TOP ACTIVATIONS
MAX = 4.001

Cuba present . Count ries that once acquies ced
. 8 7 Erit rea 129 .
open and slamming shut at high - speed )
aka The Edgar J . Kau f mann house , but
iz io Aus iliar io Fem min ile ). The commander
a transition of ownership , Civ ica , a major player
the floor while another makes pelvic thrust s at her buttocks
This has been sold to Cape in the UK , with
might say for someone like Cape lla , listen for how
Service story . Ark ansas has already honored the
Finnish parliament in Helsinki , Ves a - Matt i Sa
October was being prepared at Cape Canaveral in Florida for an
West Wing , including Sebastian Gork a , have received enormous
York Times recently , edible tet rah yd roc ann ib
appearance with Treasury Secretary Steven Mn uchin , that the debt
), which was dominated by Kik uy u and Luo ,
Park as they look for virtual creatures . ( Photo :
dinner . I open the door and got the
Glenn C OOK COUNTY CR IM INAL COUR TH
adviser to Inter pol and fut ur ist - in -

MIN ACTIVATIONS
MIN = -3.347

Al - Far ou q Amin u , and J .
Minneapolis . C onsequ ence of Sound reports that
global attacks and the French supermarket chain Car ref our took
match in charge at Old Trafford The class of
murder of Jews and all innoc ents intolerable , and it
times and hard times and unexpected turns . It would not
side face Crystal Palace at Anfield on Sunday .
in Bad Nau heim and Philipp s - n
, essential , no - brainer human rights treaties , undermined
mem b res , les Offic iers et les F ond
Bub bles , Equity and Demand ( March 29 , 2010
to grow , this was instantly recognised as a perfect partnership
pro , his first since elbow reconstruction 15 months earlier ,
> ( & self , ord : Order ing , _
businesses looking for on - prem ise deployment options happy and
announced an alliance with B ibli othe ca to directly sell
who suffer from overly - pres cript ive and overs impl
game in charge at Old Trafford against Swansea on Sunday by
) Andre R ingu ette / Getty Images
United States , 2010 . Pres ented at the 57 th

INTERVAL 2.164 - 4.001
CONTAINS 0.133%

problems such as racism , sexism , trans ph obia ,
co - leader of the GREAT 3 competition .
ky of the University of Colorado Boulder , says that his
a fundraising email this week reminding voters that the last time
longer - term actions . Duke Energy is responsible for 21

INTERVAL 0.327 - 2.164
CONTAINS 22.286%

of pun ks and the cycle continued . No one was
slight differences when the same measure is repeated ; etc .
love you , man . You nailed it !
Atlantic and North Pacific O ceans , is now
media selected him as the ACC Pre season Rookie of the

INTERVAL -1.510 - 0.327
CONTAINS 76.635%

Y anks 45 - 28 . They were led not by
alling together in a cr um pled mess of mutual uncertainty
got there in the last two seasons . Was there an
help but feel that this rapidly changing environment must be tough
Why doesn t the Deep State like Trump

INTERVAL -3.347 - -1.510
CONTAINS 0.946%

suffered a rupt ured Achilles tendon in Thursday 's game against
was too full , and Gl ü ck a poet for
restaurants and shops set among natural activities , and the M
- floor lab space is under construction , and one glass
research ethics . 2000 The Office of Science and
Token Cuba
Token ID14159
Feature activation+5.70e-01

Pos logit contributions
 Ceres+0.517
 vig+0.443
§+0.431
 Stall+0.423
 Equality+0.423
Neg logit contributions
alties-0.552
��-0.538
ingu-0.536
 oun-0.522
iqueness-0.503
Token present
Token ID1944
Feature activation+1.10e+00

Pos logit contributions
 Ceres+1.010
 vig+0.889
§+0.889
 Stall+0.862
 Equality+0.857
Neg logit contributions
alties-0.773
��-0.748
ingu-0.747
 oun-0.722
iqueness-0.688
Token.
Token ID13
Feature activation+1.90e-01

Pos logit contributions
 Ceres+1.957
 vig+1.726
§+1.723
 Stall+1.670
 Equality+1.652
Neg logit contributions
alties-1.476
��-1.436
ingu-1.419
 oun-1.381
iqueness-1.315
Token
Token ID198
Feature activation+9.57e-02

Pos logit contributions
 Ceres+0.309
 vig+0.267
§+0.265
 Stall+0.259
 Equality+0.257
Neg logit contributions
alties-0.290
ingu-0.279
��-0.278
 oun-0.270
cond-0.259
Token
Token ID198
Feature activation+2.43e-01

Pos logit contributions
 Ceres+0.148
§+0.128
 vig+0.126
 Stall+0.123
 Equality+0.123
Neg logit contributions
��-0.154
alties-0.153
 oun-0.153
ingu-0.149
iqueness-0.139
TokenCount
Token ID12332
Feature activation+4.00e+00

Pos logit contributions
 Ceres+0.393
§+0.349
 vig+0.341
 Equality+0.330
 Stall+0.330
Neg logit contributions
alties-0.368
��-0.359
 oun-0.358
ingu-0.355
iqueness-0.331
Tokenries
Token ID1678
Feature activation+7.76e-01

Pos logit contributions
 Ceres+4.297
 vig+3.395
§+3.393
 Stall+3.223
 Equality+3.164
Neg logit contributions
��-7.971
 oun-7.744
alties-7.729
ingu-7.636
iqueness-7.546
Token that
Token ID326
Feature activation+7.22e-02

Pos logit contributions
 Ceres+1.115
 vig+0.946
§+0.941
 Stall+0.910
 Equality+0.900
Neg logit contributions
alties-1.326
��-1.301
ingu-1.280
 oun-1.257
aughs-1.204
Token once
Token ID1752
Feature activation+9.31e-01

Pos logit contributions
 Ceres+0.104
 vig+0.088
§+0.087
 Stall+0.084
 Equality+0.083
Neg logit contributions
alties-0.124
��-0.120
ingu-0.120
 oun-0.117
iqueness-0.112
Token acquies
Token ID47752
Feature activation-1.15e+00

Pos logit contributions
 Ceres+1.355
 vig+1.165
§+1.135
 Stall+1.103
 Equality+1.086
Neg logit contributions
alties-1.568
��-1.538
ingu-1.522
 oun-1.473
iqueness-1.426
Tokenced
Token ID771
Feature activation-4.03e-01

Pos logit contributions
alties+1.724
ingu+1.658
��+1.638
 oun+1.594
cond+1.541
Neg logit contributions
 Ceres-1.905
 vig-1.644
§-1.634
 Stall-1.599
 Equality-1.588
Token.
Token ID13
Feature activation-1.50e-02

Pos logit contributions
 Ceres+0.782
 vig+0.672
§+0.671
 Stall+0.653
 Equality+0.648
Neg logit contributions
alties-0.763
ingu-0.737
��-0.736
 oun-0.712
iqueness-0.687
Token8
Token ID23
Feature activation+3.68e-01

Pos logit contributions
alties+0.025
ingu+0.024
��+0.024
 oun+0.023
cond+0.022
Neg logit contributions
 Ceres-0.022
 vig-0.019
§-0.019
 Stall-0.018
 Equality-0.018
Token
Token ID198
Feature activation-1.01e-01

Pos logit contributions
 Ceres+0.579
 vig+0.496
§+0.495
 Stall+0.482
 Equality+0.478
Neg logit contributions
alties-0.580
ingu-0.560
��-0.559
 oun-0.542
iqueness-0.521
Token
Token ID198
Feature activation-6.01e-02

Pos logit contributions
alties+0.158
��+0.155
ingu+0.153
 oun+0.153
iqueness+0.142
Neg logit contributions
 Ceres-0.161
§-0.139
 vig-0.138
 Stall-0.134
 Equality-0.134
Token7
Token ID22
Feature activation+4.53e-01

Pos logit contributions
alties+0.099
ingu+0.096
��+0.095
 oun+0.092
cond+0.089
Neg logit contributions
 Ceres-0.090
 vig-0.077
§-0.076
 Stall-0.074
 Equality-0.074
Token Erit
Token ID43349
Feature activation+3.64e+00

Pos logit contributions
 Ceres+0.764
 vig+0.663
§+0.662
 Stall+0.645
 Equality+0.640
Neg logit contributions
alties-0.660
ingu-0.639
��-0.637
 oun-0.616
iqueness-0.591
Tokenrea
Token ID21468
Feature activation+1.07e+00

Pos logit contributions
 Ceres+3.565
§+2.825
 vig+2.787
 Stall+2.742
 Equality+2.604
Neg logit contributions
��-7.572
alties-7.558
ingu-7.402
 oun-7.352
iqueness-7.023
Token
Token ID198
Feature activation+1.49e-01

Pos logit contributions
 Ceres+1.662
§+1.423
 vig+1.420
 Stall+1.382
 Equality+1.372
Neg logit contributions
alties-1.697
��-1.643
ingu-1.640
 oun-1.593
iqueness-1.527
Token
Token ID198
Feature activation+1.51e-01

Pos logit contributions
 Ceres+0.234
§+0.203
 vig+0.201
 Stall+0.195
 Equality+0.195
Neg logit contributions
alties-0.235
��-0.233
 oun-0.230
ingu-0.227
iqueness-0.212
Token129
Token ID18741
Feature activation+6.31e-01

Pos logit contributions
 Ceres+0.213
 vig+0.179
§+0.178
 Stall+0.173
 Equality+0.171
Neg logit contributions
alties-0.263
ingu-0.255
��-0.253
 oun-0.247
cond-0.239
Token.
Token ID13
Feature activation+2.50e-01

Pos logit contributions
 Ceres+1.191
 vig+1.051
§+1.045
 Stall+1.026
 Equality+1.018
Neg logit contributions
alties-0.793
ingu-0.758
��-0.754
 oun-0.727
cond-0.691
Token open
Token ID1280
Feature activation+2.62e+00

Pos logit contributions
alties+0.536
ingu+0.517
��+0.514
 oun+0.501
cond+0.483
Neg logit contributions
 Ceres-0.496
 vig-0.422
§-0.420
 Stall-0.409
 Equality-0.406
Token and
Token ID290
Feature activation+1.49e-01

Pos logit contributions
 Ceres+3.798
 vig+3.283
§+3.220
 Stall+3.180
 Equality+3.043
Neg logit contributions
alties-4.314
��-4.201
ingu-4.168
 oun-4.073
iqueness-3.903
Token slamming
Token ID39603
Feature activation+5.38e-01

Pos logit contributions
 Ceres+0.232
 vig+0.200
§+0.198
 Stall+0.193
 Equality+0.191
Neg logit contributions
alties-0.235
ingu-0.227
��-0.227
 oun-0.220
iqueness-0.211
Token
Token ID198
Feature activation-2.69e-01

Pos logit contributions
 Ceres+0.834
 vig+0.717
§+0.711
 Stall+0.693
 Equality+0.683
Neg logit contributions
alties-0.858
��-0.830
ingu-0.830
 oun-0.804
iqueness-0.773
Token
Token ID198
Feature activation-1.58e-02

Pos logit contributions
alties+0.413
��+0.400
ingu+0.398
 oun+0.391
iqueness+0.369
Neg logit contributions
 Ceres-0.432
 vig-0.372
§-0.372
 Stall-0.362
 Equality-0.359
Tokenshut
Token ID49625
Feature activation+3.47e+00

Pos logit contributions
alties+0.022
ingu+0.021
��+0.021
 oun+0.021
iqueness+0.020
Neg logit contributions
 Ceres-0.027
§-0.024
 vig-0.024
 Stall-0.023
 Equality-0.023
Token at
Token ID379
Feature activation-2.33e-01

Pos logit contributions
 Ceres+4.728
 vig+4.064
§+4.029
 Stall+3.823
 Equality+3.701
Neg logit contributions
alties-5.853
��-5.844
ingu-5.708
 oun-5.631
iqueness-5.372
Token high
Token ID1029
Feature activation+6.78e-03

Pos logit contributions
alties+0.371
ingu+0.359
��+0.357
 oun+0.347
cond+0.334
Neg logit contributions
 Ceres-0.359
 vig-0.307
§-0.306
 Stall-0.298
 Equality-0.296
Token-
Token ID12
Feature activation+8.76e-02

Pos logit contributions
 Ceres+0.010
 vig+0.009
§+0.009
 Stall+0.008
 Equality+0.008
Neg logit contributions
alties-0.011
ingu-0.011
��-0.011
 oun-0.010
iqueness-0.010
Tokenspeed
Token ID12287
Feature activation-6.22e-02

Pos logit contributions
 Ceres+0.166
 vig+0.147
§+0.147
 Stall+0.143
 Equality+0.142
Neg logit contributions
alties-0.109
��-0.104
ingu-0.104
 oun-0.100
iqueness-0.095
Token)
Token ID8
Feature activation+5.56e-01

Pos logit contributions
alties+0.103
ingu+0.099
��+0.099
 oun+0.096
iqueness+0.093
Neg logit contributions
 Ceres-0.093
 vig-0.079
§-0.079
 Stall-0.076
 Equality-0.076
Token aka
Token ID22430
Feature activation+7.29e-01

Pos logit contributions
alties+0.219
ingu+0.212
��+0.210
 oun+0.205
cond+0.197
Neg logit contributions
 Ceres-0.208
 vig-0.178
§-0.177
 Stall-0.172
 Equality-0.171
Token The
Token ID383
Feature activation-2.60e-01

Pos logit contributions
 Ceres+1.101
 vig+0.935
§+0.929
 Stall+0.911
 Equality+0.898
Neg logit contributions
alties-1.200
��-1.156
ingu-1.155
 oun-1.123
iqueness-1.078
Token Edgar
Token ID29166
Feature activation-3.68e-02

Pos logit contributions
alties+0.429
ingu+0.416
��+0.414
 oun+0.403
cond+0.389
Neg logit contributions
 Ceres-0.386
 vig-0.330
§-0.327
 Stall-0.316
 Equality-0.314
Token J
Token ID449
Feature activation-4.33e-01

Pos logit contributions
alties+0.062
��+0.061
ingu+0.060
 oun+0.059
cond+0.057
Neg logit contributions
 Ceres-0.054
 vig-0.045
§-0.045
 Stall-0.044
 Equality-0.043
Token.
Token ID13
Feature activation+5.67e-01

Pos logit contributions
alties+0.550
ingu+0.529
��+0.522
 oun+0.505
cond+0.479
Neg logit contributions
 Ceres-0.809
 vig-0.714
§-0.711
 Stall-0.694
 Equality-0.692
Token Kau
Token ID28148
Feature activation+3.40e+00

Pos logit contributions
 Ceres+0.839
 vig+0.708
§+0.705
 Stall+0.692
 Equality+0.682
Neg logit contributions
alties-0.949
��-0.918
ingu-0.917
 oun-0.892
iqueness-0.856
Tokenf
Token ID69
Feature activation+6.28e-01

Pos logit contributions
 Ceres+4.577
§+3.926
 vig+3.877
 Stall+3.769
 Robotics+3.670
Neg logit contributions
��-5.843
alties-5.815
ingu-5.656
 oun-5.597
ollah-5.234
Tokenmann
Token ID9038
Feature activation-9.49e-01

Pos logit contributions
 Ceres+0.930
 vig+0.791
§+0.786
 Stall+0.763
 Equality+0.762
Neg logit contributions
alties-1.045
ingu-1.011
��-1.001
 oun-0.978
cond-0.946
Token house
Token ID2156
Feature activation-8.79e-01

Pos logit contributions
alties+1.543
ingu+1.511
��+1.476
 oun+1.448
cond+1.410
Neg logit contributions
 Ceres-1.413
 vig-1.210
§-1.185
 Equality-1.154
 Stall-1.148
Token,
Token ID11
Feature activation-7.03e-02

Pos logit contributions
alties+1.328
ingu+1.289
��+1.268
 oun+1.237
cond+1.201
Neg logit contributions
 Ceres-1.421
 vig-1.225
§-1.212
 Stall-1.185
 Equality-1.183
Token but
Token ID475
Feature activation+1.42e-01

Pos logit contributions
alties+0.112
ingu+0.108
��+0.107
 oun+0.105
cond+0.101
Neg logit contributions
 Ceres-0.109
 vig-0.093
§-0.093
 Stall-0.090
 Equality-0.089
Tokeniz
Token ID528
Feature activation-6.97e-01

Pos logit contributions
 Ceres+1.314
§+1.164
 vig+1.162
 Stall+1.136
 Equality+1.126
Neg logit contributions
alties-0.836
��-0.802
ingu-0.797
 oun-0.772
iqueness-0.730
Tokenio
Token ID952
Feature activation+1.68e-01

Pos logit contributions
alties+0.900
ingu+0.862
��+0.855
 oun+0.828
cond+0.787
Neg logit contributions
 Ceres-1.291
 vig-1.136
§-1.131
 Stall-1.108
 Equality-1.100
Token Aus
Token ID27545
Feature activation+1.86e+00

Pos logit contributions
 Ceres+0.242
 vig+0.205
§+0.203
 Stall+0.198
 Equality+0.196
Neg logit contributions
alties-0.286
ingu-0.277
��-0.275
 oun-0.268
cond-0.259
Tokeniliar
Token ID4797
Feature activation-7.78e-01

Pos logit contributions
 Ceres+3.289
§+2.915
 vig+2.891
 Stall+2.805
 Equality+2.781
Neg logit contributions
��-2.458
alties-2.444
 oun-2.301
ingu-2.295
aughs-2.196
Tokenio
Token ID952
Feature activation+2.31e-02

Pos logit contributions
alties+1.170
ingu+1.135
��+1.115
 oun+1.089
cond+1.047
Neg logit contributions
 Ceres-1.272
 vig-1.097
§-1.090
 Stall-1.069
 Equality-1.061
Token Fem
Token ID31149
Feature activation+3.33e+00

Pos logit contributions
 Ceres+0.034
 vig+0.029
§+0.029
 Stall+0.028
 Equality+0.028
Neg logit contributions
alties-0.039
ingu-0.037
��-0.037
 oun-0.036
cond-0.035
Tokenmin
Token ID1084
Feature activation+5.44e-01

Pos logit contributions
 Ceres+5.564
§+4.971
 vig+4.795
 Stall+4.735
 Equality+4.698
Neg logit contributions
��-4.613
alties-4.581
 oun-4.430
ingu-4.247
iqueness-4.221
Tokenile
Token ID576
Feature activation+5.04e-01

Pos logit contributions
 Ceres+0.993
§+0.879
 vig+0.877
 Stall+0.853
 Equality+0.846
Neg logit contributions
alties-0.709
��-0.690
ingu-0.672
 oun-0.663
iqueness-0.620
Token).
Token ID737
Feature activation+1.25e-01

Pos logit contributions
 Ceres+0.724
 vig+0.612
§+0.609
 Stall+0.591
 Equality+0.585
Neg logit contributions
alties-0.862
��-0.833
ingu-0.832
 oun-0.810
iqueness-0.783
Token The
Token ID383
Feature activation-5.31e-01

Pos logit contributions
 Ceres+0.195
§+0.167
 vig+0.167
 Stall+0.162
 Equality+0.161
Neg logit contributions
alties-0.200
��-0.194
ingu-0.194
 oun-0.188
iqueness-0.181
Token commander
Token ID11561
Feature activation+1.81e-01

Pos logit contributions
alties+0.879
ingu+0.850
��+0.846
 oun+0.823
cond+0.793
Neg logit contributions
 Ceres-0.788
 vig-0.669
§-0.666
 Stall-0.648
 Equality-0.641
Token a
Token ID257
Feature activation+8.29e-03

Pos logit contributions
 Ceres+0.149
 vig+0.125
§+0.125
 Stall+0.121
 Equality+0.120
Neg logit contributions
alties-0.188
ingu-0.182
��-0.182
 oun-0.177
iqueness-0.171
Token transition
Token ID6801
Feature activation-1.36e+00

Pos logit contributions
 Ceres+0.012
 vig+0.010
§+0.010
 Stall+0.010
 Equality+0.010
Neg logit contributions
alties-0.014
ingu-0.014
��-0.014
 oun-0.013
cond-0.013
Token of
Token ID286
Feature activation-8.72e-02

Pos logit contributions
alties+2.174
ingu+2.108
��+2.080
 oun+2.034
cond+1.965
Neg logit contributions
 Ceres-2.085
 vig-1.782
§-1.773
 Stall-1.727
 Equality-1.714
Token ownership
Token ID9238
Feature activation+5.26e-01

Pos logit contributions
alties+0.144
ingu+0.140
��+0.139
 oun+0.135
cond+0.130
Neg logit contributions
 Ceres-0.130
 vig-0.110
§-0.110
 Stall-0.107
 Equality-0.106
Token,
Token ID11
Feature activation-4.45e-01

Pos logit contributions
 Ceres+0.819
 vig+0.703
§+0.698
 Stall+0.680
 Equality+0.673
Neg logit contributions
alties-0.833
��-0.814
ingu-0.810
 oun-0.787
iqueness-0.755
Token Civ
Token ID47909
Feature activation+3.31e+00

Pos logit contributions
alties+0.736
ingu+0.711
��+0.708
 oun+0.690
cond+0.664
Neg logit contributions
 Ceres-0.661
 vig-0.563
§-0.560
 Stall-0.544
 Equality-0.540
Tokenica
Token ID3970
Feature activation+9.70e-01

Pos logit contributions
 Ceres+5.364
§+4.765
 Robotics+4.605
 vig+4.600
 Equality+4.569
Neg logit contributions
alties-4.755
��-4.709
 oun-4.455
ingu-4.443
iqueness-4.263
Token,
Token ID11
Feature activation-2.17e-01

Pos logit contributions
 Ceres+1.390
 vig+1.169
§+1.168
 Stall+1.135
 Equality+1.126
Neg logit contributions
alties-1.653
��-1.610
ingu-1.598
 oun-1.557
iqueness-1.510
Token a
Token ID257
Feature activation-1.17e-01

Pos logit contributions
alties+0.368
ingu+0.356
��+0.353
 oun+0.346
cond+0.333
Neg logit contributions
 Ceres-0.312
 vig-0.265
§-0.263
 Stall-0.256
 Equality-0.253
Token major
Token ID1688
Feature activation+2.10e-01

Pos logit contributions
alties+0.194
��+0.189
ingu+0.188
 oun+0.183
iqueness+0.176
Neg logit contributions
 Ceres-0.175
 vig-0.149
§-0.147
 Stall-0.144
 Equality-0.142
Token player
Token ID2137
Feature activation+5.62e-02

Pos logit contributions
 Ceres+0.326
 vig+0.279
§+0.275
 Stall+0.270
 Equality+0.267
Neg logit contributions
alties-0.337
��-0.330
ingu-0.327
 oun-0.318
iqueness-0.308
Token the
Token ID262
Feature activation+7.38e-02

Pos logit contributions
 Ceres+0.566
 vig+0.484
§+0.478
 Stall+0.471
 Equality+0.463
Neg logit contributions
alties-0.599
��-0.578
ingu-0.578
 oun-0.560
iqueness-0.540
Token floor
Token ID4314
Feature activation-8.85e-01

Pos logit contributions
 Ceres+0.107
 vig+0.091
§+0.090
 Stall+0.088
 Equality+0.087
Neg logit contributions
alties-0.125
ingu-0.121
��-0.121
 oun-0.118
iqueness-0.114
Token while
Token ID981
Feature activation+9.09e-02

Pos logit contributions
alties+1.406
ingu+1.358
��+1.345
 oun+1.313
cond+1.276
Neg logit contributions
 Ceres-1.365
§-1.162
 vig-1.160
 Stall-1.133
 Equality-1.126
Token another
Token ID1194
Feature activation-1.34e-01

Pos logit contributions
 Ceres+0.150
 vig+0.130
§+0.129
 Stall+0.126
 Equality+0.125
Neg logit contributions
alties-0.136
ingu-0.131
��-0.131
 oun-0.127
iqueness-0.121
Token makes
Token ID1838
Feature activation+2.45e-01

Pos logit contributions
alties+0.242
ingu+0.234
��+0.234
 oun+0.228
cond+0.220
Neg logit contributions
 Ceres-0.179
 vig-0.149
§-0.148
 Stall-0.144
 Equality-0.142
Token pelvic
Token ID43510
Feature activation+3.31e+00

Pos logit contributions
 Ceres+0.377
 vig+0.323
§+0.319
 Stall+0.312
 Equality+0.308
Neg logit contributions
alties-0.396
ingu-0.383
��-0.383
 oun-0.371
iqueness-0.357
Token thrust
Token ID14613
Feature activation+1.70e+00

Pos logit contributions
 Ceres+2.816
 vig+2.262
 Stall+2.137
§+2.116
 Stretch+1.987
Neg logit contributions
��-7.399
alties-7.336
ingu-7.229
iqueness-7.039
 oun-7.038
Tokens
Token ID82
Feature activation+3.74e-01

Pos logit contributions
 Ceres+2.341
 vig+1.999
§+1.950
 Stall+1.898
 Equality+1.873
Neg logit contributions
alties-2.954
��-2.920
ingu-2.876
 oun-2.812
iqueness-2.727
Token at
Token ID379
Feature activation-1.33e-01

Pos logit contributions
 Ceres+0.602
 vig+0.521
§+0.517
 Stall+0.505
 Equality+0.499
Neg logit contributions
alties-0.574
��-0.556
ingu-0.554
 oun-0.537
iqueness-0.514
Token her
Token ID607
Feature activation-1.89e-01

Pos logit contributions
alties+0.222
ingu+0.215
��+0.214
 oun+0.208
cond+0.201
Neg logit contributions
 Ceres-0.195
 vig-0.165
§-0.165
 Stall-0.160
 Equality-0.159
Token buttocks
Token ID50193
Feature activation+5.65e-01

Pos logit contributions
alties+0.304
ingu+0.294
��+0.293
 oun+0.284
cond+0.273
Neg logit contributions
 Ceres-0.290
 vig-0.249
§-0.247
 Stall-0.241
 Equality-0.239
Token This
Token ID770
Feature activation-4.44e-01

Pos logit contributions
alties+0.624
ingu+0.604
��+0.600
 oun+0.584
cond+0.562
Neg logit contributions
 Ceres-0.597
 vig-0.511
§-0.508
 Stall-0.495
 Equality-0.490
Token has
Token ID468
Feature activation-3.01e-02

Pos logit contributions
alties+0.718
ingu+0.694
��+0.692
 oun+0.673
cond+0.646
Neg logit contributions
 Ceres-0.679
 vig-0.580
§-0.577
 Stall-0.562
 Equality-0.558
Token been
Token ID587
Feature activation-4.84e-01

Pos logit contributions
alties+0.048
ingu+0.047
��+0.046
 oun+0.045
iqueness+0.043
Neg logit contributions
 Ceres-0.047
 vig-0.040
§-0.040
 Stall-0.039
 Equality-0.038
Token sold
Token ID2702
Feature activation-7.34e-01

Pos logit contributions
alties+0.787
��+0.762
ingu+0.761
 oun+0.735
iqueness+0.709
Neg logit contributions
 Ceres-0.739
 vig-0.632
§-0.624
 Stall-0.610
 Equality-0.605
Token to
Token ID284
Feature activation-2.26e-01

Pos logit contributions
alties+1.120
ingu+1.079
��+1.068
 oun+1.043
cond+1.001
Neg logit contributions
 Ceres-1.186
 vig-1.022
§-1.016
 Stall-0.993
 Equality-0.984
Token Cape
Token ID15725
Feature activation+3.30e+00

Pos logit contributions
alties+0.364
ingu+0.352
��+0.351
 oun+0.341
iqueness+0.328
Neg logit contributions
 Ceres-0.347
 vig-0.297
§-0.294
 Stall-0.288
 Equality-0.286
Token in
Token ID287
Feature activation+7.80e-02

Pos logit contributions
 Ceres+4.116
§+3.413
 vig+3.366
 Stall+3.319
 Equality+3.260
Neg logit contributions
alties-6.232
��-6.139
ingu-6.011
 oun-5.876
iqueness-5.720
Token the
Token ID262
Feature activation+1.17e-01

Pos logit contributions
 Ceres+0.121
 vig+0.104
§+0.103
 Stall+0.100
 Equality+0.100
Neg logit contributions
alties-0.125
ingu-0.121
��-0.121
 oun-0.117
iqueness-0.112
Token UK
Token ID3482
Feature activation+5.15e-02

Pos logit contributions
 Ceres+0.186
 vig+0.160
§+0.158
 Stall+0.155
 Equality+0.155
Neg logit contributions
alties-0.184
��-0.178
ingu-0.178
 oun-0.171
iqueness-0.165
Token,
Token ID11
Feature activation-1.00e-01

Pos logit contributions
 Ceres+0.084
 vig+0.073
§+0.073
 Stall+0.071
 Equality+0.070
Neg logit contributions
alties-0.077
ingu-0.075
��-0.075
 oun-0.072
iqueness-0.069
Token with
Token ID351
Feature activation-1.01e-01

Pos logit contributions
alties+0.167
ingu+0.162
��+0.162
 oun+0.157
cond+0.151
Neg logit contributions
 Ceres-0.149
 vig-0.126
§-0.125
 Stall-0.122
 Equality-0.121
Token might
Token ID1244
Feature activation+2.20e-01

Pos logit contributions
 Ceres+0.045
 vig+0.038
§+0.038
 Stall+0.037
 Equality+0.036
Neg logit contributions
alties-0.051
��-0.050
ingu-0.050
 oun-0.048
iqueness-0.046
Token say
Token ID910
Feature activation-9.17e-01

Pos logit contributions
 Ceres+0.330
 vig+0.284
§+0.281
 Stall+0.273
 Equality+0.271
Neg logit contributions
alties-0.362
��-0.354
ingu-0.348
 oun-0.340
iqueness-0.326
Token for
Token ID329
Feature activation+1.33e-01

Pos logit contributions
alties+1.504
ingu+1.454
��+1.442
 oun+1.407
cond+1.356
Neg logit contributions
 Ceres-1.376
 vig-1.173
§-1.166
 Stall-1.135
 Equality-1.125
Token someone
Token ID2130
Feature activation-1.21e+00

Pos logit contributions
 Ceres+0.200
 vig+0.171
§+0.169
 Stall+0.165
 Equality+0.164
Neg logit contributions
alties-0.218
��-0.210
ingu-0.210
 oun-0.204
iqueness-0.196
Token like
Token ID588
Feature activation-3.23e-01

Pos logit contributions
alties+1.985
ingu+1.930
��+1.900
 oun+1.858
cond+1.797
Neg logit contributions
 Ceres-1.798
§-1.525
 vig-1.525
 Stall-1.480
 Equality-1.467
Token Cape
Token ID15725
Feature activation+3.27e+00

Pos logit contributions
alties+0.541
ingu+0.524
��+0.523
 oun+0.509
iqueness+0.489
Neg logit contributions
 Ceres-0.474
 vig-0.402
§-0.400
 Stall-0.390
 Equality-0.386
Tokenlla
Token ID8466
Feature activation+8.45e-01

Pos logit contributions
 Ceres+4.248
§+3.531
 Stall+3.485
 vig+3.466
 Equality+3.347
Neg logit contributions
alties-6.016
��-5.869
ingu-5.735
 oun-5.627
iqueness-5.455
Token,
Token ID11
Feature activation-1.60e-01

Pos logit contributions
 Ceres+1.306
 vig+1.114
§+1.114
 Stall+1.088
 Equality+1.075
Neg logit contributions
alties-1.353
��-1.306
ingu-1.295
 oun-1.266
iqueness-1.215
Token listen
Token ID6004
Feature activation+7.34e-01

Pos logit contributions
alties+0.269
ingu+0.260
��+0.259
 oun+0.252
cond+0.243
Neg logit contributions
 Ceres-0.235
 vig-0.199
§-0.198
 Stall-0.193
 Equality-0.191
Token for
Token ID329
Feature activation+3.44e-01

Pos logit contributions
 Ceres+1.102
 vig+0.940
§+0.936
 Stall+0.909
 Equality+0.902
Neg logit contributions
alties-1.198
��-1.169
ingu-1.162
 oun-1.128
iqueness-1.088
Token how
Token ID703
Feature activation+2.90e-01

Pos logit contributions
 Ceres+0.501
 vig+0.427
§+0.421
 Stall+0.411
 Equality+0.407
Neg logit contributions
alties-0.579
��-0.564
ingu-0.560
 oun-0.546
cond-0.524
TokenService
Token ID16177
Feature activation+7.44e-01

Pos logit contributions
alties+0.391
ingu+0.378
��+0.377
 oun+0.368
cond+0.354
Neg logit contributions
 Ceres-0.320
 vig-0.270
§-0.269
 Stall-0.261
 Equality-0.259
Token story
Token ID1621
Feature activation+1.82e-01

Pos logit contributions
 Ceres+1.207
§+1.042
 vig+1.040
 Stall+1.009
 Equality+1.007
Neg logit contributions
alties-1.137
��-1.092
ingu-1.089
 oun-1.057
iqueness-1.014
Token.
Token ID13
Feature activation+4.49e-02

Pos logit contributions
 Ceres+0.308
 vig+0.268
§+0.267
 Stall+0.260
 Equality+0.259
Neg logit contributions
alties-0.265
ingu-0.255
��-0.255
 oun-0.246
iqueness-0.235
Token
Token ID198
Feature activation-4.96e-02

Pos logit contributions
 Ceres+0.072
 vig+0.063
§+0.061
 Stall+0.060
 Equality+0.059
Neg logit contributions
alties-0.069
ingu-0.066
��-0.065
 oun-0.063
cond-0.062
Token
Token ID198
Feature activation+4.50e-02

Pos logit contributions
alties+0.078
��+0.077
 oun+0.076
ingu+0.076
iqueness+0.070
Neg logit contributions
 Ceres-0.078
§-0.068
 vig-0.067
 Stall-0.065
 Equality-0.065
TokenArk
Token ID42007
Feature activation+3.21e+00

Pos logit contributions
 Ceres+0.072
 vig+0.062
§+0.061
 Stall+0.060
 Equality+0.059
Neg logit contributions
alties-0.070
ingu-0.068
��-0.067
 oun-0.065
cond-0.063
Tokenansas
Token ID6618
Feature activation+3.58e-01

Pos logit contributions
 Ceres+1.903
§+1.392
 vig+1.333
 Equality+1.207
 Stall+1.204
Neg logit contributions
alties-7.874
��-7.822
 oun-7.647
ingu-7.634
iqueness-7.411
Token has
Token ID468
Feature activation+2.81e-01

Pos logit contributions
 Ceres+0.499
 vig+0.421
§+0.418
 Stall+0.406
 Equality+0.404
Neg logit contributions
alties-0.627
��-0.607
ingu-0.607
 oun-0.590
iqueness-0.569
Token already
Token ID1541
Feature activation+2.70e-01

Pos logit contributions
 Ceres+0.426
 vig+0.364
§+0.361
 Stall+0.350
 Equality+0.349
Neg logit contributions
alties-0.459
��-0.447
ingu-0.446
 oun-0.432
iqueness-0.415
Token honored
Token ID21014
Feature activation-9.81e-02

Pos logit contributions
 Ceres+0.413
 vig+0.354
§+0.351
 Stall+0.341
 Equality+0.339
Neg logit contributions
alties-0.436
ingu-0.423
��-0.423
 oun-0.408
iqueness-0.393
Token the
Token ID262
Feature activation-1.31e-03

Pos logit contributions
alties+0.156
ingu+0.150
��+0.149
 oun+0.145
cond+0.140
Neg logit contributions
 Ceres-0.152
 vig-0.130
§-0.130
 Stall-0.126
 Equality-0.125
Token Finnish
Token ID26838
Feature activation+1.61e-01

Pos logit contributions
 Ceres+0.129
 vig+0.110
§+0.109
 Stall+0.106
 Equality+0.106
Neg logit contributions
alties-0.143
��-0.138
ingu-0.137
 oun-0.134
iqueness-0.129
Token parliament
Token ID8540
Feature activation-6.38e-01

Pos logit contributions
 Ceres+0.290
 vig+0.255
§+0.255
 Equality+0.250
 Stall+0.250
Neg logit contributions
alties-0.216
��-0.208
ingu-0.205
 oun-0.200
iqueness-0.191
Token in
Token ID287
Feature activation+6.14e-02

Pos logit contributions
alties+0.958
ingu+0.925
��+0.909
 oun+0.887
cond+0.856
Neg logit contributions
 Ceres-1.049
 vig-0.909
§-0.898
 Stall-0.881
 Equality-0.871
Token Helsinki
Token ID47688
Feature activation+4.23e-01

Pos logit contributions
 Ceres+0.095
 vig+0.081
§+0.081
 Stall+0.079
 Equality+0.078
Neg logit contributions
alties-0.098
��-0.095
ingu-0.095
 oun-0.092
iqueness-0.088
Token,
Token ID11
Feature activation-8.56e-02

Pos logit contributions
 Ceres+0.686
 vig+0.592
§+0.589
 Stall+0.575
 Equality+0.571
Neg logit contributions
alties-0.644
ingu-0.621
��-0.618
 oun-0.600
cond-0.576
Token Ves
Token ID36448
Feature activation+3.16e+00

Pos logit contributions
alties+0.142
ingu+0.137
��+0.136
 oun+0.133
cond+0.128
Neg logit contributions
 Ceres-0.128
 vig-0.109
§-0.108
 Stall-0.105
 Equality-0.104
Tokena
Token ID64
Feature activation+1.20e+00

Pos logit contributions
 Ceres+5.527
§+4.948
 vig+4.897
 Stall+4.838
 Equality+4.700
Neg logit contributions
��-4.039
alties-3.957
ingu-3.906
 oun-3.832
iqueness-3.598
Token-
Token ID12
Feature activation+1.57e-01

Pos logit contributions
 Ceres+1.799
 vig+1.529
§+1.525
 Stall+1.496
 Equality+1.470
Neg logit contributions
alties-1.963
��-1.894
ingu-1.891
 oun-1.846
iqueness-1.780
TokenMatt
Token ID13448
Feature activation-7.33e-01

Pos logit contributions
 Ceres+0.242
 vig+0.207
§+0.206
 Stall+0.200
 Equality+0.199
Neg logit contributions
alties-0.252
ingu-0.243
��-0.242
 oun-0.236
cond-0.226
Tokeni
Token ID72
Feature activation+8.51e-01

Pos logit contributions
alties+0.957
ingu+0.917
��+0.907
 oun+0.877
cond+0.836
Neg logit contributions
 Ceres-1.351
 vig-1.188
§-1.182
 Stall-1.157
 Equality-1.150
Token Sa
Token ID10318
Feature activation+8.48e-01

Pos logit contributions
 Ceres+1.261
 vig+1.070
§+1.066
 Stall+1.048
 Equality+1.029
Neg logit contributions
alties-1.412
��-1.365
ingu-1.364
 oun-1.328
iqueness-1.281
Token October
Token ID3267
Feature activation-1.32e-01

Pos logit contributions
 Ceres+0.673
§+0.526
 vig+0.517
 Stall+0.491
 Equality+0.485
Neg logit contributions
alties-1.318
��-1.288
ingu-1.278
 oun-1.252
iqueness-1.221
Token was
Token ID373
Feature activation+3.12e-01

Pos logit contributions
alties+0.232
ingu+0.225
��+0.225
 oun+0.219
iqueness+0.211
Neg logit contributions
 Ceres-0.184
 vig-0.154
§-0.154
 Stall-0.149
 Equality-0.147
Token being
Token ID852
Feature activation+9.13e-01

Pos logit contributions
 Ceres+0.466
 vig+0.392
§+0.390
 Stall+0.382
 Equality+0.371
Neg logit contributions
alties-0.521
��-0.511
ingu-0.506
 oun-0.493
iqueness-0.475
Token prepared
Token ID5597
Feature activation-4.92e-02

Pos logit contributions
 Ceres+1.315
 vig+1.104
§+1.086
 Stall+1.063
 Equality+1.031
Neg logit contributions
alties-1.565
��-1.542
ingu-1.530
 oun-1.482
iqueness-1.439
Token at
Token ID379
Feature activation-5.24e-01

Pos logit contributions
alties+0.077
ingu+0.074
��+0.073
 oun+0.071
cond+0.069
Neg logit contributions
 Ceres-0.078
 vig-0.067
§-0.067
 Stall-0.065
 Equality-0.065
Token Cape
Token ID15725
Feature activation+3.07e+00

Pos logit contributions
alties+0.833
ingu+0.802
��+0.797
 oun+0.776
cond+0.749
Neg logit contributions
 Ceres-0.805
 vig-0.694
§-0.691
 Stall-0.670
 Equality-0.669
Token Canaveral
Token ID49433
Feature activation+1.56e+00

Pos logit contributions
 Ceres+2.949
§+2.204
 vig+2.137
 Stall+2.131
 Equality+2.022
Neg logit contributions
alties-6.824
��-6.730
ingu-6.703
 oun-6.575
iqueness-6.349
Token in
Token ID287
Feature activation+1.40e-01

Pos logit contributions
 Ceres+2.370
§+2.028
 vig+2.021
 Stall+1.975
 Equality+1.950
Neg logit contributions
alties-2.523
��-2.495
ingu-2.445
 oun-2.381
iqueness-2.292
Token Florida
Token ID4744
Feature activation+1.25e+00

Pos logit contributions
 Ceres+0.175
 vig+0.144
§+0.143
 Stall+0.138
 Equality+0.137
Neg logit contributions
alties-0.264
ingu-0.256
��-0.255
 oun-0.249
cond-0.242
Token for
Token ID329
Feature activation+2.15e-01

Pos logit contributions
 Ceres+1.997
§+1.738
 vig+1.724
 Stall+1.680
 Equality+1.661
Neg logit contributions
alties-1.920
��-1.874
ingu-1.853
 oun-1.799
iqueness-1.731
Token an
Token ID281
Feature activation+5.03e-02

Pos logit contributions
 Ceres+0.320
 vig+0.271
§+0.269
 Stall+0.264
 Equality+0.259
Neg logit contributions
alties-0.358
ingu-0.347
��-0.346
 oun-0.336
iqueness-0.323
Token West
Token ID2688
Feature activation+4.68e-01

Pos logit contributions
alties+0.553
ingu+0.535
��+0.533
 oun+0.518
cond+0.499
Neg logit contributions
 Ceres-0.505
 vig-0.430
§-0.428
 Stall-0.416
 Equality-0.413
Token Wing
Token ID13405
Feature activation-4.77e-01

Pos logit contributions
 Ceres+0.719
 vig+0.614
§+0.611
 Stall+0.597
 Equality+0.591
Neg logit contributions
alties-0.753
��-0.733
ingu-0.727
 oun-0.706
iqueness-0.680
Token,
Token ID11
Feature activation-3.69e-01

Pos logit contributions
alties+0.717
ingu+0.690
��+0.686
 oun+0.667
cond+0.640
Neg logit contributions
 Ceres-0.782
 vig-0.676
§-0.672
 Stall-0.657
 Equality-0.651
Token including
Token ID1390
Feature activation-1.89e+00

Pos logit contributions
alties+0.608
ingu+0.587
��+0.584
 oun+0.569
cond+0.548
Neg logit contributions
 Ceres-0.551
 vig-0.470
§-0.467
 Stall-0.454
 Equality-0.450
Token Sebastian
Token ID26190
Feature activation+7.61e-01

Pos logit contributions
alties+3.032
ingu+2.944
��+2.880
 oun+2.849
cond+2.759
Neg logit contributions
 Ceres-2.810
§-2.420
 vig-2.389
 Stall-2.320
 Equality-2.304
Token Gork
Token ID49882
Feature activation+3.07e+00

Pos logit contributions
 Ceres+1.088
 vig+0.914
§+0.903
 Stall+0.903
 Equality+0.882
Neg logit contributions
alties-1.327
��-1.291
ingu-1.280
 oun-1.244
iqueness-1.204
Tokena
Token ID64
Feature activation+8.75e-01

Pos logit contributions
 Ceres+4.751
 vig+4.463
§+4.355
 Stall+4.248
rity+4.021
Neg logit contributions
��-4.533
 oun-4.470
alties-4.455
ingu-4.286
iqueness-3.993
Token,
Token ID11
Feature activation-2.18e-01

Pos logit contributions
 Ceres+1.383
 vig+1.192
§+1.184
 Stall+1.155
 Equality+1.143
Neg logit contributions
alties-1.368
��-1.318
ingu-1.316
 oun-1.278
iqueness-1.226
Token have
Token ID423
Feature activation-4.17e-01

Pos logit contributions
alties+0.361
ingu+0.350
��+0.347
 oun+0.339
cond+0.327
Neg logit contributions
 Ceres-0.322
 vig-0.274
§-0.274
 Stall-0.265
 Equality-0.263
Token received
Token ID2722
Feature activation-1.38e+00

Pos logit contributions
alties+0.670
ingu+0.648
��+0.645
 oun+0.627
cond+0.602
Neg logit contributions
 Ceres-0.640
 vig-0.548
§-0.544
 Stall-0.531
 Equality-0.526
Token enormous
Token ID9812
Feature activation+6.47e-01

Pos logit contributions
alties+2.160
ingu+2.077
��+2.062
 oun+2.010
cond+1.943
Neg logit contributions
 Ceres-2.154
§-1.854
 vig-1.843
 Stall-1.798
 Equality-1.781
Token York
Token ID1971
Feature activation-2.66e-01

Pos logit contributions
alties+0.877
ingu+0.850
��+0.841
 oun+0.816
iqueness+0.789
Neg logit contributions
 Ceres-0.914
 vig-0.786
§-0.785
 Stall-0.762
 Equality-0.753
Token Times
Token ID3782
Feature activation+2.59e-01

Pos logit contributions
alties+0.452
��+0.435
ingu+0.435
 oun+0.422
cond+0.408
Neg logit contributions
 Ceres-0.387
 vig-0.329
§-0.326
 Stall-0.318
 Equality-0.315
Token recently
Token ID2904
Feature activation+8.48e-01

Pos logit contributions
 Ceres+0.403
 vig+0.346
§+0.346
 Stall+0.336
 Equality+0.333
Neg logit contributions
alties-0.413
��-0.398
ingu-0.395
 oun-0.384
iqueness-0.368
Token,
Token ID11
Feature activation-1.23e-01

Pos logit contributions
 Ceres+1.358
 vig+1.171
§+1.168
 Stall+1.135
 Equality+1.126
Neg logit contributions
alties-1.309
��-1.260
ingu-1.259
 oun-1.219
iqueness-1.167
Token edible
Token ID35988
Feature activation-1.87e-01

Pos logit contributions
alties+0.199
��+0.194
ingu+0.193
 oun+0.186
iqueness+0.179
Neg logit contributions
 Ceres-0.188
 vig-0.161
§-0.159
 Stall-0.156
 Equality-0.154
Token tet
Token ID28408
Feature activation+3.06e+00

Pos logit contributions
alties+0.295
ingu+0.285
��+0.285
 oun+0.276
iqueness+0.265
Neg logit contributions
 Ceres-0.292
 vig-0.252
§-0.250
 Stall-0.243
 Equality-0.241
Tokenrah
Token ID11392
Feature activation+1.70e-01

Pos logit contributions
 Ceres+5.231
 vig+4.560
§+4.544
 Equality+4.374
 Stall+4.373
Neg logit contributions
��-4.202
alties-4.148
ingu-3.991
 oun-3.968
iqueness-3.763
Tokenyd
Token ID5173
Feature activation-4.73e-01

Pos logit contributions
 Ceres+0.391
 vig+0.354
§+0.353
 Stall+0.346
 Equality+0.345
Neg logit contributions
alties-0.145
��-0.134
ingu-0.133
 oun-0.125
iqueness-0.117
Tokenroc
Token ID12204
Feature activation-4.88e-01

Pos logit contributions
alties+0.500
ingu+0.475
��+0.469
 oun+0.450
cond+0.424
Neg logit contributions
 Ceres-0.988
 vig-0.883
§-0.878
 Stall-0.864
 Equality-0.858
Tokenann
Token ID1236
Feature activation+3.31e-01

Pos logit contributions
alties+0.741
��+0.714
ingu+0.713
 oun+0.692
iqueness+0.663
Neg logit contributions
 Ceres-0.794
 vig-0.684
§-0.682
 Stall-0.666
 Equality-0.659
Tokenib
Token ID571
Feature activation+1.48e-01

Pos logit contributions
 Ceres+0.565
 vig+0.492
§+0.489
 Stall+0.479
 Equality+0.475
Neg logit contributions
alties-0.475
ingu-0.457
��-0.454
 oun-0.441
cond-0.422
Token appearance
Token ID5585
Feature activation+7.90e-01

Pos logit contributions
 Ceres+0.088
 vig+0.072
§+0.071
 Equality+0.069
 Stall+0.068
Neg logit contributions
alties-0.134
��-0.131
ingu-0.130
 oun-0.127
cond-0.123
Token with
Token ID351
Feature activation-2.57e-01

Pos logit contributions
 Ceres+1.161
 vig+0.986
§+0.983
 Stall+0.953
 Equality+0.942
Neg logit contributions
alties-1.326
ingu-1.281
��-1.281
 oun-1.245
iqueness-1.194
Token Treasury
Token ID13419
Feature activation-1.31e-01

Pos logit contributions
alties+0.415
ingu+0.401
��+0.399
 oun+0.388
cond+0.373
Neg logit contributions
 Ceres-0.394
 vig-0.337
§-0.335
 Stall-0.326
 Equality-0.324
Token Secretary
Token ID4986
Feature activation+2.05e+00

Pos logit contributions
alties+0.290
��+0.284
ingu+0.283
 oun+0.276
iqueness+0.269
Neg logit contributions
 Ceres-0.124
 vig-0.094
§-0.093
 Stall-0.089
 Equality-0.088
Token Steven
Token ID8239
Feature activation+4.51e-01

Pos logit contributions
 Ceres+2.758
§+2.288
 vig+2.244
 Stall+2.195
 Equality+2.174
Neg logit contributions
��-3.737
ingu-3.660
alties-3.653
 oun-3.572
iqueness-3.413
Token Mn
Token ID36030
Feature activation+3.03e+00

Pos logit contributions
 Ceres+0.257
§+0.149
 Stall+0.146
 vig+0.139
 Equality+0.127
Neg logit contributions
alties-1.170
��-1.165
ingu-1.155
 oun-1.145
cond-1.108
Tokenuchin
Token ID43416
Feature activation+2.03e-01

Pos logit contributions
 Ceres+2.686
§+2.325
 vig+2.075
 Stretch+1.858
 Stall+1.853
Neg logit contributions
��-6.622
alties-6.566
 oun-6.541
ingu-6.230
cond-6.138
Token,
Token ID11
Feature activation-3.59e-01

Pos logit contributions
 Ceres+0.324
 vig+0.279
§+0.276
 Stall+0.270
 Equality+0.269
Neg logit contributions
alties-0.313
ingu-0.303
��-0.300
 oun-0.292
cond-0.281
Token that
Token ID326
Feature activation-2.90e-01

Pos logit contributions
alties+0.589
ingu+0.570
��+0.564
 oun+0.551
cond+0.534
Neg logit contributions
 Ceres-0.535
 vig-0.455
§-0.447
 Stall-0.441
 Equality-0.438
Token the
Token ID262
Feature activation-2.06e-01

Pos logit contributions
alties+0.473
ingu+0.457
��+0.455
 oun+0.443
cond+0.426
Neg logit contributions
 Ceres-0.441
 vig-0.376
§-0.374
 Stall-0.364
 Equality-0.361
Token debt
Token ID5057
Feature activation+4.75e-01

Pos logit contributions
alties+0.309
ingu+0.298
��+0.297
 oun+0.288
cond+0.276
Neg logit contributions
 Ceres-0.339
 vig-0.293
§-0.292
 Stall-0.285
 Equality-0.283
Token),
Token ID828
Feature activation+6.79e-03

Pos logit contributions
 Ceres+0.867
 vig+0.735
§+0.731
 Stall+0.710
 Equality+0.704
Neg logit contributions
alties-1.008
ingu-0.975
��-0.971
 oun-0.946
iqueness-0.911
Token which
Token ID543
Feature activation-1.16e-01

Pos logit contributions
 Ceres+0.010
 vig+0.008
§+0.008
 Stall+0.008
 Equality+0.008
Neg logit contributions
alties-0.012
ingu-0.011
��-0.011
 oun-0.011
cond-0.011
Token was
Token ID373
Feature activation+2.08e-01

Pos logit contributions
alties+0.204
��+0.198
ingu+0.198
 oun+0.193
iqueness+0.186
Neg logit contributions
 Ceres-0.161
 vig-0.135
§-0.134
 Stall-0.131
 Equality-0.129
Token dominated
Token ID13354
Feature activation+2.48e-01

Pos logit contributions
 Ceres+0.269
 vig+0.223
§+0.220
 Stall+0.214
 Equality+0.211
Neg logit contributions
alties-0.386
��-0.377
ingu-0.376
 oun-0.366
iqueness-0.354
Token by
Token ID416
Feature activation-4.20e-01

Pos logit contributions
 Ceres+0.423
 vig+0.368
§+0.366
 Stall+0.358
 Equality+0.355
Neg logit contributions
alties-0.357
ingu-0.344
��-0.343
 oun-0.332
iqueness-0.318
Token Kik
Token ID40571
Feature activation+3.03e+00

Pos logit contributions
alties+0.679
ingu+0.656
��+0.654
 oun+0.636
cond+0.611
Neg logit contributions
 Ceres-0.641
 vig-0.548
§-0.544
 Stall-0.531
 Equality-0.527
Tokenuy
Token ID4669
Feature activation+7.44e-01

Pos logit contributions
 Ceres+5.279
§+4.916
 vig+4.892
 Stall+4.686
 Equality+4.662
Neg logit contributions
��-3.891
alties-3.873
 oun-3.630
iqueness-3.511
aughs-3.471
Tokenu
Token ID84
Feature activation+4.49e-03

Pos logit contributions
 Ceres+1.775
 vig+1.613
§+1.608
 Stall+1.580
 Equality+1.570
Neg logit contributions
alties-0.557
��-0.518
ingu-0.510
 oun-0.486
iqueness-0.443
Token and
Token ID290
Feature activation-1.95e-01

Pos logit contributions
 Ceres+0.007
 vig+0.006
§+0.006
 Stall+0.006
 Equality+0.006
Neg logit contributions
alties-0.007
ingu-0.007
��-0.007
 oun-0.007
cond-0.007
Token Luo
Token ID25956
Feature activation+1.28e-02

Pos logit contributions
alties+0.323
ingu+0.312
��+0.311
 oun+0.302
cond+0.291
Neg logit contributions
 Ceres-0.290
 vig-0.247
§-0.246
 Stall-0.239
 Equality-0.237
Token,
Token ID11
Feature activation-3.94e-01

Pos logit contributions
 Ceres+0.020
 vig+0.017
§+0.017
 Stall+0.017
 Equality+0.017
Neg logit contributions
alties-0.020
ingu-0.019
��-0.019
 oun-0.019
iqueness-0.018
Token Park
Token ID3250
Feature activation+4.53e-01

Pos logit contributions
 Ceres+0.891
 vig+0.766
§+0.757
 Stall+0.748
 Equality+0.737
Neg logit contributions
alties-0.895
��-0.865
ingu-0.864
 oun-0.836
iqueness-0.803
Token as
Token ID355
Feature activation+2.64e-01

Pos logit contributions
 Ceres+0.667
 vig+0.567
§+0.562
 Stall+0.549
 Equality+0.543
Neg logit contributions
alties-0.758
��-0.733
ingu-0.733
 oun-0.712
iqueness-0.686
Token they
Token ID484
Feature activation+6.23e-01

Pos logit contributions
 Ceres+0.398
 vig+0.339
§+0.337
 Stall+0.329
 Equality+0.326
Neg logit contributions
alties-0.433
ingu-0.419
��-0.418
 oun-0.406
iqueness-0.391
Token look
Token ID804
Feature activation+1.70e+00

Pos logit contributions
 Ceres+0.877
 vig+0.742
§+0.729
 Stall+0.714
 Equality+0.704
Neg logit contributions
alties-1.086
��-1.060
ingu-1.047
 oun-1.025
iqueness-0.990
Token for
Token ID329
Feature activation+3.23e-01

Pos logit contributions
 Ceres+2.449
 vig+2.082
§+2.037
 Stall+1.977
 Equality+1.969
Neg logit contributions
alties-2.871
��-2.843
ingu-2.808
 oun-2.729
iqueness-2.641
Token virtual
Token ID7166
Feature activation+3.03e+00

Pos logit contributions
 Ceres+0.471
 vig+0.399
§+0.394
 Stall+0.386
 Equality+0.382
Neg logit contributions
alties-0.547
��-0.530
ingu-0.529
 oun-0.515
iqueness-0.496
Token creatures
Token ID8109
Feature activation+8.65e-01

Pos logit contributions
 Ceres+4.332
 vig+3.678
 Stall+3.556
§+3.536
 Tyrann+3.471
Neg logit contributions
��-5.110
alties-5.057
ingu-4.981
 oun-4.853
iqueness-4.710
Token.
Token ID13
Feature activation-4.55e-01

Pos logit contributions
 Ceres+1.342
 vig+1.144
§+1.136
 Stall+1.115
 Equality+1.097
Neg logit contributions
alties-1.385
��-1.351
ingu-1.340
 oun-1.306
iqueness-1.251
Token (
Token ID357
Feature activation-4.89e-02

Pos logit contributions
alties+0.786
ingu+0.759
��+0.742
 oun+0.729
cond+0.713
Neg logit contributions
 Ceres-0.631
 vig-0.538
§-0.530
 Stall-0.506
 Equality-0.504
TokenPhoto
Token ID6191
Feature activation+4.40e-01

Pos logit contributions
alties+0.067
ingu+0.065
��+0.064
 oun+0.062
cond+0.060
Neg logit contributions
 Ceres-0.086
 vig-0.075
§-0.075
 Stall-0.073
 Equality-0.073
Token:
Token ID25
Feature activation-2.60e-02

Pos logit contributions
 Ceres+0.772
§+0.677
 vig+0.673
 Stall+0.656
 Equality+0.655
Neg logit contributions
��-0.624
alties-0.623
 oun-0.607
ingu-0.598
iqueness-0.556
Token dinner
Token ID8073
Feature activation+9.27e-02

Pos logit contributions
 Ceres+1.392
 vig+1.169
§+1.162
 Stall+1.133
 Equality+1.115
Neg logit contributions
alties-1.670
��-1.623
ingu-1.623
 oun-1.579
iqueness-1.534
Token.
Token ID13
Feature activation+5.37e-01

Pos logit contributions
 Ceres+0.134
 vig+0.113
§+0.112
 Stall+0.109
 Equality+0.108
Neg logit contributions
alties-0.158
ingu-0.153
��-0.152
 oun-0.148
iqueness-0.143
Token
Token ID198
Feature activation+3.75e-01

Pos logit contributions
 Ceres+0.832
§+0.704
 vig+0.699
 Stall+0.690
 Equality+0.684
Neg logit contributions
alties-0.870
��-0.849
ingu-0.844
 oun-0.824
iqueness-0.788
Token
Token ID198
Feature activation+5.59e-01

Pos logit contributions
 Ceres+0.577
§+0.501
 vig+0.491
 Stall+0.480
 Equality+0.479
Neg logit contributions
��-0.609
 oun-0.607
alties-0.606
ingu-0.587
iqueness-0.547
TokenI
Token ID40
Feature activation+5.30e-01

Pos logit contributions
 Ceres+0.883
§+0.771
 vig+0.759
 Stall+0.740
 Equality+0.735
Neg logit contributions
alties-0.872
��-0.842
ingu-0.839
 oun-0.827
iqueness-0.780
Token open
Token ID1280
Feature activation+3.00e+00

Pos logit contributions
 Ceres+0.746
 vig+0.633
§+0.626
 Stall+0.608
 Equality+0.600
Neg logit contributions
alties-0.921
��-0.899
ingu-0.892
 oun-0.870
iqueness-0.839
Token the
Token ID262
Feature activation+2.85e-01

Pos logit contributions
 Ceres+3.956
 vig+3.320
§+3.244
 Stall+3.184
 Equality+3.105
Neg logit contributions
alties-5.411
��-5.276
ingu-5.146
 oun-5.084
iqueness-5.007
Token door
Token ID3420
Feature activation+4.84e-01

Pos logit contributions
 Ceres+0.303
 vig+0.242
§+0.238
 Stall+0.230
 Equality+0.226
Neg logit contributions
alties-0.594
ingu-0.577
��-0.577
 oun-0.563
iqueness-0.547
Token and
Token ID290
Feature activation+5.52e-01

Pos logit contributions
 Ceres+0.698
 vig+0.591
§+0.587
 Stall+0.571
 Equality+0.566
Neg logit contributions
alties-0.822
ingu-0.796
��-0.793
 oun-0.772
cond-0.744
Token got
Token ID1392
Feature activation+5.28e-01

Pos logit contributions
 Ceres+0.790
 vig+0.669
§+0.662
 Stall+0.647
 Equality+0.637
Neg logit contributions
alties-0.951
��-0.924
ingu-0.923
 oun-0.895
iqueness-0.868
Token the
Token ID262
Feature activation+1.69e-01

Pos logit contributions
 Ceres+0.791
 vig+0.674
§+0.668
 Stall+0.652
 Equality+0.644
Neg logit contributions
alties-0.873
��-0.847
ingu-0.845
 oun-0.818
iqueness-0.790
Token Glenn
Token ID17551
Feature activation+1.44e-01

Pos logit contributions
 Ceres+0.161
 vig+0.137
§+0.136
 Stall+0.133
 Equality+0.132
Neg logit contributions
alties-0.177
ingu-0.171
��-0.170
 oun-0.166
cond-0.159
Token
Token ID198
Feature activation-8.44e-02

Pos logit contributions
 Ceres+0.217
 vig+0.185
§+0.184
 Stall+0.179
 Equality+0.177
Neg logit contributions
alties-0.235
ingu-0.228
��-0.227
 oun-0.221
iqueness-0.213
Token
Token ID198
Feature activation+2.89e-02

Pos logit contributions
alties+0.134
��+0.132
 oun+0.131
ingu+0.129
iqueness+0.120
Neg logit contributions
 Ceres-0.132
§-0.115
 vig-0.113
 Stall-0.111
 Equality-0.110
TokenC
Token ID34
Feature activation+7.44e-01

Pos logit contributions
 Ceres+0.047
 vig+0.040
§+0.040
 Stall+0.039
 Equality+0.039
Neg logit contributions
alties-0.044
ingu-0.042
��-0.042
 oun-0.041
cond-0.039
TokenOOK
Token ID15308
Feature activation+9.99e-01

Pos logit contributions
 Ceres+1.258
 vig+1.096
§+1.089
 Stall+1.066
 Equality+1.058
Neg logit contributions
alties-1.076
��-1.042
ingu-1.033
 oun-0.997
iqueness-0.959
Token COUNTY
Token ID41012
Feature activation+3.00e+00

Pos logit contributions
 Ceres+1.590
§+1.362
 vig+1.359
 Stall+1.329
 Equality+1.315
Neg logit contributions
alties-1.566
��-1.504
ingu-1.500
 oun-1.461
iqueness-1.407
Token CR
Token ID8740
Feature activation+1.33e+00

Pos logit contributions
 Ceres+4.366
§+3.674
 vig+3.637
 Stall+3.627
 Robotics+3.537
Neg logit contributions
alties-5.023
��-4.922
ingu-4.867
iqueness-4.651
 oun-4.633
TokenIM
Token ID3955
Feature activation+1.29e+00

Pos logit contributions
 Ceres+2.134
§+1.856
 vig+1.829
 Stall+1.805
 Robotics+1.775
Neg logit contributions
alties-2.049
��-1.991
ingu-1.922
 oun-1.915
iqueness-1.861
TokenINAL
Token ID17961
Feature activation+1.20e+00

Pos logit contributions
 Ceres+1.841
§+1.563
 vig+1.519
 Stall+1.517
 Equality+1.477
Neg logit contributions
alties-2.236
��-2.188
ingu-2.166
 oun-2.133
iqueness-2.067
Token COUR
Token ID37384
Feature activation+2.29e+00

Pos logit contributions
 Ceres+1.614
§+1.361
 vig+1.341
 Stall+1.317
 Equality+1.287
Neg logit contributions
alties-2.135
��-2.095
ingu-2.084
 oun-2.034
iqueness-1.977
TokenTH
Token ID4221
Feature activation+1.75e+00

Pos logit contributions
 Ceres+3.320
§+2.879
 vig+2.795
 Stall+2.759
 Equality+2.725
Neg logit contributions
alties-3.820
��-3.728
ingu-3.725
 oun-3.673
iqueness-3.521
Token adviser
Token ID12534
Feature activation+1.13e-01

Pos logit contributions
alties+1.697
ingu+1.644
��+1.635
 oun+1.593
cond+1.534
Neg logit contributions
 Ceres-1.535
 vig-1.308
§-1.302
 Stall-1.267
 Equality-1.254
Token to
Token ID284
Feature activation-2.36e-01

Pos logit contributions
 Ceres+0.175
 vig+0.150
§+0.149
 Stall+0.146
 Equality+0.144
Neg logit contributions
alties-0.179
ingu-0.173
��-0.172
 oun-0.168
cond-0.161
Token Inter
Token ID4225
Feature activation-2.07e-01

Pos logit contributions
alties+0.392
ingu+0.378
��+0.377
 oun+0.367
cond+0.353
Neg logit contributions
 Ceres-0.351
 vig-0.299
§-0.296
 Stall-0.289
 Equality-0.287
Tokenpol
Token ID16104
Feature activation-8.20e-01

Pos logit contributions
alties+0.337
ingu+0.325
��+0.323
 oun+0.314
cond+0.303
Neg logit contributions
 Ceres-0.315
 vig-0.269
§-0.268
 Stall-0.260
 Equality-0.258
Token and
Token ID290
Feature activation-2.71e-01

Pos logit contributions
alties+1.279
ingu+1.237
��+1.213
 oun+1.192
cond+1.143
Neg logit contributions
 Ceres-1.291
 vig-1.116
§-1.105
 Stall-1.084
 Equality-1.071
Token fut
Token ID13294
Feature activation+2.99e+00

Pos logit contributions
alties+0.450
ingu+0.435
��+0.433
 oun+0.421
cond+0.406
Neg logit contributions
 Ceres-0.401
 vig-0.341
§-0.339
 Stall-0.330
 Equality-0.327
Tokenur
Token ID333
Feature activation+6.02e-01

Pos logit contributions
 Ceres+3.948
§+3.493
 vig+3.418
 Stall+3.249
 Equality+3.245
Neg logit contributions
alties-5.242
��-5.164
ingu-4.996
 oun-4.932
iqueness-4.698
Tokenist
Token ID396
Feature activation+3.63e-01

Pos logit contributions
 Ceres+1.194
 vig+1.061
§+1.056
 Stall+1.035
 Equality+1.028
Neg logit contributions
alties-0.700
��-0.668
ingu-0.666
 oun-0.638
cond-0.602
Token-
Token ID12
Feature activation-2.53e-01

Pos logit contributions
 Ceres+0.571
 vig+0.490
§+0.488
 Stall+0.476
 Equality+0.472
Neg logit contributions
alties-0.571
ingu-0.551
��-0.548
 oun-0.533
cond-0.512
Tokenin
Token ID259
Feature activation-1.65e-01

Pos logit contributions
alties+0.334
ingu+0.320
��+0.319
 oun+0.308
cond+0.293
Neg logit contributions
 Ceres-0.460
 vig-0.404
§-0.402
 Stall-0.394
 Equality-0.391
Token-
Token ID12
Feature activation-3.42e-01

Pos logit contributions
alties+0.241
ingu+0.233
��+0.231
 oun+0.224
cond+0.216
Neg logit contributions
 Ceres-0.277
 vig-0.240
§-0.238
 Stall-0.233
 Equality-0.232
Token Al
Token ID978
Feature activation-9.94e-01

Pos logit contributions
 Ceres+0.191
 vig+0.162
§+0.161
 Stall+0.157
 Equality+0.156
Neg logit contributions
alties-0.216
ingu-0.209
��-0.209
 oun-0.203
iqueness-0.195
Token-
Token ID12
Feature activation-4.43e-02

Pos logit contributions
alties+1.448
ingu+1.402
��+1.372
 oun+1.332
cond+1.304
Neg logit contributions
 Ceres-1.676
 vig-1.452
§-1.435
 Stall-1.399
 Equality-1.394
TokenFar
Token ID21428
Feature activation+3.99e-01

Pos logit contributions
alties+0.088
ingu+0.086
��+0.085
 oun+0.082
cond+0.081
Neg logit contributions
 Ceres-0.052
 vig-0.041
§-0.041
 Equality-0.039
 Stall-0.039
Tokenou
Token ID280
Feature activation-4.23e-01

Pos logit contributions
 Ceres+0.752
§+0.673
 vig+0.659
 Stall+0.654
 Equality+0.645
Neg logit contributions
alties-0.496
��-0.488
ingu-0.459
 oun-0.456
cond-0.424
Tokenq
Token ID80
Feature activation-7.22e-01

Pos logit contributions
alties+0.544
��+0.535
ingu+0.527
 oun+0.522
iqueness+0.486
Neg logit contributions
 Ceres-0.782
§-0.691
 vig-0.682
 Stall-0.676
 Equality-0.664
Token Amin
Token ID39869
Feature activation-3.35e+00

Pos logit contributions
alties+1.074
��+1.045
ingu+1.033
 oun+1.022
iqueness+0.965
Neg logit contributions
 Ceres-1.195
 vig-1.034
§-1.031
 Stall-1.006
 Equality-1.001
Tokenu
Token ID84
Feature activation-6.56e-01

Pos logit contributions
alties+1.681
ingu+1.438
��+1.299
 oun+1.266
ien+1.172
Neg logit contributions
 Ceres-8.733
 Stall-7.828
 vig-7.818
§-7.708
 Robotics-7.701
Token,
Token ID11
Feature activation+1.68e-01

Pos logit contributions
alties+0.941
ingu+0.909
��+0.906
 oun+0.876
iqueness+0.838
Neg logit contributions
 Ceres-1.112
 vig-0.971
§-0.960
 Stall-0.942
 Equality-0.940
Token and
Token ID290
Feature activation+1.65e-01

Pos logit contributions
 Ceres+0.245
 vig+0.207
§+0.207
 Stall+0.200
 Equality+0.199
Neg logit contributions
alties-0.284
ingu-0.274
��-0.273
 oun-0.266
cond-0.257
Token J
Token ID449
Feature activation-7.30e-01

Pos logit contributions
 Ceres+0.238
 vig+0.202
§+0.200
 Stall+0.195
 Equality+0.193
Neg logit contributions
alties-0.280
ingu-0.271
��-0.270
 oun-0.263
cond-0.254
Token.
Token ID13
Feature activation+2.29e-01

Pos logit contributions
alties+0.959
ingu+0.934
��+0.894
 oun+0.871
cond+0.838
Neg logit contributions
 Ceres-1.341
 vig-1.174
§-1.163
 Equality-1.142
 Stall-1.142
Token Minneapolis
Token ID20455
Feature activation-1.11e-02

Pos logit contributions
 Ceres+0.297
 vig+0.247
§+0.245
 Stall+0.239
 Equality+0.237
Neg logit contributions
alties-0.404
��-0.391
ingu-0.391
 oun-0.381
iqueness-0.367
Token.
Token ID13
Feature activation+2.74e-02

Pos logit contributions
alties+0.015
ingu+0.015
��+0.015
 oun+0.014
cond+0.014
Neg logit contributions
 Ceres-0.020
 vig-0.017
§-0.017
 Stall-0.017
 Equality-0.017
Token
Token ID198
Feature activation+1.07e-02

Pos logit contributions
 Ceres+0.043
 vig+0.037
§+0.037
 Stall+0.036
 Equality+0.035
Neg logit contributions
alties-0.043
ingu-0.042
��-0.042
 oun-0.040
cond-0.039
Token
Token ID198
Feature activation+4.54e-02

Pos logit contributions
 Ceres+0.017
§+0.015
 vig+0.014
 Stall+0.014
 Equality+0.014
Neg logit contributions
alties-0.017
��-0.017
 oun-0.017
ingu-0.016
iqueness-0.015
TokenC
Token ID34
Feature activation+5.86e-01

Pos logit contributions
 Ceres+0.074
§+0.065
 vig+0.064
 Stall+0.063
 Equality+0.062
Neg logit contributions
alties-0.068
��-0.066
ingu-0.066
 oun-0.064
iqueness-0.061
Tokenonsequ
Token ID40819
Feature activation-3.28e+00

Pos logit contributions
 Ceres+1.004
 vig+0.874
§+0.870
 Stall+0.850
 Equality+0.844
Neg logit contributions
alties-0.838
ingu-0.806
��-0.805
 oun-0.778
cond-0.744
Tokenence
Token ID594
Feature activation-9.63e-01

Pos logit contributions
alties+3.739
ingu+3.523
 oun+3.217
ient+3.192
��+3.192
Neg logit contributions
 Ceres-6.436
 vig-5.829
§-5.687
 Stall-5.670
 Equality-5.635
Token of
Token ID286
Feature activation+1.15e-01

Pos logit contributions
alties+1.546
ingu+1.491
��+1.473
 oun+1.444
cond+1.388
Neg logit contributions
 Ceres-1.475
 vig-1.262
§-1.252
 Stall-1.220
 Equality-1.205
Token Sound
Token ID9506
Feature activation-3.11e-03

Pos logit contributions
 Ceres+0.168
 vig+0.143
§+0.141
 Stall+0.138
 Equality+0.137
Neg logit contributions
alties-0.194
��-0.187
ingu-0.187
 oun-0.182
iqueness-0.175
Token reports
Token ID3136
Feature activation+1.33e+00

Pos logit contributions
alties+0.005
ingu+0.005
��+0.005
 oun+0.005
cond+0.005
Neg logit contributions
 Ceres-0.005
 vig-0.004
§-0.004
 Stall-0.004
 Equality-0.004
Token that
Token ID326
Feature activation-9.07e-02

Pos logit contributions
 Ceres+2.032
 vig+1.729
§+1.724
 Stall+1.679
 Equality+1.670
Neg logit contributions
alties-2.151
��-2.122
ingu-2.072
 oun-2.019
iqueness-1.949
Token global
Token ID3298
Feature activation+1.75e-01

Pos logit contributions
alties+0.041
ingu+0.040
��+0.040
 oun+0.039
iqueness+0.037
Neg logit contributions
 Ceres-0.040
 vig-0.035
§-0.034
 Stall-0.034
 Equality-0.033
Token attacks
Token ID3434
Feature activation+6.47e-01

Pos logit contributions
 Ceres+0.258
 vig+0.221
§+0.219
 Stall+0.213
 Equality+0.211
Neg logit contributions
alties-0.292
��-0.283
ingu-0.283
 oun-0.275
iqueness-0.265
Token and
Token ID290
Feature activation+7.00e-02

Pos logit contributions
 Ceres+1.015
§+0.882
 vig+0.878
 Stall+0.851
 Equality+0.838
Neg logit contributions
alties-1.010
��-0.994
ingu-0.978
 oun-0.953
iqueness-0.916
Token the
Token ID262
Feature activation-2.22e-01

Pos logit contributions
 Ceres+0.106
 vig+0.091
§+0.090
 Stall+0.088
 Equality+0.086
Neg logit contributions
alties-0.114
��-0.111
ingu-0.111
 oun-0.107
iqueness-0.104
Token French
Token ID4141
Feature activation-1.59e+00

Pos logit contributions
alties+0.364
ingu+0.352
��+0.351
 oun+0.341
iqueness+0.328
Neg logit contributions
 Ceres-0.335
 vig-0.287
§-0.285
 Stall-0.277
 Equality-0.275
Token supermarket
Token ID26454
Feature activation-3.26e+00

Pos logit contributions
alties+2.568
ingu+2.480
��+2.436
 oun+2.396
cond+2.315
Neg logit contributions
 Ceres-2.432
 vig-2.055
§-2.042
 Stall-1.994
 Equality-1.978
Token chain
Token ID6333
Feature activation+4.90e-01

Pos logit contributions
alties+5.189
ingu+4.908
��+4.769
 oun+4.747
cond+4.669
Neg logit contributions
 Ceres-4.932
 vig-4.101
§-4.052
 Stall-4.015
 Robotics-3.973
Token Car
Token ID1879
Feature activation+3.20e-01

Pos logit contributions
 Ceres+0.707
§+0.601
 vig+0.601
 Stall+0.581
 Equality+0.578
Neg logit contributions
alties-0.831
��-0.811
ingu-0.808
 oun-0.784
iqueness-0.757
Tokenref
Token ID5420
Feature activation-3.29e-01

Pos logit contributions
 Ceres+0.658
 vig+0.589
§+0.588
 Stall+0.577
 Equality+0.574
Neg logit contributions
alties-0.347
��-0.332
ingu-0.329
 oun-0.315
iqueness-0.294
Tokenour
Token ID454
Feature activation-1.34e+00

Pos logit contributions
alties+0.478
ingu+0.462
��+0.459
 oun+0.444
cond+0.426
Neg logit contributions
 Ceres-0.554
 vig-0.481
§-0.479
 Stall-0.468
 Equality-0.464
Token took
Token ID1718
Feature activation-2.80e-01

Pos logit contributions
alties+2.240
ingu+2.158
��+2.128
 oun+2.092
cond+2.023
Neg logit contributions
 Ceres-1.983
 vig-1.671
§-1.649
 Stall-1.628
 Equality-1.606
Token match
Token ID2872
Feature activation-3.18e-01

Pos logit contributions
 Ceres+0.152
 vig+0.132
§+0.131
 Stall+0.127
 Equality+0.127
Neg logit contributions
alties-0.148
��-0.143
ingu-0.143
 oun-0.139
iqueness-0.133
Token in
Token ID287
Feature activation-4.56e-02

Pos logit contributions
alties+0.502
ingu+0.485
��+0.483
 oun+0.469
cond+0.451
Neg logit contributions
 Ceres-0.497
 vig-0.426
§-0.424
 Stall-0.413
 Equality-0.410
Token charge
Token ID3877
Feature activation-7.65e-03

Pos logit contributions
alties+0.083
��+0.081
ingu+0.081
 oun+0.079
iqueness+0.076
Neg logit contributions
 Ceres-0.060
 vig-0.050
§-0.049
 Stall-0.048
 Equality-0.048
Token at
Token ID379
Feature activation-4.97e-01

Pos logit contributions
alties+0.012
ingu+0.011
��+0.011
 oun+0.011
cond+0.010
Neg logit contributions
 Ceres-0.012
 vig-0.011
§-0.011
 Stall-0.010
 Equality-0.010
Token Old
Token ID5706
Feature activation-7.27e-01

Pos logit contributions
alties+0.783
ingu+0.755
��+0.753
 oun+0.732
iqueness+0.702
Neg logit contributions
 Ceres-0.783
 vig-0.670
§-0.665
 Stall-0.652
 Equality-0.646
Token Trafford
Token ID41107
Feature activation-3.18e+00

Pos logit contributions
alties+0.637
ingu+0.599
��+0.595
 oun+0.564
iqueness+0.520
Neg logit contributions
 Ceres-1.649
 vig-1.486
§-1.480
 Stall-1.459
 Equality-1.449
Token
Token ID198
Feature activation-1.79e-01

Pos logit contributions
alties+4.338
ingu+4.122
��+3.995
 oun+3.872
cond+3.778
Neg logit contributions
 Ceres-5.479
 vig-4.743
§-4.737
 Stall-4.629
 Equality-4.542
Token
Token ID198
Feature activation+5.45e-02

Pos logit contributions
alties+0.279
��+0.275
ingu+0.270
 oun+0.269
iqueness+0.250
Neg logit contributions
 Ceres-0.284
§-0.245
 vig-0.244
 Stall-0.237
 Equality-0.236
TokenThe
Token ID464
Feature activation+1.98e-01

Pos logit contributions
 Ceres+0.083
§+0.072
 vig+0.071
 Stall+0.069
 Equality+0.068
Neg logit contributions
alties-0.088
��-0.086
ingu-0.085
 oun-0.084
iqueness-0.079
Token class
Token ID1398
Feature activation+1.03e+00

Pos logit contributions
 Ceres+0.300
 vig+0.259
§+0.254
 Stall+0.250
 Equality+0.247
Neg logit contributions
alties-0.324
ingu-0.314
��-0.313
 oun-0.304
iqueness-0.292
Token of
Token ID286
Feature activation+1.36e-01

Pos logit contributions
 Ceres+1.514
 vig+1.307
§+1.291
 Stall+1.252
 Equality+1.245
Neg logit contributions
alties-1.686
��-1.640
ingu-1.636
 oun-1.599
cond-1.537
Token murder
Token ID5123
Feature activation-2.64e-01

Pos logit contributions
 Ceres+0.075
 vig+0.065
§+0.064
 Stall+0.062
 Equality+0.062
Neg logit contributions
alties-0.082
ingu-0.079
��-0.079
 oun-0.077
iqueness-0.074
Token of
Token ID286
Feature activation-3.90e-02

Pos logit contributions
alties+0.424
ingu+0.409
��+0.407
 oun+0.396
cond+0.383
Neg logit contributions
 Ceres-0.405
 vig-0.345
§-0.343
 Stall-0.335
 Equality-0.332
Token Jews
Token ID6771
Feature activation-5.25e-01

Pos logit contributions
alties+0.064
ingu+0.062
��+0.062
 oun+0.060
iqueness+0.058
Neg logit contributions
 Ceres-0.059
 vig-0.050
§-0.050
 Stall-0.049
 Equality-0.048
Token and
Token ID290
Feature activation+6.31e-02

Pos logit contributions
alties+0.854
ingu+0.827
��+0.823
 oun+0.801
cond+0.772
Neg logit contributions
 Ceres-0.795
 vig-0.678
§-0.673
 Stall-0.658
 Equality-0.651
Token all
Token ID477
Feature activation+2.33e-02

Pos logit contributions
 Ceres+0.097
 vig+0.084
§+0.083
 Stall+0.081
 Equality+0.081
Neg logit contributions
alties-0.101
ingu-0.098
��-0.098
 oun-0.094
iqueness-0.091
Token innoc
Token ID7909
Feature activation-3.14e+00

Pos logit contributions
 Ceres+0.034
 vig+0.029
§+0.029
 Stall+0.028
 Equality+0.028
Neg logit contributions
alties-0.039
ingu-0.038
��-0.038
 oun-0.037
iqueness-0.036
Tokenents
Token ID658
Feature activation-9.16e-01

Pos logit contributions
alties+1.678
ingu+1.460
��+1.260
 oun+1.178
cond+1.139
Neg logit contributions
 Ceres-8.194
 vig-7.517
§-7.439
 Stall-7.362
 Equality-7.351
Token intolerable
Token ID49469
Feature activation-1.67e+00

Pos logit contributions
alties+1.452
ingu+1.401
��+1.394
 oun+1.356
cond+1.309
Neg logit contributions
 Ceres-1.422
 vig-1.217
§-1.210
 Stall-1.184
 Equality-1.172
Token,
Token ID11
Feature activation+1.13e-02

Pos logit contributions
alties+2.379
ingu+2.285
��+2.240
 oun+2.193
cond+2.146
Neg logit contributions
 Ceres-2.829
§-2.435
 vig-2.427
 Stall-2.400
 Robotics-2.356
Token and
Token ID290
Feature activation+1.34e-01

Pos logit contributions
 Ceres+0.017
 vig+0.014
§+0.014
 Stall+0.014
 Equality+0.014
Neg logit contributions
alties-0.019
ingu-0.018
��-0.018
 oun-0.018
iqueness-0.017
Token it
Token ID340
Feature activation-5.78e-01

Pos logit contributions
 Ceres+0.197
 vig+0.169
§+0.166
 Stall+0.161
 Equality+0.161
Neg logit contributions
alties-0.227
��-0.219
ingu-0.219
 oun-0.213
iqueness-0.206
Token times
Token ID1661
Feature activation+9.96e-02

Pos logit contributions
 Ceres+2.308
 vig+1.998
§+1.926
 Equality+1.874
 Stall+1.870
Neg logit contributions
alties-2.869
ingu-2.780
��-2.778
 oun-2.700
iqueness-2.645
Token and
Token ID290
Feature activation+1.26e-01

Pos logit contributions
 Ceres+0.152
 vig+0.130
§+0.130
 Stall+0.126
 Equality+0.125
Neg logit contributions
alties-0.161
ingu-0.156
��-0.155
 oun-0.151
cond-0.145
Token hard
Token ID1327
Feature activation-1.03e+00

Pos logit contributions
 Ceres+0.178
 vig+0.150
§+0.149
 Stall+0.145
 Equality+0.143
Neg logit contributions
alties-0.218
ingu-0.211
��-0.210
 oun-0.205
iqueness-0.197
Token times
Token ID1661
Feature activation-2.40e-01

Pos logit contributions
alties+1.989
ingu+1.915
��+1.911
 oun+1.872
cond+1.815
Neg logit contributions
 Ceres-1.246
 vig-1.012
§-1.005
 Stall-0.971
 Equality-0.965
Token and
Token ID290
Feature activation+1.14e-01

Pos logit contributions
alties+0.380
ingu+0.367
��+0.365
 oun+0.355
cond+0.341
Neg logit contributions
 Ceres-0.373
 vig-0.320
§-0.318
 Stall-0.310
 Equality-0.308
Token unexpected
Token ID10059
Feature activation-3.10e+00

Pos logit contributions
 Ceres+0.166
 vig+0.142
§+0.141
 Stall+0.137
 Equality+0.135
Neg logit contributions
alties-0.194
ingu-0.188
��-0.187
 oun-0.182
iqueness-0.176
Token turns
Token ID4962
Feature activation+1.86e-01

Pos logit contributions
alties+5.307
ingu+5.030
��+4.965
 oun+4.945
cond+4.808
Neg logit contributions
 Ceres-4.305
§-3.600
 vig-3.488
 Stall-3.483
 Equality-3.459
Token.
Token ID13
Feature activation-2.31e-01

Pos logit contributions
 Ceres+0.253
 vig+0.212
§+0.211
 Stall+0.205
 Equality+0.203
Neg logit contributions
alties-0.332
ingu-0.322
��-0.321
 oun-0.313
iqueness-0.302
Token It
Token ID632
Feature activation-2.51e-01

Pos logit contributions
alties+0.397
ingu+0.384
��+0.383
 oun+0.373
cond+0.360
Neg logit contributions
 Ceres-0.327
 vig-0.277
§-0.274
 Stall-0.267
 Equality-0.264
Token would
Token ID561
Feature activation+3.66e-01

Pos logit contributions
alties+0.455
ingu+0.442
��+0.439
 oun+0.429
cond+0.415
Neg logit contributions
 Ceres-0.334
 vig-0.277
§-0.275
 Stall-0.267
 Equality-0.265
Token not
Token ID407
Feature activation+2.09e-01

Pos logit contributions
 Ceres+0.531
 vig+0.452
§+0.451
 Stall+0.437
 Equality+0.432
Neg logit contributions
alties-0.618
��-0.596
ingu-0.596
 oun-0.580
cond-0.559
Token side
Token ID1735
Feature activation-1.70e+00

Pos logit contributions
 Ceres+1.152
 vig+1.010
§+0.987
 Stall+0.975
 Equality+0.964
Neg logit contributions
alties-1.029
��-1.007
ingu-0.999
 oun-0.969
iqueness-0.922
Token face
Token ID1986
Feature activation-1.91e-01

Pos logit contributions
alties+2.770
ingu+2.676
��+2.645
 oun+2.586
cond+2.504
Neg logit contributions
 Ceres-2.538
 vig-2.152
§-2.136
 Stall-2.095
 Equality-2.069
Token Crystal
Token ID12969
Feature activation-2.39e-01

Pos logit contributions
alties+0.277
ingu+0.267
��+0.265
 oun+0.257
cond+0.246
Neg logit contributions
 Ceres-0.324
 vig-0.282
§-0.280
 Stall-0.274
 Equality-0.272
Token Palace
Token ID15301
Feature activation-1.24e+00

Pos logit contributions
alties+0.428
ingu+0.414
��+0.413
 oun+0.403
iqueness+0.389
Neg logit contributions
 Ceres-0.325
 vig-0.272
§-0.270
 Stall-0.262
 Equality-0.260
Token at
Token ID379
Feature activation-5.69e-01

Pos logit contributions
alties+1.958
ingu+1.892
��+1.867
 oun+1.818
cond+1.769
Neg logit contributions
 Ceres-1.925
 vig-1.641
§-1.631
 Stall-1.602
 Equality-1.580
Token Anfield
Token ID44303
Feature activation-3.06e+00

Pos logit contributions
alties+0.707
ingu+0.676
��+0.672
 oun+0.648
cond+0.615
Neg logit contributions
 Ceres-1.081
 vig-0.955
§-0.951
 Stall-0.931
 Equality-0.925
Token on
Token ID319
Feature activation+3.20e-01

Pos logit contributions
alties+4.353
ingu+4.208
��+4.081
 oun+3.971
cond+3.878
Neg logit contributions
 Ceres-5.015
§-4.313
 vig-4.304
 Stall-4.223
 Equality-4.161
Token Sunday
Token ID3502
Feature activation+1.71e-01

Pos logit contributions
 Ceres+0.344
 vig+0.272
§+0.269
 Stall+0.258
 Equality+0.256
Neg logit contributions
alties-0.665
ingu-0.647
��-0.647
 oun-0.634
iqueness-0.613
Token.
Token ID13
Feature activation-2.19e-01

Pos logit contributions
 Ceres+0.301
 vig+0.262
§+0.261
 Stall+0.256
 Equality+0.253
Neg logit contributions
alties-0.237
ingu-0.228
��-0.226
 oun-0.219
cond-0.209
Token
Token ID564
Feature activation+3.05e-01

Pos logit contributions
alties+0.314
ingu+0.302
��+0.299
 oun+0.289
cond+0.280
Neg logit contributions
 Ceres-0.372
 vig-0.327
§-0.321
 Stall-0.314
 Equality-0.312
Token
Token ID250
Feature activation+8.98e-02

Pos logit contributions
 Ceres+0.495
 vig+0.427
§+0.423
 Stall+0.415
 Equality+0.411
Neg logit contributions
alties-0.464
��-0.448
ingu-0.447
 oun-0.433
cond-0.414
Token in
Token ID287
Feature activation+7.52e-06

Pos logit contributions
alties+0.970
ingu+0.943
��+0.925
 oun+0.909
cond+0.883
Neg logit contributions
 Ceres-0.870
 vig-0.740
§-0.732
 Stall-0.716
 Equality-0.711
Token Bad
Token ID7772
Feature activation+4.55e-01

Feature not active on prev token;
no predictions were affected.
Token Nau
Token ID50071
Feature activation+1.26e+00

Pos logit contributions
 Ceres+0.899
 vig+0.797
§+0.794
 Stall+0.779
 Equality+0.774
Neg logit contributions
alties-0.531
ingu-0.506
��-0.504
 oun-0.484
cond-0.457
Tokenheim
Token ID9096
Feature activation-2.96e-01

Pos logit contributions
 Ceres+2.108
 vig+1.829
§+1.828
 Stall+1.779
 Equality+1.764
Neg logit contributions
alties-1.857
��-1.785
ingu-1.782
 oun-1.730
iqueness-1.651
Token and
Token ID290
Feature activation+5.08e-02

Pos logit contributions
alties+0.450
ingu+0.435
��+0.430
 oun+0.419
cond+0.404
Neg logit contributions
 Ceres-0.481
 vig-0.415
§-0.412
 Stall-0.404
 Equality-0.400
Token Philipp
Token ID9844
Feature activation-3.04e+00

Pos logit contributions
 Ceres+0.076
 vig+0.065
§+0.064
 Stall+0.063
 Equality+0.062
Neg logit contributions
alties-0.084
ingu-0.081
��-0.081
 oun-0.079
iqueness-0.076
Tokens
Token ID82
Feature activation-2.61e-01

Pos logit contributions
alties+4.929
ingu+4.809
��+4.705
 oun+4.544
ien+4.468
Neg logit contributions
 Ceres-4.406
 vig-3.829
§-3.765
 Stall-3.583
 Equality-3.582
Token-
Token ID12
Feature activation-1.13e-01

Pos logit contributions
alties+0.431
ingu+0.416
��+0.414
 oun+0.403
cond+0.388
Neg logit contributions
 Ceres-0.390
 vig-0.332
§-0.330
 Stall-0.321
 Equality-0.319
Token
Token ID127
Feature activation-1.47e+00

Pos logit contributions
alties+0.179
ingu+0.173
��+0.172
 oun+0.166
cond+0.162
Neg logit contributions
 Ceres-0.175
 vig-0.150
§-0.148
 Stall-0.145
 Equality-0.144
Token
Token ID250
Feature activation-1.26e-01

Pos logit contributions
alties+2.069
��+2.026
ingu+2.010
 oun+1.935
cond+1.834
Neg logit contributions
 Ceres-2.521
 vig-2.193
§-2.165
 Stall-2.131
 Equality-2.120
Tokenn
Token ID77
Feature activation-9.16e-02

Pos logit contributions
alties+0.200
ingu+0.194
��+0.190
 oun+0.185
cond+0.180
Neg logit contributions
 Ceres-0.197
 vig-0.170
§-0.167
 Stall-0.164
 Equality-0.163
Token,
Token ID11
Feature activation-4.96e-02

Pos logit contributions
alties+0.994
��+0.962
ingu+0.962
 oun+0.935
iqueness+0.901
Neg logit contributions
 Ceres-0.921
 vig-0.788
§-0.779
 Stall-0.760
 Equality-0.757
Token essential
Token ID6393
Feature activation-1.43e+00

Pos logit contributions
alties+0.074
ingu+0.072
��+0.072
 oun+0.069
iqueness+0.067
Neg logit contributions
 Ceres-0.082
 vig-0.071
§-0.070
 Stall-0.069
 Equality-0.069
Token,
Token ID11
Feature activation-1.13e-01

Pos logit contributions
alties+2.273
ingu+2.189
��+2.167
 oun+2.109
cond+2.039
Neg logit contributions
 Ceres-2.222
 vig-1.896
§-1.895
 Stall-1.847
 Equality-1.822
Token no
Token ID645
Feature activation+2.42e-01

Pos logit contributions
alties+0.186
ingu+0.181
��+0.180
 oun+0.175
iqueness+0.168
Neg logit contributions
 Ceres-0.170
 vig-0.146
§-0.144
 Stall-0.140
 Equality-0.140
Token-
Token ID12
Feature activation-2.36e-01

Pos logit contributions
 Ceres+0.396
 vig+0.343
§+0.341
 Stall+0.333
 Equality+0.330
Neg logit contributions
alties-0.363
ingu-0.350
��-0.349
 oun-0.338
iqueness-0.324
Tokenbrainer
Token ID49334
Feature activation-2.99e+00

Pos logit contributions
alties+0.341
ingu+0.328
��+0.323
 oun+0.314
cond+0.305
Neg logit contributions
 Ceres-0.404
 vig-0.350
§-0.348
 Stall-0.339
 Equality-0.338
Token human
Token ID1692
Feature activation-9.57e-01

Pos logit contributions
alties+4.772
ingu+4.560
��+4.459
 oun+4.363
cond+4.326
Neg logit contributions
 Ceres-4.482
§-3.828
 vig-3.778
 Stall-3.747
 Robotics-3.660
Token rights
Token ID2489
Feature activation+8.61e-01

Pos logit contributions
alties+1.412
ingu+1.357
��+1.347
 oun+1.309
cond+1.257
Neg logit contributions
 Ceres-1.598
 vig-1.382
§-1.377
 Stall-1.347
 Equality-1.332
Token treaties
Token ID30734
Feature activation+2.85e-01

Pos logit contributions
 Ceres+1.295
 vig+1.117
§+1.101
 Equality+1.081
 Stall+1.056
Neg logit contributions
alties-1.407
��-1.388
ingu-1.363
 oun-1.329
iqueness-1.287
Token,
Token ID11
Feature activation-9.95e-02

Pos logit contributions
 Ceres+0.429
 vig+0.366
§+0.365
 Stall+0.353
 Equality+0.351
Neg logit contributions
alties-0.468
��-0.454
ingu-0.452
 oun-0.439
iqueness-0.422
Token undermined
Token ID31256
Feature activation+7.68e-01

Pos logit contributions
alties+0.163
ingu+0.158
��+0.158
 oun+0.153
iqueness+0.147
Neg logit contributions
 Ceres-0.150
 vig-0.128
§-0.127
 Stall-0.124
 Equality-0.123
Token mem
Token ID1066
Feature activation+6.55e-01

Pos logit contributions
 Ceres+0.337
 vig+0.289
§+0.286
 Stall+0.279
 Equality+0.275
Neg logit contributions
alties-0.372
��-0.360
ingu-0.359
 oun-0.349
iqueness-0.337
Tokenb
Token ID65
Feature activation-8.76e-04

Pos logit contributions
 Ceres+0.992
 vig+0.847
§+0.842
 Stall+0.820
 Equality+0.813
Neg logit contributions
alties-1.065
ingu-1.029
��-1.027
 oun-0.998
iqueness-0.959
Tokenres
Token ID411
Feature activation-9.71e-01

Pos logit contributions
alties+0.001
ingu+0.001
��+0.001
 oun+0.001
cond+0.001
Neg logit contributions
 Ceres-0.002
 vig-0.001
§-0.001
 Stall-0.001
 Equality-0.001
Token,
Token ID11
Feature activation+2.22e-01

Pos logit contributions
alties+1.680
ingu+1.629
��+1.615
 oun+1.581
cond+1.527
Neg logit contributions
 Ceres-1.372
 vig-1.150
§-1.145
 Stall-1.114
 Equality-1.105
Token les
Token ID10287
Feature activation-5.57e-02

Pos logit contributions
 Ceres+0.311
 vig+0.263
§+0.261
 Stall+0.253
 Equality+0.250
Neg logit contributions
alties-0.389
��-0.377
ingu-0.377
 oun-0.364
iqueness-0.354
Token Offic
Token ID6328
Feature activation-2.98e+00

Pos logit contributions
alties+0.095
ingu+0.092
��+0.092
 oun+0.089
iqueness+0.086
Neg logit contributions
 Ceres-0.080
 vig-0.068
§-0.067
 Stall-0.066
 Equality-0.065
Tokeniers
Token ID3183
Feature activation-9.81e-01

Pos logit contributions
alties+3.605
ingu+3.477
��+3.158
 oun+3.154
ien+3.121
Neg logit contributions
 Ceres-5.731
 vig-5.087
§-4.951
 Stall-4.950
 Equality-4.894
Token et
Token ID2123
Feature activation-7.47e-01

Pos logit contributions
alties+1.828
ingu+1.775
��+1.758
 oun+1.728
cond+1.672
Neg logit contributions
 Ceres-1.252
 vig-1.028
§-1.023
 Stall-0.990
 Equality-0.980
Token les
Token ID10287
Feature activation+1.37e-02

Pos logit contributions
alties+1.361
ingu+1.320
��+1.310
 oun+1.283
cond+1.241
Neg logit contributions
 Ceres-0.986
 vig-0.818
§-0.814
 Stall-0.788
 Equality-0.780
Token F
Token ID376
Feature activation-8.72e-01

Pos logit contributions
 Ceres+0.020
 vig+0.017
§+0.017
 Stall+0.016
 Equality+0.016
Neg logit contributions
alties-0.023
ingu-0.022
��-0.022
 oun-0.022
iqueness-0.021
Tokenond
Token ID623
Feature activation-1.17e+00

Pos logit contributions
alties+0.861
ingu+0.816
��+0.794
 oun+0.765
cond+0.721
Neg logit contributions
 Ceres-1.887
 vig-1.689
§-1.678
 Stall-1.658
 Equality-1.645
Token Bub
Token ID19645
Feature activation+1.62e-01

Pos logit contributions
 Ceres+1.409
 vig+1.193
§+1.187
 Stall+1.159
 Equality+1.153
Neg logit contributions
alties-1.624
��-1.580
ingu-1.570
 oun-1.531
cond-1.467
Tokenbles
Token ID7689
Feature activation-3.56e-01

Pos logit contributions
 Ceres+0.269
 vig+0.232
§+0.231
 Stall+0.226
 Equality+0.224
Neg logit contributions
alties-0.241
ingu-0.233
��-0.230
 oun-0.223
cond-0.215
Token,
Token ID11
Feature activation-5.13e-02

Pos logit contributions
alties+0.626
ingu+0.609
��+0.602
 oun+0.588
cond+0.570
Neg logit contributions
 Ceres-0.490
 vig-0.413
§-0.409
 Stall-0.396
 Equality-0.392
Token Equity
Token ID35659
Feature activation+8.63e-01

Pos logit contributions
alties+0.085
ingu+0.082
��+0.082
 oun+0.079
cond+0.076
Neg logit contributions
 Ceres-0.077
 vig-0.066
§-0.065
 Stall-0.063
 Equality-0.063
Token and
Token ID290
Feature activation-1.90e-01

Pos logit contributions
 Ceres+1.281
 vig+1.088
§+1.081
 Stall+1.062
 Equality+1.053
Neg logit contributions
alties-1.428
��-1.391
ingu-1.382
 oun-1.346
iqueness-1.291
Token Demand
Token ID34479
Feature activation-2.97e+00

Pos logit contributions
alties+0.311
ingu+0.301
��+0.299
 oun+0.291
cond+0.281
Neg logit contributions
 Ceres-0.285
 vig-0.243
§-0.242
 Stall-0.235
 Equality-0.232
Token (
Token ID357
Feature activation+5.03e-02

Pos logit contributions
alties+4.882
ingu+4.788
 oun+4.522
��+4.494
cond+4.494
Neg logit contributions
 Ceres-4.206
 vig-3.510
§-3.477
 Stall-3.372
rity-3.309
TokenMarch
Token ID16192
Feature activation+1.13e+00

Pos logit contributions
 Ceres+0.066
§+0.056
 vig+0.055
 Stall+0.054
 Equality+0.053
Neg logit contributions
alties-0.091
��-0.089
ingu-0.089
 oun-0.088
iqueness-0.083
Token 29
Token ID2808
Feature activation+2.35e-01

Pos logit contributions
 Ceres+1.418
§+1.182
 vig+1.133
 Stall+1.126
 Equality+1.108
Neg logit contributions
��-2.213
alties-2.171
 oun-2.165
ingu-2.120
iqueness-2.027
Token,
Token ID11
Feature activation+1.40e-02

Pos logit contributions
 Ceres+0.374
§+0.322
 vig+0.320
 Stall+0.312
 Equality+0.311
Neg logit contributions
alties-0.368
��-0.360
ingu-0.356
 oun-0.355
iqueness-0.331
Token 2010
Token ID3050
Feature activation-1.25e+00

Pos logit contributions
 Ceres+0.019
§+0.016
 vig+0.016
 Stall+0.016
 Equality+0.015
Neg logit contributions
alties-0.025
��-0.024
ingu-0.024
 oun-0.023
iqueness-0.023
Token to
Token ID284
Feature activation+1.03e-01

Pos logit contributions
 Ceres+2.526
 vig+2.161
§+2.101
 Stall+2.081
 Equality+2.064
Neg logit contributions
alties-2.983
��-2.901
ingu-2.881
 oun-2.813
iqueness-2.712
Token grow
Token ID1663
Feature activation+3.45e-01

Pos logit contributions
 Ceres+0.145
 vig+0.123
§+0.121
 Stall+0.119
 Equality+0.118
Neg logit contributions
alties-0.178
��-0.173
ingu-0.172
 oun-0.167
iqueness-0.162
Token,
Token ID11
Feature activation-6.42e-02

Pos logit contributions
 Ceres+0.519
 vig+0.442
§+0.438
 Stall+0.429
 Equality+0.425
Neg logit contributions
alties-0.567
��-0.551
ingu-0.548
 oun-0.534
iqueness-0.513
Token this
Token ID428
Feature activation-2.27e-01

Pos logit contributions
alties+0.115
ingu+0.112
��+0.112
 oun+0.109
cond+0.105
Neg logit contributions
 Ceres-0.086
 vig-0.073
§-0.072
 Stall-0.070
 Equality-0.069
Token was
Token ID373
Feature activation+1.44e-01

Pos logit contributions
alties+0.385
��+0.378
ingu+0.372
 oun+0.362
cond+0.347
Neg logit contributions
 Ceres-0.330
 vig-0.282
§-0.277
 Stall-0.273
 Equality-0.273
Token instantly
Token ID11101
Feature activation-2.96e+00

Pos logit contributions
 Ceres+0.218
 vig+0.187
§+0.184
 Equality+0.180
 Stall+0.180
Neg logit contributions
alties-0.234
��-0.229
ingu-0.227
 oun-0.221
iqueness-0.211
Token recognised
Token ID20915
Feature activation-4.97e-01

Pos logit contributions
alties+4.567
ingu+4.425
��+4.370
 oun+4.302
iqueness+4.122
Neg logit contributions
 Ceres-4.657
 vig-4.020
§-4.019
 Stall-3.907
 Equality-3.855
Token as
Token ID355
Feature activation+1.71e-01

Pos logit contributions
alties+0.808
��+0.783
ingu+0.781
 oun+0.759
iqueness+0.727
Neg logit contributions
 Ceres-0.756
 vig-0.647
§-0.643
 Stall-0.625
 Equality-0.621
Token a
Token ID257
Feature activation+8.76e-02

Pos logit contributions
 Ceres+0.264
 vig+0.227
§+0.224
 Stall+0.220
 Equality+0.220
Neg logit contributions
alties-0.274
ingu-0.265
��-0.265
 oun-0.256
cond-0.246
Token perfect
Token ID2818
Feature activation+4.88e-01

Pos logit contributions
 Ceres+0.138
 vig+0.119
§+0.116
 Stall+0.116
 Equality+0.116
Neg logit contributions
alties-0.140
��-0.135
ingu-0.134
 oun-0.130
aughs-0.124
Token partnership
Token ID10413
Feature activation+5.75e-01

Pos logit contributions
 Ceres+0.730
 vig+0.630
 Equality+0.614
 Stall+0.610
§+0.609
Neg logit contributions
alties-0.812
��-0.787
ingu-0.778
 oun-0.762
iqueness-0.726
Token pro
Token ID386
Feature activation-4.52e-01

Pos logit contributions
alties+0.020
ingu+0.019
��+0.019
 oun+0.018
iqueness+0.018
Neg logit contributions
 Ceres-0.019
 vig-0.016
§-0.016
 Stall-0.016
 Equality-0.016
Token,
Token ID11
Feature activation-2.62e-01

Pos logit contributions
alties+0.717
ingu+0.692
��+0.689
 oun+0.670
cond+0.644
Neg logit contributions
 Ceres-0.704
 vig-0.604
§-0.600
 Stall-0.585
 Equality-0.580
Token his
Token ID465
Feature activation-9.69e-02

Pos logit contributions
alties+0.449
ingu+0.434
��+0.432
 oun+0.421
cond+0.406
Neg logit contributions
 Ceres-0.374
 vig-0.315
§-0.314
 Stall-0.305
 Equality-0.302
Token first
Token ID717
Feature activation-8.28e-02

Pos logit contributions
alties+0.159
ingu+0.154
��+0.154
 oun+0.149
cond+0.144
Neg logit contributions
 Ceres-0.146
 vig-0.124
§-0.123
 Stall-0.120
 Equality-0.119
Token since
Token ID1201
Feature activation-1.03e-01

Pos logit contributions
alties+0.135
ingu+0.131
��+0.131
 oun+0.127
iqueness+0.122
Neg logit contributions
 Ceres-0.125
 vig-0.108
§-0.106
 Stall-0.104
 Equality-0.103
Token elbow
Token ID22662
Feature activation-2.96e+00

Pos logit contributions
alties+0.179
��+0.174
ingu+0.174
 oun+0.169
iqueness+0.163
Neg logit contributions
 Ceres-0.147
 vig-0.124
§-0.123
 Stall-0.120
 Equality-0.119
Token reconstruction
Token ID25056
Feature activation-6.16e-01

Pos logit contributions
alties+3.486
ingu+3.281
 oun+3.210
��+3.203
cond+3.039
Neg logit contributions
 Ceres-5.687
§-4.949
 vig-4.936
 Stall-4.906
 Equality-4.852
Token 15
Token ID1315
Feature activation+3.30e-01

Pos logit contributions
alties+0.948
ingu+0.914
��+0.906
 oun+0.884
cond+0.849
Neg logit contributions
 Ceres-0.988
 vig-0.849
§-0.845
 Stall-0.826
 Equality-0.819
Token months
Token ID1933
Feature activation+1.25e-01

Pos logit contributions
 Ceres+0.478
 vig+0.406
§+0.403
 Stall+0.391
 Equality+0.388
Neg logit contributions
alties-0.558
ingu-0.541
��-0.541
 oun-0.524
iqueness-0.508
Token earlier
Token ID2961
Feature activation+7.35e-01

Pos logit contributions
 Ceres+0.186
 vig+0.158
§+0.157
 Stall+0.153
 Equality+0.152
Neg logit contributions
alties-0.206
ingu-0.200
��-0.199
 oun-0.194
cond-0.186
Token,
Token ID11
Feature activation-9.53e-03

Pos logit contributions
 Ceres+1.109
 vig+0.951
§+0.942
 Stall+0.918
 Equality+0.909
Neg logit contributions
alties-1.201
��-1.167
ingu-1.162
 oun-1.130
iqueness-1.084
Token >
Token ID1875
Feature activation-2.22e-02

Pos logit contributions
 Ceres+0.322
 vig+0.267
§+0.265
 Stall+0.257
 Equality+0.255
Neg logit contributions
alties-0.461
ingu-0.447
��-0.446
 oun-0.434
cond-0.420
Token (
Token ID357
Feature activation+2.63e-01

Pos logit contributions
alties+0.041
ingu+0.039
��+0.039
 oun+0.038
iqueness+0.037
Neg logit contributions
 Ceres-0.029
 vig-0.024
§-0.024
 Stall-0.023
 Equality-0.023
Token &
Token ID1222
Feature activation-4.83e-01

Pos logit contributions
 Ceres+0.356
 vig+0.300
§+0.297
 Stall+0.287
 Equality+0.284
Neg logit contributions
alties-0.471
��-0.459
ingu-0.458
 oun-0.444
iqueness-0.429
Token self
Token ID2116
Feature activation-1.28e+00

Pos logit contributions
alties+1.003
ingu+0.977
��+0.973
 oun+0.953
cond+0.926
Neg logit contributions
 Ceres-0.515
 vig-0.407
§-0.403
 Stall-0.387
 Equality-0.382
Token,
Token ID837
Feature activation-2.26e-01

Pos logit contributions
alties+2.018
ingu+1.976
��+1.898
 oun+1.874
cond+1.853
Neg logit contributions
 Ceres-1.975
 vig-1.696
§-1.664
 Stall-1.651
 Equality-1.607
Token ord
Token ID2760
Feature activation-2.95e+00

Pos logit contributions
alties+0.376
��+0.364
ingu+0.364
 oun+0.352
iqueness+0.340
Neg logit contributions
 Ceres-0.336
 vig-0.287
§-0.284
 Stall-0.276
 Equality-0.274
Token :
Token ID1058
Feature activation-1.67e-02

Pos logit contributions
alties+4.359
ingu+4.233
 oun+3.992
��+3.984
cond+3.888
Neg logit contributions
 Ceres-4.805
§-4.156
 vig-4.144
 Stall-4.035
 Robotics-3.986
Token Order
Token ID8284
Feature activation-4.09e-01

Pos logit contributions
alties+0.030
��+0.029
ingu+0.029
 oun+0.028
iqueness+0.027
Neg logit contributions
 Ceres-0.023
 vig-0.019
§-0.019
 Equality-0.018
 Stall-0.018
Tokening
Token ID278
Feature activation-1.22e+00

Pos logit contributions
alties+0.601
ingu+0.579
��+0.575
 oun+0.558
cond+0.535
Neg logit contributions
 Ceres-0.685
 vig-0.594
§-0.590
 Stall-0.577
 Equality-0.572
Token,
Token ID837
Feature activation-6.91e-02

Pos logit contributions
alties+1.841
ingu+1.782
��+1.748
 oun+1.708
cond+1.661
Neg logit contributions
 Ceres-1.963
 vig-1.701
§-1.649
 Stall-1.642
 Equality-1.597
Token _
Token ID4808
Feature activation+4.83e-02

Pos logit contributions
alties+0.112
��+0.109
ingu+0.109
 oun+0.105
iqueness+0.102
Neg logit contributions
 Ceres-0.105
 vig-0.090
§-0.089
 Stall-0.087
 Equality-0.087
Token businesses
Token ID5692
Feature activation+1.24e-01

Pos logit contributions
 Ceres+0.022
 vig+0.019
§+0.019
 Stall+0.018
 Equality+0.018
Neg logit contributions
alties-0.026
ingu-0.025
��-0.025
 oun-0.025
cond-0.024
Token looking
Token ID2045
Feature activation+1.38e+00

Pos logit contributions
 Ceres+0.181
 vig+0.154
§+0.152
 Stall+0.148
 Equality+0.147
Neg logit contributions
alties-0.210
��-0.203
ingu-0.202
 oun-0.197
iqueness-0.190
Token for
Token ID329
Feature activation+3.88e-01

Pos logit contributions
 Ceres+2.021
 vig+1.732
§+1.698
 Stall+1.669
 Equality+1.638
Neg logit contributions
alties-2.291
��-2.231
ingu-2.231
 oun-2.159
iqueness-2.084
Token on
Token ID319
Feature activation+3.08e-01

Pos logit contributions
 Ceres+0.608
 vig+0.527
§+0.517
 Stall+0.509
 Equality+0.506
Neg logit contributions
alties-0.614
��-0.593
ingu-0.592
 oun-0.576
cond-0.551
Token-
Token ID12
Feature activation-2.15e-01

Pos logit contributions
 Ceres+0.497
 vig+0.429
§+0.425
 Stall+0.416
 Equality+0.412
Neg logit contributions
alties-0.472
ingu-0.455
��-0.455
 oun-0.441
cond-0.422
Tokenprem
Token ID31605
Feature activation-2.94e+00

Pos logit contributions
alties+0.269
ingu+0.257
��+0.256
 oun+0.247
cond+0.233
Neg logit contributions
 Ceres-0.408
 vig-0.360
§-0.358
 Stall-0.351
 Equality-0.349
Tokenise
Token ID786
Feature activation-8.62e-01

Pos logit contributions
alties+4.554
ingu+4.508
��+4.294
 oun+4.262
cond+4.124
Neg logit contributions
 Ceres-4.506
 vig-3.820
§-3.819
 Stall-3.818
 Equality-3.705
Token deployment
Token ID14833
Feature activation+1.98e-01

Pos logit contributions
alties+1.326
ingu+1.278
��+1.271
 oun+1.235
cond+1.187
Neg logit contributions
 Ceres-1.381
 vig-1.188
§-1.184
 Stall-1.154
 Equality-1.143
Token options
Token ID3689
Feature activation-4.33e-02

Pos logit contributions
 Ceres+0.309
 vig+0.267
§+0.262
 Equality+0.256
 Stall+0.256
Neg logit contributions
alties-0.315
ingu-0.306
��-0.305
 oun-0.295
iqueness-0.285
Token happy
Token ID3772
Feature activation+8.29e-01

Pos logit contributions
alties+0.070
ingu+0.068
��+0.068
 oun+0.066
iqueness+0.064
Neg logit contributions
 Ceres-0.066
 vig-0.057
§-0.056
 Stall-0.055
 Equality-0.055
Token and
Token ID290
Feature activation+6.93e-02

Pos logit contributions
 Ceres+1.306
 vig+1.127
§+1.118
 Stall+1.086
 Equality+1.076
Neg logit contributions
alties-1.298
��-1.266
ingu-1.254
 oun-1.225
iqueness-1.170
Token announced
Token ID3414
Feature activation+6.98e-01

Pos logit contributions
alties+0.141
ingu+0.137
��+0.136
 oun+0.133
iqueness+0.128
Neg logit contributions
 Ceres-0.114
 vig-0.096
§-0.096
 Stall-0.093
 Equality-0.092
Token an
Token ID281
Feature activation+9.00e-02

Pos logit contributions
 Ceres+0.993
 vig+0.836
§+0.830
 Stall+0.806
 Equality+0.801
Neg logit contributions
alties-1.199
ingu-1.174
��-1.172
 oun-1.132
iqueness-1.093
Token alliance
Token ID12801
Feature activation+8.37e-01

Pos logit contributions
 Ceres+0.150
 vig+0.129
§+0.128
 Equality+0.125
 Stall+0.124
Neg logit contributions
alties-0.134
��-0.131
ingu-0.130
 oun-0.126
cond-0.121
Token with
Token ID351
Feature activation-1.81e-01

Pos logit contributions
 Ceres+1.329
 vig+1.144
§+1.142
 Stall+1.109
 Equality+1.100
Neg logit contributions
alties-1.302
��-1.258
ingu-1.256
 oun-1.216
cond-1.167
Token B
Token ID347
Feature activation-9.17e-01

Pos logit contributions
alties+0.307
ingu+0.297
��+0.296
 oun+0.288
cond+0.277
Neg logit contributions
 Ceres-0.263
 vig-0.223
§-0.221
 Stall-0.216
 Equality-0.214
Tokenibli
Token ID29142
Feature activation-2.92e+00

Pos logit contributions
alties+1.375
ingu+1.327
��+1.306
 oun+1.268
cond+1.224
Neg logit contributions
 Ceres-1.517
 vig-1.309
§-1.292
 Stall-1.274
 Equality-1.260
Tokenothe
Token ID20388
Feature activation-2.66e-01

Pos logit contributions
alties+3.610
ingu+3.597
��+3.322
 oun+3.278
cond+3.181
Neg logit contributions
 Ceres-5.442
 vig-4.757
§-4.624
 Equality-4.576
 Stall-4.574
Tokenca
Token ID6888
Feature activation+5.77e-02

Pos logit contributions
alties+0.273
ingu+0.259
��+0.257
 oun+0.246
cond+0.231
Neg logit contributions
 Ceres-0.563
 vig-0.504
§-0.502
 Stall-0.493
 Equality-0.490
Token to
Token ID284
Feature activation-1.86e-01

Pos logit contributions
 Ceres+0.085
 vig+0.072
§+0.071
 Stall+0.069
 Equality+0.069
Neg logit contributions
alties-0.097
ingu-0.094
��-0.093
 oun-0.091
cond-0.088
Token directly
Token ID3264
Feature activation-8.34e-01

Pos logit contributions
alties+0.301
��+0.292
ingu+0.290
 oun+0.282
iqueness+0.272
Neg logit contributions
 Ceres-0.284
 vig-0.243
§-0.242
 Stall-0.236
 Equality-0.234
Token sell
Token ID3677
Feature activation-6.44e-01

Pos logit contributions
alties+1.373
ingu+1.328
��+1.322
 oun+1.287
cond+1.238
Neg logit contributions
 Ceres-1.249
 vig-1.063
§-1.057
 Stall-1.029
 Equality-1.020
Token who
Token ID508
Feature activation+6.10e-01

Pos logit contributions
alties+0.753
ingu+0.725
��+0.722
 oun+0.700
cond+0.670
Neg logit contributions
 Ceres-0.855
 vig-0.742
§-0.738
 Stall-0.721
 Equality-0.715
Token suffer
Token ID8659
Feature activation-6.45e-01

Pos logit contributions
 Ceres+0.878
 vig+0.755
§+0.739
 Stall+0.716
 Equality+0.705
Neg logit contributions
alties-1.044
��-1.024
ingu-1.002
 oun-0.985
iqueness-0.944
Token from
Token ID422
Feature activation+2.89e-01

Pos logit contributions
alties+0.979
ingu+0.943
��+0.940
 oun+0.912
iqueness+0.874
Neg logit contributions
 Ceres-1.050
 vig-0.908
§-0.902
 Stall-0.881
 Equality-0.873
Token overly
Token ID17698
Feature activation-1.15e+00

Pos logit contributions
 Ceres+0.462
 vig+0.403
§+0.395
 Stall+0.387
 Equality+0.383
Neg logit contributions
alties-0.446
��-0.432
ingu-0.430
 oun-0.417
iqueness-0.400
Token-
Token ID12
Feature activation-7.19e-02

Pos logit contributions
alties+1.699
ingu+1.629
��+1.627
 oun+1.568
iqueness+1.509
Neg logit contributions
 Ceres-1.939
 vig-1.696
§-1.676
 Stall-1.638
 Equality-1.620
Tokenpres
Token ID18302
Feature activation-2.92e+00

Pos logit contributions
alties+0.098
��+0.095
ingu+0.093
 oun+0.092
aughs+0.086
Neg logit contributions
 Ceres-0.127
 vig-0.113
§-0.113
 Stall-0.109
 Equality-0.107
Tokencript
Token ID6519
Feature activation+2.12e-01

Pos logit contributions
alties+2.377
ingu+2.191
cond+2.029
��+2.027
 oun+2.022
Neg logit contributions
 Ceres-6.712
 vig-6.051
 Equality-5.988
§-5.976
 Stall-5.897
Tokenive
Token ID425
Feature activation-5.45e-01

Pos logit contributions
 Ceres+0.365
 vig+0.321
§+0.317
 Stall+0.309
 Equality+0.304
Neg logit contributions
alties-0.307
��-0.297
 oun-0.285
ingu-0.284
iqueness-0.272
Token and
Token ID290
Feature activation-1.74e-01

Pos logit contributions
alties+0.800
��+0.773
ingu+0.772
 oun+0.746
iqueness+0.715
Neg logit contributions
 Ceres-0.915
 vig-0.797
§-0.789
 Stall-0.772
 Equality-0.764
Token overs
Token ID10753
Feature activation-3.73e-01

Pos logit contributions
alties+0.273
��+0.264
ingu+0.262
 oun+0.254
iqueness+0.245
Neg logit contributions
 Ceres-0.276
 vig-0.240
§-0.236
 Stall-0.230
 Equality-0.226
Tokenimpl
Token ID23928
Feature activation-6.12e-01

Pos logit contributions
alties+0.470
ingu+0.450
��+0.446
 oun+0.431
cond+0.410
Neg logit contributions
 Ceres-0.704
 vig-0.621
§-0.618
 Stall-0.606
 Equality-0.602
Token game
Token ID983
Feature activation-3.03e-01

Pos logit contributions
 Ceres+0.127
 vig+0.109
§+0.108
 Stall+0.106
 Equality+0.105
Neg logit contributions
alties-0.130
��-0.125
ingu-0.125
 oun-0.122
iqueness-0.117
Token in
Token ID287
Feature activation-6.40e-02

Pos logit contributions
alties+0.503
ingu+0.486
��+0.484
 oun+0.471
cond+0.454
Neg logit contributions
 Ceres-0.451
 vig-0.384
§-0.381
 Stall-0.371
 Equality-0.368
Token charge
Token ID3877
Feature activation+7.27e-02

Pos logit contributions
alties+0.141
ingu+0.137
��+0.137
 oun+0.134
cond+0.130
Neg logit contributions
 Ceres-0.060
 vig-0.046
§-0.046
 Stall-0.044
 Equality-0.043
Token at
Token ID379
Feature activation-6.54e-01

Pos logit contributions
 Ceres+0.103
 vig+0.087
§+0.087
 Stall+0.084
 Equality+0.084
Neg logit contributions
alties-0.125
ingu-0.121
��-0.121
 oun-0.118
cond-0.113
Token Old
Token ID5706
Feature activation-7.54e-01

Pos logit contributions
alties+0.889
ingu+0.854
��+0.848
 oun+0.821
cond+0.784
Neg logit contributions
 Ceres-1.164
 vig-1.019
§-1.016
 Stall-0.991
 Equality-0.985
Token Trafford
Token ID41107
Feature activation-2.87e+00

Pos logit contributions
alties+0.648
ingu+0.609
��+0.608
 oun+0.578
iqueness+0.527
Neg logit contributions
 Ceres-1.724
 vig-1.555
§-1.551
 Stall-1.527
 Equality-1.519
Token against
Token ID1028
Feature activation+6.21e-02

Pos logit contributions
alties+4.237
ingu+4.080
��+3.922
 oun+3.882
cond+3.765
Neg logit contributions
 Ceres-4.635
§-3.992
 vig-3.987
 Stall-3.877
 Equality-3.789
Token Swansea
Token ID36019
Feature activation-7.67e-02

Pos logit contributions
 Ceres+0.096
 vig+0.082
§+0.082
 Stall+0.080
 Equality+0.079
Neg logit contributions
alties-0.100
ingu-0.096
��-0.096
 oun-0.093
iqueness-0.089
Token on
Token ID319
Feature activation+1.46e-01

Pos logit contributions
alties+0.124
ingu+0.120
��+0.120
 oun+0.116
iqueness+0.112
Neg logit contributions
 Ceres-0.117
 vig-0.100
§-0.099
 Stall-0.097
 Equality-0.097
Token Sunday
Token ID3502
Feature activation+1.02e-01

Pos logit contributions
 Ceres+0.183
 vig+0.150
§+0.148
 Stall+0.144
 Equality+0.143
Neg logit contributions
alties-0.278
��-0.270
ingu-0.270
 oun-0.264
iqueness-0.254
Token by
Token ID416
Feature activation-5.02e-01

Pos logit contributions
 Ceres+0.154
 vig+0.131
§+0.130
 Stall+0.127
 Equality+0.126
Neg logit contributions
alties-0.168
ingu-0.162
��-0.162
 oun-0.157
iqueness-0.151
Token)
Token ID8
Feature activation+3.93e-01

Pos logit contributions
 Ceres+0.785
 vig+0.683
§+0.681
 Stall+0.663
 Equality+0.660
Neg logit contributions
alties-0.665
��-0.638
ingu-0.636
 oun-0.618
iqueness-0.591
Token
Token ID198
Feature activation+1.72e-01

Pos logit contributions
 Ceres+0.597
 vig+0.510
§+0.504
 Stall+0.492
 Equality+0.488
Neg logit contributions
alties-0.639
ingu-0.618
��-0.612
 oun-0.597
cond-0.575
Token
Token ID198
Feature activation+2.40e-01

Pos logit contributions
 Ceres+0.264
§+0.230
 vig+0.225
 Stall+0.220
 Equality+0.219
Neg logit contributions
��-0.279
 oun-0.278
alties-0.277
ingu-0.269
iqueness-0.251
TokenAndre
Token ID31258
Feature activation-8.67e-01

Pos logit contributions
 Ceres+0.377
§+0.330
 vig+0.326
 Stall+0.317
 Equality+0.315
Neg logit contributions
alties-0.375
��-0.363
ingu-0.361
 oun-0.356
iqueness-0.336
Token R
Token ID371
Feature activation-2.71e-01

Pos logit contributions
alties+1.122
ingu+1.086
��+1.056
 oun+1.027
cond+0.988
Neg logit contributions
 Ceres-1.600
 vig-1.405
§-1.384
 Stall-1.360
 Equality-1.358
Tokeningu
Token ID6680
Feature activation-2.87e+00

Pos logit contributions
alties+0.338
ingu+0.329
��+0.314
 oun+0.305
cond+0.293
Neg logit contributions
 Ceres-0.517
 vig-0.455
§-0.448
 Stall-0.443
 Equality-0.442
Tokenette
Token ID5857
Feature activation-3.73e-01

Pos logit contributions
alties+4.149
ingu+4.072
��+3.824
 oun+3.738
ien+3.692
Neg logit contributions
 Ceres-4.820
 vig-4.225
 Equality-4.055
§-4.051
 Stall-4.044
Token/
Token ID14
Feature activation-1.52e-01

Pos logit contributions
alties+0.655
ingu+0.637
��+0.628
 oun+0.615
cond+0.594
Neg logit contributions
 Ceres-0.520
 vig-0.435
§-0.429
 Stall-0.417
 Equality-0.416
TokenGetty
Token ID6633
Feature activation-3.51e-01

Pos logit contributions
alties+0.222
ingu+0.213
��+0.213
 oun+0.207
iqueness+0.197
Neg logit contributions
 Ceres-0.258
 vig-0.224
§-0.223
 Stall-0.218
 Equality-0.216
Token Images
Token ID5382
Feature activation+1.05e+00

Pos logit contributions
alties+0.580
ingu+0.560
��+0.554
 oun+0.539
iqueness+0.519
Neg logit contributions
 Ceres-0.524
 vig-0.449
§-0.443
 Stall-0.431
 Equality-0.427
Token
Token ID198
Feature activation+1.25e-01

Pos logit contributions
 Ceres+1.636
§+1.404
 vig+1.403
 Stall+1.366
 Equality+1.362
Neg logit contributions
alties-1.643
��-1.609
ingu-1.587
 oun-1.554
iqueness-1.483
Token United
Token ID1578
Feature activation-6.79e-01

Pos logit contributions
 Ceres+0.078
 vig+0.067
§+0.066
 Stall+0.065
 Equality+0.064
Neg logit contributions
alties-0.086
ingu-0.083
��-0.083
 oun-0.081
cond-0.078
Token States
Token ID1829
Feature activation-1.33e-02

Pos logit contributions
alties+1.327
ingu+1.292
��+1.289
 oun+1.261
cond+1.218
Neg logit contributions
 Ceres-0.808
 vig-0.657
§-0.653
 Stall-0.630
 Equality-0.623
Token,
Token ID11
Feature activation+4.89e-02

Pos logit contributions
alties+0.020
ingu+0.019
��+0.019
 oun+0.018
cond+0.017
Neg logit contributions
 Ceres-0.022
 vig-0.019
§-0.019
 Stall-0.019
 Equality-0.019
Token 2010
Token ID3050
Feature activation-1.25e+00

Pos logit contributions
 Ceres+0.071
 vig+0.060
§+0.060
 Stall+0.058
 Equality+0.058
Neg logit contributions
alties-0.083
ingu-0.080
��-0.080
 oun-0.078
iqueness-0.075
Token.
Token ID13
Feature activation+1.82e-01

Pos logit contributions
alties+2.309
ingu+2.247
��+2.203
 oun+2.174
cond+2.124
Neg logit contributions
 Ceres-1.573
 vig-1.296
 Stall-1.259
§-1.250
 Equality-1.226
Token Pres
Token ID1763
Feature activation-2.87e+00

Pos logit contributions
 Ceres+0.273
 vig+0.233
§+0.232
 Stall+0.225
 Equality+0.223
Neg logit contributions
alties-0.300
ingu-0.290
��-0.288
 oun-0.280
cond-0.270
Tokenented
Token ID4714
Feature activation-6.72e-01

Pos logit contributions
alties+2.217
ingu+2.076
��+1.961
 oun+1.908
cond+1.852
Neg logit contributions
 Ceres-6.612
 vig-6.058
§-5.932
 Equality-5.926
 Stall-5.899
Token at
Token ID379
Feature activation-7.90e-01

Pos logit contributions
alties+1.210
ingu+1.176
��+1.161
 oun+1.137
cond+1.106
Neg logit contributions
 Ceres-0.897
 vig-0.748
§-0.742
 Stall-0.722
 Equality-0.715
Token the
Token ID262
Feature activation-2.93e-01

Pos logit contributions
alties+1.277
ingu+1.244
��+1.223
 oun+1.194
cond+1.157
Neg logit contributions
 Ceres-1.183
 vig-1.013
§-1.008
 Stall-0.981
 Equality-0.969
Token 57
Token ID7632
Feature activation+1.60e-01

Pos logit contributions
alties+0.497
ingu+0.481
��+0.479
 oun+0.467
cond+0.449
Neg logit contributions
 Ceres-0.423
 vig-0.358
§-0.356
 Stall-0.346
 Equality-0.343
Tokenth
Token ID400
Feature activation+4.86e-01

Pos logit contributions
 Ceres+0.275
 vig+0.240
§+0.239
 Stall+0.233
 Equality+0.232
Neg logit contributions
alties-0.227
ingu-0.218
��-0.217
 oun-0.211
cond-0.201
Token problems
Token ID2761
Feature activation+1.35e-01

Pos logit contributions
alties+0.031
��+0.030
ingu+0.030
 oun+0.029
iqueness+0.028
Neg logit contributions
 Ceres-0.030
 vig-0.026
§-0.026
 Stall-0.025
 Equality-0.025
Token such
Token ID884
Feature activation-6.34e-01

Pos logit contributions
 Ceres+0.209
 vig+0.179
§+0.179
 Stall+0.173
 Equality+0.172
Neg logit contributions
alties-0.216
��-0.209
ingu-0.208
 oun-0.203
iqueness-0.194
Token as
Token ID355
Feature activation-6.43e-01

Pos logit contributions
alties+1.060
ingu+1.025
��+1.007
 oun+0.988
cond+0.967
Neg logit contributions
 Ceres-0.930
§-0.782
 vig-0.780
 Stall-0.764
 Equality-0.754
Token racism
Token ID10713
Feature activation+1.20e+00

Pos logit contributions
alties+1.125
ingu+1.092
��+1.081
 oun+1.057
cond+1.024
Neg logit contributions
 Ceres-0.893
 vig-0.747
§-0.744
 Stall-0.723
 Equality-0.714
Token,
Token ID11
Feature activation-1.81e-01

Pos logit contributions
 Ceres+1.872
§+1.623
 vig+1.612
 Stall+1.566
 Equality+1.555
Neg logit contributions
alties-1.874
��-1.812
ingu-1.804
 oun-1.757
iqueness-1.683
Token sexism
Token ID24953
Feature activation+2.21e+00

Pos logit contributions
alties+0.331
ingu+0.321
��+0.320
 oun+0.312
cond+0.301
Neg logit contributions
 Ceres-0.240
 vig-0.200
§-0.198
 Stall-0.192
 Equality-0.190
Token,
Token ID11
Feature activation-5.92e-02

Pos logit contributions
 Ceres+3.482
§+3.011
 vig+3.005
 Stall+2.915
 Equality+2.899
Neg logit contributions
alties-3.425
ingu-3.303
��-3.289
 oun-3.202
cond-3.061
Token trans
Token ID1007
Feature activation+8.67e-01

Pos logit contributions
alties+0.098
ingu+0.095
��+0.094
 oun+0.092
cond+0.089
Neg logit contributions
 Ceres-0.088
 vig-0.074
§-0.074
 Stall-0.072
 Equality-0.071
Tokenph
Token ID746
Feature activation+3.73e-01

Pos logit contributions
 Ceres+1.457
 vig+1.273
§+1.270
 Stall+1.234
 Equality+1.231
Neg logit contributions
alties-1.256
ingu-1.211
��-1.210
 oun-1.173
iqueness-1.113
Tokenobia
Token ID30665
Feature activation+3.65e-01

Pos logit contributions
 Ceres+0.808
 vig+0.724
§+0.721
 Stall+0.709
 Equality+0.704
Neg logit contributions
alties-0.366
ingu-0.347
��-0.342
 oun-0.327
cond-0.306
Token,
Token ID11
Feature activation-9.25e-02

Pos logit contributions
 Ceres+0.604
 vig+0.522
§+0.515
 Stall+0.507
 Equality+0.504
Neg logit contributions
alties-0.544
ingu-0.524
��-0.521
 oun-0.505
cond-0.486
Token co
Token ID763
Feature activation+1.22e-01

Pos logit contributions
alties+0.395
ingu+0.382
��+0.381
 oun+0.373
cond+0.358
Neg logit contributions
 Ceres-0.302
 vig-0.255
§-0.251
 Stall-0.245
 Equality-0.242
Token-
Token ID12
Feature activation-4.05e-01

Pos logit contributions
 Ceres+0.206
 vig+0.179
§+0.178
 Stall+0.174
 Equality+0.173
Neg logit contributions
alties-0.176
ingu-0.170
��-0.169
 oun-0.164
cond-0.157
Tokenleader
Token ID27940
Feature activation-1.55e-01

Pos logit contributions
alties+0.516
ingu+0.494
��+0.490
 oun+0.473
cond+0.452
Neg logit contributions
 Ceres-0.757
 vig-0.667
§-0.663
 Stall-0.651
 Equality-0.647
Token of
Token ID286
Feature activation-4.50e-01

Pos logit contributions
alties+0.243
ingu+0.234
��+0.232
 oun+0.225
cond+0.217
Neg logit contributions
 Ceres-0.244
 vig-0.209
§-0.206
 Stall-0.202
 Equality-0.202
Token the
Token ID262
Feature activation-3.74e-01

Pos logit contributions
alties+0.730
ingu+0.706
��+0.703
 oun+0.681
iqueness+0.659
Neg logit contributions
 Ceres-0.679
§-0.579
 vig-0.579
 Stall-0.562
 Equality-0.556
Token GREAT
Token ID37853
Feature activation+2.48e+00

Pos logit contributions
alties+0.699
ingu+0.679
��+0.676
 oun+0.660
cond+0.639
Neg logit contributions
 Ceres-0.476
 vig-0.392
§-0.390
 Stall-0.377
 Equality-0.373
Token3
Token ID18
Feature activation-2.50e-01

Pos logit contributions
 Ceres+2.682
§+2.109
 vig+2.101
 Equality+2.006
 Stall+2.002
Neg logit contributions
alties-5.128
��-4.963
ingu-4.920
 oun-4.902
iqueness-4.700
Token competition
Token ID5449
Feature activation-6.21e-01

Pos logit contributions
alties+0.453
ingu+0.440
��+0.438
 oun+0.428
cond+0.413
Neg logit contributions
 Ceres-0.332
 vig-0.276
§-0.275
 Stall-0.266
 Equality-0.263
Token.
Token ID13
Feature activation-2.65e-01

Pos logit contributions
alties+0.804
ingu+0.772
��+0.757
 oun+0.733
cond+0.703
Neg logit contributions
 Ceres-1.145
 vig-1.006
§-0.995
 Stall-0.983
 Equality-0.976
Token
Token ID198
Feature activation-2.75e-01

Pos logit contributions
alties+0.407
ingu+0.392
��+0.388
 oun+0.377
cond+0.365
Neg logit contributions
 Ceres-0.423
 vig-0.368
§-0.362
 Stall-0.354
 Equality-0.351
Token
Token ID198
Feature activation-1.22e-01

Pos logit contributions
alties+0.431
��+0.423
ingu+0.416
 oun+0.416
iqueness+0.386
Neg logit contributions
 Ceres-0.437
§-0.378
 vig-0.375
 Stall-0.365
 Equality-0.363
Tokenky
Token ID2584
Feature activation+1.76e-01

Pos logit contributions
 Ceres+0.231
 vig+0.202
§+0.201
 Stall+0.197
 Equality+0.195
Neg logit contributions
alties-0.181
ingu-0.174
��-0.174
 oun-0.168
cond-0.160
Token of
Token ID286
Feature activation-3.37e-03

Pos logit contributions
 Ceres+0.293
 vig+0.255
§+0.250
 Equality+0.247
 Stall+0.245
Neg logit contributions
alties-0.260
ingu-0.252
��-0.250
 oun-0.242
cond-0.233
Token the
Token ID262
Feature activation-2.32e-01

Pos logit contributions
alties+0.006
ingu+0.005
��+0.005
 oun+0.005
iqueness+0.005
Neg logit contributions
 Ceres-0.005
 vig-0.004
§-0.004
 Stall-0.004
 Equality-0.004
Token University
Token ID2059
Feature activation-8.74e-01

Pos logit contributions
alties+0.411
ingu+0.399
��+0.397
 oun+0.387
cond+0.374
Neg logit contributions
 Ceres-0.316
 vig-0.266
§-0.264
 Stall-0.256
 Equality-0.253
Token of
Token ID286
Feature activation+4.36e-02

Pos logit contributions
alties+1.403
ingu+1.355
��+1.342
 oun+1.306
cond+1.265
Neg logit contributions
 Ceres-1.340
 vig-1.149
§-1.139
 Stall-1.110
 Equality-1.101
Token Colorado
Token ID7492
Feature activation+2.18e+00

Pos logit contributions
 Ceres+0.052
 vig+0.043
§+0.043
 Stall+0.041
 Equality+0.041
Neg logit contributions
alties-0.084
ingu-0.082
��-0.081
 oun-0.079
cond-0.077
Token Boulder
Token ID27437
Feature activation+5.65e-01

Pos logit contributions
 Ceres+3.136
§+2.612
 vig+2.580
 Stall+2.549
 Robotics+2.499
Neg logit contributions
alties-3.746
��-3.640
ingu-3.597
 oun-3.529
iqueness-3.408
Token,
Token ID11
Feature activation-1.91e-01

Pos logit contributions
 Ceres+0.912
 vig+0.788
§+0.782
 Stall+0.764
 Equality+0.758
Neg logit contributions
alties-0.865
ingu-0.835
��-0.829
 oun-0.806
cond-0.774
Token says
Token ID1139
Feature activation-4.79e-01

Pos logit contributions
alties+0.324
ingu+0.313
��+0.310
 oun+0.303
cond+0.293
Neg logit contributions
 Ceres-0.276
 vig-0.234
§-0.232
 Stall-0.226
 Equality-0.224
Token that
Token ID326
Feature activation-5.62e-02

Pos logit contributions
alties+0.814
ingu+0.789
��+0.787
 oun+0.766
iqueness+0.737
Neg logit contributions
 Ceres-0.694
 vig-0.587
§-0.584
 Stall-0.568
 Equality-0.561
Token his
Token ID465
Feature activation-2.31e-02

Pos logit contributions
alties+0.093
ingu+0.090
��+0.090
 oun+0.088
iqueness+0.084
Neg logit contributions
 Ceres-0.085
 vig-0.072
§-0.071
 Stall-0.069
 Equality-0.068
Token a
Token ID257
Feature activation+1.66e-02

Pos logit contributions
 Ceres+0.655
 vig+0.562
§+0.556
 Stall+0.541
 Equality+0.537
Neg logit contributions
alties-0.717
��-0.695
ingu-0.694
 oun-0.674
iqueness-0.650
Token fundraising
Token ID18066
Feature activation+5.25e-01

Pos logit contributions
 Ceres+0.024
 vig+0.021
§+0.020
 Stall+0.020
 Equality+0.020
Neg logit contributions
alties-0.028
ingu-0.027
��-0.027
 oun-0.026
cond-0.025
Token email
Token ID3053
Feature activation+1.09e+00

Pos logit contributions
 Ceres+0.744
 vig+0.630
§+0.622
 Stall+0.604
 Equality+0.600
Neg logit contributions
alties-0.909
��-0.882
ingu-0.881
 oun-0.854
iqueness-0.824
Token this
Token ID428
Feature activation-3.60e-01

Pos logit contributions
 Ceres+1.463
 vig+1.232
§+1.213
 Stall+1.180
 Equality+1.171
Neg logit contributions
alties-1.954
��-1.933
ingu-1.895
 oun-1.847
iqueness-1.792
Token week
Token ID1285
Feature activation+3.59e-01

Pos logit contributions
alties+0.657
ingu+0.636
��+0.631
 oun+0.620
cond+0.602
Neg logit contributions
 Ceres-0.471
§-0.389
 vig-0.388
 Stall-0.377
 Equality-0.375
Token reminding
Token ID30047
Feature activation+2.41e+00

Pos logit contributions
 Ceres+0.560
 vig+0.483
§+0.479
 Stall+0.466
 Equality+0.461
Neg logit contributions
alties-0.571
��-0.555
ingu-0.550
 oun-0.534
iqueness-0.514
Token voters
Token ID4446
Feature activation-5.37e-01

Pos logit contributions
 Ceres+3.587
 vig+3.094
§+3.008
 Stall+2.962
 Equality+2.958
Neg logit contributions
alties-3.948
��-3.885
ingu-3.876
 oun-3.723
iqueness-3.616
Token that
Token ID326
Feature activation+3.47e-02

Pos logit contributions
alties+0.868
ingu+0.836
��+0.826
 oun+0.812
cond+0.786
Neg logit contributions
 Ceres-0.813
 vig-0.693
§-0.686
 Stall-0.672
 Equality-0.665
Token the
Token ID262
Feature activation+3.67e-02

Pos logit contributions
 Ceres+0.050
 vig+0.043
§+0.042
 Stall+0.041
 Equality+0.041
Neg logit contributions
alties-0.059
ingu-0.057
��-0.057
 oun-0.055
iqueness-0.053
Token last
Token ID938
Feature activation+1.57e+00

Pos logit contributions
 Ceres+0.054
 vig+0.046
§+0.045
 Stall+0.044
 Equality+0.044
Neg logit contributions
alties-0.062
ingu-0.060
��-0.060
 oun-0.058
cond-0.056
Token time
Token ID640
Feature activation+7.96e-04

Pos logit contributions
 Ceres+2.078
 vig+1.745
§+1.719
 Stall+1.671
 Equality+1.649
Neg logit contributions
alties-2.868
ingu-2.802
��-2.787
 oun-2.706
iqueness-2.640
Token longer
Token ID2392
Feature activation+5.03e-01

Pos logit contributions
alties+0.658
ingu+0.639
��+0.637
 oun+0.621
iqueness+0.599
Neg logit contributions
 Ceres-0.496
 vig-0.415
§-0.412
 Stall-0.400
 Equality-0.395
Token-
Token ID12
Feature activation-2.72e-01

Pos logit contributions
 Ceres+0.819
 vig+0.711
§+0.704
 Stall+0.687
 Equality+0.680
Neg logit contributions
alties-0.761
��-0.737
ingu-0.737
 oun-0.710
iqueness-0.681
Tokenterm
Token ID4354
Feature activation-9.38e-01

Pos logit contributions
alties+0.344
ingu+0.327
��+0.320
 oun+0.312
cond+0.304
Neg logit contributions
 Ceres-0.513
 vig-0.451
§-0.448
 Stall-0.440
 Equality-0.440
Token actions
Token ID4028
Feature activation+1.98e-01

Pos logit contributions
alties+1.553
ingu+1.501
��+1.494
 oun+1.455
cond+1.401
Neg logit contributions
 Ceres-1.394
 vig-1.186
§-1.180
 Stall-1.148
 Equality-1.138
Token.
Token ID13
Feature activation-5.46e-01

Pos logit contributions
 Ceres+0.312
 vig+0.268
§+0.267
 Stall+0.259
 Equality+0.257
Neg logit contributions
alties-0.310
ingu-0.300
��-0.299
 oun-0.290
iqueness-0.279
Token Duke
Token ID11083
Feature activation+2.75e+00

Pos logit contributions
alties+0.853
ingu+0.819
��+0.809
 oun+0.790
cond+0.766
Neg logit contributions
 Ceres-0.856
 vig-0.742
§-0.730
 Stall-0.714
 Equality-0.709
Token Energy
Token ID6682
Feature activation+1.33e+00

Pos logit contributions
 Ceres+3.724
§+3.122
 vig+3.097
 Equality+3.016
 Robotics+3.009
Neg logit contributions
alties-4.790
��-4.737
ingu-4.662
 oun-4.547
iqueness-4.442
Token is
Token ID318
Feature activation-3.27e-01

Pos logit contributions
 Ceres+1.861
§+1.566
 vig+1.565
 Equality+1.498
 Stall+1.498
Neg logit contributions
alties-2.304
��-2.276
ingu-2.248
 oun-2.178
iqueness-2.113
Token responsible
Token ID4497
Feature activation-2.11e-01

Pos logit contributions
alties+0.540
��+0.523
ingu+0.522
 oun+0.507
iqueness+0.489
Neg logit contributions
 Ceres-0.494
 vig-0.420
§-0.418
 Stall-0.406
 Equality-0.401
Token for
Token ID329
Feature activation-3.35e-01

Pos logit contributions
alties+0.324
ingu+0.309
��+0.306
 oun+0.300
cond+0.288
Neg logit contributions
 Ceres-0.342
 vig-0.295
§-0.291
 Stall-0.287
 Equality-0.285
Token 21
Token ID2310
Feature activation+5.32e-01

Pos logit contributions
alties+0.539
ingu+0.521
��+0.520
 oun+0.505
iqueness+0.485
Neg logit contributions
 Ceres-0.515
 vig-0.441
§-0.438
 Stall-0.426
 Equality-0.423
Token of
Token ID286
Feature activation-5.99e-03

Pos logit contributions
alties+0.214
ingu+0.207
��+0.206
 oun+0.200
cond+0.192
Neg logit contributions
 Ceres-0.200
 vig-0.171
§-0.170
 Stall-0.165
 Equality-0.164
Token pun
Token ID4000
Feature activation-2.63e-01

Pos logit contributions
alties+0.010
ingu+0.009
��+0.009
 oun+0.009
iqueness+0.009
Neg logit contributions
 Ceres-0.009
 vig-0.008
§-0.008
 Stall-0.007
 Equality-0.007
Tokenks
Token ID591
Feature activation+7.98e-01

Pos logit contributions
alties+0.288
ingu+0.273
��+0.271
 oun+0.260
cond+0.246
Neg logit contributions
 Ceres-0.540
 vig-0.481
§-0.479
 Stall-0.471
 Equality-0.468
Token and
Token ID290
Feature activation+6.39e-02

Pos logit contributions
 Ceres+1.205
 vig+1.041
§+1.026
 Stall+1.000
 Equality+0.991
Neg logit contributions
alties-1.295
��-1.276
ingu-1.254
 oun-1.221
iqueness-1.170
Token the
Token ID262
Feature activation-1.69e-01

Pos logit contributions
 Ceres+0.093
 vig+0.079
§+0.078
 Stall+0.076
 Equality+0.076
Neg logit contributions
alties-0.108
��-0.105
ingu-0.104
 oun-0.101
iqueness-0.098
Token cycle
Token ID6772
Feature activation+6.84e-01

Pos logit contributions
alties+0.287
ingu+0.278
��+0.277
 oun+0.270
iqueness+0.259
Neg logit contributions
 Ceres-0.246
 vig-0.209
§-0.207
 Stall-0.202
 Equality-0.200
Token continued
Token ID3767
Feature activation+1.23e+00

Pos logit contributions
 Ceres+1.021
 vig+0.873
§+0.867
 Stall+0.843
 Equality+0.834
Neg logit contributions
alties-1.131
ingu-1.096
��-1.094
 oun-1.062
iqueness-1.021
Token.
Token ID13
Feature activation+6.20e-02

Pos logit contributions
 Ceres+1.905
 vig+1.660
§+1.629
 Stall+1.590
 Equality+1.571
Neg logit contributions
alties-1.956
��-1.936
ingu-1.882
 oun-1.839
iqueness-1.766
Token No
Token ID1400
Feature activation-6.74e-02

Pos logit contributions
 Ceres+0.097
 vig+0.083
§+0.083
 Stall+0.081
 Equality+0.080
Neg logit contributions
alties-0.099
��-0.096
ingu-0.096
 oun-0.093
iqueness-0.089
Token one
Token ID530
Feature activation-6.73e-01

Pos logit contributions
alties+0.107
ingu+0.103
��+0.103
 oun+0.100
iqueness+0.096
Neg logit contributions
 Ceres-0.105
 vig-0.090
§-0.089
 Stall-0.087
 Equality-0.086
Token was
Token ID373
Feature activation-1.34e-01

Pos logit contributions
alties+1.145
ingu+1.108
��+1.104
 oun+1.075
cond+1.036
Neg logit contributions
 Ceres-0.970
 vig-0.820
§-0.815
 Stall-0.793
 Equality-0.786
Token slight
Token ID3731
Feature activation+1.59e+00

Pos logit contributions
alties+0.197
ingu+0.190
��+0.190
 oun+0.184
iqueness+0.177
Neg logit contributions
 Ceres-0.180
 vig-0.153
§-0.152
 Stall-0.148
 Equality-0.147
Token differences
Token ID5400
Feature activation-2.23e-01

Pos logit contributions
 Ceres+2.073
 vig+1.773
§+1.725
 Stall+1.684
 Equality+1.657
Neg logit contributions
alties-2.848
��-2.799
ingu-2.776
 oun-2.709
iqueness-2.637
Token when
Token ID618
Feature activation-3.36e-01

Pos logit contributions
alties+0.329
ingu+0.317
��+0.315
 oun+0.306
cond+0.293
Neg logit contributions
 Ceres-0.373
 vig-0.323
§-0.322
 Stall-0.314
 Equality-0.312
Token the
Token ID262
Feature activation+3.61e-01

Pos logit contributions
alties+0.554
ingu+0.535
��+0.534
 oun+0.519
iqueness+0.499
Neg logit contributions
 Ceres-0.504
 vig-0.429
§-0.427
 Stall-0.415
 Equality-0.411
Token same
Token ID976
Feature activation+1.03e+00

Pos logit contributions
 Ceres+0.539
 vig+0.460
§+0.456
 Stall+0.445
 Equality+0.440
Neg logit contributions
alties-0.597
ingu-0.578
��-0.576
 oun-0.559
iqueness-0.538
Token measure
Token ID3953
Feature activation+8.42e-01

Pos logit contributions
 Ceres+1.534
 vig+1.320
§+1.302
 Stall+1.267
 Equality+1.251
Neg logit contributions
alties-1.699
ingu-1.646
��-1.640
 oun-1.594
iqueness-1.539
Token is
Token ID318
Feature activation+2.12e-01

Pos logit contributions
 Ceres+1.132
§+0.949
 vig+0.947
 Stall+0.906
 Equality+0.905
Neg logit contributions
alties-1.511
��-1.477
ingu-1.463
 oun-1.429
iqueness-1.376
Token repeated
Token ID5100
Feature activation+3.10e-01

Pos logit contributions
 Ceres+0.336
 vig+0.290
§+0.288
 Stall+0.279
 Equality+0.279
Neg logit contributions
alties-0.333
��-0.324
ingu-0.321
 oun-0.309
iqueness-0.298
Token;
Token ID26
Feature activation-3.08e-01

Pos logit contributions
 Ceres+0.488
 vig+0.421
§+0.421
 Stall+0.407
 Equality+0.404
Neg logit contributions
alties-0.487
��-0.474
ingu-0.469
 oun-0.455
iqueness-0.438
Token etc
Token ID3503
Feature activation-2.03e-01

Pos logit contributions
alties+0.529
ingu+0.512
��+0.510
 oun+0.497
cond+0.479
Neg logit contributions
 Ceres-0.440
 vig-0.372
§-0.369
 Stall-0.359
 Equality-0.356
Token.
Token ID13
Feature activation+1.97e-01

Pos logit contributions
alties+0.259
ingu+0.249
��+0.247
 oun+0.239
cond+0.227
Neg logit contributions
 Ceres-0.380
 vig-0.334
§-0.331
 Stall-0.326
 Equality-0.323
Token love
Token ID1842
Feature activation-7.91e-01

Pos logit contributions
alties+0.577
ingu+0.559
��+0.555
 oun+0.542
cond+0.524
Neg logit contributions
 Ceres-0.464
 vig-0.391
§-0.386
 Stall-0.377
 Equality-0.374
Token you
Token ID345
Feature activation+9.59e-01

Pos logit contributions
alties+1.477
ingu+1.434
��+1.428
 oun+1.394
cond+1.349
Neg logit contributions
 Ceres-1.008
 vig-0.832
§-0.826
 Stall-0.800
 Equality-0.790
Token,
Token ID11
Feature activation+1.27e-01

Pos logit contributions
 Ceres+1.449
§+1.240
 vig+1.237
 Stall+1.199
 Equality+1.186
Neg logit contributions
alties-1.562
��-1.524
ingu-1.514
 oun-1.470
iqueness-1.416
Token man
Token ID582
Feature activation+1.23e+00

Pos logit contributions
 Ceres+0.183
 vig+0.155
§+0.154
 Stall+0.149
 Equality+0.148
Neg logit contributions
alties-0.217
ingu-0.210
��-0.210
 oun-0.204
cond-0.197
Token.
Token ID13
Feature activation+3.83e-02

Pos logit contributions
 Ceres+2.175
 vig+1.906
§+1.904
 Stall+1.855
 Equality+1.840
Neg logit contributions
alties-1.685
ingu-1.620
��-1.619
 oun-1.562
iqueness-1.494
Token You
Token ID921
Feature activation+7.89e-01

Pos logit contributions
 Ceres+0.052
 vig+0.044
§+0.043
 Stall+0.042
 Equality+0.042
Neg logit contributions
alties-0.068
ingu-0.066
��-0.066
 oun-0.064
cond-0.062
Token nailed
Token ID37708
Feature activation-4.18e-01

Pos logit contributions
 Ceres+1.051
§+0.879
 vig+0.877
 Stall+0.847
 Equality+0.835
Neg logit contributions
alties-1.425
��-1.387
ingu-1.384
 oun-1.348
cond-1.297
Token it
Token ID340
Feature activation-2.14e-01

Pos logit contributions
alties+0.717
ingu+0.694
��+0.693
 oun+0.675
iqueness+0.649
Neg logit contributions
 Ceres-0.598
 vig-0.505
§-0.502
 Stall-0.488
 Equality-0.484
Token!
Token ID0
Feature activation+2.32e-01

Pos logit contributions
alties+0.300
ingu+0.289
��+0.288
 oun+0.279
cond+0.266
Neg logit contributions
 Ceres-0.372
 vig-0.324
§-0.323
 Stall-0.316
 Equality-0.313
Token
Token ID447
Feature activation-2.03e-02

Pos logit contributions
 Ceres+0.325
§+0.273
 vig+0.273
 Stall+0.265
 Equality+0.263
Neg logit contributions
alties-0.405
ingu-0.393
��-0.393
 oun-0.382
iqueness-0.368
Token
Token ID251
Feature activation+1.44e-01

Pos logit contributions
alties+0.032
��+0.031
ingu+0.031
 oun+0.030
cond+0.029
Neg logit contributions
 Ceres-0.032
 vig-0.027
§-0.027
 Stall-0.026
 Equality-0.026
Token Atlantic
Token ID10596
Feature activation+8.92e-01

Pos logit contributions
 Ceres+1.096
 vig+0.929
§+0.924
 Stall+0.899
 Equality+0.891
Neg logit contributions
alties-1.263
ingu-1.222
��-1.218
 oun-1.185
cond-1.141
Token and
Token ID290
Feature activation+1.58e-01

Pos logit contributions
 Ceres+1.248
§+1.048
 vig+1.046
 Stall+1.012
 Equality+1.001
Neg logit contributions
alties-1.558
��-1.511
ingu-1.506
 oun-1.464
iqueness-1.412
Token North
Token ID2258
Feature activation+4.39e-01

Pos logit contributions
 Ceres+0.258
 vig+0.224
§+0.223
 Stall+0.217
 Equality+0.216
Neg logit contributions
alties-0.238
ingu-0.229
��-0.227
 oun-0.222
cond-0.212
Token Pacific
Token ID8211
Feature activation+1.37e+00

Pos logit contributions
 Ceres+0.593
 vig+0.490
§+0.490
 Stall+0.479
 Equality+0.471
Neg logit contributions
alties-0.791
��-0.781
ingu-0.765
 oun-0.760
iqueness-0.724
Token O
Token ID440
Feature activation-1.73e-01

Pos logit contributions
 Ceres+1.657
 vig+1.340
§+1.340
 Stall+1.283
 Equality+1.267
Neg logit contributions
alties-2.639
��-2.588
ingu-2.555
 oun-2.489
iqueness-2.424
Tokenceans
Token ID19961
Feature activation+5.80e-01

Pos logit contributions
alties+0.296
��+0.289
ingu+0.287
 oun+0.283
iqueness+0.269
Neg logit contributions
 Ceres-0.249
 vig-0.210
§-0.210
 Stall-0.202
 Equality-0.200
Token,
Token ID11
Feature activation+5.77e-02

Pos logit contributions
 Ceres+0.943
 vig+0.813
§+0.811
 Stall+0.789
 Equality+0.783
Neg logit contributions
alties-0.881
ingu-0.848
��-0.847
 oun-0.820
iqueness-0.786
Token
Token ID447
Feature activation-1.98e-01

Pos logit contributions
 Ceres+0.067
 vig+0.055
§+0.054
 Stall+0.052
 Equality+0.052
Neg logit contributions
alties-0.114
ingu-0.111
��-0.110
 oun-0.108
cond-0.104
Token
Token ID251
Feature activation+2.11e-01

Pos logit contributions
alties+0.334
��+0.328
ingu+0.322
 oun+0.315
iqueness+0.303
Neg logit contributions
 Ceres-0.287
 vig-0.243
§-0.239
 Stall-0.237
 Equality-0.233
Token is
Token ID318
Feature activation+1.55e-01

Pos logit contributions
 Ceres+0.378
 vig+0.331
§+0.331
 Stall+0.322
 Equality+0.321
Neg logit contributions
alties-0.284
��-0.274
ingu-0.273
 oun-0.263
iqueness-0.250
Token now
Token ID783
Feature activation+9.51e-01

Pos logit contributions
 Ceres+0.234
 vig+0.198
§+0.198
 Stall+0.192
 Equality+0.191
Neg logit contributions
alties-0.256
��-0.248
ingu-0.247
 oun-0.240
iqueness-0.231
Token media
Token ID2056
Feature activation+3.37e-01

Pos logit contributions
 Ceres+0.594
 vig+0.504
§+0.500
 Stall+0.486
 Equality+0.484
Neg logit contributions
alties-0.724
��-0.710
ingu-0.706
 oun-0.684
iqueness-0.663
Token selected
Token ID6163
Feature activation-9.98e-01

Pos logit contributions
 Ceres+0.486
 vig+0.413
§+0.406
 Stall+0.396
 Equality+0.393
Neg logit contributions
alties-0.575
��-0.564
ingu-0.557
 oun-0.539
iqueness-0.524
Token him
Token ID683
Feature activation-2.64e-01

Pos logit contributions
alties+1.690
ingu+1.631
��+1.618
 oun+1.583
cond+1.528
Neg logit contributions
 Ceres-1.436
 vig-1.218
§-1.213
 Stall-1.177
 Equality-1.163
Token as
Token ID355
Feature activation+6.35e-02

Pos logit contributions
alties+0.440
ingu+0.425
��+0.420
 oun+0.412
cond+0.398
Neg logit contributions
 Ceres-0.390
 vig-0.331
§-0.328
 Stall-0.320
 Equality-0.318
Token the
Token ID262
Feature activation-1.19e-01

Pos logit contributions
 Ceres+0.095
 vig+0.081
§+0.080
 Stall+0.078
 Equality+0.077
Neg logit contributions
alties-0.105
ingu-0.102
��-0.101
 oun-0.099
cond-0.095
Token ACC
Token ID15859
Feature activation+9.75e-01

Pos logit contributions
alties+0.190
ingu+0.183
��+0.183
 oun+0.177
iqueness+0.170
Neg logit contributions
 Ceres-0.186
 vig-0.160
§-0.159
 Stall-0.155
 Equality-0.154
Token Pre
Token ID3771
Feature activation-7.80e-01

Pos logit contributions
 Ceres+1.444
§+1.240
 vig+1.236
 Stall+1.208
 Equality+1.203
Neg logit contributions
alties-1.622
��-1.565
ingu-1.554
 oun-1.503
iqueness-1.471
Tokenseason
Token ID6230
Feature activation-3.28e-01

Pos logit contributions
alties+0.858
ingu+0.817
��+0.810
 oun+0.774
cond+0.737
Neg logit contributions
 Ceres-1.595
 vig-1.419
§-1.411
 Stall-1.386
 Equality-1.379
Token Rookie
Token ID36595
Feature activation-3.76e-01

Pos logit contributions
alties+0.534
ingu+0.516
��+0.514
 oun+0.500
iqueness+0.481
Neg logit contributions
 Ceres-0.498
 vig-0.425
§-0.423
 Stall-0.412
 Equality-0.409
Token of
Token ID286
Feature activation-1.37e-01

Pos logit contributions
alties+0.593
ingu+0.574
��+0.569
 oun+0.557
cond+0.536
Neg logit contributions
 Ceres-0.587
 vig-0.503
§-0.497
 Stall-0.485
 Equality-0.479
Token the
Token ID262
Feature activation-2.01e-01

Pos logit contributions
alties+0.231
��+0.225
ingu+0.223
 oun+0.221
iqueness+0.209
Neg logit contributions
 Ceres-0.200
§-0.169
 vig-0.168
 Stall-0.164
 Equality-0.163
Token Y
Token ID575
Feature activation-6.79e-01

Pos logit contributions
alties+0.177
ingu+0.170
��+0.170
 oun+0.164
cond+0.158
Neg logit contributions
 Ceres-0.200
 vig-0.174
§-0.173
 Stall-0.170
 Equality-0.168
Tokenanks
Token ID2283
Feature activation+2.67e-01

Pos logit contributions
alties+0.872
ingu+0.836
��+0.830
 oun+0.802
cond+0.762
Neg logit contributions
 Ceres-1.264
 vig-1.113
§-1.108
 Stall-1.085
 Equality-1.078
Token 45
Token ID4153
Feature activation+2.10e-01

Pos logit contributions
 Ceres+0.398
 vig+0.339
§+0.338
 Stall+0.329
 Equality+0.325
Neg logit contributions
alties-0.443
��-0.429
ingu-0.429
 oun-0.416
iqueness-0.401
Token-
Token ID12
Feature activation-4.74e-02

Pos logit contributions
 Ceres+0.340
§+0.293
 vig+0.293
 Stall+0.285
 Equality+0.283
Neg logit contributions
alties-0.321
��-0.313
ingu-0.311
 oun-0.305
iqueness-0.288
Token28
Token ID2078
Feature activation-7.24e-01

Pos logit contributions
alties+0.082
��+0.079
ingu+0.079
 oun+0.077
iqueness+0.074
Neg logit contributions
 Ceres-0.067
§-0.057
 vig-0.057
 Stall-0.055
 Equality-0.055
Token.
Token ID13
Feature activation-9.01e-03

Pos logit contributions
alties+1.113
ingu+1.075
��+1.065
 oun+1.037
cond+1.000
Neg logit contributions
 Ceres-1.155
 vig-0.995
§-0.987
 Stall-0.965
 Equality-0.959
Token They
Token ID1119
Feature activation+1.67e-01

Pos logit contributions
alties+0.014
ingu+0.014
��+0.014
 oun+0.014
iqueness+0.013
Neg logit contributions
 Ceres-0.014
 vig-0.012
§-0.012
 Stall-0.012
 Equality-0.011
Token were
Token ID547
Feature activation-4.69e-02

Pos logit contributions
 Ceres+0.239
 vig+0.203
§+0.202
 Stall+0.196
 Equality+0.194
Neg logit contributions
alties-0.284
ingu-0.275
��-0.274
 oun-0.267
cond-0.257
Token led
Token ID2957
Feature activation+1.94e-01

Pos logit contributions
alties+0.078
ingu+0.075
��+0.075
 oun+0.073
iqueness+0.070
Neg logit contributions
 Ceres-0.070
 vig-0.060
§-0.059
 Stall-0.058
 Equality-0.057
Token not
Token ID407
Feature activation+2.75e-01

Pos logit contributions
 Ceres+0.320
 vig+0.277
§+0.276
 Stall+0.269
 Equality+0.267
Neg logit contributions
alties-0.289
ingu-0.279
��-0.278
 oun-0.269
cond-0.258
Token by
Token ID416
Feature activation-4.44e-01

Pos logit contributions
 Ceres+0.453
 vig+0.392
§+0.389
 Stall+0.380
 Equality+0.378
Neg logit contributions
alties-0.411
ingu-0.396
��-0.393
 oun-0.382
cond-0.366
Tokenalling
Token ID9221
Feature activation-1.10e+00

Pos logit contributions
 Ceres+0.618
 vig+0.543
§+0.542
 Stall+0.529
 Equality+0.526
Neg logit contributions
alties-0.436
ingu-0.418
��-0.417
 oun-0.402
iqueness-0.382
Token together
Token ID1978
Feature activation+4.68e-01

Pos logit contributions
alties+1.744
ingu+1.683
��+1.668
 oun+1.626
cond+1.563
Neg logit contributions
 Ceres-1.713
 vig-1.466
§-1.456
 Stall-1.421
 Equality-1.409
Token in
Token ID287
Feature activation+1.06e-01

Pos logit contributions
 Ceres+0.710
 vig+0.605
§+0.604
 Stall+0.587
 Equality+0.580
Neg logit contributions
alties-0.762
��-0.737
ingu-0.735
 oun-0.715
iqueness-0.686
Token a
Token ID257
Feature activation+3.02e-02

Pos logit contributions
 Ceres+0.160
 vig+0.137
§+0.136
 Stall+0.132
 Equality+0.131
Neg logit contributions
alties-0.174
ingu-0.169
��-0.169
 oun-0.163
iqueness-0.157
Token cr
Token ID1067
Feature activation+3.11e-01

Pos logit contributions
 Ceres+0.047
 vig+0.040
§+0.040
 Stall+0.039
 Equality+0.038
Neg logit contributions
alties-0.049
��-0.047
ingu-0.047
 oun-0.045
iqueness-0.044
Tokenum
Token ID388
Feature activation+6.00e-02

Pos logit contributions
 Ceres+0.605
 vig+0.536
§+0.532
 Stall+0.523
 Equality+0.519
Neg logit contributions
alties-0.374
ingu-0.358
��-0.352
 oun-0.341
cond-0.325
Tokenpled
Token ID10137
Feature activation-9.88e-02

Pos logit contributions
 Ceres+0.093
 vig+0.079
§+0.079
 Stall+0.077
 Equality+0.077
Neg logit contributions
alties-0.096
ingu-0.093
��-0.091
 oun-0.090
cond-0.087
Token mess
Token ID2085
Feature activation-5.30e-01

Pos logit contributions
alties+0.159
ingu+0.153
��+0.153
 oun+0.148
cond+0.143
Neg logit contributions
 Ceres-0.152
 vig-0.130
§-0.129
 Stall-0.126
 Equality-0.125
Token of
Token ID286
Feature activation+2.08e-01

Pos logit contributions
alties+0.833
ingu+0.807
��+0.799
 oun+0.776
cond+0.752
Neg logit contributions
 Ceres-0.827
 vig-0.713
§-0.706
 Stall-0.692
 Equality-0.687
Token mutual
Token ID13584
Feature activation+6.32e-01

Pos logit contributions
 Ceres+0.317
 vig+0.272
§+0.270
 Stall+0.263
 Equality+0.261
Neg logit contributions
alties-0.336
ingu-0.325
��-0.324
 oun-0.315
iqueness-0.303
Token uncertainty
Token ID13479
Feature activation-1.26e+00

Pos logit contributions
 Ceres+0.883
 vig+0.749
§+0.741
 Stall+0.714
 Equality+0.710
Neg logit contributions
alties-1.101
��-1.074
ingu-1.069
 oun-1.041
iqueness-1.002
Token got
Token ID1392
Feature activation+2.00e-02

Pos logit contributions
 Ceres+1.834
§+1.554
 vig+1.543
 Stall+1.495
 Equality+1.473
Neg logit contributions
alties-2.241
��-2.222
ingu-2.171
 oun-2.128
cond-2.042
Token there
Token ID612
Feature activation-7.99e-01

Pos logit contributions
 Ceres+0.030
 vig+0.026
§+0.025
 Stall+0.025
 Equality+0.025
Neg logit contributions
alties-0.033
ingu-0.032
��-0.032
 oun-0.031
cond-0.030
Token in
Token ID287
Feature activation+1.25e-01

Pos logit contributions
alties+1.270
ingu+1.225
��+1.213
 oun+1.186
cond+1.140
Neg logit contributions
 Ceres-1.239
 vig-1.062
§-1.056
 Stall-1.029
 Equality-1.022
Token the
Token ID262
Feature activation+1.00e-01

Pos logit contributions
 Ceres+0.183
 vig+0.155
§+0.154
 Stall+0.150
 Equality+0.149
Neg logit contributions
alties-0.210
��-0.203
ingu-0.203
 oun-0.197
iqueness-0.190
Token last
Token ID938
Feature activation+1.59e+00

Pos logit contributions
 Ceres+0.156
 vig+0.134
§+0.133
 Stall+0.130
 Equality+0.129
Neg logit contributions
alties-0.160
ingu-0.154
��-0.153
 oun-0.149
iqueness-0.144
Token two
Token ID734
Feature activation-3.74e-01

Pos logit contributions
 Ceres+2.317
 vig+1.955
§+1.929
 Stall+1.888
 Equality+1.852
Neg logit contributions
alties-2.723
��-2.654
ingu-2.640
 oun-2.555
iqueness-2.484
Token seasons
Token ID7028
Feature activation-4.87e-01

Pos logit contributions
alties+0.612
ingu+0.591
��+0.589
 oun+0.574
cond+0.552
Neg logit contributions
 Ceres-0.562
 vig-0.479
§-0.476
 Stall-0.464
 Equality-0.460
Token.
Token ID13
Feature activation+1.21e-01

Pos logit contributions
alties+0.779
ingu+0.752
��+0.748
 oun+0.729
cond+0.700
Neg logit contributions
 Ceres-0.751
 vig-0.643
§-0.639
 Stall-0.623
 Equality-0.618
Token Was
Token ID8920
Feature activation+6.05e-01

Pos logit contributions
 Ceres+0.176
§+0.148
 vig+0.147
 Stall+0.144
 Equality+0.142
Neg logit contributions
alties-0.207
��-0.200
ingu-0.200
 oun-0.195
iqueness-0.187
Token there
Token ID612
Feature activation-8.52e-01

Pos logit contributions
 Ceres+0.859
 vig+0.722
§+0.719
 Stall+0.699
 Equality+0.690
Neg logit contributions
alties-1.048
��-1.014
ingu-1.012
 oun-0.985
iqueness-0.951
Token an
Token ID281
Feature activation+1.59e-01

Pos logit contributions
alties+1.262
ingu+1.217
��+1.206
 oun+1.174
cond+1.127
Neg logit contributions
 Ceres-1.413
 vig-1.221
§-1.218
 Stall-1.189
 Equality-1.181
Token help
Token ID1037
Feature activation+8.33e-02

Pos logit contributions
 Ceres+0.575
 vig+0.489
§+0.486
 Stall+0.468
 Equality+0.468
Neg logit contributions
alties-0.643
��-0.632
ingu-0.615
 oun-0.604
iqueness-0.584
Token but
Token ID475
Feature activation+3.42e-01

Pos logit contributions
 Ceres+0.117
 vig+0.098
§+0.097
 Stall+0.094
 Equality+0.093
Neg logit contributions
alties-0.147
��-0.142
ingu-0.142
 oun-0.138
iqueness-0.134
Token feel
Token ID1254
Feature activation-3.12e-01

Pos logit contributions
 Ceres+0.406
 vig+0.331
§+0.329
 Stall+0.316
 Equality+0.312
Neg logit contributions
alties-0.670
��-0.652
ingu-0.650
 oun-0.634
iqueness-0.615
Token that
Token ID326
Feature activation+1.06e-01

Pos logit contributions
alties+0.531
ingu+0.514
��+0.511
 oun+0.499
cond+0.481
Neg logit contributions
 Ceres-0.450
 vig-0.381
§-0.378
 Stall-0.369
 Equality-0.366
Token this
Token ID428
Feature activation+3.48e-02

Pos logit contributions
 Ceres+0.154
 vig+0.131
§+0.130
 Stall+0.126
 Equality+0.125
Neg logit contributions
alties-0.180
��-0.175
ingu-0.174
 oun-0.169
iqueness-0.163
Token rapidly
Token ID8902
Feature activation-7.68e-02

Pos logit contributions
 Ceres+0.051
 vig+0.043
§+0.043
 Stall+0.042
 Equality+0.041
Neg logit contributions
alties-0.059
ingu-0.057
��-0.057
 oun-0.055
iqueness-0.053
Token changing
Token ID5609
Feature activation+2.79e-01

Pos logit contributions
alties+0.127
��+0.123
ingu+0.122
 oun+0.119
iqueness+0.114
Neg logit contributions
 Ceres-0.115
 vig-0.098
§-0.097
 Stall-0.095
 Equality-0.093
Token environment
Token ID2858
Feature activation+7.67e-01

Pos logit contributions
 Ceres+0.427
 vig+0.365
§+0.363
 Stall+0.352
 Equality+0.349
Neg logit contributions
alties-0.453
��-0.440
ingu-0.436
 oun-0.425
iqueness-0.408
Token must
Token ID1276
Feature activation-2.79e-01

Pos logit contributions
 Ceres+1.046
 vig+0.874
§+0.874
 Stall+0.838
 Equality+0.829
Neg logit contributions
alties-1.372
��-1.340
ingu-1.325
 oun-1.299
iqueness-1.244
Token be
Token ID307
Feature activation+3.25e-01

Pos logit contributions
alties+0.469
ingu+0.456
��+0.450
 oun+0.439
cond+0.424
Neg logit contributions
 Ceres-0.410
 vig-0.347
§-0.343
 Stall-0.336
 Equality-0.334
Token tough
Token ID5802
Feature activation-4.51e-01

Pos logit contributions
 Ceres+0.487
 vig+0.413
§+0.411
 Stall+0.398
 Equality+0.394
Neg logit contributions
alties-0.539
ingu-0.520
��-0.519
 oun-0.503
iqueness-0.486
Token
Token ID198
Feature activation-1.11e-04

Pos logit contributions
alties+0.335
��+0.329
ingu+0.324
 oun+0.323
iqueness+0.301
Neg logit contributions
 Ceres-0.340
§-0.294
 vig-0.293
 Stall-0.285
 Equality-0.283
TokenWhy
Token ID5195
Feature activation+7.14e-01

Pos logit contributions
alties+0.000
ingu+0.000
��+0.000
 oun+0.000
iqueness+0.000
Neg logit contributions
 Ceres-0.000
§-0.000
 vig-0.000
 Stall-0.000
 Equality-0.000
Token doesn
Token ID1595
Feature activation+3.62e-01

Pos logit contributions
 Ceres+1.024
 vig+0.861
§+0.859
 Stall+0.844
 Equality+0.831
Neg logit contributions
alties-1.238
��-1.202
ingu-1.192
 oun-1.171
iqueness-1.122
Token
Token ID447
Feature activation-2.22e-01

Pos logit contributions
 Ceres+0.635
§+0.570
 vig+0.555
 Stall+0.541
 Equality+0.540
Neg logit contributions
alties-0.505
��-0.500
 oun-0.496
ingu-0.485
iqueness-0.446
Token
Token ID247
Feature activation-1.26e-01

Pos logit contributions
alties+0.320
ingu+0.308
��+0.307
 oun+0.297
cond+0.284
Neg logit contributions
 Ceres-0.377
 vig-0.328
§-0.326
 Stall-0.319
 Equality-0.317
Tokent
Token ID83
Feature activation+1.97e-01

Pos logit contributions
alties+0.191
ingu+0.184
��+0.183
 oun+0.178
iqueness+0.170
Neg logit contributions
 Ceres-0.206
 vig-0.178
§-0.177
 Stall-0.173
 Equality-0.171
Token the
Token ID262
Feature activation+3.64e-02

Pos logit contributions
 Ceres+0.286
 vig+0.242
§+0.241
 Stall+0.234
 Equality+0.232
Neg logit contributions
alties-0.336
ingu-0.325
��-0.324
 oun-0.316
iqueness-0.304
Token Deep
Token ID10766
Feature activation+3.17e-01

Pos logit contributions
 Ceres+0.056
 vig+0.048
§+0.047
 Stall+0.046
 Equality+0.046
Neg logit contributions
alties-0.059
ingu-0.057
��-0.056
 oun-0.055
cond-0.053
Token State
Token ID1812
Feature activation+4.34e-01

Pos logit contributions
 Ceres+0.370
 vig+0.301
§+0.295
 Stall+0.281
 Equality+0.281
Neg logit contributions
alties-0.626
ingu-0.609
��-0.605
 oun-0.592
cond-0.576
Token like
Token ID588
Feature activation-2.14e-01

Pos logit contributions
 Ceres+0.642
 vig+0.548
§+0.548
 Stall+0.536
 Equality+0.530
Neg logit contributions
alties-0.722
��-0.703
ingu-0.700
 oun-0.679
iqueness-0.657
Token Trump
Token ID1301
Feature activation-4.22e-01

Pos logit contributions
alties+0.357
ingu+0.346
��+0.345
 oun+0.336
iqueness+0.323
Neg logit contributions
 Ceres-0.316
 vig-0.268
§-0.266
 Stall-0.260
 Equality-0.257
Token suffered
Token ID6989
Feature activation-7.43e-01

Pos logit contributions
alties+0.335
ingu+0.323
��+0.321
 oun+0.314
cond+0.303
Neg logit contributions
 Ceres-0.284
 vig-0.240
§-0.239
 Stall-0.232
 Equality-0.230
Token a
Token ID257
Feature activation-1.05e-01

Pos logit contributions
alties+1.120
ingu+1.076
��+1.071
 oun+1.041
cond+1.006
Neg logit contributions
 Ceres-1.207
 vig-1.040
§-1.038
 Stall-1.012
 Equality-1.005
Token rupt
Token ID31580
Feature activation-2.00e-01

Pos logit contributions
alties+0.164
ingu+0.158
��+0.158
 oun+0.153
iqueness+0.147
Neg logit contributions
 Ceres-0.165
 vig-0.142
§-0.141
 Stall-0.138
 Equality-0.137
Tokenured
Token ID1522
Feature activation-1.19e+00

Pos logit contributions
alties+0.374
��+0.363
ingu+0.362
 oun+0.354
iqueness+0.341
Neg logit contributions
 Ceres-0.256
 vig-0.211
§-0.210
 Stall-0.203
 Equality-0.200
Token Achilles
Token ID39561
Feature activation-3.55e-01

Pos logit contributions
alties+1.935
ingu+1.902
��+1.855
 oun+1.834
cond+1.769
Neg logit contributions
 Ceres-1.757
 vig-1.503
§-1.495
 Stall-1.452
 Equality-1.443
Token tendon
Token ID42205
Feature activation-2.39e+00

Pos logit contributions
alties+0.570
ingu+0.555
��+0.549
 oun+0.537
cond+0.518
Neg logit contributions
 Ceres-0.542
 vig-0.461
§-0.459
 Stall-0.446
 Equality-0.443
Token in
Token ID287
Feature activation-1.55e-01

Pos logit contributions
alties+3.640
ingu+3.516
��+3.419
 oun+3.372
cond+3.345
Neg logit contributions
 Ceres-3.706
 vig-3.181
§-3.151
 Equality-3.099
 Stall-3.077
Token Thursday
Token ID3635
Feature activation+1.54e+00

Pos logit contributions
alties+0.306
ingu+0.297
��+0.296
 oun+0.289
cond+0.281
Neg logit contributions
 Ceres-0.182
 vig-0.148
§-0.147
 Stall-0.142
 Equality-0.140
Token's
Token ID338
Feature activation-8.74e-02

Pos logit contributions
 Ceres+2.268
 vig+1.934
§+1.919
 Stall+1.868
 Equality+1.844
Neg logit contributions
alties-2.558
ingu-2.479
��-2.471
 oun-2.403
iqueness-2.317
Token game
Token ID983
Feature activation+1.06e-01

Pos logit contributions
alties+0.135
ingu+0.130
��+0.129
 oun+0.126
cond+0.121
Neg logit contributions
 Ceres-0.140
 vig-0.121
§-0.120
 Stall-0.117
 Equality-0.116
Token against
Token ID1028
Feature activation+2.71e-01

Pos logit contributions
 Ceres+0.167
 vig+0.143
§+0.142
 Stall+0.139
 Equality+0.137
Neg logit contributions
alties-0.168
ingu-0.162
��-0.161
 oun-0.157
iqueness-0.150
Token was
Token ID373
Feature activation+9.84e-02

Pos logit contributions
alties+0.415
ingu+0.402
��+0.400
 oun+0.390
cond+0.377
Neg logit contributions
 Ceres-0.332
 vig-0.279
§-0.277
 Stall-0.270
 Equality-0.267
Token too
Token ID1165
Feature activation-6.40e-01

Pos logit contributions
 Ceres+0.146
 vig+0.124
§+0.124
 Stall+0.119
 Equality+0.118
Neg logit contributions
alties-0.164
ingu-0.158
��-0.158
 oun-0.154
iqueness-0.148
Token full
Token ID1336
Feature activation+1.01e+00

Pos logit contributions
alties+1.001
ingu+0.965
��+0.961
 oun+0.935
iqueness+0.898
Neg logit contributions
 Ceres-1.016
 vig-0.877
§-0.870
 Stall-0.846
 Equality-0.839
Token,
Token ID11
Feature activation-7.18e-02

Pos logit contributions
 Ceres+1.531
 vig+1.310
§+1.303
 Stall+1.266
 Equality+1.255
Neg logit contributions
alties-1.644
��-1.589
ingu-1.589
 oun-1.541
iqueness-1.482
Token and
Token ID290
Feature activation+1.17e-01

Pos logit contributions
alties+0.130
ingu+0.127
��+0.126
 oun+0.123
cond+0.119
Neg logit contributions
 Ceres-0.095
 vig-0.079
§-0.079
 Stall-0.076
 Equality-0.076
Token Gl
Token ID2671
Feature activation-1.64e+00

Pos logit contributions
 Ceres+0.170
 vig+0.144
§+0.142
 Stall+0.139
 Equality+0.137
Neg logit contributions
alties-0.199
��-0.196
ingu-0.195
 oun-0.189
iqueness-0.181
Tokenü
Token ID9116
Feature activation+2.59e-01

Pos logit contributions
alties+1.986
ingu+1.898
��+1.857
 oun+1.787
cond+1.720
Neg logit contributions
 Ceres-3.181
 vig-2.813
§-2.786
 Stall-2.753
 Equality-2.724
Tokenck
Token ID694
Feature activation+9.73e-01

Pos logit contributions
 Ceres+0.389
§+0.337
 vig+0.328
 Equality+0.320
 Stall+0.320
Neg logit contributions
��-0.426
alties-0.426
 oun-0.422
ingu-0.409
iqueness-0.383
Token a
Token ID257
Feature activation-4.13e-03

Pos logit contributions
 Ceres+1.340
 vig+1.135
§+1.134
 Stall+1.086
 Equality+1.069
Neg logit contributions
alties-1.702
��-1.667
ingu-1.645
 oun-1.605
iqueness-1.552
Token poet
Token ID21810
Feature activation-1.28e-01

Pos logit contributions
alties+0.007
ingu+0.006
��+0.006
 oun+0.006
cond+0.006
Neg logit contributions
 Ceres-0.006
 vig-0.006
§-0.006
 Stall-0.005
 Equality-0.005
Token for
Token ID329
Feature activation+2.38e-02

Pos logit contributions
alties+0.203
ingu+0.196
��+0.195
 oun+0.190
cond+0.182
Neg logit contributions
 Ceres-0.200
 vig-0.172
§-0.171
 Stall-0.167
 Equality-0.165
Token restaurants
Token ID10808
Feature activation-6.67e-02

Pos logit contributions
 Ceres+1.566
 vig+1.345
§+1.315
 Stall+1.303
 Equality+1.274
Neg logit contributions
alties-1.712
��-1.679
ingu-1.669
 oun-1.616
iqueness-1.560
Token and
Token ID290
Feature activation+9.24e-02

Pos logit contributions
alties+0.108
ingu+0.104
��+0.103
 oun+0.101
cond+0.097
Neg logit contributions
 Ceres-0.102
 vig-0.087
§-0.086
 Stall-0.084
 Equality-0.083
Token shops
Token ID12437
Feature activation-9.38e-01

Pos logit contributions
 Ceres+0.148
 vig+0.128
§+0.126
 Stall+0.124
 Equality+0.122
Neg logit contributions
alties-0.144
��-0.139
ingu-0.138
 oun-0.134
iqueness-0.129
Token set
Token ID900
Feature activation+5.96e-01

Pos logit contributions
alties+1.426
ingu+1.373
��+1.351
 oun+1.320
cond+1.280
Neg logit contributions
 Ceres-1.517
 vig-1.308
§-1.298
 Stall-1.263
 Equality-1.259
Token among
Token ID1871
Feature activation-2.19e-01

Pos logit contributions
 Ceres+0.871
 vig+0.740
§+0.737
 Stall+0.717
 Equality+0.708
Neg logit contributions
alties-1.002
��-0.970
ingu-0.970
 oun-0.941
iqueness-0.905
Token natural
Token ID3288
Feature activation-1.53e+00

Pos logit contributions
alties+0.358
ingu+0.346
��+0.345
 oun+0.335
cond+0.322
Neg logit contributions
 Ceres-0.330
 vig-0.282
§-0.280
 Stall-0.273
 Equality-0.270
Token activities
Token ID4568
Feature activation+1.25e+00

Pos logit contributions
alties+2.560
ingu+2.468
��+2.431
 oun+2.378
cond+2.328
Neg logit contributions
 Ceres-2.209
§-1.890
 vig-1.847
 Equality-1.825
 Stall-1.814
Token,
Token ID11
Feature activation-7.63e-02

Pos logit contributions
 Ceres+1.883
 vig+1.607
§+1.598
 Stall+1.554
 Equality+1.531
Neg logit contributions
alties-2.053
��-2.009
ingu-1.990
 oun-1.939
iqueness-1.863
Token and
Token ID290
Feature activation-1.36e-02

Pos logit contributions
alties+0.133
ingu+0.129
��+0.128
 oun+0.125
cond+0.121
Neg logit contributions
 Ceres-0.107
 vig-0.090
§-0.089
 Stall-0.087
 Equality-0.086
Token the
Token ID262
Feature activation-1.93e-01

Pos logit contributions
alties+0.023
��+0.022
ingu+0.022
 oun+0.021
iqueness+0.020
Neg logit contributions
 Ceres-0.020
 vig-0.017
§-0.017
 Stall-0.017
 Equality-0.017
Token M
Token ID337
Feature activation-8.05e-01

Pos logit contributions
alties+0.311
ingu+0.301
��+0.300
 oun+0.291
iqueness+0.280
Neg logit contributions
 Ceres-0.298
 vig-0.255
§-0.253
 Stall-0.248
 Equality-0.245
Token-
Token ID12
Feature activation-3.38e-01

Pos logit contributions
 Ceres+0.111
 vig+0.094
§+0.094
 Stall+0.091
 Equality+0.090
Neg logit contributions
alties-0.129
ingu-0.125
��-0.125
 oun-0.122
iqueness-0.117
Tokenfloor
Token ID28300
Feature activation-1.90e+00

Pos logit contributions
alties+0.483
ingu+0.462
��+0.456
 oun+0.443
cond+0.430
Neg logit contributions
 Ceres-0.580
 vig-0.505
§-0.501
 Stall-0.492
 Equality-0.488
Token lab
Token ID2248
Feature activation-1.34e+00

Pos logit contributions
alties+3.007
ingu+2.897
��+2.855
 oun+2.789
cond+2.739
Neg logit contributions
 Ceres-2.892
 vig-2.467
§-2.464
 Stall-2.403
 Equality-2.379
Token space
Token ID2272
Feature activation+5.15e-01

Pos logit contributions
alties+2.218
ingu+2.160
��+2.113
 oun+2.067
cond+2.026
Neg logit contributions
 Ceres-1.991
 vig-1.687
§-1.657
 Stall-1.627
 Equality-1.622
Token is
Token ID318
Feature activation-7.01e-04

Pos logit contributions
 Ceres+0.735
 vig+0.620
§+0.617
 Stall+0.600
 Equality+0.593
Neg logit contributions
alties-0.885
ingu-0.857
��-0.856
 oun-0.833
iqueness-0.802
Token under
Token ID739
Feature activation-1.93e+00

Pos logit contributions
alties+0.001
��+0.001
ingu+0.001
 oun+0.001
iqueness+0.001
Neg logit contributions
 Ceres-0.001
 vig-0.001
§-0.001
 Stall-0.001
 Equality-0.001
Token construction
Token ID5103
Feature activation+3.34e-01

Pos logit contributions
alties+2.574
ingu+2.518
��+2.446
 oun+2.409
cond+2.356
Neg logit contributions
 Ceres-3.408
§-2.953
 vig-2.941
 Equality-2.884
 Stall-2.856
Token,
Token ID11
Feature activation-3.47e-02

Pos logit contributions
 Ceres+0.536
 vig+0.462
§+0.459
 Stall+0.448
 Equality+0.444
Neg logit contributions
alties-0.515
��-0.498
ingu-0.498
 oun-0.482
iqueness-0.462
Token and
Token ID290
Feature activation+1.84e-01

Pos logit contributions
alties+0.057
ingu+0.055
��+0.055
 oun+0.054
cond+0.052
Neg logit contributions
 Ceres-0.052
 vig-0.044
§-0.044
 Stall-0.043
 Equality-0.042
Token one
Token ID530
Feature activation-6.34e-01

Pos logit contributions
 Ceres+0.275
 vig+0.234
§+0.231
 Stall+0.227
 Equality+0.224
Neg logit contributions
alties-0.305
��-0.296
ingu-0.296
 oun-0.287
iqueness-0.277
Token glass
Token ID5405
Feature activation-6.40e-01

Pos logit contributions
alties+1.031
ingu+0.998
��+0.990
 oun+0.965
cond+0.933
Neg logit contributions
 Ceres-0.956
 vig-0.815
§-0.812
 Stall-0.790
 Equality-0.784
Token research
Token ID2267
Feature activation-9.42e-01

Pos logit contributions
alties+0.324
��+0.312
ingu+0.311
 oun+0.303
iqueness+0.291
Neg logit contributions
 Ceres-0.295
 vig-0.251
§-0.249
 Stall-0.244
 Equality-0.242
Token ethics
Token ID14458
Feature activation+4.30e-01

Pos logit contributions
alties+1.687
ingu+1.639
��+1.627
 oun+1.590
cond+1.539
Neg logit contributions
 Ceres-1.274
 vig-1.060
§-1.055
 Stall-1.022
 Equality-1.013
Token.
Token ID13
Feature activation-2.70e-01

Pos logit contributions
 Ceres+0.712
 vig+0.617
§+0.616
 Stall+0.599
 Equality+0.594
Neg logit contributions
alties-0.639
��-0.618
ingu-0.616
 oun-0.597
iqueness-0.571
Token
Token ID198
Feature activation-1.62e-01

Pos logit contributions
alties+0.432
ingu+0.418
��+0.417
 oun+0.405
iqueness+0.388
Neg logit contributions
 Ceres-0.417
 vig-0.356
§-0.356
 Stall-0.347
 Equality-0.344
Token
Token ID198
Feature activation-5.46e-02

Pos logit contributions
alties+0.252
��+0.247
ingu+0.244
 oun+0.243
iqueness+0.226
Neg logit contributions
 Ceres-0.257
§-0.222
 vig-0.221
 Stall-0.215
 Equality-0.214
Token2000
Token ID11024
Feature activation-1.80e+00

Pos logit contributions
alties+0.078
��+0.075
ingu+0.075
 oun+0.073
iqueness+0.069
Neg logit contributions
 Ceres-0.093
§-0.082
 vig-0.082
 Stall-0.079
 Equality-0.079
Token The
Token ID383
Feature activation-5.02e-01

Pos logit contributions
alties+2.986
ingu+2.887
��+2.844
 oun+2.784
cond+2.711
Neg logit contributions
 Ceres-2.621
 vig-2.241
§-2.221
 Stall-2.135
 Equality-2.121
Token Office
Token ID4452
Feature activation-7.54e-01

Pos logit contributions
alties+0.841
ingu+0.814
��+0.811
 oun+0.790
cond+0.760
Neg logit contributions
 Ceres-0.738
 vig-0.627
§-0.623
 Stall-0.606
 Equality-0.601
Token of
Token ID286
Feature activation-9.06e-02

Pos logit contributions
alties+1.137
ingu+1.081
��+1.077
 oun+1.062
cond+1.016
Neg logit contributions
 Ceres-1.228
 vig-1.062
§-1.047
 Stall-1.019
 Equality-1.011
Token Science
Token ID5800
Feature activation-5.24e-01

Pos logit contributions
alties+0.154
ingu+0.151
��+0.148
 oun+0.145
cond+0.141
Neg logit contributions
 Ceres-0.129
 vig-0.109
§-0.108
 Stall-0.105
 Equality-0.103
Token and
Token ID290
Feature activation-1.69e-01

Pos logit contributions
alties+0.900
ingu+0.878
��+0.869
 oun+0.852
cond+0.822
Neg logit contributions
 Ceres-0.738
 vig-0.621
§-0.617
 Stall-0.601
 Equality-0.587