TOP ACTIVATIONS
MAX = 3.477

the polls , will cons ign their votes into computers that
is long ; he comm ends the Monday job - seekers
murder and first - degree robbery . The other
charged for not using inter lock device A Kirk
Last week , he comm ended Trump for conducting military strikes
eagerly listened to the exp iring gro ans of his victims
Stay tuned ! <|endoftext|> Mer kel told parliament that while she
Comm ander of the On mits uk
of violation of ignition inter lock driver 's license .
such as installing more bike racks and decreasing j ay walking
. Rel ieved , Par amount jumped at the chance to
for not using an inter lock device he was still under
hon ne ur d ' ê tre , Madame , l
Lee . Mc Greg or highlighted how important it
always be the price of inflation ." He correctly predicted the
s sake , American Horror Story : Murder House opened
which time the firms coll uded to set each others '
. Follow @ greg ory k ort e on
their graphics cards . Comp eting with flagship design of Lightning
writing that a book compet es with Snapchat and Instagram and

MIN ACTIVATIONS
MIN = -3.284

<|endoftext|> Image copyright publicity Image caption The Greeks were totally "
the fashionable Downtown . He fty tax breaks have been granted
000 . Image caption Council staff found paperwork with
ACE ). Image caption Council staff s ifted through
. <|endoftext|> Image copyright publicity Image caption The Greeks
<|endoftext|> Image copyright AP Image caption Peace talks between the government
it mean to an intro vert online ? <|endoftext|> Update :
<|endoftext|> Image copyright ESA Image caption Go ce uses an electric
million years . <|endoftext|> Image copyright AFP The Bank
media reported . <|endoftext|> Image caption Some talked of " b
identified as Joshua E ern isse and Beau Sch oen hard
Image copyright AFP Image caption The Kurds ' push has
Russia . Image copyright AFP Image caption The Kurds
Off On The Train - Meat head - The War -
. <|endoftext|> Image copyright Deadline Police have
frequently comes off as sn obb ish and insult ingly dismissive
Are More Similar Than You Think The biggest difference
: Let s bring in good quality touring talent
appeared in the Melbourne Mag istrates Court this afternoon , where
Sciences in Memory of Alfred Nobel 2017 was awarded to Richard

INTERVAL 1.787 - 3.477
CONTAINS 0.247%

think today the team has found a lot of the weaknesses
are vying desperately for the millennial vote , and with good
teamed with the producer of Alice , long form veteran Robert
Reviewer : Mike M anners - favorite favorite favorite favorite favorite
masters . New research has gone further , exposing how American

INTERVAL 0.097 - 1.787
CONTAINS 39.936%

Rica demonstrating the value of playing this way . Their success
end up with is a strange , national istic union ism
- not ch studio specializing in ecc les ial art had
for players from the two title teams held about a decade
her friends were more prone to use prof anity and party

INTERVAL -1.594 - 0.097
CONTAINS 59.328%

and the Giant Peach , could not be sold to British
You have a technical fault in your action , go out
insky Ag rop ost ach , is a recipient of $
experiences ; the first time I got t umbled by a
, doesn 't mean that you should tell us what to

INTERVAL -3.284 - -1.594
CONTAINS 0.488%

Pasadena , California . Hide Caption 12 of 15 Photos :
by showing him how to bring the kn otted thread up
at Anfield in October . Photograph : Clive Brun skill /
gave me an excuse to look up some characters I 'm
va Ki ir in the capital , J uba .
Token the
Token ID262
Feature activation-7.79e-03

Pos logit contributions
burg+0.029
selves+0.029
lishing+0.027
aye+0.026
 Publication+0.026
Neg logit contributions
quartered-0.026
 DOT-0.025
Senator-0.023
 divides-0.023
idth-0.022
Token polls
Token ID9231
Feature activation-1.85e-01

Pos logit contributions
quartered+0.003
 DOT+0.003
Senator+0.003
idth+0.002
 divides+0.002
Neg logit contributions
burg-0.009
selves-0.008
lishing-0.008
aye-0.008
 Publication-0.008
Token,
Token ID11
Feature activation-1.09e-01

Pos logit contributions
quartered+0.135
 DOT+0.129
Senator+0.120
 divides+0.117
idth+0.115
Neg logit contributions
burg-0.144
selves-0.144
lishing-0.131
aye-0.128
 Publication-0.125
Token will
Token ID481
Feature activation+2.17e-01

Pos logit contributions
quartered+0.078
 DOT+0.074
Senator+0.069
 divides+0.067
idth+0.066
Neg logit contributions
burg-0.086
selves-0.086
lishing-0.079
aye-0.077
 Publication-0.075
Token cons
Token ID762
Feature activation+1.10e-01

Pos logit contributions
burg+0.184
selves+0.183
lishing+0.169
aye+0.164
 Publication+0.160
Neg logit contributions
quartered-0.146
 DOT-0.136
Senator-0.127
 divides-0.123
idth-0.120
Tokenign
Token ID570
Feature activation+3.48e+00

Pos logit contributions
burg+0.065
selves+0.063
lishing+0.057
aye+0.055
 Publication+0.053
Neg logit contributions
quartered-0.102
 DOT-0.098
Senator-0.092
 divides-0.092
idth-0.089
Token their
Token ID511
Feature activation-2.40e-01

Pos logit contributions
selves+2.755
burg+2.694
lishing+2.463
aye+2.374
 Publication+2.354
Neg logit contributions
quartered-2.512
 DOT-2.314
Senator-2.168
idth-2.117
 divides-2.075
Token votes
Token ID5690
Feature activation+2.97e-01

Pos logit contributions
quartered+0.136
 DOT+0.124
Senator+0.113
idth+0.110
 divides+0.108
Neg logit contributions
selves-0.229
burg-0.229
lishing-0.212
aye-0.207
 Publication-0.204
Token into
Token ID656
Feature activation+3.01e-01

Pos logit contributions
burg+0.256
selves+0.254
lishing+0.237
aye+0.231
 Publication+0.225
Neg logit contributions
quartered-0.193
 DOT-0.183
Senator-0.169
 divides-0.164
idth-0.159
Token computers
Token ID9061
Feature activation+5.67e-01

Pos logit contributions
burg+0.247
selves+0.245
lishing+0.228
aye+0.220
 Publication+0.217
Neg logit contributions
quartered-0.207
 DOT-0.196
Senator-0.183
 divides-0.178
idth-0.176
Token that
Token ID326
Feature activation+1.36e-01

Pos logit contributions
burg+0.446
selves+0.443
lishing+0.411
aye+0.397
 Publication+0.387
Neg logit contributions
quartered-0.410
 DOT-0.390
Senator-0.364
 divides-0.355
idth-0.348
Token is
Token ID318
Feature activation+1.16e-01

Pos logit contributions
quartered+0.269
 DOT+0.257
Senator+0.235
 divides+0.229
idth+0.221
Neg logit contributions
burg-0.362
selves-0.359
lishing-0.336
aye-0.326
 Publication-0.319
Token long
Token ID890
Feature activation+5.60e-01

Pos logit contributions
burg+0.091
selves+0.090
lishing+0.083
aye+0.081
 Publication+0.079
Neg logit contributions
quartered-0.084
 DOT-0.081
Senator-0.075
 divides-0.073
idth-0.071
Token;
Token ID26
Feature activation-1.05e-02

Pos logit contributions
burg+0.415
selves+0.411
lishing+0.382
aye+0.368
 Publication+0.357
Neg logit contributions
quartered-0.429
 DOT-0.413
Senator-0.383
 divides-0.375
idth-0.366
Token he
Token ID339
Feature activation+1.70e-01

Pos logit contributions
quartered+0.008
 DOT+0.007
Senator+0.007
 divides+0.007
idth+0.006
Neg logit contributions
burg-0.008
selves-0.008
lishing-0.008
aye-0.007
 Publication-0.007
Token comm
Token ID725
Feature activation+1.01e+00

Pos logit contributions
burg+0.148
selves+0.147
lishing+0.137
aye+0.133
 Publication+0.130
Neg logit contributions
quartered-0.109
 DOT-0.103
Senator-0.095
 divides-0.091
idth-0.089
Tokenends
Token ID2412
Feature activation+3.39e+00

Pos logit contributions
burg+0.746
selves+0.744
lishing+0.687
aye+0.662
 Publication+0.635
Neg logit contributions
quartered-0.778
 DOT-0.741
Senator-0.693
 divides-0.674
 elector-0.660
Token the
Token ID262
Feature activation+2.14e-02

Pos logit contributions
burg+2.624
selves+2.612
lishing+2.391
aye+2.322
 Publication+2.289
Neg logit contributions
quartered-2.469
 DOT-2.321
Senator-2.185
 divides-2.093
idth-2.077
Token Monday
Token ID3321
Feature activation+7.26e-01

Pos logit contributions
burg+0.015
selves+0.015
lishing+0.014
aye+0.014
 Publication+0.013
Neg logit contributions
quartered-0.017
 DOT-0.016
Senator-0.015
 divides-0.015
idth-0.014
Token job
Token ID1693
Feature activation-2.07e-01

Pos logit contributions
burg+0.527
selves+0.519
lishing+0.479
aye+0.464
 Publication+0.454
Neg logit contributions
quartered-0.568
 DOT-0.545
Senator-0.508
 divides-0.499
idth-0.485
Token-
Token ID12
Feature activation+6.43e-02

Pos logit contributions
quartered+0.150
 DOT+0.144
Senator+0.133
 divides+0.131
idth+0.126
Neg logit contributions
burg-0.162
selves-0.161
lishing-0.149
aye-0.144
 Publication-0.141
Tokenseekers
Token ID47971
Feature activation-1.13e+00

Pos logit contributions
burg+0.046
selves+0.046
lishing+0.042
aye+0.041
 Publication+0.040
Neg logit contributions
quartered-0.051
 DOT-0.049
Senator-0.046
 divides-0.045
idth-0.043
Token murder
Token ID5123
Feature activation+7.94e-01

Pos logit contributions
quartered+1.251
 DOT+1.205
Senator+1.104
 divides+1.088
 elector+1.048
Neg logit contributions
selves-1.462
burg-1.459
lishing-1.365
aye-1.319
 Publication-1.282
Token and
Token ID290
Feature activation-1.51e-01

Pos logit contributions
burg+0.596
selves+0.586
lishing+0.544
aye+0.525
 Publication+0.512
Neg logit contributions
quartered-0.603
 DOT-0.576
Senator-0.540
 divides-0.528
idth-0.515
Token first
Token ID717
Feature activation+2.46e-01

Pos logit contributions
quartered+0.113
 DOT+0.106
Senator+0.100
 divides+0.097
idth+0.095
Neg logit contributions
burg-0.117
selves-0.115
lishing-0.106
aye-0.103
 Publication-0.101
Token-
Token ID12
Feature activation-9.80e-02

Pos logit contributions
burg+0.185
selves+0.183
lishing+0.170
aye+0.164
 Publication+0.160
Neg logit contributions
quartered-0.185
 DOT-0.178
Senator-0.165
 divides-0.162
idth-0.157
Tokendegree
Token ID16863
Feature activation-1.58e+00

Pos logit contributions
quartered+0.065
 DOT+0.062
Senator+0.057
 divides+0.056
idth+0.054
Neg logit contributions
burg-0.083
selves-0.083
lishing-0.077
aye-0.075
 Publication-0.073
Token robbery
Token ID18609
Feature activation+3.09e+00

Pos logit contributions
quartered+1.251
 DOT+1.212
Senator+1.125
 divides+1.110
 elector+1.075
Neg logit contributions
burg-1.111
selves-1.110
lishing-1.025
aye-0.985
 Publication-0.956
Token.
Token ID13
Feature activation-2.27e-01

Pos logit contributions
burg+2.209
selves+2.149
lishing+1.991
aye+1.895
 Publication+1.857
Neg logit contributions
quartered-2.459
 DOT-2.317
Senator-2.225
idth-2.138
 divides-2.134
Token
Token ID198
Feature activation-4.48e-01

Pos logit contributions
quartered+0.155
 DOT+0.148
Senator+0.136
 divides+0.134
idth+0.129
Neg logit contributions
burg-0.187
selves-0.186
lishing-0.173
aye-0.167
 Publication-0.165
Token
Token ID198
Feature activation-3.88e-01

Pos logit contributions
quartered+0.339
 DOT+0.309
Senator+0.286
 divides+0.285
idth+0.282
Neg logit contributions
burg-0.355
selves-0.348
lishing-0.322
aye-0.308
 Publication-0.304
TokenThe
Token ID464
Feature activation+7.71e-03

Pos logit contributions
quartered+0.303
 DOT+0.292
Senator+0.272
 divides+0.267
idth+0.259
Neg logit contributions
burg-0.280
selves-0.278
lishing-0.257
aye-0.247
 Publication-0.241
Token other
Token ID584
Feature activation+2.57e-01

Pos logit contributions
burg+0.006
selves+0.006
lishing+0.006
aye+0.005
 Publication+0.005
Neg logit contributions
quartered-0.006
 DOT-0.005
Senator-0.005
 divides-0.005
idth-0.005
Token charged
Token ID5047
Feature activation-8.06e-01

Pos logit contributions
quartered+0.060
 DOT+0.057
Senator+0.053
 divides+0.051
idth+0.050
Neg logit contributions
burg-0.069
selves-0.069
lishing-0.063
aye-0.061
 Publication-0.060
Token for
Token ID329
Feature activation+3.92e-01

Pos logit contributions
quartered+0.594
 DOT+0.565
Senator+0.528
 divides+0.515
idth+0.503
Neg logit contributions
burg-0.625
selves-0.617
lishing-0.570
aye-0.554
 Publication-0.539
Token not
Token ID407
Feature activation+5.16e-01

Pos logit contributions
burg+0.282
selves+0.279
lishing+0.256
aye+0.247
 Publication+0.241
Neg logit contributions
quartered-0.311
 DOT-0.296
Senator-0.278
 divides-0.272
idth-0.265
Token using
Token ID1262
Feature activation+5.96e-01

Pos logit contributions
burg+0.396
selves+0.392
lishing+0.362
aye+0.350
 Publication+0.340
Neg logit contributions
quartered-0.387
 DOT-0.367
Senator-0.343
 divides-0.335
idth-0.327
Token inter
Token ID987
Feature activation+3.51e-01

Pos logit contributions
burg+0.462
selves+0.457
lishing+0.420
aye+0.409
 Publication+0.400
Neg logit contributions
quartered-0.443
 DOT-0.414
Senator-0.391
 divides-0.379
idth-0.372
Tokenlock
Token ID5354
Feature activation+3.06e+00

Pos logit contributions
burg+0.257
selves+0.255
lishing+0.234
aye+0.225
 Publication+0.218
Neg logit contributions
quartered-0.275
 DOT-0.261
Senator-0.245
 divides-0.240
idth-0.234
Token device
Token ID3335
Feature activation-4.47e-01

Pos logit contributions
burg+2.314
selves+2.253
lishing+2.081
aye+2.010
 Wit+1.948
Neg logit contributions
quartered-2.332
 DOT-2.174
Senator-2.090
 divides-1.984
idth-1.982
Token
Token ID198
Feature activation+3.48e-01

Pos logit contributions
quartered+0.310
 DOT+0.294
Senator+0.272
 divides+0.264
idth+0.258
Neg logit contributions
burg-0.367
selves-0.363
lishing-0.337
aye-0.328
 Publication-0.320
Token
Token ID198
Feature activation+3.29e-01

Pos logit contributions
burg+0.274
selves+0.268
lishing+0.247
aye+0.236
 Publication+0.235
Neg logit contributions
quartered-0.268
 DOT-0.240
 divides-0.223
idth-0.223
Senator-0.222
TokenA
Token ID32
Feature activation+1.89e-01

Pos logit contributions
burg+0.232
selves+0.227
lishing+0.211
aye+0.202
 Publication+0.197
Neg logit contributions
quartered-0.267
 DOT-0.255
Senator-0.236
 divides-0.236
idth-0.229
Token Kirk
Token ID13855
Feature activation+3.01e-01

Pos logit contributions
burg+0.143
selves+0.142
lishing+0.132
aye+0.127
 Publication+0.124
Neg logit contributions
quartered-0.141
 DOT-0.136
Senator-0.126
 divides-0.124
idth-0.120
Token Last
Token ID4586
Feature activation+5.61e-01

Pos logit contributions
quartered+0.002
 DOT+0.002
Senator+0.002
 divides+0.002
idth+0.002
Neg logit contributions
burg-0.002
selves-0.002
lishing-0.002
 Publication-0.002
aye-0.002
Token week
Token ID1285
Feature activation-6.56e-01

Pos logit contributions
burg+0.456
selves+0.450
lishing+0.423
aye+0.408
 Publication+0.401
Neg logit contributions
quartered-0.392
 DOT-0.371
Senator-0.342
 divides-0.334
idth-0.327
Token,
Token ID11
Feature activation-7.94e-02

Pos logit contributions
quartered+0.494
 DOT+0.470
Senator+0.435
 divides+0.429
idth+0.420
Neg logit contributions
burg-0.497
selves-0.492
lishing-0.459
aye-0.441
 Publication-0.432
Token he
Token ID339
Feature activation+1.39e-01

Pos logit contributions
quartered+0.061
 DOT+0.058
Senator+0.054
 divides+0.053
idth+0.052
Neg logit contributions
burg-0.059
selves-0.058
lishing-0.054
aye-0.052
 Publication-0.051
Token comm
Token ID725
Feature activation+7.60e-01

Pos logit contributions
burg+0.125
selves+0.124
lishing+0.117
aye+0.113
 Publication+0.111
Neg logit contributions
quartered-0.085
 DOT-0.080
Senator-0.073
 divides-0.070
idth-0.069
Tokenended
Token ID1631
Feature activation+2.98e+00

Pos logit contributions
burg+0.563
selves+0.555
lishing+0.519
aye+0.500
 Publication+0.479
Neg logit contributions
quartered-0.586
 DOT-0.564
Senator-0.525
 divides-0.514
idth-0.498
Token Trump
Token ID1301
Feature activation+4.25e-01

Pos logit contributions
burg+2.530
selves+2.527
lishing+2.374
aye+2.280
 Publication+2.258
Neg logit contributions
quartered-1.946
 DOT-1.782
Senator-1.629
 divides-1.617
idth-1.591
Token for
Token ID329
Feature activation+4.32e-01

Pos logit contributions
burg+0.336
selves+0.331
lishing+0.308
aye+0.298
 Publication+0.291
Neg logit contributions
quartered-0.307
 DOT-0.294
Senator-0.271
 divides-0.267
idth-0.259
Token conducting
Token ID14523
Feature activation-1.17e-01

Pos logit contributions
burg+0.341
selves+0.337
lishing+0.315
aye+0.303
 Publication+0.298
Neg logit contributions
quartered-0.313
 DOT-0.294
Senator-0.273
 divides-0.267
idth-0.264
Token military
Token ID2422
Feature activation+4.08e-01

Pos logit contributions
quartered+0.079
 DOT+0.073
Senator+0.068
 divides+0.066
idth+0.065
Neg logit contributions
burg-0.098
selves-0.097
lishing-0.091
aye-0.088
 Publication-0.087
Token strikes
Token ID8956
Feature activation+2.26e-01

Pos logit contributions
burg+0.326
selves+0.318
lishing+0.299
aye+0.288
 Publication+0.283
Neg logit contributions
quartered-0.293
 DOT-0.280
Senator-0.261
 divides-0.252
idth-0.246
Token eagerly
Token ID30130
Feature activation+8.13e-01

Pos logit contributions
quartered+0.232
 DOT+0.222
Senator+0.204
 divides+0.201
idth+0.193
Neg logit contributions
burg-0.276
selves-0.274
lishing-0.256
aye-0.247
 Publication-0.242
Token listened
Token ID16399
Feature activation-1.17e+00

Pos logit contributions
burg+0.658
selves+0.652
lishing+0.608
aye+0.588
 Publication+0.574
Neg logit contributions
quartered-0.569
 DOT-0.542
Senator-0.503
 divides-0.488
idth-0.476
Token to
Token ID284
Feature activation+2.42e-01

Pos logit contributions
quartered+0.735
 DOT+0.699
Senator+0.640
 divides+0.631
 elector+0.603
Neg logit contributions
burg-1.025
selves-1.016
lishing-0.957
aye-0.928
 Publication-0.916
Token the
Token ID262
Feature activation-1.43e-01

Pos logit contributions
burg+0.193
selves+0.191
lishing+0.178
aye+0.172
 Publication+0.168
Neg logit contributions
quartered-0.172
 DOT-0.163
Senator-0.152
 divides-0.148
idth-0.144
Token exp
Token ID1033
Feature activation+1.01e+00

Pos logit contributions
quartered+0.112
 DOT+0.108
Senator+0.100
 divides+0.099
idth+0.096
Neg logit contributions
burg-0.103
selves-0.102
lishing-0.094
aye-0.091
 Publication-0.088
Tokeniring
Token ID3428
Feature activation+2.93e+00

Pos logit contributions
burg+0.666
selves+0.660
lishing+0.608
aye+0.583
 Publication+0.562
Neg logit contributions
quartered-0.856
 DOT-0.824
Senator-0.772
 divides-0.761
idth-0.737
Token gro
Token ID7128
Feature activation-1.62e+00

Pos logit contributions
burg+2.177
selves+2.157
lishing+1.994
aye+1.911
 Publication+1.874
Neg logit contributions
quartered-2.243
 DOT-2.104
Senator-1.998
idth-1.923
 divides-1.910
Tokenans
Token ID504
Feature activation+9.65e-01

Pos logit contributions
quartered+0.925
 DOT+0.883
Senator+0.797
 divides+0.778
idth+0.745
Neg logit contributions
burg-1.503
selves-1.497
lishing-1.410
aye-1.368
 Publication-1.347
Token of
Token ID286
Feature activation+4.17e-02

Pos logit contributions
burg+0.796
selves+0.786
lishing+0.735
aye+0.710
 Publication+0.693
Neg logit contributions
quartered-0.662
 DOT-0.629
Senator-0.581
 divides-0.568
idth-0.550
Token his
Token ID465
Feature activation-1.71e-01

Pos logit contributions
burg+0.034
selves+0.033
lishing+0.031
aye+0.030
 Publication+0.029
Neg logit contributions
quartered-0.029
 DOT-0.028
Senator-0.026
 divides-0.025
idth-0.024
Token victims
Token ID4970
Feature activation-3.41e-01

Pos logit contributions
quartered+0.133
 DOT+0.128
Senator+0.119
 divides+0.118
 elector+0.113
Neg logit contributions
burg-0.123
selves-0.122
lishing-0.113
aye-0.109
 Publication-0.107
Token Stay
Token ID16160
Feature activation+7.88e-01

Pos logit contributions
burg+0.245
selves+0.244
lishing+0.227
aye+0.218
 Publication+0.216
Neg logit contributions
quartered-0.216
 DOT-0.203
Senator-0.188
 divides-0.185
idth-0.180
Token tuned
Token ID16524
Feature activation-1.30e+00

Pos logit contributions
burg+0.564
selves+0.556
lishing+0.517
aye+0.496
 Publication+0.481
Neg logit contributions
quartered-0.624
 DOT-0.598
Senator-0.562
 divides-0.550
idth-0.539
Token!
Token ID0
Feature activation+2.02e-02

Pos logit contributions
quartered+1.033
 DOT+0.999
Senator+0.929
 divides+0.915
idth+0.884
Neg logit contributions
burg-0.918
selves-0.911
lishing-0.841
aye-0.809
 Publication-0.788
Token<|endoftext|>
Token ID50256
Feature activation+1.68e-01

Pos logit contributions
burg+0.016
selves+0.016
lishing+0.015
aye+0.015
 Publication+0.014
Neg logit contributions
quartered-0.014
 DOT-0.013
Senator-0.012
 divides-0.012
idth-0.012
TokenMer
Token ID13102
Feature activation+8.49e-01

Pos logit contributions
burg+0.123
selves+0.121
lishing+0.112
aye+0.108
 Publication+0.105
Neg logit contributions
quartered-0.132
 DOT-0.127
 divides-0.117
Senator-0.116
idth-0.113
Tokenkel
Token ID7750
Feature activation+2.92e+00

Pos logit contributions
burg+0.643
selves+0.633
lishing+0.593
aye+0.573
 Publication+0.554
Neg logit contributions
quartered-0.639
 DOT-0.612
Senator-0.568
 divides-0.559
idth-0.538
Token told
Token ID1297
Feature activation+2.69e-02

Pos logit contributions
burg+2.428
selves+2.360
lishing+2.199
aye+2.149
 Publication+2.097
Neg logit contributions
quartered-2.004
 DOT-1.876
Senator-1.743
 divides-1.655
idth-1.639
Token parliament
Token ID8540
Feature activation-9.49e-01

Pos logit contributions
burg+0.022
selves+0.022
lishing+0.020
aye+0.020
 Publication+0.020
Neg logit contributions
quartered-0.019
 DOT-0.017
Senator-0.016
 divides-0.016
idth-0.015
Token that
Token ID326
Feature activation+8.61e-01

Pos logit contributions
quartered+0.707
 DOT+0.677
Senator+0.628
 divides+0.617
idth+0.597
Neg logit contributions
burg-0.725
selves-0.719
lishing-0.667
aye-0.644
 Publication-0.629
Token while
Token ID981
Feature activation+9.44e-01

Pos logit contributions
burg+0.639
selves+0.627
lishing+0.580
aye+0.561
 Publication+0.548
Neg logit contributions
quartered-0.666
 DOT-0.629
Senator-0.591
 divides-0.573
idth-0.564
Token she
Token ID673
Feature activation+5.46e-01

Pos logit contributions
burg+0.708
selves+0.695
lishing+0.642
aye+0.620
 Publication+0.607
Neg logit contributions
quartered-0.724
 DOT-0.682
Senator-0.641
 divides-0.619
idth-0.611
Token
Token ID447
Feature activation+3.67e-01

Pos logit contributions
burg+0.071
selves+0.071
lishing+0.065
aye+0.063
 Publication+0.062
Neg logit contributions
quartered-0.064
 DOT-0.060
Senator-0.056
 divides-0.055
idth-0.053
Token
Token ID251
Feature activation+4.85e-01

Pos logit contributions
burg+0.183
selves+0.181
lishing+0.161
aye+0.152
 Publication+0.146
Neg logit contributions
quartered-0.370
 DOT-0.359
Senator-0.340
 divides-0.336
idth-0.327
Token
Token ID564
Feature activation+1.80e-02

Pos logit contributions
burg+0.390
selves+0.388
lishing+0.358
aye+0.346
 Publication+0.343
Neg logit contributions
quartered-0.344
 DOT-0.323
Senator-0.298
 divides-0.295
idth-0.286
Token
Token ID250
Feature activation+4.22e-01

Pos logit contributions
burg+0.008
selves+0.008
lishing+0.007
aye+0.007
 Publication+0.006
Neg logit contributions
quartered-0.019
 DOT-0.018
Senator-0.017
 divides-0.017
idth-0.017
TokenComm
Token ID6935
Feature activation+7.82e-01

Pos logit contributions
burg+0.310
selves+0.307
lishing+0.283
aye+0.274
 Publication+0.266
Neg logit contributions
quartered-0.329
 DOT-0.315
 divides-0.289
Senator-0.289
idth-0.279
Tokenander
Token ID4066
Feature activation+2.92e+00

Pos logit contributions
burg+0.887
selves+0.882
lishing+0.837
aye+0.824
 Publication+0.805
Neg logit contributions
quartered-0.292
 DOT-0.270
Senator-0.227
 divides-0.220
idth-0.203
Token of
Token ID286
Feature activation+6.86e-01

Pos logit contributions
burg+2.199
selves+2.150
lishing+1.956
aye+1.939
 Wit+1.895
Neg logit contributions
quartered-2.193
 DOT-2.085
Senator-1.952
 divides-1.898
idth-1.852
Token the
Token ID262
Feature activation+5.87e-01

Pos logit contributions
burg+0.552
selves+0.550
lishing+0.510
aye+0.491
 Publication+0.488
Neg logit contributions
quartered-0.480
 DOT-0.455
Senator-0.426
 divides-0.413
idth-0.401
Token On
Token ID1550
Feature activation+3.98e-01

Pos logit contributions
burg+0.496
selves+0.494
lishing+0.458
aye+0.444
 Publication+0.439
Neg logit contributions
quartered-0.388
 DOT-0.367
Senator-0.341
 divides-0.330
idth-0.322
Tokenmits
Token ID24883
Feature activation-3.34e-01

Pos logit contributions
burg+0.274
selves+0.271
lishing+0.250
aye+0.242
 Publication+0.234
Neg logit contributions
quartered-0.325
 DOT-0.314
Senator-0.292
 divides-0.289
 elector-0.280
Tokenuk
Token ID2724
Feature activation+3.41e-01

Pos logit contributions
quartered+0.253
 DOT+0.243
Senator+0.226
 divides+0.222
idth+0.215
Neg logit contributions
burg-0.252
selves-0.248
lishing-0.228
aye-0.222
 Publication-0.217
Token of
Token ID286
Feature activation+2.23e-01

Pos logit contributions
quartered+0.234
 DOT+0.224
Senator+0.204
 divides+0.200
 elector+0.191
Neg logit contributions
burg-0.319
selves-0.317
lishing-0.298
aye-0.289
 Publication-0.282
Token violation
Token ID8747
Feature activation+7.03e-01

Pos logit contributions
burg+0.180
selves+0.177
lishing+0.164
aye+0.159
 Publication+0.157
Neg logit contributions
quartered-0.159
 DOT-0.147
Senator-0.140
 divides-0.136
idth-0.133
Token of
Token ID286
Feature activation+3.35e-01

Pos logit contributions
burg+0.587
selves+0.580
lishing+0.543
aye+0.522
 Publication+0.516
Neg logit contributions
quartered-0.474
 DOT-0.446
Senator-0.417
 divides-0.405
idth-0.393
Token ignition
Token ID37594
Feature activation+1.20e+00

Pos logit contributions
burg+0.267
selves+0.264
lishing+0.245
aye+0.235
 Publication+0.234
Neg logit contributions
quartered-0.241
 DOT-0.224
Senator-0.212
 divides-0.207
idth-0.202
Token inter
Token ID987
Feature activation+1.75e-01

Pos logit contributions
burg+0.915
selves+0.904
lishing+0.843
aye+0.806
 Publication+0.799
Neg logit contributions
quartered-0.906
 DOT-0.816
Senator-0.795
 divides-0.761
idth-0.758
Tokenlock
Token ID5354
Feature activation+2.91e+00

Pos logit contributions
burg+0.114
selves+0.112
lishing+0.103
aye+0.098
seq+0.094
Neg logit contributions
quartered-0.153
 DOT-0.145
Senator-0.137
 divides-0.135
idth-0.132
Token driver
Token ID4639
Feature activation-1.02e+00

Pos logit contributions
burg+2.065
selves+2.025
lishing+1.903
aye+1.785
 Publication+1.773
Neg logit contributions
quartered-2.312
 DOT-2.140
Senator-2.068
 divides-1.986
idth-1.977
Token's
Token ID338
Feature activation-1.98e-01

Pos logit contributions
quartered+0.702
 DOT+0.670
Senator+0.617
 divides+0.605
idth+0.584
Neg logit contributions
burg-0.829
selves-0.823
lishing-0.768
aye-0.742
 Publication-0.726
Token license
Token ID5964
Feature activation+4.56e-01

Pos logit contributions
quartered+0.160
 DOT+0.152
Senator+0.143
 divides+0.140
idth+0.136
Neg logit contributions
burg-0.139
selves-0.136
lishing-0.126
aye-0.121
 Publication-0.118
Token.
Token ID13
Feature activation+1.56e-01

Pos logit contributions
burg+0.340
selves+0.335
lishing+0.312
aye+0.301
 Publication+0.295
Neg logit contributions
quartered-0.347
 DOT-0.331
Senator-0.310
 divides-0.303
idth-0.296
Token
Token ID198
Feature activation-1.14e-01

Pos logit contributions
burg+0.130
selves+0.128
lishing+0.119
aye+0.115
 Publication+0.114
Neg logit contributions
quartered-0.107
 DOT-0.101
Senator-0.093
 divides-0.092
idth-0.089
Token such
Token ID884
Feature activation-3.98e-03

Pos logit contributions
burg+0.009
selves+0.009
lishing+0.008
aye+0.008
 Publication+0.008
Neg logit contributions
quartered-0.006
 DOT-0.006
Senator-0.006
 divides-0.005
idth-0.005
Token as
Token ID355
Feature activation+7.97e-01

Pos logit contributions
quartered+0.003
 DOT+0.003
Senator+0.003
 divides+0.003
idth+0.003
Neg logit contributions
burg-0.003
selves-0.003
lishing-0.003
aye-0.003
 Publication-0.003
Token installing
Token ID15975
Feature activation-9.32e-01

Pos logit contributions
burg+0.640
selves+0.635
lishing+0.590
aye+0.569
 Publication+0.560
Neg logit contributions
quartered-0.564
 DOT-0.529
Senator-0.497
 divides-0.484
idth-0.469
Token more
Token ID517
Feature activation+5.51e-01

Pos logit contributions
quartered+0.684
 DOT+0.658
Senator+0.608
 divides+0.598
idth+0.577
Neg logit contributions
burg-0.718
selves-0.713
lishing-0.663
aye-0.640
 Publication-0.625
Token bike
Token ID7161
Feature activation+1.34e+00

Pos logit contributions
burg+0.407
selves+0.403
lishing+0.371
aye+0.358
 Publication+0.350
Neg logit contributions
quartered-0.426
 DOT-0.402
Senator-0.381
 divides-0.370
idth-0.362
Token racks
Token ID39843
Feature activation+2.88e+00

Pos logit contributions
burg+0.997
selves+0.991
lishing+0.909
aye+0.879
 Publication+0.855
Neg logit contributions
quartered-1.034
 DOT-0.964
Senator-0.916
 divides-0.878
idth-0.871
Token and
Token ID290
Feature activation+1.05e-01

Pos logit contributions
burg+2.145
selves+2.132
lishing+1.943
aye+1.886
 Publication+1.826
Neg logit contributions
quartered-2.212
 DOT-2.071
Senator-1.966
 divides-1.901
idth-1.878
Token decreasing
Token ID24030
Feature activation+1.14e+00

Pos logit contributions
burg+0.089
selves+0.089
lishing+0.083
aye+0.080
 Publication+0.079
Neg logit contributions
quartered-0.069
 DOT-0.064
Senator-0.060
 divides-0.058
idth-0.057
Token j
Token ID474
Feature activation+3.10e-02

Pos logit contributions
burg+0.898
selves+0.889
lishing+0.823
aye+0.794
 Publication+0.783
Neg logit contributions
quartered-0.819
 DOT-0.769
Senator-0.726
 divides-0.697
idth-0.684
Tokenay
Token ID323
Feature activation-8.58e-01

Pos logit contributions
burg+0.021
selves+0.021
lishing+0.019
aye+0.018
 Publication+0.018
Neg logit contributions
quartered-0.026
 DOT-0.025
Senator-0.023
 divides-0.023
idth-0.022
Tokenwalking
Token ID44065
Feature activation+4.81e-02

Pos logit contributions
quartered+0.608
 DOT+0.579
Senator+0.537
 divides+0.524
idth+0.510
Neg logit contributions
burg-0.689
selves-0.679
lishing-0.632
aye-0.613
 Publication-0.598
Token.
Token ID13
Feature activation+1.38e-01

Pos logit contributions
burg+0.334
selves+0.332
lishing+0.308
aye+0.297
 Publication+0.290
Neg logit contributions
quartered-0.343
 DOT-0.330
Senator-0.306
 divides-0.301
idth-0.291
Token Rel
Token ID4718
Feature activation-1.58e-01

Pos logit contributions
burg+0.111
selves+0.110
lishing+0.103
aye+0.099
 Publication+0.097
Neg logit contributions
quartered-0.097
 DOT-0.092
Senator-0.085
 divides-0.084
idth-0.081
Tokenieved
Token ID39591
Feature activation-2.15e-02

Pos logit contributions
quartered+0.091
 DOT+0.087
Senator+0.079
 divides+0.077
 elector+0.074
Neg logit contributions
burg-0.146
selves-0.144
lishing-0.137
aye-0.133
 Publication-0.129
Token,
Token ID11
Feature activation-1.11e-01

Pos logit contributions
quartered+0.015
 DOT+0.015
Senator+0.014
 divides+0.014
idth+0.013
Neg logit contributions
burg-0.017
selves-0.017
lishing-0.016
aye-0.015
 Publication-0.015
TokenPar
Token ID10044
Feature activation+1.52e-01

Pos logit contributions
quartered+0.072
 DOT+0.069
Senator+0.063
 divides+0.062
idth+0.060
Neg logit contributions
burg-0.094
selves-0.094
lishing-0.087
aye-0.085
 Publication-0.083
Tokenamount
Token ID17287
Feature activation+2.88e+00

Pos logit contributions
burg+0.114
selves+0.111
lishing+0.103
aye+0.101
 Publication+0.096
Neg logit contributions
quartered-0.118
 DOT-0.112
Senator-0.105
 divides-0.103
 elector-0.100
Token jumped
Token ID11687
Feature activation+1.74e-02

Pos logit contributions
burg+2.527
selves+2.500
lishing+2.333
aye+2.280
 Publication+2.228
Neg logit contributions
quartered-1.814
 DOT-1.705
Senator-1.561
idth-1.513
 divides-1.510
Token at
Token ID379
Feature activation-4.43e-01

Pos logit contributions
burg+0.014
selves+0.014
lishing+0.013
aye+0.013
 Publication+0.012
Neg logit contributions
quartered-0.012
 DOT-0.012
Senator-0.011
 divides-0.010
idth-0.010
Token the
Token ID262
Feature activation-1.56e-01

Pos logit contributions
quartered+0.301
 DOT+0.289
Senator+0.265
 divides+0.260
idth+0.250
Neg logit contributions
burg-0.364
selves-0.364
lishing-0.340
aye-0.327
 Publication-0.321
Token chance
Token ID2863
Feature activation+5.80e-01

Pos logit contributions
quartered+0.122
 DOT+0.117
Senator+0.109
 divides+0.107
idth+0.104
Neg logit contributions
burg-0.113
selves-0.112
lishing-0.104
aye-0.100
 Publication-0.097
Token to
Token ID284
Feature activation-5.16e-01

Pos logit contributions
burg+0.449
selves+0.446
lishing+0.414
aye+0.400
 Publication+0.389
Neg logit contributions
quartered-0.424
 DOT-0.408
Senator-0.377
 divides-0.373
idth-0.357
Token for
Token ID329
Feature activation+3.97e-02

Pos logit contributions
quartered+0.877
 DOT+0.843
Senator+0.778
 divides+0.765
idth+0.737
Neg logit contributions
burg-0.937
selves-0.931
lishing-0.867
aye-0.836
 Publication-0.817
Token not
Token ID407
Feature activation+3.90e-01

Pos logit contributions
burg+0.031
selves+0.031
lishing+0.028
aye+0.028
 Publication+0.027
Neg logit contributions
quartered-0.029
 DOT-0.027
Senator-0.026
 divides-0.025
idth-0.024
Token using
Token ID1262
Feature activation+1.56e-01

Pos logit contributions
burg+0.315
selves+0.311
lishing+0.289
aye+0.278
 Publication+0.273
Neg logit contributions
quartered-0.279
 DOT-0.258
Senator-0.242
 divides-0.237
idth-0.233
Token an
Token ID281
Feature activation+2.95e-01

Pos logit contributions
burg+0.135
selves+0.133
lishing+0.125
aye+0.121
 Publication+0.119
Neg logit contributions
quartered-0.100
 DOT-0.094
Senator-0.087
 divides-0.084
idth-0.082
Token inter
Token ID987
Feature activation+4.23e-01

Pos logit contributions
burg+0.235
selves+0.230
lishing+0.215
aye+0.209
 Publication+0.204
Neg logit contributions
quartered-0.210
 DOT-0.196
Senator-0.187
 divides-0.181
idth-0.177
Tokenlock
Token ID5354
Feature activation+2.85e+00

Pos logit contributions
burg+0.273
selves+0.269
lishing+0.245
aye+0.234
seq+0.223
Neg logit contributions
quartered-0.370
 DOT-0.351
Senator-0.332
 divides-0.329
idth-0.321
Token device
Token ID3335
Feature activation-6.24e-01

Pos logit contributions
burg+1.836
selves+1.799
lishing+1.674
aye+1.581
 Publication+1.533
Neg logit contributions
quartered-2.446
 DOT-2.252
Senator-2.195
 divides-2.118
idth-2.090
Token he
Token ID339
Feature activation+2.49e-01

Pos logit contributions
quartered+0.481
 DOT+0.459
Senator+0.428
 divides+0.421
idth+0.408
Neg logit contributions
burg-0.462
selves-0.457
lishing-0.423
aye-0.408
 Publication-0.397
Token was
Token ID373
Feature activation+8.82e-02

Pos logit contributions
burg+0.222
selves+0.221
lishing+0.205
aye+0.199
 Publication+0.195
Neg logit contributions
quartered-0.156
 DOT-0.144
Senator-0.133
 divides-0.128
idth-0.126
Token still
Token ID991
Feature activation+2.16e-01

Pos logit contributions
burg+0.072
selves+0.072
lishing+0.065
aye+0.064
 Publication+0.062
Neg logit contributions
quartered-0.062
 DOT-0.057
Senator-0.054
 divides-0.053
idth-0.052
Token under
Token ID739
Feature activation+5.18e-01

Pos logit contributions
burg+0.172
selves+0.170
lishing+0.156
aye+0.150
 Publication+0.148
Neg logit contributions
quartered-0.157
 DOT-0.145
Senator-0.138
 divides-0.134
idth-0.134
Tokenhon
Token ID24130
Feature activation+8.89e-01

Pos logit contributions
burg+0.495
selves+0.488
lishing+0.452
aye+0.443
 Publication+0.424
Neg logit contributions
quartered-0.539
 DOT-0.520
Senator-0.482
 divides-0.479
idth-0.462
Tokenne
Token ID710
Feature activation+5.61e-01

Pos logit contributions
burg+0.598
selves+0.590
lishing+0.545
aye+0.529
 Publication+0.509
Neg logit contributions
quartered-0.741
 DOT-0.714
Senator-0.665
 divides-0.659
idth-0.637
Tokenur
Token ID333
Feature activation-6.60e-01

Pos logit contributions
burg+0.421
selves+0.414
lishing+0.382
aye+0.371
 Publication+0.361
Neg logit contributions
quartered-0.429
 DOT-0.410
Senator-0.381
 divides-0.373
idth-0.365
Token d
Token ID288
Feature activation-7.63e-01

Pos logit contributions
quartered+0.523
 DOT+0.503
Senator+0.468
 divides+0.461
idth+0.447
Neg logit contributions
burg-0.472
selves-0.468
lishing-0.432
aye-0.417
 Publication-0.406
Token'
Token ID6
Feature activation+4.81e-01

Pos logit contributions
quartered+0.468
 DOT+0.443
Senator+0.401
 divides+0.394
idth+0.375
Neg logit contributions
burg-0.687
selves-0.682
lishing-0.639
aye-0.624
 Publication-0.609
Tokenê
Token ID25792
Feature activation+2.85e+00

Pos logit contributions
burg+0.328
selves+0.322
lishing+0.296
aye+0.290
 Publication+0.277
Neg logit contributions
quartered-0.399
 DOT-0.386
Senator-0.358
 divides-0.356
idth-0.344
Tokentre
Token ID33945
Feature activation-2.70e-03

Pos logit contributions
burg+2.286
selves+2.266
lishing+2.106
aye+2.061
 Publication+1.973
Neg logit contributions
quartered-1.997
 DOT-1.931
Senator-1.752
 divides-1.749
idth-1.674
Token,
Token ID11
Feature activation+7.42e-01

Pos logit contributions
quartered+0.002
 DOT+0.002
Senator+0.002
 divides+0.002
idth+0.002
Neg logit contributions
burg-0.002
selves-0.002
lishing-0.002
aye-0.002
 Publication-0.002
Token Madame
Token ID43116
Feature activation+5.81e-02

Pos logit contributions
burg+0.511
selves+0.508
lishing+0.467
aye+0.451
 Publication+0.439
Neg logit contributions
quartered-0.606
 DOT-0.583
Senator-0.542
 divides-0.536
idth-0.522
Token,
Token ID11
Feature activation+6.15e-01

Pos logit contributions
burg+0.044
selves+0.043
lishing+0.040
aye+0.039
 Publication+0.038
Neg logit contributions
quartered-0.044
 DOT-0.042
Senator-0.039
 divides-0.038
idth-0.037
Token l
Token ID300
Feature activation-5.80e-01

Pos logit contributions
burg+0.458
selves+0.453
lishing+0.421
aye+0.410
 Publication+0.399
Neg logit contributions
quartered-0.469
 DOT-0.448
Senator-0.415
 divides-0.409
idth-0.398
Token Lee
Token ID5741
Feature activation+1.23e+00

Pos logit contributions
burg+0.429
selves+0.427
lishing+0.392
aye+0.379
 Publication+0.374
Neg logit contributions
quartered-0.431
 DOT-0.402
Senator-0.376
 divides-0.369
 elector-0.359
Token.
Token ID13
Feature activation-1.85e-01

Pos logit contributions
burg+0.909
selves+0.888
lishing+0.820
aye+0.795
 Publication+0.772
Neg logit contributions
quartered-0.950
 DOT-0.907
Senator-0.850
 divides-0.834
idth-0.816
Token
Token ID198
Feature activation-2.80e-01

Pos logit contributions
quartered+0.131
 DOT+0.123
Senator+0.114
 divides+0.112
idth+0.109
Neg logit contributions
burg-0.149
selves-0.148
lishing-0.138
aye-0.133
 Publication-0.131
Token
Token ID198
Feature activation-3.05e-01

Pos logit contributions
quartered+0.212
 DOT+0.194
 divides+0.179
Senator+0.178
idth+0.176
Neg logit contributions
burg-0.222
selves-0.218
lishing-0.201
aye-0.193
 Publication-0.190
TokenMc
Token ID9742
Feature activation-1.43e-01

Pos logit contributions
quartered+0.225
 DOT+0.215
 divides+0.197
Senator+0.196
idth+0.190
Neg logit contributions
burg-0.236
selves-0.234
lishing-0.218
aye-0.210
 Publication-0.204
TokenGreg
Token ID25025
Feature activation+2.82e+00

Pos logit contributions
quartered+0.124
 DOT+0.122
Senator+0.115
 divides+0.113
 elector+0.108
Neg logit contributions
burg-0.089
selves-0.089
lishing-0.081
aye-0.077
 Publication-0.075
Tokenor
Token ID273
Feature activation+3.09e-02

Pos logit contributions
burg+2.045
selves+1.867
aye+1.859
lishing+1.779
yah+1.706
Neg logit contributions
quartered-2.299
 DOT-2.068
Senator-2.003
 divides-1.949
idth-1.905
Token highlighted
Token ID14537
Feature activation-1.60e-01

Pos logit contributions
burg+0.027
selves+0.026
lishing+0.024
aye+0.024
 Publication+0.023
Neg logit contributions
quartered-0.020
 DOT-0.019
Senator-0.018
 divides-0.017
idth-0.017
Token how
Token ID703
Feature activation+2.82e-02

Pos logit contributions
quartered+0.117
 DOT+0.110
Senator+0.102
 divides+0.099
idth+0.097
Neg logit contributions
burg-0.125
selves-0.124
lishing-0.115
aye-0.111
 Publication-0.109
Token important
Token ID1593
Feature activation-1.20e+00

Pos logit contributions
burg+0.022
selves+0.022
lishing+0.020
aye+0.020
 Publication+0.019
Neg logit contributions
quartered-0.020
 DOT-0.019
Senator-0.018
 divides-0.017
idth-0.017
Token it
Token ID340
Feature activation-4.25e-01

Pos logit contributions
quartered+0.862
 DOT+0.822
Senator+0.761
 divides+0.743
idth+0.719
Neg logit contributions
burg-0.956
selves-0.948
lishing-0.882
aye-0.854
 Publication-0.837
Token always
Token ID1464
Feature activation+3.24e-01

Pos logit contributions
burg+0.297
selves+0.294
lishing+0.270
aye+0.263
 Publication+0.256
Neg logit contributions
quartered-0.291
 DOT-0.279
Senator-0.258
 divides-0.253
idth-0.246
Token be
Token ID307
Feature activation+2.74e-01

Pos logit contributions
burg+0.242
selves+0.241
lishing+0.221
aye+0.214
 Publication+0.209
Neg logit contributions
quartered-0.248
 DOT-0.237
Senator-0.220
 divides-0.214
idth-0.209
Token the
Token ID262
Feature activation-1.16e-02

Pos logit contributions
burg+0.217
selves+0.215
lishing+0.199
aye+0.192
 Publication+0.188
Neg logit contributions
quartered-0.198
 DOT-0.188
Senator-0.174
 divides-0.171
idth-0.166
Token price
Token ID2756
Feature activation+3.88e-01

Pos logit contributions
quartered+0.008
 DOT+0.008
Senator+0.007
 divides+0.007
idth+0.007
Neg logit contributions
burg-0.009
selves-0.009
lishing-0.008
aye-0.008
 Publication-0.008
Token of
Token ID286
Feature activation-2.65e-05

Pos logit contributions
burg+0.318
selves+0.313
lishing+0.291
aye+0.283
 Publication+0.276
Neg logit contributions
quartered-0.270
 DOT-0.256
Senator-0.236
 divides-0.230
idth-0.225
Token inflation
Token ID10610
Feature activation+2.82e+00

Pos logit contributions
quartered+0.000
 DOT+0.000
Senator+0.000
 divides+0.000
idth+0.000
Neg logit contributions
burg-0.000
selves-0.000
lishing-0.000
aye-0.000
 Publication-0.000
Token."
Token ID526
Feature activation+4.28e-01

Pos logit contributions
burg+1.962
selves+1.950
lishing+1.779
aye+1.720
 Publication+1.671
Neg logit contributions
quartered-2.300
 DOT-2.189
Senator-2.036
 divides-1.994
idth-1.953
TokenHe
Token ID1544
Feature activation-5.56e-02

Pos logit contributions
burg+0.331
selves+0.328
lishing+0.303
aye+0.292
 Publication+0.288
Neg logit contributions
quartered-0.318
 DOT-0.302
Senator-0.276
 divides-0.276
idth-0.268
Token correctly
Token ID9380
Feature activation-8.67e-02

Pos logit contributions
quartered+0.034
 DOT+0.032
Senator+0.030
 divides+0.029
idth+0.028
Neg logit contributions
burg-0.050
selves-0.049
lishing-0.046
aye-0.045
 Publication-0.044
Token predicted
Token ID11001
Feature activation+2.27e-01

Pos logit contributions
quartered+0.050
 DOT+0.047
Senator+0.042
 divides+0.040
idth+0.040
Neg logit contributions
burg-0.081
selves-0.081
lishing-0.076
aye-0.074
 Publication-0.072
Token the
Token ID262
Feature activation-3.72e-02

Pos logit contributions
burg+0.176
selves+0.175
lishing+0.162
aye+0.156
 Publication+0.154
Neg logit contributions
quartered-0.167
 DOT-0.159
Senator-0.147
 divides-0.144
idth-0.140
Token
Token ID247
Feature activation-1.73e-01

Pos logit contributions
burg+0.098
selves+0.097
lishing+0.085
aye+0.079
 Publication+0.075
Neg logit contributions
quartered-0.241
 DOT-0.234
Senator-0.222
 divides-0.220
idth-0.215
Tokens
Token ID82
Feature activation+2.12e-01

Pos logit contributions
quartered+0.161
 DOT+0.157
Senator+0.147
 divides+0.145
 elector+0.142
Neg logit contributions
selves-0.100
burg-0.099
lishing-0.089
aye-0.084
 Publication-0.082
Token sake
Token ID11060
Feature activation-2.17e-01

Pos logit contributions
selves+0.164
burg+0.164
lishing+0.151
aye+0.146
 Publication+0.143
Neg logit contributions
quartered-0.155
 DOT-0.148
Senator-0.139
 divides-0.135
idth-0.131
Token,
Token ID11
Feature activation+1.83e-01

Pos logit contributions
quartered+0.158
 DOT+0.151
Senator+0.140
 divides+0.137
idth+0.133
Neg logit contributions
burg-0.168
selves-0.167
lishing-0.156
aye-0.150
 Publication-0.147
Token American
Token ID1605
Feature activation+1.82e-01

Pos logit contributions
burg+0.150
selves+0.149
lishing+0.138
aye+0.134
 Publication+0.132
Neg logit contributions
quartered-0.125
 DOT-0.119
Senator-0.111
 divides-0.108
idth-0.105
Token Horror
Token ID22732
Feature activation+2.82e+00

Pos logit contributions
burg+0.131
selves+0.129
lishing+0.119
aye+0.114
 Publication+0.112
Neg logit contributions
quartered-0.144
 DOT-0.138
Senator-0.129
 divides-0.126
idth-0.123
Token Story
Token ID8362
Feature activation-5.89e-01

Pos logit contributions
burg+2.125
selves+2.063
lishing+1.931
 Publication+1.891
 Wit+1.861
Neg logit contributions
quartered-2.163
 DOT-1.994
Senator-1.862
 divides-1.825
idth-1.776
Token:
Token ID25
Feature activation-1.16e-01

Pos logit contributions
quartered+0.405
 DOT+0.386
Senator+0.358
 divides+0.345
idth+0.338
Neg logit contributions
burg-0.484
selves-0.479
lishing-0.447
aye-0.430
 Publication-0.425
Token Murder
Token ID23375
Feature activation+4.77e-01

Pos logit contributions
quartered+0.107
 DOT+0.103
Senator+0.097
 divides+0.095
idth+0.094
Neg logit contributions
burg-0.069
selves-0.067
lishing-0.062
 Publication-0.058
aye-0.057
Token House
Token ID2097
Feature activation-2.77e-01

Pos logit contributions
burg+0.369
selves+0.357
lishing+0.333
aye+0.323
 Publication+0.318
Neg logit contributions
quartered-0.353
 DOT-0.339
Senator-0.315
 divides-0.308
idth-0.305
Token opened
Token ID4721
Feature activation-2.11e-01

Pos logit contributions
quartered+0.184
 DOT+0.175
Senator+0.162
 divides+0.155
idth+0.153
Neg logit contributions
burg-0.234
selves-0.231
lishing-0.216
aye-0.208
 Publication-0.206
Token which
Token ID543
Feature activation+4.19e-02

Pos logit contributions
quartered+0.370
 DOT+0.353
Senator+0.325
 divides+0.318
idth+0.307
Neg logit contributions
burg-0.449
selves-0.445
lishing-0.416
aye-0.402
 Publication-0.394
Token time
Token ID640
Feature activation+5.57e-01

Pos logit contributions
burg+0.028
selves+0.028
lishing+0.025
aye+0.025
 Publication+0.024
Neg logit contributions
quartered-0.035
 DOT-0.033
Senator-0.032
 divides-0.031
idth-0.030
Token the
Token ID262
Feature activation+3.07e-02

Pos logit contributions
burg+0.440
selves+0.434
lishing+0.402
aye+0.389
 Publication+0.383
Neg logit contributions
quartered-0.397
 DOT-0.376
Senator-0.354
 divides-0.343
idth-0.336
Token firms
Token ID9611
Feature activation-7.38e-02

Pos logit contributions
burg+0.024
selves+0.023
lishing+0.022
aye+0.021
 Publication+0.020
Neg logit contributions
quartered-0.023
 DOT-0.021
Senator-0.020
 divides-0.020
idth-0.019
Token coll
Token ID2927
Feature activation+2.99e-01

Pos logit contributions
quartered+0.049
 DOT+0.047
Senator+0.043
 divides+0.042
idth+0.041
Neg logit contributions
burg-0.062
selves-0.062
lishing-0.057
aye-0.056
 Publication-0.055
Tokenuded
Token ID19289
Feature activation+2.79e+00

Pos logit contributions
burg+0.233
selves+0.230
lishing+0.212
aye+0.206
 Publication+0.198
Neg logit contributions
quartered-0.219
 DOT-0.209
Senator-0.195
 divides-0.190
idth-0.185
Token to
Token ID284
Feature activation+3.02e-01

Pos logit contributions
burg+2.219
selves+2.210
lishing+2.023
aye+1.941
 Wit+1.927
Neg logit contributions
quartered-1.972
 DOT-1.876
Senator-1.731
idth-1.678
 divides-1.655
Token set
Token ID900
Feature activation+8.73e-01

Pos logit contributions
burg+0.253
selves+0.252
lishing+0.234
aye+0.227
 Publication+0.224
Neg logit contributions
quartered-0.204
 DOT-0.191
Senator-0.177
 divides-0.170
idth-0.169
Token each
Token ID1123
Feature activation+2.41e-01

Pos logit contributions
selves+0.531
burg+0.531
lishing+0.478
aye+0.458
 Publication+0.446
Neg logit contributions
quartered-0.783
 DOT-0.751
Senator-0.714
 divides-0.696
idth-0.682
Token others
Token ID1854
Feature activation-1.47e-01

Pos logit contributions
burg+0.212
selves+0.211
lishing+0.197
aye+0.190
 Publication+0.188
Neg logit contributions
quartered-0.151
 DOT-0.143
Senator-0.132
 divides-0.128
idth-0.123
Token'
Token ID6
Feature activation+1.96e-01

Pos logit contributions
quartered+0.116
 DOT+0.111
Senator+0.104
 divides+0.102
idth+0.099
Neg logit contributions
selves-0.106
burg-0.106
lishing-0.097
aye-0.093
 Publication-0.091
Token.
Token ID13
Feature activation+2.82e-03

Pos logit contributions
burg+0.392
selves+0.389
lishing+0.358
aye+0.347
 Publication+0.338
Neg logit contributions
quartered-0.402
 DOT-0.384
Senator-0.358
 divides-0.349
idth-0.341
Token
Token ID198
Feature activation-2.71e-01

Pos logit contributions
burg+0.002
selves+0.002
lishing+0.002
 Publication+0.002
aye+0.002
Neg logit contributions
quartered-0.002
 DOT-0.002
Senator-0.002
 divides-0.002
idth-0.002
Token
Token ID198
Feature activation-2.34e-01

Pos logit contributions
quartered+0.208
 DOT+0.188
Senator+0.174
 divides+0.174
idth+0.173
Neg logit contributions
burg-0.213
selves-0.209
lishing-0.192
aye-0.184
 Publication-0.182
TokenFollow
Token ID7155
Feature activation+6.03e-01

Pos logit contributions
quartered+0.175
 DOT+0.165
 divides+0.153
Senator+0.150
idth+0.148
Neg logit contributions
burg-0.181
selves-0.178
lishing-0.166
aye-0.160
 Publication-0.156
Token @
Token ID2488
Feature activation+1.09e+00

Pos logit contributions
burg+0.480
selves+0.474
lishing+0.442
aye+0.428
 Publication+0.421
Neg logit contributions
quartered-0.429
 DOT-0.408
Senator-0.379
 divides-0.372
idth-0.361
Tokengreg
Token ID9903
Feature activation+2.79e+00

Pos logit contributions
burg+0.844
selves+0.828
lishing+0.773
aye+0.748
 Publication+0.728
Neg logit contributions
quartered-0.804
 DOT-0.772
Senator-0.705
 divides-0.703
idth-0.677
Tokenory
Token ID652
Feature activation-2.30e-01

Pos logit contributions
burg+2.094
selves+2.001
lishing+1.896
aye+1.814
seq+1.722
Neg logit contributions
quartered-2.179
 DOT-2.064
Senator-1.934
 divides-1.897
 elector-1.825
Tokenk
Token ID74
Feature activation+5.43e-01

Pos logit contributions
quartered+0.204
 DOT+0.198
Senator+0.185
 divides+0.183
idth+0.178
Neg logit contributions
selves-0.140
burg-0.137
lishing-0.126
aye-0.121
 Publication-0.120
Tokenort
Token ID419
Feature activation+1.49e-01

Pos logit contributions
burg+0.386
selves+0.374
lishing+0.347
aye+0.338
 Publication+0.324
Neg logit contributions
quartered-0.438
 DOT-0.419
Senator-0.395
 divides-0.390
 elector-0.376
Tokene
Token ID68
Feature activation+7.43e-01

Pos logit contributions
burg+0.076
selves+0.073
lishing+0.067
aye+0.063
 Publication+0.061
Neg logit contributions
quartered-0.149
 DOT-0.144
Senator-0.137
 divides-0.134
idth-0.132
Token on
Token ID319
Feature activation+9.89e-02

Pos logit contributions
burg+0.578
selves+0.555
lishing+0.521
aye+0.515
 Publication+0.496
Neg logit contributions
quartered-0.556
 DOT-0.521
Senator-0.488
 divides-0.481
idth-0.467
Token their
Token ID511
Feature activation+2.75e-01

Pos logit contributions
burg+0.712
selves+0.709
lishing+0.656
aye+0.632
 Publication+0.621
Neg logit contributions
quartered-0.647
 DOT-0.614
Senator-0.576
 divides-0.557
idth-0.542
Token graphics
Token ID9382
Feature activation+1.19e+00

Pos logit contributions
burg+0.211
selves+0.211
lishing+0.195
aye+0.187
 Publication+0.184
Neg logit contributions
quartered-0.202
 DOT-0.193
Senator-0.180
 divides-0.176
idth-0.170
Token cards
Token ID4116
Feature activation+1.73e-01

Pos logit contributions
burg+0.928
selves+0.920
lishing+0.852
aye+0.821
 Publication+0.805
Neg logit contributions
quartered-0.868
 DOT-0.828
Senator-0.769
 divides-0.748
idth-0.728
Token.
Token ID13
Feature activation+3.20e-01

Pos logit contributions
burg+0.135
selves+0.134
lishing+0.124
aye+0.120
 Publication+0.117
Neg logit contributions
quartered-0.127
 DOT-0.121
Senator-0.113
 divides-0.110
idth-0.107
Token Comp
Token ID3082
Feature activation+9.89e-02

Pos logit contributions
burg+0.240
selves+0.238
lishing+0.221
aye+0.212
 Publication+0.208
Neg logit contributions
quartered-0.242
 DOT-0.232
Senator-0.215
 divides-0.212
idth-0.205
Tokeneting
Token ID13629
Feature activation+2.77e+00

Pos logit contributions
burg+0.080
selves+0.079
lishing+0.075
aye+0.072
 Publication+0.070
Neg logit contributions
quartered-0.069
 DOT-0.066
Senator-0.061
 divides-0.060
 elector-0.057
Token with
Token ID351
Feature activation+2.63e-01

Pos logit contributions
burg+2.158
selves+2.157
lishing+2.001
aye+1.904
 Publication+1.897
Neg logit contributions
quartered-1.949
 DOT-1.864
Senator-1.777
 divides-1.679
idth-1.679
Token flagship
Token ID21336
Feature activation+3.29e-02

Pos logit contributions
burg+0.212
selves+0.210
lishing+0.195
aye+0.189
 Publication+0.186
Neg logit contributions
quartered-0.185
 DOT-0.175
Senator-0.164
 divides-0.159
idth-0.155
Token design
Token ID1486
Feature activation-2.27e-01

Pos logit contributions
burg+0.026
selves+0.025
lishing+0.024
aye+0.023
 Publication+0.022
Neg logit contributions
quartered-0.024
 DOT-0.023
Senator-0.021
 divides-0.021
idth-0.020
Token of
Token ID286
Feature activation+5.04e-01

Pos logit contributions
quartered+0.158
 DOT+0.152
Senator+0.139
 divides+0.137
idth+0.132
Neg logit contributions
burg-0.184
selves-0.183
lishing-0.171
aye-0.165
 Publication-0.161
Token Lightning
Token ID12469
Feature activation-3.70e-01

Pos logit contributions
burg+0.382
selves+0.379
lishing+0.350
aye+0.336
 Publication+0.331
Neg logit contributions
quartered-0.378
 DOT-0.358
Senator-0.338
 divides-0.330
idth-0.322
Token writing
Token ID3597
Feature activation-1.95e-01

Pos logit contributions
burg+0.186
selves+0.183
lishing+0.172
aye+0.166
 Publication+0.163
Neg logit contributions
quartered-0.166
 DOT-0.158
Senator-0.147
 divides-0.143
idth-0.139
Token that
Token ID326
Feature activation+2.36e-01

Pos logit contributions
quartered+0.139
 DOT+0.133
Senator+0.123
 divides+0.120
idth+0.116
Neg logit contributions
burg-0.155
selves-0.154
lishing-0.143
aye-0.139
 Publication-0.136
Token a
Token ID257
Feature activation+4.29e-01

Pos logit contributions
burg+0.185
selves+0.183
lishing+0.171
aye+0.165
 Publication+0.162
Neg logit contributions
quartered-0.171
 DOT-0.163
Senator-0.151
 divides-0.147
idth-0.143
Token book
Token ID1492
Feature activation-3.18e-03

Pos logit contributions
burg+0.335
selves+0.332
lishing+0.311
aye+0.299
 Publication+0.293
Neg logit contributions
quartered-0.311
 DOT-0.298
Senator-0.275
 divides-0.270
idth-0.261
Token compet
Token ID2307
Feature activation+9.45e-01

Pos logit contributions
quartered+0.002
 DOT+0.002
Senator+0.002
 divides+0.002
idth+0.002
Neg logit contributions
burg-0.003
selves-0.003
lishing-0.002
aye-0.002
 Publication-0.002
Tokenes
Token ID274
Feature activation+2.77e+00

Pos logit contributions
burg+0.663
selves+0.656
lishing+0.605
aye+0.581
 Publication+0.564
Neg logit contributions
quartered-0.764
 DOT-0.735
Senator-0.682
 divides-0.672
idth-0.654
Token with
Token ID351
Feature activation+4.06e-01

Pos logit contributions
burg+2.045
selves+2.039
lishing+1.871
aye+1.803
 Wit+1.764
Neg logit contributions
quartered-2.112
 DOT-2.005
Senator-1.877
 divides-1.812
idth-1.806
Token Snapchat
Token ID31546
Feature activation+1.32e+00

Pos logit contributions
burg+0.331
selves+0.328
lishing+0.307
aye+0.296
 Publication+0.291
Neg logit contributions
quartered-0.283
 DOT-0.268
Senator-0.247
 divides-0.241
idth-0.235
Token and
Token ID290
Feature activation+2.13e-01

Pos logit contributions
burg+1.026
selves+1.012
lishing+0.943
aye+0.905
 Publication+0.887
Neg logit contributions
quartered-0.985
 DOT-0.932
Senator-0.866
 divides-0.847
idth-0.827
Token Instagram
Token ID10767
Feature activation+6.05e-01

Pos logit contributions
burg+0.145
selves+0.143
lishing+0.132
aye+0.127
 Publication+0.124
Neg logit contributions
quartered-0.177
 DOT-0.169
Senator-0.159
 divides-0.155
idth-0.152
Token and
Token ID290
Feature activation-6.95e-02

Pos logit contributions
burg+0.471
selves+0.466
lishing+0.434
aye+0.417
 Publication+0.408
Neg logit contributions
quartered-0.446
 DOT-0.424
Senator-0.394
 divides-0.384
idth-0.375
Token<|endoftext|>
Token ID50256
Feature activation+1.33e-01

Pos logit contributions
quartered+0.046
 DOT+0.043
Senator+0.039
 divides+0.039
idth+0.038
Neg logit contributions
burg-0.053
selves-0.052
lishing-0.049
aye-0.047
 Publication-0.046
TokenImage
Token ID5159
Feature activation-3.55e-01

Pos logit contributions
burg+0.097
selves+0.096
lishing+0.089
aye+0.086
 Publication+0.083
Neg logit contributions
quartered-0.104
 DOT-0.100
 divides-0.092
Senator-0.091
idth-0.089
Token copyright
Token ID6634
Feature activation-3.17e+00

Pos logit contributions
quartered+0.243
 DOT+0.233
Senator+0.213
 divides+0.209
idth+0.203
Neg logit contributions
burg-0.292
selves-0.289
lishing-0.271
aye-0.262
 Publication-0.258
Token publicity
Token ID19403
Feature activation-5.42e-01

Pos logit contributions
quartered+2.114
 DOT+2.059
Senator+1.879
 divides+1.839
 elector+1.750
Neg logit contributions
burg-2.619
selves-2.596
lishing-2.433
aye-2.352
 Publication-2.286
Token Image
Token ID7412
Feature activation-6.76e-01

Pos logit contributions
quartered+0.382
 DOT+0.366
Senator+0.338
 divides+0.331
idth+0.319
Neg logit contributions
burg-0.434
selves-0.430
lishing-0.402
aye-0.388
 Publication-0.380
Token caption
Token ID8305
Feature activation-3.28e+00

Pos logit contributions
quartered+0.506
 DOT+0.476
Senator+0.439
 divides+0.431
idth+0.423
Neg logit contributions
burg-0.522
selves-0.520
lishing-0.482
aye-0.463
 Publication-0.458
Token The
Token ID383
Feature activation+2.41e-01

Pos logit contributions
quartered+2.367
 DOT+2.313
Senator+2.138
 divides+2.085
 elector+1.994
Neg logit contributions
burg-2.523
selves-2.503
lishing-2.343
aye-2.266
 Publication-2.178
Token Greeks
Token ID25059
Feature activation-3.62e-01

Pos logit contributions
burg+0.190
selves+0.189
lishing+0.176
aye+0.169
 Publication+0.166
Neg logit contributions
quartered-0.173
 DOT-0.165
Senator-0.153
 divides-0.150
idth-0.146
Token were
Token ID547
Feature activation+4.08e-01

Pos logit contributions
quartered+0.238
 DOT+0.225
Senator+0.207
 divides+0.201
idth+0.198
Neg logit contributions
burg-0.311
selves-0.308
lishing-0.285
aye-0.278
 Publication-0.273
Token totally
Token ID6635
Feature activation-2.63e-01

Pos logit contributions
burg+0.331
selves+0.329
lishing+0.303
aye+0.293
 Publication+0.288
Neg logit contributions
quartered-0.286
 DOT-0.270
Senator-0.250
idth-0.243
 divides-0.242
Token "
Token ID366
Feature activation-4.84e-01

Pos logit contributions
quartered+0.195
 DOT+0.186
Senator+0.173
 divides+0.169
idth+0.165
Neg logit contributions
burg-0.201
selves-0.200
lishing-0.186
aye-0.179
 Publication-0.175
Token the
Token ID262
Feature activation-2.00e-01

Pos logit contributions
burg+0.177
selves+0.175
lishing+0.164
aye+0.156
 Publication+0.155
Neg logit contributions
quartered-0.153
 DOT-0.143
Senator-0.135
 divides-0.130
idth-0.128
Token fashionable
Token ID38378
Feature activation-2.69e-01

Pos logit contributions
quartered+0.141
 DOT+0.131
Senator+0.125
 divides+0.120
idth+0.120
Neg logit contributions
burg-0.162
selves-0.160
lishing-0.148
 Publication-0.142
aye-0.142
Token Downtown
Token ID25657
Feature activation+1.26e+00

Pos logit contributions
quartered+0.191
 DOT+0.179
Senator+0.170
 divides+0.163
idth+0.161
Neg logit contributions
burg-0.217
selves-0.213
lishing-0.199
aye-0.190
 Publication-0.189
Token.
Token ID13
Feature activation-9.39e-02

Pos logit contributions
burg+0.977
selves+0.953
lishing+0.885
 Publication+0.846
aye+0.839
Neg logit contributions
quartered-0.937
 DOT-0.869
Senator-0.836
 divides-0.800
idth-0.794
Token He
Token ID679
Feature activation-4.49e-02

Pos logit contributions
quartered+0.066
 DOT+0.062
Senator+0.058
 divides+0.057
idth+0.055
Neg logit contributions
burg-0.076
selves-0.076
lishing-0.070
aye-0.067
 Publication-0.067
Tokenfty
Token ID19628
Feature activation-3.24e+00

Pos logit contributions
quartered+0.035
 DOT+0.033
Senator+0.031
 divides+0.030
 elector+0.029
Neg logit contributions
burg-0.033
selves-0.033
lishing-0.030
aye-0.029
 Publication-0.028
Token tax
Token ID1687
Feature activation-9.40e-01

Pos logit contributions
quartered+2.300
 DOT+2.212
Senator+2.032
 divides+2.004
idth+1.923
Neg logit contributions
burg-2.576
selves-2.561
lishing-2.387
aye-2.308
 Publication-2.253
Token breaks
Token ID9457
Feature activation-3.44e-01

Pos logit contributions
quartered+0.616
 DOT+0.577
Senator+0.537
idth+0.511
 divides+0.509
Neg logit contributions
burg-0.812
selves-0.796
lishing-0.746
aye-0.728
 Publication-0.711
Token have
Token ID423
Feature activation+5.77e-02

Pos logit contributions
quartered+0.248
 DOT+0.237
Senator+0.221
 divides+0.214
idth+0.210
Neg logit contributions
burg-0.272
selves-0.269
lishing-0.249
aye-0.242
 Publication-0.236
Token been
Token ID587
Feature activation-3.36e-01

Pos logit contributions
burg+0.046
selves+0.046
lishing+0.043
aye+0.041
 Publication+0.040
Neg logit contributions
quartered-0.041
 DOT-0.039
Senator-0.036
 divides-0.035
idth-0.034
Token granted
Token ID7520
Feature activation-8.96e-01

Pos logit contributions
quartered+0.220
 DOT+0.207
Senator+0.192
 divides+0.186
idth+0.182
Neg logit contributions
burg-0.288
selves-0.284
lishing-0.267
aye-0.258
 Publication-0.253
Token000
Token ID830
Feature activation+3.83e-01

Pos logit contributions
burg+0.189
selves+0.187
lishing+0.173
aye+0.166
 Publication+0.162
Neg logit contributions
quartered-0.203
 DOT-0.189
Senator-0.176
 divides-0.174
idth-0.172
Token.
Token ID13
Feature activation+1.48e-01

Pos logit contributions
burg+0.270
selves+0.266
lishing+0.246
aye+0.237
 Publication+0.232
Neg logit contributions
quartered-0.308
 DOT-0.293
Senator-0.277
 divides-0.270
idth-0.268
Token
Token ID198
Feature activation-2.21e-02

Pos logit contributions
burg+0.122
selves+0.122
lishing+0.114
aye+0.110
 Publication+0.108
Neg logit contributions
quartered-0.101
 DOT-0.096
Senator-0.089
 divides-0.087
idth-0.084
Token
Token ID198
Feature activation-5.63e-02

Pos logit contributions
quartered+0.017
 DOT+0.015
 divides+0.014
Senator+0.014
idth+0.014
Neg logit contributions
burg-0.017
selves-0.017
lishing-0.016
aye-0.015
 Publication-0.015
TokenImage
Token ID5159
Feature activation-6.14e-02

Pos logit contributions
quartered+0.043
 DOT+0.041
 divides+0.038
Senator+0.038
idth+0.037
Neg logit contributions
burg-0.042
selves-0.042
lishing-0.039
aye-0.037
 Publication-0.036
Token caption
Token ID8305
Feature activation-3.23e+00

Pos logit contributions
quartered+0.047
 DOT+0.044
Senator+0.040
idth+0.040
 divides+0.040
Neg logit contributions
burg-0.047
selves-0.046
lishing-0.043
aye-0.041
 Publication-0.041
Token Council
Token ID4281
Feature activation-4.12e-01

Pos logit contributions
quartered+2.384
 DOT+2.347
Senator+2.162
 divides+2.109
 Colts+2.028
Neg logit contributions
burg-2.419
selves-2.393
lishing-2.246
aye-2.172
 Publication-2.075
Token staff
Token ID3085
Feature activation-1.20e-02

Pos logit contributions
quartered+0.307
 DOT+0.295
Senator+0.274
 divides+0.268
idth+0.260
Neg logit contributions
burg-0.313
selves-0.310
lishing-0.288
aye-0.278
 Publication-0.272
Token found
Token ID1043
Feature activation+1.34e+00

Pos logit contributions
quartered+0.008
 DOT+0.008
Senator+0.007
 divides+0.007
idth+0.007
Neg logit contributions
burg-0.010
selves-0.010
lishing-0.009
aye-0.009
 Publication-0.009
Token paperwork
Token ID23143
Feature activation+1.48e-02

Pos logit contributions
burg+1.076
selves+1.075
lishing+0.994
aye+0.961
 Publication+0.943
Neg logit contributions
quartered-0.949
 DOT-0.887
Senator-0.834
 divides-0.803
idth-0.796
Token with
Token ID351
Feature activation+1.60e-01

Pos logit contributions
burg+0.012
selves+0.012
lishing+0.011
aye+0.010
 Publication+0.010
Neg logit contributions
quartered-0.011
 DOT-0.010
Senator-0.009
 divides-0.009
idth-0.009
TokenACE
Token ID11598
Feature activation+1.54e-01

Pos logit contributions
burg+0.450
selves+0.440
lishing+0.407
aye+0.397
 Publication+0.381
Neg logit contributions
quartered-0.530
 DOT-0.509
Senator-0.477
 divides-0.473
 elector-0.457
Token).
Token ID737
Feature activation+3.88e-01

Pos logit contributions
burg+0.129
selves+0.128
lishing+0.120
aye+0.116
 Publication+0.114
Neg logit contributions
quartered-0.104
 DOT-0.099
Senator-0.091
 divides-0.089
idth-0.086
Token
Token ID198
Feature activation-3.13e-01

Pos logit contributions
burg+0.323
selves+0.321
lishing+0.300
aye+0.291
 Publication+0.284
Neg logit contributions
quartered-0.261
 DOT-0.250
Senator-0.230
 divides-0.225
idth-0.216
Token
Token ID198
Feature activation-3.02e-01

Pos logit contributions
quartered+0.236
 DOT+0.216
 divides+0.199
Senator+0.198
idth+0.196
Neg logit contributions
burg-0.248
selves-0.244
lishing-0.226
aye-0.215
 Publication-0.214
TokenImage
Token ID5159
Feature activation-8.32e-03

Pos logit contributions
quartered+0.229
 DOT+0.220
Senator+0.202
 divides+0.202
idth+0.194
Neg logit contributions
burg-0.227
selves-0.225
lishing-0.209
aye-0.201
 Publication-0.196
Token caption
Token ID8305
Feature activation-3.23e+00

Pos logit contributions
quartered+0.006
 DOT+0.005
Senator+0.005
 divides+0.005
idth+0.005
Neg logit contributions
burg-0.007
selves-0.007
lishing-0.006
aye-0.006
 Publication-0.006
Token Council
Token ID4281
Feature activation-3.99e-01

Pos logit contributions
quartered+2.460
 DOT+2.420
Senator+2.237
 divides+2.183
 Colts+2.104
Neg logit contributions
burg-2.336
selves-2.307
lishing-2.162
aye-2.089
 Publication-1.990
Token staff
Token ID3085
Feature activation+1.44e-01

Pos logit contributions
quartered+0.251
 DOT+0.239
Senator+0.219
 divides+0.213
idth+0.207
Neg logit contributions
burg-0.351
selves-0.348
lishing-0.326
aye-0.316
 Publication-0.311
Token s
Token ID264
Feature activation+5.68e-01

Pos logit contributions
burg+0.126
selves+0.125
lishing+0.116
aye+0.113
 Publication+0.111
Neg logit contributions
quartered-0.092
 DOT-0.087
Senator-0.080
 divides-0.077
idth-0.076
Tokenifted
Token ID21715
Feature activation-2.94e-01

Pos logit contributions
selves+0.482
burg+0.480
lishing+0.443
aye+0.433
 Publication+0.419
Neg logit contributions
quartered-0.377
 DOT-0.361
Senator-0.334
 divides-0.324
 elector-0.310
Token through
Token ID832
Feature activation+7.89e-01

Pos logit contributions
quartered+0.226
 DOT+0.217
Senator+0.202
 divides+0.198
idth+0.192
Neg logit contributions
burg-0.216
selves-0.215
lishing-0.199
aye-0.192
 Publication-0.187
Token.
Token ID13
Feature activation-1.08e-01

Pos logit contributions
burg+0.006
selves+0.006
lishing+0.006
aye+0.006
 Publication+0.005
Neg logit contributions
quartered-0.007
 DOT-0.007
Senator-0.006
 divides-0.006
idth-0.006
Token
Token ID447
Feature activation+2.69e-02

Pos logit contributions
quartered+0.085
 DOT+0.080
 divides+0.074
Senator+0.074
idth+0.072
Neg logit contributions
burg-0.079
selves-0.079
lishing-0.072
aye-0.070
 Publication-0.069
Token
Token ID251
Feature activation-6.50e-02

Pos logit contributions
burg+0.013
selves+0.013
lishing+0.012
aye+0.011
 Publication+0.011
Neg logit contributions
quartered-0.027
 DOT-0.026
Senator-0.025
 divides-0.025
idth-0.024
Token<|endoftext|>
Token ID50256
Feature activation+1.33e-01

Pos logit contributions
quartered+0.046
 DOT+0.043
Senator+0.039
 divides+0.039
idth+0.038
Neg logit contributions
burg-0.053
selves-0.052
lishing-0.049
aye-0.047
 Publication-0.046
TokenImage
Token ID5159
Feature activation-3.55e-01

Pos logit contributions
burg+0.097
selves+0.096
lishing+0.089
aye+0.086
 Publication+0.083
Neg logit contributions
quartered-0.104
 DOT-0.100
 divides-0.092
Senator-0.091
idth-0.089
Token copyright
Token ID6634
Feature activation-3.17e+00

Pos logit contributions
quartered+0.243
 DOT+0.233
Senator+0.213
 divides+0.209
idth+0.203
Neg logit contributions
burg-0.292
selves-0.289
lishing-0.271
aye-0.262
 Publication-0.258
Token publicity
Token ID19403
Feature activation-5.42e-01

Pos logit contributions
quartered+2.114
 DOT+2.059
Senator+1.879
 divides+1.839
 elector+1.750
Neg logit contributions
burg-2.619
selves-2.596
lishing-2.433
aye-2.352
 Publication-2.286
Token Image
Token ID7412
Feature activation-6.76e-01

Pos logit contributions
quartered+0.382
 DOT+0.366
Senator+0.338
 divides+0.331
idth+0.319
Neg logit contributions
burg-0.434
selves-0.430
lishing-0.402
aye-0.388
 Publication-0.380
Token caption
Token ID8305
Feature activation-3.28e+00

Pos logit contributions
quartered+0.506
 DOT+0.476
Senator+0.439
 divides+0.431
idth+0.423
Neg logit contributions
burg-0.522
selves-0.520
lishing-0.482
aye-0.463
 Publication-0.458
Token The
Token ID383
Feature activation+2.41e-01

Pos logit contributions
quartered+2.367
 DOT+2.313
Senator+2.138
 divides+2.085
 elector+1.994
Neg logit contributions
burg-2.523
selves-2.503
lishing-2.343
aye-2.266
 Publication-2.178
Token Greeks
Token ID25059
Feature activation-3.62e-01

Pos logit contributions
burg+0.190
selves+0.189
lishing+0.176
aye+0.169
 Publication+0.166
Neg logit contributions
quartered-0.173
 DOT-0.165
Senator-0.153
 divides-0.150
idth-0.146
Token<|endoftext|>
Token ID50256
Feature activation+8.02e-02

Pos logit contributions
quartered+0.093
 DOT+0.089
Senator+0.082
 divides+0.081
idth+0.078
Neg logit contributions
burg-0.104
selves-0.103
lishing-0.096
aye-0.093
 Publication-0.091
TokenImage
Token ID5159
Feature activation-9.56e-02

Pos logit contributions
burg+0.058
selves+0.058
lishing+0.053
aye+0.052
 Publication+0.050
Neg logit contributions
quartered-0.063
 DOT-0.060
 divides-0.056
Senator-0.055
idth-0.054
Token copyright
Token ID6634
Feature activation-2.86e+00

Pos logit contributions
quartered+0.066
 DOT+0.063
Senator+0.058
 divides+0.056
idth+0.055
Neg logit contributions
burg-0.078
selves-0.078
lishing-0.073
aye-0.070
 Publication-0.069
Token AP
Token ID3486
Feature activation+4.45e-02

Pos logit contributions
quartered+1.906
 DOT+1.850
Senator+1.689
 divides+1.654
idth+1.574
Neg logit contributions
burg-2.382
selves-2.362
lishing-2.214
aye-2.141
 Publication-2.083
Token Image
Token ID7412
Feature activation-3.41e-01

Pos logit contributions
burg+0.039
selves+0.038
lishing+0.036
aye+0.035
 Publication+0.034
Neg logit contributions
quartered-0.028
 DOT-0.027
Senator-0.025
 divides-0.024
idth-0.023
Token caption
Token ID8305
Feature activation-3.14e+00

Pos logit contributions
quartered+0.257
 DOT+0.240
Senator+0.221
 divides+0.218
idth+0.214
Neg logit contributions
burg-0.263
selves-0.262
lishing-0.242
aye-0.233
 Publication-0.230
Token Peace
Token ID12689
Feature activation-4.58e-02

Pos logit contributions
quartered+2.199
 DOT+2.153
Senator+1.992
 divides+1.936
 Colts+1.847
Neg logit contributions
burg-2.467
selves-2.445
lishing-2.301
aye-2.225
 Publication-2.141
Token talks
Token ID6130
Feature activation+9.42e-02

Pos logit contributions
quartered+0.030
 DOT+0.028
Senator+0.026
 divides+0.025
idth+0.025
Neg logit contributions
burg-0.039
selves-0.039
lishing-0.036
aye-0.035
 Publication-0.035
Token between
Token ID1022
Feature activation+1.22e+00

Pos logit contributions
burg+0.077
selves+0.076
lishing+0.070
aye+0.068
 Publication+0.067
Neg logit contributions
quartered-0.065
 DOT-0.063
Senator-0.058
 divides-0.056
idth-0.056
Token the
Token ID262
Feature activation+4.66e-01

Pos logit contributions
burg+0.933
selves+0.918
lishing+0.848
aye+0.820
 Publication+0.805
Neg logit contributions
quartered-0.912
 DOT-0.861
Senator-0.806
idth-0.779
 divides-0.776
Token government
Token ID1230
Feature activation+3.27e-01

Pos logit contributions
burg+0.396
selves+0.394
lishing+0.365
aye+0.356
 Publication+0.349
Neg logit contributions
quartered-0.306
 DOT-0.290
Senator-0.268
 divides-0.259
idth-0.255
Token it
Token ID340
Feature activation+9.87e-02

Pos logit contributions
burg+0.609
selves+0.603
lishing+0.561
aye+0.541
 Publication+0.533
Neg logit contributions
quartered-0.556
 DOT-0.528
Senator-0.490
 divides-0.479
idth-0.467
Token mean
Token ID1612
Feature activation+5.05e-01

Pos logit contributions
burg+0.068
selves+0.067
lishing+0.061
aye+0.059
 Publication+0.058
Neg logit contributions
quartered-0.082
 DOT-0.078
Senator-0.073
 divides-0.071
idth-0.070
Token to
Token ID284
Feature activation-6.91e-02

Pos logit contributions
burg+0.431
selves+0.429
lishing+0.400
aye+0.388
 Publication+0.380
Neg logit contributions
quartered-0.331
 DOT-0.314
Senator-0.290
 divides-0.280
idth-0.272
Token an
Token ID281
Feature activation-2.72e-02

Pos logit contributions
quartered+0.050
 DOT+0.048
Senator+0.044
 divides+0.043
idth+0.042
Neg logit contributions
burg-0.054
selves-0.054
lishing-0.050
aye-0.048
 Publication-0.047
Token intro
Token ID18951
Feature activation+1.01e-01

Pos logit contributions
quartered+0.018
 DOT+0.017
Senator+0.016
 divides+0.015
idth+0.015
Neg logit contributions
burg-0.023
selves-0.023
lishing-0.021
aye-0.021
 Publication-0.021
Tokenvert
Token ID1851
Feature activation-3.14e+00

Pos logit contributions
burg+0.062
selves+0.061
lishing+0.056
aye+0.053
 Publication+0.052
Neg logit contributions
quartered-0.090
 DOT-0.087
Senator-0.082
 divides-0.081
idth-0.079
Token online
Token ID2691
Feature activation-5.64e-01

Pos logit contributions
quartered+2.079
 DOT+2.003
Senator+1.823
 divides+1.808
 elector+1.723
Neg logit contributions
burg-2.608
selves-2.583
lishing-2.439
aye-2.354
 Publication-2.304
Token?
Token ID30
Feature activation-4.91e-01

Pos logit contributions
quartered+0.431
 DOT+0.414
Senator+0.385
 divides+0.378
idth+0.366
Neg logit contributions
burg-0.418
selves-0.416
lishing-0.385
aye-0.370
 Publication-0.362
Token<|endoftext|>
Token ID50256
Feature activation+9.64e-03

Pos logit contributions
quartered+0.351
 DOT+0.333
Senator+0.309
 divides+0.303
idth+0.294
Neg logit contributions
burg-0.390
selves-0.388
lishing-0.360
aye-0.347
 Publication-0.342
TokenUpdate
Token ID10260
Feature activation+4.18e-01

Pos logit contributions
burg+0.007
selves+0.007
lishing+0.006
aye+0.006
 Publication+0.006
Neg logit contributions
quartered-0.008
 DOT-0.007
 divides-0.007
Senator-0.007
idth-0.006
Token:
Token ID25
Feature activation+6.38e-01

Pos logit contributions
burg+0.319
selves+0.316
lishing+0.293
aye+0.282
 Publication+0.279
Neg logit contributions
quartered-0.313
 DOT-0.297
Senator-0.277
 divides-0.271
idth-0.264
Token<|endoftext|>
Token ID50256
Feature activation+7.61e-01

Pos logit contributions
burg+0.247
selves+0.246
lishing+0.229
aye+0.222
 Publication+0.218
Neg logit contributions
quartered-0.206
 DOT-0.197
Senator-0.181
 divides-0.177
idth-0.171
TokenImage
Token ID5159
Feature activation+2.86e-01

Pos logit contributions
burg+0.557
selves+0.546
lishing+0.507
aye+0.490
 Publication+0.473
Neg logit contributions
quartered-0.596
 DOT-0.573
 divides-0.529
Senator-0.520
idth-0.509
Token copyright
Token ID6634
Feature activation-2.76e+00

Pos logit contributions
burg+0.233
selves+0.230
lishing+0.216
aye+0.208
 Publication+0.206
Neg logit contributions
quartered-0.198
 DOT-0.190
Senator-0.173
 divides-0.170
idth-0.166
Token ESA
Token ID42212
Feature activation-9.52e-01

Pos logit contributions
quartered+1.842
 DOT+1.786
Senator+1.631
 divides+1.598
idth+1.523
Neg logit contributions
burg-2.291
selves-2.273
lishing-2.129
aye-2.059
 Publication-2.004
Token Image
Token ID7412
Feature activation+8.26e-02

Pos logit contributions
quartered+0.644
 DOT+0.619
Senator+0.568
 divides+0.557
idth+0.535
Neg logit contributions
burg-0.787
selves-0.783
lishing-0.732
aye-0.707
 Publication-0.690
Token caption
Token ID8305
Feature activation-3.11e+00

Pos logit contributions
burg+0.064
selves+0.064
lishing+0.059
aye+0.057
 Publication+0.056
Neg logit contributions
quartered-0.061
 DOT-0.058
Senator-0.053
 divides-0.052
idth-0.051
Token Go
Token ID1514
Feature activation+7.41e-01

Pos logit contributions
quartered+2.184
 DOT+2.139
Senator+1.967
 divides+1.914
idth+1.833
Neg logit contributions
burg-2.446
selves-2.419
lishing-2.276
aye-2.204
 Publication-2.114
Tokence
Token ID344
Feature activation+2.59e-01

Pos logit contributions
burg+0.536
selves+0.528
lishing+0.489
aye+0.472
 Publication+0.457
Neg logit contributions
quartered-0.576
 DOT-0.558
Senator-0.518
 divides-0.512
idth-0.491
Token uses
Token ID3544
Feature activation-1.88e-01

Pos logit contributions
burg+0.206
selves+0.202
lishing+0.188
aye+0.182
 Publication+0.179
Neg logit contributions
quartered-0.185
 DOT-0.176
Senator-0.164
 divides-0.160
idth-0.155
Token an
Token ID281
Feature activation+6.71e-01

Pos logit contributions
quartered+0.132
 DOT+0.125
Senator+0.117
 divides+0.114
idth+0.110
Neg logit contributions
burg-0.152
selves-0.151
lishing-0.140
aye-0.136
 Publication-0.133
Token electric
Token ID5186
Feature activation-2.12e-01

Pos logit contributions
burg+0.537
selves+0.524
lishing+0.492
aye+0.480
 Publication+0.469
Neg logit contributions
quartered-0.471
 DOT-0.451
Senator-0.426
 divides-0.411
idth-0.402
Token million
Token ID1510
Feature activation-4.26e-01

Pos logit contributions
quartered+0.206
 DOT+0.198
Senator+0.184
 divides+0.180
idth+0.175
Neg logit contributions
burg-0.192
selves-0.190
lishing-0.176
aye-0.170
 Publication-0.165
Token years
Token ID812
Feature activation+1.63e+00

Pos logit contributions
quartered+0.328
 DOT+0.314
Senator+0.293
 divides+0.287
idth+0.278
Neg logit contributions
burg-0.314
selves-0.311
lishing-0.288
aye-0.278
 Publication-0.271
Token.
Token ID13
Feature activation+2.39e-01

Pos logit contributions
burg+1.163
selves+1.144
lishing+1.061
aye+1.025
 Publication+1.000
Neg logit contributions
quartered-1.291
 DOT-1.232
Senator-1.155
 divides-1.121
idth-1.104
Token<|endoftext|>
Token ID50256
Feature activation+1.14e-01

Pos logit contributions
burg+0.192
selves+0.191
lishing+0.178
aye+0.172
 Publication+0.169
Neg logit contributions
quartered-0.169
 DOT-0.160
Senator-0.148
 divides-0.145
idth-0.141
TokenImage
Token ID5159
Feature activation-2.39e-01

Pos logit contributions
burg+0.084
selves+0.082
lishing+0.076
aye+0.074
 Publication+0.071
Neg logit contributions
quartered-0.090
 DOT-0.086
 divides-0.079
Senator-0.079
idth-0.076
Token copyright
Token ID6634
Feature activation-3.11e+00

Pos logit contributions
quartered+0.164
 DOT+0.157
Senator+0.144
 divides+0.141
idth+0.137
Neg logit contributions
burg-0.197
selves-0.195
lishing-0.183
aye-0.176
 Publication-0.174
Token AFP
Token ID8916
Feature activation-1.43e+00

Pos logit contributions
quartered+2.072
 DOT+2.016
Senator+1.839
 divides+1.801
 elector+1.714
Neg logit contributions
burg-2.576
selves-2.553
lishing-2.393
aye-2.314
 Publication-2.249
Token
Token ID198
Feature activation+3.92e-01

Pos logit contributions
quartered+0.887
 DOT+0.855
Senator+0.778
 divides+0.762
idth+0.725
Neg logit contributions
burg-1.256
selves-1.253
lishing-1.169
aye-1.137
 Publication-1.106
Token
Token ID198
Feature activation+2.57e-01

Pos logit contributions
burg+0.311
selves+0.305
lishing+0.282
aye+0.268
 Publication+0.268
Neg logit contributions
quartered-0.298
 DOT-0.272
 divides-0.250
Senator-0.249
idth-0.248
TokenThe
Token ID464
Feature activation+2.27e-01

Pos logit contributions
burg+0.184
selves+0.182
lishing+0.168
aye+0.162
 Publication+0.157
Neg logit contributions
quartered-0.205
 DOT-0.197
Senator-0.182
 divides-0.181
idth-0.175
Token Bank
Token ID5018
Feature activation+4.59e-01

Pos logit contributions
burg+0.186
selves+0.185
lishing+0.173
aye+0.167
 Publication+0.164
Neg logit contributions
quartered-0.156
 DOT-0.149
Senator-0.137
 divides-0.134
idth-0.130
Token media
Token ID2056
Feature activation+6.76e-03

Pos logit contributions
quartered+0.027
 DOT+0.026
Senator+0.024
 divides+0.024
idth+0.023
Neg logit contributions
burg-0.030
selves-0.030
lishing-0.028
aye-0.027
 Publication-0.026
Token reported
Token ID2098
Feature activation-3.05e-02

Pos logit contributions
burg+0.006
selves+0.006
lishing+0.005
aye+0.005
 Publication+0.005
Neg logit contributions
quartered-0.004
 DOT-0.004
Senator-0.004
 divides-0.004
idth-0.004
Token.
Token ID13
Feature activation+3.53e-02

Pos logit contributions
quartered+0.025
 DOT+0.024
Senator+0.022
 divides+0.022
idth+0.021
Neg logit contributions
burg-0.021
selves-0.021
lishing-0.020
aye-0.019
 Publication-0.018
Token<|endoftext|>
Token ID50256
Feature activation+1.88e-01

Pos logit contributions
burg+0.028
selves+0.028
lishing+0.026
aye+0.025
 Publication+0.025
Neg logit contributions
quartered-0.025
 DOT-0.024
Senator-0.022
 divides-0.022
idth-0.021
TokenImage
Token ID5159
Feature activation-2.00e-01

Pos logit contributions
burg+0.137
selves+0.135
lishing+0.125
aye+0.121
 Publication+0.117
Neg logit contributions
quartered-0.147
 DOT-0.141
 divides-0.130
Senator-0.129
idth-0.126
Token caption
Token ID8305
Feature activation-3.09e+00

Pos logit contributions
quartered+0.137
 DOT+0.131
Senator+0.120
 divides+0.118
idth+0.114
Neg logit contributions
burg-0.164
selves-0.162
lishing-0.152
aye-0.147
 Publication-0.145
Token Some
Token ID2773
Feature activation-4.57e-01

Pos logit contributions
quartered+2.206
 DOT+2.147
Senator+1.987
 divides+1.935
 elector+1.852
Neg logit contributions
burg-2.401
selves-2.381
lishing-2.232
aye-2.154
 Publication-2.079
Token talked
Token ID6619
Feature activation-1.17e+00

Pos logit contributions
quartered+0.306
 DOT+0.293
Senator+0.269
 divides+0.263
idth+0.257
Neg logit contributions
burg-0.382
selves-0.380
lishing-0.354
aye-0.343
 Publication-0.336
Token of
Token ID286
Feature activation+2.62e-01

Pos logit contributions
quartered+0.748
 DOT+0.714
Senator+0.653
 divides+0.639
idth+0.614
Neg logit contributions
burg-1.006
selves-1.000
lishing-0.937
aye-0.908
 Publication-0.889
Token "
Token ID366
Feature activation-4.34e-01

Pos logit contributions
burg+0.208
selves+0.207
lishing+0.192
aye+0.185
 Publication+0.182
Neg logit contributions
quartered-0.188
 DOT-0.178
Senator-0.165
 divides-0.161
idth-0.158
Tokenb
Token ID65
Feature activation+6.81e-01

Pos logit contributions
quartered+0.350
 DOT+0.336
Senator+0.314
 divides+0.309
idth+0.299
Neg logit contributions
burg-0.302
selves-0.300
lishing-0.277
aye-0.266
 Publication-0.260
Token identified
Token ID5174
Feature activation-7.62e-01

Pos logit contributions
burg+0.639
selves+0.636
lishing+0.583
aye+0.574
 Publication+0.557
Neg logit contributions
quartered-0.543
 DOT-0.514
Senator-0.475
 divides-0.466
idth-0.453
Token as
Token ID355
Feature activation-9.41e-02

Pos logit contributions
quartered+0.540
 DOT+0.517
Senator+0.477
 divides+0.468
idth+0.453
Neg logit contributions
burg-0.609
selves-0.606
lishing-0.563
aye-0.545
 Publication-0.532
Token Joshua
Token ID20700
Feature activation+1.18e+00

Pos logit contributions
quartered+0.066
 DOT+0.062
Senator+0.058
 divides+0.057
idth+0.055
Neg logit contributions
burg-0.076
selves-0.076
lishing-0.070
aye-0.068
 Publication-0.067
Token E
Token ID412
Feature activation+1.63e-01

Pos logit contributions
burg+0.924
selves+0.910
lishing+0.837
aye+0.820
 Wit+0.799
Neg logit contributions
quartered-0.874
 DOT-0.815
Senator-0.765
 divides-0.748
idth-0.724
Tokenern
Token ID1142
Feature activation-1.06e+00

Pos logit contributions
burg+0.114
selves+0.113
lishing+0.104
aye+0.100
 Publication+0.097
Neg logit contributions
quartered-0.133
 DOT-0.127
Senator-0.119
 divides-0.118
 elector-0.113
Tokenisse
Token ID20782
Feature activation-3.06e+00

Pos logit contributions
quartered+0.902
 DOT+0.874
Senator+0.819
 divides+0.805
idth+0.780
Neg logit contributions
burg-0.703
selves-0.692
lishing-0.633
aye-0.610
 Publication-0.591
Token and
Token ID290
Feature activation+1.96e-01

Pos logit contributions
quartered+2.227
 DOT+2.150
Senator+1.983
 divides+1.948
idth+1.877
Neg logit contributions
selves-2.347
burg-2.347
lishing-2.189
aye-2.096
 Publication-2.058
Token Beau
Token ID32831
Feature activation-9.86e-02

Pos logit contributions
burg+0.155
selves+0.153
lishing+0.142
aye+0.138
 Publication+0.135
Neg logit contributions
quartered-0.141
 DOT-0.134
Senator-0.124
 divides-0.122
idth-0.118
Token Sch
Token ID3059
Feature activation-4.25e-01

Pos logit contributions
quartered+0.073
 DOT+0.070
Senator+0.065
 divides+0.064
idth+0.061
Neg logit contributions
burg-0.076
selves-0.075
lishing-0.070
aye-0.068
 Publication-0.066
Tokenoen
Token ID6571
Feature activation-1.33e+00

Pos logit contributions
quartered+0.341
 DOT+0.328
Senator+0.305
 divides+0.302
 elector+0.291
Neg logit contributions
burg-0.304
selves-0.298
lishing-0.277
aye-0.268
 Publication-0.257
Tokenhard
Token ID10424
Feature activation+9.22e-01

Pos logit contributions
quartered+1.116
 DOT+1.077
Senator+1.007
 divides+0.992
idth+0.962
Neg logit contributions
burg-0.883
selves-0.875
lishing-0.803
aye-0.770
 Publication-0.749
Token
Token ID198
Feature activation-3.87e-02

Pos logit contributions
quartered+0.020
 DOT+0.018
 divides+0.017
Senator+0.017
idth+0.017
Neg logit contributions
burg-0.021
selves-0.020
lishing-0.019
aye-0.018
 Publication-0.018
TokenImage
Token ID5159
Feature activation-1.30e-01

Pos logit contributions
quartered+0.030
 DOT+0.029
 divides+0.026
Senator+0.026
idth+0.026
Neg logit contributions
burg-0.029
selves-0.028
lishing-0.026
aye-0.025
 Publication-0.025
Token copyright
Token ID6634
Feature activation-3.00e+00

Pos logit contributions
quartered+0.088
 DOT+0.084
Senator+0.077
 divides+0.075
idth+0.073
Neg logit contributions
burg-0.108
selves-0.107
lishing-0.101
aye-0.097
 Publication-0.095
Token AFP
Token ID8916
Feature activation-1.25e+00

Pos logit contributions
quartered+1.981
 DOT+1.937
Senator+1.766
 divides+1.726
 elector+1.641
Neg logit contributions
burg-2.495
selves-2.475
lishing-2.324
aye-2.242
 Publication-2.180
Token Image
Token ID7412
Feature activation-3.70e-01

Pos logit contributions
quartered+0.761
 DOT+0.733
Senator+0.669
 divides+0.654
idth+0.620
Neg logit contributions
burg-1.116
selves-1.114
lishing-1.040
aye-1.008
 Publication-0.982
Token caption
Token ID8305
Feature activation-3.06e+00

Pos logit contributions
quartered+0.279
 DOT+0.260
Senator+0.240
 divides+0.237
idth+0.233
Neg logit contributions
burg-0.286
selves-0.284
lishing-0.263
aye-0.253
 Publication-0.251
Token The
Token ID383
Feature activation+2.44e-01

Pos logit contributions
quartered+2.141
 DOT+2.094
Senator+1.937
 divides+1.890
idth+1.795
Neg logit contributions
burg-2.410
selves-2.390
lishing-2.245
aye-2.173
 Publication-2.087
Token Kurds
Token ID23880
Feature activation-5.69e-01

Pos logit contributions
burg+0.191
selves+0.189
lishing+0.176
aye+0.170
 Publication+0.167
Neg logit contributions
quartered-0.177
 DOT-0.168
Senator-0.157
 divides-0.152
idth-0.150
Token'
Token ID6
Feature activation+9.01e-02

Pos logit contributions
quartered+0.361
 DOT+0.343
Senator+0.315
 divides+0.303
idth+0.301
Neg logit contributions
burg-0.502
selves-0.498
lishing-0.459
aye-0.451
 Publication-0.440
Token push
Token ID4574
Feature activation+1.18e+00

Pos logit contributions
selves+0.074
burg+0.074
lishing+0.068
aye+0.066
 Publication+0.065
Neg logit contributions
quartered-0.062
 DOT-0.059
Senator-0.055
 divides-0.053
idth-0.052
Token has
Token ID468
Feature activation-1.72e-01

Pos logit contributions
burg+0.958
selves+0.949
lishing+0.881
aye+0.859
 Publication+0.825
Neg logit contributions
quartered-0.818
 DOT-0.784
Senator-0.727
 divides-0.691
idth-0.684
Token Russia
Token ID3284
Feature activation+4.25e-01

Pos logit contributions
burg+0.033
selves+0.032
lishing+0.030
aye+0.029
 Publication+0.029
Neg logit contributions
quartered-0.028
 DOT-0.027
Senator-0.025
 divides-0.024
idth-0.024
Token.
Token ID13
Feature activation-5.23e-03

Pos logit contributions
burg+0.319
selves+0.315
lishing+0.291
aye+0.282
 Publication+0.274
Neg logit contributions
quartered-0.323
 DOT-0.309
Senator-0.287
 divides-0.281
idth-0.275
Token
Token ID198
Feature activation-2.64e-02

Pos logit contributions
quartered+0.004
 DOT+0.003
Senator+0.003
 divides+0.003
idth+0.003
Neg logit contributions
burg-0.004
selves-0.004
lishing-0.004
aye-0.004
 Publication-0.004
Token
Token ID198
Feature activation-3.87e-02

Pos logit contributions
quartered+0.020
 DOT+0.018
 divides+0.017
Senator+0.017
idth+0.017
Neg logit contributions
burg-0.021
selves-0.020
lishing-0.019
aye-0.018
 Publication-0.018
TokenImage
Token ID5159
Feature activation-1.30e-01

Pos logit contributions
quartered+0.030
 DOT+0.029
 divides+0.026
Senator+0.026
idth+0.026
Neg logit contributions
burg-0.029
selves-0.028
lishing-0.026
aye-0.025
 Publication-0.025
Token copyright
Token ID6634
Feature activation-3.00e+00

Pos logit contributions
quartered+0.088
 DOT+0.084
Senator+0.077
 divides+0.075
idth+0.073
Neg logit contributions
burg-0.108
selves-0.107
lishing-0.101
aye-0.097
 Publication-0.095
Token AFP
Token ID8916
Feature activation-1.25e+00

Pos logit contributions
quartered+1.981
 DOT+1.937
Senator+1.766
 divides+1.726
 elector+1.641
Neg logit contributions
burg-2.495
selves-2.475
lishing-2.324
aye-2.242
 Publication-2.180
Token Image
Token ID7412
Feature activation-3.70e-01

Pos logit contributions
quartered+0.761
 DOT+0.733
Senator+0.669
 divides+0.654
idth+0.620
Neg logit contributions
burg-1.116
selves-1.114
lishing-1.040
aye-1.008
 Publication-0.982
Token caption
Token ID8305
Feature activation-3.06e+00

Pos logit contributions
quartered+0.279
 DOT+0.260
Senator+0.240
 divides+0.237
idth+0.233
Neg logit contributions
burg-0.286
selves-0.284
lishing-0.263
aye-0.253
 Publication-0.251
Token The
Token ID383
Feature activation+2.44e-01

Pos logit contributions
quartered+2.141
 DOT+2.094
Senator+1.937
 divides+1.890
idth+1.795
Neg logit contributions
burg-2.410
selves-2.390
lishing-2.245
aye-2.173
 Publication-2.087
Token Kurds
Token ID23880
Feature activation-5.69e-01

Pos logit contributions
burg+0.191
selves+0.189
lishing+0.176
aye+0.170
 Publication+0.167
Neg logit contributions
quartered-0.177
 DOT-0.168
Senator-0.157
 divides-0.152
idth-0.150
Token Off
Token ID3242
Feature activation+9.78e-01

Pos logit contributions
burg+0.153
selves+0.151
lishing+0.139
aye+0.134
 Publication+0.132
Neg logit contributions
quartered-0.154
 DOT-0.147
Senator-0.137
 divides-0.135
idth-0.130
Token On
Token ID1550
Feature activation+2.05e-01

Pos logit contributions
burg+0.743
selves+0.735
lishing+0.676
aye+0.653
 Publication+0.642
Neg logit contributions
quartered-0.738
 DOT-0.709
Senator-0.657
 divides-0.646
 elector-0.624
Token The
Token ID383
Feature activation+5.96e-01

Pos logit contributions
burg+0.157
selves+0.156
lishing+0.144
aye+0.139
 Publication+0.137
Neg logit contributions
quartered-0.153
 DOT-0.145
Senator-0.135
 divides-0.133
 elector-0.128
Token Train
Token ID16835
Feature activation-1.48e-01

Pos logit contributions
burg+0.455
selves+0.451
lishing+0.416
aye+0.400
 Publication+0.395
Neg logit contributions
quartered-0.450
 DOT-0.423
Senator-0.395
 divides-0.390
 elector-0.378
Token-
Token ID12
Feature activation+3.81e-01

Pos logit contributions
quartered+0.103
 DOT+0.099
Senator+0.092
 divides+0.090
idth+0.087
Neg logit contributions
burg-0.119
selves-0.118
lishing-0.110
aye-0.107
 Publication-0.104
TokenMeat
Token ID35620
Feature activation-2.97e+00

Pos logit contributions
burg+0.262
selves+0.259
lishing+0.238
aye+0.229
 Publication+0.223
Neg logit contributions
quartered-0.314
 DOT-0.302
Senator-0.280
 divides-0.278
idth-0.270
Tokenhead
Token ID2256
Feature activation-1.33e+00

Pos logit contributions
quartered+2.396
 DOT+2.331
Senator+2.160
 divides+2.130
idth+2.058
Neg logit contributions
selves-2.038
burg-2.025
lishing-1.888
aye-1.797
 Publication-1.753
Token-
Token ID12
Feature activation+3.69e-01

Pos logit contributions
quartered+1.029
 DOT+0.997
Senator+0.926
 divides+0.910
idth+0.880
Neg logit contributions
burg-0.958
selves-0.953
lishing-0.885
aye-0.854
 Publication-0.825
TokenThe
Token ID464
Feature activation+4.01e-01

Pos logit contributions
burg+0.253
selves+0.250
lishing+0.230
aye+0.221
 Publication+0.215
Neg logit contributions
quartered-0.305
 DOT-0.293
Senator-0.272
 divides-0.270
idth-0.261
Token War
Token ID1810
Feature activation+2.09e-01

Pos logit contributions
burg+0.304
selves+0.301
lishing+0.279
aye+0.269
 Publication+0.264
Neg logit contributions
quartered-0.302
 DOT-0.288
Senator-0.267
 divides-0.263
idth-0.254
Token-
Token ID12
Feature activation+4.53e-01

Pos logit contributions
burg+0.151
selves+0.149
lishing+0.138
aye+0.133
 Publication+0.130
Neg logit contributions
quartered-0.164
 DOT-0.158
Senator-0.147
 divides-0.145
idth-0.140
Token.
Token ID13
Feature activation+2.57e-01

Pos logit contributions
burg+0.259
selves+0.257
lishing+0.238
aye+0.230
 Publication+0.225
Neg logit contributions
quartered-0.242
 DOT-0.231
Senator-0.215
 divides-0.210
idth-0.203
Token
Token ID447
Feature activation+4.30e-01

Pos logit contributions
selves+0.195
burg+0.195
lishing+0.178
aye+0.172
 Publication+0.170
Neg logit contributions
quartered-0.196
 DOT-0.183
Senator-0.171
 divides-0.169
idth-0.163
Token
Token ID251
Feature activation+1.44e-01

Pos logit contributions
burg+0.217
selves+0.214
lishing+0.191
aye+0.181
 Publication+0.174
Neg logit contributions
quartered-0.429
 DOT-0.416
Senator-0.394
 divides-0.390
idth-0.380
Token<|endoftext|>
Token ID50256
Feature activation+6.11e-01

Pos logit contributions
burg+0.113
selves+0.112
lishing+0.104
aye+0.100
 Publication+0.098
Neg logit contributions
quartered-0.106
 DOT-0.100
Senator-0.093
 divides-0.091
idth-0.089
TokenImage
Token ID5159
Feature activation+7.51e-02

Pos logit contributions
burg+0.447
selves+0.439
lishing+0.408
aye+0.394
 Publication+0.381
Neg logit contributions
quartered-0.477
 DOT-0.459
 divides-0.423
Senator-0.418
idth-0.408
Token copyright
Token ID6634
Feature activation-2.96e+00

Pos logit contributions
burg+0.061
selves+0.061
lishing+0.057
aye+0.055
 Publication+0.054
Neg logit contributions
quartered-0.052
 DOT-0.050
Senator-0.045
 divides-0.045
idth-0.043
Token Deadline
Token ID35954
Feature activation+3.27e-01

Pos logit contributions
quartered+1.984
 DOT+1.929
Senator+1.761
 divides+1.725
idth+1.642
Neg logit contributions
burg-2.452
selves-2.431
lishing-2.278
aye-2.202
 Publication-2.142
Token
Token ID198
Feature activation+4.67e-01

Pos logit contributions
burg+0.268
selves+0.265
lishing+0.248
aye+0.240
 Publication+0.236
Neg logit contributions
quartered-0.225
 DOT-0.215
Senator-0.198
 divides-0.195
idth-0.188
Token
Token ID198
Feature activation+3.77e-01

Pos logit contributions
burg+0.369
selves+0.362
lishing+0.334
 Publication+0.319
aye+0.318
Neg logit contributions
quartered-0.355
 DOT-0.323
 divides-0.298
Senator-0.296
idth-0.296
TokenPolice
Token ID9039
Feature activation-8.96e-01

Pos logit contributions
burg+0.272
selves+0.268
lishing+0.249
aye+0.239
 Publication+0.233
Neg logit contributions
quartered-0.299
 DOT-0.287
 divides-0.264
Senator-0.263
idth-0.255
Token have
Token ID423
Feature activation+4.78e-01

Pos logit contributions
quartered+0.577
 DOT+0.552
Senator+0.504
 divides+0.495
idth+0.473
Neg logit contributions
burg-0.771
selves-0.766
lishing-0.718
aye-0.696
 Publication-0.681
Token frequently
Token ID6777
Feature activation-5.14e-01

Pos logit contributions
burg+0.593
selves+0.583
lishing+0.538
aye+0.521
 Publication+0.507
Neg logit contributions
quartered-0.637
 DOT-0.610
Senator-0.568
 divides-0.556
idth-0.541
Token comes
Token ID2058
Feature activation-4.11e-01

Pos logit contributions
quartered+0.332
 DOT+0.316
Senator+0.290
 divides+0.283
idth+0.273
Neg logit contributions
burg-0.442
selves-0.439
lishing-0.411
aye-0.398
 Publication-0.390
Token off
Token ID572
Feature activation+3.50e-01

Pos logit contributions
quartered+0.287
 DOT+0.275
Senator+0.253
 divides+0.248
idth+0.240
Neg logit contributions
burg-0.332
selves-0.330
lishing-0.307
aye-0.298
 Publication-0.290
Token as
Token ID355
Feature activation-2.20e-01

Pos logit contributions
burg+0.275
selves+0.274
lishing+0.254
aye+0.246
 Publication+0.240
Neg logit contributions
quartered-0.253
 DOT-0.242
Senator-0.224
 divides-0.220
idth-0.213
Token sn
Token ID3013
Feature activation-1.49e+00

Pos logit contributions
quartered+0.161
 DOT+0.154
Senator+0.143
 divides+0.140
idth+0.136
Neg logit contributions
burg-0.170
selves-0.169
lishing-0.157
aye-0.151
 Publication-0.148
Tokenobb
Token ID21963
Feature activation-2.93e+00

Pos logit contributions
quartered+1.035
 DOT+0.995
Senator+0.916
 divides+0.896
idth+0.872
Neg logit contributions
burg-1.203
selves-1.199
lishing-1.114
aye-1.076
 Publication-1.060
Tokenish
Token ID680
Feature activation-8.16e-01

Pos logit contributions
quartered+2.236
 DOT+2.140
Senator+1.986
 divides+1.953
 elector+1.881
Neg logit contributions
burg-2.194
selves-2.185
lishing-2.026
aye-1.956
 Publication-1.897
Token and
Token ID290
Feature activation+1.47e-01

Pos logit contributions
quartered+0.599
 DOT+0.574
Senator+0.531
 divides+0.522
idth+0.505
Neg logit contributions
burg-0.632
selves-0.627
lishing-0.582
aye-0.562
 Publication-0.549
Token insult
Token ID13277
Feature activation+5.37e-01

Pos logit contributions
burg+0.107
selves+0.106
lishing+0.097
aye+0.094
 Publication+0.092
Neg logit contributions
quartered-0.115
 DOT-0.110
Senator-0.102
 divides-0.100
idth-0.098
Tokeningly
Token ID4420
Feature activation-6.46e-01

Pos logit contributions
burg+0.408
selves+0.399
lishing+0.372
aye+0.359
 Publication+0.350
Neg logit contributions
quartered-0.408
 DOT-0.385
Senator-0.359
 divides-0.351
idth-0.347
Token dismissive
Token ID49010
Feature activation-2.70e-01

Pos logit contributions
quartered+0.512
 DOT+0.493
Senator+0.459
 divides+0.452
idth+0.438
Neg logit contributions
burg-0.461
selves-0.457
lishing-0.422
aye-0.406
 Publication-0.396
Token Are
Token ID4231
Feature activation-8.72e-01

Pos logit contributions
quartered+0.463
 DOT+0.438
Senator+0.400
 divides+0.387
idth+0.382
Neg logit contributions
burg-0.621
selves-0.616
lishing-0.570
aye-0.556
 Publication-0.544
Token More
Token ID3125
Feature activation-5.41e-01

Pos logit contributions
quartered+0.649
 DOT+0.621
Senator+0.576
 divides+0.566
idth+0.549
Neg logit contributions
burg-0.666
selves-0.663
lishing-0.613
aye-0.591
 Publication-0.579
Token Similar
Token ID11014
Feature activation-8.09e-01

Pos logit contributions
quartered+0.389
 DOT+0.370
Senator+0.342
 divides+0.335
idth+0.328
Neg logit contributions
burg-0.429
selves-0.426
lishing-0.396
aye-0.383
 Publication-0.375
Token Than
Token ID17924
Feature activation-1.37e+00

Pos logit contributions
quartered+0.552
 DOT+0.525
Senator+0.483
 divides+0.472
idth+0.460
Neg logit contributions
selves-0.676
burg-0.669
lishing-0.618
aye-0.599
 Publication-0.590
Token You
Token ID921
Feature activation-8.17e-01

Pos logit contributions
quartered+0.954
 DOT+0.903
Senator+0.832
 divides+0.808
idth+0.795
Neg logit contributions
burg-1.128
selves-1.127
lishing-1.034
aye-1.007
 Publication-0.997
Token Think
Token ID11382
Feature activation-2.93e+00

Pos logit contributions
quartered+0.495
 DOT+0.459
Senator+0.417
 divides+0.409
idth+0.396
Neg logit contributions
selves-0.749
burg-0.749
lishing-0.691
 Publication-0.671
aye-0.670
Token
Token ID198
Feature activation-4.40e-02

Pos logit contributions
quartered+2.224
 DOT+2.145
Senator+1.992
 divides+1.958
idth+1.886
Neg logit contributions
burg-2.179
selves-2.157
lishing-2.010
aye-1.934
 Publication-1.883
Token
Token ID198
Feature activation-7.90e-03

Pos logit contributions
quartered+0.033
 DOT+0.031
 divides+0.028
idth+0.028
Senator+0.027
Neg logit contributions
burg-0.035
selves-0.034
lishing-0.032
aye-0.030
 Publication-0.030
TokenThe
Token ID464
Feature activation-1.51e-01

Pos logit contributions
quartered+0.006
 DOT+0.006
 divides+0.005
Senator+0.005
idth+0.005
Neg logit contributions
burg-0.006
selves-0.006
lishing-0.005
aye-0.005
 Publication-0.005
Token biggest
Token ID4094
Feature activation-8.49e-01

Pos logit contributions
quartered+0.099
 DOT+0.094
Senator+0.086
 divides+0.083
idth+0.082
Neg logit contributions
burg-0.128
selves-0.128
lishing-0.120
aye-0.116
 Publication-0.114
Token difference
Token ID3580
Feature activation-5.11e-01

Pos logit contributions
quartered+0.535
 DOT+0.505
Senator+0.463
 divides+0.443
idth+0.439
Neg logit contributions
burg-0.745
selves-0.741
lishing-0.695
aye-0.672
 Publication-0.663
Token:
Token ID25
Feature activation-1.04e-01

Pos logit contributions
quartered+0.258
 DOT+0.246
Senator+0.226
 divides+0.222
idth+0.214
Neg logit contributions
burg-0.318
selves-0.315
lishing-0.295
aye-0.286
 Publication-0.279
Token Let
Token ID3914
Feature activation-2.30e-01

Pos logit contributions
quartered+0.074
 DOT+0.070
Senator+0.065
 divides+0.063
idth+0.061
Neg logit contributions
burg-0.084
selves-0.083
lishing-0.077
aye-0.075
 Publication-0.073
Token
Token ID447
Feature activation-9.87e-01

Pos logit contributions
quartered+0.165
 DOT+0.157
Senator+0.146
 divides+0.142
idth+0.138
Neg logit contributions
burg-0.182
selves-0.182
lishing-0.168
aye-0.163
 Publication-0.159
Token
Token ID247
Feature activation-6.47e-01

Pos logit contributions
quartered+0.976
 DOT+0.948
Senator+0.899
 divides+0.888
idth+0.864
Neg logit contributions
burg-0.502
selves-0.499
lishing-0.448
aye-0.421
 Publication-0.407
Tokens
Token ID82
Feature activation-5.50e-01

Pos logit contributions
quartered+0.604
 DOT+0.585
Senator+0.551
 divides+0.544
idth+0.529
Neg logit contributions
burg-0.370
selves-0.365
lishing-0.332
aye-0.316
 Publication-0.306
Token bring
Token ID2222
Feature activation-2.92e+00

Pos logit contributions
quartered+0.358
 DOT+0.340
Senator+0.312
 divides+0.305
idth+0.294
Neg logit contributions
burg-0.472
selves-0.469
lishing-0.438
aye-0.425
 Publication-0.416
Token in
Token ID287
Feature activation-3.02e-01

Pos logit contributions
quartered+1.979
 DOT+1.931
Senator+1.759
 divides+1.749
 elector+1.667
Neg logit contributions
burg-2.369
selves-2.346
lishing-2.216
aye-2.137
 Publication-2.083
Token good
Token ID922
Feature activation+2.16e-01

Pos logit contributions
quartered+0.208
 DOT+0.198
Senator+0.183
 divides+0.179
idth+0.173
Neg logit contributions
burg-0.247
selves-0.245
lishing-0.229
aye-0.221
 Publication-0.216
Token quality
Token ID3081
Feature activation+4.75e-01

Pos logit contributions
burg+0.170
selves+0.168
lishing+0.155
aye+0.150
 Publication+0.146
Neg logit contributions
quartered-0.158
 DOT-0.150
Senator-0.140
 divides-0.136
idth-0.132
Token touring
Token ID25583
Feature activation-4.94e-01

Pos logit contributions
burg+0.365
selves+0.361
lishing+0.332
aye+0.320
 Publication+0.313
Neg logit contributions
quartered-0.354
 DOT-0.336
Senator-0.314
 divides-0.305
idth-0.297
Token talent
Token ID7401
Feature activation-1.73e-01

Pos logit contributions
quartered+0.391
 DOT+0.373
Senator+0.350
 divides+0.341
idth+0.332
Neg logit contributions
burg-0.357
selves-0.353
lishing-0.325
aye-0.313
 Publication-0.305
Token appeared
Token ID4120
Feature activation-6.14e-01

Pos logit contributions
quartered+0.062
 DOT+0.059
Senator+0.054
 divides+0.053
idth+0.052
Neg logit contributions
burg-0.077
selves-0.076
lishing-0.071
aye-0.069
 Publication-0.068
Token in
Token ID287
Feature activation+4.80e-02

Pos logit contributions
quartered+0.434
 DOT+0.414
Senator+0.381
 divides+0.374
idth+0.363
Neg logit contributions
burg-0.493
selves-0.489
lishing-0.455
aye-0.440
 Publication-0.428
Token the
Token ID262
Feature activation-9.82e-02

Pos logit contributions
burg+0.036
selves+0.035
lishing+0.033
aye+0.031
 Publication+0.031
Neg logit contributions
quartered-0.037
 DOT-0.035
Senator-0.033
 divides-0.032
idth-0.031
Token Melbourne
Token ID14819
Feature activation-8.66e-01

Pos logit contributions
quartered+0.072
 DOT+0.068
Senator+0.064
 divides+0.063
idth+0.061
Neg logit contributions
burg-0.076
selves-0.075
lishing-0.070
aye-0.068
 Publication-0.066
Token Mag
Token ID2944
Feature activation-2.48e-01

Pos logit contributions
quartered+0.695
 DOT+0.668
Senator+0.621
 divides+0.616
idth+0.599
Neg logit contributions
burg-0.616
selves-0.601
lishing-0.554
aye-0.534
 Publication-0.523
Tokenistrates
Token ID37909
Feature activation-2.92e+00

Pos logit contributions
quartered+0.148
 DOT+0.141
Senator+0.128
 divides+0.125
idth+0.119
Neg logit contributions
burg-0.224
selves-0.223
lishing-0.210
aye-0.204
 Publication-0.200
Token Court
Token ID3078
Feature activation-8.79e-01

Pos logit contributions
quartered+1.842
 DOT+1.764
Senator+1.633
 divides+1.585
 elector+1.529
Neg logit contributions
burg-2.488
selves-2.484
lishing-2.353
aye-2.269
 Publication-2.206
Token this
Token ID428
Feature activation-9.34e-02

Pos logit contributions
quartered+0.642
 DOT+0.616
Senator+0.570
 divides+0.559
idth+0.543
Neg logit contributions
burg-0.683
selves-0.677
lishing-0.630
aye-0.609
 Publication-0.594
Token afternoon
Token ID6672
Feature activation-3.35e-01

Pos logit contributions
quartered+0.079
 DOT+0.076
Senator+0.071
 divides+0.070
idth+0.068
Neg logit contributions
burg-0.062
selves-0.061
lishing-0.057
aye-0.054
 Publication-0.053
Token,
Token ID11
Feature activation-4.21e-01

Pos logit contributions
quartered+0.246
 DOT+0.234
Senator+0.218
 divides+0.214
idth+0.208
Neg logit contributions
burg-0.260
selves-0.258
lishing-0.239
aye-0.231
 Publication-0.225
Token where
Token ID810
Feature activation-3.17e-01

Pos logit contributions
quartered+0.295
 DOT+0.278
Senator+0.257
 divides+0.251
idth+0.247
Neg logit contributions
burg-0.343
selves-0.338
lishing-0.316
aye-0.306
 Publication-0.301
Token Sciences
Token ID13473
Feature activation-5.79e-01

Pos logit contributions
burg+0.201
selves+0.200
lishing+0.186
aye+0.179
 Publication+0.178
Neg logit contributions
quartered-0.173
 DOT-0.163
Senator-0.150
 divides-0.147
idth-0.144
Token in
Token ID287
Feature activation+2.91e-01

Pos logit contributions
quartered+0.392
 DOT+0.374
Senator+0.344
 divides+0.334
idth+0.326
Neg logit contributions
burg-0.483
selves-0.477
lishing-0.447
aye-0.432
 Publication-0.426
Token Memory
Token ID14059
Feature activation-4.89e-01

Pos logit contributions
burg+0.211
selves+0.209
lishing+0.193
aye+0.185
 Publication+0.183
Neg logit contributions
quartered-0.229
 DOT-0.219
Senator-0.205
 divides-0.201
idth-0.196
Token of
Token ID286
Feature activation+1.67e-01

Pos logit contributions
quartered+0.310
 DOT+0.296
Senator+0.271
 divides+0.267
 elector+0.253
Neg logit contributions
selves-0.424
burg-0.423
lishing-0.396
aye-0.385
 Publication-0.374
Token Alfred
Token ID22044
Feature activation-1.14e+00

Pos logit contributions
burg+0.128
selves+0.127
lishing+0.118
aye+0.114
 Publication+0.112
Neg logit contributions
quartered-0.124
 DOT-0.118
Senator-0.110
 divides-0.108
idth-0.105
Token Nobel
Token ID20715
Feature activation-2.88e+00

Pos logit contributions
quartered+0.899
 DOT+0.865
Senator+0.805
 divides+0.792
idth+0.766
Neg logit contributions
burg-0.822
selves-0.815
lishing-0.754
aye-0.725
 Publication-0.707
Token 2017
Token ID2177
Feature activation+6.94e-01

Pos logit contributions
quartered+1.965
 DOT+1.878
Senator+1.727
 divides+1.710
 elector+1.631
Neg logit contributions
burg-2.357
selves-2.356
lishing-2.197
aye-2.122
 Publication-2.071
Token was
Token ID373
Feature activation-8.64e-02

Pos logit contributions
burg+0.563
selves+0.553
lishing+0.517
aye+0.501
 Publication+0.497
Neg logit contributions
quartered-0.486
 DOT-0.463
Senator-0.427
 divides-0.416
idth-0.408
Token awarded
Token ID11343
Feature activation-9.54e-01

Pos logit contributions
quartered+0.050
 DOT+0.047
Senator+0.043
 divides+0.042
idth+0.040
Neg logit contributions
burg-0.080
selves-0.080
lishing-0.075
aye-0.073
 Publication-0.072
Token to
Token ID284
Feature activation+3.05e-01

Pos logit contributions
quartered+0.620
 DOT+0.593
Senator+0.543
 divides+0.532
idth+0.512
Neg logit contributions
burg-0.815
selves-0.810
lishing-0.758
aye-0.735
 Publication-0.720
Token Richard
Token ID6219
Feature activation+3.91e-01

Pos logit contributions
burg+0.247
selves+0.244
lishing+0.228
aye+0.220
 Publication+0.218
Neg logit contributions
quartered-0.213
 DOT-0.203
Senator-0.188
 divides-0.184
idth-0.178
Token think
Token ID892
Feature activation-1.11e+00

Pos logit contributions
quartered+0.062
 DOT+0.059
Senator+0.053
 divides+0.052
idth+0.050
Neg logit contributions
burg-0.091
selves-0.090
lishing-0.084
aye-0.082
 Publication-0.080
Token today
Token ID1909
Feature activation-4.39e-02

Pos logit contributions
quartered+0.785
 DOT+0.747
Senator+0.693
 divides+0.678
idth+0.656
Neg logit contributions
burg-0.892
selves-0.886
lishing-0.824
aye-0.799
 Publication-0.779
Token the
Token ID262
Feature activation+2.51e-01

Pos logit contributions
quartered+0.030
 DOT+0.029
Senator+0.026
 divides+0.026
idth+0.025
Neg logit contributions
burg-0.036
selves-0.036
lishing-0.033
aye-0.033
 Publication-0.032
Token team
Token ID1074
Feature activation+3.38e-01

Pos logit contributions
burg+0.193
selves+0.192
lishing+0.178
aye+0.172
 Publication+0.168
Neg logit contributions
quartered-0.186
 DOT-0.176
Senator-0.165
 divides-0.161
idth-0.157
Token has
Token ID468
Feature activation+2.00e-01

Pos logit contributions
burg+0.294
selves+0.292
lishing+0.273
aye+0.265
 Publication+0.259
Neg logit contributions
quartered-0.216
 DOT-0.205
Senator-0.188
 divides-0.183
idth-0.177
Token found
Token ID1043
Feature activation+1.92e+00

Pos logit contributions
burg+0.164
selves+0.164
lishing+0.152
aye+0.147
 Publication+0.144
Neg logit contributions
quartered-0.137
 DOT-0.131
Senator-0.121
 divides-0.118
idth-0.114
Token a
Token ID257
Feature activation+3.55e-01

Pos logit contributions
selves+1.556
burg+1.549
lishing+1.426
aye+1.387
 Publication+1.356
Neg logit contributions
quartered-1.355
 DOT-1.264
Senator-1.186
 divides-1.137
idth-1.119
Token lot
Token ID1256
Feature activation-8.42e-01

Pos logit contributions
burg+0.249
selves+0.247
lishing+0.228
aye+0.219
 Publication+0.213
Neg logit contributions
quartered-0.286
 DOT-0.274
Senator-0.256
 divides-0.251
idth-0.245
Token of
Token ID286
Feature activation+3.85e-01

Pos logit contributions
quartered+0.563
 DOT+0.538
Senator+0.494
 divides+0.485
idth+0.466
Neg logit contributions
burg-0.705
selves-0.700
lishing-0.655
aye-0.634
 Publication-0.620
Token the
Token ID262
Feature activation+4.31e-01

Pos logit contributions
burg+0.311
selves+0.309
lishing+0.287
aye+0.277
 Publication+0.271
Neg logit contributions
quartered-0.272
 DOT-0.256
Senator-0.239
 divides-0.231
idth-0.227
Token weaknesses
Token ID20256
Feature activation-8.33e-01

Pos logit contributions
burg+0.357
selves+0.355
lishing+0.331
aye+0.320
 Publication+0.312
Neg logit contributions
quartered-0.295
 DOT-0.278
Senator-0.258
 divides-0.250
idth-0.244
Tokenare
Token ID533
Feature activation-8.85e-01

Pos logit contributions
quartered+0.133
 DOT+0.127
Senator+0.115
 divides+0.113
idth+0.109
Neg logit contributions
burg-0.182
selves-0.181
lishing-0.169
aye-0.165
 Publication-0.160
Token vying
Token ID48067
Feature activation-3.16e-02

Pos logit contributions
quartered+0.620
 DOT+0.594
Senator+0.548
 divides+0.538
idth+0.517
Neg logit contributions
burg-0.712
selves-0.707
lishing-0.659
aye-0.637
 Publication-0.623
Token desperately
Token ID16459
Feature activation-4.29e-01

Pos logit contributions
quartered+0.021
 DOT+0.020
Senator+0.019
 divides+0.018
idth+0.018
Neg logit contributions
burg-0.026
selves-0.026
lishing-0.024
aye-0.024
 Publication-0.023
Token for
Token ID329
Feature activation+1.27e-01

Pos logit contributions
quartered+0.276
 DOT+0.264
Senator+0.242
 divides+0.239
 elector+0.228
Neg logit contributions
burg-0.367
selves-0.364
lishing-0.343
aye-0.332
 Publication-0.326
Token the
Token ID262
Feature activation-1.67e-01

Pos logit contributions
burg+0.091
selves+0.091
lishing+0.083
aye+0.080
 Publication+0.078
Neg logit contributions
quartered-0.100
 DOT-0.095
Senator-0.089
 divides-0.086
idth-0.085
Token millennial
Token ID46451
Feature activation+2.21e+00

Pos logit contributions
quartered+0.127
 DOT+0.121
Senator+0.112
 divides+0.110
idth+0.107
Neg logit contributions
burg-0.126
selves-0.125
lishing-0.116
aye-0.112
 Publication-0.109
Token vote
Token ID3015
Feature activation-3.83e-01

Pos logit contributions
burg+2.279
selves+2.248
lishing+2.102
aye+2.069
 Publication+2.030
Neg logit contributions
quartered-1.067
 DOT-0.975
Senator-0.859
 divides-0.797
idth-0.786
Token,
Token ID11
Feature activation-1.65e-01

Pos logit contributions
quartered+0.297
 DOT+0.285
Senator+0.266
 divides+0.261
idth+0.253
Neg logit contributions
burg-0.280
selves-0.278
lishing-0.257
aye-0.248
 Publication-0.241
Token and
Token ID290
Feature activation-1.79e-01

Pos logit contributions
quartered+0.118
 DOT+0.113
Senator+0.104
 divides+0.101
idth+0.099
Neg logit contributions
burg-0.131
selves-0.130
lishing-0.121
aye-0.117
 Publication-0.114
Token with
Token ID351
Feature activation-3.16e-02

Pos logit contributions
quartered+0.127
 DOT+0.121
Senator+0.112
 divides+0.109
idth+0.107
Neg logit contributions
burg-0.143
selves-0.142
lishing-0.131
aye-0.128
 Publication-0.124
Token good
Token ID922
Feature activation+3.21e-01

Pos logit contributions
quartered+0.022
 DOT+0.021
Senator+0.019
 divides+0.019
idth+0.018
Neg logit contributions
burg-0.026
selves-0.025
lishing-0.024
aye-0.023
 Publication-0.022
Token teamed
Token ID31230
Feature activation-8.97e-01

Pos logit contributions
burg+0.076
selves+0.075
lishing+0.070
aye+0.068
 Publication+0.066
Neg logit contributions
quartered-0.067
 DOT-0.064
Senator-0.059
 divides-0.058
idth-0.056
Token with
Token ID351
Feature activation+1.05e+00

Pos logit contributions
quartered+0.794
 DOT+0.768
Senator+0.719
 divides+0.711
idth+0.689
Neg logit contributions
burg-0.557
selves-0.552
lishing-0.503
aye-0.481
 Publication-0.466
Token the
Token ID262
Feature activation+4.49e-01

Pos logit contributions
burg+0.880
selves+0.870
lishing+0.816
aye+0.790
 Publication+0.780
Neg logit contributions
quartered-0.706
 DOT-0.662
Senator-0.622
 divides-0.601
idth-0.584
Token producer
Token ID9920
Feature activation+1.97e-01

Pos logit contributions
burg+0.366
selves+0.363
lishing+0.339
aye+0.327
 Publication+0.322
Neg logit contributions
quartered-0.311
 DOT-0.295
Senator-0.275
 divides-0.268
idth-0.260
Token of
Token ID286
Feature activation+7.03e-01

Pos logit contributions
burg+0.157
selves+0.155
lishing+0.144
aye+0.139
 Publication+0.136
Neg logit contributions
quartered-0.141
 DOT-0.135
Senator-0.126
 divides-0.123
 elector-0.118
Token Alice
Token ID14862
Feature activation+2.07e+00

Pos logit contributions
burg+0.556
selves+0.550
lishing+0.513
aye+0.497
 Publication+0.486
Neg logit contributions
quartered-0.503
 DOT-0.480
Senator-0.446
 divides-0.436
idth-0.421
Token,
Token ID11
Feature activation+3.14e-01

Pos logit contributions
burg+1.674
selves+1.643
lishing+1.532
aye+1.484
 Publication+1.462
Neg logit contributions
quartered-1.450
 DOT-1.380
Senator-1.285
 divides-1.257
 elector-1.204
Token long
Token ID890
Feature activation+6.12e-01

Pos logit contributions
burg+0.246
selves+0.244
lishing+0.226
aye+0.219
 Publication+0.215
Neg logit contributions
quartered-0.227
 DOT-0.216
Senator-0.201
 divides-0.197
idth-0.191
Tokenform
Token ID687
Feature activation-1.41e-01

Pos logit contributions
burg+0.479
selves+0.472
lishing+0.442
aye+0.424
 Publication+0.417
Neg logit contributions
quartered-0.443
 DOT-0.424
Senator-0.392
 divides-0.385
idth-0.375
Token veteran
Token ID9298
Feature activation+5.55e-01

Pos logit contributions
quartered+0.096
 DOT+0.092
Senator+0.085
 divides+0.083
idth+0.080
Neg logit contributions
burg-0.116
selves-0.115
lishing-0.108
aye-0.103
 Publication-0.102
Token Robert
Token ID5199
Feature activation+6.67e-01

Pos logit contributions
burg+0.437
selves+0.432
lishing+0.402
aye+0.389
 Publication+0.382
Neg logit contributions
quartered-0.401
 DOT-0.381
Senator-0.356
 divides-0.346
idth-0.337
TokenReviewer
Token ID35407
Feature activation-3.23e-01

Pos logit contributions
burg+0.326
selves+0.322
lishing+0.302
aye+0.294
 Publication+0.285
Neg logit contributions
quartered-0.250
 DOT-0.239
Senator-0.216
 divides-0.215
 elector-0.206
Token:
Token ID25
Feature activation+2.52e-01

Pos logit contributions
quartered+0.257
 DOT+0.241
Senator+0.224
 divides+0.222
idth+0.216
Neg logit contributions
burg-0.238
selves-0.236
lishing-0.216
aye-0.207
 Publication-0.203
Token Mike
Token ID4995
Feature activation+6.01e-01

Pos logit contributions
burg+0.184
selves+0.183
lishing+0.169
aye+0.163
 Publication+0.159
Neg logit contributions
quartered-0.196
 DOT-0.187
Senator-0.175
 divides-0.172
idth-0.167
TokenM
Token ID44
Feature activation-3.84e-01

Pos logit contributions
burg+0.421
selves+0.415
lishing+0.382
aye+0.368
 Publication+0.359
Neg logit contributions
quartered-0.487
 DOT-0.466
Senator-0.435
 divides-0.430
idth-0.417
Tokenanners
Token ID15672
Feature activation+7.23e-01

Pos logit contributions
quartered+0.253
 DOT+0.240
Senator+0.219
 divides+0.216
 elector+0.207
Neg logit contributions
burg-0.327
selves-0.325
lishing-0.303
aye-0.294
 Publication-0.287
Token -
Token ID532
Feature activation+1.79e+00

Pos logit contributions
burg+0.641
selves+0.634
lishing+0.595
aye+0.578
 Publication+0.567
Neg logit contributions
quartered-0.450
 DOT-0.427
Senator-0.388
 divides-0.383
idth-0.367
Token favorite
Token ID4004
Feature activation-3.13e-01

Pos logit contributions
burg+1.871
selves+1.837
lishing+1.746
aye+1.704
 Publication+1.693
Neg logit contributions
quartered-0.866
 DOT-0.762
Senator-0.681
 divides-0.660
idth-0.655
Token favorite
Token ID4004
Feature activation-2.83e-01

Pos logit contributions
quartered+0.142
 DOT+0.125
Senator+0.112
 divides+0.107
idth+0.104
Neg logit contributions
burg-0.334
selves-0.330
lishing-0.313
aye-0.306
 Publication-0.302
Token favorite
Token ID4004
Feature activation-1.68e-01

Pos logit contributions
quartered+0.129
 DOT+0.113
Senator+0.101
 divides+0.097
idth+0.094
Neg logit contributions
burg-0.302
selves-0.299
lishing-0.283
aye-0.276
 Publication-0.274
Token favorite
Token ID4004
Feature activation-4.25e-02

Pos logit contributions
quartered+0.078
 DOT+0.068
Senator+0.061
 divides+0.058
idth+0.057
Neg logit contributions
burg-0.178
selves-0.176
lishing-0.166
aye-0.163
 Publication-0.161
Token favorite
Token ID4004
Feature activation+7.12e-02

Pos logit contributions
quartered+0.019
 DOT+0.017
Senator+0.015
 divides+0.015
idth+0.014
Neg logit contributions
burg-0.045
selves-0.045
lishing-0.043
aye-0.042
 Publication-0.041
Token masters
Token ID18159
Feature activation+1.01e+00

Pos logit contributions
quartered+0.599
 DOT+0.573
Senator+0.531
 divides+0.519
idth+0.507
Neg logit contributions
burg-0.633
selves-0.627
lishing-0.580
aye-0.561
 Publication-0.549
Token.
Token ID13
Feature activation+6.39e-02

Pos logit contributions
burg+0.739
selves+0.729
lishing+0.676
aye+0.648
 Publication+0.631
Neg logit contributions
quartered-0.780
 DOT-0.751
Senator-0.701
 divides-0.683
idth-0.670
Token New
Token ID968
Feature activation+3.02e-01

Pos logit contributions
burg+0.052
selves+0.052
lishing+0.048
 Publication+0.046
aye+0.046
Neg logit contributions
quartered-0.045
 DOT-0.042
Senator-0.039
 divides-0.038
idth-0.038
Token research
Token ID2267
Feature activation-2.42e-01

Pos logit contributions
burg+0.260
selves+0.256
lishing+0.241
aye+0.232
 Publication+0.228
Neg logit contributions
quartered-0.198
 DOT-0.187
Senator-0.172
 divides-0.167
idth-0.163
Token has
Token ID468
Feature activation+6.14e-03

Pos logit contributions
quartered+0.165
 DOT+0.156
Senator+0.145
 divides+0.139
idth+0.138
Neg logit contributions
burg-0.201
selves-0.199
lishing-0.186
aye-0.180
 Publication-0.176
Token gone
Token ID3750
Feature activation+1.85e+00

Pos logit contributions
burg+0.005
selves+0.005
lishing+0.005
aye+0.005
 Publication+0.004
Neg logit contributions
quartered-0.004
 DOT-0.004
Senator-0.004
 divides-0.004
idth-0.003
Token further
Token ID2252
Feature activation+4.68e-01

Pos logit contributions
burg+1.424
selves+1.404
lishing+1.305
aye+1.271
 Publication+1.222
Neg logit contributions
quartered-1.369
 DOT-1.304
Senator-1.214
 divides-1.182
idth-1.154
Token,
Token ID11
Feature activation+3.65e-02

Pos logit contributions
burg+0.350
selves+0.347
lishing+0.321
aye+0.312
 Publication+0.300
Neg logit contributions
quartered-0.356
 DOT-0.341
Senator-0.317
 divides-0.305
idth-0.303
Token exposing
Token ID21294
Feature activation+1.07e-01

Pos logit contributions
burg+0.030
selves+0.030
lishing+0.028
aye+0.027
 Publication+0.026
Neg logit contributions
quartered-0.025
 DOT-0.024
Senator-0.022
 divides-0.021
idth-0.021
Token how
Token ID703
Feature activation+3.16e-03

Pos logit contributions
burg+0.087
selves+0.086
lishing+0.080
aye+0.077
 Publication+0.076
Neg logit contributions
quartered-0.075
 DOT-0.070
Senator-0.065
idth-0.062
 divides-0.062
Token American
Token ID1605
Feature activation+9.43e-02

Pos logit contributions
burg+0.003
selves+0.003
lishing+0.002
aye+0.002
 Publication+0.002
Neg logit contributions
quartered-0.002
 DOT-0.002
Senator-0.002
 divides-0.002
idth-0.002
Token Rica
Token ID31656
Feature activation-1.11e-01

Pos logit contributions
burg+0.399
selves+0.389
lishing+0.350
aye+0.336
 Publication+0.324
Neg logit contributions
quartered-0.683
 DOT-0.657
Senator-0.621
 divides-0.611
idth-0.598
Token demonstrating
Token ID21135
Feature activation+7.66e-02

Pos logit contributions
quartered+0.079
 DOT+0.075
Senator+0.070
 divides+0.068
idth+0.066
Neg logit contributions
burg-0.089
selves-0.089
lishing-0.082
aye-0.080
 Publication-0.078
Token the
Token ID262
Feature activation+1.62e-02

Pos logit contributions
burg+0.062
selves+0.062
lishing+0.057
aye+0.055
 Publication+0.054
Neg logit contributions
quartered-0.054
 DOT-0.051
Senator-0.048
 divides-0.046
idth-0.045
Token value
Token ID1988
Feature activation+1.80e-01

Pos logit contributions
burg+0.013
selves+0.013
lishing+0.012
aye+0.012
 Publication+0.011
Neg logit contributions
quartered-0.011
 DOT-0.011
Senator-0.010
 divides-0.010
idth-0.010
Token of
Token ID286
Feature activation+2.05e-02

Pos logit contributions
burg+0.153
selves+0.152
lishing+0.142
aye+0.138
 Publication+0.134
Neg logit contributions
quartered-0.119
 DOT-0.113
Senator-0.104
 divides-0.101
idth-0.098
Token playing
Token ID2712
Feature activation+1.97e-01

Pos logit contributions
burg+0.017
selves+0.017
lishing+0.015
aye+0.015
 Publication+0.015
Neg logit contributions
quartered-0.014
 DOT-0.014
Senator-0.013
 divides-0.012
idth-0.012
Token this
Token ID428
Feature activation+1.34e-01

Pos logit contributions
burg+0.158
selves+0.157
lishing+0.145
aye+0.141
 Publication+0.138
Neg logit contributions
quartered-0.140
 DOT-0.133
Senator-0.124
 divides-0.120
idth-0.118
Token way
Token ID835
Feature activation-9.89e-02

Pos logit contributions
burg+0.091
selves+0.091
lishing+0.082
aye+0.079
 Publication+0.077
Neg logit contributions
quartered-0.111
 DOT-0.106
Senator-0.100
 divides-0.097
idth-0.096
Token.
Token ID13
Feature activation-3.20e-02

Pos logit contributions
quartered+0.076
 DOT+0.073
Senator+0.068
 divides+0.066
idth+0.064
Neg logit contributions
burg-0.073
selves-0.073
lishing-0.067
aye-0.065
 Publication-0.063
Token Their
Token ID5334
Feature activation-8.45e-02

Pos logit contributions
quartered+0.023
 DOT+0.022
Senator+0.020
 divides+0.020
idth+0.019
Neg logit contributions
burg-0.025
selves-0.025
lishing-0.023
aye-0.022
 Publication-0.022
Token success
Token ID1943
Feature activation-3.80e-01

Pos logit contributions
quartered+0.060
 DOT+0.057
Senator+0.053
 divides+0.052
idth+0.050
Neg logit contributions
burg-0.067
selves-0.067
lishing-0.062
aye-0.060
 Publication-0.059
Token end
Token ID886
Feature activation-2.47e-01

Pos logit contributions
burg+0.020
selves+0.019
lishing+0.018
aye+0.018
 Publication+0.017
Neg logit contributions
quartered-0.015
 DOT-0.014
Senator-0.013
 divides-0.013
idth-0.012
Token up
Token ID510
Feature activation-4.39e-01

Pos logit contributions
quartered+0.227
 DOT+0.219
Senator+0.206
 divides+0.204
idth+0.198
Neg logit contributions
burg-0.146
selves-0.144
lishing-0.131
aye-0.125
 Publication-0.121
Token with
Token ID351
Feature activation-1.04e-01

Pos logit contributions
quartered+0.329
 DOT+0.316
Senator+0.293
 divides+0.286
idth+0.279
Neg logit contributions
burg-0.334
selves-0.331
lishing-0.307
aye-0.295
 Publication-0.288
Token is
Token ID318
Feature activation-2.16e-01

Pos logit contributions
quartered+0.072
 DOT+0.068
Senator+0.063
 divides+0.061
idth+0.060
Neg logit contributions
burg-0.085
selves-0.085
lishing-0.079
aye-0.077
 Publication-0.074
Token a
Token ID257
Feature activation-2.72e-01

Pos logit contributions
quartered+0.142
 DOT+0.135
Senator+0.124
 divides+0.121
idth+0.117
Neg logit contributions
burg-0.184
selves-0.183
lishing-0.171
aye-0.166
 Publication-0.162
Token strange
Token ID6283
Feature activation+4.85e-01

Pos logit contributions
quartered+0.200
 DOT+0.192
Senator+0.177
 divides+0.174
idth+0.168
Neg logit contributions
burg-0.210
selves-0.209
lishing-0.194
aye-0.188
 Publication-0.183
Token,
Token ID11
Feature activation-1.19e-01

Pos logit contributions
burg+0.376
selves+0.373
lishing+0.347
aye+0.336
 Publication+0.327
Neg logit contributions
quartered-0.354
 DOT-0.338
Senator-0.314
 divides-0.307
idth-0.299
Token national
Token ID2260
Feature activation+1.58e-01

Pos logit contributions
quartered+0.089
 DOT+0.085
Senator+0.079
 divides+0.077
idth+0.075
Neg logit contributions
burg-0.091
selves-0.090
lishing-0.083
aye-0.080
 Publication-0.079
Tokenistic
Token ID2569
Feature activation+9.01e-01

Pos logit contributions
burg+0.118
selves+0.117
lishing+0.109
aye+0.105
 Publication+0.102
Neg logit contributions
quartered-0.120
 DOT-0.115
Senator-0.107
 divides-0.105
idth-0.102
Token union
Token ID6441
Feature activation+1.36e+00

Pos logit contributions
burg+0.706
selves+0.702
lishing+0.649
aye+0.631
 Publication+0.615
Neg logit contributions
quartered-0.651
 DOT-0.622
Senator-0.577
 divides-0.561
idth-0.552
Tokenism
Token ID1042
Feature activation+9.54e-01

Pos logit contributions
burg+1.052
selves+1.043
lishing+0.966
aye+0.932
 Publication+0.906
Neg logit contributions
quartered-0.986
 DOT-0.949
Senator-0.882
 divides-0.855
idth-0.841
Token-
Token ID12
Feature activation+2.54e-01

Pos logit contributions
burg+0.031
selves+0.030
lishing+0.029
aye+0.028
 Publication+0.027
Neg logit contributions
quartered-0.022
 DOT-0.021
Senator-0.019
 divides-0.018
idth-0.018
Tokennot
Token ID1662
Feature activation-1.69e-01

Pos logit contributions
burg+0.194
selves+0.193
lishing+0.179
aye+0.173
 Publication+0.168
Neg logit contributions
quartered-0.187
 DOT-0.182
Senator-0.166
 divides-0.165
idth-0.159
Tokench
Token ID354
Feature activation-3.53e-01

Pos logit contributions
quartered+0.119
 DOT+0.114
Senator+0.105
 divides+0.103
idth+0.099
Neg logit contributions
burg-0.135
selves-0.134
lishing-0.125
aye-0.121
 Publication-0.118
Token studio
Token ID8034
Feature activation-6.31e-01

Pos logit contributions
quartered+0.252
 DOT+0.241
Senator+0.224
 divides+0.218
idth+0.212
Neg logit contributions
burg-0.281
selves-0.278
lishing-0.259
aye-0.251
 Publication-0.246
Token specializing
Token ID40847
Feature activation-8.17e-02

Pos logit contributions
quartered+0.437
 DOT+0.418
Senator+0.385
 divides+0.376
idth+0.364
Neg logit contributions
burg-0.513
selves-0.509
lishing-0.476
aye-0.459
 Publication-0.450
Token in
Token ID287
Feature activation+7.03e-01

Pos logit contributions
quartered+0.056
 DOT+0.053
Senator+0.049
 divides+0.048
idth+0.046
Neg logit contributions
burg-0.067
selves-0.067
lishing-0.062
aye-0.060
 Publication-0.059
Token ecc
Token ID21399
Feature activation+9.99e-01

Pos logit contributions
burg+0.563
selves+0.556
lishing+0.519
aye+0.501
 Publication+0.491
Neg logit contributions
quartered-0.500
 DOT-0.473
Senator-0.442
 divides-0.429
idth-0.420
Tokenles
Token ID829
Feature activation+1.71e-01

Pos logit contributions
burg+0.646
selves+0.635
lishing+0.585
aye+0.561
 Publication+0.543
Neg logit contributions
quartered-0.865
 DOT-0.830
Senator-0.779
 divides-0.770
idth-0.746
Tokenial
Token ID498
Feature activation+1.03e+00

Pos logit contributions
burg+0.111
selves+0.109
lishing+0.101
aye+0.096
 Publication+0.092
Neg logit contributions
quartered-0.148
 DOT-0.142
Senator-0.134
 divides-0.131
idth-0.128
Token art
Token ID1242
Feature activation-3.01e-01

Pos logit contributions
burg+0.792
selves+0.782
lishing+0.728
aye+0.699
 Publication+0.688
Neg logit contributions
quartered-0.766
 DOT-0.728
Senator-0.680
 divides-0.657
idth-0.647
Token had
Token ID550
Feature activation+1.61e-01

Pos logit contributions
quartered+0.206
 DOT+0.197
Senator+0.181
 divides+0.177
idth+0.171
Neg logit contributions
burg-0.248
selves-0.246
lishing-0.230
aye-0.222
 Publication-0.217
Token for
Token ID329
Feature activation+2.62e-01

Pos logit contributions
burg+0.283
selves+0.278
lishing+0.259
aye+0.249
 Publication+0.244
Neg logit contributions
quartered-0.279
 DOT-0.264
Senator-0.247
 divides-0.240
idth-0.235
Token players
Token ID1938
Feature activation+1.83e-01

Pos logit contributions
burg+0.214
selves+0.213
lishing+0.198
aye+0.190
 Publication+0.188
Neg logit contributions
quartered-0.182
 DOT-0.171
Senator-0.159
 divides-0.155
idth-0.151
Token from
Token ID422
Feature activation+4.37e-01

Pos logit contributions
burg+0.144
selves+0.142
lishing+0.131
aye+0.127
 Publication+0.124
Neg logit contributions
quartered-0.134
 DOT-0.127
Senator-0.119
 divides-0.116
idth-0.113
Token the
Token ID262
Feature activation+2.33e-01

Pos logit contributions
burg+0.351
selves+0.347
lishing+0.324
aye+0.311
 Publication+0.307
Neg logit contributions
quartered-0.308
 DOT-0.291
Senator-0.272
 divides-0.265
idth-0.259
Token two
Token ID734
Feature activation+4.24e-01

Pos logit contributions
burg+0.190
selves+0.188
lishing+0.175
aye+0.169
 Publication+0.167
Neg logit contributions
quartered-0.162
 DOT-0.152
Senator-0.142
 divides-0.138
idth-0.135
Token title
Token ID3670
Feature activation+8.12e-01

Pos logit contributions
burg+0.357
selves+0.354
lishing+0.328
aye+0.316
 Publication+0.313
Neg logit contributions
quartered-0.281
 DOT-0.266
Senator-0.248
 divides-0.239
idth-0.235
Token teams
Token ID3466
Feature activation-1.53e-01

Pos logit contributions
burg+0.664
selves+0.659
lishing+0.615
aye+0.591
 Publication+0.580
Neg logit contributions
quartered-0.561
 DOT-0.532
Senator-0.493
 divides-0.477
idth-0.467
Token held
Token ID2714
Feature activation+1.89e-01

Pos logit contributions
quartered+0.113
 DOT+0.108
Senator+0.100
 divides+0.098
idth+0.096
Neg logit contributions
burg-0.118
selves-0.117
lishing-0.108
aye-0.105
 Publication-0.102
Token about
Token ID546
Feature activation+1.06e-01

Pos logit contributions
burg+0.154
selves+0.154
lishing+0.143
aye+0.138
 Publication+0.134
Neg logit contributions
quartered-0.130
 DOT-0.125
Senator-0.116
 divides-0.113
idth-0.110
Token a
Token ID257
Feature activation+1.96e-01

Pos logit contributions
burg+0.082
selves+0.081
lishing+0.075
aye+0.073
 Publication+0.071
Neg logit contributions
quartered-0.079
 DOT-0.075
Senator-0.070
 divides-0.068
idth-0.067
Token decade
Token ID5707
Feature activation+4.17e-01

Pos logit contributions
burg+0.155
selves+0.153
lishing+0.143
aye+0.139
 Publication+0.135
Neg logit contributions
quartered-0.142
 DOT-0.133
Senator-0.124
 divides-0.120
idth-0.119
Token her
Token ID607
Feature activation+7.52e-01

Pos logit contributions
burg+0.173
selves+0.171
lishing+0.158
aye+0.152
 Publication+0.149
Neg logit contributions
quartered-0.183
 DOT-0.175
Senator-0.163
 divides-0.160
idth-0.156
Token friends
Token ID2460
Feature activation-7.10e-01

Pos logit contributions
burg+0.574
selves+0.573
lishing+0.527
aye+0.507
 Publication+0.497
Neg logit contributions
quartered-0.562
 DOT-0.534
Senator-0.496
 divides-0.484
idth-0.475
Token were
Token ID547
Feature activation+8.28e-01

Pos logit contributions
quartered+0.458
 DOT+0.435
Senator+0.398
 divides+0.388
idth+0.376
Neg logit contributions
burg-0.615
selves-0.610
lishing-0.569
aye-0.552
 Publication-0.542
Token more
Token ID517
Feature activation+6.70e-01

Pos logit contributions
burg+0.633
selves+0.630
lishing+0.579
aye+0.556
 Publication+0.546
Neg logit contributions
quartered-0.620
 DOT-0.587
Senator-0.546
 divides-0.533
idth-0.529
Token prone
Token ID17592
Feature activation+9.28e-01

Pos logit contributions
burg+0.508
selves+0.502
lishing+0.467
aye+0.450
 Publication+0.438
Neg logit contributions
quartered-0.504
 DOT-0.481
Senator-0.447
 divides-0.435
idth-0.429
Token to
Token ID284
Feature activation+4.69e-01

Pos logit contributions
burg+0.824
selves+0.820
lishing+0.768
aye+0.745
 Publication+0.729
Neg logit contributions
quartered-0.574
 DOT-0.547
Senator-0.498
 divides-0.486
idth-0.466
Token use
Token ID779
Feature activation+3.27e-01

Pos logit contributions
burg+0.364
selves+0.360
lishing+0.335
aye+0.323
 Publication+0.316
Neg logit contributions
quartered-0.344
 DOT-0.329
Senator-0.305
 divides-0.298
idth-0.291
Token prof
Token ID1534
Feature activation-2.34e-02

Pos logit contributions
burg+0.252
selves+0.250
lishing+0.232
aye+0.224
 Publication+0.219
Neg logit contributions
quartered-0.241
 DOT-0.230
Senator-0.214
 divides-0.209
idth-0.203
Tokenanity
Token ID19689
Feature activation+1.16e+00

Pos logit contributions
quartered+0.021
 DOT+0.020
Senator+0.019
 divides+0.019
idth+0.018
Neg logit contributions
burg-0.014
selves-0.014
lishing-0.013
aye-0.012
 Publication-0.012
Token and
Token ID290
Feature activation+2.25e-01

Pos logit contributions
selves+0.871
burg+0.871
lishing+0.795
aye+0.764
 Publication+0.741
Neg logit contributions
quartered-0.890
 DOT-0.837
Senator-0.786
 divides-0.759
idth-0.755
Token party
Token ID2151
Feature activation-4.44e-01

Pos logit contributions
burg+0.180
selves+0.179
lishing+0.165
aye+0.159
 Publication+0.157
Neg logit contributions
quartered-0.160
 DOT-0.153
Senator-0.141
 divides-0.137
idth-0.135
Token and
Token ID290
Feature activation-1.48e-01

Pos logit contributions
burg+0.421
selves+0.416
lishing+0.389
aye+0.376
 Publication+0.369
Neg logit contributions
quartered-0.359
 DOT-0.344
Senator-0.317
 divides-0.310
idth-0.299
Token the
Token ID262
Feature activation-5.96e-02

Pos logit contributions
quartered+0.101
 DOT+0.096
Senator+0.088
 divides+0.087
idth+0.084
Neg logit contributions
selves-0.121
burg-0.121
lishing-0.112
aye-0.109
 Publication-0.106
Token Giant
Token ID17384
Feature activation+1.01e+00

Pos logit contributions
quartered+0.036
 DOT+0.034
Senator+0.031
 divides+0.030
idth+0.029
Neg logit contributions
burg-0.054
selves-0.053
lishing-0.050
aye-0.049
 Publication-0.048
Token Peach
Token ID33230
Feature activation-1.85e-01

Pos logit contributions
burg+0.690
selves+0.684
lishing+0.628
aye+0.604
 Publication+0.590
Neg logit contributions
quartered-0.832
 DOT-0.802
Senator-0.749
 divides-0.738
idth-0.715
Token,
Token ID11
Feature activation-7.19e-02

Pos logit contributions
quartered+0.130
 DOT+0.124
Senator+0.115
 divides+0.112
idth+0.108
Neg logit contributions
burg-0.149
selves-0.148
lishing-0.138
aye-0.133
 Publication-0.130
Token could
Token ID714
Feature activation-2.44e-01

Pos logit contributions
quartered+0.046
 DOT+0.044
Senator+0.040
 divides+0.039
idth+0.038
Neg logit contributions
burg-0.062
selves-0.062
lishing-0.058
aye-0.056
 Publication-0.055
Token not
Token ID407
Feature activation-6.32e-03

Pos logit contributions
quartered+0.168
 DOT+0.161
Senator+0.148
 divides+0.145
idth+0.140
Neg logit contributions
burg-0.200
selves-0.199
lishing-0.186
aye-0.180
 Publication-0.176
Token be
Token ID307
Feature activation+1.65e-01

Pos logit contributions
quartered+0.004
 DOT+0.004
Senator+0.004
 divides+0.004
idth+0.003
Neg logit contributions
burg-0.005
selves-0.005
lishing-0.005
aye-0.005
 Publication-0.005
Token sold
Token ID2702
Feature activation-4.72e-01

Pos logit contributions
burg+0.132
selves+0.131
lishing+0.122
aye+0.118
 Publication+0.115
Neg logit contributions
quartered-0.117
 DOT-0.112
Senator-0.104
 divides-0.101
idth-0.098
Token to
Token ID284
Feature activation+2.17e-01

Pos logit contributions
quartered+0.328
 DOT+0.314
Senator+0.289
 divides+0.283
idth+0.273
Neg logit contributions
burg-0.383
selves-0.381
lishing-0.355
aye-0.343
 Publication-0.335
Token British
Token ID3517
Feature activation+5.11e-01

Pos logit contributions
burg+0.176
selves+0.174
lishing+0.163
aye+0.157
 Publication+0.154
Neg logit contributions
quartered-0.151
 DOT-0.145
Senator-0.133
 divides-0.131
idth-0.126
Token You
Token ID921
Feature activation+4.06e-01

Pos logit contributions
burg+0.295
selves+0.293
lishing+0.270
 Publication+0.260
aye+0.260
Neg logit contributions
quartered-0.266
 DOT-0.247
Senator-0.230
 divides-0.227
idth-0.222
Token have
Token ID423
Feature activation+2.16e-01

Pos logit contributions
burg+0.350
selves+0.347
lishing+0.323
aye+0.312
 Publication+0.307
Neg logit contributions
quartered-0.266
 DOT-0.250
Senator-0.231
 divides-0.224
idth-0.219
Token a
Token ID257
Feature activation+7.40e-02

Pos logit contributions
burg+0.185
selves+0.183
lishing+0.170
aye+0.165
 Publication+0.162
Neg logit contributions
quartered-0.143
 DOT-0.134
Senator-0.124
 divides-0.119
idth-0.117
Token technical
Token ID6276
Feature activation-1.88e-02

Pos logit contributions
burg+0.059
selves+0.058
lishing+0.054
aye+0.052
 Publication+0.051
Neg logit contributions
quartered-0.053
 DOT-0.050
Senator-0.047
 divides-0.046
idth-0.045
Token fault
Token ID8046
Feature activation+4.74e-01

Pos logit contributions
quartered+0.015
 DOT+0.014
Senator+0.013
 divides+0.013
idth+0.012
Neg logit contributions
burg-0.014
selves-0.014
lishing-0.013
aye-0.012
 Publication-0.012
Token in
Token ID287
Feature activation+2.16e-02

Pos logit contributions
burg+0.362
selves+0.358
lishing+0.331
aye+0.319
 Publication+0.312
Neg logit contributions
quartered-0.354
 DOT-0.337
Senator-0.315
 divides-0.306
idth-0.300
Token your
Token ID534
Feature activation+5.17e-01

Pos logit contributions
burg+0.018
selves+0.017
lishing+0.016
aye+0.016
 Publication+0.015
Neg logit contributions
quartered-0.015
 DOT-0.014
Senator-0.013
 divides-0.013
idth-0.012
Token action
Token ID2223
Feature activation+2.30e-01

Pos logit contributions
burg+0.343
selves+0.342
lishing+0.310
aye+0.298
 Publication+0.290
Neg logit contributions
quartered-0.437
 DOT-0.419
Senator-0.394
 divides-0.386
idth-0.378
Token,
Token ID11
Feature activation+2.89e-02

Pos logit contributions
burg+0.174
selves+0.173
lishing+0.160
aye+0.155
 Publication+0.151
Neg logit contributions
quartered-0.172
 DOT-0.165
Senator-0.153
 divides-0.150
idth-0.145
Token go
Token ID467
Feature activation+1.26e+00

Pos logit contributions
burg+0.023
selves+0.023
lishing+0.021
aye+0.021
 Publication+0.020
Neg logit contributions
quartered-0.020
 DOT-0.019
Senator-0.018
 divides-0.017
idth-0.017
Token out
Token ID503
Feature activation-4.43e-01

Pos logit contributions
burg+0.987
selves+0.968
lishing+0.891
aye+0.876
 Publication+0.841
Neg logit contributions
quartered-0.918
 DOT-0.873
Senator-0.815
 divides-0.799
idth-0.776
Tokeninsky
Token ID19870
Feature activation+6.03e-01

Pos logit contributions
burg+0.457
selves+0.446
lishing+0.412
aye+0.401
 Publication+0.390
Neg logit contributions
quartered-0.484
 DOT-0.463
Senator-0.437
 divides-0.427
idth-0.414
Token Ag
Token ID2449
Feature activation+1.48e-01

Pos logit contributions
burg+0.441
selves+0.429
lishing+0.397
aye+0.386
 Publication+0.376
Neg logit contributions
quartered-0.472
 DOT-0.451
Senator-0.424
 divides-0.414
idth-0.402
Tokenrop
Token ID1773
Feature activation+7.24e-01

Pos logit contributions
burg+0.102
selves+0.101
lishing+0.093
aye+0.090
 Publication+0.087
Neg logit contributions
quartered-0.121
 DOT-0.117
Senator-0.109
 divides-0.108
idth-0.104
Tokenost
Token ID455
Feature activation+6.19e-01

Pos logit contributions
burg+0.511
selves+0.504
lishing+0.465
aye+0.450
 Publication+0.434
Neg logit contributions
quartered-0.583
 DOT-0.560
Senator-0.523
 divides-0.515
 elector-0.501
Tokenach
Token ID620
Feature activation-5.06e-01

Pos logit contributions
burg+0.438
selves+0.429
lishing+0.397
aye+0.384
 Publication+0.373
Neg logit contributions
quartered-0.496
 DOT-0.479
Senator-0.448
 divides-0.439
idth-0.426
Token,
Token ID11
Feature activation+5.48e-02

Pos logit contributions
quartered+0.402
 DOT+0.386
Senator+0.362
 divides+0.355
 elector+0.344
Neg logit contributions
burg-0.364
selves-0.357
lishing-0.330
aye-0.320
 Publication-0.309
Token is
Token ID318
Feature activation-1.88e-01

Pos logit contributions
burg+0.046
selves+0.046
lishing+0.043
aye+0.041
 Publication+0.041
Neg logit contributions
quartered-0.036
 DOT-0.034
Senator-0.032
 divides-0.031
idth-0.030
Token a
Token ID257
Feature activation-2.11e-01

Pos logit contributions
quartered+0.123
 DOT+0.116
Senator+0.108
 divides+0.105
idth+0.102
Neg logit contributions
burg-0.161
selves-0.159
lishing-0.148
aye-0.144
 Publication-0.141
Token recipient
Token ID17800
Feature activation+1.19e-02

Pos logit contributions
quartered+0.124
 DOT+0.117
Senator+0.106
 divides+0.104
idth+0.099
Neg logit contributions
burg-0.195
selves-0.193
lishing-0.182
aye-0.176
 Publication-0.173
Token of
Token ID286
Feature activation+6.31e-02

Pos logit contributions
burg+0.010
selves+0.010
lishing+0.010
aye+0.009
 Publication+0.009
Neg logit contributions
quartered-0.008
 DOT-0.007
Senator-0.007
 divides-0.007
idth-0.006
Token $
Token ID720
Feature activation-7.53e-01

Pos logit contributions
burg+0.048
selves+0.048
lishing+0.045
aye+0.043
 Publication+0.042
Neg logit contributions
quartered-0.047
 DOT-0.045
Senator-0.041
 divides-0.041
idth-0.039
Token experiences
Token ID6461
Feature activation-7.95e-02

Pos logit contributions
burg+0.250
selves+0.249
lishing+0.229
aye+0.222
 Publication+0.217
Neg logit contributions
quartered-0.245
 DOT-0.233
Senator-0.218
 divides-0.211
idth-0.208
Token;
Token ID26
Feature activation+5.35e-03

Pos logit contributions
quartered+0.057
 DOT+0.055
Senator+0.051
 divides+0.050
idth+0.049
Neg logit contributions
burg-0.062
selves-0.062
lishing-0.058
aye-0.056
 Publication-0.054
Token the
Token ID262
Feature activation+4.61e-02

Pos logit contributions
burg+0.004
selves+0.004
lishing+0.004
aye+0.004
 Publication+0.004
Neg logit contributions
quartered-0.004
 DOT-0.004
Senator-0.003
 divides-0.003
idth-0.003
Token first
Token ID717
Feature activation-1.44e-02

Pos logit contributions
burg+0.035
selves+0.034
lishing+0.032
aye+0.031
 Publication+0.030
Neg logit contributions
quartered-0.035
 DOT-0.033
Senator-0.031
 divides-0.030
idth-0.030
Token time
Token ID640
Feature activation+1.21e-01

Pos logit contributions
quartered+0.011
 DOT+0.011
Senator+0.010
 divides+0.010
idth+0.009
Neg logit contributions
burg-0.011
selves-0.010
lishing-0.010
aye-0.009
 Publication-0.009
Token I
Token ID314
Feature activation-2.46e-01

Pos logit contributions
burg+0.095
selves+0.094
lishing+0.087
aye+0.084
 Publication+0.082
Neg logit contributions
quartered-0.088
 DOT-0.083
Senator-0.077
 divides-0.076
idth-0.074
Token got
Token ID1392
Feature activation+8.25e-01

Pos logit contributions
quartered+0.164
 DOT+0.155
Senator+0.143
 divides+0.140
idth+0.135
Neg logit contributions
burg-0.208
selves-0.207
lishing-0.192
aye-0.187
 Publication-0.182
Token t
Token ID256
Feature activation-4.69e-01

Pos logit contributions
burg+0.656
selves+0.653
lishing+0.604
aye+0.584
 Publication+0.573
Neg logit contributions
quartered-0.588
 DOT-0.558
Senator-0.519
 divides-0.506
idth-0.495
Tokenumbled
Token ID11137
Feature activation+7.27e-01

Pos logit contributions
quartered+0.358
 DOT+0.343
Senator+0.318
 divides+0.313
idth+0.302
Neg logit contributions
burg-0.350
selves-0.347
lishing-0.321
aye-0.312
 Publication-0.301
Token by
Token ID416
Feature activation+2.21e-02

Pos logit contributions
burg+0.542
selves+0.540
lishing+0.498
aye+0.482
 Publication+0.464
Neg logit contributions
quartered-0.554
 DOT-0.532
Senator-0.496
 divides-0.485
idth-0.470
Token a
Token ID257
Feature activation-6.31e-02

Pos logit contributions
burg+0.019
selves+0.018
lishing+0.017
aye+0.017
 Publication+0.016
Neg logit contributions
quartered-0.015
 DOT-0.014
Senator-0.013
 divides-0.013
idth-0.012
Token,
Token ID11
Feature activation+9.19e-02

Pos logit contributions
quartered+0.215
 DOT+0.203
Senator+0.188
idth+0.180
 divides+0.180
Neg logit contributions
burg-0.244
selves-0.242
lishing-0.223
aye-0.217
 Publication-0.213
Token doesn
Token ID1595
Feature activation+2.05e-01

Pos logit contributions
burg+0.076
selves+0.075
lishing+0.070
aye+0.068
 Publication+0.067
Neg logit contributions
quartered-0.063
 DOT-0.060
Senator-0.055
 divides-0.053
idth-0.052
Token't
Token ID470
Feature activation-2.94e-01

Pos logit contributions
burg+0.171
selves+0.169
lishing+0.156
aye+0.150
 Publication+0.148
Neg logit contributions
quartered-0.146
 DOT-0.133
 divides-0.121
Senator-0.121
idth-0.119
Token mean
Token ID1612
Feature activation+8.19e-01

Pos logit contributions
quartered+0.261
 DOT+0.251
Senator+0.236
 divides+0.231
idth+0.226
Neg logit contributions
burg-0.183
selves-0.181
lishing-0.165
aye-0.158
 Publication-0.153
Token that
Token ID326
Feature activation+1.63e-01

Pos logit contributions
burg+0.677
selves+0.668
lishing+0.623
aye+0.607
 Publication+0.593
Neg logit contributions
quartered-0.559
 DOT-0.534
Senator-0.491
 divides-0.478
idth-0.466
Token you
Token ID345
Feature activation-6.56e-02

Pos logit contributions
burg+0.136
selves+0.134
lishing+0.125
aye+0.121
 Publication+0.119
Neg logit contributions
quartered-0.111
 DOT-0.105
Senator-0.097
 divides-0.094
idth-0.092
Token should
Token ID815
Feature activation+1.33e-01

Pos logit contributions
quartered+0.045
 DOT+0.043
Senator+0.039
 divides+0.038
idth+0.037
Neg logit contributions
burg-0.054
selves-0.054
lishing-0.050
aye-0.049
 Publication-0.047
Token tell
Token ID1560
Feature activation+2.56e-01

Pos logit contributions
burg+0.109
selves+0.108
lishing+0.100
aye+0.097
 Publication+0.095
Neg logit contributions
quartered-0.093
 DOT-0.088
Senator-0.082
 divides-0.079
idth-0.077
Token us
Token ID514
Feature activation-2.74e-01

Pos logit contributions
burg+0.217
selves+0.216
lishing+0.201
aye+0.195
 Publication+0.191
Neg logit contributions
quartered-0.170
 DOT-0.162
Senator-0.149
 divides-0.145
idth-0.141
Token what
Token ID644
Feature activation+6.68e-02

Pos logit contributions
quartered+0.197
 DOT+0.189
Senator+0.174
 divides+0.170
idth+0.166
Neg logit contributions
burg-0.216
selves-0.215
lishing-0.199
aye-0.192
 Publication-0.188
Token to
Token ID284
Feature activation+3.21e-03

Pos logit contributions
burg+0.057
selves+0.057
lishing+0.052
aye+0.051
 Publication+0.050
Neg logit contributions
quartered-0.044
 DOT-0.042
Senator-0.038
 divides-0.037
idth-0.036
Token Pasadena
Token ID41010
Feature activation+3.04e-01

Pos logit contributions
burg+0.091
selves+0.089
lishing+0.083
aye+0.080
 Publication+0.079
Neg logit contributions
quartered-0.092
 DOT-0.088
Senator-0.082
 divides-0.081
idth-0.078
Token,
Token ID11
Feature activation-1.23e-01

Pos logit contributions
burg+0.230
selves+0.227
lishing+0.211
aye+0.203
 Publication+0.199
Neg logit contributions
quartered-0.228
 DOT-0.219
Senator-0.203
 divides-0.200
idth-0.193
Token California
Token ID3442
Feature activation+4.08e-01

Pos logit contributions
quartered+0.095
 DOT+0.092
Senator+0.085
 divides+0.083
idth+0.081
Neg logit contributions
burg-0.090
selves-0.089
lishing-0.083
aye-0.079
 Publication-0.078
Token.
Token ID13
Feature activation-1.51e-01

Pos logit contributions
burg+0.280
selves+0.278
lishing+0.256
aye+0.246
 Publication+0.239
Neg logit contributions
quartered-0.334
 DOT-0.322
Senator-0.300
 divides-0.296
idth-0.286
Token Hide
Token ID10415
Feature activation-2.01e-01

Pos logit contributions
quartered+0.116
 DOT+0.109
Senator+0.101
 divides+0.100
idth+0.098
Neg logit contributions
burg-0.114
selves-0.113
lishing-0.105
aye-0.100
 Publication-0.100
Token Caption
Token ID11260
Feature activation-2.23e+00

Pos logit contributions
quartered+0.150
 DOT+0.136
 divides+0.126
Senator+0.125
 athlet+0.123
Neg logit contributions
burg-0.161
selves-0.160
lishing-0.147
 Publication-0.143
aye-0.139
Token 12
Token ID1105
Feature activation-7.44e-01

Pos logit contributions
quartered+1.694
 DOT+1.604
Senator+1.485
 divides+1.467
idth+1.430
Neg logit contributions
burg-1.687
selves-1.674
lishing-1.546
aye-1.489
 Publication-1.463
Token of
Token ID286
Feature activation-1.56e-01

Pos logit contributions
quartered+0.538
 DOT+0.493
Senator+0.455
 divides+0.449
idth+0.445
Neg logit contributions
burg-0.612
selves-0.599
lishing-0.556
aye-0.532
 Publication-0.526
Token 15
Token ID1315
Feature activation-9.74e-01

Pos logit contributions
quartered+0.140
 DOT+0.132
Senator+0.125
 divides+0.123
idth+0.120
Neg logit contributions
burg-0.098
selves-0.098
lishing-0.088
aye-0.084
 Publication-0.082
Token Photos
Token ID9434
Feature activation+5.00e-01

Pos logit contributions
quartered+0.651
 DOT+0.590
Senator+0.546
 divides+0.536
 elector+0.522
Neg logit contributions
burg-0.851
selves-0.843
lishing-0.781
aye-0.754
 Publication-0.753
Token:
Token ID25
Feature activation-2.21e-01

Pos logit contributions
burg+0.357
selves+0.352
lishing+0.319
aye+0.302
 Publication+0.301
Neg logit contributions
quartered-0.423
 DOT-0.385
 divides-0.358
Senator-0.358
idth-0.354
Token by
Token ID416
Feature activation-2.62e-01

Pos logit contributions
quartered+0.328
 DOT+0.314
Senator+0.292
 divides+0.285
idth+0.276
Neg logit contributions
burg-0.360
selves-0.357
lishing-0.332
aye-0.320
 Publication-0.314
Token showing
Token ID4478
Feature activation+9.55e-01

Pos logit contributions
quartered+0.188
 DOT+0.180
Senator+0.167
 divides+0.163
idth+0.158
Neg logit contributions
burg-0.207
selves-0.205
lishing-0.191
aye-0.184
 Publication-0.180
Token him
Token ID683
Feature activation-3.53e-01

Pos logit contributions
selves+0.795
burg+0.794
lishing+0.737
aye+0.714
 Publication+0.702
Neg logit contributions
quartered-0.644
 DOT-0.605
Senator-0.562
 divides-0.540
idth-0.531
Token how
Token ID703
Feature activation-3.21e-01

Pos logit contributions
quartered+0.253
 DOT+0.240
Senator+0.224
 divides+0.217
idth+0.212
Neg logit contributions
burg-0.281
selves-0.279
lishing-0.258
aye-0.249
 Publication-0.244
Token to
Token ID284
Feature activation-4.45e-01

Pos logit contributions
quartered+0.203
 DOT+0.193
Senator+0.177
 divides+0.171
idth+0.166
Neg logit contributions
burg-0.281
selves-0.278
lishing-0.260
aye-0.252
 Publication-0.247
Token bring
Token ID2222
Feature activation-2.25e+00

Pos logit contributions
quartered+0.274
 DOT+0.258
Senator+0.237
 divides+0.229
idth+0.221
Neg logit contributions
burg-0.399
selves-0.395
lishing-0.370
aye-0.359
 Publication-0.354
Token the
Token ID262
Feature activation-3.13e-01

Pos logit contributions
quartered+1.560
 DOT+1.496
Senator+1.376
 divides+1.353
idth+1.301
Neg logit contributions
burg-1.822
selves-1.810
lishing-1.690
aye-1.634
 Publication-1.597
Token kn
Token ID638
Feature activation-4.17e-01

Pos logit contributions
quartered+0.227
 DOT+0.217
Senator+0.202
 divides+0.197
idth+0.191
Neg logit contributions
burg-0.244
selves-0.242
lishing-0.225
aye-0.217
 Publication-0.213
Tokenotted
Token ID8426
Feature activation+1.03e+00

Pos logit contributions
quartered+0.306
 DOT+0.293
Senator+0.271
 divides+0.267
 elector+0.256
Neg logit contributions
burg-0.324
selves-0.321
lishing-0.299
aye-0.290
 Publication-0.281
Token thread
Token ID4704
Feature activation+7.18e-01

Pos logit contributions
burg+0.825
selves+0.814
lishing+0.756
aye+0.733
 Publication+0.721
Neg logit contributions
quartered-0.733
 DOT-0.693
Senator-0.653
 divides-0.624
idth-0.610
Token up
Token ID510
Feature activation-3.30e-01

Pos logit contributions
burg+0.531
selves+0.527
lishing+0.487
aye+0.470
 Publication+0.458
Neg logit contributions
quartered-0.550
 DOT-0.528
Senator-0.493
 divides-0.482
idth-0.467
Token at
Token ID379
Feature activation-2.17e-02

Pos logit contributions
burg+0.086
selves+0.085
lishing+0.079
aye+0.076
 Publication+0.074
Neg logit contributions
quartered-0.088
 DOT-0.085
Senator-0.079
 divides-0.077
idth-0.076
Token Anfield
Token ID44303
Feature activation+4.34e-01

Pos logit contributions
quartered+0.017
 DOT+0.016
Senator+0.015
 divides+0.015
idth+0.014
Neg logit contributions
burg-0.016
selves-0.016
lishing-0.015
aye-0.014
 Publication-0.014
Token in
Token ID287
Feature activation+3.16e-01

Pos logit contributions
burg+0.312
selves+0.308
lishing+0.284
aye+0.274
 Publication+0.268
Neg logit contributions
quartered-0.343
 DOT-0.329
Senator-0.308
 divides-0.300
idth-0.294
Token October
Token ID3267
Feature activation+3.61e-01

Pos logit contributions
burg+0.252
selves+0.250
lishing+0.233
aye+0.223
 Publication+0.221
Neg logit contributions
quartered-0.225
 DOT-0.214
Senator-0.199
 divides-0.194
idth-0.189
Token.
Token ID13
Feature activation+8.10e-02

Pos logit contributions
burg+0.285
selves+0.282
lishing+0.262
aye+0.254
 Publication+0.249
Neg logit contributions
quartered-0.261
 DOT-0.248
Senator-0.231
 divides-0.225
idth-0.220
Token Photograph
Token ID18903
Feature activation-1.89e+00

Pos logit contributions
burg+0.070
selves+0.070
lishing+0.065
aye+0.063
 Publication+0.062
Neg logit contributions
quartered-0.052
 DOT-0.049
Senator-0.045
 divides-0.044
idth-0.043
Token:
Token ID25
Feature activation-2.70e-02

Pos logit contributions
quartered+1.513
 DOT+1.425
Senator+1.319
 divides+1.312
idth+1.272
Neg logit contributions
burg-1.379
selves-1.362
lishing-1.259
aye-1.204
 Publication-1.184
Token Clive
Token ID47722
Feature activation+1.73e-01

Pos logit contributions
quartered+0.019
 DOT+0.019
Senator+0.017
 divides+0.017
idth+0.016
Neg logit contributions
burg-0.021
selves-0.021
lishing-0.020
aye-0.019
 Publication-0.019
Token Brun
Token ID15700
Feature activation-8.40e-02

Pos logit contributions
burg+0.129
selves+0.127
lishing+0.118
aye+0.114
 Publication+0.113
Neg logit contributions
quartered-0.134
 DOT-0.125
Senator-0.118
 divides-0.116
idth-0.112
Tokenskill
Token ID42401
Feature activation+8.92e-01

Pos logit contributions
quartered+0.061
 DOT+0.058
Senator+0.054
 divides+0.053
 elector+0.051
Neg logit contributions
burg-0.066
selves-0.065
lishing-0.060
aye-0.059
 Wit-0.056
Token/
Token ID14
Feature activation+8.34e-02

Pos logit contributions
burg+0.614
selves+0.604
lishing+0.557
aye+0.537
 Publication+0.525
Neg logit contributions
quartered-0.737
 DOT-0.707
Senator-0.661
 divides-0.649
idth-0.634
Token gave
Token ID2921
Feature activation+4.80e-01

Pos logit contributions
quartered+0.169
 DOT+0.161
Senator+0.148
 divides+0.144
idth+0.140
Neg logit contributions
burg-0.217
selves-0.215
lishing-0.201
aye-0.194
 Publication-0.190
Token me
Token ID502
Feature activation+2.06e-01

Pos logit contributions
burg+0.353
selves+0.349
lishing+0.324
aye+0.313
 Publication+0.306
Neg logit contributions
quartered-0.373
 DOT-0.353
Senator-0.332
 divides-0.324
idth-0.315
Token an
Token ID281
Feature activation-7.83e-02

Pos logit contributions
burg+0.171
selves+0.170
lishing+0.158
aye+0.153
 Publication+0.151
Neg logit contributions
quartered-0.141
 DOT-0.132
Senator-0.122
 divides-0.119
idth-0.115
Token excuse
Token ID12226
Feature activation-6.43e-01

Pos logit contributions
quartered+0.059
 DOT+0.056
Senator+0.053
 divides+0.052
idth+0.050
Neg logit contributions
burg-0.059
selves-0.058
lishing-0.054
aye-0.053
 Publication-0.051
Token to
Token ID284
Feature activation-2.49e-01

Pos logit contributions
quartered+0.421
 DOT+0.401
Senator+0.369
 divides+0.360
idth+0.347
Neg logit contributions
burg-0.548
selves-0.545
lishing-0.507
aye-0.493
 Publication-0.481
Token look
Token ID804
Feature activation-1.91e+00

Pos logit contributions
quartered+0.164
 DOT+0.152
Senator+0.142
 divides+0.137
idth+0.135
Neg logit contributions
burg-0.214
selves-0.211
lishing-0.198
aye-0.192
 Publication-0.189
Token up
Token ID510
Feature activation-4.12e-01

Pos logit contributions
quartered+1.326
 DOT+1.279
Senator+1.172
 divides+1.153
idth+1.106
Neg logit contributions
burg-1.535
selves-1.525
lishing-1.424
aye-1.375
 Publication-1.347
Token some
Token ID617
Feature activation-4.11e-01

Pos logit contributions
quartered+0.281
 DOT+0.265
Senator+0.245
 divides+0.239
idth+0.232
Neg logit contributions
burg-0.342
selves-0.340
lishing-0.317
aye-0.307
 Publication-0.303
Token characters
Token ID3435
Feature activation-2.77e-01

Pos logit contributions
quartered+0.279
 DOT+0.264
Senator+0.244
 divides+0.238
idth+0.231
Neg logit contributions
burg-0.343
selves-0.340
lishing-0.317
aye-0.307
 Publication-0.302
Token I
Token ID314
Feature activation-3.05e-01

Pos logit contributions
quartered+0.205
 DOT+0.193
Senator+0.180
 divides+0.176
idth+0.172
Neg logit contributions
burg-0.215
selves-0.213
lishing-0.196
aye-0.190
 Publication-0.186
Token'm
Token ID1101
Feature activation-9.82e-02

Pos logit contributions
quartered+0.207
 DOT+0.193
Senator+0.179
 divides+0.174
idth+0.172
Neg logit contributions
burg-0.257
selves-0.255
lishing-0.236
aye-0.228
 Publication-0.224
Tokenva
Token ID6862
Feature activation-1.53e+00

Pos logit contributions
quartered+0.515
 DOT+0.495
Senator+0.457
 divides+0.450
idth+0.434
Neg logit contributions
burg-0.540
selves-0.534
lishing-0.497
aye-0.480
 Publication-0.468
Token Ki
Token ID21927
Feature activation-8.42e-01

Pos logit contributions
quartered+1.312
 DOT+1.267
Senator+1.186
 divides+1.170
idth+1.135
Neg logit contributions
burg-0.991
selves-0.984
lishing-0.901
aye-0.862
 Publication-0.837
Tokenir
Token ID343
Feature activation-4.14e-01

Pos logit contributions
quartered+0.715
 DOT+0.690
Senator+0.645
 divides+0.635
idth+0.621
Neg logit contributions
burg-0.555
selves-0.547
lishing-0.501
aye-0.487
 Publication-0.469
Token in
Token ID287
Feature activation+4.08e-01

Pos logit contributions
quartered+0.310
 DOT+0.298
Senator+0.277
 divides+0.272
idth+0.263
Neg logit contributions
burg-0.313
selves-0.310
lishing-0.287
aye-0.278
 Publication-0.271
Token the
Token ID262
Feature activation+2.35e-01

Pos logit contributions
burg+0.328
selves+0.323
lishing+0.303
aye+0.291
 Publication+0.286
Neg logit contributions
quartered-0.288
 DOT-0.274
Senator-0.254
 divides-0.248
idth-0.240
Token capital
Token ID3139
Feature activation-1.80e+00

Pos logit contributions
burg+0.204
selves+0.202
lishing+0.189
aye+0.184
 Publication+0.180
Neg logit contributions
quartered-0.151
 DOT-0.143
Senator-0.132
 divides-0.127
idth-0.125
Token,
Token ID11
Feature activation-2.75e-03

Pos logit contributions
quartered+1.282
 DOT+1.233
Senator+1.137
 divides+1.119
idth+1.075
Neg logit contributions
burg-1.418
selves-1.418
lishing-1.317
aye-1.269
 Publication-1.241
Token J
Token ID449
Feature activation+6.97e-01

Pos logit contributions
quartered+0.002
 DOT+0.002
Senator+0.002
 divides+0.002
idth+0.002
Neg logit contributions
burg-0.002
selves-0.002
lishing-0.002
aye-0.002
 Publication-0.002
Tokenuba
Token ID22013
Feature activation+4.00e-02

Pos logit contributions
burg+0.411
selves+0.402
lishing+0.367
aye+0.353
 Publication+0.340
Neg logit contributions
quartered-0.643
 DOT-0.623
Senator-0.584
 divides-0.576
 elector-0.560
Token.
Token ID13
Feature activation-4.79e-01

Pos logit contributions
burg+0.031
selves+0.030
lishing+0.028
aye+0.027
 Publication+0.026
Neg logit contributions
quartered-0.030
 DOT-0.029
Senator-0.026
 divides-0.026
idth-0.025
Token
Token ID198
Feature activation-1.33e-01

Pos logit contributions
quartered+0.331
 DOT+0.315
Senator+0.290
 divides+0.285
idth+0.275
Neg logit contributions
burg-0.391
selves-0.389
lishing-0.362
aye-0.350
 Publication-0.344