TOP ACTIVATIONS
MAX = 4.755

<|endoftext|> Donald Mir alle / Getty Images The latest
<|endoftext|> OT TA WA ( Reuters ) - Canada
) party | AFP / Getty Opinion It won
holes in American politics ' MORE as a 68 - year
holes in American politics ' MORE completed a swing through North
- a ... <|endoftext|> Buy Photo F SU target Marvin Wilson
to murder east Ukrainians und ist urbed . At
than 10 years in prison MORE and tick ed through her
is important to keep in mind that teachers who choose to
<|endoftext|> BR USS ELS ( Reuters ) - The world
Photo by John Moore / Getty Images ) Jeff Sessions Sen
holes in American politics ' MORE , told 77 WA BC
<|endoftext|> HA V ANA ( Reuters ) - Latin American and
professional help . Continue Reading Below Advertisement 9 Work
. <|endoftext|> Abstract Background Depression presents a significant burden
ast and T UG developer Nerd Kingdom last week signed an
: Clive Brun skill / Getty Images That low
gun ! R ally - Maximum strength , but
: Jason Bah r / Getty Images ) South
your Vend read Deck , but just about any strategy you

MIN ACTIVATIONS
MIN = -4.682

urgency for the 50 or so workers at the airport .
, even I can 't deny how visually stunning and colorful
43 ( S 4 ) DOI : 10 . 10 86
his s remind us not only of lost fidelity , but
Google Scholar Crossref | ISI M ol cho
--> src / main . rs : 2 :
apples - to - app les comparison . "
idea of a St . Louis purge during the Ferguson protests
Fourth Div isions all at once on S oky oku Hill
Scholar Crossref | Medline | ISI My ers ,
Scholar Crossref | Medline | ISI P el leg
to this guy . Not only did he plan a huge
I saw how it not only truly helped sick people ,
advice of Gand alf , chooses companions for him . The
national drug program is not only possible it is already
judicial races because it not only ske ws voters
to temperatures that reached 185 degrees F and 85 percent relative
changed significantly , but other than that , it mostly just
body . I feel as though Eve was like ,
GDP have already changed not only their legal headquarters but ,

INTERVAL 2.396 - 4.755
CONTAINS 0.321%

shopping malls , car show rooms , banks and houses of
, wasted no time in expressing its delight after that deal
has opened the door for nude and sexually explicit photos and
aled felon ious actions rise to the level of treason in
, let me offer a story about my experience with tape

INTERVAL 0.036 - 2.396
CONTAINS 48.111%

. Never mind the fact that he
itz ki , Mont a Ellis , Devin Harris , Raymond
Secretary Sk var la made it clear that Good
the official said , speaking to Ha aretz on condition of
. The combo had never been done before Mitchell applied it

INTERVAL -2.323 - 0.036
CONTAINS 51.123%

massive time capsule in which , depending on the neighborhood ,
after - tax income would total $ 150 million .
second innings . " When I look back on
count it . I felt bad rejecting all the Ny x
and the government s finances collapse .

INTERVAL -4.682 - -2.323
CONTAINS 0.444%

such an approach alien ates young voters , says D ella
May , but at that time discussions were still said to
pull . http :// api . c ensus .
Thompson came within 7 percentage points of defeating Republican Ron Est
us hers *) Total guaranteed at signing : $
Token<|endoftext|>
Token ID50256
Feature activation+2.89e-01

Pos logit contributions
zsche+0.520
:,+0.496
athing+0.489
ilan+0.487
 df+0.465
Neg logit contributions
ooters-0.566
 Krish-0.515
 Chaser-0.508
 Gutenberg-0.504
 Rib-0.501
TokenDonald
Token ID7371
Feature activation+9.66e-01

Pos logit contributions
ooters+0.218
 Krish+0.199
 Chaser+0.196
 Gutenberg+0.195
 Rib+0.194
Neg logit contributions
zsche-0.203
:,-0.191
athing-0.190
ilan-0.189
 df-0.180
Token Mir
Token ID7381
Feature activation-1.84e+00

Pos logit contributions
ooters+0.602
 Krish+0.542
 Chaser+0.529
 Gutenberg+0.526
 Rib+0.524
Neg logit contributions
zsche-0.806
:,-0.770
athing-0.764
ilan-0.762
 df-0.730
Tokenalle
Token ID6765
Feature activation-7.38e-01

Pos logit contributions
zsche+1.402
:,+1.336
athing+1.324
ilan+1.318
 df+1.258
Neg logit contributions
ooters-1.281
 Krish-1.164
 Chaser-1.141
 Gutenberg-1.136
 Rib-1.128
Token/
Token ID14
Feature activation+5.96e-01

Pos logit contributions
zsche+0.573
:,+0.547
athing+0.541
ilan+0.539
 df+0.515
Neg logit contributions
ooters-0.502
 Krish-0.457
 Chaser-0.446
 Gutenberg-0.444
 Rib-0.443
TokenGetty
Token ID6633
Feature activation+4.76e+00

Pos logit contributions
ooters+0.432
 Krish+0.395
 Chaser+0.388
 Rib+0.385
 Gutenberg+0.385
Neg logit contributions
zsche-0.437
:,-0.415
athing-0.410
ilan-0.408
 df-0.390
Token Images
Token ID5382
Feature activation+1.54e+00

Pos logit contributions
ooters+1.620
 Krish+1.355
 Gutenberg+1.293
 Rib+1.272
 Chaser+1.260
Neg logit contributions
zsche-5.237
:,-5.101
athing-5.017
ilan-5.015
 df-4.876
Token
Token ID198
Feature activation-1.05e+00

Pos logit contributions
ooters+0.975
 Krish+0.881
 Chaser+0.855
 Gutenberg+0.855
 Rib+0.852
Neg logit contributions
zsche-1.270
:,-1.212
athing-1.196
ilan-1.195
 df-1.146
Token
Token ID198
Feature activation-1.20e+00

Pos logit contributions
zsche+0.854
:,+0.817
athing+0.810
ilan+0.807
 df+0.772
Neg logit contributions
ooters-0.679
 Krish-0.612
 Chaser-0.599
 Gutenberg-0.595
 Rib-0.591
TokenThe
Token ID464
Feature activation-5.59e-01

Pos logit contributions
zsche+0.833
:,+0.791
athing+0.783
ilan+0.779
 df+0.740
Neg logit contributions
ooters-0.913
 Krish-0.837
 Chaser-0.822
 Gutenberg-0.818
 Rib-0.814
Token latest
Token ID3452
Feature activation-1.02e+00

Pos logit contributions
zsche+0.435
:,+0.414
athing+0.410
ilan+0.410
 df+0.389
Neg logit contributions
ooters-0.380
 Krish-0.346
 Chaser-0.339
 Gutenberg-0.337
 Rib-0.336
Token<|endoftext|>
Token ID50256
Feature activation-1.08e-01

Pos logit contributions
zsche+0.229
:,+0.218
athing+0.216
ilan+0.215
 df+0.206
Neg logit contributions
ooters-0.200
 Krish-0.182
 Chaser-0.178
 Gutenberg-0.177
 Rib-0.176
TokenOT
Token ID2394
Feature activation-3.16e-01

Pos logit contributions
zsche+0.076
:,+0.072
athing+0.071
ilan+0.071
 df+0.067
Neg logit contributions
ooters-0.082
 Krish-0.075
 Chaser-0.073
 Gutenberg-0.073
 Rib-0.073
TokenTA
Token ID5603
Feature activation-2.34e-01

Pos logit contributions
zsche+0.197
:,+0.185
athing+0.184
ilan+0.183
 df+0.173
Neg logit contributions
ooters-0.263
 Krish-0.243
 Chaser-0.239
 Gutenberg-0.238
 Rib-0.237
TokenWA
Token ID15543
Feature activation+6.62e-01

Pos logit contributions
zsche+0.148
:,+0.138
athing+0.137
ilan+0.136
 df+0.129
Neg logit contributions
ooters-0.194
 Krish-0.179
 Chaser-0.176
 Gutenberg-0.175
 Rib-0.175
Token (
Token ID357
Feature activation+1.75e+00

Pos logit contributions
ooters+0.399
 Krish+0.357
 Chaser+0.349
 Rib+0.347
 Gutenberg+0.346
Neg logit contributions
zsche-0.566
:,-0.539
athing-0.537
ilan-0.535
 df-0.512
TokenReuters
Token ID12637
Feature activation+4.73e+00

Pos logit contributions
ooters+1.069
 Krish+0.968
 Chaser+0.946
 Gutenberg+0.936
 Rib+0.934
Neg logit contributions
zsche-1.476
:,-1.405
athing-1.403
ilan-1.388
 df-1.334
Token)
Token ID8
Feature activation+2.77e+00

Pos logit contributions
ooters+1.830
 Krish+1.599
 Rib+1.555
 Observer+1.512
 Chaser+1.511
Neg logit contributions
zsche-5.010
athing-4.749
:,-4.726
ilan-4.721
 df-4.584
Token -
Token ID532
Feature activation+2.42e+00

Pos logit contributions
ooters+2.063
 Krish+1.961
 Rib+1.908
 Gutenberg+1.895
 Chaser+1.886
Neg logit contributions
zsche-1.957
:,-1.817
athing-1.809
ilan-1.800
 df-1.715
Token Canada
Token ID3340
Feature activation+7.91e-01

Pos logit contributions
ooters+1.793
 Krish+1.678
 Rib+1.650
 Chaser+1.638
 Gutenberg+1.632
Neg logit contributions
zsche-1.745
:,-1.615
athing-1.605
ilan-1.602
 df-1.507
Token
Token ID447
Feature activation+1.31e+00

Pos logit contributions
ooters+0.532
 Krish+0.484
 Chaser+0.475
 Gutenberg+0.472
 Rib+0.471
Neg logit contributions
zsche-0.625
:,-0.591
ilan-0.586
athing-0.584
 df-0.554
Token
Token ID247
Feature activation+6.33e-01

Pos logit contributions
ooters+0.794
 Krish+0.712
 Chaser+0.695
 Rib+0.692
 Gutenberg+0.680
Neg logit contributions
zsche-1.132
ilan-1.068
athing-1.061
:,-1.060
 df-1.003
Token)
Token ID8
Feature activation+2.04e+00

Pos logit contributions
ooters+0.304
 Krish+0.268
 Chaser+0.259
 Gutenberg+0.259
 Rib+0.254
Neg logit contributions
zsche-0.550
:,-0.530
ilan-0.525
athing-0.525
 df-0.506
Token party
Token ID2151
Feature activation+1.57e-01

Pos logit contributions
ooters+1.274
 Krish+1.165
 Chaser+1.149
 Rib+1.135
 Gutenberg+1.129
Neg logit contributions
zsche-1.691
:,-1.604
ilan-1.580
athing-1.578
 df-1.489
Token |
Token ID930
Feature activation+1.53e+00

Pos logit contributions
ooters+0.105
 Krish+0.096
 Chaser+0.095
 Gutenberg+0.094
 Rib+0.094
Neg logit contributions
zsche-0.123
:,-0.117
ilan-0.115
athing-0.115
 df-0.109
Token AFP
Token ID8916
Feature activation+1.62e+00

Pos logit contributions
ooters+0.925
 Krish+0.837
 Chaser+0.819
 Gutenberg+0.815
 Rib+0.813
Neg logit contributions
zsche-1.299
:,-1.243
athing-1.237
ilan-1.228
 df-1.176
Token/
Token ID14
Feature activation+5.29e-01

Pos logit contributions
ooters+1.015
 Krish+0.925
 Chaser+0.904
 Rib+0.900
 Gutenberg+0.892
Neg logit contributions
zsche-1.341
:,-1.279
athing-1.274
ilan-1.262
 df-1.208
TokenGetty
Token ID6633
Feature activation+4.72e+00

Pos logit contributions
ooters+0.447
 Krish+0.414
 Chaser+0.407
 Gutenberg+0.405
 Rib+0.404
Neg logit contributions
zsche-0.325
:,-0.305
athing-0.302
ilan-0.300
 df-0.283
Token Opinion
Token ID29525
Feature activation+1.46e+00

Pos logit contributions
ooters+2.029
 Krish+1.795
 Gutenberg+1.782
 Chaser+1.776
 Rib+1.755
Neg logit contributions
zsche-4.698
:,-4.579
athing-4.494
ilan-4.452
 df-4.278
Token It
Token ID632
Feature activation+3.45e-01

Pos logit contributions
ooters+1.018
 Krish+0.927
 Chaser+0.919
 Gutenberg+0.917
 Rib+0.916
Neg logit contributions
zsche-1.097
:,-1.050
ilan-1.039
athing-1.038
 df-0.973
Token won
Token ID1839
Feature activation-2.37e-01

Pos logit contributions
ooters+0.241
 Krish+0.219
 Chaser+0.214
 Gutenberg+0.214
 Rib+0.212
Neg logit contributions
zsche-0.261
:,-0.249
athing-0.246
ilan-0.246
 df-0.234
Token
Token ID447
Feature activation+9.13e-01

Pos logit contributions
zsche+0.225
:,+0.217
athing+0.215
ilan+0.214
 df+0.207
Neg logit contributions
ooters-0.121
 Krish-0.106
 Chaser-0.103
 Gutenberg-0.102
 Rib-0.101
Token
Token ID247
Feature activation+4.06e-01

Pos logit contributions
ooters+0.600
 Krish+0.544
 Chaser+0.534
 Gutenberg+0.529
 Rib+0.528
Neg logit contributions
zsche-0.732
:,-0.694
athing-0.689
ilan-0.689
 df-0.655
Tokenholes
Token ID28439
Feature activation-3.41e-01

Pos logit contributions
zsche+0.614
:,+0.589
athing+0.586
ilan+0.559
 df+0.548
Neg logit contributions
ooters-0.885
 Gutenberg-0.820
 Chaser-0.820
 Krish-0.819
 Rib-0.814
Token in
Token ID287
Feature activation-6.16e-01

Pos logit contributions
zsche+0.278
:,+0.264
ilan+0.263
athing+0.260
 df+0.248
Neg logit contributions
ooters-0.222
 Krish-0.197
 Chaser-0.195
 Gutenberg-0.193
 Stephenson-0.191
Token American
Token ID1605
Feature activation+2.90e-01

Pos logit contributions
zsche+0.503
:,+0.483
athing+0.479
ilan+0.476
 df+0.458
Neg logit contributions
ooters-0.392
 Krish-0.352
 Chaser-0.345
 Gutenberg-0.344
 Rib-0.341
Token politics
Token ID4819
Feature activation-1.38e+00

Pos logit contributions
ooters+0.231
 Krish+0.212
 Chaser+0.209
 Gutenberg+0.208
 Rib+0.207
Neg logit contributions
zsche-0.192
:,-0.181
ilan-0.180
athing-0.179
 df-0.169
Token'
Token ID6
Feature activation-1.01e+00

Pos logit contributions
zsche+1.084
:,+1.034
athing+1.024
ilan+1.022
 df+0.975
Neg logit contributions
ooters-0.932
 Krish-0.842
 Chaser-0.824
 Gutenberg-0.822
 Rib-0.815
Token MORE
Token ID12011
Feature activation+4.69e+00

Pos logit contributions
zsche+0.910
:,+0.865
athing+0.858
ilan+0.856
 df+0.825
Neg logit contributions
ooters-0.563
 Krish-0.500
 Chaser-0.488
 Gutenberg-0.483
 Rib-0.481
Token as
Token ID355
Feature activation+1.73e+00

Pos logit contributions
ooters+2.777
 Krish+2.526
 Chaser+2.463
 Rib+2.434
 Gutenberg+2.425
Neg logit contributions
zsche-4.046
ilan-3.748
athing-3.709
:,-3.680
 df-3.538
Token a
Token ID257
Feature activation+1.51e+00

Pos logit contributions
ooters+1.112
 Krish+1.011
 Chaser+0.990
 Gutenberg+0.987
 Rib+0.980
Neg logit contributions
zsche-1.409
:,-1.336
ilan-1.332
athing-1.318
 df-1.265
Token 68
Token ID8257
Feature activation+1.29e+00

Pos logit contributions
ooters+1.087
 Krish+1.006
 Chaser+1.001
 Gutenberg+0.987
 Rib+0.979
Neg logit contributions
zsche-1.099
ilan-1.040
:,-1.038
athing-1.023
 df-0.975
Token-
Token ID12
Feature activation+9.19e-01

Pos logit contributions
ooters+0.722
 Krish+0.640
 Chaser+0.624
 Gutenberg+0.620
 Rib+0.615
Neg logit contributions
zsche-1.157
:,-1.111
athing-1.102
ilan-1.098
 df-1.056
Tokenyear
Token ID1941
Feature activation+5.79e-01

Pos logit contributions
ooters+0.707
 Krish+0.649
 Chaser+0.637
 Gutenberg+0.634
 Rib+0.631
Neg logit contributions
zsche-0.635
:,-0.598
athing-0.592
ilan-0.590
 df-0.561
Tokenholes
Token ID28439
Feature activation-3.19e-01

Pos logit contributions
zsche+0.566
:,+0.542
athing+0.538
ilan+0.518
 df+0.504
Neg logit contributions
ooters-0.798
 Chaser-0.738
 Krish-0.737
 Gutenberg-0.737
 Rib-0.731
Token in
Token ID287
Feature activation-6.31e-01

Pos logit contributions
zsche+0.261
:,+0.247
ilan+0.247
athing+0.244
 df+0.232
Neg logit contributions
ooters-0.206
 Krish-0.183
 Chaser-0.180
 Gutenberg-0.179
 Stephenson-0.177
Token American
Token ID1605
Feature activation+4.37e-01

Pos logit contributions
zsche+0.502
:,+0.482
athing+0.478
ilan+0.475
 df+0.456
Neg logit contributions
ooters-0.414
 Krish-0.373
 Chaser-0.366
 Gutenberg-0.365
 Rib-0.361
Token politics
Token ID4819
Feature activation-1.38e+00

Pos logit contributions
ooters+0.350
 Krish+0.322
 Chaser+0.317
 Gutenberg+0.316
 Rib+0.313
Neg logit contributions
zsche-0.288
:,-0.272
ilan-0.270
athing-0.268
 df-0.253
Token'
Token ID6
Feature activation-1.22e+00

Pos logit contributions
zsche+1.089
:,+1.038
athing+1.029
ilan+1.027
 df+0.979
Neg logit contributions
ooters-0.932
 Krish-0.842
 Chaser-0.823
 Gutenberg-0.821
 Rib-0.814
Token MORE
Token ID12011
Feature activation+4.50e+00

Pos logit contributions
zsche+1.088
:,+1.042
athing+1.034
ilan+1.031
 df+0.992
Neg logit contributions
ooters-0.694
 Krish-0.617
 Chaser-0.601
 Gutenberg-0.597
 Rib-0.593
Token completed
Token ID5668
Feature activation+1.18e+00

Pos logit contributions
ooters+2.841
 Krish+2.629
 Rib+2.539
 Chaser+2.537
 Observer+2.529
Neg logit contributions
zsche-3.704
ilan-3.424
:,-3.375
athing-3.345
 df-3.172
Token a
Token ID257
Feature activation+7.98e-01

Pos logit contributions
ooters+0.771
 Krish+0.698
 Chaser+0.683
 Gutenberg+0.681
 Rib+0.676
Neg logit contributions
zsche-0.956
:,-0.910
athing-0.902
ilan-0.900
 df-0.859
Token swing
Token ID9628
Feature activation+5.81e-01

Pos logit contributions
ooters+0.516
 Krish+0.473
 Gutenberg+0.462
 Chaser+0.462
 Rib+0.458
Neg logit contributions
zsche-0.643
:,-0.612
ilan-0.608
athing-0.606
 df-0.574
Token through
Token ID832
Feature activation+2.31e+00

Pos logit contributions
ooters+0.420
 Krish+0.383
 Chaser+0.375
 Gutenberg+0.373
 Rib+0.371
Neg logit contributions
zsche-0.427
:,-0.406
athing-0.403
ilan-0.401
 df-0.382
Token North
Token ID2258
Feature activation+5.76e-01

Pos logit contributions
ooters+1.581
 Krish+1.460
 Gutenberg+1.420
 Rib+1.418
 Chaser+1.406
Neg logit contributions
zsche-1.791
:,-1.686
ilan-1.668
athing-1.658
 df-1.583
Token-
Token ID12
Feature activation+1.58e-01

Pos logit contributions
zsche+0.529
:,+0.507
athing+0.503
ilan+0.500
 df+0.480
Neg logit contributions
ooters-0.402
 Krish-0.361
 Chaser-0.353
 Gutenberg-0.351
 Rib-0.348
Tokena
Token ID64
Feature activation-3.41e-01

Pos logit contributions
ooters+0.102
 Krish+0.092
 Chaser+0.090
 Gutenberg+0.090
 Rib+0.089
Neg logit contributions
zsche-0.129
:,-0.123
athing-0.122
ilan-0.122
 df-0.116
Token...
Token ID986
Feature activation-2.05e+00

Pos logit contributions
zsche+0.281
:,+0.270
athing+0.267
ilan+0.266
 df+0.256
Neg logit contributions
ooters-0.215
 Krish-0.194
 Chaser-0.190
 Gutenberg-0.189
 Rib-0.187
Token<|endoftext|>
Token ID50256
Feature activation+4.35e-01

Pos logit contributions
zsche+1.592
:,+1.524
athing+1.503
ilan+1.493
 df+1.437
Neg logit contributions
ooters-1.392
 Krish-1.244
 Chaser-1.236
 Gutenberg-1.224
 Rib-1.213
TokenBuy
Token ID14518
Feature activation+3.85e-01

Pos logit contributions
ooters+0.328
 Krish+0.300
 Chaser+0.295
 Gutenberg+0.293
 Rib+0.292
Neg logit contributions
zsche-0.307
:,-0.289
athing-0.287
ilan-0.285
 df-0.272
Token Photo
Token ID5555
Feature activation+4.49e+00

Pos logit contributions
ooters+0.157
 Krish+0.134
 Chaser+0.129
 Gutenberg+0.128
 Rib+0.127
Neg logit contributions
zsche-0.404
:,-0.389
athing-0.386
ilan-0.386
 df-0.373
Token F
Token ID376
Feature activation-4.32e-01

Pos logit contributions
ooters+2.900
 Krish+2.720
 Rib+2.644
 Chaser+2.627
 Gutenberg+2.613
Neg logit contributions
zsche-3.634
:,-3.384
ilan-3.378
athing-3.372
 df-3.195
TokenSU
Token ID12564
Feature activation+1.71e+00

Pos logit contributions
zsche+0.313
:,+0.298
athing+0.294
ilan+0.293
 df+0.279
Neg logit contributions
ooters-0.317
 Krish-0.290
 Chaser-0.284
 Gutenberg-0.282
 Rib-0.281
Token target
Token ID2496
Feature activation+2.05e+00

Pos logit contributions
ooters+1.218
 Krish+1.127
 Chaser+1.099
 Rib+1.095
 Gutenberg+1.089
Neg logit contributions
zsche-1.277
:,-1.197
ilan-1.196
athing-1.180
 df-1.115
Token Marvin
Token ID35105
Feature activation+5.32e-01

Pos logit contributions
ooters+1.357
 Krish+1.269
 Chaser+1.212
 Rib+1.210
 Gutenberg+1.208
Neg logit contributions
zsche-1.634
:,-1.542
ilan-1.538
athing-1.527
 df-1.448
Token Wilson
Token ID8127
Feature activation+1.25e+00

Pos logit contributions
ooters+0.323
 Krish+0.291
 Chaser+0.284
 Gutenberg+0.284
 Rib+0.282
Neg logit contributions
zsche-0.453
:,-0.432
athing-0.430
ilan-0.428
 df-0.410
Token to
Token ID284
Feature activation-4.89e-01

Pos logit contributions
zsche+1.027
:,+0.987
ilan+0.974
athing+0.973
 df+0.936
Neg logit contributions
ooters-0.791
 Krish-0.709
 Chaser-0.694
 Gutenberg-0.688
 Rib-0.684
Token murder
Token ID5123
Feature activation+8.27e-01

Pos logit contributions
zsche+0.347
:,+0.326
athing+0.324
ilan+0.323
 df+0.304
Neg logit contributions
ooters-0.369
 Krish-0.337
 Chaser-0.331
 Gutenberg-0.329
 Rib-0.328
Token east
Token ID7627
Feature activation+7.42e-01

Pos logit contributions
ooters+0.513
 Krish+0.467
 Rib+0.455
 Gutenberg+0.453
 Chaser+0.453
Neg logit contributions
zsche-0.694
:,-0.654
athing-0.654
ilan-0.649
 df-0.620
Token Ukrainians
Token ID49114
Feature activation+5.39e-01

Pos logit contributions
ooters+0.471
 Krish+0.427
 Rib+0.415
 Chaser+0.412
 Gutenberg+0.409
Neg logit contributions
zsche-0.616
:,-0.582
athing-0.580
ilan-0.574
 df-0.553
Token und
Token ID3318
Feature activation+1.28e+00

Pos logit contributions
ooters+0.358
 Krish+0.325
 Chaser+0.317
 Rib+0.315
 Gutenberg+0.315
Neg logit contributions
zsche-0.429
:,-0.404
athing-0.402
ilan-0.402
 df-0.383
Tokenist
Token ID396
Feature activation+4.32e+00

Pos logit contributions
ooters+0.994
 Krish+0.916
 Chaser+0.897
 Rib+0.895
 Gutenberg+0.887
Neg logit contributions
zsche-0.871
:,-0.822
athing-0.810
ilan-0.807
 df-0.767
Tokenurbed
Token ID37694
Feature activation-1.71e+00

Pos logit contributions
ooters+2.426
 Krish+2.138
 Rib+2.099
 Gutenberg+2.077
 Observer+2.035
Neg logit contributions
zsche-3.847
:,-3.658
athing-3.598
ilan-3.545
 df-3.490
Token.
Token ID13
Feature activation-2.33e-01

Pos logit contributions
zsche+1.288
:,+1.230
athing+1.217
ilan+1.211
 df+1.157
Neg logit contributions
ooters-1.202
 Krish-1.093
 Chaser-1.073
 Gutenberg-1.066
 Rib-1.059
Token
Token ID198
Feature activation+3.31e-01

Pos logit contributions
zsche+0.192
:,+0.182
athing+0.181
ilan+0.181
 df+0.173
Neg logit contributions
ooters-0.148
 Krish-0.134
 Chaser-0.131
 Gutenberg-0.130
 Rib-0.130
Token
Token ID198
Feature activation+3.52e-01

Pos logit contributions
ooters+0.202
 Krish+0.181
 Chaser+0.176
 Rib+0.176
 Gutenberg+0.176
Neg logit contributions
zsche-0.283
:,-0.266
athing-0.266
ilan-0.265
 df-0.254
TokenAt
Token ID2953
Feature activation-1.75e-01

Pos logit contributions
ooters+0.269
 Krish+0.246
 Chaser+0.242
Barn+0.241
 Gutenberg+0.240
Neg logit contributions
zsche-0.247
:,-0.230
athing-0.229
ilan-0.228
 df-0.217
Token than
Token ID621
Feature activation+6.84e-01

Pos logit contributions
zsche+0.442
:,+0.419
athing+0.417
ilan+0.416
 df+0.398
Neg logit contributions
ooters-0.325
 Krish-0.294
 Chaser-0.288
 Gutenberg-0.288
 Rib-0.287
Token 10
Token ID838
Feature activation-7.91e-01

Pos logit contributions
ooters+0.417
 Krish+0.378
 Rib+0.369
 Stephenson+0.369
 Chaser+0.367
Neg logit contributions
zsche-0.586
:,-0.548
ilan-0.545
athing-0.541
 df-0.521
Token years
Token ID812
Feature activation-3.40e-02

Pos logit contributions
zsche+0.668
:,+0.647
ilan+0.637
athing+0.634
 df+0.614
Neg logit contributions
ooters-0.479
 Krish-0.425
 Gutenberg-0.425
 Chaser-0.421
 Rib-0.416
Token in
Token ID287
Feature activation-5.39e-01

Pos logit contributions
zsche+0.030
:,+0.029
athing+0.028
ilan+0.028
 df+0.027
Neg logit contributions
ooters-0.020
 Krish-0.018
 Chaser-0.017
 Gutenberg-0.017
 Rib-0.017
Token prison
Token ID3770
Feature activation-1.13e-01

Pos logit contributions
zsche+0.476
:,+0.458
athing+0.454
ilan+0.452
 df+0.435
Neg logit contributions
ooters-0.309
 Krish-0.275
 Chaser-0.268
 Gutenberg-0.266
 Rib-0.264
Token MORE
Token ID12011
Feature activation+4.15e+00

Pos logit contributions
zsche+0.096
:,+0.090
ilan+0.090
athing+0.089
 df+0.085
Neg logit contributions
ooters-0.069
 Krish-0.064
 Gutenberg-0.063
 Rib-0.061
 Stephenson-0.061
Token and
Token ID290
Feature activation+6.64e-01

Pos logit contributions
ooters+2.498
 Krish+2.277
 Chaser+2.214
 Rib+2.199
 Gutenberg+2.179
Neg logit contributions
zsche-3.571
ilan-3.285
athing-3.251
:,-3.231
 df-3.135
Token tick
Token ID4378
Feature activation+2.29e+00

Pos logit contributions
ooters+0.426
 Krish+0.387
 Gutenberg+0.377
 Chaser+0.375
 Rib+0.373
Neg logit contributions
zsche-0.546
:,-0.515
ilan-0.514
athing-0.511
 df-0.489
Tokened
Token ID276
Feature activation+1.26e+00

Pos logit contributions
ooters+1.535
 Krish+1.398
 Chaser+1.372
 Rib+1.356
 Gutenberg+1.355
Neg logit contributions
zsche-1.794
:,-1.712
athing-1.692
ilan-1.679
 df-1.609
Token through
Token ID832
Feature activation+6.70e-01

Pos logit contributions
ooters+0.792
 Krish+0.718
 Chaser+0.703
 Gutenberg+0.697
 Rib+0.695
Neg logit contributions
zsche-1.049
:,-0.995
athing-0.989
ilan-0.983
 df-0.940
Token her
Token ID607
Feature activation-2.48e-01

Pos logit contributions
ooters+0.439
 Krish+0.402
 Gutenberg+0.391
 Chaser+0.391
 Rib+0.388
Neg logit contributions
zsche-0.537
:,-0.511
ilan-0.505
athing-0.504
 df-0.481
Token is
Token ID318
Feature activation+1.37e-01

Pos logit contributions
zsche+0.450
:,+0.426
ilan+0.423
athing+0.422
 df+0.401
Neg logit contributions
ooters-0.457
 Krish-0.418
 Chaser-0.410
 Gutenberg-0.408
 Rib-0.406
Token important
Token ID1593
Feature activation+8.71e-01

Pos logit contributions
ooters+0.089
 Krish+0.082
 Chaser+0.080
 Gutenberg+0.079
 Rib+0.079
Neg logit contributions
zsche-0.110
:,-0.104
athing-0.104
ilan-0.103
 df-0.098
Token to
Token ID284
Feature activation+7.15e-01

Pos logit contributions
ooters+0.546
 Krish+0.491
 Chaser+0.482
 Gutenberg+0.479
 Rib+0.474
Neg logit contributions
zsche-0.723
:,-0.695
athing-0.684
ilan-0.682
 df-0.657
Token keep
Token ID1394
Feature activation+1.29e+00

Pos logit contributions
ooters+0.440
 Krish+0.397
 Rib+0.385
 Chaser+0.384
 Gutenberg+0.381
Neg logit contributions
zsche-0.602
ilan-0.570
athing-0.569
:,-0.569
 df-0.539
Token in
Token ID287
Feature activation+6.87e-02

Pos logit contributions
ooters+0.736
 Krish+0.661
 Chaser+0.640
 Gutenberg+0.640
 Rib+0.635
Neg logit contributions
zsche-1.138
:,-1.092
athing-1.085
ilan-1.080
 df-1.036
Token mind
Token ID2000
Feature activation+4.14e+00

Pos logit contributions
ooters+0.056
 Krish+0.051
 Chaser+0.051
 Gutenberg+0.050
 Rib+0.050
Neg logit contributions
zsche-0.044
:,-0.042
athing-0.041
ilan-0.041
 df-0.039
Token that
Token ID326
Feature activation+6.62e-01

Pos logit contributions
ooters+2.353
 Krish+2.148
 Gutenberg+2.072
 Rib+2.048
 Chaser+2.035
Neg logit contributions
zsche-3.623
ilan-3.470
athing-3.462
:,-3.448
 df-3.279
Token teachers
Token ID7799
Feature activation+4.14e-01

Pos logit contributions
ooters+0.433
 Krish+0.397
 Gutenberg+0.385
 Chaser+0.384
 Rib+0.383
Neg logit contributions
zsche-0.529
:,-0.504
athing-0.502
ilan-0.501
 df-0.474
Token who
Token ID508
Feature activation+8.64e-01

Pos logit contributions
ooters+0.275
 Krish+0.250
 Gutenberg+0.243
 Chaser+0.242
 Rib+0.240
Neg logit contributions
zsche-0.330
:,-0.312
ilan-0.310
athing-0.308
 df-0.293
Token choose
Token ID3853
Feature activation-2.19e+00

Pos logit contributions
ooters+0.611
 Krish+0.559
 Chaser+0.543
 Gutenberg+0.542
 Rib+0.535
Neg logit contributions
zsche-0.654
:,-0.613
ilan-0.613
athing-0.608
 df-0.577
Token to
Token ID284
Feature activation-2.23e-01

Pos logit contributions
zsche+1.737
:,+1.673
ilan+1.647
athing+1.647
 df+1.580
Neg logit contributions
ooters-1.433
 Krish-1.284
 Chaser-1.266
 Gutenberg-1.251
 Rib-1.247
Token<|endoftext|>
Token ID50256
Feature activation+2.79e-01

Pos logit contributions
ooters+0.896
 Krish+0.820
 Chaser+0.804
 Rib+0.799
 Gutenberg+0.798
Neg logit contributions
zsche-0.997
:,-0.946
athing-0.938
ilan-0.937
 df-0.884
TokenBR
Token ID11473
Feature activation-5.48e-02

Pos logit contributions
ooters+0.210
 Krish+0.193
 Chaser+0.189
 Gutenberg+0.188
 Rib+0.187
Neg logit contributions
zsche-0.196
:,-0.185
athing-0.184
ilan-0.183
 df-0.174
TokenUSS
Token ID32835
Feature activation+1.11e-01

Pos logit contributions
zsche+0.040
:,+0.038
athing+0.037
ilan+0.037
 df+0.035
Neg logit contributions
ooters-0.040
 Krish-0.037
 Chaser-0.036
 Gutenberg-0.036
 Rib-0.036
TokenELS
Token ID37142
Feature activation+9.90e-01

Pos logit contributions
ooters+0.060
 Krish+0.055
 Rib+0.052
 Stephenson+0.051
 Observer+0.051
Neg logit contributions
zsche-0.103
athing-0.098
:,-0.097
ilan-0.096
 df-0.093
Token (
Token ID357
Feature activation+1.75e+00

Pos logit contributions
ooters+0.674
 Krish+0.613
 Chaser+0.601
 Gutenberg+0.598
 Rib+0.598
Neg logit contributions
zsche-0.767
:,-0.728
athing-0.725
ilan-0.722
 df-0.687
TokenReuters
Token ID12637
Feature activation+4.08e+00

Pos logit contributions
ooters+1.018
 Krish+0.922
 Chaser+0.897
Barn+0.884
 Gutenberg+0.880
Neg logit contributions
zsche-1.523
:,-1.448
athing-1.447
ilan-1.427
 df-1.380
Token)
Token ID8
Feature activation+2.60e+00

Pos logit contributions
ooters+1.637
 Krish+1.422
 Rib+1.386
 Chaser+1.363
 Observer+1.355
Neg logit contributions
zsche-4.274
athing-4.054
:,-4.043
ilan-4.032
 df-3.916
Token -
Token ID532
Feature activation+2.48e+00

Pos logit contributions
ooters+1.962
 Krish+1.856
 Rib+1.806
 Gutenberg+1.799
 Chaser+1.791
Neg logit contributions
zsche-1.814
:,-1.682
athing-1.679
ilan-1.670
 df-1.589
Token The
Token ID383
Feature activation-7.02e-01

Pos logit contributions
ooters+1.668
 Krish+1.546
 Rib+1.516
 Chaser+1.509
 Gutenberg+1.505
Neg logit contributions
zsche-1.965
:,-1.851
ilan-1.824
athing-1.821
 df-1.725
Token world
Token ID995
Feature activation+3.18e-01

Pos logit contributions
zsche+0.489
:,+0.463
athing+0.457
ilan+0.456
 df+0.431
Neg logit contributions
ooters-0.537
 Krish-0.492
 Chaser-0.485
 Gutenberg-0.483
 Rib-0.480
Token
Token ID447
Feature activation-1.78e-01

Pos logit contributions
ooters+0.211
 Krish+0.191
 Gutenberg+0.188
 Chaser+0.188
 Rib+0.186
Neg logit contributions
zsche-0.252
:,-0.240
ilan-0.237
athing-0.237
 df-0.225
TokenPhoto
Token ID6191
Feature activation-2.19e-01

Pos logit contributions
zsche+0.026
:,+0.025
ilan+0.025
athing+0.025
 df+0.024
Neg logit contributions
ooters-0.020
 Krish-0.018
 Chaser-0.017
 Gutenberg-0.017
 Rib-0.017
Token by
Token ID416
Feature activation+1.61e-01

Pos logit contributions
zsche+0.157
:,+0.150
athing+0.148
ilan+0.148
 df+0.141
Neg logit contributions
ooters-0.162
 Krish-0.148
 Chaser-0.145
 Gutenberg-0.144
 Rib-0.143
Token John
Token ID1757
Feature activation+1.77e+00

Pos logit contributions
ooters+0.097
 Krish+0.088
 Chaser+0.085
 Gutenberg+0.085
 Rib+0.084
Neg logit contributions
zsche-0.137
:,-0.131
athing-0.130
ilan-0.130
 df-0.124
Token Moore
Token ID8877
Feature activation+3.32e-01

Pos logit contributions
ooters+1.099
 Krish+0.995
 Chaser+0.969
 Gutenberg+0.960
 Rib+0.960
Neg logit contributions
zsche-1.482
:,-1.416
athing-1.406
ilan-1.400
 df-1.343
Token/
Token ID14
Feature activation+2.96e-01

Pos logit contributions
ooters+0.224
 Krish+0.203
 Chaser+0.199
 Gutenberg+0.198
 Rib+0.198
Neg logit contributions
zsche-0.260
:,-0.248
athing-0.245
ilan-0.244
 df-0.233
TokenGetty
Token ID6633
Feature activation+4.07e+00

Pos logit contributions
ooters+0.223
 Krish+0.204
 Chaser+0.201
 Gutenberg+0.200
 Rib+0.199
Neg logit contributions
zsche-0.208
:,-0.198
athing-0.196
ilan-0.195
 df-0.185
Token Images
Token ID5382
Feature activation-1.84e-01

Pos logit contributions
ooters+1.303
 Krish+1.047
 Chaser+1.017
 Rib+1.004
 Gutenberg+1.001
Neg logit contributions
zsche-4.600
:,-4.456
athing-4.416
ilan-4.396
 df-4.267
Token)
Token ID8
Feature activation-7.61e-01

Pos logit contributions
zsche+0.172
:,+0.164
athing+0.164
ilan+0.163
 df+0.158
Neg logit contributions
ooters-0.096
 Krish-0.085
 Chaser-0.082
 Gutenberg-0.081
 Rib-0.080
Token Jeff
Token ID5502
Feature activation-5.69e-01

Pos logit contributions
zsche+0.620
:,+0.590
athing+0.584
ilan+0.583
 df+0.557
Neg logit contributions
ooters-0.492
 Krish-0.448
 Chaser-0.436
 Gutenberg-0.435
 Rib-0.433
Token Sessions
Token ID15184
Feature activation-6.71e-01

Pos logit contributions
zsche+0.489
:,+0.467
athing+0.464
ilan+0.461
 df+0.444
Neg logit contributions
ooters-0.342
 Krish-0.306
 Chaser-0.299
 Gutenberg-0.297
 Rib-0.296
Token Sen
Token ID2311
Feature activation-6.46e-01

Pos logit contributions
zsche+0.545
:,+0.519
athing+0.515
ilan+0.514
 df+0.491
Neg logit contributions
ooters-0.434
 Krish-0.394
 Chaser-0.385
 Gutenberg-0.382
 Rib-0.381
Tokenholes
Token ID28439
Feature activation-5.15e-01

Pos logit contributions
zsche+0.570
:,+0.547
athing+0.543
ilan+0.519
 df+0.506
Neg logit contributions
ooters-0.830
 Chaser-0.772
 Krish-0.769
 Gutenberg-0.769
 Rib-0.763
Token in
Token ID287
Feature activation-7.38e-01

Pos logit contributions
zsche+0.418
:,+0.397
ilan+0.396
athing+0.392
 df+0.373
Neg logit contributions
ooters-0.335
 Krish-0.299
 Chaser-0.294
 Gutenberg-0.292
 Rib-0.289
Token American
Token ID1605
Feature activation+8.50e-02

Pos logit contributions
zsche+0.584
:,+0.560
athing+0.555
ilan+0.551
 df+0.529
Neg logit contributions
ooters-0.489
 Krish-0.442
 Chaser-0.433
 Gutenberg-0.432
 Rib-0.428
Token politics
Token ID4819
Feature activation-1.55e+00

Pos logit contributions
ooters+0.067
 Krish+0.062
 Chaser+0.061
 Gutenberg+0.061
 Rib+0.060
Neg logit contributions
zsche-0.057
:,-0.053
ilan-0.053
athing-0.053
 df-0.050
Token'
Token ID6
Feature activation-9.93e-01

Pos logit contributions
zsche+1.204
:,+1.149
athing+1.138
ilan+1.134
 df+1.083
Neg logit contributions
ooters-1.056
 Krish-0.958
 Chaser-0.938
 Gutenberg-0.933
 Rib-0.928
Token MORE
Token ID12011
Feature activation+4.07e+00

Pos logit contributions
zsche+0.906
:,+0.859
athing+0.852
ilan+0.852
 df+0.820
Neg logit contributions
ooters-0.550
 Krish-0.489
 Chaser-0.478
 Gutenberg-0.472
 Rib-0.471
Token,
Token ID11
Feature activation+7.46e-01

Pos logit contributions
ooters+2.556
 Krish+2.313
 Chaser+2.241
 Gutenberg+2.231
 Rib+2.225
Neg logit contributions
zsche-3.394
ilan-3.144
athing-3.116
:,-3.098
 df-2.991
Token told
Token ID1297
Feature activation+1.44e+00

Pos logit contributions
ooters+0.502
 Krish+0.457
 Chaser+0.447
 Rib+0.443
 Gutenberg+0.443
Neg logit contributions
zsche-0.587
:,-0.554
ilan-0.552
athing-0.550
 df-0.523
Token 77
Token ID8541
Feature activation+1.55e+00

Pos logit contributions
ooters+0.929
 Krish+0.844
 Chaser+0.821
 Rib+0.813
 Gutenberg+0.806
Neg logit contributions
zsche-1.184
:,-1.125
ilan-1.109
athing-1.108
 df-1.057
TokenWA
Token ID15543
Feature activation+9.14e-01

Pos logit contributions
ooters+1.053
 Krish+0.966
 Chaser+0.948
 Rib+0.929
 Observer+0.925
Neg logit contributions
zsche-1.224
:,-1.147
ilan-1.138
athing-1.133
 df-1.078
TokenBC
Token ID2749
Feature activation+9.02e-01

Pos logit contributions
ooters+0.645
 Krish+0.590
 Chaser+0.576
 Rib+0.571
 Gutenberg+0.571
Neg logit contributions
zsche-0.694
:,-0.652
ilan-0.648
athing-0.648
 df-0.612
Token<|endoftext|>
Token ID50256
Feature activation+1.43e-01

Pos logit contributions
zsche+0.285
:,+0.271
athing+0.270
ilan+0.269
 df+0.257
Neg logit contributions
ooters-0.228
 Krish-0.208
 Chaser-0.203
 Gutenberg-0.202
 Rib-0.201
TokenHA
Token ID7801
Feature activation-5.02e-01

Pos logit contributions
ooters+0.108
 Krish+0.099
 Chaser+0.097
 Gutenberg+0.096
 Rib+0.096
Neg logit contributions
zsche-0.100
:,-0.095
athing-0.094
ilan-0.093
 df-0.089
TokenV
Token ID53
Feature activation+4.94e-02

Pos logit contributions
zsche+0.350
:,+0.330
athing+0.327
ilan+0.325
 df+0.309
Neg logit contributions
ooters-0.384
 Krish-0.354
 Chaser-0.344
 Gutenberg-0.343
 Rib-0.343
TokenANA
Token ID31574
Feature activation-2.74e-01

Pos logit contributions
ooters+0.041
 Krish+0.038
 Chaser+0.037
 Rib+0.037
 Gutenberg+0.037
Neg logit contributions
zsche-0.031
:,-0.029
athing-0.029
ilan-0.029
 df-0.027
Token (
Token ID357
Feature activation+1.48e+00

Pos logit contributions
zsche+0.216
:,+0.206
athing+0.204
ilan+0.204
 df+0.194
Neg logit contributions
ooters-0.183
 Krish-0.166
 Chaser-0.163
 Rib-0.162
 Gutenberg-0.161
TokenReuters
Token ID12637
Feature activation+4.05e+00

Pos logit contributions
ooters+0.892
 Krish+0.810
 Chaser+0.784
 Rib+0.774
 Gutenberg+0.771
Neg logit contributions
zsche-1.261
:,-1.200
athing-1.193
ilan-1.179
 df-1.142
Token)
Token ID8
Feature activation+2.90e+00

Pos logit contributions
ooters+1.631
 Krish+1.421
 Rib+1.382
 Chaser+1.359
 Observer+1.349
Neg logit contributions
zsche-4.243
athing-4.026
:,-4.003
ilan-4.002
 df-3.880
Token -
Token ID532
Feature activation+2.56e+00

Pos logit contributions
ooters+2.161
 Krish+2.052
 Rib+1.995
 Gutenberg+1.982
 Chaser+1.973
Neg logit contributions
zsche-2.044
:,-1.892
athing-1.889
ilan-1.880
 df-1.791
Token Latin
Token ID9133
Feature activation-6.76e-01

Pos logit contributions
ooters+1.651
 Krish+1.519
 Rib+1.485
 Chaser+1.469
 Gutenberg+1.460
Neg logit contributions
zsche-2.107
:,-1.990
ilan-1.952
athing-1.949
 df-1.870
Token American
Token ID1605
Feature activation-2.37e-01

Pos logit contributions
zsche+0.395
:,+0.369
athing+0.364
ilan+0.362
 df+0.339
Neg logit contributions
ooters-0.592
 Krish-0.549
 Chaser-0.542
 Gutenberg-0.540
 Rib-0.538
Token and
Token ID290
Feature activation-7.44e-01

Pos logit contributions
zsche+0.165
:,+0.156
athing+0.153
ilan+0.153
 df+0.144
Neg logit contributions
ooters-0.182
 Krish-0.167
 Chaser-0.165
 Gutenberg-0.164
 Rib-0.163
Token professional
Token ID4708
Feature activation-2.61e-01

Pos logit contributions
zsche+0.217
:,+0.206
athing+0.204
ilan+0.204
 df+0.193
Neg logit contributions
ooters-0.209
 Krish-0.190
 Chaser-0.187
 Gutenberg-0.186
 Rib-0.185
Token help
Token ID1037
Feature activation+5.23e-01

Pos logit contributions
zsche+0.202
:,+0.192
athing+0.190
ilan+0.190
 df+0.181
Neg logit contributions
ooters-0.179
 Krish-0.163
 Chaser-0.160
 Gutenberg-0.159
 Rib-0.158
Token.
Token ID13
Feature activation-3.74e-01

Pos logit contributions
ooters+0.350
 Krish+0.316
 Chaser+0.309
 Gutenberg+0.308
 Rib+0.306
Neg logit contributions
zsche-0.415
ilan-0.391
athing-0.390
:,-0.389
 df-0.371
Token Continue
Token ID10054
Feature activation+7.33e-01

Pos logit contributions
zsche+0.306
:,+0.292
athing+0.290
ilan+0.289
 df+0.276
Neg logit contributions
ooters-0.239
 Krish-0.216
 Chaser-0.212
 Gutenberg-0.210
 Rib-0.209
Token Reading
Token ID11725
Feature activation+2.92e+00

Pos logit contributions
ooters+0.458
 Krish+0.416
 Chaser+0.408
 Gutenberg+0.405
 Rib+0.402
Neg logit contributions
zsche-0.610
:,-0.578
athing-0.576
ilan-0.575
 df-0.550
Token Below
Token ID10383
Feature activation+4.03e+00

Pos logit contributions
ooters+1.701
 Krish+1.536
 Gutenberg+1.522
 Chaser+1.518
 Rib+1.483
Neg logit contributions
zsche-2.533
ilan-2.399
athing-2.396
:,-2.392
 df-2.281
Token Advertisement
Token ID39711
Feature activation+1.52e+00

Pos logit contributions
ooters+0.771
 Krish+0.542
 Gutenberg+0.533
 Chaser+0.528
 Observer+0.454
Neg logit contributions
zsche-5.042
:,-4.874
athing-4.865
ilan-4.817
 df-4.680
Token
Token ID198
Feature activation+1.08e-01

Pos logit contributions
ooters+0.975
 Krish+0.882
 Chaser+0.870
 Gutenberg+0.864
 Rib+0.858
Neg logit contributions
zsche-1.248
:,-1.184
athing-1.178
ilan-1.178
 df-1.119
Token
Token ID198
Feature activation-5.10e-02

Pos logit contributions
ooters+0.066
 Krish+0.060
 Chaser+0.058
 Rib+0.058
 Gutenberg+0.058
Neg logit contributions
zsche-0.092
:,-0.087
athing-0.086
ilan-0.086
 df-0.082
Token9
Token ID24
Feature activation+5.87e-01

Pos logit contributions
zsche+0.028
:,+0.026
athing+0.025
ilan+0.025
 df+0.024
Neg logit contributions
ooters-0.047
 Krish-0.043
 Chaser-0.043
 Gutenberg-0.043
Barn-0.042
Token Work
Token ID5521
Feature activation+3.05e-01

Pos logit contributions
ooters+0.378
 Krish+0.344
 Chaser+0.337
 Gutenberg+0.335
 Rib+0.334
Neg logit contributions
zsche-0.478
:,-0.454
athing-0.451
ilan-0.450
 df-0.429
Token.
Token ID13
Feature activation+1.22e-01

Pos logit contributions
ooters+0.532
 Krish+0.482
 Chaser+0.473
 Gutenberg+0.469
 Rib+0.467
Neg logit contributions
zsche-0.620
:,-0.586
athing-0.584
ilan-0.582
 df-0.556
Token<|endoftext|>
Token ID50256
Feature activation+2.64e-01

Pos logit contributions
ooters+0.075
 Krish+0.068
 Chaser+0.067
 Gutenberg+0.066
 Rib+0.066
Neg logit contributions
zsche-0.103
:,-0.097
athing-0.097
ilan-0.097
 df-0.092
TokenAbstract
Token ID23839
Feature activation+3.24e+00

Pos logit contributions
ooters+0.199
 Krish+0.182
 Chaser+0.179
 Gutenberg+0.178
 Rib+0.177
Neg logit contributions
zsche-0.186
:,-0.175
athing-0.174
ilan-0.173
 df-0.165
Token
Token ID198
Feature activation+2.52e-01

Pos logit contributions
ooters+2.010
 Krish+1.867
 Rib+1.811
 Gutenberg+1.802
 Chaser+1.788
Neg logit contributions
zsche-2.685
:,-2.539
ilan-2.533
athing-2.520
 df-2.401
Token
Token ID198
Feature activation-5.08e-01

Pos logit contributions
ooters+0.154
 Krish+0.139
 Chaser+0.135
 Gutenberg+0.135
 Rib+0.135
Neg logit contributions
zsche-0.214
:,-0.201
athing-0.201
ilan-0.201
 df-0.192
TokenBackground
Token ID21756
Feature activation+4.01e+00

Pos logit contributions
zsche+0.351
:,+0.333
athing+0.329
ilan+0.328
 df+0.311
Neg logit contributions
ooters-0.390
 Krish-0.358
 Chaser-0.351
 Gutenberg-0.350
 Rib-0.348
Token Depression
Token ID22483
Feature activation+2.19e-01

Pos logit contributions
ooters+2.740
 Krish+2.589
 Rib+2.544
 Gutenberg+2.496
 Chaser+2.493
Neg logit contributions
zsche-3.047
ilan-2.851
athing-2.825
:,-2.811
 df-2.678
Token presents
Token ID10969
Feature activation-2.38e-01

Pos logit contributions
ooters+0.154
 Krish+0.141
 Chaser+0.139
 Rib+0.137
 Gutenberg+0.137
Neg logit contributions
zsche-0.165
ilan-0.156
:,-0.156
athing-0.154
 df-0.146
Token a
Token ID257
Feature activation+3.70e-01

Pos logit contributions
zsche+0.191
:,+0.183
athing+0.181
ilan+0.180
 df+0.173
Neg logit contributions
ooters-0.155
 Krish-0.139
 Chaser-0.137
 Gutenberg-0.136
 Rib-0.135
Token significant
Token ID2383
Feature activation-1.97e-01

Pos logit contributions
ooters+0.253
 Krish+0.234
 Chaser+0.229
 Gutenberg+0.227
 Rib+0.225
Neg logit contributions
zsche-0.284
:,-0.269
athing-0.268
ilan-0.267
 df-0.253
Token burden
Token ID10538
Feature activation+1.49e-01

Pos logit contributions
zsche+0.143
:,+0.134
athing+0.134
ilan+0.133
 df+0.125
Neg logit contributions
ooters-0.145
 Krish-0.135
 Chaser-0.131
 Gutenberg-0.130
 Rib-0.130
Tokenast
Token ID459
Feature activation-4.11e-01

Pos logit contributions
ooters+0.382
 Krish+0.338
 Chaser+0.336
 Gutenberg+0.334
Barn+0.329
Neg logit contributions
zsche-0.378
:,-0.352
ilan-0.349
athing-0.343
 df-0.337
Token and
Token ID290
Feature activation+1.36e-01

Pos logit contributions
zsche+0.322
:,+0.305
athing+0.303
ilan+0.302
 df+0.288
Neg logit contributions
ooters-0.278
 Krish-0.253
 Chaser-0.247
 Gutenberg-0.246
 Rib-0.245
Token T
Token ID309
Feature activation+6.58e-02

Pos logit contributions
ooters+0.089
 Krish+0.081
 Chaser+0.079
 Gutenberg+0.079
 Rib+0.079
Neg logit contributions
zsche-0.110
:,-0.104
athing-0.103
ilan-0.103
 df-0.098
TokenUG
Token ID7340
Feature activation+1.08e+00

Pos logit contributions
ooters+0.046
 Krish+0.041
 Gutenberg+0.040
 Chaser+0.040
 Rib+0.040
Neg logit contributions
zsche-0.051
:,-0.048
athing-0.048
ilan-0.047
 df-0.046
Token developer
Token ID8517
Feature activation-1.45e+00

Pos logit contributions
ooters+0.718
 Krish+0.648
 Chaser+0.636
 Gutenberg+0.632
 Rib+0.631
Neg logit contributions
zsche-0.858
:,-0.809
athing-0.806
ilan-0.805
 df-0.767
Token Nerd
Token ID44031
Feature activation+3.95e+00

Pos logit contributions
zsche+1.175
:,+1.121
athing+1.112
ilan+1.108
 df+1.061
Neg logit contributions
ooters-0.939
 Krish-0.848
 Chaser-0.830
 Gutenberg-0.825
 Rib-0.820
Token Kingdom
Token ID7526
Feature activation+2.72e+00

Pos logit contributions
ooters+3.486
 Krish+3.279
 Chaser+3.212
 Gutenberg+3.196
 Rib+3.169
Neg logit contributions
zsche-2.220
:,-2.067
athing-2.014
ilan-2.006
 df-1.909
Token last
Token ID938
Feature activation+2.60e+00

Pos logit contributions
ooters+1.747
 Krish+1.599
 Rib+1.564
 Chaser+1.553
 Gutenberg+1.549
Neg logit contributions
zsche-2.213
ilan-2.081
athing-2.074
:,-2.070
 df-1.967
Token week
Token ID1285
Feature activation+4.22e-01

Pos logit contributions
ooters+1.594
 Krish+1.443
 Rib+1.403
 Gutenberg+1.403
 Chaser+1.397
Neg logit contributions
zsche-2.187
:,-2.078
athing-2.069
ilan-2.058
 df-1.961
Token signed
Token ID4488
Feature activation-3.15e-01

Pos logit contributions
ooters+0.272
 Krish+0.246
 Chaser+0.240
 Gutenberg+0.238
 Rib+0.238
Neg logit contributions
zsche-0.345
:,-0.328
athing-0.325
ilan-0.325
 df-0.310
Token an
Token ID281
Feature activation-2.11e-01

Pos logit contributions
zsche+0.264
:,+0.252
athing+0.250
ilan+0.249
 df+0.239
Neg logit contributions
ooters-0.195
 Krish-0.176
 Chaser-0.172
 Gutenberg-0.171
 Rib-0.170
Token:
Token ID25
Feature activation+1.51e-02

Pos logit contributions
zsche+1.531
:,+1.420
athing+1.410
ilan+1.402
 df+1.315
Neg logit contributions
ooters-2.446
 Krish-2.274
 Chaser-2.239
 Gutenberg-2.231
 Rib-2.223
Token Clive
Token ID47722
Feature activation+2.29e-01

Pos logit contributions
ooters+0.009
 Krish+0.009
 Chaser+0.008
 Rib+0.008
 Gutenberg+0.008
Neg logit contributions
zsche-0.013
:,-0.012
athing-0.012
ilan-0.012
 df-0.011
Token Brun
Token ID15700
Feature activation+7.15e-01

Pos logit contributions
ooters+0.128
 Krish+0.114
 Chaser+0.111
 Gutenberg+0.111
 Rib+0.110
Neg logit contributions
zsche-0.205
:,-0.197
athing-0.196
ilan-0.195
 df-0.187
Tokenskill
Token ID42401
Feature activation+1.56e-01

Pos logit contributions
ooters+0.333
 Krish+0.288
 Chaser+0.279
 Gutenberg+0.276
 Rib+0.274
Neg logit contributions
zsche-0.710
:,-0.684
athing-0.678
ilan-0.677
 df-0.653
Token/
Token ID14
Feature activation+7.81e-01

Pos logit contributions
ooters+0.106
 Krish+0.096
 Chaser+0.094
 Gutenberg+0.094
 Rib+0.093
Neg logit contributions
zsche-0.122
:,-0.116
athing-0.115
ilan-0.115
 df-0.110
TokenGetty
Token ID6633
Feature activation+3.95e+00

Pos logit contributions
ooters+0.589
 Krish+0.538
 Chaser+0.529
 Gutenberg+0.525
 Rib+0.524
Neg logit contributions
zsche-0.552
:,-0.522
athing-0.516
ilan-0.512
 df-0.488
Token Images
Token ID5382
Feature activation+1.23e+00

Pos logit contributions
ooters+1.293
 Krish+1.053
 Gutenberg+0.999
 Rib+0.991
 Chaser+0.990
Neg logit contributions
zsche-4.432
:,-4.300
athing-4.237
ilan-4.221
 df-4.113
Token
Token ID198
Feature activation-5.62e-01

Pos logit contributions
ooters+0.782
 Krish+0.704
 Chaser+0.688
 Gutenberg+0.684
 Rib+0.682
Neg logit contributions
zsche-1.011
:,-0.970
athing-0.958
ilan-0.955
 df-0.915
Token
Token ID198
Feature activation-8.22e-01

Pos logit contributions
zsche+0.470
:,+0.445
athing+0.443
ilan+0.443
 df+0.423
Neg logit contributions
ooters-0.351
 Krish-0.316
 Chaser-0.308
 Gutenberg-0.307
 Rib-0.306
TokenThat
Token ID2504
Feature activation+8.22e-01

Pos logit contributions
zsche+0.583
:,+0.549
athing+0.545
ilan+0.543
 df+0.517
Neg logit contributions
ooters-0.616
 Krish-0.565
 Chaser-0.555
 Gutenberg-0.551
 Rib-0.549
Token low
Token ID1877
Feature activation-1.74e-01

Pos logit contributions
ooters+0.543
 Krish+0.498
 Chaser+0.487
 Rib+0.487
 Gutenberg+0.486
Neg logit contributions
zsche-0.653
:,-0.616
ilan-0.612
athing-0.610
 df-0.583
Token gun
Token ID2485
Feature activation+2.70e-01

Pos logit contributions
ooters+0.106
 Krish+0.096
 Chaser+0.095
 Rib+0.093
 Gutenberg+0.093
Neg logit contributions
zsche-0.116
:,-0.110
athing-0.108
ilan-0.108
 df-0.103
Token!
Token ID0
Feature activation+5.03e-01

Pos logit contributions
ooters+0.182
 Krish+0.165
 Chaser+0.161
 Gutenberg+0.161
 Rib+0.160
Neg logit contributions
zsche-0.212
:,-0.202
athing-0.200
ilan-0.200
 df-0.191
Token
Token ID198
Feature activation+1.60e-01

Pos logit contributions
ooters+0.323
 Krish+0.292
 Chaser+0.286
 Gutenberg+0.283
 Rib+0.283
Neg logit contributions
zsche-0.411
:,-0.392
athing-0.389
ilan-0.387
 df-0.371
Token
Token ID198
Feature activation-9.80e-02

Pos logit contributions
ooters+0.098
 Krish+0.089
 Chaser+0.086
 Rib+0.086
 Gutenberg+0.086
Neg logit contributions
zsche-0.137
:,-0.128
athing-0.128
ilan-0.128
 df-0.122
TokenR
Token ID49
Feature activation+1.63e-01

Pos logit contributions
zsche+0.074
:,+0.071
athing+0.070
ilan+0.070
 df+0.067
Neg logit contributions
ooters-0.068
 Krish-0.062
 Chaser-0.061
 Gutenberg-0.061
 Rib-0.060
Tokenally
Token ID453
Feature activation+3.94e+00

Pos logit contributions
ooters+0.105
 Krish+0.093
 Chaser+0.091
 Rib+0.090
 Gutenberg+0.090
Neg logit contributions
zsche-0.133
:,-0.127
ilan-0.125
athing-0.125
 df-0.121
Token -
Token ID532
Feature activation+8.96e-01

Pos logit contributions
ooters+3.097
 Krish+2.838
 Chaser+2.836
 Rib+2.801
 Gutenberg+2.748
Neg logit contributions
zsche-2.647
:,-2.476
ilan-2.456
athing-2.437
 df-2.337
Token Maximum
Token ID22246
Feature activation+2.52e-01

Pos logit contributions
ooters+0.575
 Krish+0.520
 Chaser+0.512
 Rib+0.505
 Gutenberg+0.504
Neg logit contributions
zsche-0.733
:,-0.698
athing-0.693
ilan-0.691
 df-0.661
Token strength
Token ID4202
Feature activation-7.08e-01

Pos logit contributions
ooters+0.184
 Krish+0.168
 Chaser+0.166
 Gutenberg+0.164
 Rib+0.164
Neg logit contributions
zsche-0.184
:,-0.175
ilan-0.173
athing-0.172
 df-0.164
Token,
Token ID11
Feature activation-7.26e-01

Pos logit contributions
zsche+0.545
:,+0.520
athing+0.515
ilan+0.513
 df+0.490
Neg logit contributions
ooters-0.487
 Krish-0.442
 Chaser-0.433
 Gutenberg-0.431
 Rib-0.429
Token but
Token ID475
Feature activation-3.40e-01

Pos logit contributions
zsche+0.603
:,+0.577
athing+0.573
ilan+0.570
 df+0.547
Neg logit contributions
ooters-0.455
 Krish-0.409
 Chaser-0.399
 Gutenberg-0.397
 Rib-0.394
Token:
Token ID25
Feature activation-4.83e-01

Pos logit contributions
zsche+0.217
athing+0.198
ilan+0.197
:,+0.194
 df+0.183
Neg logit contributions
ooters-0.309
 Krish-0.288
 Chaser-0.284
 Gutenberg-0.283
 Rib-0.283
Token Jason
Token ID8982
Feature activation+2.42e-01

Pos logit contributions
zsche+0.389
:,+0.371
athing+0.368
ilan+0.366
 df+0.350
Neg logit contributions
ooters-0.315
 Krish-0.286
 Chaser-0.279
 Gutenberg-0.278
 Rib-0.277
Token Bah
Token ID13081
Feature activation+2.38e-01

Pos logit contributions
ooters+0.147
 Krish+0.133
 Chaser+0.129
 Gutenberg+0.129
 Rib+0.129
Neg logit contributions
zsche-0.205
:,-0.196
athing-0.195
ilan-0.194
 df-0.186
Tokenr
Token ID81
Feature activation-1.51e-03

Pos logit contributions
ooters+0.104
 Krish+0.089
 Chaser+0.085
 Gutenberg+0.084
 Rib+0.084
Neg logit contributions
zsche-0.244
:,-0.235
athing-0.233
ilan-0.232
 df-0.226
Token/
Token ID14
Feature activation+6.03e-01

Pos logit contributions
zsche+0.001
:,+0.001
athing+0.001
ilan+0.001
 df+0.001
Neg logit contributions
ooters-0.001
 Krish-0.001
 Gutenberg-0.001
 Chaser-0.001
 Rib-0.001
TokenGetty
Token ID6633
Feature activation+3.94e+00

Pos logit contributions
ooters+0.442
 Krish+0.403
 Chaser+0.398
 Rib+0.395
 Gutenberg+0.393
Neg logit contributions
zsche-0.439
:,-0.415
athing-0.410
ilan-0.407
 df-0.389
Token Images
Token ID5382
Feature activation+4.13e-01

Pos logit contributions
ooters+1.316
 Krish+1.065
 Chaser+1.026
 Rib+1.018
 Gutenberg+1.016
Neg logit contributions
zsche-4.397
:,-4.268
athing-4.229
ilan-4.216
 df-4.089
Token)
Token ID8
Feature activation-8.83e-02

Pos logit contributions
ooters+0.212
 Krish+0.187
 Chaser+0.180
 Gutenberg+0.179
 Rib+0.176
Neg logit contributions
zsche-0.389
:,-0.374
athing-0.372
ilan-0.370
 df-0.358
Token
Token ID198
Feature activation-5.31e-01

Pos logit contributions
zsche+0.069
:,+0.066
athing+0.066
ilan+0.066
 df+0.063
Neg logit contributions
ooters-0.059
 Krish-0.053
 Chaser-0.053
 Gutenberg-0.052
 Rib-0.051
Token
Token ID198
Feature activation-2.93e-01

Pos logit contributions
zsche+0.438
:,+0.417
athing+0.414
ilan+0.412
 df+0.395
Neg logit contributions
ooters-0.337
 Krish-0.304
 Chaser-0.297
 Gutenberg-0.295
 Rib-0.294
TokenSouth
Token ID14942
Feature activation-6.42e-01

Pos logit contributions
zsche+0.212
:,+0.199
athing+0.197
ilan+0.197
 df+0.188
Neg logit contributions
ooters-0.216
 Krish-0.198
 Chaser-0.194
 Gutenberg-0.192
 Rib-0.192
Token your
Token ID534
Feature activation+9.94e-01

Pos logit contributions
zsche+2.727
:,+2.652
athing+2.613
ilan+2.580
 df+2.503
Neg logit contributions
ooters-2.314
 Krish-2.066
 Chaser-2.025
 Gutenberg-2.017
 Rib-2.001
Token Vend
Token ID44220
Feature activation+3.10e+00

Pos logit contributions
ooters+0.679
 Krish+0.620
 Chaser+0.612
 Gutenberg+0.608
 Rib+0.604
Neg logit contributions
zsche-0.774
:,-0.727
ilan-0.725
athing-0.723
 df-0.682
Tokenread
Token ID961
Feature activation+8.80e-01

Pos logit contributions
ooters+1.287
 Krish+1.082
 Gutenberg+1.051
 Chaser+1.033
 Rib+1.010
Neg logit contributions
zsche-3.201
:,-3.091
ilan-3.090
athing-3.070
 df-2.993
Token Deck
Token ID20961
Feature activation-3.59e-02

Pos logit contributions
ooters+0.561
 Krish+0.506
 Chaser+0.499
 Gutenberg+0.493
 Rib+0.491
Neg logit contributions
zsche-0.724
:,-0.686
athing-0.682
ilan-0.681
 df-0.649
Token,
Token ID11
Feature activation+2.66e+00

Pos logit contributions
zsche+0.028
:,+0.027
athing+0.026
ilan+0.026
 df+0.025
Neg logit contributions
ooters-0.024
 Krish-0.022
 Chaser-0.022
 Gutenberg-0.021
 Rib-0.021
Token but
Token ID475
Feature activation+3.93e+00

Pos logit contributions
ooters+1.572
 Krish+1.431
 Chaser+1.413
 Rib+1.384
 Gutenberg+1.379
Neg logit contributions
zsche-2.296
ilan-2.159
:,-2.155
athing-2.146
 df-2.041
Token just
Token ID655
Feature activation+2.73e+00

Pos logit contributions
ooters+2.188
 Krish+1.983
 Chaser+1.934
 Gutenberg+1.919
 Rib+1.914
Neg logit contributions
zsche-3.533
ilan-3.353
athing-3.289
:,-3.274
 df-3.140
Token about
Token ID546
Feature activation+8.86e-01

Pos logit contributions
ooters+1.541
 Krish+1.376
 Chaser+1.339
 Gutenberg+1.333
 Rib+1.328
Neg logit contributions
zsche-2.426
ilan-2.289
:,-2.278
athing-2.266
 df-2.180
Token any
Token ID597
Feature activation+9.94e-01

Pos logit contributions
ooters+0.527
 Krish+0.472
 Chaser+0.460
 Gutenberg+0.458
 Rib+0.455
Neg logit contributions
zsche-0.766
:,-0.731
athing-0.727
ilan-0.726
 df-0.694
Token strategy
Token ID4811
Feature activation+7.19e-01

Pos logit contributions
ooters+0.630
 Krish+0.569
 Chaser+0.557
 Gutenberg+0.554
 Rib+0.549
Neg logit contributions
zsche-0.820
:,-0.780
athing-0.774
ilan-0.773
 df-0.737
Token you
Token ID345
Feature activation+1.23e+00

Pos logit contributions
ooters+0.437
 Krish+0.392
 Chaser+0.384
 Gutenberg+0.382
 Rib+0.378
Neg logit contributions
zsche-0.611
:,-0.582
ilan-0.579
athing-0.578
 df-0.552
Token urgency
Token ID25615
Feature activation-1.58e+00

Pos logit contributions
ooters+0.185
 Krish+0.170
 Chaser+0.166
 Gutenberg+0.166
 Rib+0.165
Neg logit contributions
zsche-0.178
:,-0.169
athing-0.166
ilan-0.166
 df-0.157
Token for
Token ID329
Feature activation+8.61e-01

Pos logit contributions
zsche+1.257
:,+1.208
athing+1.194
ilan+1.185
 df+1.139
Neg logit contributions
ooters-1.048
 Krish-0.945
 Chaser-0.927
 Gutenberg-0.925
 Rib-0.912
Token the
Token ID262
Feature activation+8.61e-02

Pos logit contributions
ooters+0.575
 Krish+0.530
 Chaser+0.521
 Rib+0.518
 Gutenberg+0.513
Neg logit contributions
zsche-0.684
:,-0.643
athing-0.635
ilan-0.634
 df-0.598
Token 50
Token ID2026
Feature activation-1.69e-01

Pos logit contributions
ooters+0.063
 Krish+0.059
 Chaser+0.058
 Rib+0.057
 Gutenberg+0.057
Neg logit contributions
zsche-0.063
:,-0.059
athing-0.058
ilan-0.058
 df-0.055
Token or
Token ID393
Feature activation-1.39e+00

Pos logit contributions
zsche+0.138
:,+0.131
athing+0.130
ilan+0.129
 df+0.124
Neg logit contributions
ooters-0.109
 Krish-0.099
 Chaser-0.097
 Gutenberg-0.096
 Rib-0.096
Token so
Token ID523
Feature activation-4.68e+00

Pos logit contributions
zsche+1.211
:,+1.158
athing+1.149
ilan+1.141
 df+1.098
Neg logit contributions
ooters-0.829
 Krish-0.739
 Chaser-0.721
 Gutenberg-0.716
 Rib-0.708
Token workers
Token ID3259
Feature activation-3.00e-01

Pos logit contributions
zsche+3.359
:,+3.245
ilan+3.205
athing+3.204
 df+3.081
Neg logit contributions
ooters-3.369
 Krish-3.035
 Gutenberg-2.969
 Chaser-2.955
Barn-2.936
Token at
Token ID379
Feature activation+7.37e-01

Pos logit contributions
zsche+0.235
:,+0.223
athing+0.221
ilan+0.220
 df+0.210
Neg logit contributions
ooters-0.203
 Krish-0.185
 Chaser-0.181
 Gutenberg-0.180
 Rib-0.179
Token the
Token ID262
Feature activation-1.85e-01

Pos logit contributions
ooters+0.462
 Krish+0.418
 Chaser+0.411
 Rib+0.407
 Gutenberg+0.406
Neg logit contributions
zsche-0.616
:,-0.584
athing-0.580
ilan-0.578
 df-0.550
Token airport
Token ID9003
Feature activation-4.18e-01

Pos logit contributions
zsche+0.147
:,+0.140
athing+0.138
ilan+0.138
 df+0.131
Neg logit contributions
ooters-0.123
 Krish-0.112
 Chaser-0.110
 Gutenberg-0.109
 Rib-0.109
Token.
Token ID13
Feature activation-2.52e-01

Pos logit contributions
zsche+0.335
:,+0.319
athing+0.316
ilan+0.315
 df+0.301
Neg logit contributions
ooters-0.275
 Krish-0.250
 Chaser-0.244
 Gutenberg-0.243
 Rib-0.242
Token,
Token ID11
Feature activation-2.29e-01

Pos logit contributions
zsche+0.386
:,+0.372
athing+0.364
ilan+0.364
 df+0.347
Neg logit contributions
ooters-0.367
 Krish-0.334
 Chaser-0.328
 Gutenberg-0.325
 Rib-0.323
Token even
Token ID772
Feature activation+2.20e-01

Pos logit contributions
zsche+0.190
:,+0.180
athing+0.179
ilan+0.179
 df+0.171
Neg logit contributions
ooters-0.145
 Krish-0.131
 Chaser-0.128
 Gutenberg-0.127
 Rib-0.126
Token I
Token ID314
Feature activation-9.16e-01

Pos logit contributions
ooters+0.132
 Krish+0.119
 Chaser+0.116
 Gutenberg+0.116
 Rib+0.115
Neg logit contributions
zsche-0.189
:,-0.179
athing-0.178
ilan-0.178
 df-0.170
Token can
Token ID460
Feature activation-6.15e-01

Pos logit contributions
zsche+0.712
:,+0.673
ilan+0.667
athing+0.663
 df+0.634
Neg logit contributions
ooters-0.627
 Krish-0.570
 Chaser-0.557
 Rib-0.553
 Gutenberg-0.551
Token't
Token ID470
Feature activation-6.45e-01

Pos logit contributions
zsche+0.576
:,+0.551
ilan+0.547
athing+0.546
 df+0.524
Neg logit contributions
ooters-0.322
 Krish-0.283
 Chaser-0.275
 Rib-0.272
 Gutenberg-0.271
Token deny
Token ID10129
Feature activation-4.65e+00

Pos logit contributions
zsche+0.502
:,+0.474
ilan+0.472
athing+0.469
 df+0.443
Neg logit contributions
ooters-0.439
 Krish-0.399
 Chaser-0.389
 Rib-0.389
 Gutenberg-0.386
Token how
Token ID703
Feature activation-6.19e-01

Pos logit contributions
zsche+3.630
:,+3.558
athing+3.504
ilan+3.458
 df+3.357
Neg logit contributions
ooters-3.042
 Krish-2.704
 Chaser-2.648
 Gutenberg-2.632
Barn-2.596
Token visually
Token ID22632
Feature activation-3.97e-02

Pos logit contributions
zsche+0.517
:,+0.494
athing+0.489
ilan+0.488
 df+0.468
Neg logit contributions
ooters-0.385
 Krish-0.347
 Chaser-0.339
 Gutenberg-0.336
 Rib-0.335
Token stunning
Token ID13393
Feature activation-1.48e-01

Pos logit contributions
zsche+0.033
:,+0.031
athing+0.031
ilan+0.031
 df+0.030
Neg logit contributions
ooters-0.025
 Krish-0.023
 Chaser-0.022
 Gutenberg-0.022
 Rib-0.022
Token and
Token ID290
Feature activation+4.86e-01

Pos logit contributions
zsche+0.121
:,+0.115
athing+0.114
ilan+0.113
 df+0.108
Neg logit contributions
ooters-0.096
 Krish-0.087
 Chaser-0.085
 Gutenberg-0.084
 Rib-0.084
Token colorful
Token ID20239
Feature activation-2.54e-03

Pos logit contributions
ooters+0.312
 Krish+0.285
 Chaser+0.278
 Gutenberg+0.276
 Rib+0.275
Neg logit contributions
zsche-0.395
:,-0.377
ilan-0.374
athing-0.373
 df-0.355
Token 43
Token ID5946
Feature activation-1.77e+00

Pos logit contributions
zsche+0.255
athing+0.241
:,+0.241
ilan+0.239
 df+0.227
Neg logit contributions
ooters-0.193
 Krish-0.175
 Chaser-0.170
 Gutenberg-0.170
 Rib-0.170
Token (
Token ID357
Feature activation+5.10e-01

Pos logit contributions
zsche+1.074
:,+1.021
athing+0.994
ilan+0.993
 df+0.941
Neg logit contributions
ooters-1.501
 Krish-1.386
 Chaser-1.366
 Gutenberg-1.360
 Rib-1.352
TokenS
Token ID50
Feature activation-7.22e-01

Pos logit contributions
ooters+0.389
 Krish+0.358
 Chaser+0.350
 Gutenberg+0.349
 Rib+0.348
Neg logit contributions
zsche-0.353
athing-0.331
:,-0.329
ilan-0.327
 df-0.311
Token4
Token ID19
Feature activation-2.73e-01

Pos logit contributions
zsche+0.494
:,+0.465
athing+0.463
ilan+0.460
 df+0.436
Neg logit contributions
ooters-0.559
 Krish-0.513
 Gutenberg-0.503
 Chaser-0.501
 Rib-0.499
Token)
Token ID8
Feature activation+1.41e-01

Pos logit contributions
zsche+0.203
:,+0.192
athing+0.191
ilan+0.191
 df+0.181
Neg logit contributions
ooters-0.194
 Krish-0.177
 Chaser-0.174
 Gutenberg-0.173
 Rib-0.172
Token DOI
Token ID40722
Feature activation-4.44e+00

Pos logit contributions
ooters+0.091
 Krish+0.083
 Gutenberg+0.081
 Chaser+0.081
 Rib+0.081
Neg logit contributions
zsche-0.114
:,-0.108
athing-0.107
ilan-0.107
 df-0.102
Token:
Token ID25
Feature activation+1.74e-01

Pos logit contributions
zsche+2.362
:,+2.337
athing+2.178
ilan+2.166
 df+2.102
Neg logit contributions
ooters-4.013
 Krish-3.712
 Chaser-3.699
 Gutenberg-3.637
 Rib-3.613
Token 10
Token ID838
Feature activation-1.55e+00

Pos logit contributions
ooters+0.132
 Krish+0.121
 Chaser+0.119
 Gutenberg+0.119
 Rib+0.118
Neg logit contributions
zsche-0.122
:,-0.115
athing-0.115
ilan-0.114
 df-0.108
Token.
Token ID13
Feature activation-2.36e+00

Pos logit contributions
zsche+1.173
:,+1.112
athing+1.105
ilan+1.100
 df+1.049
Neg logit contributions
ooters-1.082
 Krish-0.985
 Chaser-0.965
 Gutenberg-0.961
 Rib-0.956
Token10
Token ID940
Feature activation-7.83e-01

Pos logit contributions
zsche+1.530
:,+1.463
athing+1.439
ilan+1.427
 df+1.364
Neg logit contributions
ooters-1.887
 Krish-1.740
 Chaser-1.728
 Gutenberg-1.705
 Rib-1.699
Token86
Token ID4521
Feature activation+3.33e-01

Pos logit contributions
zsche+0.530
athing+0.500
:,+0.496
ilan+0.489
 df+0.461
Neg logit contributions
ooters-0.612
 Krish-0.561
 Rib-0.548
 Chaser-0.547
 Gutenberg-0.545
Token his
Token ID465
Feature activation-7.15e-01

Pos logit contributions
zsche+0.172
:,+0.164
athing+0.162
ilan+0.161
 df+0.154
Neg logit contributions
ooters-0.151
 Krish-0.136
 Chaser-0.134
 Gutenberg-0.133
 Rib-0.132
Tokens
Token ID82
Feature activation-1.92e+00

Pos logit contributions
zsche+0.648
:,+0.622
athing+0.617
ilan+0.615
 df+0.592
Neg logit contributions
ooters-0.394
 Krish-0.349
 Chaser-0.340
 Gutenberg-0.337
 Rib-0.335
Token remind
Token ID7101
Feature activation+1.01e+00

Pos logit contributions
zsche+1.449
:,+1.381
athing+1.369
ilan+1.362
 df+1.303
Neg logit contributions
ooters-1.346
 Krish-1.225
 Chaser-1.202
 Gutenberg-1.193
 Rib-1.188
Token us
Token ID514
Feature activation+3.50e-01

Pos logit contributions
ooters+0.652
 Krish+0.593
 Chaser+0.580
 Gutenberg+0.578
 Rib+0.575
Neg logit contributions
zsche-0.810
:,-0.775
athing-0.771
ilan-0.768
 df-0.732
Token not
Token ID407
Feature activation-1.09e+00

Pos logit contributions
ooters+0.224
 Krish+0.203
 Chaser+0.197
 Gutenberg+0.197
 Rib+0.196
Neg logit contributions
zsche-0.286
:,-0.272
athing-0.271
ilan-0.271
 df-0.258
Token only
Token ID691
Feature activation-4.31e+00

Pos logit contributions
zsche+0.868
:,+0.828
athing+0.821
ilan+0.816
 df+0.783
Neg logit contributions
ooters-0.730
 Krish-0.659
 Chaser-0.648
 Gutenberg-0.640
 Rib-0.637
Token of
Token ID286
Feature activation-1.12e+00

Pos logit contributions
zsche+3.208
:,+3.073
athing+3.030
ilan+2.980
 df+2.912
Neg logit contributions
ooters-3.031
 Krish-2.721
 Chaser-2.705
 Gutenberg-2.659
 Rib-2.636
Token lost
Token ID2626
Feature activation-6.06e-01

Pos logit contributions
zsche+0.922
:,+0.882
athing+0.874
ilan+0.871
 df+0.834
Neg logit contributions
ooters-0.708
 Krish-0.637
 Chaser-0.623
 Gutenberg-0.620
 Rib-0.615
Token fidelity
Token ID37744
Feature activation-9.79e-02

Pos logit contributions
zsche+0.469
:,+0.447
athing+0.445
ilan+0.441
 df+0.421
Neg logit contributions
ooters-0.412
 Krish-0.377
 Gutenberg-0.369
 Chaser-0.366
 Rib-0.365
Token,
Token ID11
Feature activation+1.56e+00

Pos logit contributions
zsche+0.078
:,+0.075
athing+0.074
ilan+0.074
 df+0.071
Neg logit contributions
ooters-0.064
 Krish-0.058
 Chaser-0.057
 Gutenberg-0.057
 Rib-0.056
Token but
Token ID475
Feature activation+3.70e+00

Pos logit contributions
ooters+0.947
 Krish+0.864
 Chaser+0.844
 Gutenberg+0.843
 Rib+0.835
Neg logit contributions
zsche-1.311
:,-1.251
athing-1.246
ilan-1.241
 df-1.180
Token
Token ID198
Feature activation-1.17e+00

Pos logit contributions
zsche+1.056
:,+1.008
athing+1.000
ilan+0.996
 df+0.954
Neg logit contributions
ooters-0.829
 Krish-0.747
 Chaser-0.731
 Gutenberg-0.727
 Rib-0.722
TokenGoogle
Token ID11708
Feature activation-1.94e+00

Pos logit contributions
zsche+0.868
:,+0.832
athing+0.819
ilan+0.819
 df+0.778
Neg logit contributions
ooters-0.828
 Krish-0.752
 Chaser-0.740
 Gutenberg-0.733
 Rib-0.731
Token Scholar
Token ID11713
Feature activation-1.62e+00

Pos logit contributions
zsche+1.600
:,+1.542
athing+1.520
ilan+1.512
 df+1.459
Neg logit contributions
ooters-1.223
 Krish-1.093
 Chaser-1.065
 Gutenberg-1.057
 Rib-1.052
Token Crossref
Token ID23613
Feature activation-2.15e+00

Pos logit contributions
zsche+1.237
:,+1.183
athing+1.170
ilan+1.166
 df+1.120
Neg logit contributions
ooters-1.114
 Krish-1.012
 Chaser-0.992
 Gutenberg-0.986
 Rib-0.981
Token |
Token ID930
Feature activation-4.20e-01

Pos logit contributions
zsche+1.589
:,+1.531
athing+1.494
ilan+1.492
 df+1.432
Neg logit contributions
ooters-1.532
 Krish-1.394
 Chaser-1.374
 Gutenberg-1.357
 Rib-1.345
Token ISI
Token ID22048
Feature activation-4.20e+00

Pos logit contributions
zsche+0.259
:,+0.245
athing+0.241
ilan+0.240
 df+0.227
Neg logit contributions
ooters-0.354
 Krish-0.326
 Chaser-0.321
 Gutenberg-0.319
 Rib-0.318
Token
Token ID198
Feature activation-6.90e-01

Pos logit contributions
zsche+3.128
:,+3.041
athing+2.973
ilan+2.950
 df+2.874
Neg logit contributions
ooters-2.911
 Chaser-2.646
 Krish-2.639
 Stephenson-2.546
 Gutenberg-2.543
Token
Token ID198
Feature activation-9.94e-01

Pos logit contributions
zsche+0.568
:,+0.541
athing+0.537
ilan+0.535
 df+0.513
Neg logit contributions
ooters-0.438
 Krish-0.395
 Chaser-0.386
 Gutenberg-0.384
 Rib-0.382
TokenM
Token ID44
Feature activation-3.42e-01

Pos logit contributions
zsche+0.737
:,+0.708
ilan+0.696
athing+0.695
 df+0.660
Neg logit contributions
ooters-0.708
 Krish-0.642
 Chaser-0.634
 Gutenberg-0.629
 Rib-0.624
Tokenol
Token ID349
Feature activation+5.74e-01

Pos logit contributions
zsche+0.256
:,+0.245
athing+0.242
ilan+0.240
 df+0.230
Neg logit contributions
ooters-0.243
 Krish-0.220
 Chaser-0.215
 Gutenberg-0.214
 Rib-0.213
Tokencho
Token ID6679
Feature activation-1.69e+00

Pos logit contributions
ooters+0.433
 Krish+0.393
 Gutenberg+0.384
 Chaser+0.383
 Rib+0.383
Neg logit contributions
zsche-0.400
:,-0.382
athing-0.375
ilan-0.375
 df-0.363
Token
Token ID198
Feature activation-3.85e-01

Pos logit contributions
zsche+0.379
:,+0.362
athing+0.359
ilan+0.358
 df+0.342
Neg logit contributions
ooters-0.300
 Krish-0.271
 Chaser-0.265
 Gutenberg-0.263
 Rib-0.262
Token
Token ID198
Feature activation-5.13e-01

Pos logit contributions
zsche+0.320
:,+0.304
athing+0.302
ilan+0.301
 df+0.288
Neg logit contributions
ooters-0.243
 Krish-0.218
 Chaser-0.213
 Gutenberg-0.212
 Rib-0.211
Token-->
Token ID46904
Feature activation-3.65e-01

Pos logit contributions
zsche+0.371
:,+0.352
athing+0.348
ilan+0.347
 df+0.331
Neg logit contributions
ooters-0.378
 Krish-0.345
 Chaser-0.339
 Gutenberg-0.336
 Rib-0.335
Token src
Token ID12351
Feature activation-2.10e+00

Pos logit contributions
zsche+0.292
:,+0.278
athing+0.276
ilan+0.275
 df+0.262
Neg logit contributions
ooters-0.240
 Krish-0.217
 Chaser-0.213
 Gutenberg-0.211
 Rib-0.210
Token/
Token ID14
Feature activation-1.53e+00

Pos logit contributions
zsche+1.586
:,+1.517
athing+1.500
ilan+1.486
 df+1.429
Neg logit contributions
ooters-1.471
 Krish-1.335
 Chaser-1.313
 Gutenberg-1.299
 Rib-1.295
Tokenmain
Token ID12417
Feature activation-4.19e+00

Pos logit contributions
zsche+1.112
:,+1.064
athing+1.051
ilan+1.050
 df+0.997
Neg logit contributions
ooters-1.118
 Krish-1.025
 Gutenberg-1.002
 Chaser-1.001
 Rib-0.992
Token.
Token ID13
Feature activation-1.51e+00

Pos logit contributions
zsche+2.899
:,+2.772
ilan+2.751
athing+2.734
 df+2.644
Neg logit contributions
ooters-3.182
 Krish-2.878
 Chaser-2.876
Barn-2.828
 Gutenberg-2.818
Tokenrs
Token ID3808
Feature activation-2.07e+00

Pos logit contributions
zsche+1.297
:,+1.244
athing+1.243
ilan+1.238
 df+1.180
Neg logit contributions
ooters-0.898
 Krish-0.806
 Chaser-0.790
 Gutenberg-0.786
 Rib-0.780
Token:
Token ID25
Feature activation-4.80e-01

Pos logit contributions
zsche+1.546
:,+1.472
athing+1.457
ilan+1.449
 df+1.386
Neg logit contributions
ooters-1.477
 Krish-1.344
 Chaser-1.319
 Gutenberg-1.312
 Rib-1.303
Token2
Token ID17
Feature activation+4.83e-01

Pos logit contributions
zsche+0.313
:,+0.295
athing+0.292
ilan+0.291
 df+0.274
Neg logit contributions
ooters-0.386
 Krish-0.356
 Chaser-0.349
 Gutenberg-0.348
 Rib-0.347
Token:
Token ID25
Feature activation-1.10e-01

Pos logit contributions
ooters+0.374
 Krish+0.346
 Chaser+0.339
 Gutenberg+0.337
 Rib+0.336
Neg logit contributions
zsche-0.328
:,-0.309
ilan-0.307
athing-0.307
 df-0.288
Token apples
Token ID22514
Feature activation+1.07e+00

Pos logit contributions
ooters+0.217
 Krish+0.192
 Chaser+0.187
 Gutenberg+0.186
 Rib+0.184
Neg logit contributions
zsche-0.349
:,-0.335
athing-0.332
ilan-0.331
 df-0.319
Token-
Token ID12
Feature activation+9.35e-01

Pos logit contributions
ooters+0.589
 Krish+0.525
 Chaser+0.511
 Rib+0.506
 Gutenberg+0.506
Neg logit contributions
zsche-0.977
:,-0.931
athing-0.924
ilan-0.921
 df-0.886
Tokento
Token ID1462
Feature activation+9.34e-01

Pos logit contributions
ooters+0.608
 Krish+0.549
 Chaser+0.537
 Gutenberg+0.535
 Rib+0.531
Neg logit contributions
zsche-0.754
:,-0.721
athing-0.716
ilan-0.713
 df-0.682
Token-
Token ID12
Feature activation-3.37e-01

Pos logit contributions
ooters+0.494
 Krish+0.431
 Chaser+0.421
 Rib+0.417
 Gutenberg+0.416
Neg logit contributions
zsche-0.872
:,-0.831
ilan-0.827
athing-0.827
 df-0.793
Tokenapp
Token ID1324
Feature activation-3.41e-01

Pos logit contributions
zsche+0.279
athing+0.274
:,+0.270
ilan+0.266
 df+0.254
Neg logit contributions
ooters-0.205
 Krish-0.192
 Gutenberg-0.190
 Chaser-0.185
 Rib-0.181
Tokenles
Token ID829
Feature activation-4.14e+00

Pos logit contributions
zsche+0.305
:,+0.294
athing+0.288
ilan+0.288
 df+0.278
Neg logit contributions
ooters-0.194
 Krish-0.172
 Chaser-0.167
 Rib-0.165
 Gutenberg-0.164
Token comparison
Token ID7208
Feature activation-1.41e+00

Pos logit contributions
zsche+3.297
:,+3.175
athing+3.148
ilan+3.108
 df+3.044
Neg logit contributions
ooters-2.704
 Krish-2.399
 Chaser-2.358
 Gutenberg-2.336
 Rib-2.307
Token.
Token ID13
Feature activation+9.68e-02

Pos logit contributions
zsche+1.083
:,+1.026
athing+1.021
ilan+1.018
 df+0.968
Neg logit contributions
ooters-0.979
 Krish-0.891
 Chaser-0.872
 Gutenberg-0.868
 Rib-0.864
Token
Token ID198
Feature activation-3.75e-01

Pos logit contributions
ooters+0.058
 Krish+0.052
 Chaser+0.051
 Rib+0.051
 Gutenberg+0.050
Neg logit contributions
zsche-0.084
:,-0.079
athing-0.079
ilan-0.079
 df-0.075
Token
Token ID198
Feature activation-1.60e-01

Pos logit contributions
zsche+0.315
:,+0.298
athing+0.297
ilan+0.296
 df+0.283
Neg logit contributions
ooters-0.233
 Krish-0.210
 Chaser-0.205
 Gutenberg-0.203
 Rib-0.203
Token"
Token ID1
Feature activation-1.01e+00

Pos logit contributions
zsche+0.120
:,+0.113
athing+0.112
ilan+0.112
 df+0.106
Neg logit contributions
ooters-0.114
 Krish-0.104
 Chaser-0.102
 Gutenberg-0.101
Barn-0.101
Token idea
Token ID2126
Feature activation-1.21e+00

Pos logit contributions
ooters+0.076
 Krish+0.069
 Chaser+0.067
 Gutenberg+0.067
 Rib+0.067
Neg logit contributions
zsche-0.092
:,-0.088
ilan-0.087
athing-0.087
 df-0.083
Token of
Token ID286
Feature activation+3.32e-01

Pos logit contributions
zsche+0.954
:,+0.911
athing+0.903
ilan+0.897
 df+0.860
Neg logit contributions
ooters-0.812
 Krish-0.735
 Chaser-0.720
 Gutenberg-0.716
 Rib-0.712
Token a
Token ID257
Feature activation+1.75e-01

Pos logit contributions
ooters+0.211
 Krish+0.191
 Chaser+0.186
 Gutenberg+0.186
 Rib+0.184
Neg logit contributions
zsche-0.273
:,-0.261
ilan-0.259
athing-0.258
 df-0.247
Token St
Token ID520
Feature activation+2.08e-02

Pos logit contributions
ooters+0.106
 Krish+0.096
 Chaser+0.093
 Gutenberg+0.093
 Rib+0.092
Neg logit contributions
zsche-0.149
:,-0.143
athing-0.141
ilan-0.141
 df-0.135
Token.
Token ID13
Feature activation-1.94e+00

Pos logit contributions
ooters+0.015
 Krish+0.013
 Chaser+0.013
 Gutenberg+0.013
 Rib+0.013
Neg logit contributions
zsche-0.016
:,-0.015
athing-0.015
ilan-0.015
 df-0.014
Token Louis
Token ID5593
Feature activation-4.13e+00

Pos logit contributions
zsche+1.320
:,+1.254
athing+1.246
ilan+1.237
 df+1.173
Neg logit contributions
ooters-1.499
 Krish-1.367
 Chaser-1.351
 Gutenberg-1.341
 Rib-1.331
Token purge
Token ID35714
Feature activation-1.46e+00

Pos logit contributions
zsche+3.588
:,+3.433
athing+3.422
ilan+3.378
 df+3.265
Neg logit contributions
ooters-2.377
 Krish-2.095
 Chaser-2.043
 Stephenson-2.029
 Rib-2.026
Token during
Token ID1141
Feature activation-5.38e-01

Pos logit contributions
zsche+1.127
:,+1.074
athing+1.064
ilan+1.060
 df+1.012
Neg logit contributions
ooters-1.008
 Krish-0.915
 Chaser-0.897
 Gutenberg-0.892
 Rib-0.886
Token the
Token ID262
Feature activation-1.25e-01

Pos logit contributions
zsche+0.454
:,+0.435
athing+0.431
ilan+0.429
 df+0.413
Neg logit contributions
ooters-0.330
 Krish-0.295
 Chaser-0.288
 Gutenberg-0.286
 Rib-0.284
Token Ferguson
Token ID15031
Feature activation+4.46e-01

Pos logit contributions
zsche+0.099
:,+0.094
athing+0.093
ilan+0.093
 df+0.089
Neg logit contributions
ooters-0.083
 Krish-0.075
 Chaser-0.074
 Gutenberg-0.074
 Rib-0.073
Token protests
Token ID8536
Feature activation+3.91e-01

Pos logit contributions
ooters+0.314
 Krish+0.286
 Chaser+0.280
 Gutenberg+0.279
 Rib+0.277
Neg logit contributions
zsche-0.336
:,-0.320
athing-0.317
ilan-0.316
 df-0.300
Token Fourth
Token ID15692
Feature activation+2.44e+00

Pos logit contributions
ooters+0.061
 Krish+0.054
 Chaser+0.053
 Gutenberg+0.053
 Rib+0.052
Neg logit contributions
zsche-0.098
:,-0.094
athing-0.093
ilan-0.093
 df-0.089
Token Div
Token ID4777
Feature activation-6.75e-01

Pos logit contributions
ooters+1.830
 Krish+1.716
 Rib+1.685
 Chaser+1.677
 Observer+1.655
Neg logit contributions
zsche-1.738
athing-1.599
:,-1.598
ilan-1.593
 df-1.473
Tokenisions
Token ID3279
Feature activation-1.11e-01

Pos logit contributions
zsche+0.610
:,+0.587
athing+0.583
ilan+0.582
 df+0.559
Neg logit contributions
ooters-0.371
 Krish-0.329
 Chaser-0.320
 Gutenberg-0.319
 Rib-0.315
Token all
Token ID477
Feature activation+3.41e-01

Pos logit contributions
zsche+0.088
:,+0.083
athing+0.083
ilan+0.082
 df+0.079
Neg logit contributions
ooters-0.074
 Krish-0.068
 Chaser-0.066
 Rib-0.066
 Gutenberg-0.066
Token at
Token ID379
Feature activation+1.38e-02

Pos logit contributions
ooters+0.211
 Krish+0.190
 Gutenberg+0.185
 Rib+0.185
 Chaser+0.185
Neg logit contributions
zsche-0.287
:,-0.272
athing-0.269
ilan-0.268
 df-0.258
Token once
Token ID1752
Feature activation-4.11e+00

Pos logit contributions
ooters+0.010
 Krish+0.009
 Chaser+0.009
 Gutenberg+0.009
 Rib+0.009
Neg logit contributions
zsche-0.010
:,-0.010
athing-0.010
ilan-0.010
 df-0.009
Token on
Token ID319
Feature activation-8.23e-01

Pos logit contributions
zsche+2.970
:,+2.879
athing+2.833
ilan+2.811
 df+2.676
Neg logit contributions
ooters-2.976
 Krish-2.709
 Chaser-2.666
 Gutenberg-2.657
 Rib-2.610
Token S
Token ID311
Feature activation+3.18e-01

Pos logit contributions
zsche+0.681
:,+0.654
athing+0.649
ilan+0.645
 df+0.620
Neg logit contributions
ooters-0.517
 Krish-0.462
 Chaser-0.451
 Gutenberg-0.450
 Rib-0.444
Tokenoky
Token ID31375
Feature activation-4.07e-01

Pos logit contributions
ooters+0.233
 Krish+0.211
 Chaser+0.207
 Rib+0.204
 Gutenberg+0.204
Neg logit contributions
zsche-0.234
:,-0.220
athing-0.217
ilan-0.216
 df-0.207
Tokenoku
Token ID11601
Feature activation+1.43e+00

Pos logit contributions
zsche+0.301
:,+0.286
athing+0.283
ilan+0.282
 df+0.270
Neg logit contributions
ooters-0.294
 Krish-0.267
 Chaser-0.262
 Gutenberg-0.260
 Rib-0.259
Token Hill
Token ID3327
Feature activation+7.49e-01

Pos logit contributions
ooters+1.000
 Krish+0.911
 Chaser+0.897
 Rib+0.891
 Gutenberg+0.872
Neg logit contributions
zsche-1.093
:,-1.026
ilan-1.022
athing-1.020
 df-0.970
Token Scholar
Token ID11713
Feature activation-1.72e+00

Pos logit contributions
zsche+1.784
:,+1.722
athing+1.695
ilan+1.685
 df+1.628
Neg logit contributions
ooters-1.381
 Krish-1.234
 Chaser-1.202
 Gutenberg-1.191
 Rib-1.186
Token Crossref
Token ID23613
Feature activation-2.12e+00

Pos logit contributions
zsche+1.307
:,+1.250
athing+1.235
ilan+1.231
 df+1.183
Neg logit contributions
ooters-1.198
 Krish-1.089
 Chaser-1.068
 Gutenberg-1.062
 Rib-1.054
Token |
Token ID930
Feature activation-8.66e-01

Pos logit contributions
zsche+1.581
:,+1.532
athing+1.488
ilan+1.486
 df+1.430
Neg logit contributions
ooters-1.498
 Krish-1.357
 Chaser-1.337
 Gutenberg-1.323
 Rib-1.307
Token Medline
Token ID34582
Feature activation-2.64e+00

Pos logit contributions
zsche+0.572
athing+0.533
:,+0.532
ilan+0.529
 df+0.496
Neg logit contributions
ooters-0.693
 Krish-0.642
 Chaser-0.632
 Gutenberg-0.629
 Rib-0.627
Token |
Token ID930
Feature activation-4.34e-01

Pos logit contributions
zsche+2.012
:,+1.944
ilan+1.892
athing+1.887
 df+1.826
Neg logit contributions
ooters-1.819
 Krish-1.652
 Chaser-1.630
 Gutenberg-1.614
 Rib-1.587
Token ISI
Token ID22048
Feature activation-4.09e+00

Pos logit contributions
zsche+0.266
:,+0.250
athing+0.247
ilan+0.246
 df+0.231
Neg logit contributions
ooters-0.367
 Krish-0.339
 Chaser-0.334
 Gutenberg-0.332
 Rib-0.331
Token
Token ID198
Feature activation-1.01e+00

Pos logit contributions
zsche+3.019
:,+2.904
athing+2.832
ilan+2.812
 df+2.719
Neg logit contributions
ooters-2.837
 Chaser-2.594
 Krish-2.590
 Gutenberg-2.510
 Stephenson-2.500
Token
Token ID198
Feature activation-1.17e+00

Pos logit contributions
zsche+0.825
:,+0.788
athing+0.781
ilan+0.778
 df+0.746
Neg logit contributions
ooters-0.651
 Krish-0.587
 Chaser-0.574
 Gutenberg-0.571
 Rib-0.567
TokenMy
Token ID3666
Feature activation-7.87e-01

Pos logit contributions
zsche+0.881
:,+0.850
ilan+0.834
athing+0.831
Internal+0.792
Neg logit contributions
ooters-0.818
 Krish-0.741
 Chaser-0.733
 Gutenberg-0.727
 Rib-0.721
Tokeners
Token ID364
Feature activation-7.66e-01

Pos logit contributions
zsche+0.640
:,+0.611
athing+0.606
ilan+0.604
 df+0.578
Neg logit contributions
ooters-0.508
 Krish-0.458
 Chaser-0.448
 Gutenberg-0.445
 Rib-0.443
Token,
Token ID11
Feature activation-1.84e-01

Pos logit contributions
zsche+0.577
:,+0.547
athing+0.541
ilan+0.540
 df+0.514
Neg logit contributions
ooters-0.541
 Krish-0.492
 Chaser-0.485
 Gutenberg-0.480
 Rib-0.476
Token Scholar
Token ID11713
Feature activation-1.24e+00

Pos logit contributions
zsche+1.550
:,+1.492
athing+1.473
ilan+1.465
 df+1.412
Neg logit contributions
ooters-1.172
 Krish-1.048
 Chaser-1.022
 Gutenberg-1.014
 Rib-1.008
Token Crossref
Token ID23613
Feature activation-1.66e+00

Pos logit contributions
zsche+0.962
:,+0.918
athing+0.908
ilan+0.905
 df+0.868
Neg logit contributions
ooters-0.847
 Krish-0.768
 Chaser-0.753
 Gutenberg-0.749
 Rib-0.743
Token |
Token ID930
Feature activation-3.32e-01

Pos logit contributions
zsche+1.250
:,+1.207
athing+1.175
ilan+1.172
 df+1.128
Neg logit contributions
ooters-1.169
 Krish-1.058
 Chaser-1.042
 Gutenberg-1.032
 Rib-1.020
Token Medline
Token ID34582
Feature activation-1.87e+00

Pos logit contributions
zsche+0.214
:,+0.202
athing+0.200
ilan+0.199
 df+0.188
Neg logit contributions
ooters-0.270
 Krish-0.249
 Chaser-0.245
 Gutenberg-0.243
 Rib-0.242
Token |
Token ID930
Feature activation-5.96e-01

Pos logit contributions
zsche+1.463
:,+1.406
ilan+1.374
athing+1.374
 df+1.325
Neg logit contributions
ooters-1.263
 Krish-1.140
 Chaser-1.124
 Gutenberg-1.111
 Rib-1.093
Token ISI
Token ID22048
Feature activation-4.05e+00

Pos logit contributions
zsche+0.365
athing+0.338
:,+0.336
ilan+0.334
 df+0.312
Neg logit contributions
ooters-0.508
 Krish-0.471
 Chaser-0.464
 Gutenberg-0.463
 Rib-0.461
Token
Token ID198
Feature activation-9.59e-01

Pos logit contributions
zsche+2.961
:,+2.871
athing+2.782
ilan+2.762
 df+2.681
Neg logit contributions
ooters-2.863
 Chaser-2.612
 Krish-2.599
 Gutenberg-2.524
 Stephenson-2.493
Token
Token ID198
Feature activation-1.09e+00

Pos logit contributions
zsche+0.782
:,+0.747
athing+0.741
ilan+0.738
 df+0.707
Neg logit contributions
ooters-0.615
 Krish-0.555
 Chaser-0.543
 Gutenberg-0.539
 Rib-0.536
TokenP
Token ID47
Feature activation+3.12e-01

Pos logit contributions
zsche+0.885
:,+0.855
ilan+0.843
athing+0.838
Internal+0.802
Neg logit contributions
ooters-0.702
 Krish-0.629
 Chaser-0.620
 Gutenberg-0.618
 Rib-0.607
Tokenel
Token ID417
Feature activation+1.30e-01

Pos logit contributions
ooters+0.222
 Krish+0.201
 Chaser+0.197
 Gutenberg+0.196
 Rib+0.195
Neg logit contributions
zsche-0.234
:,-0.223
athing-0.220
ilan-0.219
 df-0.210
Tokenleg
Token ID1455
Feature activation-5.94e-02

Pos logit contributions
ooters+0.094
 Krish+0.085
 Chaser+0.084
 Rib+0.084
 Gutenberg+0.083
Neg logit contributions
zsche-0.095
:,-0.091
athing-0.089
ilan-0.089
 df-0.086
Token to
Token ID284
Feature activation+1.24e+00

Pos logit contributions
ooters+0.262
 Krish+0.235
 Chaser+0.231
 Gutenberg+0.229
 Rib+0.227
Neg logit contributions
zsche-0.340
:,-0.326
athing-0.324
ilan-0.323
 df-0.309
Token this
Token ID428
Feature activation+8.83e-01

Pos logit contributions
ooters+0.740
 Krish+0.665
 Rib+0.648
 Chaser+0.646
 Gutenberg+0.643
Neg logit contributions
zsche-1.071
:,-1.020
ilan-1.016
athing-1.014
 df-0.962
Token guy
Token ID3516
Feature activation+1.14e+00

Pos logit contributions
ooters+0.549
 Krish+0.502
 Chaser+0.488
 Gutenberg+0.485
 Rib+0.484
Neg logit contributions
zsche-0.738
:,-0.696
ilan-0.693
athing-0.692
 df-0.657
Token.
Token ID13
Feature activation-6.05e-03

Pos logit contributions
ooters+0.745
 Krish+0.679
 Chaser+0.661
 Rib+0.658
 Gutenberg+0.655
Neg logit contributions
zsche-0.921
:,-0.870
athing-0.869
ilan-0.867
 df-0.826
Token Not
Token ID1892
Feature activation+1.52e-01

Pos logit contributions
zsche+0.005
:,+0.005
athing+0.005
ilan+0.005
 df+0.005
Neg logit contributions
ooters-0.004
 Krish-0.003
 Chaser-0.003
 Gutenberg-0.003
 Rib-0.003
Token only
Token ID691
Feature activation-4.03e+00

Pos logit contributions
ooters+0.103
 Krish+0.094
 Chaser+0.092
 Gutenberg+0.091
 Rib+0.091
Neg logit contributions
zsche-0.118
:,-0.113
athing-0.111
ilan-0.111
 df-0.106
Token did
Token ID750
Feature activation-1.27e+00

Pos logit contributions
zsche+3.191
:,+3.103
athing+3.083
ilan+3.030
 df+2.935
Neg logit contributions
ooters-2.612
 Krish-2.328
 Chaser-2.297
 Gutenberg-2.293
 Rib-2.241
Token he
Token ID339
Feature activation+1.33e+00

Pos logit contributions
zsche+1.073
:,+1.031
athing+1.021
ilan+1.015
 df+0.979
Neg logit contributions
ooters-0.777
 Krish-0.692
 Chaser-0.678
 Gutenberg-0.674
 Rib-0.665
Token plan
Token ID1410
Feature activation-6.93e-02

Pos logit contributions
ooters+0.916
 Krish+0.835
 Chaser+0.813
 Rib+0.806
 Gutenberg+0.798
Neg logit contributions
zsche-1.030
:,-0.966
ilan-0.965
athing-0.954
Internal-0.901
Token a
Token ID257
Feature activation-1.37e-01

Pos logit contributions
zsche+0.056
:,+0.053
athing+0.053
ilan+0.053
 df+0.051
Neg logit contributions
ooters-0.045
 Krish-0.041
 Chaser-0.040
 Gutenberg-0.040
 Rib-0.039
Token huge
Token ID3236
Feature activation+3.81e-01

Pos logit contributions
zsche+0.111
:,+0.106
athing+0.105
ilan+0.105
 df+0.100
Neg logit contributions
ooters-0.089
 Krish-0.081
 Chaser-0.079
 Gutenberg-0.078
 Rib-0.078
Token I
Token ID314
Feature activation-7.42e-01

Pos logit contributions
zsche+0.510
:,+0.487
athing+0.483
ilan+0.482
 df+0.462
Neg logit contributions
ooters-0.372
 Krish-0.334
 Chaser-0.326
 Gutenberg-0.324
 Rib-0.322
Token saw
Token ID2497
Feature activation-1.38e-01

Pos logit contributions
zsche+0.590
:,+0.561
athing+0.555
ilan+0.554
 df+0.529
Neg logit contributions
ooters-0.494
 Krish-0.447
 Chaser-0.435
 Gutenberg-0.433
 Rib-0.431
Token how
Token ID703
Feature activation+4.01e-01

Pos logit contributions
zsche+0.111
:,+0.106
athing+0.105
ilan+0.105
 df+0.100
Neg logit contributions
ooters-0.090
 Krish-0.081
 Chaser-0.080
 Gutenberg-0.079
 Rib-0.079
Token it
Token ID340
Feature activation+6.73e-01

Pos logit contributions
ooters+0.260
 Krish+0.237
 Chaser+0.231
 Gutenberg+0.230
 Rib+0.229
Neg logit contributions
zsche-0.325
:,-0.309
athing-0.307
ilan-0.306
 df-0.291
Token not
Token ID407
Feature activation-8.98e-03

Pos logit contributions
ooters+0.467
 Krish+0.424
 Chaser+0.414
 Gutenberg+0.412
 Rib+0.410
Neg logit contributions
zsche-0.515
:,-0.490
ilan-0.485
athing-0.484
 df-0.460
Token only
Token ID691
Feature activation-4.03e+00

Pos logit contributions
zsche+0.007
:,+0.007
athing+0.006
ilan+0.006
 df+0.006
Neg logit contributions
ooters-0.006
 Krish-0.006
 Chaser-0.006
 Gutenberg-0.006
 Rib-0.005
Token truly
Token ID4988
Feature activation-6.33e-01

Pos logit contributions
zsche+2.799
:,+2.682
athing+2.645
ilan+2.606
 df+2.542
Neg logit contributions
ooters-2.989
 Krish-2.730
 Chaser-2.716
 Gutenberg-2.682
 Rib-2.676
Token helped
Token ID4193
Feature activation+1.10e+00

Pos logit contributions
zsche+0.458
:,+0.435
athing+0.431
ilan+0.430
 df+0.408
Neg logit contributions
ooters-0.465
 Krish-0.425
 Chaser-0.417
 Gutenberg-0.415
 Rib-0.412
Token sick
Token ID6639
Feature activation+1.14e+00

Pos logit contributions
ooters+0.719
 Krish+0.656
 Chaser+0.634
 Gutenberg+0.633
 Rib+0.628
Neg logit contributions
zsche-0.894
:,-0.837
athing-0.837
ilan-0.837
 df-0.795
Token people
Token ID661
Feature activation+4.43e-01

Pos logit contributions
ooters+0.691
 Krish+0.619
 Chaser+0.598
 Rib+0.598
 Gutenberg+0.591
Neg logit contributions
zsche-0.976
:,-0.921
athing-0.914
ilan-0.910
 df-0.877
Token,
Token ID11
Feature activation+8.06e-01

Pos logit contributions
ooters+0.290
 Krish+0.263
 Chaser+0.257
 Gutenberg+0.254
 Rib+0.253
Neg logit contributions
zsche-0.359
ilan-0.337
:,-0.335
athing-0.334
 df-0.318
Token advice
Token ID5608
Feature activation-1.34e-01

Pos logit contributions
zsche+0.401
:,+0.380
athing+0.376
ilan+0.375
 df+0.357
Neg logit contributions
ooters-0.373
 Krish-0.341
 Chaser-0.336
 Gutenberg-0.333
 Rib-0.333
Token of
Token ID286
Feature activation+4.37e-01

Pos logit contributions
zsche+0.106
:,+0.101
athing+0.100
ilan+0.100
 df+0.095
Neg logit contributions
ooters-0.089
 Krish-0.081
 Chaser-0.079
 Gutenberg-0.079
 Rib-0.078
Token Gand
Token ID17727
Feature activation+1.71e-01

Pos logit contributions
ooters+0.282
 Krish+0.258
 Chaser+0.252
 Rib+0.252
 Gutenberg+0.250
Neg logit contributions
zsche-0.355
:,-0.337
athing-0.336
ilan-0.334
 df-0.317
Tokenalf
Token ID1604
Feature activation-8.24e-01

Pos logit contributions
ooters+0.179
 Rib+0.165
 Krish+0.165
 Gutenberg+0.165
 Chaser+0.164
Neg logit contributions
zsche-0.072
:,-0.064
ilan-0.063
athing-0.062
 df-0.057
Token,
Token ID11
Feature activation-8.44e-01

Pos logit contributions
zsche+0.640
:,+0.607
athing+0.603
ilan+0.602
 df+0.574
Neg logit contributions
ooters-0.560
 Krish-0.510
 Chaser-0.500
 Gutenberg-0.497
 Rib-0.496
Token chooses
Token ID19769
Feature activation-3.96e+00

Pos logit contributions
zsche+0.610
:,+0.575
athing+0.570
ilan+0.570
 df+0.539
Neg logit contributions
ooters-0.621
 Krish-0.571
 Chaser-0.561
 Gutenberg-0.557
 Rib-0.555
Token companions
Token ID19429
Feature activation-1.28e+00

Pos logit contributions
zsche+3.197
:,+3.110
athing+3.054
ilan+3.045
 df+2.937
Neg logit contributions
ooters-2.541
 Krish-2.253
 Chaser-2.191
 Gutenberg-2.188
Barn-2.145
Token for
Token ID329
Feature activation-4.91e-01

Pos logit contributions
zsche+1.011
:,+0.965
athing+0.956
ilan+0.952
 df+0.911
Neg logit contributions
ooters-0.852
 Krish-0.771
 Chaser-0.755
 Gutenberg-0.751
 Rib-0.746
Token him
Token ID683
Feature activation-1.62e-01

Pos logit contributions
zsche+0.414
:,+0.393
athing+0.390
ilan+0.390
 df+0.372
Neg logit contributions
ooters-0.302
 Krish-0.273
 Chaser-0.268
 Gutenberg-0.265
 Rib-0.265
Token.
Token ID13
Feature activation-6.37e-01

Pos logit contributions
zsche+0.130
:,+0.122
athing+0.122
ilan+0.122
 df+0.116
Neg logit contributions
ooters-0.106
 Krish-0.096
 Chaser-0.094
 Gutenberg-0.094
 Rib-0.093
Token The
Token ID383
Feature activation-9.76e-01

Pos logit contributions
zsche+0.533
:,+0.505
athing+0.503
ilan+0.501
 df+0.478
Neg logit contributions
ooters-0.397
 Krish-0.361
 Chaser-0.353
 Gutenberg-0.351
 Rib-0.350
Token national
Token ID2260
Feature activation-7.32e-01

Pos logit contributions
zsche+0.580
:,+0.553
athing+0.549
ilan+0.546
 df+0.523
Neg logit contributions
ooters-0.488
 Krish-0.442
 Chaser-0.432
 Gutenberg-0.430
 Rib-0.427
Token drug
Token ID2563
Feature activation-2.69e-01

Pos logit contributions
zsche+0.582
:,+0.555
athing+0.550
ilan+0.547
 df+0.525
Neg logit contributions
ooters-0.486
 Krish-0.439
 Chaser-0.430
 Gutenberg-0.427
 Rib-0.425
Token program
Token ID1430
Feature activation-1.36e+00

Pos logit contributions
zsche+0.197
:,+0.187
athing+0.185
ilan+0.185
 df+0.176
Neg logit contributions
ooters-0.195
 Krish-0.178
 Chaser-0.175
 Gutenberg-0.174
 Rib-0.173
Token is
Token ID318
Feature activation-1.33e+00

Pos logit contributions
zsche+1.003
:,+0.953
athing+0.944
ilan+0.941
 df+0.896
Neg logit contributions
ooters-0.981
 Krish-0.895
 Chaser-0.877
 Gutenberg-0.873
 Rib-0.868
Token not
Token ID407
Feature activation-2.04e+00

Pos logit contributions
zsche+1.063
:,+1.019
athing+1.009
ilan+1.006
 df+0.964
Neg logit contributions
ooters-0.866
 Krish-0.780
 Chaser-0.766
 Gutenberg-0.761
 Rib-0.755
Token only
Token ID691
Feature activation-3.96e+00

Pos logit contributions
zsche+1.630
:,+1.563
athing+1.547
ilan+1.543
 df+1.480
Neg logit contributions
ooters-1.334
 Krish-1.200
 Chaser-1.178
 Gutenberg-1.164
 Rib-1.157
Token possible
Token ID1744
Feature activation+3.58e-01

Pos logit contributions
zsche+3.141
:,+3.036
athing+2.997
ilan+2.993
 df+2.876
Neg logit contributions
ooters-2.565
 Krish-2.298
 Chaser-2.273
 Gutenberg-2.245
 Rib-2.223
Token
Token ID851
Feature activation+2.16e-01

Pos logit contributions
ooters+0.233
 Krish+0.211
 Chaser+0.206
 Gutenberg+0.205
 Rib+0.204
Neg logit contributions
zsche-0.290
:,-0.274
athing-0.274
ilan-0.273
 df-0.260
Token it
Token ID340
Feature activation+1.62e+00

Pos logit contributions
ooters+0.145
 Krish+0.132
 Chaser+0.129
 Gutenberg+0.128
 Rib+0.128
Neg logit contributions
zsche-0.169
:,-0.161
athing-0.160
ilan-0.159
 df-0.152
Token is
Token ID318
Feature activation+1.79e+00

Pos logit contributions
ooters+1.153
 Krish+1.058
 Chaser+1.030
 Gutenberg+1.028
 Rib+1.021
Neg logit contributions
zsche-1.204
:,-1.135
ilan-1.126
athing-1.121
 df-1.066
Token already
Token ID1541
Feature activation-5.54e-02

Pos logit contributions
ooters+1.185
 Krish+1.085
 Chaser+1.061
 Gutenberg+1.056
 Rib+1.050
Neg logit contributions
zsche-1.420
:,-1.339
athing-1.335
ilan-1.331
 df-1.267
Token judicial
Token ID13657
Feature activation-6.69e-01

Pos logit contributions
ooters+0.291
 Krish+0.269
 Rib+0.266
 Gutenberg+0.265
 Stephenson+0.264
Neg logit contributions
zsche-0.268
:,-0.251
athing-0.250
ilan-0.249
 df-0.234
Token races
Token ID9558
Feature activation+1.61e-01

Pos logit contributions
zsche+0.513
:,+0.488
athing+0.484
ilan+0.482
 df+0.460
Neg logit contributions
ooters-0.463
 Krish-0.420
 Chaser-0.412
 Gutenberg-0.410
 Rib-0.407
Token because
Token ID780
Feature activation+5.16e-01

Pos logit contributions
ooters+0.106
 Krish+0.097
 Chaser+0.094
 Gutenberg+0.094
 Rib+0.094
Neg logit contributions
zsche-0.128
:,-0.121
ilan-0.121
athing-0.121
 df-0.115
Token it
Token ID340
Feature activation+2.66e-01

Pos logit contributions
ooters+0.319
 Krish+0.289
 Gutenberg+0.281
 Chaser+0.280
 Rib+0.279
Neg logit contributions
zsche-0.433
:,-0.411
athing-0.411
ilan-0.411
 df-0.390
Token not
Token ID407
Feature activation+5.46e-01

Pos logit contributions
ooters+0.187
 Krish+0.171
 Chaser+0.167
 Gutenberg+0.167
 Rib+0.166
Neg logit contributions
zsche-0.199
:,-0.189
ilan-0.188
athing-0.186
 df-0.177
Token only
Token ID691
Feature activation-3.96e+00

Pos logit contributions
ooters+0.377
 Krish+0.342
 Gutenberg+0.335
 Chaser+0.334
 Rib+0.331
Neg logit contributions
zsche-0.418
:,-0.397
athing-0.393
ilan-0.392
 df-0.374
Token ske
Token ID6146
Feature activation+1.02e+00

Pos logit contributions
zsche+2.824
:,+2.696
athing+2.677
ilan+2.636
 df+2.547
Neg logit contributions
ooters-2.912
 Krish-2.647
 Chaser-2.616
 Gutenberg-2.590
 Rib-2.576
Tokenws
Token ID18504
Feature activation-1.49e+00

Pos logit contributions
ooters+0.543
 Krish+0.476
 Chaser+0.460
 Gutenberg+0.460
 Rib+0.456
Neg logit contributions
zsche-0.950
:,-0.914
athing-0.903
ilan-0.902
 df-0.870
Token voters
Token ID4446
Feature activation+5.53e-01

Pos logit contributions
zsche+1.219
:,+1.170
athing+1.160
ilan+1.152
 df+1.110
Neg logit contributions
ooters-0.949
 Krish-0.847
 Chaser-0.833
 Gutenberg-0.824
 Rib-0.818
Token
Token ID447
Feature activation+1.09e+00

Pos logit contributions
ooters+0.355
 Krish+0.321
 Chaser+0.313
 Rib+0.311
 Gutenberg+0.311
Neg logit contributions
zsche-0.452
:,-0.427
ilan-0.426
athing-0.424
 df-0.406
Token
Token ID247
Feature activation+3.04e-01

Pos logit contributions
ooters+0.726
 Krish+0.661
 Chaser+0.647
 Rib+0.641
 Gutenberg+0.637
Neg logit contributions
zsche-0.863
ilan-0.817
athing-0.816
:,-0.815
 df-0.769
Token to
Token ID284
Feature activation-7.39e-01

Pos logit contributions
zsche+0.271
:,+0.261
athing+0.259
ilan+0.257
 df+0.246
Neg logit contributions
ooters-0.212
 Krish-0.190
 Chaser-0.186
 Gutenberg-0.184
 Rib-0.183
Token temperatures
Token ID10101
Feature activation-1.79e+00

Pos logit contributions
zsche+0.563
:,+0.536
athing+0.531
ilan+0.529
 df+0.504
Neg logit contributions
ooters-0.515
 Krish-0.469
 Chaser-0.460
 Gutenberg-0.458
 Rib-0.455
Token that
Token ID326
Feature activation-1.38e+00

Pos logit contributions
zsche+1.421
:,+1.374
athing+1.354
ilan+1.341
 df+1.298
Neg logit contributions
ooters-1.182
 Krish-1.064
 Chaser-1.041
 Gutenberg-1.032
 Rib-1.023
Token reached
Token ID4251
Feature activation-2.07e-01

Pos logit contributions
zsche+1.119
:,+1.067
athing+1.059
ilan+1.055
 df+1.008
Neg logit contributions
ooters-0.898
 Krish-0.811
 Chaser-0.796
 Gutenberg-0.790
 Rib-0.785
Token 185
Token ID22855
Feature activation-1.42e+00

Pos logit contributions
zsche+0.172
:,+0.163
athing+0.163
ilan+0.162
 df+0.155
Neg logit contributions
ooters-0.130
 Krish-0.117
 Chaser-0.115
 Gutenberg-0.114
 Rib-0.113
Token degrees
Token ID7370
Feature activation-3.95e+00

Pos logit contributions
zsche+0.908
:,+0.849
athing+0.847
ilan+0.842
 df+0.787
Neg logit contributions
ooters-1.160
 Krish-1.076
 Chaser-1.056
 Gutenberg-1.052
 Rib-1.044
Token F
Token ID376
Feature activation+1.78e-02

Pos logit contributions
zsche+2.394
:,+2.311
athing+2.248
ilan+2.246
 df+2.157
Neg logit contributions
ooters-3.310
 Krish-3.036
 Chaser-2.997
 Gutenberg-2.960
 Rib-2.956
Token and
Token ID290
Feature activation-2.28e-01

Pos logit contributions
ooters+0.012
 Krish+0.011
 Chaser+0.011
 Gutenberg+0.011
 Rib+0.011
Neg logit contributions
zsche-0.014
:,-0.013
athing-0.013
ilan-0.013
 df-0.012
Token 85
Token ID7600
Feature activation-7.41e-01

Pos logit contributions
zsche+0.167
:,+0.158
ilan+0.156
athing+0.156
 df+0.147
Neg logit contributions
ooters-0.165
 Krish-0.152
 Gutenberg-0.150
 Chaser-0.149
 Rib-0.148
Token percent
Token ID1411
Feature activation-5.46e-01

Pos logit contributions
zsche+0.463
:,+0.437
athing+0.432
ilan+0.430
 df+0.407
Neg logit contributions
ooters-0.616
 Krish-0.568
 Chaser-0.559
 Gutenberg-0.555
 Rib-0.553
Token relative
Token ID3585
Feature activation+7.64e-02

Pos logit contributions
zsche+0.301
:,+0.278
athing+0.278
ilan+0.272
 df+0.254
Neg logit contributions
ooters-0.497
 Krish-0.463
 Gutenberg-0.458
 Chaser-0.456
 Rib-0.454
Token changed
Token ID3421
Feature activation-7.06e-01

Pos logit contributions
zsche+1.200
:,+1.152
athing+1.137
ilan+1.131
 df+1.086
Neg logit contributions
ooters-1.066
 Krish-0.963
 Chaser-0.942
 Gutenberg-0.936
 Rib-0.931
Token significantly
Token ID5566
Feature activation-7.24e-01

Pos logit contributions
zsche+0.564
:,+0.538
athing+0.533
ilan+0.532
 df+0.508
Neg logit contributions
ooters-0.465
 Krish-0.421
 Chaser-0.412
 Gutenberg-0.410
 Rib-0.408
Token,
Token ID11
Feature activation-3.64e-01

Pos logit contributions
zsche+0.552
:,+0.525
athing+0.521
ilan+0.519
 df+0.495
Neg logit contributions
ooters-0.503
 Krish-0.457
 Chaser-0.448
 Gutenberg-0.446
 Rib-0.443
Token but
Token ID475
Feature activation+4.21e-01

Pos logit contributions
zsche+0.294
:,+0.280
athing+0.278
ilan+0.277
 df+0.264
Neg logit contributions
ooters-0.237
 Krish-0.215
 Chaser-0.210
 Gutenberg-0.209
 Rib-0.208
Token other
Token ID584
Feature activation+1.61e-01

Pos logit contributions
ooters+0.272
 Krish+0.247
 Chaser+0.241
 Gutenberg+0.240
 Rib+0.239
Neg logit contributions
zsche-0.343
:,-0.326
athing-0.324
ilan-0.323
 df-0.308
Token than
Token ID621
Feature activation-3.94e+00

Pos logit contributions
ooters+0.107
 Krish+0.096
 Chaser+0.094
 Gutenberg+0.094
 Rib+0.093
Neg logit contributions
zsche-0.128
:,-0.122
athing-0.121
ilan-0.121
 df-0.115
Token that
Token ID326
Feature activation-5.25e-01

Pos logit contributions
zsche+3.105
:,+3.039
athing+2.997
ilan+2.953
 df+2.878
Neg logit contributions
ooters-2.573
 Krish-2.276
 Chaser-2.243
 Gutenberg-2.217
Barn-2.194
Token,
Token ID11
Feature activation-3.49e-01

Pos logit contributions
zsche+0.413
:,+0.394
athing+0.391
ilan+0.389
 df+0.372
Neg logit contributions
ooters-0.352
 Krish-0.319
 Chaser-0.312
 Gutenberg-0.311
 Rib-0.309
Token it
Token ID340
Feature activation+2.57e-01

Pos logit contributions
zsche+0.282
:,+0.270
athing+0.267
ilan+0.266
 df+0.255
Neg logit contributions
ooters-0.226
 Krish-0.204
 Chaser-0.199
 Gutenberg-0.198
 Rib-0.197
Token mostly
Token ID4632
Feature activation+8.19e-01

Pos logit contributions
ooters+0.165
 Krish+0.149
 Chaser+0.145
 Gutenberg+0.145
 Rib+0.144
Neg logit contributions
zsche-0.210
:,-0.200
athing-0.198
ilan-0.198
 df-0.189
Token just
Token ID655
Feature activation+2.05e-01

Pos logit contributions
ooters+0.534
 Krish+0.484
 Chaser+0.472
 Gutenberg+0.471
 Rib+0.468
Neg logit contributions
zsche-0.661
:,-0.627
athing-0.623
ilan-0.623
 df-0.592
Token body
Token ID1767
Feature activation+2.85e-02

Pos logit contributions
ooters+0.199
 Krish+0.182
 Chaser+0.178
 Gutenberg+0.177
 Rib+0.177
Neg logit contributions
zsche-0.212
:,-0.201
athing-0.199
ilan-0.198
 df-0.189
Token.
Token ID13
Feature activation+8.90e-01

Pos logit contributions
ooters+0.019
 Krish+0.017
 Chaser+0.017
 Gutenberg+0.016
 Rib+0.016
Neg logit contributions
zsche-0.023
:,-0.022
athing-0.022
ilan-0.021
 df-0.021
Token I
Token ID314
Feature activation+5.21e-01

Pos logit contributions
ooters+0.530
 Krish+0.483
 Chaser+0.476
 Rib+0.470
 Gutenberg+0.467
Neg logit contributions
zsche-0.772
:,-0.726
athing-0.725
ilan-0.724
 df-0.690
Token feel
Token ID1254
Feature activation+1.20e+00

Pos logit contributions
ooters+0.330
 Krish+0.298
 Chaser+0.289
 Rib+0.287
 Gutenberg+0.286
Neg logit contributions
zsche-0.430
:,-0.408
ilan-0.406
athing-0.404
 df-0.386
Token as
Token ID355
Feature activation-9.67e-01

Pos logit contributions
ooters+0.818
 Krish+0.749
 Chaser+0.737
 Rib+0.731
 Gutenberg+0.727
Neg logit contributions
zsche-0.938
:,-0.885
athing-0.877
ilan-0.874
 df-0.830
Token though
Token ID996
Feature activation-3.94e+00

Pos logit contributions
zsche+0.851
:,+0.817
athing+0.811
ilan+0.809
 df+0.777
Neg logit contributions
ooters-0.558
 Krish-0.495
 Chaser-0.483
 Gutenberg-0.481
 Rib-0.476
Token Eve
Token ID12882
Feature activation-9.15e-01

Pos logit contributions
zsche+3.091
:,+3.021
athing+2.970
ilan+2.934
 df+2.853
Neg logit contributions
ooters-2.582
 Krish-2.286
 Gutenberg-2.231
 Chaser-2.224
 Rib-2.194
Token was
Token ID373
Feature activation-8.48e-01

Pos logit contributions
zsche+0.684
:,+0.649
ilan+0.642
athing+0.642
 df+0.611
Neg logit contributions
ooters-0.649
 Krish-0.593
 Chaser-0.582
 Gutenberg-0.577
 Rib-0.576
Token like
Token ID588
Feature activation-1.66e+00

Pos logit contributions
zsche+0.655
:,+0.623
athing+0.617
ilan+0.616
 df+0.587
Neg logit contributions
ooters-0.582
 Krish-0.529
 Chaser-0.518
 Gutenberg-0.514
 Rib-0.513
Token,
Token ID11
Feature activation-2.05e+00

Pos logit contributions
zsche+1.304
:,+1.253
athing+1.238
ilan+1.229
 df+1.181
Neg logit contributions
ooters-1.119
 Krish-1.007
 Chaser-0.983
 Gutenberg-0.982
 Rib-0.971
Token
Token ID564
Feature activation-6.84e-01

Pos logit contributions
zsche+1.797
:,+1.732
athing+1.717
ilan+1.705
 df+1.648
Neg logit contributions
ooters-1.183
 Krish-1.047
 Chaser-1.019
 Gutenberg-1.017
 Rib-1.006
Token GDP
Token ID12396
Feature activation+3.77e-01

Pos logit contributions
ooters+0.561
 Krish+0.511
 Gutenberg+0.500
 Chaser+0.499
 Rib+0.498
Neg logit contributions
zsche-0.609
:,-0.578
athing-0.573
ilan-0.568
 df-0.540
Token have
Token ID423
Feature activation-5.45e-02

Pos logit contributions
ooters+0.248
 Krish+0.224
 Chaser+0.219
 Gutenberg+0.219
 Rib+0.217
Neg logit contributions
zsche-0.301
:,-0.286
athing-0.284
ilan-0.283
 df-0.270
Token already
Token ID1541
Feature activation-8.04e-01

Pos logit contributions
zsche+0.041
:,+0.039
athing+0.039
ilan+0.038
 df+0.036
Neg logit contributions
ooters-0.039
 Krish-0.035
 Chaser-0.034
 Gutenberg-0.034
 Rib-0.034
Token changed
Token ID3421
Feature activation-5.59e-01

Pos logit contributions
zsche+0.560
:,+0.527
athing+0.521
ilan+0.521
 df+0.491
Neg logit contributions
ooters-0.615
 Krish-0.566
 Chaser-0.555
 Gutenberg-0.552
 Rib-0.549
Token not
Token ID407
Feature activation-1.18e+00

Pos logit contributions
zsche+0.448
:,+0.426
athing+0.423
ilan+0.421
 df+0.403
Neg logit contributions
ooters-0.368
 Krish-0.333
 Chaser-0.326
 Gutenberg-0.324
 Rib-0.322
Token only
Token ID691
Feature activation-3.94e+00

Pos logit contributions
zsche+0.946
:,+0.901
athing+0.891
ilan+0.887
 df+0.849
Neg logit contributions
ooters-0.779
 Krish-0.701
 Gutenberg-0.686
 Chaser-0.684
 Rib-0.680
Token their
Token ID511
Feature activation-3.55e-01

Pos logit contributions
zsche+3.079
:,+2.955
athing+2.923
ilan+2.913
 df+2.797
Neg logit contributions
ooters-2.642
 Krish-2.377
 Chaser-2.341
 Gutenberg-2.322
 Rib-2.286
Token legal
Token ID2742
Feature activation-5.30e-01

Pos logit contributions
zsche+0.263
:,+0.249
athing+0.246
ilan+0.245
 df+0.233
Neg logit contributions
ooters-0.255
 Krish-0.234
 Chaser-0.229
 Gutenberg-0.228
 Rib-0.227
Token headquarters
Token ID10043
Feature activation+1.07e+00

Pos logit contributions
zsche+0.359
:,+0.338
athing+0.334
ilan+0.333
 df+0.315
Neg logit contributions
ooters-0.415
 Krish-0.383
 Chaser-0.375
 Rib-0.373
 Gutenberg-0.373
Token but
Token ID475
Feature activation+3.27e+00

Pos logit contributions
ooters+0.649
 Krish+0.589
 Rib+0.572
 Chaser+0.571
 Gutenberg+0.567
Neg logit contributions
zsche-0.892
:,-0.847
ilan-0.846
athing-0.845
 df-0.807
Token,
Token ID11
Feature activation+3.75e-01

Pos logit contributions
ooters+1.845
 Krish+1.664
 Gutenberg+1.612
 Chaser+1.607
 Rib+1.605
Neg logit contributions
zsche-2.895
athing-2.732
:,-2.729
ilan-2.720
 df-2.602
Token shopping
Token ID9735
Feature activation+1.05e+00

Pos logit contributions
ooters+0.210
 Krish+0.191
 Gutenberg+0.186
 Chaser+0.185
 Rib+0.185
Neg logit contributions
zsche-0.234
:,-0.221
athing-0.220
ilan-0.219
 df-0.208
Token malls
Token ID43890
Feature activation+1.01e+00

Pos logit contributions
ooters+0.742
 Krish+0.687
 Rib+0.659
 Gutenberg+0.659
 Chaser+0.655
Neg logit contributions
zsche-0.786
athing-0.738
:,-0.735
ilan-0.733
 df-0.698
Token,
Token ID11
Feature activation+9.80e-01

Pos logit contributions
ooters+0.674
 Krish+0.612
 Chaser+0.595
 Gutenberg+0.593
 Rib+0.592
Neg logit contributions
zsche-0.795
:,-0.755
athing-0.751
ilan-0.748
 df-0.714
Token car
Token ID1097
Feature activation-8.14e-01

Pos logit contributions
ooters+0.678
 Krish+0.622
 Gutenberg+0.600
 Rib+0.600
 Chaser+0.599
Neg logit contributions
zsche-0.758
:,-0.716
athing-0.712
ilan-0.708
 df-0.673
Token show
Token ID905
Feature activation+7.82e-02

Pos logit contributions
zsche+0.605
:,+0.572
athing+0.567
ilan+0.566
 df+0.538
Neg logit contributions
ooters-0.587
 Krish-0.535
 Chaser-0.520
 Gutenberg-0.517
 Rib-0.516
Token rooms
Token ID9519
Feature activation+2.69e+00

Pos logit contributions
ooters+0.050
 Krish+0.045
 Chaser+0.044
 Rib+0.043
 Gutenberg+0.043
Neg logit contributions
zsche-0.065
:,-0.061
athing-0.061
ilan-0.061
 df-0.058
Token,
Token ID11
Feature activation+8.86e-01

Pos logit contributions
ooters+1.750
 Krish+1.577
 Rib+1.521
 Chaser+1.517
 Gutenberg+1.517
Neg logit contributions
zsche-2.172
:,-2.051
athing-2.050
ilan-2.044
 df-1.947
Token banks
Token ID6341
Feature activation+2.81e-01

Pos logit contributions
ooters+0.608
 Krish+0.556
 Chaser+0.537
 Rib+0.537
 Gutenberg+0.537
Neg logit contributions
zsche-0.690
:,-0.653
athing-0.649
ilan-0.646
 df-0.614
Token and
Token ID290
Feature activation+6.75e-01

Pos logit contributions
ooters+0.191
 Krish+0.173
 Chaser+0.169
 Gutenberg+0.168
 Rib+0.167
Neg logit contributions
zsche-0.219
:,-0.208
athing-0.206
ilan-0.206
 df-0.196
Token houses
Token ID7777
Feature activation+1.43e+00

Pos logit contributions
ooters+0.446
 Krish+0.406
 Chaser+0.393
 Gutenberg+0.392
 Rib+0.392
Neg logit contributions
zsche-0.542
:,-0.515
athing-0.511
ilan-0.509
 df-0.485
Token of
Token ID286
Feature activation-2.66e-02

Pos logit contributions
ooters+0.785
 Krish+0.695
 Chaser+0.671
 Rib+0.665
 Gutenberg+0.665
Neg logit contributions
zsche-1.309
:,-1.253
athing-1.243
ilan-1.242
 df-1.192
Token,
Token ID11
Feature activation+3.34e-01

Pos logit contributions
ooters+0.213
 Krish+0.195
 Gutenberg+0.190
 Rib+0.189
 Chaser+0.189
Neg logit contributions
zsche-0.250
:,-0.235
athing-0.235
ilan-0.234
 df-0.222
Token wasted
Token ID18359
Feature activation-1.01e+00

Pos logit contributions
ooters+0.226
 Krish+0.206
 Chaser+0.201
 Gutenberg+0.200
 Rib+0.200
Neg logit contributions
zsche-0.261
:,-0.247
athing-0.245
ilan-0.245
 df-0.232
Token no
Token ID645
Feature activation+8.70e-02

Pos logit contributions
zsche+0.935
:,+0.900
athing+0.894
ilan+0.891
 df+0.859
Neg logit contributions
ooters-0.541
 Krish-0.473
 Chaser-0.462
 Gutenberg-0.458
 Rib-0.452
Token time
Token ID640
Feature activation+9.62e-01

Pos logit contributions
ooters+0.064
 Krish+0.060
 Chaser+0.059
 Gutenberg+0.058
 Rib+0.058
Neg logit contributions
zsche-0.063
:,-0.059
athing-0.058
ilan-0.058
 df-0.055
Token in
Token ID287
Feature activation-1.97e-01

Pos logit contributions
ooters+0.584
 Krish+0.530
 Chaser+0.514
 Gutenberg+0.513
 Rib+0.510
Neg logit contributions
zsche-0.819
:,-0.774
ilan-0.769
athing-0.768
 df-0.734
Token expressing
Token ID16621
Feature activation+2.58e+00

Pos logit contributions
zsche+0.149
:,+0.140
ilan+0.139
athing+0.139
 df+0.131
Neg logit contributions
ooters-0.139
 Krish-0.127
 Chaser-0.125
 Gutenberg-0.124
 Rib-0.124
Token its
Token ID663
Feature activation+1.34e+00

Pos logit contributions
ooters+1.713
 Krish+1.598
 Rib+1.550
 Chaser+1.545
 Stephenson+1.541
Neg logit contributions
zsche-2.025
:,-1.888
ilan-1.883
athing-1.874
 df-1.778
Token delight
Token ID10974
Feature activation-5.58e-01

Pos logit contributions
ooters+0.970
 Krish+0.898
 Chaser+0.879
 Rib+0.875
 Gutenberg+0.873
Neg logit contributions
zsche-0.977
:,-0.920
ilan-0.911
athing-0.902
 df-0.857
Tokenafter
Token ID8499
Feature activation+3.40e-01

Pos logit contributions
zsche+0.451
:,+0.431
athing+0.428
ilan+0.426
 df+0.408
Neg logit contributions
ooters-0.362
 Krish-0.326
 Chaser-0.319
 Gutenberg-0.317
 Rib-0.315
Token that
Token ID326
Feature activation+6.40e-01

Pos logit contributions
ooters+0.232
 Krish+0.211
 Chaser+0.207
 Gutenberg+0.206
 Rib+0.205
Neg logit contributions
zsche-0.264
:,-0.250
athing-0.248
ilan-0.248
 df-0.236
Token deal
Token ID1730
Feature activation-9.38e-01

Pos logit contributions
ooters+0.431
 Krish+0.391
 Chaser+0.384
 Gutenberg+0.382
 Rib+0.382
Neg logit contributions
zsche-0.503
:,-0.474
athing-0.474
ilan-0.472
 df-0.449
Token has
Token ID468
Feature activation-2.10e-01

Pos logit contributions
ooters+0.155
 Krish+0.141
 Gutenberg+0.138
 Chaser+0.138
 Rib+0.137
Neg logit contributions
zsche-0.181
:,-0.171
athing-0.171
ilan-0.171
 df-0.162
Token opened
Token ID4721
Feature activation-2.47e-01

Pos logit contributions
zsche+0.164
:,+0.155
athing+0.154
ilan+0.154
 df+0.146
Neg logit contributions
ooters-0.143
 Krish-0.131
 Chaser-0.128
 Gutenberg-0.127
 Rib-0.126
Token the
Token ID262
Feature activation-2.44e-01

Pos logit contributions
zsche+0.196
:,+0.187
athing+0.186
ilan+0.184
 df+0.177
Neg logit contributions
ooters-0.164
 Krish-0.148
 Chaser-0.145
 Gutenberg-0.144
 Rib-0.143
Token door
Token ID3420
Feature activation+1.07e+00

Pos logit contributions
zsche+0.200
:,+0.192
athing+0.190
ilan+0.189
 df+0.182
Neg logit contributions
ooters-0.155
 Krish-0.140
 Chaser-0.137
 Gutenberg-0.136
 Rib-0.135
Token for
Token ID329
Feature activation-1.35e-01

Pos logit contributions
ooters+0.714
 Krish+0.649
 Chaser+0.635
 Gutenberg+0.632
 Rib+0.625
Neg logit contributions
zsche-0.848
:,-0.812
athing-0.801
ilan-0.798
 df-0.765
Token nude
Token ID26349
Feature activation+2.43e+00

Pos logit contributions
zsche+0.113
:,+0.108
athing+0.107
ilan+0.106
 df+0.102
Neg logit contributions
ooters-0.085
 Krish-0.076
 Chaser-0.074
 Gutenberg-0.074
 Rib-0.073
Token and
Token ID290
Feature activation+2.30e-01

Pos logit contributions
ooters+1.365
 Krish+1.218
 Gutenberg+1.191
 Chaser+1.189
 Rib+1.181
Neg logit contributions
zsche-2.165
:,-2.068
athing-2.055
ilan-2.046
 df-1.968
Token sexually
Token ID11363
Feature activation+9.24e-01

Pos logit contributions
ooters+0.152
 Krish+0.138
 Chaser+0.135
 Gutenberg+0.134
 Rib+0.133
Neg logit contributions
zsche-0.183
:,-0.175
athing-0.173
ilan-0.172
 df-0.165
Token explicit
Token ID7952
Feature activation+2.03e+00

Pos logit contributions
ooters+0.414
 Krish+0.356
 Chaser+0.344
 Gutenberg+0.341
 Rib+0.338
Neg logit contributions
zsche-0.932
:,-0.898
athing-0.892
ilan-0.889
 df-0.859
Token photos
Token ID5205
Feature activation+1.69e+00

Pos logit contributions
ooters+1.161
 Krish+1.043
 Gutenberg+1.021
 Rib+1.015
 Chaser+1.012
Neg logit contributions
zsche-1.793
:,-1.702
athing-1.692
ilan-1.687
 df-1.615
Token and
Token ID290
Feature activation+2.58e-01

Pos logit contributions
ooters+1.086
 Krish+0.983
 Chaser+0.960
 Gutenberg+0.959
 Rib+0.953
Neg logit contributions
zsche-1.382
ilan-1.301
athing-1.299
:,-1.294
 df-1.236
Tokenaled
Token ID3021
Feature activation-8.42e-01

Pos logit contributions
zsche+0.532
:,+0.495
athing+0.486
ilan+0.484
 df+0.453
Neg logit contributions
ooters-0.920
 Krish-0.857
 Chaser-0.843
 Gutenberg-0.840
 Rib-0.837
Token felon
Token ID46849
Feature activation+2.57e-01

Pos logit contributions
zsche+0.587
:,+0.557
athing+0.551
ilan+0.549
 df+0.521
Neg logit contributions
ooters-0.641
 Krish-0.587
 Chaser-0.576
 Gutenberg-0.574
 Rib-0.571
Tokenious
Token ID699
Feature activation+4.65e-01

Pos logit contributions
ooters+0.186
 Krish+0.169
 Gutenberg+0.166
 Chaser+0.166
 Rib+0.165
Neg logit contributions
zsche-0.189
:,-0.179
athing-0.176
ilan-0.176
 df-0.168
Token actions
Token ID4028
Feature activation+5.14e-01

Pos logit contributions
ooters+0.332
 Krish+0.303
 Chaser+0.297
 Gutenberg+0.296
 Rib+0.295
Neg logit contributions
zsche-0.346
:,-0.328
athing-0.325
ilan-0.324
 df-0.309
Token rise
Token ID4485
Feature activation-5.27e-01

Pos logit contributions
ooters+0.346
 Krish+0.312
 Chaser+0.304
 Gutenberg+0.304
 Rib+0.302
Neg logit contributions
zsche-0.405
:,-0.384
ilan-0.379
athing-0.379
 df-0.361
Token to
Token ID284
Feature activation+2.62e+00

Pos logit contributions
zsche+0.434
:,+0.416
athing+0.413
ilan+0.410
 df+0.394
Neg logit contributions
ooters-0.335
 Krish-0.299
 Chaser-0.294
 Gutenberg-0.291
 Rib-0.289
Token the
Token ID262
Feature activation+1.32e+00

Pos logit contributions
ooters+1.526
 Krish+1.358
 Gutenberg+1.335
 Chaser+1.319
 Rib+1.315
Neg logit contributions
zsche-2.281
:,-2.177
ilan-2.169
athing-2.150
 df-2.060
Token level
Token ID1241
Feature activation+1.12e+00

Pos logit contributions
ooters+0.632
 Krish+0.556
 Chaser+0.539
 Gutenberg+0.537
 Rib+0.530
Neg logit contributions
zsche-1.300
:,-1.242
ilan-1.241
athing-1.230
 df-1.183
Token of
Token ID286
Feature activation+8.63e-01

Pos logit contributions
ooters+0.762
 Krish+0.692
 Chaser+0.680
 Gutenberg+0.675
 Rib+0.670
Neg logit contributions
zsche-0.867
:,-0.829
athing-0.822
ilan-0.813
 df-0.781
Token treason
Token ID32204
Feature activation-3.47e-02

Pos logit contributions
ooters+0.628
 Krish+0.574
 Gutenberg+0.565
 Chaser+0.563
 Rib+0.560
Neg logit contributions
zsche-0.630
:,-0.597
ilan-0.592
athing-0.589
 df-0.558
Token in
Token ID287
Feature activation+9.76e-01

Pos logit contributions
zsche+0.027
:,+0.025
ilan+0.025
athing+0.025
 df+0.024
Neg logit contributions
ooters-0.024
 Krish-0.022
 Chaser-0.021
 Gutenberg-0.021
 Rib-0.021
Token,
Token ID11
Feature activation+4.01e-02

Pos logit contributions
ooters+0.829
 Krish+0.760
 Gutenberg+0.746
 Chaser+0.738
 Rib+0.736
Neg logit contributions
zsche-0.996
ilan-0.949
athing-0.943
:,-0.940
 df-0.889
Token let
Token ID1309
Feature activation-1.72e-02

Pos logit contributions
ooters+0.025
 Krish+0.023
 Chaser+0.022
 Gutenberg+0.022
 Rib+0.022
Neg logit contributions
zsche-0.033
:,-0.032
athing-0.031
ilan-0.031
 df-0.030
Token me
Token ID502
Feature activation+4.54e-01

Pos logit contributions
zsche+0.014
:,+0.013
athing+0.013
ilan+0.013
 df+0.012
Neg logit contributions
ooters-0.012
 Krish-0.010
 Chaser-0.010
 Gutenberg-0.010
 Rib-0.010
Token offer
Token ID2897
Feature activation+3.92e-01

Pos logit contributions
ooters+0.322
 Krish+0.294
 Chaser+0.286
 Rib+0.285
 Gutenberg+0.284
Neg logit contributions
zsche-0.341
:,-0.321
ilan-0.319
athing-0.319
 df-0.301
Token a
Token ID257
Feature activation+2.91e-01

Pos logit contributions
ooters+0.240
 Krish+0.217
 Chaser+0.212
 Gutenberg+0.210
 Rib+0.210
Neg logit contributions
zsche-0.331
:,-0.316
ilan-0.314
athing-0.313
 df-0.299
Token story
Token ID1621
Feature activation+2.44e+00

Pos logit contributions
ooters+0.193
 Krish+0.177
 Chaser+0.173
 Gutenberg+0.172
 Rib+0.172
Neg logit contributions
zsche-0.229
:,-0.218
ilan-0.217
athing-0.217
 df-0.204
Token about
Token ID546
Feature activation+6.26e-01

Pos logit contributions
ooters+1.543
 Krish+1.390
 Chaser+1.383
 Rib+1.358
 Gutenberg+1.354
Neg logit contributions
zsche-1.999
ilan-1.899
athing-1.882
:,-1.877
 df-1.785
Token my
Token ID616
Feature activation-6.38e-01

Pos logit contributions
ooters+0.396
 Krish+0.363
 Gutenberg+0.355
 Chaser+0.353
 Rib+0.350
Neg logit contributions
zsche-0.512
:,-0.489
athing-0.486
ilan-0.485
 df-0.461
Token experience
Token ID1998
Feature activation-6.03e-01

Pos logit contributions
zsche+0.477
:,+0.454
athing+0.450
ilan+0.448
 df+0.426
Neg logit contributions
ooters-0.452
 Krish-0.414
 Gutenberg-0.404
 Chaser-0.404
 Rib-0.401
Token with
Token ID351
Feature activation-3.26e-01

Pos logit contributions
zsche+0.470
:,+0.445
athing+0.442
ilan+0.441
 df+0.420
Neg logit contributions
ooters-0.410
 Krish-0.373
 Chaser-0.365
 Gutenberg-0.365
 Rib-0.362
Token tape
Token ID9154
Feature activation+3.20e-01

Pos logit contributions
zsche+0.257
:,+0.245
athing+0.243
ilan+0.242
 df+0.230
Neg logit contributions
ooters-0.217
 Krish-0.199
 Gutenberg-0.195
 Chaser-0.194
 Rib-0.192
Token.
Token ID13
Feature activation-6.16e-01

Pos logit contributions
ooters+0.365
 Krish+0.333
 Chaser+0.325
 Gutenberg+0.323
 Rib+0.322
Neg logit contributions
zsche-0.424
:,-0.400
athing-0.399
ilan-0.398
 df-0.379
Token
Token ID198
Feature activation+2.63e-01

Pos logit contributions
zsche+0.520
:,+0.496
athing+0.493
ilan+0.491
 df+0.471
Neg logit contributions
ooters-0.378
 Krish-0.340
 Chaser-0.333
 Gutenberg-0.330
 Rib-0.329
Token
Token ID198
Feature activation+2.19e-01

Pos logit contributions
ooters+0.161
 Krish+0.145
 Chaser+0.141
 Rib+0.140
 Gutenberg+0.140
Neg logit contributions
zsche-0.224
:,-0.210
athing-0.210
ilan-0.210
 df-0.200
TokenNever
Token ID12295
Feature activation+8.64e-01

Pos logit contributions
ooters+0.168
 Krish+0.154
 Chaser+0.152
 Gutenberg+0.150
 Rib+0.150
Neg logit contributions
zsche-0.153
:,-0.143
athing-0.142
ilan-0.141
 df-0.134
Token mind
Token ID2000
Feature activation+1.31e+00

Pos logit contributions
ooters+0.644
 Krish+0.592
 Chaser+0.579
 Rib+0.578
 Gutenberg+0.575
Neg logit contributions
zsche-0.615
:,-0.578
ilan-0.573
athing-0.569
 df-0.543
Token the
Token ID262
Feature activation+1.18e+00

Pos logit contributions
ooters+0.774
 Krish+0.699
 Chaser+0.680
 Rib+0.678
 Gutenberg+0.674
Neg logit contributions
zsche-1.132
ilan-1.072
athing-1.071
:,-1.069
 df-1.020
Token fact
Token ID1109
Feature activation+4.95e-01

Pos logit contributions
ooters+0.685
 Krish+0.619
 Chaser+0.609
 Rib+0.601
 Gutenberg+0.598
Neg logit contributions
zsche-1.027
:,-0.977
ilan-0.974
athing-0.970
 df-0.924
Token that
Token ID326
Feature activation+7.30e-01

Pos logit contributions
ooters+0.301
 Krish+0.270
 Chaser+0.264
 Gutenberg+0.262
 Rib+0.260
Neg logit contributions
zsche-0.420
:,-0.402
athing-0.399
ilan-0.397
 df-0.381
Token he
Token ID339
Feature activation+3.44e-01

Pos logit contributions
ooters+0.461
 Krish+0.420
 Chaser+0.411
 Rib+0.408
 Gutenberg+0.405
Neg logit contributions
zsche-0.604
athing-0.571
ilan-0.570
:,-0.570
 df-0.542
Token
Token ID447
Feature activation-4.24e-01

Pos logit contributions
ooters+0.240
 Krish+0.217
 Chaser+0.213
 Rib+0.211
 Gutenberg+0.210
Neg logit contributions
zsche-0.262
:,-0.248
ilan-0.246
athing-0.244
 df-0.233
Token
Token ID247
Feature activation-9.73e-02

Pos logit contributions
zsche+0.245
:,+0.227
athing+0.225
ilan+0.224
 df+0.210
Neg logit contributions
ooters-0.374
 Krish-0.348
 Chaser-0.343
 Gutenberg-0.341
 Rib-0.340
Tokenitz
Token ID4224
Feature activation+2.90e-01

Pos logit contributions
zsche+0.440
:,+0.423
athing+0.421
ilan+0.420
 df+0.397
Neg logit contributions
ooters-0.758
 Krish-0.721
 Gutenberg-0.709
 Chaser-0.702
 Pru-0.698
Tokenki
Token ID4106
Feature activation-6.48e-01

Pos logit contributions
ooters+0.157
 Krish+0.139
 Chaser+0.135
 Gutenberg+0.134
 Rib+0.133
Neg logit contributions
zsche-0.265
:,-0.255
athing-0.253
ilan-0.252
 df-0.243
Token,
Token ID11
Feature activation-1.13e-01

Pos logit contributions
zsche+0.479
:,+0.460
athing+0.453
ilan+0.451
 df+0.432
Neg logit contributions
ooters-0.462
 Krish-0.420
 Gutenberg-0.412
 Chaser-0.412
 Rib-0.405
Token Mont
Token ID5575
Feature activation+1.18e+00

Pos logit contributions
zsche+0.089
:,+0.085
athing+0.084
ilan+0.084
 df+0.081
Neg logit contributions
ooters-0.075
 Krish-0.068
 Chaser-0.067
 Gutenberg-0.066
 Rib-0.066
Tokena
Token ID64
Feature activation-9.11e-01

Pos logit contributions
ooters+0.946
 Krish+0.871
 Chaser+0.856
 Gutenberg+0.854
 Rib+0.848
Neg logit contributions
zsche-0.768
:,-0.726
athing-0.719
ilan-0.715
 df-0.676
Token Ellis
Token ID21299
Feature activation+7.69e-01

Pos logit contributions
zsche+0.711
:,+0.683
athing+0.677
ilan+0.670
 df+0.645
Neg logit contributions
ooters-0.612
 Krish-0.549
 Chaser-0.540
 Gutenberg-0.538
 Rib-0.530
Token,
Token ID11
Feature activation-5.15e-01

Pos logit contributions
ooters+0.536
 Krish+0.487
 Chaser+0.477
 Gutenberg+0.474
 Rib+0.471
Neg logit contributions
zsche-0.585
:,-0.559
athing-0.553
ilan-0.551
 df-0.526
Token Devin
Token ID37081
Feature activation+5.56e-01

Pos logit contributions
zsche+0.408
:,+0.390
athing+0.386
ilan+0.385
 df+0.369
Neg logit contributions
ooters-0.343
 Krish-0.309
 Chaser-0.303
 Gutenberg-0.301
 Rib-0.299
Token Harris
Token ID10026
Feature activation+4.75e-01

Pos logit contributions
ooters+0.289
 Krish+0.254
 Chaser+0.247
 Gutenberg+0.245
 Rib+0.243
Neg logit contributions
zsche-0.521
:,-0.502
athing-0.498
ilan-0.496
 df-0.478
Token,
Token ID11
Feature activation+4.67e-02

Pos logit contributions
ooters+0.335
 Krish+0.305
 Chaser+0.298
 Gutenberg+0.297
 Rib+0.294
Neg logit contributions
zsche-0.357
:,-0.342
athing-0.337
ilan-0.336
 df-0.322
Token Raymond
Token ID24238
Feature activation+2.33e-01

Pos logit contributions
ooters+0.031
 Krish+0.028
 Chaser+0.027
 Gutenberg+0.027
 Rib+0.027
Neg logit contributions
zsche-0.037
:,-0.035
athing-0.035
ilan-0.035
 df-0.033
Token Secretary
Token ID4986
Feature activation-5.68e-01

Pos logit contributions
zsche+0.251
:,+0.235
ilan+0.234
athing+0.234
 df+0.220
Neg logit contributions
ooters-0.281
 Krish-0.259
 Chaser-0.255
 Gutenberg-0.253
 Rib-0.253
Token Sk
Token ID3661
Feature activation+7.11e-01

Pos logit contributions
zsche+0.457
:,+0.435
athing+0.431
ilan+0.430
 df+0.411
Neg logit contributions
ooters-0.371
 Krish-0.337
 Chaser-0.329
 Gutenberg-0.327
 Rib-0.326
Tokenvar
Token ID7785
Feature activation-2.34e-01

Pos logit contributions
ooters+0.515
 Krish+0.468
 Chaser+0.457
 Gutenberg+0.454
 Rib+0.452
Neg logit contributions
zsche-0.525
:,-0.495
athing-0.487
ilan-0.484
 df-0.469
Tokenla
Token ID5031
Feature activation-7.47e-01

Pos logit contributions
zsche+0.174
:,+0.165
athing+0.163
ilan+0.163
 df+0.155
Neg logit contributions
ooters-0.167
 Krish-0.153
 Chaser-0.149
 Gutenberg-0.149
 Rib-0.148
Token made
Token ID925
Feature activation+6.91e-01

Pos logit contributions
zsche+0.606
:,+0.579
athing+0.574
ilan+0.572
 df+0.548
Neg logit contributions
ooters-0.483
 Krish-0.436
 Chaser-0.426
 Gutenberg-0.424
 Rib-0.421
Token it
Token ID340
Feature activation+2.74e-01

Pos logit contributions
ooters+0.415
 Krish+0.376
 Chaser+0.367
 Gutenberg+0.365
 Rib+0.364
Neg logit contributions
zsche-0.589
:,-0.562
athing-0.558
ilan-0.558
 df-0.533
Token clear
Token ID1598
Feature activation+2.01e+00

Pos logit contributions
ooters+0.140
 Krish+0.122
 Chaser+0.119
 Gutenberg+0.118
 Rib+0.117
Neg logit contributions
zsche-0.260
:,-0.250
athing-0.248
ilan-0.247
 df-0.238
Token that
Token ID326
Feature activation-5.18e-01

Pos logit contributions
ooters+1.169
 Krish+1.060
 Chaser+1.026
 Gutenberg+1.020
 Rib+1.020
Neg logit contributions
zsche-1.758
ilan-1.670
athing-1.663
:,-1.654
 df-1.581
Token Good
Token ID4599
Feature activation-9.37e-01

Pos logit contributions
zsche+0.423
:,+0.402
athing+0.399
ilan+0.399
 df+0.380
Neg logit contributions
ooters-0.332
 Krish-0.302
 Chaser-0.294
 Gutenberg-0.293
 Rib-0.291
Token
Token ID447
Feature activation+2.96e-01

Pos logit contributions
zsche+0.722
:,+0.688
athing+0.682
ilan+0.679
 df+0.649
Neg logit contributions
ooters-0.643
 Krish-0.584
 Chaser-0.572
 Gutenberg-0.569
 Rib-0.565
Token
Token ID247
Feature activation-1.17e-01

Pos logit contributions
ooters+0.254
 Krish+0.236
 Chaser+0.232
 Rib+0.231
 Gutenberg+0.231
Neg logit contributions
zsche-0.178
athing-0.163
:,-0.163
ilan-0.162
 df-0.152
Token the
Token ID262
Feature activation+8.63e-01

Pos logit contributions
ooters+0.336
 Krish+0.305
 Chaser+0.298
 Rib+0.298
 Gutenberg+0.295
Neg logit contributions
zsche-0.417
:,-0.394
athing-0.392
ilan-0.391
 df-0.373
Token official
Token ID1743
Feature activation+9.79e-01

Pos logit contributions
ooters+0.573
 Krish+0.521
 Chaser+0.514
 Observer+0.513
 Rib+0.513
Neg logit contributions
zsche-0.688
:,-0.651
athing-0.644
ilan-0.644
 df-0.610
Token said
Token ID531
Feature activation+9.69e-01

Pos logit contributions
ooters+0.610
 Krish+0.553
 Rib+0.544
 Gutenberg+0.538
 Chaser+0.537
Neg logit contributions
zsche-0.820
:,-0.777
athing-0.767
ilan-0.766
 df-0.731
Token,
Token ID11
Feature activation+1.17e+00

Pos logit contributions
ooters+0.614
 Krish+0.556
 Chaser+0.541
 Rib+0.540
 Gutenberg+0.539
Neg logit contributions
zsche-0.797
:,-0.757
athing-0.754
ilan-0.751
 df-0.719
Token speaking
Token ID5486
Feature activation+2.39e-01

Pos logit contributions
ooters+0.775
 Krish+0.715
 Rib+0.701
 Chaser+0.687
 Gutenberg+0.684
Neg logit contributions
zsche-0.925
:,-0.871
ilan-0.857
athing-0.857
 df-0.818
Token to
Token ID284
Feature activation+7.36e-01

Pos logit contributions
ooters+0.162
 Krish+0.148
 Chaser+0.144
 Gutenberg+0.144
 Rib+0.143
Neg logit contributions
zsche-0.186
:,-0.177
athing-0.176
ilan-0.175
 df-0.167
Token Ha
Token ID9398
Feature activation-1.85e-01

Pos logit contributions
ooters+0.467
 Krish+0.423
 Chaser+0.412
 Rib+0.410
 Gutenberg+0.409
Neg logit contributions
zsche-0.608
:,-0.578
athing-0.574
ilan-0.571
 df-0.546
Tokenaretz
Token ID48338
Feature activation+4.07e-01

Pos logit contributions
zsche+0.132
:,+0.127
athing+0.126
ilan+0.125
 df+0.119
Neg logit contributions
ooters-0.137
 Krish-0.125
 Chaser-0.123
 Gutenberg-0.122
 Rib-0.121
Token on
Token ID319
Feature activation+3.77e-01

Pos logit contributions
ooters+0.275
 Krish+0.250
 Rib+0.245
 Chaser+0.244
 Gutenberg+0.243
Neg logit contributions
zsche-0.319
:,-0.301
athing-0.300
ilan-0.298
 df-0.285
Token condition
Token ID4006
Feature activation+1.78e+00

Pos logit contributions
ooters+0.268
 Krish+0.242
 Chaser+0.238
 Gutenberg+0.237
 Rib+0.234
Neg logit contributions
zsche-0.281
:,-0.268
athing-0.266
ilan-0.265
 df-0.253
Token of
Token ID286
Feature activation+7.12e-01

Pos logit contributions
ooters+1.173
 Krish+1.060
 Chaser+1.037
 Gutenberg+1.029
 Rib+1.026
Neg logit contributions
zsche-1.421
:,-1.353
athing-1.346
ilan-1.341
 df-1.278
Token.
Token ID13
Feature activation+4.97e-01

Pos logit contributions
zsche+0.677
:,+0.645
athing+0.639
ilan+0.637
 df+0.608
Neg logit contributions
ooters-0.606
 Krish-0.551
 Chaser-0.539
 Gutenberg-0.537
 Rib-0.533
Token The
Token ID383
Feature activation-1.89e-01

Pos logit contributions
ooters+0.294
 Krish+0.269
 Chaser+0.263
 Rib+0.260
 Gutenberg+0.259
Neg logit contributions
zsche-0.432
:,-0.409
ilan-0.407
athing-0.407
 df-0.388
Token combo
Token ID14831
Feature activation-8.35e-01

Pos logit contributions
zsche+0.147
:,+0.140
ilan+0.138
athing+0.138
 df+0.131
Neg logit contributions
ooters-0.129
 Krish-0.119
 Chaser-0.116
 Gutenberg-0.116
 Rib-0.115
Token had
Token ID550
Feature activation+9.10e-01

Pos logit contributions
zsche+0.637
:,+0.607
athing+0.601
ilan+0.599
 df+0.571
Neg logit contributions
ooters-0.580
 Krish-0.528
 Chaser-0.517
 Gutenberg-0.514
 Rib-0.511
Token never
Token ID1239
Feature activation+2.83e-01

Pos logit contributions
ooters+0.584
 Krish+0.538
 Chaser+0.523
 Gutenberg+0.521
 Rib+0.517
Neg logit contributions
zsche-0.737
:,-0.702
athing-0.698
ilan-0.697
 df-0.661
Token been
Token ID587
Feature activation+1.82e+00

Pos logit contributions
ooters+0.190
 Krish+0.174
 Chaser+0.169
 Gutenberg+0.168
 Rib+0.168
Neg logit contributions
zsche-0.222
:,-0.210
ilan-0.209
athing-0.208
 df-0.198
Token done
Token ID1760
Feature activation+1.59e-01

Pos logit contributions
ooters+1.320
 Krish+1.222
 Chaser+1.192
 Gutenberg+1.186
 Rib+1.186
Neg logit contributions
zsche-1.338
:,-1.256
ilan-1.244
athing-1.243
 df-1.176
Token before
Token ID878
Feature activation-2.25e-01

Pos logit contributions
ooters+0.111
 Krish+0.102
 Chaser+0.099
 Gutenberg+0.099
 Rib+0.099
Neg logit contributions
zsche-0.120
:,-0.114
ilan-0.113
athing-0.113
 df-0.107
Token Mitchell
Token ID14526
Feature activation+1.55e+00

Pos logit contributions
zsche+0.177
:,+0.168
athing+0.167
ilan+0.167
 df+0.159
Neg logit contributions
ooters-0.151
 Krish-0.137
 Chaser-0.134
 Gutenberg-0.134
 Rib-0.133
Token applied
Token ID5625
Feature activation-1.81e+00

Pos logit contributions
ooters+1.003
 Krish+0.924
 Rib+0.897
 Chaser+0.893
 Stephenson+0.888
Neg logit contributions
zsche-1.253
:,-1.181
ilan-1.171
athing-1.168
 df-1.112
Token it
Token ID340
Feature activation+5.77e-01

Pos logit contributions
zsche+1.407
:,+1.345
athing+1.332
ilan+1.325
 df+1.270
Neg logit contributions
ooters-1.237
 Krish-1.110
 Chaser-1.086
 Gutenberg-1.085
 Rib-1.075
Token massive
Token ID4858
Feature activation+5.23e-01

Pos logit contributions
zsche+0.755
:,+0.720
athing+0.713
ilan+0.710
 df+0.679
Neg logit contributions
ooters-0.677
 Krish-0.612
 Chaser-0.601
 Gutenberg-0.595
 Rib-0.593
Token time
Token ID640
Feature activation+4.09e-01

Pos logit contributions
ooters+0.359
 Krish+0.330
 Gutenberg+0.325
 Chaser+0.321
 Rib+0.320
Neg logit contributions
zsche-0.402
:,-0.381
athing-0.379
ilan-0.378
 df-0.360
Token capsule
Token ID27855
Feature activation+2.75e-01

Pos logit contributions
ooters+0.248
 Krish+0.224
 Gutenberg+0.219
 Chaser+0.218
 Rib+0.216
Neg logit contributions
zsche-0.346
athing-0.331
:,-0.330
ilan-0.328
 df-0.314
Token in
Token ID287
Feature activation+8.03e-01

Pos logit contributions
ooters+0.182
 Krish+0.167
 Gutenberg+0.163
 Chaser+0.163
 Rib+0.162
Neg logit contributions
zsche-0.218
ilan-0.205
athing-0.205
:,-0.204
 df-0.195
Token which
Token ID543
Feature activation-1.61e+00

Pos logit contributions
ooters+0.487
 Krish+0.440
 Gutenberg+0.428
 Chaser+0.428
 Rib+0.425
Neg logit contributions
zsche-0.683
:,-0.652
athing-0.647
ilan-0.644
 df-0.617
Token,
Token ID11
Feature activation-1.12e+00

Pos logit contributions
zsche+1.292
:,+1.236
athing+1.223
ilan+1.219
 df+1.169
Neg logit contributions
ooters-1.048
 Krish-0.940
 Chaser-0.925
 Gutenberg-0.913
 Rib-0.910
Token depending
Token ID6906
Feature activation-1.51e+00

Pos logit contributions
zsche+0.901
:,+0.860
athing+0.853
ilan+0.850
 df+0.813
Neg logit contributions
ooters-0.725
 Krish-0.656
 Chaser-0.641
 Gutenberg-0.638
 Rib-0.634
Token on
Token ID319
Feature activation+8.32e-01

Pos logit contributions
zsche+1.087
:,+1.040
athing+1.022
ilan+1.018
 df+0.974
Neg logit contributions
ooters-1.107
 Krish-1.006
 Chaser-0.995
 Gutenberg-0.990
 Rib-0.978
Token the
Token ID262
Feature activation-7.97e-02

Pos logit contributions
ooters+0.519
 Krish+0.469
 Gutenberg+0.457
 Chaser+0.456
 Rib+0.452
Neg logit contributions
zsche-0.692
:,-0.663
athing-0.657
ilan-0.656
 df-0.626
Token neighborhood
Token ID6232
Feature activation-3.02e-01

Pos logit contributions
zsche+0.061
:,+0.058
athing+0.057
ilan+0.057
 df+0.054
Neg logit contributions
ooters-0.055
 Krish-0.051
 Gutenberg-0.050
 Chaser-0.050
 Rib-0.049
Token,
Token ID11
Feature activation+7.51e-02

Pos logit contributions
zsche+0.240
:,+0.229
ilan+0.227
athing+0.227
 df+0.216
Neg logit contributions
ooters-0.200
 Krish-0.182
 Chaser-0.177
 Gutenberg-0.177
 Rib-0.176
Token after
Token ID706
Feature activation+6.85e-01

Pos logit contributions
zsche+0.379
:,+0.359
ilan+0.355
athing+0.355
 df+0.336
Neg logit contributions
ooters-0.361
 Krish-0.329
 Gutenberg-0.324
 Chaser-0.322
 Rib-0.321
Token-
Token ID12
Feature activation-2.58e-01

Pos logit contributions
ooters+0.469
 Krish+0.425
 Gutenberg+0.420
 Chaser+0.419
 Rib+0.419
Neg logit contributions
zsche-0.532
:,-0.498
ilan-0.495
athing-0.494
 df-0.470
Tokentax
Token ID19290
Feature activation+1.19e-01

Pos logit contributions
zsche+0.171
:,+0.165
athing+0.163
ilan+0.161
 df+0.152
Neg logit contributions
ooters-0.204
 Krish-0.188
 Chaser-0.185
 Gutenberg-0.183
 Rib-0.182
Token income
Token ID3739
Feature activation-9.14e-02

Pos logit contributions
ooters+0.083
 Krish+0.076
 Gutenberg+0.075
 Chaser+0.074
 Rib+0.074
Neg logit contributions
zsche-0.090
:,-0.085
athing-0.085
ilan-0.085
 df-0.080
Token would
Token ID561
Feature activation-3.23e-01

Pos logit contributions
zsche+0.066
:,+0.062
ilan+0.061
athing+0.061
 df+0.058
Neg logit contributions
ooters-0.068
 Krish-0.062
 Chaser-0.061
 Gutenberg-0.061
 Rib-0.060
Token total
Token ID2472
Feature activation-4.50e-01

Pos logit contributions
zsche+0.263
:,+0.250
athing+0.248
ilan+0.248
 df+0.236
Neg logit contributions
ooters-0.209
 Krish-0.188
 Chaser-0.184
 Gutenberg-0.183
 Rib-0.182
Token $
Token ID720
Feature activation-1.47e-02

Pos logit contributions
zsche+0.438
:,+0.421
athing+0.419
ilan+0.418
 df+0.402
Neg logit contributions
ooters-0.217
 Krish-0.189
 Chaser-0.184
 Gutenberg-0.182
 Rib-0.181
Token150
Token ID8628
Feature activation+4.13e-01

Pos logit contributions
zsche+0.010
:,+0.010
athing+0.010
ilan+0.009
 df+0.009
Neg logit contributions
ooters-0.011
 Krish-0.010
 Chaser-0.010
 Gutenberg-0.010
 Rib-0.010
Token million
Token ID1510
Feature activation+9.79e-01

Pos logit contributions
ooters+0.306
 Krish+0.279
 Chaser+0.274
 Gutenberg+0.273
 Rib+0.271
Neg logit contributions
zsche-0.296
:,-0.281
athing-0.278
ilan-0.277
 df-0.264
Token.
Token ID13
Feature activation+7.67e-01

Pos logit contributions
ooters+0.644
 Krish+0.583
 Gutenberg+0.572
 Chaser+0.571
 Rib+0.569
Neg logit contributions
zsche-0.781
athing-0.734
:,-0.734
ilan-0.730
 df-0.696
Token
Token ID198
Feature activation+7.08e-01

Pos logit contributions
ooters+0.464
 Krish+0.423
 Chaser+0.413
 Rib+0.413
 Gutenberg+0.413
Neg logit contributions
zsche-0.655
:,-0.617
athing-0.617
ilan-0.615
heastern-0.583
Token second
Token ID1218
Feature activation+1.47e+00

Pos logit contributions
ooters+0.020
 Krish+0.018
 Chaser+0.018
 Gutenberg+0.017
 Rib+0.017
Neg logit contributions
zsche-0.030
:,-0.028
athing-0.028
ilan-0.028
 df-0.027
Token innings
Token ID16267
Feature activation+7.51e-01

Pos logit contributions
ooters+0.837
 Krish+0.757
 Chaser+0.737
 Gutenberg+0.735
 Rib+0.730
Neg logit contributions
zsche-1.309
athing-1.237
:,-1.234
ilan-1.230
 df-1.172
Token.
Token ID13
Feature activation+5.22e-01

Pos logit contributions
ooters+0.501
 Krish+0.456
 Chaser+0.445
 Gutenberg+0.442
 Rib+0.440
Neg logit contributions
zsche-0.595
:,-0.564
athing-0.561
ilan-0.558
 df-0.533
Token
Token ID198
Feature activation+4.39e-01

Pos logit contributions
ooters+0.311
 Krish+0.285
 Chaser+0.276
 Rib+0.275
 Gutenberg+0.273
Neg logit contributions
zsche-0.453
:,-0.427
athing-0.426
ilan-0.425
 df-0.407
Token
Token ID198
Feature activation+3.95e-01

Pos logit contributions
ooters+0.267
 Krish+0.241
 Chaser+0.234
 Rib+0.233
 Gutenberg+0.233
Neg logit contributions
zsche-0.375
:,-0.353
athing-0.352
ilan-0.352
 df-0.337
Token"
Token ID1
Feature activation-9.13e-02

Pos logit contributions
ooters+0.260
 Krish+0.235
 Chaser+0.231
 Gutenberg+0.230
 Rib+0.228
Neg logit contributions
zsche-0.314
:,-0.301
athing-0.298
ilan-0.297
 df-0.284
TokenWhen
Token ID2215
Feature activation+4.28e-01

Pos logit contributions
zsche+0.063
:,+0.059
athing+0.059
ilan+0.058
 df+0.055
Neg logit contributions
ooters-0.071
 Krish-0.065
 Chaser-0.064
 Gutenberg-0.063
Barn-0.063
Token I
Token ID314
Feature activation+6.08e-01

Pos logit contributions
ooters+0.243
 Krish+0.219
 Chaser+0.212
 Gutenberg+0.211
 Rib+0.211
Neg logit contributions
zsche-0.382
:,-0.364
athing-0.362
ilan-0.361
 df-0.346
Token look
Token ID804
Feature activation-6.56e-01

Pos logit contributions
ooters+0.395
 Krish+0.361
 Chaser+0.347
 Gutenberg+0.345
 Rib+0.345
Neg logit contributions
zsche-0.499
:,-0.466
ilan-0.460
athing-0.459
Internal-0.438
Token back
Token ID736
Feature activation+2.25e-01

Pos logit contributions
zsche+0.541
:,+0.517
athing+0.512
ilan+0.511
 df+0.489
Neg logit contributions
ooters-0.416
 Krish-0.375
 Chaser-0.367
 Gutenberg-0.364
 Rib-0.362
Token on
Token ID319
Feature activation-6.94e-01

Pos logit contributions
ooters+0.144
 Krish+0.131
 Chaser+0.128
 Gutenberg+0.127
 Rib+0.127
Neg logit contributions
zsche-0.185
:,-0.175
athing-0.174
ilan-0.174
 df-0.166
Token count
Token ID954
Feature activation-1.55e+00

Pos logit contributions
ooters+0.184
 Krish+0.168
 Chaser+0.164
 Gutenberg+0.163
 Rib+0.163
Neg logit contributions
zsche-0.182
athing-0.171
ilan-0.171
:,-0.171
 df-0.159
Token it
Token ID340
Feature activation-6.84e-01

Pos logit contributions
zsche+1.265
:,+1.212
athing+1.199
ilan+1.194
 df+1.147
Neg logit contributions
ooters-0.999
 Krish-0.901
 Chaser-0.880
 Gutenberg-0.876
 Rib-0.869
Token.
Token ID13
Feature activation+8.02e-02

Pos logit contributions
zsche+0.509
:,+0.483
athing+0.479
ilan+0.477
 df+0.455
Neg logit contributions
ooters-0.489
 Krish-0.445
 Chaser-0.436
 Gutenberg-0.434
 Rib-0.432
Token I
Token ID314
Feature activation+3.55e-01

Pos logit contributions
ooters+0.052
 Krish+0.047
 Chaser+0.046
 Rib+0.046
 Gutenberg+0.046
Neg logit contributions
zsche-0.065
:,-0.062
athing-0.062
ilan-0.062
 df-0.059
Token felt
Token ID2936
Feature activation+5.14e-01

Pos logit contributions
ooters+0.239
 Krish+0.219
 Chaser+0.213
 Rib+0.211
 Gutenberg+0.211
Neg logit contributions
zsche-0.278
:,-0.260
ilan-0.259
athing-0.258
 df-0.245
Token bad
Token ID2089
Feature activation-7.11e-01

Pos logit contributions
ooters+0.347
 Krish+0.316
 Chaser+0.311
 Rib+0.307
 Gutenberg+0.307
Neg logit contributions
zsche-0.403
:,-0.380
athing-0.379
ilan-0.376
 df-0.357
Token rejecting
Token ID26734
Feature activation-2.20e+00

Pos logit contributions
zsche+0.561
:,+0.531
athing+0.527
ilan+0.527
 df+0.502
Neg logit contributions
ooters-0.477
 Krish-0.433
 Chaser-0.424
 Gutenberg-0.421
 Rib-0.419
Token all
Token ID477
Feature activation-7.50e-01

Pos logit contributions
zsche+1.782
:,+1.704
athing+1.688
ilan+1.682
 df+1.611
Neg logit contributions
ooters-1.426
 Krish-1.285
 Chaser-1.257
 Gutenberg-1.250
 Rib-1.242
Token the
Token ID262
Feature activation-1.02e+00

Pos logit contributions
zsche+0.606
:,+0.577
athing+0.573
ilan+0.570
 df+0.545
Neg logit contributions
ooters-0.489
 Krish-0.441
 Chaser-0.433
 Gutenberg-0.430
 Rib-0.428
Token Ny
Token ID17735
Feature activation+1.33e+00

Pos logit contributions
zsche+0.821
:,+0.782
athing+0.775
ilan+0.772
 df+0.737
Neg logit contributions
ooters-0.669
 Krish-0.605
 Chaser-0.593
 Gutenberg-0.589
 Rib-0.587
Tokenx
Token ID87
Feature activation+9.97e-02

Pos logit contributions
ooters+0.816
 Krish+0.727
 Chaser+0.711
 Gutenberg+0.702
 Rib+0.702
Neg logit contributions
zsche-1.119
:,-1.076
athing-1.063
ilan-1.057
 df-1.023
Token and
Token ID290
Feature activation-3.94e-01

Pos logit contributions
zsche+0.226
:,+0.215
athing+0.213
ilan+0.213
 df+0.202
Neg logit contributions
ooters-0.203
 Krish-0.184
 Chaser-0.180
 Gutenberg-0.180
 Rib-0.179
Token the
Token ID262
Feature activation-1.21e-01

Pos logit contributions
zsche+0.319
:,+0.305
athing+0.302
ilan+0.301
 df+0.288
Neg logit contributions
ooters-0.256
 Krish-0.231
 Gutenberg-0.226
 Chaser-0.226
 Rib-0.224
Token government
Token ID1230
Feature activation-6.37e-01

Pos logit contributions
zsche+0.083
:,+0.078
athing+0.077
ilan+0.077
 df+0.072
Neg logit contributions
ooters-0.093
 Krish-0.086
 Chaser-0.084
 Gutenberg-0.084
 Rib-0.084
Token
Token ID447
Feature activation-2.65e-01

Pos logit contributions
zsche+0.464
:,+0.440
athing+0.436
ilan+0.435
 df+0.413
Neg logit contributions
ooters-0.466
 Krish-0.426
 Chaser-0.417
 Gutenberg-0.416
 Rib-0.413
Token
Token ID247
Feature activation-8.17e-02

Pos logit contributions
zsche+0.156
:,+0.145
athing+0.144
ilan+0.143
 df+0.134
Neg logit contributions
ooters-0.230
 Krish-0.214
 Chaser-0.210
 Gutenberg-0.209
 Rib-0.209
Tokens
Token ID82
Feature activation-1.84e-01

Pos logit contributions
zsche+0.085
:,+0.082
athing+0.081
ilan+0.081
 df+0.079
Neg logit contributions
ooters-0.034
 Krish-0.029
 Chaser-0.028
 Gutenberg-0.028
 Rib-0.027
Token finances
Token ID20903
Feature activation-1.26e+00

Pos logit contributions
zsche+0.133
:,+0.126
athing+0.124
ilan+0.124
 df+0.118
Neg logit contributions
ooters-0.135
 Krish-0.124
 Gutenberg-0.121
 Chaser-0.121
 Rib-0.121
Token collapse
Token ID9807
Feature activation-4.04e-01

Pos logit contributions
zsche+0.886
:,+0.840
athing+0.832
ilan+0.828
 df+0.787
Neg logit contributions
ooters-0.946
 Krish-0.866
 Chaser-0.851
 Gutenberg-0.846
 Rib-0.842
Token.
Token ID13
Feature activation-2.99e-01

Pos logit contributions
zsche+0.306
:,+0.292
athing+0.289
ilan+0.288
 df+0.275
Neg logit contributions
ooters-0.282
 Krish-0.257
 Chaser-0.251
 Gutenberg-0.250
 Rib-0.249
Token
Token ID198
Feature activation+3.90e-01

Pos logit contributions
zsche+0.245
:,+0.233
athing+0.232
ilan+0.231
 df+0.220
Neg logit contributions
ooters-0.191
 Krish-0.173
 Chaser-0.169
 Gutenberg-0.168
 Rib-0.167
Token
Token ID198
Feature activation+2.62e-01

Pos logit contributions
ooters+0.238
 Krish+0.214
 Chaser+0.208
 Rib+0.207
 Gutenberg+0.207
Neg logit contributions
zsche-0.334
:,-0.314
athing-0.313
ilan-0.313
 df-0.299
Token such
Token ID884
Feature activation+2.52e-01

Pos logit contributions
ooters+0.495
 Krish+0.454
 Gutenberg+0.444
 Chaser+0.442
 Rib+0.441
Neg logit contributions
zsche-0.616
:,-0.583
ilan-0.581
athing-0.578
 df-0.550
Token an
Token ID281
Feature activation-6.43e-01

Pos logit contributions
ooters+0.158
 Krish+0.143
 Chaser+0.140
 Gutenberg+0.139
 Rib+0.139
Neg logit contributions
zsche-0.209
:,-0.200
athing-0.197
ilan-0.197
 df-0.189
Token approach
Token ID3164
Feature activation+8.13e-02

Pos logit contributions
zsche+0.508
:,+0.485
athing+0.481
ilan+0.479
 df+0.458
Neg logit contributions
ooters-0.429
 Krish-0.388
 Chaser-0.380
 Gutenberg-0.377
 Rib-0.375
Token alien
Token ID8756
Feature activation-3.07e-01

Pos logit contributions
ooters+0.059
 Krish+0.054
 Chaser+0.053
 Gutenberg+0.053
 Rib+0.053
Neg logit contributions
zsche-0.059
:,-0.056
ilan-0.055
athing-0.055
 df-0.052
Tokenates
Token ID689
Feature activation+1.41e-01

Pos logit contributions
zsche+0.205
:,+0.194
ilan+0.191
athing+0.191
 df+0.181
Neg logit contributions
ooters-0.243
 Krish-0.224
 Chaser-0.219
 Rib-0.219
 Gutenberg-0.219
Token young
Token ID1862
Feature activation-2.56e+00

Pos logit contributions
ooters+0.097
 Krish+0.090
 Chaser+0.088
 Gutenberg+0.088
 Rib+0.087
Neg logit contributions
zsche-0.108
:,-0.102
ilan-0.102
athing-0.101
 df-0.096
Token voters
Token ID4446
Feature activation+8.46e-02

Pos logit contributions
zsche+2.040
:,+1.959
athing+1.955
ilan+1.925
 df+1.853
Neg logit contributions
ooters-1.677
 Krish-1.499
 Chaser-1.477
 Gutenberg-1.459
 Rib-1.446
Token,
Token ID11
Feature activation-7.79e-01

Pos logit contributions
ooters+0.057
 Krish+0.052
 Chaser+0.050
 Rib+0.050
 Gutenberg+0.050
Neg logit contributions
zsche-0.067
:,-0.063
ilan-0.063
athing-0.062
 df-0.060
Token says
Token ID1139
Feature activation-3.87e-02

Pos logit contributions
zsche+0.633
:,+0.601
ilan+0.595
athing+0.594
 df+0.566
Neg logit contributions
ooters-0.504
 Krish-0.462
 Chaser-0.448
 Gutenberg-0.447
 Rib-0.445
Token D
Token ID360
Feature activation-9.63e-01

Pos logit contributions
zsche+0.031
:,+0.030
athing+0.029
ilan+0.029
 df+0.028
Neg logit contributions
ooters-0.025
 Krish-0.023
 Chaser-0.023
 Gutenberg-0.023
 Rib-0.022
Tokenella
Token ID12627
Feature activation+2.03e-01

Pos logit contributions
zsche+0.825
:,+0.780
athing+0.774
ilan+0.767
 df+0.746
Neg logit contributions
ooters-0.590
 Krish-0.523
 Chaser-0.507
 Gutenberg-0.506
 Rib-0.500
Token May
Token ID1737
Feature activation-6.30e-02

Pos logit contributions
ooters+0.571
 Krish+0.513
 Chaser+0.496
 Rib+0.493
 Gutenberg+0.493
Neg logit contributions
zsche-0.932
:,-0.889
athing-0.884
ilan-0.882
 df-0.844
Token,
Token ID11
Feature activation-2.93e-01

Pos logit contributions
zsche+0.050
:,+0.047
athing+0.047
ilan+0.047
 df+0.045
Neg logit contributions
ooters-0.042
 Krish-0.038
 Chaser-0.037
 Rib-0.037
 Gutenberg-0.037
Token but
Token ID475
Feature activation-1.41e-01

Pos logit contributions
zsche+0.237
:,+0.225
athing+0.222
ilan+0.222
 df+0.212
Neg logit contributions
ooters-0.191
 Krish-0.175
 Chaser-0.170
 Gutenberg-0.169
 Rib-0.169
Token at
Token ID379
Feature activation-3.71e-01

Pos logit contributions
zsche+0.117
:,+0.111
athing+0.110
ilan+0.110
 df+0.105
Neg logit contributions
ooters-0.090
 Krish-0.081
 Chaser-0.079
 Gutenberg-0.079
 Rib-0.079
Token that
Token ID326
Feature activation-1.29e+00

Pos logit contributions
zsche+0.315
:,+0.302
athing+0.299
ilan+0.298
 df+0.287
Neg logit contributions
ooters-0.225
 Krish-0.201
 Chaser-0.197
 Gutenberg-0.195
 Rib-0.194
Token time
Token ID640
Feature activation-2.66e+00

Pos logit contributions
zsche+0.878
:,+0.833
athing+0.824
ilan+0.819
 df+0.779
Neg logit contributions
ooters-1.007
 Krish-0.923
 Chaser-0.908
 Gutenberg-0.902
 Rib-0.898
Token discussions
Token ID9984
Feature activation-2.94e-01

Pos logit contributions
zsche+2.041
:,+1.972
athing+1.938
ilan+1.928
 df+1.857
Neg logit contributions
ooters-1.800
 Krish-1.619
 Chaser-1.579
 Gutenberg-1.577
 Rib-1.555
Token were
Token ID547
Feature activation-2.80e-01

Pos logit contributions
zsche+0.246
:,+0.234
ilan+0.232
athing+0.232
 df+0.221
Neg logit contributions
ooters-0.184
 Krish-0.166
 Chaser-0.162
 Gutenberg-0.161
 Rib-0.160
Token still
Token ID991
Feature activation-1.69e+00

Pos logit contributions
zsche+0.217
:,+0.204
ilan+0.202
athing+0.202
 df+0.192
Neg logit contributions
ooters-0.193
 Krish-0.177
 Chaser-0.173
 Gutenberg-0.171
 Rib-0.171
Token said
Token ID531
Feature activation-4.91e-01

Pos logit contributions
zsche+1.274
:,+1.219
athing+1.209
ilan+1.201
 df+1.148
Neg logit contributions
ooters-1.178
 Krish-1.068
 Chaser-1.049
 Gutenberg-1.044
 Rib-1.036
Token to
Token ID284
Feature activation+9.76e-01

Pos logit contributions
zsche+0.408
:,+0.394
athing+0.390
ilan+0.388
 df+0.372
Neg logit contributions
ooters-0.305
 Krish-0.273
 Chaser-0.268
 Gutenberg-0.266
 Rib-0.264
Token pull
Token ID2834
Feature activation-3.47e-01

Pos logit contributions
zsche+0.377
:,+0.358
athing+0.356
ilan+0.354
 df+0.334
Neg logit contributions
ooters-0.368
 Krish-0.337
 Gutenberg-0.331
 Chaser-0.330
 Rib-0.328
Token.
Token ID13
Feature activation-5.91e-01

Pos logit contributions
zsche+0.253
:,+0.240
athing+0.238
ilan+0.237
 df+0.226
Neg logit contributions
ooters-0.252
 Krish-0.230
 Chaser-0.226
 Gutenberg-0.225
 Rib-0.224
Token
Token ID198
Feature activation-2.60e+00

Pos logit contributions
zsche+0.475
:,+0.454
athing+0.450
ilan+0.448
 df+0.428
Neg logit contributions
ooters-0.386
 Krish-0.349
 Chaser-0.341
 Gutenberg-0.340
 Rib-0.338
Token
Token ID198
Feature activation-2.39e+00

Pos logit contributions
zsche+2.021
:,+1.964
athing+1.933
ilan+1.924
 df+1.846
Neg logit contributions
ooters-1.738
 Krish-1.572
 Chaser-1.543
 Gutenberg-1.534
 Rib-1.513
Tokenhttp
Token ID4023
Feature activation-3.82e+00

Pos logit contributions
zsche+1.621
:,+1.546
athing+1.525
ilan+1.521
 df+1.447
Neg logit contributions
ooters-1.857
 Krish-1.708
 Chaser-1.677
 Gutenberg-1.667
 Rib-1.660
Token://
Token ID1378
Feature activation-2.99e+00

Pos logit contributions
zsche+1.783
:,+1.716
athing+1.643
ilan+1.633
 df+1.523
Neg logit contributions
ooters-3.741
 Krish-3.502
 Chaser-3.455
 Gutenberg-3.431
 Rib-3.415
Tokenapi
Token ID15042
Feature activation-2.52e-01

Pos logit contributions
zsche+2.304
:,+2.198
ilan+2.181
athing+2.173
 df+2.089
Neg logit contributions
ooters-2.040
 Krish-1.867
 Chaser-1.830
 Gutenberg-1.811
 Rib-1.806
Token.
Token ID13
Feature activation+8.94e-01

Pos logit contributions
zsche+0.189
:,+0.180
athing+0.178
ilan+0.178
 df+0.169
Neg logit contributions
ooters-0.178
 Krish-0.162
 Chaser-0.159
 Gutenberg-0.158
 Rib-0.157
Tokenc
Token ID66
Feature activation+1.54e+00

Pos logit contributions
ooters+0.554
 Krish+0.491
 Gutenberg+0.479
 Chaser+0.478
 Rib+0.475
Neg logit contributions
zsche-0.751
:,-0.718
athing-0.711
ilan-0.708
 df-0.678
Tokenensus
Token ID7314
Feature activation+2.27e-01

Pos logit contributions
ooters+0.935
 Krish+0.810
 Gutenberg+0.797
 Rib+0.788
Barn+0.778
Neg logit contributions
zsche-1.319
:,-1.247
ilan-1.244
athing-1.241
 df-1.181
Token.
Token ID13
Feature activation-4.08e-02

Pos logit contributions
ooters+0.161
 Krish+0.147
 Chaser+0.144
 Gutenberg+0.143
 Rib+0.142
Neg logit contributions
zsche-0.170
:,-0.162
athing-0.160
ilan-0.160
 df-0.152
Token Thompson
Token ID11654
Feature activation+1.08e+00

Pos logit contributions
ooters+1.035
 Krish+0.951
 Rib+0.917
 Gutenberg+0.914
 Chaser+0.910
Neg logit contributions
zsche-1.351
:,-1.288
athing-1.277
ilan-1.274
 df-1.223
Token came
Token ID1625
Feature activation+2.61e-01

Pos logit contributions
ooters+0.808
 Krish+0.747
 Rib+0.731
 Chaser+0.725
 Gutenberg+0.723
Neg logit contributions
zsche-0.773
:,-0.722
ilan-0.717
athing-0.713
 df-0.676
Token within
Token ID1626
Feature activation+1.75e-01

Pos logit contributions
ooters+0.165
 Krish+0.146
 Chaser+0.144
 Gutenberg+0.142
 Rib+0.141
Neg logit contributions
zsche-0.215
:,-0.207
athing-0.207
ilan-0.205
 df-0.198
Token 7
Token ID767
Feature activation-1.04e+00

Pos logit contributions
ooters+0.108
 Krish+0.097
 Chaser+0.095
 Gutenberg+0.095
 Rib+0.094
Neg logit contributions
zsche-0.148
:,-0.141
athing-0.140
ilan-0.139
 df-0.134
Token percentage
Token ID5873
Feature activation-1.73e+00

Pos logit contributions
zsche+0.730
:,+0.695
athing+0.686
ilan+0.686
 df+0.651
Neg logit contributions
ooters-0.778
 Krish-0.712
 Chaser-0.699
 Gutenberg-0.696
 Rib-0.692
Token points
Token ID2173
Feature activation-2.68e+00

Pos logit contributions
zsche+1.185
:,+1.128
athing+1.115
ilan+1.109
 df+1.056
Neg logit contributions
ooters-1.332
 Krish-1.221
 Chaser-1.200
 Gutenberg-1.194
 Rib-1.186
Token of
Token ID286
Feature activation-1.80e+00

Pos logit contributions
zsche+1.987
:,+1.931
athing+1.906
ilan+1.891
 df+1.813
Neg logit contributions
ooters-1.887
 Krish-1.708
 Chaser-1.685
 Gutenberg-1.667
 Rib-1.643
Token defeating
Token ID21320
Feature activation-2.50e-01

Pos logit contributions
zsche+1.346
:,+1.293
athing+1.283
ilan+1.272
 df+1.220
Neg logit contributions
ooters-1.269
 Krish-1.146
 Chaser-1.125
 Gutenberg-1.119
 Rib-1.108
Token Republican
Token ID3415
Feature activation+6.61e-01

Pos logit contributions
zsche+0.203
:,+0.192
ilan+0.190
athing+0.190
 df+0.181
Neg logit contributions
ooters-0.163
 Krish-0.148
 Chaser-0.144
 Gutenberg-0.144
 Rib-0.143
Token Ron
Token ID6575
Feature activation+3.44e-01

Pos logit contributions
ooters+0.412
 Krish+0.381
 Chaser+0.367
 Rib+0.366
 Gutenberg+0.363
Neg logit contributions
zsche-0.558
:,-0.525
ilan-0.522
athing-0.521
 df-0.494
Token Est
Token ID10062
Feature activation-2.33e-01

Pos logit contributions
ooters+0.208
 Krish+0.189
 Chaser+0.183
 Rib+0.183
 Gutenberg+0.181
Neg logit contributions
zsche-0.293
:,-0.280
athing-0.278
ilan-0.276
 df-0.266
Tokenus
Token ID385
Feature activation+7.88e-01

Pos logit contributions
ooters+0.587
 Krish+0.535
 Chaser+0.522
 Gutenberg+0.520
 Rib+0.518
Neg logit contributions
zsche-0.509
:,-0.479
athing-0.472
ilan-0.471
 df-0.449
Tokenhers
Token ID7084
Feature activation-1.16e-01

Pos logit contributions
ooters+0.558
 Krish+0.501
 Gutenberg+0.486
 Rib+0.484
 Chaser+0.479
Neg logit contributions
zsche-0.598
:,-0.565
athing-0.556
ilan-0.555
 df-0.532
Token*)
Token ID28104
Feature activation+5.42e-01

Pos logit contributions
zsche+0.108
:,+0.103
athing+0.103
ilan+0.102
 df+0.098
Neg logit contributions
ooters-0.061
 Krish-0.054
 Chaser-0.052
 Gutenberg-0.052
 Rib-0.052
Token
Token ID198
Feature activation-6.06e-01

Pos logit contributions
ooters+0.353
 Krish+0.318
 Chaser+0.311
 Gutenberg+0.310
 Rib+0.307
Neg logit contributions
zsche-0.437
:,-0.417
athing-0.414
ilan-0.412
 df-0.395
Token
Token ID198
Feature activation-9.54e-01

Pos logit contributions
zsche+0.505
:,+0.479
athing+0.476
ilan+0.475
 df+0.455
Neg logit contributions
ooters-0.380
 Krish-0.342
 Chaser-0.334
 Gutenberg-0.332
 Rib-0.331
TokenTotal
Token ID14957
Feature activation-2.49e+00

Pos logit contributions
zsche+0.566
:,+0.531
athing+0.525
ilan+0.522
 df+0.491
Neg logit contributions
ooters-0.825
 Krish-0.764
 Chaser-0.752
 Gutenberg-0.749
 Rib-0.746
Token guaranteed
Token ID11462
Feature activation-8.84e-01

Pos logit contributions
zsche+1.868
:,+1.784
athing+1.764
ilan+1.759
 df+1.678
Neg logit contributions
ooters-1.756
 Krish-1.595
 Chaser-1.563
 Gutenberg-1.553
 Rib-1.546
Token at
Token ID379
Feature activation-1.51e+00

Pos logit contributions
zsche+0.765
:,+0.728
ilan+0.725
athing+0.721
 df+0.692
Neg logit contributions
ooters-0.524
 Krish-0.470
 Chaser-0.457
 Rib-0.454
 Gutenberg-0.453
Token signing
Token ID8415
Feature activation-4.73e-01

Pos logit contributions
zsche+1.270
:,+1.210
athing+1.200
ilan+1.197
 df+1.146
Neg logit contributions
ooters-0.940
 Krish-0.848
 Chaser-0.828
 Gutenberg-0.824
 Rib-0.820
Token:
Token ID25
Feature activation-1.21e+00

Pos logit contributions
zsche+0.281
athing+0.257
ilan+0.257
:,+0.252
 df+0.239
Neg logit contributions
ooters-0.411
 Krish-0.383
 Chaser-0.377
 Rib-0.377
 Gutenberg-0.376
Token $
Token ID720
Feature activation-2.14e+00

Pos logit contributions
zsche+1.272
:,+1.233
athing+1.224
ilan+1.216
 df+1.187
Neg logit contributions
ooters-0.491
 Krish-0.408
 Gutenberg-0.395
 Chaser-0.392
Barn-0.375