TOP ACTIVATIONS
MAX = 1.600

andr Vas ile vs ky .[ 27 ] Life
failure to provide any form of confrontation of the accuser made
the W aff en SS .[ 27 ] On
ts lasted for 25 days .[ 1 ] Exec
endants ' failure to provide any form of confrontation of the
Tam pe re Art Museum .[ 27 ] Twenty
Officials said that blaze had been contained on Tuesday ,
a grinding stone upon him .[ 1 ] This is also
be headed on the spot .[ 1 ] According
7 { ( 17 s )} 27 . R xd 8
made the proceeding against John Doe fundamentally unfair ," the ruling
the Abu N idal gang .[ 47 ] Indeed , after
as a reason for it .[ 28 ] On the next
Def endants ' failure to provide any form of confrontation of
opposed being cons cript ed .[ 28 ] Aut onomous military
21 22 23 24 25 26 27 # loop over the
of the accuser made the proceeding against John Doe fundamentally unfair
% opposed ), but a small plurality of women also backed
%. " The Wisconsin straw poll and huge turn
ibe products . [ 7 ] Because Chevron and

MIN ACTIVATIONS
MIN = -1.499

b = fit ( space ); // 2 . Sl ice
Is Not Valid ) ) return space ; // 4 .
same dinosaurs . " Comp aring such different molecules as minerals
terminate if any are not valid // This is the base
weakness . As its name would suggest , this is one
( part in parts ) return ob b _ sub div
. If you were to compare them at lower refresh rates
the space along the shorter axis of the O BB let
tissue separating the living from the dead most unmist ak ably
meal for two on Z om ato comes at Rs 100
orter Ax is , space ); // 3 . Check validity
) ) = 0 . 1 // metres _ P ulse
strong feelings one way or the other because of their taste
of course directly against the only other 144 Hz IPS screen
ara î t c epend ant que le Command ement est
is by war . In " Post Captain ," Stephen and
ob b _ sub div ide ( part ); }
s meddling in the US election and possible ties to
as 23 people since he entered the U . S .
Valid ) ) return space ; // 4 . Rec urs

INTERVAL 0.825 - 1.600
CONTAINS 0.257%

Now , I suspect some GOP senators and their allies
the argument is specific to feminist political and moral commitments because
't I tell you I 'd burn your ( re ar
UC officials needed to provide fundamental fairness to a state university
adults who still need or want nicotine could get it from

INTERVAL 0.050 - 0.825
CONTAINS 42.029%

reducing carbon dioxide emissions by 40 % by 2020 from 1990
it , said Moh an . "
was cancelled A man was able to successfully buy
( D ACA ) program . Trump formally ended
Ricky G erv ais 2007 Alec Baldwin

INTERVAL -0.724 - 0.050
CONTAINS 57.063%

get back where we were last year . "
remember being truly afraid . The first time , he genuinely
s younger brother Vin od , brother - in - law
RF : And it varies a bit along the
ahead of her Dec . 6 runoff against Rep . Bill

INTERVAL -1.499 - -0.724
CONTAINS 0.652%

with AWS . We will deploy the application in one region
A - Fat C ops - Cl over field - K
c 7 { ( 108 s )} 32 . K f
competition while at the same time we are not neglect ing
him on Twitter @ jon chest o <|endoftext|> An Open Letter
Tokenandr
Token ID46273
Feature activation+5.29e-01

Pos logit contributions
semble+3.495
 Nurs+3.088
 Crate+2.955
 COL+2.387
+2.383
Neg logit contributions
ogene-8.644
obi-8.320
hene-8.181
aucus-8.168
imum-7.993
Token Vas
Token ID23663
Feature activation+1.16e-02

Pos logit contributions
semble+5.040
 Nurs+4.806
 Crate+4.690
 COL+4.148
+3.896
Neg logit contributions
ogene-6.391
hene-6.139
obi-6.133
aucus-5.930
imum-5.831
Tokenile
Token ID576
Feature activation+6.92e-01

Pos logit contributions
semble+0.187
 Crate+0.174
 Nurs+0.172
CENT+0.160
 COL+0.159
Neg logit contributions
ogene-0.074
obi-0.070
hene-0.069
aucus-0.062
imum-0.061
Tokenvs
Token ID14259
Feature activation+4.99e-01

Pos logit contributions
semble+5.516
 Nurs+4.667
 Crate+4.474
forms+4.014
 COL+3.941
Neg logit contributions
ogene-9.327
hene-8.971
obi-8.937
imum-8.774
aucus-8.749
Tokenky
Token ID2584
Feature activation+5.91e-01

Pos logit contributions
semble+4.007
 Crate+3.352
 Nurs+3.319
CENT+2.798
 COL+2.794
Neg logit contributions
ogene-7.025
obi-6.702
hene-6.602
aucus-6.501
imum-6.446
Token.[
Token ID3693
Feature activation+1.60e+00

Pos logit contributions
semble+5.338
 Nurs+4.822
 Crate+4.674
 COL+4.213
 prosecutors+4.166
Neg logit contributions
ogene-7.350
obi-7.143
hene-7.129
aucus-7.011
imum-6.951
Token27
Token ID1983
Feature activation-1.40e-01

Pos logit contributions
semble+10.754
 Nurs+9.143
+8.892
 Crate+8.702
CENT+8.673
Neg logit contributions
ogene-17.330
hene-16.555
imum-16.436
aucus-16.212
ochet-15.932
Token]
Token ID60
Feature activation+4.19e-01

Pos logit contributions
ogene+1.638
obi+1.601
hene+1.591
aucus+1.491
imum+1.485
Neg logit contributions
semble-1.498
 Crate-1.361
 Nurs-1.339
CENT-1.197
 COL-1.182
Token
Token ID198
Feature activation+3.30e-01

Pos logit contributions
semble+3.692
 Nurs+3.452
 Crate+3.388
 COL+2.933
CENT+2.874
Neg logit contributions
ogene-5.581
hene-5.336
obi-5.305
imum-5.145
aucus-5.119
Token
Token ID198
Feature activation+9.62e-02

Pos logit contributions
semble+2.887
 Crate+2.614
 Nurs+2.553
 COL+2.232
CENT+2.231
Neg logit contributions
ogene-4.478
hene-4.324
obi-4.286
aucus-4.102
imum-4.095
TokenLife
Token ID14662
Feature activation+1.71e-01

Pos logit contributions
semble+1.084
 Crate+0.982
 Nurs+0.973
CENT+0.894
 COL+0.864
Neg logit contributions
ogene-1.078
hene-1.039
obi-1.037
aucus-0.976
imum-0.971
Token failure
Token ID5287
Feature activation+5.08e-01

Pos logit contributions
semble+7.507
 Crate+6.916
 Nurs+6.874
 prosecutors+6.515
 COL+6.440
Neg logit contributions
ogene-10.138
hene-9.949
obi-9.774
aucus-9.523
ochet-9.343
Token to
Token ID284
Feature activation+1.24e+00

Pos logit contributions
semble+4.644
 Crate+4.024
 Nurs+3.944
+3.605
 prosecutors+3.603
Neg logit contributions
ogene-6.512
hene-6.218
obi-6.198
ochet-5.940
aucus-5.921
Token provide
Token ID2148
Feature activation+1.32e+00

Pos logit contributions
semble+9.786
 COL+8.047
 prosecutors+8.032
 Crate+8.023
 Nurs+7.783
Neg logit contributions
hene-15.088
ogene-14.840
aucus-14.547
obi-14.376
imum-14.251
Token any
Token ID597
Feature activation+1.54e+00

Pos logit contributions
semble+9.564
 prosecutors+8.988
 Nurs+8.765
 Crate+8.479
 COL+8.405
Neg logit contributions
hene-15.651
ogene-15.332
obi-14.808
ochet-14.604
imum-14.536
Token form
Token ID1296
Feature activation+1.05e+00

Pos logit contributions
semble+11.123
 prosecutors+10.558
 procedural+10.067
 Crate+9.957
 Nurs+9.926
Neg logit contributions
hene-16.643
aucus-16.198
ochet-16.092
ogene-15.915
obi-15.517
Token of
Token ID286
Feature activation+1.60e+00

Pos logit contributions
semble+7.854
 Crate+7.096
 Nurs+6.857
+6.757
 prosecutors+6.548
Neg logit contributions
ogene-12.081
obi-11.708
hene-11.670
ochet-11.562
anski-11.294
Token confrontation
Token ID19207
Feature activation+8.17e-01

Pos logit contributions
semble+10.851
 prosecutors+10.502
 procedural+10.256
 Nurs+10.047
 reversible+9.990
Neg logit contributions
hene-17.418
ogene-16.962
ochet-16.730
aucus-16.672
imum-16.588
Token of
Token ID286
Feature activation+6.85e-01

Pos logit contributions
semble+7.405
 Crate+6.438
 Nurs+6.363
 COL+5.957
 prosecutors+5.948
Neg logit contributions
ogene-9.832
hene-9.645
obi-9.489
anski-9.393
ochet-9.288
Token the
Token ID262
Feature activation+6.70e-01

Pos logit contributions
semble+6.032
 Nurs+5.608
 Crate+5.431
 COL+5.222
 prosecutors+5.110
Neg logit contributions
ogene-8.450
hene-8.402
obi-8.251
aucus-7.887
imum-7.874
Token accuser
Token ID43738
Feature activation+4.24e-01

Pos logit contributions
semble+5.770
 Nurs+5.268
 Crate+5.139
 COL+4.834
 prosecutors+4.716
Neg logit contributions
ogene-8.406
obi-8.349
hene-8.321
ochet-8.024
aucus-8.017
Token made
Token ID925
Feature activation+8.59e-01

Pos logit contributions
semble+4.192
 Crate+3.769
 Nurs+3.713
 COL+3.355
 prosecutors+3.308
Neg logit contributions
ogene-5.117
hene-4.995
obi-4.972
aucus-4.761
imum-4.760
Token the
Token ID262
Feature activation+5.06e-01

Pos logit contributions
semble+2.997
 Nurs+2.739
 Crate+2.687
 COL+2.444
CENT+2.388
Neg logit contributions
ogene-3.487
obi-3.451
hene-3.436
aucus-3.240
imum-3.195
Token W
Token ID370
Feature activation+3.32e-01

Pos logit contributions
semble+5.499
 Nurs+5.132
 Crate+5.021
 COL+4.630
 prosecutors+4.481
Neg logit contributions
obi-5.476
ogene-5.474
hene-5.381
imum-5.070
aucus-5.042
Tokenaff
Token ID2001
Feature activation+2.20e-01

Pos logit contributions
semble+4.191
 Crate+3.794
 Nurs+3.764
CENT+3.545
 COL+3.469
Neg logit contributions
ogene-3.047
hene-2.963
obi-2.952
imum-2.744
aucus-2.715
Tokenen
Token ID268
Feature activation+2.00e-02

Pos logit contributions
semble+1.684
 Crate+1.427
 Nurs+1.419
CENT+1.188
 COL+1.163
Neg logit contributions
ogene-3.243
obi-3.167
hene-3.152
aucus-3.031
imum-3.016
Token SS
Token ID6723
Feature activation+4.30e-01

Pos logit contributions
semble+0.185
 Crate+0.164
 Nurs+0.162
CENT+0.141
 COL+0.141
Neg logit contributions
ogene-0.263
obi-0.258
hene-0.257
aucus-0.244
imum-0.243
Token.[
Token ID3693
Feature activation+1.55e+00

Pos logit contributions
semble+4.297
 Nurs+3.880
 Crate+3.806
 COL+3.458
CENT+3.407
Neg logit contributions
ogene-5.158
obi-5.025
hene-4.957
imum-4.757
aucus-4.711
Token27
Token ID1983
Feature activation-1.72e-01

Pos logit contributions
semble+10.189
 Crate+8.442
 Nurs+8.399
+8.051
CENT+7.914
Neg logit contributions
ogene-17.438
hene-17.175
imum-16.604
obi-16.240
ochet-16.227
Token]
Token ID60
Feature activation+1.33e-01

Pos logit contributions
ogene+2.014
obi+1.971
hene+1.957
aucus+1.832
imum+1.824
Neg logit contributions
semble-1.831
 Crate-1.663
 Nurs-1.636
CENT-1.461
 COL-1.441
Token
Token ID198
Feature activation-1.86e-02

Pos logit contributions
semble+1.291
 Crate+1.166
 Nurs+1.162
 COL+1.010
CENT+1.009
Neg logit contributions
ogene-1.685
obi-1.642
hene-1.633
aucus-1.539
imum-1.538
Token
Token ID198
Feature activation+2.24e-01

Pos logit contributions
ogene+0.223
obi+0.219
hene+0.217
aucus+0.203
imum+0.203
Neg logit contributions
semble-0.194
 Crate-0.175
 Nurs-0.172
CENT-0.153
 COL-0.151
TokenOn
Token ID2202
Feature activation+3.02e-01

Pos logit contributions
semble+2.527
 Nurs+2.270
 Crate+2.252
CENT+2.106
 COL+2.010
Neg logit contributions
ogene-2.496
hene-2.405
obi-2.389
aucus-2.261
imum-2.249
Tokents
Token ID912
Feature activation-3.77e-01

Pos logit contributions
semble+1.086
 Crate+0.984
 Nurs+0.972
CENT+0.865
 COL+0.855
Neg logit contributions
ogene-1.187
obi-1.156
hene-1.148
aucus-1.086
imum-1.074
Token lasted
Token ID15436
Feature activation-4.62e-02

Pos logit contributions
ogene+3.963
hene+3.950
obi+3.937
anski+3.585
aucus+3.550
Neg logit contributions
semble-4.291
 Crate-3.927
 Nurs-3.831
CENT-3.477
 COL-3.428
Token for
Token ID329
Feature activation+1.78e-01

Pos logit contributions
ogene+0.509
obi+0.498
hene+0.494
aucus+0.458
imum+0.458
Neg logit contributions
semble-0.524
 Crate-0.480
 Nurs-0.472
CENT-0.427
 COL-0.420
Token 25
Token ID1679
Feature activation-4.92e-01

Pos logit contributions
semble+2.001
 Crate+1.827
 Nurs+1.807
 COL+1.610
CENT+1.609
Neg logit contributions
ogene-1.986
obi-1.932
hene-1.921
aucus-1.812
imum-1.793
Token days
Token ID1528
Feature activation-7.40e-02

Pos logit contributions
ogene+6.707
hene+6.602
obi+6.594
imum+6.143
anski+6.120
Neg logit contributions
semble-3.976
 Crate-3.551
 Nurs-3.407
CENT-2.914
 COL-2.883
Token.[
Token ID3693
Feature activation+1.54e+00

Pos logit contributions
ogene+0.857
obi+0.838
hene+0.834
aucus+0.777
imum+0.775
Neg logit contributions
semble-0.799
 Crate-0.728
 Nurs-0.714
CENT-0.640
 COL-0.631
Token1
Token ID16
Feature activation-2.21e-02

Pos logit contributions
semble+10.859
 Nurs+9.671
 Crate+9.460
+8.718
CENT+8.612
Neg logit contributions
ogene-17.115
hene-16.483
ochet-15.942
imum-15.920
ipeg-15.635
Token]
Token ID60
Feature activation+3.03e-01

Pos logit contributions
ogene+0.274
obi+0.265
hene+0.265
aucus+0.250
imum+0.249
Neg logit contributions
semble-0.222
 Crate-0.201
 Nurs-0.198
CENT-0.175
 COL-0.173
Token
Token ID198
Feature activation+3.57e-01

Pos logit contributions
semble+2.789
 Crate+2.572
 Nurs+2.546
CENT+2.177
 COL+2.172
Neg logit contributions
ogene-3.976
hene-3.826
obi-3.800
aucus-3.645
anski-3.612
Token
Token ID198
Feature activation+2.27e-01

Pos logit contributions
semble+3.075
 Crate+2.794
 Nurs+2.724
 COL+2.380
CENT+2.373
Neg logit contributions
ogene-4.879
hene-4.710
obi-4.658
aucus-4.478
anski-4.463
TokenExec
Token ID23002
Feature activation-1.64e-01

Pos logit contributions
semble+2.505
 Crate+2.270
 Nurs+2.231
CENT+2.083
+1.989
Neg logit contributions
ogene-2.575
hene-2.491
obi-2.462
aucus-2.354
imum-2.311
Tokenendants
Token ID30841
Feature activation+6.44e-01

Pos logit contributions
semble+7.298
 Crate+6.396
 Nurs+6.364
CENT+5.900
+5.830
Neg logit contributions
ogene-8.505
obi-8.202
hene-8.200
ochet-7.804
aucus-7.769
Token'
Token ID6
Feature activation+8.51e-01

Pos logit contributions
semble+5.831
 Crate+5.357
 Nurs+5.299
 COL+4.723
 prosecutors+4.682
Neg logit contributions
ogene-8.003
obi-7.892
hene-7.751
aucus-7.508
imum-7.451
Token failure
Token ID5287
Feature activation+5.08e-01

Pos logit contributions
semble+7.507
 Crate+6.916
 Nurs+6.874
 prosecutors+6.515
 COL+6.440
Neg logit contributions
ogene-10.138
hene-9.949
obi-9.774
aucus-9.523
ochet-9.343
Token to
Token ID284
Feature activation+1.24e+00

Pos logit contributions
semble+4.644
 Crate+4.024
 Nurs+3.944
+3.605
 prosecutors+3.603
Neg logit contributions
ogene-6.512
hene-6.218
obi-6.198
ochet-5.940
aucus-5.921
Token provide
Token ID2148
Feature activation+1.32e+00

Pos logit contributions
semble+9.786
 COL+8.047
 prosecutors+8.032
 Crate+8.023
 Nurs+7.783
Neg logit contributions
hene-15.088
ogene-14.840
aucus-14.547
obi-14.376
imum-14.251
Token any
Token ID597
Feature activation+1.54e+00

Pos logit contributions
semble+9.564
 prosecutors+8.988
 Nurs+8.765
 Crate+8.479
 COL+8.405
Neg logit contributions
hene-15.651
ogene-15.332
obi-14.808
ochet-14.604
imum-14.536
Token form
Token ID1296
Feature activation+1.05e+00

Pos logit contributions
semble+11.123
 prosecutors+10.558
 procedural+10.067
 Crate+9.957
 Nurs+9.926
Neg logit contributions
hene-16.643
aucus-16.198
ochet-16.092
ogene-15.915
obi-15.517
Token of
Token ID286
Feature activation+1.60e+00

Pos logit contributions
semble+7.854
 Crate+7.096
 Nurs+6.857
+6.757
 prosecutors+6.548
Neg logit contributions
ogene-12.081
obi-11.708
hene-11.670
ochet-11.562
anski-11.294
Token confrontation
Token ID19207
Feature activation+8.17e-01

Pos logit contributions
semble+10.851
 prosecutors+10.502
 procedural+10.256
 Nurs+10.047
 reversible+9.990
Neg logit contributions
hene-17.418
ogene-16.962
ochet-16.730
aucus-16.672
imum-16.588
Token of
Token ID286
Feature activation+6.85e-01

Pos logit contributions
semble+7.405
 Crate+6.438
 Nurs+6.363
 COL+5.957
 prosecutors+5.948
Neg logit contributions
ogene-9.832
hene-9.645
obi-9.489
anski-9.393
ochet-9.288
Token the
Token ID262
Feature activation+6.70e-01

Pos logit contributions
semble+6.032
 Nurs+5.608
 Crate+5.431
 COL+5.222
 prosecutors+5.110
Neg logit contributions
ogene-8.450
hene-8.402
obi-8.251
aucus-7.887
imum-7.874
Token Tam
Token ID11552
Feature activation-1.48e-01

Pos logit contributions
semble+3.291
 Nurs+3.012
 Crate+3.005
 COL+2.646
CENT+2.595
Neg logit contributions
ogene-3.783
hene-3.694
obi-3.693
aucus-3.495
imum-3.428
Tokenpe
Token ID431
Feature activation-4.67e-02

Pos logit contributions
ogene+1.507
obi+1.476
hene+1.461
aucus+1.346
imum+1.343
Neg logit contributions
semble-1.810
 Crate-1.673
 Nurs-1.644
CENT-1.502
 COL-1.479
Tokenre
Token ID260
Feature activation-1.04e-01

Pos logit contributions
ogene+0.244
obi+0.231
hene+0.228
aucus+0.194
imum+0.192
Neg logit contributions
semble-0.804
 Crate-0.758
 Nurs-0.751
CENT-0.703
 COL-0.698
Token Art
Token ID3683
Feature activation-4.21e-02

Pos logit contributions
ogene+1.234
obi+1.208
hene+1.203
imum+1.120
aucus+1.117
Neg logit contributions
semble-1.097
 Crate-0.995
 Nurs-0.976
CENT-0.877
 COL-0.859
Token Museum
Token ID9594
Feature activation+3.43e-02

Pos logit contributions
ogene+0.476
obi+0.465
hene+0.462
aucus+0.431
imum+0.430
Neg logit contributions
semble-0.466
 Crate-0.425
 Nurs-0.419
CENT-0.376
 COL-0.371
Token.[
Token ID3693
Feature activation+1.49e+00

Pos logit contributions
semble+0.376
 Crate+0.342
 Nurs+0.338
CENT+0.301
 COL+0.299
Neg logit contributions
ogene-0.394
obi-0.385
hene-0.382
aucus-0.358
imum-0.357
Token27
Token ID1983
Feature activation-2.93e-01

Pos logit contributions
semble+10.099
 Nurs+8.432
 Crate+8.336
+7.902
CENT+7.588
Neg logit contributions
ogene-16.812
hene-16.742
imum-16.089
ochet-15.887
aucus-15.776
Token]
Token ID60
Feature activation-8.82e-02

Pos logit contributions
ogene+3.313
obi+3.264
hene+3.228
aucus+2.996
imum+2.985
Neg logit contributions
semble-3.214
 Crate-2.925
 Nurs-2.876
CENT-2.581
 COL-2.532
Token
Token ID198
Feature activation+6.03e-02

Pos logit contributions
ogene+1.073
obi+1.052
hene+1.044
aucus+0.980
imum+0.976
Neg logit contributions
semble-0.899
 Crate-0.814
 Nurs-0.798
CENT-0.711
 COL-0.699
Token
Token ID198
Feature activation-1.95e-01

Pos logit contributions
semble+0.605
 Crate+0.547
 Nurs+0.537
CENT+0.476
 COL+0.470
Neg logit contributions
ogene-0.748
obi-0.730
hene-0.727
aucus-0.683
imum-0.681
TokenTwenty
Token ID34096
Feature activation-4.13e-01

Pos logit contributions
ogene+2.124
obi+2.092
hene+2.061
aucus+1.917
imum+1.913
Neg logit contributions
semble-2.202
 Crate-2.037
 Nurs-1.996
 COL-1.779
CENT-1.774
Token
Token ID198
Feature activation+8.59e-02

Pos logit contributions
semble+1.881
 Crate+1.702
 Nurs+1.671
CENT+1.472
 COL+1.462
Neg logit contributions
ogene-2.640
hene-2.556
obi-2.550
aucus-2.416
imum-2.408
TokenOfficials
Token ID25883
Feature activation+6.08e-01

Pos logit contributions
semble+0.967
 Crate+0.879
 Nurs+0.869
CENT+0.797
 COL+0.772
Neg logit contributions
ogene-0.965
hene-0.933
obi-0.931
aucus-0.872
imum-0.867
Token said
Token ID531
Feature activation+5.09e-01

Pos logit contributions
semble+6.427
 Nurs+5.840
 Crate+5.722
 COL+5.146
 prosecutors+5.115
Neg logit contributions
ogene-6.926
obi-6.744
hene-6.555
imum-6.389
aucus-6.302
Token that
Token ID326
Feature activation+5.05e-01

Pos logit contributions
semble+4.946
 Crate+4.482
 Nurs+4.450
 COL+3.995
 prosecutors+3.941
Neg logit contributions
ogene-6.251
obi-6.108
hene-6.018
imum-5.746
aucus-5.624
Token blaze
Token ID31259
Feature activation+7.67e-01

Pos logit contributions
semble+5.002
 Crate+4.537
 Nurs+4.497
 COL+4.031
 prosecutors+3.966
Neg logit contributions
ogene-6.124
obi-6.014
hene-5.894
imum-5.664
aucus-5.599
Token had
Token ID550
Feature activation+1.49e+00

Pos logit contributions
semble+6.708
 Nurs+6.199
 Crate+6.106
 prosecutors+5.669
 COL+5.554
Neg logit contributions
ogene-9.398
hene-9.070
obi-8.973
imum-8.931
aucus-8.759
Token been
Token ID587
Feature activation+7.81e-01

Pos logit contributions
semble+10.277
 inciner+8.984
 Nurs+8.749
 Crate+8.591
 engulf+8.371
Neg logit contributions
ogene-17.003
hene-16.854
imum-16.682
obi-16.329
isSpecialOrderable-16.316
Token contained
Token ID7763
Feature activation+4.20e-01

Pos logit contributions
semble+6.995
 Crate+6.172
 Nurs+6.041
 COL+5.549
 extingu+5.547
Neg logit contributions
ogene-9.299
hene-9.244
obi-9.238
imum-9.070
aucus-8.967
Token on
Token ID319
Feature activation+3.11e-01

Pos logit contributions
semble+4.003
 Crate+3.569
 Nurs+3.493
 COL+3.098
 prosecutors+3.066
Neg logit contributions
ogene-5.209
obi-5.127
hene-5.101
aucus-4.864
imum-4.841
Token Tuesday
Token ID3431
Feature activation+4.02e-01

Pos logit contributions
semble+3.342
 Crate+3.060
 Nurs+3.022
 COL+2.695
CENT+2.667
Neg logit contributions
ogene-3.589
obi-3.515
hene-3.481
aucus-3.275
imum-3.270
Token,
Token ID11
Feature activation+3.41e-01

Pos logit contributions
semble+3.545
 Crate+3.170
 Nurs+3.152
 COL+2.741
 prosecutors+2.737
Neg logit contributions
ogene-5.213
obi-5.187
hene-5.117
aucus-4.920
imum-4.863
Token a
Token ID257
Feature activation-4.12e-01

Pos logit contributions
ogene+6.455
obi+6.272
hene+6.266
ochet+5.729
aucus+5.722
Neg logit contributions
semble-7.293
 Crate-6.576
 Nurs-6.541
CENT-6.125
 COL-5.823
Token grinding
Token ID28436
Feature activation-3.90e-01

Pos logit contributions
ogene+4.408
hene+4.347
obi+4.332
ochet+3.947
aucus+3.944
Neg logit contributions
semble-4.621
 Nurs-4.142
 Crate-4.112
CENT-3.783
 COL-3.629
Token stone
Token ID7815
Feature activation+9.24e-02

Pos logit contributions
ogene+4.290
hene+4.240
obi+4.218
ochet+3.860
aucus+3.847
Neg logit contributions
semble-4.323
 Nurs-3.893
 Crate-3.852
CENT-3.504
-3.409
Token upon
Token ID2402
Feature activation-6.08e-02

Pos logit contributions
semble+0.861
 Crate+0.773
 Nurs+0.756
CENT+0.660
 COL+0.653
Neg logit contributions
ogene-1.210
obi-1.185
hene-1.178
aucus-1.115
imum-1.110
Token him
Token ID683
Feature activation+1.63e-01

Pos logit contributions
ogene+0.705
obi+0.689
hene+0.685
aucus+0.641
imum+0.639
Neg logit contributions
semble-0.657
 Crate-0.598
 Nurs-0.588
CENT-0.526
 COL-0.520
Token.[
Token ID3693
Feature activation+1.46e+00

Pos logit contributions
semble+1.675
 Crate+1.511
 Nurs+1.487
 COL+1.312
CENT+1.310
Neg logit contributions
ogene-1.965
obi-1.919
hene-1.911
aucus-1.805
imum-1.798
Token1
Token ID16
Feature activation-1.05e-01

Pos logit contributions
semble+9.595
 Nurs+8.603
 Crate+8.370
+7.549
CENT+7.525
Neg logit contributions
ogene-17.385
hene-16.695
imum-16.196
ochet-16.169
ipeg-16.017
Token]
Token ID60
Feature activation-7.47e-02

Pos logit contributions
ogene+1.282
obi+1.250
hene+1.244
aucus+1.171
imum+1.166
Neg logit contributions
semble-1.073
 Crate-0.971
 Nurs-0.955
CENT-0.847
 COL-0.838
Token This
Token ID770
Feature activation+4.29e-02

Pos logit contributions
ogene+0.910
obi+0.895
hene+0.887
aucus+0.831
imum+0.830
Neg logit contributions
semble-0.763
 Crate-0.685
 Nurs-0.671
CENT-0.599
 COL-0.589
Token is
Token ID318
Feature activation-9.48e-02

Pos logit contributions
semble+0.454
 Crate+0.413
 Nurs+0.406
CENT+0.362
 COL+0.358
Neg logit contributions
ogene-0.508
obi-0.497
hene-0.494
aucus-0.464
imum-0.462
Token also
Token ID635
Feature activation-3.03e-02

Pos logit contributions
ogene+1.067
obi+1.046
hene+1.034
aucus+0.961
imum+0.961
Neg logit contributions
semble-1.056
 Crate-0.957
 Nurs-0.944
CENT-0.851
 COL-0.836
Token be
Token ID307
Feature activation-4.30e-01

Pos logit contributions
ogene+1.271
obi+1.247
hene+1.238
aucus+1.149
anski+1.140
Neg logit contributions
semble-1.180
 Crate-1.086
 Nurs-1.062
CENT-0.952
 COL-0.937
Tokenheaded
Token ID15353
Feature activation-2.63e-01

Pos logit contributions
obi+2.544
ogene+2.541
hene+2.521
anski+2.096
aucus+2.089
Neg logit contributions
semble-6.807
 Crate-6.570
 Nurs-6.489
CENT-5.962
 COL-5.959
Token on
Token ID319
Feature activation-9.17e-02

Pos logit contributions
ogene+2.882
obi+2.820
hene+2.793
aucus+2.580
ochet+2.567
Neg logit contributions
semble-2.916
 Crate-2.710
 Nurs-2.646
CENT-2.394
 COL-2.346
Token the
Token ID262
Feature activation-1.04e-01

Pos logit contributions
ogene+1.039
obi+1.013
hene+1.008
aucus+0.938
imum+0.936
Neg logit contributions
semble-1.013
 Crate-0.922
 Nurs-0.908
CENT-0.817
 COL-0.805
Token spot
Token ID4136
Feature activation-2.69e-01

Pos logit contributions
ogene+1.264
obi+1.231
hene+1.223
aucus+1.147
imum+1.142
Neg logit contributions
semble-1.062
 Crate-0.956
 Nurs-0.942
CENT-0.839
 COL-0.825
Token.[
Token ID3693
Feature activation+1.40e+00

Pos logit contributions
ogene+3.006
obi+2.944
hene+2.927
aucus+2.690
imum+2.687
Neg logit contributions
semble-2.948
 Crate-2.734
 Nurs-2.668
CENT-2.406
 COL-2.360
Token1
Token ID16
Feature activation-2.24e-02

Pos logit contributions
semble+9.531
 Nurs+8.454
 Crate+8.317
+7.482
CENT+7.424
Neg logit contributions
ogene-16.725
hene-15.968
imum-15.652
ochet-15.520
ipeg-15.294
Token]
Token ID60
Feature activation+9.61e-02

Pos logit contributions
ogene+0.287
obi+0.279
hene+0.278
aucus+0.264
imum+0.263
Neg logit contributions
semble-0.215
 Crate-0.194
 Nurs-0.191
CENT-0.167
 COL-0.166
Token
Token ID198
Feature activation+2.25e-01

Pos logit contributions
semble+0.948
 Crate+0.864
 Nurs+0.849
CENT+0.747
 COL+0.739
Neg logit contributions
ogene-1.206
obi-1.173
hene-1.169
aucus-1.103
imum-1.099
Token
Token ID198
Feature activation+9.24e-02

Pos logit contributions
semble+2.080
 Crate+1.884
 Nurs+1.844
CENT+1.618
 COL+1.612
Neg logit contributions
ogene-2.960
hene-2.867
obi-2.852
aucus-2.712
imum-2.703
TokenAccording
Token ID4821
Feature activation+3.05e-01

Pos logit contributions
semble+1.036
 Crate+0.944
 Nurs+0.928
CENT+0.850
 COL+0.825
Neg logit contributions
ogene-1.037
hene-1.003
obi-1.002
aucus-0.942
imum-0.933
Token7
Token ID22
Feature activation-1.57e-01

Pos logit contributions
semble+2.289
 Crate+2.020
 Nurs+1.966
CENT+1.735
 COL+1.703
Neg logit contributions
ogene-3.234
hene-3.171
obi-3.158
aucus-3.004
imum-2.967
Token {
Token ID1391
Feature activation+1.60e-01

Pos logit contributions
ogene+1.503
obi+1.475
hene+1.437
imum+1.331
aucus+1.327
Neg logit contributions
semble-1.976
 Crate-1.806
 Nurs-1.786
CENT-1.653
-1.612
Token(
Token ID7
Feature activation-1.83e-01

Pos logit contributions
semble+1.409
 Crate+1.255
 Nurs+1.233
CENT+1.079
 COL+1.056
Neg logit contributions
ogene-2.178
hene-2.119
obi-2.109
aucus-2.010
imum-1.998
Token17
Token ID1558
Feature activation-1.78e-01

Pos logit contributions
ogene+2.261
obi+2.226
hene+2.194
aucus+2.076
imum+2.063
Neg logit contributions
semble-1.820
 Crate-1.638
 Nurs-1.598
CENT-1.410
 COL-1.385
Tokens
Token ID82
Feature activation-6.38e-01

Pos logit contributions
ogene+1.630
obi+1.589
hene+1.570
aucus+1.443
imum+1.437
Neg logit contributions
semble-2.350
 Crate-2.176
 Nurs-2.146
CENT-1.967
 COL-1.945
Token)}
Token ID38165
Feature activation+1.38e+00

Pos logit contributions
obi+2.672
ogene+2.478
hene+2.222
ochet+1.847
imum+1.752
Neg logit contributions
semble-10.951
 Crate-10.333
 Nurs-10.329
-9.794
CENT-9.706
Token 27
Token ID2681
Feature activation+4.24e-01

Pos logit contributions
semble+7.305
 Nurs+6.778
 Crate+6.594
 Bash+6.039
 COL+5.415
Neg logit contributions
ogene-18.162
hene-17.727
obi-17.652
imum-17.370
anski-16.923
Token.
Token ID13
Feature activation+6.10e-01

Pos logit contributions
semble+4.074
 Crate+3.729
 Nurs+3.639
 COL+3.204
CENT+3.161
Neg logit contributions
ogene-5.349
hene-5.162
obi-5.148
aucus-4.917
imum-4.897
Token R
Token ID371
Feature activation+8.67e-01

Pos logit contributions
semble+5.292
 Crate+4.997
 Nurs+4.832
 COL+4.261
 prosecutors+3.968
Neg logit contributions
ogene-7.854
obi-7.569
hene-7.533
aucus-7.417
imum-7.349
Tokenxd
Token ID24954
Feature activation+3.94e-01

Pos logit contributions
semble+8.458
 Crate+7.302
 Nurs+7.024
 COL+6.355
forms+6.348
Neg logit contributions
ogene-9.422
hene-8.971
obi-8.844
imum-8.820
aucus-8.650
Token8
Token ID23
Feature activation-2.62e-02

Pos logit contributions
semble+3.374
 Crate+2.925
 Nurs+2.881
CENT+2.454
 COL+2.438
Neg logit contributions
ogene-5.378
hene-5.273
obi-5.202
aucus-4.931
imum-4.925
Token made
Token ID925
Feature activation+8.59e-01

Pos logit contributions
semble+4.192
 Crate+3.769
 Nurs+3.713
 COL+3.355
 prosecutors+3.308
Neg logit contributions
ogene-5.117
hene-4.995
obi-4.972
aucus-4.761
imum-4.760
Token the
Token ID262
Feature activation+8.43e-01

Pos logit contributions
semble+7.285
 Crate+6.751
 Nurs+6.674
 prosecutors+6.408
 COL+6.245
Neg logit contributions
ogene-10.515
hene-10.422
obi-10.149
aucus-9.771
anski-9.723
Token proceeding
Token ID18788
Feature activation+1.31e+00

Pos logit contributions
semble+7.400
 Nurs+7.052
 Crate+6.915
 COL+6.475
 prosecutors+6.463
Neg logit contributions
hene-9.923
obi-9.901
ogene-9.887
ochet-9.430
aucus-9.365
Token against
Token ID1028
Feature activation+6.18e-01

Pos logit contributions
semble+10.673
 Nurs+9.508
 prosecutors+9.454
 Crate+9.336
 COL+9.006
Neg logit contributions
ogene-14.481
hene-14.144
aucus-13.595
obi-13.475
imum-13.354
Token John
Token ID1757
Feature activation+5.13e-01

Pos logit contributions
semble+5.784
 Nurs+5.488
 Crate+5.298
 COL+4.962
 prosecutors+4.884
Neg logit contributions
ogene-7.437
hene-7.342
obi-7.209
aucus-6.811
imum-6.730
Token Doe
Token ID31780
Feature activation+1.38e+00

Pos logit contributions
semble+4.402
 Crate+4.100
 Nurs+4.043
 COL+3.587
 prosecutors+3.517
Neg logit contributions
ogene-6.545
hene-6.412
obi-6.290
aucus-6.150
ochet-5.939
Token fundamentally
Token ID17640
Feature activation+1.18e+00

Pos logit contributions
semble+10.537
 Nurs+9.988
 Crate+9.611
 prosecutors+9.534
 COL+9.506
Neg logit contributions
ogene-14.962
aucus-14.432
hene-14.377
obi-14.068
ipeg-13.870
Token unfair
Token ID11675
Feature activation+1.86e-01

Pos logit contributions
semble+9.328
 Crate+8.384
 reversible+8.228
 Nurs+8.069
 prosecutors+7.933
Neg logit contributions
ogene-13.851
hene-13.381
aucus-13.280
anski-13.087
obi-12.807
Token,"
Token ID553
Feature activation-5.66e-01

Pos logit contributions
semble+1.959
 Crate+1.756
 Nurs+1.747
 COL+1.562
CENT+1.561
Neg logit contributions
ogene-2.187
obi-2.117
hene-2.116
aucus-2.004
anski-1.977
Token the
Token ID262
Feature activation+5.38e-02

Pos logit contributions
ogene+6.152
hene+5.975
obi+5.925
anski+5.598
aucus+5.506
Neg logit contributions
semble-6.154
 Crate-5.649
 Nurs-5.430
CENT-5.059
 COL-4.818
Token ruling
Token ID6922
Feature activation-3.00e-01

Pos logit contributions
semble+0.547
 Crate+0.494
 Nurs+0.489
CENT+0.432
 COL+0.429
Neg logit contributions
ogene-0.657
obi-0.645
hene-0.641
aucus-0.601
imum-0.600
Token the
Token ID262
Feature activation-4.02e-02

Pos logit contributions
ogene+3.870
obi+3.783
hene+3.756
anski+3.507
aucus+3.487
Neg logit contributions
semble-3.810
 Crate-3.476
 Nurs-3.378
CENT-3.100
 COL-3.009
Token Abu
Token ID13098
Feature activation-1.49e-01

Pos logit contributions
ogene+0.462
obi+0.452
hene+0.449
aucus+0.420
imum+0.418
Neg logit contributions
semble-0.438
 Crate-0.399
 Nurs-0.392
CENT-0.352
 COL-0.347
Token N
Token ID399
Feature activation+3.56e-01

Pos logit contributions
ogene+1.919
obi+1.898
hene+1.879
imum+1.762
aucus+1.760
Neg logit contributions
semble-1.383
 Crate-1.241
 Nurs-1.209
CENT-1.079
 COL-1.058
Tokenidal
Token ID11624
Feature activation+5.41e-01

Pos logit contributions
semble+3.810
 Crate+3.380
 Nurs+3.329
CENT+2.988
+2.955
Neg logit contributions
ogene-4.039
hene-3.926
obi-3.884
aucus-3.693
anski-3.635
Token gang
Token ID7706
Feature activation-2.49e-01

Pos logit contributions
semble+5.325
 Nurs+4.914
 Crate+4.837
 prosecutors+4.230
 COL+4.208
Neg logit contributions
ogene-6.479
obi-6.231
hene-6.215
imum-5.983
aucus-5.968
Token.[
Token ID3693
Feature activation+1.36e+00

Pos logit contributions
ogene+2.800
obi+2.758
hene+2.734
aucus+2.535
anski+2.532
Neg logit contributions
semble-2.732
 Crate-2.514
 Nurs-2.460
CENT-2.212
 COL-2.189
Token47
Token ID2857
Feature activation-1.21e-01

Pos logit contributions
semble+5.206
 Nurs+3.627
 Crate+3.542
+3.118
CENT+3.064
Neg logit contributions
ogene-21.721
hene-20.699
imum-20.230
obi-19.988
ochet-19.450
Token]
Token ID60
Feature activation-2.73e-01

Pos logit contributions
ogene+1.319
obi+1.302
hene+1.282
aucus+1.189
imum+1.185
Neg logit contributions
semble-1.376
 Crate-1.249
 Nurs-1.233
CENT-1.107
-1.090
Token Indeed
Token ID9676
Feature activation-5.65e-02

Pos logit contributions
ogene+3.174
obi+3.160
hene+3.157
aucus+2.929
imum+2.914
Neg logit contributions
semble-2.799
 Crate-2.464
 Nurs-2.407
CENT-2.194
-2.170
Token,
Token ID11
Feature activation-2.23e-01

Pos logit contributions
ogene+0.670
obi+0.654
hene+0.650
aucus+0.610
imum+0.608
Neg logit contributions
semble-0.599
 Crate-0.542
 Nurs-0.534
CENT-0.476
 COL-0.470
Token after
Token ID706
Feature activation-2.65e-01

Pos logit contributions
ogene+2.484
obi+2.430
hene+2.423
anski+2.243
aucus+2.242
Neg logit contributions
semble-2.433
 Crate-2.220
 Nurs-2.179
CENT-1.984
-1.942
Token as
Token ID355
Feature activation-2.21e-01

Pos logit contributions
semble+1.755
 Crate+1.582
 Nurs+1.567
CENT+1.402
 COL+1.397
Neg logit contributions
ogene-1.883
obi-1.831
hene-1.821
aucus-1.723
imum-1.708
Token a
Token ID257
Feature activation-3.22e-01

Pos logit contributions
ogene+2.450
obi+2.410
hene+2.377
aucus+2.202
anski+2.198
Neg logit contributions
semble-2.459
 Crate-2.268
 Nurs-2.209
CENT-2.001
 COL-1.957
Token reason
Token ID1738
Feature activation-4.67e-01

Pos logit contributions
ogene+3.580
obi+3.535
hene+3.456
anski+3.216
ochet+3.185
Neg logit contributions
semble-3.555
 Crate-3.217
 Nurs-3.128
CENT-2.855
-2.777
Token for
Token ID329
Feature activation-5.15e-01

Pos logit contributions
ogene+4.835
obi+4.760
hene+4.673
imum+4.302
anski+4.268
Neg logit contributions
semble-5.536
 Crate-5.071
 Nurs-4.954
CENT-4.525
 COL-4.377
Token it
Token ID340
Feature activation+1.04e-01

Pos logit contributions
ogene+5.582
obi+5.425
hene+5.312
anski+4.958
ochet+4.935
Neg logit contributions
semble-5.697
 Crate-5.126
 Nurs-4.895
CENT-4.595
-4.487
Token.[
Token ID3693
Feature activation+1.34e+00

Pos logit contributions
semble+1.072
 Crate+0.968
 Nurs+0.961
CENT+0.849
 COL+0.846
Neg logit contributions
ogene-1.250
obi-1.222
hene-1.216
aucus-1.154
imum-1.146
Token28
Token ID2078
Feature activation-3.22e-01

Pos logit contributions
semble+9.635
 Crate+8.131
 Nurs+8.089
+7.219
CENT+7.075
Neg logit contributions
ogene-15.614
hene-14.708
obi-14.664
imum-14.618
anski-14.370
Token]
Token ID60
Feature activation+9.48e-02

Pos logit contributions
ogene+3.599
obi+3.538
hene+3.498
aucus+3.239
imum+3.224
Neg logit contributions
semble-3.576
 Crate-3.266
 Nurs-3.214
CENT-2.884
 COL-2.829
Token On
Token ID1550
Feature activation+7.47e-02

Pos logit contributions
semble+0.911
 Crate+0.822
 Nurs+0.812
CENT+0.710
 COL+0.703
Neg logit contributions
ogene-1.216
obi-1.185
hene-1.180
aucus-1.114
imum-1.113
Token the
Token ID262
Feature activation+2.82e-01

Pos logit contributions
semble+0.776
 Crate+0.703
 Nurs+0.692
CENT+0.615
 COL+0.608
Neg logit contributions
ogene-0.897
obi-0.878
hene-0.872
aucus-0.819
imum-0.816
Token next
Token ID1306
Feature activation-3.28e-02

Pos logit contributions
semble+2.841
 Crate+2.586
 Nurs+2.571
 COL+2.257
CENT+2.241
Neg logit contributions
ogene-3.392
hene-3.354
obi-3.347
aucus-3.151
imum-3.146
TokenDef
Token ID7469
Feature activation+7.55e-01

Pos logit contributions
semble+7.138
 Nurs+6.578
 Crate+6.460
CENT+6.365
 COL+6.007
Neg logit contributions
ogene-8.915
hene-8.431
obi-8.222
aucus-7.915
ochet-7.821
Tokenendants
Token ID30841
Feature activation+6.44e-01

Pos logit contributions
semble+7.298
 Crate+6.396
 Nurs+6.364
CENT+5.900
+5.830
Neg logit contributions
ogene-8.505
obi-8.202
hene-8.200
ochet-7.804
aucus-7.769
Token'
Token ID6
Feature activation+8.51e-01

Pos logit contributions
semble+5.831
 Crate+5.357
 Nurs+5.299
 COL+4.723
 prosecutors+4.682
Neg logit contributions
ogene-8.003
obi-7.892
hene-7.751
aucus-7.508
imum-7.451
Token failure
Token ID5287
Feature activation+5.08e-01

Pos logit contributions
semble+7.507
 Crate+6.916
 Nurs+6.874
 prosecutors+6.515
 COL+6.440
Neg logit contributions
ogene-10.138
hene-9.949
obi-9.774
aucus-9.523
ochet-9.343
Token to
Token ID284
Feature activation+1.24e+00

Pos logit contributions
semble+4.644
 Crate+4.024
 Nurs+3.944
+3.605
 prosecutors+3.603
Neg logit contributions
ogene-6.512
hene-6.218
obi-6.198
ochet-5.940
aucus-5.921
Token provide
Token ID2148
Feature activation+1.32e+00

Pos logit contributions
semble+9.786
 COL+8.047
 prosecutors+8.032
 Crate+8.023
 Nurs+7.783
Neg logit contributions
hene-15.088
ogene-14.840
aucus-14.547
obi-14.376
imum-14.251
Token any
Token ID597
Feature activation+1.54e+00

Pos logit contributions
semble+9.564
 prosecutors+8.988
 Nurs+8.765
 Crate+8.479
 COL+8.405
Neg logit contributions
hene-15.651
ogene-15.332
obi-14.808
ochet-14.604
imum-14.536
Token form
Token ID1296
Feature activation+1.05e+00

Pos logit contributions
semble+11.123
 prosecutors+10.558
 procedural+10.067
 Crate+9.957
 Nurs+9.926
Neg logit contributions
hene-16.643
aucus-16.198
ochet-16.092
ogene-15.915
obi-15.517
Token of
Token ID286
Feature activation+1.60e+00

Pos logit contributions
semble+7.854
 Crate+7.096
 Nurs+6.857
+6.757
 prosecutors+6.548
Neg logit contributions
ogene-12.081
obi-11.708
hene-11.670
ochet-11.562
anski-11.294
Token confrontation
Token ID19207
Feature activation+8.17e-01

Pos logit contributions
semble+10.851
 prosecutors+10.502
 procedural+10.256
 Nurs+10.047
 reversible+9.990
Neg logit contributions
hene-17.418
ogene-16.962
ochet-16.730
aucus-16.672
imum-16.588
Token of
Token ID286
Feature activation+6.85e-01

Pos logit contributions
semble+7.405
 Crate+6.438
 Nurs+6.363
 COL+5.957
 prosecutors+5.948
Neg logit contributions
ogene-9.832
hene-9.645
obi-9.489
anski-9.393
ochet-9.288
Token opposed
Token ID6886
Feature activation-4.83e-01

Pos logit contributions
ogene+5.666
obi+5.645
hene+5.468
anski+5.152
ochet+4.996
Neg logit contributions
semble-6.192
 Crate-5.708
 Nurs-5.618
CENT-5.245
-5.020
Token being
Token ID852
Feature activation-4.57e-01

Pos logit contributions
ogene+4.928
obi+4.764
hene+4.695
ochet+4.402
imum+4.353
Neg logit contributions
semble-5.680
 Crate-5.225
 Nurs-5.007
CENT-4.700
-4.454
Token cons
Token ID762
Feature activation+8.15e-02

Pos logit contributions
ogene+4.939
obi+4.884
hene+4.794
ochet+4.465
anski+4.445
Neg logit contributions
semble-4.957
 Crate-4.622
 Nurs-4.465
CENT-4.096
-3.924
Tokencript
Token ID6519
Feature activation-3.27e-01

Pos logit contributions
semble+0.736
 Crate+0.651
 Nurs+0.643
CENT+0.555
 COL+0.551
Neg logit contributions
ogene-1.088
hene-1.063
obi-1.063
aucus-1.006
imum-1.003
Tokened
Token ID276
Feature activation-3.50e-01

Pos logit contributions
ogene+3.306
obi+3.258
hene+3.197
imum+2.970
anski+2.964
Neg logit contributions
semble-3.954
 Crate-3.664
 Nurs-3.579
CENT-3.312
 COL-3.189
Token.[
Token ID3693
Feature activation+1.32e+00

Pos logit contributions
ogene+3.932
obi+3.817
hene+3.809
ochet+3.502
imum+3.495
Neg logit contributions
semble-3.829
 Crate-3.560
 Nurs-3.456
CENT-3.149
-3.033
Token28
Token ID2078
Feature activation-2.63e-01

Pos logit contributions
semble+8.686
 Crate+7.139
 Nurs+6.903
CENT+6.118
+5.920
Neg logit contributions
ogene-16.500
hene-15.776
obi-15.622
imum-15.545
anski-15.232
Token]
Token ID60
Feature activation-7.06e-02

Pos logit contributions
ogene+3.020
obi+2.966
hene+2.934
aucus+2.732
imum+2.723
Neg logit contributions
semble-2.859
 Crate-2.605
 Nurs-2.563
CENT-2.291
 COL-2.252
Token Aut
Token ID5231
Feature activation-3.47e-01

Pos logit contributions
ogene+0.867
obi+0.850
hene+0.844
aucus+0.793
imum+0.789
Neg logit contributions
semble-0.714
 Crate-0.642
 Nurs-0.628
CENT-0.561
 COL-0.550
Tokenonomous
Token ID38175
Feature activation-1.16e-01

Pos logit contributions
ogene+3.081
obi+3.066
hene+2.875
imum+2.658
aucus+2.656
Neg logit contributions
semble-4.672
 Crate-4.415
 Nurs-4.298
CENT-3.973
 COL-3.933
Token military
Token ID2422
Feature activation+1.23e-02

Pos logit contributions
ogene+1.393
obi+1.366
hene+1.354
aucus+1.269
anski+1.263
Neg logit contributions
semble-1.199
 Crate-1.086
 Nurs-1.054
CENT-0.945
-0.926
Token 21
Token ID2310
Feature activation+4.37e-01

Pos logit contributions
semble+5.391
 Crate+5.006
 Nurs+4.881
 COL+4.353
CENT+4.014
Neg logit contributions
ogene-8.858
hene-8.720
obi-8.563
aucus-8.255
anski-8.231
Token 22
Token ID2534
Feature activation+4.25e-01

Pos logit contributions
semble+3.612
 Crate+3.274
 Nurs+3.211
 COL+2.831
+2.668
Neg logit contributions
ogene-6.007
hene-5.901
obi-5.879
aucus-5.642
imum-5.511
Token 23
Token ID2242
Feature activation-4.07e-02

Pos logit contributions
semble+3.694
 Crate+3.277
 Nurs+3.150
 COL+2.852
CENT+2.757
Neg logit contributions
ogene-5.632
obi-5.604
hene-5.532
aucus-5.313
anski-5.287
Token 24
Token ID1987
Feature activation+3.12e-01

Pos logit contributions
ogene+0.567
obi+0.556
hene+0.553
aucus+0.523
imum+0.521
Neg logit contributions
semble-0.345
 Crate-0.304
 Nurs-0.299
CENT-0.258
 COL-0.252
Token 25
Token ID1679
Feature activation+7.02e-01

Pos logit contributions
semble+2.826
 Crate+2.527
 Nurs+2.423
 COL+2.189
CENT+2.119
Neg logit contributions
ogene-4.165
obi-4.085
hene-4.081
aucus-3.937
imum-3.870
Token 26
Token ID2608
Feature activation+1.31e+00

Pos logit contributions
semble+4.181
 Crate+3.908
 Nurs+3.634
 COL+3.326
 prosecutors+2.836
Neg logit contributions
ogene-10.660
hene-10.443
obi-10.325
aucus-10.245
imum-10.080
Token 27
Token ID2681
Feature activation+1.01e+00

Pos logit contributions
semble+6.378
 Crate+4.718
 Nurs+4.547
+4.313
 COL+4.212
Neg logit contributions
hene-18.792
ogene-18.668
obi-18.539
anski-18.016
nai-17.994
Token #
Token ID1303
Feature activation+4.61e-03

Pos logit contributions
semble+6.111
 Crate+5.250
 Nurs+4.765
 COL+4.124
 prosecutors+3.691
Neg logit contributions
ogene-15.149
hene-14.693
obi-14.608
aucus-14.279
anski-14.212
Token loop
Token ID9052
Feature activation+4.71e-01

Pos logit contributions
semble+0.047
 Crate+0.043
 Nurs+0.042
CENT+0.037
 COL+0.037
Neg logit contributions
ogene-0.056
obi-0.055
hene-0.054
aucus-0.051
imum-0.051
Token over
Token ID625
Feature activation-5.14e-01

Pos logit contributions
semble+2.714
 Crate+2.080
 Nurs+2.030
 COL+1.689
+1.561
Neg logit contributions
ogene-7.718
hene-7.496
obi-7.440
aucus-7.303
anski-7.226
Token the
Token ID262
Feature activation-2.43e-01

Pos logit contributions
ogene+5.342
obi+5.219
hene+5.143
imum+4.777
anski+4.727
Neg logit contributions
semble-5.863
 Crate-5.429
 Nurs-5.300
CENT-4.880
-4.719
Token of
Token ID286
Feature activation+6.85e-01

Pos logit contributions
semble+7.405
 Crate+6.438
 Nurs+6.363
 COL+5.957
 prosecutors+5.948
Neg logit contributions
ogene-9.832
hene-9.645
obi-9.489
anski-9.393
ochet-9.288
Token the
Token ID262
Feature activation+6.70e-01

Pos logit contributions
semble+6.032
 Nurs+5.608
 Crate+5.431
 COL+5.222
 prosecutors+5.110
Neg logit contributions
ogene-8.450
hene-8.402
obi-8.251
aucus-7.887
imum-7.874
Token accuser
Token ID43738
Feature activation+4.24e-01

Pos logit contributions
semble+5.770
 Nurs+5.268
 Crate+5.139
 COL+4.834
 prosecutors+4.716
Neg logit contributions
ogene-8.406
obi-8.349
hene-8.321
ochet-8.024
aucus-8.017
Token made
Token ID925
Feature activation+8.59e-01

Pos logit contributions
semble+4.192
 Crate+3.769
 Nurs+3.713
 COL+3.355
 prosecutors+3.308
Neg logit contributions
ogene-5.117
hene-4.995
obi-4.972
aucus-4.761
imum-4.760
Token the
Token ID262
Feature activation+8.43e-01

Pos logit contributions
semble+7.285
 Crate+6.751
 Nurs+6.674
 prosecutors+6.408
 COL+6.245
Neg logit contributions
ogene-10.515
hene-10.422
obi-10.149
aucus-9.771
anski-9.723
Token proceeding
Token ID18788
Feature activation+1.31e+00

Pos logit contributions
semble+7.400
 Nurs+7.052
 Crate+6.915
 COL+6.475
 prosecutors+6.463
Neg logit contributions
hene-9.923
obi-9.901
ogene-9.887
ochet-9.430
aucus-9.365
Token against
Token ID1028
Feature activation+6.18e-01

Pos logit contributions
semble+10.673
 Nurs+9.508
 prosecutors+9.454
 Crate+9.336
 COL+9.006
Neg logit contributions
ogene-14.481
hene-14.144
aucus-13.595
obi-13.475
imum-13.354
Token John
Token ID1757
Feature activation+5.13e-01

Pos logit contributions
semble+5.784
 Nurs+5.488
 Crate+5.298
 COL+4.962
 prosecutors+4.884
Neg logit contributions
ogene-7.437
hene-7.342
obi-7.209
aucus-6.811
imum-6.730
Token Doe
Token ID31780
Feature activation+1.38e+00

Pos logit contributions
semble+4.402
 Crate+4.100
 Nurs+4.043
 COL+3.587
 prosecutors+3.517
Neg logit contributions
ogene-6.545
hene-6.412
obi-6.290
aucus-6.150
ochet-5.939
Token fundamentally
Token ID17640
Feature activation+1.18e+00

Pos logit contributions
semble+10.537
 Nurs+9.988
 Crate+9.611
 prosecutors+9.534
 COL+9.506
Neg logit contributions
ogene-14.962
aucus-14.432
hene-14.377
obi-14.068
ipeg-13.870
Token unfair
Token ID11675
Feature activation+1.86e-01

Pos logit contributions
semble+9.328
 Crate+8.384
 reversible+8.228
 Nurs+8.069
 prosecutors+7.933
Neg logit contributions
ogene-13.851
hene-13.381
aucus-13.280
anski-13.087
obi-12.807
Token%
Token ID4
Feature activation-2.20e-01

Pos logit contributions
semble+2.767
 Crate+2.380
 Nurs+2.325
 COL+1.925
CENT+1.923
Neg logit contributions
ogene-6.548
obi-6.412
hene-6.377
imum-6.132
aucus-6.084
Token opposed
Token ID6886
Feature activation+1.80e-01

Pos logit contributions
ogene+2.282
obi+2.241
hene+2.208
aucus+2.062
anski+2.035
Neg logit contributions
semble-2.626
 Crate-2.410
 Nurs-2.370
CENT-2.146
 COL-2.105
Token),
Token ID828
Feature activation+6.79e-01

Pos logit contributions
semble+1.716
 Crate+1.543
 Nurs+1.517
CENT+1.335
 COL+1.332
Neg logit contributions
ogene-2.308
obi-2.244
hene-2.238
imum-2.112
aucus-2.104
Token but
Token ID475
Feature activation+6.65e-01

Pos logit contributions
semble+6.231
 Nurs+5.830
 Crate+5.658
 COL+5.260
 prosecutors+5.143
Neg logit contributions
ogene-8.094
hene-7.917
imum-7.897
obi-7.889
ochet-7.512
Token a
Token ID257
Feature activation+8.56e-01

Pos logit contributions
semble+6.041
 Nurs+5.689
 Crate+5.606
 COL+5.031
 prosecutors+4.927
Neg logit contributions
ogene-8.270
hene-7.856
obi-7.761
imum-7.651
anski-7.382
Token small
Token ID1402
Feature activation+1.30e+00

Pos logit contributions
semble+7.422
 Nurs+7.005
 Crate+6.698
 COL+6.269
 hallmark+5.919
Neg logit contributions
ogene-10.468
obi-10.129
hene-10.036
imum-9.620
anski-9.540
Token plurality
Token ID36325
Feature activation-1.89e-01

Pos logit contributions
semble+9.279
 Nurs+9.113
 COL+8.269
 Crate+8.085
 Liber+7.650
Neg logit contributions
ogene-14.923
obi-14.685
aucus-14.550
imum-14.462
hene-14.432
Token of
Token ID286
Feature activation+1.63e-01

Pos logit contributions
ogene+1.998
obi+1.963
hene+1.936
aucus+1.802
anski+1.790
Neg logit contributions
semble-2.220
 Crate-2.040
 Nurs-2.001
CENT-1.808
 COL-1.780
Token women
Token ID1466
Feature activation-1.15e-01

Pos logit contributions
semble+1.512
 Crate+1.357
 Nurs+1.351
 COL+1.180
CENT+1.171
Neg logit contributions
ogene-2.124
hene-2.078
obi-2.076
imum-1.974
aucus-1.950
Token also
Token ID635
Feature activation-1.48e-01

Pos logit contributions
ogene+1.330
obi+1.305
hene+1.291
aucus+1.209
anski+1.201
Neg logit contributions
semble-1.242
 Crate-1.133
 Nurs-1.109
CENT-0.990
 COL-0.977
Token backed
Token ID9763
Feature activation-4.67e-02

Pos logit contributions
ogene+1.453
obi+1.425
hene+1.400
aucus+1.298
anski+1.292
Neg logit contributions
semble-1.840
 Crate-1.705
 Nurs-1.676
CENT-1.520
 COL-1.495
Token%.
Token ID7225
Feature activation+5.20e-01

Pos logit contributions
semble+0.682
 Nurs+0.577
 Crate+0.576
CENT+0.487
 COL+0.475
Neg logit contributions
ogene-1.419
obi-1.384
hene-1.381
imum-1.323
aucus-1.314
Token
Token ID198
Feature activation+8.24e-02

Pos logit contributions
semble+4.464
 Nurs+4.379
 Crate+4.202
 COL+3.687
CENT+3.646
Neg logit contributions
ogene-6.924
hene-6.572
obi-6.561
anski-6.318
imum-6.303
Token
Token ID198
Feature activation+3.60e-01

Pos logit contributions
semble+0.816
 Crate+0.738
 Nurs+0.724
CENT+0.641
 COL+0.633
Neg logit contributions
ogene-1.033
obi-1.006
hene-1.004
aucus-0.944
imum-0.941
Token"
Token ID1
Feature activation+5.01e-01

Pos logit contributions
semble+3.864
 Nurs+3.497
 Crate+3.486
CENT+3.322
+3.080
Neg logit contributions
ogene-4.146
hene-3.966
obi-3.906
imum-3.713
anski-3.713
TokenThe
Token ID464
Feature activation+7.05e-01

Pos logit contributions
semble+5.103
 Nurs+4.776
 Crate+4.681
CENT+4.435
 COL+4.139
Neg logit contributions
ogene-5.970
hene-5.641
obi-5.566
anski-5.346
imum-5.325
Token Wisconsin
Token ID9279
Feature activation+1.28e+00

Pos logit contributions
semble+6.858
 Nurs+6.564
 Crate+6.219
 COL+5.697
 prosecutors+5.572
Neg logit contributions
ogene-8.218
obi-7.969
hene-7.951
imum-7.570
anski-7.563
Token straw
Token ID14787
Feature activation-2.71e-01

Pos logit contributions
 Nurs+11.174
semble+10.854
 Crate+9.698
 COL+9.615
 sweep+9.034
Neg logit contributions
ogene-13.803
hene-13.208
obi-13.087
ochet-12.619
anski-12.589
Token poll
Token ID3278
Feature activation+2.83e-03

Pos logit contributions
ogene+3.621
obi+3.580
hene+3.540
aucus+3.370
imum+3.331
Neg logit contributions
semble-2.436
 Crate-2.171
 Nurs-2.105
CENT-1.835
 COL-1.791
Token and
Token ID290
Feature activation+1.62e-01

Pos logit contributions
semble+0.030
 Crate+0.027
 Nurs+0.027
CENT+0.024
 COL+0.024
Neg logit contributions
ogene-0.033
obi-0.033
hene-0.032
aucus-0.030
imum-0.030
Token huge
Token ID3236
Feature activation+8.82e-01

Pos logit contributions
semble+1.755
 Crate+1.593
 Nurs+1.587
 COL+1.407
CENT+1.401
Neg logit contributions
ogene-1.879
obi-1.833
hene-1.824
imum-1.714
anski-1.700
Token turn
Token ID1210
Feature activation+4.16e-01

Pos logit contributions
semble+7.911
 Nurs+7.332
 Crate+7.015
 COL+6.400
 sweep+6.089
Neg logit contributions
ogene-10.298
hene-10.262
obi-10.114
imum-9.819
anski-9.780
Tokenibe
Token ID32438
Feature activation-2.53e-01

Pos logit contributions
ogene+2.705
obi+2.703
hene+2.610
imum+2.381
ochet+2.352
Neg logit contributions
semble-4.193
 Crate-3.988
 Nurs-3.980
CENT-3.641
 COL-3.604
Token products
Token ID3186
Feature activation+5.07e-02

Pos logit contributions
ogene+2.701
obi+2.672
hene+2.607
imum+2.420
anski+2.409
Neg logit contributions
semble-2.917
 Crate-2.655
 Nurs-2.587
CENT-2.375
-2.316
Token.
Token ID13
Feature activation+9.99e-02

Pos logit contributions
semble+0.527
 Crate+0.477
 Nurs+0.469
CENT+0.417
 COL+0.413
Neg logit contributions
ogene-0.610
obi-0.596
hene-0.594
aucus-0.558
imum-0.555
Token
Token ID447
Feature activation+1.83e-01

Pos logit contributions
semble+0.944
 Crate+0.862
 Nurs+0.848
CENT+0.740
 COL+0.735
Neg logit contributions
ogene-1.294
obi-1.263
hene-1.257
aucus-1.184
imum-1.180
Token
Token ID251
Feature activation+2.71e-01

Pos logit contributions
semble+1.158
 Crate+1.014
 Nurs+0.980
+0.876
CENT+0.762
Neg logit contributions
ogene-2.919
obi-2.848
hene-2.835
anski-2.724
imum-2.723
Token[
Token ID58
Feature activation+1.27e+00

Pos logit contributions
semble+2.009
 Crate+1.833
 Nurs+1.814
 COL+1.506
CENT+1.497
Neg logit contributions
ogene-4.028
obi-3.916
hene-3.907
imum-3.723
aucus-3.718
Token7
Token ID22
Feature activation-2.62e-01

Pos logit contributions
semble+7.888
 Crate+6.269
 Nurs+6.078
+6.037
CENT+6.005
Neg logit contributions
ogene-16.885
hene-16.543
obi-15.885
aucus-15.535
anski-15.099
Token]
Token ID60
Feature activation-2.87e-01

Pos logit contributions
ogene+2.706
obi+2.699
hene+2.635
aucus+2.432
imum+2.429
Neg logit contributions
semble-3.094
 Crate-2.788
 Nurs-2.766
CENT-2.506
-2.464
Token Because
Token ID4362
Feature activation-5.26e-01

Pos logit contributions
ogene+3.386
obi+3.359
hene+3.332
aucus+3.129
ochet+3.098
Neg logit contributions
semble-2.949
 Crate-2.588
 Nurs-2.513
CENT-2.279
-2.227
Token Chevron
Token ID50083
Feature activation-5.88e-01

Pos logit contributions
ogene+5.472
obi+5.297
hene+5.277
aucus+4.916
ochet+4.904
Neg logit contributions
semble-5.963
 Crate-5.315
 Nurs-5.215
CENT-4.909
-4.727
Token and
Token ID290
Feature activation-5.24e-01

Pos logit contributions
ogene+6.176
obi+6.024
hene+5.914
anski+5.630
imum+5.534
Neg logit contributions
semble-6.593
 Crate-5.848
 Nurs-5.684
CENT-5.305
-5.094
Tokenb
Token ID65
Feature activation-6.98e-01

Pos logit contributions
ogene+2.712
obi+2.649
hene+2.624
aucus+2.442
imum+2.438
Neg logit contributions
semble-2.960
 Crate-2.711
 Nurs-2.682
CENT-2.414
 COL-2.391
Token =
Token ID796
Feature activation-8.03e-01

Pos logit contributions
ogene+6.399
obi+6.175
hene+6.067
imum+5.809
aucus+5.494
Neg logit contributions
semble-8.534
 Nurs-7.920
 Crate-7.817
 COL-7.059
CENT-7.038
Token fit
Token ID4197
Feature activation-7.44e-01

Pos logit contributions
ogene+8.625
obi+8.475
hene+8.267
imum+7.655
ochet+7.486
Neg logit contributions
semble-8.121
 Nurs-7.223
 Crate-7.038
-6.645
CENT-6.428
Token(
Token ID7
Feature activation-7.83e-01

Pos logit contributions
ogene+8.132
obi+7.916
hene+7.887
imum+7.434
ochet+7.119
Neg logit contributions
semble-7.655
 Nurs-7.009
 Crate-6.941
 prosecutors-6.191
 COL-6.039
Tokenspace
Token ID13200
Feature activation-2.45e-01

Pos logit contributions
ogene+7.665
obi+7.637
hene+7.385
imum+6.869
ochet+6.485
Neg logit contributions
semble-8.839
 Nurs-8.155
 Crate-8.057
-7.160
 prosecutors-7.091
Token);
Token ID1776
Feature activation-1.50e+00

Pos logit contributions
ogene+2.717
obi+2.661
hene+2.643
imum+2.458
aucus+2.448
Neg logit contributions
semble-2.733
 Crate-2.486
 Nurs-2.452
CENT-2.191
-2.157
Token //
Token ID3373
Feature activation-4.32e-01

Pos logit contributions
obi+13.152
ogene+12.989
hene+12.744
 ob+12.003
imum+11.525
Neg logit contributions
semble-14.022
 Nurs-12.715
 Crate-12.209
-11.881
CENT-11.699
Token 2
Token ID362
Feature activation-2.25e-01

Pos logit contributions
ogene+4.562
obi+4.504
hene+4.420
imum+4.192
ochet+4.059
Neg logit contributions
semble-4.850
 Crate-4.393
 Nurs-4.346
CENT-3.967
-3.839
Token.
Token ID13
Feature activation-7.11e-02

Pos logit contributions
ogene+2.505
obi+2.450
hene+2.436
imum+2.268
aucus+2.256
Neg logit contributions
semble-2.508
 Crate-2.282
 Nurs-2.257
CENT-2.031
 COL-1.990
Token Sl
Token ID3454
Feature activation-4.84e-02

Pos logit contributions
ogene+0.902
obi+0.884
hene+0.881
imum+0.827
aucus+0.824
Neg logit contributions
semble-0.685
 Crate-0.610
 Nurs-0.602
CENT-0.533
 COL-0.519
Tokenice
Token ID501
Feature activation-1.10e+00

Pos logit contributions
ogene+0.579
obi+0.567
hene+0.564
aucus+0.529
imum+0.526
Neg logit contributions
semble-0.505
 Crate-0.458
 Nurs-0.450
CENT-0.400
 COL-0.396
Token Is
Token ID1148
Feature activation-2.00e-02

Pos logit contributions
ogene+6.820
obi+6.718
hene+6.463
imum+6.097
anski+5.905
Neg logit contributions
semble-6.515
 Crate-6.071
 Nurs-6.037
CENT-5.261
 COL-5.211
TokenNot
Token ID3673
Feature activation-1.77e-02

Pos logit contributions
ogene+0.215
obi+0.210
hene+0.208
aucus+0.194
imum+0.193
Neg logit contributions
semble-0.233
 Crate-0.214
 Nurs-0.210
CENT-0.190
 COL-0.188
TokenValid
Token ID47139
Feature activation-1.94e-01

Pos logit contributions
ogene+0.193
obi+0.188
hene+0.187
aucus+0.175
imum+0.173
Neg logit contributions
semble-0.204
 Crate-0.187
 Nurs-0.184
CENT-0.166
 COL-0.164
Token )
Token ID1267
Feature activation-5.01e-02

Pos logit contributions
ogene+2.330
obi+2.280
hene+2.255
imum+2.124
aucus+2.116
Neg logit contributions
semble-1.994
 Crate-1.802
 Nurs-1.789
CENT-1.579
 COL-1.567
Token )
Token ID1267
Feature activation-9.33e-01

Pos logit contributions
ogene+0.550
obi+0.537
hene+0.533
aucus+0.497
imum+0.495
Neg logit contributions
semble-0.573
 Crate-0.524
 Nurs-0.516
CENT-0.465
 COL-0.459
Token return
Token ID1441
Feature activation-1.47e+00

Pos logit contributions
ogene+8.321
obi+8.211
hene+7.845
ochet+7.264
imum+7.099
Neg logit contributions
semble-10.484
 Nurs-9.810
 Crate-9.553
CENT-9.105
-8.519
Token space
Token ID2272
Feature activation-5.30e-01

Pos logit contributions
ogene+13.078
obi+12.355
hene+12.116
 ob+11.897
imum+11.173
Neg logit contributions
semble-13.584
 Nurs-12.053
 Crate-11.823
CENT-11.652
-11.114
Token;
Token ID26
Feature activation-1.26e+00

Pos logit contributions
ogene+5.253
hene+5.024
obi+5.011
imum+4.628
ochet+4.614
Neg logit contributions
semble-6.327
 Crate-5.780
 Nurs-5.698
CENT-5.232
 COL-5.041
Token //
Token ID3373
Feature activation-5.18e-01

Pos logit contributions
ogene+10.508
obi+10.123
hene+10.090
ochet+9.032
imum+9.005
Neg logit contributions
semble-13.388
 Nurs-11.915
 Crate-11.775
CENT-11.659
-11.018
Token 4
Token ID604
Feature activation-3.76e-01

Pos logit contributions
ogene+5.318
obi+5.165
hene+5.121
imum+4.715
anski+4.677
Neg logit contributions
semble-5.912
 Nurs-5.322
 Crate-5.287
CENT-4.941
-4.796
Token.
Token ID13
Feature activation+7.65e-02

Pos logit contributions
ogene+3.629
obi+3.583
hene+3.525
imum+3.273
ochet+3.260
Neg logit contributions
semble-4.548
 Crate-4.129
 Nurs-4.115
CENT-3.816
-3.753
Token same
Token ID976
Feature activation-4.98e-02

Pos logit contributions
semble+1.396
 Crate+1.284
 Nurs+1.266
 COL+1.124
CENT+1.121
Neg logit contributions
ogene-1.446
hene-1.416
obi-1.416
aucus-1.322
imum-1.312
Token dinosaurs
Token ID27199
Feature activation+2.30e-01

Pos logit contributions
ogene+0.582
obi+0.568
hene+0.565
aucus+0.528
imum+0.526
Neg logit contributions
semble-0.534
 Crate-0.485
 Nurs-0.477
CENT-0.427
 COL-0.420
Token.
Token ID13
Feature activation-8.70e-01

Pos logit contributions
semble+2.377
 Crate+2.146
 Nurs+2.113
 COL+1.875
CENT+1.872
Neg logit contributions
ogene-2.735
obi-2.681
hene-2.661
aucus-2.526
anski-2.498
Token "
Token ID366
Feature activation-8.87e-01

Pos logit contributions
hene+8.984
ogene+8.975
obi+8.916
aucus+8.506
imum+8.330
Neg logit contributions
semble-8.676
 Crate-7.367
 Nurs-7.166
-6.705
 prosecutors-6.689
Token Comp
Token ID3082
Feature activation-3.58e-01

Pos logit contributions
obi+8.001
ogene+7.798
hene+7.639
aucus+7.437
imum+7.261
Neg logit contributions
semble-10.080
 Crate-9.107
 Nurs-9.011
 prosecutors-8.387
 decoration-8.153
Tokenaring
Token ID1723
Feature activation-1.35e+00

Pos logit contributions
ogene+3.103
obi+3.020
hene+3.017
imum+2.740
aucus+2.733
Neg logit contributions
semble-4.818
 Crate-4.491
 Nurs-4.475
CENT-4.046
-4.038
Token such
Token ID884
Feature activation-5.98e-01

Pos logit contributions
ogene+13.145
obi+12.477
hene+11.900
imum+11.672
aucus+11.648
Neg logit contributions
semble-11.933
 Crate-11.179
 Nurs-11.026
CENT-10.368
Cola-10.134
Token different
Token ID1180
Feature activation-5.30e-01

Pos logit contributions
ogene+6.899
obi+6.682
hene+6.544
imum+6.167
anski+6.135
Neg logit contributions
semble-5.964
 Crate-5.531
 Nurs-5.366
CENT-4.881
-4.854
Token molecules
Token ID17745
Feature activation-2.15e-01

Pos logit contributions
ogene+6.426
obi+6.225
hene+6.133
imum+5.761
anski+5.739
Neg logit contributions
semble-5.170
 Crate-4.782
 Nurs-4.629
CENT-4.173
-4.106
Token as
Token ID355
Feature activation-3.86e-01

Pos logit contributions
ogene+2.450
obi+2.401
hene+2.375
imum+2.207
aucus+2.193
Neg logit contributions
semble-2.349
 Crate-2.147
 Nurs-2.113
CENT-1.914
-1.885
Token minerals
Token ID21622
Feature activation+1.48e-01

Pos logit contributions
ogene+4.471
obi+4.372
hene+4.285
imum+4.023
aucus+4.018
Neg logit contributions
semble-4.056
 Nurs-3.647
 Crate-3.646
CENT-3.275
-3.264
Token terminate
Token ID23654
Feature activation-5.33e-01

Pos logit contributions
ogene+6.354
obi+6.202
hene+6.113
imum+5.751
ochet+5.662
Neg logit contributions
semble-6.038
 Crate-5.455
 Nurs-5.364
-4.906
CENT-4.893
Token if
Token ID611
Feature activation-1.21e+00

Pos logit contributions
ogene+5.945
hene+5.662
obi+5.634
ochet+5.371
imum+5.299
Neg logit contributions
semble-5.527
 Crate-5.053
 Nurs-5.015
CENT-4.630
-4.449
Token any
Token ID597
Feature activation-6.89e-01

Pos logit contributions
ogene+11.685
obi+11.114
hene+10.983
ochet+9.992
imum+9.854
Neg logit contributions
semble-11.822
 Nurs-10.844
 Crate-10.574
CENT-10.308
ctory-9.655
Token are
Token ID389
Feature activation-1.83e-01

Pos logit contributions
ogene+7.696
hene+7.246
obi+7.184
ochet+6.730
imum+6.641
Neg logit contributions
semble-7.078
 Nurs-6.293
 Crate-6.247
CENT-5.761
-5.646
Token not
Token ID407
Feature activation-7.84e-01

Pos logit contributions
ogene+2.144
obi+2.079
hene+2.071
imum+1.934
anski+1.909
Neg logit contributions
semble-1.937
 Crate-1.758
 Nurs-1.726
CENT-1.547
 COL-1.515
Token valid
Token ID4938
Feature activation-1.35e+00

Pos logit contributions
ogene+8.114
hene+7.925
obi+7.798
imum+7.298
ochet+7.234
Neg logit contributions
semble-8.080
 Crate-7.297
 Nurs-7.290
CENT-6.709
-6.572
Token //
Token ID3373
Feature activation-5.96e-01

Pos logit contributions
ogene+12.637
obi+12.499
hene+11.686
ochet+10.965
 ob+10.807
Neg logit contributions
semble-12.035
 Nurs-11.414
 Crate-10.895
CENT-10.622
-10.202
Token This
Token ID770
Feature activation-2.00e-01

Pos logit contributions
ogene+6.125
obi+5.914
hene+5.852
imum+5.401
anski+5.388
Neg logit contributions
semble-6.672
 Nurs-6.064
 Crate-5.949
CENT-5.594
-5.536
Token is
Token ID318
Feature activation+7.23e-02

Pos logit contributions
ogene+2.364
obi+2.302
hene+2.299
imum+2.151
aucus+2.132
Neg logit contributions
semble-2.086
 Crate-1.878
 Nurs-1.860
CENT-1.685
-1.634
Token the
Token ID262
Feature activation-3.48e-02

Pos logit contributions
semble+0.767
 Crate+0.698
 Nurs+0.686
CENT+0.610
 COL+0.605
Neg logit contributions
ogene-0.855
obi-0.835
hene-0.830
aucus-0.779
imum-0.774
Token base
Token ID2779
Feature activation-4.02e-01

Pos logit contributions
ogene+0.424
obi+0.415
hene+0.413
imum+0.387
aucus+0.385
Neg logit contributions
semble-0.354
 Crate-0.319
 Nurs-0.314
CENT-0.280
 COL-0.274
Token weakness
Token ID10453
Feature activation-6.39e-01

Pos logit contributions
ogene+2.617
obi+2.570
hene+2.556
imum+2.343
aucus+2.342
Neg logit contributions
semble-2.639
 Crate-2.391
 Nurs-2.372
CENT-2.160
-2.087
Token.
Token ID13
Feature activation-3.07e-01

Pos logit contributions
ogene+6.387
obi+6.306
hene+6.109
imum+5.738
ochet+5.717
Neg logit contributions
semble-7.347
 Crate-6.644
 Nurs-6.572
CENT-6.027
-5.816
Token As
Token ID1081
Feature activation-2.09e-01

Pos logit contributions
obi+3.528
ogene+3.520
hene+3.497
imum+3.261
aucus+3.234
Neg logit contributions
semble-3.201
 Crate-2.783
 Nurs-2.727
CENT-2.494
-2.474
Token its
Token ID663
Feature activation-2.36e-01

Pos logit contributions
ogene+2.338
obi+2.305
hene+2.287
imum+2.125
aucus+2.112
Neg logit contributions
semble-2.259
 Crate-2.084
 Nurs-2.064
CENT-1.858
-1.824
Token name
Token ID1438
Feature activation-1.16e+00

Pos logit contributions
ogene+2.976
obi+2.912
hene+2.891
imum+2.706
aucus+2.703
Neg logit contributions
semble-2.266
 Crate-2.028
 Nurs-2.009
CENT-1.795
-1.745
Token would
Token ID561
Feature activation-1.34e+00

Pos logit contributions
obi+9.595
ogene+9.463
hene+8.632
aucus+8.490
anski+8.309
Neg logit contributions
semble-13.142
 Crate-12.465
 Nurs-11.900
CENT-11.653
-10.942
Token suggest
Token ID1950
Feature activation-1.43e-01

Pos logit contributions
obi+10.924
ogene+10.639
hene+9.636
ochet+9.566
aucus+9.305
Neg logit contributions
 Crate-13.840
semble-13.374
 Nurs-12.570
CENT-12.470
-12.021
Token,
Token ID11
Feature activation-2.51e-01

Pos logit contributions
ogene+1.677
obi+1.639
hene+1.628
aucus+1.525
imum+1.520
Neg logit contributions
semble-1.525
 Crate-1.385
 Nurs-1.363
CENT-1.218
 COL-1.202
Token this
Token ID428
Feature activation-1.97e-01

Pos logit contributions
ogene+2.846
obi+2.786
hene+2.767
aucus+2.582
imum+2.582
Neg logit contributions
semble-2.730
 Crate-2.481
 Nurs-2.449
CENT-2.200
 COL-2.151
Token is
Token ID318
Feature activation-1.04e-01

Pos logit contributions
ogene+2.205
obi+2.160
hene+2.132
imum+1.993
aucus+1.984
Neg logit contributions
semble-2.185
 Crate-1.983
 Nurs-1.952
CENT-1.779
-1.739
Token one
Token ID530
Feature activation+1.59e-02

Pos logit contributions
ogene+1.223
obi+1.198
hene+1.189
aucus+1.113
imum+1.110
Neg logit contributions
semble-1.105
 Crate-1.002
 Nurs-0.987
CENT-0.880
 COL-0.867
Token (
Token ID357
Feature activation-1.00e+00

Pos logit contributions
ogene+4.563
obi+4.449
hene+4.448
imum+4.162
ochet+4.094
Neg logit contributions
semble-4.504
 Crate-4.059
 Nurs-4.024
CENT-3.667
-3.547
Tokenpart
Token ID3911
Feature activation-4.15e-01

Pos logit contributions
obi+9.909
ogene+9.759
hene+9.396
imum+8.837
ochet+8.450
Neg logit contributions
semble-10.521
 Nurs-9.632
 Crate-9.134
CENT-8.906
-8.399
Token in
Token ID287
Feature activation-6.99e-01

Pos logit contributions
ogene+4.636
obi+4.608
hene+4.532
imum+4.243
aucus+4.142
Neg logit contributions
semble-4.485
 Crate-4.069
 Nurs-4.032
CENT-3.671
 COL-3.550
Token parts
Token ID3354
Feature activation-2.17e-01

Pos logit contributions
obi+6.474
ogene+6.449
hene+6.149
imum+5.544
anski+5.493
Neg logit contributions
semble-8.274
 Nurs-7.610
 Crate-7.581
CENT-7.256
-6.926
Token)
Token ID8
Feature activation-5.37e-01

Pos logit contributions
ogene+2.514
obi+2.470
hene+2.449
imum+2.288
aucus+2.279
Neg logit contributions
semble-2.328
 Crate-2.123
 Nurs-2.079
CENT-1.872
 COL-1.828
Token return
Token ID1441
Feature activation-1.34e+00

Pos logit contributions
ogene+5.117
obi+5.013
hene+4.895
imum+4.483
ochet+4.467
Neg logit contributions
semble-6.563
 Crate-6.034
 Nurs-5.974
CENT-5.593
-5.335
Token ob
Token ID909
Feature activation-6.92e-02

Pos logit contributions
ogene+12.289
obi+12.152
hene+11.623
 ob+11.093
ochet+10.713
Neg logit contributions
semble-12.148
 Nurs-11.402
 Crate-11.009
CENT-10.666
-10.464
Tokenb
Token ID65
Feature activation-1.05e+00

Pos logit contributions
ogene+0.746
obi+0.731
hene+0.722
aucus+0.672
imum+0.671
Neg logit contributions
semble-0.804
 Crate-0.739
 Nurs-0.730
CENT-0.658
 COL-0.650
Token_
Token ID62
Feature activation-2.70e-01

Pos logit contributions
ogene+10.479
obi+10.151
hene+10.040
ochet+9.594
imum+9.531
Neg logit contributions
semble-10.571
 Crate-9.757
 Nurs-9.628
CENT-8.863
 prosecutors-8.445
Tokensub
Token ID7266
Feature activation+2.22e-01

Pos logit contributions
ogene+2.310
obi+2.238
hene+2.220
imum+2.021
aucus+2.003
Neg logit contributions
semble-3.652
 Crate-3.450
 Nurs-3.403
CENT-3.109
 COL-3.102
Tokendiv
Token ID7146
Feature activation+1.22e-01

Pos logit contributions
semble+1.679
 Crate+1.377
 Nurs+1.355
CENT+1.138
 COL+1.123
Neg logit contributions
ogene-3.300
obi-3.260
hene-3.233
aucus-3.095
ochet-3.062
Token.
Token ID13
Feature activation-8.61e-01

Pos logit contributions
ogene+3.856
obi+3.776
hene+3.707
ochet+3.432
aucus+3.426
Neg logit contributions
semble-4.119
 Crate-3.774
 Nurs-3.721
CENT-3.385
-3.276
Token If
Token ID1002
Feature activation-6.80e-01

Pos logit contributions
obi+8.792
hene+8.542
ogene+8.492
aucus+8.070
ochet+7.993
Neg logit contributions
semble-9.002
 Crate-7.462
 Nurs-7.199
 prosecutors-7.002
CENT-6.829
Token you
Token ID345
Feature activation-7.18e-01

Pos logit contributions
ogene+7.056
obi+7.008
hene+6.912
aucus+6.497
anski+6.440
Neg logit contributions
semble-7.335
 Crate-6.723
 Nurs-6.553
CENT-5.969
-5.803
Token were
Token ID547
Feature activation-6.04e-01

Pos logit contributions
obi+7.057
ogene+7.009
hene+7.000
anski+6.775
ochet+6.681
Neg logit contributions
semble-7.775
 Crate-7.119
 Nurs-7.050
CENT-6.489
-6.272
Token to
Token ID284
Feature activation-6.03e-01

Pos logit contributions
ogene+6.082
obi+5.807
hene+5.775
anski+5.436
imum+5.389
Neg logit contributions
semble-6.816
 Crate-6.327
 Nurs-6.134
CENT-5.695
izable-5.368
Token compare
Token ID8996
Feature activation-1.34e+00

Pos logit contributions
ogene+6.395
obi+6.311
hene+6.099
anski+6.072
ochet+5.881
Neg logit contributions
semble-6.227
 Crate-5.838
 Nurs-5.734
CENT-5.240
-5.070
Token them
Token ID606
Feature activation-6.18e-01

Pos logit contributions
ogene+10.895
obi+10.794
hene+10.560
aucus+10.264
ochet+10.093
Neg logit contributions
semble-13.568
 Crate-12.502
 Nurs-12.200
CENT-11.495
-10.977
Token at
Token ID379
Feature activation-7.84e-01

Pos logit contributions
ogene+6.345
obi+6.247
hene+6.158
imum+5.787
aucus+5.777
Neg logit contributions
semble-6.744
 Crate-6.231
 Nurs-6.139
CENT-5.623
-5.477
Token lower
Token ID2793
Feature activation-8.02e-01

Pos logit contributions
ogene+7.269
obi+7.221
hene+7.021
aucus+6.660
anski+6.624
Neg logit contributions
semble-9.028
 Crate-8.127
 Nurs-7.981
CENT-7.547
-7.215
Token refresh
Token ID14976
Feature activation-4.69e-01

Pos logit contributions
obi+7.771
ogene+7.708
hene+7.546
aucus+7.090
anski+6.958
Neg logit contributions
semble-8.948
 Crate-8.211
 Nurs-8.009
CENT-7.378
-7.026
Token rates
Token ID3965
Feature activation-6.29e-01

Pos logit contributions
obi+5.138
ogene+5.019
hene+4.973
imum+4.734
aucus+4.685
Neg logit contributions
semble-5.150
 Crate-4.717
 Nurs-4.561
CENT-3.972
-3.915
Token the
Token ID262
Feature activation-4.85e-01

Pos logit contributions
ogene+10.448
hene+10.337
obi+10.182
imum+9.575
ochet+9.428
Neg logit contributions
semble-11.029
 Nurs-9.768
 Crate-9.629
-9.141
CENT-9.018
Token space
Token ID2272
Feature activation-1.18e+00

Pos logit contributions
ogene+5.227
hene+5.066
obi+5.058
imum+4.694
ochet+4.556
Neg logit contributions
semble-5.367
 Nurs-4.757
 Crate-4.733
CENT-4.350
-4.331
Token along
Token ID1863
Feature activation-1.42e-01

Pos logit contributions
ogene+11.476
obi+11.311
hene+11.241
imum+10.345
ochet+9.979
Neg logit contributions
semble-11.192
 Nurs-9.967
 Crate-9.639
-9.326
CENT-9.100
Token the
Token ID262
Feature activation-2.31e-01

Pos logit contributions
ogene+1.601
obi+1.563
hene+1.550
imum+1.443
aucus+1.443
Neg logit contributions
semble-1.576
 Crate-1.437
 Nurs-1.418
CENT-1.273
 COL-1.251
Token shorter
Token ID12238
Feature activation-3.40e-01

Pos logit contributions
ogene+2.813
obi+2.746
hene+2.736
imum+2.544
anski+2.517
Neg logit contributions
semble-2.348
 Crate-2.106
 Nurs-2.083
CENT-1.853
-1.816
Token axis
Token ID16488
Feature activation-1.32e+00

Pos logit contributions
ogene+4.136
obi+4.056
hene+4.010
imum+3.723
anski+3.671
Neg logit contributions
semble-3.432
 Crate-3.048
 Nurs-3.034
CENT-2.713
-2.620
Token of
Token ID286
Feature activation-2.98e-01

Pos logit contributions
ogene+12.444
obi+12.093
hene+11.467
imum+10.898
 ob+10.778
Neg logit contributions
semble-12.600
 Nurs-11.276
 Crate-11.086
CENT-10.527
-10.467
Token the
Token ID262
Feature activation-3.45e-01

Pos logit contributions
ogene+3.245
obi+3.156
hene+3.121
imum+2.918
ochet+2.863
Neg logit contributions
semble-3.390
 Crate-3.078
 Nurs-3.048
CENT-2.755
-2.693
Token O
Token ID440
Feature activation+5.81e-02

Pos logit contributions
ogene+3.974
hene+3.857
obi+3.844
imum+3.570
anski+3.503
Neg logit contributions
semble-3.668
 Crate-3.274
 Nurs-3.265
CENT-2.942
-2.888
TokenBB
Token ID15199
Feature activation-2.00e-01

Pos logit contributions
semble+0.732
 Crate+0.676
 Nurs+0.663
CENT+0.607
 COL+0.603
Neg logit contributions
ogene-0.566
obi-0.549
hene-0.546
aucus-0.511
imum-0.505
Token let
Token ID1309
Feature activation-8.50e-01

Pos logit contributions
ogene+2.392
obi+2.330
hene+2.323
imum+2.172
aucus+2.158
Neg logit contributions
semble-2.062
 Crate-1.846
 Nurs-1.838
CENT-1.639
 COL-1.597
Token tissue
Token ID10712
Feature activation+3.40e-02

Pos logit contributions
obi+6.485
ogene+6.462
hene+6.304
imum+5.778
anski+5.747
Neg logit contributions
semble-6.162
 Crate-5.629
 Nurs-5.445
CENT-5.081
-4.816
Token separating
Token ID27259
Feature activation-2.86e-02

Pos logit contributions
semble+0.372
 Crate+0.339
 Nurs+0.334
CENT+0.299
 COL+0.297
Neg logit contributions
ogene-0.389
obi-0.380
hene-0.378
aucus-0.355
imum-0.353
Token the
Token ID262
Feature activation-2.19e-01

Pos logit contributions
ogene+0.332
obi+0.325
hene+0.323
aucus+0.302
imum+0.301
Neg logit contributions
semble-0.310
 Crate-0.282
 Nurs-0.278
CENT-0.248
 COL-0.246
Token living
Token ID2877
Feature activation-1.94e-01

Pos logit contributions
ogene+2.614
obi+2.548
hene+2.530
aucus+2.364
imum+2.350
Neg logit contributions
semble-2.277
 Crate-2.054
 Nurs-2.014
CENT-1.823
-1.770
Token from
Token ID422
Feature activation-1.10e+00

Pos logit contributions
ogene+2.224
obi+2.178
hene+2.163
aucus+2.016
imum+2.003
Neg logit contributions
semble-2.106
 Crate-1.914
 Nurs-1.883
CENT-1.690
 COL-1.652
Token the
Token ID262
Feature activation-1.32e+00

Pos logit contributions
ogene+8.971
obi+8.513
hene+8.370
aucus+7.832
anski+7.691
Neg logit contributions
semble-13.347
 Crate-12.232
 Nurs-12.100
CENT-11.697
 COL-10.779
Token dead
Token ID2636
Feature activation-2.75e-01

Pos logit contributions
ogene+10.945
obi+10.888
hene+10.625
aucus+9.938
anski+9.769
Neg logit contributions
semble-14.407
 Crate-12.830
 Nurs-12.823
CENT-12.611
 COL-11.130
Token most
Token ID749
Feature activation-1.09e-01

Pos logit contributions
ogene+3.059
obi+2.999
hene+2.969
aucus+2.750
imum+2.732
Neg logit contributions
semble-3.052
 Crate-2.783
 Nurs-2.727
CENT-2.462
 COL-2.400
Token unmist
Token ID40016
Feature activation-5.79e-01

Pos logit contributions
ogene+1.219
obi+1.193
hene+1.181
aucus+1.099
imum+1.095
Neg logit contributions
semble-1.223
 Crate-1.116
 Nurs-1.099
CENT-0.989
 COL-0.973
Tokenak
Token ID461
Feature activation-3.15e-02

Pos logit contributions
ogene+5.520
obi+5.470
hene+5.294
anski+4.894
imum+4.787
Neg logit contributions
semble-7.027
 Nurs-6.699
 Crate-6.681
CENT-6.108
-5.948
Tokenably
Token ID1346
Feature activation-1.59e-01

Pos logit contributions
ogene+0.292
obi+0.284
hene+0.281
aucus+0.258
imum+0.258
Neg logit contributions
semble-0.414
 Crate-0.384
 Nurs-0.379
CENT-0.347
 COL-0.344
Token meal
Token ID9799
Feature activation+3.15e-02

Pos logit contributions
ogene+0.704
obi+0.689
hene+0.684
aucus+0.638
imum+0.636
Neg logit contributions
semble-0.649
 Crate-0.588
 Nurs-0.578
CENT-0.519
 COL-0.511
Token for
Token ID329
Feature activation-2.89e-01

Pos logit contributions
semble+0.326
 Crate+0.296
 Nurs+0.290
CENT+0.258
 COL+0.255
Neg logit contributions
ogene-0.380
obi-0.371
hene-0.369
aucus-0.347
imum-0.345
Token two
Token ID734
Feature activation-2.41e-01

Pos logit contributions
ogene+3.182
obi+3.075
hene+3.053
aucus+2.868
anski+2.860
Neg logit contributions
semble-3.230
 Crate-2.894
 Nurs-2.832
CENT-2.602
-2.554
Token on
Token ID319
Feature activation-1.78e-01

Pos logit contributions
ogene+2.621
obi+2.542
hene+2.522
anski+2.341
aucus+2.337
Neg logit contributions
semble-2.729
 Crate-2.484
 Nurs-2.412
CENT-2.225
-2.199
Token Z
Token ID1168
Feature activation-3.28e-01

Pos logit contributions
ogene+2.004
obi+1.962
hene+1.941
aucus+1.803
imum+1.803
Neg logit contributions
semble-1.968
 Crate-1.782
 Nurs-1.750
CENT-1.588
 COL-1.557
Tokenom
Token ID296
Feature activation-1.31e+00

Pos logit contributions
ogene+2.825
obi+2.771
hene+2.679
imum+2.477
aucus+2.450
Neg logit contributions
semble-4.513
 Crate-4.221
 Nurs-4.159
CENT-3.836
 COL-3.809
Tokenato
Token ID5549
Feature activation-3.78e-01

Pos logit contributions
ogene+6.031
obi+5.579
ochet+4.336
hene+4.319
aroo+4.178
Neg logit contributions
 Nurs-19.330
 Crate-19.272
semble-19.158
���-19.121
 COL-18.338
Token comes
Token ID2058
Feature activation+4.21e-02

Pos logit contributions
ogene+4.003
obi+3.975
hene+3.898
imum+3.608
anski+3.602
Neg logit contributions
semble-4.374
 Crate-3.924
 Nurs-3.821
CENT-3.541
-3.467
Token at
Token ID379
Feature activation-3.13e-01

Pos logit contributions
semble+0.444
 Crate+0.402
 Nurs+0.395
CENT+0.352
 COL+0.349
Neg logit contributions
ogene-0.501
obi-0.490
hene-0.488
aucus-0.458
imum-0.456
Token Rs
Token ID12820
Feature activation-4.02e-01

Pos logit contributions
ogene+3.270
obi+3.186
hene+3.167
aucus+2.942
imum+2.937
Neg logit contributions
semble-3.652
 Crate-3.304
 Nurs-3.254
CENT-2.994
-2.905
Token100
Token ID3064
Feature activation+1.81e-02

Pos logit contributions
ogene+4.100
obi+4.033
hene+3.935
aucus+3.697
imum+3.670
Neg logit contributions
semble-4.720
 Crate-4.302
 Nurs-4.196
CENT-3.821
 prosecutors-3.743
Tokenorter
Token ID4337
Feature activation-6.27e-01

Pos logit contributions
semble+1.007
 Crate+0.912
 Nurs+0.899
CENT+0.811
 COL+0.802
Neg logit contributions
ogene-0.998
obi-0.975
hene-0.969
aucus-0.916
imum-0.896
TokenAx
Token ID31554
Feature activation+7.60e-02

Pos logit contributions
ogene+6.870
obi+6.732
hene+6.681
imum+6.236
anski+6.111
Neg logit contributions
semble-6.653
 Nurs-6.052
 Crate-6.045
 COL-5.280
 prosecutors-5.248
Tokenis
Token ID271
Feature activation-5.59e-01

Pos logit contributions
semble+0.527
 Crate+0.451
 Nurs+0.438
CENT+0.365
 COL+0.355
Neg logit contributions
ogene-1.174
obi-1.157
hene-1.150
aucus-1.101
anski-1.092
Token,
Token ID11
Feature activation-1.01e+00

Pos logit contributions
ogene+5.949
hene+5.856
obi+5.828
imum+5.443
anski+5.340
Neg logit contributions
semble-6.154
 Crate-5.632
 Nurs-5.516
 prosecutors-4.857
-4.842
Token space
Token ID2272
Feature activation-3.18e-01

Pos logit contributions
ogene+10.101
obi+10.064
hene+9.544
imum+8.903
anski+8.579
Neg logit contributions
semble-10.309
 Crate-9.497
 Nurs-9.430
-8.250
 prosecutors-8.141
Token );
Token ID5619
Feature activation-1.30e+00

Pos logit contributions
ogene+3.419
obi+3.342
hene+3.329
imum+3.079
aucus+3.069
Neg logit contributions
semble-3.667
 Crate-3.328
 Nurs-3.279
CENT-2.922
-2.896
Token //
Token ID3373
Feature activation-4.14e-01

Pos logit contributions
obi+11.706
ogene+11.438
hene+11.258
imum+10.285
 ob+10.269
Neg logit contributions
semble-12.407
 Nurs-11.615
 Crate-11.344
CENT-10.854
-10.724
Token 3
Token ID513
Feature activation+6.54e-03

Pos logit contributions
ogene+4.253
hene+4.122
obi+4.103
imum+3.779
ochet+3.732
Neg logit contributions
semble-4.784
 Crate-4.423
 Nurs-4.395
CENT-3.967
-3.914
Token.
Token ID13
Feature activation+4.02e-02

Pos logit contributions
semble+0.073
 Crate+0.067
 Nurs+0.066
CENT+0.059
 COL+0.058
Neg logit contributions
ogene-0.073
obi-0.072
hene-0.071
aucus-0.066
imum-0.066
Token Check
Token ID6822
Feature activation-7.13e-01

Pos logit contributions
semble+0.378
 Crate+0.339
 Nurs+0.332
CENT+0.291
 COL+0.287
Neg logit contributions
ogene-0.523
obi-0.513
hene-0.510
aucus-0.481
imum-0.479
Token validity
Token ID19648
Feature activation-7.60e-01

Pos logit contributions
ogene+6.984
obi+6.773
hene+6.631
imum+6.359
ochet+6.137
Neg logit contributions
semble-8.020
 Nurs-7.328
 Crate-7.265
CENT-6.880
-6.566
Token )
Token ID1267
Feature activation-5.36e-01

Pos logit contributions
obi+5.520
ogene+5.463
hene+5.311
aucus+4.882
imum+4.800
Neg logit contributions
semble-8.980
 Crate-8.032
 Nurs-7.960
CENT-7.467
 prosecutors-7.425
Token )
Token ID1267
Feature activation-6.84e-01

Pos logit contributions
obi+4.616
ogene+4.601
hene+4.524
imum+4.129
aucus+4.097
Neg logit contributions
semble-6.977
 Nurs-6.277
 Crate-6.234
-5.838
CENT-5.775
Token =
Token ID796
Feature activation-4.47e-01

Pos logit contributions
obi+5.333
ogene+5.248
hene+5.155
imum+4.870
aucus+4.615
Neg logit contributions
semble-9.253
 Nurs-8.230
 Crate-8.146
���-8.017
 hallmark-7.697
Token 0
Token ID657
Feature activation-9.81e-01

Pos logit contributions
obi+3.468
ogene+3.448
hene+3.390
imum+3.095
ochet+3.007
Neg logit contributions
semble-6.340
 Crate-5.685
 Nurs-5.636
CENT-5.382
-5.301
Token.
Token ID13
Feature activation-6.66e-02

Pos logit contributions
obi+7.375
ogene+7.129
hene+7.007
imum+6.518
ochet+6.447
Neg logit contributions
semble-12.637
���-11.943
etheless-10.993
 Nurs-10.771
 Crate-10.758
Token1
Token ID16
Feature activation-1.30e+00

Pos logit contributions
ogene+0.945
obi+0.933
hene+0.924
imum+0.877
aucus+0.875
Neg logit contributions
semble-0.547
 Crate-0.479
 Nurs-0.468
CENT-0.399
 COL-0.389
Token //
Token ID3373
Feature activation-6.55e-01

Pos logit contributions
hene+6.292
obi+6.158
imum+5.805
ogene+5.803
ochet+5.690
Neg logit contributions
semble-18.273
���-16.542
 Nurs-16.090
 Crate-15.841
iences-15.483
Token metres
Token ID18985
Feature activation-7.52e-01

Pos logit contributions
hene+6.801
ogene+6.686
obi+6.677
imum+6.373
aucus+6.159
Neg logit contributions
semble-7.331
 Crate-6.616
 Nurs-6.227
-5.748
CENT-5.709
Token _
Token ID4808
Feature activation-4.01e-01

Pos logit contributions
obi+6.838
ogene+6.835
hene+6.678
imum+6.264
ochet+6.183
Neg logit contributions
semble-9.179
 Nurs-7.931
 Crate-7.838
CENT-7.456
 prosecutors-7.195
TokenP
Token ID47
Feature activation-1.41e-01

Pos logit contributions
ogene+3.466
obi+3.454
hene+3.358
ochet+3.110
imum+3.084
Neg logit contributions
semble-5.351
 Nurs-4.820
 Crate-4.805
 prosecutors-4.445
 hallmark-4.326
Tokenulse
Token ID9615
Feature activation+1.52e-02

Pos logit contributions
ogene+1.028
hene+0.995
obi+0.992
imum+0.890
ochet+0.876
Neg logit contributions
semble-2.124
 Crate-2.003
 Nurs-1.992
 COL-1.829
CENT-1.828
Token strong
Token ID1913
Feature activation+2.65e-02

Pos logit contributions
semble+0.359
 Crate+0.327
 Nurs+0.322
CENT+0.288
 COL+0.285
Neg logit contributions
ogene-0.372
obi-0.363
hene-0.361
aucus-0.337
imum-0.336
Token feelings
Token ID7666
Feature activation-2.12e-01

Pos logit contributions
semble+0.269
 Crate+0.244
 Nurs+0.240
CENT+0.212
 COL+0.210
Neg logit contributions
ogene-0.325
obi-0.317
hene-0.316
aucus-0.297
imum-0.296
Token one
Token ID530
Feature activation+3.38e-01

Pos logit contributions
ogene+2.262
obi+2.234
hene+2.183
aucus+2.035
imum+2.017
Neg logit contributions
semble-2.473
 Crate-2.252
 Nurs-2.221
CENT-2.005
-1.969
Token way
Token ID835
Feature activation-1.85e-01

Pos logit contributions
semble+3.516
 Crate+3.210
 Nurs+3.099
 COL+2.783
CENT+2.752
Neg logit contributions
ogene-3.979
hene-3.868
obi-3.866
anski-3.639
aucus-3.637
Token or
Token ID393
Feature activation-7.52e-01

Pos logit contributions
ogene+1.916
obi+1.872
hene+1.854
aucus+1.722
imum+1.713
Neg logit contributions
semble-2.231
 Crate-2.048
 Nurs-2.021
CENT-1.836
 COL-1.809
Token the
Token ID262
Feature activation-1.29e+00

Pos logit contributions
ogene+6.819
obi+6.705
hene+6.391
anski+6.091
imum+6.014
Neg logit contributions
semble-9.224
 Crate-8.452
 Nurs-8.324
CENT-7.733
 COL-7.538
Token other
Token ID584
Feature activation+9.01e-02

Pos logit contributions
obi+6.683
ogene+6.352
hene+5.429
imum+4.867
anski+4.726
Neg logit contributions
semble-19.161
 Crate-17.449
 Nurs-17.068
CENT-17.044
ctory-15.728
Token because
Token ID780
Feature activation-1.65e-01

Pos logit contributions
semble+0.955
 Crate+0.869
 Nurs+0.854
CENT+0.762
 COL+0.757
Neg logit contributions
ogene-1.059
hene-1.033
obi-1.032
aucus-0.969
imum-0.964
Token of
Token ID286
Feature activation+2.02e-01

Pos logit contributions
ogene+1.925
obi+1.880
hene+1.863
aucus+1.749
imum+1.741
Neg logit contributions
semble-1.764
 Crate-1.597
 Nurs-1.572
CENT-1.414
 COL-1.387
Token their
Token ID511
Feature activation+1.04e-02

Pos logit contributions
semble+2.026
 Crate+1.863
 Nurs+1.823
 COL+1.608
CENT+1.568
Neg logit contributions
ogene-2.461
hene-2.417
obi-2.411
aucus-2.263
anski-2.261
Token taste
Token ID6938
Feature activation-4.15e-01

Pos logit contributions
semble+0.109
 Crate+0.098
 Nurs+0.097
CENT+0.086
 COL+0.085
Neg logit contributions
ogene-0.124
obi-0.122
hene-0.121
aucus-0.113
imum-0.113
Token of
Token ID286
Feature activation-3.66e-01

Pos logit contributions
ogene+7.156
obi+7.025
hene+6.747
aucus+6.475
imum+6.422
Neg logit contributions
semble-7.641
 Crate-6.976
 Nurs-6.695
CENT-6.225
-5.998
Token course
Token ID1781
Feature activation-5.62e-01

Pos logit contributions
ogene+4.874
obi+4.807
hene+4.723
aucus+4.533
anski+4.499
Neg logit contributions
semble-3.220
 Crate-2.869
 Nurs-2.776
CENT-2.473
-2.356
Token directly
Token ID3264
Feature activation-7.11e-01

Pos logit contributions
ogene+5.856
obi+5.734
hene+5.593
aucus+5.289
ochet+5.232
Neg logit contributions
semble-6.260
 Crate-5.806
 Nurs-5.644
CENT-5.200
-5.027
Token against
Token ID1028
Feature activation-6.27e-01

Pos logit contributions
obi+7.209
ogene+7.145
hene+6.858
ochet+6.591
aucus+6.501
Neg logit contributions
semble-7.793
 Crate-7.323
 Nurs-7.102
CENT-6.469
-6.400
Token the
Token ID262
Feature activation-5.72e-01

Pos logit contributions
ogene+6.383
obi+6.314
hene+6.006
aucus+5.825
anski+5.804
Neg logit contributions
semble-7.113
 Crate-6.390
 Nurs-6.283
CENT-5.870
-5.658
Token only
Token ID691
Feature activation-1.29e+00

Pos logit contributions
ogene+6.557
obi+6.382
hene+6.221
aucus+6.009
anski+5.964
Neg logit contributions
semble-5.840
 Crate-5.240
 Nurs-5.042
CENT-4.741
-4.499
Token other
Token ID584
Feature activation-1.17e+00

Pos logit contributions
ogene+12.748
obi+12.269
hene+11.735
aucus+11.265
anski+11.251
Neg logit contributions
semble-12.166
 Crate-10.755
 Nurs-10.354
CENT-10.100
iences-9.112
Token 144
Token ID20224
Feature activation-2.63e-01

Pos logit contributions
ogene+11.761
obi+11.517
hene+10.922
aucus+10.580
anski+10.525
Neg logit contributions
semble-11.355
 Crate-9.897
 Nurs-9.659
CENT-9.274
-8.574
TokenHz
Token ID7399
Feature activation-9.05e-01

Pos logit contributions
ogene+2.261
obi+2.221
hene+2.183
aucus+2.000
imum+1.983
Neg logit contributions
semble-3.600
 Crate-3.344
 Nurs-3.289
CENT-3.014
 COL-2.978
Token IPS
Token ID37632
Feature activation-5.85e-01

Pos logit contributions
ogene+9.385
obi+9.356
hene+8.967
aucus+8.670
ochet+8.385
Neg logit contributions
semble-9.147
 Crate-8.213
 Nurs-7.900
CENT-7.363
-6.909
Token screen
Token ID3159
Feature activation-5.66e-01

Pos logit contributions
obi+6.355
ogene+6.287
hene+6.139
aucus+5.859
anski+5.685
Neg logit contributions
semble-6.287
 Crate-5.777
 Nurs-5.621
CENT-5.012
-4.846
Tokenara
Token ID3301
Feature activation+2.71e-01

Pos logit contributions
ogene+3.449
obi+3.374
hene+3.304
imum+3.052
aucus+3.046
Neg logit contributions
semble-4.576
 Crate-4.286
 Nurs-4.255
CENT-3.840
 COL-3.833
Tokenî
Token ID34803
Feature activation-4.55e-01

Pos logit contributions
semble+2.474
 Crate+2.145
 Nurs+2.145
CENT+1.883
 COL+1.849
Neg logit contributions
ogene-3.573
obi-3.508
hene-3.486
aucus-3.336
anski-3.258
Tokent
Token ID83
Feature activation-7.12e-01

Pos logit contributions
ogene+4.667
obi+4.646
hene+4.547
imum+4.245
ochet+4.226
Neg logit contributions
semble-5.156
 Crate-4.882
 Nurs-4.725
 COL-4.246
CENT-4.163
Token c
Token ID269
Feature activation-7.85e-02

Pos logit contributions
ogene+7.251
obi+7.242
hene+7.061
imum+6.729
ochet+6.464
Neg logit contributions
semble-7.434
 Crate-7.213
 Nurs-6.895
CENT-6.351
-6.258
Tokenepend
Token ID2690
Feature activation-5.14e-01

Pos logit contributions
ogene+0.838
obi+0.817
hene+0.812
aucus+0.755
imum+0.751
Neg logit contributions
semble-0.922
 Crate-0.848
 Nurs-0.836
CENT-0.756
 COL-0.746
Tokenant
Token ID415
Feature activation-1.29e+00

Pos logit contributions
ogene+4.570
hene+4.429
obi+4.409
anski+4.022
imum+4.018
Neg logit contributions
semble-6.620
 Crate-6.302
 Nurs-6.169
CENT-5.657
 COL-5.584
Token que
Token ID8358
Feature activation-4.90e-01

Pos logit contributions
ogene+12.184
hene+12.113
obi+12.010
imum+11.244
ochet+10.968
Neg logit contributions
semble-11.414
 Crate-11.202
 Nurs-10.618
CENT-9.711
-9.676
Token le
Token ID443
Feature activation-2.32e-01

Pos logit contributions
ogene+5.148
obi+5.040
hene+5.037
ochet+4.620
anski+4.595
Neg logit contributions
semble-5.351
 Crate-4.998
 Nurs-4.833
CENT-4.449
-4.362
Token Command
Token ID9455
Feature activation-3.36e-02

Pos logit contributions
ogene+2.673
obi+2.607
hene+2.600
imum+2.427
aucus+2.409
Neg logit contributions
semble-2.450
 Crate-2.241
 Nurs-2.206
CENT-1.987
-1.942
Tokenement
Token ID972
Feature activation-1.73e-01

Pos logit contributions
ogene+0.377
obi+0.368
hene+0.365
aucus+0.341
imum+0.339
Neg logit contributions
semble-0.378
 Crate-0.345
 Nurs-0.339
CENT-0.305
 COL-0.302
Token est
Token ID1556
Feature activation-1.12e+00

Pos logit contributions
ogene+2.100
obi+2.056
hene+2.043
aucus+1.914
imum+1.912
Neg logit contributions
semble-1.771
 Crate-1.598
 Nurs-1.566
CENT-1.398
 COL-1.371
Token is
Token ID318
Feature activation+5.52e-01

Pos logit contributions
semble+4.454
 Crate+3.861
 Nurs+3.791
 COL+3.356
 prosecutors+3.343
Neg logit contributions
ogene-7.748
obi-7.497
hene-7.479
imum-7.347
ochet-7.261
Token by
Token ID416
Feature activation-1.23e-01

Pos logit contributions
semble+5.335
 Crate+4.783
 Nurs+4.700
 COL+4.244
 prosecutors+4.195
Neg logit contributions
ogene-6.640
imum-6.475
hene-6.470
obi-6.393
aucus-6.138
Token war
Token ID1175
Feature activation-3.65e-01

Pos logit contributions
ogene+1.413
obi+1.381
hene+1.368
aucus+1.279
imum+1.275
Neg logit contributions
semble-1.339
 Crate-1.217
 Nurs-1.196
CENT-1.076
 COL-1.056
Token.
Token ID13
Feature activation-4.22e-01

Pos logit contributions
ogene+3.925
obi+3.896
hene+3.841
aucus+3.552
imum+3.527
Neg logit contributions
semble-4.183
 Crate-3.808
 Nurs-3.734
CENT-3.400
 COL-3.321
Token In
Token ID554
Feature activation-4.03e-01

Pos logit contributions
obi+4.857
ogene+4.791
hene+4.750
aucus+4.439
anski+4.391
Neg logit contributions
semble-4.383
 Crate-3.716
 Nurs-3.579
CENT-3.364
-3.329
Token "
Token ID366
Feature activation-1.29e+00

Pos logit contributions
ogene+4.478
obi+4.370
hene+4.250
aucus+4.016
anski+4.013
Neg logit contributions
semble-4.397
 Crate-3.969
 Nurs-3.876
CENT-3.568
-3.470
TokenPost
Token ID6307
Feature activation-8.84e-01

Pos logit contributions
obi+11.417
ogene+11.146
hene+10.756
aucus+10.197
imum+10.166
Neg logit contributions
semble-12.840
 Crate-11.457
 Nurs-11.133
 prosecutors-10.623
 COL-10.514
Token Captain
Token ID8599
Feature activation-1.39e-01

Pos logit contributions
ogene+9.052
obi+8.938
hene+8.720
aucus+8.249
ochet+8.032
Neg logit contributions
semble-9.193
 Crate-8.425
 Nurs-7.935
CENT-7.606
-7.321
Token,"
Token ID553
Feature activation-4.70e-01

Pos logit contributions
ogene+1.470
obi+1.436
hene+1.422
aucus+1.323
imum+1.317
Neg logit contributions
semble-1.646
 Crate-1.510
 Nurs-1.486
CENT-1.349
 COL-1.330
Token Stephen
Token ID7970
Feature activation+1.75e-01

Pos logit contributions
ogene+5.429
obi+5.410
hene+5.225
aucus+4.930
anski+4.905
Neg logit contributions
semble-4.824
 Crate-4.334
 Nurs-4.201
CENT-3.927
-3.790
Token and
Token ID290
Feature activation-3.47e-01

Pos logit contributions
semble+1.846
 Crate+1.686
 Nurs+1.670
 COL+1.475
CENT+1.457
Neg logit contributions
ogene-2.048
obi-1.998
hene-1.998
imum-1.874
aucus-1.868
Token ob
Token ID909
Feature activation-6.92e-02

Pos logit contributions
ogene+12.289
obi+12.152
hene+11.623
 ob+11.093
ochet+10.713
Neg logit contributions
semble-12.148
 Nurs-11.402
 Crate-11.009
CENT-10.666
-10.464
Tokenb
Token ID65
Feature activation-1.05e+00

Pos logit contributions
ogene+0.746
obi+0.731
hene+0.722
aucus+0.672
imum+0.671
Neg logit contributions
semble-0.804
 Crate-0.739
 Nurs-0.730
CENT-0.658
 COL-0.650
Token_
Token ID62
Feature activation-2.70e-01

Pos logit contributions
ogene+10.479
obi+10.151
hene+10.040
ochet+9.594
imum+9.531
Neg logit contributions
semble-10.571
 Crate-9.757
 Nurs-9.628
CENT-8.863
 prosecutors-8.445
Tokensub
Token ID7266
Feature activation+2.22e-01

Pos logit contributions
ogene+2.310
obi+2.238
hene+2.220
imum+2.021
aucus+2.003
Neg logit contributions
semble-3.652
 Crate-3.450
 Nurs-3.403
CENT-3.109
 COL-3.102
Tokendiv
Token ID7146
Feature activation+1.22e-01

Pos logit contributions
semble+1.679
 Crate+1.377
 Nurs+1.355
CENT+1.138
 COL+1.123
Neg logit contributions
ogene-3.300
obi-3.260
hene-3.233
aucus-3.095
ochet-3.062
Tokenide
Token ID485
Feature activation-1.29e+00

Pos logit contributions
semble+1.090
 Crate+0.957
 Nurs+0.938
CENT+0.821
 COL+0.805
Neg logit contributions
ogene-1.650
obi-1.619
hene-1.605
aucus-1.524
imum-1.512
Token(
Token ID7
Feature activation-9.59e-01

Pos logit contributions
ogene+11.469
hene+11.285
obi+11.186
ochet+10.475
imum+10.323
Neg logit contributions
semble-12.902
 Crate-12.147
 Nurs-12.029
CENT-10.941
 prosecutors-10.717
Token part
Token ID636
Feature activation-4.35e-01

Pos logit contributions
obi+8.668
ogene+8.649
hene+8.231
imum+7.631
anski+7.451
Neg logit contributions
semble-10.530
 Nurs-10.280
 Crate-10.142
CENT-8.987
 COL-8.901
Token );
Token ID5619
Feature activation-6.87e-01

Pos logit contributions
ogene+4.646
obi+4.539
hene+4.490
imum+4.174
aucus+4.148
Neg logit contributions
semble-4.908
 Crate-4.515
 Nurs-4.461
CENT-3.986
 COL-3.911
Token }
Token ID1782
Feature activation-6.98e-01

Pos logit contributions
ogene+5.378
obi+5.167
hene+5.159
imum+4.639
ochet+4.596
Neg logit contributions
semble-9.219
 Nurs-8.391
 Crate-8.366
CENT-7.968
-7.620
Token
Token ID198
Feature activation+2.92e-01

Pos logit contributions
ogene+7.100
obi+7.027
hene+6.950
ochet+6.388
imum+6.384
Neg logit contributions
semble-7.658
 Nurs-6.969
 Crate-6.932
CENT-6.398
-6.162
Token
Token ID247
Feature activation-3.88e-01

Pos logit contributions
ogene+3.333
obi+3.281
hene+3.272
aucus+3.120
imum+3.096
Neg logit contributions
semble-1.394
 Crate-1.179
 Nurs-1.144
CENT-0.936
 COL-0.910
Tokens
Token ID82
Feature activation-3.68e-01

Pos logit contributions
ogene+4.305
obi+4.299
hene+4.250
aucus+3.946
anski+3.920
Neg logit contributions
semble-4.165
 Crate-3.835
 Nurs-3.701
CENT-3.307
 COL-3.259
Token meddling
Token ID33545
Feature activation-7.57e-01

Pos logit contributions
ogene+4.066
obi+4.020
hene+3.947
anski+3.664
imum+3.662
Neg logit contributions
semble-4.068
 Crate-3.733
 Nurs-3.616
CENT-3.310
-3.202
Token in
Token ID287
Feature activation-6.25e-01

Pos logit contributions
ogene+6.824
obi+6.713
hene+6.482
aucus+6.027
ochet+5.939
Neg logit contributions
semble-9.284
 Crate-8.677
 Nurs-8.361
CENT-7.811
 COL-7.512
Token the
Token ID262
Feature activation-8.23e-01

Pos logit contributions
ogene+5.971
obi+5.934
hene+5.778
imum+5.348
anski+5.333
Neg logit contributions
semble-7.527
 Crate-6.955
 Nurs-6.794
CENT-6.316
-6.113
Token US
Token ID1294
Feature activation-1.28e+00

Pos logit contributions
obi+6.391
ogene+6.328
hene+6.116
anski+5.604
aucus+5.542
Neg logit contributions
semble-10.966
 Crate-10.123
 Nurs-9.603
CENT-9.246
-9.176
Token election
Token ID3071
Feature activation-9.06e-01

Pos logit contributions
obi+8.730
hene+8.494
ogene+8.297
aucus+7.418
imum+7.396
Neg logit contributions
semble-15.977
 Crate-15.544
 Nurs-14.095
CENT-13.514
-13.408
Token and
Token ID290
Feature activation-6.76e-01

Pos logit contributions
ogene+8.719
hene+8.695
obi+8.641
aucus+7.992
imum+7.872
Neg logit contributions
semble-9.550
 Crate-9.076
 Nurs-8.373
CENT-7.888
-7.392
Token possible
Token ID1744
Feature activation-1.14e+00

Pos logit contributions
ogene+6.381
obi+6.273
hene+6.098
aucus+5.630
anski+5.535
Neg logit contributions
semble-8.068
 Crate-7.521
 Nurs-7.172
CENT-6.786
-6.564
Token ties
Token ID8470
Feature activation-8.12e-01

Pos logit contributions
obi+10.138
ogene+9.959
hene+9.243
aucus+9.122
anski+8.890
Neg logit contributions
semble-12.937
 Crate-11.760
 Nurs-11.118
CENT-10.558
-10.357
Token to
Token ID284
Feature activation-8.39e-01

Pos logit contributions
ogene+6.948
hene+6.882
obi+6.872
anski+6.341
aucus+6.246
Neg logit contributions
semble-10.065
 Crate-9.643
 Nurs-9.113
CENT-8.249
 COL-8.017
Token as
Token ID355
Feature activation-7.70e-02

Pos logit contributions
ogene+0.692
obi+0.673
hene+0.669
aucus+0.621
anski+0.620
Neg logit contributions
semble-0.716
 Crate-0.654
 Nurs-0.642
CENT-0.581
 COL-0.570
Token 23
Token ID2242
Feature activation-4.38e-01

Pos logit contributions
ogene+0.788
obi+0.763
hene+0.755
aucus+0.701
anski+0.699
Neg logit contributions
semble-0.935
 Crate-0.863
 Nurs-0.847
CENT-0.774
 COL-0.758
Token people
Token ID661
Feature activation-5.23e-01

Pos logit contributions
ogene+4.752
obi+4.587
hene+4.512
anski+4.261
aucus+4.258
Neg logit contributions
semble-4.890
 Crate-4.502
 Nurs-4.316
CENT-3.952
 COL-3.832
Token since
Token ID1201
Feature activation-5.60e-01

Pos logit contributions
ogene+5.592
obi+5.470
hene+5.398
ochet+5.071
anski+5.023
Neg logit contributions
semble-5.729
 Crate-5.311
 Nurs-5.199
CENT-4.690
-4.637
Token he
Token ID339
Feature activation-3.51e-01

Pos logit contributions
ogene+5.568
obi+5.420
hene+5.232
anski+4.925
aucus+4.891
Neg logit contributions
semble-6.587
 Crate-5.934
 Nurs-5.878
-5.371
CENT-5.343
Token entered
Token ID5982
Feature activation-1.27e+00

Pos logit contributions
ogene+3.760
obi+3.734
hene+3.696
anski+3.401
ochet+3.389
Neg logit contributions
semble-3.979
 Crate-3.635
 Nurs-3.555
CENT-3.253
-3.187
Token the
Token ID262
Feature activation-6.22e-01

Pos logit contributions
ogene+9.559
obi+9.423
hene+9.209
ochet+8.639
imum+8.410
Neg logit contributions
semble-15.065
 Crate-14.301
 Nurs-13.442
CENT-12.890
-12.129
Token U
Token ID471
Feature activation-2.91e-01

Pos logit contributions
ogene+5.566
obi+5.477
hene+5.373
anski+5.025
aucus+4.957
Neg logit contributions
semble-7.868
 Crate-7.190
 Nurs-6.909
CENT-6.599
-6.398
Token.
Token ID13
Feature activation+1.26e-01

Pos logit contributions
ogene+2.893
obi+2.884
hene+2.812
aucus+2.576
ochet+2.576
Neg logit contributions
semble-3.553
 Crate-3.267
 Nurs-3.214
CENT-2.902
-2.891
TokenS
Token ID50
Feature activation-4.83e-04

Pos logit contributions
semble+1.609
 Crate+1.490
 Nurs+1.471
CENT+1.349
 COL+1.330
Neg logit contributions
ogene-1.211
obi-1.169
hene-1.167
aucus-1.077
imum-1.070
Token.
Token ID13
Feature activation-1.37e-01

Pos logit contributions
ogene+0.006
obi+0.006
hene+0.006
aucus+0.005
imum+0.005
Neg logit contributions
semble-0.005
 Crate-0.005
 Nurs-0.004
CENT-0.004
 COL-0.004
TokenValid
Token ID47139
Feature activation-1.94e-01

Pos logit contributions
ogene+0.193
obi+0.188
hene+0.187
aucus+0.175
imum+0.173
Neg logit contributions
semble-0.204
 Crate-0.187
 Nurs-0.184
CENT-0.166
 COL-0.164
Token )
Token ID1267
Feature activation-5.01e-02

Pos logit contributions
ogene+2.330
obi+2.280
hene+2.255
imum+2.124
aucus+2.116
Neg logit contributions
semble-1.994
 Crate-1.802
 Nurs-1.789
CENT-1.579
 COL-1.567
Token )
Token ID1267
Feature activation-9.33e-01

Pos logit contributions
ogene+0.550
obi+0.537
hene+0.533
aucus+0.497
imum+0.495
Neg logit contributions
semble-0.573
 Crate-0.524
 Nurs-0.516
CENT-0.465
 COL-0.459
Token return
Token ID1441
Feature activation-1.47e+00

Pos logit contributions
ogene+8.321
obi+8.211
hene+7.845
ochet+7.264
imum+7.099
Neg logit contributions
semble-10.484
 Nurs-9.810
 Crate-9.553
CENT-9.105
-8.519
Token space
Token ID2272
Feature activation-5.30e-01

Pos logit contributions
ogene+13.078
obi+12.355
hene+12.116
 ob+11.897
imum+11.173
Neg logit contributions
semble-13.584
 Nurs-12.053
 Crate-11.823
CENT-11.652
-11.114
Token;
Token ID26
Feature activation-1.26e+00

Pos logit contributions
ogene+5.253
hene+5.024
obi+5.011
imum+4.628
ochet+4.614
Neg logit contributions
semble-6.327
 Crate-5.780
 Nurs-5.698
CENT-5.232
 COL-5.041
Token //
Token ID3373
Feature activation-5.18e-01

Pos logit contributions
ogene+10.508
obi+10.123
hene+10.090
ochet+9.032
imum+9.005
Neg logit contributions
semble-13.388
 Nurs-11.915
 Crate-11.775
CENT-11.659
-11.018
Token 4
Token ID604
Feature activation-3.76e-01

Pos logit contributions
ogene+5.318
obi+5.165
hene+5.121
imum+4.715
anski+4.677
Neg logit contributions
semble-5.912
 Nurs-5.322
 Crate-5.287
CENT-4.941
-4.796
Token.
Token ID13
Feature activation+7.65e-02

Pos logit contributions
ogene+3.629
obi+3.583
hene+3.525
imum+3.273
ochet+3.260
Neg logit contributions
semble-4.548
 Crate-4.129
 Nurs-4.115
CENT-3.816
-3.753
Token Rec
Token ID3311
Feature activation+3.20e-01

Pos logit contributions
semble+0.741
 Crate+0.668
 Nurs+0.655
CENT+0.575
 COL+0.569
Neg logit contributions
ogene-0.975
obi-0.955
hene-0.949
aucus-0.895
imum-0.891
Tokenurs
Token ID1834
Feature activation+3.13e-01

Pos logit contributions
semble+4.035
 Crate+3.664
 Nurs+3.555
CENT+3.235
 COL+3.214
Neg logit contributions
ogene-3.075
obi-3.007
hene-2.915
anski-2.776
aucus-2.740
Token
Token ID198
Feature activation+1.81e-01

Pos logit contributions
ogene+0.174
obi+0.171
hene+0.169
aucus+0.159
imum+0.158
Neg logit contributions
semble-0.150
 Crate-0.135
 Nurs-0.133
CENT-0.118
 COL-0.116
TokenNow
Token ID3844
Feature activation-5.79e-02

Pos logit contributions
semble+1.977
 Crate+1.798
 Nurs+1.790
CENT+1.637
 COL+1.587
Neg logit contributions
ogene-2.083
hene-2.019
obi-2.000
imum-1.876
anski-1.874
Token,
Token ID11
Feature activation+1.30e-01

Pos logit contributions
ogene+0.660
obi+0.645
hene+0.641
aucus+0.600
imum+0.596
Neg logit contributions
semble-0.637
 Crate-0.580
 Nurs-0.569
CENT-0.513
 COL-0.504
Token I
Token ID314
Feature activation+1.29e-01

Pos logit contributions
semble+1.409
 Crate+1.285
 Nurs+1.271
CENT+1.127
 COL+1.125
Neg logit contributions
ogene-1.503
obi-1.470
hene-1.463
imum-1.368
aucus-1.361
Token suspect
Token ID4099
Feature activation+3.88e-01

Pos logit contributions
semble+1.296
 Crate+1.172
 Nurs+1.155
CENT+1.020
 COL+1.014
Neg logit contributions
ogene-1.579
obi-1.545
hene-1.542
aucus-1.450
imum-1.447
Token some
Token ID617
Feature activation+8.32e-01

Pos logit contributions
semble+3.694
 Nurs+3.422
 Crate+3.398
 COL+2.977
 prosecutors+2.908
Neg logit contributions
ogene-4.828
hene-4.738
obi-4.703
imum-4.449
anski-4.436
Token GOP
Token ID6796
Feature activation+6.70e-01

Pos logit contributions
semble+7.753
 Nurs+7.275
 Crate+7.204
 COL+6.403
 prosecutors+6.294
Neg logit contributions
ogene-9.454
obi-9.275
hene-9.242
imum-9.049
anski-9.036
Token senators
Token ID15469
Feature activation+3.94e-01

Pos logit contributions
semble+6.591
 Nurs+6.061
 Crate+5.976
 prosecutors+5.355
CENT+5.326
Neg logit contributions
ogene-7.836
hene-7.586
obi-7.443
anski-7.259
imum-7.226
Token and
Token ID290
Feature activation+1.12e-01

Pos logit contributions
semble+3.624
 Crate+3.263
 Nurs+3.211
CENT+2.800
 COL+2.797
Neg logit contributions
ogene-5.078
obi-4.947
hene-4.919
imum-4.705
aucus-4.667
Token their
Token ID511
Feature activation+1.01e-01

Pos logit contributions
semble+1.199
 Crate+1.091
 Nurs+1.079
CENT+0.960
 COL+0.951
Neg logit contributions
ogene-1.300
obi-1.272
hene-1.265
imum-1.183
aucus-1.180
Token allies
Token ID7681
Feature activation+4.65e-01

Pos logit contributions
semble+1.111
 Crate+1.015
 Nurs+1.000
CENT+0.894
 COL+0.885
Neg logit contributions
ogene-1.157
obi-1.129
hene-1.120
imum-1.047
aucus-1.044
Token the
Token ID262
Feature activation+1.07e-01

Pos logit contributions
semble+2.773
 Crate+2.542
 Nurs+2.525
 COL+2.193
+2.192
Neg logit contributions
ogene-3.493
obi-3.406
hene-3.338
aucus-3.179
imum-3.168
Token argument
Token ID4578
Feature activation+4.90e-01

Pos logit contributions
semble+1.077
 Crate+0.976
 Nurs+0.961
CENT+0.847
 COL+0.840
Neg logit contributions
ogene-1.312
obi-1.285
hene-1.280
aucus-1.205
imum-1.197
Token is
Token ID318
Feature activation+1.35e-01

Pos logit contributions
semble+4.789
 Crate+4.338
 Nurs+4.266
+3.786
 COL+3.760
Neg logit contributions
ogene-5.861
obi-5.721
hene-5.702
aucus-5.456
ochet-5.448
Token specific
Token ID2176
Feature activation+1.09e-01

Pos logit contributions
semble+1.374
 Crate+1.244
 Nurs+1.223
CENT+1.083
 COL+1.074
Neg logit contributions
ogene-1.651
obi-1.610
hene-1.606
aucus-1.511
imum-1.504
Token to
Token ID284
Feature activation+5.72e-01

Pos logit contributions
semble+1.146
 Crate+1.033
 Nurs+1.022
 COL+0.905
CENT+0.904
Neg logit contributions
ogene-1.293
hene-1.259
obi-1.254
aucus-1.188
anski-1.180
Token feminist
Token ID14314
Feature activation+8.39e-01

Pos logit contributions
semble+4.969
 Nurs+4.682
 Crate+4.632
 prosecutors+3.998
 COL+3.974
Neg logit contributions
ogene-7.386
obi-7.158
hene-6.961
aucus-6.752
imum-6.649
Token political
Token ID1964
Feature activation+2.59e-01

Pos logit contributions
semble+6.600
 Nurs+6.209
 Crate+6.162
 prosecutors+5.640
 COL+5.454
Neg logit contributions
ogene-10.731
hene-10.264
obi-10.151
imum-10.130
aucus-10.125
Token and
Token ID290
Feature activation-1.91e-01

Pos logit contributions
semble+2.488
 Nurs+2.241
 Crate+2.225
CENT+1.953
 COL+1.943
Neg logit contributions
ogene-3.292
obi-3.187
hene-3.181
aucus-3.008
imum-2.997
Token moral
Token ID6573
Feature activation+6.13e-01

Pos logit contributions
ogene+2.238
obi+2.206
hene+2.189
anski+2.022
imum+2.020
Neg logit contributions
semble-2.020
 Crate-1.846
 Nurs-1.795
CENT-1.609
 COL-1.567
Token commitments
Token ID19405
Feature activation+5.73e-02

Pos logit contributions
semble+5.105
 Nurs+4.615
 Crate+4.573
 prosecutors+4.085
 COL+4.003
Neg logit contributions
ogene-8.117
hene-7.936
obi-7.711
aucus-7.520
imum-7.384
Token because
Token ID780
Feature activation+3.48e-02

Pos logit contributions
semble+0.615
 Crate+0.560
 Nurs+0.551
CENT+0.491
 COL+0.487
Neg logit contributions
ogene-0.668
obi-0.652
hene-0.648
aucus-0.610
imum-0.606
Token't
Token ID470
Feature activation+1.15e-02

Pos logit contributions
semble+1.170
 Crate+1.054
 Nurs+1.033
CENT+0.910
 COL+0.898
Neg logit contributions
ogene-1.545
hene-1.505
obi-1.504
aucus-1.423
imum-1.413
Token I
Token ID314
Feature activation+2.80e-01

Pos logit contributions
semble+0.121
 Crate+0.110
 Nurs+0.109
CENT+0.096
 COL+0.095
Neg logit contributions
ogene-0.136
obi-0.132
hene-0.132
aucus-0.124
imum-0.124
Token tell
Token ID1560
Feature activation+1.03e-01

Pos logit contributions
semble+2.886
 Nurs+2.591
 Crate+2.590
 COL+2.287
CENT+2.235
Neg logit contributions
ogene-3.310
hene-3.270
obi-3.238
imum-3.115
aucus-3.101
Token you
Token ID345
Feature activation+1.27e-01

Pos logit contributions
semble+0.961
 Crate+0.863
 Nurs+0.847
CENT+0.738
 COL+0.733
Neg logit contributions
ogene-1.337
obi-1.311
hene-1.305
aucus-1.231
imum-1.228
Token I
Token ID314
Feature activation-1.91e-01

Pos logit contributions
semble+1.352
 Crate+1.251
 Nurs+1.228
 COL+1.091
CENT+1.084
Neg logit contributions
ogene-1.475
obi-1.434
hene-1.423
aucus-1.347
imum-1.343
Token'd
Token ID1549
Feature activation+8.35e-01

Pos logit contributions
ogene+2.180
obi+2.138
hene+2.122
aucus+1.978
anski+1.970
Neg logit contributions
semble-2.090
 Crate-1.900
 Nurs-1.868
CENT-1.678
 COL-1.653
Token burn
Token ID4245
Feature activation-8.04e-02

Pos logit contributions
semble+4.803
 Crate+4.090
 Nurs+3.820
 COL+3.598
 inciner+3.304
Neg logit contributions
ogene-12.285
hene-12.270
obi-12.088
imum-12.020
aucus-11.862
Token your
Token ID534
Feature activation+2.64e-01

Pos logit contributions
ogene+1.056
obi+1.035
hene+1.030
aucus+0.972
imum+0.969
Neg logit contributions
semble-0.746
 Crate-0.669
 Nurs-0.656
CENT-0.571
 COL-0.567
Token (
Token ID357
Feature activation-6.11e-01

Pos logit contributions
semble+2.426
 Crate+2.253
 Nurs+2.171
 COL+1.914
+1.857
Neg logit contributions
ogene-3.437
obi-3.309
hene-3.285
imum-3.154
aucus-3.144
Tokenre
Token ID260
Feature activation+4.76e-01

Pos logit contributions
obi+2.812
ogene+2.754
hene+2.746
anski+2.252
imum+2.197
Neg logit contributions
semble-10.256
 Nurs-9.734
 Crate-9.712
 COL-9.145
CENT-9.072
Tokenar
Token ID283
Feature activation-1.90e-01

Pos logit contributions
semble+4.554
 Crate+3.969
 Nurs+3.691
CENT+3.338
 COL+3.285
Neg logit contributions
ogene-6.094
hene-5.840
obi-5.784
aucus-5.567
anski-5.488
Token UC
Token ID14417
Feature activation+3.83e-01

Pos logit contributions
semble+2.327
 Crate+2.126
 Nurs+2.124
 COL+1.889
 prosecutors+1.881
Neg logit contributions
ogene-2.767
obi-2.714
hene-2.707
aucus-2.544
imum-2.543
Token officials
Token ID2828
Feature activation+7.20e-01

Pos logit contributions
semble+3.658
 Crate+3.423
 Nurs+3.391
 prosecutors+3.052
 COL+3.046
Neg logit contributions
ogene-4.689
obi-4.575
hene-4.562
aucus-4.298
imum-4.284
Token needed
Token ID2622
Feature activation+8.07e-01

Pos logit contributions
semble+6.393
 Crate+5.868
 Nurs+5.676
 prosecutors+5.285
 COL+5.142
Neg logit contributions
ogene-9.023
obi-8.731
hene-8.574
aucus-8.402
imum-8.325
Token to
Token ID284
Feature activation+6.38e-01

Pos logit contributions
semble+6.324
 prosecutors+5.753
 Crate+5.751
 Nurs+5.726
+5.327
Neg logit contributions
ogene-10.410
hene-10.168
obi-10.104
ochet-9.503
aucus-9.452
Token provide
Token ID2148
Feature activation+6.39e-01

Pos logit contributions
semble+5.656
 Crate+5.086
 Nurs+4.797
 COL+4.485
 prosecutors+4.477
Neg logit contributions
ogene-8.274
hene-8.135
obi-8.068
aucus-7.751
imum-7.592
Token fundamental
Token ID7531
Feature activation+1.08e+00

Pos logit contributions
semble+5.632
 Crate+5.269
 Nurs+5.233
 prosecutors+4.977
 COL+4.736
Neg logit contributions
ogene-8.013
hene-7.983
obi-7.894
aucus-7.475
imum-7.417
Token fairness
Token ID22692
Feature activation+7.75e-01

Pos logit contributions
semble+7.880
 prosecutors+7.387
 Nurs+7.318
 Crate+7.288
 COL+6.806
Neg logit contributions
hene-13.349
ogene-13.140
aucus-13.134
ochet-12.779
obi-12.701
Token to
Token ID284
Feature activation+4.21e-01

Pos logit contributions
semble+6.501
 Crate+5.659
 Nurs+5.623
 prosecutors+5.494
 COL+5.194
Neg logit contributions
ogene-9.793
hene-9.672
obi-9.410
aucus-9.217
anski-9.110
Token a
Token ID257
Feature activation+5.72e-01

Pos logit contributions
semble+4.078
 Nurs+3.720
 Crate+3.717
 prosecutors+3.384
 COL+3.347
Neg logit contributions
ogene-5.074
hene-5.051
obi-5.037
aucus-4.769
imum-4.751
Token state
Token ID1181
Feature activation+4.09e-01

Pos logit contributions
semble+5.270
 Crate+4.760
 Nurs+4.711
 COL+4.294
 prosecutors+4.234
Neg logit contributions
ogene-7.140
obi-6.989
hene-6.929
aucus-6.690
imum-6.602
Token university
Token ID6403
Feature activation+3.30e-01

Pos logit contributions
semble+3.787
 Nurs+3.433
 Crate+3.427
 prosecutors+3.093
 COL+3.087
Neg logit contributions
ogene-5.197
hene-5.077
obi-5.020
imum-4.833
aucus-4.751
Token adults
Token ID6490
Feature activation+3.00e-01

Pos logit contributions
semble+6.642
 Crate+5.922
 Nurs+5.881
 prosecutors+5.488
 COL+5.476
Neg logit contributions
ogene-8.359
hene-8.196
obi-8.066
imum-7.676
ochet-7.670
Token who
Token ID508
Feature activation+3.03e-01

Pos logit contributions
semble+2.860
 Crate+2.554
 Nurs+2.512
CENT+2.255
 COL+2.232
Neg logit contributions
ogene-3.782
obi-3.704
hene-3.690
imum-3.524
aucus-3.514
Token still
Token ID991
Feature activation+1.22e-01

Pos logit contributions
semble+3.031
 Crate+2.732
 Nurs+2.663
 COL+2.400
CENT+2.379
Neg logit contributions
ogene-3.692
obi-3.620
hene-3.595
imum-3.434
aucus-3.413
Token need
Token ID761
Feature activation+9.06e-01

Pos logit contributions
semble+1.272
 Crate+1.151
 Nurs+1.128
CENT+1.009
 COL+1.005
Neg logit contributions
ogene-1.448
obi-1.419
hene-1.412
imum-1.328
aucus-1.327
Token or
Token ID393
Feature activation+4.89e-01

Pos logit contributions
semble+7.557
 Crate+6.924
 Nurs+6.893
 COL+6.260
 prosecutors+6.226
Neg logit contributions
ogene-10.966
hene-10.740
obi-10.542
anski-10.435
ochet-10.242
Token want
Token ID765
Feature activation+9.54e-01

Pos logit contributions
semble+4.530
 Crate+3.929
 Nurs+3.920
 COL+3.545
 prosecutors+3.495
Neg logit contributions
ogene-6.196
hene-6.098
obi-6.087
imum-5.845
anski-5.737
Token nicotine
Token ID21856
Feature activation+4.53e-01

Pos logit contributions
semble+7.510
 Crate+6.944
 Nurs+6.703
 COL+6.338
 prosecutors+6.184
Neg logit contributions
ogene-11.681
hene-11.397
obi-11.243
anski-11.211
ochet-10.966
Token could
Token ID714
Feature activation+4.92e-01

Pos logit contributions
semble+4.307
 Crate+3.837
 Nurs+3.730
 COL+3.385
CENT+3.371
Neg logit contributions
ogene-5.580
hene-5.519
obi-5.386
anski-5.243
aucus-5.213
Token get
Token ID651
Feature activation+7.36e-01

Pos logit contributions
semble+4.841
 Crate+4.247
 Nurs+4.081
 COL+3.822
CENT+3.682
Neg logit contributions
ogene-6.000
hene-5.954
obi-5.879
aucus-5.607
imum-5.532
Token it
Token ID340
Feature activation+6.90e-01

Pos logit contributions
semble+6.238
 Crate+5.779
 Nurs+5.777
 COL+5.271
 prosecutors+5.139
Neg logit contributions
ogene-9.176
hene-8.809
obi-8.766
anski-8.705
imum-8.429
Token from
Token ID422
Feature activation+6.60e-01

Pos logit contributions
semble+6.180
 Crate+5.496
 Nurs+5.296
 prosecutors+4.850
 COL+4.792
Neg logit contributions
ogene-8.457
hene-8.408
obi-8.204
imum-7.928
aucus-7.926
Token reducing
Token ID8868
Feature activation-2.48e-01

Pos logit contributions
ogene+0.013
obi+0.013
hene+0.013
aucus+0.012
imum+0.012
Neg logit contributions
semble-0.012
 Crate-0.011
 Nurs-0.011
CENT-0.010
 COL-0.009
Token carbon
Token ID6588
Feature activation-1.60e-01

Pos logit contributions
ogene+2.875
obi+2.803
hene+2.768
aucus+2.607
imum+2.600
Neg logit contributions
semble-2.646
 Crate-2.426
 Nurs-2.374
CENT-2.137
-2.068
Token dioxide
Token ID17556
Feature activation-2.68e-01

Pos logit contributions
ogene+1.949
obi+1.908
hene+1.894
aucus+1.778
imum+1.770
Neg logit contributions
semble-1.643
 Crate-1.488
 Nurs-1.462
CENT-1.299
 COL-1.272
Token emissions
Token ID8971
Feature activation-2.01e-02

Pos logit contributions
ogene+3.195
obi+3.133
hene+3.102
imum+2.896
aucus+2.891
Neg logit contributions
semble-2.780
 Crate-2.543
 Nurs-2.475
CENT-2.215
-2.126
Token by
Token ID416
Feature activation+5.09e-02

Pos logit contributions
ogene+0.230
obi+0.225
hene+0.223
aucus+0.209
imum+0.208
Neg logit contributions
semble-0.220
 Crate-0.200
 Nurs-0.197
CENT-0.176
 COL-0.174
Token 40
Token ID2319
Feature activation+4.64e-01

Pos logit contributions
semble+0.556
 Crate+0.507
 Nurs+0.499
CENT+0.447
 COL+0.442
Neg logit contributions
ogene-0.585
obi-0.572
hene-0.568
aucus-0.531
imum-0.529
Token%
Token ID4
Feature activation-1.86e-01

Pos logit contributions
semble+3.646
 Nurs+3.158
 Crate+3.094
CENT+2.850
 COL+2.816
Neg logit contributions
ogene-6.491
obi-6.341
hene-6.297
imum-6.051
aucus-6.045
Token by
Token ID416
Feature activation-6.60e-01

Pos logit contributions
ogene+2.008
obi+1.977
hene+1.966
aucus+1.815
imum+1.815
Neg logit contributions
semble-2.123
 Crate-1.958
 Nurs-1.919
CENT-1.723
 COL-1.698
Token 2020
Token ID12131
Feature activation-1.48e-01

Pos logit contributions
ogene+4.848
obi+4.802
hene+4.698
anski+4.394
aucus+4.211
Neg logit contributions
semble-9.122
 Crate-8.499
 Nurs-8.190
CENT-7.682
-7.625
Token from
Token ID422
Feature activation-3.13e-01

Pos logit contributions
ogene+1.677
obi+1.641
hene+1.631
aucus+1.522
imum+1.517
Neg logit contributions
semble-1.644
 Crate-1.502
 Nurs-1.470
CENT-1.318
 COL-1.300
Token 1990
Token ID6303
Feature activation-1.44e-01

Pos logit contributions
ogene+3.126
obi+3.074
hene+3.038
aucus+2.834
anski+2.824
Neg logit contributions
semble-3.816
 Crate-3.525
 Nurs-3.415
CENT-3.129
 COL-3.062
Token it
Token ID340
Feature activation-8.12e-01

Pos logit contributions
ogene+7.305
obi+7.235
hene+6.922
ochet+6.618
anski+6.443
Neg logit contributions
semble-9.118
 Crate-8.062
 Nurs-7.850
CENT-7.530
-7.165
Token,
Token ID11
Feature activation-6.97e-01

Pos logit contributions
ogene+7.309
obi+7.265
hene+6.700
ochet+6.467
anski+6.343
Neg logit contributions
semble-9.792
 Crate-8.925
 Nurs-8.685
CENT-8.176
-7.864
Token
Token ID447
Feature activation-1.10e-01

Pos logit contributions
ogene+7.208
obi+7.148
hene+6.754
anski+6.477
ochet+6.381
Neg logit contributions
semble-7.679
 Crate-6.853
 Nurs-6.721
CENT-6.215
-6.074
Token
Token ID251
Feature activation-3.80e-01

Pos logit contributions
ogene+1.797
obi+1.767
hene+1.763
aucus+1.684
imum+1.675
Neg logit contributions
semble-0.679
 Crate-0.567
 Nurs-0.549
CENT-0.440
 COL-0.425
Token said
Token ID531
Feature activation-5.68e-01

Pos logit contributions
ogene+4.175
obi+4.152
hene+4.020
anski+3.784
aucus+3.768
Neg logit contributions
semble-4.222
 Crate-3.810
 Nurs-3.756
CENT-3.384
 COL-3.305
Token Moh
Token ID9719
Feature activation+2.60e-01

Pos logit contributions
ogene+6.274
hene+6.231
obi+6.219
anski+5.764
aucus+5.737
Neg logit contributions
semble-6.023
 Crate-5.351
 Nurs-5.180
CENT-4.823
-4.580
Tokenan
Token ID272
Feature activation-2.43e-01

Pos logit contributions
semble+1.697
 Crate+1.410
 Nurs+1.391
CENT+1.131
 COL+1.088
Neg logit contributions
ogene-4.115
hene-4.017
obi-4.007
aucus-3.858
imum-3.823
Token.
Token ID13
Feature activation-7.56e-02

Pos logit contributions
ogene+2.709
obi+2.656
hene+2.624
aucus+2.441
imum+2.437
Neg logit contributions
semble-2.733
 Crate-2.495
 Nurs-2.428
CENT-2.207
 COL-2.152
Token
Token ID198
Feature activation-1.39e-01

Pos logit contributions
ogene+0.946
obi+0.923
hene+0.920
aucus+0.866
imum+0.862
Neg logit contributions
semble-0.749
 Crate-0.680
 Nurs-0.668
CENT-0.590
 COL-0.583
Token
Token ID198
Feature activation-2.55e-01

Pos logit contributions
ogene+1.590
obi+1.582
hene+1.553
aucus+1.451
imum+1.445
Neg logit contributions
semble-1.510
 Crate-1.360
 Nurs-1.340
CENT-1.197
-1.194
Token"
Token ID1
Feature activation-5.85e-01

Pos logit contributions
ogene+2.679
obi+2.649
hene+2.599
imum+2.417
aucus+2.411
Neg logit contributions
semble-2.994
 Crate-2.748
 Nurs-2.699
 COL-2.405
CENT-2.401
Token was
Token ID373
Feature activation+3.78e-01

Pos logit contributions
semble+0.007
 Crate+0.006
 Nurs+0.006
CENT+0.006
 COL+0.006
Neg logit contributions
ogene-0.008
obi-0.008
hene-0.008
aucus-0.007
imum-0.007
Token cancelled
Token ID16769
Feature activation-2.31e-01

Pos logit contributions
semble+3.566
 Crate+3.172
 Nurs+3.084
 COL+2.758
CENT+2.742
Neg logit contributions
ogene-4.773
obi-4.626
hene-4.590
imum-4.440
aucus-4.377
Token
Token ID198
Feature activation+9.73e-02

Pos logit contributions
ogene+2.588
obi+2.519
hene+2.506
aucus+2.349
anski+2.326
Neg logit contributions
semble-2.566
 Crate-2.347
 Nurs-2.298
CENT-2.070
 COL-2.018
Token
Token ID198
Feature activation-2.66e-01

Pos logit contributions
semble+0.964
 Crate+0.873
 Nurs+0.857
CENT+0.761
 COL+0.750
Neg logit contributions
ogene-1.217
obi-1.187
hene-1.183
aucus-1.113
imum-1.109
TokenA
Token ID32
Feature activation-2.53e-03

Pos logit contributions
ogene+2.785
obi+2.770
hene+2.709
imum+2.508
aucus+2.495
Neg logit contributions
semble-3.112
 Crate-2.883
 Nurs-2.843
 COL-2.520
-2.493
Token man
Token ID582
Feature activation+1.20e-01

Pos logit contributions
ogene+0.029
obi+0.028
hene+0.028
aucus+0.026
imum+0.026
Neg logit contributions
semble-0.028
 Crate-0.025
 Nurs-0.025
CENT-0.022
 COL-0.022
Token was
Token ID373
Feature activation+1.07e-01

Pos logit contributions
semble+1.261
 Crate+1.143
 Nurs+1.123
CENT+0.997
 COL+0.994
Neg logit contributions
ogene-1.423
obi-1.387
hene-1.383
aucus-1.301
imum-1.300
Token able
Token ID1498
Feature activation-5.72e-02

Pos logit contributions
semble+1.045
 Crate+0.943
 Nurs+0.923
CENT+0.815
 COL+0.812
Neg logit contributions
ogene-1.349
obi-1.317
hene-1.312
aucus-1.248
imum-1.240
Token to
Token ID284
Feature activation+4.10e-02

Pos logit contributions
ogene+0.626
obi+0.611
hene+0.607
imum+0.564
aucus+0.563
Neg logit contributions
semble-0.651
 Crate-0.600
 Nurs-0.590
CENT-0.531
 COL-0.523
Token successfully
Token ID7675
Feature activation-1.96e-01

Pos logit contributions
semble+0.510
 Crate+0.470
 Nurs+0.463
CENT+0.421
 COL+0.418
Neg logit contributions
ogene-0.409
obi-0.398
hene-0.396
aucus-0.366
imum-0.364
Token buy
Token ID2822
Feature activation-2.23e-01

Pos logit contributions
ogene+1.975
obi+1.934
hene+1.892
imum+1.760
anski+1.757
Neg logit contributions
semble-2.398
 Crate-2.214
 Nurs-2.185
CENT-1.985
-1.955
Token (
Token ID357
Feature activation+6.03e-01

Pos logit contributions
ogene+1.307
obi+1.281
hene+1.277
aucus+1.201
imum+1.196
Neg logit contributions
semble-1.001
 Crate-0.898
 Nurs-0.881
CENT-0.777
 COL-0.762
TokenD
Token ID35
Feature activation+2.93e-01

Pos logit contributions
semble+5.958
 Crate+5.228
 Nurs+5.004
+4.908
 COL+4.882
Neg logit contributions
ogene-7.294
obi-6.978
hene-6.917
anski-6.639
aucus-6.515
TokenACA
Token ID26576
Feature activation-4.31e-01

Pos logit contributions
semble+3.490
 Crate+3.236
 Nurs+3.162
 COL+2.910
CENT+2.882
Neg logit contributions
ogene-3.000
hene-2.876
obi-2.856
imum-2.659
anski-2.640
Token)
Token ID8
Feature activation+1.04e-01

Pos logit contributions
ogene+4.865
obi+4.799
hene+4.755
aucus+4.510
imum+4.445
Neg logit contributions
semble-4.673
 Crate-4.200
 Nurs-4.113
CENT-3.741
 COL-3.650
Token program
Token ID1430
Feature activation+3.74e-01

Pos logit contributions
semble+0.872
 Crate+0.771
 Nurs+0.762
CENT+0.653
 COL+0.648
Neg logit contributions
ogene-1.466
obi-1.433
hene-1.423
aucus-1.350
imum-1.347
Token.
Token ID13
Feature activation+2.76e-01

Pos logit contributions
semble+3.582
 Crate+3.266
 Nurs+3.251
 COL+2.882
CENT+2.842
Neg logit contributions
ogene-4.622
obi-4.514
hene-4.423
imum-4.216
aucus-4.198
Token
Token ID198
Feature activation-2.70e-01

Pos logit contributions
semble+2.557
 Crate+2.373
 Nurs+2.368
 COL+2.048
CENT+2.033
Neg logit contributions
ogene-3.592
hene-3.461
obi-3.455
imum-3.240
anski-3.238
Token
Token ID198
Feature activation+8.61e-02

Pos logit contributions
obi+2.891
ogene+2.863
hene+2.809
aucus+2.629
imum+2.607
Neg logit contributions
semble-3.062
 Crate-2.757
 Nurs-2.725
-2.461
CENT-2.439
TokenTrump
Token ID6170
Feature activation-2.74e-01

Pos logit contributions
semble+0.991
 Crate+0.905
 Nurs+0.896
CENT+0.822
 COL+0.800
Neg logit contributions
ogene-0.942
hene-0.910
obi-0.906
aucus-0.845
imum-0.841
Token formally
Token ID15832
Feature activation+9.19e-02

Pos logit contributions
ogene+2.865
obi+2.833
hene+2.780
aucus+2.596
anski+2.578
Neg logit contributions
semble-3.248
 Crate-2.946
 Nurs-2.876
CENT-2.653
 COL-2.578
Token ended
Token ID4444
Feature activation+4.58e-01

Pos logit contributions
semble+1.059
 Crate+0.969
 Nurs+0.956
CENT+0.860
 COL+0.853
Neg logit contributions
ogene-1.000
obi-0.974
hene-0.969
aucus-0.902
imum-0.900
Token Ricky
Token ID31668
Feature activation-3.53e-01

Pos logit contributions
ogene+2.615
obi+2.559
hene+2.531
aucus+2.364
imum+2.350
Neg logit contributions
semble-2.615
 Crate-2.369
 Nurs-2.323
CENT-2.121
-2.069
Token G
Token ID402
Feature activation+1.09e-01

Pos logit contributions
obi+3.876
ogene+3.864
hene+3.823
anski+3.592
aucus+3.569
Neg logit contributions
semble-3.895
 Crate-3.480
 Nurs-3.374
CENT-3.135
-3.026
Tokenerv
Token ID712
Feature activation-2.78e-01

Pos logit contributions
semble+1.488
 Crate+1.402
 Nurs+1.385
 COL+1.261
+1.252
Neg logit contributions
ogene-0.943
obi-0.904
hene-0.898
imum-0.836
aucus-0.821
Tokenais
Token ID15152
Feature activation-2.56e-01

Pos logit contributions
ogene+3.355
obi+3.324
hene+3.290
anski+3.145
aucus+3.040
Neg logit contributions
semble-2.830
 Crate-2.597
 Nurs-2.499
CENT-2.327
-2.238
Token
Token ID198
Feature activation-1.44e-01

Pos logit contributions
ogene+3.047
obi+2.992
hene+2.966
aucus+2.776
imum+2.774
Neg logit contributions
semble-2.662
 Crate-2.407
 Nurs-2.354
CENT-2.116
 COL-2.074
Token
Token ID198
Feature activation+1.41e-01

Pos logit contributions
ogene+1.632
obi+1.624
hene+1.592
aucus+1.492
imum+1.482
Neg logit contributions
semble-1.570
 Crate-1.416
 Nurs-1.395
-1.251
CENT-1.250
Token2007
Token ID12726
Feature activation-1.13e-02

Pos logit contributions
semble+1.381
 Crate+1.229
 Nurs+1.218
CENT+1.102
+1.055
Neg logit contributions
ogene-1.769
hene-1.717
obi-1.711
aucus-1.614
imum-1.607
Token Alec
Token ID38285
Feature activation+3.98e-01

Pos logit contributions
ogene+0.133
obi+0.130
hene+0.130
aucus+0.121
imum+0.121
Neg logit contributions
semble-0.121
 Crate-0.110
 Nurs-0.108
CENT-0.096
 COL-0.096
Token Baldwin
Token ID26402
Feature activation-2.09e-01

Pos logit contributions
semble+4.376
 Crate+4.099
 Nurs+4.076
 COL+3.627
 prosecutors+3.524
Neg logit contributions
hene-4.363
ogene-4.358
obi-4.207
ochet-3.991
anski-3.952
Token
Token ID198
Feature activation-1.92e-01

Pos logit contributions
ogene+2.510
obi+2.459
hene+2.441
aucus+2.289
imum+2.283
Neg logit contributions
semble-2.172
 Crate-1.965
 Nurs-1.926
CENT-1.727
 COL-1.694
Token
Token ID198
Feature activation+3.82e-02

Pos logit contributions
obi+2.122
ogene+2.121
hene+2.072
aucus+1.936
imum+1.923
Neg logit contributions
semble-2.134
 Crate-1.924
 Nurs-1.896
-1.718
CENT-1.702
Token get
Token ID651
Feature activation-5.93e-01

Pos logit contributions
ogene+6.218
obi+6.077
hene+5.930
anski+5.597
ochet+5.495
Neg logit contributions
semble-5.457
 Nurs-5.004
 Crate-4.982
CENT-4.523
-4.410
Token back
Token ID736
Feature activation+1.44e-01

Pos logit contributions
ogene+5.925
obi+5.890
hene+5.663
ochet+5.196
imum+5.161
Neg logit contributions
semble-6.945
 Crate-6.244
 Nurs-6.200
CENT-5.696
-5.470
Token where
Token ID810
Feature activation-5.81e-02

Pos logit contributions
semble+1.487
 Crate+1.342
 Nurs+1.318
 COL+1.171
CENT+1.170
Neg logit contributions
ogene-1.746
hene-1.702
obi-1.702
aucus-1.605
imum-1.590
Token we
Token ID356
Feature activation+6.05e-01

Pos logit contributions
ogene+0.627
obi+0.611
hene+0.608
aucus+0.566
imum+0.563
Neg logit contributions
semble-0.675
 Crate-0.619
 Nurs-0.609
CENT-0.550
 COL-0.545
Token were
Token ID547
Feature activation+1.21e-01

Pos logit contributions
semble+5.625
 Crate+4.906
 Nurs+4.756
 COL+4.371
 prosecutors+4.317
Neg logit contributions
ogene-7.512
obi-7.278
hene-7.263
imum-6.993
aucus-6.949
Token last
Token ID938
Feature activation-4.88e-01

Pos logit contributions
semble+1.210
 Crate+1.090
 Nurs+1.064
CENT+0.943
 COL+0.940
Neg logit contributions
ogene-1.508
obi-1.472
hene-1.466
aucus-1.378
imum-1.374
Token year
Token ID614
Feature activation-3.78e-02

Pos logit contributions
ogene+4.699
obi+4.677
hene+4.675
aucus+4.177
imum+4.164
Neg logit contributions
semble-5.862
 Nurs-5.372
 Crate-5.366
CENT-4.934
-4.679
Token.
Token ID13
Feature activation-3.93e-01

Pos logit contributions
ogene+0.437
obi+0.426
hene+0.424
aucus+0.397
imum+0.395
Neg logit contributions
semble-0.412
 Crate-0.375
 Nurs-0.368
CENT-0.330
 COL-0.326
Token
Token ID198
Feature activation-8.21e-02

Pos logit contributions
obi+4.467
ogene+4.411
hene+4.337
imum+4.060
aucus+4.048
Neg logit contributions
semble-4.211
 Crate-3.598
 Nurs-3.533
 prosecutors-3.262
CENT-3.243
Token
Token ID198
Feature activation-2.69e-01

Pos logit contributions
ogene+0.961
obi+0.950
hene+0.938
aucus+0.877
imum+0.875
Neg logit contributions
semble-0.871
 Crate-0.787
 Nurs-0.775
CENT-0.691
-0.684
Token"
Token ID1
Feature activation-6.61e-01

Pos logit contributions
ogene+2.864
obi+2.812
hene+2.777
aucus+2.578
imum+2.576
Neg logit contributions
semble-3.155
 Crate-2.898
 Nurs-2.860
CENT-2.554
 COL-2.548
Token remember
Token ID3505
Feature activation-3.97e-02

Pos logit contributions
semble+1.674
 Crate+1.500
 Nurs+1.471
CENT+1.305
 COL+1.295
Neg logit contributions
ogene-2.058
obi-1.998
hene-1.997
aucus-1.888
imum-1.869
Token being
Token ID852
Feature activation+2.97e-01

Pos logit contributions
ogene+0.471
obi+0.460
hene+0.458
aucus+0.428
imum+0.427
Neg logit contributions
semble-0.418
 Crate-0.378
 Nurs-0.372
CENT-0.332
 COL-0.327
Token truly
Token ID4988
Feature activation+2.69e-01

Pos logit contributions
semble+3.116
 Crate+2.802
 Nurs+2.788
CENT+2.471
 COL+2.461
Neg logit contributions
ogene-3.511
obi-3.415
hene-3.401
aucus-3.231
imum-3.213
Token afraid
Token ID7787
Feature activation-2.50e-02

Pos logit contributions
semble+2.815
 Crate+2.532
 Nurs+2.519
CENT+2.239
 COL+2.231
Neg logit contributions
ogene-3.186
obi-3.087
hene-3.078
aucus-2.929
imum-2.915
Token.
Token ID13
Feature activation-1.95e-01

Pos logit contributions
ogene+0.286
obi+0.279
hene+0.277
aucus+0.259
imum+0.258
Neg logit contributions
semble-0.274
 Crate-0.250
 Nurs-0.246
CENT-0.221
 COL-0.218
Token The
Token ID383
Feature activation-1.76e-01

Pos logit contributions
ogene+2.343
obi+2.319
hene+2.292
aucus+2.148
imum+2.135
Neg logit contributions
semble-1.994
 Crate-1.767
 Nurs-1.719
CENT-1.545
-1.521
Token first
Token ID717
Feature activation-6.16e-01

Pos logit contributions
ogene+2.228
obi+2.188
hene+2.162
imum+2.030
aucus+2.027
Neg logit contributions
semble-1.707
 Crate-1.522
 Nurs-1.484
CENT-1.326
-1.290
Token time
Token ID640
Feature activation-4.11e-01

Pos logit contributions
obi+6.394
ogene+6.392
hene+6.252
aucus+5.702
imum+5.696
Neg logit contributions
semble-6.759
 Crate-6.133
 Nurs-6.013
CENT-5.556
-5.413
Token,
Token ID11
Feature activation-4.44e-01

Pos logit contributions
ogene+4.203
obi+4.127
hene+4.086
aucus+3.762
imum+3.682
Neg logit contributions
semble-4.883
 Crate-4.390
 Nurs-4.331
CENT-3.965
-3.880
Token he
Token ID339
Feature activation+7.19e-02

Pos logit contributions
ogene+5.086
obi+4.988
hene+4.882
aucus+4.607
ochet+4.516
Neg logit contributions
semble-4.693
 Crate-4.188
 Nurs-4.085
CENT-3.696
-3.665
Token genuinely
Token ID17170
Feature activation+7.73e-03

Pos logit contributions
semble+0.797
 Crate+0.729
 Nurs+0.718
CENT+0.642
 COL+0.637
Neg logit contributions
ogene-0.814
obi-0.792
hene-0.789
aucus-0.738
imum-0.737
Tokens
Token ID82
Feature activation-8.81e-02

Pos logit contributions
ogene+2.277
obi+2.254
hene+2.233
aucus+2.076
anski+2.064
Neg logit contributions
semble-2.186
 Crate-2.001
 Nurs-1.953
CENT-1.762
 COL-1.734
Token younger
Token ID7099
Feature activation+8.27e-02

Pos logit contributions
ogene+0.958
obi+0.940
hene+0.930
aucus+0.864
imum+0.861
Neg logit contributions
semble-1.015
 Crate-0.928
 Nurs-0.909
CENT-0.827
 COL-0.810
Token brother
Token ID3956
Feature activation-2.24e-01

Pos logit contributions
semble+0.881
 Crate+0.798
 Nurs+0.792
CENT+0.702
 COL+0.698
Neg logit contributions
ogene-0.972
obi-0.945
hene-0.944
aucus-0.884
imum-0.883
Token Vin
Token ID11820
Feature activation+1.39e-01

Pos logit contributions
ogene+2.554
obi+2.529
hene+2.508
aucus+2.321
imum+2.314
Neg logit contributions
semble-2.418
 Crate-2.189
 Nurs-2.128
CENT-1.939
 COL-1.883
Tokenod
Token ID375
Feature activation+5.77e-02

Pos logit contributions
semble+1.832
 Crate+1.696
 Nurs+1.679
CENT+1.538
 COL+1.522
Neg logit contributions
ogene-1.260
hene-1.208
obi-1.200
aucus-1.134
imum-1.106
Token,
Token ID11
Feature activation-4.12e-01

Pos logit contributions
semble+0.593
 Crate+0.540
 Nurs+0.533
CENT+0.467
 COL+0.466
Neg logit contributions
ogene-0.697
obi-0.679
hene-0.675
aucus-0.635
imum-0.633
Token brother
Token ID3956
Feature activation+2.73e-01

Pos logit contributions
ogene+4.619
obi+4.589
hene+4.494
aucus+4.175
anski+4.131
Neg logit contributions
semble-4.465
 Crate-4.032
 Nurs-3.846
CENT-3.560
-3.445
Token-
Token ID12
Feature activation-1.65e-01

Pos logit contributions
semble+2.912
 Nurs+2.696
 Crate+2.644
 COL+2.357
+2.321
Neg logit contributions
ogene-3.166
obi-3.023
hene-3.018
aucus-2.903
anski-2.858
Tokenin
Token ID259
Feature activation-5.92e-01

Pos logit contributions
ogene+1.933
obi+1.901
hene+1.876
aucus+1.753
imum+1.747
Neg logit contributions
semble-1.760
 Crate-1.614
 Nurs-1.573
CENT-1.412
 COL-1.389
Token-
Token ID12
Feature activation-2.34e-01

Pos logit contributions
ogene+5.105
obi+4.928
hene+4.905
anski+4.402
aucus+4.394
Neg logit contributions
semble-7.441
 Crate-7.149
 Nurs-6.844
CENT-6.597
-6.261
Tokenlaw
Token ID6270
Feature activation+3.02e-02

Pos logit contributions
ogene+2.284
obi+2.276
hene+2.256
anski+2.066
ochet+2.044
Neg logit contributions
semble-2.849
 Crate-2.662
 Nurs-2.584
CENT-2.401
 COL-2.333
Token
Token ID198
Feature activation+1.85e-01

Pos logit contributions
ogene+0.831
obi+0.814
hene+0.809
aucus+0.761
imum+0.758
Neg logit contributions
semble-0.668
 Crate-0.600
 Nurs-0.588
CENT-0.522
 COL-0.513
Token
Token ID198
Feature activation+3.54e-02

Pos logit contributions
semble+1.733
 Crate+1.570
 Nurs+1.543
CENT+1.385
 COL+1.350
Neg logit contributions
ogene-2.401
hene-2.323
obi-2.314
anski-2.184
aucus-2.180
TokenRF
Token ID32754
Feature activation-4.08e-02

Pos logit contributions
semble+0.395
 Crate+0.358
 Nurs+0.354
CENT+0.323
 COL+0.314
Neg logit contributions
ogene-0.400
hene-0.387
obi-0.387
aucus-0.362
imum-0.360
Token:
Token ID25
Feature activation+2.17e-01

Pos logit contributions
ogene+0.480
obi+0.468
hene+0.466
aucus+0.436
imum+0.435
Neg logit contributions
semble-0.435
 Crate-0.395
 Nurs-0.389
CENT-0.347
 COL-0.343
Token And
Token ID843
Feature activation+3.27e-01

Pos logit contributions
semble+2.078
 Crate+1.897
 Nurs+1.873
CENT+1.627
 COL+1.626
Neg logit contributions
ogene-2.779
obi-2.703
hene-2.693
imum-2.537
aucus-2.537
Token it
Token ID340
Feature activation+1.64e-03

Pos logit contributions
semble+3.213
 Crate+2.928
 Nurs+2.865
 COL+2.528
CENT+2.488
Neg logit contributions
ogene-4.036
obi-3.958
hene-3.916
imum-3.732
aucus-3.722
Token varies
Token ID17806
Feature activation-2.29e-01

Pos logit contributions
semble+0.018
 Crate+0.016
 Nurs+0.016
CENT+0.014
 COL+0.014
Neg logit contributions
ogene-0.019
obi-0.019
hene-0.019
aucus-0.017
imum-0.017
Token a
Token ID257
Feature activation+7.99e-02

Pos logit contributions
ogene+2.513
obi+2.444
hene+2.420
aucus+2.270
anski+2.250
Neg logit contributions
semble-2.576
 Crate-2.341
 Nurs-2.323
CENT-2.108
-2.070
Token bit
Token ID1643
Feature activation+8.41e-02

Pos logit contributions
semble+0.958
 Crate+0.881
 Nurs+0.867
CENT+0.786
 COL+0.780
Neg logit contributions
ogene-0.831
obi-0.809
hene-0.804
aucus-0.746
imum-0.744
Token along
Token ID1863
Feature activation-2.66e-01

Pos logit contributions
semble+0.918
 Crate+0.836
 Nurs+0.823
CENT+0.736
 COL+0.731
Neg logit contributions
ogene-0.964
obi-0.941
hene-0.936
aucus-0.876
imum-0.875
Token the
Token ID262
Feature activation-1.93e-01

Pos logit contributions
ogene+2.916
obi+2.836
hene+2.799
aucus+2.610
anski+2.594
Neg logit contributions
semble-2.998
 Crate-2.758
 Nurs-2.723
CENT-2.467
-2.401
Token ahead
Token ID4058
Feature activation-2.13e-01

Pos logit contributions
semble+1.049
 Crate+0.950
 Nurs+0.932
CENT+0.832
 COL+0.825
Neg logit contributions
ogene-1.231
obi-1.207
hene-1.198
aucus-1.130
imum-1.127
Token of
Token ID286
Feature activation-4.22e-01

Pos logit contributions
ogene+2.057
obi+1.988
hene+1.961
aucus+1.805
imum+1.798
Neg logit contributions
semble-2.705
 Crate-2.490
 Nurs-2.452
CENT-2.256
-2.236
Token her
Token ID607
Feature activation-4.45e-01

Pos logit contributions
ogene+4.356
hene+4.168
obi+4.139
aucus+3.981
imum+3.850
Neg logit contributions
semble-4.893
 Crate-4.484
 Nurs-4.414
CENT-4.063
-4.022
Token Dec
Token ID4280
Feature activation+1.82e-01

Pos logit contributions
ogene+4.786
obi+4.665
hene+4.630
aucus+4.300
ochet+4.203
Neg logit contributions
semble-5.009
 Crate-4.545
 Nurs-4.414
CENT-4.042
-4.037
Token.
Token ID13
Feature activation+3.60e-01

Pos logit contributions
semble+1.952
 Crate+1.776
 Nurs+1.747
CENT+1.557
 COL+1.544
Neg logit contributions
ogene-2.102
obi-2.054
hene-2.048
aucus-1.904
anski-1.902
Token 6
Token ID718
Feature activation-2.86e-01

Pos logit contributions
semble+3.594
 Crate+3.300
 Nurs+3.250
 COL+2.862
CENT+2.838
Neg logit contributions
ogene-4.390
hene-4.289
obi-4.278
anski-4.059
imum-3.998
Token runoff
Token ID37536
Feature activation-4.42e-02

Pos logit contributions
ogene+3.073
obi+3.045
hene+3.014
aucus+2.857
imum+2.769
Neg logit contributions
semble-3.266
 Crate-2.994
 Nurs-2.902
CENT-2.624
-2.619
Token against
Token ID1028
Feature activation+1.35e-01

Pos logit contributions
ogene+0.461
obi+0.450
hene+0.447
aucus+0.416
imum+0.413
Neg logit contributions
semble-0.528
 Crate-0.486
 Nurs-0.478
CENT-0.433
 COL-0.427
Token Rep
Token ID1432
Feature activation+4.80e-01

Pos logit contributions
semble+1.382
 Crate+1.255
 Nurs+1.241
 COL+1.097
CENT+1.085
Neg logit contributions
ogene-1.628
obi-1.596
hene-1.594
imum-1.487
aucus-1.485
Token.
Token ID13
Feature activation+5.44e-01

Pos logit contributions
semble+4.625
 Crate+4.200
 Nurs+4.115
 COL+3.648
CENT+3.520
Neg logit contributions
ogene-5.907
obi-5.747
hene-5.725
imum-5.401
aucus-5.373
Token Bill
Token ID3941
Feature activation+7.99e-02

Pos logit contributions
semble+5.258
 Crate+4.974
 Nurs+4.830
 COL+4.429
 prosecutors+4.106
Neg logit contributions
ogene-6.483
obi-6.268
hene-6.259
imum-5.948
anski-5.872
Token with
Token ID351
Feature activation-2.35e-01

Pos logit contributions
ogene+2.989
obi+2.931
hene+2.914
imum+2.705
aucus+2.695
Neg logit contributions
semble-3.304
 Crate-3.017
 Nurs-2.991
CENT-2.695
 COL-2.658
Token AWS
Token ID30865
Feature activation-3.28e-01

Pos logit contributions
ogene+2.575
obi+2.500
hene+2.498
imum+2.320
aucus+2.318
Neg logit contributions
semble-2.637
 Crate-2.403
 Nurs-2.381
CENT-2.165
 COL-2.108
Token.
Token ID13
Feature activation-2.02e-01

Pos logit contributions
ogene+3.464
obi+3.445
hene+3.382
imum+3.165
aucus+3.145
Neg logit contributions
semble-3.731
 Crate-3.339
 Nurs-3.326
CENT-3.017
-2.965
Token We
Token ID775
Feature activation+3.08e-01

Pos logit contributions
ogene+2.411
obi+2.389
hene+2.371
aucus+2.211
imum+2.208
Neg logit contributions
semble-2.051
 Crate-1.856
 Nurs-1.829
CENT-1.638
 COL-1.604
Token will
Token ID481
Feature activation+3.07e-01

Pos logit contributions
semble+2.777
 Crate+2.440
 Nurs+2.391
 COL+2.096
 prosecutors+2.035
Neg logit contributions
ogene-4.114
obi-4.014
hene-3.995
imum-3.816
aucus-3.808
Token deploy
Token ID6061
Feature activation-7.42e-01

Pos logit contributions
semble+3.195
 Crate+2.853
 Nurs+2.795
 COL+2.521
CENT+2.454
Neg logit contributions
ogene-3.717
hene-3.609
obi-3.607
aucus-3.418
imum-3.398
Token the
Token ID262
Feature activation-2.78e-01

Pos logit contributions
ogene+7.121
obi+6.980
hene+6.900
aucus+6.477
ochet+6.468
Neg logit contributions
semble-8.142
 Crate-7.424
 Nurs-7.408
CENT-6.984
 COL-6.603
Token application
Token ID3586
Feature activation-1.03e-01

Pos logit contributions
ogene+2.964
obi+2.906
hene+2.880
aucus+2.654
imum+2.652
Neg logit contributions
semble-3.185
 Crate-2.904
 Nurs-2.863
CENT-2.634
 COL-2.551
Token in
Token ID287
Feature activation+1.04e-01

Pos logit contributions
ogene+1.156
obi+1.127
hene+1.117
aucus+1.041
imum+1.040
Neg logit contributions
semble-1.148
 Crate-1.048
 Nurs-1.035
CENT-0.933
 COL-0.918
Token one
Token ID530
Feature activation+9.24e-02

Pos logit contributions
semble+1.125
 Crate+1.032
 Nurs+1.006
 COL+0.895
CENT+0.888
Neg logit contributions
ogene-1.214
obi-1.181
hene-1.175
aucus-1.108
imum-1.098
Token region
Token ID3814
Feature activation-2.62e-01

Pos logit contributions
semble+0.950
 Crate+0.865
 Nurs+0.847
CENT+0.748
 COL+0.742
Neg logit contributions
ogene-1.124
obi-1.098
hene-1.087
aucus-1.033
imum-1.027
TokenA
Token ID32
Feature activation-4.46e-01

Pos logit contributions
obi+4.134
hene+3.671
ogene+3.596
aucus+3.368
anski+3.243
Neg logit contributions
semble-10.978
 Crate-9.918
 Nurs-9.891
etheless-9.856
���-9.695
Token-
Token ID12
Feature activation-5.46e-01

Pos logit contributions
obi+3.843
ogene+3.790
hene+3.689
aucus+3.369
imum+3.318
Neg logit contributions
semble-5.828
 Crate-5.372
 Nurs-5.243
-4.948
 prosecutors-4.871
TokenFat
Token ID33804
Feature activation-4.25e-01

Pos logit contributions
ogene+5.764
obi+5.740
hene+5.557
aucus+5.259
imum+5.230
Neg logit contributions
semble-6.081
 Crate-5.515
 Nurs-5.354
 prosecutors-4.856
-4.782
Token C
Token ID327
Feature activation+2.37e-01

Pos logit contributions
ogene+5.114
obi+5.021
hene+4.964
ochet+4.658
imum+4.653
Neg logit contributions
semble-4.261
 Crate-3.775
 Nurs-3.650
CENT-3.334
 prosecutors-3.180
Tokenops
Token ID2840
Feature activation-1.64e-01

Pos logit contributions
semble+2.738
 Crate+2.514
 Nurs+2.513
 COL+2.211
CENT+2.205
Neg logit contributions
ogene-2.518
obi-2.408
hene-2.407
imum-2.261
aucus-2.255
Token-
Token ID12
Feature activation-7.50e-01

Pos logit contributions
ogene+1.796
obi+1.755
hene+1.741
aucus+1.623
imum+1.617
Neg logit contributions
semble-1.871
 Crate-1.708
 Nurs-1.677
CENT-1.519
 COL-1.499
TokenCl
Token ID2601
Feature activation+2.63e-01

Pos logit contributions
obi+7.405
ogene+7.367
hene+7.121
aucus+6.687
ochet+6.616
Neg logit contributions
semble-8.425
 Crate-7.608
 Nurs-7.370
 prosecutors-6.778
 reversible-6.659
Tokenover
Token ID2502
Feature activation-2.88e-02

Pos logit contributions
semble+2.905
 Crate+2.655
 Nurs+2.607
CENT+2.312
 COL+2.310
Neg logit contributions
ogene-2.905
hene-2.818
obi-2.800
aucus-2.614
imum-2.604
Tokenfield
Token ID3245
Feature activation-1.61e-01

Pos logit contributions
ogene+0.305
obi+0.297
hene+0.295
aucus+0.274
imum+0.273
Neg logit contributions
semble-0.342
 Crate-0.316
 Nurs-0.312
CENT-0.281
 COL-0.279
Token-
Token ID12
Feature activation-6.75e-01

Pos logit contributions
ogene+1.830
obi+1.790
hene+1.777
aucus+1.661
imum+1.652
Neg logit contributions
semble-1.771
 Crate-1.606
 Nurs-1.577
CENT-1.427
 COL-1.400
TokenK
Token ID42
Feature activation-5.16e-01

Pos logit contributions
obi+6.714
ogene+6.696
hene+6.461
aucus+6.065
ochet+5.992
Neg logit contributions
semble-7.665
 Crate-6.965
 Nurs-6.772
 prosecutors-6.190
 COL-6.077
Tokenc
Token ID66
Feature activation+4.83e-01

Pos logit contributions
semble+9.014
 Crate+7.889
 Nurs+7.479
forms+6.852
CENT+6.675
Neg logit contributions
ogene-10.818
hene-10.211
imum-10.037
obi-10.015
aucus-9.997
Token7
Token ID22
Feature activation-1.48e-01

Pos logit contributions
semble+4.137
 Crate+3.526
 Nurs+3.433
CENT+3.051
 COL+2.954
Neg logit contributions
ogene-6.444
hene-6.300
obi-6.264
aucus-6.037
imum-5.923
Token {
Token ID1391
Feature activation+1.82e-01

Pos logit contributions
ogene+1.440
obi+1.414
hene+1.380
imum+1.279
aucus+1.271
Neg logit contributions
semble-1.862
 Crate-1.702
 Nurs-1.682
CENT-1.550
-1.519
Token(
Token ID7
Feature activation-8.85e-03

Pos logit contributions
semble+1.581
 Crate+1.406
 Nurs+1.381
CENT+1.210
 COL+1.181
Neg logit contributions
ogene-2.489
hene-2.420
obi-2.406
aucus-2.299
imum-2.283
Token108
Token ID15711
Feature activation-1.05e-01

Pos logit contributions
ogene+0.113
obi+0.110
hene+0.110
aucus+0.103
imum+0.103
Neg logit contributions
semble-0.086
 Crate-0.077
 Nurs-0.076
CENT-0.067
 COL-0.066
Tokens
Token ID82
Feature activation-7.26e-01

Pos logit contributions
ogene+0.895
obi+0.880
hene+0.857
imum+0.785
aucus+0.785
Neg logit contributions
semble-1.448
 Crate-1.351
 Nurs-1.328
CENT-1.226
 COL-1.211
Token)}
Token ID38165
Feature activation+6.35e-01

Pos logit contributions
obi+3.217
ogene+2.981
hene+2.731
ochet+2.336
imum+2.152
Neg logit contributions
semble-12.102
 Nurs-11.464
 Crate-11.414
���-10.883
-10.817
Token 32
Token ID3933
Feature activation+3.83e-01

Pos logit contributions
semble+4.163
 Crate+3.774
 Nurs+3.761
 COL+3.187
+2.788
Neg logit contributions
ogene-9.288
obi-9.277
hene-9.139
aucus-8.816
anski-8.790
Token.
Token ID13
Feature activation+5.05e-01

Pos logit contributions
semble+3.755
 Crate+3.420
 Nurs+3.346
 COL+2.960
CENT+2.914
Neg logit contributions
ogene-4.796
hene-4.641
obi-4.630
aucus-4.397
imum-4.386
Token K
Token ID509
Feature activation+5.41e-01

Pos logit contributions
semble+4.623
 Crate+4.347
 Nurs+4.214
 COL+3.715
 prosecutors+3.493
Neg logit contributions
ogene-6.383
obi-6.159
hene-6.144
aucus-6.000
imum-5.950
Tokenf
Token ID69
Feature activation+7.61e-01

Pos logit contributions
semble+6.051
 Crate+5.284
 Nurs+5.162
CENT+4.699
forms+4.666
Neg logit contributions
ogene-5.781
hene-5.487
obi-5.369
imum-5.290
aucus-5.288
Token competition
Token ID5449
Feature activation-4.31e-01

Pos logit contributions
ogene+0.632
obi+0.617
hene+0.612
aucus+0.573
imum+0.572
Neg logit contributions
semble-0.577
 Crate-0.523
 Nurs-0.514
CENT-0.460
 COL-0.452
Token while
Token ID981
Feature activation-4.57e-01

Pos logit contributions
ogene+4.588
obi+4.514
hene+4.419
imum+4.133
aucus+4.120
Neg logit contributions
semble-4.871
 Crate-4.430
 Nurs-4.356
CENT-3.905
-3.856
Token at
Token ID379
Feature activation-6.12e-01

Pos logit contributions
ogene+4.906
hene+4.703
obi+4.674
aucus+4.384
anski+4.372
Neg logit contributions
semble-5.087
 Crate-4.616
 Nurs-4.572
CENT-4.135
-4.055
Token the
Token ID262
Feature activation-5.83e-01

Pos logit contributions
ogene+5.991
hene+5.812
obi+5.751
imum+5.435
anski+5.391
Neg logit contributions
semble-7.216
 Crate-6.564
 Nurs-6.478
CENT-5.921
-5.777
Token same
Token ID976
Feature activation-8.49e-02

Pos logit contributions
ogene+5.023
obi+4.991
hene+4.981
imum+4.576
anski+4.472
Neg logit contributions
semble-7.326
 Crate-6.677
 Nurs-6.580
CENT-6.226
-6.049
Token time
Token ID640
Feature activation-1.04e+00

Pos logit contributions
ogene+1.070
obi+1.047
hene+1.043
aucus+0.980
imum+0.978
Neg logit contributions
semble-0.829
 Crate-0.747
 Nurs-0.735
CENT-0.649
 COL-0.635
Token we
Token ID356
Feature activation-5.35e-01

Pos logit contributions
ogene+9.975
hene+9.467
obi+9.446
anski+8.968
imum+8.774
Neg logit contributions
semble-10.574
 Nurs-9.486
 Crate-9.292
CENT-8.651
-8.542
Token are
Token ID389
Feature activation-5.99e-01

Pos logit contributions
ogene+5.714
hene+5.585
obi+5.558
anski+5.159
ochet+5.157
Neg logit contributions
semble-5.774
 Crate-5.231
 Nurs-5.226
CENT-4.686
-4.616
Token not
Token ID407
Feature activation-3.59e-01

Pos logit contributions
ogene+6.328
hene+6.043
obi+6.022
aucus+5.582
anski+5.544
Neg logit contributions
semble-6.493
 Crate-5.866
 Nurs-5.830
CENT-5.306
-5.258
Token neglect
Token ID17985
Feature activation-2.48e-01

Pos logit contributions
ogene+3.926
obi+3.816
hene+3.755
aucus+3.516
imum+3.484
Neg logit contributions
semble-4.010
 Crate-3.627
 Nurs-3.593
CENT-3.261
-3.156
Tokening
Token ID278
Feature activation-4.78e-02

Pos logit contributions
ogene+2.171
obi+2.120
hene+2.093
imum+1.922
aucus+1.912
Neg logit contributions
semble-3.344
 Crate-3.102
 Nurs-3.054
CENT-2.818
 COL-2.762
Token him
Token ID683
Feature activation+5.45e-01

Pos logit contributions
semble+2.790
 Crate+2.563
 Nurs+2.517
 COL+2.221
CENT+2.204
Neg logit contributions
ogene-3.958
obi-3.884
hene-3.860
aucus-3.661
anski-3.646
Token on
Token ID319
Feature activation-8.49e-02

Pos logit contributions
semble+4.348
 Crate+3.967
 Nurs+3.769
CENT+3.370
+3.305
Neg logit contributions
ogene-7.467
obi-7.227
hene-7.120
imum-6.901
ochet-6.796
Token Twitter
Token ID3009
Feature activation+2.48e-01

Pos logit contributions
ogene+1.030
obi+1.008
hene+1.002
aucus+0.940
imum+0.937
Neg logit contributions
semble-0.874
 Crate-0.790
 Nurs-0.776
CENT-0.690
 COL-0.679
Token @
Token ID2488
Feature activation+2.44e-01

Pos logit contributions
semble+2.089
 Crate+1.876
 Nurs+1.840
CENT+1.601
 COL+1.580
Neg logit contributions
ogene-3.423
obi-3.335
hene-3.313
aucus-3.163
imum-3.147
Tokenjon
Token ID46286
Feature activation+6.96e-01

Pos logit contributions
semble+2.690
 Crate+2.433
 Nurs+2.396
CENT+2.235
+2.171
Neg logit contributions
ogene-2.687
hene-2.605
obi-2.589
imum-2.467
aucus-2.441
Tokenchest
Token ID46713
Feature activation-7.57e-01

Pos logit contributions
semble+6.809
 Crate+5.960
 Nurs+5.704
CENT+5.653
forms+5.345
Neg logit contributions
ogene-7.946
obi-7.296
hene-7.256
anski-7.097
imum-7.047
Tokeno
Token ID78
Feature activation-2.05e-01

Pos logit contributions
obi+6.529
ogene+6.454
hene+6.028
ochet+5.675
aucus+5.610
Neg logit contributions
semble-9.795
 Crate-9.411
 Nurs-9.205
CENT-8.535
 COL-8.339
Token<|endoftext|>
Token ID50256
Feature activation+1.21e-01

Pos logit contributions
ogene+2.863
obi+2.836
hene+2.814
aucus+2.666
imum+2.650
Neg logit contributions
semble-1.718
 Crate-1.523
 Nurs-1.476
CENT-1.265
 COL-1.233
TokenAn
Token ID2025
Feature activation+3.05e-01

Pos logit contributions
semble+1.376
 Crate+1.244
 Nurs+1.228
CENT+1.142
 COL+1.098
Neg logit contributions
ogene-1.351
hene-1.304
obi-1.294
aucus-1.222
imum-1.211
Token Open
Token ID4946
Feature activation+3.22e-01

Pos logit contributions
semble+3.343
 Nurs+2.995
 Crate+2.992
 COL+2.678
CENT+2.644
Neg logit contributions
ogene-3.381
obi-3.300
hene-3.271
aucus-3.139
anski-3.081
Token Letter
Token ID18121
Feature activation-1.03e-01

Pos logit contributions
semble+2.757
 Crate+2.532
 Nurs+2.478
 COL+2.094
CENT+2.068
Neg logit contributions
ogene-4.419
obi-4.281
hene-4.250
anski-4.056
aucus-4.043