TOP ACTIVATIONS
MAX = 20.451

ized all unions into signing an unprecedented solidarity pact to oppose
Sciences in Memory of Alfred Nobel A ward ed
Jo ong Ang Il bo reported that in W
Vol ody my r Gol ov ko , Chairman of
Collective Barg aining Agreement ( C BA ) process involving the
according to the University of Hawai i , there
and low in magnesium and iron ." Chem Cam
of dialogue (" Hey , Ker mit !" and " Yeah
farmer who has lived in Shang ba all his life .
doesn 't , we 'll play ball ." Also
com or in ebook and Kindle formats . A 20 %
. Sa arak k ala , 28 , said
villagers use well water in Shang ba for drinking but tests
, and must roll to avoid following the commands in his
job of getting low and launches himself into Bo z eman
19 . Brian Al ain u ' uese (
increases levels of the neurotrans mitter acet yl ch oline in
About B ala Han uman Temple Here 7 .
Senior public servants superv ise ang l ophone and franc
with ker atin 1 and ker atin 16 .[ 23 ]

MIN ACTIVATIONS
MIN = -22.720

x . x . x . 118 10 95 607 x
8 x . x . x . 118 10 95 607
0000 8 x . x . x . 118 10 95
12 x . x . x . 114 112 59 07
x . x . x . 114 112 59 07 x
397 12 x . x . x . 114 112 59
2010 Matthew Morrison 2010 Tony Shal h oub
mel Kan , pour le Command ement des Cors aires de
x . x . x . 120 10 25 290 x
59 326 x . x . x . 116 104 17
25 151 x . x . x . 113 12 397
beat or other . 12 . Donkey Kong Country
ussed joint venture , S BI Wire x Asia . The
c epend ant que le Command ement est total ement d
full of possibility . 10 . Red Dead Redemption
36 203 x . x . x . 112 10 25
203 x . x . x . 112 10 25 151
2010 Larry David 2010 Matthew Morrison
25 290 x . x . x . 121 119 47
git im ité , le Command ement a ag i comm

INTERVAL 9.658 - 20.451
CONTAINS 0.640%

moderator Randy Sh and obil , right , after a debate
Rivals Scout Others Marvin Wilson DT 6 ' 4 / 329
a very good knowledge of ocean circulation " But
Canadian Federation of Independent Business ) The provincial government and its
best Sa isons made in Ireland . So crisp , refreshing

INTERVAL -1.134 - 9.658
CONTAINS 62.075%

the w ane . They are at their heart a data
adic gunfire rocked the new nation s capital over
and the Human Gen ome Project both complete 99 % complete
support when criticised . Second , a Europe that is getting
s actually black ? These kinds of

INTERVAL -11.927 - -1.134
CONTAINS 37.080%

on Democrats by emphasizing future spending reductions to Medicare that would
re going to take the first strike ,
sociology of race and gender to score a de cept ively
academic problems such as racism , sexism , trans ph obia
, assist them and even as happened last week

INTERVAL -22.720 - -11.927
CONTAINS 0.205%

6 { ( 6 s )} 38 . N d 4
15 x . x . x . 130 120 45 25
59 326 x . x . x . 116 104 17
05 87 x . x . x . 101 114 65
... at 4 a . m . ... after a prim
Tokenized
Token ID1143
Feature activation+6.86e+00

Pos logit contributions
ycle+0.514
ardless+0.506
 Yen+0.506
aunder+0.504
esy+0.499
Neg logit contributions
 Binding-0.622
 LAR-0.620
rador-0.619
NPR-0.597
 Goo-0.596
Token all
Token ID477
Feature activation+1.25e+00

Pos logit contributions
 Binding+4.117
 LAR+4.091
rador+4.014
 antitrust+3.821
NPR+3.807
Neg logit contributions
ycle-4.968
ardless-4.955
aunder-4.947
esy-4.865
erial-4.826
Token unions
Token ID11936
Feature activation+5.82e+00

Pos logit contributions
 Binding+0.818
 LAR+0.817
rador+0.812
NPR+0.780
 Goo+0.776
Neg logit contributions
ycle-0.869
ardless-0.859
 Yen-0.856
aunder-0.855
esy-0.846
Token into
Token ID656
Feature activation+1.12e+01

Pos logit contributions
 LAR+3.470
 Binding+3.460
rador+3.429
NPR+3.291
 Ala+3.244
Neg logit contributions
ycle-4.253
aunder-4.210
ardless-4.199
esy-4.189
 Yen-4.186
Token signing
Token ID8415
Feature activation+1.04e+01

Pos logit contributions
 Binding+5.478
 LAR+5.475
rador+5.207
 antitrust+5.041
 Ala+4.954
Neg logit contributions
aunder-9.230
ycle-9.108
ardless-8.986
esy-8.859
 Paste-8.815
Token an
Token ID281
Feature activation+2.05e+01

Pos logit contributions
 Binding+6.288
 LAR+6.082
rador+5.791
 Ala+5.751
 antitrust+5.706
Neg logit contributions
ycle-7.341
aunder-7.319
ardless-7.293
erial-7.192
esy-7.185
Token unprecedented
Token ID13029
Feature activation+1.61e+01

Pos logit contributions
 antitrust+10.296
 Binding+10.164
 LAR+9.969
 Ala+9.826
NPR+9.771
Neg logit contributions
ycle-13.917
erial-13.435
 Paste-13.307
ardless-13.070
esy-12.949
Token solidarity
Token ID17803
Feature activation+6.61e+00

Pos logit contributions
 LAR+8.718
 Binding+8.696
 antitrust+8.368
rador+7.798
 Goo+7.763
Neg logit contributions
ycle-11.406
aunder-11.169
ardless-11.074
esy-10.953
ּ-10.794
Token pact
Token ID28242
Feature activation+8.42e+00

Pos logit contributions
 LAR+3.185
 Binding+3.147
rador+2.998
 Goo+2.869
NPR+2.821
Neg logit contributions
aunder-5.671
ycle-5.644
esy-5.629
ardless-5.580
 Paste-5.535
Token to
Token ID284
Feature activation+5.35e+00

Pos logit contributions
 Binding+4.837
 LAR+4.820
rador+4.652
 Ala+4.528
NPR+4.528
Neg logit contributions
ycle-6.269
aunder-6.135
esy-6.090
 Yen-6.076
ardless-6.072
Token oppose
Token ID12856
Feature activation-3.59e+00

Pos logit contributions
 Binding+3.453
 LAR+3.432
rador+3.353
 Ala+3.252
NPR+3.251
Neg logit contributions
ycle-3.678
ardless-3.645
aunder-3.627
esy-3.621
 Yen-3.591
Token Sciences
Token ID13473
Feature activation+4.85e+00

Pos logit contributions
ycle+0.745
 Yen+0.737
ardless+0.736
aunder+0.729
 Paste+0.724
Neg logit contributions
 Binding-1.078
 LAR-1.077
rador-1.075
NPR-1.040
 Goo-1.035
Token in
Token ID287
Feature activation+6.79e+00

Pos logit contributions
 Binding+2.828
 LAR+2.814
rador+2.790
 Ala+2.658
NPR+2.655
Neg logit contributions
ardless-3.634
ycle-3.630
esy-3.587
aunder-3.566
 Yen-3.562
Token Memory
Token ID14059
Feature activation+4.23e+00

Pos logit contributions
 Binding+3.247
 LAR+3.177
 Goo+2.991
rador+2.968
 Ala+2.891
Neg logit contributions
ycle-5.895
ardless-5.874
aunder-5.864
esy-5.809
erial-5.702
Token of
Token ID286
Feature activation+7.06e+00

Pos logit contributions
 LAR+2.600
 Binding+2.552
rador+2.490
 Ala+2.435
 Goo+2.433
Neg logit contributions
ardless-3.093
ycle-3.069
aunder-3.046
esy-2.998
 Yen-2.996
Token Alfred
Token ID22044
Feature activation+7.60e+00

Pos logit contributions
 LAR+3.363
 Binding+3.322
 Ala+3.175
rador+3.150
 Goo+3.064
Neg logit contributions
ardless-6.068
ycle-5.971
aunder-5.944
esy-5.921
erial-5.896
Token Nobel
Token ID20715
Feature activation+2.03e+01

Pos logit contributions
 LAR+4.189
 Binding+4.055
 Goo+3.731
 Ala+3.711
rador+3.704
Neg logit contributions
ycle-6.103
ardless-6.090
erial-5.980
aunder-5.969
esy-5.943
Token
Token ID198
Feature activation+9.74e-01

Pos logit contributions
 LAR+10.237
 Binding+9.648
rador+9.448
 az+9.001
 Az+8.915
Neg logit contributions
esy-13.711
ardless-13.543
ycle-13.053
erial-13.003
aunder-12.971
Token
Token ID198
Feature activation+5.77e+00

Pos logit contributions
 LAR+0.539
 Binding+0.537
rador+0.529
NPR+0.512
 Ala+0.506
Neg logit contributions
ycle-0.773
ardless-0.766
aunder-0.763
 Yen-0.763
esy-0.753
TokenA
Token ID32
Feature activation+6.57e+00

Pos logit contributions
 LAR+3.509
NPR+3.479
rador+3.479
 Binding+3.464
NBA+3.339
Neg logit contributions
 Yen-4.139
ardless-4.134
ycle-4.099
aunder-4.017
 Paste-3.982
Tokenward
Token ID904
Feature activation+7.50e-01

Pos logit contributions
 LAR+3.987
 Binding+3.848
rador+3.836
 Goo+3.552
 Ala+3.528
Neg logit contributions
ycle-4.879
aunder-4.830
ardless-4.818
 Yen-4.771
esy-4.722
Tokened
Token ID276
Feature activation+6.23e+00

Pos logit contributions
 Binding+0.615
 LAR+0.614
rador+0.611
NPR+0.591
 Goo+0.589
Neg logit contributions
ycle-0.394
ardless-0.390
 Yen-0.387
aunder-0.387
esy-0.381
Token
Token ID198
Feature activation-5.60e-01

Pos logit contributions
 Binding+3.646
 LAR+3.623
rador+3.511
 Ala+3.480
 Goo+3.457
Neg logit contributions
ycle-4.524
ardless-4.506
aunder-4.490
esy-4.422
 Paste-4.334
Token
Token ID198
Feature activation+3.92e+00

Pos logit contributions
ycle+0.426
ardless+0.423
 Yen+0.423
aunder+0.420
esy+0.417
Neg logit contributions
 Binding-0.326
 LAR-0.324
rador-0.324
NPR-0.309
 Goo-0.307
TokenJo
Token ID9908
Feature activation+7.93e+00

Pos logit contributions
NPR+2.344
 LAR+2.343
 Binding+2.323
rador+2.306
NBA+2.243
Neg logit contributions
ycle-2.922
ardless-2.890
 Yen-2.852
aunder-2.852
esy-2.803
Tokenong
Token ID506
Feature activation+1.20e+01

Pos logit contributions
rador+4.175
 LAR+4.101
 Binding+4.032
 Goo+3.999
NPR+3.942
Neg logit contributions
ycle-6.157
ardless-6.097
 Paste-6.065
aunder-6.022
 Yen-6.021
TokenAng
Token ID13450
Feature activation+2.33e+00

Pos logit contributions
rador+2.769
 Binding+2.440
 Goo+2.312
Az+2.272
 LAR+2.204
Neg logit contributions
aunder-12.657
ycle-12.621
 Paste-12.593
ardless-12.591
 Cantor-12.520
Token Il
Token ID13778
Feature activation+2.01e+01

Pos logit contributions
 Binding+1.104
 LAR+1.097
rador+1.089
 Ala+1.038
 Goo+1.029
Neg logit contributions
ycle-2.032
ardless-2.019
aunder-1.998
 Yen-1.995
esy-1.983
Tokenbo
Token ID2127
Feature activation-1.36e+00

Pos logit contributions
ob+4.831
rador+4.229
NBA+3.787
 Goo+3.664
Az+3.619
Neg logit contributions
 Cantor-19.427
 Yen-19.075
ּ-18.904
 Reincarn-18.831
aunder-18.677
Token reported
Token ID2098
Feature activation-2.04e-01

Pos logit contributions
ycle+0.883
 Yen+0.877
ardless+0.873
aunder+0.870
 Paste+0.862
Neg logit contributions
 Binding-0.939
 LAR-0.936
rador-0.932
NPR-0.899
 Goo-0.893
Token that
Token ID326
Feature activation+4.11e+00

Pos logit contributions
ycle+0.132
ardless+0.131
aunder+0.130
 Yen+0.130
esy+0.129
Neg logit contributions
 Binding-0.142
 LAR-0.142
rador-0.141
NPR-0.136
 Ala-0.136
Token in
Token ID287
Feature activation+4.33e+00

Pos logit contributions
 LAR+2.616
 Binding+2.584
rador+2.510
 Ala+2.491
 Goo+2.467
Neg logit contributions
ycle-2.927
ardless-2.917
aunder-2.909
esy-2.841
 Paste-2.801
Token W
Token ID370
Feature activation-2.40e+00

Pos logit contributions
 LAR+2.715
 Binding+2.706
rador+2.664
 Goo+2.594
 Ala+2.588
Neg logit contributions
ycle-3.095
ardless-3.081
aunder-3.038
esy-3.014
 Paste-2.967
Token
Token ID198
Feature activation+8.75e-01

Pos logit contributions
 LAR+0.535
 Binding+0.533
rador+0.526
NPR+0.507
 Ala+0.502
Neg logit contributions
ycle-0.760
ardless-0.753
 Yen-0.751
aunder-0.750
esy-0.741
TokenVol
Token ID16598
Feature activation-4.15e+00

Pos logit contributions
 LAR+0.531
 Binding+0.530
rador+0.527
NPR+0.514
 Ala+0.503
Neg logit contributions
ycle-0.643
ardless-0.639
 Yen-0.637
aunder-0.633
 Paste-0.627
Tokenody
Token ID1118
Feature activation-8.57e+00

Pos logit contributions
ycle+3.231
ardless+3.144
aunder+3.139
 Yen+3.136
esy+3.121
Neg logit contributions
 Binding-2.410
 LAR-2.399
rador-2.336
 Ala-2.269
 Goo-2.268
Tokenmy
Token ID1820
Feature activation-9.80e-01

Pos logit contributions
ycle+3.361
esy+3.281
 Yen+3.204
ardless+3.195
aunder+3.195
Neg logit contributions
 Binding-8.045
 LAR-7.867
 Ala-7.781
 Goo-7.776
rador-7.691
Tokenr
Token ID81
Feature activation+4.18e+00

Pos logit contributions
ycle+0.478
ardless+0.470
 Yen+0.467
aunder+0.466
esy+0.461
Neg logit contributions
 Binding-0.842
 LAR-0.840
rador-0.833
NPR-0.812
 Goo-0.810
Token Gol
Token ID26849
Feature activation+1.87e+01

Pos logit contributions
 Binding+2.527
 LAR+2.514
rador+2.473
 Goo+2.428
 Ala+2.424
Neg logit contributions
ardless-3.066
ycle-3.063
aunder-3.018
esy-3.000
 Yen-2.968
Tokenov
Token ID709
Feature activation+1.22e+01

Pos logit contributions
ob+4.986
vas+4.151
rador+4.143
 LAR+3.782
lov+3.606
Neg logit contributions
ardless-17.733
 Yen-17.689
 Clapper-17.250
 Paste-17.172
 Cantor-17.083
Tokenko
Token ID7204
Feature activation+1.11e+00

Pos logit contributions
rador+6.046
 LAR+5.945
ob+5.523
 Goo+5.505
 Binding+5.472
Neg logit contributions
ardless-9.316
 Yen-9.235
 Paste-9.062
ycle-9.058
 Cantor-8.966
Token,
Token ID11
Feature activation+2.74e+00

Pos logit contributions
 LAR+0.783
 Binding+0.780
rador+0.775
 Ala+0.750
 Goo+0.747
Neg logit contributions
ycle-0.709
ardless-0.706
esy-0.692
 Yen-0.691
aunder-0.689
Token Chairman
Token ID12787
Feature activation+3.66e+00

Pos logit contributions
 LAR+1.716
 Binding+1.709
rador+1.672
 Ala+1.642
 Goo+1.630
Neg logit contributions
ycle-1.973
ardless-1.963
aunder-1.940
esy-1.937
 Yen-1.911
Token of
Token ID286
Feature activation-2.72e-01

Pos logit contributions
 LAR+2.343
rador+2.320
 Binding+2.319
 Ala+2.242
 Goo+2.229
Neg logit contributions
ycle-2.559
ardless-2.540
esy-2.534
aunder-2.524
 Yen-2.482
Token Collective
Token ID29128
Feature activation+3.24e+00

Pos logit contributions
 Binding+3.029
 LAR+3.000
rador+2.829
 Goo+2.780
 antitrust+2.728
Neg logit contributions
ycle-4.158
ardless-4.119
aunder-4.087
 Yen-4.055
esy-4.047
Token Barg
Token ID50062
Feature activation+3.64e+00

Pos logit contributions
 Binding+1.718
 LAR+1.691
rador+1.628
 Goo+1.585
NPR+1.571
Neg logit contributions
ycle-2.676
aunder-2.670
 Yen-2.658
ardless-2.642
esy-2.622
Tokenaining
Token ID1397
Feature activation+5.15e+00

Pos logit contributions
 Binding+2.423
rador+2.377
 LAR+2.367
NPR+2.281
NBA+2.275
Neg logit contributions
ycle-2.482
 Yen-2.478
aunder-2.441
 Paste-2.389
ardless-2.353
Token Agreement
Token ID12729
Feature activation+7.95e-01

Pos logit contributions
 Binding+1.741
 LAR+1.641
 Goo+1.512
rador+1.485
NPR+1.416
Neg logit contributions
aunder-5.322
ycle-5.286
ardless-5.240
 Yen-5.196
esy-5.191
Token (
Token ID357
Feature activation+4.75e+00

Pos logit contributions
 Binding+0.541
 LAR+0.539
rador+0.535
NPR+0.515
 Goo+0.513
Neg logit contributions
ycle-0.529
ardless-0.524
 Yen-0.522
aunder-0.521
esy-0.515
TokenC
Token ID34
Feature activation+1.82e+01

Pos logit contributions
 Binding+3.179
 LAR+3.134
rador+3.070
NPR+3.066
NBA+3.004
Neg logit contributions
 Yen-3.261
ycle-3.172
ardless-3.159
aunder-3.134
 Paste-3.103
TokenBA
Token ID4339
Feature activation-1.53e+00

Pos logit contributions
 Binding+3.617
NBA+3.527
LB+2.948
ob+2.917
NPR+2.906
Neg logit contributions
 Yen-19.309
esy-18.745
 Paste-18.521
 Nunes-18.518
 Reincarn-18.310
Token)
Token ID8
Feature activation-1.82e+00

Pos logit contributions
ycle+0.937
ardless+0.930
 Yen+0.922
esy+0.908
aunder+0.908
Neg logit contributions
 LAR-1.121
rador-1.121
 Binding-1.115
NPR-1.076
 Goo-1.074
Token process
Token ID1429
Feature activation-3.79e+00

Pos logit contributions
ycle+1.213
ardless+1.208
 Yen+1.203
 Paste+1.189
aunder+1.186
Neg logit contributions
rador-1.217
 LAR-1.212
 Binding-1.203
NPR-1.169
 Goo-1.160
Token involving
Token ID7411
Feature activation-1.35e+00

Pos logit contributions
ycle+2.389
ardless+2.389
 Yen+2.368
 Paste+2.337
aunder+2.332
Neg logit contributions
rador-2.649
 LAR-2.629
 Binding-2.621
NPR-2.558
 Goo-2.541
Token the
Token ID262
Feature activation+2.08e+00

Pos logit contributions
ycle+0.950
 Yen+0.943
ardless+0.942
aunder+0.936
 Paste+0.930
Neg logit contributions
rador-0.852
 Binding-0.851
 LAR-0.849
NPR-0.818
 Goo-0.810
Token according
Token ID1864
Feature activation+1.81e+00

Pos logit contributions
ycle+0.668
ardless+0.662
 Yen+0.660
aunder+0.657
esy+0.651
Neg logit contributions
 Binding-0.642
 LAR-0.641
rador-0.638
NPR-0.613
 Goo-0.610
Token to
Token ID284
Feature activation-8.55e+00

Pos logit contributions
 LAR+1.162
 Binding+1.154
rador+1.139
NPR+1.105
 Goo+1.102
Neg logit contributions
ycle-1.272
ardless-1.261
aunder-1.251
 Yen-1.247
esy-1.239
Token the
Token ID262
Feature activation-7.01e+00

Pos logit contributions
 Yen+5.418
 Paste+5.389
ycle+5.288
 Schumer+5.217
aunder+5.198
Neg logit contributions
rador-5.865
 Binding-5.649
 LAR-5.601
NPR-5.441
 Goo-5.370
Token University
Token ID2059
Feature activation+1.25e+00

Pos logit contributions
 Yen+4.550
 Paste+4.523
ycle+4.431
ardless+4.367
aunder+4.359
Neg logit contributions
rador-4.795
 Binding-4.595
 LAR-4.561
NPR-4.435
 Goo-4.370
Token of
Token ID286
Feature activation+4.50e+00

Pos logit contributions
 Binding+0.839
 LAR+0.837
rador+0.830
NPR+0.802
 Goo+0.799
Neg logit contributions
ycle-0.837
ardless-0.833
 Yen-0.828
aunder-0.826
esy-0.817
Token Hawai
Token ID10847
Feature activation+1.81e+01

Pos logit contributions
 LAR+2.213
 Binding+2.208
 Ala+2.170
 Goo+2.113
rador+2.061
Neg logit contributions
ycle-3.855
ardless-3.837
aunder-3.803
esy-3.769
 Yen-3.717
Token
Token ID447
Feature activation+4.77e+00

Pos logit contributions
 Ala+8.373
 LAR+8.251
rador+7.906
Az+7.854
 Goo+7.747
Neg logit contributions
ardless-13.444
aunder-13.269
ycle-12.815
 Paste-12.755
 Yen-12.736
Token
Token ID247
Feature activation+6.17e+00

Pos logit contributions
rador+3.611
 LAR+3.595
 Binding+3.564
NPR+3.468
 Ala+3.465
Neg logit contributions
ycle-2.755
 Yen-2.750
aunder-2.731
ardless-2.722
 Paste-2.693
Tokeni
Token ID72
Feature activation+3.19e+00

Pos logit contributions
 Binding+3.777
rador+3.748
 LAR+3.738
 Ala+3.664
 Goo+3.602
Neg logit contributions
ycle-4.354
 Yen-4.301
ardless-4.292
aunder-4.268
esy-4.220
Token,
Token ID11
Feature activation-5.32e-01

Pos logit contributions
 Binding+2.093
 LAR+2.090
rador+2.054
NPR+1.994
 Ala+1.990
Neg logit contributions
ardless-2.186
ycle-2.181
esy-2.154
 Yen-2.153
aunder-2.145
Token there
Token ID612
Feature activation-2.39e+00

Pos logit contributions
ycle+0.366
ardless+0.363
 Yen+0.362
aunder+0.360
esy+0.358
Neg logit contributions
 Binding-0.349
 LAR-0.349
rador-0.345
NPR-0.333
 Ala-0.332
Token and
Token ID290
Feature activation+5.76e+00

Pos logit contributions
 LAR+4.499
 Binding+4.436
rador+4.278
 Ala+4.241
 az+4.198
Neg logit contributions
ycle-6.053
ardless-6.035
aunder-5.946
esy-5.888
 Paste-5.834
Token low
Token ID1877
Feature activation+2.95e+00

Pos logit contributions
 LAR+3.519
 Binding+3.493
rador+3.371
 Ala+3.305
 Goo+3.285
Neg logit contributions
ycle-4.231
aunder-4.204
ardless-4.184
esy-4.108
 Yen-4.065
Token in
Token ID287
Feature activation+1.03e+01

Pos logit contributions
 Binding+1.758
 LAR+1.745
rador+1.727
NPR+1.667
 Ala+1.652
Neg logit contributions
ycle-2.213
aunder-2.197
ardless-2.189
 Yen-2.189
esy-2.174
Token magnesium
Token ID33525
Feature activation+1.63e+01

Pos logit contributions
 LAR+6.095
 Binding+6.089
 az+5.935
rador+5.847
 Ala+5.826
Neg logit contributions
aunder-7.355
ycle-7.309
ardless-7.265
esy-7.111
erial-6.959
Token and
Token ID290
Feature activation+7.66e+00

Pos logit contributions
 az+8.307
 Binding+8.287
 LAR+8.238
 Ala+7.796
rador+7.523
Neg logit contributions
aunder-11.478
ycle-11.441
ardless-11.418
erial-11.298
esy-11.275
Token iron
Token ID6953
Feature activation+1.80e+01

Pos logit contributions
 LAR+4.315
 Binding+4.247
 az+4.149
 Ala+4.102
rador+4.068
Neg logit contributions
ycle-5.919
aunder-5.853
ardless-5.839
erial-5.700
esy-5.698
Token."
Token ID526
Feature activation+3.94e+00

Pos logit contributions
 Binding+9.025
 az+8.638
 LAR+8.625
ob+8.345
 Ala+8.310
Neg logit contributions
aunder-12.727
ycle-12.342
esy-12.317
ardless-12.316
 Yen-12.263
Token
Token ID198
Feature activation+1.88e+00

Pos logit contributions
 Binding+2.272
 LAR+2.225
rador+2.144
 Goo+2.126
 Ala+2.092
Neg logit contributions
ycle-3.111
ardless-3.071
aunder-3.040
esy-3.020
 Yen-2.998
Token
Token ID198
Feature activation+5.73e+00

Pos logit contributions
 LAR+1.015
 Binding+1.009
rador+0.985
NPR+0.964
 Ala+0.950
Neg logit contributions
ycle-1.521
ardless-1.504
aunder-1.504
 Yen-1.496
esy-1.478
TokenChem
Token ID41829
Feature activation+1.67e+00

Pos logit contributions
NPR+3.381
 Binding+3.291
 LAR+3.273
rador+3.227
NBA+3.159
Neg logit contributions
ycle-4.384
 Yen-4.382
ardless-4.344
aunder-4.268
 Paste-4.190
TokenCam
Token ID21701
Feature activation+1.58e+00

Pos logit contributions
 Binding+0.580
rador+0.574
 LAR+0.574
NPR+0.532
 Goo+0.524
Neg logit contributions
ycle-1.668
ardless-1.662
 Yen-1.659
aunder-1.652
esy-1.638
Token of
Token ID286
Feature activation+1.33e+00

Pos logit contributions
ycle+2.360
 Yen+2.357
 Paste+2.329
aunder+2.329
ardless+2.326
Neg logit contributions
rador-2.484
 Binding-2.478
 LAR-2.456
NPR-2.360
 Goo-2.336
Token dialogue
Token ID10721
Feature activation-1.44e+00

Pos logit contributions
 LAR+0.895
 Binding+0.892
rador+0.881
 Goo+0.855
 Ala+0.850
Neg logit contributions
ycle-0.899
ardless-0.893
aunder-0.887
 Yen-0.879
esy-0.872
Token ("
Token ID5855
Feature activation+3.04e+00

Pos logit contributions
ycle+1.024
ardless+1.015
 Yen+1.014
aunder+1.009
esy+1.000
Neg logit contributions
 Binding-0.908
 LAR-0.905
rador-0.902
NPR-0.864
 Goo-0.860
TokenHey
Token ID10814
Feature activation+4.93e+00

Pos logit contributions
 LAR+1.952
 Binding+1.942
rador+1.939
NPR+1.901
 Goo+1.873
Neg logit contributions
ardless-2.095
 Yen-2.093
ycle-2.093
aunder-2.078
esy-2.046
Token,
Token ID11
Feature activation+6.11e+00

Pos logit contributions
 LAR+3.126
 Binding+3.058
rador+3.005
 Goo+2.987
 Ala+2.935
Neg logit contributions
ycle-3.470
aunder-3.453
ardless-3.451
 Yen-3.370
esy-3.361
Token Ker
Token ID17337
Feature activation+1.76e+01

Pos logit contributions
 LAR+3.944
 Binding+3.850
 Goo+3.779
rador+3.757
 Ala+3.686
Neg logit contributions
ycle-4.234
aunder-4.222
ardless-4.212
esy-4.130
 Yen-4.095
Tokenmit
Token ID2781
Feature activation+7.86e+00

Pos logit contributions
rador+3.871
ob+3.641
 LAR+3.467
NPR+3.306
 Goo+3.256
Neg logit contributions
ּ-16.974
ycle-16.809
 Yen-16.683
ardless-16.601
aunder-16.217
Token!"
Token ID2474
Feature activation-4.43e+00

Pos logit contributions
 LAR+4.789
 Binding+4.647
 Goo+4.545
rador+4.506
 Ala+4.494
Neg logit contributions
ardless-5.522
ycle-5.502
aunder-5.438
 Yen-5.394
erial-5.357
Token and
Token ID290
Feature activation-2.10e+00

Pos logit contributions
 Yen+2.782
ycle+2.778
ardless+2.733
aunder+2.711
 Paste+2.707
Neg logit contributions
rador-3.127
 Binding-3.117
 LAR-3.106
NPR-2.991
 Ala-2.959
Token "
Token ID366
Feature activation+3.14e+00

Pos logit contributions
ycle+1.380
 Yen+1.375
ardless+1.366
aunder+1.355
 Paste+1.352
Neg logit contributions
 Binding-1.433
rador-1.427
 LAR-1.422
NPR-1.370
 Ala-1.356
TokenYeah
Token ID10995
Feature activation+4.71e+00

Pos logit contributions
 LAR+1.979
 Binding+1.960
rador+1.941
NPR+1.938
 Goo+1.907
Neg logit contributions
ardless-2.210
ycle-2.209
 Yen-2.206
aunder-2.191
esy-2.151
Token farmer
Token ID18739
Feature activation+4.86e-01

Pos logit contributions
ardless+2.984
 Yen+2.970
ycle+2.967
 Paste+2.963
aunder+2.923
Neg logit contributions
 Binding-3.134
 LAR-3.105
rador-3.079
 Goo-2.990
NPR-2.960
Token who
Token ID508
Feature activation-6.40e+00

Pos logit contributions
 Binding+0.313
 LAR+0.313
rador+0.311
NPR+0.299
 Ala+0.297
Neg logit contributions
ycle-0.341
ardless-0.338
 Yen-0.336
aunder-0.336
esy-0.333
Token has
Token ID468
Feature activation-3.38e+00

Pos logit contributions
ycle+3.814
 Yen+3.812
aunder+3.783
 Paste+3.773
esy+3.693
Neg logit contributions
 Binding-4.692
rador-4.603
 LAR-4.563
 Goo-4.456
NPR-4.419
Token lived
Token ID5615
Feature activation-1.27e+00

Pos logit contributions
 Yen+2.338
ycle+2.326
 Paste+2.305
aunder+2.300
ardless+2.299
Neg logit contributions
 Binding-2.184
rador-2.149
 LAR-2.147
NPR-2.070
 Goo-2.067
Token in
Token ID287
Feature activation-8.79e-01

Pos logit contributions
ycle+0.901
 Yen+0.895
ardless+0.893
aunder+0.886
 Paste+0.880
Neg logit contributions
 Binding-0.801
 LAR-0.796
rador-0.795
NPR-0.763
 Goo-0.759
Token Shang
Token ID45797
Feature activation+1.74e+01

Pos logit contributions
ycle+0.581
 Yen+0.579
ardless+0.574
aunder+0.572
 Paste+0.568
Neg logit contributions
 Binding-0.599
 LAR-0.596
rador-0.594
NPR-0.573
 Goo-0.567
Tokenba
Token ID7012
Feature activation+2.31e+00

Pos logit contributions
rador+4.597
ob+4.198
 LAR+4.117
 Goo+4.114
 Binding+4.068
Neg logit contributions
ּ-15.990
aunder-15.975
ardless-15.871
 Paste-15.832
 Construct-15.776
Token all
Token ID477
Feature activation-5.32e+00

Pos logit contributions
 LAR+1.478
 Binding+1.467
rador+1.451
 Ala+1.411
NPR+1.403
Neg logit contributions
ardless-1.628
ycle-1.623
aunder-1.599
esy-1.589
 Yen-1.584
Token his
Token ID465
Feature activation+6.00e+00

Pos logit contributions
ardless+3.557
ycle+3.548
 Yen+3.528
aunder+3.525
 Paste+3.515
Neg logit contributions
 Binding-3.518
 LAR-3.476
rador-3.438
NPR-3.346
 Goo-3.329
Token life
Token ID1204
Feature activation+6.61e-01

Pos logit contributions
 LAR+3.196
rador+3.148
 Binding+3.130
 Ala+3.029
NPR+2.979
Neg logit contributions
ycle-4.734
ardless-4.721
 Yen-4.689
aunder-4.687
 Paste-4.618
Token.
Token ID13
Feature activation-1.63e+00

Pos logit contributions
 Binding+0.403
 LAR+0.402
rador+0.400
NPR+0.383
 Ala+0.381
Neg logit contributions
ycle-0.485
ardless-0.481
 Yen-0.480
aunder-0.478
esy-0.474
Token doesn
Token ID1595
Feature activation-3.26e+00

Pos logit contributions
ycle+1.077
ardless+1.067
 Yen+1.065
aunder+1.054
esy+1.048
Neg logit contributions
 Binding-1.561
rador-1.547
 LAR-1.546
NPR-1.499
 Goo-1.490
Token't
Token ID470
Feature activation-7.22e+00

Pos logit contributions
ycle+2.266
ardless+2.258
 Yen+2.244
esy+2.244
aunder+2.235
Neg logit contributions
 Binding-2.103
rador-2.098
 LAR-2.071
 Goo-1.980
 Ala-1.975
Token,
Token ID11
Feature activation+1.94e+00

Pos logit contributions
ardless+4.537
aunder+4.533
ycle+4.492
 Yen+4.485
 Paste+4.411
Neg logit contributions
 Binding-4.957
rador-4.856
 LAR-4.813
NPR-4.756
 Goo-4.613
Token we
Token ID356
Feature activation+1.87e+00

Pos logit contributions
 LAR+1.408
 Binding+1.399
rador+1.386
 Goo+1.345
NPR+1.343
Neg logit contributions
ycle-1.207
ardless-1.199
 Yen-1.189
aunder-1.185
esy-1.178
Token'll
Token ID1183
Feature activation-1.17e-01

Pos logit contributions
 LAR+0.847
rador+0.834
 Binding+0.829
NPR+0.791
 Goo+0.789
Neg logit contributions
ycle-1.668
 Yen-1.662
ardless-1.657
aunder-1.656
 Paste-1.642
Token play
Token ID711
Feature activation+1.74e+01

Pos logit contributions
ycle+0.063
ardless+0.062
 Yen+0.061
aunder+0.061
esy+0.061
Neg logit contributions
 Binding-0.094
rador-0.094
 LAR-0.094
NPR-0.091
 Goo-0.090
Token ball
Token ID2613
Feature activation+3.14e+00

Pos logit contributions
 LAR+3.891
BALL+3.704
 Goo+3.620
 ball+3.522
 Ala+3.455
Neg logit contributions
aunder-17.202
esy-17.036
ycle-16.983
ardless-16.963
erial-16.878
Token."
Token ID526
Feature activation-6.47e-01

Pos logit contributions
 LAR+1.773
 Binding+1.759
rador+1.727
 Ala+1.666
NPR+1.663
Neg logit contributions
ycle-2.444
ardless-2.424
 Yen-2.419
aunder-2.405
 Paste-2.396
Token
Token ID198
Feature activation+3.09e+00

Pos logit contributions
ycle+0.473
ardless+0.469
 Yen+0.468
aunder+0.466
 Paste+0.462
Neg logit contributions
 Binding-0.396
rador-0.395
 LAR-0.394
NPR-0.377
 Goo-0.375
Token
Token ID198
Feature activation+4.78e-01

Pos logit contributions
 LAR+1.591
 Binding+1.564
rador+1.509
NPR+1.507
 Ala+1.480
Neg logit contributions
ycle-2.585
aunder-2.560
ardless-2.553
 Yen-2.538
esy-2.499
TokenAlso
Token ID7583
Feature activation-6.38e-01

Pos logit contributions
 Binding+0.295
 LAR+0.295
rador+0.293
NPR+0.284
 Goo+0.280
Neg logit contributions
ycle-0.346
 Yen-0.343
ardless-0.343
aunder-0.341
 Paste-0.337
Tokencom
Token ID785
Feature activation+6.79e+00

Pos logit contributions
 Binding+3.598
 LAR+3.528
rador+3.372
 Goo+3.327
 Ala+3.315
Neg logit contributions
ardless-5.674
ycle-5.620
 Yen-5.552
aunder-5.527
erial-5.483
Token or
Token ID393
Feature activation+4.18e+00

Pos logit contributions
 Binding+3.954
 LAR+3.954
rador+3.847
 Goo+3.676
 Ala+3.675
Neg logit contributions
ardless-5.025
ycle-4.988
aunder-4.967
 Yen-4.927
esy-4.908
Token in
Token ID287
Feature activation+4.21e+00

Pos logit contributions
 Binding+2.634
 LAR+2.621
rador+2.573
 Goo+2.516
NPR+2.478
Neg logit contributions
ycle-2.990
ardless-2.951
aunder-2.938
 Yen-2.926
esy-2.912
Token ebook
Token ID47179
Feature activation+1.05e+01

Pos logit contributions
 LAR+2.636
 Binding+2.628
rador+2.541
 Goo+2.473
NPR+2.457
Neg logit contributions
ycle-3.028
ardless-3.018
aunder-3.018
 Yen-2.961
esy-2.939
Token and
Token ID290
Feature activation+1.09e+01

Pos logit contributions
 LAR+5.859
 Binding+5.654
rador+5.502
NPR+5.460
 Goo+5.436
Neg logit contributions
aunder-7.725
ycle-7.697
esy-7.680
ardless-7.607
erial-7.587
Token Kindle
Token ID27114
Feature activation+1.73e+01

Pos logit contributions
 LAR+6.288
 Binding+6.172
rador+5.937
 az+5.824
 Goo+5.784
Neg logit contributions
ycle-7.863
ardless-7.778
esy-7.766
erial-7.735
aunder-7.691
Token formats
Token ID17519
Feature activation+1.20e+01

Pos logit contributions
 LAR+9.197
 Binding+8.588
 Goo+8.220
 Ala+8.184
 az+8.107
Neg logit contributions
aunder-12.147
ardless-12.102
esy-11.984
erial-11.958
ycle-11.720
Token.
Token ID13
Feature activation+2.11e+00

Pos logit contributions
 LAR+6.463
 Binding+6.307
rador+6.083
 Goo+5.995
NPR+5.969
Neg logit contributions
ycle-8.782
ardless-8.733
aunder-8.677
esy-8.648
erial-8.568
TokenA
Token ID32
Feature activation+4.28e+00

Pos logit contributions
 LAR+1.315
 Binding+1.314
rador+1.282
 Goo+1.245
NPR+1.243
Neg logit contributions
ycle-1.533
ardless-1.522
aunder-1.507
 Yen-1.500
esy-1.493
Token 20
Token ID1160
Feature activation+5.52e+00

Pos logit contributions
 LAR+2.758
 Binding+2.718
rador+2.659
 Goo+2.559
NPR+2.543
Neg logit contributions
ardless-3.018
ycle-3.018
aunder-2.991
 Yen-2.981
esy-2.946
Token%
Token ID4
Feature activation-7.96e+00

Pos logit contributions
 LAR+3.320
 Binding+3.314
rador+3.170
NPR+3.095
 Ala+3.083
Neg logit contributions
ardless-4.054
ycle-4.031
aunder-4.015
esy-3.965
 Yen-3.925
Token.
Token ID13
Feature activation+2.89e+00

Pos logit contributions
 LAR+3.069
 Binding+3.063
rador+3.030
NPR+2.912
 Ala+2.907
Neg logit contributions
ycle-4.050
aunder-4.044
esy-3.980
 Yen-3.980
ardless-3.965
Token
Token ID198
Feature activation+1.52e+00

Pos logit contributions
 Binding+1.645
 LAR+1.611
 Ala+1.565
rador+1.544
NPR+1.520
Neg logit contributions
ycle-2.305
aunder-2.252
ardless-2.247
esy-2.224
 Yen-2.193
Token
Token ID198
Feature activation+4.92e+00

Pos logit contributions
 LAR+0.823
 Binding+0.819
rador+0.802
NPR+0.785
 Ala+0.772
Neg logit contributions
ycle-1.220
ardless-1.208
aunder-1.205
 Yen-1.203
esy-1.187
TokenSa
Token ID33890
Feature activation+9.04e+00

Pos logit contributions
NPR+2.950
 Binding+2.906
 LAR+2.883
rador+2.841
NBA+2.830
Neg logit contributions
ycle-3.638
ardless-3.590
 Yen-3.590
aunder-3.560
 Paste-3.486
Tokenarak
Token ID30447
Feature activation+1.15e+01

Pos logit contributions
rador+4.373
 LAR+4.234
 Ala+4.168
 Binding+4.137
NPR+3.980
Neg logit contributions
 Yen-7.516
 Paste-7.427
ycle-7.403
aunder-7.244
ardless-7.241
Tokenk
Token ID74
Feature activation+1.72e+01

Pos logit contributions
 LAR+4.737
rador+4.483
 Ala+4.284
NBA+4.273
ob+4.240
Neg logit contributions
 Yen-10.094
 Paste-9.956
ycle-9.955
ardless-9.928
aunder-9.879
Tokenala
Token ID6081
Feature activation-3.33e+00

Pos logit contributions
rador+3.542
 LAR+3.337
alan+3.184
 Ala+3.174
ob+2.562
Neg logit contributions
 Yen-17.439
 Construct-17.257
 Paste-17.202
 Wick-17.105
ardless-17.059
Token,
Token ID11
Feature activation+4.39e+00

Pos logit contributions
 Yen+2.191
ycle+2.188
ardless+2.171
aunder+2.152
 Paste+2.152
Neg logit contributions
 Binding-2.260
rador-2.255
 LAR-2.250
NPR-2.164
 Goo-2.147
Token 28
Token ID2579
Feature activation+8.74e-01

Pos logit contributions
 Binding+2.477
 LAR+2.469
 Ala+2.408
rador+2.394
 Goo+2.324
Neg logit contributions
ycle-3.401
ardless-3.361
aunder-3.353
esy-3.339
erial-3.291
Token,
Token ID11
Feature activation-2.93e+00

Pos logit contributions
 Binding+0.592
 LAR+0.591
rador+0.588
NPR+0.566
 Goo+0.563
Neg logit contributions
ycle-0.582
ardless-0.577
 Yen-0.576
aunder-0.573
esy-0.567
Token said
Token ID531
Feature activation-4.59e+00

Pos logit contributions
 Yen+1.963
ycle+1.960
ardless+1.942
aunder+1.931
 Paste+1.930
Neg logit contributions
 Binding-1.958
rador-1.953
 LAR-1.950
NPR-1.878
 Goo-1.859
Token villagers
Token ID25263
Feature activation-3.72e+00

Pos logit contributions
 Yen+1.287
ycle+1.285
ardless+1.268
 Paste+1.266
aunder+1.262
Neg logit contributions
 Binding-1.257
rador-1.254
 LAR-1.250
NPR-1.202
 Goo-1.192
Token use
Token ID779
Feature activation-3.91e+00

Pos logit contributions
ycle+2.451
 Yen+2.447
aunder+2.425
ardless+2.414
 Paste+2.402
Neg logit contributions
 Binding-2.518
rador-2.506
 LAR-2.487
 Goo-2.398
NPR-2.381
Token well
Token ID880
Feature activation-2.06e+00

Pos logit contributions
 Yen+2.619
ycle+2.617
 Paste+2.581
ardless+2.559
aunder+2.550
Neg logit contributions
 Binding-2.602
rador-2.587
 LAR-2.583
NPR-2.492
 Goo-2.457
Token water
Token ID1660
Feature activation-2.51e+00

Pos logit contributions
ycle+1.558
ardless+1.555
 Yen+1.547
 Paste+1.544
aunder+1.543
Neg logit contributions
 Binding-1.203
 LAR-1.188
rador-1.186
 Goo-1.130
NPR-1.125
Token in
Token ID287
Feature activation-5.51e-01

Pos logit contributions
ycle+1.642
 Yen+1.630
ardless+1.624
 Paste+1.615
aunder+1.613
Neg logit contributions
 Binding-1.729
rador-1.716
 LAR-1.713
 Goo-1.650
NPR-1.643
Token Shang
Token ID45797
Feature activation+1.71e+01

Pos logit contributions
ycle+0.386
 Yen+0.383
ardless+0.382
aunder+0.380
 Paste+0.377
Neg logit contributions
 Binding-0.355
 LAR-0.354
rador-0.353
NPR-0.339
 Goo-0.337
Tokenba
Token ID7012
Feature activation-8.76e-01

Pos logit contributions
rador+4.848
 Binding+4.571
ob+4.482
 LAR+4.461
 Goo+4.412
Neg logit contributions
aunder-15.321
ּ-15.247
ardless-15.174
 Paste-15.137
ycle-14.983
Token for
Token ID329
Feature activation-4.97e+00

Pos logit contributions
ycle+0.595
 Yen+0.589
ardless+0.588
aunder+0.585
 Paste+0.580
Neg logit contributions
 Binding-0.582
 LAR-0.581
rador-0.579
NPR-0.556
 Goo-0.554
Token drinking
Token ID7722
Feature activation-1.38e-01

Pos logit contributions
 Yen+3.158
ycle+3.154
 Paste+3.142
ardless+3.095
aunder+3.088
Neg logit contributions
rador-3.440
 Binding-3.433
 LAR-3.388
NPR-3.304
 Goo-3.254
Token but
Token ID475
Feature activation-4.58e+00

Pos logit contributions
ycle+0.096
ardless+0.096
 Yen+0.095
aunder+0.095
esy+0.094
Neg logit contributions
 LAR-0.089
 Binding-0.088
rador-0.088
NPR-0.084
 Goo-0.084
Token tests
Token ID5254
Feature activation-2.38e+00

Pos logit contributions
ycle+3.041
 Yen+3.031
aunder+2.991
ardless+2.966
 Paste+2.959
Neg logit contributions
 Binding-3.100
 LAR-3.054
rador-3.051
NPR-2.949
 Goo-2.946
Token,
Token ID11
Feature activation-7.22e-01

Pos logit contributions
 LAR+2.611
 Binding+2.589
rador+2.479
NPR+2.473
 Ala+2.455
Neg logit contributions
ycle-3.468
aunder-3.389
 Yen-3.380
ardless-3.373
 Paste-3.315
Token and
Token ID290
Feature activation+6.56e-01

Pos logit contributions
ycle+0.474
ardless+0.470
 Yen+0.469
aunder+0.467
esy+0.462
Neg logit contributions
 Binding-0.496
 LAR-0.495
rador-0.493
NPR-0.474
 Goo-0.472
Token must
Token ID1276
Feature activation+2.28e+00

Pos logit contributions
 Binding+0.406
 LAR+0.405
rador+0.403
NPR+0.387
 Goo+0.384
Neg logit contributions
ycle-0.476
ardless-0.472
 Yen-0.471
aunder-0.469
esy-0.465
Token roll
Token ID4836
Feature activation+5.50e+00

Pos logit contributions
 LAR+1.331
 Binding+1.312
rador+1.309
 Goo+1.250
NPR+1.238
Neg logit contributions
ycle-1.729
ardless-1.720
 Yen-1.719
aunder-1.706
 Paste-1.698
Token to
Token ID284
Feature activation+1.24e+01

Pos logit contributions
 LAR+3.175
 Binding+3.070
rador+3.029
 Goo+2.925
NPR+2.917
Neg logit contributions
ycle-4.242
ardless-4.192
aunder-4.170
 Yen-4.096
erial-4.053
Token avoid
Token ID3368
Feature activation+1.70e+01

Pos logit contributions
 LAR+3.045
 Goo+2.821
 Binding+2.793
rador+2.697
 Ala+2.549
Neg logit contributions
erial-12.850
ycle-12.708
aunder-12.619
 Paste-12.617
 Clapper-12.596
Token following
Token ID1708
Feature activation+1.98e-01

Pos logit contributions
 LAR+4.661
 Binding+4.520
 Goo+3.889
NPR+3.711
 az+3.707
Neg logit contributions
aunder-16.836
ycle-16.666
 Yen-16.375
erial-16.346
ardless-16.173
Token the
Token ID262
Feature activation+1.88e+00

Pos logit contributions
 LAR+0.127
 Binding+0.127
rador+0.125
NPR+0.121
 Goo+0.120
Neg logit contributions
ycle-0.140
ardless-0.139
aunder-0.138
 Yen-0.138
esy-0.137
Token commands
Token ID9729
Feature activation-1.48e-01

Pos logit contributions
 LAR+1.187
 Binding+1.187
rador+1.170
NPR+1.130
 Goo+1.124
Neg logit contributions
ycle-1.347
ardless-1.337
 Yen-1.325
aunder-1.322
esy-1.307
Token in
Token ID287
Feature activation-2.73e+00

Pos logit contributions
ycle+0.107
ardless+0.106
 Yen+0.106
aunder+0.106
esy+0.105
Neg logit contributions
 Binding-0.092
 LAR-0.092
rador-0.091
NPR-0.088
 Goo-0.087
Token his
Token ID465
Feature activation-4.18e+00

Pos logit contributions
 Yen+1.825
ardless+1.821
ycle+1.813
 Paste+1.797
esy+1.791
Neg logit contributions
 Binding-1.819
rador-1.802
 LAR-1.788
NPR-1.719
 Goo-1.709
Token job
Token ID1693
Feature activation+8.67e+00

Pos logit contributions
ycle+1.896
 Yen+1.884
ardless+1.883
aunder+1.866
esy+1.852
Neg logit contributions
 Binding-1.927
rador-1.922
 LAR-1.915
NPR-1.857
 Ala-1.835
Token of
Token ID286
Feature activation+6.90e+00

Pos logit contributions
 LAR+4.998
 Binding+4.924
rador+4.817
 Ala+4.693
 Goo+4.646
Neg logit contributions
ycle-6.441
aunder-6.323
ardless-6.272
 Yen-6.187
 Paste-6.140
Token getting
Token ID1972
Feature activation+4.24e+00

Pos logit contributions
 LAR+4.185
 Binding+4.111
rador+4.039
 Ala+3.948
 Goo+3.898
Neg logit contributions
ycle-5.035
aunder-4.930
ardless-4.922
 Yen-4.830
 Paste-4.819
Token low
Token ID1877
Feature activation-1.86e+00

Pos logit contributions
 LAR+2.650
 Binding+2.601
rador+2.567
 Ala+2.517
 Goo+2.496
Neg logit contributions
ycle-3.088
aunder-3.028
ardless-3.018
 Yen-2.974
 Paste-2.958
Token and
Token ID290
Feature activation+4.47e+00

Pos logit contributions
ycle+1.192
ardless+1.181
 Yen+1.177
esy+1.166
aunder+1.165
Neg logit contributions
 Binding-1.303
 LAR-1.296
rador-1.295
NPR-1.252
 Goo-1.246
Token launches
Token ID18617
Feature activation+1.70e+01

Pos logit contributions
 LAR+2.844
 Binding+2.806
rador+2.764
 Ala+2.698
 Goo+2.667
Neg logit contributions
ycle-3.188
ardless-3.160
aunder-3.134
 Yen-3.092
 Paste-3.060
Token himself
Token ID2241
Feature activation+1.24e+01

Pos logit contributions
 LAR+9.126
 Ala+8.262
 Binding+8.183
 az+8.066
 Goo+8.035
Neg logit contributions
ardless-11.982
aunder-11.897
ycle-11.879
 Paste-11.541
 Yen-11.425
Token into
Token ID656
Feature activation+9.45e+00

Pos logit contributions
 LAR+6.762
 Ala+6.273
 Binding+6.221
rador+6.138
 az+6.114
Neg logit contributions
ycle-9.236
aunder-9.198
ardless-8.991
 Yen-8.898
 Paste-8.794
Token Bo
Token ID3248
Feature activation+8.12e+00

Pos logit contributions
 LAR+5.410
 Binding+5.287
 Ala+5.119
rador+5.114
 Goo+4.976
Neg logit contributions
ardless-7.075
aunder-6.965
ycle-6.928
 Paste-6.864
esy-6.786
Tokenz
Token ID89
Feature activation-9.94e-01

Pos logit contributions
 LAR+3.415
rador+3.282
 Binding+3.206
 Goo+3.135
 Ala+3.080
Neg logit contributions
ardless-7.210
aunder-7.150
ycle-7.144
 Yen-7.134
 Paste-7.092
Tokeneman
Token ID8463
Feature activation+3.24e+00

Pos logit contributions
ycle+0.536
ardless+0.531
 Yen+0.528
aunder+0.526
esy+0.521
Neg logit contributions
 Binding-0.800
 LAR-0.797
rador-0.793
NPR-0.769
 Goo-0.767
Token
Token ID198
Feature activation-3.83e+00

Pos logit contributions
ycle+0.711
ardless+0.704
 Yen+0.703
aunder+0.700
 Paste+0.696
Neg logit contributions
 Binding-0.594
rador-0.591
 LAR-0.589
NPR-0.567
 Goo-0.562
Token
Token ID198
Feature activation-5.30e+00

Pos logit contributions
ardless+2.601
ycle+2.595
 Yen+2.593
 Paste+2.593
esy+2.587
Neg logit contributions
rador-2.508
 Binding-2.475
 LAR-2.424
���-2.353
 Goo-2.343
Token19
Token ID1129
Feature activation-2.30e+00

Pos logit contributions
ycle+3.436
aunder+3.371
 Paste+3.359
ardless+3.318
esy+3.303
Neg logit contributions
rador-3.601
 Binding-3.599
 LAR-3.586
 Goo-3.456
 Ala-3.453
Token.
Token ID13
Feature activation-6.40e-01

Pos logit contributions
 Yen+1.589
ycle+1.579
ardless+1.567
 Paste+1.563
aunder+1.559
Neg logit contributions
rador-1.498
 Binding-1.481
 LAR-1.462
NPR-1.412
 Goo-1.402
Token Brian
Token ID8403
Feature activation+4.42e+00

Pos logit contributions
ycle+0.451
 Yen+0.448
ardless+0.447
aunder+0.444
 Paste+0.442
Neg logit contributions
 Binding-0.407
rador-0.406
 LAR-0.406
NPR-0.389
 Goo-0.386
Token Al
Token ID978
Feature activation+1.70e+01

Pos logit contributions
 Binding+2.586
 LAR+2.550
rador+2.511
 Ala+2.485
 Goo+2.445
Neg logit contributions
ardless-3.360
ycle-3.321
aunder-3.286
 Yen-3.269
esy-3.253
Tokenain
Token ID391
Feature activation+1.12e+00

Pos logit contributions
rador+6.429
 az+5.764
riel+5.629
 Az+5.569
vas+5.509
Neg logit contributions
ּ-14.315
 Yen-14.305
 Cantor-14.142
 Clapper-14.100
ardless-13.934
Tokenu
Token ID84
Feature activation+3.67e+00

Pos logit contributions
rador+0.709
 LAR+0.707
 Binding+0.706
NPR+0.682
 Goo+0.678
Neg logit contributions
ycle-0.787
ardless-0.785
 Yen-0.783
aunder-0.773
 Paste-0.768
Token'
Token ID6
Feature activation+2.35e+00

Pos logit contributions
 LAR+2.572
 Binding+2.547
rador+2.535
NPR+2.479
 Ala+2.466
Neg logit contributions
ycle-2.359
ardless-2.345
 Yen-2.303
aunder-2.300
esy-2.281
Tokenuese
Token ID20506
Feature activation-4.25e-01

Pos logit contributions
rador+1.395
 Binding+1.387
 LAR+1.372
NPR+1.337
 Ala+1.313
Neg logit contributions
ycle-1.756
 Yen-1.754
ardless-1.743
aunder-1.727
 Paste-1.709
Token (
Token ID357
Feature activation-6.64e+00

Pos logit contributions
ycle+0.235
ardless+0.232
 Yen+0.232
aunder+0.231
 Paste+0.228
Neg logit contributions
 Binding-0.336
 LAR-0.334
rador-0.334
NPR-0.323
 Goo-0.322
Token increases
Token ID5732
Feature activation+5.07e+00

Pos logit contributions
 Yen+1.670
ycle+1.666
ardless+1.643
 Paste+1.639
aunder+1.638
Neg logit contributions
rador-1.861
 Binding-1.838
 LAR-1.826
NPR-1.778
 Goo-1.754
Token levels
Token ID2974
Feature activation+1.77e+00

Pos logit contributions
 Binding+3.031
 LAR+2.990
rador+2.855
 Ala+2.839
 Goo+2.819
Neg logit contributions
ycle-3.866
ardless-3.840
aunder-3.766
esy-3.742
erial-3.685
Token of
Token ID286
Feature activation+5.42e+00

Pos logit contributions
 Binding+1.157
 LAR+1.154
rador+1.136
NPR+1.103
 Goo+1.098
Neg logit contributions
ycle-1.223
ardless-1.216
aunder-1.210
 Yen-1.205
esy-1.192
Token the
Token ID262
Feature activation+3.94e+00

Pos logit contributions
 Binding+3.203
 LAR+3.186
 Ala+3.054
rador+3.020
 Goo+3.006
Neg logit contributions
ardless-4.167
ycle-4.156
aunder-4.054
esy-4.009
erial-3.980
Token neurotrans
Token ID39307
Feature activation+4.51e+00

Pos logit contributions
 Binding+2.517
 LAR+2.488
rador+2.400
 Goo+2.364
 Ala+2.362
Neg logit contributions
ycle-2.852
ardless-2.851
aunder-2.780
esy-2.750
 Yen-2.723
Tokenmitter
Token ID37974
Feature activation+1.70e+01

Pos logit contributions
 Binding+3.016
 LAR+2.958
rador+2.948
 Ala+2.824
 Goo+2.817
Neg logit contributions
 Yen-3.001
aunder-2.996
ardless-2.986
ycle-2.976
 Paste-2.917
Token acet
Token ID25174
Feature activation+1.11e+01

Pos logit contributions
 Binding+7.544
 Ala+7.524
 LAR+7.444
 az+7.370
NPR+6.749
Neg logit contributions
ardless-14.017
ycle-13.790
erial-13.293
esy-13.146
aunder-13.015
Tokenyl
Token ID2645
Feature activation+1.02e+01

Pos logit contributions
 Binding+5.006
rador+4.911
NPR+4.781
 LAR+4.720
ob+4.697
Neg logit contributions
 Yen-9.255
ardless-9.232
 Paste-9.111
 Wick-9.065
aunder-9.039
Tokench
Token ID354
Feature activation+8.53e+00

Pos logit contributions
 Binding+6.691
 LAR+6.519
 Ala+6.434
 az+6.288
NPR+6.222
Neg logit contributions
ardless-7.014
ycle-6.672
aunder-6.367
 Yen-6.344
 Paste-6.260
Tokenoline
Token ID14453
Feature activation+1.12e+01

Pos logit contributions
rador+4.200
 Binding+4.133
 LAR+4.129
ob+4.106
NPR+4.080
Neg logit contributions
ardless-6.782
 Yen-6.739
 Paste-6.728
ycle-6.694
 Cantor-6.658
Token in
Token ID287
Feature activation+5.01e+00

Pos logit contributions
 Binding+6.179
 LAR+5.961
NPR+5.734
 Ala+5.619
rador+5.604
Neg logit contributions
ardless-8.423
ycle-8.329
esy-8.320
aunder-8.295
erial-8.190
Token About
Token ID7994
Feature activation+1.07e+01

Pos logit contributions
 Binding+5.436
 LAR+5.092
 Ala+5.022
 Goo+4.998
rador+4.771
Neg logit contributions
ycle-7.655
aunder-7.570
ardless-7.481
esy-7.465
erial-7.350
Token B
Token ID347
Feature activation+8.99e+00

Pos logit contributions
 Binding+5.503
 LAR+4.984
 Goo+4.890
 Ala+4.862
rador+4.788
Neg logit contributions
aunder-8.681
ycle-8.653
esy-8.489
ardless-8.434
erial-8.388
Tokenala
Token ID6081
Feature activation-9.42e-02

Pos logit contributions
 LAR+2.861
rador+2.845
NPR+2.641
 Binding+2.620
 Ala+2.620
Neg logit contributions
 Yen-8.779
 Paste-8.727
ycle-8.641
aunder-8.555
ardless-8.552
Token Han
Token ID9530
Feature activation+3.66e+00

Pos logit contributions
ycle+0.065
ardless+0.065
 Yen+0.065
aunder+0.064
esy+0.064
Neg logit contributions
 Binding-0.061
 LAR-0.061
rador-0.061
NPR-0.058
 Goo-0.058
Tokenuman
Token ID3778
Feature activation+3.23e+00

Pos logit contributions
rador+3.183
 LAR+3.158
 Binding+3.122
NPR+3.067
 Ala+3.042
Neg logit contributions
ycle-1.711
ardless-1.710
 Yen-1.695
 Paste-1.664
aunder-1.659
Token Temple
Token ID10857
Feature activation+1.69e+01

Pos logit contributions
 Binding+1.914
 LAR+1.911
rador+1.852
 Ala+1.828
 Goo+1.806
Neg logit contributions
ycle-2.459
aunder-2.420
ardless-2.402
esy-2.378
 Yen-2.357
Token Here
Token ID3423
Feature activation+4.92e+00

Pos logit contributions
 Binding+6.428
 Ala+6.291
 Goo+5.999
 LAR+5.937
 az+5.431
Neg logit contributions
aunder-14.241
ycle-14.072
erial-13.825
ardless-13.800
esy-13.753
Token
Token ID198
Feature activation-1.70e-01

Pos logit contributions
 Binding+2.632
 LAR+2.586
rador+2.547
 Ala+2.466
 Goo+2.440
Neg logit contributions
ycle-3.934
aunder-3.918
ardless-3.915
 Yen-3.885
esy-3.876
Token
Token ID198
Feature activation-1.63e+00

Pos logit contributions
ycle+0.130
ardless+0.129
 Yen+0.129
aunder+0.128
esy+0.127
Neg logit contributions
 Binding-0.098
 LAR-0.098
rador-0.097
NPR-0.093
 Goo-0.092
Token7
Token ID22
Feature activation+6.25e-01

Pos logit contributions
ycle+1.201
ardless+1.180
 Yen+1.172
aunder+1.171
esy+1.170
Neg logit contributions
 Binding-0.990
 LAR-0.989
rador-0.978
 Ala-0.936
 Goo-0.932
Token.
Token ID13
Feature activation-7.97e-01

Pos logit contributions
 Binding+0.374
 LAR+0.374
rador+0.370
NPR+0.356
 Goo+0.354
Neg logit contributions
ycle-0.466
ardless-0.462
 Yen-0.460
aunder-0.459
esy-0.455
Token
Token ID198
Feature activation+2.03e+00

Pos logit contributions
ycle+0.439
ardless+0.437
 Yen+0.436
aunder+0.433
esy+0.431
Neg logit contributions
 Binding-0.342
rador-0.342
 LAR-0.341
NPR-0.324
 Goo-0.323
TokenSenior
Token ID31224
Feature activation+1.25e+00

Pos logit contributions
 Binding+1.225
 LAR+1.224
rador+1.221
NPR+1.197
 Ala+1.157
Neg logit contributions
ycle-1.499
 Yen-1.497
ardless-1.491
aunder-1.476
 Paste-1.459
Token public
Token ID1171
Feature activation-3.18e+00

Pos logit contributions
 Binding+0.691
 LAR+0.688
rador+0.685
NPR+0.652
 Goo+0.650
Neg logit contributions
ycle-0.998
ardless-0.988
aunder-0.985
 Yen-0.983
esy-0.976
Token servants
Token ID17523
Feature activation+2.84e+00

Pos logit contributions
ycle+2.014
 Yen+1.987
ardless+1.973
esy+1.953
aunder+1.942
Neg logit contributions
 Binding-2.267
 LAR-2.239
rador-2.203
NPR-2.172
 Goo-2.146
Token superv
Token ID29745
Feature activation+1.06e+00

Pos logit contributions
 LAR+1.829
 Binding+1.821
rador+1.811
 Ala+1.742
NPR+1.741
Neg logit contributions
ycle-1.986
ardless-1.975
aunder-1.946
esy-1.944
 Yen-1.941
Tokenise
Token ID786
Feature activation+1.68e+01

Pos logit contributions
rador+0.676
 Binding+0.675
 LAR+0.674
 Ala+0.643
NPR+0.643
Neg logit contributions
ycle-0.736
ardless-0.731
 Yen-0.731
aunder-0.723
 Paste-0.717
Token ang
Token ID3550
Feature activation+1.54e+01

Pos logit contributions
rador+6.895
 az+6.541
 LAR+6.424
 Binding+6.319
 Ala+6.204
Neg logit contributions
ycle-13.722
esy-13.482
ardless-13.443
aunder-13.231
erial-13.162
Tokenl
Token ID75
Feature activation+6.44e-01

Pos logit contributions
rador+5.189
 LAR+4.611
scrib+4.348
riel+4.317
NPR+4.309
Neg logit contributions
 Yen-13.969
ּ-13.962
ardless-13.716
 Clapper-13.703
ycle-13.628
Tokenophone
Token ID38656
Feature activation+2.79e+00

Pos logit contributions
 Binding+0.451
 LAR+0.451
rador+0.450
NPR+0.434
 Goo+0.430
Neg logit contributions
ycle-0.413
ardless-0.409
 Yen-0.409
aunder-0.406
esy-0.402
Token and
Token ID290
Feature activation+9.52e+00

Pos logit contributions
 LAR+1.824
 Binding+1.819
rador+1.807
NPR+1.738
 Ala+1.734
Neg logit contributions
ycle-1.913
ardless-1.901
aunder-1.898
 Yen-1.880
esy-1.875
Token franc
Token ID46754
Feature activation+7.26e+00

Pos logit contributions
 LAR+3.617
rador+3.417
 az+3.204
 Binding+3.065
NPR+3.033
Neg logit contributions
ycle-9.217
ardless-8.979
aunder-8.780
esy-8.775
erial-8.654
Token with
Token ID351
Feature activation+8.10e-01

Pos logit contributions
 Binding+3.809
 LAR+3.727
rador+3.704
 Ala+3.550
 Goo+3.540
Neg logit contributions
ycle-5.166
 Yen-5.136
ardless-5.116
aunder-5.114
esy-5.059
Token ker
Token ID41927
Feature activation+1.59e+01

Pos logit contributions
 Binding+0.538
 LAR+0.538
rador+0.530
 Goo+0.511
 Ala+0.511
Neg logit contributions
ycle-0.557
ardless-0.550
aunder-0.546
 Yen-0.544
esy-0.539
Tokenatin
Token ID10680
Feature activation+7.59e+00

Pos logit contributions
 Binding+7.311
rador+7.238
 LAR+6.985
ob+6.939
 Goo+6.784
Neg logit contributions
ycle-11.914
ardless-11.611
 Yen-11.584
 Paste-11.402
 Clapper-11.356
Token 1
Token ID352
Feature activation+7.52e+00

Pos logit contributions
 LAR+4.247
 Binding+4.247
 Ala+4.015
 Goo+3.943
 az+3.869
Neg logit contributions
ycle-5.946
ardless-5.816
aunder-5.769
erial-5.647
esy-5.626
Token and
Token ID290
Feature activation+4.70e+00

Pos logit contributions
 Binding+4.915
 LAR+4.769
rador+4.616
 Ala+4.592
 Goo+4.533
Neg logit contributions
ycle-5.072
ardless-4.992
aunder-4.972
 Yen-4.922
esy-4.914
Token ker
Token ID41927
Feature activation+1.68e+01

Pos logit contributions
 LAR+2.933
 Binding+2.918
rador+2.822
 Goo+2.778
 Ala+2.758
Neg logit contributions
ycle-3.413
ardless-3.380
aunder-3.306
 Yen-3.280
esy-3.271
Tokenatin
Token ID10680
Feature activation+7.50e+00

Pos logit contributions
ob+6.670
rador+6.604
vas+6.522
 Binding+6.469
alan+6.203
Neg logit contributions
 Clapper-13.252
ycle-13.227
 Yen-13.226
ּ-13.192
ardless-13.135
Token 16
Token ID1467
Feature activation+7.84e+00

Pos logit contributions
 LAR+4.047
 Binding+4.005
 Ala+3.820
 Goo+3.711
 az+3.630
Neg logit contributions
ycle-6.095
ardless-5.988
aunder-5.918
esy-5.801
erial-5.773
Token.[
Token ID3693
Feature activation+3.02e+00

Pos logit contributions
 Binding+5.688
 LAR+5.476
rador+5.378
 Ala+5.339
 Goo+5.278
Neg logit contributions
ycle-4.698
ardless-4.618
aunder-4.605
 Yen-4.588
esy-4.541
Token23
Token ID1954
Feature activation+2.55e+00

Pos logit contributions
 Binding+2.087
 LAR+2.084
rador+2.042
NPR+2.014
 Ala+1.964
Neg logit contributions
ycle-1.975
 Yen-1.952
aunder-1.933
ardless-1.929
esy-1.897
Token]
Token ID60
Feature activation-2.79e-01

Pos logit contributions
 Binding+2.172
 LAR+2.162
rador+2.149
NPR+2.089
 Ala+2.085
Neg logit contributions
ycle-1.258
ardless-1.241
aunder-1.237
 Yen-1.234
esy-1.216
Token x
Token ID2124
Feature activation-2.70e+00

Pos logit contributions
ycle+0.823
 Yen+0.821
ardless+0.812
aunder+0.808
 Paste+0.805
Neg logit contributions
rador-1.116
 Binding-1.114
 LAR-1.111
NPR-1.076
 Goo-1.065
Token.
Token ID13
Feature activation-9.20e+00

Pos logit contributions
ycle+1.867
ardless+1.855
 Yen+1.854
aunder+1.846
esy+1.842
Neg logit contributions
rador-1.749
 Binding-1.743
 LAR-1.725
NPR-1.655
 Goo-1.654
Tokenx
Token ID87
Feature activation-1.35e+01

Pos logit contributions
esy+3.737
ycle+3.695
ardless+3.641
erial+3.575
 Yen+3.574
Neg logit contributions
���-8.809
 incent-8.449
 Binding-8.413
rador-8.390
 LAR-8.295
Token.
Token ID13
Feature activation-2.03e+01

Pos logit contributions
esy+6.438
 Yen+6.410
 Paste+6.318
aunder+6.224
 Schumer+6.172
Neg logit contributions
���-12.574
 incent-11.414
rador-10.737
 rall-10.695
 Binding-10.457
Tokenx
Token ID87
Feature activation-2.19e+01

Pos logit contributions
esy+7.010
alter+6.810
erial+6.662
ardless+6.619
ycle+6.536
Neg logit contributions
���-20.496
 incent-18.346
 rall-18.051
 oun-17.865
 Binding-17.805
Token.
Token ID13
Feature activation-2.27e+01

Pos logit contributions
esy+8.386
alter+8.090
ardless+8.015
aunder+7.868
 Yen+7.808
Neg logit contributions
���-20.326
 incent-18.023
 tiss-17.925
 oun-17.907
 rall-17.888
Token118
Token ID16817
Feature activation-1.78e+01

Pos logit contributions
alter+3.324
esy+3.246
ardless+2.624
 Wick+2.492
erial+2.439
Neg logit contributions
 Binding-23.630
rador-23.443
 Goo-23.196
 Ala-22.807
���-22.764
Token 10
Token ID838
Feature activation-7.53e+00

Pos logit contributions
 Yen+8.155
 Cantor+7.760
 Schumer+7.684
 Construct+7.662
 Paste+7.631
Neg logit contributions
rador-13.558
���-12.743
Reviewer-12.623
 Binding-12.588
 antitrust-12.429
Token95
Token ID3865
Feature activation-1.59e+00

Pos logit contributions
ardless+4.401
esy+4.328
 Paste+4.322
ycle+4.312
aunder+4.308
Neg logit contributions
rador-5.570
 Binding-5.478
 LAR-5.348
 Goo-5.228
 antitrust-5.205
Token607
Token ID31980
Feature activation-2.23e+00

Pos logit contributions
ycle+0.990
ardless+0.981
 Yen+0.979
aunder+0.974
esy+0.964
Neg logit contributions
 Binding-1.140
 LAR-1.137
rador-1.134
NPR-1.091
 Goo-1.088
Token x
Token ID2124
Feature activation-1.83e+00

Pos logit contributions
 Yen+1.243
ycle+1.236
ardless+1.221
 Paste+1.219
esy+1.214
Neg logit contributions
rador-1.743
 Binding-1.732
 LAR-1.727
NPR-1.678
 Ala-1.659
Token8
Token ID23
Feature activation-1.45e+00

Pos logit contributions
 Yen+3.494
 Paste+3.391
aunder+3.385
ycle+3.385
esy+3.356
Neg logit contributions
rador-3.709
 Binding-3.606
 LAR-3.580
NPR-3.530
 Ala-3.455
Token x
Token ID2124
Feature activation-2.70e+00

Pos logit contributions
ycle+0.823
 Yen+0.821
ardless+0.812
aunder+0.808
 Paste+0.805
Neg logit contributions
rador-1.116
 Binding-1.114
 LAR-1.111
NPR-1.076
 Goo-1.065
Token.
Token ID13
Feature activation-9.20e+00

Pos logit contributions
ycle+1.867
ardless+1.855
 Yen+1.854
aunder+1.846
esy+1.842
Neg logit contributions
rador-1.749
 Binding-1.743
 LAR-1.725
NPR-1.655
 Goo-1.654
Tokenx
Token ID87
Feature activation-1.35e+01

Pos logit contributions
esy+3.737
ycle+3.695
ardless+3.641
erial+3.575
 Yen+3.574
Neg logit contributions
���-8.809
 incent-8.449
 Binding-8.413
rador-8.390
 LAR-8.295
Token.
Token ID13
Feature activation-2.03e+01

Pos logit contributions
esy+6.438
 Yen+6.410
 Paste+6.318
aunder+6.224
 Schumer+6.172
Neg logit contributions
���-12.574
 incent-11.414
rador-10.737
 rall-10.695
 Binding-10.457
Tokenx
Token ID87
Feature activation-2.19e+01

Pos logit contributions
esy+7.010
alter+6.810
erial+6.662
ardless+6.619
ycle+6.536
Neg logit contributions
���-20.496
 incent-18.346
 rall-18.051
 oun-17.865
 Binding-17.805
Token.
Token ID13
Feature activation-2.27e+01

Pos logit contributions
esy+8.386
alter+8.090
ardless+8.015
aunder+7.868
 Yen+7.808
Neg logit contributions
���-20.326
 incent-18.023
 tiss-17.925
 oun-17.907
 rall-17.888
Token118
Token ID16817
Feature activation-1.78e+01

Pos logit contributions
alter+3.324
esy+3.246
ardless+2.624
 Wick+2.492
erial+2.439
Neg logit contributions
 Binding-23.630
rador-23.443
 Goo-23.196
 Ala-22.807
���-22.764
Token 10
Token ID838
Feature activation-7.53e+00

Pos logit contributions
 Yen+8.155
 Cantor+7.760
 Schumer+7.684
 Construct+7.662
 Paste+7.631
Neg logit contributions
rador-13.558
���-12.743
Reviewer-12.623
 Binding-12.588
 antitrust-12.429
Token95
Token ID3865
Feature activation-1.59e+00

Pos logit contributions
ardless+4.401
esy+4.328
 Paste+4.322
ycle+4.312
aunder+4.308
Neg logit contributions
rador-5.570
 Binding-5.478
 LAR-5.348
 Goo-5.228
 antitrust-5.205
Token607
Token ID31980
Feature activation-2.23e+00

Pos logit contributions
ycle+0.990
ardless+0.981
 Yen+0.979
aunder+0.974
esy+0.964
Neg logit contributions
 Binding-1.140
 LAR-1.137
rador-1.134
NPR-1.091
 Goo-1.088
Token0000
Token ID2388
Feature activation-5.38e+00

Pos logit contributions
ardless+3.956
 Yen+3.907
 Paste+3.899
esy+3.898
ycle+3.870
Neg logit contributions
rador-4.973
 Binding-4.870
 LAR-4.793
 Goo-4.638
 Ala-4.611
Token8
Token ID23
Feature activation-1.45e+00

Pos logit contributions
 Yen+3.494
 Paste+3.391
aunder+3.385
ycle+3.385
esy+3.356
Neg logit contributions
rador-3.709
 Binding-3.606
 LAR-3.580
NPR-3.530
 Ala-3.455
Token x
Token ID2124
Feature activation-2.70e+00

Pos logit contributions
ycle+0.823
 Yen+0.821
ardless+0.812
aunder+0.808
 Paste+0.805
Neg logit contributions
rador-1.116
 Binding-1.114
 LAR-1.111
NPR-1.076
 Goo-1.065
Token.
Token ID13
Feature activation-9.20e+00

Pos logit contributions
ycle+1.867
ardless+1.855
 Yen+1.854
aunder+1.846
esy+1.842
Neg logit contributions
rador-1.749
 Binding-1.743
 LAR-1.725
NPR-1.655
 Goo-1.654
Tokenx
Token ID87
Feature activation-1.35e+01

Pos logit contributions
esy+3.737
ycle+3.695
ardless+3.641
erial+3.575
 Yen+3.574
Neg logit contributions
���-8.809
 incent-8.449
 Binding-8.413
rador-8.390
 LAR-8.295
Token.
Token ID13
Feature activation-2.03e+01

Pos logit contributions
esy+6.438
 Yen+6.410
 Paste+6.318
aunder+6.224
 Schumer+6.172
Neg logit contributions
���-12.574
 incent-11.414
rador-10.737
 rall-10.695
 Binding-10.457
Tokenx
Token ID87
Feature activation-2.19e+01

Pos logit contributions
esy+7.010
alter+6.810
erial+6.662
ardless+6.619
ycle+6.536
Neg logit contributions
���-20.496
 incent-18.346
 rall-18.051
 oun-17.865
 Binding-17.805
Token.
Token ID13
Feature activation-2.27e+01

Pos logit contributions
esy+8.386
alter+8.090
ardless+8.015
aunder+7.868
 Yen+7.808
Neg logit contributions
���-20.326
 incent-18.023
 tiss-17.925
 oun-17.907
 rall-17.888
Token118
Token ID16817
Feature activation-1.78e+01

Pos logit contributions
alter+3.324
esy+3.246
ardless+2.624
 Wick+2.492
erial+2.439
Neg logit contributions
 Binding-23.630
rador-23.443
 Goo-23.196
 Ala-22.807
���-22.764
Token 10
Token ID838
Feature activation-7.53e+00

Pos logit contributions
 Yen+8.155
 Cantor+7.760
 Schumer+7.684
 Construct+7.662
 Paste+7.631
Neg logit contributions
rador-13.558
���-12.743
Reviewer-12.623
 Binding-12.588
 antitrust-12.429
Token95
Token ID3865
Feature activation-1.59e+00

Pos logit contributions
ardless+4.401
esy+4.328
 Paste+4.322
ycle+4.312
aunder+4.308
Neg logit contributions
rador-5.570
 Binding-5.478
 LAR-5.348
 Goo-5.228
 antitrust-5.205
Token12
Token ID1065
Feature activation-3.73e+00

Pos logit contributions
ycle+1.398
 Yen+1.398
ardless+1.393
aunder+1.382
 Paste+1.374
Neg logit contributions
rador-1.754
 Binding-1.752
 LAR-1.743
NPR-1.677
 Goo-1.672
Token x
Token ID2124
Feature activation-3.12e+00

Pos logit contributions
 Yen+1.982
 Paste+1.940
ycle+1.924
esy+1.901
ardless+1.901
Neg logit contributions
rador-3.027
 Binding-2.963
 LAR-2.956
NPR-2.908
 Ala-2.851
Token.
Token ID13
Feature activation-8.54e+00

Pos logit contributions
ycle+2.128
 Yen+2.119
ardless+2.117
esy+2.105
aunder+2.104
Neg logit contributions
rador-2.052
 Binding-2.041
 LAR-2.013
NPR-1.937
 Goo-1.934
Tokenx
Token ID87
Feature activation-1.24e+01

Pos logit contributions
esy+3.561
ycle+3.529
ardless+3.508
 Yen+3.411
erial+3.401
Neg logit contributions
���-7.976
 Binding-7.749
rador-7.726
 incent-7.704
 LAR-7.667
Token.
Token ID13
Feature activation-1.99e+01

Pos logit contributions
esy+6.205
 Yen+6.151
 Paste+6.087
aunder+6.006
ardless+5.985
Neg logit contributions
���-11.219
 incent-10.278
rador-9.759
 rall-9.533
 Binding-9.495
Tokenx
Token ID87
Feature activation-2.02e+01

Pos logit contributions
esy+6.862
alter+6.472
ardless+6.363
erial+6.350
ycle+6.345
Neg logit contributions
���-20.542
 incent-18.436
 rall-18.130
 oun-17.731
 corrid-17.549
Token.
Token ID13
Feature activation-1.99e+01

Pos logit contributions
esy+8.073
aunder+7.845
ardless+7.720
alter+7.600
erial+7.577
Neg logit contributions
���-18.859
 incent-16.943
 rall-16.666
 corrid-16.536
 tiss-16.327
Token114
Token ID16562
Feature activation-1.44e+01

Pos logit contributions
esy+4.176
worth+4.062
alter+4.035
aunder+3.965
erial+3.943
Neg logit contributions
 Binding-19.565
���-19.544
 Goo-19.369
rador-19.141
 LAR-19.030
Token 112
Token ID13539
Feature activation-4.40e+00

Pos logit contributions
 Yen+6.800
 Paste+6.564
 Cantor+6.397
 Schumer+6.367
 Wick+6.349
Neg logit contributions
rador-11.342
 Binding-10.745
���-10.531
Reviewer-10.524
 antitrust-10.418
Token59
Token ID3270
Feature activation-2.72e+00

Pos logit contributions
ardless+2.675
 Yen+2.645
ycle+2.640
 Paste+2.626
aunder+2.626
Neg logit contributions
rador-3.195
 Binding-3.169
 LAR-3.138
 Goo-3.037
 Ala-3.009
Token07
Token ID2998
Feature activation+3.51e-01

Pos logit contributions
 Yen+1.588
ardless+1.578
ycle+1.573
 Paste+1.563
aunder+1.560
Neg logit contributions
rador-2.057
 Binding-2.038
 LAR-2.026
 Goo-1.947
NPR-1.940
Token x
Token ID2124
Feature activation-3.12e+00

Pos logit contributions
 Yen+1.982
 Paste+1.940
ycle+1.924
esy+1.901
ardless+1.901
Neg logit contributions
rador-3.027
 Binding-2.963
 LAR-2.956
NPR-2.908
 Ala-2.851
Token.
Token ID13
Feature activation-8.54e+00

Pos logit contributions
ycle+2.128
 Yen+2.119
ardless+2.117
esy+2.105
aunder+2.104
Neg logit contributions
rador-2.052
 Binding-2.041
 LAR-2.013
NPR-1.937
 Goo-1.934
Tokenx
Token ID87
Feature activation-1.24e+01

Pos logit contributions
esy+3.561
ycle+3.529
ardless+3.508
 Yen+3.411
erial+3.401
Neg logit contributions
���-7.976
 Binding-7.749
rador-7.726
 incent-7.704
 LAR-7.667
Token.
Token ID13
Feature activation-1.99e+01

Pos logit contributions
esy+6.205
 Yen+6.151
 Paste+6.087
aunder+6.006
ardless+5.985
Neg logit contributions
���-11.219
 incent-10.278
rador-9.759
 rall-9.533
 Binding-9.495
Tokenx
Token ID87
Feature activation-2.02e+01

Pos logit contributions
esy+6.862
alter+6.472
ardless+6.363
erial+6.350
ycle+6.345
Neg logit contributions
���-20.542
 incent-18.436
 rall-18.130
 oun-17.731
 corrid-17.549
Token.
Token ID13
Feature activation-1.99e+01

Pos logit contributions
esy+8.073
aunder+7.845
ardless+7.720
alter+7.600
erial+7.577
Neg logit contributions
���-18.859
 incent-16.943
 rall-16.666
 corrid-16.536
 tiss-16.327
Token114
Token ID16562
Feature activation-1.44e+01

Pos logit contributions
esy+4.176
worth+4.062
alter+4.035
aunder+3.965
erial+3.943
Neg logit contributions
 Binding-19.565
���-19.544
 Goo-19.369
rador-19.141
 LAR-19.030
Token 112
Token ID13539
Feature activation-4.40e+00

Pos logit contributions
 Yen+6.800
 Paste+6.564
 Cantor+6.397
 Schumer+6.367
 Wick+6.349
Neg logit contributions
rador-11.342
 Binding-10.745
���-10.531
Reviewer-10.524
 antitrust-10.418
Token59
Token ID3270
Feature activation-2.72e+00

Pos logit contributions
ardless+2.675
 Yen+2.645
ycle+2.640
 Paste+2.626
aunder+2.626
Neg logit contributions
rador-3.195
 Binding-3.169
 LAR-3.138
 Goo-3.037
 Ala-3.009
Token07
Token ID2998
Feature activation+3.51e-01

Pos logit contributions
 Yen+1.588
ardless+1.578
ycle+1.573
 Paste+1.563
aunder+1.560
Neg logit contributions
rador-2.057
 Binding-2.038
 LAR-2.026
 Goo-1.947
NPR-1.940
Token x
Token ID2124
Feature activation-3.08e+00

Pos logit contributions
 Binding+0.262
 LAR+0.262
rador+0.260
NPR+0.251
 Goo+0.251
Neg logit contributions
ycle-0.210
ardless-0.208
 Yen-0.207
aunder-0.207
esy-0.204
Token397
Token ID33372
Feature activation-2.36e+00

Pos logit contributions
ardless+3.806
 Yen+3.764
 Paste+3.731
esy+3.693
ycle+3.673
Neg logit contributions
rador-5.773
 Binding-5.635
 LAR-5.567
 Goo-5.416
 antitrust-5.402
Token12
Token ID1065
Feature activation-3.73e+00

Pos logit contributions
ycle+1.398
 Yen+1.398
ardless+1.393
aunder+1.382
 Paste+1.374
Neg logit contributions
rador-1.754
 Binding-1.752
 LAR-1.743
NPR-1.677
 Goo-1.672
Token x
Token ID2124
Feature activation-3.12e+00

Pos logit contributions
 Yen+1.982
 Paste+1.940
ycle+1.924
esy+1.901
ardless+1.901
Neg logit contributions
rador-3.027
 Binding-2.963
 LAR-2.956
NPR-2.908
 Ala-2.851
Token.
Token ID13
Feature activation-8.54e+00

Pos logit contributions
ycle+2.128
 Yen+2.119
ardless+2.117
esy+2.105
aunder+2.104
Neg logit contributions
rador-2.052
 Binding-2.041
 LAR-2.013
NPR-1.937
 Goo-1.934
Tokenx
Token ID87
Feature activation-1.24e+01

Pos logit contributions
esy+3.561
ycle+3.529
ardless+3.508
 Yen+3.411
erial+3.401
Neg logit contributions
���-7.976
 Binding-7.749
rador-7.726
 incent-7.704
 LAR-7.667
Token.
Token ID13
Feature activation-1.99e+01

Pos logit contributions
esy+6.205
 Yen+6.151
 Paste+6.087
aunder+6.006
ardless+5.985
Neg logit contributions
���-11.219
 incent-10.278
rador-9.759
 rall-9.533
 Binding-9.495
Tokenx
Token ID87
Feature activation-2.02e+01

Pos logit contributions
esy+6.862
alter+6.472
ardless+6.363
erial+6.350
ycle+6.345
Neg logit contributions
���-20.542
 incent-18.436
 rall-18.130
 oun-17.731
 corrid-17.549
Token.
Token ID13
Feature activation-1.99e+01

Pos logit contributions
esy+8.073
aunder+7.845
ardless+7.720
alter+7.600
erial+7.577
Neg logit contributions
���-18.859
 incent-16.943
 rall-16.666
 corrid-16.536
 tiss-16.327
Token114
Token ID16562
Feature activation-1.44e+01

Pos logit contributions
esy+4.176
worth+4.062
alter+4.035
aunder+3.965
erial+3.943
Neg logit contributions
 Binding-19.565
���-19.544
 Goo-19.369
rador-19.141
 LAR-19.030
Token 112
Token ID13539
Feature activation-4.40e+00

Pos logit contributions
 Yen+6.800
 Paste+6.564
 Cantor+6.397
 Schumer+6.367
 Wick+6.349
Neg logit contributions
rador-11.342
 Binding-10.745
���-10.531
Reviewer-10.524
 antitrust-10.418
Token59
Token ID3270
Feature activation-2.72e+00

Pos logit contributions
ardless+2.675
 Yen+2.645
ycle+2.640
 Paste+2.626
aunder+2.626
Neg logit contributions
rador-3.195
 Binding-3.169
 LAR-3.138
 Goo-3.037
 Ala-3.009
Token
Token ID198
Feature activation-1.91e+01

Pos logit contributions
ycle+0.704
ardless+0.700
 Yen+0.699
aunder+0.694
 Paste+0.692
Neg logit contributions
 Binding-0.553
rador-0.552
 LAR-0.550
NPR-0.523
 Goo-0.522
Token2010
Token ID10333
Feature activation+4.86e+00

Pos logit contributions
alter+5.086
ycle+4.699
esy+4.583
+4.566
 Paste+4.523
Neg logit contributions
 incent-19.006
���-18.957
rador-18.125
 LAR-17.941
 Binding-17.814
Token Matthew
Token ID9308
Feature activation-4.98e+00

Pos logit contributions
 LAR+2.928
 Binding+2.863
 Goo+2.772
 Ala+2.751
rador+2.723
Neg logit contributions
ycle-3.671
aunder-3.657
ardless-3.645
esy-3.511
 Yen-3.490
Token Morrison
Token ID24454
Feature activation-4.96e+00

Pos logit contributions
 Yen+2.523
 Paste+2.512
ycle+2.493
ardless+2.459
aunder+2.419
Neg logit contributions
rador-4.191
 Binding-4.089
 LAR-4.080
NPR-4.061
 Goo-3.930
Token
Token ID198
Feature activation+1.31e+00

Pos logit contributions
ycle+3.326
 Yen+3.305
 Paste+3.266
aunder+3.217
ardless+3.213
Neg logit contributions
 Binding-3.274
rador-3.267
 LAR-3.238
NPR-3.127
 Ala-3.082
Token
Token ID198
Feature activation-1.98e+01

Pos logit contributions
 LAR+0.715
 Binding+0.709
rador+0.696
NPR+0.677
 Ala+0.669
Neg logit contributions
ycle-1.052
ardless-1.040
aunder-1.039
 Yen-1.036
esy-1.024
Token2010
Token ID10333
Feature activation+1.53e+00

Pos logit contributions
alter+5.423
ycle+5.336
esy+4.967
 Paste+4.747
+4.716
Neg logit contributions
 incent-18.216
���-18.143
 LAR-18.040
 Binding-17.981
rador-17.979
Token Tony
Token ID8832
Feature activation+7.95e+00

Pos logit contributions
 LAR+0.914
 Binding+0.909
rador+0.888
 Goo+0.866
 Ala+0.862
Neg logit contributions
ycle-1.156
ardless-1.148
aunder-1.144
 Yen-1.129
esy-1.123
Token Shal
Token ID47974
Feature activation+7.34e+00

Pos logit contributions
 LAR+3.979
 Binding+3.951
 Goo+3.830
 Ala+3.799
rador+3.709
Neg logit contributions
ardless-6.647
ycle-6.502
aunder-6.388
esy-6.372
apon-6.267
Tokenh
Token ID71
Feature activation+6.89e+00

Pos logit contributions
rador+2.697
 LAR+2.665
 Binding+2.657
 Goo+2.522
 Ala+2.491
Neg logit contributions
 Yen-6.909
ardless-6.793
ycle-6.779
 Paste-6.753
aunder-6.642
Tokenoub
Token ID12944
Feature activation-1.25e+00

Pos logit contributions
rador+2.340
 Binding+2.315
 LAR+2.240
NPR+2.156
ob+2.095
Neg logit contributions
 Yen-6.781
ycle-6.643
 Paste-6.579
ardless-6.572
esy-6.525
Tokenmel
Token ID17694
Feature activation-3.93e+00

Pos logit contributions
ycle+3.059
 Yen+3.043
ardless+3.021
esy+3.019
aunder+3.019
Neg logit contributions
 Binding-1.830
 LAR-1.785
rador-1.780
 Goo-1.709
NPR-1.695
Token Kan
Token ID14248
Feature activation-3.99e-01

Pos logit contributions
 Yen+1.998
ycle+1.949
 Paste+1.938
ardless+1.933
aunder+1.930
Neg logit contributions
rador-3.280
 Binding-3.266
 LAR-3.236
NPR-3.165
 Goo-3.115
Token,
Token ID11
Feature activation-5.40e+00

Pos logit contributions
ycle+0.274
ardless+0.272
 Yen+0.270
aunder+0.269
esy+0.267
Neg logit contributions
 Binding-0.263
 LAR-0.263
rador-0.261
NPR-0.251
 Ala-0.250
Token pour
Token ID12797
Feature activation+1.05e+00

Pos logit contributions
 Yen+3.617
ycle+3.597
 Paste+3.584
ardless+3.571
aunder+3.565
Neg logit contributions
 Binding-3.542
rador-3.518
 LAR-3.496
NPR-3.371
 Goo-3.371
Token le
Token ID443
Feature activation+8.49e-01

Pos logit contributions
 LAR+0.712
 Binding+0.709
rador+0.707
NPR+0.680
 Ala+0.679
Neg logit contributions
ycle-0.699
ardless-0.690
 Yen-0.687
aunder-0.687
esy-0.679
Token Command
Token ID9455
Feature activation-1.97e+01

Pos logit contributions
 Binding+0.566
 LAR+0.566
rador+0.561
NPR+0.539
 Ala+0.538
Neg logit contributions
ycle-0.577
ardless-0.571
aunder-0.568
 Yen-0.567
esy-0.560
Tokenement
Token ID972
Feature activation-2.99e+00

Pos logit contributions
esy+5.977
erial+5.921
 Paste+5.645
 Yen+5.594
ycle+5.472
Neg logit contributions
���-17.261
 Binding-17.242
 LAR-17.021
 Goo-16.920
NPR-16.704
Token des
Token ID748
Feature activation-7.83e-01

Pos logit contributions
ycle+2.022
 Yen+2.021
 Paste+2.007
ardless+2.004
aunder+1.999
Neg logit contributions
 Binding-1.964
rador-1.957
 LAR-1.951
NPR-1.882
 Goo-1.864
Token Cors
Token ID26978
Feature activation-8.93e+00

Pos logit contributions
ycle+0.544
ardless+0.539
 Yen+0.538
aunder+0.535
 Paste+0.531
Neg logit contributions
 Binding-0.508
 LAR-0.507
rador-0.505
NPR-0.485
 Goo-0.483
Tokenaires
Token ID17693
Feature activation-9.98e+00

Pos logit contributions
esy+2.766
ycle+2.692
ardless+2.674
erial+2.655
aunder+2.643
Neg logit contributions
 Binding-9.080
 LAR-8.949
 Goo-8.821
NPR-8.690
rador-8.658
Token de
Token ID390
Feature activation+7.37e+00

Pos logit contributions
 Paste+6.600
 Yen+6.589
aunder+6.342
 Pon+6.307
ardless+6.287
Neg logit contributions
 Binding-6.250
rador-6.231
 LAR-6.011
NPR-5.989
 Goo-5.894
Token x
Token ID2124
Feature activation-2.25e+00

Pos logit contributions
 LAR+0.273
 Binding+0.273
rador+0.269
 Goo+0.261
NPR+0.261
Neg logit contributions
ycle-0.224
ardless-0.222
aunder-0.220
 Yen-0.219
esy-0.216
Token.
Token ID13
Feature activation-8.94e+00

Pos logit contributions
ycle+1.579
ardless+1.568
 Yen+1.568
aunder+1.560
esy+1.555
Neg logit contributions
rador-1.443
 Binding-1.441
 LAR-1.426
NPR-1.367
 Goo-1.366
Tokenx
Token ID87
Feature activation-1.20e+01

Pos logit contributions
esy+3.676
ycle+3.663
ardless+3.622
erial+3.548
aunder+3.543
Neg logit contributions
���-8.438
 Binding-8.113
rador-8.106
 incent-8.082
 LAR-8.021
Token.
Token ID13
Feature activation-1.72e+01

Pos logit contributions
esy+6.097
 Yen+6.080
 Paste+5.959
aunder+5.925
ycle+5.894
Neg logit contributions
���-10.749
 incent-9.874
rador-9.412
 Binding-9.198
 rall-9.180
Tokenx
Token ID87
Feature activation-1.67e+01

Pos logit contributions
esy+5.233
erial+4.982
ycle+4.934
alter+4.842
 Yen+4.767
Neg logit contributions
���-18.703
 incent-17.161
 rall-16.683
 Binding-16.352
rador-15.973
Token.
Token ID13
Feature activation-1.97e+01

Pos logit contributions
 Yen+7.259
esy+7.249
aunder+7.156
 Schumer+6.979
 Paste+6.947
Neg logit contributions
���-15.398
 incent-14.096
 rall-13.445
rador-13.212
 Binding-13.119
Token120
Token ID10232
Feature activation-1.42e+01

Pos logit contributions
aunder+3.372
erial+3.351
esy+3.317
alter+3.189
 Yen+3.154
Neg logit contributions
 Binding-19.953
rador-19.888
 Goo-19.792
 LAR-19.605
���-19.559
Token 10
Token ID838
Feature activation-6.89e+00

Pos logit contributions
 Yen+7.248
 Paste+6.829
 Schumer+6.827
aunder+6.803
 Cantor+6.799
Neg logit contributions
rador-10.664
 Binding-10.106
Reviewer-9.934
 LAR-9.850
 antitrust-9.839
Token25
Token ID1495
Feature activation-4.44e-02

Pos logit contributions
ardless+3.990
 Paste+3.929
esy+3.925
ycle+3.915
aunder+3.908
Neg logit contributions
rador-5.155
 Binding-5.061
 LAR-4.976
 Goo-4.867
 antitrust-4.822
Token290
Token ID24369
Feature activation-2.95e+00

Pos logit contributions
ycle+0.028
ardless+0.028
 Yen+0.028
aunder+0.028
esy+0.027
Neg logit contributions
 LAR-0.032
 Binding-0.032
rador-0.031
NPR-0.030
 Goo-0.030
Token x
Token ID2124
Feature activation-2.69e+00

Pos logit contributions
 Yen+1.618
ycle+1.594
 Paste+1.574
ardless+1.572
aunder+1.568
Neg logit contributions
rador-2.353
 Binding-2.332
 LAR-2.319
NPR-2.262
 Ala-2.233
Token59
Token ID3270
Feature activation-4.29e+00

Pos logit contributions
ardless+3.936
ycle+3.878
esy+3.874
 Paste+3.868
 Yen+3.862
Neg logit contributions
rador-4.919
 Binding-4.836
 LAR-4.725
 Goo-4.623
 Ala-4.578
Token326
Token ID39195
Feature activation-6.17e-01

Pos logit contributions
ycle+2.608
ardless+2.592
 Yen+2.588
aunder+2.573
esy+2.568
Neg logit contributions
rador-3.122
 Binding-3.111
 LAR-3.083
 Goo-2.972
NPR-2.952
Token x
Token ID2124
Feature activation-2.68e+00

Pos logit contributions
ycle+0.364
ardless+0.360
 Yen+0.359
aunder+0.357
esy+0.353
Neg logit contributions
 Binding-0.466
 LAR-0.465
rador-0.463
NPR-0.448
 Goo-0.446
Token.
Token ID13
Feature activation-1.01e+01

Pos logit contributions
ycle+1.847
 Yen+1.839
ardless+1.836
aunder+1.826
esy+1.824
Neg logit contributions
rador-1.742
 Binding-1.733
 LAR-1.712
NPR-1.645
 Goo-1.642
Tokenx
Token ID87
Feature activation-1.42e+01

Pos logit contributions
esy+3.963
ycle+3.881
ardless+3.864
erial+3.791
 Yen+3.781
Neg logit contributions
���-9.878
 incent-9.427
rador-9.306
 Binding-9.292
 LAR-9.163
Token.
Token ID13
Feature activation-1.96e+01

Pos logit contributions
esy+6.595
 Yen+6.525
 Paste+6.454
aunder+6.341
ardless+6.302
Neg logit contributions
���-13.412
 incent-12.156
 rall-11.389
rador-11.332
 tiss-11.168
Tokenx
Token ID87
Feature activation-1.79e+01

Pos logit contributions
ardless+6.205
esy+6.205
alter+6.083
erial+6.069
ycle+6.018
Neg logit contributions
���-20.601
 incent-18.608
 rall-18.301
 oun-17.780
 Binding-17.768
Token.
Token ID13
Feature activation-1.81e+01

Pos logit contributions
ardless+7.722
aunder+7.538
alter+7.473
esy+7.351
erial+7.190
Neg logit contributions
���-16.367
 incent-14.813
 rall-14.396
 tiss-14.370
 Binding-14.131
Token116
Token ID18298
Feature activation-1.53e+01

Pos logit contributions
ardless+1.366
alter+1.298
ycle+0.793
esy+0.646
 Paste+0.630
Neg logit contributions
 LAR-21.137
 Binding-20.882
 Goo-20.551
rador-20.255
���-20.195
Token 104
Token ID14436
Feature activation-3.09e+00

Pos logit contributions
 Yen+7.160
 Paste+6.867
 Schumer+6.847
 Cantor+6.794
 Wick+6.785
Neg logit contributions
rador-11.913
 Binding-11.169
���-11.086
 antitrust-11.010
Reviewer-10.978
Token17
Token ID1558
Feature activation-3.28e+00

Pos logit contributions
ycle+1.834
 Yen+1.834
ardless+1.834
aunder+1.811
 Paste+1.804
Neg logit contributions
rador-2.282
 Binding-2.279
 LAR-2.266
 Goo-2.175
NPR-2.175
Token25
Token ID1495
Feature activation-2.67e+00

Pos logit contributions
ardless+4.079
ycle+4.009
esy+3.993
aunder+3.990
 Yen+3.985
Neg logit contributions
rador-5.377
 Binding-5.281
 LAR-5.173
 Goo-5.069
 antitrust-5.015
Token151
Token ID24309
Feature activation-1.26e+00

Pos logit contributions
ycle+1.670
ardless+1.656
 Yen+1.651
aunder+1.646
esy+1.635
Neg logit contributions
 Binding-1.909
rador-1.906
 LAR-1.898
 Goo-1.820
NPR-1.816
Token x
Token ID2124
Feature activation-1.53e+00

Pos logit contributions
ycle+0.723
 Yen+0.719
ardless+0.716
aunder+0.710
esy+0.705
Neg logit contributions
 Binding-0.960
rador-0.959
 LAR-0.958
NPR-0.925
 Goo-0.918
Token.
Token ID13
Feature activation-8.05e+00

Pos logit contributions
ycle+1.090
ardless+1.083
 Yen+1.081
aunder+1.076
esy+1.070
Neg logit contributions
 Binding-0.956
rador-0.955
 LAR-0.950
NPR-0.909
 Goo-0.906
Tokenx
Token ID87
Feature activation-1.24e+01

Pos logit contributions
esy+3.409
ycle+3.395
ardless+3.378
 Yen+3.301
aunder+3.287
Neg logit contributions
���-7.392
rador-7.256
 Binding-7.239
 LAR-7.171
 incent-7.157
Token.
Token ID13
Feature activation-1.96e+01

Pos logit contributions
esy+6.189
 Yen+6.166
 Paste+6.078
aunder+5.966
ardless+5.963
Neg logit contributions
���-11.236
 incent-10.271
rador-9.781
 rall-9.520
 Binding-9.492
Tokenx
Token ID87
Feature activation-1.83e+01

Pos logit contributions
esy+6.266
alter+6.010
erial+5.951
ycle+5.860
ardless+5.777
Neg logit contributions
���-21.018
 incent-18.861
 rall-18.344
 oun-18.141
 corrid-17.779
Token.
Token ID13
Feature activation-1.76e+01

Pos logit contributions
esy+7.621
aunder+7.276
erial+7.219
alter+7.188
ardless+7.124
Neg logit contributions
���-17.520
 incent-15.729
 rall-15.046
 oun-14.857
 tiss-14.680
Token113
Token ID16616
Feature activation-1.25e+01

Pos logit contributions
esy+2.250
ycle+2.151
emi+2.043
alter+2.008
yan+1.935
Neg logit contributions
���-18.946
 Binding-18.876
rador-18.669
 Goo-18.523
 Ala-18.433
Token 12
Token ID1105
Feature activation-7.22e+00

Pos logit contributions
 Yen+6.663
 Paste+6.346
 Schumer+6.271
ardless+6.267
aunder+6.264
Neg logit contributions
rador-9.316
 Binding-8.885
 antitrust-8.704
Reviewer-8.690
 LAR-8.671
Token397
Token ID33372
Feature activation-2.36e+00

Pos logit contributions
ardless+3.806
 Yen+3.764
 Paste+3.731
esy+3.693
ycle+3.673
Neg logit contributions
rador-5.773
 Binding-5.635
 LAR-5.567
 Goo-5.416
 antitrust-5.402
Token beat
Token ID4405
Feature activation-9.11e-01

Pos logit contributions
 Binding+0.304
 LAR+0.304
rador+0.299
NPR+0.289
 Goo+0.289
Neg logit contributions
ycle-0.343
ardless-0.342
aunder-0.339
 Yen-0.339
esy-0.335
Token or
Token ID393
Feature activation-2.25e-02

Pos logit contributions
ycle+0.638
ardless+0.631
 Yen+0.629
aunder+0.628
esy+0.622
Neg logit contributions
 Binding-0.586
 LAR-0.585
rador-0.582
NPR-0.559
 Goo-0.557
Token other
Token ID584
Feature activation+2.46e+00

Pos logit contributions
ycle+0.016
ardless+0.016
 Yen+0.016
aunder+0.016
esy+0.016
Neg logit contributions
 Binding-0.014
 LAR-0.014
rador-0.014
NPR-0.014
 Goo-0.013
Token.
Token ID13
Feature activation-6.21e+00

Pos logit contributions
 Binding+1.555
 LAR+1.555
rador+1.507
 Goo+1.482
NPR+1.469
Neg logit contributions
ycle-1.757
ardless-1.756
aunder-1.742
 Yen-1.724
esy-1.709
Token
Token ID198
Feature activation-1.79e-01

Pos logit contributions
 Yen+4.264
 Paste+4.263
ardless+4.142
ycle+4.129
 Schumer+4.066
Neg logit contributions
rador-4.063
 LAR-3.862
 Binding-3.846
NPR-3.751
 antitrust-3.693
Token
Token ID198
Feature activation-1.96e+01

Pos logit contributions
ycle+0.137
ardless+0.136
 Yen+0.136
aunder+0.136
esy+0.134
Neg logit contributions
 Binding-0.103
 LAR-0.103
rador-0.102
NPR-0.097
 Goo-0.097
Token12
Token ID1065
Feature activation-2.24e+00

Pos logit contributions
ּ+7.933
esy+7.788
alter+7.770
ycle+7.612
ardless+7.388
Neg logit contributions
 Binding-15.353
 LAR-15.091
rador-14.728
 Ala-14.441
 incent-14.093
Token.
Token ID13
Feature activation-6.23e+00

Pos logit contributions
 Yen+1.535
ycle+1.529
 Paste+1.525
ardless+1.518
aunder+1.511
Neg logit contributions
rador-1.472
 Binding-1.443
 LAR-1.432
NPR-1.376
 Goo-1.370
Token Donkey
Token ID43823
Feature activation-1.19e+00

Pos logit contributions
 Yen+4.313
 Paste+4.254
ycle+4.147
ardless+4.140
aunder+4.080
Neg logit contributions
rador-4.050
 LAR-3.864
NPR-3.816
 Binding-3.761
 antitrust-3.660
Token Kong
Token ID9071
Feature activation+7.13e+00

Pos logit contributions
 Yen+0.841
ycle+0.839
ardless+0.837
aunder+0.827
 Paste+0.827
Neg logit contributions
rador-0.748
 LAR-0.746
 Binding-0.741
NPR-0.714
 Ala-0.708
Token Country
Token ID12946
Feature activation-3.22e-01

Pos logit contributions
 Binding+3.894
 LAR+3.747
rador+3.653
 Goo+3.558
 Ala+3.551
Neg logit contributions
ycle-5.705
aunder-5.693
ardless-5.662
esy-5.578
erial-5.522
Tokenussed
Token ID29569
Feature activation-4.39e+00

Pos logit contributions
ycle+0.116
ardless+0.115
 Yen+0.115
aunder+0.115
 Paste+0.113
Neg logit contributions
 Binding-0.099
 LAR-0.099
rador-0.099
NPR-0.094
 Goo-0.094
Token joint
Token ID6466
Feature activation+1.52e+00

Pos logit contributions
 Yen+2.866
ycle+2.831
ardless+2.800
 Paste+2.793
aunder+2.787
Neg logit contributions
rador-2.985
 Binding-2.973
 LAR-2.957
NPR-2.894
 Goo-2.852
Token venture
Token ID13189
Feature activation-2.96e+00

Pos logit contributions
 LAR+0.810
rador+0.802
 Binding+0.801
 Goo+0.756
 Ala+0.749
Neg logit contributions
ycle-1.232
ardless-1.228
aunder-1.219
 Yen-1.212
esy-1.210
Token,
Token ID11
Feature activation-6.35e+00

Pos logit contributions
ycle+1.985
 Yen+1.985
ardless+1.961
 Paste+1.948
aunder+1.943
Neg logit contributions
 Binding-1.970
 LAR-1.957
rador-1.950
NPR-1.894
 Goo-1.875
Token S
Token ID311
Feature activation-1.78e+00

Pos logit contributions
 Yen+4.177
ycle+4.078
 Paste+4.050
ardless+4.019
aunder+4.018
Neg logit contributions
rador-4.310
 LAR-4.206
 Binding-4.180
NPR-4.134
 Goo-4.013
TokenBI
Token ID3483
Feature activation-1.95e+01

Pos logit contributions
ycle+1.656
ardless+1.643
aunder+1.636
 Yen+1.633
 Paste+1.617
Neg logit contributions
 LAR-0.738
 Binding-0.737
rador-0.732
NPR-0.686
 Ala-0.685
Token Wire
Token ID14712
Feature activation-1.65e+01

Pos logit contributions
 Wick+10.182
 Yen+10.122
 Paste+9.431
erial+9.400
ycle+9.373
Neg logit contributions
rador-13.339
NPR-12.683
 LAR-12.319
Reviewer-12.062
 Binding-12.049
Tokenx
Token ID87
Feature activation-1.46e+01

Pos logit contributions
 Yen+5.136
ycle+5.094
esy+5.020
erial+4.750
 Paste+4.720
Neg logit contributions
���-16.127
 incent-15.252
rador-15.184
 LAR-15.145
 Binding-14.880
Token Asia
Token ID7229
Feature activation-4.23e+00

Pos logit contributions
 Yen+7.126
 Paste+6.689
ycle+6.462
 Wick+6.453
aunder+6.383
Neg logit contributions
rador-11.375
NPR-11.179
 LAR-10.789
 Binding-10.655
quickShipAvailable-10.582
Token.
Token ID13
Feature activation-5.48e+00

Pos logit contributions
 Yen+2.749
ycle+2.724
aunder+2.683
ardless+2.681
 Paste+2.680
Neg logit contributions
rador-2.902
 LAR-2.873
 Binding-2.867
NPR-2.824
 Goo-2.753
Token The
Token ID383
Feature activation-1.33e+00

Pos logit contributions
 Yen+3.868
 Paste+3.779
ardless+3.760
ycle+3.742
aunder+3.709
Neg logit contributions
rador-3.490
 LAR-3.411
 Binding-3.369
NPR-3.275
 Goo-3.218
Token c
Token ID269
Feature activation-2.05e+00

Pos logit contributions
ycle+2.508
ardless+2.488
 Yen+2.485
esy+2.477
aunder+2.468
Neg logit contributions
 Binding-2.531
 LAR-2.498
rador-2.498
NPR-2.411
 Goo-2.404
Tokenepend
Token ID2690
Feature activation-1.21e+01

Pos logit contributions
ycle+1.073
ardless+1.061
 Yen+1.056
aunder+1.051
esy+1.039
Neg logit contributions
 Binding-1.678
 LAR-1.676
rador-1.667
NPR-1.616
 Goo-1.611
Tokenant
Token ID415
Feature activation-5.21e+00

Pos logit contributions
ycle+6.265
esy+6.244
ardless+6.220
aunder+6.018
 Yen+5.854
Neg logit contributions
 Binding-9.465
 LAR-9.375
 Goo-9.123
 Ala-8.924
NPR-8.855
Token que
Token ID8358
Feature activation-1.09e+00

Pos logit contributions
 Yen+3.441
ycle+3.428
esy+3.416
ardless+3.409
 Paste+3.409
Neg logit contributions
 Binding-3.462
 LAR-3.413
rador-3.410
NPR-3.332
 Goo-3.289
Token le
Token ID443
Feature activation+2.42e+00

Pos logit contributions
ycle+0.715
ardless+0.708
 Yen+0.706
aunder+0.703
esy+0.696
Neg logit contributions
 Binding-0.751
 LAR-0.750
rador-0.746
NPR-0.718
 Goo-0.715
Token Command
Token ID9455
Feature activation-1.94e+01

Pos logit contributions
 Binding+1.592
 LAR+1.585
rador+1.570
 Ala+1.509
 Goo+1.502
Neg logit contributions
ycle-1.666
ardless-1.642
aunder-1.640
 Yen-1.635
esy-1.611
Tokenement
Token ID972
Feature activation-5.43e+00

Pos logit contributions
esy+8.865
erial+8.844
 Paste+8.634
 Yen+8.555
ardless+8.478
Neg logit contributions
 Binding-13.739
 LAR-13.688
 Goo-13.471
NPR-13.401
BALL-12.956
Token est
Token ID1556
Feature activation-2.05e+00

Pos logit contributions
 Paste+3.215
 Yen+3.197
ycle+3.163
ardless+3.150
aunder+3.131
Neg logit contributions
 Binding-3.973
rador-3.966
 LAR-3.953
NPR-3.831
 Goo-3.805
Token total
Token ID2472
Feature activation-7.80e+00

Pos logit contributions
ycle+1.390
 Yen+1.383
ardless+1.380
aunder+1.371
 Paste+1.364
Neg logit contributions
 Binding-1.356
 LAR-1.350
rador-1.343
NPR-1.298
 Goo-1.293
Tokenement
Token ID972
Feature activation-3.77e+00

Pos logit contributions
ycle+5.018
esy+4.984
aunder+4.919
 Yen+4.907
ardless+4.899
Neg logit contributions
 Binding-5.236
rador-5.157
 LAR-5.148
NPR-5.037
 Goo-4.993
Token d
Token ID288
Feature activation+1.21e+00

Pos logit contributions
ycle+2.521
 Yen+2.509
 Paste+2.499
ardless+2.498
esy+2.485
Neg logit contributions
 Binding-2.494
 LAR-2.472
rador-2.471
NPR-2.398
 Goo-2.375
Token full
Token ID1336
Feature activation-2.94e+00

Pos logit contributions
 LAR+2.133
 Binding+2.117
rador+2.067
 Ala+2.012
 Goo+2.009
Neg logit contributions
ycle-2.365
ardless-2.352
aunder-2.340
 Yen-2.323
 Paste-2.283
Token of
Token ID286
Feature activation+5.16e+00

Pos logit contributions
ycle+1.820
ardless+1.802
 Yen+1.796
esy+1.766
 Paste+1.756
Neg logit contributions
rador-2.128
 Binding-2.112
 LAR-2.092
NPR-2.033
 Goo-2.024
Token possibility
Token ID5885
Feature activation+4.11e-01

Pos logit contributions
 LAR+3.162
 Binding+3.154
rador+3.038
 Goo+3.017
 Ala+3.003
Neg logit contributions
ardless-3.773
aunder-3.733
ycle-3.733
 Yen-3.644
 Paste-3.627
Token.
Token ID13
Feature activation-3.16e+00

Pos logit contributions
 Binding+0.262
 LAR+0.262
rador+0.260
NPR+0.250
 Goo+0.249
Neg logit contributions
ycle-0.290
ardless-0.288
 Yen-0.287
aunder-0.286
esy-0.283
Token
Token ID198
Feature activation-1.15e+00

Pos logit contributions
 Yen+2.262
ycle+2.239
 Paste+2.239
ardless+2.234
aunder+2.201
Neg logit contributions
rador-1.984
 LAR-1.932
 Binding-1.926
NPR-1.862
 Ala-1.835
Token
Token ID198
Feature activation-1.94e+01

Pos logit contributions
ycle+0.861
ardless+0.856
 Yen+0.854
aunder+0.848
 Paste+0.846
Neg logit contributions
 Binding-0.686
rador-0.683
 LAR-0.678
 Goo-0.646
NPR-0.645
Token10
Token ID940
Feature activation-2.82e+00

Pos logit contributions
alter+7.394
ycle+7.315
+7.150
aunder+7.010
esy+6.977
Neg logit contributions
 Binding-15.902
 LAR-15.715
rador-15.513
 Ala-15.374
 antitrust-14.947
Token.
Token ID13
Feature activation-5.54e+00

Pos logit contributions
 Yen+1.908
 Paste+1.895
ycle+1.894
ardless+1.877
aunder+1.873
Neg logit contributions
rador-1.883
 Binding-1.834
 LAR-1.815
NPR-1.750
 Ala-1.743
Token Red
Token ID2297
Feature activation-2.29e+00

Pos logit contributions
 Yen+3.939
 Paste+3.884
ardless+3.819
ycle+3.815
esy+3.751
Neg logit contributions
rador-3.509
 LAR-3.344
NPR-3.299
 Binding-3.243
 antitrust-3.170
Token Dead
Token ID5542
Feature activation-3.33e+00

Pos logit contributions
ycle+1.832
 Yen+1.814
ardless+1.806
aunder+1.793
 Paste+1.792
Neg logit contributions
 LAR-1.239
rador-1.237
 Binding-1.227
NPR-1.175
 Ala-1.159
Token Redemption
Token ID34433
Feature activation-1.02e-01

Pos logit contributions
ycle+1.791
ardless+1.784
 Yen+1.772
 Paste+1.765
aunder+1.755
Neg logit contributions
rador-2.616
 Binding-2.608
 LAR-2.607
NPR-2.571
 Ala-2.515
Token36
Token ID2623
Feature activation-2.47e+00

Pos logit contributions
ardless+4.692
 Paste+4.614
esy+4.613
ycle+4.591
 Yen+4.589
Neg logit contributions
rador-6.063
 Binding-5.932
 LAR-5.757
 Goo-5.669
 antitrust-5.647
Token203
Token ID22416
Feature activation-1.33e+00

Pos logit contributions
ycle+1.538
 Yen+1.523
ardless+1.523
aunder+1.517
esy+1.506
Neg logit contributions
 Binding-1.776
rador-1.774
 LAR-1.767
 Goo-1.695
NPR-1.694
Token x
Token ID2124
Feature activation-2.38e+00

Pos logit contributions
ycle+0.756
 Yen+0.755
ardless+0.748
aunder+0.742
 Paste+0.740
Neg logit contributions
rador-1.021
 Binding-1.021
 LAR-1.017
NPR-0.985
 Ala-0.976
Token.
Token ID13
Feature activation-1.05e+01

Pos logit contributions
ycle+1.655
 Yen+1.646
ardless+1.645
aunder+1.635
esy+1.632
Neg logit contributions
rador-1.537
 Binding-1.532
 LAR-1.515
NPR-1.456
 Goo-1.452
Tokenx
Token ID87
Feature activation-1.29e+01

Pos logit contributions
esy+4.077
ycle+3.975
ardless+3.953
erial+3.893
 Yen+3.852
Neg logit contributions
���-10.415
 incent-9.908
 Binding-9.715
rador-9.702
 LAR-9.572
Token.
Token ID13
Feature activation-1.94e+01

Pos logit contributions
esy+6.377
 Yen+6.319
 Paste+6.240
aunder+6.141
erial+6.123
Neg logit contributions
���-11.785
 incent-10.782
rador-10.147
 rall-10.053
 Binding-9.889
Tokenx
Token ID87
Feature activation-1.91e+01

Pos logit contributions
esy+5.911
alter+5.749
erial+5.690
ycle+5.635
ardless+5.402
Neg logit contributions
���-20.869
 incent-19.065
 rall-18.457
 oun-18.260
 corrid-17.703
Token.
Token ID13
Feature activation-1.89e+01

Pos logit contributions
esy+8.064
erial+7.845
aunder+7.824
alter+7.663
 Schumer+7.502
Neg logit contributions
���-17.584
 incent-16.140
 rall-15.443
 oun-15.309
 tiss-14.945
Token112
Token ID14686
Feature activation-1.09e+01

Pos logit contributions
esy+3.652
alter+3.482
ycle+3.312
erial+3.307
aunder+3.173
Neg logit contributions
���-19.607
 incent-18.981
 Binding-18.873
 LAR-18.751
 rall-18.535
Token 10
Token ID838
Feature activation-7.11e+00

Pos logit contributions
 Yen+5.592
 Paste+5.400
ardless+5.395
ycle+5.381
aunder+5.377
Neg logit contributions
rador-8.395
 Binding-8.095
 LAR-7.928
 antitrust-7.799
Reviewer-7.791
Token25
Token ID1495
Feature activation-2.67e+00

Pos logit contributions
ardless+4.079
ycle+4.009
esy+3.993
aunder+3.990
 Yen+3.985
Neg logit contributions
rador-5.377
 Binding-5.281
 LAR-5.173
 Goo-5.069
 antitrust-5.015
Token203
Token ID22416
Feature activation-1.33e+00

Pos logit contributions
ycle+1.538
 Yen+1.523
ardless+1.523
aunder+1.517
esy+1.506
Neg logit contributions
 Binding-1.776
rador-1.774
 LAR-1.767
 Goo-1.695
NPR-1.694
Token x
Token ID2124
Feature activation-2.38e+00

Pos logit contributions
ycle+0.756
 Yen+0.755
ardless+0.748
aunder+0.742
 Paste+0.740
Neg logit contributions
rador-1.021
 Binding-1.021
 LAR-1.017
NPR-0.985
 Ala-0.976
Token.
Token ID13
Feature activation-1.05e+01

Pos logit contributions
ycle+1.655
 Yen+1.646
ardless+1.645
aunder+1.635
esy+1.632
Neg logit contributions
rador-1.537
 Binding-1.532
 LAR-1.515
NPR-1.456
 Goo-1.452
Tokenx
Token ID87
Feature activation-1.29e+01

Pos logit contributions
esy+4.077
ycle+3.975
ardless+3.953
erial+3.893
 Yen+3.852
Neg logit contributions
���-10.415
 incent-9.908
 Binding-9.715
rador-9.702
 LAR-9.572
Token.
Token ID13
Feature activation-1.94e+01

Pos logit contributions
esy+6.377
 Yen+6.319
 Paste+6.240
aunder+6.141
erial+6.123
Neg logit contributions
���-11.785
 incent-10.782
rador-10.147
 rall-10.053
 Binding-9.889
Tokenx
Token ID87
Feature activation-1.91e+01

Pos logit contributions
esy+5.911
alter+5.749
erial+5.690
ycle+5.635
ardless+5.402
Neg logit contributions
���-20.869
 incent-19.065
 rall-18.457
 oun-18.260
 corrid-17.703
Token.
Token ID13
Feature activation-1.89e+01

Pos logit contributions
esy+8.064
erial+7.845
aunder+7.824
alter+7.663
 Schumer+7.502
Neg logit contributions
���-17.584
 incent-16.140
 rall-15.443
 oun-15.309
 tiss-14.945
Token112
Token ID14686
Feature activation-1.09e+01

Pos logit contributions
esy+3.652
alter+3.482
ycle+3.312
erial+3.307
aunder+3.173
Neg logit contributions
���-19.607
 incent-18.981
 Binding-18.873
 LAR-18.751
 rall-18.535
Token 10
Token ID838
Feature activation-7.11e+00

Pos logit contributions
 Yen+5.592
 Paste+5.400
ardless+5.395
ycle+5.381
aunder+5.377
Neg logit contributions
rador-8.395
 Binding-8.095
 LAR-7.928
 antitrust-7.799
Reviewer-7.791
Token25
Token ID1495
Feature activation-2.67e+00

Pos logit contributions
ardless+4.079
ycle+4.009
esy+3.993
aunder+3.990
 Yen+3.985
Neg logit contributions
rador-5.377
 Binding-5.281
 LAR-5.173
 Goo-5.069
 antitrust-5.015
Token151
Token ID24309
Feature activation-1.26e+00

Pos logit contributions
ycle+1.670
ardless+1.656
 Yen+1.651
aunder+1.646
esy+1.635
Neg logit contributions
 Binding-1.909
rador-1.906
 LAR-1.898
 Goo-1.820
NPR-1.816
Token
Token ID198
Feature activation-1.88e+01

Pos logit contributions
ycle+0.696
ardless+0.692
 Yen+0.691
aunder+0.686
 Paste+0.683
Neg logit contributions
 Binding-0.541
rador-0.540
 LAR-0.538
NPR-0.512
 Goo-0.511
Token2010
Token ID10333
Feature activation+2.10e+00

Pos logit contributions
alter+4.622
esy+4.464
ycle+4.417
 Paste+4.240
+4.216
Neg logit contributions
���-18.881
 incent-18.644
rador-18.152
 LAR-18.057
 Binding-17.818
Token Larry
Token ID13633
Feature activation-4.24e+00

Pos logit contributions
 LAR+1.254
 Binding+1.244
rador+1.214
 Goo+1.188
 Ala+1.182
Neg logit contributions
ycle-1.589
ardless-1.574
aunder-1.572
 Yen-1.542
esy-1.541
Token David
Token ID3271
Feature activation-2.80e+00

Pos logit contributions
 Yen+3.140
 Paste+3.113
ycle+3.089
ardless+3.086
aunder+3.054
Neg logit contributions
rador-2.587
 Binding-2.489
 LAR-2.481
NPR-2.429
 Ala-2.346
Token
Token ID198
Feature activation-9.38e-01

Pos logit contributions
ycle+1.973
 Yen+1.955
ardless+1.945
 Paste+1.936
aunder+1.927
Neg logit contributions
rador-1.761
 Binding-1.758
 LAR-1.757
NPR-1.684
 Goo-1.672
Token
Token ID198
Feature activation-1.91e+01

Pos logit contributions
ycle+0.704
ardless+0.700
 Yen+0.699
aunder+0.694
 Paste+0.692
Neg logit contributions
 Binding-0.553
rador-0.552
 LAR-0.550
NPR-0.523
 Goo-0.522
Token2010
Token ID10333
Feature activation+4.86e+00

Pos logit contributions
alter+5.086
ycle+4.699
esy+4.583
+4.566
 Paste+4.523
Neg logit contributions
 incent-19.006
���-18.957
rador-18.125
 LAR-17.941
 Binding-17.814
Token Matthew
Token ID9308
Feature activation-4.98e+00

Pos logit contributions
 LAR+2.928
 Binding+2.863
 Goo+2.772
 Ala+2.751
rador+2.723
Neg logit contributions
ycle-3.671
aunder-3.657
ardless-3.645
esy-3.511
 Yen-3.490
Token Morrison
Token ID24454
Feature activation-4.96e+00

Pos logit contributions
 Yen+2.523
 Paste+2.512
ycle+2.493
ardless+2.459
aunder+2.419
Neg logit contributions
rador-4.191
 Binding-4.089
 LAR-4.080
NPR-4.061
 Goo-3.930
Token
Token ID198
Feature activation+1.31e+00

Pos logit contributions
ycle+3.326
 Yen+3.305
 Paste+3.266
aunder+3.217
ardless+3.213
Neg logit contributions
 Binding-3.274
rador-3.267
 LAR-3.238
NPR-3.127
 Ala-3.082
Token
Token ID198
Feature activation-1.98e+01

Pos logit contributions
 LAR+0.715
 Binding+0.709
rador+0.696
NPR+0.677
 Ala+0.669
Neg logit contributions
ycle-1.052
ardless-1.040
aunder-1.039
 Yen-1.036
esy-1.024
Token25
Token ID1495
Feature activation-4.44e-02

Pos logit contributions
ardless+3.990
 Paste+3.929
esy+3.925
ycle+3.915
aunder+3.908
Neg logit contributions
rador-5.155
 Binding-5.061
 LAR-4.976
 Goo-4.867
 antitrust-4.822
Token290
Token ID24369
Feature activation-2.95e+00

Pos logit contributions
ycle+0.028
ardless+0.028
 Yen+0.028
aunder+0.028
esy+0.027
Neg logit contributions
 LAR-0.032
 Binding-0.032
rador-0.031
NPR-0.030
 Goo-0.030
Token x
Token ID2124
Feature activation-2.69e+00

Pos logit contributions
 Yen+1.618
ycle+1.594
 Paste+1.574
ardless+1.572
aunder+1.568
Neg logit contributions
rador-2.353
 Binding-2.332
 LAR-2.319
NPR-2.262
 Ala-2.233
Token.
Token ID13
Feature activation-9.86e+00

Pos logit contributions
ycle+1.858
 Yen+1.850
ardless+1.846
aunder+1.836
 Paste+1.834
Neg logit contributions
rador-1.754
 Binding-1.744
 LAR-1.723
NPR-1.656
 Goo-1.656
Tokenx
Token ID87
Feature activation-1.43e+01

Pos logit contributions
esy+3.883
ycle+3.838
ardless+3.797
erial+3.740
 Yen+3.728
Neg logit contributions
���-9.588
 incent-9.136
rador-9.059
 Binding-9.033
 LAR-8.925
Token.
Token ID13
Feature activation-1.91e+01

Pos logit contributions
 Yen+6.671
esy+6.653
 Paste+6.537
alter+6.460
erial+6.381
Neg logit contributions
���-13.559
 incent-12.290
 rall-11.535
rador-11.437
 tiss-11.329
Tokenx
Token ID87
Feature activation-1.29e+01

Pos logit contributions
esy+5.813
alter+5.689
erial+5.621
ycle+5.335
 Yen+5.297
Neg logit contributions
���-20.604
 incent-18.763
 rall-18.192
 oun-17.819
 corrid-17.690
Token.
Token ID13
Feature activation-1.10e+01

Pos logit contributions
 Paste+6.451
 Yen+6.441
aunder+6.439
esy+6.430
erial+6.394
Neg logit contributions
���-10.794
 incent-10.133
 Binding-9.845
rador-9.663
 Goo-9.563
Token121
Token ID19244
Feature activation-1.16e+01

Pos logit contributions
 Paste+2.293
erial+2.155
ycle+2.119
 Schumer+2.088
 Yen+2.000
Neg logit contributions
 Binding-11.943
 Goo-11.877
 LAR-11.790
rador-11.683
 Ala-11.441
Token 119
Token ID15136
Feature activation-5.13e+00

Pos logit contributions
 Yen+6.194
 Paste+5.894
ardless+5.890
 Schumer+5.836
 Cantor+5.833
Neg logit contributions
rador-8.833
 Binding-8.397
 LAR-8.280
 antitrust-8.179
Reviewer-8.148
Token47
Token ID2857
Feature activation-7.40e-01

Pos logit contributions
ardless+2.992
 Yen+2.966
 Paste+2.927
ycle+2.921
esy+2.911
Neg logit contributions
rador-3.847
 Binding-3.806
 LAR-3.761
 Goo-3.661
 antitrust-3.627
Tokengit
Token ID18300
Feature activation-9.36e+00

Pos logit contributions
rador+1.542
 LAR+1.536
 Binding+1.526
NPR+1.457
 Goo+1.443
Neg logit contributions
 Yen-2.018
ycle-2.016
ardless-1.993
aunder-1.980
 Paste-1.970
Tokenim
Token ID320
Feature activation-6.63e+00

Pos logit contributions
ardless+5.799
aunder+5.788
 Paste+5.745
esy+5.734
 Yen+5.704
Neg logit contributions
 Binding-6.462
 LAR-6.400
 Goo-6.206
rador-6.179
NPR-6.121
Tokenité
Token ID43816
Feature activation-1.07e+00

Pos logit contributions
ycle+4.223
ardless+4.204
aunder+4.182
 Paste+4.182
esy+4.146
Neg logit contributions
 Binding-4.576
 LAR-4.501
rador-4.436
NPR-4.364
 Goo-4.340
Token,
Token ID11
Feature activation-1.62e+00

Pos logit contributions
ycle+0.721
ardless+0.714
 Yen+0.713
aunder+0.709
esy+0.703
Neg logit contributions
 Binding-0.719
 LAR-0.717
rador-0.714
NPR-0.687
 Goo-0.684
Token le
Token ID443
Feature activation+3.58e+00

Pos logit contributions
ycle+1.087
ardless+1.078
 Yen+1.076
aunder+1.070
 Paste+1.061
Neg logit contributions
 Binding-1.085
 LAR-1.082
rador-1.077
NPR-1.036
 Goo-1.032
Token Command
Token ID9455
Feature activation-1.89e+01

Pos logit contributions
 Binding+2.333
 LAR+2.319
rador+2.292
 Ala+2.212
 Goo+2.196
Neg logit contributions
ycle-2.494
ardless-2.454
aunder-2.443
 Yen-2.440
esy-2.400
Tokenement
Token ID972
Feature activation-5.11e+00

Pos logit contributions
esy+7.779
erial+7.713
 Yen+7.496
 Paste+7.461
ardless+7.434
Neg logit contributions
 Binding-14.554
 LAR-14.496
 Goo-14.262
NPR-14.193
BALL-13.740
Token a
Token ID257
Feature activation-3.63e+00

Pos logit contributions
 Paste+3.200
 Yen+3.189
ycle+3.157
ardless+3.148
aunder+3.138
Neg logit contributions
 Binding-3.580
rador-3.568
 LAR-3.550
NPR-3.446
 Goo-3.413
Token ag
Token ID556
Feature activation-5.01e+00

Pos logit contributions
ycle+2.639
ardless+2.632
 Yen+2.619
 Paste+2.611
aunder+2.601
Neg logit contributions
 Binding-2.204
rador-2.188
 LAR-2.175
NPR-2.103
 Goo-2.100
Tokeni
Token ID72
Feature activation-4.81e+00

Pos logit contributions
ardless+3.552
ycle+3.552
esy+3.500
aunder+3.469
erial+3.422
Neg logit contributions
 LAR-3.172
 Binding-3.160
 Goo-3.029
rador-3.005
 Ala-2.998
Token comm
Token ID725
Feature activation-1.13e+01

Pos logit contributions
aunder+2.926
ardless+2.913
ycle+2.903
 Yen+2.859
esy+2.845
Neg logit contributions
 Binding-3.543
 LAR-3.514
rador-3.452
 Goo-3.364
NPR-3.320
Token moderator
Token ID31847
Feature activation+6.63e+00

Pos logit contributions
 LAR+3.719
 Binding+3.683
rador+3.617
 Ala+3.617
NPR+3.549
Neg logit contributions
aunder-4.181
ardless-4.137
ycle-4.095
esy-4.022
erial-3.960
Token Randy
Token ID21248
Feature activation-3.83e+00

Pos logit contributions
 LAR+4.173
 Binding+4.071
 Ala+3.963
 Goo+3.911
NPR+3.841
Neg logit contributions
ardless-4.790
ycle-4.760
aunder-4.745
esy-4.639
erial-4.594
Token Sh
Token ID911
Feature activation+1.67e+00

Pos logit contributions
 Yen+2.628
 Paste+2.580
ycle+2.552
ardless+2.549
aunder+2.516
Neg logit contributions
rador-2.590
 Binding-2.493
 LAR-2.486
NPR-2.432
 antitrust-2.336
Tokenand
Token ID392
Feature activation-6.21e+00

Pos logit contributions
rador+1.142
 LAR+1.126
 Binding+1.109
NPR+1.082
 Goo+1.066
Neg logit contributions
 Yen-1.103
 Paste-1.093
ycle-1.090
aunder-1.074
ardless-1.072
Tokenobil
Token ID25898
Feature activation+4.86e+00

Pos logit contributions
ycle+4.434
 Yen+4.385
ardless+4.359
esy+4.323
aunder+4.304
Neg logit contributions
 Binding-3.855
 LAR-3.778
rador-3.740
 Goo-3.639
 Ala-3.604
Token,
Token ID11
Feature activation+1.12e+01

Pos logit contributions
 LAR+2.860
 Binding+2.815
rador+2.805
 Ala+2.728
 Goo+2.696
Neg logit contributions
ycle-3.597
ardless-3.581
aunder-3.565
 Yen-3.507
 Paste-3.506
Token right
Token ID826
Feature activation+3.13e+00

Pos logit contributions
 LAR+6.628
 Ala+6.461
NPR+6.305
 Binding+6.276
 az+6.211
Neg logit contributions
ycle-8.021
ardless-7.999
aunder-7.966
esy-7.757
erial-7.560
Token,
Token ID11
Feature activation+1.67e+00

Pos logit contributions
 Binding+2.014
 LAR+2.014
rador+1.993
NPR+1.924
 Ala+1.921
Neg logit contributions
ycle-2.179
ardless-2.162
aunder-2.155
 Yen-2.151
 Paste-2.130
Token after
Token ID706
Feature activation+4.05e+00

Pos logit contributions
 LAR+0.941
 Binding+0.937
rador+0.927
 Ala+0.892
NPR+0.890
Neg logit contributions
ycle-1.315
ardless-1.303
aunder-1.298
 Yen-1.288
esy-1.282
Token a
Token ID257
Feature activation+1.60e+00

Pos logit contributions
 LAR+2.533
 Binding+2.509
rador+2.439
 Ala+2.399
NPR+2.372
Neg logit contributions
ycle-2.924
ardless-2.921
aunder-2.917
esy-2.868
 Yen-2.826
Token debate
Token ID4384
Feature activation-3.52e+00

Pos logit contributions
 LAR+1.067
 Binding+1.060
rador+1.044
NPR+1.019
 Goo+1.005
Neg logit contributions
ardless-1.087
ycle-1.086
aunder-1.078
 Yen-1.064
esy-1.063
Token Rivals
Token ID42307
Feature activation+3.53e+00

Pos logit contributions
 LAR+2.951
 Binding+2.893
rador+2.813
 Ala+2.794
NPR+2.720
Neg logit contributions
ardless-3.432
ycle-3.409
 Yen-3.351
aunder-3.337
 Paste-3.273
Token Scout
Token ID22529
Feature activation+8.84e+00

Pos logit contributions
 LAR+1.965
 Binding+1.936
rador+1.877
 Ala+1.854
 Goo+1.827
Neg logit contributions
ycle-2.822
ardless-2.814
aunder-2.767
 Yen-2.750
esy-2.716
Token Others
Token ID12691
Feature activation-3.66e+00

Pos logit contributions
 LAR+5.408
 Binding+5.331
 Ala+5.158
rador+5.131
 Goo+5.063
Neg logit contributions
ardless-6.386
ycle-6.319
aunder-6.255
 Yen-6.031
 Paste-5.954
Token Marvin
Token ID35105
Feature activation+3.21e+00

Pos logit contributions
 Yen+2.651
ycle+2.629
 Paste+2.620
aunder+2.616
esy+2.610
Neg logit contributions
rador-2.242
 Binding-2.192
 LAR-2.171
NPR-2.117
 Goo-2.076
Token Wilson
Token ID8127
Feature activation+7.77e+00

Pos logit contributions
 LAR+2.429
 Binding+2.420
rador+2.378
 Ala+2.332
 Goo+2.323
Neg logit contributions
ycle-1.877
ardless-1.871
aunder-1.838
 Paste-1.825
 Yen-1.825
Token DT
Token ID24311
Feature activation+1.09e+01

Pos logit contributions
 LAR+4.930
 Binding+4.716
 Ala+4.698
 Goo+4.466
rador+4.443
Neg logit contributions
ycle-5.627
ardless-5.529
aunder-5.485
 Yen-5.360
esy-5.288
Token 6
Token ID718
Feature activation-3.37e-01

Pos logit contributions
 LAR+6.958
 Ala+6.654
 Binding+6.567
 Goo+6.194
NPR+6.130
Neg logit contributions
ycle-7.593
ardless-7.439
aunder-7.371
erial-7.177
esy-7.168
Token'
Token ID6
Feature activation-2.67e+00

Pos logit contributions
ycle+0.220
ardless+0.218
 Yen+0.218
aunder+0.217
esy+0.215
Neg logit contributions
 Binding-0.232
 LAR-0.232
rador-0.232
NPR-0.223
 Goo-0.222
Token4
Token ID19
Feature activation+1.56e-01

Pos logit contributions
ycle+1.814
 Yen+1.811
ardless+1.804
aunder+1.792
 Paste+1.782
Neg logit contributions
rador-1.756
 Binding-1.746
 LAR-1.731
 Goo-1.668
NPR-1.668
Token/
Token ID14
Feature activation-4.21e+00

Pos logit contributions
 Binding+0.103
 LAR+0.103
rador+0.102
NPR+0.098
 Goo+0.098
Neg logit contributions
ycle-0.107
ardless-0.106
 Yen-0.106
aunder-0.105
esy-0.104
Token329
Token ID37967
Feature activation+4.56e+00

Pos logit contributions
ycle+2.514
ardless+2.465
 Yen+2.454
esy+2.449
aunder+2.438
Neg logit contributions
rador-3.126
 Binding-3.095
 LAR-3.069
 Goo-2.997
NPR-2.956
Token a
Token ID257
Feature activation-3.66e+00

Pos logit contributions
 Yen+1.493
ycle+1.490
 Paste+1.478
ardless+1.471
esy+1.468
Neg logit contributions
rador-1.417
 Binding-1.415
 LAR-1.407
NPR-1.362
 Ala-1.339
Token very
Token ID845
Feature activation-4.22e+00

Pos logit contributions
 Yen+2.476
 Paste+2.471
ycle+2.453
esy+2.407
erial+2.398
Neg logit contributions
rador-2.416
 Binding-2.387
 LAR-2.359
NPR-2.342
 Ala-2.287
Token good
Token ID922
Feature activation-5.51e+00

Pos logit contributions
 Yen+2.759
 Paste+2.728
esy+2.693
ycle+2.678
aunder+2.658
Neg logit contributions
rador-2.891
 Binding-2.859
 LAR-2.794
NPR-2.790
 Ala-2.729
Token knowledge
Token ID3725
Feature activation-6.44e-01

Pos logit contributions
 Yen+3.193
 Paste+3.169
esy+3.104
ycle+3.095
erial+3.072
Neg logit contributions
rador-4.108
 Binding-4.020
NPR-4.002
 LAR-3.997
 Ala-3.903
Token of
Token ID286
Feature activation+4.46e+00

Pos logit contributions
ycle+0.414
 Yen+0.411
ardless+0.410
aunder+0.407
 Paste+0.404
Neg logit contributions
 Binding-0.449
 LAR-0.448
rador-0.447
NPR-0.432
 Goo-0.429
Token ocean
Token ID9151
Feature activation+9.72e+00

Pos logit contributions
 LAR+2.571
 Binding+2.560
rador+2.500
 Goo+2.496
 Ala+2.414
Neg logit contributions
ycle-3.441
ardless-3.428
aunder-3.390
esy-3.331
 Yen-3.284
Token circulation
Token ID19133
Feature activation+4.78e+00

Pos logit contributions
 Binding+5.752
 LAR+5.735
rador+5.520
 Goo+5.475
 az+5.358
Neg logit contributions
ardless-6.910
ycle-6.834
aunder-6.745
 Yen-6.683
esy-6.643
Token"
Token ID1
Feature activation+1.83e+00

Pos logit contributions
 Binding+2.900
 LAR+2.898
rador+2.811
 Goo+2.735
 Ala+2.734
Neg logit contributions
ardless-3.540
ycle-3.539
aunder-3.487
esy-3.427
 Yen-3.410
Token
Token ID198
Feature activation-4.12e+00

Pos logit contributions
 LAR+1.032
 Binding+1.031
rador+1.019
NPR+0.978
 Goo+0.972
Neg logit contributions
ycle-1.432
ardless-1.422
aunder-1.414
 Yen-1.413
esy-1.399
Token
Token ID198
Feature activation-4.47e+00

Pos logit contributions
 Yen+2.754
 Paste+2.752
ardless+2.732
ycle+2.730
esy+2.717
Neg logit contributions
rador-2.756
 Binding-2.696
���-2.640
 LAR-2.633
 incent-2.607
TokenBut
Token ID1537
Feature activation-5.00e+00

Pos logit contributions
 Paste+2.969
esy+2.962
ycle+2.957
aunder+2.925
 Yen+2.912
Neg logit contributions
 Binding-2.987
rador-2.966
 LAR-2.964
 Ala-2.815
 Goo-2.780
TokenCanadian
Token ID28203
Feature activation-4.84e+00

Pos logit contributions
 LAR+1.791
rador+1.781
NPR+1.766
 Binding+1.757
 Goo+1.638
Neg logit contributions
ardless-3.932
ycle-3.929
 Yen-3.922
aunder-3.880
 Paste-3.861
Token Federation
Token ID11937
Feature activation+2.45e+00

Pos logit contributions
 Yen+3.369
ycle+3.365
 Paste+3.334
ardless+3.323
aunder+3.297
Neg logit contributions
rador-3.057
 LAR-3.029
 Binding-3.019
 Goo-2.908
 Ala-2.884
Token of
Token ID286
Feature activation+2.36e+00

Pos logit contributions
 LAR+1.615
 Binding+1.614
rador+1.597
NPR+1.543
 Ala+1.534
Neg logit contributions
ycle-1.679
ardless-1.675
aunder-1.665
 Yen-1.655
esy-1.647
Token Independent
Token ID13362
Feature activation+4.36e+00

Pos logit contributions
 Binding+1.654
 LAR+1.632
rador+1.596
 Goo+1.552
 Ala+1.552
Neg logit contributions
ycle-1.568
aunder-1.550
ardless-1.542
esy-1.538
 Yen-1.501
Token Business
Token ID7320
Feature activation+1.50e+00

Pos logit contributions
 Binding+2.682
 LAR+2.622
rador+2.491
 Ala+2.443
 Goo+2.433
Neg logit contributions
ycle-3.306
aunder-3.293
esy-3.280
ardless-3.273
erial-3.178
Token)
Token ID8
Feature activation+1.01e+01

Pos logit contributions
 Binding+0.966
 LAR+0.964
rador+0.956
NPR+0.917
 Ala+0.910
Neg logit contributions
ycle-1.057
ardless-1.054
aunder-1.047
 Yen-1.037
esy-1.020
Token The
Token ID383
Feature activation+2.63e+00

Pos logit contributions
 LAR+5.645
 Binding+5.487
 Ala+5.240
 Goo+5.238
rador+5.114
Neg logit contributions
ycle-7.743
ardless-7.683
aunder-7.593
esy-7.439
erial-7.263
Token provincial
Token ID15818
Feature activation+2.00e+00

Pos logit contributions
 LAR+1.644
 Binding+1.641
rador+1.618
 Goo+1.560
 Ala+1.551
Neg logit contributions
ycle-1.893
ardless-1.878
aunder-1.874
 Yen-1.856
esy-1.853
Token government
Token ID1230
Feature activation+1.00e+00

Pos logit contributions
 Binding+1.330
 LAR+1.325
rador+1.302
NPR+1.259
 Goo+1.257
Neg logit contributions
ycle-1.368
ardless-1.362
aunder-1.357
esy-1.339
 Yen-1.332
Token and
Token ID290
Feature activation+5.86e+00

Pos logit contributions
 LAR+0.669
 Binding+0.666
rador+0.661
NPR+0.636
 Ala+0.636
Neg logit contributions
ycle-0.684
ardless-0.678
aunder-0.676
 Yen-0.671
esy-0.668
Token its
Token ID663
Feature activation+7.30e+00

Pos logit contributions
 LAR+3.487
rador+3.473
 Binding+3.455
 Ala+3.320
 Goo+3.292
Neg logit contributions
ycle-4.335
ardless-4.334
aunder-4.323
 Yen-4.218
esy-4.198
Token best
Token ID1266
Feature activation+1.40e+00

Pos logit contributions
 LAR+0.004
 Binding+0.004
rador+0.004
NPR+0.003
 Goo+0.003
Neg logit contributions
ycle-0.006
ardless-0.006
 Yen-0.006
aunder-0.006
esy-0.006
Token Sa
Token ID10318
Feature activation+5.94e+00

Pos logit contributions
 Binding+0.937
 LAR+0.935
rador+0.923
 Goo+0.891
 Ala+0.888
Neg logit contributions
ycle-0.947
ardless-0.937
aunder-0.934
 Yen-0.924
esy-0.917
Tokenisons
Token ID9886
Feature activation+9.58e+00

Pos logit contributions
 Binding+2.913
rador+2.907
 LAR+2.868
NPR+2.716
 Ala+2.706
Neg logit contributions
ycle-4.933
 Yen-4.910
aunder-4.889
ardless-4.873
 Paste-4.836
Token made
Token ID925
Feature activation+3.71e+00

Pos logit contributions
 LAR+5.552
 Binding+5.504
rador+5.333
 Ala+5.277
 Goo+5.178
Neg logit contributions
ardless-6.830
ycle-6.772
aunder-6.690
esy-6.503
erial-6.453
Token in
Token ID287
Feature activation+3.41e+00

Pos logit contributions
 LAR+2.191
 Binding+2.191
rador+2.158
 Ala+2.075
 Goo+2.049
Neg logit contributions
ycle-2.770
ardless-2.752
aunder-2.733
 Yen-2.719
esy-2.690
Token Ireland
Token ID7517
Feature activation+1.05e+01

Pos logit contributions
 LAR+2.460
 Binding+2.449
rador+2.421
 Goo+2.342
 Ala+2.342
Neg logit contributions
ycle-2.115
ardless-2.115
aunder-2.101
 Yen-2.067
esy-2.038
Token.
Token ID13
Feature activation+3.56e-01

Pos logit contributions
 LAR+5.545
 Binding+5.523
rador+5.346
 Ala+5.174
 az+5.142
Neg logit contributions
aunder-7.796
ycle-7.734
ardless-7.720
 Yen-7.665
esy-7.641
Token So
Token ID1406
Feature activation+2.28e+00

Pos logit contributions
 Binding+0.224
 LAR+0.223
rador+0.219
 Goo+0.211
NPR+0.211
Neg logit contributions
ycle-0.257
ardless-0.254
aunder-0.253
 Yen-0.252
esy-0.250
Token crisp
Token ID23453
Feature activation-2.21e+00

Pos logit contributions
 LAR+1.458
 Binding+1.455
rador+1.438
 Goo+1.388
 Ala+1.382
Neg logit contributions
ycle-1.602
ardless-1.584
aunder-1.580
 Yen-1.575
esy-1.556
Token,
Token ID11
Feature activation-2.88e+00

Pos logit contributions
ycle+1.435
 Yen+1.427
ardless+1.422
esy+1.409
aunder+1.405
Neg logit contributions
 LAR-1.529
 Binding-1.529
rador-1.520
NPR-1.471
 Goo-1.460
Token refreshing
Token ID23056
Feature activation-1.24e+00

Pos logit contributions
 Yen+2.021
ycle+2.018
ardless+1.999
 Paste+1.987
esy+1.985
Neg logit contributions
 Binding-1.824
rador-1.808
 LAR-1.801
NPR-1.756
 Goo-1.727
Token the
Token ID262
Feature activation+4.04e+00

Pos logit contributions
 LAR+3.486
 Binding+3.439
rador+3.353
 Goo+3.343
 antitrust+3.262
Neg logit contributions
ardless-4.373
ycle-4.359
aunder-4.305
 Yen-4.248
 Paste-4.216
Token w
Token ID266
Feature activation+3.48e+00

Pos logit contributions
 LAR+2.702
 Binding+2.643
rador+2.590
 Goo+2.562
 Ala+2.531
Neg logit contributions
ycle-2.761
ardless-2.756
aunder-2.741
esy-2.685
erial-2.656
Tokenane
Token ID1531
Feature activation-2.42e+00

Pos logit contributions
 LAR+1.633
 Binding+1.619
rador+1.616
NPR+1.585
 Ala+1.529
Neg logit contributions
ycle-2.988
 Yen-2.961
ardless-2.955
aunder-2.945
 Paste-2.942
Token.
Token ID13
Feature activation+2.26e+00

Pos logit contributions
ycle+1.703
ardless+1.688
 Yen+1.686
aunder+1.670
esy+1.660
Neg logit contributions
 Binding-1.546
rador-1.545
 LAR-1.537
NPR-1.481
 Goo-1.465
Token They
Token ID1119
Feature activation-4.13e-01

Pos logit contributions
 LAR+1.404
 Binding+1.402
rador+1.360
 Goo+1.328
NPR+1.323
Neg logit contributions
ycle-1.660
ardless-1.633
aunder-1.630
esy-1.614
 Yen-1.611
Token are
Token ID389
Feature activation+1.34e+00

Pos logit contributions
ycle+0.277
ardless+0.275
 Yen+0.274
aunder+0.273
esy+0.270
Neg logit contributions
 Binding-0.278
 LAR-0.278
rador-0.276
NPR-0.266
 Goo-0.265
Token at
Token ID379
Feature activation+8.01e+00

Pos logit contributions
 LAR+0.855
 Binding+0.851
rador+0.844
NPR+0.810
 Goo+0.810
Neg logit contributions
ycle-0.948
ardless-0.935
aunder-0.931
 Yen-0.930
esy-0.919
Token their
Token ID511
Feature activation-4.59e+00

Pos logit contributions
 LAR+4.572
 Binding+4.336
rador+4.328
 Goo+4.216
 Ala+4.156
Neg logit contributions
ycle-6.106
ardless-5.953
aunder-5.943
esy-5.883
 Paste-5.881
Token heart
Token ID2612
Feature activation+3.03e+00

Pos logit contributions
ycle+3.654
aunder+3.645
 Yen+3.640
 Paste+3.600
esy+3.599
Neg logit contributions
 Binding-2.451
rador-2.439
 LAR-2.363
NPR-2.292
 Ala-2.280
Token a
Token ID257
Feature activation+2.10e+00

Pos logit contributions
 LAR+1.911
 Binding+1.891
rador+1.884
 Goo+1.804
NPR+1.804
Neg logit contributions
ycle-2.180
ardless-2.150
 Yen-2.135
esy-2.124
aunder-2.119
Token data
Token ID1366
Feature activation-3.29e+00

Pos logit contributions
 LAR+1.370
 Binding+1.349
rador+1.342
 Goo+1.293
NPR+1.287
Neg logit contributions
ycle-1.472
ardless-1.464
aunder-1.453
 Yen-1.444
esy-1.437
Tokenadic
Token ID23876
Feature activation+2.11e+00

Pos logit contributions
ycle+0.412
ardless+0.407
 Yen+0.405
aunder+0.403
esy+0.398
Neg logit contributions
 Binding-0.621
 LAR-0.620
rador-0.617
NPR-0.598
 Goo-0.596
Token gunfire
Token ID29276
Feature activation+4.89e-02

Pos logit contributions
 LAR+1.314
 Binding+1.310
rador+1.284
 Ala+1.246
 Goo+1.245
Neg logit contributions
ycle-1.527
aunder-1.517
ardless-1.515
 Yen-1.493
esy-1.489
Token rocked
Token ID36872
Feature activation+1.04e-01

Pos logit contributions
 LAR+0.033
 Binding+0.033
rador+0.033
NPR+0.032
 Goo+0.032
Neg logit contributions
ycle-0.033
ardless-0.032
 Yen-0.032
aunder-0.032
esy-0.032
Token the
Token ID262
Feature activation-3.70e-01

Pos logit contributions
 LAR+0.068
 Binding+0.068
rador+0.067
NPR+0.065
 Goo+0.064
Neg logit contributions
ycle-0.072
ardless-0.071
 Yen-0.071
aunder-0.071
esy-0.070
Token new
Token ID649
Feature activation+8.90e-01

Pos logit contributions
ycle+0.247
ardless+0.245
 Yen+0.245
aunder+0.243
esy+0.241
Neg logit contributions
 Binding-0.249
 LAR-0.249
rador-0.248
NPR-0.238
 Goo-0.237
Token nation
Token ID3277
Feature activation+3.33e+00

Pos logit contributions
 LAR+0.638
 Binding+0.638
rador+0.633
NPR+0.610
 Goo+0.609
Neg logit contributions
ycle-0.558
ardless-0.552
aunder-0.551
 Yen-0.550
esy-0.543
Token
Token ID447
Feature activation+1.04e+00

Pos logit contributions
 LAR+2.094
 Binding+2.074
rador+2.067
 Ala+2.003
NPR+1.995
Neg logit contributions
ycle-2.390
aunder-2.370
ardless-2.342
 Yen-2.337
esy-2.329
Token
Token ID247
Feature activation+6.61e+00

Pos logit contributions
 Binding+0.845
 LAR+0.845
rador+0.842
NPR+0.816
 Goo+0.812
Neg logit contributions
ycle-0.550
ardless-0.544
 Yen-0.543
aunder-0.539
 Paste-0.532
Tokens
Token ID82
Feature activation+2.02e+00

Pos logit contributions
 LAR+3.073
 Binding+3.058
rador+2.977
 Ala+2.948
NPR+2.871
Neg logit contributions
ycle-5.692
aunder-5.667
 Yen-5.573
 Paste-5.509
ardless-5.507
Token capital
Token ID3139
Feature activation+4.03e+00

Pos logit contributions
 LAR+1.326
 Binding+1.316
rador+1.294
 Ala+1.266
 Goo+1.256
Neg logit contributions
ycle-1.406
aunder-1.397
ardless-1.379
esy-1.364
 Yen-1.358
Token over
Token ID625
Feature activation+3.14e-01

Pos logit contributions
 LAR+2.522
 Binding+2.496
 Ala+2.431
rador+2.400
 Goo+2.367
Neg logit contributions
ycle-2.902
aunder-2.892
ardless-2.857
esy-2.803
 Yen-2.788
Token and
Token ID290
Feature activation+2.13e+00

Pos logit contributions
 Paste+5.882
 Yen+5.698
ycle+5.569
 Construct+5.446
 Schumer+5.419
Neg logit contributions
rador-6.644
 LAR-6.456
 Binding-6.428
NPR-6.273
Reviewer-6.216
Token the
Token ID262
Feature activation+1.73e+00

Pos logit contributions
 LAR+1.367
 Binding+1.363
rador+1.333
 Ala+1.298
 Goo+1.295
Neg logit contributions
ycle-1.513
ardless-1.511
aunder-1.494
esy-1.473
 Yen-1.468
Token Human
Token ID5524
Feature activation+8.88e-01

Pos logit contributions
 LAR+1.145
 Binding+1.144
rador+1.116
 Goo+1.088
 Ala+1.085
Neg logit contributions
ycle-1.194
ardless-1.187
aunder-1.177
esy-1.168
 Yen-1.160
Token Gen
Token ID5215
Feature activation+2.65e+00

Pos logit contributions
 Binding+0.607
 LAR+0.601
rador+0.596
 Goo+0.577
 Ala+0.576
Neg logit contributions
ardless-0.591
ycle-0.590
aunder-0.582
 Yen-0.580
esy-0.577
Tokenome
Token ID462
Feature activation+5.22e+00

Pos logit contributions
 Binding+2.142
 LAR+2.094
rador+2.093
 Goo+2.054
 Ala+2.034
Neg logit contributions
ardless-1.416
aunder-1.415
ycle-1.400
 Yen-1.387
erial-1.364
Token Project
Token ID4935
Feature activation-8.31e-01

Pos logit contributions
 Binding+2.783
 LAR+2.708
 Goo+2.582
rador+2.578
 Ala+2.541
Neg logit contributions
ardless-4.294
ycle-4.239
aunder-4.194
esy-4.182
 Yen-4.151
Token both
Token ID1111
Feature activation+1.79e+00

Pos logit contributions
ycle+0.557
ardless+0.551
 Yen+0.550
aunder+0.548
 Paste+0.542
Neg logit contributions
 Binding-0.561
 LAR-0.560
rador-0.557
NPR-0.536
 Goo-0.533
Token complete
Token ID1844
Feature activation+1.46e+00

Pos logit contributions
 LAR+1.175
 Binding+1.169
rador+1.161
NPR+1.118
 Goo+1.117
Neg logit contributions
ycle-1.223
ardless-1.223
 Yen-1.209
aunder-1.204
esy-1.201
Token 99
Token ID7388
Feature activation+5.48e+00

Pos logit contributions
 LAR+0.938
 Binding+0.937
rador+0.923
 Goo+0.888
 Ala+0.887
Neg logit contributions
ardless-1.033
ycle-1.033
aunder-1.019
 Yen-1.016
esy-1.012
Token%
Token ID4
Feature activation-3.89e+00

Pos logit contributions
 Binding+3.318
 LAR+3.311
rador+3.222
NPR+3.149
 Ala+3.143
Neg logit contributions
ardless-4.038
ycle-3.970
aunder-3.907
esy-3.878
 Paste-3.868
Token complete
Token ID1844
Feature activation-1.97e+00

Pos logit contributions
 Yen+2.382
ycle+2.366
 Paste+2.357
ardless+2.350
aunder+2.343
Neg logit contributions
rador-2.798
 Binding-2.770
 LAR-2.755
NPR-2.673
 Goo-2.645
Token support
Token ID1104
Feature activation-2.51e+00

Pos logit contributions
 LAR+0.856
 Binding+0.853
rador+0.841
 Ala+0.814
 Goo+0.812
Neg logit contributions
ycle-0.807
ardless-0.798
aunder-0.794
 Yen-0.790
esy-0.784
Token when
Token ID618
Feature activation-4.99e-01

Pos logit contributions
ycle+1.667
ardless+1.661
 Yen+1.652
aunder+1.640
 Paste+1.621
Neg logit contributions
 Binding-1.699
 LAR-1.692
rador-1.691
 Goo-1.633
NPR-1.630
Token criticised
Token ID23894
Feature activation-5.60e+00

Pos logit contributions
ycle+0.349
ardless+0.346
 Yen+0.346
aunder+0.344
esy+0.341
Neg logit contributions
 Binding-0.321
 LAR-0.320
rador-0.319
NPR-0.306
 Goo-0.305
Token.
Token ID13
Feature activation-7.47e-01

Pos logit contributions
ycle+3.565
 Yen+3.524
ardless+3.499
 Paste+3.459
aunder+3.447
Neg logit contributions
 Binding-3.879
rador-3.862
 LAR-3.838
 Goo-3.775
NPR-3.724
Token Second
Token ID5498
Feature activation+4.43e+00

Pos logit contributions
ycle+0.528
ardless+0.523
 Yen+0.522
aunder+0.520
esy+0.515
Neg logit contributions
 Binding-0.477
 LAR-0.477
rador-0.473
NPR-0.455
 Goo-0.453
Token,
Token ID11
Feature activation-2.59e-01

Pos logit contributions
 Binding+2.781
 LAR+2.781
rador+2.728
 Ala+2.668
 Goo+2.625
Neg logit contributions
ardless-3.131
ycle-3.116
aunder-3.114
 Yen-3.086
esy-3.077
Token a
Token ID257
Feature activation-1.19e+00

Pos logit contributions
ycle+0.181
ardless+0.180
 Yen+0.179
aunder+0.178
esy+0.177
Neg logit contributions
 Binding-0.167
 LAR-0.167
rador-0.166
NPR-0.160
 Goo-0.159
Token Europe
Token ID2031
Feature activation-4.01e+00

Pos logit contributions
ycle+0.823
 Yen+0.818
ardless+0.816
aunder+0.810
 Paste+0.806
Neg logit contributions
 Binding-0.771
 LAR-0.768
rador-0.768
NPR-0.738
 Goo-0.733
Token that
Token ID326
Feature activation-3.06e+00

Pos logit contributions
ardless+2.520
ycle+2.511
 Yen+2.505
 Paste+2.488
aunder+2.456
Neg logit contributions
 Binding-2.797
 LAR-2.789
rador-2.789
 Goo-2.692
NPR-2.688
Token is
Token ID318
Feature activation+9.50e-01

Pos logit contributions
ycle+1.889
 Yen+1.888
ardless+1.876
 Paste+1.861
aunder+1.852
Neg logit contributions
 Binding-2.190
rador-2.184
 LAR-2.168
NPR-2.106
 Goo-2.087
Token getting
Token ID1972
Feature activation-1.03e+00

Pos logit contributions
 LAR+0.598
 Binding+0.597
rador+0.590
NPR+0.568
 Goo+0.566
Neg logit contributions
ycle-0.685
aunder-0.674
ardless-0.674
 Yen-0.670
esy-0.665
Tokens
Token ID82
Feature activation-3.59e+00

Pos logit contributions
ycle+0.066
 Yen+0.065
ardless+0.065
aunder+0.065
 Paste+0.064
Neg logit contributions
 LAR-0.036
 Binding-0.036
rador-0.036
NPR-0.033
 Goo-0.033
Token actually
Token ID1682
Feature activation-6.98e+00

Pos logit contributions
ardless+2.417
ycle+2.391
 Yen+2.384
aunder+2.373
 Paste+2.359
Neg logit contributions
rador-2.380
 Binding-2.377
 LAR-2.356
NPR-2.285
 Ala-2.241
Token black
Token ID2042
Feature activation-1.15e+01

Pos logit contributions
ardless+4.526
 Yen+4.396
ycle+4.371
 Paste+4.349
aunder+4.328
Neg logit contributions
rador-4.735
 Binding-4.718
 LAR-4.652
NPR-4.543
 Ala-4.430
Token?
Token ID30
Feature activation-5.29e+00

Pos logit contributions
ardless+6.536
ycle+6.463
 Paste+6.255
erial+6.213
esy+6.157
Neg logit contributions
 Binding-8.210
rador-8.013
 LAR-7.983
NPR-7.889
 Ala-7.850
Token
Token ID447
Feature activation+1.64e+00

Pos logit contributions
ardless+3.417
 Paste+3.398
ycle+3.389
 Yen+3.377
aunder+3.324
Neg logit contributions
rador-3.616
 Binding-3.570
 LAR-3.540
 Goo-3.417
 Ala-3.385
Token
Token ID251
Feature activation+3.76e-02

Pos logit contributions
 Binding+1.140
 LAR+1.137
rador+1.130
NPR+1.089
 Goo+1.086
Neg logit contributions
ycle-1.062
ardless-1.053
 Yen-1.051
aunder-1.048
esy-1.035
Token
Token ID198
Feature activation-1.33e+00

Pos logit contributions
 Binding+0.022
 LAR+0.022
rador+0.022
NPR+0.021
 Goo+0.021
Neg logit contributions
ycle-0.028
ardless-0.028
 Yen-0.028
aunder-0.028
esy-0.028
Token
Token ID198
Feature activation-2.77e+00

Pos logit contributions
ycle+0.983
ardless+0.978
 Yen+0.978
 Paste+0.969
aunder+0.968
Neg logit contributions
rador-0.800
 Binding-0.798
 LAR-0.790
 Goo-0.754
NPR-0.750
TokenThese
Token ID4711
Feature activation-6.25e+00

Pos logit contributions
ycle+1.887
ardless+1.877
aunder+1.862
 Paste+1.860
esy+1.854
Neg logit contributions
 Binding-1.825
rador-1.824
 LAR-1.820
 Goo-1.730
 Ala-1.723
Token kinds
Token ID6982
Feature activation+1.37e+00

Pos logit contributions
 Paste+3.962
 Yen+3.955
ardless+3.934
ycle+3.924
esy+3.858
Neg logit contributions
 Binding-4.270
rador-4.266
 LAR-4.163
 Goo-4.074
 Ala-4.034
Token of
Token ID286
Feature activation-5.35e+00

Pos logit contributions
 LAR+0.902
 Binding+0.895
rador+0.884
NPR+0.860
 Ala+0.851
Neg logit contributions
ycle-0.952
ardless-0.942
aunder-0.939
 Yen-0.937
esy-0.923
Token on
Token ID319
Feature activation+6.88e-01

Pos logit contributions
ycle+0.543
ardless+0.538
 Yen+0.536
aunder+0.532
esy+0.527
Neg logit contributions
 Binding-0.716
 LAR-0.714
rador-0.712
NPR-0.687
 Goo-0.685
Token Democrats
Token ID4956
Feature activation-1.96e+00

Pos logit contributions
 LAR+0.479
 Binding+0.478
rador+0.471
NPR+0.456
 Ala+0.454
Neg logit contributions
ycle-0.449
aunder-0.445
ardless-0.444
 Yen-0.442
esy-0.436
Token by
Token ID416
Feature activation+3.55e+00

Pos logit contributions
ycle+1.333
ardless+1.324
 Yen+1.319
aunder+1.307
 Paste+1.301
Neg logit contributions
 Binding-1.300
 LAR-1.299
rador-1.297
NPR-1.246
 Goo-1.239
Token emphasizing
Token ID36360
Feature activation+5.91e-01

Pos logit contributions
 LAR+2.237
 Binding+2.230
rador+2.187
 Ala+2.127
 Goo+2.125
Neg logit contributions
ycle-2.557
aunder-2.542
ardless-2.506
 Yen-2.471
esy-2.463
Token future
Token ID2003
Feature activation-7.61e-01

Pos logit contributions
 Binding+0.393
 LAR+0.393
rador+0.388
NPR+0.374
 Goo+0.373
Neg logit contributions
ycle-0.403
ardless-0.399
aunder-0.398
 Yen-0.396
esy-0.392
Token spending
Token ID4581
Feature activation-6.69e+00

Pos logit contributions
ycle+0.516
ardless+0.511
 Yen+0.510
aunder+0.507
esy+0.503
Neg logit contributions
 Binding-0.507
 LAR-0.506
rador-0.504
NPR-0.484
 Goo-0.482
Token reductions
Token ID20691
Feature activation+4.41e+00

Pos logit contributions
 Yen+3.416
ycle+3.399
ardless+3.374
 Paste+3.371
esy+3.338
Neg logit contributions
rador-5.476
 Binding-5.363
 LAR-5.308
NPR-5.260
 Goo-5.179
Token to
Token ID284
Feature activation+1.42e+00

Pos logit contributions
 LAR+2.663
 Binding+2.660
rador+2.567
NPR+2.518
 Ala+2.512
Neg logit contributions
aunder-3.221
ycle-3.217
ardless-3.167
 Yen-3.162
esy-3.145
Token Medicare
Token ID13771
Feature activation+4.42e+00

Pos logit contributions
 LAR+0.946
 Binding+0.946
rador+0.935
NPR+0.903
 Goo+0.900
Neg logit contributions
ycle-0.966
ardless-0.960
aunder-0.957
 Yen-0.956
esy-0.942
Token that
Token ID326
Feature activation+1.07e+00

Pos logit contributions
 LAR+2.644
 Binding+2.634
rador+2.562
 Goo+2.492
NPR+2.486
Neg logit contributions
aunder-3.267
ycle-3.245
 Yen-3.236
ardless-3.207
erial-3.194
Token would
Token ID561
Feature activation+2.90e+00

Pos logit contributions
 LAR+0.749
 Binding+0.747
rador+0.738
NPR+0.716
 Goo+0.712
Neg logit contributions
ycle-0.685
ardless-0.683
aunder-0.677
 Yen-0.674
esy-0.669
Token
Token ID447
Feature activation-6.04e+00

Pos logit contributions
 LAR+1.567
rador+1.547
 Binding+1.535
NPR+1.490
 Goo+1.483
Neg logit contributions
ycle-1.849
 Yen-1.837
ardless-1.836
aunder-1.830
esy-1.807
Token
Token ID247
Feature activation+3.68e+00

Pos logit contributions
ycle+3.828
ardless+3.764
 Yen+3.683
esy+3.652
 Paste+3.632
Neg logit contributions
 LAR-4.194
 Binding-4.176
rador-4.136
NPR-4.015
 Ala-3.949
Tokenre
Token ID260
Feature activation-3.17e+00

Pos logit contributions
 LAR+2.344
rador+2.328
 Binding+2.289
NPR+2.234
 Goo+2.217
Neg logit contributions
ycle-2.555
 Yen-2.553
ardless-2.515
 Paste-2.510
aunder-2.509
Token going
Token ID1016
Feature activation-3.76e+00

Pos logit contributions
ardless+2.164
ycle+2.154
 Yen+2.137
esy+2.120
aunder+2.116
Neg logit contributions
 Binding-2.082
rador-2.064
 LAR-2.055
 Goo-1.962
NPR-1.961
Token to
Token ID284
Feature activation-1.97e+00

Pos logit contributions
ycle+2.366
ardless+2.361
aunder+2.355
 Yen+2.352
esy+2.314
Neg logit contributions
 Binding-2.648
rador-2.631
 LAR-2.623
 Goo-2.515
NPR-2.511
Token take
Token ID1011
Feature activation-2.51e+00

Pos logit contributions
ycle+1.291
 Yen+1.279
ardless+1.278
aunder+1.263
 Paste+1.261
Neg logit contributions
 Binding-1.361
rador-1.349
 LAR-1.347
NPR-1.296
 Goo-1.284
Token the
Token ID262
Feature activation-1.96e+00

Pos logit contributions
 Yen+1.646
ycle+1.646
ardless+1.640
aunder+1.620
 Paste+1.608
Neg logit contributions
 Binding-1.711
rador-1.701
 LAR-1.691
NPR-1.633
 Goo-1.616
Token first
Token ID717
Feature activation+4.64e+00

Pos logit contributions
ycle+1.474
 Yen+1.468
ardless+1.465
aunder+1.454
 Paste+1.442
Neg logit contributions
 Binding-1.153
 LAR-1.145
rador-1.143
NPR-1.095
 Goo-1.087
Token strike
Token ID5587
Feature activation+1.31e+00

Pos logit contributions
 LAR+2.878
 Binding+2.806
rador+2.805
 Ala+2.715
 Goo+2.710
Neg logit contributions
ycle-3.326
esy-3.310
aunder-3.304
ardless-3.289
 Paste-3.286
Token,
Token ID11
Feature activation-1.09e+00

Pos logit contributions
 LAR+0.839
 Binding+0.834
rador+0.831
NPR+0.797
 Goo+0.794
Neg logit contributions
ycle-0.918
 Yen-0.911
ardless-0.909
aunder-0.907
esy-0.899
Token
Token ID447
Feature activation-9.41e-01

Pos logit contributions
ycle+0.743
ardless+0.737
 Yen+0.736
aunder+0.732
 Paste+0.726
Neg logit contributions
 Binding-0.720
 LAR-0.716
rador-0.714
NPR-0.684
 Goo-0.682
Token sociology
Token ID39625
Feature activation+2.49e-01

Pos logit contributions
 Paste+6.047
ycle+6.018
 Yen+5.996
ardless+5.945
esy+5.936
Neg logit contributions
rador-5.816
 Binding-5.766
 LAR-5.727
NPR-5.579
 Goo-5.398
Token of
Token ID286
Feature activation-4.08e+00

Pos logit contributions
 Binding+0.167
 LAR+0.167
rador+0.166
NPR+0.160
 Goo+0.159
Neg logit contributions
ycle-0.167
ardless-0.166
 Yen-0.165
aunder-0.165
esy-0.163
Token race
Token ID3234
Feature activation-4.19e+00

Pos logit contributions
ycle+2.598
 Yen+2.582
ardless+2.565
 Paste+2.564
aunder+2.536
Neg logit contributions
rador-2.846
 Binding-2.837
 LAR-2.820
NPR-2.726
 Goo-2.697
Token and
Token ID290
Feature activation-4.91e+00

Pos logit contributions
ycle+2.665
ardless+2.642
 Yen+2.621
aunder+2.599
 Paste+2.576
Neg logit contributions
 Binding-2.943
rador-2.936
 LAR-2.928
NPR-2.806
 Goo-2.804
Token gender
Token ID5279
Feature activation-3.89e+00

Pos logit contributions
ycle+3.523
 Yen+3.513
ardless+3.507
 Paste+3.478
aunder+3.475
Neg logit contributions
rador-3.003
 Binding-2.987
 LAR-2.977
NPR-2.861
 Goo-2.838
Token to
Token ID284
Feature activation-4.51e+00

Pos logit contributions
ycle+2.528
ardless+2.512
 Yen+2.498
aunder+2.474
 Paste+2.458
Neg logit contributions
rador-2.678
 Binding-2.673
 LAR-2.662
 Goo-2.561
NPR-2.547
Token score
Token ID4776
Feature activation-5.76e+00

Pos logit contributions
ycle+2.944
 Yen+2.910
ardless+2.909
 Paste+2.908
aunder+2.900
Neg logit contributions
rador-3.056
 Binding-3.051
 LAR-3.024
NPR-2.902
 Goo-2.888
Token a
Token ID257
Feature activation-2.47e+00

Pos logit contributions
ardless+3.810
ycle+3.788
 Yen+3.768
 Paste+3.746
 Schumer+3.714
Neg logit contributions
 Binding-3.780
rador-3.772
 LAR-3.728
NPR-3.629
 Goo-3.571
Token de
Token ID390
Feature activation+2.70e+00

Pos logit contributions
ycle+1.795
 Yen+1.790
ardless+1.784
aunder+1.767
 Paste+1.765
Neg logit contributions
rador-1.512
 Binding-1.511
 LAR-1.498
NPR-1.441
 Goo-1.437
Tokencept
Token ID984
Feature activation-1.51e+00

Pos logit contributions
 LAR+1.287
 Binding+1.277
rador+1.244
 Ala+1.198
NPR+1.190
Neg logit contributions
 Yen-2.344
ycle-2.326
aunder-2.322
ardless-2.307
esy-2.293
Tokenively
Token ID2280
Feature activation-5.22e+00

Pos logit contributions
ycle+0.805
ardless+0.793
 Yen+0.790
aunder+0.787
esy+0.778
Neg logit contributions
 Binding-1.226
 LAR-1.225
rador-1.217
NPR-1.182
 Goo-1.177
Token academic
Token ID8233
Feature activation-1.40e+00

Pos logit contributions
ycle+1.300
ardless+1.292
 Yen+1.285
aunder+1.279
esy+1.269
Neg logit contributions
 Binding-1.315
 LAR-1.310
rador-1.305
NPR-1.257
 Goo-1.250
Token problems
Token ID2761
Feature activation-5.80e+00

Pos logit contributions
ycle+0.999
ardless+0.991
 Yen+0.988
aunder+0.984
esy+0.975
Neg logit contributions
 Binding-0.877
 LAR-0.873
rador-0.871
NPR-0.836
 Goo-0.832
Token such
Token ID884
Feature activation-2.57e+00

Pos logit contributions
ycle+3.828
 Yen+3.794
ardless+3.781
 Paste+3.756
aunder+3.754
Neg logit contributions
rador-3.858
 Binding-3.836
 LAR-3.759
 Goo-3.708
NPR-3.694
Token as
Token ID355
Feature activation-4.90e+00

Pos logit contributions
ycle+1.840
ardless+1.827
 Yen+1.823
esy+1.805
aunder+1.804
Neg logit contributions
rador-1.607
 Binding-1.599
 LAR-1.585
NPR-1.525
 Goo-1.521
Token racism
Token ID10713
Feature activation-6.50e+00

Pos logit contributions
ycle+3.178
 Yen+3.176
 Paste+3.173
ardless+3.158
esy+3.128
Neg logit contributions
rador-3.299
 Binding-3.288
 LAR-3.245
NPR-3.158
 Goo-3.147
Token,
Token ID11
Feature activation-7.90e+00

Pos logit contributions
ycle+4.034
ardless+4.022
aunder+3.966
 Yen+3.915
 Paste+3.866
Neg logit contributions
 Binding-4.591
rador-4.583
 LAR-4.479
 Goo-4.399
NPR-4.323
Token sexism
Token ID24953
Feature activation-5.10e+00

Pos logit contributions
 Paste+4.780
 Yen+4.753
ardless+4.691
ycle+4.645
aunder+4.616
Neg logit contributions
rador-5.620
 Binding-5.526
 LAR-5.384
 Goo-5.377
NPR-5.373
Token,
Token ID11
Feature activation-9.04e+00

Pos logit contributions
ardless+3.099
ycle+3.067
aunder+3.027
 Yen+3.013
esy+2.975
Neg logit contributions
 Binding-3.714
rador-3.681
 LAR-3.642
 Goo-3.563
NPR-3.562
Token trans
Token ID1007
Feature activation-7.60e+00

Pos logit contributions
 Paste+5.344
ardless+5.317
 Yen+5.305
ycle+5.181
aunder+5.149
Neg logit contributions
rador-6.403
 Binding-6.382
NPR-6.291
 Goo-6.187
 LAR-6.156
Tokenph
Token ID746
Feature activation-2.11e+00

Pos logit contributions
ardless+4.365
ycle+4.284
 Yen+4.261
aunder+4.232
 Paste+4.170
Neg logit contributions
 Binding-5.676
rador-5.615
 LAR-5.529
NPR-5.418
 Goo-5.381
Tokenobia
Token ID30665
Feature activation-3.77e+00

Pos logit contributions
ycle+1.658
ardless+1.653
aunder+1.635
 Yen+1.630
esy+1.625
Neg logit contributions
 Binding-1.177
 LAR-1.165
rador-1.156
 Goo-1.110
NPR-1.104
Token,
Token ID11
Feature activation-6.85e+00

Pos logit contributions
ardless+3.406
ycle+3.403
aunder+3.359
 Yen+3.333
esy+3.286
Neg logit contributions
 Binding-3.547
 LAR-3.508
rador-3.468
 Goo-3.393
NPR-3.384
Token assist
Token ID3342
Feature activation-3.52e+00

Pos logit contributions
ardless+4.438
ycle+4.420
 Yen+4.391
aunder+4.385
 Paste+4.342
Neg logit contributions
 Binding-4.610
rador-4.542
 LAR-4.501
 Goo-4.384
NPR-4.321
Token them
Token ID606
Feature activation-3.63e+00

Pos logit contributions
ycle+2.197
ardless+2.174
aunder+2.159
 Yen+2.154
esy+2.129
Neg logit contributions
 Binding-2.530
 LAR-2.505
rador-2.500
 Goo-2.415
NPR-2.412
Token and
Token ID290
Feature activation-6.60e+00

Pos logit contributions
ycle+2.345
ardless+2.339
aunder+2.325
 Yen+2.294
esy+2.261
Neg logit contributions
 Binding-2.530
 LAR-2.509
rador-2.482
 Goo-2.415
NPR-2.405
Token even
Token ID772
Feature activation-5.99e+00

Pos logit contributions
ycle+4.240
aunder+4.226
 Yen+4.209
ardless+4.156
 Paste+4.136
Neg logit contributions
 Binding-4.482
rador-4.388
 LAR-4.355
NPR-4.251
 Goo-4.219
Token
Token ID784
Feature activation-1.67e+00

Pos logit contributions
ycle+3.850
aunder+3.823
 Yen+3.817
 Paste+3.785
ardless+3.768
Neg logit contributions
 Binding-4.084
rador-4.013
 LAR-3.930
NPR-3.848
 Goo-3.813
Token as
Token ID355
Feature activation-2.11e+00

Pos logit contributions
ycle+1.156
ardless+1.148
 Yen+1.145
aunder+1.139
 Paste+1.129
Neg logit contributions
 Binding-1.084
rador-1.076
 LAR-1.074
NPR-1.031
 Goo-1.026
Token happened
Token ID3022
Feature activation-4.79e+00

Pos logit contributions
ycle+1.417
 Yen+1.407
ardless+1.405
aunder+1.394
 Paste+1.392
Neg logit contributions
rador-1.410
 Binding-1.408
 LAR-1.400
NPR-1.346
 Goo-1.333
Token last
Token ID938
Feature activation-7.77e+00

Pos logit contributions
ycle+3.198
 Yen+3.191
ardless+3.166
aunder+3.148
 Paste+3.141
Neg logit contributions
rador-3.164
 Binding-3.154
 LAR-3.110
NPR-3.014
 Goo-3.000
Token week
Token ID1285
Feature activation-3.79e+00

Pos logit contributions
ardless+4.566
 Yen+4.521
ycle+4.492
 Paste+4.448
esy+4.405
Neg logit contributions
rador-5.638
 Binding-5.609
 LAR-5.417
 Goo-5.315
NPR-5.238
Token
Token ID784
Feature activation-4.25e+00

Pos logit contributions
ycle+2.699
ardless+2.686
 Yen+2.662
aunder+2.652
 Paste+2.629
Neg logit contributions
 Binding-2.367
rador-2.352
 LAR-2.335
 Goo-2.249
NPR-2.235
Token6
Token ID21
Feature activation-1.43e+01

Pos logit contributions
 LAR+0.902
 Binding+0.896
rador+0.891
NPR+0.855
 Goo+0.851
Neg logit contributions
ycle-1.147
ardless-1.140
 Yen-1.135
aunder-1.132
esy-1.122
Token {
Token ID1391
Feature activation-3.93e+00

Pos logit contributions
 Yen+5.004
esy+4.714
 Paste+4.679
 Schumer+4.593
erial+4.585
Neg logit contributions
���-13.216
rador-13.042
NPR-12.613
 LAR-12.477
 Binding-12.458
Token(
Token ID7
Feature activation-4.60e+00

Pos logit contributions
ycle+1.950
ardless+1.929
esy+1.922
aunder+1.916
 Yen+1.915
Neg logit contributions
rador-3.313
 Binding-3.285
 LAR-3.264
 Goo-3.163
 Ala-3.154
Token6
Token ID21
Feature activation+4.99e+00

Pos logit contributions
 Yen+2.833
ycle+2.832
ardless+2.829
aunder+2.821
 Paste+2.799
Neg logit contributions
rador-3.271
 Binding-3.248
 LAR-3.211
 Goo-3.102
NPR-3.085
Tokens
Token ID82
Feature activation+1.97e+00

Pos logit contributions
 Binding+2.082
 LAR+2.070
rador+1.991
 Ala+1.917
 Goo+1.907
Neg logit contributions
ycle-4.565
ardless-4.548
 Yen-4.530
aunder-4.519
 Paste-4.483
Token)}
Token ID38165
Feature activation-1.75e+01

Pos logit contributions
 LAR+1.332
 Binding+1.329
rador+1.319
NPR+1.281
 Goo+1.268
Neg logit contributions
ycle-1.309
ardless-1.296
 Yen-1.294
aunder-1.284
 Paste-1.268
Token 38
Token ID4353
Feature activation-1.57e+00

Pos logit contributions
 Yen+6.701
esy+6.361
 Wick+6.356
 Cantor+6.279
ycle+6.136
Neg logit contributions
rador-14.780
NPR-14.750
 LAR-13.705
NBA-13.592
���-13.507
Token.
Token ID13
Feature activation+1.96e+00

Pos logit contributions
ycle+1.108
 Yen+1.102
ardless+1.097
aunder+1.091
esy+1.087
Neg logit contributions
rador-1.000
 Binding-0.997
 LAR-0.991
NPR-0.950
 Goo-0.945
Token N
Token ID399
Feature activation+2.05e+00

Pos logit contributions
 Binding+1.179
 LAR+1.172
rador+1.153
 Ala+1.121
 Goo+1.117
Neg logit contributions
ycle-1.473
ardless-1.457
aunder-1.448
 Yen-1.437
esy-1.436
Tokend
Token ID67
Feature activation-2.29e+00

Pos logit contributions
 Binding+1.448
rador+1.446
 LAR+1.440
NPR+1.395
 Ala+1.388
Neg logit contributions
ycle-1.283
 Yen-1.273
ardless-1.272
aunder-1.268
 Paste-1.258
Token4
Token ID19
Feature activation-7.62e+00

Pos logit contributions
ycle+1.613
ardless+1.600
 Yen+1.595
aunder+1.586
esy+1.579
Neg logit contributions
 LAR-1.463
 Binding-1.461
rador-1.452
NPR-1.397
 Goo-1.390
Token15
Token ID1314
Feature activation-1.72e+00

Pos logit contributions
ycle+0.672
 Yen+0.667
ardless+0.667
aunder+0.662
esy+0.656
Neg logit contributions
 Binding-0.767
 LAR-0.765
rador-0.765
NPR-0.735
 Goo-0.732
Token x
Token ID2124
Feature activation-2.06e+00

Pos logit contributions
 Yen+0.975
ycle+0.974
ardless+0.963
aunder+0.958
 Paste+0.956
Neg logit contributions
rador-1.334
 Binding-1.328
 LAR-1.325
NPR-1.285
 Ala-1.271
Token.
Token ID13
Feature activation-8.85e+00

Pos logit contributions
ycle+1.450
ardless+1.440
 Yen+1.439
aunder+1.433
esy+1.427
Neg logit contributions
rador-1.318
 Binding-1.314
 LAR-1.303
NPR-1.249
 Goo-1.247
Tokenx
Token ID87
Feature activation-1.05e+01

Pos logit contributions
esy+3.650
ycle+3.636
ardless+3.617
erial+3.526
aunder+3.518
Neg logit contributions
���-8.306
 Binding-8.013
rador-7.996
 incent-7.977
 LAR-7.926
Token.
Token ID13
Feature activation-1.47e+01

Pos logit contributions
esy+5.624
 Yen+5.622
 Paste+5.515
aunder+5.456
ardless+5.455
Neg logit contributions
���-8.923
 incent-8.275
rador-8.060
 Binding-7.839
 rall-7.624
Tokenx
Token ID87
Feature activation-1.29e+01

Pos logit contributions
esy+4.908
ycle+4.626
erial+4.609
ardless+4.592
alter+4.589
Neg logit contributions
���-15.598
 incent-14.307
 rall-13.891
 Binding-13.761
rador-13.680
Token.
Token ID13
Feature activation-1.37e+01

Pos logit contributions
esy+6.411
 Yen+6.410
aunder+6.232
 Paste+6.162
alter+6.125
Neg logit contributions
���-11.490
 incent-10.384
rador-10.072
 rall-9.831
 Binding-9.819
Token130
Token ID12952
Feature activation-7.03e+00

Pos logit contributions
alter+3.940
esy+3.901
aunder+3.773
 Yen+3.770
ycle+3.690
Neg logit contributions
���-13.712
 LAR-13.160
rador-13.083
 Binding-13.058
 Goo-12.771
Token 120
Token ID7982
Feature activation-3.68e+00

Pos logit contributions
 Yen+4.488
 Paste+4.376
ycle+4.333
ardless+4.321
aunder+4.310
Neg logit contributions
rador-4.866
 Binding-4.704
 LAR-4.638
NPR-4.556
Reviewer-4.458
Token45
Token ID2231
Feature activation-8.33e-01

Pos logit contributions
 Yen+2.228
ardless+2.227
ycle+2.217
 Paste+2.204
aunder+2.196
Neg logit contributions
rador-2.686
 Binding-2.663
 LAR-2.631
 Goo-2.539
 Ala-2.531
Token25
Token ID1495
Feature activation+2.80e-01

Pos logit contributions
ycle+0.526
ardless+0.522
 Yen+0.521
aunder+0.518
esy+0.513
Neg logit contributions
 Binding-0.592
 LAR-0.591
rador-0.589
NPR-0.567
 Goo-0.565
Token59
Token ID3270
Feature activation-4.29e+00

Pos logit contributions
ardless+3.936
ycle+3.878
esy+3.874
 Paste+3.868
 Yen+3.862
Neg logit contributions
rador-4.919
 Binding-4.836
 LAR-4.725
 Goo-4.623
 Ala-4.578
Token326
Token ID39195
Feature activation-6.17e-01

Pos logit contributions
ycle+2.608
ardless+2.592
 Yen+2.588
aunder+2.573
esy+2.568
Neg logit contributions
rador-3.122
 Binding-3.111
 LAR-3.083
 Goo-2.972
NPR-2.952
Token x
Token ID2124
Feature activation-2.68e+00

Pos logit contributions
ycle+0.364
ardless+0.360
 Yen+0.359
aunder+0.357
esy+0.353
Neg logit contributions
 Binding-0.466
 LAR-0.465
rador-0.463
NPR-0.448
 Goo-0.446
Token.
Token ID13
Feature activation-1.01e+01

Pos logit contributions
ycle+1.847
 Yen+1.839
ardless+1.836
aunder+1.826
esy+1.824
Neg logit contributions
rador-1.742
 Binding-1.733
 LAR-1.712
NPR-1.645
 Goo-1.642
Tokenx
Token ID87
Feature activation-1.42e+01

Pos logit contributions
esy+3.963
ycle+3.881
ardless+3.864
erial+3.791
 Yen+3.781
Neg logit contributions
���-9.878
 incent-9.427
rador-9.306
 Binding-9.292
 LAR-9.163
Token.
Token ID13
Feature activation-1.96e+01

Pos logit contributions
esy+6.595
 Yen+6.525
 Paste+6.454
aunder+6.341
ardless+6.302
Neg logit contributions
���-13.412
 incent-12.156
 rall-11.389
rador-11.332
 tiss-11.168
Tokenx
Token ID87
Feature activation-1.79e+01

Pos logit contributions
ardless+6.205
esy+6.205
alter+6.083
erial+6.069
ycle+6.018
Neg logit contributions
���-20.601
 incent-18.608
 rall-18.301
 oun-17.780
 Binding-17.768
Token.
Token ID13
Feature activation-1.81e+01

Pos logit contributions
ardless+7.722
aunder+7.538
alter+7.473
esy+7.351
erial+7.190
Neg logit contributions
���-16.367
 incent-14.813
 rall-14.396
 tiss-14.370
 Binding-14.131
Token116
Token ID18298
Feature activation-1.53e+01

Pos logit contributions
ardless+1.366
alter+1.298
ycle+0.793
esy+0.646
 Paste+0.630
Neg logit contributions
 LAR-21.137
 Binding-20.882
 Goo-20.551
rador-20.255
���-20.195
Token 104
Token ID14436
Feature activation-3.09e+00

Pos logit contributions
 Yen+7.160
 Paste+6.867
 Schumer+6.847
 Cantor+6.794
 Wick+6.785
Neg logit contributions
rador-11.913
 Binding-11.169
���-11.086
 antitrust-11.010
Reviewer-10.978
Token17
Token ID1558
Feature activation-3.28e+00

Pos logit contributions
ycle+1.834
 Yen+1.834
ardless+1.834
aunder+1.811
 Paste+1.804
Neg logit contributions
rador-2.282
 Binding-2.279
 LAR-2.266
 Goo-2.175
NPR-2.175
Token05
Token ID2713
Feature activation-1.84e+00

Pos logit contributions
ardless+3.885
 Yen+3.858
 Paste+3.818
esy+3.801
erial+3.772
Neg logit contributions
rador-5.446
 Binding-5.379
 LAR-5.293
 antitrust-5.163
 Goo-5.149
Token87
Token ID5774
Feature activation+9.97e-01

Pos logit contributions
ycle+1.129
 Yen+1.124
ardless+1.124
aunder+1.116
 Paste+1.108
Neg logit contributions
rador-1.332
 Binding-1.331
 LAR-1.325
 Goo-1.271
NPR-1.268
Token x
Token ID2124
Feature activation-1.54e+00

Pos logit contributions
 Binding+0.736
 LAR+0.736
rador+0.725
 Goo+0.704
NPR+0.701
Neg logit contributions
ycle-0.608
ardless-0.604
aunder-0.597
 Yen-0.593
esy-0.588
Token.
Token ID13
Feature activation-7.80e+00

Pos logit contributions
ycle+1.097
 Yen+1.089
ardless+1.089
aunder+1.083
esy+1.077
Neg logit contributions
 Binding-0.964
rador-0.963
 LAR-0.958
NPR-0.917
 Goo-0.913
Tokenx
Token ID87
Feature activation-1.07e+01

Pos logit contributions
ycle+3.372
esy+3.370
ardless+3.315
 Yen+3.280
aunder+3.249
Neg logit contributions
���-7.052
rador-6.981
 Binding-6.978
 LAR-6.926
 incent-6.884
Token.
Token ID13
Feature activation-1.44e+01

Pos logit contributions
 Yen+5.768
esy+5.746
 Paste+5.647
ycle+5.606
aunder+5.580
Neg logit contributions
���-9.058
 incent-8.432
rador-8.185
 Binding-7.981
 rall-7.771
Tokenx
Token ID87
Feature activation-1.24e+01

Pos logit contributions
esy+4.828
ycle+4.791
 Yen+4.623
ardless+4.541
erial+4.531
Neg logit contributions
���-14.970
 incent-13.991
 Binding-13.477
 rall-13.417
 LAR-13.340
Token.
Token ID13
Feature activation-2.71e+00

Pos logit contributions
 Yen+6.228
ycle+6.219
 Paste+6.216
esy+6.205
aunder+6.198
Neg logit contributions
���-10.773
 incent-10.040
rador-9.500
 Binding-9.371
 rall-9.342
Token101
Token ID8784
Feature activation-1.18e+01

Pos logit contributions
ycle+1.334
ardless+1.285
 Yen+1.282
 Paste+1.281
aunder+1.264
Neg logit contributions
 LAR-2.335
 Binding-2.314
rador-2.301
 Ala-2.225
 Goo-2.219
Token 114
Token ID17342
Feature activation-5.31e+00

Pos logit contributions
 Yen+5.841
 Schumer+5.636
 Paste+5.623
 Cantor+5.517
ardless+5.496
Neg logit contributions
rador-9.504
 Binding-8.997
 LAR-8.894
Reviewer-8.866
 antitrust-8.690
Token65
Token ID2996
Feature activation+1.09e-02

Pos logit contributions
 Yen+2.995
ardless+2.983
 Paste+2.962
ycle+2.945
esy+2.933
Neg logit contributions
rador-4.075
 Binding-4.013
 LAR-3.981
 Goo-3.835
 antitrust-3.830
Token...
Token ID2644
Feature activation-1.96e+00

Pos logit contributions
ycle+2.427
ardless+2.407
 Yen+2.385
aunder+2.365
esy+2.356
Neg logit contributions
 Binding-2.305
rador-2.281
 LAR-2.281
NPR-2.214
 Goo-2.179
Token at
Token ID379
Feature activation+5.45e+00

Pos logit contributions
ycle+1.320
ardless+1.315
 Yen+1.302
aunder+1.300
 Paste+1.290
Neg logit contributions
 Binding-1.305
rador-1.298
 LAR-1.295
NPR-1.244
 Goo-1.235
Token 4
Token ID604
Feature activation-4.33e+00

Pos logit contributions
 LAR+3.287
 Binding+3.182
rador+3.130
 Ala+3.120
 Goo+3.055
Neg logit contributions
ycle-4.050
aunder-4.000
ardless-3.979
 Yen-3.973
esy-3.959
Token a
Token ID257
Feature activation-5.31e+00

Pos logit contributions
ycle+2.796
 Yen+2.761
ardless+2.760
aunder+2.751
esy+2.726
Neg logit contributions
rador-3.013
 Binding-2.976
 LAR-2.951
 Goo-2.827
NPR-2.818
Token.
Token ID13
Feature activation-4.39e+00

Pos logit contributions
ycle+3.325
ardless+3.302
 Yen+3.287
esy+3.285
 Paste+3.276
Neg logit contributions
rador-3.716
 Binding-3.677
 LAR-3.565
���-3.527
NPR-3.522
Tokenm
Token ID76
Feature activation-1.27e+01

Pos logit contributions
ardless+2.949
ycle+2.946
aunder+2.895
esy+2.889
 Yen+2.882
Neg logit contributions
 Binding-2.942
rador-2.937
 LAR-2.913
 Goo-2.788
NPR-2.757
Token.
Token ID13
Feature activation-8.13e+00

Pos logit contributions
ycle+6.996
ardless+6.862
 Yen+6.790
aunder+6.758
esy+6.757
Neg logit contributions
rador-8.934
 Binding-8.873
 LAR-8.584
Reviewer-8.509
NPR-8.485
Token...
Token ID2644
Feature activation-8.93e-01

Pos logit contributions
ycle+5.222
 Yen+5.207
 Paste+5.160
ardless+5.140
aunder+5.048
Neg logit contributions
rador-5.468
 Binding-5.321
 LAR-5.227
NPR-5.120
Reviewer-5.091
Token after
Token ID706
Feature activation+5.30e-01

Pos logit contributions
ycle+0.623
ardless+0.618
 Yen+0.615
aunder+0.613
esy+0.608
Neg logit contributions
 Binding-0.578
 LAR-0.576
rador-0.574
NPR-0.551
 Goo-0.548
Token a
Token ID257
Feature activation-2.47e-01

Pos logit contributions
 LAR+0.340
 Binding+0.339
rador+0.336
 Ala+0.323
 Goo+0.323
Neg logit contributions
ycle-0.372
ardless-0.370
 Yen-0.368
aunder-0.368
esy-0.364
Token prim
Token ID2684
Feature activation-1.61e+00

Pos logit contributions
ycle+0.172
ardless+0.170
 Yen+0.170
aunder+0.169
esy+0.168
Neg logit contributions
 Binding-0.160
 LAR-0.160
rador-0.159
NPR-0.153
 Goo-0.153