TOP ACTIVATIONS
MAX = 13.100

what he knows about student loans I contend they would
violations committed during Wisconsin Governor Scott Walker s 2012
reduce the problem of student debt , which has an impact
Linux desktop , server , cloud or device . This community
describe their feelings about college debt in three em oj is
mean the US debt - to - G DP
our daily newsletter Upgrade your inbox and get our Daily Dispatch
on giant pools of home loans . The following year ,
at her ties to Wall Street and has questioned her judgment
On Monday afternoon Governor Rick Scott declared a state of emergency
needs to know about college debt before voting in November .
our daily newsletter Upgrade your inbox and get our Daily Dispatch
ceiling in time to ensure debt payment . White
are already facing a declining consumer base for their products .
Mn uchin , that the debt ceiling would be raised .
servic ers , and student loan servicing companies also fall within
programme , designed to stimulate lending in the economy , beyond
equality , bas hes Wall Street " It is
of Americans support Graham - Cass idy , the health care
2008 . The American people bailed these sociop aths out ,

MIN ACTIVATIONS
MIN = -12.765

in Gr ims by an Islamic Cultural Centre was fire bomb
to 2015 . Its only nuclear - powered carrier project was
s was the work of Islamic extremists or by others seeking
how he was inside an Islamic cultural centre with a young
' wing span and a fu selage about 5 ' wide
Mohammad School teaching " a Qur ' an - guided journey
Vessel s N uclear propulsion has proven technically and
the first Ghostbusters , the Fire house in Trib eca neighborhood
. A leader of the Islamic militant group Hamas said to
it houses 10 , 000 worsh ippers and ooz es confidence
Arab League and Organisation of Islamic Co - operation and Gulf
.[ 26 ] Mus ics c apes from M
ama 's kitchen , the Fire Station , Sn uf kin
the fo iling of an Islamic Jihad plan to bomb the
in his book Mu hammad , Prophet of God
In 2012 construction of a nuclear - powered deep - sea
As a ball of fire envelop ed the launch pad
Local tourist site Fort de Bre gan con ,
India ` s two prominent Islamic hel pl ines are discouraging
on Mount Sin jar from Islamic State in Iraq and Syria

INTERVAL 6.634 - 13.100
CONTAINS 0.448%

October . Photograph : Clive Brun skill / Getty Images
expectations , the Canadian housing market has just kept on rolling
player , former St . Louis Rams kicker Brett Ba er
{ ( 171 s )} 12 . Na 3 { (
especially when it is put so bluntly . And as my

INTERVAL 0.168 - 6.634
CONTAINS 44.114%

keep your road trip in the right direction stay at Hart
When he himself might his quiet us make With
been painted as a secret baby incest weird o . And
years to become a Canadian citizen has been charged with 83
size increase proposals ( such as 2 / 4 / 8

INTERVAL -6.299 - 0.168
CONTAINS 54.906%

Repl acements : 16 . Pat MacArthur
, the trend has continued . The implications are startling .
be established by Law According to Stephenson ,
regime , Gaddafi had said : Oil companies are
% of population , whose occupation structure more closely matches our

INTERVAL -12.765 - -6.299
CONTAINS 0.533%

exasper ation that as a religion professor , it is difficult
I will use Rad ical Accept ance
s a good number of kids and staff spending
long viewed itself as a bridge between the United States and
; The pin function marks the thread as active
Token what
Token ID644
Feature activation-5.16e-01

Pos logit contributions
SOURCE+0.297
ALD+0.287
+0.284
vernment+0.283
lance+0.283
Neg logit contributions
ering-0.243
 Dupl-0.222
iced-0.220
 Cox-0.213
 Hoover-0.213
Token he
Token ID339
Feature activation-1.49e+00

Pos logit contributions
SOURCE+0.464
ALD+0.456
+0.455
vernment+0.451
lance+0.449
Neg logit contributions
ering-0.331
 Dupl-0.300
iced-0.296
 Hoover-0.287
 Cox-0.287
Token knows
Token ID4206
Feature activation-3.58e+00

Pos logit contributions
SOURCE+1.140
ALD+1.107
+1.097
vernment+1.091
heads+1.084
Neg logit contributions
ering-1.164
 Dupl-1.098
iced-1.068
 Hoover-1.053
 Cox-1.045
Token about
Token ID546
Feature activation-3.49e+00

Pos logit contributions
SOURCE+2.804
ALD+2.730
vernment+2.698
heads+2.683
+2.683
Neg logit contributions
ering-2.652
 Dupl-2.482
iced-2.454
 Hoover-2.388
 Cox-2.378
Token student
Token ID3710
Feature activation+8.33e+00

Pos logit contributions
SOURCE+2.868
ALD+2.843
vernment+2.842
+2.815
heads+2.792
Neg logit contributions
ering-2.391
 Dupl-2.210
iced-2.203
 Cox-2.110
 Hoover-2.095
Token loans
Token ID10021
Feature activation+1.31e+01

Pos logit contributions
ering+3.200
 Dupl+2.703
iced+2.596
 Hoover+2.573
 Cox+2.495
Neg logit contributions
SOURCE-9.634
ALD-9.452
-9.432
vernment-9.324
lance-9.302
Token
Token ID960
Feature activation+8.27e-01

Pos logit contributions
ering+7.252
 Dupl+6.433
 Hoover+6.258
iced+6.238
 Cox+6.089
Neg logit contributions
SOURCE-11.730
ALD-11.691
ITNESS-11.601
-11.560
lance-11.357
TokenI
Token ID40
Feature activation+1.09e+00

Pos logit contributions
ering+0.639
 Dupl+0.592
iced+0.581
 Hoover+0.574
 Cox+0.570
Neg logit contributions
SOURCE-0.644
ALD-0.626
-0.621
lance-0.615
vernment-0.614
Token contend
Token ID24351
Feature activation+1.18e+00

Pos logit contributions
ering+0.834
 Dupl+0.780
iced+0.765
 Hoover+0.752
 Cox+0.744
Neg logit contributions
SOURCE-0.851
ALD-0.835
-0.828
vernment-0.824
lance-0.812
Token they
Token ID484
Feature activation-1.89e+00

Pos logit contributions
ering+0.726
 Dupl+0.659
iced+0.648
 Hoover+0.634
 Cox+0.626
Neg logit contributions
SOURCE-1.099
ALD-1.076
-1.069
vernment-1.059
lance-1.057
Token would
Token ID561
Feature activation+1.19e+00

Pos logit contributions
SOURCE+1.435
ALD+1.402
+1.390
vernment+1.388
heads+1.375
Neg logit contributions
ering-1.441
 Dupl-1.358
iced-1.330
 Hoover-1.322
 Cox-1.315
Token violations
Token ID11734
Feature activation+3.39e+00

Pos logit contributions
SOURCE+0.043
ALD+0.042
heads+0.041
vernment+0.041
lance+0.041
Neg logit contributions
ering-0.038
iced-0.035
 Dupl-0.035
 Hoover-0.034
 Cox-0.034
Token committed
Token ID5364
Feature activation+4.78e-01

Pos logit contributions
ering+2.319
 Dupl+2.119
iced+2.086
 Hoover+2.069
 Cox+2.044
Neg logit contributions
SOURCE-2.915
ALD-2.856
-2.830
vernment-2.809
lance-2.797
Token during
Token ID1141
Feature activation-2.16e+00

Pos logit contributions
ering+0.298
 Dupl+0.270
iced+0.264
 Hoover+0.259
 Cox+0.255
Neg logit contributions
SOURCE-0.442
ALD-0.432
-0.429
vernment-0.425
lance-0.424
Token Wisconsin
Token ID9279
Feature activation+2.78e+00

Pos logit contributions
SOURCE+1.931
ALD+1.894
+1.876
lance+1.858
vernment+1.854
Neg logit contributions
ering-1.383
 Dupl-1.259
iced-1.257
 swallow-1.185
 Hoover-1.183
Token Governor
Token ID10807
Feature activation+5.12e+00

Pos logit contributions
ering+2.070
 Dupl+1.909
iced+1.877
 Hoover+1.858
 Cox+1.839
Neg logit contributions
SOURCE-2.238
ALD-2.181
-2.167
vernment-2.147
lance-2.136
Token Scott
Token ID4746
Feature activation+1.29e+01

Pos logit contributions
ering+3.182
 Dupl+2.884
iced+2.815
 Hoover+2.802
 Cox+2.790
Neg logit contributions
SOURCE-4.735
ALD-4.644
-4.633
vernment-4.630
ITNESS-4.569
Token Walker
Token ID10120
Feature activation+6.44e+00

Pos logit contributions
ering+4.570
 Hoover+4.034
 Dupl+3.981
 Cox+3.910
iced+3.788
Neg logit contributions
SOURCE-14.921
-14.668
vernment-14.643
ALD-14.545
ITNESS-14.491
Token
Token ID447
Feature activation+1.33e+00

Pos logit contributions
ering+4.334
 Dupl+3.881
 Hoover+3.833
 Cox+3.809
iced+3.794
Neg logit contributions
SOURCE-5.475
lance-5.194
ITNESS-5.184
ALD-5.180
heads-5.137
Token
Token ID247
Feature activation+8.45e-01

Pos logit contributions
ering+1.236
 Dupl+1.162
iced+1.147
 Hoover+1.136
 Cox+1.127
Neg logit contributions
SOURCE-0.815
ALD-0.787
-0.781
vernment-0.772
lance-0.770
Tokens
Token ID82
Feature activation+1.78e+00

Pos logit contributions
ering+0.620
 Dupl+0.574
iced+0.562
 Hoover+0.558
 Cox+0.551
Neg logit contributions
SOURCE-0.687
ALD-0.669
-0.665
vernment-0.660
lance-0.656
Token 2012
Token ID2321
Feature activation+6.48e+00

Pos logit contributions
ering+1.187
 Dupl+1.095
iced+1.086
 Hoover+1.051
 Cox+1.044
Neg logit contributions
SOURCE-1.547
ALD-1.525
-1.503
lance-1.490
vernment-1.486
Token reduce
Token ID4646
Feature activation+1.07e-01

Pos logit contributions
SOURCE+0.104
+0.102
ALD+0.101
lance+0.100
heads+0.099
Neg logit contributions
ering-0.092
 Dupl-0.087
iced-0.086
 Hoover-0.084
 Cox-0.084
Token the
Token ID262
Feature activation+1.26e+00

Pos logit contributions
ering+0.067
iced+0.061
 Dupl+0.060
 Hoover+0.058
 Cox+0.058
Neg logit contributions
SOURCE-0.097
ALD-0.095
heads-0.094
vernment-0.094
-0.093
Token problem
Token ID1917
Feature activation+4.21e+00

Pos logit contributions
ering+0.903
iced+0.828
 Dupl+0.824
 Hoover+0.802
 Cox+0.798
Neg logit contributions
SOURCE-1.033
ALD-1.005
-0.997
vernment-0.991
heads-0.989
Token of
Token ID286
Feature activation+1.77e+00

Pos logit contributions
ering+3.052
 Dupl+2.798
iced+2.736
 Hoover+2.722
 Cox+2.685
Neg logit contributions
SOURCE-3.463
ALD-3.382
-3.370
vernment-3.329
ITNESS-3.307
Token student
Token ID3710
Feature activation+6.99e+00

Pos logit contributions
ering+1.180
 Dupl+1.065
iced+1.064
 Hoover+1.042
 Cox+1.031
Neg logit contributions
SOURCE-1.562
ALD-1.530
-1.506
vernment-1.498
lance-1.497
Token debt
Token ID5057
Feature activation+1.28e+01

Pos logit contributions
ering+4.187
 Dupl+3.799
iced+3.672
 Hoover+3.655
 Cox+3.595
Neg logit contributions
SOURCE-6.581
-6.449
ALD-6.441
vernment-6.352
lance-6.340
Token,
Token ID11
Feature activation+1.34e+00

Pos logit contributions
ering+7.820
 Dupl+6.898
 Hoover+6.755
iced+6.720
 messaging+6.635
Neg logit contributions
SOURCE-11.275
-11.175
ITNESS-11.088
ALD-11.055
-10.852
Token which
Token ID543
Feature activation+2.77e+00

Pos logit contributions
ering+0.966
 Dupl+0.892
iced+0.883
 Hoover+0.870
 Cox+0.859
Neg logit contributions
SOURCE-1.108
ALD-1.079
-1.072
vernment-1.061
heads-1.051
Token has
Token ID468
Feature activation+3.74e-01

Pos logit contributions
ering+1.974
 Dupl+1.814
iced+1.788
 Hoover+1.754
 Cox+1.737
Neg logit contributions
SOURCE-2.324
ALD-2.251
-2.228
heads-2.214
vernment-2.214
Token an
Token ID281
Feature activation+9.78e-01

Pos logit contributions
ering+0.279
 Dupl+0.260
iced+0.252
 Cox+0.247
 Hoover+0.245
Neg logit contributions
SOURCE-0.298
ALD-0.287
heads-0.286
-0.281
vernment-0.281
Token impact
Token ID2928
Feature activation+2.84e+00

Pos logit contributions
ering+0.674
 Dupl+0.625
iced+0.608
 Hoover+0.600
 Cox+0.596
Neg logit contributions
SOURCE-0.833
ALD-0.808
vernment-0.794
heads-0.791
lance-0.789
Token Linux
Token ID7020
Feature activation-4.02e-01

Pos logit contributions
ering+0.786
 Dupl+0.721
iced+0.716
 Hoover+0.706
 Cox+0.694
Neg logit contributions
SOURCE-0.823
ALD-0.799
-0.788
vernment-0.782
lance-0.781
Token desktop
Token ID11364
Feature activation+5.07e+00

Pos logit contributions
SOURCE+0.248
ALD+0.237
vernment+0.233
+0.232
heads+0.231
Neg logit contributions
ering-0.373
 Dupl-0.350
iced-0.346
 Hoover-0.346
 Cox-0.339
Token,
Token ID11
Feature activation+1.20e+00

Pos logit contributions
ering+3.546
 Dupl+3.249
iced+3.183
 Hoover+3.144
 Cox+3.124
Neg logit contributions
SOURCE-4.292
ALD-4.200
-4.187
vernment-4.148
lance-4.119
Token server
Token ID4382
Feature activation+4.62e+00

Pos logit contributions
ering+0.858
 Dupl+0.784
iced+0.773
 Hoover+0.763
 Cox+0.752
Neg logit contributions
SOURCE-1.001
ALD-0.972
-0.960
heads-0.955
vernment-0.953
Token,
Token ID11
Feature activation+2.21e+00

Pos logit contributions
ering+3.005
 Dupl+2.731
iced+2.696
 Hoover+2.649
 Cox+2.632
Neg logit contributions
SOURCE-4.119
-4.029
ALD-4.008
vernment-3.993
ITNESS-3.962
Token cloud
Token ID6279
Feature activation+1.24e+01

Pos logit contributions
ering+1.517
 Dupl+1.384
iced+1.364
 Hoover+1.346
 Cox+1.325
Neg logit contributions
SOURCE-1.903
ALD-1.857
-1.839
vernment-1.824
lance-1.821
Token or
Token ID393
Feature activation+3.98e-01

Pos logit contributions
ering+7.054
 Dupl+6.329
 messaging+6.276
iced+6.193
 Hoover+6.022
Neg logit contributions
-11.103
SOURCE-11.085
vernment-10.978
ALD-10.900
ITNESS-10.711
Token device
Token ID3335
Feature activation+5.49e+00

Pos logit contributions
ering+0.287
 Dupl+0.261
iced+0.259
 Hoover+0.254
 Cox+0.249
Neg logit contributions
SOURCE-0.328
ALD-0.318
lance-0.314
vernment-0.313
heads-0.312
Token.
Token ID13
Feature activation+2.19e-01

Pos logit contributions
ering+3.841
 Dupl+3.515
iced+3.443
 Hoover+3.406
 Cox+3.388
Neg logit contributions
SOURCE-4.625
-4.527
ALD-4.519
vernment-4.500
lance-4.450
Token This
Token ID770
Feature activation+1.40e+00

Pos logit contributions
ering+0.154
iced+0.138
 Dupl+0.137
 Hoover+0.135
 swallow+0.134
Neg logit contributions
SOURCE-0.187
ALD-0.181
heads-0.178
-0.178
lance-0.176
Token community
Token ID2055
Feature activation+4.70e-01

Pos logit contributions
ering+1.026
 Dupl+0.937
iced+0.933
 Hoover+0.915
 Cox+0.905
Neg logit contributions
SOURCE-1.151
ALD-1.113
-1.096
heads-1.095
lance-1.094
Token describe
Token ID6901
Feature activation-4.93e+00

Pos logit contributions
ering+0.622
 Dupl+0.578
iced+0.577
 Hoover+0.562
 Cox+0.558
Neg logit contributions
SOURCE-0.662
ALD-0.645
heads-0.638
vernment-0.635
lance-0.633
Token their
Token ID511
Feature activation-2.71e+00

Pos logit contributions
SOURCE+3.814
ALD+3.764
heads+3.729
lance+3.669
vernment+3.667
Neg logit contributions
ering-3.508
 Dupl-3.274
iced-3.260
 swallow-3.111
 Hoover-3.104
Token feelings
Token ID7666
Feature activation-1.19e+00

Pos logit contributions
SOURCE+2.302
ALD+2.253
heads+2.217
+2.194
lance+2.192
Neg logit contributions
ering-1.823
 Dupl-1.672
iced-1.671
imaru-1.593
 Hoover-1.582
Token about
Token ID546
Feature activation-8.68e-01

Pos logit contributions
SOURCE+0.977
ALD+0.949
+0.944
vernment+0.943
heads+0.937
Neg logit contributions
ering-0.850
 Dupl-0.789
iced-0.777
 Hoover-0.762
 Cox-0.754
Token college
Token ID4152
Feature activation+4.72e+00

Pos logit contributions
SOURCE+0.781
ALD+0.767
vernment+0.755
+0.753
heads+0.753
Neg logit contributions
ering-0.553
 Dupl-0.503
iced-0.503
 Hoover-0.481
 Cox-0.474
Token debt
Token ID5057
Feature activation+1.23e+01

Pos logit contributions
ering+3.209
 Dupl+2.937
iced+2.883
 Hoover+2.851
 Cox+2.815
Neg logit contributions
SOURCE-4.098
ALD-4.005
-3.978
vernment-3.941
lance-3.930
Token in
Token ID287
Feature activation+2.12e+00

Pos logit contributions
ering+7.205
 Dupl+6.388
 Hoover+6.283
iced+6.223
 Cox+6.193
Neg logit contributions
SOURCE-11.253
ALD-10.973
ITNESS-10.911
-10.888
-10.629
Token three
Token ID1115
Feature activation+3.10e+00

Pos logit contributions
ering+1.430
 Dupl+1.307
iced+1.294
 Hoover+1.260
 Cox+1.242
Neg logit contributions
SOURCE-1.850
ALD-1.801
-1.790
lance-1.780
vernment-1.780
Token em
Token ID795
Feature activation+3.88e+00

Pos logit contributions
ering+2.267
 Dupl+2.087
iced+2.053
 Hoover+2.030
 Cox+2.007
Neg logit contributions
SOURCE-2.527
ALD-2.465
-2.449
vernment-2.422
lance-2.416
Tokenoj
Token ID13210
Feature activation+1.32e+00

Pos logit contributions
ering+3.251
 Dupl+3.000
iced+2.977
 Hoover+2.913
 Cox+2.907
Neg logit contributions
SOURCE-2.767
ALD-2.685
-2.649
vernment-2.622
ITNESS-2.605
Tokenis
Token ID271
Feature activation-8.54e-01

Pos logit contributions
ering+0.712
 Dupl+0.653
 Hoover+0.627
iced+0.617
 Cox+0.609
Neg logit contributions
SOURCE-1.320
ALD-1.302
-1.291
lance-1.290
heads-1.287
Token mean
Token ID1612
Feature activation+4.25e+00

Pos logit contributions
ering+1.280
 Dupl+1.195
iced+1.181
 Hoover+1.157
 Cox+1.151
Neg logit contributions
SOURCE-1.399
ALD-1.363
-1.362
vernment-1.346
heads-1.338
Token
Token ID564
Feature activation+1.38e+00

Pos logit contributions
ering+2.719
 Dupl+2.471
iced+2.418
 Hoover+2.399
 Cox+2.365
Neg logit contributions
SOURCE-3.855
ALD-3.770
-3.747
vernment-3.710
lance-3.704
Token
Token ID250
Feature activation-1.16e+00

Pos logit contributions
ering+1.158
 Dupl+1.099
iced+1.068
 Hoover+1.061
 Cox+1.048
Neg logit contributions
SOURCE-0.962
-0.942
ALD-0.929
heads-0.918
vernment-0.918
Tokenthe
Token ID1169
Feature activation-5.54e-01

Pos logit contributions
SOURCE+0.916
ALD+0.890
+0.890
vernment+0.886
heads+0.884
Neg logit contributions
ering-0.857
 Dupl-0.809
iced-0.786
 Hoover-0.778
 Cox-0.772
Token US
Token ID1294
Feature activation+6.25e+00

Pos logit contributions
SOURCE+0.495
ALD+0.487
+0.484
vernment+0.482
heads+0.481
Neg logit contributions
ering-0.353
 Dupl-0.321
iced-0.319
 Cox-0.304
 Hoover-0.303
Token debt
Token ID5057
Feature activation+1.21e+01

Pos logit contributions
ering+4.341
 Dupl+3.917
iced+3.850
 Hoover+3.828
 Cox+3.764
Neg logit contributions
SOURCE-5.311
-5.182
ALD-5.178
lance-5.119
ITNESS-5.082
Token-
Token ID12
Feature activation+1.95e+00

Pos logit contributions
ering+7.742
 swallow+6.856
 Hoover+6.851
iced+6.812
 Dupl+6.751
Neg logit contributions
SOURCE-10.429
ALD-10.247
-10.234
ITNESS-10.125
lance-9.970
Tokento
Token ID1462
Feature activation+2.99e+00

Pos logit contributions
ering+1.460
 Dupl+1.383
 Cox+1.328
 Hoover+1.319
iced+1.316
Neg logit contributions
SOURCE-1.523
vernment-1.499
heads-1.494
lance-1.488
ALD-1.487
Token-
Token ID12
Feature activation+9.45e-01

Pos logit contributions
ering+2.178
 Dupl+2.021
iced+1.979
 Hoover+1.957
 Cox+1.940
Neg logit contributions
SOURCE-2.444
ALD-2.380
-2.359
vernment-2.357
heads-2.346
TokenG
Token ID38
Feature activation-4.19e+00

Pos logit contributions
ering+0.746
 Dupl+0.704
iced+0.679
 Cox+0.678
 Hoover+0.669
Neg logit contributions
SOURCE-0.715
vernment-0.697
ALD-0.695
heads-0.689
-0.685
TokenDP
Token ID6322
Feature activation+3.99e+00

Pos logit contributions
SOURCE+2.399
ALD+2.338
vernment+2.294
heads+2.232
+2.218
Neg logit contributions
ering-3.968
 Dupl-3.831
iced-3.774
 Cox-3.759
 Hoover-3.699
Token our
Token ID674
Feature activation+3.10e+00

Pos logit contributions
ering+0.468
 Dupl+0.413
iced+0.402
 Hoover+0.397
 Cox+0.389
Neg logit contributions
SOURCE-0.971
ALD-0.950
-0.944
lance-0.937
vernment-0.936
Token daily
Token ID4445
Feature activation+4.01e+00

Pos logit contributions
ering+1.507
 Dupl+1.330
iced+1.298
 Hoover+1.278
 Cox+1.243
Neg logit contributions
SOURCE-3.279
ALD-3.227
-3.209
vernment-3.184
lance-3.179
Token newsletter
Token ID13129
Feature activation+2.99e+00

Pos logit contributions
ering+2.588
iced+2.319
 Hoover+2.314
 Dupl+2.312
 Cox+2.262
Neg logit contributions
SOURCE-3.599
-3.519
ALD-3.515
vernment-3.483
ITNESS-3.464
Token Upgrade
Token ID24236
Feature activation-2.79e+00

Pos logit contributions
ering+2.406
 Dupl+2.235
iced+2.194
 Hoover+2.179
 Cox+2.159
Neg logit contributions
SOURCE-2.212
ALD-2.163
-2.146
vernment-2.122
lance-2.108
Token your
Token ID534
Feature activation+2.85e+00

Pos logit contributions
SOURCE+2.035
heads+1.972
ALD+1.957
+1.920
ITNESS+1.890
Neg logit contributions
ering-2.205
 Dupl-2.061
iced-2.058
 Hoover-2.028
 Cox-2.005
Token inbox
Token ID13734
Feature activation+1.20e+01

Pos logit contributions
ering+1.452
 Dupl+1.287
iced+1.255
 Hoover+1.232
 Cox+1.213
Neg logit contributions
SOURCE-2.959
ALD-2.899
-2.883
vernment-2.856
lance-2.854
Token and
Token ID290
Feature activation+3.11e+00

Pos logit contributions
ering+5.612
 Dupl+5.052
iced+4.904
 Hoover+4.779
 Cox+4.765
Neg logit contributions
SOURCE-12.185
ALD-12.099
-12.052
ITNESS-12.049
-11.998
Token get
Token ID651
Feature activation+1.25e+00

Pos logit contributions
ering+2.542
 Dupl+2.366
iced+2.323
 Hoover+2.301
 Cox+2.278
Neg logit contributions
SOURCE-2.272
ALD-2.199
-2.191
heads-2.162
lance-2.155
Token our
Token ID674
Feature activation+2.64e+00

Pos logit contributions
ering+0.692
iced+0.615
 Dupl+0.614
 Hoover+0.590
 Cox+0.587
Neg logit contributions
SOURCE-1.242
ALD-1.217
-1.201
heads-1.199
lance-1.193
Token Daily
Token ID6714
Feature activation+1.58e+00

Pos logit contributions
ering+1.339
 Dupl+1.186
iced+1.163
 Hoover+1.134
 Cox+1.123
Neg logit contributions
SOURCE-2.745
ALD-2.673
-2.669
vernment-2.644
heads-2.636
Token Dispatch
Token ID35934
Feature activation+5.92e-01

Pos logit contributions
ering+0.350
 Dupl+0.256
iced+0.247
 Hoover+0.225
 Cox+0.214
Neg logit contributions
SOURCE-2.087
ALD-2.053
-2.043
heads-2.037
vernment-2.033
Token on
Token ID319
Feature activation+1.70e+00

Pos logit contributions
SOURCE+0.602
ALD+0.586
+0.581
vernment+0.578
lance+0.576
Neg logit contributions
ering-0.496
 Dupl-0.459
iced-0.451
 Hoover-0.444
 Cox-0.438
Token giant
Token ID6175
Feature activation+1.88e-01

Pos logit contributions
ering+1.090
 Dupl+0.994
iced+0.979
 Hoover+0.956
 Cox+0.944
Neg logit contributions
SOURCE-1.543
ALD-1.506
-1.492
vernment-1.488
lance-1.479
Token pools
Token ID20354
Feature activation+4.05e-01

Pos logit contributions
ering+0.113
 Dupl+0.104
iced+0.103
 Hoover+0.099
 Cox+0.098
Neg logit contributions
SOURCE-0.175
heads-0.171
ALD-0.170
vernment-0.168
lance-0.168
Token of
Token ID286
Feature activation+1.99e+00

Pos logit contributions
ering+0.305
 Dupl+0.284
iced+0.278
 Hoover+0.277
 Cox+0.273
Neg logit contributions
SOURCE-0.320
ALD-0.310
-0.308
vernment-0.305
lance-0.305
Token home
Token ID1363
Feature activation+8.14e+00

Pos logit contributions
ering+1.075
 Dupl+0.962
iced+0.947
 Hoover+0.919
 Cox+0.902
Neg logit contributions
SOURCE-2.001
ALD-1.950
vernment-1.931
heads-1.928
-1.914
Token loans
Token ID10021
Feature activation+1.19e+01

Pos logit contributions
ering+4.395
 Dupl+3.814
iced+3.764
 Hoover+3.722
 Dollar+3.705
Neg logit contributions
SOURCE-8.071
ALD-7.897
-7.852
ITNESS-7.795
lance-7.763
Token.
Token ID13
Feature activation+1.62e-01

Pos logit contributions
ering+7.325
iced+6.551
 Dupl+6.528
 Hoover+6.430
 Cox+6.355
Neg logit contributions
ALD-10.489
-10.390
SOURCE-10.382
ITNESS-10.339
lance-10.141
Token The
Token ID383
Feature activation-1.21e-01

Pos logit contributions
ering+0.118
iced+0.108
 Dupl+0.106
 swallow+0.105
 messaging+0.104
Neg logit contributions
SOURCE-0.134
ALD-0.129
-0.128
heads-0.126
lance-0.126
Token following
Token ID1708
Feature activation+1.84e-01

Pos logit contributions
SOURCE+0.107
ALD+0.104
+0.103
vernment+0.103
lance+0.102
Neg logit contributions
ering-0.080
 Dupl-0.072
iced-0.072
 Hoover-0.069
 Cox-0.068
Token year
Token ID614
Feature activation-3.83e+00

Pos logit contributions
ering+0.130
 Dupl+0.120
iced+0.119
 Hoover+0.116
 Cox+0.113
Neg logit contributions
SOURCE-0.154
ALD-0.149
heads-0.147
vernment-0.147
-0.146
Token,
Token ID11
Feature activation+1.54e-02

Pos logit contributions
SOURCE+2.978
vernment+2.905
ALD+2.897
lance+2.891
+2.872
Neg logit contributions
ering-2.811
 Dupl-2.580
iced-2.571
 swallow-2.449
arget-2.444
Token at
Token ID379
Feature activation+1.78e+00

Pos logit contributions
ering+1.871
 Dupl+1.704
iced+1.671
 Hoover+1.651
 Cox+1.629
Neg logit contributions
SOURCE-2.612
ALD-2.555
-2.539
vernment-2.516
lance-2.509
Token her
Token ID607
Feature activation+1.59e+00

Pos logit contributions
ering+1.059
 Dupl+0.959
iced+0.943
 Hoover+0.920
 Cox+0.910
Neg logit contributions
SOURCE-1.685
ALD-1.651
-1.635
vernment-1.631
lance-1.622
Token ties
Token ID8470
Feature activation+2.77e+00

Pos logit contributions
ering+1.040
 Dupl+0.941
iced+0.936
 Hoover+0.918
 Cox+0.906
Neg logit contributions
SOURCE-1.423
ALD-1.398
-1.371
vernment-1.367
heads-1.358
Token to
Token ID284
Feature activation+1.29e+00

Pos logit contributions
ering+1.485
 Dupl+1.259
iced+1.249
 Hoover+1.223
 Cox+1.196
Neg logit contributions
ALD-2.796
SOURCE-2.795
-2.732
ITNESS-2.726
lance-2.705
Token Wall
Token ID5007
Feature activation+3.37e+00

Pos logit contributions
ering+0.834
 Dupl+0.762
iced+0.750
 Hoover+0.734
 Cox+0.725
Neg logit contributions
SOURCE-1.158
ALD-1.133
-1.121
vernment-1.118
lance-1.112
Token Street
Token ID3530
Feature activation+1.18e+01

Pos logit contributions
ering+2.339
 Dupl+2.152
iced+2.115
 Hoover+2.086
 Cox+2.062
Neg logit contributions
SOURCE-2.878
ALD-2.808
-2.786
vernment-2.757
lance-2.756
Token and
Token ID290
Feature activation+1.89e+00

Pos logit contributions
ering+6.826
 Dupl+6.114
 Hoover+5.920
iced+5.887
 Cox+5.839
Neg logit contributions
SOURCE-10.775
ITNESS-10.625
ALD-10.571
-10.390
lance-10.289
Token has
Token ID468
Feature activation-2.09e-01

Pos logit contributions
ering+1.273
 Dupl+1.172
iced+1.146
 Hoover+1.129
 Cox+1.118
Neg logit contributions
SOURCE-1.655
ALD-1.616
-1.599
vernment-1.593
lance-1.582
Token questioned
Token ID11434
Feature activation-2.10e-01

Pos logit contributions
SOURCE+0.175
ALD+0.170
+0.169
vernment+0.169
heads+0.168
Neg logit contributions
ering-0.147
 Dupl-0.137
iced-0.133
 Hoover-0.131
 Cox-0.129
Token her
Token ID607
Feature activation+1.12e+00

Pos logit contributions
SOURCE+0.168
ALD+0.164
+0.162
vernment+0.161
heads+0.160
Neg logit contributions
ering-0.155
 Dupl-0.144
iced-0.142
 Hoover-0.139
 Cox-0.138
Token judgment
Token ID8492
Feature activation+3.00e+00

Pos logit contributions
ering+0.773
 Dupl+0.705
iced+0.699
 Hoover+0.685
 Cox+0.682
Neg logit contributions
SOURCE-0.960
ALD-0.937
-0.924
vernment-0.919
heads-0.913
Token On
Token ID1550
Feature activation+1.23e+00

Pos logit contributions
SOURCE+0.723
ALD+0.702
+0.693
heads+0.687
ITNESS+0.684
Neg logit contributions
ering-0.611
 Dupl-0.556
iced-0.556
 swallow-0.538
 Hoover-0.537
Token Monday
Token ID3321
Feature activation-4.39e+00

Pos logit contributions
ering+0.885
 Dupl+0.820
iced+0.803
 Hoover+0.792
 Cox+0.781
Neg logit contributions
SOURCE-1.007
ALD-0.984
-0.970
vernment-0.969
heads-0.963
Token afternoon
Token ID6672
Feature activation-1.13e+00

Pos logit contributions
SOURCE+3.206
ALD+3.166
heads+3.106
vernment+3.104
lance+3.089
Neg logit contributions
ering-3.381
 Dupl-3.166
iced-3.102
 Hoover-3.023
imaru-2.996
Token Governor
Token ID10807
Feature activation+2.08e+00

Pos logit contributions
SOURCE+0.915
ALD+0.900
vernment+0.886
lance+0.886
heads+0.885
Neg logit contributions
ering-0.809
 Dupl-0.748
iced-0.738
 Hoover-0.718
 Cox-0.704
Token Rick
Token ID8759
Feature activation+1.83e+00

Pos logit contributions
ering+1.331
 Dupl+1.201
iced+1.193
 Hoover+1.165
 Cox+1.145
Neg logit contributions
SOURCE-1.885
ALD-1.844
-1.821
lance-1.811
heads-1.806
Token Scott
Token ID4746
Feature activation+1.17e+01

Pos logit contributions
ering+1.223
 Dupl+1.118
iced+1.101
 Hoover+1.084
 Cox+1.067
Neg logit contributions
SOURCE-1.606
ALD-1.573
-1.554
lance-1.542
vernment-1.542
Token declared
Token ID6875
Feature activation-6.59e-02

Pos logit contributions
ering+7.499
 Cox+6.647
iced+6.615
 Dupl+6.603
 Hoover+6.593
Neg logit contributions
SOURCE-10.183
-9.919
ITNESS-9.770
ALD-9.766
lance-9.693
Token a
Token ID257
Feature activation-8.94e-01

Pos logit contributions
SOURCE+0.059
ALD+0.058
+0.057
vernment+0.057
lance+0.057
Neg logit contributions
ering-0.042
 Dupl-0.039
iced-0.038
 Hoover-0.037
 Cox-0.037
Token state
Token ID1181
Feature activation-1.27e+00

Pos logit contributions
SOURCE+0.747
ALD+0.739
+0.727
heads+0.727
vernment+0.725
Neg logit contributions
ering-0.619
 Dupl-0.570
iced-0.564
 Hoover-0.549
 Cox-0.543
Token of
Token ID286
Feature activation+9.88e-01

Pos logit contributions
SOURCE+1.035
ALD+1.004
+1.003
vernment+0.990
lance+0.985
Neg logit contributions
ering-0.940
 Dupl-0.857
iced-0.840
 Hoover-0.834
 Cox-0.827
Token emergency
Token ID6334
Feature activation-3.45e-01

Pos logit contributions
ering+0.594
 Dupl+0.538
iced+0.527
 Hoover+0.517
 Cox+0.510
Neg logit contributions
SOURCE-0.930
ALD-0.914
-0.906
vernment-0.900
lance-0.897
Tokenneeds
Token ID50032
Feature activation+2.94e+00

Pos logit contributions
ering+0.534
 Dupl+0.504
 Hoover+0.492
iced+0.490
 Cox+0.487
Neg logit contributions
SOURCE-0.468
ALD-0.449
heads-0.447
lance-0.446
-0.441
Token to
Token ID284
Feature activation+1.23e-01

Pos logit contributions
ering+1.771
 Dupl+1.601
iced+1.568
 Hoover+1.549
 Cox+1.526
Neg logit contributions
SOURCE-2.778
ALD-2.721
-2.705
vernment-2.682
lance-2.675
Token know
Token ID760
Feature activation-1.71e+00

Pos logit contributions
ering+0.113
 Dupl+0.109
iced+0.108
 Hoover+0.105
 Cox+0.105
Neg logit contributions
SOURCE-0.075
ALD-0.072
-0.070
vernment-0.070
lance-0.070
Token about
Token ID546
Feature activation-3.17e+00

Pos logit contributions
SOURCE+1.346
ALD+1.302
+1.288
heads+1.285
vernment+1.281
Neg logit contributions
ering-1.286
 Dupl-1.191
iced-1.175
 Hoover-1.154
 Cox-1.145
Token college
Token ID4152
Feature activation+4.02e+00

Pos logit contributions
SOURCE+2.748
ALD+2.734
+2.693
vernment+2.678
heads+2.676
Neg logit contributions
ering-2.023
 Dupl-1.838
iced-1.828
 Cox-1.753
 Hoover-1.744
Token debt
Token ID5057
Feature activation+1.17e+01

Pos logit contributions
ering+2.720
 Dupl+2.484
iced+2.436
 Hoover+2.417
 Cox+2.379
Neg logit contributions
SOURCE-3.499
ALD-3.417
-3.392
vernment-3.370
lance-3.358
Token before
Token ID878
Feature activation-1.13e+00

Pos logit contributions
ering+7.378
 Dupl+6.666
 Hoover+6.580
iced+6.476
 Cox+6.356
Neg logit contributions
SOURCE-10.184
ALD-9.893
-9.799
ITNESS-9.741
lance-9.671
Token voting
Token ID6709
Feature activation+1.85e+00

Pos logit contributions
SOURCE+0.992
ALD+0.977
heads+0.965
vernment+0.955
lance+0.953
Neg logit contributions
ering-0.719
 Dupl-0.672
iced-0.670
arget-0.625
 Hoover-0.625
Token in
Token ID287
Feature activation+9.51e-01

Pos logit contributions
ering+1.237
 Dupl+1.130
iced+1.109
 Hoover+1.096
 Cox+1.081
Neg logit contributions
SOURCE-1.632
ALD-1.597
-1.585
vernment-1.572
lance-1.566
Token November
Token ID3389
Feature activation-2.33e+00

Pos logit contributions
ering+0.675
 Dupl+0.617
iced+0.610
 Hoover+0.598
 Cox+0.589
Neg logit contributions
SOURCE-0.792
ALD-0.781
-0.769
vernment-0.768
heads-0.761
Token.
Token ID13
Feature activation+1.69e-01

Pos logit contributions
SOURCE+1.754
ALD+1.749
+1.715
heads+1.696
vernment+1.695
Neg logit contributions
ering-1.817
 Dupl-1.680
iced-1.655
 Hoover-1.630
 swallow-1.606
Token our
Token ID674
Feature activation+2.65e+00

Pos logit contributions
SOURCE+0.347
ALD+0.338
+0.336
lance+0.335
heads+0.333
Neg logit contributions
ering-0.176
 Dupl-0.157
iced-0.152
 Hoover-0.152
 Cox-0.148
Token daily
Token ID4445
Feature activation+4.17e+00

Pos logit contributions
ering+1.307
 Dupl+1.155
iced+1.128
 Hoover+1.108
 Cox+1.085
Neg logit contributions
SOURCE-2.788
ALD-2.738
-2.722
vernment-2.702
lance-2.696
Token newsletter
Token ID13129
Feature activation+2.84e+00

Pos logit contributions
ering+2.701
 Hoover+2.434
iced+2.431
 Dupl+2.431
 Cox+2.385
Neg logit contributions
SOURCE-3.712
ALD-3.636
-3.627
vernment-3.602
ITNESS-3.581
Token Upgrade
Token ID24236
Feature activation-2.73e+00

Pos logit contributions
ering+2.248
 Dupl+2.087
iced+2.042
 Hoover+2.029
 Cox+2.016
Neg logit contributions
SOURCE-2.138
ALD-2.101
-2.082
vernment-2.062
lance-2.042
Token your
Token ID534
Feature activation+2.80e+00

Pos logit contributions
SOURCE+1.998
heads+1.934
ALD+1.934
+1.893
ITNESS+1.856
Neg logit contributions
ering-2.161
iced-2.017
 Dupl-2.016
 Hoover-1.986
 Cox-1.953
Token inbox
Token ID13734
Feature activation+1.16e+01

Pos logit contributions
ering+1.426
 Dupl+1.263
iced+1.232
 Hoover+1.210
 Cox+1.190
Neg logit contributions
SOURCE-2.913
ALD-2.856
-2.839
vernment-2.813
lance-2.811
Token and
Token ID290
Feature activation+3.10e+00

Pos logit contributions
ering+5.443
 Dupl+4.919
iced+4.753
 Hoover+4.636
 Cox+4.629
Neg logit contributions
SOURCE-11.810
ALD-11.712
-11.661
ITNESS-11.659
-11.585
Token get
Token ID651
Feature activation+1.18e+00

Pos logit contributions
ering+2.510
 Dupl+2.335
iced+2.292
 Hoover+2.272
 Cox+2.247
Neg logit contributions
SOURCE-2.288
ALD-2.223
-2.208
heads-2.175
vernment-2.174
Token our
Token ID674
Feature activation+2.86e+00

Pos logit contributions
ering+0.655
iced+0.585
 Dupl+0.582
 Hoover+0.561
 Cox+0.559
Neg logit contributions
SOURCE-1.164
ALD-1.144
-1.127
heads-1.124
lance-1.121
Token Daily
Token ID6714
Feature activation+1.72e+00

Pos logit contributions
ering+1.458
 Dupl+1.291
iced+1.266
 Hoover+1.236
 Cox+1.229
Neg logit contributions
SOURCE-2.965
ALD-2.890
-2.889
vernment-2.853
heads-2.849
Token Dispatch
Token ID35934
Feature activation+9.82e-02

Pos logit contributions
ering+0.376
 Dupl+0.273
iced+0.261
 Hoover+0.240
 Cox+0.228
Neg logit contributions
SOURCE-2.283
ALD-2.247
-2.237
heads-2.225
vernment-2.225
Token ceiling
Token ID13387
Feature activation+2.86e+00

Pos logit contributions
ering+0.455
 Dupl+0.400
iced+0.391
 Hoover+0.369
 Cox+0.369
Neg logit contributions
SOURCE-1.181
ALD-1.168
vernment-1.164
-1.156
heads-1.152
Token in
Token ID287
Feature activation+1.60e+00

Pos logit contributions
ering+1.929
 Dupl+1.764
iced+1.733
 Hoover+1.712
 Cox+1.690
Neg logit contributions
SOURCE-2.496
ALD-2.440
-2.424
vernment-2.403
lance-2.396
Token time
Token ID640
Feature activation+3.37e+00

Pos logit contributions
ering+1.103
 Dupl+1.008
iced+1.007
 Cox+0.962
 Hoover+0.961
Neg logit contributions
SOURCE-1.353
ALD-1.335
vernment-1.321
heads-1.314
lance-1.312
Token to
Token ID284
Feature activation+1.57e+00

Pos logit contributions
ering+2.070
 Dupl+1.867
iced+1.833
 Hoover+1.813
 Cox+1.790
Neg logit contributions
SOURCE-3.125
ALD-3.066
-3.036
vernment-3.001
ITNESS-2.999
Token ensure
Token ID4155
Feature activation+6.55e+00

Pos logit contributions
ering+1.119
 Dupl+1.052
iced+1.043
 Cox+0.993
 Hoover+0.981
Neg logit contributions
SOURCE-1.295
ALD-1.265
-1.254
vernment-1.249
lance-1.239
Token debt
Token ID5057
Feature activation+1.16e+01

Pos logit contributions
ering+3.982
 Dupl+3.596
 Hoover+3.514
iced+3.490
 Cox+3.442
Neg logit contributions
SOURCE-6.153
ALD-5.998
-5.993
vernment-5.902
lance-5.896
Token payment
Token ID6074
Feature activation+4.27e+00

Pos logit contributions
ering+6.953
 Hoover+6.208
 Dupl+6.173
iced+6.148
 messaging+6.017
Neg logit contributions
SOURCE-10.656
ALD-10.283
-10.266
ITNESS-10.120
lance-10.016
Token.
Token ID13
Feature activation+1.57e-01

Pos logit contributions
ering+2.958
 Dupl+2.713
iced+2.664
 Hoover+2.634
 Cox+2.602
Neg logit contributions
SOURCE-3.641
ALD-3.557
-3.533
vernment-3.502
lance-3.491
Token
Token ID198
Feature activation+3.99e-02

Pos logit contributions
ering+0.114
 Dupl+0.105
iced+0.104
 Hoover+0.101
 swallow+0.100
Neg logit contributions
SOURCE-0.128
ALD-0.125
-0.124
vernment-0.122
lance-0.122
Token
Token ID198
Feature activation+3.68e-01

Pos logit contributions
ering+0.031
 Dupl+0.029
iced+0.029
 Hoover+0.028
 swallow+0.028
Neg logit contributions
SOURCE-0.031
ALD-0.029
heads-0.029
-0.029
lance-0.029
TokenWhite
Token ID12256
Feature activation-1.58e+00

Pos logit contributions
ering+0.304
 Dupl+0.287
iced+0.280
 Hoover+0.280
 Cox+0.276
Neg logit contributions
SOURCE-0.267
ALD-0.259
-0.254
lance-0.251
heads-0.251
Token are
Token ID389
Feature activation-8.05e-02

Pos logit contributions
ering+1.487
 Dupl+1.388
iced+1.369
 Hoover+1.349
 Cox+1.331
Neg logit contributions
SOURCE-1.391
ALD-1.346
-1.332
vernment-1.324
lance-1.317
Token already
Token ID1541
Feature activation-2.20e+00

Pos logit contributions
SOURCE+0.072
heads+0.070
ALD+0.069
vernment+0.068
+0.067
Neg logit contributions
ering-0.052
 Dupl-0.049
iced-0.048
 Hoover-0.046
imaru-0.045
Token facing
Token ID6476
Feature activation-1.35e+00

Pos logit contributions
SOURCE+1.916
heads+1.874
ALD+1.822
vernment+1.785
unknown+1.765
Neg logit contributions
ering-1.413
 Dupl-1.367
iced-1.340
 Hoover-1.273
imaru-1.252
Token a
Token ID257
Feature activation+5.56e-02

Pos logit contributions
SOURCE+1.186
ALD+1.148
heads+1.147
vernment+1.142
+1.134
Neg logit contributions
ering-0.877
iced-0.815
 Dupl-0.807
 Hoover-0.775
 Cox-0.755
Token declining
Token ID18080
Feature activation+3.39e+00

Pos logit contributions
ering+0.039
iced+0.036
 Dupl+0.036
 Hoover+0.034
 Cox+0.034
Neg logit contributions
SOURCE-0.046
ALD-0.045
heads-0.045
-0.045
vernment-0.045
Token consumer
Token ID7172
Feature activation+1.14e+01

Pos logit contributions
ering+2.177
 Dupl+1.997
iced+1.960
 Hoover+1.929
 Cox+1.899
Neg logit contributions
SOURCE-3.056
ALD-2.985
-2.957
lance-2.934
vernment-2.934
Token base
Token ID2779
Feature activation+1.36e+00

Pos logit contributions
ering+6.319
 swallow+5.691
 messaging+5.595
iced+5.547
 Cox+5.515
Neg logit contributions
SOURCE-10.747
-10.701
vernment-10.577
ALD-10.531
ITNESS-10.425
Token for
Token ID329
Feature activation+6.81e-01

Pos logit contributions
ering+0.950
 Dupl+0.877
iced+0.857
 Hoover+0.849
 Cox+0.835
Neg logit contributions
SOURCE-1.150
ALD-1.116
-1.113
vernment-1.104
lance-1.101
Token their
Token ID511
Feature activation-1.93e+00

Pos logit contributions
ering+0.437
 Dupl+0.406
iced+0.392
 Hoover+0.388
 Cox+0.377
Neg logit contributions
SOURCE-0.612
ALD-0.593
heads-0.589
vernment-0.587
-0.586
Token products
Token ID3186
Feature activation+5.00e+00

Pos logit contributions
SOURCE+1.589
ALD+1.535
heads+1.532
+1.514
vernment+1.507
Neg logit contributions
ering-1.343
 Dupl-1.264
iced-1.219
 Hoover-1.209
imaru-1.200
Token.
Token ID13
Feature activation+3.40e-01

Pos logit contributions
ering+3.342
iced+3.032
 Dupl+3.016
 Cox+2.974
 Hoover+2.952
Neg logit contributions
SOURCE-4.333
ALD-4.263
-4.196
ITNESS-4.192
vernment-4.152
Token Mn
Token ID36030
Feature activation+2.36e+00

Pos logit contributions
ering+1.310
 Dupl+1.170
iced+1.144
 Hoover+1.095
 swallow+1.090
Neg logit contributions
SOURCE-2.444
ALD-2.397
-2.396
vernment-2.389
lance-2.380
Tokenuchin
Token ID43416
Feature activation+3.67e+00

Pos logit contributions
ering+1.531
 Dupl+1.392
iced+1.368
 Hoover+1.349
 Cox+1.333
Neg logit contributions
SOURCE-2.127
ALD-2.081
-2.066
vernment-2.047
lance-2.042
Token,
Token ID11
Feature activation+1.43e+00

Pos logit contributions
ering+2.448
 Dupl+2.226
 Hoover+2.186
iced+2.175
 Cox+2.158
Neg logit contributions
SOURCE-3.220
ALD-3.120
lance-3.085
-3.082
vernment-3.067
Token that
Token ID326
Feature activation+1.40e+00

Pos logit contributions
ering+0.883
 Dupl+0.800
iced+0.784
 Hoover+0.774
 Cox+0.763
Neg logit contributions
SOURCE-1.333
ALD-1.305
-1.297
vernment-1.285
lance-1.282
Token the
Token ID262
Feature activation+1.86e-01

Pos logit contributions
ering+0.850
 Dupl+0.772
iced+0.763
 Hoover+0.738
 Cox+0.733
Neg logit contributions
SOURCE-1.307
ALD-1.293
-1.276
vernment-1.272
heads-1.265
Token debt
Token ID5057
Feature activation+1.14e+01

Pos logit contributions
ering+0.132
iced+0.121
 Dupl+0.120
 Cox+0.116
 Hoover+0.115
Neg logit contributions
SOURCE-0.154
ALD-0.152
-0.150
vernment-0.149
heads-0.149
Token ceiling
Token ID13387
Feature activation+5.60e+00

Pos logit contributions
ering+6.131
 Hoover+5.336
 Dupl+5.325
iced+5.297
 messaging+5.293
Neg logit contributions
SOURCE-11.149
-10.909
ALD-10.795
ITNESS-10.744
lance-10.679
Token would
Token ID561
Feature activation+2.62e+00

Pos logit contributions
ering+3.966
 Dupl+3.618
iced+3.538
 Hoover+3.523
 Cox+3.481
Neg logit contributions
SOURCE-4.720
ALD-4.587
-4.576
vernment-4.510
lance-4.505
Token be
Token ID307
Feature activation+3.33e-01

Pos logit contributions
ering+1.853
 Dupl+1.733
iced+1.700
 Hoover+1.668
 Cox+1.659
Neg logit contributions
SOURCE-2.162
ALD-2.120
-2.116
vernment-2.086
lance-2.085
Token raised
Token ID4376
Feature activation+2.01e-01

Pos logit contributions
ering+0.228
 Dupl+0.217
iced+0.209
 Cox+0.205
 Hoover+0.203
Neg logit contributions
SOURCE-0.276
ALD-0.274
heads-0.273
-0.270
lance-0.267
Token.
Token ID13
Feature activation+1.97e-01

Pos logit contributions
ering+0.135
 Dupl+0.125
iced+0.122
 Hoover+0.119
 Cox+0.118
Neg logit contributions
SOURCE-0.173
ALD-0.171
-0.170
heads-0.169
vernment-0.168
Token servic
Token ID37756
Feature activation+8.84e+00

Pos logit contributions
ering+4.706
 Dupl+4.344
 Hoover+4.149
 Cox+4.123
iced+4.116
Neg logit contributions
SOURCE-7.500
ALD-7.393
-7.364
ITNESS-7.201
vernment-7.179
Tokeners
Token ID364
Feature activation+1.58e+00

Pos logit contributions
ering+6.169
iced+5.540
 Dupl+5.492
 Hoover+5.370
 Cox+5.329
Neg logit contributions
SOURCE-7.614
-7.451
ALD-7.367
ITNESS-7.234
vernment-7.174
Token,
Token ID11
Feature activation+2.80e-01

Pos logit contributions
ering+1.050
 Dupl+0.959
iced+0.936
 Hoover+0.927
 Cox+0.922
Neg logit contributions
SOURCE-1.394
ALD-1.366
-1.343
lance-1.334
ITNESS-1.334
Token and
Token ID290
Feature activation+3.90e-01

Pos logit contributions
ering+0.176
 Dupl+0.160
iced+0.159
 Hoover+0.154
 Cox+0.150
Neg logit contributions
SOURCE-0.256
vernment-0.248
ALD-0.248
heads-0.246
-0.245
Token student
Token ID3710
Feature activation+7.95e+00

Pos logit contributions
ering+0.235
 Dupl+0.212
iced+0.211
 Hoover+0.203
 Cox+0.198
Neg logit contributions
SOURCE-0.366
vernment-0.355
ALD-0.354
lance-0.352
heads-0.351
Token loan
Token ID8063
Feature activation+1.13e+01

Pos logit contributions
ering+3.612
 Dupl+3.217
iced+3.058
 Hoover+3.039
 Cox+3.005
Neg logit contributions
SOURCE-8.520
ALD-8.420
-8.379
lance-8.286
ITNESS-8.261
Token servicing
Token ID43346
Feature activation+4.54e+00

Pos logit contributions
ering+6.479
 Dupl+5.988
 Hoover+5.662
 swallow+5.643
 Cox+5.614
Neg logit contributions
-10.386
SOURCE-10.365
ALD-10.311
lance-10.143
ITNESS-10.063
Token companies
Token ID2706
Feature activation+2.22e+00

Pos logit contributions
ering+2.039
 Dupl+1.782
iced+1.725
 Hoover+1.696
 Cox+1.662
Neg logit contributions
SOURCE-4.986
ALD-4.898
-4.876
vernment-4.834
lance-4.824
Token also
Token ID635
Feature activation-1.74e+00

Pos logit contributions
ering+1.655
 Dupl+1.526
iced+1.504
 Hoover+1.485
 Cox+1.468
Neg logit contributions
SOURCE-1.775
ALD-1.728
-1.716
vernment-1.700
lance-1.697
Token fall
Token ID2121
Feature activation+2.05e+00

Pos logit contributions
SOURCE+1.338
heads+1.293
ALD+1.287
lance+1.286
vernment+1.278
Neg logit contributions
ering-1.310
 Dupl-1.228
iced-1.199
 Hoover-1.182
 Cox-1.168
Token within
Token ID1626
Feature activation+8.57e-01

Pos logit contributions
ering+1.387
 Dupl+1.269
iced+1.246
 Hoover+1.231
 Cox+1.216
Neg logit contributions
SOURCE-1.783
ALD-1.742
-1.731
vernment-1.715
lance-1.710
Token programme
Token ID11383
Feature activation+2.52e+00

Pos logit contributions
ering+1.262
 Dupl+1.190
iced+1.174
 Hoover+1.153
 Cox+1.140
Neg logit contributions
SOURCE-1.033
ALD-1.004
-1.000
lance-0.994
heads-0.992
Token,
Token ID11
Feature activation-1.33e+00

Pos logit contributions
ering+1.731
 Dupl+1.587
iced+1.558
 Hoover+1.540
 Cox+1.520
Neg logit contributions
SOURCE-2.166
ALD-2.115
-2.102
vernment-2.083
lance-2.077
Token designed
Token ID3562
Feature activation+7.97e-02

Pos logit contributions
SOURCE+0.986
heads+0.938
ALD+0.938
vernment+0.934
lance+0.919
Neg logit contributions
ering-1.047
 Dupl-0.981
iced-0.973
 Hoover-0.947
 Cox-0.928
Token to
Token ID284
Feature activation-6.08e-01

Pos logit contributions
ering+0.047
 Dupl+0.043
iced+0.042
 Hoover+0.041
 Cox+0.041
Neg logit contributions
SOURCE-0.076
ALD-0.074
-0.074
vernment-0.073
lance-0.073
Token stimulate
Token ID26020
Feature activation+1.41e+00

Pos logit contributions
SOURCE+0.448
vernment+0.434
+0.433
heads+0.426
ALD+0.425
Neg logit contributions
ering-0.473
 Dupl-0.451
iced-0.444
 Cox-0.431
 Hoover-0.429
Token lending
Token ID20983
Feature activation+1.13e+01

Pos logit contributions
ering+0.819
 Dupl+0.747
iced+0.738
 Hoover+0.702
 Cox+0.702
Neg logit contributions
SOURCE-1.329
ALD-1.305
vernment-1.301
-1.296
heads-1.294
Token in
Token ID287
Feature activation+2.70e+00

Pos logit contributions
ering+6.529
 Dupl+5.778
 Hoover+5.722
 Cox+5.688
iced+5.644
Neg logit contributions
SOURCE-10.501
ALD-10.452
-10.305
ITNESS-10.300
vernment-10.183
Token the
Token ID262
Feature activation+1.79e+00

Pos logit contributions
ering+1.750
 Dupl+1.604
iced+1.575
 Hoover+1.543
 Cox+1.511
Neg logit contributions
SOURCE-2.425
ALD-2.360
-2.348
vernment-2.344
lance-2.331
Token economy
Token ID3773
Feature activation+3.95e+00

Pos logit contributions
ering+1.190
 Dupl+1.094
iced+1.073
 Hoover+1.041
 Cox+1.026
Neg logit contributions
SOURCE-1.571
-1.533
ALD-1.524
vernment-1.514
lance-1.509
Token,
Token ID11
Feature activation-3.56e-02

Pos logit contributions
ering+2.668
 Dupl+2.417
iced+2.388
 Hoover+2.361
 Cox+2.343
Neg logit contributions
SOURCE-3.452
ALD-3.393
-3.332
ITNESS-3.319
lance-3.298
Token beyond
Token ID3675
Feature activation-7.45e-01

Pos logit contributions
SOURCE+0.030
ALD+0.029
vernment+0.029
heads+0.029
lance+0.029
Neg logit contributions
ering-0.024
 Dupl-0.023
iced-0.022
 Hoover-0.022
 Cox-0.021
Token equality
Token ID10537
Feature activation-9.11e-02

Pos logit contributions
ering+1.995
 Dupl+1.802
iced+1.773
 Hoover+1.739
 Cox+1.718
Neg logit contributions
SOURCE-3.061
ALD-2.990
-2.972
vernment-2.945
heads-2.934
Token,
Token ID11
Feature activation+8.98e-02

Pos logit contributions
SOURCE+0.076
ALD+0.074
+0.073
heads+0.073
lance+0.073
Neg logit contributions
ering-0.064
 Dupl-0.059
iced-0.058
 Hoover-0.057
 Cox-0.056
Token bas
Token ID1615
Feature activation+1.57e+00

Pos logit contributions
ering+0.060
 Dupl+0.055
iced+0.055
 Hoover+0.053
 Cox+0.052
Neg logit contributions
SOURCE-0.078
ALD-0.075
vernment-0.075
heads-0.075
-0.074
Tokenhes
Token ID956
Feature activation+3.89e+00

Pos logit contributions
ering+1.206
 Dupl+1.119
iced+1.099
 Hoover+1.090
 Cox+1.078
Neg logit contributions
SOURCE-1.222
ALD-1.191
-1.182
vernment-1.172
lance-1.168
Token Wall
Token ID5007
Feature activation+3.20e+00

Pos logit contributions
ering+2.325
 Dupl+2.099
iced+2.063
 Hoover+2.023
 Cox+1.997
Neg logit contributions
SOURCE-3.696
ALD-3.618
-3.590
vernment-3.565
lance-3.557
Token Street
Token ID3530
Feature activation+1.13e+01

Pos logit contributions
ering+2.219
 Dupl+2.044
iced+2.007
 Hoover+1.980
 Cox+1.960
Neg logit contributions
SOURCE-2.731
ALD-2.663
-2.640
lance-2.618
vernment-2.616
Token
Token ID198
Feature activation+1.61e+00

Pos logit contributions
ering+6.918
 Dupl+6.259
 Cox+6.015
 Hoover+6.006
iced+5.964
Neg logit contributions
SOURCE-9.840
-9.798
ALD-9.737
ITNESS-9.705
vernment-9.657
Token
Token ID198
Feature activation+1.07e+00

Pos logit contributions
ering+1.208
 Dupl+1.121
iced+1.111
 Hoover+1.098
 swallow+1.078
Neg logit contributions
SOURCE-1.276
ALD-1.234
heads-1.219
-1.218
lance-1.214
Token"
Token ID1
Feature activation+1.46e+00

Pos logit contributions
ering+0.889
 Dupl+0.828
iced+0.816
 Hoover+0.809
 Cox+0.800
Neg logit contributions
SOURCE-0.773
ALD-0.751
-0.743
vernment-0.733
lance-0.733
TokenIt
Token ID1026
Feature activation-1.72e-03

Pos logit contributions
ering+1.252
 Dupl+1.167
iced+1.159
 Hoover+1.145
 swallow+1.141
Neg logit contributions
SOURCE-1.021
ALD-0.991
-0.972
lance-0.963
heads-0.959
Token is
Token ID318
Feature activation+1.47e+00

Pos logit contributions
SOURCE+0.001
ALD+0.001
+0.001
heads+0.001
vernment+0.001
Neg logit contributions
ering-0.001
 Dupl-0.001
iced-0.001
 Hoover-0.001
 Cox-0.001
Token of
Token ID286
Feature activation+2.51e+00

Pos logit contributions
ering+2.866
 Dupl+2.592
iced+2.566
 Hoover+2.551
 Cox+2.520
Neg logit contributions
SOURCE-3.124
ALD-3.054
-3.039
lance-2.982
vernment-2.975
Token Americans
Token ID3399
Feature activation+4.15e+00

Pos logit contributions
ering+1.449
 Dupl+1.312
iced+1.283
 Hoover+1.247
 Cox+1.231
Neg logit contributions
SOURCE-2.429
ALD-2.383
-2.358
vernment-2.350
heads-2.336
Token support
Token ID1104
Feature activation+5.96e-01

Pos logit contributions
ering+3.165
 Dupl+2.913
iced+2.869
 Hoover+2.850
 Cox+2.817
Neg logit contributions
SOURCE-3.231
ALD-3.156
-3.138
lance-3.086
vernment-3.085
Token Graham
Token ID11520
Feature activation+4.77e+00

Pos logit contributions
ering+0.382
 Dupl+0.351
iced+0.347
 Hoover+0.337
 Cox+0.330
Neg logit contributions
SOURCE-0.536
ALD-0.523
-0.519
vernment-0.517
heads-0.512
Token-
Token ID12
Feature activation+1.52e+00

Pos logit contributions
ering+3.560
 Dupl+3.273
iced+3.224
 Hoover+3.189
 Cox+3.170
Neg logit contributions
SOURCE-3.812
ALD-3.714
-3.688
lance-3.654
vernment-3.649
TokenCass
Token ID43529
Feature activation+1.12e+01

Pos logit contributions
ering+0.815
 Dupl+0.749
iced+0.709
 Hoover+0.702
 swallow+0.692
Neg logit contributions
SOURCE-1.531
ALD-1.518
lance-1.482
-1.481
heads-1.480
Tokenidy
Token ID19325
Feature activation+5.52e+00

Pos logit contributions
ering+5.431
 Dupl+4.809
iced+4.758
 Hoover+4.606
 Cox+4.602
Neg logit contributions
SOURCE-11.457
vernment-11.332
lance-11.176
-11.169
heads-11.137
Token,
Token ID11
Feature activation+1.09e+00

Pos logit contributions
ering+3.778
 Dupl+3.422
iced+3.358
 Hoover+3.328
 Cox+3.297
Neg logit contributions
SOURCE-4.758
ALD-4.629
lance-4.578
-4.568
ITNESS-4.559
Token the
Token ID262
Feature activation+6.07e-01

Pos logit contributions
ering+0.755
 Dupl+0.702
iced+0.694
 Hoover+0.675
 Cox+0.663
Neg logit contributions
SOURCE-0.927
ALD-0.900
vernment-0.885
-0.882
heads-0.877
Token health
Token ID1535
Feature activation+8.54e-01

Pos logit contributions
ering+0.385
iced+0.351
 Dupl+0.349
 Hoover+0.332
 Cox+0.331
Neg logit contributions
SOURCE-0.543
ALD-0.532
-0.531
vernment-0.529
heads-0.524
Token care
Token ID1337
Feature activation+6.33e+00

Pos logit contributions
ering+0.535
 Dupl+0.492
iced+0.482
 Hoover+0.471
 Cox+0.468
Neg logit contributions
SOURCE-0.780
ALD-0.762
-0.757
vernment-0.753
heads-0.750
Token 2008
Token ID3648
Feature activation+1.91e+00

Pos logit contributions
ering+1.491
 Dupl+1.359
iced+1.345
 Hoover+1.303
 Cox+1.292
Neg logit contributions
SOURCE-2.036
ALD-1.990
-1.976
vernment-1.964
lance-1.960
Token.
Token ID13
Feature activation+8.00e-01

Pos logit contributions
ering+1.323
 Dupl+1.213
iced+1.191
 Hoover+1.179
 Cox+1.164
Neg logit contributions
SOURCE-1.628
ALD-1.591
-1.580
vernment-1.565
lance-1.560
Token The
Token ID383
Feature activation+5.90e-01

Pos logit contributions
ering+0.594
iced+0.548
 Dupl+0.531
 swallow+0.531
 messaging+0.524
Neg logit contributions
SOURCE-0.646
ALD-0.627
-0.618
heads-0.614
lance-0.608
Token American
Token ID1605
Feature activation+6.64e+00

Pos logit contributions
ering+0.385
 Dupl+0.351
iced+0.346
 Cox+0.334
 Hoover+0.334
Neg logit contributions
SOURCE-0.521
ALD-0.511
-0.502
lance-0.501
heads-0.501
Token people
Token ID661
Feature activation+3.49e+00

Pos logit contributions
ering+4.256
 Dupl+3.850
iced+3.780
 Hoover+3.748
 Cox+3.662
Neg logit contributions
SOURCE-6.010
-5.878
ALD-5.845
vernment-5.773
lance-5.752
Token bailed
Token ID47739
Feature activation+1.11e+01

Pos logit contributions
ering+2.717
 Dupl+2.515
iced+2.475
 Hoover+2.452
 Cox+2.425
Neg logit contributions
SOURCE-2.683
ALD-2.614
-2.594
vernment-2.567
lance-2.559
Token these
Token ID777
Feature activation+1.63e+00

Pos logit contributions
ering+6.085
 Hoover+5.404
 Dupl+5.290
 Cox+5.238
iced+5.120
Neg logit contributions
SOURCE-10.751
-10.722
ITNESS-10.590
ALD-10.556
lance-10.458
Token sociop
Token ID41221
Feature activation+3.55e+00

Pos logit contributions
ering+1.001
 Dupl+0.909
iced+0.895
 Hoover+0.865
 Cox+0.858
Neg logit contributions
SOURCE-1.504
ALD-1.478
heads-1.456
vernment-1.453
-1.442
Tokenaths
Token ID33148
Feature activation+2.61e+00

Pos logit contributions
ering+3.200
 Dupl+2.979
iced+2.958
 Hoover+2.917
 Cox+2.892
Neg logit contributions
SOURCE-2.313
ALD-2.225
-2.224
vernment-2.177
lance-2.177
Token out
Token ID503
Feature activation+2.15e+00

Pos logit contributions
ering+1.519
 Dupl+1.383
iced+1.347
 Hoover+1.314
 Cox+1.308
Neg logit contributions
SOURCE-2.511
ALD-2.462
-2.443
vernment-2.434
heads-2.423
Token,
Token ID11
Feature activation-3.41e-01

Pos logit contributions
ering+1.530
 Dupl+1.405
iced+1.380
 Hoover+1.366
 Cox+1.350
Neg logit contributions
SOURCE-1.792
ALD-1.750
-1.738
vernment-1.721
lance-1.716
Token in
Token ID287
Feature activation-8.16e-01

Pos logit contributions
SOURCE+0.416
ALD+0.407
+0.403
heads+0.402
vernment+0.400
Neg logit contributions
ering-0.377
 Dupl-0.348
iced-0.340
 Hoover-0.336
 Cox-0.330
Token Gr
Token ID1902
Feature activation-4.09e+00

Pos logit contributions
SOURCE+0.687
ALD+0.675
+0.670
vernment+0.669
lance+0.668
Neg logit contributions
ering-0.559
 Dupl-0.511
iced-0.504
 Hoover-0.493
 swallow-0.486
Tokenims
Token ID12078
Feature activation+2.85e+00

Pos logit contributions
SOURCE+2.973
heads+2.949
ALD+2.905
lance+2.904
unknown+2.899
Neg logit contributions
ering-3.132
 Dupl-3.015
 Hoover-2.962
 messaging-2.864
iced-2.843
Tokenby
Token ID1525
Feature activation-1.61e+00

Pos logit contributions
ering+1.848
 Dupl+1.680
iced+1.652
 Hoover+1.631
 Cox+1.614
Neg logit contributions
SOURCE-2.563
ALD-2.507
-2.492
vernment-2.467
lance-2.461
Token an
Token ID281
Feature activation+1.28e+00

Pos logit contributions
SOURCE+1.343
+1.328
ALD+1.328
heads+1.314
vernment+1.299
Neg logit contributions
ering-1.102
 Dupl-1.015
iced-0.997
 Hoover-0.985
 Cox-0.950
Token Islamic
Token ID5533
Feature activation-1.28e+01

Pos logit contributions
ering+0.960
 Dupl+0.888
iced+0.872
 Hoover+0.863
 Cox+0.853
Neg logit contributions
SOURCE-1.011
ALD-0.987
-0.977
vernment-0.969
lance-0.965
Token Cultural
Token ID23897
Feature activation+7.33e-01

Pos logit contributions
SOURCE+8.765
heads+8.756
ALD+8.411
vernment+8.387
lance+8.358
Neg logit contributions
ering-8.841
iced-8.393
arget-8.182
 Dupl-8.133
imaru-7.968
Token Centre
Token ID9072
Feature activation-3.43e+00

Pos logit contributions
ering+0.553
 Dupl+0.509
iced+0.507
 Hoover+0.492
 Cox+0.487
Neg logit contributions
SOURCE-0.573
ALD-0.562
-0.555
heads-0.552
lance-0.550
Token was
Token ID373
Feature activation+5.79e-01

Pos logit contributions
SOURCE+2.614
ALD+2.590
heads+2.559
+2.537
lance+2.503
Neg logit contributions
ering-2.549
 Dupl-2.393
iced-2.349
 Hoover-2.317
imaru-2.256
Token fire
Token ID2046
Feature activation-8.63e+00

Pos logit contributions
ering+0.433
 Dupl+0.402
iced+0.391
 Hoover+0.388
 Cox+0.383
Neg logit contributions
SOURCE-0.460
ALD-0.450
-0.443
heads-0.441
vernment-0.438
Tokenbomb
Token ID27657
Feature activation-1.15e+00

Pos logit contributions
heads+2.983
+2.940
SOURCE+2.901
lance+2.861
ALD+2.859
Neg logit contributions
ering-9.620
 Dupl-9.331
 Dollar-9.186
 Hoover-9.157
 Cox-9.124
Token to
Token ID284
Feature activation-7.91e-01

Pos logit contributions
SOURCE+2.229
ALD+2.172
+2.165
lance+2.155
heads+2.152
Neg logit contributions
ering-1.650
 Dupl-1.531
iced-1.481
 Hoover-1.464
 Cox-1.455
Token 2015
Token ID1853
Feature activation-1.18e+00

Pos logit contributions
SOURCE+0.610
ALD+0.585
+0.584
heads+0.579
vernment+0.573
Neg logit contributions
ering-0.606
iced-0.568
 Dupl-0.564
 Hoover-0.544
 Cox-0.544
Token.
Token ID13
Feature activation-5.13e-01

Pos logit contributions
SOURCE+0.952
ALD+0.925
+0.921
vernment+0.914
lance+0.909
Neg logit contributions
ering-0.862
 Dupl-0.798
iced-0.786
 Hoover-0.771
 Cox-0.764
Token Its
Token ID6363
Feature activation-3.69e-01

Pos logit contributions
SOURCE+0.418
ALD+0.405
+0.401
vernment+0.397
heads+0.397
Neg logit contributions
ering-0.375
 Dupl-0.344
iced-0.342
 Hoover-0.334
 Cox-0.331
Token only
Token ID691
Feature activation+6.47e-01

Pos logit contributions
SOURCE+0.305
ALD+0.297
+0.293
heads+0.292
lance+0.290
Neg logit contributions
ering-0.264
 Dupl-0.242
iced-0.242
 Hoover-0.233
 Cox-0.231
Token nuclear
Token ID4523
Feature activation-1.25e+01

Pos logit contributions
ering+0.487
 Dupl+0.450
iced+0.445
 Hoover+0.435
 Cox+0.432
Neg logit contributions
SOURCE-0.514
ALD-0.498
-0.492
heads-0.489
vernment-0.487
Token-
Token ID12
Feature activation-3.92e+00

Pos logit contributions
SOURCE+8.623
heads+8.480
ighter+7.977
ALD+7.905
+7.866
Neg logit contributions
ering-8.865
 Dupl-8.484
iced-8.338
imaru-8.281
 Dollar-8.012
Tokenpowered
Token ID12293
Feature activation-7.46e+00

Pos logit contributions
SOURCE+1.999
heads+1.885
ALD+1.848
lance+1.845
+1.844
Neg logit contributions
ering-4.025
 Dupl-3.865
 Hoover-3.770
iced-3.758
 Cox-3.756
Token carrier
Token ID11920
Feature activation-4.78e+00

Pos logit contributions
SOURCE+5.309
heads+5.128
ALD+4.990
+4.866
lance+4.851
Neg logit contributions
ering-5.713
iced-5.397
 Dupl-5.331
imaru-5.290
arget-5.003
Token project
Token ID1628
Feature activation+3.64e-01

Pos logit contributions
SOURCE+3.749
heads+3.670
ALD+3.642
lance+3.582
+3.572
Neg logit contributions
ering-3.461
 Dupl-3.257
iced-3.240
 Hoover-3.086
imaru-3.076
Token was
Token ID373
Feature activation+1.21e+00

Pos logit contributions
ering+0.256
 Dupl+0.237
iced+0.232
 Hoover+0.229
 Cox+0.226
Neg logit contributions
SOURCE-0.306
ALD-0.298
-0.295
lance-0.293
vernment-0.293
Tokens
Token ID82
Feature activation-1.86e+00

Pos logit contributions
SOURCE+1.248
ALD+1.219
+1.215
heads+1.214
vernment+1.202
Neg logit contributions
ering-1.160
 Dupl-1.080
iced-1.047
 Hoover-1.031
 Cox-1.010
Token was
Token ID373
Feature activation+5.40e-01

Pos logit contributions
SOURCE+1.467
ALD+1.434
heads+1.434
+1.415
vernment+1.405
Neg logit contributions
ering-1.367
 Dupl-1.282
iced-1.246
 Hoover-1.229
imaru-1.205
Token the
Token ID262
Feature activation-1.10e+00

Pos logit contributions
ering+0.386
 Dupl+0.357
iced+0.349
 Hoover+0.344
 Cox+0.336
Neg logit contributions
SOURCE-0.442
ALD-0.436
-0.430
heads-0.429
vernment-0.424
Token work
Token ID670
Feature activation-1.33e+00

Pos logit contributions
SOURCE+0.888
ALD+0.870
+0.859
heads+0.856
lance+0.851
Neg logit contributions
ering-0.787
 Dupl-0.732
iced-0.728
 Hoover-0.697
imaru-0.686
Token of
Token ID286
Feature activation+4.26e-01

Pos logit contributions
SOURCE+1.073
ALD+1.044
+1.033
vernment+1.030
lance+1.020
Neg logit contributions
ering-0.992
 Dupl-0.906
iced-0.894
 Hoover-0.884
 Cox-0.877
Token Islamic
Token ID5533
Feature activation-1.24e+01

Pos logit contributions
ering+0.289
 Dupl+0.265
iced+0.260
 Hoover+0.256
 Cox+0.251
Neg logit contributions
SOURCE-0.365
ALD-0.359
-0.355
vernment-0.353
lance-0.352
Token extremists
Token ID20674
Feature activation-1.95e+00

Pos logit contributions
heads+8.080
SOURCE+7.923
unknown+7.632
lance+7.576
vernment+7.550
Neg logit contributions
ering-9.309
iced-8.716
arget-8.652
 Dupl-8.577
imaru-8.559
Token or
Token ID393
Feature activation-5.31e+00

Pos logit contributions
SOURCE+1.566
ALD+1.523
heads+1.521
+1.508
vernment+1.501
Neg logit contributions
ering-1.413
 Dupl-1.321
iced-1.288
 Hoover-1.266
 Cox-1.247
Token by
Token ID416
Feature activation-1.03e-01

Pos logit contributions
SOURCE+4.180
ALD+4.161
heads+4.139
vernment+4.058
+4.043
Neg logit contributions
ering-3.634
 Dupl-3.400
iced-3.321
arget-3.233
imaru-3.225
Token others
Token ID1854
Feature activation-3.88e+00

Pos logit contributions
SOURCE+0.088
ALD+0.086
+0.085
vernment+0.085
heads+0.085
Neg logit contributions
ering-0.070
 Dupl-0.064
iced-0.063
 Hoover-0.062
 Cox-0.061
Token seeking
Token ID6095
Feature activation+5.77e-01

Pos logit contributions
SOURCE+2.872
heads+2.838
ALD+2.794
unknown+2.790
+2.768
Neg logit contributions
ering-2.936
 Dupl-2.765
iced-2.693
 Hoover-2.655
imaru-2.630
Token how
Token ID703
Feature activation-6.55e-01

Pos logit contributions
ALD+4.997
heads+4.912
SOURCE+4.904
+4.902
ITNESS+4.834
Neg logit contributions
ering-4.155
 Dupl-3.903
iced-3.789
 Hoover-3.742
arget-3.689
Token he
Token ID339
Feature activation-8.94e-01

Pos logit contributions
SOURCE+0.555
ALD+0.554
heads+0.545
+0.544
vernment+0.540
Neg logit contributions
ering-0.443
 Dupl-0.409
iced-0.401
 Hoover-0.394
 swallow-0.384
Token was
Token ID373
Feature activation+1.29e+00

Pos logit contributions
SOURCE+0.672
ALD+0.665
+0.656
heads+0.654
lance+0.647
Neg logit contributions
ering-0.691
 Dupl-0.651
iced-0.631
 Hoover-0.628
 Cox-0.615
Token inside
Token ID2641
Feature activation-3.04e+00

Pos logit contributions
ering+0.910
 Dupl+0.858
iced+0.823
 Hoover+0.819
 Cox+0.804
Neg logit contributions
SOURCE-1.068
ALD-1.051
heads-1.033
-1.033
lance-1.016
Token an
Token ID281
Feature activation+2.04e+00

Pos logit contributions
SOURCE+2.580
ALD+2.569
+2.521
vernment+2.508
heads+2.503
Neg logit contributions
ering-2.018
 Dupl-1.850
iced-1.805
 Hoover-1.770
arget-1.770
Token Islamic
Token ID5533
Feature activation-1.23e+01

Pos logit contributions
ering+1.513
 Dupl+1.398
iced+1.372
 Hoover+1.358
 Cox+1.339
Neg logit contributions
SOURCE-1.636
ALD-1.599
-1.582
vernment-1.570
lance-1.562
Token cultural
Token ID6467
Feature activation+2.86e+00

Pos logit contributions
SOURCE+8.094
heads+7.987
ALD+7.786
+7.739
lance+7.728
Neg logit contributions
ering-8.985
 Dupl-8.479
iced-8.331
arget-8.247
imaru-8.142
Token centre
Token ID7372
Feature activation-1.76e+00

Pos logit contributions
ering+2.294
 Dupl+2.108
iced+2.084
 Hoover+2.070
 Cox+2.055
Neg logit contributions
SOURCE-2.139
ALD-2.073
-2.063
vernment-2.043
lance-2.027
Token with
Token ID351
Feature activation+2.33e-01

Pos logit contributions
SOURCE+1.477
ALD+1.453
heads+1.445
+1.439
lance+1.427
Neg logit contributions
ering-1.198
 Dupl-1.122
iced-1.080
 Hoover-1.073
 Cox-1.048
Token a
Token ID257
Feature activation-1.26e+00

Pos logit contributions
ering+0.161
 Dupl+0.148
iced+0.145
 Hoover+0.143
 Cox+0.140
Neg logit contributions
SOURCE-0.198
ALD-0.195
-0.192
heads-0.191
lance-0.191
Token young
Token ID1862
Feature activation+1.96e+00

Pos logit contributions
SOURCE+0.968
ALD+0.956
+0.945
heads+0.939
lance+0.937
Neg logit contributions
ering-0.947
 Dupl-0.884
iced-0.868
 Hoover-0.848
 Cox-0.828
Token'
Token ID6
Feature activation-3.10e+00

Pos logit contributions
SOURCE+3.019
ALD+2.915
vernment+2.861
+2.860
lance+2.852
Neg logit contributions
ering-2.681
 Dupl-2.531
iced-2.464
 Hoover-2.408
imaru-2.397
Token wing
Token ID8539
Feature activation-6.42e+00

Pos logit contributions
SOURCE+2.598
ALD+2.528
+2.497
lance+2.494
heads+2.470
Neg logit contributions
ering-2.137
 Dupl-1.982
iced-1.930
 Hoover-1.897
 Cox-1.885
Token span
Token ID11506
Feature activation-2.58e+00

Pos logit contributions
SOURCE+4.035
heads+3.948
lance+3.943
ALD+3.861
+3.783
Neg logit contributions
ering-5.548
 Dupl-5.328
iced-5.130
 Cox-5.124
 Hoover-5.121
Token and
Token ID290
Feature activation-1.04e+00

Pos logit contributions
SOURCE+2.104
heads+2.048
ALD+2.047
lance+2.047
+2.016
Neg logit contributions
ering-1.809
 Dupl-1.711
iced-1.660
 Hoover-1.632
 Cox-1.629
Token a
Token ID257
Feature activation-1.36e-01

Pos logit contributions
SOURCE+0.870
ALD+0.844
heads+0.836
lance+0.833
vernment+0.824
Neg logit contributions
ering-0.725
 Dupl-0.671
iced-0.658
 Hoover-0.638
 Cox-0.633
Token fu
Token ID14035
Feature activation-1.20e+01

Pos logit contributions
SOURCE+0.115
ALD+0.111
lance+0.111
heads+0.110
+0.110
Neg logit contributions
ering-0.094
 Dupl-0.087
iced-0.085
 Hoover-0.083
 Cox-0.082
Tokenselage
Token ID45217
Feature activation-6.09e+00

Pos logit contributions
SOURCE+1.448
lance+1.414
vernment+1.266
ALD+1.217
+1.131
Neg logit contributions
ering-15.186
 Dupl-15.137
 Lans-14.749
 Hoover-14.710
 Decay-14.602
Token about
Token ID546
Feature activation-5.20e+00

Pos logit contributions
SOURCE+4.749
heads+4.667
lance+4.542
ALD+4.523
unknown+4.439
Neg logit contributions
ering-4.297
 Dupl-4.090
iced-3.984
imaru-3.919
 Hoover-3.877
Token 5
Token ID642
Feature activation-1.98e+00

Pos logit contributions
SOURCE+3.921
heads+3.873
ALD+3.777
lance+3.723
unknown+3.651
Neg logit contributions
ering-3.835
 Dupl-3.518
iced-3.473
oner-3.364
imaru-3.333
Token'
Token ID6
Feature activation-2.00e+00

Pos logit contributions
SOURCE+1.456
ALD+1.401
heads+1.378
lance+1.373
+1.371
Neg logit contributions
ering-1.589
 Dupl-1.489
iced-1.456
 Hoover-1.441
 Cox-1.424
Token wide
Token ID3094
Feature activation-4.35e+00

Pos logit contributions
SOURCE+1.747
ALD+1.698
heads+1.665
lance+1.665
+1.664
Neg logit contributions
ering-1.329
 Dupl-1.228
iced-1.196
 Hoover-1.180
 Cox-1.160
Token Mohammad
Token ID29674
Feature activation-4.39e+00

Pos logit contributions
ering+1.459
 Dupl+1.342
iced+1.314
 Hoover+1.310
 Cox+1.296
Neg logit contributions
SOURCE-1.673
ALD-1.634
-1.622
vernment-1.611
lance-1.606
Token School
Token ID3961
Feature activation-6.55e-01

Pos logit contributions
SOURCE+3.663
ALD+3.585
heads+3.509
+3.505
lance+3.487
Neg logit contributions
ering-2.961
iced-2.741
 Dupl-2.725
 swallow-2.610
 Hoover-2.592
Token teaching
Token ID7743
Feature activation+3.93e+00

Pos logit contributions
SOURCE+0.565
ALD+0.552
+0.543
lance+0.541
heads+0.539
Neg logit contributions
ering-0.443
 Dupl-0.409
iced-0.405
 Hoover-0.394
 Cox-0.389
Token "
Token ID366
Feature activation+8.71e-01

Pos logit contributions
ering+2.737
 Dupl+2.510
iced+2.463
 Hoover+2.441
 Cox+2.412
Neg logit contributions
SOURCE-3.340
ALD-3.262
-3.241
vernment-3.211
lance-3.200
Tokena
Token ID64
Feature activation-1.43e+00

Pos logit contributions
ering+0.727
 Dupl+0.678
iced+0.666
 Hoover+0.662
 Cox+0.655
Neg logit contributions
SOURCE-0.621
ALD-0.603
-0.598
vernment-0.591
lance-0.590
Token Qur
Token ID17599
Feature activation-1.20e+01

Pos logit contributions
SOURCE+1.157
ALD+1.122
+1.117
lance+1.111
heads+1.108
Neg logit contributions
ering-1.023
 Dupl-0.952
iced-0.937
 Hoover-0.907
 Cox-0.896
Token'
Token ID6
Feature activation-4.72e+00

Pos logit contributions
SOURCE+6.999
ALD+6.995
+6.811
lance+6.792
vernment+6.774
Neg logit contributions
ering-9.944
 Dupl-9.516
iced-9.353
 Hoover-9.331
imaru-9.236
Tokenan
Token ID272
Feature activation-3.87e+00

Pos logit contributions
SOURCE+3.377
ALD+3.305
vernment+3.260
+3.257
lance+3.215
Neg logit contributions
ering-3.782
 Dupl-3.608
iced-3.473
 Hoover-3.472
 Cox-3.398
Token-
Token ID12
Feature activation-4.00e+00

Pos logit contributions
SOURCE+3.070
ALD+3.009
heads+2.943
lance+2.934
+2.934
Neg logit contributions
ering-2.795
 Dupl-2.624
iced-2.564
 Hoover-2.509
 Cox-2.491
Tokenguided
Token ID23657
Feature activation-1.40e+00

Pos logit contributions
SOURCE+2.455
ALD+2.355
heads+2.352
lance+2.340
unknown+2.302
Neg logit contributions
ering-3.621
 Dupl-3.499
 Cox-3.384
 Hoover-3.372
iced-3.350
Token journey
Token ID7002
Feature activation+1.33e+00

Pos logit contributions
SOURCE+1.114
ALD+1.095
lance+1.081
+1.077
vernment+1.064
Neg logit contributions
ering-1.016
 Dupl-0.958
iced-0.940
 Hoover-0.910
imaru-0.905
Token Vessel
Token ID44734
Feature activation-1.42e+00

Pos logit contributions
SOURCE+0.223
ALD+0.216
+0.214
vernment+0.213
lance+0.213
Neg logit contributions
ering-0.245
 Dupl-0.227
iced-0.225
 Hoover-0.223
 Cox-0.220
Tokens
Token ID82
Feature activation-1.38e+00

Pos logit contributions
SOURCE+1.204
ALD+1.168
+1.153
heads+1.152
lance+1.143
Neg logit contributions
ering-0.978
iced-0.900
 Dupl-0.898
 Hoover-0.877
 Cox-0.859
Token
Token ID198
Feature activation-4.76e-01

Pos logit contributions
SOURCE+1.143
ALD+1.105
+1.095
vernment+1.086
heads+1.086
Neg logit contributions
ering-0.977
 Dupl-0.898
iced-0.894
 Hoover-0.875
 Cox-0.861
Token
Token ID198
Feature activation-3.69e-01

Pos logit contributions
SOURCE+0.364
ALD+0.348
heads+0.346
lance+0.342
unknown+0.341
Neg logit contributions
ering-0.370
 Dupl-0.345
iced-0.345
 Hoover-0.341
 swallow-0.337
TokenN
Token ID45
Feature activation-5.98e+00

Pos logit contributions
SOURCE+0.258
ALD+0.248
+0.243
lance+0.241
heads+0.240
Neg logit contributions
ering-0.317
 Dupl-0.298
iced-0.293
 Hoover-0.291
 Cox-0.288
Tokenuclear
Token ID4016
Feature activation-1.20e+01

Pos logit contributions
SOURCE+3.205
ALD+3.121
vernment+3.027
heads+3.016
lance+2.999
Neg logit contributions
ering-5.775
 Dupl-5.589
 Hoover-5.483
 Cox-5.437
iced-5.426
Token propulsion
Token ID38873
Feature activation-6.63e+00

Pos logit contributions
SOURCE+9.148
heads+8.973
vernment+8.764
+8.634
ALD+8.617
Neg logit contributions
ering-7.758
 Dupl-7.234
iced-7.223
imaru-7.091
 Hoover-6.963
Token has
Token ID468
Feature activation-2.24e+00

Pos logit contributions
SOURCE+5.051
heads+4.906
ALD+4.795
unknown+4.722
lance+4.721
Neg logit contributions
ering-4.772
iced-4.540
 Dupl-4.526
imaru-4.425
 Hoover-4.350
Token proven
Token ID9157
Feature activation+1.21e+00

Pos logit contributions
SOURCE+1.771
ALD+1.691
heads+1.684
vernment+1.675
unknown+1.642
Neg logit contributions
ering-1.669
 Dupl-1.565
iced-1.521
 Hoover-1.489
imaru-1.487
Token technically
Token ID14497
Feature activation-7.02e+00

Pos logit contributions
ering+0.788
 Dupl+0.723
iced+0.710
 Hoover+0.700
 Cox+0.691
Neg logit contributions
SOURCE-1.079
ALD-1.057
-1.046
vernment-1.041
lance-1.036
Token and
Token ID290
Feature activation-3.58e+00

Pos logit contributions
SOURCE+5.317
ALD+5.245
heads+5.211
unknown+5.067
vernment+5.027
Neg logit contributions
ering-5.039
 Dupl-4.776
iced-4.750
 Hoover-4.677
imaru-4.639
Token the
Token ID262
Feature activation-1.90e+00

Pos logit contributions
SOURCE+2.076
ALD+2.042
+2.007
heads+1.999
vernment+1.998
Neg logit contributions
ering-1.399
iced-1.260
 Dupl-1.247
 swallow-1.198
 Hoover-1.174
Token first
Token ID717
Feature activation-4.05e-01

Pos logit contributions
SOURCE+1.649
ALD+1.620
+1.591
heads+1.588
vernment+1.575
Neg logit contributions
ering-1.257
iced-1.142
 Dupl-1.120
 swallow-1.076
arget-1.073
Token Ghostbusters
Token ID50037
Feature activation-1.06e+01

Pos logit contributions
SOURCE+0.356
ALD+0.350
+0.346
vernment+0.345
heads+0.343
Neg logit contributions
ering-0.267
iced-0.241
 Dupl-0.240
 Hoover-0.230
 Cox-0.228
Token,
Token ID11
Feature activation-1.49e+00

Pos logit contributions
ALD+7.485
vernment+7.480
+7.464
heads+7.460
SOURCE+7.448
Neg logit contributions
ering-7.456
iced-6.775
imaru-6.738
arget-6.719
 Dupl-6.629
Token the
Token ID262
Feature activation-1.50e+00

Pos logit contributions
SOURCE+1.237
ALD+1.229
+1.195
heads+1.190
vernment+1.177
Neg logit contributions
ering-1.050
 Dupl-0.949
iced-0.938
 swallow-0.917
oner-0.908
Token Fire
Token ID3764
Feature activation-1.19e+01

Pos logit contributions
SOURCE+1.322
ALD+1.302
+1.286
vernment+1.263
heads+1.263
Neg logit contributions
ering-0.977
iced-0.875
 Dupl-0.871
 swallow-0.833
 Hoover-0.825
Tokenhouse
Token ID4803
Feature activation-4.88e+00

Pos logit contributions
heads+8.684
lance+8.386
SOURCE+8.385
ALD+8.122
vernment+8.019
Neg logit contributions
ering-8.423
imaru-8.018
iced-7.759
 Dupl-7.640
 messaging-7.543
Token in
Token ID287
Feature activation-1.04e+00

Pos logit contributions
SOURCE+3.798
heads+3.746
+3.744
ALD+3.724
ITNESS+3.666
Neg logit contributions
ering-3.521
iced-3.206
 Dupl-3.203
imaru-3.062
 Hoover-3.056
Token Trib
Token ID10841
Feature activation+4.54e+00

Pos logit contributions
SOURCE+0.933
ALD+0.920
+0.912
heads+0.904
vernment+0.899
Neg logit contributions
ering-0.665
iced-0.597
 Dupl-0.594
 swallow-0.572
 Hoover-0.564
Tokeneca
Token ID31047
Feature activation+1.85e+00

Pos logit contributions
ering+0.728
 Dupl+0.464
iced+0.409
 Hoover+0.385
 Cox+0.343
Neg logit contributions
SOURCE-6.294
ALD-6.203
-6.168
vernment-6.145
lance-6.135
Token neighborhood
Token ID6232
Feature activation+2.15e+00

Pos logit contributions
ering+1.326
 Dupl+1.220
iced+1.198
 Hoover+1.186
 Cox+1.172
Neg logit contributions
SOURCE-1.538
ALD-1.502
-1.491
vernment-1.477
lance-1.472
Token.
Token ID13
Feature activation-8.24e-01

Pos logit contributions
SOURCE+0.317
ALD+0.310
+0.308
lance+0.304
vernment+0.304
Neg logit contributions
ering-0.280
 Dupl-0.257
iced-0.252
 Hoover-0.250
 Cox-0.247
Token A
Token ID317
Feature activation-1.22e+00

Pos logit contributions
SOURCE+0.663
ALD+0.640
+0.635
vernment+0.630
heads+0.627
Neg logit contributions
ering-0.615
 Dupl-0.561
iced-0.561
 Hoover-0.548
 swallow-0.543
Token leader
Token ID3554
Feature activation-1.48e+00

Pos logit contributions
SOURCE+0.921
ALD+0.890
vernment+0.889
heads+0.882
+0.879
Neg logit contributions
ering-0.963
 Dupl-0.899
iced-0.882
 Hoover-0.872
 Cox-0.863
Token of
Token ID286
Feature activation-8.96e-01

Pos logit contributions
SOURCE+1.179
ALD+1.147
+1.137
vernment+1.131
heads+1.128
Neg logit contributions
ering-1.108
 Dupl-1.028
iced-1.009
 Hoover-0.997
 Cox-0.985
Token the
Token ID262
Feature activation-4.16e-01

Pos logit contributions
SOURCE+0.761
ALD+0.742
vernment+0.738
+0.738
lance+0.730
Neg logit contributions
ering-0.622
 Dupl-0.571
iced-0.561
 Hoover-0.553
 Cox-0.544
Token Islamic
Token ID5533
Feature activation-1.18e+01

Pos logit contributions
SOURCE+0.297
vernment+0.289
ALD+0.285
+0.283
heads+0.283
Neg logit contributions
ering-0.344
 Dupl-0.319
iced-0.315
 Hoover-0.310
 Cox-0.305
Token militant
Token ID18614
Feature activation-2.43e+00

Pos logit contributions
SOURCE+8.101
vernment+7.873
heads+7.633
unknown+7.530
ğ+7.530
Neg logit contributions
ering-8.757
iced-8.136
arget-8.129
 Dupl-8.078
imaru-8.023
Token group
Token ID1448
Feature activation-3.66e+00

Pos logit contributions
SOURCE+1.445
vernment+1.435
heads+1.395
ALD+1.392
lance+1.378
Neg logit contributions
ering-2.269
 Dupl-2.134
iced-2.098
 Hoover-2.044
 Dollar-2.029
Token Hamas
Token ID15767
Feature activation-8.89e+00

Pos logit contributions
SOURCE+2.682
heads+2.648
vernment+2.610
ALD+2.557
lance+2.556
Neg logit contributions
ering-2.870
 Dupl-2.680
iced-2.667
 Hoover-2.588
 swallow-2.543
Token said
Token ID531
Feature activation-2.64e+00

Pos logit contributions
SOURCE+6.594
vernment+6.457
heads+6.404
ALD+6.292
+6.239
Neg logit contributions
ering-6.433
 Dupl-6.042
iced-5.970
 Hoover-5.911
arget-5.826
Token to
Token ID284
Feature activation-8.33e-01

Pos logit contributions
SOURCE+2.200
heads+2.139
ALD+2.130
vernment+2.122
lance+2.105
Neg logit contributions
ering-1.826
 Dupl-1.689
iced-1.675
 Hoover-1.619
arget-1.592
Token it
Token ID340
Feature activation-6.29e-01

Pos logit contributions
ering+0.108
 Dupl+0.098
iced+0.096
 Hoover+0.095
 Cox+0.093
Neg logit contributions
SOURCE-0.157
ALD-0.154
-0.152
vernment-0.151
heads-0.151
Token houses
Token ID7777
Feature activation-2.44e+00

Pos logit contributions
SOURCE+0.490
ALD+0.475
+0.473
vernment+0.471
heads+0.471
Neg logit contributions
ering-0.475
 Dupl-0.444
iced-0.435
 Hoover-0.429
 Cox-0.424
Token 10
Token ID838
Feature activation-4.70e-01

Pos logit contributions
SOURCE+2.078
ALD+2.051
vernment+2.031
+2.021
heads+2.012
Neg logit contributions
ering-1.632
 Dupl-1.495
iced-1.485
 Hoover-1.436
 Cox-1.414
Token,
Token ID11
Feature activation+5.61e-01

Pos logit contributions
SOURCE+0.375
ALD+0.365
+0.363
vernment+0.361
heads+0.358
Neg logit contributions
ering-0.350
 Dupl-0.324
iced-0.319
 Hoover-0.314
 Cox-0.309
Token000
Token ID830
Feature activation-2.44e+00

Pos logit contributions
ering+0.397
 Dupl+0.366
iced+0.358
 Hoover+0.355
 Cox+0.351
Neg logit contributions
SOURCE-0.471
ALD-0.460
-0.457
vernment-0.452
lance-0.451
Token worsh
Token ID26511
Feature activation-1.18e+01

Pos logit contributions
SOURCE+1.844
ALD+1.793
heads+1.783
+1.775
vernment+1.768
Neg logit contributions
ering-1.887
 Dupl-1.768
iced-1.732
 Hoover-1.715
 Cox-1.680
Tokenippers
Token ID16415
Feature activation-5.79e+00

Pos logit contributions
heads+6.695
ighters+5.954
lance+5.867
ighter+5.675
unknown+5.616
Neg logit contributions
 Dupl-10.656
ering-10.524
 Dollar-10.342
 Decay-10.338
 Hoover-10.321
Token and
Token ID290
Feature activation-2.58e+00

Pos logit contributions
SOURCE+4.352
heads+4.299
+4.281
vernment+4.271
ALD+4.208
Neg logit contributions
ering-4.259
 Dupl-4.032
iced-3.895
 Hoover-3.882
imaru-3.805
Token ooz
Token ID49564
Feature activation+2.29e+00

Pos logit contributions
SOURCE+2.031
heads+1.970
ALD+1.967
vernment+1.960
lance+1.945
Neg logit contributions
ering-1.885
 Dupl-1.756
iced-1.744
 Hoover-1.680
arget-1.670
Tokenes
Token ID274
Feature activation+1.13e+00

Pos logit contributions
ering+0.925
 Dupl+0.800
iced+0.773
 Hoover+0.757
 Cox+0.740
Neg logit contributions
SOURCE-2.607
ALD-2.564
-2.545
vernment-2.539
lance-2.527
Token confidence
Token ID6628
Feature activation+4.97e-01

Pos logit contributions
ering+0.782
 Dupl+0.722
iced+0.710
 Hoover+0.695
 Cox+0.690
Neg logit contributions
SOURCE-0.964
ALD-0.941
-0.930
vernment-0.927
lance-0.925
Token Arab
Token ID4498
Feature activation-1.07e+00

Pos logit contributions
SOURCE+0.365
ALD+0.354
+0.352
vernment+0.350
heads+0.350
Neg logit contributions
ering-0.318
 Dupl-0.293
iced-0.292
 Hoover-0.284
 Cox-0.279
Token League
Token ID4041
Feature activation+2.55e+00

Pos logit contributions
SOURCE+0.893
ALD+0.867
heads+0.860
+0.856
vernment+0.855
Neg logit contributions
ering-0.745
 Dupl-0.692
iced-0.683
 Hoover-0.675
 Cox-0.662
Token and
Token ID290
Feature activation-6.30e-01

Pos logit contributions
ering+1.835
 Dupl+1.690
iced+1.661
 Hoover+1.643
 Cox+1.622
Neg logit contributions
SOURCE-2.108
ALD-2.057
-2.042
vernment-2.025
lance-2.016
Token Organisation
Token ID30801
Feature activation-2.19e+00

Pos logit contributions
SOURCE+0.530
ALD+0.517
vernment+0.515
heads+0.512
lance+0.508
Neg logit contributions
ering-0.433
 Dupl-0.402
iced-0.396
 Hoover-0.385
 Cox-0.380
Token of
Token ID286
Feature activation-1.14e+00

Pos logit contributions
SOURCE+1.818
ALD+1.759
vernment+1.754
+1.744
heads+1.732
Neg logit contributions
ering-1.553
 Dupl-1.435
iced-1.419
 Hoover-1.398
 Cox-1.375
Token Islamic
Token ID5533
Feature activation-1.18e+01

Pos logit contributions
SOURCE+0.730
heads+0.712
ALD+0.710
vernment+0.699
+0.694
Neg logit contributions
ering-1.000
 Dupl-0.939
iced-0.932
 Hoover-0.922
 Cox-0.913
Token Co
Token ID1766
Feature activation+1.94e+00

Pos logit contributions
SOURCE+5.898
heads+5.877
unknown+5.352
lance+5.249
+5.084
Neg logit contributions
ering-10.903
arget-10.659
iced-10.634
 swallow-10.161
 Dupl-10.146
Token-
Token ID12
Feature activation-1.24e+00

Pos logit contributions
ering+1.363
 Dupl+1.251
iced+1.229
 Hoover+1.216
 Cox+1.201
Neg logit contributions
SOURCE-1.633
ALD-1.595
-1.584
vernment-1.569
lance-1.564
Tokenoperation
Token ID27184
Feature activation-1.65e+00

Pos logit contributions
SOURCE+0.840
vernment+0.808
heads+0.807
ALD+0.801
ITNESS+0.787
Neg logit contributions
ering-1.051
 Dupl-1.003
iced-0.993
 Hoover-0.989
 Cox-0.968
Token and
Token ID290
Feature activation-2.04e+00

Pos logit contributions
SOURCE+1.335
ALD+1.297
+1.293
vernment+1.292
heads+1.277
Neg logit contributions
ering-1.209
 Dupl-1.116
iced-1.104
 Hoover-1.089
 Cox-1.064
Token Gulf
Token ID12108
Feature activation-8.80e+00

Pos logit contributions
vernment+1.741
heads+1.724
SOURCE+1.724
ALD+1.699
lance+1.695
Neg logit contributions
ering-1.296
iced-1.214
 Dupl-1.185
 swallow-1.155
 Hoover-1.149
Token.[
Token ID3693
Feature activation-3.10e+00

Pos logit contributions
SOURCE+4.848
heads+4.708
ALD+4.662
+4.612
unknown+4.563
Neg logit contributions
ering-4.532
 Dupl-4.176
iced-4.149
 Hoover-4.054
 swallow-4.007
Token26
Token ID2075
Feature activation-4.91e-01

Pos logit contributions
SOURCE+2.624
ALD+2.499
+2.498
lance+2.479
heads+2.470
Neg logit contributions
ering-2.161
 Dupl-1.995
 Hoover-1.958
iced-1.949
 Cox-1.914
Token]
Token ID60
Feature activation+2.65e+00

Pos logit contributions
SOURCE+0.422
ALD+0.413
+0.412
vernment+0.405
lance+0.405
Neg logit contributions
ering-0.338
 Dupl-0.310
iced-0.304
 Hoover-0.299
 Cox-0.298
Token
Token ID198
Feature activation-6.91e-01

Pos logit contributions
ering+1.873
 Dupl+1.726
iced+1.690
 Hoover+1.677
 Cox+1.657
Neg logit contributions
SOURCE-2.223
ALD-2.172
-2.158
vernment-2.140
lance-2.131
Token
Token ID198
Feature activation-4.52e-01

Pos logit contributions
SOURCE+0.524
heads+0.498
ALD+0.497
unknown+0.492
lance+0.489
Neg logit contributions
ering-0.545
iced-0.507
 Dupl-0.505
 Hoover-0.500
 swallow-0.498
TokenMus
Token ID10694
Feature activation-1.18e+01

Pos logit contributions
SOURCE+0.313
ALD+0.302
+0.298
lance+0.295
heads+0.294
Neg logit contributions
ering-0.389
 Dupl-0.363
iced-0.357
 Hoover-0.357
 Cox-0.352
Tokenics
Token ID873
Feature activation-3.99e+00

Pos logit contributions
heads+5.883
lance+5.814
+5.724
SOURCE+5.538
unknown+5.528
Neg logit contributions
ering-10.675
 Dupl-10.549
 Hoover-10.299
 Dollar-10.153
 Cox-10.100
Tokenc
Token ID66
Feature activation-1.95e+00

Pos logit contributions
SOURCE+3.217
ALD+3.134
heads+3.112
lance+3.094
+3.080
Neg logit contributions
ering-2.859
 Dupl-2.611
iced-2.590
 Hoover-2.551
 Cox-2.517
Tokenapes
Token ID7916
Feature activation-4.54e-01

Pos logit contributions
SOURCE+1.619
ALD+1.603
heads+1.581
vernment+1.574
lance+1.568
Neg logit contributions
ering-1.351
 Dupl-1.261
 Hoover-1.238
iced-1.225
 Cox-1.205
Token from
Token ID422
Feature activation-9.76e-02

Pos logit contributions
SOURCE+0.378
ALD+0.368
+0.366
vernment+0.362
lance+0.362
Neg logit contributions
ering-0.324
 Dupl-0.296
iced-0.292
 Hoover-0.288
 Cox-0.285
Token M
Token ID337
Feature activation-1.66e+00

Pos logit contributions
SOURCE+0.085
ALD+0.083
+0.083
vernment+0.082
lance+0.082
Neg logit contributions
ering-0.066
 Dupl-0.059
iced-0.059
 Hoover-0.058
 Cox-0.057
Tokenama
Token ID1689
Feature activation+6.52e-01

Pos logit contributions
SOURCE+1.178
ALD+1.164
vernment+1.146
lance+1.142
heads+1.140
Neg logit contributions
ering-1.209
 Dupl-1.139
 Hoover-1.116
iced-1.108
 Cox-1.097
Token's
Token ID338
Feature activation-1.40e+00

Pos logit contributions
ering+0.452
 Dupl+0.417
iced+0.408
 Hoover+0.404
 Cox+0.399
Neg logit contributions
SOURCE-0.550
ALD-0.539
-0.535
vernment-0.532
lance-0.529
Token kitchen
Token ID9592
Feature activation-1.09e+00

Pos logit contributions
SOURCE+1.221
ALD+1.206
+1.183
lance+1.178
heads+1.176
Neg logit contributions
ering-0.922
 Dupl-0.834
iced-0.829
 Hoover-0.803
 Cox-0.795
Token,
Token ID11
Feature activation-4.87e-01

Pos logit contributions
SOURCE+0.911
ALD+0.893
+0.883
heads+0.880
lance+0.878
Neg logit contributions
ering-0.759
 Dupl-0.702
iced-0.685
 Hoover-0.676
 Cox-0.671
Token the
Token ID262
Feature activation-1.49e+00

Pos logit contributions
SOURCE+0.425
ALD+0.418
+0.413
vernment+0.410
lance+0.408
Neg logit contributions
ering-0.325
 Dupl-0.292
iced-0.291
 Hoover-0.283
 Cox-0.280
Token Fire
Token ID3764
Feature activation-1.16e+01

Pos logit contributions
SOURCE+1.260
ALD+1.239
+1.226
vernment+1.214
lance+1.210
Neg logit contributions
ering-1.020
 Dupl-0.918
iced-0.918
 Hoover-0.886
 Cox-0.876
Token Station
Token ID9327
Feature activation-1.64e+00

Pos logit contributions
heads+9.305
lance+9.241
SOURCE+9.111
ALD+8.860
unknown+8.747
Neg logit contributions
ering-7.206
 Dupl-6.668
imaru-6.620
iced-6.616
 messaging-6.464
Token,
Token ID11
Feature activation-8.47e-01

Pos logit contributions
SOURCE+1.353
ALD+1.328
+1.314
heads+1.311
vernment+1.302
Neg logit contributions
ering-1.142
 Dupl-1.051
iced-1.039
 Hoover-1.016
 Cox-1.010
Token Sn
Token ID5489
Feature activation+1.39e-01

Pos logit contributions
SOURCE+0.750
ALD+0.739
+0.731
vernment+0.727
heads+0.720
Neg logit contributions
ering-0.551
 Dupl-0.497
iced-0.492
 Hoover-0.480
 Cox-0.475
Tokenuf
Token ID3046
Feature activation-5.54e+00

Pos logit contributions
ering+0.101
 Dupl+0.094
iced+0.091
 Hoover+0.091
 Cox+0.090
Neg logit contributions
SOURCE-0.113
ALD-0.111
-0.109
vernment-0.109
lance-0.109
Tokenkin
Token ID5116
Feature activation-4.33e-01

Pos logit contributions
SOURCE+4.604
ALD+4.598
lance+4.551
vernment+4.519
heads+4.508
Neg logit contributions
ering-3.495
 Dupl-3.380
 Hoover-3.198
iced-3.177
 Cox-3.160
Token the
Token ID262
Feature activation-1.64e+00

Pos logit contributions
SOURCE+2.660
ALD+2.603
heads+2.583
vernment+2.575
+2.565
Neg logit contributions
ering-2.030
iced-1.865
 Dupl-1.860
 Hoover-1.780
 Cox-1.754
Token fo
Token ID11511
Feature activation-1.94e+00

Pos logit contributions
SOURCE+1.293
heads+1.246
ALD+1.245
vernment+1.235
+1.227
Neg logit contributions
ering-1.228
iced-1.128
 Dupl-1.118
 Hoover-1.083
 Cox-1.070
Tokeniling
Token ID4386
Feature activation+1.31e-01

Pos logit contributions
SOURCE+1.678
vernment+1.649
ALD+1.634
heads+1.632
+1.623
Neg logit contributions
ering-1.277
 Dupl-1.197
iced-1.155
 Hoover-1.150
 Cox-1.131
Token of
Token ID286
Feature activation+5.93e-01

Pos logit contributions
ering+0.095
 Dupl+0.085
 Hoover+0.084
iced+0.084
 Cox+0.083
Neg logit contributions
SOURCE-0.110
ALD-0.108
-0.107
vernment-0.104
lance-0.104
Token an
Token ID281
Feature activation+1.35e+00

Pos logit contributions
ering+0.415
 Dupl+0.382
iced+0.376
 Hoover+0.369
 Cox+0.365
Neg logit contributions
SOURCE-0.501
ALD-0.489
-0.486
vernment-0.483
lance-0.480
Token Islamic
Token ID5533
Feature activation-1.16e+01

Pos logit contributions
ering+1.032
 Dupl+0.955
iced+0.940
 Hoover+0.928
 Cox+0.918
Neg logit contributions
SOURCE-1.060
ALD-1.032
-1.021
vernment-1.016
lance-1.010
Token Jihad
Token ID35243
Feature activation-7.85e+00

Pos logit contributions
SOURCE+7.633
heads+7.314
lance+7.300
vernment+7.244
+7.117
Neg logit contributions
ering-8.897
iced-8.532
 Dupl-8.332
 swallow-8.006
imaru-8.000
Token plan
Token ID1410
Feature activation-4.51e+00

Pos logit contributions
SOURCE+5.695
heads+5.549
lance+5.428
ALD+5.406
vernment+5.382
Neg logit contributions
ering-5.708
 Dupl-5.492
iced-5.430
imaru-5.229
 Hoover-5.168
Token to
Token ID284
Feature activation-2.04e+00

Pos logit contributions
SOURCE+3.772
heads+3.713
vernment+3.670
lance+3.662
ALD+3.631
Neg logit contributions
ering-2.972
 Dupl-2.811
iced-2.726
 Hoover-2.677
 Cox-2.657
Token bomb
Token ID5194
Feature activation-3.36e+00

Pos logit contributions
SOURCE+1.577
heads+1.502
+1.495
vernment+1.492
ALD+1.492
Neg logit contributions
ering-1.557
iced-1.450
 Dupl-1.446
 Hoover-1.380
 Cox-1.376
Token the
Token ID262
Feature activation+2.99e-01

Pos logit contributions
SOURCE+2.600
vernment+2.561
heads+2.544
ALD+2.521
+2.487
Neg logit contributions
ering-2.500
 Dupl-2.310
iced-2.288
 Hoover-2.182
 swallow-2.175
Token in
Token ID287
Feature activation-1.12e+00

Pos logit contributions
ering+0.966
 Dupl+0.881
iced+0.870
 Hoover+0.853
 Cox+0.844
Neg logit contributions
SOURCE-1.258
ALD-1.230
-1.218
lance-1.207
vernment-1.207
Token his
Token ID465
Feature activation-8.30e-01

Pos logit contributions
SOURCE+0.981
ALD+0.958
lance+0.946
+0.944
vernment+0.938
Neg logit contributions
ering-0.742
 Dupl-0.678
iced-0.677
 Hoover-0.652
 swallow-0.648
Token book
Token ID1492
Feature activation-2.76e+00

Pos logit contributions
SOURCE+0.707
ALD+0.695
lance+0.688
+0.683
heads+0.679
Neg logit contributions
ering-0.562
iced-0.513
 Dupl-0.512
 Hoover-0.494
 Cox-0.488
Token
Token ID564
Feature activation+1.18e+00

Pos logit contributions
SOURCE+2.331
ALD+2.278
+2.253
lance+2.249
heads+2.245
Neg logit contributions
ering-1.889
iced-1.724
 Dupl-1.715
 Hoover-1.658
 Cox-1.649
Token
Token ID250
Feature activation-1.42e+00

Pos logit contributions
ering+0.973
 Dupl+0.910
iced+0.892
 Hoover+0.886
 Cox+0.876
Neg logit contributions
SOURCE-0.854
ALD-0.830
-0.826
vernment-0.816
lance-0.811
TokenMu
Token ID33239
Feature activation-1.15e+01

Pos logit contributions
SOURCE+1.109
ALD+1.079
+1.072
lance+1.061
vernment+1.057
Neg logit contributions
ering-1.084
 Dupl-1.010
iced-0.995
 Hoover-0.982
 Cox-0.972
Tokenhammad
Token ID14875
Feature activation-5.24e+00

Pos logit contributions
vernment+7.364
ALD+7.149
ğ+7.093
SOURCE+7.083
heads+7.049
Neg logit contributions
ering-8.757
 Dupl-8.481
 Hoover-8.401
imaru-8.226
 swallow-8.191
Token,
Token ID11
Feature activation+3.18e-01

Pos logit contributions
SOURCE+4.066
ALD+3.941
heads+3.927
lance+3.889
+3.883
Neg logit contributions
ering-3.886
iced-3.562
 Dupl-3.524
 swallow-3.435
 Hoover-3.414
Token Prophet
Token ID13583
Feature activation-5.71e+00

Pos logit contributions
ering+0.221
 Dupl+0.202
iced+0.201
 Hoover+0.197
 Cox+0.194
Neg logit contributions
SOURCE-0.270
ALD-0.263
-0.261
lance-0.259
vernment-0.259
Token of
Token ID286
Feature activation-1.51e+00

Pos logit contributions
SOURCE+4.285
heads+4.239
+4.209
ALD+4.159
lance+4.137
Neg logit contributions
ering-4.282
iced-3.943
 Dupl-3.942
 Hoover-3.836
 swallow-3.817
Token God
Token ID1793
Feature activation-6.45e+00

Pos logit contributions
SOURCE+1.130
ALD+1.095
lance+1.087
+1.086
heads+1.081
Neg logit contributions
ering-1.198
iced-1.096
 Dupl-1.092
 Hoover-1.067
 Cox-1.059
TokenIn
Token ID818
Feature activation-1.29e+00

Pos logit contributions
SOURCE+0.519
ALD+0.499
+0.491
lance+0.485
heads+0.485
Neg logit contributions
ering-0.672
 Dupl-0.634
iced-0.623
 Hoover-0.620
 Cox-0.613
Token 2012
Token ID2321
Feature activation+3.50e+00

Pos logit contributions
SOURCE+1.090
ALD+1.054
lance+1.044
+1.042
heads+1.041
Neg logit contributions
ering-0.896
 Dupl-0.830
iced-0.820
 Hoover-0.796
 Cox-0.787
Token construction
Token ID5103
Feature activation-3.16e+00

Pos logit contributions
ering+2.329
 Dupl+2.132
 Hoover+2.083
iced+2.058
 Cox+2.057
Neg logit contributions
SOURCE-3.068
ALD-2.998
-2.991
vernment-2.942
lance-2.937
Token of
Token ID286
Feature activation-4.05e-01

Pos logit contributions
SOURCE+2.410
heads+2.334
ALD+2.331
lance+2.290
+2.288
Neg logit contributions
ering-2.385
 Dupl-2.233
iced-2.226
imaru-2.158
 Hoover-2.153
Token a
Token ID257
Feature activation-1.04e+00

Pos logit contributions
SOURCE+0.344
ALD+0.337
+0.333
vernment+0.331
heads+0.331
Neg logit contributions
ering-0.280
 Dupl-0.256
iced-0.254
 Hoover-0.246
 Cox-0.244
Token nuclear
Token ID4523
Feature activation-1.14e+01

Pos logit contributions
SOURCE+0.907
ALD+0.890
heads+0.883
+0.881
lance+0.873
Neg logit contributions
ering-0.681
iced-0.617
 Dupl-0.617
 Hoover-0.586
 Cox-0.585
Token-
Token ID12
Feature activation-4.04e+00

Pos logit contributions
heads+7.914
SOURCE+7.912
ALD+7.614
+7.601
ighter+7.467
Neg logit contributions
ering-8.165
 Dupl-7.615
iced-7.518
imaru-7.420
 Dollar-7.259
Tokenpowered
Token ID12293
Feature activation-6.81e+00

Pos logit contributions
SOURCE+1.974
heads+1.875
+1.826
ALD+1.813
lance+1.807
Neg logit contributions
ering-4.208
 Dupl-4.048
 Hoover-3.942
 Cox-3.941
iced-3.929
Token deep
Token ID2769
Feature activation+1.08e-01

Pos logit contributions
SOURCE+5.199
heads+5.068
ALD+5.030
+4.967
lance+4.897
Neg logit contributions
ering-4.833
iced-4.449
 Dupl-4.357
imaru-4.353
arget-4.144
Token-
Token ID12
Feature activation-1.35e+00

Pos logit contributions
ering+0.076
 Dupl+0.070
iced+0.068
 Hoover+0.067
 Cox+0.067
Neg logit contributions
SOURCE-0.091
ALD-0.089
-0.088
heads-0.088
lance-0.088
Tokensea
Token ID8583
Feature activation+1.17e+00

Pos logit contributions
SOURCE+1.032
ALD+0.995
lance+0.992
+0.992
heads+0.988
Neg logit contributions
ering-1.052
 Dupl-0.987
iced-0.956
 Hoover-0.953
 Cox-0.947
Token
Token ID198
Feature activation-1.20e+00

Pos logit contributions
SOURCE+0.941
ALD+0.901
heads+0.897
unknown+0.885
lance+0.882
Neg logit contributions
ering-0.999
iced-0.935
 Dupl-0.932
 Hoover-0.922
 swallow-0.920
TokenAs
Token ID1722
Feature activation+2.82e+00

Pos logit contributions
SOURCE+0.830
ALD+0.805
+0.792
heads+0.778
lance+0.774
Neg logit contributions
ering-1.032
 Dupl-0.972
iced-0.954
 Hoover-0.953
 Cox-0.941
Token a
Token ID257
Feature activation-4.08e-01

Pos logit contributions
ering+1.741
 Dupl+1.581
iced+1.549
 Hoover+1.523
 Cox+1.504
Neg logit contributions
SOURCE-2.612
ALD-2.564
-2.540
vernment-2.529
lance-2.521
Token ball
Token ID2613
Feature activation+1.65e+00

Pos logit contributions
SOURCE+0.335
ALD+0.328
+0.327
lance+0.326
vernment+0.324
Neg logit contributions
ering-0.292
 Dupl-0.268
iced-0.264
 Hoover-0.257
 Cox-0.256
Token of
Token ID286
Feature activation+4.47e-01

Pos logit contributions
ering+1.250
 Dupl+1.157
iced+1.137
 Hoover+1.127
 Cox+1.114
Neg logit contributions
SOURCE-1.299
ALD-1.267
-1.257
vernment-1.244
lance-1.242
Token fire
Token ID2046
Feature activation-1.14e+01

Pos logit contributions
ering+0.346
 Dupl+0.321
iced+0.315
 Hoover+0.312
 Cox+0.308
Neg logit contributions
SOURCE-0.344
ALD-0.336
-0.333
vernment-0.329
lance-0.328
Token envelop
Token ID16441
Feature activation-5.42e-01

Pos logit contributions
heads+8.549
lance+8.123
SOURCE+8.081
+7.868
ALD+7.808
Neg logit contributions
ering-7.707
 Dupl-7.503
 Dollar-7.149
imaru-7.104
iced-7.027
Tokened
Token ID276
Feature activation+1.17e-01

Pos logit contributions
SOURCE+0.427
ALD+0.418
heads+0.412
+0.411
vernment+0.410
Neg logit contributions
ering-0.405
 Dupl-0.373
iced-0.364
 Hoover-0.362
 Cox-0.357
Token the
Token ID262
Feature activation-1.08e+00

Pos logit contributions
ering+0.076
 Dupl+0.069
iced+0.068
 Hoover+0.066
 Cox+0.065
Neg logit contributions
SOURCE-0.105
ALD-0.103
-0.102
vernment-0.102
lance-0.101
Token launch
Token ID4219
Feature activation-3.14e+00

Pos logit contributions
SOURCE+0.865
ALD+0.840
+0.836
heads+0.830
vernment+0.828
Neg logit contributions
ering-0.785
 Dupl-0.725
iced-0.720
 Hoover-0.693
 Cox-0.687
Token pad
Token ID14841
Feature activation-2.44e+00

Pos logit contributions
SOURCE+2.172
ALD+2.129
+2.112
lance+2.104
heads+2.101
Neg logit contributions
ering-2.596
 Dupl-2.465
iced-2.425
 Hoover-2.384
 Cox-2.338
Token
Token ID198
Feature activation-9.98e-01

Pos logit contributions
ering+0.267
 Dupl+0.244
iced+0.240
 Hoover+0.236
 Cox+0.232
Neg logit contributions
SOURCE-0.349
ALD-0.340
-0.336
vernment-0.334
lance-0.332
Token
Token ID198
Feature activation-9.72e-01

Pos logit contributions
SOURCE+0.782
ALD+0.750
heads+0.735
+0.731
lance+0.728
Neg logit contributions
ering-0.771
 Dupl-0.718
iced-0.712
 Hoover-0.708
 swallow-0.705
TokenLocal
Token ID14565
Feature activation+2.12e-01

Pos logit contributions
SOURCE+0.688
ALD+0.661
+0.646
lance+0.642
heads+0.640
Neg logit contributions
ering-0.825
 Dupl-0.778
 Hoover-0.761
iced-0.758
 Cox-0.749
Token tourist
Token ID18473
Feature activation+4.04e+00

Pos logit contributions
ering+0.149
 Dupl+0.138
iced+0.135
 Hoover+0.134
 Cox+0.131
Neg logit contributions
SOURCE-0.177
ALD-0.173
vernment-0.170
-0.170
lance-0.169
Token site
Token ID2524
Feature activation-9.84e-01

Pos logit contributions
ering+2.937
 Dupl+2.702
iced+2.655
 Hoover+2.631
 Cox+2.603
Neg logit contributions
SOURCE-3.312
ALD-3.230
-3.215
vernment-3.180
lance-3.168
Token Fort
Token ID6401
Feature activation-1.14e+01

Pos logit contributions
SOURCE+0.839
ALD+0.819
+0.805
heads+0.805
vernment+0.802
Neg logit contributions
ering-0.674
 Dupl-0.617
iced-0.611
 Hoover-0.597
 Cox-0.587
Token de
Token ID390
Feature activation-1.99e+00

Pos logit contributions
heads+9.465
ALD+9.423
SOURCE+9.422
unknown+9.403
+9.403
Neg logit contributions
ering-6.214
 Dupl-5.695
 messaging-5.612
iced-5.405
imaru-5.370
Token Bre
Token ID3719
Feature activation-1.00e+00

Pos logit contributions
SOURCE+1.745
ALD+1.701
+1.692
lance+1.689
vernment+1.671
Neg logit contributions
ering-1.296
 Dupl-1.181
iced-1.171
 Hoover-1.146
 swallow-1.122
Tokengan
Token ID1030
Feature activation+8.82e-04

Pos logit contributions
SOURCE+0.830
ALD+0.799
lance+0.799
+0.799
heads+0.798
Neg logit contributions
ering-0.707
 Dupl-0.658
 Hoover-0.646
iced-0.641
 swallow-0.627
Tokencon
Token ID1102
Feature activation-3.37e+00

Pos logit contributions
ering+0.001
 Dupl+0.001
iced+0.001
 Hoover+0.001
 Cox+0.001
Neg logit contributions
SOURCE-0.001
ALD-0.001
-0.001
vernment-0.001
lance-0.001
Token,
Token ID11
Feature activation-1.54e+00

Pos logit contributions
SOURCE+2.749
ALD+2.706
heads+2.664
vernment+2.646
lance+2.645
Neg logit contributions
ering-2.345
 Dupl-2.199
 Hoover-2.118
iced-2.108
 swallow-2.076
Token India
Token ID3794
Feature activation+1.68e+00

Pos logit contributions
ering+2.414
 Dupl+2.219
iced+2.165
 Hoover+2.151
 Cox+2.127
Neg logit contributions
SOURCE-3.066
ALD-2.997
-2.979
vernment-2.951
lance-2.942
Token`
Token ID63
Feature activation-3.67e+00

Pos logit contributions
ering+1.250
 Dupl+1.156
iced+1.136
 Hoover+1.126
 Cox+1.112
Neg logit contributions
SOURCE-1.344
ALD-1.310
-1.300
vernment-1.290
lance-1.284
Tokens
Token ID82
Feature activation+1.98e+00

Pos logit contributions
SOURCE+2.764
ALD+2.664
vernment+2.637
heads+2.626
lance+2.613
Neg logit contributions
ering-2.866
 Dupl-2.670
 Hoover-2.614
iced-2.603
 Cox-2.560
Token two
Token ID734
Feature activation+4.16e+00

Pos logit contributions
ering+1.344
 Dupl+1.225
iced+1.210
 Hoover+1.191
 Cox+1.177
Neg logit contributions
SOURCE-1.717
ALD-1.680
-1.663
vernment-1.658
lance-1.651
Token prominent
Token ID9208
Feature activation+1.98e+00

Pos logit contributions
ering+2.707
 Dupl+2.470
iced+2.420
 Hoover+2.398
 Cox+2.362
Neg logit contributions
SOURCE-3.729
ALD-3.646
-3.626
vernment-3.589
lance-3.585
Token Islamic
Token ID5533
Feature activation-1.14e+01

Pos logit contributions
ering+1.377
 Dupl+1.254
iced+1.238
 Hoover+1.211
 Cox+1.199
Neg logit contributions
SOURCE-1.683
ALD-1.645
vernment-1.626
-1.624
heads-1.622
Token hel
Token ID932
Feature activation+8.11e-01

Pos logit contributions
heads+8.275
SOURCE+8.272
vernment+8.010
ighters+7.750
lance+7.738
Neg logit contributions
ering-8.084
iced-7.313
 Dupl-7.148
arget-7.097
imaru-6.970
Tokenpl
Token ID489
Feature activation-1.33e+00

Pos logit contributions
ering+0.482
 Dupl+0.436
iced+0.425
 Hoover+0.423
 Cox+0.415
Neg logit contributions
SOURCE-0.768
ALD-0.750
-0.748
vernment-0.746
lance-0.746
Tokenines
Token ID1127
Feature activation-2.15e+00

Pos logit contributions
SOURCE+0.988
ALD+0.956
lance+0.955
heads+0.954
+0.951
Neg logit contributions
ering-1.042
 Dupl-0.985
 Hoover-0.961
iced-0.951
 Cox-0.938
Token are
Token ID389
Feature activation-7.95e-01

Pos logit contributions
SOURCE+1.710
vernment+1.653
ALD+1.652
heads+1.643
lance+1.639
Neg logit contributions
ering-1.578
 Dupl-1.454
iced-1.438
 Hoover-1.431
 swallow-1.394
Token discouraging
Token ID48001
Feature activation+1.33e+00

Pos logit contributions
SOURCE+0.682
ALD+0.661
heads+0.658
vernment+0.648
+0.646
Neg logit contributions
ering-0.533
 Dupl-0.496
iced-0.490
 Hoover-0.479
 Cox-0.469
Token on
Token ID319
Feature activation+1.30e+00

Pos logit contributions
SOURCE+2.855
heads+2.790
ALD+2.778
vernment+2.742
lance+2.723
Neg logit contributions
ering-2.256
 Dupl-2.153
iced-2.071
 Hoover-2.043
 Cox-1.996
Token Mount
Token ID5628
Feature activation-7.29e-01

Pos logit contributions
ering+0.855
 Dupl+0.784
iced+0.768
 Hoover+0.755
 Cox+0.744
Neg logit contributions
SOURCE-1.150
ALD-1.121
-1.112
vernment-1.112
heads-1.102
Token Sin
Token ID10884
Feature activation+2.74e+00

Pos logit contributions
SOURCE+0.745
ALD+0.728
+0.727
lance+0.722
vernment+0.722
Neg logit contributions
ering-0.380
 Dupl-0.337
iced-0.331
 Hoover-0.322
 Cox-0.315
Tokenjar
Token ID9491
Feature activation+2.31e-01

Pos logit contributions
ering+1.725
 Dupl+1.566
iced+1.536
 Hoover+1.517
 Cox+1.496
Neg logit contributions
SOURCE-2.512
ALD-2.458
-2.443
vernment-2.421
lance-2.414
Token from
Token ID422
Feature activation+4.27e-01

Pos logit contributions
ering+0.167
 Dupl+0.154
iced+0.151
 Hoover+0.150
 Cox+0.148
Neg logit contributions
SOURCE-0.189
ALD-0.185
-0.183
vernment-0.182
lance-0.181
Token Islamic
Token ID5533
Feature activation-1.13e+01

Pos logit contributions
ering+0.304
 Dupl+0.281
iced+0.278
 Hoover+0.271
 Cox+0.268
Neg logit contributions
SOURCE-0.354
ALD-0.344
vernment-0.342
-0.341
heads-0.339
Token State
Token ID1812
Feature activation-3.63e+00

Pos logit contributions
heads+6.661
lance+6.454
SOURCE+6.410
vernment+6.333
unknown+6.239
Neg logit contributions
ering-9.216
iced-9.137
 Dupl-8.814
arget-8.746
imaru-8.550
Token in
Token ID287
Feature activation-9.04e-01

Pos logit contributions
SOURCE+2.730
heads+2.673
lance+2.635
ALD+2.632
vernment+2.628
Neg logit contributions
ering-2.759
iced-2.598
 Dupl-2.582
 Hoover-2.522
 Cox-2.476
Token Iraq
Token ID3908
Feature activation-6.99e+00

Pos logit contributions
SOURCE+0.783
vernment+0.766
+0.764
ALD+0.763
lance+0.763
Neg logit contributions
ering-0.592
 Dupl-0.550
iced-0.547
 Hoover-0.529
 swallow-0.520
Token and
Token ID290
Feature activation+3.78e-01

Pos logit contributions
SOURCE+5.761
vernment+5.694
+5.665
ALD+5.663
heads+5.661
Neg logit contributions
ering-4.648
 Dupl-4.329
iced-4.284
 Hoover-4.228
 Cox-4.139
Token Syria
Token ID4392
Feature activation-8.23e-01

Pos logit contributions
ering+0.264
 Dupl+0.243
iced+0.238
 Hoover+0.236
 Cox+0.233
Neg logit contributions
SOURCE-0.320
ALD-0.313
-0.311
vernment-0.308
lance-0.307
Token October
Token ID3267
Feature activation-6.43e+00

Pos logit contributions
ering+0.325
 Dupl+0.299
iced+0.296
 Hoover+0.288
 swallow+0.284
Neg logit contributions
SOURCE-0.357
ALD-0.345
heads-0.342
lance-0.342
-0.341
Token.
Token ID13
Feature activation-1.43e-01

Pos logit contributions
SOURCE+4.825
heads+4.662
+4.640
ALD+4.587
lance+4.516
Neg logit contributions
ering-4.797
 Dupl-4.493
iced-4.373
 Hoover-4.342
imaru-4.276
Token Photograph
Token ID18903
Feature activation-1.46e+00

Pos logit contributions
SOURCE+0.136
ALD+0.133
+0.133
lance+0.130
heads+0.130
Neg logit contributions
ering-0.086
 Dupl-0.077
iced-0.076
 Hoover-0.075
 swallow-0.073
Token:
Token ID25
Feature activation+2.24e+00

Pos logit contributions
SOURCE+1.027
heads+0.981
ALD+0.980
lance+0.959
unknown+0.958
Neg logit contributions
ering-1.195
 Dupl-1.127
 Hoover-1.111
iced-1.109
 swallow-1.107
Token Clive
Token ID47722
Feature activation-1.79e+00

Pos logit contributions
ering+1.398
 Dupl+1.268
iced+1.244
 Hoover+1.227
 Cox+1.209
Neg logit contributions
SOURCE-2.064
ALD-2.020
-2.008
vernment-1.988
lance-1.984
Token Brun
Token ID15700
Feature activation+6.77e+00

Pos logit contributions
SOURCE+1.765
ALD+1.712
+1.708
heads+1.707
lance+1.699
Neg logit contributions
ering-0.975
iced-0.885
 Dupl-0.870
 Hoover-0.823
 swallow-0.813
Tokenskill
Token ID42401
Feature activation+1.25e+00

Pos logit contributions
ering+4.352
iced+3.857
 Dupl+3.846
 Cox+3.834
 Hoover+3.673
Neg logit contributions
SOURCE-6.276
-6.074
ALD-6.062
vernment-6.024
ITNESS-6.005
Token/
Token ID14
Feature activation-1.43e+00

Pos logit contributions
ering+1.009
 Dupl+0.937
iced+0.921
 Hoover+0.914
 Cox+0.907
Neg logit contributions
SOURCE-0.932
ALD-0.906
-0.902
vernment-0.892
lance-0.889
TokenGetty
Token ID6633
Feature activation-4.05e+00

Pos logit contributions
SOURCE+0.955
ALD+0.923
+0.896
heads+0.892
lance+0.881
Neg logit contributions
ering-1.266
 Dupl-1.187
iced-1.168
 Hoover-1.166
 swallow-1.151
Token Images
Token ID5382
Feature activation+1.23e+00

Pos logit contributions
SOURCE+3.779
ALD+3.704
+3.656
heads+3.638
lance+3.635
Neg logit contributions
ering-2.447
 Dupl-2.216
iced-2.181
 Hoover-2.148
 Cox-2.099
Token
Token ID198
Feature activation+6.02e-01

Pos logit contributions
ering+0.859
 Dupl+0.786
iced+0.767
 Hoover+0.760
 Cox+0.758
Neg logit contributions
SOURCE-1.052
ALD-1.028
-1.026
vernment-1.017
lance-1.011
Token expectations
Token ID9027
Feature activation+2.31e+00

Pos logit contributions
ering+1.650
 Dupl+1.516
iced+1.483
 Hoover+1.461
 Cox+1.444
Neg logit contributions
SOURCE-2.271
ALD-2.219
-2.199
vernment-2.183
lance-2.179
Token,
Token ID11
Feature activation+1.37e-01

Pos logit contributions
ering+1.567
 Dupl+1.429
iced+1.405
 Hoover+1.389
 Cox+1.372
Neg logit contributions
SOURCE-1.994
ALD-1.959
-1.952
vernment-1.924
ITNESS-1.919
Token the
Token ID262
Feature activation+4.98e-01

Pos logit contributions
ering+0.089
 Dupl+0.082
iced+0.081
 Hoover+0.079
 Cox+0.078
Neg logit contributions
SOURCE-0.121
ALD-0.118
heads-0.116
lance-0.116
vernment-0.116
Token Canadian
Token ID5398
Feature activation+4.67e+00

Pos logit contributions
ering+0.324
 Dupl+0.295
iced+0.294
 Hoover+0.284
 Cox+0.280
Neg logit contributions
SOURCE-0.443
ALD-0.432
-0.426
heads-0.424
vernment-0.422
Token housing
Token ID5627
Feature activation+4.56e+00

Pos logit contributions
ering+2.270
 Dupl+1.997
iced+1.944
 Hoover+1.913
 Cox+1.878
Neg logit contributions
SOURCE-4.955
ALD-4.862
-4.838
vernment-4.802
lance-4.788
Token market
Token ID1910
Feature activation+7.72e+00

Pos logit contributions
ering+2.605
 Dupl+2.330
iced+2.277
 Hoover+2.254
 Cox+2.217
Neg logit contributions
SOURCE-4.458
ALD-4.358
-4.347
vernment-4.319
lance-4.293
Token has
Token ID468
Feature activation+1.36e+00

Pos logit contributions
ering+5.156
 Dupl+4.536
iced+4.535
 Hoover+4.519
 Cox+4.487
Neg logit contributions
SOURCE-6.688
ALD-6.556
-6.533
ITNESS-6.481
vernment-6.453
Token just
Token ID655
Feature activation-6.61e-01

Pos logit contributions
ering+1.005
 Dupl+0.942
iced+0.913
 Hoover+0.899
 Cox+0.893
Neg logit contributions
SOURCE-1.091
ALD-1.065
-1.053
vernment-1.045
heads-1.043
Token kept
Token ID4030
Feature activation-2.42e-01

Pos logit contributions
SOURCE+0.528
ALD+0.521
+0.511
heads+0.506
vernment+0.502
Neg logit contributions
ering-0.484
 Dupl-0.456
 Hoover-0.435
iced-0.432
 Cox-0.427
Token on
Token ID319
Feature activation+2.04e+00

Pos logit contributions
SOURCE+0.196
ALD+0.190
heads+0.186
+0.186
lance+0.185
Neg logit contributions
ering-0.173
 Dupl-0.163
iced-0.161
 Hoover-0.157
 Cox-0.156
Token rolling
Token ID10708
Feature activation+4.21e+00

Pos logit contributions
ering+1.524
 Dupl+1.417
iced+1.390
 Hoover+1.369
 Cox+1.349
Neg logit contributions
SOURCE-1.629
ALD-1.587
-1.561
lance-1.548
heads-1.548
Token player
Token ID2137
Feature activation-1.58e+00

Pos logit contributions
SOURCE+2.892
ALD+2.843
+2.836
heads+2.835
lance+2.827
Neg logit contributions
ering-2.394
 Dupl-2.233
iced-2.211
 Hoover-2.147
 Cox-2.114
Token,
Token ID11
Feature activation-1.77e-01

Pos logit contributions
SOURCE+1.293
ALD+1.271
+1.264
vernment+1.255
lance+1.253
Neg logit contributions
ering-1.125
 Dupl-1.039
iced-1.021
 Hoover-1.006
 Cox-0.990
Token former
Token ID1966
Feature activation-3.21e+00

Pos logit contributions
SOURCE+0.159
ALD+0.157
vernment+0.155
+0.155
lance+0.154
Neg logit contributions
ering-0.112
 Dupl-0.102
iced-0.102
 Hoover-0.098
 swallow-0.096
Token St
Token ID520
Feature activation+5.63e+00

Pos logit contributions
SOURCE+2.855
ALD+2.825
+2.795
heads+2.795
lance+2.792
Neg logit contributions
ering-1.976
iced-1.824
 Dupl-1.775
imaru-1.679
 swallow-1.663
Token.
Token ID13
Feature activation+6.16e+00

Pos logit contributions
ering+3.898
 Dupl+3.557
iced+3.491
 Cox+3.473
 Hoover+3.452
Neg logit contributions
SOURCE-4.790
-4.632
ALD-4.624
vernment-4.560
lance-4.558
Token Louis
Token ID5593
Feature activation+7.10e+00

Pos logit contributions
ering+3.635
 Dupl+3.291
 Hoover+3.204
 Cox+3.197
iced+3.177
Neg logit contributions
SOURCE-5.860
-5.699
ALD-5.689
vernment-5.676
ITNESS-5.615
Token Rams
Token ID16077
Feature activation+3.32e+00

Pos logit contributions
ering+4.354
 Dupl+3.991
 Cox+3.904
 Hoover+3.860
 Perkins+3.770
Neg logit contributions
SOURCE-6.475
ALD-6.325
-6.290
vernment-6.202
ITNESS-6.180
Token kicker
Token ID38297
Feature activation+1.41e-02

Pos logit contributions
ering+2.231
 Dupl+2.041
iced+1.982
 Hoover+1.981
 Cox+1.970
Neg logit contributions
SOURCE-2.903
ALD-2.828
-2.819
vernment-2.792
lance-2.766
Token Brett
Token ID18726
Feature activation+5.47e+00

Pos logit contributions
ering+0.008
 Dupl+0.007
iced+0.007
 Hoover+0.007
 Cox+0.007
Neg logit contributions
SOURCE-0.013
ALD-0.013
-0.013
lance-0.013
vernment-0.013
Token Ba
Token ID8999
Feature activation+2.75e+00

Pos logit contributions
ering+3.389
 Dupl+3.101
 Hoover+3.048
 Cox+3.024
iced+2.999
Neg logit contributions
SOURCE-5.063
ALD-4.933
-4.883
vernment-4.863
ITNESS-4.839
Tokener
Token ID263
Feature activation-4.13e-01

Pos logit contributions
ering+1.746
 Dupl+1.595
iced+1.557
 Hoover+1.542
 Cox+1.521
Neg logit contributions
SOURCE-2.497
ALD-2.447
-2.426
vernment-2.409
lance-2.400
Token {
Token ID1391
Feature activation-7.12e-02

Pos logit contributions
SOURCE+1.420
ALD+1.389
+1.380
vernment+1.368
lance+1.365
Neg logit contributions
ering-1.015
 Dupl-0.924
iced-0.906
 Hoover-0.894
 Cox-0.882
Token(
Token ID7
Feature activation-3.04e+00

Pos logit contributions
SOURCE+0.047
ALD+0.044
+0.044
heads+0.043
lance+0.043
Neg logit contributions
ering-0.064
 Dupl-0.060
 Hoover-0.060
iced-0.060
 swallow-0.059
Token171
Token ID27192
Feature activation-5.99e+00

Pos logit contributions
SOURCE+2.524
ALD+2.454
+2.434
heads+2.404
unknown+2.390
Neg logit contributions
ering-2.157
 Dupl-1.973
iced-1.957
 Hoover-1.909
 swallow-1.904
Tokens
Token ID82
Feature activation-3.37e-01

Pos logit contributions
heads+4.040
SOURCE+3.985
lance+3.823
unknown+3.756
ALD+3.739
Neg logit contributions
ering-4.933
 Dupl-4.635
 Hoover-4.597
imaru-4.551
 swallow-4.538
Token)}
Token ID38165
Feature activation+1.65e+00

Pos logit contributions
SOURCE+0.206
ALD+0.198
lance+0.195
heads+0.193
+0.193
Neg logit contributions
ering-0.315
 Dupl-0.296
iced-0.293
 Hoover-0.292
 Cox-0.288
Token 12
Token ID1105
Feature activation+7.28e+00

Pos logit contributions
ering+0.993
 Dupl+0.895
iced+0.874
 Hoover+0.862
 Cox+0.851
Neg logit contributions
SOURCE-1.565
ALD-1.532
-1.529
vernment-1.512
lance-1.512
Token.
Token ID13
Feature activation+3.14e+00

Pos logit contributions
ering+5.042
 Dupl+4.631
iced+4.472
 Hoover+4.434
 Cox+4.419
Neg logit contributions
SOURCE-6.082
vernment-6.011
-6.001
ALD-5.998
lance-5.895
Token Na
Token ID11013
Feature activation-2.95e+00

Pos logit contributions
ering+2.227
 Dupl+2.044
iced+2.011
 Hoover+1.989
 Cox+1.964
Neg logit contributions
SOURCE-2.632
ALD-2.572
-2.551
vernment-2.527
lance-2.521
Token3
Token ID18
Feature activation+1.01e+00

Pos logit contributions
SOURCE+2.223
ALD+2.174
heads+2.129
+2.118
lance+2.104
Neg logit contributions
ering-2.260
 Dupl-2.134
 Hoover-2.123
 Cox-2.069
 swallow-2.059
Token {
Token ID1391
Feature activation-7.93e-01

Pos logit contributions
ering+0.646
 Dupl+0.589
iced+0.575
 Hoover+0.567
 Cox+0.561
Neg logit contributions
SOURCE-0.924
ALD-0.903
-0.897
vernment-0.890
lance-0.887
Token(
Token ID7
Feature activation-3.42e+00

Pos logit contributions
SOURCE+0.511
ALD+0.484
+0.478
heads+0.471
lance+0.471
Neg logit contributions
ering-0.725
 Dupl-0.681
 Hoover-0.679
iced-0.674
 swallow-0.670
Token especially
Token ID2592
Feature activation+1.82e+00

Pos logit contributions
SOURCE+0.525
ALD+0.513
+0.511
vernment+0.504
heads+0.501
Neg logit contributions
ering-0.435
 Dupl-0.403
iced-0.395
 Hoover-0.389
 Cox-0.383
Token when
Token ID618
Feature activation+1.39e+00

Pos logit contributions
ering+1.200
 Dupl+1.098
iced+1.077
 Hoover+1.063
 Cox+1.050
Neg logit contributions
SOURCE-1.619
ALD-1.585
-1.572
vernment-1.562
lance-1.553
Token it
Token ID340
Feature activation+6.03e-02

Pos logit contributions
ering+0.892
 Dupl+0.816
iced+0.802
 Hoover+0.789
 Cox+0.779
Neg logit contributions
SOURCE-1.249
ALD-1.221
-1.213
vernment-1.203
heads-1.197
Token is
Token ID318
Feature activation+1.06e+00

Pos logit contributions
ering+0.041
 Dupl+0.038
iced+0.037
 Hoover+0.037
 Cox+0.036
Neg logit contributions
SOURCE-0.052
-0.051
ALD-0.051
vernment-0.050
heads-0.050
Token put
Token ID1234
Feature activation+1.33e+00

Pos logit contributions
ering+0.747
 Dupl+0.689
iced+0.679
 Hoover+0.665
 Cox+0.660
Neg logit contributions
SOURCE-0.895
ALD-0.876
-0.865
vernment-0.859
heads-0.856
Token so
Token ID523
Feature activation+7.42e+00

Pos logit contributions
ering+0.841
 Dupl+0.763
iced+0.748
 Hoover+0.739
 Cox+0.729
Neg logit contributions
SOURCE-1.212
ALD-1.186
-1.178
vernment-1.168
lance-1.165
Token bluntly
Token ID44514
Feature activation+3.16e-01

Pos logit contributions
ering+4.203
 Dupl+3.806
 Hoover+3.683
iced+3.677
 Cox+3.636
Neg logit contributions
SOURCE-7.029
ALD-6.905
vernment-6.873
lance-6.865
-6.849
Token.
Token ID13
Feature activation-1.34e+00

Pos logit contributions
ering+0.230
 Dupl+0.212
iced+0.208
 Hoover+0.206
 Cox+0.203
Neg logit contributions
SOURCE-0.259
ALD-0.253
-0.251
vernment-0.248
lance-0.247
Token And
Token ID843
Feature activation+2.98e+00

Pos logit contributions
SOURCE+1.060
ALD+1.033
+1.028
heads+1.010
ITNESS+1.004
Neg logit contributions
ering-1.007
iced-0.929
 Dupl-0.901
 swallow-0.897
 messaging-0.881
Token as
Token ID355
Feature activation+5.12e+00

Pos logit contributions
ering+1.968
 Dupl+1.799
iced+1.765
 Hoover+1.743
 Cox+1.720
Neg logit contributions
SOURCE-2.639
ALD-2.580
-2.563
vernment-2.539
lance-2.534
Token my
Token ID616
Feature activation+2.03e+00

Pos logit contributions
ering+3.244
 Dupl+2.945
iced+2.885
 Hoover+2.864
 Cox+2.820
Neg logit contributions
SOURCE-4.669
ALD-4.569
-4.541
vernment-4.496
lance-4.479
Token keep
Token ID1394
Feature activation-3.65e+00

Pos logit contributions
ering+0.490
 Dupl+0.455
iced+0.446
 Hoover+0.439
 Cox+0.434
Neg logit contributions
SOURCE-0.568
ALD-0.554
-0.552
vernment-0.547
lance-0.546
Token your
Token ID534
Feature activation-3.78e-01

Pos logit contributions
SOURCE+2.931
heads+2.884
ALD+2.874
+2.838
ITNESS+2.830
Neg logit contributions
ering-2.522
 Dupl-2.326
iced-2.297
imaru-2.242
oner-2.240
Token road
Token ID2975
Feature activation+9.88e-01

Pos logit contributions
SOURCE+0.314
ALD+0.306
+0.305
heads+0.304
vernment+0.300
Neg logit contributions
ering-0.264
 Dupl-0.243
iced-0.242
 Hoover-0.233
imaru-0.232
Token trip
Token ID5296
Feature activation+4.20e+00

Pos logit contributions
ering+0.725
 Dupl+0.675
iced+0.661
 Hoover+0.656
 Cox+0.649
Neg logit contributions
SOURCE-0.796
ALD-0.776
-0.769
heads-0.766
lance-0.764
Token in
Token ID287
Feature activation+8.38e-01

Pos logit contributions
ering+2.845
 Dupl+2.598
iced+2.551
 Hoover+2.526
 Cox+2.494
Neg logit contributions
SOURCE-3.649
ALD-3.565
-3.543
vernment-3.517
lance-3.502
Token the
Token ID262
Feature activation+1.48e+00

Pos logit contributions
ering+0.574
 Dupl+0.524
iced+0.516
 Hoover+0.507
 Cox+0.505
Neg logit contributions
SOURCE-0.720
ALD-0.702
-0.700
vernment-0.691
lance-0.691
Token right
Token ID826
Feature activation+7.65e-01

Pos logit contributions
ering+1.059
 Dupl+0.967
iced+0.955
 Hoover+0.936
 Cox+0.926
Neg logit contributions
SOURCE-1.230
ALD-1.195
-1.185
lance-1.179
heads-1.178
Token direction
Token ID4571
Feature activation-4.66e+00

Pos logit contributions
ering+0.550
 Dupl+0.507
iced+0.498
 Hoover+0.492
 Cox+0.486
Neg logit contributions
SOURCE-0.633
ALD-0.617
-0.613
vernment-0.606
lance-0.606
Token stay
Token ID2652
Feature activation-1.70e+00

Pos logit contributions
heads+3.451
SOURCE+3.443
+3.411
vernment+3.395
ALD+3.389
Neg logit contributions
ering-3.471
 Dupl-3.214
iced-3.196
 Hoover-3.107
arget-3.076
Token at
Token ID379
Feature activation+7.25e-01

Pos logit contributions
SOURCE+1.420
heads+1.410
ALD+1.408
+1.391
vernment+1.379
Neg logit contributions
ering-1.131
 Dupl-1.061
 Cox-1.016
iced-1.014
 Hoover-1.013
Token Hart
Token ID11345
Feature activation+2.22e+00

Pos logit contributions
ering+0.467
 Dupl+0.422
iced+0.418
 Hoover+0.405
 Cox+0.400
Neg logit contributions
SOURCE-0.650
ALD-0.638
-0.632
vernment-0.632
lance-0.626
TokenWhen
Token ID2215
Feature activation+5.81e-01

Pos logit contributions
SOURCE+0.436
ALD+0.421
+0.413
heads+0.413
lance+0.411
Neg logit contributions
ering-0.549
 Dupl-0.518
 Hoover-0.508
iced-0.508
 Cox-0.501
Token he
Token ID339
Feature activation-6.39e-01

Pos logit contributions
ering+0.384
 Dupl+0.354
iced+0.348
 Hoover+0.341
 Cox+0.336
Neg logit contributions
SOURCE-0.509
ALD-0.496
lance-0.493
-0.491
heads-0.491
Token himself
Token ID2241
Feature activation+6.64e-01

Pos logit contributions
SOURCE+0.516
ALD+0.502
heads+0.502
lance+0.500
vernment+0.497
Neg logit contributions
ering-0.454
 Dupl-0.430
iced-0.423
 Hoover-0.416
 Cox-0.412
Token might
Token ID1244
Feature activation-2.50e+00

Pos logit contributions
ering+0.477
 Dupl+0.448
iced+0.442
 Hoover+0.435
 Cox+0.428
Neg logit contributions
SOURCE-0.539
ALD-0.523
heads-0.523
lance-0.519
vernment-0.517
Token his
Token ID465
Feature activation-1.23e+00

Pos logit contributions
SOURCE+1.988
lance+1.941
+1.939
heads+1.936
ALD+1.935
Neg logit contributions
ering-1.718
 Dupl-1.682
iced-1.644
 Hoover-1.620
 Cox-1.611
Token quiet
Token ID5897
Feature activation+1.01e+00

Pos logit contributions
SOURCE+1.070
ALD+1.051
+1.044
lance+1.040
heads+1.034
Neg logit contributions
ering-0.808
 Dupl-0.752
iced-0.740
 Hoover-0.725
 Cox-0.715
Tokenus
Token ID385
Feature activation-1.04e+00

Pos logit contributions
ering+0.669
 Dupl+0.622
iced+0.609
 Hoover+0.600
 Cox+0.592
Neg logit contributions
SOURCE-0.876
ALD-0.854
-0.848
lance-0.845
vernment-0.843
Token make
Token ID787
Feature activation-4.02e+00

Pos logit contributions
SOURCE+0.876
ALD+0.852
+0.849
heads+0.847
lance+0.844
Neg logit contributions
ering-0.713
 Dupl-0.670
iced-0.655
 Hoover-0.647
 Cox-0.638
Token
Token ID198
Feature activation-9.69e-01

Pos logit contributions
SOURCE+3.228
lance+3.121
heads+3.119
+3.105
ALD+3.083
Neg logit contributions
ering-2.789
 Dupl-2.620
iced-2.618
 Hoover-2.510
 Cox-2.495
Token
Token ID198
Feature activation-1.12e+00

Pos logit contributions
SOURCE+0.733
heads+0.698
ALD+0.698
unknown+0.688
lance+0.687
Neg logit contributions
ering-0.760
 Dupl-0.709
iced-0.708
 Hoover-0.701
 swallow-0.690
TokenWith
Token ID3152
Feature activation-1.82e+00

Pos logit contributions
SOURCE+0.795
ALD+0.763
+0.758
lance+0.748
heads+0.747
Neg logit contributions
ering-0.938
 Dupl-0.883
iced-0.870
 Hoover-0.866
 Cox-0.854
Token been
Token ID587
Feature activation-1.73e-01

Pos logit contributions
SOURCE+1.492
ALD+1.446
heads+1.431
vernment+1.428
+1.426
Neg logit contributions
ering-1.151
 Dupl-1.052
iced-1.029
 Hoover-1.015
 Cox-0.996
Token painted
Token ID13055
Feature activation-3.17e-01

Pos logit contributions
SOURCE+0.151
ALD+0.146
heads+0.145
+0.143
vernment+0.143
Neg logit contributions
ering-0.115
 Dupl-0.106
iced-0.104
 Hoover-0.102
 Cox-0.100
Token as
Token ID355
Feature activation+2.14e+00

Pos logit contributions
SOURCE+0.270
ALD+0.265
+0.262
heads+0.260
lance+0.259
Neg logit contributions
ering-0.216
 Dupl-0.199
iced-0.195
 Hoover-0.194
 Cox-0.191
Token a
Token ID257
Feature activation-6.19e-01

Pos logit contributions
ering+1.355
 Dupl+1.230
iced+1.206
 Hoover+1.190
 Cox+1.172
Neg logit contributions
SOURCE-1.956
ALD-1.913
-1.897
vernment-1.883
lance-1.880
Token secret
Token ID3200
Feature activation-3.44e+00

Pos logit contributions
SOURCE+0.551
ALD+0.537
+0.532
lance+0.532
heads+0.532
Neg logit contributions
ering-0.399
 Dupl-0.357
iced-0.353
 Hoover-0.346
 Cox-0.342
Token baby
Token ID5156
Feature activation+1.88e-01

Pos logit contributions
SOURCE+2.818
ALD+2.745
lance+2.732
vernment+2.716
heads+2.686
Neg logit contributions
ering-2.368
 Dupl-2.157
iced-2.135
 Hoover-2.082
imaru-2.082
Token incest
Token ID35961
Feature activation+3.19e+00

Pos logit contributions
ering+0.123
 Dupl+0.112
iced+0.110
 Hoover+0.108
 Cox+0.107
Neg logit contributions
SOURCE-0.167
ALD-0.163
heads-0.162
-0.162
vernment-0.161
Token weird
Token ID7650
Feature activation+5.20e-01

Pos logit contributions
ering+2.101
 Dupl+1.917
iced+1.880
 Hoover+1.858
 Cox+1.834
Neg logit contributions
SOURCE-2.829
ALD-2.766
-2.748
vernment-2.723
lance-2.715
Tokeno
Token ID78
Feature activation-1.64e+00

Pos logit contributions
ering+0.228
 Dupl+0.200
iced+0.193
 Hoover+0.189
 Cox+0.186
Neg logit contributions
SOURCE-0.572
ALD-0.563
-0.560
lance-0.558
vernment-0.557
Token.
Token ID13
Feature activation-7.93e-01

Pos logit contributions
SOURCE+1.271
ALD+1.239
heads+1.237
vernment+1.236
+1.228
Neg logit contributions
ering-1.214
 Dupl-1.131
iced-1.112
 Hoover-1.100
 Cox-1.088
Token And
Token ID843
Feature activation+2.92e+00

Pos logit contributions
SOURCE+0.639
ALD+0.618
+0.608
heads+0.606
lance+0.599
Neg logit contributions
ering-0.593
iced-0.544
 Dupl-0.528
 swallow-0.524
 messaging-0.517
Token years
Token ID812
Feature activation-3.35e+00

Pos logit contributions
SOURCE+1.797
vernment+1.740
ALD+1.739
heads+1.730
lance+1.716
Neg logit contributions
ering-1.441
 Dupl-1.319
iced-1.315
 Hoover-1.273
arget-1.256
Token to
Token ID284
Feature activation+2.03e-01

Pos logit contributions
SOURCE+2.822
vernment+2.752
heads+2.749
ALD+2.724
+2.693
Neg logit contributions
ering-2.259
 Dupl-2.085
iced-2.024
 Hoover-2.016
arget-1.963
Token become
Token ID1716
Feature activation-1.27e+00

Pos logit contributions
ering+0.157
 Dupl+0.148
iced+0.146
 Hoover+0.144
 Cox+0.141
Neg logit contributions
SOURCE-0.153
vernment-0.147
ALD-0.147
heads-0.145
-0.145
Token a
Token ID257
Feature activation+2.80e-03

Pos logit contributions
SOURCE+1.112
ALD+1.095
vernment+1.083
+1.077
lance+1.073
Neg logit contributions
ering-0.833
 Dupl-0.758
iced-0.747
 Hoover-0.734
 Cox-0.720
Token Canadian
Token ID5398
Feature activation+1.97e+00

Pos logit contributions
ering+0.002
 Dupl+0.002
iced+0.002
 Hoover+0.002
 Cox+0.002
Neg logit contributions
SOURCE-0.002
ALD-0.002
-0.002
vernment-0.002
lance-0.002
Token citizen
Token ID9511
Feature activation+2.37e+00

Pos logit contributions
ering+1.334
 Dupl+1.219
iced+1.197
 Hoover+1.189
 Cox+1.165
Neg logit contributions
SOURCE-1.709
ALD-1.674
-1.659
vernment-1.646
lance-1.642
Token has
Token ID468
Feature activation-1.58e+00

Pos logit contributions
ering+1.704
 Dupl+1.566
iced+1.539
 Hoover+1.522
 Cox+1.507
Neg logit contributions
SOURCE-1.965
ALD-1.920
-1.906
vernment-1.885
lance-1.881
Token been
Token ID587
Feature activation-7.46e-01

Pos logit contributions
SOURCE+1.229
ALD+1.184
heads+1.177
vernment+1.174
+1.156
Neg logit contributions
ering-1.180
 Dupl-1.112
iced-1.088
 Hoover-1.074
imaru-1.051
Token charged
Token ID5047
Feature activation-2.83e+00

Pos logit contributions
SOURCE+0.593
heads+0.581
ALD+0.580
vernment+0.565
+0.560
Neg logit contributions
ering-0.538
 Dupl-0.509
iced-0.495
 Hoover-0.489
 Cox-0.478
Token with
Token ID351
Feature activation-1.51e+00

Pos logit contributions
SOURCE+2.369
ALD+2.301
heads+2.301
lance+2.288
vernment+2.278
Neg logit contributions
ering-1.932
 Dupl-1.791
iced-1.766
 Hoover-1.751
 Cox-1.701
Token 83
Token ID9698
Feature activation-8.62e-01

Pos logit contributions
SOURCE+1.241
ALD+1.197
vernment+1.189
heads+1.185
+1.171
Neg logit contributions
ering-1.085
iced-1.001
 Dupl-0.992
 Hoover-0.978
 Cox-0.941
Token size
Token ID2546
Feature activation+2.64e-01

Pos logit contributions
ering+0.789
 Dupl+0.709
iced+0.694
 Hoover+0.683
 Cox+0.672
Neg logit contributions
SOURCE-1.430
ALD-1.399
-1.389
lance-1.378
heads-1.377
Token increase
Token ID2620
Feature activation+4.47e-01

Pos logit contributions
ering+0.179
 Dupl+0.164
iced+0.163
 Hoover+0.161
 Cox+0.158
Neg logit contributions
SOURCE-0.227
ALD-0.221
heads-0.220
-0.219
vernment-0.218
Token proposals
Token ID11628
Feature activation+1.77e+00

Pos logit contributions
ering+0.307
 Dupl+0.284
iced+0.283
 Hoover+0.278
 Cox+0.273
Neg logit contributions
SOURCE-0.377
ALD-0.369
heads-0.366
-0.366
vernment-0.365
Token (
Token ID357
Feature activation-1.48e+00

Pos logit contributions
ering+1.263
 Dupl+1.162
iced+1.145
 Hoover+1.132
 Cox+1.116
Neg logit contributions
SOURCE-1.471
ALD-1.434
-1.423
vernment-1.410
lance-1.407
Tokensuch
Token ID10508
Feature activation+1.22e+00

Pos logit contributions
SOURCE+1.155
ALD+1.118
lance+1.109
heads+1.108
+1.100
Neg logit contributions
ering-1.134
 Dupl-1.058
 Hoover-1.034
iced-1.030
 swallow-1.018
Token as
Token ID355
Feature activation+3.01e+00

Pos logit contributions
ering+0.778
 Dupl+0.706
iced+0.690
 Hoover+0.679
 Cox+0.675
Neg logit contributions
SOURCE-1.120
ALD-1.092
-1.085
lance-1.070
vernment-1.069
Token 2
Token ID362
Feature activation+1.96e+00

Pos logit contributions
ering+2.031
 Dupl+1.855
iced+1.824
 Hoover+1.801
 Cox+1.776
Neg logit contributions
SOURCE-2.625
ALD-2.567
-2.547
vernment-2.526
lance-2.520
Token/
Token ID14
Feature activation-1.30e+00

Pos logit contributions
ering+1.636
 Dupl+1.524
iced+1.502
 Hoover+1.489
 Cox+1.472
Neg logit contributions
SOURCE-1.396
ALD-1.353
-1.344
vernment-1.326
lance-1.321
Token4
Token ID19
Feature activation-1.51e+00

Pos logit contributions
SOURCE+1.030
ALD+0.988
heads+0.983
+0.978
lance+0.966
Neg logit contributions
ering-0.990
 Dupl-0.921
 Hoover-0.902
iced-0.899
 Cox-0.886
Token/
Token ID14
Feature activation-1.45e+00

Pos logit contributions
SOURCE+1.203
ALD+1.159
heads+1.150
+1.148
ITNESS+1.145
Neg logit contributions
ering-1.115
 Dupl-1.032
iced-1.023
 Hoover-1.010
 Cox-0.986
Token8
Token ID23
Feature activation+8.07e-01

Pos logit contributions
SOURCE+1.221
ALD+1.174
heads+1.173
+1.161
lance+1.157
Neg logit contributions
ering-1.025
 Dupl-0.953
 Hoover-0.933
iced-0.919
 Cox-0.907
Token
Token ID198
Feature activation+1.57e-01

Pos logit contributions
ering+1.003
 Dupl+0.921
iced+0.900
 Cox+0.894
 Hoover+0.890
Neg logit contributions
SOURCE-1.259
vernment-1.237
ALD-1.235
-1.235
lance-1.214
Token
Token ID198
Feature activation+1.72e-01

Pos logit contributions
ering+0.122
 Dupl+0.114
iced+0.113
 Hoover+0.112
 swallow+0.111
Neg logit contributions
SOURCE-0.122
ALD-0.116
heads-0.116
lance-0.114
-0.114
TokenRepl
Token ID39232
Feature activation-6.20e+00

Pos logit contributions
ering+0.111
 Dupl+0.101
iced+0.099
 Hoover+0.097
 Cox+0.097
Neg logit contributions
SOURCE-0.155
ALD-0.151
-0.151
vernment-0.150
lance-0.149
Tokenacements
Token ID28613
Feature activation-6.71e+00

Pos logit contributions
lance+1.565
+1.513
SOURCE+1.498
ALD+1.411
heads+1.399
Neg logit contributions
ering-7.480
 Hoover-7.357
 Dupl-7.340
imaru-7.142
 Dollar-7.136
Token:
Token ID25
Feature activation-4.19e-01

Pos logit contributions
SOURCE+4.851
ALD+4.661
+4.648
heads+4.639
lance+4.567
Neg logit contributions
ering-5.245
 Dupl-4.831
 Hoover-4.779
iced-4.766
imaru-4.693
Token
Token ID198
Feature activation-8.30e-01

Pos logit contributions
SOURCE+0.366
ALD+0.358
+0.354
lance+0.353
heads+0.352
Neg logit contributions
ering-0.280
 Dupl-0.254
iced-0.252
 Hoover-0.249
 swallow-0.241
Token
Token ID198
Feature activation-6.62e-01

Pos logit contributions
SOURCE+0.626
ALD+0.598
heads+0.597
lance+0.589
unknown+0.586
Neg logit contributions
ering-0.655
 Dupl-0.612
iced-0.611
 Hoover-0.606
 swallow-0.602
Token16
Token ID1433
Feature activation-2.39e+00

Pos logit contributions
SOURCE+0.528
ALD+0.514
+0.504
lance+0.502
heads+0.501
Neg logit contributions
ering-0.500
 Dupl-0.466
 Hoover-0.458
iced-0.455
 swallow-0.447
Token.
Token ID13
Feature activation+2.84e+00

Pos logit contributions
SOURCE+1.970
ALD+1.921
+1.908
vernment+1.894
lance+1.882
Neg logit contributions
ering-1.741
 Dupl-1.602
iced-1.573
 Hoover-1.557
 Cox-1.545
Token Pat
Token ID3208
Feature activation+1.31e+00

Pos logit contributions
ering+1.744
 Dupl+1.583
iced+1.546
 Hoover+1.527
 Cox+1.512
Neg logit contributions
SOURCE-2.652
ALD-2.598
-2.583
vernment-2.567
lance-2.548
Token MacArthur
Token ID46626
Feature activation-1.94e+00

Pos logit contributions
ering+0.826
 Dupl+0.751
iced+0.735
 Hoover+0.725
 Cox+0.713
Neg logit contributions
SOURCE-1.198
ALD-1.175
-1.168
vernment-1.158
lance-1.156
Token,
Token ID11
Feature activation+1.24e-01

Pos logit contributions
ering+0.213
 Dupl+0.195
iced+0.192
 Hoover+0.190
 Cox+0.187
Neg logit contributions
SOURCE-0.267
ALD-0.261
-0.259
vernment-0.256
lance-0.256
Token the
Token ID262
Feature activation-1.63e-01

Pos logit contributions
ering+0.079
iced+0.072
 Dupl+0.071
 Hoover+0.069
 swallow+0.068
Neg logit contributions
SOURCE-0.112
ALD-0.110
lance-0.108
heads-0.108
vernment-0.108
Token trend
Token ID5182
Feature activation+4.44e+00

Pos logit contributions
SOURCE+0.140
ALD+0.137
heads+0.135
lance+0.135
vernment+0.134
Neg logit contributions
ering-0.110
iced-0.101
 Dupl-0.100
 Hoover-0.096
 Cox-0.096
Token has
Token ID468
Feature activation-5.43e-01

Pos logit contributions
ering+3.189
 Dupl+2.868
iced+2.861
 Hoover+2.799
 Cox+2.797
Neg logit contributions
SOURCE-3.672
ALD-3.604
-3.597
ITNESS-3.551
vernment-3.531
Token continued
Token ID3767
Feature activation+2.74e+00

Pos logit contributions
SOURCE+0.418
ALD+0.403
+0.402
heads+0.398
vernment+0.396
Neg logit contributions
ering-0.418
 Dupl-0.392
iced-0.378
 Hoover-0.377
 Cox-0.371
Token.
Token ID13
Feature activation-2.67e-01

Pos logit contributions
ering+1.803
 Dupl+1.639
iced+1.608
 Hoover+1.590
 Cox+1.574
Neg logit contributions
SOURCE-2.431
ALD-2.377
-2.370
vernment-2.343
lance-2.333
Token The
Token ID383
Feature activation+2.56e-01

Pos logit contributions
SOURCE+0.218
ALD+0.212
heads+0.210
lance+0.208
+0.208
Neg logit contributions
ering-0.194
iced-0.178
 swallow-0.174
 Dupl-0.171
 Hoover-0.168
Token implications
Token ID10939
Feature activation-5.72e-01

Pos logit contributions
ering+0.171
iced+0.156
 Dupl+0.155
 Hoover+0.150
 Cox+0.149
Neg logit contributions
SOURCE-0.223
ALD-0.217
heads-0.214
lance-0.214
-0.213
Token are
Token ID389
Feature activation-1.00e+00

Pos logit contributions
SOURCE+0.449
ALD+0.437
+0.436
vernment+0.431
lance+0.430
Neg logit contributions
ering-0.432
 Dupl-0.399
iced-0.395
 Hoover-0.390
 Cox-0.384
Token startling
Token ID28027
Feature activation+9.57e-01

Pos logit contributions
SOURCE+0.865
ALD+0.845
heads+0.837
lance+0.829
+0.828
Neg logit contributions
ering-0.678
 Dupl-0.617
iced-0.616
 Hoover-0.602
 Cox-0.590
Token.
Token ID13
Feature activation-3.42e-01

Pos logit contributions
ering+0.693
 Dupl+0.639
iced+0.627
 Hoover+0.621
 Cox+0.614
Neg logit contributions
SOURCE-0.786
ALD-0.767
-0.762
vernment-0.755
lance-0.752
Token be
Token ID307
Feature activation-1.97e+00

Pos logit contributions
SOURCE+1.858
+1.794
ALD+1.789
lance+1.783
heads+1.781
Neg logit contributions
ering-1.711
iced-1.614
 Dupl-1.582
 Hoover-1.560
 Cox-1.546
Token established
Token ID4920
Feature activation-1.80e+00

Pos logit contributions
SOURCE+1.617
heads+1.582
ALD+1.573
+1.558
ITNESS+1.533
Neg logit contributions
ering-1.388
iced-1.247
 Hoover-1.239
 Dupl-1.230
 swallow-1.220
Token by
Token ID416
Feature activation-3.94e-02

Pos logit contributions
SOURCE+1.602
ALD+1.562
+1.551
heads+1.536
vernment+1.523
Neg logit contributions
ering-1.165
iced-1.048
 Dupl-1.048
 Hoover-1.023
 Cox-1.013
Token Law
Token ID3854
Feature activation+1.10e+00

Pos logit contributions
SOURCE+0.037
ALD+0.037
+0.036
lance+0.036
vernment+0.036
Neg logit contributions
ering-0.023
iced-0.021
 Dupl-0.021
 Hoover-0.020
 Cox-0.020
Token
Token ID1399
Feature activation+1.70e+00

Pos logit contributions
ering+0.774
 Dupl+0.709
iced+0.697
 Hoover+0.691
 Cox+0.682
Neg logit contributions
SOURCE-0.930
ALD-0.907
-0.902
vernment-0.892
lance-0.891
Token
Token ID198
Feature activation-1.14e-01

Pos logit contributions
ering+1.260
 Dupl+1.162
iced+1.142
 Hoover+1.130
 Cox+1.118
Neg logit contributions
SOURCE-1.365
ALD-1.332
-1.322
vernment-1.309
lance-1.305
Token
Token ID198
Feature activation-9.55e-02

Pos logit contributions
SOURCE+0.089
ALD+0.085
heads+0.084
+0.084
lance+0.083
Neg logit contributions
ering-0.088
 Dupl-0.082
iced-0.082
 Hoover-0.081
 swallow-0.080
TokenAccording
Token ID4821
Feature activation-4.71e+00

Pos logit contributions
SOURCE+0.063
ALD+0.061
+0.060
lance+0.059
heads+0.059
Neg logit contributions
ering-0.086
 Dupl-0.081
iced-0.079
 Hoover-0.079
 Cox-0.078
Token to
Token ID284
Feature activation-1.94e+00

Pos logit contributions
SOURCE+4.185
ALD+4.062
+4.031
lance+4.029
vernment+4.006
Neg logit contributions
ering-3.029
 Dupl-2.811
iced-2.731
 Hoover-2.680
 Cox-2.641
Token Stephenson
Token ID42838
Feature activation-4.27e+00

Pos logit contributions
SOURCE+1.679
ALD+1.642
lance+1.625
+1.614
heads+1.599
Neg logit contributions
ering-1.297
iced-1.177
 Dupl-1.175
 swallow-1.115
 Hoover-1.113
Token,
Token ID11
Feature activation+1.22e+00

Pos logit contributions
SOURCE+3.482
ALD+3.414
+3.397
lance+3.351
vernment+3.349
Neg logit contributions
ering-3.082
 Dupl-2.843
iced-2.800
 Hoover-2.750
 Cox-2.707
Token regime
Token ID7142
Feature activation+5.02e-01

Pos logit contributions
SOURCE+0.125
ALD+0.125
+0.124
vernment+0.124
lance+0.121
Neg logit contributions
ering-0.104
iced-0.095
 Dupl-0.094
 Hoover-0.090
 swallow-0.089
Token,
Token ID11
Feature activation-4.96e-01

Pos logit contributions
ering+0.344
 Dupl+0.316
iced+0.310
 Hoover+0.306
 Cox+0.302
Neg logit contributions
SOURCE-0.433
ALD-0.423
-0.420
vernment-0.416
lance-0.415
Token Gaddafi
Token ID40320
Feature activation-4.20e+00

Pos logit contributions
SOURCE+0.428
ALD+0.424
vernment+0.419
lance+0.414
heads+0.413
Neg logit contributions
ering-0.332
 Dupl-0.302
iced-0.301
 swallow-0.288
 Hoover-0.288
Token had
Token ID550
Feature activation+2.71e+00

Pos logit contributions
SOURCE+3.333
ALD+3.249
vernment+3.223
heads+3.221
+3.152
Neg logit contributions
ering-2.998
 Dupl-2.805
iced-2.784
 Hoover-2.680
 swallow-2.661
Token said
Token ID531
Feature activation-2.14e+00

Pos logit contributions
ering+1.913
 Dupl+1.770
iced+1.734
 Hoover+1.704
 Cox+1.686
Neg logit contributions
SOURCE-2.273
ALD-2.215
-2.195
vernment-2.188
heads-2.174
Token:
Token ID25
Feature activation+2.16e-02

Pos logit contributions
SOURCE+1.793
ALD+1.753
+1.740
vernment+1.727
lance+1.727
Neg logit contributions
ering-1.478
iced-1.368
 Dupl-1.359
 Hoover-1.318
 Cox-1.303
Token
Token ID564
Feature activation+3.16e+00

Pos logit contributions
ering+0.015
 Dupl+0.013
iced+0.013
 Hoover+0.013
 Cox+0.013
Neg logit contributions
SOURCE-0.019
ALD-0.019
-0.018
vernment-0.018
lance-0.018
Token
Token ID246
Feature activation+4.72e-01

Pos logit contributions
ering+2.800
 Dupl+2.620
iced+2.581
 Hoover+2.560
 Cox+2.536
Neg logit contributions
SOURCE-2.083
ALD-2.020
-2.006
vernment-1.979
lance-1.972
TokenOil
Token ID44142
Feature activation-3.89e+00

Pos logit contributions
ering+0.381
 Dupl+0.355
iced+0.350
 Hoover+0.347
 Cox+0.344
Neg logit contributions
SOURCE-0.350
ALD-0.340
-0.336
vernment-0.334
lance-0.333
Token companies
Token ID2706
Feature activation+3.61e+00

Pos logit contributions
heads+3.132
SOURCE+3.126
vernment+3.055
ALD+3.032
lance+3.014
Neg logit contributions
ering-2.717
iced-2.536
 Dupl-2.510
 Hoover-2.471
 Cox-2.464
Token are
Token ID389
Feature activation+6.57e-01

Pos logit contributions
ering+2.669
 Dupl+2.460
iced+2.417
 Hoover+2.394
 Cox+2.368
Neg logit contributions
SOURCE-2.907
ALD-2.838
-2.818
vernment-2.788
lance-2.779
Token%
Token ID4
Feature activation+2.88e+00

Pos logit contributions
ering+1.558
 Dupl+1.417
iced+1.390
 Hoover+1.374
 Cox+1.359
Neg logit contributions
SOURCE-2.081
ALD-2.034
-2.026
vernment-2.006
lance-2.004
Token of
Token ID286
Feature activation+2.27e+00

Pos logit contributions
ering+2.087
 Dupl+1.917
iced+1.884
 Hoover+1.862
 Cox+1.844
Neg logit contributions
SOURCE-2.366
ALD-2.312
-2.298
vernment-2.272
lance-2.267
Token population
Token ID3265
Feature activation+3.19e-01

Pos logit contributions
ering+1.460
 Dupl+1.329
iced+1.303
 Hoover+1.284
 Cox+1.267
Neg logit contributions
SOURCE-2.052
ALD-2.009
-1.986
vernment-1.978
lance-1.970
Token,
Token ID11
Feature activation-1.07e-01

Pos logit contributions
ering+0.225
 Dupl+0.207
iced+0.203
 Hoover+0.201
 Cox+0.198
Neg logit contributions
SOURCE-0.269
ALD-0.262
-0.260
vernment-0.258
heads-0.257
Token whose
Token ID3025
Feature activation-1.17e+00

Pos logit contributions
SOURCE+0.088
ALD+0.085
heads+0.084
vernment+0.083
+0.083
Neg logit contributions
ering-0.077
 Dupl-0.071
iced-0.070
 Hoover-0.069
 Cox-0.068
Token occupation
Token ID13755
Feature activation-1.93e+00

Pos logit contributions
SOURCE+0.975
ALD+0.948
heads+0.939
lance+0.930
+0.929
Neg logit contributions
ering-0.819
iced-0.754
 Dupl-0.751
 Hoover-0.727
 Cox-0.719
Token structure
Token ID4645
Feature activation-2.23e+00

Pos logit contributions
SOURCE+1.480
ALD+1.436
+1.428
heads+1.416
lance+1.407
Neg logit contributions
ering-1.499
 Dupl-1.385
iced-1.367
 Hoover-1.348
 Cox-1.326
Token more
Token ID517
Feature activation-1.40e+00

Pos logit contributions
SOURCE+1.777
ALD+1.710
heads+1.707
+1.706
vernment+1.685
Neg logit contributions
ering-1.642
 Dupl-1.510
iced-1.500
 Hoover-1.473
arget-1.436
Token closely
Token ID7173
Feature activation-3.58e+00

Pos logit contributions
SOURCE+1.380
ALD+1.337
heads+1.330
+1.327
lance+1.319
Neg logit contributions
ering-0.783
 Dupl-0.716
iced-0.696
 Hoover-0.685
 Cox-0.669
Token matches
Token ID7466
Feature activation+1.22e+00

Pos logit contributions
SOURCE+2.282
heads+2.161
lance+2.154
ALD+2.148
+2.131
Neg logit contributions
ering-3.201
 Dupl-3.037
 Hoover-2.981
iced-2.976
 Dollar-2.913
Token our
Token ID674
Feature activation+6.92e-01

Pos logit contributions
ering+0.774
 Dupl+0.704
iced+0.691
 Hoover+0.679
 Cox+0.668
Neg logit contributions
SOURCE-1.123
ALD-1.095
-1.081
vernment-1.077
lance-1.075
Token exasper
Token ID41851
Feature activation+4.79e+00

Pos logit contributions
ering+0.730
 Dupl+0.676
iced+0.667
 Hoover+0.648
 Cox+0.642
Neg logit contributions
SOURCE-0.839
ALD-0.818
-0.807
lance-0.802
vernment-0.801
Tokenation
Token ID341
Feature activation+3.35e+00

Pos logit contributions
ering+3.402
iced+3.053
 Dupl+3.046
 Hoover+2.949
 Cox+2.926
Neg logit contributions
SOURCE-4.069
ALD-3.964
-3.926
vernment-3.877
ITNESS-3.840
Token that
Token ID326
Feature activation+2.59e+00

Pos logit contributions
ering+2.159
 Dupl+1.959
iced+1.928
 Hoover+1.907
 Cox+1.884
Neg logit contributions
SOURCE-3.018
ALD-2.959
-2.933
vernment-2.907
lance-2.896
Token as
Token ID355
Feature activation+5.62e+00

Pos logit contributions
ering+1.661
 Dupl+1.511
iced+1.486
 Hoover+1.461
 Cox+1.439
Neg logit contributions
SOURCE-2.349
ALD-2.299
-2.276
vernment-2.263
lance-2.259
Token a
Token ID257
Feature activation+9.11e-01

Pos logit contributions
ering+3.486
 Dupl+3.156
iced+3.083
 Hoover+3.068
 Cox+3.030
Neg logit contributions
SOURCE-5.191
ALD-5.092
-5.084
vernment-5.007
lance-4.981
Token religion
Token ID5737
Feature activation-6.41e+00

Pos logit contributions
ering+0.633
 Dupl+0.576
iced+0.570
 Hoover+0.554
 Cox+0.547
Neg logit contributions
SOURCE-0.780
ALD-0.759
-0.751
lance-0.749
vernment-0.747
Token professor
Token ID6240
Feature activation+4.20e-01

Pos logit contributions
SOURCE+5.238
heads+5.093
lance+5.016
ALD+4.951
vernment+4.844
Neg logit contributions
ering-4.395
 Dupl-4.168
iced-4.094
 Hoover-3.963
imaru-3.951
Token,
Token ID11
Feature activation-2.80e-01

Pos logit contributions
ering+0.288
 Dupl+0.265
iced+0.260
 Hoover+0.255
 Cox+0.251
Neg logit contributions
SOURCE-0.360
ALD-0.352
-0.349
lance-0.347
heads-0.346
Token it
Token ID340
Feature activation+2.42e-02

Pos logit contributions
SOURCE+0.236
ALD+0.233
lance+0.228
heads+0.227
vernment+0.227
Neg logit contributions
ering-0.193
 Dupl-0.177
iced-0.176
 Hoover-0.170
 Cox-0.166
Token is
Token ID318
Feature activation+1.33e+00

Pos logit contributions
ering+0.018
 Dupl+0.017
iced+0.016
 Hoover+0.016
 Cox+0.016
Neg logit contributions
SOURCE-0.019
ALD-0.019
-0.019
vernment-0.019
heads-0.019
Token difficult
Token ID2408
Feature activation+6.20e+00

Pos logit contributions
ering+0.927
 Dupl+0.853
iced+0.839
 Hoover+0.815
 Cox+0.803
Neg logit contributions
SOURCE-1.134
ALD-1.108
heads-1.088
vernment-1.083
lance-1.073
Token I
Token ID314
Feature activation+2.12e+00

Pos logit contributions
SOURCE+1.159
ALD+1.131
+1.126
vernment+1.121
heads+1.117
Neg logit contributions
ering-0.784
 Dupl-0.718
iced-0.704
 Hoover-0.702
 Cox-0.688
Token will
Token ID481
Feature activation+5.71e-01

Pos logit contributions
ering+1.554
 Dupl+1.432
iced+1.409
 Hoover+1.397
 Cox+1.380
Neg logit contributions
SOURCE-1.720
ALD-1.681
-1.671
vernment-1.655
lance-1.643
Token use
Token ID779
Feature activation-1.91e-01

Pos logit contributions
ering+0.411
iced+0.378
 Dupl+0.376
 Hoover+0.370
 Cox+0.366
Neg logit contributions
SOURCE-0.467
ALD-0.455
-0.449
vernment-0.445
ITNESS-0.445
Token
Token ID564
Feature activation+1.47e+00

Pos logit contributions
SOURCE+0.168
ALD+0.164
+0.162
vernment+0.161
heads+0.161
Neg logit contributions
ering-0.127
 Dupl-0.115
iced-0.115
 Hoover-0.113
 Cox-0.110
Token
Token ID250
Feature activation-1.82e+00

Pos logit contributions
ering+1.343
 Dupl+1.273
iced+1.241
 Hoover+1.235
 Cox+1.227
Neg logit contributions
SOURCE-0.924
-0.898
ALD-0.892
vernment-0.878
lance-0.875
TokenRad
Token ID15546
Feature activation-8.47e+00

Pos logit contributions
SOURCE+1.474
+1.436
ALD+1.429
vernment+1.417
heads+1.411
Neg logit contributions
ering-1.317
 Dupl-1.242
 Hoover-1.202
iced-1.189
 Cox-1.185
Tokenical
Token ID605
Feature activation-1.72e+00

Pos logit contributions
SOURCE+6.519
lance+6.449
ALD+6.419
heads+6.334
vernment+6.302
Neg logit contributions
ering-5.865
 Dupl-5.568
 Hoover-5.446
 Cox-5.366
iced-5.322
Token Accept
Token ID21699
Feature activation-1.37e+00

Pos logit contributions
SOURCE+1.496
ALD+1.459
+1.453
vernment+1.441
heads+1.432
Neg logit contributions
ering-1.154
 Dupl-1.056
iced-1.035
 Hoover-1.020
 Cox-1.011
Tokenance
Token ID590
Feature activation-2.43e+00

Pos logit contributions
SOURCE+1.253
ALD+1.220
+1.214
lance+1.211
vernment+1.210
Neg logit contributions
ering-0.841
 Dupl-0.770
iced-0.756
 Hoover-0.752
 Cox-0.738
Token
Token ID447
Feature activation-2.18e+00

Pos logit contributions
SOURCE+1.975
ALD+1.923
+1.919
lance+1.900
vernment+1.900
Neg logit contributions
ering-1.730
 Dupl-1.596
iced-1.581
 Hoover-1.560
 Cox-1.535
Token
Token ID251
Feature activation-2.01e+00

Pos logit contributions
SOURCE+1.221
+1.199
ALD+1.191
lance+1.180
heads+1.145
Neg logit contributions
ering-2.108
 Dupl-2.008
iced-1.975
 swallow-1.961
 Perkins-1.944
Token
Token ID447
Feature activation-7.52e-01

Pos logit contributions
SOURCE+1.869
ALD+1.845
+1.843
vernment+1.813
heads+1.803
Neg logit contributions
ering-1.882
 Dupl-1.755
iced-1.722
 Hoover-1.710
 Cox-1.679
Token
Token ID247
Feature activation-3.74e-01

Pos logit contributions
SOURCE+0.429
ALD+0.413
+0.413
lance+0.408
heads+0.402
Neg logit contributions
ering-0.729
 Dupl-0.693
iced-0.679
 Cox-0.674
 Hoover-0.671
Tokens
Token ID82
Feature activation-1.45e+00

Pos logit contributions
SOURCE+0.297
ALD+0.290
+0.289
vernment+0.288
lance+0.287
Neg logit contributions
ering-0.279
 Dupl-0.260
iced-0.253
 Hoover-0.252
 Cox-0.249
Token a
Token ID257
Feature activation-8.66e-01

Pos logit contributions
SOURCE+1.278
ALD+1.254
+1.245
vernment+1.237
lance+1.236
Neg logit contributions
ering-0.946
 Dupl-0.868
iced-0.840
 Hoover-0.831
 Cox-0.822
Token good
Token ID922
Feature activation-5.36e-01

Pos logit contributions
SOURCE+0.739
ALD+0.725
+0.719
vernment+0.716
lance+0.713
Neg logit contributions
ering-0.594
 Dupl-0.543
iced-0.531
 Hoover-0.522
 Cox-0.514
Token number
Token ID1271
Feature activation-7.03e+00

Pos logit contributions
SOURCE+0.412
ALD+0.403
+0.398
lance+0.396
vernment+0.395
Neg logit contributions
ering-0.413
 Dupl-0.384
iced-0.376
 Hoover-0.373
 Cox-0.367
Token of
Token ID286
Feature activation-1.76e+00

Pos logit contributions
SOURCE+5.123
+4.974
ALD+4.967
vernment+4.954
heads+4.923
Neg logit contributions
ering-5.492
 Dupl-5.184
 Hoover-5.032
iced-5.019
 Cox-4.955
Token kids
Token ID3988
Feature activation+6.88e-01

Pos logit contributions
SOURCE+1.465
vernment+1.431
ALD+1.430
heads+1.420
+1.417
Neg logit contributions
ering-1.224
 Dupl-1.126
iced-1.089
 Hoover-1.074
imaru-1.066
Token and
Token ID290
Feature activation-1.16e+00

Pos logit contributions
ering+0.474
 Dupl+0.437
iced+0.428
 Hoover+0.425
 Cox+0.419
Neg logit contributions
SOURCE-0.583
ALD-0.570
-0.565
heads-0.564
vernment-0.562
Token staff
Token ID3085
Feature activation-1.97e+00

Pos logit contributions
SOURCE+1.005
ALD+0.991
+0.984
heads+0.978
vernment+0.976
Neg logit contributions
ering-0.773
 Dupl-0.707
iced-0.691
 Hoover-0.681
 Cox-0.672
Token spending
Token ID4581
Feature activation-2.77e+00

Pos logit contributions
SOURCE+1.622
ALD+1.600
heads+1.600
+1.580
vernment+1.577
Neg logit contributions
ering-1.360
 Dupl-1.272
iced-1.235
 Hoover-1.232
 Cox-1.215
Token long
Token ID890
Feature activation+2.68e+00

Pos logit contributions
ering+1.224
 Dupl+1.134
iced+1.105
 Hoover+1.087
 Cox+1.070
Neg logit contributions
SOURCE-1.497
ALD-1.454
-1.434
heads-1.434
vernment-1.433
Token viewed
Token ID9569
Feature activation-9.28e-01

Pos logit contributions
ering+1.955
 Dupl+1.815
iced+1.772
 Hoover+1.754
 Cox+1.733
Neg logit contributions
SOURCE-2.189
ALD-2.129
-2.113
vernment-2.096
heads-2.089
Token itself
Token ID2346
Feature activation+6.40e-01

Pos logit contributions
SOURCE+0.773
ALD+0.755
heads+0.751
+0.748
vernment+0.748
Neg logit contributions
ering-0.643
 Dupl-0.600
iced-0.593
 Hoover-0.570
 Cox-0.563
Token as
Token ID355
Feature activation+3.16e+00

Pos logit contributions
ering+0.434
 Dupl+0.400
iced+0.393
 Hoover+0.387
 Cox+0.381
Neg logit contributions
SOURCE-0.554
ALD-0.540
-0.535
heads-0.532
vernment-0.532
Token a
Token ID257
Feature activation+2.40e-01

Pos logit contributions
ering+1.981
 Dupl+1.802
iced+1.765
 Hoover+1.741
 Cox+1.716
Neg logit contributions
SOURCE-2.915
ALD-2.853
-2.832
vernment-2.810
lance-2.800
Token bridge
Token ID7696
Feature activation-7.05e+00

Pos logit contributions
ering+0.169
 Dupl+0.157
iced+0.154
 Hoover+0.148
 Cox+0.146
Neg logit contributions
SOURCE-0.202
ALD-0.196
heads-0.194
-0.194
vernment-0.192
Token between
Token ID1022
Feature activation-3.16e+00

Pos logit contributions
heads+5.434
SOURCE+5.353
ALD+5.240
vernment+5.194
lance+5.152
Neg logit contributions
ering-4.910
 Dupl-4.751
iced-4.633
 Perkins-4.608
 Hoover-4.599
Token the
Token ID262
Feature activation+1.70e-02

Pos logit contributions
SOURCE+2.576
heads+2.500
ALD+2.496
vernment+2.485
+2.441
Neg logit contributions
ering-2.201
 Dupl-2.059
iced-2.025
arget-1.935
 Perkins-1.924
Token United
Token ID1578
Feature activation+3.19e-02

Pos logit contributions
ering+0.012
 Dupl+0.011
iced+0.011
 Hoover+0.011
 Cox+0.011
Neg logit contributions
SOURCE-0.014
ALD-0.013
-0.013
heads-0.013
vernment-0.013
Token States
Token ID1829
Feature activation+3.45e+00

Pos logit contributions
ering+0.023
iced+0.021
 Dupl+0.021
 Hoover+0.020
 Cox+0.020
Neg logit contributions
SOURCE-0.027
heads-0.026
ALD-0.025
lance-0.025
vernment-0.025
Token and
Token ID290
Feature activation+7.13e-01

Pos logit contributions
ering+1.933
 Dupl+1.671
iced+1.667
 Hoover+1.646
 Cox+1.626
Neg logit contributions
SOURCE-3.431
-3.336
ALD-3.333
lance-3.291
ITNESS-3.283
Token ;
Token ID2162
Feature activation+5.13e+00

Pos logit contributions
SOURCE+3.375
+3.299
ALD+3.299
ITNESS+3.247
unknown+3.232
Neg logit contributions
ering-2.993
 Dupl-2.756
iced-2.717
 Hoover-2.712
 Cox-2.693
Token
Token ID198
Feature activation+1.03e+00

Pos logit contributions
ering+3.392
 Dupl+3.147
iced+3.036
 Hoover+3.010
 Cox+2.976
Neg logit contributions
SOURCE-4.475
ALD-4.418
-4.357
vernment-4.330
lance-4.293
Token
Token ID198
Feature activation+1.01e+00

Pos logit contributions
ering+0.782
 Dupl+0.725
iced+0.718
 Hoover+0.713
 swallow+0.700
Neg logit contributions
SOURCE-0.813
ALD-0.783
heads-0.776
-0.772
lance-0.771
TokenThe
Token ID464
Feature activation-1.01e+00

Pos logit contributions
ering+0.858
 Dupl+0.803
iced+0.789
 Hoover+0.787
 Cox+0.779
Neg logit contributions
SOURCE-0.702
ALD-0.679
-0.672
lance-0.666
vernment-0.664
Token pin
Token ID6757
Feature activation+1.32e+00

Pos logit contributions
SOURCE+0.821
ALD+0.787
lance+0.784
ITNESS+0.784
heads+0.782
Neg logit contributions
ering-0.737
iced-0.672
 Dupl-0.668
 Hoover-0.653
 Cox-0.649
Token function
Token ID2163
Feature activation-7.15e+00

Pos logit contributions
ering+0.991
 Dupl+0.918
iced+0.904
 Hoover+0.894
 Cox+0.884
Neg logit contributions
SOURCE-1.044
ALD-1.018
-1.010
vernment-1.001
lance-0.999
Token marks
Token ID8849
Feature activation-6.72e-01

Pos logit contributions
SOURCE+5.155
heads+5.098
lance+4.946
ALD+4.844
vernment+4.738
Neg logit contributions
ering-5.294
iced-5.027
 Dupl-4.962
 Hoover-4.883
imaru-4.794
Token the
Token ID262
Feature activation+1.30e-01

Pos logit contributions
SOURCE+0.588
ALD+0.575
vernment+0.568
+0.568
heads+0.567
Neg logit contributions
ering-0.445
 Dupl-0.404
iced-0.402
 Hoover-0.396
 Cox-0.392
Token thread
Token ID4704
Feature activation+3.71e+00

Pos logit contributions
ering+0.094
 Dupl+0.086
iced+0.086
 Hoover+0.084
 Cox+0.083
Neg logit contributions
SOURCE-0.107
ALD-0.104
-0.102
ITNESS-0.102
vernment-0.102
Token as
Token ID355
Feature activation+5.23e+00

Pos logit contributions
ering+2.533
 Dupl+2.308
iced+2.259
 Hoover+2.233
 Cox+2.209
Neg logit contributions
SOURCE-3.212
ALD-3.142
-3.123
vernment-3.086
lance-3.077
Token active
Token ID4075
Feature activation-1.51e+00

Pos logit contributions
ering+3.541
 Dupl+3.244
iced+3.177
 Hoover+3.148
 Cox+3.101
Neg logit contributions
SOURCE-4.545
ALD-4.445
-4.420
vernment-4.376
lance-4.361