TOP ACTIVATIONS
MAX = 2.022

FC RA regulates the collection , dissemination , and use of
ing demands . AFP PHOTO / Saul LO EB SA UL
s President , Car les Pu ig dem ont ,
the French supermarket chain Car ref our took Danish goods off
Time to get the Lead b elly records out .
drew maybe the first three or four . And the idea
p atted himself on the back for public service
sometimes , especially in B . C ., some of the
significantly , completely shutting off fat - burning and also triggering
some of which involve four or five cooking techniques and begin
calories to be stored as fat . After the
Moving ash from the River b end site to a lined
to buying drugs " five or six times " from Jones
would put wind at the back of the opposition party ,
Alert s newsletter delivered to your inbox We 're sorry ,
has preserved a striking car ap ace of multi - lay
was thought to have only a few bacterial species and mostly
the left : Bo Scar b rough in 10 th grade
care bill , especially since Rep . Tom MacArthur of New
& Bonnie & Baby 2008 01 09 O & A -

MIN ACTIVATIONS
MIN = -1.741

religion in Brazil are inex act . According to the census
America , C iti , Chase , Wells Fargo , and
of a military response and almost no news play on this
possible , in the old Naval Air Station in South We
to funding , requiring F annie Mae and Freddie Mac to
and you would be right . But what
been living under military law for nearly five decades , while
, there is one major difference between the two types of
. New York : William Morrow and Company , Inc .
an investment banker now with JPMorgan who specializes in big energy
credit - reporting agency , JPMorgan Chase , and Cit ib
13 Sep , 2014 , 11 : 13 AM
, though it later came out that the report had been
It also features unusual men 's ur inals in the
percent on the news . Imperial Brands , formerly known as
What most fail to point out is that those executive actions
Choose Install from Zip file and locate
America . Wells Fargo , JPMorgan Chase , and Cit ib
claims that have been pending for at least a decade
] prohibition , slavery , women s rights .

INTERVAL 1.081 - 2.022
CONTAINS 0.238%

projects like hospitals and schools .[ 5 ] Additionally , the
as evidence that Kim Jong Un is losing his grip on
by the Nobel Foundation in 1968 from S ver ig es
, and they need eight to 10 ." On
cases - a four - fold increase in a week .

INTERVAL 0.141 - 1.081
CONTAINS 33.527%

and he did not fight back . Jal en was doing
but drawing conclusions based on them wouldn t be
of a few weeks , Sanders will need to prevail
to that I had to hold as the A - story
the rankings . Maybe now it would

INTERVAL -0.800 - 0.141
CONTAINS 64.644%

Ethereum to handle one to 10 m ln active users ,
L D att u and C K Pr as ad also
no interest in seeking a patent for the Love t ron
. The views of contributors are their own and
think of recording your live performances ? RF :

INTERVAL -1.741 - -0.800
CONTAINS 1.591%

lawyer during the George W . Bush administration , said there
1940 s . The most famous case was a Norwegian woman
which I have kept for almost 30 years .
+ Loyal , Def erential , Level - headed
National Geographic A Place of Wild Beauty
Token FC
Token ID10029
Feature activation-3.14e-01

Pos logit contributions
icide+1.051
heastern+0.914
ingers+0.904
icides+0.893
ppers+0.884
Neg logit contributions
 agre-0.703
 agreeing-0.650
 dere-0.640
lio-0.625
andum-0.582
TokenRA
Token ID3861
Feature activation+3.68e-01

Pos logit contributions
 agre+1.665
 agreeing+1.578
 dere+1.549
lio+1.480
 agreement+1.412
Neg logit contributions
icide-1.898
heastern-1.646
ingers-1.623
icides-1.586
ppers-1.559
Token regulates
Token ID39474
Feature activation+4.01e-01

Pos logit contributions
icide+2.117
heastern+1.810
ingers+1.774
icides+1.753
ppers+1.741
Neg logit contributions
 agre-1.980
 agreeing-1.881
 dere-1.844
lio-1.825
elli-1.717
Token the
Token ID262
Feature activation+4.03e-01

Pos logit contributions
icide+2.297
heastern+1.938
ingers+1.904
icides+1.903
ppers+1.872
Neg logit contributions
 agre-2.161
 agreeing-1.983
 dere-1.976
lio-1.961
andum-1.908
Token collection
Token ID4947
Feature activation+3.30e-01

Pos logit contributions
icide+2.402
heastern+2.034
icides+1.990
ingers+1.984
ppers+1.975
Neg logit contributions
 agre-2.073
 agreeing-1.917
 dere-1.887
lio-1.866
andum-1.830
Token,
Token ID11
Feature activation+2.02e+00

Pos logit contributions
icide+1.943
heastern+1.675
ingers+1.658
icides+1.623
ppers+1.583
Neg logit contributions
 agre-1.774
 agreeing-1.692
 dere-1.662
lio-1.595
elli-1.524
Token dissemination
Token ID44832
Feature activation-4.08e-01

Pos logit contributions
icide+9.953
ingers+7.760
icides+7.750
heastern+7.726
ppers+7.655
Neg logit contributions
 agre-9.999
elli-9.463
andum-9.325
lio-9.297
solete-9.041
Token,
Token ID11
Feature activation+8.08e-01

Pos logit contributions
 agre+1.896
 agreeing+1.881
 dere+1.840
lio+1.721
 agreement+1.716
Neg logit contributions
icide-2.572
heastern-2.312
ingers-2.283
icides-2.192
ppers-2.190
Token and
Token ID290
Feature activation+1.01e+00

Pos logit contributions
icide+4.517
ingers+3.709
heastern+3.695
icides+3.641
ppers+3.598
Neg logit contributions
 agre-4.391
lio-4.001
 agreeing-3.991
elli-3.954
andum-3.949
Token use
Token ID779
Feature activation-1.70e-01

Pos logit contributions
icide+5.749
ingers+4.733
heastern+4.696
ppers+4.696
icides+4.633
Neg logit contributions
 agre-5.148
lio-4.750
andum-4.664
elli-4.607
 agreeing-4.592
Token of
Token ID286
Feature activation-1.47e-01

Pos logit contributions
 agre+0.804
 agreeing+0.779
 dere+0.756
lio+0.708
 agreement+0.707
Neg logit contributions
icide-1.099
heastern-0.972
ingers-0.965
icides-0.932
ppers-0.925
Tokening
Token ID278
Feature activation+3.88e-01

Pos logit contributions
icide+1.442
ingers+1.185
heastern+1.179
icides+1.107
ebus+1.067
Neg logit contributions
 agre-2.032
 agreeing-1.848
 dere-1.799
lio-1.747
yrights-1.715
Token demands
Token ID8665
Feature activation+2.75e-02

Pos logit contributions
icide+2.311
ingers+1.966
heastern+1.952
icides+1.928
ppers+1.901
Neg logit contributions
 agre-2.052
 agreeing-1.901
 dere-1.870
lio-1.843
andum-1.778
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
icide+0.150
heastern+0.129
ingers+0.127
icides+0.123
ppers+0.121
Neg logit contributions
 agre-0.160
 agreeing-0.153
 dere-0.151
lio-0.145
 agreement-0.139
Token AFP
Token ID8916
Feature activation+9.09e-01

Pos logit contributions
icide+4.998
heastern+4.163
 BACK+4.116
ingers+4.112
icides+4.010
Neg logit contributions
 agre-6.010
 agreeing-5.655
 dere-5.545
lio-5.328
andum-5.185
Token PHOTO
Token ID41153
Feature activation+4.44e-02

Pos logit contributions
icide+3.446
 BACK+2.613
heastern+2.579
icides+2.541
ingers+2.519
Neg logit contributions
 agre-6.484
 agreeing-6.226
 dere-6.221
lio-5.993
andum-5.888
Token /
Token ID1220
Feature activation+2.01e+00

Pos logit contributions
icide+0.305
heastern+0.270
ingers+0.267
icides+0.261
ppers+0.257
Neg logit contributions
 agre-0.195
 agreeing-0.184
 dere-0.180
lio-0.171
 agreement-0.161
Token Saul
Token ID31603
Feature activation+8.86e-01

Pos logit contributions
icide+9.816
 BACK+8.746
ingers+7.996
heastern+7.899
icides+7.872
Neg logit contributions
 agre-10.341
 agreeing-9.615
 dere-9.187
lio-9.101
iqueness-8.871
Token LO
Token ID17579
Feature activation+1.31e-01

Pos logit contributions
icide+4.490
 BACK+3.718
icides+3.626
ppers+3.606
ingers+3.589
Neg logit contributions
 agre-5.469
 agreeing-5.024
solete-4.813
 dere-4.808
iqueness-4.781
TokenEB
Token ID30195
Feature activation-1.87e-02

Pos logit contributions
icide+0.861
heastern+0.757
ingers+0.744
icides+0.734
ppers+0.727
Neg logit contributions
 agre-0.616
 agreeing-0.578
 dere-0.567
lio-0.544
andum-0.517
TokenSA
Token ID4090
Feature activation-3.42e-01

Pos logit contributions
 agre+0.113
 agreeing+0.109
 dere+0.107
lio+0.103
 agreement+0.099
Neg logit contributions
icide-0.098
heastern-0.083
ingers-0.082
icides-0.080
ppers-0.078
TokenUL
Token ID6239
Feature activation+5.97e-01

Pos logit contributions
 agre+2.027
 agreeing+1.968
 dere+1.932
lio+1.868
elli+1.801
Neg logit contributions
icide-1.792
heastern-1.549
ingers-1.525
icides-1.467
ppers-1.423
Token
Token ID247
Feature activation+8.31e-02

Pos logit contributions
icide+0.593
ingers+0.239
heastern+0.221
ppers+0.119
ommel+0.109
Neg logit contributions
 agre-5.056
 dere-4.817
 agreeing-4.811
lio-4.627
yrights-4.566
Tokens
Token ID82
Feature activation-7.05e-02

Pos logit contributions
icide+0.510
heastern+0.439
ingers+0.438
icides+0.426
ppers+0.426
Neg logit contributions
 agre-0.440
 agreeing-0.406
 dere-0.401
lio-0.377
yrights-0.362
Token President
Token ID1992
Feature activation+5.89e-02

Pos logit contributions
 agre+0.397
 agreeing+0.377
 dere+0.370
lio+0.358
elli+0.343
Neg logit contributions
icide-0.397
heastern-0.339
ingers-0.335
icides-0.327
ppers-0.323
Token,
Token ID11
Feature activation+4.63e-01

Pos logit contributions
icide+0.339
heastern+0.291
ingers+0.287
icides+0.280
ppers+0.275
Neg logit contributions
 agre-0.324
 agreeing-0.310
 dere-0.305
lio-0.291
 agreement-0.280
Token Car
Token ID1879
Feature activation+2.57e-01

Pos logit contributions
icide+2.654
ingers+2.263
heastern+2.211
icides+2.167
ommel+2.136
Neg logit contributions
 agre-2.525
 dere-2.305
 agreeing-2.288
lio-2.177
andum-2.124
Tokenles
Token ID829
Feature activation+1.96e+00

Pos logit contributions
icide+1.138
heastern+0.914
ppers+0.914
ingers+0.896
ommel+0.882
Neg logit contributions
 agre-1.850
 dere-1.710
 agreeing-1.679
 tenancy-1.565
solete-1.543
Token Pu
Token ID11634
Feature activation+3.14e-01

Pos logit contributions
icide+3.531
ieu+3.499
chy+3.179
 BACK+3.021
pal+2.905
Neg logit contributions
 agre-15.990
iqueness-14.660
yrights-13.901
 tenancy-13.876
sites-13.833
Tokenig
Token ID328
Feature activation+7.42e-01

Pos logit contributions
icide+1.852
ingers+1.533
icides+1.517
heastern+1.485
ppers+1.459
Neg logit contributions
 agre-1.824
 agreeing-1.629
 dere-1.597
lio-1.544
yrights-1.481
Tokendem
Token ID9536
Feature activation+8.23e-01

Pos logit contributions
icide+2.639
heastern+1.813
ingers+1.805
ppers+1.793
ommel+1.753
Neg logit contributions
 agre-5.689
 agreeing-5.229
 dere-5.127
yrights-5.105
 Malf-4.995
Tokenont
Token ID756
Feature activation+1.03e+00

Pos logit contributions
icide+3.840
ingers+3.476
ppers+3.275
heastern+3.244
chy+3.101
Neg logit contributions
 agre-5.233
 dere-4.808
 agreeing-4.773
 Malf-4.540
 tenancy-4.492
Token,
Token ID11
Feature activation+7.87e-01

Pos logit contributions
icide+5.027
ingers+4.239
heastern+4.202
ppers+4.065
ommel+4.027
Neg logit contributions
 agre-6.077
 dere-5.725
 agreeing-5.709
lio-5.476
yrights-5.336
Token the
Token ID262
Feature activation+5.13e-01

Pos logit contributions
icide+3.504
icides+2.926
heastern+2.857
ingers+2.854
ppers+2.799
Neg logit contributions
 agre-3.279
 agreeing-3.033
lio-3.006
 dere-2.926
elli-2.826
Token French
Token ID4141
Feature activation+6.76e-01

Pos logit contributions
icide+2.936
heastern+2.457
icides+2.436
ingers+2.433
ppers+2.353
Neg logit contributions
 agre-2.766
 agreeing-2.579
lio-2.557
 dere-2.502
yrights-2.422
Token supermarket
Token ID26454
Feature activation-5.50e-02

Pos logit contributions
icide+3.788
icides+3.138
ingers+3.109
heastern+3.104
ppers+3.008
Neg logit contributions
 agre-3.666
lio-3.530
 agreeing-3.500
 dere-3.440
yrights-3.285
Token chain
Token ID6333
Feature activation+3.20e-01

Pos logit contributions
 agre+0.280
 agreeing+0.268
 dere+0.263
lio+0.253
 agreement+0.240
Neg logit contributions
icide-0.339
heastern-0.295
ingers-0.291
icides-0.286
ppers-0.280
Token Car
Token ID1879
Feature activation-1.95e-01

Pos logit contributions
icide+1.778
heastern+1.512
ingers+1.482
icides+1.460
ppers+1.423
Neg logit contributions
 agre-1.790
 agreeing-1.720
 dere-1.702
lio-1.682
yrights-1.583
Tokenref
Token ID5420
Feature activation+1.91e+00

Pos logit contributions
 agre+0.914
 agreeing+0.869
 dere+0.852
lio+0.803
 agreement+0.772
Neg logit contributions
icide-1.287
heastern-1.134
ingers-1.119
icides-1.095
ppers-1.080
Tokenour
Token ID454
Feature activation+2.35e-01

Pos logit contributions
icide+5.850
ingers+5.181
ommel+4.684
heastern+4.648
 BACK+4.344
Neg logit contributions
 agre-14.001
 dere-12.872
 agreeing-12.491
iqueness-12.351
 charact-12.072
Token took
Token ID1718
Feature activation+9.27e-02

Pos logit contributions
icide+1.243
heastern+1.039
ingers+1.038
icides+1.010
ppers+0.987
Neg logit contributions
 agre-1.390
 agreeing-1.329
 dere-1.323
lio-1.293
yrights-1.232
Token Danish
Token ID20849
Feature activation+1.51e-01

Pos logit contributions
icide+0.540
heastern+0.458
ingers+0.451
icides+0.449
ppers+0.441
Neg logit contributions
 agre-0.511
 agreeing-0.481
 dere-0.470
lio-0.465
elli-0.436
Token goods
Token ID7017
Feature activation+4.15e-02

Pos logit contributions
icide+0.958
heastern+0.818
ingers+0.816
icides+0.808
ppers+0.785
Neg logit contributions
 agre-0.746
 agreeing-0.704
 dere-0.691
lio-0.690
andum-0.630
Token off
Token ID572
Feature activation-2.40e-01

Pos logit contributions
icide+0.236
heastern+0.203
ingers+0.200
icides+0.195
ppers+0.189
Neg logit contributions
 agre-0.232
 agreeing-0.219
 dere-0.216
lio-0.212
elli-0.200
TokenTime
Token ID7575
Feature activation-5.03e-01

Pos logit contributions
 agre+2.126
 agreeing+2.040
 dere+1.998
lio+1.912
 agreement+1.830
Neg logit contributions
icide-2.564
heastern-2.239
ingers-2.207
icides-2.159
ppers-2.113
Token to
Token ID284
Feature activation+6.06e-02

Pos logit contributions
 agre+2.822
 agreeing+2.717
 dere+2.669
lio+2.554
 agreement+2.467
Neg logit contributions
icide-2.833
heastern-2.439
ingers-2.404
icides-2.348
ppers-2.297
Token get
Token ID651
Feature activation+5.40e-01

Pos logit contributions
icide+0.371
heastern+0.319
ingers+0.318
icides+0.310
ppers+0.309
Neg logit contributions
 agre-0.313
 agreeing-0.298
 dere-0.288
lio-0.283
yrights-0.269
Token the
Token ID262
Feature activation+5.63e-03

Pos logit contributions
icide+2.891
heastern+2.446
ingers+2.395
 BACK+2.354
icides+2.338
Neg logit contributions
 agre-3.114
 agreeing-2.954
lio-2.902
 dere-2.834
andum-2.736
Token Lead
Token ID20116
Feature activation+2.65e-01

Pos logit contributions
icide+0.034
ingers+0.029
heastern+0.029
icides+0.028
ppers+0.028
Neg logit contributions
 agre-0.030
 agreeing-0.029
 dere-0.028
lio-0.027
andum-0.026
Tokenb
Token ID65
Feature activation+1.83e+00

Pos logit contributions
icide+1.492
ingers+1.304
heastern+1.282
ppers+1.229
icides+1.225
Neg logit contributions
 agre-1.484
 agreeing-1.410
 dere-1.382
lio-1.305
 agreement-1.280
Tokenelly
Token ID6148
Feature activation+6.34e-01

Pos logit contributions
icide+7.917
ingers+6.908
icides+6.221
ppers+6.069
ocy+6.007
Neg logit contributions
 agre-11.130
 dere-11.101
 agreeing-10.716
yrights-10.602
lio-10.248
Token records
Token ID4406
Feature activation+2.96e-01

Pos logit contributions
icide+3.686
ingers+3.236
heastern+3.090
icides+3.054
ppers+3.048
Neg logit contributions
 agre-3.331
 agreeing-3.190
 dere-3.065
lio-2.977
andum-2.860
Token out
Token ID503
Feature activation-1.40e-01

Pos logit contributions
icide+1.607
ingers+1.378
heastern+1.378
icides+1.310
ppers+1.309
Neg logit contributions
 agre-1.716
 agreeing-1.622
 dere-1.579
lio-1.573
andum-1.490
Token.
Token ID13
Feature activation-7.04e-02

Pos logit contributions
 agre+0.729
 agreeing+0.699
 dere+0.683
lio+0.655
 agreement+0.627
Neg logit contributions
icide-0.844
heastern-0.735
ingers-0.726
icides-0.708
ppers-0.694
Token
Token ID447
Feature activation+2.26e-02

Pos logit contributions
 agre+0.383
 agreeing+0.366
 dere+0.359
lio+0.343
 agreement+0.330
Neg logit contributions
icide-0.412
heastern-0.356
ingers-0.353
icides-0.343
ppers-0.337
Token drew
Token ID9859
Feature activation+3.16e-01

Pos logit contributions
icide+1.032
heastern+0.874
ingers+0.858
icides+0.838
ppers+0.828
Neg logit contributions
 agre-1.161
 agreeing-1.114
 dere-1.093
lio-1.081
andum-1.025
Token maybe
Token ID3863
Feature activation+5.04e-01

Pos logit contributions
icide+1.646
heastern+1.366
ingers+1.363
icides+1.334
ppers+1.294
Neg logit contributions
 agre-1.910
 agreeing-1.822
 dere-1.772
lio-1.745
andum-1.677
Token the
Token ID262
Feature activation+5.00e-01

Pos logit contributions
icide+2.565
heastern+2.130
ingers+2.099
icides+2.056
ppers+2.040
Neg logit contributions
 agre-3.007
 agreeing-2.837
 dere-2.811
lio-2.800
yrights-2.675
Token first
Token ID717
Feature activation+5.07e-01

Pos logit contributions
icide+2.915
ingers+2.479
heastern+2.479
icides+2.409
ppers+2.381
Neg logit contributions
 agre-2.644
 agreeing-2.502
 dere-2.404
lio-2.383
andum-2.287
Token three
Token ID1115
Feature activation+2.98e-01

Pos logit contributions
icide+2.711
heastern+2.218
ingers+2.206
icides+2.150
ppers+2.099
Neg logit contributions
 agre-2.936
 agreeing-2.783
 dere-2.697
lio-2.690
yrights-2.603
Token or
Token ID393
Feature activation+1.83e+00

Pos logit contributions
icide+1.664
ingers+1.407
heastern+1.401
icides+1.370
ppers+1.332
Neg logit contributions
 agre-1.693
 agreeing-1.589
 dere-1.553
lio-1.538
andum-1.477
Token four
Token ID1440
Feature activation+3.10e-01

Pos logit contributions
icide+6.451
ingers+5.275
heastern+4.904
ppers+4.808
icides+4.802
Neg logit contributions
 agre-12.278
 agreeing-11.640
 dere-11.476
lio-11.447
yrights-11.366
Token.
Token ID13
Feature activation+4.51e-01

Pos logit contributions
icide+1.765
ingers+1.511
heastern+1.498
icides+1.459
ppers+1.417
Neg logit contributions
 agre-1.722
 agreeing-1.614
 dere-1.575
lio-1.562
andum-1.493
Token And
Token ID843
Feature activation+4.09e-01

Pos logit contributions
icide+2.694
ingers+2.279
 BACK+2.246
heastern+2.220
icides+2.209
Neg logit contributions
 agre-2.401
 agreeing-2.224
 dere-2.189
lio-2.095
andum-1.991
Token the
Token ID262
Feature activation+3.15e-01

Pos logit contributions
icide+2.197
ingers+1.835
heastern+1.831
icides+1.765
ppers+1.765
Neg logit contributions
 agre-2.367
 agreeing-2.208
 dere-2.204
lio-2.187
yrights-2.062
Token idea
Token ID2126
Feature activation+5.14e-01

Pos logit contributions
icide+1.918
ingers+1.647
heastern+1.637
icides+1.596
ppers+1.558
Neg logit contributions
 agre-1.606
 agreeing-1.520
 dere-1.492
lio-1.449
elli-1.378
Token p
Token ID279
Feature activation-3.13e-01

Pos logit contributions
icide+2.430
ingers+2.068
heastern+2.063
icides+2.026
ppers+2.007
Neg logit contributions
 agre-2.256
 agreeing-2.128
lio-2.060
 dere-2.053
andum-1.983
Tokenatted
Token ID16898
Feature activation+2.70e-01

Pos logit contributions
 agre+1.717
 agreeing+1.648
 dere+1.614
lio+1.545
 agreement+1.486
Neg logit contributions
icide-1.823
heastern-1.573
ingers-1.558
icides-1.519
ppers-1.484
Token himself
Token ID2241
Feature activation+3.73e-01

Pos logit contributions
icide+1.162
heastern+0.956
ingers+0.926
icides+0.894
ppers+0.893
Neg logit contributions
 agre-1.885
 agreeing-1.795
 dere-1.761
lio-1.715
andum-1.680
Token on
Token ID319
Feature activation+1.13e+00

Pos logit contributions
icide+2.130
heastern+1.820
ingers+1.798
icides+1.728
ppers+1.724
Neg logit contributions
 agre-2.071
 agreeing-1.961
 dere-1.916
lio-1.882
andum-1.811
Token the
Token ID262
Feature activation+1.07e+00

Pos logit contributions
icide+5.569
ingers+4.726
heastern+4.657
ppers+4.508
 BACK+4.449
Neg logit contributions
 agre-6.733
 agreeing-6.435
 dere-6.275
lio-6.252
elli-6.028
Token back
Token ID736
Feature activation+1.78e+00

Pos logit contributions
icide+2.793
 BACK+2.317
ingers+2.156
heastern+2.046
ppers+1.990
Neg logit contributions
 agre-8.724
 agreeing-8.230
lio-8.167
elli-7.999
 dere-7.982
Token for
Token ID329
Feature activation+7.58e-02

Pos logit contributions
icide+7.788
ingers+6.379
heastern+6.287
icides+5.795
ommel+5.693
Neg logit contributions
 agre-10.969
lio-10.186
andum-10.061
 dere-9.906
 agreeing-9.771
Token
Token ID564
Feature activation-3.40e-01

Pos logit contributions
icide+0.453
heastern+0.393
ingers+0.391
icides+0.379
ppers+0.378
Neg logit contributions
 agre-0.398
 agreeing-0.368
 dere-0.365
lio-0.361
andum-0.344
Token
Token ID250
Feature activation-1.90e-02

Pos logit contributions
 agre+2.382
 agreeing+2.334
 dere+2.279
lio+2.203
elli+2.149
Neg logit contributions
icide-1.421
heastern-1.148
ingers-1.125
icides-1.105
ppers-1.057
Tokenpublic
Token ID11377
Feature activation-3.84e-01

Pos logit contributions
 agre+0.098
 agreeing+0.091
 dere+0.091
lio+0.087
 agreement+0.083
Neg logit contributions
icide-0.117
heastern-0.102
ingers-0.101
icides-0.099
ppers-0.097
Token service
Token ID2139
Feature activation-5.75e-03

Pos logit contributions
 agre+1.893
 agreeing+1.812
 dere+1.772
lio+1.689
 agreement+1.618
Neg logit contributions
icide-2.426
heastern-2.126
ingers-2.100
icides-2.053
ppers-2.013
Token sometimes
Token ID3360
Feature activation+5.56e-02

Pos logit contributions
icide+0.217
heastern+0.185
ingers+0.183
icides+0.178
ppers+0.176
Neg logit contributions
 agre-0.232
 agreeing-0.222
 dere-0.217
lio-0.211
elli-0.201
Token,
Token ID11
Feature activation-3.40e-02

Pos logit contributions
icide+0.314
heastern+0.271
ingers+0.267
icides+0.260
ppers+0.257
Neg logit contributions
 agre-0.309
 agreeing-0.296
 dere-0.289
lio-0.282
elli-0.268
Token especially
Token ID2592
Feature activation+5.98e-01

Pos logit contributions
 agre+0.189
 agreeing+0.179
 dere+0.176
lio+0.171
andum+0.162
Neg logit contributions
icide-0.194
heastern-0.167
ingers-0.166
icides-0.161
ppers-0.158
Token in
Token ID287
Feature activation+5.14e-01

Pos logit contributions
icide+3.038
ingers+2.593
heastern+2.582
ppers+2.511
icides+2.474
Neg logit contributions
 agre-3.571
 agreeing-3.319
 dere-3.284
lio-3.277
andum-3.137
Token B
Token ID347
Feature activation+9.02e-01

Pos logit contributions
icide+2.699
heastern+2.299
ingers+2.291
ppers+2.221
icides+2.200
Neg logit contributions
 agre-2.999
 agreeing-2.804
 dere-2.758
lio-2.716
yrights-2.623
Token.
Token ID13
Feature activation+1.74e+00

Pos logit contributions
icide+4.185
ingers+3.561
heastern+3.397
ommel+3.332
icides+3.319
Neg logit contributions
 agre-5.889
 dere-5.540
 agreeing-5.425
lio-5.358
 tenancy-5.099
TokenC
Token ID34
Feature activation+8.95e-01

Pos logit contributions
icide+3.957
heastern+2.408
Page+2.389
 BACK+2.370
ppers+2.355
Neg logit contributions
 agre-15.508
iqueness-13.178
 dere-13.004
solete-12.961
 agreeing-12.940
Token.,
Token ID1539
Feature activation+1.14e-01

Pos logit contributions
icide+4.731
heastern+4.106
ingers+4.007
icides+3.871
ppers+3.818
Neg logit contributions
 agre-5.205
 agreeing-4.945
 dere-4.905
lio-4.685
 agreement-4.495
Token some
Token ID617
Feature activation+2.74e-02

Pos logit contributions
icide+0.634
heastern+0.544
ingers+0.537
icides+0.522
ppers+0.514
Neg logit contributions
 agre-0.649
 agreeing-0.619
 dere-0.607
lio-0.589
andum-0.564
Token of
Token ID286
Feature activation+2.54e-01

Pos logit contributions
icide+0.157
heastern+0.136
ingers+0.135
icides+0.130
ppers+0.129
Neg logit contributions
 agre-0.151
 agreeing-0.145
 dere-0.141
lio-0.138
andum-0.131
Token the
Token ID262
Feature activation+1.56e-01

Pos logit contributions
icide+1.241
heastern+1.044
ingers+1.029
icides+0.992
ppers+0.978
Neg logit contributions
 agre-1.619
 agreeing-1.543
 dere-1.520
lio-1.474
elli-1.413
Token significantly
Token ID5566
Feature activation+2.45e-01

Pos logit contributions
icide+0.298
heastern+0.257
ingers+0.251
icides+0.245
ppers+0.239
Neg logit contributions
 agre-0.299
 agreeing-0.285
 dere-0.281
lio-0.272
andum-0.260
Token,
Token ID11
Feature activation-7.83e-02

Pos logit contributions
icide+1.280
heastern+1.085
ingers+1.063
icides+1.032
ppers+1.007
Neg logit contributions
 agre-1.480
 agreeing-1.398
 dere-1.390
lio-1.355
andum-1.298
Token completely
Token ID3190
Feature activation-4.38e-01

Pos logit contributions
 agre+0.437
 agreeing+0.416
 dere+0.412
lio+0.399
andum+0.381
Neg logit contributions
icide-0.446
heastern-0.382
ingers-0.377
icides-0.369
ppers-0.359
Token shutting
Token ID25136
Feature activation+3.70e-01

Pos logit contributions
 agre+2.371
 agreeing+2.289
 dere+2.238
lio+2.132
 agreement+2.064
Neg logit contributions
icide-2.555
heastern-2.221
ingers-2.187
icides-2.130
ppers-2.087
Token off
Token ID572
Feature activation+2.91e-01

Pos logit contributions
icide+1.525
heastern+1.229
ingers+1.204
icides+1.160
ppers+1.126
Neg logit contributions
 agre-2.611
 agreeing-2.516
 dere-2.478
lio-2.435
andum-2.365
Token fat
Token ID3735
Feature activation+1.71e+00

Pos logit contributions
icide+1.563
heastern+1.317
ingers+1.306
icides+1.277
ppers+1.233
Neg logit contributions
 agre-1.693
 agreeing-1.620
 dere-1.590
lio-1.568
andum-1.512
Token-
Token ID12
Feature activation+1.08e+00

Pos logit contributions
icide+8.668
heastern+7.005
icides+6.954
ingers+6.889
ocy+6.810
Neg logit contributions
 agre-9.237
lio-8.673
 dere-8.562
andum-8.464
 agreeing-8.455
Tokenburning
Token ID44313
Feature activation+7.99e-01

Pos logit contributions
icide+5.867
ingers+4.861
heastern+4.856
icides+4.717
chy+4.591
Neg logit contributions
 agre-6.142
 agreeing-5.561
 dere-5.502
andum-5.309
lio-5.236
Token and
Token ID290
Feature activation+4.18e-01

Pos logit contributions
icide+4.189
ingers+3.568
heastern+3.517
icides+3.432
 BACK+3.409
Neg logit contributions
 agre-4.526
 agreeing-4.269
lio-4.234
 dere-4.209
andum-4.093
Token also
Token ID635
Feature activation+3.19e-01

Pos logit contributions
icide+2.359
heastern+1.989
ingers+1.949
icides+1.927
 BACK+1.893
Neg logit contributions
 agre-2.289
 agreeing-2.151
 dere-2.149
lio-2.138
yrights-2.048
Token triggering
Token ID26555
Feature activation+3.58e-01

Pos logit contributions
icide+1.885
heastern+1.614
ingers+1.591
icides+1.570
ppers+1.529
Neg logit contributions
 agre-1.680
 dere-1.571
 agreeing-1.560
lio-1.555
andum-1.485
Token some
Token ID617
Feature activation+5.53e-02

Pos logit contributions
icide+0.772
ingers+0.657
heastern+0.657
icides+0.635
ppers+0.624
Neg logit contributions
 agre-0.813
 agreeing-0.768
 dere-0.762
lio-0.739
andum-0.717
Token of
Token ID286
Feature activation+2.03e-01

Pos logit contributions
icide+0.316
heastern+0.274
ingers+0.273
icides+0.261
ppers+0.259
Neg logit contributions
 agre-0.304
 agreeing-0.289
 dere-0.285
lio-0.276
andum-0.266
Token which
Token ID543
Feature activation+2.64e-01

Pos logit contributions
icide+1.025
heastern+0.868
ingers+0.854
icides+0.825
ppers+0.813
Neg logit contributions
 agre-1.266
 agreeing-1.212
 dere-1.192
lio-1.145
 agreement-1.108
Token involve
Token ID6211
Feature activation+3.43e-01

Pos logit contributions
icide+1.553
heastern+1.344
ingers+1.338
ppers+1.305
icides+1.286
Neg logit contributions
 agre-1.378
 agreeing-1.331
 dere-1.298
lio-1.266
andum-1.213
Token four
Token ID1440
Feature activation+5.24e-01

Pos logit contributions
icide+1.947
ingers+1.669
heastern+1.668
icides+1.612
ppers+1.589
Neg logit contributions
 agre-1.876
 agreeing-1.751
 dere-1.729
lio-1.701
yrights-1.644
Token or
Token ID393
Feature activation+1.70e+00

Pos logit contributions
icide+2.821
ingers+2.449
heastern+2.433
icides+2.332
ppers+2.304
Neg logit contributions
 agre-3.023
 agreeing-2.785
 dere-2.738
lio-2.721
andum-2.638
Token five
Token ID1936
Feature activation+4.44e-01

Pos logit contributions
icide+6.524
ingers+5.632
heastern+5.242
ppers+4.936
icides+4.834
Neg logit contributions
 agre-11.224
 dere-10.710
 agreeing-10.591
lio-10.354
yrights-10.297
Token cooking
Token ID10801
Feature activation+1.02e+00

Pos logit contributions
icide+2.466
ingers+2.172
heastern+2.147
icides+2.049
ppers+2.035
Neg logit contributions
 agre-2.468
 agreeing-2.293
 dere-2.255
lio-2.228
yrights-2.174
Token techniques
Token ID7605
Feature activation+5.43e-01

Pos logit contributions
icide+5.594
ingers+4.976
heastern+4.916
icides+4.875
ppers+4.749
Neg logit contributions
 agre-5.237
 agreeing-4.955
lio-4.765
 dere-4.764
iqueness-4.734
Token and
Token ID290
Feature activation-1.07e-02

Pos logit contributions
icide+2.908
ingers+2.462
heastern+2.456
icides+2.368
ommel+2.318
Neg logit contributions
 agre-3.203
 dere-2.967
 agreeing-2.959
lio-2.892
yrights-2.769
Token begin
Token ID2221
Feature activation+2.47e-02

Pos logit contributions
 agre+0.056
 agreeing+0.053
 dere+0.052
lio+0.050
andum+0.048
Neg logit contributions
icide-0.064
heastern-0.056
ingers-0.056
icides-0.054
ppers-0.053
Token calories
Token ID14653
Feature activation+7.47e-01

Pos logit contributions
icide+1.693
heastern+1.435
ingers+1.395
icides+1.377
 BACK+1.338
Neg logit contributions
 agre-1.799
 agreeing-1.723
 dere-1.696
lio-1.658
andum-1.595
Token to
Token ID284
Feature activation+1.99e-01

Pos logit contributions
icide+3.934
heastern+3.242
ingers+3.216
icides+3.164
 BACK+3.106
Neg logit contributions
 agre-4.453
 agreeing-4.169
 dere-4.161
lio-4.090
andum-3.906
Token be
Token ID307
Feature activation+8.87e-01

Pos logit contributions
icide+1.129
heastern+0.966
ingers+0.944
icides+0.927
ppers+0.925
Neg logit contributions
 agre-1.108
 agreeing-1.063
 dere-1.035
lio-1.008
andum-0.965
Token stored
Token ID8574
Feature activation+5.45e-01

Pos logit contributions
icide+4.363
heastern+3.553
ingers+3.472
icides+3.442
ppers+3.427
Neg logit contributions
 agre-5.420
 agreeing-5.153
lio-5.096
 dere-5.005
andum-4.882
Token as
Token ID355
Feature activation+2.51e-01

Pos logit contributions
icide+2.939
heastern+2.554
ingers+2.504
 BACK+2.391
icides+2.382
Neg logit contributions
 agre-3.139
 agreeing-2.978
 dere-2.893
lio-2.880
andum-2.754
Token fat
Token ID3735
Feature activation+1.68e+00

Pos logit contributions
icide+1.340
heastern+1.125
ingers+1.113
icides+1.090
ppers+1.066
Neg logit contributions
 agre-1.465
 agreeing-1.415
 dere-1.380
lio-1.366
andum-1.309
Token.
Token ID13
Feature activation+1.92e-01

Pos logit contributions
icide+7.990
heastern+6.337
icides+6.288
ingers+6.275
 BACK+6.133
Neg logit contributions
 agre-9.902
 dere-9.192
lio-9.178
 agreeing-9.169
andum-9.000
Token
Token ID198
Feature activation+1.36e-01

Pos logit contributions
icide+1.079
ingers+0.915
heastern+0.914
 BACK+0.889
icides+0.881
Neg logit contributions
 agre-1.093
 agreeing-1.027
 dere-1.015
lio-0.970
andum-0.937
Token
Token ID198
Feature activation+2.32e-01

Pos logit contributions
icide+0.739
ingers+0.618
heastern+0.614
ppers+0.596
icides+0.595
Neg logit contributions
 agre-0.865
 agreeing-0.766
 dere-0.751
lio-0.716
andum-0.695
TokenAfter
Token ID3260
Feature activation+5.15e-01

Pos logit contributions
icide+1.245
ingers+1.051
heastern+1.050
icides+1.006
ppers+1.004
Neg logit contributions
 agre-1.409
 agreeing-1.317
 dere-1.305
lio-1.218
yrights-1.188
Token the
Token ID262
Feature activation+1.32e-01

Pos logit contributions
icide+2.819
heastern+2.390
ingers+2.301
icides+2.279
ppers+2.261
Neg logit contributions
 agre-2.951
 dere-2.764
 agreeing-2.723
lio-2.690
yrights-2.607
TokenMoving
Token ID33622
Feature activation+4.70e-01

Pos logit contributions
 agre+0.799
 agreeing+0.752
 dere+0.744
lio+0.713
 agreement+0.688
Neg logit contributions
icide-0.742
heastern-0.627
ingers-0.624
icides-0.607
ppers-0.595
Token ash
Token ID12530
Feature activation+1.52e-01

Pos logit contributions
icide+2.746
heastern+2.306
icides+2.287
ingers+2.280
 BACK+2.197
Neg logit contributions
 agre-2.492
 agreeing-2.380
lio-2.264
 dere-2.234
yrights-2.187
Token from
Token ID422
Feature activation-4.22e-01

Pos logit contributions
icide+0.823
heastern+0.684
ingers+0.675
icides+0.674
ppers+0.645
Neg logit contributions
 agre-0.903
 agreeing-0.860
 dere-0.836
lio-0.826
andum-0.794
Token the
Token ID262
Feature activation-2.45e-01

Pos logit contributions
 agre+2.215
 agreeing+2.124
 dere+2.079
lio+1.992
 agreement+1.910
Neg logit contributions
icide-2.535
heastern-2.205
ingers-2.175
icides-2.124
ppers-2.081
Token River
Token ID5866
Feature activation+1.65e-01

Pos logit contributions
 agre+1.302
 agreeing+1.247
 dere+1.211
lio+1.172
elli+1.124
Neg logit contributions
icide-1.461
heastern-1.265
ingers-1.245
icides-1.222
ppers-1.195
Tokenb
Token ID65
Feature activation+1.67e+00

Pos logit contributions
icide+0.977
heastern+0.843
ingers+0.819
icides+0.806
ppers+0.788
Neg logit contributions
 agre-0.886
 agreeing-0.852
 dere-0.824
lio-0.801
yrights-0.774
Tokenend
Token ID437
Feature activation+1.71e-01

Pos logit contributions
icide+7.083
icides+5.350
ingers+5.170
heastern+4.809
ppers+4.594
Neg logit contributions
 agre-11.018
 agreeing-10.612
lio-10.499
 dere-10.308
yrights-9.985
Token site
Token ID2524
Feature activation+1.73e-01

Pos logit contributions
icide+1.055
heastern+0.911
ingers+0.893
icides+0.883
ppers+0.854
Neg logit contributions
 agre-0.869
 agreeing-0.838
 dere-0.803
lio-0.785
andum-0.756
Token to
Token ID284
Feature activation-8.51e-01

Pos logit contributions
icide+0.920
heastern+0.766
ingers+0.759
icides+0.743
ppers+0.716
Neg logit contributions
 agre-1.035
 agreeing-0.991
 dere-0.965
lio-0.952
andum-0.917
Token a
Token ID257
Feature activation-3.71e-01

Pos logit contributions
 agre+4.138
 agreeing+3.990
 dere+3.944
lio+3.691
 agreement+3.606
Neg logit contributions
icide-5.312
heastern-4.685
ingers-4.619
icides-4.501
ppers-4.390
Token lined
Token ID16566
Feature activation-9.99e-02

Pos logit contributions
 agre+1.893
 agreeing+1.813
 dere+1.768
lio+1.697
 agreement+1.621
Neg logit contributions
icide-2.287
heastern-1.994
ingers-1.968
icides-1.921
ppers-1.889
Token to
Token ID284
Feature activation+2.16e-01

Pos logit contributions
 agre+2.912
 agreeing+2.861
 dere+2.790
lio+2.628
 agreement+2.570
Neg logit contributions
icide-3.079
heastern-2.702
ingers-2.654
icides-2.566
ppers-2.505
Token buying
Token ID7067
Feature activation-2.90e-02

Pos logit contributions
icide+1.300
ingers+1.097
heastern+1.089
icides+1.085
ppers+1.070
Neg logit contributions
 agre-1.142
 agreeing-1.040
 dere-1.039
lio-1.028
yrights-0.989
Token drugs
Token ID5010
Feature activation+2.77e-01

Pos logit contributions
 agre+0.146
 agreeing+0.138
 dere+0.134
lio+0.131
elli+0.124
Neg logit contributions
icide-0.182
heastern-0.156
ingers-0.155
icides-0.154
ppers-0.151
Token "
Token ID366
Feature activation-9.18e-02

Pos logit contributions
icide+1.504
ingers+1.257
heastern+1.242
icides+1.220
ppers+1.194
Neg logit contributions
 agre-1.637
 agreeing-1.518
 dere-1.506
lio-1.478
andum-1.424
Tokenfive
Token ID13261
Feature activation+3.35e-01

Pos logit contributions
 agre+0.482
 agreeing+0.455
 dere+0.445
lio+0.421
 agreement+0.410
Neg logit contributions
icide-0.566
heastern-0.489
ingers-0.488
icides-0.473
ppers-0.463
Token or
Token ID393
Feature activation+1.66e+00

Pos logit contributions
icide+2.133
ingers+1.856
heastern+1.818
icides+1.804
ppers+1.756
Neg logit contributions
 agre-1.679
 agreeing-1.544
 dere-1.479
lio-1.473
andum-1.400
Token six
Token ID2237
Feature activation+2.63e-01

Pos logit contributions
icide+5.255
ingers+4.396
heastern+3.766
ppers+3.760
icides+3.645
Neg logit contributions
 agre-12.300
 dere-11.434
 agreeing-11.306
yrights-11.210
lio-11.023
Token times
Token ID1661
Feature activation+3.93e-01

Pos logit contributions
icide+1.694
ingers+1.497
heastern+1.459
icides+1.440
ppers+1.418
Neg logit contributions
 agre-1.280
 agreeing-1.177
 dere-1.130
lio-1.123
yrights-1.064
Token"
Token ID1
Feature activation-2.32e-01

Pos logit contributions
icide+2.222
ingers+1.864
heastern+1.830
icides+1.811
ppers+1.791
Neg logit contributions
 agre-2.209
 agreeing-2.046
lio-2.041
 dere-2.016
yrights-1.931
Token from
Token ID422
Feature activation-7.12e-01

Pos logit contributions
 agre+1.301
 agreeing+1.238
 dere+1.219
lio+1.176
 agreement+1.123
Neg logit contributions
icide-1.318
heastern-1.130
ingers-1.118
icides-1.089
ppers-1.066
Token Jones
Token ID5437
Feature activation-3.74e-01

Pos logit contributions
 agre+3.692
 agreeing+3.643
 dere+3.582
lio+3.349
 agreement+3.293
Neg logit contributions
icide-4.205
heastern-3.708
ingers-3.625
icides-3.531
uliffe-3.411
Token would
Token ID561
Feature activation-5.92e-01

Pos logit contributions
icide+0.322
heastern+0.276
ingers+0.274
icides+0.265
ppers+0.260
Neg logit contributions
 agre-0.348
 agreeing-0.332
 dere-0.325
lio-0.318
andum-0.304
Token put
Token ID1234
Feature activation-4.10e-01

Pos logit contributions
 agre+2.848
 agreeing+2.733
 dere+2.693
lio+2.525
 agreement+2.450
Neg logit contributions
icide-3.786
heastern-3.336
ingers-3.284
icides-3.221
ppers-3.137
Token wind
Token ID2344
Feature activation+3.00e-01

Pos logit contributions
 agre+2.334
 agreeing+2.271
 dere+2.233
lio+2.127
 agreement+2.079
Neg logit contributions
icide-2.259
heastern-1.927
ingers-1.897
icides-1.857
ppers-1.805
Token at
Token ID379
Feature activation+4.31e-01

Pos logit contributions
icide+1.582
ingers+1.385
heastern+1.360
icides+1.303
ppers+1.291
Neg logit contributions
 agre-1.796
 agreeing-1.686
 dere-1.654
andum-1.578
lio-1.576
Token the
Token ID262
Feature activation+7.56e-01

Pos logit contributions
icide+2.302
heastern+1.984
ingers+1.975
icides+1.890
ppers+1.865
Neg logit contributions
 agre-2.533
 agreeing-2.404
 dere-2.369
lio-2.296
elli-2.189
Token back
Token ID736
Feature activation+1.63e+00

Pos logit contributions
icide+3.312
ingers+2.846
heastern+2.810
icides+2.593
 BACK+2.576
Neg logit contributions
 agre-4.979
 agreeing-4.644
lio-4.621
 dere-4.517
yrights-4.453
Token of
Token ID286
Feature activation+6.01e-01

Pos logit contributions
icide+8.075
ingers+7.050
heastern+6.982
icides+6.529
ppers+6.442
Neg logit contributions
 agre-9.258
 dere-8.798
 agreeing-8.786
lio-8.626
andum-8.376
Token the
Token ID262
Feature activation+3.53e-01

Pos logit contributions
icide+3.205
ingers+2.811
heastern+2.794
ppers+2.667
icides+2.647
Neg logit contributions
 agre-3.446
 agreeing-3.206
 dere-3.169
lio-3.084
yrights-3.029
Token opposition
Token ID5471
Feature activation+6.28e-01

Pos logit contributions
icide+1.739
ingers+1.483
heastern+1.477
ppers+1.399
icides+1.394
Neg logit contributions
 agre-2.197
 agreeing-2.079
 dere-2.033
lio-2.018
yrights-1.972
Token party
Token ID2151
Feature activation+2.24e-02

Pos logit contributions
icide+3.448
heastern+2.981
ingers+2.945
icides+2.822
ppers+2.811
Neg logit contributions
 agre-3.513
 agreeing-3.317
 dere-3.279
lio-3.264
yrights-3.101
Token,
Token ID11
Feature activation-4.82e-01

Pos logit contributions
icide+0.133
heastern+0.116
ingers+0.115
icides+0.111
ppers+0.110
Neg logit contributions
 agre-0.119
 agreeing-0.113
 dere-0.111
lio-0.107
andum-0.102
Token Alert
Token ID23276
Feature activation+6.50e-01

Pos logit contributions
icide+2.747
heastern+2.383
ingers+2.353
icides+2.250
ppers+2.231
Neg logit contributions
 agre-2.549
 agreeing-2.419
 dere-2.358
lio-2.246
 agreement-2.180
Tokens
Token ID82
Feature activation+4.27e-01

Pos logit contributions
icide+3.674
ingers+3.149
heastern+3.137
icides+2.996
ppers+2.966
Neg logit contributions
 agre-3.628
 agreeing-3.371
 dere-3.344
lio-3.126
iqueness-3.058
Token newsletter
Token ID13129
Feature activation+1.29e+00

Pos logit contributions
icide+2.316
heastern+1.956
ingers+1.944
icides+1.865
ppers+1.853
Neg logit contributions
 agre-2.506
 agreeing-2.357
 dere-2.325
lio-2.229
andum-2.143
Token delivered
Token ID6793
Feature activation+1.27e+00

Pos logit contributions
icide+6.602
heastern+5.505
ingers+5.389
icides+5.141
 BACK+5.138
Neg logit contributions
 agre-7.534
 dere-6.998
 agreeing-6.920
lio-6.586
iqueness-6.570
Token to
Token ID284
Feature activation+1.41e+00

Pos logit contributions
icide+5.847
heastern+4.983
chy+4.462
ingers+4.453
ppers+4.390
Neg logit contributions
 agre-8.009
 dere-7.484
 agreeing-7.453
lio-7.255
 Malf-6.951
Token your
Token ID534
Feature activation+1.62e+00

Pos logit contributions
icide+4.105
heastern+3.029
ingers+2.926
 BACK+2.824
ppers+2.782
Neg logit contributions
 agre-11.000
 dere-10.418
 agreeing-10.400
andum-10.116
lio-10.090
Token inbox
Token ID13734
Feature activation+1.01e+00

Pos logit contributions
icide+4.563
heastern+3.093
ingers+3.054
ppers+2.891
 BACK+2.868
Neg logit contributions
 agre-12.359
 agreeing-11.806
andum-11.688
 dere-11.644
lio-11.615
Token We
Token ID775
Feature activation+1.05e+00

Pos logit contributions
icide+4.874
heastern+3.982
ingers+3.817
icides+3.724
ppers+3.610
Neg logit contributions
 agre-6.405
 dere-5.986
 agreeing-5.941
lio-5.696
andum-5.598
Token're
Token ID821
Feature activation+9.33e-01

Pos logit contributions
icide+5.948
ingers+4.982
heastern+4.936
ppers+4.884
icides+4.834
Neg logit contributions
 agre-5.353
 dere-5.148
lio-5.074
 agreeing-5.056
elli-4.857
Token sorry
Token ID7926
Feature activation+1.26e+00

Pos logit contributions
icide+5.348
ingers+4.576
 BACK+4.501
heastern+4.481
icides+4.390
Neg logit contributions
 agre-4.924
lio-4.480
 agreeing-4.411
 dere-4.365
andum-4.227
Token,
Token ID11
Feature activation+6.46e-01

Pos logit contributions
icide+6.004
heastern+4.839
icides+4.734
ingers+4.649
ppers+4.533
Neg logit contributions
 agre-8.027
 agreeing-7.322
 dere-7.297
lio-6.990
iqueness-6.878
Token has
Token ID468
Feature activation+1.19e-01

Pos logit contributions
 agre+0.370
 agreeing+0.356
 dere+0.348
lio+0.337
andum+0.321
Neg logit contributions
icide-0.390
heastern-0.338
ingers-0.333
icides-0.324
ppers-0.317
Token preserved
Token ID17232
Feature activation+1.46e-01

Pos logit contributions
icide+0.669
heastern+0.573
ingers+0.569
icides+0.553
ppers+0.544
Neg logit contributions
 agre-0.672
 agreeing-0.637
 dere-0.615
lio-0.614
andum-0.582
Token a
Token ID257
Feature activation+1.52e-01

Pos logit contributions
icide+0.841
heastern+0.733
ingers+0.720
icides+0.702
ppers+0.686
Neg logit contributions
 agre-0.799
 agreeing-0.762
 dere-0.728
lio-0.720
elli-0.687
Token striking
Token ID8871
Feature activation-3.32e-01

Pos logit contributions
icide+0.915
heastern+0.806
ingers+0.788
icides+0.763
ppers+0.753
Neg logit contributions
 agre-0.786
 agreeing-0.751
 dere-0.713
lio-0.710
andum-0.678
Token car
Token ID1097
Feature activation-1.17e-01

Pos logit contributions
 agre+1.733
 agreeing+1.662
 dere+1.627
lio+1.558
 agreement+1.493
Neg logit contributions
icide-2.001
heastern-1.742
ingers-1.718
icides-1.678
ppers-1.644
Tokenap
Token ID499
Feature activation+1.62e+00

Pos logit contributions
 agre+0.686
 agreeing+0.663
 dere+0.645
lio+0.625
 agreement+0.601
Neg logit contributions
icide-0.632
heastern-0.539
ingers-0.530
icides-0.516
ppers-0.509
Tokenace
Token ID558
Feature activation-1.81e-01

Pos logit contributions
icide+7.618
ingers+5.805
heastern+5.680
ommel+5.600
icides+5.526
Neg logit contributions
 agre-10.433
 agreeing-10.078
 dere-9.441
 redes-9.091
 Malf-8.898
Token of
Token ID286
Feature activation-3.28e-01

Pos logit contributions
 agre+0.993
 agreeing+0.955
 dere+0.932
lio+0.903
 agreement+0.858
Neg logit contributions
icide-1.047
heastern-0.908
ingers-0.891
icides-0.866
ppers-0.848
Token multi
Token ID5021
Feature activation-3.40e-02

Pos logit contributions
 agre+1.737
 agreeing+1.666
 dere+1.625
lio+1.564
 agreement+1.496
Neg logit contributions
icide-1.954
heastern-1.697
ingers-1.674
icides-1.636
ppers-1.602
Token-
Token ID12
Feature activation+1.93e-02

Pos logit contributions
 agre+0.204
 agreeing+0.197
 dere+0.194
lio+0.186
 agreement+0.180
Neg logit contributions
icide-0.179
heastern-0.153
ingers-0.150
icides-0.146
ppers-0.142
Tokenlay
Token ID10724
Feature activation+4.83e-01

Pos logit contributions
icide+0.117
heastern+0.101
ingers+0.100
icides+0.097
ppers+0.096
Neg logit contributions
 agre-0.101
 agreeing-0.097
 dere-0.093
lio-0.090
 agreement-0.086
Token was
Token ID373
Feature activation-2.02e-01

Pos logit contributions
icide+2.005
heastern+1.714
ingers+1.643
icides+1.600
ppers+1.570
Neg logit contributions
 agre-2.218
 agreeing-2.125
 dere-2.100
lio-2.044
andum-1.990
Token thought
Token ID1807
Feature activation-2.62e-03

Pos logit contributions
 agre+1.105
 agreeing+1.060
 dere+1.040
lio+0.999
 agreement+0.957
Neg logit contributions
icide-1.166
heastern-1.009
ingers-0.994
icides-0.969
ppers-0.949
Token to
Token ID284
Feature activation+2.07e-02

Pos logit contributions
 agre+0.015
 agreeing+0.014
 dere+0.014
lio+0.013
 agreement+0.013
Neg logit contributions
icide-0.015
heastern-0.013
ingers-0.013
icides-0.012
ppers-0.012
Token have
Token ID423
Feature activation+4.34e-01

Pos logit contributions
icide+0.128
heastern+0.112
ingers+0.110
icides+0.107
ppers+0.107
Neg logit contributions
 agre-0.105
 agreeing-0.100
 dere-0.098
lio-0.095
elli-0.091
Token only
Token ID691
Feature activation+1.28e+00

Pos logit contributions
icide+2.438
heastern+2.151
ingers+2.044
icides+2.016
ppers+1.993
Neg logit contributions
 agre-2.357
 agreeing-2.237
 dere-2.205
lio-2.169
andum-2.079
Token a
Token ID257
Feature activation+1.62e+00

Pos logit contributions
icide+6.329
heastern+5.796
 BACK+5.278
ppers+5.258
ingers+5.166
Neg logit contributions
 agre-6.829
lio-6.508
 dere-6.471
 agreeing-6.452
andum-6.362
Token few
Token ID1178
Feature activation+2.53e-01

Pos logit contributions
icide+7.847
heastern+7.220
 subset+6.908
 BACK+6.419
ppers+6.311
Neg logit contributions
 agre-8.741
 agreeing-8.213
yrights-8.164
lio-8.073
elli-7.861
Token bacterial
Token ID23462
Feature activation-1.31e-02

Pos logit contributions
icide+1.422
heastern+1.256
ingers+1.208
icides+1.177
ppers+1.165
Neg logit contributions
 agre-1.397
 agreeing-1.318
 dere-1.294
lio-1.288
andum-1.242
Token species
Token ID4693
Feature activation+5.15e-01

Pos logit contributions
 agre+0.070
 agreeing+0.067
 dere+0.065
lio+0.063
elli+0.060
Neg logit contributions
icide-0.078
heastern-0.068
ingers-0.067
icides-0.066
ppers-0.064
Token and
Token ID290
Feature activation+7.43e-01

Pos logit contributions
icide+2.786
heastern+2.357
ingers+2.314
icides+2.269
ppers+2.200
Neg logit contributions
 agre-3.002
 agreeing-2.791
 dere-2.790
lio-2.754
andum-2.631
Token mostly
Token ID4632
Feature activation+4.26e-01

Pos logit contributions
icide+4.046
heastern+3.525
icides+3.362
ingers+3.353
ppers+3.326
Neg logit contributions
 agre-4.035
 agreeing-3.797
lio-3.756
 dere-3.739
yrights-3.608
Token the
Token ID262
Feature activation+3.14e-01

Pos logit contributions
icide+0.425
heastern+0.361
ingers+0.356
icides+0.348
ppers+0.342
Neg logit contributions
 agre-0.444
 agreeing-0.420
 dere-0.415
lio-0.403
andum-0.386
Token left
Token ID1364
Feature activation-4.04e-02

Pos logit contributions
icide+1.765
heastern+1.508
ingers+1.478
icides+1.448
ppers+1.428
Neg logit contributions
 agre-1.783
 agreeing-1.659
 dere-1.618
lio-1.597
elli-1.544
Token:
Token ID25
Feature activation-1.45e-01

Pos logit contributions
 agre+0.229
 agreeing+0.219
 dere+0.216
lio+0.209
 agreement+0.199
Neg logit contributions
icide-0.226
heastern-0.195
ingers-0.192
icides-0.187
ppers-0.184
Token Bo
Token ID3248
Feature activation+3.42e-01

Pos logit contributions
 agre+0.793
 agreeing+0.759
 dere+0.745
lio+0.718
 agreement+0.685
Neg logit contributions
icide-0.841
heastern-0.726
ingers-0.717
icides-0.698
ppers-0.686
Token Scar
Token ID12089
Feature activation-6.09e-01

Pos logit contributions
icide+1.833
heastern+1.548
ingers+1.528
icides+1.490
ppers+1.476
Neg logit contributions
 agre-2.063
 agreeing-1.959
 dere-1.921
lio-1.872
andum-1.780
Tokenb
Token ID65
Feature activation+1.61e+00

Pos logit contributions
 agre+2.566
 agreeing+2.453
 dere+2.368
lio+2.271
elli+2.180
Neg logit contributions
icide-4.216
heastern-3.793
ingers-3.725
icides-3.638
ppers-3.567
Tokenrough
Token ID740
Feature activation+4.63e-01

Pos logit contributions
icide+6.219
icides+4.436
chy+4.297
ommel+4.266
ingers+4.244
Neg logit contributions
 agre-11.524
 agreeing-10.941
 dere-10.429
 tenancy-9.993
lio-9.984
Token in
Token ID287
Feature activation-2.56e-02

Pos logit contributions
icide+2.516
heastern+2.154
ingers+2.152
icides+2.048
ppers+2.035
Neg logit contributions
 agre-2.664
 agreeing-2.478
 dere-2.470
lio-2.397
andum-2.328
Token 10
Token ID838
Feature activation-4.96e-02

Pos logit contributions
 agre+0.141
 agreeing+0.133
 dere+0.130
lio+0.126
andum+0.122
Neg logit contributions
icide-0.148
heastern-0.127
ingers-0.127
icides-0.123
ppers-0.122
Tokenth
Token ID400
Feature activation+1.85e-02

Pos logit contributions
 agre+0.254
 agreeing+0.240
 dere+0.235
lio+0.226
andum+0.216
Neg logit contributions
icide-0.308
heastern-0.267
ingers-0.265
icides-0.258
ppers-0.253
Token grade
Token ID9559
Feature activation+8.27e-01

Pos logit contributions
icide+0.102
ingers+0.086
heastern+0.085
icides+0.083
ppers+0.081
Neg logit contributions
 agre-0.107
 agreeing-0.102
 dere-0.101
lio-0.099
andum-0.094
Token care
Token ID1337
Feature activation-2.29e-02

Pos logit contributions
 agre+0.391
 agreeing+0.372
 dere+0.363
lio+0.346
 agreement+0.327
Neg logit contributions
icide-0.596
heastern-0.527
ingers-0.521
icides-0.511
ppers-0.502
Token bill
Token ID2855
Feature activation+4.35e-01

Pos logit contributions
 agre+0.127
 agreeing+0.122
 dere+0.120
lio+0.117
andum+0.112
Neg logit contributions
icide-0.130
heastern-0.111
ingers-0.110
icides-0.107
ppers-0.106
Token,
Token ID11
Feature activation+4.20e-01

Pos logit contributions
icide+2.280
heastern+1.931
ingers+1.902
 BACK+1.842
ppers+1.817
Neg logit contributions
 agre-2.556
 dere-2.399
 agreeing-2.353
lio-2.330
andum-2.267
Token especially
Token ID2592
Feature activation+6.44e-01

Pos logit contributions
icide+2.141
ingers+1.787
heastern+1.761
icides+1.718
ppers+1.679
Neg logit contributions
 agre-2.559
 dere-2.371
 agreeing-2.340
lio-2.333
andum-2.251
Token since
Token ID1201
Feature activation+5.57e-02

Pos logit contributions
icide+3.151
ingers+2.701
heastern+2.661
ppers+2.579
 BACK+2.556
Neg logit contributions
 agre-3.874
lio-3.628
 dere-3.616
 agreeing-3.540
andum-3.490
Token Rep
Token ID1432
Feature activation+1.59e+00

Pos logit contributions
icide+0.320
heastern+0.276
ingers+0.274
icides+0.266
ppers+0.263
Neg logit contributions
 agre-0.305
 agreeing-0.285
 dere-0.285
lio-0.276
andum-0.266
Token.
Token ID13
Feature activation+1.03e+00

Pos logit contributions
icide+7.560
heastern+6.430
ingers+6.414
ppers+6.254
icides+6.045
Neg logit contributions
 agre-9.353
 dere-8.776
 agreeing-8.723
lio-8.237
iqueness-8.185
Token Tom
Token ID4186
Feature activation+3.94e-01

Pos logit contributions
icide+4.926
ingers+4.104
 BACK+4.043
ppers+4.035
heastern+3.908
Neg logit contributions
 agre-6.405
 agreeing-5.822
 dere-5.781
andum-5.504
iqueness-5.487
Token MacArthur
Token ID46626
Feature activation+5.11e-01

Pos logit contributions
icide+2.165
heastern+1.839
ingers+1.833
ppers+1.785
icides+1.762
Neg logit contributions
 agre-2.278
 agreeing-2.155
 dere-2.103
lio-1.989
andum-1.975
Token of
Token ID286
Feature activation+4.09e-01

Pos logit contributions
icide+2.576
heastern+2.181
ingers+2.177
ppers+2.062
icides+2.059
Neg logit contributions
 agre-3.096
 agreeing-2.942
 dere-2.938
lio-2.829
yrights-2.766
Token New
Token ID968
Feature activation-3.39e-02

Pos logit contributions
icide+1.901
heastern+1.561
ingers+1.535
ppers+1.482
icides+1.471
Neg logit contributions
 agre-2.743
 agreeing-2.537
 dere-2.508
lio-2.443
andum-2.392
Token &
Token ID1222
Feature activation-1.71e-01

Pos logit contributions
icide+2.871
ingers+2.487
heastern+2.457
icides+2.409
ppers+2.373
Neg logit contributions
 agre-2.453
 agreeing-2.310
 dere-2.282
lio-2.134
andum-2.071
Token Bonnie
Token ID39203
Feature activation+5.90e-02

Pos logit contributions
 agre+0.934
 agreeing+0.897
 dere+0.879
lio+0.843
 agreement+0.810
Neg logit contributions
icide-0.995
heastern-0.861
ingers-0.849
icides-0.829
ppers-0.811
Token &
Token ID1222
Feature activation+1.42e-01

Pos logit contributions
icide+0.348
heastern+0.301
ingers+0.298
icides+0.290
ppers+0.285
Neg logit contributions
 agre-0.318
 agreeing-0.304
 dere-0.299
lio-0.285
 agreement-0.274
Token Baby
Token ID14801
Feature activation+2.24e-01

Pos logit contributions
icide+0.816
heastern+0.691
ingers+0.691
icides+0.675
ppers+0.665
Neg logit contributions
 agre-0.790
 agreeing-0.749
 dere-0.734
lio-0.705
 agreement-0.677
Token 2008
Token ID3648
Feature activation+4.44e-01

Pos logit contributions
icide+1.286
heastern+1.102
ingers+1.100
icides+1.065
ppers+1.052
Neg logit contributions
 agre-1.246
 agreeing-1.189
 dere-1.170
lio-1.112
 agreement-1.074
Token01
Token ID486
Feature activation+1.59e+00

Pos logit contributions
icide+2.319
heastern+1.951
ingers+1.934
icides+1.879
ppers+1.831
Neg logit contributions
 agre-2.693
 agreeing-2.567
 dere-2.536
lio-2.458
 agreement-2.345
Token09
Token ID2931
Feature activation+6.35e-01

Pos logit contributions
icide+5.531
ingers+4.443
heastern+4.368
+4.140
icides+4.075
Neg logit contributions
 agre-11.439
 dere-10.692
 agreeing-10.627
lio-10.435
elli-10.058
Token O
Token ID440
Feature activation+8.19e-02

Pos logit contributions
icide+2.349
 BACK+1.818
ingers+1.764
ppers+1.697
icides+1.670
Neg logit contributions
 agre-4.985
 agreeing-4.509
 dere-4.464
lio-4.324
andum-4.246
Token&
Token ID5
Feature activation+5.14e-01

Pos logit contributions
icide+0.595
ingers+0.522
heastern+0.522
icides+0.509
ppers+0.508
Neg logit contributions
 agre-0.356
 agreeing-0.312
 dere-0.308
lio-0.284
andum-0.269
TokenA
Token ID32
Feature activation+3.38e-01

Pos logit contributions
icide+2.172
ingers+1.679
 BACK+1.625
heastern+1.623
+1.618
Neg logit contributions
 agre-3.950
 agreeing-3.439
 dere-3.415
 redes-3.265
lio-3.253
Token-
Token ID12
Feature activation+2.43e-01

Pos logit contributions
icide+1.502
ingers+1.202
heastern+1.178
ppers+1.147
 BACK+1.138
Neg logit contributions
 agre-2.475
 agreeing-2.214
 dere-2.185
lio-2.085
andum-2.043
Token religion
Token ID5737
Feature activation-1.23e-01

Pos logit contributions
 agre+0.442
 agreeing+0.421
 dere+0.411
lio+0.400
elli+0.386
Neg logit contributions
icide-0.441
heastern-0.378
ingers-0.372
icides-0.365
ppers-0.360
Token in
Token ID287
Feature activation-7.96e-02

Pos logit contributions
 agre+0.665
 agreeing+0.638
 dere+0.624
lio+0.605
andum+0.581
Neg logit contributions
icide-0.723
heastern-0.625
ingers-0.613
icides-0.602
ppers-0.589
Token Brazil
Token ID7595
Feature activation+1.82e-01

Pos logit contributions
 agre+0.509
 agreeing+0.487
 dere+0.476
lio+0.461
elli+0.448
Neg logit contributions
icide-0.389
heastern-0.327
ingers-0.319
icides-0.311
ppers-0.305
Token are
Token ID389
Feature activation-3.80e-01

Pos logit contributions
icide+1.048
heastern+0.904
ingers+0.882
icides+0.868
ppers+0.851
Neg logit contributions
 agre-0.995
 agreeing-0.954
 dere-0.943
lio-0.897
andum-0.867
Token inex
Token ID16087
Feature activation-3.84e-01

Pos logit contributions
 agre+1.911
 agreeing+1.823
 dere+1.785
lio+1.710
 agreement+1.631
Neg logit contributions
icide-2.372
heastern-2.073
ingers-2.049
icides-2.000
ppers-1.963
Tokenact
Token ID529
Feature activation-1.74e+00

Pos logit contributions
 agre+2.226
 agreeing+2.144
 dere+2.104
lio+2.022
 agreement+1.953
Neg logit contributions
icide-2.094
heastern-1.796
ingers-1.767
icides-1.726
ppers-1.679
Token.
Token ID13
Feature activation-9.69e-01

Pos logit contributions
 agre+7.753
 agreeing+7.630
 dere+7.400
lio+7.133
 agreement+7.061
Neg logit contributions
icide-10.281
heastern-9.170
ingers-8.871
ebus-8.718
icides-8.694
Token According
Token ID4784
Feature activation-5.64e-01

Pos logit contributions
 agre+4.532
 agreeing+4.485
 dere+4.415
lio+4.259
 Stat+4.192
Neg logit contributions
icide-6.010
heastern-5.394
ingers-5.275
icides-5.123
uliffe-5.107
Token to
Token ID284
Feature activation-7.20e-01

Pos logit contributions
 agre+3.077
 agreeing+3.046
 dere+2.986
lio+2.850
 agreement+2.782
Neg logit contributions
icide-3.153
heastern-2.740
ingers-2.710
icides-2.630
ppers-2.565
Token the
Token ID262
Feature activation-6.48e-01

Pos logit contributions
 agre+3.735
 agreeing+3.604
 dere+3.536
lio+3.370
 agreement+3.259
Neg logit contributions
icide-4.326
heastern-3.786
ingers-3.744
icides-3.633
ppers-3.556
Token census
Token ID21649
Feature activation-6.36e-01

Pos logit contributions
 agre+3.403
 agreeing+3.272
 dere+3.217
lio+3.078
 agreement+2.962
Neg logit contributions
icide-3.874
heastern-3.372
ingers-3.335
icides-3.250
ppers-3.181
Token America
Token ID2253
Feature activation+1.19e-02

Pos logit contributions
icide+1.223
heastern+0.842
ingers+0.783
icides+0.744
ppers+0.739
Neg logit contributions
 agre-3.805
 agreeing-3.488
 dere-3.468
lio-3.386
andum-3.293
Token,
Token ID11
Feature activation-2.76e-01

Pos logit contributions
icide+0.058
heastern+0.048
ingers+0.047
icides+0.046
ppers+0.045
Neg logit contributions
 agre-0.077
 agreeing-0.074
 dere-0.073
lio-0.071
andum-0.068
Token C
Token ID327
Feature activation-9.03e-01

Pos logit contributions
 agre+1.425
 agreeing+1.358
 dere+1.334
lio+1.277
 agreement+1.220
Neg logit contributions
icide-1.690
heastern-1.471
ingers-1.451
icides-1.418
ppers-1.390
Tokeniti
Token ID8846
Feature activation-6.39e-01

Pos logit contributions
 agre+4.204
 agreeing+4.188
 dere+4.076
lio+3.892
elli+3.831
Neg logit contributions
icide-5.614
heastern-4.900
icides-4.865
ingers-4.828
ppers-4.660
Token,
Token ID11
Feature activation-7.76e-01

Pos logit contributions
 agre+3.073
 agreeing+3.070
 dere+2.906
lio+2.788
elli+2.741
Neg logit contributions
icide-3.892
ingers-3.420
heastern-3.411
icides-3.359
ppers-3.277
Token Chase
Token ID16346
Feature activation-1.69e+00

Pos logit contributions
 agre+3.511
 agreeing+3.448
 dere+3.322
lio+3.152
 agreement+3.053
Neg logit contributions
icide-5.074
heastern-4.514
ingers-4.468
icides-4.367
ppers-4.295
Token,
Token ID11
Feature activation-8.91e-01

Pos logit contributions
 agreeing+6.967
 agre+6.667
 dere+6.184
 agreement+6.102
 Stat+6.009
Neg logit contributions
icide-10.590
heastern-9.384
ingers-9.328
icides-9.326
ommel-8.824
Token Wells
Token ID18292
Feature activation-7.74e-01

Pos logit contributions
 agre+3.978
 agreeing+3.939
 dere+3.777
lio+3.574
 agreement+3.485
Neg logit contributions
icide-5.811
heastern-5.186
ingers-5.141
icides-5.015
ppers-4.936
Token Fargo
Token ID28125
Feature activation-9.89e-01

Pos logit contributions
 agreeing+3.334
 agre+3.331
 dere+3.197
lio+3.055
 agreement+2.976
Neg logit contributions
icide-5.143
heastern-4.537
ingers-4.502
icides-4.462
ppers-4.340
Token,
Token ID11
Feature activation-1.02e+00

Pos logit contributions
 agreeing+4.379
 agre+4.319
 dere+4.146
 agreement+3.894
elli+3.844
Neg logit contributions
icide-6.158
heastern-5.482
ingers-5.448
icides-5.284
uliffe-5.177
Token and
Token ID290
Feature activation-1.09e+00

Pos logit contributions
 agreeing+4.423
 agre+4.422
 dere+4.238
lio+3.998
 Stat+3.912
Neg logit contributions
icide-6.661
heastern-5.958
ingers-5.926
icides-5.765
ppers-5.684
Token of
Token ID286
Feature activation-7.02e-01

Pos logit contributions
 agre+3.467
 agreeing+3.329
 dere+3.271
lio+3.081
 agreement+3.013
Neg logit contributions
icide-3.922
heastern-3.421
ingers-3.357
icides-3.291
ppers-3.217
Token a
Token ID257
Feature activation-7.93e-01

Pos logit contributions
 agre+3.462
 agreeing+3.358
 dere+3.288
lio+3.093
 agreement+3.036
Neg logit contributions
icide-4.368
heastern-3.848
ingers-3.789
icides-3.699
ppers-3.610
Token military
Token ID2422
Feature activation-7.02e-01

Pos logit contributions
 agre+4.063
 agreeing+3.926
 dere+3.905
lio+3.630
 agreement+3.589
Neg logit contributions
icide-4.770
heastern-4.163
ingers-4.084
icides-4.035
ppers-3.907
Token response
Token ID2882
Feature activation-4.24e-01

Pos logit contributions
 agre+3.652
 agreeing+3.497
 dere+3.445
lio+3.241
 agreement+3.198
Neg logit contributions
icide-4.217
heastern-3.709
ingers-3.656
icides-3.572
ppers-3.483
Token and
Token ID290
Feature activation-5.16e-01

Pos logit contributions
 agre+2.352
 agreeing+2.261
 dere+2.217
lio+2.126
 agreement+2.047
Neg logit contributions
icide-2.428
heastern-2.097
ingers-2.067
icides-2.015
ppers-1.972
Token almost
Token ID2048
Feature activation-1.68e+00

Pos logit contributions
 agre+2.748
 agreeing+2.640
 dere+2.586
lio+2.473
 agreement+2.380
Neg logit contributions
icide-3.065
heastern-2.664
ingers-2.625
icides-2.562
ppers-2.509
Token no
Token ID645
Feature activation-3.38e-01

Pos logit contributions
 agre+7.650
 agreeing+7.563
 dere+7.426
 agreement+6.832
 compl+6.466
Neg logit contributions
icide-10.156
heastern-9.320
ingers-9.037
uliffe-9.026
icides-8.729
Token news
Token ID1705
Feature activation-2.61e-02

Pos logit contributions
 agre+1.781
 agreeing+1.703
 dere+1.667
lio+1.611
elli+1.536
Neg logit contributions
icide-2.025
heastern-1.755
ingers-1.734
icides-1.696
ppers-1.662
Token play
Token ID711
Feature activation-8.26e-01

Pos logit contributions
 agre+0.140
 agreeing+0.134
 dere+0.132
lio+0.127
elli+0.122
Neg logit contributions
icide-0.153
heastern-0.132
ingers-0.130
icides-0.127
ppers-0.124
Token on
Token ID319
Feature activation+1.16e-02

Pos logit contributions
 agre+4.284
 agreeing+4.173
 dere+4.061
lio+3.857
 agreement+3.775
Neg logit contributions
icide-4.847
heastern-4.266
ingers-4.133
icides-4.084
ppers-3.978
Token this
Token ID428
Feature activation+9.06e-02

Pos logit contributions
icide+0.064
heastern+0.054
ingers+0.054
icides+0.053
ppers+0.052
Neg logit contributions
 agre-0.066
 agreeing-0.063
 dere-0.062
lio-0.061
elli-0.058
Token possible
Token ID1744
Feature activation-2.36e-01

Pos logit contributions
 agre+2.862
 agreeing+2.784
 dere+2.724
lio+2.548
 agreement+2.497
Neg logit contributions
icide-4.159
heastern-3.687
ingers-3.641
icides-3.574
ppers-3.465
Token,
Token ID11
Feature activation-1.47e-01

Pos logit contributions
 agre+1.299
 agreeing+1.244
 dere+1.220
lio+1.170
 agreement+1.124
Neg logit contributions
icide-1.360
heastern-1.174
ingers-1.158
icides-1.129
ppers-1.106
Tokenin
Token ID259
Feature activation-9.47e-02

Pos logit contributions
 agre+0.844
 agreeing+0.798
 dere+0.784
lio+0.755
elli+0.723
Neg logit contributions
icide-0.819
heastern-0.702
ingers-0.692
icides-0.674
ppers-0.662
Token the
Token ID262
Feature activation+9.44e-03

Pos logit contributions
 agre+0.518
 agreeing+0.499
 dere+0.489
lio+0.468
 agreement+0.451
Neg logit contributions
icide-0.548
heastern-0.474
ingers-0.468
icides-0.456
ppers-0.446
Token old
Token ID1468
Feature activation-7.27e-01

Pos logit contributions
icide+0.066
heastern+0.059
ingers+0.058
icides+0.056
ppers+0.056
Neg logit contributions
 agre-0.040
 agreeing-0.038
 dere-0.037
lio-0.035
elli-0.033
Token Naval
Token ID20818
Feature activation-1.67e+00

Pos logit contributions
 agre+3.326
 dere+3.165
 agreeing+3.162
lio+2.960
 agreement+2.836
Neg logit contributions
icide-4.729
heastern-4.156
ingers-4.142
icides-4.050
ppers-3.982
Token Air
Token ID3701
Feature activation-7.72e-01

Pos logit contributions
 agreeing+5.455
 dere+5.337
 Stat+5.185
 agreement+5.120
elli+5.112
Neg logit contributions
icide-11.314
heastern-10.622
ingers-10.451
uliffe-10.397
icides-10.194
Token Station
Token ID9327
Feature activation-7.08e-01

Pos logit contributions
 agreeing+1.939
 agre+1.917
 dere+1.893
lio+1.701
 agreement+1.608
Neg logit contributions
icide-6.518
heastern-5.929
ingers-5.851
icides-5.815
uliffe-5.664
Token in
Token ID287
Feature activation-4.10e-01

Pos logit contributions
 agre+3.210
 agreeing+3.145
 dere+3.094
lio+2.920
 agreement+2.813
Neg logit contributions
icide-4.640
heastern-4.073
ingers-4.061
icides-3.981
ppers-3.866
Token South
Token ID2520
Feature activation-2.15e-01

Pos logit contributions
 agre+2.062
 agreeing+2.008
 dere+1.962
lio+1.868
 agreement+1.819
Neg logit contributions
icide-2.504
heastern-2.206
ingers-2.180
icides-2.126
ommel-2.049
Token We
Token ID775
Feature activation+1.34e-01

Pos logit contributions
 agre+1.018
 agreeing+0.971
 dere+0.948
lio+0.902
 agreement+0.867
Neg logit contributions
icide-1.397
heastern-1.223
ingers-1.214
icides-1.197
ppers-1.162
Token to
Token ID284
Feature activation-4.08e-01

Pos logit contributions
 agre+0.831
 agreeing+0.806
 dere+0.790
lio+0.754
 agreement+0.736
Neg logit contributions
icide-0.830
heastern-0.724
ingers-0.710
icides-0.686
ppers-0.677
Token funding
Token ID4918
Feature activation+6.50e-02

Pos logit contributions
 agre+2.055
 agreeing+1.969
 dere+1.927
lio+1.839
 agreement+1.762
Neg logit contributions
icide-2.539
heastern-2.222
ingers-2.192
icides-2.142
ppers-2.101
Token,
Token ID11
Feature activation+3.45e-02

Pos logit contributions
icide+0.361
heastern+0.307
ingers+0.305
icides+0.298
ppers+0.290
Neg logit contributions
 agre-0.370
 agreeing-0.354
 dere-0.349
lio-0.338
elli-0.322
Token requiring
Token ID10616
Feature activation-1.13e-01

Pos logit contributions
icide+0.196
heastern+0.167
ingers+0.165
icides+0.163
ppers+0.160
Neg logit contributions
 agre-0.191
 agreeing-0.177
 dere-0.177
lio-0.173
elli-0.167
Token F
Token ID376
Feature activation-3.33e-01

Pos logit contributions
 agre+0.588
 agreeing+0.555
 dere+0.544
lio+0.531
andum+0.503
Neg logit contributions
icide-0.688
heastern-0.597
ingers-0.593
icides-0.579
ppers-0.570
Tokenannie
Token ID42883
Feature activation-1.66e+00

Pos logit contributions
 agre+1.441
 agreeing+1.369
 dere+1.335
lio+1.265
 agreement+1.200
Neg logit contributions
icide-2.309
heastern-2.049
ingers-2.026
icides-1.985
ppers-1.951
Token Mae
Token ID34673
Feature activation-1.12e+00

Pos logit contributions
 agreeing+5.128
 agre+4.916
 dere+4.595
 agreement+4.444
elli+4.265
Neg logit contributions
icide-12.371
heastern-11.226
ingers-11.019
ommel-10.929
icides-10.890
Token and
Token ID290
Feature activation-8.33e-01

Pos logit contributions
 agreeing+5.439
 agre+5.395
 dere+5.144
 agreement+4.890
elli+4.641
Neg logit contributions
icide-6.809
heastern-6.122
ingers-5.973
icides-5.767
ommel-5.745
Token Freddie
Token ID31658
Feature activation-1.22e+00

Pos logit contributions
 agre+3.910
 agreeing+3.896
 dere+3.741
lio+3.481
 agreement+3.434
Neg logit contributions
icide-5.272
heastern-4.692
ingers-4.685
icides-4.556
ppers-4.472
Token Mac
Token ID4100
Feature activation-3.26e-01

Pos logit contributions
 agreeing+4.865
 agre+4.683
 dere+4.534
lio+4.452
 agreement+4.402
Neg logit contributions
icide-8.511
ommel-7.448
heastern-7.410
icides-7.410
ingers-7.390
Token to
Token ID284
Feature activation-1.63e-01

Pos logit contributions
 agre+1.864
 agreeing+1.802
 dere+1.763
lio+1.689
 agreement+1.638
Neg logit contributions
icide-1.801
heastern-1.552
ingers-1.524
icides-1.484
ppers-1.450
Token
Token ID784
Feature activation-6.15e-01

Pos logit contributions
 agre+0.450
 agreeing+0.432
 dere+0.424
lio+0.406
 agreement+0.390
Neg logit contributions
icide-0.484
heastern-0.420
ingers-0.414
icides-0.404
ppers-0.395
Token and
Token ID290
Feature activation-3.93e-01

Pos logit contributions
 agre+3.447
 agreeing+3.341
 dere+3.258
lio+3.094
 agreement+3.033
Neg logit contributions
icide-3.467
heastern-3.000
ingers-2.941
icides-2.873
ppers-2.812
Token you
Token ID345
Feature activation-6.97e-01

Pos logit contributions
 agre+2.148
 agreeing+2.053
 dere+2.019
lio+1.952
elli+1.858
Neg logit contributions
icide-2.275
heastern-1.966
ingers-1.943
icides-1.889
ppers-1.857
Token would
Token ID561
Feature activation-7.41e-01

Pos logit contributions
 agre+4.151
 agreeing+4.028
 dere+3.934
lio+3.759
 agreement+3.674
Neg logit contributions
icide-3.669
heastern-3.161
ingers-3.073
icides-3.006
ppers-2.902
Token be
Token ID307
Feature activation-1.23e+00

Pos logit contributions
 agre+3.483
 agreeing+3.354
 dere+3.252
lio+3.050
 agreement+2.970
Neg logit contributions
icide-4.831
heastern-4.302
ingers-4.198
icides-4.135
ppers-4.020
Token right
Token ID826
Feature activation-1.65e+00

Pos logit contributions
 agreeing+5.129
 agre+4.991
 dere+4.890
 agreement+4.279
lio+4.211
Neg logit contributions
icide-8.435
heastern-7.822
icides-7.466
ingers-7.401
uliffe-7.325
Token.
Token ID13
Feature activation-5.71e-01

Pos logit contributions
 agreeing+7.488
 agre+7.066
 dere+6.996
lio+6.474
 agreement+6.474
Neg logit contributions
icide-9.869
heastern-8.887
ingers-8.630
icides-8.582
uliffe-8.491
Token
Token ID198
Feature activation-2.95e-01

Pos logit contributions
 agre+2.945
 agreeing+2.843
 dere+2.776
lio+2.649
 agreement+2.563
Neg logit contributions
icide-3.461
heastern-3.032
ingers-2.976
icides-2.911
ppers-2.848
Token
Token ID198
Feature activation-3.43e-01

Pos logit contributions
 agre+1.583
 agreeing+1.503
 dere+1.473
lio+1.406
 agreement+1.349
Neg logit contributions
icide-1.753
heastern-1.519
ingers-1.501
icides-1.462
ppers-1.438
TokenBut
Token ID1537
Feature activation-4.09e-03

Pos logit contributions
 agre+1.909
 agreeing+1.823
 dere+1.792
lio+1.713
 agreement+1.645
Neg logit contributions
icide-1.973
heastern-1.698
ingers-1.681
icides-1.636
ppers-1.607
Token what
Token ID644
Feature activation-6.80e-01

Pos logit contributions
 agre+0.023
 agreeing+0.022
 dere+0.022
lio+0.021
elli+0.020
Neg logit contributions
icide-0.023
heastern-0.019
ingers-0.019
icides-0.018
ppers-0.018
Token been
Token ID587
Feature activation-6.79e-01

Pos logit contributions
 agre+3.272
 agreeing+3.167
 dere+3.109
lio+2.921
 agreement+2.870
Neg logit contributions
icide-3.781
heastern-3.319
ingers-3.264
icides-3.169
ppers-3.110
Token living
Token ID2877
Feature activation-9.24e-01

Pos logit contributions
 agre+3.624
 agreeing+3.540
 dere+3.469
lio+3.263
 agreement+3.177
Neg logit contributions
icide-3.961
heastern-3.454
ingers-3.383
icides-3.313
ppers-3.238
Token under
Token ID739
Feature activation-6.16e-01

Pos logit contributions
 agre+4.847
 agreeing+4.681
 dere+4.597
lio+4.298
 agreement+4.264
Neg logit contributions
icide-5.513
heastern-4.750
ingers-4.713
icides-4.614
ppers-4.491
Token military
Token ID2422
Feature activation-6.45e-01

Pos logit contributions
 agre+3.414
 agreeing+3.293
 dere+3.232
lio+3.079
 agreement+3.006
Neg logit contributions
icide-3.496
heastern-3.033
ingers-2.993
icides-2.900
ppers-2.851
Token law
Token ID1099
Feature activation-2.48e-01

Pos logit contributions
 agre+3.630
 agreeing+3.509
 dere+3.448
lio+3.260
 agreement+3.243
Neg logit contributions
icide-3.564
heastern-3.127
ingers-3.072
icides-2.948
ppers-2.904
Token for
Token ID329
Feature activation-1.65e+00

Pos logit contributions
 agre+1.548
 agreeing+1.489
 dere+1.467
lio+1.420
elli+1.365
Neg logit contributions
icide-1.247
heastern-1.052
ingers-1.031
icides-1.005
ppers-0.978
Token nearly
Token ID3016
Feature activation-1.40e+00

Pos logit contributions
 agreeing+6.718
 agre+6.673
 dere+6.519
 agreement+5.941
lio+5.790
Neg logit contributions
icide-10.684
ingers-9.684
heastern-9.676
uliffe-9.262
icides-9.240
Token five
Token ID1936
Feature activation-2.00e-01

Pos logit contributions
 agre+6.542
 agreeing+6.477
 dere+6.316
 agreement+5.880
lio+5.737
Neg logit contributions
icide-8.533
ingers-7.590
heastern-7.562
icides-7.324
-7.156
Token decades
Token ID4647
Feature activation+3.52e-01

Pos logit contributions
 agre+0.737
 agreeing+0.683
 dere+0.660
lio+0.631
elli+0.586
Neg logit contributions
icide-1.524
heastern-1.361
ingers-1.349
icides-1.326
ppers-1.302
Token,
Token ID11
Feature activation-5.09e-01

Pos logit contributions
icide+1.878
heastern+1.579
ingers+1.551
icides+1.525
ppers+1.468
Neg logit contributions
 agre-2.088
 agreeing-1.949
 dere-1.944
lio-1.898
elli-1.804
Token while
Token ID981
Feature activation-4.25e-01

Pos logit contributions
 agre+2.686
 agreeing+2.584
 dere+2.530
lio+2.419
 agreement+2.324
Neg logit contributions
icide-3.040
heastern-2.648
ingers-2.609
icides-2.545
ppers-2.496
Token,
Token ID11
Feature activation-1.16e-01

Pos logit contributions
 agre+2.232
 agreeing+2.143
 dere+2.103
lio+2.018
 agreement+1.937
Neg logit contributions
icide-2.325
heastern-2.008
ingers-1.982
icides-1.930
ppers-1.891
Token there
Token ID612
Feature activation-7.26e-01

Pos logit contributions
 agre+0.661
 agreeing+0.626
 dere+0.622
lio+0.597
andum+0.572
Neg logit contributions
icide-0.644
heastern-0.557
ingers-0.554
icides-0.530
ppers-0.528
Token is
Token ID318
Feature activation-6.56e-01

Pos logit contributions
 agre+3.490
 agreeing+3.349
 dere+3.255
lio+3.098
 agreement+3.003
Neg logit contributions
icide-4.640
heastern-4.079
ingers-4.023
icides-3.945
ppers-3.834
Token one
Token ID530
Feature activation-8.44e-01

Pos logit contributions
 agre+3.396
 agreeing+3.272
 dere+3.190
lio+3.044
 agreement+2.971
Neg logit contributions
icide-3.986
heastern-3.457
ingers-3.425
icides-3.363
ppers-3.274
Token major
Token ID1688
Feature activation-1.24e+00

Pos logit contributions
 agre+4.104
 agreeing+3.958
 dere+3.871
lio+3.633
 agreement+3.589
Neg logit contributions
icide-5.354
heastern-4.690
ingers-4.661
icides-4.581
ppers-4.457
Token difference
Token ID3580
Feature activation-1.61e+00

Pos logit contributions
 agre+3.756
 agreeing+3.585
 dere+3.481
 agreement+3.188
lio+2.988
Neg logit contributions
icide-9.967
ingers-9.003
heastern-8.940
icides-8.930
ppers-8.659
Token between
Token ID1022
Feature activation-4.02e-01

Pos logit contributions
 agreeing+7.745
 agre+7.647
 dere+7.347
 agreement+7.105
elli+6.897
Neg logit contributions
icide-9.430
heastern-8.303
ingers-8.194
icides-8.006
uliffe-7.800
Token the
Token ID262
Feature activation-1.57e-01

Pos logit contributions
 agre+2.328
 agreeing+2.244
 dere+2.201
lio+2.116
 agreement+2.040
Neg logit contributions
icide-2.196
heastern-1.882
ingers-1.853
icides-1.805
ppers-1.762
Token two
Token ID734
Feature activation-1.80e-01

Pos logit contributions
 agre+0.800
 agreeing+0.761
 dere+0.749
lio+0.717
andum+0.682
Neg logit contributions
icide-0.966
heastern-0.846
ingers-0.831
icides-0.810
ppers-0.807
Token types
Token ID3858
Feature activation-3.08e-01

Pos logit contributions
 agre+0.916
 agreeing+0.871
 dere+0.856
lio+0.829
andum+0.788
Neg logit contributions
icide-1.098
heastern-0.957
ingers-0.944
icides-0.923
ppers-0.916
Token of
Token ID286
Feature activation-2.84e-02

Pos logit contributions
 agre+1.486
 agreeing+1.439
 dere+1.396
lio+1.327
 agreement+1.294
Neg logit contributions
icide-1.966
heastern-1.730
ingers-1.702
icides-1.671
ppers-1.633
Token.
Token ID13
Feature activation-1.12e-02

Pos logit contributions
icide+2.082
heastern+1.754
ingers+1.736
icides+1.697
ppers+1.679
Neg logit contributions
 agre-2.047
 agreeing-1.929
 dere-1.877
lio-1.818
 agreement-1.732
Token New
Token ID968
Feature activation-4.16e-01

Pos logit contributions
 agre+0.067
 agreeing+0.064
 dere+0.063
lio+0.060
 agreement+0.058
Neg logit contributions
icide-0.060
heastern-0.051
ingers-0.050
icides-0.049
ppers-0.048
Token York
Token ID1971
Feature activation-5.97e-01

Pos logit contributions
 agre+1.692
 agreeing+1.608
 dere+1.566
lio+1.478
 agreement+1.403
Neg logit contributions
icide-2.989
heastern-2.662
ingers-2.633
icides-2.587
ppers-2.544
Token:
Token ID25
Feature activation-6.44e-01

Pos logit contributions
 agre+2.458
 agreeing+2.426
 dere+2.387
lio+2.221
 agreement+2.194
Neg logit contributions
icide-4.093
ingers-3.619
heastern-3.611
icides-3.565
ppers-3.470
Token William
Token ID3977
Feature activation-5.66e-01

Pos logit contributions
 agre+3.303
 agreeing+3.225
 dere+3.155
lio+3.005
 agreement+2.921
Neg logit contributions
icide-3.851
heastern-3.383
ingers-3.303
icides-3.264
ppers-3.147
Token Morrow
Token ID34693
Feature activation-1.61e+00

Pos logit contributions
 agre+1.982
 agreeing+1.902
 dere+1.865
lio+1.739
 agreement+1.653
Neg logit contributions
icide-4.274
heastern-3.840
ingers-3.796
icides-3.753
ppers-3.678
Token and
Token ID290
Feature activation-6.96e-01

Pos logit contributions
 agreeing+6.854
 agre+6.774
 dere+6.640
lio+6.339
 agreement+6.299
Neg logit contributions
icide-9.894
icides-8.624
heastern-8.595
ingers-8.378
ebus-8.282
Token Company
Token ID5834
Feature activation-8.02e-01

Pos logit contributions
 agre+2.438
 agreeing+2.323
 dere+2.273
lio+2.092
 agreement+2.012
Neg logit contributions
icide-5.306
heastern-4.789
ingers-4.736
icides-4.647
ppers-4.561
Token,
Token ID11
Feature activation-3.04e-01

Pos logit contributions
 agre+3.614
 agreeing+3.575
 dere+3.509
lio+3.280
 agreement+3.248
Neg logit contributions
icide-5.110
heastern-4.507
icides-4.413
ingers-4.412
ppers-4.219
Token Inc
Token ID3457
Feature activation-3.74e-01

Pos logit contributions
 agre+1.447
 agreeing+1.374
 dere+1.341
lio+1.281
 agreement+1.216
Neg logit contributions
icide-1.989
heastern-1.744
ingers-1.728
icides-1.687
ppers-1.663
Token.
Token ID13
Feature activation-4.29e-01

Pos logit contributions
 agre+1.792
 agreeing+1.752
 dere+1.717
lio+1.636
 agreement+1.583
Neg logit contributions
icide-2.316
heastern-2.057
ingers-2.020
icides-1.978
ppers-1.925
Token an
Token ID281
Feature activation+6.11e-02

Pos logit contributions
icide+0.081
heastern+0.069
ingers+0.069
icides+0.066
ppers+0.065
Neg logit contributions
 agre-0.080
 agreeing-0.075
 dere-0.074
lio-0.072
yrights-0.069
Token investment
Token ID4896
Feature activation-3.27e-01

Pos logit contributions
icide+0.375
ingers+0.322
heastern+0.322
icides+0.312
ppers+0.310
Neg logit contributions
 agre-0.314
 agreeing-0.290
 dere-0.287
lio-0.281
yrights-0.267
Token banker
Token ID33371
Feature activation-6.03e-01

Pos logit contributions
 agre+1.494
 agreeing+1.406
 dere+1.376
lio+1.318
elli+1.237
Neg logit contributions
icide-2.212
heastern-1.950
ingers-1.916
icides-1.887
ppers-1.846
Token now
Token ID783
Feature activation+7.23e-02

Pos logit contributions
 agre+3.034
 agreeing+2.932
 dere+2.861
lio+2.731
 agreement+2.626
Neg logit contributions
icide-3.737
heastern-3.263
ingers-3.221
icides-3.158
ppers-3.088
Token with
Token ID351
Feature activation-7.59e-01

Pos logit contributions
icide+0.428
heastern+0.370
ingers+0.367
icides+0.353
ppers+0.347
Neg logit contributions
 agre-0.388
 agreeing-0.353
 dere-0.350
lio-0.343
yrights-0.334
Token JPMorgan
Token ID43318
Feature activation-1.60e+00

Pos logit contributions
 agre+3.957
 agreeing+3.827
 dere+3.772
lio+3.554
 agreement+3.468
Neg logit contributions
icide-4.501
heastern-3.927
ingers-3.889
icides-3.788
ppers-3.698
Token who
Token ID508
Feature activation-6.15e-01

Pos logit contributions
 agreeing+6.915
 agre+6.802
 dere+6.532
elli+6.105
 agreement+6.046
Neg logit contributions
icide-10.109
ingers-8.900
icides-8.832
heastern-8.806
ommel-8.548
Token specializes
Token ID29786
Feature activation-1.75e-01

Pos logit contributions
 agre+3.446
 agreeing+3.340
 dere+3.292
lio+3.117
 agreement+3.033
Neg logit contributions
icide-3.418
heastern-2.977
ingers-2.916
icides-2.825
ommel-2.737
Token in
Token ID287
Feature activation-2.74e-01

Pos logit contributions
 agre+0.945
 agreeing+0.879
 dere+0.859
lio+0.838
andum+0.795
Neg logit contributions
icide-1.048
heastern-0.917
ingers-0.891
icides-0.873
ppers-0.859
Token big
Token ID1263
Feature activation-1.29e-01

Pos logit contributions
 agre+1.377
 agreeing+1.301
 dere+1.269
lio+1.235
elli+1.169
Neg logit contributions
icide-1.717
heastern-1.499
ingers-1.482
icides-1.448
ppers-1.425
Token energy
Token ID2568
Feature activation-1.59e-01

Pos logit contributions
 agre+0.661
 agreeing+0.621
 dere+0.597
lio+0.587
andum+0.554
Neg logit contributions
icide-0.808
heastern-0.705
ingers-0.697
icides-0.683
ppers-0.673
Token credit
Token ID3884
Feature activation-8.22e-01

Pos logit contributions
icide+2.653
heastern+2.289
ingers+2.253
ppers+2.158
icides+2.155
Neg logit contributions
 agre-2.532
 agreeing-2.343
lio-2.298
 dere-2.255
yrights-2.229
Token-
Token ID12
Feature activation-2.21e-01

Pos logit contributions
 agre+3.798
 agreeing+3.785
 dere+3.624
lio+3.454
 agreement+3.432
Neg logit contributions
icide-5.246
heastern-4.642
ingers-4.616
icides-4.527
ppers-4.466
Tokenreporting
Token ID49914
Feature activation-1.11e-01

Pos logit contributions
 agre+1.164
 agreeing+1.100
 dere+1.089
lio+1.022
andum+0.993
Neg logit contributions
icide-1.341
heastern-1.158
ingers-1.142
icides-1.121
ppers-1.094
Token agency
Token ID4086
Feature activation-2.28e-01

Pos logit contributions
 agre+0.596
 agreeing+0.564
 dere+0.555
lio+0.539
andum+0.516
Neg logit contributions
icide-0.651
heastern-0.559
ingers-0.552
icides-0.538
ppers-0.528
Token,
Token ID11
Feature activation-2.06e-01

Pos logit contributions
 agre+1.244
 agreeing+1.198
 dere+1.173
lio+1.122
 agreement+1.083
Neg logit contributions
icide-1.319
heastern-1.142
ingers-1.126
icides-1.099
ppers-1.074
Token JPMorgan
Token ID43318
Feature activation-1.60e+00

Pos logit contributions
 agre+1.008
 agreeing+0.957
 dere+0.940
lio+0.898
 agreement+0.855
Neg logit contributions
icide-1.317
heastern-1.152
ingers-1.136
icides-1.113
ppers-1.091
Token Chase
Token ID16346
Feature activation-1.36e+00

Pos logit contributions
 agreeing+5.830
 agre+5.576
 dere+5.155
 agreement+4.928
 Stat+4.906
Neg logit contributions
icide-10.945
heastern-9.966
ingers-9.921
icides-9.800
ommel-9.656
Token,
Token ID11
Feature activation-7.16e-01

Pos logit contributions
 agreeing+6.125
 agre+5.885
 dere+5.420
 agreement+5.254
 Stat+5.188
Neg logit contributions
icide-8.475
heastern-7.696
ingers-7.479
icides-7.366
ppers-7.185
Token and
Token ID290
Feature activation-8.18e-01

Pos logit contributions
 agre+3.342
 agreeing+3.275
 dere+3.145
lio+2.988
 agreement+2.886
Neg logit contributions
icide-4.615
heastern-4.108
ingers-4.053
icides-3.963
ppers-3.896
Token Cit
Token ID15792
Feature activation-8.41e-01

Pos logit contributions
 agre+3.760
 agreeing+3.685
 dere+3.539
lio+3.333
 agreement+3.247
Neg logit contributions
icide-5.331
heastern-4.740
ingers-4.681
icides-4.583
ppers-4.491
Tokenib
Token ID571
Feature activation+1.21e-01

Pos logit contributions
 agre+2.625
 agreeing+2.614
 dere+2.481
lio+2.410
elli+2.265
Neg logit contributions
icide-6.452
heastern-5.847
ingers-5.815
icides-5.714
ppers-5.645
Token 13
Token ID1511
Feature activation-1.06e+00

Pos logit contributions
 agre+4.496
 agreeing+4.450
 dere+4.336
lio+4.121
 agreement+3.950
Neg logit contributions
icide-6.572
heastern-5.895
ingers-5.789
icides-5.660
uliffe-5.540
Token Sep
Token ID8621
Feature activation-3.37e-01

Pos logit contributions
 agre+4.179
 agreeing+4.173
 dere+4.017
lio+3.778
 agreement+3.655
Neg logit contributions
icide-7.472
heastern-6.705
ingers-6.603
icides-6.515
uliffe-6.346
Token,
Token ID11
Feature activation-1.10e+00

Pos logit contributions
 agre+1.807
 agreeing+1.729
 dere+1.698
lio+1.624
 agreement+1.556
Neg logit contributions
icide-1.989
heastern-1.724
ingers-1.700
icides-1.658
ppers-1.626
Token 2014
Token ID1946
Feature activation-4.88e-01

Pos logit contributions
 agre+4.906
 agreeing+4.818
 dere+4.666
lio+4.448
 agreement+4.289
Neg logit contributions
icide-7.136
heastern-6.350
ingers-6.273
icides-6.155
ppers-5.980
Token,
Token ID11
Feature activation-1.29e+00

Pos logit contributions
 agre+2.536
 agreeing+2.439
 dere+2.384
lio+2.281
 agreement+2.194
Neg logit contributions
icide-2.941
heastern-2.568
ingers-2.533
icides-2.475
ppers-2.424
Token 11
Token ID1367
Feature activation-1.60e+00

Pos logit contributions
 agre+6.012
 agreeing+5.894
 dere+5.751
lio+5.328
 agreement+5.298
Neg logit contributions
icide-8.010
heastern-7.162
ingers-7.026
icides-6.893
uliffe-6.795
Token:
Token ID25
Feature activation-2.19e-01

Pos logit contributions
 agreeing+5.636
 agre+5.439
 dere+5.409
 agreement+5.035
 Stat+4.980
Neg logit contributions
icide-11.329
heastern-10.229
ingers-10.035
icides-10.013
uliffe-9.891
Token13
Token ID1485
Feature activation+9.39e-02

Pos logit contributions
 agre+1.487
 agreeing+1.400
 dere+1.383
lio+1.329
 agreement+1.286
Neg logit contributions
icide-1.017
ingers-0.832
heastern-0.828
icides-0.797
ppers-0.780
Token AM
Token ID3001
Feature activation-5.50e-01

Pos logit contributions
icide+0.552
heastern+0.481
ingers+0.473
icides+0.459
ppers+0.452
Neg logit contributions
 agre-0.503
 agreeing-0.482
 dere-0.474
lio-0.456
 agreement-0.434
Token
Token ID198
Feature activation+1.55e-02

Pos logit contributions
 agre+3.079
 agreeing+2.971
 dere+2.893
lio+2.768
 agreement+2.701
Neg logit contributions
icide-3.088
heastern-2.661
ingers-2.615
icides-2.568
ppers-2.507
Token
Token ID198
Feature activation-5.68e-02

Pos logit contributions
icide+0.086
heastern+0.073
ingers+0.072
icides+0.070
ppers+0.070
Neg logit contributions
 agre-0.093
 agreeing-0.085
 dere-0.083
lio-0.079
andum-0.076
Token,
Token ID11
Feature activation-4.70e-01

Pos logit contributions
 agre+1.289
 agreeing+1.232
 dere+1.214
lio+1.168
elli+1.114
Neg logit contributions
icide-1.288
heastern-1.109
ingers-1.091
icides-1.060
ppers-1.040
Token though
Token ID996
Feature activation-9.02e-02

Pos logit contributions
 agre+2.648
 agreeing+2.545
 dere+2.498
lio+2.399
 agreement+2.307
Neg logit contributions
icide-2.650
heastern-2.282
ingers-2.250
icides-2.192
ppers-2.144
Token it
Token ID340
Feature activation-2.32e-01

Pos logit contributions
 agre+0.504
 agreeing+0.472
 dere+0.467
lio+0.454
elli+0.434
Neg logit contributions
icide-0.513
heastern-0.438
ingers-0.434
icides-0.423
ppers-0.419
Token later
Token ID1568
Feature activation+5.90e-01

Pos logit contributions
 agre+1.393
 agreeing+1.335
 dere+1.313
lio+1.274
elli+1.220
Neg logit contributions
icide-1.221
heastern-1.035
ingers-1.019
icides-0.993
ppers-0.972
Token came
Token ID1625
Feature activation-5.49e-01

Pos logit contributions
icide+2.815
heastern+2.310
ingers+2.284
ppers+2.264
 BACK+2.231
Neg logit contributions
 agre-3.651
lio-3.379
 agreeing-3.363
 dere-3.357
andum-3.255
Token out
Token ID503
Feature activation-1.59e+00

Pos logit contributions
 agre+2.988
 agreeing+2.919
 dere+2.869
lio+2.727
 agreement+2.635
Neg logit contributions
icide-3.145
heastern-2.739
ingers-2.690
icides-2.624
ppers-2.562
Token that
Token ID326
Feature activation-7.20e-01

Pos logit contributions
 agreeing+8.203
 agre+7.909
 dere+7.794
 agreement+7.276
lio+7.006
Neg logit contributions
icide-8.948
heastern-7.943
ingers-7.768
icides-7.689
ppers-7.570
Token the
Token ID262
Feature activation-2.18e-01

Pos logit contributions
 agre+3.693
 agreeing+3.588
 dere+3.504
lio+3.307
 agreement+3.235
Neg logit contributions
icide-4.346
heastern-3.824
ingers-3.745
icides-3.660
ppers-3.572
Token report
Token ID989
Feature activation-2.97e-01

Pos logit contributions
 agre+1.153
 agreeing+1.093
 dere+1.061
lio+1.032
elli+0.989
Neg logit contributions
icide-1.320
heastern-1.131
ingers-1.126
icides-1.100
ppers-1.082
Token had
Token ID550
Feature activation+1.86e-02

Pos logit contributions
 agre+1.809
 agreeing+1.746
 dere+1.716
lio+1.653
 agreement+1.596
Neg logit contributions
icide-1.531
heastern-1.300
ingers-1.279
icides-1.242
ppers-1.212
Token been
Token ID587
Feature activation-2.61e-01

Pos logit contributions
icide+0.102
heastern+0.086
ingers+0.085
icides+0.083
ppers+0.081
Neg logit contributions
 agre-0.109
 agreeing-0.104
 dere-0.101
lio-0.099
elli-0.095
Token
Token ID198
Feature activation+1.00e-01

Pos logit contributions
 agre+1.398
 agreeing+1.319
 dere+1.292
lio+1.235
 agreement+1.185
Neg logit contributions
icide-1.527
heastern-1.321
ingers-1.307
icides-1.272
ppers-1.252
TokenIt
Token ID1026
Feature activation-1.18e-01

Pos logit contributions
icide+0.562
ingers+0.477
heastern+0.476
icides+0.460
ppers+0.460
Neg logit contributions
 agre-0.584
 agreeing-0.549
 dere-0.537
lio-0.505
 agreement-0.494
Token also
Token ID635
Feature activation-1.97e-01

Pos logit contributions
 agre+0.654
 agreeing+0.624
 dere+0.609
lio+0.591
elli+0.561
Neg logit contributions
icide-0.685
heastern-0.586
ingers-0.582
icides-0.566
ppers-0.561
Token features
Token ID3033
Feature activation-5.70e-01

Pos logit contributions
 agre+1.089
 agreeing+1.034
 dere+1.006
lio+0.991
elli+0.938
Neg logit contributions
icide-1.131
heastern-0.966
ingers-0.963
icides-0.936
ppers-0.930
Token unusual
Token ID8468
Feature activation+2.35e-01

Pos logit contributions
 agre+2.921
 agreeing+2.804
 dere+2.755
lio+2.615
 agreement+2.521
Neg logit contributions
icide-3.487
heastern-3.046
ingers-3.003
icides-2.934
ppers-2.871
Token men
Token ID1450
Feature activation-1.59e+00

Pos logit contributions
icide+1.375
heastern+1.167
ingers+1.164
icides+1.152
ppers+1.128
Neg logit contributions
 agre-1.258
 agreeing-1.188
lio-1.137
 dere-1.130
andum-1.107
Token's
Token ID338
Feature activation-7.22e-01

Pos logit contributions
 agre+6.864
 agreeing+6.824
 dere+6.622
lio+6.436
elli+6.433
Neg logit contributions
icide-9.768
heastern-8.917
ingers-8.421
uliffe-8.413
icides-8.359
Token ur
Token ID2956
Feature activation+6.45e-02

Pos logit contributions
 agre+3.838
 agreeing+3.703
 dere+3.660
lio+3.448
 agreement+3.374
Neg logit contributions
icide-4.227
heastern-3.711
ingers-3.648
icides-3.526
ppers-3.453
Tokeninals
Token ID6897
Feature activation-2.98e-01

Pos logit contributions
icide+0.234
ingers+0.179
heastern+0.178
icides+0.173
ppers+0.166
Neg logit contributions
 agre-0.501
 agreeing-0.486
 dere-0.476
lio-0.460
 agreement-0.451
Token in
Token ID287
Feature activation-6.33e-02

Pos logit contributions
 agre+1.616
 agreeing+1.548
 dere+1.518
lio+1.458
 agreement+1.398
Neg logit contributions
icide-1.738
heastern-1.505
ingers-1.486
icides-1.446
ppers-1.420
Token the
Token ID262
Feature activation+1.16e-01

Pos logit contributions
 agre+0.348
 agreeing+0.333
 dere+0.323
lio+0.315
andum+0.304
Neg logit contributions
icide-0.362
ingers-0.311
heastern-0.311
icides-0.299
ppers-0.297
Token percent
Token ID1411
Feature activation-6.85e-01

Pos logit contributions
 agre+0.246
 agreeing+0.233
 dere+0.230
lio+0.222
elli+0.213
Neg logit contributions
icide-0.223
heastern-0.192
ingers-0.189
icides-0.182
ppers-0.180
Token on
Token ID319
Feature activation-8.29e-01

Pos logit contributions
 agre+3.485
 agreeing+3.428
 dere+3.289
lio+3.134
 agreement+3.097
Neg logit contributions
icide-4.113
heastern-3.568
ingers-3.559
icides-3.501
ppers-3.400
Token the
Token ID262
Feature activation-4.86e-01

Pos logit contributions
 agreeing+3.929
 agre+3.908
 dere+3.721
 agreement+3.593
lio+3.495
Neg logit contributions
icide-5.234
heastern-4.592
ingers-4.511
icides-4.460
ebus-4.312
Token news
Token ID1705
Feature activation+3.80e-01

Pos logit contributions
 agre+2.659
 agreeing+2.558
 dere+2.508
lio+2.401
 agreement+2.314
Neg logit contributions
icide-2.813
heastern-2.433
ingers-2.398
icides-2.339
ppers-2.289
Token.
Token ID13
Feature activation+4.12e-02

Pos logit contributions
icide+2.051
heastern+1.759
ingers+1.690
icides+1.661
ppers+1.630
Neg logit contributions
 agre-2.200
 agreeing-2.090
 dere-2.071
lio-2.022
elli-1.912
Token Imperial
Token ID11773
Feature activation-1.58e+00

Pos logit contributions
icide+0.243
heastern+0.208
ingers+0.206
icides+0.202
ppers+0.198
Neg logit contributions
 agre-0.224
 agreeing-0.213
 dere-0.209
lio-0.198
 agreement-0.191
Token Brands
Token ID46129
Feature activation+5.93e-01

Pos logit contributions
 agre+6.739
 agreeing+6.532
 Stat+6.093
 dere+6.039
lio+6.031
Neg logit contributions
icide-9.836
heastern-9.035
ingers-8.712
uliffe-8.513
icides-8.392
Token,
Token ID11
Feature activation+2.91e-01

Pos logit contributions
icide+3.001
ingers+2.521
heastern+2.465
icides+2.414
ppers+2.397
Neg logit contributions
 agre-3.557
 dere-3.365
 agreeing-3.344
lio-3.282
yrights-3.166
Token formerly
Token ID15734
Feature activation+5.13e-01

Pos logit contributions
icide+1.573
ingers+1.306
heastern+1.287
icides+1.272
ppers+1.231
Neg logit contributions
 agre-1.718
 agreeing-1.613
 dere-1.613
lio-1.551
yrights-1.505
Token known
Token ID1900
Feature activation+2.89e-01

Pos logit contributions
icide+2.875
ingers+2.414
heastern+2.407
icides+2.341
ppers+2.290
Neg logit contributions
 agre-2.841
 agreeing-2.683
 dere-2.656
lio-2.571
andum-2.498
Token as
Token ID355
Feature activation+1.00e-02

Pos logit contributions
icide+1.611
heastern+1.385
ingers+1.365
icides+1.331
ppers+1.300
Neg logit contributions
 agre-1.644
 agreeing-1.583
 dere-1.554
lio-1.490
 agreement-1.438
Token What
Token ID1867
Feature activation-3.81e-01

Pos logit contributions
icide+0.077
heastern+0.067
ingers+0.066
icides+0.065
ppers+0.063
Neg logit contributions
 agre-0.072
 agreeing-0.069
 dere-0.067
lio-0.064
 agreement-0.062
Token most
Token ID749
Feature activation-1.08e+00

Pos logit contributions
 agre+2.091
 agreeing+1.997
 dere+1.959
lio+1.895
elli+1.804
Neg logit contributions
icide-2.215
heastern-1.896
ingers-1.873
icides-1.842
ppers-1.803
Token fail
Token ID2038
Feature activation-9.93e-01

Pos logit contributions
 agre+5.636
 agreeing+5.500
 dere+5.364
 agreement+5.004
lio+4.992
Neg logit contributions
icide-6.336
heastern-5.533
ingers-5.397
icides-5.247
uliffe-5.130
Token to
Token ID284
Feature activation-1.35e+00

Pos logit contributions
 agreeing+4.517
 dere+4.460
 agre+4.414
 agreement+4.139
lio+4.029
Neg logit contributions
icide-6.036
heastern-5.346
ingers-5.217
icides-5.172
uliffe-5.125
Token point
Token ID966
Feature activation-6.54e-01

Pos logit contributions
 agre+4.984
 agreeing+4.872
 dere+4.693
 agreement+4.275
lio+4.245
Neg logit contributions
icide-9.890
heastern-8.974
ingers-8.801
icides-8.662
ebus-8.588
Token out
Token ID503
Feature activation-1.58e+00

Pos logit contributions
 agreeing+2.857
 agre+2.760
 dere+2.749
 agreement+2.633
elli+2.483
Neg logit contributions
icide-4.281
heastern-3.858
ingers-3.780
icides-3.697
ppers-3.655
Token is
Token ID318
Feature activation+1.40e-01

Pos logit contributions
 agreeing+6.223
 dere+6.068
 agre+6.027
 agreement+5.663
elli+5.345
Neg logit contributions
icide-10.286
heastern-9.350
ingers-9.118
-8.903
icides-8.857
Token that
Token ID326
Feature activation+2.44e-02

Pos logit contributions
icide+0.707
heastern+0.595
ingers+0.587
icides+0.572
ppers+0.555
Neg logit contributions
 agre-0.867
 agreeing-0.831
 dere-0.818
lio-0.794
elli-0.763
Token those
Token ID883
Feature activation+3.17e-01

Pos logit contributions
icide+0.130
heastern+0.109
ingers+0.108
icides+0.106
ppers+0.103
Neg logit contributions
 agre-0.146
 agreeing-0.138
 dere-0.136
lio-0.134
andum-0.128
Token executive
Token ID4640
Feature activation+1.19e-01

Pos logit contributions
icide+1.791
ingers+1.524
heastern+1.517
icides+1.501
ppers+1.452
Neg logit contributions
 agre-1.765
 agreeing-1.646
 dere-1.616
lio-1.611
andum-1.559
Token actions
Token ID4028
Feature activation-6.49e-02

Pos logit contributions
icide+0.659
heastern+0.560
ingers+0.554
icides+0.546
ppers+0.533
Neg logit contributions
 agre-0.678
 agreeing-0.647
 dere-0.634
lio-0.622
andum-0.591
Token Choose
Token ID17489
Feature activation-1.93e-01

Pos logit contributions
icide+0.319
heastern+0.274
ingers+0.274
ppers+0.263
icides+0.262
Neg logit contributions
 agre-0.307
 agreeing-0.291
 dere-0.287
lio-0.275
andum-0.263
Token
Token ID564
Feature activation-7.04e-01

Pos logit contributions
 agre+1.030
 agreeing+0.988
 dere+0.969
lio+0.930
 agreement+0.889
Neg logit contributions
icide-1.146
heastern-0.993
ingers-0.981
icides-0.956
ppers-0.938
Token
Token ID246
Feature activation+8.02e-02

Pos logit contributions
 agre+4.685
 agreeing+4.650
 dere+4.536
lio+4.413
 agreement+4.304
Neg logit contributions
icide-3.060
heastern-2.491
ingers-2.450
icides-2.429
ppers-2.298
TokenInstall
Token ID15798
Feature activation-1.04e+00

Pos logit contributions
icide+0.459
heastern+0.394
ingers+0.391
icides+0.378
ppers+0.375
Neg logit contributions
 agre-0.446
 agreeing-0.424
 dere-0.420
lio-0.399
 agreement-0.383
Token from
Token ID422
Feature activation-4.54e-01

Pos logit contributions
 agre+5.477
 agreeing+5.286
 dere+5.135
lio+4.916
 agreement+4.847
Neg logit contributions
icide-5.988
heastern-5.279
ingers-5.072
icides-5.061
ppers-4.856
Token Zip
Token ID38636
Feature activation-1.58e+00

Pos logit contributions
 agre+2.442
 agreeing+2.347
 dere+2.301
lio+2.195
 agreement+2.123
Neg logit contributions
icide-2.673
heastern-2.320
ingers-2.282
icides-2.235
ppers-2.177
Token file
Token ID2393
Feature activation-5.89e-01

Pos logit contributions
 agre+6.995
 agreeing+6.913
 dere+6.560
 agreement+6.333
 Stat+6.308
Neg logit contributions
icide-9.765
heastern-8.618
icides-8.608
ingers-8.368
ommel-8.107
Token
Token ID447
Feature activation+1.69e-02

Pos logit contributions
 agre+3.195
 agreeing+3.087
 dere+2.997
lio+2.867
 agreement+2.810
Neg logit contributions
icide-3.411
heastern-2.956
ingers-2.903
icides-2.862
ppers-2.768
Token
Token ID247
Feature activation-3.29e-02

Pos logit contributions
icide+0.033
heastern+0.020
ingers+0.019
icides+0.016
ppers+0.015
Neg logit contributions
 agre-0.158
 agreeing-0.154
 dere-0.153
lio-0.148
 agreement-0.145
Token and
Token ID290
Feature activation-1.10e-01

Pos logit contributions
 agre+0.184
 agreeing+0.175
 dere+0.174
lio+0.167
andum+0.159
Neg logit contributions
icide-0.187
heastern-0.161
ingers-0.159
icides-0.154
ppers-0.152
Token locate
Token ID17276
Feature activation-5.06e-01

Pos logit contributions
 agre+0.609
 agreeing+0.585
 dere+0.576
lio+0.559
elli+0.531
Neg logit contributions
icide-0.624
heastern-0.538
ingers-0.534
icides-0.514
ppers-0.512
Token America
Token ID2253
Feature activation-7.60e-02

Pos logit contributions
icide+2.104
heastern+1.504
ingers+1.435
ppers+1.362
icides+1.293
Neg logit contributions
 agre-6.281
 agreeing-5.880
 dere-5.807
lio-5.614
elli-5.511
Token.
Token ID13
Feature activation+1.42e-01

Pos logit contributions
 agre+0.428
 agreeing+0.408
 dere+0.403
lio+0.387
 agreement+0.371
Neg logit contributions
icide-0.429
heastern-0.369
ingers-0.364
icides-0.355
ppers-0.347
Token Wells
Token ID18292
Feature activation-2.88e-01

Pos logit contributions
icide+0.824
heastern+0.706
ingers+0.700
icides+0.683
ppers+0.669
Neg logit contributions
 agre-0.786
 agreeing-0.739
 dere-0.724
lio-0.690
 agreement-0.664
Token Fargo
Token ID28125
Feature activation-2.40e-01

Pos logit contributions
 agre+1.376
 agreeing+1.318
 dere+1.284
lio+1.222
 agreement+1.171
Neg logit contributions
icide-1.864
heastern-1.641
ingers-1.619
icides-1.585
ppers-1.555
Token,
Token ID11
Feature activation+9.08e-02

Pos logit contributions
 agre+1.354
 agreeing+1.302
 dere+1.278
lio+1.228
 agreement+1.180
Neg logit contributions
icide-1.348
heastern-1.160
ingers-1.143
icides-1.114
ppers-1.090
Token JPMorgan
Token ID43318
Feature activation-1.57e+00

Pos logit contributions
icide+0.497
heastern+0.422
ingers+0.417
icides+0.404
ppers+0.397
Neg logit contributions
 agre-0.532
 agreeing-0.498
 dere-0.496
lio-0.478
yrights-0.461
Token Chase
Token ID16346
Feature activation-1.32e+00

Pos logit contributions
 agreeing+5.949
 agre+5.697
 dere+5.366
 agreement+5.102
 Stat+5.025
Neg logit contributions
icide-10.708
heastern-9.599
ingers-9.580
icides-9.437
ommel-9.331
Token,
Token ID11
Feature activation-6.54e-01

Pos logit contributions
 agreeing+5.843
 agre+5.660
 dere+5.280
 agreement+5.066
elli+5.039
Neg logit contributions
icide-8.407
heastern-7.479
ingers-7.261
icides-7.250
ommel-6.998
Token and
Token ID290
Feature activation-7.79e-01

Pos logit contributions
 agre+3.002
 agreeing+2.938
 dere+2.838
lio+2.681
 agreement+2.594
Neg logit contributions
icide-4.284
heastern-3.793
ingers-3.751
icides-3.678
ppers-3.604
Token Cit
Token ID15792
Feature activation-5.03e-01

Pos logit contributions
 agre+3.682
 agreeing+3.630
 dere+3.483
lio+3.281
 agreement+3.220
Neg logit contributions
icide-4.969
heastern-4.383
ingers-4.346
icides-4.261
ppers-4.158
Tokenib
Token ID571
Feature activation+7.65e-01

Pos logit contributions
 agre+2.105
 agreeing+2.026
 dere+1.957
lio+1.870
elli+1.774
Neg logit contributions
icide-3.507
heastern-3.132
ingers-3.099
icides-3.034
ppers-2.987
Token claims
Token ID3667
Feature activation+4.99e-01

Pos logit contributions
icide+1.125
heastern+0.976
ingers+0.972
icides+0.965
ppers+0.945
Neg logit contributions
 agre-0.750
 agreeing-0.705
 dere-0.679
lio-0.668
andum-0.632
Token that
Token ID326
Feature activation-5.16e-02

Pos logit contributions
icide+2.645
heastern+2.230
ingers+2.215
icides+2.152
ppers+2.129
Neg logit contributions
 agre-2.947
 agreeing-2.710
 dere-2.680
lio-2.661
andum-2.553
Token have
Token ID423
Feature activation-6.29e-02

Pos logit contributions
 agre+0.300
 agreeing+0.288
 dere+0.280
lio+0.271
andum+0.261
Neg logit contributions
icide-0.282
heastern-0.241
ingers-0.236
icides-0.231
ppers-0.228
Token been
Token ID587
Feature activation-1.47e-01

Pos logit contributions
 agre+0.361
 agreeing+0.344
 dere+0.336
lio+0.325
andum+0.312
Neg logit contributions
icide-0.350
heastern-0.300
ingers-0.294
icides-0.289
ppers-0.283
Token pending
Token ID13310
Feature activation-7.97e-01

Pos logit contributions
 agre+0.852
 agreeing+0.812
 dere+0.794
lio+0.768
 agreement+0.739
Neg logit contributions
icide-0.812
heastern-0.696
ingers-0.685
icides-0.668
ppers-0.654
Token for
Token ID329
Feature activation-1.57e+00

Pos logit contributions
 agre+4.799
 agreeing+4.714
 dere+4.637
lio+4.385
 agreement+4.341
Neg logit contributions
icide-4.036
heastern-3.430
ingers-3.390
icides-3.279
ppers-3.208
Token at
Token ID379
Feature activation-5.15e-01

Pos logit contributions
 agreeing+6.059
 agre+5.938
 dere+5.917
 agreement+5.380
lio+5.144
Neg logit contributions
icide-10.662
heastern-9.687
ingers-9.615
icides-9.252
ommel-9.205
Token least
Token ID1551
Feature activation-3.63e-01

Pos logit contributions
 agre+2.630
 agreeing+2.558
 dere+2.510
lio+2.380
 agreement+2.312
Neg logit contributions
icide-3.120
heastern-2.728
ingers-2.693
icides-2.629
ppers-2.558
Token a
Token ID257
Feature activation-1.80e-01

Pos logit contributions
 agre+2.094
 agreeing+1.997
 dere+1.964
lio+1.909
elli+1.823
Neg logit contributions
icide-2.005
heastern-1.720
ingers-1.690
icides-1.646
ppers-1.617
Token decade
Token ID5707
Feature activation+2.38e-01

Pos logit contributions
 agre+0.729
 agreeing+0.681
 dere+0.660
lio+0.641
elli+0.597
Neg logit contributions
icide-1.305
heastern-1.159
ingers-1.140
icides-1.122
ppers-1.104
Token
Token ID851
Feature activation-3.39e-01

Pos logit contributions
icide+1.298
heastern+1.102
ingers+1.079
icides+1.060
ppers+1.030
Neg logit contributions
 agre-1.407
 agreeing-1.316
 dere-1.303
lio-1.262
andum-1.198
Token]
Token ID60
Feature activation-2.82e-01

Pos logit contributions
 agre+0.568
 agreeing+0.544
 dere+0.533
lio+0.514
andum+0.489
Neg logit contributions
icide-0.638
heastern-0.545
ingers-0.539
icides-0.532
ppers-0.515
Token prohibition
Token ID19548
Feature activation+6.86e-01

Pos logit contributions
 agre+1.483
 agreeing+1.421
 dere+1.393
lio+1.338
elli+1.272
Neg logit contributions
icide-1.705
heastern-1.474
ingers-1.460
icides-1.431
ppers-1.400
Token,
Token ID11
Feature activation+1.70e-01

Pos logit contributions
icide+3.684
icides+3.068
ingers+3.019
heastern+3.016
ppers+2.956
Neg logit contributions
 agre-3.945
 dere-3.722
 agreeing-3.695
lio-3.593
andum-3.420
Token slavery
Token ID13503
Feature activation+9.08e-02

Pos logit contributions
icide+0.955
heastern+0.811
ingers+0.803
icides+0.787
ppers+0.767
Neg logit contributions
 agre-0.951
 agreeing-0.911
 dere-0.900
lio-0.867
andum-0.829
Token,
Token ID11
Feature activation-1.64e-01

Pos logit contributions
icide+0.521
heastern+0.445
ingers+0.441
icides+0.433
ppers+0.423
Neg logit contributions
 agre-0.502
 agreeing-0.479
 dere-0.473
lio-0.455
andum-0.436
Token women
Token ID1466
Feature activation-1.57e+00

Pos logit contributions
 agre+0.868
 agreeing+0.832
 dere+0.814
lio+0.783
 agreement+0.747
Neg logit contributions
icide-0.985
heastern-0.850
ingers-0.840
icides-0.824
ppers-0.804
Token
Token ID447
Feature activation+1.21e-01

Pos logit contributions
 agreeing+7.132
 agre+7.056
 dere+6.714
 agreement+6.217
elli+6.107
Neg logit contributions
icide-9.380
heastern-8.835
ebus-8.420
ingers-8.360
uliffe-8.144
Token
Token ID247
Feature activation+1.32e-02

Pos logit contributions
icide+0.345
heastern+0.250
ingers+0.244
icides+0.226
ppers+0.217
Neg logit contributions
 agre-1.019
 agreeing-0.989
 dere-0.981
lio-0.953
 agreement-0.930
Tokens
Token ID82
Feature activation-2.54e-01

Pos logit contributions
icide+0.077
heastern+0.066
ingers+0.066
icides+0.065
ppers+0.063
Neg logit contributions
 agre-0.072
 agreeing-0.068
 dere-0.067
lio-0.065
andum-0.062
Token rights
Token ID2489
Feature activation-6.06e-02

Pos logit contributions
 agre+1.237
 agreeing+1.178
 dere+1.153
lio+1.105
 agreement+1.047
Neg logit contributions
icide-1.636
heastern-1.426
ingers-1.413
icides-1.387
ppers-1.357
Token.
Token ID13
Feature activation-4.62e-02

Pos logit contributions
 agre+0.334
 agreeing+0.320
 dere+0.315
lio+0.303
 agreement+0.288
Neg logit contributions
icide-0.349
heastern-0.300
ingers-0.297
icides-0.290
ppers-0.284
Token projects
Token ID4493
Feature activation-4.58e-01

Pos logit contributions
 agre+1.989
 agreeing+1.899
 dere+1.854
lio+1.772
 agreement+1.682
Neg logit contributions
icide-2.745
heastern-2.415
ingers-2.386
icides-2.336
ppers-2.291
Token like
Token ID588
Feature activation-9.45e-02

Pos logit contributions
 agre+2.639
 agreeing+2.566
 dere+2.507
lio+2.384
 agreement+2.333
Neg logit contributions
icide-2.494
heastern-2.137
ingers-2.108
icides-2.056
ppers-2.003
Token hospitals
Token ID11301
Feature activation-6.24e-01

Pos logit contributions
 agre+0.499
 agreeing+0.480
 dere+0.465
lio+0.456
elli+0.434
Neg logit contributions
icide-0.558
heastern-0.483
ingers-0.482
icides-0.465
ppers-0.463
Token and
Token ID290
Feature activation-1.61e-01

Pos logit contributions
 agre+3.039
 agreeing+2.972
 dere+2.893
lio+2.692
 agreement+2.646
Neg logit contributions
icide-3.862
heastern-3.394
ingers-3.337
icides-3.280
ppers-3.205
Token schools
Token ID4266
Feature activation-2.67e-01

Pos logit contributions
 agre+0.847
 agreeing+0.812
 dere+0.787
lio+0.769
elli+0.735
Neg logit contributions
icide-0.962
heastern-0.830
ingers-0.826
icides-0.804
ppers-0.795
Token.[
Token ID3693
Feature activation+1.13e+00

Pos logit contributions
 agre+1.585
 agreeing+1.533
 dere+1.503
lio+1.440
 agreement+1.397
Neg logit contributions
icide-1.413
heastern-1.204
ingers-1.186
icides-1.156
ppers-1.126
Token5
Token ID20
Feature activation+2.73e-01

Pos logit contributions
icide+3.662
heastern+2.754
ingers+2.588
icides+2.510
ppers+2.378
Neg logit contributions
 agre-8.788
 dere-8.506
 agreeing-8.473
lio-8.071
 agreement-7.873
Token]
Token ID60
Feature activation+5.11e-02

Pos logit contributions
icide+1.289
heastern+1.046
ingers+1.027
icides+0.997
ppers+0.985
Neg logit contributions
 agre-1.871
 agreeing-1.718
 dere-1.694
lio-1.625
andum-1.572
Token Additionally
Token ID12032
Feature activation+8.55e-02

Pos logit contributions
icide+0.296
heastern+0.254
ingers+0.252
icides+0.245
ppers+0.241
Neg logit contributions
 agre-0.281
 agreeing-0.267
 dere-0.263
lio-0.252
 agreement-0.240
Token,
Token ID11
Feature activation-4.15e-02

Pos logit contributions
icide+0.508
heastern+0.441
ingers+0.436
icides+0.425
ppers+0.415
Neg logit contributions
 agre-0.456
 agreeing-0.441
 dere-0.430
lio-0.408
 agreement-0.398
Token the
Token ID262
Feature activation+6.97e-02

Pos logit contributions
 agre+0.237
 agreeing+0.226
 dere+0.224
lio+0.218
elli+0.209
Neg logit contributions
icide-0.229
heastern-0.196
ingers-0.194
icides-0.189
ppers-0.185
Token as
Token ID355
Feature activation+4.31e-01

Pos logit contributions
 agre+2.388
 agreeing+2.305
 dere+2.257
lio+2.157
 agreement+2.089
Neg logit contributions
icide-2.422
heastern-2.088
ingers-2.060
icides-2.010
ppers-1.963
Token evidence
Token ID2370
Feature activation+8.16e-01

Pos logit contributions
icide+2.518
ingers+2.097
heastern+2.091
icides+2.075
ppers+2.052
Neg logit contributions
 agre-2.252
lio-2.073
 agreeing-2.029
elli-2.025
 dere-2.025
Token that
Token ID326
Feature activation+6.94e-01

Pos logit contributions
icide+4.440
heastern+3.711
ingers+3.675
icides+3.611
ommel+3.523
Neg logit contributions
 agre-4.660
 agreeing-4.361
 dere-4.343
lio-4.305
elli-4.065
Token Kim
Token ID6502
Feature activation+6.89e-01

Pos logit contributions
icide+3.638
ingers+3.070
heastern+3.068
icides+2.999
ppers+2.997
Neg logit contributions
 agre-3.952
lio-3.708
 dere-3.651
 agreeing-3.599
elli-3.530
Token Jong
Token ID17960
Feature activation+1.47e-01

Pos logit contributions
icide+4.564
ingers+3.890
icides+3.863
ppers+3.856
heastern+3.846
Neg logit contributions
 agre-3.107
 dere-2.889
 agreeing-2.844
lio-2.814
andum-2.646
Token Un
Token ID791
Feature activation+1.28e+00

Pos logit contributions
icide+0.937
heastern+0.817
ingers+0.806
icides+0.793
ppers+0.785
Neg logit contributions
 agre-0.724
 agreeing-0.688
 dere-0.674
lio-0.648
elli-0.618
Token is
Token ID318
Feature activation+1.01e+00

Pos logit contributions
icide+6.457
ingers+5.221
icides+5.194
ppers+5.153
heastern+5.012
Neg logit contributions
 agre-7.331
lio-6.884
 agreeing-6.763
 dere-6.704
andum-6.568
Token losing
Token ID6078
Feature activation+1.14e+00

Pos logit contributions
icide+5.503
ingers+4.676
ppers+4.479
 BACK+4.479
icides+4.385
Neg logit contributions
 agre-5.529
lio-5.081
yrights-5.014
andum-4.873
iqueness-4.821
Token his
Token ID465
Feature activation-1.93e-01

Pos logit contributions
icide+5.763
ingers+5.039
icides+4.655
ppers+4.642
heastern+4.599
Neg logit contributions
 agre-6.379
lio-6.006
 dere-5.896
 agreeing-5.875
andum-5.772
Token grip
Token ID11762
Feature activation-2.99e-01

Pos logit contributions
 agre+1.154
 agreeing+1.107
 dere+1.086
lio+1.052
elli+1.009
Neg logit contributions
icide-1.027
heastern-0.871
ingers-0.865
icides-0.838
ppers-0.818
Token on
Token ID319
Feature activation-3.02e-01

Pos logit contributions
 agre+1.521
 agreeing+1.466
 dere+1.431
lio+1.362
 agreement+1.319
Neg logit contributions
icide-1.838
heastern-1.609
ingers-1.577
icides-1.551
ppers-1.517
Token by
Token ID416
Feature activation-1.05e-01

Pos logit contributions
icide+0.116
heastern+0.098
ingers+0.096
icides+0.093
ppers+0.090
Neg logit contributions
 agre-0.145
 agreeing-0.138
 dere-0.137
lio-0.132
elli-0.127
Token the
Token ID262
Feature activation+1.16e-01

Pos logit contributions
 agre+0.575
 agreeing+0.546
 dere+0.539
lio+0.519
andum+0.494
Neg logit contributions
icide-0.607
heastern-0.523
ingers-0.515
icides-0.503
ppers-0.496
Token Nobel
Token ID20715
Feature activation+7.04e-03

Pos logit contributions
icide+0.662
heastern+0.575
ingers+0.558
icides+0.544
ppers+0.540
Neg logit contributions
 agre-0.633
 agreeing-0.602
 dere-0.592
lio-0.572
andum-0.550
Token Foundation
Token ID5693
Feature activation+6.87e-01

Pos logit contributions
icide+0.039
heastern+0.033
ingers+0.032
icides+0.032
ppers+0.031
Neg logit contributions
 agre-0.041
 agreeing-0.039
 dere-0.039
lio-0.037
 agreement-0.035
Token in
Token ID287
Feature activation+3.35e-01

Pos logit contributions
icide+3.534
heastern+2.970
ingers+2.930
ommel+2.814
icides+2.783
Neg logit contributions
 agre-4.057
 dere-3.852
 agreeing-3.771
lio-3.750
yrights-3.570
Token 1968
Token ID15963
Feature activation+1.11e+00

Pos logit contributions
icide+2.015
ingers+1.714
heastern+1.704
ppers+1.670
icides+1.662
Neg logit contributions
 agre-1.708
 agreeing-1.576
 dere-1.567
lio-1.513
andum-1.477
Token from
Token ID422
Feature activation-1.25e-01

Pos logit contributions
icide+5.241
ingers+4.227
heastern+4.196
ommel+4.073
icides+3.999
Neg logit contributions
 agre-6.788
 dere-6.390
lio-6.196
 agreeing-6.051
yrights-5.972
Token S
Token ID311
Feature activation-1.32e-02

Pos logit contributions
 agre+0.669
 agreeing+0.637
 dere+0.627
lio+0.602
elli+0.573
Neg logit contributions
icide-0.742
heastern-0.643
ingers-0.634
icides-0.619
ppers-0.608
Tokenver
Token ID332
Feature activation+1.00e+00

Pos logit contributions
 agre+0.058
 agreeing+0.055
 dere+0.053
lio+0.050
 agreement+0.048
Neg logit contributions
icide-0.091
heastern-0.081
ingers-0.080
icides-0.078
ppers-0.077
Tokenig
Token ID328
Feature activation+2.23e-01

Pos logit contributions
icide+5.512
ingers+4.790
heastern+4.469
ppers+4.429
icides+4.393
Neg logit contributions
 agre-6.344
 redes-5.395
 agreeing-5.392
 dere-5.347
ortunately-5.112
Tokenes
Token ID274
Feature activation+3.91e-01

Pos logit contributions
icide+1.276
heastern+1.096
ingers+1.088
icides+1.063
ppers+1.056
Neg logit contributions
 agre-1.327
 agreeing-1.192
 dere-1.173
lio-1.097
andum-1.062
Token,
Token ID11
Feature activation+1.27e-01

Pos logit contributions
 agre+3.450
 agreeing+3.383
 dere+3.277
lio+3.086
 agreement+3.044
Neg logit contributions
icide-3.958
heastern-3.473
ingers-3.437
icides-3.348
ppers-3.260
Token and
Token ID290
Feature activation+5.09e-01

Pos logit contributions
icide+0.710
heastern+0.602
ingers+0.593
icides+0.581
ppers+0.571
Neg logit contributions
 agre-0.725
 dere-0.676
 agreeing-0.675
lio-0.651
andum-0.642
Token they
Token ID484
Feature activation+4.48e-02

Pos logit contributions
icide+2.584
heastern+2.166
ingers+2.121
ppers+2.085
icides+2.068
Neg logit contributions
 agre-3.073
 dere-2.865
lio-2.844
 agreeing-2.842
andum-2.747
Token need
Token ID761
Feature activation+6.80e-01

Pos logit contributions
icide+0.243
heastern+0.203
ingers+0.201
icides+0.197
ppers+0.197
Neg logit contributions
 agre-0.263
 agreeing-0.249
 dere-0.246
lio-0.242
andum-0.232
Token eight
Token ID3624
Feature activation+8.91e-01

Pos logit contributions
icide+3.399
heastern+2.813
ingers+2.767
icides+2.735
ppers+2.714
Neg logit contributions
 agre-4.176
 agreeing-3.884
 dere-3.847
lio-3.824
andum-3.702
Token to
Token ID284
Feature activation+1.34e+00

Pos logit contributions
icide+4.603
ingers+3.914
heastern+3.879
icides+3.759
ppers+3.665
Neg logit contributions
 agre-5.189
 agreeing-4.799
lio-4.799
 dere-4.744
andum-4.568
Token 10
Token ID838
Feature activation+5.58e-01

Pos logit contributions
icide+5.641
heastern+4.906
ppers+4.750
ingers+4.608
icides+4.316
Neg logit contributions
 agre-8.360
 agreeing-7.893
lio-7.889
 dere-7.867
andum-7.656
Token."
Token ID526
Feature activation-3.31e-01

Pos logit contributions
icide+3.245
ingers+2.850
heastern+2.840
icides+2.694
ppers+2.662
Neg logit contributions
 agre-2.923
 agreeing-2.733
 dere-2.694
lio-2.680
yrights-2.559
Token
Token ID198
Feature activation-2.30e-01

Pos logit contributions
 agre+1.524
 agreeing+1.498
 dere+1.448
lio+1.406
 Stat+1.346
Neg logit contributions
icide-2.137
heastern-1.862
ingers-1.862
icides-1.818
ppers-1.789
Token
Token ID198
Feature activation-2.69e-01

Pos logit contributions
 agre+1.266
 agreeing+1.188
 dere+1.164
lio+1.112
 agreement+1.066
Neg logit contributions
icide-1.356
heastern-1.170
ingers-1.157
icides-1.127
ppers-1.110
TokenOn
Token ID2202
Feature activation-1.20e-01

Pos logit contributions
 agre+1.464
 agreeing+1.398
 dere+1.372
lio+1.307
 agreement+1.260
Neg logit contributions
icide-1.577
heastern-1.367
ingers-1.348
icides-1.314
ppers-1.290
Token cases
Token ID2663
Feature activation+1.56e-01

Pos logit contributions
icide+0.318
heastern+0.279
ingers+0.277
icides+0.269
ppers+0.264
Neg logit contributions
 agre-0.251
 agreeing-0.237
 dere-0.230
lio-0.226
elli-0.214
Token -
Token ID532
Feature activation+1.41e-01

Pos logit contributions
icide+0.904
heastern+0.780
ingers+0.768
icides+0.745
ppers+0.731
Neg logit contributions
 agre-0.854
 agreeing-0.799
 dere-0.796
lio-0.776
yrights-0.737
Token a
Token ID257
Feature activation+6.78e-01

Pos logit contributions
icide+0.826
ingers+0.703
heastern+0.700
icides+0.687
ppers+0.669
Neg logit contributions
 agre-0.768
 agreeing-0.709
 dere-0.708
lio-0.689
yrights-0.660
Token four
Token ID1440
Feature activation+9.06e-01

Pos logit contributions
icide+3.901
heastern+3.406
ingers+3.311
ppers+3.266
 BACK+3.191
Neg logit contributions
 agre-3.525
 agreeing-3.259
lio-3.212
 dere-3.131
elli-3.130
Token-
Token ID12
Feature activation+7.26e-01

Pos logit contributions
icide+4.196
heastern+3.484
ingers+3.457
ppers+3.261
icides+3.206
Neg logit contributions
 agre-5.862
 agreeing-5.470
 dere-5.363
andum-5.171
lio-5.167
Tokenfold
Token ID11379
Feature activation+1.16e+00

Pos logit contributions
icide+3.484
ingers+2.863
heastern+2.841
woman+2.669
icides+2.658
Neg logit contributions
 agre-4.757
 agreeing-4.409
 dere-4.324
yrights-4.109
lio-4.085
Token increase
Token ID2620
Feature activation-5.51e-02

Pos logit contributions
icide+5.025
heastern+3.847
icides+3.772
ingers+3.750
ppers+3.715
Neg logit contributions
 agre-7.611
lio-7.122
 agreeing-7.029
yrights-6.948
 dere-6.909
Token in
Token ID287
Feature activation-8.03e-02

Pos logit contributions
 agre+0.314
 agreeing+0.301
 dere+0.296
lio+0.286
elli+0.274
Neg logit contributions
icide-0.307
heastern-0.263
ingers-0.259
icides-0.253
ppers-0.247
Token a
Token ID257
Feature activation-2.06e-01

Pos logit contributions
 agre+0.460
 agreeing+0.436
 dere+0.429
lio+0.418
elli+0.401
Neg logit contributions
icide-0.449
heastern-0.384
ingers-0.377
icides-0.371
ppers-0.361
Token week
Token ID1285
Feature activation-1.59e-01

Pos logit contributions
 agre+0.937
 agreeing+0.886
 dere+0.858
lio+0.828
elli+0.787
Neg logit contributions
icide-1.388
heastern-1.226
ingers-1.206
icides-1.184
ppers-1.165
Token.
Token ID13
Feature activation+4.12e-01

Pos logit contributions
 agre+0.899
 agreeing+0.864
 dere+0.848
lio+0.814
 agreement+0.783
Neg logit contributions
icide-0.897
heastern-0.772
ingers-0.760
icides-0.741
ppers-0.725
Token and
Token ID290
Feature activation-7.60e-02

Pos logit contributions
 agre+0.783
 agreeing+0.738
 dere+0.725
lio+0.702
andum+0.679
Neg logit contributions
icide-0.790
heastern-0.678
ingers-0.674
icides-0.653
ppers-0.641
Token he
Token ID339
Feature activation-7.03e-02

Pos logit contributions
 agre+0.414
 agreeing+0.389
 dere+0.384
lio+0.373
andum+0.359
Neg logit contributions
icide-0.441
heastern-0.383
ingers-0.377
icides-0.365
ppers-0.363
Token did
Token ID750
Feature activation+2.33e-01

Pos logit contributions
 agre+0.442
 agreeing+0.421
 dere+0.414
lio+0.406
andum+0.393
Neg logit contributions
icide-0.351
heastern-0.292
ingers-0.289
icides-0.280
ppers-0.276
Token not
Token ID407
Feature activation-2.88e-01

Pos logit contributions
icide+1.184
heastern+1.006
ingers+0.987
ppers+0.957
icides+0.946
Neg logit contributions
 agre-1.407
 agreeing-1.326
 dere-1.309
lio-1.290
andum-1.249
Token fight
Token ID1907
Feature activation+4.64e-01

Pos logit contributions
 agre+1.544
 agreeing+1.472
 dere+1.436
lio+1.397
elli+1.334
Neg logit contributions
icide-1.703
heastern-1.462
ingers-1.456
icides-1.413
ppers-1.400
Token back
Token ID736
Feature activation+5.57e-01

Pos logit contributions
icide+2.151
ingers+1.794
heastern+1.784
 BACK+1.763
ppers+1.689
Neg logit contributions
 agre-3.036
 agreeing-2.823
 dere-2.775
lio-2.735
andum-2.717
Token.
Token ID13
Feature activation-3.63e-01

Pos logit contributions
icide+2.830
heastern+2.376
ingers+2.357
icides+2.230
 BACK+2.215
Neg logit contributions
 agre-3.426
 agreeing-3.127
 dere-3.127
lio-3.083
andum-3.052
Token Jal
Token ID28649
Feature activation-5.81e-01

Pos logit contributions
 agre+1.906
 agreeing+1.815
 dere+1.780
lio+1.696
 agreement+1.629
Neg logit contributions
icide-2.193
heastern-1.906
ingers-1.883
icides-1.835
ppers-1.804
Tokenen
Token ID268
Feature activation-1.12e-01

Pos logit contributions
 agre+3.037
 agreeing+2.971
 dere+2.916
lio+2.802
elli+2.700
Neg logit contributions
icide-3.418
heastern-2.963
ingers-2.926
icides-2.862
ppers-2.787
Token was
Token ID373
Feature activation-3.77e-01

Pos logit contributions
 agre+0.663
 agreeing+0.631
 dere+0.623
lio+0.604
andum+0.580
Neg logit contributions
icide-0.606
heastern-0.518
ingers-0.506
icides-0.492
ppers-0.485
Token doing
Token ID1804
Feature activation+2.19e-01

Pos logit contributions
 agre+1.996
 agreeing+1.879
 dere+1.845
lio+1.794
andum+1.710
Neg logit contributions
icide-2.267
heastern-1.968
ingers-1.951
icides-1.889
ppers-1.865
Token but
Token ID475
Feature activation-1.56e-01

Pos logit contributions
 agre+0.342
 agreeing+0.328
 dere+0.324
lio+0.315
andum+0.304
Neg logit contributions
icide-0.228
heastern-0.188
ingers-0.186
icides-0.179
ppers-0.173
Token drawing
Token ID8263
Feature activation+3.31e-01

Pos logit contributions
 agre+0.839
 agreeing+0.797
 dere+0.785
lio+0.764
andum+0.726
Neg logit contributions
icide-0.916
heastern-0.794
ingers-0.786
icides-0.765
ppers-0.754
Token conclusions
Token ID13242
Feature activation-5.48e-01

Pos logit contributions
icide+1.833
heastern+1.556
ingers+1.554
icides+1.522
ppers+1.491
Neg logit contributions
 agre-1.874
 agreeing-1.769
 dere-1.737
lio-1.715
andum-1.652
Token based
Token ID1912
Feature activation-2.45e-02

Pos logit contributions
 agre+3.012
 agreeing+2.928
 dere+2.848
lio+2.715
 agreement+2.648
Neg logit contributions
icide-3.124
heastern-2.719
ingers-2.668
icides-2.596
ppers-2.547
Token on
Token ID319
Feature activation+9.92e-02

Pos logit contributions
 agre+0.124
 agreeing+0.120
 dere+0.117
lio+0.111
 agreement+0.108
Neg logit contributions
icide-0.152
heastern-0.133
ingers-0.131
icides-0.128
ppers-0.125
Token them
Token ID606
Feature activation+1.54e-01

Pos logit contributions
icide+0.571
heastern+0.492
ingers+0.487
icides+0.475
ppers+0.467
Neg logit contributions
 agre-0.544
 agreeing-0.513
 dere-0.505
lio-0.493
andum-0.469
Token wouldn
Token ID3636
Feature activation-3.77e-01

Pos logit contributions
icide+0.883
heastern+0.748
ingers+0.744
icides+0.729
ppers+0.707
Neg logit contributions
 agre-0.864
 agreeing-0.817
 dere-0.800
lio-0.787
andum-0.748
Token
Token ID447
Feature activation-1.46e-01

Pos logit contributions
 agre+1.596
 agreeing+1.514
 dere+1.475
lio+1.395
 agreement+1.322
Neg logit contributions
icide-2.655
heastern-2.360
ingers-2.334
icides-2.288
ppers-2.249
Token
Token ID247
Feature activation-1.45e-01

Pos logit contributions
 agre+1.109
 agreeing+1.066
 dere+1.055
lio+1.018
 agreement+0.990
Neg logit contributions
icide-0.537
heastern-0.429
ingers-0.426
icides-0.391
ppers-0.390
Tokent
Token ID83
Feature activation-2.57e-01

Pos logit contributions
 agre+0.770
 agreeing+0.723
 dere+0.709
lio+0.672
andum+0.647
Neg logit contributions
icide-0.876
ingers-0.754
heastern-0.751
ppers-0.735
icides-0.729
Token be
Token ID307
Feature activation-2.36e-01

Pos logit contributions
 agre+1.267
 agreeing+1.207
 dere+1.182
lio+1.136
elli+1.086
Neg logit contributions
icide-1.635
heastern-1.424
ingers-1.397
icides-1.378
ppers-1.363
Token of
Token ID286
Feature activation-4.09e-01

Pos logit contributions
icide+1.008
heastern+0.851
ingers+0.847
icides+0.822
ppers+0.803
Neg logit contributions
 agre-1.061
 agreeing-0.957
 dere-0.954
lio-0.922
yrights-0.890
Token a
Token ID257
Feature activation+5.66e-02

Pos logit contributions
 agre+2.200
 agreeing+2.114
 dere+2.072
lio+1.984
 agreement+1.907
Neg logit contributions
icide-2.407
heastern-2.088
ingers-2.059
icides-2.009
ppers-1.967
Token few
Token ID1178
Feature activation+1.58e-01

Pos logit contributions
icide+0.339
heastern+0.294
ingers+0.289
icides+0.282
ppers+0.278
Neg logit contributions
 agre-0.301
 agreeing-0.283
 dere-0.276
lio-0.269
elli-0.259
Token weeks
Token ID2745
Feature activation+1.80e-01

Pos logit contributions
icide+0.864
heastern+0.741
ingers+0.730
icides+0.710
ppers+0.695
Neg logit contributions
 agre-0.920
 agreeing-0.868
 dere-0.850
lio-0.833
andum-0.802
Token,
Token ID11
Feature activation+3.18e-02

Pos logit contributions
icide+1.012
heastern+0.868
ingers+0.859
icides+0.835
ppers+0.818
Neg logit contributions
 agre-1.026
 agreeing-0.969
 dere-0.959
lio-0.922
elli-0.884
Token Sanders
Token ID5831
Feature activation+3.76e-01

Pos logit contributions
icide+0.175
heastern+0.151
ingers+0.148
icides+0.144
ppers+0.141
Neg logit contributions
 agre-0.184
 agreeing-0.173
 dere-0.172
lio-0.166
yrights-0.160
Token will
Token ID481
Feature activation-4.05e-02

Pos logit contributions
icide+1.933
ingers+1.626
heastern+1.606
ppers+1.565
icides+1.561
Neg logit contributions
 agre-2.274
 agreeing-2.138
 dere-2.135
lio-2.092
andum-2.016
Token need
Token ID761
Feature activation+4.25e-01

Pos logit contributions
 agre+0.217
 agreeing+0.205
 dere+0.201
lio+0.196
elli+0.189
Neg logit contributions
icide-0.240
ingers-0.206
heastern-0.205
ppers-0.200
icides-0.199
Token to
Token ID284
Feature activation+4.21e-01

Pos logit contributions
icide+2.285
heastern+1.970
ingers+1.926
icides+1.876
ppers+1.848
Neg logit contributions
 agre-2.500
 agreeing-2.328
 dere-2.326
lio-2.252
andum-2.167
Token prevail
Token ID28615
Feature activation+4.20e-01

Pos logit contributions
icide+2.486
ppers+2.150
ingers+2.133
heastern+2.123
icides+2.052
Neg logit contributions
 agre-2.198
 agreeing-2.052
 dere-2.020
lio-1.978
yrights-1.931
Token
Token ID851
Feature activation-3.56e-01

Pos logit contributions
icide+2.301
heastern+2.022
ingers+1.954
icides+1.889
ppers+1.846
Neg logit contributions
 agre-2.384
 agreeing-2.220
lio-2.200
 dere-2.191
andum-2.099
Token to
Token ID284
Feature activation+5.04e-01

Pos logit contributions
icide+2.121
heastern+1.759
ingers+1.757
icides+1.695
 BACK+1.688
Neg logit contributions
 agre-2.416
 agreeing-2.267
 dere-2.241
lio-2.228
andum-2.138
Token that
Token ID326
Feature activation+5.04e-01

Pos logit contributions
icide+2.669
heastern+2.245
ingers+2.231
ppers+2.173
icides+2.143
Neg logit contributions
 agre-2.938
 agreeing-2.783
 dere-2.718
lio-2.701
andum-2.593
Token I
Token ID314
Feature activation+2.62e-02

Pos logit contributions
icide+2.807
ingers+2.363
heastern+2.346
ppers+2.322
icides+2.251
Neg logit contributions
 agre-2.740
 agreeing-2.628
 dere-2.539
lio-2.500
andum-2.468
Token had
Token ID550
Feature activation+1.69e-01

Pos logit contributions
icide+0.128
ingers+0.105
heastern+0.105
icides+0.101
ppers+0.100
Neg logit contributions
 agre-0.167
 agreeing-0.161
 dere-0.160
lio-0.155
andum-0.148
Token to
Token ID284
Feature activation+5.41e-01

Pos logit contributions
icide+0.958
heastern+0.815
ingers+0.811
icides+0.786
ppers+0.775
Neg logit contributions
 agre-0.932
 agreeing-0.885
 dere-0.864
lio-0.850
andum-0.821
Token hold
Token ID1745
Feature activation+7.41e-01

Pos logit contributions
icide+3.247
ppers+2.750
ingers+2.743
heastern+2.742
 BACK+2.715
Neg logit contributions
 agre-2.684
 agreeing-2.527
lio-2.494
 dere-2.474
yrights-2.444
Token as
Token ID355
Feature activation+4.79e-01

Pos logit contributions
icide+3.672
ingers+3.127
heastern+3.083
 BACK+3.072
ppers+2.956
Neg logit contributions
 agre-4.435
 agreeing-4.127
lio-4.072
 dere-4.012
andum-3.987
Token the
Token ID262
Feature activation+3.61e-01

Pos logit contributions
icide+2.768
ingers+2.338
heastern+2.263
icides+2.249
ppers+2.227
Neg logit contributions
 agre-2.572
 agreeing-2.411
 dere-2.367
lio-2.357
andum-2.271
Token A
Token ID317
Feature activation+4.96e-01

Pos logit contributions
icide+2.137
ingers+1.832
heastern+1.799
 BACK+1.754
icides+1.748
Neg logit contributions
 agre-1.913
 agreeing-1.779
 dere-1.701
lio-1.700
andum-1.649
Token-
Token ID12
Feature activation+3.64e-01

Pos logit contributions
icide+2.772
ingers+2.400
heastern+2.389
ppers+2.227
icides+2.225
Neg logit contributions
 agre-2.784
 agreeing-2.621
 dere-2.505
lio-2.490
andum-2.404
Tokenstory
Token ID13571
Feature activation+6.93e-01

Pos logit contributions
icide+2.089
ingers+1.823
heastern+1.785
ommel+1.701
icides+1.701
Neg logit contributions
 agre-1.980
 agreeing-1.868
 dere-1.819
lio-1.716
andum-1.710
Token the
Token ID262
Feature activation-2.63e-01

Pos logit contributions
icide+0.183
heastern+0.154
ingers+0.153
icides+0.149
ppers+0.147
Neg logit contributions
 agre-0.189
 agreeing-0.178
 dere-0.178
lio-0.174
elli-0.164
Token rankings
Token ID16905
Feature activation+1.84e-01

Pos logit contributions
 agre+1.392
 agreeing+1.332
 dere+1.305
lio+1.252
elli+1.194
Neg logit contributions
icide-1.576
heastern-1.368
ingers-1.351
icides-1.318
ppers-1.295
Token.
Token ID13
Feature activation-7.60e-02

Pos logit contributions
icide+1.023
ingers+0.869
heastern+0.869
icides+0.835
ppers+0.832
Neg logit contributions
 agre-1.053
 agreeing-0.996
 dere-0.991
lio-0.962
andum-0.910
Token
Token ID198
Feature activation-2.80e-01

Pos logit contributions
 agre+0.410
 agreeing+0.386
 dere+0.380
lio+0.363
 agreement+0.345
Neg logit contributions
icide-0.456
heastern-0.389
ingers-0.387
icides-0.378
ppers-0.372
Token
Token ID198
Feature activation-2.54e-01

Pos logit contributions
 agre+1.509
 agreeing+1.430
 dere+1.400
lio+1.339
 agreement+1.284
Neg logit contributions
icide-1.669
heastern-1.445
ingers-1.429
icides-1.393
ppers-1.368
Token
Token ID447
Feature activation+3.62e-01

Pos logit contributions
 agre+1.386
 agreeing+1.331
 dere+1.305
lio+1.250
 agreement+1.202
Neg logit contributions
icide-1.475
heastern-1.277
ingers-1.259
icides-1.228
ppers-1.202
Token
Token ID250
Feature activation-5.42e-02

Pos logit contributions
icide+1.153
heastern+0.896
ingers+0.873
ppers+0.796
icides+0.783
Neg logit contributions
 agre-2.950
 agreeing-2.798
 dere-2.760
lio-2.680
 agreement-2.603
TokenMaybe
Token ID13300
Feature activation+5.04e-02

Pos logit contributions
 agre+0.306
 agreeing+0.286
 dere+0.282
lio+0.268
 agreement+0.257
Neg logit contributions
icide-0.317
heastern-0.269
ingers-0.268
icides-0.261
ppers-0.259
Token now
Token ID783
Feature activation-4.11e-01

Pos logit contributions
icide+0.289
heastern+0.246
ingers+0.243
icides+0.237
ppers+0.236
Neg logit contributions
 agre-0.280
 agreeing-0.263
 dere-0.262
lio-0.256
elli-0.240
Token it
Token ID340
Feature activation-3.21e-01

Pos logit contributions
 agre+2.200
 agreeing+2.121
 dere+2.075
lio+1.982
 agreement+1.914
Neg logit contributions
icide-2.422
heastern-2.107
ingers-2.076
icides-2.026
ppers-1.979
Token would
Token ID561
Feature activation-6.24e-01

Pos logit contributions
 agre+1.660
 agreeing+1.591
 dere+1.558
lio+1.489
 agreement+1.429
Neg logit contributions
icide-1.951
heastern-1.702
ingers-1.679
icides-1.639
ppers-1.606
Token Ethereum
Token ID20313
Feature activation+3.06e-01

Pos logit contributions
 agre+0.805
 agreeing+0.760
 dere+0.750
lio+0.726
elli+0.688
Neg logit contributions
icide-0.917
heastern-0.797
ingers-0.787
icides-0.767
ppers-0.759
Token to
Token ID284
Feature activation-3.80e-01

Pos logit contributions
icide+1.668
heastern+1.415
ingers+1.400
icides+1.362
ppers+1.345
Neg logit contributions
 agre-1.781
 agreeing-1.692
 dere-1.680
lio-1.619
elli-1.547
Token handle
Token ID5412
Feature activation+5.00e-01

Pos logit contributions
 agre+1.883
 agreeing+1.795
 dere+1.754
lio+1.679
elli+1.605
Neg logit contributions
icide-2.406
heastern-2.099
ingers-2.075
icides-2.029
ppers-2.012
Token one
Token ID530
Feature activation+2.79e-01

Pos logit contributions
icide+2.730
heastern+2.320
ingers+2.312
ppers+2.292
icides+2.255
Neg logit contributions
 agre-2.763
 dere-2.552
 agreeing-2.532
lio-2.519
andum-2.429
Token to
Token ID284
Feature activation+7.65e-01

Pos logit contributions
icide+1.542
heastern+1.306
ingers+1.288
ppers+1.270
icides+1.247
Neg logit contributions
 agre-1.588
 agreeing-1.490
 dere-1.478
lio-1.443
andum-1.389
Token 10
Token ID838
Feature activation-1.68e-02

Pos logit contributions
icide+3.826
heastern+3.222
ppers+3.179
ingers+3.029
icides+3.014
Neg logit contributions
 agre-4.649
 dere-4.404
 agreeing-4.357
andum-4.060
lio-4.053
Token m
Token ID285
Feature activation+4.60e-01

Pos logit contributions
 agre+0.092
 agreeing+0.087
 dere+0.087
lio+0.083
andum+0.079
Neg logit contributions
icide-0.097
heastern-0.084
ingers-0.083
icides-0.081
ppers-0.080
Tokenln
Token ID18755
Feature activation+6.28e-01

Pos logit contributions
icide+2.422
ingers+2.094
heastern+2.049
icides+1.992
ppers+1.982
Neg logit contributions
 agre-2.684
 dere-2.543
 agreeing-2.533
lio-2.362
 Malf-2.334
Token active
Token ID4075
Feature activation-2.38e-01

Pos logit contributions
icide+3.947
ingers+3.442
heastern+3.441
ppers+3.337
icides+3.334
Neg logit contributions
 agre-2.999
 dere-2.821
 agreeing-2.776
lio-2.621
yrights-2.518
Token users
Token ID2985
Feature activation-2.67e-01

Pos logit contributions
 agre+1.277
 agreeing+1.221
 dere+1.199
lio+1.151
elli+1.099
Neg logit contributions
icide-1.402
heastern-1.213
ingers-1.198
icides-1.170
ppers-1.148
Token,
Token ID11
Feature activation-7.29e-02

Pos logit contributions
 agre+1.487
 agreeing+1.426
 dere+1.402
lio+1.345
 agreement+1.291
Neg logit contributions
icide-1.519
heastern-1.309
ingers-1.293
icides-1.258
ppers-1.233
Token L
Token ID406
Feature activation-7.13e-02

Pos logit contributions
icide+2.177
heastern+1.828
ingers+1.803
icides+1.766
ommel+1.734
Neg logit contributions
 agre-2.377
 agreeing-2.245
 dere-2.234
lio-2.138
 agreement-2.031
Token D
Token ID360
Feature activation-3.25e-01

Pos logit contributions
 agre+0.395
 agreeing+0.377
 dere+0.371
lio+0.356
 agreement+0.341
Neg logit contributions
icide-0.412
heastern-0.356
ingers-0.350
icides-0.342
ppers-0.335
Tokenatt
Token ID1078
Feature activation-2.62e-01

Pos logit contributions
 agre+1.874
 agreeing+1.814
 dere+1.780
lio+1.708
 agreement+1.649
Neg logit contributions
icide-1.769
heastern-1.515
ingers-1.496
icides-1.458
ppers-1.421
Tokenu
Token ID84
Feature activation+2.10e-01

Pos logit contributions
 agre+1.232
 agreeing+1.178
 dere+1.150
lio+1.096
 agreement+1.046
Neg logit contributions
icide-1.710
heastern-1.508
ingers-1.488
icides-1.457
ppers-1.430
Token and
Token ID290
Feature activation-4.47e-01

Pos logit contributions
icide+1.231
heastern+1.068
ingers+1.050
icides+1.019
ppers+1.006
Neg logit contributions
 agre-1.128
 agreeing-1.067
 dere-1.059
lio-1.015
elli-0.970
Token C
Token ID327
Feature activation-3.30e-01

Pos logit contributions
 agre+2.398
 agreeing+2.312
 dere+2.263
lio+2.166
 agreement+2.085
Neg logit contributions
icide-2.631
heastern-2.285
ingers-2.256
icides-2.198
ppers-2.152
Token K
Token ID509
Feature activation-9.13e-02

Pos logit contributions
 agre+1.777
 agreeing+1.710
 dere+1.675
lio+1.604
 agreement+1.544
Neg logit contributions
icide-1.935
heastern-1.675
ingers-1.654
icides-1.616
ppers-1.578
Token Pr
Token ID1736
Feature activation+5.30e-02

Pos logit contributions
 agre+0.480
 agreeing+0.457
 dere+0.450
lio+0.429
 agreement+0.411
Neg logit contributions
icide-0.552
heastern-0.482
ingers-0.473
icides-0.463
ppers-0.454
Tokenas
Token ID292
Feature activation+4.69e-01

Pos logit contributions
icide+0.377
ingers+0.336
heastern+0.333
icides+0.325
ppers+0.320
Neg logit contributions
 agre-0.225
 agreeing-0.210
 dere-0.203
lio-0.185
 agreement-0.184
Tokenad
Token ID324
Feature activation-2.12e-01

Pos logit contributions
icide+3.207
heastern+2.773
ingers+2.756
icides+2.740
ommel+2.712
Neg logit contributions
 agre-2.153
 agreeing-1.998
 dere-1.931
lio-1.762
 agreement-1.727
Token also
Token ID635
Feature activation+3.14e-01

Pos logit contributions
 agre+1.221
 agreeing+1.175
 dere+1.154
lio+1.110
 agreement+1.068
Neg logit contributions
icide-1.164
heastern-0.999
ingers-0.984
icides-0.956
ppers-0.936
Token no
Token ID645
Feature activation+4.13e-01

Pos logit contributions
icide+0.461
heastern+0.393
ingers+0.390
icides+0.381
ppers+0.374
Neg logit contributions
 agre-0.473
 agreeing-0.452
 dere-0.444
lio-0.432
andum-0.411
Token interest
Token ID1393
Feature activation-2.64e-01

Pos logit contributions
icide+2.617
ingers+2.304
heastern+2.270
ppers+2.232
icides+2.222
Neg logit contributions
 agre-1.961
 agreeing-1.838
lio-1.786
 dere-1.758
elli-1.675
Token in
Token ID287
Feature activation-5.18e-02

Pos logit contributions
 agre+1.219
 agreeing+1.198
 dere+1.149
lio+1.074
 agreement+1.052
Neg logit contributions
icide-1.735
heastern-1.534
ingers-1.507
icides-1.476
ppers-1.458
Token seeking
Token ID6095
Feature activation-1.80e-02

Pos logit contributions
 agre+0.269
 agreeing+0.250
 dere+0.248
lio+0.243
elli+0.229
Neg logit contributions
icide-0.316
ingers-0.273
heastern-0.273
icides-0.266
ppers-0.262
Token a
Token ID257
Feature activation-3.30e-02

Pos logit contributions
 agre+0.096
 agreeing+0.091
 dere+0.090
lio+0.088
elli+0.083
Neg logit contributions
icide-0.106
heastern-0.091
ingers-0.091
icides-0.089
ppers-0.087
Token patent
Token ID12701
Feature activation-1.17e-02

Pos logit contributions
 agre+0.166
 agreeing+0.157
 dere+0.154
lio+0.151
elli+0.143
Neg logit contributions
icide-0.205
heastern-0.178
ingers-0.177
icides-0.171
ppers-0.170
Token for
Token ID329
Feature activation-3.85e-01

Pos logit contributions
 agre+0.072
 agreeing+0.068
 dere+0.068
lio+0.066
elli+0.063
Neg logit contributions
icide-0.061
heastern-0.051
ingers-0.050
icides-0.049
ppers-0.048
Token the
Token ID262
Feature activation-1.30e-01

Pos logit contributions
 agre+2.037
 agreeing+1.966
 dere+1.922
lio+1.828
 agreement+1.772
Neg logit contributions
icide-2.286
heastern-1.992
ingers-1.949
icides-1.914
ppers-1.872
Token Love
Token ID5896
Feature activation+6.57e-02

Pos logit contributions
 agre+0.630
 agreeing+0.603
 dere+0.588
lio+0.566
elli+0.537
Neg logit contributions
icide-0.836
ingers-0.728
heastern-0.727
icides-0.704
ppers-0.694
Tokent
Token ID83
Feature activation+2.45e-01

Pos logit contributions
icide+0.398
ingers+0.345
heastern+0.342
icides+0.330
ppers+0.328
Neg logit contributions
 agre-0.362
 agreeing-0.330
 dere-0.324
lio-0.304
 agreement-0.293
Tokenron
Token ID1313
Feature activation+1.17e-01

Pos logit contributions
icide+1.735
heastern+1.525
ingers+1.518
icides+1.473
ppers+1.471
Neg logit contributions
 agre-1.155
 agreeing-1.004
 dere-0.968
lio-0.883
 agreement-0.856
Token.
Token ID13
Feature activation-1.04e-01

Pos logit contributions
 agre+0.406
 agreeing+0.389
 dere+0.383
lio+0.366
 agreement+0.351
Neg logit contributions
icide-0.440
heastern-0.381
ingers-0.376
icides-0.366
ppers-0.359
Token
Token ID198
Feature activation+2.28e-01

Pos logit contributions
 agre+0.555
 agreeing+0.533
 dere+0.522
lio+0.501
 agreement+0.480
Neg logit contributions
icide-0.621
heastern-0.539
ingers-0.532
icides-0.519
ppers-0.508
Token
Token ID198
Feature activation+3.38e-01

Pos logit contributions
icide+1.218
ingers+1.007
heastern+1.004
icides+0.974
ppers+0.971
Neg logit contributions
 agre-1.476
 agreeing-1.300
 dere-1.276
lio-1.208
yrights-1.178
TokenThe
Token ID464
Feature activation+2.30e-02

Pos logit contributions
icide+1.838
heastern+1.546
ingers+1.535
icides+1.500
ppers+1.477
Neg logit contributions
 agre-2.014
 agreeing-1.905
 dere-1.880
lio-1.752
 agreement-1.719
Token views
Token ID5009
Feature activation+4.97e-01

Pos logit contributions
icide+0.135
heastern+0.116
ingers+0.116
icides+0.112
ppers+0.110
Neg logit contributions
 agre-0.125
 agreeing-0.119
 dere-0.117
lio-0.112
elli-0.107
Token of
Token ID286
Feature activation+1.39e-02

Pos logit contributions
icide+2.384
heastern+1.964
ingers+1.940
icides+1.901
ppers+1.804
Neg logit contributions
 agre-3.238
 agreeing-3.043
 dere-3.015
lio-2.958
andum-2.841
Token contributors
Token ID20420
Feature activation-4.16e-01

Pos logit contributions
icide+0.094
heastern+0.082
ingers+0.080
icides+0.080
ppers+0.078
Neg logit contributions
 agre-0.065
 agreeing-0.060
 dere-0.059
lio-0.057
andum-0.054
Token are
Token ID389
Feature activation+5.02e-01

Pos logit contributions
 agre+1.891
 agreeing+1.880
 dere+1.813
lio+1.704
 agreement+1.667
Neg logit contributions
icide-2.699
heastern-2.398
ingers-2.382
icides-2.321
ppers-2.264
Token their
Token ID511
Feature activation+2.49e-01

Pos logit contributions
icide+2.423
ingers+1.963
icides+1.944
heastern+1.880
ppers+1.808
Neg logit contributions
 agre-3.267
 dere-3.015
 agreeing-2.998
lio-2.972
solete-2.858
Token own
Token ID898
Feature activation+3.50e-01

Pos logit contributions
icide+1.745
heastern+1.550
icides+1.512
ingers+1.508
ppers+1.489
Neg logit contributions
 agre-1.115
 agreeing-0.979
lio-0.977
 dere-0.960
andum-0.890
Token and
Token ID290
Feature activation-1.03e+00

Pos logit contributions
icide+1.760
heastern+1.455
ingers+1.430
+1.411
icides+1.406
Neg logit contributions
 agre-2.336
 dere-2.021
 agreeing-1.999
lio-1.994
andum-1.898
Token think
Token ID892
Feature activation+1.73e-01

Pos logit contributions
icide+1.574
ingers+1.351
ppers+1.325
icides+1.307
heastern+1.303
Neg logit contributions
 agre-1.227
 agreeing-1.157
 dere-1.146
lio-1.140
yrights-1.110
Token of
Token ID286
Feature activation+3.00e-01

Pos logit contributions
icide+1.000
heastern+0.850
ingers+0.841
icides+0.822
ppers+0.809
Neg logit contributions
 agre-0.962
 agreeing-0.901
 dere-0.895
lio-0.868
elli-0.827
Token recording
Token ID8296
Feature activation+3.08e-01

Pos logit contributions
icide+1.685
ingers+1.417
heastern+1.408
icides+1.390
ppers+1.368
Neg logit contributions
 agre-1.675
 dere-1.525
lio-1.521
 agreeing-1.514
yrights-1.460
Token your
Token ID534
Feature activation+6.95e-02

Pos logit contributions
icide+1.724
heastern+1.482
ingers+1.475
icides+1.429
ppers+1.411
Neg logit contributions
 agre-1.712
 agreeing-1.615
 dere-1.585
lio-1.552
andum-1.497
Token live
Token ID2107
Feature activation+4.61e-01

Pos logit contributions
icide+0.475
heastern+0.412
ingers+0.410
icides+0.407
ppers+0.397
Neg logit contributions
 agre-0.313
 agreeing-0.292
 dere-0.284
lio-0.273
elli-0.264
Token performances
Token ID13289
Feature activation+1.37e-01

Pos logit contributions
icide+2.696
heastern+2.317
ingers+2.295
icides+2.256
ppers+2.229
Neg logit contributions
 agre-2.422
 agreeing-2.268
 dere-2.234
lio-2.153
andum-2.105
Token?
Token ID30
Feature activation-1.33e-01

Pos logit contributions
icide+0.772
ingers+0.662
heastern+0.660
icides+0.636
ppers+0.627
Neg logit contributions
 agre-0.766
 agreeing-0.714
 dere-0.705
lio-0.699
andum-0.663
Token
Token ID198
Feature activation-2.63e-01

Pos logit contributions
 agre+0.684
 agreeing+0.658
 dere+0.644
lio+0.617
 agreement+0.591
Neg logit contributions
icide-0.814
heastern-0.712
ingers-0.701
icides-0.685
ppers-0.671
Token
Token ID198
Feature activation-4.44e-01

Pos logit contributions
 agre+1.413
 agreeing+1.341
 dere+1.313
lio+1.256
 agreement+1.204
Neg logit contributions
icide-1.572
heastern-1.362
ingers-1.346
icides-1.313
ppers-1.289
TokenRF
Token ID32754
Feature activation+1.06e-02

Pos logit contributions
 agre+2.071
 agreeing+2.004
 dere+1.936
lio+1.882
elli+1.792
Neg logit contributions
icide-2.875
heastern-2.547
ingers-2.494
icides-2.448
ppers-2.399
Token:
Token ID25
Feature activation-8.26e-01

Pos logit contributions
icide+0.064
heastern+0.054
ingers+0.054
icides+0.053
ppers+0.053
Neg logit contributions
 agre-0.060
 agreeing-0.053
 dere-0.052
lio-0.050
andum-0.048
Token lawyer
Token ID6853
Feature activation+3.28e-01

Pos logit contributions
icide+1.749
heastern+1.501
ingers+1.484
ppers+1.402
icides+1.400
Neg logit contributions
 agre-1.844
 agreeing-1.738
 dere-1.679
lio-1.671
 Malf-1.612
Token during
Token ID1141
Feature activation+2.91e-01

Pos logit contributions
icide+1.767
heastern+1.531
ingers+1.512
icides+1.426
ppers+1.423
Neg logit contributions
 agre-1.934
 agreeing-1.797
 dere-1.791
lio-1.726
 agreement-1.646
Token the
Token ID262
Feature activation+1.51e-01

Pos logit contributions
icide+1.446
heastern+1.204
ingers+1.196
icides+1.159
ppers+1.128
Neg logit contributions
 agre-1.831
 agreeing-1.728
 dere-1.708
lio-1.656
andum-1.586
Token George
Token ID4502
Feature activation-9.43e-01

Pos logit contributions
icide+0.656
heastern+0.538
ingers+0.529
icides+0.507
ppers+0.499
Neg logit contributions
 agre-1.042
 agreeing-0.991
 dere-0.969
lio-0.954
yrights-0.926
Token W
Token ID370
Feature activation-6.67e-02

Pos logit contributions
 agreeing+5.027
 dere+4.995
 agre+4.990
 agreement+4.703
lio+4.572
Neg logit contributions
icide-5.228
heastern-4.594
ingers-4.508
icides-4.338
ppers-4.323
Token.
Token ID13
Feature activation-8.78e-01

Pos logit contributions
 agre+0.396
 agreeing+0.372
 dere+0.366
lio+0.350
andum+0.335
Neg logit contributions
icide-0.365
heastern-0.312
ingers-0.307
icides-0.298
ppers-0.294
Token Bush
Token ID5511
Feature activation-3.13e-02

Pos logit contributions
 agreeing+4.879
 dere+4.853
 agre+4.766
lio+4.587
 agreement+4.528
Neg logit contributions
icide-4.659
heastern-4.141
ingers-4.030
icides-3.885
ppers-3.797
Token administration
Token ID3662
Feature activation+4.65e-01

Pos logit contributions
 agre+0.227
 agreeing+0.219
 dere+0.214
lio+0.211
andum+0.203
Neg logit contributions
icide-0.127
heastern-0.101
ingers-0.099
icides-0.096
ppers-0.091
Token,
Token ID11
Feature activation-1.25e-02

Pos logit contributions
icide+2.503
heastern+2.144
ingers+2.111
icides+2.030
ppers+1.990
Neg logit contributions
 agre-2.709
 agreeing-2.544
 dere-2.515
lio-2.445
 agreement-2.324
Token said
Token ID531
Feature activation-2.21e-01

Pos logit contributions
 agre+0.073
 agreeing+0.070
 dere+0.069
lio+0.066
andum+0.064
Neg logit contributions
icide-0.067
heastern-0.057
ingers-0.057
icides-0.055
ppers-0.054
Token there
Token ID612
Feature activation+1.88e-01

Pos logit contributions
 agre+1.177
 agreeing+1.120
 dere+1.100
lio+1.066
elli+1.011
Neg logit contributions
icide-1.311
heastern-1.140
ingers-1.121
icides-1.094
ppers-1.076
Token 1940
Token ID16236
Feature activation+8.06e-01

Pos logit contributions
icide+0.678
heastern+0.574
ingers+0.566
icides+0.550
ppers+0.544
Neg logit contributions
 agre-0.778
 agreeing-0.732
 dere-0.708
lio-0.699
andum-0.678
Tokens
Token ID82
Feature activation+4.60e-01

Pos logit contributions
icide+4.371
heastern+3.587
icides+3.496
ingers+3.489
ppers+3.477
Neg logit contributions
 agre-4.892
 agreeing-4.265
 dere-4.140
lio-4.108
andum-3.969
Token.
Token ID13
Feature activation+3.69e-01

Pos logit contributions
icide+2.482
heastern+2.083
ingers+2.020
icides+2.007
ppers+1.947
Neg logit contributions
 agre-2.758
 agreeing-2.559
 dere-2.540
lio-2.466
yrights-2.364
Token The
Token ID383
Feature activation+2.26e-01

Pos logit contributions
icide+2.044
heastern+1.722
ingers+1.714
 BACK+1.675
icides+1.674
Neg logit contributions
 agre-2.190
 agreeing-2.015
 dere-1.983
lio-1.854
andum-1.830
Token most
Token ID749
Feature activation-7.88e-01

Pos logit contributions
icide+1.326
heastern+1.143
ingers+1.132
icides+1.102
ppers+1.084
Neg logit contributions
 agre-1.224
 agreeing-1.148
 dere-1.113
lio-1.071
andum-1.049
Token famous
Token ID5863
Feature activation-8.95e-01

Pos logit contributions
 agre+3.559
 agreeing+3.474
 dere+3.423
lio+3.113
 agreement+3.085
Neg logit contributions
icide-5.201
heastern-4.608
ingers-4.507
icides-4.436
ommel-4.375
Token case
Token ID1339
Feature activation-4.00e-01

Pos logit contributions
 agre+4.297
 agreeing+4.229
 dere+4.168
 agreement+3.856
lio+3.854
Neg logit contributions
icide-5.573
heastern-4.923
ingers-4.860
icides-4.747
ppers-4.653
Token was
Token ID373
Feature activation+3.31e-01

Pos logit contributions
 agre+2.158
 agreeing+2.072
 dere+2.031
lio+1.946
 agreement+1.869
Neg logit contributions
icide-2.347
heastern-2.034
ingers-2.006
icides-1.957
ppers-1.916
Token a
Token ID257
Feature activation+2.03e-01

Pos logit contributions
icide+1.877
heastern+1.578
ingers+1.571
icides+1.555
ppers+1.514
Neg logit contributions
 agre-1.856
 agreeing-1.725
 dere-1.704
lio-1.664
yrights-1.612
Token Norwegian
Token ID22158
Feature activation+4.51e-01

Pos logit contributions
icide+1.195
heastern+1.033
ingers+1.012
icides+0.981
ppers+0.976
Neg logit contributions
 agre-1.087
 agreeing-1.017
 dere-0.980
lio-0.979
andum-0.938
Token woman
Token ID2415
Feature activation-1.12e-01

Pos logit contributions
icide+2.490
heastern+2.086
ingers+2.050
icides+1.982
woman+1.965
Neg logit contributions
 agre-2.565
 agreeing-2.395
lio-2.344
 dere-2.325
yrights-2.253
Token which
Token ID543
Feature activation+3.16e-02

Pos logit contributions
 agre+2.118
 agreeing+2.037
 dere+1.993
lio+1.906
 agreement+1.835
Neg logit contributions
icide-2.333
heastern-2.026
ingers-1.995
icides-1.950
ppers-1.907
Token I
Token ID314
Feature activation+5.21e-02

Pos logit contributions
icide+0.186
heastern+0.162
ingers+0.160
icides+0.155
ppers+0.155
Neg logit contributions
 agre-0.167
 agreeing-0.160
 dere-0.157
lio-0.153
andum-0.145
Token have
Token ID423
Feature activation+3.74e-01

Pos logit contributions
icide+0.291
ingers+0.249
heastern+0.249
ppers+0.242
icides+0.237
Neg logit contributions
 agre-0.289
 agreeing-0.279
 dere-0.277
lio-0.270
elli-0.255
Token kept
Token ID4030
Feature activation+5.08e-01

Pos logit contributions
icide+1.965
ingers+1.685
heastern+1.676
ppers+1.607
icides+1.577
Neg logit contributions
 agre-2.162
 agreeing-2.055
lio-2.043
 dere-2.024
andum-1.926
Token for
Token ID329
Feature activation-2.93e-01

Pos logit contributions
icide+2.665
ingers+2.298
heastern+2.277
ppers+2.180
ommel+2.127
Neg logit contributions
 agre-2.970
lio-2.778
 agreeing-2.734
 dere-2.660
yrights-2.599
Token almost
Token ID2048
Feature activation-9.57e-01

Pos logit contributions
 agre+1.567
 agreeing+1.495
 dere+1.465
lio+1.420
elli+1.354
Neg logit contributions
icide-1.739
heastern-1.503
ingers-1.484
icides-1.449
ppers-1.426
Token 30
Token ID1542
Feature activation-3.54e-01

Pos logit contributions
 agre+4.810
 agreeing+4.740
 dere+4.621
 agreement+4.264
lio+4.223
Neg logit contributions
icide-5.756
heastern-5.123
ingers-5.021
icides-4.906
uliffe-4.815
Token years
Token ID812
Feature activation+4.04e-01

Pos logit contributions
 agre+1.163
 agreeing+1.102
 dere+1.068
lio+0.977
 agreement+0.925
Neg logit contributions
icide-2.799
heastern-2.532
ingers-2.486
icides-2.464
ppers-2.430
Token.
Token ID13
Feature activation+1.86e-01

Pos logit contributions
icide+2.163
heastern+1.822
ingers+1.813
icides+1.742
ppers+1.706
Neg logit contributions
 agre-2.388
 agreeing-2.227
 dere-2.209
lio-2.165
yrights-2.049
Token
Token ID198
Feature activation-1.30e-01

Pos logit contributions
icide+1.087
ingers+0.934
heastern+0.926
 BACK+0.896
icides+0.895
Neg logit contributions
 agre-1.019
 agreeing-0.953
 dere-0.934
lio-0.895
 agreement-0.853
Token
Token ID198
Feature activation+1.51e-01

Pos logit contributions
 agre+0.743
 agreeing+0.687
 dere+0.673
lio+0.642
 agreement+0.616
Neg logit contributions
icide-0.752
heastern-0.641
ingers-0.638
icides-0.619
ppers-0.612
Token
Token ID198
Feature activation+5.17e-01

Pos logit contributions
icide+2.171
ingers+1.767
heastern+1.724
 BACK+1.718
ppers+1.709
Neg logit contributions
 agre-2.992
 agreeing-2.574
 dere-2.529
 redes-2.416
ortunately-2.414
Token+
Token ID10
Feature activation-1.21e-01

Pos logit contributions
icide+2.765
ingers+2.328
heastern+2.291
icides+2.227
ppers+2.207
Neg logit contributions
 agre-3.083
 agreeing-2.916
 dere-2.873
lio-2.711
 agreement-2.652
Token Loyal
Token ID34986
Feature activation-8.51e-01

Pos logit contributions
 agre+0.633
 agreeing+0.607
 dere+0.594
lio+0.569
 agreement+0.545
Neg logit contributions
icide-0.735
heastern-0.637
ingers-0.631
icides-0.614
ppers-0.603
Token,
Token ID11
Feature activation-3.17e-01

Pos logit contributions
 agre+4.371
 agreeing+4.205
 dere+4.150
lio+3.906
 agreement+3.820
Neg logit contributions
icide-5.028
heastern-4.441
ingers-4.381
icides-4.280
ppers-4.187
Token Def
Token ID2896
Feature activation-8.75e-01

Pos logit contributions
 agre+1.605
 agreeing+1.539
 dere+1.503
lio+1.445
elli+1.377
Neg logit contributions
icide-1.966
heastern-1.713
ingers-1.694
icides-1.650
ppers-1.620
Tokenerential
Token ID33369
Feature activation-9.31e-01

Pos logit contributions
 agre+3.616
 agreeing+3.451
 dere+3.385
lio+3.243
elli+3.115
Neg logit contributions
icide-6.012
heastern-5.320
ingers-5.284
icides-5.217
ppers-5.115
Token,
Token ID11
Feature activation-9.11e-01

Pos logit contributions
 agre+4.791
 agreeing+4.602
 dere+4.551
lio+4.225
 agreement+4.177
Neg logit contributions
icide-5.517
heastern-4.851
ingers-4.788
icides-4.722
ppers-4.618
Token Level
Token ID5684
Feature activation-6.37e-01

Pos logit contributions
 agre+4.617
 agreeing+4.410
 dere+4.333
lio+4.082
 agreement+3.962
Neg logit contributions
icide-5.555
heastern-4.880
ingers-4.790
icides-4.727
ppers-4.624
Token-
Token ID12
Feature activation+4.34e-01

Pos logit contributions
 agre+3.702
 agreeing+3.596
 dere+3.507
lio+3.358
 agreement+3.283
Neg logit contributions
icide-3.420
heastern-2.943
ingers-2.888
icides-2.839
ppers-2.743
Tokenheaded
Token ID15353
Feature activation-4.81e-01

Pos logit contributions
icide+1.902
heastern+1.537
ingers+1.524
ppers+1.477
icides+1.429
Neg logit contributions
 agre-3.002
 agreeing-2.800
 dere-2.782
lio-2.664
andum-2.659
Token
Token ID198
Feature activation-2.54e-01

Pos logit contributions
 agre+2.567
 agreeing+2.464
 dere+2.419
lio+2.305
 agreement+2.224
Neg logit contributions
icide-2.842
heastern-2.475
ingers-2.434
icides-2.386
ppers-2.332
Token National
Token ID2351
Feature activation-8.05e-01

Pos logit contributions
icide+1.616
heastern+1.387
ingers+1.356
icides+1.315
 BACK+1.305
Neg logit contributions
 agre-1.570
 agreeing-1.494
 dere-1.447
lio-1.417
andum-1.369
Token Geographic
Token ID33636
Feature activation-1.02e-01

Pos logit contributions
 agre+3.380
 agreeing+3.216
 dere+3.179
lio+2.980
 agreement+2.869
Neg logit contributions
icide-5.522
heastern-4.923
ingers-4.903
icides-4.782
ppers-4.694
Token
Token ID198
Feature activation+8.00e-03

Pos logit contributions
 agre+0.551
 agreeing+0.527
 dere+0.516
lio+0.496
elli+0.474
Neg logit contributions
icide-0.605
heastern-0.523
ingers-0.517
icides-0.503
ppers-0.494
Token
Token ID198
Feature activation+1.51e-01

Pos logit contributions
icide+0.045
heastern+0.038
ingers+0.038
icides+0.037
ppers+0.037
Neg logit contributions
 agre-0.048
 agreeing-0.043
 dere-0.043
lio-0.041
andum-0.039
TokenA
Token ID32
Feature activation-1.56e-01

Pos logit contributions
icide+0.849
heastern+0.715
ingers+0.711
icides+0.701
ppers+0.685
Neg logit contributions
 agre-0.891
 agreeing-0.832
 dere-0.809
lio-0.765
 agreement-0.748
Token Place
Token ID8474
Feature activation-9.60e-01

Pos logit contributions
 agre+0.806
 agreeing+0.769
 dere+0.741
lio+0.723
elli+0.691
Neg logit contributions
icide-0.959
heastern-0.842
ingers-0.823
icides-0.799
ppers-0.788
Token of
Token ID286
Feature activation-3.18e-01

Pos logit contributions
 agre+4.915
 agreeing+4.801
 dere+4.790
lio+4.492
 agreement+4.351
Neg logit contributions
icide-5.610
heastern-4.923
ingers-4.826
icides-4.780
ppers-4.644
Token Wild
Token ID6183
Feature activation-1.95e-01

Pos logit contributions
 agre+1.606
 agreeing+1.533
 dere+1.496
lio+1.432
 agreement+1.367
Neg logit contributions
icide-1.983
heastern-1.729
ingers-1.712
icides-1.672
ppers-1.641
Token Beauty
Token ID20774
Feature activation-2.65e-01

Pos logit contributions
 agre+0.769
 agreeing+0.720
 dere+0.694
lio+0.652
andum+0.615
Neg logit contributions
icide-1.442
heastern-1.286
ingers-1.281
icides-1.251
ppers-1.235
Token
Token ID198
Feature activation-1.29e-01

Pos logit contributions
 agre+1.393
 agreeing+1.336
 dere+1.308
lio+1.253
 agreement+1.201
Neg logit contributions
icide-1.592
heastern-1.384
ingers-1.366
icides-1.333
ppers-1.306
Token
Token ID198
Feature activation+5.41e-02

Pos logit contributions
 agre+0.726
 agreeing+0.676
 dere+0.663
lio+0.633
 agreement+0.607
Neg logit contributions
icide-0.756
heastern-0.648
ingers-0.643
icides-0.626
ppers-0.617