TOP ACTIVATIONS
MAX = 3.785

") is an autos omal dominant ker at oder ma of
at all seems an ab ject failure of that primary
- q aim ish , Ab ish qa , Q ut
. When she stops at ab ar ren roadside motel ,
said F idelity CEO Ab ig ail Johnson .
border , densely populated villages ab ut protected areas set aside
of it is directed or ab et ted by states who
India Read More Ab ut Sun Temple Here
graves of more than 400 enslaved and free blacks , was
a super - p reg nant Ann al ise knowing
in - Chief of Amber Un mask ed . com and
" London " have been banned from business advertisements so as
t be one dec ibel softer ,
You get five classic old - school camera styles to shoot
even hundreds of thousands of enslaved people including Archer Alexander
all Muslims is rep ug nant to me , and a
Alternative Facts in Eastern U topia ) is an official
actually show a toilet in ab ath room seemed oddly appropriate
is created and the archived file is uploaded to it
Probably YOU RS . Most Un sett ling Moment : Nothing

MIN ACTIVATIONS
MIN = -3.667

handle multiple spikes in the angle - of - attack data
sensitivity . Increases lip oly sis and free fatty
computers then reacted to the angle - of - attack data
le pro sy - induced nerve damage .[ 25 ]
En bridge s Line 9 pipeline was
entering into force of the Lisbon Treaty ); -
p > {{ phone . sn ipp et }} p
Trump discl oses on his financial disclosures , which he sometimes
does show that what the optical bench sees correlates with reality
Andre Gre ipel ( L otto - Bel isol Team )
Beyond the question of the Keystone XL Pipeline , Clinton has
conference committee was pushing the financial reform bill to the finish
Jim Bob s line that this was
and Russia s part in destabil izing eastern Ukraine
drawing up the " Str ategic plan for deployment of the
Ke Kan ak ol u ki i pa ia
stems from his sense of the word
Phi K aps part to pursue dis aff iliation
in Trump s financial disclosure this year from last
al ise s point of view ? Does that

INTERVAL 1.922 - 3.785
CONTAINS 0.602%

. com / api is deprecated and you should use https
investment which would be a catastrophe for the area ) and
she was covert ly recorded und ressing in a changing room
his party suffered a humiliating defeat to the euros cept ic
moving along . @ Master D al K | Technical Director

INTERVAL 0.059 - 1.922
CONTAINS 44.418%

The younger you are ," D ella Vol pe says ,
cups of water into a sauce pan and bring it to
ong Ang Il bo reported that in W ons an ,
that 's goofy , fun , and impossible not to forward
well integrity required under provincial regulations , researchers were told the

INTERVAL -1.804 - 0.059
CONTAINS 54.226%

when we enter system () , not to the bottom of
starting with the rich countries , publishes them before
spent much time thinking about the conditions out of which the
their kids toys in Pakistan from their homes . An array
out bur sts marked the SB 6 hearing , however .

INTERVAL -3.667 - -1.804
CONTAINS 0.753%

wrote a review of the scientific evidence on teen sleep needs
to answer questions from a Globe reporter on Wednesday about why
nothing will happen ( in terms of visible catast rop hes
, the Fire Station , Sn uf kin 's Camp ,
Convention , the American Ge ophysical Union in 2011 and 2012
Token")
Token ID4943
Feature activation+7.51e-01

Pos logit contributions
ess+6.043
der+6.024
rely+5.658
quished+5.639
nesday+5.633
Neg logit contributions
 assurances-6.329
 calculations-6.249
 Preferences-6.068
 negotiations-6.002
 Integrity-6.000
Token is
Token ID318
Feature activation+7.05e-01

Pos logit contributions
ess+2.840
der+2.806
nesday+2.800
quished+2.784
rely+2.658
Neg logit contributions
 assurances-3.565
 calculations-3.547
 Preferences-3.456
 negotiations-3.450
 Integrity-3.436
Token an
Token ID281
Feature activation+1.56e+00

Pos logit contributions
ess+3.375
der+3.343
nesday+3.308
quished+3.293
rely+3.200
Neg logit contributions
 assurances-2.632
 calculations-2.608
 Preferences-2.528
 negotiations-2.515
 Integrity-2.509
Token autos
Token ID44619
Feature activation+5.71e-01

Pos logit contributions
ess+7.063
der+6.933
quished+6.665
rely+6.658
nesday+6.633
Neg logit contributions
 assurances-6.026
 calculations-5.936
 Preferences-5.860
 Integrity-5.841
 negotiations-5.717
Tokenomal
Token ID18048
Feature activation+1.47e+00

Pos logit contributions
ess+3.068
der+3.060
nesday+3.057
quished+3.031
rely+2.926
Neg logit contributions
 calculations-1.768
 assurances-1.766
 Preferences-1.699
 negotiations-1.690
 Integrity-1.672
Token dominant
Token ID11410
Feature activation+3.79e+00

Pos logit contributions
ess+7.673
der+7.590
quished+7.392
rely+7.296
nesday+7.295
Neg logit contributions
 assurances-4.621
 calculations-4.597
 Integrity-4.465
 Preferences-4.446
 negotiations-4.288
Token ker
Token ID41927
Feature activation+2.64e-01

Pos logit contributions
der+13.269
ess+12.826
rely+11.980
escent+11.953
die+11.937
Neg logit contributions
 assurances-15.450
 Preferences-15.073
 Integrity-14.859
 calculations-14.649
 briefings-14.249
Tokenat
Token ID265
Feature activation+1.24e+00

Pos logit contributions
nesday+1.033
quished+1.027
ess+0.994
der+0.971
rely+0.934
Neg logit contributions
 calculations-1.218
 assurances-1.210
 negotiations-1.181
 remarks-1.172
 Integrity-1.164
Tokenoder
Token ID12342
Feature activation+1.35e+00

Pos logit contributions
ess+5.161
der+5.160
nesday+4.948
quished+4.927
rely+4.882
Neg logit contributions
 assurances-5.473
 calculations-5.356
 Preferences-5.255
 Integrity-5.212
 negotiations-5.192
Tokenma
Token ID2611
Feature activation+1.96e+00

Pos logit contributions
ess+6.061
der+6.047
nesday+5.819
quished+5.783
rely+5.733
Neg logit contributions
 assurances-5.422
 calculations-5.332
 Preferences-5.198
 Integrity-5.165
 negotiations-5.150
Token of
Token ID286
Feature activation+1.86e+00

Pos logit contributions
ess+8.969
der+8.818
rely+8.331
die+8.182
joined+8.110
Neg logit contributions
 assurances-7.457
 Preferences-7.257
 calculations-7.251
 Integrity-7.151
 briefings-6.919
Token at
Token ID379
Feature activation+2.61e-01

Pos logit contributions
ess+4.035
der+3.994
nesday+3.944
quished+3.924
rely+3.805
Neg logit contributions
 assurances-3.929
 calculations-3.898
 Preferences-3.792
 negotiations-3.774
 Integrity-3.767
Token all
Token ID477
Feature activation-2.50e-01

Pos logit contributions
ess+0.986
der+0.975
nesday+0.967
quished+0.959
rely+0.922
Neg logit contributions
 assurances-1.244
 calculations-1.238
 negotiations-1.205
 Preferences-1.204
 Integrity-1.196
Token
Token ID851
Feature activation+4.79e-01

Pos logit contributions
 calculations+0.943
 assurances+0.938
 negotiations+0.915
 Preferences+0.904
 remarks+0.902
Neg logit contributions
nesday-1.171
ess-1.163
quished-1.156
der-1.148
rely-1.118
Token seems
Token ID2331
Feature activation-9.00e-01

Pos logit contributions
ess+1.928
nesday+1.913
der+1.912
quished+1.909
rely+1.814
Neg logit contributions
 assurances-2.154
 calculations-2.141
 Preferences-2.083
 negotiations-2.080
 Integrity-2.066
Token an
Token ID281
Feature activation+8.31e-01

Pos logit contributions
 calculations+3.268
 assurances+3.253
 negotiations+3.138
 remarks+3.097
 judgments+3.087
Neg logit contributions
quished-4.282
nesday-4.266
ess-4.144
der-4.143
joined-4.088
Token ab
Token ID450
Feature activation+3.56e+00

Pos logit contributions
ess+4.612
der+4.565
quished+4.470
nesday+4.469
rely+4.397
Neg logit contributions
 assurances-2.448
 calculations-2.430
 Preferences-2.349
 Integrity-2.317
 negotiations-2.314
Tokenject
Token ID752
Feature activation+2.90e+00

Pos logit contributions
ess+11.183
der+11.092
rely+9.820
die+9.588
joined+9.249
Neg logit contributions
 assurances-17.206
 Preferences-16.703
 briefings-16.616
 Integrity-16.603
 calculations-16.372
Token failure
Token ID5287
Feature activation+1.27e+00

Pos logit contributions
der+13.489
ess+13.450
rely+12.768
die+12.347
nesday+12.304
Neg logit contributions
 Preferences-9.264
 calculations-9.218
 assurances-9.177
 Integrity-9.000
 coefficients-8.826
Token of
Token ID286
Feature activation+1.18e+00

Pos logit contributions
ess+5.297
der+5.219
nesday+5.060
quished+5.039
rely+4.911
Neg logit contributions
 assurances-5.470
 calculations-5.413
 Preferences-5.326
 Integrity-5.309
 negotiations-5.238
Token that
Token ID326
Feature activation+1.26e+00

Pos logit contributions
ess+6.017
der+5.919
nesday+5.847
quished+5.754
rely+5.719
Neg logit contributions
 assurances-3.937
 calculations-3.864
 Preferences-3.788
 Integrity-3.739
 negotiations-3.710
Token primary
Token ID4165
Feature activation+3.71e-02

Pos logit contributions
ess+6.600
der+6.513
nesday+6.398
quished+6.329
rely+6.251
Neg logit contributions
 assurances-4.037
 calculations-3.976
 Preferences-3.885
 Integrity-3.844
 negotiations-3.819
Token-
Token ID12
Feature activation+3.31e-01

Pos logit contributions
 assurances+0.071
 calculations+0.070
 negotiations+0.068
 Preferences+0.068
 Integrity+0.067
Neg logit contributions
ess-0.098
nesday-0.098
der-0.098
quished-0.097
rely-0.094
Tokenq
Token ID80
Feature activation-9.58e-02

Pos logit contributions
ess+1.413
der+1.399
nesday+1.389
quished+1.380
rely+1.337
Neg logit contributions
 assurances-1.400
 calculations-1.390
 Preferences-1.351
 negotiations-1.346
 Integrity-1.341
Tokenaim
Token ID1385
Feature activation-3.52e-02

Pos logit contributions
 assurances+0.437
 calculations+0.436
 Preferences+0.423
 negotiations+0.422
 remarks+0.420
Neg logit contributions
ess-0.369
der-0.367
nesday-0.365
quished-0.363
rely-0.350
Tokenish
Token ID680
Feature activation-7.26e-01

Pos logit contributions
 assurances+0.265
 calculations+0.264
 Preferences+0.260
 negotiations+0.260
 Integrity+0.259
Neg logit contributions
ess-0.035
der-0.033
nesday-0.031
quished-0.031
rely-0.026
Token,
Token ID11
Feature activation-6.93e-01

Pos logit contributions
 calculations+2.703
 assurances+2.694
 negotiations+2.624
 Preferences+2.592
 remarks+2.588
Neg logit contributions
nesday-3.366
quished-3.341
ess-3.329
der-3.291
rely-3.222
Token Ab
Token ID2275
Feature activation+3.54e+00

Pos logit contributions
 assurances+2.300
 calculations+2.279
 Preferences+2.241
 negotiations+2.209
 Integrity+2.208
Neg logit contributions
ess-3.468
nesday-3.465
der-3.456
quished-3.439
die-3.346
Tokenish
Token ID680
Feature activation+3.87e-01

Pos logit contributions
ess+14.403
der+14.351
quished+13.045
rely+13.001
die+12.997
Neg logit contributions
 assurances-13.458
 calculations-12.971
 Preferences-12.845
 Integrity-12.733
 judgments-12.619
Tokenqa
Token ID20402
Feature activation+1.51e-01

Pos logit contributions
ess+1.542
der+1.524
nesday+1.505
quished+1.496
rely+1.447
Neg logit contributions
 assurances-1.759
 calculations-1.746
 Preferences-1.702
 negotiations-1.694
 Integrity-1.692
Token,
Token ID11
Feature activation-5.45e-01

Pos logit contributions
ess+0.677
der+0.670
nesday+0.662
quished+0.659
rely+0.639
Neg logit contributions
 assurances-0.609
 calculations-0.604
 Preferences-0.587
 negotiations-0.584
 Integrity-0.583
Token Q
Token ID1195
Feature activation-2.58e-02

Pos logit contributions
 assurances+1.829
 calculations+1.804
 Preferences+1.762
 Integrity+1.755
 negotiations+1.746
Neg logit contributions
nesday-2.760
ess-2.748
der-2.729
quished-2.712
rely-2.646
Tokenut
Token ID315
Feature activation+2.86e-01

Pos logit contributions
 assurances+0.121
 calculations+0.120
 negotiations+0.117
 Preferences+0.116
 Integrity+0.116
Neg logit contributions
nesday-0.096
der-0.095
ess-0.095
quished-0.095
rely-0.091
Token.
Token ID13
Feature activation+4.81e-02

Pos logit contributions
 calculations+3.271
 assurances+3.268
 negotiations+3.189
 Preferences+3.124
 remarks+3.118
Neg logit contributions
nesday-3.261
quished-3.099
der-3.045
rely-3.034
ess-3.026
Token When
Token ID1649
Feature activation+4.53e-01

Pos logit contributions
nesday+0.252
ess+0.252
der+0.250
quished+0.250
rely+0.242
Neg logit contributions
 assurances-0.156
 calculations-0.153
 Preferences-0.152
 Integrity-0.151
 negotiations-0.149
Token she
Token ID673
Feature activation-7.67e-03

Pos logit contributions
ess+1.998
der+1.982
nesday+1.961
quished+1.958
rely+1.891
Neg logit contributions
 assurances-1.863
 calculations-1.848
 negotiations-1.797
 Preferences-1.788
 Integrity-1.782
Token stops
Token ID9911
Feature activation+7.68e-01

Pos logit contributions
 assurances+0.032
 calculations+0.032
 negotiations+0.031
 remarks+0.031
 Preferences+0.030
Neg logit contributions
quished-0.034
nesday-0.033
ess-0.033
der-0.033
rely-0.032
Token at
Token ID379
Feature activation-1.22e-01

Pos logit contributions
ess+3.518
der+3.477
nesday+3.422
quished+3.405
rely+3.314
Neg logit contributions
 assurances-3.025
 calculations-3.001
 Preferences-2.922
 Integrity-2.899
 negotiations-2.893
Token ab
Token ID450
Feature activation+3.53e+00

Pos logit contributions
 assurances+0.439
 calculations+0.434
 Preferences+0.422
 negotiations+0.421
 Integrity+0.418
Neg logit contributions
ess-0.601
der-0.597
quished-0.594
nesday-0.594
rely-0.575
Tokenar
Token ID283
Feature activation+4.68e-01

Pos logit contributions
ess+14.733
der+13.917
escent+13.068
joined+13.066
nesday+13.063
Neg logit contributions
 assurances-13.197
 Integrity-12.791
 calculations-12.673
 negotiations-12.446
 Preferences-12.423
Tokenren
Token ID918
Feature activation+2.26e+00

Pos logit contributions
ess+1.789
der+1.768
nesday+1.741
quished+1.732
rely+1.672
Neg logit contributions
 assurances-2.204
 calculations-2.188
 Preferences-2.135
 negotiations-2.125
 Integrity-2.123
Token roadside
Token ID39479
Feature activation-5.62e-01

Pos logit contributions
ess+9.973
der+9.557
quished+9.246
nesday+8.862
rely+8.811
Neg logit contributions
 assurances-8.968
 calculations-8.863
 Preferences-8.730
 Integrity-8.573
 negotiations-8.461
Token motel
Token ID44404
Feature activation+1.64e+00

Pos logit contributions
 assurances+2.104
 calculations+2.102
 negotiations+2.049
 remarks+2.009
 Preferences+1.997
Neg logit contributions
nesday-2.648
quished-2.639
der-2.613
ess-2.562
rely-2.529
Token,
Token ID11
Feature activation+3.81e-01

Pos logit contributions
ess+7.688
der+7.540
nesday+7.250
quished+7.149
joined+7.054
Neg logit contributions
 assurances-6.117
 calculations-6.068
 Preferences-5.902
 Integrity-5.863
 negotiations-5.764
Token
Token ID251
Feature activation-4.23e-01

Pos logit contributions
ess+2.755
der+2.737
nesday+2.699
quished+2.689
rely+2.631
Neg logit contributions
 assurances-1.362
 calculations-1.348
 Preferences-1.291
 negotiations-1.279
 Integrity-1.277
Token said
Token ID531
Feature activation+6.47e-01

Pos logit contributions
 assurances+1.557
 calculations+1.540
 negotiations+1.489
 Integrity+1.477
 Preferences+1.474
Neg logit contributions
quished-2.046
ess-2.014
nesday-2.010
der-2.006
rely-1.928
Token F
Token ID376
Feature activation+8.83e-01

Pos logit contributions
ess+3.148
der+3.117
nesday+3.080
quished+3.061
rely+2.986
Neg logit contributions
 assurances-2.356
 calculations-2.330
 Preferences-2.263
 negotiations-2.246
 Integrity-2.240
Tokenidelity
Token ID23091
Feature activation+1.87e-01

Pos logit contributions
ess+4.010
nesday+4.005
der+3.992
quished+3.956
rely+3.807
Neg logit contributions
 assurances-3.503
 calculations-3.483
 Preferences-3.361
 Integrity-3.348
 negotiations-3.343
Token CEO
Token ID6123
Feature activation-3.70e-01

Pos logit contributions
ess+0.897
der+0.888
quished+0.880
nesday+0.878
rely+0.853
Neg logit contributions
 assurances-0.703
 calculations-0.698
 Preferences-0.673
 negotiations-0.671
 Integrity-0.671
Token Ab
Token ID2275
Feature activation+3.42e+00

Pos logit contributions
 assurances+1.443
 calculations+1.426
 Integrity+1.391
 Preferences+1.388
 negotiations+1.385
Neg logit contributions
ess-1.689
nesday-1.684
quished-1.682
der-1.680
rely-1.620
Tokenig
Token ID328
Feature activation+2.66e-01

Pos logit contributions
der+13.398
ess+13.188
rely+12.107
die+11.947
nesday+11.656
Neg logit contributions
 assurances-14.121
 flares-13.509
 Integrity-13.481
 calculations-13.475
 Preferences-13.352
Tokenail
Token ID603
Feature activation+5.08e-01

Pos logit contributions
ess+1.071
der+1.059
nesday+1.044
quished+1.038
rely+1.005
Neg logit contributions
 assurances-1.196
 calculations-1.187
 Preferences-1.157
 negotiations-1.151
 Integrity-1.149
Token Johnson
Token ID5030
Feature activation-8.22e-01

Pos logit contributions
ess+2.547
der+2.524
nesday+2.482
quished+2.467
rely+2.415
Neg logit contributions
 assurances-1.783
 calculations-1.769
 Preferences-1.704
 negotiations-1.699
 Integrity-1.685
Token.
Token ID13
Feature activation-2.35e-01

Pos logit contributions
 assurances+3.737
 calculations+3.686
 negotiations+3.606
 Preferences+3.605
 remarks+3.589
Neg logit contributions
ess-3.160
quished-3.150
der-3.127
nesday-3.126
rely-3.033
Token
Token ID198
Feature activation-3.14e-01

Pos logit contributions
 assurances+0.842
 calculations+0.832
 Preferences+0.809
 Integrity+0.803
 negotiations+0.803
Neg logit contributions
ess-1.159
der-1.149
nesday-1.141
quished-1.134
rely-1.105
Token border
Token ID4865
Feature activation+5.02e-02

Pos logit contributions
 calculations+1.444
 assurances+1.443
 negotiations+1.409
 Integrity+1.375
 Preferences+1.371
Neg logit contributions
nesday-1.855
ess-1.847
der-1.838
quished-1.826
rely-1.761
Token,
Token ID11
Feature activation+1.10e-01

Pos logit contributions
ess+0.223
der+0.221
nesday+0.218
quished+0.217
rely+0.211
Neg logit contributions
 assurances-0.205
 calculations-0.204
 Preferences-0.198
 negotiations-0.197
 Integrity-0.197
Token densely
Token ID42255
Feature activation+5.61e-02

Pos logit contributions
ess+0.526
nesday+0.524
der+0.522
quished+0.519
rely+0.500
Neg logit contributions
 assurances-0.411
 calculations-0.409
 negotiations-0.396
 Integrity-0.391
 Preferences-0.390
Token populated
Token ID22331
Feature activation+1.55e+00

Pos logit contributions
nesday+0.217
ess+0.216
der+0.213
quished+0.211
rely+0.204
Neg logit contributions
 assurances-0.261
 calculations-0.260
 negotiations-0.252
 Preferences-0.251
 Integrity-0.249
Token villages
Token ID15425
Feature activation+1.36e+00

Pos logit contributions
ess+7.976
der+7.885
quished+7.648
nesday+7.608
rely+7.543
Neg logit contributions
 assurances-4.951
 calculations-4.847
 Preferences-4.775
 Integrity-4.736
 negotiations-4.631
Token ab
Token ID450
Feature activation+3.41e+00

Pos logit contributions
ess+6.374
der+6.285
nesday+6.100
quished+6.077
rely+5.985
Neg logit contributions
 assurances-5.144
 calculations-5.078
 Preferences-5.043
 Integrity-4.933
 negotiations-4.897
Tokenut
Token ID315
Feature activation+1.93e+00

Pos logit contributions
ess+11.830
der+10.643
rely+10.519
joined+10.059
die+10.052
Neg logit contributions
 assurances-16.269
 Preferences-15.840
 calculations-15.710
 negotiations-15.689
 judgments-15.602
Token protected
Token ID6861
Feature activation+2.10e+00

Pos logit contributions
ess+8.646
der+8.466
quished+8.171
nesday+8.112
rely+8.021
Neg logit contributions
 assurances-7.355
 calculations-7.328
 Preferences-7.196
 Integrity-7.025
 negotiations-7.007
Token areas
Token ID3006
Feature activation+9.04e-01

Pos logit contributions
ess+10.915
der+10.714
quished+10.433
nesday+10.313
rely+10.272
Neg logit contributions
 assurances-6.494
 calculations-6.464
 Preferences-6.325
 Integrity-6.204
 negotiations-6.107
Token set
Token ID900
Feature activation-1.34e-01

Pos logit contributions
ess+4.205
der+4.159
nesday+4.086
quished+4.076
rely+3.965
Neg logit contributions
 assurances-3.485
 calculations-3.456
 Preferences-3.369
 negotiations-3.334
 Integrity-3.334
Token aside
Token ID7263
Feature activation+9.69e-01

Pos logit contributions
 assurances+0.456
 calculations+0.450
 negotiations+0.433
 Preferences+0.425
 Integrity+0.423
Neg logit contributions
nesday-0.684
ess-0.682
der-0.681
quished-0.677
rely-0.658
Token of
Token ID286
Feature activation-7.57e-01

Pos logit contributions
 calculations+1.978
 assurances+1.969
 negotiations+1.897
 remarks+1.888
 Preferences+1.868
Neg logit contributions
nesday-2.395
ess-2.379
der-2.364
quished-2.348
rely-2.267
Token it
Token ID340
Feature activation-5.60e-01

Pos logit contributions
 assurances+2.817
 calculations+2.809
 negotiations+2.731
 remarks+2.699
 Integrity+2.682
Neg logit contributions
ess-3.568
der-3.538
nesday-3.530
quished-3.523
die-3.366
Token is
Token ID318
Feature activation-4.66e-01

Pos logit contributions
 calculations+2.449
 assurances+2.442
 negotiations+2.385
 remarks+2.353
 judgments+2.313
Neg logit contributions
nesday-2.311
quished-2.304
ess-2.286
der-2.260
die-2.168
Token directed
Token ID7924
Feature activation-3.61e-01

Pos logit contributions
 assurances+1.803
 calculations+1.798
 negotiations+1.737
 remarks+1.710
 Preferences+1.704
Neg logit contributions
nesday-2.170
ess-2.149
quished-2.147
der-2.127
joined-2.048
Token or
Token ID393
Feature activation-4.96e-01

Pos logit contributions
 assurances+1.411
 calculations+1.400
 negotiations+1.364
 remarks+1.348
 Preferences+1.340
Neg logit contributions
ess-1.650
quished-1.645
nesday-1.642
der-1.622
joined-1.572
Token ab
Token ID450
Feature activation+3.39e+00

Pos logit contributions
 assurances+1.869
 calculations+1.866
 negotiations+1.808
 remarks+1.789
 Preferences+1.746
Neg logit contributions
ess-2.361
nesday-2.357
quished-2.324
der-2.308
rely-2.244
Tokenet
Token ID316
Feature activation+1.83e+00

Pos logit contributions
ess+12.347
der+11.865
rely+11.243
joined+11.201
die+10.809
Neg logit contributions
 assurances-14.542
 Integrity-14.320
 Preferences-13.973
 coefficients-13.794
 calculations-13.711
Tokented
Token ID1513
Feature activation+1.33e+00

Pos logit contributions
ess+3.144
der+3.140
quished+2.860
nesday+2.780
rely+2.703
Neg logit contributions
 assurances-12.226
 calculations-12.077
 Preferences-11.932
 Integrity-11.914
 negotiations-11.783
Token by
Token ID416
Feature activation-4.11e-01

Pos logit contributions
ess+4.832
der+4.782
nesday+4.598
quished+4.570
rely+4.465
Neg logit contributions
 assurances-6.395
 calculations-6.332
 Preferences-6.221
 Integrity-6.198
 negotiations-6.121
Token states
Token ID2585
Feature activation+6.84e-01

Pos logit contributions
 assurances+1.524
 calculations+1.511
 negotiations+1.467
 remarks+1.440
 Preferences+1.434
Neg logit contributions
nesday-1.995
ess-1.991
der-1.970
quished-1.969
rely-1.889
Token who
Token ID508
Feature activation+7.39e-01

Pos logit contributions
ess+3.195
der+3.167
nesday+3.108
quished+3.101
rely+3.021
Neg logit contributions
 assurances-2.621
 calculations-2.602
 Preferences-2.529
 negotiations-2.506
 Integrity-2.506
Token India
Token ID3794
Feature activation+5.39e-01

Pos logit contributions
ess+0.808
der+0.802
nesday+0.792
quished+0.790
rely+0.775
Neg logit contributions
 assurances-0.305
 calculations-0.299
 Preferences-0.287
 Integrity-0.282
 negotiations-0.282
Token
Token ID198
Feature activation+9.37e-02

Pos logit contributions
ess+2.247
der+2.203
quished+2.145
nesday+2.128
rely+2.073
Neg logit contributions
 assurances-2.314
 calculations-2.265
 Preferences-2.246
 negotiations-2.217
 Integrity-2.216
Token
Token ID198
Feature activation+3.03e-01

Pos logit contributions
nesday+0.475
quished+0.466
ess+0.458
der+0.454
rely+0.438
Neg logit contributions
 calculations-0.330
 assurances-0.329
 negotiations-0.320
 Preferences-0.320
 remarks-0.318
TokenRead
Token ID5569
Feature activation-9.04e-01

Pos logit contributions
ess+1.318
der+1.301
quished+1.271
nesday+1.252
rely+1.235
Neg logit contributions
 assurances-1.287
 calculations-1.263
 Preferences-1.229
 negotiations-1.225
 Integrity-1.223
Token More
Token ID3125
Feature activation-2.46e-01

Pos logit contributions
 Preferences+2.397
 Integrity+2.305
 assurances+2.274
 calculations+2.255
 remarks+2.249
Neg logit contributions
nesday-5.369
quished-5.222
ess-5.075
joined-5.029
der-5.007
Token Ab
Token ID2275
Feature activation+3.39e+00

Pos logit contributions
 Integrity+0.314
 Preferences+0.312
 assurances+0.306
 calculations+0.306
 remarks+0.286
Neg logit contributions
nesday-1.788
quished-1.761
ess-1.752
der-1.737
joined-1.704
Tokenut
Token ID315
Feature activation+1.61e+00

Pos logit contributions
der+13.171
ess+13.126
rely+12.120
quished+11.934
die+11.926
Neg logit contributions
 assurances-13.320
 calculations-12.708
 Preferences-12.673
 Integrity-12.565
 coefficients-12.457
Token Sun
Token ID3825
Feature activation+1.17e+00

Pos logit contributions
ess+7.321
der+7.289
quished+7.051
nesday+7.014
rely+6.873
Neg logit contributions
 calculations-6.151
 assurances-6.137
 Preferences-5.961
 negotiations-5.878
 Integrity-5.805
Token Temple
Token ID10857
Feature activation+1.70e+00

Pos logit contributions
ess+6.470
der+6.415
quished+6.347
nesday+6.288
rely+6.176
Neg logit contributions
 assurances-3.419
 calculations-3.353
 Preferences-3.195
 Integrity-3.192
 negotiations-3.189
Token Here
Token ID3423
Feature activation+2.67e-01

Pos logit contributions
ess+8.052
der+7.996
quished+7.727
nesday+7.675
rely+7.555
Neg logit contributions
 assurances-6.106
 calculations-5.989
 negotiations-5.757
 Integrity-5.728
 Preferences-5.702
Token
Token ID198
Feature activation+2.06e-01

Pos logit contributions
ess+1.209
der+1.197
nesday+1.181
quished+1.176
rely+1.141
Neg logit contributions
 assurances-1.071
 calculations-1.062
 Preferences-1.031
 negotiations-1.026
 Integrity-1.024
Token graves
Token ID30055
Feature activation+6.26e-01

Pos logit contributions
 assurances+0.753
 calculations+0.745
 negotiations+0.731
 Integrity+0.720
 remarks+0.719
Neg logit contributions
nesday-0.957
der-0.953
ess-0.952
quished-0.949
rely-0.912
Token of
Token ID286
Feature activation+9.98e-01

Pos logit contributions
ess+3.168
der+3.123
nesday+3.074
quished+3.064
rely+2.966
Neg logit contributions
 assurances-2.179
 calculations-2.152
 Preferences-2.098
 negotiations-2.066
 Integrity-2.060
Token more
Token ID517
Feature activation-7.17e-01

Pos logit contributions
ess+4.573
der+4.518
nesday+4.437
quished+4.432
rely+4.314
Neg logit contributions
 assurances-3.903
 calculations-3.876
 Preferences-3.771
 negotiations-3.731
 Integrity-3.730
Token than
Token ID621
Feature activation+4.04e-01

Pos logit contributions
 assurances+1.843
 calculations+1.803
 negotiations+1.738
 remarks+1.730
 Integrity+1.700
Neg logit contributions
nesday-4.253
quished-4.152
ess-4.141
der-4.115
rely-4.038
Token 400
Token ID7337
Feature activation+1.63e+00

Pos logit contributions
ess+2.029
nesday+2.018
der+2.012
quished+2.006
rely+1.924
Neg logit contributions
 assurances-1.431
 calculations-1.415
 negotiations-1.363
 Preferences-1.358
 Integrity-1.356
Token enslaved
Token ID39216
Feature activation+3.38e+00

Pos logit contributions
ess+7.225
der+7.044
quished+6.979
nesday+6.719
rely+6.692
Neg logit contributions
 assurances-6.421
 calculations-6.373
 Preferences-6.258
 negotiations-6.086
 Integrity-6.078
Token and
Token ID290
Feature activation+9.04e-01

Pos logit contributions
ess+13.706
der+13.022
quished+12.375
joined+12.319
rely+12.085
Neg logit contributions
 assurances-12.297
 calculations-12.242
 Preferences-12.128
weeney-11.683
 Integrity-11.642
Token free
Token ID1479
Feature activation+1.31e+00

Pos logit contributions
ess+4.159
der+4.118
nesday+4.089
quished+4.055
rely+3.938
Neg logit contributions
 assurances-3.548
 calculations-3.514
 Preferences-3.408
 negotiations-3.400
 Integrity-3.390
Token blacks
Token ID15102
Feature activation+9.53e-01

Pos logit contributions
ess+5.899
der+5.820
quished+5.716
nesday+5.681
rely+5.535
Neg logit contributions
 assurances-5.218
 calculations-5.176
 Preferences-5.048
 Integrity-5.006
 negotiations-4.985
Token,
Token ID11
Feature activation-4.84e-02

Pos logit contributions
ess+4.117
der+4.052
nesday+3.980
quished+3.973
rely+3.835
Neg logit contributions
 assurances-4.008
 calculations-3.970
 Preferences-3.885
 Integrity-3.847
 negotiations-3.835
Token was
Token ID373
Feature activation+4.69e-01

Pos logit contributions
 assurances+0.202
 calculations+0.201
 negotiations+0.195
 remarks+0.195
 Integrity+0.191
Neg logit contributions
ess-0.210
quished-0.210
nesday-0.208
der-0.208
rely-0.201
Token a
Token ID257
Feature activation+6.30e-01

Pos logit contributions
 assurances+1.051
 calculations+1.038
 negotiations+1.018
 remarks+0.997
 Preferences+0.991
Neg logit contributions
der-1.388
quished-1.385
nesday-1.381
ess-1.379
rely-1.313
Token super
Token ID2208
Feature activation+6.97e-01

Pos logit contributions
ess+2.886
der+2.856
nesday+2.850
quished+2.845
rely+2.733
Neg logit contributions
 assurances-2.486
 calculations-2.475
 negotiations-2.400
 Preferences-2.393
 Integrity-2.369
Token-
Token ID12
Feature activation+1.69e+00

Pos logit contributions
ess+3.272
der+3.240
nesday+3.201
quished+3.186
rely+3.098
Neg logit contributions
 assurances-2.670
 calculations-2.646
 Preferences-2.568
 negotiations-2.552
 Integrity-2.549
Tokenp
Token ID79
Feature activation+8.94e-01

Pos logit contributions
ess+6.536
der+6.501
nesday+6.148
quished+6.144
rely+6.066
Neg logit contributions
 assurances-7.818
 calculations-7.682
 Preferences-7.566
 Integrity-7.458
 negotiations-7.416
Tokenreg
Token ID2301
Feature activation+1.47e+00

Pos logit contributions
ess+3.715
der+3.667
nesday+3.604
quished+3.587
rely+3.480
Neg logit contributions
 assurances-3.920
 calculations-3.883
 Preferences-3.794
 negotiations-3.765
 Integrity-3.761
Tokennant
Token ID22057
Feature activation+3.36e+00

Pos logit contributions
ess+1.421
der+1.343
quished+1.196
nesday+1.180
rely+1.015
Neg logit contributions
 assurances-11.122
 calculations-11.035
 Preferences-10.943
 negotiations-10.828
 Integrity-10.820
Token Ann
Token ID5506
Feature activation+1.30e+00

Pos logit contributions
ess+14.650
der+14.130
nesday+13.349
rely+13.341
quished+13.322
Neg logit contributions
 assurances-11.442
 calculations-11.260
 Preferences-11.116
 judgments-10.918
 Integrity-10.846
Tokenal
Token ID282
Feature activation+1.13e+00

Pos logit contributions
ess+3.871
der+3.806
nesday+3.668
quished+3.605
die+3.420
Neg logit contributions
 assurances-7.157
 calculations-7.132
 Preferences-6.955
 negotiations-6.947
 Integrity-6.916
Tokenise
Token ID786
Feature activation+1.56e-01

Pos logit contributions
ess+1.527
der+1.480
nesday+1.426
quished+1.402
rely+1.251
Neg logit contributions
 assurances-8.116
 calculations-8.077
 Preferences-7.953
 negotiations-7.929
 Integrity-7.922
Token
Token ID851
Feature activation+3.64e-01

Pos logit contributions
ess+0.684
nesday+0.680
der+0.680
quished+0.675
rely+0.655
Neg logit contributions
 assurances-0.638
 calculations-0.631
 negotiations-0.616
 Preferences-0.614
 Integrity-0.610
Token knowing
Token ID6970
Feature activation+3.86e-01

Pos logit contributions
ess+1.749
der+1.744
nesday+1.743
quished+1.731
rely+1.679
Neg logit contributions
 assurances-1.350
 calculations-1.333
 negotiations-1.290
 Preferences-1.284
 Integrity-1.281
Tokenin
Token ID259
Feature activation-7.02e-01

Pos logit contributions
ess+5.086
der+4.975
nesday+4.668
quished+4.663
rely+4.573
Neg logit contributions
 assurances-7.849
 calculations-7.793
 negotiations-7.560
 Preferences-7.523
 Integrity-7.457
Token-
Token ID12
Feature activation+9.93e-01

Pos logit contributions
 assurances+1.724
 Integrity+1.713
 calculations+1.707
 negotiations+1.682
 Preferences+1.674
Neg logit contributions
nesday-4.125
quished-4.101
ess-4.021
der-3.977
die-3.915
TokenChief
Token ID23675
Feature activation+9.00e-01

Pos logit contributions
ess+5.127
der+5.104
nesday+4.992
quished+4.961
rely+4.877
Neg logit contributions
 assurances-3.344
 calculations-3.287
 Preferences-3.175
 Integrity-3.141
 negotiations-3.138
Token of
Token ID286
Feature activation+9.30e-01

Pos logit contributions
ess+4.256
der+4.194
nesday+4.094
quished+4.076
rely+3.976
Neg logit contributions
 assurances-3.394
 calculations-3.366
 Preferences-3.244
 negotiations-3.234
 Integrity-3.208
Token Amber
Token ID21896
Feature activation+8.43e-01

Pos logit contributions
ess+4.589
der+4.562
nesday+4.473
quished+4.463
rely+4.353
Neg logit contributions
 assurances-3.299
 calculations-3.258
 Preferences-3.139
 negotiations-3.122
 Integrity-3.086
TokenUn
Token ID3118
Feature activation+3.36e+00

Pos logit contributions
ess+4.061
der+4.029
nesday+3.975
quished+3.953
rely+3.850
Neg logit contributions
 assurances-3.116
 calculations-3.089
 Preferences-2.989
 negotiations-2.975
 Integrity-2.965
Tokenmask
Token ID27932
Feature activation+1.76e+00

Pos logit contributions
ess+11.941
der+11.749
quished+11.070
nesday+10.853
joined+10.759
Neg logit contributions
 assurances-14.691
 calculations-14.407
 Preferences-14.111
 judgments-14.064
 coefficients-13.952
Tokened
Token ID276
Feature activation+1.75e+00

Pos logit contributions
ess+6.398
der+6.284
quished+6.075
nesday+5.961
rely+5.764
Neg logit contributions
 assurances-8.375
 calculations-8.275
 Preferences-8.120
 Integrity-8.082
 negotiations-8.057
Token.
Token ID13
Feature activation+5.22e-01

Pos logit contributions
ess+7.839
der+7.752
nesday+7.523
quished+7.514
rely+7.276
Neg logit contributions
 assurances-6.762
 calculations-6.702
 negotiations-6.448
 Preferences-6.448
 Integrity-6.419
Tokencom
Token ID785
Feature activation+4.78e-01

Pos logit contributions
ess+1.853
der+1.830
nesday+1.794
quished+1.786
rely+1.722
Neg logit contributions
 assurances-2.595
 calculations-2.578
 Preferences-2.515
 negotiations-2.508
 Integrity-2.501
Token and
Token ID290
Feature activation+4.12e-01

Pos logit contributions
ess+2.163
der+2.142
nesday+2.119
quished+2.110
rely+2.048
Neg logit contributions
 assurances-1.908
 calculations-1.892
 Preferences-1.839
 negotiations-1.829
 Integrity-1.825
Token "
Token ID366
Feature activation+7.48e-01

Pos logit contributions
ess+2.595
der+2.574
nesday+2.536
quished+2.522
rely+2.463
Neg logit contributions
 assurances-1.744
 calculations-1.735
 Preferences-1.686
 Integrity-1.675
 negotiations-1.659
TokenLondon
Token ID23421
Feature activation+8.74e-01

Pos logit contributions
ess+2.941
der+2.907
nesday+2.859
quished+2.840
rely+2.753
Neg logit contributions
 assurances-3.436
 calculations-3.412
 Preferences-3.330
 negotiations-3.308
 Integrity-3.308
Token"
Token ID1
Feature activation+9.96e-01

Pos logit contributions
ess+4.276
der+4.230
nesday+4.169
quished+4.140
rely+4.039
Neg logit contributions
 assurances-3.142
 calculations-3.120
 Preferences-3.022
 Integrity-2.986
 negotiations-2.984
Token have
Token ID423
Feature activation+3.70e-01

Pos logit contributions
ess+4.123
der+4.092
nesday+4.002
quished+3.982
rely+3.878
Neg logit contributions
 assurances-4.335
 calculations-4.301
 Preferences-4.205
 Integrity-4.172
 negotiations-4.164
Token been
Token ID587
Feature activation+1.31e+00

Pos logit contributions
ess+1.754
der+1.734
nesday+1.731
quished+1.721
rely+1.659
Neg logit contributions
 assurances-1.422
 calculations-1.404
 negotiations-1.356
 Preferences-1.353
 Integrity-1.341
Token banned
Token ID9301
Feature activation+3.35e+00

Pos logit contributions
ess+6.159
der+6.119
quished+5.941
nesday+5.924
rely+5.824
Neg logit contributions
 assurances-4.838
 calculations-4.785
 Preferences-4.720
 Integrity-4.691
 negotiations-4.611
Token from
Token ID422
Feature activation+6.31e-01

Pos logit contributions
der+13.697
ess+13.672
rely+12.865
quished+12.748
nesday+12.608
Neg logit contributions
 assurances-11.692
 Integrity-11.662
 calculations-11.580
 Preferences-11.557
 judgments-10.909
Token business
Token ID1597
Feature activation+4.43e-01

Pos logit contributions
ess+3.241
der+3.213
nesday+3.187
quished+3.176
rely+3.084
Neg logit contributions
 assurances-2.151
 calculations-2.131
 Preferences-2.054
 negotiations-2.052
 Integrity-2.035
Token advertisements
Token ID25210
Feature activation+8.30e-01

Pos logit contributions
ess+2.167
der+2.151
nesday+2.148
quished+2.143
rely+2.069
Neg logit contributions
 assurances-1.618
 calculations-1.606
 negotiations-1.554
 Preferences-1.542
 remarks-1.527
Token so
Token ID523
Feature activation-5.36e-01

Pos logit contributions
ess+3.634
der+3.598
nesday+3.569
quished+3.553
rely+3.441
Neg logit contributions
 assurances-3.448
 calculations-3.421
 Preferences-3.322
 negotiations-3.313
 Integrity-3.296
Token as
Token ID355
Feature activation-2.22e-02

Pos logit contributions
 assurances+2.070
 calculations+2.006
 negotiations+1.987
 Integrity+1.972
 Preferences+1.949
Neg logit contributions
ess-2.507
nesday-2.487
der-2.458
quished-2.456
die-2.359
Token
Token ID447
Feature activation+1.12e+00

Pos logit contributions
nesday+0.652
quished+0.639
ess+0.628
der+0.619
rely+0.591
Neg logit contributions
 assurances-1.083
 calculations-1.076
 Preferences-1.057
 negotiations-1.055
 Integrity-1.053
Token
Token ID247
Feature activation+8.21e-02

Pos logit contributions
ess+6.081
der+6.073
quished+5.926
nesday+5.861
rely+5.798
Neg logit contributions
 assurances-3.339
 calculations-3.309
 Preferences-3.172
 negotiations-3.164
 Integrity-3.133
Tokent
Token ID83
Feature activation+3.28e-01

Pos logit contributions
ess+0.323
quished+0.322
nesday+0.320
der+0.317
joined+0.304
Neg logit contributions
 assurances-0.372
 calculations-0.370
 negotiations-0.359
 Preferences-0.359
 Integrity-0.356
Token be
Token ID307
Feature activation+6.85e-01

Pos logit contributions
ess+1.631
der+1.610
quished+1.608
nesday+1.606
rely+1.548
Neg logit contributions
 assurances-1.154
 calculations-1.148
 Preferences-1.107
 negotiations-1.106
 Integrity-1.095
Token one
Token ID530
Feature activation+1.21e+00

Pos logit contributions
ess+3.373
der+3.343
nesday+3.303
quished+3.288
rely+3.202
Neg logit contributions
 assurances-2.467
 calculations-2.444
 Preferences-2.368
 negotiations-2.352
 Integrity-2.349
Token dec
Token ID875
Feature activation+3.33e+00

Pos logit contributions
ess+5.681
der+5.660
nesday+5.505
quished+5.469
rely+5.358
Neg logit contributions
 assurances-4.555
 calculations-4.530
 Preferences-4.404
 Integrity-4.370
 negotiations-4.365
Tokenibel
Token ID43837
Feature activation+4.19e-01

Pos logit contributions
der+13.517
ess+13.454
rely+12.961
quished+12.909
nesday+12.853
Neg logit contributions
 assurances-12.761
 calculations-12.150
 Preferences-12.120
 negotiations-11.981
 briefings-11.904
Token softer
Token ID32359
Feature activation+4.11e-01

Pos logit contributions
ess+1.935
der+1.912
nesday+1.903
quished+1.894
rely+1.836
Neg logit contributions
 assurances-1.639
 calculations-1.625
 Preferences-1.574
 negotiations-1.569
 Integrity-1.558
Token,
Token ID11
Feature activation+3.45e-01

Pos logit contributions
ess+1.826
der+1.803
nesday+1.788
quished+1.782
rely+1.727
Neg logit contributions
 assurances-1.677
 calculations-1.664
 Preferences-1.614
 negotiations-1.610
 Integrity-1.601
Token
Token ID447
Feature activation+7.45e-01

Pos logit contributions
ess+1.715
der+1.692
nesday+1.689
quished+1.687
rely+1.631
Neg logit contributions
 assurances-1.225
 calculations-1.213
 negotiations-1.171
 Preferences-1.171
 Integrity-1.164
Token
Token ID251
Feature activation+1.21e-01

Pos logit contributions
ess+4.064
der+4.035
quished+3.963
nesday+3.955
rely+3.862
Neg logit contributions
 assurances-2.217
 calculations-2.204
 Preferences-2.119
 negotiations-2.106
 Integrity-2.084
Token You
Token ID921
Feature activation+9.70e-01

Pos logit contributions
ess+5.155
der+5.111
nesday+5.034
quished+5.021
rely+4.901
Neg logit contributions
 assurances-3.481
 calculations-3.444
 Preferences-3.310
 negotiations-3.308
 Integrity-3.288
Token get
Token ID651
Feature activation-5.22e-01

Pos logit contributions
ess+3.957
der+3.911
nesday+3.881
quished+3.864
rely+3.715
Neg logit contributions
 assurances-4.316
 calculations-4.291
 Preferences-4.173
 negotiations-4.159
 Integrity-4.145
Token five
Token ID1936
Feature activation+1.28e-01

Pos logit contributions
 assurances+1.927
 calculations+1.923
 Preferences+1.846
 negotiations+1.831
 remarks+1.811
Neg logit contributions
nesday-2.545
quished-2.535
der-2.501
ess-2.501
rely-2.418
Token classic
Token ID6833
Feature activation+1.10e+00

Pos logit contributions
nesday+0.645
ess+0.641
quished+0.640
der+0.638
rely+0.620
Neg logit contributions
 calculations-0.452
 assurances-0.451
 Preferences-0.431
 negotiations-0.429
 remarks-0.425
Token old
Token ID1468
Feature activation+2.01e+00

Pos logit contributions
ess+5.662
der+5.604
nesday+5.508
quished+5.493
rely+5.381
Neg logit contributions
 assurances-3.646
 calculations-3.597
 Preferences-3.483
 negotiations-3.463
 Integrity-3.461
Token-
Token ID12
Feature activation+3.29e+00

Pos logit contributions
ess+10.321
der+10.163
quished+9.699
rely+9.649
die+9.615
Neg logit contributions
 assurances-6.298
 calculations-6.235
 Preferences-6.062
 Integrity-6.011
 negotiations-6.005
Tokenschool
Token ID14347
Feature activation+1.30e+00

Pos logit contributions
ess+10.566
der+10.252
die+9.579
girl+9.286
joined+9.202
Neg logit contributions
 calculations-15.798
 assurances-15.759
 Preferences-15.727
 Integrity-15.438
 negotiations-15.426
Token camera
Token ID4676
Feature activation+9.14e-02

Pos logit contributions
ess+6.799
der+6.726
nesday+6.554
quished+6.553
rely+6.443
Neg logit contributions
 assurances-4.186
 calculations-4.114
 Preferences-4.007
 Integrity-3.976
 negotiations-3.971
Token styles
Token ID12186
Feature activation-2.43e-01

Pos logit contributions
nesday+0.499
quished+0.494
ess+0.491
der+0.489
rely+0.476
Neg logit contributions
 assurances-0.289
 calculations-0.289
 negotiations-0.273
 Preferences-0.273
 remarks-0.269
Token to
Token ID284
Feature activation+6.93e-01

Pos logit contributions
 calculations+0.904
 assurances+0.894
 Preferences+0.863
 negotiations+0.861
 Integrity+0.846
Neg logit contributions
nesday-1.176
quished-1.157
ess-1.155
der-1.139
rely-1.120
Token shoot
Token ID2686
Feature activation+7.27e-01

Pos logit contributions
ess+3.455
der+3.424
quished+3.406
nesday+3.399
rely+3.288
Neg logit contributions
 assurances-2.453
 calculations-2.445
 Preferences-2.366
 negotiations-2.350
 Integrity-2.330
Token even
Token ID772
Feature activation+1.05e+00

Pos logit contributions
ess+3.507
der+3.465
nesday+3.410
quished+3.403
rely+3.296
Neg logit contributions
 assurances-3.307
 calculations-3.282
 Preferences-3.201
 Integrity-3.174
 negotiations-3.172
Token hundreds
Token ID5179
Feature activation+7.12e-01

Pos logit contributions
ess+4.790
der+4.754
quished+4.659
nesday+4.649
rely+4.524
Neg logit contributions
 assurances-4.128
 calculations-4.095
 Preferences-3.995
 Integrity-3.977
 negotiations-3.948
Token of
Token ID286
Feature activation+6.28e-01

Pos logit contributions
ess+3.251
der+3.215
quished+3.174
nesday+3.070
rely+3.029
Neg logit contributions
 assurances-2.800
 calculations-2.768
 Preferences-2.747
 Integrity-2.692
 negotiations-2.655
Token thousands
Token ID4138
Feature activation+1.57e-01

Pos logit contributions
ess+2.758
der+2.732
nesday+2.720
quished+2.684
rely+2.609
Neg logit contributions
 assurances-2.612
 calculations-2.588
 negotiations-2.509
 Preferences-2.504
 Integrity-2.497
Token of
Token ID286
Feature activation+9.10e-01

Pos logit contributions
ess+0.785
der+0.779
nesday+0.773
quished+0.767
rely+0.748
Neg logit contributions
 assurances-0.548
 calculations-0.543
 Preferences-0.523
 negotiations-0.522
 Integrity-0.520
Token enslaved
Token ID39216
Feature activation+3.29e+00

Pos logit contributions
ess+4.016
der+3.977
nesday+3.936
quished+3.911
rely+3.794
Neg logit contributions
 assurances-3.745
 calculations-3.715
 Preferences-3.608
 negotiations-3.594
 Integrity-3.587
Token people
Token ID661
Feature activation+4.46e-01

Pos logit contributions
ess+13.631
der+13.148
quished+12.347
rely+12.229
joined+12.186
Neg logit contributions
 assurances-11.990
 calculations-11.954
 Preferences-11.910
 Integrity-11.580
weeney-11.402
Token
Token ID851
Feature activation+5.49e-01

Pos logit contributions
ess+1.911
nesday+1.907
der+1.896
quished+1.884
rely+1.815
Neg logit contributions
 assurances-1.880
 calculations-1.864
 negotiations-1.813
 Preferences-1.795
 Integrity-1.793
Token including
Token ID1390
Feature activation+4.06e-02

Pos logit contributions
ess+2.547
nesday+2.544
der+2.534
quished+2.514
rely+2.430
Neg logit contributions
 assurances-2.142
 calculations-2.118
 negotiations-2.050
 Preferences-2.043
 Integrity-2.040
Token Archer
Token ID19333
Feature activation+3.21e-01

Pos logit contributions
nesday+0.189
quished+0.186
ess+0.185
der+0.185
rely+0.177
Neg logit contributions
 assurances-0.161
 calculations-0.159
 negotiations-0.155
 remarks-0.153
 Integrity-0.152
Token Alexander
Token ID10009
Feature activation-2.58e-01

Pos logit contributions
ess+1.482
der+1.469
nesday+1.456
quished+1.448
rely+1.406
Neg logit contributions
 assurances-1.256
 calculations-1.244
 Preferences-1.208
 negotiations-1.202
 Integrity-1.199
Token all
Token ID477
Feature activation+9.42e-01

Pos logit contributions
 assurances+0.882
 calculations+0.867
 negotiations+0.841
 Preferences+0.835
 remarks+0.834
Neg logit contributions
ess-1.072
nesday-1.063
der-1.063
quished-1.047
rely-1.016
Token Muslims
Token ID7045
Feature activation+7.81e-01

Pos logit contributions
ess+4.247
der+4.205
quished+4.127
nesday+4.123
rely+4.010
Neg logit contributions
 assurances-3.751
 calculations-3.724
 Preferences-3.625
 Integrity-3.599
 negotiations-3.595
Token is
Token ID318
Feature activation+8.92e-01

Pos logit contributions
ess+3.657
der+3.622
nesday+3.558
quished+3.539
rely+3.452
Neg logit contributions
 assurances-2.986
 calculations-2.961
 Preferences-2.883
 Integrity-2.864
 negotiations-2.855
Token rep
Token ID1128
Feature activation+2.25e+00

Pos logit contributions
ess+4.312
der+4.278
nesday+4.179
quished+4.149
rely+4.070
Neg logit contributions
 assurances-3.256
 calculations-3.224
 Preferences-3.153
 Integrity-3.115
 negotiations-3.112
Tokenug
Token ID1018
Feature activation+7.00e-01

Pos logit contributions
ess+6.471
der+6.168
lete+6.074
rely+5.806
nesday+5.599
Neg logit contributions
 assurances-12.170
 Preferences-12.101
 calculations-12.022
 negotiations-11.878
 Integrity-11.765
Tokennant
Token ID22057
Feature activation+3.26e+00

Pos logit contributions
ess+0.905
nesday+0.869
quished+0.864
der+0.864
rely+0.740
Neg logit contributions
 assurances-5.022
 calculations-5.018
 negotiations-4.919
 remarks-4.917
 Integrity-4.914
Token to
Token ID284
Feature activation+5.27e-01

Pos logit contributions
der+13.816
ess+13.560
rely+12.112
nesday+11.966
quished+11.939
Neg logit contributions
 Preferences-11.245
 Integrity-11.187
 assurances-11.014
 coefficients-10.737
 calculations-10.673
Token me
Token ID502
Feature activation+2.97e-01

Pos logit contributions
ess+2.380
der+2.360
nesday+2.344
quished+2.336
rely+2.252
Neg logit contributions
 assurances-2.128
 calculations-2.115
 Preferences-2.042
 negotiations-2.040
 Integrity-2.039
Token,
Token ID11
Feature activation+1.36e-01

Pos logit contributions
ess+1.149
der+1.136
nesday+1.124
quished+1.116
rely+1.079
Neg logit contributions
 assurances-1.385
 calculations-1.375
 Preferences-1.340
 negotiations-1.335
 Integrity-1.332
Token and
Token ID290
Feature activation+5.19e-01

Pos logit contributions
nesday+0.670
ess+0.670
quished+0.668
der+0.665
rely+0.642
Neg logit contributions
 assurances-0.489
 calculations-0.483
 negotiations-0.464
 remarks-0.463
 Integrity-0.463
Token a
Token ID257
Feature activation+1.60e+00

Pos logit contributions
ess+2.441
quished+2.426
nesday+2.423
der+2.413
rely+2.316
Neg logit contributions
 assurances-2.004
 calculations-1.985
 negotiations-1.913
 remarks-1.905
 Preferences-1.903
Token
Token ID252
Feature activation-4.01e-01

Pos logit contributions
 assurances+2.225
 calculations+2.202
 Preferences+2.110
 Integrity+2.079
 negotiations+2.070
Neg logit contributions
ess-4.942
nesday-4.887
der-4.841
quished-4.816
rely-4.740
Token Alternative
Token ID27182
Feature activation+6.36e-01

Pos logit contributions
 calculations+1.348
 assurances+1.345
 Preferences+1.305
 negotiations+1.294
 Integrity+1.293
Neg logit contributions
ess-2.033
der-2.021
nesday-2.012
quished-1.996
rely-1.953
Token Facts
Token ID26972
Feature activation+2.33e-01

Pos logit contributions
ess+3.567
der+3.528
quished+3.478
nesday+3.477
rely+3.398
Neg logit contributions
 assurances-1.842
 calculations-1.799
 Preferences-1.736
 negotiations-1.728
 Integrity-1.714
Token in
Token ID287
Feature activation+1.17e-01

Pos logit contributions
ess+0.991
der+0.982
nesday+0.972
quished+0.966
rely+0.937
Neg logit contributions
 assurances-0.996
 calculations-0.991
 Preferences-0.964
 negotiations-0.959
 Integrity-0.959
Token Eastern
Token ID8345
Feature activation+1.24e+00

Pos logit contributions
ess+0.550
der+0.545
nesday+0.537
quished+0.536
rely+0.520
Neg logit contributions
 assurances-0.451
 calculations-0.446
 Preferences-0.431
 negotiations-0.429
 Integrity-0.427
Token U
Token ID471
Feature activation+3.25e+00

Pos logit contributions
ess+5.665
der+5.608
quished+5.434
nesday+5.389
rely+5.276
Neg logit contributions
 assurances-4.664
 calculations-4.567
 negotiations-4.365
 negatives-4.349
 flares-4.338
Tokentopia
Token ID46575
Feature activation+1.20e+00

Pos logit contributions
der+7.305
ess+7.304
rely+6.482
die+6.404
species+6.126
Neg logit contributions
 assurances-18.728
 Integrity-18.340
 flares-18.308
 calculations-18.273
 Preferences-18.033
Token)
Token ID8
Feature activation+5.40e-01

Pos logit contributions
ess+5.477
der+5.408
quished+5.276
nesday+5.246
rely+5.099
Neg logit contributions
 assurances-4.700
 calculations-4.603
 Preferences-4.454
 negotiations-4.429
 Integrity-4.413
Token is
Token ID318
Feature activation+5.27e-01

Pos logit contributions
ess+2.337
der+2.313
nesday+2.288
quished+2.274
rely+2.207
Neg logit contributions
 assurances-2.267
 calculations-2.250
 Preferences-2.189
 negotiations-2.180
 Integrity-2.177
Token an
Token ID281
Feature activation+1.05e+00

Pos logit contributions
ess+2.560
der+2.536
nesday+2.512
quished+2.499
rely+2.431
Neg logit contributions
 assurances-1.927
 calculations-1.912
 Preferences-1.851
 negotiations-1.842
 Integrity-1.841
Token official
Token ID1743
Feature activation+1.10e+00

Pos logit contributions
ess+5.266
der+5.172
quished+5.088
nesday+5.073
rely+4.983
Neg logit contributions
 assurances-3.689
 calculations-3.616
 Preferences-3.540
 negotiations-3.468
 Integrity-3.457
Token actually
Token ID1682
Feature activation+3.36e-01

Pos logit contributions
ess+3.934
der+3.890
nesday+3.856
quished+3.851
rely+3.725
Neg logit contributions
 assurances-2.905
 calculations-2.877
 Preferences-2.783
 negotiations-2.776
 Integrity-2.763
Token show
Token ID905
Feature activation+7.01e-01

Pos logit contributions
ess+1.632
quished+1.617
nesday+1.611
der+1.610
rely+1.540
Neg logit contributions
 assurances-1.223
 calculations-1.210
 negotiations-1.178
 Preferences-1.165
 Integrity-1.164
Token a
Token ID257
Feature activation+1.41e+00

Pos logit contributions
ess+3.369
der+3.340
nesday+3.318
quished+3.301
rely+3.203
Neg logit contributions
 assurances-2.613
 calculations-2.587
 Preferences-2.503
 negotiations-2.501
 Integrity-2.488
Token toilet
Token ID16146
Feature activation+1.48e+00

Pos logit contributions
ess+6.530
der+6.469
nesday+6.347
quished+6.318
rely+6.167
Neg logit contributions
 assurances-5.408
 calculations-5.361
 Preferences-5.207
 Integrity-5.165
 negotiations-5.152
Token in
Token ID287
Feature activation+2.67e-01

Pos logit contributions
der+7.081
ess+7.060
nesday+6.866
quished+6.830
rely+6.718
Neg logit contributions
 assurances-5.422
 calculations-5.339
 Preferences-5.200
 Integrity-5.140
 negotiations-5.116
Token ab
Token ID450
Feature activation+3.22e+00

Pos logit contributions
ess+1.314
der+1.304
nesday+1.295
quished+1.288
rely+1.250
Neg logit contributions
 assurances-0.959
 calculations-0.950
 Preferences-0.921
 negotiations-0.919
 Integrity-0.912
Tokenath
Token ID776
Feature activation+7.85e-01

Pos logit contributions
ess+11.699
der+11.158
rely+10.424
joined+10.300
die+10.180
Neg logit contributions
 assurances-14.167
 Preferences-14.015
 Integrity-13.906
 calculations-13.823
 negotiations-13.672
Tokenroom
Token ID3823
Feature activation+1.18e+00

Pos logit contributions
ess+2.391
der+2.372
nesday+2.290
quished+2.287
rely+2.193
Neg logit contributions
 assurances-4.279
 calculations-4.257
 Preferences-4.189
 Integrity-4.157
 negotiations-4.144
Token seemed
Token ID3947
Feature activation-6.11e-01

Pos logit contributions
ess+5.367
der+5.317
nesday+5.220
quished+5.193
rely+5.061
Neg logit contributions
 assurances-4.601
 calculations-4.557
 Preferences-4.433
 Integrity-4.426
 negotiations-4.380
Token oddly
Token ID31414
Feature activation-4.87e-01

Pos logit contributions
 assurances+2.283
 calculations+2.257
 negotiations+2.168
 remarks+2.146
 Preferences+2.129
Neg logit contributions
nesday-2.858
quished-2.834
ess-2.794
der-2.780
rely-2.745
Token appropriate
Token ID5035
Feature activation+5.02e-01

Pos logit contributions
 assurances+1.797
 calculations+1.781
 negotiations+1.715
 remarks+1.704
 Preferences+1.693
Neg logit contributions
quished-2.316
nesday-2.315
ess-2.288
der-2.265
rely-2.222
Token
Token ID198
Feature activation+6.80e-01

Pos logit contributions
 calculations+0.485
 assurances+0.482
 negotiations+0.470
 remarks+0.469
 Preferences+0.467
Neg logit contributions
nesday-0.767
quished-0.751
ess-0.720
der-0.717
rely-0.695
Tokenis
Token ID271
Feature activation+3.19e-01

Pos logit contributions
ess+2.905
der+2.876
nesday+2.855
quished+2.842
rely+2.745
Neg logit contributions
 assurances-2.882
 calculations-2.867
 Preferences-2.787
 negotiations-2.772
 Integrity-2.767
Token created
Token ID2727
Feature activation+1.85e+00

Pos logit contributions
ess+1.309
der+1.297
nesday+1.282
quished+1.271
rely+1.240
Neg logit contributions
 assurances-1.404
 calculations-1.399
 Preferences-1.358
 negotiations-1.353
 Integrity-1.350
Token and
Token ID290
Feature activation+1.44e+00

Pos logit contributions
ess+8.848
der+8.724
rely+8.308
quished+8.240
nesday+8.224
Neg logit contributions
 assurances-6.561
 calculations-6.425
 Preferences-6.326
 Integrity-6.302
 negotiations-6.109
Token the
Token ID262
Feature activation+8.66e-01

Pos logit contributions
ess+6.608
der+6.527
quished+6.324
nesday+6.302
rely+6.213
Neg logit contributions
 assurances-5.559
 calculations-5.462
 Preferences-5.360
 Integrity-5.338
 negotiations-5.258
Token archived
Token ID33962
Feature activation+3.20e+00

Pos logit contributions
ess+4.006
der+3.916
nesday+3.865
quished+3.857
joined+3.770
Neg logit contributions
 assurances-3.345
 calculations-3.231
 Preferences-3.181
 negotiations-3.141
 Integrity-3.125
Token file
Token ID2393
Feature activation+9.26e-01

Pos logit contributions
ess+13.656
der+13.530
die+12.691
rely+12.654
joined+12.231
Neg logit contributions
 assurances-11.588
 Integrity-10.780
 Preferences-10.722
 calculations-10.709
 judgments-10.613
Token is
Token ID318
Feature activation+6.62e-01

Pos logit contributions
ess+3.394
der+3.349
nesday+3.279
quished+3.262
rely+3.158
Neg logit contributions
 assurances-4.493
 calculations-4.454
 Preferences-4.359
 Integrity-4.334
 negotiations-4.330
Token uploaded
Token ID19144
Feature activation+2.30e+00

Pos logit contributions
ess+2.557
der+2.537
nesday+2.527
quished+2.497
rely+2.416
Neg logit contributions
 assurances-3.085
 calculations-3.085
 negotiations-2.991
 Preferences-2.990
 remarks-2.959
Token to
Token ID284
Feature activation+1.61e+00

Pos logit contributions
ess+9.745
der+9.489
rely+8.992
die+8.760
joined+8.659
Neg logit contributions
 assurances-9.117
 calculations-8.928
 Integrity-8.883
 Preferences-8.879
 judgments-8.580
Token it
Token ID340
Feature activation+2.10e-01

Pos logit contributions
ess+6.372
der+6.285
nesday+6.000
quished+5.941
rely+5.889
Neg logit contributions
 assurances-7.251
 calculations-7.119
 Preferences-7.030
 Integrity-7.005
 negotiations-6.887
Token Probably
Token ID18578
Feature activation-1.74e+00

Pos logit contributions
ess+2.799
der+2.778
nesday+2.763
quished+2.750
rely+2.665
Neg logit contributions
 assurances-1.792
 calculations-1.771
 Preferences-1.723
 Integrity-1.714
 negotiations-1.706
Token YOU
Token ID7013
Feature activation+4.54e-01

Pos logit contributions
 assurances+6.453
 calculations+6.432
 negotiations+6.263
 remarks+6.090
 judgments+6.086
Neg logit contributions
quished-8.093
nesday-7.826
der-7.514
ess-7.349
joined-7.169
TokenRS
Token ID6998
Feature activation-1.21e-01

Pos logit contributions
ess+1.874
nesday+1.856
der+1.855
quished+1.848
rely+1.768
Neg logit contributions
 assurances-1.984
 calculations-1.979
 Preferences-1.921
 negotiations-1.917
 Integrity-1.912
Token.
Token ID13
Feature activation+4.20e-01

Pos logit contributions
 assurances+0.488
 calculations+0.488
 Preferences+0.471
 Integrity+0.470
 negotiations+0.469
Neg logit contributions
quished-0.534
der-0.534
ess-0.532
nesday-0.531
joined-0.508
Token Most
Token ID4042
Feature activation-1.07e-01

Pos logit contributions
ess+2.179
der+2.162
nesday+2.156
quished+2.146
rely+2.073
Neg logit contributions
 assurances-1.403
 calculations-1.386
 Preferences-1.354
 Integrity-1.347
 negotiations-1.336
Token Un
Token ID791
Feature activation+3.19e+00

Pos logit contributions
 assurances+0.391
 calculations+0.386
 negotiations+0.375
 Preferences+0.374
 Integrity+0.371
Neg logit contributions
ess-0.527
nesday-0.525
quished-0.523
der-0.522
joined-0.502
Tokensett
Token ID17744
Feature activation+1.47e+00

Pos logit contributions
ess+7.922
der+7.407
rely+6.859
joined+6.734
quished+6.694
Neg logit contributions
 assurances-18.037
 calculations-17.852
 coefficients-17.568
 Preferences-17.509
 Integrity-17.469
Tokenling
Token ID1359
Feature activation+1.74e+00

Pos logit contributions
ess+4.887
der+4.822
nesday+4.709
quished+4.679
rely+4.509
Neg logit contributions
 assurances-7.655
 calculations-7.605
 Preferences-7.448
 Integrity-7.409
 negotiations-7.403
Token Moment
Token ID29278
Feature activation+8.34e-01

Pos logit contributions
ess+6.682
der+6.659
nesday+6.502
quished+6.420
rely+6.098
Neg logit contributions
 calculations-7.650
 assurances-7.640
 negotiations-7.354
 briefings-7.302
 coefficients-7.297
Token:
Token ID25
Feature activation+4.03e-01

Pos logit contributions
ess+3.425
der+3.351
nesday+3.291
quished+3.272
rely+3.145
Neg logit contributions
 assurances-3.643
 calculations-3.606
 negotiations-3.480
 Preferences-3.479
 Integrity-3.433
Token Nothing
Token ID10528
Feature activation-1.29e+00

Pos logit contributions
ess+2.112
der+2.096
nesday+2.090
quished+2.086
rely+2.003
Neg logit contributions
 assurances-1.332
 calculations-1.315
 Preferences-1.286
 Integrity-1.283
 negotiations-1.269
Token handle
Token ID5412
Feature activation-8.28e-02

Pos logit contributions
 calculations+2.243
 assurances+2.147
 remarks+2.073
 negotiations+2.071
 Preferences+2.033
Neg logit contributions
quished-3.692
nesday-3.521
ess-3.503
der-3.456
joined-3.368
Token multiple
Token ID3294
Feature activation-4.99e-01

Pos logit contributions
 calculations+0.290
 assurances+0.288
 negotiations+0.274
 Integrity+0.271
 Preferences+0.271
Neg logit contributions
ess-0.420
quished-0.420
nesday-0.419
der-0.418
rely-0.403
Token spikes
Token ID27198
Feature activation-3.36e-01

Pos logit contributions
 calculations+1.615
 assurances+1.580
 negotiations+1.487
 remarks+1.473
 Preferences+1.460
Neg logit contributions
nesday-2.699
quished-2.685
ess-2.663
der-2.649
rely-2.570
Token in
Token ID287
Feature activation-1.10e+00

Pos logit contributions
 calculations+1.343
 assurances+1.316
 negotiations+1.268
 remarks+1.263
 judgments+1.238
Neg logit contributions
quished-1.550
nesday-1.527
ess-1.506
der-1.503
rely-1.460
Token the
Token ID262
Feature activation-9.46e-01

Pos logit contributions
 calculations+3.527
 assurances+3.406
 negotiations+3.279
 remarks+3.230
 coefficients+3.205
Neg logit contributions
quished-5.963
nesday-5.895
der-5.831
ess-5.790
rely-5.634
Token angle
Token ID9848
Feature activation-3.67e+00

Pos logit contributions
 calculations+3.021
 assurances+2.906
 negotiations+2.774
 remarks+2.737
 Integrity+2.732
Neg logit contributions
quished-5.157
nesday-5.086
der-5.030
ess-5.014
rely-4.868
Token-
Token ID12
Feature activation-6.26e-01

Pos logit contributions
 calculations+11.764
 assurances+10.903
 coefficients+10.748
 negotiations+10.543
 judgments+10.453
Neg logit contributions
nesday-17.127
quished-16.852
rely-16.553
der-16.425
ess-16.360
Tokenof
Token ID1659
Feature activation-1.87e+00

Pos logit contributions
 calculations+2.784
 assurances+2.675
 negotiations+2.648
 remarks+2.632
 coefficients+2.595
Neg logit contributions
nesday-2.599
quished-2.555
der-2.329
ess-2.318
 citiz-2.274
Token-
Token ID12
Feature activation+4.36e-03

Pos logit contributions
 calculations+3.798
 negotiations+3.583
 assurances+3.505
 remarks+3.453
 Integrity+3.260
Neg logit contributions
nesday-11.718
quished-11.414
 citiz-11.190
 thous-10.794
eatures-10.740
Tokenattack
Token ID20358
Feature activation-4.96e-01

Pos logit contributions
nesday+0.022
quished+0.022
der+0.022
ess+0.022
rely+0.021
Neg logit contributions
 calculations-0.015
 assurances-0.014
 negotiations-0.014
 remarks-0.014
 Preferences-0.014
Token data
Token ID1366
Feature activation-1.34e+00

Pos logit contributions
 calculations+1.648
 assurances+1.584
 negotiations+1.516
 remarks+1.495
 coefficients+1.490
Neg logit contributions
quished-2.666
nesday-2.657
ess-2.609
der-2.588
joined-2.533
Token sensitivity
Token ID14233
Feature activation-1.16e+00

Pos logit contributions
 calculations+3.033
 assurances+2.965
 negotiations+2.865
 Preferences+2.826
 Integrity+2.799
Neg logit contributions
nesday-9.316
quished-9.211
der-8.840
joined-8.794
ess-8.724
Token.
Token ID13
Feature activation-1.61e-01

Pos logit contributions
 calculations+5.274
 assurances+5.224
 negotiations+5.110
 Preferences+5.109
 Integrity+5.068
Neg logit contributions
nesday-4.397
ess-4.348
quished-4.337
der-4.287
rely-4.200
Token
Token ID198
Feature activation-3.61e-01

Pos logit contributions
 assurances+0.665
 calculations+0.655
 negotiations+0.633
 Preferences+0.631
 remarks+0.627
Neg logit contributions
ess-0.721
der-0.712
quished-0.694
nesday-0.692
rely-0.674
Token
Token ID198
Feature activation+3.63e-01

Pos logit contributions
 calculations+1.160
 assurances+1.155
 Preferences+1.132
 negotiations+1.131
 remarks+1.128
Neg logit contributions
nesday-1.976
quished-1.920
ess-1.817
der-1.814
 citiz-1.794
TokenIncreases
Token ID28544
Feature activation-9.08e-02

Pos logit contributions
ess+1.753
nesday+1.748
der+1.738
quished+1.731
rely+1.672
Neg logit contributions
 assurances-1.323
 calculations-1.313
 Preferences-1.279
 negotiations-1.267
 Integrity-1.265
Token lip
Token ID10645
Feature activation-3.66e+00

Pos logit contributions
 calculations+0.292
 assurances+0.292
 negotiations+0.281
 Preferences+0.279
 Integrity+0.278
Neg logit contributions
nesday-0.482
ess-0.480
quished-0.480
der-0.476
joined-0.461
Tokenoly
Token ID3366
Feature activation-1.87e+00

Pos logit contributions
 Integrity+11.837
 calculations+11.822
 negotiations+11.696
 coefficients+11.616
 assurances+11.500
Neg logit contributions
nesday-15.264
quished-15.067
der-13.814
rely-13.580
ess-13.356
Tokensis
Token ID13429
Feature activation-9.53e-01

Pos logit contributions
 calculations+6.418
 negotiations+6.051
 assurances+6.046
 Integrity+6.035
 remarks+5.947
Neg logit contributions
nesday-9.108
quished-9.100
ess-8.577
joined-8.264
der-8.258
Token and
Token ID290
Feature activation-3.46e-01

Pos logit contributions
 calculations+4.189
 assurances+4.134
 negotiations+4.032
 Preferences+4.025
 Integrity+4.014
Neg logit contributions
nesday-3.935
quished-3.870
ess-3.816
der-3.773
rely-3.682
Token free
Token ID1479
Feature activation-9.24e-01

Pos logit contributions
 assurances+1.197
 calculations+1.197
 negotiations+1.165
 Integrity+1.145
 Preferences+1.139
Neg logit contributions
quished-1.779
nesday-1.777
ess-1.728
der-1.719
joined-1.681
Token fatty
Token ID19251
Feature activation-1.37e+00

Pos logit contributions
 calculations+3.040
 assurances+3.038
 negotiations+2.965
 Preferences+2.891
 remarks+2.882
Neg logit contributions
nesday-4.807
quished-4.733
der-4.643
ess-4.604
joined-4.540
Token computers
Token ID9061
Feature activation+1.21e-01

Pos logit contributions
 calculations+3.273
 assurances+3.112
 negotiations+2.972
 judgments+2.922
 Preferences+2.921
Neg logit contributions
nesday-5.377
quished-5.289
der-5.122
ess-5.111
rely-5.019
Token then
Token ID788
Feature activation-8.94e-01

Pos logit contributions
quished+0.551
ess+0.547
nesday+0.546
der+0.545
rely+0.526
Neg logit contributions
 calculations-0.482
 assurances-0.479
 negotiations-0.459
 remarks-0.457
 Preferences-0.457
Token reacted
Token ID22798
Feature activation-3.95e-02

Pos logit contributions
 calculations+3.733
 assurances+3.628
 remarks+3.571
 negotiations+3.509
 judgments+3.463
Neg logit contributions
quished-3.962
nesday-3.865
der-3.774
ess-3.759
escent-3.644
Token to
Token ID284
Feature activation-3.54e-01

Pos logit contributions
 calculations+0.152
 assurances+0.150
 negotiations+0.145
 Preferences+0.144
 remarks+0.143
Neg logit contributions
quished-0.183
nesday-0.183
ess-0.183
der-0.181
rely-0.177
Token the
Token ID262
Feature activation-7.13e-01

Pos logit contributions
 calculations+1.236
 assurances+1.229
 negotiations+1.167
 Preferences+1.157
 remarks+1.157
Neg logit contributions
quished-1.805
ess-1.787
der-1.786
nesday-1.786
rely-1.730
Token angle
Token ID9848
Feature activation-3.56e+00

Pos logit contributions
 calculations+2.035
 assurances+1.978
 remarks+1.855
 negotiations+1.832
 Preferences+1.809
Neg logit contributions
quished-4.197
nesday-4.105
ess-4.074
der-4.072
rely-3.952
Token-
Token ID12
Feature activation-5.29e-01

Pos logit contributions
 calculations+11.818
 assurances+10.598
 coefficients+10.390
 judgments+10.372
 remarks+10.343
Neg logit contributions
quished-16.582
nesday-16.521
joined-15.776
rely-15.648
escent-15.489
Tokenof
Token ID1659
Feature activation-1.89e+00

Pos logit contributions
 assurances+2.677
 calculations+2.669
 Preferences+2.599
 negotiations+2.591
 Integrity+2.586
Neg logit contributions
ess-1.808
der-1.788
nesday-1.774
quished-1.762
rely-1.687
Token-
Token ID12
Feature activation-1.22e-01

Pos logit contributions
 calculations+4.241
 negotiations+4.016
 assurances+3.942
 remarks+3.817
 judgments+3.639
Neg logit contributions
nesday-11.479
quished-11.267
der-10.593
ess-10.591
 citiz-10.552
Tokenattack
Token ID20358
Feature activation-3.35e-01

Pos logit contributions
 assurances+0.448
 calculations+0.446
 Preferences+0.432
 negotiations+0.430
 Integrity+0.428
Neg logit contributions
ess-0.586
der-0.582
nesday-0.579
quished-0.577
rely-0.558
Token data
Token ID1366
Feature activation-1.40e+00

Pos logit contributions
 calculations+0.953
 assurances+0.928
 negotiations+0.870
 remarks+0.866
 Preferences+0.863
Neg logit contributions
quished-1.952
nesday-1.939
ess-1.927
der-1.919
rely-1.872
Token le
Token ID443
Feature activation-1.02e+00

Pos logit contributions
 assurances+2.341
 calculations+2.340
 negotiations+2.271
 remarks+2.246
 Preferences+2.236
Neg logit contributions
nesday-3.259
quished-3.199
ess-3.191
der-3.143
joined-3.045
Tokenpro
Token ID1676
Feature activation-1.67e-01

Pos logit contributions
 calculations+4.018
 assurances+3.984
 remarks+3.948
 Preferences+3.890
 negotiations+3.871
Neg logit contributions
nesday-4.362
quished-4.244
ess-4.122
 citiz-4.044
der-3.957
Tokensy
Token ID1837
Feature activation-1.00e+00

Pos logit contributions
 calculations+0.760
 assurances+0.752
 remarks+0.740
 negotiations+0.739
 Preferences+0.726
Neg logit contributions
nesday-0.660
quished-0.654
ess-0.627
der-0.609
joined-0.586
Token-
Token ID12
Feature activation+6.96e-01

Pos logit contributions
 calculations+4.285
 assurances+4.275
 negotiations+4.197
 remarks+4.135
 Preferences+4.126
Neg logit contributions
nesday-4.230
quished-4.124
ess-4.052
der-3.973
rely-3.861
Tokeninduced
Token ID17223
Feature activation-6.96e-01

Pos logit contributions
ess+2.353
der+2.320
nesday+2.293
quished+2.276
rely+2.180
Neg logit contributions
 assurances-3.574
 calculations-3.550
 Preferences-3.474
 negotiations-3.459
 Integrity-3.453
Token nerve
Token ID16384
Feature activation-3.41e+00

Pos logit contributions
 assurances+2.316
 calculations+2.314
 negotiations+2.243
 remarks+2.228
 judgments+2.189
Neg logit contributions
nesday-3.656
quished-3.600
ess-3.588
der-3.500
joined-3.429
Token damage
Token ID2465
Feature activation-7.56e-01

Pos logit contributions
 flares+10.142
 calculations+9.933
 negotiations+9.850
 assurances+9.714
 negatives+9.676
Neg logit contributions
nesday-16.587
quished-16.355
ess-14.815
rely-14.323
joined-14.295
Token.[
Token ID3693
Feature activation-1.15e+00

Pos logit contributions
 calculations+3.169
 assurances+3.152
 negotiations+3.066
 remarks+3.026
 Preferences+3.025
Neg logit contributions
nesday-3.268
quished-3.214
ess-3.166
der-3.156
rely-3.075
Token25
Token ID1495
Feature activation-4.91e-01

Pos logit contributions
 calculations+3.793
 assurances+3.769
 negotiations+3.611
 Preferences+3.597
 remarks+3.594
Neg logit contributions
nesday-5.820
ess-5.744
quished-5.723
der-5.639
rely-5.555
Token]
Token ID60
Feature activation-1.02e+00

Pos logit contributions
 assurances+1.692
 calculations+1.670
 Preferences+1.614
 negotiations+1.607
 Integrity+1.600
Neg logit contributions
ess-2.515
der-2.491
quished-2.445
nesday-2.441
rely-2.381
Token
Token ID198
Feature activation-6.43e-01

Pos logit contributions
 Preferences+3.319
 assurances+3.299
 Integrity+3.284
 calculations+3.284
 remarks+3.183
Neg logit contributions
nesday-5.315
quished-5.197
ess-5.150
der-5.087
rely-5.004
Token En
Token ID2039
Feature activation-8.29e-01

Pos logit contributions
 assurances+2.209
 calculations+2.182
 negotiations+2.116
 Preferences+2.099
 Integrity+2.091
Neg logit contributions
ess-3.186
nesday-3.165
der-3.164
quished-3.142
rely-3.059
Tokenbridge
Token ID9458
Feature activation-1.86e+00

Pos logit contributions
 calculations+3.893
 assurances+3.856
 negotiations+3.801
 remarks+3.790
 Integrity+3.774
Neg logit contributions
nesday-3.034
quished-2.914
der-2.755
ess-2.745
rely-2.586
Token
Token ID447
Feature activation-5.20e-01

Pos logit contributions
 assurances+7.930
 calculations+7.907
 negotiations+7.728
 Integrity+7.657
 remarks+7.523
Neg logit contributions
nesday-7.375
ess-7.296
quished-7.245
der-7.201
rely-7.121
Token
Token ID247
Feature activation-5.32e-01

Pos logit contributions
 assurances+1.287
 calculations+1.249
 Integrity+1.213
 Preferences+1.196
 negotiations+1.187
Neg logit contributions
nesday-3.159
quished-3.108
ess-3.099
der-3.068
rely-2.989
Tokens
Token ID82
Feature activation-2.79e+00

Pos logit contributions
 calculations+2.805
 assurances+2.798
 negotiations+2.753
 remarks+2.717
 Preferences+2.706
Neg logit contributions
nesday-1.736
quished-1.726
ess-1.613
der-1.561
joined-1.528
Token Line
Token ID6910
Feature activation-3.35e+00

Pos logit contributions
 assurances+9.233
 calculations+9.231
 negotiations+9.066
 Integrity+8.829
 remarks+8.674
Neg logit contributions
nesday-13.111
quished-12.923
joined-12.266
lete-12.156
der-12.150
Token 9
Token ID860
Feature activation-2.13e+00

Pos logit contributions
 Integrity+8.647
 negotiations+8.262
 calculations+8.235
 assurances+8.046
 Preferences+7.827
Neg logit contributions
nesday-16.624
quished-16.340
der-15.796
ess-15.681
joined-15.612
Token pipeline
Token ID11523
Feature activation-2.16e+00

Pos logit contributions
 calculations+7.198
 assurances+7.021
 negotiations+6.994
 Integrity+6.812
 remarks+6.617
Neg logit contributions
quished-10.052
nesday-10.042
ess-9.724
joined-9.653
rely-9.557
Token was
Token ID373
Feature activation-4.50e-01

Pos logit contributions
 calculations+7.976
 assurances+7.960
 negotiations+7.949
 Integrity+7.475
 preferences+7.382
Neg logit contributions
quished-9.871
nesday-9.858
ess-9.258
rely-9.192
der-9.191
Token
Token ID564
Feature activation-8.04e-01

Pos logit contributions
 assurances+1.317
 calculations+1.276
 negotiations+1.239
 remarks+1.202
 Integrity+1.195
Neg logit contributions
nesday-2.544
quished-2.537
der-2.496
ess-2.491
joined-2.427
Token
Token ID250
Feature activation-9.26e-01

Pos logit contributions
 assurances+2.212
 calculations+2.146
 Integrity+2.114
 Preferences+2.099
 negotiations+2.089
Neg logit contributions
nesday-4.632
quished-4.511
ess-4.426
der-4.404
die-4.266
Token entering
Token ID8218
Feature activation-1.57e+00

Pos logit contributions
 calculations+1.242
 assurances+1.230
 negotiations+1.209
 remarks+1.171
 Integrity+1.159
Neg logit contributions
der-1.970
ess-1.964
quished-1.962
nesday-1.958
die-1.872
Token into
Token ID656
Feature activation-9.56e-01

Pos logit contributions
 negotiations+5.319
 calculations+5.258
 assurances+5.117
 remarks+5.051
 judgments+4.928
Neg logit contributions
quished-7.928
der-7.821
nesday-7.802
ess-7.770
die-7.641
Token force
Token ID2700
Feature activation-4.05e-01

Pos logit contributions
 calculations+3.061
 negotiations+3.011
 assurances+2.941
 remarks+2.853
 Preferences+2.852
Neg logit contributions
der-5.049
nesday-5.044
ess-5.020
quished-5.000
die-4.890
Token of
Token ID286
Feature activation-2.88e-01

Pos logit contributions
 calculations+1.476
 assurances+1.440
 negotiations+1.439
 remarks+1.385
 Preferences+1.383
Neg logit contributions
der-1.997
quished-1.993
ess-1.987
nesday-1.986
die-1.902
Token the
Token ID262
Feature activation-3.98e-01

Pos logit contributions
 assurances+1.023
 calculations+1.018
 negotiations+0.993
 Preferences+0.976
 Integrity+0.970
Neg logit contributions
ess-1.431
der-1.428
quished-1.418
nesday-1.411
rely-1.362
Token Lisbon
Token ID41898
Feature activation-3.35e+00

Pos logit contributions
 assurances+1.238
 calculations+1.236
 negotiations+1.206
 Preferences+1.171
 Integrity+1.168
Neg logit contributions
der-2.156
quished-2.154
ess-2.145
nesday-2.139
rely-2.066
Token Treaty
Token ID21345
Feature activation-9.52e-01

Pos logit contributions
 negotiations+9.796
 assurances+9.262
 calculations+9.199
 judgments+8.841
 remarks+8.783
Neg logit contributions
nesday-16.166
quished-15.933
der-15.488
escent-15.169
rely-15.030
Token);
Token ID1776
Feature activation-1.36e+00

Pos logit contributions
 calculations+3.927
 negotiations+3.916
 assurances+3.855
 Preferences+3.772
 remarks+3.729
Neg logit contributions
quished-4.153
nesday-4.065
der-4.049
ess-4.025
escent-3.862
Token
Token ID198
Feature activation-8.64e-02

Pos logit contributions
 assurances+4.815
 calculations+4.735
 negotiations+4.730
 Integrity+4.548
 remarks+4.519
Neg logit contributions
quished-6.556
nesday-6.511
der-6.436
ess-6.405
die-6.214
Token
Token ID198
Feature activation-1.68e-01

Pos logit contributions
 calculations+0.294
 assurances+0.293
 remarks+0.286
 negotiations+0.286
 Preferences+0.284
Neg logit contributions
nesday-0.451
quished-0.442
ess-0.426
der-0.425
rely-0.412
Token-
Token ID12
Feature activation-2.16e-01

Pos logit contributions
 assurances+0.573
 calculations+0.564
 Preferences+0.549
 Integrity+0.544
 negotiations+0.540
Neg logit contributions
ess-0.877
der-0.864
nesday-0.847
quished-0.844
rely-0.823
Tokenp
Token ID79
Feature activation-9.65e-03

Pos logit contributions
 calculations+1.540
 assurances+1.533
 Preferences+1.494
 negotiations+1.486
 remarks+1.476
Neg logit contributions
nesday-1.486
ess-1.475
quished-1.470
der-1.465
rely-1.408
Token>
Token ID29
Feature activation-3.85e-01

Pos logit contributions
 assurances+0.036
 calculations+0.036
 Preferences+0.035
 negotiations+0.035
 Integrity+0.035
Neg logit contributions
ess-0.046
der-0.045
nesday-0.045
quished-0.045
rely-0.043
Token{{
Token ID27007
Feature activation+3.69e-01

Pos logit contributions
 calculations+1.527
 assurances+1.513
 Preferences+1.492
 negotiations+1.464
 Integrity+1.457
Neg logit contributions
nesday-1.713
ess-1.713
der-1.696
quished-1.694
rely-1.634
Token phone
Token ID3072
Feature activation-5.58e-01

Pos logit contributions
ess+1.976
der+1.961
nesday+1.944
quished+1.937
rely+1.888
Neg logit contributions
 assurances-1.164
 calculations-1.155
 Preferences-1.111
 negotiations-1.104
 Integrity-1.097
Token.
Token ID13
Feature activation-1.37e-01

Pos logit contributions
 calculations+2.758
 assurances+2.757
 Preferences+2.685
 negotiations+2.672
 Integrity+2.659
Neg logit contributions
ess-1.961
der-1.946
nesday-1.933
quished-1.914
rely-1.853
Tokensn
Token ID16184
Feature activation-3.33e+00

Pos logit contributions
 calculations+0.612
 assurances+0.611
 Preferences+0.595
 negotiations+0.592
 remarks+0.587
Neg logit contributions
quished-0.545
nesday-0.543
der-0.537
ess-0.536
rely-0.509
Tokenipp
Token ID3974
Feature activation-1.50e+00

Pos logit contributions
 preferences+11.434
 Preferences+10.983
 remarks+10.941
 calculations+10.860
 polls+10.726
Neg logit contributions
nesday-12.652
die-11.874
rely-11.668
quished-11.623
mable-11.495
Tokenet
Token ID316
Feature activation-4.61e-01

Pos logit contributions
 calculations+4.683
 remarks+4.644
 assurances+4.592
 Preferences+4.397
 negotiations+4.368
Neg logit contributions
nesday-7.467
quished-7.430
rely-7.284
joined-7.142
der-7.059
Token }}
Token ID34949
Feature activation+2.93e-01

Pos logit contributions
 calculations+1.995
 assurances+1.961
 Preferences+1.939
 negotiations+1.918
 remarks+1.908
Neg logit contributions
ess-1.875
quished-1.875
der-1.874
nesday-1.860
rely-1.819
Token
Token ID3556
Feature activation-1.36e-01

Pos logit contributions
ess+1.357
der+1.345
nesday+1.331
quished+1.324
rely+1.287
Neg logit contributions
 assurances-1.138
 calculations-1.132
 Preferences-1.099
 negotiations-1.090
 Integrity-1.086
Tokenp
Token ID79
Feature activation+1.92e-01

Pos logit contributions
 calculations+0.570
 assurances+0.565
 Preferences+0.556
 remarks+0.551
 negotiations+0.548
Neg logit contributions
quished-0.586
nesday-0.582
der-0.573
ess-0.566
rely-0.537
Token Trump
Token ID1301
Feature activation+2.04e-01

Pos logit contributions
ess+2.671
der+2.644
nesday+2.621
quished+2.607
rely+2.524
Neg logit contributions
 assurances-2.170
 calculations-2.156
 Preferences-2.090
 negotiations-2.081
 Integrity-2.073
Token discl
Token ID5862
Feature activation-3.52e-01

Pos logit contributions
ess+0.992
der+0.986
nesday+0.985
quished+0.980
rely+0.949
Neg logit contributions
 assurances-0.738
 calculations-0.737
 Preferences-0.712
 negotiations-0.709
 Integrity-0.704
Tokenoses
Token ID4629
Feature activation+3.66e-01

Pos logit contributions
 calculations+0.924
 remarks+0.884
 assurances+0.882
 Preferences+0.856
 negotiations+0.854
Neg logit contributions
nesday-2.041
der-2.023
quished-2.020
ess-1.955
die-1.949
Token on
Token ID319
Feature activation-1.07e+00

Pos logit contributions
ess+1.680
der+1.660
nesday+1.652
quished+1.644
rely+1.590
Neg logit contributions
 assurances-1.440
 calculations-1.435
 Preferences-1.391
 negotiations-1.384
 Integrity-1.378
Token his
Token ID465
Feature activation-1.20e+00

Pos logit contributions
 calculations+3.156
 assurances+3.139
 negotiations+3.047
 Integrity+2.989
 Preferences+2.989
Neg logit contributions
nesday-5.873
quished-5.783
ess-5.779
der-5.709
die-5.595
Token financial
Token ID3176
Feature activation-3.29e+00

Pos logit contributions
 calculations+3.878
 assurances+3.801
 negotiations+3.681
 Preferences+3.631
 remarks+3.629
Neg logit contributions
nesday-6.335
quished-6.206
ess-6.118
der-6.085
die-5.968
Token disclosures
Token ID29535
Feature activation-8.94e-01

Pos logit contributions
 calculations+10.862
 assurances+10.653
 remarks+10.556
 statements+10.386
 briefings+10.298
Neg logit contributions
nesday-15.106
quished-14.022
der-13.794
joined-13.671
die-13.524
Token,
Token ID11
Feature activation-1.01e+00

Pos logit contributions
 calculations+3.594
 assurances+3.521
 negotiations+3.415
 Preferences+3.411
 remarks+3.375
Neg logit contributions
nesday-3.964
ess-3.922
quished-3.889
der-3.851
joined-3.763
Token which
Token ID543
Feature activation+2.46e-01

Pos logit contributions
 calculations+3.719
 assurances+3.641
 negotiations+3.564
 Integrity+3.523
 remarks+3.514
Neg logit contributions
nesday-4.798
quished-4.723
ess-4.680
der-4.677
die-4.569
Token he
Token ID339
Feature activation-3.47e-01

Pos logit contributions
ess+1.063
nesday+1.060
quished+1.055
der+1.050
die+1.003
Neg logit contributions
 calculations-1.036
 assurances-1.034
 negotiations-1.001
 Preferences-0.995
 Integrity-0.994
Token sometimes
Token ID3360
Feature activation-7.81e-01

Pos logit contributions
 calculations+1.380
 assurances+1.375
 remarks+1.338
 negotiations+1.333
 Preferences+1.311
Neg logit contributions
nesday-1.572
quished-1.558
der-1.543
ess-1.536
rely-1.498
Token does
Token ID857
Feature activation+9.90e-02

Pos logit contributions
ess+1.288
quished+1.284
nesday+1.282
der+1.280
rely+1.233
Neg logit contributions
 assurances-1.250
 calculations-1.237
 negotiations-1.198
 Preferences-1.193
 remarks-1.185
Token show
Token ID905
Feature activation+7.42e-02

Pos logit contributions
quished+0.427
ess+0.427
nesday+0.424
der+0.422
rely+0.407
Neg logit contributions
 assurances-0.415
 calculations-0.415
 negotiations-0.400
 Preferences-0.397
 remarks-0.394
Token that
Token ID326
Feature activation+1.26e+00

Pos logit contributions
quished+0.357
nesday+0.357
ess+0.355
der+0.353
joined+0.340
Neg logit contributions
 calculations-0.276
 assurances-0.276
 negotiations-0.265
 Preferences-0.264
 Integrity-0.263
Token what
Token ID644
Feature activation+2.22e-01

Pos logit contributions
ess+5.913
der+5.864
nesday+5.692
quished+5.661
rely+5.619
Neg logit contributions
 assurances-4.782
 calculations-4.704
 Preferences-4.579
 negotiations-4.545
 Integrity-4.537
Token the
Token ID262
Feature activation+3.21e-02

Pos logit contributions
ess+0.910
nesday+0.908
quished+0.904
der+0.899
joined+0.854
Neg logit contributions
 assurances-0.985
 calculations-0.979
 Preferences-0.953
 negotiations-0.949
 Integrity-0.946
Token optical
Token ID18480
Feature activation-3.21e+00

Pos logit contributions
nesday+0.164
ess+0.164
quished+0.162
der+0.162
rely+0.157
Neg logit contributions
 calculations-0.110
 assurances-0.110
 Preferences-0.105
 negotiations-0.104
 Integrity-0.104
Token bench
Token ID7624
Feature activation-3.74e-01

Pos logit contributions
 calculations+12.391
 assurances+12.040
 preferences+11.686
 judgments+11.615
 coefficients+11.559
Neg logit contributions
nesday-13.490
quished-12.684
rely-12.220
joined-11.926
escent-11.916
Token sees
Token ID7224
Feature activation-9.56e-01

Pos logit contributions
 calculations+1.507
 assurances+1.478
 remarks+1.432
 Preferences+1.426
 negotiations+1.425
Neg logit contributions
nesday-1.677
quished-1.659
ess-1.653
der-1.645
rely-1.619
Token correlates
Token ID42567
Feature activation-4.06e-02

Pos logit contributions
 calculations+3.944
 assurances+3.813
 negotiations+3.746
 Preferences+3.740
 remarks+3.688
Neg logit contributions
quished-4.149
nesday-4.111
ess-3.996
joined-3.937
der-3.919
Token with
Token ID351
Feature activation-5.23e-01

Pos logit contributions
 assurances+0.171
 calculations+0.171
 Preferences+0.166
 negotiations+0.165
 Integrity+0.164
Neg logit contributions
ess-0.174
quished-0.173
nesday-0.172
der-0.172
rely-0.165
Token reality
Token ID3950
Feature activation-2.53e-01

Pos logit contributions
 calculations+1.845
 assurances+1.818
 Preferences+1.771
 negotiations+1.757
 Integrity+1.755
Neg logit contributions
quished-2.629
nesday-2.627
ess-2.578
der-2.550
joined-2.507
Token Andre
Token ID10948
Feature activation-1.07e+00

Pos logit contributions
 assurances+3.454
 calculations+3.447
 Preferences+3.380
 Integrity+3.358
 negotiations+3.356
Neg logit contributions
nesday-4.575
quished-4.560
ess-4.512
der-4.416
rely-4.366
Token Gre
Token ID11955
Feature activation-2.40e+00

Pos logit contributions
 assurances+2.904
 calculations+2.799
 negotiations+2.795
 remarks+2.786
 Preferences+2.704
Neg logit contributions
quished-6.106
ess-6.026
nesday-5.984
rely-5.946
joined-5.822
Tokenipel
Token ID40634
Feature activation-1.31e+00

Pos logit contributions
 Preferences+8.198
 remarks+8.004
 calculations+7.977
 negotiations+7.971
 preferences+7.809
Neg logit contributions
nesday-10.082
escent-9.609
quished-9.569
joined-9.462
mable-9.291
Token (
Token ID357
Feature activation-1.01e+00

Pos logit contributions
 assurances+4.380
 calculations+4.311
 Preferences+4.248
 remarks+4.239
 negotiations+4.234
Neg logit contributions
quished-6.542
nesday-6.484
ess-6.379
der-6.271
rely-6.234
TokenL
Token ID43
Feature activation-1.33e+00

Pos logit contributions
 assurances+3.616
 calculations+3.603
 negotiations+3.560
 Preferences+3.542
 Integrity+3.506
Neg logit contributions
nesday-4.786
quished-4.754
ess-4.584
rely-4.465
der-4.419
Tokenotto
Token ID17631
Feature activation-3.18e+00

Pos logit contributions
 calculations+4.917
 remarks+4.781
 negotiations+4.760
 assurances+4.759
 Preferences+4.718
Neg logit contributions
nesday-6.004
quished-6.003
rely-5.687
der-5.620
ess-5.544
Token -
Token ID532
Feature activation-7.48e-01

Pos logit contributions
 calculations+7.329
 negotiations+7.278
 Preferences+7.272
 assurances+7.195
 Integrity+6.941
Neg logit contributions
quished-17.482
nesday-16.677
der-16.188
escent-16.094
ess-16.071
Token Bel
Token ID3944
Feature activation+6.09e-01

Pos logit contributions
 assurances+2.337
 calculations+2.311
 negotiations+2.300
 Preferences+2.282
 Integrity+2.263
Neg logit contributions
quished-4.030
nesday-3.934
rely-3.888
der-3.875
ess-3.869
Tokenisol
Token ID30152
Feature activation-1.03e+00

Pos logit contributions
ess+1.702
der+1.674
nesday+1.639
quished+1.625
rely+1.547
Neg logit contributions
 assurances-3.488
 calculations-3.468
 Preferences-3.399
 negotiations-3.385
 Integrity-3.383
Token Team
Token ID4816
Feature activation-1.13e+00

Pos logit contributions
 assurances+3.708
 calculations+3.639
 negotiations+3.586
 Preferences+3.565
 remarks+3.534
Neg logit contributions
ess-4.908
quished-4.879
nesday-4.815
der-4.754
rely-4.722
Token)
Token ID8
Feature activation-8.27e-01

Pos logit contributions
 calculations+4.005
 assurances+3.965
 Preferences+3.929
 negotiations+3.920
 Integrity+3.916
Neg logit contributions
nesday-5.396
quished-5.370
ess-5.308
der-5.198
rely-5.122
TokenBeyond
Token ID24102
Feature activation-2.16e+00

Pos logit contributions
 assurances+0.651
 calculations+0.648
 Preferences+0.629
 negotiations+0.627
 Integrity+0.624
Neg logit contributions
nesday-0.797
quished-0.787
ess-0.782
der-0.777
rely-0.749
Token the
Token ID262
Feature activation-1.61e+00

Pos logit contributions
 assurances+7.110
 calculations+6.903
 negotiations+6.830
 remarks+6.733
 Integrity+6.543
Neg logit contributions
quished-10.772
nesday-10.326
ess-10.299
der-10.296
die-10.126
Token question
Token ID1808
Feature activation-1.22e+00

Pos logit contributions
 assurances+5.669
 calculations+5.598
 negotiations+5.434
 remarks+5.423
 negatives+5.276
Neg logit contributions
quished-7.972
nesday-7.825
der-7.712
ess-7.694
die-7.476
Token of
Token ID286
Feature activation-9.32e-01

Pos logit contributions
 assurances+3.777
 calculations+3.766
 negotiations+3.632
 Integrity+3.613
 Preferences+3.592
Neg logit contributions
nesday-6.352
quished-6.336
der-6.311
ess-6.292
rely-6.131
Token the
Token ID262
Feature activation-1.10e+00

Pos logit contributions
 assurances+3.131
 calculations+3.080
 negotiations+2.998
 remarks+2.941
 Integrity+2.931
Neg logit contributions
quished-4.907
nesday-4.783
der-4.774
ess-4.761
die-4.586
Token Keystone
Token ID29055
Feature activation-3.17e+00

Pos logit contributions
 assurances+3.687
 calculations+3.628
 negotiations+3.553
 remarks+3.499
 Integrity+3.440
Neg logit contributions
quished-5.812
nesday-5.681
der-5.655
ess-5.588
die-5.397
Token XL
Token ID16276
Feature activation-2.20e+00

Pos logit contributions
 negotiations+9.679
 calculations+9.233
 remarks+9.143
 assurances+9.086
 Preferences+8.589
Neg logit contributions
nesday-15.411
die-14.433
quished-14.427
joined-14.420
ess-14.418
Token Pipeline
Token ID37709
Feature activation-1.79e+00

Pos logit contributions
 negotiations+6.596
 calculations+6.214
 assurances+6.204
 remarks+6.179
 Preferences+5.776
Neg logit contributions
nesday-11.265
quished-11.039
joined-11.038
der-10.984
ess-10.961
Token,
Token ID11
Feature activation-1.19e+00

Pos logit contributions
 negotiations+6.399
 assurances+6.320
 calculations+6.273
 remarks+6.097
 Integrity+6.064
Neg logit contributions
nesday-8.233
quished-8.209
der-8.127
ess-8.050
joined-7.952
Token Clinton
Token ID2605
Feature activation-7.20e-01

Pos logit contributions
 assurances+4.304
 negotiations+4.223
 calculations+4.200
 remarks+4.124
 Integrity+4.079
Neg logit contributions
quished-5.764
nesday-5.678
der-5.536
ess-5.524
die-5.380
Token has
Token ID468
Feature activation-1.30e+00

Pos logit contributions
 assurances+3.043
 calculations+3.004
 negotiations+2.959
 remarks+2.955
 Preferences+2.891
Neg logit contributions
quished-3.029
ess-3.003
nesday-2.998
der-2.971
rely-2.876
Token conference
Token ID4495
Feature activation+3.67e-01

Pos logit contributions
 assurances+0.784
 calculations+0.784
 negotiations+0.762
 Integrity+0.753
 remarks+0.746
Neg logit contributions
ess-1.197
der-1.185
quished-1.179
nesday-1.174
rely-1.131
Token committee
Token ID5583
Feature activation+3.89e-01

Pos logit contributions
ess+1.812
der+1.800
nesday+1.782
quished+1.780
rely+1.725
Neg logit contributions
 assurances-1.325
 calculations-1.316
 negotiations-1.275
 Preferences-1.263
 Integrity-1.258
Token was
Token ID373
Feature activation+4.08e-01

Pos logit contributions
ess+1.889
der+1.875
nesday+1.857
quished+1.857
rely+1.799
Neg logit contributions
 assurances-1.431
 calculations-1.422
 negotiations-1.376
 Preferences-1.365
 Integrity-1.363
Token pushing
Token ID7796
Feature activation-6.77e-02

Pos logit contributions
ess+2.116
quished+2.106
der+2.104
nesday+2.101
rely+2.032
Neg logit contributions
 assurances-1.360
 calculations-1.343
 negotiations-1.312
 remarks-1.284
 Preferences-1.275
Token the
Token ID262
Feature activation-3.94e-01

Pos logit contributions
 assurances+0.249
 calculations+0.247
 negotiations+0.240
 Preferences+0.238
 Integrity+0.237
Neg logit contributions
ess-0.329
der-0.325
quished-0.323
nesday-0.322
rely-0.312
Token financial
Token ID3176
Feature activation-3.16e+00

Pos logit contributions
 assurances+1.237
 calculations+1.235
 negotiations+1.203
 Integrity+1.173
 remarks+1.170
Neg logit contributions
ess-2.117
quished-2.106
der-2.104
nesday-2.094
rely-2.016
Token reform
Token ID4975
Feature activation-8.13e-01

Pos logit contributions
 calculations+11.542
 assurances+11.472
 negotiations+11.378
 remarks+10.875
 briefings+10.615
Neg logit contributions
nesday-13.309
quished-13.257
ess-13.017
der-12.984
rely-12.777
Token bill
Token ID2855
Feature activation-4.74e-02

Pos logit contributions
 assurances+2.654
 calculations+2.637
 negotiations+2.617
 remarks+2.497
 Integrity+2.466
Neg logit contributions
quished-4.219
ess-4.219
nesday-4.195
der-4.167
rely-4.035
Token to
Token ID284
Feature activation-3.45e-01

Pos logit contributions
 assurances+0.181
 calculations+0.179
 negotiations+0.176
 Preferences+0.173
 Integrity+0.172
Neg logit contributions
ess-0.222
quished-0.221
der-0.220
nesday-0.218
rely-0.212
Token the
Token ID262
Feature activation-7.37e-01

Pos logit contributions
 assurances+1.169
 calculations+1.150
 negotiations+1.139
 Preferences+1.109
 Integrity+1.102
Neg logit contributions
ess-1.772
quished-1.770
nesday-1.755
der-1.750
rely-1.675
Token finish
Token ID5461
Feature activation-1.25e+00

Pos logit contributions
 assurances+2.398
 negotiations+2.380
 calculations+2.373
 remarks+2.297
 Integrity+2.285
Neg logit contributions
nesday-3.875
quished-3.867
der-3.836
ess-3.812
escent-3.652
Token Jim
Token ID5395
Feature activation-1.00e-01

Pos logit contributions
 assurances+6.931
 remarks+6.843
 calculations+6.723
 negotiations+6.658
 judgments+6.494
Neg logit contributions
quished-9.074
nesday-8.973
ess-8.891
der-8.674
joined-8.572
Token Bob
Token ID5811
Feature activation-5.12e-01

Pos logit contributions
 assurances+0.402
 calculations+0.399
 Preferences+0.387
 negotiations+0.385
 Integrity+0.384
Neg logit contributions
ess-0.454
der-0.451
nesday-0.444
quished-0.442
rely-0.429
Token
Token ID447
Feature activation-4.82e-02

Pos logit contributions
 assurances+2.325
 calculations+2.298
 negotiations+2.245
 Preferences+2.242
 Integrity+2.234
Neg logit contributions
ess-2.004
quished-1.983
nesday-1.978
der-1.974
rely-1.906
Token
Token ID247
Feature activation-8.17e-03

Pos logit contributions
 assurances+0.127
 calculations+0.124
 Integrity+0.121
 Preferences+0.119
 negotiations+0.119
Neg logit contributions
nesday-0.286
ess-0.284
quished-0.282
der-0.280
rely-0.273
Tokens
Token ID82
Feature activation-1.31e+00

Pos logit contributions
 assurances+0.047
 calculations+0.046
 Preferences+0.046
 negotiations+0.045
 Integrity+0.045
Neg logit contributions
ess-0.023
der-0.023
nesday-0.022
quished-0.022
rely-0.021
Token line
Token ID1627
Feature activation-3.16e+00

Pos logit contributions
 assurances+4.564
 remarks+4.443
 calculations+4.441
 negotiations+4.299
 judgments+4.228
Neg logit contributions
quished-6.538
nesday-6.477
ess-6.382
der-6.347
rely-6.183
Token that
Token ID326
Feature activation+1.83e-01

Pos logit contributions
 remarks+10.672
 assurances+10.572
 calculations+10.377
 negotiations+10.205
 judgments+10.143
Neg logit contributions
nesday-13.576
quished-13.567
der-12.977
ess-12.938
rely-12.758
Token
Token ID564
Feature activation-1.27e+00

Pos logit contributions
ess+0.851
der+0.840
nesday+0.838
quished+0.835
rely+0.803
Neg logit contributions
 assurances-0.717
 calculations-0.707
 negotiations-0.691
 Integrity-0.687
 Preferences-0.687
Token
Token ID250
Feature activation-1.39e+00

Pos logit contributions
 assurances+3.937
 calculations+3.884
 Integrity+3.873
 Preferences+3.789
 negotiations+3.779
Neg logit contributions
nesday-6.647
ess-6.310
quished-6.274
der-6.139
rely-5.954
Tokenthis
Token ID5661
Feature activation+4.82e-01

Pos logit contributions
 assurances+6.329
 calculations+6.259
 remarks+6.185
 Integrity+6.170
 negotiations+6.150
Neg logit contributions
nesday-4.939
quished-4.834
ess-4.492
der-4.343
escent-4.335
Token was
Token ID373
Feature activation-1.65e-01

Pos logit contributions
ess+2.378
der+2.352
nesday+2.343
quished+2.341
rely+2.270
Neg logit contributions
 assurances-1.732
 calculations-1.719
 negotiations-1.662
 Preferences-1.657
 Integrity-1.655
Token and
Token ID290
Feature activation-1.23e+00

Pos logit contributions
 assurances+4.969
 negotiations+4.863
 calculations+4.854
 remarks+4.717
 Integrity+4.602
Neg logit contributions
quished-5.725
nesday-5.640
ess-5.567
der-5.562
rely-5.463
Token Russia
Token ID3284
Feature activation-1.09e+00

Pos logit contributions
 assurances+4.039
 calculations+3.896
 negotiations+3.882
 remarks+3.840
 judgments+3.608
Neg logit contributions
quished-6.480
nesday-6.410
ess-6.370
der-6.248
die-6.137
Token
Token ID447
Feature activation+6.61e-02

Pos logit contributions
 assurances+5.083
 calculations+4.972
 negotiations+4.895
 remarks+4.851
 Preferences+4.798
Neg logit contributions
quished-4.070
nesday-4.058
ess-4.055
der-4.025
rely-3.925
Token
Token ID247
Feature activation-4.29e-01

Pos logit contributions
ess+0.380
nesday+0.379
der+0.377
quished+0.375
rely+0.364
Neg logit contributions
 assurances-0.185
 calculations-0.180
 Integrity-0.175
 Preferences-0.175
 negotiations-0.173
Tokens
Token ID82
Feature activation-2.05e+00

Pos logit contributions
 assurances+2.366
 calculations+2.334
 negotiations+2.301
 remarks+2.279
 Integrity+2.274
Neg logit contributions
quished-1.262
ess-1.262
nesday-1.262
der-1.235
rely-1.165
Token part
Token ID636
Feature activation-3.10e+00

Pos logit contributions
 assurances+5.861
 calculations+5.558
 negotiations+5.541
 remarks+5.524
 backing+5.197
Neg logit contributions
quished-11.609
nesday-11.154
ess-10.827
der-10.812
die-10.720
Token in
Token ID287
Feature activation-1.69e+00

Pos logit contributions
 negotiations+11.091
 calculations+11.049
 assurances+10.898
 remarks+10.546
 Preferences+10.464
Neg logit contributions
rely-12.425
nesday-12.410
die-12.073
quished-12.044
der-12.042
Token destabil
Token ID27397
Feature activation+2.38e-01

Pos logit contributions
 negotiations+5.430
 calculations+5.360
 assurances+5.357
 remarks+5.106
 Integrity+4.887
Neg logit contributions
quished-8.629
nesday-8.570
ess-8.549
der-8.461
rely-8.391
Tokenizing
Token ID2890
Feature activation+2.67e-01

Pos logit contributions
ess+0.766
der+0.755
nesday+0.742
quished+0.737
rely+0.707
Neg logit contributions
 assurances-1.261
 calculations-1.253
 Preferences-1.226
 negotiations-1.221
 Integrity-1.220
Token eastern
Token ID10183
Feature activation-2.23e-02

Pos logit contributions
ess+1.540
der+1.527
nesday+1.524
quished+1.517
rely+1.477
Neg logit contributions
 assurances-0.740
 calculations-0.733
 negotiations-0.700
 Preferences-0.694
 Integrity-0.688
Token Ukraine
Token ID7049
Feature activation+5.44e-01

Pos logit contributions
 assurances+0.058
 calculations+0.057
 negotiations+0.055
 Preferences+0.054
 Integrity+0.053
Neg logit contributions
ess-0.132
nesday-0.132
der-0.131
quished-0.131
rely-0.128
Token drawing
Token ID8263
Feature activation-1.80e+00

Pos logit contributions
 calculations+2.190
 assurances+2.186
 negotiations+2.167
 remarks+2.076
 Preferences+2.036
Neg logit contributions
ess-3.284
der-3.262
nesday-3.259
quished-3.259
rely-3.146
Token up
Token ID510
Feature activation-1.03e+00

Pos logit contributions
 calculations+5.124
 assurances+5.043
 negotiations+4.976
 remarks+4.828
 judgments+4.643
Neg logit contributions
nesday-9.871
quished-9.668
ess-9.564
der-9.514
die-9.504
Token the
Token ID262
Feature activation-6.22e-01

Pos logit contributions
 calculations+3.358
 assurances+3.313
 negotiations+3.232
 remarks+3.133
 Preferences+3.047
Neg logit contributions
nesday-5.478
quished-5.416
ess-5.403
der-5.394
rely-5.222
Token "
Token ID366
Feature activation-5.59e-01

Pos logit contributions
 calculations+2.001
 assurances+1.993
 negotiations+1.928
 remarks+1.868
 Preferences+1.853
Neg logit contributions
nesday-3.321
ess-3.311
quished-3.306
der-3.305
rely-3.172
TokenStr
Token ID13290
Feature activation-2.86e+00

Pos logit contributions
 calculations+2.308
 assurances+2.297
 negotiations+2.244
 Preferences+2.202
 Integrity+2.185
Neg logit contributions
nesday-2.405
quished-2.368
ess-2.353
der-2.329
rely-2.257
Tokenategic
Token ID47917
Feature activation-3.09e+00

Pos logit contributions
 calculations+7.106
 negotiations+7.102
 assurances+6.824
 Preferences+6.421
 remarks+6.401
Neg logit contributions
quished-14.560
nesday-14.519
der-13.926
rely-13.857
ess-13.766
Token plan
Token ID1410
Feature activation-1.33e+00

Pos logit contributions
 Preferences+8.711
 calculations+8.634
 assurances+8.388
 Integrity+8.373
 negotiations+8.271
Neg logit contributions
quished-15.682
nesday-15.567
der-15.004
ess-14.998
escent-14.913
Token for
Token ID329
Feature activation-6.21e-01

Pos logit contributions
 calculations+4.976
 assurances+4.925
 negotiations+4.850
 remarks+4.745
 Preferences+4.707
Neg logit contributions
quished-6.128
nesday-6.124
ess-6.020
der-6.001
rely-5.932
Token deployment
Token ID14833
Feature activation-5.59e-01

Pos logit contributions
 calculations+2.207
 assurances+2.207
 negotiations+2.169
 Preferences+2.112
 Integrity+2.078
Neg logit contributions
ess-3.063
nesday-3.044
der-3.044
quished-3.027
rely-2.911
Token of
Token ID286
Feature activation-1.28e-02

Pos logit contributions
 assurances+1.851
 calculations+1.847
 negotiations+1.789
 Preferences+1.756
 remarks+1.740
Neg logit contributions
der-2.877
ess-2.868
nesday-2.863
quished-2.849
rely-2.773
Token the
Token ID262
Feature activation-4.16e-01

Pos logit contributions
 assurances+0.047
 calculations+0.046
 negotiations+0.045
 Preferences+0.044
 Integrity+0.044
Neg logit contributions
ess-0.062
nesday-0.062
der-0.062
quished-0.061
rely-0.059
Token Ke
Token ID3873
Feature activation-4.98e-01

Pos logit contributions
 assurances+1.411
 calculations+1.396
 negotiations+1.360
 Preferences+1.356
 remarks+1.356
Neg logit contributions
nesday-1.209
ess-1.206
quished-1.201
der-1.190
rely-1.141
Token Kan
Token ID14248
Feature activation-2.44e-01

Pos logit contributions
 assurances+1.925
 calculations+1.906
 Preferences+1.854
 Integrity+1.849
 negotiations+1.848
Neg logit contributions
nesday-2.275
ess-2.248
der-2.243
quished-2.238
rely-2.160
Tokenak
Token ID461
Feature activation-6.71e-01

Pos logit contributions
 assurances+1.049
 calculations+1.042
 Preferences+1.011
 negotiations+1.009
 Integrity+1.007
Neg logit contributions
ess-1.013
der-1.007
nesday-1.001
quished-0.992
rely-0.963
Tokenol
Token ID349
Feature activation+7.48e-01

Pos logit contributions
 assurances+3.170
 calculations+3.166
 Integrity+3.090
 Preferences+3.079
 negotiations+3.079
Neg logit contributions
nesday-2.413
der-2.400
ess-2.378
quished-2.378
rely-2.275
Tokenu
Token ID84
Feature activation+6.21e-01

Pos logit contributions
ess+2.952
der+2.878
quished+2.804
nesday+2.775
rely+2.685
Neg logit contributions
 assurances-3.460
 calculations-3.430
 Preferences-3.344
 Integrity-3.309
 negotiations-3.307
Token ki
Token ID47748
Feature activation-3.09e+00

Pos logit contributions
ess+2.883
der+2.856
nesday+2.822
quished+2.809
rely+2.730
Neg logit contributions
 assurances-2.411
 calculations-2.390
 Preferences-2.320
 negotiations-2.307
 Integrity-2.303
Token
Token ID134
Feature activation-2.58e+00

Pos logit contributions
 assurances+11.327
 calculations+11.204
 remarks+11.051
 negotiations+11.023
 Integrity+10.933
Neg logit contributions
nesday-11.472
quished-11.216
der-10.953
rely-10.860
die-10.777
Token
Token ID119
Feature activation-1.23e+00

Pos logit contributions
 assurances+3.760
 calculations+3.565
 Preferences+3.307
 remarks+3.200
 negotiations+3.193
Neg logit contributions
nesday-16.562
quished-16.092
ess-15.927
die-15.796
rely-15.782
Tokeni
Token ID72
Feature activation-1.67e+00

Pos logit contributions
 calculations+5.300
 assurances+5.287
 negotiations+5.184
 remarks+5.181
 Integrity+5.178
Neg logit contributions
nesday-4.593
quished-4.537
der-4.424
ess-4.319
rely-4.255
Token pa
Token ID14187
Feature activation+3.71e-03

Pos logit contributions
 assurances+5.997
 calculations+5.937
 remarks+5.803
 negotiations+5.790
 Integrity+5.713
Neg logit contributions
nesday-7.530
quished-7.390
der-7.286
rely-7.131
ess-7.098
Tokenia
Token ID544
Feature activation-2.21e-01

Pos logit contributions
nesday+0.017
ess+0.017
der+0.017
quished+0.016
rely+0.016
Neg logit contributions
 assurances-0.015
 calculations-0.015
 negotiations-0.014
 Preferences-0.014
 Integrity-0.014
Token
Token ID447
Feature activation+1.23e-01

Pos logit contributions
 calculations+3.968
 assurances+3.908
 remarks+3.812
 negotiations+3.786
 Preferences+3.774
Neg logit contributions
nesday-3.913
quished-3.809
der-3.748
ess-3.744
rely-3.684
Token
Token ID251
Feature activation-5.34e-01

Pos logit contributions
ess+0.767
der+0.762
nesday+0.757
quished+0.754
rely+0.737
Neg logit contributions
 assurances-0.283
 calculations-0.279
 Preferences-0.265
 negotiations-0.262
 Integrity-0.262
Token stems
Token ID21552
Feature activation-1.01e+00

Pos logit contributions
 assurances+2.075
 calculations+2.064
 remarks+2.012
 negotiations+2.004
 Preferences+1.988
Neg logit contributions
quished-2.469
nesday-2.442
ess-2.412
der-2.394
rely-2.322
Token from
Token ID422
Feature activation-7.85e-01

Pos logit contributions
 assurances+3.877
 calculations+3.842
 negotiations+3.723
 Preferences+3.700
 remarks+3.679
Neg logit contributions
ess-4.642
der-4.586
nesday-4.573
quished-4.566
rely-4.401
Token his
Token ID465
Feature activation-1.21e+00

Pos logit contributions
 calculations+2.666
 assurances+2.664
 negotiations+2.571
 remarks+2.558
 Preferences+2.538
Neg logit contributions
ess-3.973
quished-3.961
nesday-3.952
der-3.927
rely-3.808
Token sense
Token ID2565
Feature activation-3.08e+00

Pos logit contributions
 calculations+3.838
 assurances+3.795
 remarks+3.694
 negotiations+3.662
 judgments+3.640
Neg logit contributions
quished-6.388
nesday-6.347
ess-6.233
der-6.138
rely-6.029
Token of
Token ID286
Feature activation-1.77e+00

Pos logit contributions
 calculations+10.221
 assurances+10.124
 judgments+10.009
 Preferences+9.964
 remarks+9.818
Neg logit contributions
nesday-13.693
quished-13.492
die-13.054
ess-13.030
der-13.001
Token the
Token ID262
Feature activation-1.25e+00

Pos logit contributions
 assurances+6.188
 calculations+6.069
 negotiations+5.972
 Preferences+5.968
 remarks+5.931
Neg logit contributions
quished-8.289
nesday-8.216
der-7.954
die-7.926
ess-7.899
Token word
Token ID1573
Feature activation-1.92e+00

Pos logit contributions
 assurances+4.485
 calculations+4.414
 negotiations+4.314
 remarks+4.310
 Preferences+4.268
Neg logit contributions
quished-6.013
nesday-5.966
der-5.834
ess-5.757
rely-5.702
Token
Token ID564
Feature activation-1.77e+00

Pos logit contributions
 assurances+7.098
 calculations+6.902
 remarks+6.840
 negotiations+6.840
 Preferences+6.775
Neg logit contributions
nesday-8.270
quished-8.246
der-8.087
ess-7.979
rely-7.909
Token
Token ID250
Feature activation-1.11e+00

Pos logit contributions
 assurances+5.246
 Integrity+5.174
 calculations+5.150
 Preferences+5.041
 judgments+4.981
Neg logit contributions
nesday-8.992
ess-8.667
quished-8.634
der-8.525
rely-8.342
Token Phi
Token ID47256
Feature activation-2.85e+00

Pos logit contributions
 assurances+2.614
 calculations+2.600
 negotiations+2.538
 Integrity+2.520
 Preferences+2.478
Neg logit contributions
quished-3.953
ess-3.945
nesday-3.937
der-3.926
rely-3.766
Token K
Token ID509
Feature activation-7.96e-01

Pos logit contributions
 calculations+7.459
 Integrity+7.283
 negotiations+7.238
 assurances+7.202
 Preferences+6.897
Neg logit contributions
nesday-15.190
quished-14.655
der-14.499
ess-14.306
die-14.133
Tokenaps
Token ID1686
Feature activation-1.02e+00

Pos logit contributions
 calculations+2.289
 assurances+2.218
 negotiations+2.149
 remarks+2.130
 coefficients+2.064
Neg logit contributions
nesday-4.481
quished-4.304
der-4.302
ess-4.257
die-4.220
Token
Token ID447
Feature activation-1.04e+00

Pos logit contributions
 assurances+3.953
 calculations+3.946
 negotiations+3.858
 Preferences+3.819
 Integrity+3.817
Neg logit contributions
quished-4.631
ess-4.622
der-4.572
nesday-4.535
rely-4.423
Token
Token ID247
Feature activation-1.57e+00

Pos logit contributions
 assurances+2.794
 calculations+2.752
 Integrity+2.663
 Preferences+2.644
 negotiations+2.618
Neg logit contributions
ess-5.911
der-5.897
nesday-5.876
quished-5.823
rely-5.718
Token part
Token ID636
Feature activation-3.07e+00

Pos logit contributions
 calculations+5.639
 assurances+5.593
 negotiations+5.461
 Integrity+5.433
 remarks+5.376
Neg logit contributions
quished-7.421
nesday-7.325
ess-7.258
der-7.119
rely-6.957
Token to
Token ID284
Feature activation-8.34e-01

Pos logit contributions
 calculations+11.112
 negotiations+11.086
 assurances+11.083
 remarks+10.679
 preferences+10.487
Neg logit contributions
rely-11.611
quished-11.517
nesday-11.386
ess-11.184
der-11.087
Token pursue
Token ID10660
Feature activation-5.16e-01

Pos logit contributions
 calculations+2.322
 assurances+2.297
 negotiations+2.257
 Integrity+2.205
 Preferences+2.199
Neg logit contributions
quished-4.663
nesday-4.656
ess-4.656
der-4.598
rely-4.418
Token dis
Token ID595
Feature activation+1.05e+00

Pos logit contributions
 assurances+1.731
 calculations+1.712
 negotiations+1.693
 Preferences+1.650
 Integrity+1.649
Neg logit contributions
ess-2.656
nesday-2.641
quished-2.630
der-2.624
rely-2.540
Tokenaff
Token ID2001
Feature activation+7.90e-01

Pos logit contributions
ess+4.032
der+3.988
nesday+3.952
quished+3.925
rely+3.780
Neg logit contributions
 assurances-4.922
 calculations-4.889
 Preferences-4.766
 negotiations-4.752
 Integrity-4.740
Tokeniliation
Token ID15547
Feature activation+1.95e-01

Pos logit contributions
ess+3.200
der+3.187
nesday+3.172
quished+3.148
rely+3.023
Neg logit contributions
 assurances-3.513
 calculations-3.502
 Preferences-3.397
 negotiations-3.396
 Integrity-3.388
Token in
Token ID287
Feature activation-4.06e-01

Pos logit contributions
ess+1.257
der+1.247
nesday+1.235
quished+1.228
rely+1.190
Neg logit contributions
 assurances-1.316
 calculations-1.311
 Preferences-1.275
 negotiations-1.270
 Integrity-1.264
Token Trump
Token ID1301
Feature activation+2.11e-01

Pos logit contributions
 calculations+1.375
 assurances+1.360
 negotiations+1.319
 Preferences+1.304
 remarks+1.295
Neg logit contributions
ess-2.085
der-2.073
quished-2.070
nesday-2.064
rely-1.986
Token
Token ID447
Feature activation-3.95e-01

Pos logit contributions
ess+0.878
der+0.867
nesday+0.852
quished+0.846
rely+0.822
Neg logit contributions
 assurances-0.924
 calculations-0.913
 Preferences-0.892
 Integrity-0.887
 negotiations-0.885
Token
Token ID247
Feature activation-3.71e-01

Pos logit contributions
 assurances+0.989
 calculations+0.959
 Integrity+0.938
 Preferences+0.921
 negotiations+0.917
Neg logit contributions
nesday-2.399
quished-2.369
ess-2.348
der-2.347
rely-2.275
Tokens
Token ID82
Feature activation-1.55e+00

Pos logit contributions
 calculations+2.017
 assurances+2.000
 negotiations+1.972
 Preferences+1.952
 remarks+1.949
Neg logit contributions
quished-1.172
nesday-1.168
ess-1.111
der-1.100
joined-1.046
Token financial
Token ID3176
Feature activation-3.07e+00

Pos logit contributions
 calculations+4.771
 assurances+4.576
 negotiations+4.517
 remarks+4.504
 judgments+4.262
Neg logit contributions
quished-8.436
nesday-8.263
der-8.128
ess-8.123
die-8.038
Token disclosure
Token ID13019
Feature activation-1.51e+00

Pos logit contributions
 calculations+12.867
 assurances+12.281
 remarks+12.142
 negotiations+11.931
 judgments+11.728
Neg logit contributions
nesday-11.648
quished-11.554
joined-10.894
die-10.771
escent-10.761
Token this
Token ID428
Feature activation+6.98e-01

Pos logit contributions
 calculations+5.800
 assurances+5.604
 negotiations+5.454
 remarks+5.359
 briefings+5.327
Neg logit contributions
nesday-6.931
quished-6.822
der-6.759
ess-6.720
joined-6.688
Token year
Token ID614
Feature activation-1.23e+00

Pos logit contributions
ess+2.275
der+2.237
nesday+2.193
quished+2.160
rely+2.100
Neg logit contributions
 assurances-3.660
 calculations-3.624
 Preferences-3.563
 Integrity-3.531
 negotiations-3.529
Token from
Token ID422
Feature activation-6.21e-01

Pos logit contributions
 calculations+5.134
 assurances+5.075
 negotiations+4.949
 remarks+4.897
 Preferences+4.863
Neg logit contributions
nesday-5.052
ess-5.024
quished-5.012
der-4.975
rely-4.852
Token last
Token ID938
Feature activation+1.14e-01

Pos logit contributions
 calculations+2.018
 assurances+1.993
 negotiations+1.935
 Integrity+1.907
 remarks+1.894
Neg logit contributions
ess-3.262
quished-3.244
der-3.238
nesday-3.222
die-3.109
Tokenal
Token ID282
Feature activation+8.57e-01

Pos logit contributions
ess+3.165
der+3.121
nesday+3.088
quished+3.073
rely+2.920
Neg logit contributions
 assurances-5.606
 calculations-5.576
 Preferences-5.455
 negotiations-5.434
 Integrity-5.431
Tokenise
Token ID786
Feature activation-4.45e-01

Pos logit contributions
ess+1.190
der+1.172
nesday+1.167
quished+1.160
rely+0.996
Neg logit contributions
 assurances-6.081
 calculations-6.052
 Preferences-5.956
 negotiations-5.946
 Integrity-5.941
Token
Token ID447
Feature activation-1.70e-01

Pos logit contributions
 assurances+1.831
 calculations+1.820
 Preferences+1.765
 negotiations+1.764
 Integrity+1.744
Neg logit contributions
ess-1.929
quished-1.919
nesday-1.915
der-1.910
rely-1.847
Token
Token ID247
Feature activation-2.94e-01

Pos logit contributions
 assurances+0.434
 calculations+0.421
 Integrity+0.411
 Preferences+0.405
 negotiations+0.403
Neg logit contributions
nesday-1.029
quished-1.021
ess-1.010
der-1.003
rely-0.974
Tokens
Token ID82
Feature activation-1.27e+00

Pos logit contributions
 assurances+1.616
 calculations+1.602
 Preferences+1.571
 negotiations+1.568
 Integrity+1.567
Neg logit contributions
quished-0.882
nesday-0.862
ess-0.848
der-0.836
joined-0.801
Token point
Token ID966
Feature activation-3.05e+00

Pos logit contributions
 calculations+4.456
 assurances+4.446
 remarks+4.237
 negotiations+4.233
 judgments+4.211
Neg logit contributions
quished-6.492
nesday-6.423
ess-6.210
der-6.175
joined-6.082
Token of
Token ID286
Feature activation-1.67e+00

Pos logit contributions
 calculations+9.578
 assurances+9.312
 negotiations+8.988
 judgments+8.913
 preferences+8.758
Neg logit contributions
nesday-14.275
quished-14.239
joined-13.870
rely-13.583
lete-13.527
Token view
Token ID1570
Feature activation-1.81e+00

Pos logit contributions
 calculations+3.136
 assurances+3.075
 negotiations+2.918
 remarks+2.904
 judgments+2.792
Neg logit contributions
nesday-10.764
quished-10.763
ess-10.559
der-10.515
die-10.340
Token?
Token ID30
Feature activation-2.90e-01

Pos logit contributions
 calculations+6.798
 assurances+6.561
 negotiations+6.512
 remarks+6.414
 judgments+6.374
Neg logit contributions
quished-8.024
nesday-7.963
ess-7.680
rely-7.654
joined-7.649
Token Does
Token ID8314
Feature activation+1.17e-01

Pos logit contributions
 assurances+0.955
 Integrity+0.953
 Preferences+0.952
 calculations+0.938
 remarks+0.917
Neg logit contributions
nesday-1.532
quished-1.502
der-1.475
ess-1.468
joined-1.438
Token that
Token ID326
Feature activation+1.41e+00

Pos logit contributions
ess+0.542
der+0.538
quished+0.535
nesday+0.534
rely+0.516
Neg logit contributions
 assurances-0.452
 calculations-0.447
 negotiations-0.433
 Integrity-0.433
 Preferences-0.431
Token.
Token ID13
Feature activation-1.30e-01

Pos logit contributions
ess+0.793
der+0.785
nesday+0.761
quished+0.752
rely+0.724
Neg logit contributions
 assurances-1.150
 calculations-1.138
 Preferences-1.121
 Integrity-1.116
 negotiations-1.109
Tokencom
Token ID785
Feature activation+8.69e-02

Pos logit contributions
 assurances+0.683
 calculations+0.679
 Preferences+0.664
 Integrity+0.662
 negotiations+0.661
Neg logit contributions
ess-0.428
der-0.422
nesday-0.412
quished-0.408
rely-0.394
Token/
Token ID14
Feature activation-1.85e-01

Pos logit contributions
ess+0.355
quished+0.350
der+0.349
nesday+0.348
rely+0.336
Neg logit contributions
 calculations-0.386
 assurances-0.385
 negotiations-0.373
 Preferences-0.373
 Integrity-0.370
Tokenapi
Token ID15042
Feature activation-8.92e-01

Pos logit contributions
 calculations+0.805
 assurances+0.804
 Preferences+0.783
 negotiations+0.780
 Integrity+0.773
Neg logit contributions
ess-0.760
quished-0.753
nesday-0.752
der-0.744
rely-0.721
Token is
Token ID318
Feature activation+1.00e-02

Pos logit contributions
 calculations+3.631
 assurances+3.606
 negotiations+3.530
 Preferences+3.495
 Integrity+3.431
Neg logit contributions
quished-3.886
nesday-3.858
ess-3.857
der-3.740
rely-3.711
Token deprecated
Token ID39224
Feature activation+2.00e+00

Pos logit contributions
quished+0.047
nesday+0.047
ess+0.046
der+0.046
rely+0.044
Neg logit contributions
 assurances-0.038
 calculations-0.038
 negotiations-0.037
 Preferences-0.037
 Integrity-0.037
Token and
Token ID290
Feature activation+5.41e-01

Pos logit contributions
ess+8.999
der+8.978
nesday+8.562
quished+8.531
rely+8.501
Neg logit contributions
 assurances-7.569
 calculations-7.355
 Integrity-7.226
 Preferences-7.185
 negotiations-7.129
Token you
Token ID345
Feature activation+9.58e-01

Pos logit contributions
ess+2.320
nesday+2.287
quished+2.286
der+2.285
rely+2.172
Neg logit contributions
 assurances-2.317
 calculations-2.305
 Preferences-2.236
 negotiations-2.235
 Integrity-2.222
Token should
Token ID815
Feature activation+3.42e-01

Pos logit contributions
ess+3.669
der+3.633
nesday+3.558
quished+3.529
rely+3.439
Neg logit contributions
 assurances-4.474
 calculations-4.443
 Preferences-4.336
 Integrity-4.319
 negotiations-4.315
Token use
Token ID779
Feature activation+5.87e-01

Pos logit contributions
ess+1.759
quished+1.749
der+1.735
nesday+1.730
rely+1.661
Neg logit contributions
 assurances-1.158
 calculations-1.149
 Preferences-1.117
 negotiations-1.104
 Integrity-1.095
Token https
Token ID3740
Feature activation-9.77e-01

Pos logit contributions
ess+2.780
der+2.754
nesday+2.721
quished+2.705
rely+2.635
Neg logit contributions
 assurances-2.213
 calculations-2.195
 Preferences-2.124
 negotiations-2.114
 Integrity-2.112
Token investment
Token ID4896
Feature activation-1.44e-01

Pos logit contributions
ess+1.374
der+1.362
nesday+1.355
quished+1.355
rely+1.319
Neg logit contributions
 assurances-0.985
 calculations-0.981
 negotiations-0.957
 Preferences-0.939
 Integrity-0.935
Token which
Token ID543
Feature activation+9.00e-01

Pos logit contributions
 assurances+0.542
 calculations+0.539
 negotiations+0.530
 Preferences+0.519
 remarks+0.514
Neg logit contributions
ess-0.681
der-0.678
quished-0.677
nesday-0.675
rely-0.660
Token would
Token ID561
Feature activation+1.98e-01

Pos logit contributions
ess+3.754
der+3.712
nesday+3.669
quished+3.653
rely+3.531
Neg logit contributions
 assurances-3.929
 calculations-3.901
 Preferences-3.794
 negotiations-3.785
 Integrity-3.769
Token be
Token ID307
Feature activation+5.10e-01

Pos logit contributions
quished+1.025
ess+1.018
nesday+1.009
der+1.003
joined+0.975
Neg logit contributions
 assurances-0.665
 calculations-0.664
 negotiations-0.643
 Preferences-0.632
 remarks-0.625
Token a
Token ID257
Feature activation+1.45e+00

Pos logit contributions
ess+2.440
der+2.423
nesday+2.407
quished+2.395
rely+2.322
Neg logit contributions
 assurances-1.907
 calculations-1.892
 negotiations-1.829
 Preferences-1.823
 Integrity-1.810
Token catastrophe
Token ID27767
Feature activation+1.97e+00

Pos logit contributions
ess+6.891
der+6.857
quished+6.678
nesday+6.672
rely+6.522
Neg logit contributions
 assurances-5.277
 calculations-5.223
 Preferences-5.096
 Integrity-5.090
 negotiations-5.000
Token for
Token ID329
Feature activation+6.35e-01

Pos logit contributions
ess+8.081
der+7.960
quished+7.683
nesday+7.537
rely+7.440
Neg logit contributions
 assurances-8.344
 calculations-8.179
 Integrity-8.129
 Preferences-8.116
 remarks-7.839
Token the
Token ID262
Feature activation+1.42e-01

Pos logit contributions
ess+3.032
der+3.004
nesday+2.977
quished+2.967
rely+2.875
Neg logit contributions
 assurances-2.387
 calculations-2.367
 Preferences-2.291
 negotiations-2.289
 Integrity-2.278
Token area
Token ID1989
Feature activation-6.50e-01

Pos logit contributions
ess+0.710
nesday+0.707
der+0.706
quished+0.703
rely+0.680
Neg logit contributions
 assurances-0.504
 calculations-0.501
 negotiations-0.486
 Integrity-0.480
 Preferences-0.478
Token)
Token ID8
Feature activation-1.74e-01

Pos logit contributions
 calculations+2.601
 assurances+2.579
 negotiations+2.537
 Preferences+2.458
 Integrity+2.456
Neg logit contributions
nesday-2.932
ess-2.874
quished-2.862
der-2.840
rely-2.773
Token and
Token ID290
Feature activation-1.40e-01

Pos logit contributions
 assurances+0.571
 calculations+0.567
 negotiations+0.551
 Preferences+0.545
 Integrity+0.542
Neg logit contributions
ess-0.903
der-0.898
nesday-0.893
quished-0.891
rely-0.870
Token she
Token ID673
Feature activation+4.64e-01

Pos logit contributions
ess+5.360
der+5.315
nesday+5.172
quished+5.129
rely+5.068
Neg logit contributions
 assurances-4.558
 calculations-4.550
 Preferences-4.446
 Integrity-4.404
 negotiations-4.353
Token was
Token ID373
Feature activation+3.94e-01

Pos logit contributions
ess+1.848
der+1.824
nesday+1.815
quished+1.809
rely+1.738
Neg logit contributions
 assurances-2.107
 calculations-2.093
 Preferences-2.035
 negotiations-2.032
 Integrity-2.024
Token covert
Token ID22624
Feature activation+5.25e-01

Pos logit contributions
ess+1.975
nesday+1.954
der+1.953
quished+1.952
rely+1.883
Neg logit contributions
 assurances-1.382
 calculations-1.366
 negotiations-1.322
 Preferences-1.315
 Integrity-1.310
Tokenly
Token ID306
Feature activation-4.22e-01

Pos logit contributions
ess+1.078
der+1.046
nesday+1.039
quished+1.028
rely+0.943
Neg logit contributions
 assurances-3.407
 calculations-3.388
 negotiations-3.327
 Preferences-3.321
 Integrity-3.303
Token recorded
Token ID6264
Feature activation+4.23e-01

Pos logit contributions
 assurances+1.462
 calculations+1.440
 negotiations+1.420
 remarks+1.408
 judgments+1.365
Neg logit contributions
nesday-2.122
quished-2.107
ess-2.076
der-2.051
rely-2.023
Token und
Token ID3318
Feature activation+1.99e+00

Pos logit contributions
ess+2.110
nesday+2.110
quished+2.107
der+2.090
rely+2.022
Neg logit contributions
 assurances-1.503
 calculations-1.481
 negotiations-1.448
 remarks-1.434
 Preferences-1.420
Tokenressing
Token ID11697
Feature activation+1.00e+00

Pos logit contributions
ess+4.316
der+4.176
quished+3.861
nesday+3.700
joined+3.667
Neg logit contributions
 assurances-12.471
 calculations-12.322
 Preferences-12.299
 Integrity-12.088
 negotiations-12.030
Token in
Token ID287
Feature activation-4.97e-01

Pos logit contributions
ess+4.498
der+4.453
nesday+4.393
quished+4.372
rely+4.247
Neg logit contributions
 assurances-4.060
 calculations-4.027
 Preferences-3.916
 negotiations-3.892
 Integrity-3.888
Token a
Token ID257
Feature activation+5.41e-01

Pos logit contributions
 assurances+1.657
 calculations+1.633
 negotiations+1.597
 Preferences+1.584
 remarks+1.584
Neg logit contributions
nesday-2.562
ess-2.557
der-2.542
quished-2.539
rely-2.456
Token changing
Token ID5609
Feature activation-5.14e-01

Pos logit contributions
ess+2.541
der+2.517
nesday+2.503
quished+2.490
rely+2.411
Neg logit contributions
 assurances-2.076
 calculations-2.056
 Preferences-1.995
 negotiations-1.987
 Integrity-1.978
Token room
Token ID2119
Feature activation+9.05e-01

Pos logit contributions
 assurances+1.543
 calculations+1.528
 negotiations+1.489
 remarks+1.452
 Preferences+1.447
Neg logit contributions
nesday-2.864
quished-2.793
ess-2.775
der-2.735
die-2.703
Token his
Token ID465
Feature activation-1.42e+00

Pos logit contributions
 assurances+1.830
 calculations+1.784
 negotiations+1.763
 remarks+1.727
 Preferences+1.670
Neg logit contributions
ess-2.542
quished-2.541
nesday-2.529
der-2.507
die-2.437
Tokenparty
Token ID10608
Feature activation-1.15e-01

Pos logit contributions
 assurances+4.895
 calculations+4.823
 remarks+4.699
 negotiations+4.691
 judgments+4.415
Neg logit contributions
quished-7.324
nesday-7.201
ess-7.140
der-7.046
rely-6.877
Token suffered
Token ID6989
Feature activation+2.41e-01

Pos logit contributions
 assurances+0.447
 calculations+0.445
 negotiations+0.431
 Preferences+0.426
 remarks+0.424
Neg logit contributions
ess-0.535
nesday-0.533
quished-0.531
der-0.527
rely-0.513
Token a
Token ID257
Feature activation+1.29e+00

Pos logit contributions
ess+1.114
der+1.100
quished+1.091
nesday+1.090
rely+1.038
Neg logit contributions
 assurances-0.919
 calculations-0.911
 Preferences-0.909
 Integrity-0.895
 negotiations-0.875
Token humiliating
Token ID37743
Feature activation+1.85e+00

Pos logit contributions
ess+5.909
der+5.869
nesday+5.744
quished+5.712
rely+5.587
Neg logit contributions
 assurances-4.868
 calculations-4.842
 Preferences-4.787
 Integrity-4.737
 negotiations-4.636
Token defeat
Token ID7433
Feature activation+2.08e+00

Pos logit contributions
ess+9.235
der+9.159
nesday+8.811
quished+8.787
rely+8.784
Neg logit contributions
 Preferences-6.013
 assurances-6.011
 calculations-5.960
 Integrity-5.845
 negotiations-5.665
Token to
Token ID284
Feature activation+9.45e-01

Pos logit contributions
ess+9.356
der+9.115
nesday+8.799
quished+8.730
rely+8.651
Neg logit contributions
 assurances-7.834
 calculations-7.730
 Preferences-7.702
 Integrity-7.642
 coefficients-7.393
Token the
Token ID262
Feature activation+3.33e-01

Pos logit contributions
ess+4.092
der+4.065
nesday+4.017
quished+3.835
die+3.783
Neg logit contributions
 assurances-3.791
 calculations-3.750
 Preferences-3.746
 Integrity-3.672
 coefficients-3.628
Token euros
Token ID21138
Feature activation-1.26e+00

Pos logit contributions
ess+1.824
der+1.812
nesday+1.787
quished+1.777
rely+1.736
Neg logit contributions
 assurances-1.006
 calculations-0.994
 Preferences-0.964
 Integrity-0.955
 negotiations-0.947
Tokencept
Token ID984
Feature activation+2.05e-01

Pos logit contributions
 calculations+3.487
 assurances+3.434
 remarks+3.428
 negotiations+3.389
 Integrity+3.203
Neg logit contributions
quished-6.611
nesday-6.565
ess-6.456
der-6.191
 thous-6.185
Tokenic
Token ID291
Feature activation+7.02e-01

Pos logit contributions
nesday+0.858
quished+0.845
ess+0.840
der+0.817
rely+0.793
Neg logit contributions
 calculations-0.879
 assurances-0.865
 negotiations-0.853
 remarks-0.848
 Integrity-0.845
Token moving
Token ID3867
Feature activation+3.47e-01

Pos logit contributions
ess+3.684
der+3.667
nesday+3.652
quished+3.641
rely+3.542
Neg logit contributions
 assurances-1.688
 calculations-1.676
 negotiations-1.602
 Preferences-1.584
 Integrity-1.565
Token along
Token ID1863
Feature activation+1.18e+00

Pos logit contributions
nesday+1.715
ess+1.710
der+1.707
quished+1.702
rely+1.644
Neg logit contributions
 assurances-1.241
 calculations-1.234
 negotiations-1.199
 Preferences-1.173
 remarks-1.169
Token.
Token ID13
Feature activation+7.14e-01

Pos logit contributions
ess+4.001
der+3.946
nesday+3.840
quished+3.831
rely+3.697
Neg logit contributions
 assurances-6.068
 calculations-6.021
 Preferences-5.890
 negotiations-5.855
 Integrity-5.854
Token @
Token ID2488
Feature activation+4.40e-01

Pos logit contributions
ess+3.454
der+3.419
nesday+3.363
quished+3.342
rely+3.282
Neg logit contributions
 assurances-2.627
 calculations-2.596
 Preferences-2.502
 negotiations-2.495
 Integrity-2.478
TokenMaster
Token ID18254
Feature activation+1.27e+00

Pos logit contributions
ess+1.917
der+1.896
nesday+1.875
quished+1.865
rely+1.808
Neg logit contributions
 assurances-1.828
 calculations-1.813
 Preferences-1.764
 negotiations-1.755
 Integrity-1.753
TokenD
Token ID35
Feature activation+2.07e+00

Pos logit contributions
ess+5.798
der+5.648
quished+5.368
nesday+5.309
die+5.240
Neg logit contributions
 assurances-5.084
 calculations-5.012
 Preferences-4.919
 negotiations-4.814
 remarks-4.760
Tokenal
Token ID282
Feature activation+4.66e-01

Pos logit contributions
ess+7.553
der+7.427
nesday+6.957
quished+6.904
die+6.833
Neg logit contributions
 assurances-9.949
 Preferences-9.732
 calculations-9.606
 negotiations-9.476
 Integrity-9.402
TokenK
Token ID42
Feature activation-3.43e-01

Pos logit contributions
ess+1.873
der+1.863
quished+1.812
nesday+1.802
rely+1.739
Neg logit contributions
 assurances-2.115
 calculations-2.090
 Preferences-2.051
 negotiations-2.031
 Integrity-2.027
Token |
Token ID930
Feature activation-9.94e-01

Pos logit contributions
 assurances+1.301
 calculations+1.295
 Preferences+1.253
 negotiations+1.250
 Integrity+1.249
Neg logit contributions
ess-1.604
nesday-1.590
der-1.584
quished-1.575
rely-1.525
Token Technical
Token ID20671
Feature activation-1.35e+00

Pos logit contributions
 calculations+3.371
 assurances+3.350
 Preferences+3.323
 Integrity+3.289
 negotiations+3.238
Neg logit contributions
nesday-4.897
ess-4.891
quished-4.860
der-4.791
joined-4.642
Token Director
Token ID5890
Feature activation+1.62e-01

Pos logit contributions
 calculations+4.231
 Integrity+4.183
 assurances+4.145
 Preferences+4.068
 remarks+4.015
Neg logit contributions
nesday-6.999
ess-6.935
quished-6.906
der-6.725
joined-6.684
TokenThe
Token ID464
Feature activation-4.36e-01

Pos logit contributions
nesday+0.718
quished+0.709
ess+0.706
der+0.699
rely+0.675
Neg logit contributions
 assurances-0.595
 calculations-0.590
 Preferences-0.582
 Integrity-0.576
 negotiations-0.574
Token younger
Token ID7099
Feature activation-1.06e+00

Pos logit contributions
 assurances+1.576
 calculations+1.572
 negotiations+1.509
 Preferences+1.496
 remarks+1.488
Neg logit contributions
quished-2.170
nesday-2.151
ess-2.146
der-2.122
rely-2.053
Token you
Token ID345
Feature activation+1.41e-01

Pos logit contributions
 calculations+3.960
 assurances+3.958
 remarks+3.835
 negotiations+3.800
 judgments+3.747
Neg logit contributions
quished-5.027
nesday-5.005
ess-4.811
der-4.721
joined-4.647
Token are
Token ID389
Feature activation+3.81e-01

Pos logit contributions
ess+0.639
nesday+0.633
quished+0.633
der+0.629
rely+0.604
Neg logit contributions
 assurances-0.557
 calculations-0.555
 negotiations-0.541
 Preferences-0.535
 remarks-0.532
Token,"
Token ID553
Feature activation+4.23e-02

Pos logit contributions
ess+1.652
der+1.639
nesday+1.598
quished+1.590
rely+1.547
Neg logit contributions
 assurances-1.594
 calculations-1.580
 Preferences-1.541
 Integrity-1.536
 negotiations-1.524
Token D
Token ID360
Feature activation+1.88e+00

Pos logit contributions
ess+0.205
nesday+0.204
quished+0.203
der+0.203
rely+0.196
Neg logit contributions
 assurances-0.155
 calculations-0.154
 negotiations-0.149
 remarks-0.148
 Preferences-0.148
Tokenella
Token ID12627
Feature activation-2.01e-01

Pos logit contributions
ess+2.569
der+2.436
nesday+2.074
quished+2.025
rely+2.000
Neg logit contributions
 assurances-13.463
 calculations-13.300
 Preferences-13.186
 negotiations-13.106
 Integrity-13.066
Token Vol
Token ID4709
Feature activation+1.72e-01

Pos logit contributions
 calculations+0.462
 assurances+0.454
 negotiations+0.435
 Preferences+0.434
 remarks+0.433
Neg logit contributions
nesday-1.282
quished-1.271
der-1.234
ess-1.231
rely-1.204
Tokenpe
Token ID431
Feature activation-2.75e-01

Pos logit contributions
quished+0.726
nesday+0.719
der+0.688
ess+0.682
joined+0.654
Neg logit contributions
 calculations-0.741
 assurances-0.739
 Preferences-0.722
 negotiations-0.721
 remarks-0.719
Token says
Token ID1139
Feature activation+3.97e-01

Pos logit contributions
 assurances+1.176
 calculations+1.168
 Preferences+1.131
 negotiations+1.131
 Integrity+1.124
Neg logit contributions
ess-1.164
der-1.152
nesday-1.143
quished-1.138
rely-1.102
Token,
Token ID11
Feature activation-3.39e-01

Pos logit contributions
ess+1.597
der+1.575
quished+1.517
nesday+1.511
rely+1.453
Neg logit contributions
 assurances-1.777
 calculations-1.753
 Preferences-1.747
 Integrity-1.733
 negotiations-1.695
Token cups
Token ID14180
Feature activation+5.25e-01

Pos logit contributions
ess+0.783
der+0.773
nesday+0.772
quished+0.770
rely+0.750
Neg logit contributions
 assurances-0.478
 calculations-0.475
 Preferences-0.455
 negotiations-0.455
 Integrity-0.450
Token of
Token ID286
Feature activation-1.77e-01

Pos logit contributions
ess+2.475
der+2.451
nesday+2.434
quished+2.422
rely+2.349
Neg logit contributions
 assurances-2.001
 calculations-1.984
 Preferences-1.920
 negotiations-1.914
 Integrity-1.905
Token water
Token ID1660
Feature activation-1.38e-03

Pos logit contributions
 assurances+0.681
 calculations+0.676
 negotiations+0.654
 Preferences+0.652
 Integrity+0.650
Neg logit contributions
nesday-0.821
ess-0.820
quished-0.814
der-0.810
rely-0.786
Token into
Token ID656
Feature activation-9.84e-01

Pos logit contributions
 assurances+0.006
 calculations+0.006
 Preferences+0.006
 negotiations+0.006
 Integrity+0.005
Neg logit contributions
ess-0.006
nesday-0.006
der-0.006
quished-0.006
rely-0.006
Token a
Token ID257
Feature activation+8.20e-01

Pos logit contributions
 calculations+3.233
 assurances+3.176
 negotiations+3.096
 Preferences+3.091
 Integrity+3.058
Neg logit contributions
ess-5.006
nesday-4.994
der-4.994
quished-4.983
rely-4.797
Token sauce
Token ID10746
Feature activation+6.30e-01

Pos logit contributions
ess+3.677
der+3.643
nesday+3.603
quished+3.586
rely+3.477
Neg logit contributions
 assurances-3.310
 calculations-3.285
 Preferences-3.190
 negotiations-3.175
 Integrity-3.170
Tokenpan
Token ID6839
Feature activation+4.61e-01

Pos logit contributions
ess+3.873
der+3.846
nesday+3.789
quished+3.778
rely+3.714
Neg logit contributions
 assurances-1.501
 calculations-1.470
 Preferences-1.403
 Integrity-1.390
 negotiations-1.387
Token and
Token ID290
Feature activation+4.52e-01

Pos logit contributions
ess+2.138
der+2.117
nesday+2.094
quished+2.083
rely+2.025
Neg logit contributions
 assurances-1.793
 calculations-1.777
 Preferences-1.725
 negotiations-1.716
 Integrity-1.713
Token bring
Token ID2222
Feature activation+4.53e-01

Pos logit contributions
ess+2.184
quished+2.170
nesday+2.167
der+2.156
rely+2.065
Neg logit contributions
 assurances-1.683
 calculations-1.667
 negotiations-1.620
 Preferences-1.602
 remarks-1.596
Token it
Token ID340
Feature activation+3.62e-01

Pos logit contributions
ess+2.278
der+2.259
nesday+2.238
quished+2.231
rely+2.169
Neg logit contributions
 assurances-1.588
 calculations-1.573
 Preferences-1.519
 negotiations-1.514
 Integrity-1.507
Token to
Token ID284
Feature activation+1.40e-01

Pos logit contributions
ess+1.865
der+1.850
nesday+1.835
quished+1.829
rely+1.780
Neg logit contributions
 assurances-1.221
 calculations-1.208
 Preferences-1.165
 negotiations-1.163
 Integrity-1.154
Tokenong
Token ID506
Feature activation-1.07e+00

Pos logit contributions
 calculations+1.620
 assurances+1.615
 negotiations+1.587
 remarks+1.576
 Preferences+1.558
Neg logit contributions
nesday-1.725
quished-1.722
der-1.684
ess-1.658
rely-1.608
TokenAng
Token ID13450
Feature activation-6.48e-01

Pos logit contributions
 calculations+3.345
 assurances+3.327
 remarks+3.168
 negotiations+3.167
 Preferences+3.140
Neg logit contributions
der-5.610
ess-5.595
nesday-5.590
quished-5.573
rely-5.458
Token Il
Token ID13778
Feature activation-4.72e-01

Pos logit contributions
 assurances+2.304
 calculations+2.300
 Integrity+2.212
 negotiations+2.210
 Preferences+2.195
Neg logit contributions
ess-3.182
der-3.154
nesday-3.152
quished-3.123
rely-3.043
Tokenbo
Token ID2127
Feature activation-6.63e-01

Pos logit contributions
 calculations+2.378
 assurances+2.327
 remarks+2.304
 negotiations+2.290
 Preferences+2.285
Neg logit contributions
nesday-1.664
quished-1.609
ess-1.549
der-1.456
escent-1.427
Token reported
Token ID2098
Feature activation+5.32e-01

Pos logit contributions
 calculations+2.755
 assurances+2.701
 remarks+2.657
 negotiations+2.619
 Preferences+2.545
Neg logit contributions
quished-2.911
nesday-2.875
ess-2.843
der-2.832
die-2.761
Token that
Token ID326
Feature activation+1.29e+00

Pos logit contributions
ess+2.659
der+2.636
nesday+2.605
quished+2.591
rely+2.526
Neg logit contributions
 assurances-1.868
 calculations-1.848
 Preferences-1.795
 Integrity-1.777
 negotiations-1.776
Token in
Token ID287
Feature activation-6.37e-01

Pos logit contributions
ess+6.270
der+6.228
nesday+6.081
quished+6.052
rely+5.934
Neg logit contributions
 assurances-4.537
 calculations-4.460
 Preferences-4.415
 Integrity-4.343
 negotiations-4.271
Token W
Token ID370
Feature activation+1.45e+00

Pos logit contributions
 calculations+2.458
 assurances+2.437
 negotiations+2.392
 remarks+2.372
 Preferences+2.340
Neg logit contributions
ess-2.961
quished-2.940
nesday-2.930
der-2.926
rely-2.814
Tokenons
Token ID684
Feature activation-8.37e-01

Pos logit contributions
ess+7.161
der+7.003
nesday+6.756
quished+6.698
rely+6.629
Neg logit contributions
 assurances-5.282
 calculations-5.112
 Integrity-5.093
 Preferences-5.041
 negotiations-4.949
Tokenan
Token ID272
Feature activation-1.17e-01

Pos logit contributions
 calculations+3.584
 assurances+3.526
 negotiations+3.471
 Preferences+3.387
 remarks+3.386
Neg logit contributions
quished-3.387
der-3.288
ess-3.257
nesday-3.204
die-3.125
Token,
Token ID11
Feature activation-3.37e-01

Pos logit contributions
 assurances+0.444
 calculations+0.444
 negotiations+0.431
 Preferences+0.427
 Integrity+0.426
Neg logit contributions
ess-0.549
quished-0.539
der-0.539
nesday-0.538
rely-0.524
Token that
Token ID326
Feature activation+1.95e+00

Pos logit contributions
nesday+1.096
ess+1.093
quished+1.088
der+1.085
rely+1.040
Neg logit contributions
 calculations-0.899
 assurances-0.893
 Preferences-0.864
 negotiations-0.858
 Integrity-0.851
Token's
Token ID338
Feature activation+6.09e-01

Pos logit contributions
ess+9.099
der+8.981
nesday+8.741
rely+8.658
quished+8.611
Neg logit contributions
 assurances-7.151
 calculations-7.014
 Integrity-6.846
 Preferences-6.837
 negotiations-6.797
Token goofy
Token ID44089
Feature activation+9.25e-01

Pos logit contributions
ess+2.846
nesday+2.827
quished+2.822
der+2.821
rely+2.699
Neg logit contributions
 assurances-2.368
 calculations-2.359
 negotiations-2.270
 Preferences-2.265
 Integrity-2.243
Token,
Token ID11
Feature activation+7.58e-01

Pos logit contributions
ess+4.473
der+4.418
nesday+4.402
quished+4.386
rely+4.244
Neg logit contributions
 assurances-3.428
 calculations-3.411
 Preferences-3.291
 negotiations-3.282
 Integrity-3.253
Token fun
Token ID1257
Feature activation+1.40e+00

Pos logit contributions
ess+3.528
der+3.493
nesday+3.457
quished+3.444
rely+3.341
Neg logit contributions
 assurances-2.934
 calculations-2.911
 Preferences-2.822
 negotiations-2.809
 Integrity-2.801
Token,
Token ID11
Feature activation+9.70e-01

Pos logit contributions
der+5.856
ess+5.822
quished+5.506
nesday+5.498
rely+5.355
Neg logit contributions
 assurances-5.923
 Integrity-5.763
 calculations-5.727
 Preferences-5.714
 negotiations-5.630
Token and
Token ID290
Feature activation+1.27e+00

Pos logit contributions
ess+4.799
der+4.750
nesday+4.647
quished+4.607
rely+4.531
Neg logit contributions
 assurances-3.440
 calculations-3.386
 Preferences-3.309
 Integrity-3.279
 negotiations-3.259
Token impossible
Token ID5340
Feature activation+2.21e+00

Pos logit contributions
ess+5.821
der+5.758
nesday+5.626
quished+5.574
rely+5.482
Neg logit contributions
 assurances-4.980
 calculations-4.919
 Preferences-4.808
 Integrity-4.776
 negotiations-4.756
Token not
Token ID407
Feature activation+1.27e-01

Pos logit contributions
ess+9.147
der+9.121
nesday+8.508
quished+8.496
rely+8.472
Neg logit contributions
 assurances-8.502
 Integrity-8.425
 Preferences-8.110
 calculations-7.975
 negotiations-7.954
Token to
Token ID284
Feature activation+9.81e-01

Pos logit contributions
ess+0.635
der+0.629
nesday+0.618
quished+0.615
rely+0.599
Neg logit contributions
 assurances-0.454
 calculations-0.447
 Preferences-0.437
 Integrity-0.434
 negotiations-0.432
Token forward
Token ID2651
Feature activation+7.78e-01

Pos logit contributions
ess+5.333
der+5.291
nesday+5.274
quished+5.270
rely+5.090
Neg logit contributions
 assurances-3.023
 calculations-3.009
 Preferences-2.894
 negotiations-2.866
 Integrity-2.841
Token well
Token ID880
Feature activation-1.19e+00

Pos logit contributions
 assurances+0.358
 calculations+0.355
 Integrity+0.340
 negotiations+0.340
 Preferences+0.338
Neg logit contributions
ess-0.525
nesday-0.521
der-0.521
quished-0.520
rely-0.501
Token integrity
Token ID11540
Feature activation-3.89e-01

Pos logit contributions
 assurances+3.752
 calculations+3.745
 Integrity+3.626
 negotiations+3.501
 remarks+3.449
Neg logit contributions
nesday-6.318
quished-6.300
ess-6.175
rely-6.162
der-6.154
Token required
Token ID2672
Feature activation+6.64e-01

Pos logit contributions
 assurances+1.513
 calculations+1.497
 Integrity+1.442
 negotiations+1.433
 Preferences+1.423
Neg logit contributions
ess-1.800
quished-1.800
der-1.794
nesday-1.787
rely-1.738
Token under
Token ID739
Feature activation+6.66e-01

Pos logit contributions
ess+2.740
der+2.705
nesday+2.647
quished+2.630
rely+2.562
Neg logit contributions
 assurances-2.902
 calculations-2.880
 Preferences-2.824
 Integrity-2.792
 negotiations-2.792
Token provincial
Token ID15818
Feature activation-8.32e-01

Pos logit contributions
ess+3.327
der+3.303
nesday+3.240
quished+3.240
rely+3.166
Neg logit contributions
 assurances-2.306
 calculations-2.282
 Preferences-2.227
 Integrity-2.190
 negotiations-2.189
Token regulations
Token ID6647
Feature activation+1.28e-01

Pos logit contributions
 assurances+2.434
 calculations+2.412
 negotiations+2.287
 Integrity+2.277
 Preferences+2.201
Neg logit contributions
nesday-4.742
quished-4.704
ess-4.633
der-4.601
die-4.474
Token,
Token ID11
Feature activation-1.13e-01

Pos logit contributions
ess+0.580
der+0.575
nesday+0.571
quished+0.569
rely+0.551
Neg logit contributions
 assurances-0.512
 calculations-0.509
 Preferences-0.492
 negotiations-0.492
 Integrity-0.490
Token researchers
Token ID4837
Feature activation+5.84e-02

Pos logit contributions
 assurances+0.406
 calculations+0.402
 negotiations+0.388
 Preferences+0.387
 Integrity+0.386
Neg logit contributions
ess-0.561
der-0.556
nesday-0.553
quished-0.552
rely-0.533
Token were
Token ID547
Feature activation+3.35e-01

Pos logit contributions
quished+0.264
ess+0.263
nesday+0.262
der+0.261
rely+0.251
Neg logit contributions
 assurances-0.234
 calculations-0.232
 negotiations-0.224
 Integrity-0.223
 Preferences-0.223
Token told
Token ID1297
Feature activation+9.55e-01

Pos logit contributions
ess+1.537
nesday+1.530
der+1.524
quished+1.523
rely+1.465
Neg logit contributions
 assurances-1.325
 calculations-1.300
 negotiations-1.263
 remarks-1.249
 Preferences-1.246
Token the
Token ID262
Feature activation+3.28e-01

Pos logit contributions
ess+4.563
der+4.499
nesday+4.414
quished+4.363
rely+4.313
Neg logit contributions
 assurances-3.476
 calculations-3.468
 Preferences-3.382
 Integrity-3.335
 negotiations-3.328
Token when
Token ID618
Feature activation-1.16e-01

Pos logit contributions
 calculations+3.472
 assurances+3.370
 negotiations+3.296
 Preferences+3.272
 Integrity+3.264
Neg logit contributions
nesday-3.551
quished-3.544
ess-3.477
der-3.439
rely-3.401
Token we
Token ID356
Feature activation+9.45e-02

Pos logit contributions
 calculations+0.448
 assurances+0.442
 negotiations+0.429
 Preferences+0.428
 Integrity+0.425
Neg logit contributions
ess-0.540
nesday-0.537
der-0.536
quished-0.535
rely-0.512
Token enter
Token ID3802
Feature activation-7.69e-01

Pos logit contributions
quished+0.466
ess+0.462
nesday+0.462
der+0.457
rely+0.437
Neg logit contributions
 calculations-0.342
 assurances-0.338
 negotiations-0.326
 remarks-0.325
 Integrity-0.323
Token system
Token ID1080
Feature activation+7.28e-02

Pos logit contributions
 calculations+2.761
 assurances+2.667
 negotiations+2.619
 Preferences+2.615
 remarks+2.591
Neg logit contributions
quished-3.788
nesday-3.771
ess-3.753
der-3.697
rely-3.621
Token()
Token ID3419
Feature activation+1.62e-01

Pos logit contributions
ess+0.311
der+0.307
nesday+0.306
quished+0.305
rely+0.295
Neg logit contributions
 calculations-0.309
 assurances-0.308
 Preferences-0.299
 negotiations-0.298
 Integrity-0.297
Token,
Token ID837
Feature activation-9.28e-01

Pos logit contributions
ess+0.857
der+0.848
quished+0.844
nesday+0.843
rely+0.820
Neg logit contributions
 assurances-0.525
 calculations-0.524
 Preferences-0.503
 negotiations-0.501
 Integrity-0.499
Token not
Token ID407
Feature activation-5.23e-01

Pos logit contributions
 calculations+3.022
 assurances+2.975
 negotiations+2.875
 remarks+2.865
 Preferences+2.842
Neg logit contributions
quished-4.907
nesday-4.764
ess-4.700
der-4.686
rely-4.614
Token to
Token ID284
Feature activation-1.65e-01

Pos logit contributions
 calculations+1.905
 assurances+1.890
 Preferences+1.831
 negotiations+1.827
 Integrity+1.825
Neg logit contributions
quished-2.522
ess-2.507
der-2.485
nesday-2.478
rely-2.393
Token the
Token ID262
Feature activation-6.26e-01

Pos logit contributions
 assurances+0.588
 calculations+0.587
 Preferences+0.568
 negotiations+0.565
 Integrity+0.563
Neg logit contributions
ess-0.815
der-0.808
quished-0.803
nesday-0.800
rely-0.775
Token bottom
Token ID4220
Feature activation-6.57e-01

Pos logit contributions
 calculations+1.921
 assurances+1.889
 Preferences+1.824
 negotiations+1.822
 Integrity+1.818
Neg logit contributions
quished-3.379
nesday-3.377
ess-3.349
der-3.331
rely-3.250
Token of
Token ID286
Feature activation+2.24e-01

Pos logit contributions
 calculations+2.568
 assurances+2.531
 remarks+2.468
 Preferences+2.466
 negotiations+2.457
Neg logit contributions
nesday-2.962
quished-2.918
ess-2.917
der-2.910
rely-2.851
Token starting
Token ID3599
Feature activation-1.42e+00

Pos logit contributions
ess+1.802
der+1.795
quished+1.784
nesday+1.782
rely+1.714
Neg logit contributions
 assurances-1.438
 calculations-1.424
 negotiations-1.382
 Preferences-1.372
 Integrity-1.369
Token with
Token ID351
Feature activation-1.18e+00

Pos logit contributions
 assurances+5.418
 negotiations+5.401
 calculations+5.382
 remarks+5.162
 Preferences+5.147
Neg logit contributions
quished-6.410
ess-6.285
der-6.236
nesday-6.180
die-5.978
Token the
Token ID262
Feature activation-1.03e+00

Pos logit contributions
 calculations+4.334
 negotiations+4.320
 assurances+4.306
 remarks+4.106
 Preferences+4.019
Neg logit contributions
quished-5.666
nesday-5.581
ess-5.526
der-5.442
die-5.245
Token rich
Token ID5527
Feature activation+1.73e-01

Pos logit contributions
 calculations+3.759
 assurances+3.754
 negotiations+3.711
 remarks+3.556
 Preferences+3.524
Neg logit contributions
nesday-5.023
quished-5.012
ess-4.968
der-4.907
rely-4.729
Token countries
Token ID2678
Feature activation+2.22e-01

Pos logit contributions
ess+0.877
der+0.870
nesday+0.867
quished+0.862
rely+0.838
Neg logit contributions
 assurances-0.593
 calculations-0.589
 negotiations-0.568
 Preferences-0.567
 Integrity-0.561
Token,
Token ID11
Feature activation-3.94e-01

Pos logit contributions
ess+1.032
der+1.024
nesday+1.013
quished+1.009
rely+0.979
Neg logit contributions
 assurances-0.857
 calculations-0.850
 Preferences-0.824
 negotiations-0.821
 Integrity-0.818
Token publishes
Token ID34134
Feature activation+4.40e-01

Pos logit contributions
 assurances+1.503
 calculations+1.498
 negotiations+1.465
 remarks+1.428
 Preferences+1.416
Neg logit contributions
quished-1.885
nesday-1.844
der-1.832
ess-1.819
rely-1.759
Token them
Token ID606
Feature activation+1.42e-01

Pos logit contributions
ess+2.249
der+2.231
nesday+2.209
quished+2.201
rely+2.141
Neg logit contributions
 assurances-1.509
 calculations-1.495
 Preferences-1.442
 negotiations-1.436
 Integrity-1.428
Token
Token ID447
Feature activation+2.93e-01

Pos logit contributions
ess+0.661
der+0.659
nesday+0.658
quished+0.658
rely+0.635
Neg logit contributions
 assurances-0.547
 calculations-0.545
 negotiations-0.527
 Preferences-0.523
 remarks-0.517
Token
Token ID251
Feature activation+1.18e-01

Pos logit contributions
ess+1.548
der+1.535
nesday+1.516
quished+1.512
rely+1.475
Neg logit contributions
 assurances-0.947
 calculations-0.938
 Preferences-0.906
 negotiations-0.900
 Integrity-0.895
Token before
Token ID878
Feature activation+7.63e-03

Pos logit contributions
ess+0.542
der+0.539
nesday+0.533
quished+0.533
rely+0.515
Neg logit contributions
 assurances-0.468
 calculations-0.463
 Preferences-0.448
 negotiations-0.448
 Integrity-0.445
Token spent
Token ID3377
Feature activation+5.48e-01

Pos logit contributions
nesday+1.169
ess+1.167
quished+1.164
der+1.152
rely+1.104
Neg logit contributions
 assurances-0.950
 calculations-0.948
 negotiations-0.916
 remarks-0.909
 Integrity-0.895
Token much
Token ID881
Feature activation+2.03e-02

Pos logit contributions
ess+2.488
der+2.460
nesday+2.438
quished+2.431
rely+2.349
Neg logit contributions
 assurances-2.194
 calculations-2.175
 Preferences-2.106
 negotiations-2.105
 Integrity-2.093
Token time
Token ID640
Feature activation+4.17e-01

Pos logit contributions
ess+0.109
quished+0.108
nesday+0.108
der+0.108
rely+0.104
Neg logit contributions
 assurances-0.064
 calculations-0.064
 negotiations-0.061
 Preferences-0.060
 Integrity-0.060
Token thinking
Token ID3612
Feature activation-8.86e-01

Pos logit contributions
ess+2.123
nesday+2.119
quished+2.111
der+2.105
rely+2.032
Neg logit contributions
 assurances-1.435
 calculations-1.433
 negotiations-1.383
 remarks-1.358
 Preferences-1.356
Token about
Token ID546
Feature activation+1.99e-01

Pos logit contributions
 calculations+2.740
 assurances+2.709
 negotiations+2.649
 remarks+2.575
 judgments+2.563
Neg logit contributions
quished-4.739
nesday-4.736
ess-4.630
der-4.576
joined-4.523
Token the
Token ID262
Feature activation-4.62e-01

Pos logit contributions
quished+0.951
ess+0.948
nesday+0.946
der+0.944
rely+0.901
Neg logit contributions
 assurances-0.755
 calculations-0.749
 negotiations-0.726
 Preferences-0.718
 Integrity-0.714
Token conditions
Token ID3403
Feature activation-2.26e-01

Pos logit contributions
 assurances+1.585
 calculations+1.582
 negotiations+1.520
 remarks+1.504
 Preferences+1.477
Neg logit contributions
quished-2.398
nesday-2.376
der-2.340
ess-2.330
die-2.241
Token out
Token ID503
Feature activation-1.16e-01

Pos logit contributions
 assurances+0.756
 calculations+0.749
 negotiations+0.732
 Preferences+0.716
 remarks+0.703
Neg logit contributions
nesday-1.179
quished-1.177
ess-1.161
der-1.159
rely-1.111
Token of
Token ID286
Feature activation-6.90e-01

Pos logit contributions
 assurances+0.432
 calculations+0.430
 negotiations+0.416
 Preferences+0.415
 Integrity+0.410
Neg logit contributions
ess-0.554
nesday-0.551
der-0.548
quished-0.545
rely-0.528
Token which
Token ID543
Feature activation+3.03e-01

Pos logit contributions
 calculations+2.216
 assurances+2.210
 negotiations+2.139
 Preferences+2.105
 remarks+2.077
Neg logit contributions
quished-3.634
der-3.632
ess-3.629
nesday-3.606
die-3.456
Token the
Token ID262
Feature activation+2.60e-01

Pos logit contributions
ess+1.401
nesday+1.397
quished+1.395
der+1.392
rely+1.325
Neg logit contributions
 assurances-1.199
 calculations-1.191
 negotiations-1.159
 Preferences-1.142
 Integrity-1.137
Token their
Token ID511
Feature activation+5.36e-02

Pos logit contributions
 assurances+0.792
 calculations+0.787
 negotiations+0.757
 Preferences+0.745
 remarks+0.741
Neg logit contributions
quished-1.133
nesday-1.125
ess-1.116
der-1.100
rely-1.069
Token kids
Token ID3988
Feature activation-4.03e-01

Pos logit contributions
quished+0.208
ess+0.207
der+0.206
nesday+0.205
rely+0.195
Neg logit contributions
 calculations-0.250
 assurances-0.249
 negotiations-0.242
 Preferences-0.241
 remarks-0.240
Token toys
Token ID14958
Feature activation-5.45e-01

Pos logit contributions
 calculations+1.556
 assurances+1.550
 Preferences+1.496
 negotiations+1.494
 remarks+1.476
Neg logit contributions
quished-1.870
nesday-1.844
ess-1.825
der-1.813
rely-1.779
Token in
Token ID287
Feature activation-5.64e-01

Pos logit contributions
 calculations+1.921
 assurances+1.909
 negotiations+1.850
 Preferences+1.832
 remarks+1.807
Neg logit contributions
quished-2.694
nesday-2.654
ess-2.603
der-2.590
rely-2.549
Token Pakistan
Token ID7648
Feature activation+1.06e+00

Pos logit contributions
 calculations+1.865
 assurances+1.858
 negotiations+1.817
 Preferences+1.777
 remarks+1.756
Neg logit contributions
quished-2.935
ess-2.929
nesday-2.921
der-2.873
rely-2.790
Token from
Token ID422
Feature activation-6.85e-01

Pos logit contributions
ess+4.200
der+4.155
nesday+4.080
quished+4.049
rely+3.925
Neg logit contributions
 assurances-4.787
 calculations-4.746
 Preferences-4.626
 Integrity-4.607
 negotiations-4.603
Token their
Token ID511
Feature activation-5.77e-01

Pos logit contributions
 calculations+2.457
 assurances+2.447
 negotiations+2.390
 Preferences+2.327
 Integrity+2.321
Neg logit contributions
quished-3.364
nesday-3.320
ess-3.307
der-3.207
joined-3.144
Token homes
Token ID5682
Feature activation+2.31e-01

Pos logit contributions
 calculations+2.072
 assurances+2.022
 Preferences+1.975
 negotiations+1.966
 remarks+1.942
Neg logit contributions
quished-2.860
nesday-2.825
ess-2.817
der-2.782
die-2.711
Token.
Token ID13
Feature activation+3.01e-01

Pos logit contributions
ess+0.945
quished+0.943
der+0.940
nesday+0.937
rely+0.900
Neg logit contributions
 assurances-1.013
 calculations-1.012
 Preferences-0.983
 negotiations-0.978
 remarks-0.967
Token An
Token ID1052
Feature activation+6.24e-01

Pos logit contributions
ess+1.565
nesday+1.562
quished+1.557
der+1.550
rely+1.493
Neg logit contributions
 assurances-1.000
 calculations-0.994
 Preferences-0.983
 Integrity-0.967
 negotiations-0.958
Token array
Token ID7177
Feature activation-6.66e-01

Pos logit contributions
ess+3.113
der+3.088
nesday+3.061
quished+3.047
rely+2.960
Neg logit contributions
 assurances-2.200
 calculations-2.182
 Preferences-2.108
 negotiations-2.097
 Integrity-2.093
Token out
Token ID503
Feature activation+5.23e-01

Pos logit contributions
 assurances+0.780
 calculations+0.768
 negotiations+0.752
 remarks+0.747
 judgments+0.718
Neg logit contributions
nesday-1.314
ess-1.307
der-1.298
quished-1.289
rely-1.257
Tokenbur
Token ID6236
Feature activation+1.05e+00

Pos logit contributions
ess+2.468
der+2.444
nesday+2.403
quished+2.395
rely+2.329
Neg logit contributions
 assurances-1.986
 calculations-1.967
 Preferences-1.914
 Integrity-1.900
 negotiations-1.894
Tokensts
Token ID6448
Feature activation+6.41e-01

Pos logit contributions
ess+3.626
der+3.590
nesday+3.526
quished+3.480
rely+3.343
Neg logit contributions
 assurances-5.319
 calculations-5.287
 Preferences-5.178
 Integrity-5.171
 negotiations-5.147
Token marked
Token ID7498
Feature activation+5.41e-01

Pos logit contributions
ess+2.605
der+2.576
nesday+2.545
quished+2.529
rely+2.449
Neg logit contributions
 assurances-2.860
 calculations-2.840
 Preferences-2.764
 negotiations-2.757
 Integrity-2.747
Token the
Token ID262
Feature activation-1.87e-01

Pos logit contributions
ess+2.612
der+2.590
nesday+2.559
quished+2.549
rely+2.479
Neg logit contributions
 assurances-2.000
 calculations-1.980
 Preferences-1.919
 negotiations-1.910
 Integrity-1.905
Token SB
Token ID18056
Feature activation-9.63e-01

Pos logit contributions
 assurances+0.668
 calculations+0.655
 negotiations+0.648
 remarks+0.643
 Integrity+0.632
Neg logit contributions
der-0.930
ess-0.928
quished-0.928
nesday-0.923
joined-0.887
Token6
Token ID21
Feature activation-1.71e+00

Pos logit contributions
 assurances+3.273
 calculations+3.216
 Integrity+3.145
 negotiations+3.137
 remarks+3.126
Neg logit contributions
nesday-4.799
quished-4.783
der-4.738
ess-4.711
rely-4.591
Token hearing
Token ID4854
Feature activation-7.23e-01

Pos logit contributions
 negotiations+6.018
 assurances+5.934
 remarks+5.892
 calculations+5.845
 Integrity+5.720
Neg logit contributions
quished-7.958
nesday-7.835
ess-7.783
der-7.715
joined-7.687
Token,
Token ID11
Feature activation+1.67e-02

Pos logit contributions
 assurances+2.853
 negotiations+2.850
 remarks+2.833
 calculations+2.823
 briefings+2.703
Neg logit contributions
nesday-3.188
ess-3.181
quished-3.160
der-3.157
rely-3.068
Token however
Token ID2158
Feature activation-1.18e+00

Pos logit contributions
ess+0.078
der+0.078
nesday+0.077
quished+0.077
rely+0.075
Neg logit contributions
 assurances-0.064
 calculations-0.063
 negotiations-0.062
 Integrity-0.061
 Preferences-0.061
Token.
Token ID13
Feature activation-1.84e-01

Pos logit contributions
 assurances+5.394
 calculations+5.339
 negotiations+5.224
 remarks+5.165
 Preferences+5.131
Neg logit contributions
ess-4.557
der-4.521
nesday-4.516
quished-4.452
rely-4.320
Token wrote
Token ID2630
Feature activation-3.05e-01

Pos logit contributions
ess+3.314
der+3.280
nesday+3.244
quished+3.231
rely+3.134
Neg logit contributions
 assurances-3.009
 calculations-2.986
 Preferences-2.894
 negotiations-2.887
 Integrity-2.876
Token a
Token ID257
Feature activation+1.03e+00

Pos logit contributions
 assurances+1.092
 calculations+1.084
 negotiations+1.043
 Integrity+1.038
 Preferences+1.037
Neg logit contributions
ess-1.508
der-1.496
quished-1.490
nesday-1.483
rely-1.440
Token review
Token ID2423
Feature activation+2.29e-02

Pos logit contributions
ess+5.432
der+5.386
nesday+5.315
quished+5.292
rely+5.172
Neg logit contributions
 assurances-3.289
 calculations-3.242
 Preferences-3.145
 negotiations-3.111
 Integrity-3.108
Token of
Token ID286
Feature activation+5.29e-01

Pos logit contributions
ess+0.119
der+0.118
nesday+0.118
quished+0.117
rely+0.114
Neg logit contributions
 assurances-0.076
 calculations-0.076
 negotiations-0.072
 Integrity-0.072
 remarks-0.072
Token the
Token ID262
Feature activation-1.39e-01

Pos logit contributions
ess+2.613
der+2.587
nesday+2.544
quished+2.537
rely+2.479
Neg logit contributions
 assurances-1.873
 calculations-1.851
 Preferences-1.812
 Integrity-1.792
 negotiations-1.782
Token scientific
Token ID5654
Feature activation-1.81e+00

Pos logit contributions
 assurances+0.402
 calculations+0.401
 negotiations+0.381
 remarks+0.378
 Preferences+0.376
Neg logit contributions
ess-0.785
der-0.781
nesday-0.776
quished-0.775
rely-0.748
Token evidence
Token ID2370
Feature activation-1.19e+00

Pos logit contributions
 calculations+5.221
 assurances+5.009
 negotiations+4.795
 remarks+4.658
 judgments+4.638
Neg logit contributions
nesday-10.279
quished-10.229
der-9.917
ess-9.664
die-9.632
Token on
Token ID319
Feature activation-3.93e-01

Pos logit contributions
 calculations+4.318
 assurances+4.287
 negotiations+4.167
 remarks+4.125
 backing+4.038
Neg logit contributions
quished-5.655
nesday-5.553
der-5.454
rely-5.369
ess-5.358
Token teen
Token ID6036
Feature activation+3.07e-01

Pos logit contributions
 assurances+1.448
 calculations+1.445
 negotiations+1.394
 Preferences+1.373
 remarks+1.368
Neg logit contributions
quished-1.897
der-1.887
ess-1.879
nesday-1.876
rely-1.788
Token sleep
Token ID3993
Feature activation-1.21e+00

Pos logit contributions
ess+1.596
der+1.587
nesday+1.582
quished+1.579
rely+1.529
Neg logit contributions
 assurances-1.022
 calculations-1.015
 negotiations-0.975
 Preferences-0.973
 Integrity-0.964
Token needs
Token ID2476
Feature activation-1.50e+00

Pos logit contributions
 calculations+4.912
 assurances+4.839
 Preferences+4.704
 negotiations+4.697
 preferences+4.696
Neg logit contributions
quished-5.287
nesday-5.211
der-5.181
ess-5.144
rely-5.028
Token to
Token ID284
Feature activation-9.66e-01

Pos logit contributions
 assurances+1.913
 calculations+1.890
 Preferences+1.818
 negotiations+1.816
 Integrity+1.808
Neg logit contributions
ess-2.855
der-2.831
nesday-2.799
quished-2.798
rely-2.720
Token answer
Token ID3280
Feature activation-6.23e-01

Pos logit contributions
 assurances+2.399
 calculations+2.385
 remarks+2.367
 negotiations+2.285
 briefings+2.177
Neg logit contributions
quished-5.784
nesday-5.596
ess-5.566
der-5.474
joined-5.426
Token questions
Token ID2683
Feature activation-4.58e-01

Pos logit contributions
 assurances+1.695
 calculations+1.639
 remarks+1.580
 negotiations+1.579
 Integrity+1.556
Neg logit contributions
quished-3.638
ess-3.606
nesday-3.586
der-3.580
rely-3.514
Token from
Token ID422
Feature activation-4.38e-02

Pos logit contributions
 assurances+1.539
 calculations+1.507
 remarks+1.484
 negotiations+1.475
 briefings+1.426
Neg logit contributions
quished-2.361
ess-2.297
der-2.290
nesday-2.290
rely-2.251
Token a
Token ID257
Feature activation+4.67e-01

Pos logit contributions
 assurances+0.176
 calculations+0.174
 Integrity+0.170
 negotiations+0.170
 Preferences+0.168
Neg logit contributions
ess-0.199
der-0.196
quished-0.195
nesday-0.194
rely-0.188
Token Globe
Token ID18260
Feature activation-1.92e+00

Pos logit contributions
ess+2.002
der+1.981
quished+1.962
nesday+1.954
rely+1.893
Neg logit contributions
 assurances-1.982
 calculations-1.975
 negotiations-1.919
 Integrity-1.917
 Preferences-1.916
Token reporter
Token ID9095
Feature activation-8.47e-02

Pos logit contributions
 Integrity+7.118
 calculations+7.113
 assurances+6.960
 negotiations+6.943
 remarks+6.877
Neg logit contributions
quished-8.758
der-8.497
nesday-8.401
ess-8.311
rely-8.255
Token on
Token ID319
Feature activation-8.48e-01

Pos logit contributions
 assurances+0.311
 calculations+0.308
 negotiations+0.301
 remarks+0.297
 Integrity+0.293
Neg logit contributions
quished-0.414
nesday-0.405
der-0.403
ess-0.403
rely-0.394
Token Wednesday
Token ID3583
Feature activation+4.54e-01

Pos logit contributions
 assurances+2.845
 calculations+2.800
 negotiations+2.734
 remarks+2.676
 Integrity+2.669
Neg logit contributions
quished-4.397
ess-4.376
der-4.343
nesday-4.288
rely-4.183
Token about
Token ID546
Feature activation-9.68e-01

Pos logit contributions
ess+2.027
der+2.010
nesday+1.997
quished+1.996
rely+1.928
Neg logit contributions
 assurances-1.845
 calculations-1.827
 negotiations-1.775
 Preferences-1.766
 Integrity-1.758
Token why
Token ID1521
Feature activation-2.22e-01

Pos logit contributions
 assurances+3.267
 calculations+3.237
 negotiations+3.136
 remarks+3.082
 Integrity+3.051
Neg logit contributions
quished-4.989
ess-4.945
der-4.930
nesday-4.906
die-4.797
Tokennothing
Token ID22366
Feature activation-1.39e+00

Pos logit contributions
nesday+0.111
quished+0.110
ess+0.109
der+0.108
rely+0.103
Neg logit contributions
 assurances-0.106
 calculations-0.106
 negotiations-0.102
 Preferences-0.102
 Integrity-0.102
Token will
Token ID481
Feature activation-9.47e-01

Pos logit contributions
 assurances+5.888
 calculations+5.779
 negotiations+5.733
 remarks+5.600
 judgments+5.498
Neg logit contributions
quished-5.576
nesday-5.456
der-5.387
rely-5.378
die-5.313
Token happen
Token ID1645
Feature activation-2.62e-02

Pos logit contributions
 assurances+3.551
 calculations+3.525
 negotiations+3.451
 remarks+3.366
 Preferences+3.306
Neg logit contributions
quished-4.456
nesday-4.238
der-4.227
ess-4.206
joined-4.136
Token (
Token ID357
Feature activation-4.57e-02

Pos logit contributions
 calculations+0.109
 assurances+0.109
 negotiations+0.106
 Preferences+0.105
 remarks+0.104
Neg logit contributions
ess-0.112
quished-0.112
nesday-0.111
der-0.111
rely-0.108
Tokenin
Token ID259
Feature activation-1.24e+00

Pos logit contributions
 calculations+0.219
 assurances+0.219
 negotiations+0.213
 Preferences+0.211
 Integrity+0.211
Neg logit contributions
quished-0.166
nesday-0.165
ess-0.163
der-0.161
rely-0.153
Token terms
Token ID2846
Feature activation-1.98e+00

Pos logit contributions
 calculations+4.390
 assurances+4.353
 negotiations+4.280
 remarks+4.167
 Preferences+4.167
Neg logit contributions
quished-5.913
ess-5.901
nesday-5.878
der-5.825
escent-5.632
Token of
Token ID286
Feature activation-1.48e+00

Pos logit contributions
 calculations+4.588
 negotiations+4.401
 assurances+4.276
 remarks+4.177
 Integrity+3.939
Neg logit contributions
nesday-11.746
quished-11.708
der-11.107
ess-10.959
joined-10.916
Token visible
Token ID7424
Feature activation-1.08e+00

Pos logit contributions
 calculations+5.369
 assurances+5.259
 negotiations+5.182
 remarks+4.971
 briefings+4.856
Neg logit contributions
quished-7.212
der-7.002
ess-6.968
nesday-6.913
die-6.748
Token catast
Token ID14911
Feature activation-2.72e-01

Pos logit contributions
 calculations+3.718
 assurances+3.642
 negotiations+3.540
 remarks+3.482
 judgments+3.406
Neg logit contributions
quished-5.472
nesday-5.434
der-5.416
ess-5.349
joined-5.267
Tokenrop
Token ID1773
Feature activation+3.33e-01

Pos logit contributions
 calculations+1.572
 assurances+1.562
 negotiations+1.526
 Preferences+1.525
 remarks+1.517
Neg logit contributions
nesday-0.726
der-0.725
ess-0.712
quished-0.710
rely-0.657
Tokenhes
Token ID956
Feature activation+5.58e-01

Pos logit contributions
ess+1.196
der+1.190
nesday+1.176
quished+1.167
rely+1.121
Neg logit contributions
 assurances-1.634
 calculations-1.625
 Preferences-1.581
 negotiations-1.577
 Integrity-1.573
Token,
Token ID11
Feature activation-2.34e-01

Pos logit contributions
 calculations+1.268
 assurances+1.246
 remarks+1.215
 negotiations+1.214
 Preferences+1.212
Neg logit contributions
nesday-1.702
der-1.675
ess-1.667
quished-1.662
rely-1.629
Token the
Token ID262
Feature activation-1.29e-01

Pos logit contributions
 assurances+0.825
 calculations+0.824
 Preferences+0.796
 negotiations+0.794
 Integrity+0.788
Neg logit contributions
ess-1.159
der-1.154
nesday-1.151
quished-1.141
rely-1.111
Token Fire
Token ID3764
Feature activation+8.46e-01

Pos logit contributions
 assurances+0.483
 calculations+0.483
 negotiations+0.467
 Preferences+0.466
 Integrity+0.462
Neg logit contributions
ess-0.613
nesday-0.611
der-0.609
quished-0.606
rely-0.586
Token Station
Token ID9327
Feature activation+3.76e-01

Pos logit contributions
ess+3.806
der+3.765
nesday+3.700
quished+3.673
rely+3.589
Neg logit contributions
 assurances-3.408
 calculations-3.365
 Preferences-3.269
 negotiations-3.252
 Integrity-3.247
Token,
Token ID11
Feature activation-1.17e-01

Pos logit contributions
ess+1.780
der+1.766
nesday+1.751
quished+1.741
rely+1.693
Neg logit contributions
 assurances-1.423
 calculations-1.414
 Preferences-1.371
 negotiations-1.365
 Integrity-1.361
Token Sn
Token ID5489
Feature activation-1.92e+00

Pos logit contributions
 assurances+0.409
 calculations+0.407
 Preferences+0.393
 negotiations+0.392
 Integrity+0.390
Neg logit contributions
ess-0.582
der-0.579
nesday-0.575
quished-0.571
rely-0.556
Tokenuf
Token ID3046
Feature activation+4.22e-01

Pos logit contributions
 calculations+8.431
 assurances+8.212
 remarks+8.023
 negotiations+8.022
 judgments+7.992
Neg logit contributions
nesday-6.646
quished-6.432
der-6.206
ess-6.168
rely-6.166
Tokenkin
Token ID5116
Feature activation-6.66e-01

Pos logit contributions
ess+1.601
der+1.585
nesday+1.572
quished+1.567
rely+1.502
Neg logit contributions
 assurances-1.986
 calculations-1.977
 Preferences-1.922
 negotiations-1.921
 Integrity-1.912
Token's
Token ID338
Feature activation-1.20e+00

Pos logit contributions
 calculations+2.863
 assurances+2.844
 Preferences+2.775
 negotiations+2.774
 remarks+2.752
Neg logit contributions
nesday-2.710
quished-2.664
ess-2.661
der-2.651
rely-2.596
Token Camp
Token ID5425
Feature activation+1.16e+00

Pos logit contributions
 calculations+4.680
 assurances+4.608
 Preferences+4.535
 negotiations+4.516
 remarks+4.502
Neg logit contributions
nesday-5.362
quished-5.296
ess-5.225
der-5.192
rely-5.041
Token,
Token ID11
Feature activation-2.22e-02

Pos logit contributions
ess+5.005
der+4.930
quished+4.783
nesday+4.745
rely+4.633
Neg logit contributions
 assurances-4.857
 calculations-4.797
 Preferences-4.653
 negotiations-4.641
 Integrity-4.626
Token Convention
Token ID11680
Feature activation+1.39e+00

Pos logit contributions
 assurances+0.202
 calculations+0.200
 Integrity+0.199
 negotiations+0.194
 Preferences+0.194
Neg logit contributions
quished-0.430
nesday-0.430
ess-0.429
der-0.427
rely-0.418
Token,
Token ID11
Feature activation+1.39e-01

Pos logit contributions
ess+6.273
der+6.156
nesday+5.990
quished+5.953
rely+5.787
Neg logit contributions
 assurances-5.409
 calculations-5.393
 Preferences-5.238
 Integrity-5.096
 negotiations-5.091
Token the
Token ID262
Feature activation-1.74e-03

Pos logit contributions
ess+0.710
der+0.706
quished+0.702
nesday+0.701
rely+0.679
Neg logit contributions
 assurances-0.473
 calculations-0.466
 negotiations-0.453
 Integrity-0.450
 Preferences-0.449
Token American
Token ID1605
Feature activation+8.49e-01

Pos logit contributions
 assurances+0.006
 calculations+0.006
 negotiations+0.006
 Integrity+0.005
 Preferences+0.005
Neg logit contributions
ess-0.009
quished-0.009
der-0.009
nesday-0.009
joined-0.009
Token Ge
Token ID2269
Feature activation-1.45e+00

Pos logit contributions
ess+4.450
der+4.414
nesday+4.337
quished+4.326
rely+4.230
Neg logit contributions
 assurances-2.763
 calculations-2.726
 Preferences-2.635
 negotiations-2.603
 Integrity-2.599
Tokenophysical
Token ID41789
Feature activation-1.92e+00

Pos logit contributions
 calculations+2.228
 remarks+2.118
 Preferences+2.047
 assurances+2.024
 Integrity+2.015
Neg logit contributions
nesday-9.556
quished-9.396
der-9.279
ess-9.237
rely-9.091
Token Union
Token ID4479
Feature activation+3.23e-01

Pos logit contributions
 Integrity+5.375
 Preferences+5.094
 negotiations+5.046
 calculations+4.863
 assurances+4.788
Neg logit contributions
nesday-10.764
quished-10.700
ess-10.289
der-10.220
rely-10.082
Token in
Token ID287
Feature activation-9.12e-01

Pos logit contributions
ess+1.489
der+1.475
nesday+1.459
quished+1.453
rely+1.410
Neg logit contributions
 assurances-1.265
 calculations-1.255
 Preferences-1.217
 negotiations-1.213
 Integrity-1.210
Token 2011
Token ID2813
Feature activation-4.48e-01

Pos logit contributions
 assurances+2.754
 calculations+2.743
 negotiations+2.732
 remarks+2.675
 Integrity+2.594
Neg logit contributions
quished-4.955
nesday-4.930
ess-4.901
der-4.848
escent-4.694
Token and
Token ID290
Feature activation-2.16e-01

Pos logit contributions
 assurances+1.782
 calculations+1.760
 negotiations+1.722
 Preferences+1.705
 Integrity+1.705
Neg logit contributions
ess-2.025
der-2.007
nesday-2.001
quished-1.981
rely-1.920
Token 2012
Token ID2321
Feature activation+5.90e-03

Pos logit contributions
 assurances+0.782
 calculations+0.780
 negotiations+0.764
 remarks+0.748
 Integrity+0.745
Neg logit contributions
ess-1.054
der-1.051
quished-1.047
nesday-1.039
joined-1.006