TOP ACTIVATIONS
MAX = 4.087

specific to feminist political and moral commitments because it concerns the
bonds , which will be worth around £ 200 million ,
most valuable bank notes , worth USD 8000 as of September
, public and private property worth more than Rs 20 ,
third in DV OA ( value per play ), and fourth
conducting searches for material that might help us with this ,
package of potential arms sales worth more than $ 3 .
ick ory , apple , pear , plum , and paw
- ag itated editor maybe these two guys
+ Self less , Moral , Vigil ant
ultimately decide the validity and worth iness of a project by
, noted Kap ur and Va ish nav .
some people out there who might be offended by the reporter
forward and , while K uy t and Bruno Mart ins
of logos , of the moral and intellectual standard the West
the firm under wrote deals worth $ 131 billion .
year contract that offseason , worth a reported $ 22 million
, a senior atmospheric , ocean ic and space sciences major
used for religious rituals , medicinal purposes , or even electro
, especially those living in Wa ik iki where the tourism

MIN ACTIVATIONS
MIN = -4.462

million years . <|endoftext|> Image copyright AFP The Bank
. <|endoftext|> Image copyright publicity Image caption The Greeks
t weet / <|endoftext|> Story highlights Secretary of Defense Chuck Hag
Serve hot ! <|endoftext|> Image copyright ESA Image caption Go ce
. <|endoftext|> Image copyright Deadline Police have
ported License . <|endoftext|> Image copyright AP Image caption Peace talks
vote accordingly . <|endoftext|> Story highlights Police chief vows wanted suspect
didn t start with Netanyahu ; physical movement restrictions
media reported . <|endoftext|> Image caption Some talked of " b
Russia . Image copyright AFP Image caption The Kurds
didn t lose control ; it was stolen from
ACE ). Image caption Council staff s ifted through
Brexit is the prime challenge Belfast Te legraph . co .
didn t lose control it was stolen
We ve lost control of our personal data It
Moscow , May 28 , 2014 . REUTERS / Michael K
<|endoftext|> Image copyright AP Image caption Peace talks between the government
and then choosing " Show Spe lling and Gram mar "
$ 8 million due to Carlos Quentin . But fact oring
I ba H Ex TT j B orm es les

INTERVAL 1.950 - 4.087
CONTAINS 0.674%

." 2003 annual report , Joyce Foundation : " R oc
several construction companies , B oral and CS R for example
s face facts . Turkey and Azerbaijan threaten
In an article for Pitch fork , music critic Adam Ward
Americas ) summit in Venezuela in 2007 . At the summit

INTERVAL -0.187 - 1.950
CONTAINS 58.325%

Daily Caller . <|endoftext|> In a press release sent to Gaming
so much that happens . As a matter of fact ,
than increased bandwidth . Crit ically , these approaches strike right
" It can become very complex very quickly ,"
The loss to the Flyers was Chicago 's fourth

INTERVAL -2.324 - -0.187
CONTAINS 40.835%

ll deserve more discussion if he stays in contention the
2000 , but back then the universal view of the comment
, four million envelop es and 100 ballot boxes in a
star tracker , was re - task ed to fly by
Samsung , a major maker of Honey comb tablets , argue

INTERVAL -4.462 - -2.324
CONTAINS 0.165%

We ll start with the 17 - 40 f
natural gas field with Turkey buying gas from Israel
. When we use curly brace notation this is not a
sites just for the dominant browser is the kind of thing
parse _ args ()) # initialize a rectangular and square struct
Token specific
Token ID2176
Feature activation-1.86e+00

Pos logit contributions
 initials+2.203
ses+2.075
 forces+2.055
 actions+1.965
advertisement+1.938
Neg logit contributions
osate-2.599
etheless-2.522
ufact-2.347
 Lauder-2.175
veyard-2.162
Token to
Token ID284
Feature activation-7.62e-01

Pos logit contributions
 initials+3.595
ses+3.301
 forces+3.287
 actions+3.105
advertisement+2.950
Neg logit contributions
osate-6.941
etheless-6.800
ufact-6.438
 Lauder-6.049
veyard-6.018
Token feminist
Token ID14314
Feature activation+7.31e-01

Pos logit contributions
 initials+2.018
ses+1.893
 forces+1.879
 actions+1.795
advertisement+1.738
Neg logit contributions
osate-2.342
etheless-2.293
ufact-2.133
 Lauder-1.977
veyard-1.943
Token political
Token ID1964
Feature activation+9.96e-01

Pos logit contributions
osate+2.407
etheless+2.358
ufact+2.194
 Lauder+2.100
 matter+2.031
Neg logit contributions
 initials-1.679
ses-1.539
advertisement-1.480
 forces-1.469
official-1.417
Token and
Token ID290
Feature activation+1.15e-02

Pos logit contributions
osate+3.147
etheless+3.107
ufact+2.906
 Lauder+2.794
 matter+2.731
Neg logit contributions
 initials-2.312
ses-2.107
advertisement-2.075
 forces-1.976
official-1.963
Token moral
Token ID6573
Feature activation+4.09e+00

Pos logit contributions
osate+0.036
etheless+0.035
ufact+0.033
 Lauder+0.031
veyard+0.030
Neg logit contributions
 initials-0.029
ses-0.027
 forces-0.027
advertisement-0.026
 actions-0.025
Token commitments
Token ID19405
Feature activation+8.70e-01

Pos logit contributions
osate+11.201
etheless+11.109
 matter+10.860
omas+10.424
ufact+10.421
Neg logit contributions
 initials-8.302
advertisement-7.838
Reviewer-7.521
MSN-7.461
parser-7.410
Token because
Token ID780
Feature activation-1.40e+00

Pos logit contributions
osate+2.689
etheless+2.666
ufact+2.434
 Lauder+2.330
 matter+2.283
Neg logit contributions
 initials-2.187
ses-1.992
 forces-1.912
advertisement-1.910
 actions-1.827
Token it
Token ID340
Feature activation+1.36e-02

Pos logit contributions
 initials+3.279
 forces+3.036
ses+3.014
 actions+2.896
advertisement+2.757
Neg logit contributions
osate-4.692
etheless-4.616
ufact-4.341
 Lauder-4.033
veyard-3.970
Token concerns
Token ID4786
Feature activation-5.49e-01

Pos logit contributions
osate+0.042
etheless+0.041
ufact+0.038
 Lauder+0.036
veyard+0.035
Neg logit contributions
 initials-0.035
ses-0.033
 forces-0.033
 actions-0.031
advertisement-0.031
Token the
Token ID262
Feature activation-2.72e-01

Pos logit contributions
 initials+1.277
ses+1.184
 forces+1.167
 actions+1.108
advertisement+1.095
Neg logit contributions
osate-1.861
etheless-1.825
ufact-1.701
 Lauder-1.605
veyard-1.572
Token bonds
Token ID13100
Feature activation+1.26e+00

Pos logit contributions
 initials+0.888
ses+0.831
 forces+0.820
 actions+0.784
advertisement+0.778
Neg logit contributions
osate-0.996
etheless-0.977
ufact-0.902
 Lauder-0.841
veyard-0.827
Token,
Token ID11
Feature activation+1.67e+00

Pos logit contributions
osate+3.359
etheless+3.297
ufact+3.016
 Lauder+2.814
veyard+2.714
Neg logit contributions
 initials-3.698
ses-3.480
 forces-3.384
advertisement-3.295
 actions-3.255
Token which
Token ID543
Feature activation+1.47e+00

Pos logit contributions
etheless+4.562
osate+4.533
ufact+4.177
 Lauder+4.003
 Scotia+3.849
Neg logit contributions
 initials-4.528
ses-4.306
 forces-4.108
advertisement-4.104
parser-3.966
Token will
Token ID481
Feature activation+1.47e+00

Pos logit contributions
osate+3.664
etheless+3.584
ufact+3.243
 Lauder+3.095
 Scotia+2.935
Neg logit contributions
 initials-4.497
ses-4.289
advertisement-4.109
 forces-4.090
 actions-3.974
Token be
Token ID307
Feature activation+8.52e-01

Pos logit contributions
osate+3.000
etheless+2.976
ufact+2.608
 Lauder+2.395
veyard+2.270
Neg logit contributions
 initials-5.214
ses-4.984
 forces-4.847
advertisement-4.783
 actions-4.722
Token worth
Token ID2861
Feature activation+3.99e+00

Pos logit contributions
etheless+2.657
osate+2.642
ufact+2.490
 Lauder+2.344
 Scotia+2.268
Neg logit contributions
 initials-2.045
ses-1.928
 forces-1.844
advertisement-1.812
parser-1.756
Token around
Token ID1088
Feature activation+1.86e+00

Pos logit contributions
etheless+10.429
osate+10.040
 Lauder+9.670
ufact+9.539
ige+9.171
Neg logit contributions
 initials-9.022
ses-8.835
advertisement-8.541
Reviewer-8.360
parser-8.356
Token £
Token ID4248
Feature activation+2.45e+00

Pos logit contributions
osate+5.187
etheless+5.149
ufact+4.705
 Lauder+4.579
 Scotia+4.362
Neg logit contributions
 initials-5.001
ses-4.655
 forces-4.438
advertisement-4.428
parser-4.318
Token200
Token ID2167
Feature activation+1.52e+00

Pos logit contributions
osate+6.338
etheless+5.986
ufact+5.556
 Lauder+5.484
 Scotia+5.148
Neg logit contributions
 initials-7.020
ses-6.356
 forces-6.294
 actions-6.033
 motivations-5.906
Token million
Token ID1510
Feature activation+1.41e+00

Pos logit contributions
osate+4.120
etheless+4.062
ufact+3.687
 Lauder+3.483
veyard+3.372
Neg logit contributions
 initials-4.272
ses-4.045
 forces-3.923
advertisement-3.812
 actions-3.769
Token,
Token ID11
Feature activation+4.71e-01

Pos logit contributions
osate+4.091
etheless+4.052
ufact+3.741
 Lauder+3.537
 Scotia+3.420
Neg logit contributions
 initials-3.691
ses-3.470
 forces-3.348
advertisement-3.288
 actions-3.226
Token most
Token ID749
Feature activation-6.34e-01

Pos logit contributions
osate+0.286
etheless+0.282
ufact+0.259
 Lauder+0.247
veyard+0.237
Neg logit contributions
 initials-0.236
ses-0.221
 forces-0.213
advertisement-0.210
 actions-0.203
Token valuable
Token ID8119
Feature activation+1.36e+00

Pos logit contributions
 initials+1.666
ses+1.549
 forces+1.540
 actions+1.470
advertisement+1.458
Neg logit contributions
osate-1.965
etheless-1.928
ufact-1.783
 Lauder-1.663
veyard-1.634
Token bank
Token ID3331
Feature activation+1.70e-01

Pos logit contributions
osate+4.035
etheless+3.937
ufact+3.695
 Lauder+3.587
 matter+3.478
Neg logit contributions
 initials-3.306
ses-3.210
advertisement-3.045
official-2.941
 forces-2.907
Tokennotes
Token ID17815
Feature activation+1.78e+00

Pos logit contributions
osate+0.562
etheless+0.553
ufact+0.513
 Lauder+0.486
veyard+0.472
Neg logit contributions
 initials-0.399
ses-0.374
 forces-0.361
advertisement-0.348
 actions-0.342
Token,
Token ID11
Feature activation+4.78e-01

Pos logit contributions
osate+5.255
etheless+5.229
ufact+4.817
 Lauder+4.553
 matter+4.476
Neg logit contributions
 initials-4.412
ses-4.165
 forces-3.952
advertisement-3.870
parser-3.822
Token worth
Token ID2861
Feature activation+3.92e+00

Pos logit contributions
osate+1.472
etheless+1.462
ufact+1.349
 Lauder+1.288
 Scotia+1.235
Neg logit contributions
 initials-1.204
ses-1.133
 forces-1.097
advertisement-1.072
 actions-1.044
Token USD
Token ID11403
Feature activation+1.73e+00

Pos logit contributions
etheless+9.733
osate+9.413
 Lauder+8.962
ufact+8.715
 matter+8.445
Neg logit contributions
 initials-9.389
ses-9.094
advertisement-8.835
Reviewer-8.658
parser-8.598
Token 8000
Token ID38055
Feature activation+6.22e-01

Pos logit contributions
osate+5.342
etheless+5.229
ufact+4.813
 Lauder+4.703
 Scotia+4.449
Neg logit contributions
 initials-4.197
ses-3.904
 forces-3.773
advertisement-3.660
 actions-3.597
Token as
Token ID355
Feature activation+1.44e-01

Pos logit contributions
osate+1.658
etheless+1.631
ufact+1.476
 Lauder+1.390
veyard+1.332
Neg logit contributions
 initials-1.857
ses-1.753
 forces-1.721
advertisement-1.668
 actions-1.653
Token of
Token ID286
Feature activation+4.47e-01

Pos logit contributions
osate+0.440
etheless+0.434
ufact+0.400
 Lauder+0.377
veyard+0.364
Neg logit contributions
 initials-0.376
ses-0.353
 forces-0.345
advertisement-0.332
 actions-0.330
Token September
Token ID2693
Feature activation-1.09e-01

Pos logit contributions
osate+1.248
etheless+1.227
ufact+1.120
 Lauder+1.067
veyard+1.013
Neg logit contributions
 initials-1.273
ses-1.191
 forces-1.167
advertisement-1.138
 actions-1.118
Token,
Token ID11
Feature activation+4.90e-02

Pos logit contributions
osate+1.939
etheless+1.905
ufact+1.787
 Lauder+1.692
veyard+1.633
Neg logit contributions
 initials-1.372
ses-1.270
 forces-1.225
advertisement-1.200
 actions-1.169
Token public
Token ID1171
Feature activation+7.67e-01

Pos logit contributions
osate+0.159
etheless+0.156
ufact+0.144
 Lauder+0.137
veyard+0.132
Neg logit contributions
 initials-0.120
ses-0.112
 forces-0.108
advertisement-0.105
 actions-0.103
Token and
Token ID290
Feature activation-5.66e-01

Pos logit contributions
osate+2.467
etheless+2.431
ufact+2.245
 Lauder+2.153
isance+2.082
Neg logit contributions
 initials-1.841
ses-1.708
advertisement-1.620
 forces-1.596
parser-1.538
Token private
Token ID2839
Feature activation+1.52e+00

Pos logit contributions
 initials+1.413
ses+1.317
 forces+1.290
advertisement+1.233
 actions+1.228
Neg logit contributions
osate-1.817
etheless-1.781
ufact-1.656
 Lauder-1.559
veyard-1.521
Token property
Token ID3119
Feature activation+1.02e+00

Pos logit contributions
osate+4.538
etheless+4.453
ufact+4.120
 Lauder+3.943
 matter+3.868
Neg logit contributions
 initials-3.756
ses-3.513
advertisement-3.453
parser-3.291
Position-3.266
Token worth
Token ID2861
Feature activation+3.92e+00

Pos logit contributions
osate+3.017
etheless+2.959
ufact+2.753
 Lauder+2.586
veyard+2.492
Neg logit contributions
 initials-2.676
ses-2.472
advertisement-2.404
 forces-2.356
parser-2.293
Token more
Token ID517
Feature activation+1.23e+00

Pos logit contributions
etheless+9.991
osate+9.978
ufact+9.346
 Lauder+9.011
 matter+8.573
Neg logit contributions
 initials-9.457
ses-8.772
advertisement-8.708
Reviewer-8.643
parser-8.524
Token than
Token ID621
Feature activation+1.09e+00

Pos logit contributions
osate+3.573
etheless+3.534
ufact+3.276
 Lauder+3.103
veyard+2.970
Neg logit contributions
 initials-3.222
ses-3.051
 forces-2.924
advertisement-2.869
 actions-2.787
Token Rs
Token ID12820
Feature activation+1.86e+00

Pos logit contributions
osate+3.318
etheless+3.292
ufact+3.054
 Lauder+2.911
pload+2.743
Neg logit contributions
 initials-2.763
ses-2.579
 forces-2.496
advertisement-2.457
 actions-2.390
Token 20
Token ID1160
Feature activation+4.97e-01

Pos logit contributions
osate+5.677
etheless+5.520
ufact+5.134
 Lauder+5.013
 matter+4.747
Neg logit contributions
 initials-4.560
ses-4.184
 forces-4.062
advertisement-3.918
 actions-3.869
Token,
Token ID11
Feature activation+1.03e+00

Pos logit contributions
osate+1.194
etheless+1.169
ufact+1.053
 Lauder+0.973
veyard+0.932
Neg logit contributions
 initials-1.624
ses-1.536
 forces-1.509
advertisement-1.466
 actions-1.457
Token third
Token ID2368
Feature activation+5.97e-01

Pos logit contributions
etheless+1.435
osate+1.433
ufact+1.311
 Lauder+1.247
 Scotia+1.189
Neg logit contributions
 initials-1.250
ses-1.186
 forces-1.137
advertisement-1.104
 actions-1.089
Token in
Token ID287
Feature activation+1.54e+00

Pos logit contributions
etheless+1.984
osate+1.974
ufact+1.813
 Lauder+1.743
veyard+1.682
Neg logit contributions
 initials-1.335
ses-1.272
 forces-1.196
advertisement-1.187
 actions-1.145
Token DV
Token ID29854
Feature activation+3.02e+00

Pos logit contributions
etheless+4.666
osate+4.608
ufact+4.185
 Lauder+4.074
 Scotia+3.946
Neg logit contributions
 initials-3.722
ses-3.573
advertisement-3.354
 forces-3.336
MSN-3.294
TokenOA
Token ID23621
Feature activation+1.85e+00

Pos logit contributions
osate+5.932
etheless+5.825
omas+5.288
ufact+5.141
 Lauder+5.090
Neg logit contributions
ses-9.329
 initials-9.276
advertisement-8.969
parser-8.689
official-8.622
Token (
Token ID357
Feature activation+8.83e-01

Pos logit contributions
osate+5.155
etheless+5.127
ufact+4.769
 Lauder+4.344
veyard+4.337
Neg logit contributions
 initials-4.758
ses-4.621
advertisement-4.345
parser-4.228
 forces-4.216
Tokenvalue
Token ID8367
Feature activation+3.86e+00

Pos logit contributions
etheless+2.658
osate+2.633
ufact+2.375
 Lauder+2.253
veyard+2.247
Neg logit contributions
 initials-2.302
ses-2.103
 forces-2.061
 actions-1.983
advertisement-1.949
Token per
Token ID583
Feature activation+9.90e-01

Pos logit contributions
etheless+10.239
osate+10.097
 matter+9.443
ufact+9.077
veyard+8.910
Neg logit contributions
 initials-8.441
ses-8.407
MSN-8.022
advertisement-7.930
parser-7.533
Token play
Token ID711
Feature activation-5.16e-01

Pos logit contributions
etheless+2.420
osate+2.390
ufact+2.136
 Lauder+2.024
veyard+1.946
Neg logit contributions
 initials-3.120
ses-2.965
 forces-2.858
advertisement-2.812
parser-2.747
Token),
Token ID828
Feature activation+2.38e-01

Pos logit contributions
 initials+1.140
ses+1.051
 forces+1.028
advertisement+0.974
 actions+0.970
Neg logit contributions
osate-1.804
etheless-1.775
ufact-1.656
 Lauder-1.569
veyard-1.536
Token and
Token ID290
Feature activation-4.57e-01

Pos logit contributions
osate+0.744
etheless+0.735
ufact+0.678
 Lauder+0.641
veyard+0.620
Neg logit contributions
 initials-0.599
ses-0.565
 forces-0.548
advertisement-0.527
 actions-0.522
Token fourth
Token ID5544
Feature activation+1.57e+00

Pos logit contributions
 initials+1.276
ses+1.196
 forces+1.180
 actions+1.129
advertisement+1.127
Neg logit contributions
osate-1.334
etheless-1.305
ufact-1.201
 Lauder-1.122
veyard-1.095
Token conducting
Token ID14523
Feature activation-2.34e-01

Pos logit contributions
 initials+2.843
ses+2.627
 forces+2.620
 actions+2.492
advertisement+2.439
Neg logit contributions
osate-3.803
etheless-3.701
ufact-3.429
 Lauder-3.185
veyard-3.164
Token searches
Token ID15455
Feature activation+2.30e-01

Pos logit contributions
 initials+0.573
ses+0.534
 forces+0.520
advertisement+0.500
 actions+0.492
Neg logit contributions
osate-0.757
etheless-0.741
ufact-0.689
 Lauder-0.654
veyard-0.636
Token for
Token ID329
Feature activation+2.23e-01

Pos logit contributions
osate+0.768
etheless+0.761
ufact+0.703
 Lauder+0.671
veyard+0.649
Neg logit contributions
 initials-0.529
ses-0.491
 forces-0.466
advertisement-0.459
 actions-0.446
Token material
Token ID2587
Feature activation+1.45e+00

Pos logit contributions
osate+0.728
etheless+0.718
ufact+0.666
 Lauder+0.640
veyard+0.620
Neg logit contributions
 initials-0.524
ses-0.489
 forces-0.467
advertisement-0.461
 actions-0.446
Token that
Token ID326
Feature activation+2.14e+00

Pos logit contributions
osate+4.418
etheless+4.372
ufact+4.100
 Lauder+3.935
 matter+3.881
Neg logit contributions
 initials-3.468
ses-3.215
advertisement-3.166
official-2.998
 forces-2.975
Token might
Token ID1244
Feature activation+3.85e+00

Pos logit contributions
osate+4.558
etheless+4.431
ufact+4.071
 Lauder+4.008
 matter+3.875
Neg logit contributions
 initials-6.750
advertisement-6.396
ses-6.287
parser-6.104
official-6.045
Token help
Token ID1037
Feature activation+9.68e-01

Pos logit contributions
etheless+6.883
 matter+6.662
osate+6.392
 comprise+6.364
 signify+6.115
Neg logit contributions
 initials-12.272
advertisement-11.660
ses-11.518
Choice-11.489
MSN-11.410
Token us
Token ID514
Feature activation+5.52e-01

Pos logit contributions
osate+3.197
etheless+3.160
ufact+2.943
 Lauder+2.849
veyard+2.739
Neg logit contributions
 initials-2.171
ses-2.019
advertisement-1.912
 forces-1.899
 actions-1.827
Token with
Token ID351
Feature activation-5.36e-02

Pos logit contributions
osate+1.467
etheless+1.453
ufact+1.328
 Lauder+1.256
veyard+1.202
Neg logit contributions
 initials-1.618
ses-1.532
 forces-1.485
advertisement-1.458
 actions-1.432
Token this
Token ID428
Feature activation+1.16e+00

Pos logit contributions
 initials+0.131
ses+0.122
 forces+0.120
advertisement+0.114
 actions+0.114
Neg logit contributions
osate-0.173
etheless-0.170
ufact-0.158
 Lauder-0.150
veyard-0.146
Token,
Token ID11
Feature activation+7.67e-01

Pos logit contributions
osate+3.378
etheless+3.340
ufact+3.115
 Lauder+2.989
 matter+2.965
Neg logit contributions
 initials-2.994
ses-2.794
 forces-2.700
advertisement-2.658
official-2.572
Token package
Token ID5301
Feature activation-1.11e-02

Pos logit contributions
osate+1.183
etheless+1.160
ufact+1.066
 Lauder+1.004
veyard+0.970
Neg logit contributions
 initials-1.113
ses-1.042
 forces-1.023
advertisement-0.987
 actions-0.979
Token of
Token ID286
Feature activation+1.37e-01

Pos logit contributions
 initials+0.028
ses+0.027
 forces+0.026
advertisement+0.025
 actions+0.025
Neg logit contributions
osate-0.035
etheless-0.034
ufact-0.032
 Lauder-0.030
veyard-0.029
Token potential
Token ID2785
Feature activation+1.16e+00

Pos logit contributions
osate+0.432
etheless+0.425
ufact+0.393
 Lauder+0.373
veyard+0.359
Neg logit contributions
 initials-0.343
ses-0.319
 forces-0.310
advertisement-0.304
 actions-0.293
Token arms
Token ID5101
Feature activation-6.66e-02

Pos logit contributions
osate+3.552
etheless+3.447
ufact+3.201
 Lauder+3.088
 Scotia+2.914
Neg logit contributions
 initials-2.909
ses-2.704
advertisement-2.636
 forces-2.596
parser-2.496
Token sales
Token ID4200
Feature activation+9.11e-01

Pos logit contributions
 initials+0.160
ses+0.148
 forces+0.144
advertisement+0.139
 actions+0.137
Neg logit contributions
osate-0.219
etheless-0.215
ufact-0.201
 Lauder-0.190
veyard-0.185
Token worth
Token ID2861
Feature activation+3.81e+00

Pos logit contributions
osate+3.054
etheless+3.036
ufact+2.833
 Lauder+2.680
veyard+2.605
Neg logit contributions
 initials-2.024
ses-1.873
advertisement-1.772
 forces-1.768
official-1.679
Token more
Token ID517
Feature activation+9.43e-01

Pos logit contributions
etheless+9.726
osate+9.592
 Lauder+8.835
ufact+8.822
 matter+8.381
Neg logit contributions
 initials-9.106
advertisement-8.669
ses-8.602
Reviewer-8.425
parser-8.330
Token than
Token ID621
Feature activation+8.78e-01

Pos logit contributions
osate+2.777
etheless+2.726
ufact+2.520
 Lauder+2.410
 Scotia+2.309
Neg logit contributions
 initials-2.481
ses-2.335
 forces-2.247
advertisement-2.192
 actions-2.139
Token $
Token ID720
Feature activation+1.63e+00

Pos logit contributions
osate+2.618
etheless+2.586
ufact+2.380
 Lauder+2.272
 Scotia+2.159
Neg logit contributions
 initials-2.298
ses-2.134
 forces-2.077
advertisement-2.063
 actions-1.988
Token3
Token ID18
Feature activation+5.17e-01

Pos logit contributions
osate+4.535
etheless+4.327
ufact+4.005
 Lauder+3.880
veyard+3.684
Neg logit contributions
 initials-4.594
ses-4.161
 forces-4.145
 actions-3.954
advertisement-3.864
Token.
Token ID13
Feature activation+9.59e-01

Pos logit contributions
osate+1.163
etheless+1.128
ufact+1.010
 Lauder+0.921
veyard+0.891
Neg logit contributions
 initials-1.781
ses-1.696
 forces-1.669
advertisement-1.615
 actions-1.609
Tokenick
Token ID624
Feature activation+2.33e-01

Pos logit contributions
 initials+1.057
ses+1.002
 forces+0.987
 actions+0.949
advertisement+0.948
Neg logit contributions
osate-0.889
etheless-0.875
ufact-0.794
 Lauder-0.734
veyard-0.710
Tokenory
Token ID652
Feature activation+1.69e+00

Pos logit contributions
osate+0.712
etheless+0.694
ufact+0.644
 Lauder+0.603
veyard+0.587
Neg logit contributions
 initials-0.620
ses-0.577
 forces-0.570
advertisement-0.548
 actions-0.544
Token,
Token ID11
Feature activation+6.13e-01

Pos logit contributions
osate+5.100
etheless+4.899
ufact+4.556
 Lauder+4.288
 matter+4.178
Neg logit contributions
 initials-4.239
advertisement-3.905
ses-3.889
 forces-3.857
official-3.716
Token apple
Token ID17180
Feature activation+1.59e+00

Pos logit contributions
osate+1.919
etheless+1.866
ufact+1.736
 Lauder+1.645
veyard+1.590
Neg logit contributions
 initials-1.556
ses-1.443
 forces-1.426
advertisement-1.381
 actions-1.359
Token,
Token ID11
Feature activation+5.60e-01

Pos logit contributions
osate+4.721
etheless+4.559
ufact+4.270
 Lauder+4.032
omas+3.876
Neg logit contributions
 initials-4.076
ses-3.799
 forces-3.745
advertisement-3.713
 actions-3.569
Token pear
Token ID25286
Feature activation+3.78e+00

Pos logit contributions
osate+1.772
etheless+1.727
ufact+1.605
 Lauder+1.526
veyard+1.478
Neg logit contributions
 initials-1.402
ses-1.300
 forces-1.287
advertisement-1.240
 actions-1.224
Token,
Token ID11
Feature activation+8.67e-01

Pos logit contributions
osate+9.678
etheless+9.113
ufact+8.622
 Lauder+8.173
ige+8.135
Neg logit contributions
 initials-9.507
advertisement-9.095
 forces-8.913
ses-8.873
 actions-8.428
Token plum
Token ID22802
Feature activation+2.44e+00

Pos logit contributions
osate+2.692
etheless+2.617
ufact+2.431
 Lauder+2.326
veyard+2.236
Neg logit contributions
 initials-2.183
ses-2.027
 forces-2.012
advertisement-1.954
 actions-1.913
Token,
Token ID11
Feature activation+1.03e+00

Pos logit contributions
osate+6.921
etheless+6.589
ufact+6.111
 Lauder+5.810
omas+5.686
Neg logit contributions
 initials-6.352
advertisement-5.919
ses-5.880
 forces-5.875
official-5.580
Token and
Token ID290
Feature activation-3.79e-01

Pos logit contributions
osate+3.192
etheless+3.100
ufact+2.879
 Lauder+2.755
veyard+2.648
Neg logit contributions
 initials-2.602
ses-2.413
 forces-2.397
advertisement-2.343
 actions-2.282
Token paw
Token ID41827
Feature activation+2.12e+00

Pos logit contributions
 initials+0.981
ses+0.918
 forces+0.900
 actions+0.858
advertisement+0.851
Neg logit contributions
osate-1.186
etheless-1.165
ufact-1.078
 Lauder-1.008
veyard-0.988
Token-
Token ID12
Feature activation-9.73e-01

Pos logit contributions
osate+1.311
etheless+1.286
ufact+1.182
 Lauder+1.110
veyard+1.073
Neg logit contributions
 initials-1.211
ses-1.130
 forces-1.112
advertisement-1.067
 actions-1.064
Tokenag
Token ID363
Feature activation-9.42e-01

Pos logit contributions
 initials+2.444
ses+2.353
 forces+2.273
advertisement+2.210
 actions+2.160
Neg logit contributions
osate-3.056
etheless-2.976
ufact-2.781
 Lauder-2.625
veyard-2.566
Tokenitated
Token ID13939
Feature activation-4.42e-01

Pos logit contributions
 initials+2.501
ses+2.394
 forces+2.327
 actions+2.237
advertisement+2.228
Neg logit contributions
osate-2.828
etheless-2.785
ufact-2.565
 Lauder-2.419
veyard-2.345
Token editor
Token ID5464
Feature activation-1.01e+00

Pos logit contributions
 initials+1.107
ses+1.031
 forces+1.013
 actions+0.965
advertisement+0.963
Neg logit contributions
osate-1.418
etheless-1.391
ufact-1.291
 Lauder-1.214
veyard-1.187
Token
Token ID960
Feature activation+8.37e-01

Pos logit contributions
 initials+2.685
ses+2.511
 forces+2.479
 actions+2.388
advertisement+2.340
Neg logit contributions
osate-3.075
etheless-3.002
ufact-2.783
 Lauder-2.583
veyard-2.560
Tokenmaybe
Token ID25991
Feature activation+3.77e+00

Pos logit contributions
osate+2.593
etheless+2.564
ufact+2.348
 Lauder+2.238
veyard+2.172
Neg logit contributions
 initials-2.124
ses-1.926
 forces-1.919
 actions-1.818
advertisement-1.787
Token these
Token ID777
Feature activation-8.29e-02

Pos logit contributions
etheless+10.232
osate+10.152
 Lauder+9.230
ufact+9.066
 Scotia+8.927
Neg logit contributions
 initials-8.461
ses-8.207
advertisement-7.839
 forces-7.512
Position-7.480
Token two
Token ID734
Feature activation-2.53e-01

Pos logit contributions
 initials+0.207
ses+0.193
 forces+0.189
advertisement+0.181
 actions+0.180
Neg logit contributions
osate-0.265
etheless-0.261
ufact-0.241
 Lauder-0.228
veyard-0.222
Token guys
Token ID3730
Feature activation-3.72e-01

Pos logit contributions
 initials+0.675
ses+0.630
 forces+0.623
 actions+0.596
advertisement+0.588
Neg logit contributions
osate-0.779
etheless-0.759
ufact-0.704
 Lauder-0.657
veyard-0.646
Token
Token ID447
Feature activation-5.04e-01

Pos logit contributions
 initials+1.061
ses+0.996
 forces+0.987
 actions+0.946
advertisement+0.938
Neg logit contributions
osate-1.073
etheless-1.042
ufact-0.960
 Lauder-0.893
veyard-0.875
Token
Token ID247
Feature activation+2.62e-01

Pos logit contributions
 initials+1.250
ses+1.162
 forces+1.144
advertisement+1.087
 actions+1.087
Neg logit contributions
osate-1.629
etheless-1.599
ufact-1.487
 Lauder-1.398
veyard-1.365
Token
Token ID198
Feature activation-6.74e-01

Pos logit contributions
osate+3.015
etheless+2.749
 Lauder+2.600
ufact+2.569
 matter+2.486
Neg logit contributions
 initials-3.610
ses-3.291
 forces-3.209
 actions-3.100
advertisement-3.069
Token+
Token ID10
Feature activation-1.29e+00

Pos logit contributions
 initials+1.590
ses+1.474
 forces+1.447
 actions+1.372
advertisement+1.372
Neg logit contributions
osate-2.261
etheless-2.221
ufact-2.066
 Lauder-1.951
veyard-1.908
Token Self
Token ID12189
Feature activation+2.40e-01

Pos logit contributions
 initials+3.080
ses+2.900
 forces+2.864
 actions+2.730
advertisement+2.696
Neg logit contributions
osate-4.224
etheless-4.172
ufact-3.835
veyard-3.583
 Lauder-3.536
Tokenless
Token ID1203
Feature activation+1.16e+00

Pos logit contributions
osate+0.573
etheless+0.557
ufact+0.502
 Lauder+0.466
veyard+0.441
Neg logit contributions
 initials-0.799
ses-0.746
 forces-0.736
advertisement-0.718
 actions-0.709
Token,
Token ID11
Feature activation-3.80e-01

Pos logit contributions
osate+3.381
etheless+3.338
ufact+3.052
 Lauder+2.920
 Scotia+2.770
Neg logit contributions
 initials-3.138
ses-2.954
 forces-2.788
advertisement-2.723
official-2.632
Token Moral
Token ID39020
Feature activation+3.63e+00

Pos logit contributions
 initials+0.934
ses+0.868
 forces+0.852
 actions+0.810
advertisement+0.810
Neg logit contributions
osate-1.238
etheless-1.215
ufact-1.129
 Lauder-1.064
veyard-1.039
Token,
Token ID11
Feature activation+5.73e-01

Pos logit contributions
osate+9.620
etheless+9.280
ufact+8.250
 Lauder+8.164
idge+7.987
Neg logit contributions
 initials-9.669
ses-8.895
advertisement-8.599
 forces-8.155
official-7.907
Token Vigil
Token ID39840
Feature activation+2.19e+00

Pos logit contributions
osate+1.820
etheless+1.774
ufact+1.667
 Lauder+1.604
 Scotia+1.530
Neg logit contributions
 initials-1.423
ses-1.301
 forces-1.267
advertisement-1.229
 actions-1.197
Tokenant
Token ID415
Feature activation+1.35e+00

Pos logit contributions
osate+5.925
etheless+5.809
ufact+5.444
 Lauder+5.145
isance+4.936
Neg logit contributions
 initials-6.213
ses-5.706
 forces-5.446
advertisement-5.251
 actions-5.219
Token
Token ID198
Feature activation+1.05e+00

Pos logit contributions
osate+3.871
etheless+3.806
ufact+3.434
 Lauder+3.301
veyard+3.102
Neg logit contributions
 initials-3.777
ses-3.523
 forces-3.316
advertisement-3.286
 actions-3.150
Token
Token ID198
Feature activation-8.68e-01

Pos logit contributions
osate+2.692
etheless+2.475
 Lauder+2.318
ufact+2.316
 matter+2.216
Neg logit contributions
 initials-3.133
ses-2.852
 forces-2.786
 actions-2.690
advertisement-2.675
Token ultimately
Token ID6165
Feature activation+9.07e-01

Pos logit contributions
osate+0.263
etheless+0.256
ufact+0.233
 Lauder+0.216
veyard+0.209
Neg logit contributions
 initials-0.318
ses-0.300
 forces-0.296
 actions-0.285
advertisement-0.285
Token decide
Token ID5409
Feature activation+5.85e-01

Pos logit contributions
osate+2.039
etheless+1.995
ufact+1.795
 Lauder+1.647
veyard+1.576
Neg logit contributions
 initials-3.053
ses-2.917
 forces-2.850
advertisement-2.787
 actions-2.744
Token the
Token ID262
Feature activation-6.62e-01

Pos logit contributions
osate+1.745
etheless+1.711
ufact+1.601
 Lauder+1.527
 matter+1.466
Neg logit contributions
 initials-1.508
ses-1.423
 forces-1.364
advertisement-1.340
parser-1.286
Token validity
Token ID19648
Feature activation+8.04e-01

Pos logit contributions
 initials+1.746
ses+1.632
 forces+1.606
 actions+1.547
advertisement+1.518
Neg logit contributions
osate-2.061
etheless-2.007
ufact-1.858
 Lauder-1.744
veyard-1.713
Token and
Token ID290
Feature activation-5.84e-01

Pos logit contributions
osate+2.387
etheless+2.314
ufact+2.132
 Lauder+1.990
veyard+1.955
Neg logit contributions
 initials-2.130
ses-2.003
 forces-1.940
advertisement-1.905
parser-1.844
Token worth
Token ID2861
Feature activation+3.59e+00

Pos logit contributions
 initials+1.357
ses+1.256
 forces+1.232
advertisement+1.168
 actions+1.168
Neg logit contributions
osate-1.979
etheless-1.944
ufact-1.811
 Lauder-1.711
veyard-1.673
Tokeniness
Token ID1272
Feature activation+1.30e+00

Pos logit contributions
osate+8.973
etheless+8.585
ufact+7.654
 matter+7.439
veyard+7.369
Neg logit contributions
 initials-9.432
ses-8.653
MSN-8.521
 forces-8.445
parser-8.444
Token of
Token ID286
Feature activation+2.03e-01

Pos logit contributions
osate+3.616
etheless+3.507
ufact+3.187
 Lauder+3.007
 matter+2.976
Neg logit contributions
 initials-3.544
ses-3.353
 forces-3.211
advertisement-3.177
parser-3.139
Token a
Token ID257
Feature activation+1.22e+00

Pos logit contributions
osate+0.650
etheless+0.642
ufact+0.599
 Lauder+0.570
veyard+0.548
Neg logit contributions
 initials-0.484
ses-0.451
 forces-0.439
advertisement-0.421
parser-0.409
Token project
Token ID1628
Feature activation-4.77e-01

Pos logit contributions
osate+2.519
etheless+2.495
ufact+2.246
 Lauder+2.048
 matter+1.967
Neg logit contributions
 initials-4.198
ses-3.981
 forces-3.947
advertisement-3.807
parser-3.794
Token by
Token ID416
Feature activation-9.28e-01

Pos logit contributions
 initials+1.138
ses+1.056
 forces+1.036
advertisement+0.983
 actions+0.983
Neg logit contributions
osate-1.584
etheless-1.555
ufact-1.446
 Lauder-1.365
veyard-1.334
Token,
Token ID11
Feature activation+2.59e-01

Pos logit contributions
osate+0.222
etheless+0.218
ufact+0.202
 Lauder+0.193
veyard+0.186
Neg logit contributions
 initials-0.166
ses-0.154
 forces-0.150
advertisement-0.144
 actions-0.142
Token noted
Token ID4367
Feature activation+3.26e-01

Pos logit contributions
osate+0.783
etheless+0.773
ufact+0.715
 Lauder+0.678
veyard+0.652
Neg logit contributions
 initials-0.676
ses-0.636
 forces-0.619
advertisement-0.601
 actions-0.592
Token Kap
Token ID27344
Feature activation+2.07e-01

Pos logit contributions
osate+1.067
etheless+1.044
ufact+0.970
 Lauder+0.933
veyard+0.901
Neg logit contributions
 initials-0.778
ses-0.725
 forces-0.702
advertisement-0.683
 actions-0.668
Tokenur
Token ID333
Feature activation+1.09e+00

Pos logit contributions
osate+0.429
etheless+0.418
ufact+0.369
 Lauder+0.334
veyard+0.320
Neg logit contributions
 initials-0.750
ses-0.716
 forces-0.708
 actions-0.685
advertisement-0.685
Token and
Token ID290
Feature activation-1.09e+00

Pos logit contributions
osate+3.317
etheless+3.260
ufact+3.005
 Lauder+2.922
 matter+2.772
Neg logit contributions
 initials-2.731
ses-2.568
 forces-2.452
advertisement-2.388
 actions-2.345
Token Va
Token ID17668
Feature activation+3.59e+00

Pos logit contributions
 initials+3.274
 forces+3.083
ses+3.073
 actions+2.950
advertisement+2.877
Neg logit contributions
osate-2.975
etheless-2.890
ufact-2.665
 Lauder-2.414
veyard-2.386
Tokenish
Token ID680
Feature activation+2.60e+00

Pos logit contributions
osate+5.031
ige+4.153
idge+4.144
isance+3.913
issance+3.762
Neg logit contributions
 initials-14.363
advertisement-12.868
 OFFIC-12.747
MSN-12.694
 actions-12.638
Tokennav
Token ID28341
Feature activation+1.50e+00

Pos logit contributions
osate+7.489
etheless+7.029
ufact+6.615
uras+6.411
ige+6.354
Neg logit contributions
 initials-6.794
 forces-5.821
ses-5.732
 actions-5.654
advertisement-5.500
Token.
Token ID13
Feature activation+1.28e-01

Pos logit contributions
osate+4.430
etheless+4.382
ufact+3.952
 Lauder+3.858
 matter+3.729
Neg logit contributions
 initials-3.784
ses-3.581
 forces-3.405
advertisement-3.355
parser-3.284
Token
Token ID198
Feature activation+1.07e+00

Pos logit contributions
osate+0.398
etheless+0.389
ufact+0.360
 Lauder+0.341
veyard+0.331
Neg logit contributions
 initials-0.333
ses-0.310
 forces-0.305
 actions-0.291
advertisement-0.291
Token
Token ID198
Feature activation-6.25e-01

Pos logit contributions
osate+2.768
etheless+2.542
 Lauder+2.372
ufact+2.371
 Scotia+2.266
Neg logit contributions
 initials-3.200
ses-2.912
 forces-2.850
 actions-2.747
advertisement-2.708
Token some
Token ID617
Feature activation+7.00e-01

Pos logit contributions
 initials+0.313
ses+0.291
 forces+0.283
advertisement+0.272
 actions+0.269
Neg logit contributions
osate-0.420
etheless-0.413
ufact-0.384
 Lauder-0.365
veyard-0.353
Token people
Token ID661
Feature activation+6.55e-01

Pos logit contributions
osate+2.180
etheless+2.154
ufact+1.992
 Lauder+1.921
veyard+1.825
Neg logit contributions
 initials-1.741
ses-1.612
 forces-1.538
advertisement-1.526
official-1.475
Token out
Token ID503
Feature activation+1.33e+00

Pos logit contributions
osate+2.094
etheless+2.071
ufact+1.906
 Lauder+1.835
veyard+1.752
Neg logit contributions
 initials-1.594
ses-1.477
 forces-1.426
advertisement-1.384
 actions-1.360
Token there
Token ID612
Feature activation+9.73e-01

Pos logit contributions
osate+4.379
etheless+4.272
ufact+3.967
 Lauder+3.884
 matter+3.730
Neg logit contributions
 initials-2.997
ses-2.673
 forces-2.593
advertisement-2.541
 actions-2.452
Token who
Token ID508
Feature activation+1.10e+00

Pos logit contributions
osate+2.909
etheless+2.906
ufact+2.664
 Lauder+2.542
 matter+2.420
Neg logit contributions
 initials-2.522
ses-2.337
 forces-2.257
advertisement-2.205
official-2.152
Token might
Token ID1244
Feature activation+3.56e+00

Pos logit contributions
osate+2.699
etheless+2.665
ufact+2.396
 Lauder+2.229
veyard+2.117
Neg logit contributions
 initials-3.510
ses-3.319
 forces-3.238
advertisement-3.168
 actions-3.119
Token be
Token ID307
Feature activation+7.73e-01

Pos logit contributions
etheless+8.166
osate+7.620
 matter+7.281
 comprise+6.905
ufact+6.903
Neg logit contributions
 initials-10.390
ses-9.751
advertisement-9.470
MSN-9.457
 forces-9.256
Token offended
Token ID22231
Feature activation+1.11e+00

Pos logit contributions
osate+2.246
etheless+2.231
ufact+2.048
 Lauder+1.957
veyard+1.854
Neg logit contributions
 initials-2.081
ses-1.933
 forces-1.888
advertisement-1.854
 actions-1.797
Token by
Token ID416
Feature activation-6.38e-01

Pos logit contributions
etheless+3.299
osate+3.290
ufact+3.013
 Lauder+2.904
 matter+2.774
Neg logit contributions
 initials-2.887
ses-2.666
 forces-2.571
advertisement-2.555
parser-2.456
Token the
Token ID262
Feature activation-4.07e-01

Pos logit contributions
 initials+1.521
ses+1.413
 forces+1.392
 actions+1.324
advertisement+1.315
Neg logit contributions
osate-2.122
etheless-2.082
ufact-1.939
 Lauder-1.821
veyard-1.791
Token reporter
Token ID9095
Feature activation-1.17e+00

Pos logit contributions
 initials+1.097
ses+1.029
 forces+1.012
 actions+0.970
advertisement+0.964
Neg logit contributions
osate-1.235
etheless-1.209
ufact-1.116
 Lauder-1.043
veyard-1.021
Token forward
Token ID2651
Feature activation-9.65e-01

Pos logit contributions
 initials+0.761
ses+0.700
 forces+0.687
 actions+0.648
advertisement+0.645
Neg logit contributions
osate-1.249
etheless-1.227
ufact-1.147
 Lauder-1.086
veyard-1.065
Token and
Token ID290
Feature activation-1.32e+00

Pos logit contributions
 initials+2.204
ses+2.037
 forces+2.025
 actions+1.910
advertisement+1.871
Neg logit contributions
osate-3.318
etheless-3.236
ufact-3.026
 Lauder-2.848
veyard-2.803
Token,
Token ID11
Feature activation+1.36e+00

Pos logit contributions
 initials+4.083
 forces+3.899
ses+3.859
 actions+3.704
advertisement+3.605
Neg logit contributions
osate-3.489
etheless-3.404
ufact-3.114
 Lauder-2.782
veyard-2.753
Token while
Token ID981
Feature activation-4.32e-01

Pos logit contributions
etheless+4.212
osate+4.199
ufact+3.850
 Lauder+3.792
 matter+3.655
Neg logit contributions
 initials-3.192
ses-2.994
advertisement-2.820
 forces-2.777
official-2.732
Token K
Token ID509
Feature activation-4.50e-01

Pos logit contributions
 initials+1.055
ses+0.985
 forces+0.943
advertisement+0.935
 actions+0.905
Neg logit contributions
osate-1.384
etheless-1.374
ufact-1.262
 Lauder-1.218
veyard-1.172
Tokenuy
Token ID4669
Feature activation+3.54e+00

Pos logit contributions
 initials+1.391
ses+1.315
 forces+1.298
 actions+1.247
advertisement+1.245
Neg logit contributions
osate-1.175
etheless-1.150
ufact-1.045
 Lauder-0.969
veyard-0.939
Tokent
Token ID83
Feature activation+8.10e-01

Pos logit contributions
osate+9.415
etheless+8.564
idge+8.384
pton+8.294
rak+8.236
Neg logit contributions
 initials-9.139
advertisement-7.808
 forces-7.409
 ANGEL-7.406
 motivations-7.369
Token and
Token ID290
Feature activation-7.21e-01

Pos logit contributions
osate+2.270
etheless+2.252
ufact+2.072
 Lauder+1.976
pload+1.903
Neg logit contributions
 initials-2.242
ses-2.083
advertisement-2.019
 forces-1.987
 actions-1.937
Token Bruno
Token ID31045
Feature activation-3.44e-01

Pos logit contributions
 initials+1.721
ses+1.597
 forces+1.563
advertisement+1.490
 actions+1.485
Neg logit contributions
osate-2.393
etheless-2.350
ufact-2.185
 Lauder-2.069
veyard-2.017
Token Mart
Token ID3981
Feature activation+8.65e-02

Pos logit contributions
 initials+0.866
ses+0.808
 forces+0.795
 actions+0.757
advertisement+0.755
Neg logit contributions
osate-1.098
etheless-1.078
ufact-0.998
 Lauder-0.939
veyard-0.917
Tokenins
Token ID1040
Feature activation+1.40e-01

Pos logit contributions
osate+0.266
etheless+0.257
ufact+0.239
 Lauder+0.225
veyard+0.220
Neg logit contributions
 initials-0.229
ses-0.212
 forces-0.208
advertisement-0.201
 actions-0.199
Token of
Token ID286
Feature activation-9.70e-01

Pos logit contributions
osate+3.531
etheless+3.526
ufact+3.199
 Lauder+3.052
 matter+3.007
Neg logit contributions
 initials-3.030
ses-2.878
 forces-2.707
advertisement-2.661
 actions-2.616
Token logos
Token ID29645
Feature activation-1.15e+00

Pos logit contributions
 initials+2.409
 forces+2.242
ses+2.240
 actions+2.129
advertisement+2.103
Neg logit contributions
osate-3.127
etheless-3.046
ufact-2.832
 Lauder-2.657
veyard-2.625
Token,
Token ID11
Feature activation+4.12e-01

Pos logit contributions
 initials+2.608
ses+2.410
 forces+2.389
 actions+2.258
advertisement+2.253
Neg logit contributions
osate-3.916
etheless-3.852
ufact-3.550
 Lauder-3.386
veyard-3.336
Token of
Token ID286
Feature activation+2.52e+00

Pos logit contributions
osate+1.308
etheless+1.297
ufact+1.207
 Lauder+1.138
veyard+1.097
Neg logit contributions
 initials-1.012
ses-0.952
 forces-0.921
 actions-0.876
advertisement-0.876
Token the
Token ID262
Feature activation+8.73e-01

Pos logit contributions
etheless+7.910
osate+7.742
ufact+7.401
 Lauder+6.962
 matter+6.942
Neg logit contributions
 initials-5.290
ses-4.990
advertisement-4.697
 forces-4.661
MSN-4.594
Token moral
Token ID6573
Feature activation+3.50e+00

Pos logit contributions
osate+2.730
etheless+2.697
ufact+2.505
 Lauder+2.350
veyard+2.280
Neg logit contributions
 initials-2.167
ses-2.013
 forces-1.955
advertisement-1.908
 actions-1.866
Token and
Token ID290
Feature activation-5.05e-01

Pos logit contributions
osate+10.606
etheless+10.433
ufact+9.779
 matter+9.385
omas+9.368
Neg logit contributions
 initials-7.229
ses-6.720
advertisement-6.310
official-6.161
parser-6.092
Token intellectual
Token ID9028
Feature activation+1.86e+00

Pos logit contributions
 initials+1.269
ses+1.182
 forces+1.163
 actions+1.108
advertisement+1.106
Neg logit contributions
osate-1.616
etheless-1.584
ufact-1.468
 Lauder-1.383
veyard-1.352
Token standard
Token ID3210
Feature activation-2.33e-01

Pos logit contributions
osate+6.018
etheless+5.961
ufact+5.646
 Lauder+5.395
 matter+5.252
Neg logit contributions
 initials-4.005
ses-3.714
advertisement-3.522
parser-3.442
MSN-3.419
Token the
Token ID262
Feature activation+5.90e-01

Pos logit contributions
 initials+0.577
ses+0.537
 forces+0.528
 actions+0.502
advertisement+0.502
Neg logit contributions
osate-0.753
etheless-0.739
ufact-0.686
 Lauder-0.646
veyard-0.631
Token West
Token ID2688
Feature activation+6.06e-01

Pos logit contributions
osate+1.821
etheless+1.787
ufact+1.654
 Lauder+1.556
veyard+1.513
Neg logit contributions
 initials-1.523
ses-1.408
 forces-1.385
advertisement-1.345
 actions-1.326
Token the
Token ID262
Feature activation-1.99e-01

Pos logit contributions
 initials+0.531
ses+0.497
 forces+0.482
advertisement+0.467
 actions+0.459
Neg logit contributions
osate-0.678
etheless-0.664
ufact-0.619
 Lauder-0.588
veyard-0.566
Token firm
Token ID4081
Feature activation-3.43e-01

Pos logit contributions
 initials+0.534
ses+0.499
 forces+0.491
advertisement+0.473
 actions+0.469
Neg logit contributions
osate-0.596
etheless-0.583
ufact-0.537
 Lauder-0.508
veyard-0.491
Token under
Token ID739
Feature activation+4.91e-01

Pos logit contributions
 initials+0.919
ses+0.861
 forces+0.846
advertisement+0.809
 actions+0.808
Neg logit contributions
osate-1.035
etheless-1.014
ufact-0.937
 Lauder-0.880
veyard-0.856
Tokenwrote
Token ID42910
Feature activation-7.35e-01

Pos logit contributions
osate+1.217
etheless+1.184
ufact+1.072
 Lauder+0.999
veyard+0.954
Neg logit contributions
 initials-1.580
ses-1.482
 forces-1.464
advertisement-1.418
 actions-1.410
Token deals
Token ID7529
Feature activation-6.05e-01

Pos logit contributions
 initials+1.778
ses+1.651
 forces+1.621
 actions+1.540
advertisement+1.540
Neg logit contributions
osate-2.421
etheless-2.376
ufact-2.208
 Lauder-2.083
veyard-2.036
Token worth
Token ID2861
Feature activation+3.49e+00

Pos logit contributions
 initials+1.329
ses+1.225
 forces+1.199
advertisement+1.134
 actions+1.133
Neg logit contributions
osate-2.120
etheless-2.085
ufact-1.948
 Lauder-1.846
veyard-1.806
Token $
Token ID720
Feature activation+1.36e+00

Pos logit contributions
etheless+8.728
osate+8.584
 Lauder+8.082
ufact+8.071
 matter+7.618
Neg logit contributions
 initials-8.503
ses-8.250
advertisement-7.968
parser-7.870
Reviewer-7.783
Token131
Token ID22042
Feature activation+8.41e-01

Pos logit contributions
osate+3.852
etheless+3.692
ufact+3.433
 Lauder+3.298
veyard+3.139
Neg logit contributions
 initials-3.808
ses-3.488
 forces-3.451
 actions-3.300
advertisement-3.228
Token billion
Token ID2997
Feature activation+2.56e-01

Pos logit contributions
osate+1.698
etheless+1.639
ufact+1.449
 Lauder+1.317
veyard+1.253
Neg logit contributions
 initials-3.052
ses-2.935
 forces-2.863
 actions-2.763
advertisement-2.761
Token.
Token ID13
Feature activation-3.17e-02

Pos logit contributions
osate+0.849
etheless+0.839
ufact+0.782
 Lauder+0.744
veyard+0.720
Neg logit contributions
 initials-0.595
ses-0.554
 forces-0.536
advertisement-0.511
 actions-0.510
Token
Token ID198
Feature activation+1.09e+00

Pos logit contributions
 initials+0.083
ses+0.077
 forces+0.076
 actions+0.072
advertisement+0.072
Neg logit contributions
osate-0.098
etheless-0.096
ufact-0.089
 Lauder-0.084
veyard-0.081
Tokenyear
Token ID1941
Feature activation+1.09e+00

Pos logit contributions
 initials+1.708
ses+1.635
 forces+1.608
advertisement+1.552
 actions+1.544
Neg logit contributions
osate-1.533
etheless-1.469
ufact-1.357
 Lauder-1.274
veyard-1.232
Token contract
Token ID2775
Feature activation-1.90e-01

Pos logit contributions
etheless+3.065
osate+3.035
ufact+2.752
 Lauder+2.543
 Scotia+2.529
Neg logit contributions
 initials-2.994
ses-2.852
 forces-2.748
advertisement-2.722
 actions-2.633
Token that
Token ID326
Feature activation+1.45e+00

Pos logit contributions
 initials+0.441
ses+0.410
 forces+0.399
advertisement+0.381
 actions+0.378
Neg logit contributions
osate-0.638
etheless-0.632
ufact-0.589
 Lauder-0.556
veyard-0.541
Token offseason
Token ID16349
Feature activation+6.40e-01

Pos logit contributions
osate+3.807
etheless+3.785
ufact+3.512
 Lauder+3.401
 matter+3.225
Neg logit contributions
 initials-4.030
ses-3.845
advertisement-3.765
MSN-3.604
parser-3.588
Token,
Token ID11
Feature activation+3.45e-01

Pos logit contributions
etheless+2.001
osate+1.982
ufact+1.846
 Lauder+1.746
pload+1.683
Neg logit contributions
 initials-1.569
ses-1.490
 forces-1.405
advertisement-1.388
 actions-1.345
Token worth
Token ID2861
Feature activation+3.47e+00

Pos logit contributions
osate+1.053
etheless+1.048
ufact+0.974
 Lauder+0.920
veyard+0.890
Neg logit contributions
 initials-0.881
ses-0.830
 forces-0.799
advertisement-0.775
 actions-0.765
Token a
Token ID257
Feature activation+1.99e+00

Pos logit contributions
etheless+8.323
osate+7.891
 Lauder+7.577
ufact+7.360
 matter+7.197
Neg logit contributions
 initials-9.042
ses-8.853
advertisement-8.673
Reviewer-8.393
MSN-8.307
Token reported
Token ID2098
Feature activation+8.29e-01

Pos logit contributions
etheless+5.089
osate+5.008
ufact+4.496
 Lauder+4.302
 matter+4.188
Neg logit contributions
 initials-5.855
ses-5.567
 forces-5.399
advertisement-5.287
 actions-5.182
Token $
Token ID720
Feature activation+1.85e+00

Pos logit contributions
osate+2.570
etheless+2.538
ufact+2.348
 Lauder+2.226
veyard+2.135
Neg logit contributions
 initials-2.098
ses-1.966
 forces-1.923
advertisement-1.849
 actions-1.829
Token22
Token ID1828
Feature activation+1.01e+00

Pos logit contributions
osate+4.938
etheless+4.608
ufact+4.273
 Lauder+4.261
veyard+4.006
Neg logit contributions
 initials-5.317
 forces-4.769
ses-4.753
 actions-4.574
 motivations-4.438
Token million
Token ID1510
Feature activation+8.14e-01

Pos logit contributions
osate+2.623
etheless+2.580
ufact+2.287
 Lauder+2.145
veyard+2.106
Neg logit contributions
 initials-3.062
ses-2.930
 forces-2.838
 actions-2.723
advertisement-2.706
Token,
Token ID11
Feature activation-2.99e-01

Pos logit contributions
osate+1.585
etheless+1.523
ufact+1.439
 Lauder+1.358
veyard+1.296
Neg logit contributions
 initials-1.301
ses-1.195
 forces-1.180
advertisement-1.149
 actions-1.127
Token a
Token ID257
Feature activation+6.46e-01

Pos logit contributions
 initials+0.703
ses+0.650
 forces+0.635
advertisement+0.610
 actions+0.603
Neg logit contributions
osate-0.999
etheless-0.982
ufact-0.917
 Lauder-0.871
veyard-0.835
Token senior
Token ID4664
Feature activation-1.02e+00

Pos logit contributions
osate+1.996
etheless+1.961
ufact+1.828
 Lauder+1.730
 Scotia+1.646
Neg logit contributions
 initials-1.654
ses-1.531
 forces-1.492
advertisement-1.459
 actions-1.433
Token atmospheric
Token ID20938
Feature activation+1.06e+00

Pos logit contributions
 initials+2.418
ses+2.239
 forces+2.218
 actions+2.102
advertisement+2.076
Neg logit contributions
osate-3.404
etheless-3.332
ufact-3.084
 Lauder-2.911
veyard-2.889
Token,
Token ID11
Feature activation+6.08e-01

Pos logit contributions
osate+3.145
etheless+3.042
ufact+2.902
 Lauder+2.777
 matter+2.669
Neg logit contributions
 initials-2.741
ses-2.595
advertisement-2.473
 forces-2.409
official-2.357
Token ocean
Token ID9151
Feature activation+3.43e+00

Pos logit contributions
osate+1.689
etheless+1.655
ufact+1.519
 Lauder+1.429
veyard+1.348
Neg logit contributions
 initials-1.757
ses-1.658
 forces-1.592
advertisement-1.556
 actions-1.528
Tokenic
Token ID291
Feature activation+2.69e+00

Pos logit contributions
osate+9.174
etheless+8.801
ufact+8.246
 Lauder+7.663
 matter+7.581
Neg logit contributions
 initials-8.981
ses-8.264
advertisement-7.942
Alias-7.792
 forces-7.692
Token and
Token ID290
Feature activation+1.61e-01

Pos logit contributions
osate+7.183
etheless+6.964
ufact+6.680
 matter+6.242
 Lauder+6.208
Neg logit contributions
 initials-7.038
ses-6.575
advertisement-6.411
Alias-6.243
official-6.156
Token space
Token ID2272
Feature activation+1.22e+00

Pos logit contributions
osate+0.519
etheless+0.510
ufact+0.474
 Lauder+0.449
veyard+0.434
Neg logit contributions
 initials-0.396
ses-0.366
 forces-0.353
advertisement-0.343
 actions-0.337
Token sciences
Token ID19838
Feature activation+8.97e-01

Pos logit contributions
osate+3.200
etheless+3.155
ufact+2.954
 Lauder+2.707
 matter+2.626
Neg logit contributions
 initials-3.576
ses-3.333
advertisement-3.206
 forces-3.186
 actions-3.079
Token major
Token ID1688
Feature activation-2.04e-01

Pos logit contributions
osate+2.828
etheless+2.795
ufact+2.651
 Lauder+2.530
 matter+2.453
Neg logit contributions
 initials-2.100
ses-1.924
advertisement-1.882
 forces-1.837
MSN-1.782
Token used
Token ID973
Feature activation-3.37e-01

Pos logit contributions
osate+0.258
etheless+0.254
ufact+0.237
 Lauder+0.223
veyard+0.216
Neg logit contributions
 initials-0.203
ses-0.190
 forces-0.185
advertisement-0.179
 actions-0.175
Token for
Token ID329
Feature activation+1.38e-01

Pos logit contributions
 initials+0.751
ses+0.693
 forces+0.678
advertisement+0.643
 actions+0.642
Neg logit contributions
osate-1.170
etheless-1.150
ufact-1.074
 Lauder-1.016
veyard-0.994
Token religious
Token ID4158
Feature activation+7.85e-01

Pos logit contributions
osate+0.436
etheless+0.428
ufact+0.399
 Lauder+0.376
veyard+0.366
Neg logit contributions
 initials-0.346
ses-0.322
 forces-0.315
advertisement-0.304
 actions-0.299
Token rituals
Token ID25797
Feature activation+2.83e-01

Pos logit contributions
osate+2.552
etheless+2.437
ufact+2.312
 Lauder+2.233
 matter+2.146
Neg logit contributions
 initials-1.816
ses-1.714
advertisement-1.617
 forces-1.602
parser-1.550
Token,
Token ID11
Feature activation+9.43e-01

Pos logit contributions
osate+0.880
etheless+0.867
ufact+0.807
 Lauder+0.767
veyard+0.741
Neg logit contributions
 initials-0.714
ses-0.673
 forces-0.642
advertisement-0.626
 actions-0.616
Token medicinal
Token ID30381
Feature activation+3.41e+00

Pos logit contributions
osate+2.808
etheless+2.723
ufact+2.572
 Lauder+2.469
veyard+2.356
Neg logit contributions
 initials-2.402
ses-2.268
 forces-2.174
advertisement-2.101
parser-2.074
Token purposes
Token ID4959
Feature activation+2.71e+00

Pos logit contributions
osate+9.735
 matter+9.192
etheless+8.933
ufact+8.848
 Lauder+8.842
Neg logit contributions
 initials-7.232
advertisement-7.004
parser-6.780
ses-6.734
MSN-6.711
Token,
Token ID11
Feature activation+8.27e-01

Pos logit contributions
osate+6.552
etheless+6.539
ufact+6.009
 matter+5.657
 Lauder+5.593
Neg logit contributions
 initials-7.446
ses-7.116
advertisement-6.759
Position-6.646
MSN-6.619
Token or
Token ID393
Feature activation+4.63e-01

Pos logit contributions
osate+2.512
etheless+2.440
ufact+2.312
 Lauder+2.180
veyard+2.109
Neg logit contributions
 initials-2.087
ses-1.963
 forces-1.892
advertisement-1.831
 actions-1.798
Token even
Token ID772
Feature activation+4.82e-01

Pos logit contributions
osate+1.465
etheless+1.429
ufact+1.340
 Lauder+1.262
veyard+1.227
Neg logit contributions
 initials-1.148
ses-1.073
 forces-1.040
advertisement-1.008
 actions-0.989
Token electro
Token ID15206
Feature activation+6.32e-01

Pos logit contributions
osate+1.551
etheless+1.516
ufact+1.426
 Lauder+1.343
veyard+1.303
Neg logit contributions
 initials-1.154
ses-1.077
 forces-1.041
advertisement-1.008
 actions-0.987
Token,
Token ID11
Feature activation+9.38e-01

Pos logit contributions
osate+2.416
etheless+2.374
ufact+2.211
 Lauder+2.108
veyard+2.034
Neg logit contributions
 initials-1.936
ses-1.792
 forces-1.712
advertisement-1.702
 actions-1.649
Token especially
Token ID2592
Feature activation-1.08e+00

Pos logit contributions
osate+2.780
etheless+2.724
ufact+2.514
 Lauder+2.401
veyard+2.315
Neg logit contributions
 initials-2.514
ses-2.343
 forces-2.271
advertisement-2.182
 actions-2.177
Token those
Token ID883
Feature activation+4.34e-01

Pos logit contributions
 initials+2.316
ses+2.129
 forces+2.122
 actions+1.997
advertisement+1.964
Neg logit contributions
osate-3.881
etheless-3.793
ufact-3.539
 Lauder-3.327
veyard-3.277
Token living
Token ID2877
Feature activation+1.87e+00

Pos logit contributions
osate+1.356
etheless+1.347
ufact+1.246
 Lauder+1.188
veyard+1.151
Neg logit contributions
 initials-1.079
ses-1.005
 forces-0.973
advertisement-0.943
 actions-0.930
Token in
Token ID287
Feature activation+1.01e+00

Pos logit contributions
osate+6.027
etheless+6.027
ufact+5.736
 Lauder+5.503
 matter+5.305
Neg logit contributions
 initials-3.963
ses-3.713
 forces-3.460
advertisement-3.443
MSN-3.406
Token Wa
Token ID15329
Feature activation+3.41e+00

Pos logit contributions
osate+3.002
etheless+2.913
ufact+2.744
 Lauder+2.619
veyard+2.524
Neg logit contributions
 initials-2.626
ses-2.450
 forces-2.378
advertisement-2.328
 actions-2.271
Tokenik
Token ID1134
Feature activation+1.03e+00

Pos logit contributions
osate+5.347
ige+4.840
etheless+4.637
isance+4.610
ufact+4.560
Neg logit contributions
 initials-12.539
ses-11.611
 forces-11.320
advertisement-11.083
 actions-11.049
Tokeniki
Token ID5580
Feature activation+2.99e-01

Pos logit contributions
osate+1.957
etheless+1.859
ufact+1.661
 Lauder+1.477
veyard+1.457
Neg logit contributions
 initials-3.894
ses-3.644
 forces-3.614
advertisement-3.519
 actions-3.515
Token where
Token ID810
Feature activation+4.11e-01

Pos logit contributions
osate+0.976
etheless+0.957
ufact+0.893
 Lauder+0.843
veyard+0.822
Neg logit contributions
 initials-0.722
ses-0.672
 forces-0.654
advertisement-0.627
 actions-0.619
Token the
Token ID262
Feature activation-1.21e-01

Pos logit contributions
osate+1.313
etheless+1.289
ufact+1.197
 Lauder+1.139
veyard+1.107
Neg logit contributions
 initials-1.012
ses-0.945
 forces-0.912
advertisement-0.884
 actions-0.866
Token tourism
Token ID19277
Feature activation+7.28e-01

Pos logit contributions
 initials+0.340
ses+0.318
 forces+0.313
advertisement+0.301
 actions+0.300
Neg logit contributions
osate-0.348
etheless-0.340
ufact-0.313
 Lauder-0.294
veyard-0.285
Token million
Token ID1510
Feature activation+8.82e-01

Pos logit contributions
osate+2.537
etheless+2.474
ufact+2.289
 Lauder+2.138
veyard+2.080
Neg logit contributions
 initials-2.349
ses-2.198
 forces-2.145
advertisement-2.081
 actions-2.058
Token years
Token ID812
Feature activation+1.56e+00

Pos logit contributions
osate+2.151
etheless+2.076
ufact+1.888
 Lauder+1.739
veyard+1.676
Neg logit contributions
 initials-2.850
ses-2.699
 forces-2.637
advertisement-2.605
 actions-2.547
Token.
Token ID13
Feature activation+6.71e-01

Pos logit contributions
osate+4.819
etheless+4.719
ufact+4.371
 Lauder+4.115
veyard+4.007
Neg logit contributions
 initials-3.850
ses-3.602
advertisement-3.450
 forces-3.419
 actions-3.315
Token<|endoftext|>
Token ID50256
Feature activation-1.47e+00

Pos logit contributions
osate+2.028
etheless+1.933
ufact+1.821
 Lauder+1.746
 Scotia+1.669
Neg logit contributions
 initials-1.790
ses-1.659
 forces-1.627
advertisement-1.559
 actions-1.557
TokenImage
Token ID5159
Feature activation-8.09e-01

Pos logit contributions
 initials+3.283
ses+3.121
 forces+3.017
advertisement+2.967
 actions+2.859
Neg logit contributions
osate-5.031
etheless-4.993
ufact-4.659
 Lauder-4.371
veyard-4.234
Token copyright
Token ID6634
Feature activation-4.46e+00

Pos logit contributions
 initials+1.643
ses+1.489
 forces+1.482
 actions+1.389
advertisement+1.373
Neg logit contributions
osate-2.976
etheless-2.947
ufact-2.734
 Lauder-2.602
veyard-2.553
Token AFP
Token ID8916
Feature activation-7.57e-01

Pos logit contributions
 initials+9.659
 forces+9.259
ses+8.896
 actions+8.677
 Comed+8.357
Neg logit contributions
etheless-13.652
osate-13.247
ufact-12.263
veyard-11.767
hemer-11.002
Token
Token ID198
Feature activation+1.46e-01

Pos logit contributions
 initials+1.863
ses+1.731
 forces+1.704
 actions+1.621
advertisement+1.617
Neg logit contributions
osate-2.455
etheless-2.412
ufact-2.236
 Lauder-2.107
veyard-2.059
Token
Token ID198
Feature activation-1.23e+00

Pos logit contributions
osate+0.425
etheless+0.406
ufact+0.377
 Lauder+0.363
veyard+0.348
Neg logit contributions
 initials-0.406
ses-0.377
 forces-0.369
 actions-0.354
advertisement-0.353
TokenThe
Token ID464
Feature activation-3.08e-01

Pos logit contributions
 initials+2.765
ses+2.586
 forces+2.534
advertisement+2.432
 actions+2.398
Neg logit contributions
osate-4.194
etheless-4.165
ufact-3.884
 Lauder-3.630
veyard-3.556
Token Bank
Token ID5018
Feature activation+4.43e-01

Pos logit contributions
 initials+0.788
ses+0.736
 forces+0.721
advertisement+0.696
 actions+0.687
Neg logit contributions
osate-0.966
etheless-0.943
ufact-0.870
 Lauder-0.826
veyard-0.802
Token.
Token ID13
Feature activation+1.54e-01

Pos logit contributions
 initials+1.763
ses+1.630
 forces+1.605
 actions+1.520
advertisement+1.514
Neg logit contributions
osate-2.654
etheless-2.600
ufact-2.423
 Lauder-2.293
veyard-2.247
Token
Token ID447
Feature activation-7.31e-01

Pos logit contributions
osate+0.435
etheless+0.423
ufact+0.390
 Lauder+0.365
veyard+0.353
Neg logit contributions
 initials-0.447
ses-0.419
 forces-0.412
 actions-0.394
advertisement-0.394
Token
Token ID251
Feature activation-1.39e+00

Pos logit contributions
 initials+1.935
ses+1.810
 forces+1.785
advertisement+1.705
 actions+1.704
Neg logit contributions
osate-2.236
etheless-2.198
ufact-2.050
 Lauder-1.906
veyard-1.849
Token<|endoftext|>
Token ID50256
Feature activation-1.58e+00

Pos logit contributions
 initials+3.357
ses+3.142
 forces+3.099
 actions+2.967
advertisement+2.956
Neg logit contributions
osate-4.526
etheless-4.456
ufact-4.130
 Lauder-3.847
veyard-3.796
TokenImage
Token ID5159
Feature activation-8.11e-01

Pos logit contributions
 initials+3.528
ses+3.365
 forces+3.249
advertisement+3.212
 actions+3.080
Neg logit contributions
osate-5.376
etheless-5.336
ufact-4.981
 Lauder-4.665
veyard-4.520
Token copyright
Token ID6634
Feature activation-4.41e+00

Pos logit contributions
 initials+1.644
ses+1.489
 forces+1.483
 actions+1.390
advertisement+1.375
Neg logit contributions
osate-2.987
etheless-2.955
ufact-2.743
 Lauder-2.610
veyard-2.563
Token publicity
Token ID19403
Feature activation-4.53e-01

Pos logit contributions
 initials+9.564
 forces+9.163
ses+8.793
 actions+8.600
 Comed+8.259
Neg logit contributions
etheless-13.469
osate-13.146
ufact-12.153
veyard-11.661
hemer-10.865
Token Image
Token ID7412
Feature activation-1.24e+00

Pos logit contributions
 initials+1.111
ses+1.030
 forces+1.020
 actions+0.969
advertisement+0.964
Neg logit contributions
osate-1.478
etheless-1.451
ufact-1.346
 Lauder-1.264
veyard-1.243
Token caption
Token ID8305
Feature activation-3.37e+00

Pos logit contributions
 initials+2.270
 forces+2.057
ses+2.017
 actions+1.907
advertisement+1.868
Neg logit contributions
etheless-4.899
osate-4.866
ufact-4.506
 Lauder-4.240
veyard-4.189
Token The
Token ID383
Feature activation-5.15e-01

Pos logit contributions
 initials+7.779
ses+7.396
 forces+7.386
 Comed+7.012
 actions+6.977
Neg logit contributions
etheless-10.607
osate-10.205
ufact-9.421
veyard-8.855
 conclud-8.551
Token Greeks
Token ID25059
Feature activation-8.72e-02

Pos logit contributions
 initials+1.387
ses+1.297
 forces+1.278
 actions+1.221
advertisement+1.216
Neg logit contributions
osate-1.559
etheless-1.529
ufact-1.411
 Lauder-1.319
veyard-1.290
Tokent
Token ID83
Feature activation-8.85e-01

Pos logit contributions
 initials+2.378
ses+2.271
 forces+2.204
advertisement+2.121
 actions+2.109
Neg logit contributions
osate-3.266
etheless-3.191
ufact-2.985
 Lauder-2.804
veyard-2.736
Tokenweet
Token ID7277
Feature activation-7.66e-01

Pos logit contributions
 initials+2.124
ses+2.029
 forces+1.985
 actions+1.894
advertisement+1.883
Neg logit contributions
osate-2.892
etheless-2.852
ufact-2.635
 Lauder-2.494
veyard-2.414
Token/
Token ID14
Feature activation-5.95e-01

Pos logit contributions
 initials+2.246
ses+2.127
 forces+2.096
 actions+2.013
advertisement+2.010
Neg logit contributions
osate-2.110
etheless-2.064
ufact-1.890
 Lauder-1.763
veyard-1.710
Token<|endoftext|>
Token ID50256
Feature activation-1.64e+00

Pos logit contributions
 initials+1.547
ses+1.447
 forces+1.424
 actions+1.361
advertisement+1.360
Neg logit contributions
osate-1.843
etheless-1.808
ufact-1.674
 Lauder-1.572
veyard-1.534
TokenStory
Token ID11605
Feature activation-4.07e-01

Pos logit contributions
 initials+3.640
ses+3.481
 forces+3.356
advertisement+3.331
 actions+3.184
Neg logit contributions
osate-5.575
etheless-5.538
ufact-5.171
 Lauder-4.839
veyard-4.685
Token highlights
Token ID11330
Feature activation-4.40e+00

Pos logit contributions
 initials+0.893
ses+0.822
 forces+0.805
advertisement+0.760
 actions+0.760
Neg logit contributions
osate-1.430
etheless-1.405
ufact-1.313
 Lauder-1.243
veyard-1.217
Token Secretary
Token ID4986
Feature activation-4.40e-01

Pos logit contributions
 initials+9.798
 forces+9.499
ses+9.454
 Comed+9.326
 actions+9.066
Neg logit contributions
etheless-13.424
osate-12.485
ufact-11.839
veyard-10.864
 conclud-10.794
Token of
Token ID286
Feature activation+1.42e+00

Pos logit contributions
 initials+1.118
ses+1.042
 forces+1.024
advertisement+0.975
 actions+0.975
Neg logit contributions
osate-1.394
etheless-1.366
ufact-1.266
 Lauder-1.192
veyard-1.163
Token Defense
Token ID5947
Feature activation+2.17e-01

Pos logit contributions
osate+3.530
etheless+3.398
ufact+3.085
 Lauder+3.003
 Scotia+2.804
Neg logit contributions
 initials-4.387
ses-4.163
advertisement-4.035
 forces-3.955
parser-3.884
Token Chuck
Token ID14663
Feature activation-9.61e-01

Pos logit contributions
osate+0.688
etheless+0.675
ufact+0.623
 Lauder+0.594
veyard+0.576
Neg logit contributions
 initials-0.547
ses-0.508
 forces-0.496
advertisement-0.480
 actions-0.473
Token Hag
Token ID21375
Feature activation-5.46e-01

Pos logit contributions
 initials+2.521
ses+2.395
 forces+2.385
 actions+2.258
advertisement+2.192
Neg logit contributions
osate-2.963
etheless-2.882
ufact-2.711
 Lauder-2.477
pload-2.424
Token Serve
Token ID35557
Feature activation-7.00e-01

Pos logit contributions
 initials+1.337
ses+1.285
 forces+1.261
 actions+1.204
advertisement+1.203
Neg logit contributions
etheless-1.441
osate-1.423
ufact-1.304
 Lauder-1.204
veyard-1.177
Token hot
Token ID3024
Feature activation-1.23e+00

Pos logit contributions
 initials+1.865
ses+1.748
 forces+1.726
 actions+1.648
advertisement+1.637
Neg logit contributions
osate-2.148
etheless-2.110
ufact-1.945
 Lauder-1.821
veyard-1.779
Token!
Token ID0
Feature activation-7.93e-01

Pos logit contributions
 initials+2.121
ses+1.950
 forces+1.913
 actions+1.775
advertisement+1.735
Neg logit contributions
osate-4.877
etheless-4.810
ufact-4.561
 Lauder-4.286
veyard-4.247
Token<|endoftext|>
Token ID50256
Feature activation-1.48e+00

Pos logit contributions
 initials+2.058
ses+1.919
 forces+1.889
 actions+1.801
advertisement+1.800
Neg logit contributions
osate-2.473
etheless-2.423
ufact-2.244
 Lauder-2.109
veyard-2.058
TokenImage
Token ID5159
Feature activation-8.99e-01

Pos logit contributions
 initials+3.299
ses+3.138
 forces+3.031
advertisement+2.983
 actions+2.873
Neg logit contributions
osate-5.050
etheless-5.010
ufact-4.676
 Lauder-4.382
veyard-4.249
Token copyright
Token ID6634
Feature activation-4.38e+00

Pos logit contributions
 initials+1.815
ses+1.640
 forces+1.636
 actions+1.532
advertisement+1.512
Neg logit contributions
osate-3.323
etheless-3.290
ufact-3.051
 Lauder-2.903
veyard-2.851
Token ESA
Token ID42212
Feature activation+2.29e-01

Pos logit contributions
 initials+9.509
 forces+9.069
ses+8.747
 actions+8.514
 Comed+8.190
Neg logit contributions
etheless-13.485
osate-13.148
ufact-12.145
veyard-11.640
hemer-10.862
Token Image
Token ID7412
Feature activation-1.02e+00

Pos logit contributions
osate+0.752
etheless+0.731
ufact+0.685
 Lauder+0.653
veyard+0.629
Neg logit contributions
 initials-0.552
ses-0.513
 forces-0.496
advertisement-0.478
 actions-0.472
Token caption
Token ID8305
Feature activation-3.29e+00

Pos logit contributions
 initials+1.922
 forces+1.743
ses+1.707
 actions+1.614
advertisement+1.565
Neg logit contributions
osate-3.968
etheless-3.961
ufact-3.672
 Lauder-3.465
veyard-3.413
Token Go
Token ID1514
Feature activation+4.34e-01

Pos logit contributions
 initials+7.273
ses+6.979
 forces+6.969
 actions+6.579
 Comed+6.513
Neg logit contributions
etheless-10.663
osate-10.178
ufact-9.507
veyard-8.847
 conclud-8.613
Tokence
Token ID344
Feature activation+5.59e-01

Pos logit contributions
osate+1.403
etheless+1.351
ufact+1.258
 Lauder+1.201
veyard+1.174
Neg logit contributions
 initials-1.075
ses-0.997
 forces-0.969
advertisement-0.940
 actions-0.923
Token.
Token ID13
Feature activation+5.96e-02

Pos logit contributions
 initials+0.148
ses+0.137
 forces+0.133
 actions+0.126
advertisement+0.126
Neg logit contributions
osate-0.221
etheless-0.217
ufact-0.203
 Lauder-0.193
veyard-0.188
Token
Token ID447
Feature activation-1.16e+00

Pos logit contributions
osate+0.183
etheless+0.179
ufact+0.167
 Lauder+0.158
veyard+0.153
Neg logit contributions
 initials-0.156
ses-0.145
 forces-0.142
 actions-0.135
advertisement-0.135
Token
Token ID251
Feature activation-1.56e+00

Pos logit contributions
 initials+3.077
ses+2.879
 forces+2.830
advertisement+2.711
 actions+2.703
Neg logit contributions
osate-3.505
etheless-3.432
ufact-3.193
 Lauder-2.982
veyard-2.881
Token<|endoftext|>
Token ID50256
Feature activation-1.54e+00

Pos logit contributions
 initials+3.728
ses+3.524
 forces+3.487
 actions+3.312
advertisement+3.263
Neg logit contributions
osate-5.079
etheless-5.022
ufact-4.644
 Lauder-4.332
veyard-4.252
TokenImage
Token ID5159
Feature activation-8.65e-01

Pos logit contributions
 initials+3.430
ses+3.269
 forces+3.157
advertisement+3.118
 actions+2.992
Neg logit contributions
osate-5.236
etheless-5.197
ufact-4.848
 Lauder-4.544
veyard-4.403
Token copyright
Token ID6634
Feature activation-4.30e+00

Pos logit contributions
 initials+1.753
ses+1.586
 forces+1.582
 actions+1.482
advertisement+1.466
Neg logit contributions
osate-3.189
etheless-3.156
ufact-2.927
 Lauder-2.786
veyard-2.735
Token Deadline
Token ID35954
Feature activation+4.25e-01

Pos logit contributions
 initials+9.419
 forces+8.996
ses+8.665
 actions+8.437
 Comed+8.106
Neg logit contributions
etheless-13.186
osate-12.874
ufact-11.883
veyard-11.385
 Lauder-10.608
Token
Token ID198
Feature activation+2.36e-01

Pos logit contributions
osate+1.343
etheless+1.309
ufact+1.223
 Lauder+1.162
veyard+1.121
Neg logit contributions
 initials-1.083
ses-1.006
 forces-0.982
advertisement-0.938
 actions-0.937
Token
Token ID198
Feature activation-1.38e+00

Pos logit contributions
osate+0.682
etheless+0.648
ufact+0.602
 Lauder+0.584
veyard+0.557
Neg logit contributions
 initials-0.659
ses-0.611
 forces-0.599
 actions-0.573
advertisement-0.572
TokenPolice
Token ID9039
Feature activation-1.77e-01

Pos logit contributions
 initials+3.016
ses+2.819
 forces+2.753
advertisement+2.670
 actions+2.604
Neg logit contributions
osate-4.796
etheless-4.768
ufact-4.441
 Lauder-4.153
veyard-4.066
Token have
Token ID423
Feature activation-3.42e-02

Pos logit contributions
 initials+0.484
ses+0.454
 forces+0.444
advertisement+0.428
 actions+0.425
Neg logit contributions
osate-0.529
etheless-0.517
ufact-0.476
 Lauder-0.448
veyard-0.435
Tokenported
Token ID9213
Feature activation-1.22e-01

Pos logit contributions
 initials+1.916
ses+1.878
 forces+1.824
advertisement+1.788
official+1.761
Neg logit contributions
etheless-3.152
osate-3.097
ufact-2.943
 Lauder-2.810
veyard-2.693
Token License
Token ID13789
Feature activation-3.66e-01

Pos logit contributions
 initials+0.234
ses+0.214
 forces+0.207
advertisement+0.194
 actions+0.194
Neg logit contributions
osate-0.463
etheless-0.454
ufact-0.427
 Lauder-0.408
veyard-0.398
Token.
Token ID13
Feature activation+7.60e-01

Pos logit contributions
 initials+0.893
ses+0.830
 forces+0.811
advertisement+0.775
 actions+0.771
Neg logit contributions
osate-1.195
etheless-1.175
ufact-1.086
 Lauder-1.029
veyard-0.999
Token<|endoftext|>
Token ID50256
Feature activation-1.54e+00

Pos logit contributions
osate+2.324
etheless+2.240
ufact+2.091
 Lauder+2.005
veyard+1.915
Neg logit contributions
 initials-1.978
ses-1.837
 forces-1.790
advertisement-1.713
 actions-1.699
TokenImage
Token ID5159
Feature activation-7.07e-01

Pos logit contributions
 initials+3.429
ses+3.269
 forces+3.156
advertisement+3.114
 actions+2.991
Neg logit contributions
osate-5.254
etheless-5.217
ufact-4.867
 Lauder-4.560
veyard-4.417
Token copyright
Token ID6634
Feature activation-4.24e+00

Pos logit contributions
 initials+1.447
ses+1.315
 forces+1.305
 actions+1.224
advertisement+1.213
Neg logit contributions
osate-2.593
etheless-2.563
ufact-2.383
 Lauder-2.266
veyard-2.222
Token AP
Token ID3486
Feature activation-1.99e+00

Pos logit contributions
 initials+9.297
 forces+8.853
ses+8.544
 actions+8.316
official+7.955
Neg logit contributions
etheless-13.029
osate-12.725
ufact-11.762
veyard-11.241
 Lauder-10.520
Token Image
Token ID7412
Feature activation-1.23e+00

Pos logit contributions
 initials+4.431
 forces+4.074
ses+4.040
 actions+3.867
advertisement+3.774
Neg logit contributions
etheless-6.670
osate-6.666
ufact-6.141
 Lauder-5.798
veyard-5.667
Token caption
Token ID8305
Feature activation-3.49e+00

Pos logit contributions
 initials+2.211
 forces+2.012
ses+1.950
 actions+1.858
advertisement+1.808
Neg logit contributions
etheless-4.951
osate-4.885
ufact-4.525
 Lauder-4.237
veyard-4.205
Token Peace
Token ID12689
Feature activation+8.88e-01

Pos logit contributions
 initials+7.959
 forces+7.725
ses+7.621
 Comed+7.315
 actions+7.236
Neg logit contributions
etheless-10.932
osate-10.302
ufact-9.653
veyard-9.072
 conclud-8.954
Token talks
Token ID6130
Feature activation-2.12e-01

Pos logit contributions
osate+2.772
etheless+2.682
ufact+2.511
 Lauder+2.382
veyard+2.280
Neg logit contributions
 initials-2.266
ses-2.133
advertisement-2.024
 forces-2.016
 actions-1.932
Token vote
Token ID3015
Feature activation+7.47e-01

Pos logit contributions
osate+2.074
etheless+2.041
ufact+1.810
 Lauder+1.691
veyard+1.613
Neg logit contributions
 initials-3.055
ses-2.910
 forces-2.846
advertisement-2.780
 actions-2.740
Token accordingly
Token ID16062
Feature activation+1.99e-01

Pos logit contributions
osate+2.510
etheless+2.484
ufact+2.289
 Lauder+2.200
veyard+2.129
Neg logit contributions
 initials-1.707
ses-1.585
 forces-1.521
advertisement-1.479
 actions-1.435
Token.
Token ID13
Feature activation+3.77e-01

Pos logit contributions
osate+0.656
etheless+0.646
ufact+0.597
 Lauder+0.569
veyard+0.553
Neg logit contributions
 initials-0.476
ses-0.442
 forces-0.430
advertisement-0.413
 actions-0.406
Token<|endoftext|>
Token ID50256
Feature activation-1.59e+00

Pos logit contributions
osate+1.143
etheless+1.107
ufact+1.029
 Lauder+0.983
veyard+0.942
Neg logit contributions
 initials-1.007
ses-0.932
 forces-0.915
advertisement-0.876
 actions-0.872
TokenStory
Token ID11605
Feature activation-2.45e-01

Pos logit contributions
 initials+3.548
ses+3.389
 forces+3.271
advertisement+3.238
 actions+3.099
Neg logit contributions
osate-5.425
etheless-5.390
ufact-5.029
 Lauder-4.712
veyard-4.560
Token highlights
Token ID11330
Feature activation-4.18e+00

Pos logit contributions
 initials+0.541
ses+0.499
 forces+0.487
advertisement+0.461
 actions+0.460
Neg logit contributions
osate-0.855
etheless-0.840
ufact-0.785
 Lauder-0.743
veyard-0.728
Token Police
Token ID4287
Feature activation-6.06e-01

Pos logit contributions
 initials+9.418
 forces+9.105
ses+9.057
 Comed+8.863
 actions+8.671
Neg logit contributions
etheless-12.839
osate-11.986
ufact-11.329
veyard-10.409
 conclud-10.325
Token chief
Token ID4039
Feature activation-4.42e-01

Pos logit contributions
 initials+1.933
 forces+1.820
ses+1.817
 actions+1.748
advertisement+1.718
Neg logit contributions
osate-1.546
etheless-1.515
ufact-1.376
 Lauder-1.253
veyard-1.232
Token vows
Token ID36605
Feature activation+6.79e-01

Pos logit contributions
 initials+1.161
ses+1.085
 forces+1.066
advertisement+1.018
 actions+1.018
Neg logit contributions
osate-1.364
etheless-1.336
ufact-1.236
 Lauder-1.161
veyard-1.132
Token wanted
Token ID2227
Feature activation+4.08e-01

Pos logit contributions
osate+2.196
etheless+2.159
ufact+2.006
 Lauder+1.948
 Scotia+1.855
Neg logit contributions
 initials-1.577
ses-1.490
 forces-1.409
advertisement-1.395
official-1.345
Token suspect
Token ID4099
Feature activation-9.73e-01

Pos logit contributions
osate+1.330
etheless+1.298
ufact+1.203
 Lauder+1.156
veyard+1.117
Neg logit contributions
 initials-0.981
ses-0.918
 forces-0.890
advertisement-0.857
 actions-0.843
Token didn
Token ID1422
Feature activation+6.95e-01

Pos logit contributions
osate+2.409
etheless+2.358
ufact+2.182
 Lauder+2.057
veyard+1.971
Neg logit contributions
 initials-2.490
ses-2.349
 forces-2.247
advertisement-2.235
 actions-2.188
Token
Token ID447
Feature activation-6.51e-01

Pos logit contributions
osate+1.993
etheless+1.918
ufact+1.786
 Lauder+1.709
 Scotia+1.647
Neg logit contributions
 initials-1.921
ses-1.757
 forces-1.718
 actions-1.653
advertisement-1.632
Token
Token ID247
Feature activation+7.56e-01

Pos logit contributions
 initials+1.890
ses+1.781
 forces+1.755
 actions+1.683
advertisement+1.682
Neg logit contributions
osate-1.826
etheless-1.782
ufact-1.627
 Lauder-1.523
veyard-1.477
Tokent
Token ID83
Feature activation-1.92e-02

Pos logit contributions
osate+2.706
etheless+2.666
ufact+2.527
 Lauder+2.376
veyard+2.326
Neg logit contributions
 initials-1.584
ses-1.425
 forces-1.397
 actions-1.319
advertisement-1.317
Token start
Token ID923
Feature activation-2.41e+00

Pos logit contributions
 initials+0.049
ses+0.046
 forces+0.045
 actions+0.043
advertisement+0.043
Neg logit contributions
osate-0.061
etheless-0.059
ufact-0.055
 Lauder-0.052
veyard-0.050
Token with
Token ID351
Feature activation-3.94e+00

Pos logit contributions
 initials+4.820
 forces+4.521
ses+4.412
 actions+4.314
advertisement+4.133
Neg logit contributions
osate-8.611
etheless-8.382
ufact-7.814
 Lauder-7.371
veyard-7.361
Token Netanyahu
Token ID17104
Feature activation-1.18e+00

Pos logit contributions
 initials+9.626
 forces+9.174
 actions+8.868
ses+8.612
official+7.902
Neg logit contributions
osate-11.763
etheless-11.238
ufact-10.534
veyard-9.733
pload-9.700
Token;
Token ID26
Feature activation-5.80e-01

Pos logit contributions
 initials+3.030
ses+2.823
 forces+2.808
 actions+2.671
advertisement+2.634
Neg logit contributions
osate-3.689
etheless-3.622
ufact-3.332
 Lauder-3.115
veyard-3.082
Token physical
Token ID3518
Feature activation-3.12e-01

Pos logit contributions
 initials+1.353
ses+1.246
 forces+1.238
 actions+1.175
advertisement+1.157
Neg logit contributions
osate-1.962
etheless-1.920
ufact-1.794
 Lauder-1.681
veyard-1.662
Token movement
Token ID3356
Feature activation-2.33e-01

Pos logit contributions
 initials+0.712
ses+0.659
 forces+0.644
advertisement+0.613
 actions+0.610
Neg logit contributions
osate-1.066
etheless-1.046
ufact-0.976
 Lauder-0.924
veyard-0.903
Token restrictions
Token ID8733
Feature activation-1.08e+00

Pos logit contributions
 initials+0.611
ses+0.571
 forces+0.560
advertisement+0.535
 actions+0.534
Neg logit contributions
osate-0.721
etheless-0.707
ufact-0.654
 Lauder-0.615
veyard-0.599
Token media
Token ID2056
Feature activation+2.20e-01

Pos logit contributions
 initials+2.413
 forces+2.225
ses+2.221
 actions+2.104
advertisement+2.072
Neg logit contributions
osate-3.131
etheless-3.063
ufact-2.841
 Lauder-2.680
veyard-2.635
Token reported
Token ID2098
Feature activation+8.06e-01

Pos logit contributions
osate+0.633
etheless+0.622
ufact+0.574
 Lauder+0.540
veyard+0.517
Neg logit contributions
 initials-0.614
ses-0.579
 forces-0.564
advertisement-0.547
 actions-0.542
Token.
Token ID13
Feature activation+4.53e-01

Pos logit contributions
etheless+2.528
osate+2.513
ufact+2.332
 Lauder+2.210
veyard+2.137
Neg logit contributions
 initials-1.984
ses-1.891
 forces-1.782
advertisement-1.721
 actions-1.709
Token<|endoftext|>
Token ID50256
Feature activation-1.56e+00

Pos logit contributions
osate+1.352
etheless+1.328
ufact+1.226
 Lauder+1.178
veyard+1.120
Neg logit contributions
 initials-1.209
ses-1.128
 forces-1.099
advertisement-1.058
 actions-1.054
TokenImage
Token ID5159
Feature activation-8.31e-01

Pos logit contributions
 initials+3.481
ses+3.323
 forces+3.207
advertisement+3.169
 actions+3.040
Neg logit contributions
osate-5.317
etheless-5.281
ufact-4.928
 Lauder-4.614
veyard-4.469
Token caption
Token ID8305
Feature activation-3.90e+00

Pos logit contributions
 initials+1.692
ses+1.533
 forces+1.527
 actions+1.431
advertisement+1.415
Neg logit contributions
osate-3.056
etheless-3.023
ufact-2.806
 Lauder-2.668
veyard-2.620
Token Some
Token ID2773
Feature activation+3.59e-01

Pos logit contributions
 initials+8.848
 forces+8.542
ses+8.459
 Comed+8.079
 actions+8.022
Neg logit contributions
etheless-11.946
osate-11.310
ufact-10.545
veyard-9.878
 conclud-9.647
Token talked
Token ID6619
Feature activation-2.45e-01

Pos logit contributions
osate+1.136
etheless+1.110
ufact+1.024
 Lauder+0.979
veyard+0.939
Neg logit contributions
 initials-0.896
ses-0.840
 forces-0.807
advertisement-0.799
 actions-0.771
Token of
Token ID286
Feature activation-2.12e-01

Pos logit contributions
 initials+0.610
ses+0.568
 forces+0.557
advertisement+0.533
 actions+0.529
Neg logit contributions
osate-0.789
etheless-0.774
ufact-0.718
 Lauder-0.680
veyard-0.660
Token "
Token ID366
Feature activation+3.39e-02

Pos logit contributions
 initials+0.520
ses+0.487
 forces+0.473
advertisement+0.457
 actions+0.450
Neg logit contributions
osate-0.683
etheless-0.671
ufact-0.624
 Lauder-0.592
veyard-0.573
Tokenb
Token ID65
Feature activation-6.74e-01

Pos logit contributions
osate+0.107
etheless+0.105
ufact+0.097
 Lauder+0.092
veyard+0.089
Neg logit contributions
 initials-0.087
ses-0.080
 forces-0.079
 actions-0.075
advertisement-0.075
Token Russia
Token ID3284
Feature activation+3.49e-01

Pos logit contributions
 initials+0.861
ses+0.800
 forces+0.787
 actions+0.748
advertisement+0.746
Neg logit contributions
osate-1.138
etheless-1.116
ufact-1.036
 Lauder-0.976
veyard-0.955
Token.
Token ID13
Feature activation+8.82e-01

Pos logit contributions
osate+1.120
etheless+1.106
ufact+1.015
 Lauder+0.969
veyard+0.931
Neg logit contributions
 initials-0.862
ses-0.805
 forces-0.772
advertisement-0.765
 actions-0.739
Token
Token ID198
Feature activation+1.19e+00

Pos logit contributions
osate+2.519
etheless+2.421
ufact+2.219
 Lauder+2.152
 Scotia+2.028
Neg logit contributions
 initials-2.499
ses-2.307
 forces-2.243
advertisement-2.227
 actions-2.158
Token
Token ID198
Feature activation-5.52e-01

Pos logit contributions
osate+2.965
etheless+2.666
 Lauder+2.541
ufact+2.485
 Scotia+2.438
Neg logit contributions
 initials-3.620
ses-3.280
 forces-3.212
advertisement-3.121
 actions-3.110
TokenImage
Token ID5159
Feature activation-9.52e-01

Pos logit contributions
 initials+1.260
ses+1.165
 forces+1.143
 actions+1.082
advertisement+1.081
Neg logit contributions
osate-1.893
etheless-1.860
ufact-1.734
 Lauder-1.639
veyard-1.604
Token copyright
Token ID6634
Feature activation-3.88e+00

Pos logit contributions
 initials+1.910
ses+1.728
 forces+1.716
 actions+1.605
advertisement+1.572
Neg logit contributions
osate-3.533
etheless-3.485
ufact-3.250
 Lauder-3.088
veyard-3.039
Token AFP
Token ID8916
Feature activation-5.41e-01

Pos logit contributions
 initials+7.959
 forces+7.553
ses+7.263
 actions+7.039
official+6.609
Neg logit contributions
etheless-12.838
osate-12.571
ufact-11.757
veyard-11.279
 Lauder-10.751
Token Image
Token ID7412
Feature activation-1.38e+00

Pos logit contributions
 initials+1.353
ses+1.260
 forces+1.237
advertisement+1.179
 actions+1.177
Neg logit contributions
osate-1.736
etheless-1.701
ufact-1.579
 Lauder-1.487
veyard-1.453
Token caption
Token ID8305
Feature activation-3.28e+00

Pos logit contributions
 initials+2.489
 forces+2.266
ses+2.203
 actions+2.093
advertisement+2.034
Neg logit contributions
etheless-5.515
osate-5.447
ufact-5.069
 Lauder-4.715
 conclud-4.689
Token The
Token ID383
Feature activation-7.29e-01

Pos logit contributions
 initials+7.069
 forces+7.049
ses+6.838
 actions+6.504
 Comed+6.270
Neg logit contributions
etheless-10.847
osate-10.319
ufact-9.638
veyard-9.148
 conclud-9.097
Token Kurds
Token ID23880
Feature activation-6.54e-02

Pos logit contributions
 initials+1.726
ses+1.600
 forces+1.579
 actions+1.495
advertisement+1.474
Neg logit contributions
osate-2.442
etheless-2.402
ufact-2.237
 Lauder-2.101
veyard-2.063
Token didn
Token ID1422
Feature activation-9.29e-01

Pos logit contributions
osate+1.036
etheless+1.010
ufact+0.918
 Lauder+0.850
veyard+0.823
Neg logit contributions
 initials-1.269
ses-1.200
 forces-1.180
advertisement-1.138
 actions-1.136
Token
Token ID447
Feature activation-9.60e-01

Pos logit contributions
 initials+2.601
ses+2.497
 forces+2.457
advertisement+2.380
 actions+2.360
Neg logit contributions
osate-2.679
etheless-2.663
ufact-2.419
 Lauder-2.244
veyard-2.205
Token
Token ID247
Feature activation+5.93e-01

Pos logit contributions
 initials+1.997
ses+1.839
 forces+1.817
advertisement+1.722
 actions+1.697
Neg logit contributions
osate-3.456
etheless-3.433
ufact-3.203
 Lauder-3.017
veyard-2.949
Tokent
Token ID83
Feature activation-1.10e+00

Pos logit contributions
osate+2.241
etheless+2.199
ufact+2.074
 Lauder+1.967
veyard+1.934
Neg logit contributions
 initials-1.142
ses-1.023
 forces-1.004
 actions-0.943
advertisement-0.938
Token lose
Token ID4425
Feature activation-1.26e+00

Pos logit contributions
 initials+3.485
 forces+3.344
ses+3.294
 actions+3.195
advertisement+3.138
Neg logit contributions
osate-2.789
etheless-2.703
ufact-2.460
 Lauder-2.259
veyard-2.221
Token control
Token ID1630
Feature activation-3.71e+00

Pos logit contributions
 initials+3.568
 forces+3.367
ses+3.347
 actions+3.233
advertisement+3.169
Neg logit contributions
osate-3.591
etheless-3.527
ufact-3.221
 Lauder-3.003
veyard-2.967
Token;
Token ID26
Feature activation-1.59e+00

Pos logit contributions
 initials+7.455
 forces+7.272
ses+7.083
 actions+6.971
advertisement+6.562
Neg logit contributions
osate-11.809
etheless-11.617
ufact-10.968
veyard-10.433
 Lauder-10.344
Token it
Token ID340
Feature activation-2.27e-01

Pos logit contributions
 initials+3.430
ses+3.133
 forces+3.122
 actions+2.979
advertisement+2.943
Neg logit contributions
osate-5.551
etheless-5.454
ufact-5.110
 Lauder-4.847
veyard-4.777
Token was
Token ID373
Feature activation-1.84e+00

Pos logit contributions
 initials+0.661
ses+0.622
 forces+0.611
advertisement+0.587
 actions+0.586
Neg logit contributions
osate-0.636
etheless-0.622
ufact-0.571
 Lauder-0.532
veyard-0.516
Token stolen
Token ID9909
Feature activation-9.39e-01

Pos logit contributions
 initials+4.678
ses+4.371
 forces+4.369
 actions+4.176
advertisement+4.114
Neg logit contributions
osate-5.716
etheless-5.547
ufact-5.078
 Lauder-4.855
veyard-4.806
Token from
Token ID422
Feature activation-6.23e-01

Pos logit contributions
 initials+2.279
ses+2.124
 forces+2.105
 actions+2.009
advertisement+1.988
Neg logit contributions
osate-3.060
etheless-2.999
ufact-2.783
 Lauder-2.625
veyard-2.582
TokenACE
Token ID11598
Feature activation+4.13e-01

Pos logit contributions
osate+0.742
etheless+0.725
ufact+0.672
 Lauder+0.626
veyard+0.624
Neg logit contributions
 initials-0.629
ses-0.574
 forces-0.567
advertisement-0.541
 actions-0.538
Token).
Token ID737
Feature activation+4.01e-01

Pos logit contributions
osate+1.525
etheless+1.485
ufact+1.401
 Lauder+1.331
veyard+1.309
Neg logit contributions
 initials-0.817
ses-0.746
 forces-0.724
advertisement-0.690
 actions-0.672
Token
Token ID198
Feature activation+7.75e-01

Pos logit contributions
osate+1.163
etheless+1.118
ufact+1.029
 Lauder+0.980
veyard+0.944
Neg logit contributions
 initials-1.133
ses-1.056
 forces-1.030
advertisement-1.006
 actions-0.986
Token
Token ID198
Feature activation-8.66e-01

Pos logit contributions
osate+2.061
etheless+1.884
ufact+1.769
 Lauder+1.764
 Scotia+1.684
Neg logit contributions
 initials-2.290
ses-2.085
 forces-2.052
advertisement-1.984
 actions-1.980
TokenImage
Token ID5159
Feature activation-8.38e-01

Pos logit contributions
 initials+1.905
ses+1.774
 forces+1.733
advertisement+1.652
 actions+1.638
Neg logit contributions
osate-3.030
etheless-2.995
ufact-2.793
 Lauder-2.633
veyard-2.568
Token caption
Token ID8305
Feature activation-3.69e+00

Pos logit contributions
 initials+1.721
ses+1.567
 forces+1.546
 actions+1.452
advertisement+1.437
Neg logit contributions
osate-3.072
etheless-3.030
ufact-2.822
 Lauder-2.682
veyard-2.630
Token Council
Token ID4281
Feature activation-9.96e-02

Pos logit contributions
 initials+7.870
 forces+7.727
ses+7.603
 actions+7.385
 Comed+7.277
Neg logit contributions
etheless-11.980
osate-11.254
ufact-10.556
veyard-9.634
iqueness-9.542
Token staff
Token ID3085
Feature activation-2.16e+00

Pos logit contributions
 initials+0.237
ses+0.220
 forces+0.215
advertisement+0.206
 actions+0.205
Neg logit contributions
osate-0.331
etheless-0.325
ufact-0.302
 Lauder-0.286
veyard-0.279
Token s
Token ID264
Feature activation-8.88e-01

Pos logit contributions
 initials+6.497
 forces+6.244
ses+6.106
 actions+5.980
official+5.583
Neg logit contributions
osate-5.754
etheless-5.628
ufact-5.133
veyard-4.576
 Lauder-4.512
Tokenifted
Token ID21715
Feature activation+9.63e-01

Pos logit contributions
 initials+2.045
ses+1.921
 forces+1.891
 actions+1.798
advertisement+1.759
Neg logit contributions
osate-3.000
etheless-2.950
ufact-2.726
 Lauder-2.596
veyard-2.530
Token through
Token ID832
Feature activation-2.88e-01

Pos logit contributions
osate+2.971
etheless+2.890
ufact+2.695
 Lauder+2.583
veyard+2.489
Neg logit contributions
 initials-2.415
ses-2.232
 forces-2.164
advertisement-2.155
 actions-2.077
Token Brexit
Token ID11435
Feature activation-3.01e-02

Pos logit contributions
 initials+0.227
ses+0.212
 forces+0.205
advertisement+0.197
 actions+0.195
Neg logit contributions
osate-0.284
etheless-0.278
ufact-0.259
 Lauder-0.244
veyard-0.235
Token is
Token ID318
Feature activation-8.80e-01

Pos logit contributions
 initials+0.081
ses+0.075
 forces+0.074
advertisement+0.071
 actions+0.071
Neg logit contributions
osate-0.091
etheless-0.089
ufact-0.082
 Lauder-0.077
veyard-0.075
Token the
Token ID262
Feature activation-5.39e-01

Pos logit contributions
 initials+2.097
ses+1.949
 forces+1.932
 actions+1.831
advertisement+1.820
Neg logit contributions
osate-2.929
etheless-2.861
ufact-2.674
 Lauder-2.507
veyard-2.468
Token prime
Token ID6994
Feature activation-9.56e-01

Pos logit contributions
 initials+1.287
ses+1.196
 forces+1.177
 actions+1.116
advertisement+1.111
Neg logit contributions
osate-1.795
etheless-1.763
ufact-1.640
 Lauder-1.545
veyard-1.514
Token challenge
Token ID4427
Feature activation-5.81e-01

Pos logit contributions
 initials+2.482
ses+2.326
 forces+2.307
 actions+2.196
advertisement+2.186
Neg logit contributions
osate-2.968
etheless-2.909
ufact-2.702
 Lauder-2.527
veyard-2.486
Token Belfast
Token ID29713
Feature activation-3.64e+00

Pos logit contributions
 initials+1.315
ses+1.212
 forces+1.183
 actions+1.122
advertisement+1.122
Neg logit contributions
osate-2.009
etheless-1.973
ufact-1.836
 Lauder-1.742
veyard-1.701
TokenTe
Token ID6767
Feature activation-2.00e+00

Pos logit contributions
 initials+3.479
ses+3.419
 forces+3.344
 actions+3.279
Position+2.701
Neg logit contributions
osate-15.977
etheless-15.627
ufact-15.238
veyard-14.292
uras-14.166
Tokenlegraph
Token ID16606
Feature activation-2.02e+00

Pos logit contributions
 initials+5.070
 forces+4.811
ses+4.639
 actions+4.486
advertisement+4.457
Neg logit contributions
osate-6.391
etheless-6.205
ufact-5.683
 Lauder-5.318
veyard-5.111
Token.
Token ID13
Feature activation-1.17e+00

Pos logit contributions
 initials+3.393
 forces+3.131
ses+3.123
advertisement+3.099
 actions+3.006
Neg logit contributions
etheless-8.047
osate-7.983
ufact-7.511
 Lauder-6.930
iqueness-6.737
Tokenco
Token ID1073
Feature activation-1.34e+00

Pos logit contributions
 initials+2.840
ses+2.754
 forces+2.694
official+2.528
advertisement+2.521
Neg logit contributions
etheless-3.775
osate-3.729
ufact-3.527
 Lauder-3.335
veyard-3.216
Token.
Token ID13
Feature activation-5.50e-01

Pos logit contributions
 initials+2.657
ses+2.511
 forces+2.473
advertisement+2.352
 actions+2.290
Neg logit contributions
etheless-4.972
osate-4.915
ufact-4.598
 Lauder-4.312
 conclud-4.204
Token didn
Token ID1422
Feature activation+1.11e-01

Pos logit contributions
 initials+1.042
ses+0.982
 forces+0.978
 actions+0.936
advertisement+0.933
Neg logit contributions
osate-0.890
etheless-0.874
ufact-0.797
 Lauder-0.735
veyard-0.715
Token
Token ID447
Feature activation-1.11e+00

Pos logit contributions
osate+0.336
etheless+0.324
ufact+0.302
 Lauder+0.288
veyard+0.277
Neg logit contributions
 initials-0.296
ses-0.272
 forces-0.267
 actions-0.256
advertisement-0.253
Token
Token ID247
Feature activation+6.07e-01

Pos logit contributions
 initials+2.850
ses+2.687
 forces+2.644
advertisement+2.543
 actions+2.503
Neg logit contributions
osate-3.427
etheless-3.407
ufact-3.118
 Lauder-2.912
veyard-2.849
Tokent
Token ID83
Feature activation-3.28e-01

Pos logit contributions
osate+2.116
etheless+2.082
ufact+1.947
 Lauder+1.839
veyard+1.800
Neg logit contributions
 initials-1.348
ses-1.237
 forces-1.216
 actions-1.149
advertisement-1.146
Token lose
Token ID4425
Feature activation-5.56e-01

Pos logit contributions
 initials+1.086
ses+1.030
 forces+1.024
 actions+0.983
advertisement+0.980
Neg logit contributions
osate-0.800
etheless-0.776
ufact-0.703
 Lauder-0.640
veyard-0.623
Token control
Token ID1630
Feature activation-3.63e+00

Pos logit contributions
 initials+1.378
ses+1.282
 forces+1.259
advertisement+1.198
 actions+1.198
Neg logit contributions
osate-1.796
etheless-1.763
ufact-1.636
 Lauder-1.542
veyard-1.506
Token
Token ID784
Feature activation-9.88e-01

Pos logit contributions
 initials+7.232
 forces+7.060
ses+6.716
 actions+6.503
advertisement+6.353
Neg logit contributions
osate-12.233
etheless-11.839
ufact-11.188
veyard-10.455
 Lauder-10.364
Token it
Token ID340
Feature activation+4.91e-01

Pos logit contributions
 initials+2.396
ses+2.222
 forces+2.204
 actions+2.094
advertisement+2.082
Neg logit contributions
osate-3.243
etheless-3.170
ufact-2.955
 Lauder-2.776
veyard-2.730
Token was
Token ID373
Feature activation-1.09e+00

Pos logit contributions
osate+1.354
etheless+1.327
ufact+1.217
 Lauder+1.136
veyard+1.095
Neg logit contributions
 initials-1.438
ses-1.351
 forces-1.315
advertisement-1.279
 actions-1.271
Token stolen
Token ID9909
Feature activation+1.49e-02

Pos logit contributions
 initials+2.777
ses+2.586
 forces+2.566
 actions+2.443
advertisement+2.427
Neg logit contributions
osate-3.459
etheless-3.374
ufact-3.124
 Lauder-2.931
veyard-2.885
Token
Token ID198
Feature activation+1.11e+00

Pos logit contributions
osate+0.049
etheless+0.049
ufact+0.046
 Lauder+0.043
veyard+0.042
Neg logit contributions
 initials-0.035
ses-0.033
 forces-0.032
advertisement-0.030
 actions-0.030
TokenWe
Token ID1135
Feature activation-2.95e-01

Pos logit contributions
 initials+0.043
ses+0.039
 forces+0.039
 actions+0.037
advertisement+0.036
Neg logit contributions
osate-0.056
etheless-0.054
ufact-0.050
 Lauder-0.048
veyard-0.047
Token
Token ID447
Feature activation+1.48e-01

Pos logit contributions
 initials+0.905
ses+0.851
 forces+0.848
 actions+0.815
advertisement+0.810
Neg logit contributions
osate-0.775
etheless-0.761
ufact-0.694
 Lauder-0.643
veyard-0.626
Token
Token ID247
Feature activation+4.23e-01

Pos logit contributions
osate+0.477
etheless+0.467
ufact+0.433
 Lauder+0.409
veyard+0.401
Neg logit contributions
 initials-0.366
ses-0.342
 forces-0.333
 actions-0.319
advertisement-0.316
Tokenve
Token ID303
Feature activation+8.27e-01

Pos logit contributions
osate+1.327
etheless+1.302
ufact+1.207
 Lauder+1.133
veyard+1.106
Neg logit contributions
 initials-1.088
ses-1.010
 forces-0.994
 actions-0.949
advertisement-0.947
Token lost
Token ID2626
Feature activation+2.25e-01

Pos logit contributions
osate+2.378
etheless+2.324
ufact+2.159
 Lauder+2.002
pload+1.935
Neg logit contributions
 initials-2.271
ses-2.162
 forces-2.063
advertisement-2.015
 actions-1.982
Token control
Token ID1630
Feature activation-3.62e+00

Pos logit contributions
osate+0.620
etheless+0.605
ufact+0.560
 Lauder+0.515
veyard+0.497
Neg logit contributions
 initials-0.657
ses-0.624
 forces-0.599
advertisement-0.586
 actions-0.574
Token of
Token ID286
Feature activation-1.43e+00

Pos logit contributions
 initials+7.262
 forces+7.159
ses+6.736
 actions+6.678
advertisement+6.374
Neg logit contributions
osate-11.883
etheless-11.631
ufact-10.893
 Lauder-10.379
veyard-10.313
Token our
Token ID674
Feature activation-7.39e-01

Pos logit contributions
 initials+3.150
ses+2.900
 forces+2.892
 actions+2.747
advertisement+2.703
Neg logit contributions
osate-4.947
etheless-4.883
ufact-4.538
 Lauder-4.325
veyard-4.266
Token personal
Token ID2614
Feature activation+7.09e-01

Pos logit contributions
 initials+1.687
ses+1.559
 forces+1.531
 actions+1.450
advertisement+1.446
Neg logit contributions
osate-2.535
etheless-2.491
ufact-2.321
 Lauder-2.196
veyard-2.150
Token data
Token ID1366
Feature activation+2.21e-01

Pos logit contributions
osate+2.466
etheless+2.422
ufact+2.264
 Lauder+2.124
 Scotia+2.068
Neg logit contributions
 initials-1.519
ses-1.417
 forces-1.348
advertisement-1.308
official-1.268
Token It
Token ID632
Feature activation-6.30e-01

Pos logit contributions
osate+0.729
etheless+0.715
ufact+0.668
 Lauder+0.627
veyard+0.609
Neg logit contributions
 initials-0.532
ses-0.493
 forces-0.476
advertisement-0.459
 actions-0.452
Token Moscow
Token ID9070
Feature activation-1.26e+00

Pos logit contributions
osate+1.073
etheless+1.047
ufact+0.952
 Lauder+0.893
veyard+0.853
Neg logit contributions
 initials-1.341
ses-1.269
 forces-1.245
advertisement-1.219
 actions-1.200
Token,
Token ID11
Feature activation-1.08e-01

Pos logit contributions
 initials+3.310
 forces+3.072
ses+3.054
 actions+2.936
advertisement+2.814
Neg logit contributions
osate-3.936
etheless-3.859
ufact-3.573
 Lauder-3.332
veyard-3.237
Token May
Token ID1737
Feature activation+1.07e+00

Pos logit contributions
 initials+0.277
ses+0.258
 forces+0.255
 actions+0.244
advertisement+0.241
Neg logit contributions
osate-0.339
etheless-0.332
ufact-0.308
 Lauder-0.289
veyard-0.282
Token 28
Token ID2579
Feature activation-1.00e+00

Pos logit contributions
osate+3.357
etheless+3.202
ufact+3.002
 Lauder+2.944
 Scotia+2.827
Neg logit contributions
 initials-2.605
ses-2.390
 forces-2.340
advertisement-2.257
 actions-2.211
Token,
Token ID11
Feature activation-1.86e+00

Pos logit contributions
 initials+2.163
 forces+2.028
ses+1.992
 actions+1.910
advertisement+1.837
Neg logit contributions
etheless-3.567
osate-3.524
ufact-3.328
 Lauder-3.069
veyard-3.050
Token 2014
Token ID1946
Feature activation-3.62e+00

Pos logit contributions
 initials+5.415
 forces+5.272
ses+5.074
 actions+5.045
official+4.741
Neg logit contributions
etheless-5.193
osate-5.085
ufact-4.668
veyard-4.256
 Lauder-4.178
Token.
Token ID13
Feature activation-1.35e+00

Pos logit contributions
 forces+5.233
 initials+5.116
 actions+4.974
ses+4.647
official+4.057
Neg logit contributions
etheless-14.567
osate-13.990
ufact-13.801
 conclud-12.783
veyard-12.692
Token REUTERS
Token ID15862
Feature activation-7.25e-01

Pos logit contributions
 initials+2.347
ses+2.146
 forces+2.084
 actions+1.941
advertisement+1.875
Neg logit contributions
osate-5.319
etheless-5.313
ufact-4.958
 Lauder-4.657
veyard-4.650
Token/
Token ID14
Feature activation-1.39e+00

Pos logit contributions
 initials+1.019
ses+0.893
 forces+0.864
 actions+0.784
advertisement+0.784
Neg logit contributions
osate-3.122
etheless-3.078
ufact-2.912
 Lauder-2.788
veyard-2.742
TokenMichael
Token ID13256
Feature activation+1.01e+00

Pos logit contributions
 initials+2.718
ses+2.516
 forces+2.446
advertisement+2.317
 actions+2.297
Neg logit contributions
osate-5.220
etheless-5.152
ufact-4.796
 Lauder-4.524
veyard-4.455
Token K
Token ID509
Feature activation-8.24e-01

Pos logit contributions
osate+3.113
etheless+2.994
ufact+2.829
 Lauder+2.717
veyard+2.603
Neg logit contributions
 initials-2.647
ses-2.422
 forces-2.385
 actions-2.323
advertisement-2.294
Token<|endoftext|>
Token ID50256
Feature activation-1.54e+00

Pos logit contributions
osate+2.324
etheless+2.240
ufact+2.091
 Lauder+2.005
veyard+1.915
Neg logit contributions
 initials-1.978
ses-1.837
 forces-1.790
advertisement-1.713
 actions-1.699
TokenImage
Token ID5159
Feature activation-7.07e-01

Pos logit contributions
 initials+3.429
ses+3.269
 forces+3.156
advertisement+3.114
 actions+2.991
Neg logit contributions
osate-5.254
etheless-5.217
ufact-4.867
 Lauder-4.560
veyard-4.417
Token copyright
Token ID6634
Feature activation-4.24e+00

Pos logit contributions
 initials+1.447
ses+1.315
 forces+1.305
 actions+1.224
advertisement+1.213
Neg logit contributions
osate-2.593
etheless-2.563
ufact-2.383
 Lauder-2.266
veyard-2.222
Token AP
Token ID3486
Feature activation-1.99e+00

Pos logit contributions
 initials+9.297
 forces+8.853
ses+8.544
 actions+8.316
official+7.955
Neg logit contributions
etheless-13.029
osate-12.725
ufact-11.762
veyard-11.241
 Lauder-10.520
Token Image
Token ID7412
Feature activation-1.23e+00

Pos logit contributions
 initials+4.431
 forces+4.074
ses+4.040
 actions+3.867
advertisement+3.774
Neg logit contributions
etheless-6.670
osate-6.666
ufact-6.141
 Lauder-5.798
veyard-5.667
Token caption
Token ID8305
Feature activation-3.49e+00

Pos logit contributions
 initials+2.211
 forces+2.012
ses+1.950
 actions+1.858
advertisement+1.808
Neg logit contributions
etheless-4.951
osate-4.885
ufact-4.525
 Lauder-4.237
veyard-4.205
Token Peace
Token ID12689
Feature activation+8.88e-01

Pos logit contributions
 initials+7.959
 forces+7.725
ses+7.621
 Comed+7.315
 actions+7.236
Neg logit contributions
etheless-10.932
osate-10.302
ufact-9.653
veyard-9.072
 conclud-8.954
Token talks
Token ID6130
Feature activation-2.12e-01

Pos logit contributions
osate+2.772
etheless+2.682
ufact+2.511
 Lauder+2.382
veyard+2.280
Neg logit contributions
 initials-2.266
ses-2.133
advertisement-2.024
 forces-2.016
 actions-1.932
Token between
Token ID1022
Feature activation-4.01e-02

Pos logit contributions
 initials+0.625
ses+0.588
 forces+0.580
 actions+0.557
advertisement+0.556
Neg logit contributions
osate-0.588
etheless-0.575
ufact-0.527
 Lauder-0.490
veyard-0.477
Token the
Token ID262
Feature activation+2.99e-01

Pos logit contributions
 initials+0.107
ses+0.100
 forces+0.098
advertisement+0.096
 actions+0.094
Neg logit contributions
osate-0.121
etheless-0.119
ufact-0.109
 Lauder-0.103
veyard-0.100
Token government
Token ID1230
Feature activation-4.54e-01

Pos logit contributions
osate+0.960
etheless+0.938
ufact+0.863
 Lauder+0.828
veyard+0.799
Neg logit contributions
 initials-0.737
ses-0.684
 forces-0.662
advertisement-0.656
 actions-0.637
Token and
Token ID290
Feature activation-1.68e+00

Pos logit contributions
osate+1.761
etheless+1.742
ufact+1.632
 Lauder+1.549
veyard+1.496
Neg logit contributions
 initials-1.215
ses-1.146
 forces-1.102
advertisement-1.064
 actions-1.045
Token then
Token ID788
Feature activation-8.42e-01

Pos logit contributions
 initials+3.845
 forces+3.493
ses+3.464
 actions+3.303
advertisement+3.182
Neg logit contributions
osate-5.914
etheless-5.739
ufact-5.327
 Lauder-4.988
veyard-4.925
Token choosing
Token ID11236
Feature activation-8.97e-01

Pos logit contributions
 initials+1.870
ses+1.715
 forces+1.692
 actions+1.600
advertisement+1.582
Neg logit contributions
osate-2.968
etheless-2.906
ufact-2.711
 Lauder-2.555
veyard-2.511
Token "
Token ID366
Feature activation-9.22e-01

Pos logit contributions
 initials+2.181
ses+2.008
 forces+1.985
 actions+1.899
advertisement+1.864
Neg logit contributions
osate-2.971
etheless-2.913
ufact-2.707
 Lauder-2.538
veyard-2.487
TokenShow
Token ID15307
Feature activation-2.59e-01

Pos logit contributions
 initials+2.228
ses+2.088
 forces+2.034
advertisement+1.949
 actions+1.939
Neg logit contributions
osate-3.048
etheless-2.985
ufact-2.781
 Lauder-2.613
veyard-2.545
Token Spe
Token ID2531
Feature activation-3.47e+00

Pos logit contributions
 initials+0.627
ses+0.584
 forces+0.573
advertisement+0.546
 actions+0.543
Neg logit contributions
osate-0.846
etheless-0.830
ufact-0.771
 Lauder-0.729
veyard-0.712
Tokenlling
Token ID2680
Feature activation+5.71e-01

Pos logit contributions
ses+10.888
 initials+10.845
 forces+10.498
official+10.271
advertisement+10.256
Neg logit contributions
etheless-7.620
osate-7.373
ufact-6.379
 Lauder-5.863
 conclud-5.787
Token and
Token ID290
Feature activation-1.09e+00

Pos logit contributions
osate+1.904
etheless+1.811
ufact+1.705
 Lauder+1.664
 Scotia+1.612
Neg logit contributions
 initials-1.301
ses-1.224
 forces-1.166
advertisement-1.146
 actions-1.098
Token Gram
Token ID20159
Feature activation+1.75e+00

Pos logit contributions
 initials+2.672
ses+2.474
 forces+2.437
 actions+2.320
advertisement+2.309
Neg logit contributions
osate-3.559
etheless-3.493
ufact-3.237
 Lauder-3.042
veyard-2.980
Tokenmar
Token ID3876
Feature activation+5.43e-01

Pos logit contributions
osate+3.972
etheless+3.662
ufact+3.383
 Lauder+3.229
veyard+3.017
Neg logit contributions
 initials-5.799
ses-5.390
 forces-5.379
advertisement-5.261
 actions-5.193
Token"
Token ID1
Feature activation-4.40e-01

Pos logit contributions
osate+1.846
etheless+1.767
ufact+1.656
 Lauder+1.587
 Scotia+1.547
Neg logit contributions
 initials-1.216
ses-1.137
 forces-1.095
advertisement-1.069
 actions-1.028
Token $
Token ID720
Feature activation+1.09e+00

Pos logit contributions
 initials+2.442
ses+2.281
 forces+2.264
 actions+2.158
advertisement+2.131
Neg logit contributions
osate-2.745
etheless-2.688
ufact-2.467
 Lauder-2.307
veyard-2.254
Token8
Token ID23
Feature activation+8.25e-01

Pos logit contributions
osate+3.153
etheless+3.024
ufact+2.808
 Lauder+2.724
veyard+2.603
Neg logit contributions
 initials-2.966
ses-2.692
 forces-2.675
 actions-2.559
advertisement-2.486
Token million
Token ID1510
Feature activation+1.56e+00

Pos logit contributions
osate+2.128
etheless+2.083
ufact+1.865
 Lauder+1.758
veyard+1.700
Neg logit contributions
 initials-2.516
ses-2.415
 forces-2.344
 actions-2.247
advertisement-2.235
Token due
Token ID2233
Feature activation+3.59e-01

Pos logit contributions
etheless+4.753
osate+4.749
ufact+4.434
 Lauder+4.184
 matter+4.141
Neg logit contributions
 initials-3.653
ses-3.408
advertisement-3.287
 forces-3.232
official-3.144
Token to
Token ID284
Feature activation+5.70e-02

Pos logit contributions
osate+1.245
etheless+1.229
ufact+1.150
 Lauder+1.101
veyard+1.076
Neg logit contributions
 initials-0.784
ses-0.724
 forces-0.693
advertisement-0.678
 actions-0.653
Token Carlos
Token ID17409
Feature activation-3.44e+00

Pos logit contributions
osate+0.198
etheless+0.195
ufact+0.183
 Lauder+0.175
veyard+0.171
Neg logit contributions
 initials-0.124
ses-0.114
 forces-0.110
advertisement-0.107
 actions-0.104
Token Quentin
Token ID42447
Feature activation+1.17e+00

Pos logit contributions
 forces+7.436
 initials+7.377
ses+7.176
 actions+7.026
 Comed+6.332
Neg logit contributions
osate-11.091
etheless-11.013
ufact-10.026
 Lauder-9.241
 conclud-9.145
Token.
Token ID13
Feature activation+3.47e-01

Pos logit contributions
osate+3.732
etheless+3.650
ufact+3.427
 Lauder+3.266
veyard+3.146
Neg logit contributions
 initials-2.815
ses-2.580
 forces-2.503
advertisement-2.463
 actions-2.410
Token But
Token ID887
Feature activation-6.82e-01

Pos logit contributions
osate+1.059
etheless+1.027
ufact+0.964
 Lauder+0.925
 Scotia+0.882
Neg logit contributions
 initials-0.909
ses-0.835
 forces-0.818
advertisement-0.787
 actions-0.782
Token fact
Token ID1109
Feature activation+4.88e-01

Pos logit contributions
 initials+1.621
ses+1.500
 forces+1.487
 actions+1.412
advertisement+1.392
Neg logit contributions
osate-2.283
etheless-2.238
ufact-2.082
 Lauder-1.952
veyard-1.917
Tokenoring
Token ID3255
Feature activation-6.50e-01

Pos logit contributions
osate+1.241
etheless+1.211
ufact+1.094
 Lauder+1.015
veyard+0.980
Neg logit contributions
 initials-1.535
ses-1.453
 forces-1.421
advertisement-1.380
 actions-1.372
TokenI
Token ID40
Feature activation-2.37e-01

Pos logit contributions
 initials+0.107
ses+0.099
 forces+0.098
 actions+0.093
advertisement+0.092
Neg logit contributions
osate-0.150
etheless-0.147
ufact-0.137
 Lauder-0.129
veyard-0.126
Tokenba
Token ID7012
Feature activation-1.05e+00

Pos logit contributions
 initials+0.599
ses+0.564
 forces+0.552
advertisement+0.526
 actions+0.525
Neg logit contributions
osate-0.755
etheless-0.745
ufact-0.687
 Lauder-0.646
veyard-0.628
TokenH
Token ID39
Feature activation-1.94e+00

Pos logit contributions
 initials+2.376
ses+2.257
 forces+2.194
advertisement+2.102
 actions+2.078
Neg logit contributions
etheless-3.590
osate-3.571
ufact-3.282
 Lauder-3.099
veyard-2.983
TokenEx
Token ID3109
Feature activation-8.76e-01

Pos logit contributions
 initials+4.606
ses+4.581
 forces+4.343
advertisement+4.135
 actions+4.117
Neg logit contributions
etheless-6.265
osate-6.108
ufact-5.682
 Lauder-5.320
 conclud-5.071
TokenTT
Token ID15751
Feature activation-2.40e+00

Pos logit contributions
 initials+2.106
ses+2.005
 forces+1.946
advertisement+1.852
 actions+1.847
Neg logit contributions
osate-2.862
etheless-2.858
ufact-2.621
 Lauder-2.469
veyard-2.391
Tokenj
Token ID73
Feature activation-3.43e+00

Pos logit contributions
 initials+5.299
ses+5.212
 forces+5.025
 actions+4.738
official+4.642
Neg logit contributions
etheless-8.088
osate-7.850
ufact-7.260
 Lauder-6.736
veyard-6.589
Token
Token ID851
Feature activation-1.23e+00

Pos logit contributions
 initials+5.997
ses+5.665
 forces+5.645
 actions+5.353
official+5.089
Neg logit contributions
etheless-12.602
osate-12.263
ufact-11.608
 Lauder-10.851
veyard-10.621
Token B
Token ID347
Feature activation-5.95e-01

Pos logit contributions
 initials+3.074
ses+2.861
 forces+2.834
 actions+2.690
advertisement+2.663
Neg logit contributions
osate-3.906
etheless-3.842
ufact-3.556
 Lauder-3.324
veyard-3.287
Tokenorm
Token ID579
Feature activation+9.10e-01

Pos logit contributions
 initials+1.642
ses+1.548
 forces+1.526
 actions+1.455
advertisement+1.449
Neg logit contributions
osate-1.747
etheless-1.719
ufact-1.570
 Lauder-1.479
veyard-1.434
Tokenes
Token ID274
Feature activation+8.33e-01

Pos logit contributions
osate+2.982
etheless+2.768
ufact+2.660
 Lauder+2.479
veyard+2.448
Neg logit contributions
 initials-2.271
 forces-2.013
ses-2.005
advertisement-1.977
 actions-1.932
Token les
Token ID10287
Feature activation-3.66e-01

Pos logit contributions
osate+2.631
etheless+2.546
ufact+2.392
 Lauder+2.280
veyard+2.185
Neg logit contributions
 initials-2.094
ses-1.890
 forces-1.872
advertisement-1.842
 actions-1.812
Token."
Token ID526
Feature activation-6.08e-01

Pos logit contributions
osate+2.392
etheless+2.311
ufact+2.151
 Lauder+2.105
 Scotia+1.991
Neg logit contributions
 initials-1.893
ses-1.768
 forces-1.729
advertisement-1.640
 actions-1.635
Token 2003
Token ID5816
Feature activation-1.53e+00

Pos logit contributions
 initials+2.041
ses+1.934
 forces+1.911
 actions+1.845
advertisement+1.843
Neg logit contributions
osate-1.435
etheless-1.402
ufact-1.262
 Lauder-1.146
veyard-1.119
Token annual
Token ID5079
Feature activation+5.93e-01

Pos logit contributions
 initials+4.245
ses+3.946
 forces+3.896
 actions+3.758
advertisement+3.723
Neg logit contributions
osate-4.528
etheless-4.460
ufact-4.085
 Lauder-3.744
veyard-3.738
Token report
Token ID989
Feature activation-3.74e-02

Pos logit contributions
osate+1.097
etheless+1.024
ufact+0.908
 Lauder+0.855
 Scotia+0.781
Neg logit contributions
 initials-2.248
ses-2.143
 forces-2.104
advertisement-2.063
 actions-2.032
Token,
Token ID11
Feature activation-3.07e-01

Pos logit contributions
 initials+0.094
ses+0.088
 forces+0.086
advertisement+0.082
 actions+0.082
Neg logit contributions
osate-0.119
etheless-0.116
ufact-0.108
 Lauder-0.103
veyard-0.100
Token Joyce
Token ID25936
Feature activation+2.01e+00

Pos logit contributions
 initials+0.769
ses+0.719
 forces+0.704
advertisement+0.672
 actions+0.671
Neg logit contributions
osate-0.982
etheless-0.959
ufact-0.894
 Lauder-0.848
veyard-0.822
Token Foundation
Token ID5693
Feature activation+6.38e-03

Pos logit contributions
osate+3.738
 Lauder+3.245
etheless+3.234
ufact+2.985
 Scotia+2.864
Neg logit contributions
 initials-7.102
ses-6.866
 forces-6.536
advertisement-6.494
parser-6.421
Token:
Token ID25
Feature activation+2.38e-01

Pos logit contributions
osate+0.021
etheless+0.020
ufact+0.019
 Lauder+0.018
veyard+0.017
Neg logit contributions
 initials-0.016
ses-0.015
 forces-0.014
advertisement-0.014
 actions-0.014
Token "
Token ID366
Feature activation-9.12e-01

Pos logit contributions
osate+0.637
etheless+0.614
ufact+0.564
 Lauder+0.535
veyard+0.507
Neg logit contributions
 initials-0.718
ses-0.686
 forces-0.671
advertisement-0.647
 actions-0.643
TokenR
Token ID49
Feature activation-5.47e-02

Pos logit contributions
 initials+1.859
ses+1.703
 forces+1.666
advertisement+1.579
 actions+1.567
Neg logit contributions
osate-3.346
etheless-3.316
ufact-3.088
 Lauder-2.907
veyard-2.873
Tokenoc
Token ID420
Feature activation+1.28e-02

Pos logit contributions
 initials+0.133
ses+0.123
 forces+0.121
 actions+0.115
advertisement+0.115
Neg logit contributions
osate-0.179
etheless-0.176
ufact-0.164
 Lauder-0.154
veyard-0.151
Token several
Token ID1811
Feature activation+4.04e-01

Pos logit contributions
osate+0.446
etheless+0.443
ufact+0.413
 Lauder+0.389
veyard+0.376
Neg logit contributions
 initials-0.388
ses-0.363
 forces-0.353
advertisement-0.345
 actions-0.336
Token construction
Token ID5103
Feature activation-4.64e-01

Pos logit contributions
osate+1.228
etheless+1.220
ufact+1.135
 Lauder+1.072
veyard+1.025
Neg logit contributions
 initials-1.040
ses-0.974
 forces-0.947
advertisement-0.924
 actions-0.897
Token companies
Token ID2706
Feature activation-7.19e-01

Pos logit contributions
 initials+1.128
ses+1.048
 forces+1.030
 actions+0.979
advertisement+0.977
Neg logit contributions
osate-1.522
etheless-1.494
ufact-1.388
 Lauder-1.309
veyard-1.279
Token,
Token ID11
Feature activation+8.37e-01

Pos logit contributions
 initials+1.911
ses+1.790
 forces+1.771
 actions+1.689
advertisement+1.672
Neg logit contributions
osate-2.221
etheless-2.163
ufact-1.999
 Lauder-1.864
veyard-1.830
Token B
Token ID347
Feature activation-3.75e-01

Pos logit contributions
osate+2.264
etheless+2.253
ufact+2.072
 Lauder+1.962
 Scotia+1.872
Neg logit contributions
 initials-2.390
ses-2.258
 forces-2.194
advertisement-2.164
 actions-2.103
Tokenoral
Token ID6864
Feature activation+2.01e+00

Pos logit contributions
 initials+0.887
ses+0.818
 forces+0.806
advertisement+0.769
 actions+0.764
Neg logit contributions
osate-1.246
etheless-1.226
ufact-1.146
 Lauder-1.082
veyard-1.057
Token and
Token ID290
Feature activation-7.15e-01

Pos logit contributions
osate+6.023
etheless+5.871
ufact+5.597
 Lauder+5.246
idge+5.095
Neg logit contributions
 initials-5.030
ses-4.725
advertisement-4.624
 forces-4.495
 actions-4.320
Token CS
Token ID9429
Feature activation-5.63e-01

Pos logit contributions
 initials+1.857
ses+1.733
 forces+1.706
 actions+1.627
advertisement+1.622
Neg logit contributions
osate-2.226
etheless-2.183
ufact-2.017
 Lauder-1.893
veyard-1.853
TokenR
Token ID49
Feature activation+1.16e+00

Pos logit contributions
 initials+1.489
ses+1.392
 forces+1.369
advertisement+1.308
 actions+1.307
Neg logit contributions
osate-1.725
etheless-1.690
ufact-1.563
 Lauder-1.468
veyard-1.431
Token for
Token ID329
Feature activation+9.28e-01

Pos logit contributions
osate+3.561
etheless+3.481
ufact+3.231
 Lauder+3.076
 Scotia+2.955
Neg logit contributions
 initials-2.949
ses-2.759
 forces-2.671
advertisement-2.658
 actions-2.554
Token example
Token ID1672
Feature activation+3.83e-01

Pos logit contributions
osate+2.569
etheless+2.537
ufact+2.323
 Lauder+2.200
veyard+2.098
Neg logit contributions
 initials-2.619
ses-2.463
 forces-2.422
advertisement-2.389
 actions-2.306
Token
Token ID247
Feature activation+6.26e-01

Pos logit contributions
 initials+2.987
ses+2.809
 forces+2.774
advertisement+2.669
 actions+2.657
Neg logit contributions
osate-2.851
etheless-2.795
ufact-2.543
 Lauder-2.376
veyard-2.309
Tokens
Token ID82
Feature activation+1.48e-01

Pos logit contributions
osate+2.035
etheless+1.993
ufact+1.873
 Lauder+1.762
veyard+1.712
Neg logit contributions
 initials-1.513
ses-1.396
 forces-1.373
 actions-1.300
advertisement-1.294
Token face
Token ID1986
Feature activation-6.24e-01

Pos logit contributions
osate+0.364
etheless+0.356
ufact+0.321
 Lauder+0.296
veyard+0.286
Neg logit contributions
 initials-0.483
ses-0.457
 forces-0.451
 actions-0.435
advertisement-0.434
Token facts
Token ID6419
Feature activation+8.82e-02

Pos logit contributions
 initials+1.423
ses+1.316
 forces+1.289
advertisement+1.222
 actions+1.221
Neg logit contributions
osate-2.137
etheless-2.100
ufact-1.958
 Lauder-1.852
veyard-1.810
Token.
Token ID13
Feature activation+3.53e-01

Pos logit contributions
osate+0.287
etheless+0.281
ufact+0.260
 Lauder+0.248
veyard+0.239
Neg logit contributions
 initials-0.217
ses-0.200
 forces-0.195
advertisement-0.187
 actions-0.186
Token
Token ID198
Feature activation+1.96e+00

Pos logit contributions
osate+1.125
etheless+1.093
ufact+1.013
 Lauder+0.964
veyard+0.928
Neg logit contributions
 initials-0.894
ses-0.829
 forces-0.809
advertisement-0.772
 actions-0.768
Token
Token ID198
Feature activation-2.16e-01

Pos logit contributions
osate+4.786
etheless+4.143
 Lauder+4.008
ufact+3.865
 Scotia+3.840
Neg logit contributions
 initials-6.094
ses-5.469
 forces-5.414
 actions-5.206
 motivations-5.106
TokenTurkey
Token ID31632
Feature activation-1.93e+00

Pos logit contributions
 initials+0.504
ses+0.463
 forces+0.455
 actions+0.431
advertisement+0.426
Neg logit contributions
osate-0.732
etheless-0.716
ufact-0.666
 Lauder-0.632
veyard-0.620
Token and
Token ID290
Feature activation-6.66e-01

Pos logit contributions
 initials+4.924
 forces+4.718
ses+4.556
 actions+4.450
official+4.245
Neg logit contributions
etheless-5.834
osate-5.833
ufact-5.402
 Lauder-5.027
veyard-4.982
Token Azerbaijan
Token ID33605
Feature activation+8.44e-02

Pos logit contributions
 initials+1.783
ses+1.661
 forces+1.652
 actions+1.577
advertisement+1.557
Neg logit contributions
osate-2.017
etheless-1.975
ufact-1.831
 Lauder-1.706
veyard-1.676
Token threaten
Token ID16180
Feature activation-2.73e-01

Pos logit contributions
osate+0.221
etheless+0.216
ufact+0.197
 Lauder+0.182
veyard+0.177
Neg logit contributions
 initials-0.261
ses-0.247
 forces-0.244
 actions-0.234
advertisement-0.234
TokenIn
Token ID818
Feature activation+7.89e-01

Pos logit contributions
 initials+3.034
ses+2.871
 forces+2.782
advertisement+2.716
 actions+2.636
Neg logit contributions
osate-4.603
etheless-4.561
ufact-4.251
 Lauder-3.991
veyard-3.878
Token an
Token ID281
Feature activation+5.66e-01

Pos logit contributions
osate+2.445
etheless+2.388
ufact+2.220
 Lauder+2.102
veyard+2.043
Neg logit contributions
 initials-2.025
ses-1.880
 forces-1.839
advertisement-1.771
 actions-1.742
Token article
Token ID2708
Feature activation+1.33e+00

Pos logit contributions
osate+1.684
etheless+1.625
ufact+1.527
 Lauder+1.434
 Scotia+1.369
Neg logit contributions
 initials-1.495
ses-1.422
 forces-1.368
advertisement-1.319
 actions-1.288
Token for
Token ID329
Feature activation+5.61e-01

Pos logit contributions
osate+4.052
etheless+4.033
ufact+3.738
 Lauder+3.550
omas+3.421
Neg logit contributions
 initials-3.267
ses-3.069
 forces-2.952
advertisement-2.847
 actions-2.825
Token Pitch
Token ID33517
Feature activation+9.58e-01

Pos logit contributions
osate+1.740
etheless+1.704
ufact+1.584
 Lauder+1.503
veyard+1.443
Neg logit contributions
 initials-1.414
ses-1.337
 forces-1.284
advertisement-1.252
 actions-1.232
Tokenfork
Token ID32523
Feature activation+1.96e+00

Pos logit contributions
osate+2.916
etheless+2.777
ufact+2.595
veyard+2.475
 Lauder+2.450
Neg logit contributions
 initials-2.522
ses-2.359
 forces-2.268
 actions-2.154
advertisement-2.147
Token,
Token ID11
Feature activation+7.10e-01

Pos logit contributions
osate+5.748
etheless+5.562
ufact+5.198
 Lauder+4.839
pload+4.774
Neg logit contributions
 initials-5.068
ses-4.817
 forces-4.567
advertisement-4.389
 actions-4.288
Token music
Token ID2647
Feature activation+9.78e-02

Pos logit contributions
osate+2.346
etheless+2.300
ufact+2.146
 Lauder+2.069
 Scotia+1.986
Neg logit contributions
 initials-1.634
ses-1.510
 forces-1.446
advertisement-1.411
 actions-1.391
Token critic
Token ID4014
Feature activation+1.06e+00

Pos logit contributions
osate+0.290
etheless+0.284
ufact+0.262
 Lauder+0.245
veyard+0.239
Neg logit contributions
 initials-0.268
ses-0.251
 forces-0.246
advertisement-0.236
 actions-0.235
Token Adam
Token ID7244
Feature activation-4.63e-01

Pos logit contributions
osate+3.472
etheless+3.340
ufact+3.188
 Lauder+3.103
 Scotia+2.964
Neg logit contributions
 initials-2.386
ses-2.196
 forces-2.098
advertisement-2.074
 actions-2.025
Token Ward
Token ID12150
Feature activation+6.51e-01

Pos logit contributions
 initials+1.203
ses+1.132
 forces+1.115
 actions+1.058
advertisement+1.051
Neg logit contributions
osate-1.435
etheless-1.416
ufact-1.304
 Lauder-1.208
veyard-1.190
Token Americas
Token ID25733
Feature activation-7.91e-01

Pos logit contributions
osate+3.109
etheless+3.003
ufact+2.837
 Lauder+2.677
veyard+2.548
Neg logit contributions
 initials-2.861
ses-2.616
 forces-2.569
advertisement-2.552
 actions-2.473
Token)
Token ID8
Feature activation+1.10e-01

Pos logit contributions
 initials+1.679
ses+1.539
 forces+1.522
 actions+1.436
advertisement+1.416
Neg logit contributions
osate-2.828
etheless-2.808
ufact-2.614
 Lauder-2.462
veyard-2.443
Token summit
Token ID14237
Feature activation+1.36e+00

Pos logit contributions
osate+0.280
etheless+0.273
ufact+0.248
 Lauder+0.229
veyard+0.222
Neg logit contributions
 initials-0.351
ses-0.331
 forces-0.327
advertisement-0.315
 actions-0.315
Token in
Token ID287
Feature activation+2.12e+00

Pos logit contributions
osate+4.580
etheless+4.508
ufact+4.186
 Lauder+4.051
veyard+3.898
Neg logit contributions
 initials-2.990
ses-2.769
 forces-2.606
advertisement-2.525
 actions-2.485
Token Venezuela
Token ID16666
Feature activation+5.67e-01

Pos logit contributions
osate+5.242
etheless+5.118
 Lauder+4.793
ufact+4.687
 Scotia+4.488
Neg logit contributions
 initials-6.219
ses-5.817
advertisement-5.707
 forces-5.569
Reviewer-5.524
Token in
Token ID287
Feature activation+2.00e+00

Pos logit contributions
osate+2.040
etheless+2.014
ufact+1.886
 Lauder+1.798
veyard+1.752
Neg logit contributions
 initials-1.171
ses-1.076
 forces-1.033
advertisement-0.989
 actions-0.974
Token 2007
Token ID4343
Feature activation+5.58e-01

Pos logit contributions
osate+4.180
etheless+4.122
 Lauder+3.708
ufact+3.674
 Scotia+3.377
Neg logit contributions
 initials-6.641
ses-6.324
advertisement-6.153
 forces-6.057
Reviewer-5.983
Token.
Token ID13
Feature activation+5.06e-01

Pos logit contributions
osate+1.755
etheless+1.736
ufact+1.597
 Lauder+1.510
veyard+1.467
Neg logit contributions
 initials-1.402
ses-1.307
 forces-1.256
advertisement-1.210
 actions-1.201
Token At
Token ID1629
Feature activation+1.78e-01

Pos logit contributions
osate+1.563
etheless+1.518
ufact+1.416
 Lauder+1.347
 Scotia+1.284
Neg logit contributions
 initials-1.326
ses-1.228
 forces-1.203
 actions-1.144
advertisement-1.143
Token the
Token ID262
Feature activation+1.20e+00

Pos logit contributions
osate+0.588
etheless+0.576
ufact+0.536
 Lauder+0.510
veyard+0.491
Neg logit contributions
 initials-0.428
ses-0.397
 forces-0.388
advertisement-0.372
 actions-0.367
Token summit
Token ID14237
Feature activation+1.35e+00

Pos logit contributions
osate+3.261
etheless+3.217
ufact+2.891
 Lauder+2.841
 Scotia+2.659
Neg logit contributions
 initials-3.386
ses-3.183
 forces-3.109
advertisement-3.064
 actions-2.960
Token Daily
Token ID6714
Feature activation+1.25e+00

Pos logit contributions
 initials+0.238
ses+0.224
 forces+0.220
advertisement+0.211
 actions+0.211
Neg logit contributions
osate-0.221
etheless-0.216
ufact-0.198
 Lauder-0.187
veyard-0.179
Token Caller
Token ID10244
Feature activation+2.54e-01

Pos logit contributions
osate+3.476
etheless+3.444
ufact+3.151
 Lauder+2.992
 Scotia+2.914
Neg logit contributions
 initials-3.419
ses-3.331
 forces-3.124
advertisement-3.048
 actions-2.968
Token.
Token ID13
Feature activation+1.70e-01

Pos logit contributions
osate+0.800
etheless+0.789
ufact+0.723
 Lauder+0.689
pload+0.661
Neg logit contributions
 initials-0.639
ses-0.601
 forces-0.577
advertisement-0.556
 actions-0.546
Token<|endoftext|>
Token ID50256
Feature activation-1.73e+00

Pos logit contributions
osate+0.508
etheless+0.497
ufact+0.459
 Lauder+0.435
veyard+0.417
Neg logit contributions
 initials-0.462
ses-0.432
 forces-0.422
 actions-0.402
advertisement-0.402
TokenIn
Token ID818
Feature activation+6.09e-01

Pos logit contributions
 initials+3.843
ses+3.686
 forces+3.550
advertisement+3.534
 actions+3.366
Neg logit contributions
osate-5.894
etheless-5.864
ufact-5.476
 Lauder-5.120
veyard-4.951
Token a
Token ID257
Feature activation+1.12e+00

Pos logit contributions
osate+1.892
etheless+1.850
ufact+1.717
 Lauder+1.623
veyard+1.578
Neg logit contributions
 initials-1.571
ses-1.460
 forces-1.429
advertisement-1.371
 actions-1.357
Token press
Token ID1803
Feature activation+5.29e-01

Pos logit contributions
osate+3.050
etheless+2.986
ufact+2.732
 Lauder+2.604
veyard+2.482
Neg logit contributions
 initials-3.240
ses-3.048
 forces-2.971
advertisement-2.879
 actions-2.850
Token release
Token ID2650
Feature activation+8.49e-01

Pos logit contributions
osate+1.345
etheless+1.312
ufact+1.197
 Lauder+1.123
veyard+1.066
Neg logit contributions
 initials-1.647
ses-1.563
 forces-1.523
advertisement-1.478
 actions-1.467
Token sent
Token ID1908
Feature activation+2.77e-01

Pos logit contributions
osate+2.655
etheless+2.609
ufact+2.425
 Lauder+2.336
 Scotia+2.228
Neg logit contributions
 initials-2.087
ses-1.959
 forces-1.878
advertisement-1.825
 actions-1.786
Token to
Token ID284
Feature activation+8.00e-01

Pos logit contributions
osate+0.948
etheless+0.938
ufact+0.875
 Lauder+0.833
veyard+0.808
Neg logit contributions
 initials-0.615
ses-0.574
 forces-0.553
advertisement-0.531
 actions-0.522
Token Gaming
Token ID14426
Feature activation+4.24e-03

Pos logit contributions
osate+2.545
etheless+2.500
ufact+2.322
 Lauder+2.253
 Scotia+2.152
Neg logit contributions
 initials-1.913
ses-1.800
 forces-1.704
advertisement-1.692
 actions-1.636
Token so
Token ID523
Feature activation+9.84e-01

Pos logit contributions
osate+2.855
etheless+2.808
ufact+2.610
 Lauder+2.464
veyard+2.363
Neg logit contributions
 initials-2.310
ses-2.155
 forces-2.065
advertisement-2.021
 actions-1.960
Token much
Token ID881
Feature activation+1.02e+00

Pos logit contributions
osate+2.692
etheless+2.633
ufact+2.389
 Lauder+2.252
pload+2.173
Neg logit contributions
 initials-2.885
ses-2.710
 forces-2.620
advertisement-2.570
 actions-2.508
Token that
Token ID326
Feature activation+5.55e-01

Pos logit contributions
osate+3.073
etheless+3.026
ufact+2.778
 Lauder+2.669
 matter+2.525
Neg logit contributions
 initials-2.680
ses-2.498
 forces-2.395
advertisement-2.332
 actions-2.256
Token happens
Token ID4325
Feature activation+9.61e-01

Pos logit contributions
osate+1.657
etheless+1.622
ufact+1.498
 Lauder+1.405
veyard+1.360
Neg logit contributions
 initials-1.509
ses-1.408
 forces-1.374
advertisement-1.324
 actions-1.313
Token.
Token ID13
Feature activation+8.77e-02

Pos logit contributions
osate+3.225
etheless+3.221
ufact+3.000
 Lauder+2.833
veyard+2.746
Neg logit contributions
 initials-2.155
ses-1.975
 forces-1.868
advertisement-1.802
 actions-1.780
Token As
Token ID1081
Feature activation-8.98e-02

Pos logit contributions
osate+0.281
etheless+0.275
ufact+0.256
 Lauder+0.242
veyard+0.235
Neg logit contributions
 initials-0.219
ses-0.204
 forces-0.200
 actions-0.190
advertisement-0.190
Token a
Token ID257
Feature activation+1.25e+00

Pos logit contributions
 initials+0.226
ses+0.211
 forces+0.207
advertisement+0.197
 actions+0.197
Neg logit contributions
osate-0.286
etheless-0.281
ufact-0.260
 Lauder-0.245
veyard-0.239
Token matter
Token ID2300
Feature activation+1.93e+00

Pos logit contributions
osate+3.226
etheless+3.198
ufact+2.903
 Lauder+2.730
 matter+2.637
Neg logit contributions
 initials-3.802
ses-3.571
 forces-3.486
advertisement-3.424
 actions-3.350
Token of
Token ID286
Feature activation+4.94e-01

Pos logit contributions
osate+5.291
etheless+5.078
ufact+4.576
 Lauder+4.523
 matter+4.464
Neg logit contributions
 initials-5.318
ses-4.984
 forces-4.763
advertisement-4.632
 actions-4.544
Token fact
Token ID1109
Feature activation-5.65e-01

Pos logit contributions
osate+1.112
etheless+1.103
ufact+0.985
 Lauder+0.906
veyard+0.856
Neg logit contributions
 initials-1.675
ses-1.591
 forces-1.557
advertisement-1.523
 actions-1.496
Token,
Token ID11
Feature activation-8.14e-01

Pos logit contributions
 initials+1.365
ses+1.268
 forces+1.246
 actions+1.183
advertisement+1.182
Neg logit contributions
osate-1.862
etheless-1.828
ufact-1.699
 Lauder-1.603
veyard-1.567
Token than
Token ID621
Feature activation-5.13e-01

Pos logit contributions
 initials+0.265
ses+0.246
 forces+0.242
 actions+0.231
advertisement+0.230
Neg logit contributions
osate-0.349
etheless-0.342
ufact-0.318
 Lauder-0.299
veyard-0.293
Token increased
Token ID3220
Feature activation-9.86e-02

Pos logit contributions
 initials+1.296
ses+1.207
 forces+1.188
 actions+1.132
advertisement+1.130
Neg logit contributions
osate-1.637
etheless-1.604
ufact-1.488
 Lauder-1.400
veyard-1.368
Token bandwidth
Token ID19484
Feature activation-1.04e+00

Pos logit contributions
 initials+0.250
ses+0.233
 forces+0.227
advertisement+0.218
 actions+0.216
Neg logit contributions
osate-0.310
etheless-0.305
ufact-0.281
 Lauder-0.266
veyard-0.258
Token.
Token ID13
Feature activation+2.41e-01

Pos logit contributions
 initials+2.425
ses+2.267
 forces+2.251
 actions+2.139
advertisement+2.096
Neg logit contributions
osate-3.510
etheless-3.449
ufact-3.213
 Lauder-3.030
veyard-2.985
Token Crit
Token ID10056
Feature activation-1.40e+00

Pos logit contributions
osate+0.760
etheless+0.742
ufact+0.689
 Lauder+0.652
veyard+0.630
Neg logit contributions
 initials-0.616
ses-0.572
 forces-0.559
 actions-0.531
advertisement-0.530
Tokenically
Token ID1146
Feature activation+7.86e-01

Pos logit contributions
 initials+3.762
ses+3.627
 forces+3.578
 actions+3.394
advertisement+3.312
Neg logit contributions
osate-4.121
etheless-4.018
ufact-3.655
 Lauder-3.487
veyard-3.444
Token,
Token ID11
Feature activation-4.34e-01

Pos logit contributions
osate+2.372
etheless+2.335
ufact+2.117
 Lauder+2.009
 Scotia+1.967
Neg logit contributions
 initials-2.066
ses-1.914
 forces-1.844
advertisement-1.812
official-1.742
Token these
Token ID777
Feature activation+3.61e-01

Pos logit contributions
 initials+1.082
ses+1.008
 forces+0.987
advertisement+0.941
 actions+0.938
Neg logit contributions
osate-1.392
etheless-1.367
ufact-1.267
 Lauder-1.195
veyard-1.165
Token approaches
Token ID10581
Feature activation+5.71e-03

Pos logit contributions
osate+1.240
etheless+1.227
ufact+1.138
 Lauder+1.083
veyard+1.053
Neg logit contributions
 initials-0.802
ses-0.739
 forces-0.712
advertisement-0.688
 actions-0.667
Token strike
Token ID5587
Feature activation-2.91e-01

Pos logit contributions
osate+0.014
etheless+0.014
ufact+0.013
 Lauder+0.012
veyard+0.011
Neg logit contributions
 initials-0.018
ses-0.017
 forces-0.017
 actions-0.016
advertisement-0.016
Token right
Token ID826
Feature activation-6.55e-01

Pos logit contributions
 initials+0.669
ses+0.623
 forces+0.598
advertisement+0.578
 actions+0.567
Neg logit contributions
osate-0.986
etheless-0.973
ufact-0.903
 Lauder-0.860
veyard-0.835
Token
Token ID198
Feature activation+1.33e+00

Pos logit contributions
osate+1.225
etheless+1.186
ufact+1.108
 Lauder+1.056
veyard+1.008
Neg logit contributions
 initials-1.050
ses-0.970
 forces-0.951
advertisement-0.909
 actions-0.908
Token
Token ID198
Feature activation-3.33e-01

Pos logit contributions
osate+3.272
etheless+2.921
 Lauder+2.789
ufact+2.724
 matter+2.689
Neg logit contributions
 initials-4.083
ses-3.704
 forces-3.622
 actions-3.509
advertisement-3.454
Token"
Token ID1
Feature activation+1.42e-01

Pos logit contributions
 initials+0.776
ses+0.716
 forces+0.703
 actions+0.666
advertisement+0.663
Neg logit contributions
osate-1.129
etheless-1.107
ufact-1.031
 Lauder-0.976
veyard-0.955
TokenIt
Token ID1026
Feature activation+9.08e-01

Pos logit contributions
osate+0.475
etheless+0.461
ufact+0.430
 Lauder+0.411
veyard+0.401
Neg logit contributions
 initials-0.334
ses-0.306
 forces-0.300
 actions-0.284
advertisement-0.281
Token can
Token ID460
Feature activation+8.28e-01

Pos logit contributions
osate+2.453
etheless+2.367
ufact+2.208
 Lauder+2.056
veyard+1.970
Neg logit contributions
 initials-2.661
ses-2.511
 forces-2.397
advertisement-2.387
 actions-2.346
Token become
Token ID1716
Feature activation+3.25e-01

Pos logit contributions
osate+1.863
etheless+1.816
ufact+1.627
 Lauder+1.488
veyard+1.438
Neg logit contributions
 initials-2.844
ses-2.702
 forces-2.653
advertisement-2.576
 actions-2.571
Token very
Token ID845
Feature activation+7.09e-01

Pos logit contributions
osate+1.035
etheless+1.016
ufact+0.944
 Lauder+0.901
veyard+0.867
Neg logit contributions
 initials-0.806
ses-0.752
 forces-0.725
advertisement-0.707
 actions-0.691
Token complex
Token ID3716
Feature activation+2.79e-01

Pos logit contributions
osate+2.014
etheless+1.960
ufact+1.791
 Lauder+1.729
veyard+1.644
Neg logit contributions
 initials-1.983
ses-1.878
 forces-1.797
advertisement-1.768
 actions-1.718
Token very
Token ID845
Feature activation+8.20e-01

Pos logit contributions
osate+0.914
etheless+0.898
ufact+0.841
 Lauder+0.794
veyard+0.764
Neg logit contributions
 initials-0.672
ses-0.625
 forces-0.597
advertisement-0.580
 actions-0.567
Token quickly
Token ID2952
Feature activation-9.54e-01

Pos logit contributions
osate+2.031
etheless+2.001
ufact+1.807
 Lauder+1.693
veyard+1.621
Neg logit contributions
 initials-2.573
ses-2.464
 forces-2.363
advertisement-2.340
 actions-2.276
Token,"
Token ID553
Feature activation-3.57e-01

Pos logit contributions
 initials+2.143
 forces+1.989
ses+1.985
 actions+1.877
advertisement+1.838
Neg logit contributions
osate-3.296
etheless-3.232
ufact-3.000
 Lauder-2.845
veyard-2.803
Token
Token ID198
Feature activation+7.91e-01

Pos logit contributions
osate+1.323
etheless+1.282
ufact+1.201
 Lauder+1.144
veyard+1.094
Neg logit contributions
 initials-1.321
ses-1.230
 forces-1.194
advertisement-1.162
 actions-1.149
Token
Token ID198
Feature activation-6.47e-01

Pos logit contributions
osate+2.113
etheless+1.963
ufact+1.826
 Lauder+1.799
 Scotia+1.726
Neg logit contributions
 initials-2.331
ses-2.123
 forces-2.079
 actions-2.009
advertisement-1.999
TokenThe
Token ID464
Feature activation-3.38e-01

Pos logit contributions
 initials+1.465
ses+1.346
 forces+1.322
 actions+1.251
advertisement+1.242
Neg logit contributions
osate-2.230
etheless-2.189
ufact-2.041
 Lauder-1.934
veyard-1.897
Token loss
Token ID2994
Feature activation+5.96e-01

Pos logit contributions
 initials+0.929
ses+0.870
 forces+0.857
advertisement+0.820
 actions+0.819
Neg logit contributions
osate-1.002
etheless-0.982
ufact-0.905
 Lauder-0.847
veyard-0.826
Token to
Token ID284
Feature activation+4.56e-02

Pos logit contributions
osate+1.789
etheless+1.783
ufact+1.645
 Lauder+1.564
veyard+1.520
Neg logit contributions
 initials-1.545
ses-1.446
 forces-1.385
advertisement-1.369
 actions-1.338
Token the
Token ID262
Feature activation+2.24e-01

Pos logit contributions
osate+0.146
etheless+0.145
ufact+0.134
 Lauder+0.128
veyard+0.126
Neg logit contributions
 initials-0.111
ses-0.104
 forces-0.099
advertisement-0.097
 actions-0.095
Token Flyers
Token ID30652
Feature activation-4.41e-01

Pos logit contributions
osate+0.594
etheless+0.588
ufact+0.532
 Lauder+0.505
veyard+0.490
Neg logit contributions
 initials-0.665
ses-0.627
 forces-0.611
advertisement-0.602
 actions-0.591
Token was
Token ID373
Feature activation-6.99e-01

Pos logit contributions
 initials+1.233
ses+1.155
 forces+1.147
 actions+1.093
advertisement+1.085
Neg logit contributions
osate-1.297
etheless-1.264
ufact-1.164
 Lauder-1.082
veyard-1.057
Token Chicago
Token ID4842
Feature activation-1.22e+00

Pos logit contributions
 initials+1.693
ses+1.567
 forces+1.548
 actions+1.471
advertisement+1.463
Neg logit contributions
osate-2.310
etheless-2.257
ufact-2.098
 Lauder-1.979
veyard-1.936
Token's
Token ID338
Feature activation-1.04e+00

Pos logit contributions
 initials+3.314
ses+3.128
 forces+3.117
 actions+2.969
advertisement+2.902
Neg logit contributions
osate-3.699
etheless-3.627
ufact-3.331
 Lauder-3.103
veyard-3.033
Token fourth
Token ID5544
Feature activation-3.77e-01

Pos logit contributions
 initials+2.654
ses+2.469
 forces+2.442
 actions+2.339
advertisement+2.260
Neg logit contributions
osate-3.378
etheless-3.298
ufact-3.064
 Lauder-2.829
veyard-2.805
Tokenll
Token ID297
Feature activation+1.03e+00

Pos logit contributions
osate+2.822
etheless+2.795
ufact+2.635
 Lauder+2.462
veyard+2.418
Neg logit contributions
 initials-1.527
ses-1.339
 forces-1.337
advertisement-1.262
 actions-1.259
Token deserve
Token ID10925
Feature activation+7.29e-01

Pos logit contributions
osate+2.540
etheless+2.498
ufact+2.254
 Lauder+2.077
veyard+2.011
Neg logit contributions
 initials-3.328
ses-3.146
 forces-3.096
advertisement-3.004
 actions-2.983
Token more
Token ID517
Feature activation+2.17e-02

Pos logit contributions
osate+2.350
etheless+2.308
ufact+2.141
 Lauder+2.032
veyard+1.987
Neg logit contributions
 initials-1.761
ses-1.638
 forces-1.604
advertisement-1.536
 actions-1.510
Token discussion
Token ID5114
Feature activation-5.47e-02

Pos logit contributions
osate+0.061
etheless+0.059
ufact+0.054
 Lauder+0.051
veyard+0.050
Neg logit contributions
 initials-0.062
ses-0.059
 forces-0.058
advertisement-0.056
 actions-0.055
Token if
Token ID611
Feature activation+3.07e-01

Pos logit contributions
 initials+0.127
ses+0.117
 forces+0.114
advertisement+0.110
 actions+0.109
Neg logit contributions
osate-0.185
etheless-0.182
ufact-0.169
 Lauder-0.160
veyard-0.156
Token he
Token ID339
Feature activation-4.93e-01

Pos logit contributions
osate+0.978
etheless+0.958
ufact+0.884
 Lauder+0.841
veyard+0.824
Neg logit contributions
 initials-0.762
ses-0.709
 forces-0.687
advertisement-0.670
 actions-0.651
Token stays
Token ID14768
Feature activation+8.76e-02

Pos logit contributions
 initials+1.329
ses+1.250
 forces+1.236
 actions+1.176
advertisement+1.165
Neg logit contributions
osate-1.489
etheless-1.463
ufact-1.341
 Lauder-1.265
veyard-1.238
Token in
Token ID287
Feature activation+3.93e-01

Pos logit contributions
osate+0.308
etheless+0.306
ufact+0.284
 Lauder+0.272
veyard+0.263
Neg logit contributions
 initials-0.186
ses-0.172
 forces-0.165
advertisement-0.160
 actions-0.157
Token contention
Token ID22627
Feature activation-3.50e-01

Pos logit contributions
osate+1.314
etheless+1.291
ufact+1.201
 Lauder+1.143
veyard+1.112
Neg logit contributions
 initials-0.918
ses-0.851
 forces-0.826
advertisement-0.793
 actions-0.784
Token
Token ID851
Feature activation-8.80e-01

Pos logit contributions
 initials+0.777
ses+0.716
 forces+0.701
advertisement+0.663
 actions+0.662
Neg logit contributions
osate-1.220
etheless-1.200
ufact-1.120
 Lauder-1.061
veyard-1.037
Token the
Token ID262
Feature activation-3.24e-01

Pos logit contributions
 initials+2.093
ses+1.943
 forces+1.927
 actions+1.836
advertisement+1.802
Neg logit contributions
osate-2.925
etheless-2.858
ufact-2.675
 Lauder-2.512
veyard-2.459
Token 2000
Token ID4751
Feature activation-7.79e-01

Pos logit contributions
osate+0.132
etheless+0.130
ufact+0.118
 Lauder+0.109
veyard+0.107
Neg logit contributions
 initials-0.147
ses-0.138
 forces-0.137
 actions-0.131
advertisement-0.130
Token,
Token ID11
Feature activation+8.20e-02

Pos logit contributions
 initials+1.762
ses+1.628
 forces+1.606
 actions+1.519
advertisement+1.508
Neg logit contributions
osate-2.683
etheless-2.638
ufact-2.456
 Lauder-2.316
veyard-2.277
Token but
Token ID475
Feature activation-1.75e+00

Pos logit contributions
osate+0.291
etheless+0.286
ufact+0.267
 Lauder+0.255
veyard+0.248
Neg logit contributions
 initials-0.175
ses-0.162
 forces-0.157
advertisement-0.150
 actions-0.148
Token back
Token ID736
Feature activation-1.64e-01

Pos logit contributions
 initials+4.973
 forces+4.660
ses+4.572
 actions+4.497
advertisement+4.277
Neg logit contributions
osate-5.065
etheless-4.915
ufact-4.551
veyard-4.124
 Lauder-4.032
Token then
Token ID788
Feature activation-6.24e-01

Pos logit contributions
 initials+0.313
ses+0.286
 forces+0.274
advertisement+0.262
 actions+0.257
Neg logit contributions
osate-0.618
etheless-0.612
ufact-0.572
 Lauder-0.547
veyard-0.532
Token the
Token ID262
Feature activation-1.22e+00

Pos logit contributions
 initials+1.699
ses+1.589
 forces+1.568
 actions+1.500
advertisement+1.493
Neg logit contributions
osate-1.867
etheless-1.828
ufact-1.687
 Lauder-1.575
veyard-1.540
Token universal
Token ID10112
Feature activation+2.56e-01

Pos logit contributions
 initials+3.564
ses+3.359
 forces+3.333
 actions+3.198
advertisement+3.120
Neg logit contributions
osate-3.445
etheless-3.358
ufact-3.104
 Lauder-2.808
veyard-2.795
Token view
Token ID1570
Feature activation+1.05e+00

Pos logit contributions
osate+0.824
etheless+0.807
ufact+0.746
 Lauder+0.709
veyard+0.686
Neg logit contributions
 initials-0.630
ses-0.586
 forces-0.571
advertisement-0.551
 actions-0.544
Token of
Token ID286
Feature activation-8.16e-02

Pos logit contributions
osate+2.987
etheless+2.917
ufact+2.689
 Lauder+2.525
pload+2.440
Neg logit contributions
 initials-2.877
ses-2.722
 forces-2.583
advertisement-2.571
 actions-2.517
Token the
Token ID262
Feature activation+4.05e-02

Pos logit contributions
 initials+0.221
ses+0.208
 forces+0.203
advertisement+0.196
 actions+0.194
Neg logit contributions
osate-0.243
etheless-0.238
ufact-0.220
 Lauder-0.207
veyard-0.201
Token comment
Token ID2912
Feature activation+4.44e-01

Pos logit contributions
osate+0.114
etheless+0.111
ufact+0.102
 Lauder+0.096
veyard+0.093
Neg logit contributions
 initials-0.117
ses-0.110
 forces-0.108
advertisement-0.104
 actions-0.103
Token,
Token ID11
Feature activation+7.81e-01

Pos logit contributions
osate+2.771
etheless+2.750
ufact+2.560
 Lauder+2.429
veyard+2.352
Neg logit contributions
 initials-2.103
ses-1.945
advertisement-1.880
 forces-1.864
 actions-1.809
Token four
Token ID1440
Feature activation+4.62e-01

Pos logit contributions
osate+2.356
etheless+2.323
ufact+2.130
 Lauder+2.030
veyard+1.943
Neg logit contributions
 initials-2.032
ses-1.887
 forces-1.826
advertisement-1.824
 actions-1.760
Token million
Token ID1510
Feature activation+1.95e+00

Pos logit contributions
osate+1.299
etheless+1.271
ufact+1.160
 Lauder+1.093
veyard+1.051
Neg logit contributions
 initials-1.323
ses-1.246
 forces-1.218
advertisement-1.185
 actions-1.168
Token envelop
Token ID16441
Feature activation+7.55e-01

Pos logit contributions
etheless+5.715
osate+5.712
ufact+5.153
 Lauder+5.012
 Scotia+4.794
Neg logit contributions
 initials-4.758
advertisement-4.470
ses-4.457
 forces-4.301
parser-4.246
Tokenes
Token ID274
Feature activation+1.65e+00

Pos logit contributions
osate+2.414
etheless+2.340
ufact+2.227
 Lauder+2.084
pload+2.010
Neg logit contributions
 initials-1.825
 forces-1.652
ses-1.630
advertisement-1.602
 actions-1.561
Token and
Token ID290
Feature activation-6.89e-01

Pos logit contributions
osate+4.757
etheless+4.742
ufact+4.404
 Lauder+4.117
 matter+4.047
Neg logit contributions
 initials-4.175
ses-3.928
 forces-3.808
advertisement-3.802
 actions-3.668
Token 100
Token ID1802
Feature activation+8.50e-01

Pos logit contributions
 initials+1.727
ses+1.606
 forces+1.586
 actions+1.508
advertisement+1.489
Neg logit contributions
osate-2.220
etheless-2.176
ufact-2.020
 Lauder-1.891
veyard-1.859
Token ballot
Token ID11100
Feature activation+1.34e+00

Pos logit contributions
osate+2.597
etheless+2.529
ufact+2.326
 Lauder+2.219
veyard+2.137
Neg logit contributions
 initials-2.207
ses-2.051
 forces-2.004
advertisement-1.952
 actions-1.913
Token boxes
Token ID10559
Feature activation+5.98e-01

Pos logit contributions
osate+4.709
etheless+4.629
ufact+4.350
 Lauder+4.200
veyard+4.086
Neg logit contributions
 initials-2.634
ses-2.398
advertisement-2.367
 forces-2.348
official-2.232
Token in
Token ID287
Feature activation+1.26e+00

Pos logit contributions
osate+1.891
etheless+1.870
ufact+1.740
 Lauder+1.636
veyard+1.602
Neg logit contributions
 initials-1.467
ses-1.354
 forces-1.316
advertisement-1.295
 actions-1.265
Token a
Token ID257
Feature activation+1.46e+00

Pos logit contributions
osate+4.010
etheless+3.901
ufact+3.604
 Lauder+3.545
veyard+3.379
Neg logit contributions
 initials-3.048
ses-2.811
advertisement-2.749
 forces-2.698
MSN-2.571
Token star
Token ID3491
Feature activation-1.08e+00

Pos logit contributions
 initials+1.900
ses+1.766
 forces+1.743
 actions+1.659
advertisement+1.643
Neg logit contributions
osate-2.397
etheless-2.345
ufact-2.173
 Lauder-2.034
veyard-2.005
Token tracker
Token ID30013
Feature activation-6.37e-01

Pos logit contributions
 initials+3.000
ses+2.787
 forces+2.777
 actions+2.641
advertisement+2.598
Neg logit contributions
osate-3.208
etheless-3.164
ufact-2.896
 Lauder-2.684
veyard-2.661
Token,
Token ID11
Feature activation-6.91e-01

Pos logit contributions
 initials+1.538
ses+1.427
 forces+1.402
 actions+1.332
advertisement+1.331
Neg logit contributions
osate-2.103
etheless-2.064
ufact-1.918
 Lauder-1.809
veyard-1.769
Token was
Token ID373
Feature activation+3.33e-01

Pos logit contributions
 initials+1.843
ses+1.716
 forces+1.703
 actions+1.621
advertisement+1.603
Neg logit contributions
osate-2.113
etheless-2.064
ufact-1.908
 Lauder-1.783
veyard-1.749
Token re
Token ID302
Feature activation+8.01e-01

Pos logit contributions
osate+1.007
etheless+1.004
ufact+0.927
 Lauder+0.872
veyard+0.847
Neg logit contributions
 initials-0.866
ses-0.815
 forces-0.790
advertisement-0.777
 actions-0.757
Token-
Token ID12
Feature activation-3.02e-01

Pos logit contributions
osate+2.121
etheless+2.088
ufact+1.905
 Lauder+1.746
veyard+1.710
Neg logit contributions
 initials-2.430
ses-2.258
 forces-2.223
advertisement-2.148
 actions-2.140
Tokentask
Token ID35943
Feature activation-6.29e-01

Pos logit contributions
 initials+0.735
ses+0.681
 forces+0.670
 actions+0.637
advertisement+0.636
Neg logit contributions
osate-0.990
etheless-0.972
ufact-0.903
 Lauder-0.851
veyard-0.832
Tokened
Token ID276
Feature activation-6.98e-01

Pos logit contributions
 initials+1.334
ses+1.225
 forces+1.217
 actions+1.138
advertisement+1.126
Neg logit contributions
osate-2.264
etheless-2.221
ufact-2.074
 Lauder-1.975
veyard-1.936
Token to
Token ID284
Feature activation+3.50e-01

Pos logit contributions
 initials+1.473
ses+1.349
 forces+1.331
 actions+1.253
advertisement+1.245
Neg logit contributions
osate-2.522
etheless-2.470
ufact-2.315
 Lauder-2.193
veyard-2.151
Token fly
Token ID6129
Feature activation+6.64e-01

Pos logit contributions
osate+0.965
etheless+0.944
ufact+0.864
 Lauder+0.806
veyard+0.782
Neg logit contributions
 initials-1.032
ses-0.972
 forces-0.957
advertisement-0.920
 actions-0.918
Token by
Token ID416
Feature activation-5.68e-01

Pos logit contributions
osate+2.207
etheless+2.192
ufact+2.019
 Lauder+1.923
veyard+1.877
Neg logit contributions
 initials-1.544
ses-1.427
 forces-1.393
advertisement-1.346
 actions-1.321
Token Samsung
Token ID10397
Feature activation+5.48e-02

Pos logit contributions
 initials+2.169
ses+1.994
 forces+1.964
 actions+1.854
advertisement+1.833
Neg logit contributions
osate-3.589
etheless-3.529
ufact-3.299
 Lauder-3.107
veyard-3.062
Token,
Token ID11
Feature activation+1.27e+00

Pos logit contributions
osate+0.147
etheless+0.143
ufact+0.131
 Lauder+0.122
veyard+0.118
Neg logit contributions
 initials-0.166
ses-0.156
 forces-0.154
advertisement-0.148
 actions-0.148
Token a
Token ID257
Feature activation+1.33e+00

Pos logit contributions
osate+3.772
etheless+3.673
ufact+3.376
 Lauder+3.330
veyard+3.146
Neg logit contributions
 initials-3.205
ses-3.056
 forces-2.916
advertisement-2.887
official-2.806
Token major
Token ID1688
Feature activation-3.11e-01

Pos logit contributions
osate+3.922
etheless+3.849
ufact+3.563
 Lauder+3.434
veyard+3.266
Neg logit contributions
 initials-3.465
ses-3.270
 forces-3.155
advertisement-3.070
 actions-3.056
Token maker
Token ID16009
Feature activation-9.95e-01

Pos logit contributions
 initials+0.685
ses+0.632
 forces+0.617
advertisement+0.586
 actions+0.584
Neg logit contributions
osate-1.088
etheless-1.068
ufact-0.997
 Lauder-0.947
veyard-0.924
Token of
Token ID286
Feature activation-2.65e-01

Pos logit contributions
 initials+2.449
ses+2.275
 forces+2.239
 actions+2.127
advertisement+2.126
Neg logit contributions
osate-3.232
etheless-3.171
ufact-2.944
 Lauder-2.773
veyard-2.712
Token Honey
Token ID21788
Feature activation+7.71e-01

Pos logit contributions
 initials+0.467
ses+0.422
 forces+0.409
advertisement+0.387
 actions+0.379
Neg logit contributions
osate-1.037
etheless-1.023
ufact-0.961
 Lauder-0.923
veyard-0.897
Tokencomb
Token ID24011
Feature activation+1.42e+00

Pos logit contributions
osate+1.555
etheless+1.480
ufact+1.301
 Lauder+1.214
veyard+1.131
Neg logit contributions
 initials-2.819
ses-2.657
 forces-2.627
advertisement-2.565
 actions-2.542
Token tablets
Token ID17255
Feature activation+2.16e-01

Pos logit contributions
osate+4.595
etheless+4.439
ufact+4.170
 Lauder+3.980
 matter+3.809
Neg logit contributions
 initials-3.296
ses-3.023
advertisement-2.935
 forces-2.923
official-2.848
Token,
Token ID11
Feature activation-1.76e-01

Pos logit contributions
osate+0.678
etheless+0.670
ufact+0.619
 Lauder+0.581
veyard+0.561
Neg logit contributions
 initials-0.547
ses-0.509
 forces-0.493
advertisement-0.477
 actions-0.474
Token argue
Token ID7267
Feature activation+1.06e+00

Pos logit contributions
 initials+0.551
ses+0.520
 forces+0.516
 actions+0.495
advertisement+0.491
Neg logit contributions
osate-0.459
etheless-0.447
ufact-0.409
 Lauder-0.376
veyard-0.368
TokenWe
Token ID1135
Feature activation-9.01e-02

Pos logit contributions
 initials+0.988
ses+0.916
 forces+0.896
advertisement+0.851
 actions+0.846
Neg logit contributions
osate-1.558
etheless-1.533
ufact-1.431
 Lauder-1.353
veyard-1.323
Token
Token ID447
Feature activation+6.94e-01

Pos logit contributions
 initials+0.266
ses+0.249
 forces+0.248
 actions+0.237
advertisement+0.236
Neg logit contributions
osate-0.250
etheless-0.244
ufact-0.223
 Lauder-0.208
veyard-0.202
Token
Token ID247
Feature activation+1.17e+00

Pos logit contributions
osate+2.148
etheless+2.094
ufact+1.927
 Lauder+1.834
veyard+1.789
Neg logit contributions
 initials-1.780
ses-1.663
 forces-1.612
 actions-1.552
advertisement-1.526
Tokenll
Token ID297
Feature activation+1.97e+00

Pos logit contributions
osate+3.383
etheless+3.356
ufact+3.106
 Lauder+2.867
veyard+2.806
Neg logit contributions
 initials-3.246
ses-2.989
 forces-2.965
 actions-2.857
advertisement-2.831
Token start
Token ID923
Feature activation-8.70e-01

Pos logit contributions
osate+4.528
etheless+4.492
ufact+4.028
 Lauder+3.719
veyard+3.629
Neg logit contributions
 initials-6.333
ses-6.035
 forces-5.876
advertisement-5.786
 actions-5.719
Token with
Token ID351
Feature activation-2.60e+00

Pos logit contributions
 initials+1.812
ses+1.650
 forces+1.635
 actions+1.539
advertisement+1.526
Neg logit contributions
osate-3.171
etheless-3.109
ufact-2.899
 Lauder-2.748
veyard-2.708
Token the
Token ID262
Feature activation-1.04e+00

Pos logit contributions
 initials+5.737
ses+5.220
 forces+5.220
 actions+4.961
advertisement+4.676
Neg logit contributions
osate-8.940
etheless-8.609
ufact-8.100
veyard-7.559
 Lauder-7.535
Token 17
Token ID1596
Feature activation-1.08e-01

Pos logit contributions
 initials+2.577
ses+2.395
 forces+2.355
 actions+2.238
advertisement+2.199
Neg logit contributions
osate-3.420
etheless-3.343
ufact-3.112
 Lauder-2.910
veyard-2.871
Token-
Token ID12
Feature activation-1.10e-01

Pos logit contributions
 initials+0.320
ses+0.301
 forces+0.297
advertisement+0.286
 actions+0.285
Neg logit contributions
osate-0.295
etheless-0.289
ufact-0.265
 Lauder-0.247
veyard-0.239
Token40
Token ID1821
Feature activation+5.46e-01

Pos logit contributions
 initials+0.298
ses+0.276
 forces+0.273
advertisement+0.261
 actions+0.261
Neg logit contributions
osate-0.330
etheless-0.326
ufact-0.301
 Lauder-0.282
veyard-0.274
Token f
Token ID277
Feature activation+2.00e-01

Pos logit contributions
osate+1.503
etheless+1.481
ufact+1.355
 Lauder+1.269
veyard+1.218
Neg logit contributions
 initials-1.577
ses-1.482
 forces-1.457
advertisement-1.418
 actions-1.396
Token natural
Token ID3288
Feature activation+2.38e-01

Pos logit contributions
osate+2.502
etheless+2.442
ufact+2.247
 Lauder+2.125
veyard+2.044
Neg logit contributions
 initials-2.437
ses-2.271
 forces-2.221
advertisement-2.193
 actions-2.122
Token gas
Token ID3623
Feature activation+9.00e-01

Pos logit contributions
osate+0.488
etheless+0.472
ufact+0.420
 Lauder+0.378
veyard+0.360
Neg logit contributions
 initials-0.868
ses-0.826
 forces-0.804
advertisement-0.795
 actions-0.782
Token field
Token ID2214
Feature activation+3.65e-01

Pos logit contributions
osate+2.460
etheless+2.383
ufact+2.236
 Lauder+2.089
veyard+1.989
Neg logit contributions
 initials-2.608
ses-2.439
 forces-2.371
advertisement-2.331
official-2.275
Token
Token ID784
Feature activation-1.99e-02

Pos logit contributions
osate+1.160
etheless+1.135
ufact+1.059
 Lauder+0.997
veyard+0.969
Neg logit contributions
 initials-0.920
ses-0.851
 forces-0.834
advertisement-0.802
 actions-0.793
Token with
Token ID351
Feature activation-5.56e-02

Pos logit contributions
 initials+0.050
ses+0.046
 forces+0.045
advertisement+0.043
 actions+0.043
Neg logit contributions
osate-0.064
etheless-0.063
ufact-0.058
 Lauder-0.055
veyard-0.053
Token Turkey
Token ID7137
Feature activation-2.66e+00

Pos logit contributions
 initials+0.137
ses+0.128
 forces+0.125
advertisement+0.120
 actions+0.118
Neg logit contributions
osate-0.180
etheless-0.177
ufact-0.164
 Lauder-0.155
veyard-0.150
Token buying
Token ID7067
Feature activation-3.74e-01

Pos logit contributions
 initials+6.497
 forces+6.111
ses+5.980
 actions+5.818
official+5.417
Neg logit contributions
osate-8.325
etheless-8.109
ufact-7.489
veyard-7.081
 Lauder-6.909
Token gas
Token ID3623
Feature activation+3.62e-01

Pos logit contributions
 initials+0.977
ses+0.912
 forces+0.898
 actions+0.856
advertisement+0.855
Neg logit contributions
osate-1.159
etheless-1.136
ufact-1.050
 Lauder-0.986
veyard-0.963
Token from
Token ID422
Feature activation-3.78e-01

Pos logit contributions
osate+1.139
etheless+1.121
ufact+1.045
 Lauder+0.986
veyard+0.947
Neg logit contributions
 initials-0.914
ses-0.847
 forces-0.823
advertisement-0.801
 actions-0.786
Token Israel
Token ID2692
Feature activation-1.50e+00

Pos logit contributions
 initials+0.985
ses+0.920
 forces+0.906
 actions+0.864
advertisement+0.861
Neg logit contributions
osate-1.172
etheless-1.149
ufact-1.063
 Lauder-0.998
veyard-0.975
Token
Token ID447
Feature activation-1.07e+00

Pos logit contributions
 initials+3.373
 forces+3.164
ses+3.151
 actions+2.972
advertisement+2.880
Neg logit contributions
osate-5.084
etheless-4.963
ufact-4.600
veyard-4.381
 Lauder-4.371
Token.
Token ID13
Feature activation-1.14e-02

Pos logit contributions
osate+0.120
etheless+0.118
ufact+0.110
 Lauder+0.104
veyard+0.101
Neg logit contributions
 initials-0.081
ses-0.075
 forces-0.073
advertisement-0.070
 actions-0.069
Token When
Token ID1649
Feature activation-3.06e-01

Pos logit contributions
 initials+0.030
ses+0.027
 forces+0.027
advertisement+0.026
 actions+0.026
Neg logit contributions
osate-0.035
etheless-0.035
ufact-0.032
 Lauder-0.030
veyard-0.029
Token we
Token ID356
Feature activation+5.32e-02

Pos logit contributions
 initials+0.744
ses+0.691
 forces+0.673
advertisement+0.646
 actions+0.636
Neg logit contributions
osate-0.999
etheless-0.982
ufact-0.911
 Lauder-0.862
veyard-0.840
Token use
Token ID779
Feature activation-1.61e+00

Pos logit contributions
osate+0.131
etheless+0.128
ufact+0.116
 Lauder+0.106
veyard+0.103
Neg logit contributions
 initials-0.173
ses-0.164
 forces-0.162
 actions-0.156
advertisement-0.156
Token curly
Token ID45731
Feature activation-1.24e+00

Pos logit contributions
 initials+3.808
 forces+3.520
ses+3.505
 actions+3.356
advertisement+3.267
Neg logit contributions
osate-5.439
etheless-5.297
ufact-4.962
 Lauder-4.635
veyard-4.581
Token brace
Token ID22581
Feature activation-2.33e+00

Pos logit contributions
 initials+3.618
ses+3.368
 forces+3.364
 actions+3.240
advertisement+3.193
Neg logit contributions
osate-3.469
etheless-3.393
ufact-3.143
 Lauder-2.891
veyard-2.822
Token notation
Token ID33274
Feature activation-8.94e-01

Pos logit contributions
 initials+4.912
ses+4.571
 forces+4.512
 actions+4.407
advertisement+4.060
Neg logit contributions
osate-8.345
etheless-8.187
ufact-7.718
veyard-7.244
 Lauder-7.213
Token this
Token ID428
Feature activation+1.62e-01

Pos logit contributions
 initials+2.098
ses+1.942
 forces+1.913
 actions+1.818
advertisement+1.803
Neg logit contributions
osate-3.015
etheless-2.963
ufact-2.758
 Lauder-2.601
veyard-2.553
Token is
Token ID318
Feature activation-8.73e-01

Pos logit contributions
osate+0.448
etheless+0.438
ufact+0.402
 Lauder+0.378
veyard+0.363
Neg logit contributions
 initials-0.467
ses-0.440
 forces-0.428
advertisement-0.415
 actions-0.412
Token not
Token ID407
Feature activation-3.57e-01

Pos logit contributions
 initials+2.214
ses+2.059
 forces+2.043
 actions+1.950
advertisement+1.933
Neg logit contributions
osate-2.783
etheless-2.716
ufact-2.517
 Lauder-2.367
veyard-2.327
Token a
Token ID257
Feature activation-4.54e-01

Pos logit contributions
 initials+0.848
ses+0.784
 forces+0.765
advertisement+0.728
 actions+0.725
Neg logit contributions
osate-1.187
etheless-1.170
ufact-1.087
 Lauder-1.029
veyard-0.999
Token sites
Token ID5043
Feature activation-9.35e-01

Pos logit contributions
 initials+1.309
ses+1.211
 forces+1.192
 actions+1.129
advertisement+1.124
Neg logit contributions
osate-1.931
etheless-1.895
ufact-1.768
 Lauder-1.667
veyard-1.636
Token just
Token ID655
Feature activation+5.18e-01

Pos logit contributions
 initials+2.317
ses+2.155
 forces+2.142
 actions+2.032
advertisement+2.012
Neg logit contributions
osate-3.041
etheless-2.963
ufact-2.762
 Lauder-2.595
veyard-2.549
Token for
Token ID329
Feature activation-2.82e-01

Pos logit contributions
osate+1.679
etheless+1.655
ufact+1.533
 Lauder+1.450
veyard+1.407
Neg logit contributions
 initials-1.254
ses-1.171
 forces-1.130
advertisement-1.084
 actions-1.080
Token the
Token ID262
Feature activation+8.37e-02

Pos logit contributions
 initials+0.625
ses+0.578
 forces+0.563
advertisement+0.539
 actions+0.532
Neg logit contributions
osate-0.976
etheless-0.962
ufact-0.895
 Lauder-0.853
veyard-0.828
Token dominant
Token ID11410
Feature activation-1.39e+00

Pos logit contributions
osate+0.286
etheless+0.282
ufact+0.263
 Lauder+0.251
veyard+0.243
Neg logit contributions
 initials-0.186
ses-0.172
 forces-0.167
advertisement-0.162
 actions-0.158
Token browser
Token ID6444
Feature activation-2.42e+00

Pos logit contributions
 initials+3.010
 forces+2.764
ses+2.758
 actions+2.591
advertisement+2.498
Neg logit contributions
osate-4.983
etheless-4.883
ufact-4.602
 Lauder-4.306
veyard-4.285
Token is
Token ID318
Feature activation-1.02e+00

Pos logit contributions
 initials+6.274
 forces+6.014
ses+5.802
 actions+5.608
advertisement+5.284
Neg logit contributions
osate-7.424
etheless-7.309
ufact-6.777
veyard-6.330
 Lauder-6.188
Token the
Token ID262
Feature activation-5.69e-01

Pos logit contributions
 initials+2.546
ses+2.359
 forces+2.344
 actions+2.231
advertisement+2.217
Neg logit contributions
osate-3.295
etheless-3.213
ufact-2.986
 Lauder-2.793
veyard-2.755
Token kind
Token ID1611
Feature activation+6.18e-01

Pos logit contributions
 initials+1.421
ses+1.323
 forces+1.301
 actions+1.238
advertisement+1.234
Neg logit contributions
osate-1.832
etheless-1.797
ufact-1.667
 Lauder-1.568
veyard-1.535
Token of
Token ID286
Feature activation+2.82e-01

Pos logit contributions
osate+1.861
etheless+1.844
ufact+1.666
 Lauder+1.564
veyard+1.534
Neg logit contributions
 initials-1.656
ses-1.554
 forces-1.486
advertisement-1.444
 actions-1.426
Token thing
Token ID1517
Feature activation+1.08e+00

Pos logit contributions
osate+0.811
etheless+0.799
ufact+0.736
 Lauder+0.692
veyard+0.668
Neg logit contributions
 initials-0.784
ses-0.735
 forces-0.714
advertisement-0.693
 actions-0.682
Tokenparse
Token ID29572
Feature activation-4.33e-01

Pos logit contributions
 initials+1.121
ses+1.078
 forces+1.046
advertisement+0.999
 actions+0.992
Neg logit contributions
osate-1.536
etheless-1.518
ufact-1.418
 Lauder-1.329
veyard-1.289
Token_
Token ID62
Feature activation-1.64e+00

Pos logit contributions
 initials+1.097
ses+1.027
 forces+1.010
advertisement+0.964
 actions+0.963
Neg logit contributions
osate-1.377
etheless-1.360
ufact-1.256
 Lauder-1.176
veyard-1.151
Tokenargs
Token ID22046
Feature activation-1.58e+00

Pos logit contributions
 initials+3.141
ses+3.056
 forces+2.956
 actions+2.745
advertisement+2.705
Neg logit contributions
osate-6.188
etheless-6.174
ufact-5.845
 Lauder-5.506
 conclud-5.318
Token())
Token ID28955
Feature activation-1.61e+00

Pos logit contributions
 initials+2.412
ses+2.243
 forces+2.167
 actions+2.020
advertisement+1.952
Neg logit contributions
etheless-6.526
osate-6.508
ufact-6.133
 Lauder-5.750
veyard-5.644
Token #
Token ID1303
Feature activation-1.33e+00

Pos logit contributions
 initials+2.343
 forces+2.068
ses+2.056
 actions+1.895
advertisement+1.775
Neg logit contributions
osate-6.796
etheless-6.714
ufact-6.319
 Lauder-6.076
veyard-5.990
Token initialize
Token ID41216
Feature activation-2.46e+00

Pos logit contributions
 initials+3.111
 forces+2.877
ses+2.852
 actions+2.723
advertisement+2.620
Neg logit contributions
osate-4.502
etheless-4.401
ufact-4.124
veyard-3.819
 Lauder-3.808
Token a
Token ID257
Feature activation-7.33e-01

Pos logit contributions
 initials+3.447
 forces+3.355
ses+3.226
 actions+3.048
advertisement+2.694
Neg logit contributions
etheless-10.671
osate-10.473
ufact-9.941
veyard-9.213
 conclud-9.194
Token rectangular
Token ID36954
Feature activation-7.60e-01

Pos logit contributions
 initials+1.688
ses+1.565
 forces+1.545
 actions+1.465
advertisement+1.448
Neg logit contributions
osate-2.522
etheless-2.474
ufact-2.303
 Lauder-2.157
veyard-2.135
Token and
Token ID290
Feature activation-1.61e+00

Pos logit contributions
 initials+1.664
ses+1.542
 forces+1.535
 actions+1.445
advertisement+1.423
Neg logit contributions
osate-2.691
etheless-2.659
ufact-2.479
 Lauder-2.323
veyard-2.289
Token square
Token ID6616
Feature activation+6.60e-01

Pos logit contributions
 initials+5.049
 forces+4.909
ses+4.832
 actions+4.697
advertisement+4.620
Neg logit contributions
osate-4.177
etheless-4.132
ufact-3.747
 Lauder-3.336
veyard-3.306
Token struct
Token ID2878
Feature activation-1.27e+00

Pos logit contributions
osate+1.753
etheless+1.702
ufact+1.587
 Lauder+1.459
pload+1.415
Neg logit contributions
 initials-1.962
ses-1.851
 forces-1.792
advertisement-1.746
 actions-1.732