TOP ACTIVATIONS
MAX = 5.637

of thousands of dollars selling e books on niche categories .
China . This would mean co - operation with at least
for more news from G io Wolf and the European Knights
free , legal copies of e books . Make sure your
in rural regions with limited access to health care . In
pu u . E pen a mo ek ahi
to help our millions of e - news letter subscribers and
the child the name M ö ng ke ( meaning "
a marvelous framework for modern web applications . But as with
. <|endoftext|> Power BI has come a long way
diagnosis and treatment , and access to information .[ 109 ]
schools and playing fields ," HR W researcher La et itia
front of a bull . http :// ce pr . net
olla is also negotiating new co - oper ations in relation
iki m ai ana . E p au ka p ā
gorith ms and more about web development and databases .
[ 10 : 47 ] <@ t ay l ort ot
a marvelous framework for modern web applications . But as with
- The premature access ion of Bulgaria and Romania
the past two years , HR W said .

MIN ACTIVATIONS
MIN = -6.194

arse wrote : Show nested quote +
the referendum . It was already known that the
and the world . <|endoftext|> It is impossible to identify what
WATCH : <|endoftext|> It can t be
of the investigations . <|endoftext|> It was supposed to be a
's Facebook page . <|endoftext|> It 's back - to -
salary next year . <|endoftext|> It all starts with Media Day
mainstream debate . It 's stunning when you think
. curfew . It 's all been worth it
and for References : <|endoftext|> It took until Sunday morning for
. <|endoftext|> EPA It 's the kind of beast
Batman has always driven the Bat mobile : an incredible car
hack . ' It 's a theory that is
read : " Bar ack Obama would be the
his remarks . It will reduce the problem of
21 : 20 IST <|endoftext|> It 's a question I 've
own some bitcoins . <|endoftext|> Their solution involves encaps ulating per
their authority . It 's understandable parents would want
statement added . It is not clear yet how
requirement . " It just says you can 't

INTERVAL 2.679 - 5.637
CONTAINS 0.824%

both national governments and international co - operation .
from an array . The ability to undo a move and
ort > haha j ason [ 10 : 47 ] <
page articles , but health care always seems to be on
:" or " ex :" [ white ] optional white space

INTERVAL -0.278 - 2.679
CONTAINS 61.690%

' ils ont mont é une n ou vel le organisation
- 2002 - 3 Daniel Craig films , 2006 - 2012
m a social worker ! she yelled rapid -
impe ached within the next year - and - a -
is , of course , informed about the issue . In

INTERVAL -3.236 - -0.278
CONTAINS 37.133%

and the shadows , exceptionally cheerful . That is , cheerful
of the Jewish people in the land of Israel , are
they are bringing to the convention hall , and so now
- 16 jets . The Government of Bahrain has
still greater on women ( to look good ),

INTERVAL -6.194 - -3.236
CONTAINS 0.353%

at "] [ Black " Ant ip ov , Mikhail Al
admins always grant it . It 's time , with $
WA Fresh man All - America Team by the United States
what generation could do . It was still noise generated terrain
Order walk in Glasgow on Saturday . The girl
Token of
Token ID286
Feature activation+1.24e+00

Pos logit contributions
anwhile+1.814
PN+1.788
DR+1.750
hy+1.738
engineering+1.684
Neg logit contributions
ional-1.507
 Favor-1.481
inas-1.469
Jar-1.432
 deviation-1.383
Token thousands
Token ID4138
Feature activation+6.77e-01

Pos logit contributions
anwhile+1.877
PN+1.850
DR+1.815
hy+1.763
estern+1.718
Neg logit contributions
ional-1.942
inas-1.851
 Favor-1.838
Jar-1.810
 deviation-1.737
Token of
Token ID286
Feature activation+1.25e+00

Pos logit contributions
anwhile+1.138
PN+1.116
DR+1.088
hy+1.071
engineering+1.049
Neg logit contributions
ional-0.947
 Favor-0.920
inas-0.918
Jar-0.898
 deviation-0.856
Token dollars
Token ID5054
Feature activation+9.32e-01

Pos logit contributions
anwhile+2.073
PN+2.055
DR+2.008
hy+1.959
estern+1.894
Neg logit contributions
ional-1.783
inas-1.679
 Favor-1.668
Jar-1.651
 deviation-1.562
Token selling
Token ID6301
Feature activation-7.06e-01

Pos logit contributions
anwhile+1.533
PN+1.489
DR+1.460
hy+1.420
engineering+1.403
Neg logit contributions
ional-1.341
inas-1.314
 Favor-1.311
Jar-1.282
 deviation-1.220
Token e
Token ID304
Feature activation+5.05e+00

Pos logit contributions
ional+0.995
inas+0.944
 Favor+0.938
Jar+0.914
 deviation+0.877
Neg logit contributions
anwhile-1.238
PN-1.202
DR-1.162
hy-1.149
estern-1.115
Tokenbooks
Token ID12106
Feature activation+1.48e+00

Pos logit contributions
DR+6.524
PN+6.473
anwhile+6.459
hy+6.067
sp+5.802
Neg logit contributions
ional-8.585
 Favor-8.135
inas-7.956
 deviation-7.925
icularly-7.886
Token on
Token ID319
Feature activation-9.80e-01

Pos logit contributions
anwhile+2.460
PN+2.391
DR+2.342
hy+2.325
estern+2.244
Neg logit contributions
ional-2.091
inas-2.018
 Favor-2.006
Jar-1.969
 deviation-1.887
Token niche
Token ID21404
Feature activation+3.68e-01

Pos logit contributions
ional+1.364
inas+1.303
 Favor+1.295
Jar+1.262
 deviation+1.207
Neg logit contributions
anwhile-1.735
PN-1.678
DR-1.620
hy-1.603
estern-1.563
Token categories
Token ID9376
Feature activation-1.75e+00

Pos logit contributions
anwhile+0.534
PN+0.525
DR+0.506
hy+0.499
estern+0.474
Neg logit contributions
ional-0.616
inas-0.588
 Favor-0.582
Jar-0.577
 deviation-0.551
Token.
Token ID13
Feature activation+2.01e-01

Pos logit contributions
ional+2.394
 Favor+2.278
inas+2.264
Jar+2.198
 deviation+2.116
Neg logit contributions
anwhile-3.145
PN-3.028
DR-2.907
hy-2.887
estern-2.832
Token China
Token ID2807
Feature activation-1.54e+00

Pos logit contributions
ional+0.072
inas+0.068
 Favor+0.068
Jar+0.066
 deviation+0.063
Neg logit contributions
anwhile-0.090
PN-0.087
DR-0.084
hy-0.083
engineering-0.081
Token.
Token ID13
Feature activation+2.69e-01

Pos logit contributions
ional+2.095
 Favor+1.993
inas+1.982
Jar+1.923
 deviation+1.852
Neg logit contributions
anwhile-2.805
PN-2.704
DR-2.599
hy-2.580
estern-2.525
Token This
Token ID770
Feature activation-8.24e-01

Pos logit contributions
anwhile+0.464
PN+0.454
DR+0.443
hy+0.429
engineering+0.420
Neg logit contributions
ional-0.385
inas-0.365
 Favor-0.363
Jar-0.346
 deviation-0.340
Token would
Token ID561
Feature activation+5.76e-01

Pos logit contributions
ional+1.166
 Favor+1.110
inas+1.107
Jar+1.074
 deviation+1.027
Neg logit contributions
anwhile-1.448
PN-1.398
DR-1.343
hy-1.335
engineering-1.298
Token mean
Token ID1612
Feature activation-2.72e+00

Pos logit contributions
anwhile+0.991
PN+0.952
DR+0.925
hy+0.923
engineering+0.880
Neg logit contributions
ional-0.817
inas-0.776
 Favor-0.775
Jar-0.760
 deviation-0.721
Token co
Token ID763
Feature activation+4.74e+00

Pos logit contributions
ional+3.709
 Favor+3.513
inas+3.491
Jar+3.401
 deviation+3.284
Neg logit contributions
anwhile-4.926
PN-4.719
DR-4.497
hy-4.487
estern-4.458
Token-
Token ID12
Feature activation+1.46e+00

Pos logit contributions
anwhile+6.652
PN+6.645
DR+6.422
hy+6.371
engineering+6.271
Neg logit contributions
ional-7.320
 Favor-7.212
inas-7.027
 deviation-6.870
Jar-6.842
Tokenoperation
Token ID27184
Feature activation+2.63e+00

Pos logit contributions
PN+1.510
anwhile+1.500
DR+1.427
hy+1.410
engineering+1.389
Neg logit contributions
ional-3.007
 Favor-2.944
inas-2.907
Jar-2.844
 deviation-2.810
Token with
Token ID351
Feature activation+4.07e-01

Pos logit contributions
anwhile+3.880
PN+3.825
DR+3.657
hy+3.643
engineering+3.505
Neg logit contributions
ional-4.090
 Favor-4.047
inas-4.007
Jar-3.865
 deviation-3.747
Token at
Token ID379
Feature activation-3.57e-01

Pos logit contributions
anwhile+0.705
PN+0.685
DR+0.663
hy+0.652
engineering+0.635
Neg logit contributions
ional-0.578
inas-0.549
 Favor-0.547
Jar-0.532
 deviation-0.517
Token least
Token ID1551
Feature activation-1.80e+00

Pos logit contributions
ional+0.431
 Favor+0.418
inas+0.409
Jar+0.399
 deviation+0.384
Neg logit contributions
anwhile-0.688
PN-0.672
DR-0.649
hy-0.644
engineering-0.628
Tokenfor
Token ID1640
Feature activation-1.07e+00

Pos logit contributions
anwhile+0.685
PN+0.655
DR+0.632
hy+0.628
engineering+0.603
Neg logit contributions
ional-0.636
inas-0.608
 Favor-0.608
Jar-0.589
 deviation-0.572
Token more
Token ID517
Feature activation+2.59e+00

Pos logit contributions
ional+1.563
 Favor+1.495
inas+1.484
Jar+1.450
 deviation+1.391
Neg logit contributions
anwhile-1.846
PN-1.766
DR-1.701
hy-1.696
estern-1.651
Token news
Token ID1705
Feature activation+1.38e+00

Pos logit contributions
anwhile+3.932
PN+3.785
DR+3.639
hy+3.637
engineering+3.457
Neg logit contributions
ional-4.096
 Favor-3.899
inas-3.893
Jar-3.876
 deviation-3.595
Token from
Token ID422
Feature activation+1.28e+00

Pos logit contributions
anwhile+2.320
PN+2.245
hy+2.173
DR+2.170
engineering+2.080
Neg logit contributions
ional-2.003
inas-1.910
 Favor-1.903
Jar-1.841
 deviation-1.780
Token G
Token ID402
Feature activation+1.56e+00

Pos logit contributions
anwhile+2.270
PN+2.190
DR+2.120
hy+2.096
estern+2.051
Neg logit contributions
ional-1.747
inas-1.651
 Favor-1.628
Jar-1.592
 deviation-1.528
Tokenio
Token ID952
Feature activation+4.64e+00

Pos logit contributions
PN+2.225
anwhile+2.224
DR+2.117
hy+2.064
estern+2.000
Neg logit contributions
ional-2.645
 Favor-2.572
inas-2.491
Jar-2.457
 deviation-2.436
Token Wolf
Token ID8662
Feature activation-6.85e-01

Pos logit contributions
anwhile+7.617
PN+7.598
hy+7.477
DR+7.440
engineering+7.121
Neg logit contributions
ional-6.324
 Favor-5.801
 deviation-5.717
inas-5.682
icularly-5.342
Token and
Token ID290
Feature activation+5.24e-01

Pos logit contributions
ional+1.022
 Favor+0.964
inas+0.960
Jar+0.944
 deviation+0.914
Neg logit contributions
anwhile-1.149
PN-1.118
hy-1.081
DR-1.081
estern-1.043
Token the
Token ID262
Feature activation+1.49e+00

Pos logit contributions
anwhile+0.924
PN+0.894
DR+0.866
hy+0.859
estern+0.832
Neg logit contributions
ional-0.725
inas-0.690
 Favor-0.685
Jar-0.670
 deviation-0.641
Token European
Token ID3427
Feature activation+2.39e-01

Pos logit contributions
anwhile+2.588
PN+2.510
DR+2.429
hy+2.405
estern+2.327
Neg logit contributions
ional-2.106
inas-1.985
 Favor-1.972
Jar-1.925
 deviation-1.836
Token Knights
Token ID15188
Feature activation-1.31e+00

Pos logit contributions
anwhile+0.414
PN+0.400
DR+0.386
hy+0.384
engineering+0.371
Neg logit contributions
ional-0.342
 Favor-0.324
inas-0.324
Jar-0.316
 deviation-0.302
Token free
Token ID1479
Feature activation+5.80e-01

Pos logit contributions
ional+1.685
inas+1.597
 Favor+1.583
Jar+1.545
 deviation+1.484
Neg logit contributions
anwhile-2.170
PN-2.102
DR-2.023
hy-2.013
estern-1.962
Token,
Token ID11
Feature activation-4.32e-02

Pos logit contributions
anwhile+0.976
PN+0.955
DR+0.922
hy+0.921
estern+0.887
Neg logit contributions
ional-0.837
inas-0.800
 Favor-0.781
Jar-0.769
 deviation-0.745
Token legal
Token ID2742
Feature activation+4.21e-01

Pos logit contributions
ional+0.061
inas+0.058
 Favor+0.057
Jar+0.056
 deviation+0.054
Neg logit contributions
anwhile-0.076
PN-0.074
DR-0.071
hy-0.070
estern-0.068
Token copies
Token ID9088
Feature activation+1.58e-01

Pos logit contributions
anwhile+0.719
PN+0.707
DR+0.684
hy+0.680
estern+0.656
Neg logit contributions
ional-0.600
inas-0.573
 Favor-0.565
Jar-0.550
 deviation-0.528
Token of
Token ID286
Feature activation+4.94e-01

Pos logit contributions
anwhile+0.278
PN+0.271
hy+0.260
DR+0.260
engineering+0.253
Neg logit contributions
ional-0.218
 Favor-0.208
inas-0.208
Jar-0.201
 deviation-0.195
Token e
Token ID304
Feature activation+4.62e+00

Pos logit contributions
anwhile+0.876
PN+0.854
DR+0.831
hy+0.818
estern+0.796
Neg logit contributions
ional-0.676
inas-0.640
 Favor-0.625
Jar-0.612
 deviation-0.594
Tokenbooks
Token ID12106
Feature activation+1.11e+00

Pos logit contributions
PN+5.625
anwhile+5.532
DR+5.434
hy+5.146
gypt+4.988
Neg logit contributions
ional-8.451
 Favor-7.857
inas-7.855
 deviation-7.832
icularly-7.724
Token.
Token ID13
Feature activation+7.04e-02

Pos logit contributions
anwhile+1.831
PN+1.771
DR+1.708
hy+1.706
estern+1.649
Neg logit contributions
ional-1.632
inas-1.561
 Favor-1.550
Jar-1.514
 deviation-1.478
Token Make
Token ID6889
Feature activation-1.41e+00

Pos logit contributions
anwhile+0.120
PN+0.117
DR+0.113
hy+0.112
estern+0.108
Neg logit contributions
ional-0.102
inas-0.097
 Favor-0.096
Jar-0.093
 deviation-0.091
Token sure
Token ID1654
Feature activation+4.13e-01

Pos logit contributions
ional+1.374
inas+1.281
 Favor+1.281
Jar+1.219
 deviation+1.158
Neg logit contributions
anwhile-3.076
PN-2.993
DR-2.903
hy-2.885
estern-2.833
Token your
Token ID534
Feature activation+1.02e-01

Pos logit contributions
anwhile+0.643
PN+0.623
DR+0.596
hy+0.593
estern+0.569
Neg logit contributions
ional-0.656
inas-0.634
 Favor-0.627
Jar-0.612
 deviation-0.594
Token in
Token ID287
Feature activation-1.91e-01

Pos logit contributions
ional+0.249
inas+0.239
 Favor+0.238
Jar+0.232
 deviation+0.221
Neg logit contributions
anwhile-0.304
PN-0.293
DR-0.284
hy-0.281
estern-0.272
Token rural
Token ID10016
Feature activation-1.80e-01

Pos logit contributions
ional+0.279
 Favor+0.267
inas+0.266
Jar+0.262
 deviation+0.249
Neg logit contributions
anwhile-0.326
PN-0.313
DR-0.303
hy-0.301
estern-0.293
Token regions
Token ID7652
Feature activation-4.87e-01

Pos logit contributions
ional+0.265
 Favor+0.253
inas+0.249
Jar+0.249
 deviation+0.235
Neg logit contributions
anwhile-0.300
PN-0.291
DR-0.282
hy-0.281
estern-0.274
Token with
Token ID351
Feature activation+4.17e-01

Pos logit contributions
ional+0.683
 Favor+0.654
inas+0.650
Jar+0.634
 deviation+0.608
Neg logit contributions
anwhile-0.854
PN-0.826
DR-0.795
hy-0.791
estern-0.770
Token limited
Token ID3614
Feature activation+1.11e-01

Pos logit contributions
anwhile+0.705
PN+0.680
DR+0.657
hy+0.654
estern+0.633
Neg logit contributions
ional-0.605
 Favor-0.583
inas-0.582
Jar-0.570
 deviation-0.538
Token access
Token ID1895
Feature activation+4.54e+00

Pos logit contributions
anwhile+0.154
PN+0.148
DR+0.141
hy+0.141
estern+0.134
Neg logit contributions
ional-0.193
 Favor-0.188
inas-0.187
Jar-0.184
 deviation-0.175
Token to
Token ID284
Feature activation+3.52e-02

Pos logit contributions
anwhile+5.276
PN+5.137
DR+4.824
hy+4.774
estern+4.454
Neg logit contributions
 Favor-8.394
ional-8.355
inas-8.244
Jar-8.100
 deviation-7.950
Token health
Token ID1535
Feature activation+2.80e+00

Pos logit contributions
anwhile+0.058
PN+0.057
hy+0.054
DR+0.054
estern+0.052
Neg logit contributions
ional-0.053
 Favor-0.050
inas-0.050
Jar-0.049
 deviation-0.047
Token care
Token ID1337
Feature activation+2.96e+00

Pos logit contributions
PN+3.210
hy+3.155
anwhile+3.135
DR+3.106
engineering+2.847
Neg logit contributions
ional-5.270
 Favor-5.206
inas-5.071
Jar-5.031
 deviation-4.797
Token.
Token ID13
Feature activation+2.84e-01

Pos logit contributions
PN+4.475
anwhile+4.463
DR+4.339
hy+4.299
engineering+4.037
Neg logit contributions
ional-4.510
 Favor-4.367
inas-4.322
Jar-4.244
 deviation-4.037
Token In
Token ID554
Feature activation-1.03e+00

Pos logit contributions
anwhile+0.487
PN+0.474
DR+0.459
hy+0.451
estern+0.439
Neg logit contributions
ional-0.410
inas-0.389
 Favor-0.384
Jar-0.375
 deviation-0.365
Tokenpu
Token ID19944
Feature activation-1.71e+00

Pos logit contributions
ional+0.804
 Favor+0.770
inas+0.764
Jar+0.745
 deviation+0.722
Neg logit contributions
anwhile-0.920
PN-0.889
DR-0.850
hy-0.849
estern-0.824
Token
Token ID134
Feature activation-8.70e-01

Pos logit contributions
ional+2.514
 Favor+2.400
inas+2.389
Jar+2.324
 deviation+2.243
Neg logit contributions
anwhile-2.904
PN-2.789
DR-2.672
hy-2.652
estern-2.597
Token
Token ID119
Feature activation-2.12e+00

Pos logit contributions
ional+1.131
 Favor+1.072
inas+1.067
Jar+1.034
 deviation+0.993
Neg logit contributions
anwhile-1.628
PN-1.576
DR-1.517
hy-1.510
estern-1.472
Tokenu
Token ID84
Feature activation-6.81e-01

Pos logit contributions
ional+3.578
inas+3.424
 Favor+3.404
Jar+3.322
 deviation+3.204
Neg logit contributions
anwhile-3.148
PN-2.984
DR-2.851
hy-2.801
estern-2.748
Token.
Token ID13
Feature activation-2.81e-02

Pos logit contributions
ional+0.976
 Favor+0.936
inas+0.926
Jar+0.903
 deviation+0.873
Neg logit contributions
anwhile-1.177
PN-1.138
DR-1.090
hy-1.088
estern-1.058
Token E
Token ID412
Feature activation+4.51e+00

Pos logit contributions
ional+0.042
inas+0.040
 Favor+0.040
Jar+0.039
 deviation+0.037
Neg logit contributions
anwhile-0.047
PN-0.046
DR-0.044
hy-0.044
estern-0.042
Token pen
Token ID3112
Feature activation+1.03e+00

Pos logit contributions
anwhile+6.399
hy+6.188
PN+6.129
DR+5.833
estern+5.736
Neg logit contributions
ional-7.001
 Favor-6.755
inas-6.657
Jar-6.547
 deviation-6.345
Tokena
Token ID64
Feature activation+8.26e-01

Pos logit contributions
anwhile+1.674
PN+1.608
hy+1.572
DR+1.537
estern+1.498
Neg logit contributions
ional-1.527
 Favor-1.502
inas-1.443
Jar-1.432
 deviation-1.405
Token mo
Token ID6941
Feature activation+1.91e+00

Pos logit contributions
anwhile+1.368
PN+1.325
hy+1.280
DR+1.263
estern+1.229
Neg logit contributions
ional-1.210
 Favor-1.177
inas-1.158
Jar-1.127
 deviation-1.101
Tokenek
Token ID988
Feature activation+1.70e+00

Pos logit contributions
anwhile+3.233
PN+3.111
hy+3.038
DR+2.975
estern+2.910
Neg logit contributions
ional-2.717
 Favor-2.616
inas-2.544
Jar-2.493
 deviation-2.461
Tokenahi
Token ID32810
Feature activation+9.68e-01

Pos logit contributions
anwhile+2.788
PN+2.684
hy+2.624
DR+2.546
estern+2.490
Neg logit contributions
ional-2.485
 Favor-2.438
inas-2.343
Jar-2.337
 deviation-2.291
Token to
Token ID284
Feature activation+2.54e-01

Pos logit contributions
anwhile+0.163
PN+0.156
DR+0.147
hy+0.147
estern+0.141
Neg logit contributions
ional-0.215
inas-0.208
 Favor-0.208
Jar-0.204
 deviation-0.196
Token help
Token ID1037
Feature activation+7.40e-01

Pos logit contributions
anwhile+0.438
PN+0.421
DR+0.405
hy+0.403
engineering+0.390
Neg logit contributions
ional-0.363
inas-0.347
 Favor-0.347
Jar-0.340
 deviation-0.323
Token our
Token ID674
Feature activation+4.13e-01

Pos logit contributions
anwhile+1.128
PN+1.086
DR+1.048
hy+1.032
engine+0.997
Neg logit contributions
ional-1.181
inas-1.137
Jar-1.131
 Favor-1.128
 deviation-1.066
Token millions
Token ID5242
Feature activation+1.67e-01

Pos logit contributions
anwhile+0.594
PN+0.576
DR+0.545
hy+0.539
estern+0.528
Neg logit contributions
ional-0.701
inas-0.674
 Favor-0.668
Jar-0.665
 deviation-0.637
Token of
Token ID286
Feature activation+9.55e-01

Pos logit contributions
anwhile+0.305
PN+0.295
DR+0.285
hy+0.283
estern+0.276
Neg logit contributions
ional-0.222
 Favor-0.213
inas-0.211
Jar-0.205
 deviation-0.198
Token e
Token ID304
Feature activation+4.49e+00

Pos logit contributions
anwhile+1.329
PN+1.304
DR+1.235
hy+1.230
estern+1.178
Neg logit contributions
ional-1.633
inas-1.566
 Favor-1.556
Jar-1.553
 deviation-1.488
Token-
Token ID12
Feature activation+1.13e+00

Pos logit contributions
anwhile+6.311
PN+6.217
DR+6.139
hy+5.894
engine+5.688
Neg logit contributions
ional-6.990
 Favor-6.875
inas-6.643
 deviation-6.625
icularly-6.546
Tokennews
Token ID10827
Feature activation+1.61e+00

Pos logit contributions
anwhile+1.468
PN+1.426
DR+1.364
hy+1.353
engineering+1.298
Neg logit contributions
ional-2.070
inas-1.988
 Favor-1.988
Jar-1.940
 deviation-1.926
Tokenletter
Token ID9291
Feature activation+6.76e-01

Pos logit contributions
anwhile+2.197
PN+2.089
hy+1.987
DR+1.969
engine+1.902
Neg logit contributions
ional-2.803
 Favor-2.739
inas-2.711
Jar-2.698
 deviation-2.640
Token subscribers
Token ID18327
Feature activation-5.87e-01

Pos logit contributions
anwhile+0.983
PN+0.952
DR+0.907
hy+0.901
estern+0.872
Neg logit contributions
ional-1.137
inas-1.083
 Favor-1.081
Jar-1.069
 deviation-1.034
Token and
Token ID290
Feature activation-7.41e-01

Pos logit contributions
ional+0.848
 Favor+0.809
inas+0.808
Jar+0.788
 deviation+0.757
Neg logit contributions
anwhile-1.011
PN-0.971
DR-0.934
hy-0.928
estern-0.903
Token the
Token ID262
Feature activation+6.76e-01

Pos logit contributions
ional+0.752
inas+0.710
 Favor+0.707
Jar+0.681
 deviation+0.653
Neg logit contributions
anwhile-1.222
PN-1.185
DR-1.148
hy-1.141
estern-1.116
Token child
Token ID1200
Feature activation-9.39e-01

Pos logit contributions
anwhile+1.174
PN+1.133
DR+1.092
hy+1.081
estern+1.055
Neg logit contributions
ional-0.962
inas-0.909
 Favor-0.900
Jar-0.886
 deviation-0.832
Token the
Token ID262
Feature activation+5.68e-01

Pos logit contributions
ional+1.321
inas+1.258
 Favor+1.253
Jar+1.222
 deviation+1.171
Neg logit contributions
anwhile-1.644
PN-1.592
DR-1.533
hy-1.527
estern-1.485
Token name
Token ID1438
Feature activation-1.53e+00

Pos logit contributions
anwhile+1.034
PN+1.009
DR+0.977
hy+0.973
estern+0.941
Neg logit contributions
ional-0.740
inas-0.710
 Favor-0.684
Jar-0.683
 deviation-0.645
Token M
Token ID337
Feature activation+1.49e+00

Pos logit contributions
ional+2.067
 Favor+1.961
inas+1.956
Jar+1.895
 deviation+1.824
Neg logit contributions
anwhile-2.789
PN-2.690
DR-2.586
hy-2.572
estern-2.513
Tokenö
Token ID9101
Feature activation+4.45e+00

Pos logit contributions
anwhile+2.453
PN+2.381
hy+2.333
DR+2.320
estern+2.235
Neg logit contributions
ional-2.150
 Favor-2.148
inas-2.076
Jar-2.027
 deviation-2.019
Tokenng
Token ID782
Feature activation+2.10e+00

Pos logit contributions
anwhile+5.962
hy+5.840
PN+5.765
DR+5.662
gypt+5.484
Neg logit contributions
ional-7.228
 Favor-7.206
Jar-6.892
 deviation-6.860
inas-6.806
Tokenke
Token ID365
Feature activation+9.69e-01

Pos logit contributions
anwhile+3.267
PN+3.154
hy+3.102
DR+3.074
estern+2.970
Neg logit contributions
ional-3.206
 Favor-3.134
inas-3.060
Jar-3.011
 deviation-2.951
Token (
Token ID357
Feature activation+9.44e-01

Pos logit contributions
anwhile+1.541
PN+1.501
hy+1.457
DR+1.443
estern+1.381
Neg logit contributions
ional-1.499
 Favor-1.433
inas-1.421
Jar-1.395
 deviation-1.344
Tokenmeaning
Token ID24815
Feature activation-2.34e+00

Pos logit contributions
anwhile+1.424
PN+1.405
DR+1.356
hy+1.348
engineering+1.298
Neg logit contributions
ional-1.522
 Favor-1.463
inas-1.460
Jar-1.395
 deviation-1.393
Token "
Token ID366
Feature activation+6.34e-02

Pos logit contributions
ional+3.158
 Favor+3.016
inas+2.973
Jar+2.889
 deviation+2.793
Neg logit contributions
anwhile-4.260
PN-4.108
DR-3.941
hy-3.908
estern-3.846
Token a
Token ID257
Feature activation+2.22e-01

Pos logit contributions
ional+1.740
 Favor+1.642
inas+1.625
Jar+1.588
 deviation+1.525
Neg logit contributions
anwhile-2.591
PN-2.421
DR-2.329
hy-2.289
 destro-2.272
Token marvelous
Token ID43937
Feature activation+5.75e-01

Pos logit contributions
anwhile+0.355
PN+0.336
DR+0.320
hy+0.318
estern+0.311
Neg logit contributions
ional-0.356
 Favor-0.340
inas-0.338
Jar-0.330
 deviation-0.321
Token framework
Token ID9355
Feature activation+1.91e+00

Pos logit contributions
anwhile+0.906
PN+0.877
DR+0.838
hy+0.836
estern+0.812
Neg logit contributions
ional-0.905
 Favor-0.867
inas-0.867
Jar-0.844
 deviation-0.814
Token for
Token ID329
Feature activation-2.22e-01

Pos logit contributions
PN+2.910
anwhile+2.889
DR+2.822
hy+2.811
estern+2.674
Neg logit contributions
ional-2.923
inas-2.841
 Favor-2.830
Jar-2.765
 deviation-2.633
Token modern
Token ID3660
Feature activation+1.68e+00

Pos logit contributions
ional+0.406
 Favor+0.391
inas+0.390
Jar+0.381
 deviation+0.371
Neg logit contributions
anwhile-0.299
PN-0.283
DR-0.268
hy-0.265
estern-0.259
Token web
Token ID3992
Feature activation+4.44e+00

Pos logit contributions
PN+1.982
anwhile+1.972
hy+1.895
DR+1.870
engineering+1.787
Neg logit contributions
ional-3.184
inas-3.079
 Favor-3.069
Jar-3.049
 deviation-2.879
Token applications
Token ID5479
Feature activation+1.74e+00

Pos logit contributions
PN+4.131
hy+3.792
anwhile+3.704
DR+3.704
engineering+3.594
Neg logit contributions
ional-9.035
Jar-8.934
 Favor-8.793
inas-8.665
tle-8.398
Token.
Token ID13
Feature activation+2.51e-01

Pos logit contributions
PN+2.698
anwhile+2.679
DR+2.619
hy+2.584
engineering+2.509
Neg logit contributions
ional-2.644
inas-2.545
 Favor-2.522
Jar-2.454
 deviation-2.379
Token But
Token ID887
Feature activation-4.27e-01

Pos logit contributions
anwhile+0.451
PN+0.440
DR+0.425
hy+0.418
engineering+0.410
Neg logit contributions
ional-0.342
inas-0.325
 Favor-0.317
Jar-0.309
 deviation-0.299
Token as
Token ID355
Feature activation+2.98e+00

Pos logit contributions
ional+0.555
 Favor+0.526
inas+0.523
Jar+0.506
 deviation+0.486
Neg logit contributions
anwhile-0.808
PN-0.776
DR-0.744
hy-0.734
estern-0.727
Token with
Token ID351
Feature activation+1.20e+00

Pos logit contributions
PN+4.226
hy+4.138
DR+4.087
anwhile+4.053
estern+3.838
Neg logit contributions
ional-4.807
inas-4.666
 Favor-4.593
Jar-4.467
 deviation-4.344
Token.
Token ID13
Feature activation+3.55e-01

Pos logit contributions
anwhile+2.898
PN+2.820
hy+2.760
DR+2.734
engineering+2.619
Neg logit contributions
ional-2.640
 Favor-2.561
inas-2.550
Jar-2.471
 deviation-2.410
Token
Token ID447
Feature activation+2.82e-01

Pos logit contributions
anwhile+0.565
PN+0.558
DR+0.536
hy+0.528
engineering+0.505
Neg logit contributions
ional-0.553
inas-0.525
 Favor-0.516
Jar-0.506
 deviation-0.497
Token
Token ID251
Feature activation-8.19e-01

Pos logit contributions
anwhile+0.426
PN+0.411
DR+0.394
hy+0.391
engineering+0.378
Neg logit contributions
ional-0.464
inas-0.442
 Favor-0.441
Jar-0.432
 deviation-0.418
Token<|endoftext|>
Token ID50256
Feature activation-5.21e-01

Pos logit contributions
ional+1.232
inas+1.173
 Favor+1.168
Jar+1.137
 deviation+1.104
Neg logit contributions
anwhile-1.364
PN-1.317
DR-1.268
hy-1.253
estern-1.217
TokenPower
Token ID13434
Feature activation-2.26e+00

Pos logit contributions
ional+0.832
 Favor+0.803
inas+0.794
Jar+0.766
 deviation+0.756
Neg logit contributions
anwhile-0.806
PN-0.795
DR-0.761
hy-0.752
engineering-0.729
Token BI
Token ID20068
Feature activation+4.42e+00

Pos logit contributions
ional+3.575
 Favor+3.429
inas+3.408
Jar+3.330
 deviation+3.226
Neg logit contributions
anwhile-3.589
PN-3.421
DR-3.269
hy-3.242
estern-3.180
Token has
Token ID468
Feature activation-4.96e-02

Pos logit contributions
anwhile+7.265
PN+7.178
DR+7.067
hy+6.797
engine+6.653
Neg logit contributions
ional-6.043
inas-5.662
Jar-5.451
 Favor-5.346
 deviation-5.176
Token come
Token ID1282
Feature activation+8.49e-02

Pos logit contributions
ional+0.083
 Favor+0.080
inas+0.080
Jar+0.078
 deviation+0.075
Neg logit contributions
anwhile-0.074
PN-0.071
DR-0.068
hy-0.067
estern-0.065
Token a
Token ID257
Feature activation-6.62e-02

Pos logit contributions
anwhile+0.145
PN+0.140
hy+0.134
DR+0.133
estern+0.130
Neg logit contributions
ional-0.124
 Favor-0.119
inas-0.118
Jar-0.115
 deviation-0.111
Token long
Token ID890
Feature activation+4.12e-01

Pos logit contributions
ional+0.111
 Favor+0.107
inas+0.107
Jar+0.104
 deviation+0.101
Neg logit contributions
anwhile-0.098
PN-0.094
DR-0.090
hy-0.090
estern-0.087
Token way
Token ID835
Feature activation+2.50e+00

Pos logit contributions
anwhile+0.675
PN+0.648
DR+0.628
hy+0.622
estern+0.610
Neg logit contributions
ional-0.624
inas-0.600
 Favor-0.600
Jar-0.580
 deviation-0.555
Token diagnosis
Token ID13669
Feature activation-3.71e-01

Pos logit contributions
anwhile+1.137
PN+1.092
hy+1.052
DR+1.048
estern+0.991
Neg logit contributions
ional-1.286
 Favor-1.267
inas-1.253
Jar-1.236
 deviation-1.169
Token and
Token ID290
Feature activation-1.78e-01

Pos logit contributions
ional+0.498
 Favor+0.483
inas+0.478
Jar+0.464
 deviation+0.440
Neg logit contributions
anwhile-0.672
PN-0.650
DR-0.626
hy-0.624
estern-0.606
Token treatment
Token ID3513
Feature activation+9.23e-01

Pos logit contributions
ional+0.311
inas+0.302
 Favor+0.302
Jar+0.296
 deviation+0.280
Neg logit contributions
anwhile-0.251
PN-0.239
hy-0.228
DR-0.228
engineering-0.219
Token,
Token ID11
Feature activation-1.20e-01

Pos logit contributions
anwhile+1.560
PN+1.515
DR+1.468
hy+1.466
engineering+1.410
Neg logit contributions
ional-1.314
 Favor-1.292
inas-1.280
Jar-1.242
 deviation-1.164
Token and
Token ID290
Feature activation-2.08e-02

Pos logit contributions
ional+0.169
 Favor+0.162
inas+0.162
Jar+0.158
 deviation+0.149
Neg logit contributions
anwhile-0.212
PN-0.204
hy-0.196
DR-0.196
estern-0.189
Token access
Token ID1895
Feature activation+4.38e+00

Pos logit contributions
ional+0.032
inas+0.031
 Favor+0.031
Jar+0.030
 deviation+0.029
Neg logit contributions
anwhile-0.033
PN-0.032
hy-0.031
DR-0.031
estern-0.029
Token to
Token ID284
Feature activation+6.31e-01

Pos logit contributions
anwhile+5.597
PN+5.417
hy+5.119
DR+5.081
engineering+4.818
Neg logit contributions
 Favor-7.680
inas-7.533
ional-7.485
Jar-7.372
 deviation-7.126
Token information
Token ID1321
Feature activation+2.58e+00

Pos logit contributions
anwhile+0.978
PN+0.949
hy+0.912
DR+0.908
engineering+0.866
Neg logit contributions
ional-1.005
 Favor-0.966
inas-0.963
Jar-0.949
 deviation-0.896
Token.[
Token ID3693
Feature activation+2.18e+00

Pos logit contributions
anwhile+4.079
PN+3.962
hy+3.871
DR+3.855
engineering+3.677
Neg logit contributions
ional-3.823
 Favor-3.786
inas-3.743
Jar-3.644
 deviation-3.465
Token109
Token ID14454
Feature activation+1.50e+00

Pos logit contributions
anwhile+2.672
PN+2.577
hy+2.486
DR+2.479
engineering+2.331
Neg logit contributions
ional-4.168
 Favor-4.026
inas-3.979
Jar-3.853
 deviation-3.805
Token]
Token ID60
Feature activation+4.98e-01

Pos logit contributions
anwhile+2.453
PN+2.396
DR+2.327
hy+2.320
engineering+2.216
Neg logit contributions
ional-2.241
inas-2.150
 Favor-2.111
Jar-2.070
 deviation-1.977
Token schools
Token ID4266
Feature activation-1.68e-01

Pos logit contributions
ional+2.526
 Favor+2.399
inas+2.383
Jar+2.316
 deviation+2.220
Neg logit contributions
anwhile-3.551
PN-3.425
DR-3.294
hy-3.276
estern-3.208
Token and
Token ID290
Feature activation-2.77e-01

Pos logit contributions
ional+0.222
 Favor+0.215
inas+0.211
Jar+0.206
 deviation+0.199
Neg logit contributions
anwhile-0.304
PN-0.295
DR-0.287
hy-0.286
engineering-0.276
Token playing
Token ID2712
Feature activation-1.12e-01

Pos logit contributions
ional+0.401
 Favor+0.383
inas+0.380
Jar+0.374
 deviation+0.354
Neg logit contributions
anwhile-0.476
PN-0.454
DR-0.439
hy-0.437
engineering-0.423
Token fields
Token ID7032
Feature activation-5.25e-01

Pos logit contributions
ional+0.172
inas+0.167
 Favor+0.165
Jar+0.162
 deviation+0.156
Neg logit contributions
anwhile-0.178
PN-0.172
DR-0.167
hy-0.167
engineering-0.159
Token,"
Token ID553
Feature activation+1.95e+00

Pos logit contributions
ional+0.685
 Favor+0.658
inas+0.648
Jar+0.631
 deviation+0.607
Neg logit contributions
anwhile-0.970
PN-0.936
hy-0.906
DR-0.906
engineering-0.876
Token HR
Token ID15172
Feature activation+4.37e+00

Pos logit contributions
anwhile+2.980
PN+2.849
DR+2.786
hy+2.766
ITNESS+2.619
Neg logit contributions
ional-3.134
inas-3.023
 Favor-2.987
Jar-2.921
 deviation-2.840
TokenW
Token ID54
Feature activation+3.32e+00

Pos logit contributions
PN+5.087
DR+4.941
hy+4.429
engineering+4.163
ITNESS+4.128
Neg logit contributions
ional-8.430
 Favor-8.101
inas-8.030
icularly-7.867
 deviation-7.809
Token researcher
Token ID13453
Feature activation+2.60e+00

Pos logit contributions
anwhile+4.953
PN+4.765
DR+4.747
hy+4.557
estern+4.359
Neg logit contributions
ional-5.094
 Favor-4.988
inas-4.942
Jar-4.821
 deviation-4.622
Token La
Token ID4689
Feature activation+1.46e+00

Pos logit contributions
anwhile+4.428
PN+4.360
hy+4.303
DR+4.255
engineering+4.044
Neg logit contributions
ional-3.448
inas-3.279
 Favor-3.229
Jar-3.123
 deviation-3.111
Tokenet
Token ID316
Feature activation+9.12e-01

Pos logit contributions
anwhile+2.335
PN+2.285
hy+2.229
DR+2.217
estern+2.116
Neg logit contributions
ional-2.197
 Favor-2.118
inas-2.071
 deviation-2.043
Jar-2.007
Tokenitia
Token ID36723
Feature activation+1.04e+00

Pos logit contributions
anwhile+1.636
PN+1.582
hy+1.559
DR+1.552
engineering+1.478
Neg logit contributions
ional-1.216
 Favor-1.169
 deviation-1.122
inas-1.122
Jar-1.114
Token front
Token ID2166
Feature activation-1.29e+00

Pos logit contributions
ional+1.304
 Favor+1.228
inas+1.227
Jar+1.200
 deviation+1.147
Neg logit contributions
anwhile-1.695
PN-1.636
DR-1.581
hy-1.571
estern-1.527
Token of
Token ID286
Feature activation-2.50e-01

Pos logit contributions
ional+1.602
 Favor+1.513
inas+1.505
Jar+1.455
 deviation+1.397
Neg logit contributions
anwhile-2.514
PN-2.395
DR-2.304
hy-2.281
estern-2.262
Token a
Token ID257
Feature activation-2.43e-01

Pos logit contributions
ional+0.361
 Favor+0.346
inas+0.344
Jar+0.335
 deviation+0.322
Neg logit contributions
anwhile-0.429
PN-0.414
DR-0.401
hy-0.397
estern-0.385
Token bull
Token ID6473
Feature activation-2.30e+00

Pos logit contributions
ional+0.250
 Favor+0.235
inas+0.233
Jar+0.225
 deviation+0.209
Neg logit contributions
anwhile-0.519
PN-0.504
DR-0.490
hy-0.489
estern-0.476
Token.
Token ID13
Feature activation+4.20e-01

Pos logit contributions
ional+3.230
 Favor+3.076
inas+3.060
Jar+2.974
 deviation+2.865
Neg logit contributions
anwhile-4.054
PN-3.900
DR-3.742
hy-3.717
estern-3.644
Token http
Token ID2638
Feature activation+4.37e+00

Pos logit contributions
PN+0.727
anwhile+0.726
DR+0.698
hy+0.676
ITNESS+0.655
Neg logit contributions
ional-0.590
inas-0.565
 Favor-0.562
Jar-0.533
 deviation-0.525
Token://
Token ID1378
Feature activation+2.05e+00

Pos logit contributions
PN+5.865
DR+5.640
hy+5.534
../+5.198
engineering+5.183
Neg logit contributions
ional-7.299
 Favor-7.110
inas-7.017
 deviation-6.786
icularly-6.752
Tokence
Token ID344
Feature activation+1.59e+00

Pos logit contributions
anwhile+2.505
PN+2.424
hy+2.358
DR+2.328
engineering+2.249
Neg logit contributions
ional-3.838
 Favor-3.802
inas-3.682
Jar-3.653
 deviation-3.590
Tokenpr
Token ID1050
Feature activation-9.63e-01

Pos logit contributions
anwhile+2.236
PN+2.138
hy+2.069
DR+2.009
engineering+1.981
Neg logit contributions
ional-2.726
 Favor-2.713
inas-2.588
Jar-2.580
 deviation-2.553
Token.
Token ID13
Feature activation-4.19e-01

Pos logit contributions
ional+1.471
 Favor+1.421
inas+1.399
Jar+1.372
 deviation+1.331
Neg logit contributions
anwhile-1.581
PN-1.518
hy-1.452
DR-1.451
engineering-1.409
Tokennet
Token ID3262
Feature activation+7.80e-02

Pos logit contributions
ional+0.836
 Favor+0.815
inas+0.805
Jar+0.792
 deviation+0.775
Neg logit contributions
anwhile-0.490
PN-0.465
DR-0.436
hy-0.436
engineering-0.418
Tokenolla
Token ID33011
Feature activation+2.37e+00

Pos logit contributions
PN+2.465
anwhile+2.311
DR+2.264
hy+2.189
engineering+2.054
Neg logit contributions
ional-6.348
 Favor-6.263
inas-6.065
Jar-5.985
 deviation-5.957
Token is
Token ID318
Feature activation+1.01e+00

Pos logit contributions
anwhile+4.050
PN+3.926
DR+3.849
hy+3.742
engineering+3.682
Neg logit contributions
ional-3.277
inas-3.127
 Favor-3.091
Jar-3.029
 deviation-2.918
Token also
Token ID635
Feature activation+5.66e-01

Pos logit contributions
anwhile+1.526
PN+1.478
hy+1.411
DR+1.406
engineering+1.361
Neg logit contributions
ional-1.638
 Favor-1.577
inas-1.573
Jar-1.546
 deviation-1.487
Token negotiating
Token ID19025
Feature activation+9.72e-01

Pos logit contributions
anwhile+0.898
PN+0.870
DR+0.831
hy+0.830
engineering+0.805
Neg logit contributions
ional-0.881
 Favor-0.847
inas-0.842
Jar-0.828
 deviation-0.795
Token new
Token ID649
Feature activation+1.16e+00

Pos logit contributions
anwhile+1.549
PN+1.507
DR+1.465
hy+1.448
acqu+1.388
Neg logit contributions
ional-1.460
inas-1.438
 Favor-1.414
Jar-1.381
 deviation-1.314
Token co
Token ID763
Feature activation+4.34e+00

Pos logit contributions
anwhile+1.819
PN+1.776
DR+1.711
hy+1.687
engineering+1.638
Neg logit contributions
ional-1.813
inas-1.738
 Favor-1.734
Jar-1.697
 deviation-1.608
Token-
Token ID12
Feature activation+1.18e+00

Pos logit contributions
PN+6.175
anwhile+6.062
DR+5.968
hy+5.873
engineering+5.763
Neg logit contributions
ional-6.756
 Favor-6.673
inas-6.582
 deviation-6.399
Jar-6.304
Tokenoper
Token ID3575
Feature activation+1.71e-02

Pos logit contributions
anwhile+1.145
PN+1.131
DR+1.063
hy+1.048
engineering+1.024
Neg logit contributions
ional-2.533
 Favor-2.475
inas-2.450
Jar-2.396
 deviation-2.370
Tokenations
Token ID602
Feature activation+1.85e+00

Pos logit contributions
anwhile+0.029
PN+0.028
hy+0.027
DR+0.027
engineering+0.026
Neg logit contributions
ional-0.025
 Favor-0.024
inas-0.023
Jar-0.023
 deviation-0.022
Token in
Token ID287
Feature activation-6.33e-01

Pos logit contributions
anwhile+2.835
PN+2.788
hy+2.670
DR+2.670
engineering+2.564
Neg logit contributions
ional-2.834
 Favor-2.784
inas-2.770
Jar-2.671
 deviation-2.611
Token relation
Token ID8695
Feature activation+1.52e+00

Pos logit contributions
ional+0.883
inas+0.837
 Favor+0.834
Jar+0.816
 deviation+0.779
Neg logit contributions
anwhile-1.122
PN-1.085
DR-1.047
hy-1.036
engineering-1.010
Tokeniki
Token ID5580
Feature activation+1.84e+00

Pos logit contributions
anwhile+5.158
PN+5.056
hy+4.936
DR+4.793
estern+4.729
Neg logit contributions
 Favor-4.557
ional-4.488
 deviation-4.273
Jar-4.260
inas-4.198
Token m
Token ID285
Feature activation+1.42e+00

Pos logit contributions
anwhile+3.026
PN+2.956
hy+2.873
DR+2.817
engineering+2.721
Neg logit contributions
ional-2.658
 Favor-2.585
inas-2.537
Jar-2.482
 deviation-2.426
Tokenai
Token ID1872
Feature activation+1.65e+00

Pos logit contributions
anwhile+2.369
PN+2.280
hy+2.230
DR+2.188
estern+2.137
Neg logit contributions
ional-2.053
 Favor-2.030
inas-1.943
Jar-1.940
 deviation-1.921
Token ana
Token ID44763
Feature activation+9.70e-01

Pos logit contributions
anwhile+2.739
PN+2.666
hy+2.603
DR+2.542
estern+2.462
Neg logit contributions
ional-2.366
 Favor-2.299
inas-2.239
Jar-2.200
 deviation-2.155
Token.
Token ID13
Feature activation+9.00e-01

Pos logit contributions
anwhile+1.618
PN+1.565
hy+1.517
DR+1.501
engineering+1.453
Neg logit contributions
ional-1.414
 Favor-1.364
inas-1.345
Jar-1.316
 deviation-1.283
Token E
Token ID412
Feature activation+4.34e+00

Pos logit contributions
anwhile+1.473
PN+1.444
DR+1.386
hy+1.385
estern+1.334
Neg logit contributions
ional-1.338
inas-1.272
 Favor-1.254
Jar-1.229
 deviation-1.205
Token p
Token ID279
Feature activation+9.22e-01

Pos logit contributions
anwhile+6.868
PN+6.719
hy+6.637
DR+6.417
estern+6.242
Neg logit contributions
ional-6.018
 Favor-5.758
inas-5.683
Jar-5.538
 deviation-5.389
Tokenau
Token ID559
Feature activation+2.49e+00

Pos logit contributions
anwhile+1.506
PN+1.447
hy+1.398
DR+1.383
estern+1.349
Neg logit contributions
ional-1.385
 Favor-1.371
inas-1.321
Jar-1.308
 deviation-1.284
Token ka
Token ID38387
Feature activation+1.60e+00

Pos logit contributions
anwhile+3.985
PN+3.871
hy+3.827
DR+3.689
estern+3.568
Neg logit contributions
ional-3.642
 Favor-3.575
inas-3.449
Jar-3.394
 deviation-3.331
Token p
Token ID279
Feature activation-4.22e-01

Pos logit contributions
anwhile+2.601
PN+2.530
hy+2.447
DR+2.411
estern+2.337
Neg logit contributions
ional-2.376
 Favor-2.282
inas-2.260
Jar-2.197
 deviation-2.139
Tokenā
Token ID10235
Feature activation-2.00e-02

Pos logit contributions
ional+0.652
 Favor+0.631
inas+0.621
Jar+0.609
 deviation+0.592
Neg logit contributions
anwhile-0.685
PN-0.658
DR-0.628
hy-0.627
estern-0.610
Tokengorith
Token ID7727
Feature activation+1.14e+00

Pos logit contributions
anwhile+0.010
PN+0.010
DR+0.009
hy+0.009
engineering+0.009
Neg logit contributions
ional-0.020
 Favor-0.019
inas-0.019
Jar-0.019
 deviation-0.019
Tokenms
Token ID907
Feature activation+1.71e-01

Pos logit contributions
PN+1.294
anwhile+1.286
hy+1.226
DR+1.222
engineering+1.190
Neg logit contributions
ional-2.214
 Favor-2.183
Jar-2.123
 deviation-2.100
inas-2.084
Token and
Token ID290
Feature activation-4.29e-01

Pos logit contributions
anwhile+0.300
PN+0.290
DR+0.279
hy+0.279
engineering+0.271
Neg logit contributions
ional-0.238
 Favor-0.229
inas-0.227
Jar-0.221
 deviation-0.211
Token more
Token ID517
Feature activation+1.84e+00

Pos logit contributions
ional+0.643
 Favor+0.615
inas+0.612
Jar+0.595
 deviation+0.575
Neg logit contributions
anwhile-0.720
PN-0.691
DR-0.662
hy-0.657
estern-0.643
Token about
Token ID546
Feature activation+1.10e+00

Pos logit contributions
anwhile+2.925
PN+2.818
hy+2.737
DR+2.705
engineering+2.616
Neg logit contributions
ional-2.782
 Favor-2.673
inas-2.666
Jar-2.646
 deviation-2.508
Token web
Token ID3992
Feature activation+4.33e+00

Pos logit contributions
anwhile+1.806
PN+1.758
DR+1.688
hy+1.681
engineering+1.636
Neg logit contributions
ional-1.625
inas-1.549
 Favor-1.541
Jar-1.531
icularly-1.430
Token development
Token ID2478
Feature activation+2.55e+00

Pos logit contributions
PN+6.037
anwhile+6.019
DR+5.822
engineering+5.671
hy+5.669
Neg logit contributions
ional-7.064
 Favor-6.700
inas-6.524
Jar-6.432
icularly-6.215
Token and
Token ID290
Feature activation-1.32e-01

Pos logit contributions
anwhile+4.007
PN+3.929
DR+3.795
hy+3.793
engineering+3.687
Neg logit contributions
ional-3.806
 Favor-3.661
inas-3.629
Jar-3.585
 deviation-3.385
Token databases
Token ID20083
Feature activation+5.25e-01

Pos logit contributions
ional+0.216
 Favor+0.207
inas+0.206
Jar+0.202
 deviation+0.194
Neg logit contributions
anwhile-0.202
PN-0.194
DR-0.185
hy-0.184
engineering-0.179
Token.
Token ID13
Feature activation-6.07e-03

Pos logit contributions
anwhile+0.890
PN+0.862
DR+0.831
hy+0.827
engineering+0.810
Neg logit contributions
ional-0.762
 Favor-0.731
inas-0.724
Jar-0.709
 deviation-0.677
Token
Token ID198
Feature activation+5.66e-01

Pos logit contributions
ional+0.009
inas+0.009
 Favor+0.009
Jar+0.008
 deviation+0.008
Neg logit contributions
anwhile-0.010
PN-0.010
DR-0.009
hy-0.009
engineering-0.009
Token [
Token ID685
Feature activation+2.64e+00

Pos logit contributions
anwhile+1.043
PN+1.004
DR+0.970
hy+0.968
engineering+0.929
Neg logit contributions
ional-0.795
 Favor-0.759
inas-0.750
Jar-0.731
 deviation-0.698
Token10
Token ID940
Feature activation+1.76e+00

Pos logit contributions
PN+2.821
DR+2.648
hy+2.623
anwhile+2.491
engineering+2.461
Neg logit contributions
ional-5.349
 Favor-5.251
inas-5.204
Jar-5.029
 deviation-5.025
Token:
Token ID25
Feature activation+6.26e-01

Pos logit contributions
PN+2.891
DR+2.809
hy+2.784
anwhile+2.733
engineering+2.672
Neg logit contributions
ional-2.504
 Favor-2.417
inas-2.387
Jar-2.316
 deviation-2.264
Token47
Token ID2857
Feature activation-7.51e-01

Pos logit contributions
anwhile+0.850
PN+0.810
DR+0.773
hy+0.760
estern+0.738
Neg logit contributions
ional-1.132
 Favor-1.090
inas-1.084
Jar-1.057
 deviation-1.036
Token]
Token ID60
Feature activation+1.25e+00

Pos logit contributions
ional+0.980
 Favor+0.929
inas+0.927
Jar+0.898
 deviation+0.859
Neg logit contributions
anwhile-1.421
PN-1.352
DR-1.298
hy-1.290
estern-1.272
Token <@
Token ID32406
Feature activation+4.32e+00

Pos logit contributions
anwhile+1.945
PN+1.863
DR+1.780
hy+1.769
estern+1.718
Neg logit contributions
ional-2.024
 Favor-1.928
inas-1.920
Jar-1.879
 deviation-1.818
Tokent
Token ID83
Feature activation-1.08e+00

Pos logit contributions
PN+6.221
hy+5.944
DR+5.865
engineering+5.602
anwhile+5.536
Neg logit contributions
 Favor-7.129
ional-6.746
 deviation-6.585
inas-6.555
icularly-6.436
Tokenay
Token ID323
Feature activation-5.21e-01

Pos logit contributions
ional+1.576
 Favor+1.541
inas+1.494
Jar+1.474
 deviation+1.438
Neg logit contributions
anwhile-1.820
PN-1.758
DR-1.694
hy-1.691
engineering-1.646
Tokenl
Token ID75
Feature activation+5.91e-01

Pos logit contributions
ional+0.882
 Favor+0.855
inas+0.842
Jar+0.825
 deviation+0.808
Neg logit contributions
anwhile-0.768
PN-0.738
DR-0.702
hy-0.702
engineering-0.678
Tokenort
Token ID419
Feature activation+1.72e-01

Pos logit contributions
anwhile+0.845
PN+0.814
hy+0.781
DR+0.781
engineering+0.750
Neg logit contributions
ional-1.009
 Favor-0.994
inas-0.962
Jar-0.952
 deviation-0.940
Tokenot
Token ID313
Feature activation-2.59e+00

Pos logit contributions
anwhile+0.265
PN+0.257
hy+0.248
DR+0.247
estern+0.238
Neg logit contributions
ional-0.275
 Favor-0.270
inas-0.261
Jar-0.257
 deviation-0.253
Token a
Token ID257
Feature activation+3.03e-02

Pos logit contributions
ional+2.406
 Favor+2.305
inas+2.285
Jar+2.230
 deviation+2.161
Neg logit contributions
anwhile-2.556
PN-2.446
DR-2.337
hy-2.313
estern-2.270
Token marvelous
Token ID43937
Feature activation-4.93e-01

Pos logit contributions
anwhile+0.048
PN+0.046
DR+0.044
hy+0.044
engineering+0.042
Neg logit contributions
ional-0.048
inas-0.046
 Favor-0.046
Jar-0.045
 deviation-0.043
Token framework
Token ID9355
Feature activation+1.59e+00

Pos logit contributions
ional+0.826
inas+0.789
 Favor+0.789
Jar+0.770
 deviation+0.740
Neg logit contributions
anwhile-0.732
PN-0.704
DR-0.675
hy-0.670
engineering-0.648
Token for
Token ID329
Feature activation-5.43e-01

Pos logit contributions
anwhile+2.649
PN+2.582
DR+2.521
hy+2.493
engineering+2.411
Neg logit contributions
ional-2.269
inas-2.226
 Favor-2.186
Jar-2.118
 deviation-2.028
Token modern
Token ID3660
Feature activation+1.24e+00

Pos logit contributions
ional+0.924
inas+0.887
 Favor+0.886
Jar+0.867
 deviation+0.831
Neg logit contributions
anwhile-0.792
PN-0.761
DR-0.727
hy-0.719
engineering-0.699
Token web
Token ID3992
Feature activation+4.32e+00

Pos logit contributions
anwhile+1.626
PN+1.579
DR+1.525
hy+1.510
engineering+1.471
Neg logit contributions
ional-2.202
inas-2.119
 Favor-2.093
Jar-2.092
tle-1.970
Token applications
Token ID5479
Feature activation+1.42e+00

Pos logit contributions
PN+5.017
anwhile+5.001
DR+4.897
engineering+4.804
engine+4.794
Neg logit contributions
ional-7.915
 Favor-7.539
inas-7.484
Jar-7.386
icularly-7.052
Token.
Token ID13
Feature activation+6.70e-02

Pos logit contributions
anwhile+2.379
PN+2.320
DR+2.262
hy+2.239
engineering+2.193
Neg logit contributions
ional-2.021
inas-1.960
 Favor-1.946
Jar-1.881
 deviation-1.806
Token But
Token ID887
Feature activation-7.63e-01

Pos logit contributions
anwhile+0.113
PN+0.110
DR+0.106
hy+0.105
engineering+0.102
Neg logit contributions
ional-0.099
inas-0.094
 Favor-0.092
Jar-0.090
 deviation-0.087
Token as
Token ID355
Feature activation+2.87e+00

Pos logit contributions
ional+1.105
 Favor+1.055
inas+1.049
Jar+1.020
 deviation+0.984
Neg logit contributions
anwhile-1.316
PN-1.265
DR-1.213
hy-1.205
estern-1.179
Token with
Token ID351
Feature activation+1.21e+00

Pos logit contributions
anwhile+4.183
PN+4.086
hy+3.982
DR+3.943
engineering+3.781
Neg logit contributions
ional-4.535
inas-4.427
 Favor-4.304
Jar-4.246
 deviation-4.068
Token
Token ID198
Feature activation-1.70e-01

Pos logit contributions
anwhile+0.019
PN+0.018
DR+0.017
hy+0.017
engineering+0.017
Neg logit contributions
ional-0.017
inas-0.016
 Favor-0.016
Jar-0.016
 deviation-0.015
Token
Token ID198
Feature activation-4.88e-01

Pos logit contributions
ional+0.256
 Favor+0.245
inas+0.243
Jar+0.237
 deviation+0.229
Neg logit contributions
anwhile-0.279
PN-0.271
DR-0.260
hy-0.258
estern-0.251
Token-
Token ID12
Feature activation-8.86e-01

Pos logit contributions
ional+0.701
 Favor+0.669
inas+0.666
Jar+0.647
 deviation+0.624
Neg logit contributions
anwhile-0.847
PN-0.815
DR-0.782
hy-0.776
estern-0.760
Token The
Token ID383
Feature activation-3.31e-02

Pos logit contributions
ional+1.193
inas+1.131
 Favor+1.130
Jar+1.097
 deviation+1.052
Neg logit contributions
anwhile-1.621
PN-1.564
DR-1.502
hy-1.491
estern-1.457
Token premature
Token ID19905
Feature activation-3.38e-01

Pos logit contributions
ional+0.052
inas+0.049
 Favor+0.049
Jar+0.048
 deviation+0.046
Neg logit contributions
anwhile-0.053
PN-0.051
DR-0.049
hy-0.048
engineering-0.047
Token access
Token ID1895
Feature activation+4.31e+00

Pos logit contributions
ional+0.483
 Favor+0.460
inas+0.459
Jar+0.446
 deviation+0.421
Neg logit contributions
anwhile-0.588
PN-0.568
DR-0.543
hy-0.540
engineering-0.527
Tokenion
Token ID295
Feature activation+5.07e-01

Pos logit contributions
anwhile+9.269
PN+9.175
hy+8.852
DR+8.830
engineering+8.638
Neg logit contributions
 Favor-3.551
inas-3.432
Jar-3.215
ional-3.031
 deviation-2.958
Token of
Token ID286
Feature activation+2.86e-02

Pos logit contributions
anwhile+0.872
PN+0.848
DR+0.809
hy+0.804
engineering+0.778
Neg logit contributions
ional-0.715
inas-0.694
 Favor-0.693
Jar-0.673
 deviation-0.637
Token Bulgaria
Token ID27902
Feature activation-2.29e+00

Pos logit contributions
anwhile+0.050
PN+0.048
DR+0.046
hy+0.046
estern+0.045
Neg logit contributions
ional-0.040
inas-0.039
 Favor-0.038
Jar-0.037
 deviation-0.036
Token and
Token ID290
Feature activation-3.34e-01

Pos logit contributions
ional+3.701
 Favor+3.532
inas+3.513
Jar+3.424
 deviation+3.328
Neg logit contributions
anwhile-3.567
PN-3.403
DR-3.243
hy-3.221
estern-3.168
Token Romania
Token ID23356
Feature activation-2.24e+00

Pos logit contributions
ional+0.460
inas+0.437
 Favor+0.437
Jar+0.424
 deviation+0.405
Neg logit contributions
anwhile-0.598
PN-0.577
DR-0.555
hy-0.551
estern-0.537
Token the
Token ID262
Feature activation+3.29e-01

Pos logit contributions
anwhile+2.034
PN+1.967
DR+1.916
hy+1.892
estern+1.849
Neg logit contributions
ional-1.912
inas-1.817
 Favor-1.800
Jar-1.760
 deviation-1.673
Token past
Token ID1613
Feature activation+2.25e+00

Pos logit contributions
anwhile+0.524
PN+0.502
DR+0.486
hy+0.477
estern+0.465
Neg logit contributions
ional-0.519
 Favor-0.492
inas-0.492
Jar-0.478
 deviation-0.457
Token two
Token ID734
Feature activation+4.65e-01

Pos logit contributions
anwhile+3.370
PN+3.294
DR+3.241
hy+3.167
estern+3.029
Neg logit contributions
ional-3.529
 Favor-3.348
inas-3.344
Jar-3.278
icularly-3.061
Token years
Token ID812
Feature activation+7.96e-01

Pos logit contributions
anwhile+0.673
PN+0.650
DR+0.627
hy+0.623
estern+0.598
Neg logit contributions
ional-0.788
 Favor-0.759
inas-0.749
Jar-0.734
 deviation-0.711
Token,
Token ID11
Feature activation-3.20e-02

Pos logit contributions
anwhile+1.342
PN+1.295
DR+1.251
hy+1.246
engineering+1.209
Neg logit contributions
ional-1.129
 Favor-1.110
inas-1.107
Jar-1.079
 deviation-1.034
Token HR
Token ID15172
Feature activation+4.30e+00

Pos logit contributions
ional+0.047
inas+0.045
 Favor+0.045
Jar+0.043
 deviation+0.042
Neg logit contributions
anwhile-0.055
PN-0.053
DR-0.051
hy-0.050
engineering-0.049
TokenW
Token ID54
Feature activation+3.66e+00

Pos logit contributions
PN+5.053
DR+4.955
hy+4.370
engineering+4.165
ITNESS+4.145
Neg logit contributions
ional-8.360
 Favor-7.922
inas-7.887
icularly-7.709
 deviation-7.625
Token said
Token ID531
Feature activation+1.55e+00

Pos logit contributions
anwhile+5.701
PN+5.537
DR+5.504
hy+5.280
estern+5.103
Neg logit contributions
ional-5.435
 Favor-5.243
inas-5.223
Jar-5.017
 deviation-4.909
Token.
Token ID13
Feature activation-9.31e-02

Pos logit contributions
anwhile+2.487
PN+2.363
hy+2.316
DR+2.308
engineering+2.183
Neg logit contributions
 Favor-2.283
ional-2.274
inas-2.272
Jar-2.198
 deviation-2.124
Token
Token ID198
Feature activation-7.11e-02

Pos logit contributions
ional+0.144
inas+0.137
 Favor+0.136
Jar+0.133
 deviation+0.130
Neg logit contributions
anwhile-0.150
PN-0.145
DR-0.140
hy-0.138
estern-0.134
Token
Token ID198
Feature activation-3.21e-01

Pos logit contributions
ional+0.107
 Favor+0.102
inas+0.102
Jar+0.099
 deviation+0.096
Neg logit contributions
anwhile-0.118
PN-0.114
DR-0.109
hy-0.109
estern-0.106
Tokenarse
Token ID17208
Feature activation-1.01e+00

Pos logit contributions
anwhile+3.822
PN+3.687
hy+3.647
DR+3.607
engineering+3.506
Neg logit contributions
ional-4.815
 Favor-4.749
inas-4.624
 deviation-4.507
Jar-4.503
Token wrote
Token ID2630
Feature activation+1.46e+00

Pos logit contributions
ional+1.993
inas+1.922
 Favor+1.922
Jar+1.884
 deviation+1.833
Neg logit contributions
anwhile-1.226
PN-1.141
DR-1.069
hy-1.051
estern-1.033
Token:
Token ID25
Feature activation+7.42e-01

Pos logit contributions
PN+2.420
DR+2.350
hy+2.336
anwhile+2.318
engineering+2.243
Neg logit contributions
ional-2.053
 Favor-1.974
inas-1.950
Jar-1.911
 deviation-1.865
Token
Token ID198
Feature activation+4.98e-01

Pos logit contributions
anwhile+1.167
PN+1.156
DR+1.112
hy+1.102
engineering+1.067
Neg logit contributions
ional-1.146
 Favor-1.098
inas-1.092
Jar-1.060
 deviation-1.033
Token
Token ID198
Feature activation+7.70e-02

Pos logit contributions
anwhile+0.782
PN+0.777
DR+0.749
hy+0.742
engineering+0.716
Neg logit contributions
ional-0.768
 Favor-0.736
inas-0.733
Jar-0.710
 deviation-0.692
TokenShow
Token ID15307
Feature activation-6.19e+00

Pos logit contributions
anwhile+0.089
PN+0.087
DR+0.083
hy+0.082
engineering+0.078
Neg logit contributions
ional-0.152
 Favor-0.148
inas-0.147
Jar-0.142
 deviation-0.141
Token nested
Token ID28376
Feature activation-4.29e-01

Pos logit contributions
 Favor+7.451
ional+7.419
Jar+6.936
inas+6.836
 deviation+6.664
Neg logit contributions
anwhile-11.939
PN-10.773
estern-10.740
DR-10.500
osponsors-10.372
Token quote
Token ID9577
Feature activation-2.72e+00

Pos logit contributions
ional+0.586
 Favor+0.566
inas+0.562
Jar+0.542
 deviation+0.523
Neg logit contributions
anwhile-0.775
PN-0.740
DR-0.708
hy-0.704
estern-0.699
Token +
Token ID1343
Feature activation+3.86e-01

Pos logit contributions
ional+3.726
 Favor+3.605
inas+3.431
Jar+3.339
 deviation+3.081
Neg logit contributions
anwhile-5.176
PN-4.772
 destro-4.691
osponsors-4.689
estern-4.676
Token
Token ID198
Feature activation+4.57e-02

Pos logit contributions
anwhile+0.645
PN+0.621
DR+0.595
hy+0.590
estern+0.577
Neg logit contributions
ional-0.577
 Favor-0.551
inas-0.549
Jar-0.533
 deviation-0.516
Token
Token ID198
Feature activation-3.24e-01

Pos logit contributions
anwhile+0.074
PN+0.073
DR+0.070
hy+0.069
estern+0.067
Neg logit contributions
ional-0.069
 Favor-0.066
inas-0.066
Jar-0.064
 deviation-0.062
Token the
Token ID262
Feature activation-5.12e-01

Pos logit contributions
ional+2.092
inas+1.984
 Favor+1.971
Jar+1.917
 deviation+1.835
Neg logit contributions
anwhile-2.896
PN-2.796
DR-2.693
hy-2.672
estern-2.617
Token referendum
Token ID11467
Feature activation-7.96e-01

Pos logit contributions
ional+0.718
inas+0.683
 Favor+0.679
Jar+0.663
 deviation+0.635
Neg logit contributions
anwhile-0.902
PN-0.874
DR-0.836
hy-0.831
engineering-0.811
Token.
Token ID13
Feature activation-5.20e-01

Pos logit contributions
ional+1.132
inas+1.085
 Favor+1.084
Jar+1.051
 deviation+1.013
Neg logit contributions
anwhile-1.376
PN-1.331
DR-1.281
hy-1.274
engineering-1.236
Token
Token ID198
Feature activation-2.28e-01

Pos logit contributions
ional+0.788
inas+0.752
 Favor+0.742
Jar+0.722
 deviation+0.706
Neg logit contributions
anwhile-0.852
PN-0.836
DR-0.801
hy-0.790
estern-0.762
Token
Token ID198
Feature activation-4.87e-01

Pos logit contributions
ional+0.344
 Favor+0.329
inas+0.327
Jar+0.318
 deviation+0.308
Neg logit contributions
anwhile-0.377
PN-0.366
DR-0.351
hy-0.348
estern-0.339
TokenIt
Token ID1026
Feature activation-5.82e+00

Pos logit contributions
ional+0.727
 Favor+0.704
inas+0.694
Jar+0.662
 deviation+0.660
Neg logit contributions
PN-0.797
anwhile-0.796
DR-0.764
hy-0.754
engineering-0.730
Token was
Token ID373
Feature activation+2.28e-02

Pos logit contributions
ional+7.758
inas+7.301
 Favor+7.104
Jar+6.884
 deviation+6.683
Neg logit contributions
anwhile-10.101
PN-9.887
DR-9.459
estern-9.268
osponsors-9.188
Token already
Token ID1541
Feature activation+7.19e-01

Pos logit contributions
anwhile+0.034
PN+0.033
hy+0.031
DR+0.031
estern+0.030
Neg logit contributions
ional-0.038
inas-0.037
 Favor-0.036
Jar-0.035
 deviation-0.034
Token known
Token ID1900
Feature activation-3.32e-01

Pos logit contributions
anwhile+1.229
PN+1.194
hy+1.158
DR+1.132
estern+1.109
Neg logit contributions
ional-1.003
inas-0.988
 Favor-0.970
Jar-0.940
 deviation-0.908
Token that
Token ID326
Feature activation-1.60e-01

Pos logit contributions
ional+0.425
inas+0.410
 Favor+0.408
Jar+0.394
 deviation+0.378
Neg logit contributions
anwhile-0.622
PN-0.601
DR-0.579
hy-0.578
estern-0.562
Token the
Token ID262
Feature activation+2.47e-01

Pos logit contributions
ional+0.218
inas+0.209
 Favor+0.206
Jar+0.201
 deviation+0.193
Neg logit contributions
anwhile-0.286
PN-0.276
DR-0.265
hy-0.264
estern-0.255
Token and
Token ID290
Feature activation-5.57e-01

Pos logit contributions
anwhile+0.354
PN+0.342
DR+0.331
hy+0.331
estern+0.321
Neg logit contributions
ional-0.269
 Favor-0.257
inas-0.256
Jar-0.252
 deviation-0.240
Token the
Token ID262
Feature activation+7.69e-01

Pos logit contributions
ional+0.792
 Favor+0.754
inas+0.754
Jar+0.737
 deviation+0.703
Neg logit contributions
anwhile-0.972
PN-0.936
DR-0.900
hy-0.893
estern-0.866
Token world
Token ID995
Feature activation+6.68e-01

Pos logit contributions
anwhile+1.096
PN+1.059
DR+1.007
hy+0.998
estern+0.965
Neg logit contributions
ional-1.325
inas-1.269
 Favor-1.263
Jar-1.253
 deviation-1.190
Token.
Token ID13
Feature activation+1.17e-01

Pos logit contributions
anwhile+1.116
PN+1.077
DR+1.039
hy+1.034
estern+1.000
Neg logit contributions
ional-0.985
 Favor-0.941
inas-0.941
Jar-0.922
 deviation-0.883
Token<|endoftext|>
Token ID50256
Feature activation-8.32e-01

Pos logit contributions
anwhile+0.182
PN+0.178
DR+0.171
hy+0.169
estern+0.163
Neg logit contributions
ional-0.185
inas-0.177
 Favor-0.173
Jar-0.170
 deviation-0.167
TokenIt
Token ID1026
Feature activation-5.77e+00

Pos logit contributions
ional+1.335
 Favor+1.285
inas+1.273
Jar+1.233
 deviation+1.209
Neg logit contributions
anwhile-1.292
PN-1.261
DR-1.205
hy-1.192
engineering-1.157
Token is
Token ID318
Feature activation-1.50e+00

Pos logit contributions
ional+6.959
inas+6.666
 Favor+6.535
Jar+6.261
tle+5.929
Neg logit contributions
anwhile-10.818
PN-10.457
DR-9.973
estern-9.789
osponsors-9.709
Token impossible
Token ID5340
Feature activation-3.92e-01

Pos logit contributions
ional+2.323
 Favor+2.224
inas+2.214
Jar+2.157
 deviation+2.086
Neg logit contributions
anwhile-2.433
PN-2.333
DR-2.231
hy-2.215
estern-2.165
Token to
Token ID284
Feature activation-1.32e-01

Pos logit contributions
ional+0.682
 Favor+0.656
inas+0.653
Jar+0.639
 deviation+0.620
Neg logit contributions
anwhile-0.562
PN-0.536
DR-0.510
hy-0.505
estern-0.492
Token identify
Token ID5911
Feature activation-1.70e+00

Pos logit contributions
ional+0.198
 Favor+0.189
inas+0.189
Jar+0.186
 deviation+0.177
Neg logit contributions
anwhile-0.219
PN-0.210
DR-0.202
hy-0.202
estern-0.195
Token what
Token ID644
Feature activation+5.35e-01

Pos logit contributions
ional+2.355
 Favor+2.241
inas+2.237
Jar+2.171
 deviation+2.079
Neg logit contributions
anwhile-3.037
PN-2.929
DR-2.818
hy-2.800
estern-2.733
TokenWATCH
Token ID35192
Feature activation+1.96e-01

Pos logit contributions
PN+0.155
anwhile+0.153
DR+0.148
hy+0.146
engineering+0.141
Neg logit contributions
ional-0.153
 Favor-0.148
inas-0.146
 deviation-0.139
Jar-0.138
Token
Token ID198
Feature activation-2.46e-01

Pos logit contributions
anwhile+0.344
PN+0.336
DR+0.324
hy+0.319
estern+0.310
Neg logit contributions
ional-0.277
 Favor-0.263
inas-0.262
Jar-0.255
 deviation-0.246
Token
Token ID198
Feature activation-4.93e-01

Pos logit contributions
ional+0.366
 Favor+0.350
inas+0.348
Jar+0.338
 deviation+0.327
Neg logit contributions
anwhile-0.414
PN-0.397
DR-0.381
hy-0.378
estern-0.370
Token:
Token ID25
Feature activation-8.54e-02

Pos logit contributions
ional+0.779
 Favor+0.749
inas+0.742
Jar+0.715
 deviation+0.705
Neg logit contributions
anwhile-0.774
PN-0.760
DR-0.726
hy-0.718
estern-0.695
Token<|endoftext|>
Token ID50256
Feature activation-2.66e-01

Pos logit contributions
ional+0.139
inas+0.133
 Favor+0.132
Jar+0.129
 deviation+0.126
Neg logit contributions
anwhile-0.131
PN-0.128
DR-0.122
hy-0.121
estern-0.116
TokenIt
Token ID1026
Feature activation-5.68e+00

Pos logit contributions
ional+0.426
 Favor+0.412
inas+0.406
Jar+0.390
 deviation+0.388
Neg logit contributions
anwhile-0.409
PN-0.407
DR-0.389
hy-0.384
engineering-0.372
Token can
Token ID460
Feature activation-1.49e+00

Pos logit contributions
ional+6.839
inas+6.550
 Favor+6.428
Jar+6.155
tle+5.817
Neg logit contributions
anwhile-10.699
PN-10.345
DR-9.868
estern-9.685
osponsors-9.600
Token
Token ID447
Feature activation-1.23e+00

Pos logit contributions
ional+1.970
 Favor+1.876
inas+1.858
Jar+1.807
 deviation+1.738
Neg logit contributions
anwhile-2.770
PN-2.667
DR-2.576
hy-2.562
estern-2.501
Token
Token ID247
Feature activation-4.28e-01

Pos logit contributions
ional+1.762
inas+1.673
 Favor+1.672
Jar+1.620
 deviation+1.568
Neg logit contributions
anwhile-2.120
PN-2.059
DR-1.981
hy-1.963
estern-1.917
Tokent
Token ID83
Feature activation-2.34e+00

Pos logit contributions
ional+0.678
 Favor+0.659
inas+0.655
Jar+0.637
 deviation+0.624
Neg logit contributions
anwhile-0.667
PN-0.642
DR-0.620
hy-0.616
estern-0.595
Token be
Token ID307
Feature activation-1.62e+00

Pos logit contributions
ional+3.054
 Favor+2.901
inas+2.882
Jar+2.795
 deviation+2.687
Neg logit contributions
anwhile-4.382
PN-4.224
DR-4.067
hy-4.042
estern-3.961
Token of
Token ID286
Feature activation+8.30e-02

Pos logit contributions
ional+1.515
 Favor+1.446
inas+1.433
Jar+1.394
 deviation+1.331
Neg logit contributions
anwhile-2.041
PN-1.982
DR-1.900
hy-1.887
engineering-1.838
Token the
Token ID262
Feature activation+5.16e-01

Pos logit contributions
anwhile+0.138
PN+0.135
DR+0.129
hy+0.127
estern+0.124
Neg logit contributions
ional-0.124
 Favor-0.119
inas-0.117
Jar-0.114
 deviation-0.111
Token investigations
Token ID10986
Feature activation+1.04e-01

Pos logit contributions
anwhile+0.883
PN+0.865
DR+0.834
hy+0.819
engineering+0.793
Neg logit contributions
ional-0.732
 Favor-0.701
inas-0.686
Jar-0.682
 deviation-0.644
Token.
Token ID13
Feature activation-3.57e-01

Pos logit contributions
anwhile+0.175
PN+0.170
DR+0.163
hy+0.162
engineering+0.157
Neg logit contributions
ional-0.153
 Favor-0.147
inas-0.146
Jar-0.142
 deviation-0.137
Token<|endoftext|>
Token ID50256
Feature activation-7.90e-01

Pos logit contributions
ional+0.582
inas+0.553
 Favor+0.550
Jar+0.536
 deviation+0.526
Neg logit contributions
anwhile-0.544
PN-0.533
DR-0.509
hy-0.503
engineering-0.486
TokenIt
Token ID1026
Feature activation-5.68e+00

Pos logit contributions
ional+1.272
 Favor+1.226
inas+1.213
Jar+1.174
 deviation+1.154
Neg logit contributions
anwhile-1.219
PN-1.194
DR-1.142
hy-1.129
engineering-1.095
Token was
Token ID373
Feature activation+4.12e-02

Pos logit contributions
ional+6.805
inas+6.518
 Favor+6.394
Jar+6.127
tle+5.790
Neg logit contributions
anwhile-10.755
PN-10.392
DR-9.916
estern-9.735
osponsors-9.655
Token supposed
Token ID4385
Feature activation-1.21e+00

Pos logit contributions
anwhile+0.069
PN+0.067
hy+0.064
DR+0.064
estern+0.063
Neg logit contributions
ional-0.061
inas-0.058
 Favor-0.058
Jar-0.056
 deviation-0.054
Token to
Token ID284
Feature activation-1.02e+00

Pos logit contributions
ional+2.079
 Favor+1.987
inas+1.967
Jar+1.931
 deviation+1.882
Neg logit contributions
anwhile-1.807
PN-1.678
DR-1.591
hy-1.571
estern-1.547
Token be
Token ID307
Feature activation-2.00e+00

Pos logit contributions
ional+1.315
 Favor+1.245
inas+1.240
Jar+1.201
 deviation+1.157
Neg logit contributions
anwhile-1.910
PN-1.844
DR-1.784
hy-1.774
estern-1.733
Token a
Token ID257
Feature activation-3.10e-01

Pos logit contributions
ional+2.804
 Favor+2.671
inas+2.657
Jar+2.580
 deviation+2.486
Neg logit contributions
anwhile-3.554
PN-3.420
DR-3.283
hy-3.261
estern-3.195
Token's
Token ID338
Feature activation-1.19e+00

Pos logit contributions
anwhile+1.078
PN+1.038
DR+1.006
hy+1.003
engineering+0.969
Neg logit contributions
ional-0.845
inas-0.812
 Favor-0.809
Jar-0.779
 deviation-0.744
Token Facebook
Token ID3203
Feature activation+1.55e+00

Pos logit contributions
ional+1.736
 Favor+1.659
inas+1.648
Jar+1.603
 deviation+1.551
Neg logit contributions
anwhile-2.037
PN-1.955
DR-1.871
hy-1.856
estern-1.826
Token page
Token ID2443
Feature activation+2.70e-01

Pos logit contributions
anwhile+2.113
PN+2.019
DR+1.943
hy+1.909
engineering+1.837
Neg logit contributions
ional-2.740
inas-2.661
 Favor-2.601
Jar-2.593
 deviation-2.472
Token.
Token ID13
Feature activation-1.71e-01

Pos logit contributions
anwhile+0.459
PN+0.439
DR+0.422
hy+0.420
engineering+0.407
Neg logit contributions
ional-0.399
inas-0.382
 Favor-0.380
Jar-0.371
 deviation-0.357
Token<|endoftext|>
Token ID50256
Feature activation-8.70e-01

Pos logit contributions
ional+0.263
 Favor+0.252
inas+0.251
Jar+0.244
 deviation+0.237
Neg logit contributions
anwhile-0.278
PN-0.266
DR-0.255
hy-0.254
estern-0.247
TokenIt
Token ID1026
Feature activation-5.67e+00

Pos logit contributions
ional+1.396
 Favor+1.344
inas+1.331
Jar+1.290
 deviation+1.264
Neg logit contributions
anwhile-1.352
PN-1.318
DR-1.261
hy-1.247
engineering-1.210
Token's
Token ID338
Feature activation-2.60e+00

Pos logit contributions
ional+6.806
inas+6.510
 Favor+6.386
Jar+6.112
tle+5.785
Neg logit contributions
anwhile-10.706
PN-10.373
DR-9.897
estern-9.709
osponsors-9.628
Token back
Token ID736
Feature activation+1.38e+00

Pos logit contributions
ional+3.959
 Favor+3.793
inas+3.753
Jar+3.666
 deviation+3.549
Neg logit contributions
anwhile-4.301
PN-4.119
DR-3.938
hy-3.887
estern-3.809
Token-
Token ID12
Feature activation+6.63e-01

Pos logit contributions
anwhile+2.117
PN+2.047
hy+1.981
DR+1.977
estern+1.925
Neg logit contributions
ional-2.200
inas-2.118
 Favor-2.099
Jar-2.058
 deviation-2.001
Tokento
Token ID1462
Feature activation-2.41e-01

Pos logit contributions
anwhile+0.802
PN+0.773
hy+0.740
DR+0.737
engineering+0.718
Neg logit contributions
ional-1.279
 Favor-1.244
inas-1.239
Jar-1.200
 deviation-1.186
Token-
Token ID12
Feature activation-8.71e-01

Pos logit contributions
ional+0.346
 Favor+0.330
inas+0.328
Jar+0.319
 deviation+0.308
Neg logit contributions
anwhile-0.418
PN-0.402
DR-0.386
hy-0.383
estern-0.375
Token salary
Token ID9588
Feature activation-4.29e-01

Pos logit contributions
ional+1.550
 Favor+1.478
inas+1.477
Jar+1.429
 deviation+1.371
Neg logit contributions
anwhile-1.784
PN-1.717
DR-1.654
hy-1.636
estern-1.592
Token next
Token ID1306
Feature activation+5.05e-01

Pos logit contributions
ional+0.612
inas+0.590
 Favor+0.588
Jar+0.559
 deviation+0.537
Neg logit contributions
anwhile-0.741
PN-0.717
DR-0.697
hy-0.688
engineering-0.666
Token year
Token ID614
Feature activation-7.53e-03

Pos logit contributions
anwhile+0.789
PN+0.762
DR+0.732
hy+0.726
engineering+0.703
Neg logit contributions
ional-0.795
 Favor-0.771
inas-0.769
Jar-0.740
 deviation-0.712
Token.
Token ID13
Feature activation+2.63e-01

Pos logit contributions
ional+0.011
 Favor+0.011
inas+0.011
Jar+0.010
 deviation+0.010
Neg logit contributions
anwhile-0.013
PN-0.012
DR-0.012
hy-0.012
engineering-0.011
Token<|endoftext|>
Token ID50256
Feature activation-7.99e-01

Pos logit contributions
anwhile+0.415
PN+0.405
DR+0.391
hy+0.386
estern+0.371
Neg logit contributions
ional-0.410
inas-0.393
 Favor-0.384
Jar-0.374
 deviation-0.369
TokenIt
Token ID1026
Feature activation-5.65e+00

Pos logit contributions
ional+1.281
 Favor+1.234
inas+1.222
Jar+1.182
 deviation+1.162
Neg logit contributions
anwhile-1.239
PN-1.214
DR-1.161
hy-1.148
engineering-1.113
Token all
Token ID477
Feature activation-9.96e-01

Pos logit contributions
ional+6.827
inas+6.534
 Favor+6.418
Jar+6.155
tle+5.809
Neg logit contributions
anwhile-10.649
PN-10.295
DR-9.823
estern-9.626
osponsors-9.550
Token starts
Token ID4940
Feature activation-1.50e+00

Pos logit contributions
ional+1.670
 Favor+1.616
inas+1.604
Jar+1.570
 deviation+1.529
Neg logit contributions
anwhile-1.480
PN-1.417
hy-1.356
DR-1.349
engineering-1.301
Token with
Token ID351
Feature activation-8.27e-02

Pos logit contributions
ional+2.165
 Favor+2.071
inas+2.059
Jar+2.003
 deviation+1.932
Neg logit contributions
anwhile-2.590
PN-2.489
DR-2.390
hy-2.374
estern-2.320
Token Media
Token ID6343
Feature activation+6.05e-01

Pos logit contributions
ional+0.117
 Favor+0.111
inas+0.111
Jar+0.108
 deviation+0.103
Neg logit contributions
anwhile-0.144
PN-0.139
DR-0.134
hy-0.134
engineering-0.130
Token Day
Token ID3596
Feature activation-5.79e-01

Pos logit contributions
anwhile+1.026
PN+1.005
DR+0.984
hy+0.962
engineering+0.944
Neg logit contributions
ional-0.872
inas-0.823
 Favor-0.817
 deviation-0.786
Jar-0.784
Token mainstream
Token ID8661
Feature activation-1.51e+00

Pos logit contributions
ional+0.924
inas+0.879
 Favor+0.873
Jar+0.856
 deviation+0.818
Neg logit contributions
anwhile-1.065
PN-1.026
DR-0.983
hy-0.977
engineering-0.950
Token debate
Token ID4384
Feature activation-1.29e+00

Pos logit contributions
ional+2.454
 Favor+2.346
inas+2.343
Jar+2.293
 deviation+2.203
Neg logit contributions
anwhile-2.345
PN-2.257
DR-2.153
hy-2.137
estern-2.070
Token.
Token ID13
Feature activation-6.35e-01

Pos logit contributions
ional+1.833
 Favor+1.750
inas+1.742
Jar+1.692
 deviation+1.633
Neg logit contributions
anwhile-2.271
PN-2.187
DR-2.099
hy-2.088
estern-2.038
Token
Token ID198
Feature activation-2.04e-01

Pos logit contributions
ional+0.920
inas+0.874
 Favor+0.865
Jar+0.843
 deviation+0.818
Neg logit contributions
anwhile-1.090
PN-1.061
DR-1.020
hy-1.009
estern-0.978
Token
Token ID198
Feature activation-4.18e-01

Pos logit contributions
ional+0.308
 Favor+0.295
inas+0.293
Jar+0.284
 deviation+0.276
Neg logit contributions
anwhile-0.334
PN-0.326
DR-0.313
hy-0.310
estern-0.301
TokenIt
Token ID1026
Feature activation-5.60e+00

Pos logit contributions
ional+0.636
 Favor+0.613
inas+0.606
Jar+0.581
 deviation+0.576
Neg logit contributions
anwhile-0.675
PN-0.672
DR-0.645
hy-0.636
engineering-0.616
Token's
Token ID338
Feature activation-2.68e+00

Pos logit contributions
ional+6.695
inas+6.294
 Favor+6.259
Jar+5.965
 deviation+5.708
Neg logit contributions
anwhile-10.543
PN-10.328
DR-9.837
estern-9.732
ITNESS-9.601
Token stunning
Token ID13393
Feature activation-1.94e+00

Pos logit contributions
ional+4.029
 Favor+3.861
inas+3.805
Jar+3.721
 deviation+3.613
Neg logit contributions
anwhile-4.456
PN-4.275
DR-4.088
hy-4.027
estern-3.973
Token when
Token ID618
Feature activation+1.37e+00

Pos logit contributions
ional+2.605
 Favor+2.472
inas+2.457
Jar+2.384
 deviation+2.300
Neg logit contributions
anwhile-3.543
PN-3.410
DR-3.274
hy-3.252
estern-3.196
Token you
Token ID345
Feature activation+5.23e-01

Pos logit contributions
anwhile+2.226
PN+2.158
DR+2.070
hy+2.057
estern+1.989
Neg logit contributions
ional-2.067
inas-1.991
 Favor-1.970
Jar-1.938
 deviation-1.834
Token think
Token ID892
Feature activation-1.37e+00

Pos logit contributions
anwhile+0.829
PN+0.794
DR+0.761
hy+0.754
estern+0.732
Neg logit contributions
ional-0.828
 Favor-0.793
inas-0.792
Jar-0.775
 deviation-0.745
Token.
Token ID13
Feature activation+4.18e-01

Pos logit contributions
ional+3.768
inas+3.597
 Favor+3.589
Jar+3.492
 deviation+3.397
Neg logit contributions
anwhile-3.192
PN-3.079
DR-2.912
hy-2.860
estern-2.826
Token curfew
Token ID49565
Feature activation-3.12e-01

Pos logit contributions
anwhile+0.721
PN+0.692
DR+0.662
hy+0.658
estern+0.646
Neg logit contributions
ional-0.607
 Favor-0.580
inas-0.576
Jar-0.561
 deviation-0.541
Token.
Token ID13
Feature activation-6.80e-01

Pos logit contributions
ional+0.450
 Favor+0.433
inas+0.430
Jar+0.416
 deviation+0.394
Neg logit contributions
anwhile-0.539
PN-0.521
DR-0.499
hy-0.495
engineering-0.481
Token
Token ID198
Feature activation-3.98e-01

Pos logit contributions
ional+0.981
inas+0.931
 Favor+0.926
Jar+0.900
 deviation+0.869
Neg logit contributions
anwhile-1.179
PN-1.139
DR-1.098
hy-1.087
engineering-1.054
Token
Token ID198
Feature activation-6.89e-01

Pos logit contributions
ional+0.596
 Favor+0.569
inas+0.566
Jar+0.551
 deviation+0.533
Neg logit contributions
anwhile-0.664
PN-0.641
DR-0.614
hy-0.609
estern-0.595
TokenIt
Token ID1026
Feature activation-5.60e+00

Pos logit contributions
ional+1.046
 Favor+1.010
inas+0.996
Jar+0.956
 deviation+0.947
Neg logit contributions
anwhile-1.117
PN-1.107
DR-1.063
hy-1.049
engineering-1.017
Token's
Token ID338
Feature activation-2.53e+00

Pos logit contributions
ional+7.640
inas+7.246
 Favor+7.051
Jar+6.852
 deviation+6.642
Neg logit contributions
anwhile-9.588
PN-9.416
DR-8.963
estern-8.825
osponsors-8.702
Token all
Token ID477
Feature activation-9.92e-01

Pos logit contributions
ional+3.880
 Favor+3.712
inas+3.680
Jar+3.589
 deviation+3.486
Neg logit contributions
anwhile-4.148
PN-3.980
DR-3.807
hy-3.759
estern-3.695
Token been
Token ID587
Feature activation-9.26e-02

Pos logit contributions
ional+1.504
 Favor+1.442
inas+1.436
Jar+1.398
 deviation+1.347
Neg logit contributions
anwhile-1.641
PN-1.575
DR-1.508
hy-1.502
estern-1.461
Token worth
Token ID2861
Feature activation-8.11e-01

Pos logit contributions
ional+0.154
inas+0.149
 Favor+0.148
Jar+0.144
 deviation+0.140
Neg logit contributions
anwhile-0.138
PN-0.133
DR-0.127
hy-0.126
estern-0.122
Token it
Token ID340
Feature activation-2.16e+00

Pos logit contributions
ional+1.191
 Favor+1.134
inas+1.134
Jar+1.101
 deviation+1.064
Neg logit contributions
anwhile-1.376
PN-1.333
DR-1.279
hy-1.271
engineering-1.233
Tokenand
Token ID392
Feature activation+8.84e-01

Pos logit contributions
anwhile+1.033
PN+0.989
DR+0.935
hy+0.928
estern+0.915
Neg logit contributions
ional-1.099
 Favor-1.072
inas-1.059
Jar-1.042
 deviation-1.017
Token for
Token ID329
Feature activation-3.93e-01

Pos logit contributions
anwhile+1.459
PN+1.432
hy+1.357
DR+1.345
estern+1.321
Neg logit contributions
ional-1.287
inas-1.254
 Favor-1.240
Jar-1.232
 deviation-1.184
Token References
Token ID31458
Feature activation-2.46e-02

Pos logit contributions
ional+0.637
inas+0.607
 Favor+0.598
Jar+0.594
 deviation+0.570
Neg logit contributions
anwhile-0.608
PN-0.593
DR-0.559
hy-0.558
estern-0.539
Token:
Token ID25
Feature activation+3.74e-01

Pos logit contributions
ional+0.040
inas+0.038
 Favor+0.038
Jar+0.037
 deviation+0.036
Neg logit contributions
anwhile-0.038
PN-0.036
DR-0.035
hy-0.034
engineering-0.033
Token<|endoftext|>
Token ID50256
Feature activation-6.11e-01

Pos logit contributions
anwhile+0.589
PN+0.567
DR+0.547
hy+0.542
engineering+0.524
Neg logit contributions
ional-0.594
inas-0.569
 Favor-0.560
Jar-0.549
 deviation-0.534
TokenIt
Token ID1026
Feature activation-5.52e+00

Pos logit contributions
ional+0.977
 Favor+0.942
inas+0.931
Jar+0.900
 deviation+0.887
Neg logit contributions
anwhile-0.949
PN-0.933
DR-0.893
hy-0.882
engineering-0.856
Token took
Token ID1718
Feature activation-8.76e-01

Pos logit contributions
ional+6.659
inas+6.369
 Favor+6.261
Jar+5.988
tle+5.659
Neg logit contributions
anwhile-10.420
PN-10.079
DR-9.625
estern-9.437
osponsors-9.360
Token until
Token ID1566
Feature activation-3.05e-01

Pos logit contributions
ional+1.213
 Favor+1.150
inas+1.147
Jar+1.115
 deviation+1.064
Neg logit contributions
anwhile-1.566
PN-1.511
DR-1.454
hy-1.448
estern-1.409
Token Sunday
Token ID3502
Feature activation-3.26e+00

Pos logit contributions
ional+0.385
inas+0.365
 Favor+0.365
Jar+0.353
 deviation+0.336
Neg logit contributions
anwhile-0.578
PN-0.557
DR-0.538
hy-0.537
estern-0.524
Token morning
Token ID3329
Feature activation-6.53e-01

Pos logit contributions
ional+4.263
 Favor+4.068
inas+3.979
Jar+3.860
 deviation+3.735
Neg logit contributions
anwhile-6.052
PN-5.798
DR-5.573
hy-5.527
estern-5.467
Token for
Token ID329
Feature activation-8.80e-01

Pos logit contributions
ional+1.051
inas+1.015
 Favor+1.014
Jar+0.989
 deviation+0.950
Neg logit contributions
anwhile-1.011
PN-0.974
DR-0.930
hy-0.923
estern-0.897
Token.
Token ID13
Feature activation+6.39e-02

Pos logit contributions
anwhile+2.096
PN+2.020
hy+1.978
DR+1.969
engineering+1.893
Neg logit contributions
ional-1.845
 Favor-1.793
inas-1.776
Jar-1.747
 deviation-1.640
Token<|endoftext|>
Token ID50256
Feature activation-4.96e-01

Pos logit contributions
anwhile+0.102
PN+0.099
DR+0.095
hy+0.094
engineering+0.092
Neg logit contributions
ional-0.100
inas-0.095
 Favor-0.094
Jar-0.092
 deviation-0.090
TokenEPA
Token ID40906
Feature activation+7.81e-01

Pos logit contributions
ional+0.793
 Favor+0.766
inas+0.756
Jar+0.729
 deviation+0.721
Neg logit contributions
anwhile-0.767
PN-0.758
DR-0.725
hy-0.716
engineering-0.695
Token
Token ID198
Feature activation-1.88e+00

Pos logit contributions
anwhile+1.336
PN+1.301
DR+1.265
hy+1.237
engineering+1.218
Neg logit contributions
ional-1.115
inas-1.062
 Favor-1.042
Jar-1.035
 deviation-0.988
Token
Token ID198
Feature activation-1.90e+00

Pos logit contributions
ional+2.576
 Favor+2.445
inas+2.435
Jar+2.391
 deviation+2.253
Neg logit contributions
anwhile-3.561
PN-3.210
 destro-3.188
osponsors-3.077
estern-3.061
TokenIt
Token ID1026
Feature activation-5.51e+00

Pos logit contributions
ional+2.964
 Favor+2.826
inas+2.825
Jar+2.765
 deviation+2.647
Neg logit contributions
anwhile-3.104
PN-2.928
DR-2.796
hy-2.785
estern-2.741
Token's
Token ID338
Feature activation-2.68e+00

Pos logit contributions
ional+6.509
inas+6.237
 Favor+6.088
Jar+5.817
tle+5.500
Neg logit contributions
anwhile-10.528
PN-10.258
DR-9.793
estern-9.623
ITNESS-9.535
Token the
Token ID262
Feature activation-9.81e-02

Pos logit contributions
ional+4.035
 Favor+3.862
inas+3.820
Jar+3.727
 deviation+3.611
Neg logit contributions
anwhile-4.471
PN-4.285
DR-4.101
hy-4.036
estern-3.966
Token kind
Token ID1611
Feature activation-4.70e-01

Pos logit contributions
ional+0.144
 Favor+0.136
inas+0.136
Jar+0.133
 deviation+0.126
Neg logit contributions
anwhile-0.167
PN-0.161
DR-0.155
hy-0.154
estern-0.150
Token of
Token ID286
Feature activation+2.93e-01

Pos logit contributions
ional+0.614
 Favor+0.584
inas+0.581
Jar+0.563
 deviation+0.539
Neg logit contributions
anwhile-0.869
PN-0.845
DR-0.813
hy-0.809
estern-0.789
Token beast
Token ID13824
Feature activation-1.90e+00

Pos logit contributions
anwhile+0.467
PN+0.453
hy+0.434
DR+0.433
engineering+0.419
Neg logit contributions
ional-0.454
inas-0.435
 Favor-0.435
Jar-0.424
icularly-0.398
Token Batman
Token ID9827
Feature activation-1.60e+00

Pos logit contributions
anwhile+4.997
PN+4.849
hy+4.721
DR+4.701
engineering+4.487
Neg logit contributions
ional-4.082
inas-4.017
 Favor-3.915
Jar-3.767
 deviation-3.680
Token has
Token ID468
Feature activation+1.09e-01

Pos logit contributions
ional+2.190
 Favor+2.084
inas+2.074
Jar+2.012
 deviation+1.938
Neg logit contributions
anwhile-2.901
PN-2.795
DR-2.686
hy-2.670
estern-2.613
Token always
Token ID1464
Feature activation-2.58e-01

Pos logit contributions
anwhile+0.170
PN+0.163
DR+0.156
hy+0.156
estern+0.152
Neg logit contributions
ional-0.174
inas-0.168
 Favor-0.168
Jar-0.163
 deviation-0.157
Token driven
Token ID7986
Feature activation-1.28e-01

Pos logit contributions
ional+0.436
 Favor+0.422
inas+0.422
Jar+0.412
 deviation+0.399
Neg logit contributions
anwhile-0.374
PN-0.361
DR-0.344
hy-0.343
estern-0.331
Token the
Token ID262
Feature activation+5.98e-01

Pos logit contributions
ional+0.184
 Favor+0.174
inas+0.174
Jar+0.168
 deviation+0.162
Neg logit contributions
anwhile-0.221
PN-0.214
DR-0.207
hy-0.207
estern-0.198
Token Bat
Token ID6577
Feature activation-5.48e+00

Pos logit contributions
anwhile+1.015
PN+0.977
DR+0.949
hy+0.942
estern+0.912
Neg logit contributions
ional-0.879
inas-0.823
 Favor-0.819
Jar-0.802
 deviation-0.765
Tokenmobile
Token ID24896
Feature activation+1.77e+00

Pos logit contributions
ional+7.056
inas+6.827
 Favor+6.684
Jar+6.453
face+6.200
Neg logit contributions
anwhile-9.881
PN-9.480
DR-8.802
estern-8.801
ITNESS-8.767
Token:
Token ID25
Feature activation+4.60e-01

Pos logit contributions
anwhile+2.754
PN+2.657
hy+2.593
DR+2.581
engineering+2.464
Neg logit contributions
ional-2.777
 Favor-2.649
inas-2.618
Jar-2.557
 deviation-2.470
Token an
Token ID281
Feature activation-3.52e-01

Pos logit contributions
anwhile+0.778
PN+0.743
DR+0.721
hy+0.720
engineering+0.692
Neg logit contributions
ional-0.683
inas-0.651
 Favor-0.644
Jar-0.626
 deviation-0.603
Token incredible
Token ID8082
Feature activation-6.79e-01

Pos logit contributions
ional+0.505
 Favor+0.481
inas+0.475
Jar+0.461
 deviation+0.446
Neg logit contributions
anwhile-0.612
PN-0.587
hy-0.574
DR-0.568
engineering-0.554
Token car
Token ID1097
Feature activation+2.69e-01

Pos logit contributions
ional+1.049
 Favor+1.005
inas+1.000
Jar+0.976
 deviation+0.928
Neg logit contributions
anwhile-1.096
PN-1.054
hy-1.015
DR-1.014
engineering-0.982
Token hack
Token ID8156
Feature activation+5.66e-01

Pos logit contributions
anwhile+3.160
PN+3.039
hy+2.942
DR+2.927
acqu+2.764
Neg logit contributions
ional-3.433
 Favor-3.380
inas-3.319
Jar-3.296
icularly-3.113
Token.
Token ID13
Feature activation-3.98e-01

Pos logit contributions
anwhile+0.977
PN+0.951
hy+0.920
DR+0.911
estern+0.875
Neg logit contributions
ional-0.780
 Favor-0.759
inas-0.759
Jar-0.736
 deviation-0.696
Token
Token ID198
Feature activation-6.99e-01

Pos logit contributions
ional+0.600
inas+0.572
 Favor+0.563
Jar+0.552
 deviation+0.537
Neg logit contributions
anwhile-0.655
PN-0.643
DR-0.617
hy-0.605
estern-0.592
Token
Token ID198
Feature activation-8.26e-01

Pos logit contributions
ional+1.022
 Favor+0.976
inas+0.971
Jar+0.947
 deviation+0.910
Neg logit contributions
anwhile-1.212
PN-1.146
DR-1.096
hy-1.089
estern-1.073
Token'
Token ID6
Feature activation-8.51e-01

Pos logit contributions
ional+1.142
 Favor+1.097
inas+1.084
Jar+1.035
 deviation+1.022
Neg logit contributions
anwhile-1.451
PN-1.444
DR-1.389
hy-1.368
engineering-1.331
TokenIt
Token ID1026
Feature activation-5.42e+00

Pos logit contributions
ional+1.324
 Favor+1.267
inas+1.261
Jar+1.225
 deviation+1.191
Neg logit contributions
anwhile-1.372
PN-1.326
DR-1.269
hy-1.257
estern-1.223
Token's
Token ID338
Feature activation-2.69e+00

Pos logit contributions
ional+6.224
inas+5.939
 Favor+5.827
Jar+5.547
 deviation+5.287
Neg logit contributions
anwhile-10.668
PN-10.295
DR-9.777
estern-9.716
osponsors-9.609
Token a
Token ID257
Feature activation-5.25e-01

Pos logit contributions
ional+3.995
 Favor+3.825
inas+3.773
Jar+3.675
 deviation+3.581
Neg logit contributions
anwhile-4.548
PN-4.330
DR-4.142
hy-4.062
estern-4.039
Token theory
Token ID4583
Feature activation-1.20e+00

Pos logit contributions
ional+0.784
 Favor+0.750
inas+0.748
Jar+0.729
 deviation+0.698
Neg logit contributions
anwhile-0.877
PN-0.845
DR-0.809
hy-0.807
estern-0.785
Token that
Token ID326
Feature activation-9.22e-01

Pos logit contributions
ional+1.637
 Favor+1.564
inas+1.557
Jar+1.510
 deviation+1.452
Neg logit contributions
anwhile-2.149
PN-2.073
DR-1.992
hy-1.984
estern-1.937
Token is
Token ID318
Feature activation-7.50e-01

Pos logit contributions
ional+1.248
 Favor+1.185
inas+1.184
Jar+1.150
 deviation+1.099
Neg logit contributions
anwhile-1.671
PN-1.615
DR-1.553
hy-1.547
estern-1.506
Token read
Token ID1100
Feature activation-1.42e+00

Pos logit contributions
ional+1.610
 Favor+1.539
inas+1.531
Jar+1.490
 deviation+1.440
Neg logit contributions
anwhile-1.804
PN-1.733
DR-1.659
hy-1.647
estern-1.612
Token:
Token ID25
Feature activation-3.75e-01

Pos logit contributions
ional+1.865
 Favor+1.769
inas+1.765
Jar+1.711
 deviation+1.643
Neg logit contributions
anwhile-2.639
PN-2.543
DR-2.449
hy-2.435
estern-2.381
Token
Token ID198
Feature activation-6.51e-01

Pos logit contributions
ional+0.520
 Favor+0.493
inas+0.492
Jar+0.478
 deviation+0.460
Neg logit contributions
anwhile-0.672
PN-0.647
DR-0.622
hy-0.618
estern-0.605
Token
Token ID198
Feature activation-9.07e-01

Pos logit contributions
ional+0.954
 Favor+0.911
inas+0.906
Jar+0.883
 deviation+0.849
Neg logit contributions
anwhile-1.128
PN-1.067
DR-1.019
hy-1.015
estern-0.999
Token"
Token ID1
Feature activation-1.92e+00

Pos logit contributions
ional+1.162
 Favor+1.105
inas+1.097
Jar+1.054
 deviation+1.025
Neg logit contributions
anwhile-1.704
PN-1.665
DR-1.608
hy-1.591
engineering-1.550
TokenBar
Token ID10374
Feature activation-5.35e+00

Pos logit contributions
ional+3.016
 Favor+2.898
inas+2.869
Jar+2.827
 deviation+2.698
Neg logit contributions
anwhile-3.090
PN-2.904
hy-2.762
DR-2.756
estern-2.735
Tokenack
Token ID441
Feature activation+8.23e-01

Pos logit contributions
ional+8.044
inas+7.631
 Favor+7.616
Jar+7.376
 deviation+7.054
Neg logit contributions
anwhile-8.776
PN-8.232
estern-7.814
hy-7.781
DR-7.778
Token Obama
Token ID2486
Feature activation+5.98e-01

Pos logit contributions
anwhile+1.543
PN+1.478
DR+1.429
hy+1.401
engineering+1.374
Neg logit contributions
ional-1.043
 Favor-0.996
inas-0.979
Jar-0.971
 deviation-0.940
Token would
Token ID561
Feature activation+7.21e-02

Pos logit contributions
anwhile+1.072
PN+1.036
DR+1.009
hy+1.002
estern+0.968
Neg logit contributions
ional-0.791
inas-0.763
 Favor-0.740
Jar-0.736
 deviation-0.707
Token be
Token ID307
Feature activation-1.61e+00

Pos logit contributions
anwhile+0.126
PN+0.121
DR+0.117
hy+0.116
estern+0.112
Neg logit contributions
ional-0.102
inas-0.097
 Favor-0.097
Jar-0.095
 deviation-0.091
Token the
Token ID262
Feature activation+4.66e-01

Pos logit contributions
ional+2.435
 Favor+2.329
inas+2.309
Jar+2.249
 deviation+2.177
Neg logit contributions
anwhile-2.702
PN-2.584
DR-2.471
hy-2.446
estern-2.402
Token his
Token ID465
Feature activation-1.57e+00

Pos logit contributions
anwhile+0.571
PN+0.554
DR+0.534
hy+0.528
estern+0.521
Neg logit contributions
ional-0.465
inas-0.442
 Favor-0.439
Jar-0.431
 deviation-0.410
Token remarks
Token ID10252
Feature activation-1.51e+00

Pos logit contributions
ional+2.165
 Favor+2.055
inas+2.050
Jar+1.993
 deviation+1.906
Neg logit contributions
anwhile-2.817
PN-2.714
DR-2.609
hy-2.590
estern-2.535
Token.
Token ID13
Feature activation-8.63e-01

Pos logit contributions
ional+2.106
 Favor+2.013
inas+2.003
Jar+1.944
 deviation+1.873
Neg logit contributions
anwhile-2.695
PN-2.596
DR-2.495
hy-2.477
estern-2.423
Token
Token ID564
Feature activation-5.64e-01

Pos logit contributions
ional+1.284
inas+1.220
 Favor+1.220
Jar+1.177
 deviation+1.147
Neg logit contributions
anwhile-1.453
PN-1.409
DR-1.354
hy-1.337
estern-1.304
Token
Token ID250
Feature activation-5.82e-01

Pos logit contributions
ional+0.883
inas+0.844
 Favor+0.837
Jar+0.814
 deviation+0.792
Neg logit contributions
anwhile-0.902
PN-0.882
DR-0.844
hy-0.832
engineering-0.809
TokenIt
Token ID1026
Feature activation-5.34e+00

Pos logit contributions
ional+0.886
 Favor+0.849
inas+0.845
Jar+0.819
 deviation+0.801
Neg logit contributions
anwhile-0.955
PN-0.930
DR-0.893
hy-0.878
estern-0.854
Token will
Token ID481
Feature activation-1.45e+00

Pos logit contributions
ional+6.147
inas+5.794
 Favor+5.669
Jar+5.400
 deviation+5.163
Neg logit contributions
anwhile-10.424
PN-10.160
DR-9.688
estern-9.512
ITNESS-9.469
Token reduce
Token ID4646
Feature activation-1.12e+00

Pos logit contributions
ional+2.056
 Favor+1.961
inas+1.950
Jar+1.896
 deviation+1.827
Neg logit contributions
anwhile-2.546
PN-2.448
DR-2.350
hy-2.335
estern-2.284
Token the
Token ID262
Feature activation+4.87e-01

Pos logit contributions
ional+1.653
 Favor+1.582
inas+1.581
Jar+1.539
 deviation+1.467
Neg logit contributions
anwhile-1.901
PN-1.828
DR-1.755
hy-1.743
estern-1.702
Token problem
Token ID1917
Feature activation+1.26e+00

Pos logit contributions
anwhile+0.851
PN+0.823
DR+0.791
hy+0.787
estern+0.764
Neg logit contributions
ional-0.678
inas-0.651
 Favor-0.646
Jar-0.636
icularly-0.591
Token of
Token ID286
Feature activation+1.06e-01

Pos logit contributions
anwhile+2.153
PN+2.087
hy+2.019
DR+1.999
engineering+1.938
Neg logit contributions
ional-1.727
 Favor-1.707
inas-1.679
Jar-1.643
 deviation-1.558
Token21
Token ID2481
Feature activation-6.29e-02

Pos logit contributions
PN+1.141
anwhile+1.108
DR+1.089
hy+1.063
engineering+1.007
Neg logit contributions
ional-1.847
 Favor-1.775
inas-1.772
Jar-1.720
 deviation-1.702
Token:
Token ID25
Feature activation+4.03e-01

Pos logit contributions
ional+0.103
inas+0.100
 Favor+0.098
Jar+0.096
 deviation+0.093
Neg logit contributions
anwhile-0.095
PN-0.094
DR-0.090
hy-0.087
estern-0.085
Token20
Token ID1238
Feature activation-6.81e-01

Pos logit contributions
PN+0.490
anwhile+0.490
DR+0.466
hy+0.456
engineering+0.434
Neg logit contributions
ional-0.769
inas-0.738
 Favor-0.736
Jar-0.718
 deviation-0.704
Token IST
Token ID45790
Feature activation+3.26e-02

Pos logit contributions
ional+1.098
inas+1.051
 Favor+1.047
Jar+1.020
 deviation+0.986
Neg logit contributions
anwhile-1.062
PN-1.028
DR-0.982
hy-0.967
estern-0.943
Token<|endoftext|>
Token ID50256
Feature activation-3.65e-01

Pos logit contributions
anwhile+0.043
PN+0.042
DR+0.040
hy+0.038
engineering+0.037
Neg logit contributions
ional-0.060
inas-0.058
 Favor-0.058
Jar-0.056
 deviation-0.055
TokenIt
Token ID1026
Feature activation-5.34e+00

Pos logit contributions
ional+0.582
 Favor+0.563
inas+0.555
Jar+0.534
 deviation+0.530
Neg logit contributions
anwhile-0.563
PN-0.558
DR-0.535
hy-0.527
engineering-0.511
Token's
Token ID338
Feature activation-2.43e+00

Pos logit contributions
ional+6.384
inas+6.098
 Favor+5.995
Jar+5.743
tle+5.404
Neg logit contributions
anwhile-10.169
PN-9.842
DR-9.402
estern-9.223
hy-9.151
Token a
Token ID257
Feature activation-5.64e-01

Pos logit contributions
ional+3.682
 Favor+3.526
inas+3.493
Jar+3.409
 deviation+3.299
Neg logit contributions
anwhile-4.031
PN-3.864
DR-3.694
hy-3.651
estern-3.576
Token question
Token ID1808
Feature activation-8.57e-01

Pos logit contributions
ional+0.837
 Favor+0.798
inas+0.798
Jar+0.775
 deviation+0.743
Neg logit contributions
anwhile-0.944
PN-0.912
DR-0.876
hy-0.871
estern-0.851
Token I
Token ID314
Feature activation+2.14e+00

Pos logit contributions
ional+1.221
 Favor+1.165
inas+1.163
Jar+1.126
 deviation+1.088
Neg logit contributions
anwhile-1.495
PN-1.440
DR-1.384
hy-1.373
estern-1.344
Token've
Token ID1053
Feature activation+2.58e-01

Pos logit contributions
anwhile+2.738
PN+2.589
DR+2.537
hy+2.474
estern+2.364
Neg logit contributions
ional-3.896
inas-3.722
 Favor-3.721
Jar-3.702
 deviation-3.573
Token own
Token ID898
Feature activation-1.22e+00

Pos logit contributions
anwhile+3.368
PN+3.274
DR+3.194
hy+3.130
estern+3.119
Neg logit contributions
ional-3.283
inas-3.218
 Favor-3.183
Jar-3.123
 deviation-3.037
Token some
Token ID617
Feature activation-7.56e-01

Pos logit contributions
ional+1.742
 Favor+1.663
inas+1.655
Jar+1.608
 deviation+1.550
Neg logit contributions
anwhile-2.116
PN-2.037
DR-1.953
hy-1.939
estern-1.898
Token bitcoins
Token ID22690
Feature activation-5.09e-01

Pos logit contributions
ional+0.987
 Favor+0.935
inas+0.932
Jar+0.902
 deviation+0.865
Neg logit contributions
anwhile-1.412
PN-1.366
DR-1.314
hy-1.307
estern-1.275
Token.
Token ID13
Feature activation-4.90e-01

Pos logit contributions
ional+0.731
inas+0.701
 Favor+0.700
Jar+0.676
 deviation+0.651
Neg logit contributions
anwhile-0.884
PN-0.846
DR-0.819
hy-0.812
engineering-0.791
Token<|endoftext|>
Token ID50256
Feature activation-7.74e-01

Pos logit contributions
ional+0.804
inas+0.772
 Favor+0.761
Jar+0.742
 deviation+0.726
Neg logit contributions
anwhile-0.746
PN-0.728
DR-0.698
hy-0.689
engineering-0.664
TokenTheir
Token ID14574
Feature activation-5.16e+00

Pos logit contributions
ional+1.241
 Favor+1.196
inas+1.184
Jar+1.146
 deviation+1.125
Neg logit contributions
anwhile-1.201
PN-1.175
DR-1.124
hy-1.111
engineering-1.078
Token solution
Token ID4610
Feature activation+3.52e-01

Pos logit contributions
ional+7.806
 Favor+7.592
inas+7.512
Jar+7.173
 deviation+7.139
Neg logit contributions
anwhile-8.083
PN-7.737
DR-7.354
estern-7.150
hy-7.146
Token involves
Token ID9018
Feature activation-1.20e+00

Pos logit contributions
anwhile+0.588
PN+0.568
hy+0.548
DR+0.548
engineering+0.530
Neg logit contributions
ional-0.517
 Favor-0.496
inas-0.495
Jar-0.483
 deviation-0.463
Token encaps
Token ID32652
Feature activation+9.39e-01

Pos logit contributions
ional+1.842
 Favor+1.763
inas+1.756
Jar+1.711
 deviation+1.650
Neg logit contributions
anwhile-1.951
PN-1.873
DR-1.793
hy-1.780
estern-1.737
Tokenulating
Token ID8306
Feature activation+1.67e-01

Pos logit contributions
anwhile+1.304
PN+1.278
DR+1.236
hy+1.218
engineering+1.194
Neg logit contributions
ional-1.600
 Favor-1.568
inas-1.530
 deviation-1.504
Jar-1.502
Token per
Token ID583
Feature activation-6.74e-02

Pos logit contributions
anwhile+0.283
PN+0.274
DR+0.265
hy+0.263
engineering+0.254
Neg logit contributions
ional-0.244
inas-0.232
 Favor-0.230
Jar-0.226
 deviation-0.215
Token their
Token ID511
Feature activation-2.43e+00

Pos logit contributions
ional+1.969
 Favor+1.882
inas+1.869
Jar+1.821
 deviation+1.746
Neg logit contributions
anwhile-2.348
PN-2.263
DR-2.177
hy-2.165
estern-2.103
Token authority
Token ID4934
Feature activation-6.30e-02

Pos logit contributions
ional+3.440
 Favor+3.279
inas+3.265
Jar+3.150
 deviation+3.062
Neg logit contributions
anwhile-4.300
PN-4.123
DR-3.946
hy-3.903
estern-3.861
Token.
Token ID13
Feature activation-6.10e-02

Pos logit contributions
ional+0.101
 Favor+0.099
inas+0.098
Jar+0.096
 deviation+0.092
Neg logit contributions
anwhile-0.096
PN-0.093
DR-0.089
hy-0.089
engineering-0.085
Token
Token ID198
Feature activation-3.66e-01

Pos logit contributions
ional+0.092
inas+0.087
 Favor+0.085
Jar+0.084
 deviation+0.082
Neg logit contributions
anwhile-0.100
PN-0.099
DR-0.095
hy-0.093
engineering-0.090
Token
Token ID198
Feature activation-5.15e-01

Pos logit contributions
ional+0.544
 Favor+0.519
inas+0.517
Jar+0.503
 deviation+0.485
Neg logit contributions
anwhile-0.622
PN-0.595
DR-0.569
hy-0.565
estern-0.554
TokenIt
Token ID1026
Feature activation-5.16e+00

Pos logit contributions
ional+0.768
 Favor+0.740
inas+0.729
Jar+0.698
 deviation+0.694
Neg logit contributions
PN-0.846
anwhile-0.843
DR-0.813
hy-0.797
engineering-0.773
Token's
Token ID338
Feature activation-2.31e+00

Pos logit contributions
ional+5.925
 Favor+5.495
inas+5.484
Jar+5.196
 deviation+5.005
Neg logit contributions
anwhile-10.092
PN-9.855
DR-9.436
estern-9.395
ITNESS-9.270
Token understandable
Token ID21977
Feature activation-3.11e-01

Pos logit contributions
ional+3.437
 Favor+3.282
inas+3.251
Jar+3.165
 deviation+3.074
Neg logit contributions
anwhile-3.892
PN-3.738
DR-3.579
hy-3.534
estern-3.477
Token parents
Token ID3397
Feature activation+1.63e-01

Pos logit contributions
ional+0.392
 Favor+0.376
inas+0.375
Jar+0.361
 deviation+0.343
Neg logit contributions
anwhile-0.589
PN-0.573
DR-0.554
hy-0.550
estern-0.534
Token would
Token ID561
Feature activation+1.10e-01

Pos logit contributions
anwhile+0.278
PN+0.268
DR+0.259
hy+0.258
estern+0.249
Neg logit contributions
ional-0.234
inas-0.226
 Favor-0.225
Jar-0.219
 deviation-0.208
Token want
Token ID765
Feature activation-1.58e+00

Pos logit contributions
anwhile+0.186
PN+0.178
DR+0.173
hy+0.172
estern+0.166
Neg logit contributions
ional-0.158
inas-0.152
 Favor-0.151
Jar-0.149
 deviation-0.141
Token statement
Token ID2643
Feature activation-2.23e-02

Pos logit contributions
anwhile+1.542
PN+1.486
DR+1.434
hy+1.406
engineering+1.364
Neg logit contributions
ional-1.492
 Favor-1.442
inas-1.438
Jar-1.402
 deviation-1.327
Token added
Token ID2087
Feature activation+2.33e-01

Pos logit contributions
ional+0.038
 Favor+0.036
inas+0.036
Jar+0.035
 deviation+0.034
Neg logit contributions
anwhile-0.033
PN-0.031
DR-0.030
hy-0.030
engineering-0.029
Token.
Token ID13
Feature activation-2.77e-01

Pos logit contributions
anwhile+0.402
PN+0.387
DR+0.372
hy+0.369
estern+0.358
Neg logit contributions
ional-0.336
 Favor-0.322
inas-0.320
Jar-0.312
 deviation-0.301
Token
Token ID198
Feature activation-3.77e-02

Pos logit contributions
ional+0.432
inas+0.413
 Favor+0.412
Jar+0.400
 deviation+0.389
Neg logit contributions
anwhile-0.449
PN-0.431
DR-0.413
hy-0.408
estern-0.398
Token
Token ID198
Feature activation-3.14e-01

Pos logit contributions
ional+0.057
 Favor+0.054
inas+0.054
Jar+0.052
 deviation+0.051
Neg logit contributions
anwhile-0.063
PN-0.061
DR-0.058
hy-0.058
estern-0.056
TokenIt
Token ID1026
Feature activation-5.13e+00

Pos logit contributions
ional+0.470
 Favor+0.455
inas+0.449
Jar+0.431
 deviation+0.426
Neg logit contributions
anwhile-0.516
PN-0.510
DR-0.489
hy-0.481
engineering-0.468
Token is
Token ID318
Feature activation-1.28e+00

Pos logit contributions
ional+7.007
inas+6.671
 Favor+6.522
Jar+6.349
tle+6.095
Neg logit contributions
anwhile-8.739
PN-8.521
DR-8.086
estern-7.994
ITNESS-7.932
Token not
Token ID407
Feature activation-1.35e+00

Pos logit contributions
ional+1.997
 Favor+1.912
inas+1.902
Jar+1.853
 deviation+1.794
Neg logit contributions
anwhile-2.073
PN-1.988
DR-1.900
hy-1.885
estern-1.843
Token clear
Token ID1598
Feature activation-7.24e-01

Pos logit contributions
ional+1.821
 Favor+1.746
inas+1.736
Jar+1.689
 deviation+1.617
Neg logit contributions
anwhile-2.449
PN-2.361
DR-2.271
hy-2.262
estern-2.200
Token yet
Token ID1865
Feature activation+5.68e-01

Pos logit contributions
ional+0.925
 Favor+0.889
inas+0.882
Jar+0.855
 deviation+0.822
Neg logit contributions
anwhile-1.365
PN-1.312
DR-1.269
hy-1.267
engineering-1.226
Token how
Token ID703
Feature activation+2.21e+00

Pos logit contributions
anwhile+1.045
PN+1.008
hy+0.979
DR+0.976
engineering+0.943
Neg logit contributions
ional-0.728
 Favor-0.711
inas-0.707
Jar-0.688
 deviation-0.660
Token requirement
Token ID9079
Feature activation-7.83e-01

Pos logit contributions
ional+0.611
 Favor+0.588
inas+0.583
Jar+0.568
 deviation+0.539
Neg logit contributions
anwhile-0.688
PN-0.669
DR-0.643
hy-0.639
engineering-0.619
Token.
Token ID13
Feature activation-2.61e-01

Pos logit contributions
ional+1.125
 Favor+1.079
inas+1.071
Jar+1.041
 deviation+1.001
Neg logit contributions
anwhile-1.354
PN-1.306
DR-1.254
hy-1.247
engineering-1.211
Token
Token ID198
Feature activation-1.47e-01

Pos logit contributions
ional+0.379
inas+0.359
 Favor+0.355
Jar+0.347
 deviation+0.339
Neg logit contributions
anwhile-0.444
PN-0.435
DR-0.419
hy-0.411
engineering-0.400
Token
Token ID198
Feature activation-3.59e-01

Pos logit contributions
ional+0.222
 Favor+0.212
inas+0.211
Jar+0.205
 deviation+0.199
Neg logit contributions
anwhile-0.243
PN-0.236
DR-0.226
hy-0.224
estern-0.219
Token"
Token ID1
Feature activation-1.62e+00

Pos logit contributions
ional+0.499
 Favor+0.479
inas+0.472
Jar+0.451
 deviation+0.449
Neg logit contributions
anwhile-0.623
PN-0.622
DR-0.599
hy-0.590
engineering-0.575
TokenIt
Token ID1026
Feature activation-5.09e+00

Pos logit contributions
ional+2.556
 Favor+2.446
inas+2.437
Jar+2.395
 deviation+2.290
Neg logit contributions
anwhile-2.600
PN-2.444
DR-2.326
hy-2.320
estern-2.296
Token just
Token ID655
Feature activation-1.25e+00

Pos logit contributions
ional+5.550
inas+5.134
 Favor+5.085
Jar+4.845
 deviation+4.730
Neg logit contributions
anwhile-10.348
PN-10.125
DR-9.602
estern-9.586
ITNESS-9.468
Token says
Token ID1139
Feature activation-1.97e+00

Pos logit contributions
ional+1.654
 Favor+1.583
inas+1.576
Jar+1.531
 deviation+1.460
Neg logit contributions
anwhile-2.296
PN-2.213
hy-2.149
DR-2.136
engineering-2.074
Token you
Token ID345
Feature activation+3.78e-01

Pos logit contributions
ional+2.765
 Favor+2.632
inas+2.618
Jar+2.539
 deviation+2.455
Neg logit contributions
anwhile-3.495
PN-3.364
DR-3.221
hy-3.197
estern-3.152
Token can
Token ID460
Feature activation-1.13e-01

Pos logit contributions
anwhile+0.565
PN+0.540
DR+0.521
hy+0.519
estern+0.497
Neg logit contributions
ional-0.627
 Favor-0.603
inas-0.600
Jar-0.590
 deviation-0.564
Token't
Token ID470
Feature activation-1.00e+00

Pos logit contributions
ional+0.161
 Favor+0.154
inas+0.153
Jar+0.150
 deviation+0.142
Neg logit contributions
anwhile-0.197
PN-0.188
DR-0.183
hy-0.182
engineering-0.175
Token both
Token ID1111
Feature activation+1.04e+00

Pos logit contributions
anwhile+1.670
PN+1.626
DR+1.581
hy+1.548
engineering+1.508
Neg logit contributions
ional-1.438
 Favor-1.387
inas-1.387
Jar-1.342
 deviation-1.282
Token national
Token ID2260
Feature activation+3.31e-01

Pos logit contributions
anwhile+1.711
PN+1.680
DR+1.622
hy+1.591
engineering+1.548
Neg logit contributions
ional-1.510
inas-1.454
 Favor-1.445
Jar-1.419
 deviation-1.356
Token governments
Token ID6905
Feature activation+5.35e-01

Pos logit contributions
anwhile+0.509
PN+0.498
DR+0.483
hy+0.476
engineering+0.459
Neg logit contributions
ional-0.521
 Favor-0.507
inas-0.495
Jar-0.494
 deviation-0.472
Token and
Token ID290
Feature activation-4.86e-01

Pos logit contributions
anwhile+0.888
PN+0.869
DR+0.844
hy+0.834
engineering+0.809
Neg logit contributions
ional-0.771
 Favor-0.752
inas-0.750
Jar-0.726
 deviation-0.701
Token international
Token ID3230
Feature activation+2.04e+00

Pos logit contributions
ional+0.679
 Favor+0.648
inas+0.645
Jar+0.627
 deviation+0.602
Neg logit contributions
anwhile-0.863
PN-0.832
DR-0.799
hy-0.793
estern-0.774
Token co
Token ID763
Feature activation+3.76e+00

Pos logit contributions
PN+2.737
anwhile+2.733
DR+2.633
hy+2.559
engineering+2.497
Neg logit contributions
ional-3.500
 Favor-3.409
inas-3.390
Jar-3.368
 deviation-3.183
Token-
Token ID12
Feature activation+1.02e+00

Pos logit contributions
PN+5.338
anwhile+5.141
DR+5.128
hy+5.087
engineering+5.042
Neg logit contributions
ional-5.895
 Favor-5.816
 deviation-5.670
inas-5.623
Jar-5.614
Tokenoperation
Token ID27184
Feature activation+2.89e+00

Pos logit contributions
anwhile+1.008
PN+0.992
DR+0.939
hy+0.910
engineering+0.901
Neg logit contributions
ional-2.148
 Favor-2.115
inas-2.067
Jar-2.050
 deviation-2.015
Token.
Token ID13
Feature activation+2.68e-01

Pos logit contributions
PN+4.077
anwhile+3.998
DR+3.874
hy+3.858
engineering+3.694
Neg logit contributions
 Favor-4.687
ional-4.624
inas-4.567
Jar-4.498
 deviation-4.325
Token
Token ID198
Feature activation+2.46e-01

Pos logit contributions
anwhile+0.457
PN+0.451
DR+0.437
hy+0.428
engineering+0.415
Neg logit contributions
ional-0.379
inas-0.365
 Favor-0.357
Jar-0.347
 deviation-0.341
Token
Token ID198
Feature activation-3.29e-02

Pos logit contributions
anwhile+0.398
PN+0.390
DR+0.374
hy+0.371
engineering+0.360
Neg logit contributions
ional-0.375
 Favor-0.359
inas-0.358
Jar-0.347
 deviation-0.337
Token from
Token ID422
Feature activation+5.24e-01

Pos logit contributions
anwhile+0.066
PN+0.064
hy+0.061
DR+0.061
engineering+0.059
Neg logit contributions
ional-0.059
 Favor-0.057
inas-0.057
Jar-0.056
 deviation-0.053
Token an
Token ID281
Feature activation-4.30e-01

Pos logit contributions
anwhile+0.890
PN+0.865
DR+0.830
hy+0.820
estern+0.797
Neg logit contributions
ional-0.762
inas-0.721
 Favor-0.709
Jar-0.707
 deviation-0.673
Token array
Token ID7177
Feature activation+2.40e-01

Pos logit contributions
ional+0.629
inas+0.596
 Favor+0.595
Jar+0.580
 deviation+0.563
Neg logit contributions
anwhile-0.731
PN-0.708
DR-0.677
hy-0.676
engineering-0.656
Token.
Token ID13
Feature activation+6.26e-01

Pos logit contributions
anwhile+0.413
PN+0.402
DR+0.386
hy+0.383
engineering+0.373
Neg logit contributions
ional-0.342
inas-0.326
 Favor-0.326
Jar-0.318
 deviation-0.304
Token The
Token ID383
Feature activation+4.38e-01

Pos logit contributions
anwhile+1.051
PN+1.028
DR+0.989
hy+0.973
engineering+0.949
Neg logit contributions
ional-0.917
inas-0.870
 Favor-0.855
Jar-0.847
 deviation-0.810
Token ability
Token ID2694
Feature activation+3.03e+00

Pos logit contributions
anwhile+0.749
PN+0.723
DR+0.694
hy+0.687
engineering+0.668
Neg logit contributions
ional-0.636
inas-0.604
 Favor-0.599
Jar-0.592
 deviation-0.558
Token to
Token ID284
Feature activation+8.66e-01

Pos logit contributions
anwhile+3.984
PN+3.822
DR+3.583
hy+3.538
estern+3.426
Neg logit contributions
ional-5.401
inas-5.321
 Favor-5.298
Jar-5.152
 deviation-4.943
Token undo
Token ID23981
Feature activation+3.81e-01

Pos logit contributions
anwhile+1.397
PN+1.339
DR+1.288
hy+1.274
engineering+1.224
Neg logit contributions
ional-1.331
inas-1.281
 Favor-1.268
Jar-1.260
 deviation-1.188
Token a
Token ID257
Feature activation+1.59e-01

Pos logit contributions
anwhile+0.672
PN+0.658
DR+0.635
hy+0.628
estern+0.611
Neg logit contributions
ional-0.522
inas-0.501
 Favor-0.499
Jar-0.492
 deviation-0.457
Token move
Token ID1445
Feature activation+1.15e+00

Pos logit contributions
anwhile+0.285
PN+0.279
DR+0.267
hy+0.264
estern+0.258
Neg logit contributions
ional-0.216
inas-0.208
 Favor-0.204
Jar-0.202
 deviation-0.187
Token and
Token ID290
Feature activation-3.30e-01

Pos logit contributions
anwhile+1.990
PN+1.906
hy+1.852
DR+1.847
engineering+1.788
Neg logit contributions
ional-1.592
 Favor-1.555
inas-1.553
Jar-1.521
 deviation-1.421
Tokenort
Token ID419
Feature activation+9.71e-01

Pos logit contributions
anwhile+0.178
PN+0.169
hy+0.165
DR+0.163
engineering+0.159
Neg logit contributions
ional-0.174
 Favor-0.172
Jar-0.165
inas-0.164
 deviation-0.161
Token>
Token ID29
Feature activation-1.02e-02

Pos logit contributions
anwhile+1.407
PN+1.377
hy+1.344
DR+1.331
estern+1.272
Neg logit contributions
ional-1.596
 Favor-1.589
Jar-1.516
inas-1.507
 deviation-1.489
Token haha
Token ID42254
Feature activation+1.66e+00

Pos logit contributions
ional+0.016
 Favor+0.015
inas+0.015
Jar+0.015
 deviation+0.014
Neg logit contributions
anwhile-0.016
PN-0.016
DR-0.015
hy-0.015
estern-0.014
Token j
Token ID474
Feature activation+2.62e+00

Pos logit contributions
anwhile+2.778
PN+2.676
hy+2.591
DR+2.584
estern+2.465
Neg logit contributions
ional-2.408
 Favor-2.348
inas-2.317
Jar-2.252
 deviation-2.159
Tokenason
Token ID888
Feature activation+1.60e-01

Pos logit contributions
PN+3.983
anwhile+3.928
hy+3.800
DR+3.766
pn+3.599
Neg logit contributions
ional-4.077
 Favor-4.034
inas-3.822
Jar-3.802
 deviation-3.761
Token [
Token ID685
Feature activation+2.76e+00

Pos logit contributions
anwhile+0.282
PN+0.269
DR+0.262
hy+0.260
estern+0.255
Neg logit contributions
ional-0.226
 Favor-0.217
inas-0.215
Jar-0.206
 deviation-0.199
Token10
Token ID940
Feature activation+1.86e+00

Pos logit contributions
PN+2.911
DR+2.738
hy+2.689
engineering+2.533
anwhile+2.508
Neg logit contributions
ional-5.615
 Favor-5.520
inas-5.477
Jar-5.317
icularly-5.288
Token:
Token ID25
Feature activation+4.23e-01

Pos logit contributions
PN+3.051
DR+2.963
hy+2.932
anwhile+2.867
engineering+2.812
Neg logit contributions
ional-2.668
 Favor-2.573
inas-2.542
Jar-2.471
 deviation-2.413
Token47
Token ID2857
Feature activation-7.31e-01

Pos logit contributions
anwhile+0.572
PN+0.546
DR+0.521
hy+0.512
estern+0.498
Neg logit contributions
ional-0.767
 Favor-0.739
inas-0.734
Jar-0.717
 deviation-0.702
Token]
Token ID60
Feature activation+1.02e+00

Pos logit contributions
ional+0.953
 Favor+0.903
inas+0.901
Jar+0.872
 deviation+0.835
Neg logit contributions
anwhile-1.384
PN-1.315
DR-1.264
hy-1.256
estern-1.238
Token <
Token ID1279
Feature activation+1.72e+00

Pos logit contributions
anwhile+1.580
PN+1.513
DR+1.441
hy+1.435
estern+1.393
Neg logit contributions
ional-1.671
 Favor-1.593
inas-1.588
Jar-1.553
 deviation-1.503
Token page
Token ID2443
Feature activation+6.90e-02

Pos logit contributions
ional+0.431
 Favor+0.416
inas+0.411
Jar+0.403
 deviation+0.388
Neg logit contributions
anwhile-0.438
PN-0.426
DR-0.410
hy-0.408
engineering-0.396
Token articles
Token ID6685
Feature activation-1.65e+00

Pos logit contributions
anwhile+0.106
PN+0.102
DR+0.098
hy+0.098
engineering+0.094
Neg logit contributions
ional-0.112
 Favor-0.107
inas-0.106
Jar-0.104
 deviation-0.099
Token,
Token ID11
Feature activation-7.69e-01

Pos logit contributions
ional+2.234
 Favor+2.124
inas+2.111
Jar+2.048
 deviation+1.972
Neg logit contributions
anwhile-3.003
PN-2.892
DR-2.777
hy-2.757
estern-2.707
Token but
Token ID475
Feature activation+3.25e-01

Pos logit contributions
ional+1.001
inas+0.949
 Favor+0.947
Jar+0.918
 deviation+0.872
Neg logit contributions
anwhile-1.442
PN-1.394
DR-1.341
hy-1.335
estern-1.299
Token health
Token ID1535
Feature activation+2.91e+00

Pos logit contributions
anwhile+0.536
PN+0.514
DR+0.496
hy+0.493
engineering+0.475
Neg logit contributions
ional-0.488
inas-0.467
 Favor-0.463
Jar-0.452
 deviation-0.435
Token care
Token ID1337
Feature activation+3.75e+00

Pos logit contributions
anwhile+4.159
PN+4.082
hy+4.027
DR+3.925
engineering+3.771
Neg logit contributions
ional-4.682
 Favor-4.551
inas-4.488
Jar-4.385
 deviation-4.158
Token always
Token ID1464
Feature activation-6.76e-01

Pos logit contributions
anwhile+5.591
PN+5.462
hy+5.327
DR+5.258
engineering+5.019
Neg logit contributions
ional-5.580
inas-5.541
 Favor-5.325
Jar-5.294
icularly-4.967
Token seems
Token ID2331
Feature activation-5.53e-01

Pos logit contributions
ional+0.859
 Favor+0.819
inas+0.817
Jar+0.792
 deviation+0.755
Neg logit contributions
anwhile-1.275
PN-1.233
hy-1.191
DR-1.191
engineering-1.154
Token to
Token ID284
Feature activation-3.96e-01

Pos logit contributions
ional+0.971
inas+0.941
 Favor+0.938
Jar+0.914
 deviation+0.887
Neg logit contributions
anwhile-0.777
PN-0.744
DR-0.706
hy-0.704
estern-0.676
Token be
Token ID307
Feature activation-9.35e-01

Pos logit contributions
ional+0.539
inas+0.516
 Favor+0.513
Jar+0.503
 deviation+0.477
Neg logit contributions
anwhile-0.707
PN-0.681
DR-0.660
hy-0.659
engineering-0.633
Token on
Token ID319
Feature activation-3.69e-01

Pos logit contributions
ional+1.362
inas+1.305
 Favor+1.303
Jar+1.265
 deviation+1.216
Neg logit contributions
anwhile-1.595
PN-1.536
DR-1.473
hy-1.472
estern-1.424
Token:"
Token ID11097
Feature activation+2.28e-01

Pos logit contributions
anwhile+0.298
PN+0.289
DR+0.275
hy+0.274
estern+0.268
Neg logit contributions
ional-0.266
 Favor-0.257
inas-0.254
Jar-0.248
 deviation-0.240
Token or
Token ID393
Feature activation+2.13e-01

Pos logit contributions
anwhile+0.432
PN+0.424
DR+0.409
hy+0.407
estern+0.396
Neg logit contributions
ional-0.286
inas-0.269
 Favor-0.269
Jar-0.261
 deviation-0.247
Token "
Token ID366
Feature activation-2.79e-01

Pos logit contributions
anwhile+0.385
PN+0.378
DR+0.364
hy+0.362
engineering+0.352
Neg logit contributions
ional-0.282
 Favor-0.267
inas-0.267
Jar-0.259
 deviation-0.248
Tokenex
Token ID1069
Feature activation+3.16e-01

Pos logit contributions
ional+0.656
 Favor+0.637
inas+0.635
Jar+0.625
 deviation+0.611
Neg logit contributions
anwhile-0.230
PN-0.211
DR-0.191
hy-0.188
estern-0.179
Token:"
Token ID11097
Feature activation+2.08e-01

Pos logit contributions
anwhile+0.528
PN+0.515
DR+0.492
hy+0.491
estern+0.479
Neg logit contributions
ional-0.463
 Favor-0.448
inas-0.443
Jar-0.433
 deviation-0.418
Token [
Token ID685
Feature activation+2.71e+00

Pos logit contributions
anwhile+0.385
PN+0.376
DR+0.362
hy+0.361
estern+0.347
Neg logit contributions
ional-0.272
inas-0.256
 Favor-0.253
Jar-0.247
 deviation-0.236
Tokenwhite
Token ID11186
Feature activation-1.59e-01

Pos logit contributions
anwhile+4.224
PN+4.026
DR+3.882
hy+3.862
engineering+3.809
Neg logit contributions
ional-4.211
 Favor-4.059
inas-3.998
Jar-3.895
 deviation-3.828
Token]
Token ID60
Feature activation+4.33e-01

Pos logit contributions
ional+0.218
 Favor+0.207
inas+0.207
Jar+0.201
 deviation+0.192
Neg logit contributions
anwhile-0.286
PN-0.276
DR-0.264
hy-0.264
estern-0.257
Token optional
Token ID11902
Feature activation-2.86e-01

Pos logit contributions
anwhile+0.661
PN+0.642
DR+0.612
hy+0.611
engineering+0.586
Neg logit contributions
ional-0.702
inas-0.671
 Favor-0.670
Jar-0.657
 deviation-0.628
Token white
Token ID2330
Feature activation+5.08e-02

Pos logit contributions
ional+0.384
inas+0.363
 Favor+0.361
Jar+0.353
 deviation+0.333
Neg logit contributions
anwhile-0.526
PN-0.505
hy-0.485
DR-0.485
estern-0.471
Token space
Token ID2272
Feature activation+1.29e+00

Pos logit contributions
anwhile+0.073
PN+0.070
hy+0.067
DR+0.066
engineering+0.064
Neg logit contributions
ional-0.087
inas-0.084
 Favor-0.084
Jar-0.083
 deviation-0.078
Token'
Token ID6
Feature activation+3.91e-01

Pos logit contributions
anwhile+0.391
PN+0.375
DR+0.361
hy+0.359
estern+0.351
Neg logit contributions
ional-0.357
 Favor-0.346
inas-0.339
Jar-0.334
 deviation-0.324
Tokenils
Token ID4487
Feature activation+1.60e-01

Pos logit contributions
anwhile+0.591
PN+0.565
hy+0.541
DR+0.541
estern+0.526
Neg logit contributions
ional-0.646
 Favor-0.626
inas-0.614
Jar-0.603
 deviation-0.589
Token ont
Token ID39585
Feature activation-1.47e+00

Pos logit contributions
anwhile+0.262
PN+0.251
DR+0.240
hy+0.238
estern+0.232
Neg logit contributions
ional-0.244
inas-0.235
 Favor-0.234
Jar-0.229
 deviation-0.219
Token mont
Token ID40689
Feature activation-1.45e-01

Pos logit contributions
ional+2.318
 Favor+2.223
inas+2.210
Jar+2.154
 deviation+2.089
Neg logit contributions
anwhile-2.331
PN-2.234
DR-2.132
hy-2.117
estern-2.071
Tokené
Token ID2634
Feature activation+2.15e+00

Pos logit contributions
ional+0.218
 Favor+0.212
inas+0.209
Jar+0.207
 deviation+0.200
Neg logit contributions
anwhile-0.240
PN-0.229
DR-0.220
hy-0.219
estern-0.213
Token une
Token ID17809
Feature activation+1.97e+00

Pos logit contributions
anwhile+3.449
PN+3.261
DR+3.153
hy+3.123
gypt+3.008
Neg logit contributions
ional-3.227
 Favor-3.100
inas-3.094
Jar-3.072
 deviation-2.912
Token n
Token ID299
Feature activation+5.21e-01

Pos logit contributions
anwhile+3.157
PN+3.018
DR+2.903
hy+2.874
engineering+2.782
Neg logit contributions
ional-3.007
inas-2.874
 Favor-2.836
Jar-2.805
icularly-2.634
Tokenou
Token ID280
Feature activation-5.17e-01

Pos logit contributions
anwhile+0.931
PN+0.913
DR+0.878
hy+0.872
engineering+0.848
Neg logit contributions
ional-0.701
 Favor-0.688
inas-0.663
Jar-0.656
 deviation-0.631
Tokenvel
Token ID626
Feature activation-7.66e-01

Pos logit contributions
ional+0.935
 Favor+0.913
inas+0.905
Jar+0.888
 deviation+0.865
Neg logit contributions
anwhile-0.693
PN-0.655
hy-0.629
DR-0.625
engineering-0.597
Tokenle
Token ID293
Feature activation+4.71e-01

Pos logit contributions
ional+1.497
 Favor+1.451
inas+1.443
Jar+1.424
 deviation+1.386
Neg logit contributions
anwhile-0.929
PN-0.876
DR-0.826
hy-0.819
estern-0.788
Token organisation
Token ID12684
Feature activation+9.86e-01

Pos logit contributions
anwhile+0.767
PN+0.737
DR+0.706
hy+0.699
engineering+0.680
Neg logit contributions
ional-0.722
inas-0.693
 Favor-0.689
Jar-0.674
 deviation-0.642
Token-
Token ID12
Feature activation+7.76e-01

Pos logit contributions
ional+0.126
 Favor+0.120
inas+0.120
Jar+0.117
 deviation+0.113
Neg logit contributions
anwhile-0.146
PN-0.141
DR-0.135
hy-0.134
estern-0.131
Token2002
Token ID16942
Feature activation+1.45e+00

Pos logit contributions
anwhile+1.236
PN+1.230
DR+1.168
hy+1.150
engineering+1.109
Neg logit contributions
ional-1.200
inas-1.143
 Favor-1.125
Jar-1.103
 deviation-1.079
Token-
Token ID12
Feature activation+3.35e-01

Pos logit contributions
anwhile+2.376
PN+2.326
DR+2.205
hy+2.178
estern+2.116
Neg logit contributions
ional-2.179
inas-2.101
 Favor-2.077
Jar-2.007
 deviation-1.972
Token 3
Token ID513
Feature activation+3.96e-01

Pos logit contributions
anwhile+0.576
PN+0.566
DR+0.545
hy+0.536
engineering+0.519
Neg logit contributions
ional-0.480
inas-0.455
 Favor-0.449
Jar-0.436
 deviation-0.426
Token Daniel
Token ID7806
Feature activation-6.01e-01

Pos logit contributions
anwhile+0.737
PN+0.721
DR+0.695
hy+0.686
engineering+0.666
Neg logit contributions
ional-0.514
inas-0.484
 Favor-0.478
Jar-0.462
 deviation-0.447
Token Craig
Token ID13854
Feature activation+2.00e-01

Pos logit contributions
ional+0.736
inas+0.688
 Favor+0.683
Jar+0.671
 deviation+0.655
Neg logit contributions
anwhile-1.170
PN-1.123
DR-1.092
hy-1.080
estern-1.055
Token films
Token ID7328
Feature activation-9.64e-02

Pos logit contributions
anwhile+0.369
PN+0.357
DR+0.343
hy+0.341
estern+0.334
Neg logit contributions
ional-0.264
 Favor-0.250
inas-0.246
Jar-0.245
 deviation-0.231
Token,
Token ID11
Feature activation+3.84e-01

Pos logit contributions
ional+0.123
 Favor+0.116
inas+0.116
Jar+0.112
 deviation+0.107
Neg logit contributions
anwhile-0.183
PN-0.177
DR-0.170
hy-0.169
estern-0.166
Token 2006
Token ID4793
Feature activation+2.53e+00

Pos logit contributions
anwhile+0.738
PN+0.724
DR+0.696
hy+0.692
engineering+0.672
Neg logit contributions
ional-0.471
inas-0.442
 Favor-0.436
Jar-0.425
 deviation-0.406
Token-
Token ID12
Feature activation+7.70e-01

Pos logit contributions
anwhile+3.984
PN+3.886
hy+3.692
DR+3.689
estern+3.577
Neg logit contributions
ional-3.834
 Favor-3.674
inas-3.673
Jar-3.562
 deviation-3.476
Token2012
Token ID6999
Feature activation+7.88e-01

Pos logit contributions
anwhile+1.171
PN+1.148
DR+1.098
hy+1.081
engineering+1.043
Neg logit contributions
ional-1.257
inas-1.194
 Favor-1.187
Jar-1.154
 deviation-1.139
Tokenm
Token ID76
Feature activation+1.28e-01

Pos logit contributions
ional+0.037
 Favor+0.036
inas+0.036
Jar+0.035
 deviation+0.034
Neg logit contributions
anwhile-0.045
PN-0.043
DR-0.042
hy-0.041
estern-0.040
Token a
Token ID257
Feature activation+7.40e-01

Pos logit contributions
anwhile+0.207
PN+0.199
hy+0.191
DR+0.190
estern+0.184
Neg logit contributions
ional-0.198
inas-0.190
 Favor-0.189
Jar-0.184
 deviation-0.177
Token social
Token ID1919
Feature activation+1.04e+00

Pos logit contributions
anwhile+1.263
PN+1.230
hy+1.183
DR+1.181
estern+1.148
Neg logit contributions
ional-1.054
inas-1.005
 Favor-0.996
Jar-0.976
 deviation-0.925
Token worker
Token ID8383
Feature activation+1.81e+00

Pos logit contributions
anwhile+1.629
PN+1.605
DR+1.540
hy+1.537
engineering+1.511
Neg logit contributions
ional-1.610
 Favor-1.556
inas-1.536
Jar-1.518
tle-1.414
Token!
Token ID0
Feature activation+1.09e-01

Pos logit contributions
anwhile+3.010
PN+2.947
DR+2.838
hy+2.829
estern+2.722
Neg logit contributions
ional-2.562
inas-2.504
 Favor-2.433
Jar-2.378
 deviation-2.277
Token
Token ID447
Feature activation-1.12e-01

Pos logit contributions
anwhile+0.182
PN+0.176
DR+0.169
hy+0.168
estern+0.163
Neg logit contributions
ional-0.162
inas-0.155
 Favor-0.153
Jar-0.149
 deviation-0.144
Token
Token ID251
Feature activation-9.93e-01

Pos logit contributions
ional+0.164
inas+0.157
 Favor+0.156
Jar+0.151
 deviation+0.146
Neg logit contributions
anwhile-0.189
PN-0.185
DR-0.178
hy-0.176
engineering-0.171
Token she
Token ID673
Feature activation-1.17e+00

Pos logit contributions
ional+1.473
 Favor+1.406
inas+1.403
Jar+1.362
 deviation+1.317
Neg logit contributions
anwhile-1.678
PN-1.615
DR-1.549
hy-1.538
estern-1.502
Token yelled
Token ID22760
Feature activation-1.39e+00

Pos logit contributions
ional+1.893
 Favor+1.818
inas+1.809
Jar+1.764
 deviation+1.710
Neg logit contributions
anwhile-1.831
PN-1.752
DR-1.673
hy-1.663
estern-1.620
Token rapid
Token ID5801
Feature activation-1.19e+00

Pos logit contributions
ional+1.925
 Favor+1.833
inas+1.827
Jar+1.772
 deviation+1.704
Neg logit contributions
anwhile-2.480
PN-2.387
DR-2.294
hy-2.280
estern-2.229
Token-
Token ID12
Feature activation+4.21e-03

Pos logit contributions
ional+1.685
 Favor+1.614
inas+1.599
Jar+1.562
 deviation+1.489
Neg logit contributions
anwhile-2.088
PN-2.001
hy-1.926
DR-1.925
estern-1.875
Token impe
Token ID18516
Feature activation+2.24e-01

Pos logit contributions
ional+1.652
 Favor+1.589
inas+1.588
Jar+1.544
 deviation+1.494
Neg logit contributions
anwhile-1.649
PN-1.585
DR-1.512
hy-1.507
estern-1.464
Tokenached
Token ID2317
Feature activation-4.38e-01

Pos logit contributions
anwhile+0.361
PN+0.352
hy+0.338
DR+0.337
engineering+0.327
Neg logit contributions
ional-0.340
 Favor-0.328
inas-0.325
Jar-0.316
 deviation-0.310
Token within
Token ID1626
Feature activation+1.79e-01

Pos logit contributions
ional+0.595
 Favor+0.574
inas+0.573
Jar+0.552
 deviation+0.536
Neg logit contributions
anwhile-0.784
PN-0.756
DR-0.730
hy-0.727
estern-0.707
Token the
Token ID262
Feature activation+6.23e-02

Pos logit contributions
anwhile+0.313
PN+0.301
DR+0.291
hy+0.289
estern+0.283
Neg logit contributions
ional-0.252
inas-0.240
 Favor-0.238
Jar-0.234
 deviation-0.223
Token next
Token ID1306
Feature activation+9.36e-01

Pos logit contributions
anwhile+0.108
PN+0.104
DR+0.100
hy+0.099
estern+0.097
Neg logit contributions
ional-0.089
 Favor-0.085
inas-0.085
Jar-0.083
 deviation-0.079
Token year
Token ID614
Feature activation+3.09e-01

Pos logit contributions
anwhile+1.589
PN+1.560
DR+1.511
hy+1.478
estern+1.440
Neg logit contributions
ional-1.334
inas-1.280
 Favor-1.270
Jar-1.237
 deviation-1.173
Token-
Token ID12
Feature activation-2.14e-01

Pos logit contributions
anwhile+0.551
PN+0.533
DR+0.514
hy+0.511
engineering+0.497
Neg logit contributions
ional-0.420
 Favor-0.409
inas-0.405
Jar-0.391
 deviation-0.381
Tokenand
Token ID392
Feature activation-6.71e-01

Pos logit contributions
ional+0.340
 Favor+0.328
inas+0.326
Jar+0.316
 deviation+0.308
Neg logit contributions
anwhile-0.338
PN-0.324
DR-0.312
hy-0.309
engineering-0.301
Token-
Token ID12
Feature activation-3.12e-01

Pos logit contributions
ional+0.979
 Favor+0.937
inas+0.932
Jar+0.903
 deviation+0.877
Neg logit contributions
anwhile-1.148
PN-1.103
DR-1.060
hy-1.055
estern-1.029
Tokena
Token ID64
Feature activation-2.96e-01

Pos logit contributions
ional+0.522
 Favor+0.502
inas+0.499
Jar+0.486
 deviation+0.475
Neg logit contributions
anwhile-0.466
PN-0.447
DR-0.429
hy-0.426
engineering-0.414
Token-
Token ID12
Feature activation-8.34e-01

Pos logit contributions
ional+0.425
 Favor+0.406
inas+0.404
Jar+0.393
 deviation+0.379
Neg logit contributions
anwhile-0.512
PN-0.493
DR-0.473
hy-0.470
estern-0.460
Token is
Token ID318
Feature activation+3.92e-01

Pos logit contributions
anwhile+1.718
PN+1.663
hy+1.609
DR+1.609
engineering+1.560
Neg logit contributions
ional-1.507
inas-1.489
 Favor-1.481
Jar-1.439
 deviation-1.347
Token,
Token ID11
Feature activation-4.20e-01

Pos logit contributions
anwhile+0.567
PN+0.545
hy+0.519
DR+0.518
engineering+0.499
Neg logit contributions
ional-0.665
inas-0.643
 Favor-0.642
Jar-0.625
 deviation-0.606
Token of
Token ID286
Feature activation+6.91e-01

Pos logit contributions
ional+0.601
 Favor+0.574
inas+0.571
Jar+0.556
 deviation+0.534
Neg logit contributions
anwhile-0.728
PN-0.701
DR-0.673
hy-0.670
estern-0.651
Token course
Token ID1781
Feature activation-6.37e-01

Pos logit contributions
anwhile+0.980
PN+0.971
DR+0.932
hy+0.929
engineering+0.889
Neg logit contributions
ional-1.149
 Favor-1.110
inas-1.107
Jar-1.086
 deviation-1.034
Token,
Token ID11
Feature activation-9.23e-01

Pos logit contributions
ional+0.795
 Favor+0.753
inas+0.748
Jar+0.724
 deviation+0.694
Neg logit contributions
anwhile-1.227
PN-1.183
DR-1.139
hy-1.132
estern-1.112
Token informed
Token ID7981
Feature activation+7.53e-01

Pos logit contributions
ional+1.551
 Favor+1.488
inas+1.479
Jar+1.447
 deviation+1.404
Neg logit contributions
anwhile-1.385
PN-1.320
DR-1.258
hy-1.243
estern-1.218
Token about
Token ID546
Feature activation+2.93e-01

Pos logit contributions
anwhile+1.307
PN+1.268
hy+1.227
DR+1.218
engineering+1.174
Neg logit contributions
ional-1.035
 Favor-1.016
inas-1.011
Jar-0.979
 deviation-0.939
Token the
Token ID262
Feature activation+5.79e-01

Pos logit contributions
anwhile+0.492
PN+0.479
hy+0.462
DR+0.458
estern+0.442
Neg logit contributions
ional-0.423
inas-0.414
 Favor-0.407
Jar-0.398
 deviation-0.371
Token issue
Token ID2071
Feature activation+6.86e-02

Pos logit contributions
anwhile+0.926
PN+0.895
hy+0.853
DR+0.847
engineering+0.814
Neg logit contributions
ional-0.895
inas-0.866
 Favor-0.851
Jar-0.843
icularly-0.787
Token.
Token ID13
Feature activation+1.11e-01

Pos logit contributions
anwhile+0.121
PN+0.116
hy+0.112
DR+0.112
engineering+0.109
Neg logit contributions
ional-0.095
 Favor-0.092
inas-0.092
Jar-0.089
 deviation-0.085
Token In
Token ID554
Feature activation-1.18e+00

Pos logit contributions
anwhile+0.182
PN+0.178
DR+0.171
hy+0.169
estern+0.163
Neg logit contributions
ional-0.166
inas-0.159
 Favor-0.156
Jar-0.153
 deviation-0.149
Token and
Token ID290
Feature activation-1.19e-01

Pos logit contributions
anwhile+0.899
PN+0.881
hy+0.856
DR+0.850
engineering+0.829
Neg logit contributions
ional-0.754
 Favor-0.726
inas-0.721
Jar-0.706
 deviation-0.661
Token the
Token ID262
Feature activation+1.16e+00

Pos logit contributions
ional+0.178
 Favor+0.171
inas+0.170
Jar+0.167
 deviation+0.158
Neg logit contributions
anwhile-0.197
PN-0.190
hy-0.183
DR-0.183
engineering-0.177
Token shadows
Token ID16187
Feature activation-1.29e+00

Pos logit contributions
anwhile+1.914
PN+1.841
hy+1.792
DR+1.779
engineering+1.719
Neg logit contributions
ional-1.710
 Favor-1.648
inas-1.634
Jar-1.626
 deviation-1.510
Token,
Token ID11
Feature activation-2.35e-02

Pos logit contributions
ional+1.677
 Favor+1.594
inas+1.585
Jar+1.536
 deviation+1.473
Neg logit contributions
anwhile-2.405
PN-2.322
DR-2.234
hy-2.221
estern-2.176
Token exceptionally
Token ID24822
Feature activation-9.86e-02

Pos logit contributions
ional+0.036
inas+0.034
 Favor+0.034
Jar+0.033
 deviation+0.032
Neg logit contributions
anwhile-0.038
PN-0.037
hy-0.036
DR-0.036
engineering-0.034
Token cheerful
Token ID37999
Feature activation-6.73e-01

Pos logit contributions
ional+0.156
inas+0.150
 Favor+0.150
Jar+0.147
 deviation+0.140
Neg logit contributions
anwhile-0.154
PN-0.149
hy-0.143
DR-0.143
engineering-0.137
Token.
Token ID13
Feature activation+3.62e-01

Pos logit contributions
ional+0.930
inas+0.889
 Favor+0.887
Jar+0.863
 deviation+0.824
Neg logit contributions
anwhile-1.192
PN-1.153
DR-1.113
hy-1.109
engineering-1.077
Token That
Token ID1320
Feature activation-1.74e+00

Pos logit contributions
anwhile+0.622
PN+0.614
DR+0.593
hy+0.588
engineering+0.566
Neg logit contributions
ional-0.510
inas-0.484
 Favor-0.472
Jar-0.463
 deviation-0.452
Token is
Token ID318
Feature activation-1.07e-03

Pos logit contributions
ional+2.388
 Favor+2.272
inas+2.261
Jar+2.194
 deviation+2.110
Neg logit contributions
anwhile-3.149
PN-3.033
DR-2.914
hy-2.898
estern-2.835
Token,
Token ID11
Feature activation+1.52e-01

Pos logit contributions
ional+0.002
inas+0.001
 Favor+0.001
Jar+0.001
 deviation+0.001
Neg logit contributions
anwhile-0.002
PN-0.002
DR-0.002
hy-0.002
engineering-0.002
Token cheerful
Token ID37999
Feature activation-2.36e-01

Pos logit contributions
anwhile+0.271
PN+0.262
hy+0.253
DR+0.252
engineering+0.244
Neg logit contributions
ional-0.207
 Favor-0.198
inas-0.197
Jar-0.191
 deviation-0.183
Token of
Token ID286
Feature activation-9.07e-04

Pos logit contributions
anwhile+2.348
PN+2.331
hy+2.245
DR+2.226
estern+2.163
Neg logit contributions
ional-1.982
 Favor-1.966
inas-1.943
Jar-1.865
icularly-1.771
Token the
Token ID262
Feature activation+3.66e-01

Pos logit contributions
ional+0.001
inas+0.001
 Favor+0.001
Jar+0.001
 deviation+0.001
Neg logit contributions
anwhile-0.002
PN-0.002
DR-0.001
hy-0.001
estern-0.001
Token Jewish
Token ID5582
Feature activation-2.23e+00

Pos logit contributions
anwhile+0.609
PN+0.588
DR+0.569
hy+0.563
estern+0.549
Neg logit contributions
ional-0.549
inas-0.520
 Favor-0.518
Jar-0.505
 deviation-0.484
Token people
Token ID661
Feature activation-8.94e-02

Pos logit contributions
ional+3.634
 Favor+3.518
inas+3.486
Jar+3.396
 deviation+3.308
Neg logit contributions
anwhile-3.429
PN-3.251
DR-3.098
hy-3.072
estern-3.002
Token in
Token ID287
Feature activation-1.13e+00

Pos logit contributions
ional+0.122
 Favor+0.117
inas+0.117
Jar+0.113
 deviation+0.108
Neg logit contributions
anwhile-0.159
PN-0.153
DR-0.149
hy-0.149
estern-0.144
Token the
Token ID262
Feature activation-3.31e-01

Pos logit contributions
ional+1.577
inas+1.488
 Favor+1.472
Jar+1.429
 deviation+1.368
Neg logit contributions
anwhile-2.021
PN-1.954
DR-1.893
hy-1.876
estern-1.837
Token land
Token ID1956
Feature activation+1.82e-01

Pos logit contributions
ional+0.482
inas+0.455
 Favor+0.452
Jar+0.441
 deviation+0.422
Neg logit contributions
anwhile-0.568
PN-0.550
DR-0.528
hy-0.526
estern-0.514
Token of
Token ID286
Feature activation-3.67e-01

Pos logit contributions
anwhile+0.314
PN+0.303
hy+0.293
DR+0.292
estern+0.283
Neg logit contributions
ional-0.257
 Favor-0.248
inas-0.246
Jar-0.239
 deviation-0.229
Token Israel
Token ID2692
Feature activation-7.97e-01

Pos logit contributions
ional+0.497
inas+0.463
 Favor+0.462
Jar+0.447
 deviation+0.428
Neg logit contributions
anwhile-0.665
PN-0.644
DR-0.624
hy-0.620
estern-0.607
Token,
Token ID11
Feature activation-3.42e-01

Pos logit contributions
ional+1.065
 Favor+1.017
inas+1.010
Jar+0.981
 deviation+0.939
Neg logit contributions
anwhile-1.457
PN-1.408
DR-1.360
hy-1.351
estern-1.318
Token are
Token ID389
Feature activation+1.01e+00

Pos logit contributions
ional+0.496
inas+0.474
 Favor+0.473
Jar+0.459
 deviation+0.439
Neg logit contributions
anwhile-0.588
PN-0.565
DR-0.545
hy-0.540
estern-0.524
Token they
Token ID484
Feature activation-2.11e+00

Pos logit contributions
anwhile+1.781
PN+1.767
DR+1.688
hy+1.657
estern+1.597
Neg logit contributions
ional-2.018
inas-1.930
 Favor-1.900
Jar-1.896
 deviation-1.791
Token are
Token ID389
Feature activation-6.13e-01

Pos logit contributions
ional+3.030
 Favor+2.890
inas+2.868
Jar+2.759
 deviation+2.683
Neg logit contributions
anwhile-3.687
PN-3.514
DR-3.360
hy-3.343
estern-3.282
Token bringing
Token ID6079
Feature activation-1.36e+00

Pos logit contributions
ional+0.968
 Favor+0.929
inas+0.925
Jar+0.904
 deviation+0.873
Neg logit contributions
anwhile-0.971
PN-0.935
DR-0.893
hy-0.886
estern-0.863
Token to
Token ID284
Feature activation-5.63e-01

Pos logit contributions
ional+2.266
 Favor+2.179
inas+2.159
Jar+2.105
 deviation+2.047
Neg logit contributions
anwhile-2.053
PN-1.954
DR-1.862
hy-1.849
estern-1.805
Token the
Token ID262
Feature activation+2.93e-01

Pos logit contributions
ional+0.815
inas+0.774
 Favor+0.770
Jar+0.760
 deviation+0.725
Neg logit contributions
anwhile-0.965
PN-0.939
DR-0.899
hy-0.890
estern-0.870
Token convention
Token ID9831
Feature activation-1.30e+00

Pos logit contributions
anwhile+0.558
PN+0.545
DR+0.526
hy+0.522
estern+0.509
Neg logit contributions
ional-0.364
inas-0.342
 Favor-0.339
Jar-0.335
 deviation-0.315
Token hall
Token ID6899
Feature activation-6.34e-01

Pos logit contributions
ional+1.777
 Favor+1.692
inas+1.682
Jar+1.633
 deviation+1.572
Neg logit contributions
anwhile-2.339
PN-2.253
DR-2.164
hy-2.150
estern-2.107
Token,
Token ID11
Feature activation-3.84e-01

Pos logit contributions
ional+0.875
 Favor+0.833
inas+0.830
Jar+0.809
 deviation+0.774
Neg logit contributions
anwhile-1.129
PN-1.094
DR-1.053
hy-1.044
estern-1.017
Token and
Token ID290
Feature activation+6.28e-02

Pos logit contributions
ional+0.521
inas+0.493
 Favor+0.493
Jar+0.479
 deviation+0.458
Neg logit contributions
anwhile-0.696
PN-0.674
DR-0.648
hy-0.644
estern-0.627
Token so
Token ID523
Feature activation+1.51e+00

Pos logit contributions
anwhile+0.106
PN+0.103
DR+0.098
hy+0.098
estern+0.095
Neg logit contributions
ional-0.093
inas-0.088
 Favor-0.088
Jar-0.086
 deviation-0.082
Token now
Token ID783
Feature activation+1.52e+00

Pos logit contributions
anwhile+2.331
PN+2.256
DR+2.173
hy+2.154
estern+2.067
Neg logit contributions
ional-2.368
inas-2.289
 Favor-2.273
Jar-2.228
 deviation-2.138
Token-
Token ID12
Feature activation+8.20e-01

Pos logit contributions
anwhile+0.873
PN+0.859
DR+0.829
hy+0.825
engineering+0.783
Neg logit contributions
ional-0.815
 Favor-0.788
inas-0.780
Jar-0.758
 deviation-0.743
Token16
Token ID1433
Feature activation+1.94e+00

Pos logit contributions
anwhile+0.917
PN+0.873
DR+0.823
hy+0.819
engineering+0.775
Neg logit contributions
ional-1.670
 Favor-1.616
inas-1.613
Jar-1.577
 deviation-1.548
Token jets
Token ID20792
Feature activation-5.31e-03

Pos logit contributions
anwhile+2.872
PN+2.807
DR+2.772
hy+2.730
engine+2.571
Neg logit contributions
ional-3.149
inas-2.976
 Favor-2.967
Jar-2.870
 deviation-2.825
Token.
Token ID13
Feature activation+2.77e-01

Pos logit contributions
ional+0.008
inas+0.007
 Favor+0.007
Jar+0.007
 deviation+0.007
Neg logit contributions
anwhile-0.009
PN-0.009
DR-0.009
hy-0.009
engineering-0.008
Token
Token ID198
Feature activation-1.35e-01

Pos logit contributions
anwhile+0.446
PN+0.434
DR+0.416
hy+0.409
engineering+0.401
Neg logit contributions
ional-0.425
inas-0.408
 Favor-0.401
Jar-0.396
 deviation-0.387
Token
Token ID198
Feature activation-3.57e-01

Pos logit contributions
ional+0.203
 Favor+0.194
inas+0.193
Jar+0.188
 deviation+0.182
Neg logit contributions
anwhile-0.225
PN-0.218
DR-0.209
hy-0.207
estern-0.202
TokenThe
Token ID464
Feature activation-1.37e+00

Pos logit contributions
ional+0.536
 Favor+0.520
inas+0.512
 deviation+0.492
Jar+0.488
Neg logit contributions
PN-0.584
anwhile-0.572
DR-0.559
hy-0.546
engineering-0.532
Token Government
Token ID5070
Feature activation-4.62e-02

Pos logit contributions
ional+2.001
 Favor+1.910
inas+1.902
Jar+1.851
 deviation+1.782
Neg logit contributions
anwhile-2.343
PN-2.253
DR-2.160
hy-2.144
estern-2.095
Token of
Token ID286
Feature activation+4.20e-01

Pos logit contributions
ional+0.062
inas+0.060
 Favor+0.060
Jar+0.059
 deviation+0.056
Neg logit contributions
anwhile-0.081
PN-0.080
DR-0.077
hy-0.077
engineering-0.075
Token Bahrain
Token ID29066
Feature activation-2.20e+00

Pos logit contributions
anwhile+0.741
PN+0.721
DR+0.695
hy+0.687
engineering+0.673
Neg logit contributions
ional-0.576
inas-0.550
 Favor-0.545
Jar-0.537
 deviation-0.520
Token has
Token ID468
Feature activation+3.10e-01

Pos logit contributions
ional+3.317
 Favor+3.166
inas+3.150
Jar+3.064
 deviation+2.961
Neg logit contributions
anwhile-3.671
PN-3.519
DR-3.368
hy-3.346
estern-3.284
Token still
Token ID991
Feature activation+1.06e+00

Pos logit contributions
ional+0.485
 Favor+0.470
inas+0.469
Jar+0.455
 deviation+0.438
Neg logit contributions
anwhile-0.535
PN-0.517
hy-0.497
DR-0.497
engineering-0.480
Token greater
Token ID3744
Feature activation-8.48e-02

Pos logit contributions
anwhile+1.654
PN+1.627
hy+1.592
DR+1.572
engineering+1.512
Neg logit contributions
inas-1.558
ional-1.557
 Favor-1.555
Jar-1.510
 deviation-1.444
Token on
Token ID319
Feature activation-7.33e-02

Pos logit contributions
ional+0.125
 Favor+0.121
inas+0.121
Jar+0.118
 deviation+0.113
Neg logit contributions
anwhile-0.141
PN-0.136
DR-0.131
hy-0.131
estern-0.126
Token women
Token ID1466
Feature activation-4.72e-01

Pos logit contributions
ional+0.106
 Favor+0.102
inas+0.101
Jar+0.100
 deviation+0.094
Neg logit contributions
anwhile-0.124
PN-0.120
DR-0.116
hy-0.115
engineering-0.111
Token (
Token ID357
Feature activation+3.04e-01

Pos logit contributions
ional+0.726
 Favor+0.700
inas+0.696
Jar+0.681
 deviation+0.655
Neg logit contributions
anwhile-0.758
PN-0.736
DR-0.707
hy-0.704
estern-0.680
Tokento
Token ID1462
Feature activation-1.30e+00

Pos logit contributions
anwhile+0.455
PN+0.448
hy+0.428
DR+0.427
engineering+0.412
Neg logit contributions
ional-0.496
 Favor-0.480
inas-0.474
Jar-0.462
 deviation-0.454
Token look
Token ID804
Feature activation+8.18e-02

Pos logit contributions
ional+1.787
 Favor+1.701
inas+1.692
Jar+1.643
 deviation+1.581
Neg logit contributions
anwhile-2.347
PN-2.261
DR-2.172
hy-2.158
estern-2.113
Token good
Token ID922
Feature activation-4.96e-01

Pos logit contributions
anwhile+0.134
PN+0.131
hy+0.127
DR+0.126
engineering+0.121
Neg logit contributions
ional-0.122
 Favor-0.118
inas-0.116
Jar-0.114
 deviation-0.109
Token),
Token ID828
Feature activation-4.05e-01

Pos logit contributions
ional+0.671
 Favor+0.649
inas+0.639
Jar+0.624
 deviation+0.601
Neg logit contributions
anwhile-0.891
PN-0.862
DR-0.834
hy-0.833
estern-0.806
Token
Token ID447
Feature activation+7.41e-01

Pos logit contributions
ional+0.586
 Favor+0.560
inas+0.559
Jar+0.544
 deviation+0.524
Neg logit contributions
anwhile-0.694
PN-0.672
DR-0.645
hy-0.642
engineering-0.621
Token
Token ID251
Feature activation-2.59e-01

Pos logit contributions
anwhile+1.242
PN+1.234
DR+1.193
hy+1.185
engineering+1.140
Neg logit contributions
ional-1.069
inas-1.016
 Favor-1.014
Jar-0.984
 deviation-0.944
Tokenat
Token ID265
Feature activation-1.96e+00

Pos logit contributions
anwhile+3.011
PN+2.932
hy+2.913
DR+2.831
estern+2.780
Neg logit contributions
ional-2.611
 Favor-2.593
 deviation-2.526
inas-2.481
Jar-2.474
Token"]
Token ID8973
Feature activation+1.21e+00

Pos logit contributions
ional+2.740
 Favor+2.609
inas+2.594
Jar+2.523
 deviation+2.432
Neg logit contributions
anwhile-3.476
PN-3.350
DR-3.215
hy-3.195
estern-3.128
Token [
Token ID685
Feature activation+1.55e+00

Pos logit contributions
PN+2.204
anwhile+2.169
DR+2.132
hy+2.077
engineering+2.012
Neg logit contributions
ional-1.566
inas-1.457
 Favor-1.448
Jar-1.408
 deviation-1.359
TokenBlack
Token ID9915
Feature activation-2.45e+00

Pos logit contributions
PN+2.775
DR+2.679
anwhile+2.609
hy+2.604
engineering+2.535
Neg logit contributions
ional-2.004
inas-1.981
 Favor-1.977
Jar-1.871
 deviation-1.825
Token "
Token ID366
Feature activation+9.63e-02

Pos logit contributions
ional+2.815
 Favor+2.728
inas+2.651
Jar+2.576
 deviation+2.482
Neg logit contributions
anwhile-5.056
PN-4.675
 destro-4.526
DR-4.517
hy-4.482
TokenAnt
Token ID13217
Feature activation-3.54e+00

Pos logit contributions
PN+0.142
anwhile+0.141
DR+0.135
hy+0.133
engineering+0.128
Neg logit contributions
ional-0.160
 Favor-0.155
inas-0.152
 deviation-0.146
Jar-0.146
Tokenip
Token ID541
Feature activation-2.13e-01

Pos logit contributions
ional+5.745
inas+5.514
 Favor+5.366
Jar+5.286
tle+5.057
Neg logit contributions
anwhile-5.442
PN-5.135
DR-4.897
ITNESS-4.764
engineering-4.703
Tokenov
Token ID709
Feature activation+2.15e-02

Pos logit contributions
ional+0.337
 Favor+0.325
inas+0.318
Jar+0.315
 deviation+0.308
Neg logit contributions
anwhile-0.337
PN-0.321
hy-0.313
DR-0.306
estern-0.301
Token,
Token ID11
Feature activation-3.23e-01

Pos logit contributions
anwhile+0.040
PN+0.039
DR+0.038
hy+0.037
estern+0.037
Neg logit contributions
ional-0.028
 Favor-0.026
inas-0.026
Jar-0.025
 deviation-0.024
Token Mikhail
Token ID42040
Feature activation-1.32e+00

Pos logit contributions
ional+0.427
inas+0.402
 Favor+0.399
Jar+0.389
 deviation+0.375
Neg logit contributions
anwhile-0.598
PN-0.581
DR-0.557
hy-0.556
estern-0.540
Token Al
Token ID978
Feature activation+4.93e-01

Pos logit contributions
ional+1.847
 Favor+1.750
inas+1.741
Jar+1.702
 deviation+1.644
Neg logit contributions
anwhile-2.332
PN-2.251
DR-2.157
hy-2.155
estern-2.093
Token admins
Token ID44563
Feature activation+1.24e+00

Pos logit contributions
ional+0.057
 Favor+0.054
inas+0.054
Jar+0.053
 deviation+0.050
Neg logit contributions
anwhile-0.066
PN-0.064
DR-0.061
hy-0.061
estern-0.059
Token always
Token ID1464
Feature activation-4.93e-01

Pos logit contributions
anwhile+2.012
PN+1.913
DR+1.870
hy+1.839
estern+1.773
Neg logit contributions
ional-1.851
 Favor-1.796
inas-1.795
Jar-1.760
 deviation-1.664
Token grant
Token ID7264
Feature activation-2.01e+00

Pos logit contributions
ional+0.706
 Favor+0.673
inas+0.669
Jar+0.650
 deviation+0.627
Neg logit contributions
anwhile-0.858
PN-0.825
DR-0.791
hy-0.786
estern-0.770
Token it
Token ID340
Feature activation-2.30e+00

Pos logit contributions
ional+2.818
 Favor+2.683
inas+2.667
Jar+2.589
 deviation+2.500
Neg logit contributions
anwhile-3.579
PN-3.439
DR-3.302
hy-3.280
estern-3.216
Token.
Token ID13
Feature activation+2.02e-01

Pos logit contributions
ional+3.207
 Favor+3.053
inas+3.017
Jar+2.919
 deviation+2.840
Neg logit contributions
anwhile-4.105
PN-3.920
DR-3.760
hy-3.730
estern-3.693
Token It
Token ID632
Feature activation-3.38e+00

Pos logit contributions
anwhile+0.332
PN+0.326
DR+0.314
hy+0.309
engineering+0.299
Neg logit contributions
ional-0.303
inas-0.288
 Favor-0.284
Jar-0.279
 deviation-0.270
Token's
Token ID338
Feature activation-1.46e+00

Pos logit contributions
ional+3.881
inas+3.627
 Favor+3.612
Jar+3.446
 deviation+3.324
Neg logit contributions
anwhile-6.801
PN-6.612
DR-6.332
estern-6.261
hy-6.219
Token time
Token ID640
Feature activation+1.09e+00

Pos logit contributions
ional+2.198
 Favor+2.100
inas+2.088
Jar+2.033
 deviation+1.968
Neg logit contributions
anwhile-2.436
PN-2.337
DR-2.236
hy-2.217
estern-2.173
Token,
Token ID11
Feature activation-1.69e-01

Pos logit contributions
anwhile+1.557
PN+1.491
DR+1.433
hy+1.407
engineering+1.356
Neg logit contributions
ional-1.865
 Favor-1.808
inas-1.804
Jar-1.760
 deviation-1.699
Token with
Token ID351
Feature activation+7.99e-01

Pos logit contributions
ional+0.251
 Favor+0.240
inas+0.239
Jar+0.233
 deviation+0.224
Neg logit contributions
anwhile-0.283
PN-0.273
DR-0.262
hy-0.260
engineering-0.252
Token $
Token ID720
Feature activation+2.33e+00

Pos logit contributions
anwhile+1.363
PN+1.338
DR+1.283
hy+1.273
estern+1.237
Neg logit contributions
ional-1.129
 Favor-1.076
inas-1.070
Jar-1.057
 deviation-0.987
TokenWA
Token ID15543
Feature activation+1.52e+00

Pos logit contributions
PN+1.708
anwhile+1.696
DR+1.630
hy+1.569
estern+1.531
Neg logit contributions
ional-2.028
 Favor-1.927
inas-1.924
Jar-1.879
 deviation-1.841
Token Fresh
Token ID20138
Feature activation-1.21e+00

Pos logit contributions
anwhile+2.627
PN+2.603
DR+2.550
hy+2.493
engineering+2.417
Neg logit contributions
ional-2.052
inas-1.935
Jar-1.930
 Favor-1.909
 deviation-1.812
Tokenman
Token ID805
Feature activation-8.69e-01

Pos logit contributions
ional+1.246
 Favor+1.166
inas+1.160
Jar+1.116
 deviation+1.063
Neg logit contributions
anwhile-2.588
PN-2.525
DR-2.441
hy-2.433
engineering-2.378
Token All
Token ID1439
Feature activation-1.51e+00

Pos logit contributions
ional+1.157
inas+1.097
 Favor+1.092
Jar+1.072
 deviation+1.019
Neg logit contributions
anwhile-1.585
PN-1.540
DR-1.485
hy-1.472
estern-1.436
Token-
Token ID12
Feature activation-8.04e-01

Pos logit contributions
ional+2.098
 Favor+1.986
inas+1.962
Jar+1.906
 deviation+1.826
Neg logit contributions
anwhile-2.720
PN-2.599
DR-2.482
hy-2.466
engineering-2.419
TokenAmerica
Token ID18165
Feature activation-3.40e+00

Pos logit contributions
ional+1.203
 Favor+1.152
inas+1.145
Jar+1.110
 deviation+1.079
Neg logit contributions
anwhile-1.336
PN-1.300
DR-1.247
hy-1.235
estern-1.202
Token Team
Token ID4816
Feature activation-1.59e+00

Pos logit contributions
ional+4.395
 Favor+4.234
inas+4.113
Jar+3.945
 deviation+3.862
Neg logit contributions
anwhile-6.348
PN-6.099
DR-5.830
hy-5.814
estern-5.728
Token by
Token ID416
Feature activation+1.04e+00

Pos logit contributions
ional+2.192
inas+2.085
 Favor+2.081
Jar+2.029
 deviation+1.942
Neg logit contributions
anwhile-2.851
PN-2.750
DR-2.649
hy-2.623
estern-2.570
Token the
Token ID262
Feature activation+6.59e-01

Pos logit contributions
anwhile+1.823
PN+1.762
DR+1.699
hy+1.672
estern+1.635
Neg logit contributions
ional-1.466
inas-1.373
Jar-1.362
 Favor-1.361
 deviation-1.290
Token United
Token ID1578
Feature activation-1.99e-01

Pos logit contributions
anwhile+1.098
PN+1.060
DR+1.020
hy+1.008
engineering+0.984
Neg logit contributions
ional-0.985
 Favor-0.928
inas-0.925
Jar-0.913
 deviation-0.875
Token States
Token ID1829
Feature activation-9.82e-01

Pos logit contributions
ional+0.328
 Favor+0.312
inas+0.312
Jar+0.306
 deviation+0.297
Neg logit contributions
anwhile-0.301
PN-0.292
DR-0.279
hy-0.275
estern-0.268
Token what
Token ID644
Feature activation+8.15e-01

Pos logit contributions
ional+0.101
inas+0.097
 Favor+0.097
Jar+0.094
 deviation+0.090
Neg logit contributions
anwhile-0.125
PN-0.122
DR-0.117
hy-0.116
engineering-0.113
Token generation
Token ID5270
Feature activation-6.69e-02

Pos logit contributions
anwhile+1.313
PN+1.286
DR+1.227
hy+1.220
engineering+1.177
Neg logit contributions
ional-1.246
inas-1.194
 Favor-1.193
Jar-1.155
 deviation-1.102
Token could
Token ID714
Feature activation+6.58e-01

Pos logit contributions
ional+0.099
 Favor+0.095
inas+0.094
Jar+0.092
 deviation+0.088
Neg logit contributions
anwhile-0.111
PN-0.109
DR-0.104
hy-0.103
engineering-0.100
Token do
Token ID466
Feature activation+4.44e-01

Pos logit contributions
anwhile+1.117
PN+1.078
DR+1.041
hy+1.034
estern+0.993
Neg logit contributions
ional-0.945
 Favor-0.915
inas-0.904
Jar-0.883
 deviation-0.845
Token.
Token ID13
Feature activation-2.43e-01

Pos logit contributions
anwhile+0.756
PN+0.734
DR+0.708
hy+0.703
engineering+0.678
Neg logit contributions
ional-0.633
inas-0.613
 Favor-0.611
Jar-0.598
 deviation-0.571
Token It
Token ID632
Feature activation-3.51e+00

Pos logit contributions
ional+0.346
inas+0.329
 Favor+0.326
Jar+0.318
 deviation+0.306
Neg logit contributions
anwhile-0.420
PN-0.409
DR-0.394
hy-0.389
engineering-0.379
Token was
Token ID373
Feature activation+8.59e-01

Pos logit contributions
ional+4.948
inas+4.684
 Favor+4.654
Jar+4.517
 deviation+4.314
Neg logit contributions
anwhile-6.118
PN-5.899
DR-5.643
estern-5.539
hy-5.517
Token still
Token ID991
Feature activation+8.09e-01

Pos logit contributions
anwhile+1.364
PN+1.325
DR+1.273
hy+1.272
estern+1.229
Neg logit contributions
ional-1.317
inas-1.275
 Favor-1.264
Jar-1.233
 deviation-1.182
Token noise
Token ID7838
Feature activation-1.02e+00

Pos logit contributions
anwhile+1.278
PN+1.251
hy+1.206
DR+1.203
engineering+1.159
Neg logit contributions
ional-1.238
inas-1.196
 Favor-1.189
Jar-1.168
 deviation-1.105
Token generated
Token ID7560
Feature activation-1.64e-01

Pos logit contributions
ional+1.443
 Favor+1.377
inas+1.370
Jar+1.330
 deviation+1.280
Neg logit contributions
anwhile-1.793
PN-1.727
DR-1.658
hy-1.646
estern-1.611
Token terrain
Token ID15510
Feature activation+6.87e-01

Pos logit contributions
ional+0.233
 Favor+0.224
inas+0.223
Jar+0.217
 deviation+0.207
Neg logit contributions
anwhile-0.283
PN-0.274
DR-0.264
hy-0.261
engineering-0.254
Token Order
Token ID8284
Feature activation+3.16e-01

Pos logit contributions
ional+2.625
inas+2.512
 Favor+2.446
Jar+2.321
 deviation+2.271
Neg logit contributions
anwhile-4.253
PN-4.106
DR-4.001
hy-3.969
engineering-3.841
Token walk
Token ID2513
Feature activation+9.43e-01

Pos logit contributions
anwhile+0.552
PN+0.546
DR+0.527
hy+0.523
engineering+0.503
Neg logit contributions
ional-0.427
inas-0.413
 Favor-0.398
Jar-0.396
 deviation-0.372
Token in
Token ID287
Feature activation-1.05e+00

Pos logit contributions
anwhile+1.538
PN+1.502
hy+1.483
DR+1.442
estern+1.415
Neg logit contributions
ional-1.372
 Favor-1.347
inas-1.343
Jar-1.292
 deviation-1.262
Token Glasgow
Token ID23995
Feature activation-1.51e+00

Pos logit contributions
ional+1.400
inas+1.323
 Favor+1.315
Jar+1.281
 deviation+1.229
Neg logit contributions
anwhile-1.932
PN-1.863
DR-1.799
hy-1.789
estern-1.746
Token on
Token ID319
Feature activation-8.94e-01

Pos logit contributions
ional+2.092
 Favor+1.986
inas+1.981
Jar+1.925
 deviation+1.849
Neg logit contributions
anwhile-2.713
PN-2.608
DR-2.520
hy-2.494
engineering-2.433
Token Saturday
Token ID3909
Feature activation-4.05e+00

Pos logit contributions
ional+0.671
inas+0.618
 Favor+0.611
Jar+0.578
 deviation+0.531
Neg logit contributions
anwhile-2.158
PN-2.100
DR-2.047
hy-2.043
estern-1.995
Token.
Token ID13
Feature activation+3.88e-01

Pos logit contributions
ional+5.012
 Favor+4.704
inas+4.665
Jar+4.492
 deviation+4.289
Neg logit contributions
anwhile-7.691
PN-7.351
osponsors-7.103
estern-7.076
DR-7.070
Token
Token ID198
Feature activation-1.42e-01

Pos logit contributions
PN+0.586
anwhile+0.565
DR+0.560
hy+0.547
engineering+0.524
Neg logit contributions
ional-0.631
inas-0.598
 Favor-0.592
 deviation-0.574
Jar-0.571
Token
Token ID198
Feature activation-3.05e-01

Pos logit contributions
ional+0.213
 Favor+0.203
inas+0.203
Jar+0.197
 deviation+0.191
Neg logit contributions
anwhile-0.236
PN-0.228
DR-0.219
hy-0.217
estern-0.212
TokenThe
Token ID464
Feature activation-1.12e+00

Pos logit contributions
ional+0.471
 Favor+0.456
inas+0.446
 deviation+0.429
Jar+0.425
Neg logit contributions
PN-0.489
anwhile-0.480
DR-0.467
hy-0.458
engineering-0.443
Token girl
Token ID2576
Feature activation-1.85e+00

Pos logit contributions
ional+1.407
 Favor+1.330
inas+1.323
Jar+1.282
 deviation+1.223
Neg logit contributions
anwhile-2.138
PN-2.069
DR-1.993
hy-1.980
estern-1.935