TOP ACTIVATIONS
MAX = 27.530

to levels where it becomes totally unaff ordable for people .
this attack on encryption is utterly baff ling , unless it
can make the ultimate end significantly more expensive .
Un certain ty is making Britain a less attractive destination ,
Madrid Dri ving slowly along the city 's broad
emissions that would make global warming worse are " essential parts
out of there , smoke pouring out of the tank .
. " Air strikes were carried out to target
warns the policy would " drive away inward investment ".
economists think this will be pushed back due to recent poor
are always trying to throw bullets at me ,
become more powerful now makes Damascus wary . Turkey
. We need to be absolutely certain the FBI is worthy
umbling and rolling iceberg pushes water out wards through the f
in Madrid Dri ving slowly along the city 's
education institutions - and how Brexit plays out in relation to
it 's difficult , if not impossible , for them to
; they simply make it increasingly impossible to achieve in the
perpetually terrified and keep defense budgets up , if the Deep
However , to make all this happen , it is necessary

MIN ACTIVATIONS
MIN = -37.982

genius of a mathematician had originally sent me the paper
central to the story , had a collection of autom at
re originally from Ireland and have travelled and worked all over
the living world . He had once lived inside a par
, American man . I have a history of depression ,
origin that Batman and Dick have of having gone through this
was a British citizen who had been born in Pakistan ,
that Xer xes , who had probably never left dry land
of transport , and he had discovered that his father was
wrote on LinkedIn that she had also worked as a professor
born in New Jersey and has been a United States citizen
to this point , I have been living a very good
in Gund uz , who had helped him bring in the
the way ( these players had missed the Valve - led
development camera , although it had been upgraded regularly over the
hike . Bob In man had written a guide book about
Win kel ried , who had started at Goldman eight years
player , the one who had carried them to an upset
most of the Italian Army had been intern ed by German
including my wife ). I had no job lined up ,

INTERVAL 11.152 - 27.530
CONTAINS 2.350%

kn olls fathers and sons kick football s to each other
further along he walks , the greater the destruction and injuries
A few days before my last phone call
, which looks to be heading toward needing a full bailout
production . In addition , a Quad ren n ial Energy

INTERVAL -5.226 - 11.152
CONTAINS 82.136%

as Daniel Sparks , the Goldman partner overseeing the firm
which denied wrongdoing . Pr ather s
is no one who can define humanitarian ism or morality when
city held . The settlement was arguably an inevitable in light
Over drive is a piece I ve talked

INTERVAL -21.604 - -5.226
CONTAINS 15.066%

Or , as he once told an interviewer ,
. For instance , she argues that Hume 's instrumental ism
ter . You DO NOT need to remove the four 10
is the eastern short - eared rock - wall aby .
in the Mil burn Stand ). They will then

INTERVAL -37.982 - -21.604
CONTAINS 0.447%

loving long - term relationship has reached that point where ,
Week 13 . Ryan also has scored 20 Fantasy points outdoors
and territories Australia has six states New South
the opposite of what we have seen when we unp acked
into launch . The rocket had to adjust its trajectory to
Token to
Token ID284
Feature activation+6.57e+00

Pos logit contributions
 counterproductive+0.691
 futile+0.674
 undermines+0.654
 urgently+0.638
 vulnerable+0.635
Neg logit contributions
tymology-0.900
ledged-0.897
been-0.880
essor-0.861
 origins-0.855
Token levels
Token ID2974
Feature activation+2.44e+00

Pos logit contributions
 counterproductive+5.244
 futile+5.076
 vulnerable+4.947
 undermines+4.947
 urgently+4.899
Neg logit contributions
tymology-6.536
ledged-6.460
been-6.269
essor-6.132
iths-6.106
Token where
Token ID810
Feature activation+4.05e+00

Pos logit contributions
 counterproductive+1.900
 futile+1.846
 undermines+1.794
 vulnerable+1.767
 urgently+1.766
Neg logit contributions
tymology-2.492
ledged-2.474
been-2.420
essor-2.364
iths-2.352
Token it
Token ID340
Feature activation+1.23e+01

Pos logit contributions
 counterproductive+3.033
 futile+2.942
 undermines+2.861
 vulnerable+2.818
 urgently+2.816
Neg logit contributions
tymology-4.246
ledged-4.229
been-4.135
essor-4.021
iths-4.005
Token becomes
Token ID4329
Feature activation+1.44e+01

Pos logit contributions
 undermines+7.060
 counterproductive+7.029
 futile+6.843
 urgently+6.764
 threatens+6.761
Neg logit contributions
tymology-14.162
ledged-14.106
been-13.682
essor-13.457
iths-13.354
Token totally
Token ID6635
Feature activation+2.75e+01

Pos logit contributions
 counterproductive+11.500
 futile+10.964
 harder+10.763
 vulnerable+10.656
 unsafe+10.538
Neg logit contributions
tymology-13.778
ledged-13.558
been-13.079
essor-12.837
iths-12.740
Token unaff
Token ID25312
Feature activation+1.81e+00

Pos logit contributions
 counterproductive+21.038
 futile+20.040
 unsafe+19.113
 vulnerable+19.101
 pointless+18.929
Neg logit contributions
ledged-22.323
tymology-22.008
essor-20.471
been-20.377
iths-19.688
Tokenordable
Token ID16819
Feature activation+2.47e+00

Pos logit contributions
 counterproductive+1.223
 futile+1.185
 undermines+1.146
 urgently+1.118
 vulnerable+1.117
Neg logit contributions
tymology-2.036
ledged-2.024
been-1.986
essor-1.945
iths-1.936
Token for
Token ID329
Feature activation+4.69e+00

Pos logit contributions
 counterproductive+1.891
 futile+1.839
 undermines+1.791
 urgently+1.763
 vulnerable+1.758
Neg logit contributions
tymology-2.566
ledged-2.545
been-2.492
essor-2.430
iths-2.423
Token people
Token ID661
Feature activation+7.57e-01

Pos logit contributions
 counterproductive+3.912
 futile+3.811
 vulnerable+3.728
 undermines+3.709
 urgently+3.669
Neg logit contributions
tymology-4.522
ledged-4.507
been-4.383
iths-4.257
essor-4.249
Token.
Token ID13
Feature activation+2.42e+00

Pos logit contributions
 counterproductive+0.614
 futile+0.599
 undermines+0.581
 urgently+0.568
 vulnerable+0.568
Neg logit contributions
tymology-0.745
ledged-0.739
been-0.726
essor-0.708
iths-0.703
Token this
Token ID428
Feature activation+9.95e+00

Pos logit contributions
tymology+4.263
ledged+4.177
been+4.150
essor+4.049
 origins+4.031
Neg logit contributions
 counterproductive-3.783
 futile-3.690
 undermines-3.618
 urgently-3.523
 vulnerable-3.470
Token attack
Token ID1368
Feature activation+9.32e+00

Pos logit contributions
 counterproductive+9.232
 futile+9.041
 undermines+8.888
 vulnerable+8.633
 urgently+8.606
Neg logit contributions
ledged-8.434
tymology-8.377
been-8.051
essor-7.986
iths-7.845
Token on
Token ID319
Feature activation+1.00e+01

Pos logit contributions
 counterproductive+6.789
 undermines+6.725
 futile+6.694
 vulnerable+6.427
 threatens+6.412
Neg logit contributions
ledged-9.510
tymology-9.495
been-9.298
essor-9.108
iths-9.086
Token encryption
Token ID15835
Feature activation+1.23e+01

Pos logit contributions
 counterproductive+7.536
 futile+7.461
 vulnerable+7.377
 undermines+7.178
 harder+7.103
Neg logit contributions
ledged-9.873
tymology-9.865
been-9.629
iths-9.459
essor-9.265
Token is
Token ID318
Feature activation+1.05e+01

Pos logit contributions
 undermines+9.865
 counterproductive+9.828
 futile+9.658
 threatens+9.405
 urgently+9.292
Neg logit contributions
ledged-11.520
tymology-11.376
been-11.004
essor-10.911
iths-10.878
Token utterly
Token ID15950
Feature activation+2.47e+01

Pos logit contributions
 counterproductive+8.935
 futile+8.669
 harder+8.244
 urgently+8.231
 vulnerable+8.219
Neg logit contributions
tymology-9.812
ledged-9.705
essor-9.423
been-9.311
iths-9.069
Token baff
Token ID25989
Feature activation+4.81e+00

Pos logit contributions
 counterproductive+20.111
 futile+19.481
 pointless+18.125
 vulnerable+17.956
 undermines+17.906
Neg logit contributions
ledged-19.850
tymology-19.418
essor-19.341
been-18.224
iths-17.533
Tokenling
Token ID1359
Feature activation+6.51e-01

Pos logit contributions
 counterproductive+3.901
 futile+3.824
 undermines+3.688
 vulnerable+3.643
 urgently+3.640
Neg logit contributions
tymology-4.687
ledged-4.620
been-4.550
essor-4.468
iths-4.426
Token,
Token ID11
Feature activation+3.81e+00

Pos logit contributions
 counterproductive+0.556
 futile+0.542
 undermines+0.528
 urgently+0.519
 vulnerable+0.518
Neg logit contributions
tymology-0.616
ledged-0.612
been-0.598
essor-0.586
iths-0.581
Token unless
Token ID4556
Feature activation-4.06e+00

Pos logit contributions
 counterproductive+3.095
 futile+3.018
 undermines+2.928
 urgently+2.870
 vulnerable+2.870
Neg logit contributions
tymology-3.781
ledged-3.760
been-3.654
essor-3.602
iths-3.555
Token it
Token ID340
Feature activation+2.85e+00

Pos logit contributions
tymology+4.031
ledged+3.975
been+3.934
 origins+3.837
iths+3.825
Neg logit contributions
 counterproductive-3.123
 futile-3.048
 undermines-2.963
 urgently-2.898
 vulnerable-2.867
Token can
Token ID460
Feature activation-2.22e+00

Pos logit contributions
 counterproductive+3.499
 futile+3.407
 undermines+3.315
 urgently+3.242
 vulnerable+3.236
Neg logit contributions
tymology-5.219
ledged-5.187
been-5.086
essor-4.975
iths-4.945
Token make
Token ID787
Feature activation+5.47e+00

Pos logit contributions
tymology+2.211
ledged+2.178
been+2.139
 origins+2.104
essor+2.095
Neg logit contributions
 counterproductive-1.709
 futile-1.648
 undermines-1.598
 unsafe-1.550
 vulnerable-1.547
Token the
Token ID262
Feature activation+4.17e+00

Pos logit contributions
 counterproductive+4.300
 futile+4.216
 vulnerable+4.052
 undermines+4.031
 urgently+3.998
Neg logit contributions
tymology-5.507
ledged-5.504
been-5.380
essor-5.239
iths-5.237
Token ultimate
Token ID8713
Feature activation+8.17e+00

Pos logit contributions
 counterproductive+3.632
 futile+3.554
 undermines+3.434
 vulnerable+3.413
 urgently+3.395
Neg logit contributions
tymology-3.869
ledged-3.855
been-3.758
essor-3.649
iths-3.640
Token end
Token ID886
Feature activation+3.71e-01

Pos logit contributions
 counterproductive+7.806
 futile+7.619
 undermines+7.432
 vulnerable+7.387
 harder+7.323
Neg logit contributions
ledged-6.750
tymology-6.643
been-6.567
iths-6.323
essor-6.299
Token significantly
Token ID5566
Feature activation+2.45e+01

Pos logit contributions
 counterproductive+0.294
 futile+0.286
 undermines+0.278
 urgently+0.273
 vulnerable+0.272
Neg logit contributions
tymology-0.373
ledged-0.369
been-0.363
essor-0.355
iths-0.352
Token more
Token ID517
Feature activation+1.92e+01

Pos logit contributions
 harder+18.853
 counterproductive+18.054
 futile+17.958
 vulnerable+17.467
 safer+17.228
Neg logit contributions
ledged-19.997
tymology-19.515
been-18.905
essor-18.888
iths-18.778
Token expensive
Token ID5789
Feature activation+4.06e+00

Pos logit contributions
 counterproductive+13.974
 futile+13.857
 vulnerable+13.654
 urgently+13.211
 harder+13.202
Neg logit contributions
ledged-18.078
tymology-17.801
been-17.517
iths-17.450
essor-17.001
Token.
Token ID13
Feature activation+3.08e+00

Pos logit contributions
 counterproductive+3.092
 futile+3.017
 undermines+2.916
 urgently+2.879
 vulnerable+2.873
Neg logit contributions
tymology-4.186
ledged-4.151
been-4.084
essor-3.973
iths-3.961
Token
Token ID198
Feature activation-2.41e+00

Pos logit contributions
 counterproductive+2.221
 futile+2.159
 undermines+2.093
 urgently+2.048
 vulnerable+2.046
Neg logit contributions
tymology-3.315
ledged-3.291
been-3.227
essor-3.157
iths-3.140
Token
Token ID198
Feature activation+4.79e+00

Pos logit contributions
been+2.218
ledged+2.204
tymology+2.160
 origins+2.111
essor+2.109
Neg logit contributions
 counterproductive-2.087
 futile-2.038
 undermines-1.997
 vulnerable-1.909
 tightening-1.875
TokenUn
Token ID3118
Feature activation-4.03e+00

Pos logit contributions
 counterproductive+3.055
 futile+2.960
 undermines+2.886
 urgently+2.837
 vulnerable+2.827
Neg logit contributions
tymology-4.462
ledged-4.436
been-4.315
essor-4.248
iths-4.219
Tokencertain
Token ID39239
Feature activation-3.36e+00

Pos logit contributions
tymology+3.176
ledged+3.153
been+3.082
essor+2.979
iths+2.970
Neg logit contributions
 counterproductive-4.049
 futile-3.969
 undermines-3.848
 vulnerable-3.777
 urgently-3.773
Tokenty
Token ID774
Feature activation+1.14e+01

Pos logit contributions
tymology+2.690
ledged+2.650
been+2.564
 origins+2.525
iths+2.473
Neg logit contributions
 counterproductive-3.346
 futile-3.281
 undermines-3.181
 urgently-3.141
 vulnerable-3.124
Token is
Token ID318
Feature activation+1.03e+01

Pos logit contributions
 counterproductive+7.547
 undermines+7.512
 futile+7.387
 threatens+7.249
 urgently+7.191
Neg logit contributions
tymology-11.932
ledged-11.886
been-11.698
essor-11.486
iths-11.474
Token making
Token ID1642
Feature activation+8.95e+00

Pos logit contributions
 counterproductive+7.902
 futile+7.691
 urgently+7.388
 vulnerable+7.367
 undermines+7.354
Neg logit contributions
tymology-10.346
ledged-10.211
essor-9.926
been-9.923
iths-9.660
Token Britain
Token ID5491
Feature activation+2.44e+01

Pos logit contributions
 counterproductive+7.355
 futile+7.215
 vulnerable+7.057
 urgently+6.954
 undermines+6.951
Neg logit contributions
ledged-8.561
tymology-8.361
been-8.231
essor-8.013
iths-7.921
Token a
Token ID257
Feature activation+1.75e+01

Pos logit contributions
 vulnerable+20.508
 harder+19.967
 unsafe+19.871
 counterproductive+19.408
 futile+19.306
Neg logit contributions
ledged-18.246
tymology-17.180
been-16.648
iths-16.552
essor-16.220
Token less
Token ID1342
Feature activation+1.03e+01

Pos logit contributions
 counterproductive+13.829
 harder+13.613
 vulnerable+13.597
 futile+13.586
 unsafe+13.191
Neg logit contributions
tymology-15.564
ledged-15.503
been-14.748
iths-14.438
essor-14.337
Token attractive
Token ID10966
Feature activation+5.55e+00

Pos logit contributions
 counterproductive+6.336
 vulnerable+6.165
 futile+6.156
 unsafe+5.960
 urgently+5.870
Neg logit contributions
ledged-11.762
tymology-11.722
been-11.453
iths-11.272
essor-11.186
Token destination
Token ID10965
Feature activation+9.96e-02

Pos logit contributions
 counterproductive+4.245
 futile+4.127
 undermines+4.008
 vulnerable+3.991
 urgently+3.947
Neg logit contributions
ledged-5.666
tymology-5.661
been-5.506
essor-5.397
iths-5.375
Token,
Token ID11
Feature activation+7.73e+00

Pos logit contributions
 counterproductive+0.070
 futile+0.068
 undermines+0.066
 urgently+0.064
 vulnerable+0.064
Neg logit contributions
tymology-0.109
ledged-0.108
been-0.106
essor-0.104
iths-0.103
Token Madrid
Token ID14708
Feature activation+7.29e+00

Pos logit contributions
 counterproductive+3.311
 futile+3.245
 urgently+3.028
 undermines+3.026
 vulnerable+3.009
Neg logit contributions
tymology-6.979
ledged-6.933
been-6.749
essor-6.698
iths-6.597
Token
Token ID198
Feature activation+2.95e-01

Pos logit contributions
 counterproductive+5.207
 futile+5.131
 undermines+4.980
 urgently+4.928
 vulnerable+4.858
Neg logit contributions
tymology-7.684
ledged-7.542
been-7.317
essor-7.263
iths-7.163
Token
Token ID198
Feature activation+7.71e+00

Pos logit contributions
 counterproductive+0.239
 futile+0.232
 undermines+0.226
 vulnerable+0.219
 urgently+0.217
Neg logit contributions
ledged-0.291
been-0.290
tymology-0.288
 origins-0.278
iths-0.278
TokenDri
Token ID20564
Feature activation+8.30e+00

Pos logit contributions
 counterproductive+5.312
 futile+5.214
 undermines+5.040
 urgently+4.984
 vulnerable+4.946
Neg logit contributions
tymology-8.196
ledged-8.055
been-7.767
essor-7.736
iths-7.671
Tokenving
Token ID1075
Feature activation+2.30e+01

Pos logit contributions
 counterproductive+7.601
 futile+7.523
 undermines+7.265
 urgently+7.207
 vulnerable+7.180
Neg logit contributions
tymology-6.689
ledged-6.625
been-6.415
 origins-6.334
essor-6.231
Token slowly
Token ID6364
Feature activation+2.44e+01

Pos logit contributions
 futile+15.870
 counterproductive+15.696
 harder+15.654
 urgently+15.559
 vulnerable+15.284
Neg logit contributions
tymology-19.190
ledged-18.789
been-18.313
essor-18.148
iths-17.648
Token along
Token ID1863
Feature activation+1.53e+01

Pos logit contributions
 futile+12.887
 counterproductive+12.769
 vulnerable+12.401
 harder+12.386
 urgently+12.358
Neg logit contributions
tymology-23.920
ledged-22.774
essor-22.241
been-21.557
iths-20.876
Token the
Token ID262
Feature activation+1.06e+01

Pos logit contributions
 futile+8.000
 counterproductive+7.850
 vulnerable+7.577
 urgently+7.365
 harder+7.360
Neg logit contributions
tymology-17.058
ledged-16.636
been-16.305
essor-15.983
iths-15.834
Token city
Token ID1748
Feature activation+6.67e+00

Pos logit contributions
 counterproductive+5.194
 futile+5.186
 vulnerable+4.865
 urgently+4.728
 harder+4.725
Neg logit contributions
tymology-13.285
ledged-13.091
been-12.741
iths-12.642
essor-12.514
Token's
Token ID338
Feature activation+6.53e+00

Pos logit contributions
 counterproductive+3.297
 futile+3.238
 undermines+3.078
 vulnerable+3.059
 urgently+3.035
Neg logit contributions
tymology-8.530
ledged-8.422
been-8.183
essor-8.153
iths-8.147
Token broad
Token ID3154
Feature activation-1.72e+00

Pos logit contributions
 counterproductive+3.639
 futile+3.605
 vulnerable+3.439
 urgently+3.353
 harder+3.333
Neg logit contributions
tymology-7.988
ledged-7.781
been-7.651
iths-7.586
essor-7.550
Token emissions
Token ID8971
Feature activation+1.42e+01

Pos logit contributions
 counterproductive+7.307
 undermines+7.021
 futile+6.981
 vulnerable+6.879
 threatens+6.795
Neg logit contributions
tymology-10.153
ledged-9.996
essor-9.786
been-9.759
iths-9.506
Token that
Token ID326
Feature activation+1.21e+01

Pos logit contributions
 counterproductive+11.355
 undermines+11.057
 futile+10.856
 threatens+10.813
 vulnerable+10.694
Neg logit contributions
tymology-13.319
ledged-13.152
essor-12.744
been-12.694
iths-12.326
Token would
Token ID561
Feature activation+4.64e+00

Pos logit contributions
 counterproductive+9.147
 undermines+8.896
 futile+8.862
 urgently+8.706
 threatens+8.672
Neg logit contributions
ledged-12.005
tymology-11.974
been-11.659
essor-11.372
iths-11.270
Token make
Token ID787
Feature activation+7.08e+00

Pos logit contributions
 counterproductive+3.725
 futile+3.644
 undermines+3.543
 urgently+3.491
 vulnerable+3.477
Neg logit contributions
tymology-4.612
ledged-4.591
been-4.484
essor-4.383
iths-4.345
Token global
Token ID3298
Feature activation+9.68e+00

Pos logit contributions
 counterproductive+5.452
 futile+5.318
 vulnerable+5.145
 undermines+5.111
 urgently+5.063
Neg logit contributions
tymology-7.207
ledged-7.201
been-7.020
essor-6.865
iths-6.819
Token warming
Token ID9917
Feature activation+2.42e+01

Pos logit contributions
 counterproductive+5.405
 futile+5.209
 vulnerable+5.100
 undermines+4.939
 harder+4.921
Neg logit contributions
ledged-11.633
tymology-11.584
been-11.318
essor-11.197
iths-10.940
Token worse
Token ID4785
Feature activation+1.03e+01

Pos logit contributions
 counterproductive+20.095
 harder+19.625
 futile+19.517
 vulnerable+19.349
 unsafe+19.175
Neg logit contributions
ledged-18.451
tymology-18.121
been-17.443
essor-17.083
iths-15.593
Token are
Token ID389
Feature activation+3.45e+00

Pos logit contributions
 counterproductive+6.827
 undermines+6.696
 futile+6.563
 urgently+6.369
 threatens+6.365
Neg logit contributions
tymology-11.318
ledged-11.177
been-10.884
essor-10.852
iths-10.585
Token "
Token ID366
Feature activation+4.31e+00

Pos logit contributions
 counterproductive+2.785
 futile+2.703
 undermines+2.623
 urgently+2.601
 vulnerable+2.584
Neg logit contributions
tymology-3.444
ledged-3.406
been-3.324
essor-3.273
iths-3.234
Tokenessential
Token ID31195
Feature activation-8.49e-02

Pos logit contributions
 counterproductive+4.204
 futile+4.100
 undermines+3.994
 urgently+3.964
 vulnerable+3.947
Neg logit contributions
tymology-3.527
ledged-3.491
been-3.331
essor-3.327
iths-3.281
Token parts
Token ID3354
Feature activation-4.22e+00

Pos logit contributions
tymology+0.085
ledged+0.084
been+0.083
iths+0.080
 origins+0.080
Neg logit contributions
 counterproductive-0.067
 futile-0.066
 undermines-0.064
 vulnerable-0.062
 urgently-0.062
Token out
Token ID503
Feature activation+8.70e+00

Pos logit contributions
 futile+13.619
 counterproductive+13.157
 harder+12.882
 urgently+12.835
 unsafe+12.595
Neg logit contributions
tymology-20.768
ledged-20.167
essor-20.000
been-19.274
iths-18.685
Token of
Token ID286
Feature activation+5.21e+00

Pos logit contributions
 counterproductive+6.221
 futile+6.150
 undermines+5.868
 urgently+5.833
 vulnerable+5.826
Neg logit contributions
tymology-9.064
ledged-8.909
been-8.736
essor-8.640
iths-8.480
Token there
Token ID612
Feature activation-3.52e+00

Pos logit contributions
 counterproductive+3.897
 futile+3.827
 undermines+3.678
 vulnerable+3.654
 urgently+3.620
Neg logit contributions
tymology-5.413
ledged-5.312
been-5.213
essor-5.119
iths-5.051
Token,
Token ID11
Feature activation+3.26e+00

Pos logit contributions
ledged+3.448
tymology+3.433
been+3.358
 origins+3.267
iths+3.257
Neg logit contributions
 counterproductive-2.797
 futile-2.702
 undermines-2.636
 vulnerable-2.550
 urgently-2.546
Token smoke
Token ID7523
Feature activation+1.61e+01

Pos logit contributions
 counterproductive+2.428
 futile+2.364
 undermines+2.290
 vulnerable+2.244
 urgently+2.244
Neg logit contributions
tymology-3.429
ledged-3.405
been-3.334
essor-3.263
iths-3.243
Token pouring
Token ID23147
Feature activation+2.40e+01

Pos logit contributions
 counterproductive+11.639
 futile+11.598
 undermines+11.136
 vulnerable+11.134
 harder+11.073
Neg logit contributions
tymology-14.811
ledged-14.568
essor-14.279
been-13.760
iths-13.620
Token out
Token ID503
Feature activation+9.55e+00

Pos logit contributions
 futile+15.345
 counterproductive+14.717
 harder+14.591
 urgently+14.243
 unsafe+14.193
Neg logit contributions
tymology-21.360
ledged-20.818
essor-20.460
been-20.082
iths-19.328
Token of
Token ID286
Feature activation+6.63e+00

Pos logit contributions
 counterproductive+6.702
 futile+6.687
 undermines+6.325
 vulnerable+6.318
 urgently+6.312
Neg logit contributions
tymology-9.955
ledged-9.784
been-9.530
essor-9.407
iths-9.267
Token the
Token ID262
Feature activation+2.17e+00

Pos logit contributions
 counterproductive+4.722
 futile+4.641
 undermines+4.452
 vulnerable+4.432
 unsafe+4.378
Neg logit contributions
tymology-7.075
ledged-6.918
been-6.785
essor-6.661
iths-6.578
Token tank
Token ID6873
Feature activation-3.07e-01

Pos logit contributions
 counterproductive+1.670
 futile+1.625
 undermines+1.579
 urgently+1.546
 vulnerable+1.544
Neg logit contributions
tymology-2.227
ledged-2.212
been-2.169
essor-2.119
iths-2.106
Token.
Token ID13
Feature activation+2.28e+00

Pos logit contributions
tymology+0.312
ledged+0.311
been+0.305
essor+0.297
iths+0.296
Neg logit contributions
 counterproductive-0.239
 futile-0.232
 undermines-0.225
 urgently-0.219
 vulnerable-0.219
Token.
Token ID13
Feature activation+7.47e-01

Pos logit contributions
 counterproductive+10.138
 futile+9.874
 urgently+9.588
 undermines+9.489
 vulnerable+9.446
Neg logit contributions
tymology-16.074
ledged-15.252
been-15.090
essor-15.060
iths-14.791
Token
Token ID198
Feature activation+1.34e+00

Pos logit contributions
 counterproductive+0.515
 futile+0.500
 undermines+0.485
 urgently+0.475
 vulnerable+0.473
Neg logit contributions
tymology-0.831
ledged-0.822
been-0.802
essor-0.788
iths-0.783
Token
Token ID198
Feature activation+7.81e+00

Pos logit contributions
 counterproductive+1.066
 futile+1.038
 undermines+1.009
 vulnerable+0.975
 urgently+0.969
Neg logit contributions
ledged-1.344
been-1.331
tymology-1.329
essor-1.284
iths-1.281
Token"
Token ID1
Feature activation+6.30e+00

Pos logit contributions
 counterproductive+5.142
 futile+4.951
 undermines+4.791
 urgently+4.749
 vulnerable+4.678
Neg logit contributions
tymology-8.600
ledged-8.456
essor-8.191
been-8.086
iths-8.060
TokenAir
Token ID16170
Feature activation+8.88e+00

Pos logit contributions
 counterproductive+4.127
 futile+3.999
 undermines+3.862
 urgently+3.835
 vulnerable+3.759
Neg logit contributions
tymology-7.093
ledged-6.985
essor-6.758
been-6.754
iths-6.673
Token strikes
Token ID8956
Feature activation+2.38e+01

Pos logit contributions
 counterproductive+5.121
 futile+5.052
 undermines+4.833
 unsafe+4.692
 vulnerable+4.691
Neg logit contributions
tymology-10.401
ledged-10.387
essor-10.089
been-10.056
iths-9.844
Token were
Token ID547
Feature activation+1.23e+01

Pos logit contributions
 counterproductive+18.553
 futile+18.389
 urgently+18.304
 undermines+18.298
 threatens+17.972
Neg logit contributions
tymology-17.738
ledged-16.745
essor-16.608
iths-15.801
been-15.592
Token carried
Token ID5281
Feature activation+9.69e+00

Pos logit contributions
 counterproductive+9.571
 futile+9.336
 urgently+8.945
 vulnerable+8.641
 harder+8.638
Neg logit contributions
tymology-12.292
ledged-11.908
essor-11.613
been-11.433
iths-11.228
Token out
Token ID503
Feature activation+1.20e+01

Pos logit contributions
 counterproductive+2.427
 futile+2.351
 urgently+2.065
 harder+1.941
 vulnerable+1.894
Neg logit contributions
tymology-14.561
ledged-14.249
been-14.120
essor-14.010
iths-13.810
Token to
Token ID284
Feature activation+1.07e+01

Pos logit contributions
 counterproductive+8.572
 futile+8.450
 urgently+8.302
 undermines+8.091
 harder+8.057
Neg logit contributions
tymology-12.435
ledged-11.969
been-11.774
essor-11.658
iths-11.511
Token target
Token ID2496
Feature activation+1.06e+01

Pos logit contributions
 counterproductive+7.355
 futile+7.287
 urgently+7.001
 undermines+6.965
 vulnerable+6.895
Neg logit contributions
tymology-11.567
ledged-11.373
been-11.152
essor-11.022
iths-10.783
Token warns
Token ID22145
Feature activation+5.03e+00

Pos logit contributions
 counterproductive+1.354
 futile+1.315
 undermines+1.270
 urgently+1.243
 vulnerable+1.243
Neg logit contributions
tymology-2.051
ledged-2.040
been-2.000
essor-1.954
iths-1.947
Token the
Token ID262
Feature activation+5.32e+00

Pos logit contributions
 counterproductive+3.899
 futile+3.780
 undermines+3.680
 vulnerable+3.657
 urgently+3.643
Neg logit contributions
tymology-5.146
ledged-5.087
been-5.033
essor-4.855
iths-4.819
Token policy
Token ID2450
Feature activation+4.01e+00

Pos logit contributions
 counterproductive+3.722
 futile+3.621
 vulnerable+3.451
 undermines+3.443
 urgently+3.415
Neg logit contributions
tymology-5.871
ledged-5.836
been-5.699
essor-5.573
iths-5.562
Token would
Token ID561
Feature activation+6.92e+00

Pos logit contributions
 counterproductive+2.429
 futile+2.343
 undermines+2.278
 urgently+2.202
 vulnerable+2.201
Neg logit contributions
tymology-4.805
ledged-4.770
been-4.690
essor-4.597
iths-4.575
Token "
Token ID366
Feature activation+6.80e+00

Pos logit contributions
 counterproductive+5.194
 futile+5.113
 undermines+4.964
 urgently+4.893
 vulnerable+4.885
Neg logit contributions
tymology-7.220
ledged-7.198
been-7.002
essor-6.935
iths-6.788
Tokendrive
Token ID19472
Feature activation+2.38e+01

Pos logit contributions
 counterproductive+6.365
 futile+6.247
 undermines+6.129
 vulnerable+6.015
 urgently+6.001
Neg logit contributions
tymology-5.641
ledged-5.584
essor-5.367
 origins-5.319
been-5.292
Token away
Token ID1497
Feature activation+1.38e+01

Pos logit contributions
 counterproductive+20.061
 futile+19.191
 vulnerable+19.061
 harder+19.034
 undermines+18.414
Neg logit contributions
ledged-17.417
tymology-16.389
been-16.188
essor-15.517
iths-14.810
Token inward
Token ID29879
Feature activation+7.00e+00

Pos logit contributions
 counterproductive+11.871
 vulnerable+11.574
 futile+11.463
 urgently+11.237
 undermines+11.236
Neg logit contributions
ledged-11.968
tymology-11.671
been-11.179
essor-10.943
iths-10.750
Token investment
Token ID4896
Feature activation+1.09e+01

Pos logit contributions
 counterproductive+5.392
 futile+5.218
 undermines+5.111
 vulnerable+5.090
 urgently+5.026
Neg logit contributions
ledged-7.166
tymology-7.075
been-6.828
essor-6.786
iths-6.551
Token".
Token ID1911
Feature activation+2.04e+00

Pos logit contributions
 counterproductive+7.041
 futile+6.821
 undermines+6.736
 urgently+6.646
 vulnerable+6.581
Neg logit contributions
tymology-11.962
ledged-11.918
been-11.518
essor-11.501
iths-11.154
Token
Token ID198
Feature activation-3.67e-01

Pos logit contributions
 counterproductive+1.505
 futile+1.464
 undermines+1.417
 vulnerable+1.385
 urgently+1.383
Neg logit contributions
tymology-2.155
ledged-2.149
been-2.112
essor-2.064
iths-2.052
Token economists
Token ID16816
Feature activation+1.41e+01

Pos logit contributions
 counterproductive+2.815
 futile+2.737
 undermines+2.663
 urgently+2.618
 vulnerable+2.617
Neg logit contributions
tymology-3.744
ledged-3.710
been-3.627
essor-3.550
iths-3.518
Token think
Token ID892
Feature activation+8.04e-01

Pos logit contributions
 counterproductive+10.608
 urgently+10.421
 futile+10.394
 vulnerable+10.296
 harder+10.045
Neg logit contributions
ledged-13.250
tymology-13.160
been-12.699
essor-12.471
iths-12.278
Token this
Token ID428
Feature activation+1.51e+01

Pos logit contributions
 counterproductive+0.626
 futile+0.611
 undermines+0.592
 urgently+0.578
 vulnerable+0.577
Neg logit contributions
tymology-0.811
ledged-0.807
been-0.791
essor-0.774
iths-0.773
Token will
Token ID481
Feature activation+3.95e+00

Pos logit contributions
 counterproductive+10.889
 undermines+10.602
 futile+10.515
 threatens+10.131
 vulnerable+10.105
Neg logit contributions
ledged-15.119
tymology-15.118
been-14.450
essor-14.441
iths-14.181
Token be
Token ID307
Feature activation+1.67e+01

Pos logit contributions
 counterproductive+3.327
 futile+3.259
 undermines+3.167
 urgently+3.125
 vulnerable+3.119
Neg logit contributions
tymology-3.775
ledged-3.758
been-3.662
essor-3.586
iths-3.543
Token pushed
Token ID7121
Feature activation+2.36e+01

Pos logit contributions
 counterproductive+14.196
 futile+13.610
 harder+13.298
 vulnerable+13.189
 urgently+12.801
Neg logit contributions
ledged-14.814
tymology-14.678
essor-14.380
been-14.053
iths-13.496
Token back
Token ID736
Feature activation+1.39e+01

Pos logit contributions
 harder+15.576
 counterproductive+15.357
 futile+14.935
 vulnerable+14.524
 urgently+14.468
Neg logit contributions
tymology-21.832
ledged-21.195
essor-20.675
been-20.523
iths-19.636
Token due
Token ID2233
Feature activation+1.11e+00

Pos logit contributions
 counterproductive+10.511
 futile+10.221
 harder+10.202
 urgently+10.107
 vulnerable+9.946
Neg logit contributions
tymology-13.757
ledged-13.421
been-12.901
essor-12.822
iths-12.408
Token to
Token ID284
Feature activation+6.84e+00

Pos logit contributions
 counterproductive+0.959
 futile+0.941
 undermines+0.915
 vulnerable+0.892
 urgently+0.892
Neg logit contributions
tymology-1.026
ledged-1.015
been-0.998
iths-0.972
essor-0.972
Token recent
Token ID2274
Feature activation+1.94e+00

Pos logit contributions
 counterproductive+4.951
 futile+4.789
 vulnerable+4.640
 undermines+4.601
 urgently+4.579
Neg logit contributions
tymology-7.348
ledged-7.296
been-7.075
essor-6.936
iths-6.888
Token poor
Token ID3595
Feature activation-1.16e-01

Pos logit contributions
 counterproductive+1.399
 futile+1.363
 undermines+1.323
 urgently+1.290
 vulnerable+1.283
Neg logit contributions
tymology-2.070
ledged-2.054
been-2.018
 origins-1.976
essor-1.973
Token are
Token ID389
Feature activation+7.31e+00

Pos logit contributions
 counterproductive+1.274
 futile+1.238
 undermines+1.202
 vulnerable+1.174
 urgently+1.173
Neg logit contributions
tymology-1.777
ledged-1.764
been-1.731
essor-1.689
iths-1.679
Token always
Token ID1464
Feature activation+1.05e+01

Pos logit contributions
 counterproductive+5.508
 futile+5.373
 undermines+5.168
 vulnerable+5.150
 urgently+5.110
Neg logit contributions
tymology-7.548
ledged-7.467
been-7.268
essor-7.187
iths-7.119
Token trying
Token ID2111
Feature activation+3.90e-01

Pos logit contributions
 counterproductive+8.025
 futile+7.851
 vulnerable+7.640
 undermines+7.549
 harder+7.484
Neg logit contributions
tymology-10.516
ledged-10.409
essor-9.969
been-9.942
iths-9.901
Token to
Token ID284
Feature activation+8.03e+00

Pos logit contributions
 counterproductive+0.315
 futile+0.308
 undermines+0.297
 vulnerable+0.294
 harder+0.292
Neg logit contributions
tymology-0.391
ledged-0.388
been-0.379
essor-0.371
iths-0.367
Token throw
Token ID3714
Feature activation+1.05e+01

Pos logit contributions
 counterproductive+6.074
 futile+5.979
 undermines+5.809
 vulnerable+5.731
 urgently+5.683
Neg logit contributions
tymology-8.182
ledged-8.174
been-7.962
essor-7.806
iths-7.751
Token bullets
Token ID16043
Feature activation+2.34e+01

Pos logit contributions
 counterproductive+8.155
 futile+8.018
 vulnerable+7.833
 undermines+7.768
 harder+7.668
Neg logit contributions
tymology-10.114
ledged-9.920
been-9.829
essor-9.624
iths-9.572
Token at
Token ID379
Feature activation+1.43e+01

Pos logit contributions
 harder+15.313
 counterproductive+15.234
 futile+14.907
 vulnerable+14.846
 undermines+14.589
Neg logit contributions
tymology-20.147
essor-19.168
ledged-19.162
been-18.747
iths-17.976
Token me
Token ID502
Feature activation+4.76e+00

Pos logit contributions
 vulnerable+9.534
 futile+9.522
 counterproductive+9.476
 harder+9.019
 undermines+8.958
Neg logit contributions
tymology-14.553
ledged-13.950
been-13.908
iths-13.697
essor-13.463
Token,
Token ID11
Feature activation+2.05e+00

Pos logit contributions
 counterproductive+3.619
 futile+3.537
 undermines+3.434
 vulnerable+3.377
 urgently+3.374
Neg logit contributions
tymology-4.901
ledged-4.853
been-4.752
essor-4.654
iths-4.640
Token
Token ID447
Feature activation+7.28e+00

Pos logit contributions
 counterproductive+1.624
 futile+1.582
 undermines+1.538
 urgently+1.507
 vulnerable+1.506
Neg logit contributions
tymology-2.055
ledged-2.040
been-1.999
essor-1.952
iths-1.941
Token
Token ID251
Feature activation+9.98e+00

Pos logit contributions
 counterproductive+6.738
 futile+6.567
 undermines+6.431
 vulnerable+6.416
 urgently+6.273
Neg logit contributions
tymology-6.093
ledged-6.010
been-5.795
essor-5.731
iths-5.708
Token become
Token ID1716
Feature activation+8.60e+00

Pos logit contributions
been+20.892
 origins+20.723
ledged+19.920
 ancestors+19.696
iths+19.609
Neg logit contributions
 counterproductive-11.156
 futile-11.010
 undermines-10.563
Repeat-10.320
-10.280
Token more
Token ID517
Feature activation+1.07e+01

Pos logit contributions
 counterproductive+6.553
 futile+6.295
 vulnerable+6.177
 harder+6.064
 undermines+6.057
Neg logit contributions
tymology-8.923
ledged-8.632
been-8.511
essor-8.391
iths-8.309
Token powerful
Token ID3665
Feature activation+6.47e+00

Pos logit contributions
 counterproductive+8.097
 futile+7.835
 vulnerable+7.774
 undermines+7.522
 urgently+7.517
Neg logit contributions
tymology-10.963
ledged-10.670
been-10.511
essor-10.368
iths-10.367
Token now
Token ID783
Feature activation+4.89e-01

Pos logit contributions
 counterproductive+5.937
 futile+5.801
 undermines+5.774
 urgently+5.628
 vulnerable+5.598
Neg logit contributions
tymology-5.670
ledged-5.604
been-5.433
essor-5.349
iths-5.252
Token makes
Token ID1838
Feature activation+9.42e-01

Pos logit contributions
 counterproductive+0.299
 futile+0.289
 undermines+0.270
 vulnerable+0.269
 urgently+0.267
Neg logit contributions
tymology-0.573
ledged-0.573
been-0.564
iths-0.552
 origins-0.551
Token Damascus
Token ID21028
Feature activation+2.32e+01

Pos logit contributions
 counterproductive+0.807
 futile+0.788
 undermines+0.768
 urgently+0.753
 vulnerable+0.752
Neg logit contributions
tymology-0.885
ledged-0.878
been-0.859
essor-0.837
iths-0.833
Token wary
Token ID23300
Feature activation-1.67e+00

Pos logit contributions
 vulnerable+18.757
 counterproductive+18.501
 harder+18.306
 futile+17.993
 urgently+17.868
Neg logit contributions
ledged-17.790
tymology-17.259
been-16.575
essor-16.202
iths-15.793
Token.
Token ID13
Feature activation+1.85e-01

Pos logit contributions
tymology+1.552
ledged+1.541
been+1.502
essor+1.469
iths+1.457
Neg logit contributions
 counterproductive-1.428
 futile-1.397
 undermines-1.357
 vulnerable-1.328
 urgently-1.328
Token
Token ID198
Feature activation-8.27e-01

Pos logit contributions
 counterproductive+0.128
 futile+0.124
 undermines+0.121
 urgently+0.118
 vulnerable+0.118
Neg logit contributions
tymology-0.206
ledged-0.204
been-0.199
essor-0.196
iths-0.195
Token
Token ID198
Feature activation+6.70e+00

Pos logit contributions
ledged+0.787
been+0.785
tymology+0.770
 origins+0.751
iths+0.750
Neg logit contributions
 counterproductive-0.700
 futile-0.682
 undermines-0.664
 vulnerable-0.637
 urgently-0.632
TokenTurkey
Token ID31632
Feature activation+1.00e+01

Pos logit contributions
 counterproductive+4.385
 futile+4.200
 undermines+4.084
 urgently+4.060
 vulnerable+3.987
Neg logit contributions
tymology-7.475
ledged-7.363
essor-7.129
been-7.076
 origins-7.030
Token.
Token ID13
Feature activation+4.89e+00

Pos logit contributions
 counterproductive+1.561
 futile+1.522
 undermines+1.486
 vulnerable+1.457
 urgently+1.453
Neg logit contributions
tymology-1.828
ledged-1.814
been-1.775
essor-1.732
iths-1.724
Token We
Token ID775
Feature activation+7.27e+00

Pos logit contributions
 counterproductive+3.477
 futile+3.367
 undermines+3.292
 urgently+3.213
 vulnerable+3.210
Neg logit contributions
tymology-5.306
ledged-5.245
been-5.107
essor-5.008
iths-4.988
Token need
Token ID761
Feature activation-3.76e+00

Pos logit contributions
 counterproductive+5.399
 futile+5.262
 undermines+5.111
 urgently+5.100
 vulnerable+5.063
Neg logit contributions
tymology-7.443
ledged-7.435
been-7.227
iths-7.099
essor-7.051
Token to
Token ID284
Feature activation+7.53e+00

Pos logit contributions
tymology+3.661
ledged+3.626
been+3.567
iths+3.502
 origins+3.492
Neg logit contributions
 counterproductive-2.935
 futile-2.879
 undermines-2.797
 vulnerable-2.687
 unsafe-2.673
Token be
Token ID307
Feature activation+1.69e+01

Pos logit contributions
 counterproductive+5.676
 futile+5.571
 urgently+5.498
 undermines+5.390
 vulnerable+5.373
Neg logit contributions
ledged-7.721
tymology-7.689
been-7.446
essor-7.283
iths-7.266
Token absolutely
Token ID5543
Feature activation+2.31e+01

Pos logit contributions
 counterproductive+12.397
 vulnerable+12.228
 accountable+12.064
 futile+12.010
 urgently+11.993
Neg logit contributions
tymology-16.114
ledged-15.655
essor-14.756
been-14.661
iths-14.587
Token certain
Token ID1728
Feature activation-3.13e+00

Pos logit contributions
 counterproductive+16.764
 futile+16.531
 vulnerable+16.527
 accountable+16.191
 urgently+16.159
Neg logit contributions
tymology-20.238
ledged-19.050
essor-18.282
been-18.060
iths-17.687
Token the
Token ID262
Feature activation+2.58e+00

Pos logit contributions
ledged+2.923
tymology+2.921
been+2.830
essor+2.795
 origins+2.774
Neg logit contributions
 counterproductive-2.613
 futile-2.555
 undermines-2.471
 urgently-2.420
 vulnerable-2.413
Token FBI
Token ID5349
Feature activation+7.88e+00

Pos logit contributions
 counterproductive+1.810
 futile+1.757
 undermines+1.699
 vulnerable+1.665
 urgently+1.661
Neg logit contributions
tymology-2.838
ledged-2.820
been-2.765
essor-2.704
iths-2.691
Token is
Token ID318
Feature activation+5.93e+00

Pos logit contributions
 counterproductive+5.449
 futile+5.294
 undermines+5.289
 urgently+5.089
 vulnerable+5.054
Neg logit contributions
tymology-8.480
ledged-8.437
been-8.252
iths-8.070
essor-7.985
Token worthy
Token ID12733
Feature activation-5.75e+00

Pos logit contributions
 counterproductive+4.616
 futile+4.496
 undermines+4.353
 vulnerable+4.292
 urgently+4.286
Neg logit contributions
tymology-6.068
ledged-6.009
been-5.856
essor-5.730
iths-5.703
Tokenumbling
Token ID14739
Feature activation+1.41e+01

Pos logit contributions
tymology+1.247
ledged+1.241
been+1.217
essor+1.196
iths+1.183
Neg logit contributions
 counterproductive-0.943
 futile-0.916
 undermines-0.883
 urgently-0.864
 vulnerable-0.863
Token and
Token ID290
Feature activation+1.22e+01

Pos logit contributions
 counterproductive+10.652
 vulnerable+10.399
 futile+10.372
 undermines+10.343
 harder+10.107
Neg logit contributions
ledged-13.037
tymology-12.985
been-12.721
essor-12.187
iths-12.100
Token rolling
Token ID10708
Feature activation+1.61e+01

Pos logit contributions
 counterproductive+8.937
 futile+8.704
 undermines+8.557
 vulnerable+8.486
 harder+8.335
Neg logit contributions
tymology-12.157
ledged-12.117
been-11.774
essor-11.638
iths-11.315
Token iceberg
Token ID34849
Feature activation+7.58e+00

Pos logit contributions
 counterproductive+12.850
 undermines+12.602
 vulnerable+12.529
 futile+12.502
 harder+12.325
Neg logit contributions
tymology-13.825
ledged-13.778
been-13.296
essor-12.941
iths-12.517
Token pushes
Token ID20070
Feature activation+1.80e+01

Pos logit contributions
 counterproductive+6.044
 futile+5.919
 undermines+5.899
 vulnerable+5.735
 urgently+5.708
Neg logit contributions
tymology-7.336
ledged-7.334
been-7.192
essor-6.995
iths-6.861
Token water
Token ID1660
Feature activation+2.30e+01

Pos logit contributions
 harder+13.564
 counterproductive+13.399
 futile+13.272
 vulnerable+13.142
 urgently+12.760
Neg logit contributions
tymology-15.705
ledged-15.360
been-15.234
essor-15.013
iths-14.086
Token out
Token ID503
Feature activation+7.20e+00

Pos logit contributions
 harder+17.967
 futile+17.803
 counterproductive+17.637
 vulnerable+17.506
 undermines+17.089
Neg logit contributions
tymology-16.904
ledged-16.531
essor-16.515
been-16.030
iths-15.204
Tokenwards
Token ID2017
Feature activation+6.75e+00

Pos logit contributions
 counterproductive+5.277
 futile+5.173
 undermines+5.012
 vulnerable+4.960
 harder+4.947
Neg logit contributions
tymology-7.432
ledged-7.352
been-7.233
essor-7.110
iths-7.030
Token through
Token ID832
Feature activation+3.98e+00

Pos logit contributions
 counterproductive+4.667
 futile+4.572
 undermines+4.401
 harder+4.367
 vulnerable+4.362
Neg logit contributions
tymology-7.362
ledged-7.261
been-7.129
essor-7.050
iths-6.909
Token the
Token ID262
Feature activation+2.55e+00

Pos logit contributions
 counterproductive+2.642
 futile+2.568
 undermines+2.475
 vulnerable+2.448
 urgently+2.413
Neg logit contributions
tymology-4.483
ledged-4.438
been-4.360
essor-4.290
iths-4.235
Token f
Token ID277
Feature activation-3.68e+00

Pos logit contributions
 counterproductive+1.888
 futile+1.836
 undermines+1.781
 urgently+1.742
 vulnerable+1.740
Neg logit contributions
tymology-2.703
ledged-2.685
been-2.633
essor-2.574
iths-2.561
Token in
Token ID287
Feature activation+5.79e+00

Pos logit contributions
 counterproductive+6.041
 futile+6.024
 urgently+5.819
 undermines+5.730
 vulnerable+5.683
Neg logit contributions
tymology-12.415
ledged-12.228
been-12.008
iths-11.769
essor-11.705
Token Madrid
Token ID14708
Feature activation+7.29e+00

Pos logit contributions
 counterproductive+3.311
 futile+3.245
 urgently+3.028
 undermines+3.026
 vulnerable+3.009
Neg logit contributions
tymology-6.979
ledged-6.933
been-6.749
essor-6.698
iths-6.597
Token
Token ID198
Feature activation+2.95e-01

Pos logit contributions
 counterproductive+5.207
 futile+5.131
 undermines+4.980
 urgently+4.928
 vulnerable+4.858
Neg logit contributions
tymology-7.684
ledged-7.542
been-7.317
essor-7.263
iths-7.163
Token
Token ID198
Feature activation+7.71e+00

Pos logit contributions
 counterproductive+0.239
 futile+0.232
 undermines+0.226
 vulnerable+0.219
 urgently+0.217
Neg logit contributions
ledged-0.291
been-0.290
tymology-0.288
 origins-0.278
iths-0.278
TokenDri
Token ID20564
Feature activation+8.30e+00

Pos logit contributions
 counterproductive+5.312
 futile+5.214
 undermines+5.040
 urgently+4.984
 vulnerable+4.946
Neg logit contributions
tymology-8.196
ledged-8.055
been-7.767
essor-7.736
iths-7.671
Tokenving
Token ID1075
Feature activation+2.30e+01

Pos logit contributions
 counterproductive+7.601
 futile+7.523
 undermines+7.265
 urgently+7.207
 vulnerable+7.180
Neg logit contributions
tymology-6.689
ledged-6.625
been-6.415
 origins-6.334
essor-6.231
Token slowly
Token ID6364
Feature activation+2.44e+01

Pos logit contributions
 futile+15.870
 counterproductive+15.696
 harder+15.654
 urgently+15.559
 vulnerable+15.284
Neg logit contributions
tymology-19.190
ledged-18.789
been-18.313
essor-18.148
iths-17.648
Token along
Token ID1863
Feature activation+1.53e+01

Pos logit contributions
 futile+12.887
 counterproductive+12.769
 vulnerable+12.401
 harder+12.386
 urgently+12.358
Neg logit contributions
tymology-23.920
ledged-22.774
essor-22.241
been-21.557
iths-20.876
Token the
Token ID262
Feature activation+1.06e+01

Pos logit contributions
 futile+8.000
 counterproductive+7.850
 vulnerable+7.577
 urgently+7.365
 harder+7.360
Neg logit contributions
tymology-17.058
ledged-16.636
been-16.305
essor-15.983
iths-15.834
Token city
Token ID1748
Feature activation+6.67e+00

Pos logit contributions
 counterproductive+5.194
 futile+5.186
 vulnerable+4.865
 urgently+4.728
 harder+4.725
Neg logit contributions
tymology-13.285
ledged-13.091
been-12.741
iths-12.642
essor-12.514
Token's
Token ID338
Feature activation+6.53e+00

Pos logit contributions
 counterproductive+3.297
 futile+3.238
 undermines+3.078
 vulnerable+3.059
 urgently+3.035
Neg logit contributions
tymology-8.530
ledged-8.422
been-8.183
essor-8.153
iths-8.147
Token education
Token ID3707
Feature activation+7.63e-01

Pos logit contributions
tymology+1.611
ledged+1.593
been+1.554
 origins+1.541
essor+1.522
Neg logit contributions
 counterproductive-1.391
 futile-1.352
 undermines-1.321
 urgently-1.288
 unsafe-1.269
Token institutions
Token ID6712
Feature activation-9.71e-01

Pos logit contributions
 counterproductive+0.600
 futile+0.584
 undermines+0.568
 vulnerable+0.557
 urgently+0.556
Neg logit contributions
tymology-0.773
ledged-0.768
been-0.752
essor-0.734
iths-0.730
Token -
Token ID532
Feature activation-1.86e+00

Pos logit contributions
tymology+1.016
ledged+1.009
been+0.991
essor+0.969
iths+0.962
Neg logit contributions
 counterproductive-0.725
 futile-0.704
 undermines-0.680
 urgently-0.664
 vulnerable-0.661
Token and
Token ID290
Feature activation+1.21e+00

Pos logit contributions
tymology+1.925
ledged+1.908
been+1.887
 origins+1.844
iths+1.832
Neg logit contributions
 counterproductive-1.373
 futile-1.339
 undermines-1.280
 urgently-1.252
 vulnerable-1.247
Token how
Token ID703
Feature activation+6.68e+00

Pos logit contributions
 counterproductive+0.952
 futile+0.929
 undermines+0.900
 urgently+0.880
 vulnerable+0.878
Neg logit contributions
tymology-1.211
ledged-1.201
been-1.179
essor-1.150
iths-1.144
Token Brexit
Token ID11435
Feature activation+2.29e+01

Pos logit contributions
 counterproductive+5.314
 futile+5.166
 undermines+5.044
 vulnerable+4.991
 urgently+4.960
Neg logit contributions
ledged-6.658
tymology-6.648
been-6.490
essor-6.336
iths-6.276
Token plays
Token ID5341
Feature activation+1.07e+01

Pos logit contributions
 undermines+19.020
 threatens+18.606
 counterproductive+17.988
 vulnerable+17.454
 harder+17.334
Neg logit contributions
ledged-16.715
tymology-16.565
been-16.051
essor-15.609
iths-15.598
Token out
Token ID503
Feature activation+1.22e+01

Pos logit contributions
 counterproductive+6.168
 undermines+5.875
 futile+5.825
 vulnerable+5.731
 harder+5.644
Neg logit contributions
tymology-12.543
ledged-12.449
been-12.207
essor-11.930
iths-11.862
Token in
Token ID287
Feature activation+9.03e+00

Pos logit contributions
 counterproductive+8.316
 undermines+8.256
 futile+7.958
 threatens+7.882
 urgently+7.841
Neg logit contributions
tymology-12.994
ledged-12.878
been-12.506
essor-12.251
iths-12.131
Token relation
Token ID8695
Feature activation-1.69e+00

Pos logit contributions
 counterproductive+8.166
 futile+7.894
 undermines+7.777
 vulnerable+7.766
 urgently+7.632
Neg logit contributions
ledged-7.857
tymology-7.761
been-7.527
essor-7.381
iths-7.243
Token to
Token ID284
Feature activation+5.39e+00

Pos logit contributions
tymology+1.537
ledged+1.525
been+1.494
essor+1.463
iths+1.459
Neg logit contributions
 counterproductive-1.468
 futile-1.437
 undermines-1.380
 urgently-1.364
 vulnerable-1.364
Token it
Token ID340
Feature activation+1.09e+01

Pos logit contributions
tymology+0.266
ledged+0.264
been+0.259
essor+0.255
iths+0.254
Neg logit contributions
 counterproductive-0.211
 futile-0.207
 undermines-0.200
 urgently-0.196
 vulnerable-0.195
Token's
Token ID338
Feature activation+8.94e+00

Pos logit contributions
 counterproductive+6.790
 undermines+6.698
 futile+6.538
 urgently+6.378
 threatens+6.343
Neg logit contributions
tymology-12.496
ledged-12.493
been-12.084
essor-11.968
iths-11.718
Token difficult
Token ID2408
Feature activation+1.39e+00

Pos logit contributions
 counterproductive+7.722
 futile+7.463
 harder+7.251
 vulnerable+7.250
 undermines+7.150
Neg logit contributions
tymology-8.463
ledged-8.412
essor-7.992
been-7.940
iths-7.806
Token,
Token ID11
Feature activation+1.05e+01

Pos logit contributions
 counterproductive+1.012
 futile+0.986
 undermines+0.949
 urgently+0.945
 vulnerable+0.944
Neg logit contributions
tymology-1.497
ledged-1.488
been-1.464
essor-1.414
iths-1.405
Token if
Token ID611
Feature activation+8.16e+00

Pos logit contributions
 counterproductive+8.279
 futile+8.045
 vulnerable+7.795
 urgently+7.775
 undermines+7.766
Neg logit contributions
tymology-10.327
ledged-10.127
been-9.823
essor-9.506
iths-9.494
Token not
Token ID407
Feature activation+2.28e+01

Pos logit contributions
 counterproductive+6.859
 futile+6.700
 vulnerable+6.365
 unsafe+6.357
 harder+6.313
Neg logit contributions
tymology-7.912
ledged-7.820
been-7.553
essor-7.460
iths-7.257
Token impossible
Token ID5340
Feature activation+1.33e+00

Pos logit contributions
 counterproductive+15.089
 futile+14.647
 unsafe+13.774
 harder+13.584
 urgently+13.082
Neg logit contributions
ledged-22.777
tymology-22.586
essor-21.648
been-21.004
iths-20.291
Token,
Token ID11
Feature activation-4.52e-01

Pos logit contributions
 counterproductive+0.937
 futile+0.910
 undermines+0.877
 urgently+0.862
 vulnerable+0.861
Neg logit contributions
tymology-1.480
ledged-1.466
been-1.436
essor-1.405
iths-1.388
Token for
Token ID329
Feature activation+3.38e+00

Pos logit contributions
tymology+0.483
ledged+0.481
been+0.474
essor+0.458
iths+0.455
Neg logit contributions
 counterproductive-0.335
 futile-0.325
 undermines-0.314
 urgently-0.311
 vulnerable-0.309
Token them
Token ID606
Feature activation-5.50e+00

Pos logit contributions
 counterproductive+2.792
 futile+2.719
 undermines+2.638
 vulnerable+2.612
 urgently+2.598
Neg logit contributions
tymology-3.287
ledged-3.262
been-3.197
essor-3.101
iths-3.087
Token to
Token ID284
Feature activation-2.02e+00

Pos logit contributions
tymology+5.110
ledged+5.078
been+4.971
essor+4.857
iths+4.856
Neg logit contributions
 counterproductive-4.676
 futile-4.565
 undermines-4.455
 vulnerable-4.341
 urgently-4.340
Token;
Token ID26
Feature activation-3.64e+00

Pos logit contributions
 counterproductive+0.474
 futile+0.463
 undermines+0.451
 urgently+0.443
 vulnerable+0.442
Neg logit contributions
tymology-0.549
ledged-0.544
been-0.533
essor-0.521
iths-0.517
Token they
Token ID484
Feature activation+6.09e+00

Pos logit contributions
tymology+3.771
ledged+3.723
been+3.684
 origins+3.620
iths+3.579
Neg logit contributions
 counterproductive-2.630
 futile-2.559
 undermines-2.480
 vulnerable-2.437
 urgently-2.421
Token simply
Token ID2391
Feature activation+3.92e+00

Pos logit contributions
 counterproductive+5.064
 futile+4.940
 undermines+4.834
 urgently+4.757
 vulnerable+4.722
Neg logit contributions
ledged-5.772
tymology-5.772
been-5.581
essor-5.523
iths-5.481
Token make
Token ID787
Feature activation+2.62e+00

Pos logit contributions
 counterproductive+2.871
 futile+2.791
 undermines+2.714
 urgently+2.648
 vulnerable+2.647
Neg logit contributions
tymology-4.165
ledged-4.143
been-4.048
essor-3.979
iths-3.949
Token it
Token ID340
Feature activation+1.98e+01

Pos logit contributions
 counterproductive+2.160
 futile+2.108
 undermines+2.040
 vulnerable+2.007
 urgently+2.001
Neg logit contributions
tymology-2.570
ledged-2.553
been-2.497
essor-2.441
iths-2.427
Token increasingly
Token ID6481
Feature activation+2.27e+01

Pos logit contributions
 harder+15.234
 counterproductive+15.087
 futile+14.769
 vulnerable+14.238
 unsafe+14.046
Neg logit contributions
ledged-17.761
tymology-17.456
essor-17.033
been-16.787
iths-16.399
Token impossible
Token ID5340
Feature activation+1.28e+00

Pos logit contributions
 harder+16.737
 futile+16.517
 counterproductive+16.515
 vulnerable+15.608
 unsafe+15.323
Neg logit contributions
ledged-20.311
tymology-20.144
essor-19.486
been-19.139
iths-18.997
Token to
Token ID284
Feature activation+5.74e+00

Pos logit contributions
 counterproductive+0.975
 futile+0.951
 undermines+0.924
 urgently+0.909
 vulnerable+0.906
Neg logit contributions
tymology-1.332
ledged-1.319
been-1.296
essor-1.262
iths-1.259
Token achieve
Token ID4620
Feature activation-5.26e-01

Pos logit contributions
 counterproductive+4.542
 futile+4.462
 undermines+4.321
 urgently+4.286
 vulnerable+4.248
Neg logit contributions
tymology-5.722
ledged-5.718
been-5.598
essor-5.443
iths-5.434
Token in
Token ID287
Feature activation+4.71e+00

Pos logit contributions
tymology+0.515
ledged+0.512
been+0.500
essor+0.488
 origins+0.486
Neg logit contributions
 counterproductive-0.427
 futile-0.416
 undermines-0.405
 urgently-0.397
 vulnerable-0.396
Token the
Token ID262
Feature activation+3.61e+00

Pos logit contributions
 counterproductive+3.529
 futile+3.435
 undermines+3.304
 vulnerable+3.261
 urgently+3.258
Neg logit contributions
tymology-4.936
ledged-4.881
been-4.789
essor-4.674
iths-4.666
Token perpetually
Token ID43389
Feature activation+2.20e+01

Pos logit contributions
 counterproductive+12.274
 vulnerable+12.212
 futile+12.125
 unsafe+11.977
 accountable+11.937
Neg logit contributions
ledged-16.253
tymology-16.067
been-15.377
essor-15.307
iths-15.073
Token terrified
Token ID22144
Feature activation+6.03e+00

Pos logit contributions
 vulnerable+18.274
 futile+17.835
 counterproductive+17.827
 unsafe+17.734
 accountable+17.563
Neg logit contributions
tymology-17.352
ledged-17.340
essor-16.215
iths-15.855
been-15.614
Token and
Token ID290
Feature activation+1.67e+01

Pos logit contributions
 counterproductive+4.511
 futile+4.419
 undermines+4.301
 vulnerable+4.219
 urgently+4.178
Neg logit contributions
tymology-6.264
ledged-6.207
been-6.082
essor-5.972
iths-5.917
Token keep
Token ID1394
Feature activation+7.61e+00

Pos logit contributions
 counterproductive+13.957
 futile+13.811
 vulnerable+13.734
 unsafe+13.375
 undermines+13.099
Neg logit contributions
ledged-14.353
tymology-14.214
essor-13.523
been-13.477
iths-13.301
Token defense
Token ID3761
Feature activation+9.46e+00

Pos logit contributions
 counterproductive+5.568
 futile+5.458
 vulnerable+5.240
 undermines+5.229
 urgently+5.127
Neg logit contributions
tymology-8.010
ledged-8.000
been-7.784
essor-7.618
iths-7.551
Token budgets
Token ID21011
Feature activation+2.27e+01

Pos logit contributions
 counterproductive+7.270
 futile+7.190
 vulnerable+6.919
 undermines+6.904
 harder+6.761
Neg logit contributions
tymology-9.455
ledged-9.404
been-9.080
essor-9.025
iths-8.874
Token up
Token ID510
Feature activation+5.06e+00

Pos logit contributions
 futile+17.763
 counterproductive+17.666
 vulnerable+17.553
 hostage+17.447
 accountable+17.010
Neg logit contributions
tymology-18.423
ledged-18.298
essor-17.031
been-16.928
iths-16.758
Token,
Token ID11
Feature activation-1.80e+00

Pos logit contributions
 counterproductive+3.664
 futile+3.576
 undermines+3.469
 vulnerable+3.391
 urgently+3.388
Neg logit contributions
tymology-5.415
ledged-5.358
been-5.253
essor-5.158
iths-5.112
Token if
Token ID611
Feature activation-9.31e-01

Pos logit contributions
tymology+1.857
ledged+1.836
been+1.820
 origins+1.781
essor+1.759
Neg logit contributions
 counterproductive-1.339
 futile-1.304
 undermines-1.254
 urgently-1.237
 vulnerable-1.221
Token the
Token ID262
Feature activation+2.89e+00

Pos logit contributions
tymology+0.975
ledged+0.973
been+0.954
essor+0.930
 origins+0.930
Neg logit contributions
 counterproductive-0.686
 futile-0.667
 undermines-0.648
 urgently-0.632
 vulnerable-0.627
Token Deep
Token ID10766
Feature activation+1.94e+00

Pos logit contributions
 counterproductive+2.044
 futile+1.987
 undermines+1.920
 vulnerable+1.883
 urgently+1.880
Neg logit contributions
tymology-3.146
ledged-3.124
been-3.063
essor-2.997
iths-2.983
TokenHowever
Token ID4864
Feature activation+9.30e-01

Pos logit contributions
 counterproductive+3.062
 futile+2.968
 undermines+2.884
 urgently+2.837
 vulnerable+2.815
Neg logit contributions
tymology-4.476
ledged-4.433
been-4.317
essor-4.255
iths-4.235
Token,
Token ID11
Feature activation+2.40e+00

Pos logit contributions
 counterproductive+0.688
 futile+0.669
 undermines+0.649
 urgently+0.635
 vulnerable+0.635
Neg logit contributions
tymology-0.984
ledged-0.978
been-0.959
essor-0.937
iths-0.932
Token to
Token ID284
Feature activation+6.71e+00

Pos logit contributions
 counterproductive+1.854
 futile+1.805
 undermines+1.753
 urgently+1.717
 vulnerable+1.716
Neg logit contributions
tymology-2.464
ledged-2.447
been-2.398
essor-2.343
iths-2.330
Token make
Token ID787
Feature activation+4.47e+00

Pos logit contributions
 counterproductive+5.464
 futile+5.368
 undermines+5.172
 urgently+5.157
 vulnerable+5.122
Neg logit contributions
tymology-6.472
ledged-6.422
been-6.344
essor-6.146
iths-6.105
Token all
Token ID477
Feature activation+4.86e+00

Pos logit contributions
 counterproductive+3.506
 futile+3.425
 undermines+3.299
 vulnerable+3.259
 urgently+3.257
Neg logit contributions
tymology-4.537
ledged-4.522
been-4.426
essor-4.315
iths-4.289
Token this
Token ID428
Feature activation+2.26e+01

Pos logit contributions
 counterproductive+3.741
 futile+3.658
 undermines+3.511
 urgently+3.484
 vulnerable+3.479
Neg logit contributions
tymology-4.958
ledged-4.936
been-4.857
essor-4.724
iths-4.695
Token happen
Token ID1645
Feature activation+1.19e+01

Pos logit contributions
 harder+18.019
 counterproductive+17.906
 futile+17.837
 vulnerable+17.144
 urgently+17.107
Neg logit contributions
ledged-17.106
tymology-16.818
been-16.674
essor-16.104
iths-15.763
Token,
Token ID11
Feature activation+6.20e+00

Pos logit contributions
 counterproductive+8.486
 urgently+8.272
 futile+8.249
 undermines+8.133
 harder+8.009
Neg logit contributions
tymology-12.236
ledged-11.911
been-11.899
essor-11.288
iths-11.255
Token it
Token ID340
Feature activation+1.18e+01

Pos logit contributions
 counterproductive+4.766
 futile+4.625
 undermines+4.508
 urgently+4.465
 vulnerable+4.436
Neg logit contributions
tymology-6.342
ledged-6.293
been-6.176
essor-6.033
iths-5.993
Token is
Token ID318
Feature activation+1.15e+01

Pos logit contributions
 counterproductive+7.044
 undermines+6.954
 futile+6.890
 urgently+6.881
 threatens+6.606
Neg logit contributions
ledged-13.335
tymology-13.204
been-12.904
essor-12.663
iths-12.614
Token necessary
Token ID3306
Feature activation-3.12e+00

Pos logit contributions
 counterproductive+8.718
 futile+8.526
 urgently+8.263
 vulnerable+8.079
 harder+8.077
Neg logit contributions
tymology-11.690
ledged-11.652
been-11.158
essor-11.022
iths-10.903
Token genius
Token ID15632
Feature activation-8.56e+00

Pos logit contributions
tymology+3.418
ledged+3.375
been+3.334
 origins+3.282
essor+3.264
Neg logit contributions
 counterproductive-2.811
 futile-2.715
 undermines-2.692
 urgently-2.612
 vulnerable-2.590
Token of
Token ID286
Feature activation-5.00e+00

Pos logit contributions
tymology+8.205
ledged+8.039
been+7.963
 origins+7.944
essor+7.887
Neg logit contributions
 counterproductive-6.459
 futile-6.257
 undermines-6.064
 urgently-5.914
 vulnerable-5.852
Token a
Token ID257
Feature activation-1.31e+00

Pos logit contributions
tymology+5.238
ledged+5.157
been+5.094
 origins+5.068
essor+4.980
Neg logit contributions
 counterproductive-3.545
 undermines-3.424
 futile-3.419
 urgently-3.270
 threatens-3.240
Token mathematician
Token ID48251
Feature activation-6.13e+00

Pos logit contributions
tymology+1.276
ledged+1.269
been+1.251
essor+1.219
 origins+1.215
Neg logit contributions
 counterproductive-1.058
 futile-1.028
 undermines-1.013
 urgently-0.989
 vulnerable-0.980
Token
Token ID784
Feature activation-1.39e+01

Pos logit contributions
tymology+6.045
ledged+5.973
been+5.912
essor+5.817
 origins+5.807
Neg logit contributions
 counterproductive-4.712
 futile-4.563
 undermines-4.411
 urgently-4.322
 vulnerable-4.302
Token had
Token ID550
Feature activation-3.80e+01

Pos logit contributions
tymology+14.130
been+13.799
ledged+13.749
 origins+13.602
essor+13.378
Neg logit contributions
 counterproductive-8.469
 futile-7.958
 unsafe-7.653
 vulnerable-7.573
 undermines-7.349
Token originally
Token ID6198
Feature activation-3.15e+01

Pos logit contributions
 origins+28.364
been+28.308
 ancestors+27.678
iths+27.584
 predecessors+27.283
Neg logit contributions
 counterproductive-13.845
 undermines-13.607
Repeat-13.293
 hostage-13.107
Alert-12.891
Token sent
Token ID1908
Feature activation-7.03e+00

Pos logit contributions
been+24.769
 origins+23.212
 been+22.908
tymology+22.449
 ancestors+22.219
Neg logit contributions
 counterproductive-14.916
Repeat-14.830
 undermines-14.431
Alert-14.294
 hostage-13.739
Token me
Token ID502
Feature activation-1.05e+01

Pos logit contributions
tymology+7.347
ledged+7.327
been+7.222
 origins+7.123
iths+7.058
Neg logit contributions
 counterproductive-4.756
 futile-4.552
 undermines-4.484
 urgently-4.315
 vulnerable-4.293
Token the
Token ID262
Feature activation-1.12e+01

Pos logit contributions
tymology+9.553
 origins+9.470
been+9.465
ledged+9.423
 ancestors+9.278
Neg logit contributions
 counterproductive-7.790
 futile-7.488
 undermines-7.407
 unsafe-7.115
 vulnerable-7.111
Token paper
Token ID3348
Feature activation-8.35e+00

Pos logit contributions
 origins+10.644
tymology+10.598
been+10.562
ledged+10.402
 ancestors+10.320
Neg logit contributions
 counterproductive-7.842
 undermines-7.688
 futile-7.538
 threatens-7.343
 urgently-7.269
Token central
Token ID4318
Feature activation-1.51e+01

Pos logit contributions
ledged+5.977
tymology+5.935
been+5.879
 origins+5.801
essor+5.767
Neg logit contributions
 counterproductive-5.296
 futile-5.205
 undermines-5.130
 unsafe-4.907
 threatens-4.889
Token to
Token ID284
Feature activation-5.28e+00

Pos logit contributions
tymology+12.928
ledged+12.482
essor+12.380
 origins+12.347
been+12.287
Neg logit contributions
 counterproductive-11.889
 futile-11.606
 undermines-11.135
 unsafe-11.078
 vulnerable-10.680
Token the
Token ID262
Feature activation-5.49e+00

Pos logit contributions
tymology+5.564
ledged+5.506
been+5.410
 origins+5.367
essor+5.347
Neg logit contributions
 counterproductive-3.747
 futile-3.597
 undermines-3.553
 urgently-3.394
 vulnerable-3.378
Token story
Token ID1621
Feature activation-9.80e+00

Pos logit contributions
tymology+5.335
 origins+5.268
ledged+5.201
been+5.194
essor+5.087
Neg logit contributions
 counterproductive-4.267
 undermines-4.136
 futile-4.086
 urgently-3.945
 threatens-3.944
Token,
Token ID11
Feature activation-1.18e+01

Pos logit contributions
tymology+9.296
ledged+9.009
been+8.845
 origins+8.835
essor+8.632
Neg logit contributions
 counterproductive-7.727
 futile-7.492
 undermines-7.162
 urgently-7.085
 vulnerable-7.057
Token had
Token ID550
Feature activation-3.73e+01

Pos logit contributions
tymology+11.150
ledged+11.104
been+11.029
 origins+11.003
 ancestors+10.781
Neg logit contributions
 counterproductive-8.124
 futile-7.814
 undermines-7.407
 unsafe-7.314
 vulnerable-7.297
Token a
Token ID257
Feature activation-1.08e+01

Pos logit contributions
been+27.374
 origins+27.027
 ancestors+26.545
 been+26.496
iths+26.244
Neg logit contributions
Repeat-14.347
 undermines-14.137
 counterproductive-14.108
Alert-13.200
 accountable-12.953
Token collection
Token ID4947
Feature activation-1.45e+01

Pos logit contributions
ledged+10.791
tymology+10.734
been+10.720
 origins+10.634
 ancestors+10.395
Neg logit contributions
 undermines-6.683
 counterproductive-6.681
 futile-6.386
 threatens-6.235
 urgently-6.172
Token of
Token ID286
Feature activation-9.68e+00

Pos logit contributions
tymology+12.282
been+12.035
ledged+12.031
essor+11.967
 origins+11.872
Neg logit contributions
 counterproductive-11.380
 futile-10.936
 undermines-10.831
 threatens-10.324
 accountable-10.261
Token autom
Token ID3557
Feature activation-5.41e+00

Pos logit contributions
tymology+9.450
 origins+9.367
ledged+9.340
been+9.259
 ancestors+9.235
Neg logit contributions
 counterproductive-6.712
 undermines-6.461
 futile-6.423
 threatens-6.111
 urgently-6.051
Tokenat
Token ID265
Feature activation-8.97e+00

Pos logit contributions
tymology+3.588
ledged+3.518
been+3.435
iths+3.370
essor+3.302
Neg logit contributions
 counterproductive-6.097
 futile-5.934
 undermines-5.803
 vulnerable-5.716
 urgently-5.710
Tokenre
Token ID260
Feature activation-5.37e+00

Pos logit contributions
tymology+0.344
ledged+0.340
been+0.333
essor+0.322
iths+0.319
Neg logit contributions
 counterproductive-0.414
 futile-0.405
 undermines-0.396
 vulnerable-0.389
 urgently-0.388
Token originally
Token ID6198
Feature activation-1.28e+01

Pos logit contributions
been+5.477
tymology+5.464
ledged+5.462
 origins+5.327
essor+5.225
Neg logit contributions
 counterproductive-3.977
 futile-3.824
 undermines-3.803
 urgently-3.628
 vulnerable-3.601
Token from
Token ID422
Feature activation-2.93e+00

Pos logit contributions
been+13.338
tymology+13.036
 origins+13.020
ledged+12.965
 ancestors+12.679
Neg logit contributions
 counterproductive-7.780
 undermines-7.537
 futile-7.293
 urgently-7.242
 threatens-6.842
Token Ireland
Token ID7517
Feature activation-5.74e+00

Pos logit contributions
tymology+2.999
ledged+2.999
been+2.965
 origins+2.892
iths+2.884
Neg logit contributions
 counterproductive-2.183
 futile-2.116
 undermines-2.077
 urgently-2.006
 vulnerable-1.996
Token and
Token ID290
Feature activation-1.05e+01

Pos logit contributions
tymology+5.730
ledged+5.704
been+5.661
 origins+5.507
essor+5.455
Neg logit contributions
 counterproductive-4.458
 futile-4.318
 undermines-4.205
 vulnerable-4.066
 urgently-4.055
Token have
Token ID423
Feature activation-3.72e+01

Pos logit contributions
tymology+10.296
been+10.226
ledged+10.219
 origins+10.139
 ancestors+9.902
Neg logit contributions
 counterproductive-7.621
 futile-7.283
 undermines-7.202
 unsafe-6.771
 vulnerable-6.765
Token travelled
Token ID21650
Feature activation-5.38e-01

Pos logit contributions
been+28.176
 origins+26.923
 been+25.818
 ancestors+25.287
ledged+25.237
Neg logit contributions
 counterproductive-17.620
 undermines-17.016
 futile-16.173
 threatens-15.213
 unsafe-14.999
Token and
Token ID290
Feature activation-6.02e+00

Pos logit contributions
tymology+0.574
ledged+0.570
been+0.562
 origins+0.550
essor+0.548
Neg logit contributions
 counterproductive-0.388
 futile-0.376
 undermines-0.365
 vulnerable-0.355
 urgently-0.353
Token worked
Token ID3111
Feature activation-9.48e-01

Pos logit contributions
been+5.400
tymology+5.311
ledged+5.294
 origins+5.115
iths+5.059
Neg logit contributions
 counterproductive-5.228
 futile-5.063
 undermines-4.992
 vulnerable-4.862
 urgently-4.843
Token all
Token ID477
Feature activation-1.32e-01

Pos logit contributions
tymology+1.106
ledged+1.098
been+1.087
essor+1.063
 origins+1.063
Neg logit contributions
 counterproductive-0.588
 futile-0.564
 undermines-0.548
 vulnerable-0.527
 urgently-0.523
Token over
Token ID625
Feature activation+2.12e+00

Pos logit contributions
tymology+0.172
ledged+0.171
been+0.169
essor+0.165
 origins+0.165
Neg logit contributions
 counterproductive-0.064
 futile-0.061
 undermines-0.059
 urgently-0.057
 vulnerable-0.056
Token the
Token ID262
Feature activation-2.09e+00

Pos logit contributions
tymology+2.333
ledged+2.323
been+2.285
 origins+2.231
iths+2.225
Neg logit contributions
 counterproductive-1.532
 futile-1.481
 undermines-1.456
 urgently-1.399
 vulnerable-1.394
Token living
Token ID2877
Feature activation-7.18e+00

Pos logit contributions
tymology+2.177
ledged+2.174
been+2.139
 origins+2.106
iths+2.081
Neg logit contributions
 counterproductive-1.546
 futile-1.489
 undermines-1.477
 urgently-1.413
 vulnerable-1.409
Token world
Token ID995
Feature activation-1.08e+01

Pos logit contributions
ledged+5.747
tymology+5.745
been+5.691
 origins+5.601
 ancestors+5.484
Neg logit contributions
 counterproductive-6.759
 futile-6.523
 undermines-6.377
 vulnerable-6.217
 urgently-6.214
Token.
Token ID13
Feature activation-8.91e+00

Pos logit contributions
ledged+10.173
tymology+10.169
 origins+9.903
been+9.875
 ancestors+9.637
Neg logit contributions
 counterproductive-8.141
 futile-7.727
 undermines-7.727
 vulnerable-7.271
 threatens-7.241
Token He
Token ID679
Feature activation-7.85e+00

Pos logit contributions
been+8.190
ledged+8.119
tymology+8.041
 origins+8.008
iths+7.905
Neg logit contributions
 counterproductive-6.994
 futile-6.730
 undermines-6.608
 vulnerable-6.266
 threatens-6.251
Token had
Token ID550
Feature activation-3.70e+01

Pos logit contributions
tymology+7.578
ledged+7.566
been+7.436
 origins+7.200
iths+7.142
Neg logit contributions
 counterproductive-6.078
 futile-5.667
 undermines-5.656
 vulnerable-5.468
 unsafe-5.466
Token once
Token ID1752
Feature activation-2.57e+01

Pos logit contributions
 origins+28.748
been+28.390
 ancestors+28.190
iths+26.925
 been+26.659
Neg logit contributions
 counterproductive-14.474
 undermines-14.399
 accountable-13.200
Repeat-12.504
 hostage-12.481
Token lived
Token ID5615
Feature activation-4.71e+00

Pos logit contributions
been+22.149
ledged+20.944
iths+20.286
 ancestors+20.190
 origins+20.176
Neg logit contributions
 counterproductive-13.899
 undermines-13.438
Repeat-12.771
 futile-12.098
 tightening-12.096
Token inside
Token ID2641
Feature activation-3.35e+00

Pos logit contributions
ledged+5.332
tymology+5.317
been+5.266
 origins+5.206
essor+5.162
Neg logit contributions
 counterproductive-2.894
 undermines-2.715
 futile-2.709
 urgently-2.544
 vulnerable-2.528
Token a
Token ID257
Feature activation+3.18e+00

Pos logit contributions
tymology+3.440
ledged+3.430
been+3.372
iths+3.292
 origins+3.276
Neg logit contributions
 counterproductive-2.513
 futile-2.427
 undermines-2.392
 urgently-2.313
 vulnerable-2.299
Token par
Token ID1582
Feature activation-1.26e+01

Pos logit contributions
 counterproductive+2.516
 futile+2.458
 undermines+2.371
 vulnerable+2.341
 urgently+2.332
Neg logit contributions
tymology-3.211
ledged-3.182
been-3.113
essor-3.043
iths-3.026
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
tymology+1.112
ledged+1.104
been+1.084
essor+1.055
 origins+1.051
Neg logit contributions
 counterproductive-0.856
 futile-0.834
 undermines-0.813
 urgently-0.793
 vulnerable-0.788
Token American
Token ID1605
Feature activation-6.91e+00

Pos logit contributions
tymology+1.030
ledged+1.021
been+1.003
essor+0.976
 origins+0.974
Neg logit contributions
 counterproductive-0.766
 futile-0.746
 undermines-0.728
 urgently-0.708
 vulnerable-0.702
Token man
Token ID582
Feature activation-1.07e+01

Pos logit contributions
ledged+6.623
tymology+6.618
been+6.501
 origins+6.350
essor+6.238
Neg logit contributions
 counterproductive-5.548
 futile-5.385
 undermines-5.305
 urgently-5.124
 unsafe-5.039
Token.
Token ID13
Feature activation-1.24e+01

Pos logit contributions
tymology+9.963
ledged+9.876
been+9.722
 origins+9.512
essor+9.319
Neg logit contributions
 counterproductive-8.258
 futile-7.932
 undermines-7.830
 unsafe-7.481
 urgently-7.444
Token I
Token ID314
Feature activation-8.63e+00

Pos logit contributions
been+10.475
ledged+10.328
 origins+10.181
tymology+10.137
 ancestors+9.900
Neg logit contributions
 counterproductive-10.124
 futile-9.886
 undermines-9.685
 tightening-9.286
 urgently-9.280
Token have
Token ID423
Feature activation-3.62e+01

Pos logit contributions
tymology+8.711
ledged+8.380
been+8.372
 origins+8.145
 ancestors+7.975
Neg logit contributions
 counterproductive-6.299
 futile-5.931
 undermines-5.905
 hostage-5.630
 vulnerable-5.626
Token a
Token ID257
Feature activation-1.22e+01

Pos logit contributions
 origins+28.085
 ancestors+28.046
been+27.651
ledged+26.318
 been+26.048
Neg logit contributions
 counterproductive-14.072
 undermines-13.932
Repeat-13.444
Alert-13.357
 futile-13.307
Token history
Token ID2106
Feature activation-1.30e+01

Pos logit contributions
ledged+12.040
been+11.859
 origins+11.802
 ancestors+11.673
tymology+11.666
Neg logit contributions
 counterproductive-7.695
 undermines-7.672
 futile-7.459
 urgently-7.319
 threatens-7.082
Token of
Token ID286
Feature activation-7.45e+00

Pos logit contributions
tymology+11.367
ledged+11.132
been+11.033
essor+10.879
 origins+10.820
Neg logit contributions
 counterproductive-10.256
 futile-10.042
 undermines-9.751
 urgently-9.556
 unsafe-9.366
Token depression
Token ID8862
Feature activation-2.26e+00

Pos logit contributions
been+7.788
tymology+7.718
 origins+7.692
ledged+7.661
 ancestors+7.577
Neg logit contributions
 counterproductive-4.843
 futile-4.688
 undermines-4.667
 urgently-4.538
 threatens-4.356
Token,
Token ID11
Feature activation-2.02e+00

Pos logit contributions
tymology+2.407
ledged+2.389
been+2.344
essor+2.285
iths+2.277
Neg logit contributions
 counterproductive-1.657
 futile-1.611
 undermines-1.561
 urgently-1.527
 vulnerable-1.521
Token origin
Token ID8159
Feature activation-1.62e+01

Pos logit contributions
 origins+9.930
tymology+9.634
ledged+9.547
been+9.497
 ancestors+9.440
Neg logit contributions
 counterproductive-7.050
 futile-6.837
 undermines-6.668
 urgently-6.543
 threatens-6.265
Token that
Token ID326
Feature activation-5.31e+00

Pos logit contributions
 origins+15.027
tymology+14.701
ledged+14.681
been+14.165
 ancestors+14.162
Neg logit contributions
 counterproductive-10.631
 futile-10.277
 undermines-9.891
 urgently-9.621
 unsafe-9.604
Token Batman
Token ID9827
Feature activation+1.02e+00

Pos logit contributions
ledged+5.633
tymology+5.604
 origins+5.560
been+5.538
essor+5.430
Neg logit contributions
 counterproductive-3.596
 futile-3.498
 urgently-3.272
 undermines-3.258
 unsafe-3.224
Token and
Token ID290
Feature activation-4.49e+00

Pos logit contributions
 counterproductive+0.730
 futile+0.710
 undermines+0.687
 urgently+0.672
 vulnerable+0.671
Neg logit contributions
tymology-1.102
ledged-1.094
been-1.074
essor-1.051
iths-1.044
Token Dick
Token ID11740
Feature activation-4.93e-01

Pos logit contributions
ledged+4.507
tymology+4.470
been+4.430
 origins+4.401
essor+4.361
Neg logit contributions
 counterproductive-3.351
 futile-3.258
 undermines-3.149
 urgently-3.082
 unsafe-3.033
Token have
Token ID423
Feature activation-3.60e+01

Pos logit contributions
tymology+0.502
ledged+0.499
been+0.489
essor+0.478
iths+0.475
Neg logit contributions
 counterproductive-0.383
 futile-0.373
 undermines-0.362
 urgently-0.354
 vulnerable-0.354
Token of
Token ID286
Feature activation-1.39e+01

Pos logit contributions
 origins+26.078
been+25.452
 ancestors+24.808
 been+24.412
ledged+24.103
Neg logit contributions
 counterproductive-16.378
Repeat-15.636
 futile-15.539
 undermines-14.715
 Protesters-14.655
Token having
Token ID1719
Feature activation-2.57e+01

Pos logit contributions
 origins+13.207
ledged+13.025
been+12.823
 ancestors+12.654
tymology+12.589
Neg logit contributions
 counterproductive-9.557
 undermines-9.057
 futile-8.982
 urgently-8.654
 unsafe-8.367
Token gone
Token ID3750
Feature activation-3.97e+00

Pos logit contributions
 origins+23.528
 ancestors+22.789
been+22.522
 predecessors+21.500
 ancestor+21.421
Neg logit contributions
 counterproductive-12.449
 undermines-11.937
 futile-11.694
 urgently-11.615
 tightening-11.069
Token through
Token ID832
Feature activation-9.09e+00

Pos logit contributions
ledged+4.879
been+4.844
 origins+4.841
tymology+4.826
 ancestors+4.755
Neg logit contributions
 counterproductive-2.008
 futile-1.892
 undermines-1.865
 urgently-1.791
 unsafe-1.719
Token this
Token ID428
Feature activation-3.54e+00

Pos logit contributions
 origins+9.044
been+9.022
ledged+9.014
tymology+8.939
 ancestors+8.834
Neg logit contributions
 counterproductive-6.240
 futile-5.970
 undermines-5.925
 urgently-5.892
 vulnerable-5.674
Token was
Token ID373
Feature activation-9.55e-01

Pos logit contributions
 counterproductive+0.922
 futile+0.898
 undermines+0.870
 urgently+0.851
 vulnerable+0.851
Neg logit contributions
tymology-1.225
ledged-1.218
been-1.195
essor-1.165
iths-1.160
Token a
Token ID257
Feature activation+1.10e-01

Pos logit contributions
tymology+0.951
ledged+0.950
been+0.939
 origins+0.912
essor+0.907
Neg logit contributions
 counterproductive-0.747
 futile-0.727
 undermines-0.711
 urgently-0.681
 vulnerable-0.680
Token British
Token ID3517
Feature activation-4.06e+00

Pos logit contributions
 counterproductive+0.085
 futile+0.083
 undermines+0.081
 urgently+0.079
 unsafe+0.078
Neg logit contributions
ledged-0.112
tymology-0.112
been-0.110
essor-0.107
 origins-0.107
Token citizen
Token ID9511
Feature activation-1.10e+01

Pos logit contributions
tymology+4.215
ledged+4.189
been+4.159
 origins+4.097
essor+4.021
Neg logit contributions
 counterproductive-2.971
 futile-2.874
 undermines-2.801
 urgently-2.678
 unsafe-2.667
Token who
Token ID508
Feature activation-9.71e+00

Pos logit contributions
ledged+9.991
been+9.916
tymology+9.844
 origins+9.807
 ancestors+9.436
Neg logit contributions
 counterproductive-8.441
 futile-8.213
 undermines-7.968
 unsafe-7.613
 threatens-7.542
Token had
Token ID550
Feature activation-3.57e+01

Pos logit contributions
tymology+8.546
 origins+8.497
ledged+8.497
been+8.493
 ancestors+8.273
Neg logit contributions
 counterproductive-7.780
 futile-7.461
 undermines-7.090
 accountable-6.967
 unsafe-6.929
Token been
Token ID587
Feature activation-1.51e+00

Pos logit contributions
 origins+27.140
been+26.712
 ancestors+25.845
 been+25.375
 pedigree+24.461
Neg logit contributions
 counterproductive-15.594
 futile-14.664
 undermines-14.462
 accountable-14.305
Repeat-14.142
Token born
Token ID4642
Feature activation-1.52e+00

Pos logit contributions
ledged+1.553
tymology+1.553
been+1.545
 origins+1.502
iths+1.490
Neg logit contributions
 counterproductive-1.110
 futile-1.078
 undermines-1.050
 urgently-1.002
 vulnerable-0.999
Token in
Token ID287
Feature activation-1.52e-01

Pos logit contributions
tymology+1.774
ledged+1.772
been+1.743
essor+1.706
 origins+1.703
Neg logit contributions
 counterproductive-0.932
 futile-0.899
 undermines-0.870
 urgently-0.836
 vulnerable-0.835
Token Pakistan
Token ID7648
Feature activation-4.70e+00

Pos logit contributions
tymology+0.180
ledged+0.180
been+0.177
 origins+0.173
iths+0.173
Neg logit contributions
 counterproductive-0.091
 futile-0.087
 undermines-0.085
 urgently-0.081
 vulnerable-0.081
Token,
Token ID11
Feature activation-4.81e+00

Pos logit contributions
ledged+4.380
tymology+4.358
been+4.302
 origins+4.187
essor+4.155
Neg logit contributions
 counterproductive-3.956
 futile-3.853
 undermines-3.744
 vulnerable-3.641
 urgently-3.612
Token that
Token ID326
Feature activation-3.94e+00

Pos logit contributions
tymology+8.069
ledged+7.965
 origins+7.758
been+7.704
iths+7.665
Neg logit contributions
 counterproductive-8.862
 futile-8.532
 undermines-8.460
 vulnerable-8.223
 urgently-8.181
Token Xer
Token ID44264
Feature activation+1.13e+00

Pos logit contributions
tymology+4.029
ledged+3.976
been+3.915
iths+3.858
 origins+3.858
Neg logit contributions
 counterproductive-2.961
 futile-2.843
 undermines-2.784
 urgently-2.701
 vulnerable-2.682
Tokenxes
Token ID48169
Feature activation-1.12e+00

Pos logit contributions
 counterproductive+1.195
 futile+1.173
 undermines+1.142
 vulnerable+1.126
 urgently+1.123
Neg logit contributions
tymology-0.839
ledged-0.828
been-0.805
iths-0.783
essor-0.781
Token,
Token ID11
Feature activation-2.76e+00

Pos logit contributions
tymology+1.123
ledged+1.114
been+1.092
essor+1.067
iths+1.060
Neg logit contributions
 counterproductive-0.891
 futile-0.862
 undermines-0.840
 vulnerable-0.822
 urgently-0.814
Token who
Token ID508
Feature activation-1.21e+01

Pos logit contributions
tymology+2.886
ledged+2.854
been+2.811
 origins+2.765
essor+2.745
Neg logit contributions
 counterproductive-2.010
 futile-1.929
 undermines-1.891
 urgently-1.817
 unsafe-1.811
Token had
Token ID550
Feature activation-3.57e+01

Pos logit contributions
tymology+11.697
ledged+11.623
 origins+11.476
 ancestors+11.318
been+11.316
Neg logit contributions
 counterproductive-8.010
 futile-7.425
 undermines-7.141
 unsafe-7.141
 vulnerable-7.045
Token probably
Token ID2192
Feature activation-2.10e+01

Pos logit contributions
 origins+27.862
 ancestors+27.718
been+27.457
iths+26.835
 been+26.549
Neg logit contributions
Repeat-12.940
 counterproductive-12.860
 undermines-12.571
 Reporter-12.163
Alert-11.695
Token never
Token ID1239
Feature activation-2.15e+01

Pos logit contributions
been+19.393
 ancestors+18.935
 origins+18.800
ledged+18.655
tymology+18.270
Neg logit contributions
 counterproductive-11.071
 undermines-11.020
Repeat-10.503
 threatens-10.085
 hostage-10.060
Token left
Token ID1364
Feature activation-5.47e+00

Pos logit contributions
been+18.435
ledged+17.699
 ancestors+17.346
tymology+17.093
iths+17.025
Neg logit contributions
 counterproductive-12.866
 undermines-12.467
Repeat-12.264
 futile-11.874
Subscribe-11.550
Token dry
Token ID5894
Feature activation+3.24e-01

Pos logit contributions
tymology+5.538
ledged+5.509
been+5.365
iths+5.306
 origins+5.302
Neg logit contributions
 counterproductive-4.074
 futile-3.912
 undermines-3.842
 urgently-3.719
 vulnerable-3.658
Token land
Token ID1956
Feature activation-5.71e+00

Pos logit contributions
 counterproductive+0.239
 futile+0.232
 undermines+0.225
 urgently+0.221
 vulnerable+0.220
Neg logit contributions
tymology-0.343
ledged-0.340
been-0.334
essor-0.325
iths-0.325
Token of
Token ID286
Feature activation+5.67e-02

Pos logit contributions
tymology+8.844
ledged+8.784
been+8.616
 origins+8.586
essor+8.514
Neg logit contributions
 counterproductive-7.447
 futile-7.246
 undermines-7.094
 unsafe-6.832
 threatens-6.750
Token transport
Token ID4839
Feature activation-3.79e+00

Pos logit contributions
 counterproductive+0.040
 futile+0.039
 undermines+0.038
 urgently+0.037
 vulnerable+0.037
Neg logit contributions
tymology-0.061
ledged-0.061
been-0.060
 origins-0.058
essor-0.058
Token,
Token ID11
Feature activation-8.02e+00

Pos logit contributions
tymology+3.771
ledged+3.738
been+3.672
 origins+3.591
essor+3.574
Neg logit contributions
 counterproductive-2.971
 futile-2.886
 undermines-2.813
 vulnerable-2.719
 unsafe-2.703
Token and
Token ID290
Feature activation-7.76e+00

Pos logit contributions
tymology+7.980
ledged+7.885
been+7.876
 origins+7.759
essor+7.621
Neg logit contributions
 counterproductive-5.705
 futile-5.526
 undermines-5.476
 threatens-5.124
 unsafe-5.119
Token he
Token ID339
Feature activation-3.66e+00

Pos logit contributions
ledged+7.266
tymology+7.184
been+7.148
 origins+7.070
iths+6.938
Neg logit contributions
 counterproductive-5.918
 futile-5.716
 undermines-5.663
 unsafe-5.335
 threatens-5.327
Token had
Token ID550
Feature activation-3.57e+01

Pos logit contributions
tymology+3.489
ledged+3.483
been+3.414
 origins+3.327
iths+3.307
Neg logit contributions
 counterproductive-2.979
 futile-2.848
 undermines-2.795
 vulnerable-2.724
 unsafe-2.707
Token discovered
Token ID5071
Feature activation-9.74e+00

Pos logit contributions
 origins+27.989
been+27.638
 ancestors+26.980
 been+26.511
iths+26.156
Neg logit contributions
 counterproductive-13.578
Repeat-13.545
 undermines-13.491
 futile-12.287
 accountable-11.983
Token that
Token ID326
Feature activation-4.33e+00

Pos logit contributions
ledged+9.448
tymology+9.194
 origins+9.129
been+8.994
 ancestors+8.836
Neg logit contributions
 counterproductive-7.087
 futile-6.842
 undermines-6.817
 threatens-6.372
 vulnerable-6.367
Token his
Token ID465
Feature activation-1.14e+01

Pos logit contributions
ledged+4.476
tymology+4.467
been+4.377
 origins+4.314
essor+4.275
Neg logit contributions
 counterproductive-3.123
 futile-3.018
 undermines-2.956
 urgently-2.813
 vulnerable-2.805
Token father
Token ID2988
Feature activation-7.60e+00

Pos logit contributions
 origins+10.744
 ancestors+10.733
tymology+10.493
been+10.452
ledged+10.423
Neg logit contributions
 counterproductive-8.131
 undermines-8.060
 futile-7.743
 threatens-7.465
 urgently-7.333
Token was
Token ID373
Feature activation-1.43e+00

Pos logit contributions
ledged+6.820
tymology+6.795
been+6.775
 origins+6.618
 ancestors+6.591
Neg logit contributions
 counterproductive-6.175
 futile-5.940
 undermines-5.855
 vulnerable-5.660
 unsafe-5.625
Token wrote
Token ID2630
Feature activation-8.27e+00

Pos logit contributions
 counterproductive+0.309
 futile+0.300
 undermines+0.290
 vulnerable+0.284
 urgently+0.284
Neg logit contributions
tymology-0.442
ledged-0.438
been-0.431
essor-0.421
iths-0.418
Token on
Token ID319
Feature activation+2.72e-01

Pos logit contributions
tymology+8.361
ledged+8.286
been+8.199
 origins+8.145
essor+8.003
Neg logit contributions
 counterproductive-5.845
 futile-5.647
 undermines-5.526
 vulnerable-5.241
 unsafe-5.216
Token LinkedIn
Token ID27133
Feature activation-4.25e-01

Pos logit contributions
 counterproductive+0.186
 futile+0.180
 undermines+0.175
 urgently+0.170
 vulnerable+0.170
Neg logit contributions
tymology-0.302
ledged-0.300
been-0.295
essor-0.289
iths-0.287
Token that
Token ID326
Feature activation-1.04e+01

Pos logit contributions
tymology+0.425
ledged+0.422
been+0.413
essor+0.404
iths+0.401
Neg logit contributions
 counterproductive-0.338
 futile-0.330
 undermines-0.320
 vulnerable-0.314
 urgently-0.314
Token she
Token ID673
Feature activation-9.08e+00

Pos logit contributions
tymology+9.593
ledged+9.375
been+9.343
 origins+9.243
iths+9.165
Neg logit contributions
 counterproductive-8.058
 futile-7.821
 undermines-7.651
 urgently-7.346
 vulnerable-7.252
Token had
Token ID550
Feature activation-3.54e+01

Pos logit contributions
tymology+9.419
been+9.268
ledged+9.242
 origins+9.096
 ancestors+8.938
Neg logit contributions
 counterproductive-6.168
 futile-5.836
 undermines-5.547
 vulnerable-5.492
 unsafe-5.490
Token also
Token ID635
Feature activation-2.61e+01

Pos logit contributions
been+27.407
 origins+26.340
 ancestors+25.936
 been+25.240
iths+24.782
Neg logit contributions
 counterproductive-15.732
 undermines-15.701
 futile-14.667
 accountable-13.854
 threatens-13.831
Token worked
Token ID3111
Feature activation-2.50e+00

Pos logit contributions
been+21.457
 origins+19.534
 ancestors+19.354
 been+18.991
tymology+18.868
Neg logit contributions
 counterproductive-15.863
 undermines-15.410
 futile-14.597
Repeat-14.096
 accountable-13.878
Token as
Token ID355
Feature activation-3.37e+00

Pos logit contributions
tymology+2.763
ledged+2.729
been+2.704
essor+2.653
 origins+2.635
Neg logit contributions
 counterproductive-1.692
 futile-1.630
 undermines-1.589
 vulnerable-1.518
 urgently-1.518
Token a
Token ID257
Feature activation+2.65e+00

Pos logit contributions
tymology+3.206
ledged+3.166
been+3.122
essor+3.062
 origins+3.028
Neg logit contributions
 counterproductive-2.796
 futile-2.737
 undermines-2.677
 urgently-2.599
 vulnerable-2.581
Token professor
Token ID6240
Feature activation-6.85e+00

Pos logit contributions
 counterproductive+2.048
 futile+1.994
 undermines+1.934
 vulnerable+1.899
 urgently+1.897
Neg logit contributions
tymology-2.709
ledged-2.691
been-2.635
essor-2.573
iths-2.563
Token born
Token ID4642
Feature activation-6.33e-01

Pos logit contributions
ledged+3.016
tymology+2.988
been+2.984
 origins+2.906
essor+2.898
Neg logit contributions
 counterproductive-2.004
 futile-1.933
 undermines-1.924
 urgently-1.812
 threatens-1.803
Token in
Token ID287
Feature activation-1.03e+00

Pos logit contributions
tymology+0.745
ledged+0.742
been+0.729
essor+0.716
 origins+0.713
Neg logit contributions
 counterproductive-0.387
 futile-0.373
 undermines-0.361
 urgently-0.349
 vulnerable-0.348
Token New
Token ID968
Feature activation-4.55e+00

Pos logit contributions
tymology+1.079
ledged+1.076
been+1.063
 origins+1.039
essor+1.036
Neg logit contributions
 counterproductive-0.759
 futile-0.735
 undermines-0.722
 urgently-0.698
 vulnerable-0.691
Token Jersey
Token ID8221
Feature activation-1.80e+00

Pos logit contributions
ledged+4.159
tymology+4.128
been+4.068
essor+3.979
 origins+3.927
Neg logit contributions
 counterproductive-3.933
 futile-3.820
 undermines-3.736
 urgently-3.676
 vulnerable-3.629
Token and
Token ID290
Feature activation-1.05e+01

Pos logit contributions
tymology+1.848
ledged+1.841
been+1.808
essor+1.768
 origins+1.759
Neg logit contributions
 counterproductive-1.370
 futile-1.334
 undermines-1.294
 urgently-1.261
 vulnerable-1.259
Token has
Token ID468
Feature activation-3.52e+01

Pos logit contributions
ledged+9.725
 origins+9.698
tymology+9.620
 ancestors+9.567
been+9.546
Neg logit contributions
 counterproductive-7.830
 futile-7.381
 undermines-7.088
 vulnerable-6.932
 hostage-6.906
Token been
Token ID587
Feature activation-1.25e+00

Pos logit contributions
 origins+25.964
 ancestors+25.643
been+25.182
 been+24.271
ledged+24.092
Neg logit contributions
 counterproductive-17.349
 futile-16.490
 undermines-16.012
Repeat-15.611
 hostage-15.410
Token a
Token ID257
Feature activation+5.33e-01

Pos logit contributions
ledged+1.299
tymology+1.286
been+1.283
 origins+1.250
essor+1.243
Neg logit contributions
 counterproductive-0.923
 futile-0.893
 undermines-0.869
 urgently-0.839
 vulnerable-0.829
Token United
Token ID1578
Feature activation-7.34e+00

Pos logit contributions
 counterproductive+0.402
 futile+0.390
 undermines+0.382
 urgently+0.372
 vulnerable+0.367
Neg logit contributions
ledged-0.552
tymology-0.550
been-0.541
essor-0.529
 origins-0.525
Token States
Token ID1829
Feature activation-3.72e+00

Pos logit contributions
ledged+6.551
been+6.452
 origins+6.344
tymology+6.336
essor+6.275
Neg logit contributions
 counterproductive-6.154
 futile-6.046
 undermines-5.985
 urgently-5.638
 unsafe-5.566
Token citizen
Token ID9511
Feature activation-1.07e+01

Pos logit contributions
ledged+3.699
tymology+3.647
been+3.609
 origins+3.530
essor+3.518
Neg logit contributions
 counterproductive-2.926
 futile-2.823
 undermines-2.763
 urgently-2.665
 unsafe-2.643
Token to
Token ID284
Feature activation-6.64e+00

Pos logit contributions
 counterproductive+0.705
 futile+0.689
 undermines+0.674
 urgently+0.662
 vulnerable+0.661
Neg logit contributions
tymology-0.629
ledged-0.624
been-0.609
essor-0.591
iths-0.588
Token this
Token ID428
Feature activation-1.14e-01

Pos logit contributions
ledged+6.158
tymology+6.145
been+6.059
 origins+5.924
essor+5.786
Neg logit contributions
 counterproductive-5.389
 futile-5.223
 undermines-5.193
 urgently-5.034
 vulnerable-4.936
Token point
Token ID966
Feature activation-1.15e+01

Pos logit contributions
tymology+0.104
ledged+0.104
been+0.101
essor+0.099
iths+0.098
Neg logit contributions
 counterproductive-0.101
 futile-0.099
 undermines-0.096
 urgently-0.094
 vulnerable-0.094
Token,
Token ID11
Feature activation-1.11e+01

Pos logit contributions
been+10.256
tymology+10.176
ledged+10.058
 origins+9.877
essor+9.734
Neg logit contributions
 counterproductive-8.855
 undermines-8.526
 futile-8.442
 urgently-8.186
 unsafe-8.091
Token I
Token ID314
Feature activation-1.25e+01

Pos logit contributions
been+9.468
ledged+9.348
tymology+9.335
 origins+9.199
 ancestors+9.057
Neg logit contributions
 counterproductive-8.745
 undermines-8.579
 futile-8.441
 urgently-8.361
 unsafe-8.167
Token have
Token ID423
Feature activation-3.50e+01

Pos logit contributions
been+11.068
tymology+10.955
ledged+10.749
 origins+10.239
essor+10.161
Neg logit contributions
 counterproductive-9.543
 undermines-9.228
 futile-8.978
 unsafe-8.694
 threatens-8.642
Token been
Token ID587
Feature activation-1.20e+00

Pos logit contributions
been+26.927
 been+25.075
ledged+24.652
 origins+24.371
 ancestors+24.073
Neg logit contributions
 undermines-15.377
 counterproductive-14.544
 threatens-13.903
Repeat-13.857
 hostage-13.573
Token living
Token ID2877
Feature activation+5.63e+00

Pos logit contributions
ledged+1.272
tymology+1.267
been+1.265
 origins+1.225
iths+1.222
Neg logit contributions
 counterproductive-0.838
 futile-0.811
 undermines-0.802
 urgently-0.773
 threatens-0.758
Token a
Token ID257
Feature activation+6.59e+00

Pos logit contributions
 counterproductive+3.851
 futile+3.761
 undermines+3.598
 vulnerable+3.582
 harder+3.541
Neg logit contributions
tymology-6.268
ledged-6.215
been-6.036
essor-5.976
iths-5.881
Token very
Token ID845
Feature activation+3.85e+00

Pos logit contributions
 counterproductive+5.463
 futile+5.360
 vulnerable+5.108
 undermines+5.090
 harder+5.033
Neg logit contributions
tymology-6.434
ledged-6.328
been-6.121
essor-6.050
iths-5.999
Token good
Token ID922
Feature activation-5.23e+00

Pos logit contributions
 counterproductive+2.647
 futile+2.571
 undermines+2.476
 vulnerable+2.430
 urgently+2.420
Neg logit contributions
tymology-4.282
ledged-4.253
been-4.164
essor-4.080
iths-4.063
Tokenin
Token ID259
Feature activation-4.56e+00

Pos logit contributions
tymology+0.517
ledged+0.513
been+0.504
essor+0.491
iths+0.488
Neg logit contributions
 counterproductive-0.433
 futile-0.421
 undermines-0.410
 urgently-0.402
 vulnerable-0.402
Token Gund
Token ID23323
Feature activation-6.47e+00

Pos logit contributions
tymology+4.509
ledged+4.479
been+4.463
essor+4.336
 origins+4.325
Neg logit contributions
 counterproductive-3.528
 futile-3.403
 undermines-3.330
 vulnerable-3.239
 urgently-3.231
Tokenuz
Token ID10277
Feature activation-5.70e+00

Pos logit contributions
tymology+5.353
ledged+5.276
been+5.224
essor+5.012
iths+4.986
Neg logit contributions
 counterproductive-6.098
 futile-5.929
 undermines-5.737
 vulnerable-5.638
 urgently-5.631
Token,
Token ID11
Feature activation-4.14e+00

Pos logit contributions
tymology+5.418
ledged+5.366
been+5.326
essor+5.146
 origins+5.123
Neg logit contributions
 counterproductive-4.653
 futile-4.511
 undermines-4.393
 urgently-4.279
 vulnerable-4.275
Token who
Token ID508
Feature activation-9.85e+00

Pos logit contributions
tymology+4.116
ledged+4.065
been+4.025
 origins+3.928
essor+3.904
Neg logit contributions
 counterproductive-3.209
 futile-3.127
 undermines-3.061
 vulnerable-2.960
 urgently-2.951
Token had
Token ID550
Feature activation-3.48e+01

Pos logit contributions
ledged+9.014
been+8.907
tymology+8.856
 origins+8.712
iths+8.507
Neg logit contributions
 counterproductive-7.442
 futile-7.138
 undermines-6.931
 vulnerable-6.740
 unsafe-6.707
Token helped
Token ID4193
Feature activation-4.49e+00

Pos logit contributions
been+27.050
 origins+26.510
 been+25.701
 ancestors+25.251
iths+25.184
Neg logit contributions
 counterproductive-13.607
 undermines-13.277
Repeat-13.191
 futile-12.676
-12.469
Token him
Token ID683
Feature activation-9.27e+00

Pos logit contributions
tymology+4.413
ledged+4.399
been+4.312
 origins+4.297
essor+4.257
Neg logit contributions
 counterproductive-3.390
 futile-3.253
 undermines-3.203
 urgently-3.088
 vulnerable-3.062
Token bring
Token ID2222
Feature activation-8.78e-01

Pos logit contributions
tymology+9.155
ledged+9.051
 origins+8.998
been+8.943
iths+8.856
Neg logit contributions
 counterproductive-6.315
 futile-5.954
 undermines-5.853
 urgently-5.518
 unsafe-5.509
Token in
Token ID287
Feature activation-4.79e+00

Pos logit contributions
tymology+0.948
ledged+0.938
been+0.922
 origins+0.907
essor+0.905
Neg logit contributions
 counterproductive-0.618
 futile-0.596
 undermines-0.584
 vulnerable-0.563
 urgently-0.562
Token the
Token ID262
Feature activation-4.52e+00

Pos logit contributions
tymology+4.951
ledged+4.897
been+4.846
 origins+4.810
essor+4.715
Neg logit contributions
 counterproductive-3.402
 futile-3.305
 undermines-3.278
 urgently-3.101
 vulnerable-3.086
Token the
Token ID262
Feature activation+3.36e+00

Pos logit contributions
 counterproductive+1.558
 futile+1.517
 undermines+1.465
 vulnerable+1.435
 urgently+1.434
Neg logit contributions
tymology-2.373
ledged-2.355
been-2.312
essor-2.264
iths-2.249
Token way
Token ID835
Feature activation+2.85e+00

Pos logit contributions
 counterproductive+2.757
 futile+2.693
 undermines+2.604
 vulnerable+2.569
 urgently+2.561
Neg logit contributions
tymology-3.280
ledged-3.265
been-3.191
essor-3.120
iths-3.098
Token (
Token ID357
Feature activation-9.11e+00

Pos logit contributions
 counterproductive+2.050
 futile+1.996
 undermines+1.930
 urgently+1.896
 vulnerable+1.890
Neg logit contributions
tymology-3.074
ledged-3.049
been-2.992
essor-2.928
iths-2.906
Tokenthese
Token ID27218
Feature activation+8.48e-01

Pos logit contributions
been+7.693
tymology+7.504
ledged+7.440
iths+7.059
essor+6.992
Neg logit contributions
 counterproductive-8.061
 futile-7.888
 undermines-7.769
 urgently-7.572
 vulnerable-7.551
Token players
Token ID1938
Feature activation-1.90e+00

Pos logit contributions
 counterproductive+0.715
 futile+0.698
 undermines+0.680
 urgently+0.667
 vulnerable+0.665
Neg logit contributions
tymology-0.807
ledged-0.801
been-0.785
essor-0.765
iths-0.762
Token had
Token ID550
Feature activation-3.48e+01

Pos logit contributions
tymology+2.026
ledged+2.008
been+1.984
essor+1.934
iths+1.928
Neg logit contributions
 counterproductive-1.357
 futile-1.309
 undermines-1.276
 vulnerable-1.235
 urgently-1.226
Token missed
Token ID6825
Feature activation-1.04e+01

Pos logit contributions
been+28.081
iths+27.274
 origins+26.955
ledged+26.660
 ancestors+26.594
Neg logit contributions
 counterproductive-13.065
 undermines-13.048
 futile-12.007
 accountable-11.520
 threatens-11.386
Token the
Token ID262
Feature activation-7.41e+00

Pos logit contributions
ledged+11.438
tymology+11.356
been+11.154
 origins+11.076
 ancestors+11.062
Neg logit contributions
 counterproductive-5.833
 futile-5.567
 undermines-5.549
 urgently-5.227
 threatens-5.091
Token Valve
Token ID22479
Feature activation-1.34e+00

Pos logit contributions
tymology+7.284
ledged+7.262
 origins+7.168
been+7.139
 ancestors+7.060
Neg logit contributions
 counterproductive-5.377
 futile-5.201
 undermines-5.179
 urgently-4.975
 threatens-4.850
Token-
Token ID12
Feature activation-6.63e+00

Pos logit contributions
tymology+1.359
ledged+1.345
been+1.325
 origins+1.292
essor+1.289
Neg logit contributions
 counterproductive-1.045
 futile-1.017
 undermines-0.985
 urgently-0.965
 vulnerable-0.958
Tokenled
Token ID992
Feature activation-3.99e+00

Pos logit contributions
ledged+6.016
tymology+6.013
been+5.980
iths+5.637
essor+5.621
Neg logit contributions
 counterproductive-5.617
 futile-5.517
 undermines-5.381
 vulnerable-5.238
 urgently-5.238
Token development
Token ID2478
Feature activation-4.88e+00

Pos logit contributions
tymology+2.075
ledged+2.068
been+2.031
essor+1.981
 origins+1.974
Neg logit contributions
 counterproductive-1.826
 futile-1.778
 undermines-1.727
 urgently-1.693
 vulnerable-1.687
Token camera
Token ID4676
Feature activation-8.56e-02

Pos logit contributions
tymology+4.632
ledged+4.620
been+4.568
 origins+4.502
essor+4.444
Neg logit contributions
 counterproductive-3.912
 futile-3.775
 undermines-3.651
 vulnerable-3.602
 urgently-3.572
Token,
Token ID11
Feature activation-7.08e+00

Pos logit contributions
tymology+0.087
ledged+0.086
been+0.084
essor+0.082
iths+0.082
Neg logit contributions
 counterproductive-0.067
 futile-0.065
 undermines-0.063
 vulnerable-0.062
 urgently-0.062
Token although
Token ID3584
Feature activation-4.18e+00

Pos logit contributions
tymology+7.027
ledged+6.978
been+6.810
 origins+6.704
essor+6.687
Neg logit contributions
 counterproductive-5.244
 futile-5.063
 undermines-4.999
 urgently-4.754
 vulnerable-4.745
Token it
Token ID340
Feature activation-3.64e+00

Pos logit contributions
tymology+4.113
ledged+4.087
been+4.046
 origins+3.948
essor+3.923
Neg logit contributions
 counterproductive-3.199
 undermines-3.081
 futile-3.079
 urgently-2.923
 threatens-2.910
Token had
Token ID550
Feature activation-3.46e+01

Pos logit contributions
tymology+4.029
ledged+3.993
been+3.958
 origins+3.870
essor+3.831
Neg logit contributions
 counterproductive-2.426
 futile-2.325
 undermines-2.217
 vulnerable-2.172
 urgently-2.155
Token been
Token ID587
Feature activation-3.36e+00

Pos logit contributions
been+27.877
 origins+27.120
 predecessors+25.761
 ancestors+25.529
 been+25.524
Neg logit contributions
 counterproductive-14.297
 undermines-14.221
 futile-13.485
Repeat-13.213
 accountable-13.170
Token upgraded
Token ID17955
Feature activation-2.86e+00

Pos logit contributions
ledged+3.243
tymology+3.239
been+3.206
 origins+3.104
essor+3.090
Neg logit contributions
 counterproductive-2.652
 futile-2.574
 undermines-2.562
 urgently-2.430
 vulnerable-2.429
Token regularly
Token ID7987
Feature activation-4.32e-02

Pos logit contributions
tymology+3.066
ledged+3.053
been+2.992
essor+2.937
 origins+2.920
Neg logit contributions
 counterproductive-2.040
 futile-1.973
 undermines-1.919
 vulnerable-1.857
 urgently-1.841
Token over
Token ID625
Feature activation-1.57e+00

Pos logit contributions
tymology+0.045
ledged+0.045
been+0.044
essor+0.043
iths+0.043
Neg logit contributions
 counterproductive-0.033
 futile-0.032
 undermines-0.031
 urgently-0.030
 vulnerable-0.030
Token the
Token ID262
Feature activation-4.98e+00

Pos logit contributions
tymology+1.734
ledged+1.725
been+1.698
essor+1.657
 origins+1.649
Neg logit contributions
 counterproductive-1.085
 futile-1.050
 undermines-1.023
 urgently-0.993
 vulnerable-0.990
Token hike
Token ID18390
Feature activation-5.93e+00

Pos logit contributions
ledged+7.880
tymology+7.876
been+7.723
 origins+7.714
essor+7.603
Neg logit contributions
 counterproductive-5.305
 futile-5.097
 undermines-5.031
 urgently-4.895
 vulnerable-4.821
Token.
Token ID13
Feature activation-5.80e+00

Pos logit contributions
tymology+6.277
ledged+6.267
been+6.164
 origins+6.058
essor+6.011
Neg logit contributions
 counterproductive-4.147
 futile-3.972
 undermines-3.918
 urgently-3.801
 vulnerable-3.767
Token Bob
Token ID5811
Feature activation+6.66e+00

Pos logit contributions
been+5.448
ledged+5.431
tymology+5.366
 origins+5.300
iths+5.278
Neg logit contributions
 counterproductive-4.672
 futile-4.545
 undermines-4.403
 vulnerable-4.286
 urgently-4.208
Token In
Token ID554
Feature activation-4.70e+00

Pos logit contributions
 counterproductive+4.991
 futile+4.915
 undermines+4.750
 urgently+4.665
 vulnerable+4.651
Neg logit contributions
tymology-6.708
ledged-6.642
been-6.417
essor-6.373
iths-6.272
Tokenman
Token ID805
Feature activation-4.86e+00

Pos logit contributions
tymology+4.419
ledged+4.388
been+4.383
essor+4.224
iths+4.214
Neg logit contributions
 counterproductive-3.923
 futile-3.827
 undermines-3.717
 urgently-3.653
 vulnerable-3.617
Token had
Token ID550
Feature activation-3.46e+01

Pos logit contributions
ledged+4.797
tymology+4.785
been+4.697
essor+4.570
 origins+4.547
Neg logit contributions
 counterproductive-3.809
 futile-3.640
 undermines-3.589
 vulnerable-3.492
 urgently-3.468
Token written
Token ID3194
Feature activation-8.70e+00

Pos logit contributions
been+26.820
 ancestors+25.793
 origins+25.506
 been+25.232
ledged+25.027
Neg logit contributions
 undermines-14.545
 counterproductive-14.048
Repeat-13.379
 hostage-12.966
 threatens-12.834
Token a
Token ID257
Feature activation+6.90e-01

Pos logit contributions
ledged+8.576
tymology+8.521
been+8.386
 origins+8.302
essor+8.198
Neg logit contributions
 counterproductive-6.216
 futile-6.000
 undermines-5.964
 vulnerable-5.647
 urgently-5.621
Token guide
Token ID5698
Feature activation-6.25e+00

Pos logit contributions
 counterproductive+0.602
 futile+0.587
 undermines+0.578
 urgently+0.564
 vulnerable+0.562
Neg logit contributions
tymology-0.633
ledged-0.629
been-0.617
essor-0.599
iths-0.596
Tokenbook
Token ID2070
Feature activation-8.08e+00

Pos logit contributions
tymology+6.393
ledged+6.373
been+6.222
essor+6.138
 origins+6.059
Neg logit contributions
 counterproductive-4.583
 futile-4.414
 undermines-4.374
 urgently-4.174
 vulnerable-4.166
Token about
Token ID546
Feature activation-4.35e+00

Pos logit contributions
tymology+8.665
ledged+8.582
been+8.369
 origins+8.301
essor+8.267
Neg logit contributions
 counterproductive-5.276
 futile-5.066
 undermines-5.031
 unsafe-4.735
 threatens-4.728
Token Win
Token ID7178
Feature activation-1.56e+00

Pos logit contributions
ledged+3.103
been+3.069
tymology+3.067
essor+2.982
iths+2.961
Neg logit contributions
 counterproductive-2.380
 futile-2.322
 undermines-2.223
 urgently-2.183
 vulnerable-2.179
Tokenkel
Token ID7750
Feature activation-2.53e+00

Pos logit contributions
tymology+1.176
ledged+1.163
been+1.143
iths+1.106
essor+1.092
Neg logit contributions
 counterproductive-1.643
 futile-1.606
 undermines-1.575
 urgently-1.551
 vulnerable-1.546
Tokenried
Token ID2228
Feature activation-2.25e-02

Pos logit contributions
ledged+1.674
been+1.638
tymology+1.630
iths+1.516
essor+1.497
Neg logit contributions
 counterproductive-2.899
 futile-2.833
 undermines-2.801
 unsafe-2.747
 vulnerable-2.744
Token,
Token ID11
Feature activation-3.44e+00

Pos logit contributions
tymology+0.023
ledged+0.023
been+0.022
essor+0.022
 origins+0.022
Neg logit contributions
 counterproductive-0.017
 futile-0.017
 undermines-0.016
 vulnerable-0.016
 urgently-0.016
Token who
Token ID508
Feature activation-8.69e+00

Pos logit contributions
tymology+3.308
ledged+3.277
been+3.223
 origins+3.160
essor+3.145
Neg logit contributions
 counterproductive-2.757
 futile-2.681
 undermines-2.618
 urgently-2.558
 vulnerable-2.517
Token had
Token ID550
Feature activation-3.45e+01

Pos logit contributions
ledged+8.460
tymology+8.409
been+8.351
 origins+8.272
 ancestors+8.081
Neg logit contributions
 counterproductive-6.241
 futile-5.983
 undermines-5.664
 vulnerable-5.655
 unsafe-5.613
Token started
Token ID2067
Feature activation-3.72e+00

Pos logit contributions
been+26.900
 origins+26.607
 ancestors+25.661
 predecessors+25.433
 been+25.254
Neg logit contributions
 counterproductive-14.114
 undermines-13.637
 futile-13.602
Repeat-13.518
 distur-13.259
Token at
Token ID379
Feature activation-2.08e+00

Pos logit contributions
ledged+4.028
tymology+4.018
been+3.989
 origins+3.898
essor+3.886
Neg logit contributions
 counterproductive-2.490
 futile-2.398
 undermines-2.357
 urgently-2.271
 vulnerable-2.256
Token Goldman
Token ID18740
Feature activation+7.23e-01

Pos logit contributions
tymology+2.126
ledged+2.115
been+2.090
essor+2.044
 origins+2.042
Neg logit contributions
 counterproductive-1.563
 futile-1.525
 undermines-1.497
 urgently-1.450
 vulnerable-1.433
Token eight
Token ID3624
Feature activation-4.54e+00

Pos logit contributions
 counterproductive+0.491
 futile+0.476
 undermines+0.460
 urgently+0.449
 vulnerable+0.448
Neg logit contributions
tymology-0.808
ledged-0.802
been-0.788
essor-0.772
iths-0.767
Token years
Token ID812
Feature activation+4.46e+00

Pos logit contributions
tymology+3.205
ledged+3.180
been+3.116
 origins+3.042
essor+3.024
Neg logit contributions
 counterproductive-4.785
 futile-4.672
 undermines-4.602
 urgently-4.558
 vulnerable-4.500
Token player
Token ID2137
Feature activation-9.19e+00

Pos logit contributions
tymology+12.233
ledged+12.210
been+12.016
 origins+11.928
essor+11.634
Neg logit contributions
 counterproductive-10.823
 futile-10.424
 undermines-10.221
 urgently-10.130
 accountable-9.552
Token,
Token ID11
Feature activation-6.91e+00

Pos logit contributions
tymology+9.221
ledged+9.139
been+9.096
 origins+8.951
essor+8.828
Neg logit contributions
 counterproductive-6.389
 futile-6.082
 undermines-5.998
 urgently-5.660
 vulnerable-5.621
Token the
Token ID262
Feature activation-9.06e+00

Pos logit contributions
tymology+6.798
been+6.717
ledged+6.692
 origins+6.544
essor+6.479
Neg logit contributions
 counterproductive-5.153
 futile-4.955
 undermines-4.899
 urgently-4.701
 vulnerable-4.645
Token one
Token ID530
Feature activation-9.45e+00

Pos logit contributions
been+9.064
tymology+9.047
ledged+9.035
 origins+8.821
iths+8.711
Neg logit contributions
 counterproductive-6.168
 undermines-5.986
 futile-5.969
 urgently-5.634
 threatens-5.541
Token who
Token ID508
Feature activation-1.03e+01

Pos logit contributions
tymology+9.210
ledged+9.113
been+9.036
 origins+8.812
essor+8.743
Neg logit contributions
 counterproductive-6.800
 futile-6.623
 undermines-6.548
 urgently-6.292
 unsafe-6.106
Token had
Token ID550
Feature activation-3.45e+01

Pos logit contributions
tymology+11.119
ledged+11.043
been+10.905
 origins+10.750
essor+10.639
Neg logit contributions
 counterproductive-6.105
 futile-5.790
 undermines-5.375
 vulnerable-5.285
 unsafe-5.248
Token carried
Token ID5281
Feature activation-3.79e+00

Pos logit contributions
been+27.430
 origins+26.395
iths+25.707
ledged+25.664
 ancestors+25.624
Neg logit contributions
 counterproductive-13.744
 undermines-13.459
 futile-13.016
Repeat-12.120
 hostage-11.948
Token them
Token ID606
Feature activation-9.49e-01

Pos logit contributions
ledged+3.774
tymology+3.768
been+3.708
 origins+3.615
iths+3.607
Neg logit contributions
 counterproductive-2.907
 futile-2.815
 undermines-2.776
 urgently-2.700
 vulnerable-2.672
Token to
Token ID284
Feature activation-4.36e+00

Pos logit contributions
ledged+1.086
tymology+1.085
been+1.066
 origins+1.043
iths+1.040
Neg logit contributions
 counterproductive-0.603
 futile-0.579
 undermines-0.564
 urgently-0.547
 vulnerable-0.542
Token an
Token ID281
Feature activation+3.74e-02

Pos logit contributions
tymology+4.458
ledged+4.457
been+4.381
 origins+4.316
iths+4.289
Neg logit contributions
 counterproductive-3.211
 futile-3.103
 undermines-3.036
 urgently-2.977
 vulnerable-2.927
Token upset
Token ID9247
Feature activation-8.12e-01

Pos logit contributions
 counterproductive+0.029
 futile+0.028
 undermines+0.027
 urgently+0.027
 vulnerable+0.027
Neg logit contributions
tymology-0.038
ledged-0.038
been-0.037
essor-0.036
iths-0.036
Token most
Token ID749
Feature activation-5.45e+00

Pos logit contributions
tymology+9.761
ledged+9.678
 origins+9.635
been+9.523
iths+9.432
Neg logit contributions
 counterproductive-7.478
 futile-7.222
 undermines-7.113
 urgently-6.876
 threatens-6.790
Token of
Token ID286
Feature activation-2.37e+00

Pos logit contributions
tymology+5.123
ledged+5.067
been+4.997
 origins+4.949
iths+4.861
Neg logit contributions
 counterproductive-4.393
 futile-4.258
 undermines-4.186
 urgently-4.049
 unsafe-4.031
Token the
Token ID262
Feature activation-5.76e+00

Pos logit contributions
tymology+2.487
ledged+2.465
been+2.420
essor+2.362
iths+2.360
Neg logit contributions
 counterproductive-1.763
 futile-1.712
 undermines-1.669
 urgently-1.622
 vulnerable-1.617
Token Italian
Token ID8200
Feature activation-2.20e+00

Pos logit contributions
tymology+5.445
ledged+5.399
been+5.373
 origins+5.363
iths+5.248
Neg logit contributions
 counterproductive-4.486
 futile-4.333
 undermines-4.326
 urgently-4.151
 threatens-4.123
Token Army
Token ID5407
Feature activation-4.92e+00

Pos logit contributions
tymology+2.149
ledged+2.132
been+2.094
 origins+2.061
iths+2.032
Neg logit contributions
 counterproductive-1.766
 futile-1.713
 undermines-1.676
 urgently-1.634
 unsafe-1.618
Token had
Token ID550
Feature activation-3.41e+01

Pos logit contributions
tymology+4.820
ledged+4.753
been+4.677
 origins+4.625
iths+4.561
Neg logit contributions
 counterproductive-3.845
 futile-3.725
 undermines-3.636
 vulnerable-3.511
 urgently-3.507
Token been
Token ID587
Feature activation-4.18e+00

Pos logit contributions
 origins+27.937
been+27.796
 ancestors+26.762
iths+26.305
 predecessors+26.029
Neg logit contributions
 undermines-12.948
 counterproductive-12.936
Repeat-12.467
 accountable-11.996
 threatens-11.917
Token intern
Token ID1788
Feature activation-2.04e+00

Pos logit contributions
ledged+4.336
tymology+4.334
been+4.314
 origins+4.223
iths+4.195
Neg logit contributions
 counterproductive-2.902
 futile-2.787
 undermines-2.769
 urgently-2.636
 threatens-2.631
Tokened
Token ID276
Feature activation+1.90e+00

Pos logit contributions
ledged+1.535
tymology+1.524
been+1.494
iths+1.440
essor+1.436
Neg logit contributions
 counterproductive-2.125
 futile-2.076
 undermines-2.027
 urgently-1.996
 vulnerable-1.991
Token by
Token ID416
Feature activation+4.72e+00

Pos logit contributions
 counterproductive+1.327
 futile+1.290
 undermines+1.249
 vulnerable+1.223
 urgently+1.223
Neg logit contributions
tymology-2.099
ledged-2.081
been-2.040
essor-2.001
iths-1.984
Token German
Token ID2679
Feature activation+1.89e+00

Pos logit contributions
 counterproductive+3.105
 futile+3.023
 undermines+2.867
 vulnerable+2.865
 urgently+2.856
Neg logit contributions
tymology-5.373
ledged-5.334
been-5.234
essor-5.114
iths-5.075
Tokenincluding
Token ID8201
Feature activation-6.49e+00

Pos logit contributions
been+14.215
ledged+13.632
tymology+13.628
essor+12.999
 origins+12.922
Neg logit contributions
 counterproductive-12.043
 undermines-11.746
 futile-11.725
 urgently-11.227
 vulnerable-11.079
Token my
Token ID616
Feature activation-1.53e+01

Pos logit contributions
tymology+6.245
ledged+6.241
been+6.180
 origins+6.084
essor+6.047
Neg logit contributions
 counterproductive-4.971
 undermines-4.825
 futile-4.820
 urgently-4.598
 vulnerable-4.517
Token wife
Token ID3656
Feature activation-7.66e+00

Pos logit contributions
 ancestors+13.517
 origins+13.260
tymology+13.031
been+12.956
 predecessors+12.950
Neg logit contributions
 undermines-10.954
 counterproductive-10.683
 futile-10.228
 threatens-10.060
 urgently-9.967
Token).
Token ID737
Feature activation-1.00e+01

Pos logit contributions
ledged+7.334
tymology+7.330
been+7.266
 origins+7.098
essor+7.077
Neg logit contributions
 counterproductive-5.840
 futile-5.643
 undermines-5.588
 vulnerable-5.312
 unsafe-5.293
Token I
Token ID314
Feature activation-7.77e+00

Pos logit contributions
been+9.138
ledged+8.869
 origins+8.799
tymology+8.651
essor+8.633
Neg logit contributions
 counterproductive-7.906
 futile-7.624
 undermines-7.575
 vulnerable-7.199
 urgently-7.044
Token had
Token ID550
Feature activation-3.41e+01

Pos logit contributions
tymology+7.422
been+7.285
ledged+7.249
 origins+6.927
essor+6.855
Neg logit contributions
 counterproductive-6.063
 undermines-5.818
 futile-5.721
 vulnerable-5.513
 unsafe-5.457
Token no
Token ID645
Feature activation-1.77e+01

Pos logit contributions
been+26.224
 origins+26.197
 ancestors+25.997
 been+24.754
ledged+24.282
Neg logit contributions
 undermines-15.134
 counterproductive-14.205
 threatens-13.163
Repeat-13.109
 futile-13.040
Token job
Token ID1693
Feature activation-6.56e+00

Pos logit contributions
 origins+15.342
 ancestors+15.277
been+14.807
ledged+14.771
tymology+14.747
Neg logit contributions
 counterproductive-11.542
 undermines-11.358
 futile-11.096
 urgently-10.758
 threatens-10.534
Token lined
Token ID16566
Feature activation-2.68e+00

Pos logit contributions
tymology+6.584
been+6.543
ledged+6.496
 origins+6.404
 ancestors+6.247
Neg logit contributions
 counterproductive-4.898
 futile-4.766
 undermines-4.609
 vulnerable-4.450
 unsafe-4.422
Token up
Token ID510
Feature activation-6.61e+00

Pos logit contributions
tymology+3.425
ledged+3.416
been+3.379
essor+3.328
 origins+3.310
Neg logit contributions
 counterproductive-1.348
 futile-1.277
 undermines-1.227
 urgently-1.158
 vulnerable-1.157
Token,
Token ID11
Feature activation-9.97e+00

Pos logit contributions
ledged+6.493
tymology+6.487
been+6.435
 origins+6.324
iths+6.236
Neg logit contributions
 counterproductive-4.929
 futile-4.774
 undermines-4.631
 vulnerable-4.470
 unsafe-4.428
Token kn
Token ID638
Feature activation-4.62e+00

Pos logit contributions
 counterproductive+3.854
 futile+3.775
 undermines+3.654
 vulnerable+3.603
 urgently+3.589
Neg logit contributions
tymology-4.736
ledged-4.674
been-4.554
essor-4.475
iths-4.442
Tokenolls
Token ID33421
Feature activation+8.02e+00

Pos logit contributions
ledged+3.301
tymology+3.233
been+3.185
iths+3.069
essor+3.062
Neg logit contributions
 counterproductive-5.061
 futile-4.936
 undermines-4.766
 urgently-4.735
 vulnerable-4.703
Token fathers
Token ID17150
Feature activation+5.15e-01

Pos logit contributions
 counterproductive+5.636
 futile+5.551
 undermines+5.366
 vulnerable+5.343
 urgently+5.256
Neg logit contributions
tymology-8.530
ledged-8.409
been-8.197
essor-8.047
iths-8.008
Token and
Token ID290
Feature activation+5.71e+00

Pos logit contributions
 counterproductive+0.359
 futile+0.347
 undermines+0.338
 urgently+0.329
 vulnerable+0.328
Neg logit contributions
tymology-0.566
ledged-0.562
been-0.552
essor-0.540
iths-0.537
Token sons
Token ID11989
Feature activation+4.74e+00

Pos logit contributions
 counterproductive+6.642
 futile+6.570
 undermines+6.423
 vulnerable+6.411
 urgently+6.320
Neg logit contributions
tymology-3.546
ledged-3.466
been-3.377
essor-3.238
iths-3.196
Token kick
Token ID4829
Feature activation+1.45e+01

Pos logit contributions
 counterproductive+3.488
 futile+3.430
 undermines+3.306
 vulnerable+3.258
 urgently+3.258
Neg logit contributions
tymology-5.003
ledged-4.965
been-4.856
essor-4.764
iths-4.718
Token football
Token ID4346
Feature activation+1.02e+01

Pos logit contributions
 futile+11.897
 counterproductive+11.782
 harder+11.418
 vulnerable+11.404
 undermines+11.086
Neg logit contributions
tymology-12.257
ledged-11.864
been-11.863
essor-11.534
iths-11.187
Tokens
Token ID82
Feature activation+1.47e+01

Pos logit contributions
 counterproductive+8.888
 futile+8.792
 undermines+8.513
 harder+8.509
 vulnerable+8.443
Neg logit contributions
tymology-8.962
ledged-8.805
been-8.604
essor-8.425
iths-8.252
Token to
Token ID284
Feature activation+7.58e+00

Pos logit contributions
 futile+10.909
 counterproductive+10.848
 harder+10.559
 vulnerable+10.407
 undermines+10.323
Neg logit contributions
tymology-13.768
ledged-13.231
essor-13.083
been-12.996
iths-12.436
Token each
Token ID1123
Feature activation+3.12e+00

Pos logit contributions
 counterproductive+5.784
 futile+5.763
 undermines+5.473
 vulnerable+5.447
 harder+5.398
Neg logit contributions
tymology-7.649
ledged-7.487
been-7.411
essor-7.290
iths-7.050
Token other
Token ID584
Feature activation+8.19e+00

Pos logit contributions
 counterproductive+2.203
 futile+2.179
 undermines+2.087
 vulnerable+2.073
 urgently+2.035
Neg logit contributions
tymology-3.385
ledged-3.338
been-3.279
essor-3.231
iths-3.167
Token further
Token ID2252
Feature activation+8.34e+00

Pos logit contributions
 counterproductive+2.029
 futile+1.984
 undermines+1.919
 vulnerable+1.892
 urgently+1.891
Neg logit contributions
tymology-2.528
ledged-2.506
been-2.450
essor-2.391
iths-2.383
Token along
Token ID1863
Feature activation+4.97e+00

Pos logit contributions
 counterproductive+6.150
 futile+6.106
 undermines+5.885
 vulnerable+5.828
 urgently+5.825
Neg logit contributions
tymology-8.642
ledged-8.506
been-8.300
essor-8.125
iths-8.121
Token he
Token ID339
Feature activation+1.21e+00

Pos logit contributions
 counterproductive+3.376
 futile+3.310
 undermines+3.191
 urgently+3.133
 vulnerable+3.133
Neg logit contributions
tymology-5.521
ledged-5.464
been-5.354
essor-5.240
iths-5.215
Token walks
Token ID11114
Feature activation+7.24e+00

Pos logit contributions
 counterproductive+0.749
 futile+0.718
 undermines+0.690
 vulnerable+0.674
 urgently+0.671
Neg logit contributions
tymology-1.428
ledged-1.421
been-1.400
essor-1.368
 origins-1.364
Token,
Token ID11
Feature activation+1.14e+01

Pos logit contributions
 counterproductive+4.638
 futile+4.544
 undermines+4.378
 urgently+4.311
 vulnerable+4.288
Neg logit contributions
tymology-8.309
ledged-8.205
been-8.015
essor-7.870
iths-7.797
Token the
Token ID262
Feature activation+1.24e+01

Pos logit contributions
 counterproductive+6.721
 futile+6.626
 harder+6.461
 vulnerable+6.317
 urgently+6.284
Neg logit contributions
tymology-12.913
ledged-12.742
been-12.489
iths-12.067
essor-12.017
Token greater
Token ID3744
Feature activation+6.28e+00

Pos logit contributions
 harder+8.004
 counterproductive+7.921
 futile+7.859
 vulnerable+7.490
 urgently+7.355
Neg logit contributions
tymology-13.710
ledged-13.440
been-12.928
essor-12.689
iths-12.653
Token the
Token ID262
Feature activation+6.38e+00

Pos logit contributions
 counterproductive+4.558
 futile+4.485
 undermines+4.342
 vulnerable+4.289
 urgently+4.257
Neg logit contributions
tymology-6.628
ledged-6.545
been-6.411
essor-6.277
iths-6.272
Token destruction
Token ID8166
Feature activation+5.51e+00

Pos logit contributions
 counterproductive+5.709
 futile+5.628
 undermines+5.419
 vulnerable+5.393
 urgently+5.369
Neg logit contributions
tymology-5.742
ledged-5.661
been-5.490
essor-5.393
iths-5.350
Token and
Token ID290
Feature activation+1.26e+01

Pos logit contributions
 counterproductive+4.419
 futile+4.348
 undermines+4.271
 urgently+4.153
 vulnerable+4.146
Neg logit contributions
tymology-5.494
ledged-5.427
been-5.290
essor-5.166
iths-5.121
Token injuries
Token ID6821
Feature activation+9.04e+00

Pos logit contributions
 counterproductive+10.512
 futile+10.383
 undermines+10.192
 harder+10.059
 vulnerable+9.933
Neg logit contributions
tymology-11.293
ledged-11.217
been-10.799
essor-10.568
iths-10.533
Token
Token ID251
Feature activation+1.10e+01

Pos logit contributions
 counterproductive+7.451
 futile+7.226
 undermines+7.082
 vulnerable+7.072
 urgently+6.878
Neg logit contributions
tymology-6.642
ledged-6.552
essor-6.392
been-6.377
iths-6.217
Token
Token ID198
Feature activation+1.04e+00

Pos logit contributions
 counterproductive+6.570
 futile+6.359
 undermines+6.324
 urgently+6.124
 vulnerable+6.122
Neg logit contributions
tymology-12.213
ledged-11.906
been-11.446
essor-11.323
iths-11.239
Token
Token ID198
Feature activation+5.29e+00

Pos logit contributions
 counterproductive+0.822
 futile+0.799
 undermines+0.778
 vulnerable+0.753
 urgently+0.748
Neg logit contributions
ledged-1.048
tymology-1.044
been-1.040
essor-1.003
iths-1.000
TokenA
Token ID32
Feature activation+1.12e+01

Pos logit contributions
 counterproductive+3.712
 futile+3.583
 undermines+3.506
 urgently+3.421
 vulnerable+3.406
Neg logit contributions
tymology-5.686
ledged-5.645
been-5.462
essor-5.404
iths-5.377
Token few
Token ID1178
Feature activation+1.07e+01

Pos logit contributions
 counterproductive+9.869
 futile+9.716
 vulnerable+9.381
 harder+9.356
 undermines+9.271
Neg logit contributions
tymology-9.436
ledged-9.397
been-8.993
essor-8.961
iths-8.874
Token days
Token ID1528
Feature activation+1.62e+01

Pos logit contributions
 counterproductive+9.656
 futile+9.468
 vulnerable+9.167
 undermines+9.153
 harder+9.124
Neg logit contributions
ledged-8.957
tymology-8.947
been-8.576
essor-8.385
iths-8.197
Token before
Token ID878
Feature activation+4.29e+00

Pos logit contributions
 counterproductive+11.622
 futile+11.392
 harder+11.385
 undermines+11.168
 vulnerable+11.083
Neg logit contributions
tymology-14.817
ledged-14.413
essor-13.634
been-13.531
iths-13.419
Token my
Token ID616
Feature activation-1.44e+00

Pos logit contributions
 counterproductive+3.133
 futile+3.052
 undermines+2.955
 vulnerable+2.907
 urgently+2.900
Neg logit contributions
tymology-4.575
ledged-4.537
been-4.431
essor-4.328
iths-4.307
Token last
Token ID938
Feature activation-5.10e+00

Pos logit contributions
tymology+1.411
ledged+1.397
been+1.378
essor+1.341
 origins+1.341
Neg logit contributions
 counterproductive-1.156
 futile-1.122
 undermines-1.105
 urgently-1.072
 vulnerable-1.066
Token phone
Token ID3072
Feature activation+5.80e+00

Pos logit contributions
tymology+4.906
ledged+4.817
been+4.806
 origins+4.709
essor+4.669
Neg logit contributions
 counterproductive-3.995
 undermines-3.844
 futile-3.842
 urgently-3.707
 vulnerable-3.651
Token call
Token ID869
Feature activation+6.46e+00

Pos logit contributions
 counterproductive+3.851
 futile+3.799
 undermines+3.613
 urgently+3.611
 vulnerable+3.573
Neg logit contributions
tymology-6.419
ledged-6.404
been-6.211
essor-6.160
iths-6.133
Token,
Token ID11
Feature activation+3.87e-02

Pos logit contributions
 counterproductive+3.572
 futile+3.464
 undermines+3.406
 urgently+3.384
 vulnerable+3.349
Neg logit contributions
ledged-4.086
tymology-4.080
been-3.978
essor-3.888
iths-3.871
Token which
Token ID543
Feature activation+2.75e+00

Pos logit contributions
 counterproductive+0.029
 futile+0.029
 undermines+0.028
 vulnerable+0.027
 urgently+0.027
Neg logit contributions
tymology-0.040
ledged-0.039
been-0.039
 origins-0.038
essor-0.038
Token looks
Token ID3073
Feature activation+7.81e+00

Pos logit contributions
 counterproductive+1.844
 futile+1.788
 undermines+1.731
 urgently+1.690
 vulnerable+1.687
Neg logit contributions
tymology-3.102
ledged-3.083
been-3.026
essor-2.962
iths-2.947
Token to
Token ID284
Feature activation-1.78e+00

Pos logit contributions
 counterproductive+5.861
 futile+5.703
 vulnerable+5.536
 urgently+5.474
 undermines+5.442
Neg logit contributions
tymology-8.237
ledged-8.144
been-7.877
essor-7.874
iths-7.721
Token be
Token ID307
Feature activation+7.81e+00

Pos logit contributions
tymology+1.620
ledged+1.608
been+1.580
essor+1.543
 origins+1.534
Neg logit contributions
 counterproductive-1.551
 futile-1.512
 undermines-1.476
 vulnerable-1.425
 urgently-1.423
Token heading
Token ID9087
Feature activation+1.76e+01

Pos logit contributions
 counterproductive+5.310
 futile+5.168
 vulnerable+4.963
 urgently+4.925
 undermines+4.914
Neg logit contributions
tymology-8.719
ledged-8.657
been-8.404
essor-8.287
iths-8.255
Token toward
Token ID3812
Feature activation+6.55e+00

Pos logit contributions
 counterproductive+11.406
 harder+11.177
 futile+11.097
 vulnerable+10.958
 urgently+10.840
Neg logit contributions
tymology-17.976
ledged-17.875
been-17.402
essor-17.014
iths-16.897
Token needing
Token ID18139
Feature activation-5.34e+00

Pos logit contributions
 counterproductive+4.831
 futile+4.665
 vulnerable+4.498
 undermines+4.457
 urgently+4.450
Neg logit contributions
tymology-6.973
ledged-6.905
been-6.723
iths-6.509
essor-6.505
Token a
Token ID257
Feature activation+8.43e+00

Pos logit contributions
tymology+5.591
ledged+5.575
been+5.442
 origins+5.422
iths+5.389
Neg logit contributions
 counterproductive-3.683
 futile-3.586
 undermines-3.497
 vulnerable-3.344
 unsafe-3.338
Token full
Token ID1336
Feature activation+4.50e+00

Pos logit contributions
 counterproductive+6.377
 futile+6.209
 urgently+5.957
 harder+5.946
 vulnerable+5.936
Neg logit contributions
tymology-8.698
ledged-8.570
been-8.428
essor-8.240
iths-8.170
Token bailout
Token ID29928
Feature activation+6.36e+00

Pos logit contributions
 counterproductive+3.211
 futile+3.111
 undermines+3.018
 urgently+2.965
 vulnerable+2.949
Neg logit contributions
tymology-4.868
ledged-4.808
been-4.735
essor-4.640
iths-4.623
Token production
Token ID3227
Feature activation+1.84e+00

Pos logit contributions
 counterproductive+3.316
 futile+3.227
 undermines+3.158
 vulnerable+3.115
 urgently+3.109
Neg logit contributions
tymology-3.816
ledged-3.781
been-3.692
essor-3.626
iths-3.584
Token.
Token ID13
Feature activation+2.41e+00

Pos logit contributions
 counterproductive+1.525
 futile+1.486
 undermines+1.449
 urgently+1.424
 vulnerable+1.421
Neg logit contributions
tymology-1.782
ledged-1.766
been-1.729
essor-1.692
iths-1.673
Token In
Token ID554
Feature activation+4.46e+00

Pos logit contributions
 counterproductive+1.713
 futile+1.663
 undermines+1.615
 urgently+1.581
 vulnerable+1.577
Neg logit contributions
tymology-2.625
ledged-2.602
been-2.549
essor-2.493
iths-2.483
Token addition
Token ID3090
Feature activation-2.42e+00

Pos logit contributions
 counterproductive+3.576
 futile+3.481
 undermines+3.366
 vulnerable+3.337
 urgently+3.315
Neg logit contributions
tymology-4.430
ledged-4.410
been-4.297
essor-4.207
iths-4.169
Token,
Token ID11
Feature activation+5.63e+00

Pos logit contributions
tymology+2.540
ledged+2.521
been+2.472
essor+2.415
iths+2.400
Neg logit contributions
 counterproductive-1.813
 futile-1.763
 undermines-1.712
 urgently-1.677
 vulnerable-1.675
Token a
Token ID257
Feature activation+1.31e+01

Pos logit contributions
 counterproductive+4.106
 futile+3.971
 undermines+3.885
 vulnerable+3.821
 urgently+3.815
Neg logit contributions
tymology-5.980
ledged-5.912
been-5.816
essor-5.668
iths-5.624
Token Quad
Token ID20648
Feature activation+4.56e+00

Pos logit contributions
 counterproductive+10.709
 futile+10.479
 vulnerable+10.203
 harder+10.029
 undermines+9.925
Neg logit contributions
tymology-11.917
ledged-11.523
been-11.366
iths-11.241
essor-11.014
Tokenren
Token ID918
Feature activation+9.00e-01

Pos logit contributions
 counterproductive+4.888
 futile+4.789
 undermines+4.697
 urgently+4.661
 vulnerable+4.648
Neg logit contributions
tymology-3.271
ledged-3.230
been-3.099
essor-3.030
iths-2.995
Tokenn
Token ID77
Feature activation-4.10e+00

Pos logit contributions
 counterproductive+1.026
 futile+1.008
 undermines+0.988
 urgently+0.975
 vulnerable+0.974
Neg logit contributions
tymology-0.593
ledged-0.586
been-0.568
essor-0.547
iths-0.541
Tokenial
Token ID498
Feature activation+4.03e+00

Pos logit contributions
ledged+2.343
tymology+2.296
been+2.233
essor+2.156
iths+2.124
Neg logit contributions
 counterproductive-4.948
 futile-4.881
 undermines-4.748
 urgently-4.689
 unsafe-4.686
Token Energy
Token ID6682
Feature activation+7.40e+00

Pos logit contributions
 counterproductive+2.631
 futile+2.549
 undermines+2.461
 urgently+2.401
 vulnerable+2.399
Neg logit contributions
tymology-4.607
ledged-4.578
been-4.496
essor-4.403
iths-4.381
Token as
Token ID355
Feature activation+5.78e+00

Pos logit contributions
tymology+1.749
ledged+1.735
been+1.716
 origins+1.685
essor+1.670
Neg logit contributions
 counterproductive-1.195
 futile-1.157
 undermines-1.127
 urgently-1.085
 vulnerable-1.081
Token Daniel
Token ID7806
Feature activation+8.80e+00

Pos logit contributions
 counterproductive+4.594
 futile+4.487
 undermines+4.330
 vulnerable+4.304
 urgently+4.261
Neg logit contributions
tymology-5.783
ledged-5.736
been-5.594
essor-5.457
iths-5.429
Token Sparks
Token ID48117
Feature activation+6.97e+00

Pos logit contributions
 counterproductive+6.285
 futile+6.187
 undermines+5.926
 vulnerable+5.912
 harder+5.807
Neg logit contributions
tymology-9.181
ledged-9.103
been-8.824
essor-8.636
iths-8.608
Token,
Token ID11
Feature activation+1.47e+00

Pos logit contributions
 counterproductive+5.396
 futile+5.326
 undermines+5.187
 vulnerable+5.076
 urgently+5.062
Neg logit contributions
tymology-6.955
ledged-6.871
been-6.679
essor-6.557
iths-6.501
Token the
Token ID262
Feature activation-1.88e+00

Pos logit contributions
 counterproductive+1.151
 futile+1.121
 undermines+1.089
 urgently+1.068
 vulnerable+1.067
Neg logit contributions
tymology-1.490
ledged-1.479
been-1.449
essor-1.415
iths-1.408
Token Goldman
Token ID18740
Feature activation+1.02e+00

Pos logit contributions
tymology+1.872
ledged+1.865
been+1.837
 origins+1.807
essor+1.802
Neg logit contributions
 counterproductive-1.461
 futile-1.423
 undermines-1.406
 urgently-1.352
 vulnerable-1.339
Token partner
Token ID5212
Feature activation-4.35e+00

Pos logit contributions
 counterproductive+0.743
 futile+0.722
 undermines+0.700
 urgently+0.682
 vulnerable+0.680
Neg logit contributions
tymology-1.089
ledged-1.082
been-1.063
essor-1.043
 origins-1.034
Token overseeing
Token ID29852
Feature activation-3.61e-01

Pos logit contributions
tymology+4.599
ledged+4.569
been+4.497
essor+4.423
 origins+4.413
Neg logit contributions
 counterproductive-3.086
 futile-3.003
 undermines-2.890
 urgently-2.784
 unsafe-2.781
Token the
Token ID262
Feature activation-1.22e+00

Pos logit contributions
tymology+0.390
ledged+0.387
been+0.381
essor+0.373
 origins+0.373
Neg logit contributions
 counterproductive-0.255
 futile-0.248
 undermines-0.241
 urgently-0.235
 vulnerable-0.233
Token firm
Token ID4081
Feature activation-7.05e+00

Pos logit contributions
tymology+1.246
ledged+1.231
been+1.221
 origins+1.208
essor+1.190
Neg logit contributions
 counterproductive-0.907
 futile-0.882
 undermines-0.871
 urgently-0.841
 vulnerable-0.831
Token
Token ID447
Feature activation-4.09e+00

Pos logit contributions
tymology+7.506
ledged+7.464
been+7.376
 origins+7.242
essor+7.175
Neg logit contributions
 counterproductive-4.772
 futile-4.618
 undermines-4.403
 urgently-4.284
 vulnerable-4.273
Token which
Token ID543
Feature activation-9.79e-01

Pos logit contributions
 counterproductive+0.430
 futile+0.419
 undermines+0.406
 urgently+0.398
 vulnerable+0.397
Neg logit contributions
tymology-0.539
ledged-0.536
been-0.527
essor-0.513
iths-0.511
Token denied
Token ID6699
Feature activation-8.80e+00

Pos logit contributions
tymology+1.075
ledged+1.069
been+1.054
essor+1.030
 origins+1.027
Neg logit contributions
 counterproductive-0.677
 futile-0.656
 undermines-0.622
 vulnerable-0.611
 urgently-0.610
Token wrongdoing
Token ID25013
Feature activation+2.99e-01

Pos logit contributions
ledged+8.847
been+8.705
essor+8.658
tymology+8.653
 origins+8.641
Neg logit contributions
 counterproductive-6.024
 futile-5.880
 undermines-5.630
 urgently-5.548
 harder-5.356
Token.
Token ID13
Feature activation+2.61e+00

Pos logit contributions
 counterproductive+0.231
 futile+0.225
 undermines+0.218
 urgently+0.214
 vulnerable+0.214
Neg logit contributions
tymology-0.306
ledged-0.305
been-0.298
essor-0.291
iths-0.290
Token
Token ID198
Feature activation+8.96e-01

Pos logit contributions
 counterproductive+1.828
 futile+1.775
 undermines+1.725
 urgently+1.692
 vulnerable+1.685
Neg logit contributions
tymology-2.866
ledged-2.842
been-2.774
essor-2.719
iths-2.704
Token
Token ID198
Feature activation+5.90e+00

Pos logit contributions
 counterproductive+0.713
 futile+0.695
 undermines+0.676
 vulnerable+0.653
 urgently+0.648
Neg logit contributions
ledged-0.896
tymology-0.894
been-0.892
essor-0.859
 origins-0.857
TokenPr
Token ID6836
Feature activation-1.21e-03

Pos logit contributions
 counterproductive+4.186
 futile+4.038
 undermines+3.951
 urgently+3.887
 vulnerable+3.862
Neg logit contributions
tymology-6.224
ledged-6.186
been-5.953
essor-5.924
iths-5.877
Tokenather
Token ID1032
Feature activation+1.11e+01

Pos logit contributions
tymology+0.001
ledged+0.001
been+0.001
essor+0.001
iths+0.001
Neg logit contributions
 counterproductive-0.001
 futile-0.001
 undermines-0.001
 vulnerable-0.001
 urgently-0.001
Token
Token ID447
Feature activation-9.62e-01

Pos logit contributions
 counterproductive+8.677
 futile+8.589
 undermines+8.530
 urgently+8.395
 threatens+8.338
Neg logit contributions
tymology-10.182
ledged-10.154
essor-9.808
been-9.772
iths-9.632
Token
Token ID247
Feature activation+5.72e+00

Pos logit contributions
tymology+0.694
ledged+0.689
been+0.669
essor+0.642
iths+0.640
Neg logit contributions
 counterproductive-1.033
 futile-1.017
 undermines-0.994
 urgently-0.983
 vulnerable-0.977
Tokens
Token ID82
Feature activation-6.74e+00

Pos logit contributions
 counterproductive+4.739
 futile+4.639
 undermines+4.580
 urgently+4.501
 vulnerable+4.459
Neg logit contributions
tymology-5.315
ledged-5.275
been-5.156
essor-5.063
iths-5.020
Token is
Token ID318
Feature activation-8.38e+00

Pos logit contributions
tymology+3.550
ledged+3.515
been+3.490
essor+3.402
 origins+3.399
Neg logit contributions
 counterproductive-1.583
 futile-1.517
 undermines-1.415
 vulnerable-1.387
 harder-1.371
Token no
Token ID645
Feature activation+1.49e+00

Pos logit contributions
ledged+7.320
tymology+7.279
been+7.277
 origins+7.162
iths+6.893
Neg logit contributions
 counterproductive-6.692
 futile-6.544
 undermines-6.516
 unsafe-6.210
 vulnerable-6.167
Token one
Token ID530
Feature activation+2.01e+00

Pos logit contributions
 counterproductive+1.154
 futile+1.122
 undermines+1.096
 urgently+1.068
 vulnerable+1.065
Neg logit contributions
tymology-1.518
ledged-1.509
been-1.480
 origins-1.441
essor-1.441
Token who
Token ID508
Feature activation+2.14e+00

Pos logit contributions
 counterproductive+1.442
 futile+1.400
 undermines+1.361
 urgently+1.324
 vulnerable+1.320
Neg logit contributions
tymology-2.161
ledged-2.150
been-2.107
essor-2.062
iths-2.046
Token can
Token ID460
Feature activation-6.90e+00

Pos logit contributions
 counterproductive+1.264
 futile+1.217
 undermines+1.161
 vulnerable+1.137
 urgently+1.130
Neg logit contributions
tymology-2.570
ledged-2.552
been-2.510
essor-2.463
iths-2.453
Token define
Token ID8160
Feature activation+1.84e+00

Pos logit contributions
tymology+6.960
ledged+6.843
been+6.645
 origins+6.622
essor+6.588
Neg logit contributions
 counterproductive-4.787
 futile-4.464
 undermines-4.443
 unsafe-4.263
 vulnerable-4.223
Token humanitarian
Token ID15706
Feature activation+7.43e+00

Pos logit contributions
 counterproductive+1.306
 futile+1.269
 undermines+1.230
 vulnerable+1.200
 urgently+1.199
Neg logit contributions
tymology-1.995
ledged-1.981
been-1.944
essor-1.900
iths-1.892
Tokenism
Token ID1042
Feature activation+3.48e+00

Pos logit contributions
 counterproductive+7.261
 futile+7.114
 undermines+6.878
 vulnerable+6.856
 urgently+6.853
Neg logit contributions
ledged-6.029
tymology-5.943
been-5.771
essor-5.667
iths-5.613
Token or
Token ID393
Feature activation+3.87e+00

Pos logit contributions
 counterproductive+2.504
 futile+2.434
 undermines+2.357
 urgently+2.314
 vulnerable+2.306
Neg logit contributions
tymology-3.745
ledged-3.718
been-3.651
essor-3.569
iths-3.550
Token morality
Token ID18016
Feature activation+1.91e+00

Pos logit contributions
 counterproductive+2.846
 futile+2.768
 undermines+2.685
 urgently+2.627
 vulnerable+2.624
Neg logit contributions
tymology-4.118
ledged-4.092
been-4.016
essor-3.925
iths-3.910
Token when
Token ID618
Feature activation-4.50e-01

Pos logit contributions
 counterproductive+1.325
 futile+1.286
 undermines+1.244
 vulnerable+1.215
 urgently+1.214
Neg logit contributions
tymology-2.112
ledged-2.099
been-2.059
essor-2.015
iths-2.005
Token city
Token ID1748
Feature activation-3.00e+00

Pos logit contributions
tymology+2.006
ledged+1.992
been+1.948
 origins+1.924
essor+1.923
Neg logit contributions
 counterproductive-1.331
 futile-1.284
 undermines-1.266
 urgently-1.227
 vulnerable-1.205
Token held
Token ID2714
Feature activation-9.81e+00

Pos logit contributions
tymology+2.992
ledged+2.969
been+2.900
essor+2.851
 origins+2.842
Neg logit contributions
 counterproductive-2.368
 futile-2.284
 undermines-2.206
 vulnerable-2.167
 urgently-2.162
Token.
Token ID13
Feature activation-1.50e+00

Pos logit contributions
ledged+10.317
tymology+10.211
been+10.188
 origins+10.144
essor+9.995
Neg logit contributions
 counterproductive-6.107
 futile-5.722
 undermines-5.596
 urgently-5.468
 vulnerable-5.208
Token The
Token ID383
Feature activation-1.96e-01

Pos logit contributions
tymology+1.541
ledged+1.540
been+1.525
essor+1.485
iths+1.474
Neg logit contributions
 counterproductive-1.135
 futile-1.108
 undermines-1.070
 vulnerable-1.045
 urgently-1.044
Token settlement
Token ID9443
Feature activation+7.32e+00

Pos logit contributions
tymology+0.209
ledged+0.207
been+0.203
essor+0.199
 origins+0.199
Neg logit contributions
 counterproductive-0.143
 futile-0.139
 undermines-0.136
 urgently-0.132
 vulnerable-0.131
Token was
Token ID373
Feature activation+6.51e+00

Pos logit contributions
 counterproductive+5.228
 futile+5.102
 undermines+5.058
 urgently+4.877
 threatens+4.848
Neg logit contributions
tymology-7.782
ledged-7.692
been-7.569
essor-7.417
iths-7.408
Token arguably
Token ID15242
Feature activation+1.27e+01

Pos logit contributions
 counterproductive+5.078
 futile+4.948
 undermines+4.737
 urgently+4.724
 vulnerable+4.707
Neg logit contributions
tymology-6.598
ledged-6.522
been-6.362
essor-6.284
iths-6.202
Token an
Token ID281
Feature activation+1.11e+01

Pos logit contributions
 counterproductive+9.888
 futile+9.546
 harder+9.090
 vulnerable+8.975
 unsafe+8.961
Neg logit contributions
tymology-12.531
ledged-12.403
essor-12.048
been-11.947
iths-11.834
Token inevitable
Token ID13203
Feature activation+8.57e+00

Pos logit contributions
 counterproductive+8.979
 futile+8.821
 unsafe+8.506
 undermines+8.378
 urgently+8.273
Neg logit contributions
ledged-10.537
tymology-10.328
been-10.233
essor-10.041
iths-9.965
Token in
Token ID287
Feature activation+8.94e-01

Pos logit contributions
 counterproductive+6.251
 futile+6.083
 undermines+5.829
 vulnerable+5.757
 harder+5.683
Neg logit contributions
ledged-9.009
tymology-8.956
been-8.750
iths-8.636
essor-8.621
Token light
Token ID1657
Feature activation-3.18e+00

Pos logit contributions
 counterproductive+0.684
 futile+0.664
 undermines+0.646
 urgently+0.634
 vulnerable+0.631
Neg logit contributions
tymology-0.923
ledged-0.914
been-0.900
essor-0.877
iths-0.872
Token
Token ID198
Feature activation+2.13e+00

Pos logit contributions
been+2.438
ledged+2.412
tymology+2.349
 origins+2.322
iths+2.303
Neg logit contributions
 counterproductive-2.332
 futile-2.273
 undermines-2.226
 vulnerable-2.128
 urgently-2.099
TokenOver
Token ID5886
Feature activation+6.18e-01

Pos logit contributions
 counterproductive+1.563
 futile+1.519
 undermines+1.473
 urgently+1.443
 vulnerable+1.442
Neg logit contributions
tymology-2.254
ledged-2.236
been-2.188
essor-2.143
iths-2.134
Tokendrive
Token ID19472
Feature activation+6.46e+00

Pos logit contributions
 counterproductive+0.480
 futile+0.467
 undermines+0.454
 urgently+0.444
 vulnerable+0.443
Neg logit contributions
tymology-0.631
ledged-0.628
been-0.615
essor-0.601
iths-0.597
Token is
Token ID318
Feature activation+1.61e+00

Pos logit contributions
 counterproductive+4.726
 futile+4.615
 undermines+4.533
 vulnerable+4.420
 urgently+4.402
Neg logit contributions
tymology-6.752
ledged-6.662
been-6.503
essor-6.369
iths-6.368
Token a
Token ID257
Feature activation+3.89e+00

Pos logit contributions
 counterproductive+1.240
 futile+1.208
 undermines+1.175
 urgently+1.149
 vulnerable+1.145
Neg logit contributions
tymology-1.646
ledged-1.637
been-1.606
essor-1.567
iths-1.558
Token piece
Token ID3704
Feature activation-3.70e+00

Pos logit contributions
 counterproductive+3.056
 futile+2.977
 undermines+2.889
 vulnerable+2.836
 urgently+2.832
Neg logit contributions
tymology-3.938
ledged-3.907
been-3.826
essor-3.739
iths-3.718
Token I
Token ID314
Feature activation-5.70e+00

Pos logit contributions
ledged+3.421
tymology+3.402
been+3.336
essor+3.254
 origins+3.223
Neg logit contributions
 counterproductive-3.141
 futile-3.080
 undermines-3.000
 urgently-2.911
 unsafe-2.870
Token
Token ID447
Feature activation-1.70e+00

Pos logit contributions
tymology+5.886
ledged+5.802
been+5.776
essor+5.625
 origins+5.547
Neg logit contributions
 counterproductive-4.114
 futile-3.973
 undermines-3.893
 unsafe-3.686
 vulnerable-3.682
Token
Token ID247
Feature activation-1.23e+00

Pos logit contributions
ledged+1.107
tymology+1.078
been+1.054
essor+1.010
 origins+0.988
Neg logit contributions
 counterproductive-1.933
 futile-1.913
 undermines-1.887
 urgently-1.858
 vulnerable-1.845
Tokenve
Token ID303
Feature activation-2.45e+01

Pos logit contributions
tymology+0.979
ledged+0.974
been+0.953
essor+0.918
iths+0.906
Neg logit contributions
 counterproductive-1.237
 futile-1.210
 undermines-1.188
 vulnerable-1.163
 urgently-1.162
Token talked
Token ID6619
Feature activation-6.31e+00

Pos logit contributions
been+21.035
ledged+19.779
 origins+19.213
 been+19.163
 ancestors+18.943
Neg logit contributions
 undermines-13.063
 counterproductive-12.963
 futile-12.391
 hostage-11.995
 threatens-11.885
Token Or
Token ID1471
Feature activation-1.96e+00

Pos logit contributions
 counterproductive+7.775
 futile+7.511
 undermines+7.401
 urgently+7.258
 harder+7.233
Neg logit contributions
tymology-10.750
ledged-10.662
been-10.238
essor-10.180
iths-9.953
Token,
Token ID11
Feature activation-3.63e+00

Pos logit contributions
tymology+2.048
ledged+2.031
been+1.992
iths+1.952
essor+1.951
Neg logit contributions
 counterproductive-1.464
 futile-1.425
 undermines-1.378
 urgently-1.344
 vulnerable-1.342
Token as
Token ID355
Feature activation+4.12e+00

Pos logit contributions
tymology+3.763
ledged+3.699
been+3.647
 origins+3.602
iths+3.592
Neg logit contributions
 counterproductive-2.653
 futile-2.582
 undermines-2.513
 urgently-2.441
 vulnerable-2.438
Token he
Token ID339
Feature activation-5.60e+00

Pos logit contributions
 counterproductive+3.405
 futile+3.324
 undermines+3.216
 vulnerable+3.179
 urgently+3.168
Neg logit contributions
tymology-3.967
ledged-3.963
been-3.880
essor-3.773
iths-3.726
Token once
Token ID1752
Feature activation-8.91e+00

Pos logit contributions
tymology+6.347
ledged+6.161
been+6.012
 origins+5.983
essor+5.932
Neg logit contributions
 counterproductive-3.552
 futile-3.365
 undermines-3.252
 vulnerable-3.196
 urgently-3.162
Token told
Token ID1297
Feature activation-9.17e+00

Pos logit contributions
tymology+8.288
ledged+8.119
been+7.969
iths+7.822
 origins+7.798
Neg logit contributions
 counterproductive-6.791
 futile-6.397
 undermines-6.391
 vulnerable-6.242
 unsafe-6.180
Token an
Token ID281
Feature activation+1.12e+00

Pos logit contributions
tymology+8.617
ledged+8.476
been+8.310
essor+8.297
 origins+8.271
Neg logit contributions
 counterproductive-7.071
 futile-6.833
 undermines-6.736
 urgently-6.484
 accountable-6.443
Token interviewer
Token ID39877
Feature activation-1.46e+00

Pos logit contributions
 counterproductive+0.999
 futile+0.977
 undermines+0.952
 urgently+0.938
 vulnerable+0.937
Neg logit contributions
tymology-1.012
ledged-1.007
been-0.983
essor-0.956
iths-0.951
Token,
Token ID11
Feature activation-6.57e+00

Pos logit contributions
tymology+1.504
ledged+1.492
been+1.463
essor+1.434
iths+1.424
Neg logit contributions
 counterproductive-1.113
 futile-1.081
 undermines-1.051
 vulnerable-1.026
 urgently-1.024
Token
Token ID564
Feature activation-6.31e-01

Pos logit contributions
tymology+6.383
ledged+6.251
been+6.193
 origins+6.124
iths+6.097
Neg logit contributions
 counterproductive-5.036
 futile-4.889
 undermines-4.775
 urgently-4.661
 vulnerable-4.623
Token
Token ID250
Feature activation+3.09e+00

Pos logit contributions
tymology+0.492
ledged+0.490
been+0.482
essor+0.458
iths+0.456
Neg logit contributions
 counterproductive-0.640
 futile-0.629
 undermines-0.615
 urgently-0.609
 vulnerable-0.602
Token.
Token ID13
Feature activation-1.46e+00

Pos logit contributions
tymology+4.008
ledged+3.991
been+3.908
 origins+3.840
iths+3.800
Neg logit contributions
 counterproductive-3.113
 futile-3.019
 undermines-2.904
 vulnerable-2.874
 urgently-2.870
Token For
Token ID1114
Feature activation-5.36e+00

Pos logit contributions
tymology+1.457
ledged+1.453
been+1.435
essor+1.401
 origins+1.399
Neg logit contributions
 counterproductive-1.142
 futile-1.110
 undermines-1.068
 vulnerable-1.051
 urgently-1.044
Token instance
Token ID4554
Feature activation-5.27e+00

Pos logit contributions
tymology+5.157
ledged+5.080
been+4.996
 origins+4.949
iths+4.907
Neg logit contributions
 counterproductive-4.249
 futile-4.090
 undermines-3.991
 unsafe-3.909
 urgently-3.887
Token,
Token ID11
Feature activation-3.85e+00

Pos logit contributions
tymology+5.459
ledged+5.412
been+5.311
essor+5.189
iths+5.174
Neg logit contributions
 counterproductive-3.959
 futile-3.838
 undermines-3.724
 urgently-3.650
 vulnerable-3.645
Token she
Token ID673
Feature activation-5.18e-01

Pos logit contributions
tymology+3.827
ledged+3.783
been+3.721
 origins+3.680
iths+3.639
Neg logit contributions
 counterproductive-2.963
 futile-2.868
 undermines-2.774
 urgently-2.736
 vulnerable-2.720
Token argues
Token ID11673
Feature activation-7.11e+00

Pos logit contributions
tymology+0.542
ledged+0.535
been+0.526
 origins+0.516
essor+0.515
Neg logit contributions
 counterproductive-0.384
 futile-0.371
 undermines-0.354
 vulnerable-0.352
 urgently-0.350
Token that
Token ID326
Feature activation-3.56e+00

Pos logit contributions
tymology+6.655
ledged+6.572
 origins+6.400
been+6.393
iths+6.296
Neg logit contributions
 counterproductive-5.688
 futile-5.481
 undermines-5.345
 vulnerable-5.228
 urgently-5.210
Token Hume
Token ID45485
Feature activation-1.60e+00

Pos logit contributions
tymology+3.666
ledged+3.621
been+3.552
 origins+3.527
iths+3.486
Neg logit contributions
 counterproductive-2.631
 futile-2.549
 undermines-2.473
 urgently-2.428
 vulnerable-2.412
Token's
Token ID338
Feature activation-1.52e+01

Pos logit contributions
tymology+1.842
ledged+1.822
been+1.799
essor+1.760
 origins+1.758
Neg logit contributions
 counterproductive-1.014
 futile-0.974
 undermines-0.928
 vulnerable-0.914
 urgently-0.910
Token instrumental
Token ID21543
Feature activation-1.13e+01

Pos logit contributions
 origins+14.522
tymology+14.266
 ancestors+14.013
been+13.862
 predecessors+13.862
Neg logit contributions
 counterproductive-9.309
 undermines-8.858
 futile-8.833
 urgently-8.793
 threatens-8.577
Tokenism
Token ID1042
Feature activation-3.35e+00

Pos logit contributions
tymology+9.080
 origins+8.767
been+8.633
ledged+8.553
 ancestors+8.407
Neg logit contributions
 counterproductive-9.937
 futile-9.583
 urgently-9.525
 undermines-9.367
 unsafe-9.321
Tokenter
Token ID353
Feature activation+1.48e+00

Pos logit contributions
tymology+1.269
ledged+1.254
been+1.225
essor+1.158
iths+1.143
Neg logit contributions
 counterproductive-2.868
 futile-2.816
 undermines-2.782
 vulnerable-2.715
 urgently-2.715
Token.
Token ID13
Feature activation-3.17e+00

Pos logit contributions
 counterproductive+1.140
 futile+1.110
 undermines+1.077
 urgently+1.056
 vulnerable+1.055
Neg logit contributions
tymology-1.525
ledged-1.514
been-1.484
essor-1.450
iths-1.442
Token You
Token ID921
Feature activation+1.34e+00

Pos logit contributions
tymology+3.318
been+3.306
ledged+3.301
essor+3.204
iths+3.189
Neg logit contributions
 counterproductive-2.303
 futile-2.225
 undermines-2.163
 vulnerable-2.084
 urgently-2.083
Token DO
Token ID8410
Feature activation-6.10e+00

Pos logit contributions
 counterproductive+0.668
 futile+0.637
 undermines+0.611
 vulnerable+0.586
 urgently+0.583
Neg logit contributions
tymology-1.740
ledged-1.727
been-1.706
essor-1.671
iths-1.664
Token NOT
Token ID5626
Feature activation+4.03e-01

Pos logit contributions
tymology+5.552
ledged+5.481
been+5.420
essor+5.323
 origins+5.265
Neg logit contributions
 counterproductive-5.087
 undermines-4.911
 futile-4.868
 vulnerable-4.608
 threatens-4.608
Token need
Token ID761
Feature activation-1.38e+01

Pos logit contributions
 counterproductive+0.329
 futile+0.318
 undermines+0.313
 vulnerable+0.302
 urgently+0.302
Neg logit contributions
tymology-0.395
ledged-0.391
been-0.384
essor-0.374
 origins-0.372
Token to
Token ID284
Feature activation-4.03e+00

Pos logit contributions
ledged+11.377
been+11.227
tymology+11.175
iths+11.045
essor+10.841
Neg logit contributions
 counterproductive-10.988
 futile-10.491
 undermines-10.441
 threatens-9.958
 vulnerable-9.913
Token remove
Token ID4781
Feature activation-2.98e-01

Pos logit contributions
tymology+4.478
been+4.396
ledged+4.390
 origins+4.297
essor+4.275
Neg logit contributions
 counterproductive-2.603
 futile-2.420
 undermines-2.396
 vulnerable-2.259
 threatens-2.253
Token the
Token ID262
Feature activation-2.01e+00

Pos logit contributions
tymology+0.321
ledged+0.320
been+0.314
essor+0.307
iths+0.305
Neg logit contributions
 counterproductive-0.213
 futile-0.207
 undermines-0.201
 urgently-0.195
 vulnerable-0.195
Token four
Token ID1440
Feature activation-4.73e+00

Pos logit contributions
tymology+2.023
ledged+2.019
been+1.994
 origins+1.941
essor+1.937
Neg logit contributions
 counterproductive-1.538
 futile-1.490
 undermines-1.461
 urgently-1.413
 vulnerable-1.396
Token 10
Token ID838
Feature activation-3.88e+00

Pos logit contributions
tymology+4.674
ledged+4.667
been+4.637
 origins+4.547
essor+4.507
Neg logit contributions
 counterproductive-3.578
 futile-3.432
 undermines-3.390
 urgently-3.257
 threatens-3.192
Token is
Token ID318
Feature activation-9.49e+00

Pos logit contributions
tymology+10.360
ledged+10.081
 origins+10.067
been+10.051
 ancestors+9.939
Neg logit contributions
 counterproductive-5.333
 futile-5.017
 undermines-4.628
 unsafe-4.616
 vulnerable-4.550
Token the
Token ID262
Feature activation-8.01e+00

Pos logit contributions
ledged+9.385
tymology+9.355
been+9.202
 origins+9.081
iths+8.933
Neg logit contributions
 counterproductive-6.825
 futile-6.485
 undermines-6.459
 unsafe-6.153
 urgently-6.119
Token eastern
Token ID10183
Feature activation-8.28e+00

Pos logit contributions
tymology+8.142
been+8.000
ledged+7.989
 origins+7.947
 ancestors+7.807
Neg logit contributions
 counterproductive-5.642
 undermines-5.423
 futile-5.338
 urgently-5.123
 threatens-5.059
Token short
Token ID1790
Feature activation-1.41e+01

Pos logit contributions
tymology+8.337
 origins+8.206
been+8.172
 ancestors+8.042
ledged+8.022
Neg logit contributions
 counterproductive-5.848
 undermines-5.556
 futile-5.529
 urgently-5.303
 unsafe-5.241
Token-
Token ID12
Feature activation-1.76e+01

Pos logit contributions
tymology+12.035
ledged+11.647
been+11.624
 origins+11.409
essor+11.254
Neg logit contributions
 counterproductive-11.433
 futile-10.827
 undermines-10.568
 urgently-10.473
 unsafe-10.403
Tokeneared
Token ID3380
Feature activation-5.92e+00

Pos logit contributions
been+12.186
tymology+12.153
ledged+11.904
essor+11.174
iths+10.854
Neg logit contributions
 counterproductive-16.543
 futile-15.803
 undermines-15.388
 unsafe-15.245
 urgently-15.098
Token rock
Token ID3881
Feature activation-6.99e+00

Pos logit contributions
tymology+6.163
ledged+6.041
been+5.988
 origins+5.913
essor+5.830
Neg logit contributions
 counterproductive-4.272
 futile-4.075
 undermines-4.003
 urgently-3.908
 unsafe-3.841
Token-
Token ID12
Feature activation-6.92e+00

Pos logit contributions
tymology+6.724
ledged+6.604
been+6.538
essor+6.361
 origins+6.342
Neg logit contributions
 counterproductive-5.616
 futile-5.415
 undermines-5.270
 urgently-5.165
 vulnerable-5.098
Tokenwall
Token ID11930
Feature activation-7.73e+00

Pos logit contributions
tymology+5.622
been+5.592
ledged+5.525
essor+5.206
iths+5.157
Neg logit contributions
 counterproductive-6.568
 futile-6.384
 undermines-6.248
 urgently-6.069
 unsafe-6.052
Tokenaby
Token ID3930
Feature activation-7.16e+00

Pos logit contributions
been+6.626
tymology+6.521
ledged+6.376
iths+6.113
essor+6.063
Neg logit contributions
 counterproductive-7.024
 futile-6.734
 undermines-6.576
 unsafe-6.382
 urgently-6.351
Token.
Token ID13
Feature activation-7.04e+00

Pos logit contributions
tymology+6.706
ledged+6.553
been+6.447
 origins+6.418
essor+6.306
Neg logit contributions
 counterproductive-5.893
 futile-5.640
 undermines-5.503
 urgently-5.342
 unsafe-5.326
Token in
Token ID287
Feature activation-2.67e+00

Pos logit contributions
tymology+1.977
ledged+1.977
been+1.934
essor+1.885
 origins+1.884
Neg logit contributions
 counterproductive-1.572
 futile-1.524
 undermines-1.481
 urgently-1.437
 vulnerable-1.436
Token the
Token ID262
Feature activation-3.65e+00

Pos logit contributions
tymology+2.718
ledged+2.683
been+2.649
 origins+2.598
essor+2.584
Neg logit contributions
 counterproductive-2.031
 futile-1.972
 undermines-1.936
 urgently-1.865
 vulnerable-1.846
Token Mil
Token ID4460
Feature activation-5.62e+00

Pos logit contributions
ledged+3.620
tymology+3.617
been+3.584
 origins+3.504
essor+3.493
Neg logit contributions
 counterproductive-2.806
 futile-2.703
 undermines-2.678
 urgently-2.571
 threatens-2.547
Tokenburn
Token ID10899
Feature activation-4.39e+00

Pos logit contributions
tymology+5.141
ledged+5.137
been+5.097
essor+4.888
iths+4.855
Neg logit contributions
 counterproductive-4.891
 futile-4.728
 undermines-4.570
 urgently-4.511
 vulnerable-4.482
Token Stand
Token ID5751
Feature activation-1.07e+01

Pos logit contributions
tymology+4.354
ledged+4.310
been+4.263
 origins+4.167
essor+4.128
Neg logit contributions
 counterproductive-3.452
 futile-3.335
 undermines-3.256
 urgently-3.157
 vulnerable-3.144
Token).
Token ID737
Feature activation-5.30e+00

Pos logit contributions
ledged+10.169
tymology+10.099
been+10.051
 origins+10.015
essor+9.796
Neg logit contributions
 counterproductive-7.654
 futile-7.307
 undermines-7.186
 vulnerable-6.851
 threatens-6.784
Token
Token ID198
Feature activation-2.44e+00

Pos logit contributions
ledged+5.123
been+5.086
tymology+5.048
 origins+4.937
iths+4.935
Neg logit contributions
 counterproductive-4.206
 futile-4.084
 undermines-3.935
 vulnerable-3.796
 urgently-3.794
Token
Token ID198
Feature activation+4.28e-01

Pos logit contributions
been+2.232
ledged+2.214
tymology+2.167
 origins+2.132
iths+2.116
Neg logit contributions
 counterproductive-2.137
 futile-2.081
 undermines-2.033
 vulnerable-1.940
 threatens-1.918
TokenThey
Token ID2990
Feature activation+1.59e+00

Pos logit contributions
 counterproductive+0.313
 futile+0.304
 undermines+0.295
 vulnerable+0.288
 urgently+0.288
Neg logit contributions
tymology-0.457
ledged-0.455
been-0.447
essor-0.436
iths-0.434
Token will
Token ID481
Feature activation-1.03e+01

Pos logit contributions
 counterproductive+0.667
 futile+0.635
 undermines+0.600
 urgently+0.577
 vulnerable+0.576
Neg logit contributions
tymology-2.187
ledged-2.176
been-2.143
essor-2.107
iths-2.098
Token then
Token ID788
Feature activation-3.65e+00

Pos logit contributions
tymology+8.558
ledged+8.355
been+8.286
iths+8.106
 origins+8.038
Neg logit contributions
 counterproductive-8.984
 undermines-8.414
 futile-8.278
 threatens-7.997
 unsafe-7.877
Token loving
Token ID14442
Feature activation+1.72e+00

Pos logit contributions
 counterproductive+7.186
 futile+7.089
 vulnerable+6.927
 harder+6.744
 undermines+6.738
Neg logit contributions
tymology-8.413
ledged-8.336
been-8.055
essor-7.974
iths-7.973
Token long
Token ID890
Feature activation-9.94e+00

Pos logit contributions
 counterproductive+1.807
 futile+1.771
 undermines+1.734
 urgently+1.708
 vulnerable+1.707
Neg logit contributions
tymology-1.288
ledged-1.276
been-1.241
essor-1.201
iths-1.191
Token-
Token ID12
Feature activation-9.66e+00

Pos logit contributions
been+8.833
tymology+8.767
ledged+8.697
essor+8.465
iths+8.349
Neg logit contributions
 counterproductive-8.002
 undermines-7.553
 futile-7.543
 urgently-7.440
 threatens-7.242
Tokenterm
Token ID4354
Feature activation-1.33e+00

Pos logit contributions
been+5.485
tymology+5.272
ledged+5.195
essor+4.895
iths+4.817
Neg logit contributions
 counterproductive-11.354
 futile-10.980
 undermines-10.915
 urgently-10.766
 harder-10.565
Token relationship
Token ID2776
Feature activation+6.50e-01

Pos logit contributions
tymology+0.826
ledged+0.816
been+0.804
 origins+0.759
essor+0.758
Neg logit contributions
 counterproductive-1.546
 futile-1.514
 undermines-1.484
 urgently-1.472
 vulnerable-1.457
Token has
Token ID468
Feature activation-2.26e+01

Pos logit contributions
 counterproductive+0.363
 futile+0.348
 undermines+0.330
 vulnerable+0.322
 urgently+0.322
Neg logit contributions
tymology-0.804
ledged-0.797
been-0.787
essor-0.771
 origins-0.769
Token reached
Token ID4251
Feature activation+1.50e+00

Pos logit contributions
been+21.028
 origins+20.625
tymology+20.329
iths+20.303
ledged+19.898
Neg logit contributions
 counterproductive-11.509
 futile-11.243
 undermines-11.196
 accountable-10.390
 threatens-10.383
Token that
Token ID326
Feature activation+3.55e+00

Pos logit contributions
 counterproductive+1.189
 futile+1.158
 undermines+1.124
 vulnerable+1.104
 urgently+1.103
Neg logit contributions
tymology-1.500
ledged-1.491
been-1.458
essor-1.425
iths-1.415
Token point
Token ID966
Feature activation-6.05e+00

Pos logit contributions
 counterproductive+3.021
 futile+2.952
 undermines+2.863
 vulnerable+2.829
 urgently+2.819
Neg logit contributions
tymology-3.359
ledged-3.344
been-3.257
essor-3.181
iths-3.163
Token where
Token ID810
Feature activation-2.54e+00

Pos logit contributions
tymology+6.089
ledged+6.011
been+5.933
 origins+5.783
essor+5.751
Neg logit contributions
 counterproductive-4.527
 futile-4.390
 undermines-4.261
 urgently-4.064
 vulnerable-4.064
Token,
Token ID11
Feature activation-1.24e+00

Pos logit contributions
tymology+2.515
ledged+2.482
been+2.445
 origins+2.387
iths+2.383
Neg logit contributions
 counterproductive-2.016
 futile-1.956
 undermines-1.907
 urgently-1.859
 vulnerable-1.844
Token Week
Token ID6119
Feature activation-2.43e+00

Pos logit contributions
 counterproductive+0.880
 futile+0.840
 undermines+0.804
 urgently+0.774
 vulnerable+0.768
Neg logit contributions
tymology-2.518
ledged-2.506
been-2.471
essor-2.424
iths-2.413
Token 13
Token ID1511
Feature activation-8.55e-01

Pos logit contributions
ledged+2.051
been+2.023
tymology+2.010
 origins+1.967
essor+1.937
Neg logit contributions
 counterproductive-2.268
 futile-2.203
 undermines-2.169
 urgently-2.107
 vulnerable-2.092
Token.
Token ID13
Feature activation-5.56e-01

Pos logit contributions
tymology+0.836
ledged+0.830
been+0.815
essor+0.792
iths+0.788
Neg logit contributions
 counterproductive-0.697
 futile-0.678
 undermines-0.660
 urgently-0.648
 vulnerable-0.644
Token Ryan
Token ID6047
Feature activation+6.68e+00

Pos logit contributions
ledged+0.563
been+0.559
tymology+0.558
essor+0.542
 origins+0.537
Neg logit contributions
 counterproductive-0.429
 futile-0.418
 undermines-0.405
 urgently-0.396
 vulnerable-0.394
Token also
Token ID635
Feature activation-6.66e+00

Pos logit contributions
 counterproductive+4.683
 futile+4.592
 undermines+4.461
 vulnerable+4.375
 urgently+4.367
Neg logit contributions
tymology-7.219
ledged-7.186
been-6.946
essor-6.877
iths-6.825
Token has
Token ID468
Feature activation-2.66e+01

Pos logit contributions
been+6.846
ledged+6.773
tymology+6.752
 origins+6.644
 ancestors+6.535
Neg logit contributions
 counterproductive-4.577
 futile-4.320
 undermines-4.149
 urgently-4.025
 vulnerable-4.012
Token scored
Token ID7781
Feature activation-2.47e+00

Pos logit contributions
been+22.612
 origins+21.366
stood+21.072
 ancestors+20.914
 been+20.686
Neg logit contributions
 counterproductive-12.998
 undermines-12.962
 futile-12.403
 urgently-12.034
 hostage-11.676
Token 20
Token ID1160
Feature activation+1.43e+00

Pos logit contributions
tymology+2.444
ledged+2.439
been+2.420
 origins+2.372
essor+2.362
Neg logit contributions
 counterproductive-1.897
 futile-1.838
 undermines-1.805
 urgently-1.748
 vulnerable-1.729
Token Fantasy
Token ID10982
Feature activation+2.88e+00

Pos logit contributions
 counterproductive+1.095
 futile+1.072
 undermines+1.038
 vulnerable+1.019
 urgently+1.016
Neg logit contributions
tymology-1.473
ledged-1.462
been-1.431
essor-1.393
iths-1.389
Token points
Token ID2173
Feature activation+4.01e+00

Pos logit contributions
 counterproductive+2.536
 futile+2.474
 undermines+2.414
 urgently+2.368
 vulnerable+2.364
Neg logit contributions
tymology-2.638
ledged-2.620
been-2.563
essor-2.498
iths-2.485
Token outdoors
Token ID24349
Feature activation-5.81e+00

Pos logit contributions
 counterproductive+2.546
 futile+2.489
 undermines+2.380
 vulnerable+2.342
 urgently+2.330
Neg logit contributions
tymology-4.673
ledged-4.626
been-4.529
essor-4.435
iths-4.423
Token and
Token ID290
Feature activation-4.87e+00

Pos logit contributions
tymology+4.854
ledged+4.761
been+4.714
 origins+4.665
essor+4.596
Neg logit contributions
 counterproductive-3.153
 futile-3.068
 undermines-2.869
 urgently-2.853
 vulnerable-2.808
Token territories
Token ID16771
Feature activation-3.41e+00

Pos logit contributions
tymology+3.118
ledged+3.036
 origins+2.939
been+2.913
essor+2.815
Neg logit contributions
 counterproductive-5.521
 futile-5.403
 undermines-5.265
 urgently-5.206
 vulnerable-5.147
Token
Token ID198
Feature activation-7.91e+00

Pos logit contributions
tymology+3.598
ledged+3.540
been+3.495
 origins+3.449
essor+3.416
Neg logit contributions
 counterproductive-2.487
 futile-2.426
 undermines-2.284
 urgently-2.259
 vulnerable-2.238
Token
Token ID198
Feature activation-2.48e+00

Pos logit contributions
been+6.202
ledged+5.924
 origins+5.843
tymology+5.787
iths+5.730
Neg logit contributions
 counterproductive-7.500
 futile-7.272
 undermines-7.176
 tightening-6.842
 vulnerable-6.751
TokenAustralia
Token ID27429
Feature activation+5.38e+00

Pos logit contributions
tymology+2.641
ledged+2.610
been+2.577
essor+2.497
iths+2.487
Neg logit contributions
 counterproductive-1.825
 futile-1.776
 undermines-1.716
 urgently-1.677
 vulnerable-1.672
Token has
Token ID468
Feature activation-2.69e+01

Pos logit contributions
 counterproductive+3.262
 futile+3.193
 undermines+3.167
 urgently+3.116
 vulnerable+3.079
Neg logit contributions
ledged-6.191
tymology-6.120
been-6.062
iths-5.928
essor-5.919
Token six
Token ID2237
Feature activation-1.04e+01

Pos logit contributions
 origins+22.141
been+21.573
 ancestors+20.894
ledged+20.818
tymology+20.780
Neg logit contributions
 counterproductive-14.374
 futile-14.083
 undermines-13.982
 distur-13.042
 hostage-13.018
Token states
Token ID2585
Feature activation-8.25e+00

Pos logit contributions
ledged+9.993
 origins+9.991
tymology+9.985
been+9.655
 ancestors+9.621
Neg logit contributions
 counterproductive-7.401
 futile-7.147
 undermines-7.037
 urgently-6.707
 threatens-6.658
Token
Token ID851
Feature activation-9.04e+00

Pos logit contributions
tymology+8.247
ledged+8.232
been+8.031
 origins+7.935
essor+7.784
Neg logit contributions
 counterproductive-6.097
 futile-5.920
 undermines-5.717
 urgently-5.455
 unsafe-5.398
Token New
Token ID968
Feature activation-3.41e+00

Pos logit contributions
tymology+8.960
ledged+8.922
 origins+8.757
been+8.745
 ancestors+8.539
Neg logit contributions
 counterproductive-6.404
 futile-6.086
 undermines-5.980
 urgently-5.700
 unsafe-5.660
Token South
Token ID2520
Feature activation-6.03e+00

Pos logit contributions
tymology+3.997
ledged+3.985
been+3.922
essor+3.835
 origins+3.833
Neg logit contributions
 counterproductive-2.120
 futile-2.018
 undermines-1.960
 urgently-1.906
 vulnerable-1.856
Token the
Token ID262
Feature activation-4.37e+00

Pos logit contributions
 counterproductive+3.362
 futile+3.255
 undermines+3.103
 vulnerable+3.089
 harder+3.048
Neg logit contributions
tymology-5.479
ledged-5.410
been-5.321
essor-5.235
iths-5.207
Token opposite
Token ID6697
Feature activation-6.54e+00

Pos logit contributions
tymology+3.976
ledged+3.933
been+3.871
 origins+3.819
essor+3.763
Neg logit contributions
 counterproductive-3.690
 futile-3.609
 undermines-3.592
 urgently-3.491
 threatens-3.444
Token of
Token ID286
Feature activation-2.29e-01

Pos logit contributions
tymology+6.021
ledged+5.902
been+5.783
essor+5.660
 origins+5.657
Neg logit contributions
 counterproductive-5.548
 futile-5.407
 undermines-5.305
 urgently-5.213
 vulnerable-5.196
Token what
Token ID644
Feature activation+9.03e-01

Pos logit contributions
tymology+0.252
ledged+0.251
been+0.246
essor+0.240
 origins+0.239
Neg logit contributions
 counterproductive-0.158
 futile-0.154
 undermines-0.149
 urgently-0.146
 vulnerable-0.145
Token we
Token ID356
Feature activation-5.31e+00

Pos logit contributions
 counterproductive+0.640
 futile+0.622
 undermines+0.602
 urgently+0.589
 vulnerable+0.588
Neg logit contributions
tymology-0.983
ledged-0.977
been-0.958
essor-0.937
iths-0.932
Token have
Token ID423
Feature activation-2.74e+01

Pos logit contributions
tymology+5.480
ledged+5.420
been+5.374
essor+5.210
 origins+5.158
Neg logit contributions
 counterproductive-3.832
 futile-3.687
 undermines-3.671
 vulnerable-3.484
 urgently-3.455
Token seen
Token ID1775
Feature activation-7.65e+00

Pos logit contributions
been+22.128
ledged+20.930
 ancestors+20.770
 predecessors+20.267
iths+20.253
Neg logit contributions
 counterproductive-14.506
 undermines-14.431
 futile-13.548
 threatens-13.014
 hostage-12.739
Token when
Token ID618
Feature activation-4.07e+00

Pos logit contributions
tymology+8.103
ledged+8.096
been+7.969
essor+7.761
 origins+7.749
Neg logit contributions
 counterproductive-5.060
 futile-4.883
 undermines-4.810
 urgently-4.577
 threatens-4.505
Token we
Token ID356
Feature activation-3.70e+00

Pos logit contributions
tymology+4.219
ledged+4.182
been+4.125
essor+4.049
 origins+4.042
Neg logit contributions
 counterproductive-2.955
 futile-2.864
 undermines-2.809
 urgently-2.719
 vulnerable-2.692
Token unp
Token ID8593
Feature activation-3.53e+00

Pos logit contributions
tymology+3.584
ledged+3.542
been+3.482
essor+3.408
 origins+3.359
Neg logit contributions
 counterproductive-2.995
 futile-2.892
 undermines-2.852
 urgently-2.739
 vulnerable-2.739
Tokenacked
Token ID6021
Feature activation-6.39e-01

Pos logit contributions
ledged+2.712
tymology+2.677
been+2.595
essor+2.551
iths+2.505
Neg logit contributions
 counterproductive-3.642
 futile-3.562
 undermines-3.485
 urgently-3.421
 vulnerable-3.406
Token into
Token ID656
Feature activation+1.95e+00

Pos logit contributions
 counterproductive+7.790
 futile+7.555
 undermines+7.390
 harder+7.335
 vulnerable+7.270
Neg logit contributions
tymology-11.332
ledged-11.033
essor-10.694
been-10.640
iths-10.340
Token launch
Token ID4219
Feature activation+3.30e+00

Pos logit contributions
 counterproductive+1.350
 futile+1.313
 undermines+1.266
 vulnerable+1.240
 urgently+1.240
Neg logit contributions
tymology-2.164
ledged-2.148
been-2.108
essor-2.063
iths-2.056
Token.
Token ID13
Feature activation+2.77e+00

Pos logit contributions
 counterproductive+2.561
 futile+2.510
 undermines+2.426
 urgently+2.384
 vulnerable+2.384
Neg logit contributions
tymology-3.378
ledged-3.346
been-3.278
essor-3.216
iths-3.201
Token The
Token ID383
Feature activation+2.35e+00

Pos logit contributions
 counterproductive+2.031
 futile+1.976
 undermines+1.917
 urgently+1.878
 vulnerable+1.875
Neg logit contributions
tymology-2.956
ledged-2.932
been-2.872
essor-2.810
iths-2.798
Token rocket
Token ID10701
Feature activation+7.84e+00

Pos logit contributions
 counterproductive+1.890
 futile+1.843
 undermines+1.790
 urgently+1.756
 vulnerable+1.756
Neg logit contributions
tymology-2.329
ledged-2.312
been-2.263
essor-2.209
iths-2.198
Token had
Token ID550
Feature activation-2.41e+01

Pos logit contributions
 counterproductive+6.050
 futile+5.949
 undermines+5.812
 urgently+5.785
 vulnerable+5.759
Neg logit contributions
tymology-7.768
ledged-7.704
been-7.525
essor-7.353
iths-7.349
Token to
Token ID284
Feature activation-6.11e-01

Pos logit contributions
been+21.481
 origins+21.266
iths+20.395
ledged+20.293
 ancestors+20.259
Neg logit contributions
 counterproductive-11.776
 undermines-11.717
 futile-10.954
 hostage-10.810
 Everywhere-10.772
Token adjust
Token ID4532
Feature activation+1.49e+00

Pos logit contributions
tymology+0.556
ledged+0.554
been+0.545
essor+0.527
 origins+0.525
Neg logit contributions
 counterproductive-0.538
 futile-0.520
 undermines-0.512
 vulnerable-0.497
 urgently-0.494
Token its
Token ID663
Feature activation-2.82e+00

Pos logit contributions
 counterproductive+1.159
 futile+1.131
 undermines+1.095
 urgently+1.080
 vulnerable+1.078
Neg logit contributions
tymology-1.522
ledged-1.509
been-1.477
essor-1.444
iths-1.435
Token trajectory
Token ID22942
Feature activation+5.10e+00

Pos logit contributions
tymology+2.647
ledged+2.624
been+2.587
 origins+2.555
essor+2.501
Neg logit contributions
 counterproductive-2.340
 futile-2.275
 undermines-2.260
 urgently-2.179
 vulnerable-2.153
Token to
Token ID284
Feature activation+8.02e+00

Pos logit contributions
 counterproductive+3.825
 futile+3.758
 undermines+3.611
 urgently+3.606
 vulnerable+3.581
Neg logit contributions
tymology-5.354
ledged-5.270
been-5.155
essor-5.081
iths-5.038