TOP ACTIVATIONS
MAX = 8.578

10 metre setback along Y onge Street and a 5 metre
. The following chart shows projected costs for power
, the intersection of Y onge and E gl inton will
ob b . sh orter Ax is , space ); //
Joe V itt Aff idav it Scott ie
man , convicted of veh icular homicide for killing two Federal
the Football This slideshow requires JavaScript .
and the Paradox of Credit Bub bles , Equity and Demand
of inquiry to investigate this mish ap . The March launch
, a fluid with some uniform ambient density ( as the
he 'd be a hockey dad when his kids were toddlers
Sorry , this video has expired Video : Mary
A erial view of Y onge and E gl inton ,
whatever he has to in order to achieve his aims ,
distance from 2 221 Y onge Cond os to the south
than ever . The below chart shows the distribution in the
teens , is charged for driving a vehicle without an inter
. chest o @ gl obe . com . Follow him
the 2011 Disney D 23 Expo to honor Jim H enson
argument offer , partly , too , because frequent repetition brought

MIN ACTIVATIONS
MIN = -7.555

pharmac ological profile of N - cyclop rop yl methyl -
week participated in a Q & A session refer to
. <|endoftext|> Abstract Background Depression presents a significant burden
a start . <|endoftext|> Abstract Background Depression presents a
start . <|endoftext|> Abstract Background Depression presents a significant
. Materials and Methods Experimental subjects Three female r
and sham ta V NS groups at week 4 ; (
active CR F 1 receptor antagonist . Oral administration of a
in 33 cities around the world . ( Wa ub ges
4 64 406 Mem c py Reading 55 80 953 16
500 million members around the world , has relied on its
s a start . <|endoftext|> Abstract Background Depression presents
switching from a three - man defence to a back four
possibly the greatest Spanish - language writer of the 20 th
lla - san collapsed ?」 " Yes . At
N - cyclop rop yl methyl - 7 -( 2 ,
8 ( 3 or 10 mg / kg ), but not
it means lots of attention coming its way . According to
san collapsed ?」 " Yes . At the moment
been the most sought - after feature for me since the

INTERVAL 4.545 - 8.578
CONTAINS 0.159%

riders were routinely delayed by private automobile drivers blocking boarding islands
display . Sl ov akia s lone
this post I 'll round up some got cha 's I
100 passengers injured in the mish ap have settled compensation claims
ering . The flyer shows a newspaper headline taken

INTERVAL 0.512 - 4.545
CONTAINS 34.766%

the situations might very well be reversed . It is
have been diagnosed with post - traumatic stress disorder and ,
s also followed Col fax , the longest bou levard in
city governments . Paris , for example , made headlines in
, developer of Ghost Song . " True discovery is only

INTERVAL -3.522 - 0.512
CONTAINS 64.209%

. In the scene , Row lf and Ker mit sing
you ve helped create . The Unt
supporters and the Club . " The Trust is
are different here , the approach feels worn out ... I
s Parliament to change the 1978 Spanish Constitution .

INTERVAL -7.555 - -3.522
CONTAINS 0.866%

across all women was the vaginal microbiome . Sur
. She also expresses admiration for her mother
to save her son to ber ating herself for letting this
When we get the down
goal , playing in the hitherto disappointing C atter m ole
Token 10
Token ID838
Feature activation-1.39e-01

Pos logit contributions
 tradem+0.711
 undermin+0.546
construct+0.520
cler+0.518
urers+0.517
Neg logit contributions
theless-0.724
-0.634
 DAC-0.588
 DOI-0.579
-0.561
Token metre
Token ID42760
Feature activation-3.03e-01

Pos logit contributions
theless+0.105
+0.089
 DAC+0.081
 DOI+0.080
+0.076
Neg logit contributions
 tradem-0.143
 undermin-0.114
construct-0.109
urers-0.109
cler-0.109
Token setback
Token ID34116
Feature activation+1.70e+00

Pos logit contributions
theless+0.268
+0.234
 DAC+0.217
 DOI+0.214
+0.206
Neg logit contributions
 tradem-0.269
 undermin-0.206
construct-0.197
urers-0.196
cler-0.196
Token along
Token ID1863
Feature activation+2.53e+00

Pos logit contributions
 tradem+1.583
 undermin+1.237
construct+1.184
cler+1.183
urers+1.177
Neg logit contributions
theless-1.420
-1.210
 DAC-1.127
 DOI-1.110
-1.064
Token Y
Token ID575
Feature activation+3.95e+00

Pos logit contributions
 tradem+2.263
 undermin+1.731
construct+1.655
cler+1.650
urers+1.648
Neg logit contributions
theless-2.211
-1.921
 DAC-1.785
 DOI-1.756
-1.701
Tokenonge
Token ID14220
Feature activation+8.58e+00

Pos logit contributions
 tradem+2.152
 undermin+1.290
construct+1.258
cler+1.253
urers+1.235
Neg logit contributions
theless-4.824
-4.344
 DAC-4.142
 DOI-4.100
-4.002
Token Street
Token ID3530
Feature activation-1.21e+00

Pos logit contributions
 tradem+7.520
 undermin+5.709
 spoilers+5.606
construct+5.542
cler+5.492
Neg logit contributions
theless-7.175
-5.900
 DAC-5.661
 DOI-5.570
-5.562
Token and
Token ID290
Feature activation+1.81e+00

Pos logit contributions
theless+0.956
+0.826
 DAC+0.757
 DOI+0.745
+0.710
Neg logit contributions
 tradem-1.183
 undermin-0.933
urers-0.895
construct-0.895
cler-0.891
Token a
Token ID257
Feature activation-7.69e-01

Pos logit contributions
 tradem+1.589
 undermin+1.222
construct+1.166
cler+1.154
urers+1.153
Neg logit contributions
theless-1.600
-1.395
 DAC-1.294
 DOI-1.276
-1.234
Token 5
Token ID642
Feature activation-4.97e-02

Pos logit contributions
theless+0.670
+0.584
 DAC+0.542
 DOI+0.534
+0.512
Neg logit contributions
 tradem-0.694
 undermin-0.535
construct-0.510
urers-0.509
cler-0.507
Token metre
Token ID42760
Feature activation-1.21e-01

Pos logit contributions
theless+0.043
+0.037
 DAC+0.034
 DOI+0.034
+0.032
Neg logit contributions
 tradem-0.045
 undermin-0.035
construct-0.034
urers-0.034
cler-0.033
Token.
Token ID13
Feature activation-7.47e-02

Pos logit contributions
 tradem+1.167
 undermin+0.916
urers+0.884
construct+0.882
cler+0.877
Neg logit contributions
theless-0.977
-0.823
 DAC-0.767
 DOI-0.752
-0.725
Token
Token ID198
Feature activation+5.16e-02

Pos logit contributions
theless+0.060
+0.051
 DAC+0.047
 DOI+0.046
+0.045
Neg logit contributions
 tradem-0.071
 undermin-0.055
construct-0.054
urers-0.054
cler-0.054
Token
Token ID198
Feature activation+3.19e-02

Pos logit contributions
 tradem+0.046
construct+0.037
cler+0.037
urers+0.036
 spoilers+0.036
Neg logit contributions
theless-0.043
-0.036
 DAC-0.034
 DOI-0.033
-0.031
TokenThe
Token ID464
Feature activation+1.44e-01

Pos logit contributions
 tradem+0.029
 undermin+0.023
construct+0.022
cler+0.022
urers+0.022
Neg logit contributions
theless-0.027
-0.023
 DAC-0.021
 DOI-0.021
-0.020
Token following
Token ID1708
Feature activation+2.79e+00

Pos logit contributions
 tradem+0.141
 undermin+0.111
urers+0.107
construct+0.107
cler+0.106
Neg logit contributions
theless-0.113
-0.097
 DAC-0.089
 DOI-0.087
-0.085
Token chart
Token ID8262
Feature activation+8.06e+00

Pos logit contributions
 tradem+2.284
 undermin+1.722
urers+1.647
construct+1.644
cler+1.635
Neg logit contributions
theless-2.606
-2.287
 DAC-2.114
 DOI-2.083
-2.057
Token shows
Token ID2523
Feature activation+4.36e+00

Pos logit contributions
 tradem+5.642
 spoilers+4.258
construct+4.235
 undermin+4.192
 overs+4.190
Neg logit contributions
theless-7.600
-6.509
 DAC-6.386
-6.193
 DOI-6.180
Token projected
Token ID13301
Feature activation+7.78e-01

Pos logit contributions
 tradem+3.755
 undermin+2.891
construct+2.773
urers+2.769
 spoilers+2.746
Neg logit contributions
theless-3.826
-3.281
 DAC-3.042
 DOI-2.990
-2.958
Token costs
Token ID3484
Feature activation+1.00e+00

Pos logit contributions
 tradem+0.725
 undermin+0.567
construct+0.544
urers+0.543
cler+0.540
Neg logit contributions
theless-0.647
-0.557
 DAC-0.514
 DOI-0.506
-0.489
Token for
Token ID329
Feature activation+1.37e+00

Pos logit contributions
 tradem+0.936
 undermin+0.732
urers+0.703
construct+0.701
cler+0.701
Neg logit contributions
theless-0.831
-0.708
 DAC-0.659
 DOI-0.648
-0.623
Token power
Token ID1176
Feature activation+1.49e+00

Pos logit contributions
 tradem+1.212
 undermin+0.937
construct+0.898
urers+0.891
cler+0.885
Neg logit contributions
theless-1.194
-1.033
 DAC-0.957
 DOI-0.942
-0.918
Token,
Token ID11
Feature activation-1.61e-01

Pos logit contributions
 tradem+3.032
 undermin+2.366
construct+2.305
cler+2.295
urers+2.280
Neg logit contributions
theless-2.478
-2.072
 DAC-1.939
 DOI-1.895
-1.842
Token the
Token ID262
Feature activation+8.68e-01

Pos logit contributions
theless+0.138
+0.120
 DAC+0.111
 DOI+0.109
+0.105
Neg logit contributions
 tradem-0.147
 undermin-0.114
construct-0.109
urers-0.109
cler-0.108
Token intersection
Token ID16246
Feature activation+1.35e+00

Pos logit contributions
 tradem+0.784
 undermin+0.602
construct+0.577
urers+0.573
cler+0.573
Neg logit contributions
theless-0.754
-0.657
 DAC-0.606
 DOI-0.598
-0.578
Token of
Token ID286
Feature activation+3.07e+00

Pos logit contributions
 tradem+1.269
 undermin+0.992
construct+0.955
cler+0.951
urers+0.949
Neg logit contributions
theless-1.106
-0.950
 DAC-0.880
 DOI-0.865
-0.832
Token Y
Token ID575
Feature activation+3.25e+00

Pos logit contributions
 tradem+2.764
 undermin+2.109
construct+2.024
cler+2.020
urers+2.008
Neg logit contributions
theless-2.654
-2.317
 DAC-2.123
 DOI-2.099
-2.048
Tokenonge
Token ID14220
Feature activation+7.80e+00

Pos logit contributions
 tradem+2.318
 undermin+1.640
construct+1.560
cler+1.550
urers+1.548
Neg logit contributions
theless-3.427
-3.057
 DAC-2.885
 DOI-2.854
-2.773
Token and
Token ID290
Feature activation+4.12e+00

Pos logit contributions
 tradem+6.868
 undermin+5.185
construct+5.071
 spoilers+5.038
cler+4.982
Neg logit contributions
theless-6.583
-5.477
 DAC-5.224
 DOI-5.162
-5.104
Token E
Token ID412
Feature activation+2.40e+00

Pos logit contributions
 tradem+3.979
 undermin+3.100
construct+3.010
cler+3.003
 spoilers+2.990
Neg logit contributions
theless-3.251
-2.761
 DAC-2.491
 DOI-2.462
-2.439
Tokengl
Token ID4743
Feature activation-8.15e-01

Pos logit contributions
 tradem+2.527
 undermin+2.030
construct+1.957
urers+1.953
cler+1.949
Neg logit contributions
theless-1.718
-1.449
 DAC-1.319
 DOI-1.293
-1.229
Tokeninton
Token ID2371
Feature activation+2.18e+00

Pos logit contributions
theless+0.718
+0.635
 DAC+0.586
 DOI+0.576
AI+0.557
Neg logit contributions
 tradem-0.727
 undermin-0.554
 newcom-0.522
cler-0.522
construct-0.522
Token will
Token ID481
Feature activation+3.07e+00

Pos logit contributions
 tradem+2.055
 undermin+1.597
construct+1.536
urers+1.534
 spoilers+1.530
Neg logit contributions
theless-1.793
-1.519
 DAC-1.422
 DOI-1.397
-1.353
Token ob
Token ID909
Feature activation+2.80e+00

Pos logit contributions
 tradem+3.706
 undermin+2.911
construct+2.874
cler+2.831
urers+2.816
Neg logit contributions
theless-3.103
-2.598
 DAC-2.456
 DOI-2.401
-2.285
Tokenb
Token ID65
Feature activation+5.07e-01

Pos logit contributions
 tradem+2.560
construct+2.128
cler+2.119
urers+2.034
 spoilers+2.009
Neg logit contributions
theless-2.279
-1.790
 DAC-1.774
 DOI-1.683
-1.639
Token.
Token ID13
Feature activation+1.61e+00

Pos logit contributions
 tradem+0.522
 undermin+0.416
construct+0.400
urers+0.399
cler+0.398
Neg logit contributions
theless-0.377
-0.323
 DAC-0.293
 DOI-0.288
-0.274
Tokensh
Token ID1477
Feature activation+3.92e+00

Pos logit contributions
 tradem+1.539
 undermin+1.206
construct+1.173
cler+1.162
urers+1.159
Neg logit contributions
theless-1.315
-1.122
 DAC-1.043
 DOI-1.024
-0.980
Tokenorter
Token ID4337
Feature activation+1.78e+00

Pos logit contributions
 tradem+3.419
 undermin+2.621
construct+2.585
cler+2.572
urers+2.557
Neg logit contributions
theless-3.445
-2.946
 DAC-2.791
 DOI-2.739
-2.631
TokenAx
Token ID31554
Feature activation+7.74e+00

Pos logit contributions
 tradem+1.825
 undermin+1.461
construct+1.417
cler+1.406
urers+1.400
Neg logit contributions
theless-1.321
-1.102
 DAC-1.020
 DOI-0.995
-0.950
Tokenis
Token ID271
Feature activation+6.11e+00

Pos logit contributions
 tradem+7.712
 undermin+6.101
cler+6.062
construct+6.003
urers+5.958
Neg logit contributions
theless-5.972
-4.742
 DAC-4.514
 DOI-4.372
-4.249
Token,
Token ID11
Feature activation+2.99e+00

Pos logit contributions
 tradem+5.879
 undermin+4.708
cler+4.644
construct+4.627
urers+4.531
Neg logit contributions
theless-4.709
-3.712
 DAC-3.572
 DOI-3.432
-3.326
Token space
Token ID2272
Feature activation+2.79e+00

Pos logit contributions
 tradem+2.604
 undermin+1.990
construct+1.927
cler+1.905
urers+1.902
Neg logit contributions
theless-2.671
-2.295
 DAC-2.158
 DOI-2.116
-2.038
Token );
Token ID5619
Feature activation-3.68e-01

Pos logit contributions
 tradem+2.692
 undermin+2.133
construct+2.065
cler+2.060
urers+2.040
Neg logit contributions
theless-2.224
-1.846
 DAC-1.734
 DOI-1.689
-1.626
Token //
Token ID3373
Feature activation+1.66e+00

Pos logit contributions
theless+0.305
+0.264
 DAC+0.244
 DOI+0.239
+0.230
Neg logit contributions
 tradem-0.348
 undermin-0.272
construct-0.260
urers-0.259
cler-0.259
Token
Token ID198
Feature activation+8.54e-01

Pos logit contributions
 tradem+1.617
construct+1.358
cler+1.347
urers+1.314
 spoilers+1.304
Neg logit contributions
theless-1.755
-1.404
 DAC-1.341
 DOI-1.301
 detainee-1.264
TokenJoe
Token ID19585
Feature activation+1.38e+00

Pos logit contributions
 tradem+0.747
 undermin+0.571
construct+0.551
cler+0.548
urers+0.547
Neg logit contributions
theless-0.764
-0.662
 DAC-0.618
 DOI-0.608
-0.584
Token V
Token ID569
Feature activation+1.18e+00

Pos logit contributions
 tradem+1.236
 undermin+0.950
construct+0.907
urers+0.905
cler+0.902
Neg logit contributions
theless-1.210
-1.055
 DAC-0.980
 DOI-0.964
-0.927
Tokenitt
Token ID715
Feature activation+8.32e-01

Pos logit contributions
 tradem+1.127
 undermin+0.882
construct+0.844
urers+0.842
cler+0.839
Neg logit contributions
theless-0.961
-0.832
 DAC-0.766
 DOI-0.753
-0.720
Token Aff
Token ID6708
Feature activation-6.35e-02

Pos logit contributions
 tradem+0.820
 undermin+0.648
construct+0.622
urers+0.621
cler+0.619
Neg logit contributions
theless-0.655
-0.561
 DAC-0.516
 DOI-0.507
-0.484
Tokenidav
Token ID20331
Feature activation+7.63e+00

Pos logit contributions
theless+0.054
+0.047
 DAC+0.043
 DOI+0.043
+0.041
Neg logit contributions
 tradem-0.059
 undermin-0.046
construct-0.044
urers-0.043
cler-0.043
Tokenit
Token ID270
Feature activation+4.17e+00

Pos logit contributions
 tradem+6.958
cler+5.583
urers+5.562
construct+5.531
 undermin+5.473
Neg logit contributions
theless-6.284
-5.092
 DAC-4.809
 DOI-4.649
-4.561
Token
Token ID198
Feature activation+3.11e+00

Pos logit contributions
 tradem+3.860
 undermin+2.992
cler+2.909
urers+2.907
construct+2.900
Neg logit contributions
theless-3.453
-2.911
 DAC-2.688
 DOI-2.619
-2.551
Token
Token ID198
Feature activation+1.59e+00

Pos logit contributions
 tradem+2.427
construct+2.073
cler+2.055
urers+1.987
 spoilers+1.986
Neg logit contributions
theless-2.823
-2.236
 DAC-2.145
 DOI-2.078
 detainee-2.036
TokenScott
Token ID19040
Feature activation+1.36e+00

Pos logit contributions
 tradem+1.375
 undermin+1.048
construct+1.021
cler+1.013
urers+1.007
Neg logit contributions
theless-1.437
-1.239
 DAC-1.161
 DOI-1.142
-1.093
Tokenie
Token ID494
Feature activation+6.15e-01

Pos logit contributions
 tradem+1.256
 undermin+0.979
urers+0.942
construct+0.940
cler+0.937
Neg logit contributions
theless-1.138
-0.980
 DAC-0.907
 DOI-0.892
-0.856
Token man
Token ID582
Feature activation+5.87e+00

Pos logit contributions
theless+0.533
+0.468
 DAC+0.435
 DOI+0.428
+0.412
Neg logit contributions
 tradem-0.516
 undermin-0.393
construct-0.374
urers-0.373
cler-0.372
Token,
Token ID11
Feature activation+1.43e+00

Pos logit contributions
 tradem+5.203
cler+4.086
 undermin+4.069
urers+4.031
 spoilers+4.030
Neg logit contributions
theless-4.863
-4.055
 DAC-3.797
 DOI-3.708
-3.568
Token convicted
Token ID9118
Feature activation+3.40e+00

Pos logit contributions
 tradem+1.352
 undermin+1.057
cler+1.012
construct+1.011
urers+1.010
Neg logit contributions
theless-1.178
-1.012
 DAC-0.932
 DOI-0.916
-0.882
Token of
Token ID286
Feature activation+2.17e+00

Pos logit contributions
 tradem+3.145
 undermin+2.448
cler+2.333
urers+2.328
construct+2.324
Neg logit contributions
theless-2.871
-2.470
 DAC-2.293
 DOI-2.251
-2.167
Token veh
Token ID2844
Feature activation+2.95e+00

Pos logit contributions
 tradem+2.128
 undermin+1.686
cler+1.615
construct+1.613
urers+1.613
Neg logit contributions
theless-1.710
-1.451
 DAC-1.333
 DOI-1.310
-1.256
Tokenicular
Token ID13174
Feature activation+7.58e+00

Pos logit contributions
 tradem+1.856
 undermin+1.248
construct+1.198
cler+1.196
urers+1.151
Neg logit contributions
theless-3.326
-2.959
 DAC-2.855
 DOI-2.778
-2.728
Token homicide
Token ID19625
Feature activation+3.46e+00

Pos logit contributions
 tradem+7.161
 undermin+5.645
 spoilers+5.564
construct+5.485
 overs+5.449
Neg logit contributions
theless-5.865
-4.765
 DAC-4.549
 DOI-4.364
AI-4.316
Token for
Token ID329
Feature activation+2.73e+00

Pos logit contributions
 tradem+3.287
 undermin+2.578
cler+2.493
urers+2.476
construct+2.463
Neg logit contributions
theless-2.789
-2.375
 DAC-2.208
 DOI-2.147
AI-2.083
Token killing
Token ID5170
Feature activation+1.83e+00

Pos logit contributions
 tradem+2.621
 undermin+2.073
cler+1.988
construct+1.985
urers+1.982
Neg logit contributions
theless-2.197
-1.872
 DAC-1.724
 DOI-1.699
AI-1.633
Token two
Token ID734
Feature activation+1.79e+00

Pos logit contributions
 tradem+1.448
 undermin+1.073
cler+1.022
urers+1.021
construct+1.012
Neg logit contributions
theless-1.791
-1.576
 DAC-1.470
 DOI-1.455
-1.413
Token Federal
Token ID5618
Feature activation+2.98e+00

Pos logit contributions
 tradem+1.515
 undermin+1.148
urers+1.099
cler+1.090
construct+1.087
Neg logit contributions
theless-1.646
-1.432
 DAC-1.334
 DOI-1.317
-1.276
Token the
Token ID262
Feature activation+1.47e+00

Pos logit contributions
theless+1.321
+1.161
 DAC+1.071
 DOI+1.056
+1.011
Neg logit contributions
 tradem-1.363
 undermin-1.043
construct-0.999
urers-0.998
cler-0.996
Token Football
Token ID9957
Feature activation-5.38e-01

Pos logit contributions
 tradem+1.328
 undermin+1.031
construct+0.982
urers+0.981
cler+0.980
Neg logit contributions
theless-1.269
-1.098
 DAC-1.019
 DOI-1.002
-0.966
Token
Token ID198
Feature activation+1.24e+00

Pos logit contributions
theless+0.410
+0.354
 DAC+0.322
 DOI+0.316
+0.301
Neg logit contributions
 tradem-0.546
 undermin-0.434
construct-0.414
urers-0.414
cler-0.412
Token
Token ID198
Feature activation-1.90e-01

Pos logit contributions
 tradem+1.067
construct+0.871
cler+0.866
urers+0.850
 spoilers+0.842
Neg logit contributions
theless-1.068
-0.864
 DAC-0.824
 DOI-0.802
 detainee-0.770
TokenThis
Token ID1212
Feature activation+1.04e+00

Pos logit contributions
theless+0.162
+0.140
 DAC+0.130
 DOI+0.128
+0.123
Neg logit contributions
 tradem-0.174
 undermin-0.135
construct-0.130
urers-0.129
cler-0.129
Token slideshow
Token ID48828
Feature activation+7.56e+00

Pos logit contributions
 tradem+0.911
 undermin+0.698
construct+0.665
urers+0.662
cler+0.661
Neg logit contributions
theless-0.931
-0.812
 DAC-0.755
 DOI-0.743
-0.717
Token requires
Token ID4433
Feature activation+3.52e+00

Pos logit contributions
 tradem+4.763
 spoilers+3.533
construct+3.520
cler+3.438
urers+3.422
Neg logit contributions
theless-8.163
-6.854
 DAC-6.467
 DOI-6.428
AI-6.372
Token JavaScript
Token ID11933
Feature activation+3.14e-01

Pos logit contributions
 tradem+2.785
 undermin+2.047
construct+1.988
cler+1.979
 spoilers+1.967
Neg logit contributions
theless-3.434
-2.990
 DAC-2.793
 DOI-2.746
-2.669
Token.
Token ID13
Feature activation-2.28e+00

Pos logit contributions
 tradem+0.278
construct+0.215
cler+0.213
urers+0.211
 undermin+0.211
Neg logit contributions
theless-0.274
-0.229
 DAC-0.215
 DOI-0.209
-0.202
Token
Token ID198
Feature activation-2.44e+00

Pos logit contributions
theless+1.587
+1.371
 DOI+1.255
 DAC+1.253
+1.150
Neg logit contributions
 tradem-2.432
 undermin-1.976
urers-1.878
cler-1.871
construct-1.871
Token
Token ID198
Feature activation-2.24e+00

Pos logit contributions
theless+1.856
+1.592
 DAC+1.454
 DOI+1.428
+1.362
Neg logit contributions
 tradem-2.479
 undermin-1.975
construct-1.883
urers-1.882
cler-1.875
Token and
Token ID290
Feature activation+7.89e-01

Pos logit contributions
 tradem+1.723
 undermin+1.328
construct+1.266
urers+1.264
cler+1.262
Neg logit contributions
theless-1.702
-1.478
 DAC-1.370
 DOI-1.351
-1.304
Token the
Token ID262
Feature activation+4.82e-02

Pos logit contributions
 tradem+0.666
 undermin+0.502
construct+0.477
urers+0.475
cler+0.473
Neg logit contributions
theless-0.732
-0.646
 DAC-0.603
 DOI-0.595
-0.570
Token Paradox
Token ID37983
Feature activation-1.69e+00

Pos logit contributions
 tradem+0.043
 undermin+0.033
construct+0.031
urers+0.031
cler+0.031
Neg logit contributions
theless-0.043
-0.037
 DAC-0.035
 DOI-0.034
-0.033
Token of
Token ID286
Feature activation+1.15e+00

Pos logit contributions
theless+1.334
+1.181
 DAC+1.085
 DOI+1.069
AI+1.019
Neg logit contributions
 tradem-1.632
 undermin-1.290
urers-1.219
construct-1.219
 newcom-1.214
Token Credit
Token ID10504
Feature activation+4.11e-01

Pos logit contributions
 tradem+0.997
 undermin+0.759
construct+0.722
urers+0.720
cler+0.718
Neg logit contributions
theless-1.044
-0.916
 DAC-0.853
 DOI-0.840
-0.808
Token Bub
Token ID19645
Feature activation+7.48e+00

Pos logit contributions
 tradem+0.386
 undermin+0.301
construct+0.288
urers+0.288
cler+0.287
Neg logit contributions
theless-0.343
-0.296
 DAC-0.273
 DOI-0.269
-0.258
Tokenbles
Token ID7689
Feature activation+2.24e+00

Pos logit contributions
 tradem+4.307
 undermin+2.827
cler+2.810
construct+2.708
urers+2.651
Neg logit contributions
theless-8.770
-7.676
 DAC-7.349
 DOI-7.294
-7.249
Token,
Token ID11
Feature activation+1.29e+00

Pos logit contributions
 tradem+2.190
 undermin+1.734
construct+1.666
cler+1.662
urers+1.660
Neg logit contributions
theless-1.776
-1.505
 DAC-1.388
 DOI-1.353
-1.307
Token Equity
Token ID35659
Feature activation-2.10e-01

Pos logit contributions
 tradem+1.208
 undermin+0.943
construct+0.901
urers+0.901
cler+0.899
Neg logit contributions
theless-1.080
-0.929
 DAC-0.857
 DOI-0.841
-0.813
Token and
Token ID290
Feature activation+1.52e+00

Pos logit contributions
theless+0.172
+0.149
 DAC+0.138
 DOI+0.135
+0.129
Neg logit contributions
 tradem-0.200
 undermin-0.156
construct-0.149
urers-0.149
cler-0.148
Token Demand
Token ID34479
Feature activation+1.59e+00

Pos logit contributions
 tradem+1.270
 undermin+0.955
construct+0.907
urers+0.905
cler+0.902
Neg logit contributions
theless-1.424
-1.255
 DAC-1.171
 DOI-1.154
-1.113
Token of
Token ID286
Feature activation+3.71e-01

Pos logit contributions
theless+0.030
+0.026
 DAC+0.024
 DOI+0.024
+0.023
Neg logit contributions
 tradem-0.034
 undermin-0.027
construct-0.026
cler-0.026
urers-0.026
Token inquiry
Token ID12069
Feature activation-5.26e-01

Pos logit contributions
 tradem+0.329
 undermin+0.252
construct+0.241
cler+0.240
urers+0.239
Neg logit contributions
theless-0.331
-0.284
 DAC-0.263
 DOI-0.260
-0.253
Token to
Token ID284
Feature activation-1.74e-01

Pos logit contributions
theless+0.425
+0.363
 DAC+0.336
 DOI+0.329
+0.316
Neg logit contributions
 tradem-0.507
 undermin-0.398
construct-0.383
cler-0.382
urers-0.381
Token investigate
Token ID9161
Feature activation-2.45e+00

Pos logit contributions
theless+0.150
+0.130
 DAC+0.120
 DOI+0.118
+0.114
Neg logit contributions
 tradem-0.157
 undermin-0.122
construct-0.117
cler-0.116
urers-0.116
Token this
Token ID428
Feature activation-1.04e+00

Pos logit contributions
theless+1.965
+1.710
 DAC+1.578
 DOI+1.559
+1.472
Neg logit contributions
 tradem-2.378
 undermin-1.856
urers-1.788
construct-1.787
 newcom-1.763
Token mish
Token ID29406
Feature activation+7.45e+00

Pos logit contributions
theless+0.868
+0.750
 DAC+0.692
 DOI+0.681
+0.656
Neg logit contributions
 tradem-0.970
 undermin-0.759
construct-0.725
cler-0.722
urers-0.722
Tokenap
Token ID499
Feature activation+3.11e+00

Pos logit contributions
 tradem+6.075
construct+4.850
cler+4.805
urers+4.571
 undermin+4.563
Neg logit contributions
theless-6.505
-5.374
 DAC-5.360
-5.117
 DOI-5.044
Token.
Token ID13
Feature activation+5.55e-01

Pos logit contributions
 tradem+2.838
 undermin+2.207
cler+2.159
construct+2.139
urers+2.110
Neg logit contributions
theless-2.618
-2.190
 DAC-2.079
 DOI-2.005
-1.956
Token The
Token ID383
Feature activation+3.24e-01

Pos logit contributions
 tradem+0.510
 undermin+0.396
construct+0.393
cler+0.392
urers+0.386
Neg logit contributions
theless-0.461
-0.390
 DAC-0.358
 DOI-0.350
-0.342
Token March
Token ID2805
Feature activation+5.10e-01

Pos logit contributions
 tradem+0.280
 undermin+0.213
construct+0.204
cler+0.204
urers+0.204
Neg logit contributions
theless-0.292
-0.255
 DAC-0.236
 DOI-0.232
-0.226
Token launch
Token ID4219
Feature activation+3.27e+00

Pos logit contributions
 tradem+0.481
 undermin+0.377
cler+0.366
construct+0.365
urers+0.365
Neg logit contributions
theless-0.417
-0.353
 DAC-0.327
 DOI-0.321
-0.311
Token,
Token ID11
Feature activation+9.37e-01

Pos logit contributions
 tradem+2.652
cler+2.102
construct+2.101
 spoilers+2.093
urers+2.076
Neg logit contributions
theless-2.208
-1.780
 DAC-1.656
 DOI-1.636
-1.579
Token a
Token ID257
Feature activation+6.71e-01

Pos logit contributions
 tradem+0.859
 undermin+0.667
construct+0.637
urers+0.635
cler+0.634
Neg logit contributions
theless-0.800
-0.691
 DAC-0.640
 DOI-0.631
-0.608
Token fluid
Token ID11711
Feature activation+1.63e+00

Pos logit contributions
 tradem+0.601
 undermin+0.463
construct+0.444
cler+0.441
urers+0.440
Neg logit contributions
theless-0.587
-0.509
 DAC-0.470
 DOI-0.465
-0.449
Token with
Token ID351
Feature activation-2.35e-01

Pos logit contributions
 tradem+1.558
 undermin+1.224
construct+1.178
cler+1.177
urers+1.170
Neg logit contributions
theless-1.317
-1.110
 DAC-1.024
 DOI-1.016
-0.979
Token some
Token ID617
Feature activation-3.91e-01

Pos logit contributions
theless+0.200
+0.173
 DAC+0.161
 DOI+0.158
+0.152
Neg logit contributions
 tradem-0.216
 undermin-0.168
construct-0.161
urers-0.160
cler-0.160
Token uniform
Token ID8187
Feature activation+7.39e+00

Pos logit contributions
theless+0.338
+0.293
 DAC+0.271
 DOI+0.267
+0.258
Neg logit contributions
 tradem-0.354
 undermin-0.273
construct-0.261
urers-0.261
cler-0.260
Token ambient
Token ID25237
Feature activation+1.73e+00

Pos logit contributions
 tradem+6.032
 undermin+4.606
 spoilers+4.569
construct+4.453
cler+4.449
Neg logit contributions
theless-6.548
-5.550
 DAC-5.184
-5.085
 DOI-5.081
Token density
Token ID12109
Feature activation+1.81e+00

Pos logit contributions
 tradem+1.561
 undermin+1.207
construct+1.155
cler+1.154
urers+1.148
Neg logit contributions
theless-1.508
-1.283
 DAC-1.195
 DOI-1.177
-1.143
Token (
Token ID357
Feature activation+6.12e-01

Pos logit contributions
 tradem+1.714
 undermin+1.341
construct+1.295
cler+1.292
 spoilers+1.280
Neg logit contributions
theless-1.490
-1.246
 DAC-1.153
 DOI-1.144
-1.110
Tokenas
Token ID292
Feature activation-1.27e+00

Pos logit contributions
 tradem+0.613
 undermin+0.487
construct+0.476
cler+0.472
urers+0.470
Neg logit contributions
theless-0.468
-0.393
 DAC-0.364
 DOI-0.357
-0.340
Token the
Token ID262
Feature activation+1.02e+00

Pos logit contributions
theless+1.062
+0.919
 DAC+0.850
 DOI+0.836
+0.802
Neg logit contributions
 tradem-1.190
 undermin-0.926
construct-0.887
urers-0.885
cler-0.883
Token he
Token ID339
Feature activation+2.44e+00

Pos logit contributions
 tradem+1.369
 undermin+1.071
urers+1.027
construct+1.027
cler+1.025
Neg logit contributions
theless-1.195
-1.030
 DAC-0.945
 DOI-0.934
-0.899
Token'd
Token ID1549
Feature activation+4.16e+00

Pos logit contributions
 tradem+2.320
 undermin+1.834
cler+1.797
construct+1.782
urers+1.779
Neg logit contributions
theless-1.936
-1.658
 DAC-1.541
 DOI-1.514
-1.449
Token be
Token ID307
Feature activation+2.32e+00

Pos logit contributions
 tradem+3.583
 undermin+2.829
construct+2.764
cler+2.754
 spoilers+2.751
Neg logit contributions
theless-3.546
-3.048
 DAC-2.857
 DOI-2.768
-2.714
Token a
Token ID257
Feature activation+3.83e+00

Pos logit contributions
 tradem+2.038
 undermin+1.566
cler+1.511
construct+1.507
urers+1.502
Neg logit contributions
theless-2.043
-1.763
 DAC-1.632
 DOI-1.611
-1.554
Token hockey
Token ID12217
Feature activation-1.54e-01

Pos logit contributions
 tradem+3.402
 undermin+2.581
cler+2.524
urers+2.503
construct+2.498
Neg logit contributions
theless-3.294
-2.860
 DAC-2.630
 DOI-2.598
AI-2.545
Token dad
Token ID9955
Feature activation+7.37e+00

Pos logit contributions
theless+0.141
+0.123
 DAC+0.115
 DOI+0.113
+0.109
Neg logit contributions
 tradem-0.132
 undermin-0.100
construct-0.096
urers-0.095
cler-0.095
Token when
Token ID618
Feature activation+1.71e+00

Pos logit contributions
 tradem+6.510
cler+5.052
 undermin+5.024
 spoilers+4.935
urers+4.933
Neg logit contributions
theless-6.127
-5.084
 DAC-4.887
 DOI-4.707
AI-4.576
Token his
Token ID465
Feature activation+9.66e-01

Pos logit contributions
 tradem+1.613
 undermin+1.260
construct+1.205
urers+1.204
cler+1.203
Neg logit contributions
theless-1.407
-1.215
 DAC-1.117
 DOI-1.100
-1.058
Token kids
Token ID3988
Feature activation+5.22e-01

Pos logit contributions
 tradem+0.824
 undermin+0.626
cler+0.599
urers+0.595
construct+0.594
Neg logit contributions
theless-0.882
-0.770
 DAC-0.711
 DOI-0.705
-0.686
Token were
Token ID547
Feature activation+3.38e-01

Pos logit contributions
 tradem+0.496
 undermin+0.389
cler+0.374
construct+0.373
urers+0.373
Neg logit contributions
theless-0.427
-0.367
 DAC-0.338
 DOI-0.332
-0.320
Token toddlers
Token ID50142
Feature activation+2.14e+00

Pos logit contributions
 tradem+0.291
 undermin+0.221
urers+0.211
construct+0.211
cler+0.210
Neg logit contributions
theless-0.307
-0.268
 DAC-0.249
 DOI-0.246
-0.238
Token
Token ID198
Feature activation-1.82e+00

Pos logit contributions
theless+1.450
+1.248
 DAC+1.128
 DOI+1.124
 µ+1.053
Neg logit contributions
 tradem-2.248
 undermin-1.846
 newcom-1.740
urers-1.735
construct-1.734
Token
Token ID198
Feature activation-1.33e+00

Pos logit contributions
theless+1.436
+1.214
 DAC+1.121
 DOI+1.097
+1.050
Neg logit contributions
 tradem-1.779
 undermin-1.397
construct-1.376
cler-1.369
urers-1.366
TokenSorry
Token ID14385
Feature activation-2.70e+00

Pos logit contributions
theless+1.184
+1.044
 DAC+0.965
 DOI+0.950
+0.917
Neg logit contributions
 tradem-1.171
 undermin-0.896
urers-0.845
construct-0.844
cler-0.839
Token,
Token ID11
Feature activation-5.55e-01

Pos logit contributions
theless+1.870
+1.577
 DAC+1.431
 DOI+1.401
+1.326
Neg logit contributions
 tradem-2.909
 undermin-2.352
construct-2.263
urers-2.257
cler-2.252
Token this
Token ID428
Feature activation+1.42e+00

Pos logit contributions
theless+0.409
+0.346
 DAC+0.315
 DOI+0.309
+0.296
Neg logit contributions
 tradem-0.575
 undermin-0.459
construct-0.443
cler-0.442
urers-0.442
Token video
Token ID2008
Feature activation+7.33e+00

Pos logit contributions
 tradem+1.245
 undermin+0.946
construct+0.912
cler+0.908
urers+0.903
Neg logit contributions
theless-1.271
-1.105
 DAC-1.027
 DOI-1.008
-0.979
Token has
Token ID468
Feature activation+3.01e+00

Pos logit contributions
 tradem+5.721
 overs+4.414
 spoilers+4.353
construct+4.125
cler+4.114
Neg logit contributions
theless-6.797
-5.766
 DAC-5.420
 DOI-5.374
-5.281
Token expired
Token ID21350
Feature activation+3.63e+00

Pos logit contributions
 tradem+3.315
 undermin+2.701
 spoilers+2.621
construct+2.608
cler+2.605
Neg logit contributions
theless-2.003
-1.637
 DAC-1.477
 DOI-1.457
-1.386
Token Video
Token ID7623
Feature activation+4.47e+00

Pos logit contributions
 tradem+3.182
 undermin+2.426
construct+2.393
cler+2.379
 spoilers+2.372
Neg logit contributions
theless-3.196
-2.749
 DAC-2.548
 DOI-2.498
-2.430
Token:
Token ID25
Feature activation-1.47e+00

Pos logit contributions
 tradem+3.852
cler+2.993
 spoilers+2.983
construct+2.955
urers+2.934
Neg logit contributions
theless-3.945
-3.262
 DAC-3.073
 DOI-3.016
-2.890
Token Mary
Token ID5335
Feature activation-3.98e-01

Pos logit contributions
theless+1.240
+1.082
 DAC+1.012
 DOI+0.994
AI+0.944
Neg logit contributions
 tradem-1.378
 undermin-1.069
construct-1.011
urers-1.010
 newcom-1.009
TokenA
Token ID32
Feature activation+8.32e-01

Pos logit contributions
 tradem+1.177
 undermin+0.916
construct+0.906
cler+0.897
urers+0.886
Neg logit contributions
theless-1.089
-0.923
 DAC-0.866
 DOI-0.857
-0.812
Tokenerial
Token ID48499
Feature activation+4.29e+00

Pos logit contributions
 tradem+0.758
 undermin+0.585
construct+0.561
urers+0.558
cler+0.558
Neg logit contributions
theless-0.714
-0.622
 DAC-0.575
 DOI-0.566
-0.545
Token view
Token ID1570
Feature activation+4.49e+00

Pos logit contributions
 tradem+3.343
 undermin+2.475
construct+2.362
 spoilers+2.343
cler+2.340
Neg logit contributions
theless-4.203
-3.660
 DAC-3.439
 DOI-3.397
-3.310
Token of
Token ID286
Feature activation+2.88e+00

Pos logit contributions
 tradem+3.961
 undermin+3.085
construct+2.969
cler+2.955
urers+2.926
Neg logit contributions
theless-3.876
-3.315
 DAC-3.095
 DOI-3.066
-2.953
Token Y
Token ID575
Feature activation+1.68e+00

Pos logit contributions
 tradem+2.556
 undermin+1.958
construct+1.880
cler+1.857
urers+1.851
Neg logit contributions
theless-2.530
-2.209
 DAC-2.036
 DOI-2.019
-1.951
Tokenonge
Token ID14220
Feature activation+7.33e+00

Pos logit contributions
 tradem+0.938
 undermin+0.580
construct+0.554
cler+0.549
urers+0.549
Neg logit contributions
theless-2.030
-1.831
 DAC-1.744
 DOI-1.724
-1.679
Token and
Token ID290
Feature activation+2.19e+00

Pos logit contributions
 tradem+6.714
 undermin+5.172
construct+5.048
 spoilers+5.037
cler+4.988
Neg logit contributions
theless-5.968
-4.994
 DAC-4.664
 DOI-4.593
-4.525
Token E
Token ID412
Feature activation+2.00e+00

Pos logit contributions
 tradem+1.897
 undermin+1.440
construct+1.373
urers+1.366
cler+1.366
Neg logit contributions
theless-1.987
-1.739
 DAC-1.615
 DOI-1.590
-1.539
Tokengl
Token ID4743
Feature activation+5.98e-01

Pos logit contributions
 tradem+2.073
 undermin+1.656
construct+1.605
cler+1.596
urers+1.595
Neg logit contributions
theless-1.468
-1.241
 DAC-1.135
 DOI-1.111
-1.061
Tokeninton
Token ID2371
Feature activation+1.39e+00

Pos logit contributions
 tradem+0.470
construct+0.347
 undermin+0.345
urers+0.343
cler+0.343
Neg logit contributions
theless-0.581
-0.504
 DAC-0.482
 DOI-0.467
-0.454
Token,
Token ID11
Feature activation+8.44e-01

Pos logit contributions
 tradem+1.338
 undermin+1.053
construct+1.009
cler+1.002
urers+1.002
Neg logit contributions
theless-1.117
-0.953
 DAC-0.879
 DOI-0.863
-0.833
Token whatever
Token ID4232
Feature activation-1.34e+00

Pos logit contributions
 tradem+2.100
 undermin+1.650
construct+1.593
urers+1.591
cler+1.580
Neg logit contributions
theless-1.799
-1.542
 DAC-1.433
 DOI-1.404
-1.344
Token he
Token ID339
Feature activation+2.16e+00

Pos logit contributions
theless+1.039
+0.893
 DAC+0.816
 DOI+0.801
+0.762
Neg logit contributions
 tradem-1.332
 undermin-1.050
urers-1.009
construct-1.008
cler-1.004
Token has
Token ID468
Feature activation+5.27e-01

Pos logit contributions
 tradem+1.776
 undermin+1.346
construct+1.300
urers+1.290
cler+1.288
Neg logit contributions
theless-2.001
-1.762
 DAC-1.659
 DOI-1.624
-1.569
Token to
Token ID284
Feature activation+2.25e+00

Pos logit contributions
 tradem+0.532
 undermin+0.424
construct+0.408
urers+0.407
cler+0.406
Neg logit contributions
theless-0.400
-0.340
 DAC-0.313
 DOI-0.307
-0.293
Token in
Token ID287
Feature activation+1.42e+00

Pos logit contributions
 tradem+2.173
 undermin+1.716
construct+1.654
urers+1.645
cler+1.644
Neg logit contributions
theless-1.781
-1.514
 DAC-1.419
 DOI-1.374
-1.322
Token order
Token ID1502
Feature activation+7.07e+00

Pos logit contributions
 tradem+1.204
 undermin+0.911
construct+0.871
urers+0.866
cler+0.863
Neg logit contributions
theless-1.317
-1.153
 DAC-1.078
 DOI-1.064
-1.025
Token to
Token ID284
Feature activation+3.08e+00

Pos logit contributions
 tradem+6.535
 undermin+5.198
cler+4.898
urers+4.887
construct+4.880
Neg logit contributions
theless-5.737
-4.829
 DAC-4.611
 DOI-4.453
-4.237
Token achieve
Token ID4620
Feature activation+2.29e+00

Pos logit contributions
 tradem+2.487
 undermin+1.917
construct+1.837
cler+1.810
 spoilers+1.808
Neg logit contributions
theless-2.867
-2.503
 DAC-2.371
 DOI-2.310
-2.266
Token his
Token ID465
Feature activation+2.85e+00

Pos logit contributions
 tradem+1.970
 undermin+1.501
construct+1.431
urers+1.427
cler+1.426
Neg logit contributions
theless-2.074
-1.805
 DAC-1.690
 DOI-1.659
-1.616
Token aims
Token ID12031
Feature activation-3.21e-01

Pos logit contributions
 tradem+2.458
 undermin+1.894
cler+1.803
construct+1.800
 spoilers+1.798
Neg logit contributions
theless-2.541
-2.203
 DAC-2.049
 DOI-2.023
-1.966
Token,
Token ID11
Feature activation+1.36e+00

Pos logit contributions
theless+0.253
+0.218
 DAC+0.199
 DOI+0.196
+0.187
Neg logit contributions
 tradem-0.316
 undermin-0.250
urers-0.239
construct-0.239
cler-0.238
Token distance
Token ID5253
Feature activation-1.63e+00

Pos logit contributions
 tradem+2.165
 undermin+1.615
construct+1.536
cler+1.520
urers+1.511
Neg logit contributions
theless-2.505
-2.186
 DAC-2.056
 DOI-2.026
-1.961
Token from
Token ID422
Feature activation-3.52e-01

Pos logit contributions
theless+1.328
+1.167
 DAC+1.071
 DOI+1.049
+1.005
Neg logit contributions
 tradem-1.548
 undermin-1.211
urers-1.170
construct-1.161
cler-1.158
Token 2
Token ID362
Feature activation-1.61e+00

Pos logit contributions
theless+0.308
+0.269
 DAC+0.251
 DOI+0.246
+0.235
Neg logit contributions
 tradem-0.316
 undermin-0.243
urers-0.232
construct-0.232
cler-0.231
Token221
Token ID26115
Feature activation+1.29e+00

Pos logit contributions
theless+1.354
+1.181
 DAC+1.093
 DOI+1.073
AI+1.025
Neg logit contributions
 tradem-1.498
 undermin-1.168
urers-1.115
construct-1.111
cler-1.107
Token Y
Token ID575
Feature activation+2.91e+00

Pos logit contributions
 tradem+1.146
 undermin+0.877
construct+0.845
cler+0.837
urers+0.835
Neg logit contributions
theless-1.128
-0.979
 DAC-0.909
 DOI-0.895
-0.866
Tokenonge
Token ID14220
Feature activation+7.04e+00

Pos logit contributions
 tradem+1.650
 undermin+1.030
construct+0.977
cler+0.968
urers+0.961
Neg logit contributions
theless-3.497
-3.151
 DAC-2.998
 DOI-2.963
-2.896
Token Cond
Token ID9724
Feature activation+9.70e-01

Pos logit contributions
 tradem+5.900
 undermin+4.430
construct+4.307
 spoilers+4.293
cler+4.251
Neg logit contributions
theless-6.252
-5.278
 DAC-5.005
 DOI-4.910
-4.867
Tokenos
Token ID418
Feature activation+1.92e+00

Pos logit contributions
 tradem+1.060
 undermin+0.853
construct+0.840
urers+0.837
cler+0.836
Neg logit contributions
theless-0.656
-0.538
 DAC-0.491
 DOI-0.475
-0.454
Token to
Token ID284
Feature activation+4.61e-01

Pos logit contributions
 tradem+1.925
 undermin+1.526
construct+1.466
cler+1.459
urers+1.457
Neg logit contributions
theless-1.481
-1.254
 DAC-1.153
 DOI-1.129
-1.083
Token the
Token ID262
Feature activation+1.63e+00

Pos logit contributions
 tradem+0.420
 undermin+0.327
urers+0.314
construct+0.314
cler+0.312
Neg logit contributions
theless-0.393
-0.343
 DAC-0.319
 DOI-0.314
AI-0.298
Token south
Token ID5366
Feature activation-2.82e+00

Pos logit contributions
 tradem+1.347
 undermin+1.008
construct+0.956
urers+0.953
cler+0.952
Neg logit contributions
theless-1.546
-1.361
 DAC-1.271
 DOI-1.253
-1.211
Token than
Token ID621
Feature activation+8.14e-01

Pos logit contributions
 tradem+1.191
 undermin+0.929
construct+0.891
cler+0.891
urers+0.887
Neg logit contributions
theless-1.067
-0.919
 DAC-0.857
 DOI-0.838
-0.806
Token ever
Token ID1683
Feature activation-1.05e+00

Pos logit contributions
 tradem+0.757
 undermin+0.591
construct+0.566
cler+0.565
urers+0.564
Neg logit contributions
theless-0.682
-0.587
 DAC-0.545
 DOI-0.537
-0.517
Token.
Token ID13
Feature activation+6.17e-01

Pos logit contributions
theless+0.907
+0.793
 DAC+0.728
 DOI+0.719
+0.690
Neg logit contributions
 tradem-0.955
 undermin-0.737
construct-0.703
urers-0.703
cler-0.696
Token The
Token ID383
Feature activation+1.86e-01

Pos logit contributions
 tradem+0.593
 undermin+0.468
construct+0.459
cler+0.457
urers+0.454
Neg logit contributions
theless-0.488
-0.412
 DAC-0.381
 DOI-0.372
-0.357
Token below
Token ID2174
Feature activation+1.49e+00

Pos logit contributions
 tradem+0.174
 undermin+0.136
construct+0.130
urers+0.130
cler+0.130
Neg logit contributions
theless-0.156
-0.135
 DAC-0.124
 DOI-0.122
-0.118
Token chart
Token ID8262
Feature activation+7.02e+00

Pos logit contributions
 tradem+1.255
 undermin+0.950
cler+0.908
construct+0.907
urers+0.906
Neg logit contributions
theless-1.368
-1.187
 DAC-1.109
 DOI-1.095
-1.061
Token shows
Token ID2523
Feature activation+4.10e+00

Pos logit contributions
 tradem+5.282
 spoilers+4.072
construct+4.043
cler+4.023
 undermin+3.992
Neg logit contributions
theless-6.367
-5.461
 DAC-5.312
 DOI-5.157
-5.092
Token the
Token ID262
Feature activation-1.48e-01

Pos logit contributions
 tradem+3.543
 undermin+2.722
 spoilers+2.627
cler+2.617
urers+2.593
Neg logit contributions
theless-3.601
-3.093
 DAC-2.898
 DOI-2.875
AI-2.782
Token distribution
Token ID6082
Feature activation+3.06e+00

Pos logit contributions
theless+0.124
+0.108
 DAC+0.100
 DOI+0.098
+0.094
Neg logit contributions
 tradem-0.138
 undermin-0.107
construct-0.102
urers-0.102
cler-0.102
Token in
Token ID287
Feature activation-7.59e-01

Pos logit contributions
 tradem+2.867
 undermin+2.236
urers+2.137
construct+2.135
cler+2.131
Neg logit contributions
theless-2.553
-2.202
 DAC-2.041
 DOI-2.004
-1.925
Token the
Token ID262
Feature activation-4.27e-01

Pos logit contributions
theless+0.683
+0.599
 DAC+0.557
 DOI+0.548
+0.527
Neg logit contributions
 tradem-0.663
 undermin-0.506
construct-0.482
urers-0.481
cler-0.479
Token teens
Token ID15508
Feature activation+4.21e+00

Pos logit contributions
 tradem+1.237
 undermin+0.935
construct+0.891
urers+0.891
cler+0.889
Neg logit contributions
theless-1.358
-1.184
 DAC-1.108
 DOI-1.091
-1.053
Token,
Token ID11
Feature activation+1.56e+00

Pos logit contributions
 tradem+3.940
 undermin+3.089
cler+3.015
urers+2.994
construct+2.976
Neg logit contributions
theless-3.399
-2.887
 DAC-2.688
 DOI-2.619
-2.530
Token is
Token ID318
Feature activation+3.15e+00

Pos logit contributions
 tradem+1.545
 undermin+1.230
cler+1.192
urers+1.191
construct+1.181
Neg logit contributions
theless-1.205
-1.020
 DAC-0.941
 DOI-0.918
-0.883
Token charged
Token ID5047
Feature activation+2.70e+00

Pos logit contributions
 tradem+3.073
 undermin+2.446
urers+2.358
cler+2.354
 spoilers+2.345
Neg logit contributions
theless-2.472
-2.048
 DAC-1.895
 DOI-1.860
-1.790
Token for
Token ID329
Feature activation+8.09e-01

Pos logit contributions
 tradem+2.707
 undermin+2.149
construct+2.063
urers+2.060
cler+2.057
Neg logit contributions
theless-2.072
-1.764
 DAC-1.618
 DOI-1.587
-1.514
Token driving
Token ID5059
Feature activation+7.01e+00

Pos logit contributions
 tradem+0.773
 undermin+0.605
construct+0.580
urers+0.579
cler+0.577
Neg logit contributions
theless-0.661
-0.570
 DAC-0.525
 DOI-0.517
-0.495
Token a
Token ID257
Feature activation+5.07e+00

Pos logit contributions
 tradem+5.814
 undermin+4.526
urers+4.358
 spoilers+4.343
cler+4.337
Neg logit contributions
theless-6.218
-5.209
 DAC-4.941
 DOI-4.859
AI-4.672
Token vehicle
Token ID4038
Feature activation+2.34e+00

Pos logit contributions
 tradem+4.317
 undermin+3.264
cler+3.178
urers+3.150
 spoilers+3.118
Neg logit contributions
theless-4.558
-3.921
 DAC-3.597
 DOI-3.548
AI-3.464
Token without
Token ID1231
Feature activation+1.85e-02

Pos logit contributions
 tradem+2.147
 undermin+1.689
urers+1.625
construct+1.624
cler+1.624
Neg logit contributions
theless-1.943
-1.660
 DAC-1.540
 DOI-1.522
-1.459
Token an
Token ID281
Feature activation-1.45e-01

Pos logit contributions
 tradem+0.017
 undermin+0.013
construct+0.013
urers+0.013
cler+0.012
Neg logit contributions
theless-0.016
-0.014
 DAC-0.013
 DOI-0.013
-0.012
Token inter
Token ID987
Feature activation+1.56e+00

Pos logit contributions
theless+0.122
+0.106
 DAC+0.098
 DOI+0.096
+0.092
Neg logit contributions
 tradem-0.134
 undermin-0.104
construct-0.100
urers-0.100
cler-0.099
Token.
Token ID13
Feature activation+2.49e+00

Pos logit contributions
theless+0.252
+0.216
 DAC+0.200
 DOI+0.196
+0.188
Neg logit contributions
 tradem-0.302
 undermin-0.237
construct-0.229
cler-0.229
urers-0.228
Tokenchest
Token ID46713
Feature activation-8.91e-01

Pos logit contributions
 tradem+2.571
cler+2.348
construct+2.315
urers+2.192
ettel+2.164
Neg logit contributions
theless-1.561
-1.266
 DAC-1.260
 DOI-1.217
 DDR-1.162
Tokeno
Token ID78
Feature activation+5.81e-01

Pos logit contributions
theless+0.565
+0.456
 DAC+0.414
 DOI+0.399
+0.378
Neg logit contributions
 tradem-1.012
 undermin-0.823
construct-0.815
cler-0.809
urers-0.808
Token@
Token ID31
Feature activation+1.44e-01

Pos logit contributions
 tradem+0.643
 undermin+0.523
construct+0.504
urers+0.503
cler+0.502
Neg logit contributions
theless-0.387
-0.322
 DAC-0.290
 DOI-0.284
-0.268
Tokengl
Token ID4743
Feature activation+2.24e+00

Pos logit contributions
 tradem+0.153
 undermin+0.123
construct+0.119
urers+0.119
cler+0.118
Neg logit contributions
theless-0.102
-0.086
 DAC-0.078
 DOI-0.077
-0.073
Tokenobe
Token ID5910
Feature activation+6.98e+00

Pos logit contributions
 tradem+1.107
construct+0.793
cler+0.770
urers+0.743
ettel+0.659
Neg logit contributions
theless-2.750
-2.363
 DAC-2.323
 DOI-2.279
-2.272
Token.
Token ID13
Feature activation+1.81e+00

Pos logit contributions
 tradem+5.103
construct+4.273
cler+4.220
urers+3.927
 spoilers+3.915
Neg logit contributions
theless-6.746
 DAC-5.427
-5.315
-5.125
 DOI-5.109
Tokencom
Token ID785
Feature activation+2.45e+00

Pos logit contributions
 tradem+1.324
construct+1.007
cler+0.981
urers+0.959
 spoilers+0.943
Neg logit contributions
theless-1.810
-1.549
 DAC-1.485
 DOI-1.433
-1.410
Token.
Token ID764
Feature activation-4.84e-01

Pos logit contributions
 tradem+2.552
 undermin+2.031
construct+1.986
cler+1.980
urers+1.940
Neg logit contributions
theless-1.757
-1.461
 DAC-1.344
 DOI-1.306
-1.261
Token Follow
Token ID7281
Feature activation+6.46e-01

Pos logit contributions
theless+0.257
+0.203
 DAC+0.177
 DOI+0.171
+0.158
Neg logit contributions
 tradem-0.601
 undermin-0.501
construct-0.485
urers-0.485
cler-0.484
Token him
Token ID683
Feature activation+2.05e+00

Pos logit contributions
 tradem+0.614
 undermin+0.480
construct+0.461
cler+0.459
urers+0.457
Neg logit contributions
theless-0.531
-0.455
 DAC-0.422
 DOI-0.412
-0.398
Token the
Token ID262
Feature activation+7.43e-01

Pos logit contributions
theless+0.601
+0.524
 DAC+0.488
 DOI+0.478
+0.459
Neg logit contributions
 tradem-0.636
 undermin-0.491
construct-0.469
urers-0.469
cler-0.467
Token 2011
Token ID2813
Feature activation-7.11e-01

Pos logit contributions
 tradem+0.686
 undermin+0.531
construct+0.509
urers+0.507
cler+0.507
Neg logit contributions
theless-0.630
-0.545
 DAC-0.502
 DOI-0.496
-0.477
Token Disney
Token ID8519
Feature activation+9.95e-01

Pos logit contributions
theless+0.617
+0.539
 DAC+0.501
 DOI+0.492
+0.472
Neg logit contributions
 tradem-0.644
 undermin-0.497
construct-0.473
urers-0.473
cler-0.471
Token D
Token ID360
Feature activation+1.44e+00

Pos logit contributions
 tradem+0.923
 undermin+0.716
construct+0.688
cler+0.686
urers+0.682
Neg logit contributions
theless-0.836
-0.719
 DAC-0.666
 DOI-0.656
-0.632
Token23
Token ID1954
Feature activation+8.98e-01

Pos logit contributions
 tradem+1.357
 undermin+1.055
construct+1.020
urers+1.019
cler+1.017
Neg logit contributions
theless-1.190
-1.020
 DAC-0.946
 DOI-0.929
-0.896
Token Expo
Token ID32330
Feature activation+6.95e+00

Pos logit contributions
 tradem+0.816
 undermin+0.630
construct+0.602
urers+0.601
cler+0.599
Neg logit contributions
theless-0.776
-0.673
 DAC-0.624
 DOI-0.615
-0.591
Token to
Token ID284
Feature activation+1.37e+00

Pos logit contributions
 tradem+6.284
 undermin+4.812
 spoilers+4.704
cler+4.668
urers+4.638
Neg logit contributions
theless-5.898
-4.792
 DAC-4.456
 DOI-4.431
-4.277
Token honor
Token ID7522
Feature activation+1.99e+00

Pos logit contributions
 tradem+1.197
 undermin+0.921
construct+0.883
cler+0.878
urers+0.875
Neg logit contributions
theless-1.223
-1.055
 DAC-0.984
 DOI-0.965
-0.928
Token Jim
Token ID5395
Feature activation+1.34e+00

Pos logit contributions
 tradem+1.784
 undermin+1.370
construct+1.308
urers+1.302
cler+1.302
Neg logit contributions
theless-1.749
-1.517
 DAC-1.406
 DOI-1.387
-1.333
Token H
Token ID367
Feature activation-5.57e-01

Pos logit contributions
 tradem+1.506
 undermin+1.230
construct+1.189
urers+1.187
cler+1.187
Neg logit contributions
theless-0.866
-0.713
 DAC-0.638
 DOI-0.625
-0.591
Tokenenson
Token ID19069
Feature activation+2.69e+00

Pos logit contributions
theless+0.491
+0.420
 DAC+0.395
 DOI+0.387
+0.373
Neg logit contributions
 tradem-0.496
 undermin-0.380
urers-0.372
construct-0.372
cler-0.371
Token argument
Token ID4578
Feature activation-3.76e-01

Pos logit contributions
 tradem+0.562
 undermin+0.436
construct+0.418
urers+0.417
cler+0.417
Neg logit contributions
theless-0.529
-0.459
 DAC-0.425
 DOI-0.419
-0.402
Token offer
Token ID2897
Feature activation-1.33e+00

Pos logit contributions
theless+0.296
+0.254
 DAC+0.233
 DOI+0.229
+0.219
Neg logit contributions
 tradem-0.371
 undermin-0.293
construct-0.280
urers-0.279
cler-0.279
Token,
Token ID11
Feature activation+7.88e-01

Pos logit contributions
theless+1.031
+0.883
 DAC+0.803
 DOI+0.791
+0.755
Neg logit contributions
 tradem-1.341
 undermin-1.062
construct-1.017
cler-1.014
 newcom-1.009
Token partly
Token ID11476
Feature activation+1.47e+00

Pos logit contributions
 tradem+0.736
 undermin+0.573
construct+0.548
urers+0.547
cler+0.546
Neg logit contributions
theless-0.659
-0.571
 DAC-0.528
 DOI-0.519
-0.498
Token,
Token ID11
Feature activation+3.14e-01

Pos logit contributions
 tradem+1.400
 undermin+1.101
construct+1.054
urers+1.054
cler+1.050
Neg logit contributions
theless-1.192
-1.028
 DAC-0.950
 DOI-0.934
-0.894
Token too
Token ID1165
Feature activation+6.83e+00

Pos logit contributions
 tradem+0.304
 undermin+0.239
construct+0.230
urers+0.229
cler+0.228
Neg logit contributions
theless-0.252
-0.216
 DAC-0.199
 DOI-0.196
-0.187
Token,
Token ID11
Feature activation+4.88e-02

Pos logit contributions
 tradem+6.789
 undermin+5.401
construct+5.196
cler+5.189
urers+5.136
Neg logit contributions
theless-5.125
-4.277
 DAC-4.053
 DOI-3.954
-3.746
Token because
Token ID780
Feature activation+7.16e-01

Pos logit contributions
 tradem+0.047
 undermin+0.037
construct+0.036
urers+0.035
cler+0.035
Neg logit contributions
theless-0.039
-0.034
 DAC-0.031
 DOI-0.031
-0.029
Token frequent
Token ID10792
Feature activation+8.25e-01

Pos logit contributions
 tradem+0.686
 undermin+0.538
urers+0.517
construct+0.517
cler+0.515
Neg logit contributions
theless-0.580
-0.500
 DAC-0.461
 DOI-0.453
-0.435
Token repetition
Token ID29693
Feature activation+5.70e-01

Pos logit contributions
 tradem+0.732
 undermin+0.563
construct+0.537
urers+0.537
cler+0.535
Neg logit contributions
theless-0.726
-0.635
 DAC-0.590
 DOI-0.580
-0.558
Token brought
Token ID3181
Feature activation-7.73e-01

Pos logit contributions
 tradem+0.547
 undermin+0.429
construct+0.411
urers+0.411
cler+0.410
Neg logit contributions
theless-0.462
-0.399
 DAC-0.368
 DOI-0.361
-0.346
Token pharmac
Token ID25702
Feature activation-1.78e+00

Pos logit contributions
theless+3.017
+2.627
 DAC+2.412
 DOI+2.373
AI+2.256
Neg logit contributions
 tradem-3.642
 undermin-2.858
construct-2.734
urers-2.733
cler-2.719
Tokenological
Token ID2770
Feature activation-3.17e+00

Pos logit contributions
theless+1.127
+0.932
 DAC+0.831
 DOI+0.814
+0.764
Neg logit contributions
 tradem-2.023
 undermin-1.654
construct-1.595
urers-1.592
cler-1.587
Token profile
Token ID7034
Feature activation-2.14e+00

Pos logit contributions
theless+2.684
+2.340
 DAC+2.162
 DOI+2.129
+2.035
Neg logit contributions
 tradem-2.939
 undermin-2.277
construct-2.178
urers-2.174
cler-2.164
Token of
Token ID286
Feature activation-1.98e+00

Pos logit contributions
theless+1.629
+1.453
 DAC+1.307
 DOI+1.295
AI+1.213
Neg logit contributions
 tradem-2.123
 undermin-1.684
construct-1.619
urers-1.615
cler-1.604
Token N
Token ID399
Feature activation-5.56e-01

Pos logit contributions
theless+1.573
+1.354
 DAC+1.245
 DOI+1.226
+1.169
Neg logit contributions
 tradem-1.931
 undermin-1.521
construct-1.459
urers-1.457
cler-1.451
Token-
Token ID12
Feature activation-7.55e+00

Pos logit contributions
theless+0.385
+0.316
 DAC+0.291
 DOI+0.283
+0.269
Neg logit contributions
 tradem-0.600
 undermin-0.485
construct-0.472
cler-0.471
urers-0.470
Tokencyclop
Token ID22873
Feature activation-2.13e+00

Pos logit contributions
theless+5.587
+4.800
 DOI+4.269
 DAC+4.191
AI+4.066
Neg logit contributions
 tradem-7.712
 undermin-6.202
urers-5.815
 newcom-5.748
 spoilers-5.746
Tokenrop
Token ID1773
Feature activation-2.84e+00

Pos logit contributions
theless+2.223
+1.992
 DAC+1.866
 DOI+1.848
AI+1.786
Neg logit contributions
 tradem-1.544
 undermin-1.109
urers-1.032
construct-1.030
cler-1.019
Tokenyl
Token ID2645
Feature activation-3.67e+00

Pos logit contributions
theless+1.956
+1.639
 DAC+1.482
 DOI+1.451
+1.374
Neg logit contributions
 tradem-3.083
 undermin-2.493
construct-2.403
urers-2.398
cler-2.393
Tokenmethyl
Token ID43654
Feature activation-6.45e+00

Pos logit contributions
theless+2.841
+2.461
 DAC+2.235
 DOI+2.208
AI+2.081
Neg logit contributions
 tradem-3.647
 undermin-2.905
urers-2.777
construct-2.764
 newcom-2.742
Token-
Token ID12
Feature activation-3.75e+00

Pos logit contributions
theless+4.868
+4.247
 DAC+3.834
 DOI+3.834
 DDR+3.550
Neg logit contributions
 tradem-6.440
 undermin-5.220
urers-4.933
construct-4.825
cler-4.819
Token week
Token ID1285
Feature activation-4.38e-01

Pos logit contributions
theless+0.563
+0.490
 DAC+0.453
 DOI+0.446
+0.429
Neg logit contributions
 tradem-0.601
 undermin-0.464
construct-0.443
urers-0.443
cler-0.441
Token participated
Token ID14888
Feature activation-5.19e-01

Pos logit contributions
theless+0.345
+0.295
 DAC+0.274
 DOI+0.269
+0.256
Neg logit contributions
 tradem-0.428
 undermin-0.338
construct-0.326
cler-0.325
urers-0.324
Token in
Token ID287
Feature activation-1.44e+00

Pos logit contributions
theless+0.424
+0.367
 DAC+0.338
 DOI+0.332
+0.319
Neg logit contributions
 tradem-0.496
 undermin-0.388
construct-0.372
urers-0.371
cler-0.370
Token a
Token ID257
Feature activation-8.51e-01

Pos logit contributions
theless+1.211
+1.056
 DAC+0.978
 DOI+0.957
+0.922
Neg logit contributions
 tradem-1.347
 undermin-1.046
urers-0.998
construct-0.994
cler-0.991
Token Q
Token ID1195
Feature activation+8.92e-02

Pos logit contributions
theless+0.735
+0.639
 DAC+0.592
 DOI+0.583
+0.560
Neg logit contributions
 tradem-0.773
 undermin-0.596
construct-0.571
urers-0.569
cler-0.568
Token&
Token ID5
Feature activation-7.48e+00

Pos logit contributions
 tradem+0.087
 undermin+0.068
construct+0.066
urers+0.065
cler+0.065
Neg logit contributions
theless-0.071
-0.061
 DAC-0.056
 DOI-0.055
-0.053
TokenA
Token ID32
Feature activation+5.88e-01

Pos logit contributions
theless+4.485
+4.088
 DAC+3.563
AI+3.528
 DOI+3.389
Neg logit contributions
 tradem-8.654
 undermin-7.025
 newcom-6.791
urers-6.598
senal-6.564
Token session
Token ID6246
Feature activation+5.05e-01

Pos logit contributions
 tradem+0.576
 undermin+0.455
construct+0.440
urers+0.437
cler+0.437
Neg logit contributions
theless-0.463
-0.392
 DAC-0.362
 DOI-0.357
-0.340
Token
Token ID21096
Feature activation-8.13e-01

Pos logit contributions
 tradem+0.503
 undermin+0.399
construct+0.386
cler+0.384
urers+0.384
Neg logit contributions
theless-0.389
-0.327
 DAC-0.304
 DOI-0.298
-0.283
Token refer
Token ID3522
Feature activation-1.39e+00

Pos logit contributions
theless+0.694
+0.603
 DAC+0.559
 DOI+0.550
+0.528
Neg logit contributions
 tradem-0.745
 undermin-0.577
construct-0.552
urers-0.551
cler-0.549
Token to
Token ID284
Feature activation-1.05e+00

Pos logit contributions
theless+0.988
+0.852
 DAC+0.767
 DOI+0.758
+0.725
Neg logit contributions
 tradem-1.463
 undermin-1.173
construct-1.136
urers-1.133
cler-1.126
Token.
Token ID13
Feature activation-2.41e-01

Pos logit contributions
 tradem+0.264
 undermin+0.209
construct+0.201
urers+0.200
cler+0.200
Neg logit contributions
theless-0.204
-0.175
 DAC-0.160
 DOI-0.157
-0.150
Token<|endoftext|>
Token ID50256
Feature activation+8.37e-01

Pos logit contributions
theless+0.188
+0.159
 DAC+0.146
 DOI+0.143
+0.137
Neg logit contributions
 tradem-0.238
 undermin-0.188
construct-0.183
urers-0.182
cler-0.182
TokenAbstract
Token ID23839
Feature activation-6.64e+00

Pos logit contributions
 tradem+0.777
 undermin+0.605
construct+0.587
cler+0.584
urers+0.582
Neg logit contributions
theless-0.702
-0.601
 DAC-0.560
 DOI-0.551
-0.525
Token
Token ID198
Feature activation-7.20e+00

Pos logit contributions
theless+4.791
+4.280
 DAC+3.905
 DOI+3.858
 µ+3.599
Neg logit contributions
 tradem-6.915
 undermin-5.473
 newcom-5.198
urers-5.094
construct-5.066
Token
Token ID198
Feature activation-6.85e+00

Pos logit contributions
theless+4.620
+4.254
 DAC+3.680
 DOI+3.679
AI+3.494
Neg logit contributions
 tradem-8.308
 undermin-6.904
 newcom-6.608
senal-6.488
 srf-6.177
TokenBackground
Token ID21756
Feature activation-7.28e+00

Pos logit contributions
theless+5.233
+4.895
 DAC+4.304
AI+4.279
 DOI+4.232
Neg logit contributions
 tradem-6.811
 undermin-5.293
 newcom-5.086
urers-4.887
senal-4.824
Token Depression
Token ID22483
Feature activation-2.38e+00

Pos logit contributions
theless+5.582
+5.091
 DAC+4.641
 DOI+4.584
 µ+4.289
Neg logit contributions
 tradem-7.289
 undermin-5.682
 newcom-5.402
urers-5.271
construct-5.214
Token presents
Token ID10969
Feature activation-2.55e+00

Pos logit contributions
theless+1.789
+1.561
 DAC+1.412
 DOI+1.391
+1.318
Neg logit contributions
 tradem-2.412
 undermin-1.911
construct-1.835
urers-1.833
 newcom-1.823
Token a
Token ID257
Feature activation-2.75e+00

Pos logit contributions
theless+2.028
+1.757
 DAC+1.612
 DOI+1.593
 detention+1.504
Neg logit contributions
 tradem-2.489
 undermin-1.947
construct-1.876
urers-1.864
 newcom-1.855
Token significant
Token ID2383
Feature activation-3.32e+00

Pos logit contributions
theless+2.379
+2.081
 DAC+1.929
 DOI+1.906
+1.815
Neg logit contributions
 tradem-2.495
 undermin-1.923
urers-1.834
construct-1.833
cler-1.814
Token burden
Token ID10538
Feature activation-1.38e+00

Pos logit contributions
theless+2.911
+2.552
 DAC+2.366
 DOI+2.343
+2.224
Neg logit contributions
 tradem-2.967
 undermin-2.272
urers-2.172
construct-2.171
 newcom-2.146
Token a
Token ID257
Feature activation-8.77e-01

Pos logit contributions
 tradem+0.886
 undermin+0.696
construct+0.670
cler+0.665
urers+0.664
Neg logit contributions
theless-0.799
-0.686
 DAC-0.642
 DOI-0.629
AI-0.603
Token start
Token ID923
Feature activation+2.64e-01

Pos logit contributions
theless+0.767
+0.670
 DAC+0.621
 DOI+0.612
+0.588
Neg logit contributions
 tradem-0.788
 undermin-0.606
construct-0.578
urers-0.577
cler-0.575
Token.
Token ID13
Feature activation-2.41e-01

Pos logit contributions
 tradem+0.264
 undermin+0.209
construct+0.201
urers+0.200
cler+0.200
Neg logit contributions
theless-0.204
-0.175
 DAC-0.160
 DOI-0.157
-0.150
Token<|endoftext|>
Token ID50256
Feature activation+8.37e-01

Pos logit contributions
theless+0.188
+0.159
 DAC+0.146
 DOI+0.143
+0.137
Neg logit contributions
 tradem-0.238
 undermin-0.188
construct-0.183
urers-0.182
cler-0.182
TokenAbstract
Token ID23839
Feature activation-6.64e+00

Pos logit contributions
 tradem+0.777
 undermin+0.605
construct+0.587
cler+0.584
urers+0.582
Neg logit contributions
theless-0.702
-0.601
 DAC-0.560
 DOI-0.551
-0.525
Token
Token ID198
Feature activation-7.20e+00

Pos logit contributions
theless+4.791
+4.280
 DAC+3.905
 DOI+3.858
 µ+3.599
Neg logit contributions
 tradem-6.915
 undermin-5.473
 newcom-5.198
urers-5.094
construct-5.066
Token
Token ID198
Feature activation-6.85e+00

Pos logit contributions
theless+4.620
+4.254
 DAC+3.680
 DOI+3.679
AI+3.494
Neg logit contributions
 tradem-8.308
 undermin-6.904
 newcom-6.608
senal-6.488
 srf-6.177
TokenBackground
Token ID21756
Feature activation-7.28e+00

Pos logit contributions
theless+5.233
+4.895
 DAC+4.304
AI+4.279
 DOI+4.232
Neg logit contributions
 tradem-6.811
 undermin-5.293
 newcom-5.086
urers-4.887
senal-4.824
Token Depression
Token ID22483
Feature activation-2.38e+00

Pos logit contributions
theless+5.582
+5.091
 DAC+4.641
 DOI+4.584
 µ+4.289
Neg logit contributions
 tradem-7.289
 undermin-5.682
 newcom-5.402
urers-5.271
construct-5.214
Token presents
Token ID10969
Feature activation-2.55e+00

Pos logit contributions
theless+1.789
+1.561
 DAC+1.412
 DOI+1.391
+1.318
Neg logit contributions
 tradem-2.412
 undermin-1.911
construct-1.835
urers-1.833
 newcom-1.823
Token a
Token ID257
Feature activation-2.75e+00

Pos logit contributions
theless+2.028
+1.757
 DAC+1.612
 DOI+1.593
 detention+1.504
Neg logit contributions
 tradem-2.489
 undermin-1.947
construct-1.876
urers-1.864
 newcom-1.855
Token start
Token ID923
Feature activation+2.64e-01

Pos logit contributions
theless+0.767
+0.670
 DAC+0.621
 DOI+0.612
+0.588
Neg logit contributions
 tradem-0.788
 undermin-0.606
construct-0.578
urers-0.577
cler-0.575
Token.
Token ID13
Feature activation-2.41e-01

Pos logit contributions
 tradem+0.264
 undermin+0.209
construct+0.201
urers+0.200
cler+0.200
Neg logit contributions
theless-0.204
-0.175
 DAC-0.160
 DOI-0.157
-0.150
Token<|endoftext|>
Token ID50256
Feature activation+8.37e-01

Pos logit contributions
theless+0.188
+0.159
 DAC+0.146
 DOI+0.143
+0.137
Neg logit contributions
 tradem-0.238
 undermin-0.188
construct-0.183
urers-0.182
cler-0.182
TokenAbstract
Token ID23839
Feature activation-6.64e+00

Pos logit contributions
 tradem+0.777
 undermin+0.605
construct+0.587
cler+0.584
urers+0.582
Neg logit contributions
theless-0.702
-0.601
 DAC-0.560
 DOI-0.551
-0.525
Token
Token ID198
Feature activation-7.20e+00

Pos logit contributions
theless+4.791
+4.280
 DAC+3.905
 DOI+3.858
 µ+3.599
Neg logit contributions
 tradem-6.915
 undermin-5.473
 newcom-5.198
urers-5.094
construct-5.066
Token
Token ID198
Feature activation-6.85e+00

Pos logit contributions
theless+4.620
+4.254
 DAC+3.680
 DOI+3.679
AI+3.494
Neg logit contributions
 tradem-8.308
 undermin-6.904
 newcom-6.608
senal-6.488
 srf-6.177
TokenBackground
Token ID21756
Feature activation-7.28e+00

Pos logit contributions
theless+5.233
+4.895
 DAC+4.304
AI+4.279
 DOI+4.232
Neg logit contributions
 tradem-6.811
 undermin-5.293
 newcom-5.086
urers-4.887
senal-4.824
Token Depression
Token ID22483
Feature activation-2.38e+00

Pos logit contributions
theless+5.582
+5.091
 DAC+4.641
 DOI+4.584
 µ+4.289
Neg logit contributions
 tradem-7.289
 undermin-5.682
 newcom-5.402
urers-5.271
construct-5.214
Token presents
Token ID10969
Feature activation-2.55e+00

Pos logit contributions
theless+1.789
+1.561
 DAC+1.412
 DOI+1.391
+1.318
Neg logit contributions
 tradem-2.412
 undermin-1.911
construct-1.835
urers-1.833
 newcom-1.823
Token a
Token ID257
Feature activation-2.75e+00

Pos logit contributions
theless+2.028
+1.757
 DAC+1.612
 DOI+1.593
 detention+1.504
Neg logit contributions
 tradem-2.489
 undermin-1.947
construct-1.876
urers-1.864
 newcom-1.855
Token significant
Token ID2383
Feature activation-3.32e+00

Pos logit contributions
theless+2.379
+2.081
 DAC+1.929
 DOI+1.906
+1.815
Neg logit contributions
 tradem-2.495
 undermin-1.923
urers-1.834
construct-1.833
cler-1.814
Token.
Token ID13
Feature activation-1.66e+00

Pos logit contributions
theless+0.910
+0.793
 DAC+0.729
 DOI+0.716
+0.684
Neg logit contributions
 tradem-1.060
 undermin-0.829
construct-0.792
urers-0.792
cler-0.789
Token
Token ID198
Feature activation-2.21e+00

Pos logit contributions
theless+1.325
+1.136
 DAC+1.044
 DOI+1.026
+0.983
Neg logit contributions
 tradem-1.618
 undermin-1.274
construct-1.226
urers-1.223
cler-1.220
Token
Token ID198
Feature activation-2.13e+00

Pos logit contributions
theless+1.721
+1.458
 DAC+1.343
 DOI+1.317
+1.257
Neg logit contributions
 tradem-2.182
 undermin-1.722
construct-1.677
cler-1.669
urers-1.668
TokenMaterials
Token ID41657
Feature activation-5.21e+00

Pos logit contributions
theless+1.872
+1.641
 DAC+1.520
 DOI+1.496
+1.441
Neg logit contributions
 tradem-1.902
 undermin-1.460
construct-1.387
urers-1.387
cler-1.380
Token and
Token ID290
Feature activation-1.97e+00

Pos logit contributions
theless+3.786
+3.299
 DAC+3.040
 DOI+2.921
 µ+2.767
Neg logit contributions
 tradem-5.408
 undermin-4.287
 newcom-4.107
urers-4.105
construct-4.097
Token Methods
Token ID25458
Feature activation-6.84e+00

Pos logit contributions
theless+1.547
+1.335
 DAC+1.231
 DOI+1.208
+1.149
Neg logit contributions
 tradem-1.950
 undermin-1.535
urers-1.476
construct-1.465
cler-1.458
Token Experimental
Token ID32286
Feature activation-3.93e+00

Pos logit contributions
theless+5.231
+4.649
 DAC+4.338
 DOI+4.214
 µ+3.960
Neg logit contributions
 tradem-6.878
 undermin-5.396
 newcom-5.085
urers-5.081
construct-5.020
Token subjects
Token ID7481
Feature activation-2.45e+00

Pos logit contributions
theless+3.064
+2.679
 DAC+2.471
 DOI+2.413
 detention+2.294
Neg logit contributions
 tradem-3.871
 undermin-3.052
urers-2.917
cler-2.908
construct-2.899
Token Three
Token ID7683
Feature activation-4.80e+00

Pos logit contributions
theless+1.906
+1.652
 DAC+1.514
 DOI+1.481
AI+1.408
Neg logit contributions
 tradem-2.432
 undermin-1.918
construct-1.831
urers-1.830
cler-1.820
Token female
Token ID4048
Feature activation-3.95e-01

Pos logit contributions
theless+3.788
+3.357
 DAC+3.056
 DOI+2.992
 detention+2.846
Neg logit contributions
 tradem-4.661
 undermin-3.675
construct-3.480
urers-3.456
cler-3.442
Token r
Token ID374
Feature activation-4.39e+00

Pos logit contributions
theless+0.321
+0.275
 DAC+0.254
 DOI+0.250
+0.240
Neg logit contributions
 tradem-0.378
 undermin-0.296
construct-0.284
urers-0.284
cler-0.284
Token and
Token ID290
Feature activation+2.48e-01

Pos logit contributions
theless+0.225
+0.194
 DAC+0.179
 DOI+0.176
+0.169
Neg logit contributions
 tradem-0.257
 undermin-0.200
construct-0.192
urers-0.192
cler-0.191
Token sham
Token ID23565
Feature activation-3.29e+00

Pos logit contributions
 tradem+0.238
 undermin+0.187
construct+0.179
urers+0.178
cler+0.178
Neg logit contributions
theless-0.201
-0.173
 DAC-0.160
 DOI-0.157
-0.150
Token ta
Token ID20486
Feature activation-3.07e+00

Pos logit contributions
theless+2.737
+2.402
 DAC+2.211
 DOI+2.173
 detention+2.060
Neg logit contributions
 tradem-3.097
 undermin-2.397
construct-2.296
urers-2.290
cler-2.285
TokenV
Token ID53
Feature activation-7.76e-01

Pos logit contributions
theless+2.714
+2.391
 DAC+2.214
 DOI+2.181
+2.099
Neg logit contributions
 tradem-2.719
 undermin-2.082
construct-1.974
urers-1.969
cler-1.964
TokenNS
Token ID8035
Feature activation-1.56e+00

Pos logit contributions
theless+0.659
+0.564
 DAC+0.526
 DOI+0.517
+0.496
Neg logit contributions
 tradem-0.713
 undermin-0.551
construct-0.536
urers-0.535
cler-0.534
Token groups
Token ID2628
Feature activation-6.78e+00

Pos logit contributions
theless+1.242
+1.069
 DAC+0.982
 DOI+0.965
+0.922
Neg logit contributions
 tradem-1.531
 undermin-1.206
construct-1.157
urers-1.154
cler-1.151
Token at
Token ID379
Feature activation-2.96e+00

Pos logit contributions
theless+4.798
+4.390
 DAC+3.818
 DOI+3.750
 µ+3.558
Neg logit contributions
 tradem-7.002
 undermin-5.574
 newcom-5.341
urers-5.286
construct-5.283
Token week
Token ID1285
Feature activation-1.86e+00

Pos logit contributions
theless+2.267
+1.963
 DAC+1.800
 DOI+1.765
+1.665
Neg logit contributions
 tradem-2.957
 undermin-2.342
construct-2.251
urers-2.240
cler-2.238
Token 4
Token ID604
Feature activation-1.69e-01

Pos logit contributions
theless+1.466
+1.258
 DAC+1.157
 DOI+1.136
+1.086
Neg logit contributions
 tradem-1.825
 undermin-1.441
construct-1.383
urers-1.380
cler-1.376
Token;
Token ID26
Feature activation-1.87e+00

Pos logit contributions
theless+0.131
+0.111
 DAC+0.103
 DOI+0.101
+0.097
Neg logit contributions
 tradem-0.169
 undermin-0.134
construct-0.129
urers-0.128
cler-0.128
Token (
Token ID357
Feature activation-2.72e+00

Pos logit contributions
theless+1.389
+1.195
 DAC+1.084
 DOI+1.062
AI+0.999
Neg logit contributions
 tradem-1.928
 undermin-1.534
urers-1.469
construct-1.466
cler-1.465
Token active
Token ID4075
Feature activation-2.04e+00

Pos logit contributions
theless+2.197
+1.851
 DAC+1.694
 DOI+1.657
+1.580
Neg logit contributions
 tradem-3.130
 undermin-2.509
construct-2.418
urers-2.413
cler-2.408
Token CR
Token ID8740
Feature activation-1.87e+00

Pos logit contributions
theless+1.615
+1.385
 DAC+1.273
 DOI+1.249
+1.197
Neg logit contributions
 tradem-1.993
 undermin-1.572
construct-1.508
urers-1.503
cler-1.502
TokenF
Token ID37
Feature activation-1.93e+00

Pos logit contributions
theless+1.755
+1.513
 DAC+1.424
 DOI+1.396
+1.352
Neg logit contributions
 tradem-1.549
 undermin-1.160
construct-1.131
cler-1.124
urers-1.114
Token1
Token ID16
Feature activation-2.24e+00

Pos logit contributions
theless+1.590
+1.371
 DAC+1.266
 DOI+1.243
+1.193
Neg logit contributions
 tradem-1.833
 undermin-1.433
construct-1.373
urers-1.369
cler-1.367
Token receptor
Token ID17408
Feature activation-3.23e+00

Pos logit contributions
theless+1.773
+1.532
 DAC+1.405
 DOI+1.382
+1.318
Neg logit contributions
 tradem-2.190
 undermin-1.725
urers-1.653
construct-1.653
cler-1.646
Token antagonist
Token ID34667
Feature activation-6.68e+00

Pos logit contributions
theless+2.679
+2.382
 DAC+2.179
 DOI+2.160
+2.038
Neg logit contributions
 tradem-3.014
 undermin-2.346
urers-2.262
construct-2.241
cler-2.227
Token.
Token ID13
Feature activation-2.68e+00

Pos logit contributions
theless+4.423
+4.027
 DOI+3.562
 DAC+3.467
 µ+3.291
Neg logit contributions
 tradem-7.114
 undermin-5.756
urers-5.557
construct-5.514
 newcom-5.467
Token Oral
Token ID42222
Feature activation-2.46e+00

Pos logit contributions
theless+2.161
+1.869
 DAC+1.721
 DOI+1.693
+1.615
Neg logit contributions
 tradem-2.597
 undermin-2.041
construct-1.947
urers-1.947
cler-1.938
Token administration
Token ID3662
Feature activation-4.59e+00

Pos logit contributions
theless+1.896
+1.638
 DAC+1.495
 DOI+1.479
+1.392
Neg logit contributions
 tradem-2.461
 undermin-1.949
construct-1.872
urers-1.865
cler-1.860
Token of
Token ID286
Feature activation-2.25e+00

Pos logit contributions
theless+3.503
+3.108
 DOI+2.747
 DAC+2.741
 µ+2.584
Neg logit contributions
 tradem-4.625
 undermin-3.645
construct-3.482
urers-3.481
 newcom-3.439
Token a
Token ID257
Feature activation-1.92e+00

Pos logit contributions
theless+1.835
+1.587
 DAC+1.463
 DOI+1.441
+1.376
Neg logit contributions
 tradem-2.146
 undermin-1.681
construct-1.611
urers-1.607
cler-1.601
Token in
Token ID287
Feature activation+1.82e-01

Pos logit contributions
 tradem+2.153
 undermin+1.657
construct+1.592
cler+1.585
urers+1.580
Neg logit contributions
theless-2.067
-1.775
 DAC-1.648
 DOI-1.615
-1.565
Token 33
Token ID4747
Feature activation+9.70e-01

Pos logit contributions
 tradem+0.170
 undermin+0.132
construct+0.126
urers+0.126
cler+0.126
Neg logit contributions
theless-0.153
-0.132
 DAC-0.122
 DOI-0.120
-0.116
Token cities
Token ID4736
Feature activation-1.78e+00

Pos logit contributions
 tradem+0.643
 undermin+0.441
construct+0.414
urers+0.412
cler+0.411
Neg logit contributions
theless-1.074
-0.960
 DAC-0.908
 DOI-0.896
-0.874
Token around
Token ID1088
Feature activation-2.91e+00

Pos logit contributions
theless+1.330
+1.146
 DAC+1.039
 DOI+1.027
+0.973
Neg logit contributions
 tradem-1.826
 undermin-1.462
construct-1.394
 newcom-1.388
urers-1.387
Token the
Token ID262
Feature activation-1.86e+00

Pos logit contributions
theless+2.470
+2.170
 DAC+2.011
 DOI+1.985
+1.889
Neg logit contributions
 tradem-2.683
 undermin-2.095
construct-1.976
urers-1.974
 newcom-1.973
Token world
Token ID995
Feature activation-6.66e+00

Pos logit contributions
theless+1.749
+1.557
 DAC+1.456
 DOI+1.437
+1.376
Neg logit contributions
 tradem-1.537
 undermin-1.156
construct-1.090
urers-1.089
cler-1.083
Token.
Token ID13
Feature activation+1.84e-01

Pos logit contributions
theless+4.525
+4.050
 DOI+3.574
 DAC+3.543
AI+3.316
Neg logit contributions
 tradem-6.979
 undermin-5.652
 newcom-5.405
urers-5.338
construct-5.328
Token (
Token ID357
Feature activation+9.55e-01

Pos logit contributions
 tradem+0.182
 undermin+0.144
construct+0.139
urers+0.138
cler+0.138
Neg logit contributions
theless-0.144
-0.123
 DAC-0.113
 DOI-0.111
-0.106
TokenWa
Token ID33484
Feature activation-4.21e-01

Pos logit contributions
 tradem+0.956
 undermin+0.758
construct+0.731
urers+0.728
cler+0.728
Neg logit contributions
theless-0.735
-0.625
 DAC-0.575
 DOI-0.564
-0.537
Tokenub
Token ID549
Feature activation+3.67e+00

Pos logit contributions
theless+0.430
+0.371
 DAC+0.356
 DOI+0.349
+0.338
Neg logit contributions
 tradem-0.315
cler-0.226
urers-0.225
construct-0.224
 undermin-0.222
Tokenges
Token ID3212
Feature activation+7.84e-01

Pos logit contributions
 tradem+3.197
cler+2.576
urers+2.575
construct+2.567
 spoilers+2.502
Neg logit contributions
theless-3.059
-2.595
 DAC-2.443
 DOI-2.368
-2.302
Token 4
Token ID604
Feature activation-5.95e-02

Pos logit contributions
 tradem+0.580
 undermin+0.456
construct+0.437
urers+0.436
cler+0.435
Neg logit contributions
theless-0.483
-0.415
 DAC-0.383
 DOI-0.376
-0.360
Token64
Token ID2414
Feature activation+8.30e-01

Pos logit contributions
theless+0.048
+0.042
 DAC+0.039
 DOI+0.038
+0.036
Neg logit contributions
 tradem-0.057
 undermin-0.045
construct-0.043
urers-0.043
cler-0.043
Token406
Token ID29703
Feature activation-2.58e-01

Pos logit contributions
 tradem+0.824
 undermin+0.652
construct+0.630
urers+0.627
cler+0.627
Neg logit contributions
theless-0.644
-0.547
 DAC-0.504
 DOI-0.493
-0.472
Token Mem
Token ID4942
Feature activation+1.13e+00

Pos logit contributions
theless+0.206
+0.178
 DAC+0.163
 DOI+0.161
+0.153
Neg logit contributions
 tradem-0.252
 undermin-0.198
construct-0.190
urers-0.189
cler-0.189
Tokenc
Token ID66
Feature activation-2.07e+00

Pos logit contributions
 tradem+1.170
 undermin+0.936
construct+0.903
cler+0.901
urers+0.901
Neg logit contributions
theless-0.830
-0.699
 DAC-0.640
 DOI-0.626
-0.596
Tokenpy
Token ID9078
Feature activation-6.66e+00

Pos logit contributions
theless+1.926
+1.698
 DAC+1.581
 DOI+1.559
+1.503
Neg logit contributions
 tradem-1.743
 undermin-1.316
construct-1.244
urers-1.242
cler-1.236
TokenReading
Token ID36120
Feature activation-2.25e+00

Pos logit contributions
theless+5.052
+4.631
 DAC+4.118
 DOI+4.054
AI+3.938
Neg logit contributions
 tradem-6.529
 undermin-5.185
 newcom-4.923
 spoilers-4.830
urers-4.815
Token 55
Token ID5996
Feature activation+3.26e+00

Pos logit contributions
theless+1.824
+1.600
 DAC+1.473
 DOI+1.442
AI+1.381
Neg logit contributions
 tradem-2.156
 undermin-1.701
urers-1.610
construct-1.607
cler-1.596
Token80
Token ID1795
Feature activation+2.50e+00

Pos logit contributions
 tradem+3.061
 undermin+2.400
construct+2.353
cler+2.331
urers+2.321
Neg logit contributions
theless-2.653
-2.211
 DAC-2.063
 DOI-2.008
-1.937
Token953
Token ID49649
Feature activation+1.16e-01

Pos logit contributions
 tradem+2.305
 undermin+1.786
construct+1.744
cler+1.734
urers+1.724
Neg logit contributions
theless-2.091
-1.776
 DAC-1.655
 DOI-1.615
-1.560
Token 16
Token ID1467
Feature activation+2.96e+00

Pos logit contributions
 tradem+0.110
 undermin+0.086
construct+0.082
urers+0.082
cler+0.082
Neg logit contributions
theless-0.095
-0.082
 DAC-0.076
 DOI-0.075
-0.072
Token 500
Token ID5323
Feature activation-2.31e+00

Pos logit contributions
theless+0.124
+0.106
 DAC+0.098
 DOI+0.096
+0.092
Neg logit contributions
 tradem-0.151
 undermin-0.119
construct-0.115
urers-0.114
cler-0.114
Token million
Token ID1510
Feature activation-1.54e+00

Pos logit contributions
theless+1.368
+1.155
 DAC+1.012
 DOI+0.988
AI+0.930
Neg logit contributions
 tradem-2.683
 undermin-2.224
construct-2.151
 newcom-2.138
urers-2.136
Token members
Token ID1866
Feature activation-3.90e+00

Pos logit contributions
theless+1.401
+1.237
 DAC+1.149
 DOI+1.134
+1.089
Neg logit contributions
 tradem-1.330
 undermin-1.012
construct-0.961
urers-0.961
cler-0.954
Token around
Token ID1088
Feature activation-4.01e+00

Pos logit contributions
theless+2.506
+2.165
 DAC+1.903
 DOI+1.893
AI+1.780
Neg logit contributions
 tradem-4.321
 undermin-3.554
construct-3.396
 newcom-3.378
urers-3.360
Token the
Token ID262
Feature activation-1.93e+00

Pos logit contributions
theless+3.224
+2.857
 DAC+2.647
 DOI+2.600
 detention+2.466
Neg logit contributions
 tradem-3.792
 undermin-3.034
 newcom-2.884
urers-2.852
construct-2.844
Token world
Token ID995
Feature activation-6.64e+00

Pos logit contributions
theless+1.850
+1.639
 DAC+1.534
 DOI+1.512
+1.457
Neg logit contributions
 tradem-1.572
 undermin-1.173
construct-1.110
urers-1.107
cler-1.103
Token,
Token ID11
Feature activation-2.29e+00

Pos logit contributions
theless+3.799
+3.303
 DAC+2.787
 DOI+2.777
 detention+2.637
Neg logit contributions
 tradem-7.543
 undermin-6.280
construct-6.101
 newcom-6.018
urers-5.956
Token has
Token ID468
Feature activation-1.14e+00

Pos logit contributions
theless+1.761
+1.511
 DAC+1.381
 DOI+1.358
+1.289
Neg logit contributions
 tradem-2.303
 undermin-1.830
construct-1.751
 newcom-1.739
urers-1.738
Token relied
Token ID17498
Feature activation+1.57e-01

Pos logit contributions
theless+0.884
+0.756
 DAC+0.693
 DOI+0.680
+0.650
Neg logit contributions
 tradem-1.142
 undermin-0.905
construct-0.870
urers-0.868
cler-0.866
Token on
Token ID319
Feature activation+8.91e-02

Pos logit contributions
 tradem+0.145
 undermin+0.112
construct+0.107
urers+0.107
cler+0.107
Neg logit contributions
theless-0.133
-0.116
 DAC-0.107
 DOI-0.105
-0.101
Token its
Token ID663
Feature activation-9.50e-01

Pos logit contributions
 tradem+0.080
 undermin+0.061
construct+0.059
urers+0.059
cler+0.058
Neg logit contributions
theless-0.078
-0.068
 DAC-0.063
 DOI-0.062
-0.060
Tokens
Token ID82
Feature activation+9.57e-01

Pos logit contributions
 tradem+0.268
 undermin+0.193
construct+0.185
urers+0.184
cler+0.183
Neg logit contributions
theless-0.370
-0.329
 DAC-0.311
 DOI-0.306
-0.296
Token a
Token ID257
Feature activation-8.77e-01

Pos logit contributions
 tradem+0.886
 undermin+0.696
construct+0.670
cler+0.665
urers+0.664
Neg logit contributions
theless-0.799
-0.686
 DAC-0.642
 DOI-0.629
AI-0.603
Token start
Token ID923
Feature activation+2.64e-01

Pos logit contributions
theless+0.767
+0.670
 DAC+0.621
 DOI+0.612
+0.588
Neg logit contributions
 tradem-0.788
 undermin-0.606
construct-0.578
urers-0.577
cler-0.575
Token.
Token ID13
Feature activation-2.41e-01

Pos logit contributions
 tradem+0.264
 undermin+0.209
construct+0.201
urers+0.200
cler+0.200
Neg logit contributions
theless-0.204
-0.175
 DAC-0.160
 DOI-0.157
-0.150
Token<|endoftext|>
Token ID50256
Feature activation+8.37e-01

Pos logit contributions
theless+0.188
+0.159
 DAC+0.146
 DOI+0.143
+0.137
Neg logit contributions
 tradem-0.238
 undermin-0.188
construct-0.183
urers-0.182
cler-0.182
TokenAbstract
Token ID23839
Feature activation-6.64e+00

Pos logit contributions
 tradem+0.777
 undermin+0.605
construct+0.587
cler+0.584
urers+0.582
Neg logit contributions
theless-0.702
-0.601
 DAC-0.560
 DOI-0.551
-0.525
Token
Token ID198
Feature activation-7.20e+00

Pos logit contributions
theless+4.791
+4.280
 DAC+3.905
 DOI+3.858
 µ+3.599
Neg logit contributions
 tradem-6.915
 undermin-5.473
 newcom-5.198
urers-5.094
construct-5.066
Token
Token ID198
Feature activation-6.85e+00

Pos logit contributions
theless+4.620
+4.254
 DAC+3.680
 DOI+3.679
AI+3.494
Neg logit contributions
 tradem-8.308
 undermin-6.904
 newcom-6.608
senal-6.488
 srf-6.177
TokenBackground
Token ID21756
Feature activation-7.28e+00

Pos logit contributions
theless+5.233
+4.895
 DAC+4.304
AI+4.279
 DOI+4.232
Neg logit contributions
 tradem-6.811
 undermin-5.293
 newcom-5.086
urers-4.887
senal-4.824
Token Depression
Token ID22483
Feature activation-2.38e+00

Pos logit contributions
theless+5.582
+5.091
 DAC+4.641
 DOI+4.584
 µ+4.289
Neg logit contributions
 tradem-7.289
 undermin-5.682
 newcom-5.402
urers-5.271
construct-5.214
Token presents
Token ID10969
Feature activation-2.55e+00

Pos logit contributions
theless+1.789
+1.561
 DAC+1.412
 DOI+1.391
+1.318
Neg logit contributions
 tradem-2.412
 undermin-1.911
construct-1.835
urers-1.833
 newcom-1.823
Token switching
Token ID15430
Feature activation+2.33e-01

Pos logit contributions
theless+1.072
+0.936
 DAC+0.866
 DOI+0.851
+0.819
Neg logit contributions
 tradem-1.154
 undermin-0.892
urers-0.853
construct-0.851
cler-0.847
Token from
Token ID422
Feature activation-8.10e-01

Pos logit contributions
 tradem+0.217
 undermin+0.168
construct+0.161
urers+0.161
cler+0.160
Neg logit contributions
theless-0.196
-0.170
 DAC-0.158
 DOI-0.155
-0.148
Token a
Token ID257
Feature activation-1.20e+00

Pos logit contributions
theless+0.646
+0.560
 DAC+0.515
 DOI+0.505
+0.481
Neg logit contributions
 tradem-0.788
 undermin-0.620
construct-0.592
urers-0.591
cler-0.591
Token three
Token ID1115
Feature activation-3.14e+00

Pos logit contributions
theless+0.968
+0.838
 DAC+0.772
 DOI+0.760
+0.722
Neg logit contributions
 tradem-1.149
 undermin-0.904
construct-0.860
urers-0.859
cler-0.856
Token-
Token ID12
Feature activation-2.97e+00

Pos logit contributions
theless+1.772
+1.518
 DAC+1.281
 DOI+1.259
AI+1.177
Neg logit contributions
 tradem-3.733
 undermin-3.082
urers-2.966
construct-2.958
 newcom-2.947
Tokenman
Token ID805
Feature activation-6.62e+00

Pos logit contributions
theless+1.657
+1.360
 DAC+1.146
 DOI+1.116
AI+1.079
Neg logit contributions
 tradem-3.621
 undermin-3.008
 newcom-2.872
urers-2.865
 spoilers-2.840
Token defence
Token ID9366
Feature activation-3.69e+00

Pos logit contributions
theless+5.400
+4.840
 DAC+4.404
 DOI+4.358
 detention+4.204
Neg logit contributions
 tradem-6.142
 undermin-4.769
urers-4.482
 newcom-4.481
cler-4.452
Token to
Token ID284
Feature activation+1.10e+00

Pos logit contributions
theless+2.691
+2.355
 DAC+2.094
 DOI+2.077
+1.965
Neg logit contributions
 tradem-3.806
 undermin-3.048
urers-2.903
 newcom-2.898
construct-2.897
Token a
Token ID257
Feature activation+7.94e-01

Pos logit contributions
 tradem+1.013
 undermin+0.784
construct+0.750
urers+0.748
cler+0.746
Neg logit contributions
theless-0.942
-0.818
 DAC-0.758
 DOI-0.745
-0.716
Token back
Token ID736
Feature activation-1.22e+00

Pos logit contributions
 tradem+0.714
 undermin+0.550
construct+0.527
urers+0.525
cler+0.524
Neg logit contributions
theless-0.692
-0.601
 DAC-0.558
 DOI-0.549
-0.529
Token four
Token ID1440
Feature activation-8.65e-01

Pos logit contributions
theless+1.084
+0.958
 DAC+0.886
 DOI+0.873
+0.840
Neg logit contributions
 tradem-1.078
 undermin-0.828
urers-0.783
construct-0.783
cler-0.778
Token possibly
Token ID5457
Feature activation-1.65e+00

Pos logit contributions
 tradem+1.163
 undermin+0.910
construct+0.871
urers+0.868
cler+0.866
Neg logit contributions
theless-1.037
-0.891
 DAC-0.820
 DOI-0.809
-0.775
Token the
Token ID262
Feature activation+4.75e-02

Pos logit contributions
theless+1.373
+1.208
 DAC+1.123
 DOI+1.096
+1.053
Neg logit contributions
 tradem-1.534
 undermin-1.182
construct-1.140
urers-1.132
cler-1.129
Token greatest
Token ID6000
Feature activation+1.07e+00

Pos logit contributions
 tradem+0.042
 undermin+0.033
construct+0.031
urers+0.031
cler+0.031
Neg logit contributions
theless-0.042
-0.036
 DAC-0.034
 DOI-0.033
-0.032
Token Spanish
Token ID7897
Feature activation-1.10e+00

Pos logit contributions
 tradem+0.915
 undermin+0.698
construct+0.664
cler+0.659
urers+0.657
Neg logit contributions
theless-0.985
-0.861
 DAC-0.801
 DOI-0.789
-0.765
Token-
Token ID12
Feature activation-8.71e-01

Pos logit contributions
theless+1.020
+0.897
 DAC+0.836
 DOI+0.824
+0.794
Neg logit contributions
 tradem-0.938
 undermin-0.709
construct-0.674
urers-0.672
cler-0.670
Tokenlanguage
Token ID16129
Feature activation-6.55e+00

Pos logit contributions
theless+0.761
+0.667
 DAC+0.617
 DOI+0.606
+0.585
Neg logit contributions
 tradem-0.781
 undermin-0.601
urers-0.571
construct-0.569
cler-0.566
Token writer
Token ID6260
Feature activation-5.12e-01

Pos logit contributions
theless+5.849
+5.242
 DAC+4.901
 DOI+4.785
 detainee+4.683
Neg logit contributions
 tradem-5.541
 undermin-4.194
urers-4.046
cler-3.938
 newcom-3.930
Token of
Token ID286
Feature activation-1.61e+00

Pos logit contributions
theless+0.413
+0.357
 DAC+0.329
 DOI+0.322
+0.309
Neg logit contributions
 tradem-0.496
 undermin-0.389
construct-0.372
urers-0.372
cler-0.370
Token the
Token ID262
Feature activation-1.33e+00

Pos logit contributions
theless+1.322
+1.155
 DAC+1.070
 DOI+1.044
+1.001
Neg logit contributions
 tradem-1.531
 undermin-1.197
construct-1.146
urers-1.142
cler-1.135
Token 20
Token ID1160
Feature activation-6.63e-01

Pos logit contributions
theless+1.033
+0.890
 DAC+0.816
 DOI+0.802
+0.764
Neg logit contributions
 tradem-1.328
 undermin-1.050
construct-1.005
urers-1.005
cler-1.000
Tokenth
Token ID400
Feature activation-1.29e+00

Pos logit contributions
theless+0.556
+0.485
 DAC+0.447
 DOI+0.439
+0.421
Neg logit contributions
 tradem-0.619
 undermin-0.482
construct-0.460
urers-0.459
cler-0.457
Tokenlla
Token ID8466
Feature activation-6.28e-02

Pos logit contributions
theless+0.665
+0.463
 DAC+0.404
 DOI+0.379
+0.333
Neg logit contributions
 tradem-2.093
cler-1.774
 undermin-1.770
construct-1.767
urers-1.755
Token-
Token ID12
Feature activation-1.40e+00

Pos logit contributions
theless+0.044
+0.037
 DAC+0.033
 DOI+0.033
+0.031
Neg logit contributions
 tradem-0.067
 undermin-0.054
construct-0.052
urers-0.052
cler-0.052
Tokensan
Token ID12807
Feature activation-2.89e+00

Pos logit contributions
theless+1.206
+1.052
 DAC+0.973
 DOI+0.956
+0.924
Neg logit contributions
 tradem-1.268
 undermin-0.980
construct-0.931
urers-0.930
cler-0.925
Token collapsed
Token ID14707
Feature activation-3.16e-01

Pos logit contributions
theless+1.961
+1.657
 DAC+1.473
 DOI+1.449
+1.392
Neg logit contributions
 tradem-3.166
 undermin-2.572
construct-2.463
urers-2.463
cler-2.448
Token?」
Token ID42943
Feature activation-5.90e+00

Pos logit contributions
theless+0.253
+0.216
 DAC+0.200
 DOI+0.196
AI+0.187
Neg logit contributions
 tradem-0.307
 undermin-0.242
construct-0.232
cler-0.232
urers-0.232
Token
Token ID198
Feature activation-6.55e+00

Pos logit contributions
theless+4.002
+3.470
 DAC+3.017
 DOI+2.992
+2.968
Neg logit contributions
 tradem-6.378
 undermin-5.234
construct-4.915
urers-4.904
 newcom-4.885
Token
Token ID198
Feature activation-6.14e+00

Pos logit contributions
theless+4.249
+3.905
 DAC+3.383
 DOI+3.381
+3.223
Neg logit contributions
 tradem-7.530
 undermin-6.289
 newcom-5.983
senal-5.880
 srf-5.598
Token"
Token ID1
Feature activation-6.34e+00

Pos logit contributions
theless+4.475
+4.137
+3.606
 DAC+3.535
AI+3.533
Neg logit contributions
 tradem-6.413
 undermin-5.126
 newcom-4.932
senal-4.715
urers-4.675
TokenYes
Token ID5297
Feature activation-4.17e+00

Pos logit contributions
theless+4.556
+4.092
 DAC+3.614
 DOI+3.549
+3.541
Neg logit contributions
 tradem-6.711
 undermin-5.360
 newcom-5.139
urers-4.980
senal-4.907
Token.
Token ID13
Feature activation-1.87e+00

Pos logit contributions
theless+2.819
+2.397
 DAC+2.121
 DOI+2.091
+2.025
Neg logit contributions
 tradem-4.539
 undermin-3.687
urers-3.533
construct-3.513
 newcom-3.500
Token At
Token ID1629
Feature activation-1.59e+00

Pos logit contributions
theless+1.464
+1.254
 DAC+1.152
 DOI+1.131
+1.080
Neg logit contributions
 tradem-1.838
 undermin-1.453
construct-1.397
urers-1.393
cler-1.391
Token N
Token ID399
Feature activation-5.56e-01

Pos logit contributions
theless+1.573
+1.354
 DAC+1.245
 DOI+1.226
+1.169
Neg logit contributions
 tradem-1.931
 undermin-1.521
construct-1.459
urers-1.457
cler-1.451
Token-
Token ID12
Feature activation-7.55e+00

Pos logit contributions
theless+0.385
+0.316
 DAC+0.291
 DOI+0.283
+0.269
Neg logit contributions
 tradem-0.600
 undermin-0.485
construct-0.472
cler-0.471
urers-0.470
Tokencyclop
Token ID22873
Feature activation-2.13e+00

Pos logit contributions
theless+5.587
+4.800
 DOI+4.269
 DAC+4.191
AI+4.066
Neg logit contributions
 tradem-7.712
 undermin-6.202
urers-5.815
 newcom-5.748
 spoilers-5.746
Tokenrop
Token ID1773
Feature activation-2.84e+00

Pos logit contributions
theless+2.223
+1.992
 DAC+1.866
 DOI+1.848
AI+1.786
Neg logit contributions
 tradem-1.544
 undermin-1.109
urers-1.032
construct-1.030
cler-1.019
Tokenyl
Token ID2645
Feature activation-3.67e+00

Pos logit contributions
theless+1.956
+1.639
 DAC+1.482
 DOI+1.451
+1.374
Neg logit contributions
 tradem-3.083
 undermin-2.493
construct-2.403
urers-2.398
cler-2.393
Tokenmethyl
Token ID43654
Feature activation-6.45e+00

Pos logit contributions
theless+2.841
+2.461
 DAC+2.235
 DOI+2.208
AI+2.081
Neg logit contributions
 tradem-3.647
 undermin-2.905
urers-2.777
construct-2.764
 newcom-2.742
Token-
Token ID12
Feature activation-3.75e+00

Pos logit contributions
theless+4.868
+4.247
 DAC+3.834
 DOI+3.834
 DDR+3.550
Neg logit contributions
 tradem-6.440
 undermin-5.220
urers-4.933
construct-4.825
cler-4.819
Token7
Token ID22
Feature activation-2.10e+00

Pos logit contributions
theless+2.845
+2.477
 DAC+2.237
 DOI+2.209
AI+2.097
Neg logit contributions
 tradem-3.799
 undermin-3.042
urers-2.879
construct-2.865
 newcom-2.864
Token-(
Token ID30420
Feature activation-2.30e+00

Pos logit contributions
theless+1.288
+1.045
 DAC+0.936
 DOI+0.910
+0.858
Neg logit contributions
 tradem-2.427
 undermin-1.991
construct-1.931
urers-1.928
cler-1.924
Token2
Token ID17
Feature activation-5.05e+00

Pos logit contributions
theless+1.685
+1.426
 DAC+1.302
 DOI+1.275
+1.214
Neg logit contributions
 tradem-2.389
 undermin-1.912
construct-1.843
urers-1.838
cler-1.835
Token,
Token ID11
Feature activation-1.04e+00

Pos logit contributions
theless+3.132
+2.650
 DAC+2.315
 DOI+2.291
AI+2.142
Neg logit contributions
 tradem-5.768
 undermin-4.757
construct-4.566
urers-4.561
cler-4.550
Token8
Token ID23
Feature activation-1.06e+00

Pos logit contributions
theless+2.705
+2.367
 DAC+2.158
 DOI+2.127
+2.037
Neg logit contributions
 tradem-3.353
 undermin-2.650
urers-2.503
 newcom-2.499
construct-2.492
Token (
Token ID357
Feature activation-2.98e+00

Pos logit contributions
theless+0.840
+0.716
 DAC+0.662
 DOI+0.648
+0.621
Neg logit contributions
 tradem-1.041
 undermin-0.822
construct-0.790
cler-0.788
urers-0.788
Token3
Token ID18
Feature activation-4.05e+00

Pos logit contributions
theless+2.254
+1.966
 DAC+1.764
 DOI+1.746
AI+1.671
Neg logit contributions
 tradem-3.030
 undermin-2.407
urers-2.286
 newcom-2.274
construct-2.273
Token or
Token ID393
Feature activation-1.63e+00

Pos logit contributions
theless+2.777
+2.436
 DAC+2.157
 DOI+2.117
 µ+2.026
Neg logit contributions
 tradem-4.373
 undermin-3.508
construct-3.390
urers-3.371
 newcom-3.357
Token 10
Token ID838
Feature activation-5.26e+00

Pos logit contributions
theless+1.336
+1.155
 DAC+1.066
 DOI+1.048
+1.003
Neg logit contributions
 tradem-1.547
 undermin-1.211
construct-1.159
urers-1.156
cler-1.153
Tokenmg
Token ID11296
Feature activation-6.45e+00

Pos logit contributions
theless+3.336
+2.910
 DAC+2.525
 DOI+2.481
 µ+2.400
Neg logit contributions
 tradem-5.960
 undermin-4.809
construct-4.675
urers-4.655
cler-4.621
Token/
Token ID14
Feature activation-5.68e+00

Pos logit contributions
theless+4.389
+3.927
 DAC+3.412
 DOI+3.404
 µ+3.205
Neg logit contributions
 tradem-6.924
 undermin-5.564
urers-5.373
construct-5.337
cler-5.276
Tokenkg
Token ID10025
Feature activation-5.68e+00

Pos logit contributions
theless+2.357
+1.878
 DAC+1.443
 DOI+1.396
AI+1.304
Neg logit contributions
 tradem-7.723
 undermin-6.479
 newcom-6.264
urers-6.242
 spoilers-6.227
Token),
Token ID828
Feature activation-2.73e+00

Pos logit contributions
theless+3.997
+3.573
 DAC+3.114
 DOI+3.106
 µ+2.906
Neg logit contributions
 tradem-6.013
 undermin-4.827
urers-4.635
construct-4.613
cler-4.552
Token but
Token ID475
Feature activation-2.08e+00

Pos logit contributions
theless+2.205
+1.925
 DAC+1.758
 DOI+1.739
+1.650
Neg logit contributions
 tradem-2.629
 undermin-2.058
construct-1.969
urers-1.965
cler-1.955
Token not
Token ID407
Feature activation-2.44e+00

Pos logit contributions
theless+1.649
+1.428
 DAC+1.307
 DOI+1.288
+1.230
Neg logit contributions
 tradem-2.024
 undermin-1.598
construct-1.529
urers-1.523
cler-1.516
Token it
Token ID340
Feature activation-1.69e+00

Pos logit contributions
theless+1.047
+0.900
 DAC+0.828
 DOI+0.814
+0.773
Neg logit contributions
 tradem-1.347
 undermin-1.065
construct-1.024
urers-1.017
cler-1.016
Token means
Token ID1724
Feature activation-2.47e+00

Pos logit contributions
theless+1.432
+1.245
 DAC+1.141
 DOI+1.130
+1.080
Neg logit contributions
 tradem-1.592
 undermin-1.229
 newcom-1.171
construct-1.166
urers-1.165
Token lots
Token ID6041
Feature activation-2.12e+00

Pos logit contributions
theless+2.065
+1.794
 DAC+1.667
 DOI+1.637
 detention+1.576
Neg logit contributions
 tradem-2.315
 undermin-1.784
construct-1.713
urers-1.705
 newcom-1.703
Token of
Token ID286
Feature activation-3.92e+00

Pos logit contributions
theless+1.602
+1.391
 DAC+1.272
 DOI+1.247
 detention+1.201
Neg logit contributions
 tradem-2.127
 undermin-1.676
construct-1.621
urers-1.594
cler-1.585
Token attention
Token ID3241
Feature activation-1.59e+00

Pos logit contributions
theless+3.527
+3.112
 DAC+2.914
 DOI+2.866
 detention+2.775
Neg logit contributions
 tradem-3.398
 undermin-2.560
construct-2.457
urers-2.432
cler-2.417
Token coming
Token ID2406
Feature activation-6.43e+00

Pos logit contributions
theless+1.225
+1.064
 DAC+0.967
 DOI+0.954
+0.910
Neg logit contributions
 tradem-1.588
 undermin-1.257
construct-1.208
urers-1.202
cler-1.195
Token its
Token ID663
Feature activation-6.31e-01

Pos logit contributions
theless+4.772
+4.280
 DAC+3.825
 DOI+3.783
 detention+3.632
Neg logit contributions
 tradem-6.391
 undermin-5.000
construct-4.840
urers-4.814
 newcom-4.800
Token way
Token ID835
Feature activation+1.31e+00

Pos logit contributions
theless+0.613
+0.545
 DAC+0.509
 DOI+0.502
+0.486
Neg logit contributions
 tradem-0.504
 undermin-0.371
construct-0.353
urers-0.352
cler-0.349
Token.
Token ID13
Feature activation+2.27e-01

Pos logit contributions
 tradem+1.255
 undermin+0.985
construct+0.945
cler+0.943
urers+0.942
Neg logit contributions
theless-1.060
-0.910
 DAC-0.842
 DOI-0.825
-0.790
TokenAccording
Token ID4821
Feature activation-2.83e+00

Pos logit contributions
 tradem+0.219
 undermin+0.172
construct+0.168
urers+0.167
cler+0.167
Neg logit contributions
theless-0.181
-0.153
 DAC-0.141
 DOI-0.138
-0.132
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
theless+2.101
+1.790
 DAC+1.633
 DOI+1.603
+1.526
Neg logit contributions
 tradem-2.914
 undermin-2.330
construct-2.234
urers-2.228
cler-2.223
Tokensan
Token ID12807
Feature activation-2.89e+00

Pos logit contributions
theless+1.206
+1.052
 DAC+0.973
 DOI+0.956
+0.924
Neg logit contributions
 tradem-1.268
 undermin-0.980
construct-0.931
urers-0.930
cler-0.925
Token collapsed
Token ID14707
Feature activation-3.16e-01

Pos logit contributions
theless+1.961
+1.657
 DAC+1.473
 DOI+1.449
+1.392
Neg logit contributions
 tradem-3.166
 undermin-2.572
construct-2.463
urers-2.463
cler-2.448
Token?」
Token ID42943
Feature activation-5.90e+00

Pos logit contributions
theless+0.253
+0.216
 DAC+0.200
 DOI+0.196
AI+0.187
Neg logit contributions
 tradem-0.307
 undermin-0.242
construct-0.232
cler-0.232
urers-0.232
Token
Token ID198
Feature activation-6.55e+00

Pos logit contributions
theless+4.002
+3.470
 DAC+3.017
 DOI+2.992
+2.968
Neg logit contributions
 tradem-6.378
 undermin-5.234
construct-4.915
urers-4.904
 newcom-4.885
Token
Token ID198
Feature activation-6.14e+00

Pos logit contributions
theless+4.249
+3.905
 DAC+3.383
 DOI+3.381
+3.223
Neg logit contributions
 tradem-7.530
 undermin-6.289
 newcom-5.983
senal-5.880
 srf-5.598
Token"
Token ID1
Feature activation-6.34e+00

Pos logit contributions
theless+4.475
+4.137
+3.606
 DAC+3.535
AI+3.533
Neg logit contributions
 tradem-6.413
 undermin-5.126
 newcom-4.932
senal-4.715
urers-4.675
TokenYes
Token ID5297
Feature activation-4.17e+00

Pos logit contributions
theless+4.556
+4.092
 DAC+3.614
 DOI+3.549
+3.541
Neg logit contributions
 tradem-6.711
 undermin-5.360
 newcom-5.139
urers-4.980
senal-4.907
Token.
Token ID13
Feature activation-1.87e+00

Pos logit contributions
theless+2.819
+2.397
 DAC+2.121
 DOI+2.091
+2.025
Neg logit contributions
 tradem-4.539
 undermin-3.687
urers-3.533
construct-3.513
 newcom-3.500
Token At
Token ID1629
Feature activation-1.59e+00

Pos logit contributions
theless+1.464
+1.254
 DAC+1.152
 DOI+1.131
+1.080
Neg logit contributions
 tradem-1.838
 undermin-1.453
construct-1.397
urers-1.393
cler-1.391
Token the
Token ID262
Feature activation-7.62e-01

Pos logit contributions
theless+1.338
+1.163
 DAC+1.074
 DOI+1.057
+1.014
Neg logit contributions
 tradem-1.486
 undermin-1.158
construct-1.106
urers-1.104
cler-1.100
Token moment
Token ID2589
Feature activation-6.31e-01

Pos logit contributions
theless+0.660
+0.573
 DAC+0.531
 DOI+0.523
+0.503
Neg logit contributions
 tradem-0.691
 undermin-0.532
construct-0.509
urers-0.509
cler-0.508
Token been
Token ID587
Feature activation-1.24e+00

Pos logit contributions
theless+1.436
+1.225
 DAC+1.119
 DOI+1.098
+1.045
Neg logit contributions
 tradem-1.966
 undermin-1.566
construct-1.508
urers-1.503
cler-1.500
Token the
Token ID262
Feature activation-1.55e+00

Pos logit contributions
theless+1.054
+0.916
 DAC+0.848
 DOI+0.833
+0.800
Neg logit contributions
 tradem-1.152
 undermin-0.893
construct-0.854
urers-0.852
cler-0.850
Token most
Token ID749
Feature activation-1.90e+00

Pos logit contributions
theless+1.356
+1.184
 DAC+1.099
 DOI+1.082
+1.040
Neg logit contributions
 tradem-1.388
 undermin-1.066
construct-1.018
urers-1.015
cler-1.012
Token sought
Token ID7194
Feature activation-3.50e+00

Pos logit contributions
theless+1.719
+1.509
 DAC+1.406
 DOI+1.385
+1.330
Neg logit contributions
 tradem-1.645
 undermin-1.246
construct-1.186
urers-1.184
cler-1.179
Token-
Token ID12
Feature activation-1.96e+00

Pos logit contributions
theless+2.522
+2.239
 DAC+2.010
 DOI+1.967
AI+1.884
Neg logit contributions
 tradem-3.564
 undermin-2.860
construct-2.781
cler-2.748
urers-2.747
Tokenafter
Token ID8499
Feature activation-6.32e+00

Pos logit contributions
theless+1.484
+1.275
 DAC+1.161
 DOI+1.137
+1.088
Neg logit contributions
 tradem-2.000
 undermin-1.596
urers-1.517
construct-1.516
cler-1.510
Token feature
Token ID3895
Feature activation-3.39e+00

Pos logit contributions
theless+5.373
+4.749
 DAC+4.510
 DOI+4.378
AI+4.191
Neg logit contributions
 tradem-5.722
 undermin-4.324
construct-4.164
urers-4.143
 newcom-4.119
Token for
Token ID329
Feature activation-2.42e+00

Pos logit contributions
theless+2.686
+2.334
 DAC+2.149
 DOI+2.116
AI+2.009
Neg logit contributions
 tradem-3.316
 undermin-2.603
construct-2.501
urers-2.475
cler-2.467
Token me
Token ID502
Feature activation-3.15e+00

Pos logit contributions
theless+1.984
+1.718
 DAC+1.598
 DOI+1.569
+1.491
Neg logit contributions
 tradem-2.297
 undermin-1.789
construct-1.720
urers-1.709
cler-1.706
Token since
Token ID1201
Feature activation-2.05e-01

Pos logit contributions
theless+2.459
+2.129
 DAC+1.949
 DOI+1.912
AI+1.816
Neg logit contributions
 tradem-3.134
 undermin-2.469
construct-2.361
urers-2.347
 newcom-2.342
Token the
Token ID262
Feature activation-7.88e-01

Pos logit contributions
theless+0.165
+0.142
 DAC+0.131
 DOI+0.129
+0.123
Neg logit contributions
 tradem-0.197
 undermin-0.155
construct-0.149
urers-0.148
cler-0.148
Token riders
Token ID13750
Feature activation+2.31e+00

Pos logit contributions
 tradem+4.564
 undermin+3.635
cler+3.544
construct+3.508
 spoilers+3.493
Neg logit contributions
theless-3.603
-3.000
 DAC-2.762
 DOI-2.727
-2.633
Token were
Token ID547
Feature activation-6.73e-02

Pos logit contributions
 tradem+2.094
 undermin+1.613
construct+1.575
cler+1.574
urers+1.562
Neg logit contributions
theless-1.969
-1.689
 DAC-1.586
 DOI-1.557
-1.486
Token routinely
Token ID16231
Feature activation-4.64e-01

Pos logit contributions
theless+0.056
+0.048
 DAC+0.045
 DOI+0.044
+0.042
Neg logit contributions
 tradem-0.063
 undermin-0.049
construct-0.047
urers-0.047
cler-0.047
Token delayed
Token ID11038
Feature activation+1.02e+00

Pos logit contributions
theless+0.378
+0.326
 DAC+0.301
 DOI+0.295
+0.283
Neg logit contributions
 tradem-0.444
 undermin-0.348
construct-0.334
urers-0.333
cler-0.333
Token by
Token ID416
Feature activation+1.36e-01

Pos logit contributions
 tradem+0.950
 undermin+0.741
construct+0.713
cler+0.710
urers+0.709
Neg logit contributions
theless-0.851
-0.731
 DAC-0.679
 DOI-0.667
-0.643
Token private
Token ID2839
Feature activation+4.55e+00

Pos logit contributions
 tradem+0.125
 undermin+0.097
construct+0.093
urers+0.092
cler+0.092
Neg logit contributions
theless-0.116
-0.101
 DAC-0.093
 DOI-0.092
-0.088
Token automobile
Token ID27930
Feature activation+1.32e+00

Pos logit contributions
 tradem+3.222
 undermin+2.295
cler+2.216
urers+2.172
 spoilers+2.153
Neg logit contributions
theless-4.791
-4.213
 DAC-3.941
 DOI-3.910
-3.824
Token drivers
Token ID6643
Feature activation+1.68e+00

Pos logit contributions
 tradem+1.135
 undermin+0.862
construct+0.821
urers+0.820
cler+0.817
Neg logit contributions
theless-1.200
-1.050
 DAC-0.978
 DOI-0.964
-0.930
Token blocking
Token ID12013
Feature activation-2.99e-01

Pos logit contributions
 tradem+1.580
 undermin+1.239
construct+1.197
cler+1.196
urers+1.190
Neg logit contributions
theless-1.380
-1.174
 DAC-1.090
 DOI-1.070
-1.027
Token boarding
Token ID25523
Feature activation-1.47e+00

Pos logit contributions
theless+0.270
+0.239
 DAC+0.222
 DOI+0.219
+0.210
Neg logit contributions
 tradem-0.259
 undermin-0.197
construct-0.187
urers-0.187
cler-0.185
Token islands
Token ID14807
Feature activation+8.73e-01

Pos logit contributions
theless+1.433
+1.277
 DAC+1.196
 DOI+1.180
+1.137
Neg logit contributions
 tradem-1.162
 undermin-0.860
construct-0.812
urers-0.810
cler-0.804
Token display
Token ID3359
Feature activation-7.97e-01

Pos logit contributions
 tradem+0.032
 undermin+0.025
construct+0.024
urers+0.023
cler+0.023
Neg logit contributions
theless-0.030
-0.026
 DAC-0.024
 DOI-0.023
-0.023
Token.
Token ID13
Feature activation+9.15e-01

Pos logit contributions
theless+0.631
+0.546
 DAC+0.501
 DOI+0.491
+0.469
Neg logit contributions
 tradem-0.785
 undermin-0.618
construct-0.590
urers-0.589
cler-0.588
Token
Token ID198
Feature activation+1.98e+00

Pos logit contributions
 tradem+0.896
 undermin+0.708
construct+0.683
urers+0.680
cler+0.679
Neg logit contributions
theless-0.720
-0.617
 DAC-0.566
 DOI-0.555
-0.532
Token
Token ID198
Feature activation+1.43e+00

Pos logit contributions
 tradem+1.624
construct+1.363
cler+1.345
urers+1.314
 spoilers+1.313
Neg logit contributions
theless-1.767
-1.407
 DAC-1.341
 DOI-1.303
 detainee-1.269
TokenSl
Token ID11122
Feature activation+1.82e-01

Pos logit contributions
 tradem+1.309
 undermin+1.018
construct+0.999
cler+0.992
urers+0.985
Neg logit contributions
theless-1.203
-1.029
 DAC-0.960
 DOI-0.945
-0.897
Tokenov
Token ID709
Feature activation+4.66e+00

Pos logit contributions
 tradem+0.192
 undermin+0.154
construct+0.153
cler+0.153
urers+0.153
Neg logit contributions
theless-0.129
-0.106
 DAC-0.098
 DOI-0.094
-0.090
Tokenakia
Token ID21897
Feature activation+5.12e-01

Pos logit contributions
 tradem+3.904
 undermin+2.983
construct+2.927
urers+2.918
cler+2.898
Neg logit contributions
theless-4.233
-3.669
 DAC-3.413
 DOI-3.376
-3.264
Token
Token ID447
Feature activation+2.76e-01

Pos logit contributions
 tradem+0.493
 undermin+0.388
construct+0.372
urers+0.371
cler+0.370
Neg logit contributions
theless-0.412
-0.355
 DAC-0.327
 DOI-0.321
-0.308
Token
Token ID247
Feature activation+1.57e-01

Pos logit contributions
 tradem+0.239
 undermin+0.182
construct+0.173
urers+0.173
cler+0.172
Neg logit contributions
theless-0.250
-0.219
 DAC-0.204
 DOI-0.201
-0.193
Tokens
Token ID82
Feature activation+8.72e-01

Pos logit contributions
 tradem+0.152
 undermin+0.120
construct+0.115
urers+0.115
cler+0.115
Neg logit contributions
theless-0.125
-0.107
 DAC-0.099
 DOI-0.097
-0.093
Token lone
Token ID13497
Feature activation+2.17e+00

Pos logit contributions
 tradem+0.803
 undermin+0.624
construct+0.598
urers+0.596
cler+0.595
Neg logit contributions
theless-0.738
-0.641
 DAC-0.593
 DOI-0.583
-0.560
Token this
Token ID428
Feature activation+3.63e-01

Pos logit contributions
theless+0.063
+0.053
 DAC+0.048
 DOI+0.047
+0.045
Neg logit contributions
 tradem-0.085
 undermin-0.067
construct-0.066
cler-0.066
urers-0.065
Token post
Token ID1281
Feature activation+7.16e-01

Pos logit contributions
 tradem+0.332
 undermin+0.257
construct+0.250
cler+0.249
urers+0.248
Neg logit contributions
theless-0.310
-0.266
 DAC-0.246
 DOI-0.241
-0.233
Token I
Token ID314
Feature activation+1.10e+00

Pos logit contributions
 tradem+0.698
 undermin+0.548
construct+0.536
cler+0.535
urers+0.533
Neg logit contributions
theless-0.565
-0.481
 DAC-0.442
 DOI-0.433
-0.415
Token'll
Token ID1183
Feature activation+4.41e+00

Pos logit contributions
 tradem+0.994
 undermin+0.767
construct+0.754
cler+0.753
urers+0.752
Neg logit contributions
theless-0.943
-0.817
 DAC-0.760
 DOI-0.745
-0.715
Token round
Token ID2835
Feature activation-9.85e-01

Pos logit contributions
 tradem+3.615
cler+2.887
construct+2.875
 spoilers+2.840
urers+2.795
Neg logit contributions
theless-3.956
-3.293
 DAC-3.156
 DOI-3.062
AI-2.990
Token up
Token ID510
Feature activation+5.33e+00

Pos logit contributions
theless+0.775
+0.659
 DAC+0.607
 DOI+0.596
+0.570
Neg logit contributions
 tradem-0.969
 undermin-0.764
construct-0.739
cler-0.737
urers-0.737
Token some
Token ID617
Feature activation+2.17e-01

Pos logit contributions
 tradem+3.973
cler+2.899
 spoilers+2.894
construct+2.865
 undermin+2.857
Neg logit contributions
theless-5.299
-4.587
 DAC-4.351
 DOI-4.273
-4.139
Token got
Token ID1392
Feature activation+1.09e+00

Pos logit contributions
 tradem+0.159
 undermin+0.114
 spoilers+0.111
cler+0.110
construct+0.109
Neg logit contributions
theless-0.223
-0.196
 DAC-0.187
 DOI-0.183
-0.176
Tokencha
Token ID11693
Feature activation+3.52e+00

Pos logit contributions
 tradem+0.683
construct+0.465
 undermin+0.463
cler+0.460
urers+0.450
Neg logit contributions
theless-1.229
-1.089
 DAC-1.049
 DOI-1.026
-0.990
Token's
Token ID338
Feature activation+1.80e+00

Pos logit contributions
 tradem+2.969
cler+2.343
 spoilers+2.337
construct+2.331
urers+2.309
Neg logit contributions
theless-3.090
-2.520
 DAC-2.425
 DOI-2.382
 detainee-2.280
Token I
Token ID314
Feature activation+2.15e-01

Pos logit contributions
 tradem+1.678
construct+1.300
cler+1.298
 undermin+1.297
 spoilers+1.290
Neg logit contributions
theless-1.485
-1.236
 DAC-1.158
 DOI-1.124
-1.077
Token 100
Token ID1802
Feature activation+2.13e-01

Pos logit contributions
theless+1.746
+1.512
 DAC+1.395
 DOI+1.375
+1.304
Neg logit contributions
 tradem-2.076
 undermin-1.632
construct-1.558
urers-1.550
cler-1.547
Token passengers
Token ID10405
Feature activation+8.24e-01

Pos logit contributions
 tradem+0.185
 undermin+0.141
urers+0.135
construct+0.135
cler+0.134
Neg logit contributions
theless-0.193
-0.168
 DAC-0.157
 DOI-0.154
-0.148
Token injured
Token ID6686
Feature activation-6.17e-02

Pos logit contributions
 tradem+0.790
 undermin+0.619
urers+0.600
cler+0.600
construct+0.599
Neg logit contributions
theless-0.661
-0.567
 DAC-0.526
 DOI-0.514
-0.495
Token in
Token ID287
Feature activation+7.92e-01

Pos logit contributions
theless+0.050
+0.043
 DAC+0.040
 DOI+0.039
+0.037
Neg logit contributions
 tradem-0.060
 undermin-0.047
construct-0.045
urers-0.045
cler-0.044
Token the
Token ID262
Feature activation+8.56e-01

Pos logit contributions
 tradem+0.710
 undermin+0.546
construct+0.521
urers+0.520
cler+0.519
Neg logit contributions
theless-0.694
-0.605
 DAC-0.562
 DOI-0.553
-0.532
Token mish
Token ID29406
Feature activation+5.69e+00

Pos logit contributions
 tradem+0.732
 undermin+0.555
construct+0.531
urers+0.531
cler+0.527
Neg logit contributions
theless-0.784
-0.684
 DAC-0.636
 DOI-0.626
-0.609
Tokenap
Token ID499
Feature activation+2.36e+00

Pos logit contributions
 tradem+4.930
construct+3.927
cler+3.880
urers+3.794
 undermin+3.773
Neg logit contributions
theless-4.831
-4.000
 DAC-3.929
-3.752
 DOI-3.691
Token have
Token ID423
Feature activation-1.41e+00

Pos logit contributions
 tradem+2.180
 undermin+1.694
cler+1.662
urers+1.657
construct+1.649
Neg logit contributions
theless-1.940
-1.661
 DAC-1.569
 DOI-1.509
-1.478
Token settled
Token ID10282
Feature activation+1.38e+00

Pos logit contributions
theless+0.998
+0.841
 DAC+0.763
 DOI+0.749
+0.709
Neg logit contributions
 tradem-1.500
 undermin-1.208
construct-1.164
urers-1.160
cler-1.158
Token compensation
Token ID9836
Feature activation+1.13e+00

Pos logit contributions
 tradem+1.351
 undermin+1.060
urers+1.019
cler+1.017
construct+1.017
Neg logit contributions
theless-1.105
-0.943
 DAC-0.869
 DOI-0.852
-0.819
Token claims
Token ID3667
Feature activation+3.66e-01

Pos logit contributions
 tradem+1.094
 undermin+0.860
cler+0.830
urers+0.830
construct+0.828
Neg logit contributions
theless-0.905
-0.767
 DAC-0.706
 DOI-0.697
-0.667
Tokenering
Token ID1586
Feature activation+2.62e+00

Pos logit contributions
 tradem+2.736
 undermin+2.284
construct+2.220
urers+2.217
cler+2.214
Neg logit contributions
theless-1.188
-0.942
 DAC-0.835
 DOI-0.808
-0.750
Token.
Token ID13
Feature activation+3.24e-01

Pos logit contributions
 tradem+2.463
 undermin+1.950
construct+1.857
 spoilers+1.851
cler+1.849
Neg logit contributions
theless-2.132
-1.819
 DAC-1.706
 DOI-1.672
-1.600
Token
Token ID198
Feature activation-5.68e-01

Pos logit contributions
 tradem+0.313
 undermin+0.247
construct+0.240
cler+0.239
urers+0.239
Neg logit contributions
theless-0.258
-0.219
 DAC-0.201
 DOI-0.197
-0.189
Token
Token ID198
Feature activation-3.45e-01

Pos logit contributions
theless+0.470
+0.388
 DAC+0.364
 DOI+0.355
+0.340
Neg logit contributions
 tradem-0.523
construct-0.414
cler-0.412
urers-0.407
 spoilers-0.404
TokenThe
Token ID464
Feature activation+2.85e-01

Pos logit contributions
theless+0.289
+0.247
 DAC+0.230
 DOI+0.226
+0.216
Neg logit contributions
 tradem-0.320
 undermin-0.249
construct-0.242
cler-0.241
urers-0.240
Token flyer
Token ID42464
Feature activation+5.20e+00

Pos logit contributions
 tradem+0.251
 undermin+0.193
construct+0.184
urers+0.183
cler+0.183
Neg logit contributions
theless-0.253
-0.221
 DAC-0.205
 DOI-0.202
-0.195
Token shows
Token ID2523
Feature activation+1.85e+00

Pos logit contributions
 tradem+4.771
 undermin+3.774
 spoilers+3.688
urers+3.685
cler+3.666
Neg logit contributions
theless-4.170
-3.519
 DAC-3.318
 DOI-3.247
-3.138
Token a
Token ID257
Feature activation-4.58e-01

Pos logit contributions
 tradem+1.777
 undermin+1.404
urers+1.351
cler+1.343
 spoilers+1.342
Neg logit contributions
theless-1.482
-1.252
 DAC-1.154
 DOI-1.136
-1.100
Token newspaper
Token ID7533
Feature activation-2.75e+00

Pos logit contributions
theless+0.400
+0.348
 DAC+0.322
 DOI+0.317
+0.306
Neg logit contributions
 tradem-0.411
 undermin-0.317
cler-0.303
construct-0.303
urers-0.303
Token headline
Token ID16534
Feature activation-3.12e+00

Pos logit contributions
theless+2.405
+2.117
 DAC+1.956
 DOI+1.926
+1.851
Neg logit contributions
 tradem-2.467
 undermin-1.900
construct-1.807
urers-1.793
 newcom-1.782
Token taken
Token ID2077
Feature activation+9.91e-01

Pos logit contributions
theless+2.382
+2.066
 DAC+1.869
 DOI+1.834
+1.761
Neg logit contributions
 tradem-3.142
 undermin-2.482
construct-2.388
 newcom-2.378
urers-2.371
Token
Token ID960
Feature activation-8.61e-01

Pos logit contributions
 tradem+0.523
 undermin+0.417
construct+0.404
cler+0.402
urers+0.402
Neg logit contributions
theless-0.385
-0.323
 DAC-0.300
 DOI-0.294
-0.279
Tokenthe
Token ID1169
Feature activation-1.82e+00

Pos logit contributions
theless+0.666
+0.570
 DAC+0.523
 DOI+0.513
+0.490
Neg logit contributions
 tradem-0.861
 undermin-0.682
construct-0.655
urers-0.654
cler-0.652
Token situations
Token ID7445
Feature activation-1.20e+00

Pos logit contributions
theless+1.699
+1.500
 DAC+1.400
 DOI+1.379
+1.327
Neg logit contributions
 tradem-1.520
 undermin-1.141
construct-1.085
urers-1.081
cler-1.076
Token might
Token ID1244
Feature activation+1.49e+00

Pos logit contributions
theless+0.940
+0.806
 DAC+0.739
 DOI+0.726
+0.693
Neg logit contributions
 tradem-1.195
 undermin-0.945
construct-0.907
urers-0.905
cler-0.903
Token very
Token ID845
Feature activation+1.62e+00

Pos logit contributions
 tradem+1.309
 undermin+1.011
cler+0.980
urers+0.978
construct+0.978
Neg logit contributions
theless-1.310
-1.112
 DAC-1.062
 DOI-1.037
AI-0.995
Token well
Token ID880
Feature activation+1.49e+00

Pos logit contributions
 tradem+1.452
 undermin+1.115
construct+1.070
cler+1.067
urers+1.065
Neg logit contributions
theless-1.416
-1.229
 DAC-1.147
 DOI-1.130
-1.083
Token be
Token ID307
Feature activation-3.02e-01

Pos logit contributions
 tradem+1.287
 undermin+0.989
urers+0.954
cler+0.954
construct+0.953
Neg logit contributions
theless-1.335
-1.144
 DAC-1.091
 DOI-1.058
AI-1.026
Token reversed
Token ID17687
Feature activation-1.33e+00

Pos logit contributions
theless+0.278
+0.243
 DAC+0.228
 DOI+0.224
AI+0.216
Neg logit contributions
 tradem-0.256
 undermin-0.194
construct-0.185
cler-0.184
urers-0.184
Token.
Token ID13
Feature activation+1.58e+00

Pos logit contributions
theless+1.049
+0.910
 DAC+0.826
 DOI+0.815
+0.778
Neg logit contributions
 tradem-1.312
 undermin-1.036
construct-0.992
urers-0.991
cler-0.985
Token It
Token ID632
Feature activation+8.15e-02

Pos logit contributions
 tradem+1.432
construct+1.133
cler+1.118
 undermin+1.118
urers+1.112
Neg logit contributions
theless-1.293
-1.069
 DAC-0.997
 DOI-0.963
-0.948
Token is
Token ID318
Feature activation+1.31e+00

Pos logit contributions
 tradem+0.081
 undermin+0.064
urers+0.062
construct+0.062
cler+0.062
Neg logit contributions
theless-0.063
-0.053
 DAC-0.050
 DOI-0.048
AI-0.046
Token have
Token ID423
Feature activation-2.67e-01

Pos logit contributions
theless+0.281
+0.243
 DAC+0.225
 DOI+0.221
+0.213
Neg logit contributions
 tradem-0.309
 undermin-0.241
urers-0.232
construct-0.231
cler-0.231
Token been
Token ID587
Feature activation+3.50e-01

Pos logit contributions
theless+0.215
+0.185
 DAC+0.170
 DOI+0.167
+0.160
Neg logit contributions
 tradem-0.258
 undermin-0.203
construct-0.195
urers-0.195
cler-0.194
Token diagnosed
Token ID14641
Feature activation-8.78e-01

Pos logit contributions
 tradem+0.321
 undermin+0.250
construct+0.239
urers+0.238
cler+0.238
Neg logit contributions
theless-0.298
-0.258
 DAC-0.239
 DOI-0.235
-0.225
Token with
Token ID351
Feature activation-1.37e+00

Pos logit contributions
theless+0.646
+0.557
 DAC+0.504
 DOI+0.498
+0.473
Neg logit contributions
 tradem-0.904
 undermin-0.719
construct-0.697
urers-0.690
 newcom-0.686
Token post
Token ID1281
Feature activation-4.88e-01

Pos logit contributions
theless+1.110
+0.963
 DAC+0.891
 DOI+0.876
+0.834
Neg logit contributions
 tradem-1.310
 undermin-1.023
urers-0.982
construct-0.981
cler-0.974
Token-
Token ID12
Feature activation+1.80e+00

Pos logit contributions
theless+0.325
+0.272
 DAC+0.244
 DOI+0.239
+0.225
Neg logit contributions
 tradem-0.539
 undermin-0.438
urers-0.422
construct-0.421
cler-0.420
Tokentraumatic
Token ID41521
Feature activation+2.52e+00

Pos logit contributions
 tradem+1.599
 undermin+1.227
construct+1.196
cler+1.185
urers+1.176
Neg logit contributions
theless-1.575
-1.359
 DAC-1.271
 DOI-1.255
-1.204
Token stress
Token ID5503
Feature activation+4.20e+00

Pos logit contributions
 tradem+1.995
 undermin+1.507
construct+1.448
cler+1.432
 spoilers+1.427
Neg logit contributions
theless-2.442
-2.117
 DAC-1.981
 DOI-1.961
-1.909
Token disorder
Token ID8967
Feature activation+1.56e+00

Pos logit contributions
 tradem+3.352
 undermin+2.511
construct+2.378
cler+2.369
urers+2.357
Neg logit contributions
theless-4.110
-3.482
 DAC-3.299
 DOI-3.259
-3.181
Token and
Token ID290
Feature activation+2.43e-02

Pos logit contributions
 tradem+1.554
 undermin+1.233
construct+1.183
cler+1.183
urers+1.180
Neg logit contributions
theless-1.206
-1.021
 DAC-0.941
 DOI-0.921
-0.883
Token,
Token ID11
Feature activation+8.53e-01

Pos logit contributions
 tradem+0.023
 undermin+0.018
construct+0.017
urers+0.017
cler+0.017
Neg logit contributions
theless-0.020
-0.018
 DAC-0.016
 DOI-0.016
-0.015
Tokens
Token ID82
Feature activation+9.73e-01

Pos logit contributions
 tradem+0.847
 undermin+0.624
construct+0.615
cler+0.611
urers+0.609
Neg logit contributions
theless-0.999
-0.877
 DAC-0.832
 DOI-0.817
AI-0.784
Token also
Token ID635
Feature activation-3.86e-01

Pos logit contributions
 tradem+0.901
 undermin+0.703
construct+0.672
urers+0.670
cler+0.668
Neg logit contributions
theless-0.819
-0.706
 DAC-0.654
 DOI-0.644
-0.616
Token followed
Token ID3940
Feature activation+1.26e+00

Pos logit contributions
theless+0.326
+0.283
 DAC+0.262
 DOI+0.257
+0.247
Neg logit contributions
 tradem-0.358
 undermin-0.278
construct-0.266
urers-0.265
cler-0.265
Token Col
Token ID1623
Feature activation+2.58e+00

Pos logit contributions
 tradem+1.089
 undermin+0.827
construct+0.787
cler+0.786
urers+0.786
Neg logit contributions
theless-1.147
-1.004
 DAC-0.934
 DOI-0.918
-0.888
Tokenfax
Token ID23560
Feature activation-1.57e+00

Pos logit contributions
 tradem+2.406
 undermin+1.858
construct+1.798
cler+1.796
urers+1.787
Neg logit contributions
theless-2.173
-1.855
 DAC-1.726
 DOI-1.693
-1.631
Token,
Token ID11
Feature activation+1.86e+00

Pos logit contributions
theless+1.287
+1.128
 DAC+1.035
 DOI+1.021
+0.974
Neg logit contributions
 tradem-1.487
 undermin-1.168
construct-1.115
urers-1.112
 newcom-1.105
Token the
Token ID262
Feature activation+1.73e+00

Pos logit contributions
 tradem+1.619
 undermin+1.232
construct+1.174
urers+1.170
cler+1.170
Neg logit contributions
theless-1.684
-1.470
 DAC-1.368
 DOI-1.347
-1.299
Token longest
Token ID14069
Feature activation+1.94e+00

Pos logit contributions
 tradem+1.538
 undermin+1.176
cler+1.121
construct+1.121
urers+1.117
Neg logit contributions
theless-1.525
-1.329
 DAC-1.227
 DOI-1.206
-1.172
Token bou
Token ID35833
Feature activation+1.91e+00

Pos logit contributions
 tradem+1.796
 undermin+1.387
cler+1.343
construct+1.335
 spoilers+1.325
Neg logit contributions
theless-1.614
-1.380
 DAC-1.294
 DOI-1.257
-1.230
Tokenlevard
Token ID22123
Feature activation+3.64e-01

Pos logit contributions
 tradem+1.152
 undermin+0.754
cler+0.707
construct+0.705
urers+0.694
Neg logit contributions
theless-2.232
-2.011
 DAC-1.910
 DOI-1.886
-1.836
Token in
Token ID287
Feature activation-1.21e+00

Pos logit contributions
 tradem+0.338
 undermin+0.262
cler+0.252
construct+0.252
urers+0.250
Neg logit contributions
theless-0.307
-0.263
 DAC-0.245
 DOI-0.241
-0.231
Token city
Token ID1748
Feature activation-3.63e-01

Pos logit contributions
 tradem+1.127
 undermin+0.870
urers+0.831
construct+0.830
cler+0.829
Neg logit contributions
theless-1.104
-0.957
 DAC-0.880
 DOI-0.873
-0.846
Token governments
Token ID6905
Feature activation+1.72e-01

Pos logit contributions
theless+0.301
+0.259
 DAC+0.240
 DOI+0.236
+0.227
Neg logit contributions
 tradem-0.343
 undermin-0.268
construct-0.257
urers-0.256
cler-0.256
Token.
Token ID13
Feature activation-4.87e-01

Pos logit contributions
 tradem+0.165
 undermin+0.129
construct+0.124
urers+0.124
cler+0.123
Neg logit contributions
theless-0.140
-0.120
 DAC-0.111
 DOI-0.109
-0.105
Token Paris
Token ID6342
Feature activation+1.13e+00

Pos logit contributions
theless+0.394
+0.339
 DAC+0.312
 DOI+0.306
+0.294
Neg logit contributions
 tradem-0.468
 undermin-0.367
construct-0.353
urers-0.352
cler-0.352
Token,
Token ID11
Feature activation+8.29e-01

Pos logit contributions
 tradem+1.124
 undermin+0.887
urers+0.863
construct+0.858
cler+0.857
Neg logit contributions
theless-0.874
-0.734
 DAC-0.678
 DOI-0.673
-0.642
Token for
Token ID329
Feature activation+1.18e+00

Pos logit contributions
 tradem+0.791
 undermin+0.620
construct+0.594
urers+0.593
cler+0.593
Neg logit contributions
theless-0.676
-0.580
 DAC-0.533
 DOI-0.526
-0.504
Token example
Token ID1672
Feature activation+1.75e+00

Pos logit contributions
 tradem+0.844
 undermin+0.599
construct+0.565
urers+0.563
cler+0.561
Neg logit contributions
theless-1.252
-1.114
 DAC-1.048
 DOI-1.036
-1.008
Token,
Token ID11
Feature activation-4.55e-01

Pos logit contributions
 tradem+1.680
cler+1.339
construct+1.335
urers+1.317
 spoilers+1.307
Neg logit contributions
theless-1.368
-1.101
 DAC-1.040
 DOI-1.018
-0.972
Token made
Token ID925
Feature activation+1.83e-01

Pos logit contributions
theless+0.358
+0.306
 DAC+0.282
 DOI+0.277
+0.265
Neg logit contributions
 tradem-0.447
 undermin-0.353
construct-0.339
urers-0.338
cler-0.338
Token headlines
Token ID14408
Feature activation-3.90e+00

Pos logit contributions
 tradem+0.170
 undermin+0.132
construct+0.126
urers+0.126
cler+0.125
Neg logit contributions
theless-0.155
-0.135
 DAC-0.124
 DOI-0.122
-0.118
Token in
Token ID287
Feature activation-1.83e+00

Pos logit contributions
theless+2.985
+2.631
 DAC+2.365
 DOI+2.337
+2.236
Neg logit contributions
 tradem-3.879
 undermin-3.084
construct-2.957
urers-2.943
 newcom-2.935
Token,
Token ID11
Feature activation+3.07e-01

Pos logit contributions
theless+0.066
+0.056
 DAC+0.051
 DOI+0.050
+0.047
Neg logit contributions
 tradem-0.096
 undermin-0.077
urers-0.074
construct-0.074
cler-0.074
Token developer
Token ID8517
Feature activation+2.23e+00

Pos logit contributions
 tradem+0.281
 undermin+0.217
construct+0.208
urers+0.207
cler+0.207
Neg logit contributions
theless-0.262
-0.228
 DAC-0.211
 DOI-0.208
-0.200
Token of
Token ID286
Feature activation+6.36e-01

Pos logit contributions
 tradem+2.122
 undermin+1.660
construct+1.591
urers+1.588
cler+1.582
Neg logit contributions
theless-1.827
-1.582
 DAC-1.460
 DOI-1.434
-1.374
Token Ghost
Token ID9897
Feature activation-3.12e+00

Pos logit contributions
 tradem+0.591
 undermin+0.459
construct+0.439
urers+0.438
cler+0.437
Neg logit contributions
theless-0.537
-0.466
 DAC-0.431
 DOI-0.424
-0.407
Token Song
Token ID10940
Feature activation-3.22e+00

Pos logit contributions
theless+2.563
+2.244
 DAC+2.076
 DOI+2.025
AI+1.962
Neg logit contributions
 tradem-2.938
 undermin-2.292
urers-2.188
 newcom-2.168
construct-2.164
Token.
Token ID13
Feature activation+1.01e+00

Pos logit contributions
theless+2.457
+2.128
 DAC+1.968
 DOI+1.902
AI+1.828
Neg logit contributions
 tradem-3.241
 undermin-2.559
construct-2.458
urers-2.457
cler-2.434
Token "
Token ID366
Feature activation+1.18e-01

Pos logit contributions
 tradem+1.034
 undermin+0.825
construct+0.799
urers+0.796
cler+0.793
Neg logit contributions
theless-0.757
-0.641
 DAC-0.584
 DOI-0.573
-0.547
TokenTrue
Token ID17821
Feature activation+1.76e+00

Pos logit contributions
 tradem+0.112
 undermin+0.088
construct+0.087
cler+0.086
urers+0.086
Neg logit contributions
theless-0.096
-0.080
 DAC-0.074
 DOI-0.073
 detention-0.070
Token discovery
Token ID9412
Feature activation-1.14e+00

Pos logit contributions
 tradem+1.598
 undermin+1.242
urers+1.187
construct+1.186
 spoilers+1.181
Neg logit contributions
theless-1.510
-1.308
 DAC-1.211
 DOI-1.193
-1.147
Token is
Token ID318
Feature activation+9.96e-02

Pos logit contributions
theless+0.890
+0.764
 DAC+0.700
 DOI+0.689
+0.655
Neg logit contributions
 tradem-1.137
 undermin-0.897
construct-0.859
urers-0.859
cler-0.856
Token only
Token ID691
Feature activation+4.36e-01

Pos logit contributions
 tradem+0.091
 undermin+0.071
construct+0.068
urers+0.068
cler+0.067
Neg logit contributions
theless-0.085
-0.074
 DAC-0.068
 DOI-0.067
-0.065
Token.
Token ID13
Feature activation-2.06e-01

Pos logit contributions
theless+0.647
+0.560
 DAC+0.514
 DOI+0.502
+0.482
Neg logit contributions
 tradem-0.826
 undermin-0.652
construct-0.630
urers-0.627
cler-0.622
Token In
Token ID554
Feature activation+7.30e-01

Pos logit contributions
theless+0.169
+0.146
 DAC+0.134
 DOI+0.132
+0.127
Neg logit contributions
 tradem-0.198
 undermin-0.155
construct-0.148
urers-0.148
cler-0.147
Token the
Token ID262
Feature activation-8.19e-01

Pos logit contributions
 tradem+0.664
 undermin+0.513
construct+0.489
urers+0.489
cler+0.487
Neg logit contributions
theless-0.630
-0.549
 DAC-0.509
 DOI-0.500
-0.481
Token scene
Token ID3715
Feature activation+1.01e+00

Pos logit contributions
theless+0.620
+0.529
 DAC+0.484
 DOI+0.475
+0.453
Neg logit contributions
 tradem-0.830
 undermin-0.661
construct-0.636
urers-0.633
cler-0.633
Token,
Token ID11
Feature activation-1.07e+00

Pos logit contributions
 tradem+1.006
 undermin+0.799
construct+0.767
urers+0.766
cler+0.766
Neg logit contributions
theless-0.780
-0.663
 DAC-0.610
 DOI-0.599
-0.571
Token Row
Token ID11314
Feature activation-1.84e-01

Pos logit contributions
theless+0.964
+0.847
 DAC+0.789
 DOI+0.778
+0.747
Neg logit contributions
 tradem-0.944
 undermin-0.716
construct-0.685
urers-0.681
 newcom-0.675
Tokenlf
Token ID1652
Feature activation-2.84e+00

Pos logit contributions
theless+0.128
+0.106
 DAC+0.097
 DOI+0.094
+0.090
Neg logit contributions
 tradem-0.197
 undermin-0.157
construct-0.156
cler-0.156
urers-0.155
Token and
Token ID290
Feature activation-1.89e-01

Pos logit contributions
theless+2.338
+2.023
 DAC+1.861
 DOI+1.827
+1.748
Neg logit contributions
 tradem-2.732
 undermin-2.116
 newcom-2.030
urers-2.018
construct-2.016
Token Ker
Token ID17337
Feature activation+5.11e-01

Pos logit contributions
theless+0.160
+0.139
 DAC+0.129
 DOI+0.127
+0.121
Neg logit contributions
 tradem-0.176
 undermin-0.136
construct-0.131
urers-0.130
cler-0.129
Tokenmit
Token ID2781
Feature activation+9.02e-01

Pos logit contributions
 tradem+0.540
construct+0.434
 undermin+0.433
cler+0.432
urers+0.427
Neg logit contributions
theless-0.358
-0.292
 DAC-0.273
 DOI-0.264
-0.252
Token sing
Token ID1702
Feature activation-1.16e+00

Pos logit contributions
 tradem+0.821
 undermin+0.643
construct+0.618
cler+0.615
urers+0.614
Neg logit contributions
theless-0.762
-0.661
 DAC-0.614
 DOI-0.603
-0.582
Token you
Token ID345
Feature activation+1.14e+00

Pos logit contributions
 tradem+0.832
 undermin+0.657
construct+0.632
cler+0.627
urers+0.626
Neg logit contributions
theless-0.698
-0.597
 DAC-0.551
 DOI-0.541
-0.519
Token
Token ID447
Feature activation-1.44e-01

Pos logit contributions
 tradem+0.971
 undermin+0.749
construct+0.730
cler+0.718
urers+0.717
Neg logit contributions
theless-1.037
-0.884
 DAC-0.839
 DOI-0.820
-0.788
Token
Token ID247
Feature activation+5.48e-01

Pos logit contributions
theless+0.096
+0.076
 DAC+0.070
 DOI+0.068
+0.064
Neg logit contributions
 tradem-0.160
 undermin-0.129
cler-0.126
construct-0.126
urers-0.126
Tokenve
Token ID303
Feature activation+2.03e-01

Pos logit contributions
 tradem+0.447
 undermin+0.336
construct+0.332
cler+0.328
urers+0.327
Neg logit contributions
theless-0.514
-0.446
 DAC-0.424
 DOI-0.416
-0.399
Token helped
Token ID4193
Feature activation+1.10e+00

Pos logit contributions
 tradem+0.178
 undermin+0.138
construct+0.134
urers+0.132
cler+0.131
Neg logit contributions
theless-0.178
-0.154
 DAC-0.145
 DOI-0.142
AI-0.136
Token create
Token ID2251
Feature activation+3.80e-01

Pos logit contributions
 tradem+0.937
 undermin+0.725
construct+0.694
urers+0.685
cler+0.681
Neg logit contributions
theless-1.007
-0.867
 DAC-0.812
 DOI-0.797
-0.766
Token.
Token ID13
Feature activation+5.33e-01

Pos logit contributions
 tradem+0.362
 undermin+0.284
construct+0.273
urers+0.272
cler+0.271
Neg logit contributions
theless-0.310
-0.266
 DAC-0.246
 DOI-0.242
-0.232
Token
Token ID198
Feature activation+9.63e-01

Pos logit contributions
 tradem+0.519
 undermin+0.410
construct+0.402
urers+0.398
cler+0.397
Neg logit contributions
theless-0.418
-0.353
 DAC-0.325
 DOI-0.319
-0.305
Token
Token ID198
Feature activation+2.84e-01

Pos logit contributions
 tradem+0.823
construct+0.677
cler+0.672
urers+0.659
 spoilers+0.653
Neg logit contributions
theless-0.834
-0.677
 DAC-0.643
 DOI-0.624
 detainee-0.602
TokenThe
Token ID464
Feature activation+3.32e-01

Pos logit contributions
 tradem+0.270
 undermin+0.212
construct+0.210
cler+0.207
urers+0.206
Neg logit contributions
theless-0.230
-0.192
 DAC-0.181
 DOI-0.178
 detention-0.168
Token Unt
Token ID26970
Feature activation-1.10e+00

Pos logit contributions
 tradem+0.294
 undermin+0.226
construct+0.217
urers+0.216
cler+0.215
Neg logit contributions
theless-0.293
-0.255
 DAC-0.236
 DOI-0.233
-0.225
Token supporters
Token ID5941
Feature activation-9.03e-01

Pos logit contributions
theless+0.310
+0.273
 DAC+0.256
 DOI+0.252
+0.243
Neg logit contributions
 tradem-0.269
 undermin-0.202
construct-0.191
urers-0.190
cler-0.190
Token and
Token ID290
Feature activation-1.13e-02

Pos logit contributions
theless+0.684
+0.590
 DAC+0.541
 DOI+0.529
AI+0.502
Neg logit contributions
 tradem-0.914
 undermin-0.726
urers-0.694
construct-0.694
cler-0.691
Token the
Token ID262
Feature activation+1.33e-01

Pos logit contributions
theless+0.010
+0.009
 DAC+0.008
 DOI+0.008
+0.008
Neg logit contributions
 tradem-0.010
 undermin-0.008
construct-0.007
urers-0.007
cler-0.007
Token Club
Token ID6289
Feature activation-2.69e+00

Pos logit contributions
 tradem+0.119
 undermin+0.092
construct+0.088
urers+0.087
cler+0.087
Neg logit contributions
theless-0.116
-0.101
 DAC-0.094
 DOI-0.092
-0.089
Token.
Token ID13
Feature activation-4.90e-01

Pos logit contributions
theless+2.032
+1.758
 DAC+1.609
 DOI+1.567
AI+1.491
Neg logit contributions
 tradem-2.718
 undermin-2.159
urers-2.064
construct-2.063
 newcom-2.050
Token
Token ID198
Feature activation+5.05e-01

Pos logit contributions
theless+0.386
+0.328
 DAC+0.300
 DOI+0.295
+0.283
Neg logit contributions
 tradem-0.477
 undermin-0.376
construct-0.368
urers-0.365
cler-0.364
Token
Token ID198
Feature activation+2.17e-01

Pos logit contributions
 tradem+0.448
construct+0.362
cler+0.358
urers+0.353
 spoilers+0.350
Neg logit contributions
theless-0.428
-0.349
 DAC-0.329
 DOI-0.320
-0.308
Token"
Token ID1
Feature activation-8.93e-01

Pos logit contributions
 tradem+0.244
 undermin+0.199
urers+0.191
construct+0.190
cler+0.190
Neg logit contributions
theless-0.141
-0.119
 DAC-0.106
 DOI-0.103
-0.098
TokenThe
Token ID464
Feature activation+4.44e-01

Pos logit contributions
theless+0.714
+0.607
 DAC+0.561
 DOI+0.552
+0.525
Neg logit contributions
 tradem-0.862
 undermin-0.679
construct-0.662
urers-0.657
cler-0.657
Token Trust
Token ID9870
Feature activation+2.31e+00

Pos logit contributions
 tradem+0.417
 undermin+0.324
construct+0.312
urers+0.311
cler+0.309
Neg logit contributions
theless-0.371
-0.321
 DAC-0.295
 DOI-0.291
-0.280
Token is
Token ID318
Feature activation+1.15e+00

Pos logit contributions
 tradem+2.133
 undermin+1.668
construct+1.619
urers+1.618
cler+1.607
Neg logit contributions
theless-1.923
-1.641
 DAC-1.516
 DOI-1.501
-1.448
Token are
Token ID389
Feature activation-2.75e+00

Pos logit contributions
theless+0.096
+0.083
 DAC+0.077
 DOI+0.075
+0.072
Neg logit contributions
 tradem-0.111
 undermin-0.086
construct-0.083
urers-0.083
cler-0.082
Token different
Token ID1180
Feature activation-8.41e-01

Pos logit contributions
theless+2.299
+1.989
 DAC+1.830
 DOI+1.809
+1.729
Neg logit contributions
 tradem-2.588
 undermin-1.995
cler-1.912
construct-1.908
urers-1.905
Token here
Token ID994
Feature activation+1.15e+00

Pos logit contributions
theless+0.626
+0.534
 DAC+0.487
 DOI+0.479
+0.455
Neg logit contributions
 tradem-0.864
 undermin-0.690
construct-0.663
urers-0.662
cler-0.660
Token,
Token ID11
Feature activation-8.03e-01

Pos logit contributions
 tradem+1.204
 undermin+0.966
construct+0.931
urers+0.929
cler+0.927
Neg logit contributions
theless-0.831
-0.702
 DAC-0.640
 DOI-0.627
-0.596
Token the
Token ID262
Feature activation+1.87e-02

Pos logit contributions
theless+0.650
+0.560
 DAC+0.517
 DOI+0.509
+0.486
Neg logit contributions
 tradem-0.774
 undermin-0.607
construct-0.581
urers-0.580
cler-0.578
Token approach
Token ID3164
Feature activation-1.25e+00

Pos logit contributions
 tradem+0.017
 undermin+0.013
construct+0.012
urers+0.012
cler+0.012
Neg logit contributions
theless-0.016
-0.014
 DAC-0.013
 DOI-0.013
-0.013
Token feels
Token ID5300
Feature activation-1.62e+00

Pos logit contributions
theless+0.991
+0.855
 DAC+0.785
 DOI+0.771
+0.738
Neg logit contributions
 tradem-1.219
 undermin-0.958
construct-0.918
urers-0.915
cler-0.914
Token worn
Token ID12666
Feature activation+1.18e+00

Pos logit contributions
theless+1.371
+1.191
 DAC+1.103
 DOI+1.086
+1.041
Neg logit contributions
 tradem-1.507
 undermin-1.164
construct-1.110
cler-1.109
urers-1.108
Token out
Token ID503
Feature activation+2.74e+00

Pos logit contributions
 tradem+1.155
 undermin+0.912
construct+0.875
urers+0.874
cler+0.872
Neg logit contributions
theless-0.930
-0.795
 DAC-0.734
 DOI-0.720
-0.687
Token...
Token ID986
Feature activation+4.78e-01

Pos logit contributions
 tradem+2.606
 undermin+2.051
construct+1.976
cler+1.974
urers+1.965
Neg logit contributions
theless-2.220
-1.890
 DAC-1.764
 DOI-1.726
-1.649
Token I
Token ID314
Feature activation-9.20e-01

Pos logit contributions
 tradem+0.467
 undermin+0.369
construct+0.355
urers+0.354
cler+0.354
Neg logit contributions
theless-0.377
-0.324
 DAC-0.298
 DOI-0.292
-0.279
Token
Token ID447
Feature activation-2.11e+00

Pos logit contributions
theless+2.645
+2.278
 DAC+2.058
 DOI+2.017
+1.921
Neg logit contributions
 tradem-3.751
 undermin-3.009
urers-2.887
construct-2.883
cler-2.871
Token
Token ID247
Feature activation-2.32e+00

Pos logit contributions
theless+1.307
+1.045
 DAC+0.939
 DOI+0.917
+0.857
Neg logit contributions
 tradem-2.436
 undermin-1.991
construct-1.945
cler-1.940
urers-1.938
Tokens
Token ID82
Feature activation-1.51e+00

Pos logit contributions
theless+2.317
+2.064
 DAC+1.931
 DOI+1.902
+1.842
Neg logit contributions
 tradem-1.798
 undermin-1.318
urers-1.239
construct-1.237
cler-1.231
Token Parliament
Token ID8411
Feature activation-3.14e+00

Pos logit contributions
theless+1.287
+1.121
 DAC+1.038
 DOI+1.020
+0.978
Neg logit contributions
 tradem-1.397
 undermin-1.084
construct-1.034
urers-1.033
cler-1.028
Token to
Token ID284
Feature activation-1.52e+00

Pos logit contributions
theless+2.453
+2.127
 DAC+1.937
 DOI+1.910
+1.823
Neg logit contributions
 tradem-3.100
 undermin-2.455
urers-2.342
construct-2.341
cler-2.331
Token change
Token ID1487
Feature activation-1.17e+00

Pos logit contributions
theless+1.324
+1.161
 DAC+1.076
 DOI+1.060
+1.015
Neg logit contributions
 tradem-1.366
 undermin-1.049
urers-1.002
construct-1.000
cler-0.997
Token the
Token ID262
Feature activation-2.46e+00

Pos logit contributions
theless+0.989
+0.863
 DAC+0.798
 DOI+0.783
+0.750
Neg logit contributions
 tradem-1.084
 undermin-0.843
construct-0.805
urers-0.805
cler-0.799
Token 1978
Token ID15524
Feature activation-4.89e+00

Pos logit contributions
theless+2.159
+1.893
 DAC+1.761
 DOI+1.732
+1.664
Neg logit contributions
 tradem-2.191
 undermin-1.683
urers-1.603
construct-1.602
cler-1.590
Token Spanish
Token ID7897
Feature activation-3.10e+00

Pos logit contributions
theless+3.939
+3.504
 DAC+3.224
 DOI+3.136
 detention+2.999
Neg logit contributions
 tradem-4.640
 undermin-3.635
urers-3.479
construct-3.465
cler-3.442
Token Constitution
Token ID7965
Feature activation-2.94e+00

Pos logit contributions
theless+2.500
+2.192
 DAC+2.023
 DOI+1.983
+1.886
Neg logit contributions
 tradem-2.968
 undermin-2.332
urers-2.225
construct-2.220
cler-2.213
Token.
Token ID13
Feature activation-1.01e+00

Pos logit contributions
theless+2.222
+1.926
 DAC+1.746
 DOI+1.725
+1.632
Neg logit contributions
 tradem-2.979
 undermin-2.374
urers-2.278
construct-2.267
cler-2.262
Token across
Token ID1973
Feature activation+3.29e-01

Pos logit contributions
theless+0.173
+0.149
 DAC+0.138
 DOI+0.135
+0.130
Neg logit contributions
 tradem-0.205
 undermin-0.161
construct-0.155
urers-0.154
cler-0.154
Token all
Token ID477
Feature activation-6.06e-01

Pos logit contributions
 tradem+0.293
 undermin+0.224
construct+0.214
urers+0.214
cler+0.214
Neg logit contributions
theless-0.291
-0.252
 DAC-0.234
 DOI-0.231
-0.224
Token women
Token ID1466
Feature activation-5.24e-01

Pos logit contributions
theless+0.547
+0.479
 DAC+0.445
 DOI+0.439
+0.424
Neg logit contributions
 tradem-0.527
 undermin-0.401
construct-0.382
urers-0.381
cler-0.381
Token was
Token ID373
Feature activation-2.64e+00

Pos logit contributions
theless+0.398
+0.340
 DAC+0.311
 DOI+0.305
+0.291
Neg logit contributions
 tradem-0.530
 undermin-0.422
construct-0.405
urers-0.404
cler-0.403
Token the
Token ID262
Feature activation-3.42e+00

Pos logit contributions
theless+2.183
+1.902
 DAC+1.752
 DOI+1.718
+1.643
Neg logit contributions
 tradem-2.499
 undermin-1.949
construct-1.867
urers-1.863
cler-1.851
Token vaginal
Token ID30131
Feature activation-4.22e+00

Pos logit contributions
theless+2.698
+2.329
 DAC+2.142
 DOI+2.101
+2.001
Neg logit contributions
 tradem-3.365
 undermin-2.651
urers-2.542
construct-2.539
cler-2.525
Token microbiome
Token ID48465
Feature activation-3.20e-01

Pos logit contributions
theless+3.423
+3.036
 DAC+2.784
 DOI+2.734
+2.585
Neg logit contributions
 tradem-4.011
 undermin-3.148
urers-3.023
construct-2.999
cler-2.980
Token.
Token ID13
Feature activation-7.03e-01

Pos logit contributions
theless+0.251
+0.216
 DAC+0.198
 DOI+0.195
+0.186
Neg logit contributions
 tradem-0.316
 undermin-0.250
construct-0.239
urers-0.239
cler-0.238
Token
Token ID198
Feature activation-3.85e-01

Pos logit contributions
theless+0.567
+0.480
 DAC+0.441
 DOI+0.434
+0.419
Neg logit contributions
 tradem-0.668
 undermin-0.525
construct-0.514
cler-0.512
urers-0.510
Token
Token ID198
Feature activation-4.90e-01

Pos logit contributions
theless+0.312
+0.260
 DAC+0.242
 DOI+0.238
 detention+0.227
Neg logit contributions
 tradem-0.363
construct-0.284
cler-0.284
 undermin-0.282
urers-0.281
TokenSur
Token ID14214
Feature activation+1.15e+00

Pos logit contributions
theless+0.402
+0.340
 DAC+0.317
 DOI+0.312
+0.297
Neg logit contributions
 tradem-0.462
 undermin-0.362
construct-0.355
cler-0.353
urers-0.350
Token.
Token ID13
Feature activation+8.22e-01

Pos logit contributions
theless+1.052
+0.908
 DAC+0.833
 DOI+0.820
+0.784
Neg logit contributions
 tradem-1.287
 undermin-1.013
construct-0.972
urers-0.969
cler-0.964
Token
Token ID447
Feature activation+7.68e-01

Pos logit contributions
 tradem+0.849
 undermin+0.680
construct+0.669
cler+0.664
urers+0.663
Neg logit contributions
theless-0.593
-0.491
 DAC-0.448
 DOI-0.436
-0.417
Token
Token ID251
Feature activation+5.29e-01

Pos logit contributions
 tradem+0.753
 undermin+0.593
construct+0.570
urers+0.568
cler+0.567
Neg logit contributions
theless-0.607
-0.525
 DAC-0.481
 DOI-0.473
-0.452
Token She
Token ID1375
Feature activation-1.46e+00

Pos logit contributions
 tradem+0.514
 undermin+0.406
construct+0.396
cler+0.394
urers+0.393
Neg logit contributions
theless-0.415
-0.353
 DAC-0.325
 DOI-0.317
-0.304
Token also
Token ID635
Feature activation-2.50e+00

Pos logit contributions
theless+1.174
+1.012
 DAC+0.928
 DOI+0.913
+0.875
Neg logit contributions
 tradem-1.424
 undermin-1.119
construct-1.071
urers-1.069
cler-1.065
Token expresses
Token ID27505
Feature activation-4.08e+00

Pos logit contributions
theless+2.131
+1.857
 DAC+1.707
 DOI+1.686
+1.618
Neg logit contributions
 tradem-2.319
 undermin-1.792
construct-1.704
urers-1.704
cler-1.695
Token admiration
Token ID28506
Feature activation-7.37e-01

Pos logit contributions
theless+3.368
+2.988
 DAC+2.753
 DOI+2.696
 detention+2.575
Neg logit contributions
 tradem-3.824
 undermin-2.947
 newcom-2.803
construct-2.800
urers-2.797
Token for
Token ID329
Feature activation-5.94e-01

Pos logit contributions
theless+0.594
+0.515
 DAC+0.473
 DOI+0.464
+0.445
Neg logit contributions
 tradem-0.713
 undermin-0.559
construct-0.537
urers-0.536
cler-0.533
Token her
Token ID607
Feature activation-7.41e-01

Pos logit contributions
theless+0.526
+0.460
 DAC+0.427
 DOI+0.421
+0.405
Neg logit contributions
 tradem-0.527
 undermin-0.404
construct-0.385
urers-0.384
cler-0.383
Token mother
Token ID2802
Feature activation+1.77e+00

Pos logit contributions
theless+0.664
+0.581
 DAC+0.540
 DOI+0.532
+0.512
Neg logit contributions
 tradem-0.648
 undermin-0.495
construct-0.472
urers-0.471
cler-0.470
Token
Token ID447
Feature activation-4.13e-01

Pos logit contributions
 tradem+1.793
 undermin+1.427
cler+1.374
construct+1.373
urers+1.369
Neg logit contributions
theless-1.337
-1.123
 DAC-1.038
 DOI-1.019
-0.963
Token to
Token ID284
Feature activation+9.29e-02

Pos logit contributions
 tradem+1.539
 undermin+1.216
urers+1.174
cler+1.170
construct+1.170
Neg logit contributions
theless-1.224
-1.034
 DAC-0.963
 DOI-0.941
AI-0.895
Token save
Token ID3613
Feature activation-1.71e+00

Pos logit contributions
 tradem+0.086
 undermin+0.067
construct+0.064
urers+0.064
cler+0.064
Neg logit contributions
theless-0.079
-0.068
 DAC-0.063
 DOI-0.062
-0.060
Token her
Token ID607
Feature activation-1.63e+00

Pos logit contributions
theless+1.428
+1.245
 DAC+1.150
 DOI+1.132
+1.079
Neg logit contributions
 tradem-1.598
 undermin-1.243
construct-1.192
urers-1.187
cler-1.178
Token son
Token ID3367
Feature activation+2.15e-02

Pos logit contributions
theless+1.397
+1.220
 DAC+1.130
 DOI+1.112
+1.066
Neg logit contributions
 tradem-1.486
 undermin-1.147
construct-1.098
urers-1.095
cler-1.089
Token to
Token ID284
Feature activation+1.07e+00

Pos logit contributions
 tradem+0.022
 undermin+0.017
construct+0.017
urers+0.017
cler+0.017
Neg logit contributions
theless-0.016
-0.014
 DAC-0.013
 DOI-0.012
-0.012
Token ber
Token ID18157
Feature activation-3.58e+00

Pos logit contributions
 tradem+0.980
 undermin+0.764
construct+0.730
cler+0.727
urers+0.725
Neg logit contributions
theless-0.908
-0.788
 DAC-0.729
 DOI-0.716
-0.687
Tokenating
Token ID803
Feature activation-4.65e-01

Pos logit contributions
theless+3.306
+2.972
 DAC+2.702
 DOI+2.686
+2.554
Neg logit contributions
 tradem-3.052
 undermin-2.312
 newcom-2.168
urers-2.130
construct-2.126
Token herself
Token ID5223
Feature activation-2.42e-01

Pos logit contributions
theless+0.397
+0.343
 DAC+0.317
 DOI+0.312
+0.301
Neg logit contributions
 tradem-0.426
 undermin-0.331
cler-0.316
urers-0.316
construct-0.315
Token for
Token ID329
Feature activation+3.33e-01

Pos logit contributions
theless+0.197
+0.169
 DAC+0.156
 DOI+0.154
+0.147
Neg logit contributions
 tradem-0.232
 undermin-0.182
construct-0.175
urers-0.175
cler-0.174
Token letting
Token ID9616
Feature activation+1.96e+00

Pos logit contributions
 tradem+0.309
 undermin+0.242
construct+0.231
cler+0.230
urers+0.229
Neg logit contributions
theless-0.280
-0.242
 DAC-0.224
 DOI-0.221
-0.211
Token this
Token ID428
Feature activation-1.25e-01

Pos logit contributions
 tradem+1.785
 undermin+1.384
cler+1.324
urers+1.322
construct+1.319
Neg logit contributions
theless-1.672
-1.441
 DAC-1.339
 DOI-1.310
-1.263
Token
Token ID447
Feature activation+9.20e-01

Pos logit contributions
theless+0.138
+0.118
 DAC+0.108
 DOI+0.106
+0.102
Neg logit contributions
 tradem-0.178
 undermin-0.141
construct-0.135
urers-0.135
cler-0.135
Token
Token ID247
Feature activation+6.45e-01

Pos logit contributions
 tradem+0.854
 undermin+0.663
construct+0.635
urers+0.633
cler+0.631
Neg logit contributions
theless-0.777
-0.675
 DAC-0.624
 DOI-0.614
-0.589
Token
Token ID198
Feature activation-4.12e-01

Pos logit contributions
 tradem+0.632
 undermin+0.500
construct+0.484
urers+0.482
cler+0.481
Neg logit contributions
theless-0.505
-0.431
 DAC-0.399
 DOI-0.389
-0.372
Token
Token ID198
Feature activation-7.47e-01

Pos logit contributions
theless+0.340
+0.281
 DAC+0.264
 DOI+0.256
+0.246
Neg logit contributions
 tradem-0.380
construct-0.302
cler-0.300
urers-0.297
 spoilers-0.293
Token
Token ID447
Feature activation-6.32e-01

Pos logit contributions
theless+0.544
+0.457
 DAC+0.419
 DOI+0.409
+0.388
Neg logit contributions
 tradem-0.779
 undermin-0.625
construct-0.605
urers-0.602
cler-0.602
Token
Token ID246
Feature activation-4.01e+00

Pos logit contributions
theless+0.642
+0.576
 DAC+0.539
 DOI+0.533
+0.516
Neg logit contributions
 tradem-0.477
 undermin-0.345
construct-0.326
urers-0.324
cler-0.322
TokenWhen
Token ID2215
Feature activation-4.12e-01

Pos logit contributions
theless+3.031
+2.639
 DAC+2.392
 DOI+2.355
AI+2.271
Neg logit contributions
 tradem-4.085
 undermin-3.243
 newcom-3.080
urers-3.065
cler-3.034
Token we
Token ID356
Feature activation+3.06e-01

Pos logit contributions
theless+0.326
+0.278
 DAC+0.256
 DOI+0.251
+0.241
Neg logit contributions
 tradem-0.403
 undermin-0.319
construct-0.307
urers-0.306
cler-0.305
Token get
Token ID651
Feature activation+1.81e-01

Pos logit contributions
 tradem+0.278
 undermin+0.216
construct+0.210
urers+0.209
cler+0.209
Neg logit contributions
theless-0.261
-0.222
 DAC-0.209
 DOI-0.204
-0.196
Token the
Token ID262
Feature activation-2.37e-01

Pos logit contributions
 tradem+0.173
 undermin+0.136
construct+0.131
urers+0.130
cler+0.130
Neg logit contributions
theless-0.148
-0.126
 DAC-0.116
 DOI-0.114
-0.110
Token down
Token ID866
Feature activation-3.14e-01

Pos logit contributions
theless+0.228
+0.201
 DAC+0.188
 DOI+0.186
+0.180
Neg logit contributions
 tradem-0.191
 undermin-0.142
construct-0.135
urers-0.134
cler-0.134
Token goal
Token ID3061
Feature activation-1.06e+00

Pos logit contributions
 tradem+1.686
 undermin+1.313
construct+1.253
cler+1.249
urers+1.249
Neg logit contributions
theless-1.557
-1.332
 DAC-1.237
 DOI-1.226
-1.178
Token,
Token ID11
Feature activation-4.00e-01

Pos logit contributions
theless+0.847
+0.730
 DAC+0.670
 DOI+0.659
+0.629
Neg logit contributions
 tradem-1.035
 undermin-0.815
construct-0.780
urers-0.779
cler-0.777
Token playing
Token ID2712
Feature activation+4.19e-01

Pos logit contributions
theless+0.329
+0.285
 DAC+0.263
 DOI+0.258
+0.247
Neg logit contributions
 tradem-0.380
 undermin-0.297
construct-0.284
urers-0.284
cler-0.283
Token in
Token ID287
Feature activation+1.65e-01

Pos logit contributions
 tradem+0.384
 undermin+0.298
construct+0.285
urers+0.284
cler+0.283
Neg logit contributions
theless-0.358
-0.311
 DAC-0.288
 DOI-0.283
-0.272
Token the
Token ID262
Feature activation-3.51e-01

Pos logit contributions
 tradem+0.150
 undermin+0.115
construct+0.110
urers+0.110
cler+0.110
Neg logit contributions
theless-0.143
-0.125
 DAC-0.116
 DOI-0.114
-0.110
Token hitherto
Token ID45053
Feature activation-4.19e+00

Pos logit contributions
theless+0.305
+0.266
 DAC+0.246
 DOI+0.242
+0.233
Neg logit contributions
 tradem-0.318
 undermin-0.245
construct-0.234
urers-0.233
cler-0.232
Token disappointing
Token ID17185
Feature activation-1.29e+00

Pos logit contributions
theless+3.391
+2.962
 DAC+2.711
 DOI+2.650
+2.544
Neg logit contributions
 tradem-4.023
 undermin-3.146
urers-3.007
construct-2.994
cler-2.980
Token C
Token ID327
Feature activation-5.26e-02

Pos logit contributions
theless+1.117
+0.974
 DAC+0.903
 DOI+0.888
+0.852
Neg logit contributions
 tradem-1.178
 undermin-0.910
construct-0.868
urers-0.866
cler-0.863
Tokenatter
Token ID1436
Feature activation-6.13e-01

Pos logit contributions
theless+0.044
+0.038
 DAC+0.035
 DOI+0.034
+0.033
Neg logit contributions
 tradem-0.050
 undermin-0.039
construct-0.037
urers-0.037
cler-0.037
Tokenm
Token ID76
Feature activation-7.79e-01

Pos logit contributions
theless+0.466
+0.402
 DAC+0.367
 DOI+0.364
AI+0.345
Neg logit contributions
 tradem-0.621
 undermin-0.496
 newcom-0.466
construct-0.466
urers-0.465
Tokenole
Token ID2305
Feature activation+1.51e+00

Pos logit contributions
theless+0.740
+0.661
 DAC+0.610
 DOI+0.604
AI+0.583
Neg logit contributions
 tradem-0.642
 undermin-0.479
cler-0.444
 newcom-0.444
construct-0.444