TOP ACTIVATIONS
MAX = 1.612

vibration need not be of major import ance for
of sleep is having a major impact on workers with over
unemployment and wages will have an influence on prices over and
Ichigo m umbled to himself as he watched his good friend
are appropriate to make in an interview about improvis ation .
all that stuff has an impact on our long -
itability about what happens in an improvis ation , while the
hasn 't had more of an impact on prices .
Manchester United [ 95 ][ 96 ] International
said these would have a negative impact on current peace talks
hand in today s time . The ultimate objective
This version had the most motion stems from
000 each are having an effect . Many who are
the D NR commissioner who set our research on the road
nuclear weapons viewed in exactly this light before . <|endoftext|> The
. On the way , they encounter Far am ir ,
! Ichigo yelled to Chad s far
8 { ( 113 s )} 56 . f 5 {
that is going to have significant implications for things like trade
conversion to L NG has major drawbacks . Firstly , it

MIN ACTIVATIONS
MIN = -1.518

to keep this contest in - un iverse . If you
= c v 2 . get Struct uring Element ( cv
sum of money ," the bench said . "
[ return address ] | + -------------------- --+ + -------------------- --+
# initialize a rectangular and square struct uring kernel rect K
- Cook man University celebrated its hundred th anniversary from its
of the state ," the bench said . The
from an out - of - comp etition test conducted on
can be strengthened over time with practice . But this ability
from the 1980 s to the 1990 s , his family
refusal to attend a meeting , Pu ig dem ont said
bank operates ," Z et i said at a media briefing
: Frank Miller . How could this go wrong ?!
We built it entirely in - house ? the hardware and
| void * arg | + -------------------- --+ | [ return
v 2 . im read ( image Path ) image =
work with our civil servants ," he said .
With the Parliament being in session , the cant een
[ return address ] | + -------------------- --+ + -------------------- --+
| void * arg | + -------------------- --+ | [ return

INTERVAL 0.830 - 1.612
CONTAINS 0.491%

importance in today s scenario ? Today
( commit ) Levi Jones LB 6 ' 3 / 215
* Tim Read triumph s as New Zealand 's Master Che
shot and killed in 1982 as officers from the Anaheim and
Have you ever gone to see a movie that

INTERVAL 0.047 - 0.830
CONTAINS 43.684%

they said models on Monday afternoon shifted in response to
According to num ism at ist Neil Car other s ,
Video Add ons -> Netflix . Finally
influence that has crept into me in my own weird way
is it that GOP officials are trying to ram through a

INTERVAL -0.736 - 0.047
CONTAINS 54.870%

B AC limits even further , to 0 . 05 %,
s own clan or tribe . For example , El j
. 158 10 95 228 x . x . x .
investigation found that , since the mid - 1990 s ,
people . Its exports to Saudi Arabia total $ 1 .

INTERVAL -1.518 - -0.736
CONTAINS 0.955%

T ER TI P ART IS .
: Frank Miller . How could this go wrong ?!
made , and where they might fit with the Canadiens .
punishing the company after Dish lost an appeal in a multi
benefits for low - income and middle - class Canadians .
Token vibration
Token ID30999
Feature activation-3.33e-01

Pos logit contributions
gement+3.887
llah+3.746
gment+3.577
Brit+3.510
Asset+3.412
Neg logit contributions
 Duo-3.700
enf-3.517
agra-3.457
 Phi-3.407
eny-3.359
Token need
Token ID761
Feature activation+4.15e-02

Pos logit contributions
gement+3.543
llah+3.354
gment+3.196
Brit+3.145
Asset+3.028
Neg logit contributions
 Duo-3.741
enf-3.568
agra-3.504
 Phi-3.489
eny-3.405
Token not
Token ID407
Feature activation+2.17e-01

Pos logit contributions
 Duo+0.446
enf+0.422
 Phi+0.413
agra+0.410
eny+0.403
Neg logit contributions
gement-0.479
llah-0.460
gment-0.440
Brit-0.435
Asset-0.421
Token be
Token ID307
Feature activation+4.12e-01

Pos logit contributions
 Duo+2.179
enf+2.035
 Phi+1.991
agra+1.983
eny+1.963
Neg logit contributions
gement-2.626
llah-2.534
gment-2.414
Brit-2.408
Asset-2.359
Token of
Token ID286
Feature activation+8.09e-01

Pos logit contributions
 Duo+4.288
enf+4.024
 Phi+3.915
agra+3.907
eny+3.852
Neg logit contributions
gement-4.719
llah-4.615
Brit-4.389
gment-4.303
Asset-4.155
Token major
Token ID1688
Feature activation+1.61e+00

Pos logit contributions
 Duo+7.120
enf+6.548
agra+6.403
 Phi+6.348
eny+6.154
Neg logit contributions
gement-9.582
llah-9.347
Brit-9.106
Asset-8.888
gment-8.735
Token
Token ID198
Feature activation+3.87e-01

Pos logit contributions
 Duo+12.336
 cf+11.635
enf+11.249
agra+10.920
 hur+10.864
Neg logit contributions
llah-15.938
gement-14.905
Brit-14.858
Asset-14.365
vironment-13.996
Token
Token ID198
Feature activation+1.17e+00

Pos logit contributions
 Duo+3.589
enf+3.346
 Phi+3.323
agra+3.205
cum+3.191
Neg logit contributions
gement-4.922
llah-4.725
gment-4.504
��-4.476
Brit-4.367
Tokenimport
Token ID11748
Feature activation+7.36e-02

Pos logit contributions
enf+11.854
cum+11.227
if+10.818
eny+10.817
agra+10.631
Neg logit contributions
llah-11.090
gement-10.280
��-9.725
Brit-9.480
vironment-9.201
Tokenance
Token ID590
Feature activation+4.77e-01

Pos logit contributions
 Duo+0.689
enf+0.648
 Phi+0.629
agra+0.626
eny+0.613
Neg logit contributions
gement-0.949
llah-0.919
gment-0.879
Brit-0.873
Asset-0.851
Token for
Token ID329
Feature activation+3.11e-01

Pos logit contributions
 Duo+4.491
enf+4.218
 Phi+4.104
agra+4.034
eny+3.981
Neg logit contributions
gement-5.883
llah-5.691
Brit-5.460
gment-5.450
Asset-5.318
Token of
Token ID286
Feature activation-5.60e-01

Pos logit contributions
gement+1.395
llah+1.339
gment+1.270
Brit+1.254
Asset+1.209
Neg logit contributions
 Duo-1.447
enf-1.368
 Phi-1.350
agra-1.344
eny-1.310
Token sleep
Token ID3993
Feature activation-1.65e-01

Pos logit contributions
gement+5.762
llah+5.447
Brit+5.173
gment+5.152
Asset+4.904
Neg logit contributions
 Duo-6.159
enf-6.013
agra-6.012
 Phi-5.812
eny-5.756
Token is
Token ID318
Feature activation-1.14e-02

Pos logit contributions
gement+1.832
llah+1.738
gment+1.669
Brit+1.645
Asset+1.580
Neg logit contributions
 Duo-1.851
enf-1.743
 Phi-1.716
agra-1.693
eny-1.667
Token having
Token ID1719
Feature activation+9.50e-01

Pos logit contributions
gement+0.129
llah+0.124
gment+0.118
Brit+0.117
Asset+0.113
Neg logit contributions
 Duo-0.125
enf-0.118
 Phi-0.116
agra-0.115
eny-0.113
Token a
Token ID257
Feature activation+1.23e+00

Pos logit contributions
 Duo+7.563
 Phi+6.875
enf+6.812
 hur+6.811
agra+6.807
Neg logit contributions
gement-11.522
llah-11.239
Asset-10.767
gment-10.726
��-10.692
Token major
Token ID1688
Feature activation+1.56e+00

Pos logit contributions
 Duo+9.468
 hur+8.722
 Phi+8.676
 cf+8.296
 dynam+8.295
Neg logit contributions
gement-13.836
llah-13.786
Asset-13.461
ighting-13.069
Brit-13.047
Token impact
Token ID2928
Feature activation+3.69e-01

Pos logit contributions
 Duo+10.053
 hur+9.332
 cf+8.984
 Phi+8.957
cum+8.806
Neg logit contributions
llah-17.534
gement-16.538
Asset-16.532
Brit-16.498
ighting-16.150
Token on
Token ID319
Feature activation+3.15e-02

Pos logit contributions
 Duo+3.487
enf+3.263
agra+3.191
 Phi+3.183
eny+3.145
Neg logit contributions
gement-4.575
llah-4.444
gment-4.256
Asset-4.141
Brit-4.111
Token workers
Token ID3259
Feature activation+7.43e-02

Pos logit contributions
 Duo+0.334
enf+0.315
 Phi+0.308
agra+0.306
eny+0.300
Neg logit contributions
gement-0.369
llah-0.355
gment-0.340
Brit-0.335
Asset-0.325
Token with
Token ID351
Feature activation+1.34e-01

Pos logit contributions
 Duo+0.794
enf+0.751
 Phi+0.733
agra+0.727
eny+0.717
Neg logit contributions
gement-0.859
llah-0.828
gment-0.793
Brit-0.779
Asset-0.760
Token over
Token ID625
Feature activation-6.32e-02

Pos logit contributions
 Duo+1.493
enf+1.411
 Phi+1.386
agra+1.375
eny+1.347
Neg logit contributions
gement-1.485
llah-1.426
gment-1.359
Brit-1.345
Asset-1.307
Token unemployment
Token ID10681
Feature activation-2.82e-01

Pos logit contributions
gement+4.160
llah+3.970
gment+3.780
Brit+3.709
Asset+3.675
Neg logit contributions
 Duo-4.243
enf-4.043
agra-3.939
 Phi-3.929
eny-3.852
Token and
Token ID290
Feature activation-2.71e-01

Pos logit contributions
gement+3.109
llah+2.920
gment+2.795
Brit+2.742
Asset+2.647
Neg logit contributions
 Duo-3.203
enf-2.995
agra-2.980
 Phi-2.961
eny-2.881
Token wages
Token ID9400
Feature activation-2.76e-01

Pos logit contributions
gement+3.003
llah+2.865
gment+2.738
Brit+2.682
Asset+2.642
Neg logit contributions
 Duo-3.003
enf-2.843
agra-2.786
 Phi-2.781
eny-2.705
Token will
Token ID481
Feature activation+1.96e-02

Pos logit contributions
gement+2.783
llah+2.600
gment+2.515
Brit+2.448
Asset+2.365
Neg logit contributions
 Duo-3.357
enf-3.208
 Phi-3.117
agra-3.111
eny-3.018
Token have
Token ID423
Feature activation+4.88e-01

Pos logit contributions
 Duo+0.205
enf+0.193
 Phi+0.189
agra+0.188
eny+0.184
Neg logit contributions
gement-0.233
llah-0.224
gment-0.215
Brit-0.212
Asset-0.206
Token an
Token ID281
Feature activation+1.55e+00

Pos logit contributions
 Duo+4.545
enf+4.277
 Phi+4.222
agra+4.164
eny+4.085
Neg logit contributions
gement-5.966
llah-5.891
gment-5.512
Brit-5.498
Asset-5.304
Token influence
Token ID4588
Feature activation-7.37e-02

Pos logit contributions
 Duo+10.071
eny+9.850
eless+9.744
enf+9.742
 hur+9.245
Neg logit contributions
Brit-16.882
gement-16.707
llah-16.647
��-15.925
gment-15.753
Token on
Token ID319
Feature activation+1.14e-01

Pos logit contributions
gement+0.828
llah+0.794
gment+0.757
Brit+0.748
Asset+0.722
Neg logit contributions
 Duo-0.816
enf-0.773
 Phi-0.756
agra-0.751
eny-0.735
Token prices
Token ID4536
Feature activation-1.88e-01

Pos logit contributions
 Duo+1.265
enf+1.196
 Phi+1.179
agra+1.165
eny+1.148
Neg logit contributions
gement-1.261
llah-1.212
gment-1.156
Brit-1.141
Asset-1.097
Token over
Token ID625
Feature activation+4.18e-01

Pos logit contributions
gement+2.035
llah+1.943
gment+1.860
Brit+1.823
Asset+1.766
Neg logit contributions
 Duo-2.143
enf-2.016
 Phi-1.982
agra-1.978
eny-1.921
Token and
Token ID290
Feature activation+1.29e-01

Pos logit contributions
 Duo+3.739
 Phi+3.466
enf+3.432
agra+3.427
eny+3.375
Neg logit contributions
gement-5.249
llah-5.156
gment-4.942
Brit-4.892
Asset-4.668
Token Ichigo
Token ID44547
Feature activation+1.41e-01

Pos logit contributions
 Duo+5.727
 Phi+5.436
enf+5.297
eny+5.030
 Solo+4.972
Neg logit contributions
gement-6.165
llah-5.804
gment-5.716
Brit-5.460
Asset-5.434
Token m
Token ID285
Feature activation-3.48e-01

Pos logit contributions
 Duo+1.538
enf+1.445
 Phi+1.424
agra+1.396
eny+1.388
Neg logit contributions
gement-1.614
llah-1.552
gment-1.482
Brit-1.464
Asset-1.434
Tokenumbled
Token ID11137
Feature activation-8.73e-02

Pos logit contributions
gement+2.913
llah+2.655
gment+2.539
Brit+2.430
Asset+2.306
Neg logit contributions
 Duo-4.824
enf-4.566
 Phi-4.554
agra-4.500
eny-4.388
Token to
Token ID284
Feature activation+1.21e+00

Pos logit contributions
gement+0.978
llah+0.937
gment+0.895
Brit+0.886
Asset+0.854
Neg logit contributions
 Duo-0.962
enf-0.918
agra-0.896
 Phi-0.891
eny-0.874
Token himself
Token ID2241
Feature activation-2.81e-01

Pos logit contributions
 Duo+10.511
 Phi+10.315
eless+9.309
eny+9.278
enf+9.267
Neg logit contributions
gement-13.208
gment-12.353
llah-12.273
Brit-12.084
Asset-12.043
Token as
Token ID355
Feature activation+1.53e+00

Pos logit contributions
gement+2.959
llah+2.833
gment+2.708
Brit+2.664
Asset+2.547
Neg logit contributions
 Duo-3.222
enf-3.080
agra-3.029
 Phi-3.002
eny-2.924
Token he
Token ID339
Feature activation+5.63e-01

Pos logit contributions
 Phi+11.538
 Duo+11.538
if+10.372
eless+10.291
enf+10.098
Neg logit contributions
gement-16.487
llah-15.751
gment-15.285
Brit-14.878
Asset-14.840
Token watched
Token ID7342
Feature activation+8.71e-01

Pos logit contributions
 Duo+5.859
 Phi+5.360
enf+5.334
eless+5.274
eny+5.241
Neg logit contributions
gement-6.378
llah-6.290
Asset-5.968
Brit-5.853
gment-5.835
Token his
Token ID465
Feature activation-4.20e-02

Pos logit contributions
 Duo+7.561
 Phi+7.301
enf+6.754
eless+6.672
 Solo+6.612
Neg logit contributions
gement-10.718
llah-10.358
gment-9.908
Asset-9.648
Brit-9.628
Token good
Token ID922
Feature activation-2.90e-01

Pos logit contributions
gement+0.446
llah+0.428
gment+0.407
Brit+0.401
Asset+0.387
Neg logit contributions
 Duo-0.487
enf-0.464
 Phi-0.454
agra-0.453
eny-0.444
Token friend
Token ID1545
Feature activation+6.57e-01

Pos logit contributions
gement+2.192
llah+2.065
gment+1.932
Brit+1.888
Asset+1.765
Neg logit contributions
 Duo-4.172
enf-4.018
agra-3.959
 Phi-3.941
eless-3.833
Token are
Token ID389
Feature activation+6.98e-01

Pos logit contributions
 Duo+6.440
 Phi+5.934
enf+5.801
 cf+5.769
 Solo+5.686
Neg logit contributions
gement-7.301
llah-7.261
Asset-6.806
Brit-6.777
gment-6.731
Token appropriate
Token ID5035
Feature activation+5.82e-01

Pos logit contributions
 Duo+6.755
enf+6.393
 Phi+6.233
 cf+6.208
eny+5.927
Neg logit contributions
gement-7.887
llah-7.711
Brit-7.322
gment-7.210
Asset-7.157
Token to
Token ID284
Feature activation+7.03e-01

Pos logit contributions
 Duo+5.172
enf+4.896
 cf+4.813
 Phi+4.765
agra+4.691
Neg logit contributions
gement-7.168
llah-6.922
Brit-6.672
gment-6.609
Asset-6.476
Token make
Token ID787
Feature activation+5.80e-01

Pos logit contributions
 Duo+6.989
 Phi+6.458
enf+6.432
 cf+6.251
agra+6.199
Neg logit contributions
gement-7.803
llah-7.652
Brit-7.373
Asset-7.184
gment-7.116
Token in
Token ID287
Feature activation+1.13e+00

Pos logit contributions
 Duo+4.982
enf+4.624
 cf+4.463
 Phi+4.444
agra+4.393
Neg logit contributions
gement-7.295
llah-7.115
Brit-6.890
gment-6.789
Asset-6.775
Token an
Token ID281
Feature activation+1.51e+00

Pos logit contributions
 Duo+10.540
 Solo+9.699
 Phi+9.598
enf+9.154
 cf+9.024
Neg logit contributions
llah-12.006
gement-11.721
Brit-11.223
gment-10.866
Asset-10.859
Token interview
Token ID2720
Feature activation+7.39e-01

Pos logit contributions
 Duo+13.291
enf+12.217
 Solo+12.139
 Phi+11.812
if+11.444
Neg logit contributions
Brit-14.088
gement-13.937
llah-13.721
gment-13.058
Asset-12.581
Token about
Token ID546
Feature activation+6.79e-01

Pos logit contributions
 Duo+6.591
enf+6.075
 Phi+5.875
eny+5.836
eless+5.743
Neg logit contributions
gement-8.785
llah-8.631
Brit-8.419
Asset-8.138
gment-7.966
Token improvis
Token ID47833
Feature activation+4.16e-01

Pos logit contributions
 Duo+6.887
 Phi+6.374
 Solo+6.108
enf+6.041
 cf+5.911
Neg logit contributions
gement-7.494
llah-7.475
Brit-7.109
Asset-6.932
gment-6.896
Tokenation
Token ID341
Feature activation+5.96e-01

Pos logit contributions
 Duo+6.406
enf+6.068
 Phi+6.023
eny+5.975
agra+5.968
Neg logit contributions
gement-2.671
llah-2.619
Brit-2.363
gment-2.260
Asset-2.249
Token.
Token ID13
Feature activation+5.92e-01

Pos logit contributions
 Duo+5.545
enf+5.036
 Phi+5.034
eny+4.847
 cf+4.813
Neg logit contributions
gement-7.268
llah-7.076
Brit-6.849
gment-6.676
Asset-6.565
Token
Token ID960
Feature activation+3.50e-01

Pos logit contributions
gement+2.146
llah+2.077
gment+1.968
Brit+1.942
Asset+1.875
Neg logit contributions
 Duo-2.256
enf-2.137
 Phi-2.099
agra-2.092
eny-2.031
Tokenall
Token ID439
Feature activation+1.49e-02

Pos logit contributions
 Duo+3.042
enf+2.869
 Phi+2.785
agra+2.752
eny+2.725
Neg logit contributions
gement-4.643
llah-4.486
gment-4.289
Brit-4.150
Asset-4.080
Token that
Token ID326
Feature activation+1.71e-01

Pos logit contributions
 Duo+0.159
enf+0.150
 Phi+0.147
agra+0.146
eny+0.143
Neg logit contributions
gement-0.173
llah-0.166
gment-0.159
Brit-0.157
Asset-0.152
Token stuff
Token ID3404
Feature activation+1.82e-01

Pos logit contributions
 Duo+1.826
enf+1.714
 Phi+1.693
agra+1.671
eny+1.649
Neg logit contributions
gement-1.978
llah-1.900
gment-1.806
Brit-1.787
Asset-1.731
Token has
Token ID468
Feature activation+5.80e-01

Pos logit contributions
 Duo+1.922
enf+1.808
 Phi+1.776
agra+1.760
eny+1.747
Neg logit contributions
gement-2.133
llah-2.027
gment-1.947
Brit-1.921
Asset-1.873
Token an
Token ID281
Feature activation+1.51e+00

Pos logit contributions
 Duo+5.451
 Phi+5.100
enf+5.041
eny+4.895
agra+4.877
Neg logit contributions
gement-6.958
llah-6.751
Brit-6.417
gment-6.302
Asset-6.251
Token impact
Token ID2928
Feature activation+5.60e-01

Pos logit contributions
 Duo+11.924
eless+11.812
 Phi+11.804
enf+11.612
eny+11.393
Neg logit contributions
gement-14.883
Brit-14.630
llah-13.949
gment-13.579
��-13.012
Token on
Token ID319
Feature activation+1.01e-02

Pos logit contributions
 Duo+4.967
enf+4.618
eless+4.616
 Phi+4.600
agra+4.584
Neg logit contributions
gement-7.099
llah-6.697
gment-6.446
Brit-6.387
Asset-6.302
Token our
Token ID674
Feature activation-1.52e-01

Pos logit contributions
 Duo+0.109
enf+0.103
 Phi+0.101
agra+0.101
eny+0.099
Neg logit contributions
gement-0.116
llah-0.111
gment-0.106
Brit-0.105
Asset-0.102
Token long
Token ID890
Feature activation-2.22e-01

Pos logit contributions
gement+1.716
llah+1.638
gment+1.582
Brit+1.551
Asset+1.501
Neg logit contributions
 Duo-1.643
enf-1.570
agra-1.525
 Phi-1.515
eny-1.488
Token-
Token ID12
Feature activation-3.12e-01

Pos logit contributions
gement+2.546
llah+2.417
gment+2.344
Brit+2.302
Asset+2.240
Neg logit contributions
 Duo-2.380
enf-2.236
 Phi-2.199
agra-2.172
eny-2.121
Tokenitability
Token ID34147
Feature activation+3.57e-01

Pos logit contributions
 Duo+3.849
enf+3.654
eny+3.599
 Phi+3.598
eless+3.545
Neg logit contributions
gement-2.873
llah-2.782
gment-2.595
Brit-2.594
Asset-2.546
Token about
Token ID546
Feature activation+9.74e-01

Pos logit contributions
 Duo+3.471
enf+3.216
 Phi+3.179
eny+3.103
agra+3.099
Neg logit contributions
gement-4.308
llah-4.184
Brit-3.995
gment-3.969
Asset-3.914
Token what
Token ID644
Feature activation+8.16e-01

Pos logit contributions
 Duo+9.378
 Phi+8.795
 Solo+8.432
 cf+8.197
enf+8.041
Neg logit contributions
llah-10.096
gement-10.025
Brit-9.669
Asset-9.629
gment-9.470
Token happens
Token ID4325
Feature activation+6.03e-01

Pos logit contributions
 Duo+7.817
 Phi+7.290
 Solo+7.001
 cf+6.747
enf+6.705
Neg logit contributions
llah-9.266
gement-9.035
Brit-8.521
Asset-8.505
gment-8.480
Token in
Token ID287
Feature activation+9.49e-01

Pos logit contributions
 Duo+5.171
 Phi+4.757
enf+4.754
eny+4.579
 Solo+4.579
Neg logit contributions
gement-7.727
llah-7.505
gment-7.164
Asset-7.138
Brit-7.126
Token an
Token ID281
Feature activation+1.50e+00

Pos logit contributions
 Duo+8.730
 Phi+8.000
 Solo+7.818
 cf+7.343
enf+7.313
Neg logit contributions
gement-10.793
llah-10.763
Brit-10.174
Asset-10.046
gment-9.941
Token improvis
Token ID47833
Feature activation+5.36e-01

Pos logit contributions
 Duo+11.918
 Phi+10.706
 Solo+10.674
enf+10.644
+10.361
Neg logit contributions
Brit-15.062
gement-15.034
llah-14.651
gment-13.997
Asset-13.758
Tokenation
Token ID341
Feature activation+6.50e-01

Pos logit contributions
 Duo+8.004
enf+7.532
eless+7.446
eny+7.414
 Phi+7.378
Neg logit contributions
gement-3.501
llah-3.485
Brit-3.249
Asset-3.102
gment-2.884
Token,
Token ID11
Feature activation+1.43e-01

Pos logit contributions
 Duo+5.959
enf+5.399
 Phi+5.366
 Solo+5.247
eny+5.201
Neg logit contributions
gement-7.892
llah-7.721
Brit-7.500
Asset-7.256
gment-7.244
Token while
Token ID981
Feature activation+4.88e-01

Pos logit contributions
 Duo+1.513
enf+1.414
 Phi+1.401
agra+1.382
eny+1.355
Neg logit contributions
gement-1.675
llah-1.619
gment-1.541
Brit-1.529
Asset-1.488
Token the
Token ID262
Feature activation+3.53e-01

Pos logit contributions
 Duo+4.766
 Phi+4.433
enf+4.391
 cf+4.221
agra+4.208
Neg logit contributions
gement-5.901
llah-5.675
gment-5.457
Brit-5.295
Asset-5.272
Token hasn
Token ID5818
Feature activation+3.08e-01

Pos logit contributions
 Duo+2.965
enf+2.767
 Phi+2.731
agra+2.700
eny+2.653
Neg logit contributions
gement-3.416
llah-3.323
gment-3.115
Brit-3.094
Asset-3.034
Token't
Token ID470
Feature activation+4.69e-01

Pos logit contributions
 Duo+2.564
enf+2.433
 Phi+2.348
agra+2.310
eny+2.308
Neg logit contributions
gement-4.200
llah-4.032
gment-3.880
Brit-3.803
��-3.785
Token had
Token ID550
Feature activation+8.50e-01

Pos logit contributions
 Duo+4.749
enf+4.414
 Phi+4.346
eless+4.278
agra+4.274
Neg logit contributions
gement-5.464
llah-5.353
Asset-5.064
Brit-5.051
gment-4.934
Token more
Token ID517
Feature activation+1.18e+00

Pos logit contributions
 Duo+7.116
 Phi+6.686
enf+6.546
agra+6.302
eny+6.213
Neg logit contributions
gement-10.391
llah-10.124
Brit-9.540
gment-9.428
��-9.269
Token of
Token ID286
Feature activation+5.56e-01

Pos logit contributions
 Duo+9.439
 Phi+8.895
 hur+8.720
 dynam+8.657
 cf+8.642
Neg logit contributions
gement-13.413
llah-13.185
Brit-12.140
gment-12.135
Asset-11.896
Token an
Token ID281
Feature activation+1.50e+00

Pos logit contributions
 Duo+5.188
 Phi+4.771
enf+4.761
agra+4.692
eny+4.601
Neg logit contributions
gement-6.805
llah-6.653
Brit-6.245
gment-6.127
Asset-6.000
Token impact
Token ID2928
Feature activation+4.88e-01

Pos logit contributions
 Duo+8.707
 Phi+8.213
 hur+8.208
enf+8.140
eny+7.930
Neg logit contributions
llah-17.792
Brit-17.742
gement-17.599
gment-16.589
ighting-16.366
Token on
Token ID319
Feature activation-2.66e-03

Pos logit contributions
 Duo+4.422
enf+4.131
 Phi+4.060
eny+4.022
agra+4.018
Neg logit contributions
gement-6.163
llah-6.007
gment-5.658
Brit-5.566
Asset-5.538
Token prices
Token ID4536
Feature activation+1.23e-01

Pos logit contributions
gement+0.029
llah+0.028
gment+0.027
Brit+0.026
Asset+0.026
Neg logit contributions
 Duo-0.030
enf-0.028
 Phi-0.028
agra-0.028
eny-0.027
Token.
Token ID13
Feature activation+5.26e-01

Pos logit contributions
 Duo+1.241
enf+1.172
 Phi+1.146
agra+1.133
eny+1.119
Neg logit contributions
gement-1.505
llah-1.452
gment-1.384
Brit-1.366
Asset-1.330
Token
Token ID198
Feature activation-8.36e-02

Pos logit contributions
 Duo+5.207
 Phi+4.819
enf+4.708
 Solo+4.605
 cf+4.574
Neg logit contributions
gement-6.579
llah-6.220
gment-6.058
��-5.669
Brit-5.602
Token
Token ID198
Feature activation+3.43e-01

Pos logit contributions
 Duo+0.573
enf+0.543
 Phi+0.533
agra+0.527
eny+0.518
Neg logit contributions
gement-0.584
llah-0.560
gment-0.534
Brit-0.524
Asset-0.509
TokenManchester
Token ID40744
Feature activation-1.94e-01

Pos logit contributions
 Duo+4.312
enf+4.227
 Phi+4.098
agra+4.093
eny+4.054
Neg logit contributions
gement-3.158
llah-3.052
gment-2.869
Brit-2.635
Asset-2.566
Token United
Token ID1578
Feature activation-6.73e-02

Pos logit contributions
gement+1.978
llah+1.846
gment+1.771
Brit+1.763
Asset+1.680
Neg logit contributions
 Duo-2.295
enf-2.220
agra-2.162
 Phi-2.157
eny-2.131
Token[
Token ID58
Feature activation+7.81e-01

Pos logit contributions
gement+0.759
llah+0.727
gment+0.693
Brit+0.687
Asset+0.663
Neg logit contributions
 Duo-0.737
enf-0.701
 Phi-0.683
agra-0.683
eny-0.668
Token95
Token ID3865
Feature activation-1.39e-01

Pos logit contributions
enf+5.302
 Duo+5.091
cum+4.838
 Phi+4.772
 cf+4.739
Neg logit contributions
gement-10.875
llah-10.644
gment-10.037
Asset-9.478
Brit-9.373
Token][
Token ID7131
Feature activation+1.47e+00

Pos logit contributions
gement+1.290
llah+1.224
gment+1.156
Brit+1.138
Asset+1.095
Neg logit contributions
 Duo-1.791
enf-1.710
 Phi-1.681
agra-1.680
eny-1.649
Token96
Token ID4846
Feature activation-6.38e-02

Pos logit contributions
enf+8.292
cum+8.248
 Duo+8.202
if+8.183
 cf+7.717
Neg logit contributions
gement-18.071
llah-17.156
gment-16.543
Asset-16.363
��-16.362
Token]
Token ID60
Feature activation+1.58e-01

Pos logit contributions
gement+0.610
llah+0.580
gment+0.548
Brit+0.539
Asset+0.519
Neg logit contributions
 Duo-0.812
enf-0.775
 Phi-0.761
agra-0.759
eny-0.746
Token
Token ID198
Feature activation+5.45e-02

Pos logit contributions
 Duo+1.728
enf+1.629
 Phi+1.606
agra+1.579
eny+1.557
Neg logit contributions
gement-1.796
llah-1.737
gment-1.654
Brit-1.619
Asset-1.581
Token
Token ID198
Feature activation+1.59e-01

Pos logit contributions
 Duo+0.602
enf+0.569
 Phi+0.559
agra+0.553
eny+0.544
Neg logit contributions
gement-0.615
llah-0.589
gment-0.562
Brit-0.552
Asset-0.536
TokenInternational
Token ID24274
Feature activation+2.10e-01

Pos logit contributions
 Duo+1.825
enf+1.757
 Phi+1.715
agra+1.707
eny+1.683
Neg logit contributions
gement-1.684
llah-1.619
gment-1.540
Brit-1.472
Asset-1.432
Token said
Token ID531
Feature activation+5.78e-01

Pos logit contributions
 Duo+0.688
enf+0.653
 Phi+0.642
agra+0.637
eny+0.625
Neg logit contributions
gement-0.587
llah-0.559
gment-0.534
Brit-0.524
Asset-0.506
Token these
Token ID777
Feature activation+2.70e-01

Pos logit contributions
 Duo+5.509
 Phi+5.018
agra+4.865
enf+4.860
 Solo+4.791
Neg logit contributions
gement-7.018
llah-6.861
gment-6.666
Brit-6.405
Asset-6.272
Token would
Token ID561
Feature activation+4.92e-01

Pos logit contributions
 Duo+2.961
enf+2.752
 Phi+2.698
agra+2.677
eny+2.634
Neg logit contributions
gement-3.075
llah-2.911
gment-2.832
Brit-2.747
Asset-2.720
Token have
Token ID423
Feature activation+8.61e-01

Pos logit contributions
 Duo+4.577
enf+4.340
eless+4.124
 Phi+4.081
agra+4.069
Neg logit contributions
gement-6.107
llah-5.932
Brit-5.718
Asset-5.622
gment-5.608
Token a
Token ID257
Feature activation+1.23e+00

Pos logit contributions
 Duo+6.901
enf+6.593
eless+6.281
 cf+6.268
 Phi+6.209
Neg logit contributions
gement-10.612
llah-10.514
gment-9.789
Brit-9.766
Asset-9.676
Token negative
Token ID4633
Feature activation+1.45e+00

Pos logit contributions
 Duo+9.048
 Phi+8.221
 hur+8.079
enf+8.039
 deaf+8.015
Neg logit contributions
gement-14.450
llah-14.092
Brit-13.896
Asset-13.862
ighting-13.626
Token impact
Token ID2928
Feature activation+2.78e-01

Pos logit contributions
 Duo+8.715
 Phi+7.652
 cf+7.476
 hur+7.398
 Cambodia+7.390
Neg logit contributions
llah-17.716
gement-17.399
Brit-17.361
Asset-17.065
��-16.687
Token on
Token ID319
Feature activation+4.15e-01

Pos logit contributions
 Duo+2.713
enf+2.564
 Phi+2.502
agra+2.495
eny+2.454
Neg logit contributions
gement-3.446
llah-3.291
gment-3.176
Brit-3.107
Asset-3.073
Token current
Token ID1459
Feature activation+1.87e-01

Pos logit contributions
 Duo+4.393
 Phi+4.150
enf+4.073
agra+4.042
 Solo+3.939
Neg logit contributions
gement-4.694
llah-4.477
gment-4.345
Brit-4.286
Asset-4.153
Token peace
Token ID4167
Feature activation+3.67e-02

Pos logit contributions
 Duo+2.396
enf+2.280
 Phi+2.261
agra+2.228
eny+2.181
Neg logit contributions
gement-1.760
llah-1.667
gment-1.591
Brit-1.570
Asset-1.506
Token talks
Token ID6130
Feature activation-4.74e-01

Pos logit contributions
 Duo+0.426
enf+0.405
 Phi+0.397
agra+0.394
eny+0.387
Neg logit contributions
gement-0.393
llah-0.376
gment-0.357
Brit-0.353
Asset-0.342
Token hand
Token ID1021
Feature activation-5.88e-02

Pos logit contributions
 Duo+4.871
enf+4.619
 Phi+4.538
agra+4.528
eny+4.487
Neg logit contributions
gement-3.414
llah-3.389
Brit-3.099
gment-3.071
Asset-3.008
Token in
Token ID287
Feature activation+4.34e-01

Pos logit contributions
gement+0.672
llah+0.645
gment+0.616
Brit+0.607
Asset+0.589
Neg logit contributions
 Duo-0.638
enf-0.605
 Phi-0.593
agra-0.588
eny-0.576
Token today
Token ID1909
Feature activation+8.41e-01

Pos logit contributions
 Duo+4.538
 Phi+4.162
enf+4.105
agra+4.007
 Solo+3.930
Neg logit contributions
gement-5.041
llah-4.971
Brit-4.757
gment-4.611
Asset-4.444
Token
Token ID447
Feature activation+1.96e-01

Pos logit contributions
 Duo+6.839
enf+6.261
cum+6.018
 Phi+5.991
+5.926
Neg logit contributions
gement-10.274
llah-10.229
Brit-9.930
gment-9.603
Asset-9.357
Token
Token ID247
Feature activation+3.74e-01

Pos logit contributions
 Duo+2.013
enf+1.865
 Phi+1.850
agra+1.821
eny+1.794
Neg logit contributions
gement-2.319
llah-2.239
Brit-2.140
gment-2.138
Asset-2.085
Tokens
Token ID82
Feature activation+1.43e+00

Pos logit contributions
 Duo+4.728
enf+4.470
 Phi+4.364
eny+4.300
agra+4.281
Neg logit contributions
gement-3.452
llah-3.361
Brit-3.176
gment-3.124
Asset-2.989
Token time
Token ID640
Feature activation+6.20e-01

Pos logit contributions
 Duo+12.983
 Phi+11.942
 Solo+11.505
enf+11.146
 dynam+11.107
Neg logit contributions
Brit-13.909
gement-13.557
llah-13.540
gment-12.212
Asset-12.029
Token.
Token ID13
Feature activation+2.86e-01

Pos logit contributions
 Duo+5.815
enf+5.314
 Phi+5.141
 cf+5.083
cum+5.042
Neg logit contributions
gement-7.451
llah-7.143
Brit-7.047
gment-6.799
Asset-6.634
Token The
Token ID383
Feature activation+3.30e-01

Pos logit contributions
 Duo+3.185
 Phi+2.947
enf+2.912
agra+2.829
eny+2.792
Neg logit contributions
gement-3.218
llah-3.077
gment-2.942
Brit-2.886
Asset-2.728
Token ultimate
Token ID8713
Feature activation+5.19e-01

Pos logit contributions
 Duo+3.822
 Phi+3.553
enf+3.496
agra+3.438
eny+3.386
Neg logit contributions
gement-3.498
llah-3.413
Brit-3.256
gment-3.183
Asset-3.072
Token objective
Token ID9432
Feature activation+3.01e-01

Pos logit contributions
 Duo+5.448
 Phi+4.992
enf+4.833
agra+4.695
eny+4.671
Neg logit contributions
gement-5.888
llah-5.887
Brit-5.673
gment-5.440
Asset-5.266
Token
Token ID250
Feature activation+6.90e-01

Pos logit contributions
 Duo+2.161
enf+2.043
 Phi+2.019
agra+1.969
eny+1.956
Neg logit contributions
gement-2.153
llah-2.067
gment-1.963
Brit-1.937
Asset-1.911
TokenThis
Token ID1212
Feature activation+6.06e-01

Pos logit contributions
 Duo+6.414
enf+6.114
cum+5.898
 Phi+5.867
eny+5.762
Neg logit contributions
gement-7.937
llah-7.764
gment-7.284
Asset-7.083
Brit-7.062
Token version
Token ID2196
Feature activation+5.38e-01

Pos logit contributions
 Duo+6.200
 Phi+5.713
enf+5.609
agra+5.493
 Solo+5.469
Neg logit contributions
gement-6.812
llah-6.740
Brit-6.259
gment-6.241
Asset-6.190
Token had
Token ID550
Feature activation+7.30e-01

Pos logit contributions
 Duo+5.151
enf+4.770
 Phi+4.703
eny+4.588
 cf+4.585
Neg logit contributions
gement-6.432
llah-6.277
gment-5.929
Brit-5.874
Asset-5.793
Token the
Token ID262
Feature activation+8.20e-01

Pos logit contributions
 Duo+7.159
 Phi+6.691
enf+6.605
eny+6.379
 Solo+6.377
Neg logit contributions
llah-7.978
gement-7.978
Asset-7.468
Brit-7.464
gment-7.404
Token most
Token ID749
Feature activation+1.43e+00

Pos logit contributions
 Duo+8.356
 Phi+7.765
enf+7.567
 Solo+7.291
eny+7.206
Neg logit contributions
gement-8.777
llah-8.592
Brit-8.123
Asset-8.123
gment-7.939
Token motion
Token ID6268
Feature activation+3.74e-01

Pos logit contributions
 Duo+12.666
 Phi+12.547
 dynam+11.858
 cf+11.698
enf+11.644
Neg logit contributions
llah-13.476
gement-13.310
Asset-12.498
gment-12.169
Brit-12.032
Token
Token ID447
Feature activation+1.15e-01

Pos logit contributions
 Duo+3.742
enf+3.487
eless+3.402
 Phi+3.391
eny+3.373
Neg logit contributions
gement-4.432
llah-4.370
Brit-4.129
gment-4.110
Asset-4.014
Token
Token ID251
Feature activation+5.70e-01

Pos logit contributions
 Duo+1.384
enf+1.314
 Phi+1.297
agra+1.278
eny+1.264
Neg logit contributions
gement-1.165
llah-1.114
gment-1.056
Brit-1.049
Asset-1.020
Token stems
Token ID21552
Feature activation+3.02e-01

Pos logit contributions
 Duo+5.539
enf+5.146
 Phi+5.133
 cf+5.001
eny+4.979
Neg logit contributions
gement-6.652
llah-6.573
Brit-6.122
gment-6.118
Asset-6.050
Token from
Token ID422
Feature activation+3.09e-01

Pos logit contributions
 Duo+2.752
enf+2.601
 Phi+2.544
agra+2.488
eny+2.465
Neg logit contributions
gement-3.858
llah-3.772
gment-3.579
Brit-3.579
Asset-3.507
Token000
Token ID830
Feature activation+2.30e-01

Pos logit contributions
gement+1.159
llah+1.114
gment+1.055
Brit+1.043
Asset+1.006
Neg logit contributions
 Duo-1.299
enf-1.242
 Phi-1.213
agra-1.205
eny-1.187
Token each
Token ID1123
Feature activation+3.85e-01

Pos logit contributions
 Duo+2.595
enf+2.446
 Phi+2.417
agra+2.377
eny+2.357
Neg logit contributions
gement-2.523
llah-2.410
gment-2.285
Brit-2.244
Asset-2.193
Token
Token ID851
Feature activation+2.42e-01

Pos logit contributions
 Duo+4.359
 Phi+4.045
enf+4.045
agra+3.933
eny+3.931
Neg logit contributions
gement-4.167
llah-3.934
gment-3.740
Brit-3.704
Asset-3.631
Token are
Token ID389
Feature activation+2.28e-01

Pos logit contributions
 Duo+2.697
enf+2.571
agra+2.511
 Phi+2.503
eny+2.483
Neg logit contributions
gement-2.644
llah-2.544
gment-2.404
Brit-2.393
Asset-2.337
Token having
Token ID1719
Feature activation+9.14e-01

Pos logit contributions
 Duo+2.448
enf+2.318
 Phi+2.279
agra+2.261
eny+2.223
Neg logit contributions
gement-2.609
llah-2.517
gment-2.384
Brit-2.377
Asset-2.287
Token an
Token ID281
Feature activation+1.42e+00

Pos logit contributions
 Duo+8.073
 Phi+7.671
eny+7.370
agra+7.364
 Solo+7.333
Neg logit contributions
gement-10.624
llah-10.063
Brit-9.770
gment-9.531
Asset-9.504
Token effect
Token ID1245
Feature activation+5.20e-01

Pos logit contributions
 Duo+9.314
eless+9.289
eny+9.012
enf+8.962
 Phi+8.543
Neg logit contributions
gement-16.622
Brit-16.323
llah-15.910
gment-15.255
ighting-15.069
Token.
Token ID13
Feature activation+1.61e-01

Pos logit contributions
 Duo+4.822
enf+4.490
 Phi+4.457
agra+4.433
eny+4.390
Neg logit contributions
gement-6.544
llah-6.146
gment-5.983
Asset-5.781
Brit-5.766
Token Many
Token ID4650
Feature activation+4.04e-01

Pos logit contributions
 Duo+1.776
enf+1.654
 Phi+1.647
agra+1.610
eny+1.569
Neg logit contributions
gement-1.869
llah-1.762
gment-1.714
Brit-1.644
Asset-1.597
Token who
Token ID508
Feature activation+6.83e-01

Pos logit contributions
 Duo+4.345
 Phi+4.009
enf+3.989
agra+3.925
eless+3.886
Neg logit contributions
gement-4.620
llah-4.324
gment-4.215
Brit-4.119
Asset-4.021
Token are
Token ID389
Feature activation+5.75e-01

Pos logit contributions
 Duo+6.932
enf+6.471
eny+6.420
eless+6.338
agra+6.292
Neg logit contributions
gement-7.575
llah-6.973
Asset-6.972
Brit-6.955
gment-6.878
Token the
Token ID262
Feature activation+3.28e-01

Pos logit contributions
 Duo+2.306
enf+2.145
 Phi+2.133
agra+2.085
eny+2.067
Neg logit contributions
gement-2.484
llah-2.388
gment-2.292
Brit-2.244
Asset-2.194
Token D
Token ID360
Feature activation+3.99e-01

Pos logit contributions
 Duo+3.650
 Phi+3.375
enf+3.356
agra+3.263
eny+3.240
Neg logit contributions
gement-3.668
llah-3.547
gment-3.379
Brit-3.331
Asset-3.264
TokenNR
Token ID24723
Feature activation+6.59e-01

Pos logit contributions
 Duo+3.625
enf+3.542
 Phi+3.379
agra+3.376
eny+3.314
Neg logit contributions
gement-5.028
llah-4.829
Brit-4.643
gment-4.612
Asset-4.467
Token commissioner
Token ID16248
Feature activation+1.92e-01

Pos logit contributions
 Duo+6.879
 Phi+6.243
enf+6.210
 cf+5.963
eny+5.961
Neg logit contributions
gement-7.398
llah-7.208
gment-6.855
Brit-6.850
Asset-6.544
Token who
Token ID508
Feature activation+3.97e-01

Pos logit contributions
 Duo+2.014
enf+1.893
 Phi+1.861
agra+1.831
eny+1.816
Neg logit contributions
gement-2.256
llah-2.185
gment-2.077
Brit-2.064
Asset-1.998
Token set
Token ID900
Feature activation+1.41e+00

Pos logit contributions
 Duo+4.256
enf+3.978
 Phi+3.913
eny+3.842
agra+3.833
Neg logit contributions
gement-4.480
llah-4.361
Brit-4.142
gment-4.126
Asset-4.058
Token our
Token ID674
Feature activation+6.22e-01

Pos logit contributions
 Duo+12.029
 Phi+10.888
 Solo+10.410
 Boone+10.274
 cf+10.083
Neg logit contributions
llah-14.561
gement-14.508
Brit-14.379
gment-13.749
Asset-13.010
Token research
Token ID2267
Feature activation+5.10e-01

Pos logit contributions
 Duo+6.873
 Phi+6.237
enf+6.151
eny+6.009
 cf+5.971
Neg logit contributions
gement-6.681
llah-6.553
Brit-6.209
gment-6.146
Asset-5.985
Token on
Token ID319
Feature activation+9.48e-01

Pos logit contributions
 Duo+5.291
enf+4.914
 Phi+4.888
agra+4.719
eny+4.664
Neg logit contributions
gement-5.891
llah-5.680
Brit-5.395
gment-5.377
Asset-5.194
Token the
Token ID262
Feature activation+7.82e-01

Pos logit contributions
 Duo+9.433
 Phi+8.771
enf+8.322
eny+8.173
 Solo+8.169
Neg logit contributions
gement-10.036
llah-9.834
Brit-9.478
gment-9.347
Asset-9.118
Token road
Token ID2975
Feature activation+2.64e-01

Pos logit contributions
 Duo+8.179
 Phi+7.431
enf+7.196
 cf+7.081
 Solo+7.005
Neg logit contributions
gement-8.455
llah-8.451
Brit-7.998
gment-7.885
Asset-7.860
Token nuclear
Token ID4523
Feature activation+3.31e-01

Pos logit contributions
gement+2.609
llah+2.494
gment+2.385
Brit+2.349
Asset+2.286
Neg logit contributions
 Duo-2.527
enf-2.407
agra-2.349
 Phi-2.320
eny-2.300
Token weapons
Token ID3777
Feature activation-2.07e-01

Pos logit contributions
 Duo+3.556
enf+3.390
 Phi+3.345
agra+3.248
eny+3.240
Neg logit contributions
gement-3.723
llah-3.622
gment-3.416
Brit-3.351
Asset-3.262
Token viewed
Token ID9569
Feature activation+8.45e-02

Pos logit contributions
gement+2.285
llah+2.179
gment+2.081
Brit+2.040
Asset+1.981
Neg logit contributions
 Duo-2.299
enf-2.181
agra-2.156
 Phi-2.147
eny-2.077
Token in
Token ID287
Feature activation+4.53e-01

Pos logit contributions
 Duo+0.883
enf+0.835
 Phi+0.816
agra+0.813
eny+0.793
Neg logit contributions
gement-0.995
llah-0.961
gment-0.915
Brit-0.905
Asset-0.883
Token exactly
Token ID3446
Feature activation+2.91e-01

Pos logit contributions
 Duo+4.380
enf+4.124
 Phi+4.058
agra+3.974
eny+3.889
Neg logit contributions
gement-5.473
llah-5.395
gment-5.087
Brit-4.989
Asset-4.904
Token this
Token ID428
Feature activation+1.41e+00

Pos logit contributions
 Duo+2.837
 Phi+2.622
enf+2.614
agra+2.549
eny+2.511
Neg logit contributions
gement-3.600
llah-3.474
gment-3.339
Brit-3.285
Asset-3.226
Token light
Token ID1657
Feature activation-1.37e-01

Pos logit contributions
 Duo+10.624
enf+9.852
 Phi+9.695
 Solo+9.303
 cf+9.205
Neg logit contributions
gement-14.789
llah-14.534
Asset-14.021
Brit-13.862
gment-13.823
Token before
Token ID878
Feature activation+7.43e-02

Pos logit contributions
gement+1.509
llah+1.444
gment+1.380
Brit+1.362
Asset+1.313
Neg logit contributions
 Duo-1.537
enf-1.460
 Phi-1.431
agra-1.425
eny-1.389
Token.
Token ID13
Feature activation-7.96e-02

Pos logit contributions
 Duo+0.800
enf+0.754
 Phi+0.740
agra+0.733
eny+0.721
Neg logit contributions
gement-0.856
llah-0.822
gment-0.786
Brit-0.775
Asset-0.751
Token<|endoftext|>
Token ID50256
Feature activation-2.51e-01

Pos logit contributions
gement+0.849
llah+0.816
gment+0.772
Brit+0.771
Asset+0.746
Neg logit contributions
 Duo-0.914
enf-0.874
agra-0.854
 Phi-0.851
eny-0.839
TokenThe
Token ID464
Feature activation-3.82e-02

Pos logit contributions
gement+2.897
llah+2.783
Brit+2.717
gment+2.673
Asset+2.633
Neg logit contributions
 Duo-2.667
enf-2.470
 Phi-2.458
agra-2.409
eny-2.350
Token.
Token ID13
Feature activation+8.19e-01

Pos logit contributions
gement+0.258
llah+0.248
gment+0.237
Brit+0.234
Asset+0.226
Neg logit contributions
 Duo-0.240
enf-0.228
 Phi-0.222
agra-0.221
eny-0.217
Token On
Token ID1550
Feature activation+5.06e-01

Pos logit contributions
 Duo+8.243
 Phi+7.570
 Solo+7.489
enf+6.882
agra+6.866
Neg logit contributions
gement-9.514
llah-9.184
gment-8.871
Asset-8.423
��-8.266
Token the
Token ID262
Feature activation+1.21e-01

Pos logit contributions
 Duo+4.881
 Phi+4.635
agra+4.498
enf+4.397
eny+4.377
Neg logit contributions
llah-5.979
gement-5.933
Brit-5.657
Asset-5.504
gment-5.472
Token way
Token ID835
Feature activation+5.94e-01

Pos logit contributions
 Duo+0.924
enf+0.843
 Phi+0.833
agra+0.820
eny+0.786
Neg logit contributions
gement-1.778
llah-1.737
gment-1.664
Brit-1.658
Asset-1.626
Token,
Token ID11
Feature activation+7.50e-01

Pos logit contributions
 Duo+5.331
 Phi+4.962
 Solo+4.791
enf+4.772
agra+4.699
Neg logit contributions
gement-7.354
llah-7.232
Asset-6.905
Brit-6.861
gment-6.853
Token they
Token ID484
Feature activation+1.40e+00

Pos logit contributions
 Duo+6.398
 Phi+5.785
 Solo+5.677
agra+5.567
enf+5.486
Neg logit contributions
gement-9.515
llah-9.439
Asset-8.991
gment-8.905
Brit-8.896
Token encounter
Token ID8791
Feature activation+4.91e-01

Pos logit contributions
 Duo+10.564
 cf+9.312
eless+9.204
agra+9.136
 Solo+8.954
Neg logit contributions
Asset-16.412
gement-15.587
llah-15.575
Brit-15.142
��-14.628
Token Far
Token ID6755
Feature activation+7.11e-01

Pos logit contributions
 Duo+5.114
 Phi+4.682
agra+4.606
enf+4.551
 Solo+4.456
Neg logit contributions
gement-5.729
llah-5.589
Brit-5.253
gment-5.244
Asset-5.171
Tokenam
Token ID321
Feature activation+1.92e-01

Pos logit contributions
 Duo+7.386
agra+7.017
eless+6.870
enf+6.859
 Solo+6.510
Neg logit contributions
gement-7.805
llah-7.604
Asset-7.320
Brit-7.135
gment-6.997
Tokenir
Token ID343
Feature activation-5.42e-02

Pos logit contributions
 Duo+2.253
enf+2.139
 Phi+2.103
agra+2.100
eny+2.061
Neg logit contributions
gement-2.024
llah-1.937
Brit-1.829
gment-1.825
Asset-1.778
Token,
Token ID11
Feature activation+1.33e-01

Pos logit contributions
gement+0.617
llah+0.593
gment+0.566
Brit+0.557
Asset+0.539
Neg logit contributions
 Duo-0.589
enf-0.559
 Phi-0.547
agra-0.543
eny-0.533
Token!
Token ID0
Feature activation+8.59e-01

Pos logit contributions
 Duo+7.132
 Phi+6.647
enf+6.534
eny+6.425
cum+6.228
Neg logit contributions
gement-8.389
llah-7.806
gment-7.643
Asset-7.460
Brit-7.452
Token
Token ID447
Feature activation-3.07e-01

Pos logit contributions
 Duo+8.631
 Phi+8.091
enf+7.766
+7.721
eny+7.505
Neg logit contributions
gement-9.705
gment-8.946
llah-8.838
Brit-8.400
Asset-8.255
Token
Token ID251
Feature activation+6.81e-01

Pos logit contributions
gement+2.777
llah+2.622
gment+2.491
Brit+2.402
Asset+2.261
Neg logit contributions
 Duo-4.020
enf-3.865
agra-3.835
 Phi-3.766
eny-3.719
Token Ichigo
Token ID44547
Feature activation+3.25e-01

Pos logit contributions
 Duo+7.407
 Phi+7.068
enf+6.788
 Solo+6.503
eny+6.468
Neg logit contributions
gement-7.425
llah-6.916
gment-6.886
Asset-6.473
Brit-6.448
Token yelled
Token ID22760
Feature activation-5.09e-03

Pos logit contributions
 Duo+3.415
 Phi+3.159
enf+3.138
eny+3.036
eless+3.009
Neg logit contributions
gement-3.838
llah-3.691
gment-3.523
Brit-3.480
Asset-3.442
Token to
Token ID284
Feature activation+1.39e+00

Pos logit contributions
gement+0.060
llah+0.058
gment+0.056
Brit+0.055
Asset+0.053
Neg logit contributions
 Duo-0.053
enf-0.050
 Phi-0.049
agra-0.049
eny-0.048
Token Chad
Token ID19800
Feature activation+2.00e-01

Pos logit contributions
 Duo+12.138
 Phi+11.984
 Solo+10.554
enf+10.490
eless+10.457
Neg logit contributions
gement-14.545
gment-13.478
llah-13.316
Asset-13.135
Brit-12.971
Token
Token ID447
Feature activation+1.13e-01

Pos logit contributions
 Duo+2.080
enf+1.949
 Phi+1.923
agra+1.879
eny+1.860
Neg logit contributions
gement-2.390
llah-2.291
gment-2.192
Brit-2.162
Asset-2.123
Token
Token ID247
Feature activation-3.64e-02

Pos logit contributions
 Duo+1.265
enf+1.196
 Phi+1.176
agra+1.158
eny+1.144
Neg logit contributions
gement-1.243
llah-1.187
gment-1.134
Brit-1.127
Asset-1.096
Tokens
Token ID82
Feature activation+4.74e-01

Pos logit contributions
gement+0.407
llah+0.391
gment+0.373
Brit+0.368
Asset+0.356
Neg logit contributions
 Duo-0.405
enf-0.383
 Phi-0.376
agra-0.374
eny-0.366
Token far
Token ID1290
Feature activation+1.26e-01

Pos logit contributions
 Duo+5.311
 Phi+4.988
enf+4.932
eny+4.778
 Solo+4.707
Neg logit contributions
gement-5.114
llah-4.918
gment-4.620
Brit-4.562
Asset-4.539
Token8
Token ID23
Feature activation+2.55e-01

Pos logit contributions
 Duo+2.933
enf+2.832
agra+2.756
eny+2.726
 Phi+2.710
Neg logit contributions
gement-3.451
llah-3.382
Brit-3.207
gment-3.174
Asset-3.093
Token {
Token ID1391
Feature activation+7.67e-01

Pos logit contributions
 Duo+3.051
 Phi+2.846
enf+2.835
agra+2.790
eny+2.758
Neg logit contributions
gement-2.625
llah-2.523
Brit-2.392
gment-2.383
Asset-2.301
Token(
Token ID7
Feature activation+3.16e-01

Pos logit contributions
 Duo+4.899
cum+4.628
enf+4.605
if+4.602
+4.429
Neg logit contributions
gement-11.085
llah-10.699
��-10.492
gment-10.384
vironment-9.661
Token113
Token ID16616
Feature activation+3.34e-01

Pos logit contributions
 Duo+3.135
enf+2.948
 Phi+2.878
agra+2.847
eny+2.787
Neg logit contributions
gement-3.858
llah-3.685
gment-3.558
Brit-3.442
Asset-3.350
Tokens
Token ID82
Feature activation+7.98e-01

Pos logit contributions
 Duo+4.148
enf+3.983
eny+3.873
agra+3.873
 Phi+3.872
Neg logit contributions
gement-3.222
llah-3.041
gment-2.900
Brit-2.824
��-2.749
Token)}
Token ID38165
Feature activation+1.39e+00

Pos logit contributions
 Duo+4.338
enf+3.845
 Phi+3.817
 cf+3.534
eless+3.442
Neg logit contributions
gement-12.396
llah-11.925
��-11.841
gment-11.672
Asset-11.395
Token 56
Token ID7265
Feature activation+3.83e-01

Pos logit contributions
 Duo+10.956
 cf+9.719
 Phi+9.621
 Solo+9.219
if+8.790
Neg logit contributions
gement-15.545
Brit-15.198
Asset-14.920
llah-14.813
gment-14.500
Token.
Token ID13
Feature activation+8.30e-01

Pos logit contributions
 Duo+3.677
 Phi+3.442
enf+3.348
agra+3.264
eny+3.243
Neg logit contributions
gement-4.816
llah-4.619
gment-4.431
Brit-4.362
��-4.257
Token f
Token ID277
Feature activation+6.20e-01

Pos logit contributions
 Duo+7.467
 Phi+6.806
 cf+6.701
 Solo+6.196
enf+6.163
Neg logit contributions
gement-9.990
llah-9.943
Brit-9.599
gment-9.376
��-9.217
Token5
Token ID20
Feature activation+4.94e-01

Pos logit contributions
 Duo+5.155
enf+5.024
eny+4.815
agra+4.762
 Phi+4.670
Neg logit contributions
llah-7.787
gement-7.625
Brit-7.459
��-7.276
Asset-7.170
Token {
Token ID1391
Feature activation+5.65e-01

Pos logit contributions
 Duo+5.462
 Phi+5.068
enf+4.967
agra+4.954
 cf+4.875
Neg logit contributions
gement-5.254
llah-5.168
Brit-4.960
gment-4.865
Asset-4.682
Token that
Token ID326
Feature activation+1.25e-01

Pos logit contributions
 Duo+0.973
enf+0.917
 Phi+0.900
agra+0.891
eny+0.877
Neg logit contributions
gement-1.071
llah-1.036
gment-0.983
Brit-0.974
Asset-0.947
Token is
Token ID318
Feature activation+3.38e-01

Pos logit contributions
 Duo+1.373
enf+1.295
 Phi+1.273
agra+1.259
eny+1.236
Neg logit contributions
gement-1.413
llah-1.365
gment-1.294
Brit-1.272
Asset-1.240
Token going
Token ID1016
Feature activation+1.13e+00

Pos logit contributions
 Duo+3.449
enf+3.242
 Phi+3.193
agra+3.121
eny+3.080
Neg logit contributions
gement-3.984
llah-3.937
gment-3.648
Brit-3.633
Asset-3.559
Token to
Token ID284
Feature activation+3.07e-01

Pos logit contributions
 Duo+7.821
enf+7.561
eless+7.404
 Solo+7.300
eny+7.217
Neg logit contributions
gement-13.831
llah-13.791
Asset-12.869
gment-12.712
Brit-12.421
Token have
Token ID423
Feature activation+9.55e-01

Pos logit contributions
 Duo+3.099
enf+2.915
 Phi+2.856
agra+2.794
eny+2.757
Neg logit contributions
gement-3.668
llah-3.574
gment-3.348
Brit-3.341
Asset-3.301
Token significant
Token ID2383
Feature activation+1.38e+00

Pos logit contributions
 Duo+7.753
 Phi+7.232
enf+7.114
 cf+7.004
 hur+6.959
Neg logit contributions
llah-11.457
gement-11.274
Asset-10.426
gment-10.374
��-10.341
Token implications
Token ID10939
Feature activation+5.19e-01

Pos logit contributions
 Duo+10.469
 dynam+9.854
 cf+9.710
 hur+9.700
 Phi+9.604
Neg logit contributions
llah-14.911
gement-14.697
��-13.806
Brit-13.791
Asset-13.747
Token for
Token ID329
Feature activation+1.05e-01

Pos logit contributions
 Duo+4.650
enf+4.339
 Phi+4.294
agra+4.213
eny+4.195
Neg logit contributions
gement-6.497
llah-6.318
gment-6.002
Asset-5.980
Brit-5.906
Token things
Token ID1243
Feature activation+1.06e-01

Pos logit contributions
 Duo+1.106
enf+1.035
 Phi+1.025
agra+1.010
eny+0.991
Neg logit contributions
gement-1.228
llah-1.190
gment-1.130
Brit-1.115
Asset-1.086
Token like
Token ID588
Feature activation-2.73e-01

Pos logit contributions
 Duo+1.074
enf+1.005
 Phi+0.993
agra+0.978
eny+0.964
Neg logit contributions
gement-1.289
llah-1.246
gment-1.187
Brit-1.169
Asset-1.142
Token trade
Token ID3292
Feature activation+1.61e-01

Pos logit contributions
gement+3.019
llah+2.844
gment+2.755
Brit+2.721
Asset+2.625
Neg logit contributions
 Duo-3.003
enf-2.886
agra-2.800
 Phi-2.777
eny-2.757
Token conversion
Token ID11315
Feature activation-2.11e-01

Pos logit contributions
 Duo+2.819
 Phi+2.609
enf+2.586
agra+2.496
eny+2.486
Neg logit contributions
gement-3.224
llah-3.135
gment-2.960
Brit-2.960
Asset-2.819
Token to
Token ID284
Feature activation-2.68e-02

Pos logit contributions
gement+2.277
llah+2.146
gment+2.078
Brit+2.029
Asset+1.975
Neg logit contributions
 Duo-2.403
enf-2.288
agra-2.248
 Phi-2.236
eny-2.175
Token L
Token ID406
Feature activation-4.86e-01

Pos logit contributions
gement+0.292
llah+0.280
gment+0.267
Brit+0.263
Asset+0.255
Neg logit contributions
 Duo-0.305
enf-0.290
 Phi-0.283
agra-0.282
eny-0.277
TokenNG
Token ID10503
Feature activation-7.86e-02

Pos logit contributions
gement+4.887
llah+4.632
gment+4.476
Brit+4.281
Asset+4.273
Neg logit contributions
 Duo-5.840
 Phi-5.350
enf-5.337
agra-5.286
eny-5.041
Token has
Token ID468
Feature activation+7.74e-01

Pos logit contributions
gement+0.885
llah+0.846
gment+0.812
Brit+0.799
Asset+0.777
Neg logit contributions
 Duo-0.867
enf-0.821
 Phi-0.802
agra-0.800
eny-0.782
Token major
Token ID1688
Feature activation+1.38e+00

Pos logit contributions
 Duo+7.121
 Phi+6.731
enf+6.631
 Solo+6.349
agra+6.287
Neg logit contributions
gement-9.082
llah-9.013
Brit-8.481
gment-8.331
Asset-7.981
Token drawbacks
Token ID42763
Feature activation+5.61e-01

Pos logit contributions
 Duo+11.340
 cf+10.702
 Phi+10.582
enf+10.411
 dynam+10.321
Neg logit contributions
gement-14.251
llah-13.741
Brit-13.448
gment-12.691
Asset-12.449
Token.
Token ID13
Feature activation-1.70e-01

Pos logit contributions
 Duo+5.119
enf+4.835
eny+4.659
 Phi+4.650
 cf+4.587
Neg logit contributions
gement-6.935
llah-6.563
Brit-6.406
gment-6.353
Asset-6.222
Token Firstly
Token ID41039
Feature activation-2.49e-03

Pos logit contributions
gement+1.794
llah+1.739
gment+1.641
Brit+1.634
Asset+1.590
Neg logit contributions
 Duo-1.926
enf-1.863
agra-1.830
 Phi-1.794
eny-1.785
Token,
Token ID11
Feature activation-1.52e-01

Pos logit contributions
gement+0.028
llah+0.027
gment+0.026
Brit+0.026
Asset+0.025
Neg logit contributions
 Duo-0.027
enf-0.026
 Phi-0.025
agra-0.025
eny-0.025
Token it
Token ID340
Feature activation-2.82e-01

Pos logit contributions
gement+1.686
llah+1.614
gment+1.544
Brit+1.514
Asset+1.474
Neg logit contributions
 Duo-1.692
enf-1.622
agra-1.582
 Phi-1.568
eny-1.536
Token to
Token ID284
Feature activation-9.99e-02

Pos logit contributions
 Duo+0.522
enf+0.491
 Phi+0.484
agra+0.478
eny+0.469
Neg logit contributions
gement-0.566
llah-0.546
gment-0.521
Brit-0.515
Asset-0.499
Token keep
Token ID1394
Feature activation-2.41e-01

Pos logit contributions
gement+1.187
llah+1.139
gment+1.093
Brit+1.071
Asset+1.040
Neg logit contributions
 Duo-1.031
enf-0.984
agra-0.959
 Phi-0.954
eny-0.932
Token this
Token ID428
Feature activation-5.94e-01

Pos logit contributions
gement+2.602
llah+2.440
gment+2.367
Brit+2.316
Asset+2.268
Neg logit contributions
 Duo-2.688
enf-2.626
agra-2.552
 Phi-2.480
eny-2.478
Token contest
Token ID8414
Feature activation-3.25e-01

Pos logit contributions
gement+6.338
llah+5.820
gment+5.770
Brit+5.535
Asset+5.480
Neg logit contributions
enf-6.203
 Duo-6.201
agra-6.027
eny-5.831
eless-5.770
Token in
Token ID287
Feature activation-8.15e-01

Pos logit contributions
gement+3.440
llah+3.295
gment+3.166
Brit+3.058
Asset+2.994
Neg logit contributions
 Duo-3.636
enf-3.544
agra-3.443
 Phi-3.381
eny-3.365
Token-
Token ID12
Feature activation-1.52e+00

Pos logit contributions
gement+8.400
llah+7.889
gment+7.589
Asset+7.375
Brit+7.256
Neg logit contributions
agra-8.112
enf-8.078
 Duo-7.954
eny-7.683
eless-7.441
Tokenun
Token ID403
Feature activation-4.69e-01

Pos logit contributions
gement+14.357
Asset+13.529
llah+13.405
Brit+13.378
priv+13.014
Neg logit contributions
 Duo-12.529
agra-12.500
 Phi-12.172
enf-12.046
eny-11.762
Tokeniverse
Token ID3997
Feature activation-1.20e-01

Pos logit contributions
gement+5.536
llah+5.424
gment+5.179
Brit+5.129
Asset+4.903
Neg logit contributions
 Duo-4.779
agra-4.389
enf-4.385
 Phi-4.365
eny-4.078
Token.
Token ID13
Feature activation-1.82e-01

Pos logit contributions
gement+1.360
llah+1.306
gment+1.245
Brit+1.232
Asset+1.194
Neg logit contributions
 Duo-1.311
enf-1.247
 Phi-1.215
agra-1.210
eny-1.187
TokenIf
Token ID1532
Feature activation+3.21e-01

Pos logit contributions
gement+1.975
llah+1.894
Brit+1.814
gment+1.799
Asset+1.761
Neg logit contributions
 Duo-2.018
enf-1.945
agra-1.900
 Phi-1.872
eny-1.856
Token you
Token ID345
Feature activation+1.65e-01

Pos logit contributions
 Duo+3.472
 Phi+3.221
enf+3.215
agra+3.103
eny+3.098
Neg logit contributions
gement-3.655
llah-3.565
gment-3.377
Brit-3.339
Asset-3.210
Token =
Token ID796
Feature activation-5.01e-01

Pos logit contributions
gement+4.500
llah+4.313
Brit+4.253
gment+4.113
Asset+4.061
Neg logit contributions
 Duo-7.254
enf-7.041
agra-6.794
eny-6.718
 conduc-6.582
Token c
Token ID269
Feature activation-5.56e-01

Pos logit contributions
gement+5.498
llah+5.243
gment+5.163
Brit+4.930
Asset+4.725
Neg logit contributions
 Duo-5.210
enf-5.130
eny-4.963
agra-4.957
 Phi-4.739
Tokenv
Token ID85
Feature activation+3.62e-02

Pos logit contributions
gement+3.854
gment+3.615
llah+3.436
Brit+3.269
Asset+3.000
Neg logit contributions
 Duo-8.272
 Phi-7.858
 conduc-7.753
enf-7.734
agra-7.685
Token2
Token ID17
Feature activation+7.62e-02

Pos logit contributions
 Duo+0.334
enf+0.312
 Phi+0.305
agra+0.302
eny+0.295
Neg logit contributions
gement-0.475
llah-0.458
gment-0.439
Brit-0.434
Asset-0.423
Token.
Token ID13
Feature activation-5.19e-02

Pos logit contributions
 Duo+0.809
enf+0.765
 Phi+0.751
agra+0.742
eny+0.729
Neg logit contributions
gement-0.890
llah-0.852
gment-0.814
Brit-0.801
Asset-0.777
Tokenget
Token ID1136
Feature activation-1.51e+00

Pos logit contributions
gement+0.822
llah+0.797
gment+0.774
Brit+0.767
Asset+0.750
Neg logit contributions
 Duo-0.336
enf-0.302
 Phi-0.294
agra-0.289
eny-0.278
TokenStruct
Token ID44909
Feature activation-2.65e-01

Pos logit contributions
Struct+9.286
Asset+7.414
Brit+6.064
struct+5.181
gment+5.136
Neg logit contributions
 conduc-21.412
 awa-21.241
 unbeliev-21.139
 Duo-20.793
 skelet-20.209
Tokenuring
Token ID870
Feature activation-1.04e+00

Pos logit contributions
gement+2.904
llah+2.700
gment+2.682
Asset+2.538
Brit+2.531
Neg logit contributions
 Duo-3.052
enf-2.826
 Phi-2.792
agra-2.741
 cf-2.698
TokenElement
Token ID20180
Feature activation-2.63e-01

Pos logit contributions
Asset+6.352
gement+6.224
gment+5.833
llah+5.750
Brit+5.653
Neg logit contributions
 conduc-14.069
 Duo-14.057
agra-13.823
enf-13.544
 awa-13.193
Token(
Token ID7
Feature activation+2.23e-02

Pos logit contributions
gement+2.895
llah+2.802
Brit+2.686
gment+2.662
Asset+2.594
Neg logit contributions
 Duo-2.889
enf-2.745
agra-2.694
 Phi-2.682
eny-2.640
Tokencv
Token ID33967
Feature activation-1.01e-01

Pos logit contributions
 Duo+0.210
enf+0.198
 Phi+0.193
agra+0.192
eny+0.187
Neg logit contributions
gement-0.285
llah-0.275
gment-0.264
Brit-0.261
Asset-0.254
Token sum
Token ID2160
Feature activation+1.07e-01

Pos logit contributions
 Duo+2.317
 Phi+2.147
enf+2.146
agra+2.105
eny+2.091
Neg logit contributions
gement-2.831
llah-2.765
Brit-2.631
gment-2.620
Asset-2.537
Token of
Token ID286
Feature activation+2.12e-01

Pos logit contributions
 Duo+1.144
enf+1.080
 Phi+1.057
agra+1.051
eny+1.040
Neg logit contributions
gement-1.229
llah-1.180
gment-1.126
Brit-1.121
Asset-1.085
Token money
Token ID1637
Feature activation+5.90e-02

Pos logit contributions
 Duo+2.145
enf+1.998
 Phi+1.971
agra+1.963
eny+1.925
Neg logit contributions
gement-2.540
llah-2.466
gment-2.366
Brit-2.342
Asset-2.268
Token,"
Token ID553
Feature activation-9.57e-01

Pos logit contributions
 Duo+0.638
enf+0.602
 Phi+0.591
agra+0.586
eny+0.577
Neg logit contributions
gement-0.677
llah-0.651
gment-0.621
Brit-0.615
Asset-0.595
Token the
Token ID262
Feature activation-5.59e-01

Pos logit contributions
gement+8.596
llah+7.858
gment+7.524
Asset+7.028
Brit+6.937
Neg logit contributions
enf-10.410
 Duo-10.329
agra-10.131
eny-10.046
 Phi-9.798
Token bench
Token ID7624
Feature activation-1.48e+00

Pos logit contributions
gement+4.368
llah+3.958
gment+3.822
Brit+3.591
Asset+3.370
Neg logit contributions
 Duo-7.556
enf-7.519
eny-7.242
agra-7.192
 Phi-7.184
Token said
Token ID531
Feature activation-6.18e-01

Pos logit contributions
gement+10.182
gment+8.904
llah+7.999
 Roy+7.640
 warranty+7.504
Neg logit contributions
enf-17.500
 Duo-16.416
eno-16.030
 Phi-15.901
Pros-15.746
Token.
Token ID13
Feature activation-3.56e-01

Pos logit contributions
gement+5.799
llah+5.444
gment+5.294
Brit+5.011
Asset+4.867
Neg logit contributions
 Duo-7.349
enf-7.298
 Phi-6.939
agra-6.838
eny-6.750
Token
Token ID198
Feature activation+3.09e-02

Pos logit contributions
gement+3.571
llah+3.413
gment+3.204
Brit+3.187
Asset+3.092
Neg logit contributions
 Duo-4.224
enf-4.019
 Phi-3.951
agra-3.925
eny-3.898
Token
Token ID198
Feature activation-3.69e-01

Pos logit contributions
 Duo+0.345
enf+0.327
 Phi+0.321
agra+0.318
eny+0.312
Neg logit contributions
gement-0.345
llah-0.330
gment-0.315
Brit-0.310
Asset-0.300
Token"
Token ID1
Feature activation-1.11e-01

Pos logit contributions
gement+4.019
llah+3.814
Brit+3.733
gment+3.642
Asset+3.630
Neg logit contributions
 Duo-4.136
 Phi-3.847
enf-3.805
agra-3.739
eny-3.702
Token [
Token ID685
Feature activation-5.28e-01

Pos logit contributions
gement+4.684
gment+4.188
llah+3.992
Asset+3.695
Brit+3.238
Neg logit contributions
enf-10.433
 Duo-10.408
agra-10.178
eny-9.699
cum-9.688
Tokenreturn
Token ID7783
Feature activation-7.66e-01

Pos logit contributions
gement+5.629
llah+5.262
gment+5.197
Asset+4.999
Brit+4.896
Neg logit contributions
 Duo-6.032
 Phi-5.563
agra-5.548
enf-5.540
 Solo-5.218
Token address
Token ID2209
Feature activation-5.57e-01

Pos logit contributions
gement+7.382
llah+6.934
gment+6.787
Asset+6.285
 warranty+6.207
Neg logit contributions
 Duo-8.423
agra-8.317
enf-8.230
glers-7.797
cum-7.788
Token]
Token ID60
Feature activation-6.07e-01

Pos logit contributions
gement+4.041
llah+3.760
gment+3.619
Brit+3.428
Asset+3.365
Neg logit contributions
 Duo-7.724
enf-7.597
agra-7.553
eny-7.269
 Phi-7.208
Token |
Token ID930
Feature activation-5.36e-01

Pos logit contributions
gement+3.933
llah+3.724
Brit+3.441
gment+3.393
Asset+3.344
Neg logit contributions
 Duo-8.700
enf-8.600
agra-8.437
 Phi-8.284
eny-8.236
Token +
Token ID1343
Feature activation-1.48e+00

Pos logit contributions
gement+5.321
llah+4.998
gment+4.769
Brit+4.739
Asset+4.712
Neg logit contributions
enf-6.020
 Duo-5.948
agra-5.792
eny-5.689
 Phi-5.542
Token--------------------
Token ID19351
Feature activation-5.69e-01

Pos logit contributions
gement+6.786
llah+6.110
Brit+5.728
Asset+5.634
gment+5.302
Neg logit contributions
 Duo-21.371
 conduc-20.704
 unbeliev-20.169
 skelet-19.643
 Phi-19.480
Token--+
Token ID44785
Feature activation-3.28e-01

Pos logit contributions
gement+5.102
llah+4.879
Brit+4.628
gment+4.626
Asset+4.518
Neg logit contributions
 Duo-7.160
enf-6.705
 Phi-6.517
 Solo-6.431
agra-6.404
Token +
Token ID1343
Feature activation-1.18e+00

Pos logit contributions
gement+3.563
llah+3.414
Brit+3.245
gment+3.240
Asset+3.150
Neg logit contributions
 Duo-3.527
enf-3.483
agra-3.365
eny-3.319
 Phi-3.262
Token--------------------
Token ID19351
Feature activation-4.89e-01

Pos logit contributions
gement+6.329
llah+5.842
Brit+5.484
Asset+5.365
gment+5.121
Neg logit contributions
 Duo-17.418
 conduc-16.486
 Phi-15.914
 unbeliev-15.884
 Solo-15.587
Token--+
Token ID44785
Feature activation-4.67e-01

Pos logit contributions
gement+4.603
llah+4.388
gment+4.201
Brit+4.142
Asset+4.048
Neg logit contributions
 Duo-6.038
enf-5.632
 Phi-5.489
 Solo-5.398
agra-5.387
Token #
Token ID1303
Feature activation-3.15e-01

Pos logit contributions
gement+7.641
gment+7.104
llah+7.010
Brit+6.321
 warranty+6.261
Neg logit contributions
 Duo-10.018
enf-9.785
eny-9.379
agra-9.090
 Phi-8.958
Token initialize
Token ID41216
Feature activation-6.49e-01

Pos logit contributions
gement+2.777
llah+2.676
gment+2.525
Brit+2.358
Asset+2.325
Neg logit contributions
 Duo-4.155
enf-3.984
agra-3.930
 Phi-3.872
eny-3.768
Token a
Token ID257
Feature activation-8.57e-01

Pos logit contributions
gement+5.163
llah+4.948
gment+4.622
Brit+4.504
Asset+4.478
Neg logit contributions
 Duo-8.418
enf-8.178
agra-8.080
eny-7.832
 Phi-7.798
Token rectangular
Token ID36954
Feature activation-7.57e-01

Pos logit contributions
gement+6.812
gment+6.343
llah+6.208
Brit+5.454
 warranty+5.286
Neg logit contributions
enf-10.658
 Duo-10.212
eny-10.028
agra-10.027
glers-9.836
Token and
Token ID290
Feature activation-5.18e-01

Pos logit contributions
gement+6.156
llah+5.904
gment+5.631
Asset+5.157
Brit+5.074
Neg logit contributions
 Duo-9.509
enf-9.380
agra-9.161
eny-8.954
 Phi-8.670
Token square
Token ID6616
Feature activation-1.47e+00

Pos logit contributions
gement+4.883
llah+4.571
gment+4.429
Brit+4.300
Asset+4.112
Neg logit contributions
 Duo-6.190
enf-6.050
agra-5.671
eny-5.665
 Phi-5.645
Token struct
Token ID2878
Feature activation-2.45e-01

Pos logit contributions
Struct+3.797
gement+3.665
 warranty+3.538
gment+3.210
llah+3.073
Neg logit contributions
 Duo-21.850
enf-21.532
eny-21.237
agra-21.232
glers-21.188
Tokenuring
Token ID870
Feature activation-8.92e-01

Pos logit contributions
gement+2.675
llah+2.501
gment+2.477
Brit+2.361
Asset+2.313
Neg logit contributions
 Duo-2.776
enf-2.604
 Phi-2.547
agra-2.535
eny-2.477
Token kernel
Token ID9720
Feature activation-1.05e+00

Pos logit contributions
gement+3.969
llah+3.915
gment+3.512
 warranty+3.155
 GPU+2.701
Neg logit contributions
agra-13.654
 Duo-13.640
enf-13.556
 Cosponsors-13.105
eny-13.051
Token rect
Token ID13621
Feature activation-1.33e-01

Pos logit contributions
gement+6.061
gment+5.319
llah+5.181
 rect+4.584
 warranty+4.557
Neg logit contributions
 Duo-14.632
enf-14.526
agra-14.037
eny-13.800
glers-13.780
TokenK
Token ID42
Feature activation-7.65e-01

Pos logit contributions
gement+1.624
llah+1.565
Brit+1.511
gment+1.507
Asset+1.467
Neg logit contributions
 Duo-1.321
enf-1.253
agra-1.210
 Phi-1.203
eny-1.187
Token-
Token ID12
Feature activation-5.52e-02

Pos logit contributions
gement+0.296
llah+0.284
gment+0.272
Brit+0.268
Asset+0.260
Neg logit contributions
 Duo-0.272
enf-0.257
 Phi-0.251
agra-0.251
eny-0.245
TokenCook
Token ID28937
Feature activation+3.67e-01

Pos logit contributions
gement+0.664
llah+0.638
gment+0.611
Brit+0.607
Asset+0.588
Neg logit contributions
 Duo-0.566
enf-0.533
 Phi-0.522
agra-0.519
eny-0.507
Tokenman
Token ID805
Feature activation+9.78e-02

Pos logit contributions
 Duo+4.105
 Phi+3.858
enf+3.842
agra+3.710
eny+3.656
Neg logit contributions
gement-4.060
llah-3.884
gment-3.685
Brit-3.658
Asset-3.483
Token University
Token ID2059
Feature activation+5.99e-02

Pos logit contributions
 Duo+1.087
enf+1.025
 Phi+1.013
agra+0.993
eny+0.977
Neg logit contributions
gement-1.102
llah-1.055
gment-1.008
Brit-0.995
Asset-0.961
Token celebrated
Token ID13943
Feature activation-5.11e-01

Pos logit contributions
 Duo+0.674
enf+0.638
 Phi+0.629
agra+0.618
eny+0.610
Neg logit contributions
gement-0.666
llah-0.639
gment-0.608
Brit-0.600
Asset-0.580
Token its
Token ID663
Feature activation-1.43e+00

Pos logit contributions
gement+5.070
llah+4.800
Brit+4.595
gment+4.526
Asset+4.351
Neg logit contributions
 Duo-5.778
enf-5.752
agra-5.596
eny-5.372
cum-5.315
Token hundred
Token ID3470
Feature activation-5.14e-01

Pos logit contributions
gement+11.517
llah+10.198
Brit+10.141
 Roy+9.809
gment+9.746
Neg logit contributions
enf-14.000
agra-13.908
eless-13.459
 Duo-13.367
cum-13.339
Tokenth
Token ID400
Feature activation-7.42e-01

Pos logit contributions
gement+4.708
llah+4.435
gment+4.202
Brit+4.059
Asset+3.902
Neg logit contributions
 Duo-6.408
enf-6.073
agra-6.004
 Phi-5.870
eny-5.774
Token anniversary
Token ID11162
Feature activation-5.47e-01

Pos logit contributions
gement+5.608
llah+5.015
gment+4.847
Brit+4.742
Asset+4.572
Neg logit contributions
 Duo-9.749
enf-9.737
agra-9.725
eny-9.259
eless-9.068
Token from
Token ID422
Feature activation-2.49e-01

Pos logit contributions
gement+5.576
llah+5.234
gment+5.080
Brit+5.004
Asset+4.636
Neg logit contributions
 Duo-6.144
enf-6.028
agra-5.893
eless-5.580
eny-5.553
Token its
Token ID663
Feature activation-8.01e-01

Pos logit contributions
gement+2.837
llah+2.698
gment+2.591
Brit+2.554
Asset+2.454
Neg logit contributions
 Duo-2.643
enf-2.591
agra-2.490
eny-2.433
eless-2.398
Token of
Token ID286
Feature activation+7.70e-03

Pos logit contributions
 Duo+0.702
enf+0.664
 Phi+0.649
agra+0.646
eny+0.635
Neg logit contributions
gement-0.709
llah-0.682
gment-0.649
Brit-0.644
Asset-0.621
Token the
Token ID262
Feature activation-5.06e-01

Pos logit contributions
 Duo+0.082
enf+0.078
 Phi+0.076
agra+0.076
eny+0.074
Neg logit contributions
gement-0.089
llah-0.086
gment-0.082
Brit-0.081
Asset-0.079
Token state
Token ID1181
Feature activation+1.58e-01

Pos logit contributions
gement+5.476
llah+5.220
gment+4.955
Brit+4.874
Asset+4.696
Neg logit contributions
 Duo-5.432
enf-5.223
 Phi-5.053
agra-4.999
eny-4.954
Token,"
Token ID553
Feature activation-8.61e-01

Pos logit contributions
 Duo+1.741
enf+1.644
 Phi+1.608
agra+1.604
eny+1.578
Neg logit contributions
gement-1.768
llah-1.697
gment-1.616
Brit-1.607
Asset-1.550
Token the
Token ID262
Feature activation-5.62e-01

Pos logit contributions
gement+8.077
llah+7.272
gment+7.111
Asset+6.645
Brit+6.621
Neg logit contributions
enf-9.418
 Duo-9.242
agra-9.116
eny-9.026
 Phi-8.820
Token bench
Token ID7624
Feature activation-1.42e+00

Pos logit contributions
gement+4.706
llah+4.266
gment+4.140
Brit+3.927
Asset+3.646
Neg logit contributions
enf-7.307
 Duo-7.247
eny-6.996
 Phi-6.933
agra-6.915
Token said
Token ID531
Feature activation-6.61e-01

Pos logit contributions
gement+10.445
gment+9.268
llah+8.236
 Roy+7.983
Brit+7.734
Neg logit contributions
enf-16.607
 Duo-15.361
eno-15.193
 Phi-14.923
eny-14.842
Token.
Token ID13
Feature activation-5.06e-01

Pos logit contributions
gement+6.320
llah+5.872
gment+5.765
Brit+5.405
Asset+5.269
Neg logit contributions
enf-7.705
 Duo-7.636
 Phi-7.203
agra-7.166
eny-7.118
Token
Token ID198
Feature activation-1.27e-01

Pos logit contributions
gement+4.837
llah+4.606
Brit+4.335
gment+4.304
Asset+4.182
Neg logit contributions
 Duo-5.962
enf-5.751
eny-5.633
 Phi-5.625
agra-5.623
Token
Token ID198
Feature activation-4.40e-01

Pos logit contributions
gement+1.312
llah+1.264
gment+1.196
Brit+1.195
Asset+1.150
Neg logit contributions
 Duo-1.505
enf-1.438
agra-1.408
 Phi-1.402
eny-1.381
TokenThe
Token ID464
Feature activation-5.50e-01

Pos logit contributions
gement+4.713
llah+4.443
Brit+4.418
Asset+4.272
gment+4.257
Neg logit contributions
 Duo-4.968
 Phi-4.631
enf-4.555
agra-4.489
eny-4.457
Token from
Token ID422
Feature activation+1.38e-01

Pos logit contributions
 Duo+3.039
 Phi+2.832
enf+2.793
agra+2.720
cum+2.673
Neg logit contributions
gement-4.710
llah-4.602
gment-4.401
Brit-4.400
Asset-4.298
Token an
Token ID281
Feature activation+1.74e-02

Pos logit contributions
 Duo+1.439
enf+1.335
 Phi+1.333
agra+1.321
eny+1.284
Neg logit contributions
gement-1.630
llah-1.571
gment-1.496
Brit-1.481
Asset-1.443
Token out
Token ID503
Feature activation-1.81e-01

Pos logit contributions
 Duo+0.195
enf+0.184
 Phi+0.181
agra+0.180
eny+0.176
Neg logit contributions
gement-0.194
llah-0.186
gment-0.177
Brit-0.175
Asset-0.169
Token-
Token ID12
Feature activation-6.19e-01

Pos logit contributions
gement+1.914
llah+1.816
gment+1.749
Brit+1.717
Asset+1.648
Neg logit contributions
 Duo-2.110
enf-2.006
 Phi-1.968
agra-1.947
eny-1.922
Tokenof
Token ID1659
Feature activation-1.22e+00

Pos logit contributions
gement+5.864
Brit+5.596
gment+5.478
llah+5.419
Asset+5.213
Neg logit contributions
 Duo-7.616
 Phi-7.103
enf-6.907
agra-6.826
eny-6.730
Token-
Token ID12
Feature activation-1.42e+00

Pos logit contributions
gement+7.298
llah+6.853
gment+6.679
 warranty+6.337
Brit+6.306
Neg logit contributions
 Duo-15.742
enf-15.182
eny-14.729
 conduc-14.584
 Cosponsors-14.384
Tokencomp
Token ID5589
Feature activation-2.16e-01

Pos logit contributions
Brit+11.045
gement+10.542
priv+10.224
home+10.168
gment+10.136
Neg logit contributions
 Duo-16.148
 Phi-14.663
 Cosponsors-14.284
eny-14.192
 Solo-13.929
Tokenetition
Token ID15620
Feature activation-9.89e-02

Pos logit contributions
gement+2.396
llah+2.255
gment+2.213
Brit+2.147
Asset+2.080
Neg logit contributions
 Duo-2.395
enf-2.277
 Phi-2.263
agra-2.213
eny-2.149
Token test
Token ID1332
Feature activation-1.37e-01

Pos logit contributions
gement+1.107
llah+1.057
gment+1.016
Brit+0.997
Asset+0.960
Neg logit contributions
 Duo-1.091
enf-1.046
 Phi-1.011
agra-1.011
eny-0.995
Token conducted
Token ID5952
Feature activation+4.27e-01

Pos logit contributions
gement+1.553
llah+1.483
gment+1.424
Brit+1.404
Asset+1.343
Neg logit contributions
 Duo-1.480
enf-1.414
 Phi-1.375
agra-1.372
eny-1.343
Token on
Token ID319
Feature activation+2.18e-02

Pos logit contributions
 Duo+3.970
 Phi+3.637
agra+3.575
enf+3.548
eny+3.463
Neg logit contributions
gement-5.422
llah-5.282
gment-5.010
Asset-4.974
Brit-4.949
Token can
Token ID460
Feature activation-2.42e-01

Pos logit contributions
gement+4.463
llah+4.235
gment+4.088
Brit+4.006
Asset+3.872
Neg logit contributions
 Duo-5.053
enf-4.760
 Phi-4.695
agra-4.679
eny-4.562
Token be
Token ID307
Feature activation-5.39e-01

Pos logit contributions
gement+2.780
llah+2.656
gment+2.569
Brit+2.498
Asset+2.405
Neg logit contributions
 Duo-2.559
enf-2.428
 Phi-2.391
agra-2.378
eny-2.314
Token strengthened
Token ID27324
Feature activation-3.30e-01

Pos logit contributions
gement+5.690
gment+5.347
llah+5.241
Brit+4.959
Asset+4.898
Neg logit contributions
 Duo-5.860
enf-5.544
agra-5.437
 Phi-5.369
eny-5.345
Token over
Token ID625
Feature activation+8.24e-02

Pos logit contributions
gement+3.715
llah+3.560
gment+3.423
Brit+3.328
Asset+3.226
Neg logit contributions
 Duo-3.549
enf-3.357
 Phi-3.300
agra-3.287
eny-3.187
Token time
Token ID640
Feature activation-2.47e-01

Pos logit contributions
 Duo+0.812
enf+0.760
 Phi+0.748
agra+0.741
eny+0.726
Neg logit contributions
gement-1.020
llah-0.984
gment-0.943
Brit-0.936
Asset-0.907
Token with
Token ID351
Feature activation-1.41e+00

Pos logit contributions
gement+2.765
llah+2.633
gment+2.549
Brit+2.477
Asset+2.408
Neg logit contributions
 Duo-2.707
enf-2.560
 Phi-2.515
agra-2.509
eny-2.433
Token practice
Token ID3357
Feature activation-5.21e-01

Pos logit contributions
gement+12.438
gment+12.125
llah+11.867
Asset+10.685
 attendance+10.611
Neg logit contributions
enf-13.131
 Duo-12.793
agra-12.493
eny-12.138
eless-11.748
Token.
Token ID13
Feature activation-8.49e-01

Pos logit contributions
gement+5.288
llah+4.966
gment+4.856
Brit+4.601
Asset+4.499
Neg logit contributions
 Duo-6.015
enf-5.729
agra-5.677
 Phi-5.587
eny-5.412
Token But
Token ID887
Feature activation-8.24e-01

Pos logit contributions
gement+7.038
Brit+6.862
llah+6.793
Asset+6.516
 Classes+6.506
Neg logit contributions
enf-9.567
agra-9.506
eny-9.335
 Duo-9.117
eless-8.867
Token this
Token ID428
Feature activation-4.51e-01

Pos logit contributions
gement+7.868
llah+7.485
gment+7.221
Brit+6.954
Asset+6.635
Neg logit contributions
 Duo-8.719
enf-8.690
agra-8.502
eless-8.193
eny-8.144
Token ability
Token ID2694
Feature activation-4.62e-01

Pos logit contributions
gement+4.703
llah+4.375
gment+4.306
Brit+4.131
Asset+4.027
Neg logit contributions
 Duo-5.065
enf-4.846
agra-4.788
 Phi-4.644
eny-4.635
Token from
Token ID422
Feature activation-3.21e-01

Pos logit contributions
 Duo+4.198
 Phi+3.910
enf+3.814
agra+3.710
eny+3.666
Neg logit contributions
gement-4.927
llah-4.718
gment-4.548
Asset-4.470
Brit-4.454
Token the
Token ID262
Feature activation-6.78e-01

Pos logit contributions
gement+3.462
llah+3.385
gment+3.150
Brit+3.139
Asset+2.993
Neg logit contributions
 Duo-3.559
enf-3.466
agra-3.354
 Phi-3.264
eny-3.248
Token 1980
Token ID7169
Feature activation-1.04e-01

Pos logit contributions
gement+6.300
llah+6.297
Brit+5.725
gment+5.617
Asset+5.218
Neg logit contributions
 Duo-7.802
enf-7.745
agra-7.414
cum-7.244
 Phi-7.228
Tokens
Token ID82
Feature activation+4.43e-01

Pos logit contributions
gement+0.869
llah+0.824
gment+0.768
Brit+0.757
Asset+0.721
Neg logit contributions
 Duo-1.449
enf-1.392
 Phi-1.367
agra-1.364
eny-1.340
Token to
Token ID284
Feature activation-5.84e-01

Pos logit contributions
 Duo+4.623
 Phi+4.298
enf+4.246
agra+4.173
eny+4.156
Neg logit contributions
gement-5.105
llah-4.776
gment-4.660
Brit-4.501
Asset-4.484
Token the
Token ID262
Feature activation-1.41e+00

Pos logit contributions
gement+5.490
llah+5.445
gment+4.863
Brit+4.836
Asset+4.518
Neg logit contributions
 Duo-6.954
enf-6.767
agra-6.652
eless-6.486
 Phi-6.400
Token 1990
Token ID6303
Feature activation-3.77e-01

Pos logit contributions
llah+10.178
gement+9.980
Brit+8.364
gment+8.333
 warranty+7.468
Neg logit contributions
 Duo-15.679
enf-15.654
agra-15.209
eless-14.944
cum-14.296
Tokens
Token ID82
Feature activation+6.33e-01

Pos logit contributions
gement+2.699
llah+2.497
gment+2.313
Brit+2.296
Asset+2.130
Neg logit contributions
 Duo-5.632
enf-5.351
 Phi-5.350
agra-5.297
eny-5.157
Token,
Token ID11
Feature activation+4.10e-01

Pos logit contributions
 Duo+5.757
 Phi+5.298
enf+5.126
 Cambodia+5.050
eny+5.029
Neg logit contributions
gement-7.894
llah-7.479
gment-7.181
Asset-7.080
Brit-7.062
Token his
Token ID465
Feature activation+3.99e-01

Pos logit contributions
 Duo+4.084
 Phi+3.773
enf+3.690
agra+3.641
eny+3.575
Neg logit contributions
gement-4.947
llah-4.747
gment-4.536
Asset-4.469
Brit-4.460
Token family
Token ID1641
Feature activation-3.53e-01

Pos logit contributions
 Duo+4.382
 Phi+4.088
enf+4.018
eny+3.921
agra+3.905
Neg logit contributions
gement-4.447
llah-4.253
gment-4.039
Asset-4.004
Brit-3.998
Token refusal
Token ID17387
Feature activation-7.48e-01

Pos logit contributions
gement+1.334
llah+1.272
gment+1.223
Brit+1.202
Asset+1.164
Neg logit contributions
 Duo-1.323
enf-1.261
 Phi-1.226
agra-1.224
eny-1.205
Token to
Token ID284
Feature activation-2.83e-01

Pos logit contributions
gement+7.340
llah+6.574
gment+6.456
Brit+6.289
Asset+5.774
Neg logit contributions
 Duo-8.608
agra-8.244
enf-8.179
 Phi-7.896
eny-7.781
Token attend
Token ID5262
Feature activation-8.44e-01

Pos logit contributions
gement+3.147
llah+3.036
gment+2.883
Brit+2.836
Asset+2.694
Neg logit contributions
 Duo-3.046
enf-2.943
agra-2.862
 Phi-2.824
eny-2.784
Token a
Token ID257
Feature activation-3.99e-01

Pos logit contributions
gement+8.058
llah+7.160
Brit+6.970
gment+6.965
Asset+6.582
Neg logit contributions
 Duo-9.409
enf-9.333
eny-8.909
agra-8.850
eless-8.850
Token meeting
Token ID3249
Feature activation-6.47e-01

Pos logit contributions
gement+4.159
llah+3.871
gment+3.748
Brit+3.695
Asset+3.508
Neg logit contributions
 Duo-4.541
enf-4.459
eny-4.287
agra-4.271
eless-4.184
Token,
Token ID11
Feature activation-1.40e+00

Pos logit contributions
gement+6.417
llah+5.919
gment+5.775
Brit+5.709
Asset+5.181
Neg logit contributions
 Duo-7.373
enf-7.173
agra-7.029
eny-6.773
 Phi-6.758
Token Pu
Token ID11634
Feature activation-8.81e-02

Pos logit contributions
gement+13.131
 Roy+12.469
llah+11.990
gment+11.850
Brit+11.529
Neg logit contributions
 Duo-11.791
enf-11.742
eny-11.340
agra-11.262
eless-10.952
Tokenig
Token ID328
Feature activation-3.77e-01

Pos logit contributions
gement+1.248
llah+1.210
gment+1.166
Brit+1.149
Asset+1.123
Neg logit contributions
 Duo-0.718
enf-0.660
 Phi-0.644
agra-0.635
eny-0.617
Tokendem
Token ID9536
Feature activation-6.36e-01

Pos logit contributions
gement+3.330
llah+3.126
gment+2.992
Brit+2.854
Asset+2.674
Neg logit contributions
 Duo-5.025
 Phi-4.764
agra-4.722
enf-4.684
eny-4.567
Tokenont
Token ID756
Feature activation-7.50e-01

Pos logit contributions
gement+6.438
llah+6.029
gment+5.866
Brit+5.582
Asset+5.209
Neg logit contributions
 Duo-7.374
 Phi-6.740
agra-6.736
enf-6.474
 Solo-6.433
Token said
Token ID531
Feature activation-3.78e-01

Pos logit contributions
gement+6.427
llah+6.202
gment+5.581
Brit+5.513
Asset+5.196
Neg logit contributions
 Duo-9.076
agra-8.844
enf-8.629
 Phi-8.609
cum-8.341
Token bank
Token ID3331
Feature activation+1.41e-01

Pos logit contributions
 Duo+2.868
 Phi+2.654
enf+2.652
agra+2.568
eny+2.538
Neg logit contributions
gement-3.056
llah-2.972
Brit-2.815
gment-2.797
Asset-2.671
Token operates
Token ID14051
Feature activation+5.95e-01

Pos logit contributions
 Duo+1.572
enf+1.476
 Phi+1.457
agra+1.427
eny+1.412
Neg logit contributions
gement-1.577
llah-1.523
Brit-1.434
gment-1.432
Asset-1.378
Token,"
Token ID553
Feature activation-9.59e-01

Pos logit contributions
 Duo+5.948
 Phi+5.531
enf+5.519
 cf+5.300
eny+5.242
Neg logit contributions
gement-6.885
llah-6.650
Brit-6.309
gment-6.273
Asset-5.911
Token Z
Token ID1168
Feature activation+5.23e-02

Pos logit contributions
gement+9.515
llah+9.180
gment+8.438
Asset+8.307
Brit+8.294
Neg logit contributions
enf-9.854
 Duo-9.458
agra-9.404
eny-8.783
 Phi-8.735
Tokenet
Token ID316
Feature activation-7.90e-02

Pos logit contributions
 Duo+0.452
enf+0.425
 Phi+0.411
agra+0.410
eny+0.402
Neg logit contributions
gement-0.712
llah-0.687
gment-0.660
Brit-0.655
Asset-0.637
Tokeni
Token ID72
Feature activation-1.39e+00

Pos logit contributions
gement+0.913
llah+0.879
gment+0.839
Brit+0.827
Asset+0.801
Neg logit contributions
 Duo-0.850
enf-0.800
 Phi-0.784
agra-0.777
eny-0.762
Token said
Token ID531
Feature activation+8.15e-02

Pos logit contributions
gement+11.155
llah+10.817
gment+9.747
Brit+9.638
Asset+9.258
Neg logit contributions
enf-14.351
 Duo-14.064
agra-13.477
 Phi-13.268
ilton-12.919
Token at
Token ID379
Feature activation-3.36e-01

Pos logit contributions
 Duo+0.836
enf+0.784
 Phi+0.772
agra+0.762
eny+0.751
Neg logit contributions
gement-0.983
llah-0.946
gment-0.906
Brit-0.895
Asset-0.869
Token a
Token ID257
Feature activation-6.09e-01

Pos logit contributions
gement+3.470
llah+3.329
gment+3.104
Brit+3.071
Asset+3.063
Neg logit contributions
 Duo-3.887
enf-3.828
agra-3.642
 Phi-3.599
eny-3.590
Token media
Token ID2056
Feature activation-5.12e-02

Pos logit contributions
gement+6.282
llah+5.948
gment+5.672
Asset+5.554
Brit+5.447
Neg logit contributions
enf-6.819
 Duo-6.598
agra-6.228
eny-6.213
 Phi-6.113
Token briefing
Token ID17719
Feature activation-5.39e-01

Pos logit contributions
gement+0.554
llah+0.531
gment+0.505
Brit+0.497
Asset+0.482
Neg logit contributions
 Duo-0.587
enf-0.558
 Phi-0.547
agra-0.544
eny-0.534
Token:
Token ID25
Feature activation-5.60e-01

Pos logit contributions
gement+4.374
llah+4.221
Brit+4.021
gment+3.978
Asset+3.818
Neg logit contributions
 Duo-4.777
enf-4.574
agra-4.566
 Phi-4.409
eny-4.353
Token Frank
Token ID5278
Feature activation-3.62e-01

Pos logit contributions
gement+5.763
llah+5.506
Brit+5.303
gment+5.266
Asset+5.021
Neg logit contributions
 Duo-6.025
enf-5.874
agra-5.828
eny-5.553
 Phi-5.551
Token Miller
Token ID7920
Feature activation-1.18e-01

Pos logit contributions
gement+2.935
llah+2.853
gment+2.648
Brit+2.567
Asset+2.470
Neg logit contributions
 Duo-4.887
agra-4.732
enf-4.700
eny-4.596
 Phi-4.552
Token.
Token ID13
Feature activation-6.09e-01

Pos logit contributions
gement+1.323
llah+1.270
gment+1.209
Brit+1.194
Asset+1.156
Neg logit contributions
 Duo-1.286
enf-1.221
 Phi-1.197
agra-1.197
eny-1.167
Token How
Token ID1374
Feature activation-1.09e+00

Pos logit contributions
gement+5.338
llah+5.211
Brit+5.075
Asset+4.841
gment+4.745
Neg logit contributions
enf-7.064
 Duo-6.980
agra-6.960
eny-6.857
 Phi-6.618
Token could
Token ID714
Feature activation-1.39e+00

Pos logit contributions
gement+10.143
llah+9.504
gment+8.999
Brit+8.902
 Roy+8.669
Neg logit contributions
 Duo-10.912
enf-10.839
agra-10.515
 Phi-10.042
cum-9.943
Token this
Token ID428
Feature activation-1.05e+00

Pos logit contributions
gement+11.100
llah+10.400
 Roy+10.386
Brit+10.200
Asset+9.986
Neg logit contributions
enf-13.530
agra-13.476
 Duo-12.825
eny-12.785
 Phi-12.441
Token go
Token ID467
Feature activation+2.39e-01

Pos logit contributions
gement+10.090
llah+9.498
Brit+9.023
gment+8.806
Asset+8.648
Neg logit contributions
agra-10.534
 Duo-10.367
enf-10.354
 Phi-10.014
eny-9.797
Token wrong
Token ID2642
Feature activation-3.18e-01

Pos logit contributions
 Duo+2.407
enf+2.254
 Phi+2.217
agra+2.195
eny+2.170
Neg logit contributions
gement-2.880
llah-2.768
gment-2.646
Brit-2.633
Asset-2.576
Token?!
Token ID12248
Feature activation-6.40e-01

Pos logit contributions
gement+3.027
llah+2.882
gment+2.725
Brit+2.684
Asset+2.565
Neg logit contributions
 Duo-3.945
enf-3.799
agra-3.737
 Phi-3.725
eny-3.641
Token
Token ID198
Feature activation+6.58e-02

Pos logit contributions
gement+5.704
llah+5.652
Brit+5.473
Asset+5.216
gment+5.065
Neg logit contributions
enf-7.297
agra-7.244
 Duo-7.187
eny-7.022
 Phi-6.812
TokenWe
Token ID1135
Feature activation+3.64e-02

Pos logit contributions
gement+4.395
llah+4.286
Brit+4.171
Asset+4.088
gment+4.058
Neg logit contributions
 Duo-4.610
enf-4.370
agra-4.294
 Phi-4.288
eny-4.186
Token built
Token ID3170
Feature activation-9.13e-02

Pos logit contributions
 Duo+0.420
enf+0.398
 Phi+0.390
agra+0.388
eny+0.382
Neg logit contributions
gement-0.392
llah-0.376
gment-0.357
Brit-0.354
Asset-0.342
Token it
Token ID340
Feature activation+4.18e-01

Pos logit contributions
gement+1.026
llah+0.985
gment+0.943
Brit+0.925
Asset+0.899
Neg logit contributions
 Duo-1.001
enf-0.954
 Phi-0.929
agra-0.927
eny-0.909
Token entirely
Token ID5000
Feature activation+9.05e-02

Pos logit contributions
 Duo+3.926
 Phi+3.596
enf+3.593
agra+3.499
eny+3.468
Neg logit contributions
gement-5.318
llah-5.068
Brit-4.884
gment-4.858
Asset-4.778
Token in
Token ID287
Feature activation-7.25e-01

Pos logit contributions
 Duo+0.916
enf+0.855
 Phi+0.841
agra+0.828
eny+0.816
Neg logit contributions
gement-1.108
llah-1.066
gment-1.018
Brit-1.010
Asset-0.982
Token-
Token ID12
Feature activation-1.39e+00

Pos logit contributions
gement+7.489
llah+7.369
gment+6.998
Asset+6.608
Brit+6.589
Neg logit contributions
enf-7.276
agra-7.158
 Duo-6.997
eless-6.806
eny-6.788
Tokenhouse
Token ID4803
Feature activation-9.06e-02

Pos logit contributions
gement+10.702
llah+10.673
Brit+10.482
Asset+10.469
home+10.206
Neg logit contributions
 Duo-13.913
agra-13.894
 Phi-13.762
enf-13.439
eny-13.121
Token?
Token ID30
Feature activation-2.35e-01

Pos logit contributions
gement+1.083
llah+1.045
gment+1.000
Brit+0.986
Asset+0.956
Neg logit contributions
 Duo-0.930
enf-0.881
 Phi-0.860
agra-0.856
eny-0.837
Tokenthe
Token ID1169
Feature activation-2.00e-01

Pos logit contributions
gement+2.664
llah+2.582
gment+2.452
Brit+2.435
Asset+2.376
Neg logit contributions
 Duo-2.480
enf-2.374
agra-2.336
 Phi-2.303
eny-2.261
Token hardware
Token ID6890
Feature activation+3.24e-01

Pos logit contributions
gement+2.205
llah+2.131
gment+2.029
Brit+1.992
Asset+1.934
Neg logit contributions
 Duo-2.191
enf-2.099
agra-2.039
 Phi-2.036
eny-1.994
Token and
Token ID290
Feature activation-3.98e-01

Pos logit contributions
 Duo+3.335
enf+3.094
 Phi+3.035
eny+2.978
agra+2.971
Neg logit contributions
gement-3.883
llah-3.716
Brit-3.559
gment-3.555
Asset-3.402
Token |
Token ID930
Feature activation-4.47e-01

Pos logit contributions
gement+2.999
llah+2.799
gment+2.637
Brit+2.596
Asset+2.550
Neg logit contributions
 Duo-5.336
enf-5.258
agra-5.119
 Phi-5.023
eny-5.023
Token void
Token ID7951
Feature activation-5.58e-01

Pos logit contributions
gement+5.610
gment+5.323
llah+5.256
Asset+5.032
Brit+4.950
Neg logit contributions
 Duo-4.045
enf-3.985
agra-3.830
eny-3.699
 Phi-3.667
Token *
Token ID1635
Feature activation-8.72e-01

Pos logit contributions
gement+7.191
llah+6.936
gment+6.794
Brit+6.696
Asset+6.585
Neg logit contributions
enf-4.508
agra-4.461
 Duo-4.430
eny-4.175
eless-3.978
Tokenarg
Token ID853
Feature activation-6.97e-01

Pos logit contributions
gement+7.557
gment+7.058
llah+6.830
Brit+6.440
Asset+6.291
Neg logit contributions
 Duo-10.783
 Phi-10.243
 conduc-9.896
enf-9.738
 Solo-9.653
Token |
Token ID930
Feature activation-7.98e-01

Pos logit contributions
gement+4.338
llah+4.002
gment+3.803
Brit+3.662
Asset+3.543
Neg logit contributions
 Duo-9.991
enf-9.956
agra-9.834
 Phi-9.583
 conduc-9.546
Token +
Token ID1343
Feature activation-1.39e+00

Pos logit contributions
gement+7.674
gment+6.911
llah+6.793
Asset+6.410
Brit+6.217
Neg logit contributions
enf-9.004
 Duo-8.893
agra-8.407
 Phi-8.383
eny-8.331
Token--------------------
Token ID19351
Feature activation-5.03e-01

Pos logit contributions
gement+6.756
llah+6.047
Brit+5.482
Asset+5.348
gment+5.340
Neg logit contributions
 Duo-20.091
 conduc-19.159
 unbeliev-18.619
 Phi-18.478
 Solo-18.174
Token--+
Token ID44785
Feature activation-4.89e-01

Pos logit contributions
gement+4.765
llah+4.543
gment+4.349
Brit+4.288
Asset+4.167
Neg logit contributions
 Duo-6.176
enf-5.770
 Phi-5.644
agra-5.536
 Solo-5.529
Token |
Token ID930
Feature activation-7.19e-01

Pos logit contributions
gement+3.496
llah+3.296
Brit+3.070
gment+3.057
Asset+2.963
Neg logit contributions
 Duo-6.888
enf-6.753
agra-6.707
 Phi-6.509
eny-6.495
Token [
Token ID685
Feature activation-5.28e-01

Pos logit contributions
gement+4.684
gment+4.188
llah+3.992
Asset+3.695
Brit+3.238
Neg logit contributions
enf-10.433
 Duo-10.408
agra-10.178
eny-9.699
cum-9.688
Tokenreturn
Token ID7783
Feature activation-7.66e-01

Pos logit contributions
gement+5.629
llah+5.262
gment+5.197
Asset+4.999
Brit+4.896
Neg logit contributions
 Duo-6.032
 Phi-5.563
agra-5.548
enf-5.540
 Solo-5.218
Tokenv
Token ID85
Feature activation-7.00e-02

Pos logit contributions
gement+4.954
gment+4.710
llah+4.319
Brit+3.979
Asset+3.644
Neg logit contributions
 Duo-11.777
 conduc-11.614
 Phi-11.210
enf-11.110
agra-11.057
Token2
Token ID17
Feature activation-2.78e-01

Pos logit contributions
gement+0.882
llah+0.850
gment+0.819
Brit+0.812
Asset+0.788
Neg logit contributions
 Duo-0.675
enf-0.639
 Phi-0.621
agra-0.618
eny-0.607
Token.
Token ID764
Feature activation-3.40e-01

Pos logit contributions
gement+2.529
llah+2.438
gment+2.297
Brit+2.286
Asset+2.208
Neg logit contributions
 Duo-3.543
enf-3.430
agra-3.384
eny-3.304
 Phi-3.287
Token im
Token ID545
Feature activation-1.17e-01

Pos logit contributions
gement+3.891
llah+3.665
gment+3.620
Brit+3.455
Asset+3.375
Neg logit contributions
 Duo-3.571
enf-3.482
agra-3.389
eny-3.352
 Phi-3.285
Tokenread
Token ID961
Feature activation-4.26e-01

Pos logit contributions
gement+1.030
llah+0.967
gment+0.921
Brit+0.895
Asset+0.847
Neg logit contributions
 Duo-1.596
enf-1.511
 Phi-1.497
agra-1.490
eny-1.451
Token (
Token ID357
Feature activation-1.38e+00

Pos logit contributions
gement+4.109
llah+3.839
Brit+3.727
gment+3.687
Asset+3.541
Neg logit contributions
 Duo-5.160
enf-4.979
agra-4.976
eny-4.869
 Phi-4.773
Token image
Token ID2939
Feature activation-1.03e+00

Pos logit contributions
gement+5.291
gment+4.232
llah+3.961
 pal+3.220
 ox+3.020
Neg logit contributions
enf-20.926
agra-20.561
 Duo-20.550
Pros-20.440
glers-20.382
TokenPath
Token ID15235
Feature activation-9.42e-01

Pos logit contributions
gement+4.531
Brit+4.528
Asset+4.371
llah+4.146
gment+3.824
Neg logit contributions
 Duo-15.515
agra-15.408
enf-15.238
 conduc-14.828
eny-14.802
Token )
Token ID1267
Feature activation-1.19e+00

Pos logit contributions
gement+2.267
llah+2.157
Brit+1.937
gment+1.563
Asset+1.547
Neg logit contributions
 Duo-16.300
enf-16.261
agra-16.204
 conduc-16.134
eny-15.753
Token image
Token ID2939
Feature activation-7.46e-01

Pos logit contributions
gement+5.255
gment+4.413
llah+3.719
 warranty+3.514
 pal+3.502
Neg logit contributions
enf-18.311
 Duo-17.704
glers-17.624
agra-17.489
eny-17.361
Token =
Token ID796
Feature activation-4.93e-01

Pos logit contributions
gement+5.150
Brit+5.029
llah+4.920
Asset+4.767
gment+4.643
Neg logit contributions
 Duo-9.970
enf-9.947
agra-9.836
eny-9.606
 conduc-9.491
Token work
Token ID670
Feature activation-3.84e-01

Pos logit contributions
gement+0.158
llah+0.152
gment+0.145
Brit+0.143
Asset+0.139
Neg logit contributions
 Duo-0.149
enf-0.141
 Phi-0.138
agra-0.137
eny-0.134
Token with
Token ID351
Feature activation-2.95e-01

Pos logit contributions
gement+4.080
llah+3.872
gment+3.761
Brit+3.625
Asset+3.469
Neg logit contributions
 Duo-4.299
enf-4.128
 Phi-3.992
agra-3.970
eny-3.943
Token our
Token ID674
Feature activation+8.86e-02

Pos logit contributions
gement+3.268
llah+3.070
gment+2.991
Brit+2.907
Asset+2.814
Neg logit contributions
 Duo-3.244
enf-3.112
agra-3.080
eny-3.006
 Phi-2.991
Token civil
Token ID3026
Feature activation+4.53e-01

Pos logit contributions
 Duo+1.002
enf+0.946
 Phi+0.934
agra+0.917
eny+0.904
Neg logit contributions
gement-0.973
llah-0.936
gment-0.888
Brit-0.877
Asset-0.851
Token servants
Token ID17523
Feature activation+1.54e-01

Pos logit contributions
 Duo+5.079
enf+4.824
 Phi+4.730
agra+4.618
eny+4.580
Neg logit contributions
gement-4.741
llah-4.565
gment-4.283
Brit-4.224
Asset-4.205
Token,"
Token ID553
Feature activation-1.38e+00

Pos logit contributions
 Duo+1.581
enf+1.495
 Phi+1.465
agra+1.444
eny+1.427
Neg logit contributions
gement-1.834
llah-1.756
gment-1.685
Brit-1.661
Asset-1.627
Token he
Token ID339
Feature activation-3.52e-01

Pos logit contributions
gement+11.068
llah+10.449
Asset+9.485
Brit+9.314
gment+9.266
Neg logit contributions
enf-13.743
 Duo-13.712
agra-13.437
 Phi-12.777
eny-12.725
Token said
Token ID531
Feature activation+1.84e-01

Pos logit contributions
gement+3.053
llah+2.882
gment+2.737
Brit+2.649
Asset+2.513
Neg logit contributions
 Duo-4.655
enf-4.449
 Phi-4.416
agra-4.382
eny-4.281
Token.
Token ID13
Feature activation-4.28e-01

Pos logit contributions
 Duo+1.896
enf+1.789
 Phi+1.745
agra+1.728
eny+1.718
Neg logit contributions
gement-2.186
llah-2.108
gment-2.006
Brit-1.986
Asset-1.931
Token
Token ID198
Feature activation-1.69e-01

Pos logit contributions
gement+4.240
llah+4.077
gment+3.821
Brit+3.790
Asset+3.647
Neg logit contributions
 Duo-5.072
enf-4.806
 Phi-4.703
agra-4.682
eny-4.643
Token
Token ID198
Feature activation-9.20e-01

Pos logit contributions
gement+1.702
llah+1.630
Brit+1.551
gment+1.549
Asset+1.488
Neg logit contributions
 Duo-2.031
enf-1.940
agra-1.906
 Phi-1.895
eny-1.868
Token
Token ID198
Feature activation-9.74e-01

Pos logit contributions
gement+1.855
llah+1.788
Brit+1.709
gment+1.689
Asset+1.643
Neg logit contributions
 Duo-2.244
enf-2.153
agra-2.105
 Phi-2.092
eny-2.075
TokenWith
Token ID3152
Feature activation-9.14e-01

Pos logit contributions
Brit+8.785
gement+8.451
llah+8.451
Asset+8.383
Article+7.738
Neg logit contributions
 Duo-10.871
 Phi-10.464
enf-10.187
agra-9.857
 dynam-9.669
Token the
Token ID262
Feature activation-1.01e+00

Pos logit contributions
gement+9.045
llah+8.581
gment+8.165
Brit+7.780
 attendance+7.504
Neg logit contributions
enf-9.402
 Duo-9.118
 Phi-8.760
eny-8.702
eless-8.626
Token Parliament
Token ID8411
Feature activation-7.17e-01

Pos logit contributions
gement+10.149
llah+9.595
gment+9.097
 attendance+8.841
Brit+8.808
Neg logit contributions
enf-9.962
eless-9.258
 Duo-9.131
eny-9.107
agra-9.102
Token being
Token ID852
Feature activation-6.72e-01

Pos logit contributions
gement+7.502
llah+7.059
gment+6.882
Brit+6.480
Asset+6.458
Neg logit contributions
 Duo-7.446
enf-7.329
 Phi-7.034
agra-6.997
eny-6.854
Token in
Token ID287
Feature activation-1.38e+00

Pos logit contributions
gement+6.783
llah+6.308
gment+6.115
Brit+5.801
Asset+5.710
Neg logit contributions
enf-7.321
 Duo-7.317
agra-6.904
eny-6.902
 Phi-6.757
Token session
Token ID6246
Feature activation-9.13e-01

Pos logit contributions
gement+11.324
llah+10.704
 attendance+10.626
gment+9.928
 Roy+9.576
Neg logit contributions
enf-13.850
 Duo-13.049
eless-12.894
eny-12.877
agra-12.582
Token,
Token ID11
Feature activation-9.86e-01

Pos logit contributions
gement+8.010
llah+7.628
gment+7.140
Brit+6.981
Asset+6.506
Neg logit contributions
enf-10.415
 Duo-10.172
agra-9.738
eny-9.542
 Phi-9.491
Token the
Token ID262
Feature activation-7.40e-01

Pos logit contributions
gement+8.813
llah+8.564
gment+7.766
Brit+7.692
 attendance+7.478
Neg logit contributions
enf-10.665
 Duo-10.195
agra-10.062
 Phi-9.870
eny-9.704
Token cant
Token ID18548
Feature activation+1.52e-01

Pos logit contributions
gement+6.849
llah+6.455
gment+6.111
Brit+5.806
 attendance+5.489
Neg logit contributions
enf-8.509
 Duo-8.153
agra-8.070
 Phi-8.059
eless-8.037
Tokeneen
Token ID6429
Feature activation-2.39e-01

Pos logit contributions
 Duo+1.720
enf+1.650
 Phi+1.607
agra+1.599
eny+1.592
Neg logit contributions
gement-1.645
llah-1.581
Brit-1.502
gment-1.501
Asset-1.451
Token [
Token ID685
Feature activation-4.79e-01

Pos logit contributions
gement+4.601
gment+4.140
llah+4.084
Asset+3.774
Brit+3.568
Neg logit contributions
 Duo-8.005
enf-7.895
agra-7.745
eny-7.431
 Phi-7.410
Tokenreturn
Token ID7783
Feature activation-6.25e-01

Pos logit contributions
gement+5.324
llah+5.014
gment+4.932
Asset+4.740
Brit+4.675
Neg logit contributions
 Duo-5.245
enf-4.862
agra-4.856
 Phi-4.820
eny-4.609
Token address
Token ID2209
Feature activation-4.18e-01

Pos logit contributions
gement+6.579
llah+6.152
gment+6.022
Asset+5.631
 warranty+5.428
Neg logit contributions
 Duo-6.676
agra-6.534
enf-6.500
cum-6.111
eny-6.031
Token]
Token ID60
Feature activation-4.96e-01

Pos logit contributions
gement+3.323
llah+3.094
gment+2.977
Brit+2.855
Asset+2.792
Neg logit contributions
 Duo-5.691
enf-5.554
agra-5.517
eny-5.336
 Phi-5.328
Token |
Token ID930
Feature activation-7.31e-01

Pos logit contributions
gement+3.586
llah+3.363
gment+3.126
Brit+3.121
Asset+3.040
Neg logit contributions
 Duo-6.926
enf-6.840
agra-6.705
 Phi-6.579
eny-6.546
Token +
Token ID1343
Feature activation-1.37e+00

Pos logit contributions
gement+7.295
llah+6.566
gment+6.561
Asset+6.218
Brit+6.100
Neg logit contributions
 Duo-8.055
enf-7.883
agra-7.557
 Phi-7.469
eny-7.421
Token--------------------
Token ID19351
Feature activation-4.56e-01

Pos logit contributions
gement+6.383
llah+5.713
Brit+5.340
Asset+5.322
gment+5.041
Neg logit contributions
 Duo-20.259
 conduc-19.364
 unbeliev-18.806
 Phi-18.532
 skelet-18.384
Token--+
Token ID44785
Feature activation-2.77e-01

Pos logit contributions
gement+4.386
llah+4.192
gment+3.998
Brit+3.966
Asset+3.866
Neg logit contributions
 Duo-5.571
enf-5.224
 Phi-5.092
agra-5.008
 Solo-4.976
Token +
Token ID1343
Feature activation-1.12e+00

Pos logit contributions
gement+3.094
llah+2.953
gment+2.817
Brit+2.801
Asset+2.726
Neg logit contributions
 Duo-2.951
enf-2.880
agra-2.796
eny-2.740
 Phi-2.723
Token--------------------
Token ID19351
Feature activation-4.16e-01

Pos logit contributions
gement+5.998
llah+5.471
Brit+5.198
Asset+5.104
gment+4.876
Neg logit contributions
 Duo-16.767
 conduc-15.776
 Phi-15.422
 unbeliev-15.156
 Solo-15.071
Token--+
Token ID44785
Feature activation-4.44e-01

Pos logit contributions
gement+4.085
llah+3.896
gment+3.735
Brit+3.680
Asset+3.582
Neg logit contributions
 Duo-5.034
enf-4.701
 Phi-4.593
agra-4.519
 Solo-4.483
Token |
Token ID930
Feature activation-3.51e-01

Pos logit contributions
gement+2.904
llah+2.721
gment+2.558
Brit+2.531
Asset+2.480
Neg logit contributions
 Duo-5.145
enf-5.074
agra-4.937
eny-4.848
 Phi-4.842
Token void
Token ID7951
Feature activation-5.61e-01

Pos logit contributions
gement+4.265
gment+4.015
llah+4.012
Asset+3.801
Brit+3.766
Neg logit contributions
 Duo-3.383
enf-3.312
agra-3.196
eny-3.096
 Phi-3.094
Token *
Token ID1635
Feature activation-8.48e-01

Pos logit contributions
gement+7.215
llah+6.975
gment+6.819
Brit+6.734
Asset+6.615
Neg logit contributions
enf-4.550
agra-4.499
 Duo-4.457
eny-4.226
eless-4.009
Tokenarg
Token ID853
Feature activation-6.93e-01

Pos logit contributions
gement+7.414
gment+6.931
llah+6.677
Brit+6.335
Asset+6.178
Neg logit contributions
 Duo-10.506
 Phi-9.955
 conduc-9.600
enf-9.483
 Solo-9.377
Token |
Token ID930
Feature activation-7.81e-01

Pos logit contributions
gement+4.234
llah+3.928
gment+3.706
Brit+3.609
Asset+3.477
Neg logit contributions
 Duo-9.978
enf-9.963
agra-9.833
 conduc-9.602
 Phi-9.566
Token +
Token ID1343
Feature activation-1.36e+00

Pos logit contributions
gement+7.558
gment+6.791
llah+6.722
Asset+6.325
Brit+6.107
Neg logit contributions
enf-8.831
 Duo-8.717
 Phi-8.219
agra-8.189
eny-8.170
Token--------------------
Token ID19351
Feature activation-4.61e-01

Pos logit contributions
gement+6.606
llah+5.897
Brit+5.336
Asset+5.182
gment+5.166
Neg logit contributions
 Duo-19.988
 conduc-19.021
 unbeliev-18.527
 Phi-18.386
 Solo-18.124
Token--+
Token ID44785
Feature activation-4.82e-01

Pos logit contributions
gement+4.455
llah+4.250
gment+4.069
Brit+4.007
Asset+3.894
Neg logit contributions
 Duo-5.614
enf-5.255
 Phi-5.132
agra-5.034
 Solo-5.015
Token |
Token ID930
Feature activation-5.65e-01

Pos logit contributions
gement+3.456
llah+3.256
Brit+3.025
gment+3.021
Asset+2.924
Neg logit contributions
 Duo-6.797
enf-6.670
agra-6.608
 Phi-6.426
eny-6.408
Token [
Token ID685
Feature activation-4.73e-01

Pos logit contributions
gement+4.059
gment+3.617
llah+3.578
Asset+3.245
Brit+3.030
Neg logit contributions
 Duo-8.088
enf-8.003
agra-7.799
eny-7.533
 Phi-7.526
Tokenreturn
Token ID7783
Feature activation-6.80e-01

Pos logit contributions
gement+5.151
llah+4.825
gment+4.742
Asset+4.539
Brit+4.480
Neg logit contributions
 Duo-5.376
 Phi-4.952
enf-4.913
agra-4.900
 Solo-4.639
Token importance
Token ID6817
Feature activation+2.60e-01

Pos logit contributions
 Duo+7.762
 Phi+7.470
enf+7.263
agra+6.991
 Solo+6.936
Neg logit contributions
llah-9.023
gement-8.926
Brit-8.259
Asset-8.238
gment-8.119
Token in
Token ID287
Feature activation+4.68e-01

Pos logit contributions
 Duo+2.717
enf+2.554
 Phi+2.522
agra+2.449
eny+2.418
Neg logit contributions
gement-3.033
llah-2.943
Brit-2.792
gment-2.783
Asset-2.694
Token today
Token ID1909
Feature activation+8.76e-01

Pos logit contributions
 Duo+4.873
 Phi+4.564
enf+4.404
agra+4.296
 Solo+4.261
Neg logit contributions
gement-5.386
llah-5.324
Brit-5.053
gment-4.908
Asset-4.713
Token
Token ID447
Feature activation+8.96e-02

Pos logit contributions
 Duo+6.723
 Phi+6.113
enf+6.016
agra+5.709
 Solo+5.692
Neg logit contributions
gement-11.194
llah-11.031
Brit-10.641
gment-10.400
��-10.161
Token
Token ID247
Feature activation+4.19e-01

Pos logit contributions
 Duo+1.114
enf+1.058
 Phi+1.044
agra+1.033
eny+1.019
Neg logit contributions
gement-0.877
llah-0.837
gment-0.791
Brit-0.783
Asset-0.759
Tokens
Token ID82
Feature activation+8.44e-01

Pos logit contributions
 Duo+4.589
enf+4.301
 Phi+4.231
agra+4.105
eny+4.104
Neg logit contributions
gement-4.553
llah-4.404
Brit-4.190
gment-4.140
Asset-3.987
Token scenario
Token ID8883
Feature activation+7.67e-01

Pos logit contributions
 Duo+8.562
 Phi+8.127
enf+7.716
agra+7.515
 Solo+7.470
Neg logit contributions
gement-8.952
llah-8.761
Brit-8.427
gment-8.076
Asset-7.807
Token?
Token ID30
Feature activation+2.02e-01

Pos logit contributions
 Duo+7.484
enf+6.828
 Phi+6.677
 cf+6.631
agra+6.477
Neg logit contributions
gement-8.610
llah-8.361
Brit-7.958
gment-7.936
Asset-7.526
Token
Token ID198
Feature activation+4.71e-01

Pos logit contributions
 Duo+2.186
enf+2.039
 Phi+2.031
agra+1.972
eny+1.956
Neg logit contributions
gement-2.318
llah-2.223
gment-2.125
Brit-2.106
Asset-2.013
Token
Token ID198
Feature activation+2.68e-01

Pos logit contributions
 Duo+4.161
 Phi+3.853
enf+3.828
cum+3.664
 Solo+3.649
Neg logit contributions
gement-6.219
llah-5.931
��-5.727
gment-5.689
Brit-5.422
TokenToday
Token ID8888
Feature activation+6.29e-01

Pos logit contributions
 Duo+2.695
enf+2.578
 Phi+2.506
eny+2.471
agra+2.470
Neg logit contributions
gement-3.177
llah-3.066
gment-2.915
Brit-2.830
��-2.699
Token (
Token ID357
Feature activation+9.34e-02

Pos logit contributions
gement+0.646
llah+0.623
gment+0.599
Brit+0.593
Asset+0.578
Neg logit contributions
 Duo-0.446
enf-0.418
 Phi-0.406
agra-0.405
eny-0.394
Tokencommit
Token ID41509
Feature activation-1.88e-01

Pos logit contributions
 Duo+0.965
enf+0.915
 Phi+0.893
agra+0.883
eny+0.871
Neg logit contributions
gement-1.118
llah-1.075
gment-1.025
Brit-1.007
Asset-0.976
Token)
Token ID8
Feature activation+7.37e-01

Pos logit contributions
gement+1.875
llah+1.757
gment+1.698
Brit+1.647
Asset+1.590
Neg logit contributions
 Duo-2.298
enf-2.203
agra-2.163
 Phi-2.142
eny-2.112
Token Levi
Token ID20196
Feature activation-2.51e-01

Pos logit contributions
 Duo+8.045
 Phi+7.655
 Solo+7.165
enf+7.073
 cf+6.986
Neg logit contributions
gement-8.038
llah-7.594
gment-7.292
Brit-6.904
Asset-6.688
Token Jones
Token ID5437
Feature activation+8.46e-01

Pos logit contributions
gement+2.926
llah+2.844
gment+2.685
Brit+2.672
Asset+2.599
Neg logit contributions
 Duo-2.564
enf-2.465
agra-2.362
 Phi-2.360
eny-2.310
Token LB
Token ID22199
Feature activation+9.95e-01

Pos logit contributions
 Duo+6.744
 Phi+6.179
 Solo+5.732
enf+5.722
 cf+5.578
Neg logit contributions
gement-11.316
llah-10.720
gment-10.600
Brit-10.415
Asset-10.125
Token 6
Token ID718
Feature activation+2.06e-01

Pos logit contributions
 Duo+7.517
 Phi+6.951
 Solo+6.378
 cf+6.160
cum+5.803
Neg logit contributions
gement-13.583
llah-13.170
gment-12.523
Brit-12.124
��-11.819
Token'
Token ID6
Feature activation+3.19e-01

Pos logit contributions
 Duo+2.294
enf+2.156
 Phi+2.146
agra+2.113
eny+2.083
Neg logit contributions
gement-2.308
llah-2.205
gment-2.100
Brit-2.035
Asset-1.958
Token3
Token ID18
Feature activation+5.62e-01

Pos logit contributions
 Duo+3.305
enf+3.075
 Phi+3.073
agra+3.016
eless+2.946
Neg logit contributions
gement-3.779
llah-3.568
gment-3.436
Brit-3.317
Asset-3.206
Token/
Token ID14
Feature activation+2.61e-01

Pos logit contributions
 Duo+4.539
 Phi+4.267
agra+4.124
enf+4.121
 Solo+3.976
Neg logit contributions
gement-7.703
llah-7.353
gment-7.121
��-6.972
Brit-6.742
Token215
Token ID23349
Feature activation+4.96e-01

Pos logit contributions
 Duo+2.515
enf+2.371
 Phi+2.310
agra+2.278
cum+2.230
Neg logit contributions
gement-3.296
llah-3.163
gment-3.042
Brit-2.929
Asset-2.825
Token*
Token ID9
Feature activation-2.33e-01

Pos logit contributions
 Duo+2.331
enf+2.235
agra+2.124
 Phi+2.102
eny+2.083
Neg logit contributions
gement-4.519
llah-4.368
gment-4.191
Brit-3.999
Asset-3.950
Token Tim
Token ID5045
Feature activation+3.65e-01

Pos logit contributions
gement+2.680
llah+2.583
Brit+2.487
gment+2.466
Asset+2.390
Neg logit contributions
 Duo-2.407
enf-2.329
agra-2.287
eny-2.234
 Phi-2.231
Token Read
Token ID4149
Feature activation+2.31e-01

Pos logit contributions
 Duo+3.705
enf+3.459
 Phi+3.402
eless+3.362
eny+3.311
Neg logit contributions
gement-4.364
llah-4.202
gment-4.052
Brit-3.980
Asset-3.920
Token triumph
Token ID15499
Feature activation+1.08e-01

Pos logit contributions
 Duo+2.447
enf+2.293
 Phi+2.251
agra+2.204
eny+2.186
Neg logit contributions
gement-2.679
llah-2.603
gment-2.475
Brit-2.441
Asset-2.394
Tokens
Token ID82
Feature activation+4.01e-01

Pos logit contributions
 Duo+1.399
enf+1.336
 Phi+1.313
agra+1.301
eny+1.288
Neg logit contributions
gement-1.014
llah-0.969
gment-0.916
Brit-0.899
Asset-0.868
Token as
Token ID355
Feature activation+8.48e-01

Pos logit contributions
 Duo+4.155
enf+3.839
 Phi+3.802
agra+3.672
 Solo+3.645
Neg logit contributions
gement-4.756
llah-4.603
gment-4.397
Brit-4.304
Asset-4.258
Token New
Token ID968
Feature activation-9.65e-02

Pos logit contributions
 Duo+8.656
enf+7.874
 Phi+7.865
 Solo+7.722
agra+7.372
Neg logit contributions
gement-9.318
llah-8.930
gment-8.582
Asset-8.441
Brit-8.300
Token Zealand
Token ID8936
Feature activation+7.56e-01

Pos logit contributions
gement+1.119
llah+1.075
gment+1.025
Brit+1.018
Asset+0.983
Neg logit contributions
 Duo-1.023
enf-0.971
 Phi-0.952
agra-0.949
eny-0.928
Token's
Token ID338
Feature activation+7.59e-01

Pos logit contributions
 Duo+8.160
enf+7.420
 Phi+7.335
agra+7.274
 Solo+7.142
Neg logit contributions
gement-8.056
llah-7.602
gment-7.396
Asset-7.089
Brit-7.089
Token Master
Token ID5599
Feature activation+6.40e-02

Pos logit contributions
 Duo+7.646
 Phi+6.946
enf+6.851
 Solo+6.808
eny+6.593
Neg logit contributions
gement-8.616
llah-8.235
gment-7.947
Asset-7.791
Brit-7.639
TokenChe
Token ID7376
Feature activation-1.27e-01

Pos logit contributions
 Duo+0.569
enf+0.531
 Phi+0.517
agra+0.513
eny+0.500
Neg logit contributions
gement-0.856
llah-0.827
gment-0.795
Brit-0.786
Asset-0.768
Token shot
Token ID2823
Feature activation+1.64e-01

Pos logit contributions
 Duo+4.929
 Phi+4.768
enf+4.557
agra+4.429
eny+4.404
Neg logit contributions
gement-5.295
llah-5.072
Brit-4.781
gment-4.772
Asset-4.575
Token and
Token ID290
Feature activation+3.43e-01

Pos logit contributions
 Duo+1.713
enf+1.598
 Phi+1.588
agra+1.549
eny+1.536
Neg logit contributions
gement-1.943
llah-1.864
gment-1.782
Brit-1.760
Asset-1.722
Token killed
Token ID2923
Feature activation+2.21e-01

Pos logit contributions
 Duo+2.648
enf+2.433
 Phi+2.388
agra+2.299
eny+2.251
Neg logit contributions
gement-4.966
llah-4.807
gment-4.606
Brit-4.544
Asset-4.512
Token in
Token ID287
Feature activation+7.08e-02

Pos logit contributions
 Duo+2.198
 Phi+2.039
enf+2.022
agra+1.960
eny+1.955
Neg logit contributions
gement-2.747
llah-2.661
gment-2.528
Brit-2.497
Asset-2.451
Token 1982
Token ID14489
Feature activation+1.48e-01

Pos logit contributions
 Duo+0.720
enf+0.675
 Phi+0.668
agra+0.657
eny+0.644
Neg logit contributions
gement-0.859
llah-0.828
gment-0.792
Brit-0.785
Asset-0.761
Token as
Token ID355
Feature activation+1.00e+00

Pos logit contributions
 Duo+1.539
enf+1.438
 Phi+1.422
agra+1.402
eny+1.386
Neg logit contributions
gement-1.772
llah-1.709
gment-1.624
Brit-1.608
Asset-1.570
Token officers
Token ID3790
Feature activation+6.44e-01

Pos logit contributions
 Duo+9.059
 Phi+8.681
 Solo+8.157
enf+7.657
 Cambodia+7.649
Neg logit contributions
gement-11.902
llah-11.531
gment-10.982
Brit-10.934
Asset-10.706
Token from
Token ID422
Feature activation+2.44e-01

Pos logit contributions
 Duo+6.793
 Phi+6.367
 Solo+6.162
enf+6.151
eny+6.131
Neg logit contributions
gement-7.217
llah-6.919
Brit-6.562
Asset-6.474
gment-6.459
Token the
Token ID262
Feature activation+2.49e-01

Pos logit contributions
 Duo+2.493
 Phi+2.335
enf+2.307
agra+2.264
eny+2.201
Neg logit contributions
gement-2.960
llah-2.844
gment-2.734
Brit-2.699
Asset-2.595
Token Anaheim
Token ID31100
Feature activation+2.38e-01

Pos logit contributions
 Duo+2.480
 Phi+2.331
enf+2.296
agra+2.249
eny+2.188
Neg logit contributions
gement-3.065
llah-2.959
gment-2.849
Brit-2.802
Asset-2.715
Token and
Token ID290
Feature activation+2.21e-01

Pos logit contributions
 Duo+2.437
enf+2.277
 Phi+2.272
agra+2.215
eny+2.160
Neg logit contributions
gement-2.861
llah-2.760
gment-2.638
Brit-2.621
Asset-2.521
Token
Token ID198
Feature activation+4.97e-01

Pos logit contributions
 Duo+0.321
enf+0.303
 Phi+0.297
agra+0.295
eny+0.289
Neg logit contributions
gement-0.355
llah-0.341
gment-0.326
Brit-0.322
Asset-0.313
Token
Token ID198
Feature activation+3.06e-01

Pos logit contributions
 Duo+4.492
enf+4.217
 Phi+4.111
cum+4.040
agra+4.029
Neg logit contributions
gement-6.409
llah-6.091
gment-5.838
��-5.700
Brit-5.459
TokenHave
Token ID11980
Feature activation+3.32e-01

Pos logit contributions
 Duo+3.064
enf+2.929
agra+2.855
 Phi+2.819
eny+2.793
Neg logit contributions
gement-3.642
llah-3.531
gment-3.321
Brit-3.208
Asset-3.099
Token you
Token ID345
Feature activation+5.71e-01

Pos logit contributions
 Duo+3.502
enf+3.317
 Phi+3.242
agra+3.224
eny+3.178
Neg logit contributions
gement-3.803
llah-3.709
Brit-3.511
gment-3.467
Asset-3.400
Token ever
Token ID1683
Feature activation+7.22e-01

Pos logit contributions
 Duo+6.158
 Phi+5.602
enf+5.587
eny+5.581
agra+5.510
Neg logit contributions
gement-6.202
llah-6.112
Brit-5.720
Asset-5.676
gment-5.646
Token gone
Token ID3750
Feature activation+8.58e-01

Pos logit contributions
 Duo+7.022
enf+6.425
 Phi+6.366
eny+6.270
agra+6.270
Neg logit contributions
gement-8.343
llah-8.182
Asset-7.864
Brit-7.752
gment-7.730
Token to
Token ID284
Feature activation+2.54e-01

Pos logit contributions
 Duo+7.320
 Phi+6.860
 Solo+6.732
agra+6.727
eless+6.632
Neg logit contributions
gement-10.610
llah-10.341
Asset-9.827
gment-9.791
Brit-9.599
Token see
Token ID766
Feature activation+4.31e-02

Pos logit contributions
 Duo+2.761
 Phi+2.579
enf+2.561
agra+2.533
eny+2.430
Neg logit contributions
gement-2.909
llah-2.795
gment-2.676
Brit-2.627
Asset-2.567
Token a
Token ID257
Feature activation+1.36e-01

Pos logit contributions
 Duo+0.474
enf+0.448
 Phi+0.440
agra+0.437
eny+0.427
Neg logit contributions
gement-0.487
llah-0.469
gment-0.446
Brit-0.440
Asset-0.427
Token movie
Token ID3807
Feature activation+2.45e-01

Pos logit contributions
 Duo+1.628
enf+1.530
 Phi+1.522
agra+1.502
eny+1.462
Neg logit contributions
gement-1.426
llah-1.369
gment-1.293
Brit-1.271
Asset-1.241
Token that
Token ID326
Feature activation+4.54e-01

Pos logit contributions
 Duo+2.494
enf+2.310
 Phi+2.294
agra+2.252
eny+2.189
Neg logit contributions
gement-2.971
llah-2.880
gment-2.721
Brit-2.716
Asset-2.652
Token
Token ID784
Feature activation+2.93e-01

Pos logit contributions
gement+2.138
llah+2.020
gment+1.944
Brit+1.921
Asset+1.833
Neg logit contributions
 Duo-2.164
enf-2.050
 Phi-2.016
agra-2.002
eny-1.954
Token they
Token ID484
Feature activation+3.55e-01

Pos logit contributions
 Duo+3.033
enf+2.830
 Phi+2.816
agra+2.745
eny+2.696
Neg logit contributions
gement-3.444
llah-3.340
gment-3.172
Brit-3.089
Asset-3.025
Token said
Token ID531
Feature activation+4.10e-01

Pos logit contributions
 Duo+3.796
enf+3.558
 Phi+3.452
agra+3.440
eny+3.431
Neg logit contributions
gement-4.039
llah-3.922
gment-3.728
Asset-3.675
Brit-3.672
Token models
Token ID4981
Feature activation+2.04e-01

Pos logit contributions
 Duo+3.996
 Phi+3.699
enf+3.641
agra+3.527
eless+3.506
Neg logit contributions
gement-5.013
llah-4.910
gment-4.664
Brit-4.496
Asset-4.436
Token on
Token ID319
Feature activation+1.20e-01

Pos logit contributions
 Duo+2.254
enf+2.120
 Phi+2.087
agra+2.047
eny+2.030
Neg logit contributions
gement-2.298
llah-2.218
gment-2.096
Brit-2.073
Asset-2.030
Token Monday
Token ID3321
Feature activation+2.89e-01

Pos logit contributions
 Duo+1.192
enf+1.113
 Phi+1.101
agra+1.084
eny+1.060
Neg logit contributions
gement-1.486
llah-1.434
gment-1.372
Brit-1.356
Asset-1.319
Token afternoon
Token ID6672
Feature activation+1.24e-01

Pos logit contributions
 Duo+3.162
enf+2.978
 Phi+2.906
agra+2.835
eny+2.817
Neg logit contributions
gement-3.235
llah-3.164
Brit-2.973
gment-2.970
Asset-2.902
Token shifted
Token ID14869
Feature activation-3.12e-01

Pos logit contributions
 Duo+1.387
enf+1.310
 Phi+1.287
agra+1.268
eny+1.250
Neg logit contributions
gement-1.373
llah-1.326
gment-1.256
Brit-1.243
Asset-1.208
Token in
Token ID287
Feature activation+3.50e-01

Pos logit contributions
gement+3.363
llah+3.146
Brit+3.043
gment+3.020
Asset+2.905
Neg logit contributions
 Duo-3.506
enf-3.385
agra-3.295
eny-3.224
 Phi-3.221
Token response
Token ID2882
Feature activation-4.51e-02

Pos logit contributions
 Duo+3.534
enf+3.296
 Phi+3.246
agra+3.222
eny+3.170
Neg logit contributions
gement-4.133
llah-4.080
gment-3.844
Brit-3.765
Asset-3.704
Token to
Token ID284
Feature activation-2.04e-01

Pos logit contributions
gement+0.517
llah+0.496
gment+0.474
Brit+0.468
Asset+0.453
Neg logit contributions
 Duo-0.489
enf-0.462
 Phi-0.453
agra-0.450
eny-0.441
Token According
Token ID4784
Feature activation+4.23e-01

Pos logit contributions
 Duo+8.514
 Phi+8.130
 Solo+7.624
 cf+7.524
 Boone+7.303
Neg logit contributions
gement-10.446
gment-9.662
llah-9.558
Brit-8.821
��-8.785
Token to
Token ID284
Feature activation+6.08e-01

Pos logit contributions
 Duo+3.814
 Phi+3.593
enf+3.439
agra+3.354
eny+3.345
Neg logit contributions
gement-5.475
llah-5.247
gment-5.085
Brit-4.983
Asset-4.891
Token num
Token ID997
Feature activation-1.64e-01

Pos logit contributions
 Duo+6.124
 Phi+5.859
enf+5.481
agra+5.468
 Solo+5.444
Neg logit contributions
gement-7.001
llah-6.659
gment-6.426
Brit-6.387
Asset-6.111
Tokenism
Token ID1042
Feature activation+2.57e-01

Pos logit contributions
gement+1.068
llah+0.979
gment+0.919
Brit+0.896
Asset+0.822
Neg logit contributions
 Duo-2.583
enf-2.465
 Phi-2.447
agra-2.410
eny-2.377
Tokenat
Token ID265
Feature activation+3.98e-01

Pos logit contributions
 Duo+2.763
enf+2.676
agra+2.604
eny+2.598
 Phi+2.580
Neg logit contributions
gement-2.888
llah-2.758
gment-2.629
Brit-2.609
Asset-2.552
Tokenist
Token ID396
Feature activation+2.72e-01

Pos logit contributions
 Duo+4.752
enf+4.608
 Phi+4.543
agra+4.462
eny+4.455
Neg logit contributions
gement-3.895
llah-3.775
Brit-3.514
gment-3.483
Asset-3.388
Token Neil
Token ID15929
Feature activation-5.20e-02

Pos logit contributions
 Duo+2.890
 Phi+2.718
enf+2.711
agra+2.637
eny+2.598
Neg logit contributions
gement-3.157
llah-3.016
gment-2.892
Brit-2.844
Asset-2.757
Token Car
Token ID1879
Feature activation-1.12e-01

Pos logit contributions
gement+0.609
llah+0.586
gment+0.559
Brit+0.554
Asset+0.537
Neg logit contributions
 Duo-0.548
enf-0.519
 Phi-0.507
agra-0.506
eny-0.495
Tokenother
Token ID847
Feature activation+1.27e-01

Pos logit contributions
gement+1.357
llah+1.308
gment+1.250
Brit+1.235
Asset+1.192
Neg logit contributions
 Duo-1.143
enf-1.062
 Phi-1.052
agra-1.029
eny-1.011
Tokens
Token ID82
Feature activation+4.85e-01

Pos logit contributions
 Duo+1.674
enf+1.597
 Phi+1.578
agra+1.557
eny+1.552
Neg logit contributions
gement-1.144
llah-1.086
gment-1.024
Brit-1.009
Asset-0.973
Token,
Token ID11
Feature activation+1.42e-01

Pos logit contributions
 Duo+4.693
enf+4.451
 Phi+4.420
eny+4.302
agra+4.200
Neg logit contributions
gement-5.894
llah-5.641
gment-5.397
Brit-5.368
Asset-5.205
Token
Token ID564
Feature activation-2.67e-01

Pos logit contributions
gement+4.765
llah+4.467
gment+4.374
Brit+4.328
Asset+4.231
Neg logit contributions
 Duo-4.518
enf-4.502
agra-4.435
 Phi-4.291
eny-4.261
Token
Token ID246
Feature activation-1.16e-01

Pos logit contributions
gement+2.252
llah+2.065
gment+2.000
Brit+1.922
Asset+1.818
Neg logit contributions
 Duo-3.668
enf-3.563
agra-3.488
 Phi-3.478
eny-3.401
TokenVideo
Token ID10798
Feature activation+2.38e-01

Pos logit contributions
gement+1.361
llah+1.301
gment+1.253
Brit+1.247
Asset+1.215
Neg logit contributions
 Duo-1.218
enf-1.155
 Phi-1.135
agra-1.128
eny-1.098
Token Add
Token ID3060
Feature activation+2.55e-02

Pos logit contributions
 Duo+2.603
enf+2.421
 Phi+2.387
agra+2.349
eny+2.305
Neg logit contributions
gement-2.693
llah-2.619
gment-2.473
Brit-2.437
Asset-2.313
Tokenons
Token ID684
Feature activation+2.20e-01

Pos logit contributions
 Duo+0.137
enf+0.122
 Phi+0.117
agra+0.115
eny+0.110
Neg logit contributions
gement-0.431
llah-0.421
gment-0.407
Brit-0.404
Asset-0.396
Token
Token ID447
Feature activation+8.13e-02

Pos logit contributions
 Duo+2.263
enf+2.092
 Phi+2.078
agra+2.016
eny+2.010
Neg logit contributions
gement-2.617
llah-2.552
gment-2.415
Brit-2.413
Asset-2.306
Token
Token ID247
Feature activation-1.35e-01

Pos logit contributions
 Duo+0.884
enf+0.832
 Phi+0.818
agra+0.810
eny+0.798
Neg logit contributions
gement-0.923
llah-0.889
gment-0.846
Brit-0.841
Asset-0.815
Token ->
Token ID4613
Feature activation-2.77e-01

Pos logit contributions
gement+1.529
llah+1.456
gment+1.398
Brit+1.369
Asset+1.337
Neg logit contributions
 Duo-1.466
enf-1.404
agra-1.371
 Phi-1.363
eny-1.332
Token Netflix
Token ID12074
Feature activation+3.99e-01

Pos logit contributions
gement+3.223
llah+3.058
gment+2.964
Brit+2.921
Asset+2.861
Neg logit contributions
 Duo-2.835
enf-2.771
agra-2.706
 Phi-2.645
eny-2.612
Token.
Token ID13
Feature activation+4.33e-01

Pos logit contributions
 Duo+4.286
enf+3.928
 Phi+3.922
agra+3.811
 Solo+3.803
Neg logit contributions
gement-4.550
llah-4.419
gment-4.169
Brit-4.091
Asset-3.875
Token Finally
Token ID9461
Feature activation+6.86e-01

Pos logit contributions
 Duo+4.558
enf+4.207
 Phi+4.184
eny+4.070
agra+4.066
Neg logit contributions
gement-4.924
llah-4.777
Brit-4.520
gment-4.505
Asset-4.362
Token influence
Token ID4588
Feature activation+1.64e-02

Pos logit contributions
gement+1.294
llah+1.240
gment+1.180
Brit+1.171
Asset+1.133
Neg logit contributions
 Duo-1.315
enf-1.262
 Phi-1.231
agra-1.226
eny-1.198
Token that
Token ID326
Feature activation+1.65e-01

Pos logit contributions
 Duo+0.189
enf+0.179
 Phi+0.176
agra+0.174
eny+0.171
Neg logit contributions
gement-0.178
llah-0.170
gment-0.162
Brit-0.160
Asset-0.155
Token has
Token ID468
Feature activation+2.68e-01

Pos logit contributions
 Duo+1.766
enf+1.650
 Phi+1.630
agra+1.605
eny+1.579
Neg logit contributions
gement-1.909
llah-1.831
gment-1.756
Brit-1.719
Asset-1.681
Token crept
Token ID47092
Feature activation+1.84e-01

Pos logit contributions
 Duo+2.802
enf+2.602
 Phi+2.582
agra+2.520
eny+2.495
Neg logit contributions
gement-3.114
llah-3.020
gment-2.866
Brit-2.833
Asset-2.796
Token into
Token ID656
Feature activation-3.43e-01

Pos logit contributions
 Duo+1.851
enf+1.725
 Phi+1.724
agra+1.677
eny+1.663
Neg logit contributions
gement-2.244
llah-2.170
gment-2.065
Brit-2.040
Asset-2.003
Token me
Token ID502
Feature activation+2.11e-01

Pos logit contributions
gement+3.803
llah+3.651
Brit+3.472
gment+3.462
Asset+3.322
Neg logit contributions
 Duo-3.658
enf-3.575
agra-3.468
 Phi-3.395
eny-3.374
Token in
Token ID287
Feature activation-3.58e-02

Pos logit contributions
 Duo+2.308
enf+2.175
 Phi+2.136
agra+2.122
eny+2.098
Neg logit contributions
gement-2.384
llah-2.265
gment-2.183
Brit-2.146
Asset-2.100
Token my
Token ID616
Feature activation+3.83e-01

Pos logit contributions
gement+0.406
llah+0.390
gment+0.372
Brit+0.368
Asset+0.356
Neg logit contributions
 Duo-0.390
enf-0.370
 Phi-0.362
agra-0.360
eny-0.353
Token own
Token ID898
Feature activation+1.37e-01

Pos logit contributions
 Duo+3.741
 Phi+3.445
enf+3.444
eny+3.305
agra+3.296
Neg logit contributions
gement-4.701
llah-4.462
gment-4.374
Brit-4.285
Asset-4.216
Token weird
Token ID7650
Feature activation+2.47e-01

Pos logit contributions
 Duo+1.349
enf+1.264
 Phi+1.241
agra+1.223
eny+1.205
Neg logit contributions
gement-1.693
llah-1.625
gment-1.565
Brit-1.546
Asset-1.506
Token way
Token ID835
Feature activation+1.25e-01

Pos logit contributions
 Duo+2.453
enf+2.314
 Phi+2.264
agra+2.244
eny+2.224
Neg logit contributions
gement-3.003
llah-2.900
gment-2.766
Brit-2.730
Asset-2.682
Token is
Token ID318
Feature activation-1.13e-01

Pos logit contributions
gement+2.805
llah+2.663
gment+2.547
Brit+2.493
Asset+2.404
Neg logit contributions
 Duo-2.919
enf-2.786
 Phi-2.736
agra-2.725
eny-2.664
Token it
Token ID340
Feature activation-2.19e-01

Pos logit contributions
gement+1.239
llah+1.192
gment+1.135
Brit+1.116
Asset+1.082
Neg logit contributions
 Duo-1.275
enf-1.217
agra-1.188
 Phi-1.184
eny-1.165
Token that
Token ID326
Feature activation-2.60e-01

Pos logit contributions
gement+2.318
llah+2.236
gment+2.117
Brit+2.082
Asset+2.012
Neg logit contributions
 Duo-2.524
enf-2.409
agra-2.355
 Phi-2.344
eny-2.298
Token GOP
Token ID6796
Feature activation-4.48e-01

Pos logit contributions
gement+2.881
llah+2.748
gment+2.642
Brit+2.612
Asset+2.523
Neg logit contributions
 Duo-2.823
enf-2.691
agra-2.616
 Phi-2.605
eny-2.564
Token officials
Token ID2828
Feature activation+3.61e-01

Pos logit contributions
gement+4.036
llah+3.792
gment+3.655
Brit+3.614
Asset+3.442
Neg logit contributions
 Duo-5.667
enf-5.391
 Phi-5.328
agra-5.285
cum-5.106
Token are
Token ID389
Feature activation+3.43e-01

Pos logit contributions
 Duo+3.827
enf+3.580
 Phi+3.526
eny+3.489
agra+3.459
Neg logit contributions
gement-4.100
llah-3.988
gment-3.778
Brit-3.771
Asset-3.675
Token trying
Token ID2111
Feature activation-3.53e-01

Pos logit contributions
 Duo+3.404
enf+3.205
 Phi+3.150
agra+3.137
eny+3.072
Neg logit contributions
gement-4.149
llah-4.060
gment-3.819
Brit-3.805
Asset-3.701
Token to
Token ID284
Feature activation-3.43e-01

Pos logit contributions
gement+3.732
llah+3.566
gment+3.404
Brit+3.356
Asset+3.224
Neg logit contributions
 Duo-4.024
enf-3.822
 Phi-3.740
agra-3.669
eny-3.641
Token ram
Token ID15770
Feature activation-2.23e-01

Pos logit contributions
gement+3.719
llah+3.579
gment+3.480
Brit+3.338
Asset+3.222
Neg logit contributions
 Duo-3.701
enf-3.563
 Phi-3.463
agra-3.454
eny-3.335
Token through
Token ID832
Feature activation-3.04e-01

Pos logit contributions
gement+2.631
llah+2.531
gment+2.426
Brit+2.372
Asset+2.291
Neg logit contributions
 Duo-2.339
enf-2.196
 Phi-2.153
agra-2.125
eny-2.064
Token a
Token ID257
Feature activation-4.63e-01

Pos logit contributions
gement+3.217
llah+3.048
gment+2.929
Brit+2.865
Asset+2.782
Neg logit contributions
 Duo-3.456
enf-3.299
 Phi-3.195
agra-3.173
eny-3.102
Token B
Token ID347
Feature activation-3.86e-01

Pos logit contributions
 Duo+1.838
enf+1.735
 Phi+1.707
agra+1.677
eny+1.652
Neg logit contributions
gement-2.042
llah-1.967
gment-1.868
Brit-1.852
Asset-1.786
TokenAC
Token ID2246
Feature activation-6.94e-01

Pos logit contributions
gement+4.339
llah+4.188
gment+4.058
Brit+3.966
Asset+3.830
Neg logit contributions
 Duo-4.104
enf-3.881
agra-3.781
 Phi-3.682
eless-3.625
Token limits
Token ID7095
Feature activation-3.02e-01

Pos logit contributions
gement+6.604
llah+6.518
gment+6.137
Brit+5.965
Asset+5.822
Neg logit contributions
 Duo-7.818
agra-7.528
enf-7.405
 Phi-7.211
eny-7.022
Token even
Token ID772
Feature activation+2.82e-02

Pos logit contributions
gement+3.216
llah+3.080
gment+2.956
Brit+2.892
Asset+2.784
Neg logit contributions
 Duo-3.417
enf-3.239
agra-3.186
 Phi-3.159
eny-3.072
Token further
Token ID2252
Feature activation-9.56e-02

Pos logit contributions
 Duo+0.357
enf+0.341
 Phi+0.335
agra+0.333
eny+0.328
Neg logit contributions
gement-0.271
llah-0.259
gment-0.244
Brit-0.240
Asset-0.232
Token,
Token ID11
Feature activation-5.87e-02

Pos logit contributions
gement+1.062
llah+1.018
gment+0.972
Brit+0.961
Asset+0.929
Neg logit contributions
 Duo-1.063
enf-1.008
 Phi-0.988
agra-0.985
eny-0.961
Token to
Token ID284
Feature activation+1.03e-01

Pos logit contributions
gement+0.680
llah+0.653
gment+0.625
Brit+0.617
Asset+0.599
Neg logit contributions
 Duo-0.625
enf-0.593
 Phi-0.578
agra-0.576
eny-0.565
Token 0
Token ID657
Feature activation+6.03e-02

Pos logit contributions
 Duo+1.098
enf+1.033
 Phi+1.018
agra+1.006
eny+0.987
Neg logit contributions
gement-1.207
llah-1.160
gment-1.106
Brit-1.097
Asset-1.064
Token.
Token ID13
Feature activation+2.06e-01

Pos logit contributions
 Duo+0.652
enf+0.617
 Phi+0.604
agra+0.599
eny+0.589
Neg logit contributions
gement-0.694
llah-0.664
gment-0.635
Brit-0.627
Asset-0.609
Token05
Token ID2713
Feature activation+1.14e-01

Pos logit contributions
 Duo+2.109
 Phi+1.957
enf+1.951
agra+1.891
eny+1.858
Neg logit contributions
gement-2.527
llah-2.398
gment-2.309
Brit-2.251
Asset-2.174
Token%,
Token ID7441
Feature activation+7.89e-02

Pos logit contributions
 Duo+1.218
enf+1.152
 Phi+1.137
agra+1.120
eny+1.103
Neg logit contributions
gement-1.328
llah-1.270
gment-1.214
Brit-1.204
Asset-1.169
Tokens
Token ID82
Feature activation+3.92e-01

Pos logit contributions
gement+1.333
llah+1.280
gment+1.221
Brit+1.207
Asset+1.163
Neg logit contributions
 Duo-1.239
enf-1.169
 Phi-1.152
agra-1.142
eny-1.112
Token own
Token ID898
Feature activation+4.68e-01

Pos logit contributions
 Duo+4.125
enf+3.826
 Phi+3.821
agra+3.758
eny+3.739
Neg logit contributions
gement-4.453
llah-4.327
gment-4.110
Brit-4.046
Asset-4.006
Token clan
Token ID19065
Feature activation-9.47e-02

Pos logit contributions
 Duo+5.010
 Phi+4.653
enf+4.577
agra+4.544
eny+4.491
Neg logit contributions
gement-5.200
llah-5.052
gment-4.801
Brit-4.730
Asset-4.716
Token or
Token ID393
Feature activation-4.79e-02

Pos logit contributions
gement+1.150
llah+1.104
gment+1.058
Brit+1.045
Asset+1.010
Neg logit contributions
 Duo-0.962
enf-0.907
 Phi-0.884
agra-0.879
eny-0.857
Token tribe
Token ID14893
Feature activation-1.16e-01

Pos logit contributions
gement+0.523
llah+0.501
gment+0.476
Brit+0.470
Asset+0.454
Neg logit contributions
 Duo-0.544
enf-0.518
 Phi-0.506
agra-0.504
eny-0.494
Token.
Token ID13
Feature activation-3.52e-02

Pos logit contributions
gement+1.298
llah+1.240
gment+1.185
Brit+1.169
Asset+1.125
Neg logit contributions
 Duo-1.291
enf-1.225
agra-1.195
 Phi-1.195
eny-1.165
Token For
Token ID1114
Feature activation-5.27e-03

Pos logit contributions
gement+0.377
llah+0.361
gment+0.343
Brit+0.340
Asset+0.328
Neg logit contributions
 Duo-0.406
enf-0.387
 Phi-0.379
agra-0.378
eny-0.371
Token example
Token ID1672
Feature activation-1.34e-01

Pos logit contributions
gement+0.052
llah+0.050
gment+0.047
Brit+0.047
Asset+0.045
Neg logit contributions
 Duo-0.065
enf-0.062
 Phi-0.061
agra-0.061
eny-0.060
Token,
Token ID11
Feature activation-8.17e-02

Pos logit contributions
gement+1.483
llah+1.423
gment+1.351
Brit+1.342
Asset+1.288
Neg logit contributions
 Duo-1.490
enf-1.419
agra-1.383
 Phi-1.382
eny-1.356
Token El
Token ID2574
Feature activation-8.04e-02

Pos logit contributions
gement+0.936
llah+0.899
gment+0.857
Brit+0.849
Asset+0.819
Neg logit contributions
 Duo-0.877
enf-0.837
 Phi-0.814
agra-0.812
eny-0.797
Tokenj
Token ID73
Feature activation+4.39e-02

Pos logit contributions
gement+0.978
llah+0.944
gment+0.900
Brit+0.889
Asset+0.858
Neg logit contributions
 Duo-0.814
enf-0.766
 Phi-0.749
agra-0.743
eny-0.726
Token.
Token ID13
Feature activation+8.49e-01

Pos logit contributions
 Duo+2.234
enf+2.126
 Phi+2.066
agra+2.023
eny+2.019
Neg logit contributions
gement-2.779
llah-2.638
gment-2.538
Brit-2.480
��-2.407
Token158
Token ID21273
Feature activation+4.46e-01

Pos logit contributions
cum+4.494
enf+4.444
 Duo+4.347
eny+4.064
if+3.955
Neg logit contributions
gement-13.188
llah-12.562
Brit-12.232
gment-12.044
��-11.774
Token 10
Token ID838
Feature activation+1.80e-01

Pos logit contributions
 Duo+4.693
 Phi+4.438
enf+4.240
 cf+4.163
 Solo+4.150
Neg logit contributions
gement-5.296
llah-4.946
gment-4.858
Brit-4.643
Asset-4.555
Token95
Token ID3865
Feature activation+1.32e-02

Pos logit contributions
 Duo+1.828
enf+1.724
 Phi+1.675
agra+1.659
eny+1.626
Neg logit contributions
gement-2.202
llah-2.106
gment-2.020
Brit-1.957
Asset-1.902
Token228
Token ID23815
Feature activation+9.53e-02

Pos logit contributions
 Duo+0.136
enf+0.128
 Phi+0.125
agra+0.124
eny+0.122
Neg logit contributions
gement-0.159
llah-0.153
gment-0.146
Brit-0.144
Asset-0.140
Token x
Token ID2124
Feature activation+4.55e-02

Pos logit contributions
 Duo+0.669
enf+0.603
 Phi+0.596
agra+0.571
eny+0.561
Neg logit contributions
gement-1.466
llah-1.421
gment-1.372
Brit-1.361
Asset-1.333
Token.
Token ID13
Feature activation-1.29e-01

Pos logit contributions
 Duo+0.490
enf+0.464
 Phi+0.455
agra+0.450
eny+0.443
Neg logit contributions
gement-0.525
llah-0.503
gment-0.480
Brit-0.473
Asset-0.459
Tokenx
Token ID87
Feature activation-2.64e-01

Pos logit contributions
gement+1.465
llah+1.409
gment+1.352
Brit+1.333
Asset+1.293
Neg logit contributions
 Duo-1.413
enf-1.320
 Phi-1.309
agra-1.299
eny-1.261
Token.
Token ID13
Feature activation-1.12e-01

Pos logit contributions
gement+2.620
llah+2.541
gment+2.405
Brit+2.399
Asset+2.313
Neg logit contributions
 Duo-3.165
enf-2.995
agra-2.963
 Phi-2.944
eny-2.879
Tokenx
Token ID87
Feature activation-3.74e-02

Pos logit contributions
gement+1.275
llah+1.226
gment+1.175
Brit+1.158
Asset+1.124
Neg logit contributions
 Duo-1.214
enf-1.133
 Phi-1.125
agra-1.115
eny-1.081
Token.
Token ID13
Feature activation+3.66e-01

Pos logit contributions
gement+0.416
llah+0.400
gment+0.381
Brit+0.377
Asset+0.365
Neg logit contributions
 Duo-0.416
enf-0.392
 Phi-0.386
agra-0.385
eny-0.376
Token investigation
Token ID3645
Feature activation+1.37e-01

Pos logit contributions
gement+4.602
llah+4.394
gment+4.261
Asset+4.161
Brit+4.136
Neg logit contributions
 Duo-3.244
enf-3.138
agra-3.103
eny-3.004
 Phi-2.970
Token found
Token ID1043
Feature activation+2.99e-01

Pos logit contributions
 Duo+1.436
enf+1.351
 Phi+1.329
agra+1.309
eny+1.294
Neg logit contributions
gement-1.607
llah-1.548
gment-1.467
Brit-1.463
Asset-1.424
Token that
Token ID326
Feature activation+2.99e-01

Pos logit contributions
 Duo+3.065
 Phi+2.848
enf+2.843
agra+2.714
eny+2.704
Neg logit contributions
gement-3.586
llah-3.389
gment-3.307
Brit-3.300
Asset-3.125
Token,
Token ID11
Feature activation-1.77e-01

Pos logit contributions
 Duo+3.204
enf+2.984
 Phi+2.983
agra+2.842
eny+2.842
Neg logit contributions
gement-3.412
llah-3.255
gment-3.154
Brit-3.141
Asset-2.965
Token since
Token ID1201
Feature activation-1.94e-02

Pos logit contributions
gement+1.957
llah+1.873
gment+1.798
Brit+1.762
Asset+1.702
Neg logit contributions
 Duo-1.953
enf-1.873
agra-1.821
 Phi-1.815
eny-1.774
Token the
Token ID262
Feature activation-2.28e-01

Pos logit contributions
gement+0.219
llah+0.210
gment+0.200
Brit+0.197
Asset+0.191
Neg logit contributions
 Duo-0.213
enf-0.202
 Phi-0.198
agra-0.197
eny-0.193
Token mid
Token ID3095
Feature activation-8.44e-01

Pos logit contributions
gement+2.607
llah+2.525
gment+2.382
Brit+2.330
Asset+2.282
Neg logit contributions
 Duo-2.424
enf-2.333
agra-2.278
 Phi-2.230
eny-2.207
Token-
Token ID12
Feature activation-7.92e-01

Pos logit contributions
gement+6.923
llah+6.407
gment+6.222
Brit+6.027
Asset+5.886
Neg logit contributions
 Duo-10.591
agra-9.842
enf-9.820
 Phi-9.809
eny-9.483
Token1990
Token ID19891
Feature activation-1.26e-01

Pos logit contributions
gement+6.444
Brit+6.089
llah+5.835
Asset+5.823
gment+5.705
Neg logit contributions
 Duo-10.161
 Phi-9.539
enf-9.403
agra-9.245
eny-9.055
Tokens
Token ID82
Feature activation+2.36e-01

Pos logit contributions
gement+1.041
llah+0.983
gment+0.918
Brit+0.897
Asset+0.863
Neg logit contributions
 Duo-1.757
enf-1.686
 Phi-1.663
agra-1.656
eny-1.625
Token,
Token ID11
Feature activation+4.93e-02

Pos logit contributions
 Duo+2.395
enf+2.256
 Phi+2.215
agra+2.161
eny+2.143
Neg logit contributions
gement-2.828
llah-2.702
gment-2.604
Brit-2.584
Asset-2.469
Token people
Token ID661
Feature activation+3.20e-01

Pos logit contributions
gement+0.853
llah+0.819
gment+0.773
Brit+0.767
Asset+0.738
Neg logit contributions
 Duo-0.971
enf-0.926
 Phi-0.909
agra-0.905
eny-0.885
Token.
Token ID13
Feature activation+3.67e-01

Pos logit contributions
 Duo+3.289
enf+3.056
 Phi+2.992
eny+2.965
agra+2.960
Neg logit contributions
gement-3.745
llah-3.616
gment-3.462
Brit-3.413
Asset-3.348
Token Its
Token ID6363
Feature activation+1.13e-01

Pos logit contributions
 Duo+4.031
 Phi+3.699
enf+3.639
 Solo+3.582
agra+3.521
Neg logit contributions
gement-4.243
llah-4.044
gment-3.933
Brit-3.658
��-3.635
Token exports
Token ID15319
Feature activation+4.77e-02

Pos logit contributions
 Duo+1.255
enf+1.183
 Phi+1.167
agra+1.152
eny+1.131
Neg logit contributions
gement-1.253
llah-1.207
gment-1.148
Brit-1.135
Asset-1.101
Token to
Token ID284
Feature activation+1.18e-01

Pos logit contributions
 Duo+0.518
enf+0.489
 Phi+0.480
agra+0.476
eny+0.468
Neg logit contributions
gement-0.546
llah-0.524
gment-0.500
Brit-0.494
Asset-0.480
Token Saudi
Token ID7420
Feature activation-3.17e-02

Pos logit contributions
 Duo+1.172
enf+1.094
 Phi+1.082
agra+1.068
eny+1.041
Neg logit contributions
gement-1.444
llah-1.395
gment-1.338
Brit-1.318
Asset-1.287
Token Arabia
Token ID9671
Feature activation-3.10e-01

Pos logit contributions
gement+0.438
llah+0.424
gment+0.408
Brit+0.405
Asset+0.394
Neg logit contributions
 Duo-0.268
enf-0.250
 Phi-0.243
agra-0.241
eny-0.235
Token total
Token ID2472
Feature activation+3.43e-01

Pos logit contributions
gement+3.285
llah+3.141
gment+3.009
Brit+2.943
Asset+2.801
Neg logit contributions
 Duo-3.503
enf-3.372
 Phi-3.277
agra-3.254
eny-3.176
Token $
Token ID720
Feature activation-3.87e-01

Pos logit contributions
 Duo+3.405
enf+3.140
 Phi+3.131
agra+3.046
eny+2.998
Neg logit contributions
gement-4.122
llah-4.031
gment-3.852
Brit-3.787
Asset-3.700
Token1
Token ID16
Feature activation-2.44e-01

Pos logit contributions
gement+4.110
llah+4.007
Brit+3.890
gment+3.765
Asset+3.746
Neg logit contributions
 Duo-4.245
enf-4.043
agra-3.957
 Phi-3.946
eny-3.921
Token.
Token ID13
Feature activation-7.99e-02

Pos logit contributions
gement+2.658
llah+2.529
gment+2.423
Brit+2.402
Asset+2.307
Neg logit contributions
 Duo-2.752
enf-2.625
 Phi-2.562
agra-2.533
eny-2.493
Token T
Token ID309
Feature activation+9.60e-02

Pos logit contributions
 Duo+1.003
enf+0.943
 Phi+0.934
agra+0.916
eny+0.900
Neg logit contributions
gement-1.098
llah-1.051
gment-1.006
Brit-0.987
Asset-0.962
Token ER
Token ID13793
Feature activation-1.28e-01

Pos logit contributions
 Duo+0.914
enf+0.854
 Phi+0.843
agra+0.829
eny+0.809
Neg logit contributions
gement-1.230
llah-1.186
gment-1.140
Brit-1.121
Asset-1.092
TokenTI
Token ID25621
Feature activation-6.85e-02

Pos logit contributions
gement+1.675
llah+1.615
gment+1.553
Brit+1.551
Asset+1.504
Neg logit contributions
 Duo-1.158
enf-1.097
agra-1.061
 Phi-1.059
eny-1.036
Token
Token ID127
Feature activation+2.18e-01

Pos logit contributions
gement+0.777
llah+0.747
gment+0.713
Brit+0.708
Asset+0.684
Neg logit contributions
 Duo-0.745
enf-0.707
agra-0.687
 Phi-0.687
eny-0.675
Token
Token ID228
Feature activation-8.27e-02

Pos logit contributions
 Duo+1.964
enf+1.812
 Phi+1.811
agra+1.737
eny+1.735
Neg logit contributions
gement-2.900
llah-2.769
gment-2.688
Brit-2.627
Asset-2.587
Token P
Token ID350
Feature activation-9.70e-01

Pos logit contributions
gement+1.000
llah+0.964
gment+0.922
Brit+0.917
Asset+0.887
Neg logit contributions
 Duo-0.833
enf-0.791
agra-0.768
 Phi-0.765
eny-0.752
Token ART
Token ID24278
Feature activation-5.54e-02

Pos logit contributions
gement+8.175
llah+7.799
Brit+7.687
gment+7.268
Asset+7.256
Neg logit contributions
enf-10.634
 Duo-10.335
agra-9.981
eny-9.920
eless-9.514
TokenIS
Token ID1797
Feature activation+9.84e-02

Pos logit contributions
gement+0.727
llah+0.701
gment+0.674
Brit+0.669
Asset+0.651
Neg logit contributions
 Duo-0.506
enf-0.474
 Phi-0.461
agra-0.458
eny-0.448
Token.
Token ID764
Feature activation-1.46e-01

Pos logit contributions
 Duo+1.125
enf+1.063
 Phi+1.051
agra+1.036
eny+1.017
Neg logit contributions
gement-1.072
llah-1.026
gment-0.976
Brit-0.961
Asset-0.930
Token
Token ID198
Feature activation-1.06e-01

Pos logit contributions
gement+1.543
llah+1.479
gment+1.406
Brit+1.392
Asset+1.339
Neg logit contributions
 Duo-1.704
enf-1.623
 Phi-1.576
agra-1.576
eny-1.555
Token
Token ID198
Feature activation-1.25e-01

Pos logit contributions
gement+1.099
llah+1.056
gment+1.003
Brit+0.999
Asset+0.962
Neg logit contributions
 Duo-1.246
enf-1.189
agra-1.161
 Phi-1.160
eny-1.139
Token:
Token ID25
Feature activation-5.60e-01

Pos logit contributions
gement+4.374
llah+4.221
Brit+4.021
gment+3.978
Asset+3.818
Neg logit contributions
 Duo-4.777
enf-4.574
agra-4.566
 Phi-4.409
eny-4.353
Token Frank
Token ID5278
Feature activation-3.62e-01

Pos logit contributions
gement+5.763
llah+5.506
Brit+5.303
gment+5.266
Asset+5.021
Neg logit contributions
 Duo-6.025
enf-5.874
agra-5.828
eny-5.553
 Phi-5.551
Token Miller
Token ID7920
Feature activation-1.18e-01

Pos logit contributions
gement+2.935
llah+2.853
gment+2.648
Brit+2.567
Asset+2.470
Neg logit contributions
 Duo-4.887
agra-4.732
enf-4.700
eny-4.596
 Phi-4.552
Token.
Token ID13
Feature activation-6.09e-01

Pos logit contributions
gement+1.323
llah+1.270
gment+1.209
Brit+1.194
Asset+1.156
Neg logit contributions
 Duo-1.286
enf-1.221
 Phi-1.197
agra-1.197
eny-1.167
Token How
Token ID1374
Feature activation-1.09e+00

Pos logit contributions
gement+5.338
llah+5.211
Brit+5.075
Asset+4.841
gment+4.745
Neg logit contributions
enf-7.064
 Duo-6.980
agra-6.960
eny-6.857
 Phi-6.618
Token could
Token ID714
Feature activation-1.39e+00

Pos logit contributions
gement+10.143
llah+9.504
gment+8.999
Brit+8.902
 Roy+8.669
Neg logit contributions
 Duo-10.912
enf-10.839
agra-10.515
 Phi-10.042
cum-9.943
Token this
Token ID428
Feature activation-1.05e+00

Pos logit contributions
gement+11.100
llah+10.400
 Roy+10.386
Brit+10.200
Asset+9.986
Neg logit contributions
enf-13.530
agra-13.476
 Duo-12.825
eny-12.785
 Phi-12.441
Token go
Token ID467
Feature activation+2.39e-01

Pos logit contributions
gement+10.090
llah+9.498
Brit+9.023
gment+8.806
Asset+8.648
Neg logit contributions
agra-10.534
 Duo-10.367
enf-10.354
 Phi-10.014
eny-9.797
Token wrong
Token ID2642
Feature activation-3.18e-01

Pos logit contributions
 Duo+2.407
enf+2.254
 Phi+2.217
agra+2.195
eny+2.170
Neg logit contributions
gement-2.880
llah-2.768
gment-2.646
Brit-2.633
Asset-2.576
Token?!
Token ID12248
Feature activation-6.40e-01

Pos logit contributions
gement+3.027
llah+2.882
gment+2.725
Brit+2.684
Asset+2.565
Neg logit contributions
 Duo-3.945
enf-3.799
agra-3.737
 Phi-3.725
eny-3.641
Token
Token ID198
Feature activation+6.58e-02

Pos logit contributions
gement+5.704
llah+5.652
Brit+5.473
Asset+5.216
gment+5.065
Neg logit contributions
enf-7.297
agra-7.244
 Duo-7.187
eny-7.022
 Phi-6.812
Token made
Token ID925
Feature activation-4.17e-01

Pos logit contributions
 Duo+2.613
enf+2.487
 Phi+2.438
agra+2.406
eny+2.386
Neg logit contributions
gement-2.354
llah-2.281
Brit-2.155
gment-2.146
Asset-2.103
Token,
Token ID11
Feature activation-2.02e-01

Pos logit contributions
gement+4.305
llah+4.017
gment+3.864
Brit+3.783
Asset+3.610
Neg logit contributions
 Duo-4.818
enf-4.621
agra-4.534
 Phi-4.497
eny-4.350
Token and
Token ID290
Feature activation+3.67e-01

Pos logit contributions
gement+2.254
llah+2.141
gment+2.053
Brit+2.004
Asset+1.951
Neg logit contributions
 Duo-2.222
enf-2.135
agra-2.066
 Phi-2.061
eny-2.015
Token where
Token ID810
Feature activation-4.64e-02

Pos logit contributions
 Duo+3.441
 Phi+3.215
enf+3.189
agra+3.118
eny+3.082
Neg logit contributions
gement-4.559
llah-4.422
gment-4.237
Brit-4.235
Asset-4.135
Token they
Token ID484
Feature activation-3.61e-01

Pos logit contributions
gement+0.557
llah+0.535
gment+0.512
Brit+0.506
Asset+0.492
Neg logit contributions
 Duo-0.476
enf-0.451
 Phi-0.439
agra-0.438
eny-0.428
Token might
Token ID1244
Feature activation-7.95e-01

Pos logit contributions
gement+3.785
llah+3.502
gment+3.440
Brit+3.280
Asset+3.142
Neg logit contributions
 Duo-4.135
enf-4.058
 Phi-3.909
agra-3.880
eny-3.762
Token fit
Token ID4197
Feature activation+4.61e-02

Pos logit contributions
gement+7.786
gment+7.135
llah+7.004
Brit+6.564
 warranty+6.505
Neg logit contributions
 Duo-8.513
enf-8.413
agra-8.157
 Phi-7.998
eny-7.795
Token with
Token ID351
Feature activation-1.43e-01

Pos logit contributions
 Duo+0.488
enf+0.460
 Phi+0.452
agra+0.448
eny+0.439
Neg logit contributions
gement-0.540
llah-0.519
gment-0.495
Brit-0.490
Asset-0.475
Token the
Token ID262
Feature activation-2.23e-01

Pos logit contributions
gement+1.638
llah+1.560
gment+1.501
Brit+1.473
Asset+1.431
Neg logit contributions
 Duo-1.543
enf-1.469
agra-1.426
 Phi-1.423
eny-1.392
Token Canadiens
Token ID33664
Feature activation+3.56e-02

Pos logit contributions
gement+2.417
llah+2.278
gment+2.186
Brit+2.151
Asset+2.070
Neg logit contributions
 Duo-2.511
enf-2.423
agra-2.359
 Phi-2.347
eny-2.286
Token.
Token ID13
Feature activation-5.77e-02

Pos logit contributions
 Duo+0.403
enf+0.382
 Phi+0.375
agra+0.371
eny+0.366
Neg logit contributions
gement-0.391
llah-0.375
gment-0.357
Brit-0.352
Asset-0.341
Token punishing
Token ID28641
Feature activation-2.25e-02

Pos logit contributions
 Duo+0.941
enf+0.888
 Phi+0.874
agra+0.867
eny+0.850
Neg logit contributions
gement-1.003
llah-0.968
gment-0.918
Brit-0.912
Asset-0.881
Token the
Token ID262
Feature activation-8.77e-02

Pos logit contributions
gement+0.265
llah+0.255
gment+0.244
Brit+0.241
Asset+0.233
Neg logit contributions
 Duo-0.236
enf-0.223
 Phi-0.218
agra-0.217
eny-0.212
Token company
Token ID1664
Feature activation-2.03e-01

Pos logit contributions
gement+1.025
llah+0.983
gment+0.943
Brit+0.929
Asset+0.900
Neg logit contributions
 Duo-0.922
enf-0.879
 Phi-0.855
agra-0.854
eny-0.838
Token after
Token ID706
Feature activation-3.82e-01

Pos logit contributions
gement+2.248
llah+2.173
gment+2.069
Brit+2.050
Asset+1.963
Neg logit contributions
 Duo-2.241
enf-2.130
 Phi-2.067
agra-2.063
eny-2.017
Token Dish
Token ID33275
Feature activation-1.36e-01

Pos logit contributions
gement+4.054
llah+3.913
gment+3.723
Brit+3.634
Asset+3.562
Neg logit contributions
 Duo-4.247
enf-4.122
agra-3.933
eny-3.902
 Phi-3.898
Token lost
Token ID2626
Feature activation-7.72e-01

Pos logit contributions
gement+1.492
llah+1.429
gment+1.371
Brit+1.346
Asset+1.308
Neg logit contributions
 Duo-1.522
enf-1.456
agra-1.421
 Phi-1.417
eny-1.387
Token an
Token ID281
Feature activation-6.22e-01

Pos logit contributions
gement+7.711
gment+6.981
llah+6.874
 warranty+6.610
Brit+6.569
Neg logit contributions
 Duo-8.369
enf-8.099
eny-7.812
eless-7.590
 Phi-7.589
Token appeal
Token ID5198
Feature activation-3.81e-01

Pos logit contributions
gement+6.579
llah+5.996
gment+5.918
Asset+5.826
Brit+5.593
Neg logit contributions
 Duo-6.601
enf-6.403
eny-6.129
agra-6.049
 Phi-6.026
Token in
Token ID287
Feature activation-2.88e-01

Pos logit contributions
gement+4.172
llah+3.897
gment+3.834
Brit+3.695
Asset+3.520
Neg logit contributions
 Duo-4.186
enf-4.028
agra-3.918
 Phi-3.876
eny-3.862
Token a
Token ID257
Feature activation-1.61e-01

Pos logit contributions
gement+3.093
llah+2.908
gment+2.796
Brit+2.726
Asset+2.655
Neg logit contributions
 Duo-3.241
enf-3.125
agra-3.020
eny-3.000
 Phi-2.998
Token multi
Token ID5021
Feature activation-3.72e-01

Pos logit contributions
gement+1.660
llah+1.567
gment+1.511
Brit+1.477
Asset+1.425
Neg logit contributions
 Duo-1.894
enf-1.828
agra-1.765
 Phi-1.760
eny-1.757
Token benefits
Token ID4034
Feature activation+8.56e-02

Pos logit contributions
gement+0.505
llah+0.481
gment+0.458
Brit+0.450
Asset+0.436
Neg logit contributions
 Duo-0.618
enf-0.589
 Phi-0.577
agra-0.577
eny-0.566
Token for
Token ID329
Feature activation-2.93e-01

Pos logit contributions
 Duo+0.891
enf+0.844
 Phi+0.825
agra+0.816
eny+0.805
Neg logit contributions
gement-1.013
llah-0.975
gment-0.932
Brit-0.918
Asset-0.894
Token low
Token ID1877
Feature activation-6.40e-01

Pos logit contributions
gement+3.086
llah+2.891
gment+2.782
Brit+2.771
Asset+2.670
Neg logit contributions
 Duo-3.385
enf-3.267
agra-3.195
 Phi-3.153
eny-3.099
Token-
Token ID12
Feature activation-5.44e-01

Pos logit contributions
gement+5.566
llah+5.086
gment+4.951
Brit+4.863
Asset+4.748
Neg logit contributions
 Duo-8.184
agra-7.898
 Phi-7.700
enf-7.607
eny-7.403
Tokenincome
Token ID12519
Feature activation-4.82e-01

Pos logit contributions
gement+4.004
llah+3.628
Brit+3.596
gment+3.524
Asset+3.304
Neg logit contributions
 Duo-7.855
 Phi-7.483
agra-7.463
enf-7.300
eny-7.045
Token and
Token ID290
Feature activation-8.31e-01

Pos logit contributions
gement+4.439
llah+4.114
Brit+4.002
gment+3.991
Asset+3.726
Neg logit contributions
 Duo-5.924
enf-5.832
agra-5.825
 Phi-5.653
eny-5.533
Token middle
Token ID3504
Feature activation-1.33e-01

Pos logit contributions
gement+7.392
llah+6.754
Brit+6.684
gment+6.664
 warranty+6.210
Neg logit contributions
 Duo-9.490
agra-9.348
enf-9.106
 Phi-9.046
eny-8.789
Token-
Token ID12
Feature activation-1.30e-03

Pos logit contributions
gement+1.389
llah+1.325
gment+1.267
Brit+1.252
Asset+1.211
Neg logit contributions
 Duo-1.569
enf-1.496
 Phi-1.471
agra-1.468
eny-1.429
Tokenclass
Token ID4871
Feature activation-3.31e-01

Pos logit contributions
gement+0.008
llah+0.008
gment+0.007
Brit+0.007
Asset+0.007
Neg logit contributions
 Duo-0.021
enf-0.020
 Phi-0.020
agra-0.020
eny-0.019
Token Canadians
Token ID14008
Feature activation+5.98e-02

Pos logit contributions
gement+2.909
llah+2.710
gment+2.603
Brit+2.601
Asset+2.425
Neg logit contributions
 Duo-4.329
enf-4.249
agra-4.206
 Phi-4.133
eny-4.017
Token.
Token ID13
Feature activation+3.19e-01

Pos logit contributions
 Duo+0.643
enf+0.608
 Phi+0.596
agra+0.591
eny+0.583
Neg logit contributions
gement-0.691
llah-0.663
gment-0.634
Brit-0.624
Asset-0.607