TOP ACTIVATIONS
MAX = 4.126

first to hear about Nation Travels destinations , and explore the
first to hear about Nation Travels destinations , and explore the
com / om g its amb ry / status / 9
the day after G ant Heart . He was visiting from
which produces the E nerg izer brand . Now
the vulnerability of his prog eny , that he began to
teacher pensions Comp aring the experiences of charter school
PC , developed by Inf ogram es , it allows players
's event - The Walking Dead and An American Werewolf in
where he was the dish washer as pir ates
's L ir an E inav , turned out to have
the wheel of a Ford Fiesta R 5 , but the
attended ) including Shirley Ch ish ol m , the first
long delays apply to the deg assing from the ocean ic
, failed to contain an inherited hyper in flation ary crisis
who have kept up their criticism of the election and the
containing highly radioactive spent fuel rods at the stricken plant had
same dinosaurs . " Comp aring such different molecules as minerals
. Les Imp rompt us de Lac an
while watching an Orange Order walk in Glasgow on Saturday .

MIN ACTIVATIONS
MIN = -3.845

incredible . Bl acks in Philly have a different
Cub ans ( bl acks and mul att os ),
system gets me all tri pped up , and I end
up to leave and tri pped , falling backwards on to
Gand alf reports that Sar uman has betrayed them and is
, when Tor i tri pped and fell onto a hot
forehead a butt - cr ack with an ax : a
case scenario , it ro ts off my penis and then
and stem . View Full Caption Adam Glenn C
Then Friday , Trump u pped the ante by warning dark
41 , he has re pped the 305 since 1973 .
Wiki Leaks pr ides itself as a website devoted
images ( args [" images "] ): # load the image
tests undertaken after the Gren fell Tower fire , according to
: " Bar ack Obama would be the most
More research in ul cer prevention and treatment in le
you can sacrifice to damage tapped creatures , and Hot Soup
want to cr ack the code of
co . uk / s port Back then we
, art works and a detailed write up into the making

INTERVAL 2.134 - 4.126
CONTAINS 0.029%

if you 're wondering about the storms , I
mask ed - Sus ie Ess man 2008 01 28 O
an In Daily supporter . Powered by Press Pat ron <|endoftext|>
focus on exploration that W ats ham touched on is a
similar [ to the Ford Fiesta RS W RC ] so

INTERVAL 0.141 - 2.134
CONTAINS 35.768%

and it is on the floor of the reactor cavity .
doing other things - the pre - emption , referring to
in the White House ( she turned down a request from
ul c et tones . The closed beta test
ra e multiplied in mac roph ages at the site of

INTERVAL -1.852 - 0.141
CONTAINS 64.100%

they will spend it more carefully , and health care costs
content of this cover screen . On the home screen ,
a canopy created by the over hang of the building above
it . I am sure that we have completed
. During my deb rief with George he informed me that

INTERVAL -3.845 - -1.852
CONTAINS 0.104%

, past the ram sh ackle family home . Its waters
also known as " Ac ro ker ato el ast oid
captain . The inability to pinpoint the trigger leaves a crucial
Copyright © 2016 Else vier B . V .
ly golf course revenues and condo sales for a total
Token first
Token ID717
Feature activation+1.32e+00

Pos logit contributions
aan+0.736
xual+0.730
 laun+0.676
ista+0.672
 differe+0.597
Neg logit contributions
 mend-0.995
genic-0.915
 LIST-0.868
phabet-0.863
��-0.861
Token to
Token ID284
Feature activation+1.05e+00

Pos logit contributions
aan+0.918
xual+0.907
ista+0.847
 laun+0.846
 differe+0.762
Neg logit contributions
 mend-1.027
genic-0.939
 LIST-0.883
��-0.881
phabet-0.881
Token hear
Token ID3285
Feature activation+8.49e-01

Pos logit contributions
aan+0.926
xual+0.923
 laun+0.873
ista+0.870
 differe+0.801
Neg logit contributions
 mend-0.620
genic-0.545
 LIST-0.507
phabet-0.498
��-0.494
Token about
Token ID546
Feature activation+7.25e-01

Pos logit contributions
aan+0.768
xual+0.762
 laun+0.725
ista+0.722
 differe+0.669
Neg logit contributions
 mend-0.478
genic-0.422
 LIST-0.387
phabet-0.387
��-0.385
Token Nation
Token ID8741
Feature activation+4.69e-01

Pos logit contributions
aan+0.481
xual+0.463
 laun+0.442
ista+0.438
 differe+0.394
Neg logit contributions
 mend-0.579
genic-0.538
��-0.511
phabet-0.507
 LIST-0.503
Token Travels
Token ID48899
Feature activation+4.13e+00

Pos logit contributions
aan+0.300
xual+0.298
 laun+0.277
ista+0.272
asma+0.246
Neg logit contributions
 mend-0.391
genic-0.358
 LIST-0.341
phabet-0.337
��-0.335
Token destinations
Token ID23982
Feature activation+1.85e+00

Pos logit contributions
aan+2.893
xual+2.794
ista+2.674
 laun+2.584
 differe+2.384
Neg logit contributions
 mend-3.154
genic-2.860
��-2.805
phabet-2.750
 swe-2.699
Token,
Token ID11
Feature activation+9.60e-01

Pos logit contributions
aan+1.255
xual+1.242
ista+1.176
 laun+1.158
asma+1.053
Neg logit contributions
 mend-1.448
genic-1.323
��-1.273
phabet-1.258
 LIST-1.240
Token and
Token ID290
Feature activation+1.06e+00

Pos logit contributions
aan+0.565
xual+0.559
 laun+0.518
ista+0.512
 differe+0.459
Neg logit contributions
 mend-0.833
genic-0.772
��-0.760
phabet-0.746
 LIST-0.730
Token explore
Token ID7301
Feature activation+1.47e+00

Pos logit contributions
aan+0.672
xual+0.663
 laun+0.618
ista+0.613
 differe+0.547
Neg logit contributions
 mend-0.890
genic-0.823
��-0.780
phabet-0.777
 LIST-0.777
Token the
Token ID262
Feature activation+9.93e-01

Pos logit contributions
aan+1.097
xual+1.088
 laun+1.021
ista+1.016
 differe+0.922
Neg logit contributions
 mend-1.065
genic-0.965
 LIST-0.906
phabet-0.900
��-0.896
Token first
Token ID717
Feature activation+7.26e-01

Pos logit contributions
aan+0.512
xual+0.507
 laun+0.470
ista+0.467
 differe+0.415
Neg logit contributions
 mend-0.692
genic-0.636
 LIST-0.603
phabet-0.600
��-0.598
Token to
Token ID284
Feature activation+7.44e-01

Pos logit contributions
aan+0.509
xual+0.505
 laun+0.472
ista+0.470
 differe+0.423
Neg logit contributions
 mend-0.558
genic-0.509
 LIST-0.480
phabet-0.477
��-0.476
Token hear
Token ID3285
Feature activation+2.79e-01

Pos logit contributions
aan+0.624
xual+0.610
ista+0.582
 laun+0.581
 differe+0.536
Neg logit contributions
 mend-0.465
genic-0.423
��-0.393
phabet-0.392
 LIST-0.385
Token about
Token ID546
Feature activation+4.34e-01

Pos logit contributions
aan+0.252
xual+0.250
ista+0.238
 laun+0.237
 differe+0.219
Neg logit contributions
 mend-0.157
genic-0.138
 LIST-0.127
��-0.127
phabet-0.127
Token Nation
Token ID8741
Feature activation+4.95e-01

Pos logit contributions
aan+0.302
xual+0.299
 laun+0.279
ista+0.278
 differe+0.250
Neg logit contributions
 mend-0.337
genic-0.307
 LIST-0.290
��-0.289
phabet-0.288
Token Travels
Token ID48899
Feature activation+3.69e+00

Pos logit contributions
aan+0.309
xual+0.307
 laun+0.284
ista+0.282
 differe+0.251
Neg logit contributions
 mend-0.419
genic-0.385
 LIST-0.365
phabet-0.363
��-0.360
Token destinations
Token ID23982
Feature activation+1.19e+00

Pos logit contributions
aan+2.646
xual+2.579
ista+2.455
 laun+2.392
 differe+2.203
Neg logit contributions
 mend-2.773
genic-2.513
��-2.432
phabet-2.409
 swe-2.352
Token,
Token ID11
Feature activation+7.89e-01

Pos logit contributions
aan+0.839
xual+0.829
ista+0.777
 laun+0.770
asma+0.698
Neg logit contributions
 mend-0.902
genic-0.821
��-0.779
phabet-0.776
 LIST-0.773
Token and
Token ID290
Feature activation+7.66e-01

Pos logit contributions
aan+0.500
xual+0.494
ista+0.458
 laun+0.457
asma+0.406
Neg logit contributions
 mend-0.659
genic-0.606
��-0.574
 LIST-0.574
phabet-0.573
Token explore
Token ID7301
Feature activation+9.98e-01

Pos logit contributions
aan+0.478
xual+0.469
 laun+0.437
ista+0.435
 differe+0.386
Neg logit contributions
 mend-0.645
genic-0.595
 LIST-0.565
phabet-0.565
��-0.564
Token the
Token ID262
Feature activation+6.11e-01

Pos logit contributions
aan+0.753
xual+0.748
 laun+0.704
ista+0.697
 differe+0.635
Neg logit contributions
 mend-0.717
genic-0.648
 LIST-0.609
phabet-0.602
 swe-0.598
Tokencom
Token ID785
Feature activation-1.08e+00

Pos logit contributions
aan+0.093
xual+0.093
 laun+0.087
ista+0.086
 differe+0.079
Neg logit contributions
 mend-0.094
genic-0.085
 LIST-0.080
phabet-0.080
��-0.079
Token/
Token ID14
Feature activation+3.63e-01

Pos logit contributions
 mend+0.789
genic+0.716
 LIST+0.672
phabet+0.669
��+0.666
Neg logit contributions
aan-0.803
xual-0.794
 laun-0.746
ista-0.744
asma-0.674
Tokenom
Token ID296
Feature activation-3.70e-03

Pos logit contributions
aan+0.254
xual+0.252
 laun+0.236
ista+0.234
 differe+0.211
Neg logit contributions
 mend-0.280
genic-0.255
 LIST-0.241
phabet-0.239
��-0.238
Tokeng
Token ID70
Feature activation+3.89e-01

Pos logit contributions
 mend+0.003
genic+0.002
 LIST+0.002
phabet+0.002
��+0.002
Neg logit contributions
aan-0.003
xual-0.003
 laun-0.003
ista-0.003
 differe-0.002
Tokenits
Token ID896
Feature activation+7.06e-01

Pos logit contributions
aan+0.278
xual+0.276
 laun+0.259
ista+0.256
 differe+0.232
Neg logit contributions
 mend-0.294
genic-0.268
 LIST-0.252
phabet-0.250
��-0.249
Tokenamb
Token ID4131
Feature activation+3.23e+00

Pos logit contributions
aan+0.518
xual+0.513
 laun+0.480
ista+0.479
asma+0.433
Neg logit contributions
 mend-0.522
genic-0.473
 LIST-0.446
phabet-0.443
��-0.442
Tokenry
Token ID563
Feature activation+1.24e-01

Pos logit contributions
aan+2.330
xual+2.305
ista+2.169
 laun+2.129
asma+1.947
Neg logit contributions
 mend-2.430
genic-2.201
��-2.087
 LIST-2.082
phabet-2.054
Token/
Token ID14
Feature activation+8.67e-01

Pos logit contributions
aan+0.092
xual+0.091
 laun+0.085
ista+0.085
 differe+0.077
Neg logit contributions
 mend-0.091
genic-0.083
 LIST-0.078
phabet-0.077
��-0.077
Tokenstatus
Token ID13376
Feature activation-6.96e-01

Pos logit contributions
aan+0.635
xual+0.629
 laun+0.589
ista+0.587
 differe+0.531
Neg logit contributions
 mend-0.641
genic-0.583
 LIST-0.548
phabet-0.545
��-0.543
Token/
Token ID14
Feature activation+7.36e-01

Pos logit contributions
 mend+0.490
genic+0.445
 LIST+0.417
 swe+0.407
phabet+0.407
Neg logit contributions
aan-0.533
xual-0.533
 laun-0.504
ista-0.490
 differe-0.457
Token9
Token ID24
Feature activation+5.69e-01

Pos logit contributions
aan+0.504
xual+0.499
 laun+0.466
ista+0.464
 differe+0.416
Neg logit contributions
 mend-0.579
genic-0.529
 LIST-0.499
phabet-0.497
��-0.495
Token the
Token ID262
Feature activation+4.91e-01

Pos logit contributions
aan+0.082
xual+0.082
 laun+0.076
ista+0.076
asma+0.069
Neg logit contributions
 mend-0.075
genic-0.067
 LIST-0.063
 swe-0.062
phabet-0.062
Token day
Token ID1110
Feature activation+4.28e-01

Pos logit contributions
aan+0.358
xual+0.356
 laun+0.332
ista+0.331
asma+0.300
Neg logit contributions
 mend-0.367
genic-0.331
 LIST-0.313
 swe-0.309
phabet-0.309
Token after
Token ID706
Feature activation-5.93e-02

Pos logit contributions
aan+0.333
xual+0.331
 laun+0.311
ista+0.310
 differe+0.282
Neg logit contributions
 mend-0.296
genic-0.267
 LIST-0.250
phabet-0.248
��-0.246
Token G
Token ID402
Feature activation+4.51e-01

Pos logit contributions
 mend+0.043
genic+0.039
 LIST+0.037
phabet+0.036
��+0.036
Neg logit contributions
aan-0.044
xual-0.044
 laun-0.041
ista-0.041
 differe-0.037
Tokenant
Token ID415
Feature activation+2.46e+00

Pos logit contributions
xual+0.316
aan+0.315
 laun+0.295
ista+0.290
 differe+0.265
Neg logit contributions
 mend-0.346
genic-0.314
 LIST-0.298
phabet-0.295
 swe-0.292
Token Heart
Token ID8894
Feature activation+3.09e+00

Pos logit contributions
aan+1.709
xual+1.694
 laun+1.582
ista+1.575
 differe+1.417
Neg logit contributions
 mend-1.905
genic-1.738
 LIST-1.640
phabet-1.630
��-1.625
Token.
Token ID13
Feature activation+5.37e-01

Pos logit contributions
aan+2.243
xual+2.216
ista+2.084
 laun+2.081
 differe+1.874
Neg logit contributions
 mend-2.285
genic-2.089
��-1.962
phabet-1.959
 LIST-1.955
Token He
Token ID679
Feature activation-3.67e-02

Pos logit contributions
aan+0.444
xual+0.441
 laun+0.417
ista+0.415
 differe+0.381
Neg logit contributions
 mend-0.346
genic-0.309
 LIST-0.289
phabet-0.285
��-0.284
Token was
Token ID373
Feature activation+8.31e-01

Pos logit contributions
 mend+0.025
genic+0.022
 LIST+0.021
 swe+0.021
phabet+0.021
Neg logit contributions
aan-0.029
xual-0.029
 laun-0.027
ista-0.027
asma-0.025
Token visiting
Token ID10013
Feature activation+1.12e+00

Pos logit contributions
aan+0.639
xual+0.635
 laun+0.596
ista+0.593
 differe+0.540
Neg logit contributions
 mend-0.583
genic-0.526
 LIST-0.494
phabet-0.489
��-0.487
Token from
Token ID422
Feature activation+5.27e-01

Pos logit contributions
aan+0.864
xual+0.857
 laun+0.808
ista+0.805
 differe+0.733
Neg logit contributions
 mend-0.778
genic-0.704
 LIST-0.657
phabet-0.655
��-0.654
Token which
Token ID543
Feature activation-1.70e-02

Pos logit contributions
 mend+0.008
genic+0.007
 LIST+0.007
phabet+0.007
 swe+0.007
Neg logit contributions
aan-0.007
xual-0.007
 laun-0.006
ista-0.006
asma-0.006
Token produces
Token ID11073
Feature activation+4.55e-01

Pos logit contributions
 mend+0.012
genic+0.011
 LIST+0.010
 swe+0.010
phabet+0.010
Neg logit contributions
aan-0.013
xual-0.013
 laun-0.012
ista-0.012
asma-0.011
Token the
Token ID262
Feature activation+1.99e-01

Pos logit contributions
aan+0.345
xual+0.342
 laun+0.321
ista+0.320
 differe+0.291
Neg logit contributions
 mend-0.324
genic-0.293
 LIST-0.275
phabet-0.273
��-0.272
Token E
Token ID412
Feature activation-7.77e-01

Pos logit contributions
aan+0.156
xual+0.155
 laun+0.145
ista+0.145
 differe+0.132
Neg logit contributions
 mend-0.138
genic-0.125
 LIST-0.116
phabet-0.115
��-0.115
Tokennerg
Token ID25649
Feature activation+1.25e+00

Pos logit contributions
 mend+0.581
genic+0.533
phabet+0.497
 LIST+0.497
 swe+0.490
Neg logit contributions
aan-0.561
xual-0.557
 laun-0.521
ista-0.517
 differe-0.467
Tokenizer
Token ID7509
Feature activation+2.92e+00

Pos logit contributions
aan+0.718
xual+0.712
 laun+0.655
ista+0.646
 differe+0.571
Neg logit contributions
 mend-1.116
genic-1.033
 LIST-0.981
phabet-0.977
��-0.972
Token brand
Token ID4508
Feature activation+8.29e-02

Pos logit contributions
aan+2.202
xual+2.180
 laun+2.064
ista+2.063
 differe+1.869
Neg logit contributions
 mend-2.065
genic-1.857
��-1.777
 LIST-1.753
phabet-1.739
Token.
Token ID13
Feature activation+1.19e-01

Pos logit contributions
aan+0.058
xual+0.058
 laun+0.054
ista+0.054
 differe+0.048
Neg logit contributions
 mend-0.064
genic-0.058
 LIST-0.055
phabet-0.054
��-0.054
Token
Token ID198
Feature activation-3.60e-02

Pos logit contributions
aan+0.098
xual+0.098
 laun+0.092
ista+0.092
 differe+0.084
Neg logit contributions
 mend-0.077
genic-0.069
 LIST-0.064
phabet-0.063
��-0.063
Token
Token ID198
Feature activation-2.20e-01

Pos logit contributions
 mend+0.025
genic+0.023
 LIST+0.021
phabet+0.021
 swe+0.021
Neg logit contributions
aan-0.028
xual-0.028
 laun-0.026
ista-0.026
 differe-0.024
TokenNow
Token ID3844
Feature activation+4.07e-01

Pos logit contributions
 mend+0.186
genic+0.171
 LIST+0.162
phabet+0.161
joining+0.160
Neg logit contributions
xual-0.138
aan-0.138
 laun-0.128
ista-0.126
 differe-0.113
Token the
Token ID262
Feature activation+2.75e-02

Pos logit contributions
aan+0.366
xual+0.363
 laun+0.341
ista+0.340
 differe+0.309
Neg logit contributions
 mend-0.338
genic-0.306
 LIST-0.287
phabet-0.285
��-0.284
Token vulnerability
Token ID15131
Feature activation-3.73e-01

Pos logit contributions
aan+0.020
xual+0.020
 laun+0.019
ista+0.019
 differe+0.017
Neg logit contributions
 mend-0.020
genic-0.018
 LIST-0.017
phabet-0.017
��-0.017
Token of
Token ID286
Feature activation+7.57e-02

Pos logit contributions
 mend+0.290
genic+0.265
 LIST+0.250
phabet+0.248
��+0.247
Neg logit contributions
aan-0.259
xual-0.257
 laun-0.240
ista-0.238
 differe-0.215
Token his
Token ID465
Feature activation+2.28e-01

Pos logit contributions
aan+0.060
xual+0.060
 laun+0.056
ista+0.056
 differe+0.051
Neg logit contributions
 mend-0.051
genic-0.046
 LIST-0.043
phabet-0.043
��-0.043
Token prog
Token ID1172
Feature activation+1.49e-01

Pos logit contributions
aan+0.164
xual+0.163
 laun+0.152
ista+0.152
 differe+0.137
Neg logit contributions
 mend-0.172
genic-0.156
 LIST-0.147
 swe-0.145
phabet-0.145
Tokeneny
Token ID28558
Feature activation+2.81e+00

Pos logit contributions
aan+0.062
xual+0.060
ista+0.054
 laun+0.053
asma+0.043
Neg logit contributions
 mend-0.157
genic-0.147
��-0.142
 LIST-0.141
phabet-0.141
Token,
Token ID11
Feature activation+2.69e-02

Pos logit contributions
aan+1.897
xual+1.896
ista+1.766
 laun+1.760
 differe+1.584
Neg logit contributions
 mend-2.185
genic-2.004
phabet-1.911
��-1.910
 LIST-1.895
Token that
Token ID326
Feature activation+7.25e-02

Pos logit contributions
aan+0.020
xual+0.020
 laun+0.019
ista+0.019
 differe+0.017
Neg logit contributions
 mend-0.019
genic-0.018
 LIST-0.017
phabet-0.016
 swe-0.016
Token he
Token ID339
Feature activation+1.32e-01

Pos logit contributions
aan+0.054
xual+0.053
 laun+0.050
ista+0.050
 differe+0.045
Neg logit contributions
 mend-0.053
genic-0.048
 LIST-0.045
phabet-0.045
��-0.044
Token began
Token ID2540
Feature activation-1.55e-01

Pos logit contributions
aan+0.096
xual+0.096
 laun+0.089
ista+0.089
asma+0.080
Neg logit contributions
 mend-0.099
genic-0.089
 LIST-0.084
 swe-0.084
phabet-0.083
Token to
Token ID284
Feature activation+8.26e-02

Pos logit contributions
 mend+0.111
genic+0.100
 LIST+0.094
phabet+0.093
��+0.093
Neg logit contributions
aan-0.118
xual-0.117
 laun-0.110
ista-0.109
 differe-0.099
Token teacher
Token ID4701
Feature activation+4.26e-01

Pos logit contributions
 mend+0.057
genic+0.051
 LIST+0.047
phabet+0.047
 swe+0.046
Neg logit contributions
aan-0.071
xual-0.070
 laun-0.066
ista-0.066
asma-0.060
Token pensions
Token ID27581
Feature activation+4.13e-02

Pos logit contributions
aan+0.360
xual+0.357
 laun+0.338
ista+0.336
 differe+0.309
Neg logit contributions
 mend-0.267
genic-0.238
 LIST-0.221
phabet-0.219
��-0.218
Token
Token ID198
Feature activation-8.07e-01

Pos logit contributions
aan+0.029
xual+0.029
 laun+0.027
ista+0.027
 differe+0.024
Neg logit contributions
 mend-0.031
genic-0.029
 LIST-0.027
phabet-0.027
 swe-0.027
Token
Token ID198
Feature activation-6.21e-01

Pos logit contributions
 mend+0.548
genic+0.493
 LIST+0.463
 swe+0.452
joining+0.449
Neg logit contributions
xual-0.644
aan-0.641
 laun-0.613
ista-0.589
 differe-0.553
TokenComp
Token ID7293
Feature activation-9.60e-01

Pos logit contributions
 mend+0.516
genic+0.478
 LIST+0.451
phabet+0.447
joining+0.447
Neg logit contributions
xual-0.400
aan-0.393
 laun-0.374
ista-0.356
 differe-0.328
Tokenaring
Token ID1723
Feature activation+2.80e+00

Pos logit contributions
 mend+0.596
genic+0.535
phabet+0.494
 LIST+0.493
joining+0.482
Neg logit contributions
xual-0.810
aan-0.804
 laun-0.768
ista-0.759
 differe-0.703
Token the
Token ID262
Feature activation+4.05e-02

Pos logit contributions
aan+2.213
xual+2.199
ista+2.086
 laun+2.068
 differe+1.908
Neg logit contributions
 mend-1.868
genic-1.689
��-1.587
phabet-1.578
 LIST-1.567
Token experiences
Token ID6461
Feature activation-7.06e-01

Pos logit contributions
aan+0.031
xual+0.031
 laun+0.029
ista+0.029
asma+0.027
Neg logit contributions
 mend-0.028
genic-0.025
 LIST-0.024
phabet-0.024
��-0.023
Token of
Token ID286
Feature activation+2.16e-02

Pos logit contributions
 mend+0.545
genic+0.497
 LIST+0.469
phabet+0.466
��+0.465
Neg logit contributions
aan-0.494
xual-0.489
 laun-0.458
ista-0.453
asma-0.409
Token charter
Token ID17422
Feature activation-2.01e-01

Pos logit contributions
aan+0.018
xual+0.017
 laun+0.016
ista+0.016
 differe+0.015
Neg logit contributions
 mend-0.014
genic-0.013
 LIST-0.012
phabet-0.012
��-0.012
Token school
Token ID1524
Feature activation+2.35e-01

Pos logit contributions
 mend+0.123
genic+0.110
 LIST+0.102
phabet+0.101
��+0.100
Neg logit contributions
aan-0.172
xual-0.171
 laun-0.161
ista-0.161
 differe-0.148
Token PC
Token ID4217
Feature activation+7.48e-01

Pos logit contributions
aan+0.582
xual+0.577
 laun+0.543
ista+0.540
 differe+0.491
Neg logit contributions
 mend-0.544
genic-0.492
 LIST-0.461
phabet-0.458
��-0.456
Token,
Token ID11
Feature activation+6.76e-01

Pos logit contributions
aan+0.532
xual+0.527
 laun+0.495
ista+0.493
 differe+0.444
Neg logit contributions
 mend-0.567
genic-0.515
 LIST-0.485
phabet-0.484
��-0.482
Token developed
Token ID4166
Feature activation+1.75e+00

Pos logit contributions
aan+0.514
xual+0.510
 laun+0.479
ista+0.477
 differe+0.434
Neg logit contributions
 mend-0.480
genic-0.434
 LIST-0.407
phabet-0.404
��-0.403
Token by
Token ID416
Feature activation+3.91e-01

Pos logit contributions
aan+1.273
xual+1.262
 laun+1.184
ista+1.179
 differe+1.066
Neg logit contributions
 mend-1.295
genic-1.176
 LIST-1.107
phabet-1.101
��-1.096
Token Inf
Token ID4806
Feature activation+1.20e+00

Pos logit contributions
aan+0.294
xual+0.292
 laun+0.273
ista+0.271
 differe+0.247
Neg logit contributions
 mend-0.283
genic-0.255
 LIST-0.240
phabet-0.237
��-0.236
Tokenogram
Token ID21857
Feature activation+2.75e+00

Pos logit contributions
xual+0.695
aan+0.689
 laun+0.636
ista+0.615
 differe+0.556
Neg logit contributions
 mend-1.073
genic-0.988
 LIST-0.939
phabet-0.936
��-0.930
Tokenes
Token ID274
Feature activation+6.51e-01

Pos logit contributions
xual+1.216
aan+1.216
ista+1.075
 laun+1.040
asma+0.867
Neg logit contributions
 mend-2.820
genic-2.654
phabet-2.544
 LIST-2.542
��-2.540
Token,
Token ID11
Feature activation+8.60e-01

Pos logit contributions
aan+0.452
xual+0.448
 laun+0.418
ista+0.416
 differe+0.375
Neg logit contributions
 mend-0.506
genic-0.462
 LIST-0.436
phabet-0.433
��-0.431
Token it
Token ID340
Feature activation+5.38e-01

Pos logit contributions
aan+0.656
xual+0.651
 laun+0.611
ista+0.608
 differe+0.554
Neg logit contributions
 mend-0.610
genic-0.551
 LIST-0.517
phabet-0.513
��-0.511
Token allows
Token ID3578
Feature activation+9.83e-01

Pos logit contributions
aan+0.415
xual+0.411
 laun+0.387
ista+0.385
 differe+0.351
Neg logit contributions
 mend-0.377
genic-0.341
 LIST-0.319
phabet-0.317
��-0.315
Token players
Token ID1938
Feature activation+7.83e-01

Pos logit contributions
aan+0.747
xual+0.742
 laun+0.696
ista+0.692
 differe+0.630
Neg logit contributions
 mend-0.700
genic-0.633
 LIST-0.593
phabet-0.586
��-0.584
Token's
Token ID338
Feature activation-4.50e-01

Pos logit contributions
 mend+0.170
genic+0.156
 LIST+0.147
phabet+0.146
��+0.146
Neg logit contributions
aan-0.140
xual-0.139
 laun-0.129
ista-0.128
 differe-0.115
Token event
Token ID1785
Feature activation-1.06e-01

Pos logit contributions
 mend+0.310
genic+0.279
 LIST+0.262
 swe+0.257
phabet+0.257
Neg logit contributions
aan-0.354
xual-0.352
 laun-0.329
ista-0.328
asma-0.299
Token -
Token ID532
Feature activation-3.71e-02

Pos logit contributions
 mend+0.082
genic+0.074
 LIST+0.070
phabet+0.070
��+0.069
Neg logit contributions
aan-0.074
xual-0.073
 laun-0.068
ista-0.068
 differe-0.061
Token The
Token ID383
Feature activation+6.97e-01

Pos logit contributions
 mend+0.026
genic+0.023
 LIST+0.022
phabet+0.022
��+0.022
Neg logit contributions
aan-0.029
xual-0.028
 laun-0.027
ista-0.027
 differe-0.024
Token Walking
Token ID21276
Feature activation+1.72e+00

Pos logit contributions
aan+0.519
xual+0.515
 laun+0.483
ista+0.481
 differe+0.436
Neg logit contributions
 mend-0.506
genic-0.459
 LIST-0.431
phabet-0.428
��-0.426
Token Dead
Token ID5542
Feature activation+2.75e+00

Pos logit contributions
aan+0.999
xual+0.983
 laun+0.908
ista+0.907
asma+0.795
Neg logit contributions
 mend-1.522
genic-1.408
 LIST-1.335
phabet-1.334
��-1.330
Token and
Token ID290
Feature activation+1.90e-01

Pos logit contributions
aan+1.803
xual+1.781
 laun+1.668
ista+1.661
asma+1.484
Neg logit contributions
 mend-2.228
genic-2.047
phabet-1.938
 LIST-1.937
��-1.929
Token An
Token ID1052
Feature activation+6.67e-01

Pos logit contributions
aan+0.149
xual+0.148
 laun+0.139
ista+0.139
 differe+0.127
Neg logit contributions
 mend-0.130
genic-0.117
 LIST-0.109
phabet-0.108
��-0.108
Token American
Token ID1605
Feature activation-1.96e-01

Pos logit contributions
aan+0.527
xual+0.517
ista+0.492
 laun+0.487
asma+0.447
Neg logit contributions
 mend-0.455
genic-0.411
��-0.384
phabet-0.382
 LIST-0.382
Token Werewolf
Token ID40261
Feature activation+8.60e-01

Pos logit contributions
 mend+0.113
genic+0.101
 LIST+0.093
 swe+0.090
phabet+0.089
Neg logit contributions
aan-0.175
xual-0.175
 laun-0.168
ista-0.161
 differe-0.154
Token in
Token ID287
Feature activation+1.49e-01

Pos logit contributions
aan+0.594
xual+0.589
 laun+0.550
ista+0.546
 differe+0.492
Neg logit contributions
 mend-0.673
genic-0.612
 LIST-0.579
phabet-0.575
��-0.572
Token where
Token ID810
Feature activation+6.23e-01

Pos logit contributions
aan+0.390
xual+0.387
 laun+0.362
ista+0.360
 differe+0.326
Neg logit contributions
 mend-0.411
genic-0.374
 LIST-0.352
phabet-0.350
��-0.349
Token he
Token ID339
Feature activation+2.19e-01

Pos logit contributions
aan+0.415
xual+0.411
 laun+0.383
ista+0.381
 differe+0.341
Neg logit contributions
 mend-0.501
genic-0.459
 LIST-0.434
phabet-0.432
��-0.431
Token was
Token ID373
Feature activation+2.94e-01

Pos logit contributions
aan+0.169
xual+0.168
 laun+0.158
ista+0.157
 differe+0.143
Neg logit contributions
 mend-0.153
genic-0.138
 LIST-0.130
phabet-0.128
 swe-0.128
Token the
Token ID262
Feature activation-1.05e-01

Pos logit contributions
aan+0.223
xual+0.223
 laun+0.207
ista+0.206
 differe+0.188
Neg logit contributions
 mend-0.211
genic-0.190
 LIST-0.179
 swe-0.176
phabet-0.176
Token dish
Token ID9433
Feature activation+1.01e+00

Pos logit contributions
 mend+0.073
genic+0.066
 LIST+0.062
phabet+0.061
 swe+0.061
Neg logit contributions
aan-0.082
xual-0.082
 laun-0.077
ista-0.076
asma-0.070
Tokenwasher
Token ID45146
Feature activation+2.69e+00

Pos logit contributions
aan+0.640
xual+0.637
ista+0.593
 laun+0.592
 differe+0.525
Neg logit contributions
 mend-0.835
genic-0.771
��-0.730
 LIST-0.726
phabet-0.720
Token as
Token ID355
Feature activation-2.96e-02

Pos logit contributions
aan+1.817
xual+1.799
 laun+1.679
ista+1.679
 differe+1.499
Neg logit contributions
 mend-2.121
genic-1.944
phabet-1.835
 LIST-1.835
��-1.833
Token
Token ID564
Feature activation+6.15e-02

Pos logit contributions
 mend+0.022
genic+0.020
 LIST+0.019
phabet+0.019
��+0.018
Neg logit contributions
aan-0.022
xual-0.022
 laun-0.020
ista-0.020
 differe-0.018
Token
Token ID250
Feature activation-1.59e-01

Pos logit contributions
aan+0.054
xual+0.054
 laun+0.051
ista+0.051
 differe+0.047
Neg logit contributions
 mend-0.036
genic-0.032
 LIST-0.029
phabet-0.029
��-0.029
Tokenpir
Token ID4063
Feature activation+1.02e+00

Pos logit contributions
 mend+0.128
genic+0.117
 LIST+0.111
phabet+0.110
��+0.109
Neg logit contributions
aan-0.105
xual-0.105
 laun-0.098
ista-0.097
 differe-0.087
Tokenates
Token ID689
Feature activation-1.23e+00

Pos logit contributions
aan+0.554
xual+0.547
 laun+0.501
ista+0.498
 differe+0.432
Neg logit contributions
 mend-0.953
genic-0.883
 LIST-0.843
phabet-0.838
��-0.837
Token's
Token ID338
Feature activation-2.95e-01

Pos logit contributions
 mend+0.133
genic+0.121
 LIST+0.115
phabet+0.113
 swe+0.113
Neg logit contributions
aan-0.115
xual-0.114
 laun-0.106
ista-0.105
asma-0.094
Token L
Token ID406
Feature activation+1.53e-02

Pos logit contributions
 mend+0.216
genic+0.195
 LIST+0.184
phabet+0.181
 swe+0.180
Neg logit contributions
aan-0.219
xual-0.218
 laun-0.204
ista-0.203
asma-0.184
Tokenir
Token ID343
Feature activation-6.55e-01

Pos logit contributions
xual+0.011
aan+0.011
 laun+0.010
ista+0.010
 differe+0.009
Neg logit contributions
 mend-0.012
genic-0.011
phabet-0.010
 LIST-0.010
 swe-0.010
Tokenan
Token ID272
Feature activation-9.74e-02

Pos logit contributions
 mend+0.447
genic+0.404
 LIST+0.377
phabet+0.374
��+0.371
Neg logit contributions
aan-0.515
xual-0.513
 laun-0.483
ista-0.479
 differe-0.438
Token E
Token ID412
Feature activation-8.69e-01

Pos logit contributions
 mend+0.067
genic+0.061
 LIST+0.057
phabet+0.056
 swe+0.056
Neg logit contributions
aan-0.076
xual-0.076
 laun-0.071
ista-0.070
 differe-0.065
Tokeninav
Token ID26802
Feature activation+2.68e+00

Pos logit contributions
 mend+0.666
genic+0.608
 LIST+0.572
phabet+0.569
��+0.565
Neg logit contributions
aan-0.611
xual-0.607
 laun-0.567
ista-0.563
 differe-0.508
Token,
Token ID11
Feature activation+2.99e-01

Pos logit contributions
aan+1.864
xual+1.831
ista+1.720
 laun+1.710
 differe+1.535
Neg logit contributions
 mend-2.076
genic-1.893
��-1.791
 LIST-1.787
phabet-1.780
Token turned
Token ID2900
Feature activation-2.83e-01

Pos logit contributions
aan+0.224
xual+0.223
 laun+0.208
ista+0.207
asma+0.188
Neg logit contributions
 mend-0.218
genic-0.196
 LIST-0.185
 swe-0.183
phabet-0.182
Token out
Token ID503
Feature activation-6.82e-02

Pos logit contributions
 mend+0.190
genic+0.170
 LIST+0.158
phabet+0.156
 swe+0.156
Neg logit contributions
aan-0.229
xual-0.228
 laun-0.214
ista-0.212
asma-0.195
Token to
Token ID284
Feature activation+2.72e-02

Pos logit contributions
 mend+0.053
genic+0.048
 LIST+0.045
phabet+0.045
��+0.045
Neg logit contributions
aan-0.048
xual-0.047
 laun-0.044
ista-0.044
 differe-0.040
Token have
Token ID423
Feature activation+3.91e-01

Pos logit contributions
aan+0.019
xual+0.019
ista+0.018
 laun+0.018
 differe+0.016
Neg logit contributions
 mend-0.021
genic-0.019
phabet-0.018
��-0.018
 LIST-0.018
Token the
Token ID262
Feature activation+2.06e-01

Pos logit contributions
 mend+0.490
genic+0.443
 LIST+0.416
phabet+0.413
��+0.411
Neg logit contributions
aan-0.519
xual-0.516
 laun-0.484
ista-0.482
 differe-0.438
Token wheel
Token ID7825
Feature activation+6.47e-01

Pos logit contributions
aan+0.181
xual+0.179
 laun+0.170
ista+0.170
 differe+0.156
Neg logit contributions
 mend-0.123
genic-0.109
 LIST-0.100
phabet-0.100
��-0.099
Token of
Token ID286
Feature activation+7.73e-02

Pos logit contributions
aan+0.441
xual+0.437
 laun+0.408
ista+0.406
 differe+0.364
Neg logit contributions
 mend-0.511
genic-0.467
 LIST-0.441
phabet-0.438
��-0.438
Token a
Token ID257
Feature activation+2.28e-01

Pos logit contributions
aan+0.060
xual+0.059
 laun+0.056
ista+0.056
 differe+0.051
Neg logit contributions
 mend-0.054
genic-0.049
 LIST-0.046
phabet-0.045
��-0.045
Token Ford
Token ID8092
Feature activation+4.86e-01

Pos logit contributions
aan+0.163
xual+0.161
 laun+0.151
ista+0.151
 differe+0.136
Neg logit contributions
 mend-0.172
genic-0.156
 LIST-0.147
phabet-0.147
��-0.146
Token Fiesta
Token ID49839
Feature activation+2.60e+00

Pos logit contributions
aan+0.321
xual+0.317
 laun+0.296
ista+0.295
 differe+0.263
Neg logit contributions
 mend-0.394
genic-0.361
 LIST-0.341
phabet-0.340
��-0.339
Token R
Token ID371
Feature activation-8.39e-01

Pos logit contributions
aan+1.927
xual+1.910
ista+1.815
 laun+1.800
 differe+1.634
Neg logit contributions
 mend-1.869
genic-1.692
��-1.595
phabet-1.583
 LIST-1.573
Token5
Token ID20
Feature activation-3.50e-01

Pos logit contributions
 mend+0.628
genic+0.571
 LIST+0.537
phabet+0.534
��+0.533
Neg logit contributions
aan-0.606
xual-0.600
 laun-0.562
ista-0.561
asma-0.505
Token,
Token ID11
Feature activation+1.19e-01

Pos logit contributions
 mend+0.271
genic+0.247
 LIST+0.233
phabet+0.232
��+0.231
Neg logit contributions
aan-0.244
xual-0.241
 laun-0.226
ista-0.224
 differe-0.202
Token but
Token ID475
Feature activation+2.07e-01

Pos logit contributions
aan+0.087
xual+0.086
 laun+0.081
ista+0.080
 differe+0.073
Neg logit contributions
 mend-0.088
genic-0.080
 LIST-0.075
phabet-0.074
��-0.074
Token the
Token ID262
Feature activation+3.42e-01

Pos logit contributions
aan+0.169
xual+0.168
 laun+0.158
ista+0.158
 differe+0.145
Neg logit contributions
 mend-0.135
genic-0.121
 LIST-0.113
phabet-0.112
��-0.112
Token attended
Token ID9141
Feature activation-3.96e-01

Pos logit contributions
aan+0.076
xual+0.075
 laun+0.071
ista+0.070
 differe+0.065
Neg logit contributions
 mend-0.055
genic-0.048
 LIST-0.045
 swe-0.045
phabet-0.044
Token)
Token ID8
Feature activation-3.02e-01

Pos logit contributions
 mend+0.283
genic+0.255
 LIST+0.240
phabet+0.238
 swe+0.237
Neg logit contributions
aan-0.301
xual-0.299
 laun-0.280
ista-0.278
asma-0.254
Token including
Token ID1390
Feature activation+8.60e-02

Pos logit contributions
 mend+0.240
genic+0.219
 LIST+0.207
phabet+0.205
 swe+0.205
Neg logit contributions
aan-0.205
xual-0.203
 laun-0.189
ista-0.187
 differe-0.168
Token Shirley
Token ID40902
Feature activation+8.09e-01

Pos logit contributions
aan+0.066
xual+0.066
 laun+0.062
ista+0.062
 differe+0.056
Neg logit contributions
 mend-0.060
genic-0.054
 LIST-0.051
phabet-0.050
��-0.050
Token Ch
Token ID609
Feature activation+1.06e-01

Pos logit contributions
aan+0.613
xual+0.605
 laun+0.569
ista+0.569
 differe+0.516
Neg logit contributions
 mend-0.577
genic-0.524
 LIST-0.490
phabet-0.489
��-0.489
Tokenish
Token ID680
Feature activation+2.60e+00

Pos logit contributions
aan+0.072
xual+0.071
ista+0.067
 laun+0.066
asma+0.060
Neg logit contributions
 mend-0.084
genic-0.077
 LIST-0.073
��-0.072
phabet-0.072
Tokenol
Token ID349
Feature activation+1.86e+00

Pos logit contributions
aan+1.532
xual+1.479
ista+1.388
 laun+1.382
asma+1.223
Neg logit contributions
 mend-2.296
genic-2.134
��-2.052
 LIST-2.028
phabet-2.020
Tokenm
Token ID76
Feature activation+1.38e+00

Pos logit contributions
aan+0.896
xual+0.881
ista+0.844
 laun+0.785
asma+0.709
Neg logit contributions
 mend-1.824
genic-1.696
��-1.633
phabet-1.629
 LIST-1.626
Token,
Token ID11
Feature activation+7.09e-01

Pos logit contributions
aan+0.972
xual+0.964
 laun+0.901
ista+0.901
 differe+0.809
Neg logit contributions
 mend-1.050
genic-0.958
 LIST-0.902
phabet-0.899
��-0.895
Token the
Token ID262
Feature activation+5.96e-01

Pos logit contributions
aan+0.549
xual+0.544
 laun+0.514
ista+0.512
 differe+0.466
Neg logit contributions
 mend-0.491
genic-0.444
 LIST-0.415
phabet-0.414
��-0.413
Token first
Token ID717
Feature activation+3.87e-01

Pos logit contributions
aan+0.433
xual+0.429
 laun+0.403
ista+0.401
 differe+0.363
Neg logit contributions
 mend-0.443
genic-0.403
 LIST-0.379
phabet-0.377
��-0.376
Token long
Token ID890
Feature activation+1.59e-01

Pos logit contributions
aan+0.169
xual+0.168
 laun+0.158
ista+0.157
asma+0.142
Neg logit contributions
 mend-0.164
genic-0.149
 LIST-0.139
phabet-0.138
��-0.138
Token delays
Token ID16119
Feature activation-2.32e-01

Pos logit contributions
aan+0.116
xual+0.115
 laun+0.107
ista+0.107
 differe+0.097
Neg logit contributions
 mend-0.118
genic-0.107
 LIST-0.101
��-0.101
phabet-0.101
Token apply
Token ID4174
Feature activation+4.10e-01

Pos logit contributions
 mend+0.175
genic+0.160
 LIST+0.150
phabet+0.149
��+0.149
Neg logit contributions
aan-0.165
xual-0.164
 laun-0.154
ista-0.153
 differe-0.138
Token to
Token ID284
Feature activation+1.62e-01

Pos logit contributions
aan+0.287
xual+0.285
 laun+0.265
ista+0.265
 differe+0.239
Neg logit contributions
 mend-0.316
genic-0.288
 LIST-0.272
phabet-0.271
��-0.270
Token the
Token ID262
Feature activation-1.15e-02

Pos logit contributions
aan+0.125
xual+0.125
 laun+0.118
ista+0.117
 differe+0.107
Neg logit contributions
 mend-0.112
genic-0.101
��-0.095
phabet-0.095
 LIST-0.095
Token deg
Token ID3396
Feature activation+2.60e+00

Pos logit contributions
 mend+0.008
genic+0.007
 LIST+0.007
phabet+0.007
��+0.007
Neg logit contributions
aan-0.009
xual-0.009
 laun-0.008
ista-0.008
 differe-0.007
Tokenassing
Token ID19696
Feature activation+8.26e-02

Pos logit contributions
aan+1.763
xual+1.743
 laun+1.630
ista+1.624
 differe+1.463
Neg logit contributions
 mend-2.036
genic-1.854
��-1.799
 LIST-1.781
phabet-1.767
Token from
Token ID422
Feature activation+1.32e-01

Pos logit contributions
aan+0.058
xual+0.057
 laun+0.053
ista+0.053
 differe+0.048
Neg logit contributions
 mend-0.064
genic-0.058
 LIST-0.055
phabet-0.055
��-0.055
Token the
Token ID262
Feature activation-8.25e-02

Pos logit contributions
aan+0.102
xual+0.101
 laun+0.096
ista+0.095
 differe+0.087
Neg logit contributions
 mend-0.091
genic-0.082
 LIST-0.077
phabet-0.077
��-0.077
Token ocean
Token ID9151
Feature activation-1.72e-02

Pos logit contributions
 mend+0.057
genic+0.052
 LIST+0.048
phabet+0.048
��+0.048
Neg logit contributions
aan-0.064
xual-0.064
 laun-0.060
ista-0.060
 differe-0.054
Tokenic
Token ID291
Feature activation+5.61e-01

Pos logit contributions
 mend+0.013
genic+0.012
 LIST+0.011
phabet+0.011
��+0.011
Neg logit contributions
aan-0.012
xual-0.012
 laun-0.011
ista-0.011
 differe-0.010
Token,
Token ID11
Feature activation-1.82e-01

Pos logit contributions
 mend+0.145
genic+0.132
 LIST+0.125
phabet+0.124
��+0.124
Neg logit contributions
aan-0.137
xual-0.135
 laun-0.127
ista-0.126
 differe-0.114
Token failed
Token ID4054
Feature activation+3.34e-01

Pos logit contributions
 mend+0.129
genic+0.117
 LIST+0.110
phabet+0.108
 swe+0.108
Neg logit contributions
aan-0.138
xual-0.137
 laun-0.128
ista-0.128
asma-0.116
Token to
Token ID284
Feature activation-1.42e-01

Pos logit contributions
aan+0.235
xual+0.234
 laun+0.218
ista+0.216
 differe+0.196
Neg logit contributions
 mend-0.257
genic-0.233
 LIST-0.220
phabet-0.218
��-0.217
Token contain
Token ID3994
Feature activation-2.84e-01

Pos logit contributions
 mend+0.097
genic+0.087
 LIST+0.081
phabet+0.080
 swe+0.080
Neg logit contributions
aan-0.112
xual-0.112
 laun-0.105
ista-0.105
asma-0.095
Token an
Token ID281
Feature activation-2.59e-01

Pos logit contributions
 mend+0.215
genic+0.194
 LIST+0.182
phabet+0.181
 swe+0.180
Neg logit contributions
aan-0.206
xual-0.204
 laun-0.190
ista-0.188
asma-0.171
Token inherited
Token ID19552
Feature activation+2.56e+00

Pos logit contributions
 mend+0.193
genic+0.174
 LIST+0.164
phabet+0.162
��+0.162
Neg logit contributions
aan-0.190
xual-0.189
 laun-0.176
ista-0.174
asma-0.158
Token hyper
Token ID8718
Feature activation+5.71e-01

Pos logit contributions
aan+1.878
xual+1.862
 laun+1.751
ista+1.749
 differe+1.581
Neg logit contributions
 mend-1.877
genic-1.709
 LIST-1.607
phabet-1.602
��-1.601
Tokenin
Token ID259
Feature activation-1.08e-01

Pos logit contributions
aan+0.433
xual+0.429
 laun+0.403
ista+0.402
 differe+0.365
Neg logit contributions
 mend-0.407
genic-0.368
 LIST-0.346
phabet-0.343
��-0.343
Tokenflation
Token ID33521
Feature activation-2.23e-02

Pos logit contributions
 mend+0.065
genic+0.057
��+0.054
 LIST+0.053
phabet+0.053
Neg logit contributions
aan-0.093
xual-0.093
ista-0.088
 laun-0.087
asma-0.081
Tokenary
Token ID560
Feature activation+8.20e-01

Pos logit contributions
 mend+0.017
genic+0.015
 LIST+0.014
 swe+0.014
phabet+0.014
Neg logit contributions
aan-0.016
xual-0.016
 laun-0.015
ista-0.014
 differe-0.013
Token crisis
Token ID4902
Feature activation-5.83e-01

Pos logit contributions
aan+0.602
xual+0.601
 laun+0.558
ista+0.547
 differe+0.500
Neg logit contributions
 mend-0.615
genic-0.553
 LIST-0.520
 swe-0.516
phabet-0.508
Token who
Token ID508
Feature activation+7.25e-01

Pos logit contributions
aan+0.285
xual+0.283
 laun+0.262
ista+0.261
 differe+0.235
Neg logit contributions
 mend-0.329
genic-0.299
 LIST-0.283
phabet-0.280
 swe-0.278
Token have
Token ID423
Feature activation+9.51e-01

Pos logit contributions
aan+0.529
xual+0.525
 laun+0.490
ista+0.488
 differe+0.442
Neg logit contributions
 mend-0.539
genic-0.488
 LIST-0.459
phabet-0.456
 swe-0.453
Token kept
Token ID4030
Feature activation+1.41e+00

Pos logit contributions
aan+0.748
xual+0.742
 laun+0.700
ista+0.697
 differe+0.636
Neg logit contributions
 mend-0.650
genic-0.586
 LIST-0.548
phabet-0.545
��-0.544
Token up
Token ID510
Feature activation+9.89e-01

Pos logit contributions
aan+1.093
xual+1.082
 laun+1.023
ista+1.020
 differe+0.929
Neg logit contributions
 mend-0.965
genic-0.877
��-0.823
 LIST-0.821
phabet-0.821
Token their
Token ID511
Feature activation+2.43e-01

Pos logit contributions
aan+0.776
xual+0.770
 laun+0.725
ista+0.722
 differe+0.659
Neg logit contributions
 mend-0.678
genic-0.611
 LIST-0.572
phabet-0.568
��-0.566
Token criticism
Token ID7734
Feature activation+2.56e+00

Pos logit contributions
aan+0.190
xual+0.189
 laun+0.177
ista+0.176
 differe+0.161
Neg logit contributions
 mend-0.169
genic-0.151
 LIST-0.142
phabet-0.140
��-0.140
Token of
Token ID286
Feature activation+4.28e-01

Pos logit contributions
aan+1.711
xual+1.707
ista+1.584
 laun+1.584
 differe+1.416
Neg logit contributions
 mend-2.036
genic-1.873
��-1.771
 LIST-1.761
phabet-1.760
Token the
Token ID262
Feature activation+1.98e-01

Pos logit contributions
aan+0.318
xual+0.315
 laun+0.295
ista+0.293
asma+0.266
Neg logit contributions
 mend-0.315
genic-0.283
 LIST-0.266
phabet-0.264
 swe-0.263
Token election
Token ID3071
Feature activation+1.47e-01

Pos logit contributions
aan+0.151
xual+0.150
 laun+0.140
ista+0.139
asma+0.127
Neg logit contributions
 mend-0.141
genic-0.127
 LIST-0.119
phabet-0.118
 swe-0.117
Token and
Token ID290
Feature activation+1.71e-01

Pos logit contributions
aan+0.111
xual+0.110
 laun+0.103
ista+0.102
asma+0.094
Neg logit contributions
 mend-0.106
genic-0.095
 LIST-0.090
phabet-0.089
 swe-0.089
Token the
Token ID262
Feature activation+1.46e-01

Pos logit contributions
aan+0.133
xual+0.132
 laun+0.124
ista+0.123
asma+0.113
Neg logit contributions
 mend-0.120
genic-0.107
 LIST-0.100
phabet-0.099
 swe-0.099
Token containing
Token ID7268
Feature activation-3.21e-01

Pos logit contributions
aan+1.028
xual+1.019
 laun+0.953
ista+0.949
 differe+0.855
Neg logit contributions
 mend-1.116
genic-1.017
 LIST-0.958
phabet-0.953
��-0.951
Token highly
Token ID4047
Feature activation+6.22e-01

Pos logit contributions
 mend+0.221
genic+0.199
 LIST+0.186
phabet+0.185
��+0.185
Neg logit contributions
aan-0.251
xual-0.249
 laun-0.235
ista-0.234
 differe-0.214
Token radioactive
Token ID25521
Feature activation-4.05e-01

Pos logit contributions
aan+0.552
xual+0.548
 laun+0.521
ista+0.519
 differe+0.479
Neg logit contributions
 mend-0.362
genic-0.320
 LIST-0.295
phabet-0.293
��-0.292
Token spent
Token ID3377
Feature activation+2.92e-01

Pos logit contributions
 mend+0.262
genic+0.235
 LIST+0.219
phabet+0.217
��+0.216
Neg logit contributions
aan-0.334
xual-0.331
 laun-0.313
ista-0.312
 differe-0.286
Token fuel
Token ID5252
Feature activation+4.23e-01

Pos logit contributions
aan+0.243
xual+0.240
 laun+0.230
ista+0.229
asma+0.211
Neg logit contributions
 mend-0.184
genic-0.165
��-0.155
 LIST-0.154
phabet-0.153
Token rods
Token ID32858
Feature activation+2.56e+00

Pos logit contributions
aan+0.314
xual+0.312
 laun+0.292
ista+0.291
 differe+0.264
Neg logit contributions
 mend-0.308
genic-0.279
 LIST-0.262
phabet-0.260
��-0.259
Token at
Token ID379
Feature activation+4.21e-01

Pos logit contributions
aan+1.830
xual+1.803
 laun+1.699
ista+1.691
 differe+1.526
Neg logit contributions
 mend-1.919
genic-1.746
phabet-1.651
��-1.651
 LIST-1.643
Token the
Token ID262
Feature activation+4.22e-01

Pos logit contributions
aan+0.313
xual+0.309
 laun+0.292
ista+0.290
 differe+0.263
Neg logit contributions
 mend-0.305
genic-0.277
 LIST-0.260
phabet-0.260
��-0.259
Token stricken
Token ID47455
Feature activation+6.40e-01

Pos logit contributions
aan+0.339
xual+0.335
 laun+0.318
ista+0.316
asma+0.289
Neg logit contributions
 mend-0.280
genic-0.252
phabet-0.236
 LIST-0.235
��-0.235
Token plant
Token ID4618
Feature activation+6.99e-01

Pos logit contributions
aan+0.578
xual+0.574
 laun+0.546
ista+0.544
 differe+0.503
Neg logit contributions
 mend-0.363
genic-0.319
 LIST-0.294
phabet-0.292
��-0.291
Token had
Token ID550
Feature activation+1.75e-01

Pos logit contributions
aan+0.514
xual+0.511
 laun+0.478
ista+0.476
 differe+0.432
Neg logit contributions
 mend-0.514
genic-0.466
 LIST-0.439
phabet-0.434
 swe-0.432
Token same
Token ID976
Feature activation-7.58e-02

Pos logit contributions
 mend+0.072
genic+0.065
 LIST+0.060
phabet+0.060
��+0.060
Neg logit contributions
aan-0.088
xual-0.087
 laun-0.082
ista-0.082
 differe-0.075
Token dinosaurs
Token ID27199
Feature activation-6.26e-01

Pos logit contributions
 mend+0.057
genic+0.052
 LIST+0.048
phabet+0.048
��+0.048
Neg logit contributions
aan-0.055
xual-0.054
 laun-0.051
ista-0.051
asma-0.046
Token.
Token ID13
Feature activation+1.10e-01

Pos logit contributions
 mend+0.477
genic+0.435
 LIST+0.410
phabet+0.405
��+0.405
Neg logit contributions
aan-0.443
xual-0.439
 laun-0.412
ista-0.408
 differe-0.369
Token "
Token ID366
Feature activation-3.57e-01

Pos logit contributions
aan+0.093
xual+0.092
 laun+0.087
ista+0.087
 differe+0.080
Neg logit contributions
 mend-0.070
genic-0.062
 LIST-0.058
phabet-0.057
��-0.057
Token Comp
Token ID3082
Feature activation-1.07e+00

Pos logit contributions
 mend+0.307
genic+0.283
 LIST+0.270
phabet+0.266
��+0.265
Neg logit contributions
aan-0.218
xual-0.217
 laun-0.201
ista-0.198
 differe-0.176
Tokenaring
Token ID1723
Feature activation+2.56e+00

Pos logit contributions
 mend+0.696
genic+0.626
 LIST+0.582
phabet+0.578
 swe+0.569
Neg logit contributions
aan-0.866
xual-0.865
 laun-0.819
ista-0.813
 differe-0.746
Token such
Token ID884
Feature activation-5.23e-01

Pos logit contributions
aan+2.021
xual+2.008
 laun+1.895
ista+1.878
 differe+1.742
Neg logit contributions
 mend-1.697
genic-1.533
phabet-1.450
��-1.448
 LIST-1.431
Token different
Token ID1180
Feature activation+4.29e-01

Pos logit contributions
 mend+0.319
genic+0.283
 LIST+0.262
phabet+0.258
��+0.257
Neg logit contributions
aan-0.452
xual-0.450
 laun-0.425
ista-0.424
asma-0.389
Token molecules
Token ID17745
Feature activation-5.16e-01

Pos logit contributions
aan+0.353
xual+0.350
 laun+0.331
ista+0.330
 differe+0.303
Neg logit contributions
 mend-0.277
genic-0.248
 LIST-0.231
phabet-0.229
��-0.229
Token as
Token ID355
Feature activation-3.86e-01

Pos logit contributions
 mend+0.387
genic+0.351
 LIST+0.331
phabet+0.327
 swe+0.326
Neg logit contributions
aan-0.373
xual-0.370
 laun-0.347
ista-0.345
asma-0.311
Token minerals
Token ID21622
Feature activation-1.62e-01

Pos logit contributions
 mend+0.248
genic+0.222
 LIST+0.206
phabet+0.206
��+0.206
Neg logit contributions
aan-0.319
xual-0.317
 laun-0.300
ista-0.298
 differe-0.274
Token.
Token ID13
Feature activation-1.56e-01

Pos logit contributions
aan+0.294
xual+0.292
 laun+0.274
ista+0.272
asma+0.246
Neg logit contributions
 mend-0.299
genic-0.270
 LIST-0.254
phabet-0.252
 swe-0.251
Token
Token ID198
Feature activation+4.86e-02

Pos logit contributions
 mend+0.102
genic+0.091
 LIST+0.086
phabet+0.084
��+0.084
Neg logit contributions
aan-0.128
xual-0.127
 laun-0.120
ista-0.119
 differe-0.110
Token
Token ID198
Feature activation+1.78e-02

Pos logit contributions
aan+0.038
xual+0.038
 laun+0.036
ista+0.035
 differe+0.032
Neg logit contributions
 mend-0.034
genic-0.031
 LIST-0.029
phabet-0.028
 swe-0.028
TokenLes
Token ID35882
Feature activation+9.95e-01

Pos logit contributions
xual+0.012
aan+0.012
 laun+0.011
ista+0.011
 differe+0.010
Neg logit contributions
 mend-0.014
genic-0.013
 LIST-0.012
phabet-0.012
joining-0.012
Token Imp
Token ID9855
Feature activation+1.59e-02

Pos logit contributions
aan+0.726
xual+0.716
ista+0.670
 laun+0.670
 differe+0.604
Neg logit contributions
 mend-0.735
genic-0.672
��-0.632
 LIST-0.631
phabet-0.629
Tokenrompt
Token ID45700
Feature activation+2.54e+00

Pos logit contributions
aan+0.013
xual+0.013
 laun+0.012
ista+0.012
 differe+0.011
Neg logit contributions
 mend-0.011
genic-0.010
 LIST-0.009
phabet-0.009
��-0.009
Tokenus
Token ID385
Feature activation-5.71e-01

Pos logit contributions
aan+2.189
xual+2.172
ista+2.058
 laun+2.051
asma+1.883
Neg logit contributions
 mend-1.546
genic-1.380
 LIST-1.276
��-1.271
phabet-1.269
Token de
Token ID390
Feature activation+9.14e-01

Pos logit contributions
 mend+0.411
genic+0.370
 LIST+0.349
phabet+0.345
��+0.343
Neg logit contributions
aan-0.430
xual-0.426
 laun-0.400
ista-0.397
 differe-0.361
Token Lac
Token ID23115
Feature activation+1.06e-01

Pos logit contributions
aan+0.750
xual+0.746
 laun+0.705
ista+0.702
 differe+0.646
Neg logit contributions
 mend-0.587
genic-0.530
phabet-0.493
��-0.491
 LIST-0.491
Tokenan
Token ID272
Feature activation-1.37e-01

Pos logit contributions
aan+0.065
xual+0.064
 laun+0.059
ista+0.059
 differe+0.052
Neg logit contributions
 mend-0.091
genic-0.084
 LIST-0.080
phabet-0.079
��-0.079
Token
Token ID198
Feature activation-4.15e-01

Pos logit contributions
 mend+0.099
genic+0.090
 LIST+0.084
phabet+0.084
��+0.083
Neg logit contributions
aan-0.102
xual-0.102
 laun-0.095
ista-0.095
 differe-0.086
Token while
Token ID981
Feature activation+3.62e-01

Pos logit contributions
aan+0.416
xual+0.412
 laun+0.383
ista+0.378
 differe+0.345
Neg logit contributions
 mend-0.459
genic-0.417
 LIST-0.395
 swe-0.391
phabet-0.388
Token watching
Token ID4964
Feature activation+4.90e-02

Pos logit contributions
aan+0.295
xual+0.293
 laun+0.275
ista+0.274
asma+0.251
Neg logit contributions
 mend-0.240
genic-0.213
 LIST-0.200
 swe-0.197
phabet-0.195
Token an
Token ID281
Feature activation+5.35e-01

Pos logit contributions
aan+0.038
xual+0.038
 laun+0.036
ista+0.035
 differe+0.032
Neg logit contributions
 mend-0.034
genic-0.031
 LIST-0.029
phabet-0.028
 swe-0.028
Token Orange
Token ID11942
Feature activation+1.33e+00

Pos logit contributions
aan+0.393
xual+0.390
 laun+0.365
ista+0.363
 differe+0.329
Neg logit contributions
 mend-0.396
genic-0.359
 LIST-0.338
phabet-0.334
��-0.333
Token Order
Token ID8284
Feature activation+1.18e+00

Pos logit contributions
aan+0.974
xual+0.963
ista+0.915
 laun+0.904
 differe+0.815
Neg logit contributions
 mend-0.976
genic-0.884
��-0.841
phabet-0.834
 LIST-0.832
Token walk
Token ID2513
Feature activation+2.53e+00

Pos logit contributions
aan+0.938
xual+0.931
 laun+0.877
ista+0.873
 differe+0.798
Neg logit contributions
 mend-0.802
genic-0.721
 LIST-0.674
phabet-0.669
��-0.666
Token in
Token ID287
Feature activation+4.20e-01

Pos logit contributions
aan+1.754
xual+1.723
ista+1.647
 laun+1.613
 differe+1.447
Neg logit contributions
 mend-1.940
genic-1.781
��-1.697
phabet-1.678
 LIST-1.674
Token Glasgow
Token ID23995
Feature activation+1.17e+00

Pos logit contributions
aan+0.319
xual+0.317
 laun+0.297
ista+0.295
 differe+0.269
Neg logit contributions
 mend-0.300
genic-0.271
 LIST-0.255
phabet-0.253
��-0.252
Token on
Token ID319
Feature activation+5.63e-01

Pos logit contributions
aan+0.785
xual+0.779
 laun+0.726
ista+0.724
 differe+0.647
Neg logit contributions
 mend-0.938
genic-0.860
 LIST-0.812
phabet-0.810
��-0.809
Token Saturday
Token ID3909
Feature activation+7.12e-01

Pos logit contributions
aan+0.453
xual+0.449
 laun+0.424
ista+0.422
 differe+0.386
Neg logit contributions
 mend-0.375
genic-0.337
 LIST-0.314
phabet-0.312
��-0.311
Token.
Token ID13
Feature activation-1.38e-03

Pos logit contributions
aan+0.491
xual+0.488
ista+0.454
 laun+0.453
asma+0.404
Neg logit contributions
 mend-0.555
genic-0.509
 LIST-0.479
phabet-0.478
��-0.478
Token incredible
Token ID8082
Feature activation-1.01e-01

Pos logit contributions
aan+0.224
xual+0.222
 laun+0.209
ista+0.208
 differe+0.189
Neg logit contributions
 mend-0.212
genic-0.192
 LIST-0.180
phabet-0.178
��-0.178
Token.
Token ID13
Feature activation-3.48e-01

Pos logit contributions
 mend+0.077
genic+0.070
 LIST+0.066
phabet+0.066
��+0.066
Neg logit contributions
aan-0.072
xual-0.071
 laun-0.067
ista-0.066
 differe-0.060
Token
Token ID198
Feature activation-3.39e-01

Pos logit contributions
 mend+0.238
genic+0.214
 LIST+0.201
phabet+0.199
��+0.198
Neg logit contributions
aan-0.274
xual-0.272
 laun-0.256
ista-0.255
 differe-0.233
Token
Token ID198
Feature activation-5.12e-01

Pos logit contributions
 mend+0.233
genic+0.210
 LIST+0.198
 swe+0.193
phabet+0.192
Neg logit contributions
xual-0.266
aan-0.265
 laun-0.252
ista-0.244
 differe-0.228
TokenBl
Token ID3629
Feature activation+1.23e-01

Pos logit contributions
 mend+0.424
genic+0.390
 LIST+0.369
phabet+0.367
joining+0.363
Neg logit contributions
xual-0.330
aan-0.326
 laun-0.307
ista-0.299
 differe-0.271
Tokenacks
Token ID4595
Feature activation-3.84e+00

Pos logit contributions
aan+0.088
xual+0.088
 laun+0.083
ista+0.081
 differe+0.074
Neg logit contributions
 mend-0.092
genic-0.083
 LIST-0.078
phabet-0.078
��-0.077
Token in
Token ID287
Feature activation-8.16e-02

Pos logit contributions
 mend+2.746
genic+2.456
 LIST+2.338
 swe+2.321
phabet+2.312
Neg logit contributions
xual-2.877
aan-2.849
 laun-2.682
ista-2.652
asma-2.438
Token Philly
Token ID30029
Feature activation+2.46e-01

Pos logit contributions
 mend+0.057
genic+0.052
 LIST+0.049
phabet+0.048
��+0.048
Neg logit contributions
aan-0.062
xual-0.062
 laun-0.058
ista-0.058
 differe-0.053
Token have
Token ID423
Feature activation+1.22e-01

Pos logit contributions
aan+0.187
xual+0.185
 laun+0.174
ista+0.173
 differe+0.158
Neg logit contributions
 mend-0.175
genic-0.159
 LIST-0.149
phabet-0.148
��-0.148
Token a
Token ID257
Feature activation-1.49e-01

Pos logit contributions
aan+0.094
xual+0.093
 laun+0.087
ista+0.087
 differe+0.079
Neg logit contributions
 mend-0.087
genic-0.078
 LIST-0.073
phabet-0.073
��-0.072
Token different
Token ID1180
Feature activation+9.84e-01

Pos logit contributions
 mend+0.110
genic+0.099
 LIST+0.094
 swe+0.093
phabet+0.093
Neg logit contributions
aan-0.110
xual-0.110
 laun-0.103
ista-0.102
asma-0.093
Token
Token ID20977
Feature activation+9.03e-01

Pos logit contributions
 mend+0.407
genic+0.369
 LIST+0.347
phabet+0.345
��+0.344
Neg logit contributions
aan-0.406
xual-0.403
 laun-0.378
ista-0.376
 differe-0.341
TokenCub
Token ID43632
Feature activation+1.64e-01

Pos logit contributions
aan+0.621
xual+0.613
ista+0.573
 laun+0.572
 differe+0.514
Neg logit contributions
 mend-0.706
genic-0.643
 LIST-0.611
��-0.609
phabet-0.607
Tokenans
Token ID504
Feature activation+5.20e-04

Pos logit contributions
xual+0.128
aan+0.127
 laun+0.121
ista+0.119
 differe+0.110
Neg logit contributions
 mend-0.112
genic-0.101
 LIST-0.094
phabet-0.093
 swe-0.092
Token (
Token ID357
Feature activation+1.31e-01

Pos logit contributions
aan+0.000
xual+0.000
 laun+0.000
ista+0.000
asma+0.000
Neg logit contributions
 mend-0.000
genic-0.000
 LIST-0.000
phabet-0.000
 swe-0.000
Tokenbl
Token ID2436
Feature activation-1.31e-01

Pos logit contributions
aan+0.094
xual+0.093
 laun+0.087
ista+0.087
 differe+0.078
Neg logit contributions
 mend-0.099
genic-0.090
 LIST-0.085
phabet-0.084
��-0.084
Tokenacks
Token ID4595
Feature activation-3.80e+00

Pos logit contributions
 mend+0.059
genic+0.050
 LIST+0.045
phabet+0.044
��+0.043
Neg logit contributions
aan-0.133
xual-0.133
 laun-0.127
ista-0.126
 differe-0.118
Token and
Token ID290
Feature activation-3.63e-02

Pos logit contributions
 mend+2.764
genic+2.500
 LIST+2.342
 swe+2.332
phabet+2.325
Neg logit contributions
aan-2.765
xual-2.747
 laun-2.556
ista-2.535
asma-2.325
Token mul
Token ID35971
Feature activation-9.50e-01

Pos logit contributions
 mend+0.024
genic+0.022
 LIST+0.020
phabet+0.020
��+0.020
Neg logit contributions
aan-0.029
xual-0.029
 laun-0.027
ista-0.027
 differe-0.025
Tokenatt
Token ID1078
Feature activation-1.86e-01

Pos logit contributions
 mend+0.784
genic+0.723
 LIST+0.682
phabet+0.675
joining+0.673
Neg logit contributions
xual-0.609
aan-0.608
 laun-0.566
ista-0.554
 differe-0.502
Tokenos
Token ID418
Feature activation-7.58e-01

Pos logit contributions
 mend+0.157
genic+0.144
 LIST+0.136
phabet+0.135
joining+0.134
Neg logit contributions
xual-0.117
aan-0.116
 laun-0.108
ista-0.105
 differe-0.096
Token),
Token ID828
Feature activation+4.99e-01

Pos logit contributions
 mend+0.601
genic+0.549
 LIST+0.519
phabet+0.515
 swe+0.511
Neg logit contributions
aan-0.515
xual-0.508
 laun-0.473
ista-0.469
asma-0.424
Token system
Token ID1080
Feature activation-3.54e-01

Pos logit contributions
 mend+0.353
genic+0.319
 LIST+0.301
phabet+0.298
 swe+0.297
Neg logit contributions
aan-0.380
xual-0.377
 laun-0.352
ista-0.350
asma-0.319
Token gets
Token ID3011
Feature activation-9.00e-01

Pos logit contributions
 mend+0.263
genic+0.239
 LIST+0.225
phabet+0.223
 swe+0.222
Neg logit contributions
aan-0.258
xual-0.255
 laun-0.239
ista-0.238
asma-0.215
Token me
Token ID502
Feature activation-6.52e-01

Pos logit contributions
 mend+0.622
genic+0.558
 LIST+0.526
 swe+0.526
phabet+0.519
Neg logit contributions
aan-0.704
xual-0.699
 laun-0.654
ista-0.650
asma-0.591
Token all
Token ID477
Feature activation-5.94e-01

Pos logit contributions
 mend+0.480
genic+0.433
 LIST+0.412
 swe+0.410
phabet+0.406
Neg logit contributions
aan-0.480
xual-0.477
 laun-0.444
ista-0.439
asma-0.400
Token tri
Token ID1333
Feature activation+1.67e-01

Pos logit contributions
 mend+0.446
genic+0.403
 LIST+0.382
 swe+0.379
phabet+0.378
Neg logit contributions
aan-0.432
xual-0.428
 laun-0.399
ista-0.396
asma-0.358
Tokenpped
Token ID1496
Feature activation-3.58e+00

Pos logit contributions
aan+0.148
xual+0.148
 laun+0.140
ista+0.139
 differe+0.129
Neg logit contributions
 mend-0.098
genic-0.086
 LIST-0.079
phabet-0.078
 swe-0.078
Token up
Token ID510
Feature activation-3.60e-02

Pos logit contributions
 mend+2.038
genic+1.788
 LIST+1.656
 swe+1.637
phabet+1.620
Neg logit contributions
aan-3.194
xual-3.180
 laun-3.025
ista-2.996
 differe-2.786
Token,
Token ID11
Feature activation-2.63e-01

Pos logit contributions
 mend+0.027
genic+0.025
 LIST+0.023
phabet+0.023
��+0.023
Neg logit contributions
aan-0.026
xual-0.025
 laun-0.024
ista-0.024
 differe-0.021
Token and
Token ID290
Feature activation-3.69e-01

Pos logit contributions
 mend+0.224
genic+0.206
 LIST+0.195
phabet+0.194
��+0.193
Neg logit contributions
aan-0.163
xual-0.162
 laun-0.150
ista-0.149
 differe-0.132
Token I
Token ID314
Feature activation+5.08e-02

Pos logit contributions
 mend+0.239
genic+0.213
 LIST+0.200
phabet+0.197
 swe+0.196
Neg logit contributions
aan-0.304
xual-0.302
 laun-0.285
ista-0.283
asma-0.260
Token end
Token ID886
Feature activation-5.07e-01

Pos logit contributions
aan+0.041
xual+0.040
 laun+0.038
ista+0.038
asma+0.035
Neg logit contributions
 mend-0.034
genic-0.030
 LIST-0.029
phabet-0.028
 swe-0.028
Token up
Token ID510
Feature activation-4.53e-01

Pos logit contributions
 mend+0.578
genic+0.516
 LIST+0.486
 swe+0.486
joining+0.466
Neg logit contributions
xual-0.596
aan-0.595
 laun-0.548
ista-0.544
asma-0.500
Token to
Token ID284
Feature activation-2.32e-01

Pos logit contributions
 mend+0.351
genic+0.319
 LIST+0.302
 swe+0.298
phabet+0.293
Neg logit contributions
aan-0.318
xual-0.316
 laun-0.294
ista-0.290
asma-0.264
Token leave
Token ID2666
Feature activation-1.46e-01

Pos logit contributions
 mend+0.147
genic+0.128
 LIST+0.121
 swe+0.118
phabet+0.117
Neg logit contributions
aan-0.198
xual-0.196
 laun-0.185
ista-0.184
asma-0.170
Token and
Token ID290
Feature activation-4.22e-01

Pos logit contributions
 mend+0.107
genic+0.096
 LIST+0.091
 swe+0.090
phabet+0.089
Neg logit contributions
aan-0.109
xual-0.109
 laun-0.101
ista-0.100
 differe-0.092
Token tri
Token ID1333
Feature activation-1.91e-01

Pos logit contributions
 mend+0.310
genic+0.280
 LIST+0.264
phabet+0.260
 swe+0.260
Neg logit contributions
aan-0.311
xual-0.310
 laun-0.289
ista-0.288
asma-0.261
Tokenpped
Token ID1496
Feature activation-3.30e+00

Pos logit contributions
 mend+0.100
genic+0.088
 LIST+0.079
 swe+0.077
phabet+0.076
Neg logit contributions
xual-0.180
aan-0.179
 laun-0.171
ista-0.168
 differe-0.158
Token,
Token ID11
Feature activation-1.98e-01

Pos logit contributions
 mend+2.406
genic+2.120
 LIST+2.052
 swe+2.028
 enlist+1.947
Neg logit contributions
xual-2.475
aan-2.444
 laun-2.272
ista-2.241
asma-2.071
Token falling
Token ID7463
Feature activation-9.89e-01

Pos logit contributions
 mend+0.138
genic+0.123
 LIST+0.116
 swe+0.115
phabet+0.114
Neg logit contributions
aan-0.154
xual-0.154
 laun-0.144
ista-0.143
asma-0.131
Token backwards
Token ID16196
Feature activation-1.55e+00

Pos logit contributions
 mend+0.696
genic+0.622
 LIST+0.590
 swe+0.579
phabet+0.570
Neg logit contributions
aan-0.763
xual-0.761
 laun-0.712
ista-0.702
 differe-0.644
Token on
Token ID319
Feature activation-7.69e-01

Pos logit contributions
 mend+1.128
genic+1.015
 LIST+0.963
 swe+0.951
phabet+0.927
Neg logit contributions
xual-1.151
aan-1.151
 laun-1.072
ista-1.056
asma-0.971
Token to
Token ID284
Feature activation-5.83e-01

Pos logit contributions
 mend+0.527
genic+0.473
 LIST+0.443
phabet+0.438
 swe+0.437
Neg logit contributions
aan-0.606
xual-0.601
 laun-0.566
ista-0.562
 differe-0.515
Token Gand
Token ID17727
Feature activation-6.47e-01

Pos logit contributions
 mend+0.275
genic+0.244
 LIST+0.229
phabet+0.224
��+0.223
Neg logit contributions
aan-0.365
xual-0.364
 laun-0.343
ista-0.341
 differe-0.314
Tokenalf
Token ID1604
Feature activation+6.43e-01

Pos logit contributions
 mend+0.487
genic+0.444
 LIST+0.417
phabet+0.415
��+0.412
Neg logit contributions
aan-0.463
xual-0.460
 laun-0.431
ista-0.427
 differe-0.388
Token reports
Token ID3136
Feature activation+3.91e-01

Pos logit contributions
aan+0.469
xual+0.465
 laun+0.435
ista+0.433
 differe+0.392
Neg logit contributions
 mend-0.477
genic-0.433
 LIST-0.407
phabet-0.405
��-0.404
Token that
Token ID326
Feature activation+2.40e-01

Pos logit contributions
aan+0.281
xual+0.278
 laun+0.261
ista+0.260
 differe+0.235
Neg logit contributions
 mend-0.294
genic-0.268
 LIST-0.252
phabet-0.251
��-0.250
Token Sar
Token ID6866
Feature activation-1.03e-01

Pos logit contributions
aan+0.179
xual+0.178
 laun+0.167
ista+0.166
 differe+0.151
Neg logit contributions
 mend-0.174
genic-0.158
 LIST-0.148
phabet-0.148
��-0.148
Tokenuman
Token ID3778
Feature activation-3.28e+00

Pos logit contributions
 mend+0.051
genic+0.043
��+0.041
 LIST+0.040
 swe+0.039
Neg logit contributions
aan-0.102
xual-0.099
ista-0.096
 laun-0.095
asma-0.089
Token has
Token ID468
Feature activation+1.01e-01

Pos logit contributions
 mend+2.368
genic+2.114
 LIST+2.001
 swe+1.975
phabet+1.947
Neg logit contributions
xual-2.456
aan-2.434
 laun-2.276
ista-2.245
asma-2.062
Token betrayed
Token ID26281
Feature activation-3.37e-01

Pos logit contributions
aan+0.077
xual+0.076
 laun+0.072
ista+0.071
 differe+0.065
Neg logit contributions
 mend-0.071
genic-0.064
 LIST-0.060
phabet-0.059
 swe-0.059
Token them
Token ID606
Feature activation-4.04e-01

Pos logit contributions
 mend+0.252
genic+0.228
 LIST+0.215
phabet+0.213
��+0.212
Neg logit contributions
aan-0.244
xual-0.242
 laun-0.227
ista-0.225
 differe-0.204
Token and
Token ID290
Feature activation-2.16e-01

Pos logit contributions
 mend+0.311
genic+0.283
 LIST+0.267
phabet+0.266
��+0.265
Neg logit contributions
aan-0.283
xual-0.280
 laun-0.262
ista-0.261
 differe-0.235
Token is
Token ID318
Feature activation-1.65e-01

Pos logit contributions
 mend+0.152
genic+0.137
 LIST+0.129
phabet+0.128
��+0.127
Neg logit contributions
aan-0.165
xual-0.164
 laun-0.154
ista-0.153
 differe-0.140
Token,
Token ID11
Feature activation+1.83e-02

Pos logit contributions
 mend+0.428
genic+0.388
 LIST+0.369
 swe+0.364
phabet+0.362
Neg logit contributions
aan-0.380
xual-0.377
 laun-0.352
ista-0.348
asma-0.316
Token when
Token ID618
Feature activation+3.80e-01

Pos logit contributions
aan+0.013
xual+0.013
 laun+0.012
ista+0.012
 differe+0.010
Neg logit contributions
 mend-0.014
genic-0.013
 LIST-0.012
phabet-0.012
��-0.012
Token Tor
Token ID4022
Feature activation+1.75e-01

Pos logit contributions
aan+0.290
xual+0.290
 laun+0.271
ista+0.269
asma+0.246
Neg logit contributions
 mend-0.269
genic-0.240
 LIST-0.227
 swe-0.224
phabet-0.223
Tokeni
Token ID72
Feature activation-1.77e+00

Pos logit contributions
aan+0.105
xual+0.105
 laun+0.097
ista+0.095
 differe+0.085
Neg logit contributions
 mend-0.152
genic-0.141
 LIST-0.134
phabet-0.133
 swe-0.132
Token tri
Token ID1333
Feature activation+2.69e-01

Pos logit contributions
 mend+1.320
genic+1.167
 swe+1.128
 LIST+1.112
 enlist+1.069
Neg logit contributions
xual-1.307
aan-1.293
 laun-1.184
ista-1.177
asma-1.087
Tokenpped
Token ID1496
Feature activation-3.21e+00

Pos logit contributions
xual+0.252
aan+0.251
 laun+0.238
ista+0.235
 differe+0.220
Neg logit contributions
 mend-0.144
genic-0.126
 LIST-0.115
 swe-0.112
phabet-0.111
Token and
Token ID290
Feature activation-2.05e-01

Pos logit contributions
 mend+2.441
genic+2.184
 LIST+2.098
 swe+2.080
 enlist+2.010
Neg logit contributions
xual-2.310
aan-2.275
 laun-2.107
ista-2.067
asma-1.898
Token fell
Token ID3214
Feature activation-5.93e-01

Pos logit contributions
 mend+0.161
genic+0.145
 LIST+0.138
 swe+0.137
phabet+0.134
Neg logit contributions
xual-0.144
aan-0.143
 laun-0.131
ista-0.130
asma-0.119
Token onto
Token ID4291
Feature activation-1.01e+00

Pos logit contributions
 mend+0.449
genic+0.405
 LIST+0.386
 swe+0.379
phabet+0.375
Neg logit contributions
aan-0.426
xual-0.424
 laun-0.395
ista-0.391
asma-0.355
Token a
Token ID257
Feature activation-5.75e-01

Pos logit contributions
 mend+0.689
genic+0.615
 LIST+0.577
 swe+0.572
phabet+0.569
Neg logit contributions
aan-0.798
xual-0.798
 laun-0.747
ista-0.741
 differe-0.678
Token hot
Token ID3024
Feature activation+2.83e-01

Pos logit contributions
 mend+0.397
genic+0.353
 LIST+0.332
 swe+0.329
phabet+0.327
Neg logit contributions
aan-0.454
xual-0.453
 laun-0.423
ista-0.421
asma-0.384
Token forehead
Token ID22645
Feature activation-5.69e-01

Pos logit contributions
aan+0.143
xual+0.142
 laun+0.133
ista+0.132
 differe+0.121
Neg logit contributions
 mend-0.131
genic-0.118
 LIST-0.111
 swe-0.110
phabet-0.109
Token a
Token ID257
Feature activation-2.03e-01

Pos logit contributions
 mend+0.403
genic+0.363
 LIST+0.339
 swe+0.336
phabet+0.334
Neg logit contributions
aan-0.435
xual-0.434
 laun-0.405
ista-0.405
 differe-0.368
Token butt
Token ID8530
Feature activation-5.85e-01

Pos logit contributions
 mend+0.144
genic+0.129
 LIST+0.122
phabet+0.120
 swe+0.120
Neg logit contributions
aan-0.155
xual-0.154
 laun-0.144
ista-0.144
asma-0.131
Token-
Token ID12
Feature activation-4.27e-01

Pos logit contributions
 mend+0.441
genic+0.400
 LIST+0.377
phabet+0.373
��+0.372
Neg logit contributions
aan-0.420
xual-0.418
 laun-0.390
ista-0.387
 differe-0.351
Tokencr
Token ID6098
Feature activation+3.97e-01

Pos logit contributions
 mend+0.309
genic+0.280
 LIST+0.263
phabet+0.261
��+0.259
Neg logit contributions
aan-0.320
xual-0.318
 laun-0.298
ista-0.296
 differe-0.270
Tokenack
Token ID441
Feature activation-3.17e+00

Pos logit contributions
aan+0.374
xual+0.371
 laun+0.353
ista+0.352
asma+0.326
Neg logit contributions
 mend-0.210
genic-0.183
 LIST-0.167
phabet-0.165
��-0.165
Token with
Token ID351
Feature activation-2.79e-01

Pos logit contributions
 mend+2.344
genic+2.111
 LIST+2.001
 swe+1.983
phabet+1.962
Neg logit contributions
aan-2.290
xual-2.287
 laun-2.132
ista-2.100
asma-1.933
Token an
Token ID281
Feature activation-2.73e-01

Pos logit contributions
 mend+0.190
genic+0.171
 LIST+0.160
phabet+0.158
��+0.158
Neg logit contributions
aan-0.220
xual-0.219
 laun-0.206
ista-0.205
 differe-0.187
Token ax
Token ID7877
Feature activation+8.65e-01

Pos logit contributions
 mend+0.195
genic+0.176
 LIST+0.165
phabet+0.164
��+0.163
Neg logit contributions
aan-0.207
xual-0.205
 laun-0.192
ista-0.191
 differe-0.174
Token:
Token ID25
Feature activation-7.85e-02

Pos logit contributions
aan+0.606
xual+0.601
 laun+0.561
ista+0.559
 differe+0.503
Neg logit contributions
 mend-0.666
genic-0.607
 LIST-0.573
phabet-0.569
��-0.568
Token a
Token ID257
Feature activation+6.98e-04

Pos logit contributions
 mend+0.056
genic+0.050
 LIST+0.047
phabet+0.047
��+0.046
Neg logit contributions
aan-0.060
xual-0.059
 laun-0.056
ista-0.056
 differe-0.051
Token case
Token ID1339
Feature activation-3.25e-01

Pos logit contributions
aan+0.190
xual+0.187
 laun+0.176
ista+0.176
 differe+0.159
Neg logit contributions
 mend-0.183
genic-0.166
phabet-0.156
��-0.155
 LIST-0.155
Token scenario
Token ID8883
Feature activation+6.64e-01

Pos logit contributions
 mend+0.284
genic+0.261
 LIST+0.248
phabet+0.247
��+0.246
Neg logit contributions
aan-0.195
xual-0.193
 laun-0.178
ista-0.177
 differe-0.156
Token,
Token ID11
Feature activation+1.51e-01

Pos logit contributions
aan+0.477
xual+0.471
ista+0.442
 laun+0.441
 differe+0.398
Neg logit contributions
 mend-0.498
genic-0.455
��-0.429
phabet-0.427
 LIST-0.427
Token it
Token ID340
Feature activation+1.35e-01

Pos logit contributions
aan+0.113
xual+0.112
 laun+0.105
ista+0.105
 differe+0.095
Neg logit contributions
 mend-0.110
genic-0.099
 LIST-0.093
phabet-0.092
��-0.092
Token ro
Token ID686
Feature activation-3.82e-01

Pos logit contributions
aan+0.094
xual+0.094
 laun+0.088
ista+0.087
 differe+0.079
Neg logit contributions
 mend-0.103
genic-0.094
 LIST-0.089
phabet-0.088
��-0.088
Tokents
Token ID912
Feature activation-3.11e+00

Pos logit contributions
 mend+0.258
genic+0.233
 LIST+0.217
phabet+0.215
 swe+0.213
Neg logit contributions
aan-0.303
xual-0.300
 laun-0.284
ista-0.281
 differe-0.259
Token off
Token ID572
Feature activation+2.71e-01

Pos logit contributions
 mend+2.212
genic+1.968
 LIST+1.856
 swe+1.824
phabet+1.815
Neg logit contributions
aan-2.372
xual-2.354
 laun-2.215
ista-2.197
asma-2.015
Token my
Token ID616
Feature activation-5.66e-01

Pos logit contributions
aan+0.195
xual+0.193
 laun+0.181
ista+0.180
 differe+0.163
Neg logit contributions
 mend-0.205
genic-0.186
 LIST-0.176
phabet-0.174
 swe-0.173
Token penis
Token ID16360
Feature activation-7.09e-01

Pos logit contributions
 mend+0.389
genic+0.349
 LIST+0.327
phabet+0.323
 swe+0.322
Neg logit contributions
aan-0.445
xual-0.442
 laun-0.415
ista-0.414
 differe-0.378
Token and
Token ID290
Feature activation-1.46e-01

Pos logit contributions
 mend+0.546
genic+0.497
 LIST+0.469
phabet+0.463
 swe+0.462
Neg logit contributions
aan-0.499
xual-0.495
 laun-0.462
ista-0.459
 differe-0.414
Token then
Token ID788
Feature activation-3.15e-01

Pos logit contributions
 mend+0.097
genic+0.087
 LIST+0.081
phabet+0.080
��+0.080
Neg logit contributions
aan-0.118
xual-0.117
 laun-0.111
ista-0.110
 differe-0.101
Token and
Token ID290
Feature activation-1.88e-01

Pos logit contributions
 mend+1.311
genic+1.190
 LIST+1.128
phabet+1.114
 swe+1.110
Neg logit contributions
aan-1.181
xual-1.175
 laun-1.095
ista-1.081
asma-0.982
Token stem
Token ID10717
Feature activation-3.15e-01

Pos logit contributions
 mend+0.128
genic+0.115
 LIST+0.108
phabet+0.107
��+0.107
Neg logit contributions
aan-0.148
xual-0.147
 laun-0.138
ista-0.138
 differe-0.126
Token.
Token ID13
Feature activation-5.37e-01

Pos logit contributions
 mend+0.240
genic+0.218
 LIST+0.206
phabet+0.204
��+0.203
Neg logit contributions
aan-0.224
xual-0.223
 laun-0.208
ista-0.207
asma-0.187
Token View
Token ID3582
Feature activation-5.38e-01

Pos logit contributions
 mend+0.349
genic+0.312
 LIST+0.292
phabet+0.288
��+0.287
Neg logit contributions
aan-0.442
xual-0.439
 laun-0.414
ista-0.412
 differe-0.378
Token Full
Token ID6462
Feature activation-3.73e-01

Pos logit contributions
 mend+0.379
genic+0.343
 LIST+0.321
phabet+0.319
��+0.317
Neg logit contributions
aan-0.412
xual-0.409
 laun-0.384
ista-0.382
 differe-0.348
Token Caption
Token ID11260
Feature activation-3.09e+00

Pos logit contributions
 mend+0.279
genic+0.254
 LIST+0.241
phabet+0.238
��+0.237
Neg logit contributions
aan-0.269
xual-0.267
 laun-0.251
ista-0.249
 differe-0.225
Token Adam
Token ID7244
Feature activation-7.78e-01

Pos logit contributions
 mend+2.338
 LIST+2.118
genic+2.116
phabet+2.017
 swe+1.965
Neg logit contributions
xual-2.204
aan-2.170
 laun-2.071
ista-1.964
asma-1.862
Token Glenn
Token ID17551
Feature activation-1.39e+00

Pos logit contributions
 mend+0.550
genic+0.496
 LIST+0.465
phabet+0.462
��+0.459
Neg logit contributions
aan-0.594
xual-0.592
 laun-0.557
ista-0.551
 differe-0.503
Token
Token ID198
Feature activation-5.91e-01

Pos logit contributions
 mend+1.015
genic+0.915
 LIST+0.863
phabet+0.855
 swe+0.846
Neg logit contributions
aan-1.027
xual-1.025
 laun-0.961
ista-0.950
asma-0.864
Token
Token ID198
Feature activation-5.58e-01

Pos logit contributions
 mend+0.407
genic+0.367
 LIST+0.345
 swe+0.337
phabet+0.336
Neg logit contributions
xual-0.463
aan-0.462
 laun-0.439
ista-0.426
 differe-0.396
TokenC
Token ID34
Feature activation+4.43e-01

Pos logit contributions
 mend+0.453
genic+0.414
 LIST+0.392
phabet+0.390
��+0.386
Neg logit contributions
aan-0.369
xual-0.368
 laun-0.342
ista-0.338
 differe-0.304
TokenThen
Token ID6423
Feature activation-6.30e-01

Pos logit contributions
 mend+0.037
genic+0.034
 LIST+0.032
phabet+0.032
joining+0.032
Neg logit contributions
xual-0.029
aan-0.029
 laun-0.027
ista-0.027
 differe-0.024
Token Friday
Token ID3217
Feature activation+5.24e-01

Pos logit contributions
 mend+0.469
genic+0.421
 LIST+0.400
phabet+0.395
 swe+0.394
Neg logit contributions
aan-0.461
xual-0.461
 laun-0.428
ista-0.425
asma-0.386
Token,
Token ID11
Feature activation+2.91e-01

Pos logit contributions
aan+0.403
xual+0.400
 laun+0.375
ista+0.374
 differe+0.340
Neg logit contributions
 mend-0.369
genic-0.331
 LIST-0.312
phabet-0.309
 swe-0.307
Token Trump
Token ID1301
Feature activation+3.24e-01

Pos logit contributions
aan+0.218
xual+0.217
 laun+0.203
ista+0.202
 differe+0.183
Neg logit contributions
 mend-0.211
genic-0.190
 LIST-0.179
phabet-0.178
 swe-0.177
Token u
Token ID334
Feature activation-2.20e-01

Pos logit contributions
aan+0.241
xual+0.240
 laun+0.221
ista+0.221
asma+0.201
Neg logit contributions
 mend-0.239
genic-0.214
 LIST-0.203
 swe-0.201
phabet-0.199
Tokenpped
Token ID1496
Feature activation-3.08e+00

Pos logit contributions
 mend+0.121
genic+0.106
 LIST+0.097
phabet+0.095
 swe+0.094
Neg logit contributions
aan-0.203
xual-0.202
 laun-0.191
ista-0.191
 differe-0.177
Token the
Token ID262
Feature activation-1.43e-01

Pos logit contributions
 mend+2.074
genic+1.791
 LIST+1.728
 swe+1.706
phabet+1.692
Neg logit contributions
aan-2.490
xual-2.480
 laun-2.325
ista-2.297
 differe-2.113
Token ante
Token ID29692
Feature activation+5.97e-01

Pos logit contributions
 mend+0.128
genic+0.115
 LIST+0.111
 swe+0.110
phabet+0.109
Neg logit contributions
aan-0.085
xual-0.085
 laun-0.078
ista-0.076
 differe-0.068
Token by
Token ID416
Feature activation+1.10e-01

Pos logit contributions
aan+0.419
xual+0.416
 laun+0.388
ista+0.386
 differe+0.348
Neg logit contributions
 mend-0.460
genic-0.418
 LIST-0.395
phabet-0.391
 swe-0.389
Token warning
Token ID6509
Feature activation+6.33e-01

Pos logit contributions
aan+0.087
xual+0.087
 laun+0.081
ista+0.081
asma+0.074
Neg logit contributions
 mend-0.076
genic-0.067
 LIST-0.063
 swe-0.062
phabet-0.062
Token dark
Token ID3223
Feature activation+9.82e-01

Pos logit contributions
aan+0.485
xual+0.480
 laun+0.450
ista+0.448
 differe+0.407
Neg logit contributions
 mend-0.450
genic-0.405
 LIST-0.382
phabet-0.377
 swe-0.374
Token 41
Token ID6073
Feature activation-3.73e-01

Pos logit contributions
aan+0.000
xual+0.000
 laun+0.000
ista+0.000
 differe+0.000
Neg logit contributions
 mend-0.000
genic-0.000
 LIST-0.000
phabet-0.000
 swe-0.000
Token,
Token ID11
Feature activation+1.89e-01

Pos logit contributions
 mend+0.274
genic+0.248
 LIST+0.233
 swe+0.231
phabet+0.230
Neg logit contributions
aan-0.276
xual-0.275
 laun-0.257
ista-0.253
asma-0.233
Token he
Token ID339
Feature activation+7.66e-02

Pos logit contributions
aan+0.129
xual+0.128
 laun+0.119
ista+0.118
asma+0.107
Neg logit contributions
 mend-0.149
genic-0.136
 LIST-0.129
 swe-0.127
phabet-0.127
Token has
Token ID468
Feature activation+2.88e-01

Pos logit contributions
aan+0.056
xual+0.055
 laun+0.051
ista+0.051
 differe+0.046
Neg logit contributions
 mend-0.057
genic-0.052
 LIST-0.049
 swe-0.048
phabet-0.048
Token re
Token ID302
Feature activation+6.99e-01

Pos logit contributions
aan+0.220
xual+0.219
 laun+0.204
ista+0.204
asma+0.186
Neg logit contributions
 mend-0.204
genic-0.183
 LIST-0.173
 swe-0.171
phabet-0.169
Tokenpped
Token ID1496
Feature activation-2.98e+00

Pos logit contributions
aan+0.508
xual+0.504
 laun+0.472
ista+0.470
 differe+0.425
Neg logit contributions
 mend-0.520
genic-0.472
 LIST-0.444
phabet-0.441
��-0.440
Token the
Token ID262
Feature activation-9.50e-02

Pos logit contributions
 mend+2.169
genic+1.957
 LIST+1.843
 swe+1.837
phabet+1.817
Neg logit contributions
aan-2.209
xual-2.177
 laun-2.043
ista-2.022
asma-1.852
Token 305
Token ID32747
Feature activation-4.59e-01

Pos logit contributions
 mend+0.071
genic+0.064
 LIST+0.061
phabet+0.060
��+0.060
Neg logit contributions
aan-0.069
xual-0.069
 laun-0.064
ista-0.064
 differe-0.058
Token since
Token ID1201
Feature activation-1.15e+00

Pos logit contributions
 mend+0.322
genic+0.291
 LIST+0.273
phabet+0.270
��+0.268
Neg logit contributions
aan-0.354
xual-0.351
 laun-0.330
ista-0.327
 differe-0.299
Token 1973
Token ID15674
Feature activation-1.77e-01

Pos logit contributions
 mend+0.857
genic+0.774
 LIST+0.734
 swe+0.723
joining+0.721
Neg logit contributions
aan-0.842
xual-0.840
ista-0.770
 laun-0.770
asma-0.702
Token.
Token ID13
Feature activation-1.13e-01

Pos logit contributions
 mend+0.140
genic+0.128
 LIST+0.121
phabet+0.120
��+0.120
Neg logit contributions
aan-0.120
xual-0.119
 laun-0.111
ista-0.111
 differe-0.099
Token
Token ID198
Feature activation+2.92e-01

Pos logit contributions
 mend+0.041
genic+0.037
 LIST+0.034
phabet+0.034
��+0.034
Neg logit contributions
aan-0.045
xual-0.045
 laun-0.042
ista-0.042
 differe-0.038
Token
Token ID198
Feature activation-3.73e-02

Pos logit contributions
aan+0.223
xual+0.221
 laun+0.208
ista+0.206
 differe+0.188
Neg logit contributions
 mend-0.207
genic-0.187
 LIST-0.175
phabet-0.173
��-0.172
TokenWiki
Token ID32603
Feature activation+9.16e-02

Pos logit contributions
 mend+0.032
genic+0.029
 LIST+0.028
phabet+0.027
joining+0.027
Neg logit contributions
xual-0.023
aan-0.023
 laun-0.022
ista-0.021
 differe-0.019
TokenLeaks
Token ID17874
Feature activation-4.58e-01

Pos logit contributions
xual+0.019
aan+0.018
 laun+0.015
ista+0.013
 differe+0.008
Neg logit contributions
 mend-0.116
genic-0.110
 LIST-0.107
phabet-0.106
joining-0.106
Token pr
Token ID778
Feature activation-4.96e-01

Pos logit contributions
 mend+0.335
genic+0.301
 LIST+0.285
 swe+0.282
phabet+0.281
Neg logit contributions
aan-0.340
xual-0.338
 laun-0.315
ista-0.312
asma-0.284
Tokenides
Token ID1460
Feature activation-2.94e+00

Pos logit contributions
 mend+0.377
genic+0.344
 LIST+0.324
phabet+0.320
 swe+0.318
Neg logit contributions
aan-0.351
xual-0.351
 laun-0.328
ista-0.323
 differe-0.295
Token itself
Token ID2346
Feature activation+9.76e-01

Pos logit contributions
 mend+1.852
genic+1.609
 LIST+1.498
 swe+1.491
phabet+1.481
Neg logit contributions
aan-2.482
xual-2.460
 laun-2.326
ista-2.298
asma-2.132
Token as
Token ID355
Feature activation+2.03e-02

Pos logit contributions
aan+0.599
xual+0.591
 laun+0.550
ista+0.549
 differe+0.484
Neg logit contributions
 mend-0.832
genic-0.770
 LIST-0.731
��-0.726
phabet-0.725
Token a
Token ID257
Feature activation-3.56e-01

Pos logit contributions
aan+0.015
xual+0.015
 laun+0.014
ista+0.014
asma+0.013
Neg logit contributions
 mend-0.015
genic-0.013
 LIST-0.013
phabet-0.013
��-0.012
Token website
Token ID3052
Feature activation-7.75e-01

Pos logit contributions
 mend+0.255
genic+0.228
 LIST+0.214
 swe+0.212
phabet+0.211
Neg logit contributions
xual-0.271
aan-0.271
 laun-0.253
ista-0.250
asma-0.230
Token devoted
Token ID13378
Feature activation-1.18e+00

Pos logit contributions
 mend+0.577
genic+0.520
 LIST+0.490
 swe+0.485
phabet+0.484
Neg logit contributions
aan-0.566
xual-0.563
 laun-0.524
ista-0.520
asma-0.474
Tokenimages
Token ID17566
Feature activation+5.86e-02

Pos logit contributions
aan+0.409
xual+0.403
ista+0.379
 laun+0.371
asma+0.338
Neg logit contributions
 mend-0.447
genic-0.402
��-0.390
 LIST-0.381
phabet-0.379
Token(
Token ID7
Feature activation-9.87e-01

Pos logit contributions
aan+0.035
xual+0.035
 laun+0.032
ista+0.032
asma+0.028
Neg logit contributions
 mend-0.051
genic-0.047
 LIST-0.045
��-0.045
phabet-0.045
Tokenargs
Token ID22046
Feature activation+3.14e-01

Pos logit contributions
 mend+0.705
genic+0.639
 LIST+0.599
phabet+0.594
��+0.590
Neg logit contributions
aan-0.747
xual-0.742
 laun-0.698
ista-0.693
 differe-0.631
Token["
Token ID14692
Feature activation-1.77e+00

Pos logit contributions
aan+0.199
xual+0.197
 laun+0.183
ista+0.183
asma+0.162
Neg logit contributions
 mend-0.262
genic-0.241
 LIST-0.228
��-0.227
phabet-0.227
Tokenimages
Token ID17566
Feature activation-1.39e+00

Pos logit contributions
 mend+1.303
genic+1.196
 LIST+1.118
phabet+1.113
joining+1.099
Neg logit contributions
aan-1.286
xual-1.285
 laun-1.220
ista-1.195
 differe-1.086
Token"]
Token ID8973
Feature activation-2.91e+00

Pos logit contributions
 mend+1.033
genic+0.941
 LIST+0.887
phabet+0.880
��+0.873
Neg logit contributions
aan-1.008
xual-0.997
 laun-0.937
ista-0.926
 differe-0.841
Token):
Token ID2599
Feature activation-3.68e-01

Pos logit contributions
 mend+2.384
genic+2.176
 LIST+2.075
phabet+2.049
 swe+2.034
Neg logit contributions
aan-1.898
xual-1.893
 laun-1.753
ista-1.727
 differe-1.560
Token #
Token ID1303
Feature activation-1.15e+00

Pos logit contributions
 mend+0.242
genic+0.216
 LIST+0.202
phabet+0.200
��+0.198
Neg logit contributions
aan-0.300
xual-0.298
 laun-0.281
ista-0.279
 differe-0.256
Token load
Token ID3440
Feature activation-1.87e-01

Pos logit contributions
 mend+0.842
genic+0.758
 LIST+0.723
phabet+0.710
 swe+0.703
Neg logit contributions
aan-0.852
xual-0.848
 laun-0.793
ista-0.789
asma-0.714
Token the
Token ID262
Feature activation-4.84e-02

Pos logit contributions
 mend+0.124
genic+0.112
 LIST+0.104
phabet+0.103
��+0.103
Neg logit contributions
aan-0.151
xual-0.150
 laun-0.141
ista-0.141
 differe-0.129
Token image
Token ID2939
Feature activation-1.73e-01

Pos logit contributions
 mend+0.030
genic+0.027
 LIST+0.025
phabet+0.025
��+0.025
Neg logit contributions
aan-0.041
xual-0.041
 laun-0.039
ista-0.038
 differe-0.036
Token tests
Token ID5254
Feature activation+1.08e+00

Pos logit contributions
aan+0.330
xual+0.327
 laun+0.306
ista+0.304
 differe+0.275
Neg logit contributions
 mend-0.351
genic-0.319
 LIST-0.301
phabet-0.299
��-0.299
Token undertaken
Token ID21846
Feature activation+6.98e-01

Pos logit contributions
aan+0.753
xual+0.747
 laun+0.698
ista+0.693
 differe+0.625
Neg logit contributions
 mend-0.837
genic-0.763
 LIST-0.721
phabet-0.715
 swe-0.710
Token after
Token ID706
Feature activation-2.17e-01

Pos logit contributions
aan+0.503
xual+0.499
 laun+0.467
ista+0.464
 differe+0.420
Neg logit contributions
 mend-0.525
genic-0.476
 LIST-0.449
phabet-0.444
 swe-0.442
Token the
Token ID262
Feature activation+2.46e-01

Pos logit contributions
 mend+0.155
genic+0.140
 LIST+0.131
 swe+0.129
phabet+0.129
Neg logit contributions
aan-0.164
xual-0.164
 laun-0.153
ista-0.153
 differe-0.139
Token Gren
Token ID19674
Feature activation-3.44e-01

Pos logit contributions
aan+0.184
xual+0.183
 laun+0.172
ista+0.171
 differe+0.155
Neg logit contributions
 mend-0.178
genic-0.161
 LIST-0.151
phabet-0.150
��-0.150
Tokenfell
Token ID23299
Feature activation-2.91e+00

Pos logit contributions
 mend+0.132
genic+0.109
 LIST+0.095
phabet+0.092
 swe+0.091
Neg logit contributions
aan-0.373
xual-0.373
 laun-0.358
ista-0.354
 differe-0.334
Token Tower
Token ID8765
Feature activation+1.91e-03

Pos logit contributions
 mend+2.336
genic+2.118
 LIST+2.018
 swe+2.010
joining+1.969
Neg logit contributions
aan-1.947
xual-1.905
 laun-1.801
ista-1.742
 differe-1.590
Token fire
Token ID2046
Feature activation+2.16e-01

Pos logit contributions
aan+0.001
xual+0.001
 laun+0.001
ista+0.001
asma+0.001
Neg logit contributions
 mend-0.001
genic-0.001
 LIST-0.001
 swe-0.001
joining-0.001
Token,
Token ID11
Feature activation+2.11e-01

Pos logit contributions
aan+0.148
xual+0.146
 laun+0.137
ista+0.136
 differe+0.122
Neg logit contributions
 mend-0.170
genic-0.155
 LIST-0.146
phabet-0.145
 swe-0.144
Token according
Token ID1864
Feature activation-4.89e-01

Pos logit contributions
aan+0.149
xual+0.148
 laun+0.138
ista+0.138
 differe+0.124
Neg logit contributions
 mend-0.162
genic-0.148
 LIST-0.139
phabet-0.138
��-0.138
Token to
Token ID284
Feature activation+7.77e-01

Pos logit contributions
 mend+0.369
genic+0.335
 LIST+0.317
phabet+0.312
 swe+0.311
Neg logit contributions
aan-0.351
xual-0.349
 laun-0.328
ista-0.323
 differe-0.295
Token:
Token ID25
Feature activation-6.62e-02

Pos logit contributions
 mend+0.448
genic+0.406
 LIST+0.383
phabet+0.379
 swe+0.376
Neg logit contributions
aan-0.439
xual-0.436
 laun-0.408
ista-0.405
 differe-0.367
Token
Token ID198
Feature activation+1.24e-01

Pos logit contributions
 mend+0.051
genic+0.047
 LIST+0.044
phabet+0.044
��+0.044
Neg logit contributions
aan-0.046
xual-0.045
 laun-0.042
ista-0.042
asma-0.038
Token
Token ID198
Feature activation+1.76e-01

Pos logit contributions
xual+0.097
aan+0.097
 laun+0.092
ista+0.089
 differe+0.083
Neg logit contributions
 mend-0.086
genic-0.077
 LIST-0.073
 swe-0.071
phabet-0.071
Token"
Token ID1
Feature activation+3.53e-01

Pos logit contributions
xual+0.118
aan+0.117
 laun+0.109
ista+0.108
 differe+0.097
Neg logit contributions
 mend-0.141
genic-0.129
 LIST-0.123
phabet-0.121
joining-0.120
TokenBar
Token ID10374
Feature activation-1.62e+00

Pos logit contributions
xual+0.221
aan+0.221
 laun+0.204
ista+0.202
 differe+0.181
Neg logit contributions
 mend-0.298
genic-0.273
 LIST-0.260
phabet-0.257
joining-0.255
Tokenack
Token ID441
Feature activation-2.89e+00

Pos logit contributions
 mend+0.746
genic+0.641
 LIST+0.579
phabet+0.573
 swe+0.560
Neg logit contributions
xual-1.638
aan-1.634
 laun-1.561
ista-1.528
 differe-1.443
Token Obama
Token ID2486
Feature activation-4.87e-01

Pos logit contributions
 mend+1.914
genic+1.679
 LIST+1.610
 swe+1.567
phabet+1.561
Neg logit contributions
xual-2.348
aan-2.309
 laun-2.192
ista-2.139
asma-1.993
Token would
Token ID561
Feature activation-8.27e-01

Pos logit contributions
 mend+0.355
genic+0.318
 LIST+0.303
 swe+0.297
phabet+0.295
Neg logit contributions
aan-0.364
xual-0.362
 laun-0.337
ista-0.335
asma-0.307
Token be
Token ID307
Feature activation-1.36e+00

Pos logit contributions
 mend+0.584
genic+0.510
 LIST+0.489
 swe+0.479
 enlist+0.471
Neg logit contributions
aan-0.647
xual-0.642
 laun-0.598
ista-0.596
asma-0.548
Token the
Token ID262
Feature activation-5.57e-01

Pos logit contributions
 mend+0.971
genic+0.867
 LIST+0.818
 swe+0.812
joining+0.797
Neg logit contributions
xual-1.049
aan-1.041
 laun-0.971
ista-0.962
asma-0.884
Token most
Token ID749
Feature activation-3.70e-01

Pos logit contributions
 mend+0.424
genic+0.383
 LIST+0.361
 swe+0.359
phabet+0.352
Neg logit contributions
aan-0.399
xual-0.399
 laun-0.371
ista-0.367
asma-0.334
Token
Token ID198
Feature activation+4.09e-01

Pos logit contributions
aan+0.227
xual+0.226
 laun+0.213
ista+0.210
 differe+0.193
Neg logit contributions
 mend-0.208
genic-0.188
 LIST-0.177
phabet-0.174
 swe-0.173
TokenMore
Token ID5167
Feature activation+2.38e-01

Pos logit contributions
xual+0.259
aan+0.258
 laun+0.240
ista+0.235
 differe+0.212
Neg logit contributions
 mend-0.343
genic-0.316
 LIST-0.299
phabet-0.296
joining-0.294
Token research
Token ID2267
Feature activation+3.90e-01

Pos logit contributions
aan+0.188
xual+0.187
 laun+0.176
ista+0.175
 differe+0.160
Neg logit contributions
 mend-0.162
genic-0.146
 LIST-0.136
phabet-0.135
��-0.134
Token in
Token ID287
Feature activation+2.65e-01

Pos logit contributions
aan+0.291
xual+0.288
 laun+0.271
ista+0.270
 differe+0.245
Neg logit contributions
 mend-0.283
genic-0.256
 LIST-0.240
��-0.239
phabet-0.239
Token ul
Token ID14856
Feature activation+7.32e-01

Pos logit contributions
aan+0.206
xual+0.205
 laun+0.192
ista+0.192
 differe+0.174
Neg logit contributions
 mend-0.184
genic-0.166
 LIST-0.156
phabet-0.154
��-0.153
Tokencer
Token ID2189
Feature activation-2.87e+00

Pos logit contributions
aan+0.666
xual+0.660
ista+0.629
 laun+0.623
asma+0.580
Neg logit contributions
 mend-0.409
genic-0.356
 LIST-0.333
��-0.331
phabet-0.328
Token prevention
Token ID14196
Feature activation+8.82e-01

Pos logit contributions
 mend+2.295
genic+2.103
 LIST+1.937
phabet+1.918
 swe+1.915
Neg logit contributions
xual-1.953
aan-1.920
 laun-1.809
ista-1.787
asma-1.609
Token and
Token ID290
Feature activation+5.19e-01

Pos logit contributions
aan+0.640
xual+0.633
 laun+0.595
ista+0.594
 differe+0.536
Neg logit contributions
 mend-0.655
genic-0.596
phabet-0.561
��-0.560
 LIST-0.560
Token treatment
Token ID3513
Feature activation-8.06e-01

Pos logit contributions
aan+0.429
xual+0.426
 laun+0.403
ista+0.401
 differe+0.368
Neg logit contributions
 mend-0.333
genic-0.298
 LIST-0.277
phabet-0.277
��-0.276
Token in
Token ID287
Feature activation-5.91e-03

Pos logit contributions
 mend+0.570
genic+0.515
 LIST+0.483
phabet+0.477
��+0.474
Neg logit contributions
aan-0.616
xual-0.612
 laun-0.575
ista-0.571
 differe-0.521
Token le
Token ID443
Feature activation+2.64e-01

Pos logit contributions
 mend+0.004
genic+0.004
 LIST+0.004
phabet+0.004
��+0.004
Neg logit contributions
aan-0.004
xual-0.004
 laun-0.004
ista-0.004
 differe-0.004
Token you
Token ID345
Feature activation-4.80e-01

Pos logit contributions
aan+0.143
xual+0.142
 laun+0.134
ista+0.133
 differe+0.121
Neg logit contributions
 mend-0.133
genic-0.120
 LIST-0.113
phabet-0.112
��-0.112
Token can
Token ID460
Feature activation-8.80e-02

Pos logit contributions
 mend+0.317
genic+0.281
 LIST+0.264
 swe+0.260
phabet+0.257
Neg logit contributions
aan-0.393
xual-0.391
 laun-0.366
ista-0.365
asma-0.335
Token sacrifice
Token ID11728
Feature activation-2.89e-01

Pos logit contributions
 mend+0.059
genic+0.052
 LIST+0.049
 swe+0.048
phabet+0.048
Neg logit contributions
aan-0.071
xual-0.071
 laun-0.066
ista-0.066
asma-0.060
Token to
Token ID284
Feature activation+3.05e-01

Pos logit contributions
 mend+0.219
genic+0.199
 LIST+0.189
 swe+0.186
phabet+0.185
Neg logit contributions
aan-0.206
xual-0.205
 laun-0.192
ista-0.189
 differe-0.172
Token damage
Token ID2465
Feature activation-6.22e-01

Pos logit contributions
aan+0.247
xual+0.246
 laun+0.231
ista+0.230
asma+0.210
Neg logit contributions
 mend-0.204
genic-0.182
 LIST-0.170
 swe-0.168
phabet-0.166
Token tapped
Token ID22499
Feature activation-2.87e+00

Pos logit contributions
 mend+0.429
genic+0.385
 LIST+0.362
 swe+0.356
phabet+0.355
Neg logit contributions
aan-0.486
xual-0.485
 laun-0.455
ista-0.453
 differe-0.413
Token creatures
Token ID8109
Feature activation-2.11e+00

Pos logit contributions
 mend+1.887
genic+1.688
 LIST+1.583
 swe+1.580
phabet+1.515
Neg logit contributions
xual-2.351
aan-2.322
 laun-2.171
ista-2.162
 differe-2.006
Token,
Token ID11
Feature activation-3.04e-02

Pos logit contributions
 mend+1.591
genic+1.449
 LIST+1.382
 swe+1.360
joining+1.330
Neg logit contributions
xual-1.503
aan-1.500
 laun-1.393
ista-1.376
asma-1.260
Token and
Token ID290
Feature activation-3.10e-02

Pos logit contributions
 mend+0.023
genic+0.021
 LIST+0.020
phabet+0.020
 swe+0.020
Neg logit contributions
aan-0.021
xual-0.021
 laun-0.020
ista-0.020
 differe-0.018
Token Hot
Token ID6964
Feature activation-7.87e-01

Pos logit contributions
 mend+0.022
genic+0.020
 LIST+0.019
phabet+0.019
��+0.019
Neg logit contributions
aan-0.023
xual-0.023
 laun-0.022
ista-0.022
 differe-0.020
Token Soup
Token ID34011
Feature activation+5.78e-01

Pos logit contributions
 mend+0.594
genic+0.541
 LIST+0.512
phabet+0.502
 swe+0.502
Neg logit contributions
xual-0.570
aan-0.564
 laun-0.524
ista-0.519
 differe-0.473
Token want
Token ID765
Feature activation-4.68e-01

Pos logit contributions
aan+0.314
xual+0.311
 laun+0.292
ista+0.291
 differe+0.264
Neg logit contributions
 mend-0.300
genic-0.271
 LIST-0.255
phabet-0.253
��-0.252
Token to
Token ID284
Feature activation+2.05e-01

Pos logit contributions
 mend+0.350
genic+0.318
 LIST+0.299
phabet+0.297
��+0.296
Neg logit contributions
aan-0.338
xual-0.335
 laun-0.314
ista-0.312
 differe-0.283
Token
Token ID564
Feature activation+2.78e-03

Pos logit contributions
aan+0.170
xual+0.169
 laun+0.160
ista+0.159
 differe+0.146
Neg logit contributions
 mend-0.131
genic-0.117
 LIST-0.109
phabet-0.107
 swe-0.107
Token
Token ID246
Feature activation-2.19e-01

Pos logit contributions
aan+0.003
xual+0.002
 laun+0.002
ista+0.002
 differe+0.002
Neg logit contributions
 mend-0.002
genic-0.001
 LIST-0.001
phabet-0.001
��-0.001
Tokencr
Token ID6098
Feature activation+2.11e-01

Pos logit contributions
 mend+0.169
genic+0.154
 LIST+0.145
phabet+0.144
��+0.143
Neg logit contributions
aan-0.154
xual-0.152
 laun-0.143
ista-0.142
 differe-0.128
Tokenack
Token ID441
Feature activation-2.84e+00

Pos logit contributions
aan+0.220
xual+0.219
 laun+0.209
ista+0.209
 differe+0.195
Neg logit contributions
 mend-0.090
genic-0.076
 LIST-0.068
phabet-0.067
��-0.067
Token the
Token ID262
Feature activation-1.84e-02

Pos logit contributions
 mend+1.854
genic+1.639
 LIST+1.544
phabet+1.523
 swe+1.511
Neg logit contributions
xual-2.319
aan-2.318
 laun-2.187
ista-2.168
asma-2.000
Token code
Token ID2438
Feature activation-3.89e-01

Pos logit contributions
 mend+0.013
genic+0.012
 LIST+0.011
phabet+0.011
��+0.011
Neg logit contributions
aan-0.014
xual-0.014
 laun-0.013
ista-0.013
 differe-0.012
Token
Token ID447
Feature activation-1.76e-01

Pos logit contributions
 mend+0.299
genic+0.272
 LIST+0.258
phabet+0.255
 swe+0.254
Neg logit contributions
aan-0.273
xual-0.271
 laun-0.253
ista-0.251
 differe-0.227
Token
Token ID247
Feature activation+8.64e-02

Pos logit contributions
 mend+0.100
genic+0.088
 LIST+0.081
phabet+0.080
��+0.079
Neg logit contributions
aan-0.159
xual-0.158
 laun-0.150
ista-0.150
 differe-0.139
Token of
Token ID286
Feature activation+1.73e-01

Pos logit contributions
aan+0.060
xual+0.060
 laun+0.056
ista+0.055
 differe+0.050
Neg logit contributions
 mend-0.067
genic-0.061
 LIST-0.058
phabet-0.057
��-0.057
Tokenco
Token ID1073
Feature activation-6.39e-01

Pos logit contributions
 mend+0.263
genic+0.240
��+0.233
 LIST+0.227
phabet+0.227
Neg logit contributions
aan-0.211
xual-0.208
ista-0.195
 laun-0.191
asma-0.172
Token.
Token ID13
Feature activation-1.36e-01

Pos logit contributions
 mend+0.505
genic+0.462
 LIST+0.436
phabet+0.434
��+0.432
Neg logit contributions
aan-0.434
xual-0.430
 laun-0.401
ista-0.399
 differe-0.358
Tokenuk
Token ID2724
Feature activation+3.64e-01

Pos logit contributions
 mend+0.089
genic+0.079
��+0.074
 LIST+0.074
phabet+0.074
Neg logit contributions
aan-0.112
xual-0.111
ista-0.105
 laun-0.104
asma-0.095
Token/
Token ID14
Feature activation-3.30e-02

Pos logit contributions
aan+0.268
xual+0.265
ista+0.250
 laun+0.248
asma+0.224
Neg logit contributions
 mend-0.267
genic-0.243
 LIST-0.226
phabet-0.226
��-0.226
Tokens
Token ID82
Feature activation-5.81e-01

Pos logit contributions
 mend+0.021
genic+0.019
 LIST+0.018
��+0.018
phabet+0.018
Neg logit contributions
aan-0.027
xual-0.027
 laun-0.025
ista-0.025
 differe-0.023
Tokenport
Token ID634
Feature activation-2.82e+00

Pos logit contributions
 mend+0.373
genic+0.333
 LIST+0.310
phabet+0.307
��+0.306
Neg logit contributions
aan-0.481
xual-0.478
 laun-0.452
ista-0.450
 differe-0.413
Token
Token ID198
Feature activation+8.23e-02

Pos logit contributions
 mend+1.970
genic+1.792
 LIST+1.674
phabet+1.656
joining+1.651
Neg logit contributions
xual-2.143
aan-2.142
 laun-2.024
ista-1.978
 differe-1.833
Token
Token ID198
Feature activation+8.62e-02

Pos logit contributions
aan+0.064
xual+0.063
 laun+0.060
ista+0.059
 differe+0.054
Neg logit contributions
 mend-0.058
genic-0.052
 LIST-0.049
phabet-0.048
 swe-0.048
TokenBack
Token ID7282
Feature activation-3.84e-01

Pos logit contributions
aan+0.055
xual+0.055
 laun+0.051
ista+0.050
 differe+0.045
Neg logit contributions
 mend-0.072
genic-0.066
 LIST-0.062
phabet-0.062
��-0.062
Token then
Token ID788
Feature activation-6.40e-01

Pos logit contributions
 mend+0.304
genic+0.278
 LIST+0.263
phabet+0.261
��+0.260
Neg logit contributions
aan-0.261
xual-0.259
 laun-0.241
ista-0.240
 differe-0.215
Token we
Token ID356
Feature activation+4.42e-02

Pos logit contributions
 mend+0.458
genic+0.414
 LIST+0.389
phabet+0.386
��+0.384
Neg logit contributions
aan-0.483
xual-0.480
 laun-0.450
ista-0.448
 differe-0.407
Token,
Token ID11
Feature activation-1.59e-01

Pos logit contributions
aan+0.266
xual+0.265
 laun+0.249
ista+0.246
 differe+0.224
Neg logit contributions
 mend-0.268
genic-0.243
 LIST-0.229
phabet-0.225
 swe-0.225
Token art
Token ID1242
Feature activation-1.34e+00

Pos logit contributions
 mend+0.047
genic+0.037
��+0.031
 LIST+0.031
phabet+0.031
Neg logit contributions
aan-0.187
xual-0.185
ista-0.178
 laun-0.177
asma-0.167
Tokenworks
Token ID5225
Feature activation-6.41e-01

Pos logit contributions
 mend+0.647
genic+0.557
 LIST+0.502
phabet+0.493
 swe+0.489
Neg logit contributions
aan-1.318
xual-1.313
 laun-1.257
ista-1.241
 differe-1.161
Token and
Token ID290
Feature activation-2.36e-01

Pos logit contributions
 mend+0.508
genic+0.464
 LIST+0.439
 swe+0.435
phabet+0.429
Neg logit contributions
xual-0.437
aan-0.435
 laun-0.411
ista-0.397
 differe-0.363
Token a
Token ID257
Feature activation-1.39e-01

Pos logit contributions
 mend+0.168
genic+0.151
 LIST+0.142
 swe+0.141
phabet+0.138
Neg logit contributions
xual-0.180
aan-0.179
 laun-0.171
ista-0.166
 differe-0.153
Token detailed
Token ID6496
Feature activation-2.82e+00

Pos logit contributions
 mend+0.102
genic+0.093
 LIST+0.088
 swe+0.086
phabet+0.086
Neg logit contributions
aan-0.101
xual-0.101
 laun-0.096
ista-0.093
 differe-0.085
Token write
Token ID3551
Feature activation-1.01e+00

Pos logit contributions
 mend+1.974
genic+1.738
 LIST+1.653
 swe+1.628
phabet+1.594
Neg logit contributions
aan-2.205
xual-2.185
 laun-2.073
ista-2.029
asma-1.875
Tokenup
Token ID929
Feature activation-1.08e-01

Pos logit contributions
 mend+0.596
genic+0.528
 LIST+0.487
phabet+0.482
��+0.476
Neg logit contributions
aan-0.891
xual-0.886
 laun-0.841
ista-0.835
 differe-0.773
Token into
Token ID656
Feature activation+9.17e-02

Pos logit contributions
 mend+0.078
genic+0.071
 LIST+0.067
phabet+0.066
 swe+0.066
Neg logit contributions
aan-0.081
xual-0.080
 laun-0.076
ista-0.075
 differe-0.068
Token the
Token ID262
Feature activation-2.17e-01

Pos logit contributions
aan+0.071
xual+0.070
 laun+0.067
ista+0.065
 differe+0.060
Neg logit contributions
 mend-0.064
genic-0.058
 LIST-0.055
 swe-0.054
phabet-0.053
Token making
Token ID1642
Feature activation-4.48e-01

Pos logit contributions
 mend+0.164
genic+0.148
 LIST+0.140
 swe+0.138
joining+0.137
Neg logit contributions
aan-0.156
xual-0.155
 laun-0.145
ista-0.143
asma-0.131
Token
Token ID198
Feature activation-7.52e-02

Pos logit contributions
aan+0.056
xual+0.056
 laun+0.053
ista+0.052
asma+0.047
Neg logit contributions
 mend-0.055
genic-0.050
 LIST-0.047
phabet-0.046
 swe-0.046
Token
Token ID198
Feature activation+3.34e-02

Pos logit contributions
 mend+0.052
genic+0.047
 LIST+0.044
phabet+0.044
 swe+0.043
Neg logit contributions
aan-0.058
xual-0.058
 laun-0.055
ista-0.054
 differe-0.050
Tokenif
Token ID361
Feature activation+1.19e+00

Pos logit contributions
aan+0.023
xual+0.023
 laun+0.021
ista+0.021
 differe+0.019
Neg logit contributions
 mend-0.026
genic-0.024
 LIST-0.023
phabet-0.022
��-0.022
Token you
Token ID345
Feature activation+5.43e-01

Pos logit contributions
aan+0.899
xual+0.889
 laun+0.836
ista+0.834
 differe+0.758
Neg logit contributions
 mend-0.847
genic-0.770
 LIST-0.720
��-0.717
phabet-0.717
Token're
Token ID821
Feature activation+1.01e+00

Pos logit contributions
aan+0.421
xual+0.418
 laun+0.393
ista+0.391
 differe+0.357
Neg logit contributions
 mend-0.378
genic-0.341
 LIST-0.319
phabet-0.317
��-0.316
Token wondering
Token ID11263
Feature activation+2.48e+00

Pos logit contributions
aan+0.739
xual+0.734
 laun+0.687
ista+0.683
 differe+0.619
Neg logit contributions
 mend-0.745
genic-0.676
 LIST-0.637
phabet-0.630
��-0.627
Token about
Token ID546
Feature activation+4.48e-01

Pos logit contributions
aan+1.772
xual+1.752
ista+1.651
 laun+1.648
 differe+1.494
Neg logit contributions
 mend-1.848
genic-1.699
��-1.612
phabet-1.595
 LIST-1.579
Token the
Token ID262
Feature activation+5.21e-01

Pos logit contributions
aan+0.345
xual+0.343
 laun+0.322
ista+0.321
 differe+0.292
Neg logit contributions
 mend-0.313
genic-0.283
 LIST-0.265
phabet-0.263
��-0.262
Token storms
Token ID20228
Feature activation+5.45e-01

Pos logit contributions
aan+0.386
xual+0.384
 laun+0.359
ista+0.358
asma+0.324
Neg logit contributions
 mend-0.381
genic-0.345
 LIST-0.324
phabet-0.321
��-0.319
Token,
Token ID11
Feature activation+3.06e-01

Pos logit contributions
aan+0.393
xual+0.389
 laun+0.364
ista+0.363
 differe+0.328
Neg logit contributions
 mend-0.409
genic-0.372
 LIST-0.350
phabet-0.348
��-0.348
Token I
Token ID314
Feature activation+3.54e-01

Pos logit contributions
aan+0.237
xual+0.235
 laun+0.221
ista+0.220
 differe+0.201
Neg logit contributions
 mend-0.213
genic-0.193
 LIST-0.180
phabet-0.179
��-0.178
Tokenmask
Token ID27932
Feature activation-4.86e-01

Pos logit contributions
aan+0.646
xual+0.643
 laun+0.606
ista+0.600
 differe+0.548
Neg logit contributions
 mend-0.594
genic-0.537
 LIST-0.503
phabet-0.499
��-0.495
Tokened
Token ID276
Feature activation-7.73e-01

Pos logit contributions
 mend+0.382
genic+0.349
 LIST+0.330
phabet+0.325
 swe+0.324
Neg logit contributions
aan-0.331
xual-0.329
 laun-0.310
ista-0.303
 differe-0.276
Token-
Token ID12
Feature activation-7.23e-01

Pos logit contributions
 mend+0.568
genic+0.516
 LIST+0.484
phabet+0.482
��+0.480
Neg logit contributions
aan-0.568
xual-0.562
 laun-0.528
ista-0.526
 differe-0.477
TokenSus
Token ID30746
Feature activation+4.75e-01

Pos logit contributions
 mend+0.599
genic+0.550
 LIST+0.523
phabet+0.516
joining+0.515
Neg logit contributions
xual-0.464
aan-0.462
 laun-0.433
ista-0.421
 differe-0.381
Tokenie
Token ID494
Feature activation-3.31e-01

Pos logit contributions
aan+0.397
xual+0.394
 laun+0.373
ista+0.371
 differe+0.341
Neg logit contributions
 mend-0.301
genic-0.270
 LIST-0.250
phabet-0.248
��-0.247
Token Ess
Token ID11985
Feature activation+2.18e+00

Pos logit contributions
 mend+0.239
genic+0.217
 LIST+0.204
phabet+0.201
��+0.200
Neg logit contributions
aan-0.248
xual-0.247
 laun-0.233
ista-0.229
 differe-0.210
Tokenman
Token ID805
Feature activation-3.05e-01

Pos logit contributions
aan+1.469
xual+1.445
ista+1.347
 laun+1.342
 differe+1.201
Neg logit contributions
 mend-1.739
genic-1.592
 LIST-1.510
phabet-1.502
��-1.494
Token 2008
Token ID3648
Feature activation-3.01e-01

Pos logit contributions
 mend+0.234
genic+0.213
 LIST+0.201
phabet+0.199
��+0.199
Neg logit contributions
aan-0.214
xual-0.213
 laun-0.199
ista-0.197
 differe-0.178
Token01
Token ID486
Feature activation-1.14e+00

Pos logit contributions
 mend+0.219
genic+0.198
 LIST+0.186
phabet+0.185
��+0.184
Neg logit contributions
aan-0.224
xual-0.222
 laun-0.208
ista-0.207
 differe-0.188
Token28
Token ID2078
Feature activation-7.69e-01

Pos logit contributions
 mend+0.873
genic+0.797
 LIST+0.751
��+0.736
joining+0.733
Neg logit contributions
xual-0.803
aan-0.802
 laun-0.754
ista-0.735
 differe-0.675
Token O
Token ID440
Feature activation-6.48e-01

Pos logit contributions
 mend+0.458
genic+0.399
 LIST+0.383
 swe+0.365
phabet+0.359
Neg logit contributions
xual-0.683
aan-0.675
 laun-0.643
ista-0.629
 differe-0.588
Token an
Token ID281
Feature activation+4.57e-02

Pos logit contributions
aan+0.539
xual+0.529
ista+0.501
 laun+0.500
 differe+0.451
Neg logit contributions
 mend-0.539
genic-0.494
��-0.469
phabet-0.460
 LIST-0.457
Token In
Token ID554
Feature activation+5.40e-01

Pos logit contributions
xual+0.039
aan+0.039
 laun+0.037
ista+0.036
asma+0.034
Neg logit contributions
 mend-0.028
genic-0.025
 LIST-0.023
phabet-0.023
 swe-0.023
TokenDaily
Token ID28545
Feature activation+1.26e+00

Pos logit contributions
aan+0.264
xual+0.260
 laun+0.235
ista+0.234
 differe+0.199
Neg logit contributions
 mend-0.531
genic-0.494
 LIST-0.473
phabet-0.471
��-0.471
Token supporter
Token ID15525
Feature activation-6.52e-02

Pos logit contributions
aan+0.895
xual+0.878
ista+0.828
 laun+0.821
 differe+0.742
Neg logit contributions
 mend-0.958
genic-0.878
��-0.826
phabet-0.821
 LIST-0.819
Token.
Token ID13
Feature activation+4.43e-01

Pos logit contributions
 mend+0.049
genic+0.045
 LIST+0.043
phabet+0.043
joining+0.042
Neg logit contributions
aan-0.047
xual-0.046
 laun-0.043
ista-0.043
 differe-0.039
Token Powered
Token ID45090
Feature activation+2.14e+00

Pos logit contributions
xual+0.228
aan+0.224
 laun+0.194
ista+0.190
 differe+0.173
Neg logit contributions
 mend-0.433
genic-0.397
 LIST-0.389
joining-0.383
phabet-0.381
Token by
Token ID416
Feature activation+7.07e-01

Pos logit contributions
aan+1.524
xual+1.514
ista+1.428
 laun+1.394
asma+1.267
Neg logit contributions
 mend-1.616
genic-1.464
��-1.399
phabet-1.381
 LIST-1.378
Token Press
Token ID4332
Feature activation+8.06e-02

Pos logit contributions
xual+0.614
aan+0.612
 laun+0.577
ista+0.574
 differe+0.531
Neg logit contributions
 mend-0.425
genic-0.376
 LIST-0.350
 swe-0.341
phabet-0.341
TokenPat
Token ID12130
Feature activation+1.49e+00

Pos logit contributions
aan+0.042
xual+0.042
 laun+0.038
ista+0.038
 differe+0.033
Neg logit contributions
 mend-0.076
genic-0.071
 LIST-0.068
phabet-0.067
��-0.067
Tokenron
Token ID1313
Feature activation+1.27e-01

Pos logit contributions
xual+1.083
aan+1.079
 laun+1.015
ista+0.991
 differe+0.917
Neg logit contributions
 mend-1.113
genic-1.006
 LIST-0.951
phabet-0.937
 swe-0.926
Token<|endoftext|>
Token ID50256
Feature activation+2.21e-01

Pos logit contributions
aan+0.120
xual+0.119
 laun+0.114
ista+0.112
 differe+0.106
Neg logit contributions
 mend-0.067
genic-0.058
 LIST-0.054
phabet-0.053
 swe-0.052
Token focus
Token ID2962
Feature activation+3.66e-01

Pos logit contributions
aan+0.456
xual+0.452
 laun+0.424
ista+0.422
 differe+0.383
Neg logit contributions
 mend-0.446
genic-0.404
 LIST-0.380
phabet-0.377
��-0.376
Token on
Token ID319
Feature activation+6.85e-01

Pos logit contributions
aan+0.253
xual+0.251
 laun+0.234
ista+0.233
 differe+0.210
Neg logit contributions
 mend-0.285
genic-0.260
 LIST-0.246
phabet-0.244
��-0.244
Token exploration
Token ID13936
Feature activation+7.45e-01

Pos logit contributions
aan+0.475
xual+0.471
 laun+0.440
ista+0.437
 differe+0.394
Neg logit contributions
 mend-0.531
genic-0.485
 LIST-0.458
phabet-0.456
��-0.454
Token that
Token ID326
Feature activation+8.27e-02

Pos logit contributions
aan+0.548
xual+0.543
 laun+0.509
ista+0.507
 differe+0.460
Neg logit contributions
 mend-0.549
genic-0.498
 LIST-0.468
phabet-0.465
��-0.464
Token W
Token ID370
Feature activation-6.98e-01

Pos logit contributions
aan+0.063
xual+0.063
 laun+0.059
ista+0.059
 differe+0.053
Neg logit contributions
 mend-0.059
genic-0.053
 LIST-0.050
phabet-0.049
��-0.049
Tokenats
Token ID1381
Feature activation+2.20e+00

Pos logit contributions
 mend+0.477
genic+0.429
phabet+0.401
 LIST+0.400
 swe+0.395
Neg logit contributions
xual-0.548
aan-0.545
 laun-0.517
ista-0.504
 differe-0.469
Tokenham
Token ID2763
Feature activation+5.45e-01

Pos logit contributions
aan+1.656
xual+1.634
ista+1.537
 laun+1.529
asma+1.384
Neg logit contributions
 mend-1.591
genic-1.446
 LIST-1.355
��-1.355
phabet-1.351
Token touched
Token ID12615
Feature activation+2.10e-01

Pos logit contributions
aan+0.390
xual+0.387
 laun+0.362
ista+0.360
 differe+0.326
Neg logit contributions
 mend-0.411
genic-0.373
 LIST-0.352
phabet-0.349
��-0.348
Token on
Token ID319
Feature activation+2.67e-01

Pos logit contributions
aan+0.142
xual+0.140
 laun+0.131
ista+0.130
 differe+0.117
Neg logit contributions
 mend-0.167
genic-0.153
 LIST-0.145
phabet-0.144
��-0.144
Token is
Token ID318
Feature activation+1.32e-01

Pos logit contributions
aan+0.192
xual+0.190
 laun+0.178
ista+0.177
asma+0.160
Neg logit contributions
 mend-0.201
genic-0.183
 LIST-0.173
 swe-0.170
phabet-0.170
Token a
Token ID257
Feature activation+9.61e-03

Pos logit contributions
aan+0.097
xual+0.096
 laun+0.090
ista+0.089
asma+0.081
Neg logit contributions
 mend-0.098
genic-0.089
 LIST-0.084
phabet-0.082
 swe-0.082
Token similar
Token ID2092
Feature activation-9.80e-02

Pos logit contributions
 mend+0.636
genic+0.573
 LIST+0.543
 swe+0.538
phabet+0.533
Neg logit contributions
aan-0.595
xual-0.594
 laun-0.550
ista-0.547
asma-0.496
Token [
Token ID685
Feature activation-5.89e-02

Pos logit contributions
 mend+0.075
genic+0.068
 LIST+0.065
phabet+0.064
 swe+0.064
Neg logit contributions
aan-0.069
xual-0.069
 laun-0.064
ista-0.064
 differe-0.057
Tokento
Token ID1462
Feature activation+2.12e-01

Pos logit contributions
 mend+0.041
genic+0.037
 LIST+0.035
phabet+0.035
��+0.035
Neg logit contributions
aan-0.045
xual-0.045
 laun-0.042
ista-0.042
 differe-0.038
Token the
Token ID262
Feature activation+2.91e-01

Pos logit contributions
aan+0.163
xual+0.162
 laun+0.152
ista+0.152
 differe+0.138
Neg logit contributions
 mend-0.148
genic-0.134
 LIST-0.125
phabet-0.125
��-0.124
Token Ford
Token ID8092
Feature activation+2.95e-01

Pos logit contributions
aan+0.210
xual+0.208
 laun+0.195
ista+0.194
 differe+0.176
Neg logit contributions
 mend-0.218
genic-0.198
 LIST-0.186
phabet-0.186
��-0.185
Token Fiesta
Token ID49839
Feature activation+2.36e+00

Pos logit contributions
aan+0.204
xual+0.202
 laun+0.189
ista+0.188
 differe+0.169
Neg logit contributions
 mend-0.230
genic-0.210
 LIST-0.198
phabet-0.197
��-0.196
Token RS
Token ID19340
Feature activation-7.82e-01

Pos logit contributions
aan+1.796
xual+1.785
ista+1.689
 laun+1.679
asma+1.524
Neg logit contributions
 mend-1.666
genic-1.499
��-1.405
phabet-1.397
 LIST-1.392
Token W
Token ID370
Feature activation-1.04e+00

Pos logit contributions
 mend+0.605
genic+0.554
 LIST+0.525
phabet+0.519
 swe+0.515
Neg logit contributions
aan-0.544
xual-0.540
 laun-0.504
ista-0.497
 differe-0.452
TokenRC
Token ID7397
Feature activation+3.80e-01

Pos logit contributions
 mend+0.666
genic+0.597
 LIST+0.557
phabet+0.552
��+0.546
Neg logit contributions
aan-0.865
xual-0.858
 laun-0.814
ista-0.804
 differe-0.741
Token]
Token ID60
Feature activation+4.91e-01

Pos logit contributions
aan+0.262
xual+0.260
 laun+0.243
ista+0.242
 differe+0.217
Neg logit contributions
 mend-0.296
genic-0.270
 LIST-0.255
phabet-0.253
��-0.253
Token so
Token ID523
Feature activation+5.38e-01

Pos logit contributions
aan+0.351
xual+0.348
 laun+0.325
ista+0.324
 differe+0.293
Neg logit contributions
 mend-0.372
genic-0.338
 LIST-0.319
phabet-0.317
��-0.316
Token and
Token ID290
Feature activation+4.30e-01

Pos logit contributions
aan+0.635
xual+0.630
 laun+0.588
ista+0.585
 differe+0.526
Neg logit contributions
 mend-0.717
genic-0.655
 LIST-0.618
phabet-0.614
��-0.612
Token it
Token ID340
Feature activation+3.55e-01

Pos logit contributions
aan+0.323
xual+0.321
 laun+0.301
ista+0.299
 differe+0.272
Neg logit contributions
 mend-0.310
genic-0.280
 LIST-0.263
phabet-0.261
��-0.259
Token is
Token ID318
Feature activation+2.59e-01

Pos logit contributions
aan+0.244
xual+0.242
 laun+0.226
ista+0.225
 differe+0.202
Neg logit contributions
 mend-0.278
genic-0.254
 LIST-0.240
phabet-0.238
��-0.236
Token on
Token ID319
Feature activation+5.48e-01

Pos logit contributions
aan+0.191
xual+0.191
 laun+0.178
ista+0.177
 differe+0.160
Neg logit contributions
 mend-0.190
genic-0.172
 LIST-0.162
 swe-0.160
phabet-0.158
Token the
Token ID262
Feature activation+2.44e-01

Pos logit contributions
aan+0.419
xual+0.414
 laun+0.390
ista+0.388
 differe+0.353
Neg logit contributions
 mend-0.387
genic-0.350
 LIST-0.328
��-0.327
phabet-0.327
Token floor
Token ID4314
Feature activation+1.22e+00

Pos logit contributions
aan+0.189
xual+0.187
 laun+0.177
ista+0.176
 differe+0.161
Neg logit contributions
 mend-0.168
genic-0.153
phabet-0.143
��-0.143
 LIST-0.143
Token of
Token ID286
Feature activation+9.11e-02

Pos logit contributions
aan+0.853
xual+0.844
ista+0.790
 laun+0.788
 differe+0.708
Neg logit contributions
 mend-0.944
genic-0.863
��-0.816
phabet-0.813
 LIST-0.812
Token the
Token ID262
Feature activation+3.52e-01

Pos logit contributions
aan+0.068
xual+0.067
 laun+0.063
ista+0.063
 differe+0.057
Neg logit contributions
 mend-0.066
genic-0.060
 LIST-0.056
phabet-0.056
��-0.056
Token reactor
Token ID21905
Feature activation+6.94e-01

Pos logit contributions
aan+0.239
xual+0.237
 laun+0.221
ista+0.220
 differe+0.197
Neg logit contributions
 mend-0.278
genic-0.254
 LIST-0.240
phabet-0.239
��-0.238
Token cavity
Token ID31643
Feature activation+1.09e+00

Pos logit contributions
aan+0.505
xual+0.500
 laun+0.470
ista+0.467
 differe+0.423
Neg logit contributions
 mend-0.515
genic-0.468
 LIST-0.440
��-0.438
phabet-0.438
Token.
Token ID13
Feature activation-9.01e-02

Pos logit contributions
aan+0.765
xual+0.756
 laun+0.709
ista+0.707
 differe+0.635
Neg logit contributions
 mend-0.837
genic-0.764
��-0.720
phabet-0.718
 LIST-0.718
Token doing
Token ID1804
Feature activation+4.55e-02

Pos logit contributions
 mend+0.137
genic+0.123
 LIST+0.116
 swe+0.115
joining+0.113
Neg logit contributions
aan-0.150
xual-0.149
 laun-0.139
ista-0.138
asma-0.127
Token other
Token ID584
Feature activation-3.29e-01

Pos logit contributions
aan+0.037
xual+0.037
 laun+0.035
ista+0.035
 differe+0.032
Neg logit contributions
 mend-0.030
genic-0.027
 LIST-0.025
phabet-0.025
��-0.025
Token things
Token ID1243
Feature activation+2.49e-01

Pos logit contributions
 mend+0.190
genic+0.166
 LIST+0.153
 swe+0.152
phabet+0.150
Neg logit contributions
aan-0.295
xual-0.294
 laun-0.278
ista-0.277
asma-0.257
Token -
Token ID532
Feature activation-5.79e-01

Pos logit contributions
aan+0.189
xual+0.187
 laun+0.176
ista+0.175
 differe+0.159
Neg logit contributions
 mend-0.178
genic-0.161
 LIST-0.151
phabet-0.150
��-0.149
Token the
Token ID262
Feature activation+1.72e-01

Pos logit contributions
 mend+0.416
genic+0.376
 LIST+0.354
phabet+0.349
 swe+0.349
Neg logit contributions
aan-0.435
xual-0.431
 laun-0.405
ista-0.403
asma-0.367
Token pre
Token ID662
Feature activation+8.83e-01

Pos logit contributions
aan+0.130
xual+0.129
 laun+0.121
ista+0.120
asma+0.109
Neg logit contributions
 mend-0.123
genic-0.111
 LIST-0.105
 swe-0.103
phabet-0.103
Token-
Token ID12
Feature activation+1.34e-01

Pos logit contributions
aan+0.657
xual+0.653
 laun+0.611
ista+0.610
 differe+0.553
Neg logit contributions
 mend-0.640
genic-0.580
 LIST-0.545
��-0.542
phabet-0.542
Tokenemption
Token ID11221
Feature activation+9.03e-01

Pos logit contributions
aan+0.089
xual+0.088
 laun+0.083
ista+0.082
 differe+0.074
Neg logit contributions
 mend-0.107
genic-0.098
 LIST-0.093
phabet-0.092
joining-0.092
Token,
Token ID11
Feature activation+2.55e-01

Pos logit contributions
aan+0.627
xual+0.621
 laun+0.580
ista+0.578
 differe+0.520
Neg logit contributions
 mend-0.701
genic-0.640
 LIST-0.604
phabet-0.600
��-0.599
Token referring
Token ID9759
Feature activation+4.89e-01

Pos logit contributions
aan+0.194
xual+0.192
 laun+0.180
ista+0.180
asma+0.164
Neg logit contributions
 mend-0.182
genic-0.164
 LIST-0.154
phabet-0.152
 swe-0.152
Token to
Token ID284
Feature activation+4.80e-01

Pos logit contributions
aan+0.354
xual+0.352
 laun+0.330
ista+0.328
 differe+0.298
Neg logit contributions
 mend-0.363
genic-0.330
 LIST-0.311
phabet-0.310
��-0.309
Token in
Token ID287
Feature activation-1.60e-02

Pos logit contributions
 mend+0.084
genic+0.076
��+0.070
phabet+0.070
 LIST+0.070
Neg logit contributions
aan-0.108
xual-0.107
 laun-0.101
ista-0.101
 differe-0.092
Token the
Token ID262
Feature activation-8.53e-02

Pos logit contributions
 mend+0.011
genic+0.010
 LIST+0.010
phabet+0.010
��+0.010
Neg logit contributions
aan-0.012
xual-0.012
 laun-0.011
ista-0.011
 differe-0.010
Token White
Token ID2635
Feature activation+6.19e-01

Pos logit contributions
 mend+0.060
genic+0.054
 LIST+0.051
phabet+0.050
��+0.050
Neg logit contributions
aan-0.065
xual-0.065
 laun-0.061
ista-0.061
 differe-0.055
Token House
Token ID2097
Feature activation+4.07e-01

Pos logit contributions
aan+0.305
xual+0.302
 laun+0.273
ista+0.272
 differe+0.232
Neg logit contributions
 mend-0.605
genic-0.563
 LIST-0.537
phabet-0.536
��-0.535
Token (
Token ID357
Feature activation-1.54e-01

Pos logit contributions
aan+0.289
xual+0.286
 laun+0.268
ista+0.266
 differe+0.240
Neg logit contributions
 mend-0.311
genic-0.283
 LIST-0.267
phabet-0.265
��-0.264
Tokenshe
Token ID7091
Feature activation+3.42e-01

Pos logit contributions
 mend+0.109
genic+0.098
 LIST+0.092
phabet+0.091
��+0.091
Neg logit contributions
aan-0.118
xual-0.117
 laun-0.110
ista-0.110
 differe-0.100
Token turned
Token ID2900
Feature activation-4.17e-01

Pos logit contributions
aan+0.259
xual+0.257
 laun+0.240
ista+0.239
 differe+0.218
Neg logit contributions
 mend-0.244
genic-0.220
 LIST-0.207
phabet-0.205
 swe-0.205
Token down
Token ID866
Feature activation+4.27e-02

Pos logit contributions
 mend+0.278
genic+0.248
 LIST+0.230
 swe+0.227
phabet+0.227
Neg logit contributions
aan-0.339
xual-0.336
 laun-0.317
ista-0.313
asma-0.289
Token a
Token ID257
Feature activation-2.92e-01

Pos logit contributions
aan+0.033
xual+0.033
 laun+0.031
ista+0.031
 differe+0.028
Neg logit contributions
 mend-0.030
genic-0.027
 LIST-0.025
phabet-0.025
��-0.025
Token request
Token ID2581
Feature activation+5.94e-01

Pos logit contributions
 mend+0.209
genic+0.185
 LIST+0.177
 swe+0.174
phabet+0.172
Neg logit contributions
aan-0.224
xual-0.224
 laun-0.209
ista-0.205
 differe-0.189
Token from
Token ID422
Feature activation-2.30e-03

Pos logit contributions
aan+0.392
xual+0.388
 laun+0.361
ista+0.359
 differe+0.321
Neg logit contributions
 mend-0.482
genic-0.442
 LIST-0.418
phabet-0.415
��-0.414
Tokenul
Token ID377
Feature activation-6.53e-01

Pos logit contributions
aan+0.147
xual+0.145
ista+0.135
 laun+0.135
asma+0.120
Neg logit contributions
 mend-0.176
genic-0.161
 LIST-0.152
phabet-0.151
��-0.151
Tokenc
Token ID66
Feature activation+3.12e-01

Pos logit contributions
 mend+0.435
genic+0.391
 LIST+0.364
phabet+0.361
��+0.359
Neg logit contributions
aan-0.526
xual-0.523
 laun-0.493
ista-0.491
 differe-0.449
Tokenet
Token ID316
Feature activation-4.02e-01

Pos logit contributions
aan+0.217
xual+0.214
ista+0.201
 laun+0.201
asma+0.180
Neg logit contributions
 mend-0.242
genic-0.219
 LIST-0.208
phabet-0.207
��-0.207
Token tones
Token ID23755
Feature activation+2.22e-01

Pos logit contributions
 mend+0.289
genic+0.261
 LIST+0.245
phabet+0.243
��+0.241
Neg logit contributions
aan-0.302
xual-0.301
 laun-0.283
ista-0.280
 differe-0.256
Token.
Token ID13
Feature activation+1.20e-01

Pos logit contributions
aan+0.152
xual+0.151
 laun+0.141
ista+0.140
 differe+0.126
Neg logit contributions
 mend-0.174
genic-0.159
 LIST-0.150
phabet-0.149
��-0.149
Token
Token ID198
Feature activation+1.54e-01

Pos logit contributions
aan+0.098
xual+0.098
 laun+0.092
ista+0.091
 differe+0.084
Neg logit contributions
 mend-0.078
genic-0.070
 LIST-0.066
phabet-0.064
��-0.064
Token
Token ID198
Feature activation-5.67e-02

Pos logit contributions
aan+0.119
xual+0.119
 laun+0.112
ista+0.110
 differe+0.101
Neg logit contributions
 mend-0.107
genic-0.097
 LIST-0.091
phabet-0.089
 swe-0.089
TokenThe
Token ID464
Feature activation+8.20e-02

Pos logit contributions
 mend+0.047
genic+0.043
 LIST+0.041
joining+0.041
phabet+0.040
Neg logit contributions
xual-0.036
aan-0.036
 laun-0.034
ista-0.033
 differe-0.030
Token closed
Token ID4838
Feature activation-8.04e-01

Pos logit contributions
aan+0.062
xual+0.062
 laun+0.058
ista+0.058
 differe+0.052
Neg logit contributions
 mend-0.059
genic-0.053
 LIST-0.050
 swe-0.049
phabet-0.049
Token beta
Token ID12159
Feature activation-3.10e-01

Pos logit contributions
 mend+0.504
genic+0.450
 LIST+0.418
phabet+0.414
��+0.413
Neg logit contributions
aan-0.678
xual-0.673
 laun-0.636
ista-0.634
 differe-0.582
Token test
Token ID1332
Feature activation-2.09e-01

Pos logit contributions
 mend+0.215
genic+0.193
 LIST+0.182
phabet+0.179
 swe+0.178
Neg logit contributions
aan-0.242
xual-0.241
 laun-0.225
ista-0.224
 differe-0.205
Tokenra
Token ID430
Feature activation-6.52e-01

Pos logit contributions
 mend+0.607
genic+0.561
 LIST+0.525
phabet+0.525
 swe+0.520
Neg logit contributions
aan-0.509
xual-0.505
 laun-0.479
ista-0.464
 differe-0.424
Tokene
Token ID68
Feature activation-4.88e-01

Pos logit contributions
 mend+0.549
genic+0.508
 LIST+0.477
joining+0.472
 swe+0.471
Neg logit contributions
aan-0.405
xual-0.403
 laun-0.386
ista-0.367
 differe-0.338
Token multiplied
Token ID33096
Feature activation+1.34e+00

Pos logit contributions
 mend+0.350
genic+0.316
 LIST+0.296
phabet+0.292
 swe+0.292
Neg logit contributions
aan-0.370
xual-0.367
 laun-0.344
ista-0.343
 differe-0.310
Token in
Token ID287
Feature activation+2.20e-01

Pos logit contributions
aan+0.933
xual+0.931
 laun+0.870
ista+0.865
 differe+0.784
Neg logit contributions
 mend-1.029
genic-0.941
��-0.888
phabet-0.888
 LIST-0.886
Token mac
Token ID8352
Feature activation+2.87e-01

Pos logit contributions
aan+0.167
xual+0.166
 laun+0.156
ista+0.155
 differe+0.142
Neg logit contributions
 mend-0.156
genic-0.141
 LIST-0.133
phabet-0.132
��-0.132
Tokenroph
Token ID10051
Feature activation+1.56e+00

Pos logit contributions
aan+0.233
xual+0.231
 laun+0.218
ista+0.217
 differe+0.199
Neg logit contributions
 mend-0.189
genic-0.170
 LIST-0.158
phabet-0.157
��-0.157
Tokenages
Token ID1095
Feature activation-5.40e-01

Pos logit contributions
aan+1.215
xual+1.200
ista+1.132
 laun+1.120
asma+1.037
Neg logit contributions
 mend-1.088
genic-0.976
��-0.931
 LIST-0.924
phabet-0.914
Token at
Token ID379
Feature activation+1.28e-01

Pos logit contributions
 mend+0.416
genic+0.379
 LIST+0.358
phabet+0.355
��+0.354
Neg logit contributions
aan-0.379
xual-0.376
 laun-0.351
ista-0.349
 differe-0.315
Token the
Token ID262
Feature activation+3.50e-01

Pos logit contributions
aan+0.096
xual+0.095
 laun+0.090
ista+0.089
 differe+0.082
Neg logit contributions
 mend-0.092
genic-0.083
phabet-0.079
 LIST-0.078
��-0.078
Token site
Token ID2524
Feature activation-4.72e-01

Pos logit contributions
aan+0.233
xual+0.230
 laun+0.216
ista+0.214
 differe+0.193
Neg logit contributions
 mend-0.280
genic-0.257
phabet-0.244
 LIST-0.244
��-0.243
Token of
Token ID286
Feature activation+1.81e-01

Pos logit contributions
 mend+0.370
genic+0.339
 LIST+0.321
phabet+0.320
��+0.318
Neg logit contributions
aan-0.322
xual-0.320
 laun-0.298
ista-0.297
 differe-0.267
Token they
Token ID484
Feature activation-9.03e-02

Pos logit contributions
 mend+0.120
genic+0.107
 LIST+0.101
phabet+0.100
 swe+0.099
Neg logit contributions
aan-0.134
xual-0.134
 laun-0.125
ista-0.125
 differe-0.114
Token will
Token ID481
Feature activation-1.32e-01

Pos logit contributions
 mend+0.062
genic+0.056
 LIST+0.052
 swe+0.052
phabet+0.052
Neg logit contributions
aan-0.071
xual-0.070
 laun-0.066
ista-0.066
asma-0.060
Token spend
Token ID4341
Feature activation-7.03e-01

Pos logit contributions
 mend+0.094
genic+0.083
 LIST+0.079
 swe+0.078
phabet+0.077
Neg logit contributions
aan-0.102
xual-0.102
 laun-0.095
ista-0.094
asma-0.086
Token it
Token ID340
Feature activation+3.66e-02

Pos logit contributions
 mend+0.452
genic+0.403
 LIST+0.376
phabet+0.371
 swe+0.370
Neg logit contributions
aan-0.583
xual-0.580
 laun-0.546
ista-0.544
 differe-0.499
Token more
Token ID517
Feature activation+1.47e-01

Pos logit contributions
aan+0.025
xual+0.025
 laun+0.023
ista+0.023
asma+0.021
Neg logit contributions
 mend-0.029
genic-0.026
 LIST-0.025
 swe-0.025
phabet-0.024
Token carefully
Token ID7773
Feature activation-7.92e-01

Pos logit contributions
aan+0.085
xual+0.085
 laun+0.078
ista+0.077
 differe+0.068
Neg logit contributions
 mend-0.131
genic-0.120
 LIST-0.115
phabet-0.114
��-0.113
Token,
Token ID11
Feature activation-4.84e-03

Pos logit contributions
 mend+0.626
genic+0.571
 LIST+0.540
phabet+0.535
 swe+0.532
Neg logit contributions
aan-0.540
xual-0.536
 laun-0.500
ista-0.497
 differe-0.446
Token and
Token ID290
Feature activation-1.03e-01

Pos logit contributions
 mend+0.004
genic+0.003
 LIST+0.003
phabet+0.003
 swe+0.003
Neg logit contributions
aan-0.003
xual-0.003
 laun-0.003
ista-0.003
asma-0.003
Token health
Token ID1535
Feature activation-1.05e+00

Pos logit contributions
 mend+0.072
genic+0.065
 LIST+0.061
phabet+0.060
 swe+0.060
Neg logit contributions
aan-0.079
xual-0.079
 laun-0.074
ista-0.073
 differe-0.067
Token care
Token ID1337
Feature activation-6.99e-01

Pos logit contributions
 mend+0.490
genic+0.414
 LIST+0.373
phabet+0.368
 swe+0.366
Neg logit contributions
aan-1.059
xual-1.056
 laun-1.004
ista-0.997
asma-0.933
Token costs
Token ID3484
Feature activation+1.13e-01

Pos logit contributions
 mend+0.478
genic+0.427
 LIST+0.401
 swe+0.398
phabet+0.396
Neg logit contributions
aan-0.552
xual-0.548
 laun-0.514
ista-0.510
 differe-0.467
Token content
Token ID2695
Feature activation-1.20e-01

Pos logit contributions
aan+0.046
xual+0.046
 laun+0.043
ista+0.043
asma+0.039
Neg logit contributions
 mend-0.045
genic-0.041
 LIST-0.039
phabet-0.038
 swe-0.038
Token of
Token ID286
Feature activation+1.30e-01

Pos logit contributions
 mend+0.092
genic+0.084
 LIST+0.079
phabet+0.079
��+0.078
Neg logit contributions
aan-0.085
xual-0.084
 laun-0.078
ista-0.078
 differe-0.070
Token this
Token ID428
Feature activation+2.01e-01

Pos logit contributions
aan+0.098
xual+0.097
 laun+0.091
ista+0.091
 differe+0.083
Neg logit contributions
 mend-0.093
genic-0.084
 LIST-0.079
phabet-0.079
��-0.079
Token cover
Token ID3002
Feature activation-2.53e-01

Pos logit contributions
aan+0.144
xual+0.143
 laun+0.133
ista+0.133
asma+0.120
Neg logit contributions
 mend-0.152
genic-0.138
 LIST-0.130
phabet-0.129
��-0.128
Token screen
Token ID3159
Feature activation-8.57e-01

Pos logit contributions
 mend+0.188
genic+0.171
 LIST+0.161
��+0.160
phabet+0.160
Neg logit contributions
aan-0.183
xual-0.181
 laun-0.170
ista-0.170
 differe-0.153
Token.
Token ID13
Feature activation-1.71e-01

Pos logit contributions
 mend+0.645
genic+0.586
 LIST+0.553
phabet+0.548
��+0.544
Neg logit contributions
aan-0.615
xual-0.611
 laun-0.571
ista-0.568
 differe-0.514
Token On
Token ID1550
Feature activation-4.58e-01

Pos logit contributions
 mend+0.130
genic+0.118
 LIST+0.112
phabet+0.110
��+0.109
Neg logit contributions
aan-0.122
xual-0.121
 laun-0.113
ista-0.112
 differe-0.102
Token the
Token ID262
Feature activation-2.84e-01

Pos logit contributions
 mend+0.322
genic+0.290
 LIST+0.272
phabet+0.269
 swe+0.267
Neg logit contributions
aan-0.353
xual-0.351
 laun-0.329
ista-0.326
asma-0.298
Token home
Token ID1363
Feature activation+1.29e-01

Pos logit contributions
 mend+0.199
genic+0.180
 LIST+0.169
phabet+0.167
��+0.167
Neg logit contributions
aan-0.218
xual-0.216
 laun-0.203
ista-0.202
 differe-0.184
Token screen
Token ID3159
Feature activation-1.11e+00

Pos logit contributions
aan+0.099
xual+0.098
ista+0.092
 laun+0.092
 differe+0.084
Neg logit contributions
 mend-0.091
genic-0.082
 LIST-0.077
��-0.077
phabet-0.077
Token,
Token ID11
Feature activation-3.42e-01

Pos logit contributions
 mend+0.800
genic+0.719
 LIST+0.681
phabet+0.668
 swe+0.668
Neg logit contributions
aan-0.832
xual-0.831
 laun-0.774
ista-0.766
asma-0.699
Token a
Token ID257
Feature activation-2.39e-01

Pos logit contributions
 mend+0.055
genic+0.050
 LIST+0.046
phabet+0.046
��+0.046
Neg logit contributions
aan-0.059
xual-0.058
 laun-0.055
ista-0.055
 differe-0.050
Token canopy
Token ID39418
Feature activation+6.82e-01

Pos logit contributions
 mend+0.171
genic+0.154
 LIST+0.145
phabet+0.144
��+0.143
Neg logit contributions
aan-0.182
xual-0.180
 laun-0.169
ista-0.168
asma-0.153
Token created
Token ID2727
Feature activation-1.14e+00

Pos logit contributions
aan+0.470
xual+0.465
 laun+0.435
ista+0.433
 differe+0.389
Neg logit contributions
 mend-0.533
genic-0.487
 LIST-0.459
phabet-0.457
��-0.456
Token by
Token ID416
Feature activation-1.03e-01

Pos logit contributions
 mend+0.871
genic+0.793
 LIST+0.748
 swe+0.741
phabet+0.736
Neg logit contributions
aan-0.799
xual-0.793
 laun-0.740
ista-0.731
asma-0.663
Token the
Token ID262
Feature activation-2.29e-01

Pos logit contributions
 mend+0.073
genic+0.066
 LIST+0.062
phabet+0.061
��+0.061
Neg logit contributions
aan-0.078
xual-0.077
 laun-0.073
ista-0.072
 differe-0.066
Token over
Token ID625
Feature activation-1.23e-01

Pos logit contributions
 mend+0.163
genic+0.148
 LIST+0.139
phabet+0.138
��+0.137
Neg logit contributions
aan-0.174
xual-0.172
 laun-0.162
ista-0.161
 differe-0.147
Tokenhang
Token ID33255
Feature activation-1.19e+00

Pos logit contributions
 mend+0.096
genic+0.088
 LIST+0.082
phabet+0.082
��+0.081
Neg logit contributions
aan-0.085
xual-0.085
 laun-0.079
ista-0.078
 differe-0.070
Token of
Token ID286
Feature activation-1.92e-02

Pos logit contributions
 mend+0.918
genic+0.836
 LIST+0.788
phabet+0.779
 swe+0.779
Neg logit contributions
aan-0.828
xual-0.821
 laun-0.768
ista-0.758
asma-0.687
Token the
Token ID262
Feature activation-1.21e-01

Pos logit contributions
 mend+0.014
genic+0.013
 LIST+0.012
phabet+0.012
��+0.012
Neg logit contributions
aan-0.014
xual-0.014
 laun-0.013
ista-0.013
 differe-0.012
Token building
Token ID2615
Feature activation-3.57e-01

Pos logit contributions
 mend+0.083
genic+0.075
 LIST+0.070
phabet+0.070
��+0.069
Neg logit contributions
aan-0.094
xual-0.094
 laun-0.088
ista-0.088
 differe-0.080
Token above
Token ID2029
Feature activation-6.72e-01

Pos logit contributions
 mend+0.278
genic+0.253
 LIST+0.239
phabet+0.237
 swe+0.236
Neg logit contributions
aan-0.247
xual-0.245
 laun-0.228
ista-0.226
 differe-0.204
Token it
Token ID340
Feature activation+3.31e-01

Pos logit contributions
aan+0.135
xual+0.134
 laun+0.126
ista+0.126
 differe+0.116
Neg logit contributions
 mend-0.104
genic-0.094
��-0.088
 LIST-0.087
phabet-0.087
Token.
Token ID13
Feature activation+2.29e-01

Pos logit contributions
aan+0.233
xual+0.231
 laun+0.216
ista+0.215
 differe+0.194
Neg logit contributions
 mend-0.254
genic-0.232
 LIST-0.218
phabet-0.217
��-0.217
Token
Token ID198
Feature activation+2.58e-01

Pos logit contributions
aan+0.184
xual+0.182
 laun+0.172
ista+0.171
 differe+0.157
Neg logit contributions
 mend-0.153
genic-0.138
 LIST-0.128
phabet-0.128
��-0.127
Token
Token ID198
Feature activation-1.27e-01

Pos logit contributions
aan+0.198
xual+0.197
 laun+0.186
ista+0.183
 differe+0.168
Neg logit contributions
 mend-0.181
genic-0.164
 LIST-0.154
phabet-0.152
 swe-0.151
TokenI
Token ID40
Feature activation-1.69e-01

Pos logit contributions
 mend+0.103
genic+0.094
 LIST+0.089
phabet+0.088
joining+0.088
Neg logit contributions
xual-0.083
aan-0.083
 laun-0.078
ista-0.076
 differe-0.069
Token am
Token ID716
Feature activation-5.54e-02

Pos logit contributions
 mend+0.116
genic+0.103
 LIST+0.097
phabet+0.096
 swe+0.096
Neg logit contributions
aan-0.133
xual-0.133
 laun-0.125
ista-0.124
asma-0.113
Token sure
Token ID1654
Feature activation-8.81e-01

Pos logit contributions
 mend+0.040
genic+0.036
 LIST+0.034
 swe+0.033
phabet+0.033
Neg logit contributions
aan-0.041
xual-0.041
 laun-0.039
ista-0.038
asma-0.035
Token that
Token ID326
Feature activation+1.72e-01

Pos logit contributions
 mend+0.619
genic+0.557
 LIST+0.524
phabet+0.516
 swe+0.515
Neg logit contributions
aan-0.678
xual-0.674
 laun-0.633
ista-0.629
 differe-0.573
Token we
Token ID356
Feature activation+3.61e-01

Pos logit contributions
aan+0.136
xual+0.135
 laun+0.127
ista+0.126
 differe+0.115
Neg logit contributions
 mend-0.118
genic-0.106
 LIST-0.099
phabet-0.098
��-0.098
Token have
Token ID423
Feature activation+2.10e-01

Pos logit contributions
aan+0.280
xual+0.278
 laun+0.262
ista+0.260
 differe+0.238
Neg logit contributions
 mend-0.250
genic-0.226
 LIST-0.211
phabet-0.210
��-0.210
Token completed
Token ID5668
Feature activation+2.33e-01

Pos logit contributions
aan+0.160
xual+0.159
 laun+0.149
ista+0.149
 differe+0.135
Neg logit contributions
 mend-0.150
genic-0.135
 LIST-0.127
phabet-0.125
��-0.125
Token.
Token ID13
Feature activation-5.29e-01

Pos logit contributions
aan+0.247
xual+0.245
 laun+0.229
ista+0.228
 differe+0.205
Neg logit contributions
 mend-0.277
genic-0.253
 LIST-0.239
phabet-0.237
��-0.237
Token During
Token ID5856
Feature activation-1.56e-01

Pos logit contributions
 mend+0.351
genic+0.313
 LIST+0.296
phabet+0.288
 swe+0.288
Neg logit contributions
aan-0.428
xual-0.427
 laun-0.402
ista-0.399
 differe-0.366
Token my
Token ID616
Feature activation-4.13e-01

Pos logit contributions
 mend+0.108
genic+0.098
 LIST+0.092
phabet+0.091
��+0.090
Neg logit contributions
aan-0.121
xual-0.120
 laun-0.113
ista-0.113
 differe-0.102
Token deb
Token ID1915
Feature activation-4.10e-02

Pos logit contributions
 mend+0.320
genic+0.288
 LIST+0.273
 swe+0.271
phabet+0.268
Neg logit contributions
xual-0.291
aan-0.290
 laun-0.268
ista-0.268
asma-0.241
Tokenrief
Token ID3796
Feature activation+1.78e+00

Pos logit contributions
 mend+0.047
genic+0.045
 LIST+0.043
phabet+0.043
joining+0.042
Neg logit contributions
aan-0.013
xual-0.013
 laun-0.011
ista-0.011
 differe-0.008
Token with
Token ID351
Feature activation-7.20e-02

Pos logit contributions
aan+1.184
xual+1.174
 laun+1.090
ista+1.090
 differe+0.972
Neg logit contributions
 mend-1.428
genic-1.312
 LIST-1.238
��-1.234
phabet-1.231
Token George
Token ID4502
Feature activation-2.54e-01

Pos logit contributions
 mend+0.052
genic+0.047
 LIST+0.045
phabet+0.044
��+0.044
Neg logit contributions
aan-0.054
xual-0.053
 laun-0.050
ista-0.050
 differe-0.045
Token he
Token ID339
Feature activation-1.84e-01

Pos logit contributions
 mend+0.182
genic+0.163
 LIST+0.154
phabet+0.152
 swe+0.151
Neg logit contributions
aan-0.192
xual-0.192
 laun-0.179
ista-0.178
 differe-0.162
Token informed
Token ID7981
Feature activation+7.18e-02

Pos logit contributions
 mend+0.139
genic+0.125
 LIST+0.118
 swe+0.117
phabet+0.116
Neg logit contributions
aan-0.134
xual-0.133
 laun-0.124
ista-0.123
asma-0.111
Token me
Token ID502
Feature activation-1.42e-01

Pos logit contributions
aan+0.066
xual+0.065
 laun+0.062
ista+0.062
 differe+0.057
Neg logit contributions
 mend-0.040
genic-0.035
 LIST-0.032
phabet-0.032
��-0.031
Token that
Token ID326
Feature activation-1.32e-02

Pos logit contributions
 mend+0.102
genic+0.092
 LIST+0.087
phabet+0.086
 swe+0.086
Neg logit contributions
aan-0.107
xual-0.107
 laun-0.100
ista-0.099
 differe-0.090
Token,
Token ID11
Feature activation+2.53e-01

Pos logit contributions
aan+0.531
xual+0.526
 laun+0.494
ista+0.492
 differe+0.444
Neg logit contributions
 mend-0.545
genic-0.494
 LIST-0.464
phabet-0.463
��-0.462
Token past
Token ID1613
Feature activation+5.44e-01

Pos logit contributions
aan+0.180
xual+0.179
 laun+0.167
ista+0.166
 differe+0.150
Neg logit contributions
 mend-0.193
genic-0.175
 LIST-0.165
phabet-0.164
��-0.163
Token the
Token ID262
Feature activation+1.11e-01

Pos logit contributions
aan+0.410
xual+0.407
 laun+0.382
ista+0.380
 differe+0.346
Neg logit contributions
 mend-0.390
genic-0.353
 LIST-0.331
phabet-0.329
��-0.327
Token ram
Token ID15770
Feature activation+4.14e-01

Pos logit contributions
aan+0.084
xual+0.084
 laun+0.079
ista+0.078
 differe+0.071
Neg logit contributions
 mend-0.080
genic-0.072
 LIST-0.068
phabet-0.067
��-0.067
Tokensh
Token ID1477
Feature activation-1.65e+00

Pos logit contributions
aan+0.347
xual+0.346
 laun+0.327
ista+0.325
 differe+0.300
Neg logit contributions
 mend-0.261
genic-0.234
 LIST-0.216
phabet-0.213
 swe-0.211
Tokenackle
Token ID20523
Feature activation-2.40e+00

Pos logit contributions
 mend+0.733
genic+0.624
 LIST+0.551
phabet+0.534
joining+0.532
Neg logit contributions
xual-1.680
aan-1.679
 laun-1.615
ista-1.591
 differe-1.502
Token family
Token ID1641
Feature activation+8.71e-01

Pos logit contributions
 mend+1.683
genic+1.495
 LIST+1.401
 swe+1.400
phabet+1.370
Neg logit contributions
xual-1.869
aan-1.847
 laun-1.723
ista-1.716
asma-1.566
Token home
Token ID1363
Feature activation+9.21e-01

Pos logit contributions
aan+0.651
xual+0.647
 laun+0.606
ista+0.604
 differe+0.548
Neg logit contributions
 mend-0.631
genic-0.571
 LIST-0.536
phabet-0.532
��-0.530
Token.
Token ID13
Feature activation-3.46e-01

Pos logit contributions
aan+0.640
xual+0.633
 laun+0.593
ista+0.590
 differe+0.531
Neg logit contributions
 mend-0.713
genic-0.651
 LIST-0.614
phabet-0.612
��-0.612
Token Its
Token ID6363
Feature activation-3.68e-01

Pos logit contributions
 mend+0.228
genic+0.204
 LIST+0.191
phabet+0.188
��+0.187
Neg logit contributions
aan-0.282
xual-0.280
 laun-0.264
ista-0.263
 differe-0.241
Token waters
Token ID10150
Feature activation+8.81e-01

Pos logit contributions
 mend+0.271
genic+0.244
 LIST+0.230
 swe+0.227
phabet+0.225
Neg logit contributions
aan-0.273
xual-0.273
 laun-0.254
ista-0.252
 differe-0.229
Tokenalso
Token ID14508
Feature activation-7.55e-01

Pos logit contributions
aan+0.007
xual+0.007
 laun+0.006
ista+0.006
asma+0.005
Neg logit contributions
 mend-0.007
genic-0.006
 LIST-0.006
��-0.006
phabet-0.006
Token known
Token ID1900
Feature activation+3.55e-01

Pos logit contributions
 mend+0.445
genic+0.393
 LIST+0.363
phabet+0.359
��+0.357
Neg logit contributions
aan-0.667
xual-0.663
 laun-0.628
ista-0.625
 differe-0.577
Token as
Token ID355
Feature activation+2.03e-01

Pos logit contributions
aan+0.293
xual+0.292
ista+0.275
 laun+0.275
 differe+0.253
Neg logit contributions
 mend-0.227
genic-0.202
 LIST-0.189
��-0.188
phabet-0.188
Token "
Token ID366
Feature activation-8.49e-02

Pos logit contributions
aan+0.145
xual+0.144
 laun+0.135
ista+0.134
 differe+0.121
Neg logit contributions
 mend-0.153
genic-0.139
 LIST-0.131
phabet-0.131
��-0.130
TokenAc
Token ID12832
Feature activation-5.56e-01

Pos logit contributions
 mend+0.067
genic+0.062
 LIST+0.058
phabet+0.058
��+0.058
Neg logit contributions
aan-0.057
xual-0.057
 laun-0.053
ista-0.053
 differe-0.048
Tokenro
Token ID305
Feature activation-2.29e+00

Pos logit contributions
 mend+0.532
genic+0.494
 LIST+0.472
phabet+0.470
��+0.469
Neg logit contributions
aan-0.285
xual-0.282
 laun-0.257
ista-0.255
 differe-0.219
Tokenker
Token ID6122
Feature activation-1.20e+00

Pos logit contributions
 mend+1.582
genic+1.464
 LIST+1.334
phabet+1.332
joining+1.318
Neg logit contributions
aan-1.773
xual-1.767
 laun-1.666
ista-1.641
 differe-1.512
Tokenato
Token ID5549
Feature activation+8.05e-01

Pos logit contributions
 mend+0.892
genic+0.831
phabet+0.768
 LIST+0.756
joining+0.751
Neg logit contributions
xual-0.865
aan-0.851
 laun-0.813
ista-0.785
 differe-0.727
Tokenel
Token ID417
Feature activation-9.72e-01

Pos logit contributions
aan+0.589
xual+0.587
 laun+0.550
ista+0.544
 differe+0.496
Neg logit contributions
 mend-0.593
genic-0.541
 LIST-0.505
phabet-0.504
��-0.498
Tokenast
Token ID459
Feature activation-1.42e+00

Pos logit contributions
 mend+0.730
genic+0.670
phabet+0.620
 LIST+0.616
joining+0.614
Neg logit contributions
xual-0.700
aan-0.694
 laun-0.657
ista-0.634
 differe-0.585
Tokenoid
Token ID1868
Feature activation+2.56e-01

Pos logit contributions
 mend+1.167
genic+1.074
phabet+1.009
 LIST+1.009
joining+0.997
Neg logit contributions
xual-0.919
aan-0.911
 laun-0.857
ista-0.828
 differe-0.753
Token captain
Token ID10654
Feature activation-4.44e-01

Pos logit contributions
 mend+0.126
genic+0.113
 LIST+0.107
 swe+0.106
phabet+0.105
Neg logit contributions
aan-0.125
xual-0.125
 laun-0.116
ista-0.115
asma-0.105
Token.
Token ID13
Feature activation+3.91e-01

Pos logit contributions
 mend+0.338
genic+0.307
 LIST+0.290
phabet+0.287
 swe+0.286
Neg logit contributions
aan-0.316
xual-0.314
 laun-0.294
ista-0.291
 differe-0.264
Token The
Token ID383
Feature activation+7.75e-02

Pos logit contributions
aan+0.300
xual+0.298
 laun+0.280
ista+0.278
 differe+0.254
Neg logit contributions
 mend-0.275
genic-0.248
 LIST-0.234
phabet-0.231
��-0.230
Token inability
Token ID16612
Feature activation-2.55e-01

Pos logit contributions
aan+0.056
xual+0.056
 laun+0.052
ista+0.052
asma+0.047
Neg logit contributions
 mend-0.058
genic-0.052
 LIST-0.050
phabet-0.049
 swe-0.049
Token to
Token ID284
Feature activation-1.55e-01

Pos logit contributions
 mend+0.195
genic+0.178
 LIST+0.168
phabet+0.167
��+0.166
Neg logit contributions
aan-0.179
xual-0.178
 laun-0.166
ista-0.165
 differe-0.149
Token pinpoint
Token ID30534
Feature activation-2.17e+00

Pos logit contributions
 mend+0.112
genic+0.098
 LIST+0.094
 swe+0.092
 enlist+0.090
Neg logit contributions
aan-0.119
xual-0.119
ista-0.111
 laun-0.111
asma-0.101
Token the
Token ID262
Feature activation+5.74e-02

Pos logit contributions
 mend+1.510
genic+1.329
 LIST+1.254
 swe+1.249
 enlist+1.211
Neg logit contributions
xual-1.721
aan-1.698
 laun-1.575
ista-1.574
asma-1.447
Token trigger
Token ID7616
Feature activation-1.32e-01

Pos logit contributions
xual+0.040
aan+0.040
 laun+0.037
ista+0.037
asma+0.033
Neg logit contributions
 mend-0.045
genic-0.040
 LIST-0.038
 swe-0.038
phabet-0.037
Token leaves
Token ID5667
Feature activation-1.91e-01

Pos logit contributions
 mend+0.097
genic+0.088
 LIST+0.083
 swe+0.082
phabet+0.082
Neg logit contributions
aan-0.098
xual-0.097
 laun-0.090
ista-0.089
asma-0.082
Token a
Token ID257
Feature activation-2.92e-01

Pos logit contributions
 mend+0.141
genic+0.126
 LIST+0.120
 swe+0.118
phabet+0.118
Neg logit contributions
aan-0.141
xual-0.140
 laun-0.130
ista-0.130
asma-0.118
Token crucial
Token ID8780
Feature activation+1.55e-01

Pos logit contributions
 mend+0.219
genic+0.195
 LIST+0.186
 swe+0.184
phabet+0.183
Neg logit contributions
xual-0.214
aan-0.213
 laun-0.198
ista-0.197
asma-0.178
Token
Token ID198
Feature activation-1.19e-01

Pos logit contributions
 mend+0.051
genic+0.045
 LIST+0.042
phabet+0.041
��+0.041
Neg logit contributions
aan-0.078
xual-0.078
 laun-0.074
ista-0.074
 differe-0.068
Token
Token ID198
Feature activation-2.89e-01

Pos logit contributions
 mend+0.081
genic+0.073
 LIST+0.069
 swe+0.067
joining+0.066
Neg logit contributions
xual-0.094
aan-0.094
 laun-0.090
ista-0.086
 differe-0.081
TokenCopyright
Token ID15269
Feature activation+4.63e-01

Pos logit contributions
 mend+0.241
genic+0.223
 LIST+0.210
phabet+0.208
joining+0.206
Neg logit contributions
xual-0.185
aan-0.184
 laun-0.173
ista-0.167
 differe-0.153
Token ©
Token ID10673
Feature activation-7.10e-01

Pos logit contributions
aan+0.360
xual+0.358
 laun+0.337
ista+0.335
 differe+0.305
Neg logit contributions
 mend-0.321
genic-0.290
 LIST-0.271
phabet-0.269
��-0.268
Token 2016
Token ID1584
Feature activation-5.37e-01

Pos logit contributions
 mend+0.528
genic+0.477
 LIST+0.453
phabet+0.441
 swe+0.438
Neg logit contributions
aan-0.519
xual-0.518
 laun-0.484
ista-0.476
 differe-0.436
Token Else
Token ID25974
Feature activation-2.73e+00

Pos logit contributions
 mend+0.354
genic+0.316
 LIST+0.296
phabet+0.291
 swe+0.289
Neg logit contributions
aan-0.437
xual-0.436
 laun-0.409
ista-0.405
 differe-0.374
Tokenvier
Token ID49663
Feature activation+8.44e-01

Pos logit contributions
 mend+1.575
genic+1.395
 LIST+1.281
phabet+1.264
joining+1.238
Neg logit contributions
xual-2.465
aan-2.409
 laun-2.317
ista-2.257
 differe-2.113
Token B
Token ID347
Feature activation-4.60e-01

Pos logit contributions
aan+0.534
xual+0.529
 laun+0.490
ista+0.486
 differe+0.433
Neg logit contributions
 mend-0.709
genic-0.651
 LIST-0.618
phabet-0.614
��-0.611
Token.
Token ID13
Feature activation+7.55e-02

Pos logit contributions
 mend+0.342
genic+0.310
 LIST+0.296
phabet+0.294
 swe+0.290
Neg logit contributions
aan-0.336
xual-0.332
 laun-0.312
ista-0.300
 differe-0.280
TokenV
Token ID53
Feature activation-4.97e-01

Pos logit contributions
aan+0.055
xual+0.055
 laun+0.052
ista+0.051
 differe+0.047
Neg logit contributions
 mend-0.056
genic-0.050
 LIST-0.047
phabet-0.047
��-0.047
Token.
Token ID13
Feature activation+2.00e-01

Pos logit contributions
 mend+0.400
genic+0.365
 LIST+0.346
��+0.344
phabet+0.343
Neg logit contributions
aan-0.332
xual-0.327
ista-0.307
 laun-0.305
asma-0.272
Tokenly
Token ID306
Feature activation-6.69e-01

Pos logit contributions
 mend+0.040
genic+0.036
 LIST+0.033
phabet+0.033
��+0.033
Neg logit contributions
aan-0.052
xual-0.052
 laun-0.049
ista-0.049
 differe-0.045
Token golf
Token ID13126
Feature activation+1.88e-03

Pos logit contributions
 mend+0.484
genic+0.437
 LIST+0.411
phabet+0.406
 swe+0.406
Neg logit contributions
aan-0.502
xual-0.497
ista-0.465
 laun-0.465
asma-0.422
Token course
Token ID1781
Feature activation+8.56e-01

Pos logit contributions
aan+0.001
xual+0.001
 laun+0.001
ista+0.001
asma+0.001
Neg logit contributions
 mend-0.001
genic-0.001
 LIST-0.001
 swe-0.001
phabet-0.001
Token revenues
Token ID13089
Feature activation-2.96e-02

Pos logit contributions
aan+0.698
xual+0.693
 laun+0.655
ista+0.652
 differe+0.597
Neg logit contributions
 mend-0.560
genic-0.502
 LIST-0.468
phabet-0.464
��-0.462
Token and
Token ID290
Feature activation-3.13e-01

Pos logit contributions
 mend+0.023
genic+0.021
 LIST+0.020
phabet+0.020
 swe+0.019
Neg logit contributions
aan-0.021
xual-0.021
 laun-0.019
ista-0.019
 differe-0.017
Token condo
Token ID32119
Feature activation-2.30e+00

Pos logit contributions
 mend+0.210
genic+0.189
 LIST+0.176
phabet+0.175
��+0.174
Neg logit contributions
aan-0.250
xual-0.249
 laun-0.234
ista-0.233
 differe-0.213
Token sales
Token ID4200
Feature activation-7.42e-02

Pos logit contributions
 mend+1.510
genic+1.345
 swe+1.252
 LIST+1.249
phabet+1.233
Neg logit contributions
xual-1.896
aan-1.895
ista-1.751
 laun-1.748
asma-1.613
Token
Token ID960
Feature activation+4.61e-01

Pos logit contributions
 mend+0.057
genic+0.052
 LIST+0.049
phabet+0.049
��+0.049
Neg logit contributions
aan-0.052
xual-0.051
 laun-0.048
ista-0.048
 differe-0.043
Tokenfor
Token ID1640
Feature activation+1.24e-01

Pos logit contributions
aan+0.329
xual+0.325
 laun+0.305
ista+0.303
 differe+0.274
Neg logit contributions
 mend-0.349
genic-0.318
 LIST-0.299
��-0.298
phabet-0.297
Token a
Token ID257
Feature activation-3.95e-02

Pos logit contributions
aan+0.090
xual+0.090
 laun+0.084
ista+0.083
 differe+0.075
Neg logit contributions
 mend-0.093
genic-0.084
 LIST-0.079
phabet-0.079
��-0.078
Token total
Token ID2472
Feature activation+1.16e+00

Pos logit contributions
 mend+0.031
genic+0.028
 LIST+0.027
phabet+0.027
 swe+0.026
Neg logit contributions
aan-0.027
xual-0.027
 laun-0.025
ista-0.025
asma-0.022