TOP ACTIVATIONS
MAX = 2.620

in several initiatives . ( Nick Koz ak file photo for
t have the right to fail ,
ga ess ner , and Nick Hopkins , it was the
. Dr Michael Naz ir - Ali , the
The BBC One TV series Planet Earth had a segment on
groundwater withdrawal , and mitigate salt blow outs and sand storms
. According to Nick Sur gey , C MD
called into question , Dr Naz ir - Ali said .
Thanks for subscribing ! <|endoftext|> Nick Saban has not been hesitant
the other . See , Zak ov itch ,
refused one . From Ed Koch on down , every mayor
, 2016 . REUTERS / Nick Oxford / File Photo
unch kins - J ill Nick all ini - Jim Vs
However , Dr Naz ir - Ali , who
GA : US Department of Health and Human Services , CDC
In 1985 the US Postal Service issued a stamp in
Bible scholar , Y air Zak ov itch of Hebrew University
photo posted by Krist in Cav all ari (@ k rist
written a comment online about Naval ny s protest
s objective of reducing carbon dioxide emissions by 40 %

MIN ACTIVATIONS
MIN = -2.797

deal to save the Def erred Action for Childhood Arri vals
upcoming revival of a historic Broadway musical has gone missing .
this season after tearing his ACL last year .
he spr ained his left ACL after a vicious back hand
irk Cousins ( vs . BU F ): He 's been
der 's season with an ACL sp rain . Brian Bab
was also pursued heavily by Notre Dame , which was the
them near the sunk en fireplace . After that , he
in F ela ! on Broadway , and national and international
cent had been approved by Congress , and the new metal
. Gand alf convin ces Tree beard to send an army
they are befriend ed by Tree beard , the oldest of
and displeasure at how his code has since been abused by
restroom , no current statewide statute addresses entering an opposite -
5 . He Works For Flynn Intel Group , Which
' A wonderful angel ' A retired
in the event of a default but not over late or
a problem , I know Congress will work with the industry
to bring the problems to Congress so we can do that
pas aurus rex in our debugger : 1 $

INTERVAL 1.266 - 2.620
CONTAINS 0.632%

an interest ( in the overweight / ob ese girls )
bis ect ing the region vertically . F illing the land
period , one man was beaten and robbed , three fights
the R FI source ( com puters and monitors in our
. I also did two coats and it had to dry

INTERVAL -0.088 - 1.266
CONTAINS 58.196%

misses . Taking your stance at your starting distance
oses over the entire palm opl ant ar surfaces , beginning
of America recently agreed to pay a fine of $ 16
its share of crises in recent decades , it is worth
released suspect information from any of the investigations . <|endoftext|> It

INTERVAL -1.443 - -0.088
CONTAINS 40.707%

is Moreno Instit uto de Pol í tic as y
That is what everyone was doing . Glenn McGr ath had
others and thus sought to portray me as a r
were also taken .[ 12 ][ 15 ] Results
bring the deficit down to 6 . 3 percent of GDP

INTERVAL -2.797 - -1.443
CONTAINS 0.464%

of the 5 PM Monday deadline for all NHL teams to
his long - prom ised wall along the U . S
life on the streets of Wa ik iki , and feels
ov , along with his cousin , Nikola Gr ue vs
and active bystand er training .
Token in
Token ID287
Feature activation+8.16e-02

Pos logit contributions
elsen+0.074
sole+0.073
 Nanto+0.071
anto+0.065
urus+0.065
Neg logit contributions
geries-0.084
GA-0.078
gotten-0.075
gery-0.075
ODY-0.072
Token several
Token ID1811
Feature activation+5.46e-02

Pos logit contributions
elsen+0.269
sole+0.264
 Nanto+0.259
anto+0.239
urus+0.238
Neg logit contributions
geries-0.261
GA-0.239
gery-0.230
gotten-0.230
ODY-0.223
Token initiatives
Token ID15446
Feature activation+1.46e-01

Pos logit contributions
elsen+0.178
sole+0.174
 Nanto+0.171
anto+0.157
urus+0.156
Neg logit contributions
geries-0.178
GA-0.163
gery-0.156
gotten-0.156
ODY-0.151
Token.
Token ID13
Feature activation-1.97e-01

Pos logit contributions
elsen+0.421
sole+0.412
 Nanto+0.404
anto+0.368
urus+0.365
Neg logit contributions
geries-0.528
GA-0.487
gotten-0.471
gery-0.471
ODY-0.457
Token (
Token ID357
Feature activation+6.00e-01

Pos logit contributions
geries+0.665
GA+0.614
gery+0.590
gotten+0.586
ODY+0.569
Neg logit contributions
elsen-0.617
sole-0.610
 Nanto-0.589
anto-0.546
urus-0.541
Token Nick
Token ID8047
Feature activation+2.62e+00

Pos logit contributions
elsen+2.048
sole+2.008
 Nanto+1.991
anto+1.826
urus+1.819
Neg logit contributions
geries-1.852
GA-1.673
gotten-1.617
gery-1.614
ODY-1.547
Token Koz
Token ID40772
Feature activation+1.61e+00

Pos logit contributions
elsen+7.631
sole+7.252
 Nanto+6.935
anto+6.773
ross+6.641
Neg logit contributions
geries-8.831
GA-8.245
��-7.954
gotten-7.910
gery-7.848
Tokenak
Token ID461
Feature activation+2.70e-01

Pos logit contributions
elsen+4.677
sole+4.490
 Nanto+4.287
anto+4.090
urus+4.043
Neg logit contributions
geries-5.727
GA-5.289
gotten-5.094
ODY-5.048
YP-5.039
Token file
Token ID2393
Feature activation+2.20e-01

Pos logit contributions
elsen+0.741
sole+0.724
 Nanto+0.706
anto+0.642
urus+0.635
Neg logit contributions
geries-1.018
GA-0.941
gotten-0.912
gery-0.910
ODY-0.886
Token photo
Token ID4590
Feature activation+1.19e+00

Pos logit contributions
elsen+0.571
sole+0.556
 Nanto+0.544
anto+0.490
urus+0.486
Neg logit contributions
geries-0.862
GA-0.801
gery-0.776
gotten-0.776
ODY-0.754
Token for
Token ID329
Feature activation-1.50e+00

Pos logit contributions
elsen+2.291
sole+2.189
 Nanto+2.132
anto+1.861
urus+1.836
Neg logit contributions
geries-5.406
GA-5.067
gotten-4.954
gery-4.915
ODY-4.820
Token
Token ID447
Feature activation+1.63e-01

Pos logit contributions
elsen+0.142
sole+0.139
 Nanto+0.136
anto+0.125
urus+0.124
Neg logit contributions
geries-0.173
GA-0.157
gotten-0.153
gery-0.153
ODY-0.148
Token
Token ID247
Feature activation+6.23e-03

Pos logit contributions
elsen+0.638
sole+0.629
 Nanto+0.615
anto+0.578
urus+0.577
Neg logit contributions
geries-0.426
GA-0.376
gery-0.361
gotten-0.358
ODY-0.345
Tokent
Token ID83
Feature activation+3.50e-01

Pos logit contributions
elsen+0.019
sole+0.018
 Nanto+0.018
anto+0.016
urus+0.016
Neg logit contributions
geries-0.022
GA-0.020
gery-0.020
gotten-0.020
ODY-0.019
Token have
Token ID423
Feature activation-4.00e-01

Pos logit contributions
elsen+1.008
sole+0.989
 Nanto+0.966
anto+0.885
urus+0.876
Neg logit contributions
geries-1.271
GA-1.170
gotten-1.132
gery-1.131
ODY-1.099
Token the
Token ID262
Feature activation+1.20e+00

Pos logit contributions
geries+1.270
GA+1.165
gotten+1.121
gery+1.116
ODY+1.078
Neg logit contributions
elsen-1.322
sole-1.292
 Nanto-1.280
anto-1.173
urus-1.167
Token right
Token ID826
Feature activation+2.56e+00

Pos logit contributions
elsen+3.991
sole+3.912
 Nanto+3.732
anto+3.505
urus+3.473
Neg logit contributions
geries-3.855
GA-3.453
gery-3.364
gotten-3.321
��-3.199
Token to
Token ID284
Feature activation+1.17e+00

Pos logit contributions
elsen+7.224
sole+7.221
 Nanto+6.779
urus+6.292
anto+6.109
Neg logit contributions
geries-9.003
GA-7.980
gery-7.935
gotten-7.814
��-7.731
Token fail
Token ID2038
Feature activation-2.26e-01

Pos logit contributions
elsen+3.302
sole+3.276
 Nanto+3.170
anto+2.914
urus+2.873
Neg logit contributions
geries-4.213
GA-3.841
gotten-3.765
gery-3.716
ODY-3.698
Token,
Token ID11
Feature activation+4.80e-01

Pos logit contributions
geries+0.742
GA+0.679
gery+0.654
gotten+0.654
ODY+0.632
Neg logit contributions
elsen-0.730
sole-0.715
 Nanto-0.703
anto-0.647
urus-0.643
Token
Token ID447
Feature activation-6.52e-02

Pos logit contributions
elsen+1.536
sole+1.506
 Nanto+1.480
anto+1.365
urus+1.353
Neg logit contributions
geries-1.590
GA-1.451
gery-1.403
gotten-1.401
ODY-1.349
Token
Token ID251
Feature activation+1.26e-02

Pos logit contributions
geries+0.162
GA+0.144
gery+0.136
gotten+0.136
ODY+0.130
Neg logit contributions
elsen-0.263
sole-0.259
 Nanto-0.256
anto-0.239
urus-0.238
Tokenga
Token ID4908
Feature activation-5.99e-01

Pos logit contributions
elsen+0.595
sole+0.582
 Nanto+0.569
anto+0.518
urus+0.513
Neg logit contributions
geries-0.765
GA-0.707
gery-0.683
gotten-0.682
ODY-0.663
Tokeness
Token ID408
Feature activation+9.19e-01

Pos logit contributions
geries+2.124
GA+1.982
gery+1.909
gotten+1.907
YP+1.840
Neg logit contributions
sole-1.714
elsen-1.685
 Nanto-1.664
anto-1.495
urus-1.479
Tokenner
Token ID1008
Feature activation-3.49e-01

Pos logit contributions
elsen+2.506
sole+2.434
 Nanto+2.350
anto+2.158
urus+2.135
Neg logit contributions
geries-3.516
GA-3.231
gery-3.135
gotten-3.128
ODY-3.051
Token,
Token ID11
Feature activation-3.76e-01

Pos logit contributions
geries+1.091
GA+1.003
gery+0.957
gotten+0.956
ODY+0.923
Neg logit contributions
elsen-1.169
sole-1.151
 Nanto-1.127
urus-1.040
anto-1.040
Token and
Token ID290
Feature activation-2.17e-01

Pos logit contributions
geries+1.224
GA+1.123
gotten+1.077
gery+1.076
ODY+1.043
Neg logit contributions
elsen-1.225
sole-1.202
 Nanto-1.176
anto-1.085
urus-1.082
Token Nick
Token ID8047
Feature activation+2.54e+00

Pos logit contributions
geries+0.667
GA+0.607
gery+0.582
gotten+0.582
ODY+0.561
Neg logit contributions
elsen-0.749
sole-0.735
 Nanto-0.723
anto-0.669
urus-0.666
Token Hopkins
Token ID21183
Feature activation+1.97e+00

Pos logit contributions
elsen+7.872
sole+7.687
 Nanto+7.408
anto+6.998
ross+6.909
Neg logit contributions
geries-8.126
GA-7.440
ODY-7.209
gotten-7.144
gery-7.142
Token,
Token ID11
Feature activation-2.25e-01

Pos logit contributions
elsen+5.884
sole+5.785
 Nanto+5.581
anto+5.361
urst+5.206
Neg logit contributions
geries-6.675
GA-6.056
gery-5.900
gotten-5.899
YP-5.716
Token it
Token ID340
Feature activation-2.87e-01

Pos logit contributions
geries+0.730
GA+0.667
gery+0.641
gotten+0.641
ODY+0.620
Neg logit contributions
elsen-0.735
sole-0.720
 Nanto-0.708
anto-0.651
urus-0.649
Token was
Token ID373
Feature activation+5.25e-01

Pos logit contributions
geries+0.888
GA+0.810
gery+0.776
gotten+0.776
ODY+0.748
Neg logit contributions
elsen-0.978
sole-0.958
 Nanto-0.944
anto-0.871
urus-0.866
Token the
Token ID262
Feature activation+5.45e-01

Pos logit contributions
elsen+1.639
sole+1.614
 Nanto+1.569
anto+1.454
urus+1.439
Neg logit contributions
geries-1.741
GA-1.603
gotten-1.549
gery-1.546
ODY-1.492
Token.
Token ID13
Feature activation+1.92e-01

Pos logit contributions
elsen+0.580
sole+0.569
 Nanto+0.560
anto+0.521
urus+0.515
Neg logit contributions
geries-0.493
GA-0.446
gotten-0.429
gery-0.428
ODY-0.411
Token
Token ID198
Feature activation+4.41e-01

Pos logit contributions
elsen+0.596
sole+0.582
 Nanto+0.571
anto+0.523
urus+0.521
Neg logit contributions
geries-0.657
GA-0.609
gery-0.584
gotten-0.583
ODY-0.567
Token
Token ID198
Feature activation+4.90e-01

Pos logit contributions
elsen+1.211
sole+1.206
 Nanto+1.155
anto+1.058
ross+1.056
Neg logit contributions
geries-1.671
GA-1.489
gotten-1.476
gery-1.469
��-1.438
TokenDr
Token ID6187
Feature activation-4.77e-01

Pos logit contributions
elsen+1.564
sole+1.561
 Nanto+1.482
anto+1.398
ross+1.389
Neg logit contributions
geries-1.625
GA-1.431
gery-1.421
gotten-1.416
��-1.354
Token Michael
Token ID3899
Feature activation+1.02e+00

Pos logit contributions
geries+1.498
GA+1.371
gery+1.319
gotten+1.312
ODY+1.265
Neg logit contributions
elsen-1.593
sole-1.572
 Nanto-1.538
anto-1.413
urus-1.409
Token Naz
Token ID12819
Feature activation+2.46e+00

Pos logit contributions
elsen+3.182
sole+3.095
 Nanto+3.076
anto+2.822
urus+2.770
Neg logit contributions
geries-3.446
GA-3.116
gotten-3.038
gery-3.028
ODY-2.965
Tokenir
Token ID343
Feature activation+2.33e-01

Pos logit contributions
elsen+7.257
sole+7.026
 Nanto+6.736
anto+6.478
urus+6.365
Neg logit contributions
geries-8.373
GA-7.440
ODY-7.391
YP-7.245
gotten-7.243
Token-
Token ID12
Feature activation-2.63e-03

Pos logit contributions
elsen+0.754
sole+0.739
 Nanto+0.726
anto+0.669
urus+0.664
Neg logit contributions
geries-0.765
GA-0.700
gotten-0.674
gery-0.674
ODY-0.651
TokenAli
Token ID37893
Feature activation-1.40e+00

Pos logit contributions
geries+0.008
GA+0.007
gotten+0.007
gery+0.006
ODY+0.006
Neg logit contributions
elsen-0.010
sole-0.009
 Nanto-0.009
anto-0.009
urus-0.009
Token,
Token ID11
Feature activation-4.28e-02

Pos logit contributions
geries+4.215
GA+4.001
gery+3.776
gotten+3.750
ODY+3.627
Neg logit contributions
elsen-4.520
sole-4.495
 Nanto-4.387
urus-4.086
ross-3.987
Token the
Token ID262
Feature activation-3.86e-02

Pos logit contributions
geries+0.141
GA+0.130
gery+0.125
gotten+0.125
ODY+0.121
Neg logit contributions
elsen-0.138
sole-0.135
 Nanto-0.132
anto-0.122
urus-0.121
TokenThe
Token ID464
Feature activation+1.26e-01

Pos logit contributions
geries+0.362
GA+0.331
gery+0.318
gotten+0.318
ODY+0.307
Neg logit contributions
elsen-0.369
sole-0.362
 Nanto-0.356
anto-0.328
urus-0.326
Token BBC
Token ID7823
Feature activation+8.95e-01

Pos logit contributions
elsen+0.380
sole+0.371
 Nanto+0.365
anto+0.333
urus+0.331
Neg logit contributions
geries-0.442
GA-0.406
gery-0.392
gotten-0.392
ODY-0.380
Token One
Token ID1881
Feature activation-4.02e-01

Pos logit contributions
elsen+2.693
sole+2.612
 Nanto+2.599
anto+2.382
urus+2.334
Neg logit contributions
geries-3.105
GA-2.827
gery-2.745
gotten-2.742
ODY-2.661
Token TV
Token ID3195
Feature activation+1.73e-01

Pos logit contributions
geries+1.546
GA+1.436
gery+1.392
gotten+1.390
ODY+1.347
Neg logit contributions
elsen-1.070
sole-1.047
 Nanto-1.022
anto-0.919
urus-0.917
Token series
Token ID2168
Feature activation-1.11e-01

Pos logit contributions
elsen+0.456
sole+0.443
 Nanto+0.435
anto+0.394
urus+0.389
Neg logit contributions
geries-0.670
GA-0.620
gotten-0.602
gery-0.601
ODY-0.586
Token Planet
Token ID11397
Feature activation+2.36e+00

Pos logit contributions
geries+0.378
GA+0.347
gery+0.334
gotten+0.334
ODY+0.323
Neg logit contributions
elsen-0.347
sole-0.339
 Nanto-0.333
anto-0.305
urus-0.304
Token Earth
Token ID3668
Feature activation+1.00e+00

Pos logit contributions
elsen+6.929
sole+6.684
 Nanto+6.673
anto+6.242
ross+6.202
Neg logit contributions
geries-8.040
GA-7.131
gotten-7.045
gery-7.021
ODY-6.932
Token had
Token ID550
Feature activation+4.89e-02

Pos logit contributions
elsen+3.092
sole+2.995
 Nanto+2.959
anto+2.745
urus+2.689
Neg logit contributions
geries-3.410
GA-3.091
gotten-3.000
gery-2.991
ODY-2.940
Token a
Token ID257
Feature activation+5.30e-01

Pos logit contributions
elsen+0.156
sole+0.153
 Nanto+0.150
anto+0.139
urus+0.137
Neg logit contributions
geries-0.162
GA-0.148
gery-0.143
gotten-0.142
ODY-0.139
Token segment
Token ID10618
Feature activation+9.90e-02

Pos logit contributions
elsen+1.592
sole+1.561
 Nanto+1.535
anto+1.402
urus+1.387
Neg logit contributions
geries-1.849
GA-1.707
gotten-1.641
gery-1.639
ODY-1.604
Token on
Token ID319
Feature activation-3.79e-02

Pos logit contributions
elsen+0.297
sole+0.290
 Nanto+0.285
anto+0.260
urus+0.258
Neg logit contributions
geries-0.348
GA-0.321
gotten-0.310
gery-0.309
ODY-0.300
Token groundwater
Token ID34573
Feature activation+5.23e-01

Pos logit contributions
elsen+0.887
sole+0.873
 Nanto+0.857
anto+0.790
urus+0.781
Neg logit contributions
geries-0.961
GA-0.875
gotten-0.848
gery-0.846
ODY-0.822
Token withdrawal
Token ID15220
Feature activation+9.70e-01

Pos logit contributions
elsen+1.649
sole+1.618
 Nanto+1.602
anto+1.466
urus+1.454
Neg logit contributions
geries-1.744
GA-1.594
gery-1.535
gotten-1.531
ODY-1.490
Token,
Token ID11
Feature activation+8.70e-02

Pos logit contributions
elsen+2.969
sole+2.918
 Nanto+2.848
anto+2.625
ross+2.617
Neg logit contributions
geries-3.312
GA-3.007
gotten-2.926
gery-2.907
ODY-2.858
Token and
Token ID290
Feature activation+2.48e-01

Pos logit contributions
elsen+0.270
sole+0.265
 Nanto+0.260
anto+0.238
urus+0.237
Neg logit contributions
geries-0.297
GA-0.273
gery-0.263
gotten-0.263
ODY-0.254
Token mitigate
Token ID24237
Feature activation+1.60e-01

Pos logit contributions
elsen+0.790
sole+0.777
 Nanto+0.764
anto+0.704
urus+0.698
Neg logit contributions
geries-0.824
GA-0.750
gotten-0.726
gery-0.724
ODY-0.703
Token salt
Token ID8268
Feature activation+2.34e+00

Pos logit contributions
elsen+0.535
sole+0.524
 Nanto+0.516
anto+0.477
urus+0.473
Neg logit contributions
geries-0.505
GA-0.460
gery-0.443
gotten-0.442
ODY-0.428
Token blow
Token ID6611
Feature activation+5.04e-01

Pos logit contributions
elsen+6.456
 Nanto+6.382
sole+6.339
anto+5.854
urus+5.821
Neg logit contributions
geries-8.036
GA-7.293
gotten-7.114
gery-7.080
ODY-7.031
Tokenouts
Token ID5269
Feature activation-9.36e-02

Pos logit contributions
elsen+1.529
sole+1.499
 Nanto+1.467
anto+1.345
urus+1.337
Neg logit contributions
geries-1.748
GA-1.611
gotten-1.552
gery-1.551
ODY-1.504
Token and
Token ID290
Feature activation-3.05e-01

Pos logit contributions
geries+0.294
GA+0.269
gery+0.258
gotten+0.258
ODY+0.248
Neg logit contributions
elsen-0.316
sole-0.308
 Nanto-0.304
anto-0.280
urus-0.279
Token sand
Token ID6450
Feature activation+6.50e-01

Pos logit contributions
geries+1.042
GA+0.961
gotten+0.927
gery+0.923
YP+0.891
Neg logit contributions
elsen-0.942
sole-0.921
 Nanto-0.903
anto-0.823
urus-0.819
Token storms
Token ID20228
Feature activation+3.00e-01

Pos logit contributions
elsen+2.281
sole+2.256
 Nanto+2.212
anto+2.055
urus+2.040
Neg logit contributions
geries-1.920
GA-1.747
gotten-1.676
gery-1.668
ODY-1.621
Token.
Token ID13
Feature activation-9.79e-02

Pos logit contributions
elsen+0.516
sole+0.506
 Nanto+0.498
anto+0.462
urus+0.460
Neg logit contributions
geries-0.418
GA-0.378
gery-0.362
gotten-0.362
ODY-0.348
Token
Token ID198
Feature activation-2.34e-01

Pos logit contributions
geries+0.326
GA+0.302
gery+0.289
gotten+0.288
ODY+0.280
Neg logit contributions
elsen-0.311
sole-0.305
 Nanto-0.298
anto-0.275
urus-0.273
Token
Token ID198
Feature activation-3.18e-01

Pos logit contributions
geries+0.826
GA+0.757
gotten+0.732
gery+0.732
ODY+0.709
Neg logit contributions
elsen-0.700
sole-0.685
 Nanto-0.671
anto-0.614
urus-0.609
TokenAccording
Token ID4821
Feature activation+6.90e-01

Pos logit contributions
geries+1.014
GA+0.936
gotten+0.894
gery+0.892
ODY+0.864
Neg logit contributions
elsen-1.051
sole-1.028
 Nanto-1.019
anto-0.935
urus-0.929
Token to
Token ID284
Feature activation+1.39e-01

Pos logit contributions
elsen+1.487
sole+1.456
 Nanto+1.406
anto+1.242
ross+1.219
Neg logit contributions
geries-3.003
GA-2.780
gotten-2.715
gery-2.712
ODY-2.662
Token Nick
Token ID8047
Feature activation+2.32e+00

Pos logit contributions
elsen+0.462
sole+0.453
 Nanto+0.445
anto+0.411
urus+0.408
Neg logit contributions
geries-0.446
GA-0.407
gery-0.392
gotten-0.392
ODY-0.378
Token Sur
Token ID4198
Feature activation-1.77e-01

Pos logit contributions
elsen+7.291
sole+6.999
 Nanto+6.834
anto+6.422
ross+6.327
Neg logit contributions
geries-7.502
GA-6.834
gotten-6.622
gery-6.612
ODY-6.580
Tokengey
Token ID39608
Feature activation+1.51e-01

Pos logit contributions
geries+0.603
GA+0.553
gery+0.534
gotten+0.533
ODY+0.516
Neg logit contributions
elsen-0.551
sole-0.538
 Nanto-0.529
anto-0.485
urus-0.482
Token,
Token ID11
Feature activation-3.59e-01

Pos logit contributions
elsen+0.494
sole+0.484
 Nanto+0.478
urus+0.437
anto+0.437
Neg logit contributions
geries-0.488
GA-0.448
gery-0.429
gotten-0.428
ODY-0.414
Token C
Token ID327
Feature activation-8.05e-01

Pos logit contributions
geries+1.126
GA+1.036
gery+0.989
gotten+0.988
ODY+0.958
Neg logit contributions
elsen-1.205
sole-1.180
 Nanto-1.164
urus-1.069
anto-1.068
TokenMD
Token ID12740
Feature activation+2.72e-01

Pos logit contributions
geries+1.948
GA+1.770
gotten+1.639
gery+1.638
ODY+1.588
Neg logit contributions
elsen-3.259
sole-3.212
 Nanto-3.199
anto-2.954
urus-2.949
Token called
Token ID1444
Feature activation+6.74e-01

Pos logit contributions
geries+1.124
GA+1.032
gery+0.995
gotten+0.994
ODY+0.960
Neg logit contributions
elsen-1.069
sole-1.047
 Nanto-1.030
anto-0.945
urus-0.939
Token into
Token ID656
Feature activation-2.43e-01

Pos logit contributions
elsen+1.650
sole+1.620
 Nanto+1.556
anto+1.422
urus+1.386
Neg logit contributions
geries-2.717
GA-2.533
gotten-2.459
gery-2.442
ODY-2.389
Token question
Token ID1808
Feature activation-9.52e-02

Pos logit contributions
geries+0.777
GA+0.710
gery+0.682
gotten+0.682
ODY+0.658
Neg logit contributions
elsen-0.803
sole-0.788
 Nanto-0.774
anto-0.714
urus-0.710
Token,
Token ID11
Feature activation+4.34e-02

Pos logit contributions
geries+0.321
GA+0.295
gotten+0.284
gery+0.284
ODY+0.275
Neg logit contributions
elsen-0.299
sole-0.293
 Nanto-0.287
anto-0.264
urus-0.262
Token Dr
Token ID1583
Feature activation-3.05e-01

Pos logit contributions
elsen+0.127
sole+0.124
 Nanto+0.121
anto+0.111
urus+0.110
Neg logit contributions
geries-0.156
GA-0.144
gery-0.139
gotten-0.139
ODY-0.135
Token Naz
Token ID12819
Feature activation+2.29e+00

Pos logit contributions
geries+1.177
GA+1.097
gery+1.069
gotten+1.056
ODY+1.023
Neg logit contributions
elsen-0.802
sole-0.787
 Nanto-0.763
anto-0.685
urus-0.683
Tokenir
Token ID343
Feature activation+3.02e-02

Pos logit contributions
sole+5.123
elsen+5.063
urus+4.627
ross+4.620
anto+4.513
Neg logit contributions
geries-9.220
��-8.606
ODY-8.457
GA-8.456
YP-8.289
Token-
Token ID12
Feature activation+6.14e-02

Pos logit contributions
elsen+0.104
sole+0.102
 Nanto+0.100
anto+0.094
urus+0.092
Neg logit contributions
geries-0.093
GA-0.084
gotten-0.081
gery-0.081
ODY-0.078
TokenAli
Token ID37893
Feature activation-6.70e-01

Pos logit contributions
elsen+0.245
sole+0.242
 Nanto+0.237
anto+0.224
urus+0.222
Neg logit contributions
geries-0.155
GA-0.134
gotten-0.132
gery-0.130
ODY-0.124
Token said
Token ID531
Feature activation-4.94e-01

Pos logit contributions
geries+2.262
GA+2.145
gery+2.054
gotten+2.030
ODY+1.979
Neg logit contributions
elsen-2.005
sole-1.957
 Nanto-1.906
urus-1.761
ross-1.721
Token.
Token ID13
Feature activation-1.96e-01

Pos logit contributions
geries+1.352
GA+1.227
gery+1.169
gotten+1.151
ODY+1.113
Neg logit contributions
elsen-1.856
sole-1.815
 Nanto-1.785
urus-1.667
anto-1.667
Token Thanks
Token ID6930
Feature activation+9.87e-01

Pos logit contributions
geries+1.461
GA+1.353
gery+1.309
gotten+1.309
ODY+1.270
Neg logit contributions
elsen-1.117
sole-1.094
 Nanto-1.075
anto-0.971
urus-0.965
Token for
Token ID329
Feature activation-6.20e-01

Pos logit contributions
elsen+2.035
sole+2.007
 Nanto+1.728
anto+1.681
ilon+1.640
Neg logit contributions
geries-4.291
��-3.930
GA-3.907
ODY-3.849
gery-3.816
Token subscribing
Token ID18412
Feature activation+1.25e+00

Pos logit contributions
geries+2.266
GA+2.097
gotten+2.020
gery+2.019
ODY+1.967
Neg logit contributions
elsen-1.773
sole-1.736
 Nanto-1.694
anto-1.546
urus-1.534
Token!
Token ID0
Feature activation+2.65e-01

Pos logit contributions
elsen+3.743
sole+3.491
 Nanto+3.467
urus+3.181
urst+3.168
Neg logit contributions
geries-4.407
GA-3.966
gotten-3.927
gery-3.898
ODY-3.774
Token<|endoftext|>
Token ID50256
Feature activation+3.52e-01

Pos logit contributions
elsen+0.680
sole+0.663
 Nanto+0.650
anto+0.582
urus+0.577
Neg logit contributions
geries-1.045
GA-0.974
gery-0.942
gotten-0.942
ODY-0.916
TokenNick
Token ID23609
Feature activation+2.29e+00

Pos logit contributions
elsen+1.170
sole+1.158
 Nanto+1.112
anto+1.043
urus+1.038
Neg logit contributions
geries-1.124
GA-0.999
gery-0.981
gotten-0.976
ODY-0.937
Token Saban
Token ID41618
Feature activation-4.90e-02

Pos logit contributions
elsen+6.951
sole+6.818
 Nanto+6.581
anto+6.233
urus+6.180
Neg logit contributions
geries-7.465
GA-6.828
gotten-6.551
ODY-6.535
gery-6.534
Token has
Token ID468
Feature activation-2.48e-01

Pos logit contributions
geries+0.159
GA+0.145
gotten+0.140
gery+0.140
ODY+0.135
Neg logit contributions
elsen-0.160
sole-0.157
 Nanto-0.154
anto-0.142
urus-0.141
Token not
Token ID407
Feature activation-2.23e-01

Pos logit contributions
geries+0.790
GA+0.721
gery+0.693
gotten+0.692
ODY+0.669
Neg logit contributions
elsen-0.828
sole-0.812
 Nanto-0.798
anto-0.737
urus-0.733
Token been
Token ID587
Feature activation+8.82e-02

Pos logit contributions
geries+0.695
GA+0.634
gotten+0.609
gery+0.609
ODY+0.585
Neg logit contributions
elsen-0.758
sole-0.744
 Nanto-0.732
anto-0.676
urus-0.672
Token hesitant
Token ID32848
Feature activation-3.57e-02

Pos logit contributions
elsen+0.271
sole+0.266
 Nanto+0.260
anto+0.239
urus+0.237
Neg logit contributions
geries-0.303
GA-0.279
gotten-0.269
gery-0.269
ODY-0.260
Token the
Token ID262
Feature activation-3.09e-01

Pos logit contributions
geries+2.464
GA+2.208
gery+2.153
gotten+2.138
ODY+2.027
Neg logit contributions
elsen-2.832
sole-2.794
 Nanto-2.755
urus-2.547
anto-2.534
Token other
Token ID584
Feature activation-3.29e-01

Pos logit contributions
geries+1.040
GA+0.949
gery+0.920
gotten+0.916
ODY+0.884
Neg logit contributions
elsen-0.973
sole-0.954
 Nanto-0.939
urus-0.860
anto-0.855
Token.
Token ID13
Feature activation-6.37e-02

Pos logit contributions
geries+1.133
GA+1.042
gery+1.008
gotten+1.005
ODY+0.967
Neg logit contributions
elsen-1.008
sole-0.991
 Nanto-0.973
urus-0.887
anto-0.882
Token See
Token ID4091
Feature activation-4.03e-01

Pos logit contributions
geries+0.221
GA+0.204
gery+0.196
gotten+0.196
ODY+0.190
Neg logit contributions
elsen-0.193
sole-0.189
 Nanto-0.186
anto-0.170
urus-0.169
Token,
Token ID11
Feature activation+1.69e-02

Pos logit contributions
geries+1.310
GA+1.204
gotten+1.154
gery+1.154
ODY+1.114
Neg logit contributions
elsen-1.308
sole-1.290
 Nanto-1.259
anto-1.162
urus-1.161
Token Zak
Token ID32605
Feature activation+2.28e+00

Pos logit contributions
elsen+0.052
sole+0.051
 Nanto+0.050
anto+0.046
urus+0.046
Neg logit contributions
geries-0.057
GA-0.053
gery-0.051
gotten-0.051
ODY-0.049
Tokenov
Token ID709
Feature activation+1.31e+00

Pos logit contributions
elsen+4.468
sole+4.203
 Nanto+3.650
ross+3.629
anto+3.579
Neg logit contributions
geries-10.186
��-9.450
GA-9.411
gotten-9.197
ODY-9.156
Tokenitch
Token ID2007
Feature activation+7.48e-01

Pos logit contributions
elsen+3.723
sole+3.656
 Nanto+3.372
ross+3.268
anto+3.260
Neg logit contributions
geries-4.775
GA-4.377
��-4.233
gotten-4.212
ODY-4.156
Token,
Token ID11
Feature activation+6.54e-02

Pos logit contributions
elsen+2.423
sole+2.363
 Nanto+2.331
anto+2.156
urus+2.111
Neg logit contributions
geries-2.444
GA-2.214
gotten-2.167
gery-2.141
ODY-2.075
Token
Token ID564
Feature activation-2.33e-01

Pos logit contributions
elsen+0.202
sole+0.197
 Nanto+0.194
anto+0.178
urus+0.176
Neg logit contributions
geries-0.225
GA-0.206
gotten-0.199
gery-0.199
ODY-0.192
Token
Token ID250
Feature activation-2.71e-01

Pos logit contributions
geries+0.607
GA+0.538
gery+0.514
gotten+0.512
ODY+0.491
Neg logit contributions
elsen-0.913
sole-0.898
 Nanto-0.883
anto-0.827
urus-0.823
Token refused
Token ID6520
Feature activation+7.08e-01

Pos logit contributions
geries+0.149
GA+0.137
gery+0.132
gotten+0.132
ODY+0.128
Neg logit contributions
elsen-0.129
sole-0.127
 Nanto-0.124
anto-0.114
urus-0.113
Token one
Token ID530
Feature activation+3.56e-02

Pos logit contributions
elsen+2.244
sole+2.221
 Nanto+2.121
urus+1.974
anto+1.967
Neg logit contributions
geries-2.338
GA-2.139
gotten-2.059
gery-2.049
YP-1.979
Token.
Token ID13
Feature activation+1.55e-01

Pos logit contributions
elsen+0.123
sole+0.121
 Nanto+0.119
anto+0.110
urus+0.109
Neg logit contributions
geries-0.109
GA-0.099
gotten-0.095
gery-0.095
ODY-0.091
Token From
Token ID3574
Feature activation+5.46e-01

Pos logit contributions
elsen+0.483
sole+0.473
 Nanto+0.467
anto+0.426
urus+0.423
Neg logit contributions
geries-0.527
GA-0.480
gery-0.465
gotten-0.465
ODY-0.448
Token Ed
Token ID1717
Feature activation+1.20e+00

Pos logit contributions
elsen+1.737
sole+1.694
 Nanto+1.666
anto+1.539
urus+1.521
Neg logit contributions
geries-1.820
GA-1.664
gery-1.604
gotten-1.599
ODY-1.548
Token Koch
Token ID17009
Feature activation+2.28e+00

Pos logit contributions
elsen+3.726
 Nanto+3.613
sole+3.601
anto+3.261
urus+3.246
Neg logit contributions
geries-3.993
GA-3.681
gotten-3.549
gery-3.521
ODY-3.477
Token on
Token ID319
Feature activation-2.58e-01

Pos logit contributions
elsen+6.884
sole+6.770
 Nanto+6.567
anto+6.138
urus+6.020
Neg logit contributions
geries-7.647
GA-6.839
gotten-6.715
gery-6.653
YP-6.414
Token down
Token ID866
Feature activation+2.29e-01

Pos logit contributions
geries+1.056
GA+0.988
gery+0.959
gotten+0.954
ODY+0.932
Neg logit contributions
elsen-0.620
sole-0.600
 Nanto-0.585
urus-0.523
anto-0.521
Token,
Token ID11
Feature activation-6.48e-03

Pos logit contributions
elsen+0.748
sole+0.733
 Nanto+0.720
anto+0.663
urus+0.659
Neg logit contributions
geries-0.744
GA-0.680
gery-0.654
gotten-0.654
ODY-0.632
Token every
Token ID790
Feature activation+3.29e-01

Pos logit contributions
geries+0.021
GA+0.019
gery+0.018
gotten+0.018
ODY+0.018
Neg logit contributions
elsen-0.021
sole-0.021
 Nanto-0.020
anto-0.019
urus-0.019
Token mayor
Token ID9591
Feature activation+3.75e-02

Pos logit contributions
elsen+1.021
sole+1.000
 Nanto+0.979
anto+0.900
urus+0.891
Neg logit contributions
geries-1.125
GA-1.030
gotten-0.994
gery-0.991
ODY-0.959
Token,
Token ID11
Feature activation-1.32e-01

Pos logit contributions
elsen+2.017
sole+1.979
 Nanto+1.926
ross+1.775
anto+1.773
Neg logit contributions
geries-2.155
GA-1.928
gotten-1.905
gery-1.881
��-1.824
Token 2016
Token ID1584
Feature activation+1.06e+00

Pos logit contributions
geries+0.395
GA+0.360
gery+0.344
gotten+0.343
ODY+0.332
Neg logit contributions
elsen-0.463
sole-0.454
 Nanto-0.447
anto-0.415
urus-0.412
Token.
Token ID13
Feature activation-6.28e-03

Pos logit contributions
elsen+3.626
sole+3.520
 Nanto+3.459
anto+3.247
urus+3.217
Neg logit contributions
geries-3.230
GA-2.928
gotten-2.857
gery-2.817
YP-2.700
Token REUTERS
Token ID15862
Feature activation-1.80e-01

Pos logit contributions
geries+0.021
GA+0.019
gery+0.018
gotten+0.018
ODY+0.018
Neg logit contributions
elsen-0.020
sole-0.020
 Nanto-0.019
anto-0.018
urus-0.018
Token/
Token ID14
Feature activation+4.19e-01

Pos logit contributions
geries+0.474
GA+0.419
gotten+0.402
gery+0.400
ODY+0.382
Neg logit contributions
elsen-0.698
sole-0.688
 Nanto-0.674
anto-0.632
ross-0.628
TokenNick
Token ID23609
Feature activation+2.27e+00

Pos logit contributions
elsen+1.474
sole+1.449
 Nanto+1.409
anto+1.322
urus+1.317
Neg logit contributions
geries-1.247
GA-1.116
gotten-1.082
gery-1.081
ODY-1.033
Token Oxford
Token ID13643
Feature activation+8.52e-01

Pos logit contributions
elsen+6.578
sole+6.261
 Nanto+6.036
anto+5.749
ross+5.564
Neg logit contributions
geries-7.850
GA-7.435
ODY-7.085
gotten-7.073
��-7.030
Token/
Token ID14
Feature activation-1.95e-01

Pos logit contributions
elsen+3.042
sole+2.968
 Nanto+2.918
anto+2.722
urus+2.695
Neg logit contributions
geries-2.508
GA-2.264
gotten-2.187
gery-2.175
ODY-2.091
TokenFile
Token ID8979
Feature activation-8.33e-01

Pos logit contributions
geries+0.616
GA+0.564
gery+0.540
gotten+0.537
YP+0.523
Neg logit contributions
elsen-0.656
sole-0.644
 Nanto-0.633
anto-0.584
urus-0.580
Token Photo
Token ID5555
Feature activation+6.08e-01

Pos logit contributions
geries+2.958
GA+2.741
gotten+2.662
gery+2.644
ODY+2.578
Neg logit contributions
elsen-2.422
sole-2.399
 Nanto-2.316
anto-2.114
urus-2.101
Token
Token ID198
Feature activation+2.35e-01

Pos logit contributions
elsen+1.812
sole+1.766
 Nanto+1.738
anto+1.587
urus+1.575
Neg logit contributions
geries-2.145
GA-1.974
gotten-1.910
gery-1.907
ODY-1.848
Tokenunch
Token ID3316
Feature activation+2.52e-01

Pos logit contributions
elsen+0.578
sole+0.565
 Nanto+0.554
anto+0.508
urus+0.504
Neg logit contributions
geries-0.668
GA-0.614
gotten-0.593
gery-0.593
ODY-0.574
Tokenkins
Token ID5331
Feature activation+9.64e-01

Pos logit contributions
elsen+0.704
sole+0.691
 Nanto+0.673
anto+0.615
urus+0.608
Neg logit contributions
geries-0.934
GA-0.864
gotten-0.838
gery-0.833
ODY-0.816
Token-
Token ID12
Feature activation-1.97e-01

Pos logit contributions
elsen+3.146
sole+3.069
 Nanto+3.022
anto+2.793
ross+2.762
Neg logit contributions
geries-3.084
GA-2.795
gotten-2.713
gery-2.687
ODY-2.642
TokenJ
Token ID41
Feature activation+1.56e-01

Pos logit contributions
geries+0.592
GA+0.538
gery+0.515
gotten+0.515
ODY+0.496
Neg logit contributions
elsen-0.693
sole-0.680
 Nanto-0.670
anto-0.620
urus-0.617
Tokenill
Token ID359
Feature activation+1.02e+00

Pos logit contributions
elsen+0.489
sole+0.478
 Nanto+0.468
anto+0.432
urus+0.429
Neg logit contributions
geries-0.527
GA-0.482
gery-0.466
gotten-0.466
ODY-0.450
Token Nick
Token ID8047
Feature activation+2.24e+00

Pos logit contributions
elsen+3.278
sole+3.176
 Nanto+3.128
anto+2.918
urus+2.883
Neg logit contributions
geries-3.335
GA-3.044
gotten-2.928
gery-2.913
ODY-2.891
Tokenall
Token ID439
Feature activation-1.41e-01

Pos logit contributions
elsen+6.818
sole+6.407
 Nanto+6.058
ross+5.810
anto+5.794
Neg logit contributions
geries-7.503
GA-6.859
ODY-6.750
gotten-6.625
gery-6.553
Tokenini
Token ID5362
Feature activation-1.57e-01

Pos logit contributions
geries+0.471
GA+0.433
gery+0.417
gotten+0.416
ODY+0.401
Neg logit contributions
elsen-0.446
sole-0.439
 Nanto-0.432
anto-0.395
urus-0.394
Token-
Token ID12
Feature activation-1.99e-01

Pos logit contributions
geries+0.508
GA+0.465
gery+0.447
gotten+0.446
ODY+0.429
Neg logit contributions
elsen-0.515
sole-0.507
 Nanto-0.499
anto-0.460
urus-0.457
TokenJim
Token ID18050
Feature activation-1.94e-01

Pos logit contributions
geries+0.594
GA+0.540
gery+0.517
gotten+0.517
ODY+0.498
Neg logit contributions
elsen-0.699
sole-0.686
 Nanto-0.676
anto-0.626
urus-0.623
Token Vs
Token ID31280
Feature activation-3.91e-01

Pos logit contributions
geries+0.647
GA+0.593
gery+0.571
gotten+0.571
ODY+0.551
Neg logit contributions
elsen-0.614
sole-0.602
 Nanto-0.591
anto-0.543
urus-0.540
Token
Token ID198
Feature activation+1.08e-01

Pos logit contributions
geries+0.341
GA+0.316
gery+0.303
gotten+0.301
ODY+0.293
Neg logit contributions
elsen-0.321
sole-0.314
 Nanto-0.306
anto-0.282
urus-0.282
Token
Token ID198
Feature activation+1.59e-01

Pos logit contributions
elsen+0.310
sole+0.306
 Nanto+0.297
anto+0.271
urus+0.269
Neg logit contributions
geries-0.396
GA-0.359
gotten-0.350
gery-0.349
ODY-0.339
TokenHowever
Token ID4864
Feature activation-7.22e-01

Pos logit contributions
elsen+0.524
sole+0.516
 Nanto+0.504
anto+0.467
urus+0.463
Neg logit contributions
geries-0.511
GA-0.462
gery-0.447
gotten-0.447
ODY-0.430
Token,
Token ID11
Feature activation-3.22e-02

Pos logit contributions
geries+2.188
GA+1.989
gery+1.908
gotten+1.900
ODY+1.833
Neg logit contributions
elsen-2.478
sole-2.437
 Nanto-2.390
urus-2.224
anto-2.218
Token Dr
Token ID1583
Feature activation-4.80e-01

Pos logit contributions
geries+0.115
GA+0.106
gery+0.102
gotten+0.102
ODY+0.099
Neg logit contributions
elsen-0.095
sole-0.093
 Nanto-0.091
anto-0.083
urus-0.083
Token Naz
Token ID12819
Feature activation+2.23e+00

Pos logit contributions
geries+1.821
GA+1.699
gery+1.649
gotten+1.629
ODY+1.579
Neg logit contributions
elsen-1.281
sole-1.262
 Nanto-1.225
anto-1.103
urus-1.097
Tokenir
Token ID343
Feature activation+1.70e-01

Pos logit contributions
sole+5.062
elsen+4.988
ross+4.602
urus+4.545
anto+4.443
Neg logit contributions
geries-8.900
��-8.312
ODY-8.169
GA-8.108
YP-7.978
Token-
Token ID12
Feature activation+7.51e-02

Pos logit contributions
elsen+0.610
sole+0.598
 Nanto+0.588
anto+0.545
urus+0.544
Neg logit contributions
geries-0.499
GA-0.453
gery-0.433
gotten-0.432
ODY-0.415
TokenAli
Token ID37893
Feature activation-1.09e+00

Pos logit contributions
elsen+0.299
sole+0.295
 Nanto+0.288
anto+0.273
urus+0.270
Neg logit contributions
geries-0.191
GA-0.165
gotten-0.162
gery-0.160
ODY-0.153
Token,
Token ID11
Feature activation-2.04e-01

Pos logit contributions
geries+3.530
GA+3.355
gery+3.198
gotten+3.174
ODY+3.075
Neg logit contributions
elsen-3.315
sole-3.257
 Nanto-3.177
urus-2.948
ross-2.878
Token who
Token ID508
Feature activation+2.34e-01

Pos logit contributions
geries+0.654
GA+0.600
gery+0.576
gotten+0.576
ODY+0.557
Neg logit contributions
elsen-0.673
sole-0.659
 Nanto-0.649
anto-0.596
urus-0.594
Token GA
Token ID14545
Feature activation-9.72e-01

Pos logit contributions
geries+1.049
GA+0.958
gery+0.890
gotten+0.881
ODY+0.846
Neg logit contributions
elsen-1.725
sole-1.697
 Nanto-1.669
anto-1.564
urus-1.558
Token:
Token ID25
Feature activation+2.40e-01

Pos logit contributions
geries+2.832
GA+2.651
gery+2.470
gotten+2.436
ODY+2.365
Neg logit contributions
elsen-3.400
sole-3.341
 Nanto-3.310
anto-3.093
urus-3.054
Token US
Token ID1294
Feature activation+2.06e-01

Pos logit contributions
elsen+0.702
sole+0.686
 Nanto+0.676
anto+0.615
urus+0.607
Neg logit contributions
geries-0.862
GA-0.786
gery-0.767
gotten-0.766
ODY-0.740
Token Department
Token ID2732
Feature activation+2.14e+00

Pos logit contributions
elsen+0.647
sole+0.632
 Nanto+0.622
anto+0.572
urus+0.567
Neg logit contributions
geries-0.695
GA-0.627
gery-0.613
gotten-0.613
ODY-0.588
Token of
Token ID286
Feature activation+1.10e+00

Pos logit contributions
sole+3.707
elsen+3.585
 Nanto+3.338
anto+2.978
ross+2.934
Neg logit contributions
geries-9.908
��-8.949
gotten-8.937
��-8.720
gery-8.718
Token Health
Token ID3893
Feature activation+2.23e+00

Pos logit contributions
elsen+3.669
sole+3.608
 Nanto+3.536
anto+3.298
ross+3.262
Neg logit contributions
geries-3.457
GA-3.053
gotten-3.013
gery-2.983
YP-2.879
Token and
Token ID290
Feature activation+3.31e-01

Pos logit contributions
sole+5.300
elsen+5.114
 Nanto+5.100
anto+4.553
ross+4.531
Neg logit contributions
geries-8.639
GA-7.924
gotten-7.827
��-7.817
ODY-7.718
Token Human
Token ID5524
Feature activation+1.96e-01

Pos logit contributions
elsen+0.920
sole+0.900
 Nanto+0.881
anto+0.798
urus+0.793
Neg logit contributions
geries-1.245
GA-1.138
gotten-1.103
gery-1.100
ODY-1.073
Token Services
Token ID6168
Feature activation+1.33e+00

Pos logit contributions
elsen+0.603
sole+0.593
 Nanto+0.580
anto+0.534
urus+0.531
Neg logit contributions
geries-0.667
GA-0.613
gotten-0.593
gery-0.589
ODY-0.576
Token,
Token ID11
Feature activation+3.63e-01

Pos logit contributions
elsen+4.122
sole+4.026
 Nanto+3.965
anto+3.619
urus+3.596
Neg logit contributions
geries-4.451
GA-3.979
gotten-3.959
gery-3.907
ODY-3.760
Token CDC
Token ID20434
Feature activation+5.79e-01

Pos logit contributions
elsen+0.984
sole+0.960
 Nanto+0.940
anto+0.851
urus+0.844
Neg logit contributions
geries-1.379
GA-1.278
gery-1.237
gotten-1.237
ODY-1.201
Token
Token ID198
Feature activation+1.48e-01

Pos logit contributions
elsen+0.331
sole+0.327
 Nanto+0.315
anto+0.289
urus+0.287
Neg logit contributions
geries-0.434
GA-0.391
gotten-0.383
gery-0.382
ODY-0.373
TokenIn
Token ID818
Feature activation+3.57e-01

Pos logit contributions
elsen+0.494
sole+0.486
 Nanto+0.466
anto+0.438
urus+0.435
Neg logit contributions
geries-0.472
GA-0.422
gery-0.412
gotten-0.411
ODY-0.395
Token 1985
Token ID12863
Feature activation+7.94e-02

Pos logit contributions
elsen+1.052
sole+1.031
 Nanto+0.999
anto+0.915
urus+0.911
Neg logit contributions
geries-1.270
GA-1.171
gotten-1.136
gery-1.129
ODY-1.102
Token the
Token ID262
Feature activation+2.96e-01

Pos logit contributions
elsen+0.273
sole+0.267
 Nanto+0.263
anto+0.243
urus+0.242
Neg logit contributions
geries-0.245
GA-0.223
gery-0.214
gotten-0.214
ODY-0.206
Token US
Token ID1294
Feature activation+6.76e-02

Pos logit contributions
elsen+0.924
sole+0.911
 Nanto+0.887
anto+0.818
urus+0.808
Neg logit contributions
geries-0.999
GA-0.912
gotten-0.885
gery-0.882
ODY-0.854
Token Postal
Token ID32881
Feature activation+2.22e+00

Pos logit contributions
elsen+0.224
sole+0.220
 Nanto+0.215
anto+0.200
urus+0.198
Neg logit contributions
geries-0.217
GA-0.194
gery-0.190
gotten-0.189
ODY-0.182
Token Service
Token ID4809
Feature activation+1.14e+00

Pos logit contributions
elsen+6.976
sole+6.901
 Nanto+6.620
ross+6.375
urus+6.356
Neg logit contributions
geries-6.796
gotten-6.244
GA-5.980
ODY-5.896
gery-5.885
Token issued
Token ID4884
Feature activation+7.66e-01

Pos logit contributions
elsen+3.338
sole+3.333
 Nanto+3.211
anto+2.986
ross+2.924
Neg logit contributions
geries-4.036
GA-3.592
gotten-3.552
gery-3.521
ODY-3.432
Token a
Token ID257
Feature activation+5.57e-01

Pos logit contributions
elsen+2.446
sole+2.403
 Nanto+2.325
anto+2.154
urus+2.141
Neg logit contributions
geries-2.493
GA-2.278
gotten-2.212
gery-2.194
ODY-2.146
Token stamp
Token ID17977
Feature activation+3.74e-01

Pos logit contributions
elsen+1.668
sole+1.653
 Nanto+1.594
anto+1.467
urus+1.454
Neg logit contributions
geries-1.943
GA-1.775
gotten-1.729
gery-1.717
ODY-1.680
Token in
Token ID287
Feature activation+2.68e-01

Pos logit contributions
elsen+1.196
sole+1.187
 Nanto+1.155
anto+1.070
urus+1.052
Neg logit contributions
geries-1.231
GA-1.119
gotten-1.088
gery-1.080
ODY-1.053
Token Bible
Token ID9111
Feature activation-4.04e-01

Pos logit contributions
elsen+1.145
sole+1.126
 Nanto+1.107
anto+1.020
urus+1.007
Neg logit contributions
geries-1.241
GA-1.139
gotten-1.099
gery-1.093
ODY-1.065
Token scholar
Token ID15606
Feature activation-4.99e-01

Pos logit contributions
geries+1.319
GA+1.208
gery+1.163
gotten+1.159
ODY+1.120
Neg logit contributions
elsen-1.312
sole-1.285
 Nanto-1.263
anto-1.161
urus-1.155
Token,
Token ID11
Feature activation-7.88e-02

Pos logit contributions
geries+1.602
GA+1.464
gery+1.409
gotten+1.397
ODY+1.350
Neg logit contributions
elsen-1.645
sole-1.616
 Nanto-1.575
anto-1.457
urus-1.454
Token Y
Token ID575
Feature activation+4.88e-01

Pos logit contributions
geries+0.257
GA+0.235
gery+0.226
gotten+0.225
ODY+0.218
Neg logit contributions
elsen-0.257
sole-0.252
 Nanto-0.247
anto-0.227
urus-0.227
Tokenair
Token ID958
Feature activation-6.16e-01

Pos logit contributions
elsen+1.425
sole+1.391
 Nanto+1.354
anto+1.251
urus+1.245
Neg logit contributions
geries-1.758
GA-1.603
gotten-1.568
gery-1.562
ODY-1.513
Token Zak
Token ID32605
Feature activation+2.22e+00

Pos logit contributions
geries+1.995
GA+1.844
gery+1.768
gotten+1.758
ODY+1.689
Neg logit contributions
elsen-1.981
sole-1.951
 Nanto-1.895
urus-1.739
anto-1.737
Tokenov
Token ID709
Feature activation+7.88e-01

Pos logit contributions
elsen+5.601
sole+5.465
 Nanto+5.094
anto+4.888
ross+4.780
Neg logit contributions
geries-8.567
GA-7.798
ODY-7.745
gotten-7.677
YP-7.575
Tokenitch
Token ID2007
Feature activation+2.72e-01

Pos logit contributions
elsen+2.579
sole+2.524
 Nanto+2.473
anto+2.286
urus+2.261
Neg logit contributions
geries-2.558
GA-2.329
gotten-2.259
gery-2.248
ODY-2.177
Token of
Token ID286
Feature activation+3.75e-01

Pos logit contributions
elsen+0.905
sole+0.887
 Nanto+0.872
anto+0.804
urus+0.800
Neg logit contributions
geries-0.869
GA-0.793
gery-0.762
gotten-0.762
ODY-0.735
Token Hebrew
Token ID16505
Feature activation-3.03e-01

Pos logit contributions
elsen+1.231
sole+1.206
 Nanto+1.191
anto+1.096
urus+1.087
Neg logit contributions
geries-1.211
GA-1.107
gotten-1.066
gery-1.064
ODY-1.030
Token University
Token ID2059
Feature activation-7.81e-02

Pos logit contributions
geries+1.031
GA+0.948
gery+0.915
gotten+0.913
ODY+0.882
Neg logit contributions
elsen-0.940
sole-0.919
 Nanto-0.903
anto-0.820
urus-0.820
Token photo
Token ID4590
Feature activation+6.85e-01

Pos logit contributions
geries+2.212
GA+2.056
gery+1.981
gotten+1.978
ODY+1.915
Neg logit contributions
elsen-1.798
sole-1.766
 Nanto-1.737
anto-1.580
urus-1.565
Token posted
Token ID4481
Feature activation+4.52e-02

Pos logit contributions
elsen+2.115
sole+2.078
 Nanto+2.033
anto+1.878
urus+1.862
Neg logit contributions
geries-2.298
GA-2.114
gotten-2.065
gery-2.018
ODY-2.002
Token by
Token ID416
Feature activation+1.44e-01

Pos logit contributions
elsen+0.126
sole+0.123
 Nanto+0.119
anto+0.109
ross+0.108
Neg logit contributions
geries-0.169
GA-0.155
gotten-0.152
gery-0.149
YP-0.147
Token Krist
Token ID14912
Feature activation+5.62e-01

Pos logit contributions
elsen+0.503
sole+0.493
 Nanto+0.485
anto+0.450
urus+0.447
Neg logit contributions
geries-0.433
GA-0.392
gotten-0.378
gery-0.376
ODY-0.362
Tokenin
Token ID259
Feature activation+4.42e-01

Pos logit contributions
elsen+1.821
sole+1.785
 Nanto+1.744
anto+1.615
urus+1.604
Neg logit contributions
geries-1.842
GA-1.686
gotten-1.624
gery-1.623
ODY-1.570
Token Cav
Token ID19931
Feature activation+2.21e+00

Pos logit contributions
elsen+1.415
sole+1.384
 Nanto+1.360
anto+1.253
urus+1.243
Neg logit contributions
geries-1.466
GA-1.337
gotten-1.295
gery-1.291
ODY-1.249
Tokenall
Token ID439
Feature activation+7.65e-02

Pos logit contributions
elsen+6.694
sole+6.511
 Nanto+6.063
anto+6.051
ross+5.885
Neg logit contributions
geries-7.341
GA-6.756
gotten-6.561
��-6.481
gery-6.454
Tokenari
Token ID2743
Feature activation+2.71e-01

Pos logit contributions
elsen+0.234
sole+0.229
 Nanto+0.226
anto+0.206
urus+0.204
Neg logit contributions
geries-0.263
GA-0.243
gery-0.234
gotten-0.234
ODY-0.226
Token (@
Token ID4275
Feature activation+3.85e-01

Pos logit contributions
elsen+0.890
sole+0.871
 Nanto+0.853
anto+0.789
urus+0.783
Neg logit contributions
geries-0.881
GA-0.801
gotten-0.775
gery-0.773
ODY-0.746
Tokenk
Token ID74
Feature activation+1.51e+00

Pos logit contributions
elsen+1.250
sole+1.229
 Nanto+1.194
anto+1.110
urus+1.100
Neg logit contributions
geries-1.256
GA-1.150
gery-1.105
gotten-1.103
ODY-1.068
Tokenrist
Token ID1585
Feature activation+3.95e-01

Pos logit contributions
elsen+4.869
sole+4.828
 Nanto+4.451
anto+4.377
ross+4.300
Neg logit contributions
geries-4.822
GA-4.511
gotten-4.243
gery-4.216
YP-4.168
Token written
Token ID3194
Feature activation+1.90e-01

Pos logit contributions
elsen+3.774
sole+3.618
 Nanto+3.574
anto+3.332
ross+3.279
Neg logit contributions
geries-3.968
GA-3.604
gery-3.447
gotten-3.440
ODY-3.371
Token a
Token ID257
Feature activation+1.03e-01

Pos logit contributions
elsen+0.615
sole+0.605
 Nanto+0.594
anto+0.549
urus+0.542
Neg logit contributions
geries-0.622
GA-0.569
gotten-0.550
gery-0.546
ODY-0.530
Token comment
Token ID2912
Feature activation-4.70e-01

Pos logit contributions
elsen+0.290
sole+0.284
 Nanto+0.278
anto+0.253
urus+0.250
Neg logit contributions
geries-0.380
GA-0.351
gotten-0.339
gery-0.339
ODY-0.329
Token online
Token ID2691
Feature activation-3.54e-02

Pos logit contributions
geries+1.554
GA+1.431
gery+1.376
gotten+1.369
ODY+1.322
Neg logit contributions
elsen-1.510
sole-1.466
 Nanto-1.444
urus-1.326
anto-1.320
Token about
Token ID546
Feature activation-3.27e-01

Pos logit contributions
geries+0.115
GA+0.105
gotten+0.101
gery+0.101
ODY+0.098
Neg logit contributions
elsen-0.115
sole-0.114
 Nanto-0.111
anto-0.103
urus-0.102
Token Naval
Token ID20818
Feature activation+2.20e+00

Pos logit contributions
geries+1.061
GA+0.969
gery+0.934
gotten+0.931
ODY+0.900
Neg logit contributions
elsen-1.069
sole-1.047
 Nanto-1.027
anto-0.948
urus-0.941
Tokenny
Token ID3281
Feature activation+8.65e-01

Pos logit contributions
sole+2.068
elsen+1.988
ross+1.428
 Nanto+1.365
anto+1.365
Neg logit contributions
geries-11.814
GA-11.224
��-10.962
gotten-10.850
��-10.728
Token
Token ID447
Feature activation+1.78e-01

Pos logit contributions
elsen+2.732
sole+2.700
 Nanto+2.642
anto+2.460
urst+2.400
Neg logit contributions
geries-2.843
GA-2.574
gotten-2.513
gery-2.476
��-2.403
Token
Token ID247
Feature activation-9.49e-02

Pos logit contributions
elsen+0.677
sole+0.668
 Nanto+0.649
urus+0.611
anto+0.609
Neg logit contributions
geries-0.480
GA-0.421
gery-0.408
gotten-0.405
ODY-0.392
Tokens
Token ID82
Feature activation-1.80e-01

Pos logit contributions
geries+0.362
GA+0.336
gery+0.325
gotten+0.325
ODY+0.315
Neg logit contributions
elsen-0.256
sole-0.250
 Nanto-0.245
anto-0.221
urus-0.220
Token protest
Token ID5402
Feature activation-2.99e-01

Pos logit contributions
geries+0.584
GA+0.534
gery+0.514
gotten+0.513
ODY+0.496
Neg logit contributions
elsen-0.588
sole-0.577
 Nanto-0.566
anto-0.522
urus-0.519
Token
Token ID247
Feature activation-1.96e-01

Pos logit contributions
elsen+0.234
sole+0.230
 Nanto+0.227
anto+0.213
urus+0.212
Neg logit contributions
geries-0.145
GA-0.128
gotten-0.122
gery-0.122
ODY-0.116
Tokens
Token ID82
Feature activation-1.38e-01

Pos logit contributions
geries+0.741
GA+0.691
gery+0.668
gotten+0.664
ODY+0.646
Neg logit contributions
elsen-0.534
sole-0.522
 Nanto-0.515
anto-0.463
urus-0.460
Token objective
Token ID9432
Feature activation-1.07e-01

Pos logit contributions
geries+0.441
GA+0.402
gotten+0.387
gery+0.387
ODY+0.373
Neg logit contributions
elsen-0.461
sole-0.452
 Nanto-0.444
anto-0.410
urus-0.407
Token of
Token ID286
Feature activation-4.05e-02

Pos logit contributions
geries+0.407
GA+0.381
gery+0.367
gotten+0.367
ODY+0.357
Neg logit contributions
elsen-0.285
sole-0.278
 Nanto-0.272
anto-0.245
urus-0.244
Token reducing
Token ID8868
Feature activation+9.26e-02

Pos logit contributions
geries+0.132
GA+0.121
gery+0.116
gotten+0.116
ODY+0.112
Neg logit contributions
elsen-0.132
sole-0.129
 Nanto-0.127
anto-0.117
urus-0.116
Token carbon
Token ID6588
Feature activation+2.19e+00

Pos logit contributions
elsen+0.298
sole+0.292
 Nanto+0.287
anto+0.264
urus+0.262
Neg logit contributions
geries-0.305
GA-0.280
gery-0.269
gotten-0.269
ODY-0.260
Token dioxide
Token ID17556
Feature activation+1.58e+00

Pos logit contributions
elsen+5.713
sole+5.635
 Nanto+5.594
anto+5.233
ross+5.071
Neg logit contributions
geries-8.124
GA-7.371
gery-7.258
gotten-7.135
YP-7.061
Token emissions
Token ID8971
Feature activation+1.66e+00

Pos logit contributions
elsen+4.213
sole+4.130
 Nanto+4.045
anto+3.765
ross+3.663
Neg logit contributions
geries-5.946
GA-5.400
gery-5.283
gotten-5.269
��-5.164
Token by
Token ID416
Feature activation+8.44e-01

Pos logit contributions
sole+4.469
elsen+4.449
 Nanto+4.280
anto+3.966
ross+3.894
Neg logit contributions
geries-6.081
GA-5.508
gery-5.428
gotten-5.396
��-5.304
Token 40
Token ID2319
Feature activation+9.67e-02

Pos logit contributions
elsen+2.480
sole+2.455
 Nanto+2.357
anto+2.192
ross+2.151
Neg logit contributions
geries-2.952
GA-2.717
gotten-2.638
gery-2.625
ODY-2.583
Token%
Token ID4
Feature activation-1.03e-02

Pos logit contributions
elsen+0.305
sole+0.299
 Nanto+0.294
anto+0.270
urus+0.268
Neg logit contributions
geries-0.325
GA-0.297
gery-0.287
gotten-0.287
ODY-0.277
Token deal
Token ID1730
Feature activation-1.11e+00

Pos logit contributions
geries+0.649
GA+0.594
gery+0.572
gotten+0.572
ODY+0.552
Neg logit contributions
elsen-0.645
sole-0.632
 Nanto-0.621
anto-0.572
urus-0.569
Token to
Token ID284
Feature activation+1.87e-01

Pos logit contributions
geries+3.720
GA+3.429
gery+3.301
gotten+3.289
ODY+3.181
Neg logit contributions
elsen-3.475
sole-3.388
 Nanto-3.307
urus-3.067
anto-3.066
Token save
Token ID3613
Feature activation-6.77e-01

Pos logit contributions
elsen+0.573
sole+0.560
 Nanto+0.552
anto+0.508
urus+0.502
Neg logit contributions
geries-0.641
GA-0.584
gotten-0.566
gery-0.564
ODY-0.551
Token the
Token ID262
Feature activation-6.55e-02

Pos logit contributions
geries+2.021
GA+1.832
gery+1.756
gotten+1.756
ODY+1.689
Neg logit contributions
elsen-2.393
sole-2.349
 Nanto-2.311
anto-2.144
urus-2.131
Token Def
Token ID2896
Feature activation-4.06e-01

Pos logit contributions
geries+0.200
GA+0.181
gery+0.174
gotten+0.174
ODY+0.168
Neg logit contributions
elsen-0.227
sole-0.223
 Nanto-0.218
anto-0.203
urus-0.201
Tokenerred
Token ID17436
Feature activation-2.80e+00

Pos logit contributions
geries+1.172
GA+1.053
gery+1.014
gotten+1.012
ODY+0.972
Neg logit contributions
elsen-1.477
sole-1.446
 Nanto-1.422
anto-1.329
urus-1.318
Token Action
Token ID7561
Feature activation-2.06e+00

Pos logit contributions
geries+6.276
GA+6.195
gotten+5.528
gery+5.493
ODY+5.406
Neg logit contributions
elsen-10.311
 Nanto-10.298
sole-9.879
urst-9.555
urus-9.520
Token for
Token ID329
Feature activation-1.60e+00

Pos logit contributions
geries+6.300
GA+6.223
gery+5.768
gotten+5.708
ODY+5.645
Neg logit contributions
elsen-6.369
 Nanto-6.278
sole-6.066
urst-5.784
ross-5.752
Token Childhood
Token ID40992
Feature activation-1.04e+00

Pos logit contributions
geries+3.568
GA+3.131
gotten+2.992
gery+2.964
ODY+2.816
Neg logit contributions
elsen-6.820
sole-6.744
 Nanto-6.690
anto-6.213
urst-6.210
Token Arri
Token ID27350
Feature activation-4.40e-01

Pos logit contributions
geries+2.505
GA+2.264
gotten+2.118
gery+2.104
ODY+2.015
Neg logit contributions
elsen-4.220
sole-4.186
 Nanto-4.125
anto-3.873
urus-3.829
Tokenvals
Token ID12786
Feature activation-2.67e-01

Pos logit contributions
geries+1.131
GA+1.007
gotten+0.959
gery+0.955
ODY+0.918
Neg logit contributions
elsen-1.743
sole-1.711
 Nanto-1.678
anto-1.585
urus-1.576
Token upcoming
Token ID7865
Feature activation+3.58e-01

Pos logit contributions
elsen+0.907
sole+0.890
 Nanto+0.873
anto+0.802
urus+0.793
Neg logit contributions
geries-1.010
GA-0.930
gotten-0.897
gery-0.895
ODY-0.870
Token revival
Token ID26624
Feature activation-8.54e-01

Pos logit contributions
elsen+1.117
sole+1.096
 Nanto+1.082
anto+0.995
ross+0.976
Neg logit contributions
geries-1.205
GA-1.105
gotten-1.066
gery-1.065
ODY-1.040
Token of
Token ID286
Feature activation-1.53e-01

Pos logit contributions
geries+3.160
GA+2.981
gotten+2.848
gery+2.844
ODY+2.765
Neg logit contributions
elsen-2.330
sole-2.277
 Nanto-2.185
urus-2.028
anto-1.984
Token a
Token ID257
Feature activation-4.54e-01

Pos logit contributions
geries+0.498
GA+0.456
gery+0.438
gotten+0.438
ODY+0.423
Neg logit contributions
elsen-0.499
sole-0.489
 Nanto-0.481
anto-0.443
urus-0.440
Token historic
Token ID9566
Feature activation-1.78e-02

Pos logit contributions
geries+1.531
GA+1.405
gotten+1.356
gery+1.356
ODY+1.303
Neg logit contributions
elsen-1.422
sole-1.393
 Nanto-1.362
anto-1.253
urus-1.249
Token Broadway
Token ID20617
Feature activation-2.76e+00

Pos logit contributions
geries+0.061
GA+0.056
gotten+0.054
gery+0.054
ODY+0.052
Neg logit contributions
elsen-0.055
sole-0.054
 Nanto-0.053
anto-0.048
urus-0.048
Token musical
Token ID10530
Feature activation-8.07e-01

Pos logit contributions
geries+7.582
GA+6.909
gery+6.699
gotten+6.508
YP+6.175
Neg logit contributions
elsen-9.698
sole-9.436
 Nanto-9.363
urus-8.859
urst-8.397
Token has
Token ID468
Feature activation+7.26e-02

Pos logit contributions
geries+2.645
GA+2.425
gery+2.338
gotten+2.335
ODY+2.242
Neg logit contributions
elsen-2.606
sole-2.549
 Nanto-2.497
urus-2.308
anto-2.281
Token gone
Token ID3750
Feature activation+4.32e-01

Pos logit contributions
elsen+0.255
sole+0.250
 Nanto+0.245
anto+0.229
urus+0.227
Neg logit contributions
geries-0.217
GA-0.196
gery-0.188
gotten-0.187
ODY-0.183
Token missing
Token ID4814
Feature activation-3.12e-01

Pos logit contributions
elsen+1.316
sole+1.301
 Nanto+1.263
anto+1.171
urus+1.149
Neg logit contributions
geries-1.493
GA-1.368
gotten-1.322
gery-1.315
ODY-1.273
Token.
Token ID13
Feature activation-2.57e-01

Pos logit contributions
geries+1.026
GA+0.939
gery+0.906
gotten+0.906
ODY+0.874
Neg logit contributions
elsen-1.006
sole-0.982
 Nanto-0.970
anto-0.889
urus-0.885
Token this
Token ID428
Feature activation+5.92e-01

Pos logit contributions
elsen+1.339
sole+1.313
 Nanto+1.281
anto+1.173
urus+1.154
Neg logit contributions
geries-1.743
GA-1.597
gotten-1.557
gery-1.549
ODY-1.511
Token season
Token ID1622
Feature activation+3.84e-01

Pos logit contributions
elsen+1.775
sole+1.729
 Nanto+1.699
anto+1.563
urus+1.541
Neg logit contributions
geries-2.070
GA-1.902
gotten-1.849
gery-1.835
ODY-1.792
Token after
Token ID706
Feature activation+7.81e-02

Pos logit contributions
elsen+1.362
sole+1.337
 Nanto+1.316
anto+1.221
urus+1.214
Neg logit contributions
geries-1.141
GA-1.033
gery-0.990
gotten-0.990
ODY-0.953
Token tearing
Token ID24447
Feature activation-3.35e-01

Pos logit contributions
elsen+0.237
sole+0.232
 Nanto+0.228
anto+0.208
urus+0.207
Neg logit contributions
geries-0.272
GA-0.250
gery-0.241
gotten-0.241
ODY-0.234
Token his
Token ID465
Feature activation-3.77e-01

Pos logit contributions
geries+1.016
GA+0.923
gery+0.889
gotten+0.884
ODY+0.851
Neg logit contributions
elsen-1.169
sole-1.146
 Nanto-1.127
anto-1.042
urus-1.035
Token ACL
Token ID17382
Feature activation-2.71e+00

Pos logit contributions
geries+1.193
GA+1.087
gotten+1.047
gery+1.044
ODY+1.006
Neg logit contributions
elsen-1.263
sole-1.234
 Nanto-1.217
anto-1.120
urus-1.115
Token last
Token ID938
Feature activation+2.71e-01

Pos logit contributions
geries+8.423
GA+8.103
gery+7.647
gotten+7.327
ODY+7.226
Neg logit contributions
elsen-8.223
 Nanto-7.901
sole-7.863
urus-7.256
anto-7.067
Token year
Token ID614
Feature activation+2.30e-01

Pos logit contributions
elsen+0.743
sole+0.726
 Nanto+0.712
anto+0.646
urus+0.638
Neg logit contributions
geries-1.019
GA-0.943
gotten-0.913
gery-0.910
ODY-0.888
Token.
Token ID13
Feature activation-5.05e-02

Pos logit contributions
elsen+0.818
sole+0.804
 Nanto+0.790
anto+0.734
urus+0.729
Neg logit contributions
geries-0.683
GA-0.618
gotten-0.592
gery-0.592
ODY-0.570
Token
Token ID198
Feature activation-2.25e-01

Pos logit contributions
geries+0.174
GA+0.160
gery+0.154
gotten+0.154
ODY+0.149
Neg logit contributions
elsen-0.155
sole-0.152
 Nanto-0.149
anto-0.136
urus-0.135
Token
Token ID198
Feature activation-1.50e-01

Pos logit contributions
geries+0.799
GA+0.731
gotten+0.708
gery+0.707
ODY+0.686
Neg logit contributions
elsen-0.669
sole-0.655
 Nanto-0.640
anto-0.586
urus-0.582
Token he
Token ID339
Feature activation-2.43e-01

Pos logit contributions
geries+1.471
GA+1.360
gery+1.315
gotten+1.310
ODY+1.270
Neg logit contributions
elsen-1.187
sole-1.158
 Nanto-1.136
anto-1.029
urus-1.025
Token spr
Token ID7500
Feature activation-4.81e-01

Pos logit contributions
geries+0.745
GA+0.680
gery+0.653
gotten+0.652
ODY+0.625
Neg logit contributions
elsen-0.837
sole-0.817
 Nanto-0.806
anto-0.743
urus-0.740
Tokenained
Token ID1328
Feature activation-8.68e-01

Pos logit contributions
geries+1.588
GA+1.451
gery+1.400
gotten+1.399
ODY+1.348
Neg logit contributions
elsen-1.547
sole-1.515
 Nanto-1.493
anto-1.367
urus-1.361
Token his
Token ID465
Feature activation-3.00e-01

Pos logit contributions
geries+2.435
GA+2.191
gery+2.116
gotten+2.096
ODY+2.031
Neg logit contributions
elsen-3.177
sole-3.118
 Nanto-3.069
anto-2.840
urus-2.831
Token left
Token ID1364
Feature activation-4.16e-01

Pos logit contributions
geries+0.918
GA+0.834
gotten+0.801
gery+0.800
ODY+0.771
Neg logit contributions
elsen-1.038
sole-1.014
 Nanto-0.999
anto-0.923
urus-0.921
Token ACL
Token ID17382
Feature activation-2.68e+00

Pos logit contributions
geries+1.245
GA+1.118
gotten+1.081
gery+1.076
ODY+1.031
Neg logit contributions
elsen-1.469
sole-1.432
 Nanto-1.418
urus-1.313
anto-1.311
Token after
Token ID706
Feature activation-6.63e-01

Pos logit contributions
geries+7.976
GA+7.623
gery+7.192
gotten+7.010
ODY+6.818
Neg logit contributions
elsen-8.618
sole-8.125
 Nanto-8.122
urus-7.544
anto-7.288
Token a
Token ID257
Feature activation-1.36e-01

Pos logit contributions
geries+2.220
GA+2.045
gery+1.972
gotten+1.954
ODY+1.872
Neg logit contributions
elsen-2.113
sole-2.053
 Nanto-2.011
anto-1.846
urus-1.838
Token vicious
Token ID17686
Feature activation+8.63e-01

Pos logit contributions
geries+0.473
GA+0.437
gery+0.421
gotten+0.420
ODY+0.406
Neg logit contributions
elsen-0.412
sole-0.402
 Nanto-0.394
anto-0.360
urus-0.358
Token back
Token ID736
Feature activation-2.42e-01

Pos logit contributions
elsen+2.533
sole+2.515
 Nanto+2.480
anto+2.273
urus+2.240
Neg logit contributions
geries-3.012
GA-2.755
gotten-2.692
gery-2.647
ODY-2.613
Tokenhand
Token ID4993
Feature activation-9.95e-01

Pos logit contributions
geries+0.787
GA+0.721
gery+0.694
gotten+0.689
ODY+0.666
Neg logit contributions
elsen-0.795
sole-0.771
 Nanto-0.762
anto-0.702
urus-0.700
Tokenirk
Token ID14232
Feature activation+4.36e-01

Pos logit contributions
elsen+2.986
sole+2.890
 Nanto+2.798
anto+2.623
urus+2.606
Neg logit contributions
geries-3.790
GA-3.448
gotten-3.394
gery-3.351
ODY-3.271
Token Cousins
Token ID32644
Feature activation-5.46e-01

Pos logit contributions
elsen+1.164
sole+1.141
 Nanto+1.112
anto+1.014
urus+1.013
Neg logit contributions
geries-1.667
GA-1.543
gotten-1.484
gery-1.477
ODY-1.456
Token (
Token ID357
Feature activation+2.32e-01

Pos logit contributions
geries+1.833
GA+1.707
gery+1.631
gotten+1.611
ODY+1.579
Neg logit contributions
elsen-1.719
sole-1.679
 Nanto-1.648
anto-1.506
urus-1.494
Tokenvs
Token ID14259
Feature activation-8.73e-01

Pos logit contributions
elsen+0.477
sole+0.462
 Nanto+0.449
anto+0.391
urus+0.387
Neg logit contributions
geries-1.036
GA-0.972
gery-0.946
gotten-0.946
ODY-0.923
Token.
Token ID13
Feature activation-6.01e-01

Pos logit contributions
geries+2.859
GA+2.706
gery+2.561
gotten+2.545
ODY+2.485
Neg logit contributions
elsen-2.773
sole-2.735
 Nanto-2.644
anto-2.448
urus-2.415
Token BU
Token ID20571
Feature activation-2.54e+00

Pos logit contributions
geries+1.951
GA+1.825
gery+1.736
gotten+1.724
ODY+1.680
Neg logit contributions
elsen-1.949
sole-1.920
 Nanto-1.860
anto-1.727
urus-1.703
TokenF
Token ID37
Feature activation-5.81e-02

Pos logit contributions
GA+5.269
geries+5.155
ODY+4.486
gery+4.423
YP+4.410
Neg logit contributions
elsen-11.022
sole-10.798
 Nanto-10.538
urus-10.010
urst-9.765
Token):
Token ID2599
Feature activation+6.43e-02

Pos logit contributions
geries+0.247
GA+0.229
gotten+0.224
gery+0.224
ODY+0.218
Neg logit contributions
elsen-0.132
sole-0.128
 Nanto-0.125
anto-0.111
urus-0.110
Token He
Token ID679
Feature activation-6.10e-01

Pos logit contributions
elsen+0.204
sole+0.199
 Nanto+0.196
anto+0.180
urus+0.179
Neg logit contributions
geries-0.215
GA-0.197
gery-0.190
gotten-0.190
ODY-0.184
Token's
Token ID338
Feature activation+7.76e-02

Pos logit contributions
geries+1.762
GA+1.629
gotten+1.554
gery+1.552
ODY+1.476
Neg logit contributions
elsen-2.174
sole-2.140
 Nanto-2.111
anto-1.946
urus-1.937
Token been
Token ID587
Feature activation-2.56e-01

Pos logit contributions
elsen+0.238
sole+0.233
 Nanto+0.229
anto+0.210
urus+0.209
Neg logit contributions
geries-0.267
GA-0.245
gery-0.236
gotten-0.236
ODY-0.229
Tokender
Token ID1082
Feature activation+4.87e-01

Pos logit contributions
geries+1.205
GA+1.091
gery+1.046
gotten+1.044
ODY+1.003
Neg logit contributions
elsen-1.488
sole-1.461
 Nanto-1.441
anto-1.336
urus-1.329
Token's
Token ID338
Feature activation+1.56e-01

Pos logit contributions
elsen+1.728
sole+1.696
 Nanto+1.671
anto+1.552
urus+1.539
Neg logit contributions
geries-1.446
GA-1.304
gotten-1.257
gery-1.252
ODY-1.203
Token season
Token ID1622
Feature activation+3.25e-01

Pos logit contributions
elsen+0.476
sole+0.466
 Nanto+0.457
anto+0.420
urus+0.415
Neg logit contributions
geries-0.539
GA-0.495
gotten-0.478
gery-0.478
ODY-0.463
Token with
Token ID351
Feature activation+8.94e-02

Pos logit contributions
elsen+1.100
sole+1.084
 Nanto+1.066
anto+0.989
urus+0.978
Neg logit contributions
geries-1.014
GA-0.914
gotten-0.888
gery-0.882
ODY-0.851
Token an
Token ID281
Feature activation-1.71e-01

Pos logit contributions
elsen+0.305
sole+0.300
 Nanto+0.294
anto+0.272
urus+0.271
Neg logit contributions
geries-0.277
GA-0.252
gotten-0.242
gery-0.242
ODY-0.234
Token ACL
Token ID17382
Feature activation-2.50e+00

Pos logit contributions
geries+0.506
GA+0.463
gery+0.443
gotten+0.438
ODY+0.425
Neg logit contributions
elsen-0.606
sole-0.595
 Nanto-0.587
anto-0.542
urus-0.542
Token sp
Token ID599
Feature activation+9.72e-02

Pos logit contributions
geries+6.525
GA+6.266
gery+5.789
ODY+5.518
YP+5.421
Neg logit contributions
elsen-9.141
sole-8.711
 Nanto-8.554
urus-7.966
anto-7.949
Tokenrain
Token ID3201
Feature activation-1.02e+00

Pos logit contributions
elsen+0.390
sole+0.384
 Nanto+0.372
anto+0.359
urus+0.355
Neg logit contributions
geries-0.241
GA-0.214
gotten-0.205
gery-0.199
YP-0.197
Token.
Token ID13
Feature activation+6.42e-02

Pos logit contributions
geries+3.046
GA+2.823
gery+2.689
gotten+2.623
ODY+2.585
Neg logit contributions
elsen-3.570
sole-3.475
 Nanto-3.398
urus-3.169
anto-3.148
Token Brian
Token ID8403
Feature activation+3.48e-01

Pos logit contributions
elsen+0.209
sole+0.205
 Nanto+0.201
anto+0.185
urus+0.184
Neg logit contributions
geries-0.209
GA-0.192
gery-0.184
gotten-0.184
ODY-0.178
Token Bab
Token ID12400
Feature activation-4.56e-01

Pos logit contributions
elsen+0.965
sole+0.939
 Nanto+0.927
anto+0.836
ross+0.827
Neg logit contributions
geries-1.298
GA-1.196
gery-1.160
gotten-1.160
ODY-1.130
Token was
Token ID373
Feature activation+4.78e-01

Pos logit contributions
elsen+0.823
sole+0.804
 Nanto+0.791
anto+0.729
urus+0.726
Neg logit contributions
geries-0.831
GA-0.756
gotten-0.731
gery-0.728
ODY-0.707
Token also
Token ID635
Feature activation+1.57e-01

Pos logit contributions
elsen+1.471
sole+1.443
 Nanto+1.413
anto+1.294
urus+1.289
Neg logit contributions
geries-1.639
GA-1.502
gotten-1.450
gery-1.447
ODY-1.408
Token pursued
Token ID19189
Feature activation+1.42e-02

Pos logit contributions
elsen+0.487
sole+0.476
 Nanto+0.468
anto+0.429
urus+0.426
Neg logit contributions
geries-0.533
GA-0.490
gery-0.472
gotten-0.472
ODY-0.457
Token heavily
Token ID7272
Feature activation-3.98e-01

Pos logit contributions
elsen+0.042
sole+0.041
 Nanto+0.040
anto+0.037
urus+0.037
Neg logit contributions
geries-0.050
GA-0.046
gery-0.045
gotten-0.045
ODY-0.043
Token by
Token ID416
Feature activation-2.21e-01

Pos logit contributions
geries+1.408
GA+1.298
gery+1.255
gotten+1.252
ODY+1.215
Neg logit contributions
elsen-1.184
sole-1.154
 Nanto-1.131
anto-1.032
urus-1.027
Token Notre
Token ID23382
Feature activation-2.44e+00

Pos logit contributions
geries+0.728
GA+0.667
gery+0.642
gotten+0.641
ODY+0.620
Neg logit contributions
elsen-0.710
sole-0.695
 Nanto-0.683
anto-0.629
urus-0.624
Token Dame
Token ID20377
Feature activation-5.84e-01

Pos logit contributions
geries+5.027
GA+4.619
gotten+4.306
gery+4.154
YP+3.988
Neg logit contributions
elsen-10.087
 Nanto-10.035
sole-9.907
urst-9.150
urus-9.103
Token,
Token ID11
Feature activation-2.09e-01

Pos logit contributions
geries+1.935
GA+1.779
gery+1.710
gotten+1.707
ODY+1.655
Neg logit contributions
elsen-1.866
sole-1.827
 Nanto-1.797
anto-1.647
urus-1.638
Token which
Token ID543
Feature activation+5.27e-01

Pos logit contributions
geries+0.725
GA+0.673
gotten+0.646
gery+0.646
ODY+0.627
Neg logit contributions
elsen-0.635
sole-0.617
 Nanto-0.605
anto-0.552
urus-0.549
Token was
Token ID373
Feature activation+4.33e-01

Pos logit contributions
elsen+1.650
sole+1.621
 Nanto+1.601
anto+1.475
urus+1.456
Neg logit contributions
geries-1.773
GA-1.607
gotten-1.561
gery-1.552
ODY-1.503
Token the
Token ID262
Feature activation+7.14e-02

Pos logit contributions
elsen+1.323
sole+1.300
 Nanto+1.278
anto+1.168
urus+1.160
Neg logit contributions
geries-1.494
GA-1.366
gotten-1.321
gery-1.319
ODY-1.279
Token them
Token ID606
Feature activation+2.30e-01

Pos logit contributions
elsen+2.338
sole+2.303
 Nanto+2.246
anto+2.080
urus+2.064
Neg logit contributions
geries-2.256
GA-2.053
gotten-1.977
gery-1.975
ODY-1.901
Token near
Token ID1474
Feature activation+1.55e-01

Pos logit contributions
elsen+0.651
sole+0.636
 Nanto+0.623
anto+0.566
urus+0.562
Neg logit contributions
geries-0.845
GA-0.781
gery-0.756
gotten-0.755
ODY-0.733
Token the
Token ID262
Feature activation-2.21e-01

Pos logit contributions
elsen+0.540
sole+0.531
 Nanto+0.522
anto+0.483
urus+0.480
Neg logit contributions
geries-0.472
GA-0.427
gery-0.411
gotten-0.410
ODY-0.395
Token sunk
Token ID24790
Feature activation-5.38e-01

Pos logit contributions
geries+0.739
GA+0.679
gotten+0.656
gery+0.654
ODY+0.632
Neg logit contributions
elsen-0.701
sole-0.685
 Nanto-0.675
anto-0.620
urus-0.617
Tokenen
Token ID268
Feature activation-8.27e-01

Pos logit contributions
geries+1.588
GA+1.455
gery+1.400
gotten+1.397
ODY+1.323
Neg logit contributions
elsen-1.874
sole-1.843
 Nanto-1.837
anto-1.687
urus-1.685
Token fireplace
Token ID44250
Feature activation-2.43e+00

Pos logit contributions
geries+2.454
GA+2.241
gery+2.159
gotten+2.152
ODY+2.059
Neg logit contributions
elsen-2.891
sole-2.824
 Nanto-2.799
urus-2.600
anto-2.590
Token.
Token ID13
Feature activation-7.77e-02

Pos logit contributions
geries+6.370
GA+6.064
gotten+5.741
gery+5.669
ODY+5.497
Neg logit contributions
elsen-8.571
sole-8.105
 Nanto-8.037
urus-7.694
anto-7.518
Token After
Token ID2293
Feature activation+2.96e-01

Pos logit contributions
geries+0.261
GA+0.241
gery+0.231
gotten+0.231
ODY+0.224
Neg logit contributions
elsen-0.245
sole-0.240
 Nanto-0.235
anto-0.216
urus-0.215
Token that
Token ID326
Feature activation-2.92e-02

Pos logit contributions
elsen+0.907
sole+0.889
 Nanto+0.876
anto+0.801
urus+0.790
Neg logit contributions
geries-1.016
GA-0.932
gery-0.897
gotten-0.897
ODY-0.875
Token,
Token ID11
Feature activation+4.38e-02

Pos logit contributions
geries+0.089
GA+0.081
gery+0.078
gotten+0.078
ODY+0.075
Neg logit contributions
elsen-0.101
sole-0.099
 Nanto-0.097
anto-0.090
urus-0.090
Token he
Token ID339
Feature activation+1.81e-01

Pos logit contributions
elsen+0.123
sole+0.120
 Nanto+0.118
anto+0.107
urus+0.106
Neg logit contributions
geries-0.162
GA-0.150
gery-0.145
gotten-0.145
ODY-0.141
Token in
Token ID287
Feature activation-4.55e-01

Pos logit contributions
geries+3.352
GA+3.095
gery+2.970
gotten+2.966
ODY+2.873
Neg logit contributions
elsen-2.881
sole-2.812
 Nanto-2.765
urus-2.522
anto-2.515
Token F
Token ID376
Feature activation+1.16e-01

Pos logit contributions
geries+1.502
GA+1.372
gery+1.321
gotten+1.320
ODY+1.270
Neg logit contributions
elsen-1.461
sole-1.431
 Nanto-1.406
anto-1.294
urus-1.287
Tokenela
Token ID10304
Feature activation-8.98e-01

Pos logit contributions
elsen+0.366
sole+0.354
 Nanto+0.346
anto+0.322
urus+0.320
Neg logit contributions
geries-0.393
GA-0.359
gotten-0.349
gery-0.347
ODY-0.335
Token!
Token ID0
Feature activation-2.46e-01

Pos logit contributions
geries+3.200
GA+2.972
gotten+2.888
gery+2.856
ODY+2.764
Neg logit contributions
sole-2.566
elsen-2.563
 Nanto-2.474
urus-2.261
anto-2.253
Token on
Token ID319
Feature activation-8.87e-01

Pos logit contributions
geries+0.818
GA+0.754
gery+0.723
gotten+0.722
ODY+0.699
Neg logit contributions
elsen-0.779
sole-0.764
 Nanto-0.750
urus-0.687
anto-0.685
Token Broadway
Token ID20617
Feature activation-2.43e+00

Pos logit contributions
geries+2.420
GA+2.182
gery+2.084
gotten+2.071
YP+1.968
Neg logit contributions
elsen-3.338
sole-3.277
 Nanto-3.228
urus-3.011
anto-3.001
Token,
Token ID11
Feature activation-3.41e-01

Pos logit contributions
geries+6.953
GA+6.542
gery+6.071
YP+5.995
gotten+5.978
Neg logit contributions
elsen-8.225
sole-7.835
 Nanto-7.764
urus-7.194
anto-7.148
Token and
Token ID290
Feature activation+1.39e-01

Pos logit contributions
geries+1.138
GA+1.046
gotten+1.006
gery+1.005
ODY+0.973
Neg logit contributions
elsen-1.083
sole-1.061
 Nanto-1.038
anto-0.955
urus-0.951
Token national
Token ID2260
Feature activation+4.13e-01

Pos logit contributions
elsen+0.432
sole+0.422
 Nanto+0.417
anto+0.383
urus+0.379
Neg logit contributions
geries-0.472
GA-0.431
gery-0.417
gotten-0.416
ODY-0.404
Token and
Token ID290
Feature activation-1.24e-01

Pos logit contributions
elsen+1.314
sole+1.293
 Nanto+1.257
anto+1.176
ross+1.161
Neg logit contributions
geries-1.360
GA-1.239
gotten-1.207
gery-1.198
ODY-1.172
Token international
Token ID3230
Feature activation-1.94e-02

Pos logit contributions
geries+0.361
GA+0.327
gery+0.313
gotten+0.312
ODY+0.299
Neg logit contributions
elsen-0.445
sole-0.437
 Nanto-0.430
anto-0.399
urus-0.397
Token cent
Token ID1247
Feature activation+6.93e-01

Pos logit contributions
geries+0.883
GA+0.813
gery+0.788
gotten+0.786
ODY+0.760
Neg logit contributions
elsen-0.697
sole-0.680
 Nanto-0.667
anto-0.604
urus-0.603
Token had
Token ID550
Feature activation-3.29e-01

Pos logit contributions
elsen+2.206
sole+2.180
 Nanto+2.122
anto+1.975
urus+1.956
Neg logit contributions
geries-2.252
GA-2.060
gotten-1.999
gery-1.970
ODY-1.946
Token been
Token ID587
Feature activation-2.09e-01

Pos logit contributions
geries+1.015
GA+0.923
gery+0.886
gotten+0.886
ODY+0.853
Neg logit contributions
elsen-1.130
sole-1.108
 Nanto-1.090
anto-1.008
urus-1.002
Token approved
Token ID6325
Feature activation-1.38e+00

Pos logit contributions
geries+0.684
GA+0.624
gery+0.603
gotten+0.602
ODY+0.577
Neg logit contributions
elsen-0.683
sole-0.668
 Nanto-0.657
anto-0.604
urus-0.601
Token by
Token ID416
Feature activation-6.54e-02

Pos logit contributions
geries+4.485
GA+4.055
gery+3.967
gotten+3.860
ODY+3.731
Neg logit contributions
elsen-4.461
 Nanto-4.332
sole-4.226
urus-3.955
anto-3.905
Token Congress
Token ID3162
Feature activation-2.41e+00

Pos logit contributions
geries+0.204
GA+0.186
gery+0.179
gotten+0.179
ODY+0.172
Neg logit contributions
elsen-0.222
sole-0.218
 Nanto-0.214
anto-0.198
urus-0.197
Token,
Token ID11
Feature activation-4.02e-01

Pos logit contributions
geries+7.150
GA+6.485
gery+6.349
gotten+5.949
ODY+5.870
Neg logit contributions
elsen-8.054
 Nanto-7.999
sole-7.497
urus-7.091
anto-7.091
Token and
Token ID290
Feature activation-3.26e-02

Pos logit contributions
geries+1.355
GA+1.249
gery+1.199
gotten+1.197
ODY+1.158
Neg logit contributions
elsen-1.268
sole-1.234
 Nanto-1.217
anto-1.114
urus-1.108
Token the
Token ID262
Feature activation-2.12e-01

Pos logit contributions
geries+0.107
GA+0.098
gotten+0.094
gery+0.094
ODY+0.091
Neg logit contributions
elsen-0.106
sole-0.103
 Nanto-0.102
anto-0.094
urus-0.093
Token new
Token ID649
Feature activation-3.76e-01

Pos logit contributions
geries+0.741
GA+0.680
gery+0.658
gotten+0.654
ODY+0.633
Neg logit contributions
elsen-0.646
sole-0.630
 Nanto-0.620
anto-0.565
urus-0.564
Token metal
Token ID6147
Feature activation+2.36e-01

Pos logit contributions
geries+1.360
GA+1.251
gery+1.214
gotten+1.208
ODY+1.167
Neg logit contributions
elsen-1.097
sole-1.067
 Nanto-1.049
urus-0.950
anto-0.949
Token.
Token ID13
Feature activation-1.93e-01

Pos logit contributions
geries+2.582
GA+2.375
gery+2.278
gotten+2.228
ODY+2.175
Neg logit contributions
elsen-3.335
sole-3.263
 Nanto-3.256
anto-3.008
urus-2.962
Token Gand
Token ID17727
Feature activation+2.30e-01

Pos logit contributions
geries+0.617
GA+0.562
gotten+0.541
gery+0.541
ODY+0.521
Neg logit contributions
elsen-0.641
sole-0.628
 Nanto-0.618
anto-0.570
urus-0.566
Tokenalf
Token ID1604
Feature activation-1.28e-02

Pos logit contributions
elsen+0.784
sole+0.777
 Nanto+0.730
anto+0.716
ross+0.699
Neg logit contributions
geries-0.712
GA-0.631
gotten-0.610
gery-0.605
YP-0.601
Token convin
Token ID9120
Feature activation+1.13e+00

Pos logit contributions
geries+0.043
GA+0.039
gotten+0.038
gery+0.038
ODY+0.037
Neg logit contributions
elsen-0.040
sole-0.040
 Nanto-0.039
anto-0.036
urus-0.036
Tokences
Token ID728
Feature activation+5.66e-01

Pos logit contributions
sole+3.698
elsen+3.676
 Nanto+3.445
urus+3.373
anto+3.334
Neg logit contributions
geries-3.317
GA-3.080
gotten-3.067
ODY-3.051
��-2.977
Token Tree
Token ID12200
Feature activation-2.38e+00

Pos logit contributions
elsen+1.838
sole+1.801
 Nanto+1.766
anto+1.639
urus+1.619
Neg logit contributions
geries-1.832
GA-1.656
gotten-1.606
gery-1.585
YP-1.554
Tokenbeard
Token ID39433
Feature activation-3.29e-01

Pos logit contributions
geries+4.548
GA+4.088
gery+3.945
gotten+3.899
ODY+3.457
Neg logit contributions
elsen-10.286
 Nanto-10.130
sole-10.016
urus-9.402
anto-9.275
Token to
Token ID284
Feature activation+3.01e-01

Pos logit contributions
geries+1.199
GA+1.114
gery+1.078
gotten+1.061
ODY+1.037
Neg logit contributions
elsen-0.940
sole-0.921
 Nanto-0.897
urus-0.816
anto-0.809
Token send
Token ID3758
Feature activation+2.85e-01

Pos logit contributions
elsen+0.864
sole+0.846
 Nanto+0.830
anto+0.758
urus+0.750
Neg logit contributions
geries-1.093
GA-1.006
gotten-0.975
gery-0.971
ODY-0.949
Token an
Token ID281
Feature activation-4.09e-02

Pos logit contributions
elsen+0.992
sole+0.971
 Nanto+0.955
anto+0.887
urus+0.879
Neg logit contributions
geries-0.862
GA-0.782
gotten-0.752
gery-0.748
ODY-0.722
Token army
Token ID5428
Feature activation+4.50e-01

Pos logit contributions
geries+0.141
GA+0.129
gery+0.125
gotten+0.125
ODY+0.121
Neg logit contributions
elsen-0.126
sole-0.123
 Nanto-0.121
anto-0.111
urus-0.110
Token they
Token ID484
Feature activation+6.36e-01

Pos logit contributions
geries+0.011
GA+0.010
gotten+0.010
gery+0.010
ODY+0.010
Neg logit contributions
elsen-0.011
sole-0.011
 Nanto-0.011
anto-0.010
urus-0.010
Token are
Token ID389
Feature activation+6.44e-01

Pos logit contributions
elsen+1.785
sole+1.758
 Nanto+1.719
anto+1.581
urus+1.559
Neg logit contributions
geries-2.349
GA-2.151
gotten-2.091
gery-2.090
ODY-2.033
Token befriend
Token ID43548
Feature activation-7.18e-02

Pos logit contributions
elsen+1.919
sole+1.889
 Nanto+1.848
anto+1.691
urus+1.675
Neg logit contributions
geries-2.264
GA-2.071
gery-2.001
gotten-1.998
ODY-1.940
Tokened
Token ID276
Feature activation+1.35e-01

Pos logit contributions
geries+0.294
GA+0.273
gery+0.265
gotten+0.264
ODY+0.258
Neg logit contributions
elsen-0.176
sole-0.171
 Nanto-0.166
anto-0.149
urus-0.147
Token by
Token ID416
Feature activation+2.78e-01

Pos logit contributions
elsen+0.403
sole+0.394
 Nanto+0.387
anto+0.353
urus+0.351
Neg logit contributions
geries-0.477
GA-0.439
gery-0.424
gotten-0.424
ODY-0.411
Token Tree
Token ID12200
Feature activation-2.35e+00

Pos logit contributions
elsen+0.924
sole+0.905
 Nanto+0.893
anto+0.823
urus+0.816
Neg logit contributions
geries-0.886
GA-0.810
gery-0.775
gotten-0.773
YP-0.750
Tokenbeard
Token ID39433
Feature activation-5.13e-01

Pos logit contributions
geries+5.951
GA+5.578
gotten+5.292
gery+5.288
ODY+5.045
Neg logit contributions
sole-8.288
elsen-8.260
 Nanto-8.131
anto-7.521
urst-7.477
Token,
Token ID11
Feature activation-3.52e-01

Pos logit contributions
geries+1.578
GA+1.447
gery+1.387
gotten+1.383
ODY+1.338
Neg logit contributions
elsen-1.751
sole-1.717
 Nanto-1.685
anto-1.554
urus-1.548
Token the
Token ID262
Feature activation-1.59e-01

Pos logit contributions
geries+1.123
GA+1.029
gotten+0.990
gery+0.989
ODY+0.956
Neg logit contributions
elsen-1.164
sole-1.140
 Nanto-1.118
anto-1.032
urus-1.028
Token oldest
Token ID13325
Feature activation-1.43e-01

Pos logit contributions
geries+0.527
GA+0.483
gery+0.466
gotten+0.466
ODY+0.449
Neg logit contributions
elsen-0.511
sole-0.501
 Nanto-0.491
anto-0.452
urus-0.449
Token of
Token ID286
Feature activation+7.22e-01

Pos logit contributions
geries+0.517
GA+0.477
gery+0.461
gotten+0.461
ODY+0.447
Neg logit contributions
elsen-0.412
sole-0.403
 Nanto-0.395
anto-0.359
urus-0.357
Token and
Token ID290
Feature activation+1.65e-01

Pos logit contributions
elsen+0.584
sole+0.571
 Nanto+0.559
anto+0.508
urus+0.503
Neg logit contributions
geries-0.772
GA-0.713
gery-0.689
gotten-0.689
ODY-0.669
Token displeasure
Token ID47313
Feature activation+3.80e-01

Pos logit contributions
elsen+0.591
sole+0.581
 Nanto+0.571
anto+0.532
urus+0.527
Neg logit contributions
geries-0.487
GA-0.437
gery-0.419
gotten-0.419
ODY-0.406
Token at
Token ID379
Feature activation+3.68e-01

Pos logit contributions
elsen+1.044
sole+1.024
 Nanto+1.002
anto+0.909
urus+0.895
Neg logit contributions
geries-1.428
GA-1.314
gery-1.273
gotten-1.270
ODY-1.238
Token how
Token ID703
Feature activation+6.39e-01

Pos logit contributions
elsen+1.225
sole+1.212
 Nanto+1.186
anto+1.097
urus+1.090
Neg logit contributions
geries-1.158
GA-1.060
gery-1.013
gotten-1.010
ODY-0.988
Token his
Token ID465
Feature activation-4.91e-02

Pos logit contributions
elsen+2.073
sole+2.056
 Nanto+2.001
anto+1.863
urus+1.845
Neg logit contributions
geries-2.044
GA-1.879
gotten-1.800
gery-1.800
ODY-1.757
Token code
Token ID2438
Feature activation-2.34e+00

Pos logit contributions
geries+0.162
GA+0.148
gery+0.143
gotten+0.143
ODY+0.138
Neg logit contributions
elsen-0.158
sole-0.154
 Nanto-0.152
anto-0.140
urus-0.139
Token has
Token ID468
Feature activation-6.36e-01

Pos logit contributions
geries+6.818
GA+6.225
gery+6.049
gotten+6.048
YP+5.561
Neg logit contributions
elsen-8.005
 Nanto-7.575
sole-7.562
urst-6.977
urus-6.933
Token since
Token ID1201
Feature activation-2.55e-01

Pos logit contributions
geries+1.691
GA+1.514
gotten+1.452
gery+1.443
ODY+1.369
Neg logit contributions
elsen-2.459
sole-2.409
 Nanto-2.382
anto-2.218
urus-2.207
Token been
Token ID587
Feature activation-2.71e-01

Pos logit contributions
geries+0.682
GA+0.611
gery+0.582
gotten+0.580
ODY+0.562
Neg logit contributions
elsen-0.975
sole-0.961
 Nanto-0.944
anto-0.883
urus-0.879
Token abused
Token ID15519
Feature activation-3.99e-01

Pos logit contributions
geries+0.851
GA+0.775
gery+0.745
gotten+0.745
ODY+0.717
Neg logit contributions
elsen-0.916
sole-0.898
 Nanto-0.883
anto-0.816
urus-0.811
Token by
Token ID416
Feature activation-1.05e-01

Pos logit contributions
geries+1.284
GA+1.174
gery+1.130
gotten+1.128
ODY+1.085
Neg logit contributions
elsen-1.314
sole-1.284
 Nanto-1.268
anto-1.168
urus-1.161
Token restroom
Token ID40018
Feature activation-1.64e+00

Pos logit contributions
elsen+1.407
sole+1.376
 Nanto+1.348
anto+1.251
urus+1.249
Neg logit contributions
geries-1.173
GA-1.050
gotten-1.012
gery-1.004
ODY-0.979
Token,
Token ID11
Feature activation-2.75e-01

Pos logit contributions
geries+4.624
GA+4.205
gery+4.059
gotten+3.978
ODY+3.794
Neg logit contributions
elsen-5.907
 Nanto-5.782
sole-5.663
urus-5.343
anto-5.305
Token no
Token ID645
Feature activation-6.31e-01

Pos logit contributions
geries+0.876
GA+0.799
gery+0.768
gotten+0.768
ODY+0.741
Neg logit contributions
elsen-0.914
sole-0.896
 Nanto-0.880
anto-0.812
urus-0.807
Token current
Token ID1459
Feature activation-1.59e-01

Pos logit contributions
geries+2.207
GA+2.046
gery+1.972
gotten+1.957
ODY+1.905
Neg logit contributions
elsen-1.889
sole-1.857
 Nanto-1.833
anto-1.660
urus-1.652
Token statewide
Token ID23605
Feature activation-1.44e-01

Pos logit contributions
geries+0.598
GA+0.553
gotten+0.536
gery+0.535
ODY+0.520
Neg logit contributions
elsen-0.441
sole-0.430
 Nanto-0.421
anto-0.382
urus-0.379
Token statute
Token ID14195
Feature activation-2.34e+00

Pos logit contributions
geries+0.548
GA+0.508
gotten+0.492
gery+0.492
ODY+0.478
Neg logit contributions
elsen-0.388
sole-0.379
 Nanto-0.370
anto-0.335
urus-0.332
Token addresses
Token ID9405
Feature activation-7.90e-02

Pos logit contributions
geries+6.353
GA+5.681
gery+5.593
gotten+5.275
ODY+5.053
Neg logit contributions
elsen-8.498
 Nanto-8.099
sole-8.017
urst-7.440
urus-7.406
Token entering
Token ID8218
Feature activation-1.72e-01

Pos logit contributions
geries+0.247
GA+0.225
gotten+0.216
gery+0.215
ODY+0.209
Neg logit contributions
elsen-0.267
sole-0.263
 Nanto-0.255
anto-0.238
urus-0.237
Token an
Token ID281
Feature activation-1.25e-01

Pos logit contributions
geries+0.546
GA+0.498
gotten+0.479
gery+0.478
ODY+0.462
Neg logit contributions
elsen-0.576
sole-0.565
 Nanto-0.554
anto-0.512
urus-0.510
Token opposite
Token ID6697
Feature activation-1.14e-02

Pos logit contributions
geries+0.414
GA+0.380
gery+0.366
gotten+0.365
ODY+0.353
Neg logit contributions
elsen-0.401
sole-0.393
 Nanto-0.386
anto-0.355
urus-0.353
Token-
Token ID12
Feature activation-7.21e-03

Pos logit contributions
geries+0.032
GA+0.028
gotten+0.027
gery+0.027
ODY+0.026
Neg logit contributions
elsen-0.043
sole-0.042
 Nanto-0.041
anto-0.038
urus-0.038
Token
Token ID198
Feature activation-3.12e-01

Pos logit contributions
geries+1.087
GA+1.000
gotten+0.965
gery+0.965
ODY+0.934
Neg logit contributions
elsen-0.938
sole-0.918
 Nanto-0.900
anto-0.824
urus-0.818
Token5
Token ID20
Feature activation-9.35e-01

Pos logit contributions
geries+1.008
GA+0.921
gery+0.886
gotten+0.886
ODY+0.855
Neg logit contributions
elsen-1.023
sole-1.003
 Nanto-0.985
anto-0.908
urus-0.902
Token.
Token ID13
Feature activation-8.58e-01

Pos logit contributions
geries+2.508
GA+2.298
gery+2.167
gotten+2.159
ODY+2.078
Neg logit contributions
elsen-3.545
sole-3.483
 Nanto-3.435
anto-3.202
urus-3.176
Token He
Token ID679
Feature activation-4.63e-01

Pos logit contributions
geries+2.641
GA+2.432
gery+2.312
gotten+2.299
ODY+2.200
Neg logit contributions
elsen-2.911
sole-2.867
 Nanto-2.805
anto-2.605
urst-2.586
Token Works
Token ID10933
Feature activation-3.80e-01

Pos logit contributions
geries+1.454
GA+1.328
gery+1.276
gotten+1.272
ODY+1.222
Neg logit contributions
elsen-1.562
sole-1.530
 Nanto-1.502
anto-1.390
urus-1.379
Token For
Token ID1114
Feature activation-2.31e+00

Pos logit contributions
geries+1.327
GA+1.228
gery+1.182
gotten+1.178
YP+1.134
Neg logit contributions
elsen-1.143
sole-1.109
 Nanto-1.098
anto-0.999
urus-0.997
Token Flynn
Token ID14654
Feature activation-3.85e-01

Pos logit contributions
geries+6.939
GA+6.316
gery+6.096
gotten+5.918
YP+5.561
Neg logit contributions
elsen-7.710
sole-7.492
 Nanto-7.308
urst-6.972
urus-6.824
Token Intel
Token ID8180
Feature activation+3.87e-01

Pos logit contributions
geries+1.220
GA+1.116
gery+1.070
gotten+1.069
ODY+1.031
Neg logit contributions
elsen-1.287
sole-1.262
 Nanto-1.241
anto-1.143
urus-1.139
Token Group
Token ID4912
Feature activation+3.10e-01

Pos logit contributions
elsen+1.163
sole+1.131
 Nanto+1.112
anto+1.015
urus+1.009
Neg logit contributions
geries-1.363
GA-1.250
gery-1.218
gotten-1.209
ODY-1.176
Token,
Token ID11
Feature activation-3.75e-01

Pos logit contributions
elsen+0.940
sole+0.920
 Nanto+0.901
anto+0.827
urus+0.820
Neg logit contributions
geries-1.084
GA-0.994
gotten-0.962
gery-0.961
ODY-0.933
Token Which
Token ID9022
Feature activation+5.25e-01

Pos logit contributions
geries+1.221
GA+1.126
gery+1.077
gotten+1.075
ODY+1.035
Neg logit contributions
elsen-1.216
sole-1.191
 Nanto-1.167
anto-1.074
urus-1.068
Token
Token ID198
Feature activation+2.48e-03

Pos logit contributions
elsen+1.415
sole+1.384
 Nanto+1.358
anto+1.240
urus+1.231
Neg logit contributions
geries-1.680
GA-1.547
gotten-1.494
gery-1.494
ODY-1.447
Token
Token ID198
Feature activation+5.16e-02

Pos logit contributions
elsen+0.007
sole+0.007
 Nanto+0.007
anto+0.006
urus+0.006
Neg logit contributions
geries-0.009
GA-0.008
gotten-0.008
gery-0.008
ODY-0.008
Token'
Token ID6
Feature activation+2.15e-01

Pos logit contributions
elsen+0.165
sole+0.162
 Nanto+0.157
anto+0.146
urus+0.145
Neg logit contributions
geries-0.172
GA-0.155
gery-0.151
gotten-0.151
ODY-0.145
TokenA
Token ID32
Feature activation-1.14e+00

Pos logit contributions
elsen+0.702
sole+0.697
 Nanto+0.674
anto+0.626
urus+0.623
Neg logit contributions
geries-0.700
GA-0.621
gery-0.611
gotten-0.608
ODY-0.581
Token wonderful
Token ID7932
Feature activation-4.11e-01

Pos logit contributions
geries+3.664
GA+3.383
gotten+3.262
gery+3.257
ODY+3.136
Neg logit contributions
elsen-3.676
sole-3.590
 Nanto-3.551
anto-3.283
urus-3.253
Token angel
Token ID18304
Feature activation-2.30e+00

Pos logit contributions
geries+1.334
GA+1.223
gery+1.180
gotten+1.176
ODY+1.137
Neg logit contributions
elsen-1.336
sole-1.305
 Nanto-1.283
anto-1.182
urus-1.178
Token'
Token ID6
Feature activation+7.56e-01

Pos logit contributions
geries+8.118
GA+7.701
gotten+7.467
gery+7.446
ODY+7.293
Neg logit contributions
elsen-6.136
 Nanto-5.873
sole-5.786
urst-5.186
urus-5.109
Token
Token ID198
Feature activation+2.21e-01

Pos logit contributions
elsen+2.180
sole+2.131
 Nanto+2.086
anto+1.921
urus+1.893
Neg logit contributions
geries-2.737
GA-2.497
gotten-2.427
gery-2.425
ODY-2.348
Token
Token ID198
Feature activation+2.83e-01

Pos logit contributions
elsen+0.614
sole+0.607
 Nanto+0.583
anto+0.535
ross+0.532
Neg logit contributions
geries-0.829
GA-0.745
gotten-0.731
gery-0.728
��-0.714
TokenA
Token ID32
Feature activation-6.85e-01

Pos logit contributions
elsen+0.887
sole+0.876
 Nanto+0.835
ross+0.784
anto+0.783
Neg logit contributions
geries-0.967
GA-0.851
gery-0.847
gotten-0.842
ODY-0.806
Token retired
Token ID9880
Feature activation-1.79e-01

Pos logit contributions
geries+2.286
GA+2.105
gotten+2.029
gery+2.028
ODY+1.959
Neg logit contributions
elsen-2.159
sole-2.107
 Nanto-2.076
anto-1.904
urus-1.891
Token in
Token ID287
Feature activation+3.39e-01

Pos logit contributions
elsen+1.047
sole+1.035
 Nanto+1.006
anto+0.918
urus+0.907
Neg logit contributions
geries-1.305
GA-1.193
gotten-1.161
gery-1.158
ODY-1.124
Token the
Token ID262
Feature activation+2.57e-01

Pos logit contributions
elsen+1.031
sole+1.014
 Nanto+0.989
anto+0.904
urus+0.898
Neg logit contributions
geries-1.175
GA-1.074
gery-1.040
gotten-1.040
ODY-1.006
Token event
Token ID1785
Feature activation+3.83e-01

Pos logit contributions
elsen+0.763
sole+0.749
 Nanto+0.731
anto+0.670
urus+0.663
Neg logit contributions
geries-0.914
GA-0.837
gotten-0.813
gery-0.811
ODY-0.786
Token of
Token ID286
Feature activation+3.20e-01

Pos logit contributions
elsen+1.114
sole+1.085
 Nanto+1.067
anto+0.974
urus+0.971
Neg logit contributions
geries-1.382
GA-1.268
gery-1.229
gotten-1.228
ODY-1.189
Token a
Token ID257
Feature activation+6.21e-03

Pos logit contributions
elsen+1.061
sole+1.035
 Nanto+1.021
anto+0.939
urus+0.936
Neg logit contributions
geries-1.019
GA-0.930
gotten-0.895
gery-0.891
ODY-0.871
Token default
Token ID4277
Feature activation-2.30e+00

Pos logit contributions
elsen+0.019
sole+0.019
 Nanto+0.019
anto+0.017
urus+0.017
Neg logit contributions
geries-0.021
GA-0.019
gotten-0.019
gery-0.019
ODY-0.018
Token but
Token ID475
Feature activation+3.90e-01

Pos logit contributions
geries+6.346
GA+5.943
gery+5.546
gotten+5.529
ODY+5.430
Neg logit contributions
elsen-7.740
 Nanto-7.542
sole-7.439
urus-7.021
urst-6.969
Token not
Token ID407
Feature activation+5.26e-01

Pos logit contributions
elsen+1.221
sole+1.198
 Nanto+1.174
anto+1.080
urus+1.069
Neg logit contributions
geries-1.328
GA-1.202
gery-1.172
gotten-1.167
ODY-1.127
Token over
Token ID625
Feature activation-7.28e-01

Pos logit contributions
elsen+1.620
sole+1.590
 Nanto+1.565
anto+1.434
urus+1.422
Neg logit contributions
geries-1.798
GA-1.631
gotten-1.589
gery-1.577
ODY-1.525
Token late
Token ID2739
Feature activation-1.15e+00

Pos logit contributions
geries+2.306
GA+2.126
gotten+2.037
gery+2.034
ODY+1.956
Neg logit contributions
elsen-2.411
sole-2.350
 Nanto-2.337
anto-2.138
urus-2.122
Token or
Token ID393
Feature activation-4.89e-01

Pos logit contributions
geries+3.854
GA+3.562
gery+3.451
gotten+3.391
ODY+3.295
Neg logit contributions
elsen-3.587
sole-3.502
 Nanto-3.452
urus-3.138
anto-3.112
Token a
Token ID257
Feature activation+1.11e-01

Pos logit contributions
geries+0.048
GA+0.044
gotten+0.042
gery+0.042
ODY+0.041
Neg logit contributions
elsen-0.049
sole-0.048
 Nanto-0.047
anto-0.044
urus-0.043
Token problem
Token ID1917
Feature activation+3.87e-01

Pos logit contributions
elsen+0.362
sole+0.355
 Nanto+0.347
anto+0.320
urus+0.318
Neg logit contributions
geries-0.362
GA-0.330
gotten-0.318
gery-0.318
ODY-0.308
Token,
Token ID11
Feature activation+2.46e-01

Pos logit contributions
elsen+1.159
sole+1.142
 Nanto+1.116
anto+1.029
urus+1.014
Neg logit contributions
geries-1.353
GA-1.238
gotten-1.200
gery-1.197
ODY-1.155
Token I
Token ID314
Feature activation-5.13e-02

Pos logit contributions
elsen+0.702
sole+0.686
 Nanto+0.671
anto+0.617
urus+0.607
Neg logit contributions
geries-0.903
GA-0.828
gery-0.806
gotten-0.805
ODY-0.777
Token know
Token ID760
Feature activation+3.69e-01

Pos logit contributions
geries+0.179
GA+0.162
gery+0.158
gotten+0.158
ODY+0.152
Neg logit contributions
elsen-0.157
sole-0.154
 Nanto-0.150
anto-0.139
urus-0.137
Token Congress
Token ID3162
Feature activation-2.30e+00

Pos logit contributions
elsen+1.096
sole+1.070
 Nanto+1.052
anto+0.970
urus+0.954
Neg logit contributions
geries-1.310
GA-1.190
gotten-1.164
gery-1.163
YP-1.118
Token will
Token ID481
Feature activation+2.09e-02

Pos logit contributions
geries+5.932
GA+5.617
gery+5.229
gotten+5.151
ODY+5.090
Neg logit contributions
elsen-8.398
sole-8.140
 Nanto-8.118
urus-7.441
urst-7.413
Token work
Token ID670
Feature activation-7.08e-01

Pos logit contributions
elsen+0.062
sole+0.061
 Nanto+0.060
anto+0.055
urus+0.054
Neg logit contributions
geries-0.074
GA-0.068
gery-0.066
gotten-0.066
ODY-0.064
Token with
Token ID351
Feature activation-7.12e-01

Pos logit contributions
geries+2.544
GA+2.369
gotten+2.283
gery+2.280
ODY+2.222
Neg logit contributions
elsen-2.051
sole-1.985
 Nanto-1.961
anto-1.781
urus-1.779
Token the
Token ID262
Feature activation-1.19e-01

Pos logit contributions
geries+2.042
GA+1.860
gotten+1.770
gery+1.767
ODY+1.705
Neg logit contributions
elsen-2.586
sole-2.536
 Nanto-2.500
anto-2.315
urus-2.309
Token industry
Token ID2831
Feature activation+4.87e-01

Pos logit contributions
geries+0.409
GA+0.373
gery+0.362
gotten+0.362
ODY+0.349
Neg logit contributions
elsen-0.370
sole-0.362
 Nanto-0.355
anto-0.327
urus-0.324
Token to
Token ID284
Feature activation+3.74e-01

Pos logit contributions
geries+0.525
GA+0.486
gery+0.473
gotten+0.472
ODY+0.459
Neg logit contributions
elsen-0.334
sole-0.326
 Nanto-0.318
anto-0.286
urus-0.283
Token bring
Token ID2222
Feature activation+8.23e-01

Pos logit contributions
elsen+1.059
sole+1.038
 Nanto+1.022
anto+0.940
urus+0.918
Neg logit contributions
geries-1.380
GA-1.254
gery-1.225
gotten-1.218
ODY-1.193
Token the
Token ID262
Feature activation+2.45e-01

Pos logit contributions
elsen+2.490
sole+2.419
 Nanto+2.367
anto+2.215
ross+2.200
Neg logit contributions
geries-2.823
GA-2.544
gotten-2.502
gery-2.499
��-2.417
Token problems
Token ID2761
Feature activation-2.53e-01

Pos logit contributions
elsen+0.771
sole+0.756
 Nanto+0.739
anto+0.682
urus+0.678
Neg logit contributions
geries-0.821
GA-0.751
gery-0.726
gotten-0.725
ODY-0.699
Token to
Token ID284
Feature activation+9.97e-02

Pos logit contributions
geries+0.941
GA+0.879
gotten+0.850
gery+0.846
ODY+0.825
Neg logit contributions
elsen-0.701
sole-0.686
 Nanto-0.670
urus-0.607
anto-0.605
Token Congress
Token ID3162
Feature activation-2.29e+00

Pos logit contributions
elsen+0.333
sole+0.328
 Nanto+0.323
anto+0.300
urus+0.297
Neg logit contributions
geries-0.316
GA-0.282
gotten-0.276
gery-0.275
YP-0.265
Token so
Token ID523
Feature activation+3.37e-01

Pos logit contributions
geries+6.549
GA+6.185
ODY+5.837
gotten+5.805
gery+5.791
Neg logit contributions
elsen-7.762
 Nanto-7.560
sole-7.426
urus-6.806
anto-6.764
Token we
Token ID356
Feature activation+1.10e-01

Pos logit contributions
elsen+0.983
sole+0.964
 Nanto+0.940
anto+0.872
urus+0.858
Neg logit contributions
geries-1.205
GA-1.100
gery-1.074
gotten-1.073
YP-1.037
Token can
Token ID460
Feature activation+7.87e-01

Pos logit contributions
elsen+0.358
sole+0.353
 Nanto+0.347
anto+0.322
urus+0.318
Neg logit contributions
geries-0.360
GA-0.324
gery-0.317
gotten-0.315
ODY-0.305
Token do
Token ID466
Feature activation-1.07e+00

Pos logit contributions
elsen+2.254
sole+2.240
 Nanto+2.210
anto+2.041
urus+1.995
Neg logit contributions
geries-2.818
GA-2.536
gotten-2.506
gery-2.495
ODY-2.437
Token that
Token ID326
Feature activation+3.35e-02

Pos logit contributions
geries+3.128
GA+2.851
gotten+2.717
gery+2.714
ODY+2.637
Neg logit contributions
elsen-3.797
sole-3.712
 Nanto-3.666
urus-3.379
anto-3.368
Tokenpas
Token ID44429
Feature activation+1.08e-01

Pos logit contributions
elsen+2.493
sole+2.463
 Nanto+2.357
anto+2.223
urus+2.201
Neg logit contributions
geries-2.407
GA-2.175
gotten-2.108
gery-2.100
ODY-2.048
Tokenaurus
Token ID22302
Feature activation-6.55e-01

Pos logit contributions
elsen+0.354
sole+0.348
 Nanto+0.340
anto+0.315
urus+0.312
Neg logit contributions
geries-0.349
GA-0.318
gery-0.306
gotten-0.306
ODY-0.296
Tokenrex
Token ID21510
Feature activation-6.52e-01

Pos logit contributions
geries+1.993
GA+1.826
gery+1.750
gotten+1.739
ODY+1.686
Neg logit contributions
elsen-2.252
 Nanto-2.205
sole-2.194
anto-1.995
urus-1.994
Token in
Token ID287
Feature activation+2.79e-01

Pos logit contributions
geries+1.964
GA+1.790
gery+1.723
gotten+1.716
ODY+1.641
Neg logit contributions
elsen-2.258
 Nanto-2.219
sole-2.208
anto-2.019
urus-2.008
Token our
Token ID674
Feature activation+1.84e-01

Pos logit contributions
elsen+0.880
sole+0.863
 Nanto+0.846
anto+0.778
urus+0.773
Neg logit contributions
geries-0.939
GA-0.861
gotten-0.830
gery-0.829
ODY-0.802
Token debugger
Token ID49518
Feature activation-2.28e+00

Pos logit contributions
elsen+0.595
sole+0.585
 Nanto+0.573
anto+0.529
urus+0.526
Neg logit contributions
geries-0.600
GA-0.548
gotten-0.528
gery-0.527
ODY-0.510
Token:
Token ID25
Feature activation+4.95e-01

Pos logit contributions
geries+6.413
GA+5.968
gery+5.707
gotten+5.597
ODY+5.426
Neg logit contributions
elsen-7.776
 Nanto-7.693
sole-7.568
anto-7.002
urus-6.991
Token
Token ID198
Feature activation+1.25e-01

Pos logit contributions
elsen+1.503
sole+1.484
 Nanto+1.435
anto+1.331
urus+1.327
Neg logit contributions
geries-1.712
GA-1.564
gotten-1.510
gery-1.509
ODY-1.458
Token
Token ID198
Feature activation+1.88e-01

Pos logit contributions
elsen+0.358
sole+0.354
 Nanto+0.335
anto+0.315
urus+0.313
Neg logit contributions
geries-0.461
GA-0.416
gery-0.407
gotten-0.406
ODY-0.395
Token1
Token ID16
Feature activation-2.64e-01

Pos logit contributions
elsen+0.628
sole+0.623
 Nanto+0.600
anto+0.563
urus+0.560
Neg logit contributions
geries-0.595
GA-0.537
gery-0.518
gotten-0.518
ODY-0.501
Token $
Token ID720
Feature activation-7.93e-01

Pos logit contributions
geries+0.946
GA+0.875
gery+0.846
gotten+0.842
ODY+0.817
Neg logit contributions
elsen-0.778
sole-0.758
 Nanto-0.745
anto-0.676
urus-0.672
Token an
Token ID281
Feature activation-2.12e-01

Pos logit contributions
geries+3.971
GA+3.606
gotten+3.529
gery+3.457
ODY+3.333
Neg logit contributions
elsen-4.502
sole-4.411
 Nanto-4.317
urus-4.013
anto-4.008
Token interest
Token ID1393
Feature activation-1.78e-02

Pos logit contributions
geries+0.697
GA+0.637
gotten+0.614
gery+0.613
ODY+0.592
Neg logit contributions
elsen-0.687
sole-0.672
 Nanto-0.660
anto-0.609
urus-0.603
Token (
Token ID357
Feature activation+1.24e+00

Pos logit contributions
geries+0.062
GA+0.058
gotten+0.056
gery+0.055
ODY+0.054
Neg logit contributions
elsen-0.053
sole-0.052
 Nanto-0.051
anto-0.047
urus-0.046
Tokenin
Token ID259
Feature activation+3.54e-01

Pos logit contributions
elsen+3.591
sole+3.583
 Nanto+3.411
anto+3.177
urus+3.155
Neg logit contributions
geries-4.359
GA-3.940
gery-3.868
gotten-3.840
ODY-3.744
Token the
Token ID262
Feature activation+5.88e-01

Pos logit contributions
elsen+1.206
sole+1.186
 Nanto+1.160
anto+1.077
urus+1.074
Neg logit contributions
geries-1.093
GA-0.999
gotten-0.955
gery-0.954
ODY-0.921
Token overweight
Token ID22652
Feature activation+1.62e+00

Pos logit contributions
elsen+1.914
sole+1.877
 Nanto+1.844
anto+1.704
urus+1.693
Neg logit contributions
geries-1.894
GA-1.737
gotten-1.670
gery-1.657
ODY-1.605
Token/
Token ID14
Feature activation+7.64e-03

Pos logit contributions
elsen+5.001
sole+4.838
 Nanto+4.804
anto+4.467
urst+4.414
Neg logit contributions
geries-5.320
GA-4.822
gotten-4.722
gery-4.671
ODY-4.480
Tokenob
Token ID672
Feature activation+5.71e-03

Pos logit contributions
elsen+0.024
sole+0.023
 Nanto+0.023
anto+0.021
urus+0.021
Neg logit contributions
geries-0.026
GA-0.024
gery-0.023
gotten-0.023
ODY-0.022
Tokenese
Token ID2771
Feature activation+4.24e-01

Pos logit contributions
elsen+0.013
sole+0.013
 Nanto+0.013
anto+0.011
urus+0.011
Neg logit contributions
geries-0.024
GA-0.022
gery-0.022
gotten-0.022
ODY-0.021
Token girls
Token ID4813
Feature activation+6.81e-02

Pos logit contributions
elsen+1.309
sole+1.281
 Nanto+1.260
anto+1.158
urus+1.146
Neg logit contributions
geries-1.447
GA-1.327
gotten-1.282
gery-1.280
ODY-1.237
Token)
Token ID8
Feature activation+9.57e-01

Pos logit contributions
elsen+0.196
sole+0.191
 Nanto+0.187
anto+0.170
urus+0.169
Neg logit contributions
geries-0.248
GA-0.229
gery-0.222
gotten-0.222
ODY-0.215
Token bis
Token ID47457
Feature activation+7.05e-01

Pos logit contributions
geries+0.233
GA+0.215
gery+0.207
gotten+0.206
ODY+0.199
Neg logit contributions
elsen-0.207
sole-0.203
 Nanto-0.198
anto-0.182
urus-0.181
Tokenect
Token ID478
Feature activation+1.44e-01

Pos logit contributions
elsen+1.322
sole+1.279
 Nanto+1.235
anto+1.063
ross+1.052
Neg logit contributions
geries-3.268
GA-3.072
gotten-2.994
gery-2.992
ODY-2.932
Tokening
Token ID278
Feature activation+1.51e-01

Pos logit contributions
elsen+0.433
sole+0.423
 Nanto+0.415
anto+0.379
urus+0.376
Neg logit contributions
geries-0.503
GA-0.462
gotten-0.447
gery-0.447
ODY-0.434
Token the
Token ID262
Feature activation+3.70e-01

Pos logit contributions
elsen+0.481
sole+0.471
 Nanto+0.463
anto+0.426
urus+0.422
Neg logit contributions
geries-0.500
GA-0.458
gotten-0.442
gery-0.442
ODY-0.429
Token region
Token ID3814
Feature activation+2.80e-01

Pos logit contributions
elsen+1.029
sole+1.006
 Nanto+0.985
anto+0.895
urus+0.880
Neg logit contributions
geries-1.375
GA-1.269
gery-1.232
gotten-1.231
ODY-1.203
Token vertically
Token ID31677
Feature activation+1.45e+00

Pos logit contributions
elsen+0.822
sole+0.807
 Nanto+0.790
anto+0.721
urus+0.713
Neg logit contributions
geries-1.000
GA-0.920
gotten-0.891
gery-0.889
ODY-0.864
Token.
Token ID13
Feature activation+2.92e-02

Pos logit contributions
sole+4.360
elsen+4.357
 Nanto+4.213
anto+3.913
ross+3.892
Neg logit contributions
geries-4.891
GA-4.385
gotten-4.328
gery-4.273
ODY-4.175
Token F
Token ID376
Feature activation+6.19e-01

Pos logit contributions
elsen+0.091
sole+0.089
 Nanto+0.087
anto+0.080
urus+0.079
Neg logit contributions
geries-0.100
GA-0.092
gery-0.088
gotten-0.088
ODY-0.085
Tokenilling
Token ID4509
Feature activation+5.55e-01

Pos logit contributions
elsen+1.901
sole+1.830
 Nanto+1.782
anto+1.661
urus+1.652
Neg logit contributions
geries-2.147
GA-1.957
gotten-1.915
gery-1.912
ODY-1.844
Token the
Token ID262
Feature activation+1.55e-01

Pos logit contributions
elsen+1.744
sole+1.707
 Nanto+1.672
anto+1.541
urus+1.527
Neg logit contributions
geries-1.860
GA-1.701
gery-1.648
gotten-1.647
ODY-1.597
Token land
Token ID1956
Feature activation+3.84e-01

Pos logit contributions
elsen+0.513
sole+0.502
 Nanto+0.494
anto+0.456
urus+0.453
Neg logit contributions
geries-0.497
GA-0.454
gery-0.436
gotten-0.436
ODY-0.421
Token period
Token ID2278
Feature activation+8.03e-01

Pos logit contributions
elsen+1.454
sole+1.423
 Nanto+1.403
anto+1.291
urus+1.268
Neg logit contributions
geries-1.675
GA-1.535
gotten-1.491
gery-1.481
ODY-1.442
Token,
Token ID11
Feature activation+3.31e-01

Pos logit contributions
elsen+2.531
sole+2.481
 Nanto+2.450
anto+2.241
urst+2.219
Neg logit contributions
geries-2.680
GA-2.436
gotten-2.368
gery-2.360
ODY-2.283
Token one
Token ID530
Feature activation+1.65e-01

Pos logit contributions
elsen+1.040
sole+1.015
 Nanto+0.995
anto+0.916
urus+0.908
Neg logit contributions
geries-1.117
GA-1.025
gery-0.990
gotten-0.989
ODY-0.954
Token man
Token ID582
Feature activation+6.66e-01

Pos logit contributions
elsen+0.475
sole+0.464
 Nanto+0.454
anto+0.414
urus+0.410
Neg logit contributions
geries-0.602
GA-0.554
gotten-0.537
gery-0.536
ODY-0.520
Token was
Token ID373
Feature activation+6.23e-01

Pos logit contributions
elsen+2.097
sole+2.063
 Nanto+2.017
anto+1.860
urus+1.834
Neg logit contributions
geries-2.235
GA-2.022
gotten-1.957
gery-1.947
ODY-1.896
Token beaten
Token ID13125
Feature activation+1.57e+00

Pos logit contributions
elsen+1.917
sole+1.896
 Nanto+1.832
anto+1.683
urus+1.670
Neg logit contributions
geries-2.136
GA-1.941
gotten-1.878
gery-1.868
ODY-1.812
Token and
Token ID290
Feature activation-1.16e-01

Pos logit contributions
sole+4.065
elsen+4.006
 Nanto+3.838
urst+3.426
ross+3.395
Neg logit contributions
geries-6.050
GA-5.467
gotten-5.426
gery-5.293
YP-5.235
Token robbed
Token ID26773
Feature activation+6.73e-01

Pos logit contributions
geries+0.377
GA+0.344
gotten+0.331
gery+0.331
ODY+0.320
Neg logit contributions
elsen-0.381
sole-0.373
 Nanto-0.367
anto-0.338
urus-0.336
Token,
Token ID11
Feature activation-1.97e-02

Pos logit contributions
elsen+1.925
sole+1.904
 Nanto+1.844
anto+1.668
urst+1.655
Neg logit contributions
geries-2.447
GA-2.238
gotten-2.185
gery-2.156
YP-2.109
Token three
Token ID1115
Feature activation-1.55e-01

Pos logit contributions
geries+0.068
GA+0.063
gery+0.061
gotten+0.061
ODY+0.059
Neg logit contributions
elsen-0.060
sole-0.059
 Nanto-0.058
anto-0.053
urus-0.053
Token fights
Token ID11418
Feature activation-4.57e-01

Pos logit contributions
geries+0.497
GA+0.454
gery+0.436
gotten+0.436
ODY+0.421
Neg logit contributions
elsen-0.512
sole-0.502
 Nanto-0.493
anto-0.455
urus-0.452
Token the
Token ID262
Feature activation-1.63e-01

Pos logit contributions
elsen+0.240
sole+0.234
 Nanto+0.230
anto+0.211
urus+0.208
Neg logit contributions
geries-0.303
GA-0.279
gotten-0.271
gery-0.270
ODY-0.262
Token R
Token ID371
Feature activation-8.84e-01

Pos logit contributions
geries+0.535
GA+0.489
gery+0.471
gotten+0.471
ODY+0.455
Neg logit contributions
elsen-0.524
sole-0.513
 Nanto-0.505
anto-0.465
urus-0.461
TokenFI
Token ID11674
Feature activation-5.72e-01

Pos logit contributions
geries+2.251
GA+2.090
gotten+1.933
gery+1.918
ODY+1.859
Neg logit contributions
elsen-3.454
sole-3.412
 Nanto-3.370
urus-3.118
anto-3.115
Token source
Token ID2723
Feature activation+1.80e-01

Pos logit contributions
geries+1.799
GA+1.649
gery+1.577
gotten+1.576
ODY+1.522
Neg logit contributions
elsen-1.920
sole-1.883
 Nanto-1.854
anto-1.708
urus-1.700
Token (
Token ID357
Feature activation+9.77e-01

Pos logit contributions
elsen+0.579
sole+0.566
 Nanto+0.554
anto+0.515
urus+0.510
Neg logit contributions
geries-0.592
GA-0.538
gotten-0.520
gery-0.519
ODY-0.503
Tokencom
Token ID785
Feature activation+1.36e+00

Pos logit contributions
sole+2.792
elsen+2.787
 Nanto+2.623
anto+2.469
urus+2.439
Neg logit contributions
geries-3.527
GA-3.172
gery-3.128
gotten-3.100
ODY-3.025
Tokenputers
Token ID41510
Feature activation-2.84e-01

Pos logit contributions
elsen+4.665
sole+4.562
 Nanto+4.394
ross+4.190
anto+4.189
Neg logit contributions
geries-4.128
GA-3.699
gotten-3.535
gery-3.533
ODY-3.525
Token and
Token ID290
Feature activation-6.22e-01

Pos logit contributions
geries+0.978
GA+0.904
gotten+0.869
gery+0.867
ODY+0.840
Neg logit contributions
elsen-0.868
sole-0.851
 Nanto-0.837
anto-0.763
urus-0.758
Token monitors
Token ID19374
Feature activation+1.85e-01

Pos logit contributions
geries+1.893
GA+1.723
gery+1.651
gotten+1.650
ODY+1.589
Neg logit contributions
elsen-2.156
sole-2.121
 Nanto-2.087
anto-1.926
urus-1.916
Token in
Token ID287
Feature activation+7.80e-02

Pos logit contributions
elsen+0.523
sole+0.511
 Nanto+0.499
anto+0.456
urus+0.452
Neg logit contributions
geries-0.683
GA-0.628
gotten-0.610
gery-0.609
ODY-0.591
Token our
Token ID674
Feature activation+3.39e-01

Pos logit contributions
elsen+0.250
sole+0.245
 Nanto+0.241
anto+0.221
urus+0.220
Neg logit contributions
geries-0.258
GA-0.237
gotten-0.228
gery-0.228
ODY-0.220
Token.
Token ID13
Feature activation-2.27e-01

Pos logit contributions
elsen+1.596
sole+1.572
 Nanto+1.531
anto+1.414
urus+1.391
Neg logit contributions
geries-1.809
GA-1.647
gotten-1.603
gery-1.596
ODY-1.544
Token I
Token ID314
Feature activation-4.13e-01

Pos logit contributions
geries+0.711
GA+0.651
gery+0.624
gotten+0.623
ODY+0.602
Neg logit contributions
elsen-0.766
sole-0.751
 Nanto-0.738
anto-0.682
urus-0.678
Token also
Token ID635
Feature activation+4.74e-01

Pos logit contributions
geries+1.297
GA+1.192
gotten+1.143
gery+1.140
ODY+1.098
Neg logit contributions
elsen-1.387
sole-1.355
 Nanto-1.337
anto-1.232
urus-1.225
Token did
Token ID750
Feature activation-9.62e-01

Pos logit contributions
elsen+1.552
sole+1.531
 Nanto+1.499
anto+1.389
urus+1.373
Neg logit contributions
geries-1.542
GA-1.384
gery-1.345
gotten-1.333
ODY-1.300
Token two
Token ID734
Feature activation+5.41e-01

Pos logit contributions
geries+2.954
GA+2.720
gotten+2.589
gery+2.585
ODY+2.510
Neg logit contributions
elsen-3.250
sole-3.183
 Nanto-3.129
urus-2.887
anto-2.865
Token coats
Token ID30720
Feature activation+1.34e+00

Pos logit contributions
elsen+1.657
sole+1.621
 Nanto+1.594
anto+1.460
urus+1.443
Neg logit contributions
geries-1.850
GA-1.696
gotten-1.649
gery-1.643
ODY-1.594
Token and
Token ID290
Feature activation+2.40e-01

Pos logit contributions
sole+3.544
elsen+3.536
 Nanto+3.428
anto+3.129
ross+3.047
Neg logit contributions
geries-5.076
GA-4.590
gotten-4.546
gery-4.502
��-4.371
Token it
Token ID340
Feature activation+6.61e-02

Pos logit contributions
elsen+0.740
sole+0.725
 Nanto+0.714
anto+0.657
urus+0.649
Neg logit contributions
geries-0.820
GA-0.750
gery-0.725
gotten-0.724
ODY-0.702
Token had
Token ID550
Feature activation-2.36e-01

Pos logit contributions
elsen+0.218
sole+0.214
 Nanto+0.210
anto+0.194
urus+0.193
Neg logit contributions
geries-0.213
GA-0.194
gery-0.187
gotten-0.186
ODY-0.180
Token to
Token ID284
Feature activation+2.61e-01

Pos logit contributions
geries+0.765
GA+0.698
gery+0.673
gotten+0.671
ODY+0.649
Neg logit contributions
elsen-0.771
sole-0.757
 Nanto-0.743
anto-0.686
urus-0.681
Token dry
Token ID5894
Feature activation+1.26e+00

Pos logit contributions
elsen+0.753
sole+0.735
 Nanto+0.718
anto+0.657
urus+0.651
Neg logit contributions
geries-0.948
GA-0.873
gotten-0.845
gery-0.844
ODY-0.821
Token misses
Token ID18297
Feature activation-4.73e-01

Pos logit contributions
geries+1.725
GA+1.587
gery+1.522
gotten+1.518
ODY+1.465
Neg logit contributions
elsen-1.689
sole-1.640
 Nanto-1.634
anto-1.495
urus-1.480
Token.
Token ID13
Feature activation+1.23e-01

Pos logit contributions
geries+1.556
GA+1.428
gery+1.377
gotten+1.376
ODY+1.324
Neg logit contributions
elsen-1.519
sole-1.478
 Nanto-1.467
anto-1.340
urus-1.337
Token
Token ID198
Feature activation+1.21e-01

Pos logit contributions
elsen+0.391
sole+0.384
 Nanto+0.378
anto+0.347
urus+0.344
Neg logit contributions
geries-0.412
GA-0.375
gotten-0.363
gery-0.363
ODY-0.350
Token
Token ID198
Feature activation+1.92e-01

Pos logit contributions
elsen+0.342
sole+0.338
 Nanto+0.326
anto+0.299
urus+0.297
Neg logit contributions
geries-0.448
GA-0.404
gotten-0.396
gery-0.394
ODY-0.383
TokenTaking
Token ID26556
Feature activation+1.16e-01

Pos logit contributions
elsen+0.618
sole+0.610
 Nanto+0.590
anto+0.549
urus+0.545
Neg logit contributions
geries-0.634
GA-0.569
gery-0.556
gotten-0.554
ODY-0.533
Token your
Token ID534
Feature activation+2.18e-01

Pos logit contributions
elsen+0.375
sole+0.368
 Nanto+0.361
anto+0.333
urus+0.331
Neg logit contributions
geries-0.382
GA-0.349
gotten-0.337
gery-0.336
ODY-0.324
Token stance
Token ID12046
Feature activation-4.32e-01

Pos logit contributions
elsen+0.695
sole+0.682
 Nanto+0.669
anto+0.615
urus+0.610
Neg logit contributions
geries-0.725
GA-0.664
gotten-0.639
gery-0.639
ODY-0.618
Token at
Token ID379
Feature activation+1.86e-01

Pos logit contributions
geries+1.472
GA+1.360
gery+1.307
gotten+1.304
ODY+1.266
Neg logit contributions
elsen-1.336
sole-1.301
 Nanto-1.284
anto-1.174
urus-1.170
Token your
Token ID534
Feature activation+1.54e-03

Pos logit contributions
elsen+0.612
sole+0.602
 Nanto+0.591
anto+0.545
urus+0.541
Neg logit contributions
geries-0.601
GA-0.548
gotten-0.527
gery-0.527
ODY-0.509
Token starting
Token ID3599
Feature activation-2.85e-01

Pos logit contributions
elsen+0.005
sole+0.005
 Nanto+0.005
anto+0.004
urus+0.004
Neg logit contributions
geries-0.005
GA-0.005
gotten-0.005
gery-0.005
ODY-0.004
Token distance
Token ID5253
Feature activation+8.71e-01

Pos logit contributions
geries+1.047
GA+0.970
gery+0.937
gotten+0.936
ODY+0.909
Neg logit contributions
elsen-0.811
sole-0.791
 Nanto-0.776
anto-0.706
urus-0.700
Tokenoses
Token ID4629
Feature activation+2.62e-01

Pos logit contributions
elsen+0.303
sole+0.296
 Nanto+0.291
anto+0.264
urus+0.263
Neg logit contributions
geries-0.376
GA-0.347
gery-0.335
gotten-0.335
ODY-0.325
Token over
Token ID625
Feature activation-5.70e-01

Pos logit contributions
elsen+0.828
sole+0.813
 Nanto+0.797
anto+0.734
urus+0.728
Neg logit contributions
geries-0.875
GA-0.801
gotten-0.774
gery-0.772
ODY-0.747
Token the
Token ID262
Feature activation-3.91e-01

Pos logit contributions
geries+1.822
GA+1.669
gotten+1.601
gery+1.597
ODY+1.540
Neg logit contributions
elsen-1.885
sole-1.829
 Nanto-1.812
urus-1.664
anto-1.659
Token entire
Token ID2104
Feature activation-7.62e-01

Pos logit contributions
geries+1.370
GA+1.266
gery+1.220
gotten+1.214
ODY+1.177
Neg logit contributions
elsen-1.174
sole-1.137
 Nanto-1.121
urus-1.023
anto-1.019
Token palm
Token ID18057
Feature activation-1.61e+00

Pos logit contributions
geries+2.472
GA+2.289
gery+2.199
gotten+2.168
ODY+2.109
Neg logit contributions
elsen-2.461
sole-2.372
 Nanto-2.347
urus-2.161
urst-2.140
Tokenopl
Token ID20106
Feature activation+4.99e-02

Pos logit contributions
geries+5.164
GA+4.819
gery+4.594
ODY+4.504
gotten+4.497
Neg logit contributions
elsen-5.049
 Nanto-4.828
sole-4.644
urus-4.314
anto-4.240
Tokenant
Token ID415
Feature activation-4.00e-01

Pos logit contributions
elsen+0.178
sole+0.174
 Nanto+0.172
anto+0.159
urus+0.158
Neg logit contributions
geries-0.147
GA-0.134
gery-0.128
gotten-0.128
ODY-0.123
Tokenar
Token ID283
Feature activation-4.53e-01

Pos logit contributions
geries+1.208
GA+1.103
gery+1.053
gotten+1.046
ODY+1.013
Neg logit contributions
elsen-1.391
 Nanto-1.358
sole-1.357
anto-1.235
urus-1.232
Token surfaces
Token ID16649
Feature activation+2.47e-03

Pos logit contributions
geries+1.505
GA+1.383
gery+1.330
gotten+1.321
ODY+1.281
Neg logit contributions
elsen-1.449
sole-1.406
 Nanto-1.391
anto-1.271
urus-1.271
Token,
Token ID11
Feature activation-2.27e-01

Pos logit contributions
elsen+0.008
sole+0.008
 Nanto+0.008
anto+0.007
urus+0.007
Neg logit contributions
geries-0.008
GA-0.007
gery-0.007
gotten-0.007
ODY-0.007
Token beginning
Token ID3726
Feature activation-1.45e-01

Pos logit contributions
geries+0.783
GA+0.721
gery+0.694
gotten+0.692
ODY+0.671
Neg logit contributions
elsen-0.696
sole-0.679
 Nanto-0.666
anto-0.610
urus-0.606
Token of
Token ID286
Feature activation+9.19e-01

Pos logit contributions
elsen+1.677
sole+1.633
 Nanto+1.595
anto+1.432
urus+1.419
Neg logit contributions
geries-2.671
GA-2.472
gotten-2.405
gery-2.404
ODY-2.337
Token America
Token ID2253
Feature activation+7.00e-01

Pos logit contributions
elsen+2.504
sole+2.455
 Nanto+2.366
anto+2.215
urus+2.171
Neg logit contributions
geries-3.437
GA-3.102
gotten-3.070
gery-3.008
YP-2.977
Token recently
Token ID2904
Feature activation+1.42e+00

Pos logit contributions
elsen+2.183
sole+2.135
 Nanto+2.103
anto+1.958
urus+1.909
Neg logit contributions
geries-2.382
GA-2.122
gotten-2.092
gery-2.072
ODY-1.997
Token agreed
Token ID4987
Feature activation+3.30e-01

Pos logit contributions
elsen+4.218
sole+4.173
 Nanto+4.132
anto+3.864
urus+3.720
Neg logit contributions
geries-4.988
GA-4.355
gery-4.293
gotten-4.233
ODY-4.164
Token to
Token ID284
Feature activation+5.43e-01

Pos logit contributions
elsen+0.791
sole+0.769
 Nanto+0.753
anto+0.672
urus+0.665
Neg logit contributions
geries-1.354
GA-1.266
gery-1.226
gotten-1.225
ODY-1.194
Token pay
Token ID1414
Feature activation+9.42e-01

Pos logit contributions
elsen+1.661
sole+1.624
 Nanto+1.593
anto+1.469
urus+1.450
Neg logit contributions
geries-1.884
GA-1.701
gotten-1.663
gery-1.652
ODY-1.610
Token a
Token ID257
Feature activation+6.14e-01

Pos logit contributions
elsen+3.057
sole+2.951
 Nanto+2.893
anto+2.707
urus+2.680
Neg logit contributions
geries-3.008
GA-2.737
gotten-2.644
gery-2.619
ODY-2.586
Token fine
Token ID3734
Feature activation+7.61e-02

Pos logit contributions
elsen+2.008
sole+1.956
 Nanto+1.913
anto+1.764
urus+1.756
Neg logit contributions
geries-1.989
GA-1.792
gotten-1.751
gery-1.727
ODY-1.694
Token of
Token ID286
Feature activation+6.21e-01

Pos logit contributions
elsen+0.202
sole+0.195
 Nanto+0.193
anto+0.174
urus+0.174
Neg logit contributions
geries-0.294
GA-0.275
gery-0.265
gotten-0.264
ODY-0.257
Token $
Token ID720
Feature activation-7.27e-01

Pos logit contributions
elsen+2.081
sole+2.033
 Nanto+1.989
anto+1.846
urus+1.828
Neg logit contributions
geries-1.948
GA-1.775
gotten-1.706
gery-1.698
ODY-1.668
Token16
Token ID1433
Feature activation+3.64e-01

Pos logit contributions
geries+2.042
GA+1.896
gotten+1.781
gery+1.781
ODY+1.723
Neg logit contributions
elsen-2.657
sole-2.596
 Nanto-2.586
anto-2.383
urus-2.373
Token its
Token ID663
Feature activation-2.90e-01

Pos logit contributions
geries+1.637
GA+1.500
gery+1.450
gotten+1.447
ODY+1.391
Neg logit contributions
elsen-1.514
sole-1.479
 Nanto-1.458
anto-1.331
urus-1.322
Token share
Token ID2648
Feature activation-1.30e-01

Pos logit contributions
geries+0.869
GA+0.789
gery+0.756
gotten+0.755
ODY+0.726
Neg logit contributions
elsen-1.021
sole-1.002
 Nanto-0.985
anto-0.915
urus-0.909
Token of
Token ID286
Feature activation-2.64e-01

Pos logit contributions
geries+0.504
GA+0.471
gery+0.455
gotten+0.454
YP+0.439
Neg logit contributions
elsen-0.340
sole-0.333
 Nanto-0.324
anto-0.292
urus-0.288
Token crises
Token ID25712
Feature activation-8.10e-02

Pos logit contributions
geries+0.857
GA+0.780
gery+0.754
gotten+0.753
ODY+0.720
Neg logit contributions
elsen-0.864
sole-0.846
 Nanto-0.831
anto-0.766
urus-0.761
Token in
Token ID287
Feature activation+2.33e-01

Pos logit contributions
geries+0.267
GA+0.245
gery+0.236
gotten+0.235
ODY+0.227
Neg logit contributions
elsen-0.261
sole-0.255
 Nanto-0.251
anto-0.230
urus-0.229
Token recent
Token ID2274
Feature activation+7.97e-01

Pos logit contributions
elsen+0.730
sole+0.717
 Nanto+0.703
anto+0.645
urus+0.641
Neg logit contributions
geries-0.785
GA-0.721
gery-0.693
gotten-0.693
ODY-0.673
Token decades
Token ID4647
Feature activation+4.57e-01

Pos logit contributions
elsen+3.107
sole+3.080
 Nanto+3.018
anto+2.822
urus+2.792
Neg logit contributions
geries-2.023
GA-1.819
gotten-1.724
gery-1.723
ODY-1.701
Token,
Token ID11
Feature activation+4.55e-02

Pos logit contributions
elsen+1.505
sole+1.479
 Nanto+1.449
anto+1.343
urus+1.333
Neg logit contributions
geries-1.463
GA-1.334
gery-1.283
gotten-1.281
ODY-1.242
Token it
Token ID340
Feature activation-9.47e-02

Pos logit contributions
elsen+0.150
sole+0.147
 Nanto+0.145
anto+0.134
urus+0.133
Neg logit contributions
geries-0.146
GA-0.133
gotten-0.128
gery-0.128
ODY-0.124
Token is
Token ID318
Feature activation+4.21e-01

Pos logit contributions
geries+0.295
GA+0.268
gotten+0.257
gery+0.257
ODY+0.248
Neg logit contributions
elsen-0.322
sole-0.317
 Nanto-0.311
anto-0.288
urus-0.286
Token worth
Token ID2861
Feature activation+9.47e-01

Pos logit contributions
elsen+1.342
sole+1.319
 Nanto+1.296
anto+1.189
urus+1.186
Neg logit contributions
geries-1.389
GA-1.269
gotten-1.224
gery-1.223
ODY-1.192
Token released
Token ID2716
Feature activation+5.01e-01

Pos logit contributions
elsen+0.522
sole+0.509
 Nanto+0.503
anto+0.461
urus+0.458
Neg logit contributions
geries-0.523
GA-0.479
gotten-0.464
gery-0.462
ODY-0.444
Token suspect
Token ID4099
Feature activation+2.60e-01

Pos logit contributions
elsen+1.632
sole+1.597
 Nanto+1.565
anto+1.443
urus+1.439
Neg logit contributions
geries-1.632
GA-1.491
gery-1.427
gotten-1.426
ODY-1.379
Token information
Token ID1321
Feature activation+8.50e-01

Pos logit contributions
elsen+0.834
sole+0.813
 Nanto+0.797
anto+0.739
urus+0.731
Neg logit contributions
geries-0.858
GA-0.783
gotten-0.754
gery-0.752
ODY-0.731
Token from
Token ID422
Feature activation+1.46e-01

Pos logit contributions
elsen+2.675
sole+2.639
 Nanto+2.560
anto+2.384
urst+2.344
Neg logit contributions
geries-2.828
GA-2.592
gotten-2.495
gery-2.480
ODY-2.418
Token any
Token ID597
Feature activation-2.96e-01

Pos logit contributions
elsen+0.500
sole+0.489
 Nanto+0.482
anto+0.445
urus+0.442
Neg logit contributions
geries-0.453
GA-0.413
gery-0.395
gotten-0.395
ODY-0.381
Token of
Token ID286
Feature activation-1.12e-03

Pos logit contributions
geries+1.094
GA+1.016
gery+0.982
gotten+0.982
ODY+0.953
Neg logit contributions
elsen-0.831
sole-0.816
 Nanto-0.794
anto-0.721
urus-0.718
Token the
Token ID262
Feature activation+4.07e-02

Pos logit contributions
geries+0.003
GA+0.003
gery+0.003
gotten+0.003
ODY+0.003
Neg logit contributions
elsen-0.004
sole-0.004
 Nanto-0.004
anto-0.004
urus-0.004
Token investigations
Token ID10986
Feature activation+7.32e-02

Pos logit contributions
elsen+0.122
sole+0.119
 Nanto+0.117
anto+0.107
urus+0.106
Neg logit contributions
geries-0.144
GA-0.132
gery-0.128
gotten-0.128
ODY-0.124
Token.
Token ID13
Feature activation-1.02e-01

Pos logit contributions
elsen+0.254
sole+0.249
 Nanto+0.245
anto+0.226
urus+0.225
Neg logit contributions
geries-0.223
GA-0.203
gery-0.195
gotten-0.195
ODY-0.187
Token<|endoftext|>
Token ID50256
Feature activation+3.88e-01

Pos logit contributions
geries+0.359
GA+0.334
gery+0.320
gotten+0.320
ODY+0.312
Neg logit contributions
elsen-0.305
sole-0.299
 Nanto-0.292
anto-0.268
urus-0.265
TokenIt
Token ID1026
Feature activation-1.82e-01

Pos logit contributions
elsen+1.290
sole+1.277
 Nanto+1.225
anto+1.151
urus+1.145
Neg logit contributions
geries-1.236
GA-1.097
gery-1.079
gotten-1.073
ODY-1.029
Tokenis
Token ID271
Feature activation+1.49e-01

Pos logit contributions
geries+1.152
GA+1.069
gery+1.029
gotten+1.022
ODY+0.991
Neg logit contributions
elsen-0.937
sole-0.924
 Nanto-0.918
anto-0.812
ross-0.809
Token Moreno
Token ID43269
Feature activation-5.77e-01

Pos logit contributions
elsen+0.473
sole+0.463
 Nanto+0.455
anto+0.419
urus+0.415
Neg logit contributions
geries-0.501
GA-0.458
gotten-0.442
gery-0.442
ODY-0.428
Token
Token ID784
Feature activation+3.27e-02

Pos logit contributions
geries+1.829
GA+1.688
gery+1.615
gotten+1.609
ODY+1.558
Neg logit contributions
elsen-1.905
sole-1.861
 Nanto-1.838
urus-1.694
anto-1.682
Token Instit
Token ID37931
Feature activation-2.45e-01

Pos logit contributions
elsen+0.103
sole+0.101
 Nanto+0.099
anto+0.091
urus+0.090
Neg logit contributions
geries-0.110
GA-0.101
gery-0.097
gotten-0.097
ODY-0.094
Tokenuto
Token ID9390
Feature activation-3.82e-01

Pos logit contributions
geries+0.703
GA+0.645
gotten+0.615
gery+0.612
ODY+0.588
Neg logit contributions
elsen-0.872
sole-0.863
 Nanto-0.854
anto-0.781
urus-0.779
Token de
Token ID390
Feature activation-1.27e-01

Pos logit contributions
geries+1.304
GA+1.205
gotten+1.155
gery+1.153
ODY+1.118
Neg logit contributions
elsen-1.179
sole-1.150
 Nanto-1.130
urus-1.033
anto-1.032
Token Pol
Token ID2165
Feature activation+2.00e-01

Pos logit contributions
geries+0.411
GA+0.377
gery+0.362
gotten+0.361
ODY+0.348
Neg logit contributions
elsen-0.417
sole-0.407
 Nanto-0.400
anto-0.368
urus-0.367
Tokení
Token ID8836
Feature activation-5.72e-01

Pos logit contributions
elsen+0.565
sole+0.552
 Nanto+0.541
anto+0.491
urus+0.488
Neg logit contributions
geries-0.737
GA-0.682
gery-0.659
gotten-0.659
ODY-0.639
Tokentic
Token ID13370
Feature activation+3.91e-01

Pos logit contributions
geries+1.883
GA+1.719
gery+1.666
gotten+1.655
YP+1.583
Neg logit contributions
elsen-1.846
 Nanto-1.794
sole-1.785
urst-1.621
urus-1.621
Tokenas
Token ID292
Feature activation-1.15e-01

Pos logit contributions
elsen+1.054
sole+1.034
 Nanto+1.006
anto+0.917
urus+0.908
Neg logit contributions
geries-1.492
GA-1.378
gery-1.337
gotten-1.337
ODY-1.297
Token y
Token ID331
Feature activation+8.70e-02

Pos logit contributions
geries+0.396
GA+0.365
gery+0.351
gotten+0.351
ODY+0.340
Neg logit contributions
elsen-0.354
sole-0.346
 Nanto-0.339
anto-0.310
urus-0.309
Token That
Token ID1320
Feature activation-9.19e-03

Pos logit contributions
geries+0.683
GA+0.623
gotten+0.602
gery+0.601
ODY+0.580
Neg logit contributions
elsen-0.672
sole-0.659
 Nanto-0.648
anto-0.596
urus-0.593
Token is
Token ID318
Feature activation-1.85e-02

Pos logit contributions
geries+0.030
GA+0.027
gotten+0.026
gery+0.026
ODY+0.025
Neg logit contributions
elsen-0.030
sole-0.030
 Nanto-0.029
anto-0.027
urus-0.027
Token what
Token ID644
Feature activation-1.87e-01

Pos logit contributions
geries+0.059
GA+0.054
gotten+0.052
gery+0.052
ODY+0.050
Neg logit contributions
elsen-0.061
sole-0.060
 Nanto-0.059
anto-0.055
urus-0.054
Token everyone
Token ID2506
Feature activation-7.51e-01

Pos logit contributions
geries+0.595
GA+0.541
gotten+0.521
gery+0.520
ODY+0.504
Neg logit contributions
elsen-0.622
sole-0.610
 Nanto-0.599
anto-0.555
urus-0.550
Token was
Token ID373
Feature activation-1.05e-01

Pos logit contributions
geries+2.352
GA+2.148
gery+2.078
gotten+2.057
ODY+1.994
Neg logit contributions
elsen-2.516
sole-2.481
 Nanto-2.433
anto-2.229
urus-2.227
Token doing
Token ID1804
Feature activation-1.10e+00

Pos logit contributions
geries+0.347
GA+0.317
gery+0.305
gotten+0.305
ODY+0.295
Neg logit contributions
elsen-0.340
sole-0.333
 Nanto-0.327
anto-0.301
urus-0.299
Token.
Token ID13
Feature activation-1.78e-01

Pos logit contributions
geries+3.500
GA+3.188
gery+3.115
gotten+3.065
ODY+2.981
Neg logit contributions
elsen-3.631
sole-3.532
 Nanto-3.527
anto-3.218
urus-3.197
Token Glenn
Token ID17551
Feature activation-2.67e-01

Pos logit contributions
geries+0.586
GA+0.533
gotten+0.516
gery+0.514
ODY+0.497
Neg logit contributions
elsen-0.571
sole-0.561
 Nanto-0.552
anto-0.507
urus-0.504
Token McGr
Token ID41114
Feature activation-1.71e-01

Pos logit contributions
geries+0.858
GA+0.784
gery+0.754
gotten+0.753
ODY+0.727
Neg logit contributions
elsen-0.883
sole-0.866
 Nanto-0.851
anto-0.785
urus-0.780
Tokenath
Token ID776
Feature activation+1.26e-02

Pos logit contributions
geries+0.589
GA+0.544
gery+0.523
gotten+0.522
ODY+0.509
Neg logit contributions
elsen-0.521
sole-0.509
 Nanto-0.504
anto-0.458
urus-0.451
Token had
Token ID550
Feature activation-5.20e-01

Pos logit contributions
elsen+0.041
sole+0.040
 Nanto+0.039
anto+0.037
urus+0.036
Neg logit contributions
geries-0.041
GA-0.037
gotten-0.036
gery-0.036
ODY-0.035
Token others
Token ID1854
Feature activation-4.34e-01

Pos logit contributions
geries+0.429
GA+0.392
gery+0.378
gotten+0.378
ODY+0.367
Neg logit contributions
elsen-0.407
sole-0.399
 Nanto-0.392
anto-0.360
urus-0.357
Token
Token ID784
Feature activation+1.11e+00

Pos logit contributions
geries+1.471
GA+1.350
gery+1.301
gotten+1.301
ODY+1.258
Neg logit contributions
elsen-1.356
sole-1.327
 Nanto-1.304
anto-1.194
urus-1.188
Token and
Token ID290
Feature activation+5.54e-02

Pos logit contributions
elsen+3.061
sole+3.053
 Nanto+2.909
anto+2.681
urst+2.614
Neg logit contributions
geries-4.089
GA-3.750
gery-3.625
gotten-3.618
ODY-3.556
Token thus
Token ID4145
Feature activation+7.31e-01

Pos logit contributions
elsen+0.172
sole+0.169
 Nanto+0.165
anto+0.152
urus+0.151
Neg logit contributions
geries-0.189
GA-0.173
gotten-0.167
gery-0.167
ODY-0.162
Token sought
Token ID7194
Feature activation-3.94e-01

Pos logit contributions
elsen+2.332
sole+2.315
 Nanto+2.266
anto+2.091
ross+2.076
Neg logit contributions
geries-2.369
GA-2.156
ODY-2.063
gotten-2.061
gery-2.059
Token to
Token ID284
Feature activation-1.22e-01

Pos logit contributions
geries+1.575
GA+1.466
gery+1.426
gotten+1.419
ODY+1.371
Neg logit contributions
elsen-0.996
sole-0.964
 Nanto-0.956
anto-0.858
urus-0.850
Token portray
Token ID10993
Feature activation-5.00e-01

Pos logit contributions
geries+0.423
GA+0.389
gery+0.377
gotten+0.373
ODY+0.359
Neg logit contributions
elsen-0.373
sole-0.363
 Nanto-0.358
anto-0.327
urus-0.325
Token me
Token ID502
Feature activation+3.87e-01

Pos logit contributions
geries+1.477
GA+1.338
gery+1.282
gotten+1.281
ODY+1.228
Neg logit contributions
elsen-1.779
sole-1.744
 Nanto-1.719
anto-1.592
urus-1.585
Token as
Token ID355
Feature activation+4.66e-01

Pos logit contributions
elsen+1.224
sole+1.205
 Nanto+1.172
anto+1.083
urus+1.070
Neg logit contributions
geries-1.289
GA-1.179
gotten-1.142
gery-1.137
ODY-1.103
Token a
Token ID257
Feature activation-6.18e-02

Pos logit contributions
elsen+1.509
sole+1.492
 Nanto+1.451
anto+1.339
urus+1.329
Neg logit contributions
geries-1.509
GA-1.380
gotten-1.336
gery-1.320
ODY-1.304
Token r
Token ID374
Feature activation+6.97e-02

Pos logit contributions
geries+0.201
GA+0.183
gotten+0.176
gery+0.176
ODY+0.171
Neg logit contributions
elsen-0.202
sole-0.198
 Nanto-0.195
anto-0.179
urus-0.178
Token were
Token ID547
Feature activation+1.97e-01

Pos logit contributions
elsen+0.794
sole+0.778
 Nanto+0.757
anto+0.700
urus+0.690
Neg logit contributions
geries-0.967
GA-0.885
gery-0.863
gotten-0.859
ODY-0.830
Token also
Token ID635
Feature activation+1.65e-01

Pos logit contributions
elsen+0.635
sole+0.622
 Nanto+0.611
anto+0.563
urus+0.560
Neg logit contributions
geries-0.646
GA-0.590
gery-0.569
gotten-0.568
ODY-0.551
Token taken
Token ID2077
Feature activation+2.41e-01

Pos logit contributions
elsen+0.542
sole+0.531
 Nanto+0.522
anto+0.481
urus+0.479
Neg logit contributions
geries-0.532
GA-0.486
gery-0.467
gotten-0.467
ODY-0.452
Token.[
Token ID3693
Feature activation-4.01e-01

Pos logit contributions
elsen+0.721
sole+0.708
 Nanto+0.693
anto+0.634
urus+0.627
Neg logit contributions
geries-0.847
GA-0.779
gotten-0.753
gery-0.751
ODY-0.729
Token12
Token ID1065
Feature activation+5.03e-01

Pos logit contributions
geries+1.204
GA+1.112
gotten+1.058
gery+1.052
ODY+1.018
Neg logit contributions
elsen-1.405
sole-1.365
 Nanto-1.364
anto-1.250
urus-1.242
Token][
Token ID7131
Feature activation-3.33e-01

Pos logit contributions
elsen+1.706
sole+1.672
 Nanto+1.640
anto+1.524
urus+1.508
Neg logit contributions
geries-1.572
GA-1.415
gery-1.372
gotten-1.369
ODY-1.316
Token15
Token ID1314
Feature activation-7.42e-02

Pos logit contributions
geries+1.003
GA+0.922
gotten+0.880
gery+0.876
ODY+0.844
Neg logit contributions
elsen-1.164
sole-1.133
 Nanto-1.132
anto-1.037
urus-1.033
Token]
Token ID60
Feature activation-2.41e-02

Pos logit contributions
geries+0.219
GA+0.200
gotten+0.191
gery+0.191
ODY+0.184
Neg logit contributions
elsen-0.264
sole-0.259
 Nanto-0.255
anto-0.236
urus-0.235
Token
Token ID198
Feature activation-2.13e-01

Pos logit contributions
geries+0.082
GA+0.075
gery+0.072
gotten+0.072
ODY+0.070
Neg logit contributions
elsen-0.076
sole-0.074
 Nanto-0.073
anto-0.067
urus-0.066
Token
Token ID198
Feature activation-2.39e-01

Pos logit contributions
geries+0.745
GA+0.685
gotten+0.661
gery+0.661
ODY+0.640
Neg logit contributions
elsen-0.644
sole-0.630
 Nanto-0.618
anto-0.565
urus-0.561
TokenResults
Token ID25468
Feature activation+3.27e-01

Pos logit contributions
geries+0.760
GA+0.695
gotten+0.667
gery+0.667
ODY+0.643
Neg logit contributions
elsen-0.798
sole-0.782
 Nanto-0.771
anto-0.710
urus-0.706
Token bring
Token ID2222
Feature activation+8.76e-01

Pos logit contributions
elsen+0.507
sole+0.496
 Nanto+0.488
anto+0.447
urus+0.444
Neg logit contributions
geries-0.566
GA-0.520
gotten-0.501
gery-0.501
ODY-0.486
Token the
Token ID262
Feature activation+2.85e-01

Pos logit contributions
elsen+2.511
sole+2.424
 Nanto+2.385
anto+2.225
ross+2.197
Neg logit contributions
geries-3.125
GA-2.860
gotten-2.789
gery-2.770
��-2.745
Token deficit
Token ID11807
Feature activation-3.70e-01

Pos logit contributions
elsen+0.900
sole+0.882
 Nanto+0.866
anto+0.801
urus+0.789
Neg logit contributions
geries-0.952
GA-0.870
gotten-0.840
gery-0.840
ODY-0.816
Token down
Token ID866
Feature activation+4.20e-01

Pos logit contributions
geries+1.524
GA+1.433
gery+1.384
gotten+1.382
YP+1.344
Neg logit contributions
elsen-0.874
sole-0.859
 Nanto-0.824
urus-0.744
anto-0.744
Token to
Token ID284
Feature activation+1.01e-01

Pos logit contributions
elsen+1.255
sole+1.228
 Nanto+1.201
anto+1.091
ross+1.080
Neg logit contributions
geries-1.486
GA-1.360
gery-1.317
gotten-1.317
ODY-1.273
Token 6
Token ID718
Feature activation-3.85e-01

Pos logit contributions
elsen+0.310
sole+0.303
 Nanto+0.297
anto+0.273
urus+0.271
Neg logit contributions
geries-0.350
GA-0.321
gotten-0.311
gery-0.310
ODY-0.301
Token.
Token ID13
Feature activation-4.94e-01

Pos logit contributions
geries+1.307
GA+1.217
gery+1.158
gotten+1.158
ODY+1.121
Neg logit contributions
elsen-1.196
sole-1.174
 Nanto-1.155
urus-1.053
anto-1.048
Token3
Token ID18
Feature activation-3.76e-01

Pos logit contributions
geries+1.452
GA+1.357
gotten+1.277
gery+1.270
ODY+1.235
Neg logit contributions
elsen-1.741
sole-1.712
 Nanto-1.683
anto-1.559
urus-1.559
Token percent
Token ID1411
Feature activation-1.05e-01

Pos logit contributions
geries+1.427
GA+1.337
gotten+1.282
gery+1.279
ODY+1.238
Neg logit contributions
elsen-1.022
sole-0.996
 Nanto-0.979
urus-0.880
anto-0.878
Token of
Token ID286
Feature activation-2.02e-01

Pos logit contributions
geries+0.406
GA+0.377
gery+0.365
gotten+0.364
ODY+0.354
Neg logit contributions
elsen-0.281
sole-0.274
 Nanto-0.268
anto-0.242
urus-0.241
Token GDP
Token ID12396
Feature activation+1.48e-01

Pos logit contributions
geries+0.676
GA+0.620
gery+0.597
gotten+0.597
ODY+0.577
Neg logit contributions
elsen-0.638
sole-0.625
 Nanto-0.614
anto-0.564
urus-0.560
Token of
Token ID286
Feature activation-4.47e-01

Pos logit contributions
geries+1.963
GA+1.839
gery+1.772
gotten+1.766
ODY+1.731
Neg logit contributions
elsen-1.409
sole-1.373
 Nanto-1.326
anto-1.210
urus-1.205
Token the
Token ID262
Feature activation-4.55e-01

Pos logit contributions
geries+1.367
GA+1.245
gotten+1.195
gery+1.193
ODY+1.148
Neg logit contributions
elsen-1.542
sole-1.514
 Nanto-1.482
anto-1.378
urus-1.366
Token 5
Token ID642
Feature activation-6.91e-01

Pos logit contributions
geries+1.536
GA+1.418
gery+1.359
gotten+1.358
ODY+1.317
Neg logit contributions
elsen-1.413
sole-1.383
 Nanto-1.350
anto-1.251
urus-1.243
Token PM
Token ID3122
Feature activation-2.07e-01

Pos logit contributions
geries+2.301
GA+2.137
gery+2.037
gotten+2.022
ODY+1.978
Neg logit contributions
elsen-2.161
sole-2.143
 Nanto-2.088
urus-1.925
anto-1.915
Token Monday
Token ID3321
Feature activation-1.46e-01

Pos logit contributions
geries+0.650
GA+0.594
gery+0.568
gotten+0.567
ODY+0.549
Neg logit contributions
elsen-0.699
sole-0.686
 Nanto-0.672
anto-0.623
urus-0.620
Token deadline
Token ID12508
Feature activation-1.74e+00

Pos logit contributions
geries+0.492
GA+0.452
gery+0.435
gotten+0.435
ODY+0.421
Neg logit contributions
elsen-0.456
sole-0.447
 Nanto-0.438
anto-0.403
urus-0.401
Token for
Token ID329
Feature activation-1.23e+00

Pos logit contributions
geries+5.000
GA+4.606
gery+4.390
gotten+4.285
ODY+4.276
Neg logit contributions
elsen-5.972
sole-5.889
 Nanto-5.643
urus-5.416
anto-5.360
Token all
Token ID477
Feature activation-1.98e-01

Pos logit contributions
geries+3.900
GA+3.526
gotten+3.418
gery+3.412
ODY+3.238
Neg logit contributions
elsen-4.090
sole-3.982
 Nanto-3.865
anto-3.648
urus-3.585
Token NHL
Token ID9481
Feature activation-8.54e-01

Pos logit contributions
geries+0.642
GA+0.588
gotten+0.565
gery+0.565
ODY+0.545
Neg logit contributions
elsen-0.648
sole-0.636
 Nanto-0.624
anto-0.575
urus-0.572
Token teams
Token ID3466
Feature activation+3.69e-01

Pos logit contributions
geries+2.623
GA+2.422
gotten+2.300
gery+2.293
ODY+2.224
Neg logit contributions
elsen-2.898
sole-2.852
 Nanto-2.796
anto-2.596
urus-2.571
Token to
Token ID284
Feature activation+2.57e-01

Pos logit contributions
elsen+1.023
sole+0.999
 Nanto+0.984
anto+0.889
urus+0.882
Neg logit contributions
geries-1.378
GA-1.271
gery-1.233
gotten-1.232
ODY-1.193
Token his
Token ID465
Feature activation-6.90e-02

Pos logit contributions
geries+3.222
GA+2.926
gotten+2.801
gery+2.801
ODY+2.677
Neg logit contributions
elsen-3.874
sole-3.793
 Nanto-3.739
anto-3.450
urst-3.420
Token long
Token ID890
Feature activation-1.13e+00

Pos logit contributions
geries+0.246
GA+0.226
gotten+0.219
gery+0.219
ODY+0.212
Neg logit contributions
elsen-0.204
sole-0.199
 Nanto-0.195
anto-0.179
urus-0.177
Token-
Token ID12
Feature activation-4.35e-01

Pos logit contributions
geries+3.100
GA+2.866
gery+2.711
gotten+2.702
ODY+2.569
Neg logit contributions
elsen-4.213
sole-4.122
 Nanto-4.058
urus-3.754
urst-3.745
Tokenprom
Token ID16963
Feature activation-7.00e-01

Pos logit contributions
geries+1.533
GA+1.427
gotten+1.377
gery+1.375
ODY+1.317
Neg logit contributions
elsen-1.303
sole-1.259
 Nanto-1.254
anto-1.136
urus-1.126
Tokenised
Token ID1417
Feature activation-6.53e-01

Pos logit contributions
geries+2.325
GA+2.178
gotten+2.102
gery+2.077
ODY+2.009
Neg logit contributions
elsen-2.160
sole-2.133
 Nanto-2.099
anto-1.921
urus-1.916
Token wall
Token ID3355
Feature activation-1.48e+00

Pos logit contributions
geries+1.992
GA+1.846
gery+1.756
gotten+1.751
ODY+1.691
Neg logit contributions
elsen-2.250
sole-2.186
 Nanto-2.160
urus-1.994
anto-1.990
Token along
Token ID1863
Feature activation+6.49e-01

Pos logit contributions
geries+4.757
GA+4.506
gery+4.319
gotten+4.225
ODY+4.114
Neg logit contributions
elsen-4.722
sole-4.529
 Nanto-4.474
urus-4.234
anto-4.117
Token the
Token ID262
Feature activation-1.57e-01

Pos logit contributions
elsen+2.142
sole+2.085
 Nanto+2.039
anto+1.918
urst+1.906
Neg logit contributions
geries-2.076
GA-1.854
gery-1.804
gotten-1.803
ODY-1.767
Token U
Token ID471
Feature activation-5.21e-01

Pos logit contributions
geries+0.533
GA+0.490
gery+0.472
gotten+0.472
ODY+0.455
Neg logit contributions
elsen-0.491
sole-0.481
 Nanto-0.472
anto-0.432
urus-0.430
Token.
Token ID13
Feature activation+1.22e-01

Pos logit contributions
geries+1.463
GA+1.322
gery+1.261
gotten+1.261
ODY+1.209
Neg logit contributions
elsen-1.929
sole-1.894
 Nanto-1.866
anto-1.736
urus-1.727
TokenS
Token ID50
Feature activation+3.96e-01

Pos logit contributions
elsen+0.457
sole+0.449
 Nanto+0.440
anto+0.413
urus+0.411
Neg logit contributions
geries-0.342
GA-0.296
gery-0.289
gotten-0.288
ODY-0.276
Token life
Token ID1204
Feature activation-3.67e-01

Pos logit contributions
elsen+0.099
sole+0.096
 Nanto+0.095
anto+0.087
urus+0.086
Neg logit contributions
geries-0.107
GA-0.098
gotten-0.095
gery-0.094
ODY-0.091
Token on
Token ID319
Feature activation+1.31e-01

Pos logit contributions
geries+1.239
GA+1.143
gotten+1.102
gery+1.102
ODY+1.065
Neg logit contributions
elsen-1.145
sole-1.112
 Nanto-1.098
anto-1.005
urus-1.001
Token the
Token ID262
Feature activation+5.44e-02

Pos logit contributions
elsen+0.453
sole+0.445
 Nanto+0.437
anto+0.406
urus+0.401
Neg logit contributions
geries-0.402
GA-0.363
gery-0.349
gotten-0.348
ODY-0.338
Token streets
Token ID6483
Feature activation+1.75e-02

Pos logit contributions
elsen+0.179
sole+0.175
 Nanto+0.172
anto+0.159
urus+0.158
Neg logit contributions
geries-0.175
GA-0.160
gery-0.154
gotten-0.154
ODY-0.149
Token of
Token ID286
Feature activation+1.46e-01

Pos logit contributions
elsen+0.058
sole+0.057
 Nanto+0.056
anto+0.052
urus+0.051
Neg logit contributions
geries-0.056
GA-0.051
gery-0.049
gotten-0.049
ODY-0.047
Token Wa
Token ID15329
Feature activation-1.54e+00

Pos logit contributions
elsen+0.519
sole+0.509
 Nanto+0.502
anto+0.465
urus+0.463
Neg logit contributions
geries-0.431
GA-0.390
gotten-0.374
gery-0.374
ODY-0.360
Tokenik
Token ID1134
Feature activation-4.42e-01

Pos logit contributions
geries+5.026
GA+4.666
gotten+4.526
gery+4.469
YP+4.294
Neg logit contributions
elsen-4.710
sole-4.680
 Nanto-4.583
anto-4.106
ross-4.079
Tokeniki
Token ID5580
Feature activation-7.63e-01

Pos logit contributions
geries+1.073
GA+0.950
gery+0.900
gotten+0.900
ODY+0.856
Neg logit contributions
elsen-1.804
sole-1.776
 Nanto-1.751
anto-1.642
urus-1.633
Token,
Token ID11
Feature activation+5.63e-02

Pos logit contributions
geries+2.310
GA+2.116
gery+2.021
gotten+2.016
ODY+1.949
Neg logit contributions
elsen-2.636
sole-2.577
 Nanto-2.544
urus-2.349
anto-2.345
Token and
Token ID290
Feature activation+3.27e-01

Pos logit contributions
elsen+0.166
sole+0.162
 Nanto+0.159
anto+0.145
urus+0.144
Neg logit contributions
geries-0.201
GA-0.185
gotten-0.179
gery-0.179
ODY-0.173
Token feels
Token ID5300
Feature activation+1.47e-01

Pos logit contributions
elsen+0.985
sole+0.964
 Nanto+0.949
anto+0.870
urus+0.861
Neg logit contributions
geries-1.149
GA-1.047
gery-1.015
gotten-1.014
ODY-0.983
Tokenov
Token ID709
Feature activation+7.58e-01

Pos logit contributions
geries+0.267
GA+0.249
gotten+0.241
gery+0.241
ODY+0.235
Neg logit contributions
elsen-0.158
sole-0.154
 Nanto-0.149
anto-0.134
urus-0.133
Token,
Token ID11
Feature activation-6.43e-02

Pos logit contributions
elsen+2.420
sole+2.371
 Nanto+2.306
anto+2.160
urus+2.119
Neg logit contributions
geries-2.508
GA-2.262
gotten-2.198
gery-2.168
ODY-2.123
Token along
Token ID1863
Feature activation+8.03e-01

Pos logit contributions
geries+0.212
GA+0.194
gotten+0.186
gery+0.186
ODY+0.180
Neg logit contributions
elsen-0.208
sole-0.204
 Nanto-0.199
anto-0.184
urus-0.183
Token with
Token ID351
Feature activation-3.49e-02

Pos logit contributions
elsen+1.964
sole+1.950
 Nanto+1.879
anto+1.679
urst+1.669
Neg logit contributions
geries-3.263
GA-2.965
gery-2.892
gotten-2.880
��-2.869
Token his
Token ID465
Feature activation-9.82e-02

Pos logit contributions
geries+0.116
GA+0.106
gotten+0.102
gery+0.102
ODY+0.099
Neg logit contributions
elsen-0.112
sole-0.109
 Nanto-0.107
anto-0.099
urus-0.098
Token cousin
Token ID16933
Feature activation-1.47e+00

Pos logit contributions
geries+0.295
GA+0.268
gotten+0.256
gery+0.256
ODY+0.247
Neg logit contributions
elsen-0.345
sole-0.339
 Nanto-0.333
anto-0.309
urus-0.307
Token,
Token ID11
Feature activation-1.79e-01

Pos logit contributions
geries+4.649
GA+4.272
gery+4.137
gotten+4.093
ODY+3.941
Neg logit contributions
elsen-4.796
sole-4.643
 Nanto-4.512
urus-4.213
anto-4.211
Token Nikola
Token ID47817
Feature activation+3.78e-01

Pos logit contributions
geries+0.594
GA+0.544
gery+0.524
gotten+0.524
ODY+0.506
Neg logit contributions
elsen-0.571
sole-0.559
 Nanto-0.549
anto-0.505
urus-0.502
Token Gr
Token ID1902
Feature activation+8.32e-02

Pos logit contributions
elsen+1.148
sole+1.120
 Nanto+1.098
anto+1.009
urus+1.001
Neg logit contributions
geries-1.322
GA-1.213
gery-1.174
gotten-1.170
ODY-1.138
Tokenue
Token ID518
Feature activation-4.57e-01

Pos logit contributions
elsen+0.274
sole+0.269
 Nanto+0.263
anto+0.244
urus+0.243
Neg logit contributions
geries-0.268
GA-0.245
gotten-0.236
gery-0.235
ODY-0.228
Tokenvs
Token ID14259
Feature activation+2.28e-01

Pos logit contributions
geries+1.881
GA+1.754
gery+1.709
gotten+1.705
ODY+1.644
Neg logit contributions
elsen-1.092
sole-1.058
 Nanto-1.050
anto-0.918
urus-0.915
Token
Token ID251
Feature activation+6.65e-01

Pos logit contributions
elsen+2.618
sole+2.598
 Nanto+2.520
anto+2.375
urus+2.368
Neg logit contributions
geries-1.941
GA-1.707
gery-1.652
gotten-1.649
ODY-1.575
Token and
Token ID290
Feature activation+1.47e-02

Pos logit contributions
elsen+2.138
sole+2.123
 Nanto+2.060
anto+1.920
urus+1.882
Neg logit contributions
geries-2.171
GA-1.961
gotten-1.914
gery-1.905
ODY-1.834
Token
Token ID564
Feature activation+2.08e-01

Pos logit contributions
elsen+0.045
sole+0.044
 Nanto+0.043
anto+0.040
urus+0.039
Neg logit contributions
geries-0.050
GA-0.046
gery-0.045
gotten-0.045
ODY-0.043
Token
Token ID250
Feature activation+4.52e-01

Pos logit contributions
elsen+0.827
sole+0.814
 Nanto+0.800
anto+0.750
urus+0.747
Neg logit contributions
geries-0.528
GA-0.468
gery-0.446
gotten-0.445
ODY-0.425
Tokenactive
Token ID5275
Feature activation-4.63e-01

Pos logit contributions
elsen+1.438
sole+1.430
 Nanto+1.372
anto+1.282
urus+1.266
Neg logit contributions
geries-1.504
GA-1.357
gotten-1.320
gery-1.318
ODY-1.279
Token bystand
Token ID33326
Feature activation-1.80e+00

Pos logit contributions
geries+1.454
GA+1.327
gery+1.274
gotten+1.273
ODY+1.228
Neg logit contributions
elsen-1.564
sole-1.534
 Nanto-1.509
anto-1.393
urus-1.385
Tokener
Token ID263
Feature activation-9.26e-01

Pos logit contributions
geries+5.952
GA+5.493
gery+5.461
gotten+5.275
YP+4.928
Neg logit contributions
 Nanto-5.326
elsen-5.319
sole-5.305
anto-4.657
urus-4.625
Token training
Token ID3047
Feature activation-2.04e-01

Pos logit contributions
geries+3.010
GA+2.791
gery+2.675
gotten+2.650
ODY+2.591
Neg logit contributions
elsen-2.979
sole-2.915
 Nanto-2.866
urus-2.625
anto-2.615
Token
Token ID447
Feature activation+1.27e-01

Pos logit contributions
geries+0.635
GA+0.579
gery+0.555
gotten+0.554
ODY+0.535
Neg logit contributions
elsen-0.695
sole-0.681
 Nanto-0.670
anto-0.619
urus-0.616
Token
Token ID251
Feature activation+2.21e-01

Pos logit contributions
elsen+0.477
sole+0.470
 Nanto+0.459
anto+0.430
urus+0.429
Neg logit contributions
geries-0.350
GA-0.313
gery-0.300
gotten-0.300
ODY-0.288
Token.
Token ID13
Feature activation-2.17e-01

Pos logit contributions
elsen+0.763
sole+0.751
 Nanto+0.736
anto+0.684
urus+0.677
Neg logit contributions
geries-0.678
GA-0.614
gotten-0.591
gery-0.591
ODY-0.568