TOP ACTIVATIONS
MAX = 3.068

by a major , household - name financial institution in New
- human transmission and community - level outbreaks . Phase 5
week and maintained at a stable level during the experiments .
cities Several large - scale public executions of around
to attending my first large - scale tournament in the form
reproductive health services from school - based clinics . Nor are
' Willie C au ley - Ste in suffered an open
d viewed a cell - phone video of Ford inh
also . Due to large - scale arson , looting and
his s of a big - city restaurant kitchen .
that . Large - scale development without competitive bids
He then scored a season - high 28 points against Boston
the workforce than other college - educated prime - age workers
with inadequate and risky account - style plans , we would
," and his recent town - hall response to a man
, President Park Ge un - h ye 's government announced
to buy goods in big - box stores that can be
their conversations involve apologizing and large amounts of guilt on both
based battles . Its cel - sh aded visuals look the
students at eight city school - based clinics . The number

MIN ACTIVATIONS
MIN = -3.042

. Although MIC AS is more sensitive , its field -
and I learnt a lot more than I had before too
not ching repeated wins in off - road race series .
R apt or 's pioneering off - road mode driving technology
2 tires specifically designed for off - road performance .
p uls ating combustion more realistic , too . It
a difference in corners much more clearly on the 16 -
ville 's house on the r ue du Vie ux -
- way tie for second right now in the Central .
it is a little more out there than most . -
class of people who benefit more from immigrant and undocumented workers
to - use version of off - road mode driver -
the most powerful and fastest off - road pickups .
infections older than that are less accurate and can take weeks
monitor cabinet is a lot more difficult and could cause safety
aluminum alloy that helps save more than 500 pounds of weight
a poet of sens ibility more than sense , which means
moderate success . His least anti - Semitic book , Peach
the ultimate high - performance off - road pickup in the
the Desert , Lucas Oil Off Road Racing Series and the

INTERVAL 1.540 - 3.068
CONTAINS 0.264%

unlikely to have impact on large projects that are ultimately decided
officials from eight countries with large flu outbreaks on Wednesday in
was unprecedented in a major - party presidential candidate . So
someone 's 200 - year - old house ," Adams said
a one - percent age - point increase from the prior

INTERVAL 0.013 - 1.540
CONTAINS 47.908%

collaborate more in the coming years . Here s
, to 0 . 05 %, which is already the legal
( eat to survive / drop items on death ) and
end of hang - gl ider performance possibly
7 }, { 1 , 6 } < -> { 2

INTERVAL -1.514 - 0.013
CONTAINS 51.461%

­ pub ­ lic ­ ans want to say something really
that most of the report she was coerced into filing is
Keller that summer in Massachusetts . " I never
mean Montreal s second power - play defensive pairing
can also go ahead and get something for yourself on my

INTERVAL -3.042 - -1.514
CONTAINS 0.367%

cry stal meth makes it so you don 't ejac ulate
everything together . You can lose any one of the 5
apt or is 6 inches wider than the standard F -
: Clive Brun skill / Getty Images That low
ian had published in the New Yorker . In an interview
Token by
Token ID416
Feature activation+5.05e-03

Pos logit contributions
shown+3.756
ardless+3.675
 attRot+3.646
mates+3.253
warming+3.189
Neg logit contributions
stals-3.891
ights-3.832
arty-3.814
usalem-3.784
PLIED-3.741
Token a
Token ID257
Feature activation+1.36e-01

Pos logit contributions
shown+0.032
 attRot+0.031
ardless+0.030
mates+0.028
warming+0.026
Neg logit contributions
ights-0.028
stals-0.028
usalem-0.027
arty-0.027
PLIED-0.027
Token major
Token ID1688
Feature activation+8.23e-01

Pos logit contributions
shown+0.842
 attRot+0.820
ardless+0.784
mates+0.718
warming+0.700
Neg logit contributions
stals-0.756
ights-0.754
arty-0.743
PLIED-0.738
usalem-0.735
Token,
Token ID11
Feature activation+7.01e-02

Pos logit contributions
shown+4.904
 attRot+4.658
ardless+4.481
mates+4.086
warming+4.056
Neg logit contributions
stals-4.498
PLIED-4.434
ights-4.427
usalem-4.408
arty-4.318
Token household
Token ID6641
Feature activation+1.80e+00

Pos logit contributions
shown+0.448
 attRot+0.430
ardless+0.415
mates+0.377
warming+0.376
Neg logit contributions
ights-0.378
stals-0.376
arty-0.371
usalem-0.368
PLIED-0.366
Token-
Token ID12
Feature activation+3.07e+00

Pos logit contributions
shown+10.179
 attRot+9.427
ardless+9.281
mates+8.995
warming+8.975
Neg logit contributions
ights-9.173
usalem-8.945
stals-8.846
PLIED-8.787
burgh-8.685
Tokenname
Token ID3672
Feature activation+5.13e-01

Pos logit contributions
shown+15.591
warming+14.126
mates+13.729
operated+13.318
ardless+13.116
Neg logit contributions
usalem-14.517
PLIED-14.467
ights-14.238
 Twice-13.896
stals-13.811
Token financial
Token ID3176
Feature activation-2.63e-03

Pos logit contributions
shown+2.956
 attRot+2.856
ardless+2.729
mates+2.497
warming+2.463
Neg logit contributions
ights-2.997
stals-2.992
usalem-2.950
PLIED-2.947
arty-2.933
Token institution
Token ID9901
Feature activation+4.92e-01

Pos logit contributions
ights+0.016
arty+0.016
stals+0.015
usalem+0.015
PLIED+0.015
Neg logit contributions
shown-0.015
 attRot-0.015
ardless-0.014
warming-0.013
mates-0.013
Token in
Token ID287
Feature activation+5.45e-01

Pos logit contributions
shown+2.882
 attRot+2.825
ardless+2.729
mates+2.481
warming+2.433
Neg logit contributions
ights-2.841
usalem-2.815
stals-2.814
PLIED-2.771
arty-2.757
Token New
Token ID968
Feature activation-8.82e-01

Pos logit contributions
shown+3.520
 attRot+3.384
ardless+3.246
mates+3.114
warming+2.929
Neg logit contributions
ights-2.818
PLIED-2.804
stals-2.782
arty-2.741
usalem-2.715
Token-
Token ID12
Feature activation+2.69e-01

Pos logit contributions
ights+3.314
ben+3.202
arty+3.201
bite+3.018
stals+3.003
Neg logit contributions
shown-10.587
 attRot-10.387
ardless-10.170
ensical-9.734
mates-9.295
Tokenhuman
Token ID10734
Feature activation+1.85e-01

Pos logit contributions
shown+1.475
 attRot+1.359
ardless+1.320
warming+1.232
mates+1.204
Neg logit contributions
stals-1.730
ights-1.716
 Dimension-1.687
PLIED-1.674
 Twice-1.661
Token transmission
Token ID11478
Feature activation+1.75e-01

Pos logit contributions
shown+0.843
 attRot+0.806
ardless+0.755
warming+0.686
mates+0.679
Neg logit contributions
PLIED-1.344
stals-1.340
ights-1.316
arty-1.303
usalem-1.301
Token and
Token ID290
Feature activation+3.62e-01

Pos logit contributions
shown+1.127
 attRot+1.088
ardless+1.034
mates+0.956
warming+0.956
Neg logit contributions
PLIED-0.949
usalem-0.931
stals-0.924
arty-0.920
ights-0.919
Token community
Token ID2055
Feature activation+1.50e+00

Pos logit contributions
shown+2.064
 attRot+1.978
ardless+1.879
mates+1.704
warming+1.694
Neg logit contributions
stals-2.230
PLIED-2.175
ights-2.171
arty-2.161
usalem-2.136
Token-
Token ID12
Feature activation+2.66e+00

Pos logit contributions
shown+8.140
 attRot+7.891
warming+7.298
ardless+7.243
mates+7.164
Neg logit contributions
stals-8.280
PLIED-8.227
usalem-8.037
ights-8.012
arty-7.757
Tokenlevel
Token ID5715
Feature activation+1.60e+00

Pos logit contributions
shown+11.974
warming+10.631
mates+9.716
bred+9.693
 attRot+9.552
Neg logit contributions
PLIED-15.530
usalem-15.067
 Dimension-14.547
 Twice-14.531
 Venice-14.468
Token outbreaks
Token ID35198
Feature activation+3.39e-01

Pos logit contributions
shown+7.797
 attRot+7.493
warming+6.731
 behaviors+6.621
mates+6.441
Neg logit contributions
PLIED-9.522
stals-9.346
usalem-9.253
arty-8.969
ights-8.940
Token.
Token ID13
Feature activation-5.58e-02

Pos logit contributions
shown+2.099
 attRot+2.057
ardless+1.898
mates+1.751
warming+1.744
Neg logit contributions
PLIED-1.916
usalem-1.873
stals-1.872
arty-1.849
ights-1.830
Token Phase
Token ID18983
Feature activation+1.07e-01

Pos logit contributions
ights+0.308
arty+0.308
stals+0.305
usalem+0.304
PLIED+0.301
Neg logit contributions
shown-0.353
 attRot-0.343
ardless-0.326
mates-0.296
warming-0.290
Token 5
Token ID642
Feature activation+2.19e-01

Pos logit contributions
shown+0.631
 attRot+0.620
ardless+0.584
mates+0.529
warming+0.523
Neg logit contributions
ights-0.631
arty-0.628
stals-0.626
PLIED-0.619
usalem-0.616
Token week
Token ID1285
Feature activation+7.77e-02

Pos logit contributions
arty+0.677
ights+0.672
stals+0.671
PLIED+0.659
usalem+0.657
Neg logit contributions
shown-0.860
 attRot-0.841
ardless-0.812
mates-0.740
warming-0.736
Token and
Token ID290
Feature activation+6.80e-01

Pos logit contributions
shown+0.476
 attRot+0.464
ardless+0.445
mates+0.401
warming+0.397
Neg logit contributions
arty-0.443
ights-0.439
stals-0.438
PLIED-0.434
usalem-0.432
Token maintained
Token ID9456
Feature activation+1.86e+00

Pos logit contributions
shown+4.001
 attRot+3.938
ardless+3.726
mates+3.538
warming+3.409
Neg logit contributions
arty-3.918
stals-3.833
PLIED-3.809
usalem-3.789
ights-3.666
Token at
Token ID379
Feature activation+7.06e-01

Pos logit contributions
 attRot+8.206
shown+8.155
ardless+7.645
 thereafter+7.494
mates+7.429
Neg logit contributions
arty-11.843
usalem-11.292
stals-11.158
PLIED-11.076
phabet-10.954
Token a
Token ID257
Feature activation+9.26e-01

Pos logit contributions
shown+4.218
 attRot+4.101
ardless+3.967
mates+3.733
warming+3.592
Neg logit contributions
arty-3.977
usalem-3.846
stals-3.833
PLIED-3.830
ights-3.824
Token stable
Token ID8245
Feature activation+2.64e+00

Pos logit contributions
shown+5.123
 attRot+4.966
ardless+4.681
mates+4.417
warming+4.381
Neg logit contributions
arty-5.544
usalem-5.415
PLIED-5.414
stals-5.398
ights-5.304
Token level
Token ID1241
Feature activation+1.27e+00

Pos logit contributions
mates+11.582
 attRot+11.495
shown+11.477
warming+11.052
mate+10.759
Neg logit contributions
arty-13.976
PLIED-13.620
stals-13.544
usalem-13.255
ights-12.757
Token during
Token ID1141
Feature activation-9.36e-02

Pos logit contributions
shown+6.672
 attRot+6.504
ardless+5.939
mates+5.555
 thereafter+5.529
Neg logit contributions
arty-7.721
stals-7.578
usalem-7.559
PLIED-7.389
ights-7.331
Token the
Token ID262
Feature activation+2.42e-01

Pos logit contributions
stals+0.495
arty+0.494
ights+0.491
PLIED+0.486
usalem+0.480
Neg logit contributions
shown-0.613
 attRot-0.601
ardless-0.570
mates-0.530
warming-0.524
Token experiments
Token ID10256
Feature activation-1.30e-02

Pos logit contributions
shown+1.449
 attRot+1.427
ardless+1.351
mates+1.250
warming+1.235
Neg logit contributions
arty-1.395
stals-1.390
PLIED-1.358
usalem-1.355
ights-1.351
Token.
Token ID13
Feature activation+6.91e-01

Pos logit contributions
ights+0.085
arty+0.085
stals+0.085
PLIED+0.083
usalem+0.083
Neg logit contributions
shown-0.069
 attRot-0.067
ardless-0.064
mates-0.056
warming-0.056
Token cities
Token ID4736
Feature activation+3.10e-01

Pos logit contributions
shown+3.721
 attRot+3.538
mates+3.279
ardless+3.264
warming+3.126
Neg logit contributions
PLIED-3.743
stals-3.640
arty-3.574
ights-3.518
usalem-3.503
Token
Token ID198
Feature activation-6.11e-01

Pos logit contributions
shown+1.782
 attRot+1.696
ardless+1.624
mates+1.492
warming+1.468
Neg logit contributions
stals-1.907
PLIED-1.873
ights-1.861
arty-1.847
usalem-1.819
Token
Token ID198
Feature activation-6.80e-01

Pos logit contributions
ights+3.102
stals+3.026
arty+3.013
 Twice+2.978
ben+2.959
Neg logit contributions
shown-4.063
 attRot-3.929
ardless-3.883
ensical-3.625
warming-3.447
TokenSeveral
Token ID14945
Feature activation+1.89e-01

Pos logit contributions
ights+3.651
arty+3.601
stals+3.594
PLIED+3.521
usalem+3.507
Neg logit contributions
shown-4.426
 attRot-4.295
ardless-4.129
mates-3.742
warming-3.713
Token large
Token ID1588
Feature activation+1.57e+00

Pos logit contributions
shown+1.248
 attRot+1.184
ardless+1.117
mates+1.073
warming+1.038
Neg logit contributions
arty-0.995
stals-0.987
PLIED-0.980
usalem-0.952
ights-0.951
Token-
Token ID12
Feature activation+2.62e+00

Pos logit contributions
shown+8.570
 attRot+8.338
ardless+7.672
mates+7.452
warming+7.424
Neg logit contributions
PLIED-8.476
stals-8.122
arty-8.116
usalem-7.740
ights-7.694
Tokenscale
Token ID9888
Feature activation+1.16e+00

Pos logit contributions
shown+12.208
bodied+10.876
warming+10.667
ardless+10.534
 attRot+10.350
Neg logit contributions
PLIED-14.857
usalem-13.945
 Twice-13.739
 Dimension-13.598
 dearly-13.367
Token public
Token ID1171
Feature activation+4.89e-01

Pos logit contributions
shown+5.893
 attRot+5.661
warming+4.956
mates+4.840
ardless+4.806
Neg logit contributions
PLIED-7.097
stals-6.928
arty-6.911
usalem-6.803
ights-6.674
Token executions
Token ID30632
Feature activation-3.02e-01

Pos logit contributions
shown+2.132
 attRot+2.025
ardless+1.785
warming+1.688
mates+1.608
Neg logit contributions
PLIED-3.614
stals-3.556
arty-3.555
 Dimension-3.516
 Twice-3.503
Token of
Token ID286
Feature activation+5.67e-01

Pos logit contributions
arty+1.616
ights+1.610
stals+1.595
PLIED+1.569
 Dimension+1.564
Neg logit contributions
shown-1.977
 attRot-1.917
ardless-1.818
mates-1.683
warming-1.674
Token around
Token ID1088
Feature activation-2.77e-01

Pos logit contributions
shown+3.503
 attRot+3.296
ardless+3.118
mates+3.021
warming+2.902
Neg logit contributions
stals-3.096
arty-3.092
PLIED-3.082
usalem-3.041
ights-3.040
Token to
Token ID284
Feature activation-4.33e-02

Pos logit contributions
shown+2.037
 attRot+1.994
ardless+1.865
mates+1.623
warming+1.618
Neg logit contributions
ights-3.076
arty-3.070
PLIED-3.063
stals-3.048
usalem-3.018
Token attending
Token ID11969
Feature activation+2.00e-01

Pos logit contributions
PLIED+0.234
stals+0.233
ights+0.233
arty+0.230
usalem+0.229
Neg logit contributions
shown-0.281
 attRot-0.274
ardless-0.261
mates-0.240
warming-0.236
Token my
Token ID616
Feature activation-8.36e-02

Pos logit contributions
shown+1.318
 attRot+1.285
ardless+1.216
mates+1.122
warming+1.104
Neg logit contributions
PLIED-1.035
ights-1.031
stals-1.024
arty-1.022
usalem-1.018
Token first
Token ID717
Feature activation-5.92e-01

Pos logit contributions
ights+0.435
stals+0.432
PLIED+0.431
arty+0.427
usalem+0.426
Neg logit contributions
shown-0.554
 attRot-0.540
ardless-0.513
warming-0.473
mates-0.473
Token large
Token ID1588
Feature activation+1.38e+00

Pos logit contributions
ights+3.293
stals+3.244
arty+3.238
PLIED+3.192
usalem+3.164
Neg logit contributions
shown-3.741
 attRot-3.634
ardless-3.489
mates-3.144
warming-3.124
Token-
Token ID12
Feature activation+2.61e+00

Pos logit contributions
shown+8.497
 attRot+8.120
ardless+7.730
warming+7.424
FY+6.874
Neg logit contributions
PLIED-6.769
stals-6.629
ights-6.385
usalem-6.322
ben-6.267
Tokenscale
Token ID9888
Feature activation+1.06e+00

Pos logit contributions
shown+10.130
warming+8.710
scale+8.620
bodied+8.556
ardless+8.542
Neg logit contributions
usalem-16.396
PLIED-15.967
 Twice-15.461
 dearly-15.324
 Dimension-15.168
Token tournament
Token ID7756
Feature activation+3.28e-01

Pos logit contributions
shown+6.630
 attRot+6.435
ardless+5.914
warming+5.721
mates+5.418
Neg logit contributions
stals-5.362
PLIED-5.318
ights-5.209
usalem-5.093
ben-5.004
Token in
Token ID287
Feature activation+5.30e-01

Pos logit contributions
shown+1.962
 attRot+1.919
ardless+1.822
warming+1.669
mates+1.660
Neg logit contributions
ights-1.905
PLIED-1.903
stals-1.860
arty-1.855
usalem-1.850
Token the
Token ID262
Feature activation+1.19e-01

Pos logit contributions
shown+3.413
 attRot+3.262
ardless+3.062
mates+2.957
warming+2.799
Neg logit contributions
PLIED-2.817
stals-2.807
ights-2.753
arty-2.737
usalem-2.674
Token form
Token ID1296
Feature activation+2.00e-01

Pos logit contributions
shown+0.814
 attRot+0.786
ardless+0.746
mates+0.690
warming+0.678
Neg logit contributions
stals-0.611
ights-0.599
PLIED-0.594
arty-0.594
usalem-0.578
Token reproductive
Token ID18391
Feature activation+3.36e-01

Pos logit contributions
shown+3.171
 attRot+3.158
ardless+2.988
mates+2.797
warming+2.689
Neg logit contributions
ights-3.147
stals-3.143
arty-3.114
usalem-3.034
PLIED-3.001
Token health
Token ID1535
Feature activation+1.42e+00

Pos logit contributions
shown+1.818
 attRot+1.751
ardless+1.663
mates+1.538
warming+1.464
Neg logit contributions
stals-2.112
PLIED-2.082
ights-2.079
arty-2.062
usalem-2.062
Token services
Token ID2594
Feature activation+5.39e-01

Pos logit contributions
shown+5.447
 attRot+5.096
ardless+4.846
 behaviors+4.829
mates+4.756
Neg logit contributions
stals-9.817
arty-9.737
usalem-9.531
ben-9.364
 actionGroup-9.260
Token from
Token ID422
Feature activation+6.65e-01

Pos logit contributions
shown+2.869
 attRot+2.795
ardless+2.703
mates+2.435
warming+2.347
Neg logit contributions
stals-3.400
usalem-3.342
arty-3.338
ights-3.267
PLIED-3.257
Token school
Token ID1524
Feature activation+1.44e+00

Pos logit contributions
shown+3.621
 attRot+3.481
ardless+3.393
mates+3.196
idate+2.915
Neg logit contributions
stals-4.040
ights-4.040
arty-3.987
usalem-3.872
 actionGroup-3.829
Token-
Token ID12
Feature activation+2.58e+00

Pos logit contributions
mates+7.591
shown+7.175
 attRot+7.175
ardless+7.121
mate+6.409
Neg logit contributions
stals-8.274
arty-8.011
usalem-7.958
 Dimension-7.949
ights-7.895
Tokenbased
Token ID3106
Feature activation+1.09e+00

Pos logit contributions
shown+10.100
mates+9.985
operated+9.336
ardless+9.296
bred+8.629
Neg logit contributions
usalem-16.020
 Dimension-15.310
ights-15.223
 Myster-15.194
PLIED-15.190
Token clinics
Token ID21434
Feature activation+4.63e-01

Pos logit contributions
shown+4.804
 attRot+4.629
ardless+4.492
mates+4.399
 behaviors+3.842
Neg logit contributions
stals-7.265
ights-7.250
usalem-7.196
arty-6.998
PLIED-6.925
Token.
Token ID13
Feature activation-2.06e-01

Pos logit contributions
shown+2.432
 attRot+2.369
ardless+2.271
mates+2.075
warming+1.987
Neg logit contributions
stals-2.969
usalem-2.955
ights-2.898
arty-2.888
 Dimension-2.887
Token Nor
Token ID5414
Feature activation-7.10e-02

Pos logit contributions
ights+1.161
arty+1.155
usalem+1.150
stals+1.144
PLIED+1.118
Neg logit contributions
shown-1.292
 attRot-1.258
ardless-1.210
mates-1.092
warming-1.058
Token are
Token ID389
Feature activation+5.05e-01

Pos logit contributions
stals+0.346
PLIED+0.345
ights+0.345
arty+0.345
usalem+0.341
Neg logit contributions
shown-0.485
 attRot-0.475
ardless-0.464
mates-0.425
warming-0.414
Token'
Token ID6
Feature activation+1.34e-01

Pos logit contributions
shown+2.153
 attRot+2.062
ardless+1.925
mates+1.862
warming+1.804
Neg logit contributions
stals-1.719
arty-1.709
usalem-1.709
PLIED-1.704
ights-1.678
Token Willie
Token ID28623
Feature activation+4.47e-01

Pos logit contributions
shown+0.862
 attRot+0.843
ardless+0.795
mates+0.741
warming+0.726
Neg logit contributions
ights-0.727
stals-0.727
PLIED-0.726
usalem-0.723
arty-0.719
Token C
Token ID327
Feature activation+1.49e+00

Pos logit contributions
shown+2.555
 attRot+2.510
ardless+2.382
mates+2.158
idate+2.093
Neg logit contributions
usalem-2.663
PLIED-2.633
arty-2.613
bite-2.562
ights-2.516
Tokenau
Token ID559
Feature activation+5.62e-01

Pos logit contributions
shown+3.675
ardless+3.661
 attRot+3.213
urch+2.929
warming+2.536
Neg logit contributions
PLIED-12.587
 actionGroup-12.518
usalem-12.242
 Dimension-12.215
stals-12.206
Tokenley
Token ID1636
Feature activation+1.80e-01

Pos logit contributions
shown+2.483
 attRot+2.478
ardless+2.116
warming+1.997
bred+1.953
Neg logit contributions
PLIED-4.194
usalem-4.116
 Dimension-4.020
bite-4.010
stals-3.984
Token-
Token ID12
Feature activation+2.57e+00

Pos logit contributions
shown+1.339
 attRot+1.290
ardless+1.276
warming+1.190
mates+1.144
Neg logit contributions
usalem-0.800
ights-0.799
arty-0.779
stals-0.777
PLIED-0.775
TokenSte
Token ID7447
Feature activation+6.48e-01

Pos logit contributions
shown+10.508
 attRot+9.691
ardless+9.300
operated+8.474
warming+8.066
Neg logit contributions
 actionGroup-17.661
PLIED-17.455
usalem-17.236
stals-17.094
 Dimension-16.556
Tokenin
Token ID259
Feature activation-5.86e-01

Pos logit contributions
shown+3.772
 attRot+3.704
ardless+3.598
warming+3.255
mates+3.191
Neg logit contributions
usalem-3.807
stals-3.735
 actionGroup-3.672
ights-3.666
arty-3.655
Token suffered
Token ID6989
Feature activation-4.03e-01

Pos logit contributions
ights+2.877
stals+2.842
arty+2.832
PLIED+2.790
usalem+2.785
Neg logit contributions
shown-4.081
 attRot-3.969
ardless-3.834
mates-3.507
warming-3.481
Token an
Token ID281
Feature activation-8.12e-01

Pos logit contributions
ights+2.307
stals+2.280
arty+2.272
PLIED+2.229
usalem+2.227
Neg logit contributions
shown-2.513
 attRot-2.435
ardless-2.325
mates-2.098
warming-2.086
Token open
Token ID1280
Feature activation-5.49e-01

Pos logit contributions
ights+3.987
arty+3.929
stals+3.907
PLIED+3.828
 Twice+3.811
Neg logit contributions
shown-5.613
 attRot-5.481
ardless-5.281
mates-4.805
warming-4.772
Token
Token ID247
Feature activation-1.20e-01

Pos logit contributions
shown+0.300
 attRot+0.290
ardless+0.269
mates+0.234
warming+0.227
Neg logit contributions
ights-0.482
arty-0.478
stals-0.478
PLIED-0.477
usalem-0.475
Tokend
Token ID67
Feature activation+2.63e-01

Pos logit contributions
ights+0.716
stals+0.712
PLIED+0.707
 Dimension+0.705
arty+0.705
Neg logit contributions
shown-0.721
 attRot-0.679
ardless-0.660
mates-0.595
warming-0.585
Token viewed
Token ID9569
Feature activation+8.74e-01

Pos logit contributions
shown+1.693
 attRot+1.612
ardless+1.568
mates+1.418
warming+1.386
Neg logit contributions
stals-1.417
PLIED-1.407
ights-1.405
arty-1.397
 Twice-1.389
Token a
Token ID257
Feature activation+6.81e-01

Pos logit contributions
shown+5.353
 attRot+5.119
ardless+4.933
mates+4.664
idate+4.412
Neg logit contributions
stals-4.585
ights-4.576
PLIED-4.508
usalem-4.445
arty-4.409
Token cell
Token ID2685
Feature activation+6.33e-01

Pos logit contributions
shown+4.222
 attRot+4.007
ardless+3.782
mates+3.532
warming+3.486
Neg logit contributions
stals-3.743
PLIED-3.687
ights-3.660
arty-3.562
 actionGroup-3.491
Token-
Token ID12
Feature activation+2.54e+00

Pos logit contributions
shown+4.201
ardless+3.889
 attRot+3.883
mates+3.702
warming+3.633
Neg logit contributions
PLIED-3.292
stals-3.271
ights-3.149
arty-3.100
 actionGroup-3.047
Tokenphone
Token ID4862
Feature activation+6.82e-01

Pos logit contributions
shown+10.871
warming+8.842
operated+8.824
mates+8.705
mate+8.278
Neg logit contributions
PLIED-17.021
ights-15.900
usalem-15.703
 Horizons-15.578
stals-15.495
Token video
Token ID2008
Feature activation+3.69e-01

Pos logit contributions
shown+3.978
 attRot+3.696
ardless+3.414
mates+3.202
warming+3.185
Neg logit contributions
stals-4.043
ights-3.962
PLIED-3.946
arty-3.929
 Twice-3.799
Token of
Token ID286
Feature activation+1.01e+00

Pos logit contributions
shown+2.393
 attRot+2.232
ardless+2.077
warming+1.959
mates+1.932
Neg logit contributions
stals-2.026
ights-2.017
PLIED-1.975
arty-1.959
usalem-1.954
Token Ford
Token ID8092
Feature activation+3.41e-01

Pos logit contributions
shown+5.914
 attRot+5.609
ardless+5.208
mates+4.968
warming+4.775
Neg logit contributions
stals-5.592
ights-5.529
PLIED-5.461
usalem-5.321
arty-5.276
Token inh
Token ID25783
Feature activation-1.07e-01

Pos logit contributions
shown+2.060
 attRot+1.950
ardless+1.880
warming+1.748
mates+1.718
Neg logit contributions
stals-1.954
ights-1.931
PLIED-1.925
 Dimension-1.908
usalem-1.898
Token also
Token ID635
Feature activation+3.53e-01

Pos logit contributions
shown+0.860
 attRot+0.840
ardless+0.802
mates+0.717
warming+0.715
Neg logit contributions
ights-0.803
stals-0.797
arty-0.789
PLIED-0.787
usalem-0.773
Token.
Token ID13
Feature activation+6.68e-01

Pos logit contributions
shown+2.066
 attRot+2.014
ardless+1.919
warming+1.700
mates+1.686
Neg logit contributions
ights-2.097
stals-2.095
PLIED-2.086
arty-2.060
usalem-2.041
Token Due
Token ID14444
Feature activation+7.44e-01

Pos logit contributions
shown+4.048
 attRot+3.972
ardless+3.726
mates+3.317
FY+3.262
Neg logit contributions
usalem-3.731
stals-3.697
ights-3.687
arty-3.684
PLIED-3.656
Token to
Token ID284
Feature activation+3.10e-01

Pos logit contributions
shown+3.480
 attRot+3.447
ardless+3.205
warming+2.785
mates+2.757
Neg logit contributions
ights-5.143
arty-5.105
PLIED-5.066
stals-5.061
usalem-5.015
Token large
Token ID1588
Feature activation+1.55e+00

Pos logit contributions
shown+1.963
 attRot+1.923
ardless+1.779
mates+1.624
warming+1.623
Neg logit contributions
PLIED-1.680
ights-1.677
stals-1.674
arty-1.663
usalem-1.614
Token-
Token ID12
Feature activation+2.52e+00

Pos logit contributions
shown+7.747
 attRot+7.709
ardless+7.058
warming+6.478
mates+6.099
Neg logit contributions
PLIED-9.160
stals-8.762
usalem-8.651
arty-8.640
ights-8.475
Tokenscale
Token ID9888
Feature activation+1.26e+00

Pos logit contributions
shown+9.213
scale+7.857
 attRot+7.677
warming+7.632
ardless+7.430
Neg logit contributions
PLIED-16.648
usalem-16.285
 Twice-15.751
 Venice-15.539
ights-15.430
Token arson
Token ID35181
Feature activation-5.76e-01

Pos logit contributions
 attRot+7.031
shown+6.921
ardless+6.157
warming+5.795
mates+5.492
Neg logit contributions
PLIED-7.099
stals-6.893
usalem-6.876
ights-6.815
arty-6.730
Token,
Token ID11
Feature activation-1.04e-01

Pos logit contributions
ights+2.974
arty+2.930
stals+2.922
PLIED+2.867
 Twice+2.850
Neg logit contributions
shown-3.853
 attRot-3.752
ardless-3.615
mates-3.271
warming-3.256
Token looting
Token ID39473
Feature activation-7.85e-01

Pos logit contributions
ights+0.581
stals+0.576
arty+0.573
PLIED+0.571
usalem+0.563
Neg logit contributions
shown-0.657
 attRot-0.644
ardless-0.613
mates-0.555
warming-0.554
Token and
Token ID290
Feature activation+3.08e-01

Pos logit contributions
ights+3.580
arty+3.514
stals+3.504
 Twice+3.402
PLIED+3.397
Neg logit contributions
shown-5.707
 attRot-5.561
ardless-5.379
mates-4.912
warming-4.874
Token his
Token ID465
Feature activation-8.12e-01

Pos logit contributions
ights+2.572
arty+2.545
stals+2.544
PLIED+2.498
usalem+2.474
Neg logit contributions
shown-3.072
 attRot-2.999
ardless-2.872
mates-2.608
warming-2.587
Tokens
Token ID82
Feature activation-7.81e-01

Pos logit contributions
ights+3.610
arty+3.557
stals+3.534
PLIED+3.431
 Twice+3.419
Neg logit contributions
shown-5.987
 attRot-5.845
ardless-5.647
mates-5.151
warming-5.130
Token of
Token ID286
Feature activation+1.99e-01

Pos logit contributions
ights+3.620
arty+3.549
stals+3.535
 Twice+3.431
PLIED+3.427
Neg logit contributions
shown-5.623
 attRot-5.475
ardless-5.306
mates-4.821
warming-4.800
Token a
Token ID257
Feature activation-2.16e-02

Pos logit contributions
shown+1.253
 attRot+1.252
ardless+1.178
mates+1.113
warming+1.070
Neg logit contributions
stals-1.048
ights-1.048
usalem-1.038
PLIED-1.037
arty-1.026
Token big
Token ID1263
Feature activation+1.21e+00

Pos logit contributions
stals+0.114
ights+0.113
PLIED+0.111
arty+0.111
usalem+0.110
Neg logit contributions
shown-0.141
 attRot-0.140
ardless-0.132
mates-0.122
warming-0.120
Token-
Token ID12
Feature activation+2.49e+00

Pos logit contributions
 attRot+7.464
shown+7.285
ardless+6.932
mates+6.482
warming+6.428
Neg logit contributions
stals-6.085
PLIED-6.070
ights-5.851
usalem-5.773
arty-5.640
Tokencity
Token ID19205
Feature activation+2.41e-01

Pos logit contributions
shown+12.346
bodied+11.661
ardless+11.337
 attRot+11.166
warming+11.118
Neg logit contributions
PLIED-12.502
usalem-12.365
 Dimension-11.692
 Twice-11.447
ights-11.407
Token restaurant
Token ID7072
Feature activation+7.50e-01

Pos logit contributions
 attRot+1.550
shown+1.527
ardless+1.429
warming+1.328
mates+1.326
Neg logit contributions
stals-1.299
PLIED-1.269
ights-1.266
arty-1.249
usalem-1.243
Token kitchen
Token ID9592
Feature activation+9.17e-01

Pos logit contributions
 attRot+4.098
shown+4.024
ardless+3.738
warming+3.555
mates+3.520
Neg logit contributions
stals-4.603
usalem-4.498
ights-4.454
arty-4.405
PLIED-4.375
Token.
Token ID13
Feature activation+1.49e-01

Pos logit contributions
 attRot+4.859
shown+4.837
ardless+4.472
warming+4.289
mates+4.240
Neg logit contributions
stals-5.613
usalem-5.497
ights-5.416
arty-5.397
PLIED-5.391
Token
Token ID447
Feature activation+2.82e-01

Pos logit contributions
shown+0.921
 attRot+0.894
ardless+0.853
mates+0.775
warming+0.749
Neg logit contributions
usalem-0.847
ights-0.846
stals-0.840
arty-0.840
PLIED-0.825
Token that
Token ID326
Feature activation+3.59e-01

Pos logit contributions
shown+2.659
 attRot+2.607
ardless+2.439
warming+2.227
mates+2.210
Neg logit contributions
arty-2.422
PLIED-2.397
stals-2.395
usalem-2.371
ights-2.355
Token.
Token ID13
Feature activation+5.59e-01

Pos logit contributions
shown+1.805
 attRot+1.789
ardless+1.671
warming+1.473
mates+1.463
Neg logit contributions
ights-2.421
arty-2.398
usalem-2.395
stals-2.388
PLIED-2.360
Token
Token ID198
Feature activation+3.45e-01

Pos logit contributions
shown+3.202
 attRot+3.160
ardless+2.963
mates+2.639
warming+2.592
Neg logit contributions
usalem-3.389
arty-3.324
ights-3.299
stals-3.266
PLIED-3.246
Token
Token ID198
Feature activation+3.72e-01

Pos logit contributions
shown+1.952
 attRot+1.905
ardless+1.776
mates+1.615
warming+1.580
Neg logit contributions
ights-2.129
usalem-2.127
arty-2.111
PLIED-2.095
stals-2.093
TokenLarge
Token ID21968
Feature activation+1.83e+00

Pos logit contributions
shown+2.231
 attRot+2.128
ardless+1.973
FY+1.841
warming+1.840
Neg logit contributions
usalem-2.181
stals-2.179
ights-2.154
 Dimension-2.148
arty-2.145
Token-
Token ID12
Feature activation+2.47e+00

Pos logit contributions
shown+11.144
 attRot+10.503
ardless+10.346
warming+9.913
mates+9.295
Neg logit contributions
PLIED-8.175
usalem-8.115
stals-7.921
arty-7.575
vern-7.524
Tokenscale
Token ID9888
Feature activation+1.58e+00

Pos logit contributions
shown+9.853
warming+8.410
scale+8.175
bodied+8.057
 attRot+7.952
Neg logit contributions
usalem-16.252
PLIED-15.125
ights-15.077
 Venice-14.810
zzi-14.732
Token development
Token ID2478
Feature activation+1.24e+00

Pos logit contributions
shown+8.915
 attRot+8.873
ardless+7.795
warming+7.766
mates+7.268
Neg logit contributions
PLIED-8.203
usalem-8.043
stals-7.823
ights-7.740
arty-7.740
Token without
Token ID1231
Feature activation-5.57e-03

Pos logit contributions
shown+7.344
 attRot+7.082
ardless+6.492
warming+6.183
mates+6.037
Neg logit contributions
PLIED-6.678
usalem-6.496
stals-6.485
arty-6.330
ights-6.314
Token competitive
Token ID7606
Feature activation+2.59e-01

Pos logit contributions
ights+0.032
stals+0.031
arty+0.031
PLIED+0.031
usalem+0.031
Neg logit contributions
shown-0.035
 attRot-0.034
ardless-0.032
warming-0.029
mates-0.029
Token bids
Token ID27837
Feature activation-4.56e-01

Pos logit contributions
shown+1.537
 attRot+1.521
ardless+1.403
mates+1.304
warming+1.282
Neg logit contributions
ights-1.487
stals-1.487
usalem-1.486
PLIED-1.486
arty-1.475
Token He
Token ID679
Feature activation-3.16e-01

Pos logit contributions
shown+0.131
 attRot+0.126
ardless+0.119
mates+0.110
warming+0.106
Neg logit contributions
usalem-0.121
arty-0.120
ights-0.119
stals-0.118
PLIED-0.117
Token then
Token ID788
Feature activation-6.87e-02

Pos logit contributions
stals+1.614
ights+1.603
PLIED+1.599
arty+1.590
usalem+1.579
Neg logit contributions
shown-2.153
 attRot-2.071
ardless-1.988
mates-1.841
warming-1.816
Token scored
Token ID7781
Feature activation+4.51e-01

Pos logit contributions
stals+0.363
PLIED+0.359
usalem+0.357
ights+0.356
arty+0.354
Neg logit contributions
shown-0.457
 attRot-0.439
ardless-0.419
mates-0.389
warming-0.382
Token a
Token ID257
Feature activation-1.84e-03

Pos logit contributions
shown+2.822
 attRot+2.686
ardless+2.600
mates+2.473
warming+2.368
Neg logit contributions
stals-2.453
PLIED-2.422
usalem-2.415
arty-2.409
ights-2.382
Token season
Token ID1622
Feature activation+1.28e+00

Pos logit contributions
stals+0.011
PLIED+0.011
ights+0.011
usalem+0.011
arty+0.011
Neg logit contributions
shown-0.011
 attRot-0.011
ardless-0.010
mates-0.009
warming-0.009
Token-
Token ID12
Feature activation+2.45e+00

Pos logit contributions
shown+7.652
 attRot+7.296
ardless+6.827
mates+6.604
warming+6.503
Neg logit contributions
PLIED-6.901
arty-6.544
stals-6.522
ights-6.512
usalem-6.371
Tokenhigh
Token ID8929
Feature activation+7.95e-01

Pos logit contributions
shown+9.400
warming+7.334
mates+6.955
 attRot+6.416
ardless+6.391
Neg logit contributions
PLIED-17.702
 Twice-16.120
 Dimension-15.923
arty-15.915
 actionGroup-15.818
Token 28
Token ID2579
Feature activation+1.55e-01

Pos logit contributions
shown+4.765
 attRot+4.588
ardless+4.396
warming+4.078
mates+4.059
Neg logit contributions
stals-4.474
PLIED-4.469
usalem-4.329
arty-4.196
ights-4.188
Token points
Token ID2173
Feature activation-1.60e-01

Pos logit contributions
shown+0.886
 attRot+0.830
ardless+0.811
mates+0.733
warming+0.723
Neg logit contributions
PLIED-0.947
stals-0.947
ights-0.942
arty-0.942
usalem-0.925
Token against
Token ID1028
Feature activation-5.69e-01

Pos logit contributions
stals+0.843
arty+0.839
PLIED+0.835
ights+0.835
usalem+0.819
Neg logit contributions
shown-1.051
 attRot-1.016
ardless-0.982
mates-0.909
warming-0.899
Token Boston
Token ID6182
Feature activation-1.85e-01

Pos logit contributions
ights+2.926
arty+2.879
stals+2.872
PLIED+2.813
usalem+2.798
Neg logit contributions
shown-3.825
 attRot-3.723
ardless-3.589
mates-3.253
warming-3.230
Token the
Token ID262
Feature activation+2.21e-01

Pos logit contributions
shown+0.406
 attRot+0.404
ardless+0.380
mates+0.346
warming+0.336
Neg logit contributions
stals-0.404
ights-0.401
arty-0.397
usalem-0.394
PLIED-0.393
Token workforce
Token ID16433
Feature activation+5.76e-01

Pos logit contributions
 attRot+1.348
shown+1.348
ardless+1.246
mates+1.126
warming+1.120
Neg logit contributions
stals-1.279
ights-1.247
usalem-1.243
arty-1.222
PLIED-1.207
Token than
Token ID621
Feature activation+3.97e-01

Pos logit contributions
shown+3.382
 attRot+3.369
ardless+3.228
mates+2.908
warming+2.841
Neg logit contributions
stals-3.334
PLIED-3.315
ights-3.301
arty-3.232
usalem-3.225
Token other
Token ID584
Feature activation+8.33e-01

Pos logit contributions
 attRot+2.333
shown+2.307
ardless+2.171
mates+2.025
warming+1.899
Neg logit contributions
stals-2.319
usalem-2.273
ights-2.272
arty-2.239
PLIED-2.224
Token college
Token ID4152
Feature activation+7.12e-01

Pos logit contributions
 attRot+4.284
shown+4.249
ardless+3.955
mates+3.642
 behaviors+3.425
Neg logit contributions
stals-5.309
usalem-5.190
ights-5.135
PLIED-4.988
arty-4.986
Token-
Token ID12
Feature activation+2.45e+00

Pos logit contributions
shown+4.492
 attRot+4.465
ardless+4.281
mates+4.119
warming+3.915
Neg logit contributions
usalem-3.688
stals-3.618
ights-3.540
arty-3.472
 Twice-3.464
Tokeneducated
Token ID27317
Feature activation+8.11e-01

Pos logit contributions
shown+10.407
mates+9.763
ardless+9.419
 attRot+9.074
bred+9.054
Neg logit contributions
usalem-14.624
 Dimension-13.782
stals-13.594
 Myster-13.584
 Twice-13.521
Token prime
Token ID6994
Feature activation+4.62e-01

Pos logit contributions
 attRot+4.104
shown+3.968
ardless+3.697
mates+3.586
 behaviors+3.253
Neg logit contributions
stals-5.238
usalem-5.162
arty-5.079
ights-5.027
PLIED-4.993
Token-
Token ID12
Feature activation+1.79e+00

Pos logit contributions
 attRot+3.094
ardless+3.051
shown+3.003
mates+2.828
warming+2.728
Neg logit contributions
usalem-2.268
arty-2.235
stals-2.229
ights-2.175
PLIED-2.136
Tokenage
Token ID496
Feature activation+7.84e-01

Pos logit contributions
shown+8.047
 attRot+7.753
ardless+7.734
mates+7.458
warming+7.190
Neg logit contributions
usalem-11.059
 Dimension-10.686
 Twice-10.529
stals-10.523
 Horizons-10.475
Token workers
Token ID3259
Feature activation+5.27e-01

Pos logit contributions
 attRot+3.998
shown+3.918
ardless+3.645
mates+3.533
warming+3.192
Neg logit contributions
usalem-5.003
stals-4.981
ights-4.821
arty-4.805
PLIED-4.720
Token with
Token ID351
Feature activation+1.65e-01

Pos logit contributions
ights+3.843
arty+3.753
stals+3.740
 Twice+3.672
PLIED+3.671
Neg logit contributions
shown-4.288
 attRot-4.146
ardless-3.984
mates-3.574
warming-3.549
Token inadequate
Token ID20577
Feature activation+5.27e-01

Pos logit contributions
 attRot+1.002
shown+0.998
ardless+0.931
mates+0.861
warming+0.823
Neg logit contributions
ights-0.943
stals-0.935
arty-0.933
usalem-0.931
PLIED-0.910
Token and
Token ID290
Feature activation+3.39e-01

Pos logit contributions
 attRot+3.023
shown+2.946
ardless+2.780
mates+2.605
warming+2.473
Neg logit contributions
usalem-3.126
stals-3.113
arty-3.082
ights-3.046
PLIED-3.040
Token risky
Token ID17564
Feature activation+6.99e-01

Pos logit contributions
shown+2.072
 attRot+2.020
ardless+1.912
mates+1.770
warming+1.721
Neg logit contributions
ights-1.897
stals-1.897
usalem-1.894
arty-1.882
 Dimension-1.815
Token account
Token ID1848
Feature activation+7.86e-01

Pos logit contributions
 attRot+4.127
shown+3.931
ardless+3.669
mates+3.553
 behaviors+3.455
Neg logit contributions
usalem-3.990
arty-3.964
stals-3.935
ights-3.910
PLIED-3.861
Token-
Token ID12
Feature activation+2.44e+00

Pos logit contributions
 attRot+4.692
shown+4.599
ardless+4.393
mates+4.103
warming+4.049
Neg logit contributions
usalem-4.317
PLIED-4.286
 Twice-4.231
stals-4.217
ights-4.212
Tokenstyle
Token ID7635
Feature activation+9.63e-01

Pos logit contributions
shown+12.568
mates+11.653
 attRot+11.510
warming+11.407
ardless+10.882
Neg logit contributions
PLIED-11.980
 Twice-11.940
usalem-11.921
ights-11.556
 Venice-11.504
Token plans
Token ID3352
Feature activation+2.07e-01

Pos logit contributions
 attRot+5.459
shown+5.168
ardless+4.835
mates+4.576
warming+4.442
Neg logit contributions
usalem-5.611
arty-5.579
PLIED-5.412
stals-5.392
 Twice-5.385
Token,
Token ID11
Feature activation+2.14e-01

Pos logit contributions
shown+1.298
 attRot+1.292
ardless+1.221
mates+1.108
warming+1.096
Neg logit contributions
stals-1.135
ights-1.128
usalem-1.124
arty-1.122
PLIED-1.108
Token we
Token ID356
Feature activation+5.51e-01

Pos logit contributions
shown+1.358
 attRot+1.336
ardless+1.264
mates+1.160
warming+1.135
Neg logit contributions
stals-1.171
ights-1.165
arty-1.164
usalem-1.163
PLIED-1.138
Token would
Token ID561
Feature activation+5.28e-01

Pos logit contributions
shown+3.522
 attRot+3.412
ardless+3.219
mates+3.106
warming+2.929
Neg logit contributions
usalem-2.865
PLIED-2.859
arty-2.845
 Dimension-2.811
 Twice-2.810
Token,"
Token ID553
Feature activation+3.12e-01

Pos logit contributions
shown+1.728
 attRot+1.706
ardless+1.637
warming+1.493
mates+1.473
Neg logit contributions
arty-1.265
ights-1.265
PLIED-1.253
usalem-1.250
stals-1.236
Token and
Token ID290
Feature activation+9.49e-01

Pos logit contributions
shown+2.051
 attRot+2.000
ardless+1.915
warming+1.760
mates+1.750
Neg logit contributions
ights-1.585
usalem-1.584
stals-1.578
arty-1.578
PLIED-1.556
Token his
Token ID465
Feature activation+8.68e-02

Pos logit contributions
shown+5.452
 attRot+5.170
ardless+4.997
mates+4.610
warming+4.542
Neg logit contributions
usalem-5.291
stals-5.179
ights-5.138
arty-5.121
PLIED-5.088
Token recent
Token ID2274
Feature activation+4.51e-01

Pos logit contributions
shown+0.534
 attRot+0.522
ardless+0.502
mates+0.464
warming+0.454
Neg logit contributions
stals-0.484
ights-0.483
usalem-0.478
arty-0.477
PLIED-0.476
Token town
Token ID3240
Feature activation+1.40e+00

Pos logit contributions
shown+2.594
 attRot+2.541
ardless+2.362
mates+2.229
warming+2.218
Neg logit contributions
PLIED-2.660
stals-2.636
ights-2.589
usalem-2.582
arty-2.559
Token-
Token ID12
Feature activation+2.44e+00

Pos logit contributions
shown+5.386
warming+4.840
 attRot+4.818
mates+4.502
ardless+4.436
Neg logit contributions
PLIED-10.133
stals-10.055
ights-9.887
 Dimension-9.841
arty-9.802
Tokenhall
Token ID18323
Feature activation+5.06e-01

Pos logit contributions
shown+9.361
warming+8.616
ardless+7.816
mates+7.649
 attRot+7.091
Neg logit contributions
PLIED-15.915
 Dimension-15.578
 Twice-15.424
ights-15.141
usalem-15.077
Token response
Token ID2882
Feature activation+5.77e-01

Pos logit contributions
shown+3.092
 attRot+2.969
ardless+2.839
warming+2.692
mates+2.643
Neg logit contributions
stals-2.759
PLIED-2.746
ights-2.702
arty-2.693
usalem-2.658
Token to
Token ID284
Feature activation+4.67e-02

Pos logit contributions
shown+3.239
 attRot+3.090
ardless+2.885
warming+2.719
mates+2.713
Neg logit contributions
stals-3.481
ights-3.441
usalem-3.437
PLIED-3.393
arty-3.369
Token a
Token ID257
Feature activation+3.08e-01

Pos logit contributions
shown+0.305
 attRot+0.295
ardless+0.281
mates+0.261
warming+0.260
Neg logit contributions
ights-0.244
PLIED-0.243
stals-0.243
arty-0.241
usalem-0.241
Token man
Token ID582
Feature activation-7.38e-02

Pos logit contributions
shown+1.914
 attRot+1.848
ardless+1.737
warming+1.647
mates+1.605
Neg logit contributions
stals-1.701
PLIED-1.684
arty-1.678
ights-1.670
usalem-1.641
Token,
Token ID11
Feature activation+4.40e-01

Pos logit contributions
shown+1.995
 attRot+1.943
ardless+1.797
mates+1.697
warming+1.649
Neg logit contributions
stals-1.835
usalem-1.826
ights-1.822
PLIED-1.796
arty-1.793
Token President
Token ID1992
Feature activation+6.65e-01

Pos logit contributions
shown+2.858
 attRot+2.790
ardless+2.625
mates+2.435
warming+2.402
Neg logit contributions
stals-2.295
ights-2.275
usalem-2.257
arty-2.247
PLIED-2.241
Token Park
Token ID3250
Feature activation+1.12e+00

Pos logit contributions
shown+3.086
 attRot+2.935
ardless+2.744
mates+2.435
warming+2.406
Neg logit contributions
PLIED-4.683
stals-4.621
usalem-4.575
ights-4.549
arty-4.500
Token Ge
Token ID2269
Feature activation+1.00e+00

Pos logit contributions
shown+6.252
 attRot+5.678
ardless+5.455
idate+5.036
warming+4.993
Neg logit contributions
usalem-6.683
stals-6.494
PLIED-6.467
ights-6.435
 Venice-6.207
Tokenun
Token ID403
Feature activation+1.05e+00

Pos logit contributions
shown+3.811
ardless+3.352
 attRot+3.258
idate+2.819
warming+2.777
Neg logit contributions
PLIED-7.846
 actionGroup-7.808
 Venice-7.557
 Dimension-7.541
bite-7.336
Token-
Token ID12
Feature activation+2.43e+00

Pos logit contributions
shown+6.246
 attRot+6.174
ardless+6.105
warming+5.491
mates+5.411
Neg logit contributions
stals-5.521
PLIED-5.466
ights-5.457
usalem-5.344
arty-5.219
Tokenh
Token ID71
Feature activation+3.34e-01

Pos logit contributions
shown+6.974
 attRot+6.187
ardless+5.414
surprisingly+5.239
warming+4.879
Neg logit contributions
 Dimension-18.822
stals-18.399
 Venice-18.215
ights-18.207
 actionGroup-17.797
Tokenye
Token ID5948
Feature activation+3.10e-01

Pos logit contributions
shown+2.260
 attRot+2.189
ardless+2.064
mates+1.905
warming+1.895
Neg logit contributions
usalem-1.689
ights-1.687
stals-1.682
PLIED-1.677
 Dimension-1.659
Token's
Token ID338
Feature activation+1.29e-01

Pos logit contributions
shown+1.763
 attRot+1.684
ardless+1.595
mates+1.482
warming+1.443
Neg logit contributions
stals-1.885
usalem-1.859
ights-1.850
arty-1.830
PLIED-1.828
Token government
Token ID1230
Feature activation+7.38e-01

Pos logit contributions
shown+0.763
 attRot+0.741
ardless+0.704
mates+0.643
warming+0.633
Neg logit contributions
stals-0.766
ights-0.762
PLIED-0.760
usalem-0.752
arty-0.749
Token announced
Token ID3414
Feature activation+7.83e-01

Pos logit contributions
shown+3.818
 attRot+3.630
ardless+3.360
mates+3.280
warming+3.109
Neg logit contributions
stals-4.679
usalem-4.630
ights-4.571
PLIED-4.502
arty-4.494
Token to
Token ID284
Feature activation+3.44e-01

Pos logit contributions
shown+3.999
 attRot+3.913
ardless+3.682
mates+3.312
warming+3.228
Neg logit contributions
PLIED-5.659
arty-5.646
usalem-5.554
 Dimension-5.550
stals-5.541
Token buy
Token ID2822
Feature activation-4.80e-02

Pos logit contributions
 attRot+2.063
shown+1.997
ardless+1.900
mates+1.769
warming+1.700
Neg logit contributions
PLIED-1.965
usalem-1.957
arty-1.935
ights-1.912
stals-1.910
Token goods
Token ID7017
Feature activation+8.72e-01

Pos logit contributions
ights+0.264
arty+0.261
usalem+0.258
stals+0.258
PLIED+0.254
Neg logit contributions
shown-0.303
 attRot-0.297
ardless-0.287
mates-0.259
warming-0.255
Token in
Token ID287
Feature activation+5.61e-01

Pos logit contributions
shown+5.294
 attRot+5.144
ardless+5.033
mates+4.623
warming+4.540
Neg logit contributions
arty-4.533
stals-4.521
PLIED-4.458
usalem-4.446
ights-4.439
Token big
Token ID1263
Feature activation+1.80e+00

Pos logit contributions
shown+3.356
 attRot+3.260
ardless+3.148
mates+2.935
warming+2.788
Neg logit contributions
ights-3.117
PLIED-3.090
usalem-3.073
arty-3.069
stals-3.068
Token-
Token ID12
Feature activation+2.42e+00

Pos logit contributions
shown+8.857
 attRot+8.697
ardless+8.662
warming+7.813
mates+7.781
Neg logit contributions
PLIED-9.677
usalem-9.360
arty-9.250
ights-9.191
zzi-9.034
Tokenbox
Token ID3524
Feature activation+6.31e-01

Pos logit contributions
shown+10.175
ardless+9.629
warming+9.133
bodied+8.991
enough+8.786
Neg logit contributions
usalem-14.428
PLIED-14.377
 Dimension-13.724
ights-13.478
 Cavern-13.426
Token stores
Token ID7000
Feature activation+8.87e-01

Pos logit contributions
 attRot+2.282
shown+2.233
ardless+2.061
mates+1.750
warming+1.734
Neg logit contributions
usalem-4.982
ights-4.947
arty-4.885
PLIED-4.869
stals-4.855
Token that
Token ID326
Feature activation+2.93e-01

Pos logit contributions
shown+4.989
 attRot+4.968
ardless+4.714
mates+4.303
warming+4.262
Neg logit contributions
stals-5.060
usalem-5.002
arty-4.957
PLIED-4.953
ights-4.948
Token can
Token ID460
Feature activation-4.30e-02

Pos logit contributions
shown+1.941
 attRot+1.904
ardless+1.796
mates+1.693
warming+1.651
Neg logit contributions
usalem-1.480
arty-1.479
PLIED-1.472
stals-1.463
ights-1.442
Token be
Token ID307
Feature activation+9.74e-03

Pos logit contributions
PLIED+0.202
arty+0.202
ights+0.200
usalem+0.200
stals+0.199
Neg logit contributions
shown-0.303
 attRot-0.298
ardless-0.287
mates-0.264
warming-0.261
Token their
Token ID511
Feature activation+3.63e-01

Pos logit contributions
shown+0.001
 attRot+0.001
ardless+0.001
mates+0.001
warming+0.001
Neg logit contributions
ights-0.001
stals-0.001
arty-0.001
PLIED-0.001
usalem-0.001
Token conversations
Token ID10275
Feature activation+7.82e-01

Pos logit contributions
shown+2.173
 attRot+2.138
ardless+2.008
mates+1.923
warming+1.889
Neg logit contributions
stals-2.042
usalem-2.037
ights-2.015
PLIED-2.007
arty-2.000
Token involve
Token ID6211
Feature activation+5.05e-01

Pos logit contributions
 attRot+4.830
shown+4.778
ardless+4.254
mates+4.236
warming+4.115
Neg logit contributions
PLIED-4.094
arty-4.014
stals-4.007
usalem-3.987
 Twice-3.963
Token apologizing
Token ID47401
Feature activation-9.25e-02

Pos logit contributions
shown+2.948
 attRot+2.929
ardless+2.681
mates+2.562
warming+2.558
Neg logit contributions
stals-2.909
ights-2.875
usalem-2.865
arty-2.851
PLIED-2.826
Token and
Token ID290
Feature activation+6.89e-01

Pos logit contributions
ights+0.529
arty+0.525
stals+0.522
PLIED+0.515
usalem+0.515
Neg logit contributions
shown-0.565
 attRot-0.552
ardless-0.523
warming-0.478
mates-0.478
Token large
Token ID1588
Feature activation+2.42e+00

Pos logit contributions
shown+4.088
 attRot+4.018
ardless+3.646
warming+3.620
mates+3.566
Neg logit contributions
stals-3.834
usalem-3.816
PLIED-3.788
arty-3.736
ights-3.729
Token amounts
Token ID6867
Feature activation+9.55e-01

Pos logit contributions
shown+11.971
 attRot+11.602
warming+11.540
ardless+11.073
 sized+10.606
Neg logit contributions
usalem-11.507
PLIED-11.468
stals-11.099
arty-10.788
 Dimension-10.661
Token of
Token ID286
Feature activation+5.02e-01

Pos logit contributions
shown+5.755
 attRot+5.631
ardless+5.151
warming+4.954
mates+4.798
Neg logit contributions
PLIED-5.228
usalem-5.222
stals-5.173
arty-5.167
ights-5.098
Token guilt
Token ID14934
Feature activation-3.55e-01

Pos logit contributions
shown+3.068
 attRot+2.988
ardless+2.760
warming+2.669
mates+2.616
Neg logit contributions
usalem-2.773
stals-2.743
ights-2.730
PLIED-2.704
arty-2.688
Token on
Token ID319
Feature activation+1.10e-01

Pos logit contributions
ights+2.064
arty+2.033
stals+2.028
PLIED+2.004
usalem+1.996
Neg logit contributions
shown-2.162
 attRot-2.107
ardless-2.004
mates-1.809
warming-1.807
Token both
Token ID1111
Feature activation-3.98e-01

Pos logit contributions
shown+0.674
 attRot+0.664
ardless+0.626
mates+0.581
warming+0.568
Neg logit contributions
stals-0.620
arty-0.618
usalem-0.613
ights-0.610
PLIED-0.609
Tokenbased
Token ID3106
Feature activation+7.27e-01

Pos logit contributions
shown+7.705
 attRot+6.784
warming+6.469
mates+6.396
ardless+6.292
Neg logit contributions
PLIED-9.034
 Twice-8.504
usalem-8.404
 Dimension-8.403
 actionGroup-8.256
Token battles
Token ID10181
Feature activation-7.76e-02

Pos logit contributions
shown+4.589
 attRot+4.520
ardless+4.138
mates+4.053
warming+3.889
Neg logit contributions
PLIED-3.889
usalem-3.626
stals-3.602
ights-3.581
arty-3.576
Token.
Token ID13
Feature activation-2.10e-02

Pos logit contributions
ights+0.498
stals+0.496
arty+0.496
PLIED+0.492
usalem+0.485
Neg logit contributions
shown-0.421
 attRot-0.409
ardless-0.387
mates-0.344
warming-0.340
Token Its
Token ID6363
Feature activation-3.29e-01

Pos logit contributions
ights+0.114
arty+0.113
usalem+0.113
PLIED+0.112
stals+0.112
Neg logit contributions
shown-0.136
 attRot-0.133
ardless-0.126
mates-0.116
warming-0.112
Token cel
Token ID18725
Feature activation+7.18e-01

Pos logit contributions
ights+1.710
stals+1.692
arty+1.687
PLIED+1.680
usalem+1.651
Neg logit contributions
shown-2.192
 attRot-2.127
ardless-2.044
mates-1.872
warming-1.843
Token-
Token ID12
Feature activation+2.42e+00

Pos logit contributions
shown+4.932
 attRot+4.816
ardless+4.659
mates+4.418
warming+4.313
Neg logit contributions
PLIED-3.273
stals-3.269
ights-3.121
arty-3.107
usalem-3.068
Tokensh
Token ID1477
Feature activation+3.99e-01

Pos logit contributions
shown+6.806
 attRot+5.198
ardless+5.012
warming+4.299
mates+4.281
Neg logit contributions
PLIED-20.120
stals-19.263
 Horizons-19.248
arty-18.997
usalem-18.994
Tokenaded
Token ID5286
Feature activation-2.85e-01

Pos logit contributions
shown+2.898
ardless+2.819
 attRot+2.790
warming+2.498
mates+2.478
Neg logit contributions
 Twice-1.777
 Dimension-1.768
stals-1.757
usalem-1.745
ights-1.722
Token visuals
Token ID27329
Feature activation+2.68e-01

Pos logit contributions
ights+1.498
stals+1.489
arty+1.480
PLIED+1.477
usalem+1.447
Neg logit contributions
shown-1.878
 attRot-1.842
ardless-1.748
mates-1.613
warming-1.572
Token look
Token ID804
Feature activation+1.91e-01

Pos logit contributions
shown+1.743
 attRot+1.731
ardless+1.584
mates+1.539
warming+1.473
Neg logit contributions
stals-1.417
PLIED-1.391
arty-1.368
usalem-1.356
 Twice-1.356
Token the
Token ID262
Feature activation-2.48e-02

Pos logit contributions
shown+1.238
 attRot+1.233
ardless+1.155
mates+1.090
warming+1.078
Neg logit contributions
stals-0.997
PLIED-0.979
ights-0.969
arty-0.957
usalem-0.944
Token students
Token ID2444
Feature activation+8.67e-01

Pos logit contributions
shown+5.668
 attRot+5.473
ardless+5.319
mates+5.173
idate+4.652
Neg logit contributions
usalem-6.161
arty-6.154
stals-6.124
ights-6.037
PLIED-5.934
Token at
Token ID379
Feature activation+8.31e-01

Pos logit contributions
shown+4.653
 attRot+4.532
ardless+4.512
mates+4.153
warming+3.948
Neg logit contributions
stals-5.209
usalem-5.181
ights-5.127
PLIED-5.119
 Dimension-5.025
Token eight
Token ID3624
Feature activation+3.38e-01

Pos logit contributions
shown+4.678
 attRot+4.538
ardless+4.518
mates+4.153
warming+3.881
Neg logit contributions
ights-4.762
usalem-4.722
stals-4.675
arty-4.627
PLIED-4.619
Token city
Token ID1748
Feature activation+5.04e-01

Pos logit contributions
shown+1.977
 attRot+1.895
ardless+1.881
mates+1.716
warming+1.644
Neg logit contributions
ights-1.967
arty-1.956
usalem-1.934
stals-1.927
PLIED-1.916
Token school
Token ID1524
Feature activation+1.08e+00

Pos logit contributions
shown+2.739
 attRot+2.660
ardless+2.623
mates+2.396
warming+2.284
Neg logit contributions
ights-3.077
PLIED-3.021
stals-3.019
 Dimension-3.001
usalem-2.997
Token-
Token ID12
Feature activation+2.37e+00

Pos logit contributions
shown+5.361
mates+5.277
 attRot+5.099
ardless+4.846
warming+4.528
Neg logit contributions
ights-6.568
usalem-6.522
PLIED-6.451
 Dimension-6.428
stals-6.401
Tokenbased
Token ID3106
Feature activation+7.76e-01

Pos logit contributions
shown+9.777
operated+9.046
mates+8.854
ardless+8.533
bred+7.870
Neg logit contributions
usalem-14.676
PLIED-14.287
 Dimension-14.209
ights-13.967
 Twice-13.914
Token clinics
Token ID21434
Feature activation+3.92e-01

Pos logit contributions
shown+4.088
 attRot+3.999
ardless+3.863
mates+3.655
warming+3.376
Neg logit contributions
ights-4.696
stals-4.678
usalem-4.641
PLIED-4.538
arty-4.538
Token.
Token ID13
Feature activation-1.75e-01

Pos logit contributions
shown+1.980
 attRot+1.908
ardless+1.845
mates+1.669
warming+1.609
Neg logit contributions
stals-2.603
ights-2.595
usalem-2.584
PLIED-2.548
arty-2.544
Token The
Token ID383
Feature activation+1.17e-01

Pos logit contributions
ights+0.951
arty+0.938
stals+0.932
usalem+0.929
PLIED+0.913
Neg logit contributions
shown-1.133
 attRot-1.102
ardless-1.058
mates-0.961
warming-0.935
Token number
Token ID1271
Feature activation+4.57e-03

Pos logit contributions
shown+0.717
 attRot+0.700
ardless+0.669
mates+0.618
warming+0.600
Neg logit contributions
ights-0.663
arty-0.662
stals-0.659
PLIED-0.650
usalem-0.645
Token.
Token ID13
Feature activation-4.33e-02

Pos logit contributions
arty+1.518
ights+1.510
stals+1.501
usalem+1.479
PLIED+1.479
Neg logit contributions
shown-1.325
 attRot-1.288
ardless-1.212
mates-1.093
warming-1.072
Token Although
Token ID4900
Feature activation-6.29e-01

Pos logit contributions
arty+0.243
usalem+0.241
ights+0.240
stals+0.236
PLIED+0.236
Neg logit contributions
shown-0.277
 attRot-0.270
ardless-0.255
mates-0.233
warming-0.224
Token MIC
Token ID35878
Feature activation-3.45e-01

Pos logit contributions
ights+3.166
arty+3.115
stals+3.106
PLIED+3.045
usalem+3.028
Neg logit contributions
shown-4.306
 attRot-4.195
ardless-4.040
mates-3.675
warming-3.649
TokenAS
Token ID1921
Feature activation-3.77e-01

Pos logit contributions
ights+1.655
arty+1.625
stals+1.622
PLIED+1.587
 Twice+1.578
Neg logit contributions
shown-2.431
 attRot-2.371
ardless-2.293
mates-2.085
warming-2.075
Token is
Token ID318
Feature activation-1.29e+00

Pos logit contributions
arty+1.931
ights+1.922
stals+1.916
usalem+1.882
PLIED+1.875
Neg logit contributions
shown-2.552
 attRot-2.469
ardless-2.361
mates-2.187
warming-2.139
Token more
Token ID517
Feature activation-3.04e+00

Pos logit contributions
ights+6.819
arty+6.615
stals+6.611
 Twice+6.522
 Dimension+6.509
Neg logit contributions
shown-8.153
 attRot-8.053
ardless-7.848
mates-6.995
warming-6.906
Token sensitive
Token ID8564
Feature activation-1.12e+00

Pos logit contributions
ights+12.069
 Dimension+11.643
 Venice+11.613
 astronomy+11.286
arty+11.254
Neg logit contributions
ardless-18.725
shown-17.990
 attRot-17.404
idate-15.642
アル-15.594
Token,
Token ID11
Feature activation-8.69e-01

Pos logit contributions
ights+5.447
arty+5.314
stals+5.304
 Dimension+5.204
 Twice+5.192
Neg logit contributions
shown-7.591
 attRot-7.294
ardless-7.190
warming-6.420
mates-6.400
Token its
Token ID663
Feature activation-1.16e+00

Pos logit contributions
ights+4.293
arty+4.220
stals+4.206
PLIED+4.112
 Dimension+4.111
Neg logit contributions
shown-5.979
 attRot-5.811
ardless-5.624
mates-5.101
warming-5.073
Token field
Token ID2214
Feature activation-7.78e-01

Pos logit contributions
ights+6.233
stals+6.062
arty+6.053
 Dimension+6.024
 Twice+5.965
Neg logit contributions
shown-7.323
 attRot-7.086
ardless-6.935
warming-6.158
mates-6.133
Token-
Token ID12
Feature activation+9.48e-01

Pos logit contributions
ights+4.103
arty+4.048
stals+4.044
PLIED+3.964
usalem+3.947
Neg logit contributions
shown-5.135
 attRot-5.000
ardless-4.809
mates-4.362
warming-4.327
Token and
Token ID290
Feature activation-2.95e-02

Pos logit contributions
ights+1.794
stals+1.786
arty+1.776
usalem+1.754
PLIED+1.753
Neg logit contributions
shown-2.387
 attRot-2.319
ardless-2.250
mates-2.050
warming-2.018
Token I
Token ID314
Feature activation-4.74e-01

Pos logit contributions
stals+0.157
ights+0.156
arty+0.155
usalem+0.155
PLIED+0.154
Neg logit contributions
shown-0.193
 attRot-0.188
ardless-0.181
mates-0.166
warming-0.162
Token learnt
Token ID26338
Feature activation-5.61e-01

Pos logit contributions
ights+2.100
arty+2.074
stals+2.068
PLIED+2.033
usalem+2.027
Neg logit contributions
shown-3.514
 attRot-3.436
ardless-3.311
mates-3.067
warming-3.026
Token a
Token ID257
Feature activation-7.97e-01

Pos logit contributions
ights+3.157
arty+3.110
stals+3.107
PLIED+3.049
usalem+3.039
Neg logit contributions
shown-3.511
 attRot-3.413
ardless-3.272
mates-2.953
warming-2.922
Token lot
Token ID1256
Feature activation-1.02e+00

Pos logit contributions
ights+3.029
arty+2.962
stals+2.942
 Twice+2.860
PLIED+2.859
Neg logit contributions
shown-6.402
 attRot-6.246
ardless-6.078
mates-5.596
warming-5.567
Token more
Token ID517
Feature activation-2.78e+00

Pos logit contributions
ights+5.084
arty+4.949
stals+4.933
 Twice+4.874
 Dimension+4.833
Neg logit contributions
shown-6.959
 attRot-6.744
ardless-6.529
warming-5.857
mates-5.855
Token than
Token ID621
Feature activation+3.00e-01

Pos logit contributions
ights+12.206
 Twice+11.851
 Dimension+11.770
 Venice+11.636
 dearly+11.613
Neg logit contributions
shown-16.102
 attRot-15.378
ardless-15.173
idate-13.785
アル-13.512
Token I
Token ID314
Feature activation-3.20e-01

Pos logit contributions
shown+1.950
 attRot+1.889
ardless+1.823
mates+1.709
warming+1.629
Neg logit contributions
arty-1.581
stals-1.573
usalem-1.572
ights-1.548
PLIED-1.542
Token had
Token ID550
Feature activation+7.43e-01

Pos logit contributions
ights+1.677
stals+1.675
arty+1.672
PLIED+1.651
usalem+1.649
Neg logit contributions
shown-2.102
 attRot-2.050
ardless-1.965
mates-1.808
warming-1.764
Token before
Token ID878
Feature activation-5.42e-01

Pos logit contributions
shown+4.023
 attRot+3.848
ardless+3.614
mates+3.390
warming+3.140
Neg logit contributions
usalem-4.743
stals-4.615
arty-4.560
PLIED-4.480
ights-4.413
Token too
Token ID1165
Feature activation-6.47e-01

Pos logit contributions
ights+3.415
arty+3.376
stals+3.369
PLIED+3.320
usalem+3.309
Neg logit contributions
shown-3.020
 attRot-2.932
ardless-2.804
mates-2.487
warming-2.457
Token not
Token ID407
Feature activation+6.18e-02

Pos logit contributions
ights+2.893
stals+2.861
arty+2.856
PLIED+2.803
usalem+2.788
Neg logit contributions
shown-3.664
 attRot-3.562
ardless-3.427
mates-3.119
warming-3.087
Tokenching
Token ID10813
Feature activation+5.05e-01

Pos logit contributions
shown+0.363
ardless+0.342
 attRot+0.342
mates+0.307
warming+0.303
Neg logit contributions
stals-0.373
ights-0.366
arty-0.362
PLIED-0.361
usalem-0.357
Token repeated
Token ID5100
Feature activation+4.38e-02

Pos logit contributions
shown+2.956
 attRot+2.861
ardless+2.657
mates+2.506
warming+2.450
Neg logit contributions
stals-2.929
PLIED-2.919
arty-2.903
ights-2.873
usalem-2.856
Token wins
Token ID7864
Feature activation-9.28e-01

Pos logit contributions
shown+0.283
 attRot+0.274
ardless+0.257
mates+0.242
warming+0.239
Neg logit contributions
PLIED-0.236
stals-0.235
usalem-0.232
ights-0.232
arty-0.231
Token in
Token ID287
Feature activation-6.07e-01

Pos logit contributions
ights+4.828
arty+4.709
stals+4.673
 Twice+4.625
 Dimension+4.585
Neg logit contributions
shown-6.127
 attRot-5.950
ardless-5.739
mates-5.163
warming-5.133
Token off
Token ID572
Feature activation-2.75e+00

Pos logit contributions
ights+3.011
arty+2.974
stals+2.973
PLIED+2.914
usalem+2.893
Neg logit contributions
shown-4.211
 attRot-4.099
ardless-3.945
mates-3.615
warming-3.570
Token-
Token ID12
Feature activation-5.03e-01

Pos logit contributions
ights+9.582
 Venice+8.589
ben+8.570
arty+8.564
bite+8.243
Neg logit contributions
shown-18.252
 attRot-17.914
ardless-17.094
アル-15.730
FY-15.721
Tokenroad
Token ID6344
Feature activation-9.36e-01

Pos logit contributions
ights+2.308
stals+2.278
arty+2.261
PLIED+2.225
 Twice+2.216
Neg logit contributions
shown-3.683
 attRot-3.579
ardless-3.453
mates-3.181
warming-3.167
Token race
Token ID3234
Feature activation-7.15e-01

Pos logit contributions
ights+5.884
arty+5.772
stals+5.744
 Dimension+5.668
 Twice+5.668
Neg logit contributions
shown-5.175
 attRot-5.004
ardless-4.807
mates-4.219
warming-4.185
Token series
Token ID2168
Feature activation+4.08e-02

Pos logit contributions
ights+4.424
arty+4.352
stals+4.347
PLIED+4.268
 Twice+4.266
Neg logit contributions
shown-4.044
 attRot-3.910
ardless-3.746
mates-3.308
warming-3.290
Token.
Token ID13
Feature activation-2.88e-01

Pos logit contributions
shown+0.248
 attRot+0.246
ardless+0.229
mates+0.213
warming+0.207
Neg logit contributions
stals-0.233
arty-0.231
PLIED-0.230
ights-0.229
usalem-0.226
TokenR
Token ID49
Feature activation-7.85e-01

Pos logit contributions
ights+4.199
arty+4.133
stals+4.111
PLIED+4.041
 Twice+4.017
Neg logit contributions
shown-4.960
 attRot-4.827
ardless-4.654
mates-4.186
warming-4.158
Tokenapt
Token ID2373
Feature activation-7.89e-02

Pos logit contributions
ights+4.964
arty+4.850
stals+4.776
PLIED+4.708
usalem+4.699
Neg logit contributions
shown-4.361
 attRot-4.235
ardless-4.031
mates-3.576
warming-3.541
Tokenor
Token ID273
Feature activation+3.98e-03

Pos logit contributions
ights+0.347
arty+0.341
stals+0.340
PLIED+0.334
 Twice+0.330
Neg logit contributions
shown-0.588
 attRot-0.574
ardless-0.556
mates-0.509
warming-0.505
Token's
Token ID338
Feature activation-4.29e-01

Pos logit contributions
shown+0.027
 attRot+0.026
ardless+0.025
mates+0.023
warming+0.023
Neg logit contributions
stals-0.020
PLIED-0.020
usalem-0.020
arty-0.020
ights-0.020
Token pioneering
Token ID36304
Feature activation-3.81e-01

Pos logit contributions
ights+2.276
stals+2.265
arty+2.239
PLIED+2.216
usalem+2.210
Neg logit contributions
shown-2.822
 attRot-2.750
ardless-2.626
mates-2.400
warming-2.365
Token off
Token ID572
Feature activation-2.70e+00

Pos logit contributions
stals+2.057
ights+2.056
arty+2.025
PLIED+2.012
usalem+2.005
Neg logit contributions
shown-2.469
 attRot-2.411
ardless-2.297
mates-2.112
warming-2.073
Token-
Token ID12
Feature activation-5.85e-01

Pos logit contributions
ights+8.547
arty+8.002
ben+7.834
bite+7.581
 Venice+7.565
Neg logit contributions
shown-18.391
 attRot-18.027
ardless-17.423
FY-15.966
idate-15.928
Tokenroad
Token ID6344
Feature activation-5.47e-01

Pos logit contributions
ights+2.628
arty+2.589
stals+2.558
PLIED+2.507
usalem+2.483
Neg logit contributions
shown-4.285
 attRot-4.189
ardless-4.058
mates-3.694
warming-3.673
Token mode
Token ID4235
Feature activation-1.30e-01

Pos logit contributions
ights+3.250
stals+3.226
arty+3.214
PLIED+3.162
usalem+3.152
Neg logit contributions
shown-3.246
 attRot-3.151
ardless-3.007
mates-2.710
warming-2.682
Token driving
Token ID5059
Feature activation-4.80e-01

Pos logit contributions
stals+0.709
ights+0.700
arty+0.695
PLIED+0.690
usalem+0.690
Neg logit contributions
shown-0.846
 attRot-0.816
ardless-0.771
mates-0.723
warming-0.707
Token technology
Token ID3037
Feature activation-4.92e-01

Pos logit contributions
ights+2.490
stals+2.472
arty+2.467
PLIED+2.423
usalem+2.411
Neg logit contributions
shown-3.225
 attRot-3.139
ardless-2.992
mates-2.756
warming-2.717
Token2
Token ID17
Feature activation-4.48e-01

Pos logit contributions
ights+5.069
arty+4.996
stals+4.935
 Twice+4.861
PLIED+4.856
Neg logit contributions
shown-5.600
 attRot-5.419
ardless-5.271
mates-4.689
warming-4.672
Token tires
Token ID17920
Feature activation-8.79e-01

Pos logit contributions
ights+2.375
stals+2.343
arty+2.336
usalem+2.301
PLIED+2.294
Neg logit contributions
shown-2.945
 attRot-2.878
ardless-2.739
mates-2.499
warming-2.468
Token specifically
Token ID5734
Feature activation+1.09e-02

Pos logit contributions
ights+5.223
arty+5.116
stals+5.096
 Twice+5.016
 Dimension+5.005
Neg logit contributions
shown-5.114
 attRot-4.932
ardless-4.798
mates-4.230
warming-4.204
Token designed
Token ID3562
Feature activation+2.20e-01

Pos logit contributions
shown+0.065
 attRot+0.063
ardless+0.059
mates+0.054
warming+0.054
Neg logit contributions
ights-0.063
usalem-0.063
stals-0.063
arty-0.063
PLIED-0.063
Token for
Token ID329
Feature activation-7.17e-01

Pos logit contributions
shown+1.291
 attRot+1.265
ardless+1.200
mates+1.077
warming+1.067
Neg logit contributions
ights-1.301
arty-1.285
stals-1.284
usalem-1.266
PLIED-1.259
Token off
Token ID572
Feature activation-2.68e+00

Pos logit contributions
ights+3.639
arty+3.579
stals+3.578
PLIED+3.501
usalem+3.481
Neg logit contributions
shown-4.875
 attRot-4.749
ardless-4.577
mates-4.156
warming-4.126
Token-
Token ID12
Feature activation-5.18e-01

Pos logit contributions
ights+8.089
arty+7.489
 Venice+7.447
ben+7.430
bite+7.257
Neg logit contributions
shown-18.978
 attRot-18.478
ardless-17.800
FY-16.350
アル-16.323
Tokenroad
Token ID6344
Feature activation-6.28e-01

Pos logit contributions
ights+2.293
stals+2.250
arty+2.244
PLIED+2.190
 Twice+2.183
Neg logit contributions
shown-3.869
 attRot-3.773
ardless-3.646
mates-3.347
warming-3.330
Token performance
Token ID2854
Feature activation-9.05e-02

Pos logit contributions
ights+3.717
arty+3.665
stals+3.657
PLIED+3.592
 Twice+3.575
Neg logit contributions
shown-3.734
 attRot-3.622
ardless-3.475
mates-3.102
warming-3.079
Token.
Token ID13
Feature activation+7.57e-02

Pos logit contributions
stals+0.596
ights+0.594
arty+0.593
PLIED+0.583
usalem+0.582
Neg logit contributions
shown-0.473
 attRot-0.462
ardless-0.438
mates-0.389
warming-0.383
Token
Token ID198
Feature activation-4.76e-01

Pos logit contributions
shown+0.455
 attRot+0.443
ardless+0.419
mates+0.377
warming+0.367
Neg logit contributions
ights-0.442
stals-0.440
arty-0.439
usalem-0.437
PLIED-0.431
Token
Token ID198
Feature activation-1.34e-01

Pos logit contributions
shown+0.632
 attRot+0.616
ardless+0.584
mates+0.528
warming+0.521
Neg logit contributions
ights-0.628
arty-0.622
stals-0.619
usalem-0.615
PLIED-0.614
Tokenp
Token ID79
Feature activation-1.48e-01

Pos logit contributions
ights+0.766
stals+0.758
 Twice+0.756
PLIED+0.756
arty+0.753
Neg logit contributions
shown-0.837
 attRot-0.805
ardless-0.754
warming-0.700
mates-0.694
Tokenuls
Token ID5753
Feature activation-4.43e-01

Pos logit contributions
ights+0.738
stals+0.736
arty+0.726
usalem+0.724
 Twice+0.722
Neg logit contributions
 attRot-1.006
shown-1.001
ardless-0.955
warming-0.863
mates-0.863
Tokenating
Token ID803
Feature activation-7.86e-02

Pos logit contributions
ights+2.160
arty+2.136
stals+2.128
PLIED+2.100
 Twice+2.084
Neg logit contributions
shown-3.088
 attRot-3.012
ardless-2.912
mates-2.650
warming-2.631
Token combustion
Token ID33607
Feature activation-1.84e-01

Pos logit contributions
ights+0.418
arty+0.414
PLIED+0.410
usalem+0.409
stals+0.409
Neg logit contributions
shown-0.514
 attRot-0.508
ardless-0.474
mates-0.440
warming-0.435
Token more
Token ID517
Feature activation-2.59e+00

Pos logit contributions
ights+0.972
arty+0.964
usalem+0.957
PLIED+0.953
stals+0.940
Neg logit contributions
shown-1.215
 attRot-1.179
ardless-1.122
mates-1.041
warming-1.033
Token realistic
Token ID12653
Feature activation-8.20e-01

Pos logit contributions
ights+11.524
arty+10.962
 Dimension+10.842
ben+10.836
 Twice+10.709
Neg logit contributions
shown-15.828
ardless-15.739
 attRot-15.247
idate-13.465
mates-13.409
Token,
Token ID11
Feature activation-7.02e-01

Pos logit contributions
ights+4.406
arty+4.330
stals+4.309
 Twice+4.234
 Dimension+4.220
Neg logit contributions
shown-5.247
 attRot-5.077
ardless-4.927
mates-4.413
warming-4.385
Token too
Token ID1165
Feature activation-6.04e-01

Pos logit contributions
ights+3.745
arty+3.688
stals+3.685
PLIED+3.609
usalem+3.596
Neg logit contributions
shown-4.604
 attRot-4.478
ardless-4.299
mates-3.891
warming-3.869
Token.
Token ID13
Feature activation-4.82e-01

Pos logit contributions
ights+3.856
arty+3.794
stals+3.781
PLIED+3.720
 Twice+3.714
Neg logit contributions
shown-3.301
 attRot-3.174
ardless-3.062
mates-2.696
warming-2.662
Token It
Token ID632
Feature activation-1.01e-01

Pos logit contributions
ights+2.569
arty+2.544
stals+2.527
usalem+2.494
PLIED+2.480
Neg logit contributions
shown-3.169
 attRot-3.092
ardless-2.949
mates-2.673
warming-2.652
Token a
Token ID257
Feature activation-2.66e-01

Pos logit contributions
shown+0.038
 attRot+0.037
ardless+0.035
mates+0.032
warming+0.032
Neg logit contributions
stals-0.033
ights-0.033
arty-0.033
usalem-0.033
PLIED-0.033
Token difference
Token ID3580
Feature activation-3.40e-01

Pos logit contributions
ights+1.446
stals+1.439
arty+1.420
PLIED+1.406
usalem+1.397
Neg logit contributions
shown-1.723
 attRot-1.679
ardless-1.605
warming-1.446
mates-1.440
Token in
Token ID287
Feature activation-4.62e-01

Pos logit contributions
ights+1.931
stals+1.904
arty+1.900
PLIED+1.868
usalem+1.860
Neg logit contributions
shown-2.131
 attRot-2.065
ardless-1.983
mates-1.776
warming-1.773
Token corners
Token ID14371
Feature activation-5.30e-01

Pos logit contributions
ights+2.393
stals+2.371
arty+2.364
PLIED+2.324
usalem+2.303
Neg logit contributions
shown-3.104
 attRot-3.023
ardless-2.916
mates-2.641
warming-2.609
Token much
Token ID881
Feature activation-4.27e-01

Pos logit contributions
ights+2.921
arty+2.880
stals+2.873
PLIED+2.819
usalem+2.805
Neg logit contributions
shown-3.368
 attRot-3.271
ardless-3.148
mates-2.833
warming-2.814
Token more
Token ID517
Feature activation-2.57e+00

Pos logit contributions
ights+2.173
stals+2.148
arty+2.142
PLIED+2.104
usalem+2.094
Neg logit contributions
shown-2.903
 attRot-2.830
ardless-2.727
mates-2.464
warming-2.456
Token clearly
Token ID4084
Feature activation-1.35e+00

Pos logit contributions
ights+11.242
 Venice+10.964
 Twice+10.905
 Dimension+10.735
ben+10.723
Neg logit contributions
shown-14.977
 attRot-14.538
ardless-14.316
mates-13.028
idate-12.968
Token on
Token ID319
Feature activation-7.56e-01

Pos logit contributions
ights+7.030
stals+6.744
arty+6.722
ben+6.691
 Twice+6.646
Neg logit contributions
shown-8.294
 attRot-8.157
ardless-7.786
mates-7.096
warming-7.000
Token the
Token ID262
Feature activation-4.68e-01

Pos logit contributions
ights+3.839
arty+3.770
stals+3.753
PLIED+3.675
 Twice+3.668
Neg logit contributions
shown-5.112
 attRot-4.979
ardless-4.803
mates-4.349
warming-4.335
Token 16
Token ID1467
Feature activation-9.74e-01

Pos logit contributions
ights+2.331
stals+2.309
arty+2.305
PLIED+2.264
usalem+2.250
Neg logit contributions
shown-3.233
 attRot-3.141
ardless-3.030
mates-2.760
warming-2.729
Token-
Token ID12
Feature activation+4.26e-01

Pos logit contributions
ights+2.947
arty+2.745
 Dimension+2.734
stals+2.719
 Twice+2.677
Neg logit contributions
shown-8.437
 attRot-8.195
ardless-7.986
warming-7.418
mates-7.395
Tokenville
Token ID4244
Feature activation-4.16e-01

Pos logit contributions
ights+4.852
arty+4.765
stals+4.726
usalem+4.630
PLIED+4.626
Neg logit contributions
shown-5.784
 attRot-5.620
ardless-5.396
mates-4.874
warming-4.837
Token's
Token ID338
Feature activation-3.54e-01

Pos logit contributions
ights+2.090
stals+2.052
arty+2.051
PLIED+2.014
 Dimension+1.997
Neg logit contributions
shown-2.847
 attRot-2.777
ardless-2.682
mates-2.428
warming-2.420
Token house
Token ID2156
Feature activation+3.84e-01

Pos logit contributions
ights+1.843
stals+1.818
arty+1.807
PLIED+1.795
usalem+1.764
Neg logit contributions
shown-2.368
 attRot-2.317
ardless-2.228
mates-2.022
warming-1.998
Token on
Token ID319
Feature activation-3.17e-01

Pos logit contributions
shown+2.267
 attRot+2.201
ardless+2.130
warming+1.995
mates+1.974
Neg logit contributions
stals-2.213
ights-2.182
arty-2.182
PLIED-2.158
 Dimension-2.150
Token the
Token ID262
Feature activation-3.58e-01

Pos logit contributions
ights+1.589
stals+1.575
arty+1.564
PLIED+1.540
usalem+1.528
Neg logit contributions
shown-2.182
 attRot-2.126
ardless-2.047
mates-1.868
warming-1.849
Token r
Token ID374
Feature activation-2.57e+00

Pos logit contributions
ights+1.792
stals+1.780
arty+1.766
PLIED+1.738
usalem+1.732
Neg logit contributions
shown-2.456
 attRot-2.407
ardless-2.309
mates-2.107
warming-2.089
Tokenue
Token ID518
Feature activation-1.94e+00

Pos logit contributions
ights+15.796
arty+15.478
ben+15.399
stals+14.477
usalem+14.424
Neg logit contributions
shown-11.175
 attRot-11.092
ardless-10.110
アル-9.435
ゼウス-9.339
Token du
Token ID7043
Feature activation-1.23e+00

Pos logit contributions
ights+9.097
ben+9.074
 Dimension+8.815
 Twice+8.789
 Venice+8.776
Neg logit contributions
shown-12.253
 attRot-11.956
ardless-11.642
mates-10.348
warming-10.323
Token Vie
Token ID47154
Feature activation-1.13e+00

Pos logit contributions
ights+6.358
arty+6.232
 Twice+6.177
 Dimension+6.155
stals+6.133
Neg logit contributions
shown-7.974
 attRot-7.687
ardless-7.554
warming-6.789
mates-6.770
Tokenux
Token ID2821
Feature activation-9.42e-01

Pos logit contributions
ights+5.515
arty+5.398
stals+5.365
 Twice+5.280
 Dimension+5.277
Neg logit contributions
shown-7.670
 attRot-7.438
ardless-7.186
mates-6.556
warming-6.479
Token-
Token ID12
Feature activation+9.20e-01

Pos logit contributions
ights+4.739
arty+4.650
stals+4.629
 Dimension+4.543
 Twice+4.538
Neg logit contributions
shown-6.363
 attRot-6.187
ardless-5.976
mates-5.434
warming-5.384
Token-
Token ID12
Feature activation+4.82e-01

Pos logit contributions
ights+4.009
arty+3.917
stals+3.891
 Twice+3.825
 Dimension+3.822
Neg logit contributions
shown-6.514
 attRot-6.400
ardless-6.174
mates-5.613
warming-5.567
Tokenway
Token ID1014
Feature activation-9.71e-01

Pos logit contributions
shown+2.706
 attRot+2.408
ardless+2.388
warming+2.198
mates+2.171
Neg logit contributions
PLIED-3.055
ights-3.019
usalem-2.984
stals-2.970
 Twice-2.970
Token tie
Token ID9839
Feature activation-9.87e-01

Pos logit contributions
ights+4.828
arty+4.744
stals+4.679
 Dimension+4.634
 Twice+4.591
Neg logit contributions
shown-6.592
 attRot-6.380
ardless-6.205
mates-5.590
warming-5.556
Token for
Token ID329
Feature activation-9.99e-01

Pos logit contributions
ights+5.205
arty+5.175
stals+5.135
 Twice+5.055
ben+5.051
Neg logit contributions
shown-6.256
 attRot-6.082
ardless-5.849
mates-5.221
warming-5.127
Token second
Token ID1218
Feature activation-9.77e-01

Pos logit contributions
ights+4.702
arty+4.608
stals+4.572
 Twice+4.515
 Dimension+4.510
Neg logit contributions
shown-6.995
 attRot-6.807
ardless-6.630
mates-5.995
warming-5.954
Token right
Token ID826
Feature activation-2.57e+00

Pos logit contributions
ights+5.279
arty+5.203
stals+5.169
 Dimension+5.118
 Twice+5.104
Neg logit contributions
shown-6.118
 attRot-5.968
ardless-5.715
mates-5.112
warming-5.052
Token now
Token ID783
Feature activation-5.33e-01

Pos logit contributions
ights+8.637
ben+8.449
zzi+8.276
arty+8.099
 Twice+7.938
Neg logit contributions
 attRot-17.731
shown-17.678
ardless-16.665
アル-15.459
ensical-15.214
Token in
Token ID287
Feature activation-2.01e-01

Pos logit contributions
ights+2.945
arty+2.903
stals+2.892
PLIED+2.831
 Twice+2.826
Neg logit contributions
shown-3.365
 attRot-3.267
ardless-3.141
mates-2.822
warming-2.801
Token the
Token ID262
Feature activation-2.22e-01

Pos logit contributions
ights+1.018
stals+1.013
arty+1.005
PLIED+1.000
usalem+0.978
Neg logit contributions
shown-1.375
 attRot-1.339
ardless-1.286
mates-1.177
warming-1.164
Token Central
Token ID5694
Feature activation-2.30e-02

Pos logit contributions
ights+1.134
stals+1.131
arty+1.119
PLIED+1.108
usalem+1.096
Neg logit contributions
shown-1.497
 attRot-1.465
ardless-1.405
mates-1.275
warming-1.268
Token.
Token ID13
Feature activation-1.22e-02

Pos logit contributions
ights+0.123
stals+0.122
arty+0.122
PLIED+0.121
usalem+0.119
Neg logit contributions
shown-0.149
 attRot-0.145
ardless-0.140
mates-0.127
warming-0.126
Token it
Token ID340
Feature activation-5.71e-01

Pos logit contributions
stals+0.807
ights+0.806
arty+0.799
usalem+0.794
PLIED+0.793
Neg logit contributions
shown-1.098
 attRot-1.067
ardless-1.027
mates-0.945
warming-0.928
Token is
Token ID318
Feature activation-6.53e-01

Pos logit contributions
ights+2.748
arty+2.707
stals+2.706
PLIED+2.656
usalem+2.639
Neg logit contributions
shown-4.038
 attRot-3.936
ardless-3.790
mates-3.473
warming-3.440
Token a
Token ID257
Feature activation-3.61e-01

Pos logit contributions
ights+3.519
arty+3.464
stals+3.458
PLIED+3.392
usalem+3.374
Neg logit contributions
shown-4.240
 attRot-4.118
ardless-3.963
mates-3.577
warming-3.553
Token little
Token ID1310
Feature activation-5.31e-01

Pos logit contributions
stals+1.899
ights+1.898
arty+1.883
PLIED+1.860
usalem+1.841
Neg logit contributions
shown-2.397
 attRot-2.322
ardless-2.232
mates-2.021
warming-2.011
Token more
Token ID517
Feature activation-2.36e+00

Pos logit contributions
ights+2.668
stals+2.651
arty+2.630
PLIED+2.596
usalem+2.564
Neg logit contributions
shown-3.664
 attRot-3.559
ardless-3.423
mates-3.128
warming-3.115
Token out
Token ID503
Feature activation-2.55e+00

Pos logit contributions
ights+11.544
 Twice+11.320
 Dimension+11.289
arty+11.276
 Venice+11.163
Neg logit contributions
shown-13.364
ardless-13.204
 attRot-13.085
idate-11.612
mates-11.549
Token there
Token ID612
Feature activation-1.34e+00

Pos logit contributions
ights+12.361
arty+12.132
 Twice+12.044
ben+11.814
stals+11.708
Neg logit contributions
 attRot-13.895
shown-13.758
ardless-13.575
idate-11.978
mates-11.666
Token than
Token ID621
Feature activation+2.83e-01

Pos logit contributions
ights+7.046
arty+6.948
stals+6.830
 Twice+6.808
 Dimension+6.771
Neg logit contributions
shown-8.373
 attRot-8.103
ardless-7.835
mates-7.025
warming-6.945
Token most
Token ID749
Feature activation+3.88e-01

Pos logit contributions
shown+1.859
 attRot+1.800
ardless+1.706
mates+1.593
warming+1.536
Neg logit contributions
stals-1.493
usalem-1.482
arty-1.471
ights-1.454
PLIED-1.453
Token.
Token ID13
Feature activation-1.95e-01

Pos logit contributions
shown+2.350
 attRot+2.316
ardless+2.168
mates+2.020
warming+1.935
Neg logit contributions
stals-2.205
usalem-2.183
arty-2.180
PLIED-2.163
ights-2.103
Token -
Token ID532
Feature activation-5.03e-01

Pos logit contributions
arty+1.000
ights+0.996
stals+0.993
usalem+0.991
PLIED+0.976
Neg logit contributions
shown-1.316
 attRot-1.286
ardless-1.227
mates-1.127
warming-1.102
Token class
Token ID1398
Feature activation+1.23e+00

Pos logit contributions
shown+2.792
 attRot+2.740
ardless+2.570
warming+2.353
mates+2.312
Neg logit contributions
stals-2.434
ights-2.416
arty-2.370
usalem-2.350
PLIED-2.332
Token of
Token ID286
Feature activation+7.84e-01

Pos logit contributions
shown+6.945
 attRot+6.794
ardless+6.498
mates+6.218
warming+5.977
Neg logit contributions
stals-6.800
ights-6.795
PLIED-6.554
arty-6.499
usalem-6.463
Token people
Token ID661
Feature activation+6.32e-01

Pos logit contributions
shown+4.659
 attRot+4.584
ardless+4.447
mates+4.019
warming+3.849
Neg logit contributions
arty-4.275
stals-4.251
ights-4.233
PLIED-4.190
usalem-4.179
Token who
Token ID508
Feature activation+8.88e-02

Pos logit contributions
shown+3.708
 attRot+3.659
ardless+3.490
warming+3.112
mates+3.100
Neg logit contributions
stals-3.687
usalem-3.615
PLIED-3.603
ights-3.585
arty-3.552
Token benefit
Token ID4414
Feature activation-1.43e+00

Pos logit contributions
shown+0.601
 attRot+0.594
ardless+0.569
mates+0.521
warming+0.514
Neg logit contributions
stals-0.440
arty-0.439
PLIED-0.438
ights-0.434
 Dimension-0.433
Token more
Token ID517
Feature activation-2.54e+00

Pos logit contributions
ights+5.872
arty+5.681
stals+5.646
 Twice+5.569
PLIED+5.522
Neg logit contributions
shown-10.553
 attRot-10.323
ardless-9.954
mates-9.069
warming-9.052
Token from
Token ID422
Feature activation-6.66e-01

Pos logit contributions
ights+9.134
 dearly+8.832
arty+8.772
stals+8.600
bite+8.350
Neg logit contributions
shown-17.209
 attRot-16.745
ardless-15.574
ensical-14.692
idate-14.596
Token immigrant
Token ID15773
Feature activation-1.06e-01

Pos logit contributions
ights+3.634
arty+3.578
stals+3.569
PLIED+3.496
 Twice+3.484
Neg logit contributions
shown-4.253
 attRot-4.131
ardless-3.976
mates-3.581
warming-3.555
Token and
Token ID290
Feature activation-7.21e-02

Pos logit contributions
ights+0.636
arty+0.634
stals+0.633
PLIED+0.632
usalem+0.623
Neg logit contributions
shown-0.609
 attRot-0.603
ardless-0.576
mates-0.523
warming-0.516
Token undocumented
Token ID21829
Feature activation+6.99e-02

Pos logit contributions
ights+0.420
stals+0.415
PLIED+0.410
usalem+0.407
arty+0.407
Neg logit contributions
shown-0.437
 attRot-0.423
ardless-0.410
warming-0.367
mates-0.364
Token workers
Token ID3259
Feature activation-3.03e-01

Pos logit contributions
shown+0.380
 attRot+0.375
ardless+0.352
mates+0.324
warming+0.314
Neg logit contributions
arty-0.441
PLIED-0.439
stals-0.439
ights-0.436
usalem-0.431
Tokento
Token ID1462
Feature activation-1.17e+00

Pos logit contributions
ights+0.045
stals+0.045
PLIED+0.045
arty+0.045
 Dimension+0.045
Neg logit contributions
shown-0.027
 attRot-0.026
ardless-0.024
mates-0.021
warming-0.021
Token-
Token ID12
Feature activation-1.96e-01

Pos logit contributions
ights+3.525
arty+3.266
stals+3.143
ben+3.106
bite+3.028
Neg logit contributions
shown-10.181
 attRot-9.910
ardless-9.721
ensical-9.012
mates-8.875
Tokenuse
Token ID1904
Feature activation-3.64e-01

Pos logit contributions
ights+1.147
stals+1.142
 Twice+1.140
PLIED+1.139
arty+1.136
Neg logit contributions
shown-1.191
 attRot-1.128
ardless-1.091
mates-1.005
warming-0.988
Token version
Token ID2196
Feature activation-2.21e-02

Pos logit contributions
ights+1.919
stals+1.909
arty+1.886
PLIED+1.871
usalem+1.852
Neg logit contributions
shown-2.412
 attRot-2.353
ardless-2.246
mates-2.050
warming-2.023
Token of
Token ID286
Feature activation-3.35e-01

Pos logit contributions
ights+0.117
arty+0.117
stals+0.117
PLIED+0.117
usalem+0.115
Neg logit contributions
shown-0.145
 attRot-0.142
ardless-0.133
mates-0.122
warming-0.121
Token off
Token ID572
Feature activation-2.54e+00

Pos logit contributions
ights+1.727
stals+1.718
arty+1.688
PLIED+1.674
usalem+1.672
Neg logit contributions
shown-2.268
 attRot-2.221
ardless-2.114
mates-1.933
warming-1.900
Token-
Token ID12
Feature activation-4.09e-01

Pos logit contributions
ights+8.201
ben+7.840
arty+7.560
 Venice+7.410
bite+7.389
Neg logit contributions
shown-17.487
 attRot-17.245
ardless-16.879
idate-15.178
FY-15.010
Tokenroad
Token ID6344
Feature activation-4.65e-01

Pos logit contributions
ights+2.111
stals+2.084
arty+2.066
PLIED+2.044
usalem+2.038
Neg logit contributions
shown-2.778
 attRot-2.682
ardless-2.572
mates-2.362
warming-2.342
Token mode
Token ID4235
Feature activation-3.03e-01

Pos logit contributions
ights+2.478
stals+2.468
arty+2.445
PLIED+2.416
usalem+2.400
Neg logit contributions
shown-3.053
 attRot-2.979
ardless-2.835
mates-2.601
warming-2.569
Token driver
Token ID4639
Feature activation+4.50e-02

Pos logit contributions
ights+1.648
arty+1.644
stals+1.640
PLIED+1.617
 Twice+1.599
Neg logit contributions
shown-1.956
 attRot-1.893
ardless-1.803
mates-1.658
warming-1.632
Token-
Token ID12
Feature activation+1.07e+00

Pos logit contributions
shown+0.280
 attRot+0.272
ardless+0.255
mates+0.240
warming+0.236
Neg logit contributions
arty-0.252
stals-0.251
ights-0.249
PLIED-0.246
usalem-0.246
Token the
Token ID262
Feature activation-1.77e-01

Pos logit contributions
ights+1.372
stals+1.370
arty+1.345
usalem+1.336
PLIED+1.333
Neg logit contributions
shown-1.669
 attRot-1.624
ardless-1.545
mates-1.401
warming-1.381
Token most
Token ID749
Feature activation-7.80e-01

Pos logit contributions
stals+0.928
ights+0.924
arty+0.908
usalem+0.905
PLIED+0.900
Neg logit contributions
shown-1.178
 attRot-1.150
ardless-1.095
mates-1.004
warming-0.978
Token powerful
Token ID3665
Feature activation+5.26e-01

Pos logit contributions
ights+3.770
stals+3.719
arty+3.710
PLIED+3.630
usalem+3.612
Neg logit contributions
shown-5.507
 attRot-5.368
ardless-5.163
mates-4.710
warming-4.680
Token and
Token ID290
Feature activation-5.00e-01

Pos logit contributions
shown+3.212
 attRot+3.199
ardless+2.915
mates+2.816
warming+2.634
Neg logit contributions
stals-2.936
usalem-2.904
PLIED-2.898
ights-2.873
arty-2.858
Token fastest
Token ID14162
Feature activation+3.11e-01

Pos logit contributions
stals+2.425
ights+2.406
arty+2.384
PLIED+2.370
usalem+2.344
Neg logit contributions
shown-3.543
 attRot-3.451
ardless-3.283
mates-3.037
warming-3.010
Token off
Token ID572
Feature activation-2.51e+00

Pos logit contributions
shown+1.864
 attRot+1.855
ardless+1.705
mates+1.602
warming+1.531
Neg logit contributions
stals-1.809
PLIED-1.778
ights-1.775
arty-1.760
usalem-1.760
Token-
Token ID12
Feature activation-5.25e-01

Pos logit contributions
ights+8.161
arty+7.576
ben+7.407
 Venice+7.356
 Dimension+7.182
Neg logit contributions
shown-17.396
 attRot-16.897
ardless-16.345
FY-15.099
idate-14.997
Tokenroad
Token ID6344
Feature activation-5.51e-03

Pos logit contributions
ights+2.337
arty+2.296
stals+2.284
PLIED+2.231
 Twice+2.216
Neg logit contributions
shown-3.882
 attRot-3.789
ardless-3.668
mates-3.354
warming-3.334
Token pickups
Token ID46462
Feature activation+5.64e-01

Pos logit contributions
stals+0.035
ights+0.035
usalem+0.035
arty+0.035
PLIED+0.035
Neg logit contributions
shown-0.030
 attRot-0.030
ardless-0.028
mates-0.025
warming-0.025
Token.
Token ID13
Feature activation-1.77e-01

Pos logit contributions
shown+3.242
 attRot+3.168
ardless+2.947
mates+2.728
warming+2.610
Neg logit contributions
stals-3.396
arty-3.332
ights-3.300
usalem-3.274
PLIED-3.271
Token
Token ID198
Feature activation-7.86e-01

Pos logit contributions
ights+0.998
arty+0.991
stals+0.990
usalem+0.990
PLIED+0.966
Neg logit contributions
shown-1.118
 attRot-1.090
ardless-1.030
mates-0.934
warming-0.904
Token infections
Token ID16079
Feature activation-6.00e-01

Pos logit contributions
ights+2.490
stals+2.468
arty+2.466
PLIED+2.407
usalem+2.407
Neg logit contributions
shown-3.036
 attRot-2.934
ardless-2.837
mates-2.568
warming-2.538
Token older
Token ID4697
Feature activation-8.02e-01

Pos logit contributions
ights+3.191
arty+3.164
stals+3.158
PLIED+3.101
usalem+3.088
Neg logit contributions
shown-3.924
 attRot-3.812
ardless-3.667
mates-3.324
warming-3.294
Token than
Token ID621
Feature activation-4.54e-01

Pos logit contributions
ights+2.182
stals+2.119
arty+2.065
 Twice+2.013
PLIED+2.009
Neg logit contributions
shown-7.246
 attRot-7.156
ardless-6.909
mates-6.429
warming-6.383
Token that
Token ID326
Feature activation+1.44e-01

Pos logit contributions
ights+2.180
stals+2.170
arty+2.166
PLIED+2.109
usalem+2.104
Neg logit contributions
shown-3.236
 attRot-3.124
ardless-3.040
mates-2.774
warming-2.748
Token are
Token ID389
Feature activation-1.33e+00

Pos logit contributions
shown+0.965
 attRot+0.914
ardless+0.875
mates+0.814
warming+0.798
Neg logit contributions
arty-0.752
stals-0.744
PLIED-0.738
usalem-0.738
ights-0.724
Token less
Token ID1342
Feature activation-2.51e+00

Pos logit contributions
ights+6.713
stals+6.505
arty+6.501
 Twice+6.402
PLIED+6.360
Neg logit contributions
shown-8.525
 attRot-8.490
ardless-8.264
mates-7.399
warming-7.319
Token accurate
Token ID7187
Feature activation-1.36e+00

Pos logit contributions
ights+10.532
 Venice+10.289
 Twice+10.218
arty+10.121
ben+9.964
Neg logit contributions
ardless-15.517
 attRot-15.482
shown-15.317
mates-13.620
idate-13.605
Token and
Token ID290
Feature activation-9.54e-01

Pos logit contributions
ights+7.185
arty+7.078
stals+6.983
 Twice+6.931
ben+6.813
Neg logit contributions
shown-8.386
 attRot-8.074
ardless-7.922
warming-6.993
mates-6.916
Token can
Token ID460
Feature activation-8.96e-01

Pos logit contributions
ights+4.976
arty+4.869
stals+4.850
 Twice+4.766
 Dimension+4.747
Neg logit contributions
shown-6.261
 attRot-6.095
ardless-5.926
warming-5.308
mates-5.305
Token take
Token ID1011
Feature activation-1.29e+00

Pos logit contributions
ights+4.754
stals+4.635
arty+4.629
 Dimension+4.505
 Twice+4.501
Neg logit contributions
shown-5.833
 attRot-5.661
ardless-5.493
mates-4.938
warming-4.925
Token weeks
Token ID2745
Feature activation-5.75e-01

Pos logit contributions
ights+6.138
 Twice+6.015
 Dimension+5.960
arty+5.918
stals+5.864
Neg logit contributions
shown-8.726
 attRot-8.536
ardless-8.257
warming-7.375
idate-7.355
Token monitor
Token ID5671
Feature activation+2.88e-01

Pos logit contributions
ights+0.125
stals+0.124
arty+0.124
usalem+0.121
PLIED+0.121
Neg logit contributions
shown-0.157
 attRot-0.151
ardless-0.145
mates-0.132
warming-0.132
Token cabinet
Token ID13447
Feature activation+1.35e+00

Pos logit contributions
shown+1.842
 attRot+1.773
ardless+1.713
mates+1.587
warming+1.552
Neg logit contributions
stals-1.558
ights-1.558
arty-1.544
PLIED-1.527
usalem-1.525
Token is
Token ID318
Feature activation-2.21e-01

Pos logit contributions
shown+7.706
 attRot+7.309
ardless+7.143
mates+6.964
warming+6.605
Neg logit contributions
arty-7.108
ights-7.056
PLIED-7.048
usalem-7.023
stals-6.959
Token a
Token ID257
Feature activation-4.66e-01

Pos logit contributions
ights+1.191
arty+1.177
stals+1.175
usalem+1.171
PLIED+1.167
Neg logit contributions
shown-1.454
 attRot-1.400
ardless-1.340
mates-1.220
warming-1.213
Token lot
Token ID1256
Feature activation-5.43e-01

Pos logit contributions
ights+2.479
stals+2.463
arty+2.460
PLIED+2.426
usalem+2.406
Neg logit contributions
shown-3.071
 attRot-2.982
ardless-2.854
mates-2.587
warming-2.567
Token more
Token ID517
Feature activation-2.48e+00

Pos logit contributions
ights+2.437
stals+2.413
arty+2.411
PLIED+2.365
usalem+2.365
Neg logit contributions
shown-4.012
 attRot-3.941
ardless-3.771
mates-3.474
warming-3.463
Token difficult
Token ID2408
Feature activation-1.62e+00

Pos logit contributions
ights+10.681
arty+10.415
 Dimension+10.337
 Venice+10.150
ben+10.137
Neg logit contributions
shown-15.695
ardless-15.535
 attRot-15.028
mates-13.552
idate-13.544
Token and
Token ID290
Feature activation-1.07e+00

Pos logit contributions
ights+8.266
arty+8.096
 Twice+8.025
stals+8.006
ben+7.960
Neg logit contributions
shown-9.906
 attRot-9.433
ardless-9.279
warming-8.251
idate-8.218
Token could
Token ID714
Feature activation-4.19e-01

Pos logit contributions
ights+5.182
arty+5.086
stals+5.044
 Dimension+4.966
 Twice+4.960
Neg logit contributions
shown-7.436
 attRot-7.233
ardless-7.047
mates-6.341
warming-6.301
Token cause
Token ID2728
Feature activation-5.43e-01

Pos logit contributions
ights+2.390
arty+2.368
stals+2.352
usalem+2.332
PLIED+2.331
Neg logit contributions
shown-2.602
 attRot-2.544
ardless-2.415
mates-2.187
warming-2.160
Token safety
Token ID3747
Feature activation+4.82e-01

Pos logit contributions
ights+2.849
arty+2.803
stals+2.801
PLIED+2.742
usalem+2.731
Neg logit contributions
shown-3.610
 attRot-3.507
ardless-3.374
mates-3.062
warming-3.039
Token aluminum
Token ID15955
Feature activation+7.75e-02

Pos logit contributions
ights+1.303
stals+1.291
arty+1.279
PLIED+1.265
usalem+1.253
Neg logit contributions
shown-1.104
 attRot-1.068
ardless-1.005
mates-0.896
warming-0.872
Token alloy
Token ID36186
Feature activation-6.28e-01

Pos logit contributions
shown+0.519
 attRot+0.505
ardless+0.481
mates+0.442
warming+0.429
Neg logit contributions
stals-0.407
ights-0.402
PLIED-0.393
arty-0.393
usalem-0.389
Token that
Token ID326
Feature activation-4.17e-01

Pos logit contributions
ights+3.320
arty+3.267
stals+3.261
PLIED+3.198
usalem+3.180
Neg logit contributions
shown-4.138
 attRot-4.026
ardless-3.874
mates-3.504
warming-3.477
Token helps
Token ID5419
Feature activation-7.14e-01

Pos logit contributions
ights+2.081
arty+2.065
stals+2.059
PLIED+2.046
usalem+2.025
Neg logit contributions
shown-2.860
 attRot-2.805
ardless-2.681
mates-2.476
warming-2.434
Token save
Token ID3613
Feature activation-1.54e+00

Pos logit contributions
ights+3.950
arty+3.889
stals+3.874
PLIED+3.801
 Twice+3.789
Neg logit contributions
shown-4.509
 attRot-4.371
ardless-4.215
mates-3.782
warming-3.765
Token more
Token ID517
Feature activation-2.47e+00

Pos logit contributions
ights+7.531
 Dimension+7.329
 Twice+7.266
arty+7.207
stals+7.083
Neg logit contributions
shown-10.098
 attRot-9.810
ardless-9.443
idate-8.467
ensical-8.446
Token than
Token ID621
Feature activation-9.84e-01

Pos logit contributions
 Dimension+5.228
ights+5.173
 Twice+4.834
 dearly+4.660
bite+4.568
Neg logit contributions
shown-20.587
 attRot-19.835
ardless-19.673
idate-18.450
ensical-18.252
Token 500
Token ID5323
Feature activation-4.05e-03

Pos logit contributions
ights+4.206
 Twice+4.185
arty+4.123
stals+4.112
 Dimension+4.078
Neg logit contributions
shown-7.188
 attRot-7.044
ardless-6.805
mates-6.176
warming-6.156
Token pounds
Token ID8059
Feature activation-4.39e-01

Pos logit contributions
ights+0.024
arty+0.024
stals+0.023
PLIED+0.023
usalem+0.023
Neg logit contributions
shown-0.024
 attRot-0.024
ardless-0.023
mates-0.020
warming-0.020
Token of
Token ID286
Feature activation-5.74e-01

Pos logit contributions
ights+2.067
arty+2.025
stals+2.018
PLIED+1.975
 Twice+1.967
Neg logit contributions
shown-3.135
 attRot-3.056
ardless-2.954
mates-2.693
warming-2.674
Token weight
Token ID3463
Feature activation-4.94e-01

Pos logit contributions
ights+3.284
arty+3.238
stals+3.231
PLIED+3.171
 Twice+3.155
Neg logit contributions
shown-3.529
 attRot-3.426
ardless-3.291
mates-2.950
warming-2.930
Token a
Token ID257
Feature activation-1.74e-01

Pos logit contributions
stals+1.130
ights+1.125
arty+1.109
PLIED+1.106
usalem+1.090
Neg logit contributions
shown-1.346
 attRot-1.303
ardless-1.267
mates-1.145
warming-1.136
Token poet
Token ID21810
Feature activation-9.02e-01

Pos logit contributions
stals+0.907
ights+0.893
PLIED+0.886
arty+0.886
usalem+0.865
Neg logit contributions
shown-1.165
 attRot-1.137
ardless-1.092
warming-0.987
mates-0.984
Token of
Token ID286
Feature activation-7.51e-01

Pos logit contributions
ights+5.002
arty+4.901
stals+4.813
 Twice+4.777
ben+4.723
Neg logit contributions
shown-5.654
 attRot-5.471
ardless-5.236
mates-4.681
warming-4.654
Token sens
Token ID3054
Feature activation-6.80e-01

Pos logit contributions
ights+3.987
arty+3.919
stals+3.901
PLIED+3.822
 Twice+3.814
Neg logit contributions
shown-4.915
 attRot-4.777
ardless-4.606
mates-4.154
warming-4.130
Tokenibility
Token ID2247
Feature activation+5.33e-02

Pos logit contributions
ights+4.095
arty+4.036
stals+4.026
PLIED+3.955
usalem+3.939
Neg logit contributions
shown-3.967
 attRot-3.844
ardless-3.685
mates-3.280
warming-3.255
Token more
Token ID517
Feature activation-2.47e+00

Pos logit contributions
shown+0.336
 attRot+0.328
ardless+0.317
mates+0.288
warming+0.286
Neg logit contributions
stals-0.292
ights-0.289
arty-0.287
PLIED-0.285
usalem-0.284
Token than
Token ID621
Feature activation-3.14e-01

Pos logit contributions
ights+11.749
arty+11.493
 Venice+11.262
ben+11.067
bite+11.058
Neg logit contributions
shown-14.445
 attRot-13.617
ardless-13.296
idate-12.394
mates-11.857
Token sense
Token ID2565
Feature activation-3.86e-01

Pos logit contributions
stals+1.762
ights+1.759
arty+1.745
PLIED+1.736
usalem+1.699
Neg logit contributions
shown-1.950
 attRot-1.902
ardless-1.831
mates-1.662
warming-1.641
Token,
Token ID11
Feature activation-6.12e-01

Pos logit contributions
ights+2.307
stals+2.290
arty+2.283
PLIED+2.254
usalem+2.239
Neg logit contributions
shown-2.260
 attRot-2.191
ardless-2.100
mates-1.886
warming-1.870
Token which
Token ID543
Feature activation+4.76e-01

Pos logit contributions
ights+3.164
stals+3.115
arty+3.112
PLIED+3.053
usalem+3.029
Neg logit contributions
shown-4.104
 attRot-3.998
ardless-3.856
mates-3.497
warming-3.476
Token means
Token ID1724
Feature activation-9.35e-01

Pos logit contributions
shown+3.052
 attRot+3.010
ardless+2.819
mates+2.694
warming+2.621
Neg logit contributions
stals-2.460
PLIED-2.445
arty-2.428
usalem-2.386
ights-2.340
Token moderate
Token ID10768
Feature activation+8.15e-01

Pos logit contributions
shown+0.381
 attRot+0.375
ardless+0.353
mates+0.325
warming+0.321
Neg logit contributions
stals-0.349
PLIED-0.343
ights-0.342
arty-0.340
usalem-0.338
Token success
Token ID1943
Feature activation+2.84e-01

Pos logit contributions
 attRot+4.113
shown+4.070
ardless+3.621
warming+3.503
mates+3.450
Neg logit contributions
stals-5.302
PLIED-5.213
usalem-5.122
arty-5.086
ights-5.003
Token.
Token ID13
Feature activation-1.63e-01

Pos logit contributions
shown+1.564
 attRot+1.535
ardless+1.432
mates+1.309
warming+1.298
Neg logit contributions
stals-1.775
ights-1.765
PLIED-1.758
usalem-1.754
arty-1.740
Token His
Token ID2399
Feature activation-6.61e-01

Pos logit contributions
ights+0.847
usalem+0.844
arty+0.842
stals+0.836
PLIED+0.826
Neg logit contributions
shown-1.094
 attRot-1.057
ardless-1.007
mates-0.931
warming-0.905
Token least
Token ID1551
Feature activation-5.43e-01

Pos logit contributions
ights+3.570
arty+3.516
stals+3.508
PLIED+3.442
usalem+3.422
Neg logit contributions
shown-4.274
 attRot-4.157
ardless-3.999
mates-3.611
warming-3.584
Token anti
Token ID3098
Feature activation-2.46e+00

Pos logit contributions
ights+2.907
stals+2.898
arty+2.881
PLIED+2.836
usalem+2.827
Neg logit contributions
shown-3.578
 attRot-3.464
ardless-3.300
mates-3.029
warming-3.011
Token-
Token ID12
Feature activation-5.59e-01

Pos logit contributions
arty+11.147
PLIED+10.469
ights+10.137
bite+9.996
ben+9.926
Neg logit contributions
shown-14.901
 attRot-14.362
ardless-13.554
mates-12.251
FY-12.218
TokenSemitic
Token ID28753
Feature activation-4.21e-01

Pos logit contributions
ights+3.352
stals+3.312
arty+3.287
PLIED+3.252
usalem+3.234
Neg logit contributions
shown-3.326
 attRot-3.197
ardless-3.063
mates-2.758
warming-2.738
Token book
Token ID1492
Feature activation+5.40e-01

Pos logit contributions
ights+2.288
stals+2.281
arty+2.265
PLIED+2.245
usalem+2.215
Neg logit contributions
shown-2.703
 attRot-2.647
ardless-2.506
mates-2.308
warming-2.277
Token,
Token ID11
Feature activation+2.05e-01

Pos logit contributions
shown+3.386
 attRot+3.246
ardless+3.024
mates+2.960
warming+2.848
Neg logit contributions
PLIED-2.982
stals-2.892
usalem-2.883
arty-2.866
ights-2.838
Token Peach
Token ID33230
Feature activation+6.58e-01

Pos logit contributions
shown+1.409
 attRot+1.345
ardless+1.291
mates+1.208
warming+1.170
Neg logit contributions
PLIED-1.026
stals-1.024
ights-1.017
arty-1.017
usalem-1.002
Token the
Token ID262
Feature activation+4.81e-01

Pos logit contributions
shown+1.006
 attRot+0.963
ardless+0.926
mates+0.844
warming+0.826
Neg logit contributions
stals-0.805
ights-0.783
PLIED-0.777
usalem-0.776
arty-0.775
Token ultimate
Token ID8713
Feature activation-3.65e-01

Pos logit contributions
shown+3.229
 attRot+3.149
ardless+2.967
mates+2.751
warming+2.693
Neg logit contributions
stals-2.489
PLIED-2.347
usalem-2.337
ights-2.305
arty-2.265
Token high
Token ID1029
Feature activation+2.75e-01

Pos logit contributions
stals+2.057
ights+2.044
arty+2.016
PLIED+2.003
usalem+1.995
Neg logit contributions
shown-2.290
 attRot-2.228
ardless-2.120
mates-1.927
warming-1.904
Token-
Token ID12
Feature activation+8.06e-01

Pos logit contributions
shown+1.999
 attRot+1.949
ardless+1.857
warming+1.745
mates+1.723
Neg logit contributions
stals-1.275
ights-1.224
arty-1.219
PLIED-1.204
 Twice-1.180
Tokenperformance
Token ID26585
Feature activation+2.49e-01

Pos logit contributions
shown+3.908
 attRot+3.506
ardless+3.265
warming+3.087
bodied+3.005
Neg logit contributions
stals-5.630
PLIED-5.538
 Twice-5.443
usalem-5.393
 Dimension-5.342
Token off
Token ID572
Feature activation-2.46e+00

Pos logit contributions
shown+1.428
 attRot+1.414
ardless+1.289
mates+1.202
warming+1.181
Neg logit contributions
stals-1.540
PLIED-1.489
ights-1.479
usalem-1.467
arty-1.462
Token-
Token ID12
Feature activation-8.87e-01

Pos logit contributions
ights+6.247
arty+6.028
ben+5.908
bite+5.691
 Venice+5.681
Neg logit contributions
shown-18.761
 attRot-18.505
ardless-17.864
mates-16.597
idate-16.543
Tokenroad
Token ID6344
Feature activation-6.65e-02

Pos logit contributions
ights+3.701
arty+3.652
stals+3.581
PLIED+3.491
ben+3.468
Neg logit contributions
shown-6.742
 attRot-6.609
ardless-6.424
mates-5.865
warming-5.820
Token pickup
Token ID19422
Feature activation+6.78e-01

Pos logit contributions
stals+0.405
ights+0.396
arty+0.392
PLIED+0.392
usalem+0.390
Neg logit contributions
shown-0.389
 attRot-0.380
ardless-0.356
mates-0.327
warming-0.323
Token in
Token ID287
Feature activation+5.03e-01

Pos logit contributions
shown+3.766
 attRot+3.705
ardless+3.374
mates+3.195
warming+3.098
Neg logit contributions
stals-4.194
arty-4.103
usalem-4.074
PLIED-4.012
ights-4.011
Token the
Token ID262
Feature activation+6.12e-02

Pos logit contributions
shown+3.152
 attRot+3.063
ardless+2.879
mates+2.712
warming+2.614
Neg logit contributions
stals-2.794
PLIED-2.680
ights-2.672
usalem-2.667
ben-2.641
Token the
Token ID262
Feature activation-4.09e-01

Pos logit contributions
stals+2.536
arty+2.534
ights+2.513
PLIED+2.471
usalem+2.437
Neg logit contributions
shown-2.853
 attRot-2.751
ardless-2.628
mates-2.419
warming-2.375
Token Desert
Token ID18692
Feature activation+5.85e-02

Pos logit contributions
ights+1.759
stals+1.752
arty+1.741
PLIED+1.706
usalem+1.702
Neg logit contributions
shown-3.106
 attRot-3.028
ardless-2.896
mates-2.686
warming-2.667
Token,
Token ID11
Feature activation-1.05e+00

Pos logit contributions
shown+0.376
 attRot+0.368
ardless+0.346
mates+0.319
warming+0.313
Neg logit contributions
arty-0.317
stals-0.316
ights-0.314
usalem-0.309
PLIED-0.307
Token Lucas
Token ID15257
Feature activation-4.68e-01

Pos logit contributions
ights+5.069
arty+4.964
stals+4.927
 Twice+4.903
 Dimension+4.887
Neg logit contributions
shown-7.225
 attRot-7.041
ardless-6.832
warming-6.166
mates-6.164
Token Oil
Token ID11474
Feature activation-7.38e-01

Pos logit contributions
stals+1.863
ights+1.856
arty+1.831
PLIED+1.782
usalem+1.773
Neg logit contributions
shown-3.696
 attRot-3.622
ardless-3.495
mates-3.219
warming-3.208
Token Off
Token ID3242
Feature activation-2.46e+00

Pos logit contributions
ights+3.623
arty+3.566
stals+3.559
PLIED+3.483
usalem+3.470
Neg logit contributions
shown-5.141
 attRot-5.009
ardless-4.830
mates-4.401
warming-4.371
Token Road
Token ID5567
Feature activation-8.07e-01

Pos logit contributions
ights+10.166
 Twice+9.717
 Dimension+9.645
arty+9.636
ben+9.521
Neg logit contributions
 attRot-16.064
shown-15.918
ardless-15.469
FY-14.040
アル-13.992
Token Racing
Token ID14954
Feature activation-1.33e+00

Pos logit contributions
ights+3.976
arty+3.899
stals+3.889
PLIED+3.803
 Twice+3.800
Neg logit contributions
shown-5.580
 attRot-5.439
ardless-5.255
mates-4.761
warming-4.739
Token Series
Token ID7171
Feature activation-4.62e-01

Pos logit contributions
ights+6.319
arty+6.143
stals+6.118
 Twice+6.112
 Dimension+6.109
Neg logit contributions
shown-9.117
 attRot-8.861
ardless-8.679
warming-7.774
idate-7.712
Token and
Token ID290
Feature activation-3.59e-01

Pos logit contributions
ights+2.337
stals+2.327
arty+2.319
PLIED+2.283
usalem+2.269
Neg logit contributions
shown-3.138
 attRot-3.072
ardless-2.931
mates-2.697
warming-2.661
Token the
Token ID262
Feature activation-3.55e-01

Pos logit contributions
ights+1.814
stals+1.808
arty+1.797
PLIED+1.768
usalem+1.764
Neg logit contributions
shown-2.458
 attRot-2.406
ardless-2.295
mates-2.115
warming-2.069
Token unlikely
Token ID7485
Feature activation+2.29e-01

Pos logit contributions
ights+3.639
stals+3.588
arty+3.588
PLIED+3.522
usalem+3.503
Neg logit contributions
shown-4.148
 attRot-4.034
ardless-3.872
mates-3.491
warming-3.465
Token to
Token ID284
Feature activation+1.82e-01

Pos logit contributions
shown+1.140
 attRot+1.110
ardless+1.045
mates+0.915
warming+0.905
Neg logit contributions
ights-1.562
stals-1.542
arty-1.541
PLIED-1.522
usalem-1.516
Token have
Token ID423
Feature activation+5.82e-01

Pos logit contributions
 attRot+1.058
shown+1.022
ardless+0.960
mates+0.900
warming+0.864
Neg logit contributions
stals-1.087
PLIED-1.087
usalem-1.076
arty-1.074
ights-1.064
Token impact
Token ID2928
Feature activation+4.92e-01

Pos logit contributions
shown+3.366
 attRot+3.275
ardless+3.054
mates+2.823
warming+2.808
Neg logit contributions
stals-3.393
ights-3.332
PLIED-3.328
burgh-3.288
usalem-3.277
Token on
Token ID319
Feature activation+2.91e-01

Pos logit contributions
shown+2.952
 attRot+2.864
ardless+2.804
mates+2.488
warming+2.486
Neg logit contributions
stals-2.741
ights-2.729
PLIED-2.700
arty-2.697
 Dimension-2.691
Token large
Token ID1588
Feature activation+2.03e+00

Pos logit contributions
shown+1.782
 attRot+1.751
ardless+1.681
mates+1.524
warming+1.507
Neg logit contributions
stals-1.628
PLIED-1.601
usalem-1.600
arty-1.590
ights-1.586
Token projects
Token ID4493
Feature activation+5.44e-01

Pos logit contributions
shown+10.238
 attRot+10.106
ardless+9.604
warming+9.250
 sized+8.882
Neg logit contributions
stals-10.304
usalem-10.141
PLIED-9.968
zzi-9.607
arty-9.573
Token that
Token ID326
Feature activation-1.26e-01

Pos logit contributions
shown+3.045
 attRot+3.016
ardless+2.792
mates+2.540
warming+2.533
Neg logit contributions
stals-3.337
usalem-3.261
ights-3.243
PLIED-3.225
arty-3.167
Token are
Token ID389
Feature activation-3.00e-01

Pos logit contributions
stals+0.640
PLIED+0.639
arty+0.633
usalem+0.629
ights+0.629
Neg logit contributions
shown-0.841
 attRot-0.833
ardless-0.790
mates-0.742
warming-0.720
Token ultimately
Token ID6165
Feature activation-4.14e-01

Pos logit contributions
stals+1.748
ights+1.737
arty+1.724
usalem+1.715
PLIED+1.708
Neg logit contributions
shown-1.828
 attRot-1.766
ardless-1.676
mates-1.523
warming-1.517
Token decided
Token ID3066
Feature activation+1.09e-01

Pos logit contributions
ights+2.381
stals+2.369
arty+2.352
usalem+2.322
PLIED+2.320
Neg logit contributions
shown-2.539
 attRot-2.450
ardless-2.344
mates-2.121
warming-2.102
Token officials
Token ID2828
Feature activation+5.58e-01

Pos logit contributions
ights+1.595
stals+1.574
arty+1.574
usalem+1.540
PLIED+1.536
Neg logit contributions
shown-1.965
 attRot-1.907
ardless-1.824
mates-1.671
warming-1.654
Token from
Token ID422
Feature activation-2.19e-02

Pos logit contributions
shown+3.394
 attRot+3.233
ardless+3.062
mates+2.953
warming+2.857
Neg logit contributions
PLIED-3.052
stals-2.999
usalem-2.971
ights-2.955
arty-2.897
Token eight
Token ID3624
Feature activation-3.29e-01

Pos logit contributions
stals+0.114
ights+0.113
arty+0.113
PLIED+0.113
usalem+0.111
Neg logit contributions
shown-0.145
 attRot-0.139
ardless-0.133
mates-0.125
warming-0.121
Token countries
Token ID2678
Feature activation+9.90e-02

Pos logit contributions
stals+1.874
ights+1.871
arty+1.855
PLIED+1.848
usalem+1.833
Neg logit contributions
shown-2.032
 attRot-1.955
ardless-1.873
mates-1.718
warming-1.694
Token with
Token ID351
Feature activation+3.35e-01

Pos logit contributions
shown+0.639
 attRot+0.616
ardless+0.588
mates+0.559
warming+0.545
Neg logit contributions
stals-0.527
PLIED-0.523
ights-0.522
usalem-0.515
arty-0.511
Token large
Token ID1588
Feature activation+1.98e+00

Pos logit contributions
shown+2.058
 attRot+1.917
ardless+1.850
mates+1.722
warming+1.719
Neg logit contributions
ights-1.891
stals-1.887
PLIED-1.865
arty-1.862
usalem-1.847
Token flu
Token ID6562
Feature activation+3.31e-01

Pos logit contributions
shown+9.420
 attRot+8.646
ardless+8.454
warming+8.348
 sized+7.946
Neg logit contributions
PLIED-11.306
stals-11.024
usalem-10.567
arty-10.445
ights-10.285
Token outbreaks
Token ID35198
Feature activation+9.19e-02

Pos logit contributions
shown+1.631
 attRot+1.558
ardless+1.464
warming+1.340
mates+1.322
Neg logit contributions
PLIED-2.245
stals-2.237
usalem-2.217
 Dimension-2.198
ights-2.177
Token on
Token ID319
Feature activation-6.44e-02

Pos logit contributions
shown+0.569
 attRot+0.547
ardless+0.519
warming+0.479
mates+0.479
Neg logit contributions
stals-0.516
PLIED-0.515
ights-0.512
usalem-0.510
arty-0.508
Token Wednesday
Token ID3583
Feature activation-7.11e-01

Pos logit contributions
arty+0.293
stals+0.292
ights+0.290
PLIED+0.287
usalem+0.282
Neg logit contributions
shown-0.467
 attRot-0.458
ardless-0.435
mates-0.410
warming-0.407
Token in
Token ID287
Feature activation+4.88e-01

Pos logit contributions
ights+3.978
arty+3.925
stals+3.898
 Twice+3.822
PLIED+3.822
Neg logit contributions
shown-4.429
 attRot-4.299
ardless-4.151
mates-3.715
warming-3.689
Token was
Token ID373
Feature activation+3.28e-01

Pos logit contributions
shown+4.810
 attRot+4.589
ardless+4.398
mates+4.166
warming+4.078
Neg logit contributions
stals-4.050
PLIED-3.989
usalem-3.986
ights-3.949
arty-3.899
Token unprecedented
Token ID13029
Feature activation+2.88e-01

Pos logit contributions
shown+2.094
 attRot+2.023
ardless+1.904
warming+1.781
mates+1.763
Neg logit contributions
stals-1.771
ights-1.753
arty-1.743
PLIED-1.723
usalem-1.720
Token in
Token ID287
Feature activation+5.79e-01

Pos logit contributions
shown+1.668
 attRot+1.625
ardless+1.558
warming+1.396
mates+1.384
Neg logit contributions
ights-1.723
stals-1.722
arty-1.722
usalem-1.719
PLIED-1.688
Token a
Token ID257
Feature activation+5.14e-01

Pos logit contributions
shown+3.471
 attRot+3.321
ardless+3.172
mates+2.998
warming+2.915
Neg logit contributions
stals-3.258
PLIED-3.216
usalem-3.191
ights-3.183
arty-3.128
Token major
Token ID1688
Feature activation+1.27e+00

Pos logit contributions
shown+3.242
 attRot+3.131
ardless+2.997
warming+2.801
mates+2.737
Neg logit contributions
stals-2.773
ights-2.718
usalem-2.676
arty-2.651
PLIED-2.648
Token-
Token ID12
Feature activation+2.32e+00

Pos logit contributions
shown+6.997
 attRot+6.598
warming+6.361
ardless+6.345
FY+5.902
Neg logit contributions
stals-7.006
PLIED-6.761
ights-6.660
 actionGroup-6.506
usalem-6.502
Tokenparty
Token ID10608
Feature activation+3.32e-01

Pos logit contributions
shown+7.411
warming+6.909
ardless+6.354
enough+6.101
idate+5.800
Neg logit contributions
PLIED-16.141
usalem-16.118
 Dimension-15.788
 Myster-15.731
 Cavern-15.701
Token presidential
Token ID4787
Feature activation+5.30e-01

Pos logit contributions
shown+2.081
 attRot+2.058
ardless+1.949
warming+1.813
mates+1.799
Neg logit contributions
stals-1.834
ights-1.785
usalem-1.749
PLIED-1.722
arty-1.691
Token candidate
Token ID4540
Feature activation+4.90e-01

Pos logit contributions
shown+3.118
 attRot+3.081
ardless+2.929
warming+2.699
idate+2.695
Neg logit contributions
stals-3.052
ights-2.993
usalem-2.953
PLIED-2.920
 Dimension-2.855
Token.
Token ID13
Feature activation+3.88e-01

Pos logit contributions
shown+2.786
 attRot+2.743
ardless+2.553
warming+2.327
mates+2.287
Neg logit contributions
ights-2.972
stals-2.951
PLIED-2.941
usalem-2.905
arty-2.879
Token So
Token ID1406
Feature activation-4.52e-01

Pos logit contributions
shown+2.281
 attRot+2.216
ardless+2.126
mates+1.915
warming+1.863
Neg logit contributions
usalem-2.335
arty-2.285
ights-2.284
stals-2.254
PLIED-2.232
Token someone
Token ID2130
Feature activation-1.11e-01

Pos logit contributions
shown+0.690
 attRot+0.685
ardless+0.649
mates+0.601
warming+0.576
Neg logit contributions
ights-0.623
stals-0.618
usalem-0.614
arty-0.614
PLIED-0.611
Token's
Token ID338
Feature activation-1.43e-01

Pos logit contributions
stals+0.600
ights+0.600
arty+0.592
usalem+0.586
PLIED+0.586
Neg logit contributions
shown-0.714
 attRot-0.705
ardless-0.673
mates-0.618
warming-0.613
Token 200
Token ID939
Feature activation-1.98e-02

Pos logit contributions
ights+0.829
stals+0.820
arty+0.817
usalem+0.805
PLIED+0.802
Neg logit contributions
shown-0.871
 attRot-0.857
ardless-0.818
mates-0.745
warming-0.734
Token-
Token ID12
Feature activation+1.59e+00

Pos logit contributions
arty+0.087
stals+0.086
ights+0.086
PLIED+0.084
usalem+0.083
Neg logit contributions
shown-0.147
 attRot-0.144
ardless-0.139
mates-0.130
warming-0.127
Tokenyear
Token ID1941
Feature activation+3.90e-01

Pos logit contributions
shown+4.661
 attRot+4.222
ardless+4.029
warming+3.512
mates+3.505
Neg logit contributions
stals-12.686
 Twice-12.608
arty-12.487
usalem-12.402
PLIED-12.397
Token-
Token ID12
Feature activation+1.63e+00

Pos logit contributions
shown+2.926
 attRot+2.903
ardless+2.797
mates+2.576
warming+2.574
Neg logit contributions
ights-1.644
arty-1.633
stals-1.620
PLIED-1.592
 Twice-1.577
Tokenold
Token ID727
Feature activation+5.33e-02

Pos logit contributions
shown+5.978
 attRot+5.292
mates+5.145
warming+4.971
ardless+4.913
Neg logit contributions
PLIED-12.434
 Twice-12.041
 Dimension-12.034
stals-11.579
 Myster-11.560
Token house
Token ID2156
Feature activation+7.42e-01

Pos logit contributions
 attRot+0.314
shown+0.313
ardless+0.296
mates+0.271
warming+0.264
Neg logit contributions
ights-0.313
stals-0.312
arty-0.310
PLIED-0.308
usalem-0.304
Token,"
Token ID553
Feature activation-7.61e-02

Pos logit contributions
 attRot+4.597
shown+4.524
ardless+4.396
mates+4.202
warming+4.132
Neg logit contributions
arty-3.845
 Dimension-3.815
stals-3.797
usalem-3.775
PLIED-3.722
Token Adams
Token ID12620
Feature activation-2.49e-01

Pos logit contributions
ights+0.418
stals+0.414
arty+0.407
usalem+0.404
PLIED+0.401
Neg logit contributions
shown-0.491
 attRot-0.470
ardless-0.449
mates-0.413
warming-0.406
Token said
Token ID531
Feature activation-4.80e-01

Pos logit contributions
ights+1.158
stals+1.139
arty+1.137
PLIED+1.115
usalem+1.111
Neg logit contributions
shown-1.791
 attRot-1.747
ardless-1.681
mates-1.545
warming-1.533
Token a
Token ID257
Feature activation+8.58e-01

Pos logit contributions
shown+4.447
 attRot+4.261
ardless+4.030
warming+3.730
mates+3.704
Neg logit contributions
stals-4.146
PLIED-4.122
arty-4.076
usalem-4.070
ights-4.019
Token one
Token ID530
Feature activation+1.06e-01

Pos logit contributions
shown+5.184
 attRot+5.081
ardless+4.599
warming+4.329
mates+4.298
Neg logit contributions
PLIED-4.766
stals-4.755
arty-4.685
ights-4.627
usalem-4.559
Token-
Token ID12
Feature activation+1.42e+00

Pos logit contributions
shown+0.778
 attRot+0.761
ardless+0.728
warming+0.681
mates+0.669
Neg logit contributions
ights-0.477
stals-0.477
arty-0.472
usalem-0.465
PLIED-0.462
Tokenpercent
Token ID25067
Feature activation-1.16e-01

Pos logit contributions
shown+7.375
 attRot+6.670
warming+6.242
ardless+6.150
mates+5.845
Neg logit contributions
PLIED-8.591
usalem-8.455
stals-8.417
ights-8.327
 actionGroup-8.203
Tokenage
Token ID496
Feature activation-2.62e-02

Pos logit contributions
arty+0.777
ights+0.775
stals+0.774
PLIED+0.768
usalem+0.763
Neg logit contributions
shown-0.601
 attRot-0.581
ardless-0.554
warming-0.488
mates-0.485
Token-
Token ID12
Feature activation+1.94e+00

Pos logit contributions
ights+0.101
stals+0.099
arty+0.099
PLIED+0.097
usalem+0.096
Neg logit contributions
shown-0.210
 attRot-0.206
ardless-0.199
mates-0.184
warming-0.183
Tokenpoint
Token ID4122
Feature activation+9.48e-01

Pos logit contributions
shown+8.632
 attRot+7.099
warming+6.818
ardless+6.373
mates+6.072
Neg logit contributions
usalem-12.958
 actionGroup-12.719
arty-12.648
PLIED-12.608
stals-12.484
Token increase
Token ID2620
Feature activation+3.27e-01

Pos logit contributions
shown+4.032
 attRot+3.942
ardless+3.421
warming+3.220
mates+3.092
Neg logit contributions
PLIED-6.889
arty-6.864
stals-6.831
usalem-6.697
ights-6.674
Token from
Token ID422
Feature activation+9.24e-01

Pos logit contributions
shown+2.035
 attRot+1.993
ardless+1.879
warming+1.713
mates+1.698
Neg logit contributions
ights-1.825
stals-1.824
PLIED-1.807
arty-1.803
usalem-1.785
Token the
Token ID262
Feature activation+7.52e-01

Pos logit contributions
shown+5.499
 attRot+5.302
ardless+4.910
mates+4.851
warming+4.439
Neg logit contributions
stals-5.114
PLIED-5.090
arty-5.089
usalem-5.086
ights-4.905
Token prior
Token ID3161
Feature activation+6.17e-01

Pos logit contributions
shown+4.657
 attRot+4.489
ardless+4.285
mates+4.089
warming+3.873
Neg logit contributions
stals-4.107
usalem-4.080
arty-4.059
PLIED-4.039
ights-3.940
Token collaborate
Token ID30081
Feature activation-1.40e-01

Pos logit contributions
shown+1.429
 attRot+1.418
ardless+1.337
mates+1.243
idate+1.188
Neg logit contributions
PLIED-1.376
stals-1.345
ights-1.339
usalem-1.327
arty-1.327
Token more
Token ID517
Feature activation-1.84e+00

Pos logit contributions
ights+0.750
stals+0.747
arty+0.740
PLIED+0.737
usalem+0.726
Neg logit contributions
shown-0.905
 attRot-0.888
ardless-0.849
mates-0.780
warming-0.765
Token in
Token ID287
Feature activation-5.47e-01

Pos logit contributions
ights+8.950
arty+8.830
ben+8.746
 Dimension+8.696
 Twice+8.682
Neg logit contributions
shown-11.331
 attRot-10.840
ardless-10.466
warming-9.433
ensical-9.360
Token the
Token ID262
Feature activation-2.58e-01

Pos logit contributions
ights+2.779
stals+2.732
arty+2.731
PLIED+2.676
usalem+2.658
Neg logit contributions
shown-3.714
 attRot-3.616
ardless-3.483
mates-3.166
warming-3.143
Token coming
Token ID2406
Feature activation-3.87e-01

Pos logit contributions
ights+1.475
stals+1.470
arty+1.442
usalem+1.431
PLIED+1.430
Neg logit contributions
shown-1.617
 attRot-1.565
ardless-1.502
mates-1.348
warming-1.340
Token years
Token ID812
Feature activation+1.94e-01

Pos logit contributions
ights+1.819
stals+1.791
arty+1.790
PLIED+1.754
usalem+1.742
Neg logit contributions
shown-2.783
 attRot-2.719
ardless-2.625
mates-2.398
warming-2.381
Token.
Token ID13
Feature activation-5.89e-02

Pos logit contributions
shown+1.109
 attRot+1.096
ardless+1.044
mates+0.934
warming+0.921
Neg logit contributions
PLIED-1.178
ights-1.178
stals-1.174
arty-1.163
usalem-1.152
Token Here
Token ID3423
Feature activation+2.27e-01

Pos logit contributions
ights+0.338
usalem+0.338
stals+0.337
arty+0.333
PLIED+0.332
Neg logit contributions
shown-0.363
 attRot-0.352
ardless-0.337
mates-0.306
warming-0.298
Token
Token ID447
Feature activation-1.47e-01

Pos logit contributions
shown+1.382
 attRot+1.335
ardless+1.272
mates+1.166
warming+1.155
Neg logit contributions
ights-1.296
PLIED-1.292
stals-1.278
usalem-1.259
arty-1.247
Token
Token ID247
Feature activation-3.83e-02

Pos logit contributions
ights+1.089
arty+1.077
stals+1.075
PLIED+1.062
usalem+1.058
Neg logit contributions
shown-0.656
 attRot-0.630
ardless-0.594
mates-0.507
warming-0.500
Tokens
Token ID82
Feature activation+3.97e-01

Pos logit contributions
ights+0.222
PLIED+0.220
stals+0.219
arty+0.218
usalem+0.217
Neg logit contributions
shown-0.234
 attRot-0.222
ardless-0.216
mates-0.194
warming-0.193
Token,
Token ID11
Feature activation+2.64e-01

Pos logit contributions
ights+1.886
arty+1.871
stals+1.864
PLIED+1.839
usalem+1.834
Neg logit contributions
shown-1.699
 attRot-1.653
ardless-1.577
mates-1.392
warming-1.385
Token to
Token ID284
Feature activation+6.39e-01

Pos logit contributions
shown+1.723
 attRot+1.657
ardless+1.596
mates+1.433
warming+1.431
Neg logit contributions
stals-1.407
ights-1.385
usalem-1.380
arty-1.379
PLIED-1.378
Token 0
Token ID657
Feature activation-8.22e-02

Pos logit contributions
shown+3.905
 attRot+3.765
ardless+3.651
mates+3.348
warming+3.169
Neg logit contributions
usalem-3.539
arty-3.532
stals-3.526
PLIED-3.484
ights-3.445
Token.
Token ID13
Feature activation+3.19e-01

Pos logit contributions
ights+0.569
arty+0.568
stals+0.563
usalem+0.557
PLIED+0.555
Neg logit contributions
shown-0.406
 attRot-0.397
ardless-0.376
mates-0.324
warming-0.321
Token05
Token ID2713
Feature activation+1.44e-01

Pos logit contributions
shown+1.408
 attRot+1.371
ardless+1.263
mates+1.068
FY+1.054
Neg logit contributions
arty-2.394
ights-2.370
usalem-2.365
stals-2.355
 Dimension-2.254
Token%,
Token ID7441
Feature activation+1.69e-01

Pos logit contributions
shown+0.831
 attRot+0.821
ardless+0.773
warming+0.686
mates+0.684
Neg logit contributions
arty-0.880
ights-0.877
usalem-0.865
stals-0.863
PLIED-0.852
Token which
Token ID543
Feature activation+1.11e+00

Pos logit contributions
shown+1.093
 attRot+1.057
ardless+1.019
mates+0.907
warming+0.905
Neg logit contributions
stals-0.913
arty-0.912
usalem-0.908
ights-0.905
PLIED-0.889
Token is
Token ID318
Feature activation+5.60e-01

Pos logit contributions
shown+6.352
 attRot+6.327
ardless+5.833
mates+5.491
warming+5.314
Neg logit contributions
usalem-6.079
arty-6.052
PLIED-6.049
stals-6.007
ights-5.757
Token already
Token ID1541
Feature activation+2.40e-01

Pos logit contributions
shown+3.501
 attRot+3.324
ardless+3.173
warming+2.866
mates+2.855
Neg logit contributions
usalem-3.107
arty-3.082
stals-3.078
ights-3.045
PLIED-3.016
Token the
Token ID262
Feature activation+2.28e-01

Pos logit contributions
shown+1.492
 attRot+1.410
ardless+1.360
mates+1.232
warming+1.229
Neg logit contributions
arty-1.366
stals-1.362
ights-1.356
usalem-1.347
PLIED-1.341
Token legal
Token ID2742
Feature activation-3.95e-01

Pos logit contributions
shown+1.461
 attRot+1.407
ardless+1.342
mates+1.227
warming+1.219
Neg logit contributions
arty-1.254
ights-1.245
stals-1.240
usalem-1.233
PLIED-1.215
Token (
Token ID357
Feature activation-2.70e-01

Pos logit contributions
ights+4.067
arty+3.943
stals+3.881
 Dimension+3.877
 Twice+3.834
Neg logit contributions
shown-6.627
 attRot-6.484
ardless-6.242
mates-5.672
warming-5.662
Tokeneat
Token ID4098
Feature activation-7.66e-02

Pos logit contributions
stals+1.570
ights+1.569
PLIED+1.561
arty+1.553
 Twice+1.544
Neg logit contributions
shown-1.645
 attRot-1.558
ardless-1.508
mates-1.382
warming-1.371
Token to
Token ID284
Feature activation-4.44e-01

Pos logit contributions
arty+0.411
ights+0.407
PLIED+0.407
usalem+0.404
stals+0.403
Neg logit contributions
shown-0.481
 attRot-0.479
ardless-0.465
mates-0.431
warming-0.416
Token survive
Token ID7866
Feature activation-1.09e+00

Pos logit contributions
arty+2.362
ights+2.357
PLIED+2.354
stals+2.348
usalem+2.300
Neg logit contributions
shown-2.870
 attRot-2.821
ardless-2.685
mates-2.481
warming-2.438
Token/
Token ID14
Feature activation-4.07e-02

Pos logit contributions
ights+5.122
stals+4.995
arty+4.948
 Twice+4.883
 Dimension+4.852
Neg logit contributions
shown-7.696
 attRot-7.458
ardless-7.190
warming-6.521
mates-6.501
Tokendrop
Token ID14781
Feature activation+4.83e-01

Pos logit contributions
PLIED+0.248
ights+0.246
stals+0.245
 Twice+0.242
usalem+0.242
Neg logit contributions
shown-0.237
 attRot-0.227
ardless-0.217
mates-0.201
warming-0.197
Token items
Token ID3709
Feature activation+6.02e-01

Pos logit contributions
 attRot+2.919
shown+2.901
ardless+2.761
mates+2.585
warming+2.497
Neg logit contributions
arty-2.615
PLIED-2.565
usalem-2.544
stals-2.541
ights-2.529
Token on
Token ID319
Feature activation-1.16e-01

Pos logit contributions
shown+3.695
 attRot+3.584
ardless+3.468
mates+3.189
warming+3.116
Neg logit contributions
arty-3.220
stals-3.204
PLIED-3.200
usalem-3.167
ights-3.113
Token death
Token ID1918
Feature activation-8.04e-01

Pos logit contributions
stals+0.597
PLIED+0.595
arty+0.594
ights+0.586
usalem+0.577
Neg logit contributions
shown-0.775
 attRot-0.760
ardless-0.715
mates-0.688
warming-0.660
Token)
Token ID8
Feature activation-6.90e-01

Pos logit contributions
ights+3.852
stals+3.763
arty+3.756
 Dimension+3.651
 Twice+3.646
Neg logit contributions
shown-5.662
 attRot-5.494
ardless-5.327
mates-4.810
warming-4.796
Token and
Token ID290
Feature activation-2.05e-01

Pos logit contributions
ights+2.509
arty+2.419
stals+2.408
 Dimension+2.364
 Twice+2.353
Neg logit contributions
shown-5.650
 attRot-5.534
ardless-5.362
mates-4.939
warming-4.917
Token end
Token ID886
Feature activation+2.05e-01

Pos logit contributions
shown+2.203
 attRot+2.129
ardless+2.019
warming+1.845
mates+1.814
Neg logit contributions
stals-1.859
PLIED-1.852
ights-1.845
arty-1.824
usalem-1.803
Token of
Token ID286
Feature activation-5.81e-02

Pos logit contributions
shown+1.267
 attRot+1.230
ardless+1.174
mates+1.057
warming+1.054
Neg logit contributions
ights-1.145
stals-1.141
arty-1.138
PLIED-1.137
usalem-1.123
Token hang
Token ID8181
Feature activation-6.58e-01

Pos logit contributions
ights+0.308
stals+0.307
arty+0.305
PLIED+0.302
usalem+0.298
Neg logit contributions
shown-0.385
 attRot-0.373
ardless-0.354
mates-0.322
warming-0.321
Token-
Token ID12
Feature activation+1.18e+00

Pos logit contributions
ights+3.273
arty+3.220
stals+3.213
PLIED+3.146
 Twice+3.127
Neg logit contributions
shown-4.533
 attRot-4.419
ardless-4.260
mates-3.875
warming-3.849
Token
Token ID198
Feature activation+2.86e-01

Pos logit contributions
shown+5.719
 attRot+5.436
ardless+5.036
mates+4.755
warming+4.649
Neg logit contributions
 Dimension-7.566
PLIED-7.565
 Twice-7.464
usalem-7.270
 Horizons-7.267
Token
Token ID198
Feature activation+7.71e-02

Pos logit contributions
shown+1.646
 attRot+1.613
ardless+1.515
mates+1.370
warming+1.344
Neg logit contributions
ights-1.723
arty-1.716
usalem-1.711
PLIED-1.704
stals-1.704
Tokengl
Token ID4743
Feature activation+3.19e-01

Pos logit contributions
shown+0.450
 attRot+0.435
ardless+0.418
mates+0.376
warming+0.373
Neg logit contributions
ights-0.461
 Dimension-0.459
arty-0.458
 Twice-0.456
PLIED-0.455
Tokenider
Token ID1304
Feature activation-3.68e-01

Pos logit contributions
shown+2.615
 attRot+2.582
ardless+2.540
idate+2.355
mates+2.336
Neg logit contributions
PLIED-1.148
 Twice-1.082
 Dimension-1.076
stals-1.069
arty-1.027
Token performance
Token ID2854
Feature activation+2.31e-01

Pos logit contributions
ights+2.016
arty+1.997
stals+1.996
PLIED+1.964
usalem+1.950
Neg logit contributions
shown-2.360
 attRot-2.301
ardless-2.193
mates-1.989
warming-1.973
Token
Token ID784
Feature activation-2.39e-01

Pos logit contributions
shown+1.297
 attRot+1.278
ardless+1.179
warming+1.059
mates+1.058
Neg logit contributions
arty-1.440
stals-1.430
PLIED-1.427
ights-1.427
usalem-1.401
Token possibly
Token ID5457
Feature activation-5.53e-02

Pos logit contributions
ights+1.314
stals+1.313
arty+1.309
usalem+1.295
PLIED+1.285
Neg logit contributions
shown-1.538
 attRot-1.484
ardless-1.418
mates-1.278
warming-1.273
Token 7
Token ID767
Feature activation+1.35e+00

Pos logit contributions
ights+2.213
arty+2.177
stals+2.172
PLIED+2.130
 Twice+2.122
Neg logit contributions
shown-2.661
 attRot-2.587
ardless-2.492
mates-2.247
warming-2.234
Token},
Token ID5512
Feature activation+5.88e-01

Pos logit contributions
 attRot+4.703
shown+4.560
ardless+4.079
mates+3.567
warming+3.429
Neg logit contributions
arty-10.273
ights-10.193
usalem-9.969
PLIED-9.958
bite-9.760
Token {
Token ID1391
Feature activation+5.36e-01

Pos logit contributions
 attRot+2.979
shown+2.964
ardless+2.674
mates+2.416
warming+2.325
Neg logit contributions
usalem-3.946
ights-3.921
arty-3.882
PLIED-3.848
stals-3.847
Token1
Token ID16
Feature activation+6.09e-01

Pos logit contributions
shown+2.204
 attRot+2.138
ardless+1.870
mates+1.638
warming+1.603
Neg logit contributions
ights-4.134
arty-4.125
usalem-4.092
stals-4.078
 Twice-4.062
Token,
Token ID11
Feature activation-1.03e-01

Pos logit contributions
shown+3.555
 attRot+3.548
ardless+3.244
mates+2.988
warming+2.898
Neg logit contributions
ights-3.600
usalem-3.598
arty-3.567
PLIED-3.530
stals-3.507
Token 6
Token ID718
Feature activation+8.61e-01

Pos logit contributions
ights+0.614
arty+0.610
stals+0.605
PLIED+0.595
usalem+0.591
Neg logit contributions
shown-0.608
 attRot-0.590
ardless-0.564
mates-0.504
warming-0.498
Token}
Token ID92
Feature activation+1.15e-01

Pos logit contributions
 attRot+3.297
shown+3.260
ardless+2.817
mates+2.454
warming+2.368
Neg logit contributions
arty-6.719
ights-6.675
usalem-6.652
PLIED-6.533
stals-6.494
Token <
Token ID1279
Feature activation+1.31e-01

Pos logit contributions
shown+0.601
 attRot+0.583
ardless+0.552
mates+0.482
warming+0.482
Neg logit contributions
ights-0.768
stals-0.760
arty-0.757
usalem-0.744
PLIED-0.740
Token->
Token ID3784
Feature activation-8.10e-02

Pos logit contributions
shown+0.776
 attRot+0.754
ardless+0.718
mates+0.647
warming+0.638
Neg logit contributions
ights-0.775
arty-0.765
stals-0.758
usalem-0.755
PLIED-0.751
Token {
Token ID1391
Feature activation+1.59e-01

Pos logit contributions
ights+0.504
arty+0.497
stals+0.494
PLIED+0.489
usalem+0.488
Neg logit contributions
shown-0.458
 attRot-0.448
ardless-0.424
mates-0.378
warming-0.373
Token2
Token ID17
Feature activation+3.64e-01

Pos logit contributions
shown+0.659
 attRot+0.634
ardless+0.575
mates+0.496
warming+0.490
Neg logit contributions
ights-1.223
arty-1.212
stals-1.211
PLIED-1.192
usalem-1.190
Token­
Token ID3907
Feature activation-4.58e-01

Pos logit contributions
ights+6.960
arty+6.850
stals+6.734
ben+6.726
 Twice+6.613
Neg logit contributions
shown-6.765
 attRot-6.538
ardless-6.241
mates-5.550
warming-5.528
Tokenpub
Token ID12984
Feature activation+4.87e-02

Pos logit contributions
ights+3.214
arty+3.178
stals+3.172
PLIED+3.128
 Twice+3.117
Neg logit contributions
shown-2.216
 attRot-2.132
ardless-2.028
mates-1.759
warming-1.738
Token­
Token ID3907
Feature activation-1.26e-01

Pos logit contributions
shown+0.236
 attRot+0.232
ardless+0.217
mates+0.195
warming+0.187
Neg logit contributions
ights-0.336
stals-0.334
PLIED-0.334
 Twice-0.332
 Dimension-0.330
Tokenlic
Token ID677
Feature activation+1.52e-01

Pos logit contributions
ights+0.790
 Twice+0.783
arty+0.781
stals+0.781
PLIED+0.776
Neg logit contributions
shown-0.707
 attRot-0.680
ardless-0.652
mates-0.580
warming-0.574
Token­
Token ID3907
Feature activation-8.47e-02

Pos logit contributions
shown+0.768
 attRot+0.759
ardless+0.706
mates+0.634
warming+0.617
Neg logit contributions
stals-1.017
ights-1.004
PLIED-0.998
 Twice-0.993
arty-0.986
Tokenans
Token ID504
Feature activation-1.39e-01

Pos logit contributions
ights+0.356
 Twice+0.356
stals+0.352
 Dimension+0.350
PLIED+0.350
Neg logit contributions
shown-0.650
 attRot-0.628
ardless-0.615
mates-0.565
warming-0.558
Token want
Token ID765
Feature activation-5.08e-01

Pos logit contributions
stals+0.723
ights+0.721
arty+0.713
PLIED+0.711
usalem+0.700
Neg logit contributions
shown-0.925
 attRot-0.915
ardless-0.867
mates-0.799
warming-0.780
Token to
Token ID284
Feature activation-1.60e-01

Pos logit contributions
ights+3.219
arty+3.177
stals+3.169
PLIED+3.116
 Twice+3.105
Neg logit contributions
shown-2.800
 attRot-2.707
ardless-2.591
mates-2.288
warming-2.270
Token say
Token ID910
Feature activation+3.51e-01

Pos logit contributions
stals+0.855
ights+0.854
arty+0.846
PLIED+0.841
usalem+0.824
Neg logit contributions
shown-1.036
 attRot-1.022
ardless-0.973
mates-0.887
warming-0.872
Token something
Token ID1223
Feature activation+1.78e-02

Pos logit contributions
 attRot+2.234
shown+2.229
ardless+2.070
mates+1.916
warming+1.879
Neg logit contributions
ights-1.889
PLIED-1.854
stals-1.845
usalem-1.827
arty-1.823
Token really
Token ID1107
Feature activation-7.80e-01

Pos logit contributions
shown+0.113
 attRot+0.113
ardless+0.105
mates+0.097
warming+0.097
Neg logit contributions
ights-0.096
stals-0.095
PLIED-0.093
arty-0.093
usalem-0.092
Token that
Token ID326
Feature activation+3.77e-01

Pos logit contributions
shown+0.477
 attRot+0.462
ardless+0.439
mates+0.404
warming+0.394
Neg logit contributions
ights-0.378
stals-0.371
usalem-0.369
arty-0.366
PLIED-0.363
Token most
Token ID749
Feature activation+3.11e-01

Pos logit contributions
shown+2.388
 attRot+2.339
ardless+2.205
mates+2.073
warming+2.003
Neg logit contributions
usalem-2.021
ights-2.021
stals-2.001
PLIED-1.979
arty-1.972
Token of
Token ID286
Feature activation-4.37e-01

Pos logit contributions
shown+1.957
 attRot+1.911
ardless+1.801
mates+1.676
idate+1.601
Neg logit contributions
arty-1.695
usalem-1.690
stals-1.686
PLIED-1.668
ights-1.647
Token the
Token ID262
Feature activation+2.11e-01

Pos logit contributions
ights+2.017
stals+1.985
arty+1.982
usalem+1.939
PLIED+1.939
Neg logit contributions
shown-3.171
 attRot-3.099
ardless-2.972
mates-2.730
warming-2.698
Token report
Token ID989
Feature activation-6.61e-02

Pos logit contributions
shown+1.250
 attRot+1.223
ardless+1.143
mates+1.064
warming+1.020
Neg logit contributions
arty-1.228
stals-1.225
usalem-1.224
PLIED-1.211
ights-1.210
Token she
Token ID673
Feature activation-7.27e-01

Pos logit contributions
ights+0.328
stals+0.321
arty+0.321
usalem+0.320
PLIED+0.319
Neg logit contributions
shown-0.458
 attRot-0.443
ardless-0.422
mates-0.395
warming-0.385
Token was
Token ID373
Feature activation-3.32e-01

Pos logit contributions
ights+3.368
arty+3.305
stals+3.292
PLIED+3.217
 Twice+3.199
Neg logit contributions
shown-5.257
 attRot-5.128
ardless-4.958
mates-4.523
warming-4.500
Token coerced
Token ID48474
Feature activation-2.18e-01

Pos logit contributions
ights+2.335
stals+2.321
arty+2.300
usalem+2.276
PLIED+2.275
Neg logit contributions
shown-1.615
 attRot-1.543
ardless-1.467
mates-1.274
warming-1.253
Token into
Token ID656
Feature activation-8.18e-01

Pos logit contributions
ights+1.559
arty+1.544
stals+1.544
PLIED+1.520
 Dimension+1.512
Neg logit contributions
shown-1.026
 attRot-0.979
ardless-0.936
mates-0.803
warming-0.795
Token filing
Token ID12180
Feature activation-9.90e-01

Pos logit contributions
ights+3.891
arty+3.806
stals+3.747
 Twice+3.697
 Dimension+3.689
Neg logit contributions
shown-5.765
 attRot-5.613
ardless-5.424
mates-4.935
warming-4.890
Token is
Token ID318
Feature activation-8.86e-01

Pos logit contributions
ights+4.710
arty+4.606
stals+4.591
 Twice+4.500
 Dimension+4.489
Neg logit contributions
shown-6.889
 attRot-6.711
ardless-6.508
warming-5.876
mates-5.856
Token Keller
Token ID35999
Feature activation-5.28e-01

Pos logit contributions
ights+1.246
arty+1.236
stals+1.230
usalem+1.201
PLIED+1.199
Neg logit contributions
shown-1.678
 attRot-1.625
ardless-1.565
mates-1.435
warming-1.422
Token that
Token ID326
Feature activation-1.59e-01

Pos logit contributions
ights+2.939
arty+2.898
stals+2.894
PLIED+2.840
usalem+2.824
Neg logit contributions
shown-3.336
 attRot-3.240
ardless-3.115
mates-2.809
warming-2.790
Token summer
Token ID3931
Feature activation-7.24e-02

Pos logit contributions
stals+0.904
arty+0.904
ights+0.901
PLIED+0.884
usalem+0.878
Neg logit contributions
shown-0.991
 attRot-0.959
ardless-0.917
warming-0.842
mates-0.831
Token in
Token ID287
Feature activation+3.50e-01

Pos logit contributions
ights+0.465
arty+0.462
stals+0.461
usalem+0.457
PLIED+0.455
Neg logit contributions
shown-0.390
 attRot-0.383
ardless-0.363
warming-0.324
mates-0.324
Token Massachusetts
Token ID10140
Feature activation-1.87e-01

Pos logit contributions
shown+2.318
 attRot+2.260
ardless+2.108
mates+2.031
warming+1.956
Neg logit contributions
stals-1.807
ights-1.781
arty-1.759
usalem-1.758
PLIED-1.749
Token.
Token ID13
Feature activation-3.52e-01

Pos logit contributions
stals+1.210
ights+1.209
arty+1.199
usalem+1.188
PLIED+1.182
Neg logit contributions
shown-1.006
 attRot-0.977
ardless-0.919
mates-0.831
warming-0.821
Token
Token ID198
Feature activation-2.80e-01

Pos logit contributions
ights+1.953
arty+1.942
stals+1.930
usalem+1.919
PLIED+1.890
Neg logit contributions
shown-2.263
 attRot-2.178
ardless-2.088
mates-1.881
warming-1.864
Token
Token ID198
Feature activation-5.86e-01

Pos logit contributions
ights+1.519
arty+1.491
stals+1.490
 Twice+1.455
PLIED+1.451
Neg logit contributions
shown-1.794
 attRot-1.739
ardless-1.692
mates-1.510
ensical-1.509
Token"
Token ID1
Feature activation-4.44e-01

Pos logit contributions
ights+3.193
stals+3.151
arty+3.149
PLIED+3.078
usalem+3.077
Neg logit contributions
shown-3.785
 attRot-3.662
ardless-3.512
mates-3.179
warming-3.166
TokenI
Token ID40
Feature activation-8.50e-01

Pos logit contributions
ights+2.361
arty+2.339
stals+2.337
usalem+2.306
 Dimension+2.276
Neg logit contributions
shown-2.956
 attRot-2.849
ardless-2.724
mates-2.479
warming-2.475
Token never
Token ID1239
Feature activation+4.39e-02

Pos logit contributions
ights+3.703
arty+3.619
stals+3.606
PLIED+3.511
 Twice+3.498
Neg logit contributions
shown-6.365
 attRot-6.207
ardless-6.018
mates-5.486
warming-5.470
Token mean
Token ID1612
Feature activation+6.56e-02

Pos logit contributions
 attRot+3.896
shown+3.853
ardless+3.606
mates+3.333
warming+3.228
Neg logit contributions
PLIED-3.925
usalem-3.856
stals-3.788
arty-3.777
 Twice-3.728
Token Montreal
Token ID12871
Feature activation+1.26e-02

Pos logit contributions
shown+0.439
 attRot+0.425
ardless+0.410
mates+0.372
warming+0.370
Neg logit contributions
ights-0.340
arty-0.336
stals-0.335
PLIED-0.333
usalem-0.328
Token
Token ID447
Feature activation+2.33e-02

Pos logit contributions
shown+0.090
 attRot+0.087
ardless+0.085
mates+0.079
warming+0.078
Neg logit contributions
PLIED-0.058
ights-0.058
arty-0.057
stals-0.057
usalem-0.056
Token
Token ID247
Feature activation+3.43e-02

Pos logit contributions
shown+0.102
 attRot+0.098
ardless+0.092
mates+0.079
warming+0.077
Neg logit contributions
ights-0.175
arty-0.173
stals-0.172
PLIED-0.170
usalem-0.170
Tokens
Token ID82
Feature activation+1.47e-01

Pos logit contributions
shown+0.187
 attRot+0.180
ardless+0.172
mates+0.153
warming+0.151
Neg logit contributions
ights-0.220
arty-0.217
stals-0.216
PLIED-0.214
usalem-0.213
Token second
Token ID1218
Feature activation-4.63e-01

Pos logit contributions
shown+0.958
 attRot+0.928
ardless+0.888
mates+0.817
warming+0.802
Neg logit contributions
ights-0.775
PLIED-0.774
arty-0.773
stals-0.772
usalem-0.766
Token power
Token ID1176
Feature activation-7.44e-01

Pos logit contributions
ights+2.206
arty+2.174
stals+2.170
PLIED+2.126
usalem+2.106
Neg logit contributions
shown-3.292
 attRot-3.206
ardless-3.095
mates-2.827
warming-2.809
Token-
Token ID12
Feature activation+5.81e-01

Pos logit contributions
ights+2.703
arty+2.595
usalem+2.539
stals+2.535
 Twice+2.519
Neg logit contributions
shown-6.018
 attRot-5.873
ardless-5.685
mates-5.211
warming-5.180
Tokenplay
Token ID1759
Feature activation-2.89e-01

Pos logit contributions
shown+3.839
 attRot+3.577
ardless+3.472
mates+3.267
warming+3.256
Neg logit contributions
 Dimension-3.107
PLIED-3.107
 Twice-3.022
stals-3.020
arty-3.012
Token defensive
Token ID6110
Feature activation-7.44e-01

Pos logit contributions
ights+1.926
arty+1.906
stals+1.901
PLIED+1.877
usalem+1.866
Neg logit contributions
shown-1.498
 attRot-1.445
ardless-1.372
mates-1.214
warming-1.198
Token pairing
Token ID27356
Feature activation+5.57e-01

Pos logit contributions
ights+4.609
stals+4.511
arty+4.499
 Twice+4.445
 Dimension+4.431
Neg logit contributions
shown-4.171
 attRot-4.036
ardless-3.891
warming-3.388
mates-3.386
Token can
Token ID460
Feature activation+7.43e-01

Pos logit contributions
shown+6.281
 attRot+6.084
ardless+5.904
mates+5.419
FY+5.213
Neg logit contributions
PLIED-4.667
stals-4.549
 Dimension-4.503
ights-4.490
usalem-4.427
Token also
Token ID635
Feature activation+5.56e-01

Pos logit contributions
shown+4.300
 attRot+4.282
ardless+4.055
mates+3.736
idate+3.662
Neg logit contributions
PLIED-4.237
usalem-4.142
stals-4.100
ights-4.097
 Twice-4.072
Token go
Token ID467
Feature activation+8.17e-01

Pos logit contributions
 attRot+3.332
shown+3.330
ardless+3.136
mates+2.858
idate+2.813
Neg logit contributions
PLIED-3.095
ights-3.060
usalem-3.052
stals-3.040
arty-2.941
Token ahead
Token ID4058
Feature activation+1.09e+00

Pos logit contributions
shown+4.715
 attRot+4.684
ardless+4.507
mates+4.168
warming+4.077
Neg logit contributions
PLIED-4.473
usalem-4.437
ights-4.409
stals-4.406
 Dimension-4.286
Token and
Token ID290
Feature activation+1.03e+00

Pos logit contributions
shown+7.501
 attRot+7.350
ardless+7.315
mates+6.699
warming+6.662
Neg logit contributions
PLIED-4.671
ights-4.618
usalem-4.582
stals-4.556
 Dimension-4.455
Token get
Token ID651
Feature activation-3.27e-03

Pos logit contributions
 attRot+5.753
shown+5.612
ardless+5.305
mates+4.997
idate+4.741
Neg logit contributions
PLIED-5.831
usalem-5.797
stals-5.768
ights-5.722
arty-5.449
Token something
Token ID1223
Feature activation+6.62e-02

Pos logit contributions
ights+0.017
stals+0.017
arty+0.016
usalem+0.016
PLIED+0.016
Neg logit contributions
shown-0.022
 attRot-0.021
ardless-0.021
mates-0.019
warming-0.019
Token for
Token ID329
Feature activation-9.38e-01

Pos logit contributions
shown+0.409
 attRot+0.402
ardless+0.386
mates+0.348
warming+0.347
Neg logit contributions
ights-0.371
stals-0.365
PLIED-0.361
arty-0.360
usalem-0.360
Token yourself
Token ID3511
Feature activation+2.90e-01

Pos logit contributions
ights+4.222
arty+4.128
stals+4.105
 Twice+4.038
PLIED+4.028
Neg logit contributions
shown-6.768
 attRot-6.603
ardless-6.363
mates-5.816
warming-5.801
Token on
Token ID319
Feature activation-3.55e-02

Pos logit contributions
shown+1.631
 attRot+1.587
ardless+1.543
mates+1.363
warming+1.353
Neg logit contributions
ights-1.786
stals-1.755
arty-1.754
PLIED-1.728
usalem-1.727
Token my
Token ID616
Feature activation-2.60e-01

Pos logit contributions
ights+0.177
stals+0.175
arty+0.174
PLIED+0.172
usalem+0.171
Neg logit contributions
shown-0.243
 attRot-0.238
ardless-0.230
mates-0.210
warming-0.206
Tokencry
Token ID20470
Feature activation-1.10e+00

Pos logit contributions
ights+6.218
arty+6.103
stals+6.093
ben+5.948
 Dimension+5.834
Neg logit contributions
shown-8.729
 attRot-8.566
ardless-8.315
mates-7.409
ensical-7.382
Tokenstal
Token ID7757
Feature activation-2.68e-02

Pos logit contributions
stals+3.232
ights+3.020
arty+2.896
ben+2.862
 Twice+2.786
Neg logit contributions
shown-9.780
 attRot-9.566
ardless-9.335
mates-8.653
warming-8.622
Token meth
Token ID11248
Feature activation-8.96e-01

Pos logit contributions
arty+0.185
ights+0.185
PLIED+0.182
 Dimension+0.182
usalem+0.181
Neg logit contributions
shown-0.130
 attRot-0.128
ardless-0.121
mates-0.108
warming-0.104
Token makes
Token ID1838
Feature activation-1.03e+00

Pos logit contributions
ights+4.215
stals+4.143
arty+4.121
PLIED+4.016
 Twice+3.988
Neg logit contributions
shown-6.388
 attRot-6.207
ardless-6.019
mates-5.463
warming-5.439
Token it
Token ID340
Feature activation-6.11e-01

Pos logit contributions
ights+4.757
arty+4.641
stals+4.629
 Twice+4.544
 Dimension+4.515
Neg logit contributions
shown-7.380
 attRot-7.163
ardless-6.978
warming-6.276
mates-6.268
Token so
Token ID523
Feature activation-1.81e+00

Pos logit contributions
ights+2.937
arty+2.885
stals+2.877
PLIED+2.815
 Twice+2.798
Neg logit contributions
shown-4.307
 attRot-4.199
ardless-4.054
mates-3.694
warming-3.670
Token you
Token ID345
Feature activation-7.29e-01

Pos logit contributions
ights+8.187
stals+8.050
arty+7.900
ben+7.868
 Twice+7.792
Neg logit contributions
shown-12.093
 attRot-11.635
ardless-11.350
idate-10.182
mates-10.136
Token don
Token ID836
Feature activation-6.37e-01

Pos logit contributions
ights+3.321
arty+3.260
stals+3.250
PLIED+3.174
 Twice+3.156
Neg logit contributions
shown-5.321
 attRot-5.189
ardless-5.019
mates-4.585
warming-4.560
Token't
Token ID470
Feature activation-2.38e-01

Pos logit contributions
ights+2.039
arty+1.976
stals+1.969
ben+1.881
 Twice+1.866
Neg logit contributions
shown-5.444
 attRot-5.303
ardless-5.228
ensical-4.988
warming-4.814
Token ejac
Token ID39426
Feature activation-1.42e-01

Pos logit contributions
ights+1.232
stals+1.224
arty+1.216
PLIED+1.214
usalem+1.201
Neg logit contributions
shown-1.569
 attRot-1.562
ardless-1.485
mates-1.364
warming-1.329
Tokenulate
Token ID5039
Feature activation+9.15e-02

Pos logit contributions
ights+1.322
stals+1.308
arty+1.305
PLIED+1.297
usalem+1.286
Neg logit contributions
shown-0.363
 attRot-0.352
ardless-0.308
mates-0.234
warming-0.215
Token everything
Token ID2279
Feature activation+1.17e-01

Pos logit contributions
ights+0.910
stals+0.895
arty+0.894
PLIED+0.886
usalem+0.885
Neg logit contributions
shown-1.165
 attRot-1.146
ardless-1.094
mates-1.001
warming-0.986
Token together
Token ID1978
Feature activation+7.71e-02

Pos logit contributions
shown+0.676
 attRot+0.660
ardless+0.626
mates+0.564
warming+0.558
Neg logit contributions
ights-0.707
stals-0.706
PLIED-0.701
arty-0.700
usalem-0.694
Token.
Token ID13
Feature activation+1.39e-01

Pos logit contributions
shown+0.414
 attRot+0.403
ardless+0.381
mates+0.337
warming+0.334
Neg logit contributions
ights-0.497
arty-0.493
stals-0.493
PLIED-0.491
usalem-0.487
Token You
Token ID921
Feature activation+3.08e-01

Pos logit contributions
shown+0.899
 attRot+0.873
ardless+0.832
mates+0.763
FY+0.738
Neg logit contributions
usalem-0.756
ights-0.748
arty-0.744
PLIED-0.736
stals-0.735
Token can
Token ID460
Feature activation+2.35e-01

Pos logit contributions
shown+2.195
 attRot+2.138
ardless+2.050
mates+1.913
warming+1.860
Neg logit contributions
PLIED-1.423
stals-1.397
usalem-1.394
ights-1.378
arty-1.365
Token lose
Token ID4425
Feature activation-1.53e+00

Pos logit contributions
shown+1.510
 attRot+1.496
ardless+1.421
mates+1.301
idate+1.276
Neg logit contributions
PLIED-1.257
usalem-1.242
ights-1.226
stals-1.223
arty-1.211
Token any
Token ID597
Feature activation-5.49e-01

Pos logit contributions
ights+7.597
arty+7.431
 Twice+7.364
 Dimension+7.350
stals+7.213
Neg logit contributions
shown-9.978
 attRot-9.568
ardless-9.309
warming-8.309
ensical-8.231
Token one
Token ID530
Feature activation-4.90e-01

Pos logit contributions
ights+2.896
arty+2.857
stals+2.855
PLIED+2.803
usalem+2.788
Neg logit contributions
shown-3.615
 attRot-3.515
ardless-3.386
mates-3.072
warming-3.047
Token of
Token ID286
Feature activation-7.37e-01

Pos logit contributions
ights+2.423
stals+2.385
arty+2.379
PLIED+2.334
usalem+2.320
Neg logit contributions
shown-3.397
 attRot-3.310
ardless-3.193
mates-2.910
warming-2.892
Token the
Token ID262
Feature activation-4.60e-01

Pos logit contributions
ights+3.684
arty+3.621
stals+3.606
PLIED+3.530
 Twice+3.522
Neg logit contributions
shown-5.044
 attRot-4.912
ardless-4.743
mates-4.302
warming-4.283
Token 5
Token ID642
Feature activation-4.22e-01

Pos logit contributions
ights+2.442
arty+2.417
stals+2.415
PLIED+2.376
usalem+2.371
Neg logit contributions
shown-3.008
 attRot-2.921
ardless-2.813
mates-2.569
warming-2.530
Tokenapt
Token ID2373
Feature activation-8.97e-02

Pos logit contributions
ights+6.156
arty+5.955
stals+5.798
PLIED+5.748
usalem+5.727
Neg logit contributions
shown-5.538
 attRot-5.384
ardless-5.115
mates-4.571
warming-4.517
Tokenor
Token ID273
Feature activation-2.78e-02

Pos logit contributions
ights+0.402
stals+0.396
arty+0.394
PLIED+0.391
usalem+0.384
Neg logit contributions
shown-0.660
 attRot-0.643
ardless-0.624
mates-0.571
warming-0.565
Token is
Token ID318
Feature activation-3.33e-01

Pos logit contributions
stals+0.142
arty+0.140
PLIED+0.140
usalem+0.139
ights+0.138
Neg logit contributions
shown-0.187
 attRot-0.182
ardless-0.172
mates-0.162
warming-0.156
Token 6
Token ID718
Feature activation+3.52e-01

Pos logit contributions
stals+1.878
ights+1.877
arty+1.855
PLIED+1.841
usalem+1.830
Neg logit contributions
shown-2.089
 attRot-2.025
ardless-1.936
mates-1.743
warming-1.726
Token inches
Token ID8331
Feature activation-4.02e-01

Pos logit contributions
shown+2.229
 attRot+2.206
ardless+2.068
mates+1.907
warming+1.884
Neg logit contributions
arty-1.882
stals-1.877
usalem-1.854
ights-1.843
PLIED-1.828
Token wider
Token ID10595
Feature activation-1.54e+00

Pos logit contributions
ights+2.092
stals+2.079
arty+2.073
PLIED+2.034
usalem+2.019
Neg logit contributions
shown-2.688
 attRot-2.613
ardless-2.500
mates-2.269
warming-2.266
Token than
Token ID621
Feature activation+2.35e-01

Pos logit contributions
ights+5.837
 Dimension+5.729
arty+5.706
stals+5.639
 Twice+5.628
Neg logit contributions
shown-11.494
 attRot-11.288
ardless-10.876
mates-9.984
warming-9.953
Token the
Token ID262
Feature activation+1.51e-01

Pos logit contributions
shown+1.554
 attRot+1.485
ardless+1.433
mates+1.313
warming+1.250
Neg logit contributions
stals-1.252
ights-1.228
usalem-1.227
arty-1.206
PLIED-1.203
Token standard
Token ID3210
Feature activation+1.21e+00

Pos logit contributions
shown+1.039
 attRot+0.996
ardless+0.961
mates+0.891
warming+0.849
Neg logit contributions
stals-0.763
ights-0.744
usalem-0.744
arty-0.738
PLIED-0.734
Token F
Token ID376
Feature activation-6.74e-01

Pos logit contributions
shown+6.522
 attRot+6.462
ardless+5.906
mates+5.685
bred+5.446
Neg logit contributions
stals-7.085
usalem-6.735
PLIED-6.691
ights-6.612
arty-6.484
Token-
Token ID12
Feature activation+9.62e-01

Pos logit contributions
ights+2.534
arty+2.477
stals+2.467
PLIED+2.399
 Twice+2.381
Neg logit contributions
shown-5.462
 attRot-5.341
ardless-5.183
mates-4.783
warming-4.758
Token:
Token ID25
Feature activation+6.89e-04

Pos logit contributions
ights+2.528
arty+2.482
stals+2.472
ben+2.409
 Dimension+2.404
Neg logit contributions
shown-3.854
 attRot-3.730
ardless-3.661
ensical-3.321
mates-3.286
Token Clive
Token ID47722
Feature activation+3.26e-01

Pos logit contributions
shown+0.005
 attRot+0.005
ardless+0.004
mates+0.004
warming+0.004
Neg logit contributions
ights-0.004
stals-0.004
usalem-0.004
arty-0.004
PLIED-0.003
Token Brun
Token ID15700
Feature activation-1.63e-02

Pos logit contributions
shown+2.084
 attRot+2.052
ardless+1.879
warming+1.762
mates+1.723
Neg logit contributions
usalem-1.800
stals-1.737
ights-1.725
arty-1.651
PLIED-1.647
Tokenskill
Token ID42401
Feature activation-8.92e-02

Pos logit contributions
ights+0.112
stals+0.111
usalem+0.111
 Dimension+0.110
arty+0.110
Neg logit contributions
shown-0.081
 attRot-0.076
ardless-0.073
mates-0.065
warming-0.064
Token/
Token ID14
Feature activation-5.33e-01

Pos logit contributions
ights+0.539
stals+0.533
arty+0.528
usalem+0.527
 Dimension+0.517
Neg logit contributions
shown-0.523
 attRot-0.506
ardless-0.471
mates-0.434
warming-0.430
TokenGetty
Token ID6633
Feature activation-1.52e+00

Pos logit contributions
ights+3.401
stals+3.354
arty+3.343
usalem+3.296
 Twice+3.281
Neg logit contributions
shown-2.943
 attRot-2.848
ardless-2.702
mates-2.409
warming-2.384
Token Images
Token ID5382
Feature activation-5.34e-01

Pos logit contributions
ights+4.764
arty+4.762
stals+4.349
PLIED+4.349
 Twice+4.281
Neg logit contributions
shown-12.536
 attRot-12.349
ardless-12.059
ensical-11.161
warming-11.126
Token
Token ID198
Feature activation-6.39e-01

Pos logit contributions
ights+2.943
arty+2.897
stals+2.875
PLIED+2.841
 Twice+2.820
Neg logit contributions
shown-3.379
 attRot-3.276
ardless-3.192
mates-2.834
warming-2.827
Token
Token ID198
Feature activation-2.94e-01

Pos logit contributions
ights+3.192
stals+3.122
arty+3.111
ben+3.071
 Twice+3.067
Neg logit contributions
shown-4.275
 attRot-4.121
ardless-4.081
ensical-3.854
warming-3.634
TokenThat
Token ID2504
Feature activation+6.80e-01

Pos logit contributions
ights+1.668
stals+1.651
arty+1.645
usalem+1.632
PLIED+1.618
Neg logit contributions
shown-1.844
 attRot-1.789
ardless-1.689
mates-1.544
warming-1.530
Token low
Token ID1877
Feature activation+1.38e-01

Pos logit contributions
shown+4.207
 attRot+4.119
ardless+3.736
mates+3.535
warming+3.503
Neg logit contributions
stals-3.685
ights-3.674
usalem-3.672
PLIED-3.668
arty-3.508
Tokenian
Token ID666
Feature activation-2.42e-01

Pos logit contributions
ights+1.228
arty+1.210
stals+1.207
PLIED+1.191
 Twice+1.182
Neg logit contributions
shown-1.335
 attRot-1.297
ardless-1.245
mates-1.117
warming-1.110
Token had
Token ID550
Feature activation-1.45e-02

Pos logit contributions
ights+1.476
stals+1.470
arty+1.454
PLIED+1.449
usalem+1.431
Neg logit contributions
shown-1.409
 attRot-1.343
ardless-1.284
warming-1.157
mates-1.148
Token published
Token ID3199
Feature activation-4.28e-01

Pos logit contributions
stals+0.081
ights+0.081
PLIED+0.081
usalem+0.080
arty+0.079
Neg logit contributions
shown-0.091
 attRot-0.087
ardless-0.082
mates-0.076
warming-0.075
Token in
Token ID287
Feature activation-4.03e-01

Pos logit contributions
ights+2.331
stals+2.302
arty+2.300
PLIED+2.262
usalem+2.247
Neg logit contributions
shown-2.748
 attRot-2.664
ardless-2.556
mates-2.324
warming-2.303
Token the
Token ID262
Feature activation-9.03e-01

Pos logit contributions
ights+2.067
stals+2.061
arty+2.047
PLIED+2.019
usalem+1.992
Neg logit contributions
shown-2.735
 attRot-2.641
ardless-2.536
mates-2.332
warming-2.294
Token New
Token ID968
Feature activation-2.04e+00

Pos logit contributions
ights+4.439
arty+4.334
stals+4.298
 Twice+4.248
 Dimension+4.238
Neg logit contributions
shown-6.218
 attRot-6.079
ardless-5.882
mates-5.338
warming-5.301
Token Yorker
Token ID27980
Feature activation-1.02e+00

Pos logit contributions
 Dimension+3.903
ights+3.719
 Trib+3.552
 Twice+3.527
 Horizons+3.394
Neg logit contributions
shown-18.137
 attRot-17.833
ardless-17.492
mates-16.304
idate-16.217
Token.
Token ID13
Feature activation-5.93e-01

Pos logit contributions
ights+5.980
arty+5.907
stals+5.867
 Twice+5.838
 Dimension+5.810
Neg logit contributions
shown-5.895
 attRot-5.726
ardless-5.571
mates-4.896
warming-4.797
Token In
Token ID554
Feature activation+4.81e-01

Pos logit contributions
ights+3.016
arty+2.972
stals+2.960
PLIED+2.902
usalem+2.901
Neg logit contributions
shown-4.039
 attRot-3.921
ardless-3.780
mates-3.438
warming-3.404
Token an
Token ID281
Feature activation-1.41e-01

Pos logit contributions
shown+2.855
 attRot+2.722
ardless+2.539
mates+2.522
warming+2.312
Neg logit contributions
PLIED-2.806
stals-2.785
arty-2.775
ights-2.739
usalem-2.738
Token interview
Token ID2720
Feature activation-7.23e-02

Pos logit contributions
stals+0.818
ights+0.810
PLIED+0.791
arty+0.788
usalem+0.779
Neg logit contributions
shown-0.876
 attRot-0.834
ardless-0.804
mates-0.732
warming-0.729