TOP ACTIVATIONS
MAX = 2.869

with a French and African mixed heritage , who finds herself
t feel the need to mang le . Sen
ima Fl ott ig lia MAS led by Jun io Val
. <|endoftext|> I personally have mixed feelings about James Gunn 's
subscription behind is that you very often can 't watch things
How can someone with a mixed background better embrace both cultures
the debate came at the very end , when Pence tried
. T RE MAS ON , A UB URN
European democracies is at the very heart of our geopolitical strategy
student who argues he was completely denied his right to confront
the night we were all reeling from shock and grief .
of these interactions , I very possibly could have been more
use apologies to the Motion Picture Academy and to the filmmakers
up for discussion ( even mang achu or bre ade x
with 2 cups of frozen mixed berries : And
fail miser ably at the box office , Hitchcock offered to
bananas , peanut butter , mixed berries , and the
icated in minutes . Political jugg ern auts had fallen to
people , and at the very least , ensure the president
plan the situations might very well be reversed . It

MIN ACTIVATIONS
MIN = -3.060

the # deb ates # Occup y The Deb ates pic
stage win record at the Tour Down Under . He had
the shift less , hook n osed Jew from
leave . OS HA ( Occup ational Health and Safety Administration
David Camp os San Francisco Board of Super visors
76 or ed raper @ den ver post . com
or with your friends or roomm ates , for example .
all non farm sectors ( author s analysis of
got ists with a loyal fan base who will hyper -
Q F 72 incident has lived inside me every single day
not protecting the rights of Occupy demonstrators whose protests are being
inside , and perhaps even rent an apartment in Wa ik
Cuba present . Count ries that once acquies ced
the 3 rd Italian " San Marco " Marine Division and
not a full on " den ial " it was enough
use trails from Claire ville Reserv oir in Bram pton to
under a brutally misogyn istic cler ical culture she isn
set to take on the Occupy Wall Street protests in Wednesday
uit . The contamination was eventually cleared , as the material
App ro ved Author Notes : This could go

INTERVAL 1.387 - 2.869
CONTAINS 0.416%

als are poorer , black er , less educated and younger
could be reprim anded . J SO said since one of
condo in terms we 're all familiar with , from catalog
the 10 " 150 watt J BL before and it sounded
gow 1997 Michael J . Fox 1997

INTERVAL -0.095 - 1.387
CONTAINS 57.943%

which shows the item number and U PC of the Best
t figure any of this out .
, in - house online fundraising tool , Sanders can lean
now , Dr Dum us que said .
in Boston . ( Wa ub ges hig Rice / CBC

INTERVAL -1.578 - -0.095
CONTAINS 41.463%

interested in information about Google allegedly limiting access to rival search
- round . The ice breaker Lenin was the
to vote , and a record 11 . 2 million voted
has set out an ambitious goal for the Paris meeting
sim lock on the device as opposed to the Google Play

INTERVAL -3.060 - -1.578
CONTAINS 0.178%

so much more to it than that like working hard concentrating
NFL need to protect the whistleblower ( s ) in Bounty
the ping tests will be stored in a S 3 bucket
designed houses that can be rented out . The
s what the Consumer Financial Protection Bureau which
Token with
Token ID351
Feature activation+1.43e-01

Pos logit contributions
sburgh+1.818
opsis+1.754
ificant+1.519
ocene+1.496
heastern+1.474
Neg logit contributions
culus-1.898
picture-1.637
moral-1.597
rely-1.570
 footing-1.484
Token a
Token ID257
Feature activation-1.16e-01

Pos logit contributions
culus+0.864
picture+0.742
moral+0.728
rely+0.712
 footing+0.683
Neg logit contributions
sburgh-0.810
opsis-0.779
ificant-0.673
ocene-0.668
heastern-0.656
Token French
Token ID4141
Feature activation+6.20e-01

Pos logit contributions
sburgh+0.616
opsis+0.591
ificant+0.506
ocene+0.499
heastern+0.491
Neg logit contributions
culus-0.743
picture-0.645
moral-0.631
rely-0.620
 footing-0.592
Token and
Token ID290
Feature activation+4.05e-01

Pos logit contributions
culus+3.460
moral+2.936
picture+2.927
rely+2.786
 footing+2.772
Neg logit contributions
sburgh-3.715
opsis-3.565
ificant-3.151
ocene-3.114
heastern-3.046
Token African
Token ID5510
Feature activation+3.30e-01

Pos logit contributions
culus+2.229
picture+1.892
moral+1.862
rely+1.796
 footing+1.732
Neg logit contributions
sburgh-2.475
opsis-2.392
ificant-2.099
ocene-2.094
heastern-2.049
Token mixed
Token ID7668
Feature activation+2.87e+00

Pos logit contributions
culus+1.729
picture+1.448
moral+1.434
rely+1.370
 footing+1.337
Neg logit contributions
sburgh-2.126
opsis-2.025
ificant-1.803
ocene-1.780
heastern-1.761
Token heritage
Token ID15012
Feature activation+2.41e-01

Pos logit contributions
culus+11.960
moral+10.643
picture+10.581
 footing+10.542
 sexes+10.189
Neg logit contributions
sburgh-16.466
opsis-14.866
ocene-13.901
 externalToEVAOnly-13.418
ificant-13.375
Token,
Token ID11
Feature activation-4.25e-01

Pos logit contributions
culus+1.480
picture+1.279
moral+1.251
rely+1.221
 footing+1.182
Neg logit contributions
sburgh-1.342
opsis-1.279
ificant-1.111
ocene-1.098
heastern-1.078
Token who
Token ID508
Feature activation+5.00e-02

Pos logit contributions
sburgh+2.452
opsis+2.371
ificant+2.052
ocene+2.026
heastern+1.993
Neg logit contributions
culus-2.493
picture-2.139
moral-2.085
rely-2.052
 footing-1.923
Token finds
Token ID7228
Feature activation-7.54e-02

Pos logit contributions
culus+0.312
picture+0.269
moral+0.264
rely+0.259
 footing+0.248
Neg logit contributions
sburgh-0.274
opsis-0.262
ificant-0.227
ocene-0.224
heastern-0.220
Token herself
Token ID5223
Feature activation+2.66e-01

Pos logit contributions
sburgh+0.419
opsis+0.403
ificant+0.348
ocene+0.343
heastern+0.337
Neg logit contributions
culus-0.462
picture-0.399
moral-0.390
rely-0.383
 footing-0.364
Tokent
Token ID83
Feature activation+4.10e-01

Pos logit contributions
culus+3.352
picture+2.865
moral+2.796
rely+2.792
 footing+2.587
Neg logit contributions
sburgh-3.410
opsis-3.300
ocene-2.875
heastern-2.836
ificant-2.825
Token feel
Token ID1254
Feature activation+1.20e+00

Pos logit contributions
culus+2.286
picture+1.961
rely+1.894
moral+1.894
 footing+1.795
Neg logit contributions
sburgh-2.490
opsis-2.382
ificant-2.107
ocene-2.095
heastern-2.063
Token the
Token ID262
Feature activation+1.26e+00

Pos logit contributions
culus+6.597
moral+5.569
picture+5.501
rely+5.491
 footing+5.388
Neg logit contributions
sburgh-6.936
opsis-6.737
ificant-5.916
ocene-5.914
 externalToEVAOnly-5.760
Token need
Token ID761
Feature activation+3.75e-01

Pos logit contributions
culus+5.013
moral+3.930
picture+3.854
rely+3.772
 footing+3.712
Neg logit contributions
sburgh-9.263
opsis-9.007
ificant-8.108
ocene-8.014
heastern-7.887
Token to
Token ID284
Feature activation+3.17e-01

Pos logit contributions
culus+2.363
picture+2.045
moral+2.004
rely+1.963
 footing+1.880
Neg logit contributions
sburgh-2.026
opsis-1.944
ificant-1.671
ocene-1.648
heastern-1.619
Token mang
Token ID45663
Feature activation+2.79e+00

Pos logit contributions
culus+1.842
picture+1.587
moral+1.547
rely+1.539
 footing+1.467
Neg logit contributions
sburgh-1.857
opsis-1.773
ificant-1.568
ocene-1.550
heastern-1.519
Tokenle
Token ID293
Feature activation+1.20e+00

Pos logit contributions
culus+9.062
rely+7.988
picture+7.953
moral+7.515
tie+6.875
Neg logit contributions
sburgh-18.442
opsis-17.757
ificant-16.190
ocene-16.165
heastern-16.088
Token.
Token ID13
Feature activation+5.51e-01

Pos logit contributions
culus+6.840
picture+5.995
moral+5.786
rely+5.771
 footing+5.577
Neg logit contributions
sburgh-6.754
opsis-6.322
ocene-5.653
ificant-5.514
heastern-5.445
Token
Token ID198
Feature activation+1.47e-01

Pos logit contributions
culus+3.101
picture+2.684
moral+2.577
rely+2.539
 footing+2.384
Neg logit contributions
sburgh-3.280
opsis-3.174
ificant-2.759
ocene-2.754
heastern-2.721
Token
Token ID198
Feature activation+5.78e-01

Pos logit contributions
culus+0.833
picture+0.721
moral+0.697
rely+0.687
 footing+0.645
Neg logit contributions
sburgh-0.882
opsis-0.853
ificant-0.747
ocene-0.738
heastern-0.730
TokenSen
Token ID10445
Feature activation-4.83e-01

Pos logit contributions
culus+3.604
picture+3.223
moral+3.111
rely+3.056
 footing+2.831
Neg logit contributions
sburgh-3.100
opsis-2.964
ificant-2.556
ocene-2.510
heastern-2.490
Tokenima
Token ID8083
Feature activation-5.41e-01

Pos logit contributions
sburgh+5.055
opsis+4.812
ificant+4.195
ocene+4.138
heastern+4.091
Neg logit contributions
culus-5.732
picture-4.945
moral-4.856
rely-4.716
 footing-4.650
Token Fl
Token ID1610
Feature activation-7.79e-02

Pos logit contributions
sburgh+3.053
opsis+2.944
ificant+2.556
ocene+2.497
heastern+2.477
Neg logit contributions
culus-3.230
picture-2.784
moral-2.714
rely-2.659
 footing-2.551
Tokenott
Token ID1252
Feature activation-3.26e-01

Pos logit contributions
sburgh+0.436
opsis+0.419
ificant+0.362
ocene+0.357
heastern+0.351
Neg logit contributions
culus-0.475
picture-0.409
moral-0.401
rely-0.393
 footing-0.375
Tokenig
Token ID328
Feature activation-1.14e-02

Pos logit contributions
sburgh+1.763
opsis+1.680
ificant+1.448
ocene+1.430
heastern+1.398
Neg logit contributions
culus-2.050
picture-1.782
moral-1.743
rely-1.708
 footing-1.644
Tokenlia
Token ID24660
Feature activation-4.06e-01

Pos logit contributions
sburgh+0.066
opsis+0.063
ificant+0.055
ocene+0.054
heastern+0.053
Neg logit contributions
culus-0.068
picture-0.058
moral-0.057
rely-0.056
 footing-0.053
Token MAS
Token ID32337
Feature activation+2.70e+00

Pos logit contributions
sburgh+2.359
opsis+2.270
ificant+1.971
ocene+1.935
heastern+1.917
Neg logit contributions
culus-2.378
picture-2.043
moral-1.998
rely-1.946
 footing-1.863
Token led
Token ID2957
Feature activation+3.10e-01

Pos logit contributions
culus+14.534
picture+11.934
moral+11.302
rely+11.118
AME+11.074
Neg logit contributions
sburgh-13.099
opsis-12.314
ocene-10.599
ificant-10.409
heastern-10.215
Token by
Token ID416
Feature activation-7.34e-02

Pos logit contributions
culus+1.823
picture+1.561
moral+1.527
rely+1.497
 footing+1.425
Neg logit contributions
sburgh-1.803
opsis-1.735
ificant-1.507
ocene-1.491
heastern-1.466
Token Jun
Token ID7653
Feature activation-2.43e-01

Pos logit contributions
sburgh+0.409
opsis+0.392
ificant+0.338
ocene+0.334
heastern+0.328
Neg logit contributions
culus-0.450
picture-0.388
moral-0.380
rely-0.372
 footing-0.356
Tokenio
Token ID952
Feature activation+4.62e-01

Pos logit contributions
sburgh+1.357
opsis+1.306
ificant+1.126
ocene+1.111
heastern+1.094
Neg logit contributions
culus-1.487
picture-1.284
moral-1.255
rely-1.229
 footing-1.176
Token Val
Token ID3254
Feature activation+3.03e-01

Pos logit contributions
culus+2.771
picture+2.387
moral+2.333
rely+2.294
 footing+2.156
Neg logit contributions
sburgh-2.604
opsis-2.484
ificant-2.178
ocene-2.166
heastern-2.116
Token.
Token ID13
Feature activation-8.58e-03

Pos logit contributions
sburgh+3.505
opsis+3.358
ificant+2.864
ocene+2.825
heastern+2.801
Neg logit contributions
culus-4.172
picture-3.608
moral-3.549
rely-3.468
 footing-3.283
Token<|endoftext|>
Token ID50256
Feature activation-4.58e-01

Pos logit contributions
sburgh+0.047
opsis+0.045
ificant+0.039
ocene+0.039
heastern+0.038
Neg logit contributions
culus-0.053
picture-0.046
moral-0.045
rely-0.044
 footing-0.042
TokenI
Token ID40
Feature activation+2.11e-01

Pos logit contributions
sburgh+2.480
opsis+2.396
ificant+2.034
ocene+2.033
heastern+1.988
Neg logit contributions
culus-2.839
picture-2.414
moral-2.375
rely-2.338
 footing-2.281
Token personally
Token ID7620
Feature activation+4.29e-01

Pos logit contributions
culus+1.244
picture+1.061
moral+1.032
rely+1.018
 footing+0.963
Neg logit contributions
sburgh-1.231
opsis-1.184
ificant-1.036
ocene-1.017
heastern-1.004
Token have
Token ID423
Feature activation+4.02e-01

Pos logit contributions
culus+2.372
picture+2.010
moral+1.939
rely+1.932
 footing+1.826
Neg logit contributions
sburgh-2.644
opsis-2.542
ificant-2.250
ocene-2.215
heastern-2.198
Token mixed
Token ID7668
Feature activation+2.68e+00

Pos logit contributions
culus+2.485
picture+2.146
moral+2.087
rely+2.072
 footing+1.985
Neg logit contributions
sburgh-2.200
opsis-2.109
ificant-1.812
ocene-1.805
heastern-1.771
Token feelings
Token ID7666
Feature activation+2.26e-01

Pos logit contributions
culus+9.547
picture+7.818
moral+7.736
 footing+7.641
 sexes+7.597
Neg logit contributions
sburgh-17.631
opsis-16.389
ocene-15.650
heastern-15.080
ificant-15.074
Token about
Token ID546
Feature activation-1.00e-01

Pos logit contributions
culus+1.362
picture+1.176
moral+1.151
rely+1.128
 footing+1.085
Neg logit contributions
sburgh-1.281
opsis-1.230
ificant-1.069
ocene-1.052
heastern-1.037
Token James
Token ID3700
Feature activation+1.49e+00

Pos logit contributions
sburgh+0.561
opsis+0.540
ificant+0.466
ocene+0.460
heastern+0.453
Neg logit contributions
culus-0.609
picture-0.526
moral-0.514
rely-0.504
 footing-0.480
Token Gunn
Token ID40837
Feature activation+2.01e-01

Pos logit contributions
culus+7.999
moral+6.793
picture+6.791
rely+6.349
 footing+6.234
Neg logit contributions
sburgh-8.732
opsis-8.518
ocene-7.296
ificant-7.281
 externalToEVAOnly-7.068
Token's
Token ID338
Feature activation+1.38e-02

Pos logit contributions
culus+1.355
picture+1.188
moral+1.163
rely+1.140
 footing+1.099
Neg logit contributions
sburgh-0.995
opsis-0.951
ificant-0.801
ocene-0.794
heastern-0.780
Token subscription
Token ID14569
Feature activation-7.16e-01

Pos logit contributions
sburgh+0.321
opsis+0.309
ificant+0.270
ocene+0.267
heastern+0.264
Neg logit contributions
culus-0.299
picture-0.255
moral-0.249
rely-0.243
 footing-0.231
Token behind
Token ID2157
Feature activation-2.14e-01

Pos logit contributions
sburgh+3.901
opsis+3.739
ificant+3.224
heastern+3.153
ocene+3.135
Neg logit contributions
culus-4.369
picture-3.752
moral-3.704
rely-3.598
 footing-3.340
Token is
Token ID318
Feature activation+1.19e-01

Pos logit contributions
sburgh+1.074
opsis+1.027
ificant+0.870
ocene+0.855
heastern+0.843
Neg logit contributions
culus-1.425
picture-1.244
moral-1.222
rely-1.200
 footing-1.147
Token that
Token ID326
Feature activation+8.20e-02

Pos logit contributions
culus+0.727
picture+0.628
moral+0.613
rely+0.604
 footing+0.579
Neg logit contributions
sburgh-0.662
opsis-0.637
ificant-0.550
ocene-0.546
heastern-0.534
Token you
Token ID345
Feature activation+5.34e-01

Pos logit contributions
culus+0.533
picture+0.465
moral+0.453
rely+0.449
 footing+0.431
Neg logit contributions
sburgh-0.427
opsis-0.409
ificant-0.349
ocene-0.347
heastern-0.336
Token very
Token ID845
Feature activation+2.61e+00

Pos logit contributions
culus+3.305
picture+2.862
rely+2.801
moral+2.768
 footing+2.691
Neg logit contributions
sburgh-2.900
opsis-2.770
ificant-2.431
ocene-2.390
heastern-2.345
Token often
Token ID1690
Feature activation+5.64e-01

Pos logit contributions
culus+13.380
rely+12.144
picture+11.607
moral+11.288
 footing+11.134
Neg logit contributions
sburgh-13.389
opsis-12.798
ificant-11.505
ocene-11.486
heastern-10.994
Token can
Token ID460
Feature activation+1.55e-01

Pos logit contributions
culus+3.157
picture+2.695
rely+2.625
moral+2.587
 footing+2.487
Neg logit contributions
sburgh-3.396
opsis-3.266
ificant-2.900
ocene-2.849
heastern-2.816
Token't
Token ID470
Feature activation+1.97e-01

Pos logit contributions
culus+1.042
picture+0.913
moral+0.889
rely+0.885
 footing+0.852
Neg logit contributions
sburgh-0.763
opsis-0.728
ificant-0.622
ocene-0.614
heastern-0.601
Token watch
Token ID2342
Feature activation-4.52e-02

Pos logit contributions
culus+1.105
picture+0.942
moral+0.909
rely+0.905
 footing+0.864
Neg logit contributions
sburgh-1.201
opsis-1.155
ificant-1.020
ocene-1.011
heastern-0.989
Token things
Token ID1243
Feature activation+1.79e-02

Pos logit contributions
sburgh+0.243
opsis+0.234
ificant+0.200
ocene+0.198
heastern+0.194
Neg logit contributions
culus-0.285
picture-0.247
moral-0.242
rely-0.238
 footing-0.227
Token How
Token ID1374
Feature activation-3.52e-01

Pos logit contributions
sburgh+0.727
opsis+0.701
ificant+0.593
ocene+0.586
heastern+0.579
Neg logit contributions
culus-0.944
picture-0.829
moral-0.808
rely-0.796
 footing-0.759
Token can
Token ID460
Feature activation-4.24e-01

Pos logit contributions
sburgh+1.884
opsis+1.807
ificant+1.550
ocene+1.528
heastern+1.503
Neg logit contributions
culus-2.231
picture-1.935
moral-1.895
rely-1.858
 footing-1.778
Token someone
Token ID2130
Feature activation+3.90e-01

Pos logit contributions
sburgh+2.060
opsis+1.962
ificant+1.657
ocene+1.625
heastern+1.599
Neg logit contributions
culus-2.894
picture-2.538
moral-2.487
rely-2.432
 footing-2.344
Token with
Token ID351
Feature activation+8.37e-02

Pos logit contributions
culus+2.268
picture+1.942
moral+1.902
rely+1.872
 footing+1.806
Neg logit contributions
sburgh-2.265
opsis-2.183
ificant-1.899
ocene-1.887
heastern-1.856
Token a
Token ID257
Feature activation-1.85e-01

Pos logit contributions
culus+0.499
picture+0.428
moral+0.419
rely+0.411
 footing+0.392
Neg logit contributions
sburgh-0.480
opsis-0.462
ificant-0.400
ocene-0.396
heastern-0.389
Token mixed
Token ID7668
Feature activation+2.60e+00

Pos logit contributions
sburgh+1.039
opsis+0.998
ificant+0.866
ocene+0.852
heastern+0.842
Neg logit contributions
culus-1.119
picture-0.963
moral-0.941
rely-0.923
 footing-0.879
Token background
Token ID4469
Feature activation+1.75e-01

Pos logit contributions
culus+11.666
picture+10.161
moral+10.023
 footing+9.994
 sexes+9.534
Neg logit contributions
sburgh-14.765
opsis-13.991
ocene-12.900
 externalToEVAOnly-12.152
etsk-12.133
Token better
Token ID1365
Feature activation+1.05e+00

Pos logit contributions
culus+1.025
picture+0.879
moral+0.859
rely+0.841
 footing+0.807
Neg logit contributions
sburgh-1.020
opsis-0.982
ificant-0.855
ocene-0.846
heastern-0.829
Token embrace
Token ID12553
Feature activation+2.81e-02

Pos logit contributions
culus+5.275
picture+4.498
moral+4.324
rely+4.313
 footing+4.212
Neg logit contributions
sburgh-6.500
opsis-6.355
ocene-5.700
ificant-5.696
heastern-5.508
Token both
Token ID1111
Feature activation+7.62e-01

Pos logit contributions
culus+0.166
picture+0.142
moral+0.139
rely+0.136
 footing+0.130
Neg logit contributions
sburgh-0.163
opsis-0.157
ificant-0.136
ocene-0.134
heastern-0.132
Token cultures
Token ID13817
Feature activation+1.56e-01

Pos logit contributions
culus+3.716
picture+3.085
moral+3.023
rely+2.976
 footing+2.865
Neg logit contributions
sburgh-5.082
opsis-4.927
ificant-4.336
ocene-4.334
heastern-4.234
Token the
Token ID262
Feature activation-1.58e-01

Pos logit contributions
culus+0.090
picture+0.077
moral+0.075
rely+0.074
 footing+0.070
Neg logit contributions
sburgh-0.090
opsis-0.086
ificant-0.075
ocene-0.074
heastern-0.073
Token debate
Token ID4384
Feature activation+3.51e-01

Pos logit contributions
sburgh+0.927
opsis+0.892
ificant+0.777
ocene+0.767
heastern+0.756
Neg logit contributions
culus-0.917
picture-0.785
moral-0.767
rely-0.750
 footing-0.714
Token came
Token ID1625
Feature activation+9.34e-01

Pos logit contributions
culus+2.031
picture+1.728
moral+1.686
rely+1.654
 footing+1.578
Neg logit contributions
sburgh-2.061
opsis-2.000
ificant-1.737
ocene-1.722
heastern-1.693
Token at
Token ID379
Feature activation+5.08e-01

Pos logit contributions
culus+5.266
moral+4.424
picture+4.422
rely+4.305
 footing+4.119
Neg logit contributions
sburgh-5.300
opsis-5.242
ocene-4.605
ificant-4.567
heastern-4.368
Token the
Token ID262
Feature activation+9.74e-01

Pos logit contributions
culus+2.895
picture+2.449
moral+2.410
rely+2.349
 footing+2.246
Neg logit contributions
sburgh-3.022
opsis-2.933
ificant-2.531
ocene-2.524
heastern-2.491
Token very
Token ID845
Feature activation+2.47e+00

Pos logit contributions
culus+5.300
moral+4.452
picture+4.443
rely+4.356
 footing+4.180
Neg logit contributions
sburgh-5.889
opsis-5.691
ificant-5.040
ocene-4.941
heastern-4.788
Token end
Token ID886
Feature activation+1.52e-01

Pos logit contributions
culus+11.068
moral+9.207
rely+8.870
 footing+8.825
picture+8.766
Neg logit contributions
sburgh-15.029
opsis-13.868
ificant-12.610
ocene-12.537
phis-12.093
Token,
Token ID11
Feature activation-1.24e-01

Pos logit contributions
culus+0.890
picture+0.761
moral+0.744
rely+0.728
 footing+0.693
Neg logit contributions
sburgh-0.893
opsis-0.859
ificant-0.747
ocene-0.738
heastern-0.728
Token when
Token ID618
Feature activation+5.15e-01

Pos logit contributions
sburgh+0.648
opsis+0.618
ificant+0.529
ocene+0.520
heastern+0.512
Neg logit contributions
culus-0.805
picture-0.701
moral-0.685
rely-0.673
 footing-0.643
Token Pence
Token ID17339
Feature activation+6.78e-01

Pos logit contributions
culus+2.725
picture+2.270
moral+2.228
rely+2.187
 footing+2.061
Neg logit contributions
sburgh-3.263
opsis-3.184
ificant-2.785
ocene-2.772
heastern-2.722
Token tried
Token ID3088
Feature activation+1.13e+00

Pos logit contributions
culus+3.996
picture+3.380
moral+3.362
rely+3.277
 footing+3.195
Neg logit contributions
sburgh-3.811
opsis-3.685
ificant-3.238
ocene-3.186
heastern-3.137
Token.
Token ID13
Feature activation+1.15e-01

Pos logit contributions
sburgh+1.991
opsis+1.872
ificant+1.631
ocene+1.608
heastern+1.595
Neg logit contributions
culus-2.143
picture-1.858
moral-1.797
rely-1.770
 footing-1.672
Token
Token ID198
Feature activation+1.51e-01

Pos logit contributions
culus+0.705
picture+0.611
moral+0.596
rely+0.585
 footing+0.555
Neg logit contributions
sburgh-0.638
opsis-0.617
ificant-0.531
ocene-0.525
heastern-0.518
Token
Token ID198
Feature activation+2.45e-01

Pos logit contributions
culus+0.855
picture+0.739
moral+0.715
rely+0.705
 footing+0.661
Neg logit contributions
sburgh-0.909
opsis-0.878
ificant-0.770
ocene-0.761
heastern-0.752
TokenT
Token ID51
Feature activation-6.12e-01

Pos logit contributions
culus+1.504
picture+1.314
moral+1.277
rely+1.253
 footing+1.178
Neg logit contributions
sburgh-1.359
opsis-1.310
ificant-1.132
ocene-1.112
heastern-1.102
TokenRE
Token ID2200
Feature activation+4.23e-01

Pos logit contributions
sburgh+3.583
opsis+3.461
ificant+2.995
ocene+2.973
heastern+2.932
Neg logit contributions
culus-3.507
picture-2.987
moral-2.930
rely-2.869
 footing-2.768
Token MAS
Token ID32337
Feature activation+2.46e+00

Pos logit contributions
culus+2.399
picture+2.095
moral+2.016
rely+1.984
 footing+1.860
Neg logit contributions
sburgh-2.476
opsis-2.394
ificant-2.113
ocene-2.059
heastern-2.056
TokenON
Token ID1340
Feature activation+8.96e-01

Pos logit contributions
culus+10.953
picture+9.332
AME+8.604
moral+8.562
rely+8.428
Neg logit contributions
sburgh-14.739
opsis-13.761
ificant-12.737
heastern-12.474
ocene-12.031
Token,
Token ID11
Feature activation-5.13e-01

Pos logit contributions
culus+4.337
picture+3.579
moral+3.411
rely+3.363
 footing+3.163
Neg logit contributions
sburgh-5.957
opsis-5.783
ificant-5.188
heastern-5.113
ocene-5.107
Token A
Token ID317
Feature activation+2.33e-01

Pos logit contributions
sburgh+2.834
opsis+2.699
ificant+2.308
ocene+2.292
heastern+2.244
Neg logit contributions
culus-3.148
picture-2.701
moral-2.663
rely-2.586
 footing-2.486
TokenUB
Token ID10526
Feature activation+1.15e+00

Pos logit contributions
culus+1.169
picture+0.966
moral+0.946
rely+0.918
 footing+0.866
Neg logit contributions
sburgh-1.552
opsis-1.502
ificant-1.336
ocene-1.317
heastern-1.301
TokenURN
Token ID27064
Feature activation-1.11e-01

Pos logit contributions
culus+7.178
picture+6.476
rely+6.132
moral+6.091
AME+5.916
Neg logit contributions
sburgh-5.657
opsis-5.528
ificant-4.918
heastern-4.748
ocene-4.705
Token European
Token ID3427
Feature activation-5.94e-01

Pos logit contributions
culus+0.120
picture+0.104
moral+0.102
rely+0.100
 footing+0.096
Neg logit contributions
sburgh-0.098
opsis-0.093
ificant-0.080
ocene-0.079
heastern-0.077
Token democracies
Token ID40416
Feature activation-1.62e-01

Pos logit contributions
sburgh+3.308
opsis+3.150
ificant+2.702
ocene+2.670
heastern+2.664
Neg logit contributions
culus-3.626
picture-3.137
moral-3.041
rely-2.967
 footing-2.800
Token is
Token ID318
Feature activation+2.61e-01

Pos logit contributions
sburgh+0.891
opsis+0.855
ificant+0.736
ocene+0.726
heastern+0.715
Neg logit contributions
culus-1.006
picture-0.869
moral-0.851
rely-0.833
 footing-0.795
Token at
Token ID379
Feature activation+2.00e-02

Pos logit contributions
culus+1.614
picture+1.385
moral+1.372
rely+1.353
 footing+1.309
Neg logit contributions
sburgh-1.434
opsis-1.390
ificant-1.188
ocene-1.184
heastern-1.152
Token the
Token ID262
Feature activation+9.32e-01

Pos logit contributions
culus+0.111
picture+0.094
moral+0.092
rely+0.090
 footing+0.085
Neg logit contributions
sburgh-0.123
opsis-0.118
ificant-0.104
ocene-0.102
heastern-0.101
Token very
Token ID845
Feature activation+2.44e+00

Pos logit contributions
culus+5.192
moral+4.467
picture+4.420
rely+4.375
 footing+4.299
Neg logit contributions
sburgh-5.574
opsis-5.350
ificant-4.651
ocene-4.607
heastern-4.468
Token heart
Token ID2612
Feature activation-2.54e-02

Pos logit contributions
culus+11.170
 footing+9.564
moral+9.452
picture+9.028
rely+9.011
Neg logit contributions
sburgh-14.842
opsis-13.773
ificant-12.162
ocene-12.132
heastern-11.660
Token of
Token ID286
Feature activation-1.47e-01

Pos logit contributions
sburgh+0.151
opsis+0.145
ificant+0.127
ocene+0.126
heastern+0.125
Neg logit contributions
culus-0.146
picture-0.124
moral-0.121
rely-0.120
 footing-0.112
Token our
Token ID674
Feature activation+1.04e-01

Pos logit contributions
sburgh+0.845
opsis+0.813
ificant+0.705
ocene+0.696
heastern+0.685
Neg logit contributions
culus-0.879
picture-0.755
moral-0.739
rely-0.724
 footing-0.690
Token geopolitical
Token ID35493
Feature activation+3.32e-01

Pos logit contributions
culus+0.589
picture+0.505
moral+0.497
rely+0.485
 footing+0.467
Neg logit contributions
sburgh-0.618
opsis-0.601
ificant-0.520
ocene-0.513
heastern-0.506
Token strategy
Token ID4811
Feature activation+2.00e-01

Pos logit contributions
culus+1.863
picture+1.607
moral+1.571
rely+1.540
 footing+1.519
Neg logit contributions
sburgh-1.994
opsis-1.937
ificant-1.695
ocene-1.660
heastern-1.646
Token student
Token ID3710
Feature activation-5.71e-01

Pos logit contributions
culus+2.347
picture+1.958
moral+1.927
rely+1.893
 footing+1.812
Neg logit contributions
sburgh-2.591
opsis-2.505
ificant-2.193
ocene-2.184
heastern-2.134
Token who
Token ID508
Feature activation+1.10e-02

Pos logit contributions
sburgh+3.059
opsis+2.943
ificant+2.525
ocene+2.471
heastern+2.451
Neg logit contributions
culus-3.559
picture-3.067
moral-3.006
rely-2.958
 footing-2.780
Token argues
Token ID11673
Feature activation+4.27e-01

Pos logit contributions
culus+0.066
picture+0.057
moral+0.056
rely+0.055
 footing+0.052
Neg logit contributions
sburgh-0.062
opsis-0.060
ificant-0.052
ocene-0.051
heastern-0.051
Token he
Token ID339
Feature activation+5.75e-01

Pos logit contributions
culus+2.626
picture+2.259
moral+2.217
rely+2.187
 footing+2.093
Neg logit contributions
sburgh-2.344
opsis-2.271
ificant-1.950
ocene-1.924
heastern-1.887
Token was
Token ID373
Feature activation+1.03e+00

Pos logit contributions
culus+3.300
picture+2.813
moral+2.752
rely+2.701
 footing+2.638
Neg logit contributions
sburgh-3.363
opsis-3.235
ificant-2.841
ocene-2.812
heastern-2.771
Token completely
Token ID3190
Feature activation+2.40e+00

Pos logit contributions
culus+5.697
picture+4.815
moral+4.737
 footing+4.612
rely+4.607
Neg logit contributions
sburgh-6.133
opsis-6.006
ocene-5.167
ificant-5.140
heastern-5.093
Token denied
Token ID6699
Feature activation+1.47e+00

Pos logit contributions
culus+12.329
moral+10.463
 footing+10.296
 sexes+10.170
picture+10.117
Neg logit contributions
sburgh-13.494
opsis-12.980
heastern-11.401
ocene-11.377
ificant-11.087
Token his
Token ID465
Feature activation+1.06e+00

Pos logit contributions
culus+7.641
 footing+6.247
rely+6.242
picture+6.210
moral+6.193
Neg logit contributions
sburgh-9.153
opsis-8.653
ocene-7.682
ificant-7.624
heastern-7.474
Token right
Token ID826
Feature activation+1.02e+00

Pos logit contributions
culus+5.519
picture+4.600
moral+4.565
rely+4.510
 footing+4.489
Neg logit contributions
sburgh-6.680
opsis-6.381
ificant-5.656
ocene-5.627
heastern-5.443
Token to
Token ID284
Feature activation+8.93e-01

Pos logit contributions
culus+5.841
picture+5.011
moral+4.932
rely+4.809
 footing+4.733
Neg logit contributions
sburgh-5.934
opsis-5.638
ificant-4.945
ocene-4.929
heastern-4.817
Token confront
Token ID7239
Feature activation+4.16e-01

Pos logit contributions
culus+4.856
picture+4.082
moral+4.014
rely+3.934
 footing+3.834
Neg logit contributions
sburgh-5.496
opsis-5.221
ificant-4.626
ocene-4.578
heastern-4.512
Token the
Token ID262
Feature activation+4.08e-01

Pos logit contributions
culus+0.281
picture+0.242
moral+0.236
rely+0.232
 footing+0.221
Neg logit contributions
sburgh-0.266
opsis-0.256
ificant-0.221
ocene-0.219
heastern-0.215
Token night
Token ID1755
Feature activation+6.15e-01

Pos logit contributions
culus+2.226
picture+1.871
moral+1.844
rely+1.817
 footing+1.725
Neg logit contributions
sburgh-2.531
opsis-2.464
ificant-2.151
ocene-2.122
heastern-2.095
Token we
Token ID356
Feature activation+6.83e-01

Pos logit contributions
culus+3.544
picture+3.026
moral+2.957
rely+2.947
 footing+2.789
Neg logit contributions
sburgh-3.564
opsis-3.460
ificant-2.982
ocene-2.976
heastern-2.912
Token were
Token ID547
Feature activation+1.15e+00

Pos logit contributions
culus+3.931
picture+3.358
rely+3.271
moral+3.240
 footing+3.089
Neg logit contributions
sburgh-3.939
opsis-3.832
ificant-3.349
ocene-3.318
heastern-3.255
Token all
Token ID477
Feature activation+1.56e+00

Pos logit contributions
culus+6.446
picture+5.490
moral+5.392
rely+5.380
 footing+5.281
Neg logit contributions
opsis-6.645
sburgh-6.638
ocene-5.767
ificant-5.655
heastern-5.496
Token reeling
Token ID45341
Feature activation+2.38e+00

Pos logit contributions
culus+8.475
picture+7.383
moral+7.195
 footing+7.189
rely+7.135
Neg logit contributions
opsis-8.655
sburgh-8.573
ocene-7.537
ificant-7.399
heastern-7.203
Token from
Token ID422
Feature activation+1.02e+00

Pos logit contributions
culus+10.994
moral+9.118
rely+9.062
picture+8.890
 footing+8.638
Neg logit contributions
sburgh-13.947
opsis-13.513
ificant-11.924
ocene-11.871
heastern-11.561
Token shock
Token ID6380
Feature activation+1.39e+00

Pos logit contributions
culus+5.614
picture+4.725
rely+4.633
moral+4.630
 footing+4.333
Neg logit contributions
sburgh-5.985
opsis-5.911
ocene-5.085
ificant-5.034
heastern-4.999
Token and
Token ID290
Feature activation+2.17e-01

Pos logit contributions
culus+7.633
picture+6.414
rely+6.397
moral+6.379
 footing+5.935
Neg logit contributions
sburgh-8.067
opsis-7.970
ificant-6.889
heastern-6.773
ocene-6.679
Token grief
Token ID18522
Feature activation+8.50e-01

Pos logit contributions
culus+1.245
picture+1.063
moral+1.039
rely+1.020
 footing+0.971
Neg logit contributions
sburgh-1.289
opsis-1.243
ificant-1.081
ocene-1.072
heastern-1.058
Token.
Token ID13
Feature activation+3.68e-01

Pos logit contributions
culus+4.950
picture+4.242
moral+4.186
rely+4.135
 footing+3.956
Neg logit contributions
sburgh-4.788
opsis-4.653
ificant-4.016
ocene-3.987
heastern-3.898
Token of
Token ID286
Feature activation-1.57e-01

Pos logit contributions
sburgh+1.762
opsis+1.702
ificant+1.501
ocene+1.479
heastern+1.465
Neg logit contributions
culus-1.473
picture-1.237
moral-1.205
rely-1.179
 footing-1.107
Token these
Token ID777
Feature activation+2.90e-02

Pos logit contributions
sburgh+0.823
opsis+0.789
ificant+0.674
ocene+0.664
heastern+0.653
Neg logit contributions
culus-1.012
picture-0.881
moral-0.863
rely-0.846
 footing-0.811
Token interactions
Token ID12213
Feature activation-2.83e-01

Pos logit contributions
culus+0.155
picture+0.131
moral+0.127
rely+0.125
 footing+0.118
Neg logit contributions
sburgh-0.184
opsis-0.178
ificant-0.157
ocene-0.155
heastern-0.153
Token,
Token ID11
Feature activation-4.00e-01

Pos logit contributions
sburgh+1.765
opsis+1.712
ificant+1.503
ocene+1.478
heastern+1.462
Neg logit contributions
culus-1.531
picture-1.293
moral-1.266
rely-1.232
 footing-1.160
Token I
Token ID314
Feature activation-7.72e-01

Pos logit contributions
sburgh+2.119
opsis+2.031
ificant+1.745
ocene+1.711
heastern+1.686
Neg logit contributions
culus-2.552
picture-2.209
moral-2.170
rely-2.115
 footing-2.025
Token very
Token ID845
Feature activation+2.37e+00

Pos logit contributions
sburgh+4.282
opsis+4.125
ificant+3.527
ocene+3.511
heastern+3.425
Neg logit contributions
culus-4.635
picture-4.013
moral-3.971
rely-3.806
 footing-3.630
Token possibly
Token ID5457
Feature activation-2.06e-02

Pos logit contributions
culus+13.117
rely+11.700
picture+11.248
moral+10.771
 footing+10.555
Neg logit contributions
sburgh-11.616
opsis-11.179
ocene-10.057
ificant-9.920
heastern-9.588
Token could
Token ID714
Feature activation+6.52e-01

Pos logit contributions
sburgh+0.121
opsis+0.116
ificant+0.101
ocene+0.100
heastern+0.098
Neg logit contributions
culus-0.121
picture-0.103
moral-0.100
rely-0.099
 footing-0.094
Token have
Token ID423
Feature activation+5.29e-01

Pos logit contributions
culus+3.276
picture+2.755
rely+2.703
moral+2.651
 footing+2.530
Neg logit contributions
sburgh-4.162
opsis-4.058
ificant-3.601
ocene-3.575
heastern-3.489
Token been
Token ID587
Feature activation+2.85e-01

Pos logit contributions
culus+3.207
picture+2.791
rely+2.759
moral+2.703
 footing+2.641
Neg logit contributions
sburgh-2.870
opsis-2.795
ificant-2.385
ocene-2.377
heastern-2.302
Token more
Token ID517
Feature activation+1.07e+00

Pos logit contributions
culus+1.702
picture+1.465
moral+1.426
rely+1.410
 footing+1.374
Neg logit contributions
sburgh-1.625
opsis-1.572
ificant-1.352
ocene-1.345
heastern-1.312
Tokenuse
Token ID1904
Feature activation+6.14e-02

Pos logit contributions
sburgh+1.428
opsis+1.382
ificant+1.215
ocene+1.184
heastern+1.170
Neg logit contributions
culus-1.340
picture-1.141
moral-1.109
rely-1.089
 footing-1.038
Token apologies
Token ID32920
Feature activation-9.14e-02

Pos logit contributions
culus+0.370
picture+0.318
moral+0.311
rely+0.304
 footing+0.290
Neg logit contributions
sburgh-0.349
opsis-0.335
ificant-0.290
ocene-0.287
heastern-0.282
Token to
Token ID284
Feature activation-3.21e-01

Pos logit contributions
sburgh+0.503
opsis+0.483
ificant+0.416
ocene+0.410
heastern+0.404
Neg logit contributions
culus-0.566
picture-0.489
moral-0.479
rely-0.469
 footing-0.449
Token the
Token ID262
Feature activation-2.88e-01

Pos logit contributions
sburgh+1.818
opsis+1.742
ificant+1.513
ocene+1.492
heastern+1.473
Neg logit contributions
culus-1.929
picture-1.662
moral-1.621
rely-1.586
 footing-1.509
Token Motion
Token ID20843
Feature activation+1.51e+00

Pos logit contributions
sburgh+1.690
opsis+1.619
ificant+1.414
ocene+1.394
heastern+1.377
Neg logit contributions
culus-1.676
picture-1.436
moral-1.401
rely-1.370
 footing-1.299
Token Picture
Token ID17741
Feature activation+2.37e+00

Pos logit contributions
culus+4.402
picture+4.107
moral+3.400
rely+3.136
 footing+2.852
Neg logit contributions
sburgh-12.136
opsis-11.564
ificant-10.588
heastern-10.465
ocene-10.393
Token Academy
Token ID8581
Feature activation-8.34e-02

Pos logit contributions
culus+9.725
picture+9.098
moral+8.287
rely+7.927
 footing+7.589
Neg logit contributions
opsis-14.576
sburgh-14.118
heastern-12.967
ocene-12.837
 externalToEVAOnly-12.417
Token and
Token ID290
Feature activation-1.79e-01

Pos logit contributions
sburgh+0.497
opsis+0.478
ificant+0.417
ocene+0.412
heastern+0.406
Neg logit contributions
culus-0.479
picture-0.409
moral-0.399
rely-0.391
 footing-0.372
Token to
Token ID284
Feature activation-9.50e-03

Pos logit contributions
sburgh+1.011
opsis+0.971
ificant+0.842
ocene+0.828
heastern+0.817
Neg logit contributions
culus-1.074
picture-0.923
moral-0.905
rely-0.885
 footing-0.843
Token the
Token ID262
Feature activation+1.12e-01

Pos logit contributions
sburgh+0.054
opsis+0.052
ificant+0.045
ocene+0.045
heastern+0.044
Neg logit contributions
culus-0.057
picture-0.049
moral-0.048
rely-0.047
 footing-0.045
Token filmmakers
Token ID28303
Feature activation+5.11e-01

Pos logit contributions
culus+0.654
picture+0.560
moral+0.547
rely+0.536
 footing+0.511
Neg logit contributions
sburgh-0.658
opsis-0.636
ificant-0.551
ocene-0.548
heastern-0.537
Token up
Token ID510
Feature activation-5.08e-01

Pos logit contributions
sburgh+1.113
opsis+1.068
ificant+0.922
ocene+0.909
heastern+0.895
Neg logit contributions
culus-1.233
picture-1.064
moral-1.042
rely-1.019
 footing-0.973
Token for
Token ID329
Feature activation+3.72e-01

Pos logit contributions
sburgh+2.900
opsis+2.766
ificant+2.403
ocene+2.366
heastern+2.342
Neg logit contributions
culus-3.023
picture-2.593
moral-2.539
rely-2.466
 footing-2.353
Token discussion
Token ID5114
Feature activation+3.27e-01

Pos logit contributions
culus+2.154
picture+1.855
moral+1.813
rely+1.782
 footing+1.718
Neg logit contributions
sburgh-2.171
opsis-2.088
ificant-1.811
ocene-1.790
heastern-1.784
Token (
Token ID357
Feature activation+2.48e-01

Pos logit contributions
culus+1.787
picture+1.528
moral+1.476
rely+1.458
 footing+1.387
Neg logit contributions
sburgh-2.025
opsis-1.948
ificant-1.706
ocene-1.694
heastern-1.678
Tokeneven
Token ID10197
Feature activation-4.03e-01

Pos logit contributions
culus+1.447
picture+1.261
moral+1.229
rely+1.200
 footing+1.129
Neg logit contributions
sburgh-1.449
opsis-1.403
ificant-1.215
ocene-1.197
heastern-1.185
Token mang
Token ID45663
Feature activation+2.37e+00

Pos logit contributions
sburgh+1.981
opsis+1.883
ificant+1.595
ocene+1.559
heastern+1.534
Neg logit contributions
culus-2.729
picture-2.390
moral-2.346
rely-2.294
 footing-2.200
Tokenachu
Token ID32323
Feature activation+3.68e-01

Pos logit contributions
culus+13.146
moral+11.228
rely+11.186
picture+11.151
chat+10.377
Neg logit contributions
sburgh-11.544
opsis-11.059
heastern-9.400
ocene-9.264
ificant-9.252
Token or
Token ID393
Feature activation-6.34e-01

Pos logit contributions
culus+2.165
picture+1.851
moral+1.808
rely+1.782
 footing+1.702
Neg logit contributions
sburgh-2.116
opsis-2.043
ificant-1.769
ocene-1.762
heastern-1.741
Token bre
Token ID1449
Feature activation+2.50e-01

Pos logit contributions
sburgh+3.480
opsis+3.344
ificant+2.890
ocene+2.835
heastern+2.771
Neg logit contributions
culus-3.848
picture-3.345
moral-3.267
rely-3.185
 footing-3.033
Tokenade
Token ID671
Feature activation+3.63e-01

Pos logit contributions
culus+1.303
picture+1.091
moral+1.065
rely+1.034
 footing+0.974
Neg logit contributions
sburgh-1.624
opsis-1.570
ificant-1.390
ocene-1.371
heastern-1.354
Tokenx
Token ID87
Feature activation+4.60e-01

Pos logit contributions
culus+2.124
picture+1.799
moral+1.768
rely+1.742
 footing+1.646
Neg logit contributions
sburgh-2.126
opsis-2.037
ificant-1.776
ocene-1.757
heastern-1.740
Token with
Token ID351
Feature activation+2.84e-01

Pos logit contributions
culus+3.538
picture+3.055
moral+2.953
rely+2.929
 footing+2.803
Neg logit contributions
sburgh-3.734
opsis-3.531
ificant-3.164
ocene-3.139
heastern-3.016
Token 2
Token ID362
Feature activation-3.91e-01

Pos logit contributions
culus+1.714
picture+1.472
moral+1.432
rely+1.405
 footing+1.354
Neg logit contributions
sburgh-1.621
opsis-1.544
ificant-1.347
ocene-1.334
heastern-1.301
Token cups
Token ID14180
Feature activation-3.35e-01

Pos logit contributions
sburgh+2.346
opsis+2.281
ificant+1.980
heastern+1.951
ocene+1.949
Neg logit contributions
culus-2.180
picture-1.845
moral-1.830
rely-1.783
 footing-1.678
Token of
Token ID286
Feature activation-7.35e-02

Pos logit contributions
sburgh+1.895
opsis+1.838
ificant+1.587
ocene+1.552
heastern+1.548
Neg logit contributions
culus-1.999
picture-1.713
moral-1.688
rely-1.651
 footing-1.559
Token frozen
Token ID12912
Feature activation-1.61e-01

Pos logit contributions
sburgh+0.398
opsis+0.381
ificant+0.328
ocene+0.324
heastern+0.318
Neg logit contributions
culus-0.462
picture-0.400
moral-0.391
rely-0.384
 footing-0.369
Token mixed
Token ID7668
Feature activation+2.36e+00

Pos logit contributions
sburgh+0.945
opsis+0.915
ificant+0.795
ocene+0.781
heastern+0.775
Neg logit contributions
culus-0.931
picture-0.794
moral-0.780
rely-0.760
 footing-0.718
Token berries
Token ID36322
Feature activation+1.75e-01

Pos logit contributions
culus+10.948
picture+9.255
rely+8.885
 footing+8.830
moral+8.757
Neg logit contributions
sburgh-13.902
opsis-12.319
ocene-12.191
ificant-11.576
 externalToEVAOnly-11.375
Token:
Token ID25
Feature activation+1.33e-01

Pos logit contributions
culus+1.052
picture+0.906
moral+0.884
rely+0.866
 footing+0.830
Neg logit contributions
sburgh-0.997
opsis-0.956
ificant-0.831
ocene-0.821
heastern-0.806
Token
Token ID198
Feature activation+8.91e-01

Pos logit contributions
culus+0.797
picture+0.686
moral+0.664
rely+0.655
 footing+0.626
Neg logit contributions
sburgh-0.770
opsis-0.736
ificant-0.640
ocene-0.634
heastern-0.621
Token
Token ID198
Feature activation+5.03e-01

Pos logit contributions
culus+4.196
picture+3.787
rely+3.540
moral+3.507
 footing+3.104
Neg logit contributions
sburgh-5.828
opsis-5.681
ificant-5.109
ocene-5.070
heastern-5.052
TokenAnd
Token ID1870
Feature activation+9.02e-01

Pos logit contributions
culus+2.923
picture+2.557
moral+2.455
rely+2.403
 footing+2.244
Neg logit contributions
sburgh-2.974
opsis-2.824
ificant-2.482
ocene-2.447
heastern-2.386
Token fail
Token ID2038
Feature activation+6.86e-01

Pos logit contributions
culus+1.548
picture+1.351
moral+1.320
rely+1.300
 footing+1.237
Neg logit contributions
sburgh-1.324
opsis-1.270
ificant-1.093
ocene-1.082
heastern-1.062
Token miser
Token ID18790
Feature activation-4.91e-01

Pos logit contributions
culus+3.864
picture+3.346
moral+3.272
rely+3.189
 footing+3.041
Neg logit contributions
sburgh-4.068
opsis-3.931
ificant-3.452
ocene-3.424
heastern-3.341
Tokenably
Token ID1346
Feature activation+1.39e-01

Pos logit contributions
sburgh+1.746
opsis+1.624
ificant+1.296
ocene+1.240
heastern+1.218
Neg logit contributions
culus-3.965
picture-3.533
moral-3.504
rely-3.390
 footing-3.349
Token at
Token ID379
Feature activation+2.59e-01

Pos logit contributions
culus+0.742
picture+0.623
moral+0.607
rely+0.593
 footing+0.561
Neg logit contributions
sburgh-0.886
opsis-0.856
ificant-0.753
ocene-0.744
heastern-0.736
Token the
Token ID262
Feature activation+9.07e-01

Pos logit contributions
culus+1.499
picture+1.307
moral+1.268
rely+1.238
 footing+1.190
Neg logit contributions
sburgh-1.520
opsis-1.457
ificant-1.273
ocene-1.261
heastern-1.241
Token box
Token ID3091
Feature activation+2.32e+00

Pos logit contributions
culus+4.899
picture+4.319
moral+4.160
rely+4.040
 footing+3.846
Neg logit contributions
sburgh-5.569
opsis-5.356
ificant-4.706
ocene-4.636
heastern-4.571
Token office
Token ID2607
Feature activation+2.71e-01

Pos logit contributions
culus+7.693
picture+6.832
moral+6.323
 footing+5.564
rely+5.485
Neg logit contributions
sburgh-16.487
opsis-15.976
heastern-14.347
 externalToEVAOnly-14.271
ificant-14.213
Token,
Token ID11
Feature activation+1.44e-01

Pos logit contributions
culus+1.404
picture+1.173
moral+1.143
rely+1.118
 footing+1.052
Neg logit contributions
sburgh-1.770
opsis-1.712
ificant-1.513
ocene-1.497
heastern-1.477
Token Hitchcock
Token ID49817
Feature activation+3.94e-01

Pos logit contributions
culus+0.936
picture+0.818
moral+0.800
rely+0.786
 footing+0.752
Neg logit contributions
sburgh-0.747
opsis-0.716
ificant-0.610
ocene-0.604
heastern-0.591
Token offered
Token ID4438
Feature activation-1.51e-01

Pos logit contributions
culus+2.255
picture+1.934
moral+1.886
rely+1.842
 footing+1.769
Neg logit contributions
sburgh-2.332
opsis-2.241
ificant-1.972
ocene-1.959
heastern-1.909
Token to
Token ID284
Feature activation+2.78e-01

Pos logit contributions
sburgh+0.836
opsis+0.805
ificant+0.694
ocene+0.684
heastern+0.673
Neg logit contributions
culus-0.927
picture-0.797
moral-0.779
rely-0.766
 footing-0.731
Token bananas
Token ID35484
Feature activation+6.95e-01

Pos logit contributions
culus+2.958
picture+2.555
moral+2.482
rely+2.434
 footing+2.354
Neg logit contributions
sburgh-2.715
opsis-2.600
ificant-2.256
ocene-2.238
heastern-2.174
Token,
Token ID11
Feature activation-1.55e-01

Pos logit contributions
culus+3.465
picture+2.888
moral+2.783
rely+2.703
 footing+2.605
Neg logit contributions
sburgh-4.633
opsis-4.442
ificant-3.971
ocene-3.932
heastern-3.859
Token peanut
Token ID26636
Feature activation+1.13e+00

Pos logit contributions
sburgh+0.862
opsis+0.832
ificant+0.715
ocene+0.705
heastern+0.696
Neg logit contributions
culus-0.953
picture-0.820
moral-0.806
rely-0.789
 footing-0.750
Token butter
Token ID9215
Feature activation+2.84e-01

Pos logit contributions
culus+6.123
picture+5.242
moral+5.069
rely+4.994
 footing+4.772
Neg logit contributions
sburgh-6.736
opsis-6.504
ocene-5.773
ificant-5.724
heastern-5.631
Token,
Token ID11
Feature activation+1.38e-01

Pos logit contributions
culus+1.475
picture+1.232
moral+1.208
rely+1.179
 footing+1.104
Neg logit contributions
sburgh-1.850
opsis-1.788
ificant-1.575
ocene-1.557
heastern-1.539
Token mixed
Token ID7668
Feature activation+2.31e+00

Pos logit contributions
culus+0.790
picture+0.675
moral+0.658
rely+0.644
 footing+0.614
Neg logit contributions
sburgh-0.821
opsis-0.791
ificant-0.690
ocene-0.683
heastern-0.672
Token berries
Token ID36322
Feature activation+2.67e-01

Pos logit contributions
culus+11.304
picture+9.561
 footing+9.154
moral+9.092
 sexes+9.074
Neg logit contributions
sburgh-13.005
opsis-11.635
ocene-11.499
ificant-11.139
heastern-10.600
Token,
Token ID11
Feature activation+1.66e-01

Pos logit contributions
culus+1.376
picture+1.150
moral+1.125
rely+1.098
 footing+1.028
Neg logit contributions
sburgh-1.748
opsis-1.696
ificant-1.494
ocene-1.474
heastern-1.464
Token and
Token ID290
Feature activation+1.69e-01

Pos logit contributions
culus+0.959
picture+0.820
moral+0.799
rely+0.782
 footing+0.747
Neg logit contributions
sburgh-0.982
opsis-0.944
ificant-0.824
ocene-0.815
heastern-0.802
Token
Token ID960
Feature activation+1.67e+00

Pos logit contributions
culus+1.041
picture+0.902
moral+0.877
rely+0.861
 footing+0.828
Neg logit contributions
sburgh-0.933
opsis-0.892
ificant-0.770
ocene-0.764
heastern-0.748
Tokenthe
Token ID1169
Feature activation+1.98e-01

Pos logit contributions
culus+9.456
picture+8.540
moral+8.187
rely+8.098
 footing+7.403
Neg logit contributions
sburgh-8.831
opsis-8.518
ificant-7.453
ocene-7.374
 externalToEVAOnly-7.230
Tokenicated
Token ID3474
Feature activation-9.05e-02

Pos logit contributions
sburgh+0.462
opsis+0.439
ificant+0.361
ocene+0.353
heastern+0.344
Neg logit contributions
culus-0.807
picture-0.717
moral-0.703
rely-0.691
 footing-0.669
Token in
Token ID287
Feature activation+2.48e-02

Pos logit contributions
sburgh+0.483
opsis+0.464
ificant+0.398
ocene+0.393
heastern+0.386
Neg logit contributions
culus-0.575
picture-0.499
moral-0.489
rely-0.480
 footing-0.460
Token minutes
Token ID2431
Feature activation+2.62e-01

Pos logit contributions
culus+0.152
picture+0.131
moral+0.129
rely+0.126
 footing+0.121
Neg logit contributions
sburgh-0.137
opsis-0.132
ificant-0.114
ocene-0.112
heastern-0.111
Token.
Token ID13
Feature activation-4.84e-01

Pos logit contributions
culus+1.581
picture+1.361
moral+1.331
rely+1.304
 footing+1.253
Neg logit contributions
sburgh-1.478
opsis-1.417
ificant-1.229
ocene-1.211
heastern-1.194
Token Political
Token ID14611
Feature activation-3.16e-01

Pos logit contributions
sburgh+2.490
opsis+2.369
ificant+2.021
ocene+1.989
heastern+1.934
Neg logit contributions
culus-3.173
picture-2.740
moral-2.696
rely-2.647
 footing-2.553
Token jugg
Token ID24873
Feature activation+2.31e+00

Pos logit contributions
sburgh+1.874
opsis+1.801
ificant+1.567
ocene+1.553
heastern+1.525
Neg logit contributions
culus-1.816
picture-1.553
moral-1.512
rely-1.477
 footing-1.394
Tokenern
Token ID1142
Feature activation-3.96e-01

Pos logit contributions
culus+9.416
picture+7.834
rely+7.436
moral+7.351
eer+6.336
Neg logit contributions
sburgh-14.754
opsis-14.598
ocene-13.397
ificant-12.999
 externalToEVAOnly-12.784
Tokenauts
Token ID17712
Feature activation+9.95e-02

Pos logit contributions
sburgh+1.804
opsis+1.742
ificant+1.461
ocene+1.415
heastern+1.388
Neg logit contributions
culus-2.785
picture-2.483
moral-2.417
rely-2.381
 footing-2.318
Token had
Token ID550
Feature activation-3.87e-01

Pos logit contributions
culus+0.563
picture+0.478
moral+0.467
rely+0.457
 footing+0.436
Neg logit contributions
sburgh-0.602
opsis-0.580
ificant-0.508
ocene-0.503
heastern-0.494
Token fallen
Token ID9292
Feature activation-7.56e-01

Pos logit contributions
sburgh+2.041
opsis+1.953
ificant+1.670
ocene+1.638
heastern+1.618
Neg logit contributions
culus-2.475
picture-2.150
moral-2.105
rely-2.053
 footing-1.956
Token to
Token ID284
Feature activation-1.97e-01

Pos logit contributions
sburgh+4.065
opsis+3.891
ificant+3.357
ocene+3.279
heastern+3.249
Neg logit contributions
culus-4.654
picture-4.056
moral-3.946
rely-3.879
 footing-3.595
Token people
Token ID661
Feature activation+1.41e-01

Pos logit contributions
culus+1.688
picture+1.436
moral+1.407
rely+1.372
 footing+1.311
Neg logit contributions
sburgh-1.968
opsis-1.911
ificant-1.670
ocene-1.660
heastern-1.631
Token,
Token ID11
Feature activation+6.93e-01

Pos logit contributions
culus+0.806
picture+0.689
moral+0.672
rely+0.659
 footing+0.629
Neg logit contributions
sburgh-0.845
opsis-0.815
ificant-0.710
ocene-0.704
heastern-0.691
Token and
Token ID290
Feature activation+5.46e-01

Pos logit contributions
culus+3.718
picture+3.189
moral+3.121
rely+3.059
 footing+2.938
Neg logit contributions
sburgh-4.281
opsis-4.128
ificant-3.599
ocene-3.585
heastern-3.514
Token at
Token ID379
Feature activation-9.37e-02

Pos logit contributions
culus+3.145
picture+2.706
moral+2.651
rely+2.629
 footing+2.534
Neg logit contributions
sburgh-3.181
opsis-3.043
ificant-2.660
ocene-2.646
heastern-2.589
Token the
Token ID262
Feature activation+7.45e-01

Pos logit contributions
sburgh+0.522
opsis+0.501
ificant+0.432
ocene+0.427
heastern+0.420
Neg logit contributions
culus-0.575
picture-0.496
moral-0.486
rely-0.476
 footing-0.455
Token very
Token ID845
Feature activation+2.31e+00

Pos logit contributions
culus+3.706
picture+3.088
moral+3.064
rely+3.006
 footing+2.849
Neg logit contributions
sburgh-4.964
opsis-4.748
ificant-4.252
ocene-4.169
heastern-4.077
Token least
Token ID1551
Feature activation+6.55e-01

Pos logit contributions
culus+10.391
moral+8.703
picture+8.574
rely+8.333
 footing+8.187
Neg logit contributions
sburgh-14.518
opsis-12.972
ificant-12.124
ocene-11.889
heastern-11.321
Token,
Token ID11
Feature activation+7.76e-01

Pos logit contributions
culus+3.702
picture+3.174
moral+3.106
rely+3.070
 footing+2.972
Neg logit contributions
sburgh-3.864
opsis-3.716
ocene-3.246
ificant-3.244
heastern-3.159
Token ensure
Token ID4155
Feature activation-5.90e-02

Pos logit contributions
culus+4.284
picture+3.693
moral+3.621
rely+3.578
 footing+3.477
Neg logit contributions
sburgh-4.618
opsis-4.433
ocene-3.887
ificant-3.878
heastern-3.782
Token the
Token ID262
Feature activation+1.92e-02

Pos logit contributions
sburgh+0.308
opsis+0.295
ificant+0.252
ocene+0.248
heastern+0.244
Neg logit contributions
culus-0.382
picture-0.333
moral-0.326
rely-0.319
 footing-0.306
Token president
Token ID1893
Feature activation+3.47e-01

Pos logit contributions
culus+0.115
picture+0.099
moral+0.097
rely+0.095
 footing+0.091
Neg logit contributions
sburgh-0.110
opsis-0.106
ificant-0.092
ocene-0.090
heastern-0.089
Token plan
Token ID1410
Feature activation-3.79e-01

Pos logit contributions
sburgh+1.600
opsis+1.544
ificant+1.352
ocene+1.336
heastern+1.315
Neg logit contributions
culus-1.459
picture-1.235
moral-1.205
rely-1.178
 footing-1.114
Token
Token ID960
Feature activation+4.30e-01

Pos logit contributions
sburgh+2.162
opsis+2.082
ificant+1.806
ocene+1.777
heastern+1.757
Neg logit contributions
culus-2.251
picture-1.934
moral-1.892
rely-1.859
 footing-1.757
Tokenthe
Token ID1169
Feature activation-3.79e-01

Pos logit contributions
culus+2.340
picture+2.015
moral+1.973
rely+1.921
 footing+1.797
Neg logit contributions
sburgh-2.672
opsis-2.600
ocene-2.250
ificant-2.247
heastern-2.214
Token situations
Token ID7445
Feature activation+3.54e-01

Pos logit contributions
sburgh+2.051
opsis+1.976
ificant+1.702
ocene+1.674
heastern+1.647
Neg logit contributions
culus-2.349
picture-2.026
moral-1.979
rely-1.939
 footing-1.848
Token might
Token ID1244
Feature activation-3.79e-01

Pos logit contributions
culus+2.118
picture+1.830
moral+1.795
rely+1.759
 footing+1.681
Neg logit contributions
sburgh-2.002
opsis-1.938
ificant-1.671
ocene-1.655
heastern-1.618
Token very
Token ID845
Feature activation+2.30e+00

Pos logit contributions
sburgh+2.202
opsis+2.097
ificant+1.835
ocene+1.803
heastern+1.785
Neg logit contributions
culus-2.231
picture-1.911
moral-1.866
rely-1.804
 footing-1.729
Token well
Token ID880
Feature activation-2.11e-01

Pos logit contributions
culus+13.464
rely+12.054
moral+11.850
picture+11.816
 footing+11.072
Neg logit contributions
opsis-10.555
sburgh-10.543
ocene-8.863
ificant-8.670
heastern-8.315
Token be
Token ID307
Feature activation-1.47e-01

Pos logit contributions
sburgh+1.276
opsis+1.224
ificant+1.073
ocene+1.058
heastern+1.044
Neg logit contributions
culus-1.197
picture-1.018
moral-0.994
rely-0.964
 footing-0.920
Token reversed
Token ID17687
Feature activation+3.40e-02

Pos logit contributions
sburgh+0.935
opsis+0.903
ificant+0.795
ocene+0.786
heastern+0.776
Neg logit contributions
culus-0.785
picture-0.661
moral-0.645
rely-0.630
 footing-0.596
Token.
Token ID13
Feature activation+4.26e-01

Pos logit contributions
culus+0.219
picture+0.190
moral+0.186
rely+0.183
 footing+0.175
Neg logit contributions
sburgh-0.179
opsis-0.171
ificant-0.147
ocene-0.144
heastern-0.142
Token It
Token ID632
Feature activation-3.03e-01

Pos logit contributions
culus+2.575
picture+2.270
moral+2.204
rely+2.166
 footing+2.028
Neg logit contributions
sburgh-2.337
opsis-2.281
heastern-1.945
ificant-1.943
ocene-1.937
Token the
Token ID262
Feature activation+1.60e-01

Pos logit contributions
culus+2.644
picture+2.253
moral+2.192
rely+2.162
 footing+2.048
Neg logit contributions
sburgh-2.853
opsis-2.762
ocene-2.401
ificant-2.399
heastern-2.346
Token #
Token ID1303
Feature activation-1.21e-01

Pos logit contributions
culus+0.907
picture+0.772
moral+0.753
rely+0.738
 footing+0.702
Neg logit contributions
sburgh-0.966
opsis-0.934
ificant-0.814
ocene-0.808
heastern-0.793
Tokendeb
Token ID11275
Feature activation+1.58e-01

Pos logit contributions
sburgh+0.662
opsis+0.635
ificant+0.547
ocene+0.539
heastern+0.531
Neg logit contributions
culus-0.755
picture-0.652
moral-0.638
rely-0.626
 footing-0.600
Tokenates
Token ID689
Feature activation+2.32e-01

Pos logit contributions
culus+1.109
picture+0.977
moral+0.958
rely+0.939
 footing+0.901
Neg logit contributions
sburgh-0.742
opsis-0.707
ificant-0.593
ocene-0.586
heastern-0.572
Token #
Token ID1303
Feature activation-1.39e-01

Pos logit contributions
culus+1.343
picture+1.159
moral+1.123
rely+1.097
 footing+1.051
Neg logit contributions
sburgh-1.364
opsis-1.315
ificant-1.145
ocene-1.134
heastern-1.110
TokenOccup
Token ID47658
Feature activation-3.06e+00

Pos logit contributions
sburgh+0.815
opsis+0.784
ificant+0.683
ocene+0.674
heastern+0.664
Neg logit contributions
culus-0.813
picture-0.695
moral-0.679
rely-0.665
 footing-0.634
Tokeny
Token ID88
Feature activation-1.18e+00

Pos logit contributions
sburgh+12.440
opsis+11.205
ificant+9.967
ocene+9.461
heastern+9.029
Neg logit contributions
culus-17.682
picture-15.898
moral-15.253
rely-15.147
 footing-15.074
TokenThe
Token ID464
Feature activation+2.59e-01

Pos logit contributions
sburgh+6.601
opsis+6.237
ificant+5.461
ocene+5.308
heastern+5.284
Neg logit contributions
culus-6.866
picture-5.794
moral-5.737
rely-5.638
 footing-5.349
TokenDeb
Token ID16587
Feature activation+3.10e-01

Pos logit contributions
culus+1.639
picture+1.431
moral+1.397
rely+1.360
 footing+1.292
Neg logit contributions
sburgh-1.376
opsis-1.337
ificant-1.141
ocene-1.130
heastern-1.106
Tokenates
Token ID689
Feature activation-1.06e-01

Pos logit contributions
culus+2.161
picture+1.899
moral+1.863
rely+1.824
 footing+1.748
Neg logit contributions
sburgh-1.457
opsis-1.385
ificant-1.164
ocene-1.151
heastern-1.122
Token pic
Token ID8301
Feature activation+9.56e-02

Pos logit contributions
sburgh+0.618
opsis+0.596
ificant+0.518
ocene+0.512
heastern+0.504
Neg logit contributions
culus-0.618
picture-0.530
moral-0.518
rely-0.506
 footing-0.483
Token stage
Token ID3800
Feature activation-1.08e+00

Pos logit contributions
sburgh+1.763
opsis+1.690
ificant+1.452
ocene+1.433
heastern+1.409
Neg logit contributions
culus-2.048
picture-1.772
moral-1.735
rely-1.703
 footing-1.623
Token win
Token ID1592
Feature activation-1.01e+00

Pos logit contributions
sburgh+5.405
opsis+5.249
ificant+4.405
heastern+4.353
ocene+4.349
Neg logit contributions
culus-7.051
picture-5.981
moral-5.922
rely-5.852
 footing-5.470
Token record
Token ID1700
Feature activation-1.43e+00

Pos logit contributions
sburgh+5.232
opsis+5.006
ificant+4.292
ocene+4.130
heastern+4.096
Neg logit contributions
culus-6.373
picture-5.472
moral-5.404
rely-5.315
 footing-5.021
Token at
Token ID379
Feature activation-7.96e-01

Pos logit contributions
sburgh+7.276
opsis+6.873
ificant+5.890
ocene+5.735
heastern+5.720
Neg logit contributions
culus-8.916
picture-7.556
moral-7.522
rely-7.413
 footing-6.885
Token the
Token ID262
Feature activation+5.18e-02

Pos logit contributions
sburgh+4.399
opsis+4.254
ificant+3.677
ocene+3.612
heastern+3.539
Neg logit contributions
culus-4.789
picture-4.096
moral-4.043
rely-3.984
 footing-3.754
Token Tour
Token ID9852
Feature activation-2.67e+00

Pos logit contributions
culus+0.292
picture+0.249
moral+0.243
rely+0.238
 footing+0.227
Neg logit contributions
sburgh-0.314
opsis-0.303
ificant-0.265
ocene-0.261
heastern-0.258
Token Down
Token ID5588
Feature activation-4.74e-01

Pos logit contributions
sburgh+11.326
opsis+10.809
ificant+9.295
heastern+9.243
 somewhere+8.997
Neg logit contributions
culus-16.009
picture-13.889
moral-13.762
rely-13.437
 footing-12.403
Token Under
Token ID4698
Feature activation-7.64e-01

Pos logit contributions
sburgh+2.609
opsis+2.513
ificant+2.164
ocene+2.136
heastern+2.096
Neg logit contributions
culus-2.940
picture-2.536
moral-2.491
rely-2.435
 footing-2.332
Token.
Token ID13
Feature activation-2.85e-01

Pos logit contributions
sburgh+3.937
opsis+3.754
ificant+3.185
ocene+3.139
heastern+3.095
Neg logit contributions
culus-4.919
picture-4.264
moral-4.206
rely-4.135
 footing-3.913
Token He
Token ID679
Feature activation-8.42e-01

Pos logit contributions
sburgh+1.466
opsis+1.398
ificant+1.191
ocene+1.172
heastern+1.149
Neg logit contributions
culus-1.865
picture-1.618
moral-1.593
rely-1.560
 footing-1.501
Token had
Token ID550
Feature activation-5.05e-01

Pos logit contributions
sburgh+4.502
opsis+4.302
ificant+3.658
ocene+3.604
heastern+3.563
Neg logit contributions
culus-5.247
picture-4.560
moral-4.505
rely-4.392
 footing-4.179
Token the
Token ID262
Feature activation+5.36e-01

Pos logit contributions
culus+0.655
picture+0.560
moral+0.547
rely+0.535
 footing+0.510
Neg logit contributions
sburgh-0.665
opsis-0.641
ificant-0.558
ocene-0.552
heastern-0.544
Token shift
Token ID6482
Feature activation+9.98e-01

Pos logit contributions
culus+3.117
picture+2.679
moral+2.624
rely+2.563
 footing+2.456
Neg logit contributions
sburgh-3.103
opsis-3.022
ificant-2.615
ocene-2.572
heastern-2.562
Tokenless
Token ID1203
Feature activation+4.94e-01

Pos logit contributions
culus+5.753
picture+4.991
moral+4.875
rely+4.763
 footing+4.518
Neg logit contributions
sburgh-5.730
opsis-5.471
ificant-4.762
heastern-4.669
ocene-4.651
Token,
Token ID11
Feature activation-2.46e-01

Pos logit contributions
culus+2.868
picture+2.459
moral+2.402
rely+2.350
 footing+2.254
Neg logit contributions
sburgh-2.887
opsis-2.797
ificant-2.430
ocene-2.397
heastern-2.369
Token hook
Token ID8011
Feature activation-1.07e+00

Pos logit contributions
sburgh+1.381
opsis+1.318
ificant+1.143
ocene+1.128
heastern+1.112
Neg logit contributions
culus-1.497
picture-1.286
moral-1.260
rely-1.235
 footing-1.177
Tokenn
Token ID77
Feature activation-2.62e+00

Pos logit contributions
sburgh+6.216
opsis+6.006
ificant+5.223
ocene+5.167
heastern+5.068
Neg logit contributions
culus-5.838
picture-4.936
rely-4.848
moral-4.750
 footing-4.461
Tokenosed
Token ID1335
Feature activation+4.60e-01

Pos logit contributions
opsis+12.737
sburgh+12.617
ographical+10.286
ificant+10.161
essee+10.145
Neg logit contributions
culus-14.052
picture-11.775
moral-11.659
 sexes-11.224
 footing-11.020
Token Jew
Token ID3370
Feature activation+8.27e-01

Pos logit contributions
culus+2.645
picture+2.263
moral+2.217
rely+2.164
 footing+2.079
Neg logit contributions
sburgh-2.707
opsis-2.621
ificant-2.289
ocene-2.256
heastern-2.232
Token
Token ID447
Feature activation+3.55e-01

Pos logit contributions
culus+4.915
picture+4.237
moral+4.141
rely+4.035
 footing+3.933
Neg logit contributions
sburgh-4.601
opsis-4.435
ificant-3.887
heastern-3.771
ocene-3.770
Token
Token ID251
Feature activation-1.62e-01

Pos logit contributions
culus+2.137
picture+1.839
moral+1.798
rely+1.761
 footing+1.681
Neg logit contributions
sburgh-2.010
opsis-1.933
ificant-1.673
ocene-1.652
heastern-1.627
Token from
Token ID422
Feature activation+1.52e-01

Pos logit contributions
sburgh+0.892
opsis+0.856
ificant+0.737
ocene+0.727
heastern+0.714
Neg logit contributions
culus-0.999
picture-0.863
moral-0.844
rely-0.828
 footing-0.790
Token leave
Token ID2666
Feature activation-7.75e-02

Pos logit contributions
culus+1.035
picture+0.871
moral+0.849
rely+0.828
 footing+0.782
Neg logit contributions
sburgh-1.214
opsis-1.171
ificant-1.030
ocene-1.018
heastern-1.003
Token.
Token ID13
Feature activation-2.20e-01

Pos logit contributions
sburgh+0.427
opsis+0.410
ificant+0.353
ocene+0.348
heastern+0.343
Neg logit contributions
culus-0.480
picture-0.414
moral-0.406
rely-0.398
 footing-0.379
Token OS
Token ID7294
Feature activation+2.60e-01

Pos logit contributions
sburgh+1.125
opsis+1.077
ificant+0.915
ocene+0.903
heastern+0.887
Neg logit contributions
culus-1.451
picture-1.266
moral-1.241
rely-1.218
 footing-1.167
TokenHA
Token ID7801
Feature activation+2.40e-01

Pos logit contributions
culus+1.470
picture+1.245
moral+1.215
rely+1.193
 footing+1.133
Neg logit contributions
sburgh-1.582
opsis-1.521
ificant-1.332
ocene-1.309
heastern-1.291
Token (
Token ID357
Feature activation+2.04e-01

Pos logit contributions
culus+1.431
picture+1.227
moral+1.199
rely+1.182
 footing+1.134
Neg logit contributions
sburgh-1.366
opsis-1.317
ificant-1.143
ocene-1.130
heastern-1.117
TokenOccup
Token ID47658
Feature activation-2.61e+00

Pos logit contributions
culus+1.102
picture+0.943
moral+0.921
rely+0.898
 footing+0.844
Neg logit contributions
sburgh-1.276
opsis-1.234
ificant-1.083
ocene-1.068
heastern-1.055
Tokenational
Token ID864
Feature activation-8.49e-01

Pos logit contributions
sburgh+6.304
opsis+5.136
ificant+4.068
atal+4.054
essee+3.424
Neg logit contributions
culus-21.048
picture-18.772
moral-18.458
rely-18.043
 sexes-17.840
Token Health
Token ID3893
Feature activation+9.38e-03

Pos logit contributions
sburgh+4.192
opsis+3.863
ificant+3.358
ocene+3.263
heastern+3.208
Neg logit contributions
culus-5.626
picture-4.937
moral-4.771
rely-4.717
 footing-4.472
Token and
Token ID290
Feature activation+8.29e-01

Pos logit contributions
culus+0.051
picture+0.043
moral+0.042
rely+0.041
 footing+0.039
Neg logit contributions
sburgh-0.059
opsis-0.057
ificant-0.050
ocene-0.049
heastern-0.049
Token Safety
Token ID11233
Feature activation+6.45e-01

Pos logit contributions
culus+4.544
picture+3.777
moral+3.776
rely+3.708
 footing+3.436
Neg logit contributions
sburgh-5.018
opsis-4.992
ocene-4.350
ificant-4.230
heastern-4.143
Token Administration
Token ID8694
Feature activation-2.55e-01

Pos logit contributions
culus+3.610
picture+3.088
moral+3.002
rely+2.938
 footing+2.806
Neg logit contributions
sburgh-3.859
opsis-3.770
ocene-3.281
ificant-3.256
heastern-3.197
TokenDavid
Token ID11006
Feature activation+8.08e-03

Pos logit contributions
sburgh+3.042
opsis+2.943
ocene+2.502
ificant+2.497
heastern+2.446
Neg logit contributions
culus-3.413
picture-2.888
moral-2.843
rely-2.800
 footing-2.741
Token Camp
Token ID5425
Feature activation-7.44e-01

Pos logit contributions
culus+0.047
picture+0.040
moral+0.039
rely+0.038
 footing+0.036
Neg logit contributions
sburgh-0.048
opsis-0.046
ificant-0.040
ocene-0.039
heastern-0.039
Tokenos
Token ID418
Feature activation+6.83e-01

Pos logit contributions
sburgh+3.942
opsis+3.826
ificant+3.228
ocene+3.173
heastern+3.135
Neg logit contributions
culus-4.635
picture-4.031
moral-3.908
rely-3.823
 footing-3.731
Token
Token ID198
Feature activation+1.79e-01

Pos logit contributions
culus+3.989
picture+3.415
moral+3.315
rely+3.239
 footing+3.103
Neg logit contributions
sburgh-3.921
opsis-3.752
ificant-3.325
ocene-3.254
heastern-3.205
Token
Token ID198
Feature activation-5.61e-02

Pos logit contributions
culus+0.996
picture+0.870
moral+0.835
rely+0.820
 footing+0.759
Neg logit contributions
sburgh-1.087
opsis-1.052
ificant-0.923
ocene-0.908
heastern-0.904
TokenSan
Token ID15017
Feature activation-2.55e+00

Pos logit contributions
sburgh+0.308
opsis+0.295
ificant+0.254
ocene+0.251
heastern+0.247
Neg logit contributions
culus-0.349
picture-0.301
moral-0.295
rely-0.289
 footing-0.277
Token Francisco
Token ID6033
Feature activation-2.91e-01

Pos logit contributions
sburgh+10.984
opsis+10.820
ificant+8.937
ocene+8.693
heastern+8.680
Neg logit contributions
culus-14.859
moral-12.770
 footing-12.701
rely-12.585
picture-12.571
Token Board
Token ID5926
Feature activation+1.73e-01

Pos logit contributions
sburgh+1.752
opsis+1.682
ificant+1.469
ocene+1.454
heastern+1.431
Neg logit contributions
culus-1.646
picture-1.389
moral-1.365
rely-1.337
 footing-1.273
Token of
Token ID286
Feature activation+6.34e-01

Pos logit contributions
culus+0.957
picture+0.811
moral+0.793
rely+0.774
 footing+0.735
Neg logit contributions
sburgh-1.063
opsis-1.024
ificant-0.898
ocene-0.888
heastern-0.874
Token Super
Token ID3115
Feature activation+3.17e-01

Pos logit contributions
culus+3.159
moral+2.592
picture+2.588
rely+2.542
 footing+2.318
Neg logit contributions
sburgh-4.123
opsis-4.080
ificant-3.568
heastern-3.546
ocene-3.478
Tokenvisors
Token ID27681
Feature activation-8.93e-02

Pos logit contributions
culus+2.078
picture+1.816
moral+1.784
rely+1.746
 footing+1.662
Neg logit contributions
sburgh-1.615
opsis-1.555
ificant-1.310
ocene-1.303
heastern-1.285
Token76
Token ID4304
Feature activation-7.77e-02

Pos logit contributions
culus+2.543
picture+2.225
moral+2.032
rely+1.975
chat+1.732
Neg logit contributions
sburgh-4.975
opsis-4.871
heastern-4.477
ificant-4.457
ocene-4.406
Token or
Token ID393
Feature activation-4.63e-01

Pos logit contributions
sburgh+0.509
opsis+0.491
ificant+0.434
ocene+0.430
heastern+0.423
Neg logit contributions
culus-0.400
picture-0.334
moral-0.327
rely-0.318
 footing-0.301
Token ed
Token ID1225
Feature activation+1.55e-01

Pos logit contributions
sburgh+2.400
opsis+2.270
ificant+1.951
ocene+1.948
heastern+1.859
Neg logit contributions
culus-2.992
picture-2.637
moral-2.574
rely-2.518
 footing-2.416
Tokenraper
Token ID38545
Feature activation-7.75e-01

Pos logit contributions
culus+1.043
picture+0.918
moral+0.900
rely+0.879
 footing+0.825
Neg logit contributions
sburgh-0.768
opsis-0.741
ocene-0.624
ificant-0.619
heastern-0.604
Token@
Token ID31
Feature activation-1.11e+00

Pos logit contributions
sburgh+4.905
opsis+4.663
ocene+4.127
ificant+4.078
heastern+4.027
Neg logit contributions
culus-4.077
picture-3.411
moral-3.353
rely-3.238
 footing-3.180
Tokenden
Token ID6559
Feature activation-2.50e+00

Pos logit contributions
sburgh+6.517
opsis+6.203
ocene+5.535
ificant+5.455
heastern+5.386
Neg logit contributions
culus-5.878
picture-5.086
moral-4.781
rely-4.750
 footing-4.625
Tokenver
Token ID332
Feature activation-1.20e+00

Pos logit contributions
opsis+9.446
sburgh+9.418
atal+7.847
olulu+7.522
heastern+7.436
Neg logit contributions
culus-16.465
picture-14.743
rely-14.315
AME-14.054
 footing-13.925
Tokenpost
Token ID7353
Feature activation+5.41e-02

Pos logit contributions
sburgh+5.595
opsis+5.010
ocene+4.425
ificant+4.286
heastern+4.195
Neg logit contributions
culus-8.200
rely-6.969
moral-6.906
picture-6.899
 footing-6.679
Token.
Token ID13
Feature activation-6.59e-02

Pos logit contributions
culus+0.353
picture+0.311
moral+0.301
rely+0.296
 footing+0.282
Neg logit contributions
sburgh-0.278
opsis-0.268
ificant-0.229
ocene-0.224
heastern-0.221
Tokencom
Token ID785
Feature activation-2.54e-01

Pos logit contributions
sburgh+0.423
opsis+0.408
ificant+0.360
ocene+0.356
heastern+0.351
Neg logit contributions
culus-0.348
picture-0.293
moral-0.285
rely-0.278
 footing-0.263
Token
Token ID198
Feature activation+1.10e-01

Pos logit contributions
sburgh+1.367
opsis+1.291
ificant+1.100
ocene+1.096
heastern+1.075
Neg logit contributions
culus-1.593
picture-1.393
moral-1.362
rely-1.337
 footing-1.281
Token or
Token ID393
Feature activation-6.69e-01

Pos logit contributions
culus+1.089
picture+0.947
moral+0.916
rely+0.906
 footing+0.869
Neg logit contributions
sburgh-1.048
opsis-1.004
ificant-0.871
ocene-0.867
heastern-0.846
Token with
Token ID351
Feature activation-5.55e-01

Pos logit contributions
sburgh+3.411
opsis+3.293
ificant+2.809
ocene+2.709
heastern+2.707
Neg logit contributions
culus-4.346
picture-3.768
moral-3.716
rely-3.594
 footing-3.433
Token your
Token ID534
Feature activation-1.74e-01

Pos logit contributions
sburgh+2.909
opsis+2.804
ificant+2.412
ocene+2.338
heastern+2.324
Neg logit contributions
culus-3.532
picture-3.052
moral-3.010
rely-2.915
 footing-2.797
Token friends
Token ID2460
Feature activation-4.97e-01

Pos logit contributions
sburgh+0.954
opsis+0.916
ificant+0.788
ocene+0.779
heastern+0.766
Neg logit contributions
culus-1.079
picture-0.932
moral-0.913
rely-0.895
 footing-0.855
Token or
Token ID393
Feature activation-6.88e-01

Pos logit contributions
sburgh+2.787
opsis+2.681
ificant+2.318
ocene+2.275
heastern+2.266
Neg logit contributions
culus-2.993
picture-2.557
moral-2.522
rely-2.457
 footing-2.325
Token roomm
Token ID26255
Feature activation-2.45e+00

Pos logit contributions
sburgh+3.144
opsis+3.012
ificant+2.558
ocene+2.435
heastern+2.412
Neg logit contributions
culus-4.831
picture-4.229
moral-4.175
rely-4.049
 footing-3.887
Tokenates
Token ID689
Feature activation-7.26e-01

Pos logit contributions
opsis+7.449
sburgh+6.921
atal+5.621
heastern+4.968
ificant+4.904
Neg logit contributions
culus-18.229
moral-16.085
picture-16.046
rely-15.550
 footing-15.547
Token,
Token ID11
Feature activation-2.46e-01

Pos logit contributions
sburgh+3.665
opsis+3.545
ificant+2.984
heastern+2.932
ocene+2.889
Neg logit contributions
culus-4.700
picture-4.083
moral-4.032
rely-3.918
 footing-3.696
Token for
Token ID329
Feature activation+1.41e-01

Pos logit contributions
sburgh+1.346
opsis+1.291
ificant+1.111
ocene+1.098
heastern+1.078
Neg logit contributions
culus-1.530
picture-1.325
moral-1.294
rely-1.272
 footing-1.216
Token example
Token ID1672
Feature activation-4.14e-01

Pos logit contributions
culus+0.958
picture+0.838
moral+0.823
rely+0.811
 footing+0.780
Neg logit contributions
sburgh-0.698
opsis-0.664
ificant-0.558
ocene-0.555
heastern-0.541
Token.
Token ID13
Feature activation+1.26e-01

Pos logit contributions
sburgh+1.978
opsis+1.898
ificant+1.571
heastern+1.542
ocene+1.540
Neg logit contributions
culus-2.830
picture-2.485
moral-2.459
rely-2.399
 footing-2.292
Token all
Token ID477
Feature activation+7.61e-02

Pos logit contributions
sburgh+2.877
opsis+2.768
ificant+2.397
heastern+2.360
ocene+2.353
Neg logit contributions
culus-2.955
picture-2.535
moral-2.491
rely-2.400
 footing-2.289
Token non
Token ID1729
Feature activation+6.74e-01

Pos logit contributions
culus+0.500
picture+0.437
moral+0.427
rely+0.420
 footing+0.404
Neg logit contributions
sburgh-0.389
opsis-0.373
ificant-0.316
ocene-0.313
heastern-0.305
Tokenfarm
Token ID43323
Feature activation-3.28e-01

Pos logit contributions
culus+3.325
picture+2.792
moral+2.721
rely+2.619
 footing+2.450
Neg logit contributions
sburgh-4.485
opsis-4.385
ificant-3.886
ocene-3.830
 externalToEVAOnly-3.733
Token sectors
Token ID16020
Feature activation+7.97e-02

Pos logit contributions
sburgh+2.122
opsis+2.038
ificant+1.818
ocene+1.789
heastern+1.784
Neg logit contributions
culus-1.721
picture-1.430
moral-1.404
rely-1.352
 footing-1.280
Token (
Token ID357
Feature activation+9.77e-02

Pos logit contributions
culus+0.483
picture+0.416
moral+0.407
rely+0.399
 footing+0.381
Neg logit contributions
sburgh-0.449
opsis-0.432
ificant-0.374
ocene-0.369
heastern-0.363
Tokenauthor
Token ID9800
Feature activation-2.45e+00

Pos logit contributions
culus+0.585
picture+0.507
moral+0.495
rely+0.484
 footing+0.459
Neg logit contributions
sburgh-0.557
opsis-0.536
ificant-0.463
ocene-0.459
heastern-0.450
Token
Token ID447
Feature activation-6.82e-01

Pos logit contributions
sburgh+12.443
opsis+11.857
ificant+10.509
ographical+9.971
ocene+9.718
Neg logit contributions
culus-13.176
picture-11.177
moral-11.045
rely-10.638
 footing-10.528
Token
Token ID247
Feature activation-4.17e-01

Pos logit contributions
sburgh+4.013
opsis+3.858
ificant+3.320
ocene+3.267
heastern+3.242
Neg logit contributions
culus-3.889
picture-3.319
moral-3.257
rely-3.190
 footing-3.040
Tokens
Token ID82
Feature activation-5.58e-01

Pos logit contributions
sburgh+2.402
opsis+2.310
ificant+2.007
ocene+1.968
heastern+1.945
Neg logit contributions
culus-2.459
picture-2.102
moral-2.060
rely-2.015
 footing-1.929
Token analysis
Token ID3781
Feature activation-2.92e-01

Pos logit contributions
sburgh+3.235
opsis+3.133
ificant+2.712
ocene+2.646
heastern+2.627
Neg logit contributions
culus-3.234
picture-2.777
moral-2.722
rely-2.654
 footing-2.530
Token of
Token ID286
Feature activation-8.33e-01

Pos logit contributions
sburgh+1.682
opsis+1.625
ificant+1.410
ocene+1.380
heastern+1.369
Neg logit contributions
culus-1.721
picture-1.477
moral-1.445
rely-1.410
 footing-1.346
Tokengot
Token ID23442
Feature activation-1.40e+00

Pos logit contributions
culus+0.947
picture+0.818
moral+0.801
rely+0.784
 footing+0.749
Neg logit contributions
sburgh-0.848
opsis-0.815
ificant-0.702
ocene-0.692
heastern-0.682
Tokenists
Token ID1023
Feature activation-7.22e-01

Pos logit contributions
sburgh+3.568
opsis+3.408
ocene+2.467
ificant+2.439
ists+2.403
Neg logit contributions
culus-11.879
picture-10.802
moral-10.578
rely-10.574
 footing-10.462
Token with
Token ID351
Feature activation-4.30e-02

Pos logit contributions
sburgh+3.798
opsis+3.685
ificant+3.098
ocene+3.063
heastern+3.054
Neg logit contributions
culus-4.484
picture-3.879
moral-3.816
rely-3.736
 footing-3.505
Token a
Token ID257
Feature activation-2.02e-01

Pos logit contributions
sburgh+0.242
opsis+0.233
ificant+0.201
ocene+0.199
heastern+0.196
Neg logit contributions
culus-0.261
picture-0.225
moral-0.220
rely-0.216
 footing-0.206
Token loyal
Token ID9112
Feature activation-1.22e+00

Pos logit contributions
sburgh+1.124
opsis+1.081
ificant+0.933
ocene+0.919
heastern+0.907
Neg logit contributions
culus-1.236
picture-1.065
moral-1.040
rely-1.022
 footing-0.973
Token fan
Token ID4336
Feature activation-2.42e+00

Pos logit contributions
sburgh+6.026
opsis+5.996
ificant+5.090
heastern+4.840
ocene+4.792
Neg logit contributions
culus-7.631
picture-6.549
moral-6.387
rely-6.294
 sexes-6.002
Token base
Token ID2779
Feature activation-9.98e-01

Pos logit contributions
opsis+7.857
sburgh+7.314
ificant+6.102
ocene+5.870
ographical+5.573
Neg logit contributions
culus-16.244
picture-14.664
rely-14.532
moral-14.269
 sexes-13.961
Token who
Token ID508
Feature activation-4.00e-01

Pos logit contributions
sburgh+5.054
opsis+4.935
ificant+4.149
heastern+4.068
ocene+4.045
Neg logit contributions
culus-6.178
picture-5.365
moral-5.326
rely-5.231
 footing-4.820
Token will
Token ID481
Feature activation-5.34e-01

Pos logit contributions
sburgh+2.346
opsis+2.267
ificant+1.957
ocene+1.932
heastern+1.899
Neg logit contributions
culus-2.320
picture-1.983
moral-1.944
rely-1.876
 footing-1.793
Token hyper
Token ID8718
Feature activation+1.42e+00

Pos logit contributions
sburgh+2.991
opsis+2.891
ificant+2.454
ocene+2.415
heastern+2.378
Neg logit contributions
culus-3.231
picture-2.775
moral-2.716
rely-2.604
 footing-2.492
Token-
Token ID12
Feature activation+1.34e+00

Pos logit contributions
culus+6.339
moral+5.334
picture+5.327
rely+5.204
 footing+4.464
Neg logit contributions
sburgh-9.794
opsis-9.583
ificant-8.465
ocene-8.308
heastern-8.284
Token Q
Token ID1195
Feature activation+1.74e-01

Pos logit contributions
sburgh+4.585
opsis+4.379
ificant+3.848
ocene+3.761
heastern+3.699
Neg logit contributions
culus-4.752
picture-4.038
moral-3.957
rely-3.881
 footing-3.648
TokenF
Token ID37
Feature activation-3.05e-01

Pos logit contributions
culus+1.083
picture+0.937
moral+0.917
rely+0.899
 footing+0.859
Neg logit contributions
sburgh-0.950
opsis-0.912
ificant-0.784
ocene-0.774
heastern-0.762
Token72
Token ID4761
Feature activation-1.14e+00

Pos logit contributions
sburgh+2.096
opsis+2.031
ificant+1.820
ocene+1.790
heastern+1.755
Neg logit contributions
culus-1.451
picture-1.203
moral-1.161
rely-1.141
 footing-1.068
Token incident
Token ID4519
Feature activation-1.11e+00

Pos logit contributions
sburgh+5.447
opsis+5.163
ificant+4.353
ocene+4.156
heastern+4.022
Neg logit contributions
culus-7.528
picture-6.542
moral-6.461
rely-6.351
 footing-6.009
Token has
Token ID468
Feature activation-6.87e-01

Pos logit contributions
sburgh+5.692
opsis+5.425
ificant+4.639
ocene+4.508
heastern+4.466
Neg logit contributions
culus-6.996
picture-5.988
moral-5.896
rely-5.761
 footing-5.495
Token lived
Token ID5615
Feature activation-2.42e+00

Pos logit contributions
sburgh+3.584
opsis+3.402
ificant+2.953
ocene+2.869
heastern+2.847
Neg logit contributions
culus-4.394
picture-3.803
moral-3.716
rely-3.602
 footing-3.446
Token inside
Token ID2641
Feature activation-9.34e-01

Pos logit contributions
sburgh+9.927
opsis+8.827
 somewhere+8.421
ificant+8.131
 repeated+7.261
Neg logit contributions
culus-15.300
picture-13.401
moral-13.248
rely-12.448
intent-11.986
Token me
Token ID502
Feature activation-8.84e-01

Pos logit contributions
sburgh+4.290
opsis+4.058
ificant+3.474
ocene+3.376
heastern+3.271
Neg logit contributions
culus-6.472
picture-5.681
moral-5.595
rely-5.451
 footing-5.251
Token every
Token ID790
Feature activation-6.61e-01

Pos logit contributions
sburgh+4.405
opsis+4.218
ificant+3.580
ocene+3.478
heastern+3.451
Neg logit contributions
culus-5.793
picture-5.067
moral-4.970
rely-4.897
 footing-4.633
Token single
Token ID2060
Feature activation-5.93e-01

Pos logit contributions
sburgh+4.198
opsis+4.064
ificant+3.600
ocene+3.529
heastern+3.485
Neg logit contributions
culus-3.462
picture-2.832
moral-2.789
rely-2.724
 footing-2.558
Token day
Token ID1110
Feature activation-6.77e-01

Pos logit contributions
sburgh+3.950
opsis+3.826
ificant+3.411
ocene+3.353
heastern+3.311
Neg logit contributions
culus-2.950
picture-2.384
moral-2.350
rely-2.282
 footing-2.130
Token not
Token ID407
Feature activation-2.49e-01

Pos logit contributions
sburgh+0.863
opsis+0.830
ificant+0.720
ocene+0.712
heastern+0.701
Neg logit contributions
culus-0.889
picture-0.763
moral-0.746
rely-0.731
 footing-0.697
Token protecting
Token ID10192
Feature activation-6.02e-01

Pos logit contributions
sburgh+1.441
opsis+1.387
ificant+1.205
ocene+1.189
heastern+1.172
Neg logit contributions
culus-1.465
picture-1.256
moral-1.228
rely-1.200
 footing-1.142
Token the
Token ID262
Feature activation-3.70e-01

Pos logit contributions
sburgh+3.412
opsis+3.265
ificant+2.831
ocene+2.761
heastern+2.746
Neg logit contributions
culus-3.626
picture-3.095
moral-3.014
rely-2.925
 footing-2.786
Token rights
Token ID2489
Feature activation+3.06e-01

Pos logit contributions
sburgh+2.650
opsis+2.570
ificant+2.298
ocene+2.268
heastern+2.249
Neg logit contributions
culus-1.676
picture-1.365
moral-1.314
rely-1.274
 footing-1.185
Token of
Token ID286
Feature activation-3.88e-01

Pos logit contributions
culus+1.649
picture+1.397
moral+1.364
rely+1.331
 footing+1.267
Neg logit contributions
sburgh-1.923
opsis-1.856
ificant-1.634
ocene-1.614
heastern-1.592
Token Occupy
Token ID25551
Feature activation-2.42e+00

Pos logit contributions
sburgh+1.960
opsis+1.870
ificant+1.589
ocene+1.562
heastern+1.538
Neg logit contributions
culus-2.584
picture-2.252
moral-2.208
rely-2.162
 footing-2.076
Token demonstrators
Token ID25016
Feature activation-1.18e+00

Pos logit contributions
sburgh+10.280
opsis+9.908
ificant+8.158
ists+7.710
heastern+7.693
Neg logit contributions
culus-15.331
picture-13.600
moral-13.176
rely-12.642
 sexes-11.911
Token whose
Token ID3025
Feature activation-1.04e+00

Pos logit contributions
sburgh+5.995
opsis+5.752
ificant+4.841
heastern+4.781
ocene+4.676
Neg logit contributions
culus-7.520
picture-6.462
moral-6.332
rely-6.122
 footing-5.677
Token protests
Token ID8536
Feature activation-3.29e-01

Pos logit contributions
sburgh+5.936
opsis+5.678
ificant+4.986
ocene+4.814
heastern+4.808
Neg logit contributions
culus-5.985
picture-5.059
moral-4.937
rely-4.756
 footing-4.394
Token are
Token ID389
Feature activation-7.35e-01

Pos logit contributions
sburgh+1.924
opsis+1.852
ificant+1.613
ocene+1.583
heastern+1.572
Neg logit contributions
culus-1.929
picture-1.642
moral-1.608
rely-1.561
 footing-1.483
Token being
Token ID852
Feature activation-1.93e-01

Pos logit contributions
sburgh+3.814
opsis+3.648
ificant+3.134
ocene+3.027
heastern+3.020
Neg logit contributions
culus-4.726
picture-4.085
moral-3.991
rely-3.889
 footing-3.664
Token inside
Token ID2641
Feature activation-7.29e-01

Pos logit contributions
culus+1.206
picture+1.039
moral+1.017
rely+0.997
 footing+0.956
Neg logit contributions
sburgh-1.144
opsis-1.108
ificant-0.954
ocene-0.945
heastern-0.918
Token,
Token ID11
Feature activation+2.20e-01

Pos logit contributions
sburgh+4.043
opsis+3.844
ificant+3.307
ocene+3.297
heastern+3.243
Neg logit contributions
culus-4.441
picture-3.833
moral-3.754
rely-3.662
 footing-3.457
Token and
Token ID290
Feature activation+2.72e-01

Pos logit contributions
culus+1.374
picture+1.198
moral+1.164
rely+1.141
 footing+1.111
Neg logit contributions
sburgh-1.187
opsis-1.148
ificant-0.989
ocene-0.972
heastern-0.953
Token perhaps
Token ID3737
Feature activation-4.38e-02

Pos logit contributions
culus+1.643
picture+1.416
moral+1.375
rely+1.367
 footing+1.320
Neg logit contributions
sburgh-1.525
opsis-1.466
ificant-1.277
ocene-1.262
heastern-1.229
Token even
Token ID772
Feature activation+2.60e-01

Pos logit contributions
sburgh+0.245
opsis+0.236
ificant+0.204
ocene+0.202
heastern+0.198
Neg logit contributions
culus-0.267
picture-0.230
moral-0.225
rely-0.222
 footing-0.213
Token rent
Token ID5602
Feature activation-2.41e+00

Pos logit contributions
culus+1.558
picture+1.341
moral+1.306
rely+1.289
 footing+1.250
Neg logit contributions
sburgh-1.465
opsis-1.409
ificant-1.229
ocene-1.211
heastern-1.188
Token an
Token ID281
Feature activation-8.01e-01

Pos logit contributions
sburgh+10.075
opsis+9.799
 somewhere+8.931
ificant+7.964
ographical+7.895
Neg logit contributions
culus-14.959
moral-12.897
picture-12.886
rely-12.685
 sexes-11.684
Token apartment
Token ID7962
Feature activation-1.12e+00

Pos logit contributions
sburgh+4.179
opsis+3.973
ificant+3.453
ocene+3.354
heastern+3.268
Neg logit contributions
culus-5.103
picture-4.440
moral-4.392
rely-4.209
 footing-4.038
Token in
Token ID287
Feature activation-3.15e-01

Pos logit contributions
sburgh+5.698
opsis+5.390
ificant+4.603
ocene+4.461
heastern+4.456
Neg logit contributions
culus-7.202
picture-6.154
moral-6.101
rely-6.064
 sexes-5.541
Token Wa
Token ID15329
Feature activation-7.29e-01

Pos logit contributions
sburgh+1.750
opsis+1.676
ificant+1.448
ocene+1.429
heastern+1.403
Neg logit contributions
culus-1.932
picture-1.668
moral-1.628
rely-1.599
 footing-1.519
Tokenik
Token ID1134
Feature activation+1.00e-01

Pos logit contributions
sburgh+2.968
opsis+2.836
ificant+2.290
ocene+2.268
heastern+2.206
Neg logit contributions
culus-5.459
picture-4.845
moral-4.801
rely-4.711
 footing-4.564
Token Cuba
Token ID14159
Feature activation-1.45e-01

Pos logit contributions
culus+1.639
picture+1.403
moral+1.380
rely+1.360
 footing+1.311
Neg logit contributions
sburgh-1.665
opsis-1.608
ocene-1.390
ificant-1.389
heastern-1.367
Token present
Token ID1944
Feature activation-4.02e-01

Pos logit contributions
sburgh+0.751
opsis+0.719
ificant+0.613
ocene+0.605
heastern+0.594
Neg logit contributions
culus-0.944
picture-0.821
moral-0.806
rely-0.790
 footing-0.760
Token.
Token ID13
Feature activation-1.99e-01

Pos logit contributions
sburgh+2.013
opsis+1.915
ificant+1.618
ocene+1.595
heastern+1.574
Neg logit contributions
culus-2.682
picture-2.338
moral-2.294
rely-2.262
 footing-2.143
Token
Token ID198
Feature activation-5.52e-01

Pos logit contributions
sburgh+1.116
opsis+1.068
ificant+0.927
ocene+0.913
heastern+0.894
Neg logit contributions
culus-1.216
picture-1.039
moral-1.022
rely-1.001
 footing-0.960
Token
Token ID198
Feature activation-3.93e-01

Pos logit contributions
sburgh+2.954
opsis+2.802
ificant+2.395
ocene+2.364
heastern+2.303
Neg logit contributions
culus-3.523
picture-2.950
moral-2.945
rely-2.843
 footing-2.801
TokenCount
Token ID12332
Feature activation-2.40e+00

Pos logit contributions
sburgh+1.974
opsis+1.889
ificant+1.598
ocene+1.582
heastern+1.542
Neg logit contributions
culus-2.613
picture-2.256
moral-2.220
rely-2.185
 footing-2.116
Tokenries
Token ID1678
Feature activation+6.40e-01

Pos logit contributions
sburgh+7.999
opsis+6.866
ificant+5.583
heastern+5.523
ocene+5.095
Neg logit contributions
culus-17.821
picture-15.872
moral-15.800
 footing-15.179
rely-14.935
Token that
Token ID326
Feature activation+5.95e-01

Pos logit contributions
culus+3.664
picture+3.111
moral+3.056
rely+3.049
 footing+2.949
Neg logit contributions
sburgh-3.771
opsis-3.639
ocene-3.162
ificant-3.155
heastern-3.055
Token once
Token ID1752
Feature activation-4.47e-01

Pos logit contributions
culus+3.219
picture+2.739
rely+2.701
moral+2.640
 footing+2.553
Neg logit contributions
sburgh-3.639
opsis-3.530
ocene-3.080
ificant-3.075
heastern-3.043
Token acquies
Token ID47752
Feature activation+5.55e-01

Pos logit contributions
sburgh+2.509
opsis+2.384
ificant+2.070
ocene+2.031
heastern+2.009
Neg logit contributions
culus-2.712
picture-2.335
moral-2.293
rely-2.211
 footing-2.102
Tokenced
Token ID771
Feature activation+3.40e-01

Pos logit contributions
culus+3.311
rely+2.811
picture+2.773
moral+2.762
 footing+2.577
Neg logit contributions
sburgh-3.188
opsis-3.077
ificant-2.620
ocene-2.612
heastern-2.564
Token the
Token ID262
Feature activation+2.95e-01

Pos logit contributions
culus+0.241
picture+0.206
moral+0.201
rely+0.197
 footing+0.187
Neg logit contributions
sburgh-0.248
opsis-0.238
ificant-0.208
ocene-0.205
heastern-0.202
Token 3
Token ID513
Feature activation+4.80e-01

Pos logit contributions
culus+1.526
picture+1.279
moral+1.250
rely+1.225
 footing+1.163
Neg logit contributions
sburgh-1.910
opsis-1.858
ificant-1.638
ocene-1.625
heastern-1.598
Tokenrd
Token ID4372
Feature activation+1.50e-01

Pos logit contributions
culus+1.914
picture+1.522
moral+1.463
rely+1.440
 footing+1.316
Neg logit contributions
sburgh-3.650
opsis-3.568
ocene-3.196
ificant-3.195
heastern-3.125
Token Italian
Token ID8200
Feature activation-4.00e-01

Pos logit contributions
culus+0.880
picture+0.755
moral+0.738
rely+0.722
 footing+0.689
Neg logit contributions
sburgh-0.867
opsis-0.835
ificant-0.725
ocene-0.718
heastern-0.706
Token "
Token ID366
Feature activation+2.04e-01

Pos logit contributions
sburgh+2.545
opsis+2.455
ificant+2.160
ocene+2.123
heastern+2.102
Neg logit contributions
culus-2.119
picture-1.797
moral-1.739
rely-1.700
 footing-1.598
TokenSan
Token ID15017
Feature activation-2.40e+00

Pos logit contributions
culus+1.236
picture+1.073
moral+1.049
rely+1.021
 footing+0.963
Neg logit contributions
sburgh-1.145
opsis-1.099
ificant-0.952
ocene-0.941
heastern-0.926
Token Marco
Token ID16556
Feature activation-1.46e-01

Pos logit contributions
sburgh+13.224
opsis+13.085
ificant+11.545
ocene+11.064
heastern+10.998
Neg logit contributions
culus-11.007
picture-9.286
rely-9.159
 footing-9.154
moral-9.142
Token"
Token ID1
Feature activation-2.64e-01

Pos logit contributions
sburgh+0.747
opsis+0.717
ificant+0.607
ocene+0.598
heastern+0.587
Neg logit contributions
culus-0.965
picture-0.844
moral-0.825
rely-0.812
 footing-0.779
Token Marine
Token ID11000
Feature activation+1.78e-01

Pos logit contributions
sburgh+1.634
opsis+1.576
ificant+1.386
ocene+1.362
heastern+1.348
Neg logit contributions
culus-1.453
picture-1.232
moral-1.200
rely-1.175
 footing-1.109
Token Division
Token ID7458
Feature activation+5.68e-01

Pos logit contributions
culus+0.910
picture+0.761
moral+0.740
rely+0.722
 footing+0.682
Neg logit contributions
sburgh-1.167
opsis-1.129
ificant-0.999
ocene-0.988
heastern-0.975
Token and
Token ID290
Feature activation+3.46e-01

Pos logit contributions
culus+2.902
picture+2.426
moral+2.362
rely+2.300
 footing+2.180
Neg logit contributions
sburgh-3.724
opsis-3.601
ificant-3.185
ocene-3.158
heastern-3.113
Token not
Token ID407
Feature activation+5.66e-01

Pos logit contributions
culus+1.442
picture+1.242
moral+1.212
rely+1.191
 footing+1.147
Neg logit contributions
sburgh-1.378
opsis-1.319
ocene-1.143
ificant-1.141
heastern-1.120
Token a
Token ID257
Feature activation+1.51e-01

Pos logit contributions
culus+3.317
picture+2.832
moral+2.766
rely+2.727
 footing+2.628
Neg logit contributions
sburgh-3.276
opsis-3.127
ocene-2.744
ificant-2.678
heastern-2.662
Token full
Token ID1336
Feature activation+3.66e-01

Pos logit contributions
culus+0.986
picture+0.860
moral+0.842
rely+0.826
 footing+0.798
Neg logit contributions
sburgh-0.787
opsis-0.748
ificant-0.636
ocene-0.636
heastern-0.620
Token on
Token ID319
Feature activation-1.44e-01

Pos logit contributions
culus+2.196
picture+1.914
moral+1.835
rely+1.806
 footing+1.755
Neg logit contributions
sburgh-2.085
opsis-1.973
ocene-1.738
ificant-1.724
heastern-1.679
Token "
Token ID366
Feature activation+2.59e-01

Pos logit contributions
sburgh+0.819
opsis+0.788
ificant+0.683
ocene+0.674
heastern+0.664
Neg logit contributions
culus-0.860
picture-0.739
moral-0.723
rely-0.708
 footing-0.676
Tokenden
Token ID6559
Feature activation-2.39e+00

Pos logit contributions
culus+1.236
picture+1.044
moral+1.013
rely+0.980
 footing+0.904
Neg logit contributions
sburgh-1.782
opsis-1.730
ificant-1.535
ocene-1.530
heastern-1.507
Tokenial
Token ID498
Feature activation-9.21e-02

Pos logit contributions
opsis+10.905
sburgh+10.886
ificant+9.150
atal+8.966
igr+8.797
Neg logit contributions
culus-13.432
picture-11.769
moral-11.506
rely-11.310
 footing-11.037
Token"
Token ID1
Feature activation-5.93e-01

Pos logit contributions
sburgh+0.460
opsis+0.440
ificant+0.372
ocene+0.367
heastern+0.361
Neg logit contributions
culus-0.617
picture-0.539
moral-0.529
rely-0.519
 footing-0.499
Token it
Token ID340
Feature activation+3.74e-01

Pos logit contributions
sburgh+3.290
opsis+3.198
ificant+2.766
ocene+2.695
heastern+2.648
Neg logit contributions
culus-3.598
picture-3.077
moral-3.041
rely-2.960
 footing-2.804
Token was
Token ID373
Feature activation+5.06e-01

Pos logit contributions
culus+2.296
picture+1.996
moral+1.939
rely+1.905
 footing+1.854
Neg logit contributions
sburgh-2.025
opsis-1.928
ocene-1.693
ificant-1.685
heastern-1.636
Token enough
Token ID1576
Feature activation+1.52e-01

Pos logit contributions
culus+2.994
picture+2.569
moral+2.486
rely+2.465
 footing+2.384
Neg logit contributions
sburgh-2.910
opsis-2.775
ocene-2.453
ificant-2.386
heastern-2.364
Tokenuse
Token ID1904
Feature activation-2.78e-01

Pos logit contributions
culus+0.135
picture+0.113
moral+0.111
rely+0.108
 footing+0.102
Neg logit contributions
sburgh-0.165
opsis-0.159
ificant-0.141
ocene-0.139
heastern-0.137
Token trails
Token ID19196
Feature activation-1.41e-02

Pos logit contributions
sburgh+1.605
opsis+1.544
ificant+1.334
ocene+1.316
heastern+1.296
Neg logit contributions
culus-1.647
picture-1.414
moral-1.380
rely-1.345
 footing-1.269
Token from
Token ID422
Feature activation-2.61e-01

Pos logit contributions
sburgh+0.080
opsis+0.076
ificant+0.066
ocene+0.065
heastern+0.064
Neg logit contributions
culus-0.085
picture-0.073
moral-0.072
rely-0.070
 footing-0.067
Token Claire
Token ID20758
Feature activation-2.29e-01

Pos logit contributions
sburgh+1.443
opsis+1.381
ificant+1.193
ocene+1.174
heastern+1.155
Neg logit contributions
culus-1.604
picture-1.387
moral-1.355
rely-1.329
 footing-1.260
Tokenville
Token ID4244
Feature activation-4.36e-01

Pos logit contributions
sburgh+1.283
opsis+1.227
ificant+1.059
ocene+1.053
heastern+1.032
Neg logit contributions
culus-1.393
picture-1.202
moral-1.165
rely-1.146
 footing-1.101
Token Reserv
Token ID40425
Feature activation-2.39e+00

Pos logit contributions
sburgh+2.318
opsis+2.202
ificant+1.874
ocene+1.856
heastern+1.833
Neg logit contributions
culus-2.789
picture-2.424
moral-2.361
rely-2.327
 footing-2.223
Tokenoir
Token ID10840
Feature activation-2.11e-01

Pos logit contributions
opsis+2.968
sburgh+2.811
ificant+1.117
heastern+0.918
oir+0.781
Neg logit contributions
culus-22.023
picture-20.878
rely-20.107
moral-19.865
 sexes-19.611
Token in
Token ID287
Feature activation-3.59e-01

Pos logit contributions
sburgh+1.161
opsis+1.114
ificant+0.960
ocene+0.947
heastern+0.932
Neg logit contributions
culus-1.305
picture-1.129
moral-1.104
rely-1.083
 footing-1.033
Token Bram
Token ID35315
Feature activation-2.43e-01

Pos logit contributions
sburgh+1.932
opsis+1.856
ificant+1.588
ocene+1.566
heastern+1.539
Neg logit contributions
culus-2.263
picture-1.966
moral-1.919
rely-1.879
 footing-1.792
Tokenpton
Token ID10972
Feature activation-5.25e-01

Pos logit contributions
sburgh+1.184
opsis+1.128
ificant+0.946
ocene+0.927
heastern+0.926
Neg logit contributions
culus-1.662
picture-1.453
moral-1.401
rely-1.397
 footing-1.353
Token to
Token ID284
Feature activation-3.03e-01

Pos logit contributions
sburgh+2.787
opsis+2.665
ificant+2.267
ocene+2.237
heastern+2.216
Neg logit contributions
culus-3.353
picture-2.915
moral-2.848
rely-2.806
 footing-2.670
Token under
Token ID739
Feature activation+1.11e+00

Pos logit contributions
culus+6.234
moral+5.314
picture+5.277
rely+5.185
 footing+4.997
Neg logit contributions
sburgh-7.160
opsis-7.029
ocene-6.124
ificant-6.054
heastern-5.904
Token a
Token ID257
Feature activation+5.07e-01

Pos logit contributions
culus+6.181
moral+5.332
picture+5.181
rely+5.110
 footing+4.924
Neg logit contributions
sburgh-6.345
opsis-6.263
ocene-5.419
ificant-5.211
heastern-5.157
Token brutally
Token ID30040
Feature activation+1.92e-01

Pos logit contributions
culus+2.911
moral+2.472
picture+2.468
rely+2.393
 footing+2.299
Neg logit contributions
sburgh-3.012
opsis-2.900
ocene-2.489
ificant-2.478
heastern-2.410
Token misogyn
Token ID22795
Feature activation-1.06e+00

Pos logit contributions
culus+1.133
picture+0.969
moral+0.960
rely+0.935
 footing+0.892
Neg logit contributions
sburgh-1.120
opsis-1.076
ificant-0.928
ocene-0.920
heastern-0.904
Tokenistic
Token ID2569
Feature activation+6.23e-01

Pos logit contributions
sburgh+3.160
opsis+2.909
ificant+2.312
ocene+2.262
ographical+2.115
Neg logit contributions
culus-8.946
picture-8.132
rely-7.861
moral-7.854
 footing-7.686
Token cler
Token ID13451
Feature activation-2.35e+00

Pos logit contributions
culus+3.403
picture+2.890
moral+2.879
rely+2.803
 footing+2.718
Neg logit contributions
sburgh-3.824
opsis-3.688
ificant-3.215
ocene-3.163
heastern-3.128
Tokenical
Token ID605
Feature activation-1.88e-01

Pos logit contributions
opsis+6.554
sburgh+6.400
ocene+5.444
ificant+5.133
ographical+4.768
Neg logit contributions
culus-17.295
picture-16.574
moral-15.972
 footing-15.816
rely-15.644
Token culture
Token ID3968
Feature activation-7.75e-02

Pos logit contributions
sburgh+1.086
opsis+1.048
ificant+0.913
ocene+0.905
heastern+0.889
Neg logit contributions
culus-1.107
picture-0.951
moral-0.925
rely-0.905
 footing-0.856
Token she
Token ID673
Feature activation-8.40e-01

Pos logit contributions
sburgh+0.481
opsis+0.465
ificant+0.407
ocene+0.404
heastern+0.398
Neg logit contributions
culus-0.424
picture-0.359
moral-0.350
rely-0.342
 footing-0.323
Token isn
Token ID2125
Feature activation+5.78e-01

Pos logit contributions
sburgh+4.454
opsis+4.288
ificant+3.615
ocene+3.589
heastern+3.510
Neg logit contributions
culus-5.253
picture-4.567
moral-4.453
rely-4.344
 footing-4.127
Token
Token ID447
Feature activation+1.31e-01

Pos logit contributions
culus+3.641
picture+3.268
moral+3.143
rely+3.104
 footing+2.871
Neg logit contributions
sburgh-2.980
opsis-2.895
ificant-2.477
ocene-2.467
heastern-2.450
Token set
Token ID900
Feature activation-4.70e-02

Pos logit contributions
culus+0.123
picture+0.106
moral+0.103
rely+0.101
 footing+0.097
Neg logit contributions
sburgh-0.112
opsis-0.107
ificant-0.093
ocene-0.092
heastern-0.090
Token to
Token ID284
Feature activation-8.69e-02

Pos logit contributions
sburgh+0.256
opsis+0.247
ificant+0.211
ocene+0.209
heastern+0.206
Neg logit contributions
culus-0.294
picture-0.254
moral-0.249
rely-0.244
 footing-0.233
Token take
Token ID1011
Feature activation+3.02e-01

Pos logit contributions
sburgh+0.495
opsis+0.476
ificant+0.412
ocene+0.407
heastern+0.401
Neg logit contributions
culus-0.521
picture-0.448
moral-0.438
rely-0.429
 footing-0.409
Token on
Token ID319
Feature activation+7.74e-01

Pos logit contributions
culus+1.793
picture+1.546
moral+1.513
rely+1.484
 footing+1.429
Neg logit contributions
sburgh-1.729
opsis-1.676
ificant-1.447
ocene-1.435
heastern-1.410
Token the
Token ID262
Feature activation+4.48e-01

Pos logit contributions
culus+4.382
picture+3.733
moral+3.689
rely+3.578
 footing+3.524
Neg logit contributions
sburgh-4.516
opsis-4.419
ificant-3.839
ocene-3.802
heastern-3.734
Token Occupy
Token ID25551
Feature activation-2.34e+00

Pos logit contributions
culus+2.617
picture+2.238
moral+2.214
rely+2.160
 footing+2.092
Neg logit contributions
sburgh-2.585
opsis-2.522
ificant-2.174
ocene-2.143
heastern-2.114
Token Wall
Token ID5007
Feature activation-1.77e+00

Pos logit contributions
sburgh+9.666
opsis+9.299
ocene+7.510
ificant+7.309
istance+7.280
Neg logit contributions
culus-14.617
picture-13.031
rely-12.764
moral-12.586
 sexes-11.671
Token Street
Token ID3530
Feature activation-5.67e-01

Pos logit contributions
sburgh+7.208
opsis+6.596
ocene+5.102
ographical+4.995
heastern+4.942
Neg logit contributions
culus-12.438
picture-11.107
moral-10.731
rely-10.730
 sexes-10.126
Token protests
Token ID8536
Feature activation-4.66e-01

Pos logit contributions
sburgh+2.946
opsis+2.800
ificant+2.375
ocene+2.354
heastern+2.302
Neg logit contributions
culus-3.663
picture-3.182
moral-3.115
rely-3.075
 footing-2.886
Token in
Token ID287
Feature activation+9.92e-02

Pos logit contributions
sburgh+2.633
opsis+2.534
ificant+2.183
ocene+2.166
heastern+2.130
Neg logit contributions
culus-2.797
picture-2.398
moral-2.350
rely-2.299
 footing-2.171
Token Wednesday
Token ID3583
Feature activation+3.25e-02

Pos logit contributions
culus+0.581
picture+0.498
moral+0.487
rely+0.477
 footing+0.456
Neg logit contributions
sburgh-0.578
opsis-0.558
ificant-0.485
ocene-0.480
heastern-0.472
Tokenuit
Token ID5013
Feature activation+5.15e-01

Pos logit contributions
culus+0.610
picture+0.460
moral+0.433
rely+0.428
 footing+0.372
Neg logit contributions
sburgh-1.515
opsis-1.475
ificant-1.342
ocene-1.329
heastern-1.315
Token.
Token ID13
Feature activation-1.96e-01

Pos logit contributions
culus+2.911
picture+2.480
moral+2.414
rely+2.375
 footing+2.304
Neg logit contributions
sburgh-3.049
opsis-2.954
ocene-2.581
ificant-2.575
heastern-2.508
Token The
Token ID383
Feature activation-1.51e-01

Pos logit contributions
sburgh+1.030
opsis+0.988
ificant+0.843
ocene+0.834
heastern+0.819
Neg logit contributions
culus-1.265
picture-1.101
moral-1.077
rely-1.057
 footing-1.011
Token contamination
Token ID22733
Feature activation-2.83e-02

Pos logit contributions
sburgh+0.842
opsis+0.809
ificant+0.698
ocene+0.689
heastern+0.679
Neg logit contributions
culus-0.925
picture-0.797
moral-0.780
rely-0.765
 footing-0.730
Token was
Token ID373
Feature activation-8.86e-01

Pos logit contributions
sburgh+0.162
opsis+0.155
ificant+0.135
ocene+0.133
heastern+0.131
Neg logit contributions
culus-0.169
picture-0.145
moral-0.142
rely-0.139
 footing-0.132
Token eventually
Token ID4191
Feature activation-2.32e+00

Pos logit contributions
sburgh+4.627
opsis+4.385
ificant+3.853
heastern+3.712
ocene+3.657
Neg logit contributions
culus-5.543
picture-4.850
moral-4.756
rely-4.558
 footing-4.372
Token cleared
Token ID12539
Feature activation-1.19e+00

Pos logit contributions
sburgh+9.858
opsis+8.786
ificant+8.192
heastern+7.701
 repeated+7.629
Neg logit contributions
culus-14.401
picture-12.931
moral-12.786
rely-11.668
chat-11.482
Token,
Token ID11
Feature activation-3.95e-01

Pos logit contributions
sburgh+6.103
opsis+5.967
ificant+5.119
heastern+4.949
ocene+4.869
Neg logit contributions
culus-7.418
picture-6.476
moral-6.317
rely-6.100
 footing-5.725
Token as
Token ID355
Feature activation-8.94e-01

Pos logit contributions
sburgh+2.133
opsis+2.047
ificant+1.761
ocene+1.728
heastern+1.704
Neg logit contributions
culus-2.488
picture-2.150
moral-2.107
rely-2.064
 footing-1.965
Token the
Token ID262
Feature activation-5.61e-02

Pos logit contributions
sburgh+4.716
opsis+4.535
ificant+3.909
heastern+3.750
ocene+3.748
Neg logit contributions
culus-5.611
picture-4.821
moral-4.814
rely-4.625
 footing-4.331
Token material
Token ID2587
Feature activation-6.59e-01

Pos logit contributions
sburgh+0.307
opsis+0.296
ificant+0.254
ocene+0.251
heastern+0.247
Neg logit contributions
culus-0.349
picture-0.301
moral-0.295
rely-0.289
 footing-0.277
TokenApp
Token ID4677
Feature activation+5.92e-01

Pos logit contributions
culus+0.561
picture+0.485
moral+0.473
rely+0.463
 footing+0.439
Neg logit contributions
sburgh-0.548
opsis-0.528
ificant-0.459
ocene-0.453
heastern-0.447
Tokenro
Token ID305
Feature activation+2.57e-01

Pos logit contributions
culus+3.276
picture+2.773
moral+2.718
rely+2.663
 footing+2.493
Neg logit contributions
sburgh-3.560
opsis-3.453
ificant-3.041
ocene-2.992
heastern-2.987
Tokenved
Token ID1079
Feature activation+2.93e-01

Pos logit contributions
culus+1.522
picture+1.310
moral+1.284
rely+1.255
 footing+1.180
Neg logit contributions
sburgh-1.465
opsis-1.408
ificant-1.224
ocene-1.206
heastern-1.197
Token
Token ID198
Feature activation+4.68e-01

Pos logit contributions
culus+1.741
picture+1.501
moral+1.463
rely+1.435
 footing+1.372
Neg logit contributions
sburgh-1.667
opsis-1.606
ificant-1.400
ocene-1.381
heastern-1.358
Token
Token ID198
Feature activation+3.65e-01

Pos logit contributions
culus+2.486
picture+2.184
moral+2.080
rely+2.064
 footing+1.895
Neg logit contributions
sburgh-2.907
opsis-2.817
ificant-2.496
ocene-2.469
heastern-2.452
TokenAuthor
Token ID13838
Feature activation-2.32e+00

Pos logit contributions
culus+2.392
picture+2.109
moral+2.061
rely+2.019
 footing+1.902
Neg logit contributions
sburgh-1.863
opsis-1.782
ificant-1.524
ocene-1.486
heastern-1.470
Token Notes
Token ID11822
Feature activation-7.55e-01

Pos logit contributions
opsis+10.798
sburgh+10.678
ificant+8.681
ocene+8.478
ographical+8.397
Neg logit contributions
culus-14.039
moral-11.736
picture-11.657
rely-11.201
 footing-11.005
Token:
Token ID25
Feature activation-2.45e-01

Pos logit contributions
sburgh+4.153
opsis+4.022
ocene+3.393
ificant+3.392
heastern+3.313
Neg logit contributions
culus-4.569
picture-3.912
moral-3.810
rely-3.754
 footing-3.626
Token This
Token ID770
Feature activation-2.76e-01

Pos logit contributions
sburgh+1.309
opsis+1.257
ificant+1.076
ocene+1.062
heastern+1.043
Neg logit contributions
culus-1.556
picture-1.346
moral-1.323
rely-1.294
 footing-1.243
Token could
Token ID714
Feature activation+1.90e-01

Pos logit contributions
sburgh+1.488
opsis+1.429
ificant+1.223
ocene+1.206
heastern+1.187
Neg logit contributions
culus-1.741
picture-1.506
moral-1.478
rely-1.448
 footing-1.384
Token go
Token ID467
Feature activation-8.51e-01

Pos logit contributions
culus+1.211
picture+1.056
moral+1.032
rely+1.020
 footing+0.970
Neg logit contributions
sburgh-1.005
opsis-0.963
ificant-0.825
ocene-0.816
heastern-0.802
Tokenals
Token ID874
Feature activation-1.12e-01

Pos logit contributions
sburgh+0.467
opsis+0.450
ificant+0.394
ocene+0.387
heastern+0.381
Neg logit contributions
culus-0.467
picture-0.402
moral-0.392
rely-0.383
 footing-0.366
Token are
Token ID389
Feature activation-3.01e-01

Pos logit contributions
sburgh+0.646
opsis+0.623
ificant+0.540
ocene+0.533
heastern+0.525
Neg logit contributions
culus-0.658
picture-0.563
moral-0.552
rely-0.539
 footing-0.513
Token poorer
Token ID26647
Feature activation+5.56e-01

Pos logit contributions
sburgh+1.650
opsis+1.585
ificant+1.367
ocene+1.345
heastern+1.325
Neg logit contributions
culus-1.865
picture-1.612
moral-1.578
rely-1.541
 footing-1.470
Token,
Token ID11
Feature activation+2.45e-01

Pos logit contributions
culus+3.000
picture+2.611
moral+2.537
rely+2.496
 footing+2.405
Neg logit contributions
sburgh-3.418
opsis-3.298
ificant-2.890
ocene-2.859
heastern-2.781
Token black
Token ID2042
Feature activation+8.75e-01

Pos logit contributions
culus+1.480
picture+1.285
moral+1.254
rely+1.230
 footing+1.188
Neg logit contributions
sburgh-1.381
opsis-1.322
ificant-1.149
ocene-1.136
heastern-1.109
Tokener
Token ID263
Feature activation+1.49e+00

Pos logit contributions
culus+4.702
picture+4.038
moral+3.968
rely+3.832
 footing+3.664
Neg logit contributions
sburgh-5.407
opsis-5.181
ificant-4.601
ocene-4.484
heastern-4.444
Token,
Token ID11
Feature activation+5.33e-01

Pos logit contributions
culus+6.786
picture+5.955
moral+5.693
rely+5.509
 footing+5.358
Neg logit contributions
sburgh-9.885
opsis-9.276
ificant-8.360
ocene-8.212
heastern-8.099
Token less
Token ID1342
Feature activation+7.96e-02

Pos logit contributions
culus+3.066
picture+2.638
moral+2.567
rely+2.522
 footing+2.416
Neg logit contributions
sburgh-3.154
opsis-3.020
ificant-2.639
ocene-2.607
heastern-2.550
Token educated
Token ID15657
Feature activation+4.22e-01

Pos logit contributions
culus+0.467
picture+0.400
moral+0.392
rely+0.383
 footing+0.366
Neg logit contributions
sburgh-0.466
opsis-0.448
ificant-0.389
ocene-0.386
heastern-0.378
Token and
Token ID290
Feature activation+8.19e-01

Pos logit contributions
culus+2.157
picture+1.817
moral+1.768
rely+1.725
 footing+1.635
Neg logit contributions
sburgh-2.768
opsis-2.661
ificant-2.361
ocene-2.330
heastern-2.292
Token younger
Token ID7099
Feature activation+2.38e-01

Pos logit contributions
culus+4.841
picture+4.202
moral+4.082
rely+4.068
 footing+3.912
Neg logit contributions
sburgh-4.644
opsis-4.388
ificant-3.865
ocene-3.835
heastern-3.703
Token could
Token ID714
Feature activation-3.69e-01

Pos logit contributions
sburgh+1.389
opsis+1.338
ificant+1.161
ocene+1.142
heastern+1.124
Neg logit contributions
culus-1.410
picture-1.210
moral-1.186
rely-1.153
 footing-1.097
Token be
Token ID307
Feature activation+3.04e-01

Pos logit contributions
sburgh+2.185
opsis+2.102
ificant+1.828
ocene+1.798
heastern+1.774
Neg logit contributions
culus-2.137
picture-1.821
moral-1.785
rely-1.734
 footing-1.651
Token reprim
Token ID41929
Feature activation+1.38e-01

Pos logit contributions
culus+1.891
picture+1.653
moral+1.599
rely+1.583
 footing+1.542
Neg logit contributions
sburgh-1.634
opsis-1.579
ocene-1.346
ificant-1.342
heastern-1.325
Tokenanded
Token ID12249
Feature activation-1.44e-01

Pos logit contributions
culus+0.673
picture+0.564
moral+0.545
rely+0.536
 footing+0.499
Neg logit contributions
sburgh-0.939
opsis-0.907
ificant-0.800
ocene-0.795
heastern-0.785
Token.
Token ID13
Feature activation+3.61e-01

Pos logit contributions
sburgh+0.769
opsis+0.738
ificant+0.633
ocene+0.625
heastern+0.614
Neg logit contributions
culus-0.913
picture-0.792
moral-0.775
rely-0.761
 footing-0.729
Token J
Token ID449
Feature activation+1.70e+00

Pos logit contributions
culus+2.070
picture+1.811
moral+1.726
rely+1.724
 footing+1.604
Neg logit contributions
sburgh-2.085
opsis-2.039
ificant-1.773
ocene-1.768
heastern-1.754
TokenSO
Token ID15821
Feature activation-4.40e-02

Pos logit contributions
culus+8.844
picture+7.547
rely+7.126
moral+7.113
AME+6.715
Neg logit contributions
sburgh-9.807
opsis-9.313
ificant-8.322
heastern-8.222
ocene-8.156
Token said
Token ID531
Feature activation+4.26e-02

Pos logit contributions
sburgh+0.257
opsis+0.248
ificant+0.216
ocene+0.214
heastern+0.210
Neg logit contributions
culus-0.257
picture-0.220
moral-0.215
rely-0.211
 footing-0.201
Token since
Token ID1201
Feature activation+9.40e-01

Pos logit contributions
culus+0.266
picture+0.231
moral+0.226
rely+0.222
 footing+0.212
Neg logit contributions
sburgh-0.231
opsis-0.222
ificant-0.191
ocene-0.189
heastern-0.186
Token one
Token ID530
Feature activation+1.82e-01

Pos logit contributions
culus+5.287
rely+4.726
picture+4.671
moral+4.432
 footing+4.323
Neg logit contributions
sburgh-5.300
opsis-5.165
ocene-4.602
heastern-4.508
ificant-4.507
Token of
Token ID286
Feature activation+4.62e-01

Pos logit contributions
culus+0.981
picture+0.835
moral+0.808
rely+0.798
 footing+0.759
Neg logit contributions
sburgh-1.136
opsis-1.102
ificant-0.965
ocene-0.965
heastern-0.949
Token condo
Token ID32119
Feature activation-4.26e-01

Pos logit contributions
culus+3.355
picture+2.884
moral+2.844
rely+2.753
 footing+2.678
Neg logit contributions
sburgh-3.371
opsis-3.301
ificant-2.820
heastern-2.776
ocene-2.772
Token in
Token ID287
Feature activation+6.43e-01

Pos logit contributions
sburgh+2.752
opsis+2.653
ificant+2.336
ocene+2.331
heastern+2.273
Neg logit contributions
culus-2.229
picture-1.862
moral-1.827
rely-1.789
 footing-1.672
Token terms
Token ID2846
Feature activation+3.74e-01

Pos logit contributions
culus+3.744
picture+3.208
moral+3.135
rely+3.043
 footing+2.935
Neg logit contributions
sburgh-3.722
opsis-3.581
ificant-3.071
ocene-3.064
heastern-3.052
Token we
Token ID356
Feature activation+5.45e-01

Pos logit contributions
culus+2.047
picture+1.739
moral+1.698
rely+1.655
 footing+1.587
Neg logit contributions
sburgh-2.311
opsis-2.228
ificant-1.953
ocene-1.922
heastern-1.904
Token're
Token ID821
Feature activation+1.31e+00

Pos logit contributions
culus+3.221
picture+2.797
rely+2.699
moral+2.693
 footing+2.563
Neg logit contributions
sburgh-3.091
opsis-2.995
ificant-2.615
ocene-2.543
heastern-2.541
Token all
Token ID477
Feature activation+1.69e+00

Pos logit contributions
culus+7.148
picture+6.240
moral+6.125
 footing+6.121
rely+6.038
Neg logit contributions
opsis-7.519
sburgh-7.479
ificant-6.490
ocene-6.398
heastern-6.375
Token familiar
Token ID5385
Feature activation-4.10e-01

Pos logit contributions
culus+7.327
picture+6.494
 footing+6.285
moral+6.193
rely+6.146
Neg logit contributions
opsis-10.848
sburgh-10.761
ificant-9.573
heastern-9.452
ocene-9.433
Token with
Token ID351
Feature activation-1.49e-01

Pos logit contributions
sburgh+2.140
opsis+2.012
ificant+1.710
heastern+1.672
ocene+1.660
Neg logit contributions
culus-2.673
picture-2.277
moral-2.253
rely-2.233
 footing-2.052
Token,
Token ID11
Feature activation+3.53e-01

Pos logit contributions
sburgh+0.869
opsis+0.836
ificant+0.726
ocene+0.717
heastern+0.707
Neg logit contributions
culus-0.872
picture-0.746
moral-0.729
rely-0.714
 footing-0.679
Token from
Token ID422
Feature activation-1.31e-01

Pos logit contributions
culus+2.035
picture+1.746
moral+1.713
rely+1.672
 footing+1.615
Neg logit contributions
sburgh-2.076
opsis-2.003
ificant-1.745
ocene-1.729
heastern-1.697
Token catalog
Token ID18388
Feature activation+2.49e-01

Pos logit contributions
sburgh+0.751
opsis+0.721
ificant+0.625
ocene+0.618
heastern+0.607
Neg logit contributions
culus-0.781
picture-0.670
moral-0.656
rely-0.642
 footing-0.610
Token the
Token ID262
Feature activation+6.68e-01

Pos logit contributions
culus+0.302
picture+0.260
moral+0.251
rely+0.249
 footing+0.238
Neg logit contributions
sburgh-0.280
opsis-0.270
ificant-0.234
ocene-0.231
heastern-0.227
Token 10
Token ID838
Feature activation+6.54e-02

Pos logit contributions
culus+3.862
picture+3.288
moral+3.182
rely+3.141
 footing+3.017
Neg logit contributions
sburgh-3.908
opsis-3.796
ificant-3.296
ocene-3.277
heastern-3.197
Token"
Token ID1
Feature activation+1.82e-02

Pos logit contributions
culus+0.377
picture+0.322
moral+0.314
rely+0.307
 footing+0.293
Neg logit contributions
sburgh-0.388
opsis-0.374
ificant-0.326
ocene-0.322
heastern-0.318
Token 150
Token ID6640
Feature activation+2.70e-01

Pos logit contributions
culus+0.104
picture+0.088
moral+0.086
rely+0.085
 footing+0.080
Neg logit contributions
sburgh-0.110
opsis-0.106
ificant-0.092
ocene-0.091
heastern-0.090
Token watt
Token ID40114
Feature activation-2.39e-01

Pos logit contributions
culus+1.461
picture+1.238
moral+1.196
rely+1.175
 footing+1.123
Neg logit contributions
sburgh-1.687
opsis-1.638
ificant-1.433
ocene-1.424
heastern-1.401
Token J
Token ID449
Feature activation+1.62e+00

Pos logit contributions
sburgh+1.434
opsis+1.375
ificant+1.201
ocene+1.182
heastern+1.171
Neg logit contributions
culus-1.354
picture-1.156
moral-1.138
rely-1.106
 footing-1.047
TokenBL
Token ID9148
Feature activation+3.97e-02

Pos logit contributions
culus+8.541
picture+7.230
moral+6.888
rely+6.723
 footing+6.224
Neg logit contributions
sburgh-9.549
opsis-9.323
ocene-8.122
ificant-8.099
heastern-7.821
Token before
Token ID878
Feature activation+5.09e-01

Pos logit contributions
culus+0.227
picture+0.193
moral+0.189
rely+0.185
 footing+0.176
Neg logit contributions
sburgh-0.238
opsis-0.229
ificant-0.200
ocene-0.198
heastern-0.195
Token and
Token ID290
Feature activation+3.78e-01

Pos logit contributions
culus+3.065
picture+2.644
rely+2.537
moral+2.520
 footing+2.415
Neg logit contributions
sburgh-2.875
opsis-2.788
ificant-2.414
ocene-2.405
heastern-2.346
Token it
Token ID340
Feature activation+2.61e-02

Pos logit contributions
culus+2.324
picture+2.015
rely+1.935
moral+1.923
 footing+1.844
Neg logit contributions
sburgh-2.096
opsis-2.017
ificant-1.746
ocene-1.727
heastern-1.685
Token sounded
Token ID14846
Feature activation+2.19e-01

Pos logit contributions
culus+0.154
picture+0.131
moral+0.128
rely+0.126
 footing+0.121
Neg logit contributions
sburgh-0.151
opsis-0.146
ificant-0.127
ocene-0.126
heastern-0.123
Tokengow
Token ID21175
Feature activation+1.84e-01

Pos logit contributions
sburgh+3.782
opsis+3.595
ocene+3.035
ificant+3.021
heastern+2.931
Neg logit contributions
culus-5.227
picture-4.470
moral-4.397
rely-4.306
 footing-4.259
Token
Token ID198
Feature activation+4.48e-01

Pos logit contributions
culus+1.052
picture+0.905
moral+0.880
rely+0.865
 footing+0.815
Neg logit contributions
sburgh-1.098
opsis-1.058
ificant-0.925
ocene-0.914
heastern-0.903
Token
Token ID198
Feature activation+7.15e-01

Pos logit contributions
culus+2.353
picture+2.069
moral+1.969
rely+1.969
 footing+1.790
Neg logit contributions
sburgh-2.800
opsis-2.721
ificant-2.409
ocene-2.385
heastern-2.374
Token1997
Token ID21498
Feature activation+4.30e-01

Pos logit contributions
culus+3.088
picture+2.649
moral+2.537
rely+2.457
 footing+2.194
Neg logit contributions
sburgh-5.157
opsis-5.021
ificant-4.446
ocene-4.441
heastern-4.403
Token Michael
Token ID3899
Feature activation+4.23e-01

Pos logit contributions
culus+2.496
picture+2.149
moral+2.092
rely+2.065
 footing+1.957
Neg logit contributions
sburgh-2.499
opsis-2.400
ificant-2.110
ocene-2.080
heastern-2.048
Token J
Token ID449
Feature activation+1.54e+00

Pos logit contributions
culus+2.479
picture+2.147
moral+2.094
rely+2.053
 footing+1.953
Neg logit contributions
sburgh-2.416
opsis-2.320
ificant-2.051
ocene-2.019
heastern-1.970
Token.
Token ID13
Feature activation+8.02e-01

Pos logit contributions
culus+8.652
picture+7.284
moral+7.259
rely+7.135
 sexes+6.527
Neg logit contributions
sburgh-8.540
opsis-8.145
ificant-7.174
ocene-6.998
 externalToEVAOnly-6.704
Token Fox
Token ID5426
Feature activation+7.14e-01

Pos logit contributions
culus+3.571
picture+3.003
moral+2.833
rely+2.819
 footing+2.576
Neg logit contributions
sburgh-5.582
opsis-5.487
ocene-4.912
ificant-4.868
heastern-4.822
Token
Token ID198
Feature activation+5.52e-01

Pos logit contributions
culus+4.012
picture+3.437
moral+3.352
rely+3.290
 footing+3.120
Neg logit contributions
sburgh-4.269
opsis-4.119
ificant-3.601
ocene-3.560
heastern-3.511
Token
Token ID198
Feature activation+8.10e-01

Pos logit contributions
culus+2.818
picture+2.497
rely+2.374
moral+2.360
 footing+2.135
Neg logit contributions
sburgh-3.493
opsis-3.401
ificant-3.023
ocene-2.994
heastern-2.988
Token1997
Token ID21498
Feature activation+2.48e-01

Pos logit contributions
culus+3.282
picture+2.901
moral+2.742
rely+2.664
 footing+2.267
Neg logit contributions
sburgh-5.963
opsis-5.844
ificant-5.189
ocene-5.187
heastern-5.180
Token which
Token ID543
Feature activation+6.87e-01

Pos logit contributions
culus+4.185
picture+3.703
moral+3.509
rely+3.448
chat+3.305
Neg logit contributions
sburgh-4.093
opsis-3.778
ificant-3.419
ocene-3.386
heastern-3.277
Token shows
Token ID2523
Feature activation+1.08e+00

Pos logit contributions
culus+4.145
picture+3.605
rely+3.505
moral+3.451
 footing+3.299
Neg logit contributions
sburgh-3.791
opsis-3.615
ificant-3.192
ocene-3.178
heastern-3.045
Token the
Token ID262
Feature activation+7.00e-01

Pos logit contributions
culus+6.035
picture+5.189
rely+5.018
moral+4.881
 footing+4.685
Neg logit contributions
sburgh-6.321
opsis-6.045
ocene-5.332
ificant-5.199
heastern-5.124
Token item
Token ID2378
Feature activation+4.58e-01

Pos logit contributions
culus+3.887
picture+3.329
moral+3.204
rely+3.187
 footing+3.051
Neg logit contributions
sburgh-4.255
opsis-4.084
ocene-3.589
ificant-3.567
heastern-3.477
Token number
Token ID1271
Feature activation+1.77e-01

Pos logit contributions
culus+2.745
picture+2.385
moral+2.307
rely+2.288
 footing+2.201
Neg logit contributions
sburgh-2.589
opsis-2.451
ificant-2.141
ocene-2.134
heastern-2.065
Token and
Token ID290
Feature activation+3.84e-01

Pos logit contributions
culus+1.015
picture+0.871
moral+0.844
rely+0.832
 footing+0.794
Neg logit contributions
sburgh-1.048
opsis-1.007
ificant-0.881
ocene-0.869
heastern-0.855
Token U
Token ID471
Feature activation-6.61e-01

Pos logit contributions
culus+2.204
picture+1.894
moral+1.836
rely+1.813
 footing+1.734
Neg logit contributions
sburgh-2.286
opsis-2.183
ificant-1.917
ocene-1.908
heastern-1.863
TokenPC
Token ID5662
Feature activation-4.72e-01

Pos logit contributions
sburgh+4.034
opsis+3.946
ificant+3.427
ocene+3.421
heastern+3.359
Neg logit contributions
culus-3.582
picture-3.036
moral-2.956
rely-2.872
 footing-2.737
Token of
Token ID286
Feature activation+1.11e-01

Pos logit contributions
sburgh+2.583
opsis+2.488
ificant+2.134
ocene+2.105
heastern+2.075
Neg logit contributions
culus-2.927
picture-2.522
moral-2.486
rely-2.430
 footing-2.314
Token the
Token ID262
Feature activation+1.37e-01

Pos logit contributions
culus+0.661
picture+0.567
moral+0.554
rely+0.544
 footing+0.518
Neg logit contributions
sburgh-0.638
opsis-0.613
ificant-0.532
ocene-0.526
heastern-0.517
Token Best
Token ID6705
Feature activation-6.11e-01

Pos logit contributions
culus+0.744
picture+0.629
moral+0.612
rely+0.601
 footing+0.569
Neg logit contributions
sburgh-0.860
opsis-0.829
ificant-0.728
ocene-0.723
heastern-0.710
Token
Token ID447
Feature activation-2.69e-01

Pos logit contributions
culus+2.494
picture+2.215
moral+2.141
rely+2.123
 footing+1.997
Neg logit contributions
sburgh-1.881
opsis-1.824
ificant-1.538
ocene-1.527
heastern-1.514
Token
Token ID247
Feature activation+2.63e-01

Pos logit contributions
sburgh+1.462
opsis+1.395
ificant+1.198
ocene+1.188
heastern+1.164
Neg logit contributions
culus-1.687
picture-1.459
moral-1.429
rely-1.390
 footing-1.338
Tokent
Token ID83
Feature activation+1.82e-01

Pos logit contributions
culus+1.421
picture+1.196
moral+1.183
rely+1.177
 footing+1.072
Neg logit contributions
sburgh-1.646
opsis-1.588
ificant-1.389
ocene-1.379
heastern-1.365
Token figure
Token ID3785
Feature activation+1.59e-01

Pos logit contributions
culus+1.066
picture+0.916
moral+0.889
rely+0.888
 footing+0.846
Neg logit contributions
sburgh-1.056
opsis-1.014
ificant-0.888
ocene-0.882
heastern-0.862
Token any
Token ID597
Feature activation+6.49e-01

Pos logit contributions
culus+1.002
picture+0.866
moral+0.848
rely+0.838
 footing+0.804
Neg logit contributions
sburgh-0.861
opsis-0.825
ificant-0.708
ocene-0.700
heastern-0.688
Token of
Token ID286
Feature activation+3.55e-01

Pos logit contributions
culus+3.819
picture+3.288
moral+3.188
rely+3.133
 footing+3.072
Neg logit contributions
sburgh-3.667
opsis-3.558
ificant-3.039
ocene-3.023
heastern-2.997
Token this
Token ID428
Feature activation+2.54e-02

Pos logit contributions
culus+2.333
picture+2.019
moral+1.990
rely+1.966
 footing+1.892
Neg logit contributions
sburgh-1.811
opsis-1.733
ificant-1.477
ocene-1.465
heastern-1.421
Token out
Token ID503
Feature activation+1.13e-01

Pos logit contributions
culus+0.161
picture+0.140
moral+0.137
rely+0.134
 footing+0.129
Neg logit contributions
sburgh-0.136
opsis-0.131
ificant-0.112
ocene-0.111
heastern-0.109
Token.
Token ID13
Feature activation+5.10e-01

Pos logit contributions
culus+0.712
picture+0.616
moral+0.602
rely+0.590
 footing+0.567
Neg logit contributions
sburgh-0.611
opsis-0.587
ificant-0.505
ocene-0.500
heastern-0.490
Token
Token ID564
Feature activation-1.60e-01

Pos logit contributions
culus+3.087
picture+2.728
moral+2.624
rely+2.620
 Ivanka+2.471
Neg logit contributions
sburgh-2.746
opsis-2.694
heastern-2.300
ificant-2.297
ocene-2.296
Token
Token ID250
Feature activation-4.08e-01

Pos logit contributions
sburgh+0.866
opsis+0.829
ificant+0.711
ocene+0.699
heastern+0.691
Neg logit contributions
culus-1.009
picture-0.871
moral-0.855
rely-0.838
 footing-0.800
Token,
Token ID11
Feature activation-4.11e-01

Pos logit contributions
culus+2.985
picture+2.544
moral+2.484
rely+2.446
 footing+2.391
Neg logit contributions
sburgh-2.909
opsis-2.798
ificant-2.466
ocene-2.418
heastern-2.369
Token in
Token ID287
Feature activation+6.47e-03

Pos logit contributions
sburgh+2.277
opsis+2.182
ificant+1.876
ocene+1.863
heastern+1.829
Neg logit contributions
culus-2.520
picture-2.170
moral-2.130
rely-2.076
 footing-1.973
Token-
Token ID12
Feature activation-2.10e-02

Pos logit contributions
culus+0.033
picture+0.027
moral+0.027
rely+0.026
 footing+0.025
Neg logit contributions
sburgh-0.043
opsis-0.041
ificant-0.037
ocene-0.036
heastern-0.036
Tokenhouse
Token ID4803
Feature activation-7.39e-01

Pos logit contributions
sburgh+0.139
opsis+0.134
ificant+0.119
ocene+0.117
heastern+0.116
Neg logit contributions
culus-0.107
picture-0.089
moral-0.087
rely-0.084
 footing-0.080
Token online
Token ID2691
Feature activation-3.11e-01

Pos logit contributions
sburgh+4.292
opsis+4.136
ificant+3.569
ocene+3.518
heastern+3.498
Neg logit contributions
culus-4.235
picture-3.624
moral-3.554
rely-3.468
 footing-3.222
Token fundraising
Token ID18066
Feature activation+1.12e-01

Pos logit contributions
sburgh+1.784
opsis+1.717
ificant+1.488
ocene+1.470
heastern+1.447
Neg logit contributions
culus-1.844
picture-1.585
moral-1.552
rely-1.520
 footing-1.438
Token tool
Token ID2891
Feature activation+4.24e-01

Pos logit contributions
culus+0.608
picture+0.513
moral+0.500
rely+0.490
 footing+0.466
Neg logit contributions
sburgh-0.697
opsis-0.673
ificant-0.592
ocene-0.585
heastern-0.577
Token,
Token ID11
Feature activation-7.48e-02

Pos logit contributions
culus+2.474
picture+2.091
moral+2.040
rely+1.997
 footing+1.936
Neg logit contributions
sburgh-2.498
opsis-2.380
ificant-2.094
ocene-2.058
heastern-2.021
Token Sanders
Token ID5831
Feature activation+1.97e-01

Pos logit contributions
sburgh+0.450
opsis+0.434
ificant+0.379
ocene+0.375
heastern+0.369
Neg logit contributions
culus-0.425
picture-0.362
moral-0.354
rely-0.346
 footing-0.330
Token can
Token ID460
Feature activation-2.28e-01

Pos logit contributions
culus+1.221
picture+1.053
moral+1.026
rely+1.009
 footing+0.977
Neg logit contributions
sburgh-1.085
opsis-1.039
ificant-0.901
ocene-0.888
heastern-0.871
Token lean
Token ID10904
Feature activation-8.88e-02

Pos logit contributions
sburgh+1.356
opsis+1.309
ificant+1.136
ocene+1.123
heastern+1.108
Neg logit contributions
culus-1.305
picture-1.113
moral-1.091
rely-1.062
 footing-1.008
Token now
Token ID783
Feature activation+6.18e-02

Pos logit contributions
sburgh+2.425
opsis+2.335
ificant+2.022
ocene+1.996
heastern+1.966
Neg logit contributions
culus-2.540
picture-2.181
moral-2.135
rely-2.091
 footing-1.990
Token,
Token ID11
Feature activation+1.61e-01

Pos logit contributions
culus+0.370
picture+0.318
moral+0.310
rely+0.304
 footing+0.292
Neg logit contributions
sburgh-0.353
opsis-0.340
ificant-0.295
ocene-0.291
heastern-0.286
Token
Token ID447
Feature activation-9.78e-03

Pos logit contributions
culus+1.066
picture+0.936
moral+0.909
rely+0.898
 footing+0.865
Neg logit contributions
sburgh-0.819
opsis-0.777
ificant-0.664
ocene-0.652
heastern-0.642
Token
Token ID251
Feature activation-2.89e-02

Pos logit contributions
sburgh+0.060
opsis+0.058
ificant+0.050
ocene+0.050
heastern+0.049
Neg logit contributions
culus-0.055
picture-0.046
moral-0.045
rely-0.044
 footing-0.042
Token Dr
Token ID1583
Feature activation-1.19e+00

Pos logit contributions
sburgh+0.156
opsis+0.149
ificant+0.128
ocene+0.127
heastern+0.125
Neg logit contributions
culus-0.183
picture-0.159
moral-0.155
rely-0.152
 footing-0.146
Token Dum
Token ID30933
Feature activation+7.26e-01

Pos logit contributions
sburgh+5.432
opsis+5.372
ificant+4.444
ocene+4.339
heastern+4.256
Neg logit contributions
culus-7.850
moral-6.886
picture-6.872
rely-6.724
 footing-6.427
Tokenus
Token ID385
Feature activation+1.08e-01

Pos logit contributions
culus+4.830
picture+4.281
rely+4.144
moral+4.128
chat+3.790
Neg logit contributions
sburgh-3.509
opsis-3.347
ificant-2.914
ocene-2.881
heastern-2.823
Tokenque
Token ID4188
Feature activation-3.00e-01

Pos logit contributions
culus+0.549
picture+0.457
moral+0.443
rely+0.434
 footing+0.405
Neg logit contributions
sburgh-0.720
opsis-0.696
ificant-0.619
ocene-0.610
heastern-0.602
Token said
Token ID531
Feature activation-3.62e-01

Pos logit contributions
sburgh+1.498
opsis+1.446
ificant+1.211
ocene+1.192
heastern+1.177
Neg logit contributions
culus-1.991
picture-1.750
moral-1.720
rely-1.689
 footing-1.615
Token.
Token ID13
Feature activation-2.61e-01

Pos logit contributions
sburgh+1.720
opsis+1.664
ificant+1.378
ocene+1.367
heastern+1.343
Neg logit contributions
culus-2.470
picture-2.174
moral-2.145
rely-2.103
 footing-2.017
Token
Token ID198
Feature activation-3.98e-01

Pos logit contributions
sburgh+1.410
opsis+1.354
ificant+1.162
ocene+1.147
heastern+1.123
Neg logit contributions
culus-1.641
picture-1.416
moral-1.393
rely-1.364
 footing-1.309
Token in
Token ID287
Feature activation+2.00e-01

Pos logit contributions
sburgh+0.795
opsis+0.762
ificant+0.660
ocene+0.650
heastern+0.643
Neg logit contributions
culus-0.850
picture-0.731
moral-0.716
rely-0.700
 footing-0.666
Token Boston
Token ID6182
Feature activation+7.91e-02

Pos logit contributions
culus+1.185
picture+1.016
moral+0.994
rely+0.974
 footing+0.928
Neg logit contributions
sburgh-1.157
opsis-1.113
ificant-0.966
ocene-0.954
heastern-0.940
Token.
Token ID13
Feature activation-3.08e-01

Pos logit contributions
culus+0.439
picture+0.372
moral+0.363
rely+0.355
 footing+0.337
Neg logit contributions
sburgh-0.486
opsis-0.469
ificant-0.411
ocene-0.407
heastern-0.400
Token (
Token ID357
Feature activation+9.50e-03

Pos logit contributions
sburgh+1.669
opsis+1.600
ificant+1.374
ocene+1.354
heastern+1.330
Neg logit contributions
culus-1.939
picture-1.670
moral-1.643
rely-1.608
 footing-1.542
TokenWa
Token ID33484
Feature activation-6.62e-01

Pos logit contributions
culus+0.061
picture+0.053
moral+0.052
rely+0.051
 footing+0.049
Neg logit contributions
sburgh-0.050
opsis-0.048
ificant-0.041
ocene-0.041
heastern-0.040
Tokenub
Token ID549
Feature activation+6.58e-01

Pos logit contributions
sburgh+3.345
opsis+3.218
ificant+2.739
ocene+2.675
heastern+2.649
Neg logit contributions
culus-4.348
picture-3.778
moral-3.723
rely-3.636
 footing-3.556
Tokenges
Token ID3212
Feature activation+3.59e-01

Pos logit contributions
culus+3.870
picture+3.321
moral+3.292
rely+3.260
 footing+2.958
Neg logit contributions
sburgh-3.722
opsis-3.619
ificant-3.113
ocene-3.057
heastern-3.040
Tokenhig
Token ID25196
Feature activation-2.83e-01

Pos logit contributions
culus+1.513
picture+1.204
moral+1.185
rely+1.157
 footing+1.013
Neg logit contributions
sburgh-2.701
opsis-2.606
ificant-2.338
ocene-2.318
heastern-2.288
Token Rice
Token ID13823
Feature activation+3.32e-01

Pos logit contributions
sburgh+1.389
opsis+1.330
ificant+1.122
ocene+1.101
heastern+1.083
Neg logit contributions
culus-1.917
picture-1.674
moral-1.648
rely-1.610
 footing-1.551
Token/
Token ID14
Feature activation-4.35e-01

Pos logit contributions
culus+1.884
picture+1.634
moral+1.573
rely+1.553
 footing+1.457
Neg logit contributions
sburgh-1.980
opsis-1.908
ificant-1.672
ocene-1.656
heastern-1.634
TokenCBC
Token ID29208
Feature activation+4.65e-01

Pos logit contributions
sburgh+1.942
opsis+1.844
ificant+1.527
ocene+1.506
heastern+1.462
Neg logit contributions
culus-3.135
picture-2.731
moral-2.715
rely-2.658
 footing-2.597
Token interested
Token ID4609
Feature activation+6.06e-02

Pos logit contributions
sburgh+0.625
opsis+0.602
ificant+0.525
ocene+0.518
heastern+0.511
Neg logit contributions
culus-0.615
picture-0.526
moral-0.514
rely-0.503
 footing-0.478
Token in
Token ID287
Feature activation-6.22e-02

Pos logit contributions
culus+0.373
picture+0.321
moral+0.314
rely+0.308
 footing+0.292
Neg logit contributions
sburgh-0.336
opsis-0.323
ificant-0.279
ocene-0.276
heastern-0.272
Token information
Token ID1321
Feature activation-4.75e-01

Pos logit contributions
sburgh+0.359
opsis+0.345
ificant+0.300
ocene+0.296
heastern+0.291
Neg logit contributions
culus-0.368
picture-0.316
moral-0.308
rely-0.301
 footing-0.286
Token about
Token ID546
Feature activation-3.75e-01

Pos logit contributions
sburgh+2.548
opsis+2.443
ificant+2.102
ocene+2.068
heastern+2.035
Neg logit contributions
culus-2.991
picture-2.590
moral-2.532
rely-2.483
 footing-2.358
Token Google
Token ID3012
Feature activation-1.78e-01

Pos logit contributions
sburgh+2.075
opsis+1.981
ificant+1.721
ocene+1.684
heastern+1.660
Neg logit contributions
culus-2.292
picture-1.982
moral-1.929
rely-1.882
 footing-1.792
Token allegedly
Token ID7910
Feature activation-1.09e+00

Pos logit contributions
sburgh+0.925
opsis+0.884
ificant+0.756
ocene+0.743
heastern+0.729
Neg logit contributions
culus-1.163
picture-1.013
moral-0.991
rely-0.975
 footing-0.930
Token limiting
Token ID15637
Feature activation+4.58e-01

Pos logit contributions
sburgh+5.871
opsis+5.594
ificant+4.866
heastern+4.708
ocene+4.621
Neg logit contributions
culus-6.513
picture-5.702
moral-5.447
rely-5.361
 sexes-4.896
Token access
Token ID1895
Feature activation-4.26e-02

Pos logit contributions
culus+2.574
picture+2.172
moral+2.141
rely+2.101
 footing+2.033
Neg logit contributions
sburgh-2.769
opsis-2.669
ificant-2.344
ocene-2.327
heastern-2.290
Token to
Token ID284
Feature activation+8.04e-02

Pos logit contributions
sburgh+0.224
opsis+0.215
ificant+0.183
ocene+0.181
heastern+0.178
Neg logit contributions
culus-0.275
picture-0.239
moral-0.234
rely-0.230
 footing-0.219
Token rival
Token ID8976
Feature activation-8.75e-02

Pos logit contributions
culus+0.459
picture+0.391
moral+0.382
rely+0.375
 footing+0.358
Neg logit contributions
sburgh-0.482
opsis-0.465
ificant-0.406
ocene-0.402
heastern-0.396
Token search
Token ID2989
Feature activation+9.41e-02

Pos logit contributions
sburgh+0.541
opsis+0.522
ificant+0.458
ocene+0.452
heastern+0.446
Neg logit contributions
culus-0.482
picture-0.408
moral-0.398
rely-0.389
 footing-0.369
Token-
Token ID12
Feature activation+6.22e-01

Pos logit contributions
culus+3.716
picture+3.117
moral+2.978
rely+2.964
 footing+2.828
Neg logit contributions
sburgh-4.849
opsis-4.650
ocene-4.085
ificant-4.046
heastern-4.024
Tokenround
Token ID744
Feature activation-7.78e-01

Pos logit contributions
culus+3.353
picture+2.898
moral+2.791
rely+2.744
 footing+2.550
Neg logit contributions
sburgh-3.894
opsis-3.747
ificant-3.273
ocene-3.246
heastern-3.209
Token.
Token ID13
Feature activation-3.57e-01

Pos logit contributions
sburgh+4.465
opsis+4.269
ificant+3.733
ocene+3.632
heastern+3.621
Neg logit contributions
culus-4.540
picture-3.888
moral-3.839
rely-3.674
 footing-3.440
Token
Token ID198
Feature activation-4.74e-01

Pos logit contributions
sburgh+1.916
opsis+1.836
ificant+1.577
ocene+1.552
heastern+1.527
Neg logit contributions
culus-2.264
picture-1.960
moral-1.922
rely-1.884
 footing-1.803
Token
Token ID198
Feature activation-2.51e-01

Pos logit contributions
sburgh+2.579
opsis+2.469
ificant+2.110
ocene+2.080
heastern+2.026
Neg logit contributions
culus-2.979
picture-2.512
moral-2.495
rely-2.418
 footing-2.366
TokenThe
Token ID464
Feature activation-1.89e-01

Pos logit contributions
sburgh+1.293
opsis+1.238
ificant+1.054
ocene+1.040
heastern+1.022
Neg logit contributions
culus-1.639
picture-1.429
moral-1.400
rely-1.374
 footing-1.316
Token ice
Token ID4771
Feature activation-4.29e-01

Pos logit contributions
sburgh+1.059
opsis+1.017
ificant+0.879
ocene+0.867
heastern+0.854
Neg logit contributions
culus-1.154
picture-0.995
moral-0.973
rely-0.954
 footing-0.910
Tokenbreaker
Token ID25766
Feature activation+4.77e-02

Pos logit contributions
sburgh+2.733
opsis+2.637
ificant+2.335
ocene+2.296
heastern+2.282
Neg logit contributions
culus-2.248
picture-1.889
moral-1.847
rely-1.815
 footing-1.695
Token Lenin
Token ID22899
Feature activation-2.80e-01

Pos logit contributions
culus+0.281
picture+0.241
moral+0.235
rely+0.231
 footing+0.221
Neg logit contributions
sburgh-0.276
opsis-0.266
ificant-0.231
ocene-0.229
heastern-0.225
Token was
Token ID373
Feature activation-1.35e-01

Pos logit contributions
sburgh+1.600
opsis+1.538
ificant+1.331
ocene+1.314
heastern+1.294
Neg logit contributions
culus-1.677
picture-1.443
moral-1.412
rely-1.382
 footing-1.320
Token the
Token ID262
Feature activation+3.10e-01

Pos logit contributions
sburgh+0.711
opsis+0.683
ificant+0.583
ocene+0.576
heastern+0.565
Neg logit contributions
culus-0.872
picture-0.758
moral-0.742
rely-0.729
 footing-0.699
Token to
Token ID284
Feature activation+3.66e-01

Pos logit contributions
sburgh+2.501
opsis+2.347
ificant+2.037
ocene+1.991
heastern+1.966
Neg logit contributions
culus-3.498
picture-3.076
moral-3.021
rely-2.949
 footing-2.770
Token vote
Token ID3015
Feature activation+1.81e-01

Pos logit contributions
culus+1.911
picture+1.623
rely+1.577
moral+1.567
 footing+1.493
Neg logit contributions
sburgh-2.342
opsis-2.251
ocene-1.982
ificant-1.970
heastern-1.946
Token,
Token ID11
Feature activation+5.73e-01

Pos logit contributions
culus+1.050
picture+0.898
moral+0.877
rely+0.859
 footing+0.818
Neg logit contributions
sburgh-1.066
opsis-1.027
ificant-0.894
ocene-0.883
heastern-0.870
Token and
Token ID290
Feature activation-5.43e-02

Pos logit contributions
culus+3.255
picture+2.817
moral+2.726
rely+2.699
 footing+2.630
Neg logit contributions
sburgh-3.381
opsis-3.279
ocene-2.842
ificant-2.785
heastern-2.777
Token a
Token ID257
Feature activation-3.52e-01

Pos logit contributions
sburgh+0.314
opsis+0.302
ificant+0.262
ocene+0.261
heastern+0.255
Neg logit contributions
culus-0.321
picture-0.276
moral-0.269
rely-0.265
 footing-0.253
Token record
Token ID1700
Feature activation-9.76e-01

Pos logit contributions
sburgh+1.869
opsis+1.794
ificant+1.545
ocene+1.508
heastern+1.497
Neg logit contributions
culus-2.231
picture-1.933
moral-1.896
rely-1.862
 footing-1.773
Token 11
Token ID1367
Feature activation+2.30e-01

Pos logit contributions
sburgh+6.016
opsis+5.952
ificant+5.311
heastern+5.096
ocene+5.062
Neg logit contributions
culus-5.102
picture-4.227
moral-4.152
rely-4.023
 footing-3.698
Token.
Token ID13
Feature activation-1.90e-01

Pos logit contributions
culus+1.546
picture+1.357
moral+1.331
rely+1.305
 footing+1.258
Neg logit contributions
sburgh-1.136
opsis-1.084
ificant-0.908
ocene-0.907
heastern-0.882
Token2
Token ID17
Feature activation-6.98e-02

Pos logit contributions
sburgh+1.211
opsis+1.170
ificant+1.032
ocene+1.019
heastern+1.006
Neg logit contributions
culus-1.011
picture-0.849
moral-0.827
rely-0.809
 footing-0.767
Token million
Token ID1510
Feature activation+6.84e-01

Pos logit contributions
sburgh+0.313
opsis+0.298
ificant+0.248
ocene+0.243
heastern+0.238
Neg logit contributions
culus-0.503
picture-0.443
moral-0.435
rely-0.428
 footing-0.412
Token voted
Token ID7052
Feature activation+7.47e-02

Pos logit contributions
culus+3.615
picture+3.074
rely+2.970
moral+2.965
 footing+2.823
Neg logit contributions
sburgh-4.292
opsis-4.105
ocene-3.650
ificant-3.610
heastern-3.556
Token has
Token ID468
Feature activation+2.07e-01

Pos logit contributions
culus+4.373
picture+3.735
moral+3.645
rely+3.581
 footing+3.497
Neg logit contributions
sburgh-4.322
opsis-4.165
ificant-3.666
ocene-3.621
heastern-3.539
Token set
Token ID900
Feature activation+1.95e-01

Pos logit contributions
culus+1.275
picture+1.106
moral+1.079
rely+1.070
 footing+1.031
Neg logit contributions
sburgh-1.143
opsis-1.097
ificant-0.945
ocene-0.941
heastern-0.922
Token out
Token ID503
Feature activation+3.04e-01

Pos logit contributions
culus+1.201
picture+1.037
moral+1.015
rely+0.994
 footing+0.953
Neg logit contributions
sburgh-1.086
opsis-1.042
ificant-0.895
ocene-0.887
heastern-0.872
Token an
Token ID281
Feature activation+3.50e-01

Pos logit contributions
culus+1.717
picture+1.469
moral+1.435
rely+1.396
 footing+1.344
Neg logit contributions
sburgh-1.832
opsis-1.766
ificant-1.536
ocene-1.527
heastern-1.503
Token ambitious
Token ID14742
Feature activation-2.96e-02

Pos logit contributions
culus+1.793
picture+1.503
moral+1.461
rely+1.438
 footing+1.368
Neg logit contributions
sburgh-2.284
opsis-2.206
ificant-1.930
heastern-1.914
ocene-1.910
Token goal
Token ID3061
Feature activation-5.58e-01

Pos logit contributions
sburgh+0.174
opsis+0.168
ificant+0.146
ocene+0.144
heastern+0.142
Neg logit contributions
culus-0.173
picture-0.148
moral-0.144
rely-0.141
 footing-0.134
Token for
Token ID329
Feature activation+4.36e-01

Pos logit contributions
sburgh+3.159
opsis+3.059
ificant+2.662
ocene+2.618
heastern+2.583
Neg logit contributions
culus-3.318
picture-2.823
moral-2.773
rely-2.730
 footing-2.536
Token the
Token ID262
Feature activation+6.38e-02

Pos logit contributions
culus+2.472
picture+2.125
moral+2.082
rely+2.045
 footing+1.960
Neg logit contributions
sburgh-2.592
opsis-2.508
ificant-2.174
ocene-2.141
heastern-2.129
Token Paris
Token ID6342
Feature activation-4.50e-01

Pos logit contributions
culus+0.367
picture+0.315
moral+0.308
rely+0.300
 footing+0.289
Neg logit contributions
sburgh-0.379
opsis-0.365
ificant-0.318
ocene-0.311
heastern-0.310
Token meeting
Token ID3249
Feature activation-2.29e-01

Pos logit contributions
sburgh+2.528
opsis+2.424
ificant+2.101
ocene+2.084
heastern+2.038
Neg logit contributions
culus-2.738
picture-2.344
moral-2.292
rely-2.256
 footing-2.133
Token
Token ID784
Feature activation+4.11e-01

Pos logit contributions
sburgh+1.341
opsis+1.292
ificant+1.123
ocene+1.110
heastern+1.093
Neg logit contributions
culus-1.333
picture-1.140
moral-1.115
rely-1.093
 footing-1.037
Token sim
Token ID985
Feature activation+2.37e-01

Pos logit contributions
culus+0.698
picture+0.595
moral+0.582
rely+0.570
 footing+0.546
Neg logit contributions
sburgh-0.714
opsis-0.690
ificant-0.599
ocene-0.595
heastern-0.585
Token lock
Token ID5793
Feature activation-6.22e-01

Pos logit contributions
culus+1.402
picture+1.197
moral+1.171
rely+1.142
 footing+1.092
Neg logit contributions
sburgh-1.374
opsis-1.320
ificant-1.142
ocene-1.135
heastern-1.116
Token on
Token ID319
Feature activation+1.14e-01

Pos logit contributions
sburgh+3.501
opsis+3.395
ificant+2.934
ocene+2.860
heastern+2.833
Neg logit contributions
culus-3.699
picture-3.173
moral-3.115
rely-3.065
 footing-2.896
Token the
Token ID262
Feature activation+1.76e-02

Pos logit contributions
culus+0.702
picture+0.606
moral+0.592
rely+0.582
 footing+0.557
Neg logit contributions
sburgh-0.636
opsis-0.610
ificant-0.526
ocene-0.521
heastern-0.511
Token device
Token ID3335
Feature activation-5.27e-02

Pos logit contributions
culus+0.097
picture+0.083
moral+0.081
rely+0.079
 footing+0.075
Neg logit contributions
sburgh-0.109
opsis-0.105
ificant-0.092
ocene-0.091
heastern-0.090
Token as
Token ID355
Feature activation-8.99e-01

Pos logit contributions
sburgh+0.290
opsis+0.277
ificant+0.240
ocene+0.237
heastern+0.233
Neg logit contributions
culus-0.328
picture-0.284
moral-0.277
rely-0.272
 footing-0.262
Token opposed
Token ID6886
Feature activation-4.84e-01

Pos logit contributions
sburgh+4.154
opsis+3.932
ificant+3.295
ocene+3.235
heastern+3.178
Neg logit contributions
culus-6.194
picture-5.419
moral-5.396
rely-5.146
 footing-4.996
Token to
Token ID284
Feature activation-8.67e-01

Pos logit contributions
sburgh+2.463
opsis+2.348
ificant+1.991
ocene+1.954
heastern+1.933
Neg logit contributions
culus-3.185
picture-2.761
moral-2.730
rely-2.651
 footing-2.555
Token the
Token ID262
Feature activation-6.34e-01

Pos logit contributions
sburgh+4.644
opsis+4.395
ificant+3.831
heastern+3.707
ocene+3.691
Neg logit contributions
culus-5.274
picture-4.575
moral-4.519
rely-4.322
 footing-4.133
Token Google
Token ID3012
Feature activation-5.47e-01

Pos logit contributions
sburgh+3.586
opsis+3.419
ificant+2.992
ocene+2.932
heastern+2.915
Neg logit contributions
culus-3.741
picture-3.216
moral-3.171
rely-3.091
 footing-2.927
Token Play
Token ID3811
Feature activation-2.47e-01

Pos logit contributions
sburgh+3.299
opsis+3.164
ificant+2.769
ocene+2.739
heastern+2.686
Neg logit contributions
culus-3.035
picture-2.585
moral-2.536
rely-2.482
 footing-2.314
Token so
Token ID523
Feature activation-6.34e-02

Pos logit contributions
sburgh+2.713
opsis+2.605
ificant+2.250
ocene+2.202
heastern+2.172
Neg logit contributions
culus-3.119
picture-2.698
moral-2.654
rely-2.597
 footing-2.470
Token much
Token ID881
Feature activation-1.92e-01

Pos logit contributions
sburgh+0.339
opsis+0.325
ificant+0.279
ocene+0.275
heastern+0.271
Neg logit contributions
culus-0.403
picture-0.349
moral-0.342
rely-0.336
 footing-0.322
Token more
Token ID517
Feature activation-7.91e-02

Pos logit contributions
sburgh+1.010
opsis+0.968
ificant+0.826
ocene+0.818
heastern+0.803
Neg logit contributions
culus-1.234
picture-1.073
moral-1.049
rely-1.032
 footing-0.990
Token to
Token ID284
Feature activation-5.87e-01

Pos logit contributions
sburgh+0.430
opsis+0.412
ificant+0.353
ocene+0.351
heastern+0.344
Neg logit contributions
culus-0.497
picture-0.431
moral-0.420
rely-0.413
 footing-0.397
Token it
Token ID340
Feature activation-3.75e-01

Pos logit contributions
sburgh+3.039
opsis+2.903
ificant+2.484
ocene+2.437
heastern+2.399
Neg logit contributions
culus-3.802
picture-3.312
moral-3.258
rely-3.197
 footing-3.055
Token than
Token ID621
Feature activation-1.67e+00

Pos logit contributions
sburgh+1.999
opsis+1.917
ificant+1.642
ocene+1.618
heastern+1.592
Neg logit contributions
culus-2.381
picture-2.065
moral-2.025
rely-1.986
 footing-1.899
Token that
Token ID326
Feature activation-4.41e-01

Pos logit contributions
sburgh+7.594
opsis+7.013
ificant+5.871
ocene+5.510
heastern+5.505
Neg logit contributions
culus-11.006
picture-9.808
moral-9.729
rely-9.206
 sexes-8.821
Token like
Token ID588
Feature activation-8.01e-01

Pos logit contributions
sburgh+2.217
opsis+2.122
ificant+1.797
ocene+1.765
heastern+1.728
Neg logit contributions
culus-2.931
picture-2.552
moral-2.518
rely-2.472
 footing-2.363
Token working
Token ID1762
Feature activation-5.79e-01

Pos logit contributions
sburgh+4.380
opsis+4.148
ificant+3.593
ocene+3.472
heastern+3.448
Neg logit contributions
culus-4.934
picture-4.272
moral-4.212
rely-4.038
 footing-3.859
Token hard
Token ID1327
Feature activation-6.18e-02

Pos logit contributions
sburgh+2.932
opsis+2.784
ificant+2.373
ocene+2.324
heastern+2.292
Neg logit contributions
culus-3.831
picture-3.329
moral-3.286
rely-3.196
 footing-3.057
Token concentrating
Token ID37367
Feature activation+3.37e-01

Pos logit contributions
sburgh+0.370
opsis+0.358
ificant+0.312
ocene+0.309
heastern+0.304
Neg logit contributions
culus-0.352
picture-0.301
moral-0.293
rely-0.287
 footing-0.273
Token NFL
Token ID5134
Feature activation+1.05e+00

Pos logit contributions
culus+1.798
picture+1.534
moral+1.501
rely+1.471
 footing+1.407
Neg logit contributions
sburgh-1.913
opsis-1.855
ificant-1.608
ocene-1.590
heastern-1.571
Token need
Token ID761
Feature activation-7.25e-01

Pos logit contributions
culus+5.968
picture+5.139
moral+5.004
rely+4.900
 footing+4.736
Neg logit contributions
sburgh-5.977
opsis-5.893
ocene-5.031
heastern-5.009
ificant-5.001
Token to
Token ID284
Feature activation+1.79e-01

Pos logit contributions
sburgh+3.754
opsis+3.583
ificant+3.084
ocene+3.050
heastern+2.965
Neg logit contributions
culus-4.591
picture-3.990
moral-3.880
rely-3.813
 footing-3.621
Token protect
Token ID1805
Feature activation-4.58e-01

Pos logit contributions
culus+1.059
picture+0.910
moral+0.887
rely+0.883
 footing+0.843
Neg logit contributions
sburgh-1.027
opsis-0.991
ificant-0.862
ocene-0.855
heastern-0.841
Token the
Token ID262
Feature activation-2.08e-01

Pos logit contributions
sburgh+2.559
opsis+2.446
ificant+2.116
ocene+2.075
heastern+2.041
Neg logit contributions
culus-2.792
picture-2.399
moral-2.337
rely-2.281
 footing-2.173
Token whistleblower
Token ID34875
Feature activation-1.72e+00

Pos logit contributions
sburgh+1.184
opsis+1.138
ificant+0.985
ocene+0.971
heastern+0.957
Neg logit contributions
culus-1.252
picture-1.076
moral-1.051
rely-1.030
 footing-0.981
Token(
Token ID7
Feature activation-6.70e-01

Pos logit contributions
sburgh+9.173
opsis+8.847
ificant+7.576
 somewhere+7.434
ocene+7.320
Neg logit contributions
culus-9.827
picture-8.323
moral-8.213
rely-7.871
chat-7.309
Tokens
Token ID82
Feature activation-7.56e-02

Pos logit contributions
sburgh+3.219
opsis+3.037
ificant+2.556
ocene+2.519
heastern+2.441
Neg logit contributions
culus-4.624
picture-4.016
moral-3.935
rely-3.869
 footing-3.743
Token)
Token ID8
Feature activation-4.38e-01

Pos logit contributions
sburgh+0.455
opsis+0.439
ificant+0.383
ocene+0.379
heastern+0.372
Neg logit contributions
culus-0.429
picture-0.365
moral-0.356
rely-0.347
 footing-0.330
Token in
Token ID287
Feature activation-1.72e-01

Pos logit contributions
sburgh+2.473
opsis+2.380
ificant+2.057
ocene+2.030
heastern+1.994
Neg logit contributions
culus-2.643
picture-2.264
moral-2.220
rely-2.173
 footing-2.063
Token Bounty
Token ID38166
Feature activation+8.82e-02

Pos logit contributions
sburgh+0.930
opsis+0.893
ificant+0.767
ocene+0.757
heastern+0.743
Neg logit contributions
culus-1.085
picture-0.940
moral-0.920
rely-0.902
 footing-0.863
Token the
Token ID262
Feature activation-4.26e-01

Pos logit contributions
sburgh+2.389
opsis+2.292
ificant+1.953
ocene+1.914
heastern+1.888
Neg logit contributions
culus-2.914
picture-2.547
moral-2.503
rely-2.433
 footing-2.327
Token ping
Token ID29400
Feature activation+1.37e-01

Pos logit contributions
sburgh+2.262
opsis+2.169
ificant+1.850
ocene+1.819
heastern+1.796
Neg logit contributions
culus-2.703
picture-2.348
moral-2.314
rely-2.256
 footing-2.157
Token tests
Token ID5254
Feature activation-2.98e-01

Pos logit contributions
culus+0.796
picture+0.683
moral+0.661
rely+0.652
 footing+0.619
Neg logit contributions
sburgh-0.809
opsis-0.777
ificant-0.679
ocene-0.674
heastern-0.662
Token will
Token ID481
Feature activation-8.17e-01

Pos logit contributions
sburgh+1.592
opsis+1.528
ificant+1.308
ocene+1.289
heastern+1.269
Neg logit contributions
culus-1.890
picture-1.639
moral-1.607
rely-1.575
 footing-1.507
Token be
Token ID307
Feature activation-9.10e-01

Pos logit contributions
sburgh+3.772
opsis+3.591
ificant+2.985
ocene+2.882
heastern+2.877
Neg logit contributions
culus-5.673
picture-4.948
moral-4.939
rely-4.733
 footing-4.650
Token stored
Token ID8574
Feature activation-1.85e+00

Pos logit contributions
sburgh+4.507
opsis+4.367
ificant+3.700
ocene+3.582
heastern+3.547
Neg logit contributions
culus-5.912
picture-5.147
moral-5.142
rely-4.993
 footing-4.793
Token in
Token ID287
Feature activation-6.50e-01

Pos logit contributions
sburgh+8.391
opsis+8.151
 somewhere+7.031
ificant+6.700
heastern+6.647
Neg logit contributions
culus-11.651
moral-10.357
picture-10.212
rely-9.780
 footing-9.296
Token a
Token ID257
Feature activation-9.63e-01

Pos logit contributions
sburgh+3.455
opsis+3.321
ificant+2.864
ocene+2.784
heastern+2.739
Neg logit contributions
culus-4.032
picture-3.510
moral-3.487
rely-3.382
 footing-3.210
Token S
Token ID311
Feature activation-2.20e-01

Pos logit contributions
sburgh+4.964
opsis+4.749
ificant+4.092
heastern+3.929
ocene+3.916
Neg logit contributions
culus-6.100
moral-5.286
picture-5.271
rely-5.136
 footing-4.818
Token3
Token ID18
Feature activation-1.34e+00

Pos logit contributions
sburgh+1.303
opsis+1.258
ificant+1.100
ocene+1.085
heastern+1.075
Neg logit contributions
culus-1.252
picture-1.073
moral-1.048
rely-1.023
 footing-0.980
Token bucket
Token ID19236
Feature activation-1.01e+00

Pos logit contributions
sburgh+5.099
opsis+4.777
ificant+3.994
heastern+3.801
ocene+3.665
Neg logit contributions
culus-9.961
moral-8.930
picture-8.735
rely-8.619
 sexes-8.250
Tokendesigned
Token ID30473
Feature activation+2.29e-01

Pos logit contributions
sburgh+0.492
opsis+0.477
ificant+0.426
ocene+0.422
heastern+0.417
Neg logit contributions
culus-0.333
picture-0.275
moral-0.266
rely-0.259
 footing-0.243
Token houses
Token ID7777
Feature activation-3.66e-01

Pos logit contributions
culus+1.339
picture+1.149
moral+1.117
rely+1.095
 footing+1.053
Neg logit contributions
sburgh-1.335
opsis-1.284
ificant-1.119
ocene-1.110
heastern-1.093
Token that
Token ID326
Feature activation+9.58e-02

Pos logit contributions
sburgh+2.040
opsis+1.962
ificant+1.689
ocene+1.667
heastern+1.648
Neg logit contributions
culus-2.230
picture-1.920
moral-1.879
rely-1.845
 footing-1.754
Token can
Token ID460
Feature activation-5.81e-01

Pos logit contributions
culus+0.556
picture+0.477
moral+0.463
rely+0.458
 footing+0.436
Neg logit contributions
sburgh-0.562
opsis-0.542
ificant-0.475
ocene-0.469
heastern-0.461
Token be
Token ID307
Feature activation-5.65e-01

Pos logit contributions
sburgh+2.984
opsis+2.859
ificant+2.408
ocene+2.383
heastern+2.331
Neg logit contributions
culus-3.796
picture-3.291
moral-3.262
rely-3.162
 footing-3.036
Token rented
Token ID26399
Feature activation-1.96e+00

Pos logit contributions
sburgh+2.792
opsis+2.661
ificant+2.252
ocene+2.218
heastern+2.175
Neg logit contributions
culus-3.799
picture-3.316
moral-3.276
rely-3.203
 footing-3.064
Token out
Token ID503
Feature activation-5.45e-01

Pos logit contributions
sburgh+8.971
opsis+8.645
 somewhere+7.436
ificant+7.162
heastern+7.102
Neg logit contributions
culus-12.131
picture-10.524
moral-10.376
rely-10.188
 footing-9.251
Token.
Token ID13
Feature activation-1.04e-01

Pos logit contributions
sburgh+2.961
opsis+2.844
ificant+2.435
ocene+2.397
heastern+2.378
Neg logit contributions
culus-3.395
picture-2.935
moral-2.881
rely-2.818
 footing-2.675
Token
Token ID198
Feature activation+1.01e-01

Pos logit contributions
sburgh+0.578
opsis+0.556
ificant+0.480
ocene+0.474
heastern+0.467
Neg logit contributions
culus-0.637
picture-0.551
moral-0.538
rely-0.528
 footing-0.503
Token
Token ID198
Feature activation+7.47e-02

Pos logit contributions
culus+0.573
picture+0.495
moral+0.480
rely+0.472
 footing+0.443
Neg logit contributions
sburgh-0.608
opsis-0.587
ificant-0.514
ocene-0.508
heastern-0.502
TokenThe
Token ID464
Feature activation-5.34e-02

Pos logit contributions
culus+0.475
picture+0.423
moral+0.409
rely+0.400
 footing+0.377
Neg logit contributions
sburgh-0.397
opsis-0.379
ificant-0.329
ocene-0.321
heastern-0.318
Token
Token ID447
Feature activation-2.88e-01

Pos logit contributions
sburgh+0.359
opsis+0.345
ificant+0.290
ocene+0.288
heastern+0.282
Neg logit contributions
culus-0.488
picture-0.428
moral-0.419
rely-0.412
 footing-0.398
Token
Token ID247
Feature activation-1.75e-01

Pos logit contributions
sburgh+1.691
opsis+1.620
ificant+1.405
ocene+1.386
heastern+1.366
Neg logit contributions
culus-1.670
picture-1.430
moral-1.392
rely-1.369
 footing-1.303
Tokens
Token ID82
Feature activation+4.48e-02

Pos logit contributions
sburgh+1.005
opsis+0.967
ificant+0.838
ocene+0.831
heastern+0.816
Neg logit contributions
culus-1.041
picture-0.897
moral-0.876
rely-0.857
 footing-0.817
Token what
Token ID644
Feature activation-6.60e-01

Pos logit contributions
culus+0.269
picture+0.230
moral+0.226
rely+0.222
 footing+0.212
Neg logit contributions
sburgh-0.256
opsis-0.247
ocene-0.213
ificant-0.213
heastern-0.208
Token the
Token ID262
Feature activation-6.35e-01

Pos logit contributions
sburgh+3.501
opsis+3.347
ificant+2.870
ocene+2.804
heastern+2.783
Neg logit contributions
culus-4.194
picture-3.632
moral-3.561
rely-3.481
 footing-3.329
Token Consumer
Token ID18110
Feature activation-1.85e+00

Pos logit contributions
sburgh+3.738
opsis+3.556
ificant+3.109
ocene+3.069
heastern+3.040
Neg logit contributions
culus-3.673
picture-3.142
moral-3.064
rely-3.001
 footing-2.836
Token Financial
Token ID11302
Feature activation-2.25e-01

Pos logit contributions
sburgh+7.476
opsis+7.141
ificant+5.872
ists+5.837
ocene+5.625
Neg logit contributions
culus-12.774
picture-11.053
rely-10.611
moral-10.537
 sexes-10.446
Token Protection
Token ID9985
Feature activation+1.85e-01

Pos logit contributions
sburgh+1.194
opsis+1.145
ificant+0.981
ocene+0.967
heastern+0.951
Neg logit contributions
culus-1.435
picture-1.246
moral-1.220
rely-1.197
 footing-1.146
Token Bureau
Token ID9840
Feature activation-7.82e-01

Pos logit contributions
culus+1.048
picture+0.889
moral+0.866
rely+0.850
 footing+0.810
Neg logit contributions
sburgh-1.115
opsis-1.077
ificant-0.949
ocene-0.936
heastern-0.921
Token
Token ID960
Feature activation+2.70e-01

Pos logit contributions
sburgh+4.083
opsis+3.925
ificant+3.300
ocene+3.285
heastern+3.236
Neg logit contributions
culus-4.913
picture-4.290
moral-4.248
rely-4.165
 footing-3.878
Tokenwhich
Token ID4758
Feature activation+6.11e-01

Pos logit contributions
culus+1.538
picture+1.327
moral+1.292
rely+1.260
 footing+1.203
Neg logit contributions
sburgh-1.607
opsis-1.565
ificant-1.353
ocene-1.338
heastern-1.316