TOP ACTIVATIONS
MAX = 8.109

Irish woman by a grand Jury last year .
was designed as shown in Figure 1 . The maze was
slide Power Point slide PNG larger image
must select a newsletter to subscribe to . Sign Up You
must select a newsletter to subscribe to . Sign Up You
. S . Army active duty roster to 540 , 000
must select a newsletter to subscribe to . Sign Up You
s urgent needs , says Capt . Nate Ter ning ,
must select a newsletter to subscribe to . Sign Up You
others Anyone who follows food in Houston probably sc
d like to follow them . And let us
IFF original image Download : Figure 1 . Maze and task
media lying to follow the hashtag # femin in
room . P PT PowerPoint slide Power Point slide
N air , Ford group vice president , Global Product Development
to help teach others to follow in the same footsteps .
the way and the government follows , that would create the
and believes that because they follow the trends and stay fresh
price - to - ear nings multiple may be justified .
said AP X S Principal Investigator R alf G ell ert

MIN ACTIVATIONS
MIN = -8.785

of Nebraska currently has a ban against gay marriage , the
that the EU lift its ban on washing poultry meat in
call for his temporary travel ban , This piece
a border wall and a ban on chain migration
id wrote : Advertisement Advertisement
blowing up Advertisement N PH :
Claire ville Reserv oir in Bram pton to the shore of
forcing a fumble on Rob Gron kowski during 7 - on
after Trump tweeted about his ban , he offered a tweet
Recently the European union has banned the use of the three
In the face of these discrim inations , it is currently
sites reported . Advertisement Continue reading the main story
hard and we have a physical group that is setting the
jail time . Advertisement Advertisement
. Advertisement Continue reading the main story
Some interviews had already been carried out under the
dead son . Advertisement Yes ! Just
the day . Advertisement : Indeed ,
while if you have a physical manifestation of how much money
only rarely . Advertisement Continue reading the main story

INTERVAL 3.886 - 8.109
CONTAINS 0.757%

so far mostly involved ethnic Korean citizens of South Korea ,
large crowd of pro - government mar chers chanting slogans against
the NDP s deputy critic for democratic reform ,
potential side effects . The aim of this study is to
exploded , said Malcolm Cr ick , Secretary of

INTERVAL -0.338 - 3.886
CONTAINS 59.928%

Meanwhile , in more evidence that Comey was canned in haste
for All Act because it will end
. It 's quite a small one a baby oct
, P uro hit requested the Prime Minister to write in
Pelosi will be the president . Moreover , Letter

INTERVAL -4.562 - -0.338
CONTAINS 39.003%

WE P was ratified as a Wi - Fi
Car ros , north of Nice on the mainland , locals
comparison without being nearly critical enough . The Washington Post reported
on social media as well , such as when he retweet
The Doors "? Yeah , dude , Floyd and the Lizard

INTERVAL -8.785 - -4.562
CONTAINS 0.311%

time , The oppress or 's wrong , the
States . It was originally passed in 1970 ,[ 2 ]
talks and other messages via Mi iverse . There
and efforts to tighten gun control . His family foundation has
Recently the European union has banned the use of the three
Token Irish
Token ID8685
Feature activation-3.91e-02

Pos logit contributions
FontSize+0.776
Term+0.716
 EPS+0.696
arsity+0.673
\/+0.672
Neg logit contributions
undo-0.861
tarian-0.772
 wre-0.695
away-0.691
alogy-0.680
Token woman
Token ID2415
Feature activation-2.11e+00

Pos logit contributions
undo+0.034
tarian+0.031
 wre+0.028
away+0.028
 flung+0.027
Neg logit contributions
FontSize-0.029
Term-0.026
 EPS-0.025
arsity-0.024
\/-0.024
Token by
Token ID416
Feature activation+5.11e-01

Pos logit contributions
undo+1.789
tarian+1.606
 wre+1.476
away+1.457
 flung+1.441
Neg logit contributions
FontSize-1.584
Term-1.460
 EPS-1.407
arsity-1.366
\/-1.350
Token a
Token ID257
Feature activation+3.24e-01

Pos logit contributions
FontSize+0.401
Term+0.371
 EPS+0.361
arsity+0.350
\/+0.350
Neg logit contributions
undo-0.416
tarian-0.371
 wre-0.333
away-0.331
alogy-0.325
Token grand
Token ID4490
Feature activation+8.73e-01

Pos logit contributions
FontSize+0.246
Term+0.227
 EPS+0.220
arsity+0.214
\/+0.213
Neg logit contributions
undo-0.272
tarian-0.244
 wre-0.220
away-0.219
 flung-0.215
Token Jury
Token ID48705
Feature activation+8.11e+00

Pos logit contributions
FontSize+0.540
Term+0.490
 EPS+0.475
arsity+0.454
\/+0.453
Neg logit contributions
undo-0.856
tarian-0.779
 wre-0.711
away-0.709
alogy-0.700
Token last
Token ID938
Feature activation-3.20e+00

Pos logit contributions
FontSize+5.203
Term+4.789
 EPS+4.734
\/+4.568
arsity+4.468
Neg logit contributions
undo-7.163
tarian-6.705
alogy-5.938
 wre-5.852
away-5.800
Token year
Token ID614
Feature activation-2.09e+00

Pos logit contributions
undo+2.752
tarian+2.486
away+2.267
 wre+2.264
 flung+2.232
Neg logit contributions
FontSize-2.349
Term-2.109
 EPS-2.057
arsity-2.004
\/-1.972
Token.
Token ID13
Feature activation-5.97e-01

Pos logit contributions
undo+1.871
tarian+1.669
 wre+1.544
away+1.540
 flung+1.512
Neg logit contributions
FontSize-1.477
Term-1.349
 EPS-1.305
arsity-1.261
\/-1.238
Token
Token ID198
Feature activation-4.25e-01

Pos logit contributions
undo+0.469
tarian+0.416
 wre+0.371
away+0.370
alogy+0.364
Neg logit contributions
FontSize-0.486
Term-0.449
 EPS-0.438
arsity-0.425
\/-0.424
Token
Token ID198
Feature activation-5.47e-01

Pos logit contributions
undo+0.319
tarian+0.279
away+0.253
 wre+0.248
alogy+0.246
Neg logit contributions
FontSize-0.360
Term-0.332
 EPS-0.326
arsity-0.318
\/-0.312
Token was
Token ID373
Feature activation+1.48e+00

Pos logit contributions
FontSize+1.850
Term+1.721
 EPS+1.684
\/+1.623
arsity+1.619
Neg logit contributions
undo-2.052
tarian-1.836
 wre-1.633
away-1.629
alogy-1.609
Token designed
Token ID3562
Feature activation+1.79e-02

Pos logit contributions
FontSize+1.159
Term+1.077
 EPS+1.052
\/+1.016
arsity+1.016
Neg logit contributions
undo-1.215
tarian-1.086
away-0.962
 wre-0.962
alogy-0.950
Token as
Token ID355
Feature activation+1.21e+00

Pos logit contributions
FontSize+0.013
Term+0.012
 EPS+0.012
arsity+0.011
\/+0.011
Neg logit contributions
undo-0.015
tarian-0.014
 wre-0.012
away-0.012
alogy-0.012
Token shown
Token ID3402
Feature activation+2.96e+00

Pos logit contributions
FontSize+0.970
Term+0.901
 EPS+0.880
arsity+0.849
\/+0.848
Neg logit contributions
undo-0.972
tarian-0.866
 wre-0.771
away-0.767
alogy-0.756
Token in
Token ID287
Feature activation-1.05e+00

Pos logit contributions
FontSize+1.867
Term+1.710
 EPS+1.683
\/+1.602
arsity+1.585
Neg logit contributions
undo-2.824
tarian-2.572
 wre-2.323
away-2.305
alogy-2.293
Token Figure
Token ID11291
Feature activation+7.66e+00

Pos logit contributions
undo+1.028
tarian+0.935
 wre+0.859
away+0.858
alogy+0.844
Neg logit contributions
FontSize-0.649
Term-0.580
 EPS-0.556
arsity-0.544
\/-0.540
Token 1
Token ID352
Feature activation+7.60e-01

Pos logit contributions
FontSize+4.056
 EPS+3.827
Term+3.770
\/+3.469
arsity+3.374
Neg logit contributions
undo-7.791
tarian-7.149
 wre-6.472
away-6.415
 flung-6.401
Token.
Token ID13
Feature activation+1.67e+00

Pos logit contributions
FontSize+0.555
Term+0.510
 EPS+0.497
arsity+0.479
\/+0.478
Neg logit contributions
undo-0.662
tarian-0.596
 wre-0.537
away-0.535
alogy-0.527
Token The
Token ID383
Feature activation+1.11e+00

Pos logit contributions
FontSize+1.333
Term+1.242
 EPS+1.214
arsity+1.173
\/+1.169
Neg logit contributions
undo-1.342
tarian-1.196
 wre-1.067
away-1.057
alogy-1.042
Token maze
Token ID31237
Feature activation+2.28e+00

Pos logit contributions
FontSize+0.853
Term+0.791
 EPS+0.773
arsity+0.744
\/+0.742
Neg logit contributions
undo-0.914
tarian-0.818
 wre-0.730
away-0.728
alogy-0.718
Token was
Token ID373
Feature activation+2.05e+00

Pos logit contributions
FontSize+1.678
Term+1.556
 EPS+1.522
arsity+1.462
\/+1.457
Neg logit contributions
undo-1.965
tarian-1.764
away-1.572
 wre-1.571
alogy-1.553
Token slide
Token ID10649
Feature activation+7.23e+00

Pos logit contributions
FontSize+3.811
Term+3.446
 EPS+3.349
\/+3.194
arsity+3.184
Neg logit contributions
undo-6.174
tarian-5.642
 wre-5.153
away-5.131
alogy-5.069
Token
Token ID198
Feature activation+2.81e+00

Pos logit contributions
FontSize+5.234
Term+4.866
 EPS+4.832
\/+4.588
arsity+4.557
Neg logit contributions
undo-6.199
tarian-5.571
 wre-5.025
 flung-5.007
alogy-4.868
Token
Token ID198
Feature activation+2.30e+00

Pos logit contributions
FontSize+2.217
Term+2.060
 EPS+2.010
\/+1.947
arsity+1.938
Neg logit contributions
undo-2.283
tarian-2.042
 wre-1.823
away-1.802
 flung-1.783
TokenPower
Token ID13434
Feature activation+3.72e+00

Pos logit contributions
FontSize+1.520
Term+1.362
 EPS+1.336
arsity+1.303
\/+1.279
Neg logit contributions
undo-2.156
tarian-1.951
away-1.790
 wre-1.781
 flung-1.742
TokenPoint
Token ID12727
Feature activation+5.19e+00

Pos logit contributions
FontSize+2.758
Term+2.541
 EPS+2.478
arsity+2.376
\/+2.376
Neg logit contributions
undo-3.199
tarian-2.885
 wre-2.588
away-2.567
alogy-2.536
Token slide
Token ID10649
Feature activation+7.64e+00

Pos logit contributions
FontSize+3.293
Term+2.991
 EPS+2.878
arsity+2.794
\/+2.767
Neg logit contributions
undo-5.003
tarian-4.556
 wre-4.167
away-4.148
alogy-4.095
Token PNG
Token ID36182
Feature activation+6.01e+00

Pos logit contributions
FontSize+0.568
Term+0.297
 EPS+0.131
 PNG+0.000
\/-0.047
Neg logit contributions
undo-11.486
tarian-10.842
 wre-10.247
 flung-10.208
away-10.103
Token larger
Token ID4025
Feature activation+2.35e+00

Pos logit contributions
FontSize+4.228
Term+3.850
 EPS+3.702
arsity+3.647
\/+3.579
Neg logit contributions
undo-5.429
tarian-4.848
away-4.475
 wre-4.410
alogy-4.373
Token image
Token ID2939
Feature activation+6.91e+00

Pos logit contributions
FontSize+1.827
Term+1.691
 EPS+1.635
arsity+1.603
\/+1.590
Neg logit contributions
undo-1.931
tarian-1.727
 wre-1.553
away-1.543
alogy-1.518
Token
Token ID198
Feature activation+1.96e+00

Pos logit contributions
FontSize+5.135
Term+4.794
 EPS+4.749
\/+4.504
arsity+4.419
Neg logit contributions
undo-5.865
tarian-5.327
 wre-4.779
 flung-4.700
away-4.684
Token
Token ID198
Feature activation+1.68e+00

Pos logit contributions
FontSize+1.574
Term+1.459
 EPS+1.423
arsity+1.378
\/+1.376
Neg logit contributions
undo-1.564
tarian-1.394
 wre-1.243
away-1.237
alogy-1.217
Token must
Token ID1276
Feature activation+9.48e-01

Pos logit contributions
FontSize+0.616
Term+0.565
 EPS+0.550
\/+0.530
arsity+0.528
Neg logit contributions
undo-0.871
tarian-0.788
 wre-0.714
away-0.713
alogy-0.704
Token select
Token ID2922
Feature activation-3.83e-01

Pos logit contributions
FontSize+0.631
Term+0.570
 EPS+0.552
arsity+0.532
\/+0.527
Neg logit contributions
undo-0.890
tarian-0.810
 wre-0.738
away-0.734
alogy-0.722
Token a
Token ID257
Feature activation+1.29e+00

Pos logit contributions
undo+0.290
tarian+0.256
 wre+0.228
away+0.227
alogy+0.222
Neg logit contributions
FontSize-0.324
Term-0.301
 EPS-0.294
arsity-0.286
\/-0.284
Token newsletter
Token ID13129
Feature activation+4.80e+00

Pos logit contributions
FontSize+0.856
Term+0.781
 EPS+0.760
\/+0.727
arsity+0.727
Neg logit contributions
undo-1.210
tarian-1.100
 wre-0.999
away-0.995
alogy-0.983
Token to
Token ID284
Feature activation+1.21e+00

Pos logit contributions
FontSize+3.733
Term+3.452
 EPS+3.401
\/+3.332
arsity+3.249
Neg logit contributions
undo-3.884
tarian-3.487
 wre-3.093
away-3.083
 flung-3.058
Token subscribe
Token ID12383
Feature activation+7.55e+00

Pos logit contributions
FontSize+0.806
Term+0.735
 EPS+0.712
arsity+0.684
\/+0.682
Neg logit contributions
undo-1.128
tarian-1.022
 wre-0.931
away-0.927
alogy-0.914
Token to
Token ID284
Feature activation+2.10e+00

Pos logit contributions
FontSize+5.330
Term+4.983
 EPS+4.900
\/+4.849
arsity+4.569
Neg logit contributions
undo-6.519
tarian-5.911
 cumbers-5.280
 flung-5.265
 wre-5.254
Token.
Token ID13
Feature activation+3.11e+00

Pos logit contributions
FontSize+1.316
Term+1.190
 EPS+1.151
arsity+1.104
\/+1.098
Neg logit contributions
undo-2.047
tarian-1.863
 wre-1.703
away-1.699
alogy-1.676
Token Sign
Token ID5865
Feature activation+1.09e+00

Pos logit contributions
FontSize+2.377
Term+2.213
 EPS+2.159
\/+2.084
arsity+2.076
Neg logit contributions
undo-2.603
tarian-2.328
 wre-2.101
away-2.058
 flung-2.054
Token Up
Token ID3205
Feature activation+2.17e+00

Pos logit contributions
FontSize+0.855
Term+0.787
 EPS+0.767
arsity+0.742
\/+0.740
Neg logit contributions
undo-0.892
tarian-0.797
away-0.715
 wre-0.711
alogy-0.701
Token You
Token ID921
Feature activation+1.24e+00

Pos logit contributions
FontSize+1.598
Term+1.492
 EPS+1.460
\/+1.407
arsity+1.399
Neg logit contributions
undo-1.855
tarian-1.677
 wre-1.509
away-1.487
 flung-1.478
Token must
Token ID1276
Feature activation+7.70e-01

Pos logit contributions
FontSize+0.547
Term+0.502
 EPS+0.488
\/+0.472
arsity+0.469
Neg logit contributions
undo-0.778
tarian-0.705
 wre-0.638
away-0.637
alogy-0.630
Token select
Token ID2922
Feature activation-4.88e-01

Pos logit contributions
FontSize+0.518
Term+0.467
 EPS+0.452
arsity+0.437
\/+0.432
Neg logit contributions
undo-0.719
tarian-0.655
 wre-0.597
away-0.593
alogy-0.582
Token a
Token ID257
Feature activation+1.30e+00

Pos logit contributions
undo+0.367
tarian+0.324
 wre+0.287
away+0.287
alogy+0.279
Neg logit contributions
FontSize-0.415
Term-0.385
 EPS-0.377
arsity-0.367
\/-0.364
Token newsletter
Token ID13129
Feature activation+4.85e+00

Pos logit contributions
FontSize+0.856
Term+0.781
 EPS+0.760
\/+0.728
arsity+0.726
Neg logit contributions
undo-1.222
tarian-1.112
 wre-1.010
away-1.006
alogy-0.995
Token to
Token ID284
Feature activation+1.26e+00

Pos logit contributions
FontSize+3.754
Term+3.467
 EPS+3.416
\/+3.354
arsity+3.261
Neg logit contributions
undo-3.946
tarian-3.544
 wre-3.142
away-3.140
 flung-3.113
Token subscribe
Token ID12383
Feature activation+7.51e+00

Pos logit contributions
FontSize+0.842
Term+0.768
 EPS+0.744
arsity+0.715
\/+0.712
Neg logit contributions
undo-1.174
tarian-1.063
 wre-0.968
away-0.964
alogy-0.950
Token to
Token ID284
Feature activation+2.12e+00

Pos logit contributions
FontSize+5.311
Term+4.961
 EPS+4.901
\/+4.832
arsity+4.557
Neg logit contributions
undo-6.484
tarian-5.851
 flung-5.220
 cumbers-5.204
 wre-5.203
Token.
Token ID13
Feature activation+3.11e+00

Pos logit contributions
FontSize+1.329
Term+1.202
 EPS+1.163
arsity+1.115
\/+1.109
Neg logit contributions
undo-2.071
tarian-1.885
 wre-1.723
away-1.719
alogy-1.695
Token Sign
Token ID5865
Feature activation+1.27e+00

Pos logit contributions
FontSize+2.348
Term+2.193
 EPS+2.137
\/+2.061
arsity+2.053
Neg logit contributions
undo-2.627
tarian-2.356
 wre-2.131
 flung-2.087
away-2.076
Token Up
Token ID3205
Feature activation+2.42e+00

Pos logit contributions
FontSize+0.984
Term+0.908
 EPS+0.885
arsity+0.856
\/+0.854
Neg logit contributions
undo-1.049
tarian-0.938
 wre-0.840
away-0.839
alogy-0.824
Token You
Token ID921
Feature activation+1.15e+00

Pos logit contributions
FontSize+1.760
Term+1.648
 EPS+1.612
\/+1.554
arsity+1.542
Neg logit contributions
undo-2.098
tarian-1.902
 wre-1.713
away-1.683
 flung-1.680
Token.
Token ID13
Feature activation+4.68e+00

Pos logit contributions
undo+0.420
tarian+0.379
away+0.351
 wre+0.344
alogy+0.343
Neg logit contributions
FontSize-0.308
Term-0.279
 EPS-0.273
arsity-0.263
\/-0.254
TokenS
Token ID50
Feature activation+4.24e-01

Pos logit contributions
FontSize+2.349
Term+2.090
 EPS+2.001
\/+1.902
arsity+1.883
Neg logit contributions
undo-5.138
tarian-4.738
 wre-4.364
away-4.354
 flung-4.300
Token.
Token ID13
Feature activation+2.85e+00

Pos logit contributions
FontSize+0.284
Term+0.259
 EPS+0.251
arsity+0.241
\/+0.240
Neg logit contributions
undo-0.395
tarian-0.358
 wre-0.326
away-0.324
alogy-0.320
Token Army
Token ID5407
Feature activation+5.61e+00

Pos logit contributions
FontSize+2.009
Term+1.854
 EPS+1.808
\/+1.749
arsity+1.727
Neg logit contributions
undo-2.533
tarian-2.283
away-2.055
 wre-2.050
alogy-2.031
Token active
Token ID4075
Feature activation+3.55e-01

Pos logit contributions
FontSize+3.887
Term+3.529
 EPS+3.496
\/+3.400
arsity+3.332
Neg logit contributions
undo-4.975
tarian-4.511
away-4.025
 wre-4.022
alogy-3.978
Token duty
Token ID7077
Feature activation+7.50e+00

Pos logit contributions
FontSize+0.254
Term+0.233
 EPS+0.227
arsity+0.219
\/+0.219
Neg logit contributions
undo-0.315
tarian-0.284
 wre-0.256
away-0.255
alogy-0.251
Token roster
Token ID9354
Feature activation+2.17e+00

Pos logit contributions
FontSize+5.563
 EPS+5.106
Term+5.078
\/+4.888
arsity+4.829
Neg logit contributions
undo-6.221
tarian-5.538
alogy-4.921
away-4.900
 wre-4.786
Token to
Token ID284
Feature activation+1.53e-01

Pos logit contributions
FontSize+1.593
Term+1.459
 EPS+1.428
\/+1.382
arsity+1.372
Neg logit contributions
undo-1.867
tarian-1.689
 wre-1.509
away-1.501
alogy-1.484
Token 540
Token ID38190
Feature activation+3.87e+00

Pos logit contributions
FontSize+0.117
Term+0.108
 EPS+0.105
arsity+0.101
\/+0.101
Neg logit contributions
undo-0.129
tarian-0.115
 wre-0.104
away-0.103
alogy-0.101
Token,
Token ID11
Feature activation+1.19e+00

Pos logit contributions
FontSize+2.694
Term+2.484
 EPS+2.415
\/+2.329
arsity+2.326
Neg logit contributions
undo-3.467
tarian-3.139
 wre-2.824
away-2.823
alogy-2.791
Token000
Token ID830
Feature activation+6.66e-01

Pos logit contributions
FontSize+0.719
Term+0.648
 EPS+0.627
arsity+0.599
\/+0.598
Neg logit contributions
undo-1.179
tarian-1.078
 wre-0.984
away-0.981
alogy-0.970
Token must
Token ID1276
Feature activation+7.15e-01

Pos logit contributions
FontSize+0.537
Term+0.492
 EPS+0.478
\/+0.461
arsity+0.459
Neg logit contributions
undo-0.750
tarian-0.679
 wre-0.615
away-0.614
alogy-0.606
Token select
Token ID2922
Feature activation-3.64e-01

Pos logit contributions
FontSize+0.487
Term+0.438
 EPS+0.424
arsity+0.410
\/+0.404
Neg logit contributions
undo-0.662
tarian-0.604
 wre-0.550
away-0.546
alogy-0.536
Token a
Token ID257
Feature activation+1.14e+00

Pos logit contributions
undo+0.276
tarian+0.244
 wre+0.217
away+0.216
alogy+0.211
Neg logit contributions
FontSize-0.307
Term-0.285
 EPS-0.278
arsity-0.271
\/-0.269
Token newsletter
Token ID13129
Feature activation+4.61e+00

Pos logit contributions
FontSize+0.748
Term+0.683
 EPS+0.665
\/+0.636
arsity+0.634
Neg logit contributions
undo-1.069
tarian-0.972
 wre-0.883
away-0.879
 flung-0.870
Token to
Token ID284
Feature activation+1.23e+00

Pos logit contributions
FontSize+3.592
Term+3.322
 EPS+3.266
\/+3.210
arsity+3.126
Neg logit contributions
undo-3.731
tarian-3.353
 wre-2.969
away-2.962
 flung-2.939
Token subscribe
Token ID12383
Feature activation+7.49e+00

Pos logit contributions
FontSize+0.820
Term+0.747
 EPS+0.724
arsity+0.696
\/+0.694
Neg logit contributions
undo-1.154
tarian-1.047
 wre-0.952
away-0.949
alogy-0.936
Token to
Token ID284
Feature activation+2.00e+00

Pos logit contributions
FontSize+5.298
Term+4.948
 EPS+4.874
\/+4.820
arsity+4.534
Neg logit contributions
undo-6.467
tarian-5.839
 cumbers-5.228
 flung-5.226
 wre-5.211
Token.
Token ID13
Feature activation+3.19e+00

Pos logit contributions
FontSize+1.262
Term+1.141
 EPS+1.103
arsity+1.058
\/+1.051
Neg logit contributions
undo-1.948
tarian-1.773
 wre-1.620
away-1.618
alogy-1.595
Token Sign
Token ID5865
Feature activation+8.60e-01

Pos logit contributions
FontSize+2.420
Term+2.257
 EPS+2.198
\/+2.124
arsity+2.113
Neg logit contributions
undo-2.686
tarian-2.407
 wre-2.174
 flung-2.127
away-2.123
Token Up
Token ID3205
Feature activation+1.96e+00

Pos logit contributions
FontSize+0.677
Term+0.622
 EPS+0.607
arsity+0.587
\/+0.586
Neg logit contributions
undo-0.701
tarian-0.626
away-0.562
 wre-0.559
alogy-0.551
Token You
Token ID921
Feature activation+1.08e+00

Pos logit contributions
FontSize+1.447
Term+1.351
 EPS+1.320
\/+1.273
arsity+1.266
Neg logit contributions
undo-1.675
tarian-1.515
 wre-1.362
away-1.342
 flung-1.335
Tokens
Token ID82
Feature activation-4.34e-01

Pos logit contributions
undo+0.063
tarian+0.057
 wre+0.052
away+0.052
alogy+0.051
Neg logit contributions
FontSize-0.049
Term-0.045
 EPS-0.043
arsity-0.042
\/-0.042
Token urgent
Token ID18039
Feature activation+1.35e+00

Pos logit contributions
undo+0.364
tarian+0.327
 wre+0.297
away+0.295
 flung+0.290
Neg logit contributions
FontSize-0.330
Term-0.303
 EPS-0.291
arsity-0.285
\/-0.283
Token needs
Token ID2476
Feature activation+2.91e+00

Pos logit contributions
FontSize+1.029
Term+0.948
 EPS+0.921
arsity+0.892
\/+0.890
Neg logit contributions
undo-1.130
tarian-1.013
 wre-0.912
away-0.907
alogy-0.891
Token,
Token ID11
Feature activation+5.05e-01

Pos logit contributions
FontSize+1.769
Term+1.603
 EPS+1.572
\/+1.519
arsity+1.501
Neg logit contributions
undo-2.857
tarian-2.612
away-2.361
 wre-2.361
alogy-2.337
Token says
Token ID1139
Feature activation+2.57e+00

Pos logit contributions
FontSize+0.391
Term+0.362
 EPS+0.353
arsity+0.342
\/+0.341
Neg logit contributions
undo-0.416
tarian-0.372
 wre-0.333
away-0.332
alogy-0.327
Token Capt
Token ID6790
Feature activation+7.48e+00

Pos logit contributions
FontSize+1.846
Term+1.708
 EPS+1.691
arsity+1.609
\/+1.606
Neg logit contributions
undo-2.240
tarian-2.021
 wre-1.814
away-1.803
alogy-1.781
Token.
Token ID13
Feature activation+5.15e+00

Pos logit contributions
FontSize+3.748
Term+3.573
 EPS+3.437
arsity+3.370
\/+3.321
Neg logit contributions
undo-7.504
tarian-7.030
 wre-6.501
 flung-6.476
away-6.381
Token Nate
Token ID23486
Feature activation+4.10e+00

Pos logit contributions
FontSize+3.683
Term+3.469
 EPS+3.442
\/+3.277
arsity+3.273
Neg logit contributions
undo-4.377
tarian-4.000
 wre-3.569
away-3.501
 flung-3.498
Token Ter
Token ID3813
Feature activation-1.17e+00

Pos logit contributions
FontSize+2.952
Term+2.751
 EPS+2.713
\/+2.598
arsity+2.596
Neg logit contributions
undo-3.520
tarian-3.208
 wre-2.868
away-2.817
 flung-2.813
Tokenning
Token ID768
Feature activation+1.43e+00

Pos logit contributions
undo+0.923
tarian+0.819
away+0.729
 wre+0.726
alogy+0.712
Neg logit contributions
FontSize-0.959
Term-0.887
 EPS-0.868
arsity-0.837
\/-0.835
Token,
Token ID11
Feature activation+2.31e+00

Pos logit contributions
FontSize+1.000
Term+0.923
 EPS+0.900
\/+0.871
arsity+0.868
Neg logit contributions
undo-1.270
tarian-1.148
 wre-1.036
away-1.027
 flung-1.018
Token must
Token ID1276
Feature activation+1.02e+00

Pos logit contributions
FontSize+0.592
Term+0.543
 EPS+0.528
\/+0.510
arsity+0.507
Neg logit contributions
undo-0.838
tarian-0.759
 wre-0.687
away-0.685
alogy-0.678
Token select
Token ID2922
Feature activation-3.99e-01

Pos logit contributions
FontSize+0.676
Term+0.611
 EPS+0.592
arsity+0.570
\/+0.565
Neg logit contributions
undo-0.955
tarian-0.869
 wre-0.792
away-0.788
alogy-0.775
Token a
Token ID257
Feature activation+1.18e+00

Pos logit contributions
undo+0.300
tarian+0.265
 wre+0.235
away+0.235
alogy+0.229
Neg logit contributions
FontSize-0.339
Term-0.314
 EPS-0.307
arsity-0.299
\/-0.297
Token newsletter
Token ID13129
Feature activation+4.86e+00

Pos logit contributions
FontSize+0.783
Term+0.714
 EPS+0.694
\/+0.665
arsity+0.665
Neg logit contributions
undo-1.099
tarian-0.997
 wre-0.906
away-0.903
alogy-0.891
Token to
Token ID284
Feature activation+1.24e+00

Pos logit contributions
FontSize+3.771
Term+3.485
 EPS+3.433
\/+3.368
arsity+3.277
Neg logit contributions
undo-3.952
tarian-3.549
 wre-3.146
away-3.141
 flung-3.113
Token subscribe
Token ID12383
Feature activation+7.47e+00

Pos logit contributions
FontSize+0.825
Term+0.752
 EPS+0.729
arsity+0.701
\/+0.698
Neg logit contributions
undo-1.156
tarian-1.048
 wre-0.954
away-0.950
alogy-0.937
Token to
Token ID284
Feature activation+2.13e+00

Pos logit contributions
FontSize+5.280
Term+4.934
 EPS+4.870
\/+4.805
arsity+4.528
Neg logit contributions
undo-6.452
tarian-5.827
 flung-5.203
 wre-5.186
 cumbers-5.171
Token.
Token ID13
Feature activation+3.24e+00

Pos logit contributions
FontSize+1.337
Term+1.209
 EPS+1.169
arsity+1.121
\/+1.115
Neg logit contributions
undo-2.078
tarian-1.891
 wre-1.728
away-1.725
alogy-1.701
Token Sign
Token ID5865
Feature activation+1.15e+00

Pos logit contributions
FontSize+2.457
Term+2.293
 EPS+2.235
\/+2.156
arsity+2.148
Neg logit contributions
undo-2.736
tarian-2.452
 wre-2.216
 flung-2.168
away-2.161
Token Up
Token ID3205
Feature activation+2.35e+00

Pos logit contributions
FontSize+0.894
Term+0.824
 EPS+0.804
arsity+0.777
\/+0.776
Neg logit contributions
undo-0.944
tarian-0.844
away-0.756
 wre-0.755
alogy-0.742
Token You
Token ID921
Feature activation+1.20e+00

Pos logit contributions
FontSize+1.724
Term+1.612
 EPS+1.575
\/+1.520
arsity+1.510
Neg logit contributions
undo-2.027
tarian-1.835
 wre-1.651
away-1.625
 flung-1.619
Token others
Token ID1854
Feature activation-1.46e+00

Pos logit contributions
undo+0.382
tarian+0.343
 wre+0.310
away+0.309
alogy+0.304
Neg logit contributions
FontSize-0.328
Term-0.301
 EPS-0.292
arsity-0.283
\/-0.283
Token
Token ID198
Feature activation+2.27e-01

Pos logit contributions
undo+1.176
tarian+1.048
 wre+0.936
away+0.932
alogy+0.917
Neg logit contributions
FontSize-1.168
Term-1.082
 EPS-1.055
arsity-1.021
\/-1.020
Token
Token ID198
Feature activation-1.72e-01

Pos logit contributions
FontSize+0.187
Term+0.173
 EPS+0.169
arsity+0.165
\/+0.163
Neg logit contributions
undo-0.176
tarian-0.155
away-0.139
 wre-0.138
alogy-0.136
TokenAnyone
Token ID21129
Feature activation+1.62e+00

Pos logit contributions
undo+0.139
tarian+0.124
away+0.111
 wre+0.110
alogy+0.109
Neg logit contributions
FontSize-0.135
Term-0.125
 EPS-0.123
arsity-0.119
\/-0.118
Token who
Token ID508
Feature activation+1.09e+00

Pos logit contributions
FontSize+1.284
Term+1.195
 EPS+1.170
\/+1.134
arsity+1.122
Neg logit contributions
undo-1.293
tarian-1.152
 wre-1.011
away-1.010
alogy-1.007
Token follows
Token ID5679
Feature activation+7.45e+00

Pos logit contributions
FontSize+0.884
Term+0.820
 EPS+0.806
\/+0.783
arsity+0.778
Neg logit contributions
undo-0.856
tarian-0.759
away-0.666
 wre-0.665
alogy-0.663
Token food
Token ID2057
Feature activation-2.12e+00

Pos logit contributions
FontSize+5.176
 EPS+4.863
Term+4.829
\/+4.664
 Coff+4.558
Neg logit contributions
undo-6.258
tarian-5.590
away-4.927
 wre-4.912
alogy-4.894
Token in
Token ID287
Feature activation-5.81e-01

Pos logit contributions
undo+1.810
tarian+1.624
 wre+1.471
away+1.456
alogy+1.441
Neg logit contributions
FontSize-1.591
Term-1.464
 EPS-1.429
arsity-1.374
\/-1.371
Token Houston
Token ID6995
Feature activation-3.40e+00

Pos logit contributions
undo+0.432
tarian+0.381
 wre+0.338
away+0.336
alogy+0.329
Neg logit contributions
FontSize-0.498
Term-0.463
 EPS-0.452
arsity-0.440
\/-0.439
Token probably
Token ID2192
Feature activation-4.98e-01

Pos logit contributions
undo+2.559
tarian+2.293
 wre+2.052
away+2.041
alogy+2.000
Neg logit contributions
FontSize-2.850
Term-2.625
 EPS-2.561
arsity-2.503
\/-2.452
Token sc
Token ID629
Feature activation-2.10e+00

Pos logit contributions
undo+0.380
tarian+0.337
 wre+0.299
away+0.297
alogy+0.292
Neg logit contributions
FontSize-0.418
Term-0.388
 EPS-0.379
arsity-0.368
\/-0.367
Token
Token ID447
Feature activation-2.31e-01

Pos logit contributions
FontSize+0.570
Term+0.528
 EPS+0.518
\/+0.500
arsity+0.498
Neg logit contributions
undo-0.589
tarian-0.526
 wre-0.469
away-0.468
alogy-0.461
Token
Token ID247
Feature activation+2.66e-01

Pos logit contributions
undo+0.208
tarian+0.188
 wre+0.171
away+0.170
alogy+0.167
Neg logit contributions
FontSize-0.162
Term-0.148
 EPS-0.144
\/-0.139
arsity-0.139
Tokend
Token ID67
Feature activation+6.29e-01

Pos logit contributions
FontSize+0.196
Term+0.181
 EPS+0.176
\/+0.171
arsity+0.169
Neg logit contributions
undo-0.229
tarian-0.206
 wre-0.185
away-0.184
alogy-0.182
Token like
Token ID588
Feature activation-2.12e+00

Pos logit contributions
FontSize+0.491
Term+0.457
 EPS+0.444
\/+0.431
arsity+0.430
Neg logit contributions
undo-0.513
tarian-0.459
 wre-0.409
away-0.407
alogy-0.401
Token to
Token ID284
Feature activation+3.28e-01

Pos logit contributions
undo+1.755
tarian+1.572
 wre+1.423
away+1.411
alogy+1.382
Neg logit contributions
FontSize-1.633
Term-1.507
 EPS-1.461
arsity-1.420
\/-1.396
Token follow
Token ID1061
Feature activation+7.42e+00

Pos logit contributions
FontSize+0.246
Term+0.227
 EPS+0.222
\/+0.215
arsity+0.214
Neg logit contributions
undo-0.277
tarian-0.249
 wre-0.223
away-0.223
alogy-0.220
Token them
Token ID606
Feature activation+5.73e-01

Pos logit contributions
FontSize+5.144
 EPS+4.770
Term+4.763
\/+4.753
arsity+4.517
Neg logit contributions
undo-6.337
tarian-5.727
alogy-4.990
away-4.990
 wre-4.940
Token.
Token ID13
Feature activation-8.34e-01

Pos logit contributions
FontSize+0.387
Term+0.354
 EPS+0.344
\/+0.333
arsity+0.330
Neg logit contributions
undo-0.528
tarian-0.479
 wre-0.433
away-0.432
alogy-0.426
Token And
Token ID843
Feature activation-1.20e+00

Pos logit contributions
undo+0.680
tarian+0.605
 wre+0.543
away+0.542
alogy+0.532
Neg logit contributions
FontSize-0.656
Term-0.605
 EPS-0.590
arsity-0.572
\/-0.568
Token let
Token ID1309
Feature activation+1.73e+00

Pos logit contributions
undo+1.033
tarian+0.918
 wre+0.840
away+0.834
 flung+0.816
Neg logit contributions
FontSize-0.894
Term-0.819
 EPS-0.791
arsity-0.773
\/-0.758
Token us
Token ID514
Feature activation-1.53e+00

Pos logit contributions
FontSize+1.327
Term+1.229
 EPS+1.203
\/+1.168
arsity+1.153
Neg logit contributions
undo-1.419
tarian-1.282
 wre-1.138
away-1.132
 flung-1.119
TokenIFF
Token ID29267
Feature activation+6.72e-01

Pos logit contributions
FontSize+0.950
Term+0.861
arsity+0.794
Merit+0.783
\/+0.779
Neg logit contributions
undo-1.384
tarian-1.239
 wre-1.143
away-1.139
 que-1.104
Token original
Token ID2656
Feature activation+1.86e+00

Pos logit contributions
FontSize+0.393
Term+0.350
arsity+0.326
 EPS+0.318
\/+0.313
Neg logit contributions
undo-0.676
tarian-0.618
 wre-0.577
away-0.575
 flung-0.568
Token image
Token ID2939
Feature activation+4.67e+00

Pos logit contributions
FontSize+1.486
Term+1.377
arsity+1.311
 EPS+1.310
\/+1.283
Neg logit contributions
undo-1.497
tarian-1.324
away-1.193
 wre-1.189
 flung-1.161
Token Download
Token ID10472
Feature activation+3.34e+00

Pos logit contributions
FontSize+3.132
Term+2.867
 EPS+2.807
\/+2.681
arsity+2.666
Neg logit contributions
undo-4.320
tarian-3.938
 wre-3.573
away-3.541
 flung-3.501
Token:
Token ID25
Feature activation+1.52e+00

Pos logit contributions
FontSize+2.177
Term+1.979
 EPS+1.917
arsity+1.842
\/+1.836
Neg logit contributions
undo-3.169
tarian-2.878
 wre-2.622
away-2.614
alogy-2.577
Token Figure
Token ID11291
Feature activation+7.32e+00

Pos logit contributions
FontSize+0.711
Term+0.619
 EPS+0.566
arsity+0.555
\/+0.548
Neg logit contributions
undo-1.710
tarian-1.589
away-1.478
 wre-1.475
alogy-1.454
Token 1
Token ID352
Feature activation+9.46e-01

Pos logit contributions
FontSize+3.731
 EPS+3.473
Term+3.423
\/+3.147
arsity+3.054
Neg logit contributions
undo-7.649
tarian-7.029
 wre-6.425
away-6.422
 flung-6.369
Token.
Token ID13
Feature activation+2.66e+00

Pos logit contributions
FontSize+0.585
Term+0.531
 EPS+0.514
arsity+0.491
\/+0.491
Neg logit contributions
undo-0.929
tarian-0.846
 wre-0.773
away-0.769
alogy-0.760
Token Maze
Token ID33412
Feature activation+4.13e+00

Pos logit contributions
FontSize+1.909
Term+1.794
 EPS+1.756
\/+1.671
arsity+1.670
Neg logit contributions
undo-2.339
tarian-2.100
 wre-1.893
 flung-1.864
away-1.854
Token and
Token ID290
Feature activation+1.53e+00

Pos logit contributions
FontSize+3.146
Term+2.948
 EPS+2.890
\/+2.769
arsity+2.758
Neg logit contributions
undo-3.399
tarian-3.043
 wre-2.673
away-2.669
 flung-2.657
Token task
Token ID4876
Feature activation+3.99e+00

Pos logit contributions
FontSize+1.247
Term+1.164
 EPS+1.143
arsity+1.098
\/+1.096
Neg logit contributions
undo-1.194
tarian-1.063
 wre-0.937
away-0.932
alogy-0.920
Token media
Token ID2056
Feature activation+8.79e-01

Pos logit contributions
undo+0.177
tarian+0.157
away+0.139
 wre+0.139
alogy+0.138
Neg logit contributions
FontSize-0.176
Term-0.164
 EPS-0.160
\/-0.155
arsity-0.155
Token lying
Token ID9105
Feature activation+9.97e-01

Pos logit contributions
FontSize+0.688
Term+0.636
 EPS+0.624
\/+0.609
arsity+0.601
Neg logit contributions
undo-0.718
tarian-0.637
away-0.569
 wre-0.567
alogy-0.562
Token
Token ID447
Feature activation-1.26e+00

Pos logit contributions
FontSize+0.721
Term+0.666
 EPS+0.650
\/+0.632
arsity+0.623
Neg logit contributions
undo-0.868
tarian-0.781
away-0.701
 wre-0.700
alogy-0.693
Token
Token ID251
Feature activation-5.04e-01

Pos logit contributions
undo+1.207
tarian+1.097
 wre+1.000
away+0.996
alogy+0.983
Neg logit contributions
FontSize-0.816
Term-0.742
 EPS-0.719
arsity-0.690
\/-0.688
Token to
Token ID284
Feature activation-1.12e-01

Pos logit contributions
undo+0.426
tarian+0.382
 wre+0.343
away+0.342
alogy+0.337
Neg logit contributions
FontSize-0.379
Term-0.350
 EPS-0.341
\/-0.329
arsity-0.329
Token follow
Token ID1061
Feature activation+7.24e+00

Pos logit contributions
undo+0.092
tarian+0.083
away+0.074
 wre+0.074
alogy+0.073
Neg logit contributions
FontSize-0.086
Term-0.080
 EPS-0.078
arsity-0.075
\/-0.075
Token the
Token ID262
Feature activation-8.08e-02

Pos logit contributions
FontSize+5.211
 EPS+4.834
Term+4.790
\/+4.748
 Coff+4.558
Neg logit contributions
undo-5.996
tarian-5.384
alogy-4.761
 wre-4.738
 flung-4.720
Token hashtag
Token ID23950
Feature activation+1.10e+00

Pos logit contributions
undo+0.067
tarian+0.060
 wre+0.054
away+0.054
alogy+0.053
Neg logit contributions
FontSize-0.062
Term-0.057
 EPS-0.056
arsity-0.054
\/-0.054
Token #
Token ID1303
Feature activation-1.20e+00

Pos logit contributions
FontSize+0.810
Term+0.747
 EPS+0.729
\/+0.703
arsity+0.700
Neg logit contributions
undo-0.947
tarian-0.855
 wre-0.765
away-0.765
alogy-0.753
Tokenfemin
Token ID33594
Feature activation-2.17e+00

Pos logit contributions
undo+0.995
tarian+0.888
 wre+0.795
away+0.795
alogy+0.780
Neg logit contributions
FontSize-0.927
Term-0.855
 EPS-0.836
arsity-0.808
\/-0.805
Tokenin
Token ID259
Feature activation-3.84e-01

Pos logit contributions
undo+1.975
tarian+1.787
away+1.616
 wre+1.610
alogy+1.591
Neg logit contributions
FontSize-1.512
Term-1.374
 EPS-1.340
\/-1.278
arsity-1.277
Token room
Token ID2119
Feature activation+5.50e-01

Pos logit contributions
undo+0.692
tarian+0.604
 wre+0.533
away+0.529
alogy+0.516
Neg logit contributions
FontSize-0.902
Term-0.841
 EPS-0.819
arsity-0.800
\/-0.799
Token.
Token ID13
Feature activation+4.90e-01

Pos logit contributions
FontSize+0.372
Term+0.340
 EPS+0.330
arsity+0.317
\/+0.317
Neg logit contributions
undo-0.508
tarian-0.460
 wre-0.417
away-0.415
alogy-0.410
Token P
Token ID350
Feature activation+2.02e+00

Pos logit contributions
FontSize+0.396
Term+0.367
 EPS+0.357
arsity+0.346
\/+0.346
Neg logit contributions
undo-0.389
tarian-0.346
 wre-0.309
away-0.308
alogy-0.302
TokenPT
Token ID11571
Feature activation+4.46e-01

Pos logit contributions
FontSize+1.525
Term+1.407
 EPS+1.371
arsity+1.324
\/+1.322
Neg logit contributions
undo-1.702
tarian-1.526
 wre-1.372
away-1.365
alogy-1.344
Token PowerPoint
Token ID43563
Feature activation+6.25e+00

Pos logit contributions
FontSize+0.329
Term+0.304
 EPS+0.293
arsity+0.287
\/+0.285
Neg logit contributions
undo-0.384
tarian-0.342
 wre-0.311
away-0.309
 flung-0.305
Token slide
Token ID10649
Feature activation+7.23e+00

Pos logit contributions
FontSize+3.811
Term+3.446
 EPS+3.349
\/+3.194
arsity+3.184
Neg logit contributions
undo-6.174
tarian-5.642
 wre-5.153
away-5.131
alogy-5.069
Token
Token ID198
Feature activation+2.81e+00

Pos logit contributions
FontSize+5.234
Term+4.866
 EPS+4.832
\/+4.588
arsity+4.557
Neg logit contributions
undo-6.199
tarian-5.571
 wre-5.025
 flung-5.007
alogy-4.868
Token
Token ID198
Feature activation+2.30e+00

Pos logit contributions
FontSize+2.217
Term+2.060
 EPS+2.010
\/+1.947
arsity+1.938
Neg logit contributions
undo-2.283
tarian-2.042
 wre-1.823
away-1.802
 flung-1.783
TokenPower
Token ID13434
Feature activation+3.72e+00

Pos logit contributions
FontSize+1.520
Term+1.362
 EPS+1.336
arsity+1.303
\/+1.279
Neg logit contributions
undo-2.156
tarian-1.951
away-1.790
 wre-1.781
 flung-1.742
TokenPoint
Token ID12727
Feature activation+5.19e+00

Pos logit contributions
FontSize+2.758
Term+2.541
 EPS+2.478
arsity+2.376
\/+2.376
Neg logit contributions
undo-3.199
tarian-2.885
 wre-2.588
away-2.567
alogy-2.536
Token slide
Token ID10649
Feature activation+7.64e+00

Pos logit contributions
FontSize+3.293
Term+2.991
 EPS+2.878
arsity+2.794
\/+2.767
Neg logit contributions
undo-5.003
tarian-4.556
 wre-4.167
away-4.148
alogy-4.095
Token N
Token ID399
Feature activation+2.68e+00

Pos logit contributions
FontSize+0.326
Term+0.301
 EPS+0.293
arsity+0.283
\/+0.282
Neg logit contributions
undo-0.374
tarian-0.337
 wre-0.303
away-0.301
alogy-0.297
Tokenair
Token ID958
Feature activation+3.65e+00

Pos logit contributions
FontSize+1.945
Term+1.806
 EPS+1.754
\/+1.708
arsity+1.707
Neg logit contributions
undo-2.290
tarian-2.069
 wre-1.878
away-1.840
 flung-1.836
Token,
Token ID11
Feature activation+2.04e+00

Pos logit contributions
FontSize+2.374
Term+2.219
 EPS+2.170
\/+2.114
arsity+2.082
Neg logit contributions
undo-3.291
tarian-3.005
 wre-2.737
 flung-2.698
alogy-2.672
Token Ford
Token ID8092
Feature activation-1.18e+00

Pos logit contributions
FontSize+1.512
Term+1.407
 EPS+1.388
\/+1.325
arsity+1.319
Neg logit contributions
undo-1.739
tarian-1.566
away-1.397
 wre-1.395
alogy-1.379
Token group
Token ID1448
Feature activation-3.03e-01

Pos logit contributions
undo+0.982
tarian+0.873
 wre+0.791
away+0.787
alogy+0.772
Neg logit contributions
FontSize-0.921
Term-0.845
 EPS-0.814
arsity-0.798
\/-0.791
Token vice
Token ID7927
Feature activation+7.14e+00

Pos logit contributions
undo+0.279
tarian+0.252
 wre+0.231
away+0.230
alogy+0.225
Neg logit contributions
FontSize-0.209
Term-0.189
 EPS-0.181
arsity-0.177
\/-0.175
Token president
Token ID1893
Feature activation+2.74e+00

Pos logit contributions
FontSize+3.508
Term+3.183
 EPS+3.170
\/+2.991
arsity+2.952
Neg logit contributions
undo-7.627
tarian-7.045
alogy-6.383
 wre-6.338
 flung-6.323
Token,
Token ID11
Feature activation+2.43e+00

Pos logit contributions
FontSize+1.842
Term+1.688
 EPS+1.666
\/+1.600
arsity+1.591
Neg logit contributions
undo-2.512
tarian-2.278
 wre-2.058
away-2.049
 flung-2.027
Token Global
Token ID8060
Feature activation+1.90e+00

Pos logit contributions
FontSize+1.839
Term+1.709
 EPS+1.682
\/+1.613
arsity+1.610
Neg logit contributions
undo-2.026
tarian-1.822
 wre-1.619
away-1.617
alogy-1.595
Token Product
Token ID8721
Feature activation+4.42e+00

Pos logit contributions
FontSize+1.232
Term+1.119
 EPS+1.084
arsity+1.041
\/+1.039
Neg logit contributions
undo-1.806
tarian-1.641
 wre-1.496
away-1.490
alogy-1.470
Token Development
Token ID7712
Feature activation-4.10e-01

Pos logit contributions
FontSize+2.654
Term+2.458
 EPS+2.392
\/+2.302
arsity+2.300
Neg logit contributions
undo-4.322
tarian-3.934
 wre-3.577
away-3.568
 flung-3.550
Token to
Token ID284
Feature activation+5.80e-01

Pos logit contributions
FontSize+0.050
Term+0.046
 EPS+0.045
\/+0.043
arsity+0.043
Neg logit contributions
undo-0.057
tarian-0.051
 wre-0.046
away-0.046
alogy-0.045
Token help
Token ID1037
Feature activation+3.06e+00

Pos logit contributions
FontSize+0.447
Term+0.414
 EPS+0.402
arsity+0.389
\/+0.388
Neg logit contributions
undo-0.481
tarian-0.430
 wre-0.387
away-0.385
alogy-0.378
Token teach
Token ID4545
Feature activation+1.87e+00

Pos logit contributions
FontSize+2.231
Term+2.046
 EPS+2.014
\/+1.940
arsity+1.934
Neg logit contributions
undo-2.645
tarian-2.393
 wre-2.132
away-2.130
alogy-2.103
Token others
Token ID1854
Feature activation+1.54e-01

Pos logit contributions
FontSize+1.370
Term+1.260
 EPS+1.224
arsity+1.182
\/+1.178
Neg logit contributions
undo-1.632
tarian-1.466
 wre-1.326
away-1.321
alogy-1.301
Token to
Token ID284
Feature activation+1.35e+00

Pos logit contributions
FontSize+0.122
Term+0.113
 EPS+0.109
arsity+0.106
\/+0.106
Neg logit contributions
undo-0.126
tarian-0.112
 wre-0.101
away-0.100
alogy-0.098
Token follow
Token ID1061
Feature activation+7.08e+00

Pos logit contributions
FontSize+0.930
Term+0.851
 EPS+0.828
arsity+0.796
\/+0.795
Neg logit contributions
undo-1.223
tarian-1.106
 wre-1.002
away-0.998
alogy-0.984
Token in
Token ID287
Feature activation+6.02e-01

Pos logit contributions
FontSize+5.058
Term+4.634
 EPS+4.611
\/+4.458
arsity+4.395
Neg logit contributions
undo-6.095
tarian-5.562
 wre-4.897
away-4.851
 flung-4.841
Token the
Token ID262
Feature activation+8.31e-01

Pos logit contributions
FontSize+0.470
Term+0.435
 EPS+0.423
arsity+0.410
\/+0.409
Neg logit contributions
undo-0.494
tarian-0.441
 wre-0.396
away-0.394
alogy-0.388
Token same
Token ID976
Feature activation+1.92e+00

Pos logit contributions
FontSize+0.707
Term+0.658
 EPS+0.640
arsity+0.623
\/+0.622
Neg logit contributions
undo-0.622
tarian-0.548
 wre-0.489
away-0.486
alogy-0.477
Token footsteps
Token ID27146
Feature activation+1.10e-01

Pos logit contributions
FontSize+1.615
Term+1.503
 EPS+1.467
arsity+1.424
\/+1.421
Neg logit contributions
undo-1.459
tarian-1.291
 wre-1.145
away-1.140
alogy-1.119
Token.
Token ID13
Feature activation-3.67e-01

Pos logit contributions
FontSize+0.076
Term+0.070
 EPS+0.068
arsity+0.065
\/+0.065
Neg logit contributions
undo-0.100
tarian-0.090
 wre-0.083
away-0.082
alogy-0.081
Token the
Token ID262
Feature activation+7.91e-01

Pos logit contributions
FontSize+0.658
Term+0.607
 EPS+0.592
\/+0.572
arsity+0.570
Neg logit contributions
undo-0.728
tarian-0.654
 wre-0.584
away-0.581
alogy-0.574
Token way
Token ID835
Feature activation+1.77e+00

Pos logit contributions
FontSize+0.550
Term+0.504
 EPS+0.489
arsity+0.471
\/+0.470
Neg logit contributions
undo-0.716
tarian-0.647
 wre-0.586
away-0.584
alogy-0.576
Token and
Token ID290
Feature activation+2.32e-01

Pos logit contributions
FontSize+1.258
Term+1.154
 EPS+1.138
\/+1.094
arsity+1.081
Neg logit contributions
undo-1.571
tarian-1.424
 wre-1.273
away-1.261
alogy-1.255
Token the
Token ID262
Feature activation+8.34e-01

Pos logit contributions
FontSize+0.180
Term+0.166
 EPS+0.162
arsity+0.157
\/+0.156
Neg logit contributions
undo-0.191
tarian-0.171
 wre-0.153
away-0.153
alogy-0.150
Token government
Token ID1230
Feature activation+1.61e+00

Pos logit contributions
FontSize+0.626
Term+0.575
 EPS+0.558
arsity+0.542
\/+0.540
Neg logit contributions
undo-0.710
tarian-0.637
 wre-0.576
away-0.572
alogy-0.563
Token follows
Token ID5679
Feature activation+7.07e+00

Pos logit contributions
FontSize+1.276
Term+1.182
 EPS+1.159
\/+1.118
arsity+1.117
Neg logit contributions
undo-1.300
tarian-1.161
 wre-1.031
away-1.029
alogy-1.016
Token,
Token ID11
Feature activation+2.07e-01

Pos logit contributions
FontSize+5.139
 EPS+4.912
Term+4.783
\/+4.612
arsity+4.412
Neg logit contributions
undo-5.957
tarian-5.321
 wre-4.659
away-4.594
 flung-4.584
Token that
Token ID326
Feature activation+8.48e-01

Pos logit contributions
FontSize+0.156
Term+0.144
 EPS+0.141
\/+0.135
arsity+0.135
Neg logit contributions
undo-0.175
tarian-0.157
 wre-0.141
away-0.140
alogy-0.138
Token would
Token ID561
Feature activation-1.07e+00

Pos logit contributions
FontSize+0.648
Term+0.598
 EPS+0.587
\/+0.563
arsity+0.561
Neg logit contributions
undo-0.709
tarian-0.636
 wre-0.566
away-0.565
alogy-0.558
Token create
Token ID2251
Feature activation-7.20e-01

Pos logit contributions
undo+0.861
tarian+0.768
 wre+0.686
away+0.682
alogy+0.671
Neg logit contributions
FontSize-0.857
Term-0.794
 EPS-0.774
arsity-0.750
\/-0.748
Token the
Token ID262
Feature activation+1.26e-01

Pos logit contributions
undo+0.565
tarian+0.502
 wre+0.447
away+0.445
alogy+0.437
Neg logit contributions
FontSize-0.587
Term-0.545
 EPS-0.532
arsity-0.516
\/-0.515
Token and
Token ID290
Feature activation+8.26e-02

Pos logit contributions
undo+1.172
tarian+1.053
 wre+0.958
away+0.951
alogy+0.938
Neg logit contributions
FontSize-0.935
Term-0.853
 EPS-0.831
arsity-0.800
\/-0.792
Token believes
Token ID5804
Feature activation+3.88e+00

Pos logit contributions
FontSize+0.066
Term+0.061
 EPS+0.059
arsity+0.057
\/+0.057
Neg logit contributions
undo-0.067
tarian-0.059
 wre-0.053
away-0.053
alogy-0.052
Token that
Token ID326
Feature activation+1.44e+00

Pos logit contributions
FontSize+2.612
Term+2.392
 EPS+2.354
\/+2.277
arsity+2.224
Neg logit contributions
undo-3.527
tarian-3.210
away-2.866
 wre-2.848
alogy-2.846
Token because
Token ID780
Feature activation+1.13e+00

Pos logit contributions
FontSize+1.090
Term+1.006
 EPS+0.982
\/+0.948
arsity+0.946
Neg logit contributions
undo-1.217
tarian-1.093
 wre-0.975
away-0.973
alogy-0.961
Token they
Token ID484
Feature activation+1.37e+00

Pos logit contributions
FontSize+0.811
Term+0.745
 EPS+0.728
\/+0.703
arsity+0.698
Neg logit contributions
undo-0.996
tarian-0.899
 wre-0.804
away-0.804
alogy-0.795
Token follow
Token ID1061
Feature activation+7.06e+00

Pos logit contributions
FontSize+1.020
Term+0.943
 EPS+0.919
\/+0.891
arsity+0.888
Neg logit contributions
undo-1.165
tarian-1.047
 wre-0.938
away-0.937
alogy-0.925
Token the
Token ID262
Feature activation+5.88e-01

Pos logit contributions
FontSize+4.972
Term+4.625
 EPS+4.570
\/+4.437
arsity+4.341
Neg logit contributions
undo-6.007
tarian-5.460
away-4.740
 wre-4.713
alogy-4.708
Token trends
Token ID11257
Feature activation+1.42e+00

Pos logit contributions
FontSize+0.433
Term+0.399
 EPS+0.388
arsity+0.375
\/+0.374
Neg logit contributions
undo-0.507
tarian-0.456
 wre-0.411
away-0.410
alogy-0.403
Token and
Token ID290
Feature activation+1.72e+00

Pos logit contributions
FontSize+1.007
Term+0.924
 EPS+0.900
\/+0.869
arsity+0.866
Neg logit contributions
undo-1.264
tarian-1.143
 wre-1.028
away-1.026
alogy-1.012
Token stay
Token ID2652
Feature activation+1.14e+00

Pos logit contributions
FontSize+1.334
Term+1.234
 EPS+1.208
\/+1.166
arsity+1.164
Neg logit contributions
undo-1.416
tarian-1.268
 wre-1.128
away-1.126
alogy-1.111
Token fresh
Token ID4713
Feature activation+8.26e-01

Pos logit contributions
FontSize+0.642
Term+0.575
 EPS+0.553
arsity+0.528
\/+0.525
Neg logit contributions
undo-1.188
tarian-1.088
 wre-1.002
away-0.999
alogy-0.985
Token price
Token ID2756
Feature activation+2.50e+00

Pos logit contributions
FontSize+0.149
Term+0.135
 EPS+0.130
arsity+0.127
\/+0.126
Neg logit contributions
undo-0.199
tarian-0.181
 wre-0.165
away-0.164
 flung-0.162
Token-
Token ID12
Feature activation-3.12e-01

Pos logit contributions
FontSize+1.730
Term+1.594
 EPS+1.568
\/+1.497
arsity+1.486
Neg logit contributions
undo-2.259
tarian-2.048
away-1.828
 wre-1.828
alogy-1.816
Tokento
Token ID1462
Feature activation+1.42e+00

Pos logit contributions
undo+0.234
tarian+0.206
away+0.182
 wre+0.182
alogy+0.177
Neg logit contributions
FontSize-0.266
Term-0.247
 EPS-0.241
arsity-0.235
\/-0.233
Token-
Token ID12
Feature activation+1.57e+00

Pos logit contributions
FontSize+0.866
Term+0.782
 EPS+0.755
arsity+0.724
\/+0.718
Neg logit contributions
undo-1.410
tarian-1.287
 wre-1.177
away-1.176
alogy-1.159
Tokenear
Token ID451
Feature activation+3.74e+00

Pos logit contributions
FontSize+1.001
Term+0.908
 EPS+0.880
arsity+0.844
\/+0.842
Neg logit contributions
undo-1.517
tarian-1.379
 wre-1.259
away-1.255
alogy-1.237
Tokennings
Token ID23400
Feature activation+7.04e+00

Pos logit contributions
FontSize+3.539
Term+3.328
 EPS+3.267
\/+3.170
arsity+3.156
Neg logit contributions
undo-2.447
tarian-2.094
 wre-1.811
away-1.775
alogy-1.763
Token multiple
Token ID3294
Feature activation+9.88e-02

Pos logit contributions
FontSize+3.147
 EPS+2.939
Term+2.816
\/+2.635
arsity+2.540
Neg logit contributions
undo-7.787
tarian-7.132
away-6.531
 wre-6.528
alogy-6.502
Token may
Token ID743
Feature activation-1.74e+00

Pos logit contributions
FontSize+0.074
Term+0.068
 EPS+0.066
arsity+0.064
\/+0.064
Neg logit contributions
undo-0.084
tarian-0.075
 wre-0.068
away-0.068
alogy-0.067
Token be
Token ID307
Feature activation-6.79e-01

Pos logit contributions
undo+1.379
tarian+1.221
 wre+1.108
away+1.092
alogy+1.071
Neg logit contributions
FontSize-1.410
Term-1.305
 EPS-1.261
arsity-1.231
\/-1.222
Token justified
Token ID14460
Feature activation-3.11e-01

Pos logit contributions
undo+0.557
tarian+0.498
 wre+0.449
away+0.445
alogy+0.438
Neg logit contributions
FontSize-0.530
Term-0.490
 EPS-0.475
arsity-0.462
\/-0.460
Token.
Token ID13
Feature activation+1.55e-01

Pos logit contributions
undo+0.290
tarian+0.264
 wre+0.240
away+0.239
alogy+0.235
Neg logit contributions
FontSize-0.207
Term-0.189
 EPS-0.182
arsity-0.176
\/-0.174
Token said
Token ID531
Feature activation+9.64e-01

Pos logit contributions
undo+1.118
tarian+0.985
 wre+0.874
away+0.867
 flung+0.852
Neg logit contributions
FontSize-1.332
Term-1.239
 EPS-1.200
\/-1.176
arsity-1.176
Token AP
Token ID3486
Feature activation+1.64e+00

Pos logit contributions
FontSize+0.727
Term+0.672
 EPS+0.657
arsity+0.632
\/+0.631
Neg logit contributions
undo-0.814
tarian-0.731
 wre-0.655
away-0.653
alogy-0.644
TokenX
Token ID55
Feature activation+1.74e-01

Pos logit contributions
FontSize+1.309
Term+1.210
 EPS+1.178
arsity+1.145
\/+1.141
Neg logit contributions
undo-1.312
tarian-1.165
away-1.042
 wre-1.040
alogy-1.022
TokenS
Token ID50
Feature activation-4.10e-01

Pos logit contributions
FontSize+0.098
Term+0.085
 EPS+0.084
arsity+0.081
\/+0.079
Neg logit contributions
undo-0.179
tarian-0.163
away-0.151
 wre-0.150
alogy-0.148
Token Principal
Token ID32641
Feature activation+4.60e+00

Pos logit contributions
undo+0.369
tarian+0.333
 wre+0.303
away+0.301
 flung+0.296
Neg logit contributions
FontSize-0.288
Term-0.262
 EPS-0.253
arsity-0.246
\/-0.246
Token Investigator
Token ID49499
Feature activation+7.02e+00

Pos logit contributions
FontSize+1.583
Term+1.410
 EPS+1.379
arsity+1.187
\/+1.168
Neg logit contributions
undo-5.673
tarian-5.310
 wre-4.904
away-4.902
 flung-4.864
Token R
Token ID371
Feature activation+2.85e+00

Pos logit contributions
FontSize+4.955
 EPS+4.767
Term+4.681
\/+4.470
 Mehran+4.455
Neg logit contributions
undo-5.857
tarian-5.336
 wre-4.696
away-4.629
alogy-4.620
Tokenalf
Token ID1604
Feature activation+9.81e-02

Pos logit contributions
FontSize+2.040
Term+1.900
 EPS+1.851
arsity+1.810
\/+1.806
Neg logit contributions
undo-2.451
tarian-2.222
 wre-2.011
away-1.984
 flung-1.983
Token G
Token ID402
Feature activation+1.19e+00

Pos logit contributions
FontSize+0.070
Term+0.064
 EPS+0.063
arsity+0.060
\/+0.060
Neg logit contributions
undo-0.087
tarian-0.078
 wre-0.071
away-0.070
alogy-0.069
Tokenell
Token ID695
Feature activation+1.97e-01

Pos logit contributions
FontSize+0.922
Term+0.858
 EPS+0.832
arsity+0.812
\/+0.812
Neg logit contributions
undo-0.963
tarian-0.864
 wre-0.777
away-0.763
 flung-0.759
Tokenert
Token ID861
Feature activation-1.08e+00

Pos logit contributions
FontSize+0.151
Term+0.140
 EPS+0.137
arsity+0.132
\/+0.132
Neg logit contributions
undo-0.162
tarian-0.145
 wre-0.130
away-0.128
alogy-0.127
Token of
Token ID286
Feature activation+7.39e-02

Pos logit contributions
undo+0.212
tarian+0.191
 wre+0.172
away+0.172
alogy+0.169
Neg logit contributions
FontSize-0.180
Term-0.165
 EPS-0.161
arsity-0.155
\/-0.155
Token Nebraska
Token ID18329
Feature activation-2.76e+00

Pos logit contributions
FontSize+0.054
Term+0.049
 EPS+0.048
arsity+0.046
\/+0.046
Neg logit contributions
undo-0.064
tarian-0.058
 wre-0.052
away-0.052
alogy-0.051
Token currently
Token ID3058
Feature activation-4.30e+00

Pos logit contributions
undo+2.117
tarian+1.881
 wre+1.682
away+1.672
 flung+1.641
Neg logit contributions
FontSize-2.295
Term-2.122
 EPS-2.067
arsity-2.005
\/-1.998
Token has
Token ID468
Feature activation-1.04e-01

Pos logit contributions
undo+3.015
tarian+2.689
 wre+2.415
away+2.359
 flung+2.319
Neg logit contributions
FontSize-3.845
Term-3.533
 EPS-3.423
\/-3.330
arsity-3.324
Token a
Token ID257
Feature activation-1.66e+00

Pos logit contributions
undo+0.089
tarian+0.080
 wre+0.072
away+0.071
alogy+0.070
Neg logit contributions
FontSize-0.078
Term-0.072
 EPS-0.071
arsity-0.068
\/-0.068
Token ban
Token ID3958
Feature activation-8.79e+00

Pos logit contributions
undo+1.302
tarian+1.157
 wre+1.034
away+1.032
 flung+1.008
Neg logit contributions
FontSize-1.355
Term-1.252
 EPS-1.220
arsity-1.187
\/-1.184
Token against
Token ID1028
Feature activation-1.49e+00

Pos logit contributions
undo+6.036
tarian+5.250
 wre+4.727
away+4.687
alogy+4.593
Neg logit contributions
FontSize-7.842
Term-7.224
 EPS-7.010
arsity-6.825
\/-6.756
Token gay
Token ID5650
Feature activation-2.72e+00

Pos logit contributions
undo+1.205
tarian+1.077
 wre+0.970
away+0.963
 flung+0.943
Neg logit contributions
FontSize-1.182
Term-1.090
 EPS-1.058
\/-1.029
arsity-1.029
Token marriage
Token ID4845
Feature activation-6.13e+00

Pos logit contributions
undo+2.167
tarian+1.957
 wre+1.767
away+1.760
 flung+1.718
Neg logit contributions
FontSize-2.144
Term-1.983
 EPS-1.920
\/-1.860
arsity-1.855
Token,
Token ID11
Feature activation-5.20e-01

Pos logit contributions
undo+4.797
tarian+4.388
 wre+4.003
away+3.956
 flung+3.879
Neg logit contributions
FontSize-4.817
Term-4.424
 EPS-4.275
arsity-4.162
\/-4.159
Token the
Token ID262
Feature activation-7.42e-01

Pos logit contributions
undo+0.424
tarian+0.378
 wre+0.340
away+0.338
alogy+0.332
Neg logit contributions
FontSize-0.408
Term-0.377
 EPS-0.367
\/-0.356
arsity-0.355
Token that
Token ID326
Feature activation+7.25e-01

Pos logit contributions
undo+0.555
tarian+0.499
 wre+0.449
away+0.447
alogy+0.441
Neg logit contributions
FontSize-0.475
Term-0.437
 EPS-0.426
arsity-0.410
\/-0.410
Token the
Token ID262
Feature activation-1.34e-01

Pos logit contributions
FontSize+0.569
Term+0.529
 EPS+0.516
\/+0.498
arsity+0.497
Neg logit contributions
undo-0.591
tarian-0.528
 wre-0.471
away-0.469
alogy-0.463
Token EU
Token ID4576
Feature activation-1.22e+00

Pos logit contributions
undo+0.111
tarian+0.099
 wre+0.089
away+0.089
alogy+0.087
Neg logit contributions
FontSize-0.104
Term-0.096
 EPS-0.094
arsity-0.091
\/-0.090
Token lift
Token ID10303
Feature activation-4.80e+00

Pos logit contributions
undo+0.946
tarian+0.840
 wre+0.751
away+0.745
 flung+0.732
Neg logit contributions
FontSize-1.016
Term-0.938
 EPS-0.914
arsity-0.889
\/-0.885
Token its
Token ID663
Feature activation-2.21e+00

Pos logit contributions
undo+3.454
tarian+3.073
 wre+2.718
away+2.717
alogy+2.640
Neg logit contributions
FontSize-4.152
Term-3.852
 EPS-3.732
arsity-3.696
\/-3.631
Token ban
Token ID3958
Feature activation-8.58e+00

Pos logit contributions
undo+1.427
tarian+1.235
 wre+1.070
away+1.068
alogy+1.037
Neg logit contributions
FontSize-2.107
Term-1.973
 EPS-1.925
arsity-1.888
\/-1.876
Token on
Token ID319
Feature activation-3.11e+00

Pos logit contributions
undo+5.965
tarian+5.208
away+4.635
 wre+4.630
alogy+4.535
Neg logit contributions
FontSize-7.547
Term-6.929
 EPS-6.721
arsity-6.629
Unknown-6.363
Token washing
Token ID20518
Feature activation-3.82e-01

Pos logit contributions
undo+2.667
tarian+2.402
 wre+2.180
away+2.166
 flung+2.126
Neg logit contributions
FontSize-2.307
Term-2.109
 EPS-2.032
arsity-1.992
\/-1.966
Token poultry
Token ID34101
Feature activation-4.30e-01

Pos logit contributions
undo+0.291
tarian+0.258
 wre+0.229
away+0.227
alogy+0.223
Neg logit contributions
FontSize-0.321
Term-0.298
 EPS-0.292
arsity-0.283
\/-0.282
Token meat
Token ID6174
Feature activation-2.80e+00

Pos logit contributions
undo+0.374
tarian+0.338
 wre+0.306
away+0.304
 flung+0.299
Neg logit contributions
FontSize-0.315
Term-0.289
 EPS-0.279
arsity-0.271
\/-0.269
Token in
Token ID287
Feature activation-1.80e+00

Pos logit contributions
undo+2.430
tarian+2.199
away+2.001
 wre+1.998
 flung+1.959
Neg logit contributions
FontSize-2.057
Term-1.870
 EPS-1.803
arsity-1.764
\/-1.724
Token call
Token ID869
Feature activation-1.32e+00

Pos logit contributions
undo+0.871
tarian+0.767
 wre+0.685
away+0.684
 flung+0.667
Neg logit contributions
FontSize-0.927
Term-0.860
 EPS-0.833
arsity-0.814
\/-0.807
Token for
Token ID329
Feature activation-5.64e-01

Pos logit contributions
undo+1.125
tarian+1.011
 wre+0.909
away+0.904
alogy+0.892
Neg logit contributions
FontSize-0.990
Term-0.913
 EPS-0.889
arsity-0.858
\/-0.858
Token his
Token ID465
Feature activation+5.66e-01

Pos logit contributions
undo+0.447
tarian+0.398
 wre+0.354
away+0.353
alogy+0.347
Neg logit contributions
FontSize-0.456
Term-0.422
 EPS-0.412
arsity-0.399
\/-0.399
Token temporary
Token ID8584
Feature activation-1.89e+00

Pos logit contributions
FontSize+0.439
Term+0.406
 EPS+0.396
arsity+0.383
\/+0.382
Neg logit contributions
undo-0.467
tarian-0.418
 wre-0.375
away-0.373
alogy-0.367
Token travel
Token ID3067
Feature activation-3.09e+00

Pos logit contributions
undo+1.070
tarian+0.900
away+0.772
 wre+0.766
 flung+0.743
Neg logit contributions
FontSize-1.953
Term-1.834
 EPS-1.794
arsity-1.770
\/-1.752
Token ban
Token ID3958
Feature activation-8.52e+00

Pos logit contributions
undo+1.623
tarian+1.349
away+1.159
 wre+1.141
 flung+1.100
Neg logit contributions
FontSize-3.318
Term-3.115
 EPS-3.058
arsity-3.009
\/-2.975
Token,
Token ID11
Feature activation-9.21e-01

Pos logit contributions
undo+6.509
tarian+5.715
away+5.330
 wre+5.315
 flung+5.156
Neg logit contributions
FontSize-6.838
Term-6.276
 EPS-6.097
arsity-6.014
\/-5.799
Token
Token ID564
Feature activation-3.20e-01

Pos logit contributions
undo+0.757
tarian+0.674
 wre+0.609
away+0.604
 flung+0.594
Neg logit contributions
FontSize-0.718
Term-0.664
 EPS-0.645
arsity-0.628
\/-0.623
Token
Token ID250
Feature activation-8.32e-01

Pos logit contributions
undo+0.301
tarian+0.273
 wre+0.249
away+0.248
alogy+0.244
Neg logit contributions
FontSize-0.212
Term-0.193
 EPS-0.187
arsity-0.179
\/-0.179
TokenThis
Token ID1212
Feature activation+1.81e-01

Pos logit contributions
undo+0.657
tarian+0.581
away+0.519
 wre+0.519
alogy+0.506
Neg logit contributions
FontSize-0.676
Term-0.627
 EPS-0.614
arsity-0.595
\/-0.591
Token piece
Token ID3704
Feature activation-3.61e+00

Pos logit contributions
FontSize+0.140
Term+0.129
 EPS+0.126
\/+0.121
arsity+0.121
Neg logit contributions
undo-0.151
tarian-0.135
 wre-0.121
away-0.120
alogy-0.119
Token a
Token ID257
Feature activation-5.45e-01

Pos logit contributions
undo+1.553
tarian+1.384
 wre+1.247
away+1.239
alogy+1.212
Neg logit contributions
FontSize-1.496
Term-1.381
 EPS-1.343
arsity-1.310
\/-1.301
Token border
Token ID4865
Feature activation-1.27e+00

Pos logit contributions
undo+0.466
tarian+0.418
 wre+0.376
away+0.375
alogy+0.369
Neg logit contributions
FontSize-0.407
Term-0.375
 EPS-0.365
arsity-0.352
\/-0.352
Token wall
Token ID3355
Feature activation-3.34e+00

Pos logit contributions
undo+0.830
tarian+0.720
 wre+0.622
away+0.619
alogy+0.605
Neg logit contributions
FontSize-1.197
Term-1.123
 EPS-1.099
arsity-1.070
\/-1.069
Token and
Token ID290
Feature activation-9.35e-01

Pos logit contributions
undo+2.671
tarian+2.384
 wre+2.165
away+2.138
 flung+2.110
Neg logit contributions
FontSize-2.660
Term-2.452
 EPS-2.371
arsity-2.332
\/-2.287
Token a
Token ID257
Feature activation-9.17e-01

Pos logit contributions
undo+0.766
tarian+0.685
 wre+0.611
away+0.608
alogy+0.600
Neg logit contributions
FontSize-0.730
Term-0.675
 EPS-0.661
\/-0.639
arsity-0.634
Token ban
Token ID3958
Feature activation-8.39e+00

Pos logit contributions
undo+0.736
tarian+0.657
 wre+0.586
away+0.583
alogy+0.574
Neg logit contributions
FontSize-0.730
Term-0.677
 EPS-0.661
\/-0.639
arsity-0.638
Token on
Token ID319
Feature activation-2.54e+00

Pos logit contributions
undo+5.926
tarian+5.143
 wre+4.543
away+4.540
 hammer+4.462
Neg logit contributions
FontSize-7.426
Term-6.884
 EPS-6.675
arsity-6.586
\/-6.379
Token
Token ID564
Feature activation+7.44e-01

Pos logit contributions
undo+1.995
tarian+1.784
 wre+1.598
away+1.582
alogy+1.555
Neg logit contributions
FontSize-2.062
Term-1.909
 EPS-1.853
arsity-1.804
\/-1.797
Token
Token ID246
Feature activation+1.47e+00

Pos logit contributions
FontSize+0.466
Term+0.424
 EPS+0.409
arsity+0.392
\/+0.392
Neg logit contributions
undo-0.721
tarian-0.658
 wre-0.602
away-0.599
 flung-0.590
Tokenchain
Token ID7983
Feature activation-8.41e-01

Pos logit contributions
FontSize+1.097
Term+1.017
 EPS+0.977
\/+0.951
arsity+0.945
Neg logit contributions
undo-1.243
tarian-1.115
 wre-1.009
away-0.991
 flung-0.988
Token migration
Token ID13472
Feature activation-1.66e+00

Pos logit contributions
undo+0.644
tarian+0.571
away+0.509
 wre+0.509
 flung+0.496
Neg logit contributions
FontSize-0.704
Term-0.654
 EPS-0.636
arsity-0.621
\/-0.616
Tokenid
Token ID312
Feature activation-2.08e-01

Pos logit contributions
FontSize+0.412
Term+0.382
 EPS+0.375
arsity+0.361
\/+0.360
Neg logit contributions
undo-0.357
tarian-0.314
away-0.278
 wre-0.276
alogy-0.271
Token wrote
Token ID2630
Feature activation-9.65e-01

Pos logit contributions
undo+0.157
tarian+0.139
 wre+0.123
away+0.122
alogy+0.120
Neg logit contributions
FontSize-0.176
Term-0.164
 EPS-0.160
arsity-0.156
\/-0.155
Token:
Token ID25
Feature activation-4.29e-01

Pos logit contributions
undo+0.896
tarian+0.809
 wre+0.739
away+0.737
alogy+0.724
Neg logit contributions
FontSize-0.651
Term-0.594
 EPS-0.575
arsity-0.555
\/-0.550
Token
Token ID198
Feature activation-5.39e-01

Pos logit contributions
undo+0.364
tarian+0.328
 wre+0.294
away+0.293
alogy+0.288
Neg logit contributions
FontSize-0.322
Term-0.297
 EPS-0.290
arsity-0.279
\/-0.279
Token
Token ID198
Feature activation-7.43e-01

Pos logit contributions
undo+0.403
tarian+0.351
away+0.319
 wre+0.312
alogy+0.310
Neg logit contributions
FontSize-0.458
Term-0.423
 EPS-0.415
arsity-0.405
\/-0.396
TokenAdvertisement
Token ID4723
Feature activation-8.28e+00

Pos logit contributions
undo+0.628
tarian+0.559
away+0.507
 wre+0.501
alogy+0.494
Neg logit contributions
FontSize-0.561
Term-0.512
 EPS-0.508
arsity-0.487
\/-0.483
Token
Token ID198
Feature activation-1.49e+00

Pos logit contributions
undo+5.577
tarian+4.616
away+4.389
alogy+4.218
 wre+4.210
Neg logit contributions
FontSize-7.477
Term-6.813
arsity-6.579
 EPS-6.558
\/-6.401
Token
Token ID198
Feature activation-1.61e+00

Pos logit contributions
undo+1.109
tarian+0.962
away+0.880
 wre+0.858
alogy+0.848
Neg logit contributions
FontSize-1.279
Term-1.180
 EPS-1.154
arsity-1.131
\/-1.103
TokenAdvertisement
Token ID4723
Feature activation-6.84e+00

Pos logit contributions
undo+1.240
tarian+1.092
away+0.975
 wre+0.966
alogy+0.948
Neg logit contributions
FontSize-1.338
Term-1.238
 EPS-1.221
arsity-1.182
\/-1.173
Token
Token ID198
Feature activation-9.82e-01

Pos logit contributions
undo+4.656
tarian+3.864
away+3.698
alogy+3.564
 que+3.399
Neg logit contributions
FontSize-6.161
Term-5.599
 EPS-5.443
arsity-5.389
\/-5.180
Token
Token ID198
Feature activation-1.15e+00

Pos logit contributions
undo+0.735
tarian+0.639
away+0.582
 wre+0.570
alogy+0.563
Neg logit contributions
FontSize-0.837
Term-0.773
 EPS-0.756
arsity-0.739
\/-0.724
Token blowing
Token ID19280
Feature activation+1.25e+00

Pos logit contributions
FontSize+0.251
Term+0.232
 EPS+0.226
arsity+0.219
\/+0.218
Neg logit contributions
undo-0.262
tarian-0.234
 wre-0.210
away-0.209
alogy-0.205
Token up
Token ID510
Feature activation-8.02e-01

Pos logit contributions
FontSize+0.981
Term+0.909
 EPS+0.888
arsity+0.858
\/+0.857
Neg logit contributions
undo-1.011
tarian-0.902
 wre-0.804
away-0.800
alogy-0.789
Token
Token ID960
Feature activation-1.26e+00

Pos logit contributions
undo+0.650
tarian+0.579
 wre+0.528
away+0.521
 flung+0.513
Neg logit contributions
FontSize-0.634
Term-0.586
 EPS-0.567
arsity-0.553
\/-0.549
Token
Token ID198
Feature activation+2.27e+00

Pos logit contributions
undo+0.948
tarian+0.835
away+0.741
 wre+0.741
alogy+0.722
Neg logit contributions
FontSize-1.069
Term-0.992
 EPS-0.972
arsity-0.943
\/-0.939
Token
Token ID198
Feature activation+1.97e+00

Pos logit contributions
FontSize+1.832
Term+1.698
 EPS+1.656
arsity+1.605
\/+1.601
Neg logit contributions
undo-1.808
tarian-1.609
 wre-1.435
away-1.431
alogy-1.406
TokenAdvertisement
Token ID4723
Feature activation-8.22e+00

Pos logit contributions
FontSize+1.394
Term+1.277
 EPS+1.244
arsity+1.198
\/+1.194
Neg logit contributions
undo-1.766
tarian-1.593
 wre-1.440
away-1.439
alogy-1.417
Token
Token ID198
Feature activation-7.61e-02

Pos logit contributions
undo+5.557
tarian+4.428
away+4.382
alogy+4.100
 flung+4.061
Neg logit contributions
FontSize-7.273
Term-6.833
 EPS-6.551
arsity-6.479
\/-6.292
Token
Token ID198
Feature activation+1.20e-01

Pos logit contributions
undo+0.057
tarian+0.049
away+0.045
 wre+0.044
alogy+0.044
Neg logit contributions
FontSize-0.065
Term-0.060
 EPS-0.058
arsity-0.057
\/-0.056
TokenN
Token ID45
Feature activation+2.35e+00

Pos logit contributions
FontSize+0.084
Term+0.077
 EPS+0.076
arsity+0.073
\/+0.071
Neg logit contributions
undo-0.107
tarian-0.096
away-0.088
alogy-0.086
 wre-0.086
TokenPH
Token ID11909
Feature activation-1.67e+00

Pos logit contributions
FontSize+1.629
Term+1.491
 EPS+1.446
arsity+1.392
\/+1.392
Neg logit contributions
undo-2.132
tarian-1.928
 wre-1.751
away-1.740
alogy-1.715
Token:
Token ID25
Feature activation+1.10e-01

Pos logit contributions
undo+1.567
tarian+1.422
 wre+1.293
away+1.290
alogy+1.273
Neg logit contributions
FontSize-1.101
Term-1.002
 EPS-0.970
arsity-0.934
\/-0.931
Token Claire
Token ID20758
Feature activation-1.35e+00

Pos logit contributions
undo+2.820
tarian+2.505
 wre+2.314
away+2.291
 flung+2.263
Neg logit contributions
FontSize-2.502
Term-2.303
 EPS-2.199
\/-2.142
arsity-2.131
Tokenville
Token ID4244
Feature activation-3.34e-01

Pos logit contributions
undo+1.235
tarian+1.118
away+1.015
 wre+1.015
alogy+0.998
Neg logit contributions
FontSize-0.934
Term-0.850
 EPS-0.826
arsity-0.795
\/-0.793
Token Reserv
Token ID40425
Feature activation-1.34e+00

Pos logit contributions
undo+0.284
tarian+0.255
 wre+0.230
away+0.230
alogy+0.225
Neg logit contributions
FontSize-0.251
Term-0.231
 EPS-0.224
arsity-0.216
\/-0.216
Tokenoir
Token ID10840
Feature activation-2.26e+00

Pos logit contributions
undo+0.627
tarian+0.510
 wre+0.408
away+0.401
 flung+0.387
Neg logit contributions
FontSize-1.516
Term-1.440
 EPS-1.414
arsity-1.385
\/-1.384
Token in
Token ID287
Feature activation-1.81e+00

Pos logit contributions
undo+1.842
tarian+1.642
 wre+1.478
away+1.476
alogy+1.447
Neg logit contributions
FontSize-1.786
Term-1.642
 EPS-1.601
arsity-1.549
\/-1.545
Token Bram
Token ID35315
Feature activation-8.13e+00

Pos logit contributions
undo+1.528
tarian+1.362
 wre+1.243
away+1.241
 flung+1.217
Neg logit contributions
FontSize-1.366
Term-1.256
 EPS-1.209
arsity-1.175
\/-1.173
Tokenpton
Token ID10972
Feature activation-4.15e+00

Pos logit contributions
undo+5.556
tarian+4.757
away+4.408
 wre+4.110
 flung+4.017
Neg logit contributions
FontSize-7.540
Term-6.755
 EPS-6.624
arsity-6.426
 Mehran-6.329
Token to
Token ID284
Feature activation-1.63e+00

Pos logit contributions
undo+3.218
tarian+2.861
away+2.594
 wre+2.584
 flung+2.546
Neg logit contributions
FontSize-3.375
Term-3.122
 EPS-3.027
arsity-2.947
\/-2.886
Token the
Token ID262
Feature activation-3.62e-01

Pos logit contributions
undo+1.392
tarian+1.243
 wre+1.135
away+1.128
 flung+1.110
Neg logit contributions
FontSize-1.205
Term-1.109
 EPS-1.068
\/-1.035
arsity-1.034
Token shore
Token ID15191
Feature activation+1.19e-01

Pos logit contributions
undo+0.308
tarian+0.276
 wre+0.251
away+0.249
 flung+0.245
Neg logit contributions
FontSize-0.272
Term-0.251
 EPS-0.242
\/-0.235
arsity-0.234
Token of
Token ID286
Feature activation-1.71e+00

Pos logit contributions
FontSize+0.078
Term+0.071
 EPS+0.069
arsity+0.066
\/+0.066
Neg logit contributions
undo-0.112
tarian-0.102
 wre-0.092
away-0.092
alogy-0.091
Token forcing
Token ID10833
Feature activation-2.27e+00

Pos logit contributions
FontSize+0.127
Term+0.118
 EPS+0.116
arsity+0.112
\/+0.112
Neg logit contributions
undo-0.104
tarian-0.092
 wre-0.080
away-0.080
alogy-0.079
Token a
Token ID257
Feature activation-8.44e-01

Pos logit contributions
undo+1.824
tarian+1.629
 wre+1.484
away+1.472
 flung+1.437
Neg logit contributions
FontSize-1.790
Term-1.663
 EPS-1.600
arsity-1.574
\/-1.558
Token fumble
Token ID41405
Feature activation-1.84e+00

Pos logit contributions
undo+0.706
tarian+0.632
 wre+0.570
away+0.567
 flung+0.557
Neg logit contributions
FontSize-0.645
Term-0.595
 EPS-0.577
arsity-0.562
\/-0.559
Token on
Token ID319
Feature activation-5.41e-01

Pos logit contributions
undo+1.575
tarian+1.422
away+1.293
 wre+1.290
 flung+1.261
Neg logit contributions
FontSize-1.365
Term-1.258
 EPS-1.212
arsity-1.180
\/-1.168
Token Rob
Token ID3851
Feature activation-1.67e+00

Pos logit contributions
undo+0.448
tarian+0.401
 wre+0.359
away+0.358
alogy+0.352
Neg logit contributions
FontSize-0.418
Term-0.386
 EPS-0.377
arsity-0.364
\/-0.363
Token Gron
Token ID40214
Feature activation-8.13e+00

Pos logit contributions
undo+1.218
tarian+1.072
away+0.945
 wre+0.944
alogy+0.919
Neg logit contributions
FontSize-1.463
Term-1.364
 EPS-1.332
arsity-1.293
\/-1.291
Tokenkowski
Token ID26216
Feature activation-1.25e+00

Pos logit contributions
undo+7.201
tarian+6.582
 wre+5.974
away+5.900
 flung+5.725
Neg logit contributions
FontSize-5.738
Term-5.287
 EPS-5.165
\/-4.900
arsity-4.878
Token during
Token ID1141
Feature activation-6.71e-01

Pos logit contributions
undo+1.111
tarian+1.003
 wre+0.910
away+0.906
 flung+0.890
Neg logit contributions
FontSize-0.895
Term-0.821
 EPS-0.797
arsity-0.769
\/-0.766
Token 7
Token ID767
Feature activation-8.85e-01

Pos logit contributions
undo+0.551
tarian+0.493
 wre+0.439
away+0.438
alogy+0.433
Neg logit contributions
FontSize-0.523
Term-0.483
 EPS-0.473
\/-0.455
arsity-0.455
Token-
Token ID12
Feature activation+1.36e-02

Pos logit contributions
undo+0.865
tarian+0.788
 wre+0.721
away+0.719
alogy+0.707
Neg logit contributions
FontSize-0.553
Term-0.499
 EPS-0.483
arsity-0.464
\/-0.460
Tokenon
Token ID261
Feature activation+1.05e+00

Pos logit contributions
FontSize+0.013
Term+0.012
 EPS+0.012
arsity+0.011
\/+0.011
Neg logit contributions
undo-0.009
tarian-0.008
away-0.007
 wre-0.007
alogy-0.007
Token after
Token ID706
Feature activation-1.57e+00

Pos logit contributions
undo+0.355
tarian+0.324
 wre+0.297
away+0.296
alogy+0.292
Neg logit contributions
FontSize-0.214
Term-0.193
 EPS-0.186
arsity-0.178
\/-0.178
Token Trump
Token ID1301
Feature activation-3.26e-01

Pos logit contributions
undo+1.376
tarian+1.239
 wre+1.125
away+1.115
 flung+1.096
Neg logit contributions
FontSize-1.139
Term-1.045
 EPS-1.013
arsity-0.983
\/-0.978
Token tweeted
Token ID10830
Feature activation+7.84e-01

Pos logit contributions
undo+0.260
tarian+0.232
 wre+0.206
away+0.205
alogy+0.203
Neg logit contributions
FontSize-0.260
Term-0.241
 EPS-0.236
\/-0.228
arsity-0.227
Token about
Token ID546
Feature activation-3.50e+00

Pos logit contributions
FontSize+0.582
Term+0.536
 EPS+0.522
arsity+0.504
\/+0.503
Neg logit contributions
undo-0.673
tarian-0.605
 wre-0.545
away-0.543
alogy-0.534
Token his
Token ID465
Feature activation+3.08e-01

Pos logit contributions
undo+2.731
tarian+2.433
 wre+2.195
away+2.166
 flung+2.110
Neg logit contributions
FontSize-2.848
Term-2.654
 EPS-2.570
arsity-2.515
\/-2.507
Token ban
Token ID3958
Feature activation-8.07e+00

Pos logit contributions
FontSize+0.257
Term+0.239
 EPS+0.233
arsity+0.226
\/+0.226
Neg logit contributions
undo-0.236
tarian-0.209
 wre-0.185
away-0.184
alogy-0.181
Token,
Token ID11
Feature activation-1.37e+00

Pos logit contributions
undo+6.308
tarian+5.626
 wre+5.097
away+5.088
alogy+4.958
Neg logit contributions
FontSize-6.350
Term-5.898
 EPS-5.673
arsity-5.637
\/-5.495
Token he
Token ID339
Feature activation-4.29e-01

Pos logit contributions
undo+1.126
tarian+1.009
 wre+0.906
away+0.900
 flung+0.883
Neg logit contributions
FontSize-1.064
Term-0.983
 EPS-0.950
arsity-0.926
\/-0.924
Token offered
Token ID4438
Feature activation-9.54e-01

Pos logit contributions
undo+0.340
tarian+0.303
 wre+0.270
away+0.269
alogy+0.264
Neg logit contributions
FontSize-0.347
Term-0.322
 EPS-0.314
arsity-0.304
\/-0.304
Token a
Token ID257
Feature activation-1.03e+00

Pos logit contributions
undo+0.766
tarian+0.684
 wre+0.612
away+0.611
alogy+0.598
Neg logit contributions
FontSize-0.761
Term-0.706
 EPS-0.685
arsity-0.671
\/-0.664
Token tweet
Token ID6126
Feature activation+1.47e+00

Pos logit contributions
undo+0.852
tarian+0.757
away+0.682
 wre+0.681
 flung+0.670
Neg logit contributions
FontSize-0.795
Term-0.735
 EPS-0.711
arsity-0.702
\/-0.689
Token Recently
Token ID23413
Feature activation+4.85e-01

Pos logit contributions
undo+0.094
tarian+0.083
 wre+0.075
away+0.074
alogy+0.073
Neg logit contributions
FontSize-0.094
Term-0.087
 EPS-0.084
arsity-0.082
\/-0.082
Token the
Token ID262
Feature activation+8.89e-03

Pos logit contributions
FontSize+0.361
Term+0.334
 EPS+0.326
\/+0.314
arsity+0.313
Neg logit contributions
undo-0.414
tarian-0.372
 wre-0.334
away-0.333
alogy-0.329
Token European
Token ID3427
Feature activation-1.94e-01

Pos logit contributions
FontSize+0.007
Term+0.006
 EPS+0.006
arsity+0.006
\/+0.006
Neg logit contributions
undo-0.008
tarian-0.007
 wre-0.006
away-0.006
alogy-0.006
Token union
Token ID6441
Feature activation-3.35e+00

Pos logit contributions
undo+0.141
tarian+0.124
 wre+0.110
away+0.109
alogy+0.107
Neg logit contributions
FontSize-0.169
Term-0.157
 EPS-0.154
arsity-0.149
\/-0.149
Token has
Token ID468
Feature activation-3.74e-01

Pos logit contributions
undo+2.731
tarian+2.434
 wre+2.200
away+2.184
 flung+2.150
Neg logit contributions
FontSize-2.638
Term-2.421
 EPS-2.326
arsity-2.275
\/-2.260
Token banned
Token ID9301
Feature activation-7.98e+00

Pos logit contributions
undo+0.297
tarian+0.265
 wre+0.235
away+0.234
alogy+0.231
Neg logit contributions
FontSize-0.301
Term-0.280
 EPS-0.274
\/-0.264
arsity-0.264
Token the
Token ID262
Feature activation-1.21e+00

Pos logit contributions
undo+6.033
tarian+5.431
 wre+5.056
away+4.936
 hammer+4.902
Neg logit contributions
FontSize-6.421
Term-5.735
arsity-5.564
\/-5.341
 EPS-5.334
Token use
Token ID779
Feature activation+2.04e+00

Pos logit contributions
undo+0.993
tarian+0.889
 wre+0.797
away+0.793
alogy+0.780
Neg logit contributions
FontSize-0.937
Term-0.865
 EPS-0.842
arsity-0.815
\/-0.814
Token of
Token ID286
Feature activation+2.11e-02

Pos logit contributions
FontSize+1.344
Term+1.234
 EPS+1.215
\/+1.164
arsity+1.151
Neg logit contributions
undo-1.897
tarian-1.717
 wre-1.545
away-1.542
alogy-1.531
Token the
Token ID262
Feature activation-4.78e-01

Pos logit contributions
FontSize+0.016
Term+0.015
 EPS+0.014
\/+0.014
arsity+0.014
Neg logit contributions
undo-0.018
tarian-0.016
 wre-0.014
away-0.014
alogy-0.014
Token three
Token ID1115
Feature activation+1.01e+00

Pos logit contributions
undo+0.397
tarian+0.355
 wre+0.318
away+0.317
alogy+0.312
Neg logit contributions
FontSize-0.369
Term-0.341
 EPS-0.332
arsity-0.321
\/-0.321
Token In
Token ID554
Feature activation-1.32e-01

Pos logit contributions
undo+0.433
tarian+0.385
away+0.343
 wre+0.342
alogy+0.336
Neg logit contributions
FontSize-0.458
Term-0.423
 EPS-0.411
arsity-0.400
\/-0.399
Token the
Token ID262
Feature activation+3.93e-01

Pos logit contributions
undo+0.101
tarian+0.090
 wre+0.080
away+0.080
alogy+0.078
Neg logit contributions
FontSize-0.109
Term-0.101
 EPS-0.099
arsity-0.096
\/-0.096
Token face
Token ID1986
Feature activation+4.93e+00

Pos logit contributions
FontSize+0.294
Term+0.270
 EPS+0.263
arsity+0.254
\/+0.254
Neg logit contributions
undo-0.335
tarian-0.301
 wre-0.271
away-0.270
alogy-0.266
Token of
Token ID286
Feature activation-2.28e-01

Pos logit contributions
FontSize+3.195
Term+2.918
 EPS+2.859
\/+2.776
arsity+2.708
Neg logit contributions
undo-4.591
tarian-4.209
 wre-3.790
 flung-3.734
away-3.720
Token these
Token ID777
Feature activation+1.01e+00

Pos logit contributions
undo+0.188
tarian+0.168
 wre+0.151
away+0.150
alogy+0.148
Neg logit contributions
FontSize-0.177
Term-0.163
 EPS-0.159
arsity-0.154
\/-0.154
Token discrim
Token ID6534
Feature activation-7.91e+00

Pos logit contributions
FontSize+0.803
Term+0.746
 EPS+0.730
\/+0.705
arsity+0.704
Neg logit contributions
undo-0.807
tarian-0.718
away-0.636
 wre-0.636
alogy-0.627
Tokeninations
Token ID7352
Feature activation-2.45e+00

Pos logit contributions
undo+4.497
tarian+3.911
away+3.294
alogy+3.240
 wre+3.215
Neg logit contributions
FontSize-8.067
Term-7.384
 EPS-7.248
arsity-6.973
Merit-6.899
Token,
Token ID11
Feature activation-8.66e-01

Pos logit contributions
undo+2.066
tarian+1.869
 wre+1.699
away+1.675
 flung+1.663
Neg logit contributions
FontSize-1.851
Term-1.700
 EPS-1.639
arsity-1.588
\/-1.580
Token it
Token ID340
Feature activation-5.63e-01

Pos logit contributions
undo+0.669
tarian+0.598
 wre+0.534
away+0.528
 flung+0.520
Neg logit contributions
FontSize-0.716
Term-0.664
 EPS-0.644
arsity-0.627
\/-0.625
Token is
Token ID318
Feature activation+8.50e-01

Pos logit contributions
undo+0.460
tarian+0.411
 wre+0.368
away+0.367
alogy+0.361
Neg logit contributions
FontSize-0.441
Term-0.408
 EPS-0.398
arsity-0.385
\/-0.384
Token currently
Token ID3058
Feature activation-2.83e+00

Pos logit contributions
FontSize+0.660
Term+0.610
 EPS+0.592
arsity+0.575
\/+0.573
Neg logit contributions
undo-0.700
tarian-0.627
 wre-0.563
away-0.559
alogy-0.550
Token sites
Token ID5043
Feature activation-8.04e-01

Pos logit contributions
FontSize+1.592
Term+1.467
 EPS+1.425
arsity+1.373
\/+1.369
Neg logit contributions
undo-2.116
tarian-1.920
away-1.730
 wre-1.729
alogy-1.704
Token reported
Token ID2098
Feature activation-1.13e+00

Pos logit contributions
undo+0.733
tarian+0.656
 wre+0.602
away+0.597
 flung+0.589
Neg logit contributions
FontSize-0.564
Term-0.511
 EPS-0.493
\/-0.476
arsity-0.474
Token.
Token ID13
Feature activation-5.15e-01

Pos logit contributions
undo+1.091
tarian+0.990
 wre+0.906
away+0.903
 flung+0.888
Neg logit contributions
FontSize-0.720
Term-0.652
 EPS-0.631
arsity-0.604
\/-0.602
Token
Token ID198
Feature activation-3.18e-02

Pos logit contributions
undo+0.414
tarian+0.370
 wre+0.330
away+0.328
alogy+0.323
Neg logit contributions
FontSize-0.409
Term-0.381
 EPS-0.372
arsity-0.359
\/-0.358
Token
Token ID198
Feature activation-2.52e-02

Pos logit contributions
undo+0.024
tarian+0.021
away+0.019
 wre+0.019
alogy+0.019
Neg logit contributions
FontSize-0.027
Term-0.025
 EPS-0.024
arsity-0.024
\/-0.023
TokenAdvertisement
Token ID4723
Feature activation-7.80e+00

Pos logit contributions
undo+0.020
tarian+0.018
away+0.016
 wre+0.016
alogy+0.016
Neg logit contributions
FontSize-0.020
Term-0.018
 EPS-0.018
arsity-0.017
\/-0.017
Token Continue
Token ID10054
Feature activation-4.79e-02

Pos logit contributions
undo+6.029
tarian+4.999
away+4.666
alogy+4.638
 wre+4.608
Neg logit contributions
FontSize-6.369
Term-5.883
 EPS-5.679
\/-5.506
arsity-5.425
Token reading
Token ID3555
Feature activation-2.01e-01

Pos logit contributions
undo+0.029
tarian+0.025
 wre+0.021
away+0.021
alogy+0.020
Neg logit contributions
FontSize-0.048
Term-0.045
 EPS-0.044
arsity-0.043
\/-0.043
Token the
Token ID262
Feature activation+9.54e-01

Pos logit contributions
undo+0.155
tarian+0.137
 wre+0.123
away+0.123
alogy+0.120
Neg logit contributions
FontSize-0.165
Term-0.153
 EPS-0.149
arsity-0.145
\/-0.144
Token main
Token ID1388
Feature activation+1.17e+00

Pos logit contributions
FontSize+0.508
Term+0.447
 EPS+0.426
arsity+0.408
\/+0.403
Neg logit contributions
undo-1.024
tarian-0.940
 wre-0.870
away-0.869
alogy-0.855
Token story
Token ID1621
Feature activation+2.63e+00

Pos logit contributions
FontSize+0.808
Term+0.736
 EPS+0.710
arsity+0.686
\/+0.684
Neg logit contributions
undo-1.073
tarian-0.972
 wre-0.881
away-0.879
alogy-0.866
Token hard
Token ID1327
Feature activation-1.24e+00

Pos logit contributions
undo+0.426
tarian+0.377
 wre+0.338
away+0.336
 flung+0.327
Neg logit contributions
FontSize-0.479
Term-0.443
 EPS-0.432
arsity-0.420
\/-0.418
Token and
Token ID290
Feature activation+1.19e+00

Pos logit contributions
undo+1.060
tarian+0.952
 wre+0.865
away+0.860
 flung+0.841
Neg logit contributions
FontSize-0.933
Term-0.856
 EPS-0.833
arsity-0.802
\/-0.799
Token we
Token ID356
Feature activation+8.06e-01

Pos logit contributions
FontSize+0.976
Term+0.905
 EPS+0.883
arsity+0.855
\/+0.854
Neg logit contributions
undo-0.934
tarian-0.829
 wre-0.740
away-0.736
alogy-0.721
Token have
Token ID423
Feature activation+1.59e-01

Pos logit contributions
FontSize+0.630
Term+0.583
 EPS+0.568
arsity+0.549
\/+0.548
Neg logit contributions
undo-0.660
tarian-0.590
 wre-0.529
away-0.526
alogy-0.517
Token a
Token ID257
Feature activation-6.53e-01

Pos logit contributions
FontSize+0.129
Term+0.119
 EPS+0.116
arsity+0.113
\/+0.112
Neg logit contributions
undo-0.126
tarian-0.112
 wre-0.101
away-0.100
 flung-0.098
Token physical
Token ID3518
Feature activation-7.72e+00

Pos logit contributions
undo+0.563
tarian+0.506
 wre+0.459
away+0.457
 flung+0.449
Neg logit contributions
FontSize-0.482
Term-0.442
 EPS-0.429
arsity-0.415
\/-0.414
Token group
Token ID1448
Feature activation-8.74e-01

Pos logit contributions
undo+5.735
tarian+5.303
 wre+4.878
away+4.847
 rejuven+4.691
Neg logit contributions
FontSize-6.247
Term-5.743
 EPS-5.397
\/-5.280
arsity-5.219
Token that
Token ID326
Feature activation+9.26e-01

Pos logit contributions
undo+0.753
tarian+0.678
 wre+0.615
away+0.613
alogy+0.599
Neg logit contributions
FontSize-0.646
Term-0.594
 EPS-0.576
arsity-0.556
\/-0.553
Token is
Token ID318
Feature activation+1.45e+00

Pos logit contributions
FontSize+0.730
Term+0.675
 EPS+0.658
arsity+0.637
\/+0.636
Neg logit contributions
undo-0.752
tarian-0.672
 wre-0.601
away-0.599
alogy-0.588
Token setting
Token ID4634
Feature activation-9.18e-01

Pos logit contributions
FontSize+1.151
Term+1.066
 EPS+1.048
\/+1.010
arsity+1.010
Neg logit contributions
undo-1.169
tarian-1.049
 wre-0.919
away-0.917
alogy-0.915
Token the
Token ID262
Feature activation+8.85e-01

Pos logit contributions
undo+0.747
tarian+0.666
 wre+0.604
away+0.600
 flung+0.586
Neg logit contributions
FontSize-0.721
Term-0.666
 EPS-0.645
arsity-0.627
\/-0.625
Token jail
Token ID7356
Feature activation+5.94e+00

Pos logit contributions
undo+0.900
tarian+0.801
 wre+0.724
away+0.716
alogy+0.702
Neg logit contributions
FontSize-0.896
Term-0.826
 EPS-0.804
arsity-0.783
\/-0.780
Token time
Token ID640
Feature activation+6.41e-01

Pos logit contributions
FontSize+3.846
Term+3.675
 EPS+3.466
\/+3.398
arsity+3.329
Neg logit contributions
undo-5.467
tarian-4.984
alogy-4.451
 wre-4.391
away-4.331
Token.
Token ID13
Feature activation-3.09e-01

Pos logit contributions
FontSize+0.444
Term+0.407
 EPS+0.395
arsity+0.380
\/+0.380
Neg logit contributions
undo-0.582
tarian-0.526
 wre-0.476
away-0.475
alogy-0.468
Token
Token ID198
Feature activation+6.42e-01

Pos logit contributions
undo+0.246
tarian+0.219
 wre+0.196
away+0.195
alogy+0.192
Neg logit contributions
FontSize-0.248
Term-0.229
 EPS-0.223
arsity-0.217
\/-0.216
Token
Token ID198
Feature activation+1.55e-01

Pos logit contributions
FontSize+0.521
Term+0.482
 EPS+0.472
arsity+0.458
\/+0.456
Neg logit contributions
undo-0.505
tarian-0.448
away-0.400
 wre-0.399
alogy-0.392
TokenAdvertisement
Token ID4723
Feature activation-7.68e+00

Pos logit contributions
FontSize+0.120
Term+0.111
 EPS+0.109
arsity+0.105
\/+0.105
Neg logit contributions
undo-0.127
tarian-0.113
away-0.102
 wre-0.101
alogy-0.100
Token
Token ID198
Feature activation+1.70e-02

Pos logit contributions
undo+5.586
tarian+4.761
away+4.410
 wre+4.302
alogy+4.301
Neg logit contributions
FontSize-6.452
Term-6.016
 EPS-5.803
arsity-5.702
\/-5.590
Token
Token ID198
Feature activation-1.52e-01

Pos logit contributions
FontSize+0.014
Term+0.013
 EPS+0.013
arsity+0.012
\/+0.012
Neg logit contributions
undo-0.013
tarian-0.011
away-0.010
 wre-0.010
alogy-0.010
TokenAdvertisement
Token ID4723
Feature activation-6.49e+00

Pos logit contributions
undo+0.124
tarian+0.110
away+0.099
 wre+0.098
alogy+0.097
Neg logit contributions
FontSize-0.119
Term-0.109
 EPS-0.108
arsity-0.105
\/-0.104
Token
Token ID198
Feature activation+1.23e-01

Pos logit contributions
undo+4.513
tarian+3.766
away+3.574
alogy+3.465
 wre+3.288
Neg logit contributions
FontSize-5.704
Term-5.287
 EPS-5.148
arsity-5.034
\/-4.899
Token
Token ID198
Feature activation+2.91e-02

Pos logit contributions
FontSize+0.102
Term+0.094
 EPS+0.092
arsity+0.090
\/+0.089
Neg logit contributions
undo-0.094
tarian-0.083
away-0.075
 wre-0.074
alogy-0.073
Token.
Token ID13
Feature activation+1.63e-01

Pos logit contributions
undo+0.297
tarian+0.269
 wre+0.243
away+0.243
alogy+0.239
Neg logit contributions
FontSize-0.224
Term-0.205
 EPS-0.199
\/-0.192
arsity-0.192
Token
Token ID447
Feature activation+6.37e-01

Pos logit contributions
FontSize+0.112
Term+0.103
 EPS+0.100
arsity+0.096
\/+0.096
Neg logit contributions
undo-0.149
tarian-0.135
 wre-0.122
away-0.121
alogy-0.120
Token
Token ID251
Feature activation+1.27e+00

Pos logit contributions
FontSize+0.393
Term+0.356
 EPS+0.343
arsity+0.329
\/+0.328
Neg logit contributions
undo-0.625
tarian-0.572
 wre-0.524
away-0.521
 flung-0.513
Token
Token ID198
Feature activation+8.14e-01

Pos logit contributions
FontSize+1.000
Term+0.929
 EPS+0.909
arsity+0.878
\/+0.876
Neg logit contributions
undo-1.029
tarian-0.922
 wre-0.822
away-0.814
alogy-0.803
Token
Token ID198
Feature activation+1.26e-01

Pos logit contributions
FontSize+0.667
Term+0.617
 EPS+0.603
arsity+0.585
\/+0.581
Neg logit contributions
undo-0.637
tarian-0.564
away-0.505
 wre-0.503
alogy-0.494
TokenAdvertisement
Token ID4723
Feature activation-7.66e+00

Pos logit contributions
FontSize+0.099
Term+0.091
 EPS+0.089
arsity+0.086
\/+0.086
Neg logit contributions
undo-0.103
tarian-0.092
away-0.082
 wre-0.082
alogy-0.081
Token Continue
Token ID10054
Feature activation+1.15e-01

Pos logit contributions
undo+5.971
tarian+4.916
away+4.637
alogy+4.593
 wre+4.547
Neg logit contributions
FontSize-6.158
Term-5.711
 EPS-5.527
\/-5.320
arsity-5.286
Token reading
Token ID3555
Feature activation-6.51e-01

Pos logit contributions
FontSize+0.115
Term+0.108
 EPS+0.106
\/+0.103
arsity+0.103
Neg logit contributions
undo-0.070
tarian-0.059
 wre-0.050
away-0.050
alogy-0.049
Token the
Token ID262
Feature activation+7.94e-01

Pos logit contributions
undo+0.496
tarian+0.440
 wre+0.395
away+0.393
alogy+0.386
Neg logit contributions
FontSize-0.540
Term-0.500
 EPS-0.485
arsity-0.473
\/-0.470
Token main
Token ID1388
Feature activation+1.41e+00

Pos logit contributions
FontSize+0.433
Term+0.381
 EPS+0.362
arsity+0.346
\/+0.342
Neg logit contributions
undo-0.843
tarian-0.773
 wre-0.716
away-0.715
alogy-0.703
Token story
Token ID1621
Feature activation+1.97e+00

Pos logit contributions
FontSize+0.976
Term+0.889
 EPS+0.858
arsity+0.828
\/+0.826
Neg logit contributions
undo-1.293
tarian-1.170
 wre-1.062
away-1.059
alogy-1.043
Token
Token ID198
Feature activation-9.14e-01

Pos logit contributions
undo+0.819
tarian+0.731
 wre+0.653
away+0.649
alogy+0.638
Neg logit contributions
FontSize-0.809
Term-0.752
 EPS-0.734
arsity-0.709
\/-0.708
Token
Token ID198
Feature activation-9.31e-01

Pos logit contributions
undo+0.678
tarian+0.593
away+0.540
 wre+0.526
alogy+0.522
Neg logit contributions
FontSize-0.783
Term-0.719
 EPS-0.707
arsity-0.690
\/-0.676
TokenSome
Token ID4366
Feature activation-2.75e+00

Pos logit contributions
undo+0.762
tarian+0.676
away+0.609
 wre+0.601
alogy+0.592
Neg logit contributions
FontSize-0.727
Term-0.663
 EPS-0.661
arsity-0.640
\/-0.632
Token interviews
Token ID9299
Feature activation+5.43e-02

Pos logit contributions
undo+2.312
tarian+2.072
 wre+1.899
away+1.873
 flung+1.854
Neg logit contributions
FontSize-2.098
Term-1.914
 EPS-1.847
arsity-1.806
\/-1.791
Token had
Token ID550
Feature activation-9.44e-01

Pos logit contributions
FontSize+0.043
Term+0.040
 EPS+0.039
arsity+0.037
\/+0.037
Neg logit contributions
undo-0.044
tarian-0.039
 wre-0.035
away-0.035
 flung-0.034
Token already
Token ID1541
Feature activation-7.64e+00

Pos logit contributions
undo+0.661
tarian+0.578
 wre+0.513
away+0.506
 flung+0.498
Neg logit contributions
FontSize-0.850
Term-0.793
 EPS-0.771
\/-0.755
arsity-0.755
Token been
Token ID587
Feature activation-3.58e+00

Pos logit contributions
undo+4.232
tarian+3.574
 wre+3.293
 que+3.229
 flung+3.198
Neg logit contributions
FontSize-7.823
Term-7.280
arsity-6.893
\/-6.846
 EPS-6.831
Token carried
Token ID5281
Feature activation+5.00e-01

Pos logit contributions
undo+2.729
tarian+2.400
 wre+2.200
away+2.174
 que+2.160
Neg logit contributions
FontSize-3.005
Term-2.764
 EPS-2.645
arsity-2.626
\/-2.570
Token out
Token ID503
Feature activation-2.47e-01

Pos logit contributions
FontSize+0.375
Term+0.345
 EPS+0.337
arsity+0.325
\/+0.325
Neg logit contributions
undo-0.424
tarian-0.381
 wre-0.342
away-0.340
alogy-0.336
Token under
Token ID739
Feature activation+2.17e+00

Pos logit contributions
undo+0.219
tarian+0.198
 wre+0.179
away+0.178
alogy+0.175
Neg logit contributions
FontSize-0.176
Term-0.162
 EPS-0.157
arsity-0.152
\/-0.151
Token the
Token ID262
Feature activation+1.17e+00

Pos logit contributions
FontSize+1.525
Term+1.399
 EPS+1.369
arsity+1.310
\/+1.308
Neg logit contributions
undo-1.950
tarian-1.760
 wre-1.587
away-1.580
alogy-1.566
Token dead
Token ID2636
Feature activation+6.85e-01

Pos logit contributions
FontSize+0.142
Term+0.132
 EPS+0.129
arsity+0.125
\/+0.125
Neg logit contributions
undo-0.136
tarian-0.121
 wre-0.108
away-0.107
alogy-0.105
Token son
Token ID3367
Feature activation+6.67e-01

Pos logit contributions
FontSize+0.510
Term+0.470
 EPS+0.460
arsity+0.444
\/+0.443
Neg logit contributions
undo-0.585
tarian-0.525
 wre-0.470
away-0.468
alogy-0.463
Token.
Token ID13
Feature activation-7.85e-01

Pos logit contributions
FontSize+0.454
Term+0.414
 EPS+0.404
\/+0.388
arsity+0.388
Neg logit contributions
undo-0.613
tarian-0.555
 wre-0.502
away-0.500
alogy-0.493
Token
Token ID198
Feature activation-6.23e-01

Pos logit contributions
undo+0.614
tarian+0.543
away+0.486
 wre+0.485
alogy+0.477
Neg logit contributions
FontSize-0.641
Term-0.593
 EPS-0.578
arsity-0.561
\/-0.560
Token
Token ID198
Feature activation-1.04e+00

Pos logit contributions
undo+0.468
tarian+0.407
away+0.371
 wre+0.363
alogy+0.359
Neg logit contributions
FontSize-0.530
Term-0.488
 EPS-0.478
arsity-0.468
\/-0.459
TokenAdvertisement
Token ID4723
Feature activation-7.62e+00

Pos logit contributions
undo+0.818
tarian+0.719
away+0.651
 wre+0.637
alogy+0.632
Neg logit contributions
FontSize-0.833
Term-0.766
 EPS-0.765
arsity-0.737
\/-0.730
Token
Token ID198
Feature activation-4.06e-01

Pos logit contributions
undo+5.282
tarian+4.449
away+4.143
alogy+4.021
 wre+4.017
Neg logit contributions
FontSize-6.664
Term-6.248
 EPS-6.047
arsity-5.923
\/-5.802
Token
Token ID198
Feature activation-5.97e-01

Pos logit contributions
undo+0.309
tarian+0.271
away+0.245
 wre+0.241
alogy+0.238
Neg logit contributions
FontSize-0.342
Term-0.315
 EPS-0.308
arsity-0.301
\/-0.296
TokenYes
Token ID5297
Feature activation-4.19e+00

Pos logit contributions
undo+0.474
tarian+0.417
away+0.377
 wre+0.370
alogy+0.367
Neg logit contributions
FontSize-0.479
Term-0.442
 EPS-0.438
arsity-0.423
\/-0.420
Token!
Token ID0
Feature activation-1.54e+00

Pos logit contributions
undo+3.572
tarian+3.205
 wre+2.907
away+2.883
alogy+2.832
Neg logit contributions
FontSize-3.108
Term-2.861
 EPS-2.784
arsity-2.694
\/-2.684
Token Just
Token ID2329
Feature activation+2.67e+00

Pos logit contributions
undo+1.200
tarian+1.058
away+0.948
 wre+0.947
alogy+0.930
Neg logit contributions
FontSize-1.273
Term-1.174
 EPS-1.139
arsity-1.111
\/-1.108
Token the
Token ID262
Feature activation-4.09e-01

Pos logit contributions
undo+1.979
tarian+1.758
 wre+1.594
away+1.577
 flung+1.546
Neg logit contributions
FontSize-2.005
Term-1.844
 EPS-1.802
arsity-1.769
\/-1.762
Token day
Token ID1110
Feature activation+2.19e-01

Pos logit contributions
undo+0.335
tarian+0.300
 wre+0.268
away+0.267
alogy+0.263
Neg logit contributions
FontSize-0.319
Term-0.295
 EPS-0.288
arsity-0.278
\/-0.278
Token.
Token ID13
Feature activation-6.49e-01

Pos logit contributions
FontSize+0.147
Term+0.134
 EPS+0.130
arsity+0.125
\/+0.125
Neg logit contributions
undo-0.204
tarian-0.185
 wre-0.168
away-0.168
alogy-0.165
Token
Token ID198
Feature activation-8.34e-01

Pos logit contributions
undo+0.506
tarian+0.448
away+0.400
 wre+0.399
alogy+0.393
Neg logit contributions
FontSize-0.534
Term-0.492
 EPS-0.479
arsity-0.467
\/-0.466
Token
Token ID198
Feature activation-1.12e+00

Pos logit contributions
undo+0.619
tarian+0.538
away+0.492
 wre+0.479
alogy+0.475
Neg logit contributions
FontSize-0.714
Term-0.657
 EPS-0.645
arsity-0.630
\/-0.615
TokenAdvertisement
Token ID4723
Feature activation-7.62e+00

Pos logit contributions
undo+0.865
tarian+0.758
away+0.685
 wre+0.664
alogy+0.663
Neg logit contributions
FontSize-0.922
 EPS-0.848
Term-0.839
arsity-0.820
\/-0.807
Token:
Token ID25
Feature activation-1.10e+00

Pos logit contributions
undo+5.361
tarian+4.538
away+4.214
 wre+4.106
alogy+4.094
Neg logit contributions
FontSize-6.568
Term-6.133
 EPS-5.896
arsity-5.770
\/-5.719
Token
Token ID198
Feature activation+6.67e-01

Pos logit contributions
undo+0.896
tarian+0.801
 wre+0.715
away+0.713
alogy+0.701
Neg logit contributions
FontSize-0.867
Term-0.803
 EPS-0.783
arsity-0.757
\/-0.757
Token
Token ID198
Feature activation+7.99e-02

Pos logit contributions
FontSize+0.550
Term+0.509
 EPS+0.497
arsity+0.483
\/+0.478
Neg logit contributions
undo-0.519
tarian-0.458
away-0.411
 wre-0.408
alogy-0.401
TokenIndeed
Token ID17854
Feature activation-3.23e+00

Pos logit contributions
FontSize+0.065
Term+0.060
 EPS+0.060
arsity+0.058
\/+0.057
Neg logit contributions
undo-0.063
tarian-0.055
away-0.050
 wre-0.049
alogy-0.048
Token,
Token ID11
Feature activation-2.20e+00

Pos logit contributions
undo+2.784
tarian+2.505
 wre+2.266
away+2.249
alogy+2.199
Neg logit contributions
FontSize-2.383
Term-2.174
 EPS-2.120
arsity-2.070
\/-2.051
Token while
Token ID981
Feature activation-2.76e-01

Pos logit contributions
FontSize+0.033
Term+0.031
 EPS+0.030
arsity+0.029
\/+0.029
Neg logit contributions
undo-0.039
tarian-0.035
 wre-0.031
away-0.031
alogy-0.031
Token if
Token ID611
Feature activation+7.92e-02

Pos logit contributions
undo+0.222
tarian+0.198
 wre+0.178
away+0.177
alogy+0.173
Neg logit contributions
FontSize-0.220
Term-0.203
 EPS-0.198
arsity-0.192
\/-0.191
Token you
Token ID345
Feature activation+1.91e-01

Pos logit contributions
FontSize+0.060
Term+0.056
 EPS+0.054
arsity+0.052
\/+0.052
Neg logit contributions
undo-0.066
tarian-0.059
 wre-0.053
away-0.053
alogy-0.052
Token have
Token ID423
Feature activation+3.47e-01

Pos logit contributions
FontSize+0.146
Term+0.135
 EPS+0.131
arsity+0.127
\/+0.126
Neg logit contributions
undo-0.159
tarian-0.142
 wre-0.128
away-0.127
alogy-0.125
Token a
Token ID257
Feature activation-9.55e-01

Pos logit contributions
FontSize+0.273
Term+0.253
 EPS+0.247
arsity+0.238
\/+0.238
Neg logit contributions
undo-0.282
tarian-0.252
 wre-0.225
away-0.224
alogy-0.221
Token physical
Token ID3518
Feature activation-7.61e+00

Pos logit contributions
undo+0.762
tarian+0.677
 wre+0.609
away+0.609
 flung+0.595
Neg logit contributions
FontSize-0.769
Term-0.709
 EPS-0.687
arsity-0.670
\/-0.666
Token manifestation
Token ID30566
Feature activation-1.27e+00

Pos logit contributions
undo+5.945
tarian+5.300
away+4.868
 wre+4.823
 que+4.716
Neg logit contributions
FontSize-6.091
Term-5.547
 EPS-5.181
arsity-5.171
\/-5.070
Token of
Token ID286
Feature activation-8.25e-01

Pos logit contributions
undo+1.111
tarian+0.999
 wre+0.903
away+0.900
alogy+0.886
Neg logit contributions
FontSize-0.930
Term-0.853
 EPS-0.830
arsity-0.801
\/-0.797
Token how
Token ID703
Feature activation-1.10e+00

Pos logit contributions
undo+0.685
tarian+0.613
 wre+0.550
away+0.548
alogy+0.539
Neg logit contributions
FontSize-0.636
Term-0.587
 EPS-0.572
arsity-0.553
\/-0.552
Token much
Token ID881
Feature activation+8.82e-01

Pos logit contributions
undo+0.823
tarian+0.727
 wre+0.644
away+0.641
alogy+0.627
Neg logit contributions
FontSize-0.944
Term-0.879
 EPS-0.857
arsity-0.833
\/-0.830
Token money
Token ID1637
Feature activation-2.08e+00

Pos logit contributions
FontSize+0.627
Term+0.578
 EPS+0.572
\/+0.548
arsity+0.544
Neg logit contributions
undo-0.777
tarian-0.701
 wre-0.629
alogy-0.622
away-0.622
Token only
Token ID691
Feature activation+2.69e+00

Pos logit contributions
FontSize+0.535
Term+0.492
 EPS+0.481
\/+0.464
arsity+0.462
Neg logit contributions
undo-0.657
tarian-0.591
away-0.531
 wre-0.531
alogy-0.524
Token rarely
Token ID8365
Feature activation+1.50e+00

Pos logit contributions
FontSize+1.895
Term+1.754
 EPS+1.723
\/+1.661
arsity+1.643
Neg logit contributions
undo-2.386
tarian-2.137
away-1.920
alogy-1.909
 wre-1.908
Token.
Token ID13
Feature activation+5.13e-01

Pos logit contributions
FontSize+0.953
Term+0.873
 EPS+0.847
\/+0.820
arsity+0.812
Neg logit contributions
undo-1.439
tarian-1.305
 wre-1.186
away-1.181
alogy-1.173
Token
Token ID198
Feature activation+1.81e-01

Pos logit contributions
FontSize+0.402
Term+0.374
 EPS+0.367
arsity+0.354
\/+0.353
Neg logit contributions
undo-0.419
tarian-0.376
 wre-0.335
away-0.331
 flung-0.328
Token
Token ID198
Feature activation-1.57e-01

Pos logit contributions
FontSize+0.151
Term+0.139
 EPS+0.136
arsity+0.132
\/+0.131
Neg logit contributions
undo-0.139
tarian-0.122
away-0.110
 wre-0.108
alogy-0.107
TokenAdvertisement
Token ID4723
Feature activation-7.61e+00

Pos logit contributions
undo+0.131
tarian+0.116
away+0.105
 wre+0.104
alogy+0.103
Neg logit contributions
FontSize-0.120
Term-0.110
 EPS-0.108
arsity-0.105
\/-0.104
Token Continue
Token ID10054
Feature activation-7.57e-02

Pos logit contributions
undo+5.874
tarian+4.793
away+4.561
alogy+4.504
 wre+4.421
Neg logit contributions
FontSize-6.239
Term-5.762
 EPS-5.547
\/-5.378
arsity-5.347
Token reading
Token ID3555
Feature activation-4.89e-01

Pos logit contributions
undo+0.045
tarian+0.039
 wre+0.033
away+0.033
alogy+0.032
Neg logit contributions
FontSize-0.076
Term-0.071
 EPS-0.070
arsity-0.068
\/-0.068
Token the
Token ID262
Feature activation+8.37e-01

Pos logit contributions
undo+0.374
tarian+0.331
 wre+0.297
away+0.296
alogy+0.291
Neg logit contributions
FontSize-0.404
Term-0.374
 EPS-0.363
arsity-0.355
\/-0.352
Token main
Token ID1388
Feature activation+1.18e+00

Pos logit contributions
FontSize+0.449
Term+0.395
 EPS+0.376
arsity+0.360
\/+0.357
Neg logit contributions
undo-0.895
tarian-0.821
 wre-0.760
away-0.759
alogy-0.747
Token story
Token ID1621
Feature activation+2.07e+00

Pos logit contributions
FontSize+0.813
Term+0.741
 EPS+0.714
arsity+0.689
\/+0.688
Neg logit contributions
undo-1.074
tarian-0.973
 wre-0.883
away-0.880
alogy-0.867
Token so
Token ID523
Feature activation-8.05e-01

Pos logit contributions
undo+0.526
tarian+0.465
 wre+0.420
away+0.414
 flung+0.408
Neg logit contributions
FontSize-0.580
Term-0.539
 EPS-0.524
arsity-0.511
\/-0.508
Token far
Token ID1290
Feature activation-2.36e+00

Pos logit contributions
undo+0.657
tarian+0.586
 wre+0.529
away+0.525
 flung+0.516
Neg logit contributions
FontSize-0.631
Term-0.585
 EPS-0.568
arsity-0.551
\/-0.548
Token mostly
Token ID4632
Feature activation-7.02e-01

Pos logit contributions
undo+1.748
tarian+1.547
 wre+1.400
away+1.371
 flung+1.363
Neg logit contributions
FontSize-2.030
Term-1.889
 EPS-1.831
arsity-1.784
\/-1.772
Token involved
Token ID2950
Feature activation+1.29e+00

Pos logit contributions
undo+0.507
tarian+0.447
 wre+0.399
away+0.393
 flung+0.388
Neg logit contributions
FontSize-0.616
Term-0.574
 EPS-0.559
arsity-0.544
\/-0.543
Token ethnic
Token ID9450
Feature activation-1.44e-02

Pos logit contributions
FontSize+1.008
Term+0.933
 EPS+0.911
arsity+0.879
\/+0.878
Neg logit contributions
undo-1.056
tarian-0.943
 wre-0.842
away-0.839
alogy-0.827
Token Korean
Token ID6983
Feature activation+5.04e+00

Pos logit contributions
undo+0.013
tarian+0.012
 wre+0.011
away+0.010
alogy+0.010
Neg logit contributions
FontSize-0.010
Term-0.009
 EPS-0.009
arsity-0.009
\/-0.009
Token citizens
Token ID4290
Feature activation-1.63e+00

Pos logit contributions
FontSize+3.568
Term+3.318
 EPS+3.281
\/+3.144
arsity+3.091
Neg logit contributions
undo-4.405
tarian-3.924
alogy-3.492
away-3.476
 wre-3.469
Token of
Token ID286
Feature activation+1.49e+00

Pos logit contributions
undo+1.390
tarian+1.253
 wre+1.145
away+1.133
 flung+1.119
Neg logit contributions
FontSize-1.217
Term-1.124
 EPS-1.086
arsity-1.055
\/-1.042
Token South
Token ID2520
Feature activation-1.23e+00

Pos logit contributions
FontSize+1.108
Term+1.020
 EPS+0.995
arsity+0.958
\/+0.958
Neg logit contributions
undo-1.278
tarian-1.148
 wre-1.032
away-1.027
alogy-1.013
Token Korea
Token ID4969
Feature activation+1.74e+00

Pos logit contributions
undo+1.023
tarian+0.914
away+0.826
 wre+0.826
alogy+0.808
Neg logit contributions
FontSize-0.956
Term-0.876
 EPS-0.851
arsity-0.824
\/-0.817
Token,
Token ID11
Feature activation+1.07e-01

Pos logit contributions
FontSize+1.224
Term+1.124
 EPS+1.097
\/+1.063
arsity+1.052
Neg logit contributions
undo-1.546
tarian-1.394
 wre-1.248
away-1.247
alogy-1.240
Token large
Token ID1588
Feature activation-7.80e-01

Pos logit contributions
undo+1.320
tarian+1.165
 wre+1.047
away+1.033
 flung+1.020
Neg logit contributions
FontSize-1.477
Term-1.379
 EPS-1.342
arsity-1.303
\/-1.300
Token crowd
Token ID4315
Feature activation+2.54e+00

Pos logit contributions
undo+0.615
tarian+0.546
 wre+0.489
away+0.485
 flung+0.476
Neg logit contributions
FontSize-0.636
Term-0.592
 EPS-0.573
arsity-0.557
\/-0.556
Token of
Token ID286
Feature activation+1.01e+00

Pos logit contributions
FontSize+1.789
Term+1.631
 EPS+1.601
\/+1.553
arsity+1.538
Neg logit contributions
undo-2.256
tarian-2.047
alogy-1.820
away-1.804
 wre-1.804
Token pro
Token ID386
Feature activation-4.74e-01

Pos logit contributions
FontSize+0.754
Term+0.694
 EPS+0.680
\/+0.653
arsity+0.653
Neg logit contributions
undo-0.867
tarian-0.782
 wre-0.697
away-0.696
alogy-0.690
Token-
Token ID12
Feature activation-1.11e+00

Pos logit contributions
undo+0.462
tarian+0.419
away+0.385
 wre+0.384
alogy+0.376
Neg logit contributions
FontSize-0.299
Term-0.270
 EPS-0.262
arsity-0.250
\/-0.247
Tokengovernment
Token ID14480
Feature activation+4.04e+00

Pos logit contributions
undo+1.031
tarian+0.931
away+0.849
 wre+0.845
alogy+0.829
Neg logit contributions
FontSize-0.754
Term-0.685
 EPS-0.671
arsity-0.644
\/-0.638
Token mar
Token ID1667
Feature activation-1.11e+00

Pos logit contributions
FontSize+2.962
Term+2.724
 EPS+2.711
\/+2.603
arsity+2.578
Neg logit contributions
undo-3.405
tarian-3.073
alogy-2.723
away-2.712
 wre-2.683
Tokenchers
Token ID3533
Feature activation-2.48e-01

Pos logit contributions
undo+0.835
tarian+0.738
away+0.652
 wre+0.650
alogy+0.633
Neg logit contributions
FontSize-0.952
Term-0.882
 EPS-0.861
\/-0.826
arsity-0.822
Token chanting
Token ID31138
Feature activation+2.55e+00

Pos logit contributions
undo+0.223
tarian+0.201
 wre+0.182
away+0.181
alogy+0.179
Neg logit contributions
FontSize-0.174
Term-0.160
 EPS-0.155
arsity-0.150
\/-0.149
Token slogans
Token ID31139
Feature activation+4.55e+00

Pos logit contributions
FontSize+1.801
Term+1.663
 EPS+1.619
\/+1.564
arsity+1.553
Neg logit contributions
undo-2.243
tarian-2.040
 wre-1.815
away-1.809
alogy-1.805
Token against
Token ID1028
Feature activation+1.44e+00

Pos logit contributions
FontSize+3.293
Term+3.042
 EPS+2.940
\/+2.872
arsity+2.845
Neg logit contributions
undo-3.868
tarian-3.536
away-3.118
alogy-3.090
 wre-3.086
Token the
Token ID262
Feature activation+1.52e-02

Pos logit contributions
FontSize+1.098
Term+1.017
 EPS+0.995
arsity+0.956
\/+0.955
Neg logit contributions
undo-1.213
tarian-1.088
 wre-0.975
away-0.971
alogy-0.958
Token NDP
Token ID20392
Feature activation+1.12e+00

Pos logit contributions
FontSize+0.011
Term+0.010
 EPS+0.010
arsity+0.010
\/+0.010
Neg logit contributions
undo-0.013
tarian-0.012
 wre-0.011
away-0.011
alogy-0.010
Token
Token ID447
Feature activation-5.46e-01

Pos logit contributions
FontSize+0.853
Term+0.788
 EPS+0.768
arsity+0.742
\/+0.741
Neg logit contributions
undo-0.934
tarian-0.837
 wre-0.751
away-0.748
alogy-0.737
Token
Token ID247
Feature activation-2.47e-01

Pos logit contributions
undo+0.500
tarian+0.451
 wre+0.408
away+0.407
alogy+0.403
Neg logit contributions
FontSize-0.374
Term-0.342
 EPS-0.333
arsity-0.320
\/-0.319
Tokens
Token ID82
Feature activation-2.94e-01

Pos logit contributions
undo+0.225
tarian+0.203
 wre+0.184
away+0.183
alogy+0.181
Neg logit contributions
FontSize-0.171
Term-0.156
 EPS-0.152
arsity-0.146
\/-0.146
Token deputy
Token ID10636
Feature activation+5.53e+00

Pos logit contributions
undo+0.248
tarian+0.223
 wre+0.201
away+0.200
 flung+0.196
Neg logit contributions
FontSize-0.223
Term-0.204
 EPS-0.197
arsity-0.192
\/-0.192
Token critic
Token ID4014
Feature activation-4.68e-01

Pos logit contributions
FontSize+3.116
Term+2.860
 EPS+2.822
arsity+2.623
\/+2.619
Neg logit contributions
undo-5.606
tarian-5.070
alogy-4.636
 wre-4.632
away-4.587
Token for
Token ID329
Feature activation+1.27e+00

Pos logit contributions
undo+0.409
tarian+0.369
 wre+0.334
away+0.332
alogy+0.328
Neg logit contributions
FontSize-0.340
Term-0.312
 EPS-0.302
\/-0.291
arsity-0.291
Token democratic
Token ID10518
Feature activation+1.47e+00

Pos logit contributions
FontSize+0.976
Term+0.902
 EPS+0.881
arsity+0.850
\/+0.849
Neg logit contributions
undo-1.054
tarian-0.944
 wre-0.845
away-0.842
alogy-0.829
Token reform
Token ID4975
Feature activation-2.71e+00

Pos logit contributions
FontSize+1.315
Term+1.231
 EPS+1.210
arsity+1.171
\/+1.169
Neg logit contributions
undo-1.026
tarian-0.897
 wre-0.779
away-0.777
alogy-0.763
Token,
Token ID11
Feature activation-2.90e-01

Pos logit contributions
undo+2.482
tarian+2.249
away+2.040
 wre+2.037
alogy+1.988
Neg logit contributions
FontSize-1.866
Term-1.683
 EPS-1.627
arsity-1.564
\/-1.561
Token potential
Token ID2785
Feature activation+5.93e-01

Pos logit contributions
FontSize+0.631
Term+0.584
 EPS+0.573
\/+0.552
arsity+0.549
Neg logit contributions
undo-0.670
tarian-0.598
 wre-0.533
away-0.533
alogy-0.524
Token side
Token ID1735
Feature activation+9.31e-01

Pos logit contributions
FontSize+0.491
Term+0.457
 EPS+0.450
\/+0.434
arsity+0.432
Neg logit contributions
undo-0.458
tarian-0.406
away-0.357
 wre-0.357
alogy-0.351
Token effects
Token ID3048
Feature activation-1.38e+00

Pos logit contributions
FontSize+0.858
Term+0.806
 EPS+0.789
arsity+0.768
\/+0.766
Neg logit contributions
undo-0.630
tarian-0.549
 wre-0.475
away-0.474
alogy-0.465
Token.
Token ID13
Feature activation+1.57e+00

Pos logit contributions
undo+1.271
tarian+1.141
 wre+1.052
away+1.040
 flung+1.021
Neg logit contributions
FontSize-0.956
Term-0.861
 EPS-0.831
arsity-0.805
\/-0.798
Token The
Token ID383
Feature activation+9.06e-01

Pos logit contributions
FontSize+1.222
Term+1.135
 EPS+1.109
arsity+1.069
\/+1.068
Neg logit contributions
undo-1.286
tarian-1.150
 wre-1.028
away-1.021
alogy-1.007
Token aim
Token ID4031
Feature activation+5.67e+00

Pos logit contributions
FontSize+0.697
Term+0.643
 EPS+0.627
arsity+0.606
\/+0.605
Neg logit contributions
undo-0.754
tarian-0.675
 wre-0.606
away-0.603
alogy-0.594
Token of
Token ID286
Feature activation+3.56e+00

Pos logit contributions
FontSize+3.373
Term+3.110
 EPS+3.088
arsity+2.911
\/+2.852
Neg logit contributions
undo-5.411
tarian-5.016
 wre-4.490
alogy-4.482
away-4.452
Token this
Token ID428
Feature activation+1.82e+00

Pos logit contributions
FontSize+2.413
Term+2.208
 EPS+2.202
\/+2.067
arsity+2.058
Neg logit contributions
undo-3.244
tarian-2.955
 wre-2.645
alogy-2.644
away-2.643
Token study
Token ID2050
Feature activation-1.70e+00

Pos logit contributions
FontSize+1.423
Term+1.323
 EPS+1.301
\/+1.246
arsity+1.245
Neg logit contributions
undo-1.486
tarian-1.328
 wre-1.180
away-1.177
alogy-1.161
Token is
Token ID318
Feature activation+2.23e+00

Pos logit contributions
undo+1.352
tarian+1.202
 wre+1.079
away+1.074
alogy+1.054
Neg logit contributions
FontSize-1.376
Term-1.272
 EPS-1.236
arsity-1.199
\/-1.198
Token to
Token ID284
Feature activation+2.06e+00

Pos logit contributions
FontSize+1.751
Term+1.630
 EPS+1.604
\/+1.534
arsity+1.530
Neg logit contributions
undo-1.793
tarian-1.611
 wre-1.426
away-1.422
alogy-1.407
Token exploded
Token ID18750
Feature activation+2.32e+00

Pos logit contributions
undo+0.610
tarian+0.544
 wre+0.490
away+0.486
 flung+0.478
Neg logit contributions
FontSize-0.612
Term-0.565
 EPS-0.551
arsity-0.536
\/-0.531
Token,
Token ID11
Feature activation+6.31e-01

Pos logit contributions
FontSize+1.612
Term+1.495
 EPS+1.469
\/+1.433
arsity+1.385
Neg logit contributions
undo-2.069
tarian-1.866
alogy-1.664
away-1.654
 wre-1.636
Token
Token ID447
Feature activation-1.76e-01

Pos logit contributions
FontSize+0.502
Term+0.468
 EPS+0.459
\/+0.446
arsity+0.439
Neg logit contributions
undo-0.503
tarian-0.450
away-0.395
alogy-0.394
 wre-0.392
Token
Token ID251
Feature activation+6.47e-01

Pos logit contributions
undo+0.169
tarian+0.154
 wre+0.140
away+0.140
alogy+0.138
Neg logit contributions
FontSize-0.113
Term-0.102
 EPS-0.099
arsity-0.095
\/-0.095
Token said
Token ID531
Feature activation+3.64e-01

Pos logit contributions
FontSize+0.531
Term+0.497
 EPS+0.488
\/+0.471
arsity+0.467
Neg logit contributions
undo-0.497
tarian-0.443
alogy-0.388
away-0.388
 wre-0.386
Token Malcolm
Token ID20002
Feature activation+4.13e+00

Pos logit contributions
FontSize+0.274
Term+0.254
 EPS+0.249
\/+0.239
arsity+0.239
Neg logit contributions
undo-0.307
tarian-0.275
 wre-0.247
away-0.246
alogy-0.243
Token Cr
Token ID3864
Feature activation+1.21e+00

Pos logit contributions
FontSize+2.961
Term+2.768
 EPS+2.722
\/+2.620
arsity+2.611
Neg logit contributions
undo-3.525
tarian-3.222
 wre-2.875
 flung-2.825
away-2.819
Tokenick
Token ID624
Feature activation-1.09e+00

Pos logit contributions
FontSize+0.937
Term+0.870
 EPS+0.847
arsity+0.822
\/+0.820
Neg logit contributions
undo-0.988
tarian-0.884
 wre-0.794
away-0.786
alogy-0.776
Token,
Token ID11
Feature activation+1.42e+00

Pos logit contributions
undo+0.952
tarian+0.854
away+0.774
 wre+0.770
alogy+0.756
Neg logit contributions
FontSize-0.803
Term-0.735
 EPS-0.713
arsity-0.689
\/-0.684
Token Secretary
Token ID4986
Feature activation+7.23e-02

Pos logit contributions
FontSize+1.065
Term+0.991
 EPS+0.972
\/+0.935
arsity+0.931
Neg logit contributions
undo-1.189
tarian-1.065
 wre-0.951
away-0.950
alogy-0.939
Token of
Token ID286
Feature activation+1.07e+00

Pos logit contributions
FontSize+0.039
Term+0.035
 EPS+0.034
arsity+0.032
\/+0.032
Neg logit contributions
undo-0.076
tarian-0.070
 wre-0.064
away-0.064
alogy-0.064
TokenMeanwhile
Token ID10294
Feature activation+2.68e+00

Pos logit contributions
undo+0.317
tarian+0.281
away+0.253
 wre+0.251
alogy+0.247
Neg logit contributions
FontSize-0.305
Term-0.280
 EPS-0.277
arsity-0.268
\/-0.266
Token,
Token ID11
Feature activation-1.94e-01

Pos logit contributions
FontSize+1.784
Term+1.642
 EPS+1.602
\/+1.536
arsity+1.522
Neg logit contributions
undo-2.453
tarian-2.231
 wre-2.001
away-2.001
alogy-1.997
Token in
Token ID287
Feature activation-4.89e-01

Pos logit contributions
undo+0.166
tarian+0.149
 wre+0.135
away+0.134
 flung+0.132
Neg logit contributions
FontSize-0.144
Term-0.133
 EPS-0.129
arsity-0.125
\/-0.125
Token more
Token ID517
Feature activation+1.37e+00

Pos logit contributions
undo+0.394
tarian+0.351
 wre+0.315
away+0.314
 flung+0.307
Neg logit contributions
FontSize-0.390
Term-0.360
 EPS-0.351
arsity-0.341
\/-0.339
Token evidence
Token ID2370
Feature activation-2.52e+00

Pos logit contributions
FontSize+1.099
Term+1.018
 EPS+0.996
\/+0.963
arsity+0.958
Neg logit contributions
undo-1.097
tarian-0.979
 wre-0.868
away-0.865
alogy-0.853
Token that
Token ID326
Feature activation-2.20e-01

Pos logit contributions
undo+2.146
tarian+1.926
 wre+1.763
away+1.745
 flung+1.719
Neg logit contributions
FontSize-1.871
Term-1.732
 EPS-1.681
arsity-1.638
\/-1.597
Token Comey
Token ID12388
Feature activation-3.42e-01

Pos logit contributions
undo+0.191
tarian+0.172
 wre+0.155
away+0.154
alogy+0.152
Neg logit contributions
FontSize-0.161
Term-0.148
 EPS-0.144
arsity-0.138
\/-0.138
Token was
Token ID373
Feature activation+1.40e+00

Pos logit contributions
undo+0.275
tarian+0.245
 wre+0.220
away+0.218
alogy+0.215
Neg logit contributions
FontSize-0.272
Term-0.252
 EPS-0.246
arsity-0.238
\/-0.237
Token canned
Token ID32530
Feature activation+3.04e+00

Pos logit contributions
FontSize+1.158
Term+1.075
 EPS+1.053
\/+1.021
arsity+1.015
Neg logit contributions
undo-1.079
tarian-0.966
away-0.842
alogy-0.839
 wre-0.837
Token in
Token ID287
Feature activation+7.20e-01

Pos logit contributions
FontSize+2.123
Term+1.952
 EPS+1.901
\/+1.861
arsity+1.825
Neg logit contributions
undo-2.705
tarian-2.436
away-2.178
 wre-2.175
alogy-2.169
Token haste
Token ID36797
Feature activation+8.50e-01

Pos logit contributions
FontSize+0.561
Term+0.518
 EPS+0.505
arsity+0.489
\/+0.488
Neg logit contributions
undo-0.592
tarian-0.529
 wre-0.474
away-0.472
alogy-0.464
Token for
Token ID329
Feature activation-4.99e-03

Pos logit contributions
FontSize+0.933
Term+0.863
 EPS+0.838
\/+0.817
arsity+0.810
Neg logit contributions
undo-1.114
tarian-1.002
 wre-0.906
away-0.896
alogy-0.890
Token All
Token ID1439
Feature activation-2.64e-02

Pos logit contributions
undo+0.004
tarian+0.003
 wre+0.003
away+0.003
alogy+0.003
Neg logit contributions
FontSize-0.004
Term-0.004
 EPS-0.004
arsity-0.004
\/-0.004
Token Act
Token ID2191
Feature activation-3.52e+00

Pos logit contributions
undo+0.024
tarian+0.022
 wre+0.020
away+0.020
alogy+0.019
Neg logit contributions
FontSize-0.018
Term-0.017
 EPS-0.016
arsity-0.016
\/-0.016
Token
Token ID447
Feature activation-1.32e+00

Pos logit contributions
undo+3.175
tarian+2.824
 wre+2.581
away+2.571
alogy+2.525
Neg logit contributions
FontSize-2.465
Term-2.246
 EPS-2.175
arsity-2.102
\/-2.064
Token
Token ID251
Feature activation-3.76e-01

Pos logit contributions
undo+1.220
tarian+1.105
 wre+1.003
away+1.000
alogy+0.986
Neg logit contributions
FontSize-0.885
Term-0.808
 EPS-0.785
arsity-0.754
\/-0.753
Token because
Token ID780
Feature activation-1.34e-01

Pos logit contributions
undo+0.323
tarian+0.290
 wre+0.261
away+0.260
alogy+0.256
Neg logit contributions
FontSize-0.279
Term-0.257
 EPS-0.250
arsity-0.241
\/-0.241
Token it
Token ID340
Feature activation-5.05e-02

Pos logit contributions
undo+0.113
tarian+0.101
 wre+0.091
away+0.091
alogy+0.089
Neg logit contributions
FontSize-0.102
Term-0.094
 EPS-0.091
arsity-0.088
\/-0.088
Token will
Token ID481
Feature activation-5.98e-01

Pos logit contributions
undo+0.041
tarian+0.037
away+0.033
 wre+0.033
alogy+0.032
Neg logit contributions
FontSize-0.040
Term-0.037
 EPS-0.036
\/-0.035
arsity-0.035
Token end
Token ID886
Feature activation-3.61e+00

Pos logit contributions
undo+0.480
tarian+0.427
 wre+0.383
away+0.380
alogy+0.374
Neg logit contributions
FontSize-0.477
Term-0.442
 EPS-0.430
arsity-0.417
\/-0.416
Token
Token ID564
Feature activation-6.87e-01

Pos logit contributions
undo+2.997
tarian+2.673
 wre+2.460
away+2.415
 flung+2.364
Neg logit contributions
FontSize-2.733
Term-2.520
 EPS-2.436
arsity-2.363
\/-2.346
Token
Token ID250
Feature activation-5.08e-01

Pos logit contributions
undo+0.631
tarian+0.569
 wre+0.517
away+0.515
alogy+0.508
Neg logit contributions
FontSize-0.470
Term-0.429
 EPS-0.418
arsity-0.401
\/-0.400
Token.
Token ID13
Feature activation-4.65e-01

Pos logit contributions
undo+1.271
tarian+1.149
 wre+1.050
away+1.046
 flung+1.022
Neg logit contributions
FontSize-1.018
Term-0.931
 EPS-0.895
arsity-0.868
\/-0.858
Token It
Token ID632
Feature activation-4.92e-01

Pos logit contributions
undo+0.369
tarian+0.327
 wre+0.293
away+0.293
alogy+0.287
Neg logit contributions
FontSize-0.376
Term-0.347
 EPS-0.337
arsity-0.328
\/-0.327
Token's
Token ID338
Feature activation-5.72e-01

Pos logit contributions
undo+0.393
tarian+0.350
 wre+0.312
away+0.311
alogy+0.305
Neg logit contributions
FontSize-0.395
Term-0.366
 EPS-0.357
arsity-0.346
\/-0.345
Token quite
Token ID2407
Feature activation-1.93e+00

Pos logit contributions
undo+0.448
tarian+0.398
 wre+0.354
away+0.353
alogy+0.346
Neg logit contributions
FontSize-0.469
Term-0.435
 EPS-0.424
arsity-0.411
\/-0.411
Token a
Token ID257
Feature activation-7.08e-01

Pos logit contributions
undo+1.514
tarian+1.344
 wre+1.216
away+1.202
 flung+1.183
Neg logit contributions
FontSize-1.571
Term-1.459
 EPS-1.418
arsity-1.385
\/-1.377
Token small
Token ID1402
Feature activation-2.36e-01

Pos logit contributions
undo+0.552
tarian+0.489
 wre+0.438
away+0.436
 flung+0.427
Neg logit contributions
FontSize-0.584
Term-0.541
 EPS-0.526
arsity-0.512
\/-0.511
Token one
Token ID530
Feature activation+3.13e-02

Pos logit contributions
undo+0.207
tarian+0.186
 wre+0.168
away+0.167
alogy+0.165
Neg logit contributions
FontSize-0.171
Term-0.157
 EPS-0.153
arsity-0.148
\/-0.147
Token
Token ID784
Feature activation-1.59e+00

Pos logit contributions
FontSize+0.022
Term+0.020
 EPS+0.019
arsity+0.019
\/+0.018
Neg logit contributions
undo-0.029
tarian-0.026
 wre-0.023
away-0.023
 flung-0.023
Token a
Token ID257
Feature activation-1.29e-01

Pos logit contributions
undo+1.324
tarian+1.185
 wre+1.077
away+1.073
 flung+1.050
Neg logit contributions
FontSize-1.221
Term-1.125
 EPS-1.084
arsity-1.060
\/-1.051
Token baby
Token ID5156
Feature activation+2.02e+00

Pos logit contributions
undo+0.101
tarian+0.090
 wre+0.081
away+0.080
 flung+0.078
Neg logit contributions
FontSize-0.105
Term-0.098
 EPS-0.095
arsity-0.093
\/-0.092
Token oct
Token ID19318
Feature activation+2.11e+00

Pos logit contributions
FontSize+1.400
Term+1.281
 EPS+1.267
\/+1.205
arsity+1.202
Neg logit contributions
undo-1.829
tarian-1.644
away-1.482
 wre-1.481
alogy-1.470
Token,
Token ID11
Feature activation+2.60e-01

Pos logit contributions
FontSize+0.840
Term+0.771
 EPS+0.752
\/+0.725
arsity+0.723
Neg logit contributions
undo-1.054
tarian-0.952
 wre-0.856
away-0.853
alogy-0.845
Token P
Token ID350
Feature activation+1.21e+00

Pos logit contributions
FontSize+0.189
Term+0.174
 EPS+0.171
arsity+0.163
\/+0.163
Neg logit contributions
undo-0.228
tarian-0.205
 wre-0.184
away-0.184
alogy-0.182
Tokenuro
Token ID1434
Feature activation+1.38e+00

Pos logit contributions
FontSize+0.919
Term+0.844
 EPS+0.822
arsity+0.794
\/+0.793
Neg logit contributions
undo-1.019
tarian-0.911
away-0.818
 wre-0.817
alogy-0.803
Tokenhit
Token ID17945
Feature activation+6.67e-01

Pos logit contributions
FontSize+1.186
Term+1.108
 EPS+1.077
arsity+1.049
\/+1.049
Neg logit contributions
undo-1.019
tarian-0.901
 wre-0.801
away-0.785
 flung-0.779
Token requested
Token ID9167
Feature activation-7.49e-01

Pos logit contributions
FontSize+0.542
Term+0.503
 EPS+0.494
\/+0.477
arsity+0.477
Neg logit contributions
undo-0.525
tarian-0.467
 wre-0.413
away-0.411
alogy-0.406
Token the
Token ID262
Feature activation-1.28e-01

Pos logit contributions
undo+0.632
tarian+0.567
 wre+0.512
away+0.510
alogy+0.501
Neg logit contributions
FontSize-0.567
Term-0.522
 EPS-0.505
arsity-0.491
\/-0.488
Token Prime
Token ID5537
Feature activation+1.70e+00

Pos logit contributions
undo+0.116
tarian+0.105
 wre+0.096
away+0.095
alogy+0.094
Neg logit contributions
FontSize-0.089
Term-0.081
 EPS-0.078
arsity-0.076
\/-0.076
Token Minister
Token ID4139
Feature activation-4.33e-02

Pos logit contributions
FontSize+1.053
Term+0.953
 EPS+0.922
arsity+0.883
\/+0.881
Neg logit contributions
undo-1.670
tarian-1.522
 wre-1.392
away-1.387
alogy-1.369
Token to
Token ID284
Feature activation+7.39e-02

Pos logit contributions
undo+0.034
tarian+0.030
 wre+0.026
away+0.026
alogy+0.026
Neg logit contributions
FontSize-0.036
Term-0.033
 EPS-0.032
arsity-0.031
\/-0.031
Token write
Token ID3551
Feature activation-1.25e+00

Pos logit contributions
FontSize+0.057
Term+0.053
 EPS+0.051
arsity+0.050
\/+0.050
Neg logit contributions
undo-0.061
tarian-0.055
 wre-0.049
away-0.049
alogy-0.048
Token in
Token ID287
Feature activation-1.79e+00

Pos logit contributions
undo+0.960
tarian+0.850
away+0.762
 wre+0.759
 flung+0.742
Neg logit contributions
FontSize-1.035
Term-0.961
 EPS-0.931
arsity-0.908
\/-0.903
Token Pelosi
Token ID29611
Feature activation+9.78e-01

Pos logit contributions
FontSize+0.864
Term+0.793
 EPS+0.773
\/+0.745
arsity+0.742
Neg logit contributions
undo-1.100
tarian-0.996
 wre-0.898
away-0.896
alogy-0.882
Token will
Token ID481
Feature activation-2.16e-01

Pos logit contributions
FontSize+0.715
Term+0.659
 EPS+0.642
\/+0.619
arsity+0.618
Neg logit contributions
undo-0.849
tarian-0.765
 wre-0.686
away-0.684
alogy-0.676
Token be
Token ID307
Feature activation-1.19e+00

Pos logit contributions
undo+0.173
tarian+0.154
 wre+0.139
away+0.138
alogy+0.135
Neg logit contributions
FontSize-0.173
Term-0.160
 EPS-0.156
arsity-0.151
\/-0.150
Token the
Token ID262
Feature activation+9.23e-01

Pos logit contributions
undo+0.947
tarian+0.836
 wre+0.767
away+0.758
 flung+0.748
Neg logit contributions
FontSize-0.954
Term-0.876
 EPS-0.854
arsity-0.838
\/-0.828
Token president
Token ID1893
Feature activation-3.32e-01

Pos logit contributions
FontSize+0.647
Term+0.589
 EPS+0.572
arsity+0.554
\/+0.552
Neg logit contributions
undo-0.833
tarian-0.752
 wre-0.685
away-0.680
 flung-0.669
Token.
Token ID13
Feature activation+3.39e-01

Pos logit contributions
undo+0.312
tarian+0.282
 wre+0.258
away+0.257
alogy+0.252
Neg logit contributions
FontSize-0.220
Term-0.201
 EPS-0.195
arsity-0.187
\/-0.186
Token
Token ID447
Feature activation+9.58e-01

Pos logit contributions
FontSize+0.231
Term+0.212
 EPS+0.205
arsity+0.197
\/+0.197
Neg logit contributions
undo-0.312
tarian-0.282
 wre-0.256
away-0.255
alogy-0.252
Token
Token ID251
Feature activation+1.32e+00

Pos logit contributions
FontSize+0.608
Term+0.552
 EPS+0.533
\/+0.512
arsity+0.512
Neg logit contributions
undo-0.924
tarian-0.842
 wre-0.770
away-0.766
 flung-0.755
TokenMoreover
Token ID24606
Feature activation+1.07e+00

Pos logit contributions
FontSize+0.986
Term+0.912
 EPS+0.888
\/+0.857
arsity+0.854
Neg logit contributions
undo-1.126
tarian-1.013
 wre-0.914
away-0.904
alogy-0.894
Token,
Token ID11
Feature activation-5.82e-01

Pos logit contributions
FontSize+0.744
Term+0.681
 EPS+0.663
\/+0.638
arsity+0.635
Neg logit contributions
undo-0.972
tarian-0.877
 wre-0.795
away-0.792
alogy-0.784
Token Letter
Token ID18121
Feature activation+1.33e+00

Pos logit contributions
undo+0.491
tarian+0.440
 wre+0.396
away+0.394
alogy+0.387
Neg logit contributions
FontSize-0.440
Term-0.406
 EPS-0.394
arsity-0.382
\/-0.381
Token
Token ID198
Feature activation+2.55e+00

Pos logit contributions
FontSize+2.821
Term+2.645
 EPS+2.579
\/+2.508
arsity+2.467
Neg logit contributions
undo-3.177
tarian-2.866
 wre-2.582
 flung-2.531
away-2.499
Token
Token ID198
Feature activation+2.00e+00

Pos logit contributions
FontSize+1.991
Term+1.851
 EPS+1.803
\/+1.749
arsity+1.737
Neg logit contributions
undo-2.083
tarian-1.869
 wre-1.670
away-1.645
 flung-1.637
TokenWE
Token ID8845
Feature activation-1.45e+00

Pos logit contributions
FontSize+1.513
Term+1.402
 EPS+1.354
\/+1.311
arsity+1.308
Neg logit contributions
undo-1.688
tarian-1.516
 wre-1.365
away-1.350
 flung-1.335
TokenP
Token ID47
Feature activation-5.70e-01

Pos logit contributions
undo+1.169
tarian+1.045
away+0.940
 wre+0.932
alogy+0.922
Neg logit contributions
FontSize-1.129
Term-1.032
 EPS-1.005
arsity-0.990
\/-0.969
Token was
Token ID373
Feature activation+1.60e+00

Pos logit contributions
undo+0.462
tarian+0.413
 wre+0.371
away+0.368
alogy+0.362
Neg logit contributions
FontSize-0.449
Term-0.414
 EPS-0.403
arsity-0.391
\/-0.391
Token ratified
Token ID39067
Feature activation-1.87e+00

Pos logit contributions
FontSize+1.117
Term+1.031
 EPS+1.006
\/+0.960
arsity+0.959
Neg logit contributions
undo-1.445
tarian-1.304
away-1.171
 wre-1.169
alogy-1.160
Token as
Token ID355
Feature activation-5.45e-01

Pos logit contributions
undo+1.658
tarian+1.500
 wre+1.359
away+1.352
alogy+1.325
Neg logit contributions
FontSize-1.334
Term-1.217
 EPS-1.174
arsity-1.143
\/-1.133
Token a
Token ID257
Feature activation+3.70e-01

Pos logit contributions
undo+0.485
tarian+0.439
 wre+0.399
away+0.395
 flung+0.389
Neg logit contributions
FontSize-0.385
Term-0.352
 EPS-0.340
arsity-0.330
\/-0.329
Token Wi
Token ID11759
Feature activation-2.90e+00

Pos logit contributions
FontSize+0.276
Term+0.253
 EPS+0.244
arsity+0.238
\/+0.237
Neg logit contributions
undo-0.317
tarian-0.286
 wre-0.258
away-0.256
 flung-0.252
Token-
Token ID12
Feature activation-3.00e-01

Pos logit contributions
undo+2.688
tarian+2.441
away+2.248
 wre+2.202
alogy+2.186
Neg logit contributions
FontSize-1.925
 EPS-1.690
Term-1.657
arsity-1.631
\/-1.592
TokenFi
Token ID10547
Feature activation-3.36e+00

Pos logit contributions
undo+0.219
tarian+0.193
away+0.170
 wre+0.169
alogy+0.166
Neg logit contributions
FontSize-0.261
Term-0.241
 EPS-0.238
arsity-0.231
\/-0.230
Token Car
Token ID1879
Feature activation-5.69e-01

Pos logit contributions
undo+0.487
tarian+0.435
 wre+0.391
away+0.389
 flung+0.381
Neg logit contributions
FontSize-0.472
Term-0.436
 EPS-0.425
arsity-0.412
\/-0.410
Tokenros
Token ID4951
Feature activation+1.31e+00

Pos logit contributions
undo+0.467
tarian+0.416
away+0.373
 wre+0.373
alogy+0.366
Neg logit contributions
FontSize-0.449
Term-0.412
 EPS-0.402
arsity-0.388
\/-0.387
Token,
Token ID11
Feature activation-2.15e-01

Pos logit contributions
FontSize+0.984
Term+0.917
 EPS+0.890
\/+0.861
arsity+0.856
Neg logit contributions
undo-1.105
tarian-0.998
 wre-0.886
away-0.885
alogy-0.879
Token north
Token ID5093
Feature activation-2.23e+00

Pos logit contributions
undo+0.175
tarian+0.157
 wre+0.140
away+0.139
alogy+0.137
Neg logit contributions
FontSize-0.168
Term-0.156
 EPS-0.152
arsity-0.147
\/-0.147
Token of
Token ID286
Feature activation-1.77e+00

Pos logit contributions
undo+2.026
tarian+1.838
 wre+1.682
away+1.682
 flung+1.650
Neg logit contributions
FontSize-1.536
Term-1.396
 EPS-1.352
arsity-1.310
\/-1.293
Token Nice
Token ID18460
Feature activation-2.47e+00

Pos logit contributions
undo+1.444
tarian+1.288
 wre+1.164
away+1.155
 flung+1.133
Neg logit contributions
FontSize-1.388
Term-1.279
 EPS-1.243
arsity-1.211
\/-1.202
Token on
Token ID319
Feature activation+5.32e-01

Pos logit contributions
undo+2.137
tarian+1.931
 wre+1.743
away+1.735
 flung+1.705
Neg logit contributions
FontSize-1.803
Term-1.658
 EPS-1.614
arsity-1.559
\/-1.551
Token the
Token ID262
Feature activation-7.70e-01

Pos logit contributions
FontSize+0.417
Term+0.386
 EPS+0.378
\/+0.364
arsity+0.364
Neg logit contributions
undo-0.435
tarian-0.388
 wre-0.345
away-0.344
alogy-0.341
Token mainland
Token ID22779
Feature activation-1.95e+00

Pos logit contributions
undo+0.582
tarian+0.515
 wre+0.460
away+0.456
 flung+0.447
Neg logit contributions
FontSize-0.650
Term-0.605
 EPS-0.588
arsity-0.573
\/-0.571
Token,
Token ID11
Feature activation-1.23e+00

Pos logit contributions
undo+1.704
tarian+1.536
 wre+1.389
away+1.383
alogy+1.360
Neg logit contributions
FontSize-1.414
Term-1.299
 EPS-1.263
arsity-1.220
\/-1.217
Token locals
Token ID17205
Feature activation-1.23e+00

Pos logit contributions
undo+1.011
tarian+0.906
 wre+0.822
away+0.811
 flung+0.799
Neg logit contributions
FontSize-0.961
Term-0.887
 EPS-0.856
arsity-0.837
\/-0.833
Token comparison
Token ID7208
Feature activation-3.99e+00

Pos logit contributions
FontSize+0.241
Term+0.224
 EPS+0.219
arsity+0.212
\/+0.212
Neg logit contributions
undo-0.232
tarian-0.206
 wre-0.183
away-0.182
alogy-0.179
Token without
Token ID1231
Feature activation+2.58e-01

Pos logit contributions
undo+3.407
tarian+3.059
 wre+2.817
away+2.775
 flung+2.743
Neg logit contributions
FontSize-2.970
Term-2.714
 EPS-2.637
arsity-2.538
\/-2.479
Token being
Token ID852
Feature activation-2.09e+00

Pos logit contributions
FontSize+0.207
Term+0.192
 EPS+0.188
\/+0.181
arsity+0.181
Neg logit contributions
undo-0.206
tarian-0.183
 wre-0.162
away-0.162
alogy-0.160
Token nearly
Token ID3016
Feature activation-1.56e+00

Pos logit contributions
undo+1.621
tarian+1.441
 wre+1.290
away+1.286
 flung+1.268
Neg logit contributions
FontSize-1.717
Term-1.593
 EPS-1.553
arsity-1.516
\/-1.501
Token critical
Token ID4688
Feature activation-2.62e+00

Pos logit contributions
undo+1.348
tarian+1.216
 wre+1.102
away+1.095
 flung+1.088
Neg logit contributions
FontSize-1.150
Term-1.052
 EPS-1.022
arsity-0.996
\/-0.984
Token enough
Token ID1576
Feature activation-4.61e-01

Pos logit contributions
undo+2.151
tarian+1.935
 wre+1.727
away+1.722
 flung+1.693
Neg logit contributions
FontSize-2.044
Term-1.881
 EPS-1.829
arsity-1.777
\/-1.747
Token.
Token ID13
Feature activation-7.73e-01

Pos logit contributions
undo+0.390
tarian+0.350
 wre+0.315
away+0.313
alogy+0.309
Neg logit contributions
FontSize-0.346
Term-0.320
 EPS-0.311
\/-0.301
arsity-0.301
Token The
Token ID383
Feature activation-3.00e-01

Pos logit contributions
undo+0.610
tarian+0.542
 wre+0.483
away+0.482
alogy+0.474
Neg logit contributions
FontSize-0.627
Term-0.580
 EPS-0.565
arsity-0.548
\/-0.547
Token Washington
Token ID2669
Feature activation-2.44e+00

Pos logit contributions
undo+0.255
tarian+0.228
 wre+0.206
away+0.205
alogy+0.201
Neg logit contributions
FontSize-0.225
Term-0.207
 EPS-0.201
arsity-0.195
\/-0.194
Token Post
Token ID2947
Feature activation+1.15e-01

Pos logit contributions
undo+2.310
tarian+2.108
 wre+1.924
away+1.922
alogy+1.882
Neg logit contributions
FontSize-1.583
Term-1.417
 EPS-1.370
arsity-1.328
\/-1.308
Token reported
Token ID2098
Feature activation-1.76e+00

Pos logit contributions
FontSize+0.084
Term+0.078
 EPS+0.076
arsity+0.073
\/+0.073
Neg logit contributions
undo-0.099
tarian-0.089
 wre-0.080
away-0.080
alogy-0.079
Token on
Token ID319
Feature activation-2.75e-01

Pos logit contributions
FontSize+0.182
Term+0.169
 EPS+0.164
arsity+0.159
\/+0.159
Neg logit contributions
undo-0.192
tarian-0.172
 wre-0.154
away-0.153
alogy-0.150
Token social
Token ID1919
Feature activation-2.49e+00

Pos logit contributions
undo+0.224
tarian+0.201
 wre+0.180
away+0.179
alogy+0.176
Neg logit contributions
FontSize-0.215
Term-0.199
 EPS-0.193
arsity-0.188
\/-0.187
Token media
Token ID2056
Feature activation-8.30e-01

Pos logit contributions
undo+2.343
tarian+2.108
 wre+1.941
away+1.921
alogy+1.902
Neg logit contributions
FontSize-1.654
Term-1.499
 EPS-1.449
arsity-1.397
\/-1.379
Token as
Token ID355
Feature activation-5.08e-01

Pos logit contributions
undo+0.727
tarian+0.653
 wre+0.594
away+0.588
alogy+0.579
Neg logit contributions
FontSize-0.606
Term-0.555
 EPS-0.539
arsity-0.523
\/-0.517
Token well
Token ID880
Feature activation-2.01e+00

Pos logit contributions
undo+0.423
tarian+0.377
 wre+0.340
away+0.337
alogy+0.331
Neg logit contributions
FontSize-0.391
Term-0.361
 EPS-0.351
arsity-0.340
\/-0.339
Token,
Token ID11
Feature activation-5.09e-01

Pos logit contributions
undo+1.845
tarian+1.668
 wre+1.524
away+1.512
alogy+1.485
Neg logit contributions
FontSize-1.370
Term-1.252
 EPS-1.211
arsity-1.171
\/-1.156
Token such
Token ID884
Feature activation-1.54e+00

Pos logit contributions
undo+0.401
tarian+0.357
 wre+0.320
away+0.317
 flung+0.311
Neg logit contributions
FontSize-0.414
Term-0.384
 EPS-0.374
arsity-0.363
\/-0.362
Token as
Token ID355
Feature activation-4.73e+00

Pos logit contributions
undo+1.329
tarian+1.203
 wre+1.093
away+1.084
 flung+1.066
Neg logit contributions
FontSize-1.126
Term-1.031
 EPS-0.993
arsity-0.975
\/-0.958
Token when
Token ID618
Feature activation+5.72e-01

Pos logit contributions
undo+3.742
tarian+3.322
 wre+3.046
away+3.003
 que+2.920
Neg logit contributions
FontSize-3.758
Term-3.459
 EPS-3.315
arsity-3.305
\/-3.213
Token he
Token ID339
Feature activation+2.05e-01

Pos logit contributions
FontSize+0.430
Term+0.397
 EPS+0.388
arsity+0.374
\/+0.373
Neg logit contributions
undo-0.485
tarian-0.434
 wre-0.390
away-0.389
alogy-0.384
Token retweet
Token ID38814
Feature activation+2.17e+00

Pos logit contributions
FontSize+0.167
Term+0.155
 EPS+0.151
\/+0.147
arsity+0.146
Neg logit contributions
undo-0.162
tarian-0.144
 wre-0.127
away-0.127
alogy-0.125
TokenThe
Token ID464
Feature activation+8.50e-03

Pos logit contributions
undo+0.141
tarian+0.126
away+0.114
 wre+0.113
alogy+0.112
Neg logit contributions
FontSize-0.125
Term-0.115
 EPS-0.112
arsity-0.108
\/-0.108
Token Doors
Token ID33775
Feature activation-1.63e+00

Pos logit contributions
FontSize+0.006
Term+0.006
 EPS+0.005
arsity+0.005
\/+0.005
Neg logit contributions
undo-0.007
tarian-0.007
 wre-0.006
away-0.006
alogy-0.006
Token"?
Token ID13984
Feature activation-2.50e-01

Pos logit contributions
undo+1.524
tarian+1.375
 wre+1.254
away+1.252
alogy+1.231
Neg logit contributions
FontSize-1.098
Term-1.001
 EPS-0.967
arsity-0.931
\/-0.928
Token Yeah
Token ID9425
Feature activation-7.50e-01

Pos logit contributions
undo+0.200
tarian+0.179
 wre+0.159
away+0.158
alogy+0.156
Neg logit contributions
FontSize-0.200
Term-0.185
 EPS-0.181
arsity-0.175
\/-0.175
Token,
Token ID11
Feature activation-1.69e+00

Pos logit contributions
undo+0.663
tarian+0.598
 wre+0.541
away+0.539
alogy+0.530
Neg logit contributions
FontSize-0.538
Term-0.493
 EPS-0.480
arsity-0.462
\/-0.462
Token dude
Token ID18396
Feature activation-5.17e-01

Pos logit contributions
undo+1.389
tarian+1.243
 wre+1.127
away+1.118
 flung+1.095
Neg logit contributions
FontSize-1.312
Term-1.212
 EPS-1.171
arsity-1.142
\/-1.141
Token,
Token ID11
Feature activation-5.29e-01

Pos logit contributions
undo+0.483
tarian+0.438
 wre+0.398
away+0.397
alogy+0.391
Neg logit contributions
FontSize-0.345
Term-0.314
 EPS-0.305
arsity-0.293
\/-0.292
Token Floyd
Token ID21150
Feature activation-1.03e+00

Pos logit contributions
undo+0.444
tarian+0.398
 wre+0.359
away+0.357
 flung+0.351
Neg logit contributions
FontSize-0.403
Term-0.372
 EPS-0.361
arsity-0.350
\/-0.350
Token and
Token ID290
Feature activation+2.43e-01

Pos logit contributions
undo+0.875
tarian+0.785
 wre+0.710
away+0.706
 flung+0.695
Neg logit contributions
FontSize-0.772
Term-0.710
 EPS-0.692
arsity-0.669
\/-0.665
Token the
Token ID262
Feature activation-2.07e-01

Pos logit contributions
FontSize+0.183
Term+0.169
 EPS+0.165
arsity+0.159
\/+0.159
Neg logit contributions
undo-0.205
tarian-0.184
 wre-0.165
away-0.165
alogy-0.162
Token Lizard
Token ID42452
Feature activation-2.92e-01

Pos logit contributions
undo+0.171
tarian+0.153
 wre+0.138
away+0.136
 flung+0.134
Neg logit contributions
FontSize-0.162
Term-0.149
 EPS-0.145
\/-0.140
arsity-0.140
Token time
Token ID640
Feature activation-2.89e+00

Pos logit contributions
undo+0.632
tarian+0.564
 wre+0.502
away+0.501
alogy+0.493
Neg logit contributions
FontSize-0.621
Term-0.575
 EPS-0.562
arsity-0.542
\/-0.542
Token,
Token ID11
Feature activation-1.21e+00

Pos logit contributions
undo+2.440
tarian+2.180
 wre+1.982
away+1.969
 flung+1.946
Neg logit contributions
FontSize-2.184
Term-2.009
 EPS-1.950
arsity-1.893
\/-1.873
Token
Token ID198
Feature activation-1.31e+00

Pos logit contributions
undo+0.980
tarian+0.875
 wre+0.783
away+0.779
alogy+0.766
Neg logit contributions
FontSize-0.958
Term-0.887
 EPS-0.864
arsity-0.837
\/-0.835
Token
Token ID198
Feature activation-1.58e+00

Pos logit contributions
undo+0.966
tarian+0.833
away+0.768
 wre+0.746
alogy+0.739
Neg logit contributions
FontSize-1.138
Term-1.047
 EPS-1.022
arsity-1.006
\/-0.982
TokenThe
Token ID464
Feature activation-1.99e+00

Pos logit contributions
undo+1.208
tarian+1.065
away+0.948
 wre+0.944
alogy+0.921
Neg logit contributions
FontSize-1.325
Term-1.225
 EPS-1.205
arsity-1.168
\/-1.158
Token oppress
Token ID35035
Feature activation-4.64e+00

Pos logit contributions
undo+1.554
tarian+1.381
 wre+1.248
away+1.228
 flung+1.215
Neg logit contributions
FontSize-1.624
Term-1.507
 EPS-1.461
arsity-1.426
\/-1.417
Tokenor
Token ID273
Feature activation-2.65e+00

Pos logit contributions
undo+3.439
tarian+2.995
 wre+2.719
away+2.691
alogy+2.635
Neg logit contributions
FontSize-3.990
Term-3.677
 EPS-3.573
arsity-3.468
\/-3.438
Token's
Token ID338
Feature activation-2.52e-01

Pos logit contributions
undo+2.243
tarian+2.009
 wre+1.833
away+1.806
 flung+1.793
Neg logit contributions
FontSize-1.994
Term-1.844
 EPS-1.778
arsity-1.730
\/-1.718
Token wrong
Token ID2642
Feature activation+2.17e-01

Pos logit contributions
undo+0.196
tarian+0.174
 wre+0.154
away+0.154
alogy+0.151
Neg logit contributions
FontSize-0.207
Term-0.192
 EPS-0.188
\/-0.182
arsity-0.182
Token,
Token ID11
Feature activation-6.32e-01

Pos logit contributions
FontSize+0.156
Term+0.143
 EPS+0.140
\/+0.135
arsity+0.134
Neg logit contributions
undo-0.191
tarian-0.172
 wre-0.154
away-0.154
alogy-0.153
Token the
Token ID262
Feature activation-1.28e+00

Pos logit contributions
undo+0.517
tarian+0.463
away+0.411
 wre+0.411
alogy+0.405
Neg logit contributions
FontSize-0.494
Term-0.457
 EPS-0.447
\/-0.431
arsity-0.430
Token States
Token ID1829
Feature activation+8.35e-01

Pos logit contributions
undo+0.874
tarian+0.764
away+0.684
 wre+0.673
alogy+0.665
Neg logit contributions
FontSize-1.080
Term-0.991
 EPS-0.975
arsity-0.956
\/-0.934
Token.
Token ID13
Feature activation-1.24e-01

Pos logit contributions
FontSize+0.516
Term+0.468
 EPS+0.453
\/+0.435
arsity+0.432
Neg logit contributions
undo-0.819
tarian-0.746
 wre-0.682
away-0.679
alogy-0.670
Token It
Token ID632
Feature activation-1.25e-01

Pos logit contributions
undo+0.100
tarian+0.090
 wre+0.080
away+0.080
 flung+0.078
Neg logit contributions
FontSize-0.098
Term-0.091
 EPS-0.089
\/-0.086
arsity-0.086
Token was
Token ID373
Feature activation+1.35e+00

Pos logit contributions
undo+0.099
tarian+0.089
 wre+0.079
away+0.079
alogy+0.077
Neg logit contributions
FontSize-0.100
Term-0.093
 EPS-0.091
arsity-0.088
\/-0.087
Token originally
Token ID6198
Feature activation-9.38e-01

Pos logit contributions
FontSize+1.102
Term+1.025
 EPS+1.001
\/+0.967
arsity+0.965
Neg logit contributions
undo-1.065
tarian-0.949
away-0.838
 wre-0.837
alogy-0.826
Token passed
Token ID3804
Feature activation-4.67e+00

Pos logit contributions
undo+0.712
tarian+0.631
 wre+0.561
away+0.557
 flung+0.547
Neg logit contributions
FontSize-0.788
Term-0.732
 EPS-0.715
arsity-0.696
\/-0.693
Token in
Token ID287
Feature activation-7.98e-01

Pos logit contributions
undo+3.936
tarian+3.569
away+3.226
 wre+3.203
alogy+3.120
Neg logit contributions
FontSize-3.476
Term-3.184
 EPS-3.062
arsity-3.011
\/-2.938
Token 1970
Token ID8069
Feature activation+2.48e+00

Pos logit contributions
undo+0.626
tarian+0.557
 wre+0.496
away+0.494
alogy+0.485
Neg logit contributions
FontSize-0.651
Term-0.603
 EPS-0.588
arsity-0.570
\/-0.569
Token,[
Token ID17414
Feature activation+2.46e+00

Pos logit contributions
FontSize+1.678
Term+1.540
 EPS+1.506
\/+1.450
arsity+1.431
Neg logit contributions
undo-2.273
tarian-2.054
 wre-1.862
away-1.853
alogy-1.842
Token2
Token ID17
Feature activation-2.51e+00

Pos logit contributions
FontSize+1.405
Term+1.293
 EPS+1.225
\/+1.173
arsity+1.166
Neg logit contributions
undo-2.480
tarian-2.293
 wre-2.112
 flung-2.082
away-2.076
Token]
Token ID60
Feature activation+1.74e+00

Pos logit contributions
undo+2.877
tarian+2.660
away+2.471
 wre+2.467
alogy+2.439
Neg logit contributions
FontSize-1.145
Term-0.984
 EPS-0.938
arsity-0.879
\/-0.868
Token talks
Token ID6130
Feature activation+2.24e-01

Pos logit contributions
FontSize+0.906
Term+0.836
 EPS+0.832
arsity+0.803
\/+0.793
Neg logit contributions
undo-1.021
tarian-0.913
 wre-0.814
away-0.809
alogy-0.804
Token and
Token ID290
Feature activation+5.82e-01

Pos logit contributions
FontSize+0.157
Term+0.144
 EPS+0.140
\/+0.136
arsity+0.135
Neg logit contributions
undo-0.201
tarian-0.182
 wre-0.164
away-0.163
alogy-0.162
Token other
Token ID584
Feature activation+5.55e-01

Pos logit contributions
FontSize+0.432
Term+0.398
 EPS+0.388
arsity+0.374
\/+0.374
Neg logit contributions
undo-0.499
tarian-0.449
 wre-0.404
away-0.402
alogy-0.396
Token messages
Token ID6218
Feature activation+2.33e-01

Pos logit contributions
FontSize+0.415
Term+0.381
 EPS+0.369
arsity+0.358
\/+0.356
Neg logit contributions
undo-0.475
tarian-0.426
 wre-0.385
away-0.384
alogy-0.377
Token via
Token ID2884
Feature activation-9.56e-01

Pos logit contributions
FontSize+0.177
Term+0.163
 EPS+0.159
arsity+0.153
\/+0.153
Neg logit contributions
undo-0.196
tarian-0.176
 wre-0.158
away-0.157
alogy-0.155
Token Mi
Token ID13756
Feature activation-4.59e+00

Pos logit contributions
undo+0.782
tarian+0.694
 wre+0.629
away+0.624
 flung+0.613
Neg logit contributions
FontSize-0.750
Term-0.690
 EPS-0.670
arsity-0.650
\/-0.648
Tokeniverse
Token ID3997
Feature activation-9.82e-01

Pos logit contributions
undo+2.959
tarian+2.499
 wre+2.207
away+2.196
 flung+2.163
Neg logit contributions
FontSize-4.368
 EPS-4.007
Term-3.951
arsity-3.858
\/-3.851
Token.
Token ID13
Feature activation-2.89e-01

Pos logit contributions
undo+0.885
tarian+0.796
 wre+0.725
away+0.722
alogy+0.710
Neg logit contributions
FontSize-0.692
Term-0.630
 EPS-0.612
arsity-0.591
\/-0.584
Token
Token ID198
Feature activation-5.66e-01

Pos logit contributions
undo+0.229
tarian+0.204
 wre+0.182
away+0.181
alogy+0.178
Neg logit contributions
FontSize-0.233
Term-0.216
 EPS-0.211
arsity-0.204
\/-0.204
Token
Token ID198
Feature activation-7.88e-01

Pos logit contributions
undo+0.428
tarian+0.374
away+0.340
 wre+0.333
alogy+0.329
Neg logit contributions
FontSize-0.479
Term-0.441
 EPS-0.432
arsity-0.421
\/-0.414
TokenThere
Token ID1858
Feature activation+1.57e-01

Pos logit contributions
undo+0.647
tarian+0.571
away+0.519
 wre+0.510
alogy+0.505
Neg logit contributions
FontSize-0.613
Term-0.562
 EPS-0.559
arsity-0.537
\/-0.531
Token and
Token ID290
Feature activation+7.82e-01

Pos logit contributions
FontSize+1.303
Term+1.198
 EPS+1.169
\/+1.133
arsity+1.124
Neg logit contributions
undo-1.731
tarian-1.566
away-1.408
 wre-1.407
alogy-1.400
Token efforts
Token ID4040
Feature activation-1.58e+00

Pos logit contributions
FontSize+0.613
Term+0.567
 EPS+0.555
\/+0.535
arsity+0.535
Neg logit contributions
undo-0.640
tarian-0.572
 wre-0.508
away-0.507
alogy-0.502
Token to
Token ID284
Feature activation+5.76e-01

Pos logit contributions
undo+1.231
tarian+1.090
 wre+0.974
away+0.967
 flung+0.949
Neg logit contributions
FontSize-1.307
Term-1.213
 EPS-1.184
arsity-1.148
\/-1.144
Token tighten
Token ID31833
Feature activation-7.09e-01

Pos logit contributions
FontSize+0.469
Term+0.436
 EPS+0.428
\/+0.414
arsity+0.413
Neg logit contributions
undo-0.448
tarian-0.401
away-0.352
 wre-0.351
alogy-0.349
Token gun
Token ID2485
Feature activation-3.16e-01

Pos logit contributions
undo+0.563
tarian+0.500
 wre+0.444
away+0.443
alogy+0.438
Neg logit contributions
FontSize-0.574
Term-0.531
 EPS-0.521
\/-0.502
arsity-0.501
Token control
Token ID1630
Feature activation-6.55e+00

Pos logit contributions
undo+0.207
tarian+0.178
 wre+0.154
away+0.153
alogy+0.151
Neg logit contributions
FontSize-0.299
Term-0.280
 EPS-0.275
\/-0.268
arsity-0.267
Token.
Token ID13
Feature activation-7.92e-01

Pos logit contributions
undo+4.925
tarian+4.392
 wre+3.980
away+3.898
 flung+3.784
Neg logit contributions
FontSize-5.442
Term-5.029
 EPS-4.877
arsity-4.805
\/-4.672
Token His
Token ID2399
Feature activation+9.13e-01

Pos logit contributions
undo+0.621
tarian+0.550
 wre+0.491
away+0.491
alogy+0.482
Neg logit contributions
FontSize-0.646
Term-0.598
 EPS-0.583
arsity-0.564
\/-0.564
Token family
Token ID1641
Feature activation-6.35e-01

Pos logit contributions
FontSize+0.719
Term+0.665
 EPS+0.648
arsity+0.627
\/+0.626
Neg logit contributions
undo-0.743
tarian-0.664
 wre-0.594
away-0.591
alogy-0.582
Token foundation
Token ID8489
Feature activation-2.61e+00

Pos logit contributions
undo+0.505
tarian+0.450
 wre+0.403
away+0.401
alogy+0.394
Neg logit contributions
FontSize-0.511
Term-0.474
 EPS-0.461
arsity-0.448
\/-0.446
Token has
Token ID468
Feature activation+3.18e-01

Pos logit contributions
undo+2.054
tarian+1.831
 wre+1.672
away+1.632
 flung+1.623
Neg logit contributions
FontSize-2.121
Term-1.964
 EPS-1.900
arsity-1.846
\/-1.827
Token Recently
Token ID23413
Feature activation+4.85e-01

Pos logit contributions
undo+0.094
tarian+0.083
 wre+0.075
away+0.074
alogy+0.073
Neg logit contributions
FontSize-0.094
Term-0.087
 EPS-0.084
arsity-0.082
\/-0.082
Token the
Token ID262
Feature activation+8.89e-03

Pos logit contributions
FontSize+0.361
Term+0.334
 EPS+0.326
\/+0.314
arsity+0.313
Neg logit contributions
undo-0.414
tarian-0.372
 wre-0.334
away-0.333
alogy-0.329
Token European
Token ID3427
Feature activation-1.94e-01

Pos logit contributions
FontSize+0.007
Term+0.006
 EPS+0.006
arsity+0.006
\/+0.006
Neg logit contributions
undo-0.008
tarian-0.007
 wre-0.006
away-0.006
alogy-0.006
Token union
Token ID6441
Feature activation-3.35e+00

Pos logit contributions
undo+0.141
tarian+0.124
 wre+0.110
away+0.109
alogy+0.107
Neg logit contributions
FontSize-0.169
Term-0.157
 EPS-0.154
arsity-0.149
\/-0.149
Token has
Token ID468
Feature activation-3.74e-01

Pos logit contributions
undo+2.731
tarian+2.434
 wre+2.200
away+2.184
 flung+2.150
Neg logit contributions
FontSize-2.638
Term-2.421
 EPS-2.326
arsity-2.275
\/-2.260
Token banned
Token ID9301
Feature activation-7.98e+00

Pos logit contributions
undo+0.297
tarian+0.265
 wre+0.235
away+0.234
alogy+0.231
Neg logit contributions
FontSize-0.301
Term-0.280
 EPS-0.274
\/-0.264
arsity-0.264
Token the
Token ID262
Feature activation-1.21e+00

Pos logit contributions
undo+6.033
tarian+5.431
 wre+5.056
away+4.936
 hammer+4.902
Neg logit contributions
FontSize-6.421
Term-5.735
arsity-5.564
\/-5.341
 EPS-5.334
Token use
Token ID779
Feature activation+2.04e+00

Pos logit contributions
undo+0.993
tarian+0.889
 wre+0.797
away+0.793
alogy+0.780
Neg logit contributions
FontSize-0.937
Term-0.865
 EPS-0.842
arsity-0.815
\/-0.814
Token of
Token ID286
Feature activation+2.11e-02

Pos logit contributions
FontSize+1.344
Term+1.234
 EPS+1.215
\/+1.164
arsity+1.151
Neg logit contributions
undo-1.897
tarian-1.717
 wre-1.545
away-1.542
alogy-1.531
Token the
Token ID262
Feature activation-4.78e-01

Pos logit contributions
FontSize+0.016
Term+0.015
 EPS+0.014
\/+0.014
arsity+0.014
Neg logit contributions
undo-0.018
tarian-0.016
 wre-0.014
away-0.014
alogy-0.014
Token three
Token ID1115
Feature activation+1.01e+00

Pos logit contributions
undo+0.397
tarian+0.355
 wre+0.318
away+0.317
alogy+0.312
Neg logit contributions
FontSize-0.369
Term-0.341
 EPS-0.332
arsity-0.321
\/-0.321