TOP ACTIVATIONS
MAX = 17.574

UR -- With barely a month leading up to her retirement
In juries Bite Time for L aine to Shine
( ap . parse _ args ()) # initialize a rectangular
in pointed out recently in Time , has ironically only increased
driving with more than three minutes left in the game ,
the race , the less time Mrs . Clinton has to
24 With two years to go until the date
est imagination , with the political capital that was involved in
21 )) # loop over the input image paths for image
out with her 6 - month - old , Conan ,
arg parse . Arg ument Parser () ap . add _
Flyers with just over six minutes remaining gave them an opportunity
flags , during the ' Yes ' vote closing campaign in
that roughly two out of every five Americans knows about the
the most potent -- and dangerous -- American energy entrepreneur right
90 % of Yes votes with a
easy despite the amount of time it takes and it bright
Jonathan Warren , Director of Time 4 sleep . co .
their cost . On the other hand , if the sidewalk
nearest big city is an hour away ," says A du

MIN ACTIVATIONS
MIN = -19.013

v 2 . Ga ussian Bl ur ( gray , (
what s past the edge of the Universe
( as opposed to back in the day playing at 640
recovery from the crisis by opening up trade in services in
and coll aps ing .
into a variety of inventive coll ages now making the rounds
atable issue of what it 's like to get in a
, after Hastings maybe it seemed like life could go on
int / violence _ in jury _ pre vention / key
and support tax cuts for small businesses and working families ,
his current NHL team . For the Pens - Fle ury
! Is it something you 'd ever want to
: Ga ussian bl urring is applied on Line
the basic division of virtuous private property and oppressive public interest
. Does it seem fair to you ,
mass , and for the most part ,
html / people / ye ager _ bi ography . html
t . Franklin Del ano Roosevelt was credited as
their ambitions were not always self less ".[ 39 ]
http :// www . ed wards . af . mil /

INTERVAL 8.427 - 17.574
CONTAINS 0.820%

a conservative estimate of the incub ation times for the two
l ono i o Kal an ial i il
ig dem ont and his nationalist heter ogeneous coalition ( lab
for Guard , on the other hand , exits epoch management
he enjoyed his time with the Heat in Summer

INTERVAL -0.720 - 8.427
CONTAINS 58.556%

to lay another allegation on the Pentagon , saying that he
(* take the N Ns from previous ser ached locations which
on the trade front , Ireland s national accounts
Haw aii has several good legislators who
, got crushed under her weight and suffered serious head injuries

INTERVAL -9.866 - -0.720
CONTAINS 40.228%

disagreements in rating may not reflect true disagreement ; they may
co ach ), Richard Irving , Paul Sch oles , Mark
When I joined Netflix in 2014 , we had begun transitioning
things don 't . I 'm sure there 's
touching the ground in co - op ( 6 )

INTERVAL -19.013 - -9.866
CONTAINS 0.396%

Views ." ... Charles Murray just tweeted the following
his prog eny , that he began to turn to writing
.' The comment seemed a rebuke of President -
cared for her , but something of what they had together
adults that had been ignored for the past decade .
TokenUR
Token ID4261
Feature activation+4.60e+00

Pos logit contributions
united+0.086
adders+0.078
ridges+0.078
 acknow+0.072
ards+0.070
Neg logit contributions
ogene-0.158
awan-0.150
vati-0.150
undo-0.149
argument-0.147
Token --
Token ID1377
Feature activation+2.92e+00

Pos logit contributions
ogene+3.233
awan+3.114
vati+3.067
undo+3.029
argument+2.976
Neg logit contributions
united-3.278
ridges-3.123
adders-3.102
 acknow-2.940
ards-2.862
Token With
Token ID2080
Feature activation+4.09e+00

Pos logit contributions
ogene+2.008
awan+1.918
vati+1.890
undo+1.881
 Manip+1.847
Neg logit contributions
united-2.143
ridges-2.027
adders-2.020
 acknow-1.927
ards-1.877
Token barely
Token ID8523
Feature activation+7.29e+00

Pos logit contributions
ogene+2.911
awan+2.791
vati+2.763
undo+2.726
 Manip+2.699
Neg logit contributions
united-2.893
adders-2.731
ridges-2.727
 acknow-2.582
ards-2.526
Token a
Token ID257
Feature activation+5.45e+00

Pos logit contributions
ogene+4.262
awan+3.982
vati+3.937
 Manip+3.864
argument+3.840
Neg logit contributions
united-6.033
ridges-5.787
adders-5.740
 acknow-5.338
aunders-5.317
Token month
Token ID1227
Feature activation+1.76e+01

Pos logit contributions
ogene+3.831
awan+3.651
vati+3.598
 Manip+3.581
undo+3.577
Neg logit contributions
united-3.852
adders-3.676
ridges-3.672
 acknow-3.368
ards-3.331
Token leading
Token ID3756
Feature activation+2.80e+00

Pos logit contributions
ogene+8.298
 circulating+8.282
awan+7.934
undo+7.832
vati+7.425
Neg logit contributions
united-13.631
ridges-13.323
adders-13.152
ategories-12.579
 advoc-12.497
Token up
Token ID510
Feature activation+1.10e+00

Pos logit contributions
ogene+1.809
awan+1.705
vati+1.684
undo+1.671
argument+1.635
Neg logit contributions
united-2.181
adders-2.067
ridges-2.061
 acknow-1.964
ards-1.923
Token to
Token ID284
Feature activation+1.07e+00

Pos logit contributions
ogene+0.818
awan+0.776
vati+0.772
undo+0.766
argument+0.749
Neg logit contributions
united-0.753
ridges-0.706
adders-0.704
 acknow-0.668
ards-0.654
Token her
Token ID607
Feature activation-5.75e-01

Pos logit contributions
ogene+0.798
awan+0.758
vati+0.753
undo+0.748
argument+0.732
Neg logit contributions
united-0.728
ridges-0.682
adders-0.680
 acknow-0.645
ards-0.632
Token retirement
Token ID10737
Feature activation-1.97e+00

Pos logit contributions
united+0.391
ridges+0.365
adders+0.363
 acknow+0.349
ards+0.340
Neg logit contributions
ogene-0.428
undo-0.404
awan-0.403
vati-0.402
argument-0.392
Token In
Token ID554
Feature activation+5.98e+00

Pos logit contributions
ogene+1.845
awan+1.739
vati+1.727
undo+1.721
 Manip+1.683
Neg logit contributions
united-1.902
ridges-1.804
adders-1.787
 acknow-1.719
ards-1.656
Tokenjuries
Token ID47496
Feature activation+6.30e+00

Pos logit contributions
ogene+3.943
awan+3.718
vati+3.648
undo+3.621
argument+3.587
Neg logit contributions
united-4.505
ridges-4.313
adders-4.221
 acknow-4.120
 advoc-3.995
Token Bite
Token ID44540
Feature activation-4.16e+00

Pos logit contributions
ogene+4.089
awan+3.893
 Manip+3.887
vati+3.870
undo+3.851
Neg logit contributions
united-4.712
adders-4.530
ridges-4.521
 acknow-4.321
 advoc-4.223
Token
Token ID198
Feature activation+3.06e+00

Pos logit contributions
united+2.826
adders+2.597
ridges+2.590
ards+2.484
 acknow+2.410
Neg logit contributions
ogene-3.099
vati-2.930
undo-2.912
awan-2.900
argument-2.838
Token
Token ID198
Feature activation+1.97e+00

Pos logit contributions
ogene+2.186
awan+2.086
undo+2.060
vati+2.060
argument+2.020
Neg logit contributions
united-2.147
ridges-2.042
adders-2.030
 acknow-1.984
 advoc-1.951
TokenTime
Token ID7575
Feature activation+1.74e+01

Pos logit contributions
ogene+1.497
awan+1.427
vati+1.415
undo+1.415
argument+1.386
Neg logit contributions
united-1.300
ridges-1.226
adders-1.219
 acknow-1.167
ards-1.126
Token for
Token ID329
Feature activation-2.10e+00

Pos logit contributions
 Manip+9.999
ogene+9.405
 Demand+9.265
undo+9.132
awan+9.072
Neg logit contributions
united-11.481
adders-11.176
ridges-11.156
 encount-10.881
 acknow-10.879
Token L
Token ID406
Feature activation-1.75e+00

Pos logit contributions
united+1.488
adders+1.393
ridges+1.388
 acknow+1.324
ards+1.298
Neg logit contributions
ogene-1.515
awan-1.427
vati-1.427
undo-1.409
argument-1.370
Tokenaine
Token ID5718
Feature activation+9.16e-01

Pos logit contributions
united+1.185
adders+1.125
ridges+1.118
ards+1.041
 acknow+1.025
Neg logit contributions
ogene-1.306
awan-1.233
vati-1.232
undo-1.219
 Manip-1.206
Token to
Token ID284
Feature activation-1.48e+00

Pos logit contributions
ogene+0.677
awan+0.642
vati+0.638
undo+0.634
argument+0.620
Neg logit contributions
united-0.632
ridges-0.593
adders-0.591
 acknow-0.561
ards-0.549
Token Shine
Token ID41249
Feature activation+2.23e+00

Pos logit contributions
united+1.107
adders+1.041
ridges+1.037
 acknow+1.006
ards+0.978
Neg logit contributions
ogene-0.991
vati-0.942
awan-0.935
undo-0.917
argument-0.902
Token(
Token ID7
Feature activation+2.76e+00

Pos logit contributions
ogene+1.425
awan+1.359
vati+1.356
undo+1.342
argument+1.323
Neg logit contributions
united-1.014
ridges-0.946
adders-0.941
 acknow-0.899
 advoc-0.870
Tokenap
Token ID499
Feature activation+2.44e+00

Pos logit contributions
ogene+2.110
awan+2.023
vati+2.006
undo+1.991
argument+1.970
Neg logit contributions
united-1.815
ridges-1.706
adders-1.693
 acknow-1.646
 advoc-1.578
Token.
Token ID13
Feature activation+1.04e-01

Pos logit contributions
ogene+1.906
awan+1.818
vati+1.798
undo+1.793
argument+1.765
Neg logit contributions
united-1.561
ridges-1.470
adders-1.464
 acknow-1.422
 advoc-1.396
Tokenparse
Token ID29572
Feature activation-5.79e-01

Pos logit contributions
ogene+0.050
awan+0.046
vati+0.045
undo+0.045
 Manip+0.043
Neg logit contributions
united-0.100
ridges-0.095
adders-0.095
 acknow-0.091
ards-0.090
Token_
Token ID62
Feature activation+2.65e+00

Pos logit contributions
united+0.224
adders+0.197
ridges+0.195
ards+0.172
 acknow+0.167
Neg logit contributions
ogene-0.607
awan-0.583
vati-0.581
undo-0.579
argument-0.566
Tokenargs
Token ID22046
Feature activation+1.66e+01

Pos logit contributions
ogene+0.914
awan+0.809
vati+0.805
undo+0.799
argument+0.787
Neg logit contributions
united-2.850
ridges-2.732
adders-2.730
 acknow-2.707
 advoc-2.632
Token())
Token ID28955
Feature activation-2.46e-01

Pos logit contributions
ogene+4.706
argument+4.653
vati+4.107
awan+4.065
milo+3.856
Neg logit contributions
 encount-17.187
 advoc-16.538
 acknow-16.454
 therap-15.555
united-15.553
Token #
Token ID1303
Feature activation+1.19e+01

Pos logit contributions
united+0.173
ridges+0.162
adders+0.162
 acknow+0.154
ards+0.151
Neg logit contributions
ogene-0.179
awan-0.169
vati-0.168
undo-0.167
argument-0.164
Token initialize
Token ID41216
Feature activation-1.02e+00

Pos logit contributions
ogene+5.353
argument+4.975
awan+4.856
 Manip+4.729
undo+4.628
Neg logit contributions
united-10.752
ridges-9.868
adders-9.702
ategories-9.683
 advoc-9.599
Token a
Token ID257
Feature activation-2.91e+00

Pos logit contributions
united+0.722
adders+0.682
ridges+0.679
ards+0.638
 acknow+0.632
Neg logit contributions
ogene-0.731
awan-0.689
undo-0.685
vati-0.684
argument-0.662
Token rectangular
Token ID36954
Feature activation+1.08e+00

Pos logit contributions
united+1.558
ridges+1.440
adders+1.427
 acknow+1.331
ards+1.307
Neg logit contributions
ogene-2.570
vati-2.467
awan-2.465
undo-2.451
argument-2.393
Tokenin
Token ID259
Feature activation+4.26e+00

Pos logit contributions
united+2.536
adders+2.291
ridges+2.230
ards+2.087
 acknow+2.055
Neg logit contributions
ogene-5.677
undo-5.563
vati-5.486
awan-5.378
 Manip-5.342
Token pointed
Token ID6235
Feature activation+7.52e+00

Pos logit contributions
ogene+3.310
awan+3.130
vati+3.120
undo+3.117
argument+3.065
Neg logit contributions
united-2.742
ridges-2.555
adders-2.541
 acknow-2.327
ards-2.292
Token out
Token ID503
Feature activation+3.92e+00

Pos logit contributions
ogene+3.359
awan+2.940
argument+2.910
undo+2.900
vati+2.894
Neg logit contributions
united-7.201
adders-6.957
ridges-6.912
 acknow-6.709
 encount-6.631
Token recently
Token ID2904
Feature activation+1.96e+00

Pos logit contributions
ogene+2.687
awan+2.521
vati+2.507
undo+2.503
argument+2.458
Neg logit contributions
united-2.907
adders-2.741
ridges-2.735
 acknow-2.590
ards-2.544
Token in
Token ID287
Feature activation+1.34e+00

Pos logit contributions
ogene+1.363
awan+1.282
vati+1.271
undo+1.269
argument+1.244
Neg logit contributions
united-1.434
ridges-1.352
adders-1.351
 acknow-1.283
ards-1.254
Token Time
Token ID3862
Feature activation+1.66e+01

Pos logit contributions
ogene+0.957
awan+0.904
vati+0.899
undo+0.894
argument+0.875
Neg logit contributions
united-0.953
ridges-0.896
adders-0.893
 acknow-0.850
ards-0.832
Token,
Token ID11
Feature activation-4.76e-01

Pos logit contributions
ogene+9.123
undo+8.818
 Manip+8.386
argument+8.356
awan+8.299
Neg logit contributions
united-11.685
adders-11.377
ridges-11.326
 acknow-10.863
ards-10.762
Token has
Token ID468
Feature activation+3.13e+00

Pos logit contributions
united+0.309
adders+0.288
ridges+0.288
 acknow+0.273
ards+0.268
Neg logit contributions
ogene-0.371
awan-0.353
vati-0.351
undo-0.349
argument-0.340
Token ironically
Token ID32532
Feature activation+5.35e+00

Pos logit contributions
ogene+2.459
awan+2.332
vati+2.328
undo+2.322
argument+2.269
Neg logit contributions
united-1.982
ridges-1.867
adders-1.863
 acknow-1.720
aunders-1.693
Token only
Token ID691
Feature activation+4.52e+00

Pos logit contributions
ogene+4.109
vati+3.893
awan+3.878
undo+3.873
argument+3.789
Neg logit contributions
united-3.391
adders-3.242
ridges-3.241
aunders-2.990
 acknow-2.927
Token increased
Token ID3220
Feature activation+3.16e+00

Pos logit contributions
ogene+3.503
vati+3.319
awan+3.297
undo+3.291
argument+3.224
Neg logit contributions
united-2.881
ridges-2.744
adders-2.734
aunders-2.506
 acknow-2.503
Token driving
Token ID5059
Feature activation+2.37e+00

Pos logit contributions
ogene+2.922
awan+2.781
undo+2.741
vati+2.732
 Manip+2.712
Neg logit contributions
united-3.077
adders-2.917
ridges-2.898
 acknow-2.693
ards-2.677
Token with
Token ID351
Feature activation+2.02e+00

Pos logit contributions
ogene+1.735
awan+1.653
vati+1.640
undo+1.626
argument+1.600
Neg logit contributions
united-1.640
adders-1.532
ridges-1.531
 acknow-1.445
ards-1.407
Token more
Token ID517
Feature activation+7.76e+00

Pos logit contributions
ogene+1.454
awan+1.379
vati+1.368
undo+1.360
argument+1.329
Neg logit contributions
united-1.428
ridges-1.341
adders-1.337
 acknow-1.268
ards-1.243
Token than
Token ID621
Feature activation+5.32e+00

Pos logit contributions
ogene+5.426
undo+5.040
awan+5.036
vati+5.012
 Manip+5.010
Neg logit contributions
united-5.414
ridges-5.111
adders-5.091
 acknow-4.681
aunders-4.649
Token three
Token ID1115
Feature activation+6.73e+00

Pos logit contributions
ogene+2.882
awan+2.684
vati+2.647
undo+2.610
 Manip+2.570
Neg logit contributions
united-4.641
adders-4.443
ridges-4.437
 acknow-4.213
 advoc-4.129
Token minutes
Token ID2431
Feature activation+1.63e+01

Pos logit contributions
ogene+4.436
awan+4.110
 Manip+4.058
undo+4.041
 circulating+4.024
Neg logit contributions
united-5.113
adders-4.766
ridges-4.690
 acknow-4.453
aunders-4.378
Token left
Token ID1364
Feature activation+5.09e+00

Pos logit contributions
 circulating+7.894
ogene+7.838
awan+7.785
undo+7.576
vati+7.448
Neg logit contributions
united-12.684
adders-12.125
ridges-11.958
 advoc-11.783
ategories-11.417
Token in
Token ID287
Feature activation+2.59e+00

Pos logit contributions
ogene+3.360
awan+3.197
undo+3.183
vati+3.118
argument+3.085
Neg logit contributions
united-3.816
adders-3.609
ridges-3.597
 acknow-3.449
 advoc-3.367
Token the
Token ID262
Feature activation+4.01e+00

Pos logit contributions
ogene+1.888
awan+1.806
undo+1.783
vati+1.775
argument+1.742
Neg logit contributions
united-1.804
adders-1.689
ridges-1.684
 acknow-1.602
ards-1.570
Token game
Token ID983
Feature activation+6.01e+00

Pos logit contributions
ogene+3.169
awan+3.028
undo+3.018
vati+2.999
 Manip+2.938
Neg logit contributions
united-2.549
adders-2.373
ridges-2.361
 acknow-2.212
ards-2.157
Token,
Token ID11
Feature activation+6.95e-01

Pos logit contributions
ogene+3.900
awan+3.759
undo+3.700
vati+3.696
 Manip+3.619
Neg logit contributions
united-4.517
ridges-4.307
adders-4.249
 acknow-4.062
 advoc-3.960
Token the
Token ID262
Feature activation+3.53e+00

Pos logit contributions
ogene+0.720
awan+0.684
vati+0.680
undo+0.676
argument+0.662
Neg logit contributions
united-0.600
ridges-0.561
adders-0.559
 acknow-0.528
ards-0.517
Token race
Token ID3234
Feature activation+5.58e+00

Pos logit contributions
ogene+2.627
awan+2.500
vati+2.481
undo+2.470
argument+2.417
Neg logit contributions
united-2.367
ridges-2.243
adders-2.221
 acknow-2.098
ards-2.068
Token,
Token ID11
Feature activation-7.90e-01

Pos logit contributions
ogene+3.659
awan+3.431
vati+3.431
undo+3.367
argument+3.298
Neg logit contributions
united-4.232
ridges-4.073
adders-3.977
 acknow-3.799
ards-3.706
Token the
Token ID262
Feature activation-2.23e+00

Pos logit contributions
united+0.497
ridges+0.463
adders+0.463
 acknow+0.436
ards+0.426
Neg logit contributions
ogene-0.632
awan-0.601
vati-0.598
undo-0.595
argument-0.582
Token less
Token ID1342
Feature activation+7.10e+00

Pos logit contributions
united+1.032
ridges+0.932
adders+0.931
 acknow+0.868
ards+0.833
Neg logit contributions
ogene-2.158
awan-2.072
vati-2.063
undo-2.050
argument-2.010
Token time
Token ID640
Feature activation+1.62e+01

Pos logit contributions
ogene+4.854
awan+4.544
vati+4.538
undo+4.514
argument+4.407
Neg logit contributions
united-5.072
ridges-4.804
adders-4.756
 acknow-4.454
ards-4.428
Token Mrs
Token ID9074
Feature activation-1.88e+00

Pos logit contributions
ogene+9.741
awan+9.326
 circulating+9.162
vati+8.967
 Manip+8.934
Neg logit contributions
united-11.070
ridges-10.408
adders-10.289
 acknow-9.710
aunders-9.612
Token.
Token ID13
Feature activation+2.55e+00

Pos logit contributions
united+1.076
adders+0.980
ridges+0.980
ards+0.908
 acknow+0.889
Neg logit contributions
ogene-1.612
vati-1.536
awan-1.530
undo-1.523
argument-1.484
Token Clinton
Token ID2605
Feature activation+7.01e+00

Pos logit contributions
ogene+1.425
awan+1.329
vati+1.312
undo+1.293
 Manip+1.292
Neg logit contributions
united-2.198
ridges-2.100
adders-2.092
 acknow-2.015
 advoc-1.987
Token has
Token ID468
Feature activation+1.21e+01

Pos logit contributions
ogene+4.945
awan+4.737
vati+4.663
undo+4.590
argument+4.487
Neg logit contributions
united-4.941
ridges-4.680
adders-4.622
 acknow-4.329
aunders-4.222
Token to
Token ID284
Feature activation+4.90e+00

Pos logit contributions
ogene+7.096
awan+6.890
 circulating+6.850
vati+6.823
undo+6.700
Neg logit contributions
united-8.921
ridges-8.570
adders-8.507
aunders-8.182
 advoc-8.076
Token 24
Token ID1987
Feature activation-3.80e+00

Pos logit contributions
ogene+0.147
vati+0.137
awan+0.136
undo+0.135
argument+0.130
Neg logit contributions
united-0.237
adders-0.224
ridges-0.224
ards-0.214
 acknow-0.214
Token
Token ID198
Feature activation-2.39e+00

Pos logit contributions
united+2.512
ridges+2.312
adders+2.300
 acknow+2.180
ards+2.164
Neg logit contributions
ogene-2.923
vati-2.762
awan-2.699
undo-2.697
argument-2.643
Token
Token ID198
Feature activation-2.36e+00

Pos logit contributions
united+1.431
ridges+1.286
adders+1.286
ards+1.220
aunders+1.145
Neg logit contributions
ogene-2.010
vati-1.904
awan-1.881
undo-1.870
argument-1.821
TokenWith
Token ID3152
Feature activation+1.94e+00

Pos logit contributions
united+1.603
ridges+1.461
adders+1.459
ards+1.370
 acknow+1.342
Neg logit contributions
ogene-1.795
awan-1.696
vati-1.694
undo-1.666
 Manip-1.637
Token two
Token ID734
Feature activation+6.16e+00

Pos logit contributions
ogene+1.422
awan+1.348
vati+1.341
undo+1.331
argument+1.299
Neg logit contributions
united-1.344
adders-1.264
ridges-1.262
 acknow-1.199
ards-1.172
Token years
Token ID812
Feature activation+1.62e+01

Pos logit contributions
ogene+4.509
awan+4.272
undo+4.223
 Manip+4.173
vati+4.148
Neg logit contributions
united-4.131
adders-3.897
ridges-3.885
 acknow-3.689
ards-3.597
Token to
Token ID284
Feature activation+3.38e+00

Pos logit contributions
ogene+8.456
 circulating+8.258
undo+7.965
awan+7.946
vati+7.669
Neg logit contributions
united-12.015
ridges-11.541
adders-11.509
ategories-11.098
 advoc-10.742
Token go
Token ID467
Feature activation+6.99e+00

Pos logit contributions
ogene+2.103
awan+1.978
undo+1.964
vati+1.941
argument+1.890
Neg logit contributions
united-2.719
ridges-2.590
adders-2.578
 acknow-2.445
ards-2.415
Token until
Token ID1566
Feature activation-1.35e-02

Pos logit contributions
ogene+4.305
awan+4.053
undo+4.015
vati+3.976
 Manip+3.881
Neg logit contributions
united-5.434
ridges-5.197
adders-5.138
 acknow-4.969
 advoc-4.819
Token the
Token ID262
Feature activation-2.69e+00

Pos logit contributions
united+0.009
ridges+0.008
adders+0.008
 acknow+0.008
ards+0.008
Neg logit contributions
ogene-0.010
awan-0.010
vati-0.010
undo-0.010
argument-0.009
Token date
Token ID3128
Feature activation+4.44e+00

Pos logit contributions
united+2.035
ridges+1.911
adders+1.888
 acknow+1.808
ards+1.792
Neg logit contributions
ogene-1.788
awan-1.688
vati-1.685
undo-1.660
argument-1.621
Tokenest
Token ID395
Feature activation+3.18e+00

Pos logit contributions
ogene+0.993
awan+0.950
vati+0.945
undo+0.940
argument+0.923
Neg logit contributions
united-0.613
ridges-0.564
adders-0.562
 acknow-0.525
ards-0.510
Token imagination
Token ID13843
Feature activation+3.49e+00

Pos logit contributions
ogene+2.416
awan+2.310
vati+2.279
undo+2.276
argument+2.225
Neg logit contributions
united-2.115
ridges-1.980
adders-1.972
 acknow-1.865
ards-1.820
Token,
Token ID11
Feature activation+8.62e-01

Pos logit contributions
ogene+2.682
awan+2.554
vati+2.542
undo+2.541
argument+2.484
Neg logit contributions
united-2.288
ridges-2.125
adders-2.117
 acknow-1.988
ards-1.924
Token with
Token ID351
Feature activation+4.05e+00

Pos logit contributions
ogene+0.632
awan+0.599
vati+0.595
undo+0.591
argument+0.578
Neg logit contributions
united-0.601
ridges-0.564
adders-0.563
 acknow-0.534
ards-0.524
Token the
Token ID262
Feature activation+5.52e+00

Pos logit contributions
ogene+2.913
awan+2.736
vati+2.733
undo+2.699
 Manip+2.653
Neg logit contributions
united-2.856
ridges-2.695
adders-2.680
 acknow-2.546
ards-2.486
Token political
Token ID1964
Feature activation+1.61e+01

Pos logit contributions
ogene+4.145
vati+3.885
awan+3.858
undo+3.849
 Manip+3.803
Neg logit contributions
united-3.666
ridges-3.440
adders-3.424
 acknow-3.231
ards-3.192
Token capital
Token ID3139
Feature activation+4.09e+00

Pos logit contributions
ogene+9.814
 Manip+9.583
 circulating+9.283
argument+9.121
undo+9.020
Neg logit contributions
united-10.812
ridges-10.531
adders-10.254
aunders-9.848
ategories-9.811
Token that
Token ID326
Feature activation+4.83e+00

Pos logit contributions
ogene+2.994
awan+2.835
vati+2.816
undo+2.800
argument+2.740
Neg logit contributions
united-2.832
ridges-2.655
adders-2.639
 acknow-2.509
ards-2.444
Token was
Token ID373
Feature activation+7.91e+00

Pos logit contributions
ogene+3.614
awan+3.398
vati+3.374
undo+3.348
 Manip+3.320
Neg logit contributions
united-3.272
ridges-3.064
adders-3.041
 acknow-2.861
aunders-2.792
Token involved
Token ID2950
Feature activation+1.76e+00

Pos logit contributions
ogene+5.390
awan+5.076
undo+5.051
 circulating+5.044
vati+5.025
Neg logit contributions
united-5.649
ridges-5.348
adders-5.316
aunders-4.972
 acknow-4.960
Token in
Token ID287
Feature activation+5.99e+00

Pos logit contributions
ogene+1.280
awan+1.211
vati+1.205
undo+1.197
argument+1.169
Neg logit contributions
united-1.241
ridges-1.164
adders-1.162
 acknow-1.103
ards-1.081
Token 21
Token ID2310
Feature activation+9.89e+00

Pos logit contributions
ogene+2.147
awan+2.002
vati+1.936
argument+1.909
undo+1.904
Neg logit contributions
united-4.899
ridges-4.730
adders-4.697
 acknow-4.602
 encount-4.574
Token))
Token ID4008
Feature activation+5.59e-01

Pos logit contributions
ogene+4.442
awan+4.112
argument+4.101
undo+4.045
 Manip+3.817
Neg logit contributions
united-8.970
 encount-8.933
 advoc-8.798
adders-8.688
 acknow-8.638
Token #
Token ID1303
Feature activation+1.12e+01

Pos logit contributions
ogene+0.403
awan+0.381
vati+0.379
undo+0.377
argument+0.368
Neg logit contributions
united-0.396
ridges-0.372
adders-0.371
 acknow-0.353
ards-0.345
Token loop
Token ID9052
Feature activation+5.22e+00

Pos logit contributions
ogene+3.989
awan+3.558
undo+3.499
vati+3.415
argument+3.398
Neg logit contributions
united-11.248
adders-10.521
 advoc-10.506
ridges-10.485
 acknow-10.220
Token over
Token ID625
Feature activation+1.87e+00

Pos logit contributions
ogene+2.904
vati+2.735
awan+2.731
undo+2.650
 Manip+2.638
Neg logit contributions
united-4.453
ridges-4.248
adders-4.201
 advoc-4.165
 encount-4.156
Token the
Token ID262
Feature activation+1.60e+01

Pos logit contributions
ogene+1.434
awan+1.360
vati+1.352
undo+1.345
argument+1.320
Neg logit contributions
united-1.232
ridges-1.157
adders-1.153
 acknow-1.106
 advoc-1.075
Token input
Token ID5128
Feature activation+7.18e+00

Pos logit contributions
ogene+3.603
 circulating+3.285
argument+3.079
 Manip+2.823
undo+2.754
Neg logit contributions
united-17.577
ridges-17.088
 encount-17.071
adders-16.901
 advoc-16.860
Token image
Token ID2939
Feature activation+5.11e+00

Pos logit contributions
ogene+4.954
awan+4.654
 circulating+4.604
undo+4.578
 Manip+4.540
Neg logit contributions
united-5.142
ridges-4.961
adders-4.949
 encount-4.691
 acknow-4.672
Token paths
Token ID13532
Feature activation+4.33e+00

Pos logit contributions
ogene+3.737
vati+3.495
 Manip+3.494
awan+3.469
undo+3.449
Neg logit contributions
united-3.440
ridges-3.150
 acknow-3.142
adders-3.127
 encount-3.084
Token for
Token ID329
Feature activation-2.84e+00

Pos logit contributions
ogene+2.484
awan+2.322
vati+2.310
undo+2.263
argument+2.232
Neg logit contributions
united-3.655
ridges-3.488
adders-3.485
 acknow-3.383
 advoc-3.352
Token image
Token ID2939
Feature activation-5.79e-01

Pos logit contributions
united+1.468
adders+1.318
ridges+1.316
 acknow+1.226
ards+1.218
Neg logit contributions
ogene-2.534
vati-2.440
awan-2.432
undo-2.410
argument-2.336
Token out
Token ID503
Feature activation-2.22e+00

Pos logit contributions
ogene+0.264
awan+0.251
vati+0.250
undo+0.249
argument+0.243
Neg logit contributions
united-0.238
ridges-0.221
adders-0.220
 acknow-0.210
ards-0.206
Token with
Token ID351
Feature activation+2.86e+00

Pos logit contributions
united+1.488
ridges+1.356
adders+1.351
 acknow+1.282
ards+1.277
Neg logit contributions
ogene-1.679
awan-1.613
vati-1.612
undo-1.604
argument-1.556
Token her
Token ID607
Feature activation+3.86e+00

Pos logit contributions
ogene+1.848
awan+1.737
vati+1.726
undo+1.708
argument+1.668
Neg logit contributions
united-2.225
ridges-2.110
adders-2.106
 acknow-2.007
ards-1.962
Token 6
Token ID718
Feature activation+1.06e+01

Pos logit contributions
ogene+2.640
awan+2.492
undo+2.465
vati+2.459
argument+2.405
Neg logit contributions
united-2.844
ridges-2.700
adders-2.695
 acknow-2.536
ards-2.481
Token-
Token ID12
Feature activation+8.54e+00

Pos logit contributions
ogene+6.915
awan+6.550
undo+6.480
vati+6.415
opted+6.366
Neg logit contributions
united-7.349
ridges-6.928
 acknow-6.871
adders-6.837
 advoc-6.743
Tokenmonth
Token ID8424
Feature activation+1.60e+01

Pos logit contributions
ogene+3.927
undo+3.845
awan+3.764
argument+3.694
vati+3.680
Neg logit contributions
united-7.605
 acknow-7.315
adders-7.262
ridges-7.251
 advoc-7.047
Token-
Token ID12
Feature activation+1.34e+01

Pos logit contributions
ogene+8.590
awan+8.019
 circulating+7.922
 Manip+7.656
argument+7.623
Neg logit contributions
 encount-11.594
united-11.549
ridges-11.488
adders-11.371
 acknow-11.010
Tokenold
Token ID727
Feature activation+6.82e+00

Pos logit contributions
argument+5.204
ogene+5.185
awan+5.100
undo+4.998
vati+4.718
Neg logit contributions
 encount-13.090
 advoc-12.888
 acknow-12.755
 therap-11.819
 traged-11.785
Token,
Token ID11
Feature activation+5.67e+00

Pos logit contributions
ogene+4.483
awan+4.257
 Manip+4.245
vati+4.152
undo+4.140
Neg logit contributions
united-5.033
adders-4.826
ridges-4.783
aunders-4.456
 acknow-4.440
Token Conan
Token ID31634
Feature activation+2.41e+00

Pos logit contributions
ogene+3.992
vati+3.770
awan+3.769
 Manip+3.718
undo+3.710
Neg logit contributions
united-4.001
ridges-3.795
adders-3.786
 acknow-3.546
ards-3.494
Token,
Token ID11
Feature activation+3.04e+00

Pos logit contributions
ogene+1.821
awan+1.727
vati+1.711
undo+1.706
argument+1.657
Neg logit contributions
united-1.631
adders-1.523
ridges-1.518
 acknow-1.445
ards-1.397
Token arg
Token ID1822
Feature activation+3.84e+00

Pos logit contributions
united+2.325
adders+2.175
ridges+2.156
 acknow+2.044
ards+2.039
Neg logit contributions
ogene-2.195
awan-2.090
vati-2.081
undo-2.057
opted-1.946
Tokenparse
Token ID29572
Feature activation+3.31e+00

Pos logit contributions
ogene+1.591
awan+1.445
undo+1.437
vati+1.427
argument+1.408
Neg logit contributions
united-3.842
ridges-3.683
adders-3.677
 acknow-3.649
 advoc-3.625
Token.
Token ID13
Feature activation-1.08e+00

Pos logit contributions
ogene+2.515
awan+2.397
vati+2.370
undo+2.361
argument+2.332
Neg logit contributions
united-2.177
ridges-2.065
adders-2.050
 acknow-1.996
 advoc-1.986
TokenArg
Token ID28100
Feature activation+7.19e+00

Pos logit contributions
united+0.885
ridges+0.833
adders+0.833
ards+0.792
 acknow+0.770
Neg logit contributions
ogene-0.660
vati-0.621
awan-0.618
undo-0.608
 circulating-0.574
Tokenument
Token ID1713
Feature activation+8.58e+00

Pos logit contributions
ogene+5.171
argument+5.129
awan+4.907
undo+4.902
vati+4.783
Neg logit contributions
united-4.861
 acknow-4.836
ridges-4.668
 encount-4.634
adders-4.590
TokenParser
Token ID46677
Feature activation+1.59e+01

Pos logit contributions
ogene+5.616
argument+5.405
 Manip+5.300
awan+5.262
undo+5.044
Neg logit contributions
united-6.338
 encount-6.228
 acknow-6.202
 advoc-6.144
adders-5.940
Token()
Token ID3419
Feature activation+1.64e+00

Pos logit contributions
argument+4.747
ogene+4.683
vati+4.343
 Manip+4.234
awan+4.062
Neg logit contributions
 encount-16.997
 advoc-16.583
 acknow-15.939
ridges-15.558
 therap-15.529
Token ap
Token ID2471
Feature activation+3.35e+00

Pos logit contributions
ogene+1.346
awan+1.285
vati+1.276
undo+1.263
argument+1.255
Neg logit contributions
united-1.003
ridges-0.933
adders-0.927
 acknow-0.873
ards-0.852
Token.
Token ID13
Feature activation+1.43e+00

Pos logit contributions
ogene+2.555
awan+2.434
vati+2.403
undo+2.397
argument+2.365
Neg logit contributions
united-2.203
ridges-2.083
adders-2.073
 acknow-2.035
 advoc-2.017
Tokenadd
Token ID2860
Feature activation-2.19e+00

Pos logit contributions
ogene+1.012
awan+0.956
vati+0.953
undo+0.951
argument+0.933
Neg logit contributions
united-1.026
ridges-0.963
adders-0.961
 acknow-0.928
 advoc-0.902
Token_
Token ID62
Feature activation+5.13e+00

Pos logit contributions
united+0.779
adders+0.676
ridges+0.661
ards+0.586
aunders+0.540
Neg logit contributions
ogene-2.341
awan-2.261
vati-2.253
undo-2.238
 Manip-2.178
Token Flyers
Token ID30652
Feature activation-3.85e+00

Pos logit contributions
united+2.878
adders+2.678
ridges+2.675
 acknow+2.550
ards+2.536
Neg logit contributions
ogene-2.951
vati-2.780
awan-2.769
undo-2.738
argument-2.695
Token with
Token ID351
Feature activation+4.31e+00

Pos logit contributions
united+2.444
adders+2.240
ridges+2.202
 acknow+2.157
ards+2.102
Neg logit contributions
ogene-3.036
awan-2.854
vati-2.835
undo-2.826
argument-2.766
Token just
Token ID655
Feature activation+6.49e+00

Pos logit contributions
ogene+2.939
awan+2.803
vati+2.775
undo+2.756
 Manip+2.704
Neg logit contributions
united-3.157
ridges-2.995
adders-2.979
 acknow-2.839
ards-2.750
Token over
Token ID625
Feature activation+3.57e+00

Pos logit contributions
ogene+3.864
awan+3.674
vati+3.592
undo+3.560
 Manip+3.536
Neg logit contributions
united-5.295
ridges-5.037
adders-5.014
 acknow-4.785
 advoc-4.611
Token six
Token ID2237
Feature activation+6.33e+00

Pos logit contributions
ogene+2.118
awan+1.983
vati+1.967
undo+1.953
argument+1.907
Neg logit contributions
united-2.966
adders-2.815
ridges-2.814
 acknow-2.695
ards-2.639
Token minutes
Token ID2431
Feature activation+1.57e+01

Pos logit contributions
ogene+3.130
undo+2.896
 Manip+2.842
awan+2.832
 circulating+2.823
Neg logit contributions
united-5.741
ridges-5.423
adders-5.418
 acknow-5.233
aunders-5.113
Token remaining
Token ID5637
Feature activation+1.24e+00

Pos logit contributions
 circulating+7.759
ogene+7.396
awan+7.248
undo+7.142
argument+7.015
Neg logit contributions
united-12.258
adders-11.929
ridges-11.829
 advoc-11.431
ategories-11.345
Token gave
Token ID2921
Feature activation+2.44e+00

Pos logit contributions
ogene+0.946
awan+0.899
vati+0.893
undo+0.888
argument+0.870
Neg logit contributions
united-0.825
ridges-0.775
adders-0.773
 acknow-0.727
ards-0.711
Token them
Token ID606
Feature activation+6.30e+00

Pos logit contributions
ogene+1.883
awan+1.800
undo+1.780
vati+1.776
 Manip+1.738
Neg logit contributions
united-1.589
ridges-1.495
adders-1.487
 acknow-1.407
ards-1.370
Token an
Token ID281
Feature activation+6.79e+00

Pos logit contributions
ogene+4.015
awan+3.878
vati+3.820
undo+3.780
 Manip+3.758
Neg logit contributions
united-4.775
ridges-4.538
adders-4.463
 acknow-4.237
ards-4.128
Token opportunity
Token ID3663
Feature activation-6.06e+00

Pos logit contributions
ogene+4.519
awan+4.318
undo+4.298
vati+4.226
argument+4.192
Neg logit contributions
united-5.016
ridges-4.756
adders-4.693
 acknow-4.396
ards-4.383
Token flags
Token ID9701
Feature activation+7.02e+00

Pos logit contributions
ogene+2.005
awan+1.925
vati+1.924
undo+1.914
argument+1.873
Neg logit contributions
united-1.161
adders-1.065
ridges-1.064
 acknow-0.990
ards-0.952
Token,
Token ID11
Feature activation+3.27e+00

Pos logit contributions
ogene+4.628
awan+4.465
vati+4.381
undo+4.349
argument+4.262
Neg logit contributions
united-5.114
adders-4.878
ridges-4.855
 acknow-4.660
aunders-4.561
Token during
Token ID1141
Feature activation-1.17e-01

Pos logit contributions
ogene+2.389
awan+2.273
vati+2.259
undo+2.234
argument+2.190
Neg logit contributions
united-2.264
ridges-2.135
adders-2.132
 acknow-2.013
ards-1.964
Token the
Token ID262
Feature activation+1.56e+00

Pos logit contributions
united+0.080
ridges+0.075
adders+0.074
 acknow+0.070
ards+0.069
Neg logit contributions
ogene-0.088
awan-0.083
vati-0.083
undo-0.082
argument-0.080
Token '
Token ID705
Feature activation-3.70e+00

Pos logit contributions
ogene+1.102
awan+1.041
vati+1.035
undo+1.028
argument+1.004
Neg logit contributions
united-1.127
ridges-1.060
adders-1.058
 acknow-1.006
ards-0.986
TokenYes
Token ID5297
Feature activation+1.57e+01

Pos logit contributions
united+2.524
ridges+2.263
adders+2.239
ards+2.088
 acknow+2.043
Neg logit contributions
ogene-2.851
awan-2.671
vati-2.646
 Manip-2.594
undo-2.593
Token'
Token ID6
Feature activation+6.41e+00

Pos logit contributions
undo+8.252
ogene+8.244
awan+8.224
 Demand+8.151
 Manip+7.918
Neg logit contributions
adders-11.156
 acknow-11.126
united-10.994
ridges-10.920
aunders-10.463
Token vote
Token ID3015
Feature activation+5.36e+00

Pos logit contributions
ogene+4.185
awan+3.993
undo+3.949
vati+3.946
argument+3.881
Neg logit contributions
united-4.709
adders-4.553
ridges-4.541
 acknow-4.281
aunders-4.171
Token closing
Token ID9605
Feature activation-4.06e+00

Pos logit contributions
ogene+3.592
awan+3.456
undo+3.404
vati+3.401
argument+3.322
Neg logit contributions
united-3.903
ridges-3.765
adders-3.760
 acknow-3.524
aunders-3.451
Token campaign
Token ID1923
Feature activation+4.52e+00

Pos logit contributions
united+2.724
ridges+2.588
adders+2.540
 acknow+2.428
ards+2.369
Neg logit contributions
ogene-3.004
awan-2.864
vati-2.847
undo-2.781
argument-2.709
Token in
Token ID287
Feature activation+2.02e+00

Pos logit contributions
ogene+2.799
awan+2.651
undo+2.611
vati+2.609
argument+2.542
Neg logit contributions
united-3.565
adders-3.432
ridges-3.426
 acknow-3.238
aunders-3.164
Token that
Token ID326
Feature activation+3.91e+00

Pos logit contributions
ogene+0.065
awan+0.062
vati+0.062
undo+0.061
argument+0.060
Neg logit contributions
united-0.043
ridges-0.040
adders-0.040
 acknow-0.037
ards-0.036
Token roughly
Token ID7323
Feature activation+4.56e+00

Pos logit contributions
ogene+2.874
awan+2.729
vati+2.692
undo+2.686
argument+2.642
Neg logit contributions
united-2.693
ridges-2.530
adders-2.524
 acknow-2.368
aunders-2.313
Token two
Token ID734
Feature activation+8.43e+00

Pos logit contributions
ogene+2.612
awan+2.439
vati+2.428
undo+2.379
 Manip+2.350
Neg logit contributions
united-3.863
adders-3.697
ridges-3.688
 acknow-3.511
aunders-3.434
Token out
Token ID503
Feature activation+5.97e+00

Pos logit contributions
ogene+5.938
awan+5.492
 Manip+5.415
undo+5.414
vati+5.386
Neg logit contributions
united-5.772
adders-5.386
ridges-5.380
 acknow-5.213
aunders-5.079
Token of
Token ID286
Feature activation+9.97e+00

Pos logit contributions
ogene+3.930
awan+3.657
undo+3.616
vati+3.613
argument+3.577
Neg logit contributions
united-4.469
adders-4.283
 acknow-4.263
ridges-4.250
 encount-4.198
Token every
Token ID790
Feature activation+1.56e+01

Pos logit contributions
ogene+4.869
awan+4.411
vati+4.355
 Manip+4.272
argument+4.215
Neg logit contributions
united-8.852
adders-8.616
ridges-8.460
 acknow-8.342
 encount-8.099
Token five
Token ID1936
Feature activation+2.64e+00

Pos logit contributions
ogene+6.233
awan+5.200
argument+5.053
 circulating+4.970
vati+4.896
Neg logit contributions
united-14.377
adders-14.318
ridges-13.963
redits-13.762
 acknow-13.527
Token Americans
Token ID3399
Feature activation-2.38e-02

Pos logit contributions
ogene+1.655
awan+1.552
undo+1.535
vati+1.530
argument+1.486
Neg logit contributions
united-2.116
ridges-2.003
adders-1.999
 acknow-1.897
ards-1.867
Token knows
Token ID4206
Feature activation-2.66e+00

Pos logit contributions
united+0.017
ridges+0.016
adders+0.016
 acknow+0.015
ards+0.015
Neg logit contributions
ogene-0.017
awan-0.016
vati-0.016
undo-0.016
argument-0.015
Token about
Token ID546
Feature activation+5.20e-01

Pos logit contributions
united+1.685
ridges+1.566
adders+1.562
 acknow+1.478
ards+1.467
Neg logit contributions
ogene-2.108
awan-2.006
vati-2.005
undo-1.988
argument-1.945
Token the
Token ID262
Feature activation-1.34e+00

Pos logit contributions
ogene+0.408
awan+0.387
vati+0.385
undo+0.383
argument+0.375
Neg logit contributions
united-0.335
ridges-0.313
adders-0.312
 acknow-0.295
ards-0.288
Token the
Token ID262
Feature activation+4.95e+00

Pos logit contributions
ogene+1.887
awan+1.791
vati+1.777
undo+1.767
argument+1.727
Neg logit contributions
united-1.744
ridges-1.633
adders-1.632
 acknow-1.543
ards-1.512
Token most
Token ID749
Feature activation+1.37e+01

Pos logit contributions
ogene+4.313
awan+4.147
vati+4.096
undo+4.096
 Manip+4.043
Neg logit contributions
united-2.702
adders-2.502
ridges-2.479
 acknow-2.248
ards-2.210
Token potent
Token ID16739
Feature activation+1.40e+01

Pos logit contributions
ogene+9.506
 circulating+8.908
 Manip+8.860
undo+8.726
awan+8.619
Neg logit contributions
united-8.710
ridges-8.408
adders-8.350
ategories-7.889
aunders-7.855
Token--
Token ID438
Feature activation+6.00e+00

Pos logit contributions
ogene+9.010
undo+8.444
 circulating+8.396
 Manip+8.361
argument+8.272
Neg logit contributions
united-9.740
adders-9.484
ridges-9.302
aunders-8.893
redits-8.504
Tokenand
Token ID392
Feature activation+4.30e+00

Pos logit contributions
ogene+4.772
undo+4.582
awan+4.576
vati+4.551
argument+4.516
Neg logit contributions
united-3.604
adders-3.429
ridges-3.404
 acknow-3.289
 advoc-3.124
Token dangerous
Token ID4923
Feature activation+1.56e+01

Pos logit contributions
ogene+3.306
awan+3.131
undo+3.117
vati+3.087
 Manip+3.038
Neg logit contributions
united-2.798
adders-2.641
ridges-2.640
 acknow-2.428
aunders-2.392
Token--
Token ID438
Feature activation+9.51e+00

Pos logit contributions
ogene+9.690
awan+9.215
argument+9.042
undo+8.981
 Manip+8.964
Neg logit contributions
united-10.696
adders-10.384
ridges-10.019
 acknow-9.575
aunders-9.454
TokenAmerican
Token ID7437
Feature activation+2.46e+00

Pos logit contributions
ogene+7.039
argument+6.916
undo+6.835
awan+6.759
vati+6.674
Neg logit contributions
united-5.805
adders-5.717
 acknow-5.638
ridges-5.622
aunders-5.306
Token energy
Token ID2568
Feature activation+6.56e+00

Pos logit contributions
ogene+1.837
awan+1.738
vati+1.720
undo+1.716
argument+1.686
Neg logit contributions
united-1.679
ridges-1.576
adders-1.573
 acknow-1.472
ards-1.447
Token entrepreneur
Token ID19500
Feature activation+5.70e+00

Pos logit contributions
ogene+4.499
awan+4.212
undo+4.192
argument+4.119
 Manip+4.110
Neg logit contributions
united-4.817
adders-4.532
ridges-4.523
 acknow-4.226
ards-4.212
Token right
Token ID826
Feature activation+6.49e+00

Pos logit contributions
ogene+4.053
awan+3.827
vati+3.794
undo+3.786
 Manip+3.711
Neg logit contributions
united-4.018
adders-3.832
ridges-3.799
 acknow-3.559
aunders-3.478
Token90
Token ID3829
Feature activation+9.91e-01

Pos logit contributions
united+1.955
ridges+1.797
adders+1.787
ards+1.680
 acknow+1.675
Neg logit contributions
ogene-2.250
awan-2.119
vati-2.116
undo-2.060
 Manip-2.034
Token%
Token ID4
Feature activation+4.53e+00

Pos logit contributions
ogene+0.765
vati+0.724
awan+0.724
undo+0.717
argument+0.703
Neg logit contributions
united-0.653
ridges-0.608
adders-0.606
 acknow-0.575
ards-0.565
Token of
Token ID286
Feature activation+5.95e+00

Pos logit contributions
ogene+3.141
awan+3.001
undo+2.977
vati+2.964
 Manip+2.876
Neg logit contributions
united-3.251
ridges-3.104
adders-3.085
 acknow-2.862
ards-2.856
Token
Token ID564
Feature activation+2.86e+00

Pos logit contributions
ogene+4.072
undo+3.845
awan+3.831
vati+3.782
 Manip+3.727
Neg logit contributions
united-4.296
ridges-4.088
adders-4.080
 acknow-3.871
ards-3.730
Token
Token ID246
Feature activation+3.08e+00

Pos logit contributions
ogene+2.247
vati+2.138
awan+2.132
undo+2.129
argument+2.077
Neg logit contributions
united-1.811
ridges-1.703
adders-1.698
 acknow-1.610
ards-1.556
TokenYes
Token ID5297
Feature activation+1.56e+01

Pos logit contributions
ogene+2.147
awan+2.031
vati+2.021
undo+2.012
argument+1.964
Neg logit contributions
united-2.236
ridges-2.114
adders-2.109
 acknow-2.009
ards-1.957
Token
Token ID447
Feature activation+5.07e+00

Pos logit contributions
ogene+8.321
undo+8.295
awan+8.152
 circulating+8.010
argument+7.916
Neg logit contributions
united-10.943
ridges-10.873
adders-10.766
 acknow-10.385
 encount-10.164
Token
Token ID247
Feature activation+2.22e+00

Pos logit contributions
ogene+3.657
vati+3.508
undo+3.508
awan+3.464
argument+3.408
Neg logit contributions
united-3.509
ridges-3.293
adders-3.257
 acknow-3.155
 advoc-2.997
Token votes
Token ID5690
Feature activation+7.11e+00

Pos logit contributions
ogene+1.794
awan+1.706
vati+1.698
undo+1.688
argument+1.654
Neg logit contributions
united-1.388
ridges-1.291
adders-1.288
 acknow-1.214
ards-1.187
Token with
Token ID351
Feature activation+3.10e+00

Pos logit contributions
ogene+4.928
undo+4.753
awan+4.694
vati+4.647
 Manip+4.554
Neg logit contributions
united-4.885
ridges-4.763
adders-4.710
 acknow-4.387
aunders-4.306
Token a
Token ID257
Feature activation+3.70e+00

Pos logit contributions
ogene+1.884
awan+1.768
vati+1.753
undo+1.743
argument+1.695
Neg logit contributions
united-2.525
ridges-2.405
adders-2.397
 acknow-2.289
ards-2.240
Token easy
Token ID2562
Feature activation+3.36e-01

Pos logit contributions
united+5.105
adders+4.807
ridges+4.615
 acknow+4.611
ards+4.469
Neg logit contributions
ogene-8.300
vati-7.925
awan-7.783
undo-7.628
argument-7.574
Token despite
Token ID3805
Feature activation+2.03e+00

Pos logit contributions
ogene+0.261
awan+0.248
vati+0.247
undo+0.245
argument+0.240
Neg logit contributions
united-0.219
ridges-0.204
adders-0.204
 acknow-0.193
ards-0.189
Token the
Token ID262
Feature activation+5.81e+00

Pos logit contributions
ogene+1.489
awan+1.403
undo+1.393
vati+1.389
argument+1.357
Neg logit contributions
united-1.423
ridges-1.330
adders-1.327
 acknow-1.253
ards-1.231
Token amount
Token ID2033
Feature activation+2.77e+00

Pos logit contributions
ogene+4.376
undo+4.138
awan+4.136
vati+4.068
argument+4.043
Neg logit contributions
united-3.872
ridges-3.564
adders-3.561
 acknow-3.294
ards-3.281
Token of
Token ID286
Feature activation+5.88e+00

Pos logit contributions
ogene+1.890
awan+1.785
undo+1.783
vati+1.770
argument+1.725
Neg logit contributions
united-2.054
adders-1.913
ridges-1.911
 acknow-1.834
ards-1.783
Token time
Token ID640
Feature activation+1.56e+01

Pos logit contributions
ogene+4.317
undo+4.130
awan+4.080
vati+4.051
 Manip+4.015
Neg logit contributions
united-3.986
ridges-3.687
adders-3.677
ards-3.402
 acknow-3.383
Token it
Token ID340
Feature activation+8.40e+00

Pos logit contributions
ogene+8.716
awan+8.240
 Manip+8.192
 circulating+8.180
undo+8.086
Neg logit contributions
united-11.124
adders-10.607
ridges-10.398
aunders-9.871
 advoc-9.857
Token takes
Token ID2753
Feature activation+3.65e+00

Pos logit contributions
ogene+6.038
awan+5.715
undo+5.630
vati+5.602
 Manip+5.514
Neg logit contributions
united-5.716
ridges-5.326
adders-5.275
 acknow-5.012
aunders-4.912
Token and
Token ID290
Feature activation+4.41e+00

Pos logit contributions
ogene+2.707
awan+2.570
vati+2.550
undo+2.547
argument+2.483
Neg logit contributions
united-2.479
ridges-2.333
adders-2.331
 acknow-2.209
ards-2.159
Token it
Token ID340
Feature activation+7.46e+00

Pos logit contributions
ogene+3.218
awan+3.040
undo+3.029
vati+3.007
 Manip+2.958
Neg logit contributions
united-3.062
ridges-2.855
adders-2.847
 acknow-2.656
ards-2.627
Token bright
Token ID6016
Feature activation-1.81e+00

Pos logit contributions
ogene+4.945
awan+4.652
undo+4.610
vati+4.560
argument+4.458
Neg logit contributions
united-5.487
ridges-5.216
adders-5.120
 acknow-4.899
 advoc-4.861
TokenJonathan
Token ID30365
Feature activation-1.20e+01

Pos logit contributions
united+2.017
adders+1.831
ridges+1.830
ards+1.717
 acknow+1.683
Neg logit contributions
ogene-2.435
vati-2.294
awan-2.275
undo-2.232
 Manip-2.206
Token Warren
Token ID11328
Feature activation-1.11e+01

Pos logit contributions
united+7.670
adders+7.248
ridges+7.246
ards+6.848
 acknow+6.764
Neg logit contributions
ogene-8.325
vati-8.163
awan-7.915
undo-7.858
argument-7.754
Token,
Token ID11
Feature activation-5.75e+00

Pos logit contributions
united+6.395
ridges+6.078
adders+5.936
ards+5.853
 acknow+5.731
Neg logit contributions
ogene-8.637
vati-8.191
awan-8.037
undo-7.875
argument-7.757
Token Director
Token ID5890
Feature activation-2.18e+00

Pos logit contributions
united+3.926
ridges+3.695
adders+3.659
 acknow+3.538
ards+3.440
Neg logit contributions
ogene-4.210
vati-3.978
undo-3.907
awan-3.881
argument-3.785
Token of
Token ID286
Feature activation-1.27e+00

Pos logit contributions
united+1.481
adders+1.380
ridges+1.376
 acknow+1.301
ards+1.281
Neg logit contributions
ogene-1.642
awan-1.548
vati-1.545
undo-1.528
argument-1.491
Token Time
Token ID3862
Feature activation+1.54e+01

Pos logit contributions
united+0.888
ridges+0.831
adders+0.831
 acknow+0.786
ards+0.773
Neg logit contributions
ogene-0.939
awan-0.885
vati-0.884
undo-0.875
argument-0.855
Token4
Token ID19
Feature activation+4.40e+00

Pos logit contributions
 Manip+9.411
 Demand+9.168
undo+9.125
ogene+9.104
awan+8.982
Neg logit contributions
united-9.643
adders-9.447
ridges-9.370
 acknow-9.336
 encount-9.205
Tokensleep
Token ID42832
Feature activation+2.06e+00

Pos logit contributions
ogene+3.122
awan+2.989
undo+2.979
vati+2.930
argument+2.918
Neg logit contributions
united-3.005
ridges-2.891
adders-2.884
 acknow-2.763
aunders-2.670
Token.
Token ID13
Feature activation-3.13e+00

Pos logit contributions
ogene+1.514
awan+1.457
undo+1.439
vati+1.431
argument+1.407
Neg logit contributions
united-1.396
adders-1.315
ridges-1.313
 acknow-1.254
ards-1.200
Tokenco
Token ID1073
Feature activation-9.00e+00

Pos logit contributions
united+1.894
ridges+1.741
adders+1.725
 acknow+1.603
ards+1.589
Neg logit contributions
ogene-2.603
vati-2.471
awan-2.456
undo-2.417
argument-2.379
Token.
Token ID13
Feature activation-8.17e+00

Pos logit contributions
united+4.787
ridges+4.364
adders+4.221
ards+3.997
 acknow+3.974
Neg logit contributions
ogene-7.854
vati-7.668
awan-7.373
undo-7.362
 Manip-7.220
Token their
Token ID511
Feature activation+1.68e+00

Pos logit contributions
united+0.180
ridges+0.167
adders+0.167
 acknow+0.156
ards+0.152
Neg logit contributions
ogene-0.261
awan-0.249
vati-0.248
undo-0.247
argument-0.242
Token cost
Token ID1575
Feature activation+4.25e+00

Pos logit contributions
ogene+1.270
awan+1.205
vati+1.197
undo+1.193
argument+1.168
Neg logit contributions
united-1.130
adders-1.060
ridges-1.057
 acknow-1.001
ards-0.979
Token.
Token ID13
Feature activation+3.18e+00

Pos logit contributions
ogene+3.083
awan+2.904
vati+2.892
undo+2.884
argument+2.831
Neg logit contributions
united-2.933
ridges-2.767
adders-2.732
 acknow-2.631
 advoc-2.537
Token On
Token ID1550
Feature activation+1.49e+00

Pos logit contributions
ogene+2.203
awan+2.090
undo+2.073
vati+2.067
 Manip+2.030
Neg logit contributions
united-2.313
adders-2.174
ridges-2.173
 acknow-2.081
 advoc-2.024
Token the
Token ID262
Feature activation+8.15e+00

Pos logit contributions
ogene+1.137
awan+1.078
vati+1.069
undo+1.064
argument+1.044
Neg logit contributions
united-0.999
adders-0.925
ridges-0.925
 acknow-0.879
ards-0.857
Token other
Token ID584
Feature activation+1.52e+01

Pos logit contributions
ogene+5.727
awan+5.368
undo+5.260
vati+5.216
argument+5.104
Neg logit contributions
united-5.780
adders-5.394
ridges-5.270
 acknow-5.066
ards-5.007
Token hand
Token ID1021
Feature activation+4.71e+00

Pos logit contributions
ogene+6.524
vati+6.123
awan+6.113
argument+5.925
 circulating+5.738
Neg logit contributions
united-13.232
adders-11.979
 acknow-11.931
ridges-11.909
aunders-11.476
Token,
Token ID11
Feature activation+3.10e+00

Pos logit contributions
ogene+3.049
awan+2.866
vati+2.819
undo+2.817
argument+2.793
Neg logit contributions
united-3.659
adders-3.438
ridges-3.419
 acknow-3.317
 encount-3.264
Token if
Token ID611
Feature activation-1.16e+00

Pos logit contributions
ogene+2.123
awan+2.004
vati+1.996
undo+1.990
argument+1.940
Neg logit contributions
united-2.293
adders-2.138
ridges-2.130
 acknow-2.023
ards-1.985
Token the
Token ID262
Feature activation-4.99e-01

Pos logit contributions
united+0.762
ridges+0.720
adders+0.714
 acknow+0.678
ards+0.665
Neg logit contributions
ogene-0.892
awan-0.848
vati-0.844
undo-0.836
argument-0.818
Token sidewalk
Token ID24058
Feature activation+2.62e-01

Pos logit contributions
united+0.357
ridges+0.337
adders+0.334
 acknow+0.319
ards+0.312
Neg logit contributions
ogene-0.356
awan-0.337
vati-0.334
undo-0.331
argument-0.324
Token nearest
Token ID16936
Feature activation+3.25e+00

Pos logit contributions
united+1.718
ridges+1.602
adders+1.599
 acknow+1.515
ards+1.497
Neg logit contributions
ogene-1.894
vati-1.794
awan-1.787
undo-1.774
argument-1.723
Token big
Token ID1263
Feature activation-4.17e+00

Pos logit contributions
ogene+2.442
awan+2.318
undo+2.289
vati+2.287
argument+2.251
Neg logit contributions
united-2.188
ridges-2.060
adders-2.048
 acknow-1.925
ards-1.888
Token city
Token ID1748
Feature activation-3.62e+00

Pos logit contributions
united+2.560
ridges+2.405
adders+2.361
 acknow+2.232
ards+2.213
Neg logit contributions
ogene-3.342
awan-3.214
vati-3.206
undo-3.144
argument-3.102
Token is
Token ID318
Feature activation-1.49e+00

Pos logit contributions
united+2.298
ridges+2.068
adders+2.062
 acknow+1.967
ards+1.955
Neg logit contributions
ogene-2.880
vati-2.730
awan-2.723
undo-2.703
argument-2.632
Token an
Token ID281
Feature activation+6.91e+00

Pos logit contributions
united+1.116
ridges+1.042
adders+1.036
 acknow+0.995
ards+0.979
Neg logit contributions
ogene-1.020
vati-0.964
awan-0.959
undo-0.947
argument-0.927
Token hour
Token ID1711
Feature activation+1.52e+01

Pos logit contributions
ogene+4.965
awan+4.712
undo+4.642
vati+4.578
argument+4.517
Neg logit contributions
united-4.797
adders-4.520
ridges-4.491
 acknow-4.259
ards-4.176
Token away
Token ID1497
Feature activation+2.52e+00

Pos logit contributions
ogene+8.782
awan+8.637
undo+8.587
argument+8.290
 Manip+8.085
Neg logit contributions
united-10.829
adders-10.597
ridges-10.423
 acknow-10.042
ategories-9.959
Token,"
Token ID553
Feature activation-3.02e+00

Pos logit contributions
ogene+1.945
awan+1.851
vati+1.832
undo+1.832
argument+1.788
Neg logit contributions
united-1.637
adders-1.539
ridges-1.538
 acknow-1.455
ards-1.414
Token says
Token ID1139
Feature activation-3.08e-02

Pos logit contributions
united+1.907
ridges+1.786
adders+1.783
 acknow+1.711
ards+1.667
Neg logit contributions
ogene-2.370
vati-2.249
undo-2.220
awan-2.220
argument-2.158
Token A
Token ID317
Feature activation-3.74e-01

Pos logit contributions
united+0.023
ridges+0.021
adders+0.021
 acknow+0.020
ards+0.020
Neg logit contributions
ogene-0.021
vati-0.020
awan-0.020
undo-0.020
argument-0.019
Tokendu
Token ID646
Feature activation+7.72e-02

Pos logit contributions
united+0.268
ridges+0.253
adders+0.252
 acknow+0.238
ards+0.236
Neg logit contributions
ogene-0.266
vati-0.250
awan-0.250
undo-0.248
argument-0.242
Tokenv
Token ID85
Feature activation-5.93e+00

Pos logit contributions
united+2.988
ridges+2.623
adders+2.612
ards+2.472
aunders+2.346
Neg logit contributions
ogene-3.881
awan-3.662
 Manip-3.575
vati-3.536
undo-3.531
Token2
Token ID17
Feature activation-5.65e+00

Pos logit contributions
united+3.645
adders+3.194
ridges+3.141
ards+3.011
ategories+2.881
Neg logit contributions
ogene-4.902
awan-4.590
vati-4.560
opted-4.446
undo-4.443
Token.
Token ID764
Feature activation-8.04e+00

Pos logit contributions
united+3.264
adders+2.915
ridges+2.913
ards+2.812
aunders+2.657
Neg logit contributions
ogene-4.706
vati-4.478
awan-4.438
undo-4.385
opted-4.296
Token Ga
Token ID12822
Feature activation-7.75e+00

Pos logit contributions
united+4.252
 acknow+3.989
ridges+3.842
adders+3.835
 darkest+3.755
Neg logit contributions
ogene-6.583
vati-6.422
awan-6.373
undo-6.310
argument-6.186
Tokenussian
Token ID31562
Feature activation-1.05e+01

Pos logit contributions
united+3.913
adders+3.282
ridges+3.183
aunders+2.970
ards+2.921
Neg logit contributions
ogene-6.850
awan-6.632
vati-6.604
 Manip-6.596
 Demand-6.492
TokenBl
Token ID3629
Feature activation-1.90e+01

Pos logit contributions
united+3.558
adders+3.151
ridges+3.052
ards+3.019
 darkest+2.839
Neg logit contributions
ogene-10.565
vati-10.173
undo-10.136
awan-9.999
opted-9.960
Tokenur
Token ID333
Feature activation-3.28e+00

Pos logit contributions
united+8.448
ridges+8.214
adders+7.945
ards+7.326
ights+7.027
Neg logit contributions
ogene-14.741
 Manip-14.501
 circulating-14.278
opted-14.073
�醒-14.016
Token (
Token ID357
Feature activation-3.36e-01

Pos logit contributions
united+1.819
adders+1.655
ridges+1.648
ards+1.586
 acknow+1.485
Neg logit contributions
ogene-2.840
vati-2.690
undo-2.690
awan-2.683
opted-2.600
Token gray
Token ID12768
Feature activation-3.06e+00

Pos logit contributions
united+0.230
ridges+0.215
adders+0.215
 acknow+0.204
ards+0.199
Neg logit contributions
ogene-0.251
awan-0.238
vati-0.237
undo-0.235
argument-0.230
Token,
Token ID837
Feature activation-1.60e+00

Pos logit contributions
united+1.620
adders+1.474
ridges+1.473
ards+1.390
aunders+1.297
Neg logit contributions
ogene-2.728
vati-2.585
awan-2.582
undo-2.564
opted-2.511
Token (
Token ID357
Feature activation+2.98e+00

Pos logit contributions
united+0.948
ridges+0.871
adders+0.870
 acknow+0.821
ards+0.808
Neg logit contributions
ogene-1.322
awan-1.261
vati-1.260
undo-1.251
argument-1.220
Token
Token ID1399
Feature activation-6.87e+00

Pos logit contributions
united+1.833
ridges+1.699
adders+1.697
 acknow+1.568
ards+1.528
Neg logit contributions
ogene-2.966
vati-2.827
awan-2.817
undo-2.814
argument-2.752
Token what
Token ID644
Feature activation-6.61e-01

Pos logit contributions
united+3.783
ridges+3.490
adders+3.448
 acknow+3.290
ards+3.180
Neg logit contributions
ogene-5.777
vati-5.549
undo-5.478
awan-5.466
argument-5.350
Token
Token ID447
Feature activation-1.69e+00

Pos logit contributions
united+0.514
ridges+0.485
adders+0.485
 acknow+0.462
ards+0.454
Neg logit contributions
ogene-0.430
awan-0.405
vati-0.402
undo-0.399
argument-0.388
Token
Token ID247
Feature activation-3.97e+00

Pos logit contributions
united+1.378
ridges+1.306
adders+1.306
ards+1.236
 acknow+1.233
Neg logit contributions
ogene-1.030
awan-0.968
vati-0.964
undo-0.943
 Manip-0.928
Tokens
Token ID82
Feature activation-1.97e+00

Pos logit contributions
united+2.216
adders+2.051
ridges+2.043
ards+1.898
aunders+1.779
Neg logit contributions
ogene-3.456
awan-3.279
vati-3.266
undo-3.200
 Manip-3.184
Token past
Token ID1613
Feature activation-1.74e+01

Pos logit contributions
united+1.657
adders+1.565
ridges+1.558
 acknow+1.520
ards+1.492
Neg logit contributions
ogene-1.140
awan-1.061
vati-1.061
undo-1.051
argument-1.018
Token the
Token ID262
Feature activation-6.75e+00

Pos logit contributions
united+5.659
ards+4.501
adders+4.349
ridges+4.087
 darkest+4.033
Neg logit contributions
ogene-15.980
vati-15.829
undo-15.567
awan-15.533
opted-15.349
Token edge
Token ID5743
Feature activation-1.31e+01

Pos logit contributions
united+3.386
ridges+2.999
adders+2.970
ards+2.831
 acknow+2.773
Neg logit contributions
ogene-6.023
vati-5.841
undo-5.791
awan-5.765
opted-5.637
Token of
Token ID286
Feature activation-1.34e+01

Pos logit contributions
united+5.640
ards+4.749
adders+4.632
ridges+4.416
 darkest+3.961
Neg logit contributions
ogene-11.716
vati-11.449
opted-11.178
undo-11.143
awan-11.135
Token the
Token ID262
Feature activation-2.56e+00

Pos logit contributions
united+5.200
adders+4.338
ridges+3.976
ards+3.964
 Morrow+3.866
Neg logit contributions
ogene-12.542
vati-12.310
awan-12.015
opted-11.887
undo-11.877
Token Universe
Token ID11950
Feature activation+3.91e+00

Pos logit contributions
united+1.524
adders+1.380
ridges+1.368
 acknow+1.287
ards+1.260
Neg logit contributions
ogene-2.147
vati-2.051
awan-2.038
undo-2.009
argument-1.978
Token (
Token ID357
Feature activation-3.61e+00

Pos logit contributions
united+1.505
adders+1.385
ridges+1.382
 acknow+1.295
ards+1.284
Neg logit contributions
ogene-1.953
awan-1.854
vati-1.850
undo-1.836
argument-1.792
Tokenas
Token ID292
Feature activation+3.69e+00

Pos logit contributions
united+1.326
adders+1.120
ridges+1.094
ards+0.937
aunders+0.881
Neg logit contributions
ogene-3.888
vati-3.683
 Manip-3.674
awan-3.668
undo-3.646
Token opposed
Token ID6886
Feature activation-5.28e-01

Pos logit contributions
ogene+2.383
undo+2.300
awan+2.284
vati+2.270
argument+2.236
Neg logit contributions
united-2.748
ridges-2.688
adders-2.640
 acknow-2.592
ards-2.525
Token to
Token ID284
Feature activation+2.79e+00

Pos logit contributions
united+0.352
adders+0.327
ridges+0.327
 acknow+0.306
ards+0.304
Neg logit contributions
ogene-0.404
awan-0.383
vati-0.381
undo-0.379
argument-0.370
Token back
Token ID736
Feature activation-1.19e+01

Pos logit contributions
ogene+1.658
awan+1.574
vati+1.572
undo+1.540
argument+1.508
Neg logit contributions
united-2.315
ridges-2.209
adders-2.188
 acknow-2.095
 advoc-2.058
Token in
Token ID287
Feature activation-1.73e+01

Pos logit contributions
united+5.800
ards+4.929
ridges+4.848
adders+4.784
 darkest+4.528
Neg logit contributions
ogene-10.056
vati-9.665
awan-9.603
undo-9.484
aimon-9.354
Token the
Token ID262
Feature activation-6.85e+00

Pos logit contributions
united+5.173
ards+4.356
ridges+3.898
adders+3.891
 pity+3.844
Neg logit contributions
ogene-16.124
vati-15.608
�醒-15.596
undo-15.546
awan-15.393
Token day
Token ID1110
Feature activation+6.83e+00

Pos logit contributions
united+3.697
ards+3.341
ridges+3.288
adders+3.210
 acknow+3.120
Neg logit contributions
ogene-5.672
vati-5.477
undo-5.457
argument-5.381
awan-5.373
Token playing
Token ID2712
Feature activation-3.32e+00

Pos logit contributions
ogene+2.456
awan+2.357
 circulating+2.355
undo+2.237
vati+2.223
Neg logit contributions
united-7.044
adders-6.830
ridges-6.792
 acknow-6.605
 advoc-6.583
Token at
Token ID379
Feature activation+1.87e+00

Pos logit contributions
united+2.124
adders+1.965
ridges+1.962
ards+1.855
 acknow+1.796
Neg logit contributions
ogene-2.643
undo-2.469
vati-2.465
awan-2.461
argument-2.389
Token 640
Token ID33759
Feature activation-1.94e+00

Pos logit contributions
ogene+1.043
awan+0.967
vati+0.965
undo+0.949
argument+0.926
Neg logit contributions
united-1.628
ridges-1.546
adders-1.544
 acknow-1.485
ards-1.451
Token recovery
Token ID7628
Feature activation-5.90e+00

Pos logit contributions
united+4.745
adders+4.533
ridges+4.528
 advoc+4.351
 acknow+4.323
Neg logit contributions
ogene-6.043
awan-5.666
undo-5.646
vati-5.625
argument-5.508
Token from
Token ID422
Feature activation-5.64e+00

Pos logit contributions
united+3.633
adders+3.422
ridges+3.344
 acknow+3.298
ards+3.258
Neg logit contributions
ogene-4.654
awan-4.377
undo-4.341
vati-4.248
argument-4.186
Token the
Token ID262
Feature activation-4.55e+00

Pos logit contributions
united+3.384
ridges+3.223
adders+3.155
 acknow+3.038
ards+2.950
Neg logit contributions
ogene-4.465
awan-4.237
vati-4.235
undo-4.228
argument-4.114
Token crisis
Token ID4902
Feature activation-3.83e+00

Pos logit contributions
united+2.979
ridges+2.802
adders+2.750
 acknow+2.653
ards+2.602
Neg logit contributions
ogene-3.404
awan-3.243
undo-3.229
vati-3.222
argument-3.141
Token by
Token ID416
Feature activation-2.64e+00

Pos logit contributions
united+2.497
ridges+2.322
adders+2.306
 acknow+2.221
ards+2.208
Neg logit contributions
ogene-2.905
awan-2.792
undo-2.725
vati-2.719
argument-2.658
Token opening
Token ID4756
Feature activation-1.71e+01

Pos logit contributions
united+1.946
adders+1.811
ridges+1.802
 acknow+1.766
ards+1.721
Neg logit contributions
ogene-1.813
awan-1.714
vati-1.711
undo-1.687
argument-1.640
Token up
Token ID510
Feature activation-1.11e+01

Pos logit contributions
adders+8.234
united+8.154
ridges+7.868
 pensions+7.865
 acknow+7.747
Neg logit contributions
ogene-12.795
undo-12.377
vati-12.264
awan-12.217
argument-12.008
Token trade
Token ID3292
Feature activation-6.37e+00

Pos logit contributions
united+6.256
adders+6.035
ridges+5.848
 acknow+5.670
ards+5.630
Neg logit contributions
ogene-8.511
vati-8.205
undo-8.164
awan-8.037
argument-8.027
Token in
Token ID287
Feature activation-4.54e+00

Pos logit contributions
united+4.031
adders+3.759
ridges+3.695
 acknow+3.500
ards+3.500
Neg logit contributions
ogene-5.002
awan-4.751
vati-4.722
undo-4.714
argument-4.657
Token services
Token ID2594
Feature activation-1.95e+00

Pos logit contributions
united+2.587
ridges+2.436
adders+2.428
 acknow+2.275
ards+2.243
Neg logit contributions
ogene-3.826
undo-3.656
vati-3.609
awan-3.593
argument-3.539
Token in
Token ID287
Feature activation-4.45e+00

Pos logit contributions
united+1.276
adders+1.181
ridges+1.181
 acknow+1.114
ards+1.097
Neg logit contributions
ogene-1.524
awan-1.445
vati-1.430
undo-1.428
argument-1.395
Token
Token ID447
Feature activation+2.04e+00

Pos logit contributions
united+1.123
adders+1.056
ridges+1.055
 acknow+1.009
ards+0.998
Neg logit contributions
ogene-1.013
awan-0.954
vati-0.944
undo-0.934
argument-0.911
Token
Token ID251
Feature activation+6.38e-01

Pos logit contributions
ogene+1.472
awan+1.390
vati+1.389
undo+1.379
argument+1.349
Neg logit contributions
united-1.440
ridges-1.349
adders-1.347
 acknow-1.282
ards-1.250
Token and
Token ID290
Feature activation+9.76e-01

Pos logit contributions
ogene+0.467
awan+0.442
vati+0.440
undo+0.437
argument+0.428
Neg logit contributions
united-0.445
ridges-0.417
adders-0.417
 acknow-0.395
ards-0.387
Token
Token ID564
Feature activation+5.46e+00

Pos logit contributions
ogene+0.765
awan+0.727
vati+0.723
undo+0.719
argument+0.704
Neg logit contributions
united-0.631
ridges-0.588
adders-0.587
 acknow-0.554
ards-0.542
Token
Token ID250
Feature activation-1.09e+00

Pos logit contributions
ogene+3.553
undo+3.380
vati+3.365
awan+3.309
argument+3.269
Neg logit contributions
united-4.138
adders-3.896
ridges-3.879
 acknow-3.769
ards-3.628
Tokencoll
Token ID26000
Feature activation-1.71e+01

Pos logit contributions
united+0.753
ridges+0.700
adders+0.698
 acknow+0.655
ards+0.649
Neg logit contributions
ogene-0.819
awan-0.774
vati-0.768
undo-0.760
argument-0.744
Tokenaps
Token ID1686
Feature activation-4.89e+00

Pos logit contributions
united+8.227
adders+7.863
ards+7.465
ridges+7.163
aunders+6.838
Neg logit contributions
ogene-12.900
vati-12.648
undo-12.398
argument-12.383
 Manip-12.359
Tokening
Token ID278
Feature activation-4.58e+00

Pos logit contributions
united+3.146
adders+2.907
ridges+2.840
ards+2.695
 acknow+2.686
Neg logit contributions
ogene-3.778
vati-3.560
awan-3.545
undo-3.522
argument-3.456
Token.
Token ID13
Feature activation-1.23e+00

Pos logit contributions
united+3.176
adders+2.992
ridges+2.947
 acknow+2.874
ards+2.826
Neg logit contributions
ogene-3.277
awan-3.109
vati-3.082
undo-3.038
argument-2.972
Token
Token ID447
Feature activation+1.60e+00

Pos logit contributions
united+0.845
adders+0.791
ridges+0.787
 acknow+0.739
ards+0.738
Neg logit contributions
ogene-0.916
vati-0.863
awan-0.861
undo-0.853
argument-0.827
Token
Token ID251
Feature activation+1.87e+00

Pos logit contributions
ogene+1.174
awan+1.111
vati+1.106
undo+1.099
argument+1.074
Neg logit contributions
united-1.112
ridges-1.043
adders-1.040
 acknow-0.987
ards-0.967
Token into
Token ID656
Feature activation+7.23e+00

Pos logit contributions
ogene+5.467
awan+5.011
undo+4.987
vati+4.970
 circulating+4.897
Neg logit contributions
united-6.261
adders-6.062
ridges-5.954
 advoc-5.669
 acknow-5.640
Token a
Token ID257
Feature activation+7.48e+00

Pos logit contributions
ogene+4.678
undo+4.411
awan+4.343
 Manip+4.312
argument+4.268
Neg logit contributions
united-5.394
adders-5.129
ridges-5.067
 acknow-4.809
aunders-4.723
Token variety
Token ID4996
Feature activation+4.34e-01

Pos logit contributions
ogene+5.397
undo+5.252
 Manip+5.141
 circulating+5.058
awan+5.056
Neg logit contributions
united-4.926
adders-4.791
ridges-4.660
 acknow-4.323
aunders-4.283
Token of
Token ID286
Feature activation+5.35e+00

Pos logit contributions
ogene+0.316
awan+0.300
vati+0.298
undo+0.296
argument+0.289
Neg logit contributions
united-0.304
ridges-0.285
adders-0.285
 acknow-0.270
ards-0.265
Token inventive
Token ID47602
Feature activation+4.15e+00

Pos logit contributions
ogene+3.966
undo+3.729
awan+3.722
vati+3.681
 Manip+3.664
Neg logit contributions
united-3.592
adders-3.374
ridges-3.338
 acknow-3.122
ards-3.068
Token coll
Token ID2927
Feature activation-1.67e+01

Pos logit contributions
ogene+3.024
awan+2.855
undo+2.842
vati+2.824
 Manip+2.790
Neg logit contributions
united-2.868
adders-2.704
ridges-2.664
 acknow-2.520
ards-2.460
Tokenages
Token ID1095
Feature activation-1.57e+00

Pos logit contributions
ards+6.199
adders+5.851
united+5.850
ridges+5.478
ages+5.472
Neg logit contributions
vati-15.422
ogene-14.993
awan-14.883
undo-14.494
 Demand-14.263
Token now
Token ID783
Feature activation+9.22e+00

Pos logit contributions
united+0.992
ridges+0.918
adders+0.911
 acknow+0.866
ards+0.854
Neg logit contributions
ogene-1.260
awan-1.200
vati-1.193
undo-1.181
argument-1.156
Token making
Token ID1642
Feature activation+8.07e+00

Pos logit contributions
ogene+5.686
 circulating+5.487
awan+5.097
undo+5.076
vati+5.052
Neg logit contributions
united-7.145
adders-6.875
ridges-6.813
aunders-6.275
 acknow-6.268
Token the
Token ID262
Feature activation-6.78e-01

Pos logit contributions
ogene+4.952
 circulating+4.654
awan+4.634
vati+4.553
undo+4.543
Neg logit contributions
united-6.347
adders-5.917
ridges-5.908
 acknow-5.522
aunders-5.409
Token rounds
Token ID9196
Feature activation+6.42e-02

Pos logit contributions
united+0.507
ridges+0.478
adders+0.477
 acknow+0.456
ards+0.447
Neg logit contributions
ogene-0.461
awan-0.435
vati-0.433
undo-0.428
argument-0.418
Tokenatable
Token ID21156
Feature activation+2.77e+00

Pos logit contributions
united+6.130
adders+5.813
ridges+5.564
ards+5.464
 acknow+5.286
Neg logit contributions
ogene-7.038
undo-6.768
 Manip-6.635
vati-6.606
argument-6.603
Token issue
Token ID2071
Feature activation+9.18e-02

Pos logit contributions
ogene+2.066
awan+1.961
vati+1.945
undo+1.915
argument+1.902
Neg logit contributions
united-1.896
ridges-1.778
adders-1.770
 acknow-1.669
ards-1.615
Token of
Token ID286
Feature activation+3.74e-01

Pos logit contributions
ogene+0.069
awan+0.065
vati+0.065
undo+0.064
argument+0.063
Neg logit contributions
united-0.062
ridges-0.059
adders-0.058
 acknow-0.055
ards-0.054
Token what
Token ID644
Feature activation-3.65e+00

Pos logit contributions
ogene+0.272
awan+0.257
vati+0.256
undo+0.254
argument+0.248
Neg logit contributions
united-0.263
ridges-0.247
adders-0.247
 acknow-0.233
ards-0.229
Token it
Token ID340
Feature activation-8.71e+00

Pos logit contributions
united+2.344
adders+2.200
ridges+2.187
 acknow+2.112
ards+2.076
Neg logit contributions
ogene-2.801
vati-2.674
awan-2.668
undo-2.640
argument-2.577
Token's
Token ID338
Feature activation-1.65e+01

Pos logit contributions
united+5.424
adders+4.838
 acknow+4.810
ridges+4.752
ards+4.729
Neg logit contributions
ogene-6.669
vati-6.436
awan-6.311
undo-6.187
argument-6.160
Token like
Token ID588
Feature activation-6.69e+00

Pos logit contributions
united+7.022
 darkest+6.078
ards+5.920
 acknow+5.837
adders+5.691
Neg logit contributions
ogene-13.805
vati-13.424
undo-13.350
awan-13.203
argument-13.097
Token to
Token ID284
Feature activation-2.98e+00

Pos logit contributions
united+4.101
adders+3.709
ridges+3.699
ards+3.568
 acknow+3.524
Neg logit contributions
ogene-5.147
vati-5.033
undo-4.998
awan-4.975
argument-4.799
Token get
Token ID651
Feature activation+1.37e+00

Pos logit contributions
united+1.852
adders+1.718
ridges+1.714
 acknow+1.649
ards+1.596
Neg logit contributions
ogene-2.368
vati-2.276
awan-2.250
undo-2.228
argument-2.188
Token in
Token ID287
Feature activation-4.53e+00

Pos logit contributions
ogene+0.966
awan+0.914
vati+0.909
undo+0.899
argument+0.880
Neg logit contributions
united-0.994
ridges-0.935
adders-0.933
 acknow-0.883
ards-0.867
Token a
Token ID257
Feature activation-4.14e+00

Pos logit contributions
united+2.908
adders+2.733
ridges+2.732
 acknow+2.572
ards+2.536
Neg logit contributions
ogene-3.512
undo-3.346
awan-3.309
vati-3.285
argument-3.200
Token,
Token ID11
Feature activation-5.20e-01

Pos logit contributions
ogene+0.798
awan+0.756
vati+0.751
undo+0.744
argument+0.728
Neg logit contributions
united-0.817
adders-0.767
ridges-0.766
 acknow-0.728
ards-0.709
Token after
Token ID706
Feature activation-4.52e+00

Pos logit contributions
united+0.348
ridges+0.326
adders+0.325
 acknow+0.308
ards+0.302
Neg logit contributions
ogene-0.395
awan-0.374
vati-0.373
undo-0.371
argument-0.362
Token Hastings
Token ID38330
Feature activation-2.07e+00

Pos logit contributions
united+3.078
ridges+2.893
adders+2.860
 acknow+2.746
ards+2.682
Neg logit contributions
ogene-3.344
awan-3.165
vati-3.143
undo-3.136
argument-3.023
Token maybe
Token ID3863
Feature activation-4.83e+00

Pos logit contributions
united+1.315
adders+1.218
ridges+1.218
 acknow+1.153
ards+1.133
Neg logit contributions
ogene-1.647
awan-1.560
undo-1.554
vati-1.553
argument-1.511
Token it
Token ID340
Feature activation-8.98e+00

Pos logit contributions
united+3.038
ridges+2.827
adders+2.823
 acknow+2.732
ards+2.682
Neg logit contributions
ogene-3.805
awan-3.617
undo-3.614
vati-3.574
argument-3.455
Token seemed
Token ID3947
Feature activation-1.64e+01

Pos logit contributions
united+4.210
adders+3.743
 acknow+3.670
ards+3.639
ridges+3.544
Neg logit contributions
ogene-8.109
awan-7.824
undo-7.821
vati-7.756
argument-7.513
Token like
Token ID588
Feature activation-1.04e+01

Pos logit contributions
united+8.318
ridges+7.718
ards+7.671
 pity+7.652
adders+7.583
Neg logit contributions
ogene-12.639
undo-11.989
vati-11.868
awan-11.745
milo-11.573
Token life
Token ID1204
Feature activation+1.78e+00

Pos logit contributions
united+6.066
ridges+5.719
adders+5.695
ards+5.560
 acknow+5.511
Neg logit contributions
ogene-8.149
undo-7.738
awan-7.621
vati-7.613
argument-7.304
Token could
Token ID714
Feature activation-5.64e+00

Pos logit contributions
ogene+1.451
awan+1.384
vati+1.377
undo+1.368
argument+1.344
Neg logit contributions
united-1.092
ridges-1.015
adders-1.013
 acknow-0.952
ards-0.928
Token go
Token ID467
Feature activation-2.09e+00

Pos logit contributions
united+3.427
adders+3.163
ridges+3.111
 acknow+3.053
ards+2.981
Neg logit contributions
ogene-4.525
vati-4.314
awan-4.219
undo-4.190
argument-4.126
Token on
Token ID319
Feature activation-4.75e-01

Pos logit contributions
united+1.433
ridges+1.346
adders+1.335
 acknow+1.275
ards+1.253
Neg logit contributions
ogene-1.551
awan-1.468
vati-1.456
undo-1.454
argument-1.417
Tokenint
Token ID600
Feature activation-6.24e+00

Pos logit contributions
united+3.001
adders+2.462
ridges+2.403
ards+2.307
aunders+1.958
Neg logit contributions
ogene-8.348
vati-8.091
awan-7.945
 Manip-7.864
argument-7.832
Token/
Token ID14
Feature activation-1.35e+01

Pos logit contributions
united+3.227
ridges+2.861
adders+2.832
ards+2.647
 acknow+2.639
Neg logit contributions
ogene-5.603
vati-5.330
awan-5.314
undo-5.270
argument-5.262
Tokenviolence
Token ID37502
Feature activation-1.11e+01

Pos logit contributions
united+9.435
ridges+8.514
adders+8.235
ards+8.058
 acknow+7.769
Neg logit contributions
ogene-8.857
vati-8.287
awan-8.208
argument-8.039
 Manip-7.968
Token_
Token ID62
Feature activation-7.90e+00

Pos logit contributions
united+6.199
ridges+5.461
adders+5.423
ards+5.189
 acknow+4.863
Neg logit contributions
ogene-9.178
vati-8.647
awan-8.622
undo-8.345
argument-8.283
Tokenin
Token ID259
Feature activation-7.11e+00

Pos logit contributions
united+5.328
ridges+4.810
adders+4.780
ards+4.530
aunders+4.283
Neg logit contributions
ogene-5.947
awan-5.594
vati-5.515
 Manip-5.384
undo-5.329
Tokenjury
Token ID21871
Feature activation-1.59e+01

Pos logit contributions
united+5.416
ridges+5.081
adders+4.936
ards+4.792
aunders+4.613
Neg logit contributions
ogene-4.558
awan-4.370
undo-4.273
vati-4.256
 Manip-4.151
Token_
Token ID62
Feature activation-7.94e+00

Pos logit contributions
united+8.597
ridges+7.477
adders+7.258
ards+7.192
aunders+6.707
Neg logit contributions
ogene-12.257
awan-11.578
vati-11.491
undo-11.332
aimon-11.063
Tokenpre
Token ID3866
Feature activation-6.86e+00

Pos logit contributions
united+5.464
ridges+4.917
adders+4.856
ards+4.638
aunders+4.381
Neg logit contributions
ogene-5.892
awan-5.509
vati-5.423
 Manip-5.279
undo-5.261
Tokenvention
Token ID4018
Feature activation-8.55e+00

Pos logit contributions
united+3.770
ridges+3.423
adders+3.343
ards+3.265
 acknow+2.994
Neg logit contributions
ogene-5.998
awan-5.676
undo-5.667
vati-5.580
 Manip-5.505
Token/
Token ID14
Feature activation-1.14e+01

Pos logit contributions
united+4.416
ridges+3.929
adders+3.834
ards+3.658
 acknow+3.568
Neg logit contributions
ogene-7.569
vati-7.215
awan-7.179
undo-7.126
argument-6.928
Tokenkey
Token ID2539
Feature activation-2.11e+00

Pos logit contributions
united+8.027
ridges+7.189
adders+7.003
ards+6.821
 acknow+6.611
Neg logit contributions
ogene-7.809
awan-7.287
vati-7.142
argument-6.952
undo-6.927
Token and
Token ID290
Feature activation-2.13e+00

Pos logit contributions
united+0.056
adders+0.052
ridges+0.052
 acknow+0.049
ards+0.049
Neg logit contributions
ogene-0.056
awan-0.053
vati-0.053
undo-0.053
argument-0.051
Token support
Token ID1104
Feature activation+7.24e-01

Pos logit contributions
united+1.485
adders+1.402
ridges+1.390
 acknow+1.340
ards+1.306
Neg logit contributions
ogene-1.545
vati-1.457
awan-1.456
undo-1.427
argument-1.410
Token tax
Token ID1687
Feature activation-5.70e+00

Pos logit contributions
ogene+0.529
awan+0.500
vati+0.498
undo+0.494
argument+0.483
Neg logit contributions
united-0.506
ridges-0.474
adders-0.473
 acknow-0.449
ards-0.440
Token cuts
Token ID6630
Feature activation-4.83e+00

Pos logit contributions
united+3.896
adders+3.752
ridges+3.722
ards+3.511
 acknow+3.460
Neg logit contributions
ogene-4.098
vati-3.892
awan-3.880
undo-3.785
argument-3.727
Token for
Token ID329
Feature activation-1.25e+01

Pos logit contributions
united+3.344
adders+3.126
ridges+3.110
ards+2.960
 acknow+2.943
Neg logit contributions
ogene-3.484
awan-3.289
vati-3.281
undo-3.175
argument-3.174
Token small
Token ID1402
Feature activation-1.58e+01

Pos logit contributions
united+6.670
ridges+6.548
adders+6.467
ards+6.097
 acknow+6.083
Neg logit contributions
vati-9.470
ogene-9.440
argument-9.156
undo-9.104
awan-9.091
Token businesses
Token ID5692
Feature activation-3.92e+00

Pos logit contributions
ridges+4.840
united+4.793
adders+4.501
ards+4.356
 acknow+4.197
Neg logit contributions
ogene-14.809
vati-14.657
awan-14.534
argument-14.480
undo-14.431
Token and
Token ID290
Feature activation-7.40e+00

Pos logit contributions
united+2.331
adders+2.111
ridges+2.100
ards+1.978
 acknow+1.962
Neg logit contributions
ogene-3.256
vati-3.106
awan-3.099
undo-3.056
argument-3.021
Token working
Token ID1762
Feature activation-7.23e+00

Pos logit contributions
united+4.801
adders+4.651
ridges+4.518
 acknow+4.390
ards+4.330
Neg logit contributions
ogene-5.411
vati-5.227
awan-5.101
argument-5.031
undo-5.026
Token families
Token ID4172
Feature activation-4.63e+00

Pos logit contributions
united+4.544
ridges+4.276
adders+4.238
ards+4.231
 acknow+3.973
Neg logit contributions
ogene-5.497
undo-5.284
vati-5.279
awan-5.268
argument-5.164
Token,
Token ID11
Feature activation-2.37e+00

Pos logit contributions
united+2.687
adders+2.425
ridges+2.397
ards+2.277
 acknow+2.223
Neg logit contributions
ogene-3.902
vati-3.737
awan-3.702
undo-3.668
argument-3.636
Token his
Token ID465
Feature activation+2.75e+00

Pos logit contributions
ogene+2.175
awan+2.064
undo+2.044
vati+2.035
argument+1.990
Neg logit contributions
united-2.226
adders-2.106
ridges-2.103
 acknow-1.980
ards-1.942
Token current
Token ID1459
Feature activation+2.11e+00

Pos logit contributions
ogene+2.011
awan+1.913
undo+1.895
vati+1.886
argument+1.844
Neg logit contributions
united-1.910
ridges-1.802
adders-1.796
 acknow-1.689
ards-1.653
Token NHL
Token ID9481
Feature activation-3.51e+00

Pos logit contributions
ogene+1.552
awan+1.472
vati+1.459
undo+1.456
argument+1.420
Neg logit contributions
united-1.466
adders-1.380
ridges-1.379
 acknow-1.298
ards-1.270
Token team
Token ID1074
Feature activation-3.22e+00

Pos logit contributions
united+2.241
ridges+2.066
adders+2.050
 acknow+2.015
ards+1.970
Neg logit contributions
ogene-2.738
vati-2.622
awan-2.572
undo-2.533
argument-2.499
Token.
Token ID13
Feature activation-3.12e+00

Pos logit contributions
united+2.050
adders+1.875
ridges+1.857
 acknow+1.807
ards+1.794
Neg logit contributions
ogene-2.548
vati-2.414
awan-2.387
undo-2.363
argument-2.326
Token For
Token ID1114
Feature activation-1.58e+01

Pos logit contributions
united+2.272
adders+2.114
ridges+2.099
 acknow+2.030
ards+2.002
Neg logit contributions
ogene-2.190
vati-2.064
awan-2.042
undo-2.011
argument-1.973
Token the
Token ID262
Feature activation-7.97e+00

Pos logit contributions
united+8.706
adders+8.104
wards+8.079
ridges+7.993
 acknow+7.960
Neg logit contributions
ogene-11.201
vati-10.810
awan-10.139
opian-10.052
undo-10.041
Token Pens
Token ID38740
Feature activation+2.59e+00

Pos logit contributions
united+5.344
ridges+5.044
adders+4.978
 acknow+4.824
ards+4.698
Neg logit contributions
ogene-5.661
vati-5.442
awan-5.269
undo-5.193
argument-5.152
Token-
Token ID12
Feature activation+2.11e+00

Pos logit contributions
ogene+1.843
awan+1.750
vati+1.727
undo+1.726
argument+1.675
Neg logit contributions
united-1.846
ridges-1.747
adders-1.739
 acknow-1.650
ards-1.612
Token Fle
Token ID12005
Feature activation-8.34e+00

Pos logit contributions
ogene+1.626
awan+1.546
vati+1.539
undo+1.537
argument+1.499
Neg logit contributions
united-1.373
ridges-1.289
adders-1.288
 acknow-1.230
ards-1.194
Tokenury
Token ID1601
Feature activation-2.42e+00

Pos logit contributions
united+2.598
ridges+2.166
adders+2.059
ards+1.946
aunders+1.684
Neg logit contributions
ogene-9.179
 Manip-8.728
vati-8.617
awan-8.589
undo-8.545
Token!
Token ID0
Feature activation-2.04e+00

Pos logit contributions
united+6.538
ridges+5.968
 acknow+5.730
adders+5.694
ards+5.652
Neg logit contributions
ogene-9.791
vati-9.549
undo-9.425
awan-9.301
argument-9.082
Token
Token ID198
Feature activation-1.28e-01

Pos logit contributions
united+1.361
ridges+1.266
adders+1.265
ards+1.188
 acknow+1.179
Neg logit contributions
ogene-1.563
vati-1.472
awan-1.461
undo-1.458
argument-1.424
Token
Token ID198
Feature activation-2.70e+00

Pos logit contributions
united+0.083
adders+0.076
ridges+0.076
ards+0.071
 acknow+0.070
Neg logit contributions
ogene-0.101
vati-0.096
awan-0.096
undo-0.095
argument-0.093
TokenIs
Token ID3792
Feature activation-4.12e+00

Pos logit contributions
united+1.727
adders+1.580
ridges+1.575
ards+1.486
 acknow+1.442
Neg logit contributions
ogene-2.145
vati-2.019
awan-2.007
undo-1.988
argument-1.938
Token it
Token ID340
Feature activation-9.26e+00

Pos logit contributions
united+2.843
ridges+2.663
adders+2.650
ards+2.526
 acknow+2.498
Neg logit contributions
ogene-3.001
vati-2.812
undo-2.790
awan-2.785
argument-2.731
Token something
Token ID1223
Feature activation-1.56e+01

Pos logit contributions
united+5.344
adders+4.881
ridges+4.868
ards+4.812
 acknow+4.710
Neg logit contributions
ogene-7.248
vati-6.915
undo-6.841
awan-6.817
argument-6.767
Token you
Token ID345
Feature activation-9.73e+00

Pos logit contributions
united+7.338
ridges+6.853
ards+6.604
adders+6.395
 darkest+6.247
Neg logit contributions
ogene-12.717
undo-11.902
vati-11.848
argument-11.827
awan-11.640
Token'd
Token ID1549
Feature activation-1.12e+01

Pos logit contributions
united+5.868
 acknow+5.496
ridges+5.432
adders+5.414
ards+5.214
Neg logit contributions
ogene-7.279
argument-6.883
awan-6.854
undo-6.835
vati-6.833
Token ever
Token ID1683
Feature activation-9.65e+00

Pos logit contributions
united+5.850
 acknow+5.601
adders+5.465
ridges+5.349
 pity+5.170
Neg logit contributions
ogene-9.075
vati-8.572
awan-8.536
argument-8.373
undo-8.315
Token want
Token ID765
Feature activation-3.19e+00

Pos logit contributions
united+5.403
 acknow+5.145
adders+5.078
ridges+4.978
ards+4.769
Neg logit contributions
ogene-7.692
awan-7.266
vati-7.213
argument-7.115
undo-7.066
Token to
Token ID284
Feature activation-8.90e+00

Pos logit contributions
united+2.039
ridges+1.896
adders+1.881
 acknow+1.782
ards+1.769
Neg logit contributions
ogene-2.511
awan-2.369
vati-2.356
undo-2.342
argument-2.300
Token:
Token ID25
Feature activation-3.05e-01

Pos logit contributions
united+2.981
ridges+2.761
adders+2.732
ards+2.552
 acknow+2.547
Neg logit contributions
ogene-3.902
vati-3.770
awan-3.719
undo-3.706
argument-3.597
Token
Token ID198
Feature activation-1.35e+00

Pos logit contributions
united+0.202
ridges+0.188
adders+0.188
 acknow+0.178
ards+0.174
Neg logit contributions
ogene-0.234
awan-0.222
vati-0.221
undo-0.219
argument-0.215
Token
Token ID198
Feature activation+1.10e+00

Pos logit contributions
united+0.850
adders+0.780
ridges+0.779
ards+0.728
 acknow+0.706
Neg logit contributions
ogene-1.092
vati-1.031
awan-1.027
undo-1.016
argument-0.990
TokenGa
Token ID35389
Feature activation-5.72e+00

Pos logit contributions
ogene+0.832
awan+0.790
vati+0.785
undo+0.783
argument+0.768
Neg logit contributions
united-0.735
ridges-0.688
adders-0.686
 acknow-0.657
ards-0.638
Tokenussian
Token ID31562
Feature activation-1.72e+00

Pos logit contributions
united+3.067
adders+2.781
ridges+2.719
ards+2.583
 acknow+2.521
Neg logit contributions
ogene-5.020
awan-4.787
vati-4.766
undo-4.684
 Manip-4.616
Token bl
Token ID698
Feature activation-1.55e+01

Pos logit contributions
united+1.158
adders+1.084
ridges+1.082
 acknow+1.024
ards+1.007
Neg logit contributions
ogene-1.296
awan-1.231
vati-1.228
undo-1.217
argument-1.185
Tokenurring
Token ID14924
Feature activation-4.06e+00

Pos logit contributions
united+9.230
adders+9.142
ridges+8.776
ards+8.317
ights+7.808
Neg logit contributions
ogene-10.968
 circulating-10.616
 Manip-10.311
vati-10.198
awan-9.963
Token is
Token ID318
Feature activation-1.43e+00

Pos logit contributions
united+2.643
adders+2.475
ridges+2.463
 acknow+2.355
ards+2.314
Neg logit contributions
ogene-3.104
vati-2.966
awan-2.954
undo-2.920
argument-2.824
Token applied
Token ID5625
Feature activation+5.15e+00

Pos logit contributions
united+0.940
ridges+0.880
adders+0.877
 acknow+0.836
ards+0.823
Neg logit contributions
ogene-1.089
awan-1.037
vati-1.034
undo-1.023
argument-0.999
Token on
Token ID319
Feature activation+1.56e+00

Pos logit contributions
ogene+3.534
awan+3.319
vati+3.293
undo+3.278
 Manip+3.241
Neg logit contributions
united-3.773
ridges-3.562
adders-3.528
 acknow-3.340
 advoc-3.284
Token Line
Token ID6910
Feature activation+7.07e+00

Pos logit contributions
ogene+1.191
awan+1.127
vati+1.118
undo+1.113
argument+1.090
Neg logit contributions
united-1.042
adders-0.968
ridges-0.967
 acknow-0.919
ards-0.896
Token the
Token ID262
Feature activation+5.44e+00

Pos logit contributions
ogene+2.869
awan+2.735
vati+2.692
undo+2.674
 Manip+2.659
Neg logit contributions
united-2.833
adders-2.659
ridges-2.643
 acknow-2.481
ards-2.434
Token basic
Token ID4096
Feature activation+3.52e+00

Pos logit contributions
ogene+3.987
awan+3.783
vati+3.760
 Manip+3.727
undo+3.702
Neg logit contributions
united-3.650
adders-3.464
ridges-3.428
 acknow-3.208
aunders-3.150
Token division
Token ID7297
Feature activation-6.28e+00

Pos logit contributions
ogene+2.580
awan+2.425
vati+2.416
undo+2.388
argument+2.369
Neg logit contributions
united-2.426
adders-2.297
ridges-2.288
 acknow-2.151
ards-2.092
Token of
Token ID286
Feature activation-6.76e+00

Pos logit contributions
united+3.741
ridges+3.482
adders+3.470
ards+3.342
 acknow+3.321
Neg logit contributions
ogene-5.075
awan-4.851
vati-4.821
undo-4.739
argument-4.577
Token virtuous
Token ID41276
Feature activation+2.23e-02

Pos logit contributions
united+3.700
adders+3.445
ridges+3.417
 acknow+3.301
ards+3.180
Neg logit contributions
ogene-5.770
vati-5.510
awan-5.506
undo-5.481
argument-5.265
Token private
Token ID2839
Feature activation-1.53e+01

Pos logit contributions
ogene+0.017
awan+0.016
vati+0.016
undo+0.016
argument+0.015
Neg logit contributions
united-0.015
ridges-0.014
adders-0.014
 acknow-0.013
ards-0.013
Token property
Token ID3119
Feature activation-5.45e+00

Pos logit contributions
united+8.006
ridges+7.678
ards+7.308
adders+7.252
 acknow+7.235
Neg logit contributions
ogene-11.890
undo-11.341
awan-11.320
vati-11.254
argument-10.937
Token and
Token ID290
Feature activation-6.74e+00

Pos logit contributions
united+3.215
ridges+2.996
adders+2.951
 acknow+2.860
ards+2.835
Neg logit contributions
ogene-4.476
awan-4.246
undo-4.243
vati-4.221
argument-4.089
Token oppressive
Token ID30983
Feature activation-1.98e+00

Pos logit contributions
united+4.113
ridges+3.806
adders+3.803
 acknow+3.740
ards+3.592
Neg logit contributions
ogene-5.392
vati-5.094
undo-5.040
awan-5.012
argument-4.908
Token public
Token ID1171
Feature activation-9.90e+00

Pos logit contributions
united+1.232
ridges+1.143
adders+1.141
 acknow+1.083
ards+1.061
Neg logit contributions
ogene-1.588
vati-1.506
awan-1.503
undo-1.498
argument-1.463
Token interest
Token ID1393
Feature activation-3.22e+00

Pos logit contributions
united+5.859
ridges+5.566
adders+5.350
 acknow+5.224
ards+5.198
Neg logit contributions
ogene-7.744
undo-7.350
vati-7.340
awan-7.322
argument-6.987
Token.
Token ID13
Feature activation-2.14e+00

Pos logit contributions
united+1.436
ridges+1.326
adders+1.326
 acknow+1.255
ards+1.234
Neg logit contributions
ogene-1.775
awan-1.689
vati-1.681
undo-1.663
argument-1.622
Token
Token ID564
Feature activation-6.30e-01

Pos logit contributions
united+1.496
adders+1.395
ridges+1.395
 acknow+1.317
ards+1.313
Neg logit contributions
ogene-1.566
vati-1.459
awan-1.458
undo-1.436
argument-1.399
Token
Token ID250
Feature activation-4.69e+00

Pos logit contributions
united+0.462
ridges+0.437
adders+0.435
 acknow+0.412
ards+0.408
Neg logit contributions
ogene-0.437
awan-0.413
vati-0.405
undo-0.401
argument-0.393
TokenDoes
Token ID13921
Feature activation-1.73e+00

Pos logit contributions
united+3.173
ridges+2.898
adders+2.894
ards+2.725
 acknow+2.646
Neg logit contributions
ogene-3.571
awan-3.303
vati-3.300
undo-3.192
opian-3.136
Token it
Token ID340
Feature activation-4.06e+00

Pos logit contributions
united+1.120
adders+1.046
ridges+1.046
 acknow+0.989
ards+0.973
Neg logit contributions
ogene-1.342
vati-1.271
awan-1.262
undo-1.248
argument-1.223
Token seem
Token ID1283
Feature activation-1.53e+01

Pos logit contributions
united+2.634
adders+2.485
ridges+2.435
 acknow+2.383
ards+2.304
Neg logit contributions
ogene-3.077
vati-2.962
awan-2.904
undo-2.818
argument-2.783
Token fair
Token ID3148
Feature activation-3.86e+00

Pos logit contributions
united+8.029
ridges+7.458
adders+7.423
 pity+7.182
 acknow+7.147
Neg logit contributions
ogene-11.489
vati-10.843
awan-10.529
aimon-10.464
undo-10.359
Token to
Token ID284
Feature activation-5.57e+00

Pos logit contributions
united+2.346
adders+2.188
ridges+2.176
 acknow+2.046
ards+2.042
Neg logit contributions
ogene-3.120
vati-2.961
awan-2.933
undo-2.924
argument-2.837
Token you
Token ID345
Feature activation-4.28e+00

Pos logit contributions
united+3.704
adders+3.534
 acknow+3.505
ridges+3.461
ards+3.314
Neg logit contributions
ogene-4.019
vati-3.827
awan-3.764
undo-3.657
opted-3.620
Token,
Token ID11
Feature activation+2.51e+00

Pos logit contributions
united+2.422
adders+2.264
ridges+2.237
 acknow+2.108
ards+2.065
Neg logit contributions
ogene-3.609
vati-3.438
awan-3.399
undo-3.349
argument-3.287
Token
Token ID447
Feature activation-9.06e-02

Pos logit contributions
ogene+1.774
awan+1.678
vati+1.667
undo+1.657
argument+1.618
Neg logit contributions
united-1.809
ridges-1.700
adders-1.694
 acknow-1.612
ards-1.579
Token mass
Token ID2347
Feature activation-8.61e+00

Pos logit contributions
united+0.583
ridges+0.544
adders+0.542
 acknow+0.514
ards+0.502
Neg logit contributions
ogene-0.712
awan-0.675
vati-0.672
undo-0.667
argument-0.654
Token,
Token ID11
Feature activation-1.66e+00

Pos logit contributions
united+5.365
ridges+5.044
adders+5.004
ards+4.887
 acknow+4.730
Neg logit contributions
ogene-6.548
awan-6.204
vati-6.125
undo-5.967
argument-5.889
Token
Token ID198
Feature activation-1.96e+00

Pos logit contributions
united+1.104
adders+1.030
ridges+1.029
 acknow+0.984
ards+0.958
Neg logit contributions
ogene-1.268
vati-1.199
awan-1.198
undo-1.183
argument-1.162
Token
Token ID198
Feature activation-8.50e-01

Pos logit contributions
united+1.202
adders+1.092
ridges+1.092
ards+1.027
 acknow+0.973
Neg logit contributions
ogene-1.616
vati-1.532
awan-1.517
undo-1.504
argument-1.462
Tokenand
Token ID392
Feature activation+5.49e-01

Pos logit contributions
united+0.551
ridges+0.512
adders+0.510
 acknow+0.476
ards+0.472
Neg logit contributions
ogene-0.671
awan-0.634
vati-0.632
undo-0.624
argument-0.611
Token for
Token ID329
Feature activation-1.53e+01

Pos logit contributions
ogene+0.401
awan+0.379
vati+0.377
undo+0.374
argument+0.366
Neg logit contributions
united-0.384
ridges-0.360
adders-0.359
 acknow-0.341
ards-0.334
Token the
Token ID262
Feature activation-8.63e+00

Pos logit contributions
united+8.756
ridges+8.089
 pity+7.953
adders+7.818
ards+7.808
Neg logit contributions
ogene-10.867
awan-10.304
undo-10.100
vati-10.004
argument-9.559
Token most
Token ID749
Feature activation-1.55e+00

Pos logit contributions
united+5.330
ridges+4.962
adders+4.814
 acknow+4.734
ards+4.573
Neg logit contributions
ogene-6.685
awan-6.287
undo-6.261
vati-6.253
argument-6.091
Token part
Token ID636
Feature activation-2.30e+00

Pos logit contributions
united+1.105
ridges+1.037
adders+1.030
 acknow+0.989
ards+0.968
Neg logit contributions
ogene-1.115
awan-1.055
vati-1.046
undo-1.041
argument-1.013
Token,
Token ID11
Feature activation-8.04e-01

Pos logit contributions
united+1.582
ridges+1.480
adders+1.466
 acknow+1.437
ards+1.395
Neg logit contributions
ogene-1.705
awan-1.609
vati-1.607
undo-1.581
argument-1.556
Token
Token ID198
Feature activation-1.60e-01

Pos logit contributions
united+0.539
ridges+0.505
adders+0.502
 acknow+0.481
ards+0.469
Neg logit contributions
ogene-0.611
awan-0.578
vati-0.577
undo-0.570
argument-0.559
Tokenhtml
Token ID6494
Feature activation-7.61e+00

Pos logit contributions
united+6.921
ridges+6.316
adders+6.223
ards+5.946
aunders+5.724
Neg logit contributions
ogene-6.774
 Manip-6.387
vati-6.380
awan-6.253
 Demand-6.072
Token/
Token ID14
Feature activation-1.17e+01

Pos logit contributions
united+4.290
adders+3.892
ridges+3.873
ards+3.628
 acknow+3.525
Neg logit contributions
ogene-6.469
vati-6.120
awan-6.050
undo-6.007
 Manip-5.852
Tokenpeople
Token ID15332
Feature activation-8.96e+00

Pos logit contributions
united+8.344
adders+7.500
ridges+7.468
ards+7.215
aunders+6.900
Neg logit contributions
ogene-7.739
vati-7.257
 Manip-7.206
awan-7.074
undo-6.919
Token/
Token ID14
Feature activation-1.05e+01

Pos logit contributions
united+4.802
adders+4.307
ridges+4.304
ards+4.024
 acknow+3.876
Neg logit contributions
ogene-7.687
vati-7.390
awan-7.308
undo-7.229
 Manip-7.133
Tokenye
Token ID5948
Feature activation-6.56e+00

Pos logit contributions
united+7.550
ridges+6.714
adders+6.714
ards+6.434
 acknow+6.214
Neg logit contributions
ogene-7.098
vati-6.689
 Manip-6.617
awan-6.527
undo-6.382
Tokenager
Token ID3536
Feature activation-1.51e+01

Pos logit contributions
united+2.555
adders+2.295
ridges+2.199
ards+2.102
aunders+1.890
Neg logit contributions
ogene-6.664
 Manip-6.474
vati-6.414
undo-6.371
awan-6.370
Token_
Token ID62
Feature activation-7.44e+00

Pos logit contributions
united+9.262
adders+8.297
ridges+8.057
ards+8.055
aunders+7.679
Neg logit contributions
ogene-10.368
vati-10.014
undo-9.784
 Manip-9.696
awan-9.590
Tokenbi
Token ID8482
Feature activation-4.95e+00

Pos logit contributions
united+5.187
ridges+4.659
adders+4.644
ards+4.422
 acknow+4.318
Neg logit contributions
ogene-5.315
vati-5.061
 Manip-5.046
awan-4.988
undo-4.898
Tokenography
Token ID4867
Feature activation-8.64e+00

Pos logit contributions
united+3.330
ridges+3.123
adders+3.105
ards+2.967
 acknow+2.921
Neg logit contributions
ogene-3.552
vati-3.481
awan-3.433
undo-3.366
 Manip-3.362
Token.
Token ID13
Feature activation-6.60e+00

Pos logit contributions
united+4.649
adders+4.134
ridges+4.085
ards+3.910
 acknow+3.820
Neg logit contributions
ogene-7.383
vati-7.103
awan-6.984
undo-6.951
argument-6.794
Tokenhtml
Token ID6494
Feature activation-3.10e+00

Pos logit contributions
united+3.729
ridges+3.376
adders+3.319
ards+3.154
 acknow+3.069
Neg logit contributions
ogene-5.598
vati-5.300
awan-5.265
undo-5.209
argument-5.098
Token
Token ID447
Feature activation-2.10e+00

Pos logit contributions
ogene+2.428
awan+2.294
undo+2.251
vati+2.240
argument+2.202
Neg logit contributions
united-3.208
ridges-3.084
adders-3.051
 acknow-3.010
 advoc-2.976
Token
Token ID247
Feature activation-3.50e+00

Pos logit contributions
united+1.472
ridges+1.382
adders+1.378
 acknow+1.308
ards+1.295
Neg logit contributions
ogene-1.521
awan-1.452
vati-1.424
undo-1.403
 Manip-1.379
Tokent
Token ID83
Feature activation+3.92e+00

Pos logit contributions
united+2.322
adders+2.159
ridges+2.156
 acknow+2.034
ards+2.032
Neg logit contributions
ogene-2.688
awan-2.539
vati-2.508
undo-2.463
argument-2.435
Token.
Token ID13
Feature activation-1.92e+00

Pos logit contributions
ogene+2.948
awan+2.800
undo+2.787
vati+2.772
argument+2.714
Neg logit contributions
united-2.611
ridges-2.473
adders-2.450
 acknow-2.304
ards-2.260
Token Franklin
Token ID14021
Feature activation-1.10e+01

Pos logit contributions
united+1.298
adders+1.210
ridges+1.199
ards+1.136
 acknow+1.134
Neg logit contributions
ogene-1.457
vati-1.372
awan-1.368
undo-1.346
argument-1.312
Token Del
Token ID4216
Feature activation-1.51e+01

Pos logit contributions
united+6.143
adders+6.137
ridges+5.768
 acknow+5.705
ards+5.677
Neg logit contributions
ogene-8.499
vati-8.318
undo-7.965
argument-7.943
awan-7.936
Tokenano
Token ID5733
Feature activation-6.14e+00

Pos logit contributions
adders+9.951
united+9.732
ridges+9.281
ards+9.063
esse+8.949
Neg logit contributions
ogene-10.355
vati-10.249
argument-9.827
undo-9.746
 circulating-9.665
Token Roosevelt
Token ID19297
Feature activation-6.05e+00

Pos logit contributions
united+3.466
adders+3.173
ridges+3.126
ards+3.058
 acknow+2.934
Neg logit contributions
ogene-5.191
vati-4.918
awan-4.868
argument-4.832
undo-4.714
Token was
Token ID373
Feature activation-4.28e-01

Pos logit contributions
united+3.810
 acknow+3.506
adders+3.495
ridges+3.434
ards+3.346
Neg logit contributions
ogene-4.734
awan-4.499
vati-4.481
undo-4.381
argument-4.270
Token credited
Token ID18141
Feature activation+4.25e+00

Pos logit contributions
united+0.292
ridges+0.271
adders+0.271
 acknow+0.260
ards+0.254
Neg logit contributions
ogene-0.322
awan-0.305
vati-0.305
undo-0.301
argument-0.294
Token as
Token ID355
Feature activation-1.39e+00

Pos logit contributions
ogene+3.371
awan+3.193
vati+3.182
undo+3.175
argument+3.116
Neg logit contributions
united-2.641
ridges-2.479
adders-2.466
 acknow-2.294
ards-2.245
Token their
Token ID511
Feature activation+5.23e-01

Pos logit contributions
united+0.308
ridges+0.287
adders+0.286
 acknow+0.272
ards+0.267
Neg logit contributions
ogene-0.361
awan-0.343
vati-0.342
undo-0.339
argument-0.330
Token ambitions
Token ID22455
Feature activation+3.09e+00

Pos logit contributions
ogene+0.398
awan+0.377
vati+0.375
undo+0.373
argument+0.365
Neg logit contributions
united-0.351
ridges-0.328
adders-0.327
 acknow-0.310
ards-0.304
Token were
Token ID547
Feature activation-3.04e+00

Pos logit contributions
ogene+2.531
awan+2.413
vati+2.408
undo+2.388
argument+2.343
Neg logit contributions
united-1.862
ridges-1.742
adders-1.732
 acknow-1.620
ards-1.581
Token not
Token ID407
Feature activation-6.46e-01

Pos logit contributions
united+1.889
ridges+1.735
adders+1.727
 acknow+1.659
ards+1.617
Neg logit contributions
ogene-2.470
awan-2.348
vati-2.339
undo-2.315
argument-2.267
Token always
Token ID1464
Feature activation-4.54e+00

Pos logit contributions
united+0.394
ridges+0.365
adders+0.364
 acknow+0.345
ards+0.337
Neg logit contributions
ogene-0.530
awan-0.504
vati-0.502
undo-0.499
argument-0.489
Token self
Token ID2116
Feature activation-1.51e+01

Pos logit contributions
united+2.838
ridges+2.576
adders+2.572
 acknow+2.513
ards+2.455
Neg logit contributions
ogene-3.638
vati-3.451
awan-3.449
undo-3.425
argument-3.344
Tokenless
Token ID1203
Feature activation-5.49e+00

Pos logit contributions
united+6.767
ridges+6.279
 acknow+5.745
adders+5.704
ards+5.628
Neg logit contributions
ogene-13.363
awan-12.856
vati-12.830
undo-12.554
pora-12.331
Token".[
Token ID42924
Feature activation+3.05e+00

Pos logit contributions
united+2.966
ridges+2.706
adders+2.684
ards+2.556
 acknow+2.527
Neg logit contributions
ogene-4.823
undo-4.576
vati-4.575
awan-4.568
argument-4.420
Token39
Token ID2670
Feature activation+4.14e+00

Pos logit contributions
ogene+1.723
awan+1.603
vati+1.598
undo+1.586
argument+1.555
Neg logit contributions
united-2.621
adders-2.495
ridges-2.478
 acknow-2.401
ards-2.338
Token]
Token ID60
Feature activation+3.43e+00

Pos logit contributions
ogene+2.188
awan+2.041
vati+2.007
undo+2.000
argument+1.955
Neg logit contributions
united-3.662
ridges-3.534
adders-3.506
 acknow-3.456
 advoc-3.418
Token
Token ID198
Feature activation+2.80e+00

Pos logit contributions
ogene+2.451
awan+2.335
undo+2.307
vati+2.305
 Manip+2.280
Neg logit contributions
united-2.401
ridges-2.264
adders-2.261
 acknow-2.160
 advoc-2.099
Token http
Token ID2638
Feature activation-1.93e+00

Pos logit contributions
united+3.483
ridges+3.141
adders+3.129
 acknow+2.991
ards+2.956
Neg logit contributions
ogene-4.612
vati-4.435
undo-4.372
awan-4.351
argument-4.277
Token://
Token ID1378
Feature activation-7.85e+00

Pos logit contributions
united+0.553
adders+0.451
ridges+0.439
ards+0.375
 acknow+0.354
Neg logit contributions
ogene-2.218
vati-2.141
awan-2.137
undo-2.120
argument-2.081
Tokenwww
Token ID2503
Feature activation-5.69e+00

Pos logit contributions
united+4.989
adders+4.552
ridges+4.442
ards+4.208
 acknow+4.015
Neg logit contributions
ogene-6.097
vati-5.843
 Manip-5.798
awan-5.679
undo-5.619
Token.
Token ID13
Feature activation-3.35e+00

Pos logit contributions
united+2.938
adders+2.579
ridges+2.512
ards+2.403
aunders+2.253
Neg logit contributions
ogene-5.161
vati-4.963
undo-4.861
awan-4.848
 Manip-4.734
Tokened
Token ID276
Feature activation-2.06e+00

Pos logit contributions
united+2.361
adders+2.183
ridges+2.182
ards+2.049
 acknow+2.015
Neg logit contributions
ogene-2.451
vati-2.320
awan-2.305
 Manip-2.280
undo-2.265
Tokenwards
Token ID2017
Feature activation-1.51e+01

Pos logit contributions
united+0.630
ridges+0.539
adders+0.530
 acknow+0.456
ards+0.451
Neg logit contributions
ogene-2.320
vati-2.233
awan-2.231
undo-2.216
argument-2.188
Token.
Token ID13
Feature activation-7.84e+00

Pos logit contributions
united+9.426
adders+8.142
ards+8.094
ridges+7.842
aunders+7.409
Neg logit contributions
ogene-10.626
vati-10.337
 Manip-10.294
undo-9.875
 circulating-9.794
Tokenaf
Token ID1878
Feature activation-8.80e+00

Pos logit contributions
united+4.949
adders+4.494
ridges+4.406
ards+4.214
 acknow+3.983
Neg logit contributions
ogene-6.007
vati-5.745
 Manip-5.706
awan-5.590
 circulating-5.557
Token.
Token ID13
Feature activation-9.61e+00

Pos logit contributions
united+5.353
ridges+4.864
adders+4.687
ards+4.610
 acknow+4.589
Neg logit contributions
ogene-6.864
vati-6.477
 Manip-6.356
argument-6.323
undo-6.285
Tokenmil
Token ID25433
Feature activation-6.53e+00

Pos logit contributions
united+4.931
adders+4.270
ridges+4.242
ards+3.912
 acknow+3.723
Neg logit contributions
ogene-8.512
vati-8.092
 Manip-7.951
 circulating-7.883
awan-7.866
Token/
Token ID14
Feature activation-1.33e+01

Pos logit contributions
united+3.463
adders+3.156
ridges+3.100
 acknow+2.932
ards+2.897
Neg logit contributions
ogene-5.730
vati-5.430
awan-5.398
undo-5.376
argument-5.316
Token a
Token ID257
Feature activation+2.09e+00

Pos logit contributions
ogene+3.323
awan+3.117
undo+3.092
vati+3.064
 Manip+3.037
Neg logit contributions
united-3.494
ridges-3.265
adders-3.241
 acknow-3.093
ards-2.996
Token conservative
Token ID5940
Feature activation+6.93e+00

Pos logit contributions
ogene+1.621
awan+1.535
undo+1.525
vati+1.525
argument+1.485
Neg logit contributions
united-1.366
adders-1.278
ridges-1.278
 acknow-1.208
ards-1.174
Token estimate
Token ID8636
Feature activation+4.30e+00

Pos logit contributions
ogene+4.747
undo+4.450
awan+4.394
vati+4.345
argument+4.324
Neg logit contributions
united-5.088
ridges-4.819
adders-4.798
 acknow-4.490
ards-4.423
Token of
Token ID286
Feature activation+6.29e+00

Pos logit contributions
ogene+2.981
awan+2.806
vati+2.799
undo+2.795
argument+2.732
Neg logit contributions
united-3.130
ridges-2.960
adders-2.941
 acknow-2.814
 advoc-2.734
Token the
Token ID262
Feature activation+6.10e+00

Pos logit contributions
ogene+4.181
awan+3.851
undo+3.841
vati+3.824
 Manip+3.772
Neg logit contributions
united-4.756
ridges-4.468
adders-4.465
 acknow-4.235
ards-4.144
Token incub
Token ID25543
Feature activation+1.03e+01

Pos logit contributions
ogene+4.457
undo+4.153
awan+4.141
vati+4.119
 Manip+4.054
Neg logit contributions
united-4.209
adders-3.970
ridges-3.947
 acknow-3.743
ards-3.654
Tokenation
Token ID341
Feature activation+8.96e+00

Pos logit contributions
ogene+7.050
vati+6.513
awan+6.512
undo+6.431
argument+6.206
Neg logit contributions
united-7.156
ridges-6.963
 acknow-6.709
 advoc-6.671
adders-6.571
Token times
Token ID1661
Feature activation+9.01e+00

Pos logit contributions
ogene+5.619
vati+5.205
awan+5.176
 circulating+5.156
undo+5.124
Neg logit contributions
united-6.849
ridges-6.460
adders-6.374
 acknow-6.093
aunders-5.945
Token for
Token ID329
Feature activation+2.09e+00

Pos logit contributions
ogene+6.025
vati+5.663
undo+5.615
awan+5.614
argument+5.493
Neg logit contributions
united-6.511
ridges-6.146
adders-6.095
 acknow-5.921
 advoc-5.753
Token the
Token ID262
Feature activation+3.91e+00

Pos logit contributions
ogene+1.598
awan+1.512
vati+1.504
undo+1.497
argument+1.464
Neg logit contributions
united-1.394
ridges-1.303
adders-1.301
 acknow-1.232
ards-1.202
Token two
Token ID734
Feature activation+4.06e+00

Pos logit contributions
ogene+2.767
awan+2.570
undo+2.559
vati+2.554
 Manip+2.507
Neg logit contributions
united-2.819
adders-2.657
ridges-2.656
 acknow-2.532
ards-2.458
Token l
Token ID300
Feature activation+1.14e+00

Pos logit contributions
ogene+4.539
awan+4.320
vati+4.290
undo+4.235
 Manip+4.112
Neg logit contributions
united-4.360
ridges-4.151
adders-4.131
 acknow-3.839
ards-3.774
Tokenono
Token ID29941
Feature activation+2.27e+00

Pos logit contributions
ogene+0.707
awan+0.661
vati+0.658
undo+0.652
argument+0.637
Neg logit contributions
united-0.921
adders-0.871
ridges-0.870
 acknow-0.830
ards-0.818
Token i
Token ID1312
Feature activation+2.98e+00

Pos logit contributions
ogene+1.675
awan+1.584
vati+1.577
undo+1.563
argument+1.522
Neg logit contributions
united-1.579
ridges-1.484
adders-1.480
 acknow-1.399
ards-1.367
Token o
Token ID267
Feature activation+8.12e+00

Pos logit contributions
ogene+2.091
awan+1.980
vati+1.966
undo+1.946
 Manip+1.893
Neg logit contributions
united-2.171
ridges-2.048
adders-2.039
 acknow-1.930
ards-1.892
Token Kal
Token ID12612
Feature activation+5.18e+00

Pos logit contributions
ogene+5.721
vati+5.454
awan+5.437
undo+5.362
 Manip+5.194
Neg logit contributions
united-5.621
ridges-5.315
adders-5.296
 acknow-5.005
ards-4.894
Tokenan
Token ID272
Feature activation+8.44e+00

Pos logit contributions
ogene+3.765
awan+3.646
vati+3.564
undo+3.534
argument+3.401
Neg logit contributions
united-3.581
ridges-3.387
adders-3.351
 acknow-3.269
 advoc-3.138
Tokenial
Token ID498
Feature activation+1.08e+01

Pos logit contributions
ogene+5.847
awan+5.650
vati+5.546
undo+5.425
argument+5.188
Neg logit contributions
united-5.962
ridges-5.660
adders-5.568
 acknow-5.504
 advoc-5.310
Tokeni
Token ID72
Feature activation+6.75e+00

Pos logit contributions
ogene+7.620
awan+7.385
vati+7.231
undo+7.129
ova+6.765
Neg logit contributions
united-7.208
ridges-6.878
adders-6.695
 acknow-6.573
 advoc-6.408
Token
Token ID134
Feature activation+3.55e+00

Pos logit contributions
ogene+4.753
awan+4.499
vati+4.431
undo+4.389
 Manip+4.322
Neg logit contributions
united-4.780
ridges-4.551
adders-4.512
 acknow-4.249
aunders-4.145
Token
Token ID119
Feature activation+8.08e+00

Pos logit contributions
ogene+2.275
awan+2.153
vati+2.139
undo+2.131
argument+2.073
Neg logit contributions
united-2.773
ridges-2.635
adders-2.615
 acknow-2.509
ards-2.452
Tokenil
Token ID346
Feature activation+6.06e+00

Pos logit contributions
ogene+5.805
awan+5.511
vati+5.460
undo+5.433
argument+5.159
Neg logit contributions
united-5.538
ridges-5.211
adders-5.137
 acknow-5.046
 advoc-4.904
Tokenig
Token ID328
Feature activation+7.04e+00

Pos logit contributions
ogene+3.536
awan+3.347
undo+3.303
vati+3.277
argument+3.237
Neg logit contributions
united-4.083
adders-3.872
ridges-3.871
 acknow-3.801
 encount-3.791
Tokendem
Token ID9536
Feature activation+5.01e+00

Pos logit contributions
ogene+3.960
undo+3.801
awan+3.751
vati+3.643
argument+3.641
Neg logit contributions
united-5.708
 acknow-5.659
ridges-5.647
 encount-5.633
adders-5.551
Tokenont
Token ID756
Feature activation+3.80e+00

Pos logit contributions
ogene+3.750
awan+3.628
undo+3.581
vati+3.488
argument+3.480
Neg logit contributions
united-3.230
ridges-3.133
 acknow-3.116
 encount-3.087
adders-3.073
Token and
Token ID290
Feature activation+6.47e+00

Pos logit contributions
ogene+2.709
awan+2.566
vati+2.560
undo+2.538
argument+2.487
Neg logit contributions
united-2.698
ridges-2.553
adders-2.538
 acknow-2.392
ards-2.332
Token his
Token ID465
Feature activation+5.24e+00

Pos logit contributions
ogene+4.310
awan+4.105
undo+4.044
vati+4.010
 Manip+3.991
Neg logit contributions
united-4.708
adders-4.528
ridges-4.497
 acknow-4.153
aunders-4.094
Token nationalist
Token ID19848
Feature activation+9.33e+00

Pos logit contributions
ogene+3.500
awan+3.342
undo+3.299
vati+3.286
 Manip+3.224
Neg logit contributions
united-3.875
ridges-3.699
adders-3.680
 acknow-3.472
ards-3.390
Token heter
Token ID14445
Feature activation-2.52e+00

Pos logit contributions
ogene+5.833
awan+5.716
undo+5.548
vati+5.478
 Manip+5.453
Neg logit contributions
united-6.868
ridges-6.709
adders-6.602
 acknow-6.170
aunders-6.105
Tokenogeneous
Token ID32269
Feature activation+4.84e+00

Pos logit contributions
united+1.944
adders+1.816
ridges+1.805
ards+1.734
 acknow+1.725
Neg logit contributions
vati-1.561
ogene-1.557
undo-1.523
awan-1.513
 Manip-1.496
Token coalition
Token ID9906
Feature activation+1.25e+00

Pos logit contributions
ogene+3.253
awan+3.098
vati+3.056
undo+3.050
argument+2.969
Neg logit contributions
united-3.588
ridges-3.431
adders-3.407
 acknow-3.227
aunders-3.133
Token (
Token ID357
Feature activation-6.80e-01

Pos logit contributions
ogene+0.940
awan+0.888
vati+0.885
undo+0.878
argument+0.859
Neg logit contributions
united-0.848
ridges-0.791
adders-0.791
 acknow-0.750
ards-0.736
Tokenlab
Token ID23912
Feature activation-4.96e-01

Pos logit contributions
united+0.422
ridges+0.388
adders+0.388
 acknow+0.362
ards+0.359
Neg logit contributions
ogene-0.555
awan-0.527
vati-0.525
undo-0.519
argument-0.508
Token for
Token ID329
Feature activation-1.10e+00

Pos logit contributions
ogene+2.562
awan+2.432
undo+2.391
 Manip+2.368
argument+2.367
Neg logit contributions
united-2.549
ridges-2.357
adders-2.341
 acknow-2.187
ards-2.145
Token Guard
Token ID4932
Feature activation-8.50e-01

Pos logit contributions
united+0.772
ridges+0.727
adders+0.724
 acknow+0.688
ards+0.674
Neg logit contributions
ogene-0.798
awan-0.754
vati-0.753
undo-0.745
argument-0.728
Token,
Token ID837
Feature activation+1.02e+00

Pos logit contributions
united+0.557
ridges+0.523
adders+0.521
 acknow+0.498
ards+0.486
Neg logit contributions
ogene-0.653
vati-0.622
awan-0.620
undo-0.612
argument-0.598
Token on
Token ID319
Feature activation+2.00e+00

Pos logit contributions
ogene+0.807
awan+0.768
vati+0.761
undo+0.759
argument+0.744
Neg logit contributions
united-0.648
ridges-0.602
adders-0.602
 acknow-0.564
ards-0.554
Token the
Token ID262
Feature activation+7.73e+00

Pos logit contributions
ogene+1.541
awan+1.457
undo+1.445
vati+1.443
argument+1.416
Neg logit contributions
united-1.325
ridges-1.227
adders-1.227
 acknow-1.154
ards-1.133
Token other
Token ID584
Feature activation+1.27e+01

Pos logit contributions
ogene+5.362
awan+5.027
 Manip+4.952
argument+4.913
vati+4.910
Neg logit contributions
united-5.596
adders-5.124
ridges-5.080
ards-4.783
aunders-4.764
Token hand
Token ID1021
Feature activation+3.34e+00

Pos logit contributions
ogene+5.654
awan+5.431
 circulating+5.399
vati+5.374
 Manip+5.328
Neg logit contributions
united-11.194
 encount-10.888
adders-10.458
 advoc-10.421
 acknow-10.417
Token,
Token ID11
Feature activation+5.30e+00

Pos logit contributions
ogene+2.236
awan+2.112
undo+2.069
vati+2.058
argument+2.029
Neg logit contributions
united-2.557
adders-2.368
ridges-2.368
 acknow-2.250
ards-2.187
Token exits
Token ID30151
Feature activation-3.50e-01

Pos logit contributions
ogene+3.853
awan+3.662
undo+3.601
 Manip+3.590
vati+3.539
Neg logit contributions
united-3.721
adders-3.402
ridges-3.390
 acknow-3.104
 advoc-3.095
Token epoch
Token ID36835
Feature activation+5.24e+00

Pos logit contributions
united+0.238
ridges+0.223
adders+0.223
 acknow+0.212
ards+0.208
Neg logit contributions
ogene-0.261
awan-0.248
vati-0.247
undo-0.245
argument-0.239
Token management
Token ID4542
Feature activation+4.44e+00

Pos logit contributions
ogene+3.710
awan+3.538
undo+3.514
 Manip+3.509
vati+3.498
Neg logit contributions
united-3.697
ridges-3.481
adders-3.461
 acknow-3.196
aunders-3.162
Token
Token ID564
Feature activation-1.02e+00

Pos logit contributions
ogene+0.316
awan+0.300
vati+0.298
undo+0.296
argument+0.290
Neg logit contributions
united-0.291
ridges-0.272
adders-0.272
 acknow-0.257
ards-0.252
Token
Token ID250
Feature activation-2.11e+00

Pos logit contributions
united+0.708
ridges+0.664
adders+0.662
 acknow+0.625
ards+0.618
Neg logit contributions
ogene-0.756
awan-0.715
vati-0.707
undo-0.700
 Manip-0.687
Tokenhe
Token ID258
Feature activation-2.31e+00

Pos logit contributions
united+1.442
ridges+1.338
adders+1.327
 acknow+1.246
ards+1.239
Neg logit contributions
ogene-1.590
awan-1.503
vati-1.494
undo-1.459
 Manip-1.449
Token enjoyed
Token ID8359
Feature activation+2.20e+00

Pos logit contributions
united+1.396
adders+1.294
ridges+1.283
 acknow+1.226
ards+1.201
Neg logit contributions
ogene-1.892
awan-1.806
vati-1.794
undo-1.779
argument-1.754
Token his
Token ID465
Feature activation+1.97e+00

Pos logit contributions
ogene+1.457
awan+1.375
vati+1.362
undo+1.355
argument+1.321
Neg logit contributions
united-1.681
ridges-1.584
adders-1.582
 acknow-1.502
ards-1.479
Token time
Token ID640
Feature activation+8.83e+00

Pos logit contributions
ogene+1.494
awan+1.421
vati+1.409
undo+1.404
argument+1.374
Neg logit contributions
united-1.325
adders-1.240
ridges-1.238
 acknow-1.167
ards-1.143
Token with
Token ID351
Feature activation-1.75e+00

Pos logit contributions
ogene+6.044
undo+5.795
awan+5.761
vati+5.726
 Manip+5.662
Neg logit contributions
united-5.909
adders-5.897
ridges-5.720
 acknow-5.425
 advoc-5.344
Token the
Token ID262
Feature activation-3.70e+00

Pos logit contributions
united+1.207
ridges+1.124
adders+1.117
 acknow+1.063
ards+1.046
Neg logit contributions
ogene-1.286
vati-1.223
awan-1.216
undo-1.200
argument-1.183
Token Heat
Token ID12308
Feature activation+2.18e+00

Pos logit contributions
united+2.610
ridges+2.431
adders+2.409
 acknow+2.289
ards+2.276
Neg logit contributions
ogene-2.637
vati-2.509
awan-2.485
undo-2.454
argument-2.413
Token in
Token ID287
Feature activation+1.84e+00

Pos logit contributions
ogene+1.644
awan+1.562
vati+1.552
undo+1.542
argument+1.507
Neg logit contributions
united-1.470
ridges-1.385
adders-1.379
 acknow-1.303
ards-1.274
Token Summer
Token ID10216
Feature activation+1.17e+01

Pos logit contributions
ogene+1.173
awan+1.102
vati+1.088
undo+1.085
argument+1.057
Neg logit contributions
united-1.457
ridges-1.380
adders-1.379
 acknow-1.317
ards-1.293
Token to
Token ID284
Feature activation+3.02e+00

Pos logit contributions
united+0.552
adders+0.516
ridges+0.515
 acknow+0.486
ards+0.479
Neg logit contributions
ogene-0.640
awan-0.603
vati-0.602
undo-0.598
argument-0.584
Token lay
Token ID3830
Feature activation+3.73e+00

Pos logit contributions
ogene+2.122
awan+2.018
undo+1.993
vati+1.988
argument+1.939
Neg logit contributions
united-2.188
ridges-2.068
adders-2.052
 acknow-1.921
ards-1.906
Token another
Token ID1194
Feature activation+3.97e+00

Pos logit contributions
ogene+2.356
awan+2.209
vati+2.187
undo+2.178
argument+2.121
Neg logit contributions
united-2.981
ridges-2.813
adders-2.794
 acknow-2.657
 advoc-2.601
Token allegation
Token ID24389
Feature activation+8.97e-01

Pos logit contributions
ogene+3.035
awan+2.880
undo+2.868
vati+2.850
argument+2.810
Neg logit contributions
united-2.607
ridges-2.484
adders-2.457
 acknow-2.273
ards-2.252
Token on
Token ID319
Feature activation+3.92e+00

Pos logit contributions
ogene+0.685
awan+0.650
vati+0.647
undo+0.642
argument+0.629
Neg logit contributions
united-0.597
ridges-0.558
adders-0.557
 acknow-0.527
ards-0.516
Token the
Token ID262
Feature activation+8.05e+00

Pos logit contributions
ogene+2.560
awan+2.414
vati+2.378
undo+2.368
argument+2.319
Neg logit contributions
united-3.015
ridges-2.851
adders-2.826
 acknow-2.699
ards-2.662
Token Pentagon
Token ID12651
Feature activation+2.54e+00

Pos logit contributions
ogene+5.528
awan+5.212
vati+5.122
undo+5.096
 Manip+5.074
Neg logit contributions
united-5.701
ridges-5.374
adders-5.338
 acknow-5.092
ards-4.972
Token,
Token ID11
Feature activation-1.22e+00

Pos logit contributions
ogene+1.843
awan+1.752
vati+1.734
undo+1.726
argument+1.692
Neg logit contributions
united-1.779
ridges-1.671
adders-1.662
 acknow-1.574
ards-1.541
Token saying
Token ID2282
Feature activation+1.72e+00

Pos logit contributions
united+0.819
ridges+0.763
adders+0.763
 acknow+0.731
ards+0.709
Neg logit contributions
ogene-0.924
awan-0.874
vati-0.874
undo-0.866
argument-0.842
Token that
Token ID326
Feature activation+3.16e+00

Pos logit contributions
ogene+1.335
awan+1.268
vati+1.259
undo+1.254
argument+1.230
Neg logit contributions
united-1.128
ridges-1.052
adders-1.050
 acknow-0.991
ards-0.970
Token he
Token ID339
Feature activation-9.99e-01

Pos logit contributions
ogene+2.292
awan+2.171
vati+2.150
undo+2.142
argument+2.102
Neg logit contributions
united-2.226
ridges-2.086
adders-2.075
 acknow-1.959
ards-1.922
Token (*
Token ID20789
Feature activation-8.48e-01

Pos logit contributions
ogene+0.979
awan+0.923
vati+0.916
undo+0.914
argument+0.893
Neg logit contributions
united-1.024
ridges-0.962
adders-0.959
 acknow-0.914
ards-0.898
Token take
Token ID1011
Feature activation-2.88e+00

Pos logit contributions
united+0.584
ridges+0.547
adders+0.547
 acknow+0.516
ards+0.507
Neg logit contributions
ogene-0.630
awan-0.596
vati-0.594
undo-0.584
argument-0.572
Token the
Token ID262
Feature activation+2.71e+00

Pos logit contributions
united+1.957
adders+1.829
ridges+1.823
 acknow+1.719
ards+1.704
Neg logit contributions
ogene-2.133
awan-2.029
vati-2.016
undo-1.979
 Manip-1.923
Token N
Token ID399
Feature activation+7.14e+00

Pos logit contributions
ogene+2.074
awan+1.962
undo+1.953
vati+1.947
argument+1.918
Neg logit contributions
united-1.799
ridges-1.675
adders-1.673
 acknow-1.577
ards-1.552
TokenNs
Token ID47503
Feature activation+4.21e+00

Pos logit contributions
ogene+5.334
undo+5.083
awan+5.011
argument+4.919
vati+4.875
Neg logit contributions
united-4.631
ridges-4.325
adders-4.183
 acknow-4.156
aunders-3.996
Token from
Token ID422
Feature activation+3.04e+00

Pos logit contributions
ogene+3.039
awan+2.887
undo+2.872
vati+2.853
argument+2.819
Neg logit contributions
united-2.943
ridges-2.742
adders-2.725
 acknow-2.605
 advoc-2.547
Token previous
Token ID2180
Feature activation+3.13e+00

Pos logit contributions
ogene+2.149
awan+2.022
undo+2.012
vati+2.002
argument+1.981
Neg logit contributions
united-2.182
ridges-2.046
adders-2.038
 acknow-1.941
ards-1.913
Token ser
Token ID1055
Feature activation-2.70e+00

Pos logit contributions
ogene+2.163
awan+2.018
undo+2.014
vati+2.013
argument+1.977
Neg logit contributions
united-2.308
ridges-2.157
adders-2.154
 acknow-2.053
ards-2.021
Tokenached
Token ID2317
Feature activation-5.08e-01

Pos logit contributions
united+1.997
ridges+1.890
adders+1.889
 acknow+1.793
ards+1.768
Neg logit contributions
ogene-1.848
awan-1.739
vati-1.734
undo-1.723
 Manip-1.686
Token locations
Token ID7064
Feature activation+1.56e+00

Pos logit contributions
united+0.352
ridges+0.331
adders+0.330
 acknow+0.312
ards+0.307
Neg logit contributions
ogene-0.374
awan-0.355
vati-0.353
undo-0.350
argument-0.342
Token which
Token ID543
Feature activation+1.39e+00

Pos logit contributions
ogene+1.056
awan+0.995
vati+0.989
undo+0.984
argument+0.962
Neg logit contributions
united-1.170
ridges-1.101
adders-1.098
 acknow-1.046
ards-1.027
Token on
Token ID319
Feature activation-3.91e+00

Pos logit contributions
united+2.416
ridges+2.252
adders+2.246
 acknow+2.158
ards+2.116
Neg logit contributions
ogene-2.705
vati-2.543
awan-2.541
undo-2.536
argument-2.450
Token the
Token ID262
Feature activation+1.03e+00

Pos logit contributions
united+2.575
ridges+2.428
adders+2.425
 acknow+2.298
ards+2.266
Neg logit contributions
ogene-2.943
awan-2.781
undo-2.765
vati-2.760
argument-2.670
Token trade
Token ID3292
Feature activation-4.61e+00

Pos logit contributions
ogene+0.786
awan+0.746
vati+0.742
undo+0.737
argument+0.722
Neg logit contributions
united-0.690
ridges-0.645
adders-0.644
 acknow-0.609
ards-0.596
Token front
Token ID2166
Feature activation-5.56e+00

Pos logit contributions
united+3.251
adders+3.039
ridges+2.969
 acknow+2.875
ards+2.871
Neg logit contributions
ogene-3.308
awan-3.138
vati-3.113
undo-3.106
argument-2.992
Token,
Token ID11
Feature activation+2.07e+00

Pos logit contributions
united+3.541
adders+3.249
ridges+3.229
ards+3.076
 acknow+3.016
Neg logit contributions
ogene-4.318
awan-4.094
undo-4.067
vati-4.050
argument-3.908
Token Ireland
Token ID7517
Feature activation+2.74e+00

Pos logit contributions
ogene+1.548
awan+1.474
vati+1.469
undo+1.456
argument+1.431
Neg logit contributions
united-1.403
ridges-1.316
adders-1.311
 acknow-1.241
ards-1.208
Token
Token ID447
Feature activation-2.55e-01

Pos logit contributions
ogene+2.118
awan+2.020
vati+2.008
undo+1.996
argument+1.953
Neg logit contributions
united-1.787
ridges-1.681
adders-1.667
 acknow-1.574
ards-1.533
Token
Token ID247
Feature activation-3.38e-01

Pos logit contributions
united+0.210
adders+0.199
ridges+0.199
 acknow+0.188
ards+0.188
Neg logit contributions
ogene-0.154
awan-0.144
vati-0.143
undo-0.141
argument-0.137
Tokens
Token ID82
Feature activation-1.38e+00

Pos logit contributions
united+0.204
ridges+0.188
adders+0.188
 acknow+0.176
ards+0.174
Neg logit contributions
ogene-0.281
awan-0.267
vati-0.265
undo-0.264
argument-0.259
Token national
Token ID2260
Feature activation-4.54e+00

Pos logit contributions
united+0.896
ridges+0.835
adders+0.833
 acknow+0.793
ards+0.776
Neg logit contributions
ogene-1.071
awan-1.014
vati-1.003
undo-1.001
argument-0.980
Token accounts
Token ID5504
Feature activation+1.70e+00

Pos logit contributions
united+2.521
adders+2.295
ridges+2.288
 acknow+2.171
ards+2.111
Neg logit contributions
ogene-3.961
awan-3.766
undo-3.764
vati-3.736
argument-3.668
Token
Token ID198
Feature activation-8.65e-03

Pos logit contributions
united+2.542
adders+2.316
ridges+2.278
ards+2.198
 acknow+2.144
Neg logit contributions
ogene-3.056
vati-2.895
undo-2.872
awan-2.870
argument-2.813
Token
Token ID198
Feature activation-9.10e-01

Pos logit contributions
united+0.006
ridges+0.005
adders+0.005
ards+0.005
 acknow+0.005
Neg logit contributions
ogene-0.007
vati-0.006
awan-0.006
undo-0.006
argument-0.006
Token
Token ID447
Feature activation-2.12e+00

Pos logit contributions
united+0.609
adders+0.560
ridges+0.558
ards+0.522
 acknow+0.519
Neg logit contributions
ogene-0.703
awan-0.663
vati-0.662
undo-0.654
argument-0.638
Token
Token ID250
Feature activation-5.21e+00

Pos logit contributions
united+1.430
ridges+1.329
adders+1.328
 acknow+1.265
ards+1.239
Neg logit contributions
ogene-1.616
awan-1.522
vati-1.506
undo-1.496
argument-1.466
TokenHaw
Token ID33055
Feature activation+4.32e-01

Pos logit contributions
united+3.589
adders+3.245
ridges+3.231
ards+3.104
 acknow+3.102
Neg logit contributions
ogene-3.837
awan-3.602
vati-3.585
undo-3.529
argument-3.510
Tokenaii
Token ID42648
Feature activation+1.79e+00

Pos logit contributions
ogene+0.292
awan+0.276
vati+0.274
undo+0.272
argument+0.265
Neg logit contributions
united-0.326
ridges-0.307
adders-0.306
 acknow-0.293
ards-0.286
Token has
Token ID468
Feature activation+1.36e+00

Pos logit contributions
ogene+1.128
awan+1.059
vati+1.052
undo+1.041
argument+1.016
Neg logit contributions
united-1.430
ridges-1.357
adders-1.352
 acknow-1.290
ards-1.267
Token several
Token ID1811
Feature activation+1.98e+00

Pos logit contributions
ogene+1.048
awan+0.995
vati+0.990
undo+0.984
argument+0.963
Neg logit contributions
united-0.893
ridges-0.834
adders-0.832
 acknow-0.786
ards-0.769
Token good
Token ID922
Feature activation+3.07e+00

Pos logit contributions
ogene+1.535
awan+1.458
vati+1.449
undo+1.439
argument+1.409
Neg logit contributions
united-1.295
ridges-1.210
adders-1.207
 acknow-1.140
ards-1.116
Token legislators
Token ID19964
Feature activation+2.43e+00

Pos logit contributions
ogene+2.327
awan+2.210
vati+2.201
undo+2.178
argument+2.138
Neg logit contributions
united-2.038
adders-1.909
ridges-1.907
 acknow-1.807
ards-1.753
Token who
Token ID508
Feature activation+2.92e+00

Pos logit contributions
ogene+1.822
awan+1.729
vati+1.717
undo+1.707
argument+1.669
Neg logit contributions
united-1.629
ridges-1.545
adders-1.537
 acknow-1.448
ards-1.414
Token,
Token ID11
Feature activation-1.16e+00

Pos logit contributions
united+3.494
adders+3.266
ridges+3.253
 acknow+3.064
ards+3.032
Neg logit contributions
ogene-4.409
undo-4.224
awan-4.167
vati-4.086
argument-4.014
Token got
Token ID1392
Feature activation+5.44e+00

Pos logit contributions
united+0.705
adders+0.654
ridges+0.652
 acknow+0.622
ards+0.601
Neg logit contributions
ogene-0.952
awan-0.906
vati-0.899
undo-0.898
argument-0.877
Token crushed
Token ID18577
Feature activation+7.07e-01

Pos logit contributions
ogene+3.491
vati+3.326
awan+3.314
undo+3.202
 Manip+3.188
Neg logit contributions
united-4.219
adders-3.971
ridges-3.953
 acknow-3.735
ards-3.703
Token under
Token ID739
Feature activation+6.01e-01

Pos logit contributions
ogene+0.508
awan+0.480
vati+0.478
undo+0.474
argument+0.463
Neg logit contributions
united-0.503
ridges-0.473
adders-0.472
 acknow-0.448
ards-0.439
Token her
Token ID607
Feature activation+6.80e-01

Pos logit contributions
ogene+0.468
awan+0.444
vati+0.441
undo+0.439
argument+0.430
Neg logit contributions
united-0.392
ridges-0.367
adders-0.366
 acknow-0.345
ards-0.338
Token weight
Token ID3463
Feature activation+3.45e-01

Pos logit contributions
ogene+0.520
awan+0.493
vati+0.490
undo+0.488
argument+0.475
Neg logit contributions
united-0.453
ridges-0.423
adders-0.423
 acknow-0.402
ards-0.392
Token and
Token ID290
Feature activation+4.22e+00

Pos logit contributions
ogene+0.265
awan+0.251
vati+0.249
undo+0.248
argument+0.242
Neg logit contributions
united-0.229
ridges-0.213
adders-0.213
 acknow-0.202
ards-0.198
Token suffered
Token ID6989
Feature activation-3.74e+00

Pos logit contributions
ogene+3.267
vati+3.105
awan+3.093
 Manip+3.041
undo+3.033
Neg logit contributions
united-2.729
ridges-2.575
adders-2.552
 acknow-2.361
ards-2.349
Token serious
Token ID2726
Feature activation-2.46e-01

Pos logit contributions
united+2.229
ridges+2.105
adders+2.078
 acknow+2.004
ards+1.920
Neg logit contributions
ogene-3.043
awan-2.918
undo-2.905
vati-2.876
argument-2.821
Token head
Token ID1182
Feature activation-5.50e+00

Pos logit contributions
united+0.130
ridges+0.120
adders+0.119
 acknow+0.111
ards+0.108
Neg logit contributions
ogene-0.221
awan-0.212
vati-0.211
undo-0.210
argument-0.206
Token injuries
Token ID6821
Feature activation-4.92e+00

Pos logit contributions
united+2.517
ridges+2.316
adders+2.275
 acknow+2.068
ards+2.023
Neg logit contributions
ogene-5.260
awan-5.059
vati-5.021
undo-5.000
argument-4.931
Token disagreements
Token ID37990
Feature activation-2.75e+00

Pos logit contributions
united+0.223
ridges+0.210
adders+0.209
 acknow+0.199
ards+0.195
Neg logit contributions
ogene-0.230
awan-0.219
vati-0.217
undo-0.216
argument-0.210
Token in
Token ID287
Feature activation+1.51e+00

Pos logit contributions
united+1.789
ridges+1.659
adders+1.658
 acknow+1.607
ards+1.563
Neg logit contributions
ogene-2.110
awan-2.012
vati-1.996
undo-1.980
argument-1.924
Token rating
Token ID7955
Feature activation+3.23e+00

Pos logit contributions
ogene+1.106
awan+1.047
vati+1.041
undo+1.034
argument+1.015
Neg logit contributions
united-1.058
ridges-0.993
adders-0.991
 acknow-0.939
ards-0.920
Token may
Token ID743
Feature activation-1.17e+00

Pos logit contributions
ogene+2.374
awan+2.244
vati+2.222
undo+2.214
argument+2.180
Neg logit contributions
united-2.229
ridges-2.097
adders-2.078
 acknow-1.961
ards-1.933
Token not
Token ID407
Feature activation+6.24e-01

Pos logit contributions
united+0.717
adders+0.663
ridges+0.659
 acknow+0.629
ards+0.618
Neg logit contributions
ogene-0.957
vati-0.910
awan-0.907
undo-0.896
argument-0.875
Token reflect
Token ID4079
Feature activation-2.29e+00

Pos logit contributions
ogene+0.505
awan+0.481
vati+0.478
undo+0.475
argument+0.466
Neg logit contributions
united-0.387
ridges-0.360
adders-0.359
 acknow-0.338
ards-0.330
Token true
Token ID2081
Feature activation+2.29e+00

Pos logit contributions
united+1.469
ridges+1.368
adders+1.355
 acknow+1.307
ards+1.274
Neg logit contributions
ogene-1.792
awan-1.702
undo-1.688
vati-1.687
argument-1.635
Token disagreement
Token ID25800
Feature activation-5.20e+00

Pos logit contributions
ogene+1.793
awan+1.698
vati+1.691
undo+1.680
argument+1.654
Neg logit contributions
united-1.471
adders-1.380
ridges-1.375
 acknow-1.287
ards-1.266
Token;
Token ID26
Feature activation+1.91e+00

Pos logit contributions
united+3.137
adders+2.883
ridges+2.876
 acknow+2.775
ards+2.746
Neg logit contributions
ogene-4.170
awan-4.010
vati-3.942
undo-3.924
 Manip-3.797
Token they
Token ID484
Feature activation+3.72e+00

Pos logit contributions
ogene+1.479
awan+1.403
vati+1.396
undo+1.386
argument+1.362
Neg logit contributions
united-1.248
ridges-1.166
adders-1.163
 acknow-1.097
ards-1.074
Token may
Token ID743
Feature activation-8.22e-01

Pos logit contributions
ogene+2.496
awan+2.346
vati+2.342
undo+2.322
argument+2.272
Neg logit contributions
united-2.795
ridges-2.655
adders-2.648
 acknow-2.491
ards-2.448
Tokenco
Token ID1073
Feature activation-7.14e+00

Pos logit contributions
united+1.023
ridges+0.949
adders+0.944
 acknow+0.886
ards+0.882
Neg logit contributions
ogene-1.058
awan-0.995
vati-0.990
undo-0.976
 Manip-0.954
Tokenach
Token ID620
Feature activation+3.83e+00

Pos logit contributions
united+4.121
ridges+3.898
adders+3.751
ards+3.566
 acknow+3.474
Neg logit contributions
ogene-5.983
vati-5.662
awan-5.662
undo-5.560
 Manip-5.426
Token),
Token ID828
Feature activation-6.87e-01

Pos logit contributions
ogene+2.472
awan+2.347
undo+2.319
vati+2.301
argument+2.244
Neg logit contributions
united-2.945
ridges-2.815
adders-2.811
 acknow-2.676
 advoc-2.606
Token Richard
Token ID6219
Feature activation-7.31e+00

Pos logit contributions
united+0.485
ridges+0.453
adders+0.452
 acknow+0.430
ards+0.422
Neg logit contributions
ogene-0.498
awan-0.471
vati-0.469
undo-0.465
argument-0.455
Token Irving
Token ID26101
Feature activation-1.69e+00

Pos logit contributions
united+4.617
ridges+4.275
adders+4.235
ards+4.048
aunders+3.903
Neg logit contributions
ogene-5.612
vati-5.368
awan-5.239
undo-5.237
argument-5.212
Token,
Token ID11
Feature activation-8.94e-01

Pos logit contributions
united+1.158
ridges+1.081
adders+1.075
 acknow+1.019
ards+1.008
Neg logit contributions
ogene-1.260
awan-1.193
vati-1.185
undo-1.177
argument-1.153
Token Paul
Token ID3362
Feature activation-6.28e+00

Pos logit contributions
united+0.631
ridges+0.590
adders+0.588
 acknow+0.560
ards+0.550
Neg logit contributions
ogene-0.648
awan-0.612
vati-0.610
undo-0.605
argument-0.593
Token Sch
Token ID3059
Feature activation-4.02e+00

Pos logit contributions
united+3.714
ridges+3.407
adders+3.360
ards+3.144
 acknow+3.042
Neg logit contributions
ogene-5.171
vati-4.983
awan-4.875
undo-4.865
argument-4.777
Tokenoles
Token ID4316
Feature activation-3.00e-01

Pos logit contributions
united+1.849
ridges+1.691
adders+1.671
ards+1.495
aunders+1.445
Neg logit contributions
ogene-3.870
vati-3.705
awan-3.687
undo-3.680
argument-3.630
Token,
Token ID11
Feature activation-7.98e-01

Pos logit contributions
united+0.212
ridges+0.199
adders+0.199
 acknow+0.189
ards+0.185
Neg logit contributions
ogene-0.217
awan-0.205
vati-0.204
undo-0.203
argument-0.198
Token Mark
Token ID2940
Feature activation-5.52e-01

Pos logit contributions
united+0.561
ridges+0.524
adders+0.523
 acknow+0.497
ards+0.488
Neg logit contributions
ogene-0.581
awan-0.549
vati-0.547
undo-0.542
argument-0.531
TokenWhen
Token ID2215
Feature activation+1.61e+00

Pos logit contributions
ogene+1.329
awan+1.264
vati+1.256
undo+1.254
argument+1.229
Neg logit contributions
united-1.270
ridges-1.192
adders-1.191
 acknow-1.144
ards-1.107
Token I
Token ID314
Feature activation-1.65e+00

Pos logit contributions
ogene+1.147
awan+1.084
vati+1.078
undo+1.071
argument+1.049
Neg logit contributions
united-1.153
adders-1.081
ridges-1.081
 acknow-1.024
ards-1.003
Token joined
Token ID5399
Feature activation-4.91e+00

Pos logit contributions
united+0.986
ridges+0.914
adders+0.913
 acknow+0.865
ards+0.842
Neg logit contributions
ogene-1.369
awan-1.300
vati-1.300
undo-1.290
argument-1.265
Token Netflix
Token ID12074
Feature activation+1.55e+00

Pos logit contributions
united+3.258
adders+3.059
ridges+3.049
 acknow+2.896
ards+2.832
Neg logit contributions
ogene-3.694
vati-3.472
awan-3.459
undo-3.432
argument-3.349
Token in
Token ID287
Feature activation-2.48e-01

Pos logit contributions
ogene+1.099
awan+1.039
vati+1.033
undo+1.026
argument+1.003
Neg logit contributions
united-1.115
ridges-1.047
adders-1.044
 acknow-0.994
ards-0.973
Token 2014
Token ID1946
Feature activation-2.71e+00

Pos logit contributions
united+0.207
ridges+0.196
adders+0.196
 acknow+0.188
ards+0.185
Neg logit contributions
ogene-0.147
awan-0.137
vati-0.136
undo-0.135
argument-0.131
Token,
Token ID11
Feature activation-2.36e+00

Pos logit contributions
united+1.811
adders+1.703
ridges+1.700
 acknow+1.602
ards+1.592
Neg logit contributions
ogene-2.029
awan-1.927
vati-1.918
undo-1.910
argument-1.858
Token we
Token ID356
Feature activation-3.28e+00

Pos logit contributions
united+1.667
ridges+1.585
adders+1.582
 acknow+1.502
ards+1.475
Neg logit contributions
ogene-1.683
awan-1.592
vati-1.580
undo-1.575
argument-1.526
Token had
Token ID550
Feature activation+5.59e-01

Pos logit contributions
united+2.040
ridges+1.884
adders+1.879
 acknow+1.801
ards+1.750
Neg logit contributions
ogene-2.642
awan-2.508
vati-2.490
undo-2.466
argument-2.430
Token begun
Token ID9258
Feature activation-4.28e+00

Pos logit contributions
ogene+0.397
awan+0.375
vati+0.373
undo+0.371
argument+0.362
Neg logit contributions
united-0.402
ridges-0.378
adders-0.377
 acknow-0.359
ards-0.352
Token transitioning
Token ID37005
Feature activation-2.14e+00

Pos logit contributions
united+2.949
adders+2.800
ridges+2.790
 acknow+2.732
ards+2.650
Neg logit contributions
ogene-3.045
vati-2.873
awan-2.856
undo-2.835
argument-2.747
Token things
Token ID1243
Feature activation+6.09e+00

Pos logit contributions
ogene+3.078
undo+2.898
awan+2.893
vati+2.862
 Manip+2.810
Neg logit contributions
united-2.549
ridges-2.344
adders-2.313
 acknow-2.188
ards-2.148
Token don
Token ID836
Feature activation+3.89e+00

Pos logit contributions
ogene+3.325
undo+3.137
awan+3.088
vati+3.056
 Manip+2.932
Neg logit contributions
united-5.299
ridges-5.046
adders-5.002
 advoc-4.748
 acknow-4.727
Token't
Token ID470
Feature activation+7.57e+00

Pos logit contributions
ogene+2.342
awan+2.209
undo+2.173
vati+2.148
argument+2.106
Neg logit contributions
united-3.154
ridges-3.019
adders-3.002
 acknow-2.992
 advoc-2.967
Token.
Token ID13
Feature activation-1.10e+00

Pos logit contributions
ogene+5.215
undo+4.973
awan+4.950
vati+4.843
argument+4.718
Neg logit contributions
united-5.336
ridges-5.031
adders-4.977
 acknow-4.735
 advoc-4.691
Token
Token ID198
Feature activation+1.23e+00

Pos logit contributions
united+0.802
adders+0.752
ridges+0.750
 acknow+0.709
ards+0.707
Neg logit contributions
ogene-0.766
vati-0.723
awan-0.718
undo-0.714
argument-0.691
Token
Token ID198
Feature activation-2.54e+00

Pos logit contributions
ogene+0.932
awan+0.884
vati+0.879
undo+0.874
argument+0.855
Neg logit contributions
united-0.828
ridges-0.775
adders-0.773
 acknow-0.734
ards-0.716
TokenI
Token ID40
Feature activation+6.27e-01

Pos logit contributions
united+1.617
ridges+1.473
adders+1.472
ards+1.380
 acknow+1.317
Neg logit contributions
ogene-2.040
vati-1.929
awan-1.909
undo-1.889
 Manip-1.843
Token'm
Token ID1101
Feature activation-1.82e+00

Pos logit contributions
ogene+0.478
awan+0.453
vati+0.451
undo+0.447
argument+0.437
Neg logit contributions
united-0.420
ridges-0.393
adders-0.392
 acknow-0.369
ards-0.361
Token sure
Token ID1654
Feature activation-3.37e+00

Pos logit contributions
united+1.167
ridges+1.069
adders+1.066
 acknow+1.027
ards+1.008
Neg logit contributions
ogene-1.416
vati-1.366
awan-1.360
undo-1.343
argument-1.320
Token there
Token ID612
Feature activation-6.00e+00

Pos logit contributions
united+2.185
ridges+2.034
adders+2.027
 acknow+1.936
ards+1.919
Neg logit contributions
ogene-2.593
vati-2.472
undo-2.455
awan-2.445
argument-2.353
Token's
Token ID338
Feature activation-3.40e+00

Pos logit contributions
united+3.662
adders+3.484
ridges+3.440
ards+3.349
 acknow+3.292
Neg logit contributions
ogene-4.600
vati-4.474
undo-4.430
awan-4.423
argument-4.262
Token touching
Token ID15241
Feature activation-8.82e+00

Pos logit contributions
united+3.105
adders+2.945
ridges+2.933
 acknow+2.814
ards+2.739
Neg logit contributions
ogene-2.942
awan-2.804
vati-2.782
undo-2.745
argument-2.684
Token the
Token ID262
Feature activation-2.17e+00

Pos logit contributions
united+5.124
ridges+5.030
adders+4.931
 acknow+4.559
ards+4.533
Neg logit contributions
ogene-6.836
awan-6.524
vati-6.519
undo-6.443
argument-6.437
Token ground
Token ID2323
Feature activation-8.47e-01

Pos logit contributions
united+1.383
ridges+1.298
adders+1.294
 acknow+1.222
ards+1.187
Neg logit contributions
ogene-1.713
awan-1.629
vati-1.623
undo-1.602
argument-1.580
Token in
Token ID287
Feature activation-2.79e+00

Pos logit contributions
united+0.534
ridges+0.497
adders+0.496
 acknow+0.467
ards+0.459
Neg logit contributions
ogene-0.676
awan-0.643
vati-0.641
undo-0.636
argument-0.623
Token co
Token ID763
Feature activation-7.18e+00

Pos logit contributions
united+2.009
adders+1.899
ridges+1.895
 acknow+1.799
ards+1.767
Neg logit contributions
ogene-1.970
awan-1.860
vati-1.850
undo-1.840
argument-1.801
Token-
Token ID12
Feature activation-9.14e-01

Pos logit contributions
united+3.616
adders+3.212
ridges+3.198
ards+2.907
aunders+2.684
Neg logit contributions
ogene-6.479
vati-6.351
awan-6.305
undo-6.119
 Manip-6.111
Tokenop
Token ID404
Feature activation-6.92e-01

Pos logit contributions
united+0.685
ridges+0.646
adders+0.645
 acknow+0.612
ards+0.602
Neg logit contributions
ogene-0.622
awan-0.586
vati-0.583
undo-0.577
argument-0.563
Token (
Token ID357
Feature activation+8.89e-01

Pos logit contributions
united+0.441
ridges+0.412
adders+0.412
 acknow+0.388
ards+0.380
Neg logit contributions
ogene-0.548
awan-0.521
vati-0.519
undo-0.514
argument-0.504
Token6
Token ID21
Feature activation+3.45e+00

Pos logit contributions
ogene+0.574
awan+0.539
vati+0.536
undo+0.531
argument+0.516
Neg logit contributions
united-0.700
ridges-0.659
adders-0.658
 acknow-0.626
ards-0.618
Token)
Token ID8
Feature activation+2.95e-01

Pos logit contributions
ogene+1.914
awan+1.784
vati+1.776
undo+1.770
 Manip+1.703
Neg logit contributions
united-2.984
ridges-2.819
adders-2.806
 acknow-2.717
ards-2.648
Token
Token ID198
Feature activation+7.38e-01

Pos logit contributions
ogene+0.220
awan+0.209
vati+0.208
undo+0.207
argument+0.202
Neg logit contributions
united-0.201
ridges-0.188
adders-0.187
 acknow-0.178
ards-0.174
Token Views
Token ID29978
Feature activation-3.94e-01

Pos logit contributions
united+3.274
ridges+3.136
adders+3.124
ards+2.970
 acknow+2.881
Neg logit contributions
ogene-3.415
undo-3.217
vati-3.207
awan-3.178
argument-3.076
Token."
Token ID526
Feature activation+1.64e+00

Pos logit contributions
united+0.276
ridges+0.259
adders+0.258
 acknow+0.245
ards+0.241
Neg logit contributions
ogene-0.287
awan-0.271
vati-0.270
undo-0.268
argument-0.262
Token...
Token ID986
Feature activation-1.79e+00

Pos logit contributions
ogene+1.171
awan+1.109
vati+1.102
undo+1.095
argument+1.071
Neg logit contributions
united-1.169
ridges-1.100
adders-1.098
 acknow-1.043
ards-1.019
Token
Token ID198
Feature activation+7.60e-01

Pos logit contributions
united+1.188
ridges+1.107
adders+1.102
 acknow+1.042
ards+1.036
Neg logit contributions
ogene-1.369
awan-1.285
vati-1.284
undo-1.269
argument-1.239
Token
Token ID198
Feature activation+1.42e+00

Pos logit contributions
ogene+0.585
awan+0.554
vati+0.553
undo+0.548
argument+0.536
Neg logit contributions
united-0.504
ridges-0.469
adders-0.468
 acknow-0.440
ards-0.435
TokenCharles
Token ID28711
Feature activation-1.08e+01

Pos logit contributions
ogene+1.071
awan+1.017
vati+1.011
undo+1.007
argument+0.986
Neg logit contributions
united-0.961
ridges-0.902
adders-0.900
 acknow-0.857
ards-0.833
Token Murray
Token ID12164
Feature activation-1.02e+00

Pos logit contributions
united+6.421
ridges+6.225
adders+5.910
ards+5.889
 acknow+5.833
Neg logit contributions
ogene-8.282
vati-7.787
awan-7.710
undo-7.499
aimon-7.458
Token just
Token ID655
Feature activation+1.13e+00

Pos logit contributions
united+0.708
ridges+0.662
adders+0.661
 acknow+0.634
ards+0.621
Neg logit contributions
ogene-0.756
awan-0.714
vati-0.711
undo-0.707
argument-0.687
Token tweeted
Token ID10830
Feature activation+1.51e+00

Pos logit contributions
ogene+0.866
awan+0.822
vati+0.817
undo+0.812
argument+0.795
Neg logit contributions
united-0.756
ridges-0.707
adders-0.706
 acknow-0.666
ards-0.652
Token the
Token ID262
Feature activation-2.68e+00

Pos logit contributions
ogene+1.111
awan+1.053
vati+1.046
undo+1.041
argument+1.020
Neg logit contributions
united-1.047
adders-0.982
ridges-0.981
 acknow-0.924
ards-0.907
Token following
Token ID1708
Feature activation+9.29e-02

Pos logit contributions
united+1.783
ridges+1.669
adders+1.656
 acknow+1.594
ards+1.552
Neg logit contributions
ogene-2.035
awan-1.925
vati-1.922
undo-1.902
argument-1.849
Token his
Token ID465
Feature activation-1.22e+00

Pos logit contributions
ogene+1.794
awan+1.679
vati+1.665
undo+1.660
 Manip+1.617
Neg logit contributions
united-2.126
adders-2.007
ridges-1.990
 acknow-1.896
ards-1.869
Token prog
Token ID1172
Feature activation+2.90e+00

Pos logit contributions
united+0.838
ridges+0.787
adders+0.782
 acknow+0.750
ards+0.731
Neg logit contributions
ogene-0.899
awan-0.853
vati-0.850
undo-0.845
argument-0.826
Tokeneny
Token ID28558
Feature activation+8.99e-01

Pos logit contributions
ogene+2.106
awan+2.009
undo+1.980
vati+1.977
argument+1.921
Neg logit contributions
united-2.020
ridges-1.917
adders-1.876
 acknow-1.800
 advoc-1.749
Token,
Token ID11
Feature activation-5.04e+00

Pos logit contributions
ogene+0.647
awan+0.612
vati+0.609
undo+0.605
argument+0.590
Neg logit contributions
united-0.636
ridges-0.599
adders-0.598
 acknow-0.567
ards-0.552
Token that
Token ID326
Feature activation-8.65e+00

Pos logit contributions
united+2.600
ridges+2.427
adders+2.370
 acknow+2.244
ards+2.189
Neg logit contributions
ogene-4.514
vati-4.327
awan-4.322
undo-4.259
argument-4.197
Token he
Token ID339
Feature activation-1.15e+01

Pos logit contributions
united+5.045
ridges+4.852
adders+4.630
 acknow+4.595
ards+4.471
Neg logit contributions
ogene-6.660
vati-6.530
awan-6.470
undo-6.405
argument-6.251
Token began
Token ID2540
Feature activation-9.28e+00

Pos logit contributions
united+6.401
 acknow+5.974
ridges+5.570
ards+5.524
adders+5.507
Neg logit contributions
ogene-9.044
awan-8.802
vati-8.761
undo-8.577
argument-8.515
Token to
Token ID284
Feature activation-5.68e+00

Pos logit contributions
united+5.821
 acknow+5.508
ridges+5.473
adders+5.370
 advoc+5.360
Neg logit contributions
ogene-6.638
vati-6.367
awan-6.362
undo-6.214
argument-6.001
Token turn
Token ID1210
Feature activation-2.77e+00

Pos logit contributions
united+3.670
 acknow+3.459
adders+3.410
ridges+3.406
ards+3.184
Neg logit contributions
ogene-4.233
vati-4.117
awan-4.019
undo-3.933
argument-3.884
Token to
Token ID284
Feature activation-5.09e+00

Pos logit contributions
united+1.901
ridges+1.778
adders+1.774
 acknow+1.707
ards+1.675
Neg logit contributions
ogene-2.042
awan-1.931
vati-1.919
undo-1.895
argument-1.861
Token writing
Token ID3597
Feature activation-6.72e+00

Pos logit contributions
united+3.364
ridges+3.188
adders+3.115
 acknow+3.069
 advoc+3.028
Neg logit contributions
ogene-3.721
awan-3.557
vati-3.553
undo-3.480
argument-3.401
Token.'
Token ID2637
Feature activation-4.02e+00

Pos logit contributions
united+1.626
adders+1.508
ridges+1.502
 acknow+1.429
ards+1.425
Neg logit contributions
ogene-1.492
awan-1.411
vati-1.406
undo-1.395
argument-1.362
Token
Token ID198
Feature activation-5.49e+00

Pos logit contributions
united+2.466
ridges+2.294
adders+2.282
ards+2.129
 acknow+2.115
Neg logit contributions
ogene-3.253
awan-3.066
vati-3.062
undo-3.046
argument-2.988
Token
Token ID198
Feature activation-3.69e+00

Pos logit contributions
united+2.996
adders+2.639
ridges+2.614
ards+2.540
aunders+2.293
Neg logit contributions
ogene-4.861
vati-4.592
undo-4.508
awan-4.478
opted-4.373
TokenThe
Token ID464
Feature activation-4.96e+00

Pos logit contributions
united+2.722
adders+2.509
ridges+2.493
ards+2.374
 acknow+2.306
Neg logit contributions
ogene-2.586
vati-2.415
awan-2.394
undo-2.378
argument-2.293
Token comment
Token ID2912
Feature activation-5.43e+00

Pos logit contributions
united+3.392
adders+3.125
ridges+3.122
 acknow+3.023
ards+2.934
Neg logit contributions
ogene-3.605
awan-3.438
undo-3.435
vati-3.430
argument-3.318
Token seemed
Token ID3947
Feature activation-1.09e+01

Pos logit contributions
united+3.174
adders+2.911
 acknow+2.841
ridges+2.805
ards+2.776
Neg logit contributions
ogene-4.453
vati-4.278
undo-4.256
awan-4.217
argument-4.132
Token a
Token ID257
Feature activation-8.56e+00

Pos logit contributions
united+5.863
ridges+5.186
adders+5.154
ards+5.080
 acknow+5.068
Neg logit contributions
ogene-8.969
undo-8.604
vati-8.575
awan-8.386
aimon-8.187
Token rebuke
Token ID45830
Feature activation-1.30e+00

Pos logit contributions
united+5.122
ridges+4.512
 acknow+4.501
adders+4.447
ards+4.357
Neg logit contributions
ogene-6.726
vati-6.469
awan-6.410
undo-6.380
argument-6.248
Token of
Token ID286
Feature activation+3.12e+00

Pos logit contributions
united+0.868
ridges+0.808
adders+0.807
 acknow+0.768
ards+0.755
Neg logit contributions
ogene-0.990
awan-0.941
vati-0.937
undo-0.930
argument-0.908
Token President
Token ID1992
Feature activation+6.89e-01

Pos logit contributions
ogene+2.101
awan+1.982
vati+1.968
undo+1.934
argument+1.904
Neg logit contributions
united-2.350
ridges-2.215
adders-2.208
 acknow-2.096
ards-2.063
Token-
Token ID12
Feature activation+4.78e+00

Pos logit contributions
ogene+0.332
awan+0.306
vati+0.302
undo+0.300
argument+0.290
Neg logit contributions
united-0.652
ridges-0.622
adders-0.621
 acknow-0.598
ards-0.588
Token cared
Token ID19951
Feature activation-5.09e+00

Pos logit contributions
ogene+0.858
vati+0.814
awan+0.812
undo+0.804
argument+0.791
Neg logit contributions
united-0.656
ridges-0.611
adders-0.610
 acknow-0.564
ards-0.554
Token for
Token ID329
Feature activation-2.59e+00

Pos logit contributions
united+3.134
ridges+2.963
adders+2.915
 acknow+2.775
ards+2.757
Neg logit contributions
ogene-4.007
undo-3.865
awan-3.846
vati-3.825
argument-3.689
Token her
Token ID607
Feature activation-1.11e+00

Pos logit contributions
united+1.947
ridges+1.858
adders+1.833
 acknow+1.755
ards+1.733
Neg logit contributions
ogene-1.716
awan-1.628
vati-1.612
undo-1.612
argument-1.569
Token,
Token ID11
Feature activation+3.30e+00

Pos logit contributions
united+0.756
ridges+0.709
adders+0.705
 acknow+0.676
ards+0.662
Neg logit contributions
ogene-0.829
awan-0.788
undo-0.783
vati-0.782
argument-0.760
Token but
Token ID475
Feature activation-6.65e+00

Pos logit contributions
ogene+2.366
vati+2.235
awan+2.216
undo+2.191
 Manip+2.170
Neg logit contributions
united-2.338
adders-2.204
ridges-2.195
 acknow-2.043
ards-2.024
Token something
Token ID1223
Feature activation-1.20e+01

Pos logit contributions
united+3.999
ridges+3.837
 acknow+3.708
adders+3.696
ards+3.538
Neg logit contributions
ogene-5.208
awan-5.029
undo-4.943
vati-4.935
argument-4.784
Token of
Token ID286
Feature activation-6.71e+00

Pos logit contributions
united+6.216
ridges+5.902
 darkest+5.832
 acknow+5.716
adders+5.658
Neg logit contributions
ogene-9.762
vati-9.368
awan-9.259
undo-9.149
argument-8.976
Token what
Token ID644
Feature activation-5.20e+00

Pos logit contributions
united+4.410
ridges+4.345
adders+4.135
 acknow+4.014
 pity+3.889
Neg logit contributions
ogene-4.842
awan-4.614
undo-4.610
vati-4.546
argument-4.437
Token they
Token ID484
Feature activation-4.48e+00

Pos logit contributions
united+2.934
ridges+2.768
adders+2.689
 acknow+2.642
ards+2.584
Neg logit contributions
ogene-4.277
awan-4.147
undo-4.143
vati-4.135
argument-4.006
Token had
Token ID550
Feature activation-3.86e+00

Pos logit contributions
united+2.541
adders+2.303
ridges+2.300
 acknow+2.225
ards+2.155
Neg logit contributions
ogene-3.850
awan-3.663
undo-3.644
vati-3.628
argument-3.550
Token together
Token ID1978
Feature activation-7.89e+00

Pos logit contributions
united+2.344
ridges+2.148
adders+2.141
 acknow+2.063
ards+2.009
Neg logit contributions
ogene-3.180
awan-3.016
undo-2.987
vati-2.983
argument-2.902
Token adults
Token ID6490
Feature activation-2.87e+00

Pos logit contributions
united+4.526
ridges+4.242
adders+4.179
 acknow+4.008
ards+4.005
Neg logit contributions
ogene-5.236
awan-5.003
vati-5.002
undo-4.874
argument-4.802
Token that
Token ID326
Feature activation-2.30e+00

Pos logit contributions
united+1.729
ridges+1.600
adders+1.582
 acknow+1.496
ards+1.482
Neg logit contributions
ogene-2.363
awan-2.253
vati-2.249
undo-2.234
argument-2.184
Token had
Token ID550
Feature activation+8.46e-01

Pos logit contributions
united+1.390
ridges+1.291
adders+1.284
 acknow+1.222
ards+1.192
Neg logit contributions
ogene-1.883
vati-1.792
awan-1.792
undo-1.780
argument-1.743
Token been
Token ID587
Feature activation+4.20e+00

Pos logit contributions
ogene+0.674
awan+0.641
vati+0.638
undo+0.633
argument+0.620
Neg logit contributions
united-0.536
ridges-0.500
adders-0.499
 acknow-0.470
ards-0.459
Token ignored
Token ID9514
Feature activation+7.91e-01

Pos logit contributions
ogene+3.151
awan+2.989
vati+2.964
undo+2.936
 circulating+2.907
Neg logit contributions
united-2.807
adders-2.671
ridges-2.632
 acknow-2.450
ards-2.402
Token for
Token ID329
Feature activation-1.05e+01

Pos logit contributions
ogene+0.560
awan+0.529
vati+0.526
undo+0.523
argument+0.510
Neg logit contributions
united-0.572
ridges-0.537
adders-0.536
 acknow-0.510
ards-0.500
Token the
Token ID262
Feature activation-6.91e+00

Pos logit contributions
united+5.870
ridges+5.540
ards+5.387
adders+5.359
 acknow+5.298
Neg logit contributions
ogene-8.068
awan-7.691
vati-7.674
undo-7.555
argument-7.271
Token past
Token ID1613
Feature activation-3.67e+00

Pos logit contributions
united+4.709
ridges+4.512
adders+4.358
 acknow+4.220
 darkest+4.214
Neg logit contributions
ogene-4.895
vati-4.652
awan-4.584
undo-4.579
argument-4.437
Token decade
Token ID5707
Feature activation+6.77e-01

Pos logit contributions
united+2.875
ridges+2.723
adders+2.667
 acknow+2.600
ards+2.586
Neg logit contributions
ogene-2.307
awan-2.200
vati-2.164
undo-2.163
argument-2.064
Token.
Token ID13
Feature activation-1.37e+00

Pos logit contributions
ogene+0.531
awan+0.505
vati+0.502
undo+0.499
argument+0.489
Neg logit contributions
united-0.437
ridges-0.408
adders-0.407
 acknow-0.384
ards-0.376
Token
Token ID198
Feature activation+8.52e-01

Pos logit contributions
united+0.926
ridges+0.868
adders+0.860
 acknow+0.809
ards+0.807
Neg logit contributions
ogene-1.040
vati-0.985
awan-0.980
undo-0.970
argument-0.950