TOP ACTIVATIONS
MAX = 2.344

Man é , Adam L all ana and Jordan Henderson .
destruction . <|endoftext|> Now playing : Watch this : US Army
Fund chief Domin ique Strauss - K ahn . <|endoftext|> SAN
as Slav oj i ek has said
D ahl ia Lith wick has a rundown of the
Scott Ric her t and David Gordon ( IS I Books
a -- holes in American politics ' MORE , told 77
a -- holes in American politics ' MORE as a 68
a -- holes in American politics ' MORE completed a swing
Cant well ( Team Sax o Bank ) s . t
I 'll make the large corporations pay ' fair share of
first specimens in southwestern Arn hem Land . Another common name
- Kim summit Manafort 's attorneys say he should get less
word by Patrick J . Buchanan , and critical essays by
book , Re claim ing the American Right : The Lost
this content : Sylv ie von Du ug las - I
Kim summit Manafort 's attorneys say he should get less than
Hillary Diane Rodham Clinton Sanders : ' I fully expect '
, but coach Joel Qu enne ville has already broken up
Hillary Diane Rodham Clinton Sanders : ' I fully expect '

MIN ACTIVATIONS
MIN = -2.123

tweets are sometimes deliberately provocative , often made in j est
that my tweets are sometimes deliberately provocative , often made in
Get The Times of Israel 's Daily Edition by email and
© 2019 The Washington Times , LLC . Click here for
state for free . Any contribution to help fund our work
more all for only $ 19 . 99 ...
© 2019 The Washington Times , LLC . Click here for
aii Congress woman Tuls i G abb ard is among key
Policy or the London School of Economics . ________________________________
government faces aboriginal opposition to Kinder Morgan s Trans
<|endoftext|> Facebook Twitter Reddit Google + Go to the
<|endoftext|> B ud we iser Cly des d ales make deliveries
president told Fox News host Laura Ing raham that initial funding
adventure set during the Nap ole onic Wars .
Note : This article gives the views of the author ,
before commenting . Note : This article gives the
, New Jersey Republican Governor Chris Christie carried 51 % of
But please note that my tweets are sometimes deliberately provocative ,
here . But please note that my tweets are sometimes deliberately
can check out my Twitter feed by going here . But

INTERVAL 1.228 - 2.344
CONTAINS 0.041%

We rate this Pants on Fire . <|endoftext|> I
second edition of my 1993 book , Re claim ing the
v ald Bo ass on H agen ( Sky Pro cy
handed ' 2016 primary Sanders : ' Damn right ' I
Kim summit Manafort 's attorneys say he should get less than

INTERVAL 0.111 - 1.228
CONTAINS 26.762%

areas over which the Commonwealth does not have legislative power .
after the fact ? Like how wal r uses were some
, departing from Manchester on December 23 ; and the five
roph ysics and his coll agues have found that Kepler -
not find it engaging in the least . For

INTERVAL -1.006 - 0.111
CONTAINS 73.136%

. And that s it ! Hopefully this wasn
itself has seemed relatively quiet of late despite the fact that
We re not acknowledging that we don
Catalan pro - independence flag - the est el ada -
who has repeatedly condemned trade deals he believes are bad for

INTERVAL -2.123 - -1.006
CONTAINS 0.060%

- March on a United Launch Alliance Atlas 5 , but
slide Power Point slide PNG larger image
delivered Cleveland its long - s ought NFL Championship in 1945
information . <|endoftext|> Power BI has come a long
and never miss our top stories Free Sign Up
Token Man
Token ID1869
Feature activation-2.81e-01

Pos logit contributions
jri+0.661
urgical+0.613
 disg+0.549
warts+0.538
 Generations+0.529
Neg logit contributions
iaz-0.554
••••-0.470
abase-0.443
paralle-0.416
••-0.398
Tokené
Token ID2634
Feature activation+3.16e-02

Pos logit contributions
jri+0.255
urgical+0.238
 disg+0.213
warts+0.210
 Ended+0.206
Neg logit contributions
iaz-0.196
••••-0.164
abase-0.154
paralle-0.144
••-0.137
Token,
Token ID11
Feature activation-4.03e-01

Pos logit contributions
iaz+0.024
••••+0.020
abase+0.019
paralle+0.018
••+0.017
Neg logit contributions
jri-0.027
urgical-0.025
 disg-0.022
warts-0.022
 Ended-0.022
Token Adam
Token ID7244
Feature activation+4.89e-01

Pos logit contributions
jri+0.308
urgical+0.282
 disg+0.248
warts+0.242
 Generations+0.238
Neg logit contributions
iaz-0.339
••••-0.294
abase-0.280
paralle-0.265
••-0.255
Token L
Token ID406
Feature activation+9.62e-01

Pos logit contributions
iaz+0.401
••••+0.348
abase+0.330
paralle+0.315
••+0.299
Neg logit contributions
jri-0.381
urgical-0.351
 disg-0.309
warts-0.303
 Generations-0.298
Tokenall
Token ID439
Feature activation+2.34e+00

Pos logit contributions
iaz+0.839
••••+0.745
abase+0.705
paralle+0.679
••+0.650
Neg logit contributions
jri-0.691
urgical-0.635
 disg-0.545
warts-0.536
 Ended-0.523
Tokenana
Token ID2271
Feature activation+4.45e-03

Pos logit contributions
iaz+1.378
••••+1.217
abase+1.074
paralle+1.035
••+1.008
Neg logit contributions
jri-2.274
urgical-2.200
 disg-1.942
warts-1.936
 Ended-1.905
Token and
Token ID290
Feature activation-2.93e-01

Pos logit contributions
iaz+0.003
••••+0.003
abase+0.003
paralle+0.003
••+0.002
Neg logit contributions
jri-0.004
urgical-0.004
 disg-0.003
warts-0.003
 Ended-0.003
Token Jordan
Token ID8078
Feature activation+3.99e-02

Pos logit contributions
jri+0.225
urgical+0.206
 disg+0.181
warts+0.177
 Ended+0.174
Neg logit contributions
iaz-0.245
••••-0.213
abase-0.202
paralle-0.191
••-0.184
Token Henderson
Token ID22016
Feature activation+7.59e-02

Pos logit contributions
iaz+0.034
••••+0.029
abase+0.028
paralle+0.026
••+0.025
Neg logit contributions
jri-0.030
urgical-0.028
 disg-0.024
warts-0.024
 Ended-0.024
Token.
Token ID13
Feature activation-3.63e-01

Pos logit contributions
iaz+0.057
••••+0.048
abase+0.045
paralle+0.043
••+0.041
Neg logit contributions
jri-0.065
urgical-0.060
 disg-0.054
warts-0.053
 Ended-0.052
Token destruction
Token ID8166
Feature activation-3.38e-01

Pos logit contributions
iaz+0.104
••••+0.091
abase+0.087
paralle+0.083
••+0.080
Neg logit contributions
jri-0.084
urgical-0.077
 disg-0.067
warts-0.065
 Generations-0.064
Token.
Token ID13
Feature activation-4.03e-02

Pos logit contributions
jri+0.291
urgical+0.270
 disg+0.241
warts+0.237
 Generations+0.233
Neg logit contributions
iaz-0.251
••••-0.213
abase-0.201
paralle-0.189
••-0.180
Token<|endoftext|>
Token ID50256
Feature activation-9.28e-02

Pos logit contributions
jri+0.034
urgical+0.032
 disg+0.028
warts+0.028
 Generations+0.027
Neg logit contributions
iaz-0.030
••••-0.026
abase-0.024
paralle-0.023
••-0.022
TokenNow
Token ID3844
Feature activation+8.87e-02

Pos logit contributions
jri+0.079
urgical+0.073
 disg+0.065
warts+0.064
 Ended+0.063
Neg logit contributions
iaz-0.070
••••-0.059
abase-0.056
paralle-0.052
••-0.050
Token playing
Token ID2712
Feature activation-5.33e-01

Pos logit contributions
iaz+0.070
••••+0.060
abase+0.057
paralle+0.054
••+0.052
Neg logit contributions
jri-0.072
urgical-0.067
 disg-0.059
warts-0.058
 Generations-0.057
Token:
Token ID25
Feature activation+2.32e+00

Pos logit contributions
jri+0.474
urgical+0.440
 disg+0.396
warts+0.388
 Ended+0.381
Neg logit contributions
iaz-0.381
••••-0.321
abase-0.303
paralle-0.282
••-0.270
Token Watch
Token ID6305
Feature activation-6.76e-01

Pos logit contributions
iaz+1.386
••••+1.131
abase+1.058
paralle+1.033
••+0.935
Neg logit contributions
jri-2.322
urgical-2.217
 disg-1.984
 Ended-1.966
 Generations-1.963
Token this
Token ID428
Feature activation-5.08e-02

Pos logit contributions
jri+0.572
urgical+0.528
 disg+0.472
warts+0.462
 Ended+0.453
Neg logit contributions
iaz-0.513
••••-0.437
abase-0.413
paralle-0.388
••-0.373
Token:
Token ID25
Feature activation+1.80e-01

Pos logit contributions
jri+0.045
urgical+0.042
 disg+0.037
warts+0.036
 Generations+0.036
Neg logit contributions
iaz-0.037
••••-0.031
abase-0.029
paralle-0.028
••-0.026
Token US
Token ID1294
Feature activation+1.23e-01

Pos logit contributions
iaz+0.141
••••+0.121
abase+0.115
paralle+0.108
••+0.103
Neg logit contributions
jri-0.147
urgical-0.136
 disg-0.121
warts-0.118
 Generations-0.117
Token Army
Token ID5407
Feature activation+1.10e-01

Pos logit contributions
iaz+0.100
••••+0.086
abase+0.082
paralle+0.077
••+0.074
Neg logit contributions
jri-0.097
urgical-0.089
 disg-0.079
warts-0.077
 Ended-0.076
Token Fund
Token ID7557
Feature activation-2.17e-01

Pos logit contributions
iaz+0.019
••••+0.016
abase+0.016
paralle+0.015
••+0.014
Neg logit contributions
jri-0.015
urgical-0.014
 disg-0.012
warts-0.012
 Ended-0.012
Token chief
Token ID4039
Feature activation+1.60e-01

Pos logit contributions
jri+0.168
urgical+0.155
 disg+0.136
warts+0.133
 Ended+0.131
Neg logit contributions
iaz-0.179
••••-0.155
abase-0.147
paralle-0.139
••-0.134
Token Domin
Token ID11817
Feature activation-2.94e-02

Pos logit contributions
iaz+0.138
••••+0.120
abase+0.115
paralle+0.109
••+0.105
Neg logit contributions
jri-0.118
urgical-0.108
 disg-0.094
warts-0.092
 Ended-0.090
Tokenique
Token ID2350
Feature activation-2.88e-01

Pos logit contributions
jri+0.018
urgical+0.016
 disg+0.014
warts+0.013
 Ended+0.013
Neg logit contributions
iaz-0.029
••••-0.026
abase-0.025
paralle-0.024
••-0.023
Token Strauss
Token ID42971
Feature activation+1.06e+00

Pos logit contributions
jri+0.192
urgical+0.174
 disg+0.150
warts+0.146
 Ended+0.143
Neg logit contributions
iaz-0.269
••••-0.237
abase-0.227
paralle-0.216
••-0.209
Token-
Token ID12
Feature activation+2.32e+00

Pos logit contributions
iaz+0.787
••••+0.671
abase+0.642
paralle+0.605
••+0.572
Neg logit contributions
jri-0.914
urgical-0.844
 disg-0.757
warts-0.741
 Ended-0.737
TokenK
Token ID42
Feature activation+6.75e-01

Pos logit contributions
iaz+1.669
••••+1.443
abase+1.388
paralle+1.318
••+1.223
Neg logit contributions
jri-2.020
urgical-1.868
warts-1.655
 disg-1.637
 Generations-1.615
Tokenahn
Token ID15386
Feature activation+1.57e-01

Pos logit contributions
iaz+0.462
••••+0.396
abase+0.364
paralle+0.345
••+0.330
Neg logit contributions
jri-0.618
urgical-0.572
warts-0.510
 disg-0.510
 Ended-0.495
Token.
Token ID13
Feature activation-9.98e-02

Pos logit contributions
iaz+0.117
••••+0.100
abase+0.094
paralle+0.089
••+0.085
Neg logit contributions
jri-0.135
urgical-0.125
 disg-0.112
warts-0.110
 Generations-0.108
Token<|endoftext|>
Token ID50256
Feature activation-8.94e-02

Pos logit contributions
jri+0.086
urgical+0.080
 disg+0.071
warts+0.070
 Generations+0.069
Neg logit contributions
iaz-0.074
••••-0.063
abase-0.059
paralle-0.056
••-0.053
TokenSAN
Token ID36753
Feature activation-2.97e-01

Pos logit contributions
jri+0.076
urgical+0.071
 disg+0.063
warts+0.062
 Ended+0.061
Neg logit contributions
iaz-0.067
••••-0.057
abase-0.054
paralle-0.051
••-0.048
Token as
Token ID355
Feature activation-9.43e-02

Pos logit contributions
jri+0.090
urgical+0.083
 disg+0.074
warts+0.072
 Generations+0.071
Neg logit contributions
iaz-0.083
••••-0.070
abase-0.067
paralle-0.063
••-0.060
Token Slav
Token ID34314
Feature activation-5.23e-02

Pos logit contributions
jri+0.077
urgical+0.071
 disg+0.063
warts+0.062
 Generations+0.061
Neg logit contributions
iaz-0.074
••••-0.064
abase-0.060
paralle-0.057
••-0.054
Tokenoj
Token ID13210
Feature activation+1.45e+00

Pos logit contributions
jri+0.041
urgical+0.038
 disg+0.034
warts+0.033
 Generations+0.032
Neg logit contributions
iaz-0.042
••••-0.037
abase-0.035
paralle-0.033
••-0.032
Token
Token ID25370
Feature activation+5.89e-01

Pos logit contributions
iaz+1.200
••••+1.048
abase+1.000
paralle+0.947
phant+0.909
Neg logit contributions
jri-1.101
urgical-1.027
 disg-0.901
warts-0.889
 Ended-0.879
Token
Token ID121
Feature activation+1.38e-01

Pos logit contributions
iaz+0.464
••••+0.403
abase+0.382
paralle+0.358
••+0.346
Neg logit contributions
jri-0.471
urgical-0.441
 disg-0.388
warts-0.383
 Ended-0.377
Tokeni
Token ID72
Feature activation+2.31e+00

Pos logit contributions
iaz+0.099
••••+0.084
abase+0.079
paralle+0.074
••+0.070
Neg logit contributions
jri-0.122
urgical-0.113
 disg-0.101
warts-0.099
 Ended-0.098
Token
Token ID129
Feature activation+1.31e+00

Pos logit contributions
iaz+1.680
••••+1.446
abase+1.367
paralle+1.329
phant+1.209
Neg logit contributions
jri-2.033
urgical-1.859
warts-1.660
 disg-1.627
 Ended-1.625
Token
Token ID122
Feature activation+1.43e+00

Pos logit contributions
iaz+0.856
••••+0.712
abase+0.675
paralle+0.609
••+0.579
Neg logit contributions
jri-1.232
urgical-1.165
 disg-1.048
warts-1.044
 Ended-1.033
Tokenek
Token ID988
Feature activation+3.58e-01

Pos logit contributions
iaz+1.013
••••+0.874
abase+0.818
paralle+0.772
••+0.735
Neg logit contributions
jri-1.244
urgical-1.174
 disg-1.043
warts-1.034
 Ended-1.022
Token has
Token ID468
Feature activation-6.01e-02

Pos logit contributions
iaz+0.280
••••+0.240
abase+0.228
paralle+0.214
••+0.205
Neg logit contributions
jri-0.294
urgical-0.272
 disg-0.241
warts-0.237
 Generations-0.232
Token said
Token ID531
Feature activation+3.19e-01

Pos logit contributions
jri+0.046
urgical+0.043
 disg+0.038
warts+0.037
 Ended+0.036
Neg logit contributions
iaz-0.050
••••-0.043
abase-0.041
paralle-0.039
••-0.037
Token
Token ID198
Feature activation-1.57e-01

Pos logit contributions
jri+0.040
urgical+0.037
 disg+0.033
warts+0.033
 Ended+0.032
Neg logit contributions
iaz-0.036
••••-0.030
abase-0.029
paralle-0.027
••-0.026
TokenD
Token ID35
Feature activation+3.61e-01

Pos logit contributions
jri+0.131
urgical+0.121
 disg+0.107
warts+0.105
 Ended+0.103
Neg logit contributions
iaz-0.121
••••-0.103
abase-0.098
paralle-0.092
••-0.088
Tokenahl
Token ID15668
Feature activation+6.76e-01

Pos logit contributions
iaz+0.275
••••+0.237
abase+0.224
paralle+0.211
••+0.201
Neg logit contributions
jri-0.303
urgical-0.280
 disg-0.249
warts-0.244
 Ended-0.239
Tokenia
Token ID544
Feature activation+3.37e-03

Pos logit contributions
iaz+0.400
••••+0.328
abase+0.304
paralle+0.281
••+0.262
Neg logit contributions
jri-0.681
urgical-0.641
 disg-0.580
warts-0.576
 Ended-0.566
Token Lith
Token ID19730
Feature activation+2.21e-02

Pos logit contributions
iaz+0.002
••••+0.002
abase+0.002
paralle+0.001
••+0.001
Neg logit contributions
jri-0.003
urgical-0.003
 disg-0.003
warts-0.003
 Generations-0.003
Tokenwick
Token ID16239
Feature activation+2.30e+00

Pos logit contributions
iaz+0.022
••••+0.020
abase+0.019
paralle+0.018
••+0.017
Neg logit contributions
jri-0.013
urgical-0.012
 disg-0.010
warts-0.010
 Generations-0.010
Token has
Token ID468
Feature activation+2.67e-01

Pos logit contributions
iaz+1.878
••••+1.638
abase+1.577
paralle+1.463
phant+1.399
Neg logit contributions
jri-1.801
urgical-1.643
 disg-1.441
 Generations-1.435
warts-1.429
Token a
Token ID257
Feature activation+3.14e-01

Pos logit contributions
iaz+0.213
••••+0.182
abase+0.173
paralle+0.163
••+0.157
Neg logit contributions
jri-0.215
urgical-0.199
 disg-0.177
warts-0.172
 Generations-0.170
Token rundown
Token ID45602
Feature activation+6.80e-01

Pos logit contributions
iaz+0.259
••••+0.223
abase+0.212
paralle+0.201
••+0.192
Neg logit contributions
jri-0.246
urgical-0.226
 disg-0.200
warts-0.195
 Generations-0.192
Token of
Token ID286
Feature activation-3.33e-02

Pos logit contributions
iaz+0.523
••••+0.447
abase+0.427
paralle+0.399
phant+0.381
Neg logit contributions
jri-0.562
urgical-0.523
 disg-0.462
warts-0.456
 Generations-0.449
Token the
Token ID262
Feature activation+6.95e-02

Pos logit contributions
jri+0.027
urgical+0.025
 disg+0.022
warts+0.022
 Generations+0.022
Neg logit contributions
iaz-0.026
••••-0.022
abase-0.021
paralle-0.020
••-0.019
Token Scott
Token ID4746
Feature activation-9.78e-01

Pos logit contributions
jri+0.564
urgical+0.524
 disg+0.471
warts+0.462
 Generations+0.454
Neg logit contributions
iaz-0.438
••••-0.369
abase-0.346
paralle-0.324
••-0.309
Token Ric
Token ID15868
Feature activation-1.58e-01

Pos logit contributions
jri+0.741
urgical+0.671
 disg+0.591
warts+0.578
 Ended+0.563
Neg logit contributions
iaz-0.834
••••-0.728
abase-0.690
paralle-0.656
••-0.632
Tokenher
Token ID372
Feature activation+6.14e-02

Pos logit contributions
jri+0.125
urgical+0.115
 disg+0.101
warts+0.099
 Ended+0.097
Neg logit contributions
iaz-0.129
••••-0.112
abase-0.106
paralle-0.100
••-0.096
Tokent
Token ID83
Feature activation+3.02e-01

Pos logit contributions
iaz+0.048
••••+0.041
abase+0.039
paralle+0.036
••+0.035
Neg logit contributions
jri-0.051
urgical-0.047
 disg-0.042
warts-0.041
 Generations-0.040
Token and
Token ID290
Feature activation+2.92e-01

Pos logit contributions
iaz+0.230
••••+0.197
abase+0.186
paralle+0.175
••+0.168
Neg logit contributions
jri-0.253
urgical-0.235
 disg-0.209
warts-0.206
 Ended-0.202
Token David
Token ID3271
Feature activation+2.21e+00

Pos logit contributions
iaz+0.231
••••+0.197
abase+0.189
paralle+0.177
••+0.170
Neg logit contributions
jri-0.235
urgical-0.219
 disg-0.195
warts-0.191
 Generations-0.189
Token Gordon
Token ID11646
Feature activation-6.22e-02

Pos logit contributions
iaz+1.825
••••+1.570
abase+1.515
paralle+1.406
phant+1.375
Neg logit contributions
jri-1.679
urgical-1.557
 Generations-1.376
warts-1.369
 disg-1.367
Token (
Token ID357
Feature activation+1.57e-01

Pos logit contributions
jri+0.055
urgical+0.051
 disg+0.046
warts+0.045
 Generations+0.044
Neg logit contributions
iaz-0.045
••••-0.038
abase-0.036
paralle-0.033
••-0.032
TokenIS
Token ID1797
Feature activation+7.68e-02

Pos logit contributions
iaz+0.137
••••+0.119
abase+0.114
paralle+0.108
••+0.104
Neg logit contributions
jri-0.114
urgical-0.105
 disg-0.092
warts-0.090
 Ended-0.087
TokenI
Token ID40
Feature activation+6.63e-01

Pos logit contributions
iaz+0.057
••••+0.049
abase+0.046
paralle+0.043
••+0.041
Neg logit contributions
jri-0.065
urgical-0.061
 disg-0.054
warts-0.053
 Generations-0.052
Token Books
Token ID13661
Feature activation-1.47e-01

Pos logit contributions
iaz+0.663
••••+0.589
abase+0.567
paralle+0.546
••+0.526
Neg logit contributions
jri-0.392
urgical-0.360
 disg-0.299
warts-0.293
 Generations-0.287
Token a
Token ID257
Feature activation+1.47e+00

Pos logit contributions
iaz+0.781
••••+0.675
abase+0.653
paralle+0.609
phant+0.575
Neg logit contributions
jri-0.767
urgical-0.701
 disg-0.634
warts-0.624
 Generations-0.607
Token--
Token ID438
Feature activation+8.78e-01

Pos logit contributions
iaz+1.322
••••+1.144
abase+1.118
paralle+1.074
••+1.009
Neg logit contributions
jri-1.018
urgical-0.917
 disg-0.800
warts-0.787
 Generations-0.770
Tokenholes
Token ID28439
Feature activation+4.67e-01

Pos logit contributions
iaz+0.526
••••+0.428
abase+0.398
paralle+0.366
••+0.339
Neg logit contributions
jri-0.887
urgical-0.825
 disg-0.757
warts-0.749
 Generations-0.732
Token in
Token ID287
Feature activation+7.84e-01

Pos logit contributions
iaz+0.307
••••+0.257
abase+0.240
paralle+0.223
phant+0.211
Neg logit contributions
jri-0.441
urgical-0.410
 disg-0.370
warts-0.366
 Ended-0.361
Token American
Token ID1605
Feature activation+1.07e+00

Pos logit contributions
iaz+0.621
••••+0.533
abase+0.510
paralle+0.479
••+0.455
Neg logit contributions
jri-0.636
urgical-0.585
 disg-0.520
warts-0.508
 Ended-0.500
Token politics
Token ID4819
Feature activation+2.21e+00

Pos logit contributions
iaz+0.921
••••+0.802
abase+0.768
paralle+0.731
••+0.700
Neg logit contributions
jri-0.800
urgical-0.729
 disg-0.635
warts-0.623
 Generations-0.611
Token'
Token ID6
Feature activation+1.30e+00

Pos logit contributions
iaz+1.517
••••+1.301
abase+1.249
paralle+1.176
••+1.086
Neg logit contributions
jri-2.002
urgical-1.838
warts-1.650
 disg-1.633
 Ended-1.616
Token MORE
Token ID12011
Feature activation-1.22e-01

Pos logit contributions
iaz+1.021
••••+0.874
abase+0.852
paralle+0.791
phant+0.756
Neg logit contributions
jri-1.037
urgical-0.972
 Ended-0.862
 disg-0.854
warts-0.848
Token,
Token ID11
Feature activation+6.75e-03

Pos logit contributions
jri+0.103
urgical+0.096
 disg+0.085
warts+0.084
 Generations+0.082
Neg logit contributions
iaz-0.092
••••-0.078
abase-0.074
paralle-0.069
••-0.066
Token told
Token ID1297
Feature activation-3.87e-02

Pos logit contributions
iaz+0.005
••••+0.005
abase+0.004
paralle+0.004
••+0.004
Neg logit contributions
jri-0.005
urgical-0.005
 disg-0.004
warts-0.004
 Ended-0.004
Token 77
Token ID8541
Feature activation-4.52e-01

Pos logit contributions
jri+0.030
urgical+0.028
 disg+0.025
warts+0.024
 Generations+0.024
Neg logit contributions
iaz-0.032
••••-0.028
abase-0.026
paralle-0.025
••-0.024
Token a
Token ID257
Feature activation+1.27e+00

Pos logit contributions
iaz+0.742
••••+0.643
abase+0.621
paralle+0.581
phant+0.550
Neg logit contributions
jri-0.706
urgical-0.645
 disg-0.581
warts-0.573
 Generations-0.555
Token--
Token ID438
Feature activation+9.17e-01

Pos logit contributions
iaz+1.193
••••+1.041
abase+1.015
paralle+0.976
••+0.922
Neg logit contributions
jri-0.841
urgical-0.755
 disg-0.653
warts-0.641
 Generations-0.625
Tokenholes
Token ID28439
Feature activation+5.58e-01

Pos logit contributions
iaz+0.546
••••+0.443
abase+0.410
paralle+0.377
••+0.349
Neg logit contributions
jri-0.932
urgical-0.867
 disg-0.797
warts-0.788
 Generations-0.770
Token in
Token ID287
Feature activation+7.87e-01

Pos logit contributions
iaz+0.369
••••+0.309
abase+0.289
paralle+0.269
phant+0.254
Neg logit contributions
jri-0.526
urgical-0.489
 disg-0.442
warts-0.436
 Ended-0.430
Token American
Token ID1605
Feature activation+9.42e-01

Pos logit contributions
iaz+0.618
••••+0.531
abase+0.508
paralle+0.477
••+0.453
Neg logit contributions
jri-0.642
urgical-0.592
 disg-0.527
warts-0.515
 Ended-0.506
Token politics
Token ID4819
Feature activation+2.18e+00

Pos logit contributions
iaz+0.802
••••+0.698
abase+0.668
paralle+0.634
••+0.610
Neg logit contributions
jri-0.707
urgical-0.646
 disg-0.564
warts-0.552
 Generations-0.541
Token'
Token ID6
Feature activation+1.11e+00

Pos logit contributions
iaz+1.500
••••+1.286
abase+1.233
paralle+1.163
••+1.074
Neg logit contributions
jri-1.979
urgical-1.819
warts-1.634
 disg-1.616
 Ended-1.596
Token MORE
Token ID12011
Feature activation-5.90e-02

Pos logit contributions
iaz+0.863
••••+0.742
abase+0.717
paralle+0.666
phant+0.637
Neg logit contributions
jri-0.896
urgical-0.835
 Ended-0.737
 disg-0.736
warts-0.730
Token as
Token ID355
Feature activation+6.54e-02

Pos logit contributions
jri+0.050
urgical+0.047
 disg+0.042
warts+0.041
 Generations+0.040
Neg logit contributions
iaz-0.044
••••-0.038
abase-0.036
paralle-0.033
••-0.032
Token a
Token ID257
Feature activation-1.07e-01

Pos logit contributions
iaz+0.053
••••+0.046
abase+0.043
paralle+0.041
••+0.039
Neg logit contributions
jri-0.052
urgical-0.048
 disg-0.042
warts-0.041
 Generations-0.041
Token 68
Token ID8257
Feature activation-1.19e-02

Pos logit contributions
jri+0.081
urgical+0.074
 disg+0.065
warts+0.063
 Generations+0.062
Neg logit contributions
iaz-0.092
••••-0.080
abase-0.076
paralle-0.072
••-0.069
Token a
Token ID257
Feature activation+1.23e+00

Pos logit contributions
iaz+0.776
••••+0.678
abase+0.656
paralle+0.615
phant+0.584
Neg logit contributions
jri-0.683
urgical-0.621
 disg-0.557
warts-0.548
 Generations-0.530
Token--
Token ID438
Feature activation+8.35e-01

Pos logit contributions
iaz+1.144
••••+0.996
abase+0.970
paralle+0.932
••+0.881
Neg logit contributions
jri-0.819
urgical-0.738
 disg-0.639
warts-0.628
 Generations-0.611
Tokenholes
Token ID28439
Feature activation+4.16e-01

Pos logit contributions
iaz+0.492
••••+0.399
abase+0.369
paralle+0.339
••+0.314
Neg logit contributions
jri-0.852
urgical-0.794
 disg-0.729
warts-0.720
 Generations-0.704
Token in
Token ID287
Feature activation+7.30e-01

Pos logit contributions
iaz+0.274
••••+0.230
abase+0.215
paralle+0.200
phant+0.189
Neg logit contributions
jri-0.392
urgical-0.365
 disg-0.330
warts-0.325
 Ended-0.320
Token American
Token ID1605
Feature activation+9.97e-01

Pos logit contributions
iaz+0.577
••••+0.495
abase+0.472
paralle+0.444
••+0.422
Neg logit contributions
jri-0.594
urgical-0.548
 disg-0.488
warts-0.475
 Ended-0.468
Token politics
Token ID4819
Feature activation+2.15e+00

Pos logit contributions
iaz+0.818
••••+0.706
abase+0.674
paralle+0.640
••+0.612
Neg logit contributions
jri-0.780
urgical-0.714
 disg-0.628
warts-0.617
 Generations-0.606
Token'
Token ID6
Feature activation+1.07e+00

Pos logit contributions
iaz+1.483
••••+1.269
abase+1.223
paralle+1.150
••+1.064
Neg logit contributions
jri-1.944
urgical-1.794
warts-1.607
 disg-1.595
 Ended-1.573
Token MORE
Token ID12011
Feature activation+2.15e-02

Pos logit contributions
iaz+0.834
••••+0.718
abase+0.695
paralle+0.646
phant+0.619
Neg logit contributions
jri-0.862
urgical-0.805
 Ended-0.708
 disg-0.707
warts-0.702
Token completed
Token ID5668
Feature activation-9.87e-02

Pos logit contributions
iaz+0.017
••••+0.014
abase+0.014
paralle+0.013
••+0.012
Neg logit contributions
jri-0.018
urgical-0.016
 disg-0.014
warts-0.014
 Ended-0.014
Token a
Token ID257
Feature activation+8.48e-02

Pos logit contributions
jri+0.081
urgical+0.075
 disg+0.066
warts+0.065
 Generations+0.064
Neg logit contributions
iaz-0.077
••••-0.066
abase-0.063
paralle-0.059
••-0.057
Token swing
Token ID9628
Feature activation-4.80e-01

Pos logit contributions
iaz+0.070
••••+0.061
abase+0.058
paralle+0.054
••+0.052
Neg logit contributions
jri-0.066
urgical-0.061
 disg-0.053
warts-0.052
 Generations-0.051
Token Cant
Token ID16821
Feature activation-3.05e-01

Pos logit contributions
iaz+0.269
••••+0.231
abase+0.219
paralle+0.207
••+0.198
Neg logit contributions
jri-0.283
urgical-0.261
 disg-0.231
warts-0.228
 Ended-0.226
Tokenwell
Token ID4053
Feature activation+3.50e-01

Pos logit contributions
jri+0.241
urgical+0.222
 disg+0.196
warts+0.192
 Ended+0.188
Neg logit contributions
iaz-0.247
••••-0.213
abase-0.203
paralle-0.191
••-0.184
Token (
Token ID357
Feature activation+1.94e-01

Pos logit contributions
iaz+0.254
••••+0.216
abase+0.204
paralle+0.190
••+0.182
Neg logit contributions
jri-0.307
urgical-0.284
 disg-0.254
warts-0.250
 Generations-0.246
TokenTeam
Token ID15592
Feature activation+5.46e-02

Pos logit contributions
iaz+0.145
••••+0.123
abase+0.117
paralle+0.109
••+0.104
Neg logit contributions
jri-0.166
urgical-0.153
 disg-0.137
warts-0.134
 Ended-0.132
Token Sax
Token ID29242
Feature activation+1.19e+00

Pos logit contributions
iaz+0.052
••••+0.046
abase+0.044
paralle+0.042
••+0.041
Neg logit contributions
jri-0.036
urgical-0.032
 disg-0.028
warts-0.027
 Ended-0.026
Tokeno
Token ID78
Feature activation+2.14e+00

Pos logit contributions
iaz+0.992
••••+0.860
abase+0.813
paralle+0.770
••+0.745
Neg logit contributions
jri-0.923
urgical-0.854
 disg-0.745
warts-0.735
 Ended-0.726
Token Bank
Token ID5018
Feature activation+7.10e-01

Pos logit contributions
iaz+1.496
••••+1.253
abase+1.185
paralle+1.111
••+1.070
Neg logit contributions
jri-1.906
urgical-1.773
 disg-1.594
 Generations-1.578
warts-1.577
Token)
Token ID8
Feature activation+4.04e-01

Pos logit contributions
iaz+0.547
••••+0.470
abase+0.442
paralle+0.418
••+0.400
Neg logit contributions
jri-0.592
urgical-0.546
 disg-0.487
warts-0.477
 Generations-0.471
Token s
Token ID264
Feature activation-1.75e-01

Pos logit contributions
iaz+0.341
••••+0.295
abase+0.282
paralle+0.267
••+0.257
Neg logit contributions
jri-0.307
urgical-0.280
 disg-0.247
warts-0.241
 Ended-0.238
Token.
Token ID13
Feature activation+7.20e-02

Pos logit contributions
jri+0.153
urgical+0.142
 disg+0.127
warts+0.125
 Ended+0.123
Neg logit contributions
iaz-0.127
••••-0.108
abase-0.102
paralle-0.095
••-0.091
Tokent
Token ID83
Feature activation+2.51e-02

Pos logit contributions
iaz+0.056
••••+0.048
abase+0.045
paralle+0.042
••+0.041
Neg logit contributions
jri-0.060
urgical-0.055
 disg-0.049
warts-0.048
 Ended-0.047
Token I
Token ID314
Feature activation+4.56e-01

Pos logit contributions
iaz+0.754
••••+0.652
abase+0.631
paralle+0.588
phant+0.572
Neg logit contributions
jri-0.727
urgical-0.666
 disg-0.591
 Generations-0.576
warts-0.574
Token'll
Token ID1183
Feature activation-9.17e-02

Pos logit contributions
iaz+0.386
••••+0.336
abase+0.324
paralle+0.306
••+0.292
Neg logit contributions
jri-0.343
urgical-0.315
 disg-0.277
warts-0.268
 Ended-0.265
Token make
Token ID787
Feature activation+6.30e-01

Pos logit contributions
jri+0.066
urgical+0.060
 disg+0.052
warts+0.051
 Generations+0.050
Neg logit contributions
iaz-0.081
••••-0.071
abase-0.068
paralle-0.064
••-0.062
Token the
Token ID262
Feature activation+7.24e-01

Pos logit contributions
iaz+0.499
••••+0.430
abase+0.409
paralle+0.384
phant+0.370
Neg logit contributions
jri-0.510
urgical-0.472
 disg-0.416
warts-0.408
 Generations-0.401
Token large
Token ID1588
Feature activation+1.03e+00

Pos logit contributions
iaz+0.633
••••+0.554
abase+0.533
paralle+0.504
phant+0.482
Neg logit contributions
jri-0.529
urgical-0.484
 disg-0.419
warts-0.407
 Ended-0.401
Token corporations
Token ID10225
Feature activation+2.04e+00

Pos logit contributions
iaz+0.925
••••+0.814
abase+0.790
paralle+0.751
phant+0.716
Neg logit contributions
jri-0.728
urgical-0.662
 disg-0.578
warts-0.557
 Ended-0.555
Token pay
Token ID1414
Feature activation-2.03e-01

Pos logit contributions
iaz+1.569
••••+1.359
abase+1.332
paralle+1.229
phant+1.181
Neg logit contributions
jri-1.678
urgical-1.526
 disg-1.393
warts-1.335
 Ended-1.331
Token '
Token ID705
Feature activation-1.06e+00

Pos logit contributions
jri+0.173
urgical+0.160
 disg+0.143
warts+0.140
 Ended+0.138
Neg logit contributions
iaz-0.152
••••-0.129
abase-0.123
paralle-0.115
••-0.110
Tokenfair
Token ID22043
Feature activation-1.08e+00

Pos logit contributions
jri+0.829
urgical+0.762
 disg+0.676
warts+0.656
 Generations+0.646
Neg logit contributions
iaz-0.872
••••-0.754
abase-0.717
paralle-0.678
••-0.655
Token share
Token ID2648
Feature activation-1.43e-01

Pos logit contributions
jri+0.895
urgical+0.825
 disg+0.728
warts+0.723
 Generations+0.704
Neg logit contributions
iaz-0.842
••••-0.722
abase-0.675
paralle-0.641
••-0.622
Token of
Token ID286
Feature activation-8.00e-01

Pos logit contributions
jri+0.137
urgical+0.128
 disg+0.116
warts+0.114
 Ended+0.113
Neg logit contributions
iaz-0.093
••••-0.077
abase-0.072
paralle-0.066
••-0.063
Token first
Token ID717
Feature activation-3.63e-02

Pos logit contributions
jri+0.289
urgical+0.266
 disg+0.235
warts+0.230
 Ended+0.226
Neg logit contributions
iaz-0.298
••••-0.257
abase-0.244
paralle-0.231
••-0.222
Token specimens
Token ID27569
Feature activation+2.12e-02

Pos logit contributions
jri+0.025
urgical+0.023
 disg+0.020
warts+0.019
 Generations+0.019
Neg logit contributions
iaz-0.033
••••-0.029
abase-0.028
paralle-0.026
••-0.025
Token in
Token ID287
Feature activation-1.57e-01

Pos logit contributions
iaz+0.016
••••+0.014
abase+0.013
paralle+0.012
••+0.012
Neg logit contributions
jri-0.018
urgical-0.016
 disg-0.015
warts-0.014
 Generations-0.014
Token southwestern
Token ID46065
Feature activation-1.26e-01

Pos logit contributions
jri+0.129
urgical+0.119
 disg+0.106
warts+0.103
 Generations+0.102
Neg logit contributions
iaz-0.123
••••-0.105
abase-0.100
paralle-0.094
••-0.090
Token Arn
Token ID16644
Feature activation+3.18e-01

Pos logit contributions
jri+0.101
urgical+0.094
 disg+0.083
warts+0.081
 Generations+0.080
Neg logit contributions
iaz-0.101
••••-0.087
abase-0.082
paralle-0.077
••-0.074
Tokenhem
Token ID4411
Feature activation+2.01e+00

Pos logit contributions
iaz+0.367
••••+0.333
abase+0.320
paralle+0.309
••+0.303
Neg logit contributions
jri-0.142
urgical-0.123
 disg-0.095
warts-0.092
 Ended-0.088
Token Land
Token ID6379
Feature activation-8.58e-02

Pos logit contributions
iaz+1.404
••••+1.187
abase+1.124
paralle+1.050
••+1.000
Neg logit contributions
jri-1.787
urgical-1.686
 disg-1.512
warts-1.483
 Ended-1.466
Token.
Token ID13
Feature activation-7.41e-02

Pos logit contributions
jri+0.073
urgical+0.068
 disg+0.061
warts+0.060
 Generations+0.059
Neg logit contributions
iaz-0.064
••••-0.055
abase-0.052
paralle-0.048
••-0.046
Token Another
Token ID6023
Feature activation-1.36e-01

Pos logit contributions
jri+0.063
urgical+0.059
 disg+0.052
warts+0.051
 Generations+0.051
Neg logit contributions
iaz-0.055
••••-0.047
abase-0.045
paralle-0.042
••-0.040
Token common
Token ID2219
Feature activation-1.12e-01

Pos logit contributions
jri+0.099
urgical+0.090
 disg+0.078
warts+0.077
 Generations+0.075
Neg logit contributions
iaz-0.120
••••-0.105
abase-0.100
paralle-0.095
••-0.091
Token name
Token ID1438
Feature activation-2.64e-01

Pos logit contributions
jri+0.079
urgical+0.072
 disg+0.063
warts+0.061
 Generations+0.060
Neg logit contributions
iaz-0.101
••••-0.088
abase-0.084
paralle-0.080
••-0.078
Token-
Token ID12
Feature activation-5.60e-01

Pos logit contributions
jri+0.156
urgical+0.144
 disg+0.129
warts+0.127
 Generations+0.125
Neg logit contributions
iaz-0.127
••••-0.108
abase-0.102
paralle-0.095
••-0.091
TokenKim
Token ID26374
Feature activation-8.37e-01

Pos logit contributions
jri+0.352
urgical+0.316
 disg+0.269
warts+0.260
 Generations+0.254
Neg logit contributions
iaz-0.547
••••-0.484
abase-0.464
paralle-0.443
••-0.430
Token summit
Token ID14237
Feature activation+6.10e-01

Pos logit contributions
jri+0.596
urgical+0.543
 disg+0.474
warts+0.462
 Ended+0.451
Neg logit contributions
iaz-0.748
••••-0.653
abase-0.619
paralle-0.591
••-0.572
Token Manafort
Token ID26351
Feature activation+7.39e-01

Pos logit contributions
iaz+0.476
••••+0.412
abase+0.391
paralle+0.370
phant+0.352
Neg logit contributions
jri-0.502
urgical-0.463
 disg-0.408
warts-0.402
 Ended-0.401
Token's
Token ID338
Feature activation+1.26e+00

Pos logit contributions
iaz+0.567
••••+0.493
abase+0.464
paralle+0.433
••+0.423
Neg logit contributions
jri-0.612
urgical-0.566
 disg-0.509
warts-0.492
 Ended-0.489
Token attorneys
Token ID14449
Feature activation+2.00e+00

Pos logit contributions
iaz+1.185
••••+1.045
abase+1.001
paralle+0.956
••+0.930
Neg logit contributions
jri-0.832
urgical-0.754
 disg-0.647
warts-0.620
 Generations-0.618
Token say
Token ID910
Feature activation+1.71e+00

Pos logit contributions
iaz+1.716
••••+1.527
abase+1.446
••+1.334
paralle+1.331
Neg logit contributions
jri-1.459
urgical-1.356
 disg-1.199
 Ended-1.146
 Generations-1.136
Token he
Token ID339
Feature activation+2.61e-01

Pos logit contributions
iaz+1.457
••••+1.283
abase+1.198
paralle+1.148
••+1.119
Neg logit contributions
jri-1.285
urgical-1.180
 disg-1.045
 Generations-1.004
warts-1.000
Token should
Token ID815
Feature activation+4.71e-01

Pos logit contributions
iaz+0.209
••••+0.180
abase+0.171
paralle+0.161
••+0.156
Neg logit contributions
jri-0.210
urgical-0.194
 disg-0.172
warts-0.167
 Generations-0.165
Token get
Token ID651
Feature activation-5.06e-01

Pos logit contributions
iaz+0.422
••••+0.370
abase+0.357
paralle+0.337
••+0.325
Neg logit contributions
jri-0.329
urgical-0.302
 disg-0.264
warts-0.255
 Generations-0.251
Token less
Token ID1342
Feature activation+2.09e-01

Pos logit contributions
jri+0.388
urgical+0.356
 disg+0.313
warts+0.306
 Ended+0.300
Neg logit contributions
iaz-0.425
••••-0.368
abase-0.350
paralle-0.331
••-0.319
Tokenword
Token ID4775
Feature activation+2.97e-01

Pos logit contributions
jri+0.330
urgical+0.309
 disg+0.280
warts+0.276
 Ended+0.272
Neg logit contributions
iaz-0.214
••••-0.176
abase-0.164
paralle-0.151
••-0.143
Token by
Token ID416
Feature activation-1.02e+00

Pos logit contributions
iaz+0.224
••••+0.189
abase+0.180
paralle+0.168
••+0.161
Neg logit contributions
jri-0.252
urgical-0.235
 disg-0.209
warts-0.205
 Generations-0.203
Token Patrick
Token ID9925
Feature activation+1.58e-02

Pos logit contributions
jri+1.073
urgical+1.004
 disg+0.916
warts+0.904
 Generations+0.886
Neg logit contributions
iaz-0.564
••••-0.454
abase-0.413
paralle-0.382
••-0.355
Token J
Token ID449
Feature activation-2.06e-02

Pos logit contributions
iaz+0.012
••••+0.010
abase+0.010
paralle+0.009
••+0.009
Neg logit contributions
jri-0.013
urgical-0.012
 disg-0.011
warts-0.011
 Generations-0.011
Token.
Token ID13
Feature activation+9.49e-01

Pos logit contributions
jri+0.018
urgical+0.017
 disg+0.015
warts+0.015
 Generations+0.014
Neg logit contributions
iaz-0.015
••••-0.013
abase-0.012
paralle-0.011
••-0.011
Token Buchanan
Token ID36358
Feature activation+1.99e+00

Pos logit contributions
iaz+0.705
••••+0.607
abase+0.574
paralle+0.535
••+0.511
Neg logit contributions
jri-0.813
urgical-0.753
 disg-0.669
warts-0.662
 Generations-0.653
Token,
Token ID11
Feature activation-1.13e-01

Pos logit contributions
iaz+1.573
••••+1.349
abase+1.281
paralle+1.213
••+1.156
Neg logit contributions
jri-1.619
urgical-1.496
 disg-1.335
 Generations-1.305
warts-1.297
Token and
Token ID290
Feature activation-7.46e-01

Pos logit contributions
jri+0.096
urgical+0.089
 disg+0.080
warts+0.078
 Generations+0.077
Neg logit contributions
iaz-0.085
••••-0.072
abase-0.069
paralle-0.064
••-0.062
Token critical
Token ID4688
Feature activation+1.17e-01

Pos logit contributions
jri+0.512
urgical+0.463
 disg+0.401
warts+0.389
 Ended+0.379
Neg logit contributions
iaz-0.687
••••-0.604
abase-0.576
paralle-0.549
••-0.532
Token essays
Token ID27126
Feature activation+4.16e-01

Pos logit contributions
iaz+0.102
••••+0.089
abase+0.085
paralle+0.080
••+0.077
Neg logit contributions
jri-0.086
urgical-0.079
 disg-0.069
warts-0.067
 Ended-0.066
Token by
Token ID416
Feature activation-6.25e-01

Pos logit contributions
iaz+0.315
••••+0.266
abase+0.254
paralle+0.237
••+0.227
Neg logit contributions
jri-0.350
urgical-0.325
 disg-0.290
warts-0.286
 Generations-0.281
Token book
Token ID1492
Feature activation+1.57e+00

Pos logit contributions
iaz+0.204
••••+0.176
abase+0.169
paralle+0.159
••+0.153
Neg logit contributions
jri-0.179
urgical-0.165
 disg-0.145
warts-0.142
 Ended-0.139
Token,
Token ID11
Feature activation+7.75e-01

Pos logit contributions
iaz+1.249
••••+1.056
abase+1.004
paralle+0.937
phant+0.895
Neg logit contributions
jri-1.273
urgical-1.186
 disg-1.053
warts-1.032
 Generations-1.028
Token Re
Token ID797
Feature activation+7.27e-01

Pos logit contributions
iaz+0.691
••••+0.595
abase+0.579
paralle+0.542
••+0.517
Neg logit contributions
jri-0.555
urgical-0.510
 disg-0.449
warts-0.435
 Generations-0.431
Tokenclaim
Token ID6604
Feature activation+7.20e-01

Pos logit contributions
iaz+0.514
••••+0.428
abase+0.409
paralle+0.372
••+0.361
Neg logit contributions
jri-0.649
urgical-0.611
 disg-0.550
warts-0.536
 Generations-0.533
Tokening
Token ID278
Feature activation+3.93e-01

Pos logit contributions
iaz+0.525
••••+0.442
abase+0.418
paralle+0.392
••+0.372
Neg logit contributions
jri-0.628
urgical-0.585
 disg-0.521
warts-0.516
 Generations-0.514
Token the
Token ID262
Feature activation+1.99e+00

Pos logit contributions
iaz+0.305
••••+0.260
abase+0.246
paralle+0.232
••+0.222
Neg logit contributions
jri-0.323
urgical-0.301
 disg-0.267
warts-0.263
 Generations-0.260
Token American
Token ID1605
Feature activation+1.06e+00

Pos logit contributions
iaz+1.532
••••+1.281
abase+1.227
paralle+1.185
phant+1.104
Neg logit contributions
jri-1.628
urgical-1.550
 Generations-1.354
warts-1.340
 Ended-1.339
Token Right
Token ID6498
Feature activation+5.89e-01

Pos logit contributions
iaz+0.596
••••+0.471
abase+0.438
paralle+0.407
phant+0.368
Neg logit contributions
jri-1.092
urgical-1.037
 Generations-0.929
 disg-0.928
warts-0.922
Token:
Token ID25
Feature activation-3.77e-02

Pos logit contributions
iaz+0.438
••••+0.370
abase+0.354
paralle+0.332
••+0.311
Neg logit contributions
jri-0.502
urgical-0.470
 disg-0.414
warts-0.410
 Ended-0.409
Token The
Token ID383
Feature activation-9.35e-02

Pos logit contributions
jri+0.031
urgical+0.029
 disg+0.026
warts+0.025
 Generations+0.025
Neg logit contributions
iaz-0.029
••••-0.025
abase-0.024
paralle-0.022
••-0.021
Token Lost
Token ID9164
Feature activation+1.59e+00

Pos logit contributions
jri+0.075
urgical+0.070
 disg+0.061
warts+0.060
 Ended+0.059
Neg logit contributions
iaz-0.075
••••-0.064
abase-0.061
paralle-0.057
••-0.055
Token this
Token ID428
Feature activation-6.18e-02

Pos logit contributions
iaz+0.083
••••+0.071
abase+0.067
paralle+0.063
••+0.060
Neg logit contributions
jri-0.092
urgical-0.085
 disg-0.076
warts-0.074
 Generations-0.073
Token content
Token ID2695
Feature activation-1.71e-01

Pos logit contributions
jri+0.046
urgical+0.042
 disg+0.037
warts+0.036
 Ended+0.036
Neg logit contributions
iaz-0.053
••••-0.046
abase-0.044
paralle-0.041
••-0.040
Token:
Token ID25
Feature activation+1.55e+00

Pos logit contributions
jri+0.146
urgical+0.135
 disg+0.121
warts+0.118
 Ended+0.116
Neg logit contributions
iaz-0.129
••••-0.110
abase-0.104
paralle-0.097
••-0.093
Token Sylv
Token ID24286
Feature activation+1.60e-01

Pos logit contributions
iaz+1.410
••••+1.223
abase+1.194
paralle+1.133
••+1.084
Neg logit contributions
jri-1.074
urgical-0.976
 disg-0.853
warts-0.830
 Ended-0.823
Tokenie
Token ID494
Feature activation+1.37e+00

Pos logit contributions
iaz+0.116
••••+0.098
abase+0.092
paralle+0.086
••+0.083
Neg logit contributions
jri-0.141
urgical-0.131
 disg-0.117
warts-0.115
 Generations-0.113
Token von
Token ID18042
Feature activation+1.99e+00

Pos logit contributions
iaz+1.170
••••+1.005
abase+0.970
paralle+0.906
••+0.868
Neg logit contributions
jri-1.030
urgical-0.951
 disg-0.837
warts-0.830
 Generations-0.815
Token Du
Token ID10343
Feature activation-1.74e-01

Pos logit contributions
iaz+1.559
••••+1.337
abase+1.258
paralle+1.175
••+1.153
Neg logit contributions
jri-1.619
urgical-1.510
 disg-1.353
 Generations-1.319
warts-1.313
Tokenug
Token ID1018
Feature activation+4.10e-01

Pos logit contributions
jri+0.134
urgical+0.123
 disg+0.108
warts+0.106
 Ended+0.103
Neg logit contributions
iaz-0.145
••••-0.126
abase-0.120
paralle-0.113
••-0.109
Tokenlas
Token ID21921
Feature activation+1.22e+00

Pos logit contributions
iaz+0.310
••••+0.265
abase+0.251
paralle+0.236
••+0.226
Neg logit contributions
jri-0.348
urgical-0.322
 disg-0.287
warts-0.282
 Generations-0.278
Token-
Token ID12
Feature activation+1.12e+00

Pos logit contributions
iaz+0.889
••••+0.759
abase+0.733
paralle+0.670
••+0.649
Neg logit contributions
jri-1.062
urgical-0.981
 disg-0.876
warts-0.866
 Ended-0.860
TokenI
Token ID40
Feature activation+1.15e+00

Pos logit contributions
iaz+0.856
••••+0.740
abase+0.712
paralle+0.659
••+0.630
Neg logit contributions
jri-0.926
urgical-0.855
warts-0.753
 disg-0.751
 Generations-0.733
TokenKim
Token ID26374
Feature activation-7.70e-01

Pos logit contributions
jri+0.471
urgical+0.423
 disg+0.361
warts+0.348
 Generations+0.341
Neg logit contributions
iaz-0.730
••••-0.645
abase-0.617
paralle-0.591
••-0.573
Token summit
Token ID14237
Feature activation+5.59e-01

Pos logit contributions
jri+0.548
urgical+0.499
 disg+0.435
warts+0.424
 Ended+0.414
Neg logit contributions
iaz-0.688
••••-0.601
abase-0.571
paralle-0.545
••-0.527
Token Manafort
Token ID26351
Feature activation+7.44e-01

Pos logit contributions
iaz+0.432
••••+0.373
abase+0.353
paralle+0.334
phant+0.317
Neg logit contributions
jri-0.465
urgical-0.430
 disg-0.379
warts-0.375
 Generations-0.371
Token's
Token ID338
Feature activation+1.17e+00

Pos logit contributions
iaz+0.572
••••+0.497
abase+0.467
paralle+0.437
••+0.426
Neg logit contributions
jri-0.619
urgical-0.570
 disg-0.514
warts-0.496
 Ended-0.493
Token attorneys
Token ID14449
Feature activation+1.73e+00

Pos logit contributions
iaz+1.069
••••+0.939
abase+0.899
paralle+0.856
••+0.832
Neg logit contributions
jri-0.799
urgical-0.726
 disg-0.629
warts-0.603
 Generations-0.600
Token say
Token ID910
Feature activation+1.97e+00

Pos logit contributions
iaz+1.565
••••+1.397
abase+1.325
paralle+1.228
••+1.225
Neg logit contributions
jri-1.186
urgical-1.091
 disg-0.957
 Ended-0.910
 Generations-0.900
Token he
Token ID339
Feature activation-4.82e-02

Pos logit contributions
iaz+1.676
••••+1.481
abase+1.377
paralle+1.317
••+1.291
Neg logit contributions
jri-1.476
urgical-1.355
 disg-1.204
 Generations-1.149
warts-1.145
Token should
Token ID815
Feature activation+5.64e-01

Pos logit contributions
jri+0.039
urgical+0.036
 disg+0.032
warts+0.031
 Ended+0.030
Neg logit contributions
iaz-0.038
••••-0.033
abase-0.031
paralle-0.030
••-0.029
Token get
Token ID651
Feature activation-4.52e-01

Pos logit contributions
iaz+0.498
••••+0.437
abase+0.421
paralle+0.396
••+0.383
Neg logit contributions
jri-0.400
urgical-0.369
 disg-0.323
warts-0.312
 Generations-0.307
Token less
Token ID1342
Feature activation+2.40e-01

Pos logit contributions
jri+0.339
urgical+0.310
 disg+0.272
warts+0.266
 Ended+0.260
Neg logit contributions
iaz-0.387
••••-0.336
abase-0.320
paralle-0.303
••-0.292
Token than
Token ID621
Feature activation+2.35e-01

Pos logit contributions
iaz+0.201
••••+0.175
abase+0.168
paralle+0.158
••+0.152
Neg logit contributions
jri-0.183
urgical-0.169
 disg-0.148
warts-0.144
 Ended-0.142
Token Hillary
Token ID5073
Feature activation-1.21e-01

Pos logit contributions
iaz+0.318
••••+0.277
abase+0.264
paralle+0.249
••+0.241
Neg logit contributions
jri-0.275
urgical-0.252
 disg-0.220
warts-0.217
 Generations-0.213
Token Diane
Token ID26542
Feature activation+4.97e-01

Pos logit contributions
jri+0.093
urgical+0.086
 disg+0.075
warts+0.074
 Ended+0.072
Neg logit contributions
iaz-0.101
••••-0.088
abase-0.083
paralle-0.079
••-0.076
Token Rodham
Token ID41848
Feature activation+3.99e-01

Pos logit contributions
iaz+0.383
••••+0.329
abase+0.312
paralle+0.293
••+0.280
Neg logit contributions
jri-0.411
urgical-0.381
 disg-0.338
warts-0.331
 Generations-0.328
Token Clinton
Token ID2605
Feature activation-3.42e-02

Pos logit contributions
iaz+0.347
••••+0.304
abase+0.290
paralle+0.274
••+0.263
Neg logit contributions
jri-0.291
urgical-0.266
 disg-0.231
warts-0.228
 Ended-0.225
TokenSanders
Token ID26747
Feature activation+7.14e-02

Pos logit contributions
jri+0.029
urgical+0.027
 disg+0.024
warts+0.023
 Ended+0.023
Neg logit contributions
iaz-0.026
••••-0.022
abase-0.021
paralle-0.020
••-0.019
Token:
Token ID25
Feature activation+1.94e+00

Pos logit contributions
iaz+0.052
••••+0.044
abase+0.042
paralle+0.039
••+0.037
Neg logit contributions
jri-0.063
urgical-0.058
 disg-0.052
warts-0.051
 Ended-0.050
Token '
Token ID705
Feature activation-1.16e+00

Pos logit contributions
iaz+1.565
••••+1.366
abase+1.287
paralle+1.224
phant+1.162
Neg logit contributions
jri-1.526
urgical-1.413
 disg-1.259
warts-1.244
 Ended-1.240
TokenI
Token ID40
Feature activation+7.01e-01

Pos logit contributions
jri+0.991
urgical+0.920
 disg+0.821
warts+0.799
 Ended+0.789
Neg logit contributions
iaz-0.873
••••-0.743
abase-0.701
paralle-0.661
••-0.636
Token fully
Token ID3938
Feature activation+4.99e-02

Pos logit contributions
iaz+0.581
••••+0.500
abase+0.484
paralle+0.456
••+0.434
Neg logit contributions
jri-0.542
urgical-0.500
 disg-0.441
warts-0.429
 Generations-0.425
Token expect
Token ID1607
Feature activation+4.92e-01

Pos logit contributions
iaz+0.042
••••+0.036
abase+0.035
paralle+0.033
••+0.032
Neg logit contributions
jri-0.038
urgical-0.035
 disg-0.031
warts-0.030
 Generations-0.029
Token'
Token ID6
Feature activation+3.37e-01

Pos logit contributions
iaz+0.325
••••+0.270
abase+0.256
paralle+0.235
phant+0.221
Neg logit contributions
jri-0.463
urgical-0.432
 disg-0.390
warts-0.383
 Generations-0.379
Token,
Token ID11
Feature activation+2.51e-01

Pos logit contributions
iaz+0.078
••••+0.067
abase+0.063
paralle+0.059
••+0.056
Neg logit contributions
jri-0.092
urgical-0.085
 disg-0.076
warts-0.075
 Ended-0.073
Token but
Token ID475
Feature activation-8.79e-02

Pos logit contributions
iaz+0.195
••••+0.167
abase+0.158
paralle+0.149
••+0.143
Neg logit contributions
jri-0.208
urgical-0.192
 disg-0.171
warts-0.167
 Ended-0.165
Token coach
Token ID3985
Feature activation-9.67e-02

Pos logit contributions
jri+0.073
urgical+0.067
 disg+0.060
warts+0.059
 Ended+0.058
Neg logit contributions
iaz-0.068
••••-0.058
abase-0.055
paralle-0.052
••-0.050
Token Joel
Token ID18623
Feature activation-2.20e-01

Pos logit contributions
jri+0.069
urgical+0.063
 disg+0.055
warts+0.053
 Generations+0.053
Neg logit contributions
iaz-0.086
••••-0.075
abase-0.072
paralle-0.068
••-0.066
Token Qu
Token ID2264
Feature activation-6.98e-01

Pos logit contributions
jri+0.212
urgical+0.199
 disg+0.180
warts+0.177
 Ended+0.175
Neg logit contributions
iaz-0.140
••••-0.115
abase-0.108
paralle-0.100
••-0.094
Tokenenne
Token ID29727
Feature activation+1.94e+00

Pos logit contributions
jri+0.464
urgical+0.418
 disg+0.363
warts+0.351
 Generations+0.344
Neg logit contributions
iaz-0.658
••••-0.576
abase-0.553
paralle-0.526
••-0.509
Tokenville
Token ID4244
Feature activation+8.11e-02

Pos logit contributions
iaz+1.804
••••+1.600
abase+1.532
paralle+1.455
••+1.418
Neg logit contributions
jri-1.327
urgical-1.194
warts-0.992
 disg-0.987
 Ended-0.987
Token has
Token ID468
Feature activation-2.20e-01

Pos logit contributions
iaz+0.064
••••+0.055
abase+0.052
paralle+0.049
••+0.047
Neg logit contributions
jri-0.066
urgical-0.061
 disg-0.054
warts-0.053
 Ended-0.052
Token already
Token ID1541
Feature activation-1.28e-01

Pos logit contributions
jri+0.169
urgical+0.155
 disg+0.137
warts+0.134
 Ended+0.131
Neg logit contributions
iaz-0.183
••••-0.158
abase-0.151
paralle-0.142
••-0.137
Token broken
Token ID5445
Feature activation+3.04e-01

Pos logit contributions
jri+0.097
urgical+0.089
 disg+0.078
warts+0.076
 Ended+0.075
Neg logit contributions
iaz-0.108
••••-0.094
abase-0.090
paralle-0.085
••-0.082
Token up
Token ID510
Feature activation-7.88e-02

Pos logit contributions
iaz+0.238
••••+0.204
abase+0.193
paralle+0.182
••+0.174
Neg logit contributions
jri-0.250
urgical-0.230
 disg-0.205
warts-0.200
 Generations-0.197
Token Hillary
Token ID5073
Feature activation-1.42e-01

Pos logit contributions
iaz+0.272
••••+0.237
abase+0.226
paralle+0.213
••+0.206
Neg logit contributions
jri-0.234
urgical-0.214
 disg-0.187
warts-0.183
 Generations-0.180
Token Diane
Token ID26542
Feature activation+4.03e-01

Pos logit contributions
jri+0.109
urgical+0.101
 disg+0.088
warts+0.086
 Ended+0.085
Neg logit contributions
iaz-0.118
••••-0.103
abase-0.098
paralle-0.092
••-0.089
Token Rodham
Token ID41848
Feature activation+3.61e-01

Pos logit contributions
iaz+0.309
••••+0.266
abase+0.252
paralle+0.236
••+0.226
Neg logit contributions
jri-0.334
urgical-0.311
 disg-0.275
warts-0.270
 Ended-0.267
Token Clinton
Token ID2605
Feature activation-1.14e-01

Pos logit contributions
iaz+0.314
••••+0.275
abase+0.263
paralle+0.248
••+0.238
Neg logit contributions
jri-0.264
urgical-0.242
 disg-0.210
warts-0.207
 Ended-0.204
TokenSanders
Token ID26747
Feature activation-7.97e-03

Pos logit contributions
jri+0.096
urgical+0.088
 disg+0.079
warts+0.077
 Ended+0.076
Neg logit contributions
iaz-0.086
••••-0.074
abase-0.070
paralle-0.066
••-0.063
Token:
Token ID25
Feature activation+1.94e+00

Pos logit contributions
jri+0.007
urgical+0.006
 disg+0.006
warts+0.006
 Ended+0.005
Neg logit contributions
iaz-0.006
••••-0.005
abase-0.005
paralle-0.004
••-0.004
Token '
Token ID705
Feature activation-9.18e-01

Pos logit contributions
iaz+1.560
••••+1.361
abase+1.286
paralle+1.221
phant+1.160
Neg logit contributions
jri-1.526
urgical-1.408
 disg-1.258
warts-1.240
 Ended-1.230
TokenI
Token ID40
Feature activation+5.25e-01

Pos logit contributions
jri+0.777
urgical+0.720
 disg+0.641
warts+0.626
 Ended+0.617
Neg logit contributions
iaz-0.696
••••-0.593
abase-0.560
paralle-0.527
••-0.507
Token fully
Token ID3938
Feature activation+1.73e-01

Pos logit contributions
iaz+0.432
••••+0.372
abase+0.359
paralle+0.338
••+0.322
Neg logit contributions
jri-0.409
urgical-0.377
 disg-0.333
warts-0.324
 Generations-0.320
Token expect
Token ID1607
Feature activation+3.55e-01

Pos logit contributions
iaz+0.149
••••+0.130
abase+0.124
paralle+0.118
••+0.114
Neg logit contributions
jri-0.128
urgical-0.117
 disg-0.103
warts-0.099
 Generations-0.097
Token'
Token ID6
Feature activation+1.51e-01

Pos logit contributions
iaz+0.241
••••+0.201
abase+0.190
paralle+0.176
phant+0.166
Neg logit contributions
jri-0.328
urgical-0.306
 disg-0.275
warts-0.270
 Generations-0.267
Token tweets
Token ID12665
Feature activation-1.67e+00

Pos logit contributions
jri+0.843
urgical+0.747
 disg+0.622
warts+0.606
 Ended+0.591
Neg logit contributions
iaz-1.547
••••-1.387
abase-1.329
paralle-1.277
••-1.245
Token are
Token ID389
Feature activation-1.09e+00

Pos logit contributions
jri+1.438
urgical+1.314
 disg+1.186
warts+1.170
 Ended+1.141
Neg logit contributions
iaz-1.251
••••-1.070
abase-1.000
paralle-0.945
••-0.911
Token sometimes
Token ID3360
Feature activation-3.43e-01

Pos logit contributions
jri+0.815
urgical+0.741
 disg+0.649
warts+0.637
 Ended+0.624
Neg logit contributions
iaz-0.933
••••-0.816
abase-0.766
paralle-0.731
••-0.711
Token deliberately
Token ID14593
Feature activation-2.06e+00

Pos logit contributions
jri+0.263
urgical+0.242
 disg+0.212
warts+0.208
 Ended+0.204
Neg logit contributions
iaz-0.287
••••-0.249
abase-0.237
paralle-0.224
••-0.216
Token provocative
Token ID28695
Feature activation+5.92e-01

Pos logit contributions
jri+1.545
urgical+1.385
 disg+1.230
warts+1.205
 Ended+1.170
Neg logit contributions
iaz-1.780
••••-1.550
abase-1.455
paralle-1.408
phant-1.361
Token,
Token ID11
Feature activation-2.12e+00

Pos logit contributions
iaz+0.452
••••+0.385
abase+0.371
paralle+0.344
phant+0.325
Neg logit contributions
jri-0.496
urgical-0.462
 disg-0.409
warts-0.400
 Ended-0.395
Token often
Token ID1690
Feature activation-1.04e+00

Pos logit contributions
jri+1.755
urgical+1.590
 disg+1.405
warts+1.388
 Generations+1.385
Neg logit contributions
iaz-1.652
••••-1.417
abase-1.301
paralle-1.279
phant-1.209
Token made
Token ID925
Feature activation+4.47e-02

Pos logit contributions
jri+0.922
urgical+0.853
 disg+0.765
warts+0.755
 Ended+0.740
Neg logit contributions
iaz-0.741
••••-0.628
abase-0.583
paralle-0.553
••-0.528
Token in
Token ID287
Feature activation+1.89e-01

Pos logit contributions
iaz+0.036
••••+0.031
abase+0.030
paralle+0.028
••+0.027
Neg logit contributions
jri-0.036
urgical-0.033
 disg-0.029
warts-0.028
 Generations-0.028
Token j
Token ID474
Feature activation+1.52e-01

Pos logit contributions
iaz+0.153
••••+0.132
abase+0.127
paralle+0.119
••+0.114
Neg logit contributions
jri-0.150
urgical-0.138
 disg-0.122
warts-0.119
 Ended-0.116
Tokenest
Token ID395
Feature activation+4.23e-01

Pos logit contributions
iaz+0.077
••••+0.060
abase+0.055
paralle+0.049
••+0.045
Neg logit contributions
jri-0.166
urgical-0.157
 disg-0.144
warts-0.141
 Ended-0.140
Token that
Token ID326
Feature activation-1.66e+00

Pos logit contributions
jri+1.066
urgical+0.984
 disg+0.878
warts+0.859
 Ended+0.847
Neg logit contributions
iaz-1.008
••••-0.869
abase-0.812
paralle-0.771
••-0.748
Token my
Token ID616
Feature activation-1.49e+00

Pos logit contributions
jri+1.320
urgical+1.210
 disg+1.063
warts+1.054
 Ended+1.030
Neg logit contributions
iaz-1.350
••••-1.176
abase-1.096
paralle-1.056
••-1.020
Token tweets
Token ID12665
Feature activation-1.67e+00

Pos logit contributions
jri+0.843
urgical+0.747
 disg+0.622
warts+0.606
 Ended+0.591
Neg logit contributions
iaz-1.547
••••-1.387
abase-1.329
paralle-1.277
••-1.245
Token are
Token ID389
Feature activation-1.09e+00

Pos logit contributions
jri+1.438
urgical+1.314
 disg+1.186
warts+1.170
 Ended+1.141
Neg logit contributions
iaz-1.251
••••-1.070
abase-1.000
paralle-0.945
••-0.911
Token sometimes
Token ID3360
Feature activation-3.43e-01

Pos logit contributions
jri+0.815
urgical+0.741
 disg+0.649
warts+0.637
 Ended+0.624
Neg logit contributions
iaz-0.933
••••-0.816
abase-0.766
paralle-0.731
••-0.711
Token deliberately
Token ID14593
Feature activation-2.06e+00

Pos logit contributions
jri+0.263
urgical+0.242
 disg+0.212
warts+0.208
 Ended+0.204
Neg logit contributions
iaz-0.287
••••-0.249
abase-0.237
paralle-0.224
••-0.216
Token provocative
Token ID28695
Feature activation+5.92e-01

Pos logit contributions
jri+1.545
urgical+1.385
 disg+1.230
warts+1.205
 Ended+1.170
Neg logit contributions
iaz-1.780
••••-1.550
abase-1.455
paralle-1.408
phant-1.361
Token,
Token ID11
Feature activation-2.12e+00

Pos logit contributions
iaz+0.452
••••+0.385
abase+0.371
paralle+0.344
phant+0.325
Neg logit contributions
jri-0.496
urgical-0.462
 disg-0.409
warts-0.400
 Ended-0.395
Token often
Token ID1690
Feature activation-1.04e+00

Pos logit contributions
jri+1.755
urgical+1.590
 disg+1.405
warts+1.388
 Generations+1.385
Neg logit contributions
iaz-1.652
••••-1.417
abase-1.301
paralle-1.279
phant-1.209
Token made
Token ID925
Feature activation+4.47e-02

Pos logit contributions
jri+0.922
urgical+0.853
 disg+0.765
warts+0.755
 Ended+0.740
Neg logit contributions
iaz-0.741
••••-0.628
abase-0.583
paralle-0.553
••-0.528
Token in
Token ID287
Feature activation+1.89e-01

Pos logit contributions
iaz+0.036
••••+0.031
abase+0.030
paralle+0.028
••+0.027
Neg logit contributions
jri-0.036
urgical-0.033
 disg-0.029
warts-0.028
 Generations-0.028
TokenGet
Token ID3855
Feature activation+1.04e+00

Pos logit contributions
jri+0.060
urgical+0.055
 disg+0.049
warts+0.048
 Ended+0.047
Neg logit contributions
iaz-0.054
••••-0.046
abase-0.044
paralle-0.041
••-0.040
Token The
Token ID383
Feature activation-3.28e-02

Pos logit contributions
iaz+0.823
••••+0.710
abase+0.677
paralle+0.635
••+0.618
Neg logit contributions
jri-0.835
urgical-0.781
 disg-0.692
warts-0.670
 Generations-0.668
Token Times
Token ID3782
Feature activation+2.39e-02

Pos logit contributions
jri+0.030
urgical+0.028
 disg+0.025
warts+0.025
 Generations+0.024
Neg logit contributions
iaz-0.022
••••-0.019
abase-0.018
paralle-0.016
••-0.016
Token of
Token ID286
Feature activation-1.08e+00

Pos logit contributions
iaz+0.018
••••+0.015
abase+0.015
paralle+0.014
••+0.013
Neg logit contributions
jri-0.020
urgical-0.019
 disg-0.017
warts-0.016
 Generations-0.016
Token Israel
Token ID2692
Feature activation-3.78e-01

Pos logit contributions
jri+0.669
urgical+0.599
 disg+0.516
warts+0.492
 Generations+0.479
Neg logit contributions
iaz-1.061
••••-0.940
abase-0.896
paralle-0.861
••-0.837
Token's
Token ID338
Feature activation-1.98e+00

Pos logit contributions
jri+0.322
urgical+0.299
 disg+0.267
warts+0.261
 Ended+0.257
Neg logit contributions
iaz-0.283
••••-0.241
abase-0.228
paralle-0.213
••-0.204
Token Daily
Token ID6714
Feature activation-4.37e-01

Pos logit contributions
jri+1.950
urgical+1.788
 disg+1.657
warts+1.603
 Ended+1.580
Neg logit contributions
iaz-1.235
••••-1.022
abase-0.937
paralle-0.874
••-0.828
Token Edition
Token ID5061
Feature activation-3.61e-01

Pos logit contributions
jri+0.350
urgical+0.322
 disg+0.285
warts+0.279
 Ended+0.274
Neg logit contributions
iaz-0.351
••••-0.302
abase-0.287
paralle-0.271
••-0.260
Token by
Token ID416
Feature activation-8.56e-01

Pos logit contributions
jri+0.314
urgical+0.291
 disg+0.260
warts+0.255
 Generations+0.251
Neg logit contributions
iaz-0.265
••••-0.225
abase-0.212
paralle-0.199
••-0.190
Token email
Token ID3053
Feature activation-1.76e-01

Pos logit contributions
jri+0.683
urgical+0.627
 disg+0.556
warts+0.543
 Ended+0.534
Neg logit contributions
iaz-0.689
••••-0.596
abase-0.563
paralle-0.533
••-0.513
Token and
Token ID290
Feature activation+3.65e-01

Pos logit contributions
jri+0.150
urgical+0.139
 disg+0.124
warts+0.121
 Ended+0.120
Neg logit contributions
iaz-0.132
••••-0.112
abase-0.107
paralle-0.100
••-0.095
Token ©
Token ID10673
Feature activation+1.34e-02

Pos logit contributions
iaz+0.095
••••+0.080
abase+0.076
paralle+0.071
••+0.068
Neg logit contributions
jri-0.112
urgical-0.104
 disg-0.093
warts-0.091
 Ended-0.090
Token 2019
Token ID13130
Feature activation-7.19e-01

Pos logit contributions
iaz+0.010
••••+0.009
abase+0.008
paralle+0.008
••+0.007
Neg logit contributions
jri-0.011
urgical-0.011
 disg-0.009
warts-0.009
 Generations-0.009
Token The
Token ID383
Feature activation+4.44e-01

Pos logit contributions
jri+0.607
urgical+0.560
 disg+0.501
warts+0.491
 Ended+0.481
Neg logit contributions
iaz-0.546
••••-0.465
abase-0.440
paralle-0.413
••-0.397
Token Washington
Token ID2669
Feature activation-2.50e-01

Pos logit contributions
iaz+0.339
••••+0.290
abase+0.274
paralle+0.258
••+0.246
Neg logit contributions
jri-0.373
urgical-0.345
 disg-0.306
warts-0.301
 Generations-0.299
Token Times
Token ID3782
Feature activation-5.61e-01

Pos logit contributions
jri+0.231
urgical+0.215
 disg+0.194
warts+0.190
 Generations+0.188
Neg logit contributions
iaz-0.171
••••-0.143
abase-0.134
paralle-0.125
••-0.118
Token,
Token ID11
Feature activation-1.96e+00

Pos logit contributions
jri+0.488
urgical+0.453
 disg+0.406
warts+0.397
 Generations+0.390
Neg logit contributions
iaz-0.411
••••-0.349
abase-0.329
paralle-0.308
••-0.295
Token LLC
Token ID11419
Feature activation-7.11e-01

Pos logit contributions
jri+1.295
urgical+1.167
 disg+1.015
warts+0.989
 Generations+0.945
Neg logit contributions
iaz-1.835
••••-1.635
abase-1.541
paralle-1.495
••-1.441
Token.
Token ID13
Feature activation-3.36e-01

Pos logit contributions
jri+0.631
urgical+0.585
 disg+0.527
warts+0.515
 Generations+0.507
Neg logit contributions
iaz-0.510
••••-0.430
abase-0.405
paralle-0.378
••-0.362
Token Click
Token ID6914
Feature activation-7.44e-01

Pos logit contributions
jri+0.378
urgical+0.357
 disg+0.329
warts+0.324
 Ended+0.320
Neg logit contributions
iaz-0.160
••••-0.122
abase-0.110
paralle-0.098
••-0.090
Token here
Token ID994
Feature activation-2.81e-01

Pos logit contributions
jri+0.634
urgical+0.588
 disg+0.525
warts+0.514
 Generations+0.505
Neg logit contributions
iaz-0.559
••••-0.476
abase-0.449
paralle-0.421
••-0.405
Token for
Token ID329
Feature activation-4.39e-01

Pos logit contributions
jri+0.240
urgical+0.222
 disg+0.198
warts+0.194
 Generations+0.191
Neg logit contributions
iaz-0.210
••••-0.179
abase-0.169
paralle-0.159
••-0.152
Token state
Token ID1181
Feature activation-6.87e-01

Pos logit contributions
iaz+0.900
••••+0.788
abase+0.747
paralle+0.707
••+0.694
Neg logit contributions
jri-0.715
urgical-0.657
 disg-0.564
warts-0.560
 Generations-0.545
Token for
Token ID329
Feature activation-6.82e-02

Pos logit contributions
jri+0.595
urgical+0.551
 disg+0.493
warts+0.483
 Generations+0.475
Neg logit contributions
iaz-0.507
••••-0.430
abase-0.406
paralle-0.381
••-0.363
Token free
Token ID1479
Feature activation-5.70e-02

Pos logit contributions
jri+0.051
urgical+0.046
 disg+0.040
warts+0.039
 Generations+0.039
Neg logit contributions
iaz-0.059
••••-0.051
abase-0.049
paralle-0.046
••-0.045
Token.
Token ID13
Feature activation+3.98e-01

Pos logit contributions
jri+0.050
urgical+0.046
 disg+0.041
warts+0.040
 Generations+0.040
Neg logit contributions
iaz-0.042
••••-0.035
abase-0.033
paralle-0.031
••-0.030
Token Any
Token ID4377
Feature activation-1.43e+00

Pos logit contributions
iaz+0.285
••••+0.238
abase+0.226
paralle+0.211
••+0.200
Neg logit contributions
jri-0.353
urgical-0.330
 disg-0.294
warts-0.289
 Generations-0.288
Token contribution
Token ID10156
Feature activation-1.92e+00

Pos logit contributions
jri+1.148
urgical+1.051
 disg+0.936
warts+0.916
 Ended+0.893
Neg logit contributions
iaz-1.139
••••-0.995
abase-0.934
paralle-0.886
••-0.856
Token to
Token ID284
Feature activation-4.81e-02

Pos logit contributions
jri+1.725
urgical+1.601
 disg+1.445
warts+1.408
 Generations+1.387
Neg logit contributions
iaz-1.368
••••-1.138
abase-1.054
paralle-0.996
phant-0.965
Token help
Token ID1037
Feature activation+2.13e-01

Pos logit contributions
jri+0.038
urgical+0.035
 disg+0.031
warts+0.031
 Generations+0.030
Neg logit contributions
iaz-0.039
••••-0.033
abase-0.032
paralle-0.030
••-0.029
Token fund
Token ID1814
Feature activation-2.84e-01

Pos logit contributions
iaz+0.184
••••+0.160
abase+0.153
paralle+0.145
••+0.140
Neg logit contributions
jri-0.158
urgical-0.144
 disg-0.127
warts-0.124
 Generations-0.122
Token our
Token ID674
Feature activation-6.59e-02

Pos logit contributions
jri+0.246
urgical+0.228
 disg+0.204
warts+0.200
 Generations+0.196
Neg logit contributions
iaz-0.209
••••-0.177
abase-0.168
paralle-0.157
••-0.150
Token work
Token ID670
Feature activation+3.29e-02

Pos logit contributions
jri+0.043
urgical+0.039
 disg+0.033
warts+0.033
 Generations+0.032
Neg logit contributions
iaz-0.062
••••-0.055
abase-0.053
paralle-0.050
••-0.049
Token more
Token ID517
Feature activation+5.38e-01

Pos logit contributions
iaz+0.116
••••+0.100
abase+0.095
paralle+0.089
••+0.086
Neg logit contributions
jri-0.117
urgical-0.108
 disg-0.096
warts-0.094
 Generations-0.092
Token
Token ID1587
Feature activation+1.32e-01

Pos logit contributions
iaz+0.466
••••+0.410
abase+0.392
paralle+0.371
••+0.357
Neg logit contributions
jri-0.390
urgical-0.360
 disg-0.311
warts-0.303
 Ended-0.301
Token
Token ID245
Feature activation-2.32e-01

Pos logit contributions
iaz+0.097
••••+0.082
abase+0.078
paralle+0.073
••+0.069
Neg logit contributions
jri-0.114
urgical-0.106
 disg-0.095
warts-0.093
 Ended-0.092
Token all
Token ID477
Feature activation+2.11e-01

Pos logit contributions
jri+0.201
urgical+0.186
 disg+0.166
warts+0.163
 Generations+0.161
Neg logit contributions
iaz-0.171
••••-0.145
abase-0.137
paralle-0.128
••-0.123
Token for
Token ID329
Feature activation-8.96e-01

Pos logit contributions
iaz+0.163
••••+0.140
abase+0.132
paralle+0.124
••+0.120
Neg logit contributions
jri-0.174
urgical-0.162
 disg-0.144
warts-0.142
 Ended-0.139
Token only
Token ID691
Feature activation-1.91e+00

Pos logit contributions
jri+0.671
urgical+0.614
 disg+0.540
warts+0.526
 Generations+0.516
Neg logit contributions
iaz-0.765
••••-0.667
abase-0.632
paralle-0.604
••-0.579
Token $
Token ID720
Feature activation-7.20e-01

Pos logit contributions
jri+1.639
urgical+1.501
 disg+1.370
warts+1.330
 Ended+1.306
Neg logit contributions
iaz-1.439
••••-1.234
abase-1.149
paralle-1.103
••-1.047
Token19
Token ID1129
Feature activation-1.17e+00

Pos logit contributions
jri+0.572
urgical+0.525
 disg+0.466
warts+0.455
 Ended+0.447
Neg logit contributions
iaz-0.585
••••-0.504
abase-0.477
paralle-0.453
••-0.435
Token.
Token ID13
Feature activation-1.11e+00

Pos logit contributions
jri+1.056
urgical+0.982
 disg+0.884
warts+0.871
 Generations+0.855
Neg logit contributions
iaz-0.825
••••-0.694
abase-0.649
paralle-0.609
••-0.581
Token99
Token ID2079
Feature activation-2.62e-01

Pos logit contributions
jri+0.807
urgical+0.736
 disg+0.644
warts+0.625
 Generations+0.613
Neg logit contributions
iaz-0.969
••••-0.850
abase-0.809
paralle-0.770
••-0.741
Token...
Token ID986
Feature activation-3.59e-01

Pos logit contributions
jri+0.217
urgical+0.201
 disg+0.178
warts+0.174
 Ended+0.172
Neg logit contributions
iaz-0.203
••••-0.174
abase-0.165
paralle-0.155
••-0.149
Token ©
Token ID10673
Feature activation+4.45e-01

Pos logit contributions
iaz+0.319
••••+0.269
abase+0.255
paralle+0.238
••+0.227
Neg logit contributions
jri-0.384
urgical-0.356
 disg-0.319
warts-0.312
 Ended-0.309
Token 2019
Token ID13130
Feature activation-1.89e-01

Pos logit contributions
iaz+0.341
••••+0.291
abase+0.277
paralle+0.259
••+0.249
Neg logit contributions
jri-0.373
urgical-0.344
 disg-0.306
warts-0.300
 Ended-0.296
Token The
Token ID383
Feature activation+6.95e-01

Pos logit contributions
jri+0.158
urgical+0.146
 disg+0.130
warts+0.127
 Generations+0.125
Neg logit contributions
iaz-0.146
••••-0.125
abase-0.118
paralle-0.111
••-0.106
Token Washington
Token ID2669
Feature activation-4.00e-02

Pos logit contributions
iaz+0.534
••••+0.458
abase+0.434
paralle+0.408
••+0.388
Neg logit contributions
jri-0.581
urgical-0.536
 disg-0.473
warts-0.467
 Generations-0.465
Token Times
Token ID3782
Feature activation-3.88e-01

Pos logit contributions
jri+0.037
urgical+0.034
 disg+0.031
warts+0.030
 Generations+0.030
Neg logit contributions
iaz-0.027
••••-0.023
abase-0.022
paralle-0.020
••-0.019
Token,
Token ID11
Feature activation-1.90e+00

Pos logit contributions
jri+0.336
urgical+0.312
 disg+0.279
warts+0.273
 Ended+0.269
Neg logit contributions
iaz-0.286
••••-0.243
abase-0.229
paralle-0.215
••-0.205
Token LLC
Token ID11419
Feature activation-5.86e-01

Pos logit contributions
jri+1.261
urgical+1.134
 disg+0.988
warts+0.962
 Generations+0.919
Neg logit contributions
iaz-1.781
••••-1.587
abase-1.496
paralle-1.454
••-1.398
Token.
Token ID13
Feature activation-1.42e-02

Pos logit contributions
jri+0.518
urgical+0.481
 disg+0.432
warts+0.423
 Generations+0.416
Neg logit contributions
iaz-0.421
••••-0.356
abase-0.335
paralle-0.313
••-0.300
Token Click
Token ID6914
Feature activation-7.43e-01

Pos logit contributions
jri+0.016
urgical+0.015
 disg+0.014
warts+0.014
 Ended+0.014
Neg logit contributions
iaz-0.007
••••-0.005
abase-0.005
paralle-0.004
••-0.004
Token here
Token ID994
Feature activation-5.82e-02

Pos logit contributions
jri+0.633
urgical+0.587
 disg+0.524
warts+0.514
 Generations+0.504
Neg logit contributions
iaz-0.558
••••-0.475
abase-0.448
paralle-0.421
••-0.405
Token for
Token ID329
Feature activation-1.97e-01

Pos logit contributions
jri+0.050
urgical+0.046
 disg+0.041
warts+0.040
 Generations+0.039
Neg logit contributions
iaz-0.044
••••-0.037
abase-0.035
paralle-0.033
••-0.032
Tokenaii
Token ID42648
Feature activation-7.34e-02

Pos logit contributions
jri+0.272
urgical+0.252
 disg+0.224
warts+0.220
 Ended+0.216
Neg logit contributions
iaz-0.247
••••-0.211
abase-0.200
paralle-0.188
••-0.180
Token Congress
Token ID3162
Feature activation+2.57e-01

Pos logit contributions
jri+0.060
urgical+0.056
 disg+0.050
warts+0.048
 Ended+0.048
Neg logit contributions
iaz-0.057
••••-0.049
abase-0.047
paralle-0.044
••-0.042
Tokenwoman
Token ID8580
Feature activation-3.04e-02

Pos logit contributions
iaz+0.217
••••+0.189
abase+0.180
paralle+0.170
••+0.164
Neg logit contributions
jri-0.194
urgical-0.178
 disg-0.156
warts-0.153
 Ended-0.150
Token Tuls
Token ID33219
Feature activation+2.27e-01

Pos logit contributions
jri+0.018
urgical+0.016
 disg+0.013
warts+0.013
 Ended+0.013
Neg logit contributions
iaz-0.031
••••-0.027
abase-0.026
paralle-0.025
••-0.024
Tokeni
Token ID72
Feature activation-1.57e-01

Pos logit contributions
iaz+0.163
••••+0.139
abase+0.131
paralle+0.123
••+0.116
Neg logit contributions
jri-0.201
urgical-0.186
 disg-0.166
warts-0.163
 Ended-0.161
Token G
Token ID402
Feature activation-1.89e+00

Pos logit contributions
jri+0.150
urgical+0.141
 disg+0.127
warts+0.125
 Generations+0.123
Neg logit contributions
iaz-0.101
••••-0.084
abase-0.078
paralle-0.073
••-0.069
Tokenabb
Token ID6485
Feature activation+7.57e-01

Pos logit contributions
jri+1.320
urgical+1.177
 disg+1.029
warts+0.998
 Ended+0.974
Neg logit contributions
iaz-1.740
••••-1.506
abase-1.458
paralle-1.370
••-1.329
Tokenard
Token ID446
Feature activation+1.62e-01

Pos logit contributions
iaz+0.520
••••+0.438
abase+0.412
paralle+0.383
••+0.363
Neg logit contributions
jri-0.691
urgical-0.645
 disg-0.578
warts-0.567
 Ended-0.563
Token is
Token ID318
Feature activation-1.41e-01

Pos logit contributions
iaz+0.126
••••+0.108
abase+0.102
paralle+0.096
••+0.092
Neg logit contributions
jri-0.133
urgical-0.123
 disg-0.109
warts-0.107
 Ended-0.105
Token among
Token ID1871
Feature activation-1.27e-01

Pos logit contributions
jri+0.109
urgical+0.100
 disg+0.088
warts+0.086
 Ended+0.085
Neg logit contributions
iaz-0.118
••••-0.102
abase-0.097
paralle-0.091
••-0.088
Token key
Token ID1994
Feature activation-3.68e-02

Pos logit contributions
jri+0.105
urgical+0.097
 disg+0.086
warts+0.084
 Generations+0.083
Neg logit contributions
iaz-0.099
••••-0.085
abase-0.080
paralle-0.076
••-0.073
Token Policy
Token ID7820
Feature activation+2.00e-01

Pos logit contributions
iaz+0.399
••••+0.350
abase+0.338
paralle+0.319
••+0.308
Neg logit contributions
jri-0.297
urgical-0.271
 disg-0.231
warts-0.225
 Generations-0.223
Token or
Token ID393
Feature activation-1.41e-01

Pos logit contributions
iaz+0.150
••••+0.128
abase+0.122
paralle+0.114
••+0.108
Neg logit contributions
jri-0.170
urgical-0.157
 disg-0.140
warts-0.138
 Generations-0.136
Token the
Token ID262
Feature activation+4.52e-01

Pos logit contributions
jri+0.118
urgical+0.109
 disg+0.097
warts+0.095
 Generations+0.094
Neg logit contributions
iaz-0.108
••••-0.093
abase-0.088
paralle-0.082
••-0.079
Token London
Token ID3576
Feature activation+6.60e-01

Pos logit contributions
iaz+0.330
••••+0.280
abase+0.267
paralle+0.246
phant+0.237
Neg logit contributions
jri-0.392
urgical-0.365
 disg-0.327
warts-0.321
 Ended-0.317
Token School
Token ID3961
Feature activation-6.57e-01

Pos logit contributions
iaz+0.572
••••+0.499
abase+0.483
paralle+0.452
phant+0.437
Neg logit contributions
jri-0.481
urgical-0.442
 disg-0.387
warts-0.375
 Generations-0.370
Token of
Token ID286
Feature activation-1.85e+00

Pos logit contributions
jri+0.567
urgical+0.525
 disg+0.470
warts+0.460
 Generations+0.452
Neg logit contributions
iaz-0.487
••••-0.413
abase-0.389
paralle-0.364
••-0.350
Token Economics
Token ID18963
Feature activation-1.76e-01

Pos logit contributions
jri+1.324
urgical+1.184
 disg+1.041
warts+1.006
 Generations+0.980
Neg logit contributions
iaz-1.648
••••-1.434
abase-1.361
paralle-1.307
phant-1.264
Token.
Token ID13
Feature activation-5.16e-01

Pos logit contributions
jri+0.153
urgical+0.142
 disg+0.127
warts+0.124
 Ended+0.122
Neg logit contributions
iaz-0.129
••••-0.110
abase-0.104
paralle-0.097
••-0.093
Token
Token ID198
Feature activation-1.84e-02

Pos logit contributions
jri+0.446
urgical+0.413
 disg+0.370
warts+0.362
 Ended+0.356
Neg logit contributions
iaz-0.381
••••-0.324
abase-0.305
paralle-0.286
••-0.274
Token
Token ID198
Feature activation-5.88e-01

Pos logit contributions
jri+0.016
urgical+0.014
 disg+0.013
warts+0.013
 Ended+0.012
Neg logit contributions
iaz-0.014
••••-0.012
abase-0.011
paralle-0.010
••-0.010
Token________________________________
Token ID10221
Feature activation+1.70e-01

Pos logit contributions
jri+0.618
urgical+0.580
 disg+0.531
warts+0.522
 Generations+0.515
Neg logit contributions
iaz-0.325
••••-0.260
abase-0.239
paralle-0.217
••-0.204
Token government
Token ID1230
Feature activation+5.03e-02

Pos logit contributions
iaz+0.093
••••+0.080
abase+0.076
paralle+0.072
••+0.069
Neg logit contributions
jri-0.090
urgical-0.083
 disg-0.073
warts-0.071
 Generations-0.070
Token faces
Token ID6698
Feature activation+1.21e-01

Pos logit contributions
iaz+0.039
••••+0.033
abase+0.032
paralle+0.030
••+0.029
Neg logit contributions
jri-0.042
urgical-0.038
 disg-0.034
warts-0.033
 Ended-0.033
Token aboriginal
Token ID48328
Feature activation-2.35e-01

Pos logit contributions
iaz+0.098
••••+0.084
abase+0.080
paralle+0.075
••+0.072
Neg logit contributions
jri-0.096
urgical-0.089
 disg-0.079
warts-0.077
 Generations-0.075
Token opposition
Token ID5471
Feature activation-1.84e-01

Pos logit contributions
jri+0.185
urgical+0.170
 disg+0.151
warts+0.147
 Generations+0.145
Neg logit contributions
iaz-0.192
••••-0.166
abase-0.157
paralle-0.149
••-0.143
Token to
Token ID284
Feature activation+2.02e-01

Pos logit contributions
jri+0.153
urgical+0.142
 disg+0.126
warts+0.123
 Generations+0.121
Neg logit contributions
iaz-0.141
••••-0.121
abase-0.115
paralle-0.108
••-0.103
Token Kinder
Token ID46832
Feature activation-1.84e+00

Pos logit contributions
iaz+0.163
••••+0.140
abase+0.133
paralle+0.126
••+0.121
Neg logit contributions
jri-0.161
urgical-0.148
 disg-0.131
warts-0.128
 Generations-0.126
Token Morgan
Token ID10805
Feature activation-4.33e-01

Pos logit contributions
jri+1.251
urgical+1.124
 disg+0.975
warts+0.952
 Ended+0.929
Neg logit contributions
iaz-1.716
••••-1.497
abase-1.444
paralle-1.357
••-1.321
Token
Token ID447
Feature activation+3.22e-02

Pos logit contributions
jri+0.362
urgical+0.335
 disg+0.298
warts+0.292
 Ended+0.287
Neg logit contributions
iaz-0.332
••••-0.284
abase-0.269
paralle-0.253
••-0.242
Token
Token ID247
Feature activation-5.18e-02

Pos logit contributions
iaz+0.024
••••+0.020
abase+0.019
paralle+0.018
••+0.017
Neg logit contributions
jri-0.028
urgical-0.026
 disg-0.023
warts-0.023
 Generations-0.022
Tokens
Token ID82
Feature activation-6.92e-02

Pos logit contributions
jri+0.048
urgical+0.045
 disg+0.040
warts+0.040
 Generations+0.039
Neg logit contributions
iaz-0.035
••••-0.029
abase-0.027
paralle-0.025
••-0.024
Token Trans
Token ID3602
Feature activation-1.48e-01

Pos logit contributions
jri+0.056
urgical+0.052
 disg+0.046
warts+0.045
 Generations+0.044
Neg logit contributions
iaz-0.055
••••-0.047
abase-0.045
paralle-0.042
••-0.040
Token<|endoftext|>
Token ID50256
Feature activation-7.74e-02

Pos logit contributions
iaz+0.007
••••+0.006
abase+0.005
paralle+0.005
••+0.005
Neg logit contributions
jri-0.007
urgical-0.007
 disg-0.006
warts-0.006
 Generations-0.006
TokenFacebook
Token ID12025
Feature activation+6.72e-02

Pos logit contributions
jri+0.066
urgical+0.061
 disg+0.054
warts+0.054
 Ended+0.053
Neg logit contributions
iaz-0.058
••••-0.049
abase-0.047
paralle-0.044
••-0.042
Token Twitter
Token ID3009
Feature activation-5.93e-01

Pos logit contributions
iaz+0.052
••••+0.045
abase+0.043
paralle+0.040
••+0.038
Neg logit contributions
jri-0.055
urgical-0.051
 disg-0.045
warts-0.044
 Ended-0.044
Token Reddit
Token ID10750
Feature activation-1.27e+00

Pos logit contributions
jri+0.509
urgical+0.471
 disg+0.420
warts+0.412
 Generations+0.405
Neg logit contributions
iaz-0.442
••••-0.376
abase-0.355
paralle-0.333
••-0.319
Token Google
Token ID3012
Feature activation-6.16e-01

Pos logit contributions
jri+1.111
urgical+1.027
 disg+0.917
warts+0.904
 Generations+0.881
Neg logit contributions
iaz-0.924
••••-0.792
abase-0.744
paralle-0.697
••-0.671
Token+
Token ID10
Feature activation-1.84e+00

Pos logit contributions
jri+0.530
urgical+0.491
 disg+0.438
warts+0.429
 Generations+0.422
Neg logit contributions
iaz-0.457
••••-0.389
abase-0.368
paralle-0.344
••-0.330
Token
Token ID198
Feature activation-6.84e-02

Pos logit contributions
jri+1.478
urgical+1.354
 disg+1.181
warts+1.160
 Generations+1.133
Neg logit contributions
iaz-1.490
••••-1.289
abase-1.230
paralle-1.154
••-1.125
Token
Token ID198
Feature activation-2.78e-02

Pos logit contributions
jri+0.058
urgical+0.054
 disg+0.048
warts+0.047
 Ended+0.047
Neg logit contributions
iaz-0.051
••••-0.044
abase-0.041
paralle-0.039
••-0.037
TokenGo
Token ID5247
Feature activation-1.94e-01

Pos logit contributions
jri+0.024
urgical+0.022
 disg+0.019
warts+0.019
 Ended+0.019
Neg logit contributions
iaz-0.021
••••-0.018
abase-0.017
paralle-0.016
••-0.015
Token to
Token ID284
Feature activation-1.64e-02

Pos logit contributions
jri+0.159
urgical+0.146
 disg+0.130
warts+0.127
 Generations+0.125
Neg logit contributions
iaz-0.153
••••-0.131
abase-0.124
paralle-0.117
••-0.112
Token the
Token ID262
Feature activation-3.28e-02

Pos logit contributions
jri+0.013
urgical+0.012
 disg+0.011
warts+0.011
 Generations+0.011
Neg logit contributions
iaz-0.013
••••-0.011
abase-0.011
paralle-0.010
••-0.010
Token<|endoftext|>
Token ID50256
Feature activation-9.03e-02

Pos logit contributions
iaz+0.023
••••+0.019
abase+0.018
paralle+0.017
••+0.017
Neg logit contributions
jri-0.025
urgical-0.023
 disg-0.020
warts-0.020
 Generations-0.020
TokenB
Token ID33
Feature activation+1.10e-01

Pos logit contributions
jri+0.077
urgical+0.071
 disg+0.064
warts+0.063
 Ended+0.061
Neg logit contributions
iaz-0.068
••••-0.057
abase-0.055
paralle-0.051
••-0.049
Tokenud
Token ID463
Feature activation-1.59e-01

Pos logit contributions
iaz+0.087
••••+0.074
abase+0.071
paralle+0.067
••+0.064
Neg logit contributions
jri-0.090
urgical-0.083
 disg-0.073
warts-0.072
 Ended-0.071
Tokenwe
Token ID732
Feature activation+7.55e-01

Pos logit contributions
jri+0.141
urgical+0.131
 disg+0.118
warts+0.115
 Ended+0.114
Neg logit contributions
iaz-0.113
••••-0.096
abase-0.090
paralle-0.084
••-0.080
Tokeniser
Token ID5847
Feature activation-9.54e-02

Pos logit contributions
iaz+0.635
••••+0.556
abase+0.522
paralle+0.504
phant+0.480
Neg logit contributions
jri-0.576
urgical-0.524
 disg-0.460
warts-0.448
 Ended-0.442
Token Cly
Token ID29133
Feature activation-1.80e+00

Pos logit contributions
jri+0.079
urgical+0.073
 disg+0.065
warts+0.063
 Ended+0.062
Neg logit contributions
iaz-0.074
••••-0.064
abase-0.060
paralle-0.057
••-0.054
Tokendes
Token ID8906
Feature activation+4.22e-01

Pos logit contributions
jri+0.800
urgical+0.683
 disg+0.550
warts+0.499
 Generations+0.493
Neg logit contributions
iaz-2.086
••••-1.881
abase-1.825
paralle-1.754
phant-1.710
Tokend
Token ID67
Feature activation+5.65e-01

Pos logit contributions
iaz+0.395
••••+0.349
abase+0.335
paralle+0.320
••+0.309
Neg logit contributions
jri-0.279
urgical-0.256
 disg-0.218
warts-0.213
 Ended-0.210
Tokenales
Token ID2040
Feature activation-1.96e-01

Pos logit contributions
iaz+0.570
••••+0.508
abase+0.488
paralle+0.466
••+0.456
Neg logit contributions
jri-0.331
urgical-0.300
 disg-0.251
warts-0.242
 Ended-0.237
Token make
Token ID787
Feature activation+1.02e-01

Pos logit contributions
jri+0.162
urgical+0.150
 disg+0.133
warts+0.130
 Ended+0.128
Neg logit contributions
iaz-0.152
••••-0.130
abase-0.123
paralle-0.116
••-0.111
Token deliveries
Token ID31167
Feature activation+2.07e-02

Pos logit contributions
iaz+0.082
••••+0.070
abase+0.067
paralle+0.063
••+0.061
Neg logit contributions
jri-0.082
urgical-0.076
 disg-0.067
warts-0.066
 Ended-0.064
Token president
Token ID1893
Feature activation+9.75e-02

Pos logit contributions
jri+0.116
urgical+0.107
 disg+0.094
warts+0.092
 Generations+0.090
Neg logit contributions
iaz-0.125
••••-0.108
abase-0.103
paralle-0.098
••-0.094
Token told
Token ID1297
Feature activation-2.39e-01

Pos logit contributions
iaz+0.077
••••+0.067
abase+0.063
paralle+0.059
••+0.057
Neg logit contributions
jri-0.079
urgical-0.073
 disg-0.064
warts-0.063
 Ended-0.062
Token Fox
Token ID5426
Feature activation-3.89e-02

Pos logit contributions
jri+0.193
urgical+0.178
 disg+0.157
warts+0.154
 Generations+0.151
Neg logit contributions
iaz-0.191
••••-0.164
abase-0.156
paralle-0.147
••-0.141
Token News
Token ID3000
Feature activation-1.70e-02

Pos logit contributions
jri+0.030
urgical+0.028
 disg+0.025
warts+0.024
 Generations+0.024
Neg logit contributions
iaz-0.032
••••-0.028
abase-0.026
paralle-0.025
••-0.024
Token host
Token ID2583
Feature activation-8.94e-02

Pos logit contributions
jri+0.014
urgical+0.013
 disg+0.012
warts+0.012
 Generations+0.011
Neg logit contributions
iaz-0.013
••••-0.011
abase-0.010
paralle-0.010
••-0.009
Token Laura
Token ID16753
Feature activation-1.79e+00

Pos logit contributions
jri+0.081
urgical+0.075
 disg+0.068
warts+0.067
 Generations+0.066
Neg logit contributions
iaz-0.062
••••-0.052
abase-0.049
paralle-0.046
••-0.044
Token Ing
Token ID17589
Feature activation-1.06e+00

Pos logit contributions
jri+1.606
urgical+1.487
 disg+1.332
warts+1.299
 Ended+1.280
Neg logit contributions
iaz-1.284
••••-1.075
abase-1.003
paralle-0.947
••-0.898
Tokenraham
Token ID13220
Feature activation+2.50e-02

Pos logit contributions
jri+0.733
urgical+0.660
 disg+0.577
warts+0.554
 Generations+0.549
Neg logit contributions
iaz-0.967
••••-0.844
abase-0.810
paralle-0.770
phant-0.744
Token that
Token ID326
Feature activation-1.81e-01

Pos logit contributions
iaz+0.020
••••+0.017
abase+0.016
paralle+0.015
••+0.015
Neg logit contributions
jri-0.020
urgical-0.019
 disg-0.017
warts-0.016
 Generations-0.016
Token initial
Token ID4238
Feature activation-4.36e-01

Pos logit contributions
jri+0.151
urgical+0.140
 disg+0.124
warts+0.122
 Generations+0.120
Neg logit contributions
iaz-0.139
••••-0.118
abase-0.112
paralle-0.105
••-0.101
Token funding
Token ID4918
Feature activation-3.21e-01

Pos logit contributions
jri+0.324
urgical+0.297
 disg+0.260
warts+0.254
 Ended+0.249
Neg logit contributions
iaz-0.375
••••-0.326
abase-0.311
paralle-0.295
••-0.284
Token adventure
Token ID8855
Feature activation+5.08e-02

Pos logit contributions
jri+0.441
urgical+0.404
 disg+0.355
warts+0.347
 Ended+0.341
Neg logit contributions
iaz-0.484
••••-0.420
abase-0.399
paralle-0.378
••-0.364
Token set
Token ID900
Feature activation+7.70e-02

Pos logit contributions
iaz+0.041
••••+0.035
abase+0.034
paralle+0.032
••+0.031
Neg logit contributions
jri-0.040
urgical-0.037
 disg-0.033
warts-0.032
 Generations-0.032
Token during
Token ID1141
Feature activation-5.65e-02

Pos logit contributions
iaz+0.062
••••+0.053
abase+0.050
paralle+0.047
••+0.045
Neg logit contributions
jri-0.062
urgical-0.057
 disg-0.051
warts-0.050
 Generations-0.049
Token the
Token ID262
Feature activation-2.84e-01

Pos logit contributions
jri+0.047
urgical+0.043
 disg+0.039
warts+0.038
 Generations+0.037
Neg logit contributions
iaz-0.044
••••-0.037
abase-0.035
paralle-0.033
••-0.032
Token Nap
Token ID14332
Feature activation-1.27e+00

Pos logit contributions
jri+0.233
urgical+0.216
 disg+0.192
warts+0.187
 Generations+0.184
Neg logit contributions
iaz-0.222
••••-0.190
abase-0.180
paralle-0.170
••-0.163
Tokenole
Token ID2305
Feature activation-1.79e+00

Pos logit contributions
jri+0.996
urgical+0.909
 disg+0.808
warts+0.787
 Generations+0.771
Neg logit contributions
iaz-1.047
••••-0.903
abase-0.862
paralle-0.814
phant-0.782
Tokenonic
Token ID9229
Feature activation-5.39e-01

Pos logit contributions
jri+1.067
urgical+0.936
 disg+0.805
warts+0.775
 Generations+0.749
Neg logit contributions
iaz-1.804
••••-1.606
abase-1.548
paralle-1.472
phant-1.437
Token Wars
Token ID6176
Feature activation-2.54e-01

Pos logit contributions
jri+0.408
urgical+0.374
 disg+0.328
warts+0.321
 Ended+0.314
Neg logit contributions
iaz-0.456
••••-0.396
abase-0.377
paralle-0.357
••-0.344
Token.
Token ID13
Feature activation-4.27e-02

Pos logit contributions
jri+0.219
urgical+0.203
 disg+0.182
warts+0.178
 Ended+0.175
Neg logit contributions
iaz-0.188
••••-0.160
abase-0.151
paralle-0.141
••-0.135
Token
Token ID198
Feature activation+1.35e-01

Pos logit contributions
jri+0.036
urgical+0.033
 disg+0.029
warts+0.029
 Generations+0.029
Neg logit contributions
iaz-0.033
••••-0.028
abase-0.026
paralle-0.025
••-0.024
Token
Token ID198
Feature activation-6.34e-03

Pos logit contributions
iaz+0.102
••••+0.087
abase+0.082
paralle+0.077
••+0.074
Neg logit contributions
jri-0.114
urgical-0.106
 disg-0.094
warts-0.093
 Ended-0.091
TokenNote
Token ID6425
Feature activation-1.69e+00

Pos logit contributions
jri+0.657
urgical+0.616
 disg+0.562
warts+0.551
 Generations+0.544
Neg logit contributions
iaz-0.402
••••-0.329
abase-0.306
paralle-0.280
••-0.266
Token:
Token ID25
Feature activation-3.27e-01

Pos logit contributions
jri+1.549
urgical+1.446
 disg+1.306
warts+1.281
 Generations+1.253
Neg logit contributions
iaz-1.173
••••-0.976
abase-0.907
paralle-0.848
••-0.823
Token This
Token ID770
Feature activation-2.10e-01

Pos logit contributions
jri+0.274
urgical+0.253
 disg+0.226
warts+0.221
 Ended+0.217
Neg logit contributions
iaz-0.250
••••-0.213
abase-0.202
paralle-0.190
••-0.182
Token article
Token ID2708
Feature activation-3.15e-01

Pos logit contributions
jri+0.178
urgical+0.165
 disg+0.147
warts+0.144
 Ended+0.142
Neg logit contributions
iaz-0.158
••••-0.135
abase-0.128
paralle-0.120
••-0.115
Token gives
Token ID3607
Feature activation-4.17e-01

Pos logit contributions
jri+0.266
urgical+0.246
 disg+0.219
warts+0.215
 Ended+0.211
Neg logit contributions
iaz-0.240
••••-0.205
abase-0.194
paralle-0.182
••-0.174
Token the
Token ID262
Feature activation-1.75e+00

Pos logit contributions
jri+0.333
urgical+0.307
 disg+0.272
warts+0.266
 Ended+0.261
Neg logit contributions
iaz-0.335
••••-0.289
abase-0.274
paralle-0.259
••-0.249
Token views
Token ID5009
Feature activation-4.34e-01

Pos logit contributions
jri+1.462
urgical+1.337
 disg+1.193
warts+1.169
 Ended+1.138
Neg logit contributions
iaz-1.355
••••-1.172
abase-1.094
paralle-1.034
••-1.010
Token of
Token ID286
Feature activation-8.33e-01

Pos logit contributions
jri+0.373
urgical+0.345
 disg+0.309
warts+0.302
 Ended+0.297
Neg logit contributions
iaz-0.324
••••-0.275
abase-0.260
paralle-0.244
••-0.234
Token the
Token ID262
Feature activation-5.74e-01

Pos logit contributions
jri+0.715
urgical+0.663
 disg+0.591
warts+0.580
 Ended+0.570
Neg logit contributions
iaz-0.621
••••-0.528
abase-0.499
paralle-0.467
••-0.449
Token author
Token ID1772
Feature activation+2.38e-01

Pos logit contributions
jri+0.411
urgical+0.375
 disg+0.326
warts+0.317
 Ended+0.311
Neg logit contributions
iaz-0.510
••••-0.446
abase-0.426
paralle-0.405
••-0.391
Token,
Token ID11
Feature activation-4.01e-01

Pos logit contributions
iaz+0.179
••••+0.152
abase+0.144
paralle+0.134
phant+0.129
Neg logit contributions
jri-0.203
urgical-0.188
 disg-0.168
warts-0.165
 Ended-0.162
Token before
Token ID878
Feature activation-5.73e-01

Pos logit contributions
jri+0.671
urgical+0.620
 disg+0.549
warts+0.537
 Generations+0.529
Neg logit contributions
iaz-0.661
••••-0.567
abase-0.538
paralle-0.507
••-0.488
Token commenting
Token ID26387
Feature activation-1.15e-01

Pos logit contributions
jri+0.422
urgical+0.386
 disg+0.338
warts+0.330
 Ended+0.323
Neg logit contributions
iaz-0.497
••••-0.433
abase-0.412
paralle-0.392
••-0.378
Token.
Token ID13
Feature activation-8.97e-01

Pos logit contributions
jri+0.100
urgical+0.093
 disg+0.083
warts+0.082
 Ended+0.081
Neg logit contributions
iaz-0.084
••••-0.071
abase-0.067
paralle-0.063
••-0.060
Token
Token ID198
Feature activation+1.41e-02

Pos logit contributions
jri+0.785
urgical+0.727
 disg+0.655
warts+0.638
 Generations+0.626
Neg logit contributions
iaz-0.653
••••-0.554
abase-0.524
paralle-0.489
••-0.469
Token
Token ID198
Feature activation-6.61e-01

Pos logit contributions
iaz+0.011
••••+0.009
abase+0.009
paralle+0.008
••+0.008
Neg logit contributions
jri-0.012
urgical-0.011
 disg-0.010
warts-0.010
 Ended-0.010
TokenNote
Token ID6425
Feature activation-1.69e+00

Pos logit contributions
jri+0.657
urgical+0.616
 disg+0.562
warts+0.551
 Generations+0.544
Neg logit contributions
iaz-0.402
••••-0.329
abase-0.306
paralle-0.280
••-0.266
Token:
Token ID25
Feature activation-3.27e-01

Pos logit contributions
jri+1.549
urgical+1.446
 disg+1.306
warts+1.281
 Generations+1.253
Neg logit contributions
iaz-1.173
••••-0.976
abase-0.907
paralle-0.848
••-0.823
Token This
Token ID770
Feature activation-2.10e-01

Pos logit contributions
jri+0.274
urgical+0.253
 disg+0.226
warts+0.221
 Ended+0.217
Neg logit contributions
iaz-0.250
••••-0.213
abase-0.202
paralle-0.190
••-0.182
Token article
Token ID2708
Feature activation-3.15e-01

Pos logit contributions
jri+0.178
urgical+0.165
 disg+0.147
warts+0.144
 Ended+0.142
Neg logit contributions
iaz-0.158
••••-0.135
abase-0.128
paralle-0.120
••-0.115
Token gives
Token ID3607
Feature activation-4.17e-01

Pos logit contributions
jri+0.266
urgical+0.246
 disg+0.219
warts+0.215
 Ended+0.211
Neg logit contributions
iaz-0.240
••••-0.205
abase-0.194
paralle-0.182
••-0.174
Token the
Token ID262
Feature activation-1.75e+00

Pos logit contributions
jri+0.333
urgical+0.307
 disg+0.272
warts+0.266
 Ended+0.261
Neg logit contributions
iaz-0.335
••••-0.289
abase-0.274
paralle-0.259
••-0.249
Token,
Token ID11
Feature activation-1.16e-01

Pos logit contributions
iaz+0.104
••••+0.089
abase+0.084
paralle+0.079
••+0.075
Neg logit contributions
jri-0.117
urgical-0.109
 disg-0.097
warts-0.095
 Ended-0.093
Token New
Token ID968
Feature activation+5.09e-02

Pos logit contributions
jri+0.095
urgical+0.088
 disg+0.078
warts+0.076
 Generations+0.075
Neg logit contributions
iaz-0.091
••••-0.078
abase-0.074
paralle-0.070
••-0.067
Token Jersey
Token ID8221
Feature activation+1.17e-01

Pos logit contributions
iaz+0.039
••••+0.034
abase+0.032
paralle+0.030
••+0.029
Neg logit contributions
jri-0.042
urgical-0.039
 disg-0.035
warts-0.034
 Generations-0.034
Token Republican
Token ID3415
Feature activation-3.59e-02

Pos logit contributions
iaz+0.088
••••+0.075
abase+0.071
paralle+0.067
••+0.064
Neg logit contributions
jri-0.099
urgical-0.092
 disg-0.082
warts-0.080
 Generations-0.079
Token Governor
Token ID10807
Feature activation-4.31e-01

Pos logit contributions
jri+0.030
urgical+0.027
 disg+0.024
warts+0.024
 Generations+0.023
Neg logit contributions
iaz-0.028
••••-0.024
abase-0.023
paralle-0.021
••-0.021
Token Chris
Token ID5180
Feature activation-1.68e+00

Pos logit contributions
jri+0.362
urgical+0.335
 disg+0.298
warts+0.292
 Ended+0.287
Neg logit contributions
iaz-0.329
••••-0.281
abase-0.266
paralle-0.250
••-0.239
Token Christie
Token ID16929
Feature activation+1.23e-01

Pos logit contributions
jri+0.787
urgical+0.666
 disg+0.535
warts+0.509
 Ended+0.469
Neg logit contributions
iaz-1.927
••••-1.748
abase-1.666
paralle-1.609
••-1.583
Token carried
Token ID5281
Feature activation-4.66e-01

Pos logit contributions
iaz+0.101
••••+0.087
abase+0.083
paralle+0.078
••+0.075
Neg logit contributions
jri-0.096
urgical-0.088
 disg-0.078
warts-0.076
 Ended-0.075
Token 51
Token ID6885
Feature activation-3.35e-02

Pos logit contributions
jri+0.396
urgical+0.366
 disg+0.327
warts+0.320
 Ended+0.315
Neg logit contributions
iaz-0.351
••••-0.299
abase-0.283
paralle-0.265
••-0.254
Token%
Token ID4
Feature activation-3.97e-01

Pos logit contributions
jri+0.026
urgical+0.024
 disg+0.021
warts+0.021
 Generations+0.020
Neg logit contributions
iaz-0.028
••••-0.024
abase-0.023
paralle-0.022
••-0.021
Token of
Token ID286
Feature activation-4.72e-01

Pos logit contributions
jri+0.340
urgical+0.315
 disg+0.281
warts+0.275
 Ended+0.271
Neg logit contributions
iaz-0.297
••••-0.253
abase-0.239
paralle-0.224
••-0.215
Token But
Token ID887
Feature activation+8.55e-01

Pos logit contributions
iaz+0.378
••••+0.329
abase+0.317
paralle+0.297
••+0.285
Neg logit contributions
jri-0.317
urgical-0.292
 disg-0.255
warts-0.248
 Generations-0.246
Token please
Token ID3387
Feature activation+1.28e+00

Pos logit contributions
iaz+0.611
••••+0.514
abase+0.491
paralle+0.459
phant+0.432
Neg logit contributions
jri-0.759
urgical-0.703
 disg-0.636
warts-0.617
 Generations-0.612
Token note
Token ID3465
Feature activation-1.29e+00

Pos logit contributions
iaz+0.958
••••+0.814
abase+0.793
paralle+0.735
phant+0.691
Neg logit contributions
jri-1.077
urgical-1.015
 disg-0.918
warts-0.892
 Generations-0.878
Token that
Token ID326
Feature activation-1.66e+00

Pos logit contributions
jri+1.066
urgical+0.984
 disg+0.878
warts+0.859
 Ended+0.847
Neg logit contributions
iaz-1.008
••••-0.869
abase-0.812
paralle-0.771
••-0.748
Token my
Token ID616
Feature activation-1.49e+00

Pos logit contributions
jri+1.320
urgical+1.210
 disg+1.063
warts+1.054
 Ended+1.030
Neg logit contributions
iaz-1.350
••••-1.176
abase-1.096
paralle-1.056
••-1.020
Token tweets
Token ID12665
Feature activation-1.67e+00

Pos logit contributions
jri+0.843
urgical+0.747
 disg+0.622
warts+0.606
 Ended+0.591
Neg logit contributions
iaz-1.547
••••-1.387
abase-1.329
paralle-1.277
••-1.245
Token are
Token ID389
Feature activation-1.09e+00

Pos logit contributions
jri+1.438
urgical+1.314
 disg+1.186
warts+1.170
 Ended+1.141
Neg logit contributions
iaz-1.251
••••-1.070
abase-1.000
paralle-0.945
••-0.911
Token sometimes
Token ID3360
Feature activation-3.43e-01

Pos logit contributions
jri+0.815
urgical+0.741
 disg+0.649
warts+0.637
 Ended+0.624
Neg logit contributions
iaz-0.933
••••-0.816
abase-0.766
paralle-0.731
••-0.711
Token deliberately
Token ID14593
Feature activation-2.06e+00

Pos logit contributions
jri+0.263
urgical+0.242
 disg+0.212
warts+0.208
 Ended+0.204
Neg logit contributions
iaz-0.287
••••-0.249
abase-0.237
paralle-0.224
••-0.216
Token provocative
Token ID28695
Feature activation+5.92e-01

Pos logit contributions
jri+1.545
urgical+1.385
 disg+1.230
warts+1.205
 Ended+1.170
Neg logit contributions
iaz-1.780
••••-1.550
abase-1.455
paralle-1.408
phant-1.361
Token,
Token ID11
Feature activation-2.12e+00

Pos logit contributions
iaz+0.452
••••+0.385
abase+0.371
paralle+0.344
phant+0.325
Neg logit contributions
jri-0.496
urgical-0.462
 disg-0.409
warts-0.400
 Ended-0.395
Token here
Token ID994
Feature activation+1.58e-01

Pos logit contributions
iaz+0.093
••••+0.079
abase+0.075
paralle+0.070
••+0.067
Neg logit contributions
jri-0.099
urgical-0.092
 disg-0.082
warts-0.080
 Generations-0.079
Token.
Token ID13
Feature activation+4.35e-01

Pos logit contributions
iaz+0.117
••••+0.099
abase+0.094
paralle+0.088
••+0.083
Neg logit contributions
jri-0.137
urgical-0.127
 disg-0.114
warts-0.111
 Ended-0.110
Token But
Token ID887
Feature activation+8.55e-01

Pos logit contributions
iaz+0.378
••••+0.329
abase+0.317
paralle+0.297
••+0.285
Neg logit contributions
jri-0.317
urgical-0.292
 disg-0.255
warts-0.248
 Generations-0.246
Token please
Token ID3387
Feature activation+1.28e+00

Pos logit contributions
iaz+0.611
••••+0.514
abase+0.491
paralle+0.459
phant+0.432
Neg logit contributions
jri-0.759
urgical-0.703
 disg-0.636
warts-0.617
 Generations-0.612
Token note
Token ID3465
Feature activation-1.29e+00

Pos logit contributions
iaz+0.958
••••+0.814
abase+0.793
paralle+0.735
phant+0.691
Neg logit contributions
jri-1.077
urgical-1.015
 disg-0.918
warts-0.892
 Generations-0.878
Token that
Token ID326
Feature activation-1.66e+00

Pos logit contributions
jri+1.066
urgical+0.984
 disg+0.878
warts+0.859
 Ended+0.847
Neg logit contributions
iaz-1.008
••••-0.869
abase-0.812
paralle-0.771
••-0.748
Token my
Token ID616
Feature activation-1.49e+00

Pos logit contributions
jri+1.320
urgical+1.210
 disg+1.063
warts+1.054
 Ended+1.030
Neg logit contributions
iaz-1.350
••••-1.176
abase-1.096
paralle-1.056
••-1.020
Token tweets
Token ID12665
Feature activation-1.67e+00

Pos logit contributions
jri+0.843
urgical+0.747
 disg+0.622
warts+0.606
 Ended+0.591
Neg logit contributions
iaz-1.547
••••-1.387
abase-1.329
paralle-1.277
••-1.245
Token are
Token ID389
Feature activation-1.09e+00

Pos logit contributions
jri+1.438
urgical+1.314
 disg+1.186
warts+1.170
 Ended+1.141
Neg logit contributions
iaz-1.251
••••-1.070
abase-1.000
paralle-0.945
••-0.911
Token sometimes
Token ID3360
Feature activation-3.43e-01

Pos logit contributions
jri+0.815
urgical+0.741
 disg+0.649
warts+0.637
 Ended+0.624
Neg logit contributions
iaz-0.933
••••-0.816
abase-0.766
paralle-0.731
••-0.711
Token deliberately
Token ID14593
Feature activation-2.06e+00

Pos logit contributions
jri+0.263
urgical+0.242
 disg+0.212
warts+0.208
 Ended+0.204
Neg logit contributions
iaz-0.287
••••-0.249
abase-0.237
paralle-0.224
••-0.216
Token can
Token ID460
Feature activation+4.59e-01

Pos logit contributions
iaz+0.599
••••+0.519
abase+0.499
paralle+0.473
••+0.455
Neg logit contributions
jri-0.524
urgical-0.480
 disg-0.421
warts-0.409
 Ended-0.407
Token check
Token ID2198
Feature activation-8.33e-01

Pos logit contributions
iaz+0.363
••••+0.313
abase+0.297
paralle+0.281
••+0.270
Neg logit contributions
jri-0.372
urgical-0.342
 disg-0.306
warts-0.296
 Ended-0.292
Token out
Token ID503
Feature activation-9.64e-02

Pos logit contributions
jri+0.675
urgical+0.622
 disg+0.551
warts+0.539
 Ended+0.528
Neg logit contributions
iaz-0.662
••••-0.569
abase-0.539
paralle-0.507
••-0.488
Token my
Token ID616
Feature activation+1.26e-01

Pos logit contributions
jri+0.072
urgical+0.066
 disg+0.058
warts+0.057
 Generations+0.056
Neg logit contributions
iaz-0.082
••••-0.072
abase-0.068
paralle-0.064
••-0.062
Token Twitter
Token ID3009
Feature activation-2.27e-01

Pos logit contributions
iaz+0.111
••••+0.097
abase+0.092
paralle+0.087
••+0.084
Neg logit contributions
jri-0.091
urgical-0.083
 disg-0.072
warts-0.071
 Generations-0.070
Token feed
Token ID3745
Feature activation-1.66e+00

Pos logit contributions
jri+0.120
urgical+0.106
 disg+0.087
warts+0.083
 Generations+0.081
Neg logit contributions
iaz-0.244
••••-0.219
abase-0.211
paralle-0.202
••-0.197
Token by
Token ID416
Feature activation-1.04e+00

Pos logit contributions
jri+1.505
urgical+1.404
 disg+1.257
warts+1.239
 Ended+1.211
Neg logit contributions
iaz-1.152
••••-0.974
abase-0.905
paralle-0.858
••-0.813
Token going
Token ID1016
Feature activation+1.20e-01

Pos logit contributions
jri+0.941
urgical+0.876
 disg+0.786
warts+0.770
 Ended+0.761
Neg logit contributions
iaz-0.731
••••-0.615
abase-0.577
paralle-0.542
••-0.515
Token here
Token ID994
Feature activation+1.58e-01

Pos logit contributions
iaz+0.093
••••+0.079
abase+0.075
paralle+0.070
••+0.067
Neg logit contributions
jri-0.099
urgical-0.092
 disg-0.082
warts-0.080
 Generations-0.079
Token.
Token ID13
Feature activation+4.35e-01

Pos logit contributions
iaz+0.117
••••+0.099
abase+0.094
paralle+0.088
••+0.083
Neg logit contributions
jri-0.137
urgical-0.127
 disg-0.114
warts-0.111
 Ended-0.110
Token But
Token ID887
Feature activation+8.55e-01

Pos logit contributions
iaz+0.378
••••+0.329
abase+0.317
paralle+0.297
••+0.285
Neg logit contributions
jri-0.317
urgical-0.292
 disg-0.255
warts-0.248
 Generations-0.246
Token
Token ID198
Feature activation-4.27e-02

Pos logit contributions
iaz+0.031
••••+0.026
abase+0.025
paralle+0.023
••+0.022
Neg logit contributions
jri-0.034
urgical-0.032
 disg-0.028
warts-0.028
 Ended-0.027
TokenWe
Token ID1135
Feature activation+1.07e-01

Pos logit contributions
jri+0.036
urgical+0.033
 disg+0.030
warts+0.029
 Ended+0.029
Neg logit contributions
iaz-0.032
••••-0.027
abase-0.026
paralle-0.024
••-0.023
Token rate
Token ID2494
Feature activation+1.32e-01

Pos logit contributions
iaz+0.089
••••+0.077
abase+0.073
paralle+0.069
••+0.067
Neg logit contributions
jri-0.082
urgical-0.076
 disg-0.067
warts-0.065
 Ended-0.064
Token this
Token ID428
Feature activation+6.40e-02

Pos logit contributions
iaz+0.102
••••+0.087
abase+0.083
paralle+0.078
••+0.074
Neg logit contributions
jri-0.110
urgical-0.101
 disg-0.090
warts-0.088
 Generations-0.087
Token Pants
Token ID41689
Feature activation-7.98e-02

Pos logit contributions
iaz+0.052
••••+0.045
abase+0.043
paralle+0.040
••+0.039
Neg logit contributions
jri-0.050
urgical-0.046
 disg-0.041
warts-0.040
 Ended-0.039
Token on
Token ID319
Feature activation+1.27e+00

Pos logit contributions
jri+0.067
urgical+0.062
 disg+0.056
warts+0.054
 Ended+0.054
Neg logit contributions
iaz-0.060
••••-0.052
abase-0.049
paralle-0.046
••-0.044
Token Fire
Token ID3764
Feature activation-3.93e-01

Pos logit contributions
iaz+1.003
••••+0.857
abase+0.817
paralle+0.771
phant+0.740
Neg logit contributions
jri-1.034
urgical-0.960
 disg-0.858
 Ended-0.830
warts-0.830
Token.
Token ID13
Feature activation-3.04e-01

Pos logit contributions
jri+0.337
urgical+0.312
 disg+0.279
warts+0.273
 Ended+0.269
Neg logit contributions
iaz-0.293
••••-0.249
abase-0.235
paralle-0.220
••-0.211
Token<|endoftext|>
Token ID50256
Feature activation-8.94e-02

Pos logit contributions
jri+0.268
urgical+0.249
 disg+0.223
warts+0.219
 Ended+0.216
Neg logit contributions
iaz-0.219
••••-0.185
abase-0.175
paralle-0.163
••-0.156
TokenI
Token ID40
Feature activation-1.79e-02

Pos logit contributions
jri+0.076
urgical+0.071
 disg+0.063
warts+0.062
 Ended+0.061
Neg logit contributions
iaz-0.067
••••-0.057
abase-0.054
paralle-0.051
••-0.048
Token
Token ID447
Feature activation-1.90e-01

Pos logit contributions
jri+0.014
urgical+0.013
 disg+0.012
warts+0.011
 Ended+0.011
Neg logit contributions
iaz-0.015
••••-0.013
abase-0.012
paralle-0.011
••-0.011
Token second
Token ID1218
Feature activation+3.36e-01

Pos logit contributions
iaz+0.044
••••+0.038
abase+0.036
paralle+0.034
••+0.032
Neg logit contributions
jri-0.047
urgical-0.043
 disg-0.039
warts-0.038
 Generations-0.037
Token edition
Token ID8313
Feature activation-1.81e-01

Pos logit contributions
iaz+0.265
••••+0.228
abase+0.217
paralle+0.204
••+0.195
Neg logit contributions
jri-0.271
urgical-0.252
 disg-0.223
warts-0.218
 Generations-0.217
Token of
Token ID286
Feature activation+9.15e-01

Pos logit contributions
jri+0.154
urgical+0.142
 disg+0.127
warts+0.125
 Generations+0.123
Neg logit contributions
iaz-0.136
••••-0.116
abase-0.110
paralle-0.103
••-0.098
Token my
Token ID616
Feature activation+4.22e-01

Pos logit contributions
iaz+0.811
••••+0.702
abase+0.678
paralle+0.635
••+0.612
Neg logit contributions
jri-0.651
urgical-0.605
 disg-0.531
warts-0.509
 Generations-0.509
Token 1993
Token ID9656
Feature activation+2.39e-01

Pos logit contributions
iaz+0.329
••••+0.282
abase+0.270
paralle+0.252
phant+0.240
Neg logit contributions
jri-0.347
urgical-0.321
 disg-0.286
warts-0.280
 Generations-0.277
Token book
Token ID1492
Feature activation+1.57e+00

Pos logit contributions
iaz+0.204
••••+0.176
abase+0.169
paralle+0.159
••+0.153
Neg logit contributions
jri-0.179
urgical-0.165
 disg-0.145
warts-0.142
 Ended-0.139
Token,
Token ID11
Feature activation+7.75e-01

Pos logit contributions
iaz+1.249
••••+1.056
abase+1.004
paralle+0.937
phant+0.895
Neg logit contributions
jri-1.273
urgical-1.186
 disg-1.053
warts-1.032
 Generations-1.028
Token Re
Token ID797
Feature activation+7.27e-01

Pos logit contributions
iaz+0.691
••••+0.595
abase+0.579
paralle+0.542
••+0.517
Neg logit contributions
jri-0.555
urgical-0.510
 disg-0.449
warts-0.435
 Generations-0.431
Tokenclaim
Token ID6604
Feature activation+7.20e-01

Pos logit contributions
iaz+0.514
••••+0.428
abase+0.409
paralle+0.372
••+0.361
Neg logit contributions
jri-0.649
urgical-0.611
 disg-0.550
warts-0.536
 Generations-0.533
Tokening
Token ID278
Feature activation+3.93e-01

Pos logit contributions
iaz+0.525
••••+0.442
abase+0.418
paralle+0.392
••+0.372
Neg logit contributions
jri-0.628
urgical-0.585
 disg-0.521
warts-0.516
 Generations-0.514
Token the
Token ID262
Feature activation+1.99e+00

Pos logit contributions
iaz+0.305
••••+0.260
abase+0.246
paralle+0.232
••+0.222
Neg logit contributions
jri-0.323
urgical-0.301
 disg-0.267
warts-0.263
 Generations-0.260
Tokenv
Token ID85
Feature activation+1.14e+00

Pos logit contributions
iaz+0.636
••••+0.556
abase+0.526
paralle+0.497
••+0.482
Neg logit contributions
jri-0.588
urgical-0.541
 disg-0.469
warts-0.463
 Ended-0.457
Tokenald
Token ID1940
Feature activation+1.45e+00

Pos logit contributions
iaz+0.794
••••+0.676
abase+0.626
paralle+0.595
••+0.566
Neg logit contributions
jri-1.022
urgical-0.954
 disg-0.851
warts-0.846
 Generations-0.827
Token Bo
Token ID3248
Feature activation-2.45e-01

Pos logit contributions
iaz+1.194
••••+1.043
abase+1.001
paralle+0.941
••+0.904
Neg logit contributions
jri-1.114
urgical-1.016
 disg-0.898
warts-0.891
 Generations-0.886
Tokenass
Token ID562
Feature activation+5.33e-02

Pos logit contributions
jri+0.213
urgical+0.197
 disg+0.176
warts+0.173
 Generations+0.170
Neg logit contributions
iaz-0.180
••••-0.153
abase-0.144
paralle-0.135
••-0.129
Tokenon
Token ID261
Feature activation+1.68e+00

Pos logit contributions
iaz+0.040
••••+0.034
abase+0.032
paralle+0.030
••+0.029
Neg logit contributions
jri-0.046
urgical-0.042
 disg-0.038
warts-0.037
 Generations-0.036
Token H
Token ID367
Feature activation+1.48e+00

Pos logit contributions
iaz+1.007
••••+0.843
abase+0.770
paralle+0.713
••+0.688
Neg logit contributions
jri-1.675
urgical-1.562
 disg-1.419
 Ended-1.404
warts-1.400
Tokenagen
Token ID11286
Feature activation+1.46e-01

Pos logit contributions
iaz+0.974
••••+0.810
abase+0.758
paralle+0.724
••+0.677
Neg logit contributions
jri-1.398
urgical-1.308
 disg-1.173
warts-1.163
 Ended-1.135
Token (
Token ID357
Feature activation+1.98e-01

Pos logit contributions
iaz+0.105
••••+0.089
abase+0.084
paralle+0.079
••+0.075
Neg logit contributions
jri-0.128
urgical-0.119
 disg-0.107
warts-0.105
 Ended-0.103
TokenSky
Token ID22308
Feature activation-2.64e-01

Pos logit contributions
iaz+0.159
••••+0.136
abase+0.129
paralle+0.122
••+0.117
Neg logit contributions
jri-0.159
urgical-0.146
 disg-0.129
warts-0.127
 Ended-0.125
Token Pro
Token ID1041
Feature activation-2.57e-01

Pos logit contributions
jri+0.245
urgical+0.229
 disg+0.206
warts+0.203
 Generations+0.200
Neg logit contributions
iaz-0.178
••••-0.148
abase-0.139
paralle-0.129
••-0.123
Tokency
Token ID948
Feature activation-1.18e-01

Pos logit contributions
jri+0.220
urgical+0.204
 disg+0.182
warts+0.178
 Ended+0.175
Neg logit contributions
iaz-0.193
••••-0.164
abase-0.155
paralle-0.145
••-0.139
Token handed
Token ID10158
Feature activation+6.40e-01

Pos logit contributions
iaz+0.619
••••+0.515
abase+0.491
paralle+0.463
phant+0.420
Neg logit contributions
jri-0.853
urgical-0.807
 disg-0.718
warts-0.706
 Generations-0.706
Token'
Token ID6
Feature activation+1.28e+00

Pos logit contributions
iaz+0.424
••••+0.351
abase+0.333
paralle+0.310
phant+0.289
Neg logit contributions
jri-0.602
urgical-0.564
 disg-0.507
warts-0.497
 Generations-0.493
Token 2016
Token ID1584
Feature activation+9.44e-01

Pos logit contributions
iaz+1.126
••••+0.986
abase+0.949
paralle+0.898
••+0.856
Neg logit contributions
jri-0.919
urgical-0.841
 disg-0.727
 Generations-0.720
 Ended-0.708
Token primary
Token ID4165
Feature activation+4.00e-01

Pos logit contributions
iaz+0.783
••••+0.678
abase+0.660
paralle+0.621
phant+0.590
Neg logit contributions
jri-0.715
urgical-0.662
 disg-0.586
 Generations-0.566
warts-0.564
Token Sanders
Token ID5831
Feature activation+6.07e-01

Pos logit contributions
iaz+0.298
••••+0.255
abase+0.242
paralle+0.228
phant+0.215
Neg logit contributions
jri-0.342
urgical-0.318
 disg-0.283
warts-0.278
 Ended-0.276
Token:
Token ID25
Feature activation+1.56e+00

Pos logit contributions
iaz+0.447
••••+0.383
abase+0.363
paralle+0.339
phant+0.322
Neg logit contributions
jri-0.523
urgical-0.485
 disg-0.434
warts-0.426
 Ended-0.421
Token '
Token ID705
Feature activation-8.19e-01

Pos logit contributions
iaz+1.225
••••+1.060
abase+1.010
paralle+0.955
phant+0.906
Neg logit contributions
jri-1.257
urgical-1.170
 disg-1.043
 Generations-1.024
 Ended-1.020
TokenDamn
Token ID43343
Feature activation+1.89e-01

Pos logit contributions
jri+0.712
urgical+0.662
 disg+0.592
warts+0.580
 Ended+0.570
Neg logit contributions
iaz-0.602
••••-0.510
abase-0.480
paralle-0.451
••-0.433
Token right
Token ID826
Feature activation+1.42e-01

Pos logit contributions
iaz+0.144
••••+0.123
abase+0.117
paralle+0.110
••+0.104
Neg logit contributions
jri-0.158
urgical-0.146
 disg-0.130
warts-0.128
 Generations-0.126
Token'
Token ID6
Feature activation+9.27e-01

Pos logit contributions
iaz+0.103
••••+0.087
abase+0.082
paralle+0.077
••+0.073
Neg logit contributions
jri-0.124
urgical-0.115
 disg-0.103
warts-0.101
 Ended-0.100
Token I
Token ID314
Feature activation+4.56e-01

Pos logit contributions
iaz+0.754
••••+0.652
abase+0.631
paralle+0.588
phant+0.572
Neg logit contributions
jri-0.727
urgical-0.666
 disg-0.591
 Generations-0.576
warts-0.574
TokenKim
Token ID26374
Feature activation-9.24e-01

Pos logit contributions
jri+0.384
urgical+0.345
 disg+0.294
warts+0.284
 Generations+0.278
Neg logit contributions
iaz-0.599
••••-0.530
abase-0.508
paralle-0.486
••-0.471
Token summit
Token ID14237
Feature activation+4.33e-01

Pos logit contributions
jri+0.660
urgical+0.602
 disg+0.525
warts+0.512
 Ended+0.499
Neg logit contributions
iaz-0.823
••••-0.718
abase-0.680
paralle-0.650
••-0.629
Token Manafort
Token ID26351
Feature activation+6.20e-01

Pos logit contributions
iaz+0.338
••••+0.292
abase+0.277
paralle+0.261
phant+0.249
Neg logit contributions
jri-0.357
urgical-0.330
 disg-0.291
warts-0.287
 Ended-0.284
Token's
Token ID338
Feature activation+1.15e+00

Pos logit contributions
iaz+0.474
••••+0.411
abase+0.386
paralle+0.362
••+0.352
Neg logit contributions
jri-0.516
urgical-0.477
 disg-0.429
warts-0.415
 Ended-0.412
Token attorneys
Token ID14449
Feature activation+1.79e+00

Pos logit contributions
iaz+1.061
••••+0.933
abase+0.893
paralle+0.852
••+0.830
Neg logit contributions
jri-0.774
urgical-0.703
 disg-0.607
warts-0.581
 Generations-0.579
Token say
Token ID910
Feature activation+1.92e+00

Pos logit contributions
iaz+1.605
••••+1.433
abase+1.361
paralle+1.262
••+1.259
Neg logit contributions
jri-1.237
urgical-1.140
 disg-1.002
 Ended-0.953
 Generations-0.942
Token he
Token ID339
Feature activation+6.04e-02

Pos logit contributions
iaz+1.646
••••+1.455
abase+1.354
paralle+1.302
••+1.271
Neg logit contributions
jri-1.431
urgical-1.312
 disg-1.165
 Generations-1.112
warts-1.110
Token should
Token ID815
Feature activation+4.38e-01

Pos logit contributions
iaz+0.049
••••+0.042
abase+0.040
paralle+0.037
••+0.036
Neg logit contributions
jri-0.048
urgical-0.045
 disg-0.039
warts-0.038
 Generations-0.038
Token get
Token ID651
Feature activation-5.82e-01

Pos logit contributions
iaz+0.389
••••+0.341
abase+0.328
paralle+0.309
••+0.299
Neg logit contributions
jri-0.310
urgical-0.285
 disg-0.249
warts-0.241
 Generations-0.237
Token less
Token ID1342
Feature activation+1.72e-01

Pos logit contributions
jri+0.446
urgical+0.409
 disg+0.359
warts+0.351
 Ended+0.344
Neg logit contributions
iaz-0.488
••••-0.423
abase-0.402
paralle-0.381
••-0.367
Token than
Token ID621
Feature activation+7.50e-04

Pos logit contributions
iaz+0.144
••••+0.125
abase+0.120
paralle+0.113
••+0.109
Neg logit contributions
jri-0.132
urgical-0.121
 disg-0.107
warts-0.103
 Ended-0.102
Token areas
Token ID3006
Feature activation+8.62e-02

Pos logit contributions
iaz+0.336
••••+0.288
abase+0.273
paralle+0.257
••+0.247
Neg logit contributions
jri-0.345
urgical-0.320
 disg-0.283
warts-0.277
 Generations-0.274
Token over
Token ID625
Feature activation+2.99e-01

Pos logit contributions
iaz+0.069
••••+0.060
abase+0.057
paralle+0.053
••+0.051
Neg logit contributions
jri-0.069
urgical-0.064
 disg-0.056
warts-0.055
 Generations-0.054
Token which
Token ID543
Feature activation+4.88e-01

Pos logit contributions
iaz+0.232
••••+0.199
abase+0.188
paralle+0.177
••+0.170
Neg logit contributions
jri-0.248
urgical-0.230
 disg-0.204
warts-0.200
 Generations-0.197
Token the
Token ID262
Feature activation+2.67e-01

Pos logit contributions
iaz+0.395
••••+0.340
abase+0.323
paralle+0.304
••+0.292
Neg logit contributions
jri-0.388
urgical-0.359
 disg-0.317
warts-0.309
 Generations-0.306
Token Commonwealth
Token ID17165
Feature activation+2.66e-01

Pos logit contributions
iaz+0.182
••••+0.153
abase+0.143
paralle+0.133
••+0.127
Neg logit contributions
jri-0.245
urgical-0.229
 disg-0.207
warts-0.203
 Generations-0.200
Token does
Token ID857
Feature activation+1.97e-01

Pos logit contributions
iaz+0.219
••••+0.189
abase+0.180
paralle+0.169
••+0.163
Neg logit contributions
jri-0.208
urgical-0.192
 disg-0.169
warts-0.166
 Ended-0.163
Token not
Token ID407
Feature activation+1.53e-02

Pos logit contributions
iaz+0.155
••••+0.133
abase+0.126
paralle+0.119
••+0.114
Neg logit contributions
jri-0.160
urgical-0.148
 disg-0.131
warts-0.128
 Generations-0.126
Token have
Token ID423
Feature activation+4.88e-01

Pos logit contributions
iaz+0.012
••••+0.010
abase+0.010
paralle+0.009
••+0.009
Neg logit contributions
jri-0.012
urgical-0.011
 disg-0.010
warts-0.010
 Generations-0.010
Token legislative
Token ID10828
Feature activation+1.99e-01

Pos logit contributions
iaz+0.437
••••+0.382
abase+0.365
paralle+0.347
••+0.334
Neg logit contributions
jri-0.346
urgical-0.317
 disg-0.276
warts-0.267
 Generations-0.263
Token power
Token ID1176
Feature activation+1.46e-01

Pos logit contributions
iaz+0.184
••••+0.162
abase+0.155
paralle+0.147
••+0.142
Neg logit contributions
jri-0.135
urgical-0.123
 disg-0.106
warts-0.103
 Generations-0.101
Token.
Token ID13
Feature activation+1.65e-01

Pos logit contributions
iaz+0.114
••••+0.097
abase+0.092
paralle+0.087
••+0.083
Neg logit contributions
jri-0.120
urgical-0.111
 disg-0.098
warts-0.096
 Ended-0.095
Token after
Token ID706
Feature activation+1.05e-01

Pos logit contributions
jri+0.059
urgical+0.055
 disg+0.049
warts+0.048
 Ended+0.047
Neg logit contributions
iaz-0.052
••••-0.045
abase-0.042
paralle-0.040
••-0.038
Token the
Token ID262
Feature activation+1.96e-02

Pos logit contributions
iaz+0.083
••••+0.071
abase+0.067
paralle+0.063
••+0.061
Neg logit contributions
jri-0.086
urgical-0.079
 disg-0.071
warts-0.069
 Generations-0.068
Token fact
Token ID1109
Feature activation-2.29e-01

Pos logit contributions
iaz+0.016
••••+0.014
abase+0.013
paralle+0.012
••+0.012
Neg logit contributions
jri-0.015
urgical-0.014
 disg-0.012
warts-0.012
 Generations-0.012
Token?
Token ID30
Feature activation-6.36e-02

Pos logit contributions
jri+0.197
urgical+0.183
 disg+0.164
warts+0.160
 Ended+0.158
Neg logit contributions
iaz-0.169
••••-0.144
abase-0.136
paralle-0.127
••-0.122
Token Like
Token ID4525
Feature activation+1.35e-01

Pos logit contributions
jri+0.053
urgical+0.049
 disg+0.043
warts+0.042
 Generations+0.042
Neg logit contributions
iaz-0.049
••••-0.042
abase-0.040
paralle-0.038
••-0.036
Token how
Token ID703
Feature activation+1.50e-01

Pos logit contributions
iaz+0.106
••••+0.091
abase+0.086
paralle+0.081
••+0.078
Neg logit contributions
jri-0.111
urgical-0.102
 disg-0.091
warts-0.089
 Generations-0.088
Token wal
Token ID6514
Feature activation+4.65e-01

Pos logit contributions
iaz+0.119
••••+0.103
abase+0.097
paralle+0.092
••+0.088
Neg logit contributions
jri-0.121
urgical-0.112
 disg-0.099
warts-0.097
 Generations-0.095
Tokenr
Token ID81
Feature activation+2.39e-03

Pos logit contributions
iaz+0.379
••••+0.327
abase+0.311
paralle+0.293
••+0.281
Neg logit contributions
jri-0.368
urgical-0.339
 disg-0.298
warts-0.293
 Generations-0.285
Tokenuses
Token ID2664
Feature activation+1.94e-02

Pos logit contributions
iaz+0.002
••••+0.001
abase+0.001
paralle+0.001
••+0.001
Neg logit contributions
jri-0.002
urgical-0.002
 disg-0.002
warts-0.002
 Ended-0.002
Token were
Token ID547
Feature activation+8.93e-02

Pos logit contributions
iaz+0.016
••••+0.014
abase+0.013
paralle+0.012
••+0.012
Neg logit contributions
jri-0.015
urgical-0.014
 disg-0.013
warts-0.012
 Ended-0.012
Token some
Token ID617
Feature activation+9.23e-02

Pos logit contributions
iaz+0.075
••••+0.065
abase+0.062
paralle+0.058
••+0.056
Neg logit contributions
jri-0.068
urgical-0.063
 disg-0.055
warts-0.054
 Ended-0.053
Token,
Token ID11
Feature activation+1.41e-01

Pos logit contributions
iaz+0.183
••••+0.162
abase+0.155
paralle+0.148
••+0.143
Neg logit contributions
jri-0.126
urgical-0.114
 disg-0.098
warts-0.095
 Ended-0.093
Token departing
Token ID37717
Feature activation+1.39e-01

Pos logit contributions
iaz+0.111
••••+0.095
abase+0.091
paralle+0.085
••+0.082
Neg logit contributions
jri-0.115
urgical-0.106
 disg-0.094
warts-0.092
 Generations-0.091
Token from
Token ID422
Feature activation+1.70e-01

Pos logit contributions
iaz+0.107
••••+0.091
abase+0.086
paralle+0.081
••+0.078
Neg logit contributions
jri-0.116
urgical-0.107
 disg-0.095
warts-0.093
 Ended-0.092
Token Manchester
Token ID9502
Feature activation+8.19e-02

Pos logit contributions
iaz+0.136
••••+0.117
abase+0.112
paralle+0.105
••+0.101
Neg logit contributions
jri-0.136
urgical-0.125
 disg-0.111
warts-0.108
 Ended-0.107
Token on
Token ID319
Feature activation+8.27e-02

Pos logit contributions
iaz+0.063
••••+0.054
abase+0.051
paralle+0.048
••+0.046
Neg logit contributions
jri-0.068
urgical-0.063
 disg-0.056
warts-0.055
 Generations-0.054
Token December
Token ID3426
Feature activation+1.82e-01

Pos logit contributions
iaz+0.068
••••+0.058
abase+0.056
paralle+0.053
••+0.051
Neg logit contributions
jri-0.065
urgical-0.060
 disg-0.053
warts-0.051
 Ended-0.051
Token 23
Token ID2242
Feature activation+2.35e-01

Pos logit contributions
iaz+0.132
••••+0.112
abase+0.106
paralle+0.099
••+0.094
Neg logit contributions
jri-0.159
urgical-0.148
 disg-0.132
warts-0.130
 Ended-0.128
Token;
Token ID26
Feature activation+1.19e-01

Pos logit contributions
iaz+0.173
••••+0.147
abase+0.139
paralle+0.130
••+0.124
Neg logit contributions
jri-0.203
urgical-0.188
 disg-0.169
warts-0.165
 Ended-0.163
Token and
Token ID290
Feature activation+1.13e-01

Pos logit contributions
iaz+0.093
••••+0.079
abase+0.075
paralle+0.071
••+0.068
Neg logit contributions
jri-0.099
urgical-0.091
 disg-0.082
warts-0.080
 Generations-0.078
Token the
Token ID262
Feature activation+9.22e-02

Pos logit contributions
iaz+0.088
••••+0.075
abase+0.071
paralle+0.067
••+0.064
Neg logit contributions
jri-0.093
urgical-0.086
 disg-0.077
warts-0.075
 Generations-0.074
Token five
Token ID1936
Feature activation+8.18e-03

Pos logit contributions
iaz+0.073
••••+0.063
abase+0.060
paralle+0.057
••+0.054
Neg logit contributions
jri-0.074
urgical-0.069
 disg-0.061
warts-0.059
 Generations-0.059
Tokenroph
Token ID10051
Feature activation-8.47e-01

Pos logit contributions
jri+0.118
urgical+0.110
 disg+0.097
warts+0.096
 Generations+0.094
Neg logit contributions
iaz-0.103
••••-0.088
abase-0.083
paralle-0.078
••-0.075
Tokenysics
Token ID23154
Feature activation-1.03e-01

Pos logit contributions
jri+0.560
urgical+0.502
 disg+0.437
warts+0.421
 Ended+0.412
Neg logit contributions
iaz-0.803
••••-0.704
abase-0.677
paralle-0.643
••-0.623
Token and
Token ID290
Feature activation+4.20e-01

Pos logit contributions
jri+0.086
urgical+0.079
 disg+0.071
warts+0.069
 Ended+0.068
Neg logit contributions
iaz-0.079
••••-0.068
abase-0.064
paralle-0.060
••-0.058
Token his
Token ID465
Feature activation+6.85e-01

Pos logit contributions
iaz+0.341
••••+0.294
abase+0.279
paralle+0.264
••+0.254
Neg logit contributions
jri-0.333
urgical-0.307
 disg-0.271
warts-0.265
 Generations-0.262
Token coll
Token ID2927
Feature activation+8.35e-02

Pos logit contributions
iaz+0.570
••••+0.495
abase+0.470
paralle+0.444
••+0.429
Neg logit contributions
jri-0.529
urgical-0.486
 disg-0.427
warts-0.418
 Generations-0.411
Tokenagues
Token ID6120
Feature activation+2.58e-01

Pos logit contributions
iaz+0.065
••••+0.056
abase+0.053
paralle+0.050
••+0.048
Neg logit contributions
jri-0.068
urgical-0.063
 disg-0.056
warts-0.055
 Generations-0.054
Token have
Token ID423
Feature activation-2.75e-02

Pos logit contributions
iaz+0.203
••••+0.175
abase+0.165
paralle+0.156
••+0.150
Neg logit contributions
jri-0.210
urgical-0.194
 disg-0.172
warts-0.168
 Ended-0.166
Token found
Token ID1043
Feature activation+7.20e-02

Pos logit contributions
jri+0.021
urgical+0.019
 disg+0.017
warts+0.016
 Ended+0.016
Neg logit contributions
iaz-0.023
••••-0.020
abase-0.019
paralle-0.018
••-0.017
Token that
Token ID326
Feature activation+9.66e-02

Pos logit contributions
iaz+0.058
••••+0.049
abase+0.047
paralle+0.044
••+0.043
Neg logit contributions
jri-0.058
urgical-0.053
 disg-0.047
warts-0.046
 Generations-0.046
Token Kepler
Token ID40294
Feature activation+6.59e-01

Pos logit contributions
iaz+0.077
••••+0.066
abase+0.063
paralle+0.059
••+0.057
Neg logit contributions
jri-0.078
urgical-0.072
 disg-0.064
warts-0.062
 Generations-0.061
Token-
Token ID12
Feature activation+1.80e+00

Pos logit contributions
iaz+0.507
••••+0.433
abase+0.410
paralle+0.383
••+0.371
Neg logit contributions
jri-0.548
urgical-0.507
 disg-0.455
warts-0.441
 Ended-0.437
Token not
Token ID407
Feature activation-1.81e-01

Pos logit contributions
jri+0.075
urgical+0.068
 disg+0.060
warts+0.059
 Ended+0.057
Neg logit contributions
iaz-0.085
••••-0.074
abase-0.070
paralle-0.067
••-0.064
Token find
Token ID1064
Feature activation+2.03e-02

Pos logit contributions
jri+0.137
urgical+0.125
 disg+0.110
warts+0.107
 Generations+0.105
Neg logit contributions
iaz-0.154
••••-0.133
abase-0.127
paralle-0.120
••-0.116
Token it
Token ID340
Feature activation-1.67e-01

Pos logit contributions
iaz+0.016
••••+0.014
abase+0.013
paralle+0.012
••+0.012
Neg logit contributions
jri-0.016
urgical-0.015
 disg-0.013
warts-0.013
 Generations-0.013
Token engaging
Token ID11932
Feature activation+3.25e-02

Pos logit contributions
jri+0.132
urgical+0.122
 disg+0.108
warts+0.106
 Ended+0.104
Neg logit contributions
iaz-0.135
••••-0.116
abase-0.110
paralle-0.104
••-0.100
Token in
Token ID287
Feature activation+8.12e-02

Pos logit contributions
iaz+0.024
••••+0.021
abase+0.020
paralle+0.018
••+0.018
Neg logit contributions
jri-0.028
urgical-0.026
 disg-0.023
warts-0.022
 Ended-0.022
Token the
Token ID262
Feature activation+2.48e-01

Pos logit contributions
iaz+0.067
••••+0.058
abase+0.055
paralle+0.052
••+0.050
Neg logit contributions
jri-0.063
urgical-0.058
 disg-0.052
warts-0.050
 Generations-0.049
Token least
Token ID1551
Feature activation+2.79e-01

Pos logit contributions
iaz+0.205
••••+0.177
abase+0.169
paralle+0.160
••+0.153
Neg logit contributions
jri-0.193
urgical-0.177
 disg-0.157
warts-0.152
 Generations-0.150
Token.
Token ID13
Feature activation-6.77e-02

Pos logit contributions
iaz+0.216
••••+0.185
abase+0.175
paralle+0.165
••+0.158
Neg logit contributions
jri-0.231
urgical-0.214
 disg-0.191
warts-0.186
 Ended-0.183
Token
Token ID198
Feature activation-3.43e-01

Pos logit contributions
jri+0.057
urgical+0.053
 disg+0.047
warts+0.046
 Generations+0.045
Neg logit contributions
iaz-0.052
••••-0.044
abase-0.042
paralle-0.039
••-0.037
Token
Token ID198
Feature activation-2.59e-01

Pos logit contributions
jri+0.294
urgical+0.273
 disg+0.244
warts+0.239
 Ended+0.235
Neg logit contributions
iaz-0.256
••••-0.217
abase-0.205
paralle-0.192
••-0.184
TokenFor
Token ID1890
Feature activation-1.17e-01

Pos logit contributions
jri+0.221
urgical+0.204
 disg+0.182
warts+0.179
 Ended+0.176
Neg logit contributions
iaz-0.195
••••-0.165
abase-0.157
paralle-0.147
••-0.140
Token.
Token ID13
Feature activation+7.82e-02

Pos logit contributions
jri+0.284
urgical+0.262
 disg+0.234
warts+0.229
 Ended+0.225
Neg logit contributions
iaz-0.262
••••-0.224
abase-0.212
paralle-0.199
••-0.191
Token And
Token ID843
Feature activation-1.09e-02

Pos logit contributions
iaz+0.061
••••+0.052
abase+0.049
paralle+0.047
••+0.044
Neg logit contributions
jri-0.064
urgical-0.060
 disg-0.053
warts-0.052
 Ended-0.051
Token that
Token ID326
Feature activation-1.84e-01

Pos logit contributions
jri+0.009
urgical+0.008
 disg+0.007
warts+0.007
 Generations+0.007
Neg logit contributions
iaz-0.008
••••-0.007
abase-0.007
paralle-0.007
••-0.006
Token
Token ID447
Feature activation-5.97e-02

Pos logit contributions
jri+0.151
urgical+0.139
 disg+0.123
warts+0.121
 Ended+0.119
Neg logit contributions
iaz-0.145
••••-0.124
abase-0.118
paralle-0.111
••-0.106
Token
Token ID247
Feature activation-1.59e-01

Pos logit contributions
jri+0.052
urgical+0.048
 disg+0.043
warts+0.042
 Generations+0.042
Neg logit contributions
iaz-0.044
••••-0.037
abase-0.035
paralle-0.033
••-0.031
Tokens
Token ID82
Feature activation-1.48e-01

Pos logit contributions
jri+0.142
urgical+0.132
 disg+0.118
warts+0.116
 Ended+0.114
Neg logit contributions
iaz-0.113
••••-0.095
abase-0.090
paralle-0.084
••-0.080
Token it
Token ID340
Feature activation-2.82e-01

Pos logit contributions
jri+0.119
urgical+0.110
 disg+0.098
warts+0.095
 Ended+0.094
Neg logit contributions
iaz-0.119
••••-0.102
abase-0.097
paralle-0.091
••-0.087
Token!
Token ID0
Feature activation+7.56e-02

Pos logit contributions
jri+0.240
urgical+0.222
 disg+0.198
warts+0.194
 Ended+0.191
Neg logit contributions
iaz-0.213
••••-0.181
abase-0.171
paralle-0.161
••-0.154
Token Hopefully
Token ID19996
Feature activation+4.82e-02

Pos logit contributions
iaz+0.059
••••+0.050
abase+0.048
paralle+0.045
••+0.043
Neg logit contributions
jri-0.062
urgical-0.058
 disg-0.051
warts-0.050
 Ended-0.049
Token this
Token ID428
Feature activation-1.55e-01

Pos logit contributions
iaz+0.037
••••+0.032
abase+0.030
paralle+0.028
••+0.027
Neg logit contributions
jri-0.040
urgical-0.037
 disg-0.033
warts-0.032
 Generations-0.032
Token wasn
Token ID2492
Feature activation-1.78e-01

Pos logit contributions
jri+0.120
urgical+0.110
 disg+0.097
warts+0.095
 Ended+0.093
Neg logit contributions
iaz-0.129
••••-0.112
abase-0.106
paralle-0.101
••-0.097
Token itself
Token ID2346
Feature activation+8.45e-03

Pos logit contributions
jri+0.013
urgical+0.012
 disg+0.010
warts+0.010
 Ended+0.010
Neg logit contributions
iaz-0.012
••••-0.010
abase-0.010
paralle-0.009
••-0.009
Token has
Token ID468
Feature activation-4.36e-02

Pos logit contributions
iaz+0.006
••••+0.006
abase+0.005
paralle+0.005
••+0.005
Neg logit contributions
jri-0.007
urgical-0.007
 disg-0.006
warts-0.006
 Ended-0.006
Token seemed
Token ID3947
Feature activation+2.46e-01

Pos logit contributions
jri+0.034
urgical+0.031
 disg+0.028
warts+0.027
 Ended+0.027
Neg logit contributions
iaz-0.036
••••-0.031
abase-0.029
paralle-0.028
••-0.027
Token relatively
Token ID5365
Feature activation+9.13e-02

Pos logit contributions
iaz+0.200
••••+0.172
abase+0.164
paralle+0.155
••+0.149
Neg logit contributions
jri-0.195
urgical-0.179
 disg-0.159
warts-0.155
 Ended-0.152
Token quiet
Token ID5897
Feature activation-4.52e-02

Pos logit contributions
iaz+0.080
••••+0.070
abase+0.066
paralle+0.063
••+0.061
Neg logit contributions
jri-0.067
urgical-0.061
 disg-0.053
warts-0.052
 Ended-0.051
Token of
Token ID286
Feature activation+8.86e-02

Pos logit contributions
jri+0.038
urgical+0.035
 disg+0.031
warts+0.031
 Generations+0.030
Neg logit contributions
iaz-0.034
••••-0.029
abase-0.028
paralle-0.026
••-0.025
Token late
Token ID2739
Feature activation+1.20e-02

Pos logit contributions
iaz+0.064
••••+0.054
abase+0.051
paralle+0.048
••+0.046
Neg logit contributions
jri-0.078
urgical-0.072
 disg-0.065
warts-0.064
 Generations-0.063
Token despite
Token ID3805
Feature activation+1.85e-01

Pos logit contributions
iaz+0.009
••••+0.008
abase+0.007
paralle+0.007
••+0.006
Neg logit contributions
jri-0.010
urgical-0.010
 disg-0.009
warts-0.008
 Ended-0.008
Token the
Token ID262
Feature activation+7.71e-02

Pos logit contributions
iaz+0.147
••••+0.127
abase+0.120
paralle+0.113
••+0.109
Neg logit contributions
jri-0.150
urgical-0.138
 disg-0.122
warts-0.120
 Generations-0.118
Token fact
Token ID1109
Feature activation+2.74e-02

Pos logit contributions
iaz+0.064
••••+0.055
abase+0.053
paralle+0.050
••+0.048
Neg logit contributions
jri-0.060
urgical-0.055
 disg-0.048
warts-0.047
 Generations-0.046
Token that
Token ID326
Feature activation+1.04e-01

Pos logit contributions
iaz+0.022
••••+0.019
abase+0.018
paralle+0.017
••+0.016
Neg logit contributions
jri-0.022
urgical-0.020
 disg-0.018
warts-0.017
 Generations-0.017
Token
Token ID564
Feature activation+9.49e-02

Pos logit contributions
iaz+0.018
••••+0.015
abase+0.015
paralle+0.014
••+0.013
Neg logit contributions
jri-0.020
urgical-0.018
 disg-0.016
warts-0.016
 Generations-0.016
Token
Token ID250
Feature activation+1.37e-01

Pos logit contributions
iaz+0.082
••••+0.071
abase+0.068
paralle+0.064
••+0.062
Neg logit contributions
jri-0.070
urgical-0.065
 disg-0.056
warts-0.055
 Ended-0.054
TokenWe
Token ID1135
Feature activation+1.91e-01

Pos logit contributions
iaz+0.106
••••+0.091
abase+0.086
paralle+0.081
••+0.077
Neg logit contributions
jri-0.113
urgical-0.104
 disg-0.092
warts-0.091
 Ended-0.089
Token
Token ID447
Feature activation+2.68e-02

Pos logit contributions
iaz+0.153
••••+0.131
abase+0.125
paralle+0.118
••+0.113
Neg logit contributions
jri-0.153
urgical-0.141
 disg-0.125
warts-0.122
 Ended-0.120
Token
Token ID247
Feature activation+1.01e-01

Pos logit contributions
iaz+0.020
••••+0.017
abase+0.016
paralle+0.015
••+0.014
Neg logit contributions
jri-0.023
urgical-0.021
 disg-0.019
warts-0.019
 Ended-0.018
Tokenre
Token ID260
Feature activation+2.82e-02

Pos logit contributions
iaz+0.081
••••+0.069
abase+0.066
paralle+0.062
••+0.060
Neg logit contributions
jri-0.082
urgical-0.075
 disg-0.067
warts-0.065
 Ended-0.064
Token not
Token ID407
Feature activation-1.39e-02

Pos logit contributions
iaz+0.023
••••+0.020
abase+0.019
paralle+0.018
••+0.017
Neg logit contributions
jri-0.022
urgical-0.021
 disg-0.018
warts-0.018
 Ended-0.018
Token acknowledging
Token ID25937
Feature activation-8.53e-02

Pos logit contributions
jri+0.010
urgical+0.010
 disg+0.008
warts+0.008
 Ended+0.008
Neg logit contributions
iaz-0.012
••••-0.010
abase-0.010
paralle-0.009
••-0.009
Token that
Token ID326
Feature activation-4.11e-02

Pos logit contributions
jri+0.070
urgical+0.064
 disg+0.057
warts+0.056
 Generations+0.055
Neg logit contributions
iaz-0.067
••••-0.058
abase-0.055
paralle-0.051
••-0.049
Token we
Token ID356
Feature activation+1.14e-01

Pos logit contributions
jri+0.034
urgical+0.031
 disg+0.028
warts+0.027
 Generations+0.027
Neg logit contributions
iaz-0.032
••••-0.028
abase-0.026
paralle-0.025
••-0.024
Token don
Token ID836
Feature activation+2.84e-01

Pos logit contributions
iaz+0.090
••••+0.078
abase+0.074
paralle+0.069
••+0.067
Neg logit contributions
jri-0.092
urgical-0.085
 disg-0.075
warts-0.074
 Ended-0.073
Token Catalan
Token ID31066
Feature activation+3.34e-01

Pos logit contributions
iaz+0.181
••••+0.156
abase+0.149
paralle+0.140
••+0.135
Neg logit contributions
jri-0.172
urgical-0.159
 disg-0.140
warts-0.136
 Generations-0.134
Token pro
Token ID386
Feature activation+6.87e-03

Pos logit contributions
iaz+0.259
••••+0.222
abase+0.211
paralle+0.198
••+0.190
Neg logit contributions
jri-0.276
urgical-0.255
 disg-0.226
warts-0.222
 Generations-0.219
Token-
Token ID12
Feature activation-6.47e-03

Pos logit contributions
iaz+0.005
••••+0.004
abase+0.004
paralle+0.004
••+0.003
Neg logit contributions
jri-0.006
urgical-0.006
 disg-0.005
warts-0.005
 Generations-0.005
Tokenindependence
Token ID39894
Feature activation+2.45e-01

Pos logit contributions
jri+0.005
urgical+0.004
 disg+0.004
warts+0.004
 Ended+0.004
Neg logit contributions
iaz-0.006
••••-0.005
abase-0.005
paralle-0.004
••-0.004
Token flag
Token ID6056
Feature activation-2.94e-01

Pos logit contributions
iaz+0.206
••••+0.178
abase+0.170
paralle+0.161
••+0.155
Neg logit contributions
jri-0.187
urgical-0.172
 disg-0.151
warts-0.148
 Generations-0.145
Token -
Token ID532
Feature activation-1.91e-02

Pos logit contributions
jri+0.254
urgical+0.235
 disg+0.210
warts+0.206
 Generations+0.203
Neg logit contributions
iaz-0.218
••••-0.185
abase-0.175
paralle-0.163
••-0.156
Tokenthe
Token ID1169
Feature activation+7.38e-02

Pos logit contributions
jri+0.016
urgical+0.014
 disg+0.013
warts+0.013
 Generations+0.012
Neg logit contributions
iaz-0.015
••••-0.013
abase-0.012
paralle-0.011
••-0.011
Token est
Token ID1556
Feature activation-1.84e-01

Pos logit contributions
iaz+0.054
••••+0.045
abase+0.043
paralle+0.040
••+0.038
Neg logit contributions
jri-0.065
urgical-0.060
 disg-0.054
warts-0.053
 Generations-0.052
Tokenel
Token ID417
Feature activation+2.76e-01

Pos logit contributions
jri+0.146
urgical+0.135
 disg+0.119
warts+0.116
 Ended+0.114
Neg logit contributions
iaz-0.149
••••-0.129
abase-0.122
paralle-0.115
••-0.111
Tokenada
Token ID4763
Feature activation+2.24e-01

Pos logit contributions
iaz+0.199
••••+0.169
abase+0.159
paralle+0.150
••+0.143
Neg logit contributions
jri-0.243
urgical-0.226
 disg-0.201
warts-0.198
 Ended-0.195
Token -
Token ID532
Feature activation+2.94e-02

Pos logit contributions
iaz+0.167
••••+0.142
abase+0.134
paralle+0.126
••+0.120
Neg logit contributions
jri-0.193
urgical-0.178
 disg-0.159
warts-0.156
 Generations-0.154
Token who
Token ID508
Feature activation-3.04e-02

Pos logit contributions
jri+0.047
urgical+0.043
 disg+0.038
warts+0.038
 Generations+0.037
Neg logit contributions
iaz-0.044
••••-0.038
abase-0.036
paralle-0.034
••-0.033
Token has
Token ID468
Feature activation-1.76e-01

Pos logit contributions
jri+0.025
urgical+0.023
 disg+0.021
warts+0.020
 Ended+0.020
Neg logit contributions
iaz-0.024
••••-0.020
abase-0.019
paralle-0.018
••-0.017
Token repeatedly
Token ID7830
Feature activation+2.95e-02

Pos logit contributions
jri+0.133
urgical+0.122
 disg+0.108
warts+0.105
 Ended+0.103
Neg logit contributions
iaz-0.148
••••-0.129
abase-0.123
paralle-0.116
••-0.112
Token condemned
Token ID15006
Feature activation+1.97e-01

Pos logit contributions
iaz+0.026
••••+0.023
abase+0.022
paralle+0.020
••+0.020
Neg logit contributions
jri-0.021
urgical-0.020
 disg-0.017
warts-0.017
 Ended-0.016
Token trade
Token ID3292
Feature activation-2.51e-02

Pos logit contributions
iaz+0.156
••••+0.134
abase+0.128
paralle+0.120
••+0.115
Neg logit contributions
jri-0.160
urgical-0.147
 disg-0.131
warts-0.128
 Generations-0.125
Token deals
Token ID7529
Feature activation-3.28e-02

Pos logit contributions
jri+0.021
urgical+0.019
 disg+0.017
warts+0.017
 Ended+0.016
Neg logit contributions
iaz-0.020
••••-0.017
abase-0.016
paralle-0.015
••-0.014
Token he
Token ID339
Feature activation-1.36e-01

Pos logit contributions
jri+0.027
urgical+0.025
 disg+0.022
warts+0.022
 Ended+0.021
Neg logit contributions
iaz-0.026
••••-0.022
abase-0.021
paralle-0.020
••-0.019
Token believes
Token ID5804
Feature activation-7.14e-02

Pos logit contributions
jri+0.111
urgical+0.103
 disg+0.091
warts+0.089
 Ended+0.088
Neg logit contributions
iaz-0.106
••••-0.091
abase-0.086
paralle-0.081
••-0.078
Token are
Token ID389
Feature activation-8.42e-02

Pos logit contributions
jri+0.053
urgical+0.049
 disg+0.043
warts+0.042
 Ended+0.041
Neg logit contributions
iaz-0.061
••••-0.053
abase-0.051
paralle-0.048
••-0.047
Token bad
Token ID2089
Feature activation-2.85e-01

Pos logit contributions
jri+0.063
urgical+0.058
 disg+0.051
warts+0.050
 Ended+0.049
Neg logit contributions
iaz-0.071
••••-0.062
abase-0.059
paralle-0.056
••-0.054
Token for
Token ID329
Feature activation-8.34e-02

Pos logit contributions
jri+0.243
urgical+0.225
 disg+0.201
warts+0.197
 Ended+0.194
Neg logit contributions
iaz-0.213
••••-0.181
abase-0.172
paralle-0.161
••-0.154
Token-
Token ID12
Feature activation+1.34e-01

Pos logit contributions
iaz+0.083
••••+0.070
abase+0.066
paralle+0.061
••+0.058
Neg logit contributions
jri-0.106
urgical-0.099
 disg-0.089
warts-0.087
 Ended-0.086
TokenMarch
Token ID16192
Feature activation+9.75e-02

Pos logit contributions
iaz+0.097
••••+0.082
abase+0.078
paralle+0.072
••+0.069
Neg logit contributions
jri-0.118
urgical-0.109
 disg-0.098
warts-0.096
 Ended-0.094
Token on
Token ID319
Feature activation+1.44e-01

Pos logit contributions
iaz+0.074
••••+0.063
abase+0.059
paralle+0.056
••+0.053
Neg logit contributions
jri-0.083
urgical-0.077
 disg-0.069
warts-0.067
 Ended-0.066
Token a
Token ID257
Feature activation+3.10e-01

Pos logit contributions
iaz+0.115
••••+0.098
abase+0.093
paralle+0.088
••+0.084
Neg logit contributions
jri-0.117
urgical-0.108
 disg-0.096
warts-0.093
 Generations-0.092
Token United
Token ID1578
Feature activation+8.00e-02

Pos logit contributions
iaz+0.251
••••+0.216
abase+0.205
paralle+0.194
••+0.186
Neg logit contributions
jri-0.246
urgical-0.226
 disg-0.201
warts-0.195
 Generations-0.193
Token Launch
Token ID21225
Feature activation-1.01e+00

Pos logit contributions
iaz+0.062
••••+0.053
abase+0.050
paralle+0.047
••+0.045
Neg logit contributions
jri-0.066
urgical-0.061
 disg-0.055
warts-0.053
 Generations-0.053
Token Alliance
Token ID10302
Feature activation+1.37e-01

Pos logit contributions
jri+0.839
urgical+0.771
 disg+0.689
warts+0.674
 Ended+0.659
Neg logit contributions
iaz-0.782
••••-0.670
abase-0.632
paralle-0.594
••-0.573
Token Atlas
Token ID22494
Feature activation-3.40e-01

Pos logit contributions
iaz+0.105
••••+0.090
abase+0.085
paralle+0.080
••+0.077
Neg logit contributions
jri-0.114
urgical-0.106
 disg-0.094
warts-0.092
 Generations-0.091
Token 5
Token ID642
Feature activation+1.00e-01

Pos logit contributions
jri+0.313
urgical+0.291
 disg+0.263
warts+0.258
 Generations+0.254
Neg logit contributions
iaz-0.232
••••-0.194
abase-0.182
paralle-0.170
••-0.161
Token,
Token ID11
Feature activation+2.02e-01

Pos logit contributions
iaz+0.095
••••+0.084
abase+0.080
paralle+0.077
••+0.074
Neg logit contributions
jri-0.066
urgical-0.060
 disg-0.051
warts-0.050
 Generations-0.049
Token but
Token ID475
Feature activation-2.57e-02

Pos logit contributions
iaz+0.158
••••+0.135
abase+0.128
paralle+0.121
••+0.116
Neg logit contributions
jri-0.165
urgical-0.153
 disg-0.136
warts-0.133
 Generations-0.131
Token slide
Token ID10649
Feature activation-8.22e-02

Pos logit contributions
iaz+0.884
••••+0.786
abase+0.763
paralle+0.727
••+0.706
Neg logit contributions
jri-0.461
urgical-0.406
 disg-0.338
 Generations-0.322
warts-0.321
Token
Token ID198
Feature activation-1.65e-01

Pos logit contributions
jri+0.070
urgical+0.065
 disg+0.058
warts+0.057
 Ended+0.056
Neg logit contributions
iaz-0.062
••••-0.052
abase-0.050
paralle-0.047
••-0.045
Token
Token ID198
Feature activation-7.64e-01

Pos logit contributions
jri+0.141
urgical+0.130
 disg+0.116
warts+0.114
 Ended+0.112
Neg logit contributions
iaz-0.123
••••-0.105
abase-0.099
paralle-0.093
••-0.089
TokenPower
Token ID13434
Feature activation-1.06e+00

Pos logit contributions
jri+0.634
urgical+0.586
 disg+0.522
warts+0.511
 Generations+0.502
Neg logit contributions
iaz-0.591
••••-0.505
abase-0.479
paralle-0.452
••-0.434
TokenPoint
Token ID12727
Feature activation-6.70e-01

Pos logit contributions
jri+0.885
urgical+0.820
 disg+0.728
warts+0.717
 Generations+0.701
Neg logit contributions
iaz-0.812
••••-0.692
abase-0.653
paralle-0.618
••-0.596
Token slide
Token ID10649
Feature activation-1.31e+00

Pos logit contributions
jri+0.397
urgical+0.354
 disg+0.296
warts+0.287
 Ended+0.279
Neg logit contributions
iaz-0.678
••••-0.603
abase-0.579
paralle-0.554
••-0.539
Token PNG
Token ID36182
Feature activation-4.03e-01

Pos logit contributions
jri+0.692
urgical+0.610
 disg+0.493
warts+0.479
 Generations+0.456
Neg logit contributions
iaz-1.415
••••-1.259
abase-1.214
paralle-1.167
••-1.137
Token larger
Token ID4025
Feature activation-4.78e-01

Pos logit contributions
jri+0.391
urgical+0.366
 disg+0.332
warts+0.326
 Ended+0.321
Neg logit contributions
iaz-0.255
••••-0.210
abase-0.196
paralle-0.181
••-0.171
Token image
Token ID2939
Feature activation+3.44e-01

Pos logit contributions
jri+0.331
urgical+0.301
 disg+0.261
warts+0.254
 Ended+0.248
Neg logit contributions
iaz-0.436
••••-0.382
abase-0.366
paralle-0.347
••-0.336
Token
Token ID198
Feature activation+5.53e-02

Pos logit contributions
iaz+0.261
••••+0.223
abase+0.212
paralle+0.199
••+0.189
Neg logit contributions
jri-0.288
urgical-0.268
 disg-0.239
warts-0.234
 Generations-0.232
Token
Token ID198
Feature activation+4.41e-01

Pos logit contributions
iaz+0.042
••••+0.035
abase+0.034
paralle+0.032
••+0.030
Neg logit contributions
jri-0.047
urgical-0.043
 disg-0.039
warts-0.038
 Generations-0.037
Token delivered
Token ID6793
Feature activation+7.84e-02

Pos logit contributions
jri+0.045
urgical+0.041
 disg+0.036
warts+0.035
 Ended+0.035
Neg logit contributions
iaz-0.047
••••-0.041
abase-0.039
paralle-0.037
••-0.035
Token Cleveland
Token ID10306
Feature activation+7.49e-02

Pos logit contributions
iaz+0.060
••••+0.052
abase+0.049
paralle+0.046
••+0.044
Neg logit contributions
jri-0.065
urgical-0.060
 disg-0.054
warts-0.053
 Generations-0.052
Token its
Token ID663
Feature activation-1.62e-01

Pos logit contributions
iaz+0.058
••••+0.050
abase+0.047
paralle+0.044
••+0.042
Neg logit contributions
jri-0.062
urgical-0.057
 disg-0.051
warts-0.050
 Generations-0.049
Token long
Token ID890
Feature activation+3.45e-02

Pos logit contributions
jri+0.125
urgical+0.115
 disg+0.101
warts+0.099
 Generations+0.097
Neg logit contributions
iaz-0.134
••••-0.116
abase-0.110
paralle-0.104
••-0.100
Token-
Token ID12
Feature activation-5.63e-02

Pos logit contributions
iaz+0.026
••••+0.022
abase+0.021
paralle+0.020
••+0.019
Neg logit contributions
jri-0.029
urgical-0.027
 disg-0.024
warts-0.024
 Generations-0.024
Tokens
Token ID82
Feature activation-1.32e+00

Pos logit contributions
jri+0.042
urgical+0.038
 disg+0.033
warts+0.033
 Ended+0.032
Neg logit contributions
iaz-0.049
••••-0.042
abase-0.040
paralle-0.038
••-0.037
Tokenought
Token ID2917
Feature activation+1.26e-02

Pos logit contributions
jri+0.996
urgical+0.907
 disg+0.805
warts+0.784
 Generations+0.772
Neg logit contributions
iaz-1.122
••••-0.969
abase-0.929
paralle-0.878
phant-0.843
Token NFL
Token ID5134
Feature activation-2.52e-01

Pos logit contributions
iaz+0.010
••••+0.009
abase+0.008
paralle+0.008
••+0.008
Neg logit contributions
jri-0.010
urgical-0.009
 disg-0.008
warts-0.008
 Generations-0.008
Token Championship
Token ID10749
Feature activation-2.18e-01

Pos logit contributions
jri+0.194
urgical+0.178
 disg+0.157
warts+0.153
 Generations+0.150
Neg logit contributions
iaz-0.210
••••-0.181
abase-0.173
paralle-0.163
••-0.157
Token in
Token ID287
Feature activation-1.26e-02

Pos logit contributions
jri+0.182
urgical+0.168
 disg+0.150
warts+0.147
 Generations+0.144
Neg logit contributions
iaz-0.167
••••-0.143
abase-0.135
paralle-0.127
••-0.122
Token 1945
Token ID15761
Feature activation-7.78e-02

Pos logit contributions
jri+0.010
urgical+0.009
 disg+0.008
warts+0.008
 Generations+0.008
Neg logit contributions
iaz-0.010
••••-0.009
abase-0.008
paralle-0.008
••-0.008
Token information
Token ID1321
Feature activation-2.95e-01

Pos logit contributions
jri+0.146
urgical+0.134
 disg+0.119
warts+0.116
 Ended+0.114
Neg logit contributions
iaz-0.158
••••-0.136
abase-0.130
paralle-0.123
••-0.118
Token.
Token ID13
Feature activation+6.66e-02

Pos logit contributions
jri+0.255
urgical+0.237
 disg+0.212
warts+0.208
 Ended+0.204
Neg logit contributions
iaz-0.217
••••-0.184
abase-0.174
paralle-0.163
••-0.156
Token
Token ID447
Feature activation+9.33e-03

Pos logit contributions
iaz+0.051
••••+0.043
abase+0.041
paralle+0.039
••+0.037
Neg logit contributions
jri-0.056
urgical-0.052
 disg-0.046
warts-0.045
 Ended-0.045
Token
Token ID251
Feature activation+1.08e-01

Pos logit contributions
iaz+0.006
••••+0.005
abase+0.004
paralle+0.004
••+0.004
Neg logit contributions
jri-0.009
urgical-0.009
 disg-0.008
warts-0.008
 Ended-0.008
Token<|endoftext|>
Token ID50256
Feature activation-1.02e-01

Pos logit contributions
iaz+0.082
••••+0.070
abase+0.066
paralle+0.062
••+0.059
Neg logit contributions
jri-0.091
urgical-0.085
 disg-0.076
warts-0.074
 Ended-0.073
TokenPower
Token ID13434
Feature activation-1.06e+00

Pos logit contributions
jri+0.087
urgical+0.081
 disg+0.072
warts+0.071
 Ended+0.070
Neg logit contributions
iaz-0.077
••••-0.065
abase-0.062
paralle-0.058
••-0.055
Token BI
Token ID20068
Feature activation+8.15e-02

Pos logit contributions
jri+0.854
urgical+0.786
 disg+0.696
warts+0.681
 Generations+0.668
Neg logit contributions
iaz-0.853
••••-0.735
abase-0.696
paralle-0.657
••-0.632
Token has
Token ID468
Feature activation-1.08e-01

Pos logit contributions
iaz+0.064
••••+0.054
abase+0.052
paralle+0.049
••+0.047
Neg logit contributions
jri-0.067
urgical-0.062
 disg-0.055
warts-0.054
 Generations-0.053
Token come
Token ID1282
Feature activation-6.19e-02

Pos logit contributions
jri+0.084
urgical+0.077
 disg+0.068
warts+0.067
 Ended+0.066
Neg logit contributions
iaz-0.089
••••-0.077
abase-0.073
paralle-0.069
••-0.066
Token a
Token ID257
Feature activation-2.95e-01

Pos logit contributions
jri+0.051
urgical+0.047
 disg+0.042
warts+0.041
 Generations+0.040
Neg logit contributions
iaz-0.048
••••-0.041
abase-0.039
paralle-0.037
••-0.035
Token long
Token ID890
Feature activation+6.07e-01

Pos logit contributions
jri+0.238
urgical+0.220
 disg+0.195
warts+0.191
 Generations+0.187
Neg logit contributions
iaz-0.235
••••-0.202
abase-0.192
paralle-0.180
••-0.173
Token and
Token ID290
Feature activation+3.65e-01

Pos logit contributions
jri+0.150
urgical+0.139
 disg+0.124
warts+0.121
 Ended+0.120
Neg logit contributions
iaz-0.132
••••-0.112
abase-0.107
paralle-0.100
••-0.095
Token never
Token ID1239
Feature activation+1.08e-01

Pos logit contributions
iaz+0.279
••••+0.237
abase+0.227
paralle+0.212
••+0.201
Neg logit contributions
jri-0.305
urgical-0.285
 disg-0.251
warts-0.247
 Generations-0.243
Token miss
Token ID2051
Feature activation+2.15e-01

Pos logit contributions
iaz+0.107
••••+0.095
abase+0.092
paralle+0.087
••+0.085
Neg logit contributions
jri-0.065
urgical-0.058
 disg-0.049
warts-0.048
 Generations-0.047
Token our
Token ID674
Feature activation-1.13e-01

Pos logit contributions
iaz+0.161
••••+0.136
abase+0.131
paralle+0.122
••+0.116
Neg logit contributions
jri-0.183
urgical-0.171
 disg-0.151
warts-0.149
 Generations-0.147
Token top
Token ID1353
Feature activation-1.02e+00

Pos logit contributions
jri+0.081
urgical+0.074
 disg+0.064
warts+0.063
 Generations+0.061
Neg logit contributions
iaz-0.101
••••-0.088
abase-0.084
paralle-0.080
••-0.077
Token stories
Token ID3923
Feature activation-1.32e+00

Pos logit contributions
jri+0.820
urgical+0.752
 disg+0.666
warts+0.652
 Ended+0.641
Neg logit contributions
iaz-0.813
••••-0.702
abase-0.659
paralle-0.627
••-0.602
Token Free
Token ID3232
Feature activation+6.26e-01

Pos logit contributions
jri+1.169
urgical+1.088
 disg+0.975
warts+0.955
 Ended+0.937
Neg logit contributions
iaz-0.946
••••-0.798
abase-0.741
paralle-0.703
••-0.671
Token Sign
Token ID5865
Feature activation+5.39e-01

Pos logit contributions
iaz+0.397
••••+0.328
abase+0.308
paralle+0.286
••+0.267
Neg logit contributions
jri-0.604
urgical-0.565
 disg-0.509
warts-0.502
 Generations-0.501
Token Up
Token ID3205
Feature activation-3.85e-01

Pos logit contributions
iaz+0.386
••••+0.325
abase+0.308
paralle+0.287
phant+0.273
Neg logit contributions
jri-0.477
urgical-0.446
 disg-0.395
warts-0.390
 Generations-0.389
Token
Token ID198
Feature activation+5.31e-02

Pos logit contributions
jri+0.331
urgical+0.306
 disg+0.274
warts+0.268
 Ended+0.264
Neg logit contributions
iaz-0.286
••••-0.243
abase-0.230
paralle-0.215
••-0.206
Token
Token ID198
Feature activation-1.53e-01

Pos logit contributions
iaz+0.040
••••+0.034
abase+0.032
paralle+0.030
••+0.029
Neg logit contributions
jri-0.045
urgical-0.042
 disg-0.037
warts-0.037
 Ended-0.036