TOP ACTIVATIONS
MAX = 3.976

probe : report Vietnamese airline takes steps to open flights to
probe : report Vietnamese airline takes steps to open flights to
probe : report Vietnamese airline takes steps to open flights to
Manafort 's attorneys say he should get less than 10 years
at stars such as Robert Down ey Jr . ( for
Scott Ric her t and David Gordon ( IS I Books
this content : Sylv ie von Du ug las - I
Manafort 's attorneys say he should get less than 10 years
right wing - back Paul Ver ha e gh sacrificed and
Manafort 's attorneys say he should get less than 10 years
airline takes steps to open flights to US on sidelines of
Fore word by Patrick J . Buchanan , and critical essays
airline takes steps to open flights to US on sidelines of
Click to create and send a link using your email application
made by Prime Minister Mar iano Raj oy and his ministers
The birth year of Akira Kuro s awa , director of
airline takes steps to open flights to US on sidelines of
zing ; as Slav oj i ek
request stamp . <|endoftext|> ED MON TON Omar Kh adr
showcase . View more <|endoftext|> About R

MIN ACTIVATIONS
MIN = -2.671

country , and DHS Secretary John Kelly recently said that
opinions expressed by column ists are their own and do not
and ESPN analyst Fran Fr as ch illa .[ 94 ]
File ) L ANS ING , MI -- The
an appearance with Treasury Secretary Steven Mn uchin , that the
SAN FR ANC IS CO ( CBS SF )
, according to one of Lois G . Lerner
the Americans ." <|endoftext|> GL AS G OW - Although he
to per use . Here is the link for buying the
. <|endoftext|> SAN FR ANC IS CO - Reddit and Google
the views of Town hall . com . Bernie
after former Finance Minister Jim Fla herty tightened the country 's
Sign Up : Stay on top of the latest breaking
Please verify you 're not a robot by clicking the box
in ( O mega Pharma - Quick step ) s .
Please verify you 're not a robot by clicking the box
ac ans ole il - DC M ) s . t
Please verify you 're not a robot by clicking the box
Advertisement # ad # They d usted off
Tom ar ( Press Trust of India photo )

INTERVAL 2.314 - 3.976
CONTAINS 0.002%

this content : Sylv ie von Du ug las - I
Scott Ric her t and David Gordon ( IS I Books
Manafort 's attorneys say he should get less than 10 years
at stars such as Robert Down ey Jr . ( for
airline takes steps to open flights to US on sidelines of

INTERVAL 0.653 - 2.314
CONTAINS 1.204%

bank did not feel under any pressure from the government and
Stephen Am ell and Robert Patt inson . While
the fund had ended up in Prime Minister Naj ib Raz
. French President Francois Hollande has set out an
tall as he grants freedom to a crou ching slave .

INTERVAL -1.009 - 0.653
CONTAINS 98.649%

stay at the London embassy indefinitely , although he may impose
Earth did stop moving suddenly , there would be an awful
Our grass - roots momentum is unstoppable ,
fact that he viewed and posted a message in this forum
hear from the candidates on our ballot in the # deb

INTERVAL -2.671 - -1.009
CONTAINS 0.144%

Not ed Republican poll ster Frank L unt z
s so much better than b uss ing tables . He
. <|endoftext|> SAN FR ANC IS CO - Reddit and Google
payments to the National Radio Astron omy Observatory in Charlottesville ,
to legend ," U men h ofer says .
Token probe
Token ID12774
Feature activation-2.60e-01

Pos logit contributions
 Dek+0.183
nc+0.177
 Jou+0.172
jug+0.171
�醒+0.169
Neg logit contributions
wrapper-0.168
arity-0.167
istries-0.150
olson-0.146
inski-0.146
Token:
Token ID25
Feature activation-2.31e-01

Pos logit contributions
wrapper+0.195
arity+0.193
istries+0.177
olson+0.172
inski+0.172
Neg logit contributions
 Dek-0.160
nc-0.155
 Jou-0.149
jug-0.148
�醒-0.146
Token report
Token ID989
Feature activation-3.38e-02

Pos logit contributions
wrapper+0.189
arity+0.188
istries+0.173
olson+0.169
inski+0.169
Neg logit contributions
 Dek-0.126
nc-0.122
 Jou-0.117
jug-0.116
�醒-0.114
Token Vietnamese
Token ID23618
Feature activation+1.02e+00

Pos logit contributions
wrapper+0.025
arity+0.025
istries+0.023
olson+0.022
inski+0.022
Neg logit contributions
 Dek-0.021
nc-0.021
 Jou-0.020
jug-0.020
�醒-0.019
Token airline
Token ID18091
Feature activation+1.53e+00

Pos logit contributions
 Dek+0.685
nc+0.654
 Jou+0.631
�醒+0.631
jug+0.630
Neg logit contributions
wrapper-0.711
arity-0.708
istries-0.644
inski-0.629
olson-0.616
Token takes
Token ID2753
Feature activation+3.98e+00

Pos logit contributions
 Dek+1.014
nc+0.979
�醒+0.944
jug+0.934
Ger+0.934
Neg logit contributions
wrapper-1.077
arity-1.067
istries-0.962
inski-0.943
olson-0.926
Token steps
Token ID4831
Feature activation+7.65e-01

Pos logit contributions
nc+2.367
 Dek+2.281
jug+2.232
�醒+2.148
 Jou+2.138
Neg logit contributions
wrapper-3.076
arity-3.065
inski-2.746
olson-2.700
istries-2.694
Token to
Token ID284
Feature activation+1.78e+00

Pos logit contributions
 Dek+0.484
nc+0.471
jug+0.459
�醒+0.448
 Jou+0.448
Neg logit contributions
wrapper-0.558
arity-0.556
istries-0.506
inski-0.493
olson-0.488
Token open
Token ID1280
Feature activation+7.14e-01

Pos logit contributions
 Dek+0.991
nc+0.982
�醒+0.916
jug+0.916
Ger+0.911
Neg logit contributions
wrapper-1.440
arity-1.401
olson-1.299
istries-1.293
inski-1.273
Token flights
Token ID13956
Feature activation+2.21e+00

Pos logit contributions
 Dek+0.491
nc+0.480
jug+0.465
 Jou+0.460
�醒+0.458
Neg logit contributions
wrapper-0.481
arity-0.472
istries-0.433
inski-0.420
olson-0.417
Token to
Token ID284
Feature activation+5.34e-01

Pos logit contributions
 Dek+1.552
nc+1.520
jug+1.482
�醒+1.474
 Muscle+1.469
Neg logit contributions
wrapper-1.442
arity-1.414
istries-1.258
inski-1.244
olson-1.214
Token probe
Token ID12774
Feature activation-2.57e-01

Pos logit contributions
 Dek+0.287
nc+0.278
 Jou+0.269
jug+0.268
�醒+0.266
Neg logit contributions
wrapper-0.283
arity-0.281
istries-0.253
olson-0.248
inski-0.247
Token:
Token ID25
Feature activation+3.46e-01

Pos logit contributions
wrapper+0.176
arity+0.174
istries+0.158
olson+0.153
inski+0.153
Neg logit contributions
 Dek-0.176
nc-0.171
 Jou-0.165
jug-0.164
�醒-0.162
Token report
Token ID989
Feature activation+9.71e-02

Pos logit contributions
 Dek+0.194
nc+0.186
 Jou+0.179
jug+0.178
�醒+0.175
Neg logit contributions
wrapper-0.280
arity-0.277
istries-0.256
inski-0.248
olson-0.247
Token Vietnamese
Token ID23618
Feature activation+8.51e-01

Pos logit contributions
 Dek+0.060
nc+0.058
 Jou+0.056
jug+0.055
�醒+0.055
Neg logit contributions
wrapper-0.073
arity-0.072
istries-0.066
olson-0.064
inski-0.064
Token airline
Token ID18091
Feature activation+1.16e+00

Pos logit contributions
 Dek+0.585
nc+0.560
 Jou+0.541
jug+0.539
�醒+0.539
Neg logit contributions
wrapper-0.581
arity-0.579
istries-0.525
inski-0.511
olson-0.503
Token takes
Token ID2753
Feature activation+3.75e+00

Pos logit contributions
 Dek+0.797
nc+0.769
�醒+0.741
 Jou+0.738
jug+0.735
Neg logit contributions
wrapper-0.784
arity-0.778
istries-0.699
inski-0.683
olson-0.675
Token steps
Token ID4831
Feature activation+6.61e-01

Pos logit contributions
nc+2.186
 Dek+2.104
jug+2.055
�醒+1.995
 Jou+1.975
Neg logit contributions
wrapper-2.955
arity-2.938
inski-2.640
istries-2.604
olson-2.591
Token to
Token ID284
Feature activation+1.70e+00

Pos logit contributions
 Dek+0.417
nc+0.406
jug+0.395
 Jou+0.386
�醒+0.386
Neg logit contributions
wrapper-0.484
arity-0.482
istries-0.438
inski-0.427
olson-0.423
Token open
Token ID1280
Feature activation+5.85e-01

Pos logit contributions
 Dek+0.970
nc+0.953
�醒+0.900
jug+0.896
Ger+0.892
Neg logit contributions
wrapper-1.345
arity-1.313
istries-1.209
olson-1.207
inski-1.189
Token flights
Token ID13956
Feature activation+2.38e+00

Pos logit contributions
 Dek+0.426
nc+0.417
jug+0.404
 Jou+0.401
�醒+0.398
Neg logit contributions
wrapper-0.371
arity-0.364
istries-0.332
inski-0.320
olson-0.318
Token to
Token ID284
Feature activation+3.29e-01

Pos logit contributions
 Dek+1.736
nc+1.705
jug+1.673
�醒+1.663
 Muscle+1.653
Neg logit contributions
wrapper-1.479
arity-1.452
istries-1.288
inski-1.279
olson-1.233
Token probe
Token ID12774
Feature activation-1.07e-01

Pos logit contributions
 Dek+0.433
nc+0.419
 Jou+0.405
jug+0.403
�醒+0.402
Neg logit contributions
wrapper-0.463
arity-0.461
istries-0.415
olson-0.408
inski-0.407
Token:
Token ID25
Feature activation-2.43e-01

Pos logit contributions
wrapper+0.075
arity+0.074
istries+0.067
olson+0.065
inski+0.065
Neg logit contributions
 Dek-0.072
nc-0.070
 Jou-0.068
jug-0.067
�醒-0.066
Token report
Token ID989
Feature activation+1.81e-01

Pos logit contributions
wrapper+0.199
arity+0.197
istries+0.182
olson+0.177
inski+0.177
Neg logit contributions
 Dek-0.133
nc-0.128
 Jou-0.123
jug-0.122
�醒-0.120
Token Vietnamese
Token ID23618
Feature activation+9.03e-01

Pos logit contributions
 Dek+0.115
nc+0.112
 Jou+0.107
jug+0.107
�醒+0.105
Neg logit contributions
wrapper-0.132
arity-0.131
istries-0.119
olson-0.116
inski-0.116
Token airline
Token ID18091
Feature activation+1.34e+00

Pos logit contributions
 Dek+0.606
nc+0.579
 Jou+0.559
�醒+0.558
jug+0.557
Neg logit contributions
wrapper-0.633
arity-0.629
istries-0.572
inski-0.557
olson-0.549
Token takes
Token ID2753
Feature activation+3.60e+00

Pos logit contributions
 Dek+0.936
nc+0.904
�醒+0.872
 Jou+0.868
jug+0.865
Neg logit contributions
wrapper-0.901
arity-0.895
istries-0.801
inski-0.785
olson-0.774
Token steps
Token ID4831
Feature activation+8.37e-01

Pos logit contributions
nc+2.058
 Dek+1.982
jug+1.925
�醒+1.876
 Jou+1.863
Neg logit contributions
wrapper-2.868
arity-2.855
inski-2.567
istries-2.545
olson-2.523
Token to
Token ID284
Feature activation+1.68e+00

Pos logit contributions
 Dek+0.530
nc+0.516
jug+0.504
�醒+0.491
 Jou+0.491
Neg logit contributions
wrapper-0.610
arity-0.608
istries-0.551
inski-0.537
olson-0.531
Token open
Token ID1280
Feature activation+6.35e-01

Pos logit contributions
 Dek+0.956
nc+0.936
�醒+0.885
jug+0.881
Ger+0.880
Neg logit contributions
wrapper-1.341
arity-1.311
istries-1.205
olson-1.204
inski-1.189
Token flights
Token ID13956
Feature activation+2.27e+00

Pos logit contributions
 Dek+0.460
nc+0.450
jug+0.436
 Jou+0.433
�醒+0.430
Neg logit contributions
wrapper-0.405
arity-0.398
istries-0.363
inski-0.350
olson-0.348
Token to
Token ID284
Feature activation+2.32e-01

Pos logit contributions
 Dek+1.633
nc+1.601
jug+1.573
�醒+1.565
 Muscle+1.549
Neg logit contributions
wrapper-1.444
arity-1.415
istries-1.257
inski-1.248
olson-1.210
Token Manafort
Token ID26351
Feature activation-9.35e-01

Pos logit contributions
 Dek+0.148
nc+0.144
 Jou+0.138
jug+0.137
�醒+0.135
Neg logit contributions
wrapper-0.203
arity-0.202
istries-0.185
olson-0.180
inski-0.180
Token's
Token ID338
Feature activation+1.53e+00

Pos logit contributions
wrapper+0.687
arity+0.682
istries+0.625
olson+0.604
inski+0.602
Neg logit contributions
 Dek-0.592
nc-0.573
 Jou-0.551
jug-0.548
Ger-0.539
Token attorneys
Token ID14449
Feature activation+7.72e-01

Pos logit contributions
 Dek+1.092
nc+1.062
 Jou+1.030
jug+1.023
 Muscle+1.011
Neg logit contributions
wrapper-0.995
arity-0.991
istries-0.897
inski-0.872
olson-0.865
Token say
Token ID910
Feature activation+9.58e-01

Pos logit contributions
 Dek+0.485
nc+0.472
jug+0.451
 Jou+0.451
�醒+0.442
Neg logit contributions
arity-0.568
wrapper-0.565
istries-0.516
inski-0.507
olson-0.503
Token he
Token ID339
Feature activation+1.37e+00

Pos logit contributions
 Dek+0.633
nc+0.614
 Jou+0.593
jug+0.593
�醒+0.588
Neg logit contributions
wrapper-0.669
arity-0.668
istries-0.600
inski-0.592
olson-0.579
Token should
Token ID815
Feature activation+2.70e+00

Pos logit contributions
 Dek+0.891
nc+0.871
 Jou+0.839
�醒+0.836
jug+0.833
Neg logit contributions
wrapper-0.969
arity-0.966
istries-0.858
inski-0.854
olson-0.844
Token get
Token ID651
Feature activation+6.73e-01

Pos logit contributions
 Dek+2.021
nc+1.914
 Jou+1.873
jug+1.859
Registered+1.854
Neg logit contributions
wrapper-1.695
arity-1.679
istries-1.496
inski-1.456
olson-1.454
Token less
Token ID1342
Feature activation+1.12e+00

Pos logit contributions
 Dek+0.418
nc+0.410
jug+0.388
 Jou+0.387
Registered+0.382
Neg logit contributions
wrapper-0.499
arity-0.496
istries-0.449
olson-0.440
inski-0.438
Token than
Token ID621
Feature activation+1.37e-01

Pos logit contributions
 Dek+0.741
nc+0.721
 Jou+0.700
jug+0.691
Ger+0.686
Neg logit contributions
wrapper-0.776
arity-0.769
istries-0.690
inski-0.685
olson-0.676
Token 10
Token ID838
Feature activation-4.11e-02

Pos logit contributions
 Dek+0.088
nc+0.086
 Jou+0.082
jug+0.081
 Muscle+0.081
Neg logit contributions
wrapper-0.098
arity-0.098
istries-0.089
olson-0.086
inski-0.086
Token years
Token ID812
Feature activation-5.05e-02

Pos logit contributions
wrapper+0.029
arity+0.029
istries+0.026
olson+0.025
inski+0.025
Neg logit contributions
 Dek-0.027
nc-0.026
 Jou-0.025
jug-0.025
�醒-0.025
Token at
Token ID379
Feature activation+5.38e-01

Pos logit contributions
 Dek+0.166
nc+0.161
 Jou+0.156
jug+0.155
�醒+0.153
Neg logit contributions
wrapper-0.158
arity-0.157
istries-0.142
inski-0.138
olson-0.137
Token stars
Token ID5788
Feature activation+3.69e-01

Pos logit contributions
 Dek+0.341
nc+0.332
 Jou+0.317
jug+0.317
�醒+0.314
Neg logit contributions
wrapper-0.394
arity-0.390
istries-0.355
olson-0.346
inski-0.346
Token such
Token ID884
Feature activation+3.45e-01

Pos logit contributions
 Dek+0.242
nc+0.236
 Jou+0.227
jug+0.226
�醒+0.224
Neg logit contributions
wrapper-0.261
arity-0.259
istries-0.235
inski-0.229
olson-0.229
Token as
Token ID355
Feature activation+3.63e-01

Pos logit contributions
 Dek+0.221
nc+0.214
 Jou+0.206
jug+0.206
�醒+0.205
Neg logit contributions
wrapper-0.249
arity-0.247
istries-0.225
inski-0.219
olson-0.219
Token Robert
Token ID5199
Feature activation+2.43e-01

Pos logit contributions
 Dek+0.234
nc+0.228
 Jou+0.218
jug+0.218
�醒+0.216
Neg logit contributions
wrapper-0.261
arity-0.258
istries-0.235
inski-0.229
olson-0.229
Token Down
Token ID5588
Feature activation+2.63e+00

Pos logit contributions
 Dek+0.152
nc+0.148
 Jou+0.142
jug+0.142
�醒+0.141
Neg logit contributions
wrapper-0.179
arity-0.177
istries-0.162
inski-0.158
olson-0.158
Tokeney
Token ID2959
Feature activation+1.62e+00

Pos logit contributions
 Dek+0.936
nc+0.887
�醒+0.874
jug+0.831
 Jou+0.822
Neg logit contributions
arity-2.641
wrapper-2.635
inski-2.491
olson-2.431
istries-2.422
Token Jr
Token ID7504
Feature activation-3.04e-04

Pos logit contributions
 Dek+1.362
nc+1.342
 Jou+1.309
�醒+1.305
jug+1.303
Neg logit contributions
wrapper-0.833
arity-0.816
istries-0.720
olson-0.703
inski-0.700
Token.
Token ID13
Feature activation+1.35e-01

Pos logit contributions
wrapper+0.000
arity+0.000
istries+0.000
olson+0.000
inski+0.000
Neg logit contributions
 Dek-0.000
nc-0.000
 Jou-0.000
jug-0.000
�醒-0.000
Token (
Token ID357
Feature activation+2.02e-01

Pos logit contributions
 Dek+0.085
nc+0.082
 Jou+0.079
jug+0.078
�醒+0.077
Neg logit contributions
wrapper-0.099
arity-0.098
istries-0.090
inski-0.087
olson-0.087
Tokenfor
Token ID1640
Feature activation-1.39e-01

Pos logit contributions
 Dek+0.128
nc+0.124
 Jou+0.119
jug+0.118
�醒+0.117
Neg logit contributions
wrapper-0.149
arity-0.147
istries-0.134
inski-0.131
olson-0.131
Token Scott
Token ID4746
Feature activation+1.30e+00

Pos logit contributions
wrapper+0.678
arity+0.671
istries+0.608
olson+0.591
inski+0.589
Neg logit contributions
 Dek-0.717
nc-0.693
 Jou-0.676
jug-0.666
Registered-0.661
Token Ric
Token ID15868
Feature activation-2.49e-01

Pos logit contributions
 Dek+0.578
nc+0.571
�醒+0.531
jug+0.525
Registered+0.524
Neg logit contributions
wrapper-1.172
arity-1.170
istries-1.095
inski-1.070
olson-1.070
Tokenher
Token ID372
Feature activation+4.48e-01

Pos logit contributions
wrapper+0.147
arity+0.146
istries+0.130
olson+0.125
inski+0.125
Neg logit contributions
 Dek-0.193
nc-0.188
 Jou-0.183
jug-0.182
�醒-0.180
Tokent
Token ID83
Feature activation+1.56e+00

Pos logit contributions
 Dek+0.242
nc+0.232
jug+0.223
 Jou+0.222
�醒+0.221
Neg logit contributions
wrapper-0.371
arity-0.369
istries-0.339
olson-0.333
inski-0.333
Token and
Token ID290
Feature activation-2.16e-01

Pos logit contributions
 Dek+1.069
nc+1.054
 Jou+0.997
jug+0.995
Registered+0.993
Neg logit contributions
wrapper-1.054
arity-1.046
inski-0.933
istries-0.929
olson-0.919
Token David
Token ID3271
Feature activation+2.55e+00

Pos logit contributions
wrapper+0.161
arity+0.159
istries+0.146
olson+0.142
inski+0.142
Neg logit contributions
 Dek-0.134
nc-0.130
 Jou-0.125
jug-0.124
�醒-0.122
Token Gordon
Token ID11646
Feature activation+1.69e+00

Pos logit contributions
 Dek+1.976
�醒+1.954
nc+1.940
jug+1.891
Registered+1.882
Neg logit contributions
wrapper-1.458
arity-1.457
inski-1.279
olson-1.273
istries-1.265
Token (
Token ID357
Feature activation+1.31e-01

Pos logit contributions
 Dek+1.069
nc+1.055
�醒+1.021
jug+1.021
Registered+1.019
Neg logit contributions
wrapper-1.190
arity-1.188
istries-1.080
olson-1.073
inski-1.061
TokenIS
Token ID1797
Feature activation+7.72e-01

Pos logit contributions
 Dek+0.066
nc+0.064
 Jou+0.061
jug+0.060
�醒+0.059
Neg logit contributions
wrapper-0.114
arity-0.113
istries-0.104
inski-0.102
olson-0.102
TokenI
Token ID40
Feature activation+1.07e+00

Pos logit contributions
 Dek+0.463
nc+0.446
 Jou+0.435
jug+0.429
�醒+0.426
Neg logit contributions
wrapper-0.593
arity-0.587
istries-0.536
olson-0.526
inski-0.521
Token Books
Token ID13661
Feature activation-1.41e+00

Pos logit contributions
 Dek+0.693
nc+0.685
 Jou+0.664
jug+0.659
�醒+0.650
Neg logit contributions
arity-0.761
wrapper-0.756
istries-0.682
inski-0.664
olson-0.661
Token this
Token ID428
Feature activation-2.96e-01

Pos logit contributions
 Dek+0.104
nc+0.101
 Jou+0.097
jug+0.097
�醒+0.096
Neg logit contributions
wrapper-0.128
arity-0.127
istries-0.116
inski-0.113
olson-0.113
Token content
Token ID2695
Feature activation-4.21e-01

Pos logit contributions
wrapper+0.218
arity+0.216
istries+0.198
olson+0.192
inski+0.192
Neg logit contributions
 Dek-0.186
nc-0.180
 Jou-0.173
jug-0.172
�醒-0.170
Token:
Token ID25
Feature activation-5.01e-01

Pos logit contributions
wrapper+0.304
arity+0.301
istries+0.274
olson+0.267
inski+0.266
Neg logit contributions
 Dek-0.272
nc-0.264
 Jou-0.254
jug-0.252
�醒-0.249
Token Sylv
Token ID24286
Feature activation+1.13e+00

Pos logit contributions
wrapper+0.438
arity+0.435
istries+0.404
olson+0.394
inski+0.394
Neg logit contributions
 Dek-0.247
nc-0.237
 Jou-0.226
jug-0.223
 Muscle-0.219
Tokenie
Token ID494
Feature activation+1.66e+00

Pos logit contributions
 Dek+1.069
nc+1.038
 Jou+1.013
�醒+1.009
jug+1.008
Neg logit contributions
arity-0.475
wrapper-0.469
istries-0.398
inski-0.373
olson-0.373
Token von
Token ID18042
Feature activation+2.53e+00

Pos logit contributions
 Dek+0.866
nc+0.840
jug+0.809
�醒+0.801
Registered+0.795
Neg logit contributions
wrapper-1.389
arity-1.369
istries-1.257
olson-1.252
inski-1.239
Token Du
Token ID10343
Feature activation+1.46e+00

Pos logit contributions
 Dek+1.808
nc+1.763
�醒+1.757
jug+1.727
Registered+1.707
Neg logit contributions
wrapper-1.636
arity-1.585
inski-1.408
istries-1.395
olson-1.339
Tokenug
Token ID1018
Feature activation+5.20e-01

Pos logit contributions
 Dek+0.894
nc+0.863
 Jou+0.830
Registered+0.827
�醒+0.825
Neg logit contributions
wrapper-1.097
arity-1.085
istries-0.994
inski-0.973
olson-0.971
Tokenlas
Token ID21921
Feature activation+1.01e-02

Pos logit contributions
 Dek+0.307
nc+0.294
 Jou+0.283
jug+0.281
�醒+0.277
Neg logit contributions
wrapper-0.405
arity-0.400
istries-0.368
inski-0.360
olson-0.359
Token-
Token ID12
Feature activation-4.87e-01

Pos logit contributions
 Dek+0.006
nc+0.006
 Jou+0.006
jug+0.006
�醒+0.005
Neg logit contributions
wrapper-0.008
arity-0.008
istries-0.007
olson-0.007
inski-0.007
TokenI
Token ID40
Feature activation+1.91e+00

Pos logit contributions
wrapper+0.386
arity+0.383
istries+0.352
olson+0.343
inski+0.342
Neg logit contributions
 Dek-0.280
nc-0.270
 Jou-0.259
jug-0.257
�醒-0.253
Token Manafort
Token ID26351
Feature activation-8.58e-01

Pos logit contributions
 Dek+0.106
nc+0.103
 Jou+0.099
jug+0.098
�醒+0.097
Neg logit contributions
wrapper-0.148
arity-0.147
istries-0.135
olson-0.132
inski-0.131
Token's
Token ID338
Feature activation+1.59e+00

Pos logit contributions
wrapper+0.630
arity+0.625
istries+0.573
olson+0.554
inski+0.552
Neg logit contributions
 Dek-0.544
nc-0.526
 Jou-0.507
jug-0.504
Ger-0.495
Token attorneys
Token ID14449
Feature activation+7.45e-01

Pos logit contributions
 Dek+1.167
nc+1.134
 Jou+1.100
jug+1.092
�醒+1.083
Neg logit contributions
wrapper-1.009
arity-1.002
istries-0.905
inski-0.879
olson-0.872
Token say
Token ID910
Feature activation+1.03e+00

Pos logit contributions
 Dek+0.419
nc+0.406
jug+0.386
 Jou+0.386
�醒+0.379
Neg logit contributions
arity-0.596
wrapper-0.595
istries-0.547
inski-0.538
olson-0.535
Token he
Token ID339
Feature activation+1.18e+00

Pos logit contributions
 Dek+0.685
nc+0.666
jug+0.643
 Jou+0.643
�醒+0.636
Neg logit contributions
wrapper-0.710
arity-0.708
istries-0.633
inski-0.628
olson-0.613
Token should
Token ID815
Feature activation+2.50e+00

Pos logit contributions
 Dek+0.766
nc+0.747
 Jou+0.720
�醒+0.714
jug+0.714
Neg logit contributions
wrapper-0.840
arity-0.835
istries-0.745
inski-0.740
olson-0.733
Token get
Token ID651
Feature activation+6.74e-01

Pos logit contributions
 Dek+1.926
nc+1.827
 Jou+1.788
jug+1.779
Registered+1.774
Neg logit contributions
wrapper-1.517
arity-1.507
istries-1.337
olson-1.302
inski-1.300
Token less
Token ID1342
Feature activation+1.12e+00

Pos logit contributions
 Dek+0.430
nc+0.422
jug+0.400
 Jou+0.399
Registered+0.395
Neg logit contributions
wrapper-0.487
arity-0.485
istries-0.436
olson-0.429
inski-0.427
Token than
Token ID621
Feature activation-3.03e-02

Pos logit contributions
 Dek+0.747
nc+0.726
 Jou+0.706
jug+0.697
Ger+0.691
Neg logit contributions
wrapper-0.782
arity-0.775
istries-0.695
inski-0.690
olson-0.682
Token 10
Token ID838
Feature activation-3.98e-02

Pos logit contributions
wrapper+0.022
arity+0.022
istries+0.020
olson+0.019
inski+0.019
Neg logit contributions
 Dek-0.019
nc-0.019
 Jou-0.018
jug-0.018
 Muscle-0.018
Token years
Token ID812
Feature activation+3.20e-02

Pos logit contributions
wrapper+0.028
arity+0.028
istries+0.026
olson+0.025
inski+0.025
Neg logit contributions
 Dek-0.026
nc-0.025
 Jou-0.024
jug-0.024
�醒-0.024
Token right
Token ID826
Feature activation-3.54e-01

Pos logit contributions
wrapper+0.085
arity+0.084
istries+0.076
olson+0.074
inski+0.074
Neg logit contributions
 Dek-0.081
nc-0.078
 Jou-0.075
jug-0.075
�醒-0.074
Token wing
Token ID8539
Feature activation-4.76e-01

Pos logit contributions
wrapper+0.258
arity+0.255
istries+0.233
olson+0.226
inski+0.226
Neg logit contributions
 Dek-0.227
nc-0.220
 Jou-0.212
jug-0.210
�醒-0.207
Token-
Token ID12
Feature activation-3.40e-01

Pos logit contributions
wrapper+0.352
arity+0.349
istries+0.319
olson+0.310
inski+0.309
Neg logit contributions
 Dek-0.299
nc-0.290
 Jou-0.279
jug-0.277
�醒-0.273
Tokenback
Token ID1891
Feature activation-2.33e-01

Pos logit contributions
wrapper+0.303
arity+0.300
istries+0.279
olson+0.273
inski+0.272
Neg logit contributions
 Dek-0.162
nc-0.156
 Jou-0.148
jug-0.146
�醒-0.144
Token Paul
Token ID3362
Feature activation-3.89e-01

Pos logit contributions
wrapper+0.171
arity+0.169
istries+0.155
olson+0.151
inski+0.150
Neg logit contributions
 Dek-0.147
nc-0.143
 Jou-0.137
jug-0.136
�醒-0.135
Token Ver
Token ID4643
Feature activation+2.41e+00

Pos logit contributions
wrapper+0.357
arity+0.355
istries+0.330
olson+0.323
inski+0.323
Neg logit contributions
 Dek-0.175
nc-0.167
 Jou-0.158
jug-0.156
�醒-0.153
Tokenha
Token ID3099
Feature activation+7.26e-01

Pos logit contributions
 Dek+1.622
nc+1.587
Registered+1.559
�醒+1.554
 Jou+1.534
Neg logit contributions
arity-1.671
wrapper-1.654
istries-1.513
inski-1.468
olson-1.445
Tokene
Token ID68
Feature activation-3.05e-01

Pos logit contributions
 Dek+0.369
nc+0.351
�醒+0.333
 Jou+0.332
jug+0.327
Neg logit contributions
wrapper-0.624
arity-0.621
istries-0.575
inski-0.563
olson-0.558
Tokengh
Token ID456
Feature activation-3.66e-01

Pos logit contributions
wrapper+0.179
arity+0.177
istries+0.158
olson+0.152
inski+0.152
Neg logit contributions
 Dek-0.239
nc-0.232
 Jou-0.226
jug-0.224
�醒-0.222
Token sacrificed
Token ID27445
Feature activation-2.90e-01

Pos logit contributions
wrapper+0.268
arity+0.265
istries+0.242
olson+0.235
inski+0.235
Neg logit contributions
 Dek-0.233
nc-0.226
 Jou-0.218
jug-0.216
�醒-0.213
Token and
Token ID290
Feature activation-7.44e-02

Pos logit contributions
wrapper+0.211
arity+0.210
istries+0.191
olson+0.186
inski+0.186
Neg logit contributions
 Dek-0.185
nc-0.179
 Jou-0.173
jug-0.171
�醒-0.169
Token Manafort
Token ID26351
Feature activation-8.88e-01

Pos logit contributions
wrapper+0.027
arity+0.027
istries+0.024
olson+0.024
inski+0.024
Neg logit contributions
 Dek-0.020
nc-0.019
 Jou-0.018
jug-0.018
�醒-0.018
Token's
Token ID338
Feature activation+1.59e+00

Pos logit contributions
wrapper+0.654
arity+0.649
istries+0.595
olson+0.575
inski+0.573
Neg logit contributions
 Dek-0.561
nc-0.543
 Jou-0.523
jug-0.520
Ger-0.510
Token attorneys
Token ID14449
Feature activation+9.19e-01

Pos logit contributions
 Dek+1.141
nc+1.111
 Jou+1.077
jug+1.067
�醒+1.060
Neg logit contributions
wrapper-1.022
arity-1.014
istries-0.918
inski-0.893
olson-0.887
Token say
Token ID910
Feature activation+1.03e+00

Pos logit contributions
 Dek+0.508
nc+0.493
 Jou+0.467
jug+0.467
�醒+0.459
Neg logit contributions
arity-0.744
wrapper-0.743
istries-0.684
inski-0.675
olson-0.670
Token he
Token ID339
Feature activation+1.12e+00

Pos logit contributions
 Dek+0.678
nc+0.660
jug+0.636
 Jou+0.635
Registered+0.629
Neg logit contributions
wrapper-0.727
arity-0.725
istries-0.650
inski-0.646
olson-0.631
Token should
Token ID815
Feature activation+2.40e+00

Pos logit contributions
 Dek+0.713
nc+0.695
 Jou+0.670
jug+0.663
�醒+0.662
Neg logit contributions
wrapper-0.813
arity-0.808
istries-0.726
inski-0.719
olson-0.715
Token get
Token ID651
Feature activation+5.26e-01

Pos logit contributions
 Dek+1.878
nc+1.781
 Jou+1.742
jug+1.735
Registered+1.730
Neg logit contributions
wrapper-1.429
arity-1.418
istries-1.259
olson-1.226
inski-1.221
Token less
Token ID1342
Feature activation+1.04e+00

Pos logit contributions
 Dek+0.326
nc+0.319
jug+0.302
 Jou+0.302
�醒+0.298
Neg logit contributions
wrapper-0.391
arity-0.389
istries-0.352
olson-0.346
inski-0.344
Token than
Token ID621
Feature activation+6.69e-02

Pos logit contributions
 Dek+0.686
nc+0.667
 Jou+0.647
jug+0.640
Ger+0.635
Neg logit contributions
wrapper-0.728
arity-0.721
istries-0.648
inski-0.643
olson-0.636
Token 10
Token ID838
Feature activation+1.02e-01

Pos logit contributions
 Dek+0.042
nc+0.041
 Jou+0.040
jug+0.039
 Muscle+0.039
Neg logit contributions
wrapper-0.049
arity-0.048
istries-0.044
olson-0.043
inski-0.043
Token years
Token ID812
Feature activation+2.75e-03

Pos logit contributions
 Dek+0.068
nc+0.066
 Jou+0.063
jug+0.063
�醒+0.062
Neg logit contributions
wrapper-0.072
arity-0.071
istries-0.065
olson-0.063
inski-0.063
Token airline
Token ID18091
Feature activation+1.16e+00

Pos logit contributions
 Dek+0.585
nc+0.560
 Jou+0.541
jug+0.539
�醒+0.539
Neg logit contributions
wrapper-0.581
arity-0.579
istries-0.525
inski-0.511
olson-0.503
Token takes
Token ID2753
Feature activation+3.75e+00

Pos logit contributions
 Dek+0.797
nc+0.769
�醒+0.741
 Jou+0.738
jug+0.735
Neg logit contributions
wrapper-0.784
arity-0.778
istries-0.699
inski-0.683
olson-0.675
Token steps
Token ID4831
Feature activation+6.61e-01

Pos logit contributions
nc+2.186
 Dek+2.104
jug+2.055
�醒+1.995
 Jou+1.975
Neg logit contributions
wrapper-2.955
arity-2.938
inski-2.640
istries-2.604
olson-2.591
Token to
Token ID284
Feature activation+1.70e+00

Pos logit contributions
 Dek+0.417
nc+0.406
jug+0.395
 Jou+0.386
�醒+0.386
Neg logit contributions
wrapper-0.484
arity-0.482
istries-0.438
inski-0.427
olson-0.423
Token open
Token ID1280
Feature activation+5.85e-01

Pos logit contributions
 Dek+0.970
nc+0.953
�醒+0.900
jug+0.896
Ger+0.892
Neg logit contributions
wrapper-1.345
arity-1.313
istries-1.209
olson-1.207
inski-1.189
Token flights
Token ID13956
Feature activation+2.38e+00

Pos logit contributions
 Dek+0.426
nc+0.417
jug+0.404
 Jou+0.401
�醒+0.398
Neg logit contributions
wrapper-0.371
arity-0.364
istries-0.332
inski-0.320
olson-0.318
Token to
Token ID284
Feature activation+3.29e-01

Pos logit contributions
 Dek+1.736
nc+1.705
jug+1.673
�醒+1.663
 Muscle+1.653
Neg logit contributions
wrapper-1.479
arity-1.452
istries-1.288
inski-1.279
olson-1.233
Token US
Token ID1294
Feature activation+7.73e-01

Pos logit contributions
 Dek+0.231
nc+0.225
 Jou+0.216
jug+0.216
�醒+0.213
Neg logit contributions
wrapper-0.219
arity-0.216
istries-0.194
olson-0.190
inski-0.189
Token on
Token ID319
Feature activation-7.61e-01

Pos logit contributions
 Dek+0.487
nc+0.476
 Jou+0.457
jug+0.456
�醒+0.451
Neg logit contributions
wrapper-0.562
arity-0.560
istries-0.509
olson-0.499
inski-0.495
Token sidelines
Token ID29394
Feature activation-2.43e-02

Pos logit contributions
wrapper+0.598
arity+0.593
istries+0.545
olson+0.533
inski+0.529
Neg logit contributions
 Dek-0.444
nc-0.427
 Jou-0.412
jug-0.406
�醒-0.399
Token of
Token ID286
Feature activation+2.23e-01

Pos logit contributions
wrapper+0.018
arity+0.018
istries+0.016
olson+0.016
inski+0.016
Neg logit contributions
 Dek-0.015
nc-0.015
 Jou-0.014
jug-0.014
�醒-0.014
Token Fore
Token ID4558
Feature activation+7.51e-01

Pos logit contributions
 Dek+1.145
nc+1.127
 Jou+1.094
jug+1.093
�醒+1.088
Neg logit contributions
wrapper-0.703
arity-0.697
istries-0.601
inski-0.584
olson-0.573
Tokenword
Token ID4775
Feature activation+2.62e-01

Pos logit contributions
 Dek+0.469
nc+0.446
 Jou+0.441
�醒+0.436
jug+0.435
Neg logit contributions
wrapper-0.561
arity-0.554
istries-0.505
inski-0.491
olson-0.487
Token by
Token ID416
Feature activation-2.15e-01

Pos logit contributions
 Dek+0.183
nc+0.180
 Jou+0.173
jug+0.172
�醒+0.170
Neg logit contributions
wrapper-0.173
arity-0.171
istries-0.155
inski-0.151
olson-0.150
Token Patrick
Token ID9925
Feature activation+1.14e+00

Pos logit contributions
wrapper+0.133
arity+0.132
istries+0.118
olson+0.114
inski+0.114
Neg logit contributions
 Dek-0.161
nc-0.157
 Jou-0.152
jug-0.151
�醒-0.149
Token J
Token ID449
Feature activation+2.30e-01

Pos logit contributions
 Dek+0.625
nc+0.613
�醒+0.591
Registered+0.580
jug+0.578
Neg logit contributions
wrapper-0.921
arity-0.921
istries-0.852
olson-0.821
inski-0.820
Token.
Token ID13
Feature activation+2.28e+00

Pos logit contributions
 Dek+0.143
nc+0.139
 Jou+0.134
jug+0.133
�醒+0.132
Neg logit contributions
wrapper-0.171
arity-0.170
istries-0.155
olson-0.152
inski-0.151
Token Buchanan
Token ID36358
Feature activation-3.00e-01

Pos logit contributions
 Dek+1.635
nc+1.616
�醒+1.606
Registered+1.592
jug+1.567
Neg logit contributions
wrapper-1.433
arity-1.418
istries-1.289
olson-1.251
inski-1.241
Token,
Token ID11
Feature activation+6.44e-01

Pos logit contributions
wrapper+0.227
arity+0.225
istries+0.206
olson+0.200
inski+0.200
Neg logit contributions
 Dek-0.184
nc-0.178
 Jou-0.171
jug-0.170
�醒-0.167
Token and
Token ID290
Feature activation+2.35e-01

Pos logit contributions
 Dek+0.428
nc+0.417
 Jou+0.403
jug+0.402
�醒+0.395
Neg logit contributions
wrapper-0.451
arity-0.447
istries-0.404
inski-0.396
olson-0.392
Token critical
Token ID4688
Feature activation+1.68e+00

Pos logit contributions
 Dek+0.130
nc+0.126
 Jou+0.121
jug+0.120
�醒+0.118
Neg logit contributions
wrapper-0.191
arity-0.189
istries-0.174
inski-0.170
olson-0.170
Token essays
Token ID27126
Feature activation-9.11e-01

Pos logit contributions
 Dek+1.363
nc+1.334
jug+1.304
 Jou+1.299
�醒+1.288
Neg logit contributions
arity-0.944
wrapper-0.942
istries-0.835
inski-0.780
olson-0.775
Token airline
Token ID18091
Feature activation+1.34e+00

Pos logit contributions
 Dek+0.606
nc+0.579
 Jou+0.559
�醒+0.558
jug+0.557
Neg logit contributions
wrapper-0.633
arity-0.629
istries-0.572
inski-0.557
olson-0.549
Token takes
Token ID2753
Feature activation+3.60e+00

Pos logit contributions
 Dek+0.936
nc+0.904
�醒+0.872
 Jou+0.868
jug+0.865
Neg logit contributions
wrapper-0.901
arity-0.895
istries-0.801
inski-0.785
olson-0.774
Token steps
Token ID4831
Feature activation+8.37e-01

Pos logit contributions
nc+2.058
 Dek+1.982
jug+1.925
�醒+1.876
 Jou+1.863
Neg logit contributions
wrapper-2.868
arity-2.855
inski-2.567
istries-2.545
olson-2.523
Token to
Token ID284
Feature activation+1.68e+00

Pos logit contributions
 Dek+0.530
nc+0.516
jug+0.504
�醒+0.491
 Jou+0.491
Neg logit contributions
wrapper-0.610
arity-0.608
istries-0.551
inski-0.537
olson-0.531
Token open
Token ID1280
Feature activation+6.35e-01

Pos logit contributions
 Dek+0.956
nc+0.936
�醒+0.885
jug+0.881
Ger+0.880
Neg logit contributions
wrapper-1.341
arity-1.311
istries-1.205
olson-1.204
inski-1.189
Token flights
Token ID13956
Feature activation+2.27e+00

Pos logit contributions
 Dek+0.460
nc+0.450
jug+0.436
 Jou+0.433
�醒+0.430
Neg logit contributions
wrapper-0.405
arity-0.398
istries-0.363
inski-0.350
olson-0.348
Token to
Token ID284
Feature activation+2.32e-01

Pos logit contributions
 Dek+1.633
nc+1.601
jug+1.573
�醒+1.565
 Muscle+1.549
Neg logit contributions
wrapper-1.444
arity-1.415
istries-1.257
inski-1.248
olson-1.210
Token US
Token ID1294
Feature activation+1.02e+00

Pos logit contributions
 Dek+0.170
nc+0.166
 Jou+0.160
jug+0.160
�醒+0.158
Neg logit contributions
wrapper-0.147
arity-0.145
istries-0.130
olson-0.127
inski-0.126
Token on
Token ID319
Feature activation-6.46e-01

Pos logit contributions
 Dek+0.642
nc+0.630
jug+0.606
 Jou+0.605
�醒+0.599
Neg logit contributions
arity-0.733
wrapper-0.733
istries-0.664
olson-0.652
inski-0.646
Token sidelines
Token ID29394
Feature activation+5.48e-02

Pos logit contributions
wrapper+0.508
arity+0.503
istries+0.463
olson+0.452
inski+0.449
Neg logit contributions
 Dek-0.376
nc-0.362
 Jou-0.349
jug-0.345
�醒-0.339
Token of
Token ID286
Feature activation+2.55e-01

Pos logit contributions
 Dek+0.034
nc+0.033
 Jou+0.032
jug+0.032
�醒+0.031
Neg logit contributions
wrapper-0.041
arity-0.040
istries-0.037
olson-0.036
inski-0.036
Token Click
Token ID6914
Feature activation+5.60e-01

Pos logit contributions
 Dek+0.056
nc+0.054
 Jou+0.052
jug+0.052
�醒+0.051
Neg logit contributions
wrapper-0.067
arity-0.067
istries-0.061
inski-0.059
olson-0.059
Token to
Token ID284
Feature activation+7.71e-02

Pos logit contributions
 Dek+0.342
nc+0.332
 Jou+0.318
jug+0.316
�醒+0.313
Neg logit contributions
wrapper-0.424
arity-0.419
istries-0.383
inski-0.373
olson-0.372
Token create
Token ID2251
Feature activation+7.43e-01

Pos logit contributions
 Dek+0.051
nc+0.050
 Jou+0.048
jug+0.048
�醒+0.047
Neg logit contributions
wrapper-0.054
arity-0.054
istries-0.049
inski-0.047
olson-0.047
Token and
Token ID290
Feature activation+4.95e-01

Pos logit contributions
 Dek+0.471
nc+0.459
 Jou+0.441
jug+0.439
�醒+0.438
Neg logit contributions
wrapper-0.543
arity-0.533
istries-0.485
inski-0.476
olson-0.472
Token send
Token ID3758
Feature activation+1.54e+00

Pos logit contributions
 Dek+0.259
nc+0.250
 Jou+0.239
jug+0.236
�醒+0.234
Neg logit contributions
wrapper-0.416
arity-0.412
istries-0.382
inski-0.374
olson-0.374
Token a
Token ID257
Feature activation+2.25e+00

Pos logit contributions
 Dek+1.083
nc+1.061
�醒+1.036
jug+1.026
 Jou+1.025
Neg logit contributions
wrapper-1.016
arity-0.994
istries-0.889
inski-0.874
olson-0.869
Token link
Token ID2792
Feature activation+1.20e+00

Pos logit contributions
nc+1.651
 Dek+1.643
�醒+1.621
jug+1.594
 Jou+1.570
Neg logit contributions
wrapper-1.408
arity-1.358
istries-1.197
olson-1.178
inski-1.177
Token using
Token ID1262
Feature activation+5.73e-01

Pos logit contributions
 Dek+0.687
nc+0.671
jug+0.644
 Jou+0.644
�醒+0.643
Neg logit contributions
wrapper-0.941
arity-0.919
istries-0.853
olson-0.837
inski-0.826
Token your
Token ID534
Feature activation+1.38e+00

Pos logit contributions
 Dek+0.424
nc+0.414
 Jou+0.399
jug+0.397
�醒+0.394
Neg logit contributions
wrapper-0.359
arity-0.355
istries-0.316
olson-0.307
inski-0.307
Token email
Token ID3053
Feature activation+8.19e-01

Pos logit contributions
 Dek+0.920
nc+0.893
jug+0.856
�醒+0.853
 Jou+0.852
Neg logit contributions
wrapper-0.968
arity-0.960
istries-0.861
olson-0.848
inski-0.843
Token application
Token ID3586
Feature activation+1.43e+00

Pos logit contributions
 Dek+0.482
nc+0.465
 Jou+0.444
jug+0.444
�醒+0.441
Neg logit contributions
wrapper-0.640
arity-0.629
istries-0.577
inski-0.565
olson-0.564
Token made
Token ID925
Feature activation-3.41e-02

Pos logit contributions
wrapper+0.075
arity+0.074
istries+0.068
olson+0.066
inski+0.066
Neg logit contributions
 Dek-0.063
nc-0.061
 Jou-0.059
jug-0.058
�醒-0.058
Token by
Token ID416
Feature activation+1.38e-01

Pos logit contributions
wrapper+0.024
arity+0.024
istries+0.022
olson+0.021
inski+0.021
Neg logit contributions
 Dek-0.023
nc-0.022
 Jou-0.021
jug-0.021
�醒-0.021
Token Prime
Token ID5537
Feature activation+7.53e-01

Pos logit contributions
 Dek+0.088
nc+0.085
 Jou+0.082
jug+0.082
�醒+0.081
Neg logit contributions
wrapper-0.100
arity-0.099
istries-0.090
olson-0.088
inski-0.088
Token Minister
Token ID4139
Feature activation+1.58e+00

Pos logit contributions
 Dek+0.569
nc+0.556
 Jou+0.537
�醒+0.536
jug+0.536
Neg logit contributions
wrapper-0.457
arity-0.452
istries-0.404
inski-0.392
olson-0.389
Token Mar
Token ID1526
Feature activation+6.74e-01

Pos logit contributions
 Dek+0.887
nc+0.859
�醒+0.828
 Jou+0.820
jug+0.816
Neg logit contributions
wrapper-1.263
arity-1.252
istries-1.155
inski-1.125
olson-1.111
Tokeniano
Token ID10115
Feature activation+2.24e+00

Pos logit contributions
 Dek+0.625
nc+0.608
 Jou+0.596
jug+0.594
�醒+0.591
Neg logit contributions
wrapper-0.297
arity-0.294
istries-0.254
inski-0.243
olson-0.238
Token Raj
Token ID13308
Feature activation+8.78e-01

Pos logit contributions
 Dek+1.420
nc+1.391
�醒+1.370
jug+1.356
 Jou+1.343
Neg logit contributions
wrapper-1.587
arity-1.587
inski-1.436
istries-1.433
olson-1.407
Tokenoy
Token ID726
Feature activation-1.21e-01

Pos logit contributions
 Dek+0.436
nc+0.413
 Jou+0.401
jug+0.393
�醒+0.391
Neg logit contributions
wrapper-0.765
arity-0.761
istries-0.703
inski-0.691
olson-0.686
Token and
Token ID290
Feature activation-1.59e-01

Pos logit contributions
wrapper+0.087
arity+0.086
istries+0.079
olson+0.077
inski+0.077
Neg logit contributions
 Dek-0.078
nc-0.075
 Jou-0.072
jug-0.072
�醒-0.071
Token his
Token ID465
Feature activation+1.05e-01

Pos logit contributions
wrapper+0.114
arity+0.113
istries+0.103
olson+0.100
inski+0.100
Neg logit contributions
 Dek-0.104
nc-0.101
 Jou-0.097
jug-0.097
�醒-0.096
Token ministers
Token ID14138
Feature activation-4.68e-03

Pos logit contributions
 Dek+0.061
nc+0.059
 Jou+0.056
jug+0.056
�醒+0.056
Neg logit contributions
wrapper-0.082
arity-0.082
istries-0.075
inski-0.073
olson-0.073
Token The
Token ID383
Feature activation-1.53e-01

Pos logit contributions
wrapper+0.089
arity+0.088
istries+0.081
olson+0.079
inski+0.079
Neg logit contributions
 Dek-0.071
nc-0.069
 Jou-0.066
jug-0.065
�醒-0.065
Token birth
Token ID4082
Feature activation-2.70e-01

Pos logit contributions
wrapper+0.108
arity+0.106
istries+0.097
olson+0.094
inski+0.094
Neg logit contributions
 Dek-0.102
nc-0.099
 Jou-0.096
jug-0.095
�醒-0.094
Token year
Token ID614
Feature activation-3.16e-01

Pos logit contributions
wrapper+0.201
arity+0.199
istries+0.182
olson+0.177
inski+0.177
Neg logit contributions
 Dek-0.169
nc-0.163
 Jou-0.157
jug-0.156
�醒-0.154
Token of
Token ID286
Feature activation-4.72e-03

Pos logit contributions
wrapper+0.236
arity+0.234
istries+0.214
olson+0.208
inski+0.208
Neg logit contributions
 Dek-0.196
nc-0.189
 Jou-0.182
jug-0.181
�醒-0.178
Token Akira
Token ID46910
Feature activation+5.45e-01

Pos logit contributions
wrapper+0.003
arity+0.003
istries+0.003
olson+0.003
inski+0.003
Neg logit contributions
 Dek-0.003
nc-0.003
 Jou-0.003
jug-0.003
�醒-0.003
Token Kuro
Token ID25796
Feature activation+2.23e+00

Pos logit contributions
 Dek+0.332
nc+0.325
 Jou+0.311
jug+0.310
�醒+0.308
Neg logit contributions
wrapper-0.411
arity-0.405
istries-0.370
olson-0.361
inski-0.360
Tokens
Token ID82
Feature activation+1.44e+00

Pos logit contributions
 Dek+1.588
nc+1.537
�醒+1.528
jug+1.513
 Jou+1.496
Neg logit contributions
wrapper-1.459
arity-1.430
istries-1.299
inski-1.297
olson-1.237
Tokenawa
Token ID6909
Feature activation+1.87e-01

Pos logit contributions
 Dek+1.147
nc+1.113
 Jou+1.089
jug+1.087
Registered+1.080
Neg logit contributions
wrapper-0.817
arity-0.803
istries-0.713
inski-0.712
olson-0.686
Token,
Token ID11
Feature activation-1.53e-03

Pos logit contributions
 Dek+0.120
nc+0.116
 Jou+0.112
jug+0.111
�醒+0.110
Neg logit contributions
wrapper-0.135
arity-0.134
istries-0.122
inski-0.119
olson-0.119
Token director
Token ID3437
Feature activation-6.21e-02

Pos logit contributions
wrapper+0.001
arity+0.001
istries+0.001
olson+0.001
inski+0.001
Neg logit contributions
 Dek-0.001
nc-0.001
 Jou-0.001
jug-0.001
�醒-0.001
Token of
Token ID286
Feature activation+5.05e-01

Pos logit contributions
wrapper+0.046
arity+0.046
istries+0.042
olson+0.041
inski+0.041
Neg logit contributions
 Dek-0.039
nc-0.037
 Jou-0.036
jug-0.036
�醒-0.035
Token airline
Token ID18091
Feature activation+1.53e+00

Pos logit contributions
 Dek+0.685
nc+0.654
 Jou+0.631
�醒+0.631
jug+0.630
Neg logit contributions
wrapper-0.711
arity-0.708
istries-0.644
inski-0.629
olson-0.616
Token takes
Token ID2753
Feature activation+3.98e+00

Pos logit contributions
 Dek+1.014
nc+0.979
�醒+0.944
jug+0.934
Ger+0.934
Neg logit contributions
wrapper-1.077
arity-1.067
istries-0.962
inski-0.943
olson-0.926
Token steps
Token ID4831
Feature activation+7.65e-01

Pos logit contributions
nc+2.367
 Dek+2.281
jug+2.232
�醒+2.148
 Jou+2.138
Neg logit contributions
wrapper-3.076
arity-3.065
inski-2.746
olson-2.700
istries-2.694
Token to
Token ID284
Feature activation+1.78e+00

Pos logit contributions
 Dek+0.484
nc+0.471
jug+0.459
�醒+0.448
 Jou+0.448
Neg logit contributions
wrapper-0.558
arity-0.556
istries-0.506
inski-0.493
olson-0.488
Token open
Token ID1280
Feature activation+7.14e-01

Pos logit contributions
 Dek+0.991
nc+0.982
�醒+0.916
jug+0.916
Ger+0.911
Neg logit contributions
wrapper-1.440
arity-1.401
olson-1.299
istries-1.293
inski-1.273
Token flights
Token ID13956
Feature activation+2.21e+00

Pos logit contributions
 Dek+0.491
nc+0.480
jug+0.465
 Jou+0.460
�醒+0.458
Neg logit contributions
wrapper-0.481
arity-0.472
istries-0.433
inski-0.420
olson-0.417
Token to
Token ID284
Feature activation+5.34e-01

Pos logit contributions
 Dek+1.552
nc+1.520
jug+1.482
�醒+1.474
 Muscle+1.469
Neg logit contributions
wrapper-1.442
arity-1.414
istries-1.258
inski-1.244
olson-1.214
Token US
Token ID1294
Feature activation+6.48e-01

Pos logit contributions
 Dek+0.377
nc+0.368
 Jou+0.354
jug+0.354
�醒+0.350
Neg logit contributions
wrapper-0.351
arity-0.346
istries-0.311
olson-0.305
inski-0.302
Token on
Token ID319
Feature activation-5.94e-01

Pos logit contributions
 Dek+0.404
nc+0.395
 Jou+0.379
jug+0.378
�醒+0.374
Neg logit contributions
wrapper-0.475
arity-0.474
istries-0.431
olson-0.423
inski-0.420
Token sidelines
Token ID29394
Feature activation-2.59e-02

Pos logit contributions
wrapper+0.466
arity+0.462
istries+0.424
olson+0.414
inski+0.412
Neg logit contributions
 Dek-0.348
nc-0.335
 Jou-0.323
jug-0.318
�醒-0.314
Token of
Token ID286
Feature activation+1.51e-01

Pos logit contributions
wrapper+0.019
arity+0.019
istries+0.017
olson+0.017
inski+0.017
Neg logit contributions
 Dek-0.016
nc-0.016
 Jou-0.015
jug-0.015
�醒-0.015
Tokenzing
Token ID9510
Feature activation-1.78e-01

Pos logit contributions
 Dek+0.294
nc+0.283
 Jou+0.276
jug+0.273
�醒+0.273
Neg logit contributions
wrapper-0.285
arity-0.282
istries-0.255
olson-0.248
inski-0.248
Token;
Token ID26
Feature activation-7.39e-02

Pos logit contributions
wrapper+0.130
arity+0.129
istries+0.118
olson+0.114
inski+0.114
Neg logit contributions
 Dek-0.113
nc-0.109
 Jou-0.105
jug-0.105
�醒-0.103
Token as
Token ID355
Feature activation+3.02e-01

Pos logit contributions
wrapper+0.053
arity+0.053
istries+0.048
olson+0.047
inski+0.047
Neg logit contributions
 Dek-0.048
nc-0.046
 Jou-0.045
jug-0.044
�醒-0.044
Token Slav
Token ID34314
Feature activation+1.52e-01

Pos logit contributions
 Dek+0.192
nc+0.187
 Jou+0.180
jug+0.179
�醒+0.177
Neg logit contributions
wrapper-0.219
arity-0.218
istries-0.198
inski-0.193
olson-0.193
Tokenoj
Token ID13210
Feature activation+3.58e-01

Pos logit contributions
 Dek+0.097
nc+0.094
 Jou+0.091
jug+0.090
�醒+0.089
Neg logit contributions
wrapper-0.111
arity-0.110
istries-0.100
inski-0.098
olson-0.097
Token
Token ID25370
Feature activation+2.16e+00

Pos logit contributions
 Dek+0.250
nc+0.244
 Jou+0.235
jug+0.234
�醒+0.233
Neg logit contributions
wrapper-0.238
arity-0.236
istries-0.212
inski-0.208
olson-0.206
Token
Token ID121
Feature activation+5.04e-01

Pos logit contributions
 Dek+1.912
nc+1.884
 Jou+1.826
Registered+1.804
 Muscle+1.797
Neg logit contributions
arity-1.049
wrapper-1.021
inski-0.870
istries-0.861
olson-0.856
Tokeni
Token ID72
Feature activation+6.67e-01

Pos logit contributions
 Dek+0.388
nc+0.378
 Jou+0.365
jug+0.363
�醒+0.361
Neg logit contributions
wrapper-0.300
arity-0.299
istries-0.267
inski-0.258
olson-0.257
Token
Token ID129
Feature activation-1.74e-01

Pos logit contributions
 Dek+0.505
nc+0.494
 Jou+0.475
jug+0.475
�醒+0.474
Neg logit contributions
arity-0.405
wrapper-0.404
istries-0.358
inski-0.350
olson-0.347
Token
Token ID122
Feature activation-6.54e-01

Pos logit contributions
wrapper+0.090
arity+0.088
istries+0.077
inski+0.074
olson+0.074
Neg logit contributions
 Dek-0.148
nc-0.144
 Jou-0.140
jug-0.140
�醒-0.138
Tokenek
Token ID988
Feature activation-1.12e-01

Pos logit contributions
wrapper+0.356
arity+0.351
istries+0.311
olson+0.298
inski+0.295
Neg logit contributions
 Dek-0.539
nc-0.526
 Jou-0.510
jug-0.508
Ger-0.501
Token request
Token ID2581
Feature activation+1.20e-01

Pos logit contributions
 Dek+0.122
nc+0.118
 Jou+0.114
jug+0.114
�醒+0.112
Neg logit contributions
wrapper-0.117
arity-0.116
istries-0.105
olson-0.102
inski-0.102
Token stamp
Token ID17977
Feature activation+1.12e-01

Pos logit contributions
 Dek+0.078
nc+0.076
 Jou+0.073
jug+0.073
�醒+0.072
Neg logit contributions
wrapper-0.086
arity-0.085
istries-0.077
olson-0.075
inski-0.075
Token.
Token ID13
Feature activation-1.01e-01

Pos logit contributions
 Dek+0.068
nc+0.065
 Jou+0.063
jug+0.062
�醒+0.062
Neg logit contributions
wrapper-0.085
arity-0.085
istries-0.078
olson-0.076
inski-0.075
Token<|endoftext|>
Token ID50256
Feature activation-2.80e-03

Pos logit contributions
wrapper+0.077
arity+0.076
istries+0.070
olson+0.068
inski+0.068
Neg logit contributions
 Dek-0.061
nc-0.059
 Jou-0.057
jug-0.057
�醒-0.056
TokenED
Token ID1961
Feature activation+5.81e-02

Pos logit contributions
wrapper+0.002
arity+0.002
istries+0.002
olson+0.002
inski+0.002
Neg logit contributions
 Dek-0.002
nc-0.002
 Jou-0.002
jug-0.002
�醒-0.002
TokenMON
Token ID27857
Feature activation+2.13e+00

Pos logit contributions
 Dek+0.034
nc+0.033
 Jou+0.031
jug+0.031
�醒+0.031
Neg logit contributions
wrapper-0.046
arity-0.045
istries-0.042
olson-0.040
inski-0.040
TokenTON
Token ID11357
Feature activation+1.92e-01

Pos logit contributions
 Dek+0.892
nc+0.845
jug+0.821
�醒+0.811
 Jou+0.802
Neg logit contributions
wrapper-2.001
arity-1.958
istries-1.830
olson-1.826
inski-1.817
Token
Token ID784
Feature activation-3.03e-01

Pos logit contributions
 Dek+0.123
nc+0.119
 Jou+0.114
jug+0.114
�醒+0.112
Neg logit contributions
wrapper-0.139
arity-0.138
istries-0.126
olson-0.123
inski-0.122
Token Omar
Token ID24980
Feature activation+1.66e-01

Pos logit contributions
wrapper+0.225
arity+0.223
istries+0.204
olson+0.199
inski+0.198
Neg logit contributions
 Dek-0.189
nc-0.183
 Jou-0.176
jug-0.175
�醒-0.172
Token Kh
Token ID5311
Feature activation-1.44e-01

Pos logit contributions
 Dek+0.109
nc+0.106
 Jou+0.102
jug+0.101
�醒+0.101
Neg logit contributions
wrapper-0.117
arity-0.116
istries-0.105
olson-0.102
inski-0.102
Tokenadr
Token ID41909
Feature activation-2.51e-01

Pos logit contributions
wrapper+0.087
arity+0.086
istries+0.077
olson+0.074
inski+0.074
Neg logit contributions
 Dek-0.110
nc-0.107
 Jou-0.104
jug-0.103
�醒-0.102
Token showcase
Token ID21742
Feature activation-1.00e-01

Pos logit contributions
 Dek+0.170
nc+0.165
 Jou+0.159
jug+0.158
�醒+0.157
Neg logit contributions
wrapper-0.183
arity-0.181
istries-0.165
olson-0.160
inski-0.160
Token.
Token ID13
Feature activation-5.20e-02

Pos logit contributions
wrapper+0.073
arity+0.072
istries+0.066
olson+0.064
inski+0.064
Neg logit contributions
 Dek-0.064
nc-0.062
 Jou-0.060
jug-0.059
�醒-0.058
Token
Token ID198
Feature activation-2.12e-01

Pos logit contributions
wrapper+0.042
arity+0.041
istries+0.038
olson+0.037
inski+0.037
Neg logit contributions
 Dek-0.030
nc-0.028
 Jou-0.027
jug-0.027
�醒-0.027
Token
Token ID198
Feature activation-6.49e-01

Pos logit contributions
wrapper+0.157
arity+0.156
istries+0.142
olson+0.139
inski+0.138
Neg logit contributions
 Dek-0.133
nc-0.128
 Jou-0.124
jug-0.123
�醒-0.121
TokenView
Token ID7680
Feature activation+1.10e+00

Pos logit contributions
wrapper+0.481
arity+0.476
istries+0.438
olson+0.423
inski+0.423
Neg logit contributions
 Dek-0.406
nc-0.394
 Jou-0.378
jug-0.376
Ger-0.370
Token more
Token ID517
Feature activation+2.09e+00

Pos logit contributions
 Dek+0.755
nc+0.733
�醒+0.707
 Jou+0.707
jug+0.702
Neg logit contributions
wrapper-0.747
arity-0.744
istries-0.658
olson-0.655
inski-0.648
Token<|endoftext|>
Token ID50256
Feature activation-2.00e-02

Pos logit contributions
 Dek+0.785
nc+0.751
 Jou+0.697
�醒+0.691
jug+0.677
Neg logit contributions
arity-2.071
wrapper-2.044
olson-1.893
istries-1.888
inski-1.881
TokenAbout
Token ID8585
Feature activation-8.89e-02

Pos logit contributions
wrapper+0.015
arity+0.015
istries+0.014
olson+0.014
inski+0.014
Neg logit contributions
 Dek-0.012
nc-0.012
 Jou-0.011
jug-0.011
�醒-0.011
Token
Token ID198
Feature activation+3.00e-02

Pos logit contributions
wrapper+0.066
arity+0.066
istries+0.060
olson+0.058
inski+0.058
Neg logit contributions
 Dek-0.055
nc-0.053
 Jou-0.051
jug-0.051
�醒-0.050
Token
Token ID198
Feature activation+6.79e-02

Pos logit contributions
 Dek+0.019
nc+0.018
 Jou+0.018
jug+0.018
�醒+0.017
Neg logit contributions
wrapper-0.022
arity-0.022
istries-0.020
olson-0.019
inski-0.019
TokenR
Token ID49
Feature activation-1.05e-01

Pos logit contributions
 Dek+0.041
nc+0.039
 Jou+0.038
jug+0.037
�醒+0.037
Neg logit contributions
wrapper-0.052
arity-0.052
istries-0.047
olson-0.046
inski-0.046
Token country
Token ID1499
Feature activation-1.70e-02

Pos logit contributions
wrapper+0.013
arity+0.013
istries+0.012
olson+0.012
inski+0.012
Neg logit contributions
 Dek-0.017
nc-0.016
 Jou-0.016
jug-0.016
�醒-0.015
Token,
Token ID11
Feature activation-1.06e-01

Pos logit contributions
wrapper+0.013
arity+0.013
istries+0.011
olson+0.011
inski+0.011
Neg logit contributions
 Dek-0.011
nc-0.010
 Jou-0.010
jug-0.010
�醒-0.010
Token and
Token ID290
Feature activation-8.27e-02

Pos logit contributions
wrapper+0.075
arity+0.074
istries+0.068
olson+0.066
inski+0.066
Neg logit contributions
 Dek-0.070
nc-0.068
 Jou-0.065
jug-0.065
�醒-0.064
Token DHS
Token ID29987
Feature activation-4.69e-01

Pos logit contributions
wrapper+0.059
arity+0.058
istries+0.053
olson+0.051
inski+0.051
Neg logit contributions
 Dek-0.054
nc-0.053
 Jou-0.051
jug-0.050
�醒-0.050
Token Secretary
Token ID4986
Feature activation-9.85e-01

Pos logit contributions
wrapper+0.344
arity+0.341
istries+0.312
olson+0.303
inski+0.303
Neg logit contributions
 Dek-0.296
nc-0.287
 Jou-0.277
jug-0.274
�醒-0.270
Token John
Token ID1757
Feature activation-2.67e+00

Pos logit contributions
wrapper+0.761
arity+0.755
istries+0.694
olson+0.676
inski+0.672
Neg logit contributions
 Dek-0.590
nc-0.566
 Jou-0.549
jug-0.538
 Muscle-0.533
Token Kelly
Token ID9077
Feature activation-3.05e-01

Pos logit contributions
wrapper+1.718
arity+1.702
istries+1.572
inski+1.491
olson+1.489
Neg logit contributions
 Dek-1.977
nc-1.876
 Jou-1.827
jug-1.789
 Muscle-1.788
Token recently
Token ID2904
Feature activation-4.42e-01

Pos logit contributions
wrapper+0.222
arity+0.220
istries+0.201
olson+0.196
inski+0.195
Neg logit contributions
 Dek-0.194
nc-0.188
 Jou-0.181
jug-0.180
�醒-0.177
Token said
Token ID531
Feature activation-1.49e-01

Pos logit contributions
wrapper+0.326
arity+0.323
istries+0.295
olson+0.287
inski+0.286
Neg logit contributions
 Dek-0.279
nc-0.270
 Jou-0.260
jug-0.258
�醒-0.254
Token that
Token ID326
Feature activation-1.79e-01

Pos logit contributions
wrapper+0.106
arity+0.105
istries+0.096
olson+0.093
inski+0.093
Neg logit contributions
 Dek-0.098
nc-0.095
 Jou-0.091
jug-0.091
�醒-0.090
Token
Token ID564
Feature activation-8.77e-02

Pos logit contributions
wrapper+0.126
arity+0.125
istries+0.114
olson+0.111
inski+0.110
Neg logit contributions
 Dek-0.118
nc-0.114
 Jou-0.110
jug-0.110
�醒-0.108
Token opinions
Token ID9317
Feature activation-2.43e-01

Pos logit contributions
 Dek+0.045
nc+0.043
 Jou+0.042
jug+0.042
�醒+0.041
Neg logit contributions
wrapper-0.050
arity-0.050
istries-0.046
olson-0.044
inski-0.044
Token expressed
Token ID6241
Feature activation-2.32e-01

Pos logit contributions
wrapper+0.169
arity+0.167
istries+0.152
olson+0.147
inski+0.147
Neg logit contributions
 Dek-0.163
nc-0.158
 Jou-0.153
jug-0.152
�醒-0.150
Token by
Token ID416
Feature activation-1.17e+00

Pos logit contributions
wrapper+0.158
arity+0.156
istries+0.142
olson+0.138
inski+0.137
Neg logit contributions
 Dek-0.160
nc-0.155
 Jou-0.150
jug-0.148
�醒-0.147
Token column
Token ID5721
Feature activation+4.16e-01

Pos logit contributions
wrapper+0.892
arity+0.888
istries+0.813
inski+0.796
olson+0.796
Neg logit contributions
 Dek-0.706
nc-0.684
 Jou-0.655
jug-0.649
Ger-0.641
Tokenists
Token ID1023
Feature activation-1.81e+00

Pos logit contributions
 Dek+0.201
nc+0.194
 Jou+0.184
jug+0.181
�醒+0.179
Neg logit contributions
wrapper-0.367
arity-0.363
istries-0.339
inski-0.330
olson-0.330
Token are
Token ID389
Feature activation-2.30e+00

Pos logit contributions
wrapper+1.433
arity+1.407
istries+1.296
olson+1.286
inski+1.262
Neg logit contributions
 Dek-1.055
nc-1.017
 Jou-0.980
 Muscle-0.962
jug-0.959
Token their
Token ID511
Feature activation-1.84e+00

Pos logit contributions
wrapper+1.659
arity+1.656
istries+1.531
olson+1.488
inski+1.467
Neg logit contributions
 Dek-1.489
nc-1.436
 Jou-1.390
 Muscle-1.362
jug-1.353
Token own
Token ID898
Feature activation-1.20e+00

Pos logit contributions
wrapper+1.336
arity+1.332
istries+1.217
olson+1.189
inski+1.168
Neg logit contributions
 Dek-1.171
nc-1.128
 Jou-1.104
jug-1.069
 Muscle-1.061
Token and
Token ID290
Feature activation-1.53e+00

Pos logit contributions
wrapper+0.893
arity+0.887
istries+0.807
olson+0.790
inski+0.784
Neg logit contributions
 Dek-0.753
nc-0.727
 Jou-0.702
jug-0.691
 Muscle-0.684
Token do
Token ID466
Feature activation-6.25e-01

Pos logit contributions
wrapper+0.966
arity+0.957
istries+0.854
olson+0.837
inski+0.825
Neg logit contributions
 Dek-1.124
nc-1.094
 Jou-1.057
jug-1.045
 Muscle-1.042
Token not
Token ID407
Feature activation-8.01e-02

Pos logit contributions
wrapper+0.442
arity+0.437
istries+0.398
olson+0.387
inski+0.386
Neg logit contributions
 Dek-0.413
nc-0.401
 Jou-0.387
jug-0.383
 Muscle-0.378
Token and
Token ID290
Feature activation+1.40e-01

Pos logit contributions
 Dek+0.000
nc+0.000
 Jou+0.000
jug+0.000
�醒+0.000
Neg logit contributions
wrapper-0.000
arity-0.000
istries-0.000
olson-0.000
inski-0.000
Token ESPN
Token ID10409
Feature activation+1.13e-01

Pos logit contributions
 Dek+0.090
nc+0.088
 Jou+0.085
jug+0.084
�醒+0.083
Neg logit contributions
wrapper-0.101
arity-0.100
istries-0.091
olson-0.089
inski-0.089
Token analyst
Token ID12499
Feature activation+1.36e-01

Pos logit contributions
 Dek+0.071
nc+0.068
 Jou+0.066
jug+0.065
�醒+0.065
Neg logit contributions
wrapper-0.084
arity-0.083
istries-0.076
olson-0.074
inski-0.074
Token Fran
Token ID12323
Feature activation-4.52e-01

Pos logit contributions
 Dek+0.085
nc+0.083
 Jou+0.079
jug+0.079
�醒+0.078
Neg logit contributions
wrapper-0.101
arity-0.100
istries-0.091
olson-0.089
inski-0.089
Token Fr
Token ID1305
Feature activation-1.44e-01

Pos logit contributions
wrapper+0.337
arity+0.333
istries+0.305
olson+0.297
inski+0.296
Neg logit contributions
 Dek-0.283
nc-0.273
 Jou-0.263
jug-0.261
�醒-0.257
Tokenas
Token ID292
Feature activation-2.19e+00

Pos logit contributions
wrapper+0.119
arity+0.118
istries+0.109
olson+0.106
inski+0.106
Neg logit contributions
 Dek-0.078
nc-0.075
 Jou-0.072
jug-0.071
�醒-0.070
Tokench
Token ID354
Feature activation-4.86e-01

Pos logit contributions
wrapper+1.369
arity+1.353
istries+1.207
olson+1.180
inski+1.155
Neg logit contributions
 Dek-1.630
nc-1.583
 Jou-1.525
jug-1.519
 Muscle-1.500
Tokenilla
Token ID5049
Feature activation-2.06e-01

Pos logit contributions
wrapper+0.495
arity+0.492
istries+0.462
olson+0.452
inski+0.451
Neg logit contributions
 Dek-0.170
nc-0.160
 Jou-0.149
jug-0.147
�醒-0.143
Token.[
Token ID3693
Feature activation+2.31e-01

Pos logit contributions
wrapper+0.154
arity+0.153
istries+0.140
olson+0.136
inski+0.136
Neg logit contributions
 Dek-0.127
nc-0.123
 Jou-0.119
jug-0.118
�醒-0.116
Token94
Token ID5824
Feature activation+2.94e-02

Pos logit contributions
 Dek+0.107
nc+0.101
 Jou+0.097
jug+0.095
�醒+0.094
Neg logit contributions
wrapper-0.210
arity-0.208
istries-0.193
olson-0.189
inski-0.189
Token]
Token ID60
Feature activation-5.26e-01

Pos logit contributions
 Dek+0.015
nc+0.014
 Jou+0.014
jug+0.014
�醒+0.013
Neg logit contributions
wrapper-0.025
arity-0.025
istries-0.023
olson-0.023
inski-0.023
Token File
Token ID9220
Feature activation+6.06e-01

Pos logit contributions
wrapper+0.151
arity+0.150
istries+0.137
olson+0.133
inski+0.133
Neg logit contributions
 Dek-0.123
nc-0.119
 Jou-0.114
jug-0.114
�醒-0.112
Token)
Token ID8
Feature activation+2.05e-01

Pos logit contributions
 Dek+0.366
nc+0.354
 Jou+0.342
jug+0.339
�醒+0.337
Neg logit contributions
wrapper-0.461
arity-0.456
istries-0.418
inski-0.408
olson-0.407
Token
Token ID198
Feature activation-1.42e-01

Pos logit contributions
 Dek+0.128
nc+0.124
 Jou+0.119
jug+0.118
�醒+0.117
Neg logit contributions
wrapper-0.152
arity-0.150
istries-0.137
inski-0.134
olson-0.134
Token
Token ID198
Feature activation-2.69e-01

Pos logit contributions
wrapper+0.105
arity+0.104
istries+0.095
olson+0.093
inski+0.093
Neg logit contributions
 Dek-0.089
nc-0.086
 Jou-0.083
jug-0.083
�醒-0.082
TokenL
Token ID43
Feature activation+2.47e-01

Pos logit contributions
wrapper+0.206
arity+0.205
istries+0.188
olson+0.183
inski+0.182
Neg logit contributions
 Dek-0.162
nc-0.157
 Jou-0.151
jug-0.149
�醒-0.147
TokenANS
Token ID15037
Feature activation-2.16e+00

Pos logit contributions
 Dek+0.154
nc+0.149
 Jou+0.144
jug+0.143
�醒+0.142
Neg logit contributions
wrapper-0.183
arity-0.182
istries-0.166
olson-0.162
inski-0.161
TokenING
Token ID2751
Feature activation-2.83e-01

Pos logit contributions
wrapper+1.695
arity+1.644
istries+1.532
olson+1.490
inski+1.479
Neg logit contributions
 Dek-1.313
nc-1.242
 Jou-1.192
Ger-1.180
 Muscle-1.163
Token,
Token ID11
Feature activation-6.71e-01

Pos logit contributions
wrapper+0.213
arity+0.211
istries+0.193
olson+0.188
inski+0.187
Neg logit contributions
 Dek-0.174
nc-0.168
 Jou-0.162
jug-0.160
�醒-0.158
Token MI
Token ID15789
Feature activation-1.13e+00

Pos logit contributions
wrapper+0.523
arity+0.518
istries+0.475
olson+0.464
inski+0.462
Neg logit contributions
 Dek-0.396
nc-0.382
 Jou-0.367
jug-0.363
�醒-0.358
Token --
Token ID1377
Feature activation-4.22e-01

Pos logit contributions
wrapper+0.845
arity+0.837
istries+0.764
olson+0.745
inski+0.742
Neg logit contributions
 Dek-0.710
nc-0.686
 Jou-0.660
jug-0.654
 Muscle-0.644
Token The
Token ID383
Feature activation-2.29e-01

Pos logit contributions
wrapper+0.312
arity+0.309
istries+0.283
olson+0.275
inski+0.274
Neg logit contributions
 Dek-0.265
nc-0.257
 Jou-0.247
jug-0.245
�醒-0.242
Token an
Token ID281
Feature activation-1.82e-01

Pos logit contributions
wrapper+0.034
arity+0.033
istries+0.030
olson+0.030
inski+0.029
Neg logit contributions
 Dek-0.032
nc-0.031
 Jou-0.030
jug-0.030
�醒-0.029
Token appearance
Token ID5585
Feature activation-2.27e-01

Pos logit contributions
wrapper+0.130
arity+0.129
istries+0.117
olson+0.114
inski+0.114
Neg logit contributions
 Dek-0.119
nc-0.115
 Jou-0.111
jug-0.110
�醒-0.109
Token with
Token ID351
Feature activation-1.93e-01

Pos logit contributions
wrapper+0.161
arity+0.160
istries+0.145
olson+0.141
inski+0.141
Neg logit contributions
 Dek-0.150
nc-0.145
 Jou-0.140
jug-0.139
�醒-0.137
Token Treasury
Token ID13419
Feature activation-2.08e-01

Pos logit contributions
wrapper+0.147
arity+0.145
istries+0.133
olson+0.130
inski+0.130
Neg logit contributions
 Dek-0.117
nc-0.114
 Jou-0.109
jug-0.108
�醒-0.107
Token Secretary
Token ID4986
Feature activation-5.69e-01

Pos logit contributions
wrapper+0.156
arity+0.154
istries+0.141
olson+0.138
inski+0.137
Neg logit contributions
 Dek-0.128
nc-0.124
 Jou-0.119
jug-0.118
�醒-0.117
Token Steven
Token ID8239
Feature activation-2.16e+00

Pos logit contributions
wrapper+0.451
arity+0.447
istries+0.412
olson+0.402
inski+0.401
Neg logit contributions
 Dek-0.327
nc-0.315
 Jou-0.303
jug-0.299
�醒-0.295
Token Mn
Token ID36030
Feature activation-7.53e-01

Pos logit contributions
wrapper+1.180
arity+1.179
istries+1.041
inski+0.991
olson+0.987
Neg logit contributions
 Dek-1.806
nc-1.730
 Jou-1.707
 Muscle-1.667
jug-1.665
Tokenuchin
Token ID43416
Feature activation-6.60e-01

Pos logit contributions
wrapper+0.511
arity+0.504
istries+0.459
olson+0.446
inski+0.443
Neg logit contributions
 Dek-0.517
nc-0.504
 Jou-0.485
jug-0.482
�醒-0.475
Token,
Token ID11
Feature activation-1.19e-01

Pos logit contributions
wrapper+0.493
arity+0.487
istries+0.447
olson+0.435
inski+0.433
Neg logit contributions
 Dek-0.410
nc-0.396
 Jou-0.382
jug-0.378
�醒-0.373
Token that
Token ID326
Feature activation-3.69e-01

Pos logit contributions
wrapper+0.084
arity+0.083
istries+0.076
olson+0.074
inski+0.073
Neg logit contributions
 Dek-0.079
nc-0.076
 Jou-0.074
jug-0.073
�醒-0.072
Token the
Token ID262
Feature activation-3.29e-01

Pos logit contributions
wrapper+0.273
arity+0.270
istries+0.247
olson+0.240
inski+0.240
Neg logit contributions
 Dek-0.231
nc-0.224
 Jou-0.216
jug-0.214
�醒-0.211
Token
Token ID198
Feature activation-7.53e-02

Pos logit contributions
wrapper+0.110
arity+0.109
istries+0.099
olson+0.097
inski+0.097
Neg logit contributions
 Dek-0.092
nc-0.090
 Jou-0.086
jug-0.086
�醒-0.084
Token
Token ID198
Feature activation+2.53e-03

Pos logit contributions
wrapper+0.056
arity+0.055
istries+0.050
olson+0.049
inski+0.049
Neg logit contributions
 Dek-0.047
nc-0.046
 Jou-0.044
jug-0.044
�醒-0.043
TokenSAN
Token ID36753
Feature activation-2.29e-01

Pos logit contributions
 Dek+0.001
nc+0.001
 Jou+0.001
jug+0.001
�醒+0.001
Neg logit contributions
wrapper-0.002
arity-0.002
istries-0.002
olson-0.002
inski-0.002
Token FR
Token ID8782
Feature activation+1.64e-01

Pos logit contributions
wrapper+0.112
arity+0.111
istries+0.096
olson+0.092
inski+0.092
Neg logit contributions
 Dek-0.201
nc-0.196
 Jou-0.191
jug-0.190
�醒-0.188
TokenANC
Token ID20940
Feature activation-8.47e-01

Pos logit contributions
 Dek+0.090
nc+0.087
 Jou+0.083
jug+0.083
�醒+0.082
Neg logit contributions
wrapper-0.134
arity-0.133
istries-0.122
olson-0.120
inski-0.119
TokenIS
Token ID1797
Feature activation-2.15e+00

Pos logit contributions
wrapper+0.752
arity+0.745
istries+0.692
olson+0.676
inski+0.675
Neg logit contributions
 Dek-0.409
nc-0.390
 Jou-0.372
jug-0.365
Ger-0.362
TokenCO
Token ID8220
Feature activation-5.55e-01

Pos logit contributions
wrapper+1.507
arity+1.480
istries+1.379
inski+1.313
olson+1.300
Neg logit contributions
 Dek-1.468
nc-1.409
 Jou-1.363
Ger-1.348
jug-1.320
Token (
Token ID357
Feature activation-6.73e-01

Pos logit contributions
wrapper+0.425
arity+0.421
istries+0.386
olson+0.376
inski+0.375
Neg logit contributions
 Dek-0.335
nc-0.323
 Jou-0.311
jug-0.308
�醒-0.304
TokenCBS
Token ID22923
Feature activation+6.81e-01

Pos logit contributions
wrapper+0.485
arity+0.480
istries+0.439
olson+0.426
inski+0.425
Neg logit contributions
 Dek-0.435
nc-0.422
 Jou-0.407
jug-0.403
Ger-0.398
Token SF
Token ID14362
Feature activation-5.66e-01

Pos logit contributions
 Dek+0.414
nc+0.399
 Jou+0.387
�醒+0.385
jug+0.381
Neg logit contributions
wrapper-0.516
arity-0.511
istries-0.465
olson-0.458
inski-0.456
Token)
Token ID8
Feature activation-5.88e-01

Pos logit contributions
wrapper+0.420
arity+0.417
istries+0.381
olson+0.371
inski+0.370
Neg logit contributions
 Dek-0.354
nc-0.343
 Jou-0.329
jug-0.327
 Muscle-0.322
Token,
Token ID11
Feature activation+1.76e-01

Pos logit contributions
 Dek+0.043
nc+0.042
 Jou+0.040
jug+0.040
�醒+0.039
Neg logit contributions
wrapper-0.050
arity-0.050
istries-0.046
olson-0.044
inski-0.044
Token according
Token ID1864
Feature activation-5.10e-02

Pos logit contributions
 Dek+0.109
nc+0.105
 Jou+0.101
jug+0.101
�醒+0.100
Neg logit contributions
wrapper-0.132
arity-0.131
istries-0.119
olson-0.116
inski-0.116
Token to
Token ID284
Feature activation+1.39e-01

Pos logit contributions
wrapper+0.038
arity+0.038
istries+0.035
olson+0.034
inski+0.034
Neg logit contributions
 Dek-0.031
nc-0.030
 Jou-0.029
jug-0.029
�醒-0.029
Token one
Token ID530
Feature activation-3.04e-01

Pos logit contributions
 Dek+0.089
nc+0.086
 Jou+0.083
jug+0.082
�醒+0.081
Neg logit contributions
wrapper-0.102
arity-0.101
istries-0.092
olson-0.089
inski-0.089
Token of
Token ID286
Feature activation+1.09e-01

Pos logit contributions
wrapper+0.219
arity+0.217
istries+0.198
olson+0.192
inski+0.192
Neg logit contributions
 Dek-0.197
nc-0.191
 Jou-0.184
jug-0.183
�醒-0.181
Token Lois
Token ID42879
Feature activation-2.13e+00

Pos logit contributions
 Dek+0.070
nc+0.068
 Jou+0.065
jug+0.065
�醒+0.064
Neg logit contributions
wrapper-0.079
arity-0.078
istries-0.071
olson-0.069
inski-0.069
Token G
Token ID402
Feature activation-1.24e-01

Pos logit contributions
wrapper+0.868
arity+0.837
istries+0.711
olson+0.676
inski+0.667
Neg logit contributions
 Dek-2.074
nc-2.009
 Jou-1.974
jug-1.956
 Muscle-1.941
Token.
Token ID13
Feature activation-6.73e-01

Pos logit contributions
wrapper+0.093
arity+0.092
istries+0.084
olson+0.082
inski+0.082
Neg logit contributions
 Dek-0.076
nc-0.074
 Jou-0.071
jug-0.070
�醒-0.070
Token Lerner
Token ID49750
Feature activation-4.59e-01

Pos logit contributions
wrapper+0.270
arity+0.264
istries+0.223
inski+0.210
olson+0.210
Neg logit contributions
 Dek-0.652
nc-0.637
 Jou-0.624
jug-0.619
 Muscle-0.614
Token
Token ID447
Feature activation+4.18e-01

Pos logit contributions
wrapper+0.333
arity+0.330
istries+0.301
olson+0.293
inski+0.292
Neg logit contributions
 Dek-0.294
nc-0.285
 Jou-0.274
jug-0.272
�醒-0.268
Token
Token ID247
Feature activation-1.42e-01

Pos logit contributions
 Dek+0.230
nc+0.221
 Jou+0.212
jug+0.210
 Muscle+0.206
Neg logit contributions
wrapper-0.342
arity-0.339
istries-0.312
olson-0.304
inski-0.304
Token the
Token ID262
Feature activation-4.21e-02

Pos logit contributions
 Dek+0.029
nc+0.028
 Jou+0.027
jug+0.027
�醒+0.027
Neg logit contributions
wrapper-0.032
arity-0.032
istries-0.029
olson-0.028
inski-0.028
Token Americans
Token ID3399
Feature activation+8.58e-03

Pos logit contributions
wrapper+0.030
arity+0.030
istries+0.027
olson+0.026
inski+0.026
Neg logit contributions
 Dek-0.028
nc-0.027
 Jou-0.026
jug-0.026
�醒-0.025
Token."
Token ID526
Feature activation+1.95e-02

Pos logit contributions
 Dek+0.005
nc+0.005
 Jou+0.005
jug+0.005
�醒+0.005
Neg logit contributions
wrapper-0.006
arity-0.006
istries-0.006
olson-0.006
inski-0.006
Token<|endoftext|>
Token ID50256
Feature activation-6.01e-02

Pos logit contributions
 Dek+0.009
nc+0.009
 Jou+0.008
jug+0.008
�醒+0.008
Neg logit contributions
wrapper-0.018
arity-0.017
istries-0.016
olson-0.016
inski-0.016
TokenGL
Token ID8763
Feature activation+3.47e-01

Pos logit contributions
wrapper+0.046
arity+0.046
istries+0.042
olson+0.041
inski+0.041
Neg logit contributions
 Dek-0.036
nc-0.035
 Jou-0.033
jug-0.033
�醒-0.033
TokenAS
Token ID1921
Feature activation-2.12e+00

Pos logit contributions
 Dek+0.234
nc+0.227
 Jou+0.219
jug+0.218
�醒+0.216
Neg logit contributions
wrapper-0.240
arity-0.238
istries-0.215
inski-0.210
olson-0.209
TokenG
Token ID38
Feature activation-7.09e-01

Pos logit contributions
wrapper+1.542
arity+1.540
istries+1.415
inski+1.357
olson+1.345
Neg logit contributions
 Dek-1.364
nc-1.320
 Jou-1.262
Ger-1.253
jug-1.253
TokenOW
Token ID3913
Feature activation+1.73e-02

Pos logit contributions
wrapper+0.321
arity+0.315
istries+0.272
olson+0.258
inski+0.257
Neg logit contributions
 Dek-0.648
nc-0.635
 Jou-0.619
jug-0.613
Ger-0.608
Token -
Token ID532
Feature activation-2.39e-01

Pos logit contributions
 Dek+0.011
nc+0.010
 Jou+0.010
jug+0.010
�醒+0.010
Neg logit contributions
wrapper-0.013
arity-0.013
istries-0.012
olson-0.011
inski-0.011
Token Although
Token ID4900
Feature activation-2.01e-01

Pos logit contributions
wrapper+0.180
arity+0.178
istries+0.163
olson+0.159
inski+0.159
Neg logit contributions
 Dek-0.146
nc-0.141
 Jou-0.136
jug-0.135
�醒-0.133
Token he
Token ID339
Feature activation-4.24e-02

Pos logit contributions
wrapper+0.146
arity+0.144
istries+0.132
olson+0.128
inski+0.128
Neg logit contributions
 Dek-0.130
nc-0.126
 Jou-0.121
jug-0.120
�醒-0.119
Token to
Token ID284
Feature activation-1.66e+00

Pos logit contributions
 Dek+0.041
nc+0.039
 Jou+0.038
jug+0.038
�醒+0.037
Neg logit contributions
wrapper-0.049
arity-0.049
istries-0.045
olson-0.044
inski-0.043
Token per
Token ID583
Feature activation-1.48e+00

Pos logit contributions
wrapper+1.304
arity+1.293
istries+1.197
inski+1.155
olson+1.154
Neg logit contributions
 Dek-0.984
nc-0.936
 Jou-0.909
jug-0.899
Ger-0.885
Tokenuse
Token ID1904
Feature activation-3.17e-01

Pos logit contributions
wrapper+1.048
arity+1.037
istries+0.950
olson+0.925
inski+0.911
Neg logit contributions
 Dek-0.971
nc-0.937
 Jou-0.905
jug-0.899
 Muscle-0.892
Token.
Token ID13
Feature activation-9.23e-01

Pos logit contributions
wrapper+0.240
arity+0.238
istries+0.218
olson+0.212
inski+0.212
Neg logit contributions
 Dek-0.194
nc-0.187
 Jou-0.180
jug-0.179
�醒-0.176
Token Here
Token ID3423
Feature activation-3.59e-01

Pos logit contributions
wrapper+0.805
arity+0.798
istries+0.743
olson+0.729
inski+0.724
Neg logit contributions
 Dek-0.456
nc-0.439
 Jou-0.416
 Muscle-0.407
jug-0.406
Token is
Token ID318
Feature activation-2.00e+00

Pos logit contributions
wrapper+0.257
arity+0.254
istries+0.232
olson+0.225
inski+0.225
Neg logit contributions
 Dek-0.234
nc-0.227
 Jou-0.218
jug-0.217
�醒-0.214
Token the
Token ID262
Feature activation-1.53e+00

Pos logit contributions
wrapper+1.512
arity+1.469
istries+1.369
olson+1.332
inski+1.310
Neg logit contributions
 Dek-1.255
nc-1.198
 Jou-1.144
jug-1.141
 Muscle-1.132
Token link
Token ID2792
Feature activation+6.94e-01

Pos logit contributions
wrapper+1.061
arity+1.056
istries+0.962
inski+0.931
olson+0.921
Neg logit contributions
 Dek-1.036
nc-1.004
jug-0.964
 Jou-0.955
Ger-0.948
Token for
Token ID329
Feature activation+6.21e-01

Pos logit contributions
 Dek+0.458
nc+0.442
 Jou+0.434
jug+0.428
�醒+0.423
Neg logit contributions
wrapper-0.493
arity-0.484
istries-0.444
inski-0.428
olson-0.427
Token buying
Token ID7067
Feature activation+2.44e-01

Pos logit contributions
 Dek+0.271
nc+0.264
 Jou+0.250
jug+0.247
�醒+0.241
Neg logit contributions
wrapper-0.576
arity-0.570
istries-0.529
inski-0.520
olson-0.519
Token the
Token ID262
Feature activation+1.04e+00

Pos logit contributions
 Dek+0.161
nc+0.157
 Jou+0.152
jug+0.151
�醒+0.150
Neg logit contributions
wrapper-0.172
arity-0.171
istries-0.155
olson-0.151
inski-0.151
Token.
Token ID13
Feature activation+8.65e-02

Pos logit contributions
wrapper+0.187
arity+0.186
istries+0.170
olson+0.165
inski+0.165
Neg logit contributions
 Dek-0.154
nc-0.149
 Jou-0.144
jug-0.142
�醒-0.141
Token<|endoftext|>
Token ID50256
Feature activation-4.40e-02

Pos logit contributions
 Dek+0.051
nc+0.050
 Jou+0.047
jug+0.047
�醒+0.047
Neg logit contributions
wrapper-0.067
arity-0.066
istries-0.061
olson-0.059
inski-0.059
TokenSAN
Token ID36753
Feature activation-2.63e-01

Pos logit contributions
wrapper+0.034
arity+0.034
istries+0.031
olson+0.030
inski+0.030
Neg logit contributions
 Dek-0.026
nc-0.025
 Jou-0.024
jug-0.024
�醒-0.024
Token FR
Token ID8782
Feature activation+1.05e-01

Pos logit contributions
wrapper+0.155
arity+0.154
istries+0.137
olson+0.132
inski+0.132
Neg logit contributions
 Dek-0.204
nc-0.199
 Jou-0.193
jug-0.192
�醒-0.190
TokenANC
Token ID20940
Feature activation-9.55e-01

Pos logit contributions
 Dek+0.057
nc+0.055
 Jou+0.053
jug+0.053
�醒+0.052
Neg logit contributions
wrapper-0.086
arity-0.085
istries-0.078
olson-0.077
inski-0.077
TokenIS
Token ID1797
Feature activation-1.97e+00

Pos logit contributions
wrapper+0.851
arity+0.843
istries+0.782
olson+0.764
inski+0.763
Neg logit contributions
 Dek-0.460
nc-0.438
 Jou-0.418
jug-0.409
Ger-0.407
TokenCO
Token ID8220
Feature activation-4.35e-01

Pos logit contributions
wrapper+1.363
arity+1.343
istries+1.243
inski+1.188
olson+1.179
Neg logit contributions
 Dek-1.350
nc-1.298
 Jou-1.256
Ger-1.238
jug-1.218
Token -
Token ID532
Feature activation-1.42e-01

Pos logit contributions
wrapper+0.326
arity+0.323
istries+0.296
olson+0.288
inski+0.287
Neg logit contributions
 Dek-0.268
nc-0.260
 Jou-0.250
jug-0.248
�醒-0.244
Token Reddit
Token ID10750
Feature activation-1.96e-01

Pos logit contributions
wrapper+0.108
arity+0.107
istries+0.098
olson+0.095
inski+0.095
Neg logit contributions
 Dek-0.086
nc-0.083
 Jou-0.080
jug-0.079
�醒-0.078
Token and
Token ID290
Feature activation+2.61e-01

Pos logit contributions
wrapper+0.141
arity+0.140
istries+0.127
olson+0.124
inski+0.124
Neg logit contributions
 Dek-0.127
nc-0.123
 Jou-0.119
jug-0.118
�醒-0.116
Token Google
Token ID3012
Feature activation-4.26e-01

Pos logit contributions
 Dek+0.162
nc+0.157
 Jou+0.151
jug+0.150
�醒+0.149
Neg logit contributions
wrapper-0.195
arity-0.194
istries-0.177
olson-0.172
inski-0.172
Token the
Token ID262
Feature activation-8.73e-01

Pos logit contributions
wrapper+0.669
arity+0.663
istries+0.606
olson+0.589
inski+0.588
Neg logit contributions
 Dek-0.585
nc-0.565
 Jou-0.547
jug-0.538
 Muscle-0.533
Token views
Token ID5009
Feature activation-2.46e-01

Pos logit contributions
wrapper+0.515
arity+0.508
istries+0.453
olson+0.441
inski+0.438
Neg logit contributions
 Dek-0.679
nc-0.661
 Jou-0.639
jug-0.634
 Muscle-0.629
Token of
Token ID286
Feature activation-1.66e+00

Pos logit contributions
wrapper+0.184
arity+0.183
istries+0.167
olson+0.163
inski+0.162
Neg logit contributions
 Dek-0.153
nc-0.148
 Jou-0.142
jug-0.141
�醒-0.139
Token Town
Token ID8329
Feature activation-5.69e-01

Pos logit contributions
wrapper+1.293
arity+1.281
istries+1.170
inski+1.145
olson+1.144
Neg logit contributions
 Dek-0.992
nc-0.955
 Jou-0.916
jug-0.899
 Muscle-0.895
Tokenhall
Token ID18323
Feature activation-4.92e-01

Pos logit contributions
wrapper+0.369
arity+0.364
istries+0.328
olson+0.318
inski+0.316
Neg logit contributions
 Dek-0.410
nc-0.399
 Jou-0.386
jug-0.383
 Muscle-0.378
Token.
Token ID13
Feature activation-1.95e+00

Pos logit contributions
wrapper+0.374
arity+0.371
istries+0.340
olson+0.331
inski+0.330
Neg logit contributions
 Dek-0.299
nc-0.290
 Jou-0.278
jug-0.276
 Muscle-0.272
Tokencom
Token ID785
Feature activation-1.28e+00

Pos logit contributions
wrapper+1.232
arity+1.228
istries+1.094
inski+1.081
olson+1.062
Neg logit contributions
 Dek-1.430
nc-1.406
 Jou-1.353
jug-1.325
 Muscle-1.325
Token.
Token ID13
Feature activation-1.77e+00

Pos logit contributions
wrapper+0.998
arity+0.990
istries+0.906
olson+0.885
inski+0.885
Neg logit contributions
 Dek-0.753
nc-0.731
 Jou-0.697
jug-0.697
 Muscle-0.686
Token
Token ID198
Feature activation-4.82e-02

Pos logit contributions
wrapper+1.379
arity+1.366
istries+1.240
olson+1.226
inski+1.222
Neg logit contributions
 Dek-1.063
nc-1.019
 Jou-0.981
jug-0.961
 Muscle-0.952
Token
Token ID198
Feature activation-2.43e-01

Pos logit contributions
wrapper+0.036
arity+0.035
istries+0.032
olson+0.031
inski+0.031
Neg logit contributions
 Dek-0.030
nc-0.029
 Jou-0.028
jug-0.028
�醒-0.028
TokenBernie
Token ID33433
Feature activation-4.70e-01

Pos logit contributions
wrapper+0.194
arity+0.192
istries+0.177
olson+0.173
inski+0.172
Neg logit contributions
 Dek-0.139
nc-0.134
 Jou-0.128
jug-0.127
�醒-0.126
Token after
Token ID706
Feature activation-2.60e-01

Pos logit contributions
wrapper+0.185
arity+0.183
istries+0.168
olson+0.163
inski+0.163
Neg logit contributions
 Dek-0.156
nc-0.151
 Jou-0.145
jug-0.144
�醒-0.142
Token former
Token ID1966
Feature activation+2.45e-01

Pos logit contributions
wrapper+0.185
arity+0.183
istries+0.167
olson+0.162
inski+0.162
Neg logit contributions
 Dek-0.170
nc-0.165
 Jou-0.159
jug-0.158
�醒-0.156
Token Finance
Token ID15007
Feature activation-6.33e-01

Pos logit contributions
 Dek+0.154
nc+0.150
 Jou+0.144
jug+0.143
�醒+0.142
Neg logit contributions
wrapper-0.180
arity-0.179
istries-0.163
inski-0.159
olson-0.158
Token Minister
Token ID4139
Feature activation-4.84e-01

Pos logit contributions
wrapper+0.402
arity+0.397
istries+0.358
olson+0.347
inski+0.345
Neg logit contributions
 Dek-0.463
nc-0.451
 Jou-0.437
jug-0.433
 Muscle-0.428
Token Jim
Token ID5395
Feature activation-4.17e-01

Pos logit contributions
wrapper+0.399
arity+0.395
istries+0.364
olson+0.356
inski+0.355
Neg logit contributions
 Dek-0.264
nc-0.254
 Jou-0.244
jug-0.241
 Muscle-0.237
Token Fla
Token ID22026
Feature activation-1.93e+00

Pos logit contributions
wrapper+0.226
arity+0.223
istries+0.196
olson+0.189
inski+0.188
Neg logit contributions
 Dek-0.345
nc-0.336
 Jou-0.327
jug-0.325
�醒-0.321
Tokenherty
Token ID29029
Feature activation-5.27e-01

Pos logit contributions
wrapper+0.932
arity+0.887
istries+0.812
olson+0.754
inski+0.732
Neg logit contributions
 Dek-1.710
nc-1.675
 Jou-1.640
jug-1.630
 Muscle-1.616
Token tightened
Token ID37257
Feature activation-3.75e-01

Pos logit contributions
wrapper+0.399
arity+0.395
istries+0.362
olson+0.352
inski+0.351
Neg logit contributions
 Dek-0.322
nc-0.311
 Jou-0.299
jug-0.297
 Muscle-0.292
Token the
Token ID262
Feature activation-1.63e-01

Pos logit contributions
wrapper+0.262
arity+0.260
istries+0.236
olson+0.229
inski+0.229
Neg logit contributions
 Dek-0.250
nc-0.242
 Jou-0.234
jug-0.232
�醒-0.229
Token country
Token ID1499
Feature activation+5.89e-02

Pos logit contributions
wrapper+0.112
arity+0.111
istries+0.101
olson+0.098
inski+0.098
Neg logit contributions
 Dek-0.111
nc-0.108
 Jou-0.104
jug-0.104
�醒-0.102
Token's
Token ID338
Feature activation-5.22e-01

Pos logit contributions
 Dek+0.038
nc+0.037
 Jou+0.036
jug+0.035
�醒+0.035
Neg logit contributions
wrapper-0.042
arity-0.042
istries-0.038
olson-0.037
inski-0.037
Token
Token ID198
Feature activation+1.66e-01

Pos logit contributions
 Dek+0.015
nc+0.014
 Jou+0.014
jug+0.014
�醒+0.013
Neg logit contributions
wrapper-0.017
arity-0.017
istries-0.015
olson-0.015
inski-0.015
TokenSign
Token ID11712
Feature activation-1.49e-02

Pos logit contributions
 Dek+0.099
nc+0.095
 Jou+0.092
jug+0.091
�醒+0.090
Neg logit contributions
wrapper-0.128
arity-0.127
istries-0.116
olson-0.113
inski-0.113
Token Up
Token ID3205
Feature activation+1.80e-01

Pos logit contributions
wrapper+0.011
arity+0.011
istries+0.010
olson+0.010
inski+0.010
Neg logit contributions
 Dek-0.009
nc-0.009
 Jou-0.008
jug-0.008
�醒-0.008
Token:
Token ID25
Feature activation+8.99e-01

Pos logit contributions
 Dek+0.117
nc+0.113
 Jou+0.109
jug+0.109
�醒+0.108
Neg logit contributions
wrapper-0.128
arity-0.128
istries-0.116
inski-0.113
olson-0.112
Token Stay
Token ID16160
Feature activation+1.66e-01

Pos logit contributions
 Dek+0.592
nc+0.577
 Jou+0.556
jug+0.553
�醒+0.550
Neg logit contributions
wrapper-0.631
arity-0.627
istries-0.573
inski-0.552
olson-0.549
Token on
Token ID319
Feature activation-1.93e+00

Pos logit contributions
 Dek+0.109
nc+0.106
 Jou+0.102
jug+0.101
�醒+0.101
Neg logit contributions
wrapper-0.117
arity-0.116
istries-0.106
olson-0.103
inski-0.103
Token top
Token ID1353
Feature activation+1.40e-02

Pos logit contributions
wrapper+1.190
arity+1.178
istries+1.061
olson+1.019
inski+1.019
Neg logit contributions
 Dek-1.460
nc-1.412
 Jou-1.370
 Muscle-1.351
jug-1.348
Token of
Token ID286
Feature activation-5.14e-01

Pos logit contributions
 Dek+0.009
nc+0.008
 Jou+0.008
jug+0.008
�醒+0.008
Neg logit contributions
wrapper-0.010
arity-0.010
istries-0.009
olson-0.009
inski-0.009
Token the
Token ID262
Feature activation-4.27e-02

Pos logit contributions
wrapper+0.371
arity+0.367
istries+0.335
olson+0.325
inski+0.325
Neg logit contributions
 Dek-0.332
nc-0.321
 Jou-0.310
jug-0.307
 Muscle-0.303
Token latest
Token ID3452
Feature activation+1.36e+00

Pos logit contributions
wrapper+0.029
arity+0.029
istries+0.026
olson+0.026
inski+0.026
Neg logit contributions
 Dek-0.029
nc-0.028
 Jou-0.027
jug-0.027
�醒-0.027
Token breaking
Token ID7163
Feature activation+1.45e+00

Pos logit contributions
 Dek+0.772
nc+0.757
�醒+0.729
Registered+0.724
jug+0.723
Neg logit contributions
arity-1.059
wrapper-1.059
istries-0.969
inski-0.953
olson-0.947
Token Please
Token ID4222
Feature activation-1.97e-01

Pos logit contributions
wrapper+0.447
arity+0.442
istries+0.404
olson+0.393
inski+0.392
Neg logit contributions
 Dek-0.398
nc-0.386
 Jou-0.372
jug-0.368
 Muscle-0.363
Token verify
Token ID11767
Feature activation-6.13e-02

Pos logit contributions
wrapper+0.136
arity+0.134
istries+0.122
olson+0.118
inski+0.118
Neg logit contributions
 Dek-0.134
nc-0.130
 Jou-0.126
jug-0.125
�醒-0.123
Token you
Token ID345
Feature activation-1.18e-02

Pos logit contributions
wrapper+0.041
arity+0.041
istries+0.037
olson+0.036
inski+0.036
Neg logit contributions
 Dek-0.043
nc-0.041
 Jou-0.040
jug-0.040
�醒-0.039
Token're
Token ID821
Feature activation-2.86e-01

Pos logit contributions
wrapper+0.010
arity+0.010
istries+0.009
olson+0.009
inski+0.009
Neg logit contributions
 Dek-0.006
nc-0.006
 Jou-0.006
jug-0.006
�醒-0.006
Token not
Token ID407
Feature activation-8.78e-01

Pos logit contributions
wrapper+0.200
arity+0.198
istries+0.180
olson+0.175
inski+0.175
Neg logit contributions
 Dek-0.191
nc-0.186
 Jou-0.179
jug-0.178
�醒-0.176
Token a
Token ID257
Feature activation-1.93e+00

Pos logit contributions
wrapper+0.619
arity+0.613
istries+0.557
olson+0.544
inski+0.540
Neg logit contributions
 Dek-0.582
nc-0.565
 Jou-0.544
jug-0.540
 Muscle-0.533
Token robot
Token ID9379
Feature activation-7.49e-01

Pos logit contributions
wrapper+1.442
arity+1.433
istries+1.304
olson+1.300
inski+1.271
Neg logit contributions
 Dek-1.196
nc-1.157
 Jou-1.111
jug-1.100
 Muscle-1.092
Token by
Token ID416
Feature activation-1.90e-01

Pos logit contributions
wrapper+0.514
arity+0.509
istries+0.462
olson+0.449
inski+0.447
Neg logit contributions
 Dek-0.509
nc-0.495
 Jou-0.478
jug-0.474
 Muscle-0.468
Token clicking
Token ID12264
Feature activation+2.59e-01

Pos logit contributions
wrapper+0.136
arity+0.134
istries+0.122
olson+0.119
inski+0.119
Neg logit contributions
 Dek-0.124
nc-0.120
 Jou-0.116
jug-0.115
�醒-0.114
Token the
Token ID262
Feature activation+7.22e-01

Pos logit contributions
 Dek+0.171
nc+0.167
 Jou+0.160
jug+0.160
�醒+0.159
Neg logit contributions
wrapper-0.183
arity-0.181
istries-0.165
inski-0.160
olson-0.160
Token box
Token ID3091
Feature activation+3.45e-01

Pos logit contributions
 Dek+0.352
nc+0.339
 Jou+0.324
�醒+0.322
jug+0.321
Neg logit contributions
wrapper-0.634
arity-0.628
istries-0.581
inski-0.571
olson-0.566
Tokenin
Token ID259
Feature activation-2.55e-01

Pos logit contributions
wrapper+0.492
arity+0.489
istries+0.446
olson+0.434
inski+0.430
Neg logit contributions
 Dek-0.457
nc-0.443
 Jou-0.428
jug-0.423
 Muscle-0.418
Token (
Token ID357
Feature activation-1.14e+00

Pos logit contributions
wrapper+0.193
arity+0.191
istries+0.175
olson+0.170
inski+0.170
Neg logit contributions
 Dek-0.155
nc-0.150
 Jou-0.145
jug-0.143
�醒-0.142
TokenO
Token ID46
Feature activation+4.69e-02

Pos logit contributions
wrapper+0.941
arity+0.930
istries+0.863
olson+0.840
inski+0.838
Neg logit contributions
 Dek-0.621
nc-0.603
 Jou-0.574
Ger-0.569
jug-0.568
Tokenmega
Token ID13731
Feature activation-6.15e-01

Pos logit contributions
 Dek+0.027
nc+0.026
 Jou+0.025
jug+0.025
�醒+0.025
Neg logit contributions
wrapper-0.037
arity-0.036
istries-0.033
olson-0.033
inski-0.033
Token Pharma
Token ID41845
Feature activation-1.78e+00

Pos logit contributions
wrapper+0.435
arity+0.432
istries+0.393
olson+0.382
inski+0.381
Neg logit contributions
 Dek-0.406
nc-0.392
 Jou-0.379
jug-0.375
 Muscle-0.371
Token -
Token ID532
Feature activation-1.93e+00

Pos logit contributions
wrapper+1.417
arity+1.410
istries+1.298
olson+1.269
inski+1.256
Neg logit contributions
 Dek-1.022
nc-0.970
 Jou-0.938
jug-0.923
 Muscle-0.920
Token Quick
Token ID12029
Feature activation-5.20e-01

Pos logit contributions
wrapper+1.325
arity+1.307
istries+1.183
olson+1.161
inski+1.146
Neg logit contributions
 Dek-1.330
nc-1.272
 Jou-1.245
jug-1.236
 Muscle-1.225
Tokenstep
Token ID9662
Feature activation-6.66e-01

Pos logit contributions
wrapper+0.383
arity+0.379
istries+0.347
olson+0.338
inski+0.336
Neg logit contributions
 Dek-0.328
nc-0.318
 Jou-0.307
jug-0.304
 Muscle-0.300
Token)
Token ID8
Feature activation+1.89e-01

Pos logit contributions
wrapper+0.496
arity+0.492
istries+0.450
olson+0.438
inski+0.437
Neg logit contributions
 Dek-0.415
nc-0.401
 Jou-0.386
jug-0.382
 Muscle-0.378
Token s
Token ID264
Feature activation+4.90e-02

Pos logit contributions
 Dek+0.109
nc+0.106
 Jou+0.101
jug+0.100
�醒+0.099
Neg logit contributions
wrapper-0.149
arity-0.148
istries-0.136
olson-0.133
inski-0.133
Token.
Token ID13
Feature activation-2.41e-01

Pos logit contributions
 Dek+0.030
nc+0.029
 Jou+0.028
jug+0.028
�醒+0.028
Neg logit contributions
wrapper-0.037
arity-0.036
istries-0.033
olson-0.032
inski-0.032
Token Please
Token ID4222
Feature activation-2.32e-01

Pos logit contributions
wrapper+0.231
arity+0.229
istries+0.209
olson+0.203
inski+0.203
Neg logit contributions
 Dek-0.209
nc-0.203
 Jou-0.196
jug-0.194
�醒-0.192
Token verify
Token ID11767
Feature activation-1.34e-02

Pos logit contributions
wrapper+0.159
arity+0.158
istries+0.143
olson+0.139
inski+0.139
Neg logit contributions
 Dek-0.157
nc-0.153
 Jou-0.147
jug-0.146
�醒-0.145
Token you
Token ID345
Feature activation+4.23e-02

Pos logit contributions
wrapper+0.009
arity+0.009
istries+0.008
olson+0.008
inski+0.008
Neg logit contributions
 Dek-0.009
nc-0.009
 Jou-0.009
jug-0.009
�醒-0.009
Token're
Token ID821
Feature activation-1.87e-01

Pos logit contributions
 Dek+0.023
nc+0.022
 Jou+0.022
jug+0.021
�醒+0.021
Neg logit contributions
wrapper-0.035
arity-0.034
istries-0.032
olson-0.031
inski-0.031
Token not
Token ID407
Feature activation-7.77e-01

Pos logit contributions
wrapper+0.130
arity+0.129
istries+0.117
olson+0.114
inski+0.114
Neg logit contributions
 Dek-0.125
nc-0.122
 Jou-0.117
jug-0.117
�醒-0.115
Token a
Token ID257
Feature activation-1.91e+00

Pos logit contributions
wrapper+0.545
arity+0.540
istries+0.491
olson+0.478
inski+0.476
Neg logit contributions
 Dek-0.516
nc-0.501
 Jou-0.483
jug-0.479
 Muscle-0.473
Token robot
Token ID9379
Feature activation-7.58e-01

Pos logit contributions
wrapper+1.423
arity+1.414
istries+1.288
olson+1.281
inski+1.254
Neg logit contributions
 Dek-1.181
nc-1.142
 Jou-1.097
jug-1.087
 Muscle-1.075
Token by
Token ID416
Feature activation-1.81e-01

Pos logit contributions
wrapper+0.520
arity+0.515
istries+0.468
olson+0.454
inski+0.452
Neg logit contributions
 Dek-0.515
nc-0.501
 Jou-0.483
jug-0.480
 Muscle-0.473
Token clicking
Token ID12264
Feature activation+2.27e-01

Pos logit contributions
wrapper+0.129
arity+0.128
istries+0.117
olson+0.113
inski+0.113
Neg logit contributions
 Dek-0.119
nc-0.115
 Jou-0.111
jug-0.110
�醒-0.109
Token the
Token ID262
Feature activation+7.87e-01

Pos logit contributions
 Dek+0.150
nc+0.146
 Jou+0.140
jug+0.140
�醒+0.139
Neg logit contributions
wrapper-0.160
arity-0.159
istries-0.145
olson-0.141
inski-0.140
Token box
Token ID3091
Feature activation+3.87e-01

Pos logit contributions
 Dek+0.385
nc+0.370
 Jou+0.354
�醒+0.353
jug+0.350
Neg logit contributions
wrapper-0.691
arity-0.683
istries-0.633
inski-0.622
olson-0.616
Tokenac
Token ID330
Feature activation-4.78e-01

Pos logit contributions
wrapper+0.315
arity+0.312
istries+0.282
olson+0.274
inski+0.273
Neg logit contributions
 Dek-0.324
nc-0.315
 Jou-0.304
jug-0.302
�醒-0.297
Tokenans
Token ID504
Feature activation+4.53e-01

Pos logit contributions
wrapper+0.363
arity+0.360
istries+0.330
olson+0.321
inski+0.320
Neg logit contributions
 Dek-0.291
nc-0.281
 Jou-0.270
jug-0.268
�醒-0.264
Tokenole
Token ID2305
Feature activation-5.09e-01

Pos logit contributions
 Dek+0.217
nc+0.209
 Jou+0.199
jug+0.198
�醒+0.195
Neg logit contributions
wrapper-0.399
arity-0.399
istries-0.368
inski-0.361
olson-0.360
Tokenil
Token ID346
Feature activation-1.03e+00

Pos logit contributions
wrapper+0.418
arity+0.414
istries+0.383
olson+0.373
inski+0.373
Neg logit contributions
 Dek-0.278
nc-0.267
 Jou-0.256
jug-0.253
�醒-0.249
Token-
Token ID12
Feature activation-1.01e+00

Pos logit contributions
wrapper+0.811
arity+0.802
istries+0.737
olson+0.721
inski+0.716
Neg logit contributions
 Dek-0.599
nc-0.573
 Jou-0.555
jug-0.547
 Muscle-0.540
TokenDC
Token ID9697
Feature activation-1.90e+00

Pos logit contributions
wrapper+0.722
arity+0.717
istries+0.655
olson+0.637
inski+0.634
Neg logit contributions
 Dek-0.661
nc-0.640
 Jou-0.614
jug-0.612
Ger-0.610
TokenM
Token ID44
Feature activation-1.48e+00

Pos logit contributions
wrapper+1.456
arity+1.431
istries+1.325
olson+1.275
inski+1.274
Neg logit contributions
 Dek-1.158
nc-1.122
 Jou-1.080
 Muscle-1.073
Ger-1.072
Token)
Token ID8
Feature activation+1.33e-01

Pos logit contributions
wrapper+1.211
arity+1.199
istries+1.110
olson+1.081
inski+1.077
Neg logit contributions
 Dek-0.817
nc-0.776
 Jou-0.753
jug-0.740
 Muscle-0.734
Token s
Token ID264
Feature activation+3.86e-02

Pos logit contributions
 Dek+0.076
nc+0.074
 Jou+0.071
jug+0.070
�醒+0.069
Neg logit contributions
wrapper-0.105
arity-0.104
istries-0.095
olson-0.093
inski-0.093
Token.
Token ID13
Feature activation-2.98e-01

Pos logit contributions
 Dek+0.024
nc+0.023
 Jou+0.022
jug+0.022
�醒+0.022
Neg logit contributions
wrapper-0.029
arity-0.029
istries-0.026
olson-0.026
inski-0.026
Tokent
Token ID83
Feature activation+7.30e-02

Pos logit contributions
wrapper+0.252
arity+0.250
istries+0.231
olson+0.226
inski+0.226
Neg logit contributions
 Dek-0.156
nc-0.150
 Jou-0.143
jug-0.142
�醒-0.139
Token Please
Token ID4222
Feature activation-1.10e-01

Pos logit contributions
wrapper+0.319
arity+0.316
istries+0.289
olson+0.281
inski+0.280
Neg logit contributions
 Dek-0.287
nc-0.278
 Jou-0.268
jug-0.266
�醒-0.263
Token verify
Token ID11767
Feature activation-8.95e-02

Pos logit contributions
wrapper+0.075
arity+0.075
istries+0.068
inski+0.066
olson+0.066
Neg logit contributions
 Dek-0.075
nc-0.073
 Jou-0.070
jug-0.070
�醒-0.069
Token you
Token ID345
Feature activation-9.23e-03

Pos logit contributions
wrapper+0.060
arity+0.060
istries+0.054
olson+0.052
inski+0.052
Neg logit contributions
 Dek-0.062
nc-0.060
 Jou-0.058
jug-0.058
�醒-0.057
Token're
Token ID821
Feature activation-2.40e-01

Pos logit contributions
wrapper+0.008
arity+0.007
istries+0.007
olson+0.007
inski+0.007
Neg logit contributions
 Dek-0.005
nc-0.005
 Jou-0.005
jug-0.005
�醒-0.005
Token not
Token ID407
Feature activation-7.61e-01

Pos logit contributions
wrapper+0.167
arity+0.166
istries+0.151
olson+0.146
inski+0.146
Neg logit contributions
 Dek-0.161
nc-0.156
 Jou-0.150
jug-0.149
�醒-0.148
Token a
Token ID257
Feature activation-1.87e+00

Pos logit contributions
wrapper+0.534
arity+0.529
istries+0.480
olson+0.468
inski+0.466
Neg logit contributions
 Dek-0.506
nc-0.492
 Jou-0.473
jug-0.470
 Muscle-0.463
Token robot
Token ID9379
Feature activation-7.85e-01

Pos logit contributions
wrapper+1.394
arity+1.387
istries+1.263
olson+1.254
inski+1.228
Neg logit contributions
 Dek-1.155
nc-1.119
 Jou-1.076
jug-1.065
 Muscle-1.055
Token by
Token ID416
Feature activation-2.30e-01

Pos logit contributions
wrapper+0.539
arity+0.535
istries+0.485
olson+0.471
inski+0.469
Neg logit contributions
 Dek-0.533
nc-0.518
 Jou-0.500
jug-0.496
 Muscle-0.489
Token clicking
Token ID12264
Feature activation+2.24e-01

Pos logit contributions
wrapper+0.164
arity+0.163
istries+0.148
olson+0.144
inski+0.144
Neg logit contributions
 Dek-0.150
nc-0.146
 Jou-0.140
jug-0.139
�醒-0.138
Token the
Token ID262
Feature activation+7.96e-01

Pos logit contributions
 Dek+0.148
nc+0.143
 Jou+0.138
jug+0.137
�醒+0.137
Neg logit contributions
wrapper-0.158
arity-0.156
istries-0.142
olson-0.138
inski-0.138
Token box
Token ID3091
Feature activation+3.89e-01

Pos logit contributions
 Dek+0.391
nc+0.376
 Jou+0.360
�醒+0.358
jug+0.356
Neg logit contributions
wrapper-0.698
arity-0.691
istries-0.639
inski-0.628
olson-0.621
Token
Token ID198
Feature activation+3.80e-02

Pos logit contributions
 Dek+0.015
nc+0.014
 Jou+0.014
jug+0.014
�醒+0.014
Neg logit contributions
wrapper-0.017
arity-0.017
istries-0.016
olson-0.015
inski-0.015
TokenAdvertisement
Token ID4723
Feature activation-2.58e-01

Pos logit contributions
 Dek+0.021
nc+0.021
 Jou+0.020
jug+0.020
�醒+0.019
Neg logit contributions
wrapper-0.031
arity-0.030
istries-0.028
olson-0.027
inski-0.027
Token
Token ID198
Feature activation+2.28e-03

Pos logit contributions
wrapper+0.192
arity+0.190
istries+0.174
olson+0.169
inski+0.169
Neg logit contributions
 Dek-0.161
nc-0.156
 Jou-0.150
jug-0.149
�醒-0.147
Token
Token ID198
Feature activation-2.99e-02

Pos logit contributions
 Dek+0.001
nc+0.001
 Jou+0.001
jug+0.001
�醒+0.001
Neg logit contributions
wrapper-0.002
arity-0.002
istries-0.002
olson-0.001
inski-0.001
Token#
Token ID2
Feature activation-4.04e-01

Pos logit contributions
wrapper+0.024
arity+0.024
istries+0.022
olson+0.021
inski+0.021
Neg logit contributions
 Dek-0.017
nc-0.016
 Jou-0.016
jug-0.015
�醒-0.015
Tokenad
Token ID324
Feature activation-1.87e+00

Pos logit contributions
wrapper+0.319
arity+0.317
istries+0.292
olson+0.284
inski+0.284
Neg logit contributions
 Dek-0.232
nc-0.225
 Jou-0.215
jug-0.213
�醒-0.210
Token#
Token ID2
Feature activation-2.20e-01

Pos logit contributions
wrapper+1.427
arity+1.418
istries+1.316
olson+1.278
inski+1.278
Neg logit contributions
 Dek-1.111
nc-1.084
 Jou-1.031
jug-1.031
 Muscle-1.010
TokenThey
Token ID2990
Feature activation-1.01e-01

Pos logit contributions
wrapper+0.175
arity+0.174
istries+0.160
olson+0.156
inski+0.155
Neg logit contributions
 Dek-0.126
nc-0.121
 Jou-0.116
jug-0.115
�醒-0.114
Token d
Token ID288
Feature activation-4.08e-01

Pos logit contributions
wrapper+0.078
arity+0.077
istries+0.071
olson+0.069
inski+0.069
Neg logit contributions
 Dek-0.061
nc-0.059
 Jou-0.056
jug-0.056
�醒-0.055
Tokenusted
Token ID8459
Feature activation-2.09e-01

Pos logit contributions
wrapper+0.317
arity+0.314
istries+0.289
olson+0.281
inski+0.281
Neg logit contributions
 Dek-0.240
nc-0.233
 Jou-0.223
jug-0.221
�醒-0.218
Token off
Token ID572
Feature activation-8.55e-02

Pos logit contributions
wrapper+0.126
arity+0.124
istries+0.111
olson+0.107
inski+0.107
Neg logit contributions
 Dek-0.160
nc-0.156
 Jou-0.151
jug-0.150
�醒-0.149
Token Tom
Token ID4186
Feature activation-1.05e-03

Pos logit contributions
wrapper+0.155
arity+0.153
istries+0.140
olson+0.136
inski+0.136
Neg logit contributions
 Dek-0.130
nc-0.126
 Jou-0.122
jug-0.121
�醒-0.119
Tokenar
Token ID283
Feature activation-3.22e-01

Pos logit contributions
wrapper+0.001
arity+0.001
istries+0.001
olson+0.001
inski+0.001
Neg logit contributions
 Dek-0.001
nc-0.001
 Jou-0.001
jug-0.001
�醒-0.001
Token (
Token ID357
Feature activation+2.26e-01

Pos logit contributions
wrapper+0.240
arity+0.238
istries+0.217
olson+0.211
inski+0.211
Neg logit contributions
 Dek-0.201
nc-0.195
 Jou-0.187
jug-0.186
�醒-0.183
TokenPress
Token ID13800
Feature activation+8.24e-01

Pos logit contributions
 Dek+0.145
nc+0.140
 Jou+0.135
jug+0.134
�醒+0.133
Neg logit contributions
wrapper-0.165
arity-0.163
istries-0.149
olson-0.145
inski-0.144
Token Trust
Token ID9870
Feature activation+2.48e-01

Pos logit contributions
 Dek+0.526
nc+0.511
 Jou+0.490
�醒+0.487
jug+0.486
Neg logit contributions
wrapper-0.599
arity-0.593
istries-0.539
inski-0.527
olson-0.525
Token of
Token ID286
Feature activation-1.86e+00

Pos logit contributions
 Dek+0.155
nc+0.151
 Jou+0.145
jug+0.144
�醒+0.143
Neg logit contributions
wrapper-0.183
arity-0.181
istries-0.166
olson-0.161
inski-0.161
Token India
Token ID3794
Feature activation+2.31e-01

Pos logit contributions
wrapper+0.691
arity+0.667
istries+0.550
olson+0.520
inski+0.519
Neg logit contributions
 Dek-1.878
nc-1.828
 Jou-1.783
jug-1.769
 Muscle-1.764
Token photo
Token ID4590
Feature activation+3.13e-01

Pos logit contributions
 Dek+0.139
nc+0.135
 Jou+0.130
jug+0.129
�醒+0.128
Neg logit contributions
wrapper-0.176
arity-0.174
istries-0.160
inski-0.155
olson-0.155
Token)
Token ID8
Feature activation-4.60e-02

Pos logit contributions
 Dek+0.199
nc+0.193
 Jou+0.186
jug+0.184
�醒+0.183
Neg logit contributions
wrapper-0.228
arity-0.226
istries-0.207
olson-0.201
inski-0.200
Token
Token ID198
Feature activation-3.91e-01

Pos logit contributions
wrapper+0.034
arity+0.034
istries+0.031
olson+0.030
inski+0.030
Neg logit contributions
 Dek-0.029
nc-0.028
 Jou-0.027
jug-0.027
�醒-0.027
Token
Token ID198
Feature activation-4.60e-01

Pos logit contributions
wrapper+0.291
arity+0.289
istries+0.264
olson+0.257
inski+0.257
Neg logit contributions
 Dek-0.244
nc-0.236
 Jou-0.227
jug-0.225
�醒-0.222
Token this
Token ID428
Feature activation-2.96e-01

Pos logit contributions
 Dek+0.104
nc+0.101
 Jou+0.097
jug+0.097
�醒+0.096
Neg logit contributions
wrapper-0.128
arity-0.127
istries-0.116
inski-0.113
olson-0.113
Token content
Token ID2695
Feature activation-4.21e-01

Pos logit contributions
wrapper+0.218
arity+0.216
istries+0.198
olson+0.192
inski+0.192
Neg logit contributions
 Dek-0.186
nc-0.180
 Jou-0.173
jug-0.172
�醒-0.170
Token:
Token ID25
Feature activation-5.01e-01

Pos logit contributions
wrapper+0.304
arity+0.301
istries+0.274
olson+0.267
inski+0.266
Neg logit contributions
 Dek-0.272
nc-0.264
 Jou-0.254
jug-0.252
�醒-0.249
Token Sylv
Token ID24286
Feature activation+1.13e+00

Pos logit contributions
wrapper+0.438
arity+0.435
istries+0.404
olson+0.394
inski+0.394
Neg logit contributions
 Dek-0.247
nc-0.237
 Jou-0.226
jug-0.223
 Muscle-0.219
Tokenie
Token ID494
Feature activation+1.66e+00

Pos logit contributions
 Dek+1.069
nc+1.038
 Jou+1.013
�醒+1.009
jug+1.008
Neg logit contributions
arity-0.475
wrapper-0.469
istries-0.398
inski-0.373
olson-0.373
Token von
Token ID18042
Feature activation+2.53e+00

Pos logit contributions
 Dek+0.866
nc+0.840
jug+0.809
�醒+0.801
Registered+0.795
Neg logit contributions
wrapper-1.389
arity-1.369
istries-1.257
olson-1.252
inski-1.239
Token Du
Token ID10343
Feature activation+1.46e+00

Pos logit contributions
 Dek+1.808
nc+1.763
�醒+1.757
jug+1.727
Registered+1.707
Neg logit contributions
wrapper-1.636
arity-1.585
inski-1.408
istries-1.395
olson-1.339
Tokenug
Token ID1018
Feature activation+5.20e-01

Pos logit contributions
 Dek+0.894
nc+0.863
 Jou+0.830
Registered+0.827
�醒+0.825
Neg logit contributions
wrapper-1.097
arity-1.085
istries-0.994
inski-0.973
olson-0.971
Tokenlas
Token ID21921
Feature activation+1.01e-02

Pos logit contributions
 Dek+0.307
nc+0.294
 Jou+0.283
jug+0.281
�醒+0.277
Neg logit contributions
wrapper-0.405
arity-0.400
istries-0.368
inski-0.360
olson-0.359
Token-
Token ID12
Feature activation-4.87e-01

Pos logit contributions
 Dek+0.006
nc+0.006
 Jou+0.006
jug+0.006
�醒+0.005
Neg logit contributions
wrapper-0.008
arity-0.008
istries-0.007
olson-0.007
inski-0.007
TokenI
Token ID40
Feature activation+1.91e+00

Pos logit contributions
wrapper+0.386
arity+0.383
istries+0.352
olson+0.343
inski+0.342
Neg logit contributions
 Dek-0.280
nc-0.270
 Jou-0.259
jug-0.257
�醒-0.253
Token Scott
Token ID4746
Feature activation+1.30e+00

Pos logit contributions
wrapper+0.678
arity+0.671
istries+0.608
olson+0.591
inski+0.589
Neg logit contributions
 Dek-0.717
nc-0.693
 Jou-0.676
jug-0.666
Registered-0.661
Token Ric
Token ID15868
Feature activation-2.49e-01

Pos logit contributions
 Dek+0.578
nc+0.571
�醒+0.531
jug+0.525
Registered+0.524
Neg logit contributions
wrapper-1.172
arity-1.170
istries-1.095
inski-1.070
olson-1.070
Tokenher
Token ID372
Feature activation+4.48e-01

Pos logit contributions
wrapper+0.147
arity+0.146
istries+0.130
olson+0.125
inski+0.125
Neg logit contributions
 Dek-0.193
nc-0.188
 Jou-0.183
jug-0.182
�醒-0.180
Tokent
Token ID83
Feature activation+1.56e+00

Pos logit contributions
 Dek+0.242
nc+0.232
jug+0.223
 Jou+0.222
�醒+0.221
Neg logit contributions
wrapper-0.371
arity-0.369
istries-0.339
olson-0.333
inski-0.333
Token and
Token ID290
Feature activation-2.16e-01

Pos logit contributions
 Dek+1.069
nc+1.054
 Jou+0.997
jug+0.995
Registered+0.993
Neg logit contributions
wrapper-1.054
arity-1.046
inski-0.933
istries-0.929
olson-0.919
Token David
Token ID3271
Feature activation+2.55e+00

Pos logit contributions
wrapper+0.161
arity+0.159
istries+0.146
olson+0.142
inski+0.142
Neg logit contributions
 Dek-0.134
nc-0.130
 Jou-0.125
jug-0.124
�醒-0.122
Token Gordon
Token ID11646
Feature activation+1.69e+00

Pos logit contributions
 Dek+1.976
�醒+1.954
nc+1.940
jug+1.891
Registered+1.882
Neg logit contributions
wrapper-1.458
arity-1.457
inski-1.279
olson-1.273
istries-1.265
Token (
Token ID357
Feature activation+1.31e-01

Pos logit contributions
 Dek+1.069
nc+1.055
�醒+1.021
jug+1.021
Registered+1.019
Neg logit contributions
wrapper-1.190
arity-1.188
istries-1.080
olson-1.073
inski-1.061
TokenIS
Token ID1797
Feature activation+7.72e-01

Pos logit contributions
 Dek+0.066
nc+0.064
 Jou+0.061
jug+0.060
�醒+0.059
Neg logit contributions
wrapper-0.114
arity-0.113
istries-0.104
inski-0.102
olson-0.102
TokenI
Token ID40
Feature activation+1.07e+00

Pos logit contributions
 Dek+0.463
nc+0.446
 Jou+0.435
jug+0.429
�醒+0.426
Neg logit contributions
wrapper-0.593
arity-0.587
istries-0.536
olson-0.526
inski-0.521
Token Books
Token ID13661
Feature activation-1.41e+00

Pos logit contributions
 Dek+0.693
nc+0.685
 Jou+0.664
jug+0.659
�醒+0.650
Neg logit contributions
arity-0.761
wrapper-0.756
istries-0.682
inski-0.664
olson-0.661
Token Manafort
Token ID26351
Feature activation-9.35e-01

Pos logit contributions
 Dek+0.148
nc+0.144
 Jou+0.138
jug+0.137
�醒+0.135
Neg logit contributions
wrapper-0.203
arity-0.202
istries-0.185
olson-0.180
inski-0.180
Token's
Token ID338
Feature activation+1.53e+00

Pos logit contributions
wrapper+0.687
arity+0.682
istries+0.625
olson+0.604
inski+0.602
Neg logit contributions
 Dek-0.592
nc-0.573
 Jou-0.551
jug-0.548
Ger-0.539
Token attorneys
Token ID14449
Feature activation+7.72e-01

Pos logit contributions
 Dek+1.092
nc+1.062
 Jou+1.030
jug+1.023
 Muscle+1.011
Neg logit contributions
wrapper-0.995
arity-0.991
istries-0.897
inski-0.872
olson-0.865
Token say
Token ID910
Feature activation+9.58e-01

Pos logit contributions
 Dek+0.485
nc+0.472
jug+0.451
 Jou+0.451
�醒+0.442
Neg logit contributions
arity-0.568
wrapper-0.565
istries-0.516
inski-0.507
olson-0.503
Token he
Token ID339
Feature activation+1.37e+00

Pos logit contributions
 Dek+0.633
nc+0.614
 Jou+0.593
jug+0.593
�醒+0.588
Neg logit contributions
wrapper-0.669
arity-0.668
istries-0.600
inski-0.592
olson-0.579
Token should
Token ID815
Feature activation+2.70e+00

Pos logit contributions
 Dek+0.891
nc+0.871
 Jou+0.839
�醒+0.836
jug+0.833
Neg logit contributions
wrapper-0.969
arity-0.966
istries-0.858
inski-0.854
olson-0.844
Token get
Token ID651
Feature activation+6.73e-01

Pos logit contributions
 Dek+2.021
nc+1.914
 Jou+1.873
jug+1.859
Registered+1.854
Neg logit contributions
wrapper-1.695
arity-1.679
istries-1.496
inski-1.456
olson-1.454
Token less
Token ID1342
Feature activation+1.12e+00

Pos logit contributions
 Dek+0.418
nc+0.410
jug+0.388
 Jou+0.387
Registered+0.382
Neg logit contributions
wrapper-0.499
arity-0.496
istries-0.449
olson-0.440
inski-0.438
Token than
Token ID621
Feature activation+1.37e-01

Pos logit contributions
 Dek+0.741
nc+0.721
 Jou+0.700
jug+0.691
Ger+0.686
Neg logit contributions
wrapper-0.776
arity-0.769
istries-0.690
inski-0.685
olson-0.676
Token 10
Token ID838
Feature activation-4.11e-02

Pos logit contributions
 Dek+0.088
nc+0.086
 Jou+0.082
jug+0.081
 Muscle+0.081
Neg logit contributions
wrapper-0.098
arity-0.098
istries-0.089
olson-0.086
inski-0.086
Token years
Token ID812
Feature activation-5.05e-02

Pos logit contributions
wrapper+0.029
arity+0.029
istries+0.026
olson+0.025
inski+0.025
Neg logit contributions
 Dek-0.027
nc-0.026
 Jou-0.025
jug-0.025
�醒-0.025
Token at
Token ID379
Feature activation+5.38e-01

Pos logit contributions
 Dek+0.166
nc+0.161
 Jou+0.156
jug+0.155
�醒+0.153
Neg logit contributions
wrapper-0.158
arity-0.157
istries-0.142
inski-0.138
olson-0.137
Token stars
Token ID5788
Feature activation+3.69e-01

Pos logit contributions
 Dek+0.341
nc+0.332
 Jou+0.317
jug+0.317
�醒+0.314
Neg logit contributions
wrapper-0.394
arity-0.390
istries-0.355
olson-0.346
inski-0.346
Token such
Token ID884
Feature activation+3.45e-01

Pos logit contributions
 Dek+0.242
nc+0.236
 Jou+0.227
jug+0.226
�醒+0.224
Neg logit contributions
wrapper-0.261
arity-0.259
istries-0.235
inski-0.229
olson-0.229
Token as
Token ID355
Feature activation+3.63e-01

Pos logit contributions
 Dek+0.221
nc+0.214
 Jou+0.206
jug+0.206
�醒+0.205
Neg logit contributions
wrapper-0.249
arity-0.247
istries-0.225
inski-0.219
olson-0.219
Token Robert
Token ID5199
Feature activation+2.43e-01

Pos logit contributions
 Dek+0.234
nc+0.228
 Jou+0.218
jug+0.218
�醒+0.216
Neg logit contributions
wrapper-0.261
arity-0.258
istries-0.235
inski-0.229
olson-0.229
Token Down
Token ID5588
Feature activation+2.63e+00

Pos logit contributions
 Dek+0.152
nc+0.148
 Jou+0.142
jug+0.142
�醒+0.141
Neg logit contributions
wrapper-0.179
arity-0.177
istries-0.162
inski-0.158
olson-0.158
Tokeney
Token ID2959
Feature activation+1.62e+00

Pos logit contributions
 Dek+0.936
nc+0.887
�醒+0.874
jug+0.831
 Jou+0.822
Neg logit contributions
arity-2.641
wrapper-2.635
inski-2.491
olson-2.431
istries-2.422
Token Jr
Token ID7504
Feature activation-3.04e-04

Pos logit contributions
 Dek+1.362
nc+1.342
 Jou+1.309
�醒+1.305
jug+1.303
Neg logit contributions
wrapper-0.833
arity-0.816
istries-0.720
olson-0.703
inski-0.700
Token.
Token ID13
Feature activation+1.35e-01

Pos logit contributions
wrapper+0.000
arity+0.000
istries+0.000
olson+0.000
inski+0.000
Neg logit contributions
 Dek-0.000
nc-0.000
 Jou-0.000
jug-0.000
�醒-0.000
Token (
Token ID357
Feature activation+2.02e-01

Pos logit contributions
 Dek+0.085
nc+0.082
 Jou+0.079
jug+0.078
�醒+0.077
Neg logit contributions
wrapper-0.099
arity-0.098
istries-0.090
inski-0.087
olson-0.087
Tokenfor
Token ID1640
Feature activation-1.39e-01

Pos logit contributions
 Dek+0.128
nc+0.124
 Jou+0.119
jug+0.118
�醒+0.117
Neg logit contributions
wrapper-0.149
arity-0.147
istries-0.134
inski-0.131
olson-0.131
Token airline
Token ID18091
Feature activation+1.16e+00

Pos logit contributions
 Dek+0.585
nc+0.560
 Jou+0.541
jug+0.539
�醒+0.539
Neg logit contributions
wrapper-0.581
arity-0.579
istries-0.525
inski-0.511
olson-0.503
Token takes
Token ID2753
Feature activation+3.75e+00

Pos logit contributions
 Dek+0.797
nc+0.769
�醒+0.741
 Jou+0.738
jug+0.735
Neg logit contributions
wrapper-0.784
arity-0.778
istries-0.699
inski-0.683
olson-0.675
Token steps
Token ID4831
Feature activation+6.61e-01

Pos logit contributions
nc+2.186
 Dek+2.104
jug+2.055
�醒+1.995
 Jou+1.975
Neg logit contributions
wrapper-2.955
arity-2.938
inski-2.640
istries-2.604
olson-2.591
Token to
Token ID284
Feature activation+1.70e+00

Pos logit contributions
 Dek+0.417
nc+0.406
jug+0.395
 Jou+0.386
�醒+0.386
Neg logit contributions
wrapper-0.484
arity-0.482
istries-0.438
inski-0.427
olson-0.423
Token open
Token ID1280
Feature activation+5.85e-01

Pos logit contributions
 Dek+0.970
nc+0.953
�醒+0.900
jug+0.896
Ger+0.892
Neg logit contributions
wrapper-1.345
arity-1.313
istries-1.209
olson-1.207
inski-1.189
Token flights
Token ID13956
Feature activation+2.38e+00

Pos logit contributions
 Dek+0.426
nc+0.417
jug+0.404
 Jou+0.401
�醒+0.398
Neg logit contributions
wrapper-0.371
arity-0.364
istries-0.332
inski-0.320
olson-0.318
Token to
Token ID284
Feature activation+3.29e-01

Pos logit contributions
 Dek+1.736
nc+1.705
jug+1.673
�醒+1.663
 Muscle+1.653
Neg logit contributions
wrapper-1.479
arity-1.452
istries-1.288
inski-1.279
olson-1.233
Token US
Token ID1294
Feature activation+7.73e-01

Pos logit contributions
 Dek+0.231
nc+0.225
 Jou+0.216
jug+0.216
�醒+0.213
Neg logit contributions
wrapper-0.219
arity-0.216
istries-0.194
olson-0.190
inski-0.189
Token on
Token ID319
Feature activation-7.61e-01

Pos logit contributions
 Dek+0.487
nc+0.476
 Jou+0.457
jug+0.456
�醒+0.451
Neg logit contributions
wrapper-0.562
arity-0.560
istries-0.509
olson-0.499
inski-0.495
Token sidelines
Token ID29394
Feature activation-2.43e-02

Pos logit contributions
wrapper+0.598
arity+0.593
istries+0.545
olson+0.533
inski+0.529
Neg logit contributions
 Dek-0.444
nc-0.427
 Jou-0.412
jug-0.406
�醒-0.399
Token of
Token ID286
Feature activation+2.23e-01

Pos logit contributions
wrapper+0.018
arity+0.018
istries+0.016
olson+0.016
inski+0.016
Neg logit contributions
 Dek-0.015
nc-0.015
 Jou-0.014
jug-0.014
�醒-0.014
Token bank
Token ID3331
Feature activation-2.48e-01

Pos logit contributions
wrapper+0.104
arity+0.103
istries+0.095
olson+0.093
inski+0.093
Neg logit contributions
 Dek-0.075
nc-0.073
 Jou-0.070
jug-0.069
�醒-0.068
Token did
Token ID750
Feature activation-1.41e-01

Pos logit contributions
wrapper+0.187
arity+0.186
istries+0.170
olson+0.165
inski+0.165
Neg logit contributions
 Dek-0.151
nc-0.146
 Jou-0.141
jug-0.140
�醒-0.138
Token not
Token ID407
Feature activation-1.67e-01

Pos logit contributions
wrapper+0.099
arity+0.098
istries+0.089
olson+0.086
inski+0.086
Neg logit contributions
 Dek-0.094
nc-0.091
 Jou-0.088
jug-0.087
�醒-0.086
Token feel
Token ID1254
Feature activation-3.21e-01

Pos logit contributions
wrapper+0.121
arity+0.120
istries+0.110
olson+0.107
inski+0.107
Neg logit contributions
 Dek-0.106
nc-0.103
 Jou-0.099
jug-0.098
�醒-0.097
Token under
Token ID739
Feature activation+5.67e-01

Pos logit contributions
wrapper+0.230
arity+0.228
istries+0.207
olson+0.202
inski+0.201
Neg logit contributions
 Dek-0.209
nc-0.202
 Jou-0.195
jug-0.194
�醒-0.191
Token any
Token ID597
Feature activation+9.25e-01

Pos logit contributions
 Dek+0.271
nc+0.258
 Jou+0.245
jug+0.243
�醒+0.242
Neg logit contributions
wrapper-0.505
arity-0.501
istries-0.465
olson-0.454
inski-0.454
Token pressure
Token ID3833
Feature activation-1.27e-01

Pos logit contributions
 Dek+0.462
nc+0.446
jug+0.423
 Jou+0.423
�醒+0.422
Neg logit contributions
wrapper-0.800
arity-0.793
istries-0.735
inski-0.718
olson-0.717
Token from
Token ID422
Feature activation+4.54e-02

Pos logit contributions
wrapper+0.095
arity+0.094
istries+0.086
olson+0.084
inski+0.084
Neg logit contributions
 Dek-0.079
nc-0.076
 Jou-0.073
jug-0.073
�醒-0.072
Token the
Token ID262
Feature activation-2.02e-01

Pos logit contributions
 Dek+0.029
nc+0.028
 Jou+0.027
jug+0.027
�醒+0.027
Neg logit contributions
wrapper-0.033
arity-0.032
istries-0.030
olson-0.029
inski-0.029
Token government
Token ID1230
Feature activation-3.22e-01

Pos logit contributions
wrapper+0.160
arity+0.158
istries+0.145
olson+0.142
inski+0.142
Neg logit contributions
 Dek-0.117
nc-0.113
 Jou-0.108
jug-0.107
�醒-0.106
Token and
Token ID290
Feature activation+7.13e-02

Pos logit contributions
wrapper+0.242
arity+0.240
istries+0.220
olson+0.214
inski+0.214
Neg logit contributions
 Dek-0.198
nc-0.192
 Jou-0.184
jug-0.183
�醒-0.180
Token Stephen
Token ID7970
Feature activation-3.41e-01

Pos logit contributions
wrapper+0.096
arity+0.095
istries+0.087
olson+0.084
inski+0.084
Neg logit contributions
 Dek-0.087
nc-0.084
 Jou-0.081
jug-0.081
�醒-0.080
Token Am
Token ID1703
Feature activation+4.08e-01

Pos logit contributions
wrapper+0.250
arity+0.248
istries+0.226
olson+0.220
inski+0.220
Neg logit contributions
 Dek-0.216
nc-0.209
 Jou-0.201
jug-0.200
�醒-0.197
Tokenell
Token ID695
Feature activation-2.10e-01

Pos logit contributions
 Dek+0.258
nc+0.249
 Jou+0.240
jug+0.238
�醒+0.237
Neg logit contributions
wrapper-0.299
arity-0.298
istries-0.272
inski-0.266
olson-0.265
Token and
Token ID290
Feature activation-1.26e-01

Pos logit contributions
wrapper+0.157
arity+0.156
istries+0.143
olson+0.139
inski+0.139
Neg logit contributions
 Dek-0.130
nc-0.125
 Jou-0.121
jug-0.120
�醒-0.118
Token Robert
Token ID5199
Feature activation-1.35e-01

Pos logit contributions
wrapper+0.091
arity+0.090
istries+0.082
olson+0.080
inski+0.080
Neg logit contributions
 Dek-0.081
nc-0.079
 Jou-0.076
jug-0.076
�醒-0.075
Token Patt
Token ID24682
Feature activation+1.40e+00

Pos logit contributions
wrapper+0.100
arity+0.099
istries+0.091
olson+0.088
inski+0.088
Neg logit contributions
 Dek-0.084
nc-0.082
 Jou-0.079
jug-0.078
�醒-0.077
Tokeninson
Token ID7899
Feature activation-8.21e-02

Pos logit contributions
 Dek+1.072
nc+1.030
�醒+1.010
 Jou+1.007
jug+1.001
Neg logit contributions
arity-0.840
wrapper-0.840
istries-0.735
inski-0.730
olson-0.726
Token.
Token ID13
Feature activation-8.90e-03

Pos logit contributions
wrapper+0.062
arity+0.061
istries+0.056
olson+0.054
inski+0.054
Neg logit contributions
 Dek-0.051
nc-0.049
 Jou-0.047
jug-0.047
�醒-0.046
Token
Token ID198
Feature activation-2.22e-01

Pos logit contributions
wrapper+0.007
arity+0.007
istries+0.006
olson+0.006
inski+0.006
Neg logit contributions
 Dek-0.005
nc-0.005
 Jou-0.005
jug-0.005
�醒-0.005
Token
Token ID198
Feature activation+1.55e-01

Pos logit contributions
wrapper+0.165
arity+0.163
istries+0.149
olson+0.145
inski+0.145
Neg logit contributions
 Dek-0.139
nc-0.134
 Jou-0.130
jug-0.129
�醒-0.127
TokenWhile
Token ID3633
Feature activation+4.02e-01

Pos logit contributions
 Dek+0.091
nc+0.088
 Jou+0.085
jug+0.084
�醒+0.083
Neg logit contributions
wrapper-0.121
arity-0.120
istries-0.110
olson-0.108
inski-0.107
Token the
Token ID262
Feature activation-7.86e-02

Pos logit contributions
 Dek+0.191
nc+0.186
 Jou+0.179
jug+0.178
�醒+0.177
Neg logit contributions
wrapper-0.198
arity-0.196
istries-0.178
olson-0.173
inski-0.172
Token fund
Token ID1814
Feature activation+3.84e-01

Pos logit contributions
wrapper+0.055
arity+0.055
istries+0.050
olson+0.048
inski+0.048
Neg logit contributions
 Dek-0.052
nc-0.051
 Jou-0.049
jug-0.048
�醒-0.048
Token had
Token ID550
Feature activation+3.50e-01

Pos logit contributions
 Dek+0.239
nc+0.231
 Jou+0.222
jug+0.220
�醒+0.219
Neg logit contributions
wrapper-0.286
arity-0.284
istries-0.260
olson-0.252
inski-0.252
Token ended
Token ID4444
Feature activation+1.70e-01

Pos logit contributions
 Dek+0.240
nc+0.233
 Jou+0.225
jug+0.224
�醒+0.222
Neg logit contributions
wrapper-0.239
arity-0.237
istries-0.214
olson-0.207
inski-0.207
Token up
Token ID510
Feature activation+4.34e-01

Pos logit contributions
 Dek+0.125
nc+0.121
 Jou+0.118
jug+0.117
�醒+0.116
Neg logit contributions
wrapper-0.107
arity-0.106
istries-0.095
inski-0.092
olson-0.092
Token in
Token ID287
Feature activation+6.94e-01

Pos logit contributions
 Dek+0.276
nc+0.268
 Jou+0.257
jug+0.256
�醒+0.254
Neg logit contributions
wrapper-0.317
arity-0.314
istries-0.285
inski-0.278
olson-0.277
Token Prime
Token ID5537
Feature activation+2.98e-01

Pos logit contributions
 Dek+0.477
nc+0.465
 Jou+0.447
jug+0.446
�醒+0.444
Neg logit contributions
wrapper-0.470
arity-0.465
istries-0.420
olson-0.408
inski-0.407
Token Minister
Token ID4139
Feature activation-1.97e-01

Pos logit contributions
 Dek+0.223
nc+0.218
 Jou+0.210
jug+0.210
�醒+0.209
Neg logit contributions
wrapper-0.184
arity-0.182
istries-0.163
inski-0.158
olson-0.157
Token Naj
Token ID37135
Feature activation+5.14e-01

Pos logit contributions
wrapper+0.176
arity+0.175
istries+0.162
olson+0.158
inski+0.158
Neg logit contributions
 Dek-0.093
nc-0.090
 Jou-0.085
jug-0.084
�醒-0.083
Tokenib
Token ID571
Feature activation+8.17e-01

Pos logit contributions
 Dek+0.282
nc+0.272
 Jou+0.261
�醒+0.259
jug+0.258
Neg logit contributions
wrapper-0.418
arity-0.417
istries-0.383
inski-0.375
olson-0.374
Token Raz
Token ID38058
Feature activation+1.69e+00

Pos logit contributions
 Dek+0.664
nc+0.653
jug+0.632
 Jou+0.629
Registered+0.626
Neg logit contributions
wrapper-0.446
arity-0.440
istries-0.386
inski-0.378
olson-0.373
Token.
Token ID13
Feature activation+3.91e-01

Pos logit contributions
wrapper+0.181
arity+0.180
istries+0.164
olson+0.160
inski+0.160
Neg logit contributions
 Dek-0.149
nc-0.144
 Jou-0.139
jug-0.138
�醒-0.136
Token
Token ID198
Feature activation+1.18e-01

Pos logit contributions
 Dek+0.242
nc+0.235
 Jou+0.225
jug+0.225
�醒+0.221
Neg logit contributions
wrapper-0.291
arity-0.289
istries-0.264
olson-0.257
inski-0.257
Token
Token ID198
Feature activation+2.34e-01

Pos logit contributions
 Dek+0.074
nc+0.072
 Jou+0.069
jug+0.069
�醒+0.069
Neg logit contributions
wrapper-0.086
arity-0.086
istries-0.078
olson-0.076
inski-0.076
TokenFrench
Token ID24111
Feature activation+1.03e-01

Pos logit contributions
 Dek+0.147
nc+0.141
 Jou+0.137
jug+0.135
�醒+0.134
Neg logit contributions
wrapper-0.173
arity-0.172
istries-0.157
olson-0.152
inski-0.152
Token President
Token ID1992
Feature activation+7.63e-01

Pos logit contributions
 Dek+0.065
nc+0.063
 Jou+0.060
jug+0.060
�醒+0.059
Neg logit contributions
wrapper-0.076
arity-0.075
istries-0.069
olson-0.067
inski-0.067
Token Francois
Token ID40419
Feature activation+8.09e-01

Pos logit contributions
 Dek+0.517
nc+0.503
jug+0.483
 Jou+0.481
�醒+0.480
Neg logit contributions
wrapper-0.524
arity-0.519
istries-0.472
inski-0.458
olson-0.455
Token Hollande
Token ID33590
Feature activation-3.76e-02

Pos logit contributions
 Dek+0.560
nc+0.549
�醒+0.533
jug+0.532
 Jou+0.527
Neg logit contributions
wrapper-0.538
arity-0.527
istries-0.480
olson-0.470
inski-0.465
Token has
Token ID468
Feature activation+3.12e-02

Pos logit contributions
wrapper+0.027
arity+0.027
istries+0.025
olson+0.024
inski+0.024
Neg logit contributions
 Dek-0.024
nc-0.023
 Jou-0.022
jug-0.022
�醒-0.022
Token set
Token ID900
Feature activation+2.45e-01

Pos logit contributions
 Dek+0.021
nc+0.020
 Jou+0.019
jug+0.019
�醒+0.019
Neg logit contributions
wrapper-0.022
arity-0.022
istries-0.020
olson-0.019
inski-0.019
Token out
Token ID503
Feature activation+4.72e-01

Pos logit contributions
 Dek+0.167
nc+0.163
 Jou+0.157
jug+0.156
�醒+0.155
Neg logit contributions
wrapper-0.167
arity-0.165
istries-0.150
olson-0.146
inski-0.145
Token an
Token ID281
Feature activation+6.55e-01

Pos logit contributions
 Dek+0.303
nc+0.294
 Jou+0.282
jug+0.281
�醒+0.279
Neg logit contributions
wrapper-0.342
arity-0.339
istries-0.310
inski-0.300
olson-0.300
Token tall
Token ID7331
Feature activation-3.50e-02

Pos logit contributions
 Dek+0.071
nc+0.069
 Jou+0.066
jug+0.066
�醒+0.066
Neg logit contributions
wrapper-0.080
arity-0.079
istries-0.072
olson-0.070
inski-0.070
Token as
Token ID355
Feature activation+3.80e-01

Pos logit contributions
wrapper+0.026
arity+0.026
istries+0.024
olson+0.023
inski+0.023
Neg logit contributions
 Dek-0.021
nc-0.021
 Jou-0.020
jug-0.020
�醒-0.020
Token he
Token ID339
Feature activation-1.83e-01

Pos logit contributions
 Dek+0.248
nc+0.242
 Jou+0.233
jug+0.232
�醒+0.229
Neg logit contributions
wrapper-0.271
arity-0.268
istries-0.244
olson-0.237
inski-0.237
Token grants
Token ID11455
Feature activation+2.87e-01

Pos logit contributions
wrapper+0.134
arity+0.133
istries+0.121
olson+0.118
inski+0.118
Neg logit contributions
 Dek-0.116
nc-0.113
 Jou-0.108
jug-0.108
�醒-0.106
Token freedom
Token ID4925
Feature activation+3.08e-01

Pos logit contributions
 Dek+0.177
nc+0.172
 Jou+0.165
jug+0.164
�醒+0.162
Neg logit contributions
wrapper-0.215
arity-0.212
istries-0.195
olson-0.189
inski-0.189
Token to
Token ID284
Feature activation+6.57e-01

Pos logit contributions
 Dek+0.192
nc+0.186
 Jou+0.179
jug+0.177
�醒+0.175
Neg logit contributions
wrapper-0.229
arity-0.227
istries-0.207
olson-0.202
inski-0.202
Token a
Token ID257
Feature activation+6.37e-01

Pos logit contributions
 Dek+0.392
nc+0.382
 Jou+0.366
jug+0.363
�醒+0.362
Neg logit contributions
wrapper-0.503
arity-0.498
istries-0.456
inski-0.445
olson-0.444
Token crou
Token ID33838
Feature activation-5.55e-01

Pos logit contributions
 Dek+0.396
nc+0.387
 Jou+0.370
jug+0.368
�醒+0.365
Neg logit contributions
wrapper-0.472
arity-0.466
istries-0.426
inski-0.415
olson-0.414
Tokenching
Token ID10813
Feature activation+8.43e-01

Pos logit contributions
wrapper+0.294
arity+0.290
istries+0.256
olson+0.245
inski+0.245
Neg logit contributions
 Dek-0.464
nc-0.454
 Jou-0.441
jug-0.438
�醒-0.433
Token slave
Token ID11778
Feature activation+8.03e-01

Pos logit contributions
 Dek+0.513
nc+0.502
 Jou+0.480
jug+0.479
�醒+0.477
Neg logit contributions
wrapper-0.633
arity-0.627
istries-0.571
inski-0.560
olson-0.556
Token.
Token ID13
Feature activation+2.71e-01

Pos logit contributions
 Dek+0.512
nc+0.497
 Jou+0.479
jug+0.474
�醒+0.472
Neg logit contributions
wrapper-0.583
arity-0.578
istries-0.523
inski-0.514
olson-0.512
Token stay
Token ID2652
Feature activation+3.79e-01

Pos logit contributions
wrapper+0.002
arity+0.002
istries+0.002
olson+0.002
inski+0.002
Neg logit contributions
 Dek-0.002
nc-0.002
 Jou-0.002
jug-0.002
�醒-0.002
Token at
Token ID379
Feature activation+1.91e-01

Pos logit contributions
 Dek+0.244
nc+0.237
 Jou+0.228
�醒+0.227
jug+0.227
Neg logit contributions
wrapper-0.273
arity-0.270
istries-0.246
inski-0.240
olson-0.239
Token the
Token ID262
Feature activation-1.23e-01

Pos logit contributions
 Dek+0.127
nc+0.124
 Jou+0.119
jug+0.118
�醒+0.117
Neg logit contributions
wrapper-0.134
arity-0.133
istries-0.121
olson-0.117
inski-0.117
Token London
Token ID3576
Feature activation-5.27e-01

Pos logit contributions
wrapper+0.072
arity+0.071
istries+0.063
olson+0.061
inski+0.061
Neg logit contributions
 Dek-0.097
nc-0.094
 Jou-0.091
jug-0.091
�醒-0.090
Token embassy
Token ID18613
Feature activation-1.62e-01

Pos logit contributions
wrapper+0.315
arity+0.312
istries+0.279
olson+0.269
inski+0.268
Neg logit contributions
 Dek-0.405
nc-0.394
 Jou-0.383
jug-0.380
 Muscle-0.376
Token indefinitely
Token ID24391
Feature activation+4.95e-02

Pos logit contributions
wrapper+0.119
arity+0.118
istries+0.108
olson+0.105
inski+0.105
Neg logit contributions
 Dek-0.102
nc-0.099
 Jou-0.095
jug-0.094
�醒-0.093
Token,
Token ID11
Feature activation-4.10e-02

Pos logit contributions
 Dek+0.030
nc+0.029
 Jou+0.028
jug+0.028
�醒+0.028
Neg logit contributions
wrapper-0.037
arity-0.037
istries-0.034
olson-0.033
inski-0.033
Token although
Token ID3584
Feature activation+6.86e-02

Pos logit contributions
wrapper+0.029
arity+0.029
istries+0.026
olson+0.025
inski+0.025
Neg logit contributions
 Dek-0.027
nc-0.026
 Jou-0.025
jug-0.025
�醒-0.025
Token he
Token ID339
Feature activation+2.38e-01

Pos logit contributions
 Dek+0.044
nc+0.043
 Jou+0.041
jug+0.041
�醒+0.041
Neg logit contributions
wrapper-0.050
arity-0.049
istries-0.045
olson-0.044
inski-0.044
Token may
Token ID743
Feature activation+2.63e-01

Pos logit contributions
 Dek+0.152
nc+0.147
 Jou+0.142
jug+0.141
�醒+0.140
Neg logit contributions
wrapper-0.172
arity-0.171
istries-0.156
inski-0.152
olson-0.152
Token impose
Token ID13551
Feature activation+4.91e-01

Pos logit contributions
 Dek+0.181
nc+0.176
 Jou+0.170
jug+0.169
�醒+0.168
Neg logit contributions
wrapper-0.179
arity-0.177
istries-0.161
olson-0.156
inski-0.156
Token Earth
Token ID3668
Feature activation+5.09e-01

Pos logit contributions
 Dek+0.105
nc+0.102
 Jou+0.097
jug+0.097
�醒+0.096
Neg logit contributions
wrapper-0.139
arity-0.138
istries-0.127
olson-0.123
inski-0.123
Token did
Token ID750
Feature activation+5.67e-01

Pos logit contributions
 Dek+0.216
nc+0.207
 Jou+0.194
�醒+0.192
jug+0.192
Neg logit contributions
wrapper-0.480
arity-0.476
istries-0.444
olson-0.435
inski-0.434
Token stop
Token ID2245
Feature activation+1.71e-01

Pos logit contributions
 Dek+0.269
nc+0.259
 Jou+0.245
jug+0.243
�醒+0.240
Neg logit contributions
wrapper-0.506
arity-0.502
istries-0.466
olson-0.455
inski-0.455
Token moving
Token ID3867
Feature activation-3.12e-02

Pos logit contributions
 Dek+0.089
nc+0.085
 Jou+0.081
jug+0.080
�醒+0.080
Neg logit contributions
wrapper-0.146
arity-0.144
istries-0.134
olson-0.130
inski-0.130
Token suddenly
Token ID6451
Feature activation-1.06e-01

Pos logit contributions
wrapper+0.023
arity+0.023
istries+0.021
olson+0.020
inski+0.020
Neg logit contributions
 Dek-0.020
nc-0.019
 Jou-0.018
jug-0.018
�醒-0.018
Token,
Token ID11
Feature activation+1.13e-03

Pos logit contributions
wrapper+0.078
arity+0.078
istries+0.071
olson+0.069
inski+0.069
Neg logit contributions
 Dek-0.066
nc-0.064
 Jou-0.061
jug-0.061
�醒-0.060
Token there
Token ID612
Feature activation+3.74e-01

Pos logit contributions
 Dek+0.001
nc+0.001
 Jou+0.001
jug+0.001
�醒+0.001
Neg logit contributions
wrapper-0.001
arity-0.001
istries-0.001
olson-0.001
inski-0.001
Token would
Token ID561
Feature activation-6.30e-02

Pos logit contributions
 Dek+0.228
nc+0.220
 Jou+0.211
jug+0.211
�醒+0.209
Neg logit contributions
wrapper-0.284
arity-0.282
istries-0.257
olson-0.251
inski-0.251
Token be
Token ID307
Feature activation+1.18e-01

Pos logit contributions
wrapper+0.039
arity+0.039
istries+0.035
olson+0.034
inski+0.034
Neg logit contributions
 Dek-0.047
nc-0.045
 Jou-0.044
jug-0.044
�醒-0.043
Token an
Token ID281
Feature activation+2.99e-01

Pos logit contributions
 Dek+0.076
nc+0.074
 Jou+0.071
jug+0.071
�醒+0.070
Neg logit contributions
wrapper-0.085
arity-0.084
istries-0.076
olson-0.074
inski-0.074
Token awful
Token ID12659
Feature activation+1.05e-01

Pos logit contributions
 Dek+0.197
nc+0.191
 Jou+0.183
jug+0.183
�醒+0.182
Neg logit contributions
wrapper-0.212
arity-0.210
istries-0.190
olson-0.186
inski-0.186
Token
Token ID198
Feature activation-1.57e-01

Pos logit contributions
 Dek+0.024
nc+0.023
 Jou+0.022
jug+0.022
�醒+0.022
Neg logit contributions
wrapper-0.028
arity-0.028
istries-0.025
olson-0.024
inski-0.024
Token
Token ID447
Feature activation+3.30e-02

Pos logit contributions
wrapper+0.120
arity+0.119
istries+0.109
olson+0.106
inski+0.106
Neg logit contributions
 Dek-0.094
nc-0.091
 Jou-0.088
jug-0.087
�醒-0.086
Token
Token ID250
Feature activation-8.93e-02

Pos logit contributions
 Dek+0.030
nc+0.029
 Jou+0.029
jug+0.029
�醒+0.028
Neg logit contributions
wrapper-0.015
arity-0.015
istries-0.013
olson-0.012
inski-0.012
TokenOur
Token ID5122
Feature activation-3.72e-01

Pos logit contributions
wrapper+0.070
arity+0.070
istries+0.064
olson+0.063
inski+0.063
Neg logit contributions
 Dek-0.052
nc-0.050
 Jou-0.048
jug-0.047
�醒-0.047
Token grass
Token ID8701
Feature activation-3.14e-01

Pos logit contributions
wrapper+0.273
arity+0.271
istries+0.247
olson+0.241
inski+0.240
Neg logit contributions
 Dek-0.235
nc-0.228
 Jou-0.220
jug-0.218
�醒-0.215
Token-
Token ID12
Feature activation-7.69e-01

Pos logit contributions
wrapper+0.240
arity+0.238
istries+0.218
olson+0.212
inski+0.212
Neg logit contributions
 Dek-0.190
nc-0.184
 Jou-0.177
jug-0.175
�醒-0.173
Tokenroots
Token ID19150
Feature activation-2.85e-01

Pos logit contributions
wrapper+0.250
arity+0.245
istries+0.199
olson+0.184
inski+0.183
Neg logit contributions
 Dek-0.799
nc-0.785
 Jou-0.765
jug-0.763
Registered-0.756
Token momentum
Token ID12858
Feature activation-1.82e-01

Pos logit contributions
wrapper+0.203
arity+0.201
istries+0.183
olson+0.178
inski+0.178
Neg logit contributions
 Dek-0.187
nc-0.181
 Jou-0.175
jug-0.173
�醒-0.171
Token is
Token ID318
Feature activation+1.39e-01

Pos logit contributions
wrapper+0.134
arity+0.133
istries+0.121
olson+0.118
inski+0.118
Neg logit contributions
 Dek-0.115
nc-0.112
 Jou-0.107
jug-0.107
�醒-0.105
Token unstoppable
Token ID40181
Feature activation+1.82e-01

Pos logit contributions
 Dek+0.087
nc+0.084
 Jou+0.081
jug+0.080
�醒+0.080
Neg logit contributions
wrapper-0.103
arity-0.102
istries-0.093
inski-0.090
olson-0.090
Token,
Token ID11
Feature activation-1.70e-02

Pos logit contributions
 Dek+0.114
nc+0.110
 Jou+0.106
jug+0.106
�醒+0.104
Neg logit contributions
wrapper-0.134
arity-0.133
istries-0.121
inski-0.118
olson-0.118
Token fact
Token ID1109
Feature activation-3.18e-01

Pos logit contributions
 Dek+0.180
nc+0.175
 Jou+0.168
jug+0.168
�醒+0.165
Neg logit contributions
wrapper-0.206
arity-0.204
istries-0.186
olson-0.180
inski-0.180
Token that
Token ID326
Feature activation+1.98e-01

Pos logit contributions
wrapper+0.226
arity+0.224
istries+0.204
olson+0.198
inski+0.198
Neg logit contributions
 Dek-0.209
nc-0.202
 Jou-0.195
jug-0.194
�醒-0.191
Token he
Token ID339
Feature activation+2.02e-01

Pos logit contributions
 Dek+0.127
nc+0.124
 Jou+0.119
jug+0.118
�醒+0.117
Neg logit contributions
wrapper-0.142
arity-0.141
istries-0.128
inski-0.125
olson-0.125
Token viewed
Token ID9569
Feature activation+3.74e-01

Pos logit contributions
 Dek+0.126
nc+0.122
 Jou+0.118
jug+0.117
�醒+0.116
Neg logit contributions
wrapper-0.150
arity-0.149
istries-0.136
inski-0.132
olson-0.132
Token and
Token ID290
Feature activation+3.81e-01

Pos logit contributions
 Dek+0.249
nc+0.242
 Jou+0.233
jug+0.232
�醒+0.230
Neg logit contributions
wrapper-0.262
arity-0.259
istries-0.235
inski-0.229
olson-0.228
Token posted
Token ID4481
Feature activation+3.29e-01

Pos logit contributions
 Dek+0.231
nc+0.223
 Jou+0.215
jug+0.213
�醒+0.212
Neg logit contributions
wrapper-0.291
arity-0.288
istries-0.263
olson-0.257
inski-0.257
Token a
Token ID257
Feature activation+7.71e-02

Pos logit contributions
 Dek+0.219
nc+0.213
 Jou+0.206
jug+0.204
�醒+0.203
Neg logit contributions
wrapper-0.231
arity-0.228
istries-0.207
inski-0.201
olson-0.201
Token message
Token ID3275
Feature activation+2.79e-01

Pos logit contributions
 Dek+0.051
nc+0.050
 Jou+0.048
jug+0.048
�醒+0.047
Neg logit contributions
wrapper-0.054
arity-0.054
istries-0.049
olson-0.047
inski-0.047
Token in
Token ID287
Feature activation-8.67e-02

Pos logit contributions
 Dek+0.179
nc+0.173
 Jou+0.167
jug+0.166
�醒+0.164
Neg logit contributions
wrapper-0.203
arity-0.201
istries-0.184
inski-0.179
olson-0.178
Token this
Token ID428
Feature activation-1.26e-04

Pos logit contributions
wrapper+0.061
arity+0.060
istries+0.055
olson+0.053
inski+0.053
Neg logit contributions
 Dek-0.058
nc-0.056
 Jou-0.054
jug-0.054
�醒-0.053
Token forum
Token ID10041
Feature activation+3.30e-01

Pos logit contributions
wrapper+0.000
arity+0.000
istries+0.000
olson+0.000
inski+0.000
Neg logit contributions
 Dek-0.000
nc-0.000
 Jou-0.000
jug-0.000
�醒-0.000
Token hear
Token ID3285
Feature activation-3.86e-01

Pos logit contributions
wrapper+0.336
arity+0.332
istries+0.300
olson+0.290
inski+0.289
Neg logit contributions
 Dek-0.375
nc-0.365
 Jou-0.353
jug-0.350
�醒-0.346
Token from
Token ID422
Feature activation-5.27e-02

Pos logit contributions
wrapper+0.286
arity+0.284
istries+0.259
olson+0.252
inski+0.252
Neg logit contributions
 Dek-0.242
nc-0.234
 Jou-0.226
jug-0.224
�醒-0.221
Token the
Token ID262
Feature activation+1.37e-01

Pos logit contributions
wrapper+0.040
arity+0.039
istries+0.036
olson+0.035
inski+0.035
Neg logit contributions
 Dek-0.032
nc-0.031
 Jou-0.030
jug-0.030
�醒-0.030
Token candidates
Token ID5871
Feature activation-1.06e-01

Pos logit contributions
 Dek+0.082
nc+0.080
 Jou+0.077
jug+0.076
�醒+0.075
Neg logit contributions
wrapper-0.105
arity-0.104
istries-0.096
olson-0.093
inski-0.093
Token on
Token ID319
Feature activation-3.09e-01

Pos logit contributions
wrapper+0.080
arity+0.079
istries+0.073
olson+0.071
inski+0.071
Neg logit contributions
 Dek-0.065
nc-0.063
 Jou-0.060
jug-0.060
�醒-0.059
Token our
Token ID674
Feature activation-3.69e-02

Pos logit contributions
wrapper+0.227
arity+0.225
istries+0.205
olson+0.200
inski+0.199
Neg logit contributions
 Dek-0.196
nc-0.190
 Jou-0.183
jug-0.181
�醒-0.179
Token ballot
Token ID11100
Feature activation+3.78e-01

Pos logit contributions
wrapper+0.027
arity+0.026
istries+0.024
olson+0.023
inski+0.023
Neg logit contributions
 Dek-0.024
nc-0.023
 Jou-0.022
jug-0.022
�醒-0.022
Token in
Token ID287
Feature activation+1.31e-01

Pos logit contributions
 Dek+0.230
nc+0.222
 Jou+0.214
jug+0.212
�醒+0.210
Neg logit contributions
wrapper-0.286
arity-0.283
istries-0.260
inski-0.253
olson-0.252
Token the
Token ID262
Feature activation+4.80e-01

Pos logit contributions
 Dek+0.079
nc+0.076
 Jou+0.073
jug+0.073
�醒+0.072
Neg logit contributions
wrapper-0.100
arity-0.099
istries-0.091
olson-0.088
inski-0.088
Token #
Token ID1303
Feature activation+1.81e-01

Pos logit contributions
 Dek+0.285
nc+0.276
 Jou+0.265
jug+0.264
�醒+0.261
Neg logit contributions
wrapper-0.370
arity-0.366
istries-0.334
inski-0.327
olson-0.326
Tokendeb
Token ID11275
Feature activation+6.66e-01

Pos logit contributions
 Dek+0.109
nc+0.105
 Jou+0.102
jug+0.101
�醒+0.100
Neg logit contributions
wrapper-0.138
arity-0.137
istries-0.125
olson-0.122
inski-0.122
Token
Token ID198
Feature activation-2.18e-02

Pos logit contributions
wrapper+0.027
arity+0.027
istries+0.025
olson+0.024
inski+0.024
Neg logit contributions
 Dek-0.022
nc-0.022
 Jou-0.021
jug-0.021
�醒-0.020
Token
Token ID198
Feature activation+3.96e-02

Pos logit contributions
wrapper+0.016
arity+0.016
istries+0.015
olson+0.014
inski+0.014
Neg logit contributions
 Dek-0.014
nc-0.013
 Jou-0.013
jug-0.013
�醒-0.013
TokenNot
Token ID3673
Feature activation+2.98e-01

Pos logit contributions
 Dek+0.023
nc+0.022
 Jou+0.021
jug+0.021
�醒+0.021
Neg logit contributions
wrapper-0.031
arity-0.031
istries-0.028
olson-0.028
inski-0.028
Tokened
Token ID276
Feature activation-1.93e-03

Pos logit contributions
 Dek+0.186
nc+0.180
 Jou+0.173
jug+0.172
�醒+0.170
Neg logit contributions
wrapper-0.221
arity-0.219
istries-0.200
olson-0.195
inski-0.195
Token Republican
Token ID3415
Feature activation+6.03e-02

Pos logit contributions
wrapper+0.001
arity+0.001
istries+0.001
olson+0.001
inski+0.001
Neg logit contributions
 Dek-0.001
nc-0.001
 Jou-0.001
jug-0.001
�醒-0.001
Token poll
Token ID3278
Feature activation-1.14e+00

Pos logit contributions
 Dek+0.036
nc+0.034
 Jou+0.033
jug+0.033
�醒+0.032
Neg logit contributions
wrapper-0.047
arity-0.047
istries-0.043
olson-0.042
inski-0.042
Tokenster
Token ID1706
Feature activation-3.47e-01

Pos logit contributions
wrapper+0.737
arity+0.731
istries+0.658
olson+0.640
inski+0.634
Neg logit contributions
 Dek-0.826
nc-0.804
 Jou-0.779
jug-0.773
�醒-0.764
Token Frank
Token ID5278
Feature activation+3.29e-01

Pos logit contributions
wrapper+0.254
arity+0.251
istries+0.229
olson+0.223
inski+0.223
Neg logit contributions
 Dek-0.221
nc-0.214
 Jou-0.206
jug-0.205
�醒-0.202
Token L
Token ID406
Feature activation-5.27e-02

Pos logit contributions
 Dek+0.209
nc+0.203
 Jou+0.195
jug+0.193
�醒+0.192
Neg logit contributions
wrapper-0.239
arity-0.238
istries-0.217
inski-0.211
olson-0.210
Tokenunt
Token ID2797
Feature activation-3.00e-02

Pos logit contributions
wrapper+0.035
arity+0.035
istries+0.031
inski+0.031
olson+0.031
Neg logit contributions
 Dek-0.037
nc-0.036
 Jou-0.035
jug-0.034
�醒-0.034
Tokenz
Token ID89
Feature activation-1.45e-01

Pos logit contributions
wrapper+0.026
arity+0.025
istries+0.024
inski+0.023
olson+0.023
Neg logit contributions
 Dek-0.015
nc-0.015
 Jou-0.014
jug-0.014
�醒-0.014
Tokens
Token ID82
Feature activation+1.11e-01

Pos logit contributions
 Dek+0.080
nc+0.077
 Jou+0.075
jug+0.074
�醒+0.074
Neg logit contributions
wrapper-0.083
arity-0.083
istries-0.075
olson-0.073
inski-0.073
Token so
Token ID523
Feature activation+2.28e-03

Pos logit contributions
 Dek+0.073
nc+0.071
 Jou+0.068
jug+0.068
�醒+0.067
Neg logit contributions
wrapper-0.079
arity-0.079
istries-0.072
inski-0.070
olson-0.070
Token much
Token ID881
Feature activation-9.22e-02

Pos logit contributions
 Dek+0.002
nc+0.002
 Jou+0.001
jug+0.001
�醒+0.001
Neg logit contributions
wrapper-0.002
arity-0.002
istries-0.001
olson-0.001
inski-0.001
Token better
Token ID1365
Feature activation+3.89e-01

Pos logit contributions
wrapper+0.045
arity+0.044
istries+0.038
olson+0.037
inski+0.037
Neg logit contributions
 Dek-0.081
nc-0.079
 Jou-0.077
jug-0.077
�醒-0.076
Token than
Token ID621
Feature activation+3.15e-02

Pos logit contributions
 Dek+0.246
nc+0.238
 Jou+0.230
jug+0.228
�醒+0.226
Neg logit contributions
wrapper-0.285
arity-0.283
istries-0.258
inski-0.251
olson-0.251
Token b
Token ID275
Feature activation-1.50e+00

Pos logit contributions
 Dek+0.020
nc+0.020
 Jou+0.019
jug+0.019
�醒+0.019
Neg logit contributions
wrapper-0.023
arity-0.023
istries-0.021
inski-0.020
olson-0.020
Tokenuss
Token ID1046
Feature activation-3.62e-02

Pos logit contributions
wrapper+1.016
arity+0.999
istries+0.909
olson+0.880
inski+0.877
Neg logit contributions
 Dek-1.030
nc-1.009
jug-0.967
 Jou-0.966
Ger-0.946
Tokening
Token ID278
Feature activation+2.41e-01

Pos logit contributions
wrapper+0.033
arity+0.033
istries+0.030
olson+0.030
inski+0.030
Neg logit contributions
 Dek-0.017
nc-0.016
 Jou-0.015
jug-0.015
�醒-0.015
Token tables
Token ID8893
Feature activation+4.66e-01

Pos logit contributions
 Dek+0.149
nc+0.144
 Jou+0.139
jug+0.138
�醒+0.137
Neg logit contributions
wrapper-0.180
arity-0.179
istries-0.163
inski-0.159
olson-0.159
Token.
Token ID13
Feature activation+8.41e-02

Pos logit contributions
 Dek+0.295
nc+0.284
 Jou+0.274
jug+0.272
�醒+0.270
Neg logit contributions
wrapper-0.343
arity-0.340
istries-0.310
olson-0.302
inski-0.302
Token He
Token ID679
Feature activation+1.87e-01

Pos logit contributions
 Dek+0.052
nc+0.050
 Jou+0.048
jug+0.048
�醒+0.047
Neg logit contributions
wrapper-0.063
arity-0.063
istries-0.058
inski-0.056
olson-0.056
Token.
Token ID13
Feature activation+8.65e-02

Pos logit contributions
wrapper+0.187
arity+0.186
istries+0.170
olson+0.165
inski+0.165
Neg logit contributions
 Dek-0.154
nc-0.149
 Jou-0.144
jug-0.142
�醒-0.141
Token<|endoftext|>
Token ID50256
Feature activation-4.40e-02

Pos logit contributions
 Dek+0.051
nc+0.050
 Jou+0.047
jug+0.047
�醒+0.047
Neg logit contributions
wrapper-0.067
arity-0.066
istries-0.061
olson-0.059
inski-0.059
TokenSAN
Token ID36753
Feature activation-2.63e-01

Pos logit contributions
wrapper+0.034
arity+0.034
istries+0.031
olson+0.030
inski+0.030
Neg logit contributions
 Dek-0.026
nc-0.025
 Jou-0.024
jug-0.024
�醒-0.024
Token FR
Token ID8782
Feature activation+1.05e-01

Pos logit contributions
wrapper+0.155
arity+0.154
istries+0.137
olson+0.132
inski+0.132
Neg logit contributions
 Dek-0.204
nc-0.199
 Jou-0.193
jug-0.192
�醒-0.190
TokenANC
Token ID20940
Feature activation-9.55e-01

Pos logit contributions
 Dek+0.057
nc+0.055
 Jou+0.053
jug+0.053
�醒+0.052
Neg logit contributions
wrapper-0.086
arity-0.085
istries-0.078
olson-0.077
inski-0.077
TokenIS
Token ID1797
Feature activation-1.97e+00

Pos logit contributions
wrapper+0.851
arity+0.843
istries+0.782
olson+0.764
inski+0.763
Neg logit contributions
 Dek-0.460
nc-0.438
 Jou-0.418
jug-0.409
Ger-0.407
TokenCO
Token ID8220
Feature activation-4.35e-01

Pos logit contributions
wrapper+1.363
arity+1.343
istries+1.243
inski+1.188
olson+1.179
Neg logit contributions
 Dek-1.350
nc-1.298
 Jou-1.256
Ger-1.238
jug-1.218
Token -
Token ID532
Feature activation-1.42e-01

Pos logit contributions
wrapper+0.326
arity+0.323
istries+0.296
olson+0.288
inski+0.287
Neg logit contributions
 Dek-0.268
nc-0.260
 Jou-0.250
jug-0.248
�醒-0.244
Token Reddit
Token ID10750
Feature activation-1.96e-01

Pos logit contributions
wrapper+0.108
arity+0.107
istries+0.098
olson+0.095
inski+0.095
Neg logit contributions
 Dek-0.086
nc-0.083
 Jou-0.080
jug-0.079
�醒-0.078
Token and
Token ID290
Feature activation+2.61e-01

Pos logit contributions
wrapper+0.141
arity+0.140
istries+0.127
olson+0.124
inski+0.124
Neg logit contributions
 Dek-0.127
nc-0.123
 Jou-0.119
jug-0.118
�醒-0.116
Token Google
Token ID3012
Feature activation-4.26e-01

Pos logit contributions
 Dek+0.162
nc+0.157
 Jou+0.151
jug+0.150
�醒+0.149
Neg logit contributions
wrapper-0.195
arity-0.194
istries-0.177
olson-0.172
inski-0.172
Token payments
Token ID7524
Feature activation-5.62e-02

Pos logit contributions
wrapper+0.248
arity+0.246
istries+0.224
olson+0.218
inski+0.218
Neg logit contributions
 Dek-0.222
nc-0.216
 Jou-0.208
jug-0.206
�醒-0.204
Token to
Token ID284
Feature activation+2.38e-01

Pos logit contributions
wrapper+0.042
arity+0.041
istries+0.038
olson+0.037
inski+0.037
Neg logit contributions
 Dek-0.035
nc-0.034
 Jou-0.033
jug-0.033
�醒-0.032
Token the
Token ID262
Feature activation+1.35e-02

Pos logit contributions
 Dek+0.151
nc+0.146
 Jou+0.141
jug+0.140
�醒+0.139
Neg logit contributions
wrapper-0.174
arity-0.173
istries-0.157
olson-0.154
inski-0.153
Token National
Token ID2351
Feature activation+2.22e-02

Pos logit contributions
 Dek+0.009
nc+0.008
 Jou+0.008
jug+0.008
�醒+0.008
Neg logit contributions
wrapper-0.010
arity-0.010
istries-0.009
olson-0.009
inski-0.009
Token Radio
Token ID8829
Feature activation-6.79e-01

Pos logit contributions
 Dek+0.016
nc+0.016
 Jou+0.015
jug+0.015
�醒+0.015
Neg logit contributions
wrapper-0.014
arity-0.014
istries-0.013
olson-0.012
inski-0.012
Token Astron
Token ID25398
Feature activation-1.09e+00

Pos logit contributions
wrapper+0.508
arity+0.502
istries+0.460
olson+0.448
inski+0.447
Neg logit contributions
 Dek-0.422
nc-0.409
 Jou-0.393
jug-0.390
 Muscle-0.384
Tokenomy
Token ID9145
Feature activation-7.74e-01

Pos logit contributions
wrapper+0.922
arity+0.912
istries+0.845
olson+0.827
inski+0.825
Neg logit contributions
 Dek-0.561
nc-0.540
 Jou-0.518
jug-0.510
 Muscle-0.502
Token Observatory
Token ID25758
Feature activation-2.08e-01

Pos logit contributions
wrapper+0.529
arity+0.523
istries+0.475
olson+0.462
inski+0.459
Neg logit contributions
 Dek-0.530
nc-0.514
 Jou-0.498
jug-0.492
 Muscle-0.488
Token in
Token ID287
Feature activation-2.52e-01

Pos logit contributions
wrapper+0.156
arity+0.154
istries+0.141
olson+0.137
inski+0.137
Neg logit contributions
 Dek-0.129
nc-0.125
 Jou-0.120
jug-0.119
�醒-0.117
Token Charlottesville
Token ID24756
Feature activation-7.92e-05

Pos logit contributions
wrapper+0.184
arity+0.182
istries+0.166
olson+0.162
inski+0.161
Neg logit contributions
 Dek-0.161
nc-0.156
 Jou-0.150
jug-0.149
�醒-0.147
Token,
Token ID11
Feature activation-1.93e-01

Pos logit contributions
wrapper+0.000
arity+0.000
istries+0.000
olson+0.000
inski+0.000
Neg logit contributions
 Dek-0.000
nc-0.000
 Jou-0.000
jug-0.000
�醒-0.000
Token to
Token ID284
Feature activation+1.16e-01

Pos logit contributions
wrapper+0.261
arity+0.259
istries+0.237
olson+0.230
inski+0.230
Neg logit contributions
 Dek-0.216
nc-0.209
 Jou-0.201
jug-0.200
�醒-0.197
Token legend
Token ID8177
Feature activation-1.11e-01

Pos logit contributions
 Dek+0.076
nc+0.074
 Jou+0.071
jug+0.071
�醒+0.070
Neg logit contributions
wrapper-0.082
arity-0.081
istries-0.074
olson-0.072
inski-0.072
Token,"
Token ID553
Feature activation-1.40e-01

Pos logit contributions
wrapper+0.076
arity+0.075
istries+0.068
olson+0.066
inski+0.066
Neg logit contributions
 Dek-0.076
nc-0.074
 Jou-0.072
jug-0.071
�醒-0.070
Token U
Token ID471
Feature activation-2.74e-01

Pos logit contributions
wrapper+0.106
arity+0.105
istries+0.096
olson+0.094
inski+0.094
Neg logit contributions
 Dek-0.085
nc-0.082
 Jou-0.079
jug-0.078
�醒-0.077
Tokenmen
Token ID3653
Feature activation-1.97e-01

Pos logit contributions
wrapper+0.198
arity+0.196
istries+0.179
olson+0.174
inski+0.174
Neg logit contributions
 Dek-0.177
nc-0.171
 Jou-0.165
jug-0.164
�醒-0.162
Tokenh
Token ID71
Feature activation-1.48e+00

Pos logit contributions
wrapper+0.143
arity+0.141
istries+0.129
olson+0.125
inski+0.125
Neg logit contributions
 Dek-0.127
nc-0.123
 Jou-0.119
jug-0.118
�醒-0.116
Tokenofer
Token ID30288
Feature activation-1.03e-01

Pos logit contributions
wrapper+0.861
arity+0.848
istries+0.761
olson+0.732
inski+0.721
Neg logit contributions
 Dek-1.162
nc-1.129
 Jou-1.103
jug-1.087
 Muscle-1.073
Token says
Token ID1139
Feature activation-2.46e-02

Pos logit contributions
wrapper+0.081
arity+0.081
istries+0.074
olson+0.072
inski+0.072
Neg logit contributions
 Dek-0.060
nc-0.058
 Jou-0.055
jug-0.055
�醒-0.054
Token.
Token ID13
Feature activation+1.27e-01

Pos logit contributions
wrapper+0.018
arity+0.018
istries+0.017
olson+0.016
inski+0.016
Neg logit contributions
 Dek-0.015
nc-0.015
 Jou-0.014
jug-0.014
�醒-0.014
Token
Token ID198
Feature activation+3.20e-02

Pos logit contributions
 Dek+0.079
nc+0.076
 Jou+0.073
jug+0.073
�醒+0.072
Neg logit contributions
wrapper-0.095
arity-0.094
istries-0.086
olson-0.083
inski-0.083
Token
Token ID198
Feature activation+7.15e-02

Pos logit contributions
 Dek+0.020
nc+0.020
 Jou+0.019
jug+0.019
�醒+0.019
Neg logit contributions
wrapper-0.024
arity-0.023
istries-0.021
olson-0.021
inski-0.021