TOP ACTIVATIONS
MAX = 2.097

Diane Rodham Clinton Sanders : ' I fully expect ' fair
Diane Rodham Clinton Sanders : ' I fully expect ' fair
Diane Rodham Clinton Sanders : ' I fully expect ' fair
not a robot by clicking the box . Invalid email address
not a robot by clicking the box . Invalid email address
Clinton staff are ' big gest a -- holes in American
Clinton staff are ' big gest a -- holes in American
not a robot by clicking the box . Invalid email address
Clinton staff are ' big gest a -- holes in American
not a robot by clicking the box . Invalid email address
. service You should notice network manager drop and
by DNC in 2020 after ' not quite even handed '
by DNC in 2020 after ' not quite even handed '
after three decades at Q ant as . " The cards
<|endoftext|> Image copyright ESA Image caption Go ce uses an electric
football fans quickly turned their heart ache into an unhealthy devotion
( in out app data _ full v ) { //
, Black is saying
August 2011 United States 6 12 Posts # 17 Good to
Caption 4 of 15 Photos : Carly Fiorina 's political career

MIN ACTIVATIONS
MIN = -1.733

served as the US em iss ary to the induction of
Leadership . END OF TRAN SM ISSION /// ---------------------------------------------------------------- ---------------------------------------------------------------- ------------------------
- sw orn - en m ity approach to serious ,
s Ring vs Kar am et ra , God of
- ways - law - enforcement - will - use -
scrib bled things like Ryan Se ac rest , Louis Vu
in . um ich . edu /. <|endoftext|> I remember about
press secretary . <|endoftext|> AL B ION are delighted to announce
media for TV personality Ryan Se ac rest 's production company
eng in . um ich . edu /. <|endoftext|> I remember
uman used to speak with Saur on and through which Sar
42 3 355 ; mart in rand all . com ),
in team history ( Patri k Elias , Claude Lem ie
et rah yd ro - 2 H - p yr an
a large reservoir of M . le p ra e and
neighbours , Dust y Al br itton . "
the comparison instead of , say , listing the readily -
have been designed to dis em power and divide our communities
another . Huge , En orm ous , Titanic . The
34 ) " I can understand people getting critical about them

INTERVAL 1.139 - 2.097
CONTAINS 0.042%

and services . Thank you for subscribing . An error has
your stories ! Do you have a story for The Sun
a robot by clicking the box . Invalid email address .
running , pl anking and lung es . ( Wa ub
The Frenchman set a lap time of 1 : 54 .

INTERVAL 0.182 - 1.139
CONTAINS 26.376%

Bre cher / NBC News The Environmental Protection Agency
ine saying that he 's essentially operating in the party interest
. T oss out everything you can imagine about
now looking for ways to be able to direct more light
ve heard that type of rhetoric before ,

INTERVAL -0.775 - 0.182
CONTAINS 72.876%

Light year in black - and - white . The supporters
. Brian Bab ine au / NBA E / Getty Images
Herod ot us begins the Australian Alan Moore head
and I don t want to feel like I
ballot because the doctrine of " E stop el " did

INTERVAL -1.733 - -0.775
CONTAINS 0.706%

are odd echoes of M . Night Shy am alan
." These sc ammers tend to request payment up
planet , no matter what our backgrounds may be ,
less (@ Real Skip Bay less ) January 16 , 2017
it , embrace it . Like you can joke ,
Token Diane
Token ID26542
Feature activation+9.44e-01

Pos logit contributions
hift+3.651
 revol+3.104
 painfully+2.922
 captives+2.900
 comfortable+2.825
Neg logit contributions
 Niet-5.665
plet-4.985
ById-4.981
otle-4.939
SourceFile-4.907
Token Rodham
Token ID41848
Feature activation+2.22e-01

Pos logit contributions
hift+3.208
 revol+2.754
 captives+2.520
 painfully+2.419
 Syri+2.327
Neg logit contributions
 Niet-6.048
ashington-5.276
plet-5.231
SourceFile-5.231
ById-5.186
Token Clinton
Token ID2605
Feature activation+6.26e-01

Pos logit contributions
hift+1.093
 revol+0.965
 captives+0.917
 painfully+0.903
Sense+0.882
Neg logit contributions
 Niet-1.111
ById-0.943
ashington-0.926
otle-0.917
plet-0.916
TokenSanders
Token ID26747
Feature activation+1.21e+00

Pos logit contributions
hift+2.930
 revol+2.561
 captives+2.428
Sense+2.422
 painfully+2.392
Neg logit contributions
 Niet-3.232
otle-2.730
ById-2.730
ashington-2.718
plet-2.712
Token:
Token ID25
Feature activation+1.05e+00

Pos logit contributions
hift+5.518
 revol+4.908
 captives+4.659
 painfully+4.564
 stances+4.528
Neg logit contributions
 Niet-6.236
ById-5.347
plet-5.298
otle-5.278
ashington-5.245
Token '
Token ID705
Feature activation+2.10e+00

Pos logit contributions
hift+4.475
 revol+4.293
 captives+3.951
 Syri+3.809
 reckoned+3.799
Neg logit contributions
 Niet-5.167
ById-4.774
SourceFile-4.615
ashington-4.598
icro-4.554
TokenI
Token ID40
Feature activation+5.82e-01

Pos logit contributions
hift+6.567
Sense+5.951
 revol+5.483
 captives+5.277
vice+5.016
Neg logit contributions
 Niet-12.614
sterdam-11.660
ashington-11.509
SourceFile-11.499
otle-11.441
Token fully
Token ID3938
Feature activation-4.78e-01

Pos logit contributions
hift+2.991
 revol+2.670
 captives+2.534
 painfully+2.474
Sense+2.453
Neg logit contributions
 Niet-2.749
ById-2.338
ashington-2.304
otle-2.291
plet-2.268
Token expect
Token ID1607
Feature activation+1.14e-01

Pos logit contributions
 Niet+2.133
ashington+1.805
ById+1.804
otle+1.756
plet+1.743
Neg logit contributions
hift-2.589
 revol-2.279
 captives-2.184
 painfully-2.126
Sense-2.124
Token'
Token ID6
Feature activation+5.24e-01

Pos logit contributions
hift+0.557
 revol+0.497
 captives+0.467
 painfully+0.459
Sense+0.449
Neg logit contributions
 Niet-0.570
ById-0.496
ashington-0.494
otle-0.484
icro-0.480
Token fair
Token ID3148
Feature activation-1.54e-01

Pos logit contributions
hift+2.272
 revol+2.010
 captives+1.923
 painfully+1.853
Sense+1.805
Neg logit contributions
 Niet-2.866
ashington-2.488
SourceFile-2.486
ById-2.451
icro-2.413
Token Diane
Token ID26542
Feature activation+7.25e-01

Pos logit contributions
hift+3.391
 revol+2.836
 painfully+2.687
 captives+2.666
 comfortable+2.582
Neg logit contributions
 Niet-5.364
plet-4.811
ById-4.732
otle-4.722
SourceFile-4.706
Token Rodham
Token ID41848
Feature activation+1.81e-01

Pos logit contributions
hift+2.695
 revol+2.330
 captives+2.140
 painfully+2.059
 Syri+1.997
Neg logit contributions
 Niet-4.446
ashington-3.890
plet-3.854
ById-3.821
SourceFile-3.820
Token Clinton
Token ID2605
Feature activation+7.36e-01

Pos logit contributions
hift+0.901
 revol+0.792
 captives+0.756
 painfully+0.742
Sense+0.729
Neg logit contributions
 Niet-0.891
ById-0.757
ashington-0.745
plet-0.736
otle-0.736
TokenSanders
Token ID26747
Feature activation+1.34e+00

Pos logit contributions
hift+3.415
 revol+2.995
 captives+2.838
Sense+2.815
 painfully+2.798
Neg logit contributions
 Niet-3.828
ById-3.240
otle-3.232
ashington-3.213
plet-3.192
Token:
Token ID25
Feature activation+9.90e-01

Pos logit contributions
hift+5.907
 revol+5.249
 captives+5.039
 painfully+4.945
 stances+4.933
Neg logit contributions
 Niet-7.146
ById-6.112
plet-6.050
otle-5.989
ashington-5.914
Token '
Token ID705
Feature activation+2.03e+00

Pos logit contributions
hift+4.031
 revol+3.953
 captives+3.602
 Syri+3.520
 Oath+3.466
Neg logit contributions
 Niet-4.921
ById-4.643
icro-4.477
ashington-4.475
SourceFile-4.474
TokenI
Token ID40
Feature activation+5.57e-01

Pos logit contributions
hift+6.325
Sense+5.640
 revol+5.408
 captives+5.206
 stances+4.759
Neg logit contributions
 Niet-12.240
sterdam-11.229
SourceFile-11.210
otle-11.025
plet-11.024
Token fully
Token ID3938
Feature activation-5.16e-01

Pos logit contributions
hift+2.851
 revol+2.549
 captives+2.419
 painfully+2.361
Sense+2.339
Neg logit contributions
 Niet-2.643
ById-2.248
ashington-2.213
otle-2.204
plet-2.183
Token expect
Token ID1607
Feature activation+5.95e-02

Pos logit contributions
 Niet+2.341
ById+1.989
ashington+1.987
otle+1.937
plet+1.920
Neg logit contributions
hift-2.757
 revol-2.417
 captives-2.314
 painfully-2.254
Sense-2.253
Token'
Token ID6
Feature activation+4.56e-01

Pos logit contributions
hift+0.289
 revol+0.259
 captives+0.244
 painfully+0.240
Sense+0.235
Neg logit contributions
 Niet-0.296
ById-0.258
ashington-0.257
otle-0.251
icro-0.249
Token fair
Token ID3148
Feature activation-1.94e-01

Pos logit contributions
hift+2.035
 revol+1.813
 captives+1.734
 painfully+1.690
 reckoned+1.652
Neg logit contributions
 Niet-2.412
SourceFile-2.100
ashington-2.099
ById-2.080
icro-2.028
Token Diane
Token ID26542
Feature activation+7.18e-01

Pos logit contributions
hift+3.375
 revol+2.934
 painfully+2.760
 captives+2.739
 comfortable+2.648
Neg logit contributions
 Niet-5.540
plet-4.993
otle-4.906
ById-4.889
SourceFile-4.860
Token Rodham
Token ID41848
Feature activation+2.06e-01

Pos logit contributions
hift+2.621
 revol+2.265
 captives+2.082
 painfully+2.001
 Syri+1.933
Neg logit contributions
 Niet-4.444
ashington-3.892
SourceFile-3.847
plet-3.845
otle-3.815
Token Clinton
Token ID2605
Feature activation+6.88e-01

Pos logit contributions
hift+1.018
 revol+0.895
 captives+0.855
 painfully+0.840
Sense+0.824
Neg logit contributions
 Niet-1.024
ById-0.867
ashington-0.854
otle-0.847
plet-0.843
TokenSanders
Token ID26747
Feature activation+1.26e+00

Pos logit contributions
hift+3.158
 revol+2.760
 captives+2.613
Sense+2.598
 painfully+2.581
Neg logit contributions
 Niet-3.603
ById-3.050
otle-3.049
ashington-3.040
plet-3.008
Token:
Token ID25
Feature activation+9.86e-01

Pos logit contributions
hift+5.558
 revol+4.947
 captives+4.766
 painfully+4.634
 stances+4.627
Neg logit contributions
 Niet-6.681
ById-5.670
otle-5.652
plet-5.643
ashington-5.497
Token '
Token ID705
Feature activation+1.84e+00

Pos logit contributions
hift+4.054
 revol+3.951
 captives+3.643
 Syri+3.550
 reckoned+3.476
Neg logit contributions
 Niet-4.893
ById-4.554
ashington-4.458
SourceFile-4.458
icro-4.418
TokenI
Token ID40
Feature activation+5.31e-01

Pos logit contributions
hift+6.094
Sense+5.442
 revol+5.290
 captives+5.130
 stances+4.693
Neg logit contributions
 Niet-10.807
SourceFile-9.897
ashington-9.847
sterdam-9.840
otle-9.839
Token fully
Token ID3938
Feature activation-5.37e-01

Pos logit contributions
hift+2.740
 revol+2.450
 captives+2.327
 painfully+2.271
Sense+2.249
Neg logit contributions
 Niet-2.502
ById-2.123
ashington-2.096
otle-2.085
plet-2.065
Token expect
Token ID1607
Feature activation+3.05e-02

Pos logit contributions
 Niet+2.407
ById+2.040
ashington+2.033
otle+1.984
plet+1.968
Neg logit contributions
hift-2.892
 revol-2.545
 captives-2.435
 painfully-2.374
Sense-2.370
Token'
Token ID6
Feature activation+4.20e-01

Pos logit contributions
hift+0.148
 revol+0.133
 captives+0.124
 painfully+0.122
Sense+0.120
Neg logit contributions
 Niet-0.152
ById-0.133
ashington-0.133
otle-0.130
icro-0.129
Token fair
Token ID3148
Feature activation-1.46e-01

Pos logit contributions
hift+1.849
 revol+1.641
 captives+1.565
 painfully+1.519
 reckoned+1.493
Neg logit contributions
 Niet-2.232
ashington-1.972
SourceFile-1.951
ById-1.929
plet-1.905
Token not
Token ID407
Feature activation+2.19e-01

Pos logit contributions
hift+2.303
 revol+2.009
 captives+1.852
 painfully+1.848
Sense+1.773
Neg logit contributions
 Niet-3.208
otle-2.746
ashington-2.734
ById-2.726
plet-2.681
Token a
Token ID257
Feature activation+3.36e-01

Pos logit contributions
hift+0.887
 revol+0.761
 captives+0.704
 painfully+0.693
Sense+0.666
Neg logit contributions
 Niet-1.316
ashington-1.117
otle-1.107
ById-1.103
plet-1.100
Token robot
Token ID9379
Feature activation+1.94e-01

Pos logit contributions
hift+1.204
 revol+1.030
 captives+0.944
 painfully+0.925
 reckoned+0.876
Neg logit contributions
 Niet-2.136
ById-1.863
ashington-1.857
otle-1.841
SourceFile-1.824
Token by
Token ID416
Feature activation+9.49e-01

Pos logit contributions
hift+0.859
 revol+0.764
 captives+0.717
 painfully+0.703
 reckoned+0.685
Neg logit contributions
 Niet-1.079
plet-0.898
ashington-0.893
ById-0.892
otle-0.890
Token clicking
Token ID12264
Feature activation+1.35e+00

Pos logit contributions
hift+3.075
 revol+2.548
 painfully+2.285
 captives+2.279
Sense+2.197
Neg logit contributions
 Niet-6.301
ById-5.548
ashington-5.526
otle-5.460
SourceFile-5.432
Token the
Token ID262
Feature activation+1.79e+00

Pos logit contributions
hift+4.229
 revol+3.375
 captives+3.162
 painfully+3.103
Sense+3.096
Neg logit contributions
 Niet-8.813
ashington-7.503
ById-7.442
plet-7.404
otle-7.401
Token box
Token ID3091
Feature activation+1.18e+00

Pos logit contributions
hift+5.128
 revol+4.279
 painfully+4.069
 captives+3.995
 comfortable+3.953
Neg logit contributions
 Niet-11.711
otle-10.100
ゼウス-10.057
ovember-9.997
ashington-9.954
Token.
Token ID13
Feature activation+9.98e-01

Pos logit contributions
hift+3.788
 revol+2.931
 Syri+2.767
Sense+2.761
 captives+2.742
Neg logit contributions
 Niet-7.698
ById-6.550
plet-6.535
sterdam-6.534
ovember-6.505
Token Invalid
Token ID17665
Feature activation-8.35e-02

Pos logit contributions
hift+3.057
Sense+2.453
 revol+2.398
 Syri+2.239
 reckoned+2.228
Neg logit contributions
 Niet-6.456
sterdam-5.760
plet-5.707
otle-5.692
ovember-5.687
Token email
Token ID3053
Feature activation+4.60e-02

Pos logit contributions
 Niet+0.366
ById+0.300
ashington+0.300
otle+0.295
plet+0.292
Neg logit contributions
hift-0.464
 revol-0.416
 captives-0.398
 painfully-0.390
Sense-0.384
Token address
Token ID2209
Feature activation+6.03e-01

Pos logit contributions
hift+0.215
 revol+0.188
 captives+0.178
 painfully+0.177
Sense+0.172
Neg logit contributions
 Niet-0.242
ById-0.204
otle-0.203
ashington-0.201
plet-0.198
Token not
Token ID407
Feature activation+2.48e-01

Pos logit contributions
hift+2.320
 revol+2.030
 captives+1.861
 painfully+1.860
Sense+1.789
Neg logit contributions
 Niet-3.232
otle-2.773
ashington-2.763
ById-2.748
plet-2.712
Token a
Token ID257
Feature activation+2.50e-01

Pos logit contributions
hift+1.010
 revol+0.867
 captives+0.801
 painfully+0.789
 reckoned+0.761
Neg logit contributions
 Niet-1.482
ashington-1.259
otle-1.248
ById-1.245
plet-1.240
Token robot
Token ID9379
Feature activation+3.36e-01

Pos logit contributions
hift+0.905
 revol+0.777
 captives+0.708
 painfully+0.694
 reckoned+0.665
Neg logit contributions
 Niet-1.586
ById-1.386
ashington-1.376
otle-1.366
SourceFile-1.348
Token by
Token ID416
Feature activation+8.66e-01

Pos logit contributions
hift+1.460
 revol+1.315
 captives+1.232
 painfully+1.205
 reckoned+1.172
Neg logit contributions
 Niet-1.873
plet-1.564
ById-1.554
ashington-1.550
otle-1.550
Token clicking
Token ID12264
Feature activation+1.33e+00

Pos logit contributions
hift+2.834
 revol+2.371
 painfully+2.102
 captives+2.087
Sense+2.026
Neg logit contributions
 Niet-5.726
ById-5.058
ashington-5.047
otle-4.984
plet-4.953
Token the
Token ID262
Feature activation+1.78e+00

Pos logit contributions
hift+4.154
 revol+3.370
 captives+3.098
 painfully+3.061
Sense+3.046
Neg logit contributions
 Niet-8.718
ashington-7.446
ById-7.345
otle-7.339
plet-7.337
Token box
Token ID3091
Feature activation+1.15e+00

Pos logit contributions
hift+4.988
 revol+4.278
 painfully+3.980
 comfortable+3.878
 captives+3.846
Neg logit contributions
 Niet-11.944
otle-10.215
ゼウス-10.166
ovember-10.125
ashington-10.089
Token.
Token ID13
Feature activation+9.16e-01

Pos logit contributions
hift+3.713
 revol+2.916
 captives+2.703
Sense+2.698
 painfully+2.693
Neg logit contributions
 Niet-7.490
ById-6.346
sterdam-6.315
plet-6.311
ovember-6.287
Token Invalid
Token ID17665
Feature activation-2.16e-01

Pos logit contributions
hift+2.825
 revol+2.274
Sense+2.269
 reckoned+2.092
 Syri+2.063
Neg logit contributions
 Niet-5.928
otle-5.267
sterdam-5.254
plet-5.231
ovember-5.192
Token email
Token ID3053
Feature activation+5.89e-03

Pos logit contributions
 Niet+0.925
ById+0.762
ashington+0.760
otle+0.749
plet+0.738
Neg logit contributions
hift-1.214
 revol-1.091
 captives-1.043
 painfully-1.021
Sense-1.009
Token address
Token ID2209
Feature activation+5.07e-01

Pos logit contributions
hift+0.028
 revol+0.024
 captives+0.023
 painfully+0.023
Sense+0.022
Neg logit contributions
 Niet-0.030
ById-0.026
otle-0.026
ashington-0.026
plet-0.025
Token Clinton
Token ID2605
Feature activation+5.86e-01

Pos logit contributions
hift+2.754
 revol+2.398
 captives+2.384
Sense+2.299
 Syri+2.293
Neg logit contributions
 Niet-2.314
icro-1.973
ashington-1.955
ById-1.945
SourceFile-1.937
Token staff
Token ID3085
Feature activation+5.99e-01

Pos logit contributions
hift+2.644
 revol+2.329
 captives+2.201
 painfully+2.139
 reckoned+2.100
Neg logit contributions
 Niet-3.132
ById-2.717
ashington-2.698
otle-2.672
plet-2.652
Token are
Token ID389
Feature activation+9.81e-01

Pos logit contributions
hift+2.338
 revol+2.072
 captives+1.930
 painfully+1.878
 reckoned+1.828
Neg logit contributions
 Niet-3.546
ById-3.137
plet-3.101
ashington-3.087
icro-3.033
Token '
Token ID705
Feature activation+1.42e+00

Pos logit contributions
hift+4.523
 revol+4.017
 captives+3.832
 painfully+3.743
 reckoned+3.640
Neg logit contributions
 Niet-5.101
ById-4.430
otle-4.379
ashington-4.342
plet-4.327
Tokenbig
Token ID14261
Feature activation+7.87e-01

Pos logit contributions
hift+6.892
 revol+6.186
 captives+5.845
 painfully+5.651
Sense+5.504
Neg logit contributions
 Niet-6.983
otle-6.188
ById-6.178
SourceFile-6.053
icro-5.886
Tokengest
Token ID3495
Feature activation+1.76e+00

Pos logit contributions
hift+3.592
 revol+3.140
 captives+2.976
 painfully+2.881
 reckoned+2.834
Neg logit contributions
 Niet-4.158
ById-3.617
ashington-3.610
otle-3.571
plet-3.516
Token a
Token ID257
Feature activation-1.83e-02

Pos logit contributions
hift+6.884
 revol+5.956
 captives+5.945
 painfully+5.593
 servants+5.345
Neg logit contributions
 Niet-9.702
SourceFile-9.050
ashington-8.852
ゼウス-8.807
sterdam-8.629
Token--
Token ID438
Feature activation-7.23e-02

Pos logit contributions
 Niet+0.087
ById+0.074
ashington+0.074
otle+0.073
plet+0.073
Neg logit contributions
hift-0.093
 revol-0.082
 captives-0.078
 painfully-0.076
Sense-0.076
Tokenholes
Token ID28439
Feature activation+2.39e-01

Pos logit contributions
 Niet+0.339
ById+0.288
ashington+0.287
otle+0.284
plet+0.282
Neg logit contributions
hift-0.373
 revol-0.331
 captives-0.316
 painfully-0.308
Sense-0.305
Token in
Token ID287
Feature activation+1.88e-01

Pos logit contributions
hift+1.203
 revol+1.055
 captives+0.999
 painfully+0.971
Sense+0.958
Neg logit contributions
 Niet-1.189
ashington-1.005
ById-1.005
otle-0.992
plet-0.982
Token American
Token ID1605
Feature activation-7.96e-02

Pos logit contributions
hift+0.828
 revol+0.722
 captives+0.676
 painfully+0.657
Sense+0.651
Neg logit contributions
 Niet-1.031
ById-0.896
ashington-0.893
otle-0.879
plet-0.874
Token Clinton
Token ID2605
Feature activation+5.78e-01

Pos logit contributions
hift+2.881
 revol+2.513
 captives+2.500
Sense+2.409
 Syri+2.403
Neg logit contributions
 Niet-2.421
icro-2.068
SourceFile-2.050
ashington-2.045
ById-2.043
Token staff
Token ID3085
Feature activation+5.76e-01

Pos logit contributions
hift+2.727
 revol+2.384
 captives+2.259
 painfully+2.192
Sense+2.178
Neg logit contributions
 Niet-2.987
ById-2.570
ashington-2.565
otle-2.533
plet-2.510
Token are
Token ID389
Feature activation+9.45e-01

Pos logit contributions
hift+2.327
 revol+2.060
 captives+1.928
 painfully+1.873
 reckoned+1.825
Neg logit contributions
 Niet-3.334
ById-2.939
plet-2.896
ashington-2.890
icro-2.849
Token '
Token ID705
Feature activation+1.39e+00

Pos logit contributions
hift+4.348
 revol+3.872
 captives+3.686
 painfully+3.607
 reckoned+3.496
Neg logit contributions
 Niet-4.905
ById-4.270
otle-4.220
ashington-4.184
plet-4.169
Tokenbig
Token ID14261
Feature activation+7.86e-01

Pos logit contributions
hift+6.711
 revol+6.065
 captives+5.712
 painfully+5.537
Sense+5.371
Neg logit contributions
 Niet-6.821
otle-6.085
ById-6.059
SourceFile-5.957
icro-5.763
Tokengest
Token ID3495
Feature activation+1.75e+00

Pos logit contributions
hift+3.669
 revol+3.224
 captives+3.059
 painfully+2.962
 reckoned+2.915
Neg logit contributions
 Niet-4.069
ById-3.527
ashington-3.524
otle-3.486
plet-3.429
Token a
Token ID257
Feature activation-3.89e-04

Pos logit contributions
hift+6.856
 revol+6.053
 captives+5.977
 painfully+5.647
 comfortable+5.360
Neg logit contributions
 Niet-9.640
SourceFile-8.972
ashington-8.817
ゼウス-8.741
sterdam-8.568
Token--
Token ID438
Feature activation-8.89e-02

Pos logit contributions
 Niet+0.002
ById+0.002
ashington+0.002
otle+0.002
plet+0.002
Neg logit contributions
hift-0.002
 revol-0.002
 captives-0.002
 painfully-0.002
Sense-0.002
Tokenholes
Token ID28439
Feature activation+2.26e-01

Pos logit contributions
 Niet+0.422
ById+0.358
ashington+0.358
otle+0.354
plet+0.351
Neg logit contributions
hift-0.454
 revol-0.402
 captives-0.384
 painfully-0.374
Sense-0.370
Token in
Token ID287
Feature activation+2.33e-01

Pos logit contributions
hift+1.136
 revol+0.999
 captives+0.945
 painfully+0.919
Sense+0.907
Neg logit contributions
 Niet-1.115
ById-0.943
ashington-0.943
otle-0.932
plet-0.923
Token American
Token ID1605
Feature activation-5.44e-02

Pos logit contributions
hift+1.037
 revol+0.908
 captives+0.848
 painfully+0.826
Sense+0.819
Neg logit contributions
 Niet-1.264
ById-1.098
ashington-1.093
otle-1.076
plet-1.069
Token not
Token ID407
Feature activation+2.35e-01

Pos logit contributions
hift+2.533
 revol+2.205
 painfully+2.026
 captives+2.022
 comfortable+1.942
Neg logit contributions
 Niet-3.599
otle-3.091
ashington-3.068
ById-3.062
plet-3.012
Token a
Token ID257
Feature activation+3.13e-01

Pos logit contributions
hift+0.957
 revol+0.821
 captives+0.761
 painfully+0.748
 reckoned+0.721
Neg logit contributions
 Niet-1.403
ashington-1.193
otle-1.182
ById-1.179
plet-1.175
Token robot
Token ID9379
Feature activation+3.23e-01

Pos logit contributions
hift+1.127
 revol+0.964
 captives+0.882
 painfully+0.862
 reckoned+0.821
Neg logit contributions
 Niet-1.990
ById-1.741
ashington-1.729
otle-1.718
SourceFile-1.693
Token by
Token ID416
Feature activation+9.15e-01

Pos logit contributions
hift+1.398
 revol+1.256
 captives+1.178
 painfully+1.152
 reckoned+1.118
Neg logit contributions
 Niet-1.807
plet-1.512
otle-1.497
ashington-1.496
ById-1.495
Token clicking
Token ID12264
Feature activation+1.30e+00

Pos logit contributions
hift+3.003
 revol+2.477
 painfully+2.210
 captives+2.208
Sense+2.133
Neg logit contributions
 Niet-6.039
ById-5.320
ashington-5.308
otle-5.253
plet-5.222
Token the
Token ID262
Feature activation+1.75e+00

Pos logit contributions
hift+4.143
 revol+3.336
 captives+3.113
 painfully+3.053
Sense+3.030
Neg logit contributions
 Niet-8.496
ashington-7.228
ById-7.159
otle-7.155
plet-7.139
Token box
Token ID3091
Feature activation+1.14e+00

Pos logit contributions
hift+5.027
 revol+4.240
 painfully+3.965
 comfortable+3.914
 captives+3.901
Neg logit contributions
 Niet-11.611
otle-9.911
ゼウス-9.873
ovember-9.801
ashington-9.738
Token.
Token ID13
Feature activation+9.25e-01

Pos logit contributions
hift+3.715
 revol+2.896
 Syri+2.708
 captives+2.707
Sense+2.707
Neg logit contributions
 Niet-7.408
ById-6.311
plet-6.274
sterdam-6.264
ovember-6.240
Token Invalid
Token ID17665
Feature activation-1.72e-01

Pos logit contributions
hift+2.890
Sense+2.316
 revol+2.301
 Syri+2.143
 reckoned+2.132
Neg logit contributions
 Niet-5.938
plet-5.264
sterdam-5.262
otle-5.253
ovember-5.218
Token email
Token ID3053
Feature activation+4.73e-02

Pos logit contributions
 Niet+0.742
ById+0.612
ashington+0.610
otle+0.601
plet+0.593
Neg logit contributions
hift-0.966
 revol-0.867
 captives-0.830
 painfully-0.812
Sense-0.801
Token address
Token ID2209
Feature activation+6.38e-01

Pos logit contributions
hift+0.223
 revol+0.195
 captives+0.185
 painfully+0.183
Sense+0.179
Neg logit contributions
 Niet-0.246
ById-0.209
otle-0.207
ashington-0.206
plet-0.203
Token Clinton
Token ID2605
Feature activation+6.48e-01

Pos logit contributions
hift+3.040
 revol+2.639
 captives+2.635
Sense+2.535
 Syri+2.533
Neg logit contributions
 Niet-2.596
icro-2.224
ById-2.195
ashington-2.172
SourceFile-2.159
Token staff
Token ID3085
Feature activation+6.57e-01

Pos logit contributions
hift+3.020
 revol+2.643
 captives+2.502
 painfully+2.428
Sense+2.403
Neg logit contributions
 Niet-3.385
ById-2.919
ashington-2.910
otle-2.876
plet-2.850
Token are
Token ID389
Feature activation+9.83e-01

Pos logit contributions
hift+2.593
 revol+2.312
 captives+2.152
 painfully+2.090
 reckoned+2.033
Neg logit contributions
 Niet-3.851
ById-3.421
plet-3.367
ashington-3.350
icro-3.292
Token '
Token ID705
Feature activation+1.40e+00

Pos logit contributions
hift+4.582
 revol+4.069
 captives+3.881
 painfully+3.790
 reckoned+3.692
Neg logit contributions
 Niet-5.074
ById-4.397
otle-4.342
ashington-4.303
plet-4.291
Tokenbig
Token ID14261
Feature activation+7.88e-01

Pos logit contributions
hift+6.791
 revol+6.080
 captives+5.750
 painfully+5.570
Sense+5.442
Neg logit contributions
 Niet-6.920
ById-6.078
otle-6.075
SourceFile-5.973
icro-5.819
Tokengest
Token ID3495
Feature activation+1.72e+00

Pos logit contributions
hift+3.516
 revol+3.064
 captives+2.894
 painfully+2.802
 reckoned+2.756
Neg logit contributions
 Niet-4.258
ById-3.709
ashington-3.701
otle-3.661
plet-3.607
Token a
Token ID257
Feature activation-2.80e-02

Pos logit contributions
hift+6.926
 revol+5.983
 captives+5.908
 painfully+5.571
 servants+5.354
Neg logit contributions
 Niet-9.413
SourceFile-8.721
ashington-8.489
ゼウス-8.480
sterdam-8.269
Token--
Token ID438
Feature activation-3.31e-02

Pos logit contributions
 Niet+0.131
ById+0.112
ashington+0.111
otle+0.110
plet+0.109
Neg logit contributions
hift-0.145
 revol-0.128
 captives-0.123
 painfully-0.119
Sense-0.118
Tokenholes
Token ID28439
Feature activation+2.27e-01

Pos logit contributions
 Niet+0.154
ById+0.131
ashington+0.131
otle+0.129
plet+0.128
Neg logit contributions
hift-0.172
 revol-0.152
 captives-0.146
 painfully-0.142
Sense-0.140
Token in
Token ID287
Feature activation+1.94e-01

Pos logit contributions
hift+1.135
 revol+0.992
 captives+0.939
 painfully+0.913
 reckoned+0.899
Neg logit contributions
 Niet-1.142
ById-0.965
ashington-0.963
otle-0.951
SourceFile-0.945
Token American
Token ID1605
Feature activation-5.23e-02

Pos logit contributions
hift+0.869
 revol+0.759
 captives+0.713
 painfully+0.692
Sense+0.686
Neg logit contributions
 Niet-1.048
ById-0.909
ashington-0.905
otle-0.892
plet-0.886
Token not
Token ID407
Feature activation+2.79e-01

Pos logit contributions
hift+2.344
 revol+2.040
 captives+1.873
 painfully+1.873
Sense+1.799
Neg logit contributions
 Niet-3.265
otle-2.806
ashington-2.803
ById-2.777
plet-2.741
Token a
Token ID257
Feature activation+3.05e-01

Pos logit contributions
hift+1.132
 revol+0.970
 captives+0.894
 painfully+0.881
 reckoned+0.849
Neg logit contributions
 Niet-1.677
ashington-1.428
otle-1.415
ById-1.408
plet-1.406
Token robot
Token ID9379
Feature activation+3.62e-01

Pos logit contributions
hift+1.089
 revol+0.936
 captives+0.852
 painfully+0.836
 reckoned+0.795
Neg logit contributions
 Niet-1.937
ById-1.700
ashington-1.693
otle-1.677
SourceFile-1.653
Token by
Token ID416
Feature activation+8.69e-01

Pos logit contributions
hift+1.549
 revol+1.398
 captives+1.310
 painfully+1.281
 reckoned+1.244
Neg logit contributions
 Niet-2.039
plet-1.707
otle-1.691
ById-1.686
ashington-1.685
Token clicking
Token ID12264
Feature activation+1.31e+00

Pos logit contributions
hift+2.827
 revol+2.363
 painfully+2.089
 captives+2.065
Sense+2.000
Neg logit contributions
 Niet-5.763
ashington-5.089
ById-5.065
otle-5.038
plet-4.985
Token the
Token ID262
Feature activation+1.70e+00

Pos logit contributions
hift+4.122
 revol+3.329
 captives+3.089
 painfully+3.034
Sense+3.011
Neg logit contributions
 Niet-8.577
ashington-7.320
otle-7.225
ById-7.218
sterdam-7.184
Token box
Token ID3091
Feature activation+1.04e+00

Pos logit contributions
hift+4.873
 revol+4.154
 painfully+3.876
 comfortable+3.827
 captives+3.764
Neg logit contributions
 Niet-11.261
otle-9.616
ゼウス-9.523
ovember-9.515
ashington-9.473
Token.
Token ID13
Feature activation+9.21e-01

Pos logit contributions
hift+3.450
 revol+2.727
 captives+2.547
Sense+2.517
 Syri+2.515
Neg logit contributions
 Niet-6.701
ById-5.706
plet-5.670
sterdam-5.645
ashington-5.631
Token Invalid
Token ID17665
Feature activation-2.14e-01

Pos logit contributions
hift+2.875
Sense+2.325
 revol+2.298
 reckoned+2.145
 Syri+2.142
Neg logit contributions
 Niet-5.915
sterdam-5.246
otle-5.237
plet-5.222
ovember-5.170
Token email
Token ID3053
Feature activation-3.16e-02

Pos logit contributions
 Niet+0.932
ById+0.767
ashington+0.766
otle+0.754
plet+0.744
Neg logit contributions
hift-1.190
 revol-1.069
 captives-1.022
 painfully-1.001
Sense-0.986
Token address
Token ID2209
Feature activation+5.51e-01

Pos logit contributions
 Niet+0.162
ById+0.138
otle+0.137
ashington+0.136
plet+0.134
Neg logit contributions
hift-0.151
 revol-0.132
 captives-0.125
 painfully-0.124
Sense-0.121
Token.
Token ID13
Feature activation-6.24e-01

Pos logit contributions
 Niet+2.287
ashington+1.928
ById+1.926
otle+1.900
plet+1.876
Neg logit contributions
hift-2.592
 revol-2.298
 captives-2.194
 painfully-2.126
Sense-2.116
Tokenservice
Token ID15271
Feature activation-1.05e+00

Pos logit contributions
 Niet+3.755
ashington+3.293
ById+3.285
otle+3.265
plet+3.233
Neg logit contributions
hift-2.400
 revol-2.039
 captives-1.888
 painfully-1.825
Sense-1.794
Token
Token ID198
Feature activation-7.88e-02

Pos logit contributions
 Niet+4.643
ashington+3.866
ById+3.833
otle+3.808
plet+3.766
Neg logit contributions
hift-5.599
 revol-4.937
 captives-4.733
 reckoned-4.555
Sense-4.547
Token
Token ID198
Feature activation-7.56e-02

Pos logit contributions
 Niet+0.382
ashington+0.320
ById+0.319
plet+0.317
otle+0.314
Neg logit contributions
hift-0.401
 revol-0.350
 captives-0.334
Sense-0.330
 painfully-0.328
TokenYou
Token ID1639
Feature activation+1.56e-01

Pos logit contributions
 Niet+0.367
ById+0.311
ashington+0.311
otle+0.307
plet+0.304
Neg logit contributions
hift-0.380
 revol-0.335
 captives-0.319
Sense-0.312
 painfully-0.311
Token should
Token ID815
Feature activation+1.60e+00

Pos logit contributions
hift+0.748
 revol+0.662
 captives+0.622
 painfully+0.610
Sense+0.604
Neg logit contributions
 Niet-0.797
ashington-0.681
ById-0.678
otle-0.670
plet-0.667
Token notice
Token ID4003
Feature activation+1.27e+00

Pos logit contributions
hift+6.081
 revol+5.436
 painfully+5.015
Sense+4.806
 captives+4.724
Neg logit contributions
 Niet-9.209
ById-7.984
ashington-7.882
soType-7.611
plet-7.539
Token network
Token ID3127
Feature activation-1.84e-02

Pos logit contributions
hift+5.044
 revol+4.409
 painfully+4.082
 captives+3.950
Sense+3.914
Neg logit contributions
 Niet-7.165
ashington-6.369
ById-6.282
otle-6.217
plet-6.139
Token manager
Token ID4706
Feature activation+7.70e-02

Pos logit contributions
 Niet+0.102
ById+0.089
ashington+0.089
otle+0.088
plet+0.087
Neg logit contributions
hift-0.079
 revol-0.068
 captives-0.064
 painfully-0.062
Sense-0.062
Token drop
Token ID4268
Feature activation+1.56e-01

Pos logit contributions
hift+0.344
 revol+0.299
 captives+0.282
 painfully+0.274
Sense+0.272
Neg logit contributions
 Niet-0.416
ById-0.361
ashington-0.360
otle-0.356
plet-0.353
Token and
Token ID290
Feature activation+7.44e-02

Pos logit contributions
hift+0.704
 revol+0.612
 captives+0.578
 painfully+0.563
Sense+0.558
Neg logit contributions
 Niet-0.833
ashington-0.721
ById-0.720
otle-0.710
plet-0.701
Token by
Token ID416
Feature activation+4.62e-01

Pos logit contributions
hift+0.940
 revol+0.826
 captives+0.775
 painfully+0.752
Sense+0.749
Neg logit contributions
 Niet-1.075
ashington-0.934
ById-0.932
otle-0.919
plet-0.913
Token DNC
Token ID17824
Feature activation+6.89e-01

Pos logit contributions
hift+2.197
 revol+1.935
 captives+1.840
 painfully+1.781
Sense+1.764
Neg logit contributions
 Niet-2.352
ById-2.036
ashington-2.010
plet-1.977
otle-1.976
Token in
Token ID287
Feature activation+3.11e-01

Pos logit contributions
hift+3.252
 revol+2.851
 captives+2.707
 painfully+2.615
Sense+2.585
Neg logit contributions
 Niet-3.551
ById-3.058
ashington-3.039
otle-3.013
plet-2.999
Token 2020
Token ID12131
Feature activation+3.22e-01

Pos logit contributions
hift+1.171
 revol+0.968
 captives+0.906
 reckoned+0.869
 painfully+0.863
Neg logit contributions
 Niet-1.920
ashington-1.697
ById-1.694
otle-1.671
plet-1.655
Token after
Token ID706
Feature activation+1.15e+00

Pos logit contributions
hift+1.523
 revol+1.363
 captives+1.282
 painfully+1.253
Sense+1.252
Neg logit contributions
 Niet-1.630
ById-1.413
otle-1.393
ashington-1.384
plet-1.370
Token '
Token ID705
Feature activation+1.54e+00

Pos logit contributions
hift+4.562
 revol+4.331
 captives+4.045
 painfully+4.024
 Syri+3.822
Neg logit contributions
 Niet-6.300
plet-5.565
ashington-5.469
otle-5.463
icro-5.423
Tokennot
Token ID1662
Feature activation+1.40e-01

Pos logit contributions
hift+5.927
 revol+5.178
Sense+4.898
 captives+4.878
 painfully+4.709
Neg logit contributions
 Niet-8.962
otle-7.863
ById-7.796
ashington-7.675
icro-7.660
Token quite
Token ID2407
Feature activation-2.24e-01

Pos logit contributions
hift+0.661
 revol+0.578
 captives+0.551
 painfully+0.534
Sense+0.527
Neg logit contributions
 Niet-0.712
ById-0.618
ashington-0.617
otle-0.607
plet-0.605
Token even
Token ID772
Feature activation-5.46e-01

Pos logit contributions
 Niet+1.138
ashington+1.000
ById+0.984
plet+0.982
otle+0.973
Neg logit contributions
hift-1.056
 revol-0.907
 captives-0.858
 reckoned-0.833
 painfully-0.823
Token handed
Token ID10158
Feature activation+9.70e-02

Pos logit contributions
 Niet+2.319
ashington+1.999
ById+1.961
otle+1.954
plet+1.936
Neg logit contributions
hift-3.003
 revol-2.647
 captives-2.556
 reckoned-2.496
Sense-2.474
Token'
Token ID6
Feature activation+1.65e-01

Pos logit contributions
hift+0.515
 revol+0.459
 captives+0.438
 reckoned+0.426
 painfully+0.425
Neg logit contributions
 Niet-0.434
ById-0.370
ashington-0.369
otle-0.367
plet-0.364
Token by
Token ID416
Feature activation+3.68e-01

Pos logit contributions
hift+0.930
 revol+0.820
 captives+0.774
 painfully+0.751
Sense+0.749
Neg logit contributions
 Niet-0.997
ById-0.862
ashington-0.862
otle-0.850
plet-0.843
Token DNC
Token ID17824
Feature activation+7.82e-01

Pos logit contributions
hift+1.719
 revol+1.505
 captives+1.429
 painfully+1.384
Sense+1.372
Neg logit contributions
 Niet-1.911
ById-1.654
ashington-1.642
otle-1.617
plet-1.611
Token in
Token ID287
Feature activation+2.58e-01

Pos logit contributions
hift+3.634
 revol+3.191
 captives+3.045
 painfully+2.919
Sense+2.871
Neg logit contributions
 Niet-4.078
ById-3.518
ashington-3.479
plet-3.460
otle-3.456
Token 2020
Token ID12131
Feature activation+2.21e-01

Pos logit contributions
hift+0.841
 revol+0.669
 captives+0.614
 reckoned+0.587
 painfully+0.580
Neg logit contributions
 Niet-1.728
ashington-1.545
ById-1.540
otle-1.518
plet-1.506
Token after
Token ID706
Feature activation+1.06e+00

Pos logit contributions
hift+1.044
 revol+0.932
 captives+0.878
 painfully+0.858
Sense+0.855
Neg logit contributions
 Niet-1.127
ById-0.974
otle-0.960
ashington-0.956
plet-0.946
Token '
Token ID705
Feature activation+1.52e+00

Pos logit contributions
hift+4.385
 revol+4.124
 captives+3.876
 painfully+3.836
 Syri+3.660
Neg logit contributions
 Niet-5.660
plet-5.023
ashington-4.951
otle-4.896
ById-4.878
Tokennot
Token ID1662
Feature activation+1.06e-01

Pos logit contributions
hift+5.717
 revol+5.043
Sense+4.754
 captives+4.733
 painfully+4.556
Neg logit contributions
 Niet-8.992
otle-7.877
ById-7.842
icro-7.720
ashington-7.714
Token quite
Token ID2407
Feature activation-3.19e-01

Pos logit contributions
hift+0.509
 revol+0.446
 captives+0.425
 painfully+0.412
Sense+0.408
Neg logit contributions
 Niet-0.532
ById-0.460
ashington-0.458
otle-0.452
plet-0.450
Token even
Token ID772
Feature activation-5.51e-01

Pos logit contributions
 Niet+1.585
ashington+1.390
plet+1.367
ById+1.362
otle+1.351
Neg logit contributions
hift-1.532
 revol-1.317
 captives-1.242
 reckoned-1.209
 painfully-1.193
Token handed
Token ID10158
Feature activation+1.17e-01

Pos logit contributions
 Niet+2.382
ashington+2.040
ById+1.996
otle+1.995
plet+1.980
Neg logit contributions
hift-3.014
 revol-2.649
 captives-2.557
 reckoned-2.499
Sense-2.474
Token'
Token ID6
Feature activation+8.21e-02

Pos logit contributions
hift+0.620
 revol+0.553
 captives+0.527
 reckoned+0.513
 painfully+0.513
Neg logit contributions
 Niet-0.522
ById-0.443
ashington-0.443
otle-0.440
plet-0.437
Token after
Token ID706
Feature activation+3.84e-01

Pos logit contributions
hift+1.457
 revol+1.274
 captives+1.200
 painfully+1.174
 reckoned+1.152
Neg logit contributions
 Niet-1.768
ashington-1.539
ById-1.538
otle-1.524
plet-1.508
Token three
Token ID1115
Feature activation+1.05e-01

Pos logit contributions
hift+1.696
 revol+1.498
 captives+1.406
 painfully+1.379
 reckoned+1.340
Neg logit contributions
 Niet-2.078
ById-1.828
ashington-1.819
otle-1.806
plet-1.775
Token decades
Token ID4647
Feature activation+1.89e-02

Pos logit contributions
hift+0.540
 revol+0.482
 captives+0.460
 painfully+0.449
 reckoned+0.440
Neg logit contributions
 Niet-0.493
ById-0.420
ashington-0.418
otle-0.413
plet-0.406
Token at
Token ID379
Feature activation+4.93e-02

Pos logit contributions
hift+0.084
 revol+0.073
 captives+0.069
 painfully+0.067
Sense+0.066
Neg logit contributions
 Niet-0.103
ById-0.089
ashington-0.089
otle-0.088
plet-0.087
Token Q
Token ID1195
Feature activation+1.62e-01

Pos logit contributions
hift+0.228
 revol+0.199
 captives+0.188
 painfully+0.183
Sense+0.180
Neg logit contributions
 Niet-0.259
ById-0.225
ashington-0.224
otle-0.221
plet-0.219
Tokenant
Token ID415
Feature activation+1.51e+00

Pos logit contributions
hift+0.808
 revol+0.712
 captives+0.676
 painfully+0.659
Sense+0.656
Neg logit contributions
 Niet-0.796
ById-0.679
ashington-0.676
otle-0.668
plet-0.661
Tokenas
Token ID292
Feature activation+5.78e-01

Pos logit contributions
hift+5.493
 revol+4.620
Sense+4.068
 painfully+4.008
 captives+3.994
Neg logit contributions
 Niet-9.056
ById-8.216
ashington-7.859
SourceFile-7.824
ゼウス-7.642
Token.
Token ID13
Feature activation-2.73e-01

Pos logit contributions
hift+2.575
 revol+2.239
 captives+2.111
 painfully+2.041
 reckoned+2.018
Neg logit contributions
 Niet-3.131
ById-2.722
ashington-2.709
otle-2.677
plet-2.667
Token "
Token ID366
Feature activation-3.71e-01

Pos logit contributions
 Niet+1.246
ById+1.052
ashington+1.048
otle+1.034
plet+1.027
Neg logit contributions
hift-1.450
 revol-1.287
 captives-1.228
 painfully-1.197
Sense-1.194
TokenThe
Token ID464
Feature activation+9.34e-02

Pos logit contributions
 Niet+1.775
ById+1.518
ashington+1.508
otle+1.485
plet+1.468
Neg logit contributions
hift-1.889
 revol-1.679
 captives-1.595
 painfully-1.550
 reckoned-1.532
Token cards
Token ID4116
Feature activation+1.27e-01

Pos logit contributions
hift+0.416
 revol+0.363
 captives+0.343
 painfully+0.333
Sense+0.327
Neg logit contributions
 Niet-0.505
ById-0.441
ashington-0.441
otle-0.434
plet-0.430
Token<|endoftext|>
Token ID50256
Feature activation-3.12e-01

Pos logit contributions
 Niet+1.813
ById+1.519
ashington+1.516
otle+1.495
plet+1.477
Neg logit contributions
hift-2.205
 revol-1.965
 captives-1.878
 painfully-1.831
Sense-1.818
TokenImage
Token ID5159
Feature activation-1.01e+00

Pos logit contributions
 Niet+1.552
ById+1.334
ashington+1.329
otle+1.311
plet+1.298
Neg logit contributions
hift-1.525
 revol-1.345
 captives-1.275
 painfully-1.239
 reckoned-1.227
Token copyright
Token ID6634
Feature activation+5.44e-01

Pos logit contributions
 Niet+3.717
ById+3.022
ashington+2.950
otle+2.915
/​+2.893
Neg logit contributions
hift-6.101
 revol-5.489
 captives-5.280
 painfully-5.170
 reckoned-5.148
Token ESA
Token ID42212
Feature activation-2.39e-02

Pos logit contributions
hift+2.864
 revol+2.577
 captives+2.454
 painfully+2.385
 reckoned+2.376
Neg logit contributions
 Niet-2.412
ById-2.061
ashington-2.061
plet-2.049
otle-2.016
Token Image
Token ID7412
Feature activation-9.18e-02

Pos logit contributions
 Niet+0.116
ById+0.098
ashington+0.098
otle+0.097
plet+0.096
Neg logit contributions
hift-0.121
 revol-0.107
 captives-0.102
 painfully-0.099
Sense-0.098
Token caption
Token ID8305
Feature activation+1.50e+00

Pos logit contributions
 Niet+0.338
plet+0.260
ById+0.259
ashington+0.259
otle+0.258
Neg logit contributions
hift-0.581
 revol-0.523
 captives-0.512
 painfully-0.499
Sense-0.495
Token Go
Token ID1514
Feature activation+2.32e-02

Pos logit contributions
hift+6.663
 Syri+5.837
 revol+5.710
Sense+5.657
 captives+5.572
Neg logit contributions
 Niet-6.937
ashington-6.321
plet-6.311
ById-6.287
SourceFile-6.153
Tokence
Token ID344
Feature activation+2.54e-01

Pos logit contributions
hift+0.117
 revol+0.102
 captives+0.098
 painfully+0.095
Sense+0.094
Neg logit contributions
 Niet-0.113
ashington-0.096
ById-0.096
otle-0.094
plet-0.094
Token uses
Token ID3544
Feature activation+5.41e-01

Pos logit contributions
hift+1.216
 revol+1.072
 captives+1.015
 painfully+0.981
 reckoned+0.979
Neg logit contributions
 Niet-1.288
ById-1.119
ashington-1.115
otle-1.091
plet-1.077
Token an
Token ID281
Feature activation+7.29e-02

Pos logit contributions
hift+2.409
 revol+2.113
 captives+1.999
 painfully+1.933
 reckoned+1.875
Neg logit contributions
 Niet-2.893
ashington-2.550
ById-2.531
otle-2.508
plet-2.487
Token electric
Token ID5186
Feature activation+1.96e-01

Pos logit contributions
hift+0.332
 revol+0.289
 captives+0.273
 painfully+0.266
 reckoned+0.261
Neg logit contributions
 Niet-0.389
ashington-0.337
ById-0.333
plet-0.333
otle-0.333
Token football
Token ID4346
Feature activation-1.45e-01

Pos logit contributions
hift+0.217
 revol+0.188
 captives+0.176
 painfully+0.171
 reckoned+0.169
Neg logit contributions
 Niet-0.289
ById-0.252
ashington-0.251
otle-0.249
plet-0.247
Token fans
Token ID3296
Feature activation-1.47e-01

Pos logit contributions
 Niet+0.823
ById+0.718
ashington+0.717
otle+0.709
plet+0.702
Neg logit contributions
hift-0.607
 revol-0.520
 captives-0.489
 painfully-0.472
Sense-0.469
Token quickly
Token ID2952
Feature activation-3.70e-01

Pos logit contributions
 Niet+0.839
ById+0.732
ashington+0.731
otle+0.723
plet+0.717
Neg logit contributions
hift-0.616
 revol-0.529
 captives-0.497
 painfully-0.480
Sense-0.477
Token turned
Token ID2900
Feature activation-3.97e-01

Pos logit contributions
 Niet+2.166
ById+1.899
ashington+1.898
otle+1.877
plet+1.861
Neg logit contributions
hift-1.483
 revol-1.253
 captives-1.182
 painfully-1.136
Sense-1.135
Token their
Token ID511
Feature activation+4.69e-01

Pos logit contributions
 Niet+2.201
ById+1.913
ashington+1.910
otle+1.889
plet+1.872
Neg logit contributions
hift-1.722
 revol-1.483
 captives-1.400
 painfully-1.352
Sense-1.347
Token heart
Token ID2612
Feature activation+1.47e+00

Pos logit contributions
hift+2.087
 revol+1.938
 captives+1.799
 painfully+1.754
Sense+1.683
Neg logit contributions
 Niet-2.450
ashington-2.140
ById-2.133
otle-2.127
plet-2.111
Tokenache
Token ID4891
Feature activation+6.50e-01

Pos logit contributions
hift+6.036
 revol+5.672
 captives+5.311
 painfully+5.207
Sense+4.995
Neg logit contributions
 Niet-7.746
ashington-6.697
ById-6.643
plet-6.568
icro-6.523
Token into
Token ID656
Feature activation+1.10e-01

Pos logit contributions
hift+2.550
 revol+2.262
 captives+2.104
 painfully+2.062
 reckoned+1.981
Neg logit contributions
 Niet-3.820
ashington-3.332
ById-3.313
otle-3.270
SourceFile-3.258
Token an
Token ID281
Feature activation+1.10e-01

Pos logit contributions
hift+0.484
 revol+0.427
 captives+0.399
 painfully+0.390
 reckoned+0.381
Neg logit contributions
 Niet-0.604
ashington-0.527
ById-0.524
otle-0.520
plet-0.514
Token unhealthy
Token ID27942
Feature activation-2.92e-01

Pos logit contributions
hift+0.499
 revol+0.441
 captives+0.412
 painfully+0.406
 reckoned+0.398
Neg logit contributions
 Niet-0.586
ashington-0.504
otle-0.501
ById-0.501
plet-0.500
Token devotion
Token ID25808
Feature activation+4.45e-01

Pos logit contributions
 Niet+1.549
ById+1.339
ashington+1.338
otle+1.322
plet+1.311
Neg logit contributions
hift-1.336
 revol-1.166
 captives-1.100
 painfully-1.068
Sense-1.057
Token (
Token ID357
Feature activation+3.73e-01

Pos logit contributions
hift+2.873
 revol+2.505
Sense+2.373
 captives+2.361
 reckoned+2.326
Neg logit contributions
 Niet-3.587
plet-3.062
ashington-3.037
otle-2.984
ById-2.945
Token in
Token ID287
Feature activation+2.62e-01

Pos logit contributions
hift+1.436
 revol+1.242
 captives+1.156
 painfully+1.118
 reckoned+1.089
Neg logit contributions
 Niet-2.262
ById-1.973
ashington-1.960
otle-1.943
plet-1.939
Tokenout
Token ID448
Feature activation+4.19e-01

Pos logit contributions
hift+1.038
 revol+0.871
 captives+0.818
 painfully+0.786
 reckoned+0.779
Neg logit contributions
 Niet-1.583
ById-1.382
ashington-1.354
otle-1.345
plet-1.337
Token app
Token ID598
Feature activation+2.84e-01

Pos logit contributions
hift+1.897
 revol+1.685
 captives+1.569
 painfully+1.563
 reckoned+1.516
Neg logit contributions
 Niet-2.272
ashington-1.965
ById-1.950
plet-1.898
otle-1.891
Tokendata
Token ID7890
Feature activation+5.16e-01

Pos logit contributions
hift+1.372
 revol+1.179
 captives+1.148
 painfully+1.126
 reckoned+1.106
Neg logit contributions
 Niet-1.490
ById-1.200
ashington-1.196
otle-1.166
SourceFile-1.162
Token _
Token ID4808
Feature activation+1.46e+00

Pos logit contributions
hift+1.834
 revol+1.571
 captives+1.455
 painfully+1.392
 reckoned+1.374
Neg logit contributions
 Niet-3.223
ashington-2.862
ById-2.795
plet-2.784
icro-2.754
Token full
Token ID1336
Feature activation+6.27e-01

Pos logit contributions
hift+2.469
 revol+1.879
 captives+1.688
 painfully+1.554
 reckoned+1.397
Neg logit contributions
 Niet-11.553
ashington-10.614
ById-10.468
otle-10.300
SourceFile-10.245
Token v
Token ID410
Feature activation+1.29e+00

Pos logit contributions
hift+2.207
 revol+1.939
 captives+1.764
 painfully+1.694
 reckoned+1.663
Neg logit contributions
 Niet-3.956
ashington-3.489
otle-3.413
ById-3.408
icro-3.399
Token )
Token ID1267
Feature activation+1.26e+00

Pos logit contributions
hift+5.288
 revol+4.676
 captives+4.363
 painfully+4.226
Sense+4.114
Neg logit contributions
 Niet-7.093
ashington-5.912
ById-5.889
SourceFile-5.859
otle-5.844
Token {
Token ID1391
Feature activation+5.55e-01

Pos logit contributions
hift+5.269
 revol+4.600
Sense+4.299
 captives+4.298
 painfully+4.247
Neg logit contributions
 Niet-6.988
plet-5.803
ashington-5.659
ById-5.609
azo-5.591
Token //
Token ID3373
Feature activation+8.38e-01

Pos logit contributions
hift+2.145
 revol+1.826
 captives+1.699
 painfully+1.637
 reckoned+1.619
Neg logit contributions
 Niet-3.339
ById-2.937
ashington-2.923
otle-2.896
plet-2.876
Token,
Token ID11
Feature activation-1.50e-01

Pos logit contributions
 Niet+3.196
ashington+2.740
ById+2.737
otle+2.725
plet+2.680
Neg logit contributions
hift-3.125
 revol-2.722
 captives-2.595
Sense-2.515
 reckoned-2.509
Token Black
Token ID2619
Feature activation-1.72e-02

Pos logit contributions
 Niet+0.786
ById+0.678
ashington+0.676
otle+0.667
plet+0.662
Neg logit contributions
hift-0.700
 revol-0.611
 captives-0.579
 painfully-0.562
Sense-0.557
Token is
Token ID318
Feature activation-1.09e-01

Pos logit contributions
 Niet+0.089
ById+0.077
ashington+0.076
otle+0.075
plet+0.075
Neg logit contributions
hift-0.081
 revol-0.072
 captives-0.068
 painfully-0.066
Sense-0.066
Token saying
Token ID2282
Feature activation-4.73e-01

Pos logit contributions
 Niet+0.615
ById+0.538
ashington+0.537
otle+0.530
plet+0.526
Neg logit contributions
hift-0.459
 revol-0.396
 captives-0.372
 painfully-0.360
Sense-0.355
Token
Token ID10545
Feature activation+4.10e-01

Pos logit contributions
 Niet+2.517
ById+2.166
ashington+2.150
otle+2.143
plet+2.115
Neg logit contributions
hift-2.162
 revol-1.873
 captives-1.775
 painfully-1.710
Sense-1.709
Token
Token ID230
Feature activation+1.46e+00

Pos logit contributions
hift+1.681
 revol+1.431
 captives+1.341
Sense+1.308
 painfully+1.292
Neg logit contributions
 Niet-2.367
ashington-2.075
ById-2.054
otle-2.038
plet-2.027
Token
Token ID239
Feature activation+1.04e+00

Pos logit contributions
hift+5.921
 revol+4.731
 captives+4.437
 painfully+4.339
Sense+4.299
Neg logit contributions
 Niet-8.459
ashington-7.359
ById-7.266
otle-7.172
plet-7.126
Token
Token ID46349
Feature activation+2.30e-01

Pos logit contributions
hift+4.664
 revol+4.054
 captives+3.782
Sense+3.731
 reckoned+3.728
Neg logit contributions
 Niet-5.436
ashington-4.715
ById-4.653
otle-4.608
plet-4.557
Token
Token ID111
Feature activation+7.65e-01

Pos logit contributions
hift+1.009
 revol+0.871
 captives+0.824
 painfully+0.795
Sense+0.790
Neg logit contributions
 Niet-1.267
ashington-1.101
ById-1.096
otle-1.084
plet-1.073
Token
Token ID17358
Feature activation+2.80e-01

Pos logit contributions
hift+3.473
 revol+3.027
 captives+2.859
 painfully+2.802
Sense+2.794
Neg logit contributions
 Niet-3.971
ashington-3.481
otle-3.414
ById-3.409
plet-3.379
Token
Token ID223
Feature activation+2.02e-01

Pos logit contributions
hift+0.998
 revol+0.828
 captives+0.770
 painfully+0.736
Sense+0.733
Neg logit contributions
 Niet-1.772
ashington-1.568
ById-1.562
otle-1.545
plet-1.540
Token August
Token ID2932
Feature activation+6.65e-01

Pos logit contributions
hift+1.345
 revol+1.213
 captives+1.153
 painfully+1.143
Sense+1.134
Neg logit contributions
 Niet-0.971
ById-0.820
ashington-0.820
plet-0.811
otle-0.800
Token 2011
Token ID2813
Feature activation+7.72e-01

Pos logit contributions
hift+2.731
 revol+2.344
 reckoned+2.159
 Syri+2.153
 captives+2.145
Neg logit contributions
 Niet-3.725
plet-3.291
ById-3.277
ashington-3.263
otle-3.244
Token United
Token ID1578
Feature activation+1.13e-01

Pos logit contributions
hift+3.457
 revol+3.000
Sense+2.774
 captives+2.773
 Syri+2.763
Neg logit contributions
 Niet-3.945
ById-3.498
ashington-3.488
plet-3.486
otle-3.439
Token States
Token ID1829
Feature activation+7.23e-01

Pos logit contributions
hift+0.558
 revol+0.491
 captives+0.466
 painfully+0.453
Sense+0.450
Neg logit contributions
 Niet-0.560
ById-0.478
ashington-0.477
otle-0.471
plet-0.466
Token 6
Token ID718
Feature activation+3.25e-01

Pos logit contributions
hift+3.418
 revol+2.958
 captives+2.767
 reckoned+2.745
Sense+2.731
Neg logit contributions
 Niet-3.543
ById-3.149
ashington-3.132
plet-3.131
otle-3.071
Token12
Token ID1065
Feature activation+1.45e+00

Pos logit contributions
hift+1.448
 revol+1.242
 captives+1.172
Sense+1.143
 painfully+1.139
Neg logit contributions
 Niet-1.755
ById-1.517
ashington-1.511
plet-1.501
otle-1.494
Token Posts
Token ID12043
Feature activation-4.41e-01

Pos logit contributions
hift+6.395
Sense+5.334
 revol+5.321
 reckoned+5.170
 Syri+5.065
Neg logit contributions
 Niet-7.230
plet-6.398
ashington-6.247
otle-6.204
sterdam-6.087
Token #
Token ID1303
Feature activation+4.45e-02

Pos logit contributions
 Niet+1.543
ById+1.216
ashington+1.202
otle+1.189
plet+1.165
Neg logit contributions
hift-2.812
 revol-2.548
 captives-2.468
Sense-2.402
 painfully-2.398
Token17
Token ID1558
Feature activation+7.70e-01

Pos logit contributions
hift+0.178
 revol+0.150
 captives+0.140
Sense+0.136
 painfully+0.135
Neg logit contributions
 Niet-0.265
ById-0.230
ashington-0.229
otle-0.228
plet-0.227
Token Good
Token ID4599
Feature activation+3.20e-01

Pos logit contributions
hift+3.816
 revol+3.377
 captives+3.186
 painfully+3.169
 reckoned+3.160
Neg logit contributions
 Niet-3.581
ById-3.170
ashington-3.155
plet-3.083
otle-3.082
Token to
Token ID284
Feature activation+4.81e-01

Pos logit contributions
hift+1.420
 revol+1.230
 captives+1.155
 painfully+1.127
Sense+1.118
Neg logit contributions
 Niet-1.746
ById-1.523
ashington-1.511
otle-1.493
plet-1.479
Token Caption
Token ID11260
Feature activation+5.06e-02

Pos logit contributions
 Niet+4.087
ashington+3.511
ById+3.496
otle+3.419
plet+3.416
Neg logit contributions
hift-3.512
 revol-3.059
 captives-2.897
 painfully-2.802
 reckoned-2.795
Token 4
Token ID604
Feature activation+6.30e-01

Pos logit contributions
hift+0.245
 revol+0.214
 captives+0.201
Sense+0.200
 painfully+0.198
Neg logit contributions
 Niet-0.252
ashington-0.217
ById-0.217
plet-0.216
otle-0.215
Token of
Token ID286
Feature activation+5.78e-01

Pos logit contributions
hift+2.564
 revol+2.088
 captives+2.009
 painfully+1.963
Sense+1.937
Neg logit contributions
 Niet-3.695
ById-3.167
plet-3.147
ashington-3.141
otle-3.115
Token 15
Token ID1315
Feature activation+5.20e-01

Pos logit contributions
hift+2.637
 revol+2.282
 captives+2.152
Sense+2.136
 painfully+2.104
Neg logit contributions
 Niet-2.938
ashington-2.631
plet-2.602
ById-2.596
otle-2.576
Token Photos
Token ID9434
Feature activation-2.74e-02

Pos logit contributions
hift+1.737
 revol+1.443
 captives+1.331
 painfully+1.275
Sense+1.251
Neg logit contributions
 Niet-3.343
ashington-2.990
ById-2.986
otle-2.964
plet-2.955
Token:
Token ID25
Feature activation+1.43e+00

Pos logit contributions
 Niet+0.161
ById+0.139
ashington+0.139
plet+0.138
otle+0.138
Neg logit contributions
hift-0.112
 revol-0.095
 captives-0.089
Sense-0.087
 painfully-0.086
Token Carly
Token ID35982
Feature activation+5.75e-02

Pos logit contributions
hift+6.124
 revol+5.330
 Syri+5.072
Sense+5.043
 captives+4.944
Neg logit contributions
 Niet-7.026
plet-6.375
sterdam-6.350
ById-6.311
icro-6.280
Token Fiorina
Token ID49028
Feature activation-4.68e-01

Pos logit contributions
hift+0.207
 revol+0.174
 captives+0.161
 painfully+0.154
Sense+0.152
Neg logit contributions
 Niet-0.361
ById-0.321
ashington-0.320
otle-0.315
plet-0.314
Token's
Token ID338
Feature activation-9.97e-02

Pos logit contributions
 Niet+2.381
ashington+2.031
ById+2.021
otle+2.016
plet+1.985
Neg logit contributions
hift-2.243
 revol-1.934
 captives-1.839
Sense-1.800
 painfully-1.777
Token political
Token ID1964
Feature activation+2.82e-01

Pos logit contributions
 Niet+0.545
ById+0.472
ashington+0.472
otle+0.466
plet+0.462
Neg logit contributions
hift-0.441
 revol-0.381
 captives-0.360
 painfully-0.348
Sense-0.346
Token career
Token ID3451
Feature activation-1.48e-03

Pos logit contributions
hift+1.289
 revol+1.144
 captives+1.077
 painfully+1.039
Sense+1.023
Neg logit contributions
 Niet-1.502
ById-1.298
ashington-1.284
otle-1.273
plet-1.264
Token served
Token ID4983
Feature activation-4.11e-01

Pos logit contributions
 Niet+0.353
ById+0.309
ashington+0.307
otle+0.303
plet+0.302
Neg logit contributions
hift-0.257
 revol-0.224
 captives-0.209
 painfully-0.204
 reckoned-0.201
Token as
Token ID355
Feature activation-3.65e-01

Pos logit contributions
 Niet+2.274
ashington+1.972
ById+1.970
otle+1.952
plet+1.924
Neg logit contributions
hift-1.769
 revol-1.517
 captives-1.431
Sense-1.394
 painfully-1.378
Token the
Token ID262
Feature activation-3.16e-01

Pos logit contributions
 Niet+1.993
ashington+1.727
ById+1.725
otle+1.706
plet+1.686
Neg logit contributions
hift-1.607
 revol-1.391
 captives-1.313
 painfully-1.268
Sense-1.267
Token US
Token ID1294
Feature activation-2.86e-02

Pos logit contributions
 Niet+1.695
ById+1.468
ashington+1.464
otle+1.447
plet+1.434
Neg logit contributions
hift-1.426
 revol-1.240
 captives-1.171
 painfully-1.135
Sense-1.126
Token em
Token ID795
Feature activation-3.16e-01

Pos logit contributions
 Niet+0.161
ById+0.141
ashington+0.139
otle+0.138
plet+0.137
Neg logit contributions
hift-0.123
 revol-0.106
 captives-0.100
 painfully-0.097
Sense-0.096
Tokeniss
Token ID747
Feature activation-1.73e+00

Pos logit contributions
 Niet+1.603
ById+1.377
ashington+1.357
otle+1.345
plet+1.328
Neg logit contributions
hift-1.531
 revol-1.346
 captives-1.266
 painfully-1.236
Sense-1.229
Tokenary
Token ID560
Feature activation-4.08e-01

Pos logit contributions
 Niet+7.588
ashington+6.702
otle+6.594
icro+6.500
ById+6.399
Neg logit contributions
hift-8.604
 revol-7.600
 captives-7.533
Sense-7.112
 painfully-7.063
Token to
Token ID284
Feature activation-5.44e-02

Pos logit contributions
 Niet+2.153
ashington+1.859
ById+1.856
otle+1.837
plet+1.815
Neg logit contributions
hift-1.871
 revol-1.626
 captives-1.538
 painfully-1.490
Sense-1.489
Token the
Token ID262
Feature activation+1.70e-01

Pos logit contributions
 Niet+0.309
ById+0.271
ashington+0.270
otle+0.267
plet+0.265
Neg logit contributions
hift-0.229
 revol-0.198
 captives-0.186
 painfully-0.179
 reckoned-0.176
Token induction
Token ID28471
Feature activation-3.31e-01

Pos logit contributions
hift+0.786
 revol+0.702
 captives+0.661
 painfully+0.639
Sense+0.620
Neg logit contributions
 Niet-0.882
ById-0.773
ashington-0.771
otle-0.756
plet-0.755
Token of
Token ID286
Feature activation-3.17e-01

Pos logit contributions
 Niet+1.805
ashington+1.562
ById+1.559
otle+1.543
plet+1.527
Neg logit contributions
hift-1.470
 revol-1.266
 captives-1.193
 painfully-1.156
Sense-1.155
Token Leadership
Token ID26935
Feature activation+2.22e-01

Pos logit contributions
hift+2.813
 revol+2.453
 captives+2.328
 painfully+2.233
Sense+2.228
Neg logit contributions
 Niet-3.254
ById-2.801
ashington-2.790
otle-2.768
plet-2.760
Token.
Token ID13
Feature activation+2.51e-01

Pos logit contributions
hift+1.048
 revol+0.919
 captives+0.871
Sense+0.845
 painfully+0.845
Neg logit contributions
 Niet-1.142
ById-0.980
ashington-0.974
otle-0.964
plet-0.962
TokenEND
Token ID10619
Feature activation+1.55e-02

Pos logit contributions
hift+1.262
 revol+1.109
 captives+1.056
Sense+1.044
 painfully+1.027
Neg logit contributions
 Niet-1.213
ById-1.030
ashington-1.024
plet-1.016
otle-1.016
Token OF
Token ID3963
Feature activation-1.04e-01

Pos logit contributions
hift+0.081
 revol+0.071
 captives+0.068
 painfully+0.066
Sense+0.066
Neg logit contributions
 Niet-0.073
ById-0.062
ashington-0.062
otle-0.061
plet-0.060
Token TRAN
Token ID48213
Feature activation+1.35e-01

Pos logit contributions
 Niet+0.526
ById+0.452
ashington+0.451
otle+0.444
plet+0.441
Neg logit contributions
hift-0.504
 revol-0.443
 captives-0.419
 painfully-0.408
Sense-0.406
TokenSM
Token ID12310
Feature activation-1.61e+00

Pos logit contributions
hift+0.627
 revol+0.545
 captives+0.517
Sense+0.506
 painfully+0.504
Neg logit contributions
 Niet-0.709
ById-0.612
ashington-0.605
otle-0.602
plet-0.597
TokenISSION
Token ID40373
Feature activation-1.50e-01

Pos logit contributions
 Niet+5.697
ashington+4.632
otle+4.569
ById+4.565
/​+4.496
Neg logit contributions
hift-9.717
 revol-8.758
 captives-8.510
 painfully-8.228
 reckoned-8.113
Token ///
Token ID34013
Feature activation-9.94e-02

Pos logit contributions
 Niet+0.738
ById+0.630
ashington+0.628
otle+0.620
plet+0.615
Neg logit contributions
hift-0.741
 revol-0.652
 captives-0.619
 painfully-0.602
Sense-0.599
Token----------------------------------------------------------------
Token ID10097
Feature activation-2.13e-01

Pos logit contributions
 Niet+0.483
ById+0.411
ashington+0.410
otle+0.404
plet+0.401
Neg logit contributions
hift-0.500
 revol-0.441
 captives-0.419
 painfully-0.408
Sense-0.407
Token----------------------------------------------------------------
Token ID10097
Feature activation-1.38e-01

Pos logit contributions
 Niet+1.064
ById+0.911
ashington+0.909
otle+0.898
plet+0.889
Neg logit contributions
hift-1.040
 revol-0.914
 captives-0.868
 painfully-0.843
Sense-0.837
Token------------------------
Token ID22369
Feature activation+2.38e-01

Pos logit contributions
 Niet+0.696
ById+0.596
ashington+0.595
otle+0.587
plet+0.581
Neg logit contributions
hift-0.671
 revol-0.589
 captives-0.559
 painfully-0.543
Sense-0.540
Token-
Token ID12
Feature activation+1.05e-02

Pos logit contributions
 Niet+0.319
ById+0.269
ashington+0.268
otle+0.264
plet+0.262
Neg logit contributions
hift-0.373
 revol-0.331
 captives-0.316
 painfully-0.307
Sense-0.307
Tokensw
Token ID2032
Feature activation-3.89e-02

Pos logit contributions
hift+0.047
 revol+0.041
 captives+0.039
 painfully+0.038
Sense+0.038
Neg logit contributions
 Niet-0.056
ById-0.049
ashington-0.048
otle-0.048
plet-0.047
Tokenorn
Token ID1211
Feature activation-3.96e-01

Pos logit contributions
 Niet+0.193
ById+0.165
ashington+0.164
otle+0.162
plet+0.161
Neg logit contributions
hift-0.192
 revol-0.168
 captives-0.160
 painfully-0.155
Sense-0.155
Token-
Token ID12
Feature activation-3.87e-01

Pos logit contributions
 Niet+1.977
ashington+1.694
ById+1.684
otle+1.671
plet+1.646
Neg logit contributions
hift-1.924
 revol-1.691
 captives-1.607
 painfully-1.557
Sense-1.552
Tokenen
Token ID268
Feature activation+9.78e-02

Pos logit contributions
 Niet+2.014
ashington+1.750
ById+1.744
otle+1.717
plet+1.717
Neg logit contributions
hift-1.784
 revol-1.568
 captives-1.473
 painfully-1.434
 reckoned-1.420
Tokenm
Token ID76
Feature activation-1.61e+00

Pos logit contributions
hift+0.410
 revol+0.351
 captives+0.329
 painfully+0.319
Sense+0.316
Neg logit contributions
 Niet-0.559
ById-0.489
ashington-0.486
otle-0.481
plet-0.477
Tokenity
Token ID414
Feature activation-3.44e-01

Pos logit contributions
 Niet+7.220
ashington+6.599
ById+6.502
plet+6.492
otle+6.436
Neg logit contributions
hift-7.622
 revol-6.642
 captives-6.416
 reckoned-6.135
 painfully-6.084
Token approach
Token ID3164
Feature activation+4.85e-02

Pos logit contributions
 Niet+1.888
ashington+1.637
ById+1.635
otle+1.620
plet+1.599
Neg logit contributions
hift-1.509
 revol-1.303
 captives-1.234
 painfully-1.189
Sense-1.183
Token to
Token ID284
Feature activation+2.36e-01

Pos logit contributions
hift+0.212
 revol+0.184
 captives+0.173
 painfully+0.168
Sense+0.165
Neg logit contributions
 Niet-0.268
ById-0.233
ashington-0.232
otle-0.229
plet-0.228
Token serious
Token ID2726
Feature activation-6.51e-02

Pos logit contributions
hift+1.049
 revol+0.912
 captives+0.861
 painfully+0.834
 reckoned+0.818
Neg logit contributions
 Niet-1.293
ById-1.121
ashington-1.114
otle-1.109
plet-1.100
Token,
Token ID11
Feature activation+1.63e-01

Pos logit contributions
 Niet+0.351
ById+0.304
ashington+0.301
otle+0.300
plet+0.297
Neg logit contributions
hift-0.295
 revol-0.257
 captives-0.243
 painfully-0.235
 reckoned-0.231
Token
Token ID247
Feature activation-3.59e-01

Pos logit contributions
 Niet+3.534
ById+3.126
otle+3.094
ashington+3.087
/​+3.047
Neg logit contributions
hift-2.750
 revol-2.388
 captives-2.263
 painfully-2.198
Sense-2.163
Tokens
Token ID82
Feature activation-6.53e-02

Pos logit contributions
 Niet+1.740
ById+1.481
ashington+1.477
otle+1.458
plet+1.443
Neg logit contributions
hift-1.813
 revol-1.600
 captives-1.523
 painfully-1.481
Sense-1.471
Token Ring
Token ID12569
Feature activation-3.65e-01

Pos logit contributions
 Niet+0.334
ById+0.287
ashington+0.287
otle+0.283
plet+0.281
Neg logit contributions
hift-0.312
 revol-0.274
 captives-0.259
 painfully-0.251
Sense-0.250
Token vs
Token ID3691
Feature activation+1.60e-01

Pos logit contributions
 Niet+1.906
ById+1.643
ashington+1.639
otle+1.620
plet+1.605
Neg logit contributions
hift-1.699
 revol-1.484
 captives-1.406
 painfully-1.363
Sense-1.353
Token Kar
Token ID9375
Feature activation+2.12e-01

Pos logit contributions
hift+0.788
 revol+0.691
 captives+0.655
Sense+0.638
 painfully+0.637
Neg logit contributions
 Niet-0.790
ashington-0.680
ById-0.679
otle-0.667
plet-0.666
Tokenam
Token ID321
Feature activation-1.59e+00

Pos logit contributions
hift+1.105
 revol+0.961
 captives+0.918
Sense+0.904
 painfully+0.890
Neg logit contributions
 Niet-0.993
ById-0.853
ashington-0.828
otle-0.813
plet-0.810
Tokenet
Token ID316
Feature activation-8.09e-01

Pos logit contributions
 Niet+7.697
otle+6.679
plet+6.587
ById+6.530
ashington+6.509
Neg logit contributions
hift-7.595
 revol-6.725
 captives-6.393
 reckoned-6.228
 painfully-6.187
Tokenra
Token ID430
Feature activation-5.08e-01

Pos logit contributions
 Niet+4.488
ById+3.944
otle+3.916
ashington+3.913
plet+3.868
Neg logit contributions
hift-3.402
 revol-2.941
 captives-2.778
 painfully-2.670
Sense-2.667
Token,
Token ID11
Feature activation-4.31e-01

Pos logit contributions
 Niet+2.564
ById+2.198
ashington+2.190
otle+2.171
plet+2.148
Neg logit contributions
hift-2.441
 revol-2.143
 captives-2.039
 painfully-1.979
 reckoned-1.959
Token God
Token ID1793
Feature activation-4.83e-01

Pos logit contributions
 Niet+2.194
ById+1.883
ashington+1.878
otle+1.856
plet+1.837
Neg logit contributions
hift-2.066
 revol-1.810
 captives-1.718
 painfully-1.667
Sense-1.655
Token of
Token ID286
Feature activation-3.83e-01

Pos logit contributions
 Niet+2.462
ashington+2.113
ById+2.110
otle+2.095
plet+2.060
Neg logit contributions
hift-2.291
 revol-2.009
 captives-1.909
 painfully-1.855
 reckoned-1.830
Token-
Token ID12
Feature activation-3.89e-01

Pos logit contributions
 Niet+1.313
ashington+1.120
ById+1.119
otle+1.103
plet+1.092
Neg logit contributions
hift-1.379
 revol-1.219
 captives-1.160
 painfully-1.128
 reckoned-1.123
Tokenways
Token ID1322
Feature activation-6.09e-01

Pos logit contributions
 Niet+1.927
ashington+1.661
ById+1.655
otle+1.631
plet+1.621
Neg logit contributions
hift-1.901
 revol-1.677
 captives-1.588
 painfully-1.543
 reckoned-1.534
Token-
Token ID12
Feature activation-6.22e-02

Pos logit contributions
 Niet+2.571
ashington+2.205
ById+2.174
otle+2.161
/​+2.132
Neg logit contributions
hift-3.309
 revol-3.005
 captives-2.871
 reckoned-2.806
 painfully-2.806
Tokenlaw
Token ID6270
Feature activation-3.43e-01

Pos logit contributions
 Niet+0.289
ById+0.244
ashington+0.244
otle+0.240
plet+0.238
Neg logit contributions
hift-0.325
 revol-0.289
 captives-0.275
 painfully-0.268
Sense-0.266
Token-
Token ID12
Feature activation-4.10e-01

Pos logit contributions
 Niet+1.741
ashington+1.499
ById+1.494
otle+1.482
plet+1.461
Neg logit contributions
hift-1.641
 revol-1.432
 captives-1.360
 painfully-1.323
 reckoned-1.313
Tokenenforcement
Token ID44976
Feature activation-1.58e+00

Pos logit contributions
 Niet+1.955
ashington+1.668
ById+1.667
otle+1.644
plet+1.629
Neg logit contributions
hift-2.084
 revol-1.842
 captives-1.749
 painfully-1.705
 reckoned-1.692
Token-
Token ID12
Feature activation-6.46e-01

Pos logit contributions
 Niet+5.919
/​+5.138
otle+5.113
ById+5.088
ashington+5.066
Neg logit contributions
hift-9.054
 revol-7.871
 captives-7.499
 painfully-7.438
 reckoned-7.396
Tokenwill
Token ID10594
Feature activation-5.67e-01

Pos logit contributions
 Niet+3.155
ashington+2.743
ById+2.732
otle+2.694
plet+2.680
Neg logit contributions
hift-3.164
 revol-2.786
 captives-2.632
 painfully-2.566
 reckoned-2.545
Token-
Token ID12
Feature activation-3.86e-01

Pos logit contributions
 Niet+2.472
ashington+2.097
ById+2.092
otle+2.092
plet+2.060
Neg logit contributions
hift-3.062
 revol-2.745
 captives-2.631
 painfully-2.556
 reckoned-2.545
Tokenuse
Token ID1904
Feature activation-5.19e-01

Pos logit contributions
 Niet+1.956
ashington+1.683
ById+1.683
otle+1.661
plet+1.648
Neg logit contributions
hift-1.853
 revol-1.627
 captives-1.541
 painfully-1.496
Sense-1.484
Token-
Token ID12
Feature activation-1.90e-01

Pos logit contributions
 Niet+2.306
ashington+1.950
ById+1.948
otle+1.926
plet+1.901
Neg logit contributions
hift-2.785
 revol-2.487
 captives-2.375
 painfully-2.317
 reckoned-2.305
Token scrib
Token ID44661
Feature activation-4.64e-01

Pos logit contributions
hift+1.678
 revol+1.505
 captives+1.405
 painfully+1.379
 reckoned+1.360
Neg logit contributions
 Niet-2.007
ashington-1.760
ById-1.746
otle-1.717
SourceFile-1.683
Tokenbled
Token ID9342
Feature activation-7.75e-01

Pos logit contributions
 Niet+2.618
ById+2.282
ashington+2.278
otle+2.253
plet+2.233
Neg logit contributions
hift-1.973
 revol-1.699
 captives-1.598
 painfully-1.544
Sense-1.532
Token things
Token ID1243
Feature activation-8.90e-02

Pos logit contributions
 Niet+4.135
ById+3.562
otle+3.506
plet+3.501
ashington+3.476
Neg logit contributions
hift-3.487
 revol-2.942
 captives-2.817
Sense-2.757
 reckoned-2.688
Token like
Token ID588
Feature activation-7.61e-01

Pos logit contributions
 Niet+0.498
ashington+0.435
ById+0.434
otle+0.429
plet+0.425
Neg logit contributions
hift-0.383
 revol-0.332
 captives-0.311
 painfully-0.302
 reckoned-0.297
Token Ryan
Token ID6047
Feature activation+4.91e-01

Pos logit contributions
 Niet+3.762
ById+3.200
otle+3.149
ashington+3.135
plet+3.119
Neg logit contributions
hift-3.730
 revol-3.216
 captives-3.083
 reckoned-2.983
 painfully-2.972
Token Se
Token ID1001
Feature activation-1.53e+00

Pos logit contributions
hift+2.524
 revol+2.275
 captives+2.165
 painfully+2.139
Sense+2.108
Neg logit contributions
 Niet-2.222
ById-1.932
ashington-1.914
otle-1.855
plet-1.849
Tokenac
Token ID330
Feature activation+5.18e-02

Pos logit contributions
 Niet+5.739
ashington+4.901
ById+4.777
plet+4.726
/​+4.705
Neg logit contributions
hift-8.853
 revol-7.862
 captives-7.681
 reckoned-7.433
 Syri-7.379
Tokenrest
Token ID2118
Feature activation+5.38e-01

Pos logit contributions
hift+0.237
 revol+0.208
 captives+0.196
 painfully+0.191
Sense+0.188
Neg logit contributions
 Niet-0.275
ById-0.237
ashington-0.236
otle-0.232
plet-0.231
Token,
Token ID11
Feature activation-2.24e-01

Pos logit contributions
hift+2.456
 revol+2.201
 captives+2.057
 painfully+2.015
Sense+1.982
Neg logit contributions
 Niet-2.777
ashington-2.421
ById-2.381
plet-2.362
otle-2.361
Token Louis
Token ID5593
Feature activation-2.78e-02

Pos logit contributions
 Niet+1.131
ById+0.968
ashington+0.966
otle+0.954
plet+0.945
Neg logit contributions
hift-1.085
 revol-0.951
 captives-0.903
 painfully-0.876
Sense-0.872
Token Vu
Token ID35816
Feature activation+3.56e-01

Pos logit contributions
 Niet+0.135
ById+0.117
ashington+0.117
plet+0.114
otle+0.114
Neg logit contributions
hift-0.139
 revol-0.123
 captives-0.116
 painfully-0.113
 reckoned-0.112
Tokenin
Token ID259
Feature activation-1.48e-01

Pos logit contributions
 Niet+0.630
ById+0.542
ashington+0.538
otle+0.532
plet+0.527
Neg logit contributions
hift-0.571
 revol-0.498
 captives-0.472
 painfully-0.459
Sense-0.457
Token.
Token ID13
Feature activation-1.34e+00

Pos logit contributions
 Niet+0.756
ById+0.649
ashington+0.648
otle+0.640
plet+0.634
Neg logit contributions
hift-0.705
 revol-0.618
 captives-0.586
 painfully-0.568
Sense-0.564
Tokenum
Token ID388
Feature activation-1.19e+00

Pos logit contributions
 Niet+6.148
ashington+5.568
/​+5.295
plet+5.289
icro+5.289
Neg logit contributions
hift-6.461
 revol-5.830
 captives-5.573
 painfully-5.404
Sense-5.211
Tokenich
Token ID488
Feature activation-7.10e-01

Pos logit contributions
 Niet+6.413
ashington+5.952
otle+5.768
icro+5.750
plet+5.725
Neg logit contributions
hift-4.862
 revol-4.204
 captives-3.993
 painfully-3.834
 reckoned-3.787
Token.
Token ID13
Feature activation-1.50e+00

Pos logit contributions
 Niet+3.486
ashington+3.147
/​+3.079
otle+3.076
egg+3.035
Neg logit contributions
hift-3.346
 revol-2.921
 captives-2.811
 painfully-2.712
 reckoned-2.666
Tokenedu
Token ID15532
Feature activation-1.51e+00

Pos logit contributions
 Niet+2.579
ashington+1.760
ById+1.601
plet+1.549
/​+1.520
Neg logit contributions
hift-11.729
 revol-10.855
 captives-10.493
 painfully-10.287
 reckoned-10.271
Token/.
Token ID11757
Feature activation-8.63e-01

Pos logit contributions
 Niet+6.370
/​+5.604
ashington+5.431
otle+5.286
ById+5.237
Neg logit contributions
hift-8.056
 revol-7.187
 captives-6.897
 painfully-6.739
 reckoned-6.663
Token<|endoftext|>
Token ID50256
Feature activation-3.24e-01

Pos logit contributions
 Niet+3.789
ashington+3.214
ById+3.158
otle+3.157
/​+3.131
Neg logit contributions
hift-4.654
 revol-4.162
 captives-3.981
 painfully-3.891
 reckoned-3.817
TokenI
Token ID40
Feature activation+2.45e-01

Pos logit contributions
 Niet+1.610
ById+1.384
ashington+1.379
otle+1.360
plet+1.346
Neg logit contributions
hift-1.583
 revol-1.396
 captives-1.324
 painfully-1.286
 reckoned-1.274
Token remember
Token ID3505
Feature activation-1.09e-01

Pos logit contributions
hift+1.091
 revol+0.952
 captives+0.885
 painfully+0.873
 reckoned+0.865
Neg logit contributions
 Niet-1.336
ById-1.167
ashington-1.151
otle-1.138
plet-1.133
Token about
Token ID546
Feature activation-2.59e-02

Pos logit contributions
 Niet+0.603
ById+0.526
ashington+0.526
otle+0.519
plet+0.514
Neg logit contributions
hift-0.473
 revol-0.410
 captives-0.385
 painfully-0.374
Sense-0.369
Token press
Token ID1803
Feature activation+1.64e-01

Pos logit contributions
hift+1.051
 revol+0.921
 captives+0.870
 painfully+0.841
Sense+0.834
Neg logit contributions
 Niet-1.276
ById-1.102
ashington-1.094
otle-1.087
plet-1.084
Token secretary
Token ID7705
Feature activation+5.54e-01

Pos logit contributions
hift+0.659
 revol+0.572
 captives+0.533
 painfully+0.513
Sense+0.504
Neg logit contributions
 Niet-0.966
ById-0.840
ashington-0.839
otle-0.835
plet-0.827
Token.
Token ID13
Feature activation+3.19e-01

Pos logit contributions
hift+2.369
 revol+2.070
 captives+1.931
 painfully+1.901
 reckoned+1.837
Neg logit contributions
 Niet-3.058
ById-2.666
ashington-2.647
otle-2.641
SourceFile-2.634
Token<|endoftext|>
Token ID50256
Feature activation-2.60e-01

Pos logit contributions
hift+1.611
 revol+1.413
 captives+1.349
Sense+1.344
 painfully+1.305
Neg logit contributions
 Niet-1.519
plet-1.289
ashington-1.281
ById-1.277
otle-1.272
TokenAL
Token ID1847
Feature activation+3.07e-01

Pos logit contributions
 Niet+1.297
ById+1.113
ashington+1.110
otle+1.095
plet+1.084
Neg logit contributions
hift-1.273
 revol-1.121
 captives-1.064
 painfully-1.034
 reckoned-1.024
TokenB
Token ID33
Feature activation-1.51e+00

Pos logit contributions
hift+1.443
 revol+1.267
 captives+1.203
Sense+1.169
 painfully+1.161
Neg logit contributions
 Niet-1.580
ById-1.354
ashington-1.353
otle-1.333
plet-1.333
TokenION
Token ID2849
Feature activation-1.14e-01

Pos logit contributions
 Niet+4.583
ById+3.620
ashington+3.513
otle+3.371
/​+3.361
Neg logit contributions
hift-10.126
 revol-9.046
 captives-8.821
 painfully-8.633
 reckoned-8.501
Token are
Token ID389
Feature activation-1.78e-01

Pos logit contributions
 Niet+0.597
ById+0.515
ashington+0.514
otle+0.508
plet+0.503
Neg logit contributions
hift-0.526
 revol-0.459
 captives-0.435
 painfully-0.422
Sense-0.419
Token delighted
Token ID20707
Feature activation+8.82e-02

Pos logit contributions
 Niet+1.000
ById+0.869
ashington+0.867
otle+0.857
plet+0.850
Neg logit contributions
hift-0.756
 revol-0.645
 captives-0.607
Sense-0.590
 painfully-0.586
Token to
Token ID284
Feature activation+6.96e-01

Pos logit contributions
hift+0.396
 revol+0.344
 captives+0.325
 painfully+0.315
Sense+0.312
Neg logit contributions
 Niet-0.475
ById-0.412
ashington-0.411
otle-0.406
plet-0.403
Token announce
Token ID5453
Feature activation+1.15e+00

Pos logit contributions
hift+3.105
 revol+2.684
 captives+2.547
 painfully+2.487
 reckoned+2.451
Neg logit contributions
 Niet-3.828
ById-3.329
ashington-3.278
otle-3.209
SourceFile-3.172
Token media
Token ID2056
Feature activation-1.38e-01

Pos logit contributions
hift+0.881
 revol+0.771
 captives+0.732
Sense+0.709
 painfully+0.708
Neg logit contributions
 Niet-1.058
ById-0.917
ashington-0.912
otle-0.907
plet-0.904
Token for
Token ID329
Feature activation+2.71e-01

Pos logit contributions
 Niet+0.735
ById+0.635
ashington+0.634
otle+0.626
plet+0.621
Neg logit contributions
hift-0.633
 revol-0.551
 captives-0.521
 painfully-0.505
Sense-0.501
Token TV
Token ID3195
Feature activation-9.70e-02

Pos logit contributions
hift+1.315
 revol+1.165
 captives+1.100
 painfully+1.074
Sense+1.069
Neg logit contributions
 Niet-1.342
ashington-1.165
ById-1.163
plet-1.142
otle-1.139
Token personality
Token ID8806
Feature activation-8.10e-02

Pos logit contributions
 Niet+0.499
ById+0.428
ashington+0.427
otle+0.423
plet+0.420
Neg logit contributions
hift-0.460
 revol-0.404
 captives-0.383
 painfully-0.371
Sense-0.371
Token Ryan
Token ID6047
Feature activation+6.45e-01

Pos logit contributions
 Niet+0.394
ById+0.338
ashington+0.337
otle+0.331
plet+0.330
Neg logit contributions
hift-0.406
 revol-0.359
 captives-0.340
 painfully-0.331
Sense-0.329
Token Se
Token ID1001
Feature activation-1.50e+00

Pos logit contributions
hift+3.129
 revol+2.734
 captives+2.586
 reckoned+2.562
 painfully+2.554
Neg logit contributions
 Niet-3.152
ById-2.806
ashington-2.715
otle-2.677
plet-2.647
Tokenac
Token ID330
Feature activation-8.86e-02

Pos logit contributions
 Niet+5.507
ashington+4.759
ById+4.583
plet+4.502
otle+4.501
Neg logit contributions
hift-8.601
 revol-7.812
 captives-7.643
 Syri-7.314
 reckoned-7.287
Tokenrest
Token ID2118
Feature activation+2.92e-01

Pos logit contributions
 Niet+0.453
ashington+0.390
ById+0.390
otle+0.387
plet+0.382
Neg logit contributions
hift-0.422
 revol-0.368
 captives-0.350
 painfully-0.338
Sense-0.337
Token's
Token ID338
Feature activation-2.89e-01

Pos logit contributions
hift+1.318
 revol+1.157
 captives+1.084
 painfully+1.055
 reckoned+1.050
Neg logit contributions
 Niet-1.557
ashington-1.346
ById-1.346
plet-1.325
otle-1.322
Token production
Token ID3227
Feature activation-2.76e-01

Pos logit contributions
 Niet+1.572
ById+1.360
ashington+1.357
otle+1.343
plet+1.329
Neg logit contributions
hift-1.290
 revol-1.115
 captives-1.055
 painfully-1.019
Sense-1.012
Token company
Token ID1664
Feature activation-6.55e-02

Pos logit contributions
 Niet+1.227
ById+1.030
ashington+1.023
otle+1.011
plet+0.996
Neg logit contributions
hift-1.495
 revol-1.328
 captives-1.271
 painfully-1.240
Sense-1.229
Tokeneng
Token ID1516
Feature activation-1.21e-01

Pos logit contributions
 Niet+1.405
ashington+1.211
ById+1.206
otle+1.191
plet+1.182
Neg logit contributions
hift-1.340
 revol-1.181
 captives-1.121
 painfully-1.087
Sense-1.075
Tokenin
Token ID259
Feature activation-1.48e-01

Pos logit contributions
 Niet+0.630
ById+0.542
ashington+0.538
otle+0.532
plet+0.527
Neg logit contributions
hift-0.571
 revol-0.498
 captives-0.472
 painfully-0.459
Sense-0.457
Token.
Token ID13
Feature activation-1.34e+00

Pos logit contributions
 Niet+0.756
ById+0.649
ashington+0.648
otle+0.640
plet+0.634
Neg logit contributions
hift-0.705
 revol-0.618
 captives-0.586
 painfully-0.568
Sense-0.564
Tokenum
Token ID388
Feature activation-1.19e+00

Pos logit contributions
 Niet+6.148
ashington+5.568
/​+5.295
plet+5.289
icro+5.289
Neg logit contributions
hift-6.461
 revol-5.830
 captives-5.573
 painfully-5.404
Sense-5.211
Tokenich
Token ID488
Feature activation-7.10e-01

Pos logit contributions
 Niet+6.413
ashington+5.952
otle+5.768
icro+5.750
plet+5.725
Neg logit contributions
hift-4.862
 revol-4.204
 captives-3.993
 painfully-3.834
 reckoned-3.787
Token.
Token ID13
Feature activation-1.50e+00

Pos logit contributions
 Niet+3.486
ashington+3.147
/​+3.079
otle+3.076
egg+3.035
Neg logit contributions
hift-3.346
 revol-2.921
 captives-2.811
 painfully-2.712
 reckoned-2.666
Tokenedu
Token ID15532
Feature activation-1.51e+00

Pos logit contributions
 Niet+2.579
ashington+1.760
ById+1.601
plet+1.549
/​+1.520
Neg logit contributions
hift-11.729
 revol-10.855
 captives-10.493
 painfully-10.287
 reckoned-10.271
Token/.
Token ID11757
Feature activation-8.63e-01

Pos logit contributions
 Niet+6.370
/​+5.604
ashington+5.431
otle+5.286
ById+5.237
Neg logit contributions
hift-8.056
 revol-7.187
 captives-6.897
 painfully-6.739
 reckoned-6.663
Token<|endoftext|>
Token ID50256
Feature activation-3.24e-01

Pos logit contributions
 Niet+3.789
ashington+3.214
ById+3.158
otle+3.157
/​+3.131
Neg logit contributions
hift-4.654
 revol-4.162
 captives-3.981
 painfully-3.891
 reckoned-3.817
TokenI
Token ID40
Feature activation+2.45e-01

Pos logit contributions
 Niet+1.610
ById+1.384
ashington+1.379
otle+1.360
plet+1.346
Neg logit contributions
hift-1.583
 revol-1.396
 captives-1.324
 painfully-1.286
 reckoned-1.274
Token remember
Token ID3505
Feature activation-1.09e-01

Pos logit contributions
hift+1.091
 revol+0.952
 captives+0.885
 painfully+0.873
 reckoned+0.865
Neg logit contributions
 Niet-1.336
ById-1.167
ashington-1.151
otle-1.138
plet-1.133
Tokenuman
Token ID3778
Feature activation-4.43e-01

Pos logit contributions
 Niet+1.871
ById+1.584
ashington+1.577
otle+1.554
plet+1.542
Neg logit contributions
hift-2.005
 revol-1.772
 captives-1.686
 painfully-1.643
Sense-1.634
Token used
Token ID973
Feature activation+4.39e-02

Pos logit contributions
 Niet+2.411
ById+2.090
ashington+2.087
otle+2.065
plet+2.045
Neg logit contributions
hift-1.971
 revol-1.704
 captives-1.609
 painfully-1.557
Sense-1.550
Token to
Token ID284
Feature activation+2.31e-02

Pos logit contributions
hift+0.190
 revol+0.166
 captives+0.156
 painfully+0.151
 reckoned+0.148
Neg logit contributions
 Niet-0.245
ById-0.214
ashington-0.213
otle-0.210
plet-0.209
Token speak
Token ID2740
Feature activation-2.92e-01

Pos logit contributions
hift+0.109
 revol+0.096
 captives+0.091
 painfully+0.089
 reckoned+0.087
Neg logit contributions
 Niet-0.120
ById-0.104
ashington-0.103
otle-0.101
plet-0.100
Token with
Token ID351
Feature activation+3.23e-01

Pos logit contributions
 Niet+1.582
ById+1.370
ashington+1.366
otle+1.352
plet+1.339
Neg logit contributions
hift-1.308
 revol-1.135
 captives-1.071
 painfully-1.037
Sense-1.031
Token Saur
Token ID46197
Feature activation-1.46e+00

Pos logit contributions
hift+1.585
 revol+1.400
 captives+1.348
 painfully+1.302
Sense+1.281
Neg logit contributions
 Niet-1.576
ashington-1.378
ById-1.373
plet-1.347
otle-1.340
Tokenon
Token ID261
Feature activation-5.25e-01

Pos logit contributions
 Niet+7.710
ashington+6.882
otle+6.844
ById+6.749
plet+6.714
Neg logit contributions
hift-6.393
 revol-5.592
 captives-5.250
 painfully-5.059
Sense-5.009
Token and
Token ID290
Feature activation-4.18e-01

Pos logit contributions
 Niet+2.805
ById+2.423
ashington+2.417
otle+2.401
plet+2.364
Neg logit contributions
hift-2.373
 revol-2.055
 captives-1.935
Sense-1.882
 painfully-1.876
Token through
Token ID832
Feature activation-5.89e-01

Pos logit contributions
 Niet+2.159
ById+1.852
ashington+1.846
otle+1.830
plet+1.807
Neg logit contributions
hift-1.974
 revol-1.717
 captives-1.627
 painfully-1.577
Sense-1.572
Token which
Token ID543
Feature activation+3.69e-02

Pos logit contributions
 Niet+2.984
ById+2.547
ashington+2.529
otle+2.517
plet+2.479
Neg logit contributions
hift-2.824
 revol-2.465
 captives-2.333
 painfully-2.265
 reckoned-2.255
Token Sar
Token ID6866
Feature activation-2.30e-01

Pos logit contributions
hift+0.185
 revol+0.164
 captives+0.156
 painfully+0.152
Sense+0.150
Neg logit contributions
 Niet-0.179
ById-0.153
ashington-0.153
otle-0.151
plet-0.150
Token42
Token ID3682
Feature activation+1.57e-02

Pos logit contributions
 Niet+0.709
ById+0.611
ashington+0.611
otle+0.603
plet+0.599
Neg logit contributions
hift-0.635
 revol-0.551
 captives-0.521
 painfully-0.506
Sense-0.504
Token 3
Token ID513
Feature activation-3.80e-01

Pos logit contributions
hift+0.079
 revol+0.070
 captives+0.066
 painfully+0.064
Sense+0.064
Neg logit contributions
 Niet-0.076
ById-0.065
ashington-0.065
otle-0.064
plet-0.064
Token355
Token ID28567
Feature activation-1.07e-01

Pos logit contributions
 Niet+1.921
ById+1.648
ashington+1.646
otle+1.624
plet+1.607
Neg logit contributions
hift-1.830
 revol-1.611
 captives-1.528
 painfully-1.484
Sense-1.471
Token;
Token ID26
Feature activation-5.91e-01

Pos logit contributions
 Niet+0.575
ById+0.498
ashington+0.497
otle+0.491
plet+0.487
Neg logit contributions
hift-0.479
 revol-0.416
 captives-0.393
 painfully-0.380
Sense-0.378
Token mart
Token ID11277
Feature activation-6.88e-01

Pos logit contributions
 Niet+2.956
ashington+2.531
ById+2.512
otle+2.495
plet+2.463
Neg logit contributions
hift-2.861
 revol-2.480
 captives-2.353
Sense-2.311
 painfully-2.295
Tokenin
Token ID259
Feature activation-1.46e+00

Pos logit contributions
 Niet+3.580
ashington+3.185
otle+3.151
plet+3.103
ById+3.087
Neg logit contributions
hift-3.110
 revol-2.698
 captives-2.574
 painfully-2.481
Sense-2.479
Tokenrand
Token ID25192
Feature activation-5.11e-01

Pos logit contributions
 Niet+8.075
ashington+7.613
otle+7.427
plet+7.400
ById+7.268
Neg logit contributions
hift-5.507
 revol-4.615
 captives-4.402
 painfully-4.212
Sense-4.198
Tokenall
Token ID439
Feature activation-3.24e-01

Pos logit contributions
 Niet+2.842
ashington+2.520
ById+2.505
otle+2.495
plet+2.463
Neg logit contributions
hift-2.157
 revol-1.853
 captives-1.755
 painfully-1.689
Sense-1.668
Token.
Token ID13
Feature activation-1.10e+00

Pos logit contributions
 Niet+1.724
ashington+1.500
ById+1.497
otle+1.482
plet+1.467
Neg logit contributions
hift-1.468
 revol-1.274
 captives-1.205
 painfully-1.168
Sense-1.158
Tokencom
Token ID785
Feature activation-8.98e-01

Pos logit contributions
 Niet+5.533
ashington+4.845
otle+4.797
plet+4.773
icro+4.741
Neg logit contributions
hift-4.974
 revol-4.419
 captives-4.147
 painfully-4.055
 reckoned-3.976
Token),
Token ID828
Feature activation-1.61e-01

Pos logit contributions
 Niet+3.397
ashington+2.793
otle+2.765
/​+2.738
ById+2.734
Neg logit contributions
hift-5.318
 revol-4.769
 captives-4.597
 painfully-4.502
Sense-4.486
Token in
Token ID287
Feature activation-4.36e-01

Pos logit contributions
hift+1.535
 revol+1.346
 captives+1.263
 painfully+1.223
 reckoned+1.210
Neg logit contributions
 Niet-2.077
ById-1.806
ashington-1.796
otle-1.784
plet-1.773
Token team
Token ID1074
Feature activation-5.80e-01

Pos logit contributions
 Niet+2.170
ashington+1.854
ById+1.846
otle+1.826
plet+1.812
Neg logit contributions
hift-2.133
 revol-1.873
 captives-1.777
 painfully-1.726
Sense-1.723
Token history
Token ID2106
Feature activation-3.31e-01

Pos logit contributions
 Niet+3.232
ashington+2.820
ById+2.817
otle+2.785
plet+2.764
Neg logit contributions
hift-2.495
 revol-2.134
 captives-2.014
 painfully-1.949
Sense-1.940
Token (
Token ID357
Feature activation-1.76e-01

Pos logit contributions
 Niet+1.770
ById+1.532
ashington+1.529
otle+1.509
plet+1.494
Neg logit contributions
hift-1.506
 revol-1.308
 captives-1.236
 painfully-1.197
Sense-1.192
Token Patri
Token ID9606
Feature activation-2.63e-01

Pos logit contributions
 Niet+0.960
ById+0.833
ashington+0.831
otle+0.822
plet+0.814
Neg logit contributions
hift-0.780
 revol-0.676
 captives-0.638
 painfully-0.617
Sense-0.614
Tokenk
Token ID74
Feature activation-1.46e+00

Pos logit contributions
 Niet+0.998
ById+0.814
ashington+0.807
otle+0.790
plet+0.781
Neg logit contributions
hift-1.598
 revol-1.445
 captives-1.386
 painfully-1.359
Sense-1.354
Token Elias
Token ID41462
Feature activation-4.12e-01

Pos logit contributions
 Niet+6.168
/​+4.913
plet+4.907
ashington+4.812
ById+4.792
Neg logit contributions
hift-8.414
 revol-7.208
 captives-6.915
Sense-6.631
 painfully-6.601
Token,
Token ID837
Feature activation-1.61e-01

Pos logit contributions
 Niet+2.100
ById+1.805
ashington+1.803
otle+1.779
plet+1.761
Neg logit contributions
hift-1.966
 revol-1.717
 captives-1.633
 painfully-1.581
Sense-1.575
Token Claude
Token ID40559
Feature activation-3.10e-01

Pos logit contributions
 Niet+0.777
ById+0.661
ashington+0.659
otle+0.651
plet+0.644
Neg logit contributions
hift-0.816
 revol-0.721
 captives-0.686
 painfully-0.668
Sense-0.663
Token Lem
Token ID20607
Feature activation-5.93e-01

Pos logit contributions
 Niet+1.305
ashington+1.116
ById+1.105
plet+1.081
otle+1.077
Neg logit contributions
hift-1.748
 revol-1.560
 captives-1.487
 painfully-1.453
 reckoned-1.450
Tokenie
Token ID494
Feature activation-8.04e-01

Pos logit contributions
 Niet+3.279
ashington+2.841
ById+2.836
plet+2.821
otle+2.814
Neg logit contributions
hift-2.596
 revol-2.201
 captives-2.090
 painfully-2.010
 reckoned-1.975
Tokenet
Token ID316
Feature activation-4.56e-01

Pos logit contributions
 Niet+0.244
ById+0.204
ashington+0.197
otle+0.191
SourceFile+0.190
Neg logit contributions
hift-0.362
 revol-0.325
 captives-0.311
 painfully-0.305
Sense-0.303
Tokenrah
Token ID11392
Feature activation-8.13e-01

Pos logit contributions
 Niet+2.514
ashington+2.182
ById+2.178
otle+2.169
plet+2.150
Neg logit contributions
hift-1.974
 revol-1.716
 captives-1.620
 painfully-1.564
Sense-1.559
Tokenyd
Token ID5173
Feature activation-3.90e-01

Pos logit contributions
 Niet+2.468
ashington+1.899
ById+1.888
otle+1.867
plet+1.830
Neg logit contributions
hift-5.515
 revol-5.055
 captives-4.882
 painfully-4.783
Sense-4.756
Tokenro
Token ID305
Feature activation-7.01e-01

Pos logit contributions
 Niet+1.964
ById+1.683
ashington+1.669
otle+1.648
plet+1.631
Neg logit contributions
hift-1.897
 revol-1.671
 captives-1.579
 painfully-1.533
Sense-1.524
Token-
Token ID12
Feature activation-1.13e+00

Pos logit contributions
 Niet+3.215
ashington+2.709
ById+2.705
otle+2.682
plet+2.656
Neg logit contributions
hift-3.691
 revol-3.280
 captives-3.121
 painfully-3.045
Sense-3.033
Token2
Token ID17
Feature activation-1.44e+00

Pos logit contributions
 Niet+5.510
ById+4.702
ashington+4.671
otle+4.650
/​+4.590
Neg logit contributions
hift-5.449
 revol-4.853
 captives-4.593
 painfully-4.474
 reckoned-4.414
TokenH
Token ID39
Feature activation-1.20e-01

Pos logit contributions
 Niet+6.312
ashington+5.270
ById+5.262
/​+5.216
icro+5.204
Neg logit contributions
hift-7.539
 revol-6.706
 captives-6.434
 reckoned-6.304
 painfully-6.300
Token-
Token ID12
Feature activation-9.22e-01

Pos logit contributions
 Niet+0.553
ById+0.471
ashington+0.468
otle+0.453
plet+0.451
Neg logit contributions
hift-0.631
 revol-0.562
 captives-0.537
Sense-0.523
 painfully-0.520
Tokenp
Token ID79
Feature activation-1.11e+00

Pos logit contributions
 Niet+4.482
ById+3.825
ashington+3.798
otle+3.773
plet+3.758
Neg logit contributions
hift-4.541
 revol-4.031
 captives-3.813
 painfully-3.716
 reckoned-3.692
Tokenyr
Token ID2417
Feature activation-3.13e-01

Pos logit contributions
 Niet+4.454
ashington+3.663
ById+3.622
otle+3.618
icro+3.604
Neg logit contributions
hift-6.410
 revol-5.761
 captives-5.524
 reckoned-5.430
 painfully-5.387
Tokenan
Token ID272
Feature activation-3.96e-01

Pos logit contributions
 Niet+1.628
ById+1.402
ashington+1.397
otle+1.374
plet+1.357
Neg logit contributions
hift-1.477
 revol-1.286
 captives-1.215
 painfully-1.181
Sense-1.180
Token a
Token ID257
Feature activation+2.25e-01

Pos logit contributions
hift+0.056
 revol+0.049
 captives+0.046
 painfully+0.045
 reckoned+0.044
Neg logit contributions
 Niet-0.066
ById-0.057
ashington-0.057
otle-0.056
plet-0.056
Token large
Token ID1588
Feature activation-1.94e-01

Pos logit contributions
hift+0.891
 revol+0.768
 captives+0.714
 painfully+0.700
 reckoned+0.683
Neg logit contributions
 Niet-1.324
ById-1.171
ashington-1.166
otle-1.156
plet-1.147
Token reservoir
Token ID22115
Feature activation+2.11e-01

Pos logit contributions
 Niet+0.993
ById+0.854
ashington+0.852
otle+0.841
plet+0.835
Neg logit contributions
hift-0.929
 revol-0.818
 captives-0.775
 painfully-0.753
 reckoned-0.746
Token of
Token ID286
Feature activation-2.55e-01

Pos logit contributions
hift+0.944
 revol+0.828
 captives+0.780
 painfully+0.759
 reckoned+0.756
Neg logit contributions
 Niet-1.146
ById-0.994
ashington-0.986
otle-0.977
plet-0.962
Token M
Token ID337
Feature activation-2.16e-01

Pos logit contributions
 Niet+1.245
ashington+1.070
ById+1.066
otle+1.050
plet+1.038
Neg logit contributions
hift-1.269
 revol-1.133
 captives-1.073
 painfully-1.046
Sense-1.032
Token.
Token ID13
Feature activation-1.44e+00

Pos logit contributions
 Niet+1.141
ById+0.984
ashington+0.982
otle+0.971
plet+0.961
Neg logit contributions
hift-0.999
 revol-0.871
 captives-0.824
 painfully-0.799
Sense-0.793
Token le
Token ID443
Feature activation-6.85e-01

Pos logit contributions
 Niet+6.403
ById+5.505
otle+5.399
ashington+5.365
plet+5.344
Neg logit contributions
hift-7.231
 revol-6.139
Sense-6.007
 captives-6.004
 reckoned-5.849
Tokenp
Token ID79
Feature activation-8.80e-01

Pos logit contributions
 Niet+3.020
ashington+2.535
ById+2.534
otle+2.510
plet+2.487
Neg logit contributions
hift-3.730
 revol-3.309
 captives-3.178
 painfully-3.088
Sense-3.082
Tokenra
Token ID430
Feature activation-3.39e-01

Pos logit contributions
 Niet+5.216
ashington+4.615
ById+4.604
otle+4.536
plet+4.519
Neg logit contributions
hift-3.441
 revol-2.835
 captives-2.671
 reckoned-2.579
 painfully-2.569
Tokene
Token ID68
Feature activation-8.59e-01

Pos logit contributions
 Niet+1.607
ById+1.367
ashington+1.365
otle+1.352
plet+1.344
Neg logit contributions
hift-1.726
 revol-1.513
 captives-1.454
Sense-1.408
 painfully-1.407
Token and
Token ID290
Feature activation-4.94e-01

Pos logit contributions
 Niet+4.333
ashington+3.732
ById+3.730
otle+3.665
plet+3.662
Neg logit contributions
hift-4.104
 revol-3.547
 captives-3.371
 painfully-3.272
Sense-3.264
Token neighbours
Token ID23788
Feature activation+1.92e-01

Pos logit contributions
 Niet+1.276
ById+1.126
ashington+1.125
otle+1.115
plet+1.106
Neg logit contributions
hift-0.688
 revol-0.570
 captives-0.524
Sense-0.505
 painfully-0.502
Token,
Token ID11
Feature activation+6.83e-02

Pos logit contributions
hift+0.851
 revol+0.739
 captives+0.697
 painfully+0.677
 reckoned+0.668
Neg logit contributions
 Niet-1.048
ashington-0.916
ById-0.914
otle-0.903
plet-0.899
Token Dust
Token ID16240
Feature activation+3.64e-02

Pos logit contributions
hift+0.325
 revol+0.286
 captives+0.271
 painfully+0.264
 reckoned+0.261
Neg logit contributions
 Niet-0.348
ashington-0.301
ById-0.301
otle-0.297
plet-0.294
Tokeny
Token ID88
Feature activation+2.16e-01

Pos logit contributions
hift+0.139
 revol+0.117
 captives+0.109
 painfully+0.105
Sense+0.105
Neg logit contributions
 Niet-0.220
ById-0.195
ashington-0.194
otle-0.191
plet-0.190
Token Al
Token ID978
Feature activation+1.23e-01

Pos logit contributions
hift+1.109
 revol+0.980
 captives+0.931
 painfully+0.908
Sense+0.905
Neg logit contributions
 Niet-1.009
ById-0.869
ashington-0.862
otle-0.845
plet-0.840
Tokenbr
Token ID1671
Feature activation-1.41e+00

Pos logit contributions
hift+0.599
 revol+0.524
 captives+0.497
Sense+0.482
 painfully+0.480
Neg logit contributions
 Niet-0.619
ById-0.534
ashington-0.522
otle-0.520
SourceFile-0.510
Tokenitton
Token ID47304
Feature activation-6.01e-02

Pos logit contributions
 Niet+6.997
ashington+6.378
plet+6.296
icro+6.250
otle+6.192
Neg logit contributions
hift-6.282
 revol-5.540
 captives-5.344
 painfully-5.102
Sense-5.009
Token.
Token ID13
Feature activation-3.72e-01

Pos logit contributions
 Niet+0.324
ById+0.280
ashington+0.280
otle+0.276
plet+0.274
Neg logit contributions
hift-0.270
 revol-0.235
 captives-0.222
 painfully-0.215
Sense-0.213
Token
Token ID198
Feature activation-1.54e-01

Pos logit contributions
 Niet+1.635
ById+1.358
ashington+1.350
otle+1.334
plet+1.305
Neg logit contributions
hift-2.044
 revol-1.836
 captives-1.754
 painfully-1.710
Sense-1.693
Token
Token ID198
Feature activation-2.31e-01

Pos logit contributions
 Niet+0.747
ashington+0.624
ById+0.623
plet+0.618
otle+0.614
Neg logit contributions
hift-0.790
 revol-0.690
 captives-0.660
Sense-0.649
 painfully-0.645
Token"
Token ID1
Feature activation-3.28e-01

Pos logit contributions
 Niet+1.152
ById+0.983
ashington+0.982
otle+0.970
plet+0.961
Neg logit contributions
hift-1.134
 revol-0.996
 captives-0.947
 painfully-0.920
Sense-0.917
Token the
Token ID262
Feature activation-2.53e-02

Pos logit contributions
hift+2.203
 revol+1.927
 captives+1.801
 painfully+1.772
Sense+1.729
Neg logit contributions
 Niet-2.796
ashington-2.439
ById-2.437
otle-2.386
plet-2.385
Token comparison
Token ID7208
Feature activation-2.07e-01

Pos logit contributions
 Niet+0.134
ById+0.116
ashington+0.116
otle+0.114
plet+0.114
Neg logit contributions
hift-0.116
 revol-0.102
 captives-0.096
 painfully-0.093
Sense-0.092
Token instead
Token ID2427
Feature activation-2.19e-01

Pos logit contributions
 Niet+1.157
ById+1.010
ashington+1.007
otle+0.996
plet+0.986
Neg logit contributions
hift-0.887
 revol-0.760
 captives-0.719
Sense-0.689
 painfully-0.688
Token of
Token ID286
Feature activation-1.79e-01

Pos logit contributions
 Niet+1.137
ById+0.984
ashington+0.982
otle+0.972
plet+0.961
Neg logit contributions
hift-1.016
 revol-0.885
 captives-0.843
Sense-0.812
 reckoned-0.806
Token,
Token ID11
Feature activation-6.31e-01

Pos logit contributions
 Niet+0.987
ById+0.857
ashington+0.856
otle+0.846
plet+0.839
Neg logit contributions
hift-0.779
 revol-0.672
 captives-0.634
 painfully-0.612
Sense-0.610
Token say
Token ID910
Feature activation-1.41e+00

Pos logit contributions
 Niet+2.916
ById+2.464
ashington+2.462
otle+2.431
plet+2.400
Neg logit contributions
hift-3.305
 revol-2.915
 captives-2.786
Sense-2.706
 painfully-2.692
Token,
Token ID11
Feature activation-1.93e-02

Pos logit contributions
 Niet+6.598
ById+5.676
otle+5.600
ashington+5.588
plet+5.454
Neg logit contributions
hift-6.946
 revol-6.247
 captives-5.856
 reckoned-5.694
 painfully-5.649
Token listing
Token ID13487
Feature activation+7.55e-02

Pos logit contributions
 Niet+0.103
ById+0.090
ashington+0.089
otle+0.088
plet+0.087
Neg logit contributions
hift-0.087
 revol-0.077
 captives-0.072
 painfully-0.070
Sense-0.069
Token the
Token ID262
Feature activation-7.58e-02

Pos logit contributions
hift+0.333
 revol+0.292
 captives+0.275
 painfully+0.268
 reckoned+0.260
Neg logit contributions
 Niet-0.412
ashington-0.357
ById-0.356
otle-0.351
plet-0.350
Token readily
Token ID14704
Feature activation-1.74e-01

Pos logit contributions
 Niet+0.407
ashington+0.353
ById+0.352
otle+0.348
plet+0.345
Neg logit contributions
hift-0.341
 revol-0.301
 captives-0.283
 painfully-0.276
Sense-0.270
Token-
Token ID12
Feature activation+2.09e-01

Pos logit contributions
 Niet+1.041
ById+0.917
ashington+0.915
otle+0.905
plet+0.898
Neg logit contributions
hift-0.678
 revol-0.573
 captives-0.537
 painfully-0.516
Sense-0.513
Token have
Token ID423
Feature activation-2.76e-01

Pos logit contributions
 Niet+0.300
ById+0.257
ashington+0.256
otle+0.253
plet+0.250
Neg logit contributions
hift-0.303
 revol-0.268
 captives-0.255
 painfully-0.248
 reckoned-0.245
Token been
Token ID587
Feature activation-2.35e-01

Pos logit contributions
 Niet+1.570
ById+1.371
ashington+1.368
otle+1.353
plet+1.342
Neg logit contributions
hift-1.160
 revol-0.996
 captives-0.936
 painfully-0.904
Sense-0.898
Token designed
Token ID3562
Feature activation-2.46e-01

Pos logit contributions
 Niet+1.272
ById+1.103
ashington+1.100
otle+1.088
plet+1.078
Neg logit contributions
hift-1.048
 revol-0.910
 captives-0.859
 painfully-0.832
Sense-0.825
Token to
Token ID284
Feature activation-1.86e-01

Pos logit contributions
 Niet+1.331
ById+1.153
ashington+1.151
otle+1.137
plet+1.127
Neg logit contributions
hift-1.105
 revol-0.959
 captives-0.906
 painfully-0.877
Sense-0.871
Token dis
Token ID595
Feature activation-1.77e-02

Pos logit contributions
 Niet+0.872
ById+0.738
ashington+0.735
otle+0.725
plet+0.716
Neg logit contributions
hift-0.971
 revol-0.863
 captives-0.822
 painfully-0.803
Sense-0.793
Tokenem
Token ID368
Feature activation-1.41e+00

Pos logit contributions
 Niet+0.094
ById+0.081
ashington+0.081
otle+0.080
plet+0.079
Neg logit contributions
hift-0.081
 revol-0.071
 captives-0.067
 painfully-0.065
Sense-0.065
Tokenpower
Token ID6477
Feature activation+8.28e-02

Pos logit contributions
 Niet+7.565
plet+6.915
otle+6.817
ashington+6.815
icro+6.712
Neg logit contributions
hift-5.685
 revol-4.973
 captives-4.738
 painfully-4.582
 reckoned-4.537
Token and
Token ID290
Feature activation-7.92e-01

Pos logit contributions
hift+0.390
 revol+0.342
 captives+0.324
 painfully+0.314
Sense+0.311
Neg logit contributions
 Niet-0.428
ById-0.368
ashington-0.368
otle-0.363
plet-0.360
Token divide
Token ID14083
Feature activation-2.84e-02

Pos logit contributions
 Niet+3.727
ById+3.163
ashington+3.159
otle+3.125
plet+3.103
Neg logit contributions
hift-4.009
 revol-3.474
 captives-3.323
Sense-3.310
 reckoned-3.221
Token our
Token ID674
Feature activation+3.06e-01

Pos logit contributions
 Niet+0.162
ById+0.140
ashington+0.140
otle+0.139
plet+0.138
Neg logit contributions
hift-0.119
 revol-0.101
 captives-0.095
Sense-0.092
 painfully-0.092
Token communities
Token ID5348
Feature activation+5.09e-01

Pos logit contributions
hift+1.437
 revol+1.281
 captives+1.215
 painfully+1.183
 reckoned+1.158
Neg logit contributions
 Niet-1.564
ById-1.363
ashington-1.355
otle-1.342
plet-1.324
Token another
Token ID1194
Feature activation+1.99e-01

Pos logit contributions
hift+0.837
 revol+0.726
 captives+0.684
 painfully+0.661
Sense+0.656
Neg logit contributions
 Niet-1.070
ById-0.932
ashington-0.930
otle-0.919
plet-0.913
Token.
Token ID13
Feature activation-2.69e-01

Pos logit contributions
hift+0.925
 revol+0.808
 captives+0.764
 painfully+0.743
Sense+0.735
Neg logit contributions
 Niet-1.047
ById-0.906
ashington-0.904
otle-0.891
plet-0.883
Token Huge
Token ID35225
Feature activation-1.43e-01

Pos logit contributions
 Niet+1.271
ById+1.077
ashington+1.075
otle+1.060
plet+1.050
Neg logit contributions
hift-1.385
 revol-1.226
 captives-1.167
 painfully-1.136
Sense-1.130
Token,
Token ID11
Feature activation+2.18e-01

Pos logit contributions
 Niet+0.800
ById+0.697
ashington+0.695
otle+0.687
plet+0.681
Neg logit contributions
hift-0.610
 revol-0.525
 captives-0.494
 painfully-0.476
Sense-0.474
Token En
Token ID2039
Feature activation+2.76e-01

Pos logit contributions
hift+0.904
 revol+0.787
 captives+0.736
 painfully+0.717
 reckoned+0.706
Neg logit contributions
 Niet-1.242
ById-1.090
ashington-1.089
otle-1.080
plet-1.069
Tokenorm
Token ID579
Feature activation-1.41e+00

Pos logit contributions
hift+1.433
 revol+1.265
 captives+1.202
 painfully+1.173
Sense+1.168
Neg logit contributions
 Niet-1.297
ById-1.106
ashington-1.099
otle-1.073
plet-1.065
Tokenous
Token ID516
Feature activation-2.68e-01

Pos logit contributions
 Niet+8.252
ashington+7.465
otle+7.447
plet+7.357
icro+7.305
Neg logit contributions
hift-5.049
 revol-4.392
 captives-4.065
 painfully-3.873
Sense-3.845
Token,
Token ID11
Feature activation+2.30e-02

Pos logit contributions
 Niet+1.459
ById+1.268
ashington+1.263
otle+1.247
plet+1.233
Neg logit contributions
hift-1.186
 revol-1.026
 captives-0.968
 painfully-0.934
Sense-0.933
Token Titanic
Token ID41184
Feature activation-1.10e-02

Pos logit contributions
hift+0.105
 revol+0.092
 captives+0.087
 painfully+0.085
 reckoned+0.084
Neg logit contributions
 Niet-0.122
ById-0.106
ashington-0.106
otle-0.105
plet-0.104
Token.
Token ID13
Feature activation-3.25e-01

Pos logit contributions
 Niet+0.060
ById+0.052
ashington+0.052
otle+0.052
plet+0.051
Neg logit contributions
hift-0.049
 revol-0.042
 captives-0.040
 painfully-0.038
Sense-0.038
Token The
Token ID383
Feature activation-5.58e-02

Pos logit contributions
 Niet+1.529
ById+1.293
ashington+1.289
otle+1.272
plet+1.256
Neg logit contributions
hift-1.686
 revol-1.494
 captives-1.424
 painfully-1.386
Sense-1.377
Token34
Token ID2682
Feature activation+3.16e-01

Pos logit contributions
 Niet+0.602
ById+0.516
ashington+0.515
otle+0.508
plet+0.503
Neg logit contributions
hift-0.578
 revol-0.507
 captives-0.481
 painfully-0.467
Sense-0.465
Token)
Token ID8
Feature activation+2.78e-01

Pos logit contributions
hift+1.693
 revol+1.515
 captives+1.445
 painfully+1.401
Sense+1.390
Neg logit contributions
 Niet-1.430
ById-1.208
ashington-1.203
otle-1.184
plet-1.159
Token "
Token ID366
Feature activation+6.44e-02

Pos logit contributions
hift+1.382
 revol+1.218
 captives+1.152
Sense+1.124
 painfully+1.122
Neg logit contributions
 Niet-1.349
ById-1.164
ashington-1.162
plet-1.143
otle-1.138
TokenI
Token ID40
Feature activation+1.10e-02

Pos logit contributions
hift+0.316
 revol+0.277
 captives+0.263
Sense+0.257
 painfully+0.256
Neg logit contributions
 Niet-0.322
ashington-0.274
ById-0.273
otle-0.270
plet-0.268
Token can
Token ID460
Feature activation-4.87e-01

Pos logit contributions
hift+0.050
 revol+0.044
 captives+0.041
 painfully+0.040
 reckoned+0.040
Neg logit contributions
 Niet-0.058
ById-0.051
ashington-0.050
otle-0.050
plet-0.049
Token understand
Token ID1833
Feature activation-1.40e+00

Pos logit contributions
 Niet+2.286
ashington+1.959
ById+1.941
otle+1.935
plet+1.910
Neg logit contributions
hift-2.474
 revol-2.184
 captives-2.093
Sense-2.051
 painfully-2.008
Token people
Token ID661
Feature activation-7.07e-01

Pos logit contributions
 Niet+6.788
otle+5.686
ById+5.613
/​+5.579
plet+5.522
Neg logit contributions
hift-6.488
 revol-5.526
Sense-5.416
 captives-5.392
 reckoned-5.250
Token getting
Token ID1972
Feature activation-6.11e-01

Pos logit contributions
 Niet+3.579
ById+3.085
ashington+3.062
otle+3.047
plet+3.003
Neg logit contributions
hift-3.336
 revol-2.843
 captives-2.736
Sense-2.703
 painfully-2.630
Token critical
Token ID4688
Feature activation+1.39e-01

Pos logit contributions
 Niet+3.337
ById+2.889
ashington+2.863
otle+2.856
plet+2.854
Neg logit contributions
hift-2.632
 revol-2.208
Sense-2.101
 captives-2.098
 reckoned-2.014
Token about
Token ID546
Feature activation+4.39e-02

Pos logit contributions
hift+0.650
 revol+0.578
 captives+0.543
 painfully+0.527
 reckoned+0.518
Neg logit contributions
 Niet-0.732
ById-0.630
ashington-0.630
otle-0.614
plet-0.610
Token them
Token ID606
Feature activation-7.23e-02

Pos logit contributions
hift+0.195
 revol+0.169
 captives+0.159
 painfully+0.155
 reckoned+0.152
Neg logit contributions
 Niet-0.241
ById-0.210
ashington-0.209
otle-0.206
plet-0.204
Token and
Token ID290
Feature activation+5.58e-01

Pos logit contributions
hift+1.364
 revol+1.188
 captives+1.127
Sense+1.112
 painfully+1.111
Neg logit contributions
 Niet-1.380
plet-1.135
ById-1.127
otle-1.104
ashington-1.097
Token services
Token ID2594
Feature activation+2.13e-01

Pos logit contributions
hift+2.160
 revol+1.935
 captives+1.804
 painfully+1.729
Sense+1.712
Neg logit contributions
 Niet-3.315
otle-2.863
ById-2.854
plet-2.827
ashington-2.820
Token.
Token ID13
Feature activation+4.35e-01

Pos logit contributions
hift+0.861
 revol+0.727
 captives+0.681
Sense+0.664
 painfully+0.659
Neg logit contributions
 Niet-1.256
ById-1.078
ashington-1.074
plet-1.070
otle-1.069
Token Thank
Token ID6952
Feature activation+6.75e-01

Pos logit contributions
hift+2.035
 revol+1.784
Sense+1.694
 captives+1.656
 painfully+1.625
Neg logit contributions
 Niet-2.244
plet-1.928
otle-1.904
ashington-1.875
ById-1.867
Token you
Token ID345
Feature activation+1.14e+00

Pos logit contributions
hift+2.608
 revol+2.116
 captives+1.995
 painfully+1.936
Sense+1.905
Neg logit contributions
 Niet-4.183
ById-3.571
ashington-3.570
otle-3.536
SourceFile-3.504
Token for
Token ID329
Feature activation+1.27e+00

Pos logit contributions
hift+3.857
 revol+3.185
 captives+2.889
 painfully+2.856
Sense+2.846
Neg logit contributions
 Niet-7.405
otle-6.324
plet-6.248
ById-6.202
SourceFile-6.147
Token subscribing
Token ID18412
Feature activation+7.83e-01

Pos logit contributions
hift+5.154
 revol+4.660
 captives+4.263
 painfully+4.197
 comfortable+4.109
Neg logit contributions
 Niet-7.241
plet-6.068
otle-6.019
ゼウス-5.923
ById-5.915
Token.
Token ID13
Feature activation+9.45e-01

Pos logit contributions
hift+2.807
 revol+2.329
Sense+2.217
 painfully+2.149
 captives+2.141
Neg logit contributions
 Niet-4.875
plet-4.214
ById-4.183
ashington-4.125
otle-4.104
Token An
Token ID1052
Feature activation+4.21e-01

Pos logit contributions
hift+4.637
Sense+4.002
 revol+3.982
 Syri+3.832
 captives+3.696
Neg logit contributions
 Niet-4.396
plet-3.983
ovember-3.971
sterdam-3.845
icro-3.770
Token error
Token ID4049
Feature activation+7.30e-03

Pos logit contributions
hift+1.552
 revol+1.322
 captives+1.227
 painfully+1.192
Sense+1.182
Neg logit contributions
 Niet-2.597
ById-2.266
otle-2.246
ashington-2.235
plet-2.225
Token has
Token ID468
Feature activation+2.30e-01

Pos logit contributions
hift+0.027
 revol+0.023
 captives+0.022
 painfully+0.021
Sense+0.020
Neg logit contributions
 Niet-0.045
ById-0.039
ashington-0.039
plet-0.039
otle-0.039
Token your
Token ID534
Feature activation+1.08e-01

Pos logit contributions
hift+0.325
 revol+0.284
 captives+0.267
 painfully+0.259
Sense+0.258
Neg logit contributions
 Niet-0.418
ById-0.363
ashington-0.360
plet-0.359
otle-0.357
Token stories
Token ID3923
Feature activation+7.04e-01

Pos logit contributions
hift+0.461
 revol+0.378
 captives+0.356
Sense+0.345
 painfully+0.343
Neg logit contributions
 Niet-0.616
ashington-0.542
ById-0.536
otle-0.530
/​-0.521
Token!
Token ID0
Feature activation+6.92e-01

Pos logit contributions
hift+2.660
 revol+2.317
 painfully+2.143
 captives+2.112
Sense+2.058
Neg logit contributions
 Niet-4.269
ById-3.703
plet-3.663
SourceFile-3.661
ashington-3.633
Token Do
Token ID2141
Feature activation+8.52e-02

Pos logit contributions
hift+3.601
 revol+3.256
 captives+3.100
Sense+3.041
 reckoned+2.999
Neg logit contributions
 Niet-3.128
plet-2.761
ById-2.663
SourceFile-2.657
ashington-2.625
Token you
Token ID345
Feature activation+6.83e-01

Pos logit contributions
hift+0.388
 revol+0.340
 captives+0.321
 reckoned+0.312
Sense+0.312
Neg logit contributions
 Niet-0.448
ById-0.390
ashington-0.386
otle-0.384
plet-0.383
Token have
Token ID423
Feature activation+1.18e+00

Pos logit contributions
hift+2.897
 revol+2.493
 captives+2.345
 painfully+2.265
Sense+2.248
Neg logit contributions
 Niet-3.860
ById-3.367
ashington-3.360
otle-3.324
plet-3.295
Token a
Token ID257
Feature activation+4.83e-01

Pos logit contributions
hift+4.865
 revol+4.247
 captives+3.923
 painfully+3.831
Sense+3.712
Neg logit contributions
 Niet-6.859
ashington-6.033
ById-5.929
plet-5.872
otle-5.842
Token story
Token ID1621
Feature activation+6.19e-01

Pos logit contributions
hift+2.145
 revol+1.847
 captives+1.752
 painfully+1.684
Sense+1.678
Neg logit contributions
 Niet-2.627
ById-2.282
ashington-2.278
otle-2.248
plet-2.227
Token for
Token ID329
Feature activation+3.65e-01

Pos logit contributions
hift+2.532
 revol+2.180
 captives+1.991
 painfully+1.976
Sense+1.926
Neg logit contributions
 Niet-3.617
ById-3.158
ashington-3.141
otle-3.076
plet-3.068
Token The
Token ID383
Feature activation+6.51e-01

Pos logit contributions
hift+1.695
 revol+1.493
 captives+1.415
Sense+1.369
 painfully+1.369
Neg logit contributions
 Niet-1.915
ashington-1.648
ById-1.642
otle-1.617
plet-1.609
Token Sun
Token ID3825
Feature activation-1.53e-01

Pos logit contributions
hift+4.131
 revol+3.777
 captives+3.604
 painfully+3.591
Sense+3.566
Neg logit contributions
 Niet-2.278
otle-1.771
plet-1.742
ById-1.732
/​-1.707
Token a
Token ID257
Feature activation+2.50e-01

Pos logit contributions
hift+1.010
 revol+0.867
 captives+0.801
 painfully+0.789
 reckoned+0.761
Neg logit contributions
 Niet-1.482
ashington-1.259
otle-1.248
ById-1.245
plet-1.240
Token robot
Token ID9379
Feature activation+3.36e-01

Pos logit contributions
hift+0.905
 revol+0.777
 captives+0.708
 painfully+0.694
 reckoned+0.665
Neg logit contributions
 Niet-1.586
ById-1.386
ashington-1.376
otle-1.366
SourceFile-1.348
Token by
Token ID416
Feature activation+8.66e-01

Pos logit contributions
hift+1.460
 revol+1.315
 captives+1.232
 painfully+1.205
 reckoned+1.172
Neg logit contributions
 Niet-1.873
plet-1.564
ById-1.554
ashington-1.550
otle-1.550
Token clicking
Token ID12264
Feature activation+1.33e+00

Pos logit contributions
hift+2.834
 revol+2.371
 painfully+2.102
 captives+2.087
Sense+2.026
Neg logit contributions
 Niet-5.726
ById-5.058
ashington-5.047
otle-4.984
plet-4.953
Token the
Token ID262
Feature activation+1.78e+00

Pos logit contributions
hift+4.154
 revol+3.370
 captives+3.098
 painfully+3.061
Sense+3.046
Neg logit contributions
 Niet-8.718
ashington-7.446
ById-7.345
otle-7.339
plet-7.337
Token box
Token ID3091
Feature activation+1.15e+00

Pos logit contributions
hift+4.988
 revol+4.278
 painfully+3.980
 comfortable+3.878
 captives+3.846
Neg logit contributions
 Niet-11.944
otle-10.215
ゼウス-10.166
ovember-10.125
ashington-10.089
Token.
Token ID13
Feature activation+9.16e-01

Pos logit contributions
hift+3.713
 revol+2.916
 captives+2.703
Sense+2.698
 painfully+2.693
Neg logit contributions
 Niet-7.490
ById-6.346
sterdam-6.315
plet-6.311
ovember-6.287
Token Invalid
Token ID17665
Feature activation-2.16e-01

Pos logit contributions
hift+2.825
 revol+2.274
Sense+2.269
 reckoned+2.092
 Syri+2.063
Neg logit contributions
 Niet-5.928
otle-5.267
sterdam-5.254
plet-5.231
ovember-5.192
Token email
Token ID3053
Feature activation+5.89e-03

Pos logit contributions
 Niet+0.925
ById+0.762
ashington+0.760
otle+0.749
plet+0.738
Neg logit contributions
hift-1.214
 revol-1.091
 captives-1.043
 painfully-1.021
Sense-1.009
Token address
Token ID2209
Feature activation+5.07e-01

Pos logit contributions
hift+0.028
 revol+0.024
 captives+0.023
 painfully+0.023
Sense+0.022
Neg logit contributions
 Niet-0.030
ById-0.026
otle-0.026
ashington-0.026
plet-0.025
Token.
Token ID13
Feature activation+4.62e-01

Pos logit contributions
hift+1.864
 revol+1.538
Sense+1.450
 painfully+1.409
 captives+1.405
Neg logit contributions
 Niet-3.088
ById-2.694
plet-2.685
otle-2.664
ashington-2.657
Token running
Token ID2491
Feature activation+6.49e-01

Pos logit contributions
 Niet+1.169
ById+1.018
ashington+1.017
otle+1.005
plet+0.997
Neg logit contributions
hift-0.901
 revol-0.777
 captives-0.731
 painfully-0.707
Sense-0.701
Token,
Token ID11
Feature activation+1.24e-01

Pos logit contributions
hift+2.852
 revol+2.498
 captives+2.350
 painfully+2.308
 reckoned+2.233
Neg logit contributions
 Niet-3.497
ashington-3.103
ById-3.051
plet-3.037
otle-3.007
Token pl
Token ID458
Feature activation-1.39e-01

Pos logit contributions
hift+0.534
 revol+0.465
 captives+0.435
 painfully+0.424
 reckoned+0.416
Neg logit contributions
 Niet-0.690
ashington-0.606
ById-0.605
otle-0.598
plet-0.591
Tokenanking
Token ID15230
Feature activation-1.37e-01

Pos logit contributions
 Niet+0.924
ById+0.824
ashington+0.818
otle+0.810
plet+0.806
Neg logit contributions
hift-0.452
 revol-0.369
 captives-0.338
 painfully-0.322
Sense-0.317
Token and
Token ID290
Feature activation+1.53e-01

Pos logit contributions
 Niet+0.715
ById+0.617
ashington+0.616
otle+0.608
plet+0.603
Neg logit contributions
hift-0.639
 revol-0.558
 captives-0.528
 painfully-0.513
Sense-0.508
Token lung
Token ID12317
Feature activation+1.20e+00

Pos logit contributions
hift+0.676
 revol+0.589
 captives+0.553
 painfully+0.539
 reckoned+0.528
Neg logit contributions
 Niet-0.838
ashington-0.733
ById-0.732
otle-0.720
plet-0.715
Tokenes
Token ID274
Feature activation+1.43e-01

Pos logit contributions
hift+5.063
 revol+4.450
 captives+4.106
 painfully+4.062
Sense+3.941
Neg logit contributions
 Niet-6.516
ashington-5.769
ById-5.690
otle-5.662
SourceFile-5.627
Token.
Token ID13
Feature activation-9.48e-02

Pos logit contributions
hift+0.630
 revol+0.544
 captives+0.511
 painfully+0.498
 reckoned+0.488
Neg logit contributions
 Niet-0.791
ById-0.687
ashington-0.686
otle-0.678
plet-0.672
Token (
Token ID357
Feature activation+3.83e-01

Pos logit contributions
 Niet+0.445
ById+0.380
ashington+0.379
plet+0.374
otle+0.372
Neg logit contributions
hift-0.489
 revol-0.430
 captives-0.411
Sense-0.400
 painfully-0.399
TokenWa
Token ID33484
Feature activation-4.24e-02

Pos logit contributions
hift+1.758
 revol+1.501
 captives+1.428
Sense+1.426
 painfully+1.385
Neg logit contributions
 Niet-2.034
ashington-1.727
ById-1.726
otle-1.724
plet-1.712
Tokenub
Token ID549
Feature activation-9.50e-02

Pos logit contributions
 Niet+0.292
ById+0.260
otle+0.257
ashington+0.257
plet+0.254
Neg logit contributions
hift-0.131
 revol-0.102
 captives-0.093
 painfully-0.090
Sense-0.089
TokenThe
Token ID464
Feature activation+3.38e-01

Pos logit contributions
hift+1.164
 revol+0.997
Sense+0.964
 captives+0.954
 painfully+0.927
Neg logit contributions
 Niet-1.225
ashington-1.045
ById-1.035
plet-1.032
otle-1.031
Token Frenchman
Token ID46819
Feature activation+4.15e-01

Pos logit contributions
hift+1.576
 revol+1.394
 captives+1.325
 painfully+1.293
 reckoned+1.262
Neg logit contributions
 Niet-1.721
ashington-1.519
ById-1.517
otle-1.484
plet-1.479
Token set
Token ID900
Feature activation+5.12e-01

Pos logit contributions
hift+1.755
 revol+1.544
 captives+1.437
 reckoned+1.428
 painfully+1.421
Neg logit contributions
 Niet-2.307
ById-2.047
ashington-2.041
otle-1.998
plet-1.995
Token a
Token ID257
Feature activation+3.27e-02

Pos logit contributions
hift+2.158
 revol+1.863
 captives+1.754
 painfully+1.728
 reckoned+1.699
Neg logit contributions
 Niet-2.847
ashington-2.523
ById-2.518
otle-2.483
plet-2.448
Token lap
Token ID14779
Feature activation+4.79e-01

Pos logit contributions
hift+0.128
 revol+0.110
 captives+0.103
 painfully+0.099
 reckoned+0.098
Neg logit contributions
 Niet-0.194
ById-0.172
ashington-0.171
otle-0.169
plet-0.168
Token time
Token ID640
Feature activation+1.24e+00

Pos logit contributions
hift+1.806
 revol+1.502
 captives+1.407
 painfully+1.391
 reckoned+1.383
Neg logit contributions
 Niet-2.908
ById-2.588
ashington-2.552
otle-2.532
plet-2.516
Token of
Token ID286
Feature activation+4.18e-01

Pos logit contributions
hift+5.006
 revol+4.191
 reckoned+4.000
 painfully+3.980
 captives+3.964
Neg logit contributions
 Niet-6.967
ById-6.191
plet-6.134
ashington-6.107
SourceFile-6.029
Token 1
Token ID352
Feature activation+1.77e-01

Pos logit contributions
hift+2.044
 revol+1.798
 captives+1.712
 painfully+1.692
 reckoned+1.669
Neg logit contributions
 Niet-2.050
ashington-1.798
ById-1.797
plet-1.756
otle-1.744
Token:
Token ID25
Feature activation+7.29e-01

Pos logit contributions
hift+0.762
 revol+0.657
 captives+0.617
 painfully+0.600
 reckoned+0.592
Neg logit contributions
 Niet-0.990
ById-0.865
ashington-0.864
otle-0.852
plet-0.844
Token54
Token ID4051
Feature activation+4.16e-01

Pos logit contributions
hift+2.709
 revol+2.154
Sense+2.069
 captives+2.031
 painfully+1.976
Neg logit contributions
 Niet-4.506
plet-3.907
ashington-3.871
ById-3.869
otle-3.859
Token.
Token ID13
Feature activation+3.74e-01

Pos logit contributions
hift+1.737
 revol+1.479
 captives+1.394
 painfully+1.358
 reckoned+1.351
Neg logit contributions
 Niet-2.368
ById-2.074
ashington-2.071
otle-2.050
plet-2.034
Token Bre
Token ID3719
Feature activation-5.78e-01

Pos logit contributions
 Niet+0.742
ById+0.631
ashington+0.628
otle+0.618
plet+0.613
Neg logit contributions
hift-0.837
 revol-0.744
 captives-0.709
 painfully-0.690
Sense-0.688
Tokencher
Token ID2044
Feature activation+4.35e-01

Pos logit contributions
 Niet+2.901
ashington+2.510
ById+2.489
otle+2.460
plet+2.452
Neg logit contributions
hift-2.775
 revol-2.455
 captives-2.334
 painfully-2.266
Sense-2.238
Token/
Token ID14
Feature activation+4.47e-01

Pos logit contributions
hift+2.132
 revol+1.882
 captives+1.784
 painfully+1.738
 reckoned+1.728
Neg logit contributions
 Niet-2.134
ashington-1.841
ById-1.840
otle-1.821
plet-1.797
TokenNBC
Token ID13175
Feature activation+4.51e-01

Pos logit contributions
hift+2.435
 revol+2.145
 captives+2.045
Sense+2.004
 reckoned+1.997
Neg logit contributions
 Niet-1.978
ashington-1.645
ById-1.644
otle-1.640
plet-1.628
Token News
Token ID3000
Feature activation+3.41e-01

Pos logit contributions
hift+2.008
 revol+1.712
 captives+1.643
 painfully+1.594
 reckoned+1.576
Neg logit contributions
 Niet-2.421
ById-2.122
ashington-2.085
otle-2.083
plet-2.074
Token
Token ID198
Feature activation+4.13e-01

Pos logit contributions
hift+1.736
 revol+1.524
 captives+1.464
 painfully+1.419
 reckoned+1.401
Neg logit contributions
 Niet-1.611
ById-1.380
ashington-1.360
plet-1.357
otle-1.356
Token
Token ID198
Feature activation+3.58e-01

Pos logit contributions
hift+1.902
 revol+1.594
Sense+1.565
 captives+1.529
 painfully+1.515
Neg logit contributions
 Niet-2.207
plet-1.832
ashington-1.818
ById-1.810
sterdam-1.792
TokenThe
Token ID464
Feature activation+1.17e-01

Pos logit contributions
hift+1.722
 revol+1.484
Sense+1.426
 captives+1.418
 painfully+1.379
Neg logit contributions
 Niet-1.831
ashington-1.533
otle-1.525
plet-1.522
ById-1.521
Token Environmental
Token ID13272
Feature activation+1.30e-01

Pos logit contributions
hift+0.561
 revol+0.496
 captives+0.468
 painfully+0.457
Sense+0.445
Neg logit contributions
 Niet-0.600
ById-0.517
ashington-0.517
otle-0.508
plet-0.507
Token Protection
Token ID9985
Feature activation-3.68e-01

Pos logit contributions
hift+0.516
 revol+0.436
 captives+0.404
Sense+0.393
 painfully+0.391
Neg logit contributions
 Niet-0.776
ashington-0.685
ById-0.679
otle-0.674
plet-0.671
Token Agency
Token ID7732
Feature activation-1.39e-01

Pos logit contributions
 Niet+1.580
ById+1.317
ashington+1.314
otle+1.296
plet+1.275
Neg logit contributions
hift-2.046
 revol-1.829
 captives-1.752
 painfully-1.707
Sense-1.697
Tokenine
Token ID500
Feature activation+1.26e-01

Pos logit contributions
hift+1.086
 revol+0.919
Sense+0.862
 painfully+0.861
 captives+0.854
Neg logit contributions
 Niet-1.446
ashington-1.222
ById-1.202
SourceFile-1.193
plet-1.192
Token saying
Token ID2282
Feature activation-4.79e-02

Pos logit contributions
hift+0.611
 revol+0.546
 captives+0.513
 painfully+0.503
 reckoned+0.496
Neg logit contributions
 Niet-0.634
ashington-0.541
ById-0.541
otle-0.534
plet-0.528
Token that
Token ID326
Feature activation+3.27e-01

Pos logit contributions
 Niet+0.257
ById+0.223
ashington+0.222
otle+0.219
plet+0.218
Neg logit contributions
hift-0.217
 revol-0.189
 captives-0.178
 painfully-0.173
 reckoned-0.171
Token he
Token ID339
Feature activation+1.77e-01

Pos logit contributions
hift+1.429
 revol+1.255
 captives+1.183
 painfully+1.151
 reckoned+1.133
Neg logit contributions
 Niet-1.790
ById-1.574
ashington-1.570
otle-1.541
plet-1.532
Token's
Token ID338
Feature activation-3.04e-02

Pos logit contributions
hift+0.804
 revol+0.723
 captives+0.676
 painfully+0.662
 reckoned+0.662
Neg logit contributions
 Niet-0.946
ById-0.813
ashington-0.809
plet-0.797
otle-0.797
Token essentially
Token ID6986
Feature activation+3.26e-01

Pos logit contributions
 Niet+0.168
ById+0.146
ashington+0.146
otle+0.144
plet+0.143
Neg logit contributions
hift-0.132
 revol-0.114
 captives-0.107
 painfully-0.104
Sense-0.103
Token operating
Token ID5361
Feature activation+1.53e-01

Pos logit contributions
hift+1.743
 revol+1.590
 captives+1.500
 painfully+1.472
 reckoned+1.465
Neg logit contributions
 Niet-1.472
ashington-1.241
ById-1.238
otle-1.230
plet-1.207
Token in
Token ID287
Feature activation-8.01e-02

Pos logit contributions
hift+0.687
 revol+0.603
 captives+0.570
 painfully+0.556
 reckoned+0.544
Neg logit contributions
 Niet-0.825
ashington-0.720
ById-0.715
otle-0.706
plet-0.705
Token the
Token ID262
Feature activation-1.41e-01

Pos logit contributions
 Niet+0.435
ById+0.378
ashington+0.377
otle+0.372
plet+0.368
Neg logit contributions
hift-0.358
 revol-0.311
 captives-0.293
 painfully-0.284
Sense-0.281
Token party
Token ID2151
Feature activation+4.37e-01

Pos logit contributions
 Niet+0.763
ashington+0.664
ById+0.664
otle+0.655
plet+0.650
Neg logit contributions
hift-0.624
 revol-0.545
 captives-0.513
 painfully-0.500
Sense-0.493
Token interest
Token ID1393
Feature activation+4.63e-01

Pos logit contributions
hift+1.892
 revol+1.696
 captives+1.580
 painfully+1.544
 reckoned+1.523
Neg logit contributions
 Niet-2.388
ById-2.095
ashington-2.079
otle-2.050
plet-2.036
Token.
Token ID13
Feature activation+5.17e-02

Pos logit contributions
hift+1.245
 revol+1.091
 captives+1.024
 painfully+1.003
 reckoned+0.987
Neg logit contributions
 Niet-1.502
ashington-1.313
ById-1.309
otle-1.290
plet-1.286
Token
Token ID564
Feature activation+2.17e-01

Pos logit contributions
hift+0.263
 revol+0.233
 captives+0.221
 painfully+0.215
Sense+0.215
Neg logit contributions
 Niet-0.248
ashington-0.211
ById-0.211
otle-0.208
plet-0.206
Token
Token ID250
Feature activation+4.39e-02

Pos logit contributions
hift+0.957
 revol+0.828
 captives+0.781
 painfully+0.756
Sense+0.751
Neg logit contributions
 Niet-1.193
ById-1.035
ashington-1.034
otle-1.022
plet-1.013
TokenT
Token ID51
Feature activation+9.90e-01

Pos logit contributions
hift+0.217
 revol+0.192
 captives+0.182
Sense+0.177
 painfully+0.177
Neg logit contributions
 Niet-0.216
ashington-0.185
ById-0.184
otle-0.181
plet-0.180
Tokenoss
Token ID793
Feature activation+9.69e-01

Pos logit contributions
hift+4.058
 revol+3.562
 painfully+3.269
 captives+3.245
 reckoned+3.238
Neg logit contributions
 Niet-5.459
ById-4.812
ashington-4.767
SourceFile-4.663
plet-4.648
Token out
Token ID503
Feature activation+3.74e-01

Pos logit contributions
hift+3.851
 revol+3.313
 captives+3.066
 painfully+3.014
Sense+2.958
Neg logit contributions
 Niet-5.601
ashington-4.996
ById-4.893
plet-4.799
otle-4.785
Token everything
Token ID2279
Feature activation+3.89e-01

Pos logit contributions
hift+1.594
 revol+1.405
 captives+1.309
 painfully+1.282
Sense+1.254
Neg logit contributions
 Niet-2.068
ashington-1.827
ById-1.809
otle-1.784
plet-1.783
Token you
Token ID345
Feature activation+3.45e-01

Pos logit contributions
hift+1.731
 revol+1.540
 captives+1.426
 painfully+1.411
Sense+1.375
Neg logit contributions
 Niet-2.083
ashington-1.831
ById-1.812
plet-1.783
otle-1.780
Token can
Token ID460
Feature activation+1.18e-01

Pos logit contributions
hift+1.473
 revol+1.299
 captives+1.201
 painfully+1.176
 reckoned+1.171
Neg logit contributions
 Niet-1.929
ashington-1.674
ById-1.666
plet-1.645
otle-1.639
Token imagine
Token ID5967
Feature activation+2.72e-01

Pos logit contributions
hift+0.550
 revol+0.485
 captives+0.455
 painfully+0.445
 reckoned+0.439
Neg logit contributions
 Niet-0.619
ashington-0.531
ById-0.529
otle-0.520
plet-0.517
Token about
Token ID546
Feature activation+2.23e-01

Pos logit contributions
hift+1.266
 revol+1.117
 captives+1.044
 painfully+1.026
Sense+1.011
Neg logit contributions
 Niet-1.415
ashington-1.232
ById-1.223
otle-1.205
plet-1.201
Token now
Token ID783
Feature activation-3.49e-01

Pos logit contributions
 Niet+0.421
ById+0.370
ashington+0.369
otle+0.365
plet+0.362
Neg logit contributions
hift-0.287
 revol-0.246
 captives-0.230
 painfully-0.222
Sense-0.219
Token looking
Token ID2045
Feature activation-6.87e-01

Pos logit contributions
 Niet+1.986
ashington+1.731
ById+1.730
otle+1.709
plet+1.696
Neg logit contributions
hift-1.465
 revol-1.251
 captives-1.176
Sense-1.135
 painfully-1.133
Token for
Token ID329
Feature activation-5.84e-01

Pos logit contributions
 Niet+3.908
ashington+3.443
ById+3.414
otle+3.409
plet+3.376
Neg logit contributions
hift-2.796
 revol-2.362
 captives-2.204
Sense-2.180
 reckoned-2.117
Token ways
Token ID2842
Feature activation-9.34e-01

Pos logit contributions
 Niet+3.226
ById+2.796
ashington+2.784
otle+2.769
plet+2.737
Neg logit contributions
hift-2.549
 revol-2.178
 captives-2.051
Sense-1.985
 painfully-1.982
Token to
Token ID284
Feature activation-3.19e-01

Pos logit contributions
 Niet+4.303
otle+3.718
ashington+3.717
ById+3.655
plet+3.620
Neg logit contributions
hift-4.608
 revol-4.055
 captives-3.838
 reckoned-3.790
Sense-3.719
Token be
Token ID307
Feature activation+2.41e-01

Pos logit contributions
 Niet+1.555
ById+1.324
ashington+1.318
otle+1.297
plet+1.285
Neg logit contributions
hift-1.609
 revol-1.421
 captives-1.351
 painfully-1.317
Sense-1.303
Token able
Token ID1498
Feature activation-2.19e-01

Pos logit contributions
hift+0.907
 revol+0.765
 captives+0.713
 painfully+0.689
 reckoned+0.683
Neg logit contributions
 Niet-1.482
ById-1.300
ashington-1.294
otle-1.281
plet-1.274
Token to
Token ID284
Feature activation-3.95e-01

Pos logit contributions
 Niet+1.210
ById+1.049
ashington+1.045
otle+1.031
plet+1.022
Neg logit contributions
hift-0.962
 revol-0.832
 captives-0.787
 painfully-0.763
 reckoned-0.751
Token direct
Token ID1277
Feature activation-1.76e-01

Pos logit contributions
 Niet+1.913
ById+1.629
ashington+1.627
otle+1.607
plet+1.591
Neg logit contributions
hift-1.986
 revol-1.752
 captives-1.667
 painfully-1.619
Sense-1.612
Token more
Token ID517
Feature activation+1.79e-02

Pos logit contributions
 Niet+0.973
ashington+0.847
ById+0.847
otle+0.834
plet+0.827
Neg logit contributions
hift-0.770
 revol-0.670
 captives-0.631
 painfully-0.612
Sense-0.603
Token light
Token ID1657
Feature activation-5.13e-01

Pos logit contributions
hift+0.080
 revol+0.070
 captives+0.066
 painfully+0.065
Sense+0.063
Neg logit contributions
 Niet-0.097
ById-0.084
ashington-0.084
otle-0.082
plet-0.082
Token
Token ID247
Feature activation+4.38e-02

Pos logit contributions
 Niet+0.504
ashington+0.435
ById+0.433
otle+0.431
plet+0.428
Neg logit contributions
hift-0.381
 revol-0.329
 captives-0.309
 painfully-0.298
Sense-0.297
Tokenve
Token ID303
Feature activation-1.15e-01

Pos logit contributions
hift+0.207
 revol+0.183
 captives+0.172
 painfully+0.168
Sense+0.167
Neg logit contributions
 Niet-0.227
ById-0.194
ashington-0.193
otle-0.191
plet-0.189
Token heard
Token ID2982
Feature activation-4.80e-01

Pos logit contributions
 Niet+0.675
ById+0.593
ashington+0.592
otle+0.585
plet+0.581
Neg logit contributions
hift-0.458
 revol-0.389
 captives-0.365
 painfully-0.351
Sense-0.349
Token that
Token ID326
Feature activation-3.51e-01

Pos logit contributions
 Niet+2.561
ById+2.215
ashington+2.211
otle+2.185
plet+2.162
Neg logit contributions
hift-2.174
 revol-1.877
 captives-1.779
Sense-1.721
 painfully-1.717
Token type
Token ID2099
Feature activation-1.68e-01

Pos logit contributions
 Niet+1.905
ById+1.645
ashington+1.643
otle+1.627
plet+1.611
Neg logit contributions
hift-1.563
 revol-1.344
 captives-1.271
Sense-1.232
 painfully-1.229
Token of
Token ID286
Feature activation+2.06e-01

Pos logit contributions
 Niet+0.854
ById+0.734
ashington+0.733
otle+0.726
plet+0.717
Neg logit contributions
hift-0.805
 revol-0.705
 captives-0.671
 painfully-0.648
Sense-0.646
Token rhetoric
Token ID13214
Feature activation+1.10e-04

Pos logit contributions
hift+0.861
 revol+0.759
 captives+0.702
 painfully+0.685
 reckoned+0.664
Neg logit contributions
 Niet-1.173
ById-1.028
ashington-1.028
otle-1.010
plet-1.001
Token before
Token ID878
Feature activation+6.63e-01

Pos logit contributions
hift+0.000
 revol+0.000
 captives+0.000
 painfully+0.000
 reckoned+0.000
Neg logit contributions
 Niet-0.001
ashington-0.001
ById-0.001
plet-0.001
otle-0.000
Token,
Token ID11
Feature activation-2.39e-01

Pos logit contributions
hift+2.900
 revol+2.565
 captives+2.356
 painfully+2.353
Sense+2.243
Neg logit contributions
 Niet-3.622
ById-3.117
ashington-3.101
plet-3.071
otle-3.057
Token
Token ID447
Feature activation-4.33e-02

Pos logit contributions
 Niet+1.217
ById+1.045
ashington+1.039
otle+1.028
plet+1.018
Neg logit contributions
hift-1.150
 revol-1.013
 captives-0.959
 painfully-0.933
Sense-0.923
Token
Token ID251
Feature activation-2.26e-01

Pos logit contributions
 Niet+0.241
ashington+0.207
ById+0.207
otle+0.206
plet+0.204
Neg logit contributions
hift-0.189
 revol-0.163
 captives-0.154
 painfully-0.148
Sense-0.147
Token Light
Token ID4401
Feature activation-1.59e-01

Pos logit contributions
hift+2.714
 revol+2.381
 captives+2.275
 reckoned+2.218
Sense+2.203
Neg logit contributions
 Niet-2.658
ById-2.268
ashington-2.257
plet-2.209
otle-2.196
Tokenyear
Token ID1941
Feature activation+5.25e-01

Pos logit contributions
 Niet+0.697
ById+0.583
ashington+0.581
otle+0.572
plet+0.566
Neg logit contributions
hift-0.873
 revol-0.779
 captives-0.744
 painfully-0.726
Sense-0.722
Token in
Token ID287
Feature activation+5.05e-01

Pos logit contributions
hift+2.375
 revol+2.102
 captives+1.954
 painfully+1.919
 reckoned+1.899
Neg logit contributions
 Niet-2.789
ashington-2.425
ById-2.409
otle-2.369
plet-2.366
Token black
Token ID2042
Feature activation+4.31e-01

Pos logit contributions
hift+2.313
 revol+2.049
 captives+1.914
 painfully+1.876
Sense+1.836
Neg logit contributions
 Niet-2.638
ById-2.316
ashington-2.312
otle-2.249
plet-2.240
Token-
Token ID12
Feature activation-1.07e-01

Pos logit contributions
hift+1.972
 revol+1.740
 captives+1.633
 painfully+1.593
 reckoned+1.568
Neg logit contributions
 Niet-2.273
ById-1.986
ashington-1.958
otle-1.930
plet-1.921
Tokenand
Token ID392
Feature activation-5.19e-01

Pos logit contributions
 Niet+0.325
ById+0.248
ashington+0.247
otle+0.241
plet+0.236
Neg logit contributions
hift-0.732
 revol-0.669
 captives-0.646
 painfully-0.633
Sense-0.631
Token-
Token ID12
Feature activation+1.30e-01

Pos logit contributions
 Niet+2.384
ById+2.009
ashington+2.005
otle+1.979
plet+1.957
Neg logit contributions
hift-2.744
 revol-2.436
 captives-2.324
 painfully-2.264
Sense-2.252
Tokenwhite
Token ID11186
Feature activation+6.22e-01

Pos logit contributions
hift+0.738
 revol+0.663
 captives+0.634
 painfully+0.619
Sense+0.618
Neg logit contributions
 Niet-0.542
ById-0.450
ashington-0.446
otle-0.436
plet-0.431
Token.
Token ID13
Feature activation-8.09e-02

Pos logit contributions
hift+2.828
 revol+2.521
 captives+2.346
 painfully+2.287
 reckoned+2.250
Neg logit contributions
 Niet-3.260
ById-2.874
ashington-2.860
plet-2.801
otle-2.795
Token The
Token ID383
Feature activation+6.20e-03

Pos logit contributions
 Niet+0.379
ById+0.323
ashington+0.322
otle+0.316
plet+0.316
Neg logit contributions
hift-0.420
 revol-0.370
 captives-0.352
Sense-0.344
 painfully-0.344
Token supporters
Token ID5941
Feature activation-1.26e-01

Pos logit contributions
hift+0.029
 revol+0.025
 captives+0.024
 painfully+0.023
Sense+0.023
Neg logit contributions
 Niet-0.032
ById-0.028
ashington-0.028
otle-0.027
plet-0.027
Token.
Token ID13
Feature activation-2.86e-02

Pos logit contributions
hift+1.903
 revol+1.664
 captives+1.569
 painfully+1.544
Sense+1.502
Neg logit contributions
 Niet-2.500
ById-2.205
otle-2.191
plet-2.161
ashington-2.153
Token Brian
Token ID8403
Feature activation+1.61e-01

Pos logit contributions
 Niet+0.134
ById+0.114
ashington+0.114
otle+0.113
plet+0.112
Neg logit contributions
hift-0.147
 revol-0.131
 captives-0.124
 painfully-0.121
 reckoned-0.120
Token Bab
Token ID12400
Feature activation-1.78e-01

Pos logit contributions
hift+0.800
 revol+0.713
 captives+0.677
 painfully+0.665
Sense+0.657
Neg logit contributions
 Niet-0.761
ById-0.669
ashington-0.668
otle-0.666
plet-0.660
Tokenine
Token ID500
Feature activation+4.97e-01

Pos logit contributions
 Niet+0.909
ById+0.778
ashington+0.776
otle+0.765
plet+0.757
Neg logit contributions
hift-0.858
 revol-0.752
 captives-0.712
 painfully-0.692
Sense-0.688
Tokenau
Token ID559
Feature activation-1.31e-01

Pos logit contributions
hift+2.928
 revol+2.620
 captives+2.496
 painfully+2.455
 reckoned+2.443
Neg logit contributions
 Niet-1.981
ById-1.677
ashington-1.616
otle-1.604
plet-1.588
Token/
Token ID14
Feature activation+1.63e-01

Pos logit contributions
 Niet+0.606
ById+0.512
ashington+0.510
otle+0.504
plet+0.498
Neg logit contributions
hift-0.688
 revol-0.611
 captives-0.582
 painfully-0.567
Sense-0.563
TokenNBA
Token ID32470
Feature activation+4.91e-01

Pos logit contributions
hift+0.864
 revol+0.765
 captives+0.728
 painfully+0.711
Sense+0.710
Neg logit contributions
 Niet-0.746
ById-0.625
ashington-0.624
otle-0.619
plet-0.612
TokenE
Token ID36
Feature activation+9.25e-02

Pos logit contributions
hift+1.988
 revol+1.710
 captives+1.616
Sense+1.556
 painfully+1.555
Neg logit contributions
 Niet-2.842
otle-2.473
ById-2.451
ashington-2.440
plet-2.440
Token/
Token ID14
Feature activation-2.43e-03

Pos logit contributions
hift+0.495
 revol+0.444
 captives+0.424
 painfully+0.413
Sense+0.409
Neg logit contributions
 Niet-0.415
ById-0.349
ashington-0.347
otle-0.345
plet-0.340
TokenGetty
Token ID6633
Feature activation+1.72e-02

Pos logit contributions
 Niet+0.008
ashington+0.006
ById+0.006
otle+0.006
plet+0.006
Neg logit contributions
hift-0.016
 revol-0.014
 captives-0.014
 painfully-0.014
Sense-0.013
Token Images
Token ID5382
Feature activation+2.64e-01

Pos logit contributions
hift+0.069
 revol+0.058
 captives+0.055
 painfully+0.052
 reckoned+0.052
Neg logit contributions
 Niet-0.102
ById-0.089
ashington-0.089
otle-0.088
plet-0.088
Token Herod
Token ID48084
Feature activation-6.26e-01

Pos logit contributions
hift+1.524
 revol+1.363
 captives+1.293
 reckoned+1.254
 painfully+1.243
Neg logit contributions
 Niet-1.466
ashington-1.281
ById-1.279
plet-1.241
SourceFile-1.222
Tokenot
Token ID313
Feature activation+6.01e-01

Pos logit contributions
 Niet+2.954
otle+2.509
ashington+2.501
ById+2.475
plet+2.435
Neg logit contributions
hift-3.239
 revol-2.818
 captives-2.703
Sense-2.634
 painfully-2.631
Tokenus
Token ID385
Feature activation+4.32e-01

Pos logit contributions
hift+2.379
 revol+2.083
 captives+1.868
Sense+1.839
 painfully+1.815
Neg logit contributions
 Niet-3.528
ById-3.235
ashington-3.064
SourceFile-3.061
plet-3.037
Token begins
Token ID6140
Feature activation-2.14e-01

Pos logit contributions
hift+1.973
 revol+1.766
 captives+1.653
 painfully+1.613
 reckoned+1.610
Neg logit contributions
 Niet-2.269
ById-1.982
ashington-1.970
plet-1.938
otle-1.919
Token the
Token ID262
Feature activation-1.53e-03

Pos logit contributions
 Niet+1.152
ById+0.995
ashington+0.991
otle+0.982
plet+0.972
Neg logit contributions
hift-0.966
 revol-0.837
 captives-0.791
 painfully-0.765
Sense-0.763
Token Australian
Token ID6638
Feature activation+1.00e-01

Pos logit contributions
 Niet+0.009
ById+0.008
ashington+0.008
otle+0.007
plet+0.007
Neg logit contributions
hift-0.006
 revol-0.006
 captives-0.005
 painfully-0.005
 reckoned-0.005
Token Alan
Token ID12246
Feature activation+2.73e-01

Pos logit contributions
hift+0.431
 revol+0.376
 captives+0.355
 painfully+0.340
 reckoned+0.337
Neg logit contributions
 Niet-0.561
ById-0.492
ashington-0.489
otle-0.482
plet-0.480
Token Moore
Token ID8877
Feature activation-4.40e-01

Pos logit contributions
hift+1.426
 revol+1.270
 captives+1.212
 painfully+1.177
 reckoned+1.175
Neg logit contributions
 Niet-1.261
ById-1.076
ashington-1.071
otle-1.051
plet-1.035
Tokenhead
Token ID2256
Feature activation-5.41e-02

Pos logit contributions
 Niet+2.315
ById+1.987
ashington+1.984
otle+1.967
plet+1.950
Neg logit contributions
hift-2.029
 revol-1.755
 captives-1.665
 painfully-1.607
Sense-1.607
Token
Token ID447
Feature activation-1.72e-01

Pos logit contributions
 Niet+0.288
ById+0.250
ashington+0.249
otle+0.246
plet+0.244
Neg logit contributions
hift-0.246
 revol-0.215
 captives-0.203
 painfully-0.197
Sense-0.195
Token
Token ID247
Feature activation-6.36e-02

Pos logit contributions
 Niet+0.979
ById+0.855
ashington+0.853
otle+0.845
plet+0.837
Neg logit contributions
hift-0.719
 revol-0.617
 captives-0.580
 painfully-0.560
Sense-0.555
Token and
Token ID290
Feature activation-5.80e-02

Pos logit contributions
hift+0.198
 revol+0.174
 captives+0.163
 painfully+0.159
 reckoned+0.156
Neg logit contributions
 Niet-0.243
ashington-0.211
ById-0.210
otle-0.207
plet-0.205
Token I
Token ID314
Feature activation+3.74e-01

Pos logit contributions
 Niet+0.306
ashington+0.265
ById+0.264
otle+0.260
plet+0.258
Neg logit contributions
hift-0.266
 revol-0.234
 captives-0.220
 painfully-0.214
Sense-0.212
Token don
Token ID836
Feature activation-2.43e-01

Pos logit contributions
hift+1.633
 revol+1.447
 captives+1.335
 reckoned+1.323
 painfully+1.317
Neg logit contributions
 Niet-2.056
ById-1.770
ashington-1.769
plet-1.751
otle-1.740
Token
Token ID447
Feature activation+9.56e-02

Pos logit contributions
 Niet+1.275
ashington+1.081
ById+1.079
plet+1.070
otle+1.068
Neg logit contributions
hift-1.137
 revol-0.985
 captives-0.929
Sense-0.914
 painfully-0.907
Token
Token ID247
Feature activation-3.12e-02

Pos logit contributions
hift+0.398
 revol+0.339
 captives+0.316
Sense+0.306
 painfully+0.306
Neg logit contributions
 Niet-0.562
ashington-0.479
plet-0.473
otle-0.468
ById-0.468
Tokent
Token ID83
Feature activation+3.15e-04

Pos logit contributions
 Niet+0.137
ById+0.115
ashington+0.114
otle+0.112
plet+0.110
Neg logit contributions
hift-0.171
 revol-0.153
 captives-0.145
Sense-0.142
 painfully-0.142
Token want
Token ID765
Feature activation-4.45e-02

Pos logit contributions
hift+0.001
 revol+0.001
 captives+0.001
 painfully+0.001
 reckoned+0.001
Neg logit contributions
 Niet-0.002
ById-0.001
ashington-0.001
otle-0.001
plet-0.001
Token to
Token ID284
Feature activation-3.24e-01

Pos logit contributions
 Niet+0.242
ById+0.210
ashington+0.210
otle+0.207
plet+0.205
Neg logit contributions
hift-0.198
 revol-0.172
 captives-0.162
 painfully-0.157
Sense-0.156
Token feel
Token ID1254
Feature activation+2.70e-04

Pos logit contributions
 Niet+1.544
ById+1.310
ashington+1.308
otle+1.291
plet+1.278
Neg logit contributions
hift-1.654
 revol-1.462
 captives-1.392
 painfully-1.353
Sense-1.349
Token like
Token ID588
Feature activation-5.10e-01

Pos logit contributions
hift+0.001
 revol+0.001
 captives+0.001
 painfully+0.001
Sense+0.001
Neg logit contributions
 Niet-0.002
ById-0.001
ashington-0.001
otle-0.001
plet-0.001
Token I
Token ID314
Feature activation-4.68e-01

Pos logit contributions
 Niet+2.504
ById+2.124
ashington+2.112
otle+2.101
plet+2.077
Neg logit contributions
hift-2.541
 revol-2.225
 captives-2.120
 painfully-2.055
Sense-2.055
Token ballot
Token ID11100
Feature activation+1.36e-01

Pos logit contributions
hift+1.040
 revol+0.942
 captives+0.900
 painfully+0.880
Sense+0.877
Neg logit contributions
 Niet-0.702
ById-0.582
ashington-0.581
otle-0.566
plet-0.566
Token because
Token ID780
Feature activation-2.68e-01

Pos logit contributions
hift+0.598
 revol+0.522
 captives+0.489
 painfully+0.475
Sense+0.470
Neg logit contributions
 Niet-0.747
ById-0.652
ashington-0.646
otle-0.639
plet-0.634
Token the
Token ID262
Feature activation+1.19e-01

Pos logit contributions
 Niet+1.407
ById+1.211
ashington+1.205
otle+1.194
plet+1.180
Neg logit contributions
hift-1.232
 revol-1.073
 captives-1.021
Sense-0.988
 painfully-0.984
Token doctrine
Token ID13115
Feature activation+3.66e-01

Pos logit contributions
hift+0.589
 revol+0.519
 captives+0.493
 painfully+0.481
Sense+0.475
Neg logit contributions
 Niet-0.583
ashington-0.503
ById-0.502
otle-0.495
plet-0.491
Token of
Token ID286
Feature activation-3.28e-02

Pos logit contributions
hift+1.649
 revol+1.453
 captives+1.361
 painfully+1.328
 reckoned+1.308
Neg logit contributions
 Niet-1.974
ashington-1.705
ById-1.697
plet-1.686
otle-1.667
Token "
Token ID366
Feature activation+9.45e-02

Pos logit contributions
 Niet+0.169
ById+0.145
ashington+0.145
otle+0.143
plet+0.142
Neg logit contributions
hift-0.155
 revol-0.136
 captives-0.129
 painfully-0.125
Sense-0.124
TokenE
Token ID36
Feature activation+5.09e-01

Pos logit contributions
hift+0.436
 revol+0.379
 captives+0.359
 painfully+0.348
Sense+0.346
Neg logit contributions
 Niet-0.499
ById-0.430
ashington-0.429
otle-0.424
plet-0.420
Tokenstop
Token ID11338
Feature activation-1.45e-02

Pos logit contributions
hift+2.010
 revol+1.719
 captives+1.585
Sense+1.537
 reckoned+1.532
Neg logit contributions
 Niet-2.994
ById-2.630
ashington-2.611
otle-2.602
plet-2.582
Tokenel
Token ID417
Feature activation-1.97e-01

Pos logit contributions
 Niet+0.078
ById+0.068
ashington+0.067
otle+0.067
plet+0.066
Neg logit contributions
hift-0.065
 revol-0.057
 captives-0.054
 painfully-0.052
Sense-0.052
Token"
Token ID1
Feature activation-6.01e-02

Pos logit contributions
 Niet+1.095
ById+0.953
ashington+0.951
otle+0.940
plet+0.932
Neg logit contributions
hift-0.850
 revol-0.734
 captives-0.690
 painfully-0.668
Sense-0.665
Token did
Token ID750
Feature activation-6.98e-01

Pos logit contributions
 Niet+0.312
ashington+0.269
ById+0.269
otle+0.265
plet+0.265
Neg logit contributions
hift-0.282
 revol-0.248
 captives-0.234
 painfully-0.228
 reckoned-0.225
Token are
Token ID389
Feature activation+3.22e-01

Pos logit contributions
hift+0.262
 revol+0.230
 captives+0.216
 painfully+0.210
Sense+0.208
Neg logit contributions
 Niet-0.301
ById-0.259
ashington-0.259
otle-0.255
plet-0.253
Token odd
Token ID5629
Feature activation-1.51e-01

Pos logit contributions
hift+1.331
 revol+1.159
 captives+1.080
 painfully+1.053
 reckoned+1.029
Neg logit contributions
 Niet-1.845
ashington-1.620
ById-1.616
otle-1.594
plet-1.585
Token echoes
Token ID30346
Feature activation+7.50e-02

Pos logit contributions
 Niet+0.835
ashington+0.727
ById+0.726
otle+0.717
plet+0.711
Neg logit contributions
hift-0.657
 revol-0.570
 captives-0.535
 painfully-0.518
Sense-0.512
Token of
Token ID286
Feature activation-4.25e-01

Pos logit contributions
hift+0.335
 revol+0.293
 captives+0.274
 painfully+0.268
 reckoned+0.264
Neg logit contributions
 Niet-0.408
ashington-0.353
ById-0.352
otle-0.348
plet-0.344
Token M
Token ID337
Feature activation-1.36e-01

Pos logit contributions
 Niet+2.266
ById+1.942
ashington+1.931
otle+1.922
plet+1.900
Neg logit contributions
hift-1.945
 revol-1.675
 captives-1.588
 painfully-1.534
Sense-1.531
Token.
Token ID13
Feature activation-1.25e+00

Pos logit contributions
 Niet+0.722
ashington+0.626
ById+0.625
otle+0.621
plet+0.613
Neg logit contributions
hift-0.615
 revol-0.535
 captives-0.505
 painfully-0.488
Sense-0.485
Token Night
Token ID5265
Feature activation-4.89e-01

Pos logit contributions
 Niet+5.809
otle+4.807
ashington+4.799
ById+4.758
/​+4.652
Neg logit contributions
hift-6.405
 revol-5.556
 captives-5.346
 reckoned-5.174
 painfully-5.167
Token Shy
Token ID36838
Feature activation-5.37e-01

Pos logit contributions
 Niet+2.248
ashington+1.892
ById+1.881
otle+1.859
plet+1.837
Neg logit contributions
hift-2.585
 revol-2.289
 captives-2.185
 painfully-2.128
 reckoned-2.112
Tokenam
Token ID321
Feature activation-1.41e-01

Pos logit contributions
 Niet+2.475
ById+2.090
ashington+2.089
otle+2.071
plet+2.035
Neg logit contributions
hift-2.829
 revol-2.494
 captives-2.396
 reckoned-2.299
 painfully-2.298
Tokenalan
Token ID25786
Feature activation-1.50e-01

Pos logit contributions
 Niet+0.904
ById+0.812
ashington+0.806
otle+0.787
SourceFile+0.785
Neg logit contributions
hift-0.487
 revol-0.414
 captives-0.377
 painfully-0.366
Sense-0.357
Token
Token ID447
Feature activation-3.84e-01

Pos logit contributions
 Niet+0.767
ashington+0.658
ById+0.658
otle+0.649
plet+0.643
Neg logit contributions
hift-0.713
 revol-0.627
 captives-0.593
 painfully-0.577
Sense-0.571
Token."
Token ID526
Feature activation-2.07e-01

Pos logit contributions
hift+0.412
 revol+0.359
 captives+0.340
 painfully+0.330
Sense+0.325
Neg logit contributions
 Niet-0.486
ById-0.421
ashington-0.419
otle-0.413
plet-0.410
Token
Token ID198
Feature activation-1.54e-01

Pos logit contributions
 Niet+0.976
ById+0.828
ashington+0.825
otle+0.815
plet+0.810
Neg logit contributions
hift-1.072
 revol-0.948
 captives-0.904
 painfully-0.878
Sense-0.877
Token
Token ID198
Feature activation-1.14e-01

Pos logit contributions
 Niet+0.749
ById+0.626
ashington+0.624
plet+0.618
otle+0.614
Neg logit contributions
hift-0.785
 revol-0.686
 captives-0.656
Sense-0.645
 painfully-0.642
TokenThese
Token ID4711
Feature activation-2.54e-01

Pos logit contributions
 Niet+0.577
ById+0.491
ashington+0.490
otle+0.486
plet+0.482
Neg logit contributions
hift-0.553
 revol-0.483
 captives-0.460
Sense-0.448
 painfully-0.446
Token sc
Token ID629
Feature activation-1.62e-01

Pos logit contributions
 Niet+1.299
ById+1.115
ashington+1.113
otle+1.101
plet+1.091
Neg logit contributions
hift-1.215
 revol-1.068
 captives-1.012
 painfully-0.981
Sense-0.970
Tokenammers
Token ID36846
Feature activation-8.63e-01

Pos logit contributions
 Niet+0.665
ashington+0.548
ById+0.545
otle+0.539
plet+0.537
Neg logit contributions
hift-0.932
 revol-0.834
 captives-0.801
 painfully-0.783
Sense-0.779
Token tend
Token ID4327
Feature activation-5.16e-01

Pos logit contributions
 Niet+4.182
ashington+3.586
ById+3.563
otle+3.543
plet+3.488
Neg logit contributions
hift-4.259
 revol-3.739
 captives-3.558
Sense-3.495
 painfully-3.440
Token to
Token ID284
Feature activation-2.45e-01

Pos logit contributions
 Niet+2.582
ashington+2.229
ById+2.214
otle+2.206
plet+2.177
Neg logit contributions
hift-2.472
 revol-2.157
 captives-2.044
Sense-1.990
 reckoned-1.983
Token request
Token ID2581
Feature activation-5.93e-01

Pos logit contributions
 Niet+1.212
ById+1.037
ashington+1.027
otle+1.012
plet+1.001
Neg logit contributions
hift-1.222
 revol-1.079
 captives-1.025
 painfully-1.002
 reckoned-0.984
Token payment
Token ID6074
Feature activation-6.94e-01

Pos logit contributions
 Niet+3.144
ById+2.710
ashington+2.701
otle+2.672
plet+2.657
Neg logit contributions
hift-2.685
 revol-2.322
 captives-2.184
Sense-2.151
 painfully-2.125
Token up
Token ID510
Feature activation+4.60e-01

Pos logit contributions
 Niet+3.620
ById+3.115
ashington+3.110
otle+3.074
plet+3.043
Neg logit contributions
hift-3.234
 revol-2.807
 captives-2.652
Sense-2.577
 painfully-2.567
Token planet
Token ID5440
Feature activation-5.92e-02

Pos logit contributions
 Niet+0.743
ById+0.649
ashington+0.647
otle+0.641
plet+0.635
Neg logit contributions
hift-0.570
 revol-0.493
 captives-0.463
 painfully-0.449
Sense-0.443
Token,
Token ID11
Feature activation-7.23e-02

Pos logit contributions
 Niet+0.318
ById+0.275
ashington+0.275
otle+0.272
plet+0.269
Neg logit contributions
hift-0.267
 revol-0.233
 captives-0.219
 painfully-0.213
Sense-0.211
Token no
Token ID645
Feature activation+9.00e-02

Pos logit contributions
 Niet+0.370
ById+0.317
ashington+0.316
otle+0.313
plet+0.310
Neg logit contributions
hift-0.344
 revol-0.303
 captives-0.287
 painfully-0.279
Sense-0.278
Token matter
Token ID2300
Feature activation+1.72e-01

Pos logit contributions
hift+0.341
 revol+0.290
 captives+0.270
 painfully+0.259
Sense+0.255
Neg logit contributions
 Niet-0.549
ashington-0.485
ById-0.484
otle-0.481
plet-0.474
Token what
Token ID644
Feature activation+4.43e-01

Pos logit contributions
hift+0.755
 revol+0.663
 captives+0.624
 painfully+0.607
 reckoned+0.600
Neg logit contributions
 Niet-0.935
ById-0.818
ashington-0.817
otle-0.807
plet-0.796
Token our
Token ID674
Feature activation-9.35e-01

Pos logit contributions
hift+1.889
 revol+1.683
 captives+1.572
 painfully+1.540
 reckoned+1.523
Neg logit contributions
 Niet-2.423
ById-2.171
ashington-2.155
otle-2.125
SourceFile-2.108
Token backgrounds
Token ID19063
Feature activation-7.00e-01

Pos logit contributions
 Niet+5.055
ashington+4.322
ById+4.321
otle+4.224
/​+4.220
Neg logit contributions
hift-4.172
 revol-3.497
 captives-3.303
Sense-3.296
 reckoned-3.184
Token may
Token ID743
Feature activation-3.06e-01

Pos logit contributions
 Niet+3.357
ashington+2.852
ById+2.837
otle+2.835
/​+2.782
Neg logit contributions
hift-3.492
 revol-3.069
 captives-2.933
Sense-2.859
 painfully-2.842
Token be
Token ID307
Feature activation-1.72e-01

Pos logit contributions
 Niet+1.504
ashington+1.284
ById+1.282
otle+1.271
plet+1.255
Neg logit contributions
hift-1.515
 revol-1.330
 captives-1.264
Sense-1.227
 painfully-1.226
Token,
Token ID11
Feature activation-2.75e-01

Pos logit contributions
 Niet+0.929
ById+0.805
ashington+0.803
otle+0.794
plet+0.787
Neg logit contributions
hift-0.770
 revol-0.669
 captives-0.632
 painfully-0.612
Sense-0.607
Token
Token ID447
Feature activation+9.88e-02

Pos logit contributions
 Niet+1.396
ById+1.198
ashington+1.195
otle+1.180
plet+1.169
Neg logit contributions
hift-1.323
 revol-1.161
 captives-1.101
 painfully-1.069
Sense-1.062
Tokenless
Token ID1203
Feature activation-2.96e-01

Pos logit contributions
hift+0.071
 revol+0.055
 captives+0.049
 painfully+0.046
Sense+0.045
Neg logit contributions
 Niet-0.209
ById-0.188
ashington-0.187
otle-0.186
plet-0.185
Token (@
Token ID4275
Feature activation+3.98e-01

Pos logit contributions
 Niet+1.364
ById+1.151
ashington+1.147
otle+1.132
plet+1.118
Neg logit contributions
hift-1.569
 revol-1.394
 captives-1.328
 painfully-1.297
Sense-1.286
TokenReal
Token ID15633
Feature activation-3.03e-01

Pos logit contributions
hift+2.014
 revol+1.736
 captives+1.668
Sense+1.646
 painfully+1.629
Neg logit contributions
 Niet-1.933
ById-1.627
otle-1.605
ashington-1.601
plet-1.576
TokenSkip
Token ID50232
Feature activation-9.65e-01

Pos logit contributions
 Niet+1.360
ById+1.145
ashington+1.144
otle+1.124
plet+1.113
Neg logit contributions
hift-1.627
 revol-1.452
 captives-1.385
 painfully-1.350
Sense-1.338
TokenBay
Token ID15262
Feature activation-4.37e-01

Pos logit contributions
 Niet+5.138
ashington+4.625
ById+4.548
otle+4.503
plet+4.448
Neg logit contributions
hift-4.179
 revol-3.719
 captives-3.492
 painfully-3.338
 reckoned-3.284
Tokenless
Token ID1203
Feature activation-1.01e+00

Pos logit contributions
 Niet+3.128
ById+2.822
ashington+2.820
otle+2.792
plet+2.775
Neg logit contributions
hift-1.182
 revol-0.927
 captives-0.830
 painfully-0.779
Sense-0.761
Token)
Token ID8
Feature activation+3.54e-01

Pos logit contributions
 Niet+4.059
ashington+3.402
ById+3.367
otle+3.315
plet+3.293
Neg logit contributions
hift-5.726
 revol-5.210
 captives-5.026
 painfully-4.863
 reckoned-4.819
Token January
Token ID3269
Feature activation+3.60e-01

Pos logit contributions
hift+1.972
 revol+1.783
 captives+1.704
 painfully+1.681
 reckoned+1.658
Neg logit contributions
 Niet-1.492
ById-1.256
ashington-1.237
otle-1.234
plet-1.229
Token 16
Token ID1467
Feature activation+4.54e-01

Pos logit contributions
hift+1.500
 revol+1.281
 captives+1.195
Sense+1.173
 painfully+1.166
Neg logit contributions
 Niet-2.030
ashington-1.802
ById-1.793
plet-1.779
otle-1.759
Token,
Token ID11
Feature activation+7.70e-01

Pos logit contributions
hift+1.774
 revol+1.463
 captives+1.396
Sense+1.391
 painfully+1.352
Neg logit contributions
 Niet-2.722
plet-2.332
ashington-2.310
ById-2.309
otle-2.295
Token 2017
Token ID2177
Feature activation+4.36e-01

Pos logit contributions
hift+3.540
 revol+3.052
 captives+2.898
Sense+2.861
 Syri+2.839
Neg logit contributions
 Niet-3.913
plet-3.420
ById-3.359
icro-3.339
sterdam-3.317
Token it
Token ID340
Feature activation+1.62e-01

Pos logit contributions
 Niet+0.469
ById+0.405
ashington+0.404
otle+0.400
plet+0.396
Neg logit contributions
hift-0.394
 revol-0.342
 captives-0.323
 painfully-0.313
Sense-0.311
Token,
Token ID11
Feature activation+3.00e-03

Pos logit contributions
hift+0.717
 revol+0.624
 captives+0.585
 painfully+0.573
 reckoned+0.562
Neg logit contributions
 Niet-0.885
ashington-0.767
ById-0.764
otle-0.754
plet-0.752
Token embrace
Token ID12553
Feature activation+3.15e-01

Pos logit contributions
hift+0.014
 revol+0.012
 captives+0.011
 painfully+0.011
Sense+0.011
Neg logit contributions
 Niet-0.016
ashington-0.014
ById-0.014
otle-0.013
plet-0.013
Token it
Token ID340
Feature activation-1.36e-01

Pos logit contributions
hift+1.371
 revol+1.198
 captives+1.124
 painfully+1.097
Sense+1.079
Neg logit contributions
 Niet-1.724
ashington-1.511
ById-1.507
otle-1.485
plet-1.478
Token.
Token ID13
Feature activation-5.72e-02

Pos logit contributions
 Niet+0.720
ById+0.623
ashington+0.620
otle+0.614
plet+0.607
Neg logit contributions
hift-0.625
 revol-0.543
 captives-0.515
 painfully-0.497
Sense-0.496
Token Like
Token ID4525
Feature activation-9.29e-01

Pos logit contributions
 Niet+0.255
ashington+0.216
ById+0.215
plet+0.212
otle+0.210
Neg logit contributions
hift-0.308
 revol-0.272
 captives-0.259
Sense-0.257
 painfully-0.253
Token you
Token ID345
Feature activation-5.21e-02

Pos logit contributions
 Niet+4.566
otle+3.864
ById+3.852
ashington+3.801
plet+3.772
Neg logit contributions
hift-4.477
 revol-3.882
 captives-3.731
 reckoned-3.594
Sense-3.576
Token can
Token ID460
Feature activation-8.01e-02

Pos logit contributions
 Niet+0.274
ById+0.236
ashington+0.235
otle+0.232
plet+0.231
Neg logit contributions
hift-0.242
 revol-0.212
 captives-0.199
 painfully-0.195
 reckoned-0.193
Token joke
Token ID9707
Feature activation-4.47e-01

Pos logit contributions
 Niet+0.409
ById+0.351
ashington+0.350
otle+0.343
plet+0.341
Neg logit contributions
hift-0.385
 revol-0.338
 captives-0.319
 painfully-0.312
Sense-0.308
Token,
Token ID11
Feature activation-4.07e-01

Pos logit contributions
 Niet+2.200
ById+1.875
ashington+1.866
otle+1.855
plet+1.825
Neg logit contributions
hift-2.210
 revol-1.940
 captives-1.850
 painfully-1.787
Sense-1.785
Token
Token ID564
Feature activation+2.37e-01

Pos logit contributions
 Niet+2.054
ById+1.760
ashington+1.756
otle+1.736
plet+1.718
Neg logit contributions
hift-1.969
 revol-1.727
 captives-1.641
 painfully-1.591
Sense-1.583