TOP ACTIVATIONS
MAX = 18.670

ed , liquid state and held in super - ins ulated
s S ieve . Even negative comments will be welcome
grandfather ing provisions . What happens after that is also important
the phone name . What happens is that initially , the
one connections . Even after watching Gary
the get - go . Even knowing that he
. What would happen if Earth stopped spinning ?
this one . Even a progressive outlet like Mic
economic or environmental benefit . Even K I UC , while
selfish request . Even if the God of this
as and brown boots . Even thought it would cost me
? Even among Muslim scholars the principle
. 2 . However , even I can 't deny how
issues . What was it like watching your team compete without
hanging . But even if they never tell you
six years . Even more seriously , in late
. Even by the deb ased standards
But what about the books I bought
stepped up . Even their much - mal igned
to enable these techniques . Even without them I believe we

MIN ACTIVATIONS
MIN = -21.984

for Republican candidates continues to er ode . O
In years past , we er red when observational data led
ama opened its doors -- er , lot ? -- at
that he had impro per ly touched
for him , he " rem ained at the bottom of
While the source of the er rant command was not known
and its government has a good reputation , the instruments that
radios , when you un plug the external antenna connection ,
speakers if a door cre aks make the noise
Hale Williams , who is better known for his dedication of
capabilities to expand upon and better Link to the Past in
to complain the country is badly run , if we don
reactors , but these were badly damaged during refuel ing in
the Civil War , there may not have been a war
.), who was among two critically injured . Democrats
Owner Sylvia Cas ares was critically injured when she was allegedly
institution this year exc ited about the
rushing touchdowns , respectively . Er rict R hett eventually broke
, and Tony Shal h oub , who won three times
programme " Normally children that brake late go off the track

INTERVAL 8.507 - 18.670
CONTAINS 0.865%

you re talking about . Like when I talk
to the contrary , suspect at best . Dean Baker is
. He shares what he s learned
country that ensures equal and full rights for all its citizens
of a besieged city , even if it were innocent of

INTERVAL -1.657 - 8.507
CONTAINS 69.367%

for foreign engagements as has been suggested by officials .
- 3 power play goal against Pittsburgh , they hadn
in May . We built it entirely
Roh t ak city which continued on 19 , 20 ,
is formed of RT prop + these additional sources of noise

INTERVAL -11.820 - -1.657
CONTAINS 29.686%

that it would miss the Moon by about 9 65 kilometers
of sea turtles around the world have ingested debris ," she
2 . 8 points in Real Clear Politics 's North Carolina
Agreement " over printed on ob verse . In addition ,
, and although it was arguably bet tered by the later

INTERVAL -21.984 - -11.820
CONTAINS 0.083%

Continue reading the main story Please verify you 're not a
<|endoftext|> ONE of France 's best - known mail order catalog
Sheikh , Egypt , and cru ises via A q aba
started looking for a new partner . " It
How A New Generation Is Rem aking America . " O
Tokened
Token ID276
Feature activation+5.80e+00

Pos logit contributions
erness+2.354
independence+2.083
eenth+2.057
aways+2.018
meal+1.958
Neg logit contributions
beh-3.048
bered-3.007
dden-2.998
thora-2.961
icably-2.836
Token,
Token ID11
Feature activation+6.90e+00

Pos logit contributions
erness+4.150
independence+3.708
eenth+3.652
aways+3.608
meal+3.591
Neg logit contributions
beh-4.423
dden-4.421
bered-4.360
thora-4.360
untled-4.133
Token liquid
Token ID8122
Feature activation+6.53e+00

Pos logit contributions
erness+5.066
independence+4.638
eenth+4.535
aways+4.508
meal+4.418
Neg logit contributions
dden-4.987
beh-4.965
thora-4.926
bered-4.891
gged-4.618
Token state
Token ID1181
Feature activation+2.58e+00

Pos logit contributions
erness+4.560
independence+4.108
eenth+4.053
meal+4.005
aways+3.961
Neg logit contributions
beh-5.047
dden-5.026
bered-4.986
thora-4.872
gged-4.663
Token and
Token ID290
Feature activation-4.79e-01

Pos logit contributions
erness+1.638
independence+1.474
eenth+1.447
aways+1.428
meal+1.387
Neg logit contributions
beh-2.207
bered-2.176
dden-2.171
thora-2.157
gged-2.062
Token held
Token ID2714
Feature activation+1.87e+01

Pos logit contributions
beh+0.379
bered+0.374
dden+0.370
thora+0.365
icably+0.352
Neg logit contributions
erness-0.339
independence-0.308
eenth-0.302
aways-0.297
meal-0.288
Token in
Token ID287
Feature activation+6.28e+00

Pos logit contributions
erness+9.994
aways+8.601
away+8.244
meal+8.233
eenth+8.133
Neg logit contributions
thora-15.203
untled-14.459
dden-14.442
beh-14.342
bered-13.804
Token super
Token ID2208
Feature activation+1.56e+00

Pos logit contributions
erness+4.418
independence+3.952
aways+3.931
eenth+3.847
meal+3.718
Neg logit contributions
beh-4.812
thora-4.806
bered-4.794
dden-4.779
gged-4.548
Token-
Token ID12
Feature activation+1.87e+00

Pos logit contributions
erness+0.962
independence+0.868
eenth+0.838
aways+0.828
meal+0.801
Neg logit contributions
beh-1.356
bered-1.346
dden-1.339
thora-1.335
icably-1.275
Tokenins
Token ID1040
Feature activation+5.15e+00

Pos logit contributions
erness+1.295
independence+1.189
eenth+1.145
aways+1.132
meal+1.107
Neg logit contributions
beh-1.489
bered-1.484
thora-1.479
dden-1.473
icably-1.401
Tokenulated
Token ID4817
Feature activation+8.95e+00

Pos logit contributions
erness+3.448
independence+3.128
eenth+3.016
aways+2.983
meal+2.871
Neg logit contributions
beh-4.108
bered-4.104
dden-4.094
thora-4.056
icably-3.873
Token
Token ID247
Feature activation+3.64e+00

Pos logit contributions
erness+2.803
independence+2.570
eenth+2.537
aways+2.484
meal+2.399
Neg logit contributions
beh-2.517
bered-2.469
dden-2.444
thora-2.410
icably-2.315
Tokens
Token ID82
Feature activation+5.30e+00

Pos logit contributions
erness+2.393
independence+2.181
eenth+2.133
aways+2.085
meal+2.007
Neg logit contributions
beh-2.973
bered-2.962
dden-2.943
thora-2.919
icably-2.809
Token S
Token ID311
Feature activation+2.10e+00

Pos logit contributions
erness+3.801
independence+3.465
eenth+3.440
aways+3.336
meal+3.290
Neg logit contributions
bered-3.927
beh-3.923
dden-3.918
thora-3.848
icably-3.731
Tokenieve
Token ID12311
Feature activation-3.59e+00

Pos logit contributions
erness+1.524
independence+1.391
eenth+1.368
aways+1.348
meal+1.321
Neg logit contributions
beh-1.604
bered-1.575
dden-1.573
thora-1.545
icably-1.485
Token.
Token ID13
Feature activation+2.16e+00

Pos logit contributions
beh+2.938
bered+2.858
dden+2.852
thora+2.817
gged+2.707
Neg logit contributions
erness-2.429
independence-2.201
eenth-2.151
aways-2.112
meal-2.041
Token Even
Token ID3412
Feature activation+1.84e+01

Pos logit contributions
erness+1.360
independence+1.228
eenth+1.213
aways+1.180
meal+1.144
Neg logit contributions
beh-1.854
bered-1.850
dden-1.827
thora-1.809
icably-1.741
Token negative
Token ID4633
Feature activation+3.37e+00

Pos logit contributions
erness+9.247
 IF+8.346
independence+8.326
eenth+8.285
aways+8.273
Neg logit contributions
thora-14.311
dden-14.074
bered-13.990
beh-13.876
iddled-13.544
Token comments
Token ID3651
Feature activation+4.53e+00

Pos logit contributions
erness+2.415
independence+2.204
eenth+2.153
aways+2.127
meal+2.069
Neg logit contributions
beh-2.592
bered-2.572
dden-2.570
thora-2.536
icably-2.431
Token will
Token ID481
Feature activation-2.33e-01

Pos logit contributions
erness+3.020
independence+2.739
eenth+2.702
aways+2.660
meal+2.587
Neg logit contributions
beh-3.663
dden-3.617
bered-3.605
thora-3.585
icably-3.427
Token be
Token ID307
Feature activation+5.42e+00

Pos logit contributions
beh+0.152
bered+0.151
dden+0.150
thora+0.146
icably+0.140
Neg logit contributions
erness-0.196
independence-0.181
eenth-0.178
aways-0.175
meal-0.171
Token welcome
Token ID7062
Feature activation+1.10e+01

Pos logit contributions
erness+3.632
independence+3.281
eenth+3.260
aways+3.213
meal+3.096
Neg logit contributions
beh-4.346
dden-4.318
thora-4.296
bered-4.248
untled-4.074
Token grandfather
Token ID17695
Feature activation-3.76e+00

Pos logit contributions
beh+1.592
bered+1.564
dden+1.542
thora+1.514
icably+1.459
Neg logit contributions
erness-1.731
independence-1.575
eenth-1.566
aways-1.525
meal-1.489
Tokening
Token ID278
Feature activation-3.17e+00

Pos logit contributions
beh+3.209
bered+3.160
dden+3.142
thora+3.079
gged+3.002
Neg logit contributions
erness-2.365
independence-2.135
eenth-2.074
aways-2.053
meal-1.992
Token provisions
Token ID8617
Feature activation-2.19e+00

Pos logit contributions
beh+2.188
bered+2.136
dden+2.098
thora+2.061
gged+1.994
Neg logit contributions
erness-2.559
independence-2.329
eenth-2.317
aways-2.243
meal-2.181
Token.
Token ID13
Feature activation+2.85e+00

Pos logit contributions
beh+1.690
bered+1.662
dden+1.649
thora+1.620
gged+1.558
Neg logit contributions
erness-1.593
independence-1.448
eenth-1.417
aways-1.388
meal-1.346
Token What
Token ID1867
Feature activation+1.46e+01

Pos logit contributions
erness+1.700
independence+1.516
eenth+1.500
aways+1.472
meal+1.401
Neg logit contributions
beh-2.537
bered-2.531
dden-2.498
thora-2.490
icably-2.385
Token happens
Token ID4325
Feature activation+1.80e+01

Pos logit contributions
erness+8.497
aways+7.902
independence+7.779
eenth+7.477
meal+7.263
Neg logit contributions
dden-11.479
beh-11.268
thora-11.229
bered-11.159
iddled-10.773
Token after
Token ID706
Feature activation+9.82e+00

Pos logit contributions
erness+7.153
meal+6.858
eenth+6.800
aways+6.702
independence+6.361
Neg logit contributions
thora-16.580
dden-15.763
beh-15.412
iddled-15.396
bered-15.275
Token that
Token ID326
Feature activation+8.95e+00

Pos logit contributions
erness+5.586
independence+5.509
aways+5.197
eenth+5.181
meal+5.032
Neg logit contributions
thora-8.244
dden-8.124
bered-8.040
beh-8.019
gged-7.565
Token is
Token ID318
Feature activation+7.11e+00

Pos logit contributions
erness+4.815
independence+4.435
eenth+4.279
aways+4.267
meal+4.254
Neg logit contributions
dden-8.004
thora-7.942
beh-7.911
bered-7.770
icably-7.511
Token also
Token ID635
Feature activation-1.56e+00

Pos logit contributions
erness+5.358
independence+4.862
eenth+4.779
aways+4.758
meal+4.754
Neg logit contributions
thora-5.031
beh-4.969
dden-4.931
bered-4.834
untled-4.760
Token important
Token ID1593
Feature activation+4.40e-01

Pos logit contributions
beh+1.067
bered+1.054
dden+1.032
thora+1.008
icably+0.975
Neg logit contributions
erness-1.262
independence-1.165
eenth-1.144
aways-1.121
meal-1.089
Token the
Token ID262
Feature activation+2.30e+00

Pos logit contributions
erness+2.664
independence+2.394
eenth+2.353
aways+2.346
meal+2.270
Neg logit contributions
beh-3.391
bered-3.352
dden-3.331
thora-3.325
untled-3.196
Token phone
Token ID3072
Feature activation+2.61e+00

Pos logit contributions
erness+1.662
independence+1.507
eenth+1.481
aways+1.466
meal+1.423
Neg logit contributions
beh-1.763
bered-1.743
dden-1.734
thora-1.714
icably-1.642
Token name
Token ID1438
Feature activation-2.63e+00

Pos logit contributions
erness+1.826
independence+1.663
eenth+1.624
aways+1.598
meal+1.551
Neg logit contributions
beh-2.067
bered-2.035
dden-2.025
thora-2.006
icably-1.932
Token.
Token ID13
Feature activation+2.02e+00

Pos logit contributions
beh+2.090
bered+2.054
dden+2.027
thora+1.981
gged+1.920
Neg logit contributions
erness-1.835
independence-1.676
eenth-1.645
aways-1.610
meal-1.576
Token What
Token ID1867
Feature activation+1.53e+01

Pos logit contributions
erness+1.313
independence+1.183
eenth+1.160
aways+1.140
meal+1.102
Neg logit contributions
beh-1.709
bered-1.691
dden-1.679
thora-1.661
icably-1.596
Token happens
Token ID4325
Feature activation+1.79e+01

Pos logit contributions
erness+7.857
aways+7.546
independence+7.306
eenth+7.053
meal+6.881
Neg logit contributions
bered-12.363
dden-12.235
thora-12.002
beh-11.988
untled-11.757
Token is
Token ID318
Feature activation+8.49e+00

Pos logit contributions
erness+7.120
aways+6.679
eenth+6.618
meal+6.502
independence+6.031
Neg logit contributions
thora-16.332
dden-15.605
bered-15.438
iddled-15.383
beh-15.210
Token that
Token ID326
Feature activation+6.13e+00

Pos logit contributions
erness+4.829
independence+4.268
aways+4.261
eenth+4.224
meal+4.115
Neg logit contributions
thora-7.234
beh-7.205
dden-7.131
bered-7.109
untled-6.883
Token initially
Token ID7317
Feature activation-3.18e-01

Pos logit contributions
erness+3.873
independence+3.482
aways+3.429
eenth+3.411
meal+3.296
Neg logit contributions
beh-5.046
dden-5.009
bered-5.004
thora-4.966
untled-4.778
Token,
Token ID11
Feature activation+1.66e+00

Pos logit contributions
beh+0.261
bered+0.257
dden+0.256
thora+0.253
icably+0.243
Neg logit contributions
erness-0.214
independence-0.194
eenth-0.190
aways-0.186
meal-0.181
Token the
Token ID262
Feature activation+8.69e-02

Pos logit contributions
erness+1.122
independence+1.016
eenth+0.996
aways+0.982
meal+0.951
Neg logit contributions
beh-1.354
bered-1.337
dden-1.330
thora-1.314
icably-1.259
Tokenone
Token ID505
Feature activation-1.06e-01

Pos logit contributions
erness+0.945
independence+0.859
aways+0.830
eenth+0.829
meal+0.807
Neg logit contributions
beh-1.163
bered-1.160
dden-1.150
thora-1.145
untled-1.096
Token connections
Token ID8787
Feature activation-1.11e+00

Pos logit contributions
beh+0.081
bered+0.080
dden+0.079
thora+0.078
icably+0.075
Neg logit contributions
erness-0.077
independence-0.070
eenth-0.069
aways-0.068
meal-0.066
Token.
Token ID13
Feature activation+1.92e+00

Pos logit contributions
beh+0.926
bered+0.913
dden+0.905
thora+0.892
icably+0.862
Neg logit contributions
erness-0.730
independence-0.661
eenth-0.644
aways-0.631
meal-0.609
Token
Token ID198
Feature activation+1.45e+00

Pos logit contributions
erness+1.247
independence+1.123
eenth+1.105
aways+1.085
meal+1.049
Neg logit contributions
beh-1.617
bered-1.603
dden-1.586
thora-1.572
icably-1.509
Token
Token ID198
Feature activation+3.06e+00

Pos logit contributions
erness+0.951
independence+0.857
eenth+0.834
aways+0.816
meal+0.788
Neg logit contributions
beh-1.230
dden-1.206
bered-1.202
thora-1.174
gged-1.140
TokenEven
Token ID6104
Feature activation+1.78e+01

Pos logit contributions
erness+2.035
independence+1.864
eenth+1.798
aways+1.787
meal+1.731
Neg logit contributions
beh-2.500
bered-2.499
dden-2.468
thora-2.457
untled-2.347
Token after
Token ID706
Feature activation+9.34e+00

Pos logit contributions
erness+10.450
aways+9.449
independence+9.416
meal+9.068
eenth+9.003
Neg logit contributions
thora-12.991
bered-12.754
beh-12.652
dden-12.607
iddled-12.036
Token watching
Token ID4964
Feature activation-4.14e+00

Pos logit contributions
erness+6.038
independence+5.579
aways+5.450
eenth+5.245
meal+5.189
Neg logit contributions
bered-7.293
thora-7.282
dden-7.258
beh-7.210
gged-6.736
Token Gary
Token ID10936
Feature activation-6.07e+00

Pos logit contributions
beh+3.021
bered+2.969
dden+2.946
thora+2.884
icably+2.814
Neg logit contributions
erness-3.116
independence-2.859
eenth-2.788
aways-2.725
meal-2.668
Token
Token ID447
Feature activation+4.40e-01

Pos logit contributions
beh+4.483
bered+4.412
dden+4.407
thora+4.270
icably+4.129
Neg logit contributions
erness-4.476
independence-4.095
eenth-3.992
aways-3.892
meal-3.794
Token
Token ID247
Feature activation-2.03e+00

Pos logit contributions
erness+0.379
independence+0.350
aways+0.343
eenth+0.343
meal+0.334
Neg logit contributions
beh-0.280
bered-0.276
dden-0.276
thora-0.271
gged-0.257
Token the
Token ID262
Feature activation+2.85e+00

Pos logit contributions
erness+2.923
independence+2.702
aways+2.660
eenth+2.632
meal+2.523
Neg logit contributions
beh-3.534
bered-3.481
dden-3.463
thora-3.442
untled-3.327
Token get
Token ID651
Feature activation+3.82e+00

Pos logit contributions
erness+2.220
independence+2.043
eenth+2.014
aways+1.996
meal+1.937
Neg logit contributions
beh-2.025
bered-1.990
dden-1.986
thora-1.956
untled-1.874
Token-
Token ID12
Feature activation+7.35e-01

Pos logit contributions
erness+2.048
independence+1.834
aways+1.798
eenth+1.784
meal+1.716
Neg logit contributions
beh-3.590
bered-3.547
dden-3.539
thora-3.519
untled-3.382
Tokengo
Token ID2188
Feature activation+1.51e+00

Pos logit contributions
erness+0.790
independence+0.741
eenth+0.736
aways+0.713
meal+0.702
Neg logit contributions
beh-0.320
bered-0.300
dden-0.294
thora-0.285
gged-0.271
Token.
Token ID13
Feature activation+3.94e+00

Pos logit contributions
erness+0.934
independence+0.836
eenth+0.820
aways+0.807
meal+0.778
Neg logit contributions
beh-1.323
bered-1.309
dden-1.302
thora-1.286
icably-1.240
Token Even
Token ID3412
Feature activation+1.77e+01

Pos logit contributions
erness+2.556
eenth+2.295
independence+2.283
aways+2.251
meal+2.157
Neg logit contributions
beh-3.295
bered-3.290
dden-3.231
thora-3.207
untled-3.088
Token knowing
Token ID6970
Feature activation+6.23e-02

Pos logit contributions
erness+9.712
aways+9.089
eenth+8.700
meal+8.649
 IF+8.592
Neg logit contributions
beh-12.855
thora-12.805
bered-12.750
dden-12.520
untled-12.427
Token that
Token ID326
Feature activation+2.38e+00

Pos logit contributions
erness+0.044
independence+0.040
eenth+0.040
aways+0.039
meal+0.038
Neg logit contributions
beh-0.049
bered-0.048
dden-0.048
thora-0.047
icably-0.045
Token he
Token ID339
Feature activation-1.78e-01

Pos logit contributions
erness+1.640
independence+1.485
eenth+1.456
aways+1.443
meal+1.396
Neg logit contributions
beh-1.908
bered-1.881
dden-1.873
thora-1.847
gged-1.771
Token
Token ID447
Feature activation-1.52e+00

Pos logit contributions
beh+0.148
bered+0.146
dden+0.145
thora+0.143
icably+0.138
Neg logit contributions
erness-0.118
independence-0.107
eenth-0.105
aways-0.103
meal-0.100
Token
Token ID247
Feature activation+4.99e-01

Pos logit contributions
beh+0.919
dden+0.905
bered+0.898
thora+0.879
gged+0.844
Neg logit contributions
erness-1.363
independence-1.255
aways-1.241
eenth-1.229
meal-1.207
Token.
Token ID13
Feature activation-1.09e+00

Pos logit contributions
erness+0.284
independence+0.255
eenth+0.249
aways+0.244
meal+0.235
Neg logit contributions
beh-0.409
bered-0.404
dden-0.400
thora-0.395
icably-0.382
Token
Token ID198
Feature activation-1.62e+00

Pos logit contributions
beh+0.877
dden+0.859
bered+0.854
thora+0.843
icably+0.814
Neg logit contributions
erness-0.746
independence-0.678
eenth-0.657
aways-0.642
meal-0.631
Token
Token ID198
Feature activation-1.10e+00

Pos logit contributions
beh+1.286
dden+1.257
bered+1.234
thora+1.194
gged+1.174
Neg logit contributions
erness-1.145
independence-1.042
eenth-1.009
aways-0.982
liga-0.958
TokenWhat
Token ID2061
Feature activation+1.05e+01

Pos logit contributions
beh+0.893
bered+0.872
dden+0.871
thora+0.852
icably+0.824
Neg logit contributions
erness-0.762
independence-0.685
eenth-0.679
aways-0.660
meal-0.643
Token would
Token ID561
Feature activation+8.74e+00

Pos logit contributions
erness+6.383
aways+5.826
independence+5.786
eenth+5.515
meal+5.395
Neg logit contributions
beh-8.667
dden-8.629
bered-8.626
thora-8.390
untled-8.228
Token happen
Token ID1645
Feature activation+1.76e+01

Pos logit contributions
erness+5.432
independence+4.989
eenth+4.760
aways+4.740
meal+4.545
Neg logit contributions
beh-7.054
bered-6.987
thora-6.928
dden-6.845
untled-6.694
Token if
Token ID611
Feature activation+3.67e+00

Pos logit contributions
erness+6.151
eenth+5.491
aways+5.408
meal+5.363
 IF+5.151
Neg logit contributions
thora-17.080
dden-16.607
bered-16.312
beh-16.262
untled-16.086
Token Earth
Token ID3668
Feature activation+1.14e+00

Pos logit contributions
erness+2.501
independence+2.276
aways+2.214
eenth+2.211
meal+2.122
Neg logit contributions
beh-2.938
dden-2.898
bered-2.895
thora-2.863
untled-2.744
Token stopped
Token ID5025
Feature activation-6.07e+00

Pos logit contributions
erness+0.800
independence+0.727
eenth+0.712
aways+0.701
meal+0.680
Neg logit contributions
beh-0.906
bered-0.893
dden-0.887
thora-0.875
icably-0.841
Token spinning
Token ID19493
Feature activation-3.11e+00

Pos logit contributions
beh+4.407
bered+4.258
dden+4.166
icably+4.143
thora+4.026
Neg logit contributions
erness-4.685
independence-4.185
eenth-4.112
aways-3.950
liga-3.854
Token?
Token ID30
Feature activation+1.56e+00

Pos logit contributions
beh+2.755
bered+2.727
dden+2.662
thora+2.610
icably+2.604
Neg logit contributions
erness-1.891
independence-1.676
eenth-1.607
aways-1.537
meal-1.511
Token this
Token ID428
Feature activation-6.21e+00

Pos logit contributions
erness+0.140
independence+0.127
eenth+0.125
aways+0.123
meal+0.119
Neg logit contributions
beh-0.169
bered-0.166
dden-0.165
thora-0.163
icably-0.157
Token one
Token ID530
Feature activation-3.32e+00

Pos logit contributions
beh+4.096
bered+4.042
dden+3.961
thora+3.905
icably+3.712
Neg logit contributions
erness-5.079
independence-4.669
eenth-4.600
aways-4.491
meal-4.327
Token.
Token ID13
Feature activation+1.66e+00

Pos logit contributions
beh+2.802
bered+2.750
dden+2.726
thora+2.688
gged+2.595
Neg logit contributions
erness-2.162
independence-1.942
eenth-1.875
aways-1.830
meal-1.758
Token
Token ID198
Feature activation-1.07e-01

Pos logit contributions
erness+1.037
independence+0.929
eenth+0.913
aways+0.895
meal+0.863
Neg logit contributions
beh-1.450
bered-1.434
dden-1.423
thora-1.409
icably-1.357
Token
Token ID198
Feature activation+5.04e-01

Pos logit contributions
beh+0.088
dden+0.086
bered+0.085
thora+0.083
gged+0.081
Neg logit contributions
erness-0.073
independence-0.066
eenth-0.065
aways-0.063
meal-0.061
TokenEven
Token ID6104
Feature activation+1.76e+01

Pos logit contributions
erness+0.347
independence+0.314
eenth+0.308
aways+0.303
meal+0.294
Neg logit contributions
beh-0.408
bered-0.401
dden-0.399
thora-0.393
icably-0.378
Token a
Token ID257
Feature activation+4.91e+00

Pos logit contributions
erness+10.566
aways+9.394
independence+9.350
meal+9.098
eenth+9.090
Neg logit contributions
thora-12.722
beh-12.584
dden-12.411
bered-12.257
untled-11.781
Token progressive
Token ID10393
Feature activation-3.14e-01

Pos logit contributions
erness+3.595
independence+3.254
eenth+3.203
aways+3.142
meal+3.098
Neg logit contributions
beh-3.645
dden-3.593
bered-3.560
thora-3.545
icably-3.396
Token outlet
Token ID16615
Feature activation+8.18e+00

Pos logit contributions
beh+0.240
bered+0.237
dden+0.235
thora+0.231
icably+0.221
Neg logit contributions
erness-0.229
independence-0.210
eenth-0.206
aways-0.203
meal-0.197
Token like
Token ID588
Feature activation+8.57e+00

Pos logit contributions
erness+4.578
independence+4.156
eenth+4.032
meal+4.025
aways+3.986
Neg logit contributions
beh-7.212
dden-7.181
thora-7.137
bered-6.998
untled-6.742
Token Mic
Token ID7631
Feature activation-5.15e-01

Pos logit contributions
erness+5.689
independence+5.128
eenth+5.038
aways+5.017
meal+4.810
Neg logit contributions
beh-6.684
bered-6.625
dden-6.601
thora-6.570
untled-6.280
Token economic
Token ID3034
Feature activation-7.00e+00

Pos logit contributions
erness+1.074
independence+0.980
eenth+0.964
aways+0.946
meal+0.919
Neg logit contributions
beh-1.179
bered-1.163
dden-1.162
thora-1.142
icably-1.094
Token or
Token ID393
Feature activation+2.18e+00

Pos logit contributions
beh+4.544
bered+4.415
thora+4.176
dden+4.135
edy+4.102
Neg logit contributions
erness-5.742
independence-5.142
eenth-5.058
aways-4.995
meal-4.941
Token environmental
Token ID6142
Feature activation+2.41e+00

Pos logit contributions
erness+1.424
independence+1.296
eenth+1.273
aways+1.239
meal+1.207
Neg logit contributions
beh-1.826
bered-1.808
dden-1.806
thora-1.776
icably-1.705
Token benefit
Token ID4414
Feature activation-1.54e+00

Pos logit contributions
erness+1.723
independence+1.584
eenth+1.545
aways+1.525
meal+1.480
Neg logit contributions
beh-1.854
dden-1.840
bered-1.832
thora-1.806
gged-1.729
Token.
Token ID13
Feature activation+4.33e+00

Pos logit contributions
beh+1.322
bered+1.306
dden+1.294
thora+1.276
icably+1.231
Neg logit contributions
erness-0.977
independence-0.876
eenth-0.857
aways-0.838
meal-0.811
Token Even
Token ID3412
Feature activation+1.72e+01

Pos logit contributions
erness+2.794
independence+2.536
eenth+2.510
aways+2.447
meal+2.359
Neg logit contributions
beh-3.632
bered-3.624
dden-3.578
thora-3.549
icably-3.384
Token K
Token ID509
Feature activation+1.25e+00

Pos logit contributions
erness+9.894
independence+9.213
aways+8.923
eenth+8.806
meal+8.795
Neg logit contributions
beh-12.860
thora-12.828
dden-12.771
bered-12.621
iddled-12.060
TokenI
Token ID40
Feature activation-4.65e-01

Pos logit contributions
erness+0.902
independence+0.823
eenth+0.806
aways+0.794
meal+0.772
Neg logit contributions
beh-0.968
bered-0.956
dden-0.949
thora-0.936
icably-0.897
TokenUC
Token ID9598
Feature activation+2.54e+00

Pos logit contributions
beh+0.392
bered+0.385
dden+0.383
thora+0.377
icably+0.364
Neg logit contributions
erness-0.305
independence-0.274
eenth-0.269
aways-0.263
meal-0.255
Token,
Token ID11
Feature activation+7.07e+00

Pos logit contributions
erness+1.702
independence+1.547
eenth+1.510
aways+1.483
meal+1.438
Neg logit contributions
beh-2.085
bered-2.061
dden-2.049
thora-2.028
icably-1.943
Token while
Token ID981
Feature activation+6.48e+00

Pos logit contributions
erness+4.427
independence+4.147
eenth+3.975
aways+3.885
meal+3.795
Neg logit contributions
bered-5.791
beh-5.785
dden-5.735
thora-5.669
gged-5.334
Token selfish
Token ID20363
Feature activation-6.37e-01

Pos logit contributions
erness+3.933
independence+3.567
eenth+3.523
aways+3.416
meal+3.388
Neg logit contributions
beh-3.811
bered-3.783
dden-3.769
thora-3.718
untled-3.610
Token request
Token ID2581
Feature activation+5.46e+00

Pos logit contributions
beh+0.462
bered+0.452
dden+0.451
thora+0.443
icably+0.424
Neg logit contributions
erness-0.490
independence-0.451
eenth-0.442
aways-0.437
meal-0.423
Token.
Token ID13
Feature activation+4.63e+00

Pos logit contributions
erness+3.103
independence+2.755
eenth+2.737
aways+2.656
meal+2.599
Neg logit contributions
beh-4.890
bered-4.881
thora-4.820
dden-4.816
icably-4.654
Token
Token ID198
Feature activation+3.77e+00

Pos logit contributions
erness+2.751
independence+2.464
eenth+2.434
aways+2.367
meal+2.295
Neg logit contributions
bered-4.105
beh-4.090
dden-4.027
thora-4.016
icably-3.880
Token
Token ID198
Feature activation+2.41e+00

Pos logit contributions
erness+2.254
independence+2.014
eenth+1.970
aways+1.932
meal+1.868
Neg logit contributions
beh-3.365
bered-3.335
dden-3.308
thora-3.281
icably-3.160
TokenEven
Token ID6104
Feature activation+1.70e+01

Pos logit contributions
erness+1.532
independence+1.386
eenth+1.349
aways+1.327
meal+1.285
Neg logit contributions
beh-2.062
bered-2.048
dden-2.029
thora-2.012
icably-1.935
Token if
Token ID611
Feature activation+8.18e-01

Pos logit contributions
erness+9.059
independence+8.016
eenth+7.992
aways+7.860
meal+7.843
Neg logit contributions
thora-13.467
bered-13.231
beh-13.133
dden-13.009
untled-12.748
Token the
Token ID262
Feature activation+1.99e+00

Pos logit contributions
erness+0.567
independence+0.515
eenth+0.504
aways+0.496
meal+0.482
Neg logit contributions
beh-0.656
bered-0.647
dden-0.643
thora-0.635
icably-0.609
Token God
Token ID1793
Feature activation-4.33e+00

Pos logit contributions
erness+1.485
independence+1.357
eenth+1.333
aways+1.311
meal+1.277
Neg logit contributions
beh-1.487
bered-1.470
dden-1.462
thora-1.441
icably-1.378
Token of
Token ID286
Feature activation-6.36e-01

Pos logit contributions
beh+3.372
bered+3.295
dden+3.251
thora+3.194
gged+3.103
Neg logit contributions
erness-3.092
independence-2.821
eenth-2.761
aways-2.677
meal-2.604
Token this
Token ID428
Feature activation+6.47e-01

Pos logit contributions
beh+0.483
bered+0.473
dden+0.471
thora+0.465
icably+0.445
Neg logit contributions
erness-0.469
independence-0.428
eenth-0.420
aways-0.415
meal-0.403
Tokenas
Token ID292
Feature activation-9.25e-01

Pos logit contributions
erness+4.256
independence+3.871
eenth+3.752
aways+3.740
meal+3.720
Neg logit contributions
beh-4.643
dden-4.590
bered-4.575
thora-4.566
untled-4.361
Token and
Token ID290
Feature activation+2.21e-01

Pos logit contributions
beh+0.786
bered+0.775
dden+0.770
thora+0.760
icably+0.732
Neg logit contributions
erness-0.597
independence-0.539
eenth-0.527
aways-0.517
meal-0.500
Token brown
Token ID7586
Feature activation+3.55e+00

Pos logit contributions
erness+0.155
independence+0.141
eenth+0.138
aways+0.135
meal+0.132
Neg logit contributions
beh-0.176
bered-0.173
dden-0.172
thora-0.170
icably-0.164
Token boots
Token ID14412
Feature activation+4.54e-01

Pos logit contributions
erness+2.505
independence+2.260
eenth+2.232
aways+2.194
meal+2.147
Neg logit contributions
beh-2.774
dden-2.757
bered-2.748
thora-2.710
icably-2.606
Token.
Token ID13
Feature activation+1.72e+00

Pos logit contributions
erness+0.302
independence+0.273
eenth+0.267
aways+0.262
meal+0.254
Neg logit contributions
beh-0.378
bered-0.373
dden-0.371
thora-0.366
icably-0.352
Token Even
Token ID3412
Feature activation+1.70e+01

Pos logit contributions
erness+1.121
independence+1.010
eenth+0.991
aways+0.974
meal+0.942
Neg logit contributions
beh-1.452
bered-1.435
dden-1.426
thora-1.406
icably-1.352
Token thought
Token ID1807
Feature activation+3.70e+00

Pos logit contributions
erness+9.689
aways+8.791
meal+8.543
eenth+8.533
independence+8.432
Neg logit contributions
beh-12.581
thora-12.538
bered-12.465
dden-12.460
untled-11.982
Token it
Token ID340
Feature activation+5.31e+00

Pos logit contributions
erness+2.446
independence+2.193
eenth+2.162
aways+2.138
meal+2.088
Neg logit contributions
beh-3.049
bered-3.011
dden-3.003
thora-2.959
untled-2.835
Token would
Token ID561
Feature activation+3.13e-01

Pos logit contributions
erness+3.412
eenth+3.049
independence+3.046
aways+3.003
meal+2.934
Neg logit contributions
beh-4.359
dden-4.332
bered-4.310
thora-4.308
untled-4.152
Token cost
Token ID1575
Feature activation-2.75e+00

Pos logit contributions
erness+0.262
independence+0.242
eenth+0.238
aways+0.235
meal+0.230
Neg logit contributions
beh-0.206
bered-0.203
dden-0.201
thora-0.198
icably-0.188
Token me
Token ID502
Feature activation-8.61e-01

Pos logit contributions
beh+2.207
bered+2.176
dden+2.149
thora+2.129
gged+2.061
Neg logit contributions
erness-1.891
independence-1.711
eenth-1.684
aways-1.634
meal-1.591
Token?
Token ID30
Feature activation+1.44e+00

Pos logit contributions
erness+2.570
independence+2.354
aways+2.252
eenth+2.240
meal+2.189
Neg logit contributions
bered-3.678
dden-3.667
beh-3.664
thora-3.635
gged-3.495
Token
Token ID447
Feature activation-2.76e+00

Pos logit contributions
erness+0.881
independence+0.791
eenth+0.775
aways+0.759
meal+0.734
Neg logit contributions
beh-1.271
bered-1.260
dden-1.251
thora-1.237
icably-1.194
Token
Token ID251
Feature activation-1.36e+00

Pos logit contributions
beh+1.815
bered+1.786
dden+1.777
thora+1.748
icably+1.657
Neg logit contributions
erness-2.298
independence-2.125
eenth-2.085
aways-2.065
meal-2.015
Token
Token ID198
Feature activation+1.28e+00

Pos logit contributions
beh+1.206
bered+1.190
dden+1.184
thora+1.169
icably+1.127
Neg logit contributions
erness-0.833
independence-0.746
eenth-0.729
aways-0.715
meal-0.690
Token
Token ID198
Feature activation+3.28e+00

Pos logit contributions
erness+0.834
independence+0.752
eenth+0.732
aways+0.717
meal+0.692
Neg logit contributions
beh-1.094
dden-1.071
bered-1.069
thora-1.045
gged-1.013
TokenEven
Token ID6104
Feature activation+1.68e+01

Pos logit contributions
erness+2.073
independence+1.898
eenth+1.823
aways+1.798
meal+1.760
Neg logit contributions
beh-2.777
bered-2.771
dden-2.752
thora-2.743
icably-2.634
Token among
Token ID1871
Feature activation+5.88e+00

Pos logit contributions
erness+9.294
independence+8.391
meal+8.211
aways+8.164
eenth+7.944
Neg logit contributions
thora-13.239
dden-12.880
beh-12.763
bered-12.721
iddled-12.282
Token Muslim
Token ID3765
Feature activation-4.40e+00

Pos logit contributions
erness+4.147
independence+3.817
aways+3.716
eenth+3.706
meal+3.577
Neg logit contributions
beh-4.434
thora-4.418
dden-4.383
bered-4.377
icably-4.140
Token scholars
Token ID9123
Feature activation-1.49e+00

Pos logit contributions
beh+3.008
bered+2.958
dden+2.905
thora+2.806
icably+2.740
Neg logit contributions
erness-3.543
independence-3.244
eenth-3.213
aways-3.136
meal-3.066
Token the
Token ID262
Feature activation-1.88e+00

Pos logit contributions
beh+1.220
bered+1.204
dden+1.190
thora+1.167
icably+1.129
Neg logit contributions
erness-1.020
independence-0.922
eenth-0.903
aways-0.882
meal-0.849
Token principle
Token ID7989
Feature activation-8.17e+00

Pos logit contributions
beh+1.387
bered+1.353
dden+1.341
thora+1.322
icably+1.267
Neg logit contributions
erness-1.429
independence-1.307
eenth-1.295
aways-1.266
meal-1.224
Token.
Token ID13
Feature activation-9.08e-01

Pos logit contributions
beh+0.971
bered+0.958
dden+0.957
thora+0.943
gged+0.909
Neg logit contributions
erness-0.702
independence-0.630
eenth-0.612
aways-0.600
meal-0.582
Token 2
Token ID362
Feature activation+1.91e+00

Pos logit contributions
beh+0.729
dden+0.716
bered+0.713
thora+0.700
gged+0.675
Neg logit contributions
erness-0.632
independence-0.574
eenth-0.555
aways-0.547
meal-0.530
Token.
Token ID13
Feature activation-7.51e-01

Pos logit contributions
erness+1.181
independence+1.060
eenth+1.040
aways+1.022
meal+0.986
Neg logit contributions
beh-1.675
bered-1.654
dden-1.647
thora-1.627
icably-1.566
Token However
Token ID2102
Feature activation+3.07e+00

Pos logit contributions
beh+0.615
dden+0.605
bered+0.604
thora+0.594
gged+0.574
Neg logit contributions
erness-0.507
independence-0.462
eenth-0.445
aways-0.438
meal-0.427
Token,
Token ID11
Feature activation+4.81e+00

Pos logit contributions
erness+1.739
independence+1.541
eenth+1.506
aways+1.491
meal+1.448
Neg logit contributions
beh-2.798
dden-2.731
bered-2.728
thora-2.716
untled-2.616
Token even
Token ID772
Feature activation+1.68e+01

Pos logit contributions
erness+3.053
independence+2.745
eenth+2.695
aways+2.688
meal+2.578
Neg logit contributions
beh-4.052
dden-3.981
bered-3.977
thora-3.940
untled-3.764
Token I
Token ID314
Feature activation+5.94e+00

Pos logit contributions
erness+8.644
aways+7.709
independence+7.605
eenth+7.453
meal+7.244
Neg logit contributions
beh-13.724
thora-13.664
dden-13.549
bered-13.254
iddled-12.936
Token can
Token ID460
Feature activation+4.46e+00

Pos logit contributions
erness+3.944
independence+3.534
eenth+3.498
aways+3.489
meal+3.381
Neg logit contributions
beh-4.824
dden-4.749
thora-4.722
bered-4.691
untled-4.480
Token't
Token ID470
Feature activation+4.80e+00

Pos logit contributions
erness+2.867
independence+2.567
eenth+2.518
aways+2.491
meal+2.406
Neg logit contributions
beh-3.790
bered-3.701
dden-3.696
thora-3.675
untled-3.538
Token deny
Token ID10129
Feature activation+6.50e+00

Pos logit contributions
erness+3.738
independence+3.436
eenth+3.371
aways+3.355
meal+3.256
Neg logit contributions
beh-3.399
bered-3.306
dden-3.303
thora-3.277
untled-3.114
Token how
Token ID703
Feature activation+2.58e+00

Pos logit contributions
erness+3.968
independence+3.623
aways+3.522
eenth+3.496
meal+3.380
Neg logit contributions
beh-5.501
dden-5.381
thora-5.336
bered-5.317
untled-5.106
Token issues
Token ID2428
Feature activation+3.82e+00

Pos logit contributions
beh+2.376
bered+2.314
dden+2.288
thora+2.275
icably+2.197
Neg logit contributions
erness-2.290
independence-2.083
eenth-2.074
aways-2.008
meal-1.959
Token.
Token ID13
Feature activation+1.36e+00

Pos logit contributions
erness+2.338
independence+2.089
eenth+2.064
aways+2.039
meal+1.962
Neg logit contributions
beh-3.298
bered-3.288
dden-3.281
thora-3.230
icably-3.105
Token What
Token ID1867
Feature activation+1.17e+01

Pos logit contributions
erness+0.894
independence+0.806
eenth+0.794
aways+0.780
meal+0.754
Neg logit contributions
beh-1.140
bered-1.128
dden-1.122
thora-1.107
icably-1.063
Token was
Token ID373
Feature activation+1.12e+01

Pos logit contributions
erness+6.722
aways+6.172
independence+6.162
eenth+5.926
meal+5.829
Neg logit contributions
beh-9.498
dden-9.437
bered-9.312
thora-9.120
iddled-8.881
Token it
Token ID340
Feature activation+1.31e+01

Pos logit contributions
erness+6.082
aways+5.664
eenth+5.549
meal+5.505
independence+5.480
Neg logit contributions
beh-9.274
bered-9.144
thora-9.046
dden-9.040
untled-8.671
Token like
Token ID588
Feature activation+1.67e+01

Pos logit contributions
erness+6.489
aways+5.680
eenth+5.601
meal+5.464
independence+5.396
Neg logit contributions
beh-11.376
thora-11.098
dden-11.068
bered-10.981
untled-10.812
Token watching
Token ID4964
Feature activation+2.40e+00

Pos logit contributions
erness+9.171
aways+8.270
eenth+7.842
independence+7.767
meal+7.670
Neg logit contributions
thora-12.885
beh-12.617
dden-12.608
bered-12.532
iddled-12.321
Token your
Token ID534
Feature activation+2.28e+00

Pos logit contributions
erness+1.653
independence+1.506
eenth+1.477
aways+1.457
meal+1.414
Neg logit contributions
beh-1.921
bered-1.888
dden-1.884
thora-1.855
untled-1.776
Token team
Token ID1074
Feature activation+1.89e+00

Pos logit contributions
erness+1.609
independence+1.468
eenth+1.441
aways+1.413
meal+1.375
Neg logit contributions
beh-1.794
bered-1.773
dden-1.763
thora-1.732
icably-1.666
Token compete
Token ID9320
Feature activation-2.28e+00

Pos logit contributions
erness+1.413
independence+1.292
eenth+1.270
aways+1.254
meal+1.221
Neg logit contributions
beh-1.412
bered-1.388
dden-1.382
thora-1.365
icably-1.298
Token without
Token ID1231
Feature activation-3.23e+00

Pos logit contributions
beh+1.860
bered+1.840
dden+1.819
thora+1.791
icably+1.740
Neg logit contributions
erness-1.542
independence-1.393
eenth-1.348
aways-1.319
meal-1.284
Token hanging
Token ID10938
Feature activation+1.93e+00

Pos logit contributions
erness+1.585
independence+1.445
eenth+1.418
aways+1.403
meal+1.361
Neg logit contributions
beh-1.762
bered-1.732
dden-1.729
thora-1.715
untled-1.635
Token.
Token ID13
Feature activation-1.42e+00

Pos logit contributions
erness+1.222
independence+1.096
eenth+1.081
aways+1.062
meal+1.029
Neg logit contributions
beh-1.649
bered-1.630
dden-1.624
thora-1.611
icably-1.541
Token
Token ID198
Feature activation-1.43e-01

Pos logit contributions
beh+1.223
dden+1.198
bered+1.196
thora+1.173
icably+1.134
Neg logit contributions
erness-0.914
independence-0.821
eenth-0.798
aways-0.784
meal-0.760
Token
Token ID198
Feature activation-7.47e-01

Pos logit contributions
beh+0.118
dden+0.115
bered+0.114
thora+0.111
gged+0.108
Neg logit contributions
erness-0.098
independence-0.089
eenth-0.086
aways-0.084
meal-0.081
TokenBut
Token ID1537
Feature activation+5.72e+00

Pos logit contributions
beh+0.609
bered+0.594
dden+0.594
thora+0.581
icably+0.561
Neg logit contributions
erness-0.514
independence-0.461
eenth-0.456
aways-0.445
meal-0.432
Token even
Token ID772
Feature activation+1.67e+01

Pos logit contributions
erness+3.757
independence+3.429
eenth+3.343
aways+3.339
meal+3.228
Neg logit contributions
beh-4.577
thora-4.572
bered-4.567
dden-4.522
untled-4.361
Token if
Token ID611
Feature activation+1.21e-01

Pos logit contributions
erness+8.658
aways+7.981
independence+7.967
meal+7.681
eenth+7.621
Neg logit contributions
thora-13.811
dden-13.087
bered-13.059
beh-12.923
iddled-12.862
Token they
Token ID484
Feature activation+7.15e+00

Pos logit contributions
erness+0.082
independence+0.074
eenth+0.073
aways+0.071
meal+0.069
Neg logit contributions
beh-0.100
bered-0.098
dden-0.098
thora-0.097
icably-0.093
Token never
Token ID1239
Feature activation+5.15e+00

Pos logit contributions
erness+4.633
independence+4.158
eenth+4.108
aways+4.017
meal+3.882
Neg logit contributions
beh-5.750
thora-5.747
dden-5.720
bered-5.690
gged-5.444
Token tell
Token ID1560
Feature activation+4.63e+00

Pos logit contributions
erness+3.810
independence+3.465
eenth+3.414
aways+3.391
meal+3.278
Neg logit contributions
beh-3.805
dden-3.741
thora-3.729
bered-3.710
untled-3.530
Token you
Token ID345
Feature activation-2.34e+00

Pos logit contributions
erness+2.522
independence+2.242
eenth+2.159
aways+2.150
meal+2.072
Neg logit contributions
beh-4.296
thora-4.264
dden-4.253
bered-4.235
icably-4.079
Token six
Token ID2237
Feature activation+3.51e+00

Pos logit contributions
erness+2.044
independence+1.884
eenth+1.833
aways+1.798
meal+1.764
Neg logit contributions
beh-2.362
bered-2.328
dden-2.315
thora-2.298
icably-2.200
Token years
Token ID812
Feature activation-1.22e+00

Pos logit contributions
erness+2.398
independence+2.231
eenth+2.182
aways+2.131
meal+2.088
Neg logit contributions
beh-2.797
dden-2.752
thora-2.736
bered-2.733
untled-2.592
Token.
Token ID13
Feature activation-9.79e-01

Pos logit contributions
beh+0.959
bered+0.946
dden+0.937
thora+0.921
icably+0.887
Neg logit contributions
erness-0.873
independence-0.794
eenth-0.779
aways-0.766
meal-0.741
Token
Token ID198
Feature activation+2.62e-01

Pos logit contributions
beh+0.785
bered+0.768
dden+0.767
thora+0.753
gged+0.727
Neg logit contributions
erness-0.680
independence-0.616
eenth-0.600
aways-0.590
meal-0.574
Token
Token ID198
Feature activation+6.60e-01

Pos logit contributions
erness+0.177
independence+0.160
eenth+0.155
aways+0.152
meal+0.147
Neg logit contributions
beh-0.217
dden-0.213
bered-0.211
thora-0.206
gged-0.201
TokenEven
Token ID6104
Feature activation+1.66e+01

Pos logit contributions
erness+0.447
independence+0.406
eenth+0.397
aways+0.390
meal+0.379
Neg logit contributions
beh-0.539
bered-0.532
dden-0.529
thora-0.522
icably-0.502
Token more
Token ID517
Feature activation+3.25e+00

Pos logit contributions
erness+9.719
independence+9.028
aways+8.831
meal+8.708
eenth+8.572
Neg logit contributions
thora-12.455
dden-12.039
beh-11.931
bered-11.828
untled-11.320
Token seriously
Token ID6411
Feature activation-7.28e+00

Pos logit contributions
erness+2.392
independence+2.198
eenth+2.157
aways+2.134
meal+2.076
Neg logit contributions
beh-2.433
dden-2.393
bered-2.372
thora-2.361
untled-2.246
Token,
Token ID11
Feature activation-1.93e-01

Pos logit contributions
beh+5.804
bered+5.729
dden+5.650
thora+5.501
gged+5.447
Neg logit contributions
erness-4.958
independence-4.487
eenth-4.390
aways-4.209
meal-4.066
Token in
Token ID287
Feature activation-8.72e-01

Pos logit contributions
beh+0.149
bered+0.147
dden+0.146
thora+0.144
icably+0.138
Neg logit contributions
erness-0.139
independence-0.127
eenth-0.124
aways-0.122
meal-0.119
Token late
Token ID2739
Feature activation-3.28e-02

Pos logit contributions
beh+0.636
bered+0.625
dden+0.621
thora+0.610
icably+0.585
Neg logit contributions
erness-0.669
independence-0.613
eenth-0.602
aways-0.593
meal-0.577
Token.
Token ID13
Feature activation-1.80e+00

Pos logit contributions
erness+0.997
independence+0.896
eenth+0.874
aways+0.860
meal+0.832
Neg logit contributions
beh-1.457
bered-1.442
dden-1.436
thora-1.420
icably-1.371
Token
Token ID447
Feature activation-2.88e+00

Pos logit contributions
beh+1.522
dden+1.484
bered+1.480
thora+1.459
gged+1.408
Neg logit contributions
erness-1.180
independence-1.059
eenth-1.028
aways-1.014
meal-0.983
Token
Token ID251
Feature activation-6.87e-01

Pos logit contributions
beh+1.612
dden+1.575
bered+1.574
thora+1.539
icably+1.450
Neg logit contributions
erness-2.661
independence-2.494
eenth-2.442
aways-2.426
meal-2.386
Token
Token ID198
Feature activation+4.55e-01

Pos logit contributions
beh+0.601
bered+0.593
dden+0.590
thora+0.583
icably+0.561
Neg logit contributions
erness-0.427
independence-0.383
eenth-0.374
aways-0.367
meal-0.355
Token
Token ID198
Feature activation+1.95e+00

Pos logit contributions
erness+0.305
independence+0.276
eenth+0.269
aways+0.262
meal+0.253
Neg logit contributions
beh-0.378
dden-0.370
bered-0.368
thora-0.359
gged-0.350
TokenEven
Token ID6104
Feature activation+1.65e+01

Pos logit contributions
erness+1.294
independence+1.178
eenth+1.147
aways+1.130
meal+1.098
Neg logit contributions
beh-1.604
bered-1.593
dden-1.581
thora-1.565
icably-1.501
Token by
Token ID416
Feature activation+4.40e+00

Pos logit contributions
erness+9.753
independence+8.962
meal+8.907
aways+8.822
eenth+8.739
Neg logit contributions
thora-12.195
dden-11.785
beh-11.652
bered-11.635
untled-11.156
Token the
Token ID262
Feature activation-1.69e+00

Pos logit contributions
erness+2.976
independence+2.730
eenth+2.675
aways+2.628
meal+2.599
Neg logit contributions
beh-3.493
bered-3.458
dden-3.450
thora-3.439
untled-3.277
Token deb
Token ID1915
Feature activation-6.97e+00

Pos logit contributions
beh+1.173
bered+1.152
dden+1.142
thora+1.113
gged+1.082
Neg logit contributions
erness-1.357
independence-1.242
eenth-1.228
aways-1.215
meal-1.170
Tokenased
Token ID839
Feature activation+2.21e+00

Pos logit contributions
bered+4.333
beh+4.286
dden+4.278
thora+4.150
gged+4.097
Neg logit contributions
erness-5.812
independence-5.386
eenth-5.258
meal-5.216
aways-5.215
Token standards
Token ID5423
Feature activation-1.87e+00

Pos logit contributions
erness+1.816
independence+1.679
eenth+1.648
aways+1.618
meal+1.592
Neg logit contributions
beh-1.473
bered-1.455
dden-1.441
thora-1.426
icably-1.357
Token
Token ID198
Feature activation+4.70e+00

Pos logit contributions
erness+2.686
independence+2.421
eenth+2.367
aways+2.339
meal+2.276
Neg logit contributions
beh-3.621
bered-3.604
dden-3.591
thora-3.536
icably-3.406
Token
Token ID198
Feature activation+2.66e+00

Pos logit contributions
erness+2.731
independence+2.436
eenth+2.384
aways+2.342
meal+2.265
Neg logit contributions
beh-4.253
bered-4.227
dden-4.182
thora-4.169
untled-4.011
Token
Token ID447
Feature activation+6.31e+00

Pos logit contributions
erness+1.739
independence+1.579
eenth+1.536
aways+1.515
meal+1.469
Neg logit contributions
beh-2.226
bered-2.207
dden-2.193
thora-2.168
icably-2.084
Token
Token ID250
Feature activation+1.45e+00

Pos logit contributions
erness+4.546
independence+4.176
eenth+4.142
aways+4.000
meal+3.873
Neg logit contributions
beh-4.755
bered-4.643
dden-4.545
thora-4.406
untled-4.347
TokenBut
Token ID1537
Feature activation+5.33e+00

Pos logit contributions
erness+0.971
independence+0.879
eenth+0.860
aways+0.846
meal+0.820
Neg logit contributions
beh-1.192
bered-1.176
dden-1.169
thora-1.153
icably-1.109
Token what
Token ID644
Feature activation+1.65e+01

Pos logit contributions
erness+3.600
independence+3.256
eenth+3.221
aways+3.187
meal+3.079
Neg logit contributions
beh-4.237
bered-4.201
dden-4.179
thora-4.155
icably-3.996
Token about
Token ID546
Feature activation+9.79e+00

Pos logit contributions
erness+7.784
aways+7.401
independence+7.391
eenth+7.129
meal+7.095
Neg logit contributions
bered-13.765
beh-13.629
dden-13.584
thora-13.292
iddled-13.292
Token the
Token ID262
Feature activation+4.38e+00

Pos logit contributions
erness+6.391
aways+5.883
independence+5.861
eenth+5.706
meal+5.543
Neg logit contributions
dden-7.470
thora-7.404
bered-7.319
beh-7.316
gged-7.033
Token books
Token ID3835
Feature activation+8.72e+00

Pos logit contributions
erness+2.992
independence+2.705
eenth+2.672
aways+2.651
meal+2.544
Neg logit contributions
beh-3.453
dden-3.434
bered-3.430
thora-3.378
icably-3.253
Token I
Token ID314
Feature activation+2.78e+00

Pos logit contributions
erness+4.896
independence+4.444
eenth+4.393
aways+4.378
meal+4.284
Neg logit contributions
thora-7.494
bered-7.494
beh-7.455
dden-7.442
icably-7.147
Token bought
Token ID5839
Feature activation-1.90e-01

Pos logit contributions
erness+1.928
independence+1.744
eenth+1.711
aways+1.690
meal+1.641
Neg logit contributions
beh-2.215
bered-2.184
dden-2.180
thora-2.152
icably-2.065
Token stepped
Token ID10764
Feature activation+2.76e+00

Pos logit contributions
erness+3.059
independence+2.780
aways+2.735
eenth+2.694
meal+2.631
Neg logit contributions
beh-3.618
dden-3.568
bered-3.541
thora-3.528
untled-3.366
Token up
Token ID510
Feature activation-1.06e+00

Pos logit contributions
erness+1.699
independence+1.549
eenth+1.513
aways+1.503
meal+1.466
Neg logit contributions
beh-2.394
dden-2.348
thora-2.347
bered-2.344
untled-2.270
Token.
Token ID13
Feature activation+7.05e-01

Pos logit contributions
beh+0.879
bered+0.866
dden+0.860
thora+0.849
icably+0.816
Neg logit contributions
erness-0.703
independence-0.636
eenth-0.620
aways-0.609
meal-0.591
Token
Token ID198
Feature activation-1.05e+00

Pos logit contributions
erness+0.474
independence+0.429
eenth+0.420
aways+0.413
meal+0.400
Neg logit contributions
beh-0.581
bered-0.574
dden-0.570
thora-0.563
icably-0.541
Token
Token ID198
Feature activation-8.60e-02

Pos logit contributions
beh+0.845
dden+0.828
bered+0.815
thora+0.794
gged+0.778
Neg logit contributions
erness-0.732
independence-0.663
eenth-0.643
aways-0.628
meal-0.606
TokenEven
Token ID6104
Feature activation+1.63e+01

Pos logit contributions
beh+0.070
bered+0.069
dden+0.069
thora+0.068
icably+0.065
Neg logit contributions
erness-0.058
independence-0.053
eenth-0.052
aways-0.051
meal-0.049
Token their
Token ID511
Feature activation+3.80e-01

Pos logit contributions
erness+9.380
aways+8.738
eenth+8.382
meal+8.074
independence+8.006
Neg logit contributions
bered-12.039
beh-12.003
thora-11.901
dden-11.691
untled-11.542
Token much
Token ID881
Feature activation+4.84e+00

Pos logit contributions
erness+0.280
independence+0.256
eenth+0.250
aways+0.246
meal+0.240
Neg logit contributions
beh-0.289
bered-0.284
dden-0.283
thora-0.278
icably-0.267
Token-
Token ID12
Feature activation+5.07e-01

Pos logit contributions
erness+3.082
independence+2.803
eenth+2.794
aways+2.747
meal+2.689
Neg logit contributions
beh-3.971
dden-3.937
bered-3.929
thora-3.916
icably-3.730
Tokenmal
Token ID7617
Feature activation+5.19e+00

Pos logit contributions
erness+0.390
independence+0.360
eenth+0.352
aways+0.348
meal+0.339
Neg logit contributions
beh-0.363
bered-0.360
dden-0.358
thora-0.354
icably-0.338
Tokenigned
Token ID3916
Feature activation+2.95e+00

Pos logit contributions
erness+3.357
independence+3.044
eenth+2.974
aways+2.926
meal+2.838
Neg logit contributions
beh-4.288
bered-4.237
dden-4.233
thora-4.194
icably-4.005
Token to
Token ID284
Feature activation-1.64e+00

Pos logit contributions
erness+1.695
independence+1.533
eenth+1.493
aways+1.472
meal+1.427
Neg logit contributions
beh-2.343
bered-2.314
dden-2.310
thora-2.291
icably-2.195
Token enable
Token ID7139
Feature activation-1.72e+00

Pos logit contributions
beh+1.237
bered+1.218
dden+1.206
thora+1.189
icably+1.147
Neg logit contributions
erness-1.226
independence-1.110
eenth-1.095
aways-1.077
meal-1.047
Token these
Token ID777
Feature activation-6.54e-01

Pos logit contributions
beh+1.328
bered+1.305
dden+1.290
thora+1.273
icably+1.233
Neg logit contributions
erness-1.246
independence-1.127
eenth-1.114
aways-1.092
meal-1.064
Token techniques
Token ID7605
Feature activation-3.93e+00

Pos logit contributions
beh+0.481
bered+0.472
dden+0.468
thora+0.461
icably+0.444
Neg logit contributions
erness-0.498
independence-0.456
eenth-0.448
aways-0.441
meal-0.430
Token.
Token ID13
Feature activation-6.76e-01

Pos logit contributions
beh+3.227
bered+3.173
dden+3.120
thora+3.089
icably+3.007
Neg logit contributions
erness-2.621
independence-2.346
eenth-2.310
aways-2.281
meal-2.195
Token Even
Token ID3412
Feature activation+1.63e+01

Pos logit contributions
beh+0.548
bered+0.538
dden+0.535
thora+0.528
icably+0.509
Neg logit contributions
erness-0.464
independence-0.420
eenth-0.410
aways-0.403
meal-0.392
Token without
Token ID1231
Feature activation+2.30e-01

Pos logit contributions
erness+9.199
independence+8.485
aways+8.356
eenth+8.154
meal+8.127
Neg logit contributions
dden-12.294
thora-12.265
beh-12.034
bered-11.992
untled-11.530
Token them
Token ID606
Feature activation-1.16e+00

Pos logit contributions
erness+0.165
independence+0.150
eenth+0.147
aways+0.145
meal+0.141
Neg logit contributions
beh-0.179
bered-0.177
dden-0.176
thora-0.173
icably-0.166
Token I
Token ID314
Feature activation-3.16e-02

Pos logit contributions
beh+0.976
bered+0.963
dden+0.957
thora+0.945
icably+0.910
Neg logit contributions
erness-0.760
independence-0.685
eenth-0.670
aways-0.659
meal-0.638
Token believe
Token ID1975
Feature activation-4.17e+00

Pos logit contributions
beh+0.024
bered+0.023
dden+0.023
thora+0.023
icably+0.022
Neg logit contributions
erness-0.023
independence-0.022
eenth-0.021
aways-0.021
meal-0.020
Token we
Token ID356
Feature activation-6.41e+00

Pos logit contributions
beh+3.075
bered+3.013
dden+2.985
thora+2.928
icably+2.864
Neg logit contributions
erness-3.145
independence-2.831
eenth-2.816
aways-2.757
meal-2.679
Token for
Token ID329
Feature activation+6.14e-01

Pos logit contributions
erness+0.212
independence+0.194
eenth+0.190
aways+0.187
meal+0.182
Neg logit contributions
beh-0.211
bered-0.208
dden-0.206
thora-0.204
icably-0.195
Token Republican
Token ID3415
Feature activation-6.24e+00

Pos logit contributions
erness+0.473
independence+0.436
eenth+0.427
aways+0.421
meal+0.408
Neg logit contributions
beh-0.444
bered-0.438
dden-0.435
thora-0.430
icably-0.409
Token candidates
Token ID5871
Feature activation-5.47e+00

Pos logit contributions
beh+3.240
dden+3.203
bered+3.152
thora+2.984
gged+2.883
Neg logit contributions
erness-5.847
eenth-5.433
independence-5.421
meal-5.305
aways-5.294
Token continues
Token ID4477
Feature activation+2.63e+00

Pos logit contributions
beh+4.135
bered+4.026
dden+3.991
thora+3.835
icably+3.790
Neg logit contributions
erness-3.901
independence-3.539
eenth-3.513
aways-3.385
meal-3.257
Token to
Token ID284
Feature activation-3.81e+00

Pos logit contributions
erness+1.616
independence+1.455
eenth+1.395
aways+1.382
meal+1.353
Neg logit contributions
beh-2.314
bered-2.296
dden-2.277
thora-2.261
untled-2.175
Token er
Token ID1931
Feature activation-2.20e+01

Pos logit contributions
bered+2.305
dden+2.298
beh+2.283
thora+2.216
icably+2.172
Neg logit contributions
erness-3.288
independence-3.079
eenth-3.026
aways-2.936
meal-2.880
Tokenode
Token ID1098
Feature activation-2.51e+00

Pos logit contributions
thora+4.391
icably+4.335
dden+4.062
oding+3.682
beh+3.563
Neg logit contributions
erness-25.290
independence-23.337
eenth-22.800
aways-22.450
liga-22.086
Token.
Token ID13
Feature activation+1.68e+00

Pos logit contributions
beh+2.075
bered+2.036
dden+2.024
thora+1.988
icably+1.922
Neg logit contributions
erness-1.673
independence-1.509
eenth-1.484
aways-1.449
meal-1.406
Token
Token ID198
Feature activation+1.46e+00

Pos logit contributions
erness+1.115
independence+1.008
eenth+0.992
aways+0.975
meal+0.942
Neg logit contributions
beh-1.392
bered-1.381
dden-1.365
thora-1.351
icably-1.299
Token
Token ID198
Feature activation-4.10e-01

Pos logit contributions
erness+0.950
independence+0.857
eenth+0.834
aways+0.816
meal+0.788
Neg logit contributions
beh-1.238
dden-1.215
bered-1.210
thora-1.183
gged-1.147
TokenO
Token ID46
Feature activation-5.05e-01

Pos logit contributions
beh+0.335
bered+0.330
dden+0.328
thora+0.323
icably+0.311
Neg logit contributions
erness-0.278
independence-0.252
eenth-0.247
aways-0.243
meal-0.235
TokenIn
Token ID818
Feature activation-3.06e-01

Pos logit contributions
beh+1.041
bered+1.022
dden+1.016
thora+0.996
icably+0.962
Neg logit contributions
erness-0.880
independence-0.791
eenth-0.784
aways-0.767
meal-0.740
Token years
Token ID812
Feature activation+1.85e+00

Pos logit contributions
beh+0.236
bered+0.233
dden+0.231
thora+0.228
icably+0.218
Neg logit contributions
erness-0.222
independence-0.202
eenth-0.198
aways-0.195
meal-0.190
Token past
Token ID1613
Feature activation+1.38e+00

Pos logit contributions
erness+1.595
independence+1.480
eenth+1.456
aways+1.438
meal+1.405
Neg logit contributions
beh-1.160
bered-1.137
dden-1.133
thora-1.116
icably-1.056
Token,
Token ID11
Feature activation+4.93e-01

Pos logit contributions
erness+0.862
independence+0.782
eenth+0.758
aways+0.748
meal+0.725
Neg logit contributions
beh-1.192
dden-1.171
bered-1.171
thora-1.160
icably-1.110
Token we
Token ID356
Feature activation-3.06e+00

Pos logit contributions
erness+0.356
independence+0.326
eenth+0.319
aways+0.314
meal+0.305
Neg logit contributions
beh-0.381
bered-0.375
dden-0.373
thora-0.368
icably-0.352
Token er
Token ID1931
Feature activation-2.00e+01

Pos logit contributions
beh+2.245
bered+2.236
dden+2.188
thora+2.142
gged+2.096
Neg logit contributions
erness-2.299
independence-2.130
eenth-2.060
aways-2.033
meal-1.971
Tokenred
Token ID445
Feature activation-8.22e+00

Pos logit contributions
bered+10.721
beh+10.475
dden+10.185
gged+9.965
icably+9.963
Neg logit contributions
erness-16.543
independence-14.990
eenth-14.277
aways-14.118
meal-13.795
Token when
Token ID618
Feature activation-2.64e+00

Pos logit contributions
beh+6.326
bered+6.280
dden+6.088
icably+5.891
thora+5.859
Neg logit contributions
erness-5.768
independence-5.207
eenth-5.008
aways-4.939
meal-4.779
Token observational
Token ID40118
Feature activation+6.64e-01

Pos logit contributions
beh+1.942
bered+1.919
dden+1.891
thora+1.856
icably+1.794
Neg logit contributions
erness-1.998
independence-1.819
eenth-1.797
aways-1.755
meal-1.710
Token data
Token ID1366
Feature activation+1.36e+00

Pos logit contributions
erness+0.434
independence+0.392
eenth+0.383
aways+0.377
meal+0.365
Neg logit contributions
beh-0.558
bered-0.551
dden-0.548
thora-0.542
icably-0.520
Token led
Token ID2957
Feature activation+1.85e+00

Pos logit contributions
erness+0.997
independence+0.912
eenth+0.894
aways+0.883
meal+0.858
Neg logit contributions
beh-1.039
bered-1.022
dden-1.019
thora-1.006
icably-0.961
Tokenama
Token ID1689
Feature activation+1.88e+00

Pos logit contributions
beh+0.692
bered+0.677
dden+0.676
thora+0.668
gged+0.653
Neg logit contributions
erness-0.369
independence-0.321
eenth-0.310
aways-0.305
meal-0.296
Token opened
Token ID4721
Feature activation-1.63e+00

Pos logit contributions
erness+1.306
independence+1.189
eenth+1.166
aways+1.146
meal+1.109
Neg logit contributions
beh-1.508
bered-1.481
dden-1.478
thora-1.459
untled-1.398
Token its
Token ID663
Feature activation-1.92e+00

Pos logit contributions
beh+1.241
bered+1.226
dden+1.217
thora+1.196
icably+1.149
Neg logit contributions
erness-1.191
independence-1.085
eenth-1.065
aways-1.047
meal-1.018
Token doors
Token ID8215
Feature activation-1.75e+00

Pos logit contributions
beh+1.421
bered+1.409
dden+1.394
thora+1.362
icably+1.315
Neg logit contributions
erness-1.440
independence-1.313
eenth-1.287
aways-1.273
meal-1.231
Token --
Token ID1377
Feature activation+1.93e+00

Pos logit contributions
beh+1.422
bered+1.405
dden+1.389
thora+1.366
icably+1.317
Neg logit contributions
erness-1.196
independence-1.084
eenth-1.059
aways-1.040
meal-1.007
Token er
Token ID1931
Feature activation-1.88e+01

Pos logit contributions
erness+1.293
independence+1.171
eenth+1.147
aways+1.131
meal+1.094
Neg logit contributions
beh-1.587
bered-1.567
dden-1.565
thora-1.545
icably-1.479
Token,
Token ID11
Feature activation-1.41e+00

Pos logit contributions
beh+12.094
bered+11.828
thora+11.632
gged+11.256
icably+11.229
Neg logit contributions
erness-13.438
independence-11.995
eenth-11.606
aways-11.142
liga-10.749
Token lot
Token ID1256
Feature activation+2.88e+00

Pos logit contributions
beh+1.082
bered+1.070
dden+1.057
thora+1.038
icably+1.008
Neg logit contributions
erness-1.030
independence-0.934
eenth-0.918
aways-0.902
meal-0.872
Token?
Token ID30
Feature activation-6.06e-01

Pos logit contributions
erness+1.924
independence+1.739
eenth+1.705
aways+1.681
meal+1.644
Neg logit contributions
beh-2.360
bered-2.326
dden-2.316
thora-2.292
untled-2.201
Token --
Token ID1377
Feature activation+2.90e+00

Pos logit contributions
beh+0.491
bered+0.483
dden+0.480
thora+0.472
icably+0.456
Neg logit contributions
erness-0.416
independence-0.378
eenth-0.369
aways-0.361
meal-0.352
Token at
Token ID379
Feature activation+4.46e+00

Pos logit contributions
erness+1.955
independence+1.774
eenth+1.740
aways+1.721
meal+1.661
Neg logit contributions
beh-2.349
bered-2.321
dden-2.320
thora-2.291
untled-2.190
Token that
Token ID326
Feature activation+1.16e+00

Pos logit contributions
beh+3.303
bered+3.275
dden+3.198
thora+3.118
icably+3.068
Neg logit contributions
erness-2.963
independence-2.706
eenth-2.640
aways-2.537
liga-2.464
Token he
Token ID339
Feature activation-2.36e+00

Pos logit contributions
erness+0.765
independence+0.692
eenth+0.675
aways+0.669
meal+0.647
Neg logit contributions
beh-0.960
bered-0.950
dden-0.946
thora-0.936
untled-0.897
Token had
Token ID550
Feature activation+1.46e+00

Pos logit contributions
beh+1.821
bered+1.789
dden+1.749
thora+1.701
gged+1.670
Neg logit contributions
erness-1.746
independence-1.595
eenth-1.553
aways-1.508
meal-1.470
Token
Token ID564
Feature activation-3.31e+00

Pos logit contributions
erness+1.118
independence+1.025
eenth+1.006
aways+0.994
meal+0.967
Neg logit contributions
beh-1.065
bered-1.048
dden-1.043
thora-1.028
icably-0.981
Token
Token ID250
Feature activation-7.54e-01

Pos logit contributions
beh+2.084
dden+2.057
bered+2.044
thora+1.993
gged+1.914
Neg logit contributions
erness-2.860
independence-2.659
eenth-2.595
aways-2.575
meal-2.518
Tokenimpro
Token ID32077
Feature activation-1.82e+01

Pos logit contributions
beh+0.578
bered+0.565
dden+0.560
thora+0.548
gged+0.526
Neg logit contributions
erness-0.559
independence-0.506
eenth-0.501
aways-0.489
meal-0.475
Tokenper
Token ID525
Feature activation-8.32e+00

Pos logit contributions
bered+8.287
beh+8.080
dden+7.883
gged+7.407
thora+7.067
Neg logit contributions
erness-16.537
eenth-15.992
independence-15.714
liga-14.952
meal-14.894
Tokenly
Token ID306
Feature activation-6.88e+00

Pos logit contributions
beh+5.126
bered+4.918
dden+4.846
thora+4.536
icably+4.507
Neg logit contributions
erness-7.162
independence-6.593
eenth-6.433
aways-6.335
liga-6.288
Token touched
Token ID12615
Feature activation-3.92e+00

Pos logit contributions
beh+4.980
bered+4.922
dden+4.702
gged+4.540
icably+4.410
Neg logit contributions
erness-5.427
independence-4.966
eenth-4.710
aways-4.539
liga-4.536
Token
Token ID447
Feature activation+3.23e+00

Pos logit contributions
beh+3.004
bered+2.954
dden+2.953
thora+2.856
gged+2.751
Neg logit contributions
erness-2.860
independence-2.597
eenth-2.549
aways-2.484
meal-2.426
Token
Token ID251
Feature activation+1.88e+00

Pos logit contributions
erness+2.607
independence+2.399
eenth+2.369
aways+2.325
meal+2.262
Neg logit contributions
beh-2.218
bered-2.181
dden-2.157
thora-2.128
icably-2.036
Token for
Token ID329
Feature activation-5.77e-01

Pos logit contributions
bered+3.305
beh+3.301
dden+3.245
thora+3.187
icably+3.077
Neg logit contributions
erness-3.239
independence-2.971
eenth-2.901
aways-2.804
meal-2.730
Token him
Token ID683
Feature activation-3.68e+00

Pos logit contributions
beh+0.514
bered+0.507
dden+0.505
thora+0.499
icably+0.481
Neg logit contributions
erness-0.349
independence-0.313
eenth-0.305
aways-0.300
meal-0.289
Token,
Token ID11
Feature activation-2.23e+00

Pos logit contributions
beh+2.962
bered+2.933
dden+2.892
thora+2.835
icably+2.750
Neg logit contributions
erness-2.535
independence-2.314
eenth-2.254
aways-2.179
meal-2.120
Token he
Token ID339
Feature activation-7.69e-01

Pos logit contributions
beh+1.576
bered+1.552
dden+1.536
thora+1.510
icably+1.449
Neg logit contributions
erness-1.766
independence-1.623
eenth-1.592
aways-1.565
meal-1.525
Token "
Token ID366
Feature activation-3.65e+00

Pos logit contributions
beh+0.596
bered+0.594
dden+0.579
thora+0.568
icably+0.553
Neg logit contributions
erness-0.558
independence-0.511
eenth-0.496
aways-0.485
meal-0.471
Tokenrem
Token ID2787
Feature activation-1.79e+01

Pos logit contributions
beh+2.769
bered+2.683
dden+2.648
thora+2.579
icably+2.507
Neg logit contributions
erness-2.713
independence-2.453
eenth-2.450
aways-2.372
meal-2.296
Tokenained
Token ID1328
Feature activation-2.96e+00

Pos logit contributions
bered+12.423
beh+11.996
edy+11.520
dden+11.169
icably+11.041
Neg logit contributions
erness-11.375
eenth-11.031
independence-10.897
meal-10.720
aways-10.510
Token at
Token ID379
Feature activation+5.80e-01

Pos logit contributions
beh+2.142
bered+2.115
dden+2.082
thora+2.014
gged+1.981
Neg logit contributions
erness-2.282
independence-2.095
eenth-2.054
aways-1.988
meal-1.955
Token the
Token ID262
Feature activation-7.32e-01

Pos logit contributions
erness+0.396
independence+0.359
eenth+0.352
aways+0.348
meal+0.336
Neg logit contributions
beh-0.470
bered-0.462
dden-0.461
thora-0.455
gged-0.435
Token bottom
Token ID4220
Feature activation+6.27e-01

Pos logit contributions
beh+0.514
bered+0.507
dden+0.503
thora+0.493
icably+0.472
Neg logit contributions
erness-0.581
independence-0.535
eenth-0.526
aways-0.517
meal-0.504
Token of
Token ID286
Feature activation-2.58e+00

Pos logit contributions
erness+0.434
independence+0.390
eenth+0.387
aways+0.382
meal+0.373
Neg logit contributions
beh-0.506
bered-0.495
dden-0.492
thora-0.490
icably-0.469
Token While
Token ID2893
Feature activation-1.36e+00

Pos logit contributions
beh+1.246
dden+1.224
bered+1.218
thora+1.204
icably+1.165
Neg logit contributions
erness-1.008
independence-0.921
eenth-0.891
aways-0.871
meal-0.852
Token the
Token ID262
Feature activation-1.43e+00

Pos logit contributions
beh+1.052
bered+1.037
dden+1.030
thora+1.015
icably+0.975
Neg logit contributions
erness-0.976
independence-0.890
eenth-0.873
aways-0.858
meal-0.834
Token source
Token ID2723
Feature activation-7.99e-01

Pos logit contributions
beh+1.038
bered+1.019
dden+1.012
thora+0.996
icably+0.955
Neg logit contributions
erness-1.107
independence-1.018
eenth-1.001
aways-0.983
meal-0.957
Token of
Token ID286
Feature activation-1.85e+00

Pos logit contributions
beh+0.611
bered+0.602
dden+0.598
thora+0.590
icably+0.565
Neg logit contributions
erness-0.584
independence-0.533
eenth-0.523
aways-0.514
meal-0.500
Token the
Token ID262
Feature activation+7.59e-01

Pos logit contributions
beh+1.444
bered+1.422
dden+1.415
thora+1.396
icably+1.342
Neg logit contributions
erness-1.329
independence-1.211
eenth-1.188
aways-1.166
meal-1.133
Token er
Token ID1931
Feature activation-1.78e+01

Pos logit contributions
erness+0.585
independence+0.536
eenth+0.526
aways+0.520
meal+0.507
Neg logit contributions
beh-0.548
bered-0.541
dden-0.538
thora-0.529
icably-0.505
Tokenrant
Token ID5250
Feature activation-1.06e+00

Pos logit contributions
thora+8.700
beh+8.332
bered+8.073
dden+7.959
untled+7.947
Neg logit contributions
erness-15.877
independence-14.233
eenth-14.129
aways-14.095
liga-13.580
Token command
Token ID3141
Feature activation-4.95e+00

Pos logit contributions
beh+0.741
bered+0.725
dden+0.719
thora+0.706
icably+0.679
Neg logit contributions
erness-0.852
independence-0.787
eenth-0.772
aways-0.758
meal-0.740
Token was
Token ID373
Feature activation+6.16e-01

Pos logit contributions
beh+3.655
bered+3.631
dden+3.486
thora+3.465
gged+3.363
Neg logit contributions
erness-3.664
independence-3.401
eenth-3.296
aways-3.148
meal-3.097
Token not
Token ID407
Feature activation-2.82e+00

Pos logit contributions
erness+0.474
independence+0.434
eenth+0.426
aways+0.420
meal+0.409
Neg logit contributions
beh-0.447
bered-0.440
dden-0.438
thora-0.432
icably-0.411
Token known
Token ID1900
Feature activation-2.82e-01

Pos logit contributions
beh+2.105
bered+2.103
dden+2.051
thora+2.006
icably+2.000
Neg logit contributions
erness-2.080
independence-1.896
eenth-1.859
aways-1.830
meal-1.784
Token and
Token ID290
Feature activation-1.62e+00

Pos logit contributions
beh+1.476
bered+1.450
dden+1.433
thora+1.406
icably+1.367
Neg logit contributions
erness-1.206
independence-1.085
eenth-1.061
aways-1.040
meal-1.004
Token its
Token ID663
Feature activation-1.49e+00

Pos logit contributions
beh+1.096
bered+1.076
dden+1.065
thora+1.050
icably+1.001
Neg logit contributions
erness-1.327
independence-1.219
eenth-1.204
aways-1.178
meal-1.154
Token government
Token ID1230
Feature activation+6.39e-01

Pos logit contributions
beh+0.940
bered+0.922
dden+0.906
thora+0.894
icably+0.847
Neg logit contributions
erness-1.300
independence-1.198
eenth-1.186
aways-1.163
meal-1.137
Token has
Token ID468
Feature activation-2.02e-02

Pos logit contributions
erness+0.471
independence+0.430
eenth+0.422
aways+0.415
meal+0.404
Neg logit contributions
beh-0.485
bered-0.478
dden-0.474
thora-0.467
icably-0.448
Token a
Token ID257
Feature activation-3.80e+00

Pos logit contributions
beh+0.015
bered+0.015
dden+0.015
thora+0.015
icably+0.014
Neg logit contributions
erness-0.015
independence-0.014
eenth-0.014
aways-0.013
meal-0.013
Token good
Token ID922
Feature activation-1.77e+01

Pos logit contributions
beh+2.535
bered+2.499
thora+2.430
dden+2.422
gged+2.310
Neg logit contributions
erness-3.143
independence-2.869
eenth-2.858
aways-2.805
meal-2.712
Token reputation
Token ID8507
Feature activation-6.78e+00

Pos logit contributions
beh+9.949
bered+9.187
thora+8.958
gged+8.471
reci+8.340
Neg logit contributions
erness-14.238
aways-12.449
independence-12.324
eenth-12.314
liga-11.543
Token,
Token ID11
Feature activation-1.90e+00

Pos logit contributions
beh+5.035
bered+4.900
dden+4.814
thora+4.678
gged+4.561
Neg logit contributions
erness-5.044
independence-4.540
eenth-4.482
aways-4.334
meal-4.261
Token the
Token ID262
Feature activation-1.29e+00

Pos logit contributions
beh+1.426
bered+1.401
dden+1.387
thora+1.365
icably+1.314
Neg logit contributions
erness-1.425
independence-1.291
eenth-1.276
aways-1.250
meal-1.218
Token instruments
Token ID12834
Feature activation-5.72e-01

Pos logit contributions
beh+0.885
bered+0.865
dden+0.858
thora+0.847
icably+0.807
Neg logit contributions
erness-1.044
independence-0.956
eenth-0.947
aways-0.930
meal-0.908
Token that
Token ID326
Feature activation+1.43e+00

Pos logit contributions
beh+0.434
bered+0.428
dden+0.425
thora+0.419
icably+0.401
Neg logit contributions
erness-0.421
independence-0.385
eenth-0.378
aways-0.372
meal-0.362
Token radios
Token ID40023
Feature activation+7.76e-01

Pos logit contributions
erness+1.336
independence+1.229
eenth+1.209
aways+1.196
meal+1.161
Neg logit contributions
beh-1.164
bered-1.149
dden-1.142
thora-1.126
icably-1.069
Token,
Token ID11
Feature activation-8.85e-01

Pos logit contributions
erness+0.535
independence+0.486
eenth+0.476
aways+0.469
meal+0.454
Neg logit contributions
beh-0.624
bered-0.615
dden-0.612
thora-0.604
icably-0.579
Token when
Token ID618
Feature activation-2.26e+00

Pos logit contributions
beh+0.698
bered+0.687
dden+0.680
thora+0.671
icably+0.648
Neg logit contributions
erness-0.626
independence-0.570
eenth-0.559
aways-0.548
meal-0.533
Token you
Token ID345
Feature activation+1.39e+00

Pos logit contributions
beh+1.780
bered+1.749
dden+1.726
thora+1.704
icably+1.647
Neg logit contributions
erness-1.601
independence-1.459
eenth-1.432
aways-1.395
meal-1.363
Token un
Token ID555
Feature activation-6.85e+00

Pos logit contributions
erness+1.097
independence+1.009
eenth+0.991
aways+0.976
meal+0.952
Neg logit contributions
beh-0.984
bered-0.970
dden-0.961
thora-0.947
icably-0.906
Tokenplug
Token ID16875
Feature activation-1.77e+01

Pos logit contributions
beh+4.654
dden+4.435
bered+4.371
icably+4.266
thora+4.153
Neg logit contributions
erness-5.381
eenth-5.074
independence-5.029
aways-4.803
meal-4.671
Token the
Token ID262
Feature activation-4.80e+00

Pos logit contributions
bered+10.287
beh+10.120
dden+9.442
rimp+9.371
untled+9.349
Neg logit contributions
erness-12.825
independence-12.202
eenth-11.592
aways-11.130
meal-11.084
Token external
Token ID7097
Feature activation-4.97e+00

Pos logit contributions
beh+3.452
bered+3.378
dden+3.317
thora+3.271
icably+3.142
Neg logit contributions
erness-3.667
independence-3.374
eenth-3.321
aways-3.230
meal-3.129
Token antenna
Token ID20509
Feature activation-1.92e+00

Pos logit contributions
beh+3.139
bered+3.046
dden+2.965
thora+2.915
icably+2.822
Neg logit contributions
erness-4.244
independence-3.929
eenth-3.899
aways-3.753
meal-3.658
Token connection
Token ID4637
Feature activation-2.96e+00

Pos logit contributions
beh+1.569
bered+1.548
dden+1.534
thora+1.511
icably+1.460
Neg logit contributions
erness-1.304
independence-1.189
eenth-1.161
aways-1.129
meal-1.102
Token,
Token ID11
Feature activation-4.00e+00

Pos logit contributions
beh+2.347
bered+2.307
dden+2.276
thora+2.254
icably+2.184
Neg logit contributions
erness-2.060
independence-1.883
eenth-1.841
aways-1.773
meal-1.740
Token speakers
Token ID11636
Feature activation+2.95e-01

Pos logit contributions
erness+1.469
independence+1.340
eenth+1.320
aways+1.300
meal+1.263
Neg logit contributions
beh-1.427
bered-1.411
dden-1.404
thora-1.387
icably-1.327
Token
Token ID784
Feature activation+7.13e-01

Pos logit contributions
erness+0.191
independence+0.172
eenth+0.168
aways+0.165
meal+0.160
Neg logit contributions
beh-0.250
bered-0.247
dden-0.245
thora-0.242
icably-0.233
Token if
Token ID611
Feature activation-6.77e+00

Pos logit contributions
erness+0.509
independence+0.464
eenth+0.455
aways+0.448
meal+0.435
Neg logit contributions
beh-0.557
bered-0.549
dden-0.545
thora-0.538
icably-0.516
Token a
Token ID257
Feature activation-1.68e+00

Pos logit contributions
beh+5.213
bered+5.109
dden+5.089
thora+4.996
icably+4.793
Neg logit contributions
erness-4.806
independence-4.374
eenth-4.320
aways-4.192
meal-4.063
Token door
Token ID3420
Feature activation-9.63e+00

Pos logit contributions
beh+1.253
bered+1.233
dden+1.221
thora+1.202
icably+1.152
Neg logit contributions
erness-1.260
independence-1.157
eenth-1.133
aways-1.117
meal-1.083
Token cre
Token ID1126
Feature activation-1.73e+01

Pos logit contributions
beh+6.566
bered+6.200
dden+6.090
thora+6.080
icably+5.885
Neg logit contributions
erness-7.460
independence-6.819
eenth-6.621
aways-6.524
liga-6.336
Tokenaks
Token ID4730
Feature activation-9.84e+00

Pos logit contributions
beh+9.048
bered+8.473
dden+8.417
thora+8.255
gged+8.108
Neg logit contributions
erness-13.712
independence-12.912
eenth-12.693
DragonMagazine-12.568
liga-12.412
Token
Token ID784
Feature activation-3.87e+00

Pos logit contributions
beh+7.567
bered+7.486
dden+7.107
icably+6.852
thora+6.775
Neg logit contributions
erness-6.739
independence-6.184
eenth-5.922
aways-5.776
meal-5.646
Token make
Token ID787
Feature activation+2.09e+00

Pos logit contributions
beh+2.685
bered+2.636
dden+2.625
thora+2.547
icably+2.464
Neg logit contributions
erness-3.063
independence-2.832
eenth-2.776
aways-2.721
meal-2.626
Token the
Token ID262
Feature activation-3.16e-01

Pos logit contributions
erness+1.493
independence+1.356
eenth+1.330
aways+1.325
meal+1.281
Neg logit contributions
beh-1.618
bered-1.597
dden-1.593
thora-1.571
untled-1.504
Token noise
Token ID7838
Feature activation-1.70e-01

Pos logit contributions
beh+0.221
bered+0.218
dden+0.216
thora+0.213
icably+0.203
Neg logit contributions
erness-0.252
independence-0.231
eenth-0.227
aways-0.224
meal-0.219
Token Hale
Token ID35056
Feature activation+1.92e+00

Pos logit contributions
beh+0.443
bered+0.434
dden+0.432
thora+0.425
icably+0.409
Neg logit contributions
erness-0.440
independence-0.402
eenth-0.393
aways-0.387
meal-0.375
Token Williams
Token ID6484
Feature activation-3.97e+00

Pos logit contributions
erness+1.224
independence+1.098
eenth+1.071
aways+1.067
meal+1.032
Neg logit contributions
beh-1.624
bered-1.615
dden-1.601
thora-1.583
untled-1.525
Token,
Token ID11
Feature activation-5.38e+00

Pos logit contributions
beh+3.353
bered+3.270
dden+3.258
thora+3.194
gged+3.089
Neg logit contributions
erness-2.576
independence-2.345
eenth-2.263
aways-2.228
meal-2.131
Token who
Token ID508
Feature activation+1.63e+00

Pos logit contributions
beh+3.930
bered+3.846
dden+3.759
thora+3.690
icably+3.634
Neg logit contributions
erness-4.043
independence-3.691
eenth-3.598
aways-3.532
meal-3.419
Token is
Token ID318
Feature activation-3.22e+00

Pos logit contributions
erness+1.143
independence+1.038
eenth+1.019
aways+1.003
meal+0.974
Neg logit contributions
beh-1.293
bered-1.272
dden-1.268
thora-1.251
icably-1.197
Token better
Token ID1365
Feature activation-1.67e+01

Pos logit contributions
beh+2.325
bered+2.294
dden+2.235
icably+2.186
thora+2.175
Neg logit contributions
erness-2.490
independence-2.282
eenth-2.253
aways-2.198
meal-2.112
Token known
Token ID1900
Feature activation-1.23e+00

Pos logit contributions
bered+7.774
beh+7.686
icably+7.169
dden+6.455
iddled+6.433
Neg logit contributions
erness-14.960
independence-13.977
eenth-13.780
meal-13.137
aways-13.087
Token for
Token ID329
Feature activation+4.04e-01

Pos logit contributions
beh+0.955
bered+0.941
dden+0.932
thora+0.916
icably+0.885
Neg logit contributions
erness-0.880
independence-0.804
eenth-0.788
aways-0.773
meal-0.750
Token his
Token ID465
Feature activation+5.01e-01

Pos logit contributions
erness+0.275
independence+0.249
eenth+0.244
aways+0.239
meal+0.232
Neg logit contributions
beh-0.331
bered-0.326
dden-0.324
thora-0.319
icably-0.308
Token dedication
Token ID22445
Feature activation-3.28e+00

Pos logit contributions
erness+0.363
independence+0.331
eenth+0.325
aways+0.320
meal+0.311
Neg logit contributions
beh-0.386
bered-0.380
dden-0.378
thora-0.373
icably-0.357
Token of
Token ID286
Feature activation-1.67e+00

Pos logit contributions
beh+2.714
bered+2.678
dden+2.668
thora+2.623
icably+2.529
Neg logit contributions
erness-2.179
independence-1.970
eenth-1.928
aways-1.891
meal-1.830
Token capabilities
Token ID9889
Feature activation-4.43e+00

Pos logit contributions
beh+0.354
bered+0.348
dden+0.346
thora+0.342
icably+0.327
Neg logit contributions
erness-0.352
independence-0.321
eenth-0.316
aways-0.311
meal-0.302
Token to
Token ID284
Feature activation+5.97e-01

Pos logit contributions
beh+3.518
bered+3.479
dden+3.451
thora+3.409
gged+3.258
Neg logit contributions
erness-3.056
independence-2.776
eenth-2.715
aways-2.670
meal-2.588
Token expand
Token ID4292
Feature activation-4.99e+00

Pos logit contributions
erness+0.450
independence+0.412
eenth+0.405
aways+0.398
meal+0.388
Neg logit contributions
beh-0.442
bered-0.436
dden-0.433
thora-0.427
icably-0.409
Token upon
Token ID2402
Feature activation-5.48e+00

Pos logit contributions
beh+3.702
dden+3.690
bered+3.675
thora+3.628
icably+3.503
Neg logit contributions
erness-3.639
independence-3.342
eenth-3.269
aways-3.211
meal-3.106
Token and
Token ID290
Feature activation-4.58e+00

Pos logit contributions
beh+4.074
dden+4.058
bered+3.998
thora+3.963
gged+3.811
Neg logit contributions
erness-3.994
independence-3.674
eenth-3.602
aways-3.540
meal-3.406
Token better
Token ID1365
Feature activation-1.67e+01

Pos logit contributions
bered+3.084
beh+3.037
dden+3.024
thora+3.009
icably+2.902
Neg logit contributions
erness-3.668
independence-3.371
eenth-3.306
aways-3.240
meal-3.150
Token Link
Token ID7502
Feature activation-3.04e+00

Pos logit contributions
dden+10.885
bered+10.676
beh+10.529
icably+10.219
thora+10.138
Neg logit contributions
erness-11.625
independence-10.630
eenth-10.212
aways-10.067
meal-9.651
Token to
Token ID284
Feature activation-1.99e+00

Pos logit contributions
beh+2.337
dden+2.295
bered+2.290
thora+2.260
gged+2.163
Neg logit contributions
erness-2.184
independence-2.002
eenth-1.972
aways-1.936
meal-1.872
Token the
Token ID262
Feature activation+2.34e+00

Pos logit contributions
beh+1.554
bered+1.540
dden+1.533
thora+1.501
icably+1.448
Neg logit contributions
erness-1.416
independence-1.290
eenth-1.263
aways-1.240
meal-1.206
Token Past
Token ID11303
Feature activation+9.59e-01

Pos logit contributions
erness+1.860
independence+1.700
eenth+1.686
aways+1.675
meal+1.615
Neg logit contributions
beh-1.644
bered-1.601
dden-1.580
thora-1.554
untled-1.507
Token in
Token ID287
Feature activation-6.09e+00

Pos logit contributions
erness+0.638
independence+0.576
eenth+0.561
aways+0.552
meal+0.537
Neg logit contributions
beh-0.797
bered-0.787
dden-0.781
thora-0.770
icably-0.745
Token to
Token ID284
Feature activation+5.19e+00

Pos logit contributions
erness+3.288
independence+2.968
eenth+2.852
aways+2.828
meal+2.780
Neg logit contributions
bered-4.539
beh-4.530
thora-4.505
dden-4.486
untled-4.328
Token complain
Token ID13121
Feature activation-2.33e+00

Pos logit contributions
erness+3.685
independence+3.384
eenth+3.304
aways+3.300
meal+3.215
Neg logit contributions
beh-3.949
bered-3.897
dden-3.854
thora-3.807
untled-3.709
Token the
Token ID262
Feature activation-1.54e+00

Pos logit contributions
beh+1.684
dden+1.644
bered+1.642
thora+1.614
icably+1.551
Neg logit contributions
erness-1.788
independence-1.637
eenth-1.614
aways-1.584
meal-1.532
Token country
Token ID1499
Feature activation-3.40e+00

Pos logit contributions
beh+1.077
bered+1.058
dden+1.051
thora+1.034
icably+0.988
Neg logit contributions
erness-1.226
independence-1.127
eenth-1.108
aways-1.092
meal-1.065
Token is
Token ID318
Feature activation-3.46e+00

Pos logit contributions
beh+2.768
bered+2.706
dden+2.680
thora+2.616
icably+2.560
Neg logit contributions
erness-2.309
independence-2.093
eenth-2.052
aways-2.002
meal-1.929
Token badly
Token ID11234
Feature activation-1.67e+01

Pos logit contributions
beh+2.555
bered+2.494
dden+2.466
thora+2.418
icably+2.371
Neg logit contributions
erness-2.624
independence-2.392
eenth-2.367
aways-2.300
meal-2.230
Token run
Token ID1057
Feature activation+1.36e+00

Pos logit contributions
beh+11.081
bered+10.480
dden+9.748
gged+9.738
icably+9.571
Neg logit contributions
erness-12.175
independence-11.011
eenth-10.933
liga-10.347
angel-9.998
Token,
Token ID11
Feature activation-4.49e-02

Pos logit contributions
erness+0.854
independence+0.765
eenth+0.748
aways+0.739
meal+0.714
Neg logit contributions
beh-1.177
bered-1.165
dden-1.158
thora-1.148
icably-1.104
Token if
Token ID611
Feature activation-3.23e+00

Pos logit contributions
beh+0.036
bered+0.035
dden+0.035
thora+0.035
icably+0.033
Neg logit contributions
erness-0.031
independence-0.028
eenth-0.028
aways-0.027
meal-0.027
Token we
Token ID356
Feature activation-2.65e+00

Pos logit contributions
beh+2.408
bered+2.368
dden+2.357
thora+2.303
icably+2.220
Neg logit contributions
erness-2.424
independence-2.204
eenth-2.182
aways-2.127
meal-2.076
Token don
Token ID836
Feature activation+2.82e+00

Pos logit contributions
beh+2.052
bered+2.020
dden+1.999
thora+1.963
icably+1.905
Neg logit contributions
erness-1.904
independence-1.742
eenth-1.700
aways-1.672
meal-1.623
Token reactors
Token ID28502
Feature activation-3.25e+00

Pos logit contributions
erness+0.776
independence+0.708
eenth+0.696
aways+0.685
meal+0.667
Neg logit contributions
beh-0.778
bered-0.767
dden-0.762
thora-0.751
icably-0.719
Token,
Token ID11
Feature activation-2.90e+00

Pos logit contributions
beh+2.538
bered+2.500
dden+2.463
thora+2.419
gged+2.345
Neg logit contributions
erness-2.313
independence-2.119
eenth-2.067
aways-2.011
meal-1.945
Token but
Token ID475
Feature activation-2.49e+00

Pos logit contributions
beh+2.250
bered+2.212
dden+2.165
thora+2.156
icably+2.087
Neg logit contributions
erness-2.080
independence-1.893
eenth-1.855
aways-1.813
meal-1.756
Token these
Token ID777
Feature activation-1.49e+00

Pos logit contributions
beh+1.890
bered+1.872
dden+1.834
thora+1.818
icably+1.762
Neg logit contributions
erness-1.826
independence-1.654
eenth-1.635
aways-1.597
meal-1.545
Token were
Token ID547
Feature activation-1.01e+00

Pos logit contributions
beh+1.112
bered+1.096
dden+1.075
thora+1.066
icably+1.029
Neg logit contributions
erness-1.121
independence-1.023
eenth-1.005
aways-0.988
meal-0.961
Token badly
Token ID11234
Feature activation-1.64e+01

Pos logit contributions
beh+0.745
bered+0.733
dden+0.724
thora+0.715
icably+0.690
Neg logit contributions
erness-0.771
independence-0.708
eenth-0.695
aways-0.680
meal-0.663
Token damaged
Token ID9694
Feature activation-9.24e-01

Pos logit contributions
beh+9.823
bered+9.653
icably+8.812
gged+8.736
dden+8.671
Neg logit contributions
erness-12.882
independence-12.118
eenth-11.385
angel-10.581
liga-10.540
Token during
Token ID1141
Feature activation-3.92e+00

Pos logit contributions
beh+0.757
bered+0.746
dden+0.738
thora+0.728
icably+0.702
Neg logit contributions
erness-0.628
independence-0.568
eenth-0.556
aways-0.543
meal-0.527
Token refuel
Token ID47874
Feature activation-3.06e+00

Pos logit contributions
beh+2.773
bered+2.718
dden+2.694
thora+2.651
icably+2.550
Neg logit contributions
erness-3.061
independence-2.794
eenth-2.771
aways-2.727
meal-2.651
Tokening
Token ID278
Feature activation-3.84e+00

Pos logit contributions
beh+2.759
bered+2.701
dden+2.673
thora+2.628
icably+2.576
Neg logit contributions
erness-1.755
independence-1.586
eenth-1.546
aways-1.502
meal-1.454
Token in
Token ID287
Feature activation-4.76e+00

Pos logit contributions
beh+2.979
bered+2.884
dden+2.844
thora+2.800
icably+2.720
Neg logit contributions
erness-2.765
independence-2.511
eenth-2.485
aways-2.405
meal-2.337
Token the
Token ID262
Feature activation-2.91e+00

Pos logit contributions
beh+2.553
bered+2.534
dden+2.504
thora+2.463
icably+2.364
Neg logit contributions
erness-2.494
independence-2.248
eenth-2.203
aways-2.188
meal-2.141
Token Civil
Token ID7511
Feature activation-2.09e-01

Pos logit contributions
beh+1.687
bered+1.659
dden+1.635
thora+1.602
icably+1.505
Neg logit contributions
erness-2.655
independence-2.452
eenth-2.414
aways-2.400
meal-2.352
Token War
Token ID1810
Feature activation-2.73e+00

Pos logit contributions
beh+0.110
bered+0.107
dden+0.105
thora+0.102
icably+0.097
Neg logit contributions
erness-0.203
independence-0.190
eenth-0.185
aways-0.185
meal-0.181
Token,
Token ID11
Feature activation-5.87e+00

Pos logit contributions
beh+2.246
bered+2.223
dden+2.191
thora+2.161
gged+2.079
Neg logit contributions
erness-1.820
independence-1.648
eenth-1.603
aways-1.583
meal-1.535
Token there
Token ID612
Feature activation-6.04e+00

Pos logit contributions
beh+4.366
bered+4.343
dden+4.221
thora+4.155
icably+4.061
Neg logit contributions
erness-4.289
independence-3.847
eenth-3.792
aways-3.720
meal-3.627
Token may
Token ID743
Feature activation-1.59e+01

Pos logit contributions
beh+4.227
bered+4.180
dden+4.100
thora+4.012
gged+3.886
Neg logit contributions
erness-4.643
independence-4.267
eenth-4.163
aways-4.020
meal-3.985
Token not
Token ID407
Feature activation-1.38e+01

Pos logit contributions
beh+8.609
bered+8.422
dden+8.375
icably+8.099
thora+7.880
Neg logit contributions
erness-12.838
independence-11.748
eenth-11.507
aways-11.354
meal-11.285
Token have
Token ID423
Feature activation-8.70e+00

Pos logit contributions
beh+7.878
dden+7.798
bered+7.782
icably+7.618
thora+7.296
Neg logit contributions
erness-11.291
independence-10.461
aways-10.078
meal-10.054
eenth-9.976
Token been
Token ID587
Feature activation-2.60e+00

Pos logit contributions
bered+4.917
beh+4.887
dden+4.736
icably+4.634
thora+4.519
Neg logit contributions
erness-7.791
independence-7.319
aways-6.981
eenth-6.967
meal-6.948
Token a
Token ID257
Feature activation-1.06e+00

Pos logit contributions
beh+1.870
bered+1.850
dden+1.820
thora+1.805
icably+1.741
Neg logit contributions
erness-2.009
independence-1.840
eenth-1.803
aways-1.774
meal-1.739
Token war
Token ID1175
Feature activation-2.07e-02

Pos logit contributions
beh+0.612
bered+0.604
dden+0.593
thora+0.584
icably+0.553
Neg logit contributions
erness-0.977
independence-0.910
eenth-0.895
aways-0.887
meal-0.866
Token.),
Token ID12179
Feature activation-8.19e-01

Pos logit contributions
erness+2.198
independence+1.870
aways+1.765
eenth+1.763
meal+1.700
Neg logit contributions
bered-4.738
beh-4.730
dden-4.663
thora-4.569
icably-4.498
Token who
Token ID508
Feature activation-4.20e+00

Pos logit contributions
beh+0.630
bered+0.621
dden+0.616
thora+0.607
icably+0.583
Neg logit contributions
erness-0.595
independence-0.543
eenth-0.532
aways-0.523
meal-0.509
Token was
Token ID373
Feature activation-2.59e+00

Pos logit contributions
beh+3.061
bered+3.023
dden+2.969
thora+2.887
icably+2.805
Neg logit contributions
erness-3.143
independence-2.922
eenth-2.831
aways-2.784
meal-2.686
Token among
Token ID1871
Feature activation+2.81e-01

Pos logit contributions
beh+1.879
bered+1.844
dden+1.829
thora+1.777
icably+1.735
Neg logit contributions
erness-1.998
independence-1.845
eenth-1.799
aways-1.776
meal-1.718
Token two
Token ID734
Feature activation-2.77e+00

Pos logit contributions
erness+0.198
independence+0.180
eenth+0.176
aways+0.174
meal+0.169
Neg logit contributions
beh-0.222
bered-0.219
dden-0.217
thora-0.215
icably-0.206
Token critically
Token ID19475
Feature activation-1.59e+01

Pos logit contributions
beh+1.908
bered+1.886
dden+1.858
thora+1.815
icably+1.760
Neg logit contributions
erness-2.213
independence-2.051
eenth-1.995
aways-1.960
meal-1.908
Token injured
Token ID6686
Feature activation-6.63e+00

Pos logit contributions
beh+8.265
bered+7.713
dden+7.651
icably+7.188
gged+7.086
Neg logit contributions
erness-14.123
independence-13.171
liga-12.741
eenth-12.723
meal-12.035
Token.
Token ID13
Feature activation+8.31e-01

Pos logit contributions
beh+5.162
bered+5.061
dden+4.996
thora+4.773
icably+4.690
Neg logit contributions
erness-4.622
independence-4.213
eenth-4.119
aways-3.924
meal-3.893
Token
Token ID198
Feature activation+1.25e+00

Pos logit contributions
erness+0.521
independence+0.468
eenth+0.459
aways+0.449
meal+0.435
Neg logit contributions
beh-0.722
bered-0.715
dden-0.709
thora-0.701
icably-0.676
Token
Token ID198
Feature activation+4.39e+00

Pos logit contributions
erness+0.823
independence+0.741
eenth+0.721
aways+0.705
meal+0.680
Neg logit contributions
beh-1.060
dden-1.038
bered-1.034
thora-1.011
gged-0.981
TokenDemocrats
Token ID29969
Feature activation+2.11e+00

Pos logit contributions
erness+2.734
independence+2.514
eenth+2.404
aways+2.381
meal+2.345
Neg logit contributions
bered-3.739
beh-3.712
thora-3.686
dden-3.674
untled-3.545
Token Owner
Token ID23853
Feature activation-1.15e-01

Pos logit contributions
erness+1.630
independence+1.470
eenth+1.455
aways+1.431
meal+1.387
Neg logit contributions
beh-2.009
bered-1.987
dden-1.965
thora-1.954
icably-1.870
Token Sylvia
Token ID45425
Feature activation+7.29e+00

Pos logit contributions
beh+0.085
bered+0.084
dden+0.084
thora+0.082
icably+0.079
Neg logit contributions
erness-0.086
independence-0.079
eenth-0.078
aways-0.077
meal-0.074
Token Cas
Token ID11294
Feature activation+3.56e+00

Pos logit contributions
erness+5.143
independence+4.648
eenth+4.562
aways+4.561
meal+4.506
Neg logit contributions
beh-5.387
bered-5.350
dden-5.271
thora-5.233
untled-5.094
Tokenares
Token ID3565
Feature activation+2.85e+00

Pos logit contributions
erness+2.645
independence+2.419
eenth+2.366
aways+2.362
meal+2.276
Neg logit contributions
beh-2.607
bered-2.605
dden-2.602
thora-2.550
icably-2.459
Token was
Token ID373
Feature activation+2.80e-01

Pos logit contributions
erness+1.936
independence+1.752
eenth+1.723
aways+1.701
meal+1.651
Neg logit contributions
beh-2.308
bered-2.279
dden-2.277
thora-2.248
untled-2.152
Token critically
Token ID19475
Feature activation-1.59e+01

Pos logit contributions
erness+0.210
independence+0.192
eenth+0.188
aways+0.185
meal+0.180
Neg logit contributions
beh-0.210
bered-0.207
dden-0.205
thora-0.202
icably-0.194
Token injured
Token ID6686
Feature activation-5.09e+00

Pos logit contributions
beh+8.175
bered+7.972
icably+7.681
dden+7.563
gged+7.292
Neg logit contributions
erness-13.893
independence-13.058
eenth-12.546
liga-12.120
aways-11.729
Token when
Token ID618
Feature activation-4.06e+00

Pos logit contributions
beh+3.667
bered+3.617
dden+3.523
thora+3.421
icably+3.360
Neg logit contributions
erness-3.913
independence-3.600
eenth-3.503
aways-3.406
meal-3.309
Token she
Token ID673
Feature activation-6.61e-01

Pos logit contributions
beh+3.172
bered+3.138
dden+3.097
thora+3.061
icably+2.946
Neg logit contributions
erness-2.877
independence-2.615
eenth-2.573
aways-2.509
meal-2.436
Token was
Token ID373
Feature activation-4.05e+00

Pos logit contributions
beh+0.496
bered+0.490
dden+0.484
thora+0.478
icably+0.459
Neg logit contributions
erness-0.494
independence-0.452
eenth-0.443
aways-0.435
meal-0.423
Token allegedly
Token ID7910
Feature activation-6.79e+00

Pos logit contributions
beh+2.523
bered+2.494
dden+2.427
thora+2.390
icably+2.330
Neg logit contributions
erness-3.522
independence-3.270
eenth-3.201
aways-3.136
meal-3.059
Token institution
Token ID9901
Feature activation+5.29e-02

Pos logit contributions
erness+0.888
independence+0.812
eenth+0.795
aways+0.784
meal+0.762
Neg logit contributions
beh-0.899
bered-0.886
dden-0.881
thora-0.869
icably-0.830
Token this
Token ID428
Feature activation-6.37e+00

Pos logit contributions
erness+0.035
independence+0.031
eenth+0.031
aways+0.030
meal+0.029
Neg logit contributions
beh-0.045
bered-0.044
dden-0.044
thora-0.043
icably-0.041
Token year
Token ID614
Feature activation-7.41e-01

Pos logit contributions
beh+4.513
bered+4.395
thora+4.292
dden+4.232
icably+4.189
Neg logit contributions
erness-4.913
independence-4.447
eenth-4.409
aways-4.332
meal-4.166
Token
Token ID564
Feature activation-9.46e-01

Pos logit contributions
beh+0.643
bered+0.633
dden+0.629
thora+0.621
icably+0.599
Neg logit contributions
erness-0.467
independence-0.419
eenth-0.410
aways-0.402
meal-0.388
Token
Token ID250
Feature activation-1.77e+00

Pos logit contributions
beh+0.657
bered+0.646
dden+0.642
thora+0.635
icably+0.604
Neg logit contributions
erness-0.757
independence-0.697
eenth-0.684
aways-0.678
meal-0.659
Tokenexc
Token ID41194
Feature activation-1.58e+01

Pos logit contributions
beh+1.481
bered+1.459
dden+1.444
thora+1.427
icably+1.375
Neg logit contributions
erness-1.165
independence-1.049
eenth-1.029
aways-1.010
meal-0.977
Tokenited
Token ID863
Feature activation-1.21e+00

Pos logit contributions
beh+8.699
bered+8.557
icably+8.216
dden+8.081
gged+7.918
Neg logit contributions
erness-12.989
eenth-12.268
independence-11.874
aways-11.496
liga-11.492
Token
Token ID447
Feature activation+1.86e+00

Pos logit contributions
beh+0.964
bered+0.950
dden+0.944
thora+0.931
icably+0.895
Neg logit contributions
erness-0.841
independence-0.765
eenth-0.751
aways-0.737
meal-0.715
Token
Token ID251
Feature activation+4.24e-01

Pos logit contributions
erness+1.323
independence+1.203
eenth+1.185
aways+1.157
meal+1.122
Neg logit contributions
beh-1.459
bered-1.436
dden-1.420
thora-1.398
untled-1.349
Token about
Token ID546
Feature activation+5.08e+00

Pos logit contributions
erness+0.277
independence+0.250
eenth+0.245
aways+0.240
meal+0.233
Neg logit contributions
beh-0.357
bered-0.352
dden-0.350
thora-0.346
icably-0.333
Token the
Token ID262
Feature activation+8.78e-01

Pos logit contributions
erness+3.323
independence+3.050
eenth+2.918
aways+2.889
meal+2.794
Neg logit contributions
beh-4.146
dden-4.136
bered-4.120
thora-4.084
gged-3.904
Token rushing
Token ID15795
Feature activation+6.35e+00

Pos logit contributions
erness+2.900
eenth+2.627
independence+2.612
aways+2.566
meal+2.452
Neg logit contributions
beh-3.713
dden-3.655
bered-3.642
thora-3.627
untled-3.475
Token touchdowns
Token ID17692
Feature activation+2.04e+00

Pos logit contributions
erness+4.084
aways+3.727
eenth+3.684
independence+3.654
meal+3.426
Neg logit contributions
beh-5.331
dden-5.197
thora-5.191
bered-5.156
untled-4.980
Token,
Token ID11
Feature activation+3.45e+00

Pos logit contributions
erness+1.264
independence+1.136
eenth+1.114
aways+1.097
meal+1.059
Neg logit contributions
beh-1.777
bered-1.751
dden-1.744
thora-1.726
icably-1.664
Token respectively
Token ID8148
Feature activation+3.87e+00

Pos logit contributions
erness+2.335
independence+2.119
eenth+2.077
aways+2.062
meal+1.990
Neg logit contributions
beh-2.787
dden-2.756
bered-2.751
thora-2.711
untled-2.601
Token.
Token ID13
Feature activation+2.10e+00

Pos logit contributions
erness+2.071
independence+1.816
eenth+1.786
aways+1.777
meal+1.696
Neg logit contributions
beh-3.628
bered-3.591
dden-3.573
thora-3.564
untled-3.461
Token Er
Token ID5256
Feature activation-1.58e+01

Pos logit contributions
erness+1.284
independence+1.145
eenth+1.133
aways+1.116
meal+1.072
Neg logit contributions
beh-1.846
bered-1.829
dden-1.809
thora-1.798
icably-1.730
Tokenrict
Token ID2012
Feature activation-1.32e+00

Pos logit contributions
beh+9.677
bered+9.442
thora+9.281
dden+8.996
gged+8.859
Neg logit contributions
erness-12.562
independence-11.585
eenth-11.572
aways-10.716
liga-10.628
Token R
Token ID371
Feature activation+1.26e+00

Pos logit contributions
beh+0.970
bered+0.955
dden+0.948
thora+0.931
icably+0.897
Neg logit contributions
erness-1.003
independence-0.919
eenth-0.896
aways-0.885
meal-0.862
Tokenhett
Token ID17442
Feature activation+3.78e+00

Pos logit contributions
erness+0.971
independence+0.889
eenth+0.872
aways+0.861
meal+0.843
Neg logit contributions
beh-0.916
bered-0.906
dden-0.896
thora-0.884
icably-0.844
Token eventually
Token ID4191
Feature activation+7.81e+00

Pos logit contributions
erness+2.600
independence+2.336
eenth+2.324
aways+2.296
meal+2.210
Neg logit contributions
beh-2.998
bered-2.947
dden-2.937
thora-2.929
untled-2.816
Token broke
Token ID6265
Feature activation-2.75e-01

Pos logit contributions
erness+5.123
aways+4.677
independence+4.674
eenth+4.639
meal+4.440
Neg logit contributions
beh-6.208
dden-6.116
thora-6.068
bered-5.951
untled-5.863
Token,
Token ID11
Feature activation-9.17e-01

Pos logit contributions
beh+3.666
dden+3.514
bered+3.485
thora+3.471
gged+3.381
Neg logit contributions
erness-3.848
independence-3.513
eenth-3.361
aways-3.342
meal-3.224
Token and
Token ID290
Feature activation-3.18e+00

Pos logit contributions
beh+0.725
bered+0.715
dden+0.710
thora+0.701
icably+0.673
Neg logit contributions
erness-0.646
independence-0.587
eenth-0.575
aways-0.566
meal-0.550
Token Tony
Token ID8832
Feature activation-7.23e+00

Pos logit contributions
beh+2.260
bered+2.244
dden+2.224
thora+2.177
icably+2.100
Neg logit contributions
erness-2.470
independence-2.268
eenth-2.225
aways-2.180
meal-2.131
Token Shal
Token ID47974
Feature activation+1.24e+00

Pos logit contributions
beh+5.139
dden+5.065
bered+5.014
thora+4.995
icably+4.823
Neg logit contributions
erness-5.524
independence-5.040
eenth-4.845
aways-4.815
meal-4.668
Tokenh
Token ID71
Feature activation+3.68e-03

Pos logit contributions
erness+0.649
independence+0.569
eenth+0.553
aways+0.535
meal+0.516
Neg logit contributions
beh-1.212
bered-1.189
dden-1.185
thora-1.170
gged-1.133
Tokenoub
Token ID12944
Feature activation-1.58e+01

Pos logit contributions
erness+0.001
independence+0.001
eenth+0.001
aways+0.001
meal+0.001
Neg logit contributions
beh-0.004
bered-0.004
dden-0.004
thora-0.004
icably-0.004
Token,
Token ID11
Feature activation-7.83e+00

Pos logit contributions
bered+8.621
beh+8.458
dden+8.345
gged+7.947
thora+7.685
Neg logit contributions
erness-12.653
independence-11.757
eenth-11.193
aways-11.158
meal-11.049
Token who
Token ID508
Feature activation-1.50e+00

Pos logit contributions
beh+5.046
bered+5.017
dden+4.960
thora+4.802
gged+4.712
Neg logit contributions
erness-6.372
independence-5.914
eenth-5.716
aways-5.648
meal-5.494
Token won
Token ID1839
Feature activation-7.70e+00

Pos logit contributions
beh+1.313
bered+1.312
dden+1.291
thora+1.267
gged+1.239
Neg logit contributions
erness-0.934
independence-0.844
eenth-0.812
aways-0.795
meal-0.770
Token three
Token ID1115
Feature activation-4.72e+00

Pos logit contributions
bered+5.107
beh+5.067
dden+4.914
thora+4.848
icably+4.723
Neg logit contributions
erness-6.055
independence-5.585
eenth-5.447
aways-5.280
meal-5.231
Token times
Token ID1661
Feature activation-3.68e+00

Pos logit contributions
bered+3.749
beh+3.744
dden+3.623
thora+3.580
icably+3.534
Neg logit contributions
erness-3.219
independence-2.932
eenth-2.789
aways-2.710
meal-2.690
Token programme
Token ID11383
Feature activation+1.61e+00

Pos logit contributions
beh+1.659
bered+1.643
dden+1.621
thora+1.589
icably+1.529
Neg logit contributions
erness-1.787
independence-1.649
eenth-1.623
aways-1.586
meal-1.550
Token "
Token ID366
Feature activation-2.26e+00

Pos logit contributions
erness+1.181
independence+1.078
eenth+1.058
aways+1.050
meal+1.019
Neg logit contributions
beh-1.223
bered-1.203
dden-1.202
thora-1.184
icably-1.128
TokenNormally
Token ID43625
Feature activation-8.27e+00

Pos logit contributions
beh+1.888
bered+1.841
dden+1.836
thora+1.808
icably+1.748
Neg logit contributions
erness-1.493
independence-1.334
eenth-1.324
aways-1.284
meal-1.245
Token children
Token ID1751
Feature activation+1.64e+00

Pos logit contributions
beh+6.246
bered+6.089
dden+6.043
thora+5.827
icably+5.815
Neg logit contributions
erness-5.775
independence-5.242
eenth-5.185
aways-4.942
meal-4.917
Token that
Token ID326
Feature activation-1.07e-01

Pos logit contributions
erness+1.173
independence+1.070
eenth+1.050
aways+1.034
meal+1.004
Neg logit contributions
beh-1.283
bered-1.263
dden-1.256
thora-1.241
icably-1.188
Token brake
Token ID20439
Feature activation-1.56e+01

Pos logit contributions
beh+0.080
bered+0.079
dden+0.078
thora+0.077
icably+0.074
Neg logit contributions
erness-0.080
independence-0.073
eenth-0.072
aways-0.071
meal-0.069
Token late
Token ID2739
Feature activation-3.85e+00

Pos logit contributions
beh+9.758
bered+9.636
icably+8.992
edy+8.765
dden+8.676
Neg logit contributions
erness-11.472
independence-10.417
eenth-10.025
meal-9.805
liga-9.736
Token go
Token ID467
Feature activation-2.32e+00

Pos logit contributions
beh+2.861
bered+2.823
dden+2.753
thora+2.657
icably+2.625
Neg logit contributions
erness-2.898
independence-2.649
eenth-2.569
aways-2.513
meal-2.452
Token off
Token ID572
Feature activation-8.45e+00

Pos logit contributions
beh+1.778
bered+1.760
dden+1.741
thora+1.715
gged+1.652
Neg logit contributions
erness-1.690
independence-1.548
eenth-1.516
aways-1.464
meal-1.437
Token the
Token ID262
Feature activation-5.49e+00

Pos logit contributions
beh+6.121
bered+6.005
dden+5.836
thora+5.665
icably+5.638
Neg logit contributions
erness-6.244
independence-5.722
eenth-5.589
aways-5.322
meal-5.209
Token track
Token ID2610
Feature activation-3.49e-01

Pos logit contributions
beh+3.632
bered+3.542
dden+3.502
thora+3.366
gged+3.301
Neg logit contributions
erness-4.501
independence-4.123
eenth-4.082
aways-3.935
meal-3.892
Token you
Token ID345
Feature activation+2.26e+00

Pos logit contributions
erness+4.995
independence+4.580
aways+4.422
eenth+4.330
meal+4.310
Neg logit contributions
beh-7.027
bered-6.961
dden-6.824
thora-6.819
untled-6.688
Token
Token ID447
Feature activation+1.60e+00

Pos logit contributions
erness+1.669
independence+1.508
eenth+1.486
aways+1.464
meal+1.425
Neg logit contributions
beh-1.701
dden-1.676
bered-1.675
thora-1.655
icably-1.586
Token
Token ID247
Feature activation+4.75e-01

Pos logit contributions
erness+1.330
independence+1.227
eenth+1.205
aways+1.193
meal+1.162
Neg logit contributions
beh-1.065
bered-1.047
dden-1.041
thora-1.023
icably-0.973
Tokenre
Token ID260
Feature activation+1.14e+00

Pos logit contributions
erness+0.277
independence+0.247
eenth+0.242
aways+0.236
meal+0.229
Neg logit contributions
beh-0.431
bered-0.427
dden-0.425
thora-0.421
icably-0.406
Token talking
Token ID3375
Feature activation+4.46e+00

Pos logit contributions
erness+0.950
independence+0.875
eenth+0.862
aways+0.851
meal+0.831
Neg logit contributions
beh-0.757
bered-0.744
dden-0.741
thora-0.732
icably-0.693
Token about
Token ID546
Feature activation+9.00e+00

Pos logit contributions
erness+3.443
independence+3.154
eenth+3.117
aways+3.072
meal+3.001
Neg logit contributions
beh-3.181
bered-3.135
dden-3.128
thora-3.067
untled-2.944
Token.
Token ID13
Feature activation+5.82e+00

Pos logit contributions
erness+4.600
independence+3.995
eenth+3.990
aways+3.883
meal+3.764
Neg logit contributions
beh-8.194
thora-8.182
bered-8.094
dden-8.089
untled-7.937
Token Like
Token ID4525
Feature activation+3.55e+00

Pos logit contributions
erness+3.640
eenth+3.266
independence+3.265
aways+3.201
meal+3.065
Neg logit contributions
bered-4.904
beh-4.889
thora-4.796
dden-4.786
untled-4.643
Token when
Token ID618
Feature activation+1.92e+00

Pos logit contributions
erness+2.218
independence+1.993
eenth+1.946
aways+1.924
meal+1.861
Neg logit contributions
beh-3.045
bered-3.010
dden-2.992
thora-2.971
untled-2.862
Token I
Token ID314
Feature activation+1.37e+00

Pos logit contributions
erness+1.228
independence+1.107
eenth+1.080
aways+1.065
meal+1.027
Neg logit contributions
beh-1.634
bered-1.614
dden-1.606
thora-1.588
icably-1.527
Token talk
Token ID1561
Feature activation-3.15e+00

Pos logit contributions
erness+0.955
independence+0.865
eenth+0.851
aways+0.837
meal+0.813
Neg logit contributions
beh-1.093
bered-1.076
dden-1.072
thora-1.060
icably-1.016
Token to
Token ID284
Feature activation+4.87e+00

Pos logit contributions
erness+2.852
independence+2.613
eenth+2.538
aways+2.535
meal+2.478
Neg logit contributions
beh-3.418
bered-3.349
dden-3.349
thora-3.325
untled-3.174
Token the
Token ID262
Feature activation+5.03e+00

Pos logit contributions
erness+3.537
independence+3.327
eenth+3.184
aways+3.175
meal+3.049
Neg logit contributions
beh-3.619
bered-3.552
dden-3.549
thora-3.513
untled-3.347
Token contrary
Token ID10388
Feature activation+8.52e+00

Pos logit contributions
erness+3.233
independence+2.959
eenth+2.846
aways+2.798
meal+2.741
Neg logit contributions
beh-4.150
bered-4.145
dden-4.120
thora-4.058
untled-3.934
Token,
Token ID11
Feature activation+5.06e+00

Pos logit contributions
erness+4.526
independence+4.051
aways+3.960
eenth+3.881
meal+3.844
Neg logit contributions
thora-7.593
beh-7.587
bered-7.518
dden-7.495
untled-7.290
Token suspect
Token ID4099
Feature activation+3.60e+00

Pos logit contributions
erness+3.291
independence+3.002
aways+2.914
eenth+2.912
meal+2.831
Neg logit contributions
beh-4.132
dden-4.098
bered-4.084
thora-4.073
untled-3.880
Token at
Token ID379
Feature activation+9.89e+00

Pos logit contributions
erness+2.117
independence+1.880
eenth+1.826
aways+1.794
meal+1.740
Neg logit contributions
beh-3.244
bered-3.207
dden-3.197
thora-3.187
untled-3.046
Token best
Token ID1266
Feature activation-5.90e-02

Pos logit contributions
erness+4.932
eenth+4.621
independence+4.597
aways+4.311
meal+4.249
Neg logit contributions
bered-8.949
beh-8.934
thora-8.848
dden-8.843
untled-8.649
Token.
Token ID13
Feature activation+3.00e+00

Pos logit contributions
beh+0.052
bered+0.052
dden+0.051
thora+0.051
icably+0.049
Neg logit contributions
erness-0.036
independence-0.032
eenth-0.032
aways-0.031
meal-0.030
TokenDean
Token ID36372
Feature activation+3.78e+00

Pos logit contributions
erness+2.044
independence+1.861
eenth+1.821
aways+1.800
meal+1.735
Neg logit contributions
beh-2.425
bered-2.412
dden-2.383
thora-2.366
icably-2.266
Token Baker
Token ID14372
Feature activation-1.27e+00

Pos logit contributions
erness+2.702
independence+2.458
eenth+2.404
aways+2.384
meal+2.304
Neg logit contributions
beh-2.934
bered-2.910
dden-2.902
thora-2.852
untled-2.746
Token is
Token ID318
Feature activation+1.10e+00

Pos logit contributions
beh+1.018
bered+0.995
dden+0.990
thora+0.972
icably+0.941
Neg logit contributions
erness-0.880
independence-0.802
eenth-0.782
aways-0.765
meal-0.741
Token.
Token ID13
Feature activation+3.08e+00

Pos logit contributions
erness+3.801
independence+3.434
eenth+3.388
aways+3.338
meal+3.255
Neg logit contributions
beh-4.644
dden-4.597
bered-4.597
thora-4.552
gged-4.361
Token
Token ID198
Feature activation-7.54e-01

Pos logit contributions
erness+1.970
independence+1.772
eenth+1.761
aways+1.726
meal+1.652
Neg logit contributions
beh-2.606
bered-2.591
dden-2.562
thora-2.543
untled-2.440
Token
Token ID198
Feature activation-1.95e+00

Pos logit contributions
beh+0.610
dden+0.596
bered+0.587
thora+0.570
gged+0.558
Neg logit contributions
erness-0.527
independence-0.479
eenth-0.463
aways-0.451
meal-0.438
TokenHe
Token ID1544
Feature activation+2.63e-01

Pos logit contributions
beh+1.590
bered+1.548
dden+1.545
thora+1.511
icably+1.462
Neg logit contributions
erness-1.335
independence-1.194
eenth-1.183
aways-1.152
meal-1.123
Token shares
Token ID7303
Feature activation+5.23e+00

Pos logit contributions
erness+0.181
independence+0.165
eenth+0.161
aways+0.158
meal+0.154
Neg logit contributions
beh-0.213
bered-0.210
dden-0.208
thora-0.206
icably-0.198
Token what
Token ID644
Feature activation+1.11e+01

Pos logit contributions
erness+3.457
independence+3.113
aways+3.077
eenth+3.057
meal+2.938
Neg logit contributions
beh-4.222
bered-4.206
dden-4.134
thora-4.129
icably-3.941
Token he
Token ID339
Feature activation+7.08e+00

Pos logit contributions
erness+6.629
independence+6.100
aways+6.016
eenth+5.707
meal+5.600
Neg logit contributions
beh-8.901
bered-8.848
dden-8.742
thora-8.650
icably-8.402
Token
Token ID447
Feature activation+8.28e-01

Pos logit contributions
erness+4.529
eenth+3.999
independence+3.966
aways+3.923
meal+3.747
Neg logit contributions
beh-5.827
dden-5.783
thora-5.729
bered-5.716
untled-5.526
Token
Token ID247
Feature activation+2.86e+00

Pos logit contributions
erness+0.713
independence+0.658
eenth+0.645
aways+0.643
meal+0.625
Neg logit contributions
beh-0.530
dden-0.521
bered-0.521
thora-0.510
gged-0.486
Tokens
Token ID82
Feature activation+8.05e+00

Pos logit contributions
erness+1.442
independence+1.268
eenth+1.241
aways+1.194
meal+1.137
Neg logit contributions
beh-2.799
bered-2.774
dden-2.764
thora-2.752
untled-2.644
Token learned
Token ID4499
Feature activation+5.28e+00

Pos logit contributions
erness+4.928
eenth+4.363
independence+4.309
aways+4.295
meal+4.194
Neg logit contributions
beh-6.699
dden-6.643
thora-6.618
bered-6.534
untled-6.364
Token country
Token ID1499
Feature activation-4.52e+00

Pos logit contributions
beh+2.142
bered+2.103
dden+2.049
thora+1.993
icably+1.961
Neg logit contributions
erness-2.232
independence-2.046
eenth-2.045
aways-2.041
meal-1.922
Token that
Token ID326
Feature activation+1.14e-01

Pos logit contributions
beh+3.451
bered+3.422
dden+3.340
thora+3.246
icably+3.167
Neg logit contributions
erness-3.288
independence-2.964
eenth-2.956
aways-2.875
meal-2.731
Token ensures
Token ID19047
Feature activation+1.38e+00

Pos logit contributions
erness+0.080
independence+0.073
eenth+0.072
aways+0.070
meal+0.068
Neg logit contributions
beh-0.090
bered-0.089
dden-0.088
thora-0.086
icably-0.084
Token equal
Token ID4961
Feature activation+4.98e+00

Pos logit contributions
erness+0.998
independence+0.914
eenth+0.893
aways+0.878
meal+0.855
Neg logit contributions
beh-1.070
bered-1.054
dden-1.049
thora-1.035
icably-0.990
Token and
Token ID290
Feature activation+3.77e+00

Pos logit contributions
erness+2.592
independence+2.297
eenth+2.173
aways+2.151
meal+2.108
Neg logit contributions
beh-4.758
dden-4.749
bered-4.714
thora-4.696
untled-4.573
Token full
Token ID1336
Feature activation+9.10e+00

Pos logit contributions
erness+2.441
independence+2.235
eenth+2.163
aways+2.127
meal+2.066
Neg logit contributions
beh-3.161
dden-3.111
bered-3.107
thora-3.079
untled-2.956
Token rights
Token ID2489
Feature activation+4.76e-01

Pos logit contributions
erness+5.764
independence+5.354
eenth+5.026
aways+4.936
meal+4.913
Neg logit contributions
thora-7.365
dden-7.348
bered-7.307
beh-7.303
untled-7.218
Token for
Token ID329
Feature activation+1.00e+00

Pos logit contributions
erness+0.332
independence+0.302
eenth+0.295
aways+0.291
meal+0.284
Neg logit contributions
beh-0.380
bered-0.374
dden-0.372
thora-0.368
icably-0.354
Token all
Token ID477
Feature activation+1.44e+00

Pos logit contributions
erness+0.657
independence+0.594
eenth+0.581
aways+0.570
meal+0.552
Neg logit contributions
beh-0.843
bered-0.832
dden-0.827
thora-0.817
icably-0.786
Token its
Token ID663
Feature activation-6.22e-01

Pos logit contributions
erness+0.996
independence+0.904
eenth+0.884
aways+0.873
meal+0.847
Neg logit contributions
beh-1.149
bered-1.130
dden-1.122
thora-1.115
icably-1.069
Token citizens
Token ID4290
Feature activation-4.68e-01

Pos logit contributions
beh+0.393
bered+0.387
dden+0.384
thora+0.373
icably+0.357
Neg logit contributions
erness-0.538
independence-0.498
eenth-0.491
aways-0.483
meal-0.472
Token of
Token ID286
Feature activation+3.91e+00

Pos logit contributions
erness+1.442
independence+1.306
eenth+1.284
aways+1.276
meal+1.242
Neg logit contributions
beh-1.774
bered-1.741
thora-1.738
dden-1.718
icably-1.657
Token a
Token ID257
Feature activation-2.62e-01

Pos logit contributions
erness+2.797
independence+2.552
aways+2.490
eenth+2.489
meal+2.388
Neg logit contributions
beh-2.995
bered-2.956
dden-2.930
thora-2.928
untled-2.792
Token besieged
Token ID41680
Feature activation+4.74e+00

Pos logit contributions
beh+0.190
bered+0.187
dden+0.185
thora+0.182
icably+0.174
Neg logit contributions
erness-0.202
independence-0.186
eenth-0.183
aways-0.180
meal-0.175
Token city
Token ID1748
Feature activation+1.87e+00

Pos logit contributions
erness+3.584
independence+3.229
eenth+3.174
aways+3.158
meal+3.064
Neg logit contributions
beh-3.460
dden-3.398
thora-3.386
bered-3.382
untled-3.235
Token,
Token ID11
Feature activation-2.70e-01

Pos logit contributions
erness+1.180
independence+1.057
eenth+1.030
aways+1.018
meal+0.987
Neg logit contributions
beh-1.609
bered-1.583
dden-1.579
thora-1.571
icably-1.506
Token even
Token ID772
Feature activation+1.24e+01

Pos logit contributions
beh+0.216
bered+0.213
dden+0.212
thora+0.209
icably+0.201
Neg logit contributions
erness-0.188
independence-0.171
eenth-0.167
aways-0.164
meal-0.160
Token if
Token ID611
Feature activation-1.85e+00

Pos logit contributions
erness+6.700
aways+5.951
independence+5.876
eenth+5.829
meal+5.723
Neg logit contributions
thora-10.685
beh-10.330
dden-10.325
bered-10.208
untled-10.159
Token it
Token ID340
Feature activation+2.14e+00

Pos logit contributions
beh+1.348
bered+1.329
dden+1.320
thora+1.280
icably+1.240
Neg logit contributions
erness-1.417
independence-1.300
eenth-1.279
aways-1.248
meal-1.217
Token were
Token ID547
Feature activation+3.25e+00

Pos logit contributions
erness+1.482
independence+1.339
eenth+1.322
aways+1.301
meal+1.262
Neg logit contributions
beh-1.712
dden-1.689
bered-1.685
thora-1.677
untled-1.598
Token innocent
Token ID10218
Feature activation-8.32e-01

Pos logit contributions
erness+2.435
independence+2.219
eenth+2.190
aways+2.164
meal+2.101
Neg logit contributions
beh-2.410
bered-2.363
dden-2.361
thora-2.354
untled-2.241
Token of
Token ID286
Feature activation+7.40e-01

Pos logit contributions
beh+0.674
bered+0.665
dden+0.662
thora+0.647
icably+0.625
Neg logit contributions
erness-0.570
independence-0.520
eenth-0.508
aways-0.497
meal-0.482
Token for
Token ID329
Feature activation+2.80e+00

Pos logit contributions
beh+1.601
bered+1.576
dden+1.560
thora+1.524
icably+1.465
Neg logit contributions
erness-1.614
independence-1.484
eenth-1.455
aways-1.421
meal-1.385
Token foreign
Token ID3215
Feature activation-1.34e+00

Pos logit contributions
erness+1.962
independence+1.800
eenth+1.736
aways+1.727
meal+1.658
Neg logit contributions
beh-2.190
dden-2.176
bered-2.168
thora-2.152
icably-2.038
Token engagements
Token ID40588
Feature activation+3.47e-01

Pos logit contributions
beh+0.891
bered+0.873
dden+0.865
thora+0.843
gged+0.808
Neg logit contributions
erness-1.114
independence-1.027
eenth-1.019
aways-1.001
meal-0.980
Token as
Token ID355
Feature activation+4.03e-01

Pos logit contributions
erness+0.219
independence+0.197
eenth+0.193
aways+0.189
meal+0.183
Neg logit contributions
beh-0.299
bered-0.295
dden-0.294
thora-0.290
icably-0.279
Token has
Token ID468
Feature activation+7.02e-01

Pos logit contributions
erness+0.313
independence+0.287
eenth+0.282
aways+0.278
meal+0.271
Neg logit contributions
beh-0.289
bered-0.285
dden-0.284
thora-0.279
icably-0.266
Token been
Token ID587
Feature activation+1.97e+00

Pos logit contributions
erness+0.573
independence+0.528
eenth+0.519
aways+0.512
meal+0.500
Neg logit contributions
beh-0.476
bered-0.467
dden-0.465
thora-0.458
icably-0.435
Token suggested
Token ID5220
Feature activation-1.97e+00

Pos logit contributions
erness+1.485
independence+1.360
eenth+1.338
aways+1.317
meal+1.283
Neg logit contributions
beh-1.455
bered-1.430
dden-1.424
thora-1.406
icably-1.340
Token by
Token ID416
Feature activation-2.60e-02

Pos logit contributions
beh+1.626
bered+1.603
dden+1.592
thora+1.562
icably+1.507
Neg logit contributions
erness-1.318
independence-1.194
eenth-1.165
aways-1.140
meal-1.104
Token officials
Token ID2828
Feature activation+1.01e-01

Pos logit contributions
beh+0.020
bered+0.019
dden+0.019
thora+0.019
icably+0.018
Neg logit contributions
erness-0.019
independence-0.018
eenth-0.017
aways-0.017
meal-0.016
Token.
Token ID13
Feature activation-1.38e+00

Pos logit contributions
erness+0.071
independence+0.065
eenth+0.064
aways+0.063
meal+0.061
Neg logit contributions
beh-0.080
bered-0.079
dden-0.078
thora-0.077
icably-0.074
Token
Token ID198
Feature activation-4.12e-01

Pos logit contributions
beh+1.135
bered+1.111
dden+1.108
thora+1.086
gged+1.051
Neg logit contributions
erness-0.936
independence-0.848
eenth-0.824
aways-0.809
meal-0.790
Token-
Token ID12
Feature activation-1.53e+00

Pos logit contributions
erness+0.290
independence+0.252
eenth+0.244
aways+0.234
meal+0.226
Neg logit contributions
beh-0.543
dden-0.537
bered-0.535
thora-0.522
gged-0.511
Token3
Token ID18
Feature activation-2.54e+00

Pos logit contributions
beh+1.332
bered+1.311
dden+1.304
thora+1.282
icably+1.239
Neg logit contributions
erness-0.964
independence-0.864
eenth-0.846
aways-0.824
meal-0.800
Token power
Token ID1176
Feature activation+3.95e+00

Pos logit contributions
bered+1.823
beh+1.822
dden+1.793
thora+1.759
gged+1.699
Neg logit contributions
erness-1.946
independence-1.792
eenth-1.739
aways-1.694
meal-1.665
Token play
Token ID711
Feature activation+1.86e+00

Pos logit contributions
erness+2.933
eenth+2.673
independence+2.672
aways+2.654
meal+2.542
Neg logit contributions
beh-2.943
dden-2.878
bered-2.877
thora-2.837
icably-2.700
Token goal
Token ID3061
Feature activation-3.16e+00

Pos logit contributions
erness+1.311
independence+1.195
eenth+1.173
aways+1.168
meal+1.128
Neg logit contributions
beh-1.467
dden-1.443
bered-1.438
thora-1.419
untled-1.355
Token against
Token ID1028
Feature activation+2.15e+00

Pos logit contributions
beh+2.415
bered+2.392
dden+2.368
thora+2.324
gged+2.250
Neg logit contributions
erness-2.308
independence-2.078
eenth-2.028
aways-1.984
meal-1.928
Token Pittsburgh
Token ID12767
Feature activation-4.81e+00

Pos logit contributions
erness+1.387
independence+1.253
eenth+1.230
aways+1.219
meal+1.172
Neg logit contributions
beh-1.821
bered-1.794
dden-1.781
thora-1.762
untled-1.693
Token,
Token ID11
Feature activation-2.11e+00

Pos logit contributions
beh+3.757
bered+3.709
dden+3.666
thora+3.580
gged+3.502
Neg logit contributions
erness-3.409
independence-3.114
eenth-2.989
aways-2.917
meal-2.874
Token they
Token ID484
Feature activation-1.35e+00

Pos logit contributions
beh+1.617
bered+1.591
dden+1.582
thora+1.557
icably+1.502
Neg logit contributions
erness-1.542
independence-1.413
eenth-1.376
aways-1.341
meal-1.314
Token hadn
Token ID8020
Feature activation+5.13e+00

Pos logit contributions
beh+1.069
bered+1.057
dden+1.043
thora+1.032
gged+0.994
Neg logit contributions
erness-0.952
independence-0.870
eenth-0.839
aways-0.825
meal-0.809
Token
Token ID447
Feature activation-5.56e-01

Pos logit contributions
erness+2.669
independence+2.346
eenth+2.301
aways+2.246
meal+2.149
Neg logit contributions
beh-4.935
bered-4.915
dden-4.859
thora-4.835
untled-4.664
Token in
Token ID287
Feature activation-4.87e+00

Pos logit contributions
beh+2.670
bered+2.619
dden+2.603
thora+2.549
icably+2.464
Neg logit contributions
erness-2.591
independence-2.319
eenth-2.258
aways-2.254
meal-2.180
Token May
Token ID1737
Feature activation+3.20e+00

Pos logit contributions
beh+2.986
bered+2.851
dden+2.823
thora+2.716
icably+2.651
Neg logit contributions
erness-4.327
independence-3.903
eenth-3.890
aways-3.874
meal-3.750
Token.
Token ID13
Feature activation+2.75e+00

Pos logit contributions
erness+2.027
independence+1.838
eenth+1.808
aways+1.762
meal+1.723
Neg logit contributions
beh-2.700
bered-2.685
dden-2.676
thora-2.633
icably-2.544
Token
Token ID198
Feature activation+1.36e+00

Pos logit contributions
erness+1.686
independence+1.516
eenth+1.497
aways+1.458
meal+1.419
Neg logit contributions
beh-2.397
bered-2.384
dden-2.353
thora-2.330
icably-2.254
Token
Token ID198
Feature activation+1.09e+00

Pos logit contributions
erness+0.884
independence+0.795
eenth+0.775
aways+0.759
meal+0.733
Neg logit contributions
beh-1.161
dden-1.138
bered-1.134
thora-1.110
gged-1.075
Token
Token ID447
Feature activation+2.62e+00

Pos logit contributions
erness+0.691
independence+0.625
eenth+0.608
aways+0.598
meal+0.581
Neg logit contributions
beh-0.931
bered-0.922
dden-0.915
thora-0.905
icably-0.872
Token
Token ID250
Feature activation+2.74e+00

Pos logit contributions
erness+2.071
independence+1.896
eenth+1.880
aways+1.839
meal+1.773
Neg logit contributions
beh-1.847
bered-1.805
dden-1.778
thora-1.732
untled-1.685
TokenWe
Token ID1135
Feature activation-2.17e+00

Pos logit contributions
erness+1.820
independence+1.663
eenth+1.617
aways+1.589
meal+1.549
Neg logit contributions
beh-2.251
bered-2.237
dden-2.216
thora-2.192
icably-2.106
Token built
Token ID3170
Feature activation-4.30e+00

Pos logit contributions
beh+1.609
bered+1.599
dden+1.570
thora+1.542
gged+1.499
Neg logit contributions
erness-1.620
independence-1.498
eenth-1.444
aways-1.430
meal-1.391
Token it
Token ID340
Feature activation-8.75e-03

Pos logit contributions
beh+3.176
bered+3.110
dden+3.074
thora+3.036
icably+2.935
Neg logit contributions
erness-3.252
independence-2.953
eenth-2.891
aways-2.826
meal-2.746
Token entirely
Token ID5000
Feature activation+5.59e-01

Pos logit contributions
beh+0.007
bered+0.007
dden+0.007
thora+0.007
icably+0.007
Neg logit contributions
erness-0.006
independence-0.005
eenth-0.005
aways-0.005
meal-0.005
Token Roh
Token ID32694
Feature activation+1.39e-01

Pos logit contributions
erness+1.732
independence+1.586
eenth+1.553
aways+1.531
meal+1.484
Neg logit contributions
beh-1.953
bered-1.929
dden-1.924
thora-1.892
icably-1.812
Tokent
Token ID83
Feature activation+1.26e+00

Pos logit contributions
erness+0.097
independence+0.088
eenth+0.086
aways+0.085
meal+0.082
Neg logit contributions
beh-0.111
bered-0.110
dden-0.109
thora-0.107
icably-0.103
Tokenak
Token ID461
Feature activation-1.42e+00

Pos logit contributions
erness+0.825
independence+0.746
eenth+0.729
aways+0.711
meal+0.691
Neg logit contributions
beh-1.072
bered-1.053
dden-1.048
thora-1.033
gged-0.993
Token city
Token ID1748
Feature activation+1.91e+00

Pos logit contributions
beh+1.147
bered+1.126
dden+1.112
thora+1.097
icably+1.059
Neg logit contributions
erness-0.986
independence-0.886
eenth-0.875
aways-0.854
meal-0.829
Token which
Token ID543
Feature activation-2.99e-01

Pos logit contributions
erness+1.265
independence+1.147
eenth+1.121
aways+1.104
meal+1.070
Neg logit contributions
beh-1.584
bered-1.562
dden-1.557
thora-1.539
icably-1.476
Token continued
Token ID3767
Feature activation+1.33e+00

Pos logit contributions
beh+0.233
bered+0.230
dden+0.227
thora+0.224
icably+0.216
Neg logit contributions
erness-0.215
independence-0.195
eenth-0.191
aways-0.188
meal-0.183
Token on
Token ID319
Feature activation+1.13e+00

Pos logit contributions
erness+0.933
independence+0.850
eenth+0.835
aways+0.822
meal+0.799
Neg logit contributions
beh-1.052
bered-1.042
dden-1.035
thora-1.021
gged-0.979
Token 19
Token ID678
Feature activation+3.66e-01

Pos logit contributions
erness+0.790
independence+0.721
eenth+0.706
aways+0.696
meal+0.677
Neg logit contributions
beh-0.891
bered-0.880
dden-0.873
thora-0.864
icably-0.828
Token,
Token ID11
Feature activation-1.19e+00

Pos logit contributions
erness+0.214
independence+0.192
eenth+0.188
aways+0.182
meal+0.176
Neg logit contributions
beh-0.331
bered-0.329
dden-0.327
thora-0.323
icably-0.312
Token 20
Token ID1160
Feature activation-1.91e+00

Pos logit contributions
beh+0.965
bered+0.949
dden+0.943
thora+0.928
icably+0.896
Neg logit contributions
erness-0.821
independence-0.740
eenth-0.724
aways-0.713
meal-0.692
Token,
Token ID11
Feature activation-3.48e+00

Pos logit contributions
beh+1.600
bered+1.577
dden+1.566
thora+1.547
icably+1.489
Neg logit contributions
erness-1.255
independence-1.131
eenth-1.105
aways-1.088
meal-1.052
Token is
Token ID318
Feature activation+2.51e+00

Pos logit contributions
erness+1.084
independence+0.988
eenth+0.961
aways+0.949
meal+0.923
Neg logit contributions
beh-1.273
bered-1.264
dden-1.258
thora-1.237
untled-1.190
Token formed
Token ID7042
Feature activation+9.45e+00

Pos logit contributions
erness+1.733
independence+1.589
eenth+1.544
aways+1.532
meal+1.488
Neg logit contributions
beh-1.991
dden-1.970
bered-1.969
thora-1.943
untled-1.856
Token of
Token ID286
Feature activation+6.93e+00

Pos logit contributions
erness+4.921
independence+4.569
eenth+4.479
aways+4.422
meal+4.379
Neg logit contributions
beh-8.358
bered-8.338
dden-8.338
thora-8.335
untled-8.068
Token RT
Token ID11923
Feature activation+7.92e+00

Pos logit contributions
erness+4.564
independence+4.166
aways+4.090
eenth+4.028
meal+3.943
Neg logit contributions
bered-5.450
beh-5.435
dden-5.411
thora-5.313
untled-5.104
Token prop
Token ID2632
Feature activation+7.53e-01

Pos logit contributions
erness+5.169
independence+4.667
aways+4.601
eenth+4.478
meal+4.395
Neg logit contributions
dden-6.173
bered-6.141
beh-6.140
thora-6.032
untled-5.897
Token +
Token ID1343
Feature activation+7.11e+00

Pos logit contributions
erness+0.518
independence+0.470
eenth+0.460
aways+0.453
meal+0.439
Neg logit contributions
beh-0.608
bered-0.599
dden-0.596
thora-0.588
icably-0.565
Token these
Token ID777
Feature activation+2.74e+00

Pos logit contributions
erness+4.632
independence+4.185
eenth+4.068
aways+4.056
meal+3.963
Neg logit contributions
bered-5.674
dden-5.623
beh-5.623
thora-5.464
icably-5.321
Token additional
Token ID3224
Feature activation+4.65e+00

Pos logit contributions
erness+1.959
independence+1.790
eenth+1.741
aways+1.733
meal+1.683
Neg logit contributions
beh-2.109
bered-2.095
dden-2.089
thora-2.058
untled-1.966
Token sources
Token ID4237
Feature activation-2.98e-01

Pos logit contributions
erness+3.328
independence+3.051
aways+2.981
eenth+2.945
meal+2.877
Neg logit contributions
beh-3.506
bered-3.506
dden-3.497
thora-3.458
untled-3.276
Token of
Token ID286
Feature activation-2.90e-02

Pos logit contributions
beh+0.245
bered+0.242
dden+0.240
thora+0.237
icably+0.228
Neg logit contributions
erness-0.200
independence-0.181
eenth-0.178
aways-0.174
meal-0.168
Token noise
Token ID7838
Feature activation+7.57e-01

Pos logit contributions
beh+0.023
bered+0.022
dden+0.022
thora+0.022
icably+0.021
Neg logit contributions
erness-0.021
independence-0.019
eenth-0.018
aways-0.018
meal-0.018
Token that
Token ID326
Feature activation-3.31e+00

Pos logit contributions
beh+1.031
bered+1.016
dden+1.010
thora+0.996
icably+0.955
Neg logit contributions
erness-0.937
independence-0.854
eenth-0.836
aways-0.823
meal-0.800
Token it
Token ID340
Feature activation-2.12e+00

Pos logit contributions
beh+2.440
bered+2.393
dden+2.378
thora+2.345
icably+2.259
Neg logit contributions
erness-2.492
independence-2.288
eenth-2.263
aways-2.204
meal-2.149
Token would
Token ID561
Feature activation-3.82e+00

Pos logit contributions
beh+1.625
bered+1.594
dden+1.565
thora+1.537
icably+1.499
Neg logit contributions
erness-1.553
independence-1.432
eenth-1.391
aways-1.363
meal-1.329
Token miss
Token ID2051
Feature activation-3.74e+00

Pos logit contributions
beh+2.559
dden+2.527
bered+2.506
thora+2.447
icably+2.383
Neg logit contributions
erness-3.103
independence-2.874
eenth-2.826
aways-2.765
meal-2.704
Token the
Token ID262
Feature activation-2.57e+00

Pos logit contributions
beh+2.769
bered+2.717
dden+2.697
thora+2.630
icably+2.573
Neg logit contributions
erness-2.793
independence-2.559
eenth-2.538
aways-2.466
meal-2.405
Token Moon
Token ID6869
Feature activation-2.46e+00

Pos logit contributions
beh+1.902
bered+1.874
dden+1.860
thora+1.818
icably+1.758
Neg logit contributions
erness-1.925
independence-1.765
eenth-1.738
aways-1.707
meal-1.656
Token by
Token ID416
Feature activation-4.51e+00

Pos logit contributions
beh+1.918
bered+1.883
dden+1.875
thora+1.843
icably+1.772
Neg logit contributions
erness-1.742
independence-1.595
eenth-1.555
aways-1.532
meal-1.488
Token about
Token ID546
Feature activation+1.84e+00

Pos logit contributions
beh+3.114
bered+3.029
dden+3.010
thora+2.951
icably+2.882
Neg logit contributions
erness-3.605
independence-3.289
eenth-3.235
aways-3.155
meal-3.090
Token 9
Token ID860
Feature activation+1.85e+00

Pos logit contributions
erness+1.264
independence+1.154
eenth+1.128
aways+1.111
meal+1.080
Neg logit contributions
beh-1.471
bered-1.452
dden-1.443
thora-1.426
icably-1.363
Token65
Token ID2996
Feature activation+1.97e+00

Pos logit contributions
erness+1.106
independence+0.995
eenth+0.971
aways+0.943
meal+0.912
Neg logit contributions
beh-1.652
bered-1.634
dden-1.630
thora-1.605
untled-1.550
Token kilometers
Token ID18212
Feature activation+7.40e-01

Pos logit contributions
erness+1.113
independence+0.995
eenth+0.971
aways+0.940
meal+0.909
Neg logit contributions
beh-1.828
bered-1.808
dden-1.800
thora-1.779
untled-1.717
Token of
Token ID286
Feature activation-6.26e-01

Pos logit contributions
beh+0.844
bered+0.833
dden+0.827
thora+0.816
icably+0.782
Neg logit contributions
erness-0.783
independence-0.714
eenth-0.700
aways-0.688
meal-0.668
Token sea
Token ID5417
Feature activation+7.91e+00

Pos logit contributions
beh+0.490
bered+0.483
dden+0.479
thora+0.472
icably+0.455
Neg logit contributions
erness-0.448
independence-0.407
eenth-0.399
aways-0.390
meal-0.379
Token turtles
Token ID36288
Feature activation+1.84e+00

Pos logit contributions
erness+4.521
aways+4.268
independence+4.081
meal+3.979
eenth+3.825
Neg logit contributions
beh-6.874
dden-6.771
bered-6.725
thora-6.697
iddled-6.389
Token around
Token ID1088
Feature activation+5.05e-01

Pos logit contributions
erness+1.301
independence+1.185
eenth+1.167
aways+1.151
meal+1.127
Neg logit contributions
beh-1.445
bered-1.422
dden-1.420
thora-1.402
untled-1.340
Token the
Token ID262
Feature activation-1.09e+00

Pos logit contributions
erness+0.337
independence+0.307
eenth+0.300
aways+0.298
meal+0.287
Neg logit contributions
beh-0.417
bered-0.411
dden-0.411
thora-0.405
gged-0.388
Token world
Token ID995
Feature activation-7.29e+00

Pos logit contributions
beh+1.000
bered+0.987
dden+0.981
thora+0.970
icably+0.938
Neg logit contributions
erness-0.625
independence-0.555
eenth-0.542
aways-0.528
meal-0.510
Token have
Token ID423
Feature activation-2.50e+00

Pos logit contributions
beh+4.999
bered+4.952
dden+4.765
thora+4.683
icably+4.603
Neg logit contributions
erness-5.673
independence-5.225
eenth-5.032
aways-4.898
liga-4.809
Token ingested
Token ID44694
Feature activation-7.89e+00

Pos logit contributions
beh+1.705
bered+1.685
dden+1.653
thora+1.637
icably+1.566
Neg logit contributions
erness-2.041
independence-1.880
eenth-1.851
aways-1.805
meal-1.748
Token debris
Token ID16468
Feature activation-2.87e+00

Pos logit contributions
beh+5.497
bered+5.440
thora+5.373
dden+5.286
icably+5.193
Neg logit contributions
erness-5.952
independence-5.451
eenth-5.386
aways-5.103
liga-4.966
Token,"
Token ID553
Feature activation-4.64e+00

Pos logit contributions
beh+2.121
bered+2.087
dden+2.053
thora+2.039
icably+1.958
Neg logit contributions
erness-2.159
independence-1.979
eenth-1.936
aways-1.882
meal-1.812
Token she
Token ID673
Feature activation-4.59e+00

Pos logit contributions
beh+3.595
bered+3.544
dden+3.518
thora+3.433
icably+3.305
Neg logit contributions
erness-3.315
independence-3.024
eenth-2.944
aways-2.899
meal-2.805
Token 2
Token ID362
Feature activation-1.97e-01

Pos logit contributions
erness+0.509
independence+0.464
eenth+0.454
aways+0.449
meal+0.435
Neg logit contributions
beh-0.574
bered-0.567
dden-0.564
thora-0.557
icably-0.532
Token.
Token ID13
Feature activation-2.14e-01

Pos logit contributions
beh+0.179
bered+0.177
dden+0.176
thora+0.174
icably+0.168
Neg logit contributions
erness-0.115
independence-0.103
eenth-0.101
aways-0.099
meal-0.095
Token8
Token ID23
Feature activation+8.40e-01

Pos logit contributions
beh+0.189
bered+0.187
dden+0.185
thora+0.183
icably+0.177
Neg logit contributions
erness-0.131
independence-0.117
eenth-0.114
aways-0.112
meal-0.108
Token points
Token ID2173
Feature activation+2.10e-01

Pos logit contributions
erness+0.478
independence+0.425
eenth+0.416
aways+0.407
meal+0.391
Neg logit contributions
beh-0.779
bered-0.770
dden-0.764
thora-0.757
icably-0.730
Token in
Token ID287
Feature activation+2.04e+00

Pos logit contributions
erness+0.150
independence+0.136
eenth+0.134
aways+0.132
meal+0.128
Neg logit contributions
beh-0.165
bered-0.162
dden-0.161
thora-0.159
icably-0.153
Token Real
Token ID6416
Feature activation-3.79e+00

Pos logit contributions
erness+1.407
independence+1.284
eenth+1.267
aways+1.252
meal+1.207
Neg logit contributions
beh-1.621
bered-1.601
dden-1.593
thora-1.578
icably-1.513
TokenClear
Token ID19856
Feature activation+5.69e+00

Pos logit contributions
beh+2.441
dden+2.390
bered+2.380
thora+2.332
gged+2.224
Neg logit contributions
erness-3.215
independence-2.962
eenth-2.918
aways-2.850
meal-2.801
TokenPolitics
Token ID43921
Feature activation+4.82e+00

Pos logit contributions
erness+4.214
independence+4.121
aways+3.937
eenth+3.868
meal+3.747
Neg logit contributions
bered-4.055
beh-4.030
dden-3.984
thora-3.948
gged-3.788
Token's
Token ID338
Feature activation+2.91e+00

Pos logit contributions
erness+2.502
independence+2.268
eenth+2.180
aways+2.165
meal+2.082
Neg logit contributions
beh-4.611
bered-4.567
dden-4.509
thora-4.504
untled-4.349
Token North
Token ID2258
Feature activation+8.70e-02

Pos logit contributions
erness+2.119
independence+1.955
eenth+1.914
aways+1.900
meal+1.839
Neg logit contributions
beh-2.207
bered-2.184
dden-2.163
thora-2.136
icably-2.055
Token Carolina
Token ID5913
Feature activation+4.38e-01

Pos logit contributions
erness+0.043
independence+0.037
eenth+0.036
aways+0.035
meal+0.034
Neg logit contributions
beh-0.088
bered-0.087
dden-0.086
thora-0.085
icably-0.083
Token Agreement
Token ID12729
Feature activation+1.24e+00

Pos logit contributions
erness+1.564
independence+1.451
eenth+1.430
aways+1.409
meal+1.376
Neg logit contributions
beh-1.148
bered-1.137
dden-1.125
thora-1.114
untled-1.052
Token"
Token ID1
Feature activation-5.57e-01

Pos logit contributions
erness+0.880
independence+0.799
eenth+0.785
aways+0.771
meal+0.751
Neg logit contributions
beh-0.973
bered-0.961
dden-0.954
thora-0.946
icably-0.902
Token over
Token ID625
Feature activation+1.75e+00

Pos logit contributions
beh+0.421
bered+0.415
dden+0.412
thora+0.404
gged+0.389
Neg logit contributions
erness-0.411
independence-0.376
eenth-0.369
aways-0.362
meal-0.353
Tokenprinted
Token ID49695
Feature activation+7.66e+00

Pos logit contributions
erness+1.228
independence+1.122
eenth+1.101
aways+1.086
meal+1.063
Neg logit contributions
beh-1.367
bered-1.344
dden-1.338
thora-1.334
icably-1.276
Token on
Token ID319
Feature activation+2.48e+00

Pos logit contributions
erness+4.875
independence+4.379
aways+4.366
eenth+4.304
meal+4.256
Neg logit contributions
beh-6.308
thora-6.214
dden-6.148
bered-6.080
untled-5.890
Token ob
Token ID909
Feature activation-5.59e+00

Pos logit contributions
erness+1.750
independence+1.601
eenth+1.569
aways+1.560
meal+1.506
Neg logit contributions
beh-1.936
dden-1.896
bered-1.893
thora-1.883
untled-1.799
Tokenverse
Token ID4399
Feature activation+9.43e-01

Pos logit contributions
beh+3.755
bered+3.716
dden+3.700
icably+3.494
thora+3.452
Neg logit contributions
erness-4.530
independence-4.124
eenth-4.016
aways-3.889
meal-3.851
Token.
Token ID13
Feature activation+1.65e+00

Pos logit contributions
erness+0.619
independence+0.559
eenth+0.547
aways+0.537
meal+0.520
Neg logit contributions
beh-0.791
bered-0.781
dden-0.776
thora-0.766
icably-0.737
Token In
Token ID554
Feature activation+6.41e-01

Pos logit contributions
erness+1.049
independence+0.943
eenth+0.927
aways+0.907
meal+0.878
Neg logit contributions
beh-1.420
bered-1.406
dden-1.396
thora-1.382
icably-1.326
Token addition
Token ID3090
Feature activation+3.19e+00

Pos logit contributions
erness+0.453
independence+0.412
eenth+0.404
aways+0.397
meal+0.386
Neg logit contributions
beh-0.505
bered-0.498
dden-0.495
thora-0.489
icably-0.469
Token,
Token ID11
Feature activation+3.78e-01

Pos logit contributions
erness+1.839
independence+1.632
eenth+1.583
aways+1.550
meal+1.515
Neg logit contributions
beh-2.902
bered-2.873
dden-2.866
thora-2.824
icably-2.725
Token,
Token ID11
Feature activation+4.03e+00

Pos logit contributions
erness+1.304
independence+1.170
eenth+1.145
aways+1.118
meal+1.086
Neg logit contributions
beh-1.876
bered-1.853
dden-1.841
thora-1.828
untled-1.754
Token and
Token ID290
Feature activation-3.98e-01

Pos logit contributions
erness+2.623
independence+2.389
eenth+2.327
aways+2.298
meal+2.226
Neg logit contributions
beh-3.320
bered-3.312
dden-3.311
thora-3.263
untled-3.136
Token although
Token ID3584
Feature activation-2.88e-01

Pos logit contributions
beh+0.310
bered+0.305
dden+0.303
thora+0.298
icably+0.288
Neg logit contributions
erness-0.286
independence-0.260
eenth-0.256
aways-0.251
meal-0.244
Token it
Token ID340
Feature activation+2.16e+00

Pos logit contributions
beh+0.214
bered+0.210
dden+0.209
thora+0.206
icably+0.197
Neg logit contributions
erness-0.217
independence-0.198
eenth-0.195
aways-0.192
meal-0.186
Token was
Token ID373
Feature activation+3.18e+00

Pos logit contributions
erness+1.477
independence+1.335
eenth+1.317
aways+1.293
meal+1.259
Neg logit contributions
beh-1.739
bered-1.714
dden-1.711
thora-1.696
untled-1.623
Token arguably
Token ID15242
Feature activation-2.77e+00

Pos logit contributions
erness+2.263
independence+2.067
eenth+2.025
aways+1.991
meal+1.951
Neg logit contributions
beh-2.467
bered-2.437
dden-2.428
thora-2.402
untled-2.295
Token bet
Token ID731
Feature activation-1.67e+00

Pos logit contributions
beh+2.045
dden+1.996
bered+1.995
thora+1.954
icably+1.921
Neg logit contributions
erness-2.096
independence-1.900
eenth-1.877
aways-1.862
meal-1.779
Tokentered
Token ID4400
Feature activation-1.84e+00

Pos logit contributions
beh+1.326
bered+1.312
dden+1.305
thora+1.279
gged+1.241
Neg logit contributions
erness-1.165
independence-1.055
eenth-1.030
aways-1.012
meal-0.979
Token by
Token ID416
Feature activation+1.50e-01

Pos logit contributions
beh+1.563
bered+1.541
dden+1.534
thora+1.508
icably+1.460
Neg logit contributions
erness-1.188
independence-1.072
eenth-1.047
aways-1.030
meal-0.994
Token the
Token ID262
Feature activation+5.83e-01

Pos logit contributions
erness+0.110
independence+0.101
eenth+0.099
aways+0.097
meal+0.094
Neg logit contributions
beh-0.114
bered-0.113
dden-0.112
thora-0.110
icably-0.105
Token later
Token ID1568
Feature activation+3.71e+00

Pos logit contributions
erness+0.438
independence+0.401
eenth+0.393
aways+0.387
meal+0.377
Neg logit contributions
beh-0.433
bered-0.427
dden-0.424
thora-0.418
icably-0.400
Token Continue
Token ID10054
Feature activation-8.84e+00

Pos logit contributions
beh+3.457
dden+3.133
bered+3.125
thora+3.004
gged+2.856
Neg logit contributions
erness-10.496
independence-9.880
eenth-9.643
aways-9.579
liga-9.430
Token reading
Token ID3555
Feature activation-5.04e+00

Pos logit contributions
beh+6.537
bered+6.388
dden+6.121
gged+6.030
thora+6.016
Neg logit contributions
erness-6.292
independence-5.609
eenth-5.589
aways-5.309
liga-5.185
Token the
Token ID262
Feature activation-1.10e+01

Pos logit contributions
beh+3.168
dden+3.093
bered+3.064
thora+2.978
gged+2.930
Neg logit contributions
erness-4.330
independence-3.934
eenth-3.798
aways-3.740
meal-3.616
Token main
Token ID1388
Feature activation-1.35e+01

Pos logit contributions
beh+4.866
bered+4.795
thora+4.553
dden+4.418
gged+4.383
Neg logit contributions
erness-10.813
independence-9.988
eenth-9.754
aways-9.596
liga-9.434
Token story
Token ID1621
Feature activation-5.68e+00

Pos logit contributions
beh+4.171
bered+3.959
dden+3.805
gged+3.446
edy+3.395
Neg logit contributions
erness-14.556
independence-13.813
eenth-13.383
aways-13.045
liga-12.569
Token Please
Token ID4222
Feature activation-1.19e+01

Pos logit contributions
beh+3.824
thora+3.737
bered+3.673
dden+3.591
iddled+3.510
Neg logit contributions
erness-4.553
eenth-3.995
independence-3.989
aways-3.867
meal-3.769
Token verify
Token ID11767
Feature activation-3.64e+00

Pos logit contributions
beh+4.085
bered+3.939
icably+3.684
edy+3.549
dden+3.398
Neg logit contributions
erness-12.767
eenth-12.034
aways-11.759
independence-11.575
liga-11.516
Token you
Token ID345
Feature activation-6.93e-01

Pos logit contributions
bered+2.753
beh+2.753
dden+2.686
gged+2.601
thora+2.599
Neg logit contributions
erness-2.698
independence-2.429
eenth-2.360
aways-2.307
meal-2.240
Token're
Token ID821
Feature activation-3.49e+00

Pos logit contributions
beh+0.480
bered+0.478
dden+0.464
thora+0.452
icably+0.441
Neg logit contributions
erness-0.563
independence-0.516
eenth-0.503
aways-0.501
meal-0.486
Token not
Token ID407
Feature activation-5.45e+00

Pos logit contributions
bered+1.911
beh+1.885
dden+1.803
gged+1.724
thora+1.721
Neg logit contributions
erness-3.335
independence-3.063
eenth-2.993
aways-2.989
meal-2.911
Token a
Token ID257
Feature activation-2.67e+00

Pos logit contributions
beh+3.363
bered+3.324
dden+3.181
thora+3.050
gged+3.028
Neg logit contributions
erness-4.723
eenth-4.286
independence-4.279
aways-4.225
meal-4.123
Token<|endoftext|>
Token ID50256
Feature activation-3.40e+00

Pos logit contributions
erness+1.462
independence+1.305
eenth+1.277
aways+1.257
meal+1.216
Neg logit contributions
beh-2.315
bered-2.301
thora-2.271
dden-2.271
icably-2.177
TokenONE
Token ID11651
Feature activation-4.09e+00

Pos logit contributions
beh+2.735
bered+2.664
dden+2.649
thora+2.583
icably+2.517
Neg logit contributions
erness-2.353
eenth-2.099
independence-2.096
aways-2.043
meal-1.971
Token of
Token ID286
Feature activation-5.98e+00

Pos logit contributions
beh+3.207
bered+3.156
dden+3.139
thora+3.084
icably+2.978
Neg logit contributions
erness-2.892
independence-2.631
eenth-2.537
aways-2.511
meal-2.429
Token France
Token ID4881
Feature activation-1.30e+00

Pos logit contributions
beh+4.457
dden+4.393
bered+4.355
thora+4.254
icably+4.102
Neg logit contributions
erness-4.378
independence-3.931
eenth-3.899
aways-3.855
meal-3.739
Token's
Token ID338
Feature activation-3.71e+00

Pos logit contributions
beh+0.987
bered+0.972
dden+0.965
thora+0.948
icably+0.910
Neg logit contributions
erness-0.963
independence-0.878
eenth-0.864
aways-0.849
meal-0.824
Token best
Token ID1266
Feature activation-1.21e+01

Pos logit contributions
beh+2.808
bered+2.759
dden+2.732
thora+2.658
icably+2.597
Neg logit contributions
erness-2.740
independence-2.448
eenth-2.415
aways-2.389
meal-2.308
Token-
Token ID12
Feature activation-1.14e+01

Pos logit contributions
beh+9.461
bered+9.340
dden+9.102
icably+8.715
thora+8.668
Neg logit contributions
erness-7.775
independence-6.742
eenth-6.593
meal-6.267
aways-6.147
Tokenknown
Token ID4002
Feature activation-3.81e+00

Pos logit contributions
beh+6.618
bered+5.912
dden+5.873
reci+5.303
icably+5.263
Neg logit contributions
erness-10.244
eenth-9.227
independence-9.011
meal-8.841
aways-8.803
Token mail
Token ID6920
Feature activation+9.58e+00

Pos logit contributions
beh+2.676
bered+2.635
dden+2.569
thora+2.496
icably+2.442
Neg logit contributions
erness-2.996
independence-2.706
eenth-2.692
aways-2.621
meal-2.577
Token order
Token ID1502
Feature activation+3.75e+00

Pos logit contributions
erness+6.721
aways+6.087
independence+6.051
eenth+5.888
meal+5.631
Neg logit contributions
dden-7.186
bered-7.175
thora-7.092
beh-7.058
untled-6.848
Token catalog
Token ID18388
Feature activation+3.38e+00

Pos logit contributions
erness+3.073
independence+2.836
eenth+2.792
aways+2.790
meal+2.710
Neg logit contributions
beh-2.495
dden-2.458
thora-2.428
bered-2.428
untled-2.308
Token Sheikh
Token ID30843
Feature activation-2.22e+00

Pos logit contributions
erness+0.752
independence+0.669
eenth+0.650
aways+0.641
meal+0.618
Neg logit contributions
beh-1.129
bered-1.121
dden-1.112
thora-1.101
icably-1.061
Token,
Token ID11
Feature activation-3.92e+00

Pos logit contributions
beh+1.765
bered+1.710
dden+1.706
thora+1.674
gged+1.604
Neg logit contributions
erness-1.572
independence-1.439
eenth-1.398
aways-1.371
meal-1.333
Token Egypt
Token ID6365
Feature activation-4.80e+00

Pos logit contributions
beh+3.045
bered+2.953
dden+2.929
thora+2.850
icably+2.789
Neg logit contributions
erness-2.808
independence-2.544
eenth-2.506
aways-2.406
meal-2.375
Token,
Token ID11
Feature activation-4.50e+00

Pos logit contributions
beh+3.760
bered+3.683
dden+3.632
thora+3.539
icably+3.487
Neg logit contributions
erness-3.414
independence-3.100
eenth-3.008
aways-2.930
meal-2.875
Token and
Token ID290
Feature activation-7.56e+00

Pos logit contributions
beh+3.465
bered+3.379
dden+3.332
thora+3.259
icably+3.193
Neg logit contributions
erness-3.264
independence-2.972
eenth-2.886
aways-2.787
meal-2.748
Token cru
Token ID4630
Feature activation-1.38e+01

Pos logit contributions
beh+5.333
bered+5.127
dden+5.034
thora+4.917
icably+4.903
Neg logit contributions
erness-5.678
independence-5.222
eenth-5.167
liga-4.907
aways-4.853
Tokenises
Token ID2696
Feature activation-4.06e+00

Pos logit contributions
ises+0.503
beh-0.240
dden-0.292
ised-0.525
icably-0.698
Neg logit contributions
erness-19.312
independence-18.567
liga-18.502
eenth-18.147
meal-17.777
Token via
Token ID2884
Feature activation-1.07e+00

Pos logit contributions
beh+3.041
bered+3.005
dden+2.990
thora+2.907
icably+2.832
Neg logit contributions
erness-2.969
independence-2.720
eenth-2.667
aways-2.569
meal-2.528
Token A
Token ID317
Feature activation-3.30e+00

Pos logit contributions
beh+0.842
bered+0.829
dden+0.825
thora+0.812
icably+0.781
Neg logit contributions
erness-0.757
independence-0.688
eenth-0.676
aways-0.661
meal-0.645
Tokenq
Token ID80
Feature activation-1.72e+00

Pos logit contributions
beh+2.501
bered+2.448
dden+2.429
thora+2.382
gged+2.298
Neg logit contributions
erness-2.436
independence-2.233
eenth-2.187
aways-2.154
meal-2.077
Tokenaba
Token ID15498
Feature activation-8.67e+00

Pos logit contributions
beh+1.614
bered+1.578
dden+1.569
thora+1.553
icably+1.501
Neg logit contributions
erness-0.968
independence-0.860
eenth-0.841
aways-0.821
meal-0.786
Token started
Token ID2067
Feature activation-4.51e+00

Pos logit contributions
erness+1.322
independence+1.207
eenth+1.184
aways+1.172
meal+1.141
Neg logit contributions
beh-1.383
bered-1.362
dden-1.358
thora-1.340
icably-1.277
Token looking
Token ID2045
Feature activation-1.24e+01

Pos logit contributions
beh+3.384
bered+3.312
dden+3.221
thora+3.181
icably+3.142
Neg logit contributions
erness-3.319
independence-3.001
eenth-2.960
aways-2.870
meal-2.744
Token for
Token ID329
Feature activation-8.37e+00

Pos logit contributions
beh+6.825
bered+6.817
dden+6.672
thora+6.309
icably+6.188
Neg logit contributions
erness-10.334
independence-9.721
eenth-9.515
aways-9.042
chool-8.778
Token a
Token ID257
Feature activation-5.14e+00

Pos logit contributions
beh+5.835
bered+5.669
dden+5.553
thora+5.400
icably+5.300
Neg logit contributions
erness-6.470
independence-5.800
eenth-5.799
aways-5.520
meal-5.294
Token new
Token ID649
Feature activation-6.71e+00

Pos logit contributions
beh+3.466
bered+3.431
dden+3.317
thora+3.253
icably+3.145
Neg logit contributions
erness-4.199
independence-3.832
eenth-3.784
aways-3.754
meal-3.546
Token partner
Token ID5212
Feature activation-1.23e+01

Pos logit contributions
beh+4.449
bered+4.385
dden+4.247
thora+4.108
icably+3.974
Neg logit contributions
erness-5.387
eenth-4.966
independence-4.954
aways-4.896
liga-4.601
Token.
Token ID13
Feature activation-1.92e+00

Pos logit contributions
beh+8.275
bered+8.249
thora+7.777
dden+7.725
untled+7.626
Neg logit contributions
erness-8.895
independence-8.147
eenth-8.135
aways-7.727
angel-7.055
Token
Token ID198
Feature activation-6.42e-01

Pos logit contributions
beh+1.714
bered+1.696
dden+1.686
thora+1.667
icably+1.607
Neg logit contributions
erness-1.163
independence-1.042
eenth-1.018
aways-0.997
meal-0.968
Token
Token ID198
Feature activation+9.88e-01

Pos logit contributions
beh+0.521
dden+0.510
bered+0.502
thora+0.488
gged+0.479
Neg logit contributions
erness-0.446
independence-0.404
eenth-0.393
aways-0.384
liga-0.370
Token"
Token ID1
Feature activation+1.88e+00

Pos logit contributions
erness+0.664
independence+0.608
eenth+0.590
aways+0.580
meal+0.568
Neg logit contributions
beh-0.807
bered-0.800
dden-0.794
thora-0.788
icably-0.755
TokenIt
Token ID1026
Feature activation-4.68e-03

Pos logit contributions
erness+1.229
independence+1.124
eenth+1.092
aways+1.077
meal+1.049
Neg logit contributions
beh-1.557
bered-1.554
dden-1.537
thora-1.524
icably-1.458
Token How
Token ID1374
Feature activation-4.07e-01

Pos logit contributions
erness+1.277
independence+1.156
eenth+1.139
aways+1.119
meal+1.077
Neg logit contributions
beh-1.580
bered-1.572
dden-1.558
thora-1.546
icably-1.485
Token A
Token ID317
Feature activation+2.72e-01

Pos logit contributions
beh+0.306
bered+0.301
dden+0.299
thora+0.295
icably+0.283
Neg logit contributions
erness-0.302
independence-0.276
eenth-0.271
aways-0.267
meal-0.260
Token New
Token ID968
Feature activation-6.42e-01

Pos logit contributions
erness+0.207
independence+0.188
eenth+0.186
aways+0.183
meal+0.178
Neg logit contributions
beh-0.199
bered-0.196
dden-0.195
thora-0.192
icably-0.184
Token Generation
Token ID16588
Feature activation+2.97e+00

Pos logit contributions
beh+0.502
bered+0.494
dden+0.491
thora+0.484
icably+0.465
Neg logit contributions
erness-0.459
independence-0.418
eenth-0.410
aways-0.403
meal-0.392
Token Is
Token ID1148
Feature activation+2.78e+00

Pos logit contributions
erness+2.037
independence+1.837
eenth+1.816
aways+1.803
meal+1.740
Neg logit contributions
beh-2.371
bered-2.350
dden-2.346
thora-2.319
untled-2.211
Token Rem
Token ID3982
Feature activation-1.46e+01

Pos logit contributions
erness+2.153
independence+1.968
eenth+1.947
aways+1.929
meal+1.877
Neg logit contributions
beh-1.969
bered-1.950
dden-1.947
thora-1.925
untled-1.831
Tokenaking
Token ID868
Feature activation-5.55e-01

Pos logit contributions
bered+9.557
beh+9.204
edy+8.907
dden+8.888
icably+8.771
Neg logit contributions
erness-10.768
independence-10.062
eenth-10.020
meal-9.767
liga-9.635
Token America
Token ID2253
Feature activation-5.29e+00

Pos logit contributions
beh+0.423
bered+0.417
dden+0.414
thora+0.408
icably+0.391
Neg logit contributions
erness-0.407
independence-0.372
eenth-0.365
aways-0.359
meal-0.349
Token.
Token ID13
Feature activation+1.07e+00

Pos logit contributions
beh+4.269
bered+4.202
dden+4.154
thora+4.054
gged+3.927
Neg logit contributions
erness-3.546
independence-3.246
eenth-3.166
aways-3.057
meal-2.974
Token "
Token ID366
Feature activation+2.56e-01

Pos logit contributions
erness+0.667
independence+0.598
eenth+0.587
aways+0.577
meal+0.556
Neg logit contributions
beh-0.931
bered-0.921
dden-0.913
thora-0.904
icably-0.871
TokenO
Token ID46
Feature activation-4.02e-01

Pos logit contributions
erness+0.181
independence+0.165
eenth+0.161
aways+0.159
meal+0.154
Neg logit contributions
beh-0.202
bered-0.200
dden-0.199
thora-0.196
icably-0.188