TOP ACTIVATIONS
MAX = 5.283

baby . The baby was thirsty , so the mom my
little boy and he was thirsty . So he went to
little rabbit who was very thirsty . He hopped over to
little girl who was very thirsty . She looked everywhere ,
deaf . She was always thirsty , so one day she
on the island were very thirsty , but no matter how
, the cow was feeling thirsty so it decided to go
three and he was always thirsty . He asked his mom
little boy who was very thirsty . So , he got
antly , Jim felt really thirsty , so he nodded ,
bear . He was very thirsty , but the only thing
sky and Bob was very thirsty . He saw a big
. The bull was very thirsty and wanted to drink some
. The rabbit was so thirsty , he thought he could
old . Emma was very thirsty and went to the kitchen
bird . He was very thirsty . He knew he needed
day . She was so thirsty that she thought she would
. Tim my was very thirsty and wanted some milk .
town until it became very thirsty . Then it
ggy was scared and very thirsty . Suddenly , it heard

MIN ACTIVATIONS
MIN = -7.271

is angry . It his ses . It scratches . It
like that . It his ses and fl aps its wings
it . The cat his ses and scratches him . Ben
is mean . He his ses and runs away . He
snake sees Tom and his ses . It opens its mouth
nose . The cat his ses and scratches his face .
and angry . It his ses and fl aps its wings
claws and bites and his ses at the bars . He
The snake his ses . L ila and Tom
scare it . It his ses and shows its teeth .
is afraid . It his ses and scratches Tom 's hand
is afraid . It his ses and scratches Tom 's hand
under the bed and his sed at them . Lily and
slide . The cat his ses at him . Tom is
cat woke up and his sed . He jumped off the
angry snake . It his ses and snaps at Ben .
she sque aked and his sed at Ben and Lily .
strangers . The cat his ses at Tom and Sue .
a cat . She his ses and scratches at the dog
ked tongue . It his ses and swim s near the

INTERVAL 2.144 - 5.283
CONTAINS 1.468%

The man helped Mike find his way back home
� That 's right ! â € he said .
liked to hear them cro ak and jump .
helped restore the <|endoftext|> Once upon a time there was a
, worried . â € � � Hey ,

INTERVAL -0.994 - 2.144
CONTAINS 86.634%

ked tongue . Lily is curious . She touches the snake
The window was opened and the two friends could finally go
small bird who asked for help . The independent
noticed the tall stack of blocks that she had made earlier
was a baby donkey ! The m ule was so surprised

INTERVAL -4.133 - -0.994
CONTAINS 11.886%

umbles and falls . She hurts her knee and cries .
remember her . Now it is gone . We are both
From that day on , Jack always had
they were looking for -- wherever they went , they were
n aps . I like you and the garden . I

INTERVAL -7.271 - -4.133
CONTAINS 0.011%

mean . Wh isk ers knew she had to
and red . It his sed at Lily . Lily screamed
Let 's play !" Wh isk ers said , " No
the trees , the flower bed s , and the patches
cher ries and ch ir ped happily . It trusted them
Token baby
Token ID5156
Feature activation+4.54e-01

Pos logit contributions
ak+0.074
He+0.065
 there+0.060
 upon+0.058
 like+0.058
Neg logit contributions
sed-0.086
opes-0.085
lder-0.077
ses-0.076
 Majesty-0.075
Token.
Token ID13
Feature activation+1.27e+00

Pos logit contributions
ak+0.035
He+0.030
 there+0.029
 like+0.028
 of+0.028
Neg logit contributions
sed-0.053
opes-0.052
 Majesty-0.049
lder-0.048
ses-0.047
Token The
Token ID383
Feature activation+1.77e-01

Pos logit contributions
ak+0.124
He+0.108
 there+0.099
he+0.099
 upon+0.097
Neg logit contributions
sed-0.125
opes-0.117
lder-0.112
ses-0.108
 Majesty-0.107
Token baby
Token ID5156
Feature activation+4.21e-01

Pos logit contributions
ak+0.016
He+0.015
 there+0.015
 of+0.015
m+0.014
Neg logit contributions
opes-0.017
sed-0.016
 Majesty-0.015
 endings-0.015
lder-0.015
Token was
Token ID373
Feature activation-3.44e-01

Pos logit contributions
ak+0.032
He+0.028
 there+0.026
 like+0.024
 of+0.024
Neg logit contributions
sed-0.050
opes-0.048
 Majesty-0.046
ses-0.045
lder-0.044
Token thirsty
Token ID47124
Feature activation+5.28e+00

Pos logit contributions
sed+0.041
opes+0.039
lder+0.038
ses+0.037
 Majesty+0.036
Neg logit contributions
ak-0.026
He-0.022
he-0.019
ill-0.019
 there-0.019
Token,
Token ID11
Feature activation+6.64e-01

Pos logit contributions
ak+0.421
He+0.353
 upon+0.316
ill+0.313
fing+0.311
Neg logit contributions
sed-0.575
opes-0.551
lder-0.538
ses-0.509
 endings-0.490
Token so
Token ID523
Feature activation+1.18e+00

Pos logit contributions
ak+0.040
He+0.032
 there+0.028
he+0.027
 like+0.027
Neg logit contributions
sed-0.090
opes-0.086
lder-0.083
 Majesty-0.082
ses-0.082
Token the
Token ID262
Feature activation+1.15e+00

Pos logit contributions
ak+0.107
He+0.092
 there+0.084
he+0.084
 upon+0.081
Neg logit contributions
sed-0.124
opes-0.117
lder-0.112
ses-0.108
 Majesty-0.108
Token mom
Token ID1995
Feature activation+6.81e-01

Pos logit contributions
ak+0.101
He+0.096
 there+0.094
 of+0.090
he+0.088
Neg logit contributions
opes-0.114
sed-0.112
 Majesty-0.102
lder-0.102
ses-0.099
Tokenmy
Token ID1820
Feature activation+2.00e+00

Pos logit contributions
ak+0.041
He+0.036
fing+0.031
he+0.030
ifying+0.029
Neg logit contributions
sed-0.080
lder-0.078
opes-0.078
 self-0.070
ses-0.070
Token little
Token ID1310
Feature activation-7.49e-01

Pos logit contributions
ak+0.078
He+0.066
ill+0.061
he+0.061
 there+0.060
Neg logit contributions
sed-0.094
lder-0.087
opes-0.086
 Majesty-0.084
ses-0.082
Token boy
Token ID2933
Feature activation+1.57e+00

Pos logit contributions
sed+0.068
lder+0.060
ses+0.059
 Majesty+0.058
opes+0.056
Neg logit contributions
ak-0.077
He-0.064
ill-0.062
he-0.060
 upon-0.058
Token and
Token ID290
Feature activation+9.00e-01

Pos logit contributions
ak+0.139
He+0.120
he+0.111
ill+0.107
 upon+0.106
Neg logit contributions
sed-0.159
opes-0.150
lder-0.145
ses-0.137
 Majesty-0.135
Token he
Token ID339
Feature activation+2.28e+00

Pos logit contributions
ak+0.067
He+0.051
he+0.051
ill+0.049
aky+0.048
Neg logit contributions
sed-0.107
lder-0.100
opes-0.098
 Majesty-0.098
ses-0.095
Token was
Token ID373
Feature activation-3.75e-01

Pos logit contributions
ak+0.206
He+0.182
 there+0.166
 like+0.164
 upon+0.161
Neg logit contributions
sed-0.240
opes-0.228
lder-0.213
ses-0.212
 Majesty-0.207
Token thirsty
Token ID47124
Feature activation+5.25e+00

Pos logit contributions
sed+0.042
opes+0.040
lder+0.038
ses+0.037
 Majesty+0.037
Neg logit contributions
ak-0.031
He-0.027
 there-0.024
he-0.024
 upon-0.023
Token.
Token ID13
Feature activation+1.01e+00

Pos logit contributions
ak+0.479
He+0.401
 upon+0.367
ill+0.366
aky+0.363
Neg logit contributions
sed-0.499
opes-0.482
lder-0.461
ses-0.432
 endings-0.431
Token So
Token ID1406
Feature activation-3.47e-01

Pos logit contributions
ak+0.071
He+0.055
aky+0.050
he+0.050
 there+0.049
Neg logit contributions
sed-0.127
opes-0.122
lder-0.118
 Majesty-0.114
ses-0.114
Token he
Token ID339
Feature activation+2.18e+00

Pos logit contributions
sed+0.035
lder+0.033
opes+0.033
 Majesty+0.032
 endings+0.031
Neg logit contributions
ak-0.033
aky-0.026
He-0.025
he-0.024
ill-0.024
Token went
Token ID1816
Feature activation+1.92e-02

Pos logit contributions
ak+0.182
He+0.155
he+0.143
 there+0.140
ill+0.136
Neg logit contributions
sed-0.241
opes-0.227
lder-0.219
ses-0.211
 Majesty-0.210
Token to
Token ID284
Feature activation+2.49e-01

Pos logit contributions
ak+0.002
He+0.002
he+0.002
aky+0.002
ill+0.002
Neg logit contributions
sed-0.002
opes-0.001
lder-0.001
yard-0.001
ses-0.001
Token little
Token ID1310
Feature activation-1.04e+00

Pos logit contributions
ak+0.061
He+0.053
 there+0.049
he+0.049
ill+0.047
Neg logit contributions
sed-0.066
opes-0.062
lder-0.060
ses-0.058
 Majesty-0.057
Token rabbit
Token ID22746
Feature activation-4.64e-01

Pos logit contributions
sed+0.108
opes+0.099
lder+0.098
ses+0.095
 Majesty+0.093
Neg logit contributions
ak-0.094
He-0.081
ill-0.073
he-0.073
 upon-0.072
Token who
Token ID508
Feature activation+1.30e+00

Pos logit contributions
sed+0.056
opes+0.054
 Majesty+0.051
lder+0.051
ses+0.051
Neg logit contributions
ak-0.035
He-0.029
 there-0.027
 like-0.026
 of-0.026
Token was
Token ID373
Feature activation-4.47e-01

Pos logit contributions
ak+0.138
He+0.123
 there+0.113
he+0.112
 like+0.111
Neg logit contributions
sed-0.118
opes-0.111
lder-0.104
ses-0.101
 Majesty-0.101
Token very
Token ID845
Feature activation+2.08e+00

Pos logit contributions
sed+0.056
opes+0.054
lder+0.052
 Majesty+0.051
ses+0.051
Neg logit contributions
ak-0.031
He-0.025
 there-0.023
 like-0.022
he-0.022
Token thirsty
Token ID47124
Feature activation+5.16e+00

Pos logit contributions
ak+0.164
He+0.139
 there+0.124
he+0.123
 like+0.119
Neg logit contributions
sed-0.244
opes-0.232
lder-0.223
ses-0.217
 Majesty-0.216
Token.
Token ID13
Feature activation+1.22e+00

Pos logit contributions
ak+0.419
He+0.353
 upon+0.311
he+0.311
ill+0.311
Neg logit contributions
sed-0.575
opes-0.547
lder-0.526
ses-0.508
 Majesty-0.492
Token He
Token ID679
Feature activation+1.80e+00

Pos logit contributions
ak+0.087
He+0.070
he+0.064
 there+0.062
aky+0.062
Neg logit contributions
sed-0.152
opes-0.144
lder-0.139
ses-0.136
 Majesty-0.134
Token hopped
Token ID45230
Feature activation-7.96e-01

Pos logit contributions
ak+0.147
He+0.127
 there+0.118
 upon+0.112
 like+0.112
Neg logit contributions
sed-0.205
opes-0.197
 Majesty-0.189
lder-0.187
ses-0.185
Token over
Token ID625
Feature activation-2.53e-01

Pos logit contributions
sed+0.077
opes+0.068
lder+0.066
ses+0.064
zed+0.063
Neg logit contributions
ak-0.066
ill-0.057
fing-0.054
aky-0.053
He-0.052
Token to
Token ID284
Feature activation+1.59e-01

Pos logit contributions
sed+0.027
opes+0.026
lder+0.024
ses+0.024
 Majesty+0.023
Neg logit contributions
ak-0.021
He-0.018
aky-0.016
he-0.016
ill-0.016
Token little
Token ID1310
Feature activation-1.03e+00

Pos logit contributions
ak+0.063
He+0.054
he+0.050
 there+0.050
ill+0.049
Neg logit contributions
sed-0.064
opes-0.060
lder-0.058
ses-0.056
 Majesty-0.056
Token girl
Token ID2576
Feature activation+8.06e-01

Pos logit contributions
sed+0.107
lder+0.097
opes+0.097
ses+0.094
 Majesty+0.091
Neg logit contributions
ak-0.094
He-0.081
ill-0.074
he-0.073
 upon-0.072
Token who
Token ID508
Feature activation+1.38e+00

Pos logit contributions
ak+0.061
He+0.048
 there+0.047
 like+0.046
 of+0.045
Neg logit contributions
sed-0.101
opes-0.097
 Majesty-0.091
ses-0.091
lder-0.089
Token was
Token ID373
Feature activation-4.07e-01

Pos logit contributions
ak+0.144
He+0.128
 there+0.118
he+0.117
 like+0.116
Neg logit contributions
sed-0.129
opes-0.121
lder-0.113
 Majesty-0.110
ses-0.110
Token very
Token ID845
Feature activation+2.10e+00

Pos logit contributions
sed+0.052
opes+0.050
lder+0.048
 Majesty+0.047
ses+0.046
Neg logit contributions
ak-0.027
He-0.022
 there-0.021
 like-0.021
he-0.020
Token thirsty
Token ID47124
Feature activation+5.11e+00

Pos logit contributions
ak+0.155
He+0.130
 there+0.119
he+0.115
 like+0.115
Neg logit contributions
sed-0.255
opes-0.247
lder-0.230
ses-0.228
 Majesty-0.225
Token.
Token ID13
Feature activation+1.22e+00

Pos logit contributions
ak+0.417
He+0.352
 upon+0.311
ill+0.310
he+0.309
Neg logit contributions
sed-0.564
opes-0.537
lder-0.518
ses-0.498
 Majesty-0.484
Token She
Token ID1375
Feature activation+3.77e-01

Pos logit contributions
ak+0.096
He+0.085
 there+0.079
 like+0.076
 upon+0.075
Neg logit contributions
sed-0.143
opes-0.137
lder-0.129
 Majesty-0.128
ses-0.126
Token looked
Token ID3114
Feature activation+4.19e-02

Pos logit contributions
ak+0.031
He+0.027
 there+0.025
 like+0.024
 upon+0.024
Neg logit contributions
sed-0.043
opes-0.041
 Majesty-0.040
ses-0.040
lder-0.038
Token everywhere
Token ID8347
Feature activation-9.31e-01

Pos logit contributions
ak+0.004
He+0.003
ill+0.003
he+0.003
aky+0.003
Neg logit contributions
sed-0.005
opes-0.004
lder-0.004
ses-0.004
 Majesty-0.004
Token,
Token ID11
Feature activation+5.00e-01

Pos logit contributions
sed+0.107
opes+0.100
lder+0.095
ses+0.094
 Majesty+0.090
Neg logit contributions
ak-0.074
He-0.062
he-0.056
aky-0.054
fing-0.053
Token deaf
Token ID21815
Feature activation+7.82e-01

Pos logit contributions
sed+0.049
opes+0.046
lder+0.045
ses+0.043
 Majesty+0.043
Neg logit contributions
ak-0.028
He-0.023
 there-0.020
he-0.020
 like-0.019
Token.
Token ID13
Feature activation+1.23e+00

Pos logit contributions
ak+0.050
He+0.041
 there+0.039
 like+0.038
 of+0.038
Neg logit contributions
sed-0.102
opes-0.098
lder-0.095
 Majesty-0.094
ses-0.092
Token She
Token ID1375
Feature activation+3.54e-01

Pos logit contributions
ak+0.088
He+0.078
 there+0.068
 like+0.067
 upon+0.066
Neg logit contributions
sed-0.150
opes-0.142
lder-0.138
 Majesty-0.134
ses-0.133
Token was
Token ID373
Feature activation-4.28e-01

Pos logit contributions
ak+0.029
He+0.026
 there+0.025
 like+0.025
 upon+0.022
Neg logit contributions
sed-0.040
opes-0.037
 Majesty-0.037
ses-0.036
lder-0.035
Token always
Token ID1464
Feature activation+8.95e-01

Pos logit contributions
sed+0.052
opes+0.050
lder+0.048
 Majesty+0.046
ses+0.046
Neg logit contributions
ak-0.032
He-0.027
 there-0.025
 like-0.024
 upon-0.023
Token thirsty
Token ID47124
Feature activation+5.05e+00

Pos logit contributions
ak+0.081
He+0.072
he+0.067
ill+0.066
aky+0.061
Neg logit contributions
sed-0.092
opes-0.085
lder-0.082
ses-0.079
 Majesty-0.079
Token,
Token ID11
Feature activation+5.84e-01

Pos logit contributions
ak+0.406
He+0.341
ill+0.305
fing+0.301
 upon+0.299
Neg logit contributions
sed-0.545
opes-0.530
lder-0.507
ses-0.481
 endings-0.474
Token so
Token ID523
Feature activation+1.02e+00

Pos logit contributions
ak+0.034
He+0.028
 there+0.028
 like+0.026
 upon+0.024
Neg logit contributions
sed-0.079
opes-0.076
lder-0.074
 Majesty-0.072
ses-0.072
Token one
Token ID530
Feature activation+7.45e-01

Pos logit contributions
ak+0.085
He+0.076
 there+0.073
 like+0.068
 upon+0.068
Neg logit contributions
sed-0.115
opes-0.109
lder-0.104
ses-0.102
 Majesty-0.099
Token day
Token ID1110
Feature activation-5.08e-01

Pos logit contributions
ak+0.094
He+0.084
ill+0.080
he+0.079
fing+0.078
Neg logit contributions
sed-0.046
lder-0.042
opes-0.039
ses-0.037
 Majesty-0.033
Token she
Token ID673
Feature activation+6.08e-01

Pos logit contributions
sed+0.066
opes+0.064
ses+0.062
 Majesty+0.060
lder+0.060
Neg logit contributions
ak-0.032
 there-0.031
He-0.028
 like-0.028
 of-0.027
Token on
Token ID319
Feature activation+1.33e+00

Pos logit contributions
sed+0.065
opes+0.062
lder+0.060
ses+0.057
 Majesty+0.055
Neg logit contributions
ak-0.047
He-0.042
ill-0.037
he-0.036
aky-0.035
Token the
Token ID262
Feature activation+9.72e-01

Pos logit contributions
ak+0.074
He+0.060
 there+0.060
 like+0.051
he+0.050
Neg logit contributions
sed-0.189
opes-0.176
lder-0.173
ses-0.171
 Majesty-0.170
Token island
Token ID7022
Feature activation+5.53e-01

Pos logit contributions
ak+0.090
He+0.078
 there+0.076
 like+0.072
he+0.070
Neg logit contributions
sed-0.100
opes-0.092
ses-0.091
lder-0.086
 Majesty-0.084
Token were
Token ID547
Feature activation+1.00e+00

Pos logit contributions
ak+0.043
He+0.036
 there+0.034
he+0.032
 upon+0.032
Neg logit contributions
sed-0.065
opes-0.062
lder-0.060
ses-0.058
 Majesty-0.058
Token very
Token ID845
Feature activation+2.08e+00

Pos logit contributions
ak+0.076
He+0.063
 there+0.059
 like+0.056
he+0.055
Neg logit contributions
sed-0.121
opes-0.115
lder-0.111
ses-0.108
 Majesty-0.107
Token thirsty
Token ID47124
Feature activation+5.05e+00

Pos logit contributions
ak+0.164
He+0.139
he+0.127
ill+0.126
aky+0.119
Neg logit contributions
sed-0.240
opes-0.220
lder-0.219
 Majesty-0.210
ses-0.210
Token,
Token ID11
Feature activation+5.16e-01

Pos logit contributions
ak+0.404
He+0.340
aky+0.299
he+0.298
 upon+0.294
Neg logit contributions
sed-0.568
opes-0.536
lder-0.518
ses-0.502
 Majesty-0.488
Token but
Token ID475
Feature activation+2.11e+00

Pos logit contributions
ak+0.027
He+0.022
 there+0.020
 like+0.019
 upon+0.017
Neg logit contributions
sed-0.073
opes-0.070
lder-0.069
 Majesty-0.068
ses-0.066
Token no
Token ID645
Feature activation+1.53e+00

Pos logit contributions
ak+0.146
 there+0.132
He+0.129
 like+0.121
 upon+0.112
Neg logit contributions
sed-0.258
opes-0.250
lder-0.243
 Majesty-0.238
ses-0.232
Token matter
Token ID2300
Feature activation+1.66e+00

Pos logit contributions
ak+0.139
He+0.120
 there+0.112
 like+0.108
he+0.108
Neg logit contributions
sed-0.160
opes-0.151
lder-0.145
ses-0.140
 Majesty-0.139
Token how
Token ID703
Feature activation-3.22e-01

Pos logit contributions
ak+0.132
He+0.111
 there+0.109
 of+0.102
 upon+0.102
Neg logit contributions
sed-0.196
opes-0.181
lder-0.179
ses-0.173
 Majesty-0.172
Token,
Token ID11
Feature activation+4.70e-01

Pos logit contributions
sed+0.064
opes+0.060
lder+0.059
ses+0.056
 Majesty+0.056
Neg logit contributions
ak-0.045
He-0.037
aky-0.034
he-0.034
ill-0.033
Token the
Token ID262
Feature activation+9.28e-01

Pos logit contributions
ak+0.031
He+0.026
 there+0.023
he+0.022
 like+0.021
Neg logit contributions
sed-0.061
opes-0.058
lder-0.056
ses-0.055
 Majesty-0.055
Token cow
Token ID9875
Feature activation+1.71e-01

Pos logit contributions
ak+0.097
 there+0.085
He+0.085
 like+0.084
 of+0.079
Neg logit contributions
opes-0.082
sed-0.082
 Majesty-0.076
 endings-0.075
lder-0.071
Token was
Token ID373
Feature activation-5.20e-01

Pos logit contributions
ak+0.012
He+0.010
 there+0.009
 like+0.009
 upon+0.009
Neg logit contributions
sed-0.022
opes-0.021
ses-0.020
 Majesty-0.019
lder-0.019
Token feeling
Token ID4203
Feature activation+4.71e-01

Pos logit contributions
sed+0.055
opes+0.054
lder+0.051
 Majesty+0.050
ses+0.050
Neg logit contributions
ak-0.047
 there-0.039
He-0.038
 like-0.037
 upon-0.036
Token thirsty
Token ID47124
Feature activation+5.04e+00

Pos logit contributions
ak+0.038
He+0.032
he+0.029
ill+0.029
 there+0.028
Neg logit contributions
sed-0.054
opes-0.050
lder-0.049
ses-0.047
 Majesty-0.047
Token so
Token ID523
Feature activation+1.00e+00

Pos logit contributions
ak+0.401
He+0.339
 upon+0.299
fing+0.297
ill+0.295
Neg logit contributions
sed-0.547
opes-0.525
lder-0.509
ses-0.483
 endings-0.469
Token it
Token ID340
Feature activation+7.23e-02

Pos logit contributions
ak+0.070
He+0.061
 there+0.058
 upon+0.054
 like+0.053
Neg logit contributions
sed-0.125
opes-0.122
lder-0.114
ses-0.113
 Majesty-0.113
Token decided
Token ID3066
Feature activation+1.68e+00

Pos logit contributions
ak+0.006
He+0.005
 there+0.005
 upon+0.004
 like+0.004
Neg logit contributions
sed-0.008
opes-0.008
 Majesty-0.008
lder-0.008
ses-0.008
Token to
Token ID284
Feature activation+2.47e-01

Pos logit contributions
ak+0.087
 there+0.086
He+0.077
 like+0.074
 upon+0.069
Neg logit contributions
sed-0.231
opes-0.231
ses-0.218
lder-0.214
 Majesty-0.213
Token go
Token ID467
Feature activation+1.27e+00

Pos logit contributions
ak+0.019
He+0.016
 there+0.014
he+0.014
 like+0.014
Neg logit contributions
sed-0.030
opes-0.028
lder-0.027
ses-0.026
 Majesty-0.026
Token three
Token ID1115
Feature activation-6.67e-01

Pos logit contributions
ak+0.068
He+0.059
he+0.057
ill+0.054
 there+0.050
Neg logit contributions
sed-0.072
ses-0.063
opes-0.062
 Majesty-0.061
lder-0.060
Token and
Token ID290
Feature activation+8.90e-01

Pos logit contributions
sed+0.068
opes+0.063
lder+0.061
ses+0.060
 Majesty+0.059
Neg logit contributions
ak-0.063
He-0.055
 there-0.051
he-0.050
aky-0.049
Token he
Token ID339
Feature activation+1.97e+00

Pos logit contributions
ak+0.085
He+0.072
ill+0.068
he+0.067
 upon+0.066
Neg logit contributions
sed-0.089
opes-0.083
lder-0.080
ses-0.077
 Majesty-0.076
Token was
Token ID373
Feature activation-4.38e-01

Pos logit contributions
ak+0.187
He+0.167
 like+0.152
 there+0.151
he+0.147
Neg logit contributions
sed-0.199
opes-0.189
 Majesty-0.177
ses-0.176
lder-0.175
Token always
Token ID1464
Feature activation+8.62e-01

Pos logit contributions
sed+0.051
opes+0.049
lder+0.047
ses+0.046
 Majesty+0.045
Neg logit contributions
ak-0.035
He-0.029
 there-0.027
he-0.026
 like-0.026
Token thirsty
Token ID47124
Feature activation+5.00e+00

Pos logit contributions
ak+0.084
He+0.075
he+0.071
ill+0.070
 of+0.064
Neg logit contributions
sed-0.082
opes-0.074
lder-0.073
 Majesty-0.070
ses-0.070
Token.
Token ID13
Feature activation+1.07e+00

Pos logit contributions
ak+0.465
He+0.389
ill+0.362
 upon+0.359
fing+0.358
Neg logit contributions
sed-0.456
opes-0.450
lder-0.424
 endings-0.408
der-0.401
Token He
Token ID679
Feature activation+1.81e+00

Pos logit contributions
ak+0.108
aky+0.090
 closer+0.082
fing+0.081
vy+0.080
Neg logit contributions
sed-0.097
opes-0.092
ses-0.084
lder-0.084
 Majesty-0.081
Token asked
Token ID1965
Feature activation+5.20e-01

Pos logit contributions
ak+0.149
He+0.131
 there+0.119
 like+0.117
 upon+0.114
Neg logit contributions
sed-0.205
opes-0.197
 Majesty-0.186
ses-0.185
lder-0.185
Token his
Token ID465
Feature activation+1.72e+00

Pos logit contributions
ak+0.031
He+0.024
he+0.021
ill+0.020
aky+0.020
Neg logit contributions
sed-0.070
opes-0.067
lder-0.066
 Majesty-0.064
ses-0.063
Token mom
Token ID1995
Feature activation+4.66e-01

Pos logit contributions
ak+0.153
 there+0.134
He+0.134
 of+0.128
 upon+0.126
Neg logit contributions
opes-0.175
sed-0.170
lder-0.161
ses-0.152
 endings-0.151
Token little
Token ID1310
Feature activation-1.16e+00

Pos logit contributions
ak+0.044
He+0.038
 there+0.035
he+0.034
 like+0.034
Neg logit contributions
sed-0.050
opes-0.047
lder-0.045
ses-0.044
 Majesty-0.043
Token boy
Token ID2933
Feature activation+1.24e+00

Pos logit contributions
sed+0.123
opes+0.114
lder+0.112
ses+0.109
 Majesty+0.107
Neg logit contributions
ak-0.102
He-0.089
he-0.080
 there-0.079
ill-0.078
Token who
Token ID508
Feature activation+1.20e+00

Pos logit contributions
ak+0.107
He+0.092
 there+0.083
he+0.082
 like+0.080
Neg logit contributions
sed-0.136
opes-0.128
lder-0.123
ses-0.120
 Majesty-0.119
Token was
Token ID373
Feature activation-5.53e-01

Pos logit contributions
ak+0.128
He+0.112
ill+0.109
he+0.106
 there+0.104
Neg logit contributions
sed-0.103
lder-0.095
opes-0.094
ses-0.088
 Majesty-0.085
Token very
Token ID845
Feature activation+1.97e+00

Pos logit contributions
sed+0.071
opes+0.070
lder+0.067
 Majesty+0.065
ses+0.064
Neg logit contributions
ak-0.036
He-0.030
 there-0.028
 like-0.027
 upon-0.025
Token thirsty
Token ID47124
Feature activation+4.98e+00

Pos logit contributions
ak+0.152
He+0.129
 there+0.117
he+0.114
 like+0.112
Neg logit contributions
sed-0.232
opes-0.223
lder-0.211
ses-0.207
 Majesty-0.205
Token.
Token ID13
Feature activation+9.45e-01

Pos logit contributions
ak+0.428
He+0.365
 upon+0.328
ill+0.325
fing+0.321
Neg logit contributions
sed-0.515
opes-0.492
lder-0.475
ses-0.452
 endings-0.436
Token So
Token ID1406
Feature activation-5.00e-01

Pos logit contributions
ak+0.074
He+0.057
aky+0.056
he+0.054
fing+0.052
Neg logit contributions
sed-0.112
opes-0.105
lder-0.102
ses-0.100
 Majesty-0.095
Token,
Token ID11
Feature activation+4.56e-01

Pos logit contributions
sed+0.055
lder+0.050
opes+0.049
ses+0.048
 Majesty+0.047
Neg logit contributions
ak-0.045
He-0.036
he-0.035
aky-0.035
ill-0.034
Token he
Token ID339
Feature activation+1.83e+00

Pos logit contributions
ak+0.042
aky+0.034
ill+0.033
aking+0.031
he+0.030
Neg logit contributions
sed-0.048
lder-0.044
opes-0.043
ses-0.041
 Majesty-0.039
Token got
Token ID1392
Feature activation+6.54e-01

Pos logit contributions
ak+0.148
He+0.126
 there+0.116
 like+0.113
 upon+0.109
Neg logit contributions
sed-0.213
opes-0.204
ses-0.192
 Majesty-0.192
lder-0.190
Tokenantly
Token ID3875
Feature activation-1.93e+00

Pos logit contributions
sed+0.109
opes+0.102
lder+0.097
 Majesty+0.093
ses+0.093
Neg logit contributions
ak-0.133
He-0.117
he-0.109
 there-0.108
 like-0.107
Token,
Token ID11
Feature activation+3.26e-01

Pos logit contributions
sed+0.229
opes+0.215
lder+0.211
 Majesty+0.204
ses+0.201
Neg logit contributions
ak-0.149
He-0.122
he-0.111
aky-0.110
 upon-0.103
Token Jim
Token ID5395
Feature activation+1.01e+00

Pos logit contributions
ak+0.031
aky+0.025
He+0.024
he+0.023
fing+0.022
Neg logit contributions
sed-0.033
opes-0.031
lder-0.030
 Majesty-0.029
ses-0.028
Token felt
Token ID2936
Feature activation-8.06e-02

Pos logit contributions
ak+0.081
He+0.069
he+0.063
 there+0.061
 like+0.059
Neg logit contributions
sed-0.115
opes-0.109
lder-0.105
 Majesty-0.102
ses-0.101
Token really
Token ID1107
Feature activation+1.55e+00

Pos logit contributions
sed+0.008
lder+0.007
opes+0.007
 Majesty+0.007
ses+0.007
Neg logit contributions
ak-0.007
ill-0.006
he-0.006
He-0.006
fing-0.005
Token thirsty
Token ID47124
Feature activation+4.97e+00

Pos logit contributions
ak+0.128
He+0.108
he+0.099
ill+0.097
aky+0.095
Neg logit contributions
sed-0.176
opes-0.163
lder-0.159
ses-0.155
 Majesty-0.154
Token,
Token ID11
Feature activation+2.68e-01

Pos logit contributions
ak+0.407
He+0.334
 upon+0.304
aky+0.299
fing+0.296
Neg logit contributions
sed-0.531
opes-0.506
lder-0.501
ses-0.466
 Majesty-0.459
Token so
Token ID523
Feature activation+8.69e-01

Pos logit contributions
ak+0.017
He+0.013
he+0.012
aky+0.012
 there+0.011
Neg logit contributions
sed-0.035
opes-0.033
lder-0.032
ses-0.031
 Majesty-0.031
Token he
Token ID339
Feature activation+1.92e+00

Pos logit contributions
ak+0.058
He+0.042
aky+0.039
he+0.039
ill+0.037
Neg logit contributions
sed-0.113
opes-0.107
lder-0.105
 Majesty-0.101
ses-0.101
Token nodded
Token ID14464
Feature activation+1.68e+00

Pos logit contributions
ak+0.156
He+0.132
he+0.122
 there+0.118
ill+0.116
Neg logit contributions
sed-0.217
opes-0.205
lder-0.198
 Majesty-0.191
ses-0.190
Token,
Token ID11
Feature activation+2.72e-01

Pos logit contributions
ak+0.127
He+0.099
he+0.096
aky+0.091
fing+0.087
Neg logit contributions
sed-0.194
opes-0.185
lder-0.178
ses-0.167
zed-0.165
Token bear
Token ID6842
Feature activation+6.77e-03

Pos logit contributions
ak+0.290
He+0.256
he+0.234
 there+0.234
ill+0.231
Neg logit contributions
sed-0.273
opes-0.255
lder-0.245
ses-0.234
 Majesty-0.233
Token.
Token ID13
Feature activation+1.16e+00

Pos logit contributions
ak+0.001
He+0.000
 there+0.000
he+0.000
 like+0.000
Neg logit contributions
sed-0.001
opes-0.001
lder-0.001
ses-0.001
 Majesty-0.001
Token He
Token ID679
Feature activation+1.74e+00

Pos logit contributions
ak+0.099
He+0.080
aky+0.076
he+0.074
 there+0.073
Neg logit contributions
sed-0.127
opes-0.121
lder-0.118
ses-0.114
 Majesty-0.110
Token was
Token ID373
Feature activation-5.13e-01

Pos logit contributions
ak+0.158
He+0.137
 there+0.125
 like+0.123
aky+0.119
Neg logit contributions
sed-0.186
opes-0.176
 Majesty-0.167
lder-0.167
ses-0.165
Token very
Token ID845
Feature activation+1.96e+00

Pos logit contributions
sed+0.058
opes+0.052
ses+0.051
lder+0.050
 Majesty+0.048
Neg logit contributions
ak-0.042
He-0.037
ill-0.036
he-0.035
fing-0.032
Token thirsty
Token ID47124
Feature activation+4.96e+00

Pos logit contributions
ak+0.171
ill+0.152
He+0.149
he+0.145
fing+0.130
Neg logit contributions
sed-0.209
lder-0.186
opes-0.182
ses-0.179
 Majesty-0.178
Token,
Token ID11
Feature activation+5.65e-01

Pos logit contributions
ak+0.424
He+0.357
ill+0.327
 upon+0.325
fing+0.325
Neg logit contributions
sed-0.507
opes-0.485
lder-0.465
ses-0.445
 endings-0.431
Token but
Token ID475
Feature activation+2.07e+00

Pos logit contributions
ak+0.036
He+0.028
he+0.025
 there+0.024
ill+0.023
Neg logit contributions
sed-0.075
opes-0.071
lder-0.069
ses-0.067
 Majesty-0.067
Token the
Token ID262
Feature activation+9.68e-01

Pos logit contributions
ak+0.150
ill+0.118
he+0.116
He+0.113
aky+0.105
Neg logit contributions
sed-0.251
opes-0.235
lder-0.228
ses-0.221
 Majesty-0.220
Token only
Token ID691
Feature activation+5.44e-01

Pos logit contributions
ak+0.092
ill+0.077
He+0.077
he+0.076
aky+0.071
Neg logit contributions
sed-0.096
opes-0.087
lder-0.084
 Majesty-0.083
ses-0.083
Token thing
Token ID1517
Feature activation+1.54e+00

Pos logit contributions
ak+0.053
He+0.046
he+0.046
ill+0.045
fing+0.042
Neg logit contributions
sed-0.053
opes-0.045
lder-0.045
ses-0.045
 Majesty-0.044
Token sky
Token ID6766
Feature activation-1.19e+00

Pos logit contributions
ak+0.107
He+0.096
aky+0.088
fing+0.088
ill+0.088
Neg logit contributions
sed-0.075
opes-0.071
lder-0.069
 Majesty-0.064
ses-0.063
Token and
Token ID290
Feature activation+7.28e-01

Pos logit contributions
sed+0.133
opes+0.128
lder+0.123
 Majesty+0.117
ses+0.116
Neg logit contributions
ak-0.094
He-0.079
aky-0.070
he-0.070
fing-0.069
Token Bob
Token ID5811
Feature activation+9.63e-02

Pos logit contributions
ak+0.057
He+0.046
he+0.042
aky+0.041
ill+0.040
Neg logit contributions
sed-0.085
opes-0.081
lder-0.078
 Majesty-0.076
ses-0.076
Token was
Token ID373
Feature activation-7.09e-01

Pos logit contributions
ak+0.008
He+0.007
he+0.007
fing+0.007
 there+0.007
Neg logit contributions
sed-0.010
opes-0.009
lder-0.009
ses-0.009
 Majesty-0.008
Token very
Token ID845
Feature activation+1.88e+00

Pos logit contributions
sed+0.072
opes+0.067
lder+0.064
ses+0.063
 Majesty+0.062
Neg logit contributions
ak-0.065
He-0.057
he-0.052
ill-0.051
aky-0.050
Token thirsty
Token ID47124
Feature activation+4.95e+00

Pos logit contributions
ak+0.156
He+0.130
ill+0.121
he+0.119
aky+0.113
Neg logit contributions
sed-0.208
lder-0.189
opes-0.186
 Majesty-0.184
ses-0.181
Token.
Token ID13
Feature activation+1.09e+00

Pos logit contributions
ak+0.452
He+0.375
aky+0.345
fing+0.342
 upon+0.342
Neg logit contributions
sed-0.475
opes-0.458
lder-0.441
 endings-0.412
der-0.411
Token He
Token ID679
Feature activation+1.61e+00

Pos logit contributions
ak+0.064
He+0.053
 there+0.045
he+0.043
 like+0.042
Neg logit contributions
sed-0.149
opes-0.142
lder-0.138
ses-0.134
 Majesty-0.134
Token saw
Token ID2497
Feature activation+8.35e-01

Pos logit contributions
ak+0.135
He+0.116
 there+0.105
he+0.103
 upon+0.101
Neg logit contributions
sed-0.180
opes-0.170
lder-0.163
ses-0.159
 Majesty-0.158
Token a
Token ID257
Feature activation+3.89e-01

Pos logit contributions
ak+0.066
He+0.054
he+0.051
ill+0.049
aky+0.049
Neg logit contributions
sed-0.095
opes-0.091
lder-0.087
ses-0.084
 Majesty-0.083
Token big
Token ID1263
Feature activation+3.17e-01

Pos logit contributions
ak+0.037
He+0.031
ill+0.030
he+0.029
aky+0.029
Neg logit contributions
sed-0.038
lder-0.035
opes-0.034
ses-0.033
 Majesty-0.033
Token.
Token ID13
Feature activation+9.33e-01

Pos logit contributions
sed+0.082
opes+0.077
lder+0.075
ses+0.072
 Majesty+0.071
Neg logit contributions
ak-0.053
He-0.045
he-0.040
aky-0.039
 there-0.039
Token The
Token ID383
Feature activation-2.75e-01

Pos logit contributions
ak+0.085
He+0.074
 there+0.067
he+0.066
 like+0.065
Neg logit contributions
sed-0.098
opes-0.092
lder-0.088
ses-0.086
 Majesty-0.086
Token bull
Token ID6473
Feature activation-5.56e-01

Pos logit contributions
sed+0.020
opes+0.018
ses+0.017
 Majesty+0.017
lder+0.016
Neg logit contributions
ak-0.034
He-0.032
 there-0.030
he-0.030
 like-0.029
Token was
Token ID373
Feature activation-6.82e-01

Pos logit contributions
sed+0.065
opes+0.062
 Majesty+0.060
lder+0.059
ses+0.059
Neg logit contributions
ak-0.044
He-0.037
 there-0.034
 like-0.033
he-0.032
Token very
Token ID845
Feature activation+1.84e+00

Pos logit contributions
sed+0.080
opes+0.074
lder+0.071
ses+0.071
 Majesty+0.067
Neg logit contributions
ak-0.053
He-0.046
ill-0.043
he-0.042
 upon-0.040
Token thirsty
Token ID47124
Feature activation+4.94e+00

Pos logit contributions
ak+0.148
He+0.126
ill+0.116
he+0.114
 there+0.111
Neg logit contributions
sed-0.211
opes-0.195
lder-0.193
ses-0.185
 Majesty-0.183
Token and
Token ID290
Feature activation+6.04e-01

Pos logit contributions
ak+0.397
He+0.333
 upon+0.296
ill+0.294
aky+0.293
Neg logit contributions
sed-0.545
opes-0.519
lder-0.503
ses-0.482
 endings-0.462
Token wanted
Token ID2227
Feature activation-3.78e-01

Pos logit contributions
ak+0.046
He+0.040
 there+0.038
 upon+0.035
 like+0.035
Neg logit contributions
sed-0.071
opes-0.069
lder-0.065
 Majesty-0.065
ses-0.064
Token to
Token ID284
Feature activation-5.22e-02

Pos logit contributions
sed+0.047
opes+0.046
lder+0.044
 Majesty+0.043
ses+0.043
Neg logit contributions
ak-0.027
He-0.022
 there-0.021
 like-0.020
 of-0.020
Token drink
Token ID4144
Feature activation+2.18e+00

Pos logit contributions
sed+0.006
opes+0.005
lder+0.005
ses+0.005
 Majesty+0.005
Neg logit contributions
ak-0.005
He-0.004
he-0.003
aky-0.003
ill-0.003
Token some
Token ID617
Feature activation-7.75e-01

Pos logit contributions
ak+0.211
He+0.181
aky+0.174
ill+0.174
he+0.171
Neg logit contributions
sed-0.197
opes-0.189
lder-0.174
ses-0.170
 endings-0.166
Token.
Token ID13
Feature activation+1.02e+00

Pos logit contributions
ak+0.217
He+0.184
aky+0.179
he+0.169
ill+0.169
Neg logit contributions
sed-0.205
opes-0.201
lder-0.185
der-0.181
 endings-0.180
Token The
Token ID383
Feature activation-2.37e-01

Pos logit contributions
ak+0.089
aky+0.070
he+0.067
He+0.067
ill+0.064
Neg logit contributions
sed-0.111
opes-0.103
lder-0.099
ses-0.097
 Majesty-0.096
Token rabbit
Token ID22746
Feature activation-7.23e-01

Pos logit contributions
sed+0.023
opes+0.021
ses+0.020
 Majesty+0.020
lder+0.020
Neg logit contributions
ak-0.023
He-0.022
 there-0.020
 like-0.019
 upon-0.019
Token was
Token ID373
Feature activation-6.65e-01

Pos logit contributions
sed+0.091
opes+0.088
ses+0.084
lder+0.083
 Majesty+0.083
Neg logit contributions
ak-0.051
He-0.042
 there-0.040
 upon-0.037
 like-0.036
Token so
Token ID523
Feature activation+9.01e-01

Pos logit contributions
sed+0.073
opes+0.065
ses+0.063
lder+0.062
 Majesty+0.060
Neg logit contributions
ak-0.055
He-0.049
ill-0.047
he-0.046
fing-0.043
Token thirsty
Token ID47124
Feature activation+4.93e+00

Pos logit contributions
ak+0.073
He+0.061
ill+0.058
he+0.056
fing+0.055
Neg logit contributions
sed-0.100
lder-0.093
opes-0.090
ses-0.088
 Majesty-0.085
Token,
Token ID11
Feature activation+3.65e-01

Pos logit contributions
ak+0.455
He+0.375
aky+0.358
ill+0.342
 upon+0.341
Neg logit contributions
sed-0.494
opes-0.453
lder-0.445
ses-0.425
 endings-0.410
Token he
Token ID339
Feature activation+1.77e+00

Pos logit contributions
ak+0.020
He+0.016
 there+0.013
 like+0.012
he+0.012
Neg logit contributions
sed-0.052
opes-0.050
lder-0.048
 Majesty-0.047
ses-0.047
Token thought
Token ID1807
Feature activation-1.28e+00

Pos logit contributions
ak+0.138
He+0.117
 there+0.107
 upon+0.102
 like+0.100
Neg logit contributions
sed-0.209
opes-0.202
lder-0.191
ses-0.190
 Majesty-0.188
Token he
Token ID339
Feature activation+1.75e+00

Pos logit contributions
sed+0.164
opes+0.157
lder+0.152
ses+0.148
 Majesty+0.148
Neg logit contributions
ak-0.087
He-0.072
 there-0.065
he-0.061
 like-0.061
Token could
Token ID714
Feature activation+1.71e-01

Pos logit contributions
ak+0.113
He+0.094
 there+0.085
 upon+0.078
 like+0.078
Neg logit contributions
sed-0.226
opes-0.221
lder-0.211
 Majesty-0.210
ses-0.207
Token old
Token ID1468
Feature activation-1.50e+00

Pos logit contributions
ak+0.007
 of+0.006
 there+0.006
He+0.005
he+0.005
Neg logit contributions
sed-0.009
opes-0.009
ses-0.008
lder-0.008
 Majesty-0.008
Token.
Token ID13
Feature activation+9.48e-01

Pos logit contributions
sed+0.211
opes+0.202
lder+0.195
 Majesty+0.194
ses+0.192
Neg logit contributions
ak-0.080
He-0.066
 like-0.059
 there-0.058
he-0.053
Token Emma
Token ID18966
Feature activation-3.35e-01

Pos logit contributions
ak+0.070
He+0.065
 there+0.061
he+0.060
 like+0.058
Neg logit contributions
sed-0.114
opes-0.105
 Majesty-0.102
lder-0.101
ses-0.101
Token was
Token ID373
Feature activation-5.34e-01

Pos logit contributions
sed+0.039
opes+0.036
 Majesty+0.035
ses+0.035
 endings+0.033
Neg logit contributions
ak-0.027
He-0.023
 like-0.023
 there-0.022
aky-0.020
Token very
Token ID845
Feature activation+1.98e+00

Pos logit contributions
sed+0.066
opes+0.064
lder+0.061
 Majesty+0.059
ses+0.059
Neg logit contributions
ak-0.038
He-0.032
 there-0.029
 like-0.028
he-0.028
Token thirsty
Token ID47124
Feature activation+4.91e+00

Pos logit contributions
ak+0.163
He+0.139
 there+0.124
he+0.124
ill+0.120
Neg logit contributions
sed-0.225
opes-0.212
lder-0.205
ses-0.199
 Majesty-0.198
Token and
Token ID290
Feature activation+7.78e-01

Pos logit contributions
ak+0.406
He+0.343
 upon+0.303
ill+0.302
aky+0.300
Neg logit contributions
sed-0.529
opes-0.505
lder-0.489
ses-0.467
 endings-0.451
Token went
Token ID1816
Feature activation-1.35e-01

Pos logit contributions
ak+0.060
 there+0.054
He+0.053
 upon+0.050
 like+0.050
Neg logit contributions
sed-0.090
opes-0.087
 Majesty-0.082
ses-0.081
lder-0.081
Token to
Token ID284
Feature activation+9.29e-02

Pos logit contributions
sed+0.016
opes+0.015
lder+0.014
ses+0.014
 Majesty+0.014
Neg logit contributions
ak-0.011
He-0.009
 there-0.008
he-0.008
 upon-0.008
Token the
Token ID262
Feature activation+8.47e-01

Pos logit contributions
ak+0.007
He+0.006
 there+0.006
he+0.005
 upon+0.005
Neg logit contributions
sed-0.011
opes-0.011
lder-0.010
ses-0.010
 Majesty-0.010
Token kitchen
Token ID9592
Feature activation+1.64e-01

Pos logit contributions
ak+0.081
He+0.072
 there+0.067
he+0.066
 like+0.064
Neg logit contributions
sed-0.084
opes-0.080
lder-0.074
ses-0.074
 Majesty-0.072
Token bird
Token ID6512
Feature activation-1.85e-01

Pos logit contributions
sed+0.116
opes+0.106
lder+0.106
ses+0.102
 Majesty+0.100
Neg logit contributions
ak-0.099
He-0.085
he-0.077
ill-0.077
 upon-0.075
Token.
Token ID13
Feature activation+7.51e-01

Pos logit contributions
sed+0.020
opes+0.019
lder+0.018
ses+0.018
 Majesty+0.018
Neg logit contributions
ak-0.016
He-0.014
he-0.012
 there-0.012
ill-0.012
Token He
Token ID679
Feature activation+1.60e+00

Pos logit contributions
ak+0.068
He+0.060
 there+0.055
he+0.054
 like+0.053
Neg logit contributions
sed-0.079
opes-0.075
lder-0.070
 Majesty-0.070
ses-0.069
Token was
Token ID373
Feature activation-6.27e-01

Pos logit contributions
ak+0.146
He+0.126
 there+0.117
 like+0.114
aky+0.109
Neg logit contributions
sed-0.172
opes-0.166
 Majesty-0.163
ses-0.156
 endings-0.152
Token very
Token ID845
Feature activation+1.81e+00

Pos logit contributions
sed+0.077
opes+0.072
lder+0.070
ses+0.068
 Majesty+0.068
Neg logit contributions
ak-0.046
He-0.039
he-0.034
 there-0.034
ill-0.033
Token thirsty
Token ID47124
Feature activation+4.91e+00

Pos logit contributions
ak+0.146
He+0.126
he+0.112
ill+0.111
 there+0.111
Neg logit contributions
sed-0.208
opes-0.194
lder-0.190
ses-0.184
 Majesty-0.182
Token.
Token ID13
Feature activation+7.09e-01

Pos logit contributions
ak+0.409
He+0.348
ill+0.317
 upon+0.317
fing+0.313
Neg logit contributions
sed-0.510
opes-0.488
lder-0.474
ses-0.449
 endings-0.431
Token He
Token ID679
Feature activation+1.57e+00

Pos logit contributions
ak+0.059
He+0.047
he+0.045
aky+0.045
fing+0.042
Neg logit contributions
sed-0.081
opes-0.076
lder-0.074
ses-0.072
 Majesty-0.070
Token knew
Token ID2993
Feature activation-1.13e+00

Pos logit contributions
ak+0.131
He+0.113
 there+0.105
 upon+0.100
 like+0.099
Neg logit contributions
sed-0.177
opes-0.171
 Majesty-0.164
ses-0.161
lder-0.160
Token he
Token ID339
Feature activation+1.78e+00

Pos logit contributions
sed+0.148
opes+0.141
lder+0.137
ses+0.133
 Majesty+0.133
Neg logit contributions
ak-0.073
He-0.060
 there-0.052
he-0.051
 upon-0.049
Token needed
Token ID2622
Feature activation-9.27e-01

Pos logit contributions
ak+0.144
He+0.123
 there+0.112
 like+0.109
he+0.108
Neg logit contributions
sed-0.203
opes-0.195
lder-0.185
 Majesty-0.183
ses-0.181
Token day
Token ID1110
Feature activation-6.17e-01

Pos logit contributions
ak+0.021
He+0.018
 there+0.017
he+0.017
 like+0.016
Neg logit contributions
sed-0.020
opes-0.019
lder-0.018
ses-0.018
 Majesty-0.017
Token.
Token ID13
Feature activation+8.52e-01

Pos logit contributions
sed+0.070
opes+0.067
lder+0.065
ses+0.062
 Majesty+0.061
Neg logit contributions
ak-0.049
He-0.042
he-0.036
aky-0.036
 there-0.036
Token She
Token ID1375
Feature activation+1.04e-01

Pos logit contributions
ak+0.062
He+0.057
 there+0.055
 like+0.053
 upon+0.052
Neg logit contributions
sed-0.101
opes-0.097
 Majesty-0.092
lder-0.091
ses-0.090
Token was
Token ID373
Feature activation-6.00e-01

Pos logit contributions
ak+0.008
He+0.007
 there+0.007
 upon+0.006
he+0.006
Neg logit contributions
sed-0.012
opes-0.012
lder-0.011
ses-0.011
 Majesty-0.011
Token so
Token ID523
Feature activation+8.33e-01

Pos logit contributions
sed+0.064
opes+0.059
lder+0.057
ses+0.056
 Majesty+0.055
Neg logit contributions
ak-0.053
He-0.047
he-0.042
ill-0.041
aky-0.040
Token thirsty
Token ID47124
Feature activation+4.90e+00

Pos logit contributions
ak+0.061
He+0.050
he+0.044
 there+0.044
ill+0.043
Neg logit contributions
sed-0.102
opes-0.097
lder-0.094
ses-0.091
 Majesty-0.091
Token that
Token ID326
Feature activation-1.45e+00

Pos logit contributions
ak+0.438
He+0.368
aky+0.342
ill+0.332
 upon+0.328
Neg logit contributions
sed-0.495
opes-0.463
lder-0.457
ses-0.429
 endings-0.416
Token she
Token ID673
Feature activation+4.46e-01

Pos logit contributions
sed+0.161
opes+0.152
lder+0.144
ses+0.142
 Majesty+0.140
Neg logit contributions
ak-0.122
He-0.108
 there-0.103
 like-0.097
he-0.097
Token thought
Token ID1807
Feature activation-1.21e+00

Pos logit contributions
ak+0.038
He+0.034
 there+0.032
 upon+0.030
 like+0.030
Neg logit contributions
sed-0.049
opes-0.047
ses-0.044
 Majesty-0.044
lder-0.043
Token she
Token ID673
Feature activation+4.25e-01

Pos logit contributions
sed+0.149
opes+0.142
 Majesty+0.136
lder+0.135
ses+0.135
Neg logit contributions
ak-0.087
 there-0.074
He-0.074
 of-0.072
 like-0.068
Token would
Token ID561
Feature activation+1.74e-02

Pos logit contributions
ak+0.027
He+0.023
 there+0.021
 like+0.020
he+0.019
Neg logit contributions
sed-0.055
opes-0.053
 Majesty-0.051
ses-0.051
lder-0.050
Token.
Token ID13
Feature activation+1.05e+00

Pos logit contributions
ak+0.117
He+0.097
 there+0.084
 like+0.083
he+0.080
Neg logit contributions
sed-0.233
opes-0.220
 Majesty-0.212
ses-0.210
lder-0.207
Token Tim
Token ID5045
Feature activation+1.89e-01

Pos logit contributions
ak+0.114
He+0.096
aky+0.093
he+0.091
 there+0.090
Neg logit contributions
sed-0.091
opes-0.087
lder-0.080
ses-0.078
 Majesty-0.073
Tokenmy
Token ID1820
Feature activation+1.70e+00

Pos logit contributions
ak+0.009
He+0.007
fing+0.007
 closer+0.007
ill+0.007
Neg logit contributions
opes-0.025
sed-0.025
lder-0.024
ses-0.023
 Majesty-0.022
Token was
Token ID373
Feature activation-6.49e-01

Pos logit contributions
ak+0.164
He+0.144
 there+0.132
 like+0.129
he+0.129
Neg logit contributions
sed-0.172
opes-0.161
 Majesty-0.152
ses-0.150
lder-0.150
Token very
Token ID845
Feature activation+1.83e+00

Pos logit contributions
sed+0.078
opes+0.074
lder+0.071
ses+0.070
 Majesty+0.069
Neg logit contributions
ak-0.049
He-0.041
he-0.036
 there-0.036
ill-0.035
Token thirsty
Token ID47124
Feature activation+4.90e+00

Pos logit contributions
ak+0.151
He+0.128
he+0.115
 there+0.115
ill+0.113
Neg logit contributions
sed-0.209
opes-0.196
lder-0.190
ses-0.184
 Majesty-0.183
Token and
Token ID290
Feature activation+6.99e-01

Pos logit contributions
ak+0.397
He+0.331
 upon+0.302
ill+0.297
fing+0.293
Neg logit contributions
sed-0.528
opes-0.506
lder-0.490
ses-0.468
 endings-0.451
Token wanted
Token ID2227
Feature activation-3.64e-01

Pos logit contributions
ak+0.053
He+0.045
 there+0.042
 upon+0.039
 like+0.039
Neg logit contributions
sed-0.084
opes-0.080
lder-0.076
 Majesty-0.075
ses-0.075
Token some
Token ID617
Feature activation-7.20e-01

Pos logit contributions
sed+0.044
opes+0.042
lder+0.040
 Majesty+0.039
ses+0.039
Neg logit contributions
ak-0.027
He-0.023
 there-0.021
 like-0.020
he-0.020
Token milk
Token ID7545
Feature activation+2.28e-01

Pos logit contributions
sed+0.069
opes+0.065
lder+0.061
ses+0.060
 Majesty+0.058
Neg logit contributions
ak-0.071
He-0.062
he-0.057
ill-0.056
aky-0.055
Token.
Token ID13
Feature activation+9.97e-01

Pos logit contributions
ak+0.022
He+0.019
aky+0.017
he+0.016
ill+0.016
Neg logit contributions
sed-0.022
opes-0.020
lder-0.019
ses-0.019
yard-0.018
Token town
Token ID3240
Feature activation+2.20e-01

Pos logit contributions
sed+0.082
opes+0.077
 Majesty+0.076
lder+0.075
ses+0.075
Neg logit contributions
ak-0.025
 there-0.023
 like-0.021
He-0.019
 of-0.017
Token until
Token ID1566
Feature activation-3.94e-01

Pos logit contributions
ak+0.015
He+0.012
 there+0.011
 like+0.011
he+0.011
Neg logit contributions
sed-0.028
opes-0.027
lder-0.026
ses-0.026
 Majesty-0.025
Token it
Token ID340
Feature activation-2.77e-02

Pos logit contributions
sed+0.050
opes+0.048
lder+0.045
ses+0.045
 Majesty+0.044
Neg logit contributions
ak-0.027
 there-0.024
He-0.023
 like-0.021
 upon-0.021
Token became
Token ID2627
Feature activation-6.29e-01

Pos logit contributions
sed+0.003
opes+0.003
lder+0.003
ses+0.003
 Majesty+0.003
Neg logit contributions
ak-0.002
He-0.002
 there-0.002
he-0.002
 upon-0.002
Token very
Token ID845
Feature activation+1.85e+00

Pos logit contributions
sed+0.060
opes+0.059
 self+0.054
 Majesty+0.053
lder+0.052
Neg logit contributions
ak-0.050
he-0.049
ill-0.044
He-0.043
fing-0.042
Token thirsty
Token ID47124
Feature activation+4.90e+00

Pos logit contributions
ak+0.160
He+0.133
ill+0.129
he+0.128
aky+0.120
Neg logit contributions
sed-0.190
lder-0.177
opes-0.174
 Majesty-0.167
ses-0.162
Token.
Token ID13
Feature activation+7.26e-01

Pos logit contributions
ak+0.437
He+0.369
fing+0.334
aky+0.331
 upon+0.328
Neg logit contributions
sed-0.480
opes-0.463
lder-0.447
 endings-0.422
ses-0.419
Token
Token ID220
Feature activation+4.26e-01

Pos logit contributions
ak+0.054
He+0.052
 there+0.046
 like+0.045
 upon+0.044
Neg logit contributions
sed-0.084
opes-0.078
lder-0.077
 Majesty-0.075
ses-0.075
Token
Token ID198
Feature activation+6.87e-01

Pos logit contributions
ak+0.031
He+0.025
aky+0.023
 like+0.022
 there+0.022
Neg logit contributions
sed-0.050
opes-0.048
lder-0.046
ses-0.045
 Majesty-0.044
TokenThen
Token ID6423
Feature activation-1.30e+00

Pos logit contributions
ak+0.054
aky+0.041
 like+0.041
He+0.041
 there+0.040
Neg logit contributions
opes-0.070
sed-0.070
lder-0.070
 Majesty-0.063
llers-0.061
Token it
Token ID340
Feature activation-4.22e-02

Pos logit contributions
sed+0.144
opes+0.136
lder+0.130
ses+0.126
 Majesty+0.126
Neg logit contributions
ak-0.111
He-0.094
he-0.085
aky-0.083
 there-0.082
Tokenggy
Token ID19970
Feature activation+4.17e-01

Pos logit contributions
ak+0.075
He+0.067
 there+0.063
he+0.062
 of+0.061
Neg logit contributions
sed-0.057
opes-0.053
ses-0.050
lder-0.048
 Majesty-0.047
Token was
Token ID373
Feature activation-5.25e-01

Pos logit contributions
ak+0.030
He+0.026
 like+0.025
 there+0.025
 of+0.023
Neg logit contributions
sed-0.050
opes-0.049
ses-0.047
 Majesty-0.046
lder-0.046
Token scared
Token ID12008
Feature activation+1.70e+00

Pos logit contributions
sed+0.063
opes+0.059
lder+0.057
ses+0.056
 Majesty+0.056
Neg logit contributions
ak-0.040
He-0.034
he-0.030
 there-0.030
ill-0.029
Token and
Token ID290
Feature activation+8.20e-01

Pos logit contributions
ak+0.128
He+0.104
he+0.094
ill+0.090
aky+0.090
Neg logit contributions
sed-0.204
opes-0.194
lder-0.185
ses-0.181
 Majesty-0.179
Token very
Token ID845
Feature activation+1.88e+00

Pos logit contributions
ak+0.066
He+0.059
 there+0.058
 like+0.055
 upon+0.054
Neg logit contributions
sed-0.091
opes-0.089
lder-0.084
 Majesty-0.082
ses-0.082
Token thirsty
Token ID47124
Feature activation+4.89e+00

Pos logit contributions
ak+0.136
He+0.115
 there+0.104
 upon+0.099
he+0.099
Neg logit contributions
sed-0.230
opes-0.222
lder-0.212
ses-0.207
 Majesty-0.205
Token.
Token ID13
Feature activation+9.46e-01

Pos logit contributions
ak+0.358
He+0.300
 there+0.269
he+0.265
 like+0.258
Neg logit contributions
sed-0.599
opes-0.569
lder-0.549
ses-0.535
 Majesty-0.534
Token Suddenly
Token ID24975
Feature activation-7.58e-01

Pos logit contributions
ak+0.056
He+0.054
 there+0.045
 upon+0.043
he+0.041
Neg logit contributions
sed-0.126
opes-0.122
lder-0.117
 Majesty-0.116
ses-0.114
Token,
Token ID11
Feature activation+3.55e-01

Pos logit contributions
sed+0.096
opes+0.091
lder+0.088
ses+0.086
 Majesty+0.085
Neg logit contributions
ak-0.053
He-0.043
 there-0.038
he-0.038
 upon-0.037
Token it
Token ID340
Feature activation-4.16e-02

Pos logit contributions
ak+0.027
He+0.022
aky+0.020
ill+0.019
he+0.019
Neg logit contributions
sed-0.043
opes-0.040
ses-0.039
lder-0.039
 Majesty-0.037
Token heard
Token ID2982
Feature activation+5.61e-01

Pos logit contributions
sed+0.005
opes+0.005
lder+0.004
ses+0.004
 Majesty+0.004
Neg logit contributions
ak-0.003
He-0.003
he-0.002
 there-0.002
ill-0.002
Token is
Token ID318
Feature activation-1.07e+00

Pos logit contributions
sed+0.046
opes+0.045
lder+0.044
 Majesty+0.041
ses+0.040
Neg logit contributions
ak-0.039
He-0.034
 there-0.032
 like-0.031
 upon-0.030
Token angry
Token ID7954
Feature activation+7.13e-01

Pos logit contributions
sed+0.118
opes+0.105
ses+0.104
lder+0.095
 Majesty+0.094
Neg logit contributions
ak-0.093
He-0.083
he-0.080
ill-0.075
fing-0.072
Token.
Token ID13
Feature activation-5.74e-01

Pos logit contributions
ak+0.049
He+0.040
 there+0.035
he+0.035
 upon+0.034
Neg logit contributions
sed-0.091
opes-0.087
lder-0.084
ses-0.082
 Majesty-0.081
Token It
Token ID632
Feature activation-5.51e-01

Pos logit contributions
sed+0.059
opes+0.054
lder+0.054
ses+0.052
 Majesty+0.049
Neg logit contributions
ak-0.052
He-0.042
aky-0.042
he-0.041
 upon-0.040
Token his
Token ID465
Feature activation+6.52e-01

Pos logit contributions
sed+0.054
opes+0.053
lder+0.053
 Majesty+0.050
 endings+0.048
Neg logit contributions
ak-0.050
He-0.046
 like-0.042
 there-0.042
 upon-0.040
Tokenses
Token ID8448
Feature activation-7.27e+00

Pos logit contributions
ak+0.098
He+0.087
 like+0.086
 there+0.085
 upon+0.084
Neg logit contributions
opes-0.026
lder-0.025
sed-0.023
 Majesty-0.020
 endings-0.019
Token.
Token ID13
Feature activation-4.45e-01

Pos logit contributions
sed+0.962
opes+0.915
lder+0.881
ses+0.867
 Majesty+0.854
Neg logit contributions
ak-0.458
He-0.368
he-0.323
 there-0.316
 upon-0.307
Token It
Token ID632
Feature activation-5.03e-01

Pos logit contributions
sed+0.043
opes+0.038
lder+0.038
ses+0.038
 Majesty+0.035
Neg logit contributions
ak-0.043
aky-0.036
he-0.034
 upon-0.034
He-0.033
Token scratches
Token ID37627
Feature activation-1.46e+00

Pos logit contributions
sed+0.047
opes+0.046
lder+0.045
 Majesty+0.043
 endings+0.041
Neg logit contributions
ak-0.048
He-0.044
 like-0.041
 there-0.041
 upon-0.039
Token.
Token ID13
Feature activation-5.26e-01

Pos logit contributions
sed+0.148
opes+0.146
ses+0.132
lder+0.131
 endings+0.121
Neg logit contributions
ak-0.127
fing-0.103
He-0.102
aky-0.100
 upon-0.097
Token It
Token ID632
Feature activation-5.83e-01

Pos logit contributions
sed+0.054
opes+0.050
ses+0.048
lder+0.048
 Majesty+0.045
Neg logit contributions
ak-0.048
aky-0.039
He-0.039
he-0.037
 upon-0.037
Token like
Token ID588
Feature activation+2.05e+00

Pos logit contributions
sed+0.019
opes+0.018
lder+0.018
ses+0.017
 Majesty+0.017
Neg logit contributions
ak-0.009
He-0.007
he-0.006
ill-0.006
 upon-0.006
Token that
Token ID326
Feature activation-2.12e+00

Pos logit contributions
ak+0.128
 there+0.120
 like+0.113
 of+0.111
He+0.103
Neg logit contributions
sed-0.261
lder-0.254
opes-0.254
ses-0.237
 Majesty-0.236
Token.
Token ID13
Feature activation-2.74e-01

Pos logit contributions
sed+0.255
opes+0.242
lder+0.233
ses+0.227
 Majesty+0.226
Neg logit contributions
ak-0.160
He-0.134
 there-0.119
he-0.119
 upon-0.115
Token It
Token ID632
Feature activation-5.09e-01

Pos logit contributions
sed+0.026
opes+0.024
ses+0.023
lder+0.023
 Majesty+0.022
Neg logit contributions
ak-0.027
he-0.022
aky-0.022
He-0.022
fing-0.021
Token his
Token ID465
Feature activation+8.58e-01

Pos logit contributions
sed+0.054
opes+0.052
lder+0.051
 Majesty+0.049
ses+0.048
Neg logit contributions
ak-0.044
He-0.039
 there-0.035
 like-0.035
 upon-0.034
Tokenses
Token ID8448
Feature activation-7.23e+00

Pos logit contributions
ak+0.094
He+0.079
 like+0.077
 there+0.077
 upon+0.076
Neg logit contributions
opes-0.070
lder-0.069
sed-0.065
 Majesty-0.060
 endings-0.059
Token and
Token ID290
Feature activation+1.28e-03

Pos logit contributions
sed+0.983
opes+0.946
lder+0.922
 Majesty+0.901
ses+0.892
Neg logit contributions
ak-0.428
He-0.343
 there-0.296
 like-0.288
 of-0.274
Token fl
Token ID781
Feature activation-3.18e+00

Pos logit contributions
ak+0.000
He+0.000
 like+0.000
 there+0.000
 closer+0.000
Neg logit contributions
opes-0.000
lder-0.000
sed-0.000
 Majesty-0.000
 endings-0.000
Tokenaps
Token ID1686
Feature activation+2.64e-01

Pos logit contributions
sed+0.302
ses+0.256
opes+0.240
 Majesty+0.210
lder+0.200
Neg logit contributions
ak-0.308
 there-0.288
He-0.286
he-0.282
 like-0.271
Token its
Token ID663
Feature activation-1.28e+00

Pos logit contributions
ak+0.025
fing+0.021
He+0.020
he+0.020
aking+0.020
Neg logit contributions
sed-0.024
opes-0.023
ses-0.021
lder-0.020
 endings-0.020
Token wings
Token ID12098
Feature activation-2.70e+00

Pos logit contributions
sed+0.094
opes+0.090
lder+0.089
 Majesty+0.082
ses+0.079
Neg logit contributions
ak-0.153
He-0.135
 like-0.126
 there-0.126
 upon-0.124
Token it
Token ID340
Feature activation-6.67e-01

Pos logit contributions
sed+0.018
opes+0.017
lder+0.016
ses+0.016
 Majesty+0.015
Neg logit contributions
ak-0.015
He-0.012
aky-0.011
he-0.011
ill-0.011
Token.
Token ID13
Feature activation-3.70e-01

Pos logit contributions
sed+0.083
opes+0.079
lder+0.076
ses+0.074
 Majesty+0.074
Neg logit contributions
ak-0.048
He-0.040
 there-0.035
he-0.035
 like-0.034
Token The
Token ID383
Feature activation-1.36e+00

Pos logit contributions
sed+0.043
opes+0.039
lder+0.038
ses+0.038
 Majesty+0.037
Neg logit contributions
ak-0.031
He-0.024
aky-0.023
he-0.023
ill-0.022
Token cat
Token ID3797
Feature activation-1.94e+00

Pos logit contributions
opes+0.121
sed+0.114
lder+0.112
 Majesty+0.108
llers+0.104
Neg logit contributions
ak-0.142
He-0.123
 there-0.121
 like-0.118
 upon-0.117
Token his
Token ID465
Feature activation+7.40e-01

Pos logit contributions
sed+0.194
opes+0.192
lder+0.187
 Majesty+0.182
 endings+0.175
Neg logit contributions
ak-0.171
He-0.155
 there-0.146
 like-0.145
 closer-0.138
Tokenses
Token ID8448
Feature activation-7.13e+00

Pos logit contributions
ak+0.115
 like+0.104
He+0.101
 there+0.101
 upon+0.101
Neg logit contributions
opes-0.023
lder-0.023
 endings-0.017
 Majesty-0.017
llers-0.015
Token and
Token ID290
Feature activation-2.12e-04

Pos logit contributions
sed+0.923
opes+0.874
lder+0.838
ses+0.829
 Majesty+0.807
Neg logit contributions
ak-0.464
He-0.369
he-0.339
ill-0.320
 there-0.319
Token scratches
Token ID37627
Feature activation-1.48e+00

Pos logit contributions
lder+0.000
opes+0.000
sed+0.000
 endings+0.000
 Majesty+0.000
Neg logit contributions
ak-0.000
He-0.000
 like-0.000
 there-0.000
 closer-0.000
Token him
Token ID683
Feature activation-7.96e-01

Pos logit contributions
sed+0.151
opes+0.144
ses+0.133
lder+0.130
der+0.121
Neg logit contributions
ak-0.131
fing-0.106
He-0.103
aky-0.103
 upon-0.097
Token.
Token ID13
Feature activation-3.49e-01

Pos logit contributions
sed+0.088
opes+0.084
ses+0.079
lder+0.079
 Majesty+0.074
Neg logit contributions
ak-0.065
He-0.053
fing-0.050
he-0.049
ill-0.048
Token Ben
Token ID3932
Feature activation-3.16e-01

Pos logit contributions
sed+0.039
opes+0.036
ses+0.035
lder+0.035
 Majesty+0.034
Neg logit contributions
ak-0.030
He-0.024
aky-0.023
he-0.023
ill-0.022
Token is
Token ID318
Feature activation-1.15e+00

Pos logit contributions
sed+0.023
opes+0.022
lder+0.021
ses+0.021
 Majesty+0.020
Neg logit contributions
ak-0.017
He-0.015
 there-0.013
he-0.013
ill-0.013
Token mean
Token ID1612
Feature activation+7.93e-01

Pos logit contributions
sed+0.119
opes+0.112
ses+0.105
lder+0.104
 Majesty+0.102
Neg logit contributions
ak-0.103
He-0.088
ill-0.084
he-0.084
 of-0.078
Token.
Token ID13
Feature activation-4.20e-01

Pos logit contributions
ak+0.058
He+0.047
he+0.042
aky+0.041
 there+0.040
Neg logit contributions
sed-0.096
opes-0.091
lder-0.088
ses-0.086
 Majesty-0.084
Token He
Token ID679
Feature activation+5.54e-01

Pos logit contributions
sed+0.051
opes+0.047
ses+0.046
lder+0.046
 Majesty+0.044
Neg logit contributions
ak-0.032
he-0.024
aky-0.023
ill-0.022
fing-0.022
Token his
Token ID465
Feature activation+6.19e-01

Pos logit contributions
ak+0.052
He+0.047
 like+0.043
 there+0.042
 upon+0.041
Neg logit contributions
sed-0.054
opes-0.052
lder-0.050
 Majesty-0.047
ses-0.047
Tokenses
Token ID8448
Feature activation-7.09e+00

Pos logit contributions
ak+0.064
 like+0.058
 upon+0.057
 there+0.055
He+0.055
Neg logit contributions
opes-0.050
lder-0.049
 endings-0.043
 Majesty-0.043
sed-0.043
Token and
Token ID290
Feature activation-1.86e-02

Pos logit contributions
sed+0.944
opes+0.900
lder+0.870
ses+0.851
 Majesty+0.845
Neg logit contributions
ak-0.441
He-0.354
he-0.305
 there-0.304
 upon-0.290
Token runs
Token ID4539
Feature activation-1.38e+00

Pos logit contributions
sed+0.002
lder+0.002
opes+0.002
 Majesty+0.002
 endings+0.001
Neg logit contributions
ak-0.002
He-0.002
 like-0.002
 there-0.001
 upon-0.001
Token away
Token ID1497
Feature activation+1.79e+00

Pos logit contributions
sed+0.169
opes+0.157
lder+0.153
ses+0.150
 Majesty+0.146
Neg logit contributions
ak-0.100
He-0.079
ill-0.071
he-0.070
aky-0.070
Token.
Token ID13
Feature activation-4.21e-01

Pos logit contributions
ak+0.139
He+0.111
he+0.100
aky+0.099
 there+0.097
Neg logit contributions
sed-0.211
opes-0.199
lder-0.192
ses-0.188
 Majesty-0.185
Token He
Token ID679
Feature activation+5.78e-01

Pos logit contributions
sed+0.052
opes+0.048
ses+0.046
lder+0.046
 Majesty+0.045
Neg logit contributions
ak-0.032
aky-0.023
He-0.023
he-0.023
ill-0.022
Token snake
Token ID17522
Feature activation-1.74e+00

Pos logit contributions
sed+0.087
opes+0.086
lder+0.083
 Majesty+0.079
 endings+0.072
Neg logit contributions
ak-0.172
He-0.158
 there-0.149
 like-0.147
 of-0.146
Token sees
Token ID7224
Feature activation-4.52e-01

Pos logit contributions
sed+0.170
opes+0.168
lder+0.163
 Majesty+0.157
 endings+0.149
Neg logit contributions
ak-0.159
He-0.141
 there-0.132
 like-0.128
 upon-0.125
Token Tom
Token ID4186
Feature activation+4.00e-01

Pos logit contributions
sed+0.055
opes+0.052
lder+0.050
 Majesty+0.049
ses+0.049
Neg logit contributions
ak-0.034
He-0.029
 there-0.027
he-0.026
 like-0.025
Token and
Token ID290
Feature activation-3.75e-02

Pos logit contributions
ak+0.031
He+0.026
he+0.023
aky+0.023
fing+0.022
Neg logit contributions
sed-0.046
opes-0.044
lder-0.042
ses-0.042
 Majesty-0.039
Token his
Token ID465
Feature activation+6.66e-01

Pos logit contributions
opes+0.004
sed+0.004
lder+0.004
 Majesty+0.004
 endings+0.004
Neg logit contributions
ak-0.003
He-0.002
 there-0.002
 like-0.002
 of-0.002
Tokenses
Token ID8448
Feature activation-7.08e+00

Pos logit contributions
ak+0.071
He+0.062
 like+0.059
 there+0.058
 upon+0.057
Neg logit contributions
opes-0.055
lder-0.055
sed-0.053
 Majesty-0.051
 endings-0.048
Token.
Token ID13
Feature activation-3.63e-01

Pos logit contributions
sed+0.776
opes+0.716
ses+0.683
lder+0.664
 Majesty+0.632
Neg logit contributions
ak-0.581
He-0.488
he-0.485
 upon-0.448
fing-0.448
Token It
Token ID632
Feature activation-4.14e-01

Pos logit contributions
sed+0.038
opes+0.034
ses+0.034
lder+0.034
 Majesty+0.031
Neg logit contributions
ak-0.033
aky-0.027
He-0.026
 upon-0.026
ill-0.025
Token opens
Token ID9808
Feature activation-9.34e-01

Pos logit contributions
sed+0.039
opes+0.038
lder+0.037
 Majesty+0.036
 endings+0.034
Neg logit contributions
ak-0.040
He-0.036
 like-0.033
 there-0.033
 upon-0.032
Token its
Token ID663
Feature activation-1.40e+00

Pos logit contributions
sed+0.097
opes+0.091
lder+0.087
ses+0.085
 Majesty+0.084
Neg logit contributions
ak-0.086
He-0.074
he-0.068
aky-0.067
 there-0.066
Token mouth
Token ID5422
Feature activation+1.53e-01

Pos logit contributions
sed+0.120
opes+0.107
ses+0.101
lder+0.098
 Majesty+0.097
Neg logit contributions
ak-0.157
He-0.141
he-0.137
 there-0.131
ill-0.129
Token nose
Token ID9686
Feature activation-6.49e-01

Pos logit contributions
sed+0.105
opes+0.094
ses+0.090
lder+0.088
 Majesty+0.083
Neg logit contributions
ak-0.129
He-0.114
 there-0.109
he-0.109
fing-0.109
Token.
Token ID13
Feature activation-2.06e-01

Pos logit contributions
sed+0.064
opes+0.060
lder+0.057
ses+0.055
der+0.055
Neg logit contributions
ak-0.058
He-0.047
fing-0.047
 upon-0.046
he-0.045
Token The
Token ID383
Feature activation-1.31e+00

Pos logit contributions
sed+0.025
opes+0.023
lder+0.022
ses+0.022
 Majesty+0.022
Neg logit contributions
ak-0.016
He-0.013
he-0.012
 there-0.012
aky-0.011
Token cat
Token ID3797
Feature activation-1.95e+00

Pos logit contributions
opes+0.118
sed+0.113
lder+0.112
 Majesty+0.110
 endings+0.099
Neg logit contributions
ak-0.132
He-0.120
 of-0.114
 there-0.113
 like-0.111
Token his
Token ID465
Feature activation+7.33e-01

Pos logit contributions
sed+0.199
opes+0.195
lder+0.189
 Majesty+0.181
 endings+0.179
Neg logit contributions
ak-0.168
He-0.155
 like-0.148
 there-0.143
 closer-0.136
Tokenses
Token ID8448
Feature activation-7.03e+00

Pos logit contributions
ak+0.090
He+0.080
 like+0.078
 of+0.074
 there+0.074
Neg logit contributions
lder-0.048
opes-0.047
 Majesty-0.044
sed-0.042
 endings-0.041
Token and
Token ID290
Feature activation+7.29e-02

Pos logit contributions
sed+0.905
opes+0.854
lder+0.822
ses+0.812
 Majesty+0.793
Neg logit contributions
ak-0.463
He-0.367
he-0.339
 there-0.321
 upon-0.321
Token scratches
Token ID37627
Feature activation-1.51e+00

Pos logit contributions
ak+0.006
He+0.006
 like+0.005
 there+0.005
 closer+0.005
Neg logit contributions
lder-0.007
opes-0.007
sed-0.007
 endings-0.007
 Majesty-0.007
Token his
Token ID465
Feature activation+6.78e-01

Pos logit contributions
sed+0.159
opes+0.150
ses+0.140
lder+0.137
 endings+0.128
Neg logit contributions
ak-0.134
He-0.107
fing-0.107
aky-0.102
 upon-0.100
Token face
Token ID1986
Feature activation+5.12e-01

Pos logit contributions
ak+0.075
He+0.066
 there+0.061
he+0.061
 like+0.061
Neg logit contributions
sed-0.057
opes-0.054
lder-0.052
 Majesty-0.049
ses-0.048
Token.
Token ID13
Feature activation-2.87e-01

Pos logit contributions
ak+0.044
He+0.037
fing+0.035
he+0.035
 upon+0.035
Neg logit contributions
sed-0.053
opes-0.050
lder-0.048
ses-0.046
 Majesty-0.044
Token and
Token ID290
Feature activation+6.25e-02

Pos logit contributions
ak+0.079
He+0.062
he+0.061
fing+0.058
aky+0.056
Neg logit contributions
sed-0.123
opes-0.119
ses-0.109
lder-0.109
 endings-0.108
Token angry
Token ID7954
Feature activation+8.93e-01

Pos logit contributions
ak+0.005
He+0.004
 there+0.004
 like+0.004
 upon+0.004
Neg logit contributions
sed-0.007
opes-0.007
lder-0.007
 Majesty-0.006
ses-0.006
Token.
Token ID13
Feature activation-2.19e-01

Pos logit contributions
ak+0.057
He+0.050
 there+0.045
 like+0.044
 of+0.042
Neg logit contributions
sed-0.117
opes-0.112
lder-0.110
 Majesty-0.106
ses-0.105
Token It
Token ID632
Feature activation-3.98e-01

Pos logit contributions
sed+0.022
opes+0.021
lder+0.020
ses+0.020
 Majesty+0.019
Neg logit contributions
ak-0.021
He-0.017
he-0.016
aky-0.016
ill-0.016
Token his
Token ID465
Feature activation+7.08e-01

Pos logit contributions
sed+0.037
opes+0.036
lder+0.035
 Majesty+0.033
ses+0.032
Neg logit contributions
ak-0.039
He-0.035
 like-0.032
 there-0.032
 upon-0.031
Tokenses
Token ID8448
Feature activation-7.01e+00

Pos logit contributions
ak+0.101
He+0.088
 like+0.087
 there+0.087
 upon+0.085
Neg logit contributions
opes-0.034
lder-0.033
 Majesty-0.028
sed-0.026
storms-0.026
Token and
Token ID290
Feature activation+8.16e-02

Pos logit contributions
sed+0.946
opes+0.906
lder+0.879
 Majesty+0.857
ses+0.855
Neg logit contributions
ak-0.424
He-0.341
 there-0.292
he-0.281
 like-0.279
Token fl
Token ID781
Feature activation-3.27e+00

Pos logit contributions
ak+0.007
He+0.007
 like+0.007
 there+0.006
 closer+0.006
Neg logit contributions
lder-0.008
opes-0.008
sed-0.007
 Majesty-0.007
 endings-0.007
Tokenaps
Token ID1686
Feature activation+1.80e-01

Pos logit contributions
sed+0.365
ses+0.326
opes+0.302
 Majesty+0.263
lder+0.259
Neg logit contributions
ak-0.275
He-0.250
 there-0.249
he-0.241
 like-0.233
Token its
Token ID663
Feature activation-1.32e+00

Pos logit contributions
ak+0.019
he+0.016
fing+0.016
He+0.016
m+0.016
Neg logit contributions
sed-0.015
opes-0.014
ses-0.013
 endings-0.012
lder-0.012
Token wings
Token ID12098
Feature activation-2.62e+00

Pos logit contributions
sed+0.099
opes+0.095
lder+0.093
 Majesty+0.087
ses+0.082
Neg logit contributions
ak-0.158
He-0.140
 there-0.130
 like-0.129
aky-0.127
Token claws
Token ID28421
Feature activation-1.41e+00

Pos logit contributions
ak+0.092
He+0.083
 there+0.079
 like+0.077
 upon+0.075
Neg logit contributions
sed-0.103
opes-0.101
lder-0.100
 endings-0.094
 Majesty-0.093
Token and
Token ID290
Feature activation+2.66e-01

Pos logit contributions
sed+0.163
opes+0.152
ses+0.144
lder+0.142
 Majesty+0.135
Neg logit contributions
ak-0.112
He-0.089
aky-0.087
he-0.085
fing-0.085
Token bites
Token ID26081
Feature activation-3.53e+00

Pos logit contributions
ak+0.024
He+0.021
 there+0.020
 like+0.020
 upon+0.019
Neg logit contributions
sed-0.026
opes-0.026
lder-0.025
 Majesty-0.024
ses-0.023
Token and
Token ID290
Feature activation+2.19e-01

Pos logit contributions
sed+0.327
opes+0.316
ses+0.283
lder+0.273
der+0.265
Neg logit contributions
ak-0.338
fing-0.272
aky-0.270
he-0.270
He-0.268
Token his
Token ID465
Feature activation+9.49e-01

Pos logit contributions
ak+0.019
He+0.017
 there+0.016
 like+0.015
 of+0.014
Neg logit contributions
sed-0.023
opes-0.023
lder-0.022
 Majesty-0.021
ses-0.020
Tokenses
Token ID8448
Feature activation-6.99e+00

Pos logit contributions
ak+0.096
 there+0.078
He+0.077
 closer+0.073
 like+0.073
Neg logit contributions
lder-0.084
opes-0.083
 Majesty-0.079
 endings-0.075
sed-0.075
Token at
Token ID379
Feature activation+1.19e-01

Pos logit contributions
sed+0.869
opes+0.827
lder+0.798
ses+0.778
 Majesty+0.774
Neg logit contributions
ak-0.496
He-0.411
 there-0.362
he-0.359
 upon-0.346
Token the
Token ID262
Feature activation+2.29e-01

Pos logit contributions
ak+0.009
He+0.007
 there+0.007
he+0.007
 like+0.007
Neg logit contributions
sed-0.015
opes-0.014
lder-0.013
 Majesty-0.013
ses-0.013
Token bars
Token ID9210
Feature activation-1.17e+00

Pos logit contributions
ak+0.020
 like+0.017
 there+0.017
 of+0.016
He+0.016
Neg logit contributions
opes-0.022
zed-0.021
lder-0.021
llers-0.020
 Majesty-0.020
Token.
Token ID13
Feature activation+1.78e-01

Pos logit contributions
sed+0.135
opes+0.127
lder+0.121
ses+0.120
 Majesty+0.116
Neg logit contributions
ak-0.092
He-0.078
he-0.069
 there-0.069
 upon-0.068
Token He
Token ID679
Feature activation+9.82e-01

Pos logit contributions
ak+0.013
He+0.012
 there+0.011
he+0.011
 like+0.010
Neg logit contributions
sed-0.021
opes-0.020
lder-0.019
 Majesty-0.019
ses-0.019
Token
Token ID198
Feature activation+2.20e-01

Pos logit contributions
ak+0.031
aky+0.026
 closer+0.026
fing+0.025
He+0.025
Neg logit contributions
sed-0.028
opes-0.026
ses-0.025
lder-0.024
 Majesty-0.022
Token
Token ID198
Feature activation+2.49e-01

Pos logit contributions
ak+0.017
He+0.014
aky+0.014
ill+0.014
he+0.013
Neg logit contributions
sed-0.024
opes-0.022
lder-0.022
ses-0.021
 Majesty-0.020
TokenThe
Token ID464
Feature activation-1.92e-01

Pos logit contributions
ak+0.020
ill+0.018
he+0.016
 upon+0.016
 closer+0.016
Neg logit contributions
lder-0.024
opes-0.024
sed-0.023
llers-0.021
 Majesty-0.020
Token snake
Token ID17522
Feature activation-1.52e+00

Pos logit contributions
opes+0.016
sed+0.015
lder+0.015
 Majesty+0.014
llers+0.014
Neg logit contributions
ak-0.020
 there-0.019
He-0.019
 of-0.018
 closer-0.018
Token his
Token ID465
Feature activation+9.66e-01

Pos logit contributions
sed+0.164
opes+0.162
lder+0.159
 Majesty+0.154
ses+0.147
Neg logit contributions
ak-0.123
He-0.108
 there-0.104
 like-0.097
 upon-0.094
Tokenses
Token ID8448
Feature activation-6.97e+00

Pos logit contributions
ak+0.123
 like+0.106
 there+0.106
 upon+0.104
He+0.103
Neg logit contributions
lder-0.061
opes-0.055
llers-0.052
storms-0.052
zed-0.052
Token.
Token ID13
Feature activation+4.07e-01

Pos logit contributions
sed+0.907
opes+0.863
lder+0.831
ses+0.816
 Majesty+0.803
Neg logit contributions
ak-0.453
He-0.367
he-0.325
 there-0.318
 upon-0.309
Token L
Token ID406
Feature activation+4.68e-01

Pos logit contributions
ak+0.038
He+0.031
aky+0.030
he+0.029
 upon+0.029
Neg logit contributions
sed-0.043
opes-0.039
ses-0.038
lder-0.038
 Majesty-0.036
Tokenila
Token ID10102
Feature activation+1.99e+00

Pos logit contributions
ak+0.028
He+0.024
 like+0.020
aky+0.019
 closer+0.019
Neg logit contributions
sed-0.059
opes-0.057
lder-0.056
ses-0.054
 Majesty-0.052
Token and
Token ID290
Feature activation+1.84e-01

Pos logit contributions
ak+0.106
He+0.082
 there+0.071
 like+0.069
 of+0.062
Neg logit contributions
sed-0.279
opes-0.268
lder-0.265
 Majesty-0.264
 endings-0.257
Token Tom
Token ID4186
Feature activation+9.34e-01

Pos logit contributions
ak+0.012
He+0.010
 there+0.009
 like+0.009
he+0.009
Neg logit contributions
sed-0.023
opes-0.023
lder-0.022
 Majesty-0.021
 endings-0.021
Token scare
Token ID19437
Feature activation-1.73e-01

Pos logit contributions
sed+0.034
opes+0.032
lder+0.031
ses+0.030
 Majesty+0.030
Neg logit contributions
ak-0.029
He-0.026
 upon-0.023
he-0.023
 there-0.023
Token it
Token ID340
Feature activation-4.66e-01

Pos logit contributions
sed+0.020
opes+0.019
lder+0.018
ses+0.018
 Majesty+0.018
Neg logit contributions
ak-0.014
He-0.011
he-0.010
 there-0.010
 upon-0.010
Token.
Token ID13
Feature activation+2.81e-01

Pos logit contributions
sed+0.060
opes+0.058
lder+0.055
ses+0.055
 Majesty+0.054
Neg logit contributions
ak-0.030
He-0.024
he-0.021
aky-0.020
ill-0.020
Token It
Token ID632
Feature activation-2.45e-01

Pos logit contributions
ak+0.021
He+0.019
 there+0.017
 like+0.016
 upon+0.016
Neg logit contributions
sed-0.033
opes-0.032
lder-0.031
 Majesty-0.029
ses-0.029
Token his
Token ID465
Feature activation+1.01e+00

Pos logit contributions
sed+0.023
opes+0.022
lder+0.022
 Majesty+0.020
ses+0.020
Neg logit contributions
ak-0.024
He-0.021
 like-0.020
 there-0.020
 upon-0.019
Tokenses
Token ID8448
Feature activation-6.94e+00

Pos logit contributions
ak+0.107
 like+0.093
 there+0.092
 of+0.089
 upon+0.088
Neg logit contributions
lder-0.074
opes-0.071
 Majesty-0.070
zed-0.068
bank-0.066
Token and
Token ID290
Feature activation+3.10e-01

Pos logit contributions
sed+0.924
opes+0.881
lder+0.851
ses+0.833
 Majesty+0.827
Neg logit contributions
ak-0.432
He-0.347
he-0.299
 there-0.298
 upon-0.285
Token shows
Token ID2523
Feature activation-1.54e+00

Pos logit contributions
ak+0.027
He+0.025
 there+0.023
 like+0.023
 closer+0.022
Neg logit contributions
lder-0.030
opes-0.029
sed-0.029
 endings-0.028
 Majesty-0.028
Token its
Token ID663
Feature activation-1.09e+00

Pos logit contributions
sed+0.192
opes+0.182
lder+0.178
ses+0.171
 Majesty+0.171
Neg logit contributions
ak-0.111
He-0.094
 there-0.084
 like-0.081
he-0.080
Token teeth
Token ID9941
Feature activation-1.83e+00

Pos logit contributions
lder+0.098
sed+0.096
 Majesty+0.093
opes+0.093
 endings+0.090
Neg logit contributions
ak-0.104
He-0.088
 there-0.087
 like-0.084
 of-0.083
Token.
Token ID13
Feature activation+1.89e-01

Pos logit contributions
sed+0.187
opes+0.178
ses+0.164
lder+0.162
der+0.162
Neg logit contributions
ak-0.153
He-0.127
he-0.121
fing-0.120
aky-0.119
Token is
Token ID318
Feature activation-8.94e-01

Pos logit contributions
sed+0.177
opes+0.174
lder+0.168
 Majesty+0.160
 endings+0.156
Neg logit contributions
ak-0.153
He-0.138
 there-0.128
 like-0.127
 closer-0.122
Token afraid
Token ID7787
Feature activation+9.38e-01

Pos logit contributions
sed+0.100
ses+0.088
opes+0.086
lder+0.080
 Majesty+0.078
Neg logit contributions
ak-0.072
he-0.065
He-0.064
ill-0.061
fing-0.059
Token.
Token ID13
Feature activation-1.43e-01

Pos logit contributions
ak+0.063
He+0.049
he+0.045
aky+0.042
 there+0.042
Neg logit contributions
sed-0.120
opes-0.115
lder-0.109
ses-0.108
 Majesty-0.106
Token It
Token ID632
Feature activation-4.14e-01

Pos logit contributions
sed+0.014
ses+0.012
opes+0.012
lder+0.012
 Majesty+0.011
Neg logit contributions
ak-0.015
aky-0.012
he-0.012
He-0.011
 upon-0.011
Token his
Token ID465
Feature activation+8.56e-01

Pos logit contributions
sed+0.040
opes+0.039
lder+0.038
 Majesty+0.035
 endings+0.035
Neg logit contributions
ak-0.039
He-0.036
 like-0.033
 there-0.032
 upon-0.032
Tokenses
Token ID8448
Feature activation-6.94e+00

Pos logit contributions
ak+0.120
 like+0.111
 there+0.106
 upon+0.106
 of+0.105
Neg logit contributions
lder-0.039
opes-0.038
 Majesty-0.031
 endings-0.030
zed-0.030
Token and
Token ID290
Feature activation+1.41e-01

Pos logit contributions
sed+0.929
opes+0.889
lder+0.862
ses+0.838
 Majesty+0.838
Neg logit contributions
ak-0.427
He-0.344
 there-0.295
he-0.287
 like-0.281
Token scratches
Token ID37627
Feature activation-1.31e+00

Pos logit contributions
ak+0.012
He+0.011
 like+0.011
 there+0.010
 closer+0.010
Neg logit contributions
lder-0.014
opes-0.014
 endings-0.013
 Majesty-0.013
sed-0.013
Token Tom
Token ID4186
Feature activation+6.44e-01

Pos logit contributions
sed+0.143
opes+0.137
ses+0.127
lder+0.126
 endings+0.118
Neg logit contributions
ak-0.109
He-0.089
fing-0.084
aky-0.083
 upon-0.083
Token's
Token ID338
Feature activation-7.05e-01

Pos logit contributions
ak+0.050
He+0.043
he+0.038
fing+0.038
ill+0.037
Neg logit contributions
sed-0.073
opes-0.070
lder-0.066
ses-0.066
 Majesty-0.063
Token hand
Token ID1021
Feature activation+1.28e+00

Pos logit contributions
opes+0.080
lder+0.079
sed+0.079
 Majesty+0.074
 endings+0.070
Neg logit contributions
ak-0.055
He-0.046
 like-0.041
 there-0.041
 of-0.040
Token is
Token ID318
Feature activation-9.18e-01

Pos logit contributions
sed+0.181
opes+0.176
lder+0.170
 Majesty+0.161
ses+0.157
Neg logit contributions
ak-0.160
He-0.144
 there-0.133
 like-0.132
 closer-0.127
Token afraid
Token ID7787
Feature activation+9.04e-01

Pos logit contributions
sed+0.101
ses+0.088
opes+0.084
 Majesty+0.077
lder+0.077
Neg logit contributions
ak-0.074
he-0.070
He-0.067
ill-0.065
fing-0.062
Token.
Token ID13
Feature activation-1.74e-01

Pos logit contributions
ak+0.062
He+0.050
he+0.045
aky+0.043
 there+0.043
Neg logit contributions
sed-0.114
opes-0.109
lder-0.104
ses-0.102
 Majesty-0.101
Token It
Token ID632
Feature activation-4.10e-01

Pos logit contributions
sed+0.017
ses+0.016
opes+0.015
lder+0.015
 Majesty+0.014
Neg logit contributions
ak-0.017
aky-0.014
He-0.014
he-0.014
 upon-0.013
Token his
Token ID465
Feature activation+8.15e-01

Pos logit contributions
sed+0.039
opes+0.038
lder+0.037
 Majesty+0.035
 endings+0.033
Neg logit contributions
ak-0.039
He-0.036
 like-0.033
 there-0.033
 upon-0.032
Tokenses
Token ID8448
Feature activation-6.91e+00

Pos logit contributions
ak+0.116
 like+0.106
 there+0.101
 closer+0.101
 of+0.100
Neg logit contributions
lder-0.035
opes-0.035
 endings-0.028
zed-0.028
 Majesty-0.027
Token and
Token ID290
Feature activation+1.57e-01

Pos logit contributions
sed+0.910
opes+0.866
lder+0.834
ses+0.820
 Majesty+0.811
Neg logit contributions
ak-0.439
He-0.353
he-0.309
 there-0.304
 upon-0.293
Token scratches
Token ID37627
Feature activation-1.42e+00

Pos logit contributions
ak+0.013
He+0.012
 like+0.012
 there+0.012
 closer+0.011
Neg logit contributions
lder-0.016
opes-0.015
 endings-0.015
sed-0.014
 Majesty-0.014
Token Tom
Token ID4186
Feature activation+5.88e-01

Pos logit contributions
sed+0.153
opes+0.147
ses+0.137
lder+0.135
 endings+0.127
Neg logit contributions
ak-0.118
He-0.096
fing-0.091
aky-0.090
 upon-0.090
Token's
Token ID338
Feature activation-7.45e-01

Pos logit contributions
ak+0.046
He+0.040
he+0.036
fing+0.036
ill+0.035
Neg logit contributions
sed-0.066
opes-0.062
lder-0.059
ses-0.059
 Majesty-0.056
Token hand
Token ID1021
Feature activation+1.22e+00

Pos logit contributions
opes+0.087
sed+0.086
lder+0.086
 Majesty+0.080
 endings+0.076
Neg logit contributions
ak-0.056
He-0.046
 there-0.041
 like-0.041
 of-0.040
Token under
Token ID739
Feature activation-7.06e-01

Pos logit contributions
sed+0.076
opes+0.072
lder+0.068
ses+0.068
 Majesty+0.066
Neg logit contributions
ak-0.045
He-0.036
he-0.032
aky-0.031
ill-0.031
Token the
Token ID262
Feature activation-2.48e-01

Pos logit contributions
sed+0.102
lder+0.102
opes+0.100
 Majesty+0.100
 endings+0.094
Neg logit contributions
ak-0.027
 there-0.026
He-0.026
 like-0.022
 of-0.021
Token bed
Token ID3996
Feature activation-7.09e-01

Pos logit contributions
sed+0.020
opes+0.019
lder+0.018
 Majesty+0.017
ses+0.017
Neg logit contributions
ak-0.028
He-0.026
he-0.024
 there-0.024
 of-0.024
Token and
Token ID290
Feature activation-2.56e-01

Pos logit contributions
sed+0.093
opes+0.088
lder+0.085
ses+0.083
 Majesty+0.083
Neg logit contributions
ak-0.046
He-0.037
 there-0.032
he-0.032
 like-0.031
Token his
Token ID465
Feature activation+5.08e-01

Pos logit contributions
sed+0.029
opes+0.029
lder+0.028
 Majesty+0.027
ses+0.026
Neg logit contributions
ak-0.019
He-0.017
 there-0.016
 like-0.016
 upon-0.014
Tokensed
Token ID36622
Feature activation-6.90e+00

Pos logit contributions
ak+0.046
He+0.039
 there+0.038
 like+0.037
 of+0.036
Neg logit contributions
lder-0.051
opes-0.049
 Majesty-0.046
sed-0.045
 endings-0.044
Token at
Token ID379
Feature activation-2.58e-02

Pos logit contributions
sed+0.860
opes+0.817
lder+0.784
ses+0.768
 Majesty+0.758
Neg logit contributions
ak-0.486
He-0.399
he-0.357
 there-0.349
 upon-0.339
Token them
Token ID606
Feature activation-7.88e-01

Pos logit contributions
sed+0.003
lder+0.003
opes+0.003
 Majesty+0.003
ses+0.003
Neg logit contributions
ak-0.002
He-0.002
 there-0.002
 like-0.002
he-0.002
Token.
Token ID13
Feature activation-5.81e-01

Pos logit contributions
sed+0.098
opes+0.093
lder+0.090
 Majesty+0.088
ses+0.088
Neg logit contributions
ak-0.056
He-0.047
 there-0.042
he-0.041
 like-0.040
Token Lily
Token ID20037
Feature activation-1.04e+00

Pos logit contributions
sed+0.073
opes+0.072
lder+0.070
 Majesty+0.067
ses+0.065
Neg logit contributions
ak-0.036
He-0.036
 there-0.030
 like-0.028
 upon-0.027
Token and
Token ID290
Feature activation-2.65e-01

Pos logit contributions
sed+0.138
opes+0.133
lder+0.128
 Majesty+0.126
ses+0.125
Neg logit contributions
ak-0.066
He-0.054
 there-0.047
he-0.045
 like-0.044
Token slide
Token ID10649
Feature activation+1.49e+00

Pos logit contributions
ak+0.009
He+0.008
he+0.007
 like+0.007
 there+0.007
Neg logit contributions
sed-0.011
opes-0.011
lder-0.011
 Majesty-0.010
ses-0.010
Token.
Token ID13
Feature activation-3.03e-01

Pos logit contributions
ak+0.125
He+0.104
aky+0.095
fing+0.093
 there+0.093
Neg logit contributions
sed-0.162
opes-0.155
lder-0.147
ses-0.143
 Majesty-0.138
Token The
Token ID383
Feature activation-1.17e+00

Pos logit contributions
sed+0.031
ses+0.027
lder+0.027
opes+0.026
 Majesty+0.024
Neg logit contributions
ak-0.028
aky-0.024
he-0.022
 upon-0.022
fing-0.022
Token cat
Token ID3797
Feature activation-1.80e+00

Pos logit contributions
opes+0.101
sed+0.096
lder+0.096
 Majesty+0.095
 endings+0.089
Neg logit contributions
ak-0.122
He-0.107
 there-0.104
 of-0.102
 like-0.100
Token his
Token ID465
Feature activation+8.98e-01

Pos logit contributions
sed+0.182
opes+0.172
lder+0.166
 Majesty+0.160
ses+0.159
Neg logit contributions
ak-0.167
He-0.147
 there-0.135
 like-0.132
he-0.131
Tokenses
Token ID8448
Feature activation-6.89e+00

Pos logit contributions
ak+0.105
He+0.091
 like+0.089
 there+0.087
 upon+0.086
Neg logit contributions
opes-0.064
lder-0.063
sed-0.058
 endings-0.056
 Majesty-0.056
Token at
Token ID379
Feature activation+1.34e-01

Pos logit contributions
sed+0.853
opes+0.800
ses+0.762
lder+0.748
 Majesty+0.720
Neg logit contributions
ak-0.478
He-0.391
he-0.380
fing-0.351
ill-0.350
Token him
Token ID683
Feature activation-6.13e-01

Pos logit contributions
ak+0.011
He+0.009
 there+0.008
he+0.008
 like+0.008
Neg logit contributions
sed-0.016
opes-0.015
lder-0.014
 Majesty-0.014
ses-0.014
Token.
Token ID13
Feature activation-2.40e-01

Pos logit contributions
sed+0.064
opes+0.061
lder+0.057
ses+0.057
 Majesty+0.055
Neg logit contributions
ak-0.054
He-0.045
aky-0.042
he-0.041
fing-0.041
Token Tom
Token ID4186
Feature activation+6.14e-01

Pos logit contributions
sed+0.025
ses+0.023
opes+0.022
lder+0.022
 Majesty+0.020
Neg logit contributions
ak-0.022
aky-0.019
he-0.017
He-0.017
 upon-0.017
Token is
Token ID318
Feature activation-7.42e-01

Pos logit contributions
ak+0.062
He+0.055
 there+0.051
he+0.050
 like+0.050
Neg logit contributions
sed-0.057
opes-0.054
lder-0.051
 Majesty-0.050
ses-0.049
Token cat
Token ID3797
Feature activation-2.07e+00

Pos logit contributions
lder+0.141
opes+0.139
sed+0.133
 Majesty+0.132
zed+0.125
Neg logit contributions
ak-0.160
He-0.139
 like-0.137
 there-0.135
 of-0.134
Token woke
Token ID19092
Feature activation-2.06e+00

Pos logit contributions
sed+0.241
opes+0.232
lder+0.223
 Majesty+0.221
ses+0.216
Neg logit contributions
ak-0.164
He-0.139
 there-0.126
he-0.121
 like-0.121
Token up
Token ID510
Feature activation-1.82e+00

Pos logit contributions
sed+0.239
opes+0.221
lder+0.220
ses+0.212
 Majesty+0.209
Neg logit contributions
ak-0.160
He-0.144
 there-0.134
he-0.130
 like-0.127
Token and
Token ID290
Feature activation-2.41e-01

Pos logit contributions
sed+0.226
opes+0.212
lder+0.205
ses+0.203
 Majesty+0.198
Neg logit contributions
ak-0.129
He-0.103
aky-0.091
he-0.090
ill-0.088
Token his
Token ID465
Feature activation+4.29e-01

Pos logit contributions
sed+0.026
opes+0.026
lder+0.025
 Majesty+0.024
ses+0.023
Neg logit contributions
ak-0.020
He-0.017
 there-0.017
 like-0.016
 upon-0.015
Tokensed
Token ID36622
Feature activation-6.86e+00

Pos logit contributions
ak+0.043
 there+0.037
 of+0.036
 like+0.035
He+0.035
Neg logit contributions
lder-0.038
opes-0.036
 Majesty-0.035
 endings-0.033
storms-0.032
Token.
Token ID13
Feature activation-6.55e-01

Pos logit contributions
sed+0.815
opes+0.777
lder+0.733
ses+0.723
 Majesty+0.702
Neg logit contributions
ak-0.513
He-0.421
he-0.389
fing-0.372
 upon-0.370
Token He
Token ID679
Feature activation+2.84e-01

Pos logit contributions
sed+0.079
opes+0.074
lder+0.072
ses+0.070
 Majesty+0.069
Neg logit contributions
ak-0.049
He-0.039
he-0.036
aky-0.036
 there-0.035
Token jumped
Token ID11687
Feature activation-8.41e-01

Pos logit contributions
ak+0.024
He+0.020
 there+0.018
 like+0.018
he+0.018
Neg logit contributions
sed-0.032
opes-0.030
lder-0.029
 Majesty-0.028
ses-0.028
Token off
Token ID572
Feature activation-9.60e-02

Pos logit contributions
sed+0.105
opes+0.098
ses+0.093
lder+0.093
 Majesty+0.089
Neg logit contributions
ak-0.059
He-0.046
aky-0.042
fing-0.042
ill-0.041
Token the
Token ID262
Feature activation-3.29e-01

Pos logit contributions
sed+0.010
opes+0.010
lder+0.009
ses+0.009
 Majesty+0.009
Neg logit contributions
ak-0.008
He-0.006
aky-0.006
fing-0.006
aking-0.006
Token angry
Token ID7954
Feature activation+9.82e-01

Pos logit contributions
sed+0.023
opes+0.021
ses+0.020
lder+0.019
 Majesty+0.019
Neg logit contributions
ak-0.020
He-0.017
he-0.017
ill-0.016
 upon-0.016
Token snake
Token ID17522
Feature activation-1.52e+00

Pos logit contributions
ak+0.086
He+0.075
 there+0.071
 like+0.069
 of+0.066
Neg logit contributions
sed-0.103
opes-0.100
lder-0.094
ses-0.090
 Majesty-0.090
Token.
Token ID13
Feature activation-4.41e-02

Pos logit contributions
sed+0.188
opes+0.187
lder+0.179
 Majesty+0.177
ses+0.171
Neg logit contributions
ak-0.099
 there-0.081
He-0.080
 like-0.077
 of-0.073
Token It
Token ID632
Feature activation-2.64e-01

Pos logit contributions
sed+0.004
opes+0.004
lder+0.004
ses+0.004
 Majesty+0.004
Neg logit contributions
ak-0.004
He-0.003
aky-0.003
he-0.003
 upon-0.003
Token his
Token ID465
Feature activation+8.24e-01

Pos logit contributions
sed+0.024
opes+0.024
lder+0.024
 Majesty+0.023
 endings+0.022
Neg logit contributions
ak-0.025
He-0.023
 like-0.021
 there-0.021
 upon-0.020
Tokenses
Token ID8448
Feature activation-6.86e+00

Pos logit contributions
ak+0.101
He+0.085
 like+0.083
 there+0.083
 upon+0.081
Neg logit contributions
opes-0.056
lder-0.056
sed-0.051
 Majesty-0.049
 endings-0.048
Token and
Token ID290
Feature activation+1.87e-01

Pos logit contributions
sed+0.931
opes+0.892
lder+0.866
 Majesty+0.845
ses+0.842
Neg logit contributions
ak-0.411
He-0.330
 there-0.282
he-0.271
 like-0.270
Token snaps
Token ID23429
Feature activation-8.62e-01

Pos logit contributions
ak+0.017
He+0.015
 like+0.015
 there+0.014
 closer+0.013
Neg logit contributions
lder-0.018
opes-0.017
 endings-0.017
 Majesty-0.017
sed-0.017
Token at
Token ID379
Feature activation+1.18e-01

Pos logit contributions
sed+0.108
opes+0.101
ses+0.096
lder+0.092
 Majesty+0.090
Neg logit contributions
ak-0.062
He-0.050
he-0.049
aky-0.045
fing-0.044
Token Ben
Token ID3932
Feature activation-1.75e-01

Pos logit contributions
ak+0.010
He+0.008
 there+0.007
he+0.007
 upon+0.007
Neg logit contributions
sed-0.014
opes-0.013
lder-0.012
ses-0.012
 Majesty-0.012
Token.
Token ID13
Feature activation-8.64e-02

Pos logit contributions
sed+0.021
opes+0.020
lder+0.019
ses+0.018
 Majesty+0.018
Neg logit contributions
ak-0.013
He-0.011
he-0.010
 there-0.010
ill-0.010
Token she
Token ID673
Feature activation-3.46e-01

Pos logit contributions
sed+0.122
opes+0.117
lder+0.113
 Majesty+0.109
ses+0.108
Neg logit contributions
ak-0.075
He-0.065
 there-0.059
 like-0.057
he-0.057
Token sque
Token ID35776
Feature activation-1.44e+00

Pos logit contributions
sed+0.039
opes+0.037
lder+0.035
ses+0.034
 Majesty+0.033
Neg logit contributions
ak-0.028
He-0.024
he-0.022
 there-0.022
 upon-0.021
Tokenaked
Token ID4335
Feature activation-1.29e+00

Pos logit contributions
opes+0.127
sed+0.127
lder+0.119
ses+0.117
 endings+0.116
Neg logit contributions
He-0.112
 there-0.103
 closer-0.103
 upon-0.101
 like-0.099
Token and
Token ID290
Feature activation-1.73e-01

Pos logit contributions
sed+0.152
opes+0.145
lder+0.138
ses+0.135
 Majesty+0.134
Neg logit contributions
ak-0.099
He-0.082
he-0.074
aky-0.071
 there-0.070
Token his
Token ID465
Feature activation+6.20e-01

Pos logit contributions
opes+0.019
lder+0.018
sed+0.018
 Majesty+0.017
 endings+0.017
Neg logit contributions
ak-0.014
 there-0.012
He-0.012
 like-0.011
 closer-0.010
Tokensed
Token ID36622
Feature activation-6.83e+00

Pos logit contributions
ak+0.045
 upon+0.039
 there+0.038
 of+0.037
 like+0.037
Neg logit contributions
lder-0.072
zed-0.065
opes-0.064
 endings-0.063
 Majesty-0.061
Token at
Token ID379
Feature activation-3.38e-02

Pos logit contributions
sed+0.792
opes+0.747
lder+0.710
ses+0.698
 Majesty+0.685
Neg logit contributions
ak-0.534
He-0.439
he-0.412
 upon-0.386
 there-0.384
Token Ben
Token ID3932
Feature activation-3.75e-01

Pos logit contributions
sed+0.004
opes+0.004
lder+0.004
 Majesty+0.004
ses+0.004
Neg logit contributions
ak-0.002
He-0.002
he-0.002
 there-0.002
ill-0.002
Token and
Token ID290
Feature activation-1.80e-01

Pos logit contributions
sed+0.044
opes+0.042
lder+0.040
ses+0.039
 Majesty+0.039
Neg logit contributions
ak-0.029
He-0.024
he-0.022
 there-0.022
 upon-0.021
Token Lily
Token ID20037
Feature activation-8.96e-01

Pos logit contributions
opes+0.021
lder+0.020
sed+0.019
 endings+0.019
 Majesty+0.019
Neg logit contributions
ak-0.013
He-0.012
 there-0.011
 like-0.011
 of-0.010
Token.
Token ID13
Feature activation-4.95e-01

Pos logit contributions
sed+0.107
opes+0.102
lder+0.098
ses+0.095
 Majesty+0.095
Neg logit contributions
ak-0.068
He-0.057
 there-0.050
he-0.050
 upon-0.049
Token strangers
Token ID18981
Feature activation+4.30e-01

Pos logit contributions
ak+0.141
 there+0.136
 like+0.130
 of+0.129
He+0.114
Neg logit contributions
sed-0.314
opes-0.309
lder-0.308
 Majesty-0.299
ses-0.281
Token.
Token ID13
Feature activation+9.79e-02

Pos logit contributions
ak+0.037
He+0.031
aky+0.028
ill+0.027
he+0.027
Neg logit contributions
sed-0.046
opes-0.043
lder-0.042
ses-0.041
 Majesty-0.039
Token The
Token ID383
Feature activation-9.59e-01

Pos logit contributions
ak+0.009
He+0.008
aky+0.008
he+0.007
 upon+0.007
Neg logit contributions
sed-0.010
opes-0.009
lder-0.009
ses-0.009
 Majesty-0.008
Token cat
Token ID3797
Feature activation-1.74e+00

Pos logit contributions
opes+0.089
lder+0.087
 Majesty+0.086
sed+0.084
iously+0.083
Neg logit contributions
ak-0.096
 there-0.083
He-0.082
 like-0.079
 of-0.079
Token his
Token ID465
Feature activation+1.06e+00

Pos logit contributions
sed+0.183
opes+0.182
lder+0.178
 Majesty+0.172
 endings+0.167
Neg logit contributions
ak-0.141
He-0.125
 like-0.121
 there-0.118
 closer-0.110
Tokenses
Token ID8448
Feature activation-6.81e+00

Pos logit contributions
ak+0.159
 like+0.141
He+0.137
 there+0.136
 upon+0.134
Neg logit contributions
lder-0.044
opes-0.040
 endings-0.034
 Majesty-0.033
zed-0.030
Token at
Token ID379
Feature activation+2.13e-01

Pos logit contributions
sed+0.868
opes+0.814
ses+0.777
lder+0.775
 Majesty+0.746
Neg logit contributions
ak-0.458
He-0.369
he-0.351
 upon-0.323
ill-0.321
Token Tom
Token ID4186
Feature activation+8.14e-01

Pos logit contributions
ak+0.017
He+0.014
 there+0.013
he+0.013
 like+0.013
Neg logit contributions
sed-0.025
opes-0.023
lder-0.023
 Majesty-0.022
ses-0.022
Token and
Token ID290
Feature activation+2.54e-01

Pos logit contributions
ak+0.063
He+0.053
he+0.048
fing+0.046
ill+0.046
Neg logit contributions
sed-0.094
opes-0.089
lder-0.085
ses-0.084
 Majesty-0.081
Token Sue
Token ID26089
Feature activation+1.12e+00

Pos logit contributions
ak+0.017
He+0.017
 like+0.017
 there+0.016
 closer+0.016
Neg logit contributions
opes-0.028
lder-0.027
 endings-0.027
 Majesty-0.026
sed-0.026
Token.
Token ID13
Feature activation+1.14e-02

Pos logit contributions
ak+0.087
He+0.073
he+0.065
 there+0.064
aky+0.064
Neg logit contributions
sed-0.132
opes-0.124
lder-0.119
ses-0.118
 Majesty-0.115
Token a
Token ID257
Feature activation-9.57e-02

Pos logit contributions
ak+0.157
 there+0.149
 of+0.143
He+0.141
 like+0.140
Neg logit contributions
sed-0.308
opes-0.299
 Majesty-0.285
lder-0.285
ses-0.277
Token cat
Token ID3797
Feature activation-1.51e+00

Pos logit contributions
sed+0.009
opes+0.008
lder+0.008
 Majesty+0.007
ses+0.007
Neg logit contributions
ak-0.010
He-0.009
 there-0.008
 like-0.008
he-0.008
Token.
Token ID13
Feature activation+4.22e-01

Pos logit contributions
sed+0.161
opes+0.147
lder+0.144
ses+0.142
 Majesty+0.135
Neg logit contributions
ak-0.128
He-0.110
 upon-0.098
he-0.097
ill-0.096
Token She
Token ID1375
Feature activation-1.70e-01

Pos logit contributions
ak+0.037
He+0.032
 there+0.029
he+0.029
aky+0.028
Neg logit contributions
sed-0.046
opes-0.043
lder-0.041
ses-0.040
 Majesty-0.040
Token his
Token ID465
Feature activation+1.09e+00

Pos logit contributions
sed+0.016
lder+0.016
opes+0.016
 endings+0.015
 Majesty+0.015
Neg logit contributions
ak-0.015
He-0.014
 upon-0.013
 there-0.013
 like-0.013
Tokenses
Token ID8448
Feature activation-6.80e+00

Pos logit contributions
ak+0.133
 like+0.123
 there+0.119
 close+0.117
 upon+0.115
Neg logit contributions
opes-0.068
lder-0.067
 endings-0.063
 Majesty-0.061
zed-0.058
Token and
Token ID290
Feature activation+3.53e-01

Pos logit contributions
sed+0.900
opes+0.856
lder+0.827
ses+0.810
 Majesty+0.802
Neg logit contributions
ak-0.431
He-0.346
he-0.303
 there-0.297
 upon-0.286
Token scratches
Token ID37627
Feature activation-1.27e+00

Pos logit contributions
ak+0.031
He+0.028
 like+0.026
 there+0.026
 upon+0.024
Neg logit contributions
sed-0.034
lder-0.034
opes-0.034
 endings-0.032
 Majesty-0.032
Token at
Token ID379
Feature activation+2.08e-01

Pos logit contributions
sed+0.136
opes+0.128
lder+0.121
ses+0.120
 Majesty+0.112
Neg logit contributions
ak-0.112
He-0.092
aky-0.086
fing-0.085
he-0.084
Token the
Token ID262
Feature activation+3.09e-01

Pos logit contributions
ak+0.015
He+0.013
 there+0.012
he+0.011
 like+0.011
Neg logit contributions
sed-0.025
opes-0.024
lder-0.023
 Majesty-0.023
ses-0.023
Token dog
Token ID3290
Feature activation-7.13e-01

Pos logit contributions
ak+0.032
He+0.029
 there+0.026
he+0.026
 like+0.026
Neg logit contributions
sed-0.028
opes-0.027
lder-0.026
 Majesty-0.025
ses-0.024
Tokenked
Token ID9091
Feature activation-2.84e+00

Pos logit contributions
lder+0.138
sed+0.135
icol+0.131
 Majesty+0.130
opes+0.130
Neg logit contributions
 there-0.053
ak-0.050
He-0.043
 of-0.043
 like-0.042
Token tongue
Token ID11880
Feature activation-5.58e-01

Pos logit contributions
sed+0.262
opes+0.246
lder+0.239
 Majesty+0.228
ses+0.225
Neg logit contributions
ak-0.293
He-0.257
 there-0.240
 like-0.234
 upon-0.230
Token.
Token ID13
Feature activation+5.50e-01

Pos logit contributions
sed+0.061
opes+0.059
lder+0.055
ses+0.054
 Majesty+0.052
Neg logit contributions
ak-0.046
He-0.039
he-0.036
aky-0.035
 upon-0.035
Token It
Token ID632
Feature activation-6.22e-02

Pos logit contributions
ak+0.042
He+0.039
 there+0.036
 of+0.033
he+0.033
Neg logit contributions
sed-0.064
opes-0.061
lder-0.058
 Majesty-0.058
ses-0.057
Token his
Token ID465
Feature activation+1.18e+00

Pos logit contributions
sed+0.006
opes+0.006
lder+0.006
 Majesty+0.006
 endings+0.006
Neg logit contributions
ak-0.006
He-0.005
 like-0.005
 there-0.005
 upon-0.005
Tokenses
Token ID8448
Feature activation-6.79e+00

Pos logit contributions
ak+0.182
He+0.162
 there+0.158
 like+0.158
 upon+0.156
Neg logit contributions
lder-0.042
opes-0.039
 Majesty-0.034
 endings-0.031
sed-0.029
Token and
Token ID290
Feature activation+4.07e-01

Pos logit contributions
sed+0.935
opes+0.896
lder+0.873
 Majesty+0.857
ses+0.848
Neg logit contributions
ak-0.395
He-0.316
 there-0.271
 like-0.262
 of-0.248
Token swim
Token ID9422
Feature activation+5.87e-01

Pos logit contributions
ak+0.034
He+0.031
 there+0.029
 like+0.029
 upon+0.026
Neg logit contributions
lder-0.041
opes-0.040
 endings-0.039
 Majesty-0.039
sed-0.039
Tokens
Token ID82
Feature activation-8.28e-01

Pos logit contributions
ak+0.046
He+0.038
he+0.036
ill+0.035
aky+0.034
Neg logit contributions
sed-0.070
opes-0.064
ses-0.061
lder-0.061
 Majesty-0.058
Token near
Token ID1474
Feature activation-1.75e-01

Pos logit contributions
sed+0.105
opes+0.099
lder+0.096
ses+0.094
 Majesty+0.092
Neg logit contributions
ak-0.057
He-0.046
he-0.041
aky-0.040
 there-0.039
Token the
Token ID262
Feature activation+4.61e-01

Pos logit contributions
sed+0.023
opes+0.022
lder+0.022
 Majesty+0.021
ses+0.021
Neg logit contributions
ak-0.011
He-0.009
 there-0.008
 like-0.008
he-0.008
Token
Token ID198
Feature activation+2.82e-01

Pos logit contributions
ak+0.004
He+0.003
aky+0.003
 there+0.003
 like+0.003
Neg logit contributions
sed-0.007
opes-0.007
lder-0.006
 Majesty-0.006
ses-0.006
Token
Token ID198
Feature activation+3.38e-01

Pos logit contributions
ak+0.024
aky+0.020
 like+0.019
 there+0.019
ill+0.018
Neg logit contributions
lder-0.026
sed-0.025
opes-0.025
zed-0.024
 Majesty-0.024
TokenThe
Token ID464
Feature activation+1.41e-01

Pos logit contributions
ak+0.026
 like+0.020
aky+0.020
 there+0.020
 seen+0.019
Neg logit contributions
lder-0.033
zed-0.032
opes-0.030
 Majesty-0.028
llers-0.028
Token man
Token ID582
Feature activation+1.83e-01

Pos logit contributions
ak+0.014
He+0.012
ill+0.011
he+0.011
 there+0.011
Neg logit contributions
sed-0.014
opes-0.013
lder-0.012
ses-0.012
 Majesty-0.012
Token helped
Token ID4193
Feature activation-4.85e-01

Pos logit contributions
ak+0.015
He+0.013
he+0.012
aky+0.011
 there+0.011
Neg logit contributions
sed-0.020
opes-0.019
lder-0.018
 Majesty-0.018
ses-0.018
Token Mike
Token ID4995
Feature activation+2.15e+00

Pos logit contributions
sed+0.050
opes+0.048
lder+0.047
 Majesty+0.044
ses+0.043
Neg logit contributions
ak-0.044
He-0.037
aky-0.034
 there-0.032
he-0.031
Token find
Token ID1064
Feature activation+7.11e-01

Pos logit contributions
ak+0.168
He+0.142
aky+0.127
he+0.123
 there+0.122
Neg logit contributions
sed-0.236
opes-0.229
lder-0.224
 Majesty-0.216
ses-0.207
Token his
Token ID465
Feature activation+1.21e+00

Pos logit contributions
ak+0.055
He+0.046
he+0.042
aky+0.041
ill+0.040
Neg logit contributions
sed-0.082
opes-0.079
 Majesty-0.074
lder-0.074
ses-0.073
Token way
Token ID835
Feature activation+1.30e-01

Pos logit contributions
ak+0.131
He+0.115
 there+0.111
 upon+0.108
 like+0.107
Neg logit contributions
sed-0.104
opes-0.099
lder-0.096
ses-0.088
 Majesty-0.088
Token back
Token ID736
Feature activation+1.05e+00

Pos logit contributions
ak+0.013
aky+0.011
He+0.011
he+0.011
fing+0.011
Neg logit contributions
sed-0.011
opes-0.010
yard-0.009
ses-0.009
 Majesty-0.009
Token home
Token ID1363
Feature activation+9.85e-01

Pos logit contributions
ak+0.109
He+0.098
aky+0.091
he+0.089
fing+0.086
Neg logit contributions
sed-0.091
opes-0.084
 Majesty-0.078
ses-0.077
lder-0.076
Token�
Token ID241
Feature activation+1.04e+00

Pos logit contributions
ak+0.002
He+0.001
 like+0.001
he+0.001
 there+0.001
Neg logit contributions
sed-0.001
 endings-0.001
opes-0.001
ses-0.001
 Majesty-0.001
TokenThat
Token ID2504
Feature activation-1.40e+00

Pos logit contributions
ak+0.084
aky+0.065
ill+0.063
 closer+0.060
fing+0.060
Neg logit contributions
sed-0.108
lder-0.106
opes-0.101
ses-0.097
 endings-0.095
Token's
Token ID338
Feature activation+9.22e-02

Pos logit contributions
sed+0.162
opes+0.154
lder+0.145
ses+0.144
 Majesty+0.142
Neg logit contributions
ak-0.113
He-0.095
 there-0.088
 like-0.085
he-0.085
Token right
Token ID826
Feature activation+1.20e+00

Pos logit contributions
ak+0.007
He+0.006
he+0.006
ill+0.005
aky+0.005
Neg logit contributions
sed-0.011
opes-0.010
lder-0.010
ses-0.010
 Majesty-0.010
Token!
Token ID0
Feature activation+4.04e-01

Pos logit contributions
ak+0.102
He+0.092
he+0.085
aky+0.078
ill+0.078
Neg logit contributions
sed-0.133
 Majesty-0.120
opes-0.120
lder-0.120
ses-0.115
Tokenâ
Token ID22940
Feature activation+2.41e+00

Pos logit contributions
ak+0.043
he+0.036
He+0.036
aky+0.035
ill+0.034
Neg logit contributions
sed-0.033
opes-0.031
lder-0.030
 Majesty-0.029
ses-0.028
Token€
Token ID26391
Feature activation+8.49e-01

Pos logit contributions
ak+0.238
He+0.217
 upon+0.203
 there+0.201
ill+0.195
Neg logit contributions
sed-0.219
lder-0.207
opes-0.206
ses-0.191
 Majesty-0.186
Token he
Token ID339
Feature activation+1.77e+00

Pos logit contributions
ak+0.079
He+0.069
 there+0.064
he+0.062
 like+0.061
Neg logit contributions
sed-0.087
opes-0.082
lder-0.077
ses-0.076
 Majesty-0.075
Token said
Token ID531
Feature activation+2.52e-01

Pos logit contributions
ak+0.139
He+0.111
 like+0.111
 there+0.106
 upon+0.100
Neg logit contributions
sed-0.203
opes-0.200
 Majesty-0.187
ses-0.184
lder-0.183
Token.
Token ID13
Feature activation+7.13e-01

Pos logit contributions
ak+0.021
He+0.017
he+0.016
aky+0.016
ill+0.016
Neg logit contributions
sed-0.029
opes-0.027
lder-0.027
 Majesty-0.026
ses-0.025
Token
Token ID198
Feature activation+6.47e-01

Pos logit contributions
ak+0.069
he+0.054
He+0.054
aky+0.054
ill+0.052
Neg logit contributions
sed-0.068
opes-0.064
lder-0.063
 Majesty-0.062
ses-0.058
Token liked
Token ID8288
Feature activation+1.07e-01

Pos logit contributions
sed+0.008
opes+0.008
lder+0.008
 Majesty+0.008
ses+0.007
Neg logit contributions
ak-0.005
He-0.004
 there-0.004
 like-0.004
 upon-0.004
Token to
Token ID284
Feature activation-1.79e-01

Pos logit contributions
 there+0.005
ak+0.005
 like+0.004
 of+0.004
He+0.004
Neg logit contributions
sed-0.015
opes-0.015
lder-0.015
 Majesty-0.014
ses-0.014
Token hear
Token ID3285
Feature activation+1.72e+00

Pos logit contributions
sed+0.021
opes+0.020
lder+0.019
ses+0.019
 Majesty+0.018
Neg logit contributions
ak-0.014
He-0.012
 there-0.011
he-0.011
 upon-0.010
Token them
Token ID606
Feature activation-3.18e-01

Pos logit contributions
ak+0.101
 there+0.097
 like+0.095
 of+0.095
He+0.091
Neg logit contributions
sed-0.229
ses-0.208
opes-0.207
lder-0.207
 Majesty-0.197
Token cro
Token ID6763
Feature activation-1.34e+00

Pos logit contributions
sed+0.037
opes+0.035
lder+0.034
ses+0.033
 Majesty+0.033
Neg logit contributions
ak-0.025
He-0.021
 there-0.019
he-0.019
 like-0.019
Tokenak
Token ID461
Feature activation+3.42e+00

Pos logit contributions
opes+0.166
lder+0.156
sed+0.151
endo+0.146
 endings+0.143
Neg logit contributions
He-0.062
 there-0.050
 like-0.043
 helmet-0.040
 upon-0.040
Token and
Token ID290
Feature activation+3.94e-01

Pos logit contributions
ak+0.204
He+0.164
 there+0.143
 like+0.141
he+0.140
Neg logit contributions
sed-0.467
opes-0.439
lder-0.428
ses-0.423
 Majesty-0.422
Token jump
Token ID4391
Feature activation+9.97e-01

Pos logit contributions
ak+0.028
He+0.024
 there+0.022
 like+0.021
 upon+0.020
Neg logit contributions
sed-0.048
opes-0.046
lder-0.045
ses-0.043
 Majesty-0.043
Token.
Token ID13
Feature activation+4.84e-01

Pos logit contributions
ak+0.057
He+0.045
 there+0.038
he+0.037
 like+0.036
Neg logit contributions
sed-0.138
opes-0.132
lder-0.128
ses-0.126
 Majesty-0.125
Token
Token ID198
Feature activation+4.60e-01

Pos logit contributions
ak+0.029
He+0.027
 there+0.023
 upon+0.023
 like+0.022
Neg logit contributions
sed-0.065
opes-0.062
lder-0.061
 Majesty-0.059
ses-0.058
Token
Token ID198
Feature activation+4.95e-01

Pos logit contributions
ak+0.036
He+0.029
aky+0.028
fing+0.027
ill+0.027
Neg logit contributions
sed-0.048
lder-0.047
opes-0.047
ses-0.042
 Majesty-0.041
Token helped
Token ID4193
Feature activation-9.85e-01

Pos logit contributions
sed+0.030
opes+0.029
lder+0.028
 Majesty+0.027
ses+0.026
Neg logit contributions
ak-0.024
He-0.021
he-0.019
 upon-0.019
ill-0.019
Token restore
Token ID11169
Feature activation-1.10e+00

Pos logit contributions
sed+0.113
opes+0.107
lder+0.103
ses+0.100
 Majesty+0.099
Neg logit contributions
ak-0.080
He-0.068
 there-0.061
he-0.060
 upon-0.059
Token the
Token ID262
Feature activation-8.78e-02

Pos logit contributions
sed+0.133
opes+0.125
lder+0.121
ses+0.118
 Majesty+0.117
Neg logit contributions
ak-0.082
He-0.069
 there-0.062
he-0.061
 like-0.060
Token<|endoftext|>
Token ID50256
Feature activation+1.96e-01

Pos logit contributions
opes+0.009
sed+0.009
lder+0.009
 Majesty+0.008
ses+0.008
Neg logit contributions
ak-0.008
 there-0.007
He-0.006
 upon-0.006
 like-0.006
TokenOnce
Token ID7454
Feature activation+1.49e+00

Pos logit contributions
ak+0.018
He+0.015
aky+0.014
fing+0.014
 like+0.013
Neg logit contributions
sed-0.019
opes-0.018
lder-0.017
 endings-0.016
ses-0.016
Token upon
Token ID2402
Feature activation+3.43e+00

Pos logit contributions
ak+0.059
He+0.038
aky+0.028
fing+0.025
 like+0.023
Neg logit contributions
sed-0.226
opes-0.218
lder-0.210
ses-0.205
 Majesty-0.205
Token a
Token ID257
Feature activation+3.35e-01

Pos logit contributions
ak+0.305
fing+0.243
aky+0.237
ill+0.234
He+0.232
Neg logit contributions
sed-0.326
opes-0.320
lder-0.312
 endings-0.297
zed-0.289
Token time
Token ID640
Feature activation+6.31e-01

Pos logit contributions
ak+0.026
He+0.022
 there+0.020
he+0.019
 like+0.019
Neg logit contributions
sed-0.040
opes-0.039
lder-0.036
ses-0.036
 Majesty-0.036
Token there
Token ID612
Feature activation+1.42e+00

Pos logit contributions
ak+0.048
He+0.039
he+0.034
aky+0.034
ill+0.033
Neg logit contributions
sed-0.076
opes-0.072
lder-0.069
ses-0.067
 Majesty-0.067
Token was
Token ID373
Feature activation-8.18e-01

Pos logit contributions
ak+0.132
He+0.113
aky+0.102
he+0.101
 there+0.101
Neg logit contributions
sed-0.144
opes-0.136
lder-0.131
ses-0.125
 Majesty-0.124
Token a
Token ID257
Feature activation+3.02e-01

Pos logit contributions
sed+0.108
opes+0.103
lder+0.100
ses+0.097
 Majesty+0.097
Neg logit contributions
ak-0.052
He-0.042
 there-0.037
he-0.036
 like-0.035
Token,
Token ID11
Feature activation-6.27e-02

Pos logit contributions
sed+0.112
opes+0.107
lder+0.103
ses+0.100
 Majesty+0.100
Neg logit contributions
ak-0.056
He-0.047
 there-0.042
he-0.040
 like-0.040
Token worried
Token ID7960
Feature activation+2.01e+00

Pos logit contributions
sed+0.008
opes+0.008
lder+0.007
ses+0.007
 Majesty+0.007
Neg logit contributions
ak-0.004
He-0.004
 there-0.003
he-0.003
 like-0.003
Token.
Token ID13
Feature activation+3.03e-01

Pos logit contributions
ak+0.138
He+0.119
 there+0.114
 like+0.112
 of+0.109
Neg logit contributions
sed-0.252
opes-0.235
lder-0.233
ses-0.227
 Majesty-0.226
Token
Token ID198
Feature activation+3.39e-01

Pos logit contributions
He+0.019
ak+0.019
 there+0.019
 upon+0.018
 like+0.016
Neg logit contributions
sed-0.038
opes-0.037
ses-0.034
lder-0.034
 Majesty-0.033
Token
Token ID198
Feature activation+3.93e-01

Pos logit contributions
ak+0.023
He+0.019
 there+0.016
he+0.016
 like+0.016
Neg logit contributions
sed-0.042
opes-0.041
lder-0.039
 Majesty-0.038
ses-0.038
Tokenâ
Token ID22940
Feature activation+2.35e+00

Pos logit contributions
ak+0.031
He+0.026
he+0.023
 there+0.023
 like+0.023
Neg logit contributions
sed-0.044
opes-0.043
lder-0.042
 Majesty-0.040
ses-0.039
Token€
Token ID26391
Feature activation+5.55e-01

Pos logit contributions
ak+0.226
He+0.196
he+0.181
 there+0.179
 like+0.176
Neg logit contributions
sed-0.235
opes-0.221
lder-0.209
ses-0.204
 Majesty-0.203
Token�
Token ID129
Feature activation-1.11e-01

Pos logit contributions
ak+0.051
He+0.043
 there+0.043
 upon+0.042
he+0.040
Neg logit contributions
sed-0.058
opes-0.054
ses-0.052
lder-0.050
 Majesty-0.049
Token�
Token ID241
Feature activation+8.50e-01

Pos logit contributions
sed+0.010
ses+0.008
 endings+0.008
opes+0.008
 Majesty+0.008
Neg logit contributions
ak-0.012
He-0.010
 like-0.010
he-0.010
 there-0.009
TokenHey
Token ID10814
Feature activation+3.07e-01

Pos logit contributions
ak+0.069
He+0.053
ill+0.052
aky+0.051
he+0.050
Neg logit contributions
sed-0.091
lder-0.087
opes-0.087
ses-0.081
 endings-0.080
Token,
Token ID11
Feature activation+1.16e-01

Pos logit contributions
ak+0.024
He+0.020
he+0.018
aky+0.017
ill+0.017
Neg logit contributions
sed-0.036
opes-0.034
lder-0.033
 Majesty-0.033
ses-0.032
Tokenked
Token ID9091
Feature activation-2.93e+00

Pos logit contributions
lder+0.126
sed+0.124
 Majesty+0.115
icol+0.115
opes+0.113
Neg logit contributions
ak-0.093
 there-0.084
 of-0.078
He-0.076
 like-0.075
Token tongue
Token ID11880
Feature activation-7.66e-01

Pos logit contributions
sed+0.284
opes+0.262
lder+0.257
 Majesty+0.246
ses+0.245
Neg logit contributions
ak-0.290
He-0.252
 there-0.236
 like-0.232
 of-0.225
Token.
Token ID13
Feature activation-1.21e-01

Pos logit contributions
sed+0.087
opes+0.084
lder+0.079
ses+0.077
 Majesty+0.076
Neg logit contributions
ak-0.061
He-0.052
he-0.047
aky-0.045
 there-0.045
Token Lily
Token ID20037
Feature activation-6.96e-01

Pos logit contributions
sed+0.014
opes+0.013
lder+0.013
 Majesty+0.012
ses+0.012
Neg logit contributions
ak-0.010
He-0.009
 there-0.008
he-0.008
 like-0.007
Token is
Token ID318
Feature activation-7.90e-01

Pos logit contributions
sed+0.071
opes+0.065
lder+0.064
 endings+0.063
 Majesty+0.062
Neg logit contributions
ak-0.062
He-0.057
 like-0.054
 there-0.051
 upon-0.051
Token curious
Token ID11040
Feature activation+5.80e-01

Pos logit contributions
sed+0.089
opes+0.082
ses+0.078
lder+0.077
 Majesty+0.076
Neg logit contributions
ak-0.067
He-0.057
he-0.054
ill-0.051
 upon-0.050
Token.
Token ID13
Feature activation-1.49e-01

Pos logit contributions
ak+0.045
He+0.037
he+0.034
aky+0.033
 upon+0.032
Neg logit contributions
sed-0.067
opes-0.064
lder-0.061
ses-0.060
 Majesty-0.060
Token She
Token ID1375
Feature activation-4.80e-01

Pos logit contributions
sed+0.017
opes+0.016
lder+0.016
 Majesty+0.015
ses+0.015
Neg logit contributions
ak-0.012
He-0.011
 there-0.010
 like-0.009
he-0.009
Token touches
Token ID18105
Feature activation-1.65e-01

Pos logit contributions
sed+0.045
opes+0.043
lder+0.043
 Majesty+0.040
ses+0.039
Neg logit contributions
ak-0.046
He-0.042
 like-0.039
 there-0.038
 upon-0.038
Token the
Token ID262
Feature activation+1.81e-01

Pos logit contributions
sed+0.019
opes+0.018
lder+0.017
ses+0.017
 Majesty+0.016
Neg logit contributions
ak-0.013
He-0.011
aky-0.010
he-0.010
ill-0.010
Token snake
Token ID17522
Feature activation-1.47e+00

Pos logit contributions
ak+0.018
He+0.016
 there+0.015
he+0.015
 like+0.015
Neg logit contributions
sed-0.016
opes-0.016
lder-0.015
 Majesty-0.015
 endings-0.014
TokenThe
Token ID464
Feature activation-9.95e-02

Pos logit contributions
ak+0.002
 like+0.002
 there+0.001
 closer+0.001
 seen+0.001
Neg logit contributions
opes-0.002
lder-0.002
 endings-0.002
 Majesty-0.002
llers-0.002
Token window
Token ID4324
Feature activation+5.14e-01

Pos logit contributions
opes+0.011
sed+0.011
 endings+0.010
storms+0.010
ors+0.010
Neg logit contributions
ak-0.008
 there-0.007
 of-0.007
He-0.007
 upon-0.007
Token was
Token ID373
Feature activation-1.19e+00

Pos logit contributions
ak+0.042
He+0.036
 there+0.032
he+0.032
 upon+0.031
Neg logit contributions
sed-0.058
opes-0.055
lder-0.053
ses-0.051
 Majesty-0.051
Token opened
Token ID4721
Feature activation+1.45e-01

Pos logit contributions
sed+0.138
opes+0.127
lder+0.121
ses+0.120
 Majesty+0.116
Neg logit contributions
ak-0.092
He-0.079
he-0.070
fing-0.069
ill-0.069
Token and
Token ID290
Feature activation+2.55e-01

Pos logit contributions
ak+0.012
He+0.010
aky+0.010
fing+0.009
he+0.009
Neg logit contributions
sed-0.015
opes-0.015
lder-0.014
 Majesty-0.014
iously-0.013
Token the
Token ID262
Feature activation+2.91e-01

Pos logit contributions
ak+0.023
He+0.019
aky+0.017
he+0.016
fing+0.016
Neg logit contributions
sed-0.027
opes-0.025
lder-0.024
 Majesty-0.024
ses-0.023
Token two
Token ID734
Feature activation-1.69e-01

Pos logit contributions
ak+0.023
 there+0.020
 of+0.020
He+0.020
 upon+0.019
Neg logit contributions
sed-0.032
opes-0.031
lder-0.030
 Majesty-0.029
ses-0.028
Token friends
Token ID2460
Feature activation+3.54e-01

Pos logit contributions
sed+0.022
opes+0.021
lder+0.021
ses+0.020
 Majesty+0.020
Neg logit contributions
ak-0.011
He-0.009
 there-0.008
he-0.008
 of-0.007
Token could
Token ID714
Feature activation-2.94e-01

Pos logit contributions
ak+0.029
He+0.025
he+0.022
 there+0.022
aky+0.021
Neg logit contributions
sed-0.040
opes-0.038
lder-0.037
ses-0.035
 Majesty-0.035
Token finally
Token ID3443
Feature activation-1.97e-01

Pos logit contributions
sed+0.040
opes+0.038
lder+0.037
ses+0.036
 Majesty+0.036
Neg logit contributions
ak-0.018
He-0.014
aky-0.012
he-0.011
fing-0.011
Token go
Token ID467
Feature activation+6.84e-01

Pos logit contributions
sed+0.024
lder+0.023
opes+0.023
 Majesty+0.022
ses+0.022
Neg logit contributions
ak-0.013
He-0.011
aky-0.010
 upon-0.009
m-0.009
Token small
Token ID1402
Feature activation-1.11e+00

Pos logit contributions
ak+0.012
He+0.010
 there+0.009
he+0.009
 like+0.009
Neg logit contributions
sed-0.015
opes-0.014
lder-0.013
ses-0.013
 Majesty-0.013
Token bird
Token ID6512
Feature activation-5.55e-01

Pos logit contributions
sed+0.113
opes+0.106
lder+0.101
ses+0.098
 Majesty+0.098
Neg logit contributions
ak-0.104
He-0.090
 there-0.083
he-0.082
 like-0.080
Token who
Token ID508
Feature activation+9.87e-01

Pos logit contributions
sed+0.064
opes+0.060
lder+0.059
ses+0.056
 Majesty+0.056
Neg logit contributions
ak-0.043
He-0.037
he-0.032
ill-0.032
 upon-0.032
Token asked
Token ID1965
Feature activation+1.28e-01

Pos logit contributions
ak+0.082
He+0.069
 there+0.064
he+0.062
 like+0.061
Neg logit contributions
sed-0.112
opes-0.106
lder-0.101
 Majesty-0.100
ses-0.099
Token for
Token ID329
Feature activation-1.00e+00

Pos logit contributions
ak+0.009
 there+0.008
He+0.008
 like+0.008
 of+0.008
Neg logit contributions
sed-0.015
opes-0.014
lder-0.014
ses-0.014
 Majesty-0.014
Token help
Token ID1037
Feature activation+1.22e-01

Pos logit contributions
sed+0.124
opes+0.118
lder+0.116
ses+0.111
 Majesty+0.109
Neg logit contributions
ak-0.075
He-0.062
 there-0.061
 of-0.057
 like-0.057
Token.
Token ID13
Feature activation-1.37e-01

Pos logit contributions
ak+0.008
He+0.007
 there+0.006
 like+0.006
 of+0.006
Neg logit contributions
sed-0.016
opes-0.015
lder-0.014
ses-0.014
 Majesty-0.014
Token
Token ID198
Feature activation-1.06e-01

Pos logit contributions
sed+0.016
opes+0.015
lder+0.014
ses+0.014
 Majesty+0.014
Neg logit contributions
ak-0.011
He-0.010
 there-0.009
ill-0.008
 upon-0.008
Token
Token ID198
Feature activation-6.26e-02

Pos logit contributions
sed+0.013
opes+0.012
lder+0.012
 Majesty+0.011
ses+0.011
Neg logit contributions
ak-0.007
He-0.006
 like-0.005
 there-0.005
he-0.005
TokenThe
Token ID464
Feature activation+1.45e-01

Pos logit contributions
sed+0.007
lder+0.007
opes+0.007
 Majesty+0.007
ses+0.007
Neg logit contributions
ak-0.004
He-0.003
 like-0.003
 there-0.003
he-0.003
Token independent
Token ID4795
Feature activation+3.11e-01

Pos logit contributions
ak+0.012
He+0.010
 there+0.010
 of+0.009
 closer+0.009
Neg logit contributions
opes-0.016
sed-0.016
lder-0.015
zed-0.014
llers-0.014
Token noticed
Token ID6810
Feature activation+3.88e-01

Pos logit contributions
ak+0.012
He+0.010
he+0.010
fing+0.009
ill+0.009
Neg logit contributions
sed-0.018
opes-0.017
lder-0.016
 Majesty-0.016
ses-0.015
Token the
Token ID262
Feature activation+6.30e-01

Pos logit contributions
ak+0.024
He+0.020
 there+0.019
 like+0.017
 of+0.017
Neg logit contributions
sed-0.052
opes-0.049
lder-0.048
ses-0.047
 Majesty-0.046
Token tall
Token ID7331
Feature activation-6.33e-01

Pos logit contributions
ak+0.068
He+0.060
 there+0.056
he+0.055
 upon+0.054
Neg logit contributions
sed-0.056
opes-0.052
lder-0.049
ses-0.047
 Majesty-0.047
Token stack
Token ID8931
Feature activation+4.37e-01

Pos logit contributions
sed+0.060
opes+0.056
lder+0.054
ses+0.053
 Majesty+0.052
Neg logit contributions
ak-0.064
He-0.056
 there-0.053
 of-0.052
 like-0.052
Token of
Token ID286
Feature activation+1.00e+00

Pos logit contributions
ak+0.016
He+0.011
 there+0.007
he+0.007
aky+0.007
Neg logit contributions
sed-0.068
opes-0.066
lder-0.064
ses-0.062
 Majesty-0.061
Token blocks
Token ID7021
Feature activation-2.22e-01

Pos logit contributions
ak+0.108
He+0.096
he+0.089
ill+0.088
aky+0.087
Neg logit contributions
sed-0.085
opes-0.081
ses-0.073
lder-0.072
 Majesty-0.071
Token that
Token ID326
Feature activation-1.69e+00

Pos logit contributions
sed+0.026
opes+0.025
lder+0.023
ses+0.023
yard+0.022
Neg logit contributions
ak-0.016
He-0.013
aky-0.011
fing-0.011
 upon-0.010
Token she
Token ID673
Feature activation+1.99e-01

Pos logit contributions
sed+0.198
opes+0.188
lder+0.181
ses+0.176
 Majesty+0.175
Neg logit contributions
ak-0.133
He-0.112
 there-0.100
he-0.099
 upon-0.096
Token had
Token ID550
Feature activation+3.14e-01

Pos logit contributions
ak+0.015
He+0.013
 there+0.012
he+0.012
ill+0.011
Neg logit contributions
sed-0.023
opes-0.022
lder-0.021
ses-0.021
 Majesty-0.021
Token made
Token ID925
Feature activation+4.36e-01

Pos logit contributions
ak+0.027
He+0.023
 there+0.021
he+0.020
 like+0.020
Neg logit contributions
sed-0.035
opes-0.033
lder-0.032
ses-0.031
 Majesty-0.031
Token earlier
Token ID2961
Feature activation-1.27e+00

Pos logit contributions
ak+0.029
He+0.024
 there+0.020
he+0.020
aky+0.020
Neg logit contributions
sed-0.056
opes-0.053
lder-0.051
ses-0.050
 Majesty-0.050
Token was
Token ID373
Feature activation-8.47e-01

Pos logit contributions
ak+0.013
He+0.011
he+0.010
 there+0.010
aky+0.010
Neg logit contributions
sed-0.016
opes-0.015
lder-0.014
ses-0.014
 Majesty-0.014
Token a
Token ID257
Feature activation+1.66e-01

Pos logit contributions
sed+0.103
opes+0.096
ses+0.091
lder+0.091
 Majesty+0.088
Neg logit contributions
ak-0.064
He-0.052
he-0.048
aky-0.047
fing-0.045
Token baby
Token ID5156
Feature activation-1.04e-01

Pos logit contributions
ak+0.014
He+0.012
he+0.011
 there+0.011
ill+0.011
Neg logit contributions
sed-0.018
opes-0.017
lder-0.016
ses-0.016
 Majesty-0.016
Token donkey
Token ID50085
Feature activation-8.14e-01

Pos logit contributions
sed+0.010
opes+0.010
lder+0.009
ses+0.009
 Majesty+0.009
Neg logit contributions
ak-0.010
He-0.009
he-0.008
 there-0.008
 upon-0.008
Token!
Token ID0
Feature activation+1.81e-01

Pos logit contributions
sed+0.093
opes+0.087
lder+0.084
ses+0.081
 Majesty+0.080
Neg logit contributions
ak-0.065
He-0.055
he-0.050
aky-0.048
 upon-0.047
Token The
Token ID383
Feature activation-5.17e-01

Pos logit contributions
ak+0.016
He+0.014
 there+0.012
he+0.012
 upon+0.012
Neg logit contributions
sed-0.020
opes-0.019
lder-0.018
ses-0.017
 Majesty-0.017
Token m
Token ID285
Feature activation+3.33e-01

Pos logit contributions
sed+0.048
opes+0.045
lder+0.042
ses+0.041
 Majesty+0.041
Neg logit contributions
ak-0.053
He-0.048
 there-0.044
he-0.043
 like-0.043
Tokenule
Token ID2261
Feature activation+2.42e-01

Pos logit contributions
ak+0.038
He+0.034
he+0.031
 there+0.031
aky+0.031
Neg logit contributions
sed-0.028
opes-0.025
lder-0.024
ses-0.023
 Majesty-0.023
Token was
Token ID373
Feature activation-8.59e-01

Pos logit contributions
ak+0.020
He+0.017
 there+0.015
he+0.015
 like+0.014
Neg logit contributions
sed-0.028
opes-0.026
lder-0.025
ses-0.025
 Majesty-0.024
Token so
Token ID523
Feature activation+6.54e-01

Pos logit contributions
sed+0.099
opes+0.089
ses+0.086
lder+0.085
 Majesty+0.081
Neg logit contributions
ak-0.071
He-0.059
he-0.055
fing-0.053
ill-0.053
Token surprised
Token ID6655
Feature activation+1.13e+00

Pos logit contributions
ak+0.053
He+0.043
fing+0.040
ill+0.039
he+0.039
Neg logit contributions
sed-0.075
lder-0.069
opes-0.068
ses-0.066
 Majesty-0.064
Tokenumbles
Token ID25329
Feature activation+3.18e-02

Pos logit contributions
sed+0.034
opes+0.031
ses+0.029
lder+0.029
 Majesty+0.028
Neg logit contributions
ak-0.040
He-0.036
he-0.034
 there-0.033
 like-0.033
Token and
Token ID290
Feature activation+1.81e-01

Pos logit contributions
ak+0.002
ill+0.002
fing+0.002
aky+0.002
he+0.002
Neg logit contributions
sed-0.004
opes-0.003
ses-0.003
lder-0.003
zed-0.003
Token falls
Token ID8953
Feature activation-9.20e-01

Pos logit contributions
ak+0.015
He+0.013
 there+0.012
he+0.011
 like+0.011
Neg logit contributions
sed-0.020
opes-0.019
lder-0.019
 Majesty-0.018
ses-0.018
Token.
Token ID13
Feature activation+4.14e-01

Pos logit contributions
sed+0.116
opes+0.110
ses+0.104
lder+0.104
 Majesty+0.100
Neg logit contributions
ak-0.063
He-0.048
aky-0.044
he-0.044
ill-0.043
Token She
Token ID1375
Feature activation-2.89e-01

Pos logit contributions
ak+0.025
He+0.024
 there+0.022
 upon+0.021
 of+0.021
Neg logit contributions
opes-0.051
sed-0.051
 Majesty-0.049
lder-0.049
ses-0.046
Token hurts
Token ID20406
Feature activation-1.28e+00

Pos logit contributions
sed+0.024
opes+0.022
lder+0.021
ses+0.020
 Majesty+0.020
Neg logit contributions
ak-0.033
He-0.029
 there-0.027
he-0.027
 upon-0.027
Token her
Token ID607
Feature activation-1.02e+00

Pos logit contributions
sed+0.116
opes+0.110
lder+0.101
ses+0.100
 Majesty+0.093
Neg logit contributions
ak-0.130
He-0.109
he-0.105
fing-0.103
aky-0.101
Token knee
Token ID10329
Feature activation+1.21e+00

Pos logit contributions
sed+0.098
opes+0.094
lder+0.091
 Majesty+0.087
ses+0.085
Neg logit contributions
ak-0.099
He-0.086
 there-0.080
he-0.078
 like-0.078
Token and
Token ID290
Feature activation+2.03e-01

Pos logit contributions
ak+0.085
He+0.069
he+0.061
 there+0.061
ill+0.061
Neg logit contributions
sed-0.149
opes-0.141
lder-0.135
ses-0.133
 Majesty-0.129
Token cries
Token ID24691
Feature activation-3.35e+00

Pos logit contributions
ak+0.019
He+0.017
 there+0.017
 like+0.016
 upon+0.016
Neg logit contributions
sed-0.020
opes-0.019
lder-0.018
 Majesty-0.018
ses-0.017
Token.
Token ID13
Feature activation+2.87e-01

Pos logit contributions
sed+0.398
opes+0.376
lder+0.361
ses+0.354
 Majesty+0.349
Neg logit contributions
ak-0.258
He-0.215
he-0.194
 there-0.190
ill-0.185
Token remember
Token ID3505
Feature activation+1.14e+00

Pos logit contributions
sed+0.014
opes+0.013
lder+0.013
ses+0.012
 Majesty+0.012
Neg logit contributions
ak-0.009
He-0.008
 there-0.007
he-0.007
 like-0.007
Token her
Token ID607
Feature activation-1.28e+00

Pos logit contributions
ak+0.078
 there+0.066
He+0.065
 like+0.062
 of+0.060
Neg logit contributions
sed-0.147
opes-0.137
lder-0.134
ses-0.132
 Majesty-0.128
Token.
Token ID13
Feature activation-5.00e-01

Pos logit contributions
opes+0.130
sed+0.127
lder+0.123
 Majesty+0.117
bank+0.113
Neg logit contributions
ak-0.108
 there-0.099
 like-0.099
 of-0.095
 upon-0.089
Token Now
Token ID2735
Feature activation-2.64e+00

Pos logit contributions
sed+0.047
opes+0.044
lder+0.043
ses+0.041
 Majesty+0.041
Neg logit contributions
ak-0.051
He-0.042
he-0.041
aky-0.040
 there-0.039
Token it
Token ID340
Feature activation-6.72e-01

Pos logit contributions
sed+0.316
opes+0.301
lder+0.290
 Majesty+0.282
ses+0.281
Neg logit contributions
ak-0.200
He-0.167
he-0.147
 there-0.145
aky-0.143
Token is
Token ID318
Feature activation-1.15e+00

Pos logit contributions
sed+0.082
opes+0.079
 Majesty+0.074
ses+0.074
lder+0.074
Neg logit contributions
ak-0.049
 there-0.041
He-0.041
 like-0.038
 of-0.038
Token gone
Token ID3750
Feature activation-5.78e-01

Pos logit contributions
sed+0.127
opes+0.121
lder+0.114
ses+0.112
 Majesty+0.112
Neg logit contributions
ak-0.095
He-0.084
he-0.077
ill-0.073
aky-0.071
Token.
Token ID13
Feature activation-5.74e-01

Pos logit contributions
sed+0.064
opes+0.061
lder+0.058
 Majesty+0.056
ses+0.056
Neg logit contributions
ak-0.047
He-0.041
he-0.036
ill-0.035
fing-0.035
Token We
Token ID775
Feature activation-2.63e-01

Pos logit contributions
sed+0.051
opes+0.047
lder+0.046
ses+0.045
 Majesty+0.044
Neg logit contributions
ak-0.060
He-0.050
aky-0.048
he-0.048
 upon-0.047
Token are
Token ID389
Feature activation+6.81e-01

Pos logit contributions
sed+0.031
lder+0.028
opes+0.028
ses+0.028
 Majesty+0.026
Neg logit contributions
ak-0.019
He-0.017
ill-0.014
aky-0.014
he-0.013
Token both
Token ID1111
Feature activation+2.78e-01

Pos logit contributions
ak+0.050
 there+0.043
 like+0.040
He+0.038
 of+0.037
Neg logit contributions
sed-0.084
opes-0.080
lder-0.077
ses-0.075
 endings-0.074
Token
Token ID220
Feature activation-3.87e-01

Pos logit contributions
sed+0.019
opes+0.017
lder+0.017
 Majesty+0.017
ses+0.016
Neg logit contributions
ak-0.014
He-0.014
 there-0.012
 upon-0.012
he-0.012
Token
Token ID198
Feature activation-1.38e-01

Pos logit contributions
sed+0.053
opes+0.051
lder+0.049
 Majesty+0.048
ses+0.048
Neg logit contributions
ak-0.023
He-0.019
he-0.016
 there-0.016
 upon-0.015
Token
Token ID198
Feature activation-6.12e-02

Pos logit contributions
sed+0.018
opes+0.017
lder+0.016
ses+0.016
 Majesty+0.016
Neg logit contributions
ak-0.009
He-0.007
 there-0.006
he-0.006
 upon-0.006
TokenFrom
Token ID4863
Feature activation-1.95e+00

Pos logit contributions
sed+0.006
opes+0.006
lder+0.006
 Majesty+0.006
 endings+0.006
Neg logit contributions
ak-0.005
He-0.004
 there-0.004
 like-0.004
ill-0.004
Token that
Token ID326
Feature activation-1.97e+00

Pos logit contributions
sed+0.159
opes+0.146
lder+0.139
ses+0.133
 Majesty+0.127
Neg logit contributions
ak-0.219
He-0.193
ill-0.186
aky-0.180
 upon-0.177
Token day
Token ID1110
Feature activation-1.19e+00

Pos logit contributions
sed+0.215
opes+0.206
 Majesty+0.194
ses+0.187
lder+0.185
Neg logit contributions
ak-0.179
He-0.156
 there-0.153
he-0.152
 of-0.151
Token on
Token ID319
Feature activation+6.55e-01

Pos logit contributions
sed+0.171
opes+0.162
lder+0.155
ses+0.154
zed+0.150
Neg logit contributions
ak-0.057
He-0.045
ill-0.038
aky-0.034
 upon-0.034
Token,
Token ID11
Feature activation-1.77e-01

Pos logit contributions
ak+0.027
 there+0.026
 of+0.023
He+0.022
he+0.019
Neg logit contributions
sed-0.099
lder-0.094
opes-0.094
 Majesty-0.092
ses-0.092
Token Jack
Token ID3619
Feature activation-8.45e-01

Pos logit contributions
sed+0.020
opes+0.019
lder+0.018
ses+0.018
 Majesty+0.018
Neg logit contributions
ak-0.015
 there-0.013
He-0.013
 like-0.013
 of-0.013
Token always
Token ID1464
Feature activation+2.43e-01

Pos logit contributions
sed+0.105
opes+0.097
ses+0.095
lder+0.092
 Majesty+0.091
Neg logit contributions
ak-0.064
He-0.054
 like-0.049
 there-0.048
 of-0.047
Token had
Token ID550
Feature activation-1.57e-01

Pos logit contributions
ak+0.020
He+0.018
he+0.016
ill+0.015
 upon+0.015
Neg logit contributions
sed-0.027
opes-0.025
lder-0.025
ses-0.024
 Majesty-0.024
Token they
Token ID484
Feature activation-7.03e-01

Pos logit contributions
ak+0.018
He+0.015
 there+0.015
 of+0.014
 like+0.014
Neg logit contributions
sed-0.031
opes-0.030
ses-0.028
lder-0.028
 Majesty-0.028
Token were
Token ID547
Feature activation-9.63e-02

Pos logit contributions
sed+0.079
opes+0.075
lder+0.073
ses+0.070
 Majesty+0.069
Neg logit contributions
ak-0.057
He-0.048
ill-0.042
he-0.042
 there-0.042
Token looking
Token ID2045
Feature activation-1.81e+00

Pos logit contributions
sed+0.012
opes+0.011
lder+0.011
ses+0.010
 Majesty+0.010
Neg logit contributions
ak-0.007
He-0.006
 there-0.006
he-0.005
 like-0.005
Token for
Token ID329
Feature activation-1.67e+00

Pos logit contributions
sed+0.185
opes+0.174
lder+0.166
 Majesty+0.162
ses+0.160
Neg logit contributions
ak-0.164
He-0.142
he-0.129
ill-0.126
aky-0.125
Token--
Token ID438
Feature activation+2.65e-02

Pos logit contributions
sed+0.201
opes+0.191
lder+0.184
ses+0.179
 Majesty+0.178
Neg logit contributions
ak-0.125
He-0.105
 there-0.093
he-0.093
 upon-0.089
Token wherever
Token ID14530
Feature activation-2.38e+00

Pos logit contributions
ak+0.002
He+0.002
aky+0.001
he+0.001
ill+0.001
Neg logit contributions
sed-0.003
opes-0.003
lder-0.003
 Majesty-0.003
 endings-0.003
Token they
Token ID484
Feature activation-6.14e-01

Pos logit contributions
sed+0.270
opes+0.258
lder+0.246
ses+0.246
 Majesty+0.241
Neg logit contributions
ak-0.191
 there-0.177
He-0.173
 of-0.162
 upon-0.158
Token went
Token ID1816
Feature activation-1.20e+00

Pos logit contributions
sed+0.063
opes+0.060
lder+0.058
ses+0.055
 Majesty+0.054
Neg logit contributions
ak-0.055
He-0.047
aky-0.042
 there-0.042
ill-0.042
Token,
Token ID11
Feature activation-6.89e-01

Pos logit contributions
sed+0.130
opes+0.126
lder+0.120
 endings+0.115
ses+0.114
Neg logit contributions
ak-0.098
He-0.082
aky-0.075
ill-0.075
he-0.074
Token they
Token ID484
Feature activation-6.97e-01

Pos logit contributions
sed+0.082
opes+0.078
lder+0.075
ses+0.073
 Majesty+0.073
Neg logit contributions
ak-0.052
He-0.043
he-0.038
 there-0.038
aky-0.037
Token were
Token ID547
Feature activation-2.06e-01

Pos logit contributions
sed+0.078
opes+0.073
lder+0.071
 Majesty+0.068
ses+0.068
Neg logit contributions
ak-0.057
He-0.049
he-0.043
 there-0.043
aky-0.043
Token n
Token ID299
Feature activation-1.35e+00

Pos logit contributions
sed+0.021
opes+0.020
lder+0.019
ses+0.018
 Majesty+0.017
Neg logit contributions
ak-0.017
He-0.016
 there-0.015
 like-0.015
 of-0.015
Tokenaps
Token ID1686
Feature activation-7.08e-02

Pos logit contributions
sed+0.148
opes+0.141
lder+0.134
 Majesty+0.132
ses+0.131
Neg logit contributions
ak-0.109
He-0.102
 like-0.085
 there-0.084
he-0.083
Token.
Token ID13
Feature activation-5.44e-01

Pos logit contributions
sed+0.007
opes+0.007
lder+0.007
ses+0.006
 Majesty+0.006
Neg logit contributions
ak-0.006
He-0.005
fing-0.005
he-0.005
ill-0.004
Token I
Token ID314
Feature activation+3.45e-01

Pos logit contributions
sed+0.054
opes+0.051
lder+0.049
ses+0.047
 Majesty+0.047
Neg logit contributions
ak-0.052
He-0.045
 there-0.042
he-0.041
 like-0.041
Token like
Token ID588
Feature activation+1.94e+00

Pos logit contributions
ak+0.028
He+0.024
ill+0.022
 there+0.022
he+0.021
Neg logit contributions
sed-0.038
lder-0.035
ses-0.034
 Majesty-0.033
opes-0.033
Token you
Token ID345
Feature activation-1.27e+00

Pos logit contributions
 there+0.114
 of+0.106
 like+0.103
ak+0.099
 upon+0.092
Neg logit contributions
sed-0.262
opes-0.251
ses-0.237
lder-0.235
 Majesty-0.222
Token and
Token ID290
Feature activation-2.72e-01

Pos logit contributions
sed+0.143
opes+0.136
lder+0.131
ses+0.126
 Majesty+0.126
Neg logit contributions
ak-0.104
He-0.090
he-0.079
 there-0.078
ill-0.077
Token the
Token ID262
Feature activation-2.80e-01

Pos logit contributions
sed+0.032
opes+0.032
lder+0.029
ses+0.028
 Majesty+0.028
Neg logit contributions
ak-0.019
 there-0.017
He-0.017
 like-0.017
 upon-0.015
Token garden
Token ID11376
Feature activation-1.11e+00

Pos logit contributions
sed+0.027
opes+0.026
lder+0.024
ses+0.023
 Majesty+0.022
Neg logit contributions
ak-0.027
 there-0.023
He-0.023
 like-0.023
 upon-0.023
Token.
Token ID13
Feature activation-6.04e-01

Pos logit contributions
sed+0.137
opes+0.130
lder+0.125
ses+0.122
 Majesty+0.120
Neg logit contributions
ak-0.082
He-0.068
 there-0.063
 like-0.061
he-0.060
Token I
Token ID314
Feature activation+3.14e-01

Pos logit contributions
sed+0.059
opes+0.055
lder+0.053
ses+0.052
 Majesty+0.051
Neg logit contributions
ak-0.059
He-0.051
he-0.048
 there-0.047
aky-0.046
Token mean
Token ID1612
Feature activation+1.24e+00

Pos logit contributions
ak+0.042
He+0.038
 there+0.036
 like+0.034
 upon+0.033
Neg logit contributions
sed-0.059
opes-0.058
 Majesty-0.056
lder-0.056
ses-0.054
Token.
Token ID13
Feature activation+6.68e-01

Pos logit contributions
ak+0.077
He+0.067
 there+0.061
 like+0.061
 of+0.057
Neg logit contributions
sed-0.161
opes-0.155
lder-0.153
 Majesty-0.150
ses-0.145
Token
Token ID198
Feature activation+5.87e-01

Pos logit contributions
ak+0.052
He+0.046
 there+0.041
he+0.039
 like+0.039
Neg logit contributions
sed-0.078
opes-0.074
lder-0.072
 Majesty-0.070
ses-0.069
Token
Token ID198
Feature activation+6.15e-01

Pos logit contributions
ak+0.046
He+0.037
ill+0.037
aky+0.036
fing+0.036
Neg logit contributions
sed-0.063
opes-0.060
lder-0.059
ses-0.055
 Majesty-0.052
TokenWh
Token ID1199
Feature activation-1.14e+00

Pos logit contributions
ak+0.058
He+0.050
ill+0.048
he+0.047
 there+0.047
Neg logit contributions
sed-0.060
opes-0.057
lder-0.055
ses-0.052
 Majesty-0.051
Tokenisk
Token ID1984
Feature activation-5.61e+00

Pos logit contributions
sed+0.061
opes+0.054
lder+0.049
ses+0.046
 Majesty+0.046
Neg logit contributions
ak-0.163
He-0.149
 there-0.141
he-0.141
 upon-0.139
Tokeners
Token ID364
Feature activation+4.01e-01

Pos logit contributions
sed+0.646
opes+0.613
lder+0.590
ses+0.573
 Majesty+0.570
Neg logit contributions
ak-0.448
He-0.380
 there-0.340
he-0.338
 upon-0.328
Token knew
Token ID2993
Feature activation-1.25e+00

Pos logit contributions
ak+0.028
He+0.024
 there+0.022
 upon+0.020
he+0.020
Neg logit contributions
sed-0.051
opes-0.049
ses-0.048
 Majesty-0.047
 endings-0.044
Token she
Token ID673
Feature activation+2.49e-01

Pos logit contributions
sed+0.157
opes+0.150
lder+0.147
 Majesty+0.142
ses+0.141
Neg logit contributions
ak-0.088
He-0.075
 there-0.071
 like-0.064
 of-0.064
Token had
Token ID550
Feature activation+3.74e-01

Pos logit contributions
ak+0.018
He+0.015
 there+0.014
 like+0.013
he+0.013
Neg logit contributions
sed-0.031
opes-0.029
lder-0.028
 Majesty-0.028
ses-0.028
Token to
Token ID284
Feature activation-1.55e-01

Pos logit contributions
ak+0.021
 there+0.018
 seen+0.015
 like+0.015
 upon+0.014
Neg logit contributions
opes-0.052
sed-0.051
lder-0.049
 Majesty-0.049
ses-0.047
Token and
Token ID290
Feature activation+2.42e-01

Pos logit contributions
sed+0.244
opes+0.233
lder+0.225
ses+0.220
 Majesty+0.218
Neg logit contributions
ak-0.114
He-0.092
 there-0.079
he-0.078
 upon-0.075
Token red
Token ID2266
Feature activation-1.17e+00

Pos logit contributions
ak+0.020
He+0.017
 there+0.016
 like+0.015
 upon+0.015
Neg logit contributions
sed-0.027
opes-0.026
lder-0.026
 Majesty-0.025
 endings-0.024
Token.
Token ID13
Feature activation+5.81e-02

Pos logit contributions
sed+0.157
opes+0.152
lder+0.151
 Majesty+0.149
ses+0.142
Neg logit contributions
ak-0.066
He-0.056
 like-0.052
 there-0.050
 of-0.049
Token It
Token ID632
Feature activation-1.24e-01

Pos logit contributions
ak+0.005
He+0.004
 there+0.004
he+0.004
 like+0.004
Neg logit contributions
sed-0.006
opes-0.006
lder-0.006
 Majesty-0.006
ses-0.005
Token his
Token ID465
Feature activation+9.96e-01

Pos logit contributions
sed+0.015
opes+0.014
 Majesty+0.014
lder+0.014
ses+0.013
Neg logit contributions
ak-0.009
He-0.008
 there-0.007
 like-0.007
 of-0.007
Tokensed
Token ID36622
Feature activation-6.49e+00

Pos logit contributions
ak+0.155
 like+0.137
 there+0.137
He+0.135
 upon+0.134
Neg logit contributions
lder-0.034
opes-0.032
storms-0.025
 endings-0.023
 Majesty-0.022
Token at
Token ID379
Feature activation+2.26e-01

Pos logit contributions
sed+0.825
opes+0.783
lder+0.746
ses+0.738
 Majesty+0.715
Neg logit contributions
ak-0.439
He-0.357
he-0.328
 there-0.306
 upon-0.305
Token Lily
Token ID20037
Feature activation-4.24e-01

Pos logit contributions
ak+0.017
He+0.015
 there+0.014
he+0.014
ill+0.013
Neg logit contributions
sed-0.027
opes-0.025
lder-0.024
 Majesty-0.024
ses-0.024
Token.
Token ID13
Feature activation+4.71e-02

Pos logit contributions
sed+0.050
opes+0.048
lder+0.046
ses+0.045
 Majesty+0.044
Neg logit contributions
ak-0.032
He-0.027
he-0.024
 there-0.024
 upon-0.023
Token Lily
Token ID20037
Feature activation-4.73e-01

Pos logit contributions
ak+0.004
He+0.003
 there+0.003
he+0.003
 like+0.003
Neg logit contributions
sed-0.005
opes-0.005
lder-0.005
 Majesty-0.005
ses-0.005
Token screamed
Token ID25421
Feature activation-1.00e+00

Pos logit contributions
sed+0.056
opes+0.054
 Majesty+0.052
lder+0.051
ses+0.050
Neg logit contributions
ak-0.036
He-0.031
 there-0.028
 like-0.027
he-0.026
TokenLet
Token ID5756
Feature activation+2.68e-01

Pos logit contributions
sed+0.055
opes+0.052
lder+0.052
ses+0.048
 Majesty+0.048
Neg logit contributions
ak-0.043
He-0.034
ill-0.033
 there-0.033
he-0.032
Token's
Token ID338
Feature activation-1.39e-01

Pos logit contributions
ak+0.015
He+0.013
 there+0.012
 of+0.011
 like+0.010
Neg logit contributions
sed-0.039
ses-0.036
opes-0.036
 Majesty-0.034
lder-0.032
Token play
Token ID711
Feature activation+1.31e-01

Pos logit contributions
sed+0.015
opes+0.014
lder+0.013
 Majesty+0.013
ses+0.013
Neg logit contributions
ak-0.012
He-0.011
aky-0.010
he-0.009
 upon-0.009
Token!"
Token ID2474
Feature activation-1.72e-01

Pos logit contributions
ak+0.010
He+0.009
ill+0.008
he+0.008
 upon+0.007
Neg logit contributions
sed-0.015
opes-0.015
lder-0.014
ses-0.014
 Majesty-0.013
Token Wh
Token ID854
Feature activation-1.48e+00

Pos logit contributions
sed+0.017
opes+0.016
 Majesty+0.016
lder+0.015
ses+0.015
Neg logit contributions
ak-0.016
He-0.015
 there-0.013
 like-0.013
 of-0.013
Tokenisk
Token ID1984
Feature activation-5.73e+00

Pos logit contributions
sed+0.087
opes+0.077
lder+0.072
 Majesty+0.071
ses+0.068
Neg logit contributions
ak-0.204
He-0.185
he-0.173
aky-0.172
ill-0.171
Tokeners
Token ID364
Feature activation+2.88e-01

Pos logit contributions
sed+0.682
opes+0.643
lder+0.618
ses+0.607
 Majesty+0.603
Neg logit contributions
ak-0.445
He-0.375
 there-0.333
he-0.333
 like-0.320
Token said
Token ID531
Feature activation+3.68e-03

Pos logit contributions
ak+0.021
He+0.017
 there+0.016
he+0.015
 upon+0.015
Neg logit contributions
sed-0.036
opes-0.034
 Majesty-0.033
ses-0.033
lder-0.032
Token,
Token ID11
Feature activation+1.44e-01

Pos logit contributions
ak+0.000
He+0.000
 there+0.000
 like+0.000
 of+0.000
Neg logit contributions
sed-0.000
opes-0.000
lder-0.000
 Majesty-0.000
ses-0.000
Token "
Token ID366
Feature activation-5.85e-01

Pos logit contributions
ak+0.012
He+0.012
 there+0.011
 like+0.011
 of+0.010
Neg logit contributions
opes-0.015
 endings-0.014
 Majesty-0.014
sed-0.014
lder-0.013
TokenNo
Token ID2949
Feature activation+5.50e-01

Pos logit contributions
sed+0.064
opes+0.060
lder+0.060
ses+0.057
 Majesty+0.056
Neg logit contributions
ak-0.049
He-0.041
 there-0.039
 upon-0.038
ill-0.038
Token the
Token ID262
Feature activation+7.14e-01

Pos logit contributions
sed+0.056
opes+0.054
lder+0.052
ses+0.050
 Majesty+0.050
Neg logit contributions
ak-0.032
He-0.026
he-0.021
ill-0.021
aky-0.021
Token trees
Token ID7150
Feature activation-1.78e+00

Pos logit contributions
ak+0.073
He+0.064
 there+0.064
 upon+0.061
 of+0.061
Neg logit contributions
sed-0.069
lder-0.060
opes-0.060
ses-0.060
 Majesty-0.057
Token,
Token ID11
Feature activation+2.08e-01

Pos logit contributions
sed+0.210
opes+0.200
lder+0.192
ses+0.187
 Majesty+0.182
Neg logit contributions
ak-0.134
He-0.114
he-0.097
aky-0.097
 there-0.096
Token the
Token ID262
Feature activation+6.96e-01

Pos logit contributions
ak+0.020
He+0.018
 there+0.017
he+0.016
 like+0.016
Neg logit contributions
sed-0.020
opes-0.019
lder-0.018
ses-0.017
 Majesty-0.017
Token flower
Token ID15061
Feature activation-5.23e-02

Pos logit contributions
ak+0.060
 there+0.052
He+0.052
 like+0.050
 upon+0.049
Neg logit contributions
sed-0.074
opes-0.070
lder-0.068
ses-0.066
 Majesty-0.065
Tokenbed
Token ID3077
Feature activation-4.25e+00

Pos logit contributions
sed+0.007
opes+0.006
lder+0.006
 Majesty+0.006
ses+0.006
Neg logit contributions
ak-0.004
 there-0.003
 of-0.003
 like-0.003
He-0.003
Tokens
Token ID82
Feature activation-6.57e-01

Pos logit contributions
sed+0.511
opes+0.484
lder+0.471
ses+0.457
 Majesty+0.457
Neg logit contributions
ak-0.317
He-0.264
 there-0.243
 like-0.233
 upon-0.232
Token,
Token ID11
Feature activation+2.12e-01

Pos logit contributions
sed+0.073
opes+0.072
lder+0.071
 Majesty+0.070
ses+0.066
Neg logit contributions
ak-0.052
 like-0.042
 there-0.042
 of-0.041
 upon-0.040
Token and
Token ID290
Feature activation+5.69e-01

Pos logit contributions
ak+0.013
He+0.011
 there+0.010
 like+0.009
he+0.009
Neg logit contributions
sed-0.028
opes-0.027
lder-0.026
 Majesty-0.026
ses-0.026
Token the
Token ID262
Feature activation+6.67e-01

Pos logit contributions
ak+0.039
He+0.033
 there+0.033
 like+0.031
 of+0.030
Neg logit contributions
sed-0.071
opes-0.068
lder-0.066
ses-0.064
 Majesty-0.063
Token patches
Token ID16082
Feature activation+6.46e-01

Pos logit contributions
ak+0.053
 there+0.046
He+0.045
 like+0.044
 upon+0.043
Neg logit contributions
sed-0.074
opes-0.073
lder-0.070
ses-0.066
 Majesty-0.066
Token cher
Token ID21382
Feature activation-6.64e-01

Pos logit contributions
sed+0.021
opes+0.021
 Majesty+0.019
lder+0.019
ses+0.019
Neg logit contributions
ak-0.026
 there-0.023
He-0.023
he-0.022
 like-0.022
Tokenries
Token ID1678
Feature activation-3.07e-01

Pos logit contributions
sed+0.043
opes+0.041
lder+0.038
 Majesty+0.034
ses+0.034
Neg logit contributions
ak-0.083
He-0.077
 like-0.071
 there-0.070
aky-0.070
Token and
Token ID290
Feature activation-2.51e-01

Pos logit contributions
sed+0.036
opes+0.034
lder+0.033
ses+0.032
 Majesty+0.030
Neg logit contributions
ak-0.022
He-0.018
aky-0.016
 upon-0.016
fing-0.016
Token ch
Token ID442
Feature activation-2.70e+00

Pos logit contributions
opes+0.031
sed+0.031
lder+0.030
 Majesty+0.030
ses+0.028
Neg logit contributions
ak-0.016
 there-0.014
 like-0.013
He-0.013
 of-0.011
Tokenir
Token ID343
Feature activation-3.15e+00

Pos logit contributions
sed+0.262
opes+0.245
lder+0.234
ses+0.226
 Majesty+0.225
Neg logit contributions
ak-0.270
He-0.237
 there-0.218
he-0.217
 upon-0.212
Tokenped
Token ID9124
Feature activation-4.33e+00

Pos logit contributions
sed+0.246
lder+0.219
 Majesty+0.217
opes+0.215
ses+0.203
Neg logit contributions
ak-0.382
He-0.328
 like-0.318
 of-0.312
aky-0.303
Token happily
Token ID18177
Feature activation-2.04e+00

Pos logit contributions
sed+0.559
opes+0.542
lder+0.516
ses+0.507
 Majesty+0.504
Neg logit contributions
ak-0.288
He-0.248
 there-0.229
 like-0.224
 closer-0.207
Token.
Token ID13
Feature activation-6.21e-01

Pos logit contributions
sed+0.239
opes+0.225
lder+0.218
ses+0.213
 Majesty+0.209
Neg logit contributions
ak-0.158
He-0.135
he-0.119
 there-0.118
 upon-0.116
Token It
Token ID632
Feature activation-6.81e-01

Pos logit contributions
sed+0.071
opes+0.066
ses+0.063
lder+0.063
 Majesty+0.062
Neg logit contributions
ak-0.051
He-0.041
he-0.039
aky-0.038
ill-0.037
Token trusted
Token ID13467
Feature activation-3.25e-01

Pos logit contributions
sed+0.081
opes+0.080
lder+0.075
 Majesty+0.075
ses+0.073
Neg logit contributions
ak-0.051
He-0.044
 there-0.040
 like-0.039
aky-0.036
Token them
Token ID606
Feature activation-7.33e-01

Pos logit contributions
sed+0.039
opes+0.037
lder+0.037
ses+0.035
 Majesty+0.035
Neg logit contributions
ak-0.024
 there-0.021
He-0.021
 like-0.021
 of-0.020