TOP ACTIVATIONS
MAX = 8.465

swe aters and hats and scar ves from the wool .
you like your hats and scar ves . You are our
. There are hats and scar ves and gloves and m
like to knit hats and scar ves for my friends ."
and tug - of - war . The fox was happy
. They find helmets and scar ves and gloves to wear
They take their hats and scar ves and throw them in
. They wear hats and scar ves and gloves . They
coats , shoes , and scar ves . Tom
They put on hats and scar ves . They look like
. She makes hats and scar ves and dolls with yarn
their mom 's hats and scar ves . They stand on
were soft things , like scar ves and hats . There
They put on hats and scar ves and pretended to have
and blew their hats and scar ves away . They ran
. They find hats and scar ves and glasses . They
makes gloves , hats , scar ves , and swe aters
saw people wearing hats and scar ves and gloves . They
play tug - of - war !" The birds
only found their hats and scar ves , torn and bloody

MIN ACTIVATIONS
MIN = -13.225

. He lived in the tre et ops , climbing from
as he soared over the tre et ops . It felt
soon became Joy 's most tre asured toy , and she
She picks up the re ind eer . She looks at
ran up to the re ind eer , pet ting it
one of Mary 's most tre asured possessions and she never
couldn 't get above the tre et ops . He was
of the house 's most tre asured treasures . <|endoftext|> Once
up to the restless re ind eer and said , "
to work cleaning the re ind eer . With her help
, he spotted another re ind eer .
the razor was the most tre asured thing in all of
he saw the empty r ind and the seeds on the
small bird sitting on the tre et op . His name
walked up to the re ind eer . Suddenly , the
got back the impatient re ind eer was waiting for them
there was a pink re ind eer . Her name was
. It was her most tre asured one , and it
and her puppy developed a tre asured friendship . Together ,
heard a noise in the tre et ops . He looked

INTERVAL 3.043 - 8.465
CONTAINS 6.069%

are you doing ?" asked a voice . James
before eating . Mom my said I should be careful with
with your ball ," Tom says . They agree
time ." They take some paper and go to play .
together . <|endoftext|> Once upon a time , there was a

INTERVAL -2.380 - 3.043
CONTAINS 81.428%

to join her . He wanted to play too , but
said . She hugged them and kissed them . " You
" My name is Lily . Do you want to sit
They find a big rock near the water . They
was tough to put the tent up . L

INTERVAL -7.803 - -2.380
CONTAINS 12.481%

her hand . " Be careful , Lily ,
of the animals to her home . So every morning she
bird whist ling in a tree . Sammy thought it was
usually find help to overcome your fears . <|endoftext|> Once there
back to his nest . His mom loved the ring and

INTERVAL -13.225 - -7.803
CONTAINS 0.022%

a big box of ch ocol ates ! <|endoftext|> Once upon
, Why don â
Once there was a re ind eer . She loved to
, I like seeing you sleep
The re ind eer was so pleased that
Token swe
Token ID3490
Feature activation+1.61e+00

Pos logit contributions
ves+0.116
 replied+0.115
gh+0.105
 story+0.099
 poem+0.099
Neg logit contributions
ind-0.182
 beings-0.168
asured-0.159
pered-0.155
 yourselves-0.154
Tokenaters
Token ID8605
Feature activation+7.81e-01

Pos logit contributions
 replied+0.062
ves+0.057
gh+0.057
enda+0.057
 +0.054
Neg logit contributions
ind-0.036
 beings-0.035
 alone-0.034
 prisoners-0.032
 slaves-0.031
Token and
Token ID290
Feature activation+1.21e+00

Pos logit contributions
ves+0.023
 replied+0.022
gh+0.021
 story+0.020
 poem+0.019
Neg logit contributions
ind-0.028
 beings-0.026
asured-0.024
pered-0.024
 yourselves-0.023
Token hats
Token ID23910
Feature activation+4.82e+00

Pos logit contributions
ves+0.034
 replied+0.034
gh+0.031
 story+0.030
 poem+0.030
Neg logit contributions
ind-0.044
 beings-0.041
asured-0.038
pered-0.037
 yourselves-0.037
Token and
Token ID290
Feature activation+1.35e+00

Pos logit contributions
 replied+0.133
ves+0.132
gh+0.121
 story+0.114
 happen+0.114
Neg logit contributions
ind-0.182
 beings-0.166
asured-0.158
 yourselves-0.152
pered-0.152
Token scar
Token ID10153
Feature activation+8.47e+00

Pos logit contributions
ves+0.039
 replied+0.039
gh+0.037
 story+0.034
 poem+0.034
Neg logit contributions
ind-0.048
 beings-0.044
asured-0.041
pered-0.040
 yourselves-0.040
Tokenves
Token ID1158
Feature activation+4.03e+00

Pos logit contributions
 replied+0.104
 poem+0.092
 story+0.080
 Kelly+0.075
gh+0.066
Neg logit contributions
ind-0.430
 beings-0.400
pered-0.391
 yourselves-0.381
asured-0.378
Token from
Token ID422
Feature activation-1.77e+00

Pos logit contributions
ves+0.106
 replied+0.105
gh+0.095
 story+0.090
 poem+0.089
Neg logit contributions
ind-0.157
 beings-0.144
asured-0.137
pered-0.132
 yourselves-0.132
Token the
Token ID262
Feature activation+2.22e+00

Pos logit contributions
ind+0.080
 beings+0.077
attered+0.077
asured+0.074
pered+0.072
Neg logit contributions
 replied-0.030
ves-0.028
 happen-0.024
gh-0.024
 -0.023
Token wool
Token ID25749
Feature activation+3.04e+00

Pos logit contributions
ves+0.071
 replied+0.071
gh+0.066
 story+0.064
 poem+0.063
Neg logit contributions
ind-0.073
 beings-0.066
asured-0.063
pered-0.060
attered-0.059
Token.
Token ID13
Feature activation-3.90e-01

Pos logit contributions
 replied+0.079
ves+0.079
gh+0.071
 story+0.068
 poem+0.067
Neg logit contributions
ind-0.119
 beings-0.109
asured-0.105
pered-0.101
attered-0.100
Token you
Token ID345
Feature activation+1.41e+00

Pos logit contributions
ves+0.039
 replied+0.039
gh+0.035
 poem+0.033
 story+0.033
Neg logit contributions
ind-0.058
 beings-0.054
asured-0.051
 yourselves-0.050
pered-0.049
Token like
Token ID588
Feature activation-8.97e-01

Pos logit contributions
 replied+0.039
ves+0.039
 happen+0.037
 story+0.036
gh+0.035
Neg logit contributions
ind-0.052
 beings-0.047
attered-0.045
 yourselves-0.044
pered-0.044
Token your
Token ID534
Feature activation-9.22e-01

Pos logit contributions
ind+0.035
 beings+0.032
asured+0.031
pered+0.030
attered+0.030
Neg logit contributions
ves-0.023
 replied-0.023
gh-0.021
 story-0.020
 poem-0.020
Token hats
Token ID23910
Feature activation+3.94e+00

Pos logit contributions
ind+0.033
 beings+0.030
asured+0.029
pered+0.028
attered+0.028
Neg logit contributions
ves-0.027
 replied-0.027
gh-0.025
 story-0.023
 poem-0.023
Token and
Token ID290
Feature activation+1.79e+00

Pos logit contributions
 replied+0.101
ves+0.098
gh+0.089
 story+0.088
 happen+0.086
Neg logit contributions
ind-0.157
 beings-0.143
asured-0.136
pered-0.135
attered-0.134
Token scar
Token ID10153
Feature activation+8.28e+00

Pos logit contributions
ves+0.057
 replied+0.057
gh+0.053
 story+0.051
 poem+0.051
Neg logit contributions
ind-0.059
 beings-0.053
asured-0.050
pered-0.049
attered-0.048
Tokenves
Token ID1158
Feature activation+3.43e+00

Pos logit contributions
 replied+0.143
 Kelly+0.115
 poem+0.112
 +0.106
gh+0.101
Neg logit contributions
ind-0.400
 beings-0.365
 yourselves-0.345
asured-0.343
ishing-0.335
Token.
Token ID13
Feature activation-3.72e-01

Pos logit contributions
ves+0.085
 replied+0.084
gh+0.073
 story+0.073
 happen+0.072
Neg logit contributions
ind-0.141
 beings-0.127
asured-0.124
attered-0.121
pered-0.119
Token You
Token ID921
Feature activation+1.43e+00

Pos logit contributions
ind+0.014
 beings+0.012
asured+0.012
ishing+0.012
attered+0.011
Neg logit contributions
 replied-0.011
ves-0.010
 -0.010
 story-0.009
gh-0.009
Token are
Token ID389
Feature activation+2.31e-01

Pos logit contributions
 replied+0.036
ves+0.036
gh+0.032
 story+0.032
 poem+0.031
Neg logit contributions
ind-0.058
 beings-0.052
asured-0.050
pered-0.049
attered-0.049
Token our
Token ID674
Feature activation-7.01e-01

Pos logit contributions
 replied+0.006
ves+0.005
 story+0.005
gh+0.005
 happen+0.005
Neg logit contributions
ind-0.009
attered-0.008
 beings-0.008
asured-0.008
pered-0.008
Token.
Token ID13
Feature activation-6.79e-01

Pos logit contributions
ves+0.091
 replied+0.088
gh+0.079
 story+0.076
 poem+0.074
Neg logit contributions
ind-0.167
 beings-0.154
asured-0.149
pered-0.144
attered-0.143
Token There
Token ID1318
Feature activation+1.28e+00

Pos logit contributions
ind+0.028
 beings+0.026
asured+0.025
attered+0.024
ishing+0.024
Neg logit contributions
 replied-0.017
ves-0.017
 -0.015
 story-0.014
gh-0.014
Token are
Token ID389
Feature activation-4.23e-01

Pos logit contributions
ves+0.036
 replied+0.036
gh+0.033
 poem+0.031
 story+0.031
Neg logit contributions
ind-0.046
 beings-0.043
asured-0.040
 yourselves-0.039
pered-0.039
Token hats
Token ID23910
Feature activation+4.26e+00

Pos logit contributions
ind+0.016
 beings+0.015
asured+0.014
ishing+0.014
pered+0.014
Neg logit contributions
ves-0.012
 replied-0.011
 story-0.010
gh-0.010
 poem-0.010
Token and
Token ID290
Feature activation+1.29e+00

Pos logit contributions
 replied+0.113
ves+0.112
gh+0.099
 story+0.099
 happen+0.096
Neg logit contributions
ind-0.167
 beings-0.152
asured-0.146
pered-0.141
attered-0.140
Token scar
Token ID10153
Feature activation+8.05e+00

Pos logit contributions
ves+0.038
 replied+0.038
gh+0.035
 story+0.033
 poem+0.033
Neg logit contributions
ind-0.045
 beings-0.041
asured-0.039
 yourselves-0.038
pered-0.038
Tokenves
Token ID1158
Feature activation+3.76e+00

Pos logit contributions
 replied+0.075
 poem+0.053
 Kelly+0.046
 +0.039
 story+0.039
Neg logit contributions
ind-0.437
 beings-0.408
 yourselves-0.392
pered-0.388
ishing-0.379
Token and
Token ID290
Feature activation+1.33e+00

Pos logit contributions
ves+0.108
 replied+0.106
gh+0.095
 story+0.093
 poem+0.092
Neg logit contributions
ind-0.140
 beings-0.126
asured-0.121
attered-0.116
 yourselves-0.115
Token gloves
Token ID18051
Feature activation+5.16e+00

Pos logit contributions
ves+0.037
 replied+0.037
gh+0.034
 poem+0.033
 story+0.033
Neg logit contributions
ind-0.049
 beings-0.044
asured-0.042
attered-0.041
pered-0.041
Token and
Token ID290
Feature activation+1.31e+00

Pos logit contributions
ves+0.140
 replied+0.140
gh+0.126
 story+0.121
 happen+0.120
Neg logit contributions
ind-0.197
 beings-0.180
asured-0.171
pered-0.166
attered-0.164
Token m
Token ID285
Feature activation+2.21e+00

Pos logit contributions
ves+0.036
 replied+0.036
gh+0.032
 poem+0.032
 story+0.032
Neg logit contributions
ind-0.049
 beings-0.045
asured-0.043
pered-0.041
attered-0.041
Token like
Token ID588
Feature activation-7.05e-01

Pos logit contributions
ind+0.000
 beings+0.000
 yourselves+0.000
pered+0.000
asured+0.000
Neg logit contributions
 replied-0.000
ves-0.000
gh-0.000
 story-0.000
 happen-0.000
Token to
Token ID284
Feature activation+1.64e+00

Pos logit contributions
ind+0.028
 beings+0.026
asured+0.025
pered+0.024
 yourselves+0.024
Neg logit contributions
ves-0.018
 replied-0.018
gh-0.016
 story-0.015
 poem-0.015
Token knit
Token ID30495
Feature activation+3.31e+00

Pos logit contributions
ves+0.041
 replied+0.040
gh+0.036
 story+0.034
 poem+0.034
Neg logit contributions
ind-0.065
 beings-0.059
asured-0.057
 yourselves-0.056
pered-0.055
Token hats
Token ID23910
Feature activation+3.83e+00

Pos logit contributions
 replied+0.097
ves+0.095
gh+0.089
 poem+0.085
 story+0.085
Neg logit contributions
ind-0.116
 beings-0.106
 yourselves-0.098
asured-0.098
pered-0.095
Token and
Token ID290
Feature activation+1.16e+00

Pos logit contributions
 replied+0.112
ves+0.112
gh+0.102
 story+0.099
 poem+0.098
Neg logit contributions
ind-0.137
 beings-0.124
asured-0.118
pered-0.114
attered-0.114
Token scar
Token ID10153
Feature activation+8.04e+00

Pos logit contributions
ves+0.036
gh+0.035
 replied+0.035
 story+0.032
enda+0.031
Neg logit contributions
ind-0.038
 beings-0.035
asured-0.033
 yourselves-0.032
pered-0.031
Tokenves
Token ID1158
Feature activation+3.67e+00

Pos logit contributions
 replied+0.083
 poem+0.065
 Kelly+0.060
 story+0.051
oph+0.048
Neg logit contributions
ind-0.420
 beings-0.397
 yourselves-0.389
pered-0.377
ubby-0.368
Token for
Token ID329
Feature activation+1.67e+00

Pos logit contributions
ves+0.104
 replied+0.103
gh+0.093
 story+0.090
 poem+0.089
Neg logit contributions
ind-0.136
 beings-0.123
asured-0.118
attered-0.113
pered-0.113
Token my
Token ID616
Feature activation-1.95e+00

Pos logit contributions
 replied+0.048
ves+0.048
gh+0.044
 story+0.042
 poem+0.041
Neg logit contributions
ind-0.060
 beings-0.055
asured-0.051
 yourselves-0.051
pered-0.050
Token friends
Token ID2460
Feature activation-4.65e-01

Pos logit contributions
ind+0.066
 beings+0.059
asured+0.056
pered+0.054
attered+0.054
Neg logit contributions
ves-0.062
 replied-0.061
gh-0.057
 story-0.055
 poem-0.055
Token."
Token ID526
Feature activation+1.45e+00

Pos logit contributions
ind+0.017
 beings+0.016
asured+0.015
pered+0.015
attered+0.015
Neg logit contributions
 replied-0.013
ves-0.013
gh-0.012
 story-0.011
 poem-0.011
Token and
Token ID290
Feature activation+1.85e-01

Pos logit contributions
ves+0.093
 replied+0.091
gh+0.083
 poem+0.080
 story+0.080
Neg logit contributions
ind-0.121
 beings-0.110
asured-0.106
attered-0.102
pered-0.101
Token tug
Token ID27762
Feature activation+2.54e+00

Pos logit contributions
ves+0.005
 replied+0.005
gh+0.004
 story+0.004
 poem+0.004
Neg logit contributions
ind-0.007
 beings-0.006
asured-0.006
 alone-0.006
pered-0.006
Token-
Token ID12
Feature activation-9.74e-01

Pos logit contributions
ves+0.078
 replied+0.073
gh+0.071
 happen+0.065
 poem+0.064
Neg logit contributions
ind-0.088
asured-0.079
 beings-0.078
attered-0.074
uzz-0.074
Tokenof
Token ID1659
Feature activation+3.23e+00

Pos logit contributions
ind+0.039
 beings+0.035
attered+0.034
asured+0.033
uzz+0.033
Neg logit contributions
 replied-0.025
ves-0.022
 story-0.022
 poem-0.021
gh-0.021
Token-
Token ID12
Feature activation-8.96e-01

Pos logit contributions
ves+0.071
 replied+0.069
gh+0.063
 story+0.058
 poem+0.058
Neg logit contributions
ind-0.140
 beings-0.130
asured-0.125
pered-0.121
attered-0.120
Tokenwar
Token ID5767
Feature activation+8.03e+00

Pos logit contributions
ind+0.037
 beings+0.033
attered+0.032
asured+0.031
uzz+0.031
Neg logit contributions
 replied-0.022
ves-0.020
 story-0.019
 poem-0.018
gh-0.018
Token.
Token ID13
Feature activation-2.01e-01

Pos logit contributions
 replied+0.221
ves+0.218
gh+0.200
 poem+0.192
 story+0.192
Neg logit contributions
ind-0.299
 beings-0.274
asured-0.259
pered-0.254
attered-0.251
Token The
Token ID383
Feature activation+2.13e+00

Pos logit contributions
ind+0.007
 beings+0.006
asured+0.006
attered+0.005
pered+0.005
Neg logit contributions
ves-0.007
 replied-0.007
gh-0.006
 story-0.006
 -0.006
Token fox
Token ID21831
Feature activation+9.09e-01

Pos logit contributions
 replied+0.061
ves+0.059
 poem+0.056
 story+0.056
gh+0.051
Neg logit contributions
ind-0.078
asured-0.073
 beings-0.071
attered-0.071
uzz-0.068
Token was
Token ID373
Feature activation-4.10e-01

Pos logit contributions
ves+0.020
 replied+0.020
gh+0.017
 poem+0.016
 story+0.016
Neg logit contributions
ind-0.041
 beings-0.037
asured-0.035
 yourselves-0.035
attered-0.035
Token happy
Token ID3772
Feature activation-7.08e-01

Pos logit contributions
ind+0.015
 beings+0.014
asured+0.013
attered+0.013
 yourselves+0.012
Neg logit contributions
ves-0.012
 replied-0.011
gh-0.011
 poem-0.010
 story-0.010
Token.
Token ID13
Feature activation-4.28e-01

Pos logit contributions
ves+0.027
 replied+0.027
gh+0.024
 story+0.023
 poem+0.023
Neg logit contributions
ind-0.041
 beings-0.038
asured-0.036
pered-0.035
 yourselves-0.035
Token They
Token ID1119
Feature activation+3.14e-03

Pos logit contributions
ind+0.016
 beings+0.014
asured+0.013
attered+0.012
ishing+0.012
Neg logit contributions
ves-0.013
 replied-0.013
 -0.012
gh-0.012
 story-0.011
Token find
Token ID1064
Feature activation+6.18e-01

Pos logit contributions
 replied+0.000
ves+0.000
gh+0.000
 story+0.000
 happen+0.000
Neg logit contributions
ind-0.000
 beings-0.000
asured-0.000
pered-0.000
attered-0.000
Token helmets
Token ID28359
Feature activation+8.98e-01

Pos logit contributions
ves+0.016
 replied+0.016
gh+0.015
 poem+0.014
 story+0.014
Neg logit contributions
ind-0.024
 beings-0.022
asured-0.021
pered-0.020
attered-0.020
Token and
Token ID290
Feature activation+5.79e-01

Pos logit contributions
ves+0.027
 replied+0.027
gh+0.025
 story+0.024
 poem+0.024
Neg logit contributions
ind-0.031
 beings-0.028
asured-0.027
pered-0.026
attered-0.026
Token scar
Token ID10153
Feature activation+7.89e+00

Pos logit contributions
ves+0.018
 replied+0.018
gh+0.017
 story+0.016
 poem+0.016
Neg logit contributions
ind-0.020
 beings-0.018
asured-0.017
pered-0.016
 yourselves-0.016
Tokenves
Token ID1158
Feature activation+3.52e+00

Pos logit contributions
 replied+0.088
 poem+0.069
 Kelly+0.062
oph+0.062
 Jennifer+0.049
Neg logit contributions
ind-0.407
 beings-0.383
pered-0.370
 yourselves-0.368
ishing-0.358
Token and
Token ID290
Feature activation+8.67e-01

Pos logit contributions
ves+0.105
 replied+0.104
gh+0.096
 story+0.092
 poem+0.092
Neg logit contributions
ind-0.123
 beings-0.112
asured-0.106
pered-0.103
attered-0.101
Token gloves
Token ID18051
Feature activation+4.93e+00

Pos logit contributions
ves+0.026
 replied+0.026
gh+0.024
 story+0.023
 poem+0.023
Neg logit contributions
ind-0.030
 beings-0.028
asured-0.026
pered-0.025
attered-0.025
Token to
Token ID284
Feature activation+1.39e+00

Pos logit contributions
ves+0.140
 replied+0.139
gh+0.127
 story+0.121
 poem+0.121
Neg logit contributions
ind-0.180
 beings-0.164
asured-0.156
pered-0.151
attered-0.150
Token wear
Token ID5806
Feature activation+1.30e+00

Pos logit contributions
 replied+0.033
ves+0.033
gh+0.030
 story+0.028
 poem+0.028
Neg logit contributions
ind-0.057
 beings-0.052
asured-0.050
pered-0.048
 yourselves-0.048
Token They
Token ID1119
Feature activation-1.16e-01

Pos logit contributions
ind+0.019
 beings+0.017
asured+0.016
pered+0.015
attered+0.015
Neg logit contributions
ves-0.023
 replied-0.023
gh-0.022
 story-0.021
 poem-0.021
Token take
Token ID1011
Feature activation+1.75e+00

Pos logit contributions
ind+0.005
 beings+0.005
asured+0.004
attered+0.004
pered+0.004
Neg logit contributions
ves-0.002
 replied-0.002
gh-0.002
 story-0.002
 poem-0.002
Token their
Token ID511
Feature activation-1.74e+00

Pos logit contributions
ves+0.046
 replied+0.045
gh+0.042
 story+0.039
 poem+0.039
Neg logit contributions
ind-0.068
 beings-0.062
asured-0.060
 yourselves-0.057
attered-0.057
Token hats
Token ID23910
Feature activation+3.61e+00

Pos logit contributions
ind+0.059
 beings+0.054
asured+0.050
pered+0.050
 yourselves+0.049
Neg logit contributions
 replied-0.054
ves-0.053
gh-0.049
 story-0.046
 poem-0.046
Token and
Token ID290
Feature activation+8.86e-01

Pos logit contributions
ves+0.111
 replied+0.110
gh+0.102
 story+0.098
 poem+0.097
Neg logit contributions
ind-0.124
 beings-0.112
asured-0.106
pered-0.102
attered-0.101
Token scar
Token ID10153
Feature activation+7.80e+00

Pos logit contributions
ves+0.028
 replied+0.028
gh+0.027
 story+0.025
 poem+0.025
Neg logit contributions
ind-0.029
 beings-0.026
asured-0.025
pered-0.024
 yourselves-0.024
Tokenves
Token ID1158
Feature activation+3.52e+00

Pos logit contributions
 replied+0.083
 poem+0.068
 Kelly+0.062
 Jenna+0.052
 story+0.052
Neg logit contributions
ind-0.408
pered-0.381
 beings-0.380
 yourselves-0.369
ubby-0.364
Token and
Token ID290
Feature activation+1.09e+00

Pos logit contributions
ves+0.097
 replied+0.096
gh+0.088
 story+0.083
 poem+0.083
Neg logit contributions
ind-0.133
 beings-0.121
asured-0.115
 yourselves-0.110
attered-0.110
Token throw
Token ID3714
Feature activation+2.68e+00

Pos logit contributions
ves+0.030
 replied+0.030
gh+0.028
 story+0.027
 poem+0.026
Neg logit contributions
ind-0.039
 beings-0.037
asured-0.035
pered-0.034
 yourselves-0.034
Token them
Token ID606
Feature activation-2.93e-01

Pos logit contributions
ves+0.067
 replied+0.065
gh+0.060
 +0.057
 story+0.055
Neg logit contributions
ind-0.109
 beings-0.099
asured-0.098
attered-0.092
pered-0.091
Token in
Token ID287
Feature activation+4.12e-01

Pos logit contributions
ind+0.011
 beings+0.009
asured+0.009
years+0.009
 yourselves+0.009
Neg logit contributions
ves-0.009
 replied-0.008
gh-0.008
 happen-0.008
 -0.008
Token.
Token ID13
Feature activation-1.16e+00

Pos logit contributions
ves+0.029
 replied+0.028
gh+0.027
 happen+0.025
 +0.025
Neg logit contributions
ind-0.064
asured-0.059
 beings-0.059
lves-0.058
 prisoners-0.058
Token They
Token ID1119
Feature activation-2.85e-01

Pos logit contributions
ind+0.045
 beings+0.041
asured+0.039
attered+0.038
pered+0.037
Neg logit contributions
 replied-0.031
ves-0.031
gh-0.028
 story-0.027
 poem-0.027
Token wear
Token ID5806
Feature activation+1.12e+00

Pos logit contributions
ind+0.013
 beings+0.012
asured+0.011
ishing+0.011
attered+0.011
Neg logit contributions
 replied-0.006
ves-0.006
 happen-0.005
 story-0.005
 -0.005
Token hats
Token ID23910
Feature activation+3.90e+00

Pos logit contributions
ves+0.031
 replied+0.030
gh+0.028
 poem+0.026
 story+0.026
Neg logit contributions
ind-0.043
 beings-0.039
asured-0.037
pered-0.036
attered-0.035
Token and
Token ID290
Feature activation+9.11e-01

Pos logit contributions
 replied+0.110
ves+0.109
gh+0.099
 story+0.094
 poem+0.093
Neg logit contributions
ind-0.146
 beings-0.133
asured-0.126
 yourselves-0.122
pered-0.121
Token scar
Token ID10153
Feature activation+7.79e+00

Pos logit contributions
ves+0.028
 replied+0.027
gh+0.026
 story+0.024
 poem+0.024
Neg logit contributions
ind-0.031
 beings-0.029
asured-0.027
pered-0.026
 yourselves-0.026
Tokenves
Token ID1158
Feature activation+3.58e+00

Pos logit contributions
 replied+0.074
 poem+0.059
 Kelly+0.049
 story+0.047
oph+0.040
Neg logit contributions
ind-0.420
 beings-0.392
pered-0.385
uzz-0.375
 yourselves-0.374
Token and
Token ID290
Feature activation+1.19e+00

Pos logit contributions
ves+0.101
 replied+0.100
gh+0.091
 story+0.087
 poem+0.087
Neg logit contributions
ind-0.133
 beings-0.121
asured-0.115
 yourselves-0.111
pered-0.110
Token gloves
Token ID18051
Feature activation+4.91e+00

Pos logit contributions
ves+0.033
 replied+0.033
gh+0.030
 story+0.028
 poem+0.028
Neg logit contributions
ind-0.045
 beings-0.041
asured-0.039
pered-0.038
attered-0.037
Token.
Token ID13
Feature activation-4.18e-01

Pos logit contributions
ves+0.132
 replied+0.131
gh+0.119
 story+0.113
 poem+0.112
Neg logit contributions
ind-0.187
 beings-0.172
asured-0.163
pered-0.158
 yourselves-0.157
Token They
Token ID1119
Feature activation-3.67e-02

Pos logit contributions
ind+0.014
 beings+0.013
asured+0.012
ishing+0.012
attered+0.011
Neg logit contributions
 replied-0.013
ves-0.013
 -0.012
gh-0.012
 story-0.012
Token coats
Token ID30720
Feature activation+2.10e+00

Pos logit contributions
 replied+0.014
ves+0.014
gh+0.014
 story+0.013
 poem+0.013
Neg logit contributions
ind-0.016
 beings-0.015
asured-0.014
 yourselves-0.013
pered-0.013
Token,
Token ID11
Feature activation-2.84e-02

Pos logit contributions
ves+0.049
 replied+0.049
gh+0.044
 poem+0.041
 story+0.041
Neg logit contributions
ind-0.087
 beings-0.080
asured-0.077
pered-0.075
attered-0.074
Token shoes
Token ID10012
Feature activation+3.02e+00

Pos logit contributions
ind+0.001
 beings+0.001
asured+0.001
pered+0.001
 yourselves+0.001
Neg logit contributions
 replied-0.001
ves-0.001
gh-0.001
 story-0.001
 poem-0.001
Token,
Token ID11
Feature activation+1.81e-01

Pos logit contributions
ves+0.073
 replied+0.072
gh+0.066
 poem+0.063
 story+0.062
Neg logit contributions
ind-0.122
 beings-0.112
asured-0.107
pered-0.104
attered-0.103
Token and
Token ID290
Feature activation+8.75e-01

Pos logit contributions
ves+0.005
 replied+0.005
gh+0.005
 story+0.005
 poem+0.005
Neg logit contributions
ind-0.006
 beings-0.006
asured-0.005
pered-0.005
 yourselves-0.005
Token scar
Token ID10153
Feature activation+7.76e+00

Pos logit contributions
ves+0.025
 replied+0.024
gh+0.023
 story+0.021
 poem+0.021
Neg logit contributions
ind-0.032
 beings-0.029
asured-0.028
pered-0.027
 yourselves-0.027
Tokenves
Token ID1158
Feature activation+3.53e+00

Pos logit contributions
 replied+0.077
 poem+0.060
 Kelly+0.046
 story+0.045
oph+0.044
Neg logit contributions
ind-0.415
 beings-0.385
pered-0.376
 yourselves-0.373
uzz-0.366
Token.
Token ID13
Feature activation-2.44e-01

Pos logit contributions
ves+0.091
 replied+0.090
gh+0.082
 story+0.077
 poem+0.077
Neg logit contributions
ind-0.138
 beings-0.127
asured-0.121
pered-0.117
attered-0.117
Token
Token ID198
Feature activation+1.70e+00

Pos logit contributions
ind+0.009
 beings+0.008
asured+0.008
pered+0.007
 yourselves+0.007
Neg logit contributions
ves-0.007
 replied-0.007
gh-0.007
 story-0.006
 poem-0.006
Token
Token ID198
Feature activation+1.36e+00

Pos logit contributions
 replied+0.043
 poem+0.041
gh+0.039
 happen+0.039
ves+0.038
Neg logit contributions
ind-0.065
uzz-0.057
 beings-0.057
 yourselves-0.056
asured-0.054
TokenTom
Token ID13787
Feature activation+2.11e+00

Pos logit contributions
ves+0.032
 replied+0.028
gh+0.025
 +0.024
 exclaimed+0.022
Neg logit contributions
ind-0.059
 beings-0.057
asured-0.055
pered-0.053
ishing-0.053
Token They
Token ID1119
Feature activation-2.26e-01

Pos logit contributions
ind+0.025
 beings+0.022
asured+0.021
attered+0.020
pered+0.020
Neg logit contributions
ves-0.028
 replied-0.028
gh-0.025
 story-0.025
 -0.024
Token put
Token ID1234
Feature activation+1.18e+00

Pos logit contributions
ind+0.010
 beings+0.009
asured+0.009
pered+0.009
attered+0.009
Neg logit contributions
ves-0.005
 replied-0.005
gh-0.004
 story-0.004
 poem-0.004
Token on
Token ID319
Feature activation+3.85e+00

Pos logit contributions
ves+0.031
 replied+0.030
gh+0.027
 story+0.026
 +0.026
Neg logit contributions
ind-0.046
 beings-0.042
asured-0.041
attered-0.039
pered-0.039
Token hats
Token ID23910
Feature activation+3.68e+00

Pos logit contributions
ves+0.113
 replied+0.111
gh+0.103
 story+0.098
 poem+0.098
Neg logit contributions
ind-0.137
 beings-0.126
asured-0.120
pered-0.115
attered-0.115
Token and
Token ID290
Feature activation+5.87e-01

Pos logit contributions
ves+0.101
 replied+0.101
gh+0.091
 story+0.087
 poem+0.086
Neg logit contributions
ind-0.140
 beings-0.128
asured-0.122
pered-0.117
attered-0.117
Token scar
Token ID10153
Feature activation+7.66e+00

Pos logit contributions
ves+0.018
 replied+0.018
gh+0.018
 story+0.016
enda+0.016
Neg logit contributions
ind-0.019
 beings-0.017
pered-0.016
asured-0.016
 yourselves-0.016
Tokenves
Token ID1158
Feature activation+3.38e+00

Pos logit contributions
 replied+0.087
 poem+0.074
 Kelly+0.064
oph+0.062
 story+0.055
Neg logit contributions
ind-0.392
 beings-0.363
pered-0.360
 yourselves-0.348
ubby-0.345
Token.
Token ID13
Feature activation-6.01e-01

Pos logit contributions
ves+0.093
 replied+0.093
gh+0.085
 story+0.081
 poem+0.081
Neg logit contributions
ind-0.125
 beings-0.115
asured-0.108
pered-0.106
attered-0.104
Token They
Token ID1119
Feature activation-1.35e-01

Pos logit contributions
ind+0.019
 beings+0.017
asured+0.016
attered+0.015
pered+0.015
Neg logit contributions
ves-0.021
 replied-0.021
gh-0.019
 story-0.019
 -0.019
Token look
Token ID804
Feature activation+2.84e+00

Pos logit contributions
ind+0.006
 beings+0.005
asured+0.005
pered+0.005
 yourselves+0.005
Neg logit contributions
ves-0.003
 replied-0.003
gh-0.003
 poem-0.002
 story-0.002
Token like
Token ID588
Feature activation-1.83e+00

Pos logit contributions
 replied+0.106
ves+0.104
gh+0.097
 story+0.094
 poem+0.094
Neg logit contributions
ind-0.080
 beings-0.070
asured-0.066
 yourselves-0.063
pered-0.063
Token.
Token ID13
Feature activation-1.53e+00

Pos logit contributions
ind+0.022
 beings+0.021
asured+0.019
pered+0.019
 yourselves+0.019
Neg logit contributions
ves-0.014
 replied-0.014
gh-0.013
 story-0.012
 poem-0.012
Token She
Token ID1375
Feature activation-3.33e+00

Pos logit contributions
ind+0.064
 beings+0.061
asured+0.056
ishing+0.055
 prisoners+0.055
Neg logit contributions
 -0.040
 replied-0.039
ves-0.036
 story-0.032
 poem-0.030
Token makes
Token ID1838
Feature activation+3.40e+00

Pos logit contributions
ind+0.117
 beings+0.107
pered+0.100
asured+0.099
attered+0.099
Neg logit contributions
 replied-0.102
ves-0.102
 poem-0.089
 story-0.088
 -0.087
Token hats
Token ID23910
Feature activation+3.65e+00

Pos logit contributions
 replied+0.098
ves+0.094
gh+0.092
 story+0.086
 happen+0.082
Neg logit contributions
ind-0.120
 beings-0.111
asured-0.103
 yourselves-0.102
pered-0.101
Token and
Token ID290
Feature activation+9.15e-01

Pos logit contributions
ves+0.106
 replied+0.105
gh+0.098
 poem+0.092
 story+0.092
Neg logit contributions
ind-0.130
 beings-0.119
asured-0.112
pered-0.109
attered-0.108
Token scar
Token ID10153
Feature activation+7.65e+00

Pos logit contributions
ves+0.030
gh+0.030
 replied+0.029
 story+0.027
 happen+0.026
Neg logit contributions
ind-0.028
 beings-0.026
 yourselves-0.024
asured-0.024
ishes-0.024
Tokenves
Token ID1158
Feature activation+3.27e+00

Pos logit contributions
 replied+0.096
 poem+0.078
 Kelly+0.075
iden+0.072
oph+0.072
Neg logit contributions
ind-0.370
 beings-0.347
pered-0.345
 yourselves-0.339
ins-0.333
Token and
Token ID290
Feature activation+9.70e-01

Pos logit contributions
 replied+0.103
ves+0.103
gh+0.097
 poem+0.093
 story+0.092
Neg logit contributions
ind-0.104
 beings-0.096
pered-0.090
asured-0.089
uzz-0.086
Token dolls
Token ID36062
Feature activation+3.29e-01

Pos logit contributions
ves+0.030
gh+0.029
 replied+0.029
 story+0.026
enda+0.025
Neg logit contributions
ind-0.032
 beings-0.030
asured-0.027
pered-0.027
 yourselves-0.027
Token with
Token ID351
Feature activation+3.23e-01

Pos logit contributions
ves+0.009
 replied+0.009
gh+0.008
 story+0.008
 poem+0.008
Neg logit contributions
ind-0.013
 beings-0.011
asured-0.011
 yourselves-0.011
pered-0.010
Token yarn
Token ID21181
Feature activation+3.35e+00

Pos logit contributions
 replied+0.010
ves+0.010
gh+0.009
 story+0.009
 poem+0.009
Neg logit contributions
ind-0.011
 beings-0.010
asured-0.009
pered-0.009
 yourselves-0.009
Token their
Token ID511
Feature activation-2.24e+00

Pos logit contributions
 replied+0.127
ves+0.124
gh+0.117
 happen+0.110
 story+0.110
Neg logit contributions
ind-0.123
 beings-0.109
asured-0.099
 yourselves-0.098
pered-0.096
Token mom
Token ID1995
Feature activation-1.34e+00

Pos logit contributions
ind+0.072
 beings+0.065
asured+0.058
 yourselves+0.058
pered+0.057
Neg logit contributions
 replied-0.072
ves-0.070
gh-0.066
enda-0.062
 -0.062
Token's
Token ID338
Feature activation-1.34e+00

Pos logit contributions
ind+0.062
 beings+0.056
asured+0.054
pered+0.053
attered+0.052
Neg logit contributions
ves-0.026
 replied-0.025
gh-0.023
 happen-0.021
 poem-0.020
Token hats
Token ID23910
Feature activation+3.16e+00

Pos logit contributions
ind+0.048
 beings+0.044
asured+0.042
pered+0.040
attered+0.040
Neg logit contributions
ves-0.039
 replied-0.039
gh-0.035
 story-0.034
 poem-0.034
Token and
Token ID290
Feature activation+1.18e+00

Pos logit contributions
ves+0.097
 replied+0.097
gh+0.089
 story+0.085
 poem+0.085
Neg logit contributions
ind-0.108
 beings-0.098
asured-0.093
pered-0.089
 yourselves-0.089
Token scar
Token ID10153
Feature activation+7.65e+00

Pos logit contributions
ves+0.037
 replied+0.035
gh+0.035
 story+0.032
 happen+0.031
Neg logit contributions
ind-0.039
 beings-0.036
asured-0.033
 yourselves-0.033
pered-0.033
Tokenves
Token ID1158
Feature activation+3.24e+00

Pos logit contributions
 replied+0.083
 poem+0.067
 Kelly+0.062
oph+0.061
 Emily+0.053
Neg logit contributions
ind-0.398
 beings-0.362
pered-0.359
 yourselves-0.351
per-0.349
Token.
Token ID13
Feature activation-3.63e-01

Pos logit contributions
ves+0.098
 replied+0.097
gh+0.091
 story+0.086
 poem+0.086
Neg logit contributions
ind-0.110
 beings-0.101
asured-0.095
pered-0.093
attered-0.091
Token They
Token ID1119
Feature activation-4.82e-02

Pos logit contributions
ind+0.012
 beings+0.011
asured+0.011
ishing+0.010
 prisoners+0.010
Neg logit contributions
 replied-0.012
ves-0.012
 -0.011
 poem-0.011
 story-0.010
Token stand
Token ID1302
Feature activation+6.88e-01

Pos logit contributions
ind+0.002
 beings+0.002
asured+0.002
pered+0.002
attered+0.002
Neg logit contributions
ves-0.001
 replied-0.001
gh-0.001
 story-0.001
 poem-0.001
Token on
Token ID319
Feature activation+3.42e+00

Pos logit contributions
ves+0.017
 replied+0.016
gh+0.014
 story+0.014
 poem+0.014
Neg logit contributions
ind-0.027
 beings-0.025
attered-0.025
asured-0.024
lves-0.024
Token were
Token ID547
Feature activation-6.06e-01

Pos logit contributions
ves+0.029
 replied+0.029
gh+0.027
 poem+0.026
enda+0.025
Neg logit contributions
ind-0.037
 beings-0.035
asured-0.032
 alone-0.032
pered-0.032
Token soft
Token ID2705
Feature activation+1.30e+00

Pos logit contributions
ind+0.023
 beings+0.021
asured+0.021
ishing+0.020
attered+0.020
Neg logit contributions
ves-0.017
 replied-0.015
gh-0.015
 story-0.014
 poem-0.014
Token things
Token ID1243
Feature activation+8.69e-02

Pos logit contributions
ves+0.030
 replied+0.029
 story+0.025
 happen+0.025
gh+0.025
Neg logit contributions
ind-0.054
asured-0.052
 beings-0.049
 slaves-0.049
years-0.048
Token,
Token ID11
Feature activation+6.22e-02

Pos logit contributions
ves+0.002
 replied+0.002
 happen+0.002
gh+0.002
 story+0.002
Neg logit contributions
ind-0.004
 beings-0.003
asured-0.003
 yourselves-0.003
attered-0.003
Token like
Token ID588
Feature activation-1.42e+00

Pos logit contributions
ves+0.002
 replied+0.002
 story+0.002
 happen+0.002
 poem+0.002
Neg logit contributions
ind-0.002
 beings-0.002
asured-0.002
attered-0.002
ishing-0.002
Token scar
Token ID10153
Feature activation+7.65e+00

Pos logit contributions
ind+0.054
 beings+0.049
asured+0.048
attered+0.046
ishing+0.045
Neg logit contributions
ves-0.040
 replied-0.038
gh-0.035
 story-0.034
 poem-0.034
Tokenves
Token ID1158
Feature activation+3.18e+00

Pos logit contributions
 replied+0.074
 poem+0.053
 Kelly+0.044
enda+0.041
 +0.038
Neg logit contributions
ind-0.409
 beings-0.383
 yourselves-0.370
pered-0.357
 alone-0.353
Token and
Token ID290
Feature activation+1.63e+00

Pos logit contributions
ves+0.074
 replied+0.073
gh+0.064
 happen+0.061
 story+0.061
Neg logit contributions
ind-0.136
 beings-0.124
asured-0.120
attered-0.115
 yourselves-0.115
Token hats
Token ID23910
Feature activation+4.96e+00

Pos logit contributions
ves+0.049
 replied+0.049
gh+0.044
 poem+0.044
 story+0.043
Neg logit contributions
ind-0.058
 beings-0.052
asured-0.050
attered-0.048
pered-0.048
Token.
Token ID13
Feature activation-2.44e-01

Pos logit contributions
 replied+0.118
ves+0.118
gh+0.105
 story+0.099
 happen+0.099
Neg logit contributions
ind-0.208
 beings-0.190
asured-0.183
 yourselves-0.176
pered-0.175
Token There
Token ID1318
Feature activation+1.24e+00

Pos logit contributions
ind+0.010
asured+0.009
ishing+0.009
 beings+0.009
isher+0.009
Neg logit contributions
 -0.007
ves-0.006
 replied-0.006
 story-0.005
 poem-0.005
Token They
Token ID1119
Feature activation-1.84e-01

Pos logit contributions
ind+0.008
 beings+0.007
asured+0.006
pered+0.006
attered+0.006
Neg logit contributions
 replied-0.009
ves-0.009
gh-0.008
 story-0.008
 -0.008
Token put
Token ID1234
Feature activation+1.03e+00

Pos logit contributions
ind+0.008
 beings+0.007
 yourselves+0.007
asured+0.007
pered+0.007
Neg logit contributions
 replied-0.005
ves-0.004
gh-0.004
 story-0.004
 -0.004
Token on
Token ID319
Feature activation+3.78e+00

Pos logit contributions
ves+0.025
 replied+0.025
gh+0.022
 story+0.021
 +0.021
Neg logit contributions
ind-0.042
 beings-0.038
asured-0.038
attered-0.036
pered-0.036
Token hats
Token ID23910
Feature activation+3.65e+00

Pos logit contributions
ves+0.112
 replied+0.111
gh+0.103
 story+0.098
 poem+0.098
Neg logit contributions
ind-0.133
 beings-0.122
asured-0.117
pered-0.112
attered-0.111
Token and
Token ID290
Feature activation+7.73e-01

Pos logit contributions
 replied+0.099
ves+0.099
gh+0.089
 story+0.085
 poem+0.084
Neg logit contributions
ind-0.140
 beings-0.128
asured-0.122
 yourselves-0.117
attered-0.117
Token scar
Token ID10153
Feature activation+7.65e+00

Pos logit contributions
ves+0.024
 replied+0.024
gh+0.023
 story+0.022
enda+0.022
Neg logit contributions
ind-0.024
 beings-0.022
asured-0.021
pered-0.020
 prisoners-0.020
Tokenves
Token ID1158
Feature activation+3.38e+00

Pos logit contributions
 replied+0.085
 poem+0.077
 Kelly+0.062
oph+0.060
 story+0.058
Neg logit contributions
ind-0.384
pered-0.362
 beings-0.359
uzz-0.345
per-0.344
Token and
Token ID290
Feature activation+1.09e+00

Pos logit contributions
ves+0.097
 replied+0.096
gh+0.089
 story+0.084
 poem+0.084
Neg logit contributions
ind-0.122
 beings-0.112
asured-0.106
pered-0.103
attered-0.102
Token pretended
Token ID36766
Feature activation+3.43e-01

Pos logit contributions
ves+0.032
 replied+0.032
gh+0.031
 story+0.029
 poem+0.028
Neg logit contributions
ind-0.036
 beings-0.034
asured-0.032
pered-0.031
 prisoners-0.031
Token to
Token ID284
Feature activation+2.13e+00

Pos logit contributions
 replied+0.008
ves+0.007
gh+0.007
 poem+0.007
enda+0.007
Neg logit contributions
ind-0.013
 beings-0.013
asured-0.012
 yourselves-0.012
 alone-0.012
Token have
Token ID423
Feature activation+2.12e+00

Pos logit contributions
ves+0.053
 replied+0.053
gh+0.048
 poem+0.046
 story+0.044
Neg logit contributions
ind-0.083
 beings-0.075
asured-0.074
 yourselves-0.071
pered-0.070
Token and
Token ID290
Feature activation-2.89e-01

Pos logit contributions
ind+0.058
 beings+0.052
asured+0.049
pered+0.048
attered+0.047
Neg logit contributions
ves-0.055
 replied-0.054
gh-0.050
 poem-0.048
 story-0.048
Token blew
Token ID17948
Feature activation+2.84e+00

Pos logit contributions
ind+0.010
 beings+0.010
asured+0.009
 yourselves+0.009
 alone+0.009
Neg logit contributions
ves-0.008
gh-0.008
 replied-0.007
 story-0.007
 poem-0.007
Token their
Token ID511
Feature activation-1.91e+00

Pos logit contributions
ves+0.076
 replied+0.076
gh+0.069
 story+0.066
 poem+0.065
Neg logit contributions
ind-0.109
 beings-0.100
asured-0.095
pered-0.091
 yourselves-0.091
Token hats
Token ID23910
Feature activation+3.40e+00

Pos logit contributions
ind+0.070
 beings+0.064
asured+0.060
pered+0.058
attered+0.058
Neg logit contributions
ves-0.055
 replied-0.055
gh-0.050
 story-0.048
 poem-0.048
Token and
Token ID290
Feature activation+7.92e-01

Pos logit contributions
ves+0.106
 replied+0.106
gh+0.097
 poem+0.094
 story+0.094
Neg logit contributions
ind-0.114
 beings-0.103
asured-0.097
pered-0.095
attered-0.094
Token scar
Token ID10153
Feature activation+7.62e+00

Pos logit contributions
ves+0.023
 replied+0.022
gh+0.021
 story+0.020
 poem+0.020
Neg logit contributions
ind-0.028
 beings-0.026
asured-0.025
 yourselves-0.024
pered-0.024
Tokenves
Token ID1158
Feature activation+3.44e+00

Pos logit contributions
 replied+0.120
 poem+0.110
 story+0.097
 Kelly+0.096
 Chris+0.085
Neg logit contributions
ind-0.358
pered-0.332
 beings-0.327
 alone-0.318
attered-0.317
Token away
Token ID1497
Feature activation+3.26e-01

Pos logit contributions
ves+0.099
 replied+0.098
gh+0.090
 story+0.086
 poem+0.085
Neg logit contributions
ind-0.125
 beings-0.114
asured-0.108
pered-0.104
 yourselves-0.103
Token.
Token ID13
Feature activation-6.02e-01

Pos logit contributions
ves+0.007
 replied+0.007
gh+0.006
 happen+0.006
 story+0.006
Neg logit contributions
ind-0.014
 beings-0.013
asured-0.013
years-0.012
pered-0.012
Token They
Token ID1119
Feature activation-2.05e-01

Pos logit contributions
ind+0.022
 beings+0.020
asured+0.019
pered+0.018
attered+0.018
Neg logit contributions
ves-0.017
 replied-0.017
gh-0.016
 story-0.015
 poem-0.015
Token ran
Token ID4966
Feature activation-1.56e+00

Pos logit contributions
ind+0.008
 beings+0.008
asured+0.007
pered+0.007
attered+0.007
Neg logit contributions
ves-0.005
 replied-0.005
gh-0.005
 story-0.004
 poem-0.004
Token.
Token ID13
Feature activation-1.48e+00

Pos logit contributions
ind+0.024
 beings+0.022
asured+0.021
attered+0.021
pered+0.021
Neg logit contributions
ves-0.013
 replied-0.013
gh-0.012
 happen-0.011
 story-0.011
Token They
Token ID1119
Feature activation-7.49e-01

Pos logit contributions
ind+0.055
 beings+0.050
asured+0.047
pered+0.046
attered+0.045
Neg logit contributions
ves-0.042
 replied-0.042
gh-0.038
 story-0.037
 poem-0.037
Token find
Token ID1064
Feature activation+3.26e-01

Pos logit contributions
ind+0.033
 beings+0.030
attered+0.029
asured+0.028
 yourselves+0.028
Neg logit contributions
 replied-0.018
ves-0.016
 -0.015
 story-0.015
 happen-0.014
Token hats
Token ID23910
Feature activation+3.60e+00

Pos logit contributions
ves+0.008
 replied+0.008
gh+0.007
 story+0.007
 poem+0.007
Neg logit contributions
ind-0.013
 beings-0.012
asured-0.011
pered-0.011
attered-0.011
Token and
Token ID290
Feature activation+5.23e-01

Pos logit contributions
 replied+0.093
ves+0.093
gh+0.083
 story+0.080
 poem+0.079
Neg logit contributions
ind-0.142
 beings-0.130
asured-0.124
pered-0.120
 yourselves-0.119
Token scar
Token ID10153
Feature activation+7.60e+00

Pos logit contributions
ves+0.016
 replied+0.016
gh+0.016
 story+0.014
 happen+0.014
Neg logit contributions
ind-0.017
 beings-0.016
asured-0.015
pered-0.014
 prisoners-0.014
Tokenves
Token ID1158
Feature activation+3.61e+00

Pos logit contributions
 replied+0.081
 poem+0.071
 Kelly+0.061
 story+0.051
oph+0.050
Neg logit contributions
ind-0.393
 beings-0.368
pered-0.359
 yourselves-0.352
uzz-0.350
Token and
Token ID290
Feature activation+9.88e-01

Pos logit contributions
ves+0.107
 replied+0.106
gh+0.098
 story+0.093
 poem+0.093
Neg logit contributions
ind-0.127
 beings-0.116
asured-0.110
pered-0.106
attered-0.105
Token glasses
Token ID15232
Feature activation+5.90e+00

Pos logit contributions
ves+0.028
 replied+0.028
gh+0.027
 story+0.025
 poem+0.024
Neg logit contributions
ind-0.035
 beings-0.032
asured-0.031
pered-0.030
 yourselves-0.029
Token.
Token ID13
Feature activation-6.47e-01

Pos logit contributions
ves+0.152
 replied+0.151
gh+0.136
 story+0.130
 poem+0.128
Neg logit contributions
ind-0.234
 beings-0.214
asured-0.204
pered-0.197
attered-0.196
Token They
Token ID1119
Feature activation-2.41e-01

Pos logit contributions
ind+0.023
 beings+0.020
asured+0.020
ishing+0.018
attered+0.018
Neg logit contributions
 replied-0.020
ves-0.020
 -0.019
 story-0.018
gh-0.018
Token makes
Token ID1838
Feature activation+3.73e+00

Pos logit contributions
ind+0.071
 beings+0.065
attered+0.065
pered+0.064
 yourselves+0.062
Neg logit contributions
 replied-0.046
ves-0.041
 story-0.039
 poem-0.037
 said-0.036
Token gloves
Token ID18051
Feature activation+4.10e+00

Pos logit contributions
ves+0.099
 replied+0.098
gh+0.088
 story+0.087
 poem+0.086
Neg logit contributions
ind-0.143
 beings-0.132
asured-0.126
attered-0.123
pered-0.122
Token,
Token ID11
Feature activation+7.12e-01

Pos logit contributions
ves+0.120
 replied+0.119
gh+0.110
 story+0.105
 poem+0.104
Neg logit contributions
ind-0.146
 beings-0.133
asured-0.126
pered-0.122
attered-0.121
Token hats
Token ID23910
Feature activation+4.33e+00

Pos logit contributions
 replied+0.019
ves+0.019
 poem+0.017
 story+0.017
gh+0.016
Neg logit contributions
ind-0.028
 beings-0.025
asured-0.024
attered-0.024
pered-0.023
Token,
Token ID11
Feature activation+6.26e-01

Pos logit contributions
 replied+0.092
ves+0.092
gh+0.080
 story+0.076
 happen+0.074
Neg logit contributions
ind-0.193
 beings-0.178
asured-0.171
attered-0.164
 yourselves-0.164
Token scar
Token ID10153
Feature activation+7.59e+00

Pos logit contributions
ves+0.019
 replied+0.018
gh+0.017
 poem+0.016
 story+0.016
Neg logit contributions
ind-0.022
 beings-0.020
asured-0.019
attered-0.018
pered-0.018
Tokenves
Token ID1158
Feature activation+3.24e+00

Pos logit contributions
 replied+0.062
 poem+0.044
 Kelly+0.030
 story+0.027
oph+0.026
Neg logit contributions
ind-0.420
 beings-0.392
 yourselves-0.383
pered-0.381
ubby-0.371
Token,
Token ID11
Feature activation+4.79e-01

Pos logit contributions
ves+0.071
 replied+0.070
gh+0.061
 story+0.057
 happen+0.057
Neg logit contributions
ind-0.145
 beings-0.132
asured-0.128
attered-0.124
 yourselves-0.123
Token and
Token ID290
Feature activation+1.07e+00

Pos logit contributions
ves+0.014
 replied+0.014
gh+0.012
 poem+0.012
 story+0.012
Neg logit contributions
ind-0.018
 beings-0.016
asured-0.015
attered-0.015
pered-0.015
Token swe
Token ID3490
Feature activation+2.53e+00

Pos logit contributions
ves+0.030
 replied+0.029
gh+0.027
 story+0.026
 poem+0.025
Neg logit contributions
ind-0.040
 beings-0.036
asured-0.034
pered-0.033
 yourselves-0.033
Tokenaters
Token ID8605
Feature activation+1.17e+00

Pos logit contributions
 replied+0.090
gh+0.085
enda+0.083
ves+0.081
 +0.078
Neg logit contributions
 alone-0.065
 beings-0.059
ind-0.058
 prisoners-0.056
lves-0.056
Token saw
Token ID2497
Feature activation+3.89e-01

Pos logit contributions
ind+0.017
 beings+0.016
asured+0.015
pered+0.015
 yourselves+0.014
Neg logit contributions
 replied-0.010
ves-0.009
gh-0.008
 story-0.008
 poem-0.008
Token people
Token ID661
Feature activation-1.49e+00

Pos logit contributions
ves+0.009
 replied+0.009
gh+0.008
 story+0.008
 poem+0.008
Neg logit contributions
ind-0.016
 beings-0.015
asured-0.014
pered-0.014
attered-0.014
Token wearing
Token ID5762
Feature activation-6.07e-01

Pos logit contributions
ind+0.057
 beings+0.054
asured+0.051
 yourselves+0.050
attered+0.050
Neg logit contributions
 replied-0.039
ves-0.039
 story-0.035
 happen-0.035
 -0.034
Token hats
Token ID23910
Feature activation+3.24e+00

Pos logit contributions
ind+0.024
 beings+0.022
asured+0.022
attered+0.021
pered+0.021
Neg logit contributions
ves-0.015
 replied-0.015
gh-0.014
 story-0.013
 happen-0.013
Token and
Token ID290
Feature activation+9.63e-01

Pos logit contributions
ves+0.101
 replied+0.100
gh+0.092
 story+0.089
 poem+0.088
Neg logit contributions
ind-0.110
 beings-0.099
asured-0.094
pered-0.090
attered-0.090
Token scar
Token ID10153
Feature activation+7.58e+00

Pos logit contributions
ves+0.030
 replied+0.030
gh+0.028
 story+0.026
 poem+0.026
Neg logit contributions
ind-0.032
 beings-0.029
asured-0.027
pered-0.026
 yourselves-0.026
Tokenves
Token ID1158
Feature activation+3.58e+00

Pos logit contributions
 replied+0.073
 poem+0.059
 Kelly+0.047
 story+0.044
 Emily+0.039
Neg logit contributions
ind-0.413
 beings-0.378
pered-0.364
ishing-0.359
uzz-0.356
Token and
Token ID290
Feature activation+1.30e+00

Pos logit contributions
ves+0.103
 replied+0.102
gh+0.094
 story+0.089
 poem+0.089
Neg logit contributions
ind-0.130
 beings-0.118
asured-0.112
pered-0.108
 yourselves-0.108
Token gloves
Token ID18051
Feature activation+5.02e+00

Pos logit contributions
ves+0.038
 replied+0.038
gh+0.035
 story+0.033
 poem+0.033
Neg logit contributions
ind-0.046
 beings-0.042
asured-0.040
pered-0.038
 yourselves-0.038
Token.
Token ID13
Feature activation-2.73e-01

Pos logit contributions
ves+0.140
 replied+0.139
gh+0.127
 story+0.121
 poem+0.121
Neg logit contributions
ind-0.186
 beings-0.170
asured-0.161
pered-0.156
attered-0.155
Token They
Token ID1119
Feature activation-2.24e-02

Pos logit contributions
ind+0.010
 beings+0.009
asured+0.008
ishing+0.008
attered+0.008
Neg logit contributions
 replied-0.008
ves-0.008
 -0.008
 story-0.007
gh-0.007
Token play
Token ID711
Feature activation-2.96e-02

Pos logit contributions
ind+0.032
 beings+0.030
asured+0.029
pered+0.028
attered+0.028
Neg logit contributions
 replied-0.017
ves-0.017
gh-0.015
 story-0.014
 poem-0.014
Token tug
Token ID27762
Feature activation+2.36e+00

Pos logit contributions
ind+0.001
 beings+0.001
asured+0.001
pered+0.001
 yourselves+0.001
Neg logit contributions
 replied-0.001
ves-0.001
gh-0.001
 poem-0.001
 story-0.001
Token-
Token ID12
Feature activation-9.98e-01

Pos logit contributions
ves+0.062
 replied+0.061
gh+0.057
 story+0.053
 happen+0.052
Neg logit contributions
ind-0.090
 beings-0.082
asured-0.081
attered-0.078
pered-0.077
Tokenof
Token ID1659
Feature activation+3.03e+00

Pos logit contributions
ind+0.039
lves+0.033
 beings+0.033
aked+0.033
attered+0.033
Neg logit contributions
 replied-0.027
 story-0.025
 poem-0.023
ves-0.023
 happen-0.022
Token-
Token ID12
Feature activation-9.29e-01

Pos logit contributions
 replied+0.071
ves+0.068
gh+0.062
 story+0.059
 poem+0.058
Neg logit contributions
ind-0.127
 beings-0.116
asured-0.109
 yourselves-0.107
pered-0.107
Tokenwar
Token ID5767
Feature activation+7.57e+00

Pos logit contributions
ind+0.037
lves+0.031
 beings+0.031
aked+0.031
attered+0.031
Neg logit contributions
 replied-0.025
 story-0.022
 poem-0.021
ves-0.021
 happen-0.020
Token!"
Token ID2474
Feature activation+3.35e+00

Pos logit contributions
 replied+0.241
ves+0.239
gh+0.222
 poem+0.216
 story+0.215
Neg logit contributions
ind-0.246
 beings-0.226
asured-0.208
pered-0.206
attered-0.202
Token
Token ID198
Feature activation+1.81e+00

Pos logit contributions
gh+0.109
 happen+0.102
ves+0.100
 poem+0.098
 story+0.093
Neg logit contributions
 alone-0.096
pered-0.093
 beings-0.089
 yourselves-0.086
uzz-0.085
Token
Token ID198
Feature activation+1.53e+00

Pos logit contributions
 poem+0.063
 happen+0.060
gh+0.058
oph+0.057
 som+0.057
Neg logit contributions
ind-0.048
uzz-0.041
 alone-0.040
pered-0.040
ucks-0.039
TokenThe
Token ID464
Feature activation+1.92e+00

Pos logit contributions
 replied+0.042
ves+0.040
gh+0.039
 happen+0.038
 story+0.038
Neg logit contributions
ind-0.057
 beings-0.051
pered-0.048
asured-0.048
 yourselves-0.047
Token birds
Token ID10087
Feature activation+7.01e-01

Pos logit contributions
gh+0.076
 replied+0.074
ves+0.074
enda+0.070
 happen+0.067
Neg logit contributions
ind-0.046
 beings-0.042
ubby-0.036
 yourselves-0.036
pered-0.036
Token only
Token ID691
Feature activation+2.07e-01

Pos logit contributions
ind+0.014
 beings+0.013
asured+0.012
 yourselves+0.012
attered+0.012
Neg logit contributions
 replied-0.007
ves-0.007
 -0.006
gh-0.006
 story-0.005
Token found
Token ID1043
Feature activation+9.98e-01

Pos logit contributions
 replied+0.005
ves+0.005
gh+0.005
 poem+0.004
 story+0.004
Neg logit contributions
ind-0.008
 beings-0.008
asured-0.007
 yourselves-0.007
pered-0.007
Token their
Token ID511
Feature activation-2.47e+00

Pos logit contributions
ves+0.021
 replied+0.020
 +0.017
 happen+0.017
gh+0.017
Neg logit contributions
ind-0.046
asured-0.041
 beings-0.040
attered-0.039
ishing-0.039
Token hats
Token ID23910
Feature activation+3.02e+00

Pos logit contributions
ind+0.095
 beings+0.082
asured+0.082
attered+0.080
years+0.080
Neg logit contributions
 replied-0.065
 story-0.064
ves-0.063
 poem-0.061
gh-0.059
Token and
Token ID290
Feature activation+6.79e-01

Pos logit contributions
 replied+0.076
ves+0.075
gh+0.068
 story+0.064
 poem+0.063
Neg logit contributions
ind-0.121
 beings-0.111
asured-0.106
pered-0.102
 yourselves-0.102
Token scar
Token ID10153
Feature activation+7.49e+00

Pos logit contributions
ves+0.021
 replied+0.021
gh+0.020
 story+0.019
 poem+0.019
Neg logit contributions
ind-0.022
 beings-0.020
asured-0.019
pered-0.019
 yourselves-0.018
Tokenves
Token ID1158
Feature activation+3.26e+00

Pos logit contributions
 replied+0.076
 poem+0.062
 Kelly+0.048
 story+0.047
 +0.043
Neg logit contributions
ind-0.402
 beings-0.373
pered-0.361
 yourselves-0.355
asured-0.352
Token,
Token ID11
Feature activation+4.56e-01

Pos logit contributions
ves+0.081
 replied+0.080
gh+0.071
 happen+0.067
 story+0.066
Neg logit contributions
ind-0.135
 beings-0.123
asured-0.118
 yourselves-0.114
attered-0.113
Token torn
Token ID12445
Feature activation+1.63e+00

Pos logit contributions
ves+0.012
 replied+0.012
 poem+0.010
gh+0.010
 story+0.010
Neg logit contributions
ind-0.018
 beings-0.016
attered-0.015
asured-0.015
ubby-0.015
Token and
Token ID290
Feature activation-5.67e-01

Pos logit contributions
ves+0.045
 replied+0.043
gh+0.040
 happen+0.038
 poem+0.036
Neg logit contributions
ind-0.060
asured-0.056
 beings-0.055
years-0.054
attered-0.052
Token bloody
Token ID15222
Feature activation+2.25e+00

Pos logit contributions
ind+0.021
 beings+0.019
asured+0.018
pered+0.018
attered+0.017
Neg logit contributions
ves-0.016
 replied-0.016
gh-0.014
 story-0.014
 poem-0.014
Token.
Token ID13
Feature activation-3.25e+00

Pos logit contributions
ind+0.157
 beings+0.143
asured+0.138
 yourselves+0.133
attered+0.133
Neg logit contributions
 replied-0.100
ves-0.097
gh-0.089
 story-0.085
 poem-0.083
Token He
Token ID679
Feature activation-4.04e+00

Pos logit contributions
ind+0.124
 beings+0.113
asured+0.107
attered+0.103
ishing+0.102
Neg logit contributions
 replied-0.093
ves-0.087
 -0.082
gh-0.078
 story-0.078
Token lived
Token ID5615
Feature activation-6.31e+00

Pos logit contributions
ind+0.165
 beings+0.153
asured+0.145
pered+0.142
 yourselves+0.142
Neg logit contributions
 replied-0.102
ves-0.098
gh-0.086
 story-0.083
 poem-0.083
Token in
Token ID287
Feature activation-2.75e+00

Pos logit contributions
ind+0.221
attered+0.203
 beings+0.201
lves+0.194
 yourselves+0.191
Neg logit contributions
 replied-0.185
ves-0.172
gh-0.168
 -0.168
 story-0.168
Token the
Token ID262
Feature activation-1.21e-01

Pos logit contributions
ind+0.129
 beings+0.120
attered+0.116
asured+0.115
pered+0.113
Neg logit contributions
 replied-0.048
ves-0.047
 story-0.044
gh-0.042
 -0.042
Token tre
Token ID2054
Feature activation-1.18e+01

Pos logit contributions
ind+0.004
 beings+0.003
asured+0.003
pered+0.003
uzz+0.003
Neg logit contributions
 replied-0.004
ves-0.004
 story-0.004
gh-0.004
 poem-0.004
Tokenet
Token ID316
Feature activation+1.95e+00

Pos logit contributions
 beings+0.324
ind+0.317
attered+0.295
 prisoners+0.287
pered+0.271
Neg logit contributions
 replied-0.395
ves-0.394
gh-0.385
 poem-0.359
 story-0.354
Tokenops
Token ID2840
Feature activation+1.25e+00

Pos logit contributions
ves+0.057
 replied+0.057
gh+0.051
 story+0.050
 poem+0.049
Neg logit contributions
ind-0.070
 beings-0.065
asured-0.061
pered-0.060
attered-0.059
Token,
Token ID11
Feature activation-1.06e+00

Pos logit contributions
ves+0.035
 replied+0.035
gh+0.031
 story+0.030
 poem+0.030
Neg logit contributions
ind-0.046
 beings-0.043
asured-0.040
pered-0.039
attered-0.039
Token climbing
Token ID14281
Feature activation-1.56e+00

Pos logit contributions
ind+0.040
 beings+0.037
attered+0.034
asured+0.034
pered+0.034
Neg logit contributions
 replied-0.030
ves-0.028
 -0.026
 poem-0.026
gh-0.025
Token from
Token ID422
Feature activation-3.64e+00

Pos logit contributions
ind+0.061
 beings+0.056
asured+0.054
attered+0.052
pered+0.051
Neg logit contributions
ves-0.041
 replied-0.040
gh-0.036
 story-0.034
 -0.034
Token as
Token ID355
Feature activation-2.64e+00

Pos logit contributions
ves+0.018
 replied+0.018
gh+0.016
 story+0.015
 poem+0.015
Neg logit contributions
ind-0.025
 beings-0.023
asured-0.022
pered-0.021
 yourselves-0.021
Token he
Token ID339
Feature activation-1.33e+00

Pos logit contributions
ind+0.124
 beings+0.115
asured+0.110
pered+0.108
 yourselves+0.107
Neg logit contributions
ves-0.047
 replied-0.047
gh-0.041
 story-0.038
 poem-0.038
Token soared
Token ID35292
Feature activation-1.54e+00

Pos logit contributions
ind+0.053
 beings+0.048
asured+0.046
pered+0.045
attered+0.044
Neg logit contributions
ves-0.033
 replied-0.033
gh-0.031
 story-0.029
 poem-0.029
Token over
Token ID625
Feature activation-1.14e+00

Pos logit contributions
ind+0.049
 beings+0.044
asured+0.042
pered+0.040
 yourselves+0.040
Neg logit contributions
 replied-0.051
ves-0.051
gh-0.047
 story-0.045
 poem-0.045
Token the
Token ID262
Feature activation-1.00e-01

Pos logit contributions
ind+0.053
 beings+0.050
asured+0.048
attered+0.047
 prisoners+0.046
Neg logit contributions
 replied-0.021
ves-0.019
 -0.017
gh-0.017
 story-0.017
Token tre
Token ID2054
Feature activation-1.15e+01

Pos logit contributions
ind+0.004
 beings+0.003
asured+0.003
pered+0.003
attered+0.003
Neg logit contributions
 replied-0.003
ves-0.003
gh-0.002
 story-0.002
 poem-0.002
Tokenet
Token ID316
Feature activation+1.81e+00

Pos logit contributions
ind+0.345
 beings+0.334
attered+0.299
 prisoners+0.298
uzz+0.289
Neg logit contributions
ves-0.375
 replied-0.374
gh-0.352
 poem-0.336
 happen-0.327
Tokenops
Token ID2840
Feature activation+1.53e+00

Pos logit contributions
ves+0.054
 replied+0.053
gh+0.050
 poem+0.047
 story+0.047
Neg logit contributions
ind-0.062
 beings-0.053
asured-0.051
pered-0.051
attered-0.050
Token.
Token ID13
Feature activation-9.79e-01

Pos logit contributions
 replied+0.044
ves+0.044
gh+0.041
 poem+0.039
 story+0.039
Neg logit contributions
ind-0.053
 beings-0.048
asured-0.046
pered-0.045
attered-0.044
Token It
Token ID632
Feature activation+1.66e+00

Pos logit contributions
ind+0.042
 beings+0.038
asured+0.037
pered+0.036
attered+0.035
Neg logit contributions
 replied-0.022
ves-0.022
gh-0.020
 story-0.019
 poem-0.018
Token felt
Token ID2936
Feature activation+1.08e+00

Pos logit contributions
ves+0.037
 replied+0.035
gh+0.034
 story+0.031
 poem+0.031
Neg logit contributions
ind-0.070
 beings-0.064
asured-0.062
attered-0.060
 alone-0.059
Token soon
Token ID2582
Feature activation-1.11e-02

Pos logit contributions
ves+0.024
 replied+0.023
gh+0.022
 story+0.020
 poem+0.020
Neg logit contributions
ind-0.047
 beings-0.044
asured-0.043
pered-0.041
 alone-0.041
Token became
Token ID2627
Feature activation-5.53e+00

Pos logit contributions
ind+0.000
 beings+0.000
asured+0.000
 alone+0.000
attered+0.000
Neg logit contributions
ves-0.000
gh-0.000
 replied-0.000
 story-0.000
enda-0.000
Token Joy
Token ID14087
Feature activation-1.82e+00

Pos logit contributions
ind+0.214
 beings+0.196
asured+0.184
pered+0.180
attered+0.179
Neg logit contributions
 replied-0.147
ves-0.146
gh-0.134
 story-0.126
 poem-0.126
Token's
Token ID338
Feature activation-2.44e+00

Pos logit contributions
ind+0.081
 beings+0.075
asured+0.072
 yourselves+0.071
pered+0.070
Neg logit contributions
ves-0.037
 replied-0.037
gh-0.033
 poem-0.032
 story-0.032
Token most
Token ID749
Feature activation-1.26e+00

Pos logit contributions
ind+0.088
 beings+0.080
asured+0.076
pered+0.074
attered+0.073
Neg logit contributions
ves-0.071
 replied-0.071
gh-0.065
 story-0.062
 poem-0.062
Token tre
Token ID2054
Feature activation-1.14e+01

Pos logit contributions
ind+0.051
 beings+0.046
pered+0.046
 yourselves+0.045
ishes+0.044
Neg logit contributions
 replied-0.030
ves-0.029
 poem-0.027
gh-0.027
 story-0.027
Tokenasured
Token ID34006
Feature activation-6.86e+00

Pos logit contributions
 beings+0.131
ind+0.129
ishes+0.089
attered+0.087
 prisoners+0.079
Neg logit contributions
ves-0.561
gh-0.559
 replied-0.557
 happen-0.527
 poem-0.514
Token toy
Token ID13373
Feature activation-2.28e+00

Pos logit contributions
ind+0.216
 beings+0.194
pered+0.179
asured+0.178
attered+0.175
Neg logit contributions
ves-0.231
 replied-0.229
gh-0.215
 poem-0.210
 story-0.208
Token,
Token ID11
Feature activation-5.30e-01

Pos logit contributions
ind+0.098
 beings+0.089
asured+0.086
 yourselves+0.084
pered+0.083
Neg logit contributions
 replied-0.052
ves-0.051
gh-0.046
 poem-0.044
 story-0.044
Token and
Token ID290
Feature activation-1.42e-01

Pos logit contributions
ind+0.018
 beings+0.017
asured+0.016
 yourselves+0.015
pered+0.015
Neg logit contributions
ves-0.016
 replied-0.016
gh-0.015
 story-0.014
 poem-0.014
Token she
Token ID673
Feature activation-1.53e+00

Pos logit contributions
ind+0.006
 beings+0.006
asured+0.006
pered+0.006
 yourselves+0.006
Neg logit contributions
ves-0.003
 replied-0.003
gh-0.002
 story-0.002
 poem-0.002
Token She
Token ID1375
Feature activation-4.50e+00

Pos logit contributions
ind+0.128
 beings+0.116
asured+0.110
attered+0.106
ishing+0.106
Neg logit contributions
 replied-0.089
ves-0.083
 -0.078
gh-0.075
 story-0.074
Token picks
Token ID11103
Feature activation+1.52e+00

Pos logit contributions
ind+0.197
 beings+0.182
asured+0.174
pered+0.170
attered+0.169
Neg logit contributions
ves-0.096
 replied-0.096
gh-0.084
 story-0.079
 poem-0.079
Token up
Token ID510
Feature activation-2.20e+00

Pos logit contributions
ves+0.048
 replied+0.047
gh+0.045
 story+0.043
 poem+0.042
Neg logit contributions
ind-0.050
 beings-0.046
asured-0.042
pered-0.041
 yourselves-0.041
Token the
Token ID262
Feature activation-5.94e-01

Pos logit contributions
ind+0.091
 beings+0.084
asured+0.080
pered+0.078
attered+0.078
Neg logit contributions
ves-0.052
 replied-0.052
gh-0.046
 story-0.044
 poem-0.044
Token re
Token ID302
Feature activation-7.23e+00

Pos logit contributions
ind+0.023
 beings+0.021
asured+0.020
pered+0.019
attered+0.019
Neg logit contributions
ves-0.016
 replied-0.016
gh-0.014
 story-0.014
 poem-0.014
Tokenind
Token ID521
Feature activation-1.14e+01

Pos logit contributions
pered+0.083
 beings+0.069
 yourselves+0.057
attered+0.056
 underground+0.053
Neg logit contributions
ves-0.368
 replied-0.351
gh-0.342
 poem-0.332
 story-0.332
Tokeneer
Token ID28153
Feature activation-2.16e+00

Pos logit contributions
pered+0.213
 beings+0.204
 prisoners+0.198
 yourselves+0.191
ind+0.190
Neg logit contributions
ves-0.474
 replied-0.444
 story-0.422
 poem-0.420
 happen-0.405
Token.
Token ID13
Feature activation-2.29e+00

Pos logit contributions
ind+0.087
 beings+0.078
asured+0.076
attered+0.074
 yourselves+0.073
Neg logit contributions
 replied-0.057
ves-0.054
 story-0.050
 poem-0.049
 -0.048
Token She
Token ID1375
Feature activation-3.67e+00

Pos logit contributions
ind+0.077
 beings+0.069
asured+0.066
attered+0.063
pered+0.062
Neg logit contributions
 replied-0.073
ves-0.072
gh-0.066
 story-0.064
 poem-0.064
Token looks
Token ID3073
Feature activation+1.34e+00

Pos logit contributions
ind+0.152
 beings+0.140
asured+0.133
attered+0.131
pered+0.130
Neg logit contributions
 replied-0.090
ves-0.088
gh-0.076
 story-0.075
 poem-0.074
Token at
Token ID379
Feature activation-1.26e+00

Pos logit contributions
 replied+0.053
ves+0.049
 +0.047
gh+0.046
 story+0.045
Neg logit contributions
ind-0.037
 beings-0.032
asured-0.030
 yourselves-0.029
attered-0.029
Token ran
Token ID4966
Feature activation-3.35e+00

Pos logit contributions
ind+0.023
 beings+0.021
asured+0.020
pered+0.020
 yourselves+0.020
Neg logit contributions
 replied-0.014
ves-0.014
gh-0.013
 poem-0.012
 story-0.012
Token up
Token ID510
Feature activation-2.95e+00

Pos logit contributions
ind+0.134
 beings+0.124
asured+0.118
pered+0.115
attered+0.114
Neg logit contributions
ves-0.084
 replied-0.083
gh-0.076
 story-0.072
 poem-0.072
Token to
Token ID284
Feature activation-3.00e-02

Pos logit contributions
ind+0.124
 beings+0.115
asured+0.110
pered+0.108
attered+0.105
Neg logit contributions
ves-0.069
 replied-0.067
gh-0.061
 poem-0.057
 story-0.057
Token the
Token ID262
Feature activation+2.21e-01

Pos logit contributions
ind+0.001
 beings+0.001
asured+0.001
pered+0.001
 yourselves+0.001
Neg logit contributions
ves-0.001
 replied-0.001
gh-0.001
 poem-0.001
 story-0.001
Token re
Token ID302
Feature activation-7.09e+00

Pos logit contributions
 replied+0.008
ves+0.007
 story+0.007
 poem+0.007
 +0.007
Neg logit contributions
ind-0.006
 beings-0.006
asured-0.006
pered-0.006
uzz-0.006
Tokenind
Token ID521
Feature activation-1.13e+01

Pos logit contributions
pered+0.036
 beings+0.029
 yourselves+0.018
attered+0.014
 alone+0.013
Neg logit contributions
ves-0.405
 replied-0.388
gh-0.380
 story-0.375
enda-0.366
Tokeneer
Token ID28153
Feature activation-1.80e+00

Pos logit contributions
pered+0.214
 yourselves+0.210
 beings+0.205
asured+0.200
attered+0.199
Neg logit contributions
ves-0.462
 replied-0.434
 story-0.423
 poem-0.414
 happen-0.403
Token,
Token ID11
Feature activation-3.34e-01

Pos logit contributions
ind+0.076
 beings+0.070
 yourselves+0.066
asured+0.066
attered+0.064
Neg logit contributions
 replied-0.043
ves-0.041
 story-0.037
 -0.036
gh-0.036
Token pet
Token ID4273
Feature activation-1.14e+00

Pos logit contributions
ind+0.014
 beings+0.013
asured+0.012
attered+0.012
pered+0.012
Neg logit contributions
 replied-0.008
ves-0.008
 -0.007
gh-0.007
 poem-0.007
Tokenting
Token ID889
Feature activation-1.85e+00

Pos logit contributions
ind+0.036
 beings+0.033
asured+0.032
pered+0.030
attered+0.029
Neg logit contributions
 replied-0.039
ves-0.038
gh-0.035
 poem-0.033
 story-0.033
Token it
Token ID340
Feature activation+1.27e+00

Pos logit contributions
ind+0.075
 beings+0.070
asured+0.068
attered+0.066
pered+0.065
Neg logit contributions
ves-0.045
 replied-0.044
gh-0.039
 story-0.038
 poem-0.037
Token one
Token ID530
Feature activation-2.30e+00

Pos logit contributions
ind+0.242
 beings+0.221
asured+0.208
attered+0.206
pered+0.205
Neg logit contributions
 replied-0.141
ves-0.140
gh-0.128
 story-0.120
 poem-0.120
Token of
Token ID286
Feature activation-4.18e+00

Pos logit contributions
ind+0.087
 beings+0.080
asured+0.076
pered+0.074
 yourselves+0.073
Neg logit contributions
ves-0.063
 replied-0.062
gh-0.057
 story-0.054
 poem-0.054
Token Mary
Token ID5335
Feature activation-6.17e-01

Pos logit contributions
ind+0.175
 beings+0.162
attered+0.156
asured+0.151
pered+0.150
Neg logit contributions
 replied-0.100
ves-0.096
 poem-0.085
 story-0.085
gh-0.085
Token's
Token ID338
Feature activation-2.81e+00

Pos logit contributions
ind+0.029
 beings+0.027
asured+0.026
pered+0.025
attered+0.025
Neg logit contributions
ves-0.011
 replied-0.011
gh-0.010
 story-0.009
 poem-0.009
Token most
Token ID749
Feature activation-1.12e+00

Pos logit contributions
ind+0.101
 beings+0.092
asured+0.087
pered+0.085
 yourselves+0.084
Neg logit contributions
ves-0.082
 replied-0.082
gh-0.075
 story-0.073
 poem-0.072
Token tre
Token ID2054
Feature activation-1.13e+01

Pos logit contributions
ind+0.045
pered+0.042
 yourselves+0.041
 beings+0.041
uzz+0.040
Neg logit contributions
 replied-0.027
ves-0.026
 poem-0.024
 story-0.024
gh-0.023
Tokenasured
Token ID34006
Feature activation-6.89e+00

Pos logit contributions
 beings+0.111
ind+0.109
attered+0.076
 prisoners+0.075
ishes+0.070
Neg logit contributions
ves-0.570
gh-0.562
 replied-0.558
 poem-0.525
 happen-0.524
Token possessions
Token ID23309
Feature activation-3.92e+00

Pos logit contributions
ind+0.212
 beings+0.189
asured+0.177
pered+0.176
 yourselves+0.173
Neg logit contributions
ves-0.238
 replied-0.234
gh-0.219
 poem-0.215
 story-0.213
Token and
Token ID290
Feature activation-2.27e-01

Pos logit contributions
ind+0.159
 beings+0.145
asured+0.139
pered+0.135
 yourselves+0.134
Neg logit contributions
ves-0.096
 replied-0.096
gh-0.087
 story-0.083
 poem-0.082
Token she
Token ID673
Feature activation-1.84e+00

Pos logit contributions
ind+0.009
 beings+0.008
asured+0.008
 yourselves+0.008
attered+0.008
Neg logit contributions
ves-0.006
 replied-0.005
gh-0.005
 story-0.005
 poem-0.005
Token never
Token ID1239
Feature activation-1.82e+00

Pos logit contributions
ind+0.077
 beings+0.071
pered+0.067
asured+0.066
 yourselves+0.066
Neg logit contributions
 replied-0.044
ves-0.043
gh-0.039
 story-0.037
 poem-0.037
Token couldn
Token ID3521
Feature activation+2.86e+00

Pos logit contributions
ind+0.001
 beings+0.001
asured+0.001
attered+0.001
pered+0.001
Neg logit contributions
ves-0.001
 replied-0.000
gh-0.000
 story-0.000
 poem-0.000
Token't
Token ID470
Feature activation+2.37e+00

Pos logit contributions
 replied+0.063
ves+0.059
 poem+0.058
enda+0.051
gh+0.051
Neg logit contributions
ind-0.113
pered-0.102
 alone-0.102
 yourselves-0.102
 beings-0.098
Token get
Token ID651
Feature activation-8.73e-01

Pos logit contributions
ves+0.065
 replied+0.065
gh+0.059
 poem+0.055
 story+0.055
Neg logit contributions
ind-0.087
 beings-0.079
asured-0.077
pered-0.075
 alone-0.074
Token above
Token ID2029
Feature activation+1.70e+00

Pos logit contributions
ind+0.035
 beings+0.032
attered+0.030
asured+0.030
pered+0.029
Neg logit contributions
 replied-0.022
ves-0.021
gh-0.020
 happen-0.019
 -0.019
Token the
Token ID262
Feature activation+6.59e-01

Pos logit contributions
ves+0.039
 replied+0.038
gh+0.035
 story+0.033
 poem+0.032
Neg logit contributions
ind-0.072
 beings-0.067
asured-0.064
pered-0.062
attered-0.062
Token tre
Token ID2054
Feature activation-1.13e+01

Pos logit contributions
ves+0.020
 replied+0.019
gh+0.018
 happen+0.016
 poem+0.016
Neg logit contributions
ind-0.024
 beings-0.022
asured-0.020
 yourselves-0.020
pered-0.020
Tokenet
Token ID316
Feature activation+2.01e+00

Pos logit contributions
ind+0.320
 beings+0.297
attered+0.267
 prisoners+0.260
asured+0.258
Neg logit contributions
ves-0.395
 replied-0.393
gh-0.373
 poem-0.357
 story-0.351
Tokenops
Token ID2840
Feature activation+1.55e+00

Pos logit contributions
ves+0.060
 replied+0.058
gh+0.056
 Kelly+0.053
 poem+0.053
Neg logit contributions
ind-0.063
 beings-0.057
 alone-0.055
pered-0.054
ter-0.053
Token.
Token ID13
Feature activation-5.95e-01

Pos logit contributions
 replied+0.046
ves+0.046
gh+0.042
enda+0.040
 poem+0.040
Neg logit contributions
ind-0.050
 beings-0.046
 alone-0.046
pered-0.044
asured-0.043
Token He
Token ID679
Feature activation-2.35e+00

Pos logit contributions
ind+0.024
 beings+0.021
asured+0.020
ishing+0.019
ubby+0.019
Neg logit contributions
 replied-0.017
ves-0.016
 -0.016
gh-0.015
 story-0.014
Token was
Token ID373
Feature activation-6.60e-01

Pos logit contributions
ind+0.097
 beings+0.089
asured+0.086
attered+0.082
pered+0.082
Neg logit contributions
ves-0.055
 replied-0.054
gh-0.050
 story-0.047
 poem-0.047
Token of
Token ID286
Feature activation-3.49e+00

Pos logit contributions
ind+0.056
 beings+0.052
asured+0.049
attered+0.046
pered+0.046
Neg logit contributions
ves-0.048
 replied-0.046
gh-0.044
 story-0.040
 happen-0.040
Token the
Token ID262
Feature activation+9.04e-01

Pos logit contributions
ind+0.146
 beings+0.135
attered+0.128
asured+0.128
pered+0.124
Neg logit contributions
 replied-0.081
ves-0.079
gh-0.070
 story-0.069
 poem-0.067
Token house
Token ID2156
Feature activation-2.45e+00

Pos logit contributions
 replied+0.025
ves+0.024
 story+0.023
 poem+0.023
gh+0.023
Neg logit contributions
ind-0.033
 beings-0.030
asured-0.029
pered-0.029
attered-0.028
Token's
Token ID338
Feature activation-3.05e+00

Pos logit contributions
ind+0.094
 beings+0.085
asured+0.081
pered+0.079
 yourselves+0.079
Neg logit contributions
 replied-0.066
ves-0.065
gh-0.060
 story-0.057
 poem-0.057
Token most
Token ID749
Feature activation-9.18e-01

Pos logit contributions
ind+0.116
 beings+0.106
asured+0.101
pered+0.100
 yourselves+0.099
Neg logit contributions
 replied-0.081
ves-0.079
gh-0.074
 story-0.073
 poem-0.072
Token tre
Token ID2054
Feature activation-1.11e+01

Pos logit contributions
ind+0.039
pered+0.035
 beings+0.035
 yourselves+0.034
uzz+0.034
Neg logit contributions
 replied-0.021
ves-0.020
 poem-0.018
 story-0.018
gh-0.018
Tokenasured
Token ID34006
Feature activation-6.45e+00

Pos logit contributions
ind+0.143
 beings+0.142
attered+0.110
 prisoners+0.104
ishes+0.104
Neg logit contributions
ves-0.518
gh-0.511
 replied-0.508
 happen-0.483
 poem-0.474
Token treasures
Token ID29561
Feature activation-2.52e+00

Pos logit contributions
ind+0.211
 beings+0.187
pered+0.180
asured+0.179
 yourselves+0.172
Neg logit contributions
ves-0.208
 replied-0.205
gh-0.191
 poem-0.188
 story-0.188
Token.
Token ID13
Feature activation-8.74e-01

Pos logit contributions
ind+0.096
 beings+0.089
asured+0.084
pered+0.081
attered+0.081
Neg logit contributions
ves-0.066
 replied-0.066
gh-0.060
 story-0.057
 poem-0.057
Token<|endoftext|>
Token ID50256
Feature activation-4.78e+00

Pos logit contributions
ind+0.033
 beings+0.030
asured+0.029
attered+0.029
uzz+0.029
Neg logit contributions
ves-0.024
gh-0.022
 replied-0.022
 poem-0.021
 happen-0.020
TokenOnce
Token ID7454
Feature activation-1.53e+00

Pos logit contributions
ind+0.172
 beings+0.156
 alone+0.152
pered+0.150
uzz+0.147
Neg logit contributions
ves-0.131
 replied-0.124
gh-0.118
 poem-0.118
 happen-0.115
Token up
Token ID510
Feature activation-2.57e+00

Pos logit contributions
ind+0.116
 beings+0.109
 yourselves+0.104
asured+0.101
 prisoners+0.099
Neg logit contributions
 replied-0.086
ves-0.077
 -0.073
gh-0.070
 story-0.069
Token to
Token ID284
Feature activation+3.26e-01

Pos logit contributions
ind+0.105
 beings+0.097
asured+0.093
pered+0.091
uzz+0.089
Neg logit contributions
ves-0.063
 replied-0.060
gh-0.056
 poem-0.052
 story-0.052
Token the
Token ID262
Feature activation+6.92e-01

Pos logit contributions
ves+0.009
 replied+0.008
gh+0.008
 poem+0.008
 story+0.007
Neg logit contributions
ind-0.012
 beings-0.011
asured-0.011
 yourselves-0.010
pered-0.010
Token restless
Token ID42537
Feature activation-3.14e+00

Pos logit contributions
 replied+0.025
ves+0.024
 story+0.023
 poem+0.023
gh+0.022
Neg logit contributions
ind-0.020
 beings-0.018
asured-0.017
pered-0.017
attered-0.017
Token re
Token ID302
Feature activation-6.78e+00

Pos logit contributions
attered+0.094
pered+0.094
years+0.092
asured+0.091
 beings+0.091
Neg logit contributions
 replied-0.097
 story-0.094
ves-0.092
 poem-0.089
 happen-0.082
Tokenind
Token ID521
Feature activation-1.11e+01

Pos logit contributions
pered+0.051
 beings+0.035
attered+0.031
 yourselves+0.028
 alone+0.025
Neg logit contributions
ves-0.376
 replied-0.353
gh-0.349
 story-0.348
 poem-0.335
Tokeneer
Token ID28153
Feature activation-1.31e+00

Pos logit contributions
pered+0.207
attered+0.202
 yourselves+0.200
 beings+0.194
 prisoners+0.189
Neg logit contributions
ves-0.454
 replied-0.436
 story-0.427
 poem-0.418
 happen-0.407
Token and
Token ID290
Feature activation+2.33e-01

Pos logit contributions
ind+0.047
 beings+0.042
asured+0.040
 yourselves+0.039
attered+0.039
Neg logit contributions
 replied-0.039
ves-0.038
gh-0.035
 story-0.035
 poem-0.034
Token said
Token ID531
Feature activation+3.28e+00

Pos logit contributions
ves+0.005
gh+0.004
 story+0.004
 replied+0.004
 happen+0.004
Neg logit contributions
ind-0.010
 beings-0.010
asured-0.009
 alone-0.009
pered-0.009
Token,
Token ID11
Feature activation+8.42e-01

Pos logit contributions
ves+0.073
 replied+0.072
gh+0.064
 poem+0.062
 story+0.061
Neg logit contributions
ind-0.138
 beings-0.129
asured-0.122
pered-0.121
 yourselves-0.119
Token "
Token ID366
Feature activation-2.64e+00

Pos logit contributions
ves+0.022
 replied+0.021
gh+0.020
 poem+0.019
 story+0.019
Neg logit contributions
ind-0.032
 beings-0.030
pered-0.028
asured-0.028
 alone-0.027
Token to
Token ID284
Feature activation+1.18e+00

Pos logit contributions
ind+0.069
 beings+0.063
asured+0.059
pered+0.057
 yourselves+0.057
Neg logit contributions
 replied-0.047
ves-0.046
gh-0.042
 story-0.040
 poem-0.040
Token work
Token ID670
Feature activation+1.28e+00

Pos logit contributions
ves+0.032
 replied+0.030
gh+0.029
 poem+0.026
 story+0.026
Neg logit contributions
ind-0.045
 beings-0.041
asured-0.039
pered-0.038
 yourselves-0.038
Token cleaning
Token ID12724
Feature activation-9.76e-01

Pos logit contributions
ves+0.036
 replied+0.036
gh+0.032
 poem+0.032
 story+0.031
Neg logit contributions
ind-0.046
 beings-0.043
asured-0.040
pered-0.040
 alone-0.039
Token the
Token ID262
Feature activation+5.31e-01

Pos logit contributions
ind+0.037
 beings+0.034
asured+0.032
pered+0.031
attered+0.031
Neg logit contributions
ves-0.026
 replied-0.026
gh-0.024
 story-0.023
 happen-0.022
Token re
Token ID302
Feature activation-6.61e+00

Pos logit contributions
 story+0.018
 replied+0.017
 poem+0.016
 way+0.015
 +0.015
Neg logit contributions
uzz-0.018
attered-0.015
ases-0.015
years-0.015
ateful-0.015
Tokenind
Token ID521
Feature activation-1.10e+01

Pos logit contributions
pered+0.045
 beings+0.031
attered+0.030
 alone+0.022
 chefs+0.021
Neg logit contributions
ves-0.369
 replied-0.346
 story-0.346
gh-0.338
 poem-0.331
Tokeneer
Token ID28153
Feature activation-1.47e+00

Pos logit contributions
pered+0.201
attered+0.193
years+0.184
 beings+0.183
asured+0.177
Neg logit contributions
ves-0.462
 replied-0.436
 story-0.433
 poem-0.416
 happen-0.407
Token.
Token ID13
Feature activation-4.64e-01

Pos logit contributions
ind+0.060
 beings+0.055
asured+0.053
pered+0.051
attered+0.051
Neg logit contributions
 replied-0.036
ves-0.035
gh-0.031
 story-0.031
 happen-0.030
Token With
Token ID2080
Feature activation-2.43e-02

Pos logit contributions
ind+0.017
 beings+0.015
asured+0.015
attered+0.014
pered+0.014
Neg logit contributions
 replied-0.014
ves-0.013
 -0.012
 story-0.012
gh-0.012
Token her
Token ID607
Feature activation-3.10e+00

Pos logit contributions
ind+0.001
 beings+0.001
asured+0.001
pered+0.001
 yourselves+0.001
Neg logit contributions
ves-0.001
 replied-0.001
gh-0.001
 story-0.001
 poem-0.001
Token help
Token ID1037
Feature activation+4.91e-01

Pos logit contributions
ind+0.121
 beings+0.112
asured+0.105
pered+0.102
 yourselves+0.100
Neg logit contributions
ves-0.081
 replied-0.079
gh-0.074
 poem-0.068
 happen-0.067
Token,
Token ID11
Feature activation-2.66e+00

Pos logit contributions
ind+0.072
 beings+0.067
asured+0.063
pered+0.062
attered+0.062
Neg logit contributions
ves-0.041
 replied-0.041
gh-0.037
 poem-0.035
 story-0.035
Token he
Token ID339
Feature activation-2.63e+00

Pos logit contributions
ind+0.119
 beings+0.110
asured+0.106
pered+0.102
attered+0.102
Neg logit contributions
 replied-0.055
ves-0.055
gh-0.048
 story-0.045
 poem-0.045
Token spotted
Token ID13489
Feature activation-2.16e+00

Pos logit contributions
ind+0.100
 beings+0.090
asured+0.086
pered+0.085
 yourselves+0.083
Neg logit contributions
ves-0.070
 replied-0.065
gh-0.062
 happen-0.060
 poem-0.059
Token another
Token ID1194
Feature activation-1.07e+00

Pos logit contributions
ind+0.089
 beings+0.082
asured+0.078
pered+0.077
 yourselves+0.076
Neg logit contributions
ves-0.051
 replied-0.050
gh-0.046
 story-0.043
 poem-0.043
Token re
Token ID302
Feature activation-7.52e+00

Pos logit contributions
ind+0.040
 beings+0.036
asured+0.035
pered+0.034
 yourselves+0.034
Neg logit contributions
 replied-0.030
ves-0.030
gh-0.027
 story-0.026
 poem-0.026
Tokenind
Token ID521
Feature activation-1.10e+01

Pos logit contributions
pered+0.046
 beings+0.040
 yourselves+0.032
attered+0.027
 chefs+0.021
Neg logit contributions
ves-0.404
 replied-0.395
gh-0.392
 story-0.378
 poem-0.372
Tokeneer
Token ID28153
Feature activation-2.08e+00

Pos logit contributions
 beings+0.198
 yourselves+0.196
pered+0.196
ind+0.194
attered+0.187
Neg logit contributions
ves-0.447
 replied-0.443
 story-0.412
 poem-0.408
 happen-0.407
Token.
Token ID13
Feature activation-1.19e+00

Pos logit contributions
ind+0.076
 beings+0.070
asured+0.067
 yourselves+0.064
attered+0.064
Neg logit contributions
 replied-0.060
ves-0.059
gh-0.054
 story-0.053
 poem-0.051
Token
Token ID220
Feature activation+1.74e+00

Pos logit contributions
ind+0.050
 beings+0.046
asured+0.044
attered+0.043
ishing+0.042
Neg logit contributions
 replied-0.029
ves-0.027
 -0.025
gh-0.025
 story-0.024
Token
Token ID198
Feature activation+5.62e-01

Pos logit contributions
ves+0.031
 replied+0.031
gh+0.027
 poem+0.025
 story+0.024
Neg logit contributions
ind-0.081
 beings-0.076
asured-0.073
pered-0.072
 yourselves-0.071
Token
Token ID198
Feature activation+4.85e-01

Pos logit contributions
ves+0.013
 replied+0.013
 happen+0.012
gh+0.011
 poem+0.011
Neg logit contributions
ind-0.023
 beings-0.021
pered-0.020
uzz-0.020
asured-0.020
Token the
Token ID262
Feature activation+1.93e+00

Pos logit contributions
ind+0.009
 beings+0.009
attered+0.008
asured+0.008
pered+0.008
Neg logit contributions
 replied-0.004
ves-0.004
 -0.004
 story-0.003
 happen-0.003
Token razor
Token ID26811
Feature activation+7.87e-01

Pos logit contributions
 replied+0.054
ves+0.054
gh+0.049
 story+0.048
 poem+0.047
Neg logit contributions
ind-0.073
 beings-0.067
asured-0.063
pered-0.062
attered-0.061
Token was
Token ID373
Feature activation-1.21e+00

Pos logit contributions
 replied+0.014
ves+0.014
gh+0.012
 story+0.011
 poem+0.011
Neg logit contributions
ind-0.038
 beings-0.035
pered-0.034
 yourselves-0.034
asured-0.034
Token the
Token ID262
Feature activation+1.37e+00

Pos logit contributions
ind+0.042
 beings+0.038
asured+0.037
pered+0.035
 yourselves+0.034
Neg logit contributions
ves-0.037
 replied-0.037
gh-0.033
 poem-0.032
 story-0.032
Token most
Token ID749
Feature activation-4.78e-01

Pos logit contributions
 replied+0.038
ves+0.036
 story+0.035
gh+0.035
 poem+0.035
Neg logit contributions
ind-0.051
 beings-0.046
asured-0.044
pered-0.044
attered-0.043
Token tre
Token ID2054
Feature activation-1.09e+01

Pos logit contributions
ind+0.019
pered+0.018
 beings+0.017
 yourselves+0.017
uzz+0.017
Neg logit contributions
 replied-0.012
ves-0.011
 poem-0.011
 story-0.010
gh-0.010
Tokenasured
Token ID34006
Feature activation-6.25e+00

Pos logit contributions
 beings+0.131
ind+0.125
 prisoners+0.098
ishes+0.095
attered+0.087
Neg logit contributions
gh-0.520
 replied-0.519
ves-0.514
 happen-0.489
 poem-0.474
Token thing
Token ID1517
Feature activation-8.97e-01

Pos logit contributions
ind+0.201
 beings+0.180
pered+0.175
asured+0.168
 yourselves+0.167
Neg logit contributions
 replied-0.202
ves-0.201
gh-0.190
 poem-0.186
 story-0.183
Token in
Token ID287
Feature activation-2.41e+00

Pos logit contributions
ind+0.037
 beings+0.034
asured+0.033
pered+0.032
attered+0.032
Neg logit contributions
ves-0.021
 replied-0.021
gh-0.019
 story-0.018
 poem-0.018
Token all
Token ID477
Feature activation-4.91e+00

Pos logit contributions
ind+0.102
 beings+0.094
attered+0.092
pered+0.090
asured+0.089
Neg logit contributions
 replied-0.052
ves-0.051
gh-0.048
 story-0.048
 happen-0.046
Token of
Token ID286
Feature activation-3.55e+00

Pos logit contributions
ind+0.209
 beings+0.191
asured+0.184
pered+0.183
attered+0.180
Neg logit contributions
 replied-0.112
ves-0.109
 story-0.098
gh-0.098
 happen-0.097
Token he
Token ID339
Feature activation-5.13e-01

Pos logit contributions
ind+0.005
 beings+0.005
asured+0.004
attered+0.004
ishing+0.004
Neg logit contributions
ves-0.001
 replied-0.001
gh-0.001
 story-0.001
 -0.001
Token saw
Token ID2497
Feature activation+1.68e-02

Pos logit contributions
ind+0.021
 beings+0.020
asured+0.019
pered+0.018
attered+0.018
Neg logit contributions
 replied-0.012
ves-0.012
gh-0.011
 story-0.010
 poem-0.010
Token the
Token ID262
Feature activation+1.71e+00

Pos logit contributions
ves+0.000
 replied+0.000
gh+0.000
 story+0.000
 poem+0.000
Neg logit contributions
ind-0.001
 beings-0.001
asured-0.001
pered-0.001
attered-0.001
Token empty
Token ID6565
Feature activation-2.30e+00

Pos logit contributions
ves+0.042
 replied+0.041
gh+0.037
 happen+0.034
 poem+0.033
Neg logit contributions
ind-0.070
 beings-0.065
asured-0.061
 yourselves-0.060
attered-0.059
Token r
Token ID374
Feature activation-6.73e+00

Pos logit contributions
ind+0.093
 beings+0.087
asured+0.081
 yourselves+0.081
attered+0.079
Neg logit contributions
ves-0.056
 replied-0.054
gh-0.050
 happen-0.046
 poem-0.045
Tokenind
Token ID521
Feature activation-1.09e+01

Pos logit contributions
ind+0.175
 beings+0.160
 prisoners+0.145
lves+0.141
asured+0.137
Neg logit contributions
 replied-0.248
ves-0.246
gh-0.237
 story-0.230
 poem-0.224
Token and
Token ID290
Feature activation-8.74e-02

Pos logit contributions
ind+0.216
 beings+0.207
 prisoners+0.201
pered+0.201
asured+0.196
Neg logit contributions
ves-0.431
 replied-0.427
 poem-0.412
 story-0.411
on-0.403
Token the
Token ID262
Feature activation+1.88e+00

Pos logit contributions
ind+0.003
 beings+0.003
asured+0.003
pered+0.003
attered+0.003
Neg logit contributions
 replied-0.002
ves-0.002
gh-0.002
 happen-0.002
 story-0.002
Token seeds
Token ID11904
Feature activation+2.08e+00

Pos logit contributions
ves+0.051
 replied+0.050
gh+0.046
 story+0.043
 poem+0.043
Neg logit contributions
ind-0.072
 beings-0.066
asured-0.063
pered-0.061
 yourselves-0.060
Token on
Token ID319
Feature activation+2.27e+00

Pos logit contributions
ves+0.054
 replied+0.054
gh+0.049
 story+0.046
 poem+0.046
Neg logit contributions
ind-0.081
 beings-0.075
asured-0.071
pered-0.069
attered-0.068
Token the
Token ID262
Feature activation+1.17e+00

Pos logit contributions
ves+0.051
 replied+0.050
gh+0.046
 story+0.043
 poem+0.043
Neg logit contributions
ind-0.096
 beings-0.090
asured-0.086
attered-0.084
pered-0.084
Token small
Token ID1402
Feature activation-4.77e-01

Pos logit contributions
 replied+0.078
ves+0.075
 story+0.070
 poem+0.070
gh+0.067
Neg logit contributions
ind-0.120
 beings-0.114
asured-0.114
pered-0.111
 yourselves-0.110
Token bird
Token ID6512
Feature activation-1.53e+00

Pos logit contributions
asured+0.017
ind+0.017
pered+0.017
 yourselves+0.017
attered+0.016
Neg logit contributions
 replied-0.013
 story-0.013
 poem-0.012
 picture-0.011
ves-0.010
Token sitting
Token ID5586
Feature activation-4.57e+00

Pos logit contributions
ind+0.045
 beings+0.041
asured+0.038
pered+0.037
attered+0.036
Neg logit contributions
ves-0.054
 replied-0.054
gh-0.050
 story-0.048
 poem-0.048
Token on
Token ID319
Feature activation+2.24e+00

Pos logit contributions
ind+0.161
 beings+0.152
attered+0.150
 prisoners+0.149
asured+0.147
Neg logit contributions
 replied-0.127
 -0.114
ves-0.112
 story-0.111
on-0.105
Token the
Token ID262
Feature activation+7.20e-01

Pos logit contributions
ves+0.058
 replied+0.057
gh+0.052
 poem+0.048
 story+0.047
Neg logit contributions
ind-0.089
 beings-0.080
 yourselves-0.076
asured-0.075
pered-0.074
Token tre
Token ID2054
Feature activation-1.09e+01

Pos logit contributions
ves+0.026
 replied+0.025
gh+0.024
enda+0.022
 Kelly+0.022
Neg logit contributions
ind-0.023
 beings-0.018
 alone-0.017
 yourselves-0.017
 underground-0.017
Tokenet
Token ID316
Feature activation+2.31e+00

Pos logit contributions
ind+0.280
 beings+0.276
attered+0.243
 prisoners+0.240
pered+0.229
Neg logit contributions
ves-0.399
 replied-0.388
gh-0.380
 poem-0.357
 happen-0.353
Tokenop
Token ID404
Feature activation+3.76e-01

Pos logit contributions
ves+0.066
 replied+0.065
gh+0.061
 happen+0.059
 poem+0.058
Neg logit contributions
ind-0.082
 beings-0.071
asured-0.068
 alone-0.067
attered-0.067
Token.
Token ID13
Feature activation-3.39e-01

Pos logit contributions
ves+0.010
 replied+0.010
gh+0.009
 poem+0.009
 story+0.009
Neg logit contributions
ind-0.014
 beings-0.013
asured-0.012
pered-0.012
attered-0.012
Token His
Token ID2399
Feature activation-2.58e+00

Pos logit contributions
ind+0.013
 beings+0.012
asured+0.012
attered+0.011
ishing+0.011
Neg logit contributions
 replied-0.009
ves-0.009
 -0.008
gh-0.008
 story-0.008
Token name
Token ID1438
Feature activation+1.35e+00

Pos logit contributions
ind+0.087
 beings+0.079
asured+0.075
 yourselves+0.072
pered+0.072
Neg logit contributions
 replied-0.082
ves-0.081
gh-0.075
 story-0.074
 poem-0.073
Token walked
Token ID6807
Feature activation-3.34e+00

Pos logit contributions
ind+0.030
 beings+0.028
asured+0.026
 yourselves+0.025
pered+0.025
Neg logit contributions
 replied-0.026
ves-0.025
gh-0.022
 poem-0.022
 story-0.021
Token up
Token ID510
Feature activation-2.52e+00

Pos logit contributions
ind+0.115
 beings+0.105
asured+0.098
 yourselves+0.096
attered+0.095
Neg logit contributions
 replied-0.103
ves-0.101
gh-0.093
 story-0.090
 poem-0.089
Token to
Token ID284
Feature activation+2.99e-01

Pos logit contributions
ind+0.106
 beings+0.098
asured+0.094
pered+0.093
attered+0.090
Neg logit contributions
ves-0.058
 replied-0.056
gh-0.052
 poem-0.048
 story-0.048
Token the
Token ID262
Feature activation+8.12e-01

Pos logit contributions
ves+0.008
gh+0.008
 replied+0.007
 poem+0.007
enda+0.007
Neg logit contributions
ind-0.011
 beings-0.010
 alone-0.010
asured-0.009
 yourselves-0.009
Token re
Token ID302
Feature activation-6.66e+00

Pos logit contributions
 replied+0.031
ves+0.031
gh+0.028
 story+0.028
 poem+0.028
Neg logit contributions
ind-0.021
 beings-0.019
asured-0.018
pered-0.018
attered-0.017
Tokenind
Token ID521
Feature activation-1.08e+01

Pos logit contributions
pered+0.039
 beings+0.032
 yourselves+0.017
attered+0.015
 prisoners+0.014
Neg logit contributions
ves-0.378
 replied-0.364
gh-0.356
 story-0.352
 poem-0.339
Tokeneer
Token ID28153
Feature activation-1.47e+00

Pos logit contributions
 beings+0.183
pered+0.181
ind+0.172
attered+0.171
 yourselves+0.171
Neg logit contributions
ves-0.464
 replied-0.448
 story-0.432
 happen-0.422
 poem-0.417
Token.
Token ID13
Feature activation-5.77e-01

Pos logit contributions
ind+0.061
 beings+0.057
asured+0.054
 yourselves+0.053
attered+0.052
Neg logit contributions
 replied-0.035
ves-0.034
 story-0.030
gh-0.030
 -0.029
Token Suddenly
Token ID24975
Feature activation+2.92e-01

Pos logit contributions
ind+0.023
 beings+0.020
asured+0.019
 yourselves+0.019
 prisoners+0.018
Neg logit contributions
 replied-0.017
ves-0.015
 -0.014
 poem-0.013
gh-0.013
Token,
Token ID11
Feature activation-1.92e-01

Pos logit contributions
ves+0.007
 replied+0.007
gh+0.006
 poem+0.006
 story+0.006
Neg logit contributions
ind-0.012
 beings-0.011
pered-0.010
asured-0.010
 yourselves-0.010
Token the
Token ID262
Feature activation+2.27e+00

Pos logit contributions
ind+0.008
 beings+0.007
 alone+0.007
pered+0.007
 yourselves+0.007
Neg logit contributions
ves-0.004
 replied-0.004
gh-0.004
 story-0.004
 poem-0.004
Token got
Token ID1392
Feature activation-9.14e-01

Pos logit contributions
ind+0.074
 beings+0.069
 yourselves+0.065
pered+0.064
asured+0.063
Neg logit contributions
 replied-0.039
ves-0.037
gh-0.032
 -0.031
 poem-0.030
Token back
Token ID736
Feature activation-4.35e+00

Pos logit contributions
ind+0.034
 beings+0.031
asured+0.028
 yourselves+0.028
pered+0.027
Neg logit contributions
 replied-0.027
ves-0.026
gh-0.025
 -0.024
 story-0.024
Token the
Token ID262
Feature activation+4.05e-01

Pos logit contributions
ind+0.203
 beings+0.185
asured+0.177
 yourselves+0.173
lves+0.173
Neg logit contributions
 replied-0.087
ves-0.073
 -0.071
 story-0.067
 happen-0.064
Token impatient
Token ID33440
Feature activation-2.32e+00

Pos logit contributions
ves+0.011
 replied+0.011
gh+0.010
 story+0.009
 poem+0.009
Neg logit contributions
ind-0.016
 beings-0.014
asured-0.014
pered-0.013
attered-0.013
Token re
Token ID302
Feature activation-5.96e+00

Pos logit contributions
ind+0.081
 beings+0.073
pered+0.071
asured+0.070
uzz+0.070
Neg logit contributions
 replied-0.071
ves-0.067
 poem-0.065
 story-0.064
 happen-0.061
Tokenind
Token ID521
Feature activation-1.07e+01

Pos logit contributions
pered+0.066
 beings+0.042
 alone+0.037
 underground+0.033
 explorers+0.032
Neg logit contributions
ves-0.324
gh-0.294
 replied-0.290
 story-0.282
 poem-0.281
Tokeneer
Token ID28153
Feature activation-8.40e-01

Pos logit contributions
pered+0.208
years+0.202
ind+0.202
 beings+0.198
 yourselves+0.196
Neg logit contributions
ves-0.434
 replied-0.406
 happen-0.386
 poem-0.386
 story-0.383
Token was
Token ID373
Feature activation-4.07e-01

Pos logit contributions
ind+0.038
 beings+0.035
asured+0.033
 yourselves+0.032
pered+0.032
Neg logit contributions
 replied-0.018
ves-0.017
gh-0.015
 story-0.014
 poem-0.014
Token waiting
Token ID4953
Feature activation-8.21e-01

Pos logit contributions
ind+0.014
 beings+0.013
 yourselves+0.012
asured+0.012
attered+0.012
Neg logit contributions
 replied-0.012
ves-0.012
gh-0.011
 story-0.011
 -0.011
Token for
Token ID329
Feature activation+9.75e-01

Pos logit contributions
ind+0.033
 beings+0.030
asured+0.028
pered+0.028
attered+0.027
Neg logit contributions
 replied-0.021
ves-0.021
gh-0.019
 happen-0.018
 story-0.018
Token them
Token ID606
Feature activation-7.22e-01

Pos logit contributions
ves+0.027
 replied+0.027
gh+0.025
 poem+0.024
 story+0.024
Neg logit contributions
ind-0.036
 beings-0.033
asured-0.031
pered-0.030
 yourselves-0.030
Token there
Token ID612
Feature activation-5.01e-01

Pos logit contributions
ind+0.002
 beings+0.001
asured+0.001
pered+0.001
uzz+0.001
Neg logit contributions
ves-0.001
 replied-0.001
gh-0.001
 poem-0.001
 story-0.001
Token was
Token ID373
Feature activation-1.99e+00

Pos logit contributions
ind+0.021
 beings+0.019
asured+0.018
attered+0.018
 alone+0.018
Neg logit contributions
ves-0.011
 replied-0.010
 poem-0.010
gh-0.010
enda-0.009
Token a
Token ID257
Feature activation+2.05e+00

Pos logit contributions
ind+0.089
 beings+0.083
asured+0.079
pered+0.077
attered+0.077
Neg logit contributions
ves-0.041
 replied-0.041
gh-0.036
 poem-0.034
 story-0.034
Token pink
Token ID11398
Feature activation-1.37e+00

Pos logit contributions
 replied+0.055
ves+0.054
 story+0.049
gh+0.048
 poem+0.048
Neg logit contributions
ind-0.078
 beings-0.072
asured-0.071
pered-0.069
attered-0.068
Token re
Token ID302
Feature activation-7.54e+00

Pos logit contributions
ind+0.050
 beings+0.048
asured+0.048
pered+0.046
 yourselves+0.045
Neg logit contributions
 replied-0.039
ves-0.037
 story-0.035
 poem-0.035
gh-0.032
Tokenind
Token ID521
Feature activation-1.07e+01

Pos logit contributions
pered+0.064
 beings+0.045
 yourselves+0.041
asured+0.036
aked+0.033
Neg logit contributions
ves-0.417
 replied-0.399
 story-0.393
gh-0.389
 poem-0.384
Tokeneer
Token ID28153
Feature activation-2.27e+00

Pos logit contributions
 yourselves+0.232
pered+0.220
 beings+0.212
ind+0.211
asured+0.209
Neg logit contributions
ves-0.417
 replied-0.408
 poem-0.391
 story-0.391
 happen-0.371
Token.
Token ID13
Feature activation-1.04e+00

Pos logit contributions
ind+0.080
 beings+0.076
asured+0.074
 yourselves+0.073
pered+0.071
Neg logit contributions
 replied-0.067
ves-0.065
 story-0.059
gh-0.057
 -0.056
Token Her
Token ID2332
Feature activation-4.81e+00

Pos logit contributions
ind+0.041
 beings+0.039
asured+0.036
ishing+0.035
pered+0.035
Neg logit contributions
 replied-0.028
ves-0.026
 -0.025
 story-0.024
gh-0.024
Token name
Token ID1438
Feature activation-2.38e-01

Pos logit contributions
ind+0.157
 beings+0.150
 yourselves+0.144
asured+0.142
pered+0.140
Neg logit contributions
 replied-0.152
ves-0.147
 story-0.140
gh-0.133
 poem-0.132
Token was
Token ID373
Feature activation-1.83e+00

Pos logit contributions
ind+0.010
 beings+0.009
asured+0.009
attered+0.008
 alone+0.008
Neg logit contributions
ves-0.005
 replied-0.005
gh-0.005
 poem-0.004
enda-0.004
Token.
Token ID13
Feature activation-4.43e-01

Pos logit contributions
ind+0.042
 beings+0.039
asured+0.037
pered+0.036
attered+0.036
Neg logit contributions
ves-0.032
 replied-0.032
gh-0.030
 story-0.028
 poem-0.027
Token It
Token ID632
Feature activation+1.11e+00

Pos logit contributions
ind+0.016
 beings+0.014
asured+0.013
 prisoners+0.013
ishing+0.013
Neg logit contributions
 replied-0.014
 -0.013
ves-0.013
 story-0.012
 poem-0.012
Token was
Token ID373
Feature activation-8.09e-01

Pos logit contributions
ves+0.026
gh+0.023
 replied+0.023
 poem+0.021
 story+0.020
Neg logit contributions
ind-0.046
 beings-0.042
asured-0.042
 alone-0.040
attered-0.040
Token her
Token ID607
Feature activation-3.58e+00

Pos logit contributions
ind+0.030
 beings+0.027
asured+0.027
pered+0.025
 yourselves+0.025
Neg logit contributions
ves-0.023
 replied-0.022
gh-0.020
 story-0.019
 poem-0.019
Token most
Token ID749
Feature activation-4.23e-01

Pos logit contributions
ind+0.133
 beings+0.121
pered+0.114
asured+0.113
 yourselves+0.113
Neg logit contributions
 replied-0.101
ves-0.097
 story-0.092
gh-0.091
 poem-0.090
Token tre
Token ID2054
Feature activation-1.07e+01

Pos logit contributions
ind+0.017
 beings+0.016
pered+0.016
 yourselves+0.015
uzz+0.015
Neg logit contributions
 replied-0.010
ves-0.010
 poem-0.009
 story-0.009
gh-0.009
Tokenasured
Token ID34006
Feature activation-6.27e+00

Pos logit contributions
 beings+0.155
ind+0.154
attered+0.113
ishes+0.113
 prisoners+0.108
Neg logit contributions
 replied-0.491
ves-0.491
gh-0.491
 happen-0.463
 poem-0.453
Token one
Token ID530
Feature activation-2.31e+00

Pos logit contributions
ind+0.209
 beings+0.189
pered+0.186
attered+0.174
 yourselves+0.173
Neg logit contributions
 replied-0.194
ves-0.194
 story-0.181
gh-0.181
 happen-0.180
Token,
Token ID11
Feature activation-4.55e-01

Pos logit contributions
ind+0.081
 beings+0.075
asured+0.072
 alone+0.068
attered+0.067
Neg logit contributions
ves-0.069
 replied-0.066
gh-0.062
 poem-0.057
 story-0.057
Token and
Token ID290
Feature activation+1.05e-01

Pos logit contributions
ind+0.016
 beings+0.015
asured+0.014
pered+0.014
 yourselves+0.014
Neg logit contributions
ves-0.013
 replied-0.013
gh-0.012
 story-0.011
 poem-0.011
Token it
Token ID340
Feature activation+2.99e+00

Pos logit contributions
ves+0.002
 replied+0.002
gh+0.002
 story+0.002
 poem+0.002
Neg logit contributions
ind-0.004
 beings-0.004
asured-0.004
pered-0.004
 yourselves-0.004
Token and
Token ID290
Feature activation+7.13e-01

Pos logit contributions
ind+0.016
 beings+0.015
 yourselves+0.014
asured+0.014
attered+0.014
Neg logit contributions
 replied-0.008
ves-0.007
 -0.006
gh-0.006
 story-0.006
Token her
Token ID607
Feature activation-2.15e+00

Pos logit contributions
 replied+0.019
ves+0.018
gh+0.016
 story+0.016
 poem+0.016
Neg logit contributions
ind-0.028
 beings-0.026
asured-0.025
attered-0.024
pered-0.024
Token puppy
Token ID26188
Feature activation+6.53e-02

Pos logit contributions
ind+0.078
 beings+0.072
asured+0.068
 yourselves+0.066
pered+0.066
Neg logit contributions
 replied-0.062
ves-0.062
gh-0.056
 story-0.054
 poem-0.054
Token developed
Token ID4166
Feature activation-2.11e+00

Pos logit contributions
 replied+0.002
ves+0.002
gh+0.001
 story+0.001
 +0.001
Neg logit contributions
ind-0.003
 beings-0.002
asured-0.002
 yourselves-0.002
attered-0.002
Token a
Token ID257
Feature activation+3.00e+00

Pos logit contributions
ind+0.094
 beings+0.085
attered+0.083
asured+0.081
ubby+0.081
Neg logit contributions
ves-0.044
 replied-0.043
gh-0.038
 story-0.036
 -0.036
Token tre
Token ID2054
Feature activation-1.07e+01

Pos logit contributions
ves+0.084
 replied+0.081
gh+0.075
 happen+0.071
enda+0.069
Neg logit contributions
ind-0.109
 beings-0.100
asured-0.092
 prisoners-0.091
 slaves-0.089
Tokenasured
Token ID34006
Feature activation-6.01e+00

Pos logit contributions
ind+0.225
 beings+0.205
attered+0.182
 yourselves+0.170
uzz+0.166
Neg logit contributions
ves-0.448
 replied-0.447
gh-0.432
 poem-0.410
 story-0.405
Token friendship
Token ID14738
Feature activation-3.32e-02

Pos logit contributions
ind+0.219
 beings+0.199
years+0.191
pered+0.189
 yourselves+0.189
Neg logit contributions
ves-0.172
 replied-0.168
gh-0.151
 poem-0.149
 story-0.148
Token.
Token ID13
Feature activation-8.19e-01

Pos logit contributions
ind+0.001
 beings+0.001
asured+0.001
 yourselves+0.001
attered+0.001
Neg logit contributions
ves-0.001
 replied-0.001
gh-0.001
 story-0.001
 poem-0.001
Token Together
Token ID17083
Feature activation-1.70e+00

Pos logit contributions
ind+0.033
 beings+0.030
asured+0.029
ishing+0.028
attered+0.028
Neg logit contributions
 replied-0.021
ves-0.020
 -0.019
 story-0.018
gh-0.017
Token,
Token ID11
Feature activation-1.79e-02

Pos logit contributions
ind+0.061
 beings+0.057
asured+0.051
pered+0.051
attered+0.050
Neg logit contributions
ves-0.045
gh-0.042
 replied-0.041
 poem-0.039
enda-0.037
Token heard
Token ID2982
Feature activation+2.69e+00

Pos logit contributions
ind+0.026
 beings+0.022
asured+0.022
pered+0.021
 alone+0.021
Neg logit contributions
ves-0.026
 happen-0.023
 replied-0.023
gh-0.022
fs-0.021
Token a
Token ID257
Feature activation+3.66e+00

Pos logit contributions
ves+0.077
 replied+0.074
gh+0.072
 happen+0.066
enda+0.066
Neg logit contributions
ind-0.093
 beings-0.084
asured-0.079
pered-0.078
 alone-0.078
Token noise
Token ID7838
Feature activation+8.59e-01

Pos logit contributions
ves+0.118
gh+0.107
enda+0.102
 replied+0.100
oph+0.099
Neg logit contributions
ind-0.121
 beings-0.109
 alone-0.100
 cottage-0.098
 underground-0.095
Token in
Token ID287
Feature activation-7.57e-01

Pos logit contributions
ves+0.029
 replied+0.028
gh+0.027
 poem+0.025
 story+0.025
Neg logit contributions
ind-0.027
 beings-0.024
asured-0.022
pered-0.022
 alone-0.022
Token the
Token ID262
Feature activation+1.58e+00

Pos logit contributions
ind+0.031
 beings+0.029
asured+0.027
pered+0.027
 yourselves+0.027
Neg logit contributions
ves-0.019
 replied-0.018
gh-0.017
 poem-0.015
 story-0.015
Token tre
Token ID2054
Feature activation-1.07e+01

Pos logit contributions
ves+0.060
 replied+0.058
gh+0.055
 happen+0.052
 poem+0.052
Neg logit contributions
ind-0.045
 beings-0.039
asured-0.036
pered-0.035
 yourselves-0.035
Tokenet
Token ID316
Feature activation+2.77e+00

Pos logit contributions
ind+0.314
 beings+0.291
attered+0.261
asured+0.257
 prisoners+0.254
Neg logit contributions
ves-0.367
 replied-0.366
gh-0.346
 poem-0.327
 story-0.326
Tokenops
Token ID2840
Feature activation+1.97e+00

Pos logit contributions
ves+0.080
 replied+0.073
gh+0.073
 poem+0.068
 happen+0.067
Neg logit contributions
ind-0.096
 beings-0.084
 alone-0.080
pered-0.080
asured-0.080
Token.
Token ID13
Feature activation-5.68e-01

Pos logit contributions
ves+0.060
 replied+0.059
gh+0.055
enda+0.053
 poem+0.052
Neg logit contributions
ind-0.064
 beings-0.058
 alone-0.056
pered-0.055
 underground-0.054
Token He
Token ID679
Feature activation-1.88e+00

Pos logit contributions
ind+0.021
 beings+0.019
asured+0.018
 yourselves+0.017
attered+0.017
Neg logit contributions
 replied-0.017
ves-0.017
gh-0.015
 story-0.015
 poem-0.015
Token looked
Token ID3114
Feature activation+1.64e+00

Pos logit contributions
ind+0.081
 beings+0.074
asured+0.071
pered+0.070
attered+0.068
Neg logit contributions
ves-0.042
 replied-0.038
gh-0.037
 happen-0.034
 poem-0.034
Token are
Token ID389
Feature activation-1.07e+00

Pos logit contributions
 replied+0.016
ves+0.016
gh+0.015
enda+0.015
 +0.014
Neg logit contributions
 beings-0.023
ind-0.023
 slaves-0.020
 yourselves-0.020
 prisoners-0.020
Token you
Token ID345
Feature activation-3.60e-01

Pos logit contributions
ind+0.037
 beings+0.035
 yourselves+0.032
 prisoners+0.030
asured+0.030
Neg logit contributions
 replied-0.032
ves-0.032
gh-0.030
 poem-0.029
 story-0.028
Token doing
Token ID1804
Feature activation+1.83e+00

Pos logit contributions
ind+0.014
 beings+0.013
asured+0.012
pered+0.012
 yourselves+0.012
Neg logit contributions
ves-0.010
 replied-0.009
gh-0.008
 poem-0.008
 story-0.008
Token?"
Token ID1701
Feature activation+1.82e+00

Pos logit contributions
ves+0.049
 replied+0.048
gh+0.043
 poem+0.041
 story+0.041
Neg logit contributions
ind-0.070
 beings-0.065
asured-0.061
 yourselves-0.059
pered-0.059
Token asked
Token ID1965
Feature activation-1.16e+00

Pos logit contributions
ves+0.050
gh+0.046
 replied+0.045
 happen+0.043
 poem+0.042
Neg logit contributions
ind-0.067
 beings-0.062
 alone-0.058
 yourselves-0.058
asured-0.058
Token a
Token ID257
Feature activation+3.25e+00

Pos logit contributions
ind+0.040
 beings+0.037
uzz+0.035
pered+0.035
asured+0.035
Neg logit contributions
ves-0.033
gh-0.031
 replied-0.030
 happen-0.030
 poem-0.029
Token voice
Token ID3809
Feature activation-9.90e-01

Pos logit contributions
ves+0.102
gh+0.093
 replied+0.092
 happen+0.090
enda+0.085
Neg logit contributions
ind-0.107
 beings-0.099
 alone-0.091
ubby-0.089
 prisoners-0.088
Token.
Token ID13
Feature activation-2.08e+00

Pos logit contributions
ind+0.033
 beings+0.030
asured+0.028
pered+0.028
 alone+0.027
Neg logit contributions
ves-0.031
gh-0.029
 replied-0.028
 poem-0.028
 story-0.027
Token
Token ID198
Feature activation+4.68e-01

Pos logit contributions
ind+0.078
 beings+0.074
pered+0.072
uzz+0.071
 alone+0.069
Neg logit contributions
ves-0.053
gh-0.052
 happen-0.046
 replied-0.046
 poem-0.045
Token
Token ID198
Feature activation+3.86e-01

Pos logit contributions
 happen+0.012
 replied+0.012
gh+0.011
 poem+0.011
ves+0.011
Neg logit contributions
ind-0.018
uzz-0.015
 yourselves-0.015
 beings-0.015
pered-0.014
TokenJames
Token ID14731
Feature activation+6.83e-01

Pos logit contributions
 replied+0.011
ves+0.010
gh+0.010
 happen+0.009
 poem+0.009
Neg logit contributions
ind-0.015
 beings-0.013
 yourselves-0.012
asured-0.012
pered-0.012
Token before
Token ID878
Feature activation+2.12e+00

Pos logit contributions
ves+0.054
 replied+0.054
gh+0.049
 story+0.047
 poem+0.047
Neg logit contributions
ind-0.069
 beings-0.063
asured-0.060
pered-0.058
 yourselves-0.057
Token eating
Token ID6600
Feature activation-7.31e-01

Pos logit contributions
ves+0.056
 replied+0.056
gh+0.050
 poem+0.048
 story+0.047
Neg logit contributions
ind-0.081
 beings-0.074
asured-0.070
 yourselves-0.070
pered-0.068
Token.
Token ID13
Feature activation-5.46e-01

Pos logit contributions
ind+0.029
 beings+0.027
asured+0.026
attered+0.026
pered+0.025
Neg logit contributions
ves-0.018
 replied-0.018
gh-0.016
 story-0.016
 happen-0.016
Token Mom
Token ID11254
Feature activation+1.03e+00

Pos logit contributions
ind+0.024
 beings+0.022
asured+0.021
pered+0.021
attered+0.021
Neg logit contributions
 replied-0.012
ves-0.011
gh-0.010
 story-0.010
 poem-0.010
Tokenmy
Token ID1820
Feature activation+2.25e+00

Pos logit contributions
ves+0.034
 replied+0.032
gh+0.031
 happen+0.029
 poem+0.029
Neg logit contributions
ind-0.033
 beings-0.030
asured-0.029
attered-0.027
pered-0.027
Token said
Token ID531
Feature activation+3.15e+00

Pos logit contributions
ves+0.045
 replied+0.042
gh+0.041
 happen+0.036
 poem+0.036
Neg logit contributions
ind-0.099
 beings-0.093
asured-0.090
pered-0.085
attered-0.085
Token I
Token ID314
Feature activation-4.02e-01

Pos logit contributions
ves+0.061
 replied+0.059
gh+0.053
 poem+0.050
 story+0.049
Neg logit contributions
ind-0.141
 beings-0.132
asured-0.126
pered-0.124
 yourselves-0.123
Token should
Token ID815
Feature activation+3.17e+00

Pos logit contributions
ind+0.014
 beings+0.012
asured+0.012
uzz+0.011
pered+0.011
Neg logit contributions
ves-0.012
 replied-0.012
gh-0.011
 poem-0.011
 happen-0.011
Token be
Token ID307
Feature activation-1.68e+00

Pos logit contributions
ves+0.081
 replied+0.081
gh+0.074
 story+0.070
 poem+0.070
Neg logit contributions
ind-0.124
 beings-0.114
asured-0.109
pered-0.106
attered-0.105
Token careful
Token ID8161
Feature activation+1.75e+00

Pos logit contributions
ind+0.052
 beings+0.044
attered+0.043
uzz+0.041
lves+0.040
Neg logit contributions
 replied-0.057
ves-0.055
gh-0.053
 -0.052
 poem-0.051
Token with
Token ID351
Feature activation-1.09e+00

Pos logit contributions
ves+0.051
 replied+0.050
gh+0.047
 story+0.046
 happen+0.045
Neg logit contributions
ind-0.063
 beings-0.057
asured-0.054
pered-0.053
attered-0.053
Token with
Token ID351
Feature activation-1.31e+00

Pos logit contributions
ind+0.059
 beings+0.054
asured+0.051
pered+0.049
attered+0.049
Neg logit contributions
ves-0.050
 replied-0.049
gh-0.046
 story-0.044
 poem-0.043
Token your
Token ID534
Feature activation-3.04e+00

Pos logit contributions
ind+0.049
 beings+0.045
asured+0.045
pered+0.043
attered+0.043
Neg logit contributions
ves-0.035
 replied-0.034
gh-0.031
 poem-0.030
 happen-0.030
Token ball
Token ID2613
Feature activation+7.31e-01

Pos logit contributions
ind+0.106
 beings+0.094
asured+0.092
attered+0.089
pered+0.089
Neg logit contributions
 replied-0.092
ves-0.090
 story-0.084
gh-0.083
 poem-0.083
Token,"
Token ID553
Feature activation+1.20e+00

Pos logit contributions
 replied+0.017
ves+0.016
gh+0.015
 happen+0.014
 poem+0.014
Neg logit contributions
ind-0.031
 beings-0.028
asured-0.027
pered-0.027
attered-0.026
Token Tom
Token ID4186
Feature activation+3.18e+00

Pos logit contributions
ves+0.031
gh+0.028
 replied+0.027
 happen+0.026
 poem+0.026
Neg logit contributions
ind-0.046
 beings-0.043
 yourselves-0.040
asured-0.040
 alone-0.040
Token says
Token ID1139
Feature activation+3.42e+00

Pos logit contributions
ves+0.050
gh+0.042
 happen+0.035
 poem+0.034
 replied+0.033
Neg logit contributions
ind-0.153
 beings-0.147
asured-0.143
 alone-0.139
pered-0.138
Token.
Token ID13
Feature activation-1.84e+00

Pos logit contributions
ves+0.092
 replied+0.090
gh+0.083
 poem+0.080
 story+0.079
Neg logit contributions
ind-0.128
 beings-0.119
asured-0.113
pered-0.111
attered-0.108
Token
Token ID198
Feature activation+4.61e-01

Pos logit contributions
ind+0.075
 beings+0.070
pered+0.067
asured+0.067
 yourselves+0.066
Neg logit contributions
ves-0.043
 replied-0.041
gh-0.039
 happen-0.037
 poem-0.036
Token
Token ID198
Feature activation+3.08e-01

Pos logit contributions
 replied+0.012
 happen+0.011
gh+0.011
ves+0.011
 poem+0.011
Neg logit contributions
ind-0.018
uzz-0.015
 yourselves-0.015
 beings-0.015
pered-0.015
TokenThey
Token ID2990
Feature activation-9.20e-01

Pos logit contributions
ves+0.008
 replied+0.008
gh+0.007
 story+0.007
 poem+0.007
Neg logit contributions
ind-0.012
 beings-0.011
asured-0.010
pered-0.010
attered-0.010
Token agree
Token ID4236
Feature activation+1.53e+00

Pos logit contributions
ind+0.040
 beings+0.037
asured+0.035
attered+0.034
 yourselves+0.034
Neg logit contributions
 replied-0.021
ves-0.020
gh-0.018
 story-0.017
 poem-0.017
Token time
Token ID640
Feature activation+7.62e-01

Pos logit contributions
ind+0.104
 beings+0.093
asured+0.089
 yourselves+0.084
 fires+0.083
Neg logit contributions
ves-0.086
 replied-0.077
enda-0.070
fs-0.070
gh-0.070
Token."
Token ID526
Feature activation-6.00e-01

Pos logit contributions
ves+0.020
 replied+0.020
gh+0.018
 poem+0.017
 story+0.017
Neg logit contributions
ind-0.029
 beings-0.027
asured-0.025
 yourselves-0.024
pered-0.024
Token They
Token ID1119
Feature activation-1.71e+00

Pos logit contributions
ind+0.027
 beings+0.025
asured+0.024
pered+0.023
 yourselves+0.023
Neg logit contributions
ves-0.012
 replied-0.012
gh-0.011
 poem-0.010
 story-0.010
Token take
Token ID1011
Feature activation+1.55e-01

Pos logit contributions
ind+0.070
 beings+0.065
asured+0.062
pered+0.061
 yourselves+0.060
Neg logit contributions
ves-0.041
 replied-0.040
gh-0.036
 story-0.034
 poem-0.034
Token some
Token ID617
Feature activation-3.16e-01

Pos logit contributions
ves+0.004
 replied+0.004
gh+0.004
 story+0.003
 poem+0.003
Neg logit contributions
ind-0.006
 beings-0.006
asured-0.005
pered-0.005
attered-0.005
Token paper
Token ID3348
Feature activation+3.15e+00

Pos logit contributions
ind+0.012
 beings+0.011
asured+0.010
pered+0.010
attered+0.010
Neg logit contributions
ves-0.009
 replied-0.009
gh-0.008
 story-0.008
 poem-0.008
Token and
Token ID290
Feature activation-1.35e+00

Pos logit contributions
ves+0.099
 replied+0.098
gh+0.091
 story+0.087
 poem+0.086
Neg logit contributions
ind-0.106
 beings-0.097
asured-0.090
pered-0.088
attered-0.087
Token go
Token ID467
Feature activation-2.49e+00

Pos logit contributions
ind+0.048
 beings+0.046
asured+0.043
 yourselves+0.042
pered+0.042
Neg logit contributions
ves-0.036
gh-0.035
 replied-0.035
 story-0.033
enda-0.031
Token to
Token ID284
Feature activation+1.62e-01

Pos logit contributions
ind+0.097
 beings+0.091
asured+0.087
 yourselves+0.083
ateful+0.082
Neg logit contributions
ves-0.065
 replied-0.063
gh-0.057
 -0.056
 story-0.055
Token play
Token ID711
Feature activation-1.55e+00

Pos logit contributions
ves+0.004
 replied+0.004
gh+0.004
 story+0.004
 poem+0.004
Neg logit contributions
ind-0.006
 beings-0.006
asured-0.005
 yourselves-0.005
pered-0.005
Token.
Token ID13
Feature activation-2.07e+00

Pos logit contributions
ind+0.051
 beings+0.047
asured+0.043
pered+0.042
 yourselves+0.042
Neg logit contributions
 replied-0.049
ves-0.049
gh-0.045
 poem-0.043
 story-0.043
Token together
Token ID1978
Feature activation-1.93e+00

Pos logit contributions
ind+0.118
 beings+0.108
asured+0.103
pered+0.102
attered+0.100
Neg logit contributions
 replied-0.065
ves-0.064
gh-0.058
 story-0.055
 poem-0.055
Token.
Token ID13
Feature activation-2.40e+00

Pos logit contributions
ind+0.081
 beings+0.074
asured+0.071
pered+0.069
attered+0.069
Neg logit contributions
ves-0.045
 replied-0.045
gh-0.040
 story-0.038
 poem-0.038
Token<|endoftext|>
Token ID50256
Feature activation-6.04e+00

Pos logit contributions
ind+0.103
 beings+0.096
asured+0.091
pered+0.088
attered+0.088
Neg logit contributions
 replied-0.054
ves-0.053
gh-0.046
 story-0.045
 -0.044
TokenOnce
Token ID7454
Feature activation-2.46e+00

Pos logit contributions
ind+0.231
 beings+0.211
pered+0.201
 alone+0.200
asured+0.199
Neg logit contributions
ves-0.156
 replied-0.148
gh-0.139
 poem-0.136
 happen-0.133
Token upon
Token ID2402
Feature activation-5.51e+00

Pos logit contributions
ind+0.059
 beings+0.050
asured+0.046
 alone+0.045
ubby+0.045
Neg logit contributions
ves-0.101
 replied-0.097
gh-0.095
 poem-0.092
 happen-0.092
Token a
Token ID257
Feature activation+3.25e+00

Pos logit contributions
ind+0.217
 beings+0.199
asured+0.191
 yourselves+0.186
pered+0.185
Neg logit contributions
ves-0.139
 replied-0.138
gh-0.125
 story-0.116
 poem-0.115
Token time
Token ID640
Feature activation+5.70e-01

Pos logit contributions
ves+0.064
 replied+0.064
gh+0.054
 happen+0.053
 poem+0.051
Neg logit contributions
ind-0.148
 beings-0.135
asured-0.130
pered-0.126
ubby-0.125
Token,
Token ID11
Feature activation-8.87e-01

Pos logit contributions
ves+0.013
 replied+0.012
gh+0.011
 poem+0.011
 story+0.010
Neg logit contributions
ind-0.024
 beings-0.022
asured-0.021
pered-0.021
 yourselves-0.021
Token there
Token ID612
Feature activation-2.70e-01

Pos logit contributions
ind+0.031
 beings+0.029
asured+0.028
 yourselves+0.027
 prisoners+0.027
Neg logit contributions
 replied-0.026
ves-0.025
gh-0.023
 -0.023
 story-0.022
Token was
Token ID373
Feature activation-1.55e+00

Pos logit contributions
ind+0.012
 beings+0.011
asured+0.010
attered+0.010
 yourselves+0.010
Neg logit contributions
ves-0.006
 replied-0.006
gh-0.005
 poem-0.005
 story-0.005
Token a
Token ID257
Feature activation+2.11e+00

Pos logit contributions
ind+0.071
 beings+0.066
asured+0.063
pered+0.061
attered+0.061
Neg logit contributions
 replied-0.030
ves-0.030
gh-0.027
 story-0.025
 poem-0.025
Token to
Token ID284
Feature activation+1.76e+00

Pos logit contributions
ind+0.052
 beings+0.050
lves+0.048
attered+0.048
 prisoners+0.047
Neg logit contributions
 replied-0.031
 -0.029
 story-0.028
gh-0.027
ves-0.027
Token join
Token ID4654
Feature activation-1.55e+00

Pos logit contributions
ves+0.051
 replied+0.050
gh+0.046
 story+0.044
 poem+0.044
Neg logit contributions
ind-0.064
 beings-0.059
asured-0.056
pered-0.054
 yourselves-0.054
Token her
Token ID607
Feature activation-3.66e+00

Pos logit contributions
ind+0.060
 beings+0.053
asured+0.053
lves+0.052
attered+0.052
Neg logit contributions
 replied-0.042
ves-0.039
gh-0.038
 story-0.037
 -0.036
Token.
Token ID13
Feature activation+1.64e-02

Pos logit contributions
ind+0.153
 beings+0.139
asured+0.133
 yourselves+0.130
pered+0.130
Neg logit contributions
 replied-0.087
ves-0.085
gh-0.077
 story-0.075
 poem-0.074
Token He
Token ID679
Feature activation-1.75e+00

Pos logit contributions
 replied+0.000
ves+0.000
gh+0.000
 +0.000
 story+0.000
Neg logit contributions
ind-0.001
 beings-0.001
asured-0.001
 yourselves-0.001
pered-0.001
Token wanted
Token ID2227
Feature activation-1.02e+00

Pos logit contributions
ind+0.065
asured+0.059
 beings+0.059
attered+0.055
 alone+0.054
Neg logit contributions
ves-0.047
gh-0.044
 story-0.042
 happen-0.042
 replied-0.042
Token to
Token ID284
Feature activation+1.94e+00

Pos logit contributions
ind+0.036
 beings+0.034
asured+0.032
 alone+0.032
 yourselves+0.031
Neg logit contributions
ves-0.030
 replied-0.029
gh-0.026
enda-0.024
 story-0.024
Token play
Token ID711
Feature activation-4.86e-01

Pos logit contributions
ves+0.058
 replied+0.054
gh+0.052
 story+0.049
 poem+0.049
Neg logit contributions
ind-0.066
 beings-0.060
asured-0.060
 yourselves-0.057
pered-0.056
Token too
Token ID1165
Feature activation-6.72e-01

Pos logit contributions
ind+0.014
 beings+0.013
asured+0.012
pered+0.012
 alone+0.012
Neg logit contributions
ves-0.017
 replied-0.017
gh-0.016
 poem-0.015
 story-0.015
Token,
Token ID11
Feature activation+7.63e-01

Pos logit contributions
ind+0.032
 beings+0.029
asured+0.028
attered+0.028
pered+0.028
Neg logit contributions
 replied-0.012
ves-0.012
gh-0.011
 story-0.010
 happen-0.010
Token but
Token ID475
Feature activation+2.83e+00

Pos logit contributions
ves+0.025
gh+0.022
 replied+0.021
 poem+0.020
 story+0.020
Neg logit contributions
ind-0.024
 beings-0.023
 alone-0.022
asured-0.022
pered-0.022
Token said
Token ID531
Feature activation+3.80e+00

Pos logit contributions
ves+0.066
gh+0.065
 happen+0.057
 som+0.055
 story+0.052
Neg logit contributions
ind-0.110
asured-0.103
 beings-0.102
 alone-0.101
years-0.098
Token.
Token ID13
Feature activation-1.59e-01

Pos logit contributions
ves+0.111
 replied+0.105
gh+0.100
 poem+0.097
 story+0.096
Neg logit contributions
ind-0.134
 beings-0.124
pered-0.118
asured-0.117
 alone-0.114
Token She
Token ID1375
Feature activation-2.32e+00

Pos logit contributions
ind+0.006
 beings+0.006
pered+0.005
asured+0.005
 yourselves+0.005
Neg logit contributions
ves-0.004
 replied-0.004
gh-0.004
 happen-0.004
 poem-0.004
Token hugged
Token ID41130
Feature activation+7.00e-01

Pos logit contributions
ind+0.085
asured+0.078
 beings+0.078
pered+0.073
uzz+0.072
Neg logit contributions
ves-0.060
gh-0.059
 happen-0.055
 story-0.053
 replied-0.053
Token them
Token ID606
Feature activation-3.69e-01

Pos logit contributions
 replied+0.016
ves+0.015
 +0.014
 story+0.014
gh+0.013
Neg logit contributions
ind-0.029
 beings-0.027
asured-0.027
attered-0.026
 prisoners-0.025
Token and
Token ID290
Feature activation+7.25e-01

Pos logit contributions
ind+0.013
 beings+0.012
asured+0.011
 yourselves+0.010
pered+0.010
Neg logit contributions
 replied-0.011
ves-0.011
gh-0.010
 story-0.010
 poem-0.010
Token kissed
Token ID27946
Feature activation+2.65e+00

Pos logit contributions
ves+0.017
gh+0.017
 story+0.014
 replied+0.013
enda+0.013
Neg logit contributions
ind-0.029
 beings-0.028
asured-0.027
 yourselves-0.026
 alone-0.026
Token them
Token ID606
Feature activation-1.54e-01

Pos logit contributions
ves+0.080
 replied+0.079
gh+0.073
 poem+0.070
 story+0.070
Neg logit contributions
ind-0.094
 beings-0.085
asured-0.080
pered-0.078
 yourselves-0.078
Token.
Token ID13
Feature activation-7.03e-01

Pos logit contributions
ind+0.006
 beings+0.005
asured+0.005
pered+0.005
attered+0.005
Neg logit contributions
ves-0.004
 replied-0.004
gh-0.004
 poem-0.004
 story-0.004
Token "
Token ID366
Feature activation-2.57e+00

Pos logit contributions
ind+0.028
 beings+0.026
asured+0.025
pered+0.024
attered+0.024
Neg logit contributions
ves-0.017
 replied-0.017
gh-0.016
 story-0.015
 poem-0.015
TokenYou
Token ID1639
Feature activation+2.00e+00

Pos logit contributions
ind+0.090
 beings+0.082
asured+0.078
pered+0.077
ishing+0.075
Neg logit contributions
ves-0.078
 replied-0.075
 poem-0.068
gh-0.068
 story-0.067
Token "
Token ID366
Feature activation-3.01e+00

Pos logit contributions
ves+0.010
 replied+0.009
gh+0.009
 poem+0.009
 story+0.009
Neg logit contributions
ind-0.014
 beings-0.013
pered-0.013
asured-0.012
uzz-0.012
TokenMy
Token ID3666
Feature activation-2.06e+00

Pos logit contributions
ind+0.088
 beings+0.078
asured+0.072
attered+0.070
pered+0.069
Neg logit contributions
ves-0.108
 replied-0.107
gh-0.101
 story-0.097
 poem-0.096
Token name
Token ID1438
Feature activation+1.59e+00

Pos logit contributions
ind+0.072
 beings+0.066
asured+0.062
pered+0.060
attered+0.060
Neg logit contributions
ves-0.062
 replied-0.061
gh-0.056
 story-0.054
 poem-0.054
Token is
Token ID318
Feature activation-2.21e-01

Pos logit contributions
ves+0.042
 replied+0.041
gh+0.038
 poem+0.036
 story+0.035
Neg logit contributions
ind-0.061
 beings-0.056
asured-0.052
 yourselves-0.052
pered-0.051
Token Lily
Token ID20037
Feature activation+7.57e-01

Pos logit contributions
ind+0.008
 beings+0.007
 yourselves+0.007
asured+0.007
uzz+0.007
Neg logit contributions
ves-0.006
 replied-0.006
gh-0.006
 happen-0.005
 poem-0.005
Token.
Token ID13
Feature activation-1.20e+00

Pos logit contributions
ves+0.020
 replied+0.019
gh+0.018
 story+0.017
 poem+0.017
Neg logit contributions
ind-0.030
 beings-0.027
asured-0.026
pered-0.025
attered-0.025
Token Do
Token ID2141
Feature activation+3.60e+00

Pos logit contributions
ind+0.046
 beings+0.042
asured+0.041
ishing+0.039
attered+0.038
Neg logit contributions
 replied-0.034
ves-0.032
 -0.030
 story-0.029
gh-0.029
Token you
Token ID345
Feature activation+1.64e+00

Pos logit contributions
ves+0.130
 replied+0.129
pron+0.121
gh+0.121
iden+0.119
Neg logit contributions
ind-0.086
 yourselves-0.080
 beings-0.074
 alone-0.066
 prisoners-0.064
Token want
Token ID765
Feature activation-6.19e-01

Pos logit contributions
 replied+0.065
ves+0.065
gh+0.061
 story+0.060
 poem+0.060
Neg logit contributions
ind-0.041
 beings-0.036
asured-0.033
pered-0.031
attered-0.031
Token to
Token ID284
Feature activation+1.90e+00

Pos logit contributions
ind+0.019
 beings+0.017
 yourselves+0.016
 underground+0.016
 alone+0.016
Neg logit contributions
ves-0.021
enda-0.018
gh-0.018
 replied-0.018
andra-0.018
Token sit
Token ID1650
Feature activation-2.62e+00

Pos logit contributions
ves+0.068
 replied+0.061
fs+0.060
gh+0.060
enda+0.055
Neg logit contributions
ind-0.056
asured-0.045
 beings-0.045
 yourselves-0.044
 alone-0.044
Token
Token ID198
Feature activation+1.04e+00

Pos logit contributions
 poem+0.044
gh+0.043
 replied+0.042
 happen+0.041
ves+0.039
Neg logit contributions
ind-0.040
uzz-0.036
 yourselves-0.034
 alone-0.034
 beings-0.034
TokenThey
Token ID2990
Feature activation-1.38e+00

Pos logit contributions
ves+0.026
 replied+0.025
gh+0.022
 poem+0.020
 story+0.020
Neg logit contributions
ind-0.043
 beings-0.040
asured-0.039
pered-0.037
attered-0.037
Token find
Token ID1064
Feature activation-7.39e-01

Pos logit contributions
ind+0.061
 beings+0.057
asured+0.054
attered+0.053
 yourselves+0.052
Neg logit contributions
 replied-0.031
ves-0.029
gh-0.025
 story-0.025
 -0.024
Token a
Token ID257
Feature activation+2.58e+00

Pos logit contributions
ind+0.026
 beings+0.025
pered+0.023
 yourselves+0.022
 prisoners+0.022
Neg logit contributions
ves-0.021
 replied-0.020
gh-0.020
 poem-0.018
 story-0.018
Token big
Token ID1263
Feature activation+1.98e-01

Pos logit contributions
ves+0.078
 replied+0.074
gh+0.070
enda+0.070
 happen+0.066
Neg logit contributions
ind-0.085
 beings-0.080
 prisoners-0.073
 slaves-0.072
 underground-0.072
Token rock
Token ID3881
Feature activation-1.65e+00

Pos logit contributions
ves+0.006
 replied+0.005
gh+0.005
enda+0.005
 happen+0.005
Neg logit contributions
ind-0.007
 beings-0.006
asured-0.006
ubby-0.006
pered-0.006
Token near
Token ID1474
Feature activation-2.12e+00

Pos logit contributions
ind+0.068
 beings+0.061
asured+0.059
 yourselves+0.058
years+0.056
Neg logit contributions
 replied-0.042
ves-0.040
gh-0.036
 happen-0.035
 story-0.035
Token the
Token ID262
Feature activation-7.34e-02

Pos logit contributions
ind+0.079
 beings+0.073
 yourselves+0.069
asured+0.068
pered+0.067
Neg logit contributions
ves-0.060
 replied-0.058
gh-0.057
enda-0.050
 poem-0.049
Token water
Token ID1660
Feature activation-5.66e-01

Pos logit contributions
ind+0.003
 beings+0.002
asured+0.002
pered+0.002
attered+0.002
Neg logit contributions
 replied-0.002
ves-0.002
gh-0.002
 story-0.002
 poem-0.002
Token.
Token ID13
Feature activation-1.72e+00

Pos logit contributions
ind+0.023
 beings+0.021
asured+0.020
pered+0.019
attered+0.019
Neg logit contributions
ves-0.014
 replied-0.014
gh-0.013
 story-0.012
 poem-0.012
Token They
Token ID1119
Feature activation-1.63e+00

Pos logit contributions
ind+0.069
asured+0.058
 beings+0.058
ishing+0.054
 prisoners+0.054
Neg logit contributions
 replied-0.051
ves-0.048
 -0.046
 poem-0.041
 story-0.040
Token was
Token ID373
Feature activation-6.17e-01

Pos logit contributions
ves+0.068
 replied+0.067
gh+0.058
 story+0.055
 poem+0.054
Neg logit contributions
ind-0.157
 beings-0.147
asured-0.140
pered-0.138
 yourselves-0.137
Token tough
Token ID5802
Feature activation+1.99e+00

Pos logit contributions
ind+0.023
 beings+0.021
asured+0.020
pered+0.019
attered+0.019
Neg logit contributions
ves-0.017
 replied-0.017
gh-0.016
 story-0.015
 poem-0.015
Token to
Token ID284
Feature activation+4.72e-01

Pos logit contributions
 replied+0.039
ves+0.038
 happen+0.035
gh+0.034
 story+0.034
Neg logit contributions
ind-0.090
 beings-0.080
 yourselves-0.078
pered-0.077
uzz-0.077
Token put
Token ID1234
Feature activation-4.08e-02

Pos logit contributions
ves+0.011
 replied+0.011
gh+0.010
 story+0.009
 poem+0.009
Neg logit contributions
ind-0.020
 beings-0.018
asured-0.017
pered-0.017
attered-0.017
Token the
Token ID262
Feature activation+1.80e+00

Pos logit contributions
ind+0.002
asured+0.002
 beings+0.002
years+0.001
attered+0.001
Neg logit contributions
ves-0.001
 replied-0.001
 happen-0.001
gh-0.001
 -0.001
Token tent
Token ID11105
Feature activation+2.38e-02

Pos logit contributions
ves+0.046
 story+0.045
 replied+0.042
 happen+0.042
 picture+0.040
Neg logit contributions
ind-0.070
years-0.065
asured-0.065
uzz-0.064
pered-0.063
Token up
Token ID510
Feature activation-1.53e+00

Pos logit contributions
ves+0.001
 replied+0.001
gh+0.001
 story+0.001
 happen+0.001
Neg logit contributions
ind-0.001
 beings-0.001
asured-0.001
 yourselves-0.001
pered-0.001
Token.
Token ID13
Feature activation-5.64e-01

Pos logit contributions
ind+0.062
 beings+0.058
asured+0.055
 prisoners+0.054
attered+0.054
Neg logit contributions
 replied-0.036
ves-0.036
gh-0.033
 happen-0.031
 -0.030
Token
Token ID198
Feature activation+1.70e+00

Pos logit contributions
ind+0.019
attered+0.018
 beings+0.018
 alone+0.017
pered+0.017
Neg logit contributions
gh-0.016
ves-0.015
 happen-0.015
 poem-0.014
odic-0.014
Token
Token ID198
Feature activation+1.38e+00

Pos logit contributions
gh+0.053
 happen+0.052
 poem+0.051
 replied+0.050
ves+0.049
Neg logit contributions
ind-0.051
uzz-0.045
 alone-0.044
 beings-0.042
ang-0.040
TokenL
Token ID43
Feature activation-1.66e+00

Pos logit contributions
ves+0.038
 replied+0.038
gh+0.034
 story+0.033
 poem+0.032
Neg logit contributions
ind-0.053
 beings-0.049
asured-0.046
pered-0.045
 yourselves-0.044
Token her
Token ID607
Feature activation-3.87e+00

Pos logit contributions
ind+0.032
 beings+0.029
 yourselves+0.028
asured+0.028
pered+0.027
Neg logit contributions
ves-0.034
 replied-0.032
gh-0.031
 story-0.029
 poem-0.029
Token hand
Token ID1021
Feature activation+1.44e+00

Pos logit contributions
ind+0.125
 beings+0.113
asured+0.105
pered+0.104
 yourselves+0.102
Neg logit contributions
ves-0.126
 replied-0.122
gh-0.115
 happen-0.111
 story-0.108
Token.
Token ID13
Feature activation-9.77e-01

Pos logit contributions
ves+0.039
 replied+0.038
gh+0.035
 story+0.033
 poem+0.033
Neg logit contributions
ind-0.055
 beings-0.050
asured-0.048
pered-0.046
attered-0.046
Token
Token ID198
Feature activation+1.42e+00

Pos logit contributions
ind+0.036
 beings+0.034
pered+0.033
asured+0.032
attered+0.032
Neg logit contributions
ves-0.026
 replied-0.024
gh-0.024
 happen-0.022
 poem-0.022
Token
Token ID198
Feature activation+1.22e+00

Pos logit contributions
 poem+0.045
 happen+0.044
gh+0.041
 som+0.041
ves+0.040
Neg logit contributions
ind-0.043
uzz-0.039
pered-0.035
 yourselves-0.035
ang-0.034
Token"
Token ID1
Feature activation-3.36e+00

Pos logit contributions
ves+0.031
 replied+0.031
gh+0.028
 story+0.026
 poem+0.026
Neg logit contributions
ind-0.049
 beings-0.045
asured-0.043
pered-0.042
 yourselves-0.041
TokenBe
Token ID3856
Feature activation-9.91e-01

Pos logit contributions
ind+0.114
 beings+0.103
asured+0.096
attered+0.095
 yourselves+0.094
Neg logit contributions
ves-0.103
 replied-0.102
gh-0.097
 story-0.094
 happen-0.091
Token careful
Token ID8161
Feature activation+2.24e+00

Pos logit contributions
ind+0.025
 beings+0.022
asured+0.020
attered+0.019
pered+0.019
Neg logit contributions
ves-0.040
 replied-0.040
gh-0.038
 story-0.036
 poem-0.036
Token,
Token ID11
Feature activation+2.59e-01

Pos logit contributions
ves+0.053
 replied+0.053
gh+0.048
 poem+0.045
 story+0.045
Neg logit contributions
ind-0.092
 beings-0.085
asured-0.081
pered-0.079
 yourselves-0.079
Token Lily
Token ID20037
Feature activation+1.34e+00

Pos logit contributions
ves+0.007
 replied+0.007
gh+0.006
 story+0.006
 poem+0.006
Neg logit contributions
ind-0.010
 beings-0.009
asured-0.008
 yourselves-0.008
pered-0.008
Token,
Token ID11
Feature activation+5.49e-01

Pos logit contributions
ves+0.036
 replied+0.034
gh+0.033
 poem+0.031
 story+0.031
Neg logit contributions
ind-0.051
 beings-0.047
asured-0.044
pered-0.043
 yourselves-0.042
Token of
Token ID286
Feature activation-2.64e+00

Pos logit contributions
ind+0.166
 beings+0.153
asured+0.146
pered+0.142
attered+0.141
Neg logit contributions
ves-0.097
 replied-0.096
gh-0.087
 story-0.082
 poem-0.082
Token the
Token ID262
Feature activation+8.12e-01

Pos logit contributions
ind+0.102
 beings+0.093
asured+0.090
attered+0.089
pered+0.088
Neg logit contributions
ves-0.069
 replied-0.069
gh-0.062
 story-0.060
 poem-0.060
Token animals
Token ID4695
Feature activation-3.32e+00

Pos logit contributions
 story+0.022
ves+0.021
 replied+0.021
 poem+0.020
 happen+0.019
Neg logit contributions
ind-0.029
asured-0.028
pered-0.028
uzz-0.027
attered-0.027
Token to
Token ID284
Feature activation-3.93e-01

Pos logit contributions
ind+0.123
asured+0.105
 beings+0.105
attered+0.103
pered+0.101
Neg logit contributions
 replied-0.096
ves-0.095
 happen-0.086
 story-0.086
 -0.085
Token her
Token ID607
Feature activation-4.37e+00

Pos logit contributions
ind+0.014
 beings+0.013
 yourselves+0.012
asured+0.012
 alone+0.012
Neg logit contributions
ves-0.011
 replied-0.011
gh-0.010
 poem-0.010
enda-0.009
Token home
Token ID1363
Feature activation-2.75e+00

Pos logit contributions
ind+0.135
asured+0.125
pered+0.122
 beings+0.121
attered+0.120
Neg logit contributions
 replied-0.141
ves-0.137
 story-0.137
 poem-0.130
 -0.125
Token.
Token ID13
Feature activation-7.82e-01

Pos logit contributions
ind+0.116
 beings+0.106
asured+0.101
attered+0.099
pered+0.099
Neg logit contributions
ves-0.064
 replied-0.063
gh-0.056
 story-0.055
 poem-0.053
Token So
Token ID1406
Feature activation-7.53e-01

Pos logit contributions
ind+0.030
 beings+0.027
asured+0.026
 yourselves+0.026
pered+0.025
Neg logit contributions
ves-0.021
gh-0.020
 replied-0.019
 happen-0.018
 poem-0.017
Token every
Token ID790
Feature activation-1.98e+00

Pos logit contributions
ind+0.026
 beings+0.024
 alone+0.023
 yourselves+0.023
asured+0.022
Neg logit contributions
ves-0.021
gh-0.019
 replied-0.018
enda-0.018
 poem-0.017
Token morning
Token ID3329
Feature activation+1.78e+00

Pos logit contributions
ind+0.071
 beings+0.063
asured+0.061
 alone+0.061
ubby+0.058
Neg logit contributions
ves-0.057
 replied-0.054
gh-0.052
enda-0.049
 happen-0.048
Token she
Token ID673
Feature activation-3.43e+00

Pos logit contributions
ves+0.042
 replied+0.040
gh+0.039
 happen+0.034
 story+0.034
Neg logit contributions
ind-0.070
 beings-0.065
 yourselves-0.061
 alone-0.061
asured-0.059
Token bird
Token ID6512
Feature activation-1.63e+00

Pos logit contributions
ves+0.077
 replied+0.070
gh+0.069
 happen+0.065
enda+0.062
Neg logit contributions
ind-0.092
 beings-0.083
 prisoners-0.074
 alone-0.073
 underground-0.073
Token whist
Token ID32262
Feature activation-1.00e+00

Pos logit contributions
ind+0.058
 beings+0.052
asured+0.049
pered+0.047
attered+0.047
Neg logit contributions
ves-0.049
 replied-0.048
gh-0.045
 poem-0.043
 story-0.042
Tokenling
Token ID1359
Feature activation-2.60e+00

Pos logit contributions
ind+0.024
 beings+0.021
asured+0.020
pered+0.019
 yourselves+0.018
Neg logit contributions
ves-0.041
 replied-0.041
gh-0.038
 story-0.037
 poem-0.037
Token in
Token ID287
Feature activation-1.66e+00

Pos logit contributions
ind+0.095
 beings+0.087
asured+0.083
 yourselves+0.079
attered+0.079
Neg logit contributions
 replied-0.075
ves-0.074
gh-0.068
 story-0.065
 poem-0.065
Token a
Token ID257
Feature activation+1.94e+00

Pos logit contributions
ind+0.067
 beings+0.061
asured+0.058
pered+0.057
 yourselves+0.057
Neg logit contributions
ves-0.043
 replied-0.041
gh-0.038
 poem-0.035
 story-0.035
Token tree
Token ID5509
Feature activation-2.58e+00

Pos logit contributions
ves+0.062
 replied+0.058
gh+0.054
 happen+0.054
enda+0.052
Neg logit contributions
ind-0.067
 beings-0.058
 alone-0.054
lves-0.051
pered-0.051
Token.
Token ID13
Feature activation-1.15e+00

Pos logit contributions
ind+0.102
 beings+0.094
asured+0.089
pered+0.087
attered+0.086
Neg logit contributions
ves-0.065
 replied-0.065
gh-0.059
 story-0.056
 poem-0.056
Token Sammy
Token ID44067
Feature activation+2.75e+00

Pos logit contributions
pered+0.031
ind+0.030
attered+0.029
 alone+0.028
 beings+0.028
Neg logit contributions
ves-0.043
gh-0.041
 happen-0.039
 poem-0.037
 story-0.036
Token thought
Token ID1807
Feature activation+2.76e+00

Pos logit contributions
ves+0.065
gh+0.060
 happen+0.054
 poem+0.054
 story+0.054
Neg logit contributions
ind-0.113
 beings-0.100
asured-0.098
 alone-0.095
pered-0.093
Token it
Token ID340
Feature activation+3.03e+00

Pos logit contributions
 replied+0.061
enda+0.059
ves+0.058
gh+0.056
 poem+0.052
Neg logit contributions
ind-0.103
 beings-0.101
 alone-0.099
pered-0.095
 underground-0.094
Token was
Token ID373
Feature activation-1.24e+00

Pos logit contributions
ves+0.086
gh+0.076
stand+0.073
 poem+0.073
enda+0.070
Neg logit contributions
ind-0.103
 beings-0.093
 alone-0.091
asured-0.087
 underground-0.087
Token usually
Token ID3221
Feature activation-8.97e-01

Pos logit contributions
ves+0.066
 replied+0.063
gh+0.060
 poem+0.057
enda+0.054
Neg logit contributions
ind-0.097
 beings-0.087
asured-0.082
 yourselves-0.080
pered-0.080
Token find
Token ID1064
Feature activation-3.88e-01

Pos logit contributions
ind+0.037
 beings+0.034
asured+0.033
pered+0.032
attered+0.032
Neg logit contributions
ves-0.021
 replied-0.021
gh-0.019
 story-0.018
 poem-0.018
Token help
Token ID1037
Feature activation+6.95e-01

Pos logit contributions
ind+0.015
 beings+0.013
asured+0.013
attered+0.013
pered+0.012
Neg logit contributions
ves-0.010
 replied-0.010
gh-0.009
 story-0.009
 poem-0.009
Token to
Token ID284
Feature activation+1.66e-01

Pos logit contributions
ves+0.021
 replied+0.021
gh+0.019
 poem+0.018
 story+0.018
Neg logit contributions
ind-0.025
 beings-0.023
asured-0.021
 yourselves-0.021
pered-0.020
Token overcome
Token ID10980
Feature activation-1.36e-01

Pos logit contributions
ves+0.004
 replied+0.004
gh+0.004
 poem+0.004
enda+0.004
Neg logit contributions
ind-0.006
 beings-0.006
 yourselves-0.006
asured-0.006
pered-0.005
Token your
Token ID534
Feature activation-4.44e+00

Pos logit contributions
ind+0.005
 beings+0.004
asured+0.004
attered+0.004
pered+0.004
Neg logit contributions
ves-0.004
 replied-0.004
gh-0.004
 poem-0.003
 story-0.003
Token fears
Token ID10251
Feature activation-6.20e-01

Pos logit contributions
ind+0.112
asured+0.096
 beings+0.095
attered+0.094
uzz+0.090
Neg logit contributions
 replied-0.169
 story-0.168
ves-0.166
 poem-0.162
gh-0.157
Token.
Token ID13
Feature activation-9.23e-01

Pos logit contributions
ind+0.024
 beings+0.022
asured+0.021
pered+0.020
attered+0.020
Neg logit contributions
ves-0.016
 replied-0.016
gh-0.015
 story-0.014
 poem-0.014
Token<|endoftext|>
Token ID50256
Feature activation-4.85e+00

Pos logit contributions
ind+0.041
 beings+0.038
asured+0.037
pered+0.036
 yourselves+0.036
Neg logit contributions
ves-0.019
 replied-0.018
gh-0.017
 poem-0.015
 happen-0.015
TokenOnce
Token ID7454
Feature activation-1.36e+00

Pos logit contributions
ind+0.167
 beings+0.152
pered+0.144
 alone+0.142
uzz+0.141
Neg logit contributions
ves-0.142
 replied-0.138
gh-0.129
 poem-0.129
 happen-0.124
Token there
Token ID612
Feature activation+5.90e-01

Pos logit contributions
ind+0.026
 alone+0.022
uzz+0.022
ubby+0.022
 beings+0.022
Neg logit contributions
ves-0.059
gh-0.057
 poem-0.056
 replied-0.056
odic-0.053
Token back
Token ID736
Feature activation-5.55e+00

Pos logit contributions
ves+0.014
 replied+0.014
gh+0.013
 story+0.012
 poem+0.012
Neg logit contributions
ind-0.016
 beings-0.014
asured-0.014
pered-0.013
 yourselves-0.013
Token to
Token ID284
Feature activation+6.34e-01

Pos logit contributions
ind+0.251
 beings+0.231
asured+0.219
lves+0.217
 yourselves+0.216
Neg logit contributions
 replied-0.108
ves-0.107
 -0.093
gh-0.093
 story-0.093
Token his
Token ID465
Feature activation-3.62e+00

Pos logit contributions
ves+0.020
 replied+0.019
gh+0.018
 poem+0.017
 story+0.017
Neg logit contributions
ind-0.022
 beings-0.020
asured-0.019
pered-0.019
 yourselves-0.018
Token nest
Token ID16343
Feature activation-1.86e+00

Pos logit contributions
ind+0.171
 beings+0.160
 yourselves+0.155
attered+0.153
pered+0.153
Neg logit contributions
 replied-0.065
 story-0.062
ves-0.054
 poem-0.054
 picture-0.052
Token.
Token ID13
Feature activation-4.16e-01

Pos logit contributions
ind+0.067
 beings+0.062
asured+0.060
pered+0.058
 alone+0.057
Neg logit contributions
ves-0.052
 replied-0.051
gh-0.048
 poem-0.046
 story-0.045
Token His
Token ID2399
Feature activation-3.01e+00

Pos logit contributions
ind+0.015
 beings+0.014
asured+0.014
pered+0.013
attered+0.013
Neg logit contributions
ves-0.012
 replied-0.011
gh-0.011
 poem-0.010
 story-0.010
Token mom
Token ID1995
Feature activation+2.32e-02

Pos logit contributions
ind+0.134
 beings+0.125
asured+0.118
attered+0.116
pered+0.116
Neg logit contributions
 replied-0.064
ves-0.063
gh-0.056
 story-0.055
 poem-0.053
Token loved
Token ID6151
Feature activation-3.46e+00

Pos logit contributions
ves+0.001
gh+0.001
 happen+0.000
 poem+0.000
 replied+0.000
Neg logit contributions
ind-0.001
 beings-0.001
asured-0.001
years-0.001
 alone-0.001
Token the
Token ID262
Feature activation+2.38e+00

Pos logit contributions
ind+0.125
 beings+0.115
asured+0.110
 yourselves+0.106
pered+0.104
Neg logit contributions
ves-0.101
 replied-0.098
gh-0.090
 poem-0.086
 story-0.084
Token ring
Token ID5858
Feature activation+3.82e-01

Pos logit contributions
ves+0.070
 replied+0.066
gh+0.063
enda+0.061
 Kelly+0.059
Neg logit contributions
ind-0.082
 beings-0.077
 alone-0.074
asured-0.071
 yourselves-0.070
Token and
Token ID290
Feature activation+8.76e-01

Pos logit contributions
ves+0.011
 replied+0.011
gh+0.010
 story+0.010
 poem+0.010
Neg logit contributions
ind-0.014
 beings-0.013
asured-0.012
pered-0.011
attered-0.011
Token a
Token ID257
Feature activation+3.23e+00

Pos logit contributions
ind+0.051
 beings+0.047
asured+0.044
pered+0.043
 yourselves+0.043
Neg logit contributions
ves-0.033
 replied-0.033
gh-0.030
 poem-0.028
 story-0.028
Token big
Token ID1263
Feature activation+1.14e+00

Pos logit contributions
ves+0.094
 replied+0.087
gh+0.081
 happen+0.078
enda+0.076
Neg logit contributions
ind-0.115
 beings-0.107
 prisoners-0.099
 alone-0.099
 slaves-0.097
Token box
Token ID3091
Feature activation+3.42e-02

Pos logit contributions
 replied+0.025
ves+0.024
gh+0.021
 story+0.021
 poem+0.020
Neg logit contributions
ind-0.050
 beings-0.046
asured-0.045
attered-0.043
 yourselves-0.043
Token of
Token ID286
Feature activation-4.30e+00

Pos logit contributions
ves+0.001
 replied+0.001
gh+0.001
 poem+0.001
 story+0.001
Neg logit contributions
ind-0.001
 beings-0.001
asured-0.001
pered-0.001
attered-0.001
Token ch
Token ID442
Feature activation-2.41e+00

Pos logit contributions
ind+0.168
asured+0.149
 beings+0.149
pered+0.147
attered+0.145
Neg logit contributions
 replied-0.117
ves-0.108
 story-0.104
 poem-0.101
 happen-0.098
Tokenocol
Token ID4668
Feature activation-9.39e+00

Pos logit contributions
ind+0.047
lves+0.046
 slaves+0.045
 beings+0.045
asured+0.044
Neg logit contributions
ves-0.093
gh-0.087
 replied-0.087
 story-0.086
on-0.084
Tokenates
Token ID689
Feature activation+7.20e-01

Pos logit contributions
ind+0.193
 beings+0.169
asured+0.152
attered+0.149
lves+0.144
Neg logit contributions
ves-0.410
gh-0.395
 replied-0.392
 story-0.370
 poem-0.364
Token!
Token ID0
Feature activation+6.86e-01

Pos logit contributions
 replied+0.018
ves+0.018
gh+0.016
 story+0.015
 poem+0.015
Neg logit contributions
ind-0.029
 beings-0.026
asured-0.025
pered-0.024
 yourselves-0.024
Token<|endoftext|>
Token ID50256
Feature activation-4.56e+00

Pos logit contributions
ves+0.015
 replied+0.014
gh+0.013
 happen+0.012
 poem+0.012
Neg logit contributions
ind-0.030
 beings-0.028
asured-0.027
pered-0.026
 yourselves-0.026
TokenOnce
Token ID7454
Feature activation-1.03e+00

Pos logit contributions
ind+0.142
 beings+0.128
pered+0.118
asured+0.117
 yourselves+0.116
Neg logit contributions
ves-0.150
 replied-0.147
gh-0.139
 poem-0.135
 happen-0.133
Token upon
Token ID2402
Feature activation-4.18e+00

Pos logit contributions
ind+0.022
 beings+0.018
 alone+0.018
uzz+0.017
ubby+0.017
Neg logit contributions
ves-0.044
gh-0.042
 replied-0.042
 poem-0.041
 happen-0.040
Token,
Token ID11
Feature activation-1.58e+00

Pos logit contributions
ves+0.024
 replied+0.023
gh+0.021
 poem+0.021
 story+0.021
Neg logit contributions
ind-0.035
 beings-0.033
pered-0.031
asured-0.030
uzz-0.030
Token
Token ID6184
Feature activation+2.96e-01

Pos logit contributions
ind+0.062
 beings+0.058
asured+0.055
pered+0.055
 yourselves+0.054
Neg logit contributions
ves-0.039
 replied-0.036
gh-0.035
 happen-0.032
 story-0.032
Token
Token ID95
Feature activation-1.74e+00

Pos logit contributions
ves+0.008
 replied+0.008
gh+0.007
 story+0.007
 poem+0.007
Neg logit contributions
ind-0.011
 beings-0.010
asured-0.010
pered-0.010
attered-0.009
Token
Token ID26391
Feature activation+1.16e+00

Pos logit contributions
ind+0.063
 beings+0.057
asured+0.054
pered+0.053
 yourselves+0.053
Neg logit contributions
ves-0.050
 replied-0.049
gh-0.045
 poem-0.044
 story-0.044
Token
Token ID129
Feature activation-4.87e-01

Pos logit contributions
 happen+0.029
 replied+0.027
 poem+0.027
ves+0.026
gh+0.025
Neg logit contributions
ind-0.042
 yourselves-0.038
 beings-0.037
uzz-0.037
ang-0.037
Token
Token ID241
Feature activation-7.89e+00

Pos logit contributions
ind+0.017
 beings+0.015
asured+0.014
pered+0.014
 yourselves+0.014
Neg logit contributions
ves-0.015
 replied-0.015
gh-0.014
 story-0.013
 poem-0.013
TokenWhy
Token ID5195
Feature activation+4.56e-01

Pos logit contributions
ind+0.280
 beings+0.255
asured+0.241
pered+0.233
 yourselves+0.233
Neg logit contributions
ves-0.231
 replied-0.231
gh-0.211
 story-0.202
 poem-0.202
Token don
Token ID836
Feature activation+2.62e+00

Pos logit contributions
 replied+0.015
ves+0.015
gh+0.014
 poem+0.013
 exclaimed+0.013
Neg logit contributions
ind-0.013
 beings-0.012
 yourselves-0.012
asured-0.011
 alone-0.011
Tokenâ
Token ID22940
Feature activation+1.27e-01

Pos logit contributions
 replied+0.062
gh+0.056
 poem+0.055
ves+0.055
enda+0.048
Neg logit contributions
ind-0.101
 yourselves-0.090
 beings-0.088
ubby-0.088
 alone-0.086
Token
Token ID26391
Feature activation+3.04e+00

Pos logit contributions
ves+0.003
 poem+0.003
 replied+0.003
gh+0.002
 story+0.002
Neg logit contributions
ind-0.005
asured-0.005
 alone-0.004
 yourselves-0.004
 beings-0.004
Token
Token ID8151
Feature activation+4.77e+00

Pos logit contributions
ves+0.073
 happen+0.072
 replied+0.071
gh+0.065
 poem+0.064
Neg logit contributions
ind-0.115
 beings-0.103
 yourselves-0.100
attered-0.098
ang-0.098
TokenOnce
Token ID7454
Feature activation-5.07e-01

Pos logit contributions
ind+0.146
 beings+0.133
asured+0.123
pered+0.122
 yourselves+0.121
Neg logit contributions
ves-0.137
 replied-0.135
gh-0.126
 poem-0.122
 happen-0.120
Token there
Token ID612
Feature activation+1.57e+00

Pos logit contributions
ind+0.011
 beings+0.009
 alone+0.009
uzz+0.008
ubby+0.008
Neg logit contributions
ves-0.022
gh-0.021
 replied-0.021
 poem-0.020
 happen-0.020
Token was
Token ID373
Feature activation-1.87e-02

Pos logit contributions
ves+0.031
 replied+0.029
gh+0.026
 poem+0.025
 story+0.024
Neg logit contributions
ind-0.071
 beings-0.066
asured-0.063
pered-0.061
attered-0.061
Token a
Token ID257
Feature activation+3.62e+00

Pos logit contributions
ind+0.001
 beings+0.001
asured+0.001
pered+0.001
ishing+0.001
Neg logit contributions
 replied-0.000
ves-0.000
gh-0.000
 -0.000
 story-0.000
Token re
Token ID302
Feature activation-5.89e+00

Pos logit contributions
 replied+0.098
ves+0.097
 story+0.087
gh+0.087
 poem+0.087
Neg logit contributions
ind-0.135
 beings-0.126
asured-0.123
pered-0.120
 yourselves-0.118
Tokenind
Token ID521
Feature activation-8.98e+00

Pos logit contributions
pered+0.046
 beings+0.036
 yourselves+0.033
 chefs+0.028
attered+0.026
Neg logit contributions
ves-0.321
 replied-0.315
 story-0.304
gh-0.302
 poem-0.292
Tokeneer
Token ID28153
Feature activation-5.41e-01

Pos logit contributions
 yourselves+0.194
pered+0.179
 beings+0.174
attered+0.172
ind+0.171
Neg logit contributions
ves-0.360
 replied-0.353
 story-0.330
 poem-0.330
 happen-0.314
Token.
Token ID13
Feature activation-8.26e-02

Pos logit contributions
ind+0.017
 beings+0.016
asured+0.015
 yourselves+0.015
pered+0.015
Neg logit contributions
 replied-0.018
ves-0.017
 story-0.016
gh-0.016
 -0.015
Token She
Token ID1375
Feature activation-2.78e+00

Pos logit contributions
ind+0.003
 beings+0.003
asured+0.003
ishing+0.003
attered+0.003
Neg logit contributions
 replied-0.002
ves-0.002
 -0.002
 story-0.002
gh-0.002
Token loved
Token ID6151
Feature activation-3.41e+00

Pos logit contributions
ind+0.110
 beings+0.105
asured+0.099
pered+0.098
 yourselves+0.096
Neg logit contributions
ves-0.074
 replied-0.072
gh-0.065
 story-0.063
 happen-0.060
Token to
Token ID284
Feature activation+8.35e-01

Pos logit contributions
ind+0.151
 beings+0.140
asured+0.134
pered+0.131
attered+0.130
Neg logit contributions
ves-0.071
 replied-0.070
gh-0.062
 story-0.059
 poem-0.059
Token,
Token ID11
Feature activation+4.83e-01

Pos logit contributions
ves+0.059
 replied+0.057
gh+0.052
 poem+0.050
 story+0.049
Neg logit contributions
ind-0.101
 beings-0.095
pered-0.089
asured-0.088
uzz-0.087
Token
Token ID6184
Feature activation-5.22e-02

Pos logit contributions
ves+0.013
gh+0.012
 replied+0.012
 poem+0.011
 happen+0.011
Neg logit contributions
ind-0.018
 beings-0.016
pered-0.016
uzz-0.016
asured-0.015
Token
Token ID95
Feature activation-1.12e+00

Pos logit contributions
ind+0.002
 beings+0.002
asured+0.002
attered+0.002
ishes+0.002
Neg logit contributions
 replied-0.002
ves-0.002
gh-0.001
 happen-0.001
 story-0.001
Token
Token ID26391
Feature activation+2.21e+00

Pos logit contributions
ind+0.037
 beings+0.034
asured+0.032
attered+0.030
pered+0.030
Neg logit contributions
ves-0.036
 replied-0.036
gh-0.033
 story-0.032
 poem-0.031
Token
Token ID129
Feature activation-8.60e-02

Pos logit contributions
 replied+0.060
ves+0.057
gh+0.055
 happen+0.055
 poem+0.054
Neg logit contributions
ind-0.081
 beings-0.072
 yourselves-0.068
 prisoners-0.067
 slaves-0.066
Token
Token ID241
Feature activation-7.95e+00

Pos logit contributions
ind+0.003
 beings+0.003
asured+0.002
pered+0.002
 yourselves+0.002
Neg logit contributions
ves-0.003
 replied-0.003
gh-0.003
 story-0.002
 poem-0.002
TokenI
Token ID40
Feature activation-2.12e+00

Pos logit contributions
ind+0.304
 beings+0.280
asured+0.265
pered+0.259
attered+0.256
Neg logit contributions
ves-0.213
 replied-0.210
gh-0.192
 story-0.182
 poem-0.182
Token like
Token ID588
Feature activation-2.25e+00

Pos logit contributions
ind+0.079
 beings+0.073
asured+0.068
pered+0.067
 yourselves+0.067
Neg logit contributions
 replied-0.059
ves-0.059
gh-0.053
 story-0.051
 poem-0.050
Token seeing
Token ID4379
Feature activation-1.06e+00

Pos logit contributions
ind+0.091
 beings+0.083
asured+0.080
attered+0.078
pered+0.078
Neg logit contributions
ves-0.055
 replied-0.055
gh-0.050
 story-0.048
 poem-0.048
Token you
Token ID345
Feature activation-1.09e+00

Pos logit contributions
ind+0.041
 beings+0.038
asured+0.036
pered+0.035
attered+0.035
Neg logit contributions
ves-0.027
 replied-0.027
gh-0.024
 story-0.023
 poem-0.023
Token sleep
Token ID3993
Feature activation-8.29e-01

Pos logit contributions
ind+0.042
 beings+0.039
attered+0.037
asured+0.036
pered+0.036
Neg logit contributions
 replied-0.029
ves-0.028
 story-0.026
gh-0.026
 poem-0.026
Token
Token ID220
Feature activation+3.10e+00

Pos logit contributions
ves+0.069
gh+0.066
 happen+0.060
 replied+0.059
 poem+0.058
Neg logit contributions
ind-0.122
 beings-0.115
pered-0.111
asured-0.108
attered-0.107
Token
Token ID198
Feature activation+2.03e+00

Pos logit contributions
 replied+0.065
ves+0.065
gh+0.063
 poem+0.061
 happen+0.057
Neg logit contributions
ind-0.129
 beings-0.120
pered-0.116
asured-0.114
uzz-0.114
Token
Token ID198
Feature activation+1.78e+00

Pos logit contributions
 happen+0.060
 poem+0.058
gh+0.055
 replied+0.054
ves+0.052
Neg logit contributions
ind-0.070
uzz-0.060
 beings-0.057
 yourselves-0.057
pered-0.057
TokenThe
Token ID464
Feature activation+1.87e+00

Pos logit contributions
 replied+0.040
ves+0.040
gh+0.036
 story+0.034
 happen+0.034
Neg logit contributions
ind-0.075
 beings-0.069
asured-0.066
pered-0.064
 yourselves-0.064
Token re
Token ID302
Feature activation-4.81e+00

Pos logit contributions
 replied+0.062
 story+0.059
ves+0.056
 poem+0.055
 +0.053
Neg logit contributions
ind-0.056
attered-0.055
 beings-0.054
asured-0.054
pered-0.053
Tokenind
Token ID521
Feature activation-8.57e+00

Pos logit contributions
pered+0.049
attered+0.034
 yourselves+0.034
 thieves+0.034
ags+0.034
Neg logit contributions
ves-0.253
 replied-0.238
 story-0.229
gh-0.225
lf-0.219
Tokeneer
Token ID28153
Feature activation+7.63e-01

Pos logit contributions
 yourselves+0.180
pered+0.176
 beings+0.162
 prisoners+0.158
attered+0.156
Neg logit contributions
ves-0.338
 replied-0.325
 story-0.303
 poem-0.290
 happen-0.283
Token was
Token ID373
Feature activation-4.40e-01

Pos logit contributions
ves+0.015
 replied+0.015
gh+0.013
 story+0.012
 poem+0.012
Neg logit contributions
ind-0.034
 beings-0.032
asured-0.031
pered-0.030
attered-0.030
Token so
Token ID523
Feature activation+2.07e-01

Pos logit contributions
ind+0.017
 beings+0.015
attered+0.014
asured+0.014
 yourselves+0.014
Neg logit contributions
 replied-0.012
ves-0.012
gh-0.011
 story-0.010
 -0.010
Token pleased
Token ID10607
Feature activation-2.04e-01

Pos logit contributions
 replied+0.006
ves+0.006
 +0.006
gh+0.005
 story+0.005
Neg logit contributions
ind-0.007
 beings-0.007
 yourselves-0.006
isher-0.006
asured-0.006
Token that
Token ID326
Feature activation+1.28e+00

Pos logit contributions
ind+0.008
 beings+0.007
asured+0.007
pered+0.007
 yourselves+0.006
Neg logit contributions
ves-0.006
 replied-0.005
gh-0.005
 story-0.005
 poem-0.005