TOP ACTIVATIONS
MAX = 0.189

love ." <|endoftext|> <|endoftext|> <|endoftext|> <|endoftext|> Anna and Ben are playing
like love ." <|endoftext|> <|endoftext|> <|endoftext|> <|endoftext|> Anna and Ben are
more squash ." <|endoftext|> <|endoftext|> <|endoftext|> L ily and Sam went
. <|endoftext|> <|endoftext|> <|endoftext|> <|endoftext|> <|endoftext|> L ila and Ben are
but like love ." <|endoftext|> <|endoftext|> <|endoftext|> <|endoftext|> Anna and Ben
end . <|endoftext|> <|endoftext|> <|endoftext|> <|endoftext|> <|endoftext|> L ila and Ben
<|endoftext|> Once upon a time there was a little girl named
<|endoftext|> Once upon a time there was a bus . The
<|endoftext|> Once upon a time there was a little girl .
<|endoftext|> Once upon a time there was a clever dog .
<|endoftext|> Once upon a time there lived a girl called Ruth
<|endoftext|> Once upon a time there was a beautiful princess called
<|endoftext|> Once upon a time there was a blue ostr ich
<|endoftext|> Once upon a time there was a big , beautiful
<|endoftext|> Once upon a time there was a big house .
<|endoftext|> Once upon a time there was a princess who lived
<|endoftext|> Once upon a time there was a girl who loved
<|endoftext|> Once upon a time there was a brave little boy
<|endoftext|> Once upon a time there was a summer day .
<|endoftext|> Once upon a time there was a little girl named

MIN ACTIVATIONS
MIN = -0.143

garden together and share their m ugs of spicy tea .
. She also gave the m ule a carrot to make
that he had earned his m ama 's permission . <|endoftext|>
not want to keep the m itten . It did not
they could not find their m ittens . They had disappeared
I help you fix your sc ooter ?" The
and reminded herself of the fore b oding darkness she had
't want to share her m op , but her mom
and wished they had their m ittens back . They were
had learned to appreciate the therm ometer . <|endoftext|> John lay
. You can keep your m op . And you can
that are cheap are not very good . But these cookies
them . He held their m ittens gently . He knew
to lose her new gifted m ittens . But then ,
for things that weren 't quite right . <|endoftext|> Once upon
before we sent him the sc ooter !" M andy smiled
boy and give him her sc ooter as a gift .
homes . They put their sc ooters away and say goodbye
that he had done something very helpful and kind . He
and it made him feel very special . The end .

INTERVAL 0.106 - 0.189
CONTAINS 2.007%

people happy . <|endoftext|> Once upon a time , there was
mom . <|endoftext|> Once upon a time there was a depend
. <|endoftext|> Once upon a time , there was a little
ler in the forest . <|endoftext|> M um and Dad had
every day . <|endoftext|> Once upon a time , in a

INTERVAL 0.023 - 0.106
CONTAINS 24.144%

When she got back home , she put the meat
a big bowl of rice on the table . Nancy and
. Ben loved to hop around and explore the world .
time , there was a little girl named Lily . She
joy . Maria c uddled the puppy all

INTERVAL -0.060 - 0.023
CONTAINS 69.771%

past ries . Lily learns to be more careful and patient
on . The mixer made a loud noise and spun very
hat . The small man said , " Hi
Miss Lee says , " You 're welcome , Tom
on the ground . The rob in sees it too .

INTERVAL -0.143 - -0.060
CONTAINS 4.078%

The bug was pin ching his tail ! Sam
. She learned that it 's important to be honest and
. And that 's the sad ending of Tim my and
is that it 's always better to ask for help than
that you 're so happy this morning . Would you like
Token love
Token ID1842
Feature activation+4.42e-03

Pos logit contributions
 there+0.005
 set+0.005
 upon+0.005
 saw+0.005
 found+0.005
Neg logit contributions
 wounds-0.009
rier-0.009
 apology-0.009
othe-0.009
ometer-0.009
Token."
Token ID526
Feature activation-5.40e-03

Pos logit contributions
 there+0.005
 set+0.005
 upon+0.005
 saw+0.005
 found+0.005
Neg logit contributions
 wounds-0.009
rier-0.009
 apology-0.009
othe-0.009
ometer-0.009
Token<|endoftext|>
Token ID50256
Feature activation+1.41e-01

Pos logit contributions
rier+0.014
 wounds+0.014
vern+0.014
 fault+0.013
vere+0.013
Neg logit contributions
 there-0.005
 saw-0.004
 upon-0.004
 a-0.004
 the-0.004
Token<|endoftext|>
Token ID50256
Feature activation+1.61e-01

Pos logit contributions
 set+0.199
 three+0.196
 spent+0.188
ere+0.185
 colored+0.181
Neg logit contributions
othe-0.200
vere-0.200
 wounds-0.198
 Or-0.196
 apology-0.195
Token<|endoftext|>
Token ID50256
Feature activation+1.69e-01

Pos logit contributions
 set+0.235
 spent+0.230
 three+0.227
 colored+0.219
ere+0.217
Neg logit contributions
othe-0.208
vere-0.206
 Or-0.204
 wounds-0.203
 dent-0.199
Token<|endoftext|>
Token ID50256
Feature activation+1.72e-01

Pos logit contributions
 set+0.232
 spent+0.229
 three+0.223
 packed+0.217
 colored+0.216
Neg logit contributions
othe-0.221
vere-0.220
 Or-0.219
 dent-0.215
 wounds-0.215
TokenAnna
Token ID31160
Feature activation+9.32e-02

Pos logit contributions
 set+0.226
 spent+0.223
 three+0.217
 packed+0.214
 colored+0.209
Neg logit contributions
 Or-0.229
vere-0.228
othe-0.227
 dent-0.225
 wounds-0.222
Token and
Token ID290
Feature activation+1.17e-01

Pos logit contributions
neath+0.139
 there+0.139
ere+0.134
where+0.134
 upon+0.133
Neg logit contributions
 wounds-0.132
 apology-0.127
ometer-0.121
othe-0.120
 mistake-0.118
Token Ben
Token ID3932
Feature activation+1.03e-01

Pos logit contributions
ere+0.116
y+0.110
r+0.107
ka+0.106
 colored+0.106
Neg logit contributions
othe-0.240
 sorry-0.233
 bleed-0.218
 apology-0.217
 wounds-0.216
Token are
Token ID389
Feature activation+1.10e-01

Pos logit contributions
 there+0.123
ere+0.117
 three+0.116
 set+0.114
 upon+0.114
Neg logit contributions
 wounds-0.177
 apology-0.174
 fault-0.172
othe-0.171
vern-0.166
Token playing
Token ID2712
Feature activation+1.09e-01

Pos logit contributions
 arrived+0.117
 set+0.117
y+0.114
 saw+0.113
r+0.113
Neg logit contributions
 wounds-0.215
 fault-0.212
vern-0.207
 sorry-0.205
othe-0.205
Token like
Token ID588
Feature activation+4.42e-03

Pos logit contributions
 wounds+0.092
rier+0.090
 apology+0.089
ometer+0.087
vern+0.087
Neg logit contributions
 there-0.063
 set-0.057
 saw-0.056
 upon-0.056
 found-0.055
Token love
Token ID1842
Feature activation+4.42e-03

Pos logit contributions
 there+0.005
 set+0.005
 upon+0.005
 saw+0.005
 found+0.005
Neg logit contributions
 wounds-0.009
rier-0.009
 apology-0.009
othe-0.009
ometer-0.009
Token."
Token ID526
Feature activation-5.40e-03

Pos logit contributions
 there+0.005
 set+0.005
 upon+0.005
 saw+0.005
 found+0.005
Neg logit contributions
 wounds-0.009
rier-0.009
 apology-0.009
othe-0.009
ometer-0.009
Token<|endoftext|>
Token ID50256
Feature activation+1.41e-01

Pos logit contributions
rier+0.014
 wounds+0.014
vern+0.014
 fault+0.013
vere+0.013
Neg logit contributions
 there-0.005
 saw-0.004
 upon-0.004
 a-0.004
 the-0.004
Token<|endoftext|>
Token ID50256
Feature activation+1.61e-01

Pos logit contributions
 set+0.199
 three+0.196
 spent+0.188
ere+0.185
 colored+0.181
Neg logit contributions
othe-0.200
vere-0.200
 wounds-0.198
 Or-0.196
 apology-0.195
Token<|endoftext|>
Token ID50256
Feature activation+1.69e-01

Pos logit contributions
 set+0.235
 spent+0.230
 three+0.227
 colored+0.219
ere+0.217
Neg logit contributions
othe-0.208
vere-0.206
 Or-0.204
 wounds-0.203
 dent-0.199
Token<|endoftext|>
Token ID50256
Feature activation+1.72e-01

Pos logit contributions
 set+0.232
 spent+0.229
 three+0.223
 packed+0.217
 colored+0.216
Neg logit contributions
othe-0.221
vere-0.220
 Or-0.219
 dent-0.215
 wounds-0.215
TokenAnna
Token ID31160
Feature activation+9.32e-02

Pos logit contributions
 set+0.226
 spent+0.223
 three+0.217
 packed+0.214
 colored+0.209
Neg logit contributions
 Or-0.229
vere-0.228
othe-0.227
 dent-0.225
 wounds-0.222
Token and
Token ID290
Feature activation+1.17e-01

Pos logit contributions
neath+0.139
 there+0.139
ere+0.134
where+0.134
 upon+0.133
Neg logit contributions
 wounds-0.132
 apology-0.127
ometer-0.121
othe-0.120
 mistake-0.118
Token Ben
Token ID3932
Feature activation+1.03e-01

Pos logit contributions
ere+0.116
y+0.110
r+0.107
ka+0.106
 colored+0.106
Neg logit contributions
othe-0.240
 sorry-0.233
 bleed-0.218
 apology-0.217
 wounds-0.216
Token are
Token ID389
Feature activation+1.10e-01

Pos logit contributions
 there+0.123
ere+0.117
 three+0.116
 set+0.114
 upon+0.114
Neg logit contributions
 wounds-0.177
 apology-0.174
 fault-0.172
othe-0.171
vern-0.166
Token more
Token ID517
Feature activation-6.31e-02

Pos logit contributions
 there+0.012
 set+0.011
 saw+0.010
 upon+0.010
 found+0.010
Neg logit contributions
rier-0.031
 wounds-0.031
 apology-0.030
ometer-0.029
othe-0.029
Token squash
Token ID34613
Feature activation-3.70e-02

Pos logit contributions
 wounds+0.119
 apology+0.117
rier+0.113
othe+0.111
 fault+0.109
Neg logit contributions
 there-0.083
 upon-0.083
 set-0.082
 saw-0.081
 found-0.079
Token."
Token ID526
Feature activation-2.91e-02

Pos logit contributions
 wounds+0.078
rier+0.076
 apology+0.075
othe+0.072
 fault+0.072
Neg logit contributions
 there-0.041
 upon-0.040
 saw-0.039
 set-0.039
 three-0.037
Token<|endoftext|>
Token ID50256
Feature activation+1.30e-01

Pos logit contributions
rier+0.081
 wounds+0.080
vern+0.076
 apology+0.075
vere+0.075
Neg logit contributions
 there-0.022
 upon-0.019
 saw-0.017
 a-0.016
-0.015
Token<|endoftext|>
Token ID50256
Feature activation+1.53e-01

Pos logit contributions
 set+0.141
 spent+0.134
 explored+0.131
 planted+0.129
 three+0.128
Neg logit contributions
vere-0.234
 wounds-0.229
 Or-0.227
vern-0.222
 apology-0.221
Token<|endoftext|>
Token ID50256
Feature activation+1.62e-01

Pos logit contributions
 set+0.179
 spent+0.178
 packed+0.169
 explored+0.169
 imagined+0.165
Neg logit contributions
vere-0.243
 wounds-0.243
 Or-0.239
othe-0.232
 dent-0.231
TokenL
Token ID43
Feature activation+9.25e-02

Pos logit contributions
 set+0.190
 spent+0.188
 packed+0.185
 explored+0.177
 three+0.176
Neg logit contributions
 wounds-0.249
vere-0.245
 Or-0.243
 dent-0.238
othe-0.236
Tokenily
Token ID813
Feature activation+9.54e-02

Pos logit contributions
 there+0.120
 set+0.116
 three+0.113
 saw+0.112
where+0.111
Neg logit contributions
 apology-0.165
 wounds-0.163
vern-0.156
 fault-0.153
 mistake-0.150
Token and
Token ID290
Feature activation+1.12e-01

Pos logit contributions
 there+0.145
 upon+0.140
ifying+0.137
where+0.135
neath+0.134
Neg logit contributions
 wounds-0.139
 apology-0.130
 mistake-0.125
ometer-0.124
 fault-0.123
Token Sam
Token ID3409
Feature activation+1.08e-01

Pos logit contributions
 upon+0.107
y+0.107
ere+0.106
r+0.099
 camps+0.096
Neg logit contributions
othe-0.237
 sorry-0.230
 bleed-0.224
 apology-0.223
 wounds-0.219
Token went
Token ID1816
Feature activation+1.19e-01

Pos logit contributions
 there+0.121
 three+0.113
 upon+0.110
where+0.108
 set+0.107
Neg logit contributions
 wounds-0.197
 fault-0.196
 apology-0.195
othe-0.187
vern-0.185
Token.
Token ID13
Feature activation-2.51e-02

Pos logit contributions
rier+0.123
 wounds+0.120
 apology+0.114
 fault+0.113
vern+0.112
Neg logit contributions
 there-0.036
 in-0.029
 saw-0.028
 upon-0.028
 a-0.025
Token<|endoftext|>
Token ID50256
Feature activation+1.19e-01

Pos logit contributions
 wounds+0.063
rier+0.062
 apology+0.059
vern+0.058
ometer+0.058
Neg logit contributions
 there-0.023
 upon-0.020
 set-0.019
 saw-0.018
 three-0.018
Token<|endoftext|>
Token ID50256
Feature activation+1.43e-01

Pos logit contributions
 set+0.116
 three+0.110
 saw+0.106
 upon+0.106
 spent+0.104
Neg logit contributions
vere-0.227
vern-0.223
 wounds-0.223
 Or-0.222
ometer-0.221
Token<|endoftext|>
Token ID50256
Feature activation+1.53e-01

Pos logit contributions
 set+0.167
 spent+0.160
 three+0.156
 explored+0.153
 found+0.152
Neg logit contributions
vere-0.236
 wounds-0.234
 Or-0.232
 dent-0.230
vern-0.229
Token<|endoftext|>
Token ID50256
Feature activation+1.59e-01

Pos logit contributions
 set+0.185
 spent+0.178
y+0.174
 three+0.173
 packed+0.169
Neg logit contributions
 wounds-0.242
vere-0.240
 dent-0.237
 Or-0.236
vern-0.232
Token<|endoftext|>
Token ID50256
Feature activation+1.62e-01

Pos logit contributions
 set+0.192
 spent+0.184
 three+0.181
y+0.180
 packed+0.179
Neg logit contributions
 wounds-0.245
vere-0.243
 dent-0.242
 Or-0.240
othe-0.235
TokenL
Token ID43
Feature activation+1.05e-01

Pos logit contributions
 set+0.195
 spent+0.187
 three+0.185
 packed+0.183
y+0.180
Neg logit contributions
 wounds-0.245
 dent-0.244
vere-0.243
 Or-0.242
othe-0.236
Tokenila
Token ID10102
Feature activation+9.48e-02

Pos logit contributions
 there+0.144
 set+0.139
 three+0.136
where+0.132
 saw+0.132
Neg logit contributions
 apology-0.180
 wounds-0.176
vern-0.170
 fault-0.170
 mistake-0.163
Token and
Token ID290
Feature activation+1.22e-01

Pos logit contributions
 there+0.126
 upon+0.120
where+0.113
 set+0.112
 three+0.111
Neg logit contributions
 wounds-0.169
 apology-0.162
othe-0.155
ometer-0.151
 fault-0.149
Token Ben
Token ID3932
Feature activation+1.05e-01

Pos logit contributions
ere+0.114
y+0.111
 upon+0.109
 set+0.105
r+0.104
Neg logit contributions
othe-0.255
 sorry-0.242
 bleed-0.239
 wounds-0.234
 apology-0.234
Token are
Token ID389
Feature activation+1.13e-01

Pos logit contributions
 there+0.123
 upon+0.116
ere+0.115
 three+0.113
 set+0.112
Neg logit contributions
 wounds-0.189
 apology-0.184
 fault-0.181
othe-0.179
vern-0.175
Token but
Token ID475
Feature activation-4.64e-02

Pos logit contributions
 wounds+0.054
vern+0.054
rier+0.053
 apology+0.053
ometer+0.053
Neg logit contributions
 there-0.032
 set-0.027
 upon-0.027
 saw-0.027
 three-0.027
Token like
Token ID588
Feature activation+4.42e-03

Pos logit contributions
 wounds+0.092
rier+0.090
 apology+0.089
ometer+0.087
vern+0.087
Neg logit contributions
 there-0.063
 set-0.057
 saw-0.056
 upon-0.056
 found-0.055
Token love
Token ID1842
Feature activation+4.42e-03

Pos logit contributions
 there+0.005
 set+0.005
 upon+0.005
 saw+0.005
 found+0.005
Neg logit contributions
 wounds-0.009
rier-0.009
 apology-0.009
othe-0.009
ometer-0.009
Token."
Token ID526
Feature activation-5.40e-03

Pos logit contributions
 there+0.005
 set+0.005
 upon+0.005
 saw+0.005
 found+0.005
Neg logit contributions
 wounds-0.009
rier-0.009
 apology-0.009
othe-0.009
ometer-0.009
Token<|endoftext|>
Token ID50256
Feature activation+1.41e-01

Pos logit contributions
rier+0.014
 wounds+0.014
vern+0.014
 fault+0.013
vere+0.013
Neg logit contributions
 there-0.005
 saw-0.004
 upon-0.004
 a-0.004
 the-0.004
Token<|endoftext|>
Token ID50256
Feature activation+1.61e-01

Pos logit contributions
 set+0.199
 three+0.196
 spent+0.188
ere+0.185
 colored+0.181
Neg logit contributions
othe-0.200
vere-0.200
 wounds-0.198
 Or-0.196
 apology-0.195
Token<|endoftext|>
Token ID50256
Feature activation+1.69e-01

Pos logit contributions
 set+0.235
 spent+0.230
 three+0.227
 colored+0.219
ere+0.217
Neg logit contributions
othe-0.208
vere-0.206
 Or-0.204
 wounds-0.203
 dent-0.199
Token<|endoftext|>
Token ID50256
Feature activation+1.72e-01

Pos logit contributions
 set+0.232
 spent+0.229
 three+0.223
 packed+0.217
 colored+0.216
Neg logit contributions
othe-0.221
vere-0.220
 Or-0.219
 dent-0.215
 wounds-0.215
TokenAnna
Token ID31160
Feature activation+9.32e-02

Pos logit contributions
 set+0.226
 spent+0.223
 three+0.217
 packed+0.214
 colored+0.209
Neg logit contributions
 Or-0.229
vere-0.228
othe-0.227
 dent-0.225
 wounds-0.222
Token and
Token ID290
Feature activation+1.17e-01

Pos logit contributions
neath+0.139
 there+0.139
ere+0.134
where+0.134
 upon+0.133
Neg logit contributions
 wounds-0.132
 apology-0.127
ometer-0.121
othe-0.120
 mistake-0.118
Token Ben
Token ID3932
Feature activation+1.03e-01

Pos logit contributions
ere+0.116
y+0.110
r+0.107
ka+0.106
 colored+0.106
Neg logit contributions
othe-0.240
 sorry-0.233
 bleed-0.218
 apology-0.217
 wounds-0.216
Token end
Token ID886
Feature activation-4.52e-02

Pos logit contributions
 wounds+0.048
rier+0.047
 apology+0.045
othe+0.045
ometer+0.045
Neg logit contributions
 there-0.044
 set-0.042
 saw-0.041
 upon-0.041
 three-0.040
Token.
Token ID13
Feature activation-2.51e-02

Pos logit contributions
rier+0.123
 wounds+0.120
 apology+0.114
 fault+0.113
vern+0.112
Neg logit contributions
 there-0.036
 in-0.029
 saw-0.028
 upon-0.028
 a-0.025
Token<|endoftext|>
Token ID50256
Feature activation+1.19e-01

Pos logit contributions
 wounds+0.063
rier+0.062
 apology+0.059
vern+0.058
ometer+0.058
Neg logit contributions
 there-0.023
 upon-0.020
 set-0.019
 saw-0.018
 three-0.018
Token<|endoftext|>
Token ID50256
Feature activation+1.43e-01

Pos logit contributions
 set+0.116
 three+0.110
 saw+0.106
 upon+0.106
 spent+0.104
Neg logit contributions
vere-0.227
vern-0.223
 wounds-0.223
 Or-0.222
ometer-0.221
Token<|endoftext|>
Token ID50256
Feature activation+1.53e-01

Pos logit contributions
 set+0.167
 spent+0.160
 three+0.156
 explored+0.153
 found+0.152
Neg logit contributions
vere-0.236
 wounds-0.234
 Or-0.232
 dent-0.230
vern-0.229
Token<|endoftext|>
Token ID50256
Feature activation+1.59e-01

Pos logit contributions
 set+0.185
 spent+0.178
y+0.174
 three+0.173
 packed+0.169
Neg logit contributions
 wounds-0.242
vere-0.240
 dent-0.237
 Or-0.236
vern-0.232
Token<|endoftext|>
Token ID50256
Feature activation+1.62e-01

Pos logit contributions
 set+0.192
 spent+0.184
 three+0.181
y+0.180
 packed+0.179
Neg logit contributions
 wounds-0.245
vere-0.243
 dent-0.242
 Or-0.240
othe-0.235
TokenL
Token ID43
Feature activation+1.05e-01

Pos logit contributions
 set+0.195
 spent+0.187
 three+0.185
 packed+0.183
y+0.180
Neg logit contributions
 wounds-0.245
 dent-0.244
vere-0.243
 Or-0.242
othe-0.236
Tokenila
Token ID10102
Feature activation+9.48e-02

Pos logit contributions
 there+0.144
 set+0.139
 three+0.136
where+0.132
 saw+0.132
Neg logit contributions
 apology-0.180
 wounds-0.176
vern-0.170
 fault-0.170
 mistake-0.163
Token and
Token ID290
Feature activation+1.22e-01

Pos logit contributions
 there+0.126
 upon+0.120
where+0.113
 set+0.112
 three+0.111
Neg logit contributions
 wounds-0.169
 apology-0.162
othe-0.155
ometer-0.151
 fault-0.149
Token Ben
Token ID3932
Feature activation+1.05e-01

Pos logit contributions
ere+0.114
y+0.111
 upon+0.109
 set+0.105
r+0.104
Neg logit contributions
othe-0.255
 sorry-0.242
 bleed-0.239
 wounds-0.234
 apology-0.234
Token<|endoftext|>
Token ID50256
Feature activation+1.46e-01

Pos logit contributions
 there+0.045
 set+0.045
 saw+0.044
 found+0.043
 upon+0.043
Neg logit contributions
ometer-0.056
 apology-0.055
othe-0.055
 wounds-0.055
rier-0.054
TokenOnce
Token ID7454
Feature activation+1.44e-01

Pos logit contributions
 set+0.223
 three+0.216
 colored+0.213
 spent+0.209
ere+0.207
Neg logit contributions
othe-0.193
ometer-0.192
 Or-0.184
vere-0.181
 fault-0.179
Token upon
Token ID2402
Feature activation+1.57e-01

Pos logit contributions
 set+0.085
 colored+0.063
ere+0.059
 entered+0.058
 explored+0.055
Neg logit contributions
othe-0.377
ometer-0.370
 fault-0.369
rier-0.364
vern-0.358
Token a
Token ID257
Feature activation+1.30e-01

Pos logit contributions
ere+0.171
cy+0.147
y+0.147
neath+0.142
 arrived+0.139
Neg logit contributions
 wounds-0.317
 worse-0.315
 bleed-0.315
vern-0.308
 apology-0.302
Token time
Token ID640
Feature activation+1.31e-01

Pos logit contributions
 there+0.111
 set+0.099
 upon+0.094
 saw+0.087
 found+0.086
Neg logit contributions
 wounds-0.298
vern-0.294
 apology-0.292
 fault-0.288
othe-0.283
Token there
Token ID612
Feature activation+1.59e-01

Pos logit contributions
 set+0.145
 colored+0.131
ere+0.129
 explored+0.123
 spread+0.120
Neg logit contributions
vern-0.262
othe-0.262
vere-0.257
ometer-0.249
 fault-0.248
Token was
Token ID373
Feature activation+1.16e-01

Pos logit contributions
ere+0.179
 three+0.169
 set+0.168
ka+0.162
r+0.159
Neg logit contributions
othe-0.288
ometer-0.275
 fault-0.274
 dent-0.272
 wounds-0.263
Token a
Token ID257
Feature activation+9.98e-02

Pos logit contributions
 upon+0.089
y+0.083
 arrived+0.082
 saw+0.081
ere+0.079
Neg logit contributions
vern-0.251
 fault-0.248
 wounds-0.243
 Or-0.239
 bleed-0.238
Token little
Token ID1310
Feature activation+8.55e-02

Pos logit contributions
 there+0.070
 set+0.062
 saw+0.061
 upon+0.060
 found+0.056
Neg logit contributions
 wounds-0.258
rier-0.254
 apology-0.251
othe-0.247
ometer-0.245
Token girl
Token ID2576
Feature activation+6.92e-02

Pos logit contributions
 there+0.115
 set+0.109
 upon+0.108
 saw+0.107
 found+0.104
Neg logit contributions
 wounds-0.164
rier-0.162
 apology-0.158
othe-0.154
ometer-0.154
Token named
Token ID3706
Feature activation+1.18e-01

Pos logit contributions
 upon+0.078
 there+0.068
ere+0.066
 set+0.065
neath+0.065
Neg logit contributions
 wounds-0.139
othe-0.135
ometer-0.132
 apology-0.128
ache-0.125
Token<|endoftext|>
Token ID50256
Feature activation+1.46e-01

Pos logit contributions
 there+0.045
 set+0.045
 saw+0.044
 found+0.043
 upon+0.043
Neg logit contributions
ometer-0.056
 apology-0.055
othe-0.055
 wounds-0.055
rier-0.054
TokenOnce
Token ID7454
Feature activation+1.44e-01

Pos logit contributions
 set+0.223
 three+0.216
 colored+0.213
 spent+0.209
ere+0.207
Neg logit contributions
othe-0.193
ometer-0.192
 Or-0.184
vere-0.181
 fault-0.179
Token upon
Token ID2402
Feature activation+1.57e-01

Pos logit contributions
 set+0.085
 colored+0.063
ere+0.059
 entered+0.058
 explored+0.055
Neg logit contributions
othe-0.377
ometer-0.370
 fault-0.369
rier-0.364
vern-0.358
Token a
Token ID257
Feature activation+1.30e-01

Pos logit contributions
ere+0.171
cy+0.147
y+0.147
neath+0.142
 arrived+0.139
Neg logit contributions
 wounds-0.317
 worse-0.315
 bleed-0.315
vern-0.308
 apology-0.302
Token time
Token ID640
Feature activation+1.31e-01

Pos logit contributions
 there+0.111
 set+0.099
 upon+0.094
 saw+0.087
 found+0.086
Neg logit contributions
 wounds-0.298
vern-0.294
 apology-0.292
 fault-0.288
othe-0.283
Token there
Token ID612
Feature activation+1.59e-01

Pos logit contributions
 set+0.145
 colored+0.131
ere+0.129
 explored+0.123
 spread+0.120
Neg logit contributions
vern-0.262
othe-0.262
vere-0.257
ometer-0.249
 fault-0.248
Token was
Token ID373
Feature activation+1.16e-01

Pos logit contributions
ere+0.179
 three+0.169
 set+0.168
ka+0.162
r+0.159
Neg logit contributions
othe-0.288
ometer-0.275
 fault-0.274
 dent-0.272
 wounds-0.263
Token a
Token ID257
Feature activation+9.98e-02

Pos logit contributions
 upon+0.089
y+0.083
 arrived+0.082
 saw+0.081
ere+0.079
Neg logit contributions
vern-0.251
 fault-0.248
 wounds-0.243
 Or-0.239
 bleed-0.238
Token bus
Token ID1323
Feature activation+7.61e-02

Pos logit contributions
 there+0.070
 set+0.062
 saw+0.061
 upon+0.060
 found+0.056
Neg logit contributions
 wounds-0.258
rier-0.254
 apology-0.251
othe-0.247
ometer-0.245
Token.
Token ID13
Feature activation+4.71e-02

Pos logit contributions
 there+0.075
 set+0.069
 upon+0.068
 saw+0.068
 found+0.065
Neg logit contributions
 wounds-0.173
rier-0.171
 apology-0.168
othe-0.165
ometer-0.164
Token The
Token ID383
Feature activation+2.58e-02

Pos logit contributions
 saw+0.068
 found+0.065
 upon+0.065
ere+0.062
y+0.062
Neg logit contributions
ometer-0.074
othe-0.073
 apology-0.069
 Or-0.068
rier-0.067
Token<|endoftext|>
Token ID50256
Feature activation+1.47e-01

Pos logit contributions
 there+0.018
 set+0.017
 saw+0.017
 upon+0.017
 found+0.017
Neg logit contributions
 wounds-0.023
rier-0.023
 apology-0.022
othe-0.022
ometer-0.022
TokenOnce
Token ID7454
Feature activation+1.47e-01

Pos logit contributions
 set+0.226
 three+0.221
 spent+0.215
 colored+0.212
 spread+0.207
Neg logit contributions
ometer-0.198
othe-0.196
 Or-0.189
 apology-0.183
vere-0.181
Token upon
Token ID2402
Feature activation+1.57e-01

Pos logit contributions
 set+0.085
 colored+0.062
 entered+0.059
ere+0.058
 explored+0.054
Neg logit contributions
othe-0.383
 fault-0.377
ometer-0.374
rier-0.369
vern-0.368
Token a
Token ID257
Feature activation+1.30e-01

Pos logit contributions
ere+0.168
cy+0.147
y+0.144
neath+0.141
 arrived+0.137
Neg logit contributions
 wounds-0.316
 bleed-0.312
 worse-0.311
vern-0.308
 apology-0.302
Token time
Token ID640
Feature activation+1.32e-01

Pos logit contributions
 there+0.112
 set+0.100
 upon+0.094
 saw+0.088
 found+0.087
Neg logit contributions
 wounds-0.298
vern-0.295
 apology-0.293
 fault-0.290
othe-0.284
Token there
Token ID612
Feature activation+1.59e-01

Pos logit contributions
 set+0.148
ere+0.134
 colored+0.132
 explored+0.126
 spread+0.123
Neg logit contributions
vern-0.261
othe-0.259
vere-0.255
 fault-0.248
 sorry-0.247
Token was
Token ID373
Feature activation+1.18e-01

Pos logit contributions
ere+0.181
 set+0.168
 three+0.168
ka+0.166
r+0.161
Neg logit contributions
othe-0.282
ometer-0.269
 dent-0.268
 fault-0.267
 wounds-0.256
Token a
Token ID257
Feature activation+1.01e-01

Pos logit contributions
 upon+0.095
y+0.089
 saw+0.086
ere+0.085
 arrived+0.084
Neg logit contributions
vern-0.250
 fault-0.248
 wounds-0.239
 Or-0.237
 bleed-0.236
Token little
Token ID1310
Feature activation+8.63e-02

Pos logit contributions
 there+0.070
 set+0.062
 saw+0.060
 upon+0.060
 found+0.056
Neg logit contributions
 wounds-0.262
rier-0.258
 apology-0.255
othe-0.250
ometer-0.249
Token girl
Token ID2576
Feature activation+6.99e-02

Pos logit contributions
 there+0.115
 set+0.109
 upon+0.108
 saw+0.107
 found+0.104
Neg logit contributions
 wounds-0.166
rier-0.164
 apology-0.160
ometer-0.156
othe-0.156
Token.
Token ID13
Feature activation+4.95e-02

Pos logit contributions
 upon+0.082
 there+0.071
neath+0.069
ere+0.069
 set+0.068
Neg logit contributions
 wounds-0.135
othe-0.132
ometer-0.129
 apology-0.124
 Or-0.123
Token<|endoftext|>
Token ID50256
Feature activation+1.47e-01

Pos logit contributions
 there+0.018
 set+0.017
 saw+0.017
 upon+0.017
 found+0.017
Neg logit contributions
 wounds-0.023
rier-0.023
 apology-0.022
othe-0.022
ometer-0.022
TokenOnce
Token ID7454
Feature activation+1.47e-01

Pos logit contributions
 set+0.226
 three+0.221
 spent+0.215
 colored+0.212
 spread+0.207
Neg logit contributions
ometer-0.198
othe-0.196
 Or-0.189
 apology-0.183
vere-0.181
Token upon
Token ID2402
Feature activation+1.57e-01

Pos logit contributions
 set+0.085
 colored+0.062
 entered+0.059
ere+0.058
 explored+0.054
Neg logit contributions
othe-0.383
 fault-0.377
ometer-0.374
rier-0.369
vern-0.368
Token a
Token ID257
Feature activation+1.30e-01

Pos logit contributions
ere+0.168
cy+0.147
y+0.144
neath+0.141
 arrived+0.137
Neg logit contributions
 wounds-0.316
 bleed-0.312
 worse-0.311
vern-0.308
 apology-0.302
Token time
Token ID640
Feature activation+1.32e-01

Pos logit contributions
 there+0.112
 set+0.100
 upon+0.094
 saw+0.088
 found+0.087
Neg logit contributions
 wounds-0.298
vern-0.295
 apology-0.293
 fault-0.290
othe-0.284
Token there
Token ID612
Feature activation+1.59e-01

Pos logit contributions
 set+0.148
ere+0.134
 colored+0.132
 explored+0.126
 spread+0.123
Neg logit contributions
vern-0.261
othe-0.259
vere-0.255
 fault-0.248
 sorry-0.247
Token was
Token ID373
Feature activation+1.18e-01

Pos logit contributions
ere+0.181
 set+0.168
 three+0.168
ka+0.166
r+0.161
Neg logit contributions
othe-0.282
ometer-0.269
 dent-0.268
 fault-0.267
 wounds-0.256
Token a
Token ID257
Feature activation+1.01e-01

Pos logit contributions
 upon+0.095
y+0.089
 saw+0.086
ere+0.085
 arrived+0.084
Neg logit contributions
vern-0.250
 fault-0.248
 wounds-0.239
 Or-0.237
 bleed-0.236
Token clever
Token ID14169
Feature activation+7.18e-02

Pos logit contributions
 there+0.070
 set+0.062
 saw+0.060
 upon+0.060
 found+0.056
Neg logit contributions
 wounds-0.262
rier-0.258
 apology-0.255
othe-0.250
ometer-0.249
Token dog
Token ID3290
Feature activation+6.25e-02

Pos logit contributions
 there+0.080
 upon+0.074
 set+0.073
 saw+0.072
 found+0.070
Neg logit contributions
 wounds-0.153
rier-0.152
 apology-0.150
ometer-0.146
othe-0.144
Token.
Token ID13
Feature activation+4.31e-02

Pos logit contributions
 there+0.047
 upon+0.046
 set+0.044
 saw+0.041
ere+0.040
Neg logit contributions
 wounds-0.150
othe-0.147
 apology-0.145
rier-0.145
ometer-0.145
Token<|endoftext|>
Token ID50256
Feature activation+1.47e-01

Pos logit contributions
 there+0.018
 set+0.017
 saw+0.017
 upon+0.017
 found+0.017
Neg logit contributions
 wounds-0.023
rier-0.023
 apology-0.022
othe-0.022
ometer-0.022
TokenOnce
Token ID7454
Feature activation+1.47e-01

Pos logit contributions
 set+0.226
 three+0.221
 spent+0.215
 colored+0.212
 spread+0.207
Neg logit contributions
ometer-0.198
othe-0.196
 Or-0.189
 apology-0.183
vere-0.181
Token upon
Token ID2402
Feature activation+1.57e-01

Pos logit contributions
 set+0.085
 colored+0.062
 entered+0.059
ere+0.058
 explored+0.054
Neg logit contributions
othe-0.383
 fault-0.377
ometer-0.374
rier-0.369
vern-0.368
Token a
Token ID257
Feature activation+1.30e-01

Pos logit contributions
ere+0.168
cy+0.147
y+0.144
neath+0.141
 arrived+0.137
Neg logit contributions
 wounds-0.316
 bleed-0.312
 worse-0.311
vern-0.308
 apology-0.302
Token time
Token ID640
Feature activation+1.32e-01

Pos logit contributions
 there+0.112
 set+0.100
 upon+0.094
 saw+0.088
 found+0.087
Neg logit contributions
 wounds-0.298
vern-0.295
 apology-0.293
 fault-0.290
othe-0.284
Token there
Token ID612
Feature activation+1.59e-01

Pos logit contributions
 set+0.148
ere+0.134
 colored+0.132
 explored+0.126
 spread+0.123
Neg logit contributions
vern-0.261
othe-0.259
vere-0.255
 fault-0.248
 sorry-0.247
Token lived
Token ID5615
Feature activation+1.49e-01

Pos logit contributions
ere+0.181
 set+0.168
 three+0.168
ka+0.166
r+0.161
Neg logit contributions
othe-0.282
ometer-0.269
 dent-0.268
 fault-0.267
 wounds-0.256
Token a
Token ID257
Feature activation+1.01e-01

Pos logit contributions
ited+0.164
y+0.154
 arrived+0.149
 saw+0.147
 spotted+0.147
Neg logit contributions
vern-0.282
 sorry-0.273
 wounds-0.268
vere-0.259
 worse-0.250
Token girl
Token ID2576
Feature activation+7.01e-02

Pos logit contributions
 there+0.075
 set+0.067
 upon+0.065
 saw+0.065
 found+0.061
Neg logit contributions
 wounds-0.256
rier-0.252
 apology-0.249
othe-0.244
ometer-0.243
Token called
Token ID1444
Feature activation+1.09e-01

Pos logit contributions
 upon+0.093
neath+0.081
ere+0.079
 there+0.076
 set+0.076
Neg logit contributions
 wounds-0.118
othe-0.116
ometer-0.113
 Or-0.111
pre-0.109
Token Ruth
Token ID22132
Feature activation+6.48e-02

Pos logit contributions
 saw+0.168
 upon+0.166
 set+0.166
 there+0.159
where+0.157
Neg logit contributions
ometer-0.155
 apology-0.154
othe-0.154
vern-0.145
 fault-0.145
Token<|endoftext|>
Token ID50256
Feature activation+1.47e-01

Pos logit contributions
 there+0.018
 set+0.017
 saw+0.017
 upon+0.017
 found+0.017
Neg logit contributions
 wounds-0.023
rier-0.023
 apology-0.022
othe-0.022
ometer-0.022
TokenOnce
Token ID7454
Feature activation+1.47e-01

Pos logit contributions
 set+0.226
 three+0.221
 spent+0.215
 colored+0.212
 spread+0.207
Neg logit contributions
ometer-0.198
othe-0.196
 Or-0.189
 apology-0.183
vere-0.181
Token upon
Token ID2402
Feature activation+1.57e-01

Pos logit contributions
 set+0.085
 colored+0.062
 entered+0.059
ere+0.058
 explored+0.054
Neg logit contributions
othe-0.383
 fault-0.377
ometer-0.374
rier-0.369
vern-0.368
Token a
Token ID257
Feature activation+1.30e-01

Pos logit contributions
ere+0.168
cy+0.147
y+0.144
neath+0.141
 arrived+0.137
Neg logit contributions
 wounds-0.316
 bleed-0.312
 worse-0.311
vern-0.308
 apology-0.302
Token time
Token ID640
Feature activation+1.32e-01

Pos logit contributions
 there+0.112
 set+0.100
 upon+0.094
 saw+0.088
 found+0.087
Neg logit contributions
 wounds-0.298
vern-0.295
 apology-0.293
 fault-0.290
othe-0.284
Token there
Token ID612
Feature activation+1.59e-01

Pos logit contributions
 set+0.148
ere+0.134
 colored+0.132
 explored+0.126
 spread+0.123
Neg logit contributions
vern-0.261
othe-0.259
vere-0.255
 fault-0.248
 sorry-0.247
Token was
Token ID373
Feature activation+1.18e-01

Pos logit contributions
ere+0.181
 set+0.168
 three+0.168
ka+0.166
r+0.161
Neg logit contributions
othe-0.282
ometer-0.269
 dent-0.268
 fault-0.267
 wounds-0.256
Token a
Token ID257
Feature activation+1.01e-01

Pos logit contributions
 upon+0.095
y+0.089
 saw+0.086
ere+0.085
 arrived+0.084
Neg logit contributions
vern-0.250
 fault-0.248
 wounds-0.239
 Or-0.237
 bleed-0.236
Token beautiful
Token ID4950
Feature activation+6.91e-02

Pos logit contributions
 there+0.070
 set+0.062
 saw+0.060
 upon+0.060
 found+0.056
Neg logit contributions
 wounds-0.262
rier-0.258
 apology-0.255
othe-0.250
ometer-0.249
Token princess
Token ID21752
Feature activation+7.46e-02

Pos logit contributions
 there+0.088
 upon+0.086
 found+0.082
ere+0.079
 set+0.079
Neg logit contributions
rier-0.128
 apology-0.125
 wounds-0.125
ometer-0.124
othe-0.117
Token called
Token ID1444
Feature activation+1.03e-01

Pos logit contributions
 upon+0.095
neath+0.090
 Inside+0.087
 there+0.087
ere+0.086
Neg logit contributions
ache-0.130
 wounds-0.129
ometer-0.128
othe-0.127
vern-0.120
Token<|endoftext|>
Token ID50256
Feature activation+1.47e-01

Pos logit contributions
 there+0.018
 set+0.017
 saw+0.017
 upon+0.017
 found+0.017
Neg logit contributions
 wounds-0.023
rier-0.023
 apology-0.022
othe-0.022
ometer-0.022
TokenOnce
Token ID7454
Feature activation+1.47e-01

Pos logit contributions
 set+0.226
 three+0.221
 spent+0.215
 colored+0.212
 spread+0.207
Neg logit contributions
ometer-0.198
othe-0.196
 Or-0.189
 apology-0.183
vere-0.181
Token upon
Token ID2402
Feature activation+1.57e-01

Pos logit contributions
 set+0.085
 colored+0.062
 entered+0.059
ere+0.058
 explored+0.054
Neg logit contributions
othe-0.383
 fault-0.377
ometer-0.374
rier-0.369
vern-0.368
Token a
Token ID257
Feature activation+1.30e-01

Pos logit contributions
ere+0.168
cy+0.147
y+0.144
neath+0.141
 arrived+0.137
Neg logit contributions
 wounds-0.316
 bleed-0.312
 worse-0.311
vern-0.308
 apology-0.302
Token time
Token ID640
Feature activation+1.32e-01

Pos logit contributions
 there+0.112
 set+0.100
 upon+0.094
 saw+0.088
 found+0.087
Neg logit contributions
 wounds-0.298
vern-0.295
 apology-0.293
 fault-0.290
othe-0.284
Token there
Token ID612
Feature activation+1.59e-01

Pos logit contributions
 set+0.148
ere+0.134
 colored+0.132
 explored+0.126
 spread+0.123
Neg logit contributions
vern-0.261
othe-0.259
vere-0.255
 fault-0.248
 sorry-0.247
Token was
Token ID373
Feature activation+1.18e-01

Pos logit contributions
ere+0.181
 set+0.168
 three+0.168
ka+0.166
r+0.161
Neg logit contributions
othe-0.282
ometer-0.269
 dent-0.268
 fault-0.267
 wounds-0.256
Token a
Token ID257
Feature activation+1.01e-01

Pos logit contributions
 upon+0.095
y+0.089
 saw+0.086
ere+0.085
 arrived+0.084
Neg logit contributions
vern-0.250
 fault-0.248
 wounds-0.239
 Or-0.237
 bleed-0.236
Token blue
Token ID4171
Feature activation+9.72e-02

Pos logit contributions
 there+0.070
 set+0.062
 saw+0.060
 upon+0.060
 found+0.056
Neg logit contributions
 wounds-0.262
rier-0.258
 apology-0.255
othe-0.250
ometer-0.249
Token ostr
Token ID42387
Feature activation+4.99e-02

Pos logit contributions
 there+0.146
 found+0.139
 upon+0.137
 saw+0.136
 set+0.134
Neg logit contributions
rier-0.165
 wounds-0.162
 apology-0.160
ometer-0.159
othe-0.149
Tokenich
Token ID488
Feature activation+7.67e-02

Pos logit contributions
 there+0.118
 set+0.115
 upon+0.113
 saw+0.113
 found+0.112
Neg logit contributions
 wounds-0.045
rier-0.044
ometer-0.040
 apology-0.039
othe-0.039
Token<|endoftext|>
Token ID50256
Feature activation+1.47e-01

Pos logit contributions
 there+0.018
 set+0.017
 saw+0.017
 upon+0.017
 found+0.017
Neg logit contributions
 wounds-0.023
rier-0.023
 apology-0.022
othe-0.022
ometer-0.022
TokenOnce
Token ID7454
Feature activation+1.47e-01

Pos logit contributions
 set+0.226
 three+0.221
 spent+0.215
 colored+0.212
 spread+0.207
Neg logit contributions
ometer-0.198
othe-0.196
 Or-0.189
 apology-0.183
vere-0.181
Token upon
Token ID2402
Feature activation+1.57e-01

Pos logit contributions
 set+0.085
 colored+0.062
 entered+0.059
ere+0.058
 explored+0.054
Neg logit contributions
othe-0.383
 fault-0.377
ometer-0.374
rier-0.369
vern-0.368
Token a
Token ID257
Feature activation+1.30e-01

Pos logit contributions
ere+0.168
cy+0.147
y+0.144
neath+0.141
 arrived+0.137
Neg logit contributions
 wounds-0.316
 bleed-0.312
 worse-0.311
vern-0.308
 apology-0.302
Token time
Token ID640
Feature activation+1.32e-01

Pos logit contributions
 there+0.112
 set+0.100
 upon+0.094
 saw+0.088
 found+0.087
Neg logit contributions
 wounds-0.298
vern-0.295
 apology-0.293
 fault-0.290
othe-0.284
Token there
Token ID612
Feature activation+1.59e-01

Pos logit contributions
 set+0.148
ere+0.134
 colored+0.132
 explored+0.126
 spread+0.123
Neg logit contributions
vern-0.261
othe-0.259
vere-0.255
 fault-0.248
 sorry-0.247
Token was
Token ID373
Feature activation+1.18e-01

Pos logit contributions
ere+0.181
 set+0.168
 three+0.168
ka+0.166
r+0.161
Neg logit contributions
othe-0.282
ometer-0.269
 dent-0.268
 fault-0.267
 wounds-0.256
Token a
Token ID257
Feature activation+1.01e-01

Pos logit contributions
 upon+0.095
y+0.089
 saw+0.086
ere+0.085
 arrived+0.084
Neg logit contributions
vern-0.250
 fault-0.248
 wounds-0.239
 Or-0.237
 bleed-0.236
Token big
Token ID1263
Feature activation+7.83e-02

Pos logit contributions
 there+0.070
 set+0.062
 saw+0.060
 upon+0.060
 found+0.056
Neg logit contributions
 wounds-0.262
rier-0.258
 apology-0.255
othe-0.250
ometer-0.249
Token,
Token ID11
Feature activation+8.71e-02

Pos logit contributions
 there+0.097
 upon+0.094
 set+0.089
 found+0.089
 saw+0.087
Neg logit contributions
 wounds-0.151
rier-0.149
 apology-0.149
ometer-0.144
othe-0.142
Token beautiful
Token ID4950
Feature activation+5.97e-02

Pos logit contributions
 upon+0.097
 set+0.096
ere+0.094
 arrived+0.092
 there+0.091
Neg logit contributions
rier-0.168
 wounds-0.163
ometer-0.161
 apology-0.158
othe-0.156
Token<|endoftext|>
Token ID50256
Feature activation+1.47e-01

Pos logit contributions
 there+0.018
 set+0.017
 saw+0.017
 upon+0.017
 found+0.017
Neg logit contributions
 wounds-0.023
rier-0.023
 apology-0.022
othe-0.022
ometer-0.022
TokenOnce
Token ID7454
Feature activation+1.47e-01

Pos logit contributions
 set+0.226
 three+0.221
 spent+0.215
 colored+0.212
 spread+0.207
Neg logit contributions
ometer-0.198
othe-0.196
 Or-0.189
 apology-0.183
vere-0.181
Token upon
Token ID2402
Feature activation+1.57e-01

Pos logit contributions
 set+0.085
 colored+0.062
 entered+0.059
ere+0.058
 explored+0.054
Neg logit contributions
othe-0.383
 fault-0.377
ometer-0.374
rier-0.369
vern-0.368
Token a
Token ID257
Feature activation+1.30e-01

Pos logit contributions
ere+0.168
cy+0.147
y+0.144
neath+0.141
 arrived+0.137
Neg logit contributions
 wounds-0.316
 bleed-0.312
 worse-0.311
vern-0.308
 apology-0.302
Token time
Token ID640
Feature activation+1.32e-01

Pos logit contributions
 there+0.112
 set+0.100
 upon+0.094
 saw+0.088
 found+0.087
Neg logit contributions
 wounds-0.298
vern-0.295
 apology-0.293
 fault-0.290
othe-0.284
Token there
Token ID612
Feature activation+1.59e-01

Pos logit contributions
 set+0.148
ere+0.134
 colored+0.132
 explored+0.126
 spread+0.123
Neg logit contributions
vern-0.261
othe-0.259
vere-0.255
 fault-0.248
 sorry-0.247
Token was
Token ID373
Feature activation+1.18e-01

Pos logit contributions
ere+0.181
 set+0.168
 three+0.168
ka+0.166
r+0.161
Neg logit contributions
othe-0.282
ometer-0.269
 dent-0.268
 fault-0.267
 wounds-0.256
Token a
Token ID257
Feature activation+1.01e-01

Pos logit contributions
 upon+0.095
y+0.089
 saw+0.086
ere+0.085
 arrived+0.084
Neg logit contributions
vern-0.250
 fault-0.248
 wounds-0.239
 Or-0.237
 bleed-0.236
Token big
Token ID1263
Feature activation+7.83e-02

Pos logit contributions
 there+0.070
 set+0.062
 saw+0.060
 upon+0.060
 found+0.056
Neg logit contributions
 wounds-0.262
rier-0.258
 apology-0.255
othe-0.250
ometer-0.249
Token house
Token ID2156
Feature activation+8.30e-02

Pos logit contributions
 there+0.097
 upon+0.094
 set+0.089
 found+0.089
 saw+0.087
Neg logit contributions
 wounds-0.151
rier-0.149
 apology-0.149
ometer-0.144
othe-0.142
Token.
Token ID13
Feature activation+4.92e-02

Pos logit contributions
 upon+0.088
 there+0.078
ere+0.078
 saw+0.075
 found+0.075
Neg logit contributions
othe-0.169
rier-0.169
ometer-0.166
 wounds-0.166
 apology-0.163
Token<|endoftext|>
Token ID50256
Feature activation+1.47e-01

Pos logit contributions
 there+0.018
 set+0.017
 saw+0.017
 upon+0.017
 found+0.017
Neg logit contributions
 wounds-0.023
rier-0.023
 apology-0.022
othe-0.022
ometer-0.022
TokenOnce
Token ID7454
Feature activation+1.47e-01

Pos logit contributions
 set+0.226
 three+0.221
 spent+0.215
 colored+0.212
 spread+0.207
Neg logit contributions
ometer-0.198
othe-0.196
 Or-0.189
 apology-0.183
vere-0.181
Token upon
Token ID2402
Feature activation+1.57e-01

Pos logit contributions
 set+0.085
 colored+0.062
 entered+0.059
ere+0.058
 explored+0.054
Neg logit contributions
othe-0.383
 fault-0.377
ometer-0.374
rier-0.369
vern-0.368
Token a
Token ID257
Feature activation+1.30e-01

Pos logit contributions
ere+0.168
cy+0.147
y+0.144
neath+0.141
 arrived+0.137
Neg logit contributions
 wounds-0.316
 bleed-0.312
 worse-0.311
vern-0.308
 apology-0.302
Token time
Token ID640
Feature activation+1.32e-01

Pos logit contributions
 there+0.112
 set+0.100
 upon+0.094
 saw+0.088
 found+0.087
Neg logit contributions
 wounds-0.298
vern-0.295
 apology-0.293
 fault-0.290
othe-0.284
Token there
Token ID612
Feature activation+1.59e-01

Pos logit contributions
 set+0.148
ere+0.134
 colored+0.132
 explored+0.126
 spread+0.123
Neg logit contributions
vern-0.261
othe-0.259
vere-0.255
 fault-0.248
 sorry-0.247
Token was
Token ID373
Feature activation+1.18e-01

Pos logit contributions
ere+0.181
 set+0.168
 three+0.168
ka+0.166
r+0.161
Neg logit contributions
othe-0.282
ometer-0.269
 dent-0.268
 fault-0.267
 wounds-0.256
Token a
Token ID257
Feature activation+1.01e-01

Pos logit contributions
 upon+0.095
y+0.089
 saw+0.086
ere+0.085
 arrived+0.084
Neg logit contributions
vern-0.250
 fault-0.248
 wounds-0.239
 Or-0.237
 bleed-0.236
Token princess
Token ID21752
Feature activation+8.12e-02

Pos logit contributions
 there+0.070
 set+0.062
 saw+0.060
 upon+0.060
 found+0.056
Neg logit contributions
 wounds-0.262
rier-0.258
 apology-0.255
othe-0.250
ometer-0.249
Token who
Token ID508
Feature activation+8.10e-02

Pos logit contributions
 upon+0.104
neath+0.100
 Inside+0.097
 there+0.095
ere+0.095
Neg logit contributions
ache-0.141
 wounds-0.139
othe-0.138
ometer-0.137
rier-0.128
Token lived
Token ID5615
Feature activation+1.22e-01

Pos logit contributions
 there+0.077
 upon+0.073
ere+0.069
 set+0.067
 three+0.066
Neg logit contributions
 bleed-0.169
 wounds-0.168
othe-0.168
ometer-0.168
 dent-0.159
Token<|endoftext|>
Token ID50256
Feature activation+1.47e-01

Pos logit contributions
 there+0.018
 set+0.017
 saw+0.017
 upon+0.017
 found+0.017
Neg logit contributions
 wounds-0.023
rier-0.023
 apology-0.022
othe-0.022
ometer-0.022
TokenOnce
Token ID7454
Feature activation+1.47e-01

Pos logit contributions
 set+0.226
 three+0.221
 spent+0.215
 colored+0.212
 spread+0.207
Neg logit contributions
ometer-0.198
othe-0.196
 Or-0.189
 apology-0.183
vere-0.181
Token upon
Token ID2402
Feature activation+1.57e-01

Pos logit contributions
 set+0.085
 colored+0.062
 entered+0.059
ere+0.058
 explored+0.054
Neg logit contributions
othe-0.383
 fault-0.377
ometer-0.374
rier-0.369
vern-0.368
Token a
Token ID257
Feature activation+1.30e-01

Pos logit contributions
ere+0.168
cy+0.147
y+0.144
neath+0.141
 arrived+0.137
Neg logit contributions
 wounds-0.316
 bleed-0.312
 worse-0.311
vern-0.308
 apology-0.302
Token time
Token ID640
Feature activation+1.32e-01

Pos logit contributions
 there+0.112
 set+0.100
 upon+0.094
 saw+0.088
 found+0.087
Neg logit contributions
 wounds-0.298
vern-0.295
 apology-0.293
 fault-0.290
othe-0.284
Token there
Token ID612
Feature activation+1.59e-01

Pos logit contributions
 set+0.148
ere+0.134
 colored+0.132
 explored+0.126
 spread+0.123
Neg logit contributions
vern-0.261
othe-0.259
vere-0.255
 fault-0.248
 sorry-0.247
Token was
Token ID373
Feature activation+1.18e-01

Pos logit contributions
ere+0.181
 set+0.168
 three+0.168
ka+0.166
r+0.161
Neg logit contributions
othe-0.282
ometer-0.269
 dent-0.268
 fault-0.267
 wounds-0.256
Token a
Token ID257
Feature activation+1.01e-01

Pos logit contributions
 upon+0.095
y+0.089
 saw+0.086
ere+0.085
 arrived+0.084
Neg logit contributions
vern-0.250
 fault-0.248
 wounds-0.239
 Or-0.237
 bleed-0.236
Token girl
Token ID2576
Feature activation+7.14e-02

Pos logit contributions
 there+0.070
 set+0.062
 saw+0.060
 upon+0.060
 found+0.056
Neg logit contributions
 wounds-0.262
rier-0.258
 apology-0.255
othe-0.250
ometer-0.249
Token who
Token ID508
Feature activation+8.22e-02

Pos logit contributions
 upon+0.081
 there+0.071
ere+0.069
neath+0.068
 set+0.068
Neg logit contributions
 wounds-0.141
othe-0.138
ometer-0.135
 apology-0.130
 Or-0.128
Token loved
Token ID6151
Feature activation+8.95e-02

Pos logit contributions
 there+0.076
 upon+0.072
ere+0.071
 set+0.070
 three+0.065
Neg logit contributions
 wounds-0.171
ometer-0.171
othe-0.170
 bleed-0.170
 apology-0.163
Token<|endoftext|>
Token ID50256
Feature activation+1.47e-01

Pos logit contributions
 there+0.018
 set+0.017
 saw+0.017
 upon+0.017
 found+0.017
Neg logit contributions
 wounds-0.023
rier-0.023
 apology-0.022
othe-0.022
ometer-0.022
TokenOnce
Token ID7454
Feature activation+1.47e-01

Pos logit contributions
 set+0.226
 three+0.221
 spent+0.215
 colored+0.212
 spread+0.207
Neg logit contributions
ometer-0.198
othe-0.196
 Or-0.189
 apology-0.183
vere-0.181
Token upon
Token ID2402
Feature activation+1.57e-01

Pos logit contributions
 set+0.085
 colored+0.062
 entered+0.059
ere+0.058
 explored+0.054
Neg logit contributions
othe-0.383
 fault-0.377
ometer-0.374
rier-0.369
vern-0.368
Token a
Token ID257
Feature activation+1.30e-01

Pos logit contributions
ere+0.168
cy+0.147
y+0.144
neath+0.141
 arrived+0.137
Neg logit contributions
 wounds-0.316
 bleed-0.312
 worse-0.311
vern-0.308
 apology-0.302
Token time
Token ID640
Feature activation+1.32e-01

Pos logit contributions
 there+0.112
 set+0.100
 upon+0.094
 saw+0.088
 found+0.087
Neg logit contributions
 wounds-0.298
vern-0.295
 apology-0.293
 fault-0.290
othe-0.284
Token there
Token ID612
Feature activation+1.59e-01

Pos logit contributions
 set+0.148
ere+0.134
 colored+0.132
 explored+0.126
 spread+0.123
Neg logit contributions
vern-0.261
othe-0.259
vere-0.255
 fault-0.248
 sorry-0.247
Token was
Token ID373
Feature activation+1.18e-01

Pos logit contributions
ere+0.181
 set+0.168
 three+0.168
ka+0.166
r+0.161
Neg logit contributions
othe-0.282
ometer-0.269
 dent-0.268
 fault-0.267
 wounds-0.256
Token a
Token ID257
Feature activation+1.01e-01

Pos logit contributions
 upon+0.095
y+0.089
 saw+0.086
ere+0.085
 arrived+0.084
Neg logit contributions
vern-0.250
 fault-0.248
 wounds-0.239
 Or-0.237
 bleed-0.236
Token brave
Token ID14802
Feature activation+7.38e-02

Pos logit contributions
 there+0.070
 set+0.062
 saw+0.060
 upon+0.060
 found+0.056
Neg logit contributions
 wounds-0.262
rier-0.258
 apology-0.255
othe-0.250
ometer-0.249
Token little
Token ID1310
Feature activation+7.67e-02

Pos logit contributions
 there+0.084
 upon+0.079
 set+0.077
 saw+0.075
 found+0.074
Neg logit contributions
rier-0.153
 apology-0.151
 wounds-0.151
ometer-0.147
othe-0.142
Token boy
Token ID2933
Feature activation+6.45e-02

Pos logit contributions
 there+0.098
 set+0.092
 saw+0.090
 upon+0.089
 found+0.086
Neg logit contributions
 wounds-0.156
rier-0.152
 apology-0.150
othe-0.147
vern-0.145
Token<|endoftext|>
Token ID50256
Feature activation+1.47e-01

Pos logit contributions
 there+0.018
 set+0.017
 saw+0.017
 upon+0.017
 found+0.017
Neg logit contributions
 wounds-0.023
rier-0.023
 apology-0.022
othe-0.022
ometer-0.022
TokenOnce
Token ID7454
Feature activation+1.47e-01

Pos logit contributions
 set+0.226
 three+0.221
 spent+0.215
 colored+0.212
 spread+0.207
Neg logit contributions
ometer-0.198
othe-0.196
 Or-0.189
 apology-0.183
vere-0.181
Token upon
Token ID2402
Feature activation+1.57e-01

Pos logit contributions
 set+0.085
 colored+0.062
 entered+0.059
ere+0.058
 explored+0.054
Neg logit contributions
othe-0.383
 fault-0.377
ometer-0.374
rier-0.369
vern-0.368
Token a
Token ID257
Feature activation+1.30e-01

Pos logit contributions
ere+0.168
cy+0.147
y+0.144
neath+0.141
 arrived+0.137
Neg logit contributions
 wounds-0.316
 bleed-0.312
 worse-0.311
vern-0.308
 apology-0.302
Token time
Token ID640
Feature activation+1.32e-01

Pos logit contributions
 there+0.112
 set+0.100
 upon+0.094
 saw+0.088
 found+0.087
Neg logit contributions
 wounds-0.298
vern-0.295
 apology-0.293
 fault-0.290
othe-0.284
Token there
Token ID612
Feature activation+1.59e-01

Pos logit contributions
 set+0.148
ere+0.134
 colored+0.132
 explored+0.126
 spread+0.123
Neg logit contributions
vern-0.261
othe-0.259
vere-0.255
 fault-0.248
 sorry-0.247
Token was
Token ID373
Feature activation+1.18e-01

Pos logit contributions
ere+0.181
 set+0.168
 three+0.168
ka+0.166
r+0.161
Neg logit contributions
othe-0.282
ometer-0.269
 dent-0.268
 fault-0.267
 wounds-0.256
Token a
Token ID257
Feature activation+1.01e-01

Pos logit contributions
 upon+0.095
y+0.089
 saw+0.086
ere+0.085
 arrived+0.084
Neg logit contributions
vern-0.250
 fault-0.248
 wounds-0.239
 Or-0.237
 bleed-0.236
Token summer
Token ID3931
Feature activation+7.40e-02

Pos logit contributions
 there+0.070
 set+0.062
 saw+0.060
 upon+0.060
 found+0.056
Neg logit contributions
 wounds-0.262
rier-0.258
 apology-0.255
othe-0.250
ometer-0.249
Token day
Token ID1110
Feature activation+9.30e-02

Pos logit contributions
 there+0.075
 upon+0.072
 saw+0.069
 set+0.069
 found+0.068
Neg logit contributions
 wounds-0.159
rier-0.159
 apology-0.155
ometer-0.154
othe-0.154
Token.
Token ID13
Feature activation+5.09e-02

Pos logit contributions
ere+0.083
 upon+0.081
 found+0.079
 saw+0.079
cy+0.075
Neg logit contributions
othe-0.193
ometer-0.183
rier-0.183
 sorry-0.181
vere-0.179
Token<|endoftext|>
Token ID50256
Feature activation+1.47e-01

Pos logit contributions
 there+0.018
 set+0.017
 saw+0.017
 upon+0.017
 found+0.017
Neg logit contributions
 wounds-0.023
rier-0.023
 apology-0.022
othe-0.022
ometer-0.022
TokenOnce
Token ID7454
Feature activation+1.47e-01

Pos logit contributions
 set+0.226
 three+0.221
 spent+0.215
 colored+0.212
 spread+0.207
Neg logit contributions
ometer-0.198
othe-0.196
 Or-0.189
 apology-0.183
vere-0.181
Token upon
Token ID2402
Feature activation+1.57e-01

Pos logit contributions
 set+0.085
 colored+0.062
 entered+0.059
ere+0.058
 explored+0.054
Neg logit contributions
othe-0.383
 fault-0.377
ometer-0.374
rier-0.369
vern-0.368
Token a
Token ID257
Feature activation+1.30e-01

Pos logit contributions
ere+0.168
cy+0.147
y+0.144
neath+0.141
 arrived+0.137
Neg logit contributions
 wounds-0.316
 bleed-0.312
 worse-0.311
vern-0.308
 apology-0.302
Token time
Token ID640
Feature activation+1.32e-01

Pos logit contributions
 there+0.112
 set+0.100
 upon+0.094
 saw+0.088
 found+0.087
Neg logit contributions
 wounds-0.298
vern-0.295
 apology-0.293
 fault-0.290
othe-0.284
Token there
Token ID612
Feature activation+1.59e-01

Pos logit contributions
 set+0.148
ere+0.134
 colored+0.132
 explored+0.126
 spread+0.123
Neg logit contributions
vern-0.261
othe-0.259
vere-0.255
 fault-0.248
 sorry-0.247
Token was
Token ID373
Feature activation+1.18e-01

Pos logit contributions
ere+0.181
 set+0.168
 three+0.168
ka+0.166
r+0.161
Neg logit contributions
othe-0.282
ometer-0.269
 dent-0.268
 fault-0.267
 wounds-0.256
Token a
Token ID257
Feature activation+1.01e-01

Pos logit contributions
 upon+0.095
y+0.089
 saw+0.086
ere+0.085
 arrived+0.084
Neg logit contributions
vern-0.250
 fault-0.248
 wounds-0.239
 Or-0.237
 bleed-0.236
Token little
Token ID1310
Feature activation+8.63e-02

Pos logit contributions
 there+0.070
 set+0.062
 saw+0.060
 upon+0.060
 found+0.056
Neg logit contributions
 wounds-0.262
rier-0.258
 apology-0.255
othe-0.250
ometer-0.249
Token girl
Token ID2576
Feature activation+6.99e-02

Pos logit contributions
 there+0.115
 set+0.109
 upon+0.108
 saw+0.107
 found+0.104
Neg logit contributions
 wounds-0.166
rier-0.164
 apology-0.160
ometer-0.156
othe-0.156
Token named
Token ID3706
Feature activation+1.18e-01

Pos logit contributions
 upon+0.082
 there+0.071
neath+0.069
ere+0.069
 set+0.068
Neg logit contributions
 wounds-0.135
othe-0.132
ometer-0.129
 apology-0.124
 Or-0.123
Token garden
Token ID11376
Feature activation+9.83e-03

Pos logit contributions
 wounds+0.095
ometer+0.085
rier+0.085
 mistake+0.084
 decisions+0.084
Neg logit contributions
 arrived-0.056
 there-0.054
ere-0.053
 upon-0.053
 set-0.053
Token together
Token ID1978
Feature activation+1.30e-02

Pos logit contributions
 there+0.009
 upon+0.009
 saw+0.009
 set+0.009
ere+0.008
Neg logit contributions
 wounds-0.022
rier-0.021
othe-0.021
 apology-0.021
 fault-0.020
Token and
Token ID290
Feature activation-4.57e-02

Pos logit contributions
 there+0.013
 upon+0.013
 set+0.012
 saw+0.012
ere+0.012
Neg logit contributions
 wounds-0.028
rier-0.028
othe-0.027
 apology-0.027
ometer-0.026
Token share
Token ID2648
Feature activation+1.09e-02

Pos logit contributions
 wounds+0.104
rier+0.102
 apology+0.101
othe+0.100
ometer+0.098
Neg logit contributions
 there-0.045
 upon-0.041
 set-0.041
 saw-0.040
 found-0.038
Token their
Token ID511
Feature activation-9.57e-02

Pos logit contributions
 there+0.011
 upon+0.010
 set+0.010
 saw+0.010
 found+0.009
Neg logit contributions
 wounds-0.025
rier-0.024
 apology-0.024
othe-0.023
vern-0.023
Token m
Token ID285
Feature activation-1.34e-01

Pos logit contributions
 wounds+0.169
 apology+0.165
rier+0.152
othe+0.150
vern+0.149
Neg logit contributions
 there-0.145
 upon-0.137
 set-0.135
 saw-0.134
y-0.131
Tokenugs
Token ID10339
Feature activation-4.52e-02

Pos logit contributions
rier+0.247
 wounds+0.231
 fault+0.222
 worse+0.218
 dent+0.218
Neg logit contributions
 upon-0.185
 there-0.184
 set-0.181
ere-0.181
y-0.179
Token of
Token ID286
Feature activation-8.45e-03

Pos logit contributions
 wounds+0.115
 apology+0.113
rier+0.113
 dent+0.111
ometer+0.111
Neg logit contributions
 there-0.038
 in-0.033
 set-0.032
 a-0.029
 found-0.029
Token spicy
Token ID26880
Feature activation-6.70e-02

Pos logit contributions
 wounds+0.016
 apology+0.015
rier+0.015
othe+0.015
ometer+0.014
Neg logit contributions
 there-0.012
 upon-0.011
 set-0.011
 saw-0.011
 found-0.011
Token tea
Token ID8887
Feature activation-5.06e-02

Pos logit contributions
 wounds+0.120
rier+0.116
 apology+0.116
othe+0.110
ometer+0.110
Neg logit contributions
 there-0.100
 upon-0.095
 saw-0.093
 set-0.092
 found-0.091
Token.
Token ID13
Feature activation-4.89e-02

Pos logit contributions
 wounds+0.114
rier+0.111
 apology+0.111
othe+0.108
ometer+0.107
Neg logit contributions
 there-0.053
 set-0.049
 saw-0.047
 upon-0.047
 found-0.045
Token.
Token ID13
Feature activation-5.43e-02

Pos logit contributions
 there+0.034
 set+0.030
 upon+0.029
 saw+0.029
 three+0.027
Neg logit contributions
 wounds-0.098
rier-0.098
 apology-0.095
vern-0.094
othe-0.093
Token She
Token ID1375
Feature activation-1.39e-02

Pos logit contributions
rier+0.135
 wounds+0.134
 apology+0.127
vern+0.124
vere+0.123
Neg logit contributions
 there-0.053
 upon-0.048
 saw-0.045
 a-0.044
 set-0.043
Token also
Token ID635
Feature activation-1.83e-02

Pos logit contributions
rier+0.029
 wounds+0.029
 apology+0.028
othe+0.028
ometer+0.028
Neg logit contributions
 there-0.017
 set-0.016
 saw-0.016
 found-0.015
 upon-0.015
Token gave
Token ID2921
Feature activation+2.30e-02

Pos logit contributions
 wounds+0.037
 apology+0.035
rier+0.035
othe+0.034
ometer+0.034
Neg logit contributions
 there-0.023
 upon-0.022
 set-0.021
 saw-0.020
 three-0.020
Token the
Token ID262
Feature activation-9.18e-02

Pos logit contributions
 there+0.022
 upon+0.020
 set+0.020
 saw+0.019
 found+0.018
Neg logit contributions
 wounds-0.054
rier-0.052
 apology-0.052
othe-0.051
ometer-0.050
Token m
Token ID285
Feature activation-1.33e-01

Pos logit contributions
othe+0.238
rier+0.231
vern+0.223
vere+0.223
 wounds+0.222
Neg logit contributions
 saw-0.078
 three-0.076
 set-0.072
 there-0.069
 upon-0.066
Tokenule
Token ID2261
Feature activation-6.59e-02

Pos logit contributions
rier+0.208
 apology+0.199
 wounds+0.198
 fault+0.198
 worse+0.198
Neg logit contributions
 upon-0.210
y-0.208
 saw-0.205
 three-0.203
 there-0.201
Token a
Token ID257
Feature activation-4.70e-02

Pos logit contributions
 wounds+0.186
rier+0.183
 apology+0.182
othe+0.178
ometer+0.178
Neg logit contributions
 there-0.029
 set-0.024
 upon-0.023
 saw-0.023
 found-0.020
Token carrot
Token ID40562
Feature activation-5.09e-02

Pos logit contributions
 wounds+0.091
rier+0.091
othe+0.083
vere+0.083
 decisions+0.081
Neg logit contributions
 there-0.065
 saw-0.063
 upon-0.063
 set-0.063
 three-0.061
Token to
Token ID284
Feature activation+1.42e-02

Pos logit contributions
 wounds+0.122
rier+0.120
 apology+0.119
vere+0.118
vern+0.116
Neg logit contributions
 there-0.047
 set-0.045
 saw-0.041
 in-0.040
 upon-0.040
Token make
Token ID787
Feature activation+1.48e-02

Pos logit contributions
 there+0.014
 set+0.013
 saw+0.013
 upon+0.013
 found+0.012
Neg logit contributions
 wounds-0.032
rier-0.032
 apology-0.031
ometer-0.031
vern-0.031
Token that
Token ID326
Feature activation-2.11e-02

Pos logit contributions
 wounds+0.014
rier+0.014
 apology+0.014
ometer+0.014
othe+0.014
Neg logit contributions
 there-0.005
 set-0.005
 upon-0.005
 saw-0.005
 found-0.005
Token he
Token ID339
Feature activation-4.38e-02

Pos logit contributions
 wounds+0.044
rier+0.043
 apology+0.042
vern+0.041
ometer+0.041
Neg logit contributions
 there-0.030
 set-0.027
 saw-0.026
 a-0.025
 three-0.025
Token had
Token ID550
Feature activation-5.94e-03

Pos logit contributions
 wounds+0.092
rier+0.088
 apology+0.088
ometer+0.088
othe+0.087
Neg logit contributions
 there-0.046
 upon-0.043
ere-0.040
 set-0.040
 three-0.040
Token earned
Token ID7366
Feature activation-1.64e-02

Pos logit contributions
 wounds+0.013
vern+0.012
 fault+0.012
vere+0.012
othe+0.011
Neg logit contributions
y-0.006
cy-0.006
 upon-0.006
ere-0.005
r-0.005
Token his
Token ID465
Feature activation-8.98e-02

Pos logit contributions
 wounds+0.041
rier+0.040
 apology+0.040
othe+0.039
ometer+0.039
Neg logit contributions
 there-0.013
 set-0.012
 upon-0.011
 saw-0.011
 found-0.011
Token m
Token ID285
Feature activation-1.31e-01

Pos logit contributions
 wounds+0.153
rier+0.150
 apology+0.147
othe+0.143
ometer+0.142
Neg logit contributions
 there-0.142
 set-0.135
 upon-0.133
 saw-0.133
 found-0.129
Tokenama
Token ID1689
Feature activation-7.45e-02

Pos logit contributions
 worse+0.198
 wounds+0.198
rier+0.195
othe+0.192
 bleed+0.189
Neg logit contributions
 upon-0.211
 there-0.204
 three-0.204
 set-0.202
ere-0.201
Token's
Token ID338
Feature activation-8.84e-02

Pos logit contributions
 wounds+0.153
rier+0.149
 apology+0.147
ometer+0.145
 fault+0.143
Neg logit contributions
 there-0.085
 upon-0.081
 set-0.080
 saw-0.078
ere-0.075
Token permission
Token ID7170
Feature activation-6.77e-02

Pos logit contributions
 wounds+0.154
rier+0.152
othe+0.147
 apology+0.145
ometer+0.143
Neg logit contributions
 there-0.137
 set-0.132
 saw-0.128
 upon-0.128
 three-0.126
Token.
Token ID13
Feature activation-5.43e-02

Pos logit contributions
 wounds+0.159
rier+0.154
 apology+0.154
othe+0.151
vern+0.148
Neg logit contributions
 there-0.067
 set-0.061
 upon-0.060
 saw-0.057
 found-0.056
Token<|endoftext|>
Token ID50256
Feature activation+1.26e-01

Pos logit contributions
rier+0.138
 wounds+0.136
 apology+0.127
vere+0.126
vern+0.126
Neg logit contributions
 there-0.048
 upon-0.047
 saw-0.043
 found-0.041
 set-0.041
Token not
Token ID407
Feature activation-2.61e-02

Pos logit contributions
rier+0.017
 wounds+0.016
ometer+0.016
 apology+0.016
othe+0.015
Neg logit contributions
 there-0.023
 set-0.022
 upon-0.022
 saw-0.022
 found-0.021
Token want
Token ID765
Feature activation+2.08e-03

Pos logit contributions
 wounds+0.055
rier+0.055
 apology+0.054
ometer+0.052
othe+0.052
Neg logit contributions
 there-0.031
 set-0.029
 saw-0.028
 upon-0.028
 found-0.027
Token to
Token ID284
Feature activation-2.18e-02

Pos logit contributions
 there+0.002
 set+0.002
 upon+0.002
 saw+0.002
 found+0.002
Neg logit contributions
 wounds-0.005
rier-0.005
 apology-0.005
othe-0.005
ometer-0.005
Token keep
Token ID1394
Feature activation-2.31e-02

Pos logit contributions
 wounds+0.045
 apology+0.044
othe+0.042
rier+0.042
 bleed+0.042
Neg logit contributions
 there-0.026
 upon-0.026
 saw-0.024
ere-0.023
 set-0.023
Token the
Token ID262
Feature activation-8.69e-02

Pos logit contributions
rier+0.058
 wounds+0.057
 apology+0.057
ometer+0.056
vern+0.055
Neg logit contributions
 there-0.019
 set-0.017
 saw-0.016
 upon-0.016
 found-0.015
Token m
Token ID285
Feature activation-1.31e-01

Pos logit contributions
othe+0.185
rier+0.184
vern+0.183
vere+0.180
 wounds+0.179
Neg logit contributions
 there-0.101
 set-0.099
 saw-0.098
 three-0.095
 upon-0.094
Tokenitten
Token ID2621
Feature activation-8.30e-02

Pos logit contributions
 wounds+0.231
rier+0.231
 apology+0.229
 fault+0.226
vern+0.222
Neg logit contributions
 there-0.183
y-0.181
 upon-0.180
 saw-0.175
 set-0.167
Token.
Token ID13
Feature activation-3.94e-02

Pos logit contributions
 wounds+0.210
 apology+0.209
vern+0.201
rier+0.200
ometer+0.199
Neg logit contributions
 there-0.078
 in-0.069
 set-0.063
 a-0.058
 all-0.055
Token It
Token ID632
Feature activation-3.84e-02

Pos logit contributions
 wounds+0.083
rier+0.081
 apology+0.079
vern+0.077
ometer+0.077
Neg logit contributions
 there-0.050
 upon-0.044
 set-0.044
 saw-0.044
 found-0.042
Token did
Token ID750
Feature activation-1.93e-02

Pos logit contributions
 wounds+0.086
 apology+0.081
 bleed+0.080
othe+0.079
rier+0.078
Neg logit contributions
 there-0.038
ere-0.037
 upon-0.036
 three-0.033
where-0.033
Token not
Token ID407
Feature activation-2.75e-02

Pos logit contributions
rier+0.028
 wounds+0.027
ometer+0.027
 apology+0.026
vern+0.025
Neg logit contributions
 there-0.037
 set-0.034
 upon-0.034
 saw-0.034
 found-0.034
Token they
Token ID484
Feature activation-3.13e-03

Pos logit contributions
 wounds+0.098
rier+0.096
 apology+0.094
othe+0.091
ometer+0.091
Neg logit contributions
 there-0.079
 set-0.076
 upon-0.075
 saw-0.074
 found-0.072
Token could
Token ID714
Feature activation-4.77e-04

Pos logit contributions
 wounds+0.007
othe+0.007
rier+0.007
 apology+0.007
ometer+0.006
Neg logit contributions
 there-0.003
 set-0.003
ere-0.003
 upon-0.003
 three-0.003
Token not
Token ID407
Feature activation-1.36e-02

Pos logit contributions
rier+0.001
 wounds+0.001
 apology+0.001
ometer+0.001
vern+0.001
Neg logit contributions
 there-0.001
 saw-0.001
 set-0.001
 upon-0.001
 found-0.001
Token find
Token ID1064
Feature activation+3.15e-03

Pos logit contributions
 wounds+0.031
rier+0.031
 apology+0.030
ometer+0.030
othe+0.030
Neg logit contributions
 there-0.014
 set-0.013
 saw-0.013
 upon-0.012
 found-0.012
Token their
Token ID511
Feature activation-8.76e-02

Pos logit contributions
 there+0.003
 set+0.003
 saw+0.003
 found+0.003
 upon+0.003
Neg logit contributions
 wounds-0.007
rier-0.007
 apology-0.007
vern-0.007
ometer-0.007
Token m
Token ID285
Feature activation-1.30e-01

Pos logit contributions
rier+0.181
othe+0.177
vere+0.175
vern+0.172
 apology+0.170
Neg logit contributions
 there-0.110
 set-0.109
 saw-0.108
 upon-0.103
 three-0.101
Tokenittens
Token ID34978
Feature activation-5.12e-02

Pos logit contributions
rier+0.213
 apology+0.204
othe+0.192
 wounds+0.192
 worse+0.191
Neg logit contributions
 upon-0.202
 saw-0.189
 set-0.189
y-0.189
 there-0.188
Token.
Token ID13
Feature activation-4.47e-02

Pos logit contributions
 wounds+0.118
 apology+0.117
rier+0.116
othe+0.113
vern+0.113
Neg logit contributions
 there-0.052
 set-0.047
 saw-0.045
 upon-0.044
 found-0.043
Token They
Token ID1119
Feature activation+4.75e-03

Pos logit contributions
 wounds+0.096
rier+0.094
vern+0.093
 apology+0.091
vere+0.091
Neg logit contributions
 there-0.059
 upon-0.051
 saw-0.051
 three-0.049
 found-0.049
Token had
Token ID550
Feature activation+1.59e-02

Pos logit contributions
 there+0.004
 upon+0.004
 set+0.004
ere+0.003
y+0.003
Neg logit contributions
 wounds-0.011
 apology-0.011
othe-0.011
rier-0.011
ometer-0.011
Token disappeared
Token ID12120
Feature activation+5.14e-03

Pos logit contributions
 there+0.018
 upon+0.018
 set+0.018
ere+0.017
y+0.017
Neg logit contributions
 wounds-0.033
 apology-0.031
 fault-0.031
rier-0.031
vern-0.030
Token I
Token ID314
Feature activation-1.88e-02

Pos logit contributions
 wounds+0.051
rier+0.050
 apology+0.049
othe+0.048
ometer+0.047
Neg logit contributions
 there-0.044
 set-0.042
 upon-0.041
 saw-0.041
 found-0.040
Token help
Token ID1037
Feature activation+2.85e-03

Pos logit contributions
 wounds+0.043
rier+0.043
 apology+0.041
othe+0.041
ometer+0.040
Neg logit contributions
 there-0.020
 set-0.018
 saw-0.018
 upon-0.018
 found-0.017
Token you
Token ID345
Feature activation-4.88e-02

Pos logit contributions
 there+0.004
 upon+0.004
 set+0.004
 saw+0.004
 found+0.004
Neg logit contributions
 wounds-0.005
rier-0.005
 apology-0.005
othe-0.005
ometer-0.005
Token fix
Token ID4259
Feature activation-3.55e-03

Pos logit contributions
 wounds+0.117
 apology+0.113
rier+0.113
vern+0.111
vere+0.110
Neg logit contributions
 there-0.038
 upon-0.036
 saw-0.035
ere-0.035
 found-0.034
Token your
Token ID534
Feature activation-8.87e-02

Pos logit contributions
 wounds+0.008
rier+0.008
 apology+0.008
othe+0.008
ometer+0.008
Neg logit contributions
 there-0.003
 set-0.003
 upon-0.003
 saw-0.003
 found-0.003
Token sc
Token ID629
Feature activation-1.28e-01

Pos logit contributions
rier+0.202
othe+0.200
ometer+0.190
vern+0.181
vere+0.181
Neg logit contributions
 there-0.099
 set-0.097
 saw-0.095
 upon-0.089
 found-0.086
Tokenooter
Token ID25141
Feature activation-7.52e-02

Pos logit contributions
rier+0.171
 wounds+0.158
othe+0.151
 worse+0.145
 bleed+0.144
Neg logit contributions
 upon-0.256
 there-0.247
 set-0.242
 saw-0.241
ere-0.237
Token?"
Token ID1701
Feature activation-3.96e-02

Pos logit contributions
 wounds+0.207
rier+0.205
 apology+0.202
othe+0.199
vern+0.197
Neg logit contributions
 there-0.043
 set-0.037
 upon-0.034
 saw-0.033
 found-0.030
Token
Token ID198
Feature activation-7.04e-03

Pos logit contributions
rier+0.117
 wounds+0.113
vern+0.110
 fault+0.109
 apology+0.108
Neg logit contributions
 there-0.028
 saw-0.022
 upon-0.022
 the-0.021
 a-0.020
Token
Token ID198
Feature activation-9.47e-03

Pos logit contributions
rier+0.018
 wounds+0.017
 fault+0.017
 apology+0.017
ometer+0.017
Neg logit contributions
 there-0.006
 saw-0.005
 upon-0.005
 set-0.005
 found-0.005
TokenThe
Token ID464
Feature activation-4.57e-02

Pos logit contributions
 fault+0.023
rier+0.023
 wounds+0.022
 apology+0.022
ometer+0.022
Neg logit contributions
 there-0.010
 saw-0.009
 set-0.009
 found-0.009
 upon-0.008
Token and
Token ID290
Feature activation-5.40e-02

Pos logit contributions
rier+0.132
 wounds+0.131
 apology+0.126
othe+0.126
ometer+0.125
Neg logit contributions
 there-0.057
 upon-0.053
 set-0.052
 saw-0.051
 found-0.049
Token reminded
Token ID14516
Feature activation+2.13e-03

Pos logit contributions
 wounds+0.109
 apology+0.105
rier+0.104
othe+0.104
 decisions+0.099
Neg logit contributions
 there-0.063
 upon-0.060
 set-0.059
ere-0.059
y-0.058
Token herself
Token ID5223
Feature activation-5.77e-02

Pos logit contributions
 there+0.002
 set+0.002
 upon+0.002
 saw+0.002
 found+0.002
Neg logit contributions
 wounds-0.005
rier-0.005
 apology-0.005
ometer-0.005
othe-0.005
Token of
Token ID286
Feature activation+4.16e-03

Pos logit contributions
 wounds+0.124
rier+0.122
 apology+0.121
othe+0.119
vern+0.118
Neg logit contributions
 there-0.059
 saw-0.058
 set-0.057
 upon-0.055
 found-0.054
Token the
Token ID262
Feature activation-8.04e-02

Pos logit contributions
 upon+0.003
 there+0.003
 saw+0.003
 set+0.003
y+0.003
Neg logit contributions
 wounds-0.010
 apology-0.010
rier-0.010
othe-0.010
vern-0.009
Token fore
Token ID1674
Feature activation-1.27e-01

Pos logit contributions
 wounds+0.159
rier+0.156
 apology+0.154
othe+0.150
ometer+0.149
Neg logit contributions
 there-0.104
 set-0.098
 upon-0.096
 saw-0.096
 found-0.093
Tokenb
Token ID65
Feature activation-6.78e-02

Pos logit contributions
 wounds+0.151
 apology+0.144
rier+0.143
 fault+0.140
othe+0.139
Neg logit contributions
 there-0.270
 upon-0.261
 set-0.256
 saw-0.253
 found-0.252
Tokenoding
Token ID7656
Feature activation-5.20e-02

Pos logit contributions
rier+0.110
 wounds+0.108
 apology+0.104
ometer+0.102
vern+0.102
Neg logit contributions
 there-0.109
 set-0.106
 saw-0.106
 found-0.102
 upon-0.102
Token darkness
Token ID11854
Feature activation-4.64e-02

Pos logit contributions
rier+0.100
 wounds+0.100
 apology+0.098
 decisions+0.094
othe+0.094
Neg logit contributions
 there-0.063
 set-0.059
 upon-0.059
 saw-0.059
ere-0.058
Token she
Token ID673
Feature activation-5.97e-02

Pos logit contributions
 wounds+0.107
rier+0.104
 apology+0.104
ometer+0.101
othe+0.101
Neg logit contributions
 there-0.047
 set-0.043
 saw-0.042
 upon-0.042
 found-0.040
Token had
Token ID550
Feature activation-1.65e-02

Pos logit contributions
 wounds+0.129
 apology+0.124
vere+0.122
rier+0.122
vern+0.121
Neg logit contributions
 there-0.058
 upon-0.054
 set-0.052
ere-0.052
 three-0.051
Token't
Token ID470
Feature activation-2.32e-03

Pos logit contributions
 there+0.004
 set+0.004
 a+0.004
 in+0.004
 upon+0.004
Neg logit contributions
 wounds-0.010
rier-0.010
ometer-0.009
 apology-0.009
 fault-0.009
Token want
Token ID765
Feature activation+5.53e-03

Pos logit contributions
 wounds+0.005
rier+0.005
ometer+0.005
othe+0.005
vere+0.005
Neg logit contributions
 there-0.003
 saw-0.002
 set-0.002
 found-0.002
 upon-0.002
Token to
Token ID284
Feature activation-1.33e-02

Pos logit contributions
 there+0.004
 saw+0.003
 set+0.003
 found+0.003
 a+0.003
Neg logit contributions
rier-0.015
 wounds-0.015
ometer-0.014
 apology-0.014
vere-0.014
Token share
Token ID2648
Feature activation+2.17e-02

Pos logit contributions
 wounds+0.032
rier+0.032
ometer+0.031
 apology+0.030
vern+0.030
Neg logit contributions
 there-0.013
 set-0.013
 saw-0.012
 found-0.011
 three-0.011
Token her
Token ID607
Feature activation-7.89e-02

Pos logit contributions
 there+0.015
 set+0.013
 saw+0.012
 upon+0.012
 found+0.011
Neg logit contributions
 wounds-0.057
rier-0.057
 apology-0.056
ometer-0.055
othe-0.055
Token m
Token ID285
Feature activation-1.27e-01

Pos logit contributions
 wounds+0.164
rier+0.161
 apology+0.158
othe+0.155
ometer+0.154
Neg logit contributions
 there-0.096
 set-0.090
 upon-0.088
 saw-0.088
 found-0.085
Tokenop
Token ID404
Feature activation-8.05e-02

Pos logit contributions
rier+0.250
 apology+0.230
 wounds+0.229
 fault+0.226
 worse+0.222
Neg logit contributions
y-0.161
 there-0.158
 upon-0.156
ere-0.154
 three-0.151
Token,
Token ID11
Feature activation-2.44e-02

Pos logit contributions
 wounds+0.214
vere+0.211
rier+0.211
 apology+0.208
vern+0.208
Neg logit contributions
 there-0.055
 in-0.051
y-0.046
 all-0.046
 a-0.043
Token but
Token ID475
Feature activation-4.32e-02

Pos logit contributions
vern+0.067
rier+0.067
ometer+0.066
 wounds+0.062
pre+0.062
Neg logit contributions
 there-0.024
 all-0.021
 a-0.020
 the-0.020
 saw-0.020
Token her
Token ID607
Feature activation-4.67e-02

Pos logit contributions
 wounds+0.073
othe+0.072
 apology+0.070
rier+0.069
 worse+0.069
Neg logit contributions
ere-0.062
 upon-0.059
 set-0.058
r-0.058
 there-0.056
Token mom
Token ID1995
Feature activation+9.09e-03

Pos logit contributions
 wounds+0.091
 apology+0.087
ometer+0.084
rier+0.084
 fault+0.083
Neg logit contributions
 there-0.063
 upon-0.060
 set-0.059
 saw-0.057
 found-0.056
Token and
Token ID290
Feature activation-4.62e-02

Pos logit contributions
 in+0.011
 a+0.011
 there+0.011
 three+0.011
 all+0.010
Neg logit contributions
 wounds-0.051
vern-0.050
rier-0.049
 apology-0.049
vere-0.048
Token wished
Token ID16555
Feature activation-3.02e-02

Pos logit contributions
 wounds+0.082
rier+0.081
 apology+0.079
othe+0.077
ometer+0.077
Neg logit contributions
 there-0.069
 set-0.066
 upon-0.065
 saw-0.065
 found-0.063
Token they
Token ID484
Feature activation-3.94e-02

Pos logit contributions
rier+0.063
vern+0.062
 wounds+0.062
ometer+0.061
 bleed+0.061
Neg logit contributions
 there-0.045
 the-0.041
 in-0.041
 a-0.040
 all-0.037
Token had
Token ID550
Feature activation-1.95e-02

Pos logit contributions
 wounds+0.089
othe+0.083
 apology+0.082
ometer+0.081
rier+0.081
Neg logit contributions
ere-0.037
 set-0.035
 upon-0.035
 there-0.034
 three-0.033
Token their
Token ID511
Feature activation-8.76e-02

Pos logit contributions
 wounds+0.039
 apology+0.036
 fault+0.036
vern+0.036
othe+0.035
Neg logit contributions
 upon-0.024
 there-0.023
ere-0.022
y-0.022
 set-0.022
Token m
Token ID285
Feature activation-1.27e-01

Pos logit contributions
rier+0.180
 wounds+0.171
othe+0.169
 apology+0.167
vere+0.165
Neg logit contributions
 there-0.114
 set-0.110
 saw-0.108
 upon-0.106
 found-0.102
Tokenittens
Token ID34978
Feature activation-5.29e-02

Pos logit contributions
rier+0.210
 apology+0.182
 worse+0.181
 wounds+0.180
othe+0.177
Neg logit contributions
 upon-0.210
ere-0.198
 saw-0.195
 there-0.195
 set-0.194
Token back
Token ID736
Feature activation-7.90e-02

Pos logit contributions
 wounds+0.122
rier+0.122
 apology+0.120
othe+0.117
vere+0.116
Neg logit contributions
 there-0.053
 set-0.048
 saw-0.046
 upon-0.046
 found-0.045
Token.
Token ID13
Feature activation-5.39e-02

Pos logit contributions
 wounds+0.158
rier+0.153
 fault+0.151
 apology+0.150
ometer+0.149
Neg logit contributions
 there-0.092
 set-0.091
 saw-0.089
 upon-0.088
ere-0.088
Token They
Token ID1119
Feature activation-3.50e-03

Pos logit contributions
rier+0.111
 wounds+0.111
 apology+0.108
vern+0.107
vere+0.104
Neg logit contributions
 there-0.072
 saw-0.066
 upon-0.064
 found-0.062
 three-0.062
Token were
Token ID547
Feature activation-4.35e-02

Pos logit contributions
 wounds+0.006
rier+0.006
 apology+0.006
ometer+0.006
othe+0.006
Neg logit contributions
 there-0.005
 set-0.005
 saw-0.005
 upon-0.005
 found-0.005
Token had
Token ID550
Feature activation+3.43e-03

Pos logit contributions
 wounds+0.122
rier+0.120
 apology+0.119
othe+0.116
ometer+0.115
Neg logit contributions
 there-0.060
 set-0.055
 upon-0.054
 saw-0.054
 found-0.052
Token learned
Token ID4499
Feature activation-2.45e-02

Pos logit contributions
 there+0.003
 upon+0.003
 set+0.002
ere+0.002
 saw+0.002
Neg logit contributions
 wounds-0.008
 fault-0.008
rier-0.008
 apology-0.008
vern-0.008
Token to
Token ID284
Feature activation-2.50e-02

Pos logit contributions
 wounds+0.063
rier+0.063
 apology+0.061
othe+0.060
ometer+0.060
Neg logit contributions
 there-0.020
 set-0.016
 a-0.016
 upon-0.015
 saw-0.015
Token appreciate
Token ID9144
Feature activation-8.84e-03

Pos logit contributions
 wounds+0.047
 apology+0.046
othe+0.044
rier+0.044
vern+0.043
Neg logit contributions
 there-0.034
 upon-0.033
 saw-0.031
 set-0.030
 found-0.030
Token the
Token ID262
Feature activation-9.01e-02

Pos logit contributions
 wounds+0.022
rier+0.022
 apology+0.022
othe+0.021
ometer+0.021
Neg logit contributions
 there-0.007
 set-0.006
 upon-0.006
 saw-0.006
 found-0.005
Token therm
Token ID21969
Feature activation-1.26e-01

Pos logit contributions
rier+0.164
othe+0.161
 wounds+0.161
vern+0.156
ometer+0.156
Neg logit contributions
 set-0.134
 there-0.134
 three-0.129
 saw-0.128
 upon-0.124
Tokenometer
Token ID15635
Feature activation-9.82e-02

Pos logit contributions
 wounds+0.106
 apology+0.104
vern+0.092
vere+0.081
 fuss+0.080
Neg logit contributions
y-0.286
 upon-0.284
 there-0.278
 three-0.277
 in-0.276
Token.
Token ID13
Feature activation-5.29e-02

Pos logit contributions
 wounds+0.238
 apology+0.235
rier+0.233
vern+0.225
vere+0.225
Neg logit contributions
 there-0.093
 set-0.082
 in-0.078
 saw-0.077
 found-0.075
Token<|endoftext|>
Token ID50256
Feature activation+1.29e-01

Pos logit contributions
rier+0.137
 wounds+0.136
 apology+0.127
vere+0.125
 bleed+0.125
Neg logit contributions
 there-0.047
 upon-0.041
 saw-0.041
 found-0.039
 set-0.038
TokenJohn
Token ID7554
Feature activation+6.63e-02

Pos logit contributions
 set+0.153
 colored+0.150
ere+0.148
 planted+0.145
 three+0.145
Neg logit contributions
ometer-0.209
vere-0.203
othe-0.199
rier-0.195
 fault-0.193
Token lay
Token ID3830
Feature activation+3.99e-02

Pos logit contributions
 there+0.066
 upon+0.064
 three+0.063
where+0.061
ere+0.059
Neg logit contributions
 wounds-0.123
ometer-0.123
 fault-0.119
ache-0.119
othe-0.114
Token.
Token ID13
Feature activation-5.05e-02

Pos logit contributions
 wounds+0.037
rier+0.036
 apology+0.036
othe+0.035
ometer+0.035
Neg logit contributions
 there-0.017
 set-0.016
 upon-0.016
 saw-0.016
 found-0.015
Token You
Token ID921
Feature activation-2.91e-02

Pos logit contributions
 wounds+0.079
rier+0.077
 apology+0.075
othe+0.072
vern+0.072
Neg logit contributions
 there-0.088
 set-0.083
 upon-0.082
 saw-0.082
 found-0.080
Token can
Token ID460
Feature activation-1.44e-02

Pos logit contributions
 apology+0.053
vern+0.052
vere+0.051
 wounds+0.051
 fault+0.051
Neg logit contributions
y-0.036
ere-0.035
 upon-0.034
 there-0.034
 colored-0.033
Token keep
Token ID1394
Feature activation-1.62e-02

Pos logit contributions
 wounds+0.027
vern+0.027
 apology+0.027
othe+0.026
vere+0.026
Neg logit contributions
 upon-0.018
 there-0.018
 saw-0.016
y-0.016
ere-0.016
Token your
Token ID534
Feature activation-8.24e-02

Pos logit contributions
 wounds+0.036
rier+0.035
 apology+0.034
othe+0.034
ometer+0.033
Neg logit contributions
 there-0.018
 set-0.016
 upon-0.016
 saw-0.016
 found-0.015
Token m
Token ID285
Feature activation-1.26e-01

Pos logit contributions
rier+0.181
othe+0.179
vere+0.175
vern+0.173
 wounds+0.168
Neg logit contributions
 there-0.094
 three-0.093
 set-0.091
 upon-0.088
 saw-0.086
Tokenop
Token ID404
Feature activation-7.58e-02

Pos logit contributions
rier+0.239
 apology+0.220
 wounds+0.217
 fault+0.213
 worse+0.210
Neg logit contributions
 there-0.167
 upon-0.166
 three-0.161
ere-0.159
 set-0.159
Token.
Token ID13
Feature activation-5.31e-02

Pos logit contributions
 wounds+0.162
rier+0.160
 apology+0.158
othe+0.154
ometer+0.153
Neg logit contributions
 there-0.087
 set-0.081
 upon-0.078
 saw-0.077
 found-0.075
Token And
Token ID843
Feature activation-6.69e-02

Pos logit contributions
 wounds+0.097
rier+0.094
vern+0.091
 apology+0.088
vere+0.088
Neg logit contributions
 there-0.079
 set-0.076
 three-0.075
 a-0.072
 saw-0.070
Token you
Token ID345
Feature activation-4.09e-02

Pos logit contributions
 wounds+0.131
rier+0.129
 apology+0.127
othe+0.123
ometer+0.123
Neg logit contributions
 there-0.087
 set-0.082
 upon-0.082
 saw-0.082
 found-0.079
Token can
Token ID460
Feature activation-1.64e-02

Pos logit contributions
 wounds+0.079
 apology+0.079
vern+0.076
ometer+0.076
vere+0.075
Neg logit contributions
 there-0.047
ere-0.047
 upon-0.046
y-0.045
 set-0.044
Token that
Token ID326
Feature activation-2.59e-02

Pos logit contributions
 wounds+0.047
rier+0.046
 apology+0.046
othe+0.045
ometer+0.044
Neg logit contributions
 there-0.027
 upon-0.026
 set-0.026
 saw-0.026
 three-0.025
Token are
Token ID389
Feature activation-8.00e-02

Pos logit contributions
 wounds+0.056
rier+0.056
 apology+0.054
vern+0.053
othe+0.053
Neg logit contributions
 there-0.031
 set-0.029
 saw-0.028
 found-0.027
 upon-0.026
Token cheap
Token ID7026
Feature activation-4.41e-02

Pos logit contributions
 wounds+0.138
rier+0.137
 apology+0.134
othe+0.132
ometer+0.131
Neg logit contributions
 there-0.130
 set-0.120
 saw-0.116
 upon-0.115
 three-0.115
Token are
Token ID389
Feature activation-9.14e-02

Pos logit contributions
 wounds+0.103
rier+0.102
 apology+0.100
ometer+0.097
othe+0.097
Neg logit contributions
 there-0.040
 set-0.037
 upon-0.037
 saw-0.036
 found-0.034
Token not
Token ID407
Feature activation-6.74e-02

Pos logit contributions
 wounds+0.155
rier+0.152
 apology+0.150
othe+0.147
ometer+0.146
Neg logit contributions
 there-0.150
 set-0.140
 saw-0.135
 upon-0.135
 found-0.133
Token very
Token ID845
Feature activation-1.26e-01

Pos logit contributions
 wounds+0.123
rier+0.119
 apology+0.118
othe+0.116
ometer+0.115
Neg logit contributions
 there-0.101
 set-0.095
 upon-0.092
 saw-0.091
 found-0.090
Token good
Token ID922
Feature activation-5.38e-02

Pos logit contributions
 wounds+0.201
rier+0.192
 apology+0.186
othe+0.184
ometer+0.183
Neg logit contributions
 there-0.221
 set-0.207
 saw-0.204
 upon-0.204
 found-0.199
Token.
Token ID13
Feature activation-6.00e-02

Pos logit contributions
 wounds+0.114
rier+0.113
 apology+0.111
othe+0.108
ometer+0.107
Neg logit contributions
 there-0.061
 set-0.057
 upon-0.056
 saw-0.056
 found-0.054
Token But
Token ID887
Feature activation-5.44e-02

Pos logit contributions
 wounds+0.118
vern+0.112
rier+0.111
othe+0.111
 apology+0.108
Neg logit contributions
 there-0.083
 three-0.078
 set-0.075
 Inside-0.072
 a-0.072
Token these
Token ID777
Feature activation-3.37e-02

Pos logit contributions
 wounds+0.094
rier+0.092
 apology+0.090
othe+0.088
ometer+0.087
Neg logit contributions
 there-0.082
 set-0.079
 upon-0.078
 saw-0.078
 found-0.076
Token cookies
Token ID14746
Feature activation-2.24e-02

Pos logit contributions
 wounds+0.066
 apology+0.064
rier+0.064
othe+0.061
ometer+0.061
Neg logit contributions
 there-0.043
 upon-0.042
 set-0.041
 saw-0.040
 found-0.039
Token them
Token ID606
Feature activation-2.29e-02

Pos logit contributions
 there+0.020
 set+0.018
 upon+0.018
 saw+0.018
 found+0.017
Neg logit contributions
 wounds-0.045
rier-0.044
 apology-0.044
othe-0.043
ometer-0.042
Token.
Token ID13
Feature activation-4.33e-02

Pos logit contributions
 wounds+0.057
vern+0.056
 dent+0.056
othe+0.056
vere+0.056
Neg logit contributions
 there-0.018
 in-0.017
 set-0.017
 upon-0.016
 three-0.016
Token He
Token ID679
Feature activation-4.55e-03

Pos logit contributions
 wounds+0.088
rier+0.087
vern+0.083
 apology+0.083
 fault+0.082
Neg logit contributions
 there-0.060
 saw-0.054
 set-0.053
 upon-0.053
 found-0.051
Token held
Token ID2714
Feature activation-9.11e-04

Pos logit contributions
 wounds+0.009
 apology+0.009
rier+0.009
ometer+0.008
othe+0.008
Neg logit contributions
 there-0.005
 upon-0.005
ere-0.005
 set-0.005
 three-0.005
Token their
Token ID511
Feature activation-8.28e-02

Pos logit contributions
 wounds+0.002
rier+0.002
 apology+0.002
othe+0.002
ometer+0.002
Neg logit contributions
 there-0.001
 set-0.001
 upon-0.001
 saw-0.001
 found-0.001
Token m
Token ID285
Feature activation-1.25e-01

Pos logit contributions
rier+0.177
othe+0.171
vere+0.170
vern+0.169
ometer+0.165
Neg logit contributions
 there-0.097
 set-0.093
 saw-0.092
 upon-0.092
 three-0.087
Tokenittens
Token ID34978
Feature activation-4.75e-02

Pos logit contributions
rier+0.208
 worse+0.191
 fault+0.186
vere+0.180
 smarter+0.179
Neg logit contributions
 upon-0.191
r-0.182
y-0.181
ere-0.181
 set-0.177
Token gently
Token ID15165
Feature activation-4.21e-02

Pos logit contributions
 apology+0.111
 fault+0.109
 dent+0.109
 wounds+0.109
rier+0.108
Neg logit contributions
 there-0.050
 in-0.047
 set-0.044
 all-0.040
 upon-0.040
Token.
Token ID13
Feature activation-4.42e-02

Pos logit contributions
 wounds+0.099
rier+0.098
 apology+0.097
othe+0.095
ometer+0.094
Neg logit contributions
 there-0.039
 set-0.036
 upon-0.034
 saw-0.034
 found-0.033
Token He
Token ID679
Feature activation-1.03e-02

Pos logit contributions
 wounds+0.090
rier+0.089
 apology+0.086
vern+0.084
ometer+0.084
Neg logit contributions
 there-0.058
 set-0.053
 saw-0.052
 upon-0.052
 found-0.050
Token knew
Token ID2993
Feature activation-4.81e-02

Pos logit contributions
 wounds+0.022
 apology+0.021
rier+0.021
othe+0.020
ometer+0.020
Neg logit contributions
 there-0.012
 upon-0.011
 set-0.011
ere-0.010
 three-0.010
Token to
Token ID284
Feature activation-2.14e-02

Pos logit contributions
rier+0.013
 wounds+0.013
 apology+0.013
vere+0.013
ometer+0.013
Neg logit contributions
 there-0.004
 set-0.003
 a-0.003
 saw-0.003
 three-0.003
Token lose
Token ID4425
Feature activation-1.43e-02

Pos logit contributions
 wounds+0.046
rier+0.044
 apology+0.044
othe+0.044
ometer+0.042
Neg logit contributions
 there-0.024
 upon-0.023
 set-0.022
 saw-0.022
 found-0.021
Token her
Token ID607
Feature activation-8.67e-02

Pos logit contributions
 apology+0.036
 wounds+0.036
rier+0.036
ometer+0.036
vere+0.036
Neg logit contributions
 there-0.011
 set-0.009
 saw-0.009
 in-0.009
 a-0.008
Token new
Token ID649
Feature activation-8.34e-02

Pos logit contributions
 wounds+0.172
rier+0.166
 apology+0.165
ometer+0.159
 fault+0.159
Neg logit contributions
 there-0.113
 set-0.106
 upon-0.105
 saw-0.105
 found-0.102
Token gifted
Token ID22527
Feature activation-4.26e-02

Pos logit contributions
 wounds+0.168
rier+0.160
 apology+0.160
 fault+0.157
ometer+0.156
Neg logit contributions
 there-0.105
 upon-0.096
 set-0.096
 saw-0.095
 found-0.094
Token m
Token ID285
Feature activation-1.25e-01

Pos logit contributions
 wounds+0.086
rier+0.083
 apology+0.083
ometer+0.080
 fault+0.080
Neg logit contributions
 there-0.054
 upon-0.050
 set-0.049
 saw-0.049
 found-0.048
Tokenittens
Token ID34978
Feature activation-5.58e-02

Pos logit contributions
rier+0.198
 apology+0.195
 wounds+0.194
 worse+0.181
vere+0.179
Neg logit contributions
 upon-0.199
 there-0.197
y-0.196
 set-0.189
ere-0.188
Token.
Token ID13
Feature activation-5.20e-02

Pos logit contributions
 wounds+0.131
rier+0.127
 apology+0.127
othe+0.123
vern+0.122
Neg logit contributions
 there-0.056
 set-0.050
 upon-0.048
 saw-0.048
 found-0.047
Token But
Token ID887
Feature activation-3.92e-02

Pos logit contributions
 wounds+0.111
rier+0.110
vern+0.106
 apology+0.106
vere+0.103
Neg logit contributions
 there-0.062
 set-0.059
 saw-0.059
 upon-0.058
 found-0.056
Token then
Token ID788
Feature activation+2.77e-02

Pos logit contributions
 wounds+0.083
othe+0.081
 apology+0.079
rier+0.078
ometer+0.076
Neg logit contributions
ere-0.040
 upon-0.039
 set-0.039
 there-0.038
y-0.037
Token,
Token ID11
Feature activation+8.82e-03

Pos logit contributions
ere+0.028
 set+0.027
 upon+0.027
y+0.025
r+0.025
Neg logit contributions
othe-0.058
 wounds-0.057
 sorry-0.055
 worse-0.055
 apology-0.053
Token for
Token ID329
Feature activation-1.14e-02

Pos logit contributions
 wounds+0.058
rier+0.056
 apology+0.056
ometer+0.055
othe+0.054
Neg logit contributions
 there-0.028
 set-0.025
 upon-0.025
 saw-0.025
 found-0.024
Token things
Token ID1243
Feature activation-5.79e-02

Pos logit contributions
rier+0.027
 wounds+0.027
ometer+0.027
 apology+0.026
othe+0.026
Neg logit contributions
 there-0.011
 set-0.010
 saw-0.009
 found-0.009
 upon-0.009
Token that
Token ID326
Feature activation-6.46e-02

Pos logit contributions
 wounds+0.125
rier+0.122
 apology+0.121
othe+0.118
ometer+0.118
Neg logit contributions
 there-0.066
 set-0.061
 upon-0.060
 saw-0.060
 found-0.058
Token weren
Token ID6304
Feature activation-7.74e-02

Pos logit contributions
 wounds+0.165
rier+0.163
 apology+0.156
vern+0.153
ometer+0.150
Neg logit contributions
 there-0.066
 saw-0.060
 found-0.058
 set-0.058
 upon-0.054
Token't
Token ID470
Feature activation-5.63e-02

Pos logit contributions
 wounds+0.146
rier+0.145
 apology+0.142
othe+0.139
vern+0.137
Neg logit contributions
 there-0.105
 set-0.099
 saw-0.099
 upon-0.098
 found-0.095
Token quite
Token ID2407
Feature activation-1.25e-01

Pos logit contributions
 wounds+0.112
rier+0.110
 apology+0.109
othe+0.106
ometer+0.105
Neg logit contributions
 there-0.072
 set-0.068
 upon-0.067
 saw-0.067
 found-0.064
Token right
Token ID826
Feature activation-7.54e-02

Pos logit contributions
 wounds+0.198
rier+0.194
 apology+0.190
othe+0.184
ometer+0.182
Neg logit contributions
 there-0.210
 set-0.200
 upon-0.198
 saw-0.198
 found-0.193
Token.
Token ID13
Feature activation-6.25e-02

Pos logit contributions
rier+0.155
 wounds+0.154
 fault+0.150
 apology+0.148
ometer+0.145
Neg logit contributions
 there-0.084
 set-0.084
 saw-0.082
ere-0.080
 upon-0.079
Token<|endoftext|>
Token ID50256
Feature activation+1.25e-01

Pos logit contributions
rier+0.180
 wounds+0.179
 bleed+0.166
 apology+0.165
vere+0.163
Neg logit contributions
 there-0.039
 upon-0.035
 the-0.033
 a-0.032
-0.032
TokenOnce
Token ID7454
Feature activation+1.22e-01

Pos logit contributions
 set+0.141
 colored+0.137
ere+0.134
 three+0.133
 planted+0.132
Neg logit contributions
ometer-0.210
vere-0.202
othe-0.200
 fault-0.196
rier-0.195
Token upon
Token ID2402
Feature activation+1.15e-01

Pos logit contributions
 set+0.055
 three+0.036
ere+0.033
 colored+0.032
 entered+0.030
Neg logit contributions
othe-0.338
 fault-0.328
 apology-0.325
ometer-0.325
vern-0.323
Token before
Token ID878
Feature activation-3.91e-02

Pos logit contributions
 wounds+0.096
rier+0.096
 apology+0.093
othe+0.092
ometer+0.091
Neg logit contributions
 there-0.049
 set-0.046
 saw-0.046
 upon-0.045
 found-0.044
Token we
Token ID356
Feature activation-3.90e-02

Pos logit contributions
 wounds+0.073
rier+0.072
 apology+0.071
othe+0.070
vere+0.069
Neg logit contributions
 there-0.050
 saw-0.048
 set-0.048
 upon-0.048
 three-0.047
Token sent
Token ID1908
Feature activation+2.61e-02

Pos logit contributions
 apology+0.077
 wounds+0.077
othe+0.074
rier+0.074
 bleed+0.074
Neg logit contributions
 there-0.044
 upon-0.040
ere-0.039
where-0.039
 three-0.038
Token him
Token ID683
Feature activation-6.73e-02

Pos logit contributions
 there+0.026
 set+0.024
 upon+0.024
 saw+0.023
 found+0.023
Neg logit contributions
 wounds-0.059
rier-0.059
 apology-0.058
othe-0.057
ometer-0.057
Token the
Token ID262
Feature activation-8.52e-02

Pos logit contributions
 wounds+0.134
rier+0.129
 apology+0.128
 fault+0.126
ometer+0.123
Neg logit contributions
 there-0.077
 saw-0.077
 set-0.076
ere-0.075
 upon-0.075
Token sc
Token ID629
Feature activation-1.24e-01

Pos logit contributions
othe+0.178
rier+0.173
 wounds+0.168
ometer+0.168
vern+0.158
Neg logit contributions
 there-0.115
 set-0.114
 saw-0.107
 three-0.106
 upon-0.104
Tokenooter
Token ID25141
Feature activation-6.83e-02

Pos logit contributions
 wounds+0.153
rier+0.152
othe+0.139
 worse+0.137
 bleed+0.133
Neg logit contributions
 upon-0.253
 there-0.250
 saw-0.243
 set-0.242
 three-0.239
Token!"
Token ID2474
Feature activation-3.96e-02

Pos logit contributions
 wounds+0.149
rier+0.147
 apology+0.145
othe+0.142
ometer+0.141
Neg logit contributions
 there-0.074
 set-0.069
 upon-0.067
 saw-0.067
 found-0.064
Token M
Token ID337
Feature activation-3.00e-02

Pos logit contributions
 wounds+0.095
rier+0.094
 apology+0.090
vern+0.089
vere+0.088
Neg logit contributions
 there-0.040
 upon-0.037
 set-0.036
 saw-0.036
 found-0.034
Tokenandy
Token ID10757
Feature activation-9.93e-03

Pos logit contributions
 wounds+0.069
rier+0.068
 apology+0.067
othe+0.066
ometer+0.065
Neg logit contributions
 there-0.030
 set-0.027
 saw-0.027
 upon-0.026
 found-0.025
Token smiled
Token ID13541
Feature activation-2.33e-02

Pos logit contributions
 wounds+0.022
rier+0.022
 apology+0.021
othe+0.021
ometer+0.021
Neg logit contributions
 there-0.011
 set-0.010
 saw-0.010
 upon-0.010
 found-0.009
Token boy
Token ID2933
Feature activation-4.76e-02

Pos logit contributions
 wounds+0.128
rier+0.125
othe+0.125
ometer+0.122
 apology+0.121
Neg logit contributions
 there-0.128
 saw-0.120
 set-0.119
 upon-0.117
 found-0.115
Token and
Token ID290
Feature activation-4.08e-02

Pos logit contributions
rier+0.122
 wounds+0.121
vern+0.119
vere+0.119
 apology+0.116
Neg logit contributions
 there-0.042
 in-0.035
 saw-0.033
 set-0.032
 upon-0.032
Token give
Token ID1577
Feature activation-2.06e-03

Pos logit contributions
rier+0.089
 wounds+0.087
ometer+0.085
vern+0.084
 apology+0.084
Neg logit contributions
 there-0.048
 saw-0.044
 set-0.044
 found-0.043
 upon-0.041
Token him
Token ID683
Feature activation-6.04e-02

Pos logit contributions
 wounds+0.005
rier+0.005
 apology+0.005
othe+0.005
ometer+0.004
Neg logit contributions
 there-0.002
 set-0.002
 upon-0.002
 saw-0.002
ere-0.002
Token her
Token ID607
Feature activation-8.59e-02

Pos logit contributions
 wounds+0.151
 apology+0.145
vern+0.139
rier+0.138
 fault+0.138
Neg logit contributions
 there-0.045
 upon-0.044
 set-0.042
 saw-0.041
y-0.041
Token sc
Token ID629
Feature activation-1.24e-01

Pos logit contributions
 wounds+0.158
rier+0.154
 apology+0.152
othe+0.147
ometer+0.147
Neg logit contributions
 there-0.124
 set-0.117
 upon-0.116
 saw-0.115
 found-0.112
Tokenooter
Token ID25141
Feature activation-7.11e-02

Pos logit contributions
rier+0.146
 wounds+0.134
othe+0.128
 dent+0.121
 worse+0.121
Neg logit contributions
 upon-0.263
 there-0.258
 saw-0.250
 found-0.249
 set-0.247
Token as
Token ID355
Feature activation-3.47e-02

Pos logit contributions
 wounds+0.141
rier+0.140
 apology+0.137
othe+0.136
ometer+0.133
Neg logit contributions
 there-0.093
 set-0.086
 saw-0.084
 upon-0.084
 found-0.082
Token a
Token ID257
Feature activation-2.59e-02

Pos logit contributions
 wounds+0.088
 apology+0.086
rier+0.086
othe+0.084
vern+0.083
Neg logit contributions
 there-0.024
 set-0.023
 saw-0.022
 upon-0.021
 found-0.021
Token gift
Token ID6979
Feature activation-2.35e-02

Pos logit contributions
rier+0.059
 wounds+0.056
othe+0.052
 decisions+0.051
vere+0.051
Neg logit contributions
 saw-0.031
 there-0.030
 set-0.029
 arrived-0.029
 three-0.029
Token.
Token ID13
Feature activation-5.58e-02

Pos logit contributions
 wounds+0.058
rier+0.055
 apology+0.055
othe+0.054
vern+0.054
Neg logit contributions
 there-0.023
 set-0.021
 in-0.019
 saw-0.018
 found-0.018
Token homes
Token ID5682
Feature activation+2.46e-02

Pos logit contributions
rier+0.067
 wounds+0.066
 apology+0.065
othe+0.064
vern+0.063
Neg logit contributions
 there-0.062
 set-0.059
 saw-0.058
 upon-0.058
 found-0.056
Token.
Token ID13
Feature activation-3.98e-02

Pos logit contributions
 there+0.024
 set+0.021
 upon+0.020
 saw+0.020
 in+0.020
Neg logit contributions
 wounds-0.059
 apology-0.058
rier-0.058
vern-0.056
ometer-0.056
Token They
Token ID1119
Feature activation+1.06e-02

Pos logit contributions
 wounds+0.065
rier+0.063
 apology+0.062
othe+0.061
ometer+0.060
Neg logit contributions
 there-0.064
 set-0.062
 saw-0.061
 upon-0.061
 found-0.060
Token put
Token ID1234
Feature activation+2.20e-02

Pos logit contributions
 there+0.012
 upon+0.011
 set+0.011
ere+0.010
 saw+0.010
Neg logit contributions
 wounds-0.023
 apology-0.022
rier-0.021
othe-0.021
vern-0.021
Token their
Token ID511
Feature activation-7.97e-02

Pos logit contributions
 there+0.021
 upon+0.020
 saw+0.019
 set+0.019
 found+0.018
Neg logit contributions
 wounds-0.051
rier-0.049
 apology-0.049
othe-0.048
ometer-0.047
Token sc
Token ID629
Feature activation-1.24e-01

Pos logit contributions
 wounds+0.165
 apology+0.155
rier+0.152
ometer+0.149
 fault+0.148
Neg logit contributions
 there-0.099
 set-0.095
 saw-0.093
 upon-0.092
 found-0.091
Tokenooters
Token ID48316
Feature activation-3.19e-02

Pos logit contributions
rier+0.170
 wounds+0.149
othe+0.143
 worse+0.143
 dent+0.140
Neg logit contributions
 upon-0.248
 there-0.240
 set-0.234
 saw-0.230
 found-0.228
Token away
Token ID1497
Feature activation-3.39e-02

Pos logit contributions
 wounds+0.070
rier+0.068
ometer+0.067
 apology+0.066
othe+0.065
Neg logit contributions
 there-0.031
 saw-0.030
 upon-0.030
 set-0.029
ere-0.028
Token and
Token ID290
Feature activation-4.18e-02

Pos logit contributions
 wounds+0.080
rier+0.079
 apology+0.078
othe+0.076
ometer+0.076
Neg logit contributions
 there-0.031
 set-0.028
 upon-0.028
 saw-0.027
 found-0.026
Token say
Token ID910
Feature activation-3.84e-02

Pos logit contributions
rier+0.094
 wounds+0.092
 apology+0.090
ometer+0.090
vern+0.089
Neg logit contributions
 there-0.045
 set-0.042
 saw-0.041
 upon-0.039
 found-0.039
Token goodbye
Token ID24829
Feature activation-1.19e-02

Pos logit contributions
 wounds+0.078
rier+0.076
 apology+0.075
ometer+0.073
othe+0.073
Neg logit contributions
 there-0.050
 set-0.046
 saw-0.045
 upon-0.044
 three-0.043
Token that
Token ID326
Feature activation-2.94e-02

Pos logit contributions
 wounds+0.128
rier+0.126
 apology+0.124
ometer+0.121
othe+0.120
Neg logit contributions
 there-0.079
 set-0.072
 upon-0.071
 saw-0.070
 found-0.068
Token he
Token ID339
Feature activation-5.32e-02

Pos logit contributions
rier+0.064
 wounds+0.063
 apology+0.061
ometer+0.061
vern+0.060
Neg logit contributions
 there-0.039
 set-0.034
 saw-0.033
 upon-0.033
 found-0.033
Token had
Token ID550
Feature activation-1.71e-02

Pos logit contributions
 wounds+0.112
 apology+0.109
ometer+0.107
rier+0.107
othe+0.106
Neg logit contributions
 there-0.055
 upon-0.051
 set-0.050
ere-0.049
 three-0.049
Token done
Token ID1760
Feature activation-3.26e-02

Pos logit contributions
 wounds+0.034
vern+0.033
 apology+0.033
 fault+0.033
othe+0.032
Neg logit contributions
 upon-0.018
 there-0.018
y-0.018
ere-0.017
cy-0.017
Token something
Token ID1223
Feature activation-8.99e-02

Pos logit contributions
 wounds+0.078
rier+0.076
 apology+0.074
vern+0.073
othe+0.073
Neg logit contributions
 upon-0.026
 set-0.026
 there-0.026
 saw-0.025
y-0.024
Token very
Token ID845
Feature activation-1.24e-01

Pos logit contributions
 wounds+0.186
 apology+0.176
 dent+0.172
rier+0.172
 bleed+0.171
Neg logit contributions
 there-0.130
 in-0.118
 upon-0.112
 a-0.111
 saw-0.110
Token helpful
Token ID7613
Feature activation-4.01e-02

Pos logit contributions
 wounds+0.200
rier+0.188
 apology+0.183
 dent+0.181
ometer+0.180
Neg logit contributions
 there-0.225
 saw-0.208
 upon-0.208
 set-0.206
 found-0.205
Token and
Token ID290
Feature activation-8.84e-02

Pos logit contributions
 wounds+0.092
rier+0.091
 apology+0.090
othe+0.088
ometer+0.087
Neg logit contributions
 there-0.038
 set-0.035
 upon-0.035
 saw-0.034
 found-0.032
Token kind
Token ID1611
Feature activation-2.35e-02

Pos logit contributions
 wounds+0.153
rier+0.151
 apology+0.146
ometer+0.143
 fault+0.141
Neg logit contributions
 there-0.143
 set-0.134
 saw-0.133
 found-0.130
 upon-0.129
Token.
Token ID13
Feature activation-6.80e-02

Pos logit contributions
 wounds+0.055
rier+0.053
 apology+0.053
othe+0.052
ometer+0.052
Neg logit contributions
 there-0.023
 set-0.021
 saw-0.021
 upon-0.021
 found-0.020
Token He
Token ID679
Feature activation-2.96e-02

Pos logit contributions
 wounds+0.164
rier+0.159
 apology+0.151
 bleed+0.151
 dent+0.148
Neg logit contributions
 there-0.071
 upon-0.064
 three-0.059
 a-0.059
 set-0.059
Token and
Token ID290
Feature activation-6.94e-02

Pos logit contributions
 wounds+0.026
rier+0.026
 apology+0.025
othe+0.025
ometer+0.025
Neg logit contributions
 there-0.011
 set-0.010
 upon-0.010
 saw-0.009
 found-0.009
Token it
Token ID340
Feature activation-6.83e-02

Pos logit contributions
 wounds+0.136
rier+0.132
 apology+0.132
othe+0.128
ometer+0.127
Neg logit contributions
 there-0.091
 set-0.085
 upon-0.085
 saw-0.084
 found-0.081
Token made
Token ID925
Feature activation+1.27e-02

Pos logit contributions
 wounds+0.184
rier+0.184
 apology+0.181
ometer+0.175
othe+0.174
Neg logit contributions
 there-0.048
 set-0.045
 saw-0.043
 found-0.042
 in-0.036
Token him
Token ID683
Feature activation-6.39e-02

Pos logit contributions
 there+0.010
 set+0.008
 a+0.008
 the+0.008
 all+0.008
Neg logit contributions
ometer-0.034
 wounds-0.034
rier-0.034
 apology-0.033
vern-0.033
Token feel
Token ID1254
Feature activation-6.67e-02

Pos logit contributions
rier+0.125
 wounds+0.123
 apology+0.120
ometer+0.119
vere+0.116
Neg logit contributions
 there-0.089
 upon-0.082
 saw-0.081
 set-0.081
 a-0.081
Token very
Token ID845
Feature activation-1.24e-01

Pos logit contributions
 wounds+0.090
rier+0.087
 apology+0.085
ometer+0.085
othe+0.081
Neg logit contributions
 there-0.133
 set-0.126
 saw-0.125
 found-0.123
 upon-0.123
Token special
Token ID2041
Feature activation-4.38e-02

Pos logit contributions
 wounds+0.211
ometer+0.202
 apology+0.199
rier+0.199
 fault+0.198
Neg logit contributions
 there-0.212
 saw-0.199
 set-0.196
 upon-0.193
 found-0.193
Token.
Token ID13
Feature activation-4.69e-02

Pos logit contributions
 wounds+0.100
 apology+0.098
rier+0.097
othe+0.096
ometer+0.095
Neg logit contributions
 there-0.046
 set-0.041
 upon-0.040
 saw-0.039
 found-0.038
Token The
Token ID383
Feature activation-5.09e-02

Pos logit contributions
rier+0.127
 wounds+0.124
vere+0.115
 apology+0.114
 bleed+0.112
Neg logit contributions
 there-0.040
 upon-0.037
 saw-0.035
 set-0.034
 found-0.033
Token end
Token ID886
Feature activation-7.08e-02

Pos logit contributions
rier+0.109
ometer+0.104
 wounds+0.104
othe+0.103
 fault+0.098
Neg logit contributions
 there-0.067
 set-0.063
 saw-0.062
 three-0.059
 upon-0.058
Token.
Token ID13
Feature activation-3.60e-02

Pos logit contributions
rier+0.182
 wounds+0.178
 apology+0.172
vere+0.166
 fault+0.165
Neg logit contributions
 there-0.064
 saw-0.054
 upon-0.053
 in-0.053
 set-0.052
Token people
Token ID661
Feature activation-2.07e-02

Pos logit contributions
 there+0.026
 set+0.024
 saw+0.023
 upon+0.023
 found+0.023
Neg logit contributions
 wounds-0.044
rier-0.044
ometer-0.044
 apology-0.043
othe-0.043
Token happy
Token ID3772
Feature activation-3.18e-02

Pos logit contributions
 wounds+0.034
rier+0.033
 apology+0.033
ometer+0.033
othe+0.032
Neg logit contributions
 there-0.035
 set-0.032
 saw-0.032
 upon-0.032
 found-0.031
Token.
Token ID13
Feature activation-3.70e-02

Pos logit contributions
 wounds+0.067
rier+0.067
 apology+0.065
 fault+0.063
othe+0.063
Neg logit contributions
 there-0.034
 upon-0.033
 set-0.033
 saw-0.032
 found-0.031
Token<|endoftext|>
Token ID50256
Feature activation+1.38e-01

Pos logit contributions
rier+0.105
 wounds+0.102
 bleed+0.093
ometer+0.093
 apology+0.093
Neg logit contributions
 there-0.027
 a-0.021
 upon-0.021
 Inside-0.020
 three-0.020
TokenOnce
Token ID7454
Feature activation+1.35e-01

Pos logit contributions
 set+0.197
ere+0.187
 saw+0.185
 colored+0.185
 three+0.184
Neg logit contributions
vere-0.202
ometer-0.202
 wounds-0.198
othe-0.197
vern-0.197
Token upon
Token ID2402
Feature activation+1.26e-01

Pos logit contributions
 set+0.062
 three+0.040
ere+0.038
 colored+0.038
 entered+0.035
Neg logit contributions
othe-0.369
 apology-0.361
ometer-0.360
 fault-0.359
vern-0.354
Token a
Token ID257
Feature activation+1.12e-01

Pos logit contributions
ere+0.111
y+0.089
cy+0.088
 arrived+0.086
 set+0.085
Neg logit contributions
 wounds-0.291
vern-0.281
 apology-0.278
 worse-0.278
 bleed-0.275
Token time
Token ID640
Feature activation+1.14e-01

Pos logit contributions
 there+0.092
 set+0.086
 upon+0.078
 three+0.075
 saw+0.075
Neg logit contributions
vern-0.256
 apology-0.256
 wounds-0.255
 fault-0.253
othe-0.247
Token,
Token ID11
Feature activation+1.01e-01

Pos logit contributions
 set+0.125
 colored+0.107
ere+0.106
 spread+0.105
 explored+0.104
Neg logit contributions
othe-0.239
vern-0.236
vere-0.233
 fault-0.227
 sorry-0.226
Token there
Token ID612
Feature activation+1.27e-01

Pos logit contributions
ere+0.093
 explored+0.086
 set+0.085
 approached+0.076
 arrived+0.076
Neg logit contributions
 sorry-0.218
othe-0.216
ometer-0.212
 worse-0.203
 Or-0.198
Token was
Token ID373
Feature activation+9.06e-02

Pos logit contributions
ere+0.159
 set+0.151
 three+0.148
ka+0.147
r+0.142
Neg logit contributions
othe-0.215
 fault-0.205
ometer-0.203
 dent-0.203
 mistake-0.199
Token mom
Token ID1995
Feature activation+6.30e-02

Pos logit contributions
 there+0.003
 set+0.002
 saw+0.002
 upon+0.002
 found+0.002
Neg logit contributions
 wounds-0.014
rier-0.013
 apology-0.013
ometer-0.013
othe-0.013
Token.
Token ID13
Feature activation+1.27e-03

Pos logit contributions
 there+0.064
ere+0.059
 set+0.059
 upon+0.058
 three+0.056
Neg logit contributions
 wounds-0.135
 apology-0.131
rier-0.129
othe-0.128
 fault-0.125
Token<|endoftext|>
Token ID50256
Feature activation+1.49e-01

Pos logit contributions
 there+0.002
 set+0.002
 saw+0.002
 upon+0.002
 found+0.002
Neg logit contributions
 wounds-0.002
rier-0.002
 apology-0.002
ometer-0.002
othe-0.002
TokenOnce
Token ID7454
Feature activation+1.45e-01

Pos logit contributions
 set+0.234
 three+0.229
 spent+0.223
 colored+0.220
 planted+0.214
Neg logit contributions
othe-0.188
ometer-0.188
 Or-0.184
vere-0.175
 fault-0.173
Token upon
Token ID2402
Feature activation+1.56e-01

Pos logit contributions
 set+0.090
 colored+0.069
 entered+0.063
ere+0.063
 explored+0.061
Neg logit contributions
othe-0.373
 fault-0.365
ometer-0.364
rier-0.358
vern-0.354
Token a
Token ID257
Feature activation+1.30e-01

Pos logit contributions
ere+0.171
cy+0.147
neath+0.144
y+0.143
 arrived+0.139
Neg logit contributions
 wounds-0.315
 bleed-0.312
 worse-0.310
vern-0.304
 apology-0.300
Token time
Token ID640
Feature activation+1.32e-01

Pos logit contributions
 there+0.116
 set+0.103
 upon+0.097
 saw+0.090
 entered+0.090
Neg logit contributions
 wounds-0.292
vern-0.291
 apology-0.288
 fault-0.286
othe-0.280
Token there
Token ID612
Feature activation+1.57e-01

Pos logit contributions
 set+0.147
ere+0.131
 colored+0.130
 explored+0.125
 spread+0.121
Neg logit contributions
vern-0.264
othe-0.263
vere-0.257
 fault-0.251
 sorry-0.250
Token was
Token ID373
Feature activation+1.15e-01

Pos logit contributions
ere+0.182
 three+0.172
 set+0.171
ka+0.165
r+0.161
Neg logit contributions
othe-0.278
ometer-0.264
 dent-0.263
 fault-0.262
 wounds-0.249
Token a
Token ID257
Feature activation+9.75e-02

Pos logit contributions
 upon+0.088
 arrived+0.083
 saw+0.082
y+0.082
ere+0.079
Neg logit contributions
vern-0.247
 fault-0.246
 wounds-0.238
 Or-0.237
 bleed-0.236
Token depend
Token ID4745
Feature activation+7.07e-02

Pos logit contributions
 there+0.069
 set+0.061
 saw+0.060
 upon+0.059
 found+0.055
Neg logit contributions
 wounds-0.252
rier-0.249
 apology-0.246
othe-0.241
ometer-0.239
Token.
Token ID13
Feature activation-3.92e-02

Pos logit contributions
 wounds+0.080
rier+0.080
 apology+0.079
ometer+0.077
vern+0.077
Neg logit contributions
 there-0.033
 in-0.028
 saw-0.028
 set-0.026
 upon-0.026
Token<|endoftext|>
Token ID50256
Feature activation+1.36e-01

Pos logit contributions
rier+0.108
 wounds+0.104
vere+0.100
vern+0.099
 apology+0.098
Neg logit contributions
 there-0.032
 saw-0.028
 found-0.026
 a-0.026
 upon-0.026
TokenOnce
Token ID7454
Feature activation+1.28e-01

Pos logit contributions
 set+0.200
ere+0.198
 upon+0.190
 three+0.188
 saw+0.187
Neg logit contributions
 wounds-0.199
vere-0.195
othe-0.195
vern-0.193
 apology-0.192
Token upon
Token ID2402
Feature activation+1.17e-01

Pos logit contributions
 set+0.067
 three+0.049
ere+0.047
 colored+0.045
 entered+0.044
Neg logit contributions
othe-0.343
 apology-0.331
 fault-0.331
ometer-0.328
vern-0.327
Token a
Token ID257
Feature activation+1.05e-01

Pos logit contributions
ere+0.115
cy+0.088
y+0.088
al+0.085
 arrived+0.083
Neg logit contributions
 wounds-0.255
 worse-0.250
vern-0.249
 bleed-0.247
 apology-0.244
Token time
Token ID640
Feature activation+1.07e-01

Pos logit contributions
 there+0.085
 set+0.080
 three+0.073
 upon+0.072
 saw+0.069
Neg logit contributions
vern-0.244
 apology-0.243
 wounds-0.242
 fault-0.241
othe-0.238
Token,
Token ID11
Feature activation+9.31e-02

Pos logit contributions
 set+0.116
ere+0.102
 colored+0.098
 spread+0.096
 explored+0.096
Neg logit contributions
othe-0.221
vern-0.220
vere-0.216
 fault-0.212
 sorry-0.210
Token there
Token ID612
Feature activation+1.19e-01

Pos logit contributions
ere+0.092
 explored+0.080
oline+0.079
al+0.075
 set+0.075
Neg logit contributions
othe-0.199
 sorry-0.199
 worse-0.193
 Or-0.186
ometer-0.181
Token was
Token ID373
Feature activation+8.17e-02

Pos logit contributions
ere+0.152
 three+0.144
 set+0.140
ka+0.138
 upon+0.138
Neg logit contributions
othe-0.195
 dent-0.187
 fault-0.185
ometer-0.184
 mistake-0.180
Token a
Token ID257
Feature activation+7.46e-02

Pos logit contributions
 arrived+0.073
r+0.072
ere+0.072
 upon+0.070
y+0.070
Neg logit contributions
 fault-0.165
vern-0.160
 wounds-0.160
 Or-0.159
 apology-0.158
Token little
Token ID1310
Feature activation+6.05e-02

Pos logit contributions
 there+0.060
 saw+0.056
 set+0.053
 three+0.050
 upon+0.050
Neg logit contributions
 wounds-0.191
rier-0.187
 apology-0.185
 fault-0.184
othe-0.183
Tokenler
Token ID1754
Feature activation-5.57e-02

Pos logit contributions
 wounds+0.088
rier+0.087
 apology+0.086
othe+0.084
ometer+0.083
Neg logit contributions
 there-0.064
 set-0.061
 saw-0.060
 upon-0.060
 found-0.058
Token in
Token ID287
Feature activation+3.90e-03

Pos logit contributions
rier+0.120
 wounds+0.118
 apology+0.116
othe+0.115
ometer+0.113
Neg logit contributions
 there-0.056
 set-0.053
 upon-0.052
 saw-0.052
 three-0.051
Token the
Token ID262
Feature activation-5.90e-02

Pos logit contributions
 upon+0.002
 saw+0.002
 set+0.002
 there+0.002
 found+0.002
Neg logit contributions
 wounds-0.010
 apology-0.010
othe-0.010
rier-0.010
 fault-0.010
Token forest
Token ID8222
Feature activation-1.88e-02

Pos logit contributions
 wounds+0.126
rier+0.125
 apology+0.125
 decisions+0.121
ometer+0.118
Neg logit contributions
 there-0.059
 upon-0.054
 saw-0.051
 set-0.051
 found-0.051
Token.
Token ID13
Feature activation-4.58e-02

Pos logit contributions
 wounds+0.044
rier+0.043
 apology+0.043
othe+0.042
ometer+0.042
Neg logit contributions
 there-0.018
 set-0.016
 saw-0.016
 upon-0.016
 found-0.015
Token<|endoftext|>
Token ID50256
Feature activation+1.32e-01

Pos logit contributions
 wounds+0.116
rier+0.114
 apology+0.108
vern+0.106
 bleed+0.105
Neg logit contributions
 there-0.041
 upon-0.036
 set-0.034
 saw-0.034
 three-0.033
TokenM
Token ID44
Feature activation+6.27e-02

Pos logit contributions
 set+0.177
 colored+0.175
ere+0.172
 planted+0.169
 three+0.165
Neg logit contributions
ometer-0.196
vere-0.193
othe-0.189
 fault-0.182
rier-0.180
Tokenum
Token ID388
Feature activation+8.35e-02

Pos logit contributions
 there+0.087
 set+0.082
 saw+0.081
 upon+0.080
 found+0.078
Neg logit contributions
 wounds-0.118
rier-0.115
 apology-0.115
othe-0.112
ometer-0.111
Token and
Token ID290
Feature activation+5.63e-02

Pos logit contributions
 outer+0.109
where+0.108
 three+0.107
ifying+0.105
 upon+0.102
Neg logit contributions
ache-0.112
ometer-0.106
oke-0.103
othe-0.102
 wounds-0.099
Token Dad
Token ID17415
Feature activation+6.54e-02

Pos logit contributions
r+0.071
ere+0.065
 upon+0.064
y+0.062
 set+0.061
Neg logit contributions
othe-0.096
 apology-0.095
 wounds-0.094
rier-0.092
 sorry-0.090
Token had
Token ID550
Feature activation+7.62e-02

Pos logit contributions
ere+0.059
 there+0.059
where+0.056
 upon+0.055
 three+0.054
Neg logit contributions
ometer-0.121
 apology-0.119
othe-0.119
 wounds-0.115
rier-0.113
Token every
Token ID790
Feature activation-1.49e-02

Pos logit contributions
 wounds+0.052
rier+0.051
 apology+0.050
othe+0.049
ometer+0.048
Neg logit contributions
 there-0.025
 set-0.023
 saw-0.022
 upon-0.022
 found-0.021
Token day
Token ID1110
Feature activation-2.21e-02

Pos logit contributions
 apology+0.023
othe+0.022
 decision+0.021
 worse+0.021
 lesson+0.021
Neg logit contributions
 upon-0.022
ere-0.021
een-0.019
r-0.019
 there-0.019
Token.
Token ID13
Feature activation-3.37e-02

Pos logit contributions
 wounds+0.051
rier+0.050
 apology+0.049
ometer+0.048
vern+0.048
Neg logit contributions
 there-0.023
 set-0.021
 saw-0.020
 upon-0.020
 found-0.020
Token<|endoftext|>
Token ID50256
Feature activation+1.40e-01

Pos logit contributions
rier+0.100
 wounds+0.096
 apology+0.089
vere+0.089
vern+0.087
Neg logit contributions
 there-0.022
 the-0.019
 saw-0.019
 a-0.019
 found-0.018
TokenOnce
Token ID7454
Feature activation+1.28e-01

Pos logit contributions
 set+0.210
ere+0.205
 three+0.200
 upon+0.198
y+0.195
Neg logit contributions
 wounds-0.199
othe-0.198
vere-0.194
vern-0.192
 apology-0.192
Token upon
Token ID2402
Feature activation+1.23e-01

Pos logit contributions
 set+0.059
 three+0.040
ere+0.039
 colored+0.036
 there+0.036
Neg logit contributions
othe-0.351
 fault-0.341
ometer-0.340
 apology-0.340
vern-0.337
Token a
Token ID257
Feature activation+1.10e-01

Pos logit contributions
ere+0.127
cy+0.099
y+0.095
 arrived+0.095
al+0.095
Neg logit contributions
 wounds-0.264
 worse-0.259
vern-0.257
 bleed-0.255
 Or-0.250
Token time
Token ID640
Feature activation+1.10e-01

Pos logit contributions
 there+0.092
 set+0.087
 upon+0.079
 three+0.078
 entered+0.075
Neg logit contributions
vern-0.250
 apology-0.247
 wounds-0.246
 fault-0.246
othe-0.242
Token,
Token ID11
Feature activation+9.85e-02

Pos logit contributions
 set+0.123
ere+0.108
 colored+0.106
 spread+0.100
 explored+0.100
Neg logit contributions
othe-0.226
vern-0.225
vere-0.221
 fault-0.216
 sorry-0.213
Token in
Token ID287
Feature activation+1.39e-01

Pos logit contributions
ere+0.101
 explored+0.091
oline+0.091
 set+0.088
en+0.083
Neg logit contributions
 sorry-0.210
othe-0.207
 worse-0.195
ometer-0.186
 Or-0.185
Token a
Token ID257
Feature activation+1.09e-01

Pos logit contributions
y+0.122
 upon+0.121
ere+0.113
 explored+0.109
 arrived+0.108
Neg logit contributions
 wounds-0.296
 apology-0.279
othe-0.276
rier-0.275
vern-0.274
Token
Token ID198
Feature activation+3.65e-02

Pos logit contributions
 set+0.049
 there+0.048
 saw+0.046
 three+0.045
 found+0.045
Neg logit contributions
othe-0.066
 wounds-0.066
 apology-0.065
ometer-0.064
vern-0.064
TokenWhen
Token ID2215
Feature activation+3.93e-02

Pos logit contributions
 upon+0.045
 there+0.044
 set+0.043
 saw+0.043
 found+0.043
Neg logit contributions
vere-0.061
vern-0.061
 wounds-0.061
othe-0.059
 Or-0.059
Token she
Token ID673
Feature activation+3.01e-02

Pos logit contributions
ere+0.073
r+0.071
 upon+0.070
y+0.070
 found+0.070
Neg logit contributions
othe-0.049
 wounds-0.045
rier-0.043
ometer-0.043
 apology-0.042
Token got
Token ID1392
Feature activation+3.78e-02

Pos logit contributions
 upon+0.033
 there+0.031
ere+0.031
 outdoors+0.029
 three+0.027
Neg logit contributions
othe-0.052
 apology-0.051
ometer-0.051
 wounds-0.049
 worse-0.049
Token back
Token ID736
Feature activation-1.54e-02

Pos logit contributions
ere+0.039
r+0.039
 set+0.037
y+0.037
 upon+0.036
Neg logit contributions
 wounds-0.076
rier-0.073
ometer-0.072
othe-0.072
 worse-0.071
Token home
Token ID1363
Feature activation+2.73e-02

Pos logit contributions
othe+0.031
 wounds+0.031
ometer+0.030
rier+0.029
 apology+0.029
Neg logit contributions
ere-0.014
r-0.014
 set-0.014
 upon-0.014
 saw-0.014
Token,
Token ID11
Feature activation+8.67e-03

Pos logit contributions
 there+0.013
 set+0.010
 upon+0.010
 saw+0.010
 found+0.009
Neg logit contributions
 wounds-0.078
rier-0.077
 apology-0.076
othe-0.074
ometer-0.074
Token she
Token ID673
Feature activation+1.84e-02

Pos logit contributions
ere+0.015
r+0.015
y+0.014
 set+0.014
 upon+0.014
Neg logit contributions
othe-0.012
 wounds-0.012
rier-0.012
ometer-0.011
 worse-0.011
Token put
Token ID1234
Feature activation+3.58e-02

Pos logit contributions
 there+0.015
 upon+0.014
ere+0.013
where+0.013
r+0.012
Neg logit contributions
 wounds-0.042
ometer-0.040
 apology-0.039
othe-0.039
rier-0.038
Token the
Token ID262
Feature activation-5.95e-02

Pos logit contributions
 upon+0.021
 there+0.019
 saw+0.018
 set+0.018
 arrived+0.018
Neg logit contributions
rier-0.091
 wounds-0.091
othe-0.086
 apology-0.086
ometer-0.085
Token meat
Token ID6174
Feature activation-5.05e-02

Pos logit contributions
 wounds+0.103
rier+0.102
 apology+0.097
ometer+0.095
othe+0.093
Neg logit contributions
 there-0.092
 set-0.086
 upon-0.086
 saw-0.085
 found-0.084
Token a
Token ID257
Feature activation+3.73e-02

Pos logit contributions
 set+0.026
ere+0.026
 there+0.026
 upon+0.026
y+0.026
Neg logit contributions
 wounds-0.043
rier-0.042
 fault-0.041
 apology-0.040
othe-0.040
Token big
Token ID1263
Feature activation+3.40e-02

Pos logit contributions
 there+0.043
 saw+0.041
 set+0.040
 three+0.039
 upon+0.038
Neg logit contributions
 wounds-0.083
rier-0.081
othe-0.080
 apology-0.079
vern-0.078
Token bowl
Token ID9396
Feature activation+3.23e-02

Pos logit contributions
 there+0.039
 set+0.036
 upon+0.036
 saw+0.035
 found+0.034
Neg logit contributions
 wounds-0.072
rier-0.071
 apology-0.070
othe-0.068
ometer-0.068
Token of
Token ID286
Feature activation+7.23e-02

Pos logit contributions
 there+0.031
 upon+0.031
 saw+0.030
ere+0.029
 set+0.029
Neg logit contributions
rier-0.070
 wounds-0.067
ometer-0.067
othe-0.067
 apology-0.066
Token rice
Token ID11464
Feature activation+3.10e-02

Pos logit contributions
 there+0.106
 upon+0.103
 saw+0.101
 set+0.101
 found+0.100
Neg logit contributions
 wounds-0.125
rier-0.122
 apology-0.121
othe-0.114
ometer-0.110
Token on
Token ID319
Feature activation+8.27e-02

Pos logit contributions
 there+0.032
 upon+0.032
 saw+0.031
ere+0.030
 three+0.030
Neg logit contributions
rier-0.066
 wounds-0.064
ometer-0.062
 apology-0.061
othe-0.061
Token the
Token ID262
Feature activation+1.19e-02

Pos logit contributions
 upon+0.055
 saw+0.053
 there+0.051
 found+0.049
 met+0.047
Neg logit contributions
rier-0.205
 wounds-0.203
 apology-0.197
othe-0.196
 decisions-0.189
Token table
Token ID3084
Feature activation+5.04e-02

Pos logit contributions
 there+0.019
 set+0.018
 upon+0.018
 saw+0.018
 found+0.017
Neg logit contributions
 wounds-0.020
rier-0.020
 apology-0.020
othe-0.019
ometer-0.019
Token.
Token ID13
Feature activation+4.69e-03

Pos logit contributions
 there+0.059
 upon+0.058
 saw+0.057
ere+0.056
 set+0.055
Neg logit contributions
rier-0.099
 wounds-0.096
ometer-0.092
 apology-0.092
 fault-0.092
Token Nancy
Token ID18496
Feature activation+5.07e-02

Pos logit contributions
 there+0.007
 set+0.006
 saw+0.006
 upon+0.006
 found+0.006
Neg logit contributions
 wounds-0.009
rier-0.008
 apology-0.008
othe-0.008
ometer-0.008
Token and
Token ID290
Feature activation+2.55e-02

Pos logit contributions
 there+0.055
where+0.051
 upon+0.051
ere+0.045
 three+0.045
Neg logit contributions
 wounds-0.099
ometer-0.096
othe-0.093
 apology-0.090
ache-0.089
Token.
Token ID13
Feature activation+3.10e-02

Pos logit contributions
 set+0.069
ere+0.069
 there+0.068
 three+0.068
 upon+0.067
Neg logit contributions
 wounds-0.082
othe-0.079
vern-0.079
 apology-0.077
ometer-0.076
Token Ben
Token ID3932
Feature activation+3.04e-02

Pos logit contributions
 there+0.046
 set+0.043
ere+0.042
 saw+0.042
 arrived+0.042
Neg logit contributions
othe-0.049
 wounds-0.049
 fault-0.049
ometer-0.049
 apology-0.049
Token loved
Token ID6151
Feature activation+6.34e-02

Pos logit contributions
 upon+0.029
 there+0.028
 three+0.028
ere+0.028
 set+0.027
Neg logit contributions
 wounds-0.062
ometer-0.060
othe-0.059
vern-0.058
 apology-0.058
Token to
Token ID284
Feature activation+3.61e-02

Pos logit contributions
 there+0.068
 upon+0.067
 saw+0.064
 set+0.063
 found+0.062
Neg logit contributions
 wounds-0.134
 apology-0.126
othe-0.125
ometer-0.124
 fault-0.123
Token hop
Token ID1725
Feature activation+4.73e-02

Pos logit contributions
 upon+0.049
 there+0.047
y+0.045
ere+0.043
cy+0.041
Neg logit contributions
 apology-0.068
 wounds-0.067
 bleed-0.067
othe-0.065
vern-0.063
Token around
Token ID1088
Feature activation+8.32e-02

Pos logit contributions
ere+0.052
 arrived+0.052
 three+0.050
 upon+0.049
 saw+0.049
Neg logit contributions
othe-0.089
 wounds-0.088
vere-0.086
 apology-0.085
vern-0.084
Token and
Token ID290
Feature activation-1.68e-03

Pos logit contributions
 arrived+0.094
 entered+0.084
ere+0.082
 found+0.078
 met+0.076
Neg logit contributions
 wounds-0.158
othe-0.150
rier-0.143
 apology-0.136
 mistake-0.136
Token explore
Token ID7301
Feature activation+7.25e-02

Pos logit contributions
 wounds+0.003
othe+0.003
 bleed+0.003
ometer+0.003
 cheek+0.003
Neg logit contributions
ere-0.002
 there-0.002
 upon-0.001
oline-0.001
y-0.001
Token the
Token ID262
Feature activation-1.54e-02

Pos logit contributions
 arrived+0.076
 set+0.074
 entered+0.070
 found+0.069
 upon+0.069
Neg logit contributions
 wounds-0.153
othe-0.141
rier-0.141
vere-0.138
 apology-0.137
Token world
Token ID995
Feature activation+1.24e-02

Pos logit contributions
 wounds+0.024
rier+0.023
 decisions+0.022
ometer+0.021
vere+0.021
Neg logit contributions
 there-0.024
 arrived-0.023
 found-0.023
 entered-0.022
 set-0.022
Token.
Token ID13
Feature activation+7.75e-03

Pos logit contributions
 there+0.011
 upon+0.011
 set+0.011
 saw+0.011
 three+0.010
Neg logit contributions
 wounds-0.028
rier-0.028
othe-0.027
 apology-0.027
ometer-0.026
Token time
Token ID640
Feature activation+1.14e-01

Pos logit contributions
 there+0.101
 set+0.092
 upon+0.084
 saw+0.080
 found+0.080
Neg logit contributions
vern-0.259
 apology-0.258
 wounds-0.257
 fault-0.256
othe-0.250
Token,
Token ID11
Feature activation+9.80e-02

Pos logit contributions
 set+0.127
ere+0.115
 colored+0.111
 spread+0.106
 planted+0.104
Neg logit contributions
othe-0.232
vern-0.232
vere-0.226
 fault-0.224
 Or-0.218
Token there
Token ID612
Feature activation+1.31e-01

Pos logit contributions
ere+0.109
oline+0.098
 explored+0.093
ted+0.087
 approached+0.087
Neg logit contributions
 sorry-0.194
 worse-0.186
rier-0.185
othe-0.180
 Or-0.178
Token was
Token ID373
Feature activation+9.10e-02

Pos logit contributions
ka+0.177
ere+0.175
ifying+0.172
 three+0.171
r+0.170
Neg logit contributions
othe-0.196
 dent-0.185
 fault-0.183
 mistake-0.182
ometer-0.177
Token a
Token ID257
Feature activation+8.17e-02

Pos logit contributions
r+0.078
ere+0.077
y+0.076
 upon+0.076
 arrived+0.075
Neg logit contributions
 wounds-0.181
 fault-0.181
vern-0.176
 Or-0.175
 sorry-0.175
Token little
Token ID1310
Feature activation+6.80e-02

Pos logit contributions
 there+0.055
 saw+0.049
 set+0.048
 upon+0.046
 three+0.044
Neg logit contributions
 wounds-0.216
rier-0.213
 apology-0.211
othe-0.208
 fault-0.206
Token girl
Token ID2576
Feature activation+5.57e-02

Pos logit contributions
 there+0.090
 set+0.084
 saw+0.083
 upon+0.082
 found+0.080
Neg logit contributions
 wounds-0.134
rier-0.131
 apology-0.130
othe-0.127
ometer-0.125
Token named
Token ID3706
Feature activation+9.75e-02

Pos logit contributions
 upon+0.060
 set+0.053
 there+0.052
ere+0.052
neath+0.051
Neg logit contributions
 wounds-0.115
othe-0.111
ometer-0.106
 apology-0.105
 mistake-0.102
Token Lily
Token ID20037
Feature activation+5.50e-02

Pos logit contributions
ere+0.163
where+0.149
 set+0.146
 explored+0.146
 colored+0.145
Neg logit contributions
 apology-0.129
othe-0.129
 wounds-0.129
 cheek-0.124
ometer-0.121
Token.
Token ID13
Feature activation+3.52e-02

Pos logit contributions
 there+0.070
 upon+0.067
 set+0.067
ere+0.065
 three+0.063
Neg logit contributions
 wounds-0.103
othe-0.098
 apology-0.097
ometer-0.096
vern-0.095
Token She
Token ID1375
Feature activation+4.69e-02

Pos logit contributions
ere+0.057
 set+0.056
 arrived+0.056
 saw+0.055
 there+0.055
Neg logit contributions
othe-0.050
 Or-0.047
ometer-0.047
 apology-0.046
rier-0.045
Token joy
Token ID8716
Feature activation+6.04e-03

Pos logit contributions
 upon+0.039
 there+0.039
 set+0.037
 saw+0.037
y+0.037
Neg logit contributions
 apology-0.026
 wounds-0.025
othe-0.024
rier-0.024
vern-0.024
Token.
Token ID13
Feature activation-2.92e-02

Pos logit contributions
 there+0.006
 set+0.006
 upon+0.006
 saw+0.006
 found+0.005
Neg logit contributions
rier-0.013
 wounds-0.013
othe-0.013
 apology-0.013
ometer-0.013
Token
Token ID220
Feature activation+2.78e-02

Pos logit contributions
 wounds+0.058
rier+0.057
 apology+0.057
othe+0.056
ometer+0.055
Neg logit contributions
 there-0.036
 set-0.034
 saw-0.034
 upon-0.033
 found-0.032
Token
Token ID198
Feature activation+1.88e-02

Pos logit contributions
 there+0.029
 upon+0.026
 set+0.026
 saw+0.026
 found+0.025
Neg logit contributions
 wounds-0.063
rier-0.062
 apology-0.061
othe-0.059
ometer-0.059
Token
Token ID198
Feature activation+1.75e-02

Pos logit contributions
 there+0.022
 set+0.022
 saw+0.021
 upon+0.020
 found+0.020
Neg logit contributions
 wounds-0.038
 apology-0.037
othe-0.037
rier-0.037
vern-0.036
TokenMaria
Token ID46827
Feature activation+3.11e-02

Pos logit contributions
 there+0.026
 set+0.025
 saw+0.024
 upon+0.024
 found+0.024
Neg logit contributions
 wounds-0.029
 apology-0.029
othe-0.028
vern-0.028
vere-0.028
Token c
Token ID269
Feature activation-7.17e-03

Pos logit contributions
 there+0.038
 upon+0.036
ere+0.032
 three+0.031
 set+0.031
Neg logit contributions
 wounds-0.060
 apology-0.057
othe-0.056
rier-0.055
ometer-0.055
Tokenuddled
Token ID32745
Feature activation+1.14e-02

Pos logit contributions
 wounds+0.009
rier+0.009
 apology+0.009
othe+0.008
ometer+0.008
Neg logit contributions
 there-0.014
 set-0.014
 upon-0.014
 saw-0.014
ere-0.013
Token the
Token ID262
Feature activation-5.25e-02

Pos logit contributions
 there+0.010
 set+0.009
 saw+0.008
 upon+0.008
 found+0.008
Neg logit contributions
 wounds-0.028
rier-0.028
 apology-0.027
ometer-0.027
vern-0.027
Token puppy
Token ID26188
Feature activation-9.16e-03

Pos logit contributions
 wounds+0.096
rier+0.094
 apology+0.093
othe+0.089
ometer+0.089
Neg logit contributions
 there-0.075
 upon-0.071
 set-0.071
 saw-0.070
 found-0.068
Token all
Token ID477
Feature activation-4.80e-03

Pos logit contributions
 wounds+0.022
rier+0.022
 apology+0.021
othe+0.021
vern+0.021
Neg logit contributions
 there-0.008
 set-0.008
 saw-0.007
 upon-0.007
 found-0.007
Token past
Token ID1613
Feature activation+6.77e-03

Pos logit contributions
 wounds+0.117
rier+0.113
 apology+0.112
ometer+0.105
othe+0.105
Neg logit contributions
 there-0.108
 upon-0.102
 set-0.101
 saw-0.101
 found-0.100
Tokenries
Token ID1678
Feature activation-1.03e-02

Pos logit contributions
 upon+0.012
 found+0.011
 saw+0.011
 set+0.011
 there+0.011
Neg logit contributions
 wounds-0.009
rier-0.009
 apology-0.008
othe-0.008
ometer-0.008
Token.
Token ID13
Feature activation-2.94e-02

Pos logit contributions
 wounds+0.025
 apology+0.025
rier+0.025
vern+0.024
vere+0.024
Neg logit contributions
 there-0.010
 set-0.009
 in-0.008
 saw-0.008
 upon-0.008
Token Lily
Token ID20037
Feature activation-1.58e-03

Pos logit contributions
rier+0.066
 wounds+0.065
 apology+0.062
vern+0.062
vere+0.061
Neg logit contributions
 there-0.035
 saw-0.031
 upon-0.031
 set-0.030
 found-0.030
Token learns
Token ID22974
Feature activation-8.71e-03

Pos logit contributions
rier+0.003
 wounds+0.003
 apology+0.003
othe+0.003
ometer+0.003
Neg logit contributions
 there-0.002
 set-0.002
 saw-0.002
 upon-0.002
 found-0.002
Token to
Token ID284
Feature activation-1.59e-02

Pos logit contributions
rier+0.024
ometer+0.023
 wounds+0.023
 dent+0.023
vere+0.022
Neg logit contributions
 there-0.006
 a-0.006
 in-0.005
 the-0.005
 all-0.005
Token be
Token ID307
Feature activation-6.58e-02

Pos logit contributions
 wounds+0.031
rier+0.030
 apology+0.030
othe+0.029
ometer+0.029
Neg logit contributions
 there-0.021
 set-0.020
 upon-0.020
 saw-0.020
 found-0.019
Token more
Token ID517
Feature activation-7.64e-02

Pos logit contributions
 wounds+0.106
rier+0.103
 apology+0.101
othe+0.098
ometer+0.097
Neg logit contributions
 there-0.111
 set-0.106
 upon-0.105
 saw-0.104
 found-0.102
Token careful
Token ID8161
Feature activation-4.91e-02

Pos logit contributions
rier+0.125
 wounds+0.122
 apology+0.120
ometer+0.120
othe+0.116
Neg logit contributions
 there-0.136
 in-0.123
 saw-0.123
 set-0.121
 found-0.120
Token and
Token ID290
Feature activation-7.56e-02

Pos logit contributions
 wounds+0.124
rier+0.123
 apology+0.122
ometer+0.120
vern+0.120
Neg logit contributions
 there-0.041
 in-0.034
 set-0.033
 upon-0.032
 saw-0.030
Token patient
Token ID5827
Feature activation-3.24e-02

Pos logit contributions
rier+0.110
 wounds+0.108
 apology+0.102
ometer+0.102
vern+0.102
Neg logit contributions
 there-0.141
 saw-0.135
 set-0.135
 found-0.132
 upon-0.131
Token on
Token ID319
Feature activation+2.35e-02

Pos logit contributions
 wounds+0.141
 apology+0.138
rier+0.137
vern+0.136
 fault+0.133
Neg logit contributions
 there-0.047
 set-0.043
 saw-0.038
 upon-0.038
 in-0.037
Token.
Token ID13
Feature activation-4.42e-02

Pos logit contributions
 there+0.023
 set+0.021
 saw+0.020
 upon+0.020
 three+0.019
Neg logit contributions
 wounds-0.055
rier-0.054
 apology-0.053
vern-0.052
othe-0.052
Token The
Token ID383
Feature activation-4.54e-02

Pos logit contributions
 wounds+0.115
rier+0.113
vern+0.110
 apology+0.109
vere+0.107
Neg logit contributions
 there-0.041
 upon-0.033
 a-0.033
 saw-0.033
 Inside-0.033
Token mixer
Token ID33938
Feature activation-1.80e-02

Pos logit contributions
othe+0.088
vern+0.086
 wounds+0.086
rier+0.084
ometer+0.083
Neg logit contributions
 there-0.063
 saw-0.061
 three-0.061
 set-0.061
 two-0.058
Token made
Token ID925
Feature activation+2.97e-02

Pos logit contributions
 wounds+0.045
rier+0.045
 apology+0.045
vern+0.044
othe+0.043
Neg logit contributions
 there-0.014
 set-0.014
 saw-0.013
 found-0.012
 upon-0.012
Token a
Token ID257
Feature activation-4.04e-02

Pos logit contributions
 there+0.012
 set+0.010
 saw+0.009
 a+0.009
 three+0.008
Neg logit contributions
 wounds-0.086
rier-0.086
vern-0.085
 apology-0.085
vere-0.084
Token loud
Token ID7812
Feature activation-2.10e-02

Pos logit contributions
 fault+0.091
vern+0.088
uts+0.085
ometers+0.083
 wounds+0.083
Neg logit contributions
 saw-0.049
 three-0.048
 set-0.047
 me-0.044
 a-0.044
Token noise
Token ID7838
Feature activation-2.28e-02

Pos logit contributions
 fault+0.035
vern+0.034
 wounds+0.033
ometer+0.031
rier+0.031
Neg logit contributions
 set-0.038
 there-0.036
 a-0.036
 in-0.036
 the-0.036
Token and
Token ID290
Feature activation-4.60e-02

Pos logit contributions
 wounds+0.061
 apology+0.061
vern+0.060
rier+0.060
ometer+0.060
Neg logit contributions
 there-0.017
 set-0.016
 in-0.015
 saw-0.015
 a-0.013
Token spun
Token ID26843
Feature activation+3.69e-02

Pos logit contributions
 wounds+0.105
rier+0.104
 apology+0.102
othe+0.100
ometer+0.099
Neg logit contributions
 there-0.045
 set-0.042
 upon-0.041
 saw-0.041
 found-0.039
Token very
Token ID845
Feature activation-8.39e-02

Pos logit contributions
 there+0.031
 set+0.029
 upon+0.028
 saw+0.028
 found+0.027
Neg logit contributions
 wounds-0.089
rier-0.088
 apology-0.087
othe-0.085
ometer-0.085
Token hat
Token ID6877
Feature activation-8.73e-03

Pos logit contributions
 wounds+0.019
rier+0.019
 apology+0.019
othe+0.018
ometer+0.018
Neg logit contributions
 there-0.012
 set-0.011
 saw-0.011
 upon-0.011
 found-0.010
Token.
Token ID13
Feature activation-3.47e-02

Pos logit contributions
rier+0.019
 wounds+0.018
othe+0.018
ometer+0.018
 apology+0.018
Neg logit contributions
 there-0.009
 upon-0.009
 saw-0.008
ere-0.008
 set-0.008
Token
Token ID198
Feature activation+1.77e-02

Pos logit contributions
 wounds+0.076
rier+0.075
 apology+0.075
othe+0.073
ometer+0.073
Neg logit contributions
 there-0.036
 set-0.034
 upon-0.033
 saw-0.033
 found-0.031
Token
Token ID198
Feature activation+1.52e-02

Pos logit contributions
 there+0.021
 set+0.021
 upon+0.020
 three+0.020
 saw+0.020
Neg logit contributions
 wounds-0.034
 apology-0.034
othe-0.033
vern-0.033
rier-0.033
TokenThe
Token ID464
Feature activation-3.74e-02

Pos logit contributions
 there+0.018
 set+0.017
 upon+0.017
 three+0.017
 saw+0.016
Neg logit contributions
 wounds-0.028
vern-0.028
 apology-0.028
rier-0.028
othe-0.028
Token small
Token ID1402
Feature activation-1.61e-02

Pos logit contributions
 wounds+0.042
rier+0.042
 apology+0.041
vern+0.036
vere+0.036
Neg logit contributions
 there-0.078
 upon-0.077
y-0.076
 set-0.075
ere-0.075
Token man
Token ID582
Feature activation-4.49e-03

Pos logit contributions
rier+0.018
 wounds+0.017
 apology+0.016
ometer+0.015
 decisions+0.015
Neg logit contributions
 upon-0.033
 there-0.033
 set-0.032
ere-0.032
 found-0.031
Token said
Token ID531
Feature activation-5.37e-03

Pos logit contributions
 wounds+0.010
rier+0.009
 apology+0.009
othe+0.009
ometer+0.009
Neg logit contributions
 there-0.004
 upon-0.004
 set-0.004
ere-0.004
 saw-0.004
Token,
Token ID11
Feature activation-3.31e-02

Pos logit contributions
rier+0.014
 wounds+0.014
 apology+0.014
othe+0.013
ometer+0.013
Neg logit contributions
 there-0.003
 set-0.003
 upon-0.003
 saw-0.003
 found-0.003
Token "
Token ID366
Feature activation-2.07e-02

Pos logit contributions
 wounds+0.072
 fault+0.068
 apology+0.067
vern+0.067
rier+0.067
Neg logit contributions
 there-0.044
 three-0.039
 set-0.039
 saw-0.039
 upon-0.038
TokenHi
Token ID17250
Feature activation+1.11e-02

Pos logit contributions
rier+0.038
 wounds+0.038
 apology+0.037
othe+0.036
ometer+0.036
Neg logit contributions
 there-0.027
 upon-0.025
 saw-0.025
 set-0.025
 found-0.024
Token
Token ID198
Feature activation+6.14e-03

Pos logit contributions
 there+0.006
 upon+0.005
 saw+0.005
 set+0.005
 found+0.005
Neg logit contributions
rier-0.017
 wounds-0.017
 fault-0.017
ometer-0.016
 apology-0.016
TokenMiss
Token ID17140
Feature activation+1.20e-02

Pos logit contributions
 there+0.007
 saw+0.007
 set+0.006
 upon+0.006
 found+0.006
Neg logit contributions
 fault-0.014
rier-0.014
ometer-0.013
 apology-0.013
 wounds-0.013
Token Lee
Token ID5741
Feature activation+2.26e-02

Pos logit contributions
 set+0.025
 there+0.025
 found+0.024
 saw+0.024
where+0.024
Neg logit contributions
rier-0.012
ometer-0.012
 apology-0.012
 fault-0.011
 wounds-0.011
Token says
Token ID1139
Feature activation-8.58e-03

Pos logit contributions
 there+0.035
 set+0.034
 saw+0.034
 upon+0.033
 found+0.032
Neg logit contributions
 wounds-0.039
rier-0.038
 apology-0.037
othe-0.036
ometer-0.036
Token,
Token ID11
Feature activation-2.55e-02

Pos logit contributions
 wounds+0.025
rier+0.024
 apology+0.024
othe+0.023
ometer+0.023
Neg logit contributions
 there-0.004
 set-0.003
 saw-0.003
 upon-0.003
 found-0.003
Token "
Token ID366
Feature activation-1.47e-02

Pos logit contributions
 wounds+0.049
rier+0.047
 apology+0.047
 fault+0.045
vern+0.045
Neg logit contributions
 there-0.036
 set-0.034
 saw-0.034
 upon-0.033
 three-0.032
TokenYou
Token ID1639
Feature activation-2.81e-02

Pos logit contributions
 wounds+0.032
rier+0.031
 apology+0.031
 fault+0.031
othe+0.030
Neg logit contributions
 there-0.018
 set-0.017
 saw-0.016
 three-0.016
 upon-0.016
Token're
Token ID821
Feature activation-4.93e-02

Pos logit contributions
 apology+0.054
vern+0.052
 wounds+0.052
vere+0.051
 fault+0.051
Neg logit contributions
 upon-0.033
 there-0.033
y-0.032
ere-0.031
 set-0.029
Token welcome
Token ID7062
Feature activation-2.66e-02

Pos logit contributions
 wounds+0.107
rier+0.105
 apology+0.103
othe+0.101
ometer+0.101
Neg logit contributions
 there-0.057
 set-0.053
 upon-0.052
 saw-0.052
 found-0.050
Token,
Token ID11
Feature activation-2.42e-02

Pos logit contributions
rier+0.065
 wounds+0.065
 apology+0.063
vere+0.062
vern+0.061
Neg logit contributions
 set-0.019
 there-0.018
 upon-0.018
 saw-0.018
 found-0.017
Token Tom
Token ID4186
Feature activation+8.70e-04

Pos logit contributions
 wounds+0.052
rier+0.050
ometer+0.050
othe+0.049
 apology+0.049
Neg logit contributions
 there-0.029
 set-0.027
 upon-0.026
 saw-0.026
 three-0.025
Token on
Token ID319
Feature activation+6.68e-02

Pos logit contributions
ere+0.007
 upon+0.007
 there+0.007
 three+0.006
 saw+0.006
Neg logit contributions
 wounds-0.014
rier-0.013
othe-0.013
 apology-0.013
ometer-0.013
Token the
Token ID262
Feature activation-2.60e-02

Pos logit contributions
 upon+0.041
 there+0.038
 saw+0.037
 found+0.035
 set+0.034
Neg logit contributions
 wounds-0.171
rier-0.169
 apology-0.167
othe-0.164
 dent-0.160
Token ground
Token ID2323
Feature activation+3.33e-03

Pos logit contributions
othe+0.049
 wounds+0.049
rier+0.049
 apology+0.049
vern+0.049
Neg logit contributions
 there-0.034
 set-0.034
 saw-0.033
 three-0.032
 upon-0.031
Token.
Token ID13
Feature activation-2.06e-02

Pos logit contributions
 there+0.004
 upon+0.004
ere+0.004
 set+0.004
 saw+0.004
Neg logit contributions
 wounds-0.007
rier-0.007
 apology-0.006
othe-0.006
ometer-0.006
Token The
Token ID383
Feature activation-3.05e-02

Pos logit contributions
 wounds+0.048
vern+0.047
rier+0.046
vere+0.046
 apology+0.045
Neg logit contributions
 there-0.022
 set-0.020
 saw-0.020
 upon-0.019
 three-0.019
Token rob
Token ID3857
Feature activation-2.29e-03

Pos logit contributions
 wounds+0.060
rier+0.058
othe+0.057
 apology+0.057
vern+0.057
Neg logit contributions
 there-0.041
 set-0.039
 saw-0.039
 upon-0.038
 three-0.037
Tokenin
Token ID259
Feature activation-2.90e-02

Pos logit contributions
 wounds+0.004
rier+0.004
 apology+0.004
othe+0.004
ometer+0.004
Neg logit contributions
 set-0.003
 there-0.003
 upon-0.003
 three-0.003
 saw-0.003
Token sees
Token ID7224
Feature activation+3.40e-02

Pos logit contributions
 wounds+0.055
 apology+0.053
rier+0.052
ometer+0.052
othe+0.051
Neg logit contributions
 there-0.039
 upon-0.037
 set-0.036
 three-0.035
 saw-0.034
Token it
Token ID340
Feature activation-3.33e-02

Pos logit contributions
 upon+0.035
 set+0.033
 there+0.032
ere+0.031
 saw+0.031
Neg logit contributions
 wounds-0.072
rier-0.072
 apology-0.071
othe-0.069
 fault-0.067
Token too
Token ID1165
Feature activation-6.08e-02

Pos logit contributions
 wounds+0.072
rier+0.071
 apology+0.070
othe+0.069
ometer+0.067
Neg logit contributions
 there-0.034
 upon-0.032
 set-0.032
ere-0.032
 saw-0.031
Token.
Token ID13
Feature activation-1.77e-02

Pos logit contributions
 wounds+0.149
rier+0.145
 apology+0.144
ometer+0.139
othe+0.137
Neg logit contributions
 there-0.059
 saw-0.051
 upon-0.051
 set-0.050
 found-0.048
Token The
Token ID383
Feature activation-2.69e-02

Pos logit contributions
 wounds+0.062
rier+0.060
 apology+0.059
vern+0.057
othe+0.057
Neg logit contributions
 there-0.041
 upon-0.038
 set-0.037
 saw-0.037
 found-0.036
Token bug
Token ID5434
Feature activation-6.30e-03

Pos logit contributions
 apology+0.049
 wounds+0.048
rier+0.047
ometer+0.045
othe+0.044
Neg logit contributions
 there-0.037
 set-0.036
ere-0.035
 upon-0.035
 saw-0.034
Token was
Token ID373
Feature activation-2.99e-02

Pos logit contributions
 wounds+0.013
othe+0.013
 apology+0.012
ometer+0.012
rier+0.012
Neg logit contributions
 upon-0.006
 there-0.006
ere-0.006
 set-0.006
 three-0.006
Token pin
Token ID6757
Feature activation-8.17e-03

Pos logit contributions
 wounds+0.052
rier+0.051
othe+0.049
 apology+0.048
 fault+0.048
Neg logit contributions
 upon-0.042
 there-0.041
 set-0.041
ere-0.040
 saw-0.039
Tokenching
Token ID10813
Feature activation+1.66e-02

Pos logit contributions
rier+0.010
 wounds+0.010
vern+0.009
othe+0.009
vere+0.009
Neg logit contributions
 there-0.016
 set-0.016
 upon-0.015
 saw-0.015
 three-0.015
Token his
Token ID465
Feature activation-6.20e-02

Pos logit contributions
 there+0.014
 set+0.013
 saw+0.012
 upon+0.012
 found+0.012
Neg logit contributions
 wounds-0.041
rier-0.040
 apology-0.040
vern-0.039
ometer-0.039
Token tail
Token ID7894
Feature activation-1.62e-02

Pos logit contributions
rier+0.120
 wounds+0.119
othe+0.116
 apology+0.115
vern+0.114
Neg logit contributions
 there-0.081
 set-0.080
 saw-0.077
 upon-0.076
 three-0.076
Token!
Token ID0
Feature activation-2.84e-02

Pos logit contributions
rier+0.035
 wounds+0.034
othe+0.033
 apology+0.033
ometer+0.033
Neg logit contributions
 there-0.017
 upon-0.016
 saw-0.015
 set-0.015
ere-0.015
Token
Token ID198
Feature activation+2.10e-02

Pos logit contributions
 wounds+0.064
rier+0.062
 apology+0.061
vern+0.060
 fault+0.058
Neg logit contributions
 there-0.033
 upon-0.030
 saw-0.029
 set-0.028
 three-0.028
Token
Token ID198
Feature activation+1.89e-02

Pos logit contributions
 there+0.023
 set+0.021
 upon+0.021
 saw+0.021
 found+0.020
Neg logit contributions
 wounds-0.046
rier-0.045
 apology-0.045
othe-0.044
ometer-0.043
TokenSam
Token ID16305
Feature activation+2.92e-02

Pos logit contributions
 there+0.026
 set+0.025
 upon+0.025
 saw+0.024
 found+0.024
Neg logit contributions
 wounds-0.035
rier-0.034
 apology-0.034
vern-0.033
othe-0.033
Token.
Token ID13
Feature activation-4.20e-02

Pos logit contributions
 wounds+0.006
rier+0.006
 apology+0.005
ometer+0.005
othe+0.005
Neg logit contributions
 there-0.003
 saw-0.003
 set-0.003
 upon-0.003
 three-0.002
Token She
Token ID1375
Feature activation+8.45e-03

Pos logit contributions
 wounds+0.087
rier+0.085
 apology+0.082
vern+0.080
othe+0.080
Neg logit contributions
 there-0.053
 set-0.049
 upon-0.049
 saw-0.049
 found-0.047
Token learned
Token ID4499
Feature activation+6.53e-05

Pos logit contributions
 there+0.010
 upon+0.009
ere+0.008
 three+0.008
y+0.008
Neg logit contributions
 wounds-0.018
 apology-0.017
othe-0.016
rier-0.016
ometer-0.016
Token that
Token ID326
Feature activation-1.32e-02

Pos logit contributions
 there+0.000
 set+0.000
 upon+0.000
 saw+0.000
 found+0.000
Neg logit contributions
 wounds-0.000
rier-0.000
 apology-0.000
ometer-0.000
othe-0.000
Token it
Token ID340
Feature activation-5.38e-02

Pos logit contributions
rier+0.032
 wounds+0.030
ometer+0.030
vern+0.029
 apology+0.029
Neg logit contributions
 there-0.016
 set-0.013
 a-0.012
 in-0.012
 the-0.012
Token's
Token ID338
Feature activation-6.09e-02

Pos logit contributions
 wounds+0.122
rier+0.121
 apology+0.118
othe+0.116
vern+0.115
Neg logit contributions
 there-0.057
 set-0.052
 saw-0.052
 upon-0.051
 found-0.050
Token important
Token ID1593
Feature activation-3.01e-02

Pos logit contributions
 wounds+0.073
rier+0.069
 apology+0.069
othe+0.065
vern+0.063
Neg logit contributions
 there-0.125
 set-0.122
 saw-0.122
 upon-0.122
 found-0.118
Token to
Token ID284
Feature activation-1.24e-02

Pos logit contributions
rier+0.078
 wounds+0.077
 apology+0.073
vere+0.073
othe+0.073
Neg logit contributions
 there-0.024
 set-0.021
 saw-0.021
 found-0.020
 upon-0.020
Token be
Token ID307
Feature activation-5.79e-02

Pos logit contributions
 wounds+0.024
 apology+0.024
rier+0.023
othe+0.023
vern+0.022
Neg logit contributions
 there-0.017
 upon-0.016
 saw-0.015
 set-0.015
 found-0.015
Token honest
Token ID5508
Feature activation-2.35e-02

Pos logit contributions
 wounds+0.091
rier+0.088
 apology+0.086
othe+0.083
ometer+0.082
Neg logit contributions
 there-0.099
 set-0.095
 upon-0.095
 saw-0.093
 found-0.091
Token and
Token ID290
Feature activation-7.32e-02

Pos logit contributions
 wounds+0.049
rier+0.048
 apology+0.047
 fault+0.046
ometer+0.046
Neg logit contributions
 upon-0.025
 set-0.025
 there-0.025
 saw-0.024
ere-0.024
Token.
Token ID13
Feature activation-5.85e-02

Pos logit contributions
 wounds+0.090
rier+0.089
 apology+0.087
ometer+0.086
vern+0.086
Neg logit contributions
 there-0.037
 set-0.033
 saw-0.032
 upon-0.032
 found-0.031
Token And
Token ID843
Feature activation-5.85e-02

Pos logit contributions
rier+0.156
 wounds+0.145
vern+0.141
vere+0.139
 bleed+0.134
Neg logit contributions
 saw-0.058
 there-0.057
 Inside-0.056
 a-0.054
 found-0.053
Token that
Token ID326
Feature activation+7.15e-03

Pos logit contributions
rier+0.147
 wounds+0.139
vern+0.138
vere+0.135
 apology+0.133
Neg logit contributions
 there-0.073
 the-0.062
 found-0.061
 in-0.060
 a-0.060
Token's
Token ID338
Feature activation-5.88e-02

Pos logit contributions
 there+0.008
 set+0.007
 upon+0.007
 saw+0.007
 three+0.007
Neg logit contributions
 wounds-0.015
 apology-0.015
rier-0.015
othe-0.015
ometer-0.014
Token the
Token ID262
Feature activation-7.46e-02

Pos logit contributions
 wounds+0.124
rier+0.121
 apology+0.120
othe+0.118
ometer+0.116
Neg logit contributions
 there-0.068
 set-0.065
 saw-0.064
 upon-0.064
 found-0.061
Token sad
Token ID6507
Feature activation-8.00e-02

Pos logit contributions
rier+0.145
othe+0.145
 wounds+0.144
ometer+0.142
vere+0.133
Neg logit contributions
 there-0.106
 upon-0.102
 set-0.102
 saw-0.101
 three-0.099
Token ending
Token ID7464
Feature activation-2.05e-02

Pos logit contributions
 wounds+0.160
rier+0.159
vere+0.155
vern+0.154
othe+0.152
Neg logit contributions
 there-0.108
 set-0.103
 upon-0.102
 saw-0.102
 found-0.099
Token of
Token ID286
Feature activation+1.44e-02

Pos logit contributions
 wounds+0.053
rier+0.052
 apology+0.050
 dent+0.048
 bleed+0.048
Neg logit contributions
 there-0.020
 in-0.018
 set-0.017
 saw-0.017
 found-0.017
Token Tim
Token ID5045
Feature activation-3.91e-02

Pos logit contributions
 there+0.016
 set+0.014
 upon+0.014
 saw+0.014
 found+0.014
Neg logit contributions
 wounds-0.032
rier-0.031
 apology-0.031
othe-0.030
ometer-0.030
Tokenmy
Token ID1820
Feature activation-4.27e-02

Pos logit contributions
 wounds+0.101
rier+0.100
 apology+0.098
othe+0.097
ometer+0.096
Neg logit contributions
 there-0.028
 set-0.025
 upon-0.025
 saw-0.024
 found-0.023
Token and
Token ID290
Feature activation-3.59e-02

Pos logit contributions
rier+0.104
 wounds+0.103
 apology+0.099
vern+0.098
ometer+0.098
Neg logit contributions
 there-0.041
 saw-0.038
 set-0.038
 found-0.037
 in-0.035
Token is
Token ID318
Feature activation-3.50e-02

Pos logit contributions
 there+0.027
 set+0.024
 upon+0.024
 saw+0.024
 found+0.022
Neg logit contributions
 wounds-0.066
rier-0.065
 apology-0.063
othe-0.062
vern-0.062
Token that
Token ID326
Feature activation+3.97e-04

Pos logit contributions
 wounds+0.090
rier+0.090
 dent+0.085
 bleed+0.084
 apology+0.084
Neg logit contributions
 there-0.031
 in-0.025
 set-0.025
 saw-0.025
 three-0.025
Token it
Token ID340
Feature activation-4.76e-02

Pos logit contributions
 there+0.000
 in+0.000
 set+0.000
 the+0.000
 a+0.000
Neg logit contributions
rier-0.001
 wounds-0.001
ometer-0.001
vern-0.001
 apology-0.001
Token's
Token ID338
Feature activation-5.70e-02

Pos logit contributions
 wounds+0.101
rier+0.100
 apology+0.098
othe+0.096
ometer+0.095
Neg logit contributions
 there-0.055
 set-0.051
 upon-0.050
 saw-0.050
 found-0.049
Token always
Token ID1464
Feature activation-4.08e-02

Pos logit contributions
 wounds+0.063
rier+0.061
 apology+0.059
othe+0.056
ometer+0.055
Neg logit contributions
 there-0.124
 set-0.120
 upon-0.119
 saw-0.119
 found-0.116
Token better
Token ID1365
Feature activation-6.14e-02

Pos logit contributions
 wounds+0.052
rier+0.051
 apology+0.049
othe+0.047
ometer+0.047
Neg logit contributions
 there-0.083
 set-0.079
 saw-0.078
 upon-0.078
 found-0.077
Token to
Token ID284
Feature activation-9.99e-03

Pos logit contributions
 wounds+0.140
rier+0.136
 apology+0.135
othe+0.133
ometer+0.133
Neg logit contributions
 there-0.059
 set-0.057
 upon-0.055
 saw-0.054
 three-0.052
Token ask
Token ID1265
Feature activation-4.06e-03

Pos logit contributions
 apology+0.018
 wounds+0.018
othe+0.017
rier+0.017
vern+0.017
Neg logit contributions
 there-0.014
 upon-0.014
 saw-0.013
y-0.013
ere-0.012
Token for
Token ID329
Feature activation+1.62e-02

Pos logit contributions
 wounds+0.008
rier+0.008
 apology+0.008
ometer+0.008
othe+0.008
Neg logit contributions
 there-0.005
 set-0.005
 upon-0.005
 saw-0.005
 found-0.004
Token help
Token ID1037
Feature activation+3.32e-03

Pos logit contributions
 there+0.022
 set+0.020
 saw+0.020
 found+0.019
 three+0.019
Neg logit contributions
rier-0.032
ometer-0.031
 wounds-0.031
 apology-0.030
othe-0.030
Token than
Token ID621
Feature activation-4.16e-02

Pos logit contributions
 there+0.003
 upon+0.003
 saw+0.003
 set+0.003
ere+0.003
Neg logit contributions
 wounds-0.008
rier-0.007
 apology-0.007
othe-0.007
 decisions-0.007
Token that
Token ID326
Feature activation-9.57e-03

Pos logit contributions
 upon+0.015
 there+0.015
 saw+0.015
 set+0.015
ere+0.014
Neg logit contributions
 wounds-0.028
 apology-0.027
rier-0.027
 fault-0.026
othe-0.026
Token you
Token ID345
Feature activation-4.51e-02

Pos logit contributions
 wounds+0.020
rier+0.019
 apology+0.019
ometer+0.018
othe+0.018
Neg logit contributions
 there-0.011
 set-0.011
 upon-0.011
 saw-0.011
 found-0.010
Token're
Token ID821
Feature activation-4.64e-02

Pos logit contributions
 apology+0.079
 bleed+0.079
 fault+0.078
vere+0.077
vern+0.077
Neg logit contributions
ere-0.054
 there-0.053
 three-0.047
where-0.047
 upon-0.047
Token so
Token ID523
Feature activation-7.73e-02

Pos logit contributions
 wounds+0.088
rier+0.087
 apology+0.084
othe+0.083
 fault+0.082
Neg logit contributions
 there-0.058
 set-0.057
 upon-0.057
 saw-0.057
 found-0.055
Token happy
Token ID3772
Feature activation-2.22e-02

Pos logit contributions
 wounds+0.143
rier+0.141
 apology+0.138
othe+0.135
ometer+0.134
Neg logit contributions
 there-0.109
 set-0.104
 upon-0.102
 saw-0.102
 found-0.099
Token this
Token ID428
Feature activation-7.12e-02

Pos logit contributions
rier+0.042
 wounds+0.041
 fault+0.040
 decisions+0.040
ometer+0.039
Neg logit contributions
ere-0.026
 arrived-0.026
 set-0.025
neath-0.025
 saw-0.025
Token morning
Token ID3329
Feature activation-4.51e-02

Pos logit contributions
rier+0.141
 wounds+0.139
 apology+0.139
ometer+0.135
 fault+0.134
Neg logit contributions
 there-0.082
 upon-0.081
 saw-0.078
ere-0.076
 set-0.076
Token.
Token ID13
Feature activation-3.56e-02

Pos logit contributions
rier+0.093
 wounds+0.092
 apology+0.091
ometer+0.090
othe+0.089
Neg logit contributions
 there-0.047
 upon-0.046
 saw-0.045
 set-0.044
ere-0.044
Token Would
Token ID10928
Feature activation+6.54e-03

Pos logit contributions
 wounds+0.057
 apology+0.057
rier+0.056
ometer+0.055
othe+0.055
Neg logit contributions
 there-0.056
 upon-0.054
 set-0.053
 saw-0.053
 found-0.052
Token you
Token ID345
Feature activation-3.43e-02

Pos logit contributions
 there+0.010
 upon+0.010
 set+0.010
 saw+0.009
 found+0.009
Neg logit contributions
 wounds-0.011
 apology-0.011
vere-0.011
othe-0.011
rier-0.011
Token like
Token ID588
Feature activation+1.03e-02

Pos logit contributions
 apology+0.054
vern+0.053
vere+0.050
 fault+0.049
 smarter+0.048
Neg logit contributions
ere-0.052
und-0.049
neath-0.049
 there-0.045
where-0.043