TOP ACTIVATIONS
MAX = 9.956

named Tim found a big chest in his room . It
. Tim my had a chest that he loved very much
. She had a big chest in her room where she
. Tom had a big chest in his room . The
because she had a huge chest full of toys . Every
and had a special pink journal that she wrote in every
Lily . She had a journal that she loved to write
little rabbit named Tim found a shiny trophy in the forest
. He has a red journal with many pages . He
: he had a bright diary . He would hold it
One day , Tom saw a beautiful sunrise . The sky
, Lily saw a big chest . She asked her friends
a girl named Sue found a broken scarf . She wanted
little girl named Lucy found a big box in her room
swimming , Fin found a journal . The journal had many
boy named Tim found a puzzle in his toy box .
a brown dog found a puzzle . The puzzle was big
green forest , there lived a little leaf . The little
little girl named Lucy found a big book called a dictionary
man who had a big chest . He loved his chest

MIN ACTIVATIONS
MIN = -12.251

treasure was so special that Ab bie decide to keep it
S ully was glad that J ody was safe . He
From that day on , Ab bie never forgot the special
the gate keeper agreed that Ab bie had worked hard and
just watched with admiration as T essa sw am down the
. And that is how L iza always started the day
, when she remembered what Grand ma mouse had said .
Maggie nodded and remembered what Grand ma Mouse said .
. They all listened and Ab bie felt so proud and
They wonder what other things Grand pa used to wear and
had so much fun . K ait lyn was so happy
the day , Bella kissed Aless io on the cheek and
is gone . What if Grand ma needs it ?" Anna
longer sad . Even though T una missed his old friends
he opened the gate . Ab bie was so happy and
have fun without it ." K ait lyn smiled and agreed
children didn 't know what Mam ma meant but they ran
and everyone was happy . T oni was happy to have
not believe how care free T essa was . She just
the sky . And whenever Mitt ens felt hungry , Tweet

INTERVAL 4.404 - 9.956
CONTAINS 5.987%

box . It made a bright light . She carefully put
make it perfect . While Jack was busy building
upon a time , there was a little flower that was
the river on their pink raft , enjoying the sunshine and
with her pink flowers and a silly joke in her heart

INTERVAL -1.148 - 4.404
CONTAINS 60.061%

so excited that she ran all the way to the store
. Although he was forbidden to climb it again , he
The little girl decided to run and jump towards the thing
are happy and friendly . They say sorry for fighting .
said hello to the family and pet ted the bunny .

INTERVAL -6.699 - -1.148
CONTAINS 33.390%

though it was bitter at first , it ended with her
have to listen to us . You have to be careful
their stories and their laughs . They became good friends .
had lots of water . D ucky had many friends who
sky . " You cannot use my stone !" the voice

INTERVAL -12.251 - -6.699
CONTAINS 0.561%

smiled and said , " Of course , Sara . You
€ So the little frog hopped out to the
any way she could . Mar iah found many ways to
. The end . <|endoftext|> L ily and Tom were friends
the strong man and the little girl was happy . She
Token named
Token ID3706
Feature activation-1.03e+00

Pos logit contributions
 long+0.018
 badly+0.015
 far+0.014
 much+0.014
 prow+0.014
Neg logit contributions
elling-0.042
aughter-0.040
ays-0.038
ada-0.037
atted-0.037
Token Tim
Token ID5045
Feature activation-2.47e+00

Pos logit contributions
elling+0.021
aughter+0.019
ays+0.018
ada+0.017
atted+0.017
Neg logit contributions
 long-0.026
 badly-0.024
 prow-0.023
 far-0.023
 much-0.022
Token found
Token ID1043
Feature activation+6.51e-01

Pos logit contributions
elling+0.069
aughter+0.065
ays+0.061
ada+0.060
atted+0.059
Neg logit contributions
 long-0.043
 badly-0.039
 far-0.035
 prow-0.035
 much-0.035
Token a
Token ID257
Feature activation+8.65e+00

Pos logit contributions
 long+0.008
 badly+0.007
 prow+0.006
 gall+0.006
 far+0.006
Neg logit contributions
elling-0.021
aughter-0.020
ays-0.019
ada-0.019
atted-0.019
Token big
Token ID1263
Feature activation+4.44e+00

Pos logit contributions
 prow+0.124
 badly+0.120
One+0.116
 gall+0.116
 miles+0.113
Neg logit contributions
aughter-0.211
elling-0.191
 Daughter-0.185
ison-0.185
ada-0.183
Token chest
Token ID7721
Feature activation+9.96e+00

Pos logit contributions
 long+0.081
 badly+0.078
 prow+0.075
 much+0.073
One+0.072
Neg logit contributions
elling-0.107
aughter-0.104
ison-0.093
ada-0.093
isher-0.092
Token in
Token ID287
Feature activation+1.33e+00

Pos logit contributions
 prow+0.174
 long+0.170
 villains+0.164
 gall+0.162
 badly+0.154
Neg logit contributions
aughter-0.229
elling-0.223
ays-0.208
ener-0.202
isher-0.201
Token his
Token ID465
Feature activation+1.99e+00

Pos logit contributions
 long+0.026
 badly+0.024
 prow+0.023
 gall+0.021
 much+0.021
Neg logit contributions
elling-0.032
aughter-0.031
els-0.029
ays-0.028
ison-0.028
Token room
Token ID2119
Feature activation+5.52e+00

Pos logit contributions
 badly+0.039
 long+0.038
 prow+0.036
 gall+0.035
 much+0.033
Neg logit contributions
elling-0.045
aughter-0.044
ison-0.039
ays-0.039
atted-0.038
Token.
Token ID13
Feature activation+1.92e-01

Pos logit contributions
 long+0.105
 prow+0.098
 gall+0.094
 badly+0.090
 nut+0.088
Neg logit contributions
aughter-0.126
elling-0.124
ays-0.119
atted-0.118
els-0.115
Token It
Token ID632
Feature activation+3.10e+00

Pos logit contributions
 long+0.004
 badly+0.003
 much+0.003
 far+0.003
 prow+0.003
Neg logit contributions
elling-0.005
aughter-0.005
ays-0.004
ada-0.004
atted-0.004
Token.
Token ID13
Feature activation+5.19e-01

Pos logit contributions
 long+0.008
 badly+0.007
 quest+0.007
 prow+0.007
 deserted+0.007
Neg logit contributions
elling-0.009
aughter-0.008
ison-0.008
ada-0.008
els-0.008
Token Tim
Token ID5045
Feature activation-3.01e+00

Pos logit contributions
 long+0.012
 badly+0.012
 prow+0.011
 far+0.011
 much+0.011
Neg logit contributions
elling-0.011
aughter-0.010
ays-0.009
ada-0.009
atted-0.009
Tokenmy
Token ID1820
Feature activation+7.65e-01

Pos logit contributions
elling+0.043
aughter+0.038
ays+0.033
ada+0.032
atted+0.031
Neg logit contributions
 long-0.093
 badly-0.088
 prow-0.083
 far-0.083
 much-0.083
Token had
Token ID550
Feature activation+4.50e+00

Pos logit contributions
 long+0.015
 badly+0.015
 prow+0.014
 far+0.013
 deserted+0.013
Neg logit contributions
elling-0.016
aughter-0.016
ison-0.015
cy-0.014
 means-0.014
Token a
Token ID257
Feature activation+7.85e+00

Pos logit contributions
 quest+0.088
 badly+0.087
 cave+0.083
 prow+0.081
 trail+0.081
Neg logit contributions
elling-0.074
 Daughter-0.070
 Jenny-0.069
ays-0.068
 brushes-0.067
Token chest
Token ID7721
Feature activation+9.73e+00

Pos logit contributions
 badly+0.123
 prow+0.121
 miles+0.117
 gall+0.112
On+0.112
Neg logit contributions
aughter-0.185
elling-0.169
 Daughter-0.163
cy-0.161
ison-0.160
Token that
Token ID326
Feature activation+1.76e+00

Pos logit contributions
 prow+0.158
 villains+0.148
 gall+0.144
 long+0.143
Bl+0.142
Neg logit contributions
elling-0.226
aughter-0.222
atted-0.207
isher-0.204
ener-0.202
Token he
Token ID339
Feature activation+1.77e+00

Pos logit contributions
 long+0.025
 badly+0.024
 prow+0.020
 far+0.019
 much+0.019
Neg logit contributions
elling-0.052
aughter-0.050
ison-0.046
ays-0.046
ada-0.045
Token loved
Token ID6151
Feature activation+2.17e-01

Pos logit contributions
 long+0.035
 badly+0.032
 prow+0.029
 far+0.028
 much+0.028
Neg logit contributions
elling-0.042
aughter-0.039
ison-0.036
isher-0.036
atted-0.036
Token very
Token ID845
Feature activation+2.85e+00

Pos logit contributions
 badly+0.004
 long+0.004
 prow+0.004
 deserted+0.004
ggy+0.004
Neg logit contributions
elling-0.004
aughter-0.004
els-0.004
ison-0.004
 means-0.004
Token much
Token ID881
Feature activation-1.29e-01

Pos logit contributions
 prow+0.027
 gall+0.023
 cave+0.021
 shield+0.021
 hollow+0.021
Neg logit contributions
aughter-0.087
elling-0.087
olds-0.082
ison-0.080
uns-0.080
Token.
Token ID13
Feature activation+6.21e-01

Pos logit contributions
elling+0.011
aughter+0.010
ison+0.010
ada+0.009
els+0.009
Neg logit contributions
 long-0.008
 badly-0.007
 prow-0.007
 deserted-0.007
 quest-0.007
Token She
Token ID1375
Feature activation+2.51e+00

Pos logit contributions
 long+0.015
 badly+0.014
 much+0.013
 far+0.013
 prow+0.013
Neg logit contributions
elling-0.014
aughter-0.013
ays-0.012
ada-0.011
atted-0.011
Token had
Token ID550
Feature activation+4.49e+00

Pos logit contributions
 badly+0.057
 long+0.053
 prow+0.051
 deserted+0.050
 cave+0.050
Neg logit contributions
elling-0.045
aughter-0.041
ison-0.040
 means-0.038
atted-0.037
Token a
Token ID257
Feature activation+7.86e+00

Pos logit contributions
 badly+0.075
 quest+0.074
 cave+0.073
 prow+0.070
 nut+0.066
Neg logit contributions
elling-0.099
aughter-0.093
 Daughter-0.090
ada-0.089
ays-0.088
Token big
Token ID1263
Feature activation+4.40e+00

Pos logit contributions
 prow+0.119
 badly+0.119
 gall+0.111
 deserted+0.109
 miles+0.107
Neg logit contributions
aughter-0.199
elling-0.185
ada-0.175
 Daughter-0.173
cy-0.172
Token chest
Token ID7721
Feature activation+9.64e+00

Pos logit contributions
 long+0.082
 badly+0.078
 prow+0.073
 much+0.071
 far+0.070
Neg logit contributions
elling-0.111
aughter-0.107
ada-0.097
ays-0.096
ison-0.094
Token in
Token ID287
Feature activation+1.83e+00

Pos logit contributions
 prow+0.157
 villains+0.149
 long+0.148
 gall+0.146
 badly+0.138
Neg logit contributions
elling-0.233
aughter-0.230
atted-0.210
ays-0.205
ison-0.202
Token her
Token ID607
Feature activation+1.95e+00

Pos logit contributions
 long+0.047
 badly+0.046
 prow+0.044
 gall+0.043
 cave+0.042
Neg logit contributions
elling-0.032
aughter-0.029
els-0.026
ays-0.025
ison-0.025
Token room
Token ID2119
Feature activation+5.64e+00

Pos logit contributions
 badly+0.040
 long+0.039
 prow+0.038
 gall+0.037
 deserted+0.035
Neg logit contributions
elling-0.041
aughter-0.041
ays-0.035
ison-0.035
atted-0.035
Token where
Token ID810
Feature activation-2.61e-01

Pos logit contributions
 long+0.105
 prow+0.099
 nut+0.097
 gall+0.096
 villains+0.094
Neg logit contributions
elling-0.126
aughter-0.125
atted-0.118
ays-0.114
els-0.110
Token she
Token ID673
Feature activation+1.27e+00

Pos logit contributions
elling+0.006
aughter+0.006
ays+0.006
ada+0.006
atted+0.005
Neg logit contributions
 long-0.005
 badly-0.005
 prow-0.004
 far-0.004
 much-0.004
Token.
Token ID13
Feature activation-1.03e-01

Pos logit contributions
elling+0.045
aughter+0.042
ison+0.040
els+0.040
ays+0.040
Neg logit contributions
 long-0.032
 badly-0.028
 prow-0.027
 gall-0.025
 deserted-0.025
Token Tom
Token ID4186
Feature activation-6.44e-01

Pos logit contributions
elling+0.002
aughter+0.002
ada+0.002
ays+0.002
atted+0.002
Neg logit contributions
 long-0.003
 badly-0.002
 much-0.002
 far-0.002
 prow-0.002
Token had
Token ID550
Feature activation+4.36e+00

Pos logit contributions
elling+0.016
aughter+0.016
ison+0.015
ays+0.014
atted+0.014
Neg logit contributions
 long-0.012
 badly-0.011
 prow-0.010
 gall-0.009
 far-0.009
Token a
Token ID257
Feature activation+7.67e+00

Pos logit contributions
 quest+0.091
 badly+0.086
 cave+0.086
 trail+0.084
 prow+0.083
Neg logit contributions
 brushes-0.066
elling-0.063
 opinions-0.062
olds-0.062
 paints-0.061
Token big
Token ID1263
Feature activation+4.18e+00

Pos logit contributions
 badly+0.115
 prow+0.114
 miles+0.111
 deserted+0.106
 gall+0.105
Neg logit contributions
aughter-0.182
elling-0.163
ison-0.160
 Daughter-0.159
ada-0.156
Token chest
Token ID7721
Feature activation+9.55e+00

Pos logit contributions
 long+0.078
 badly+0.075
 prow+0.071
 much+0.069
 far+0.068
Neg logit contributions
elling-0.101
aughter-0.098
ada-0.088
ays-0.088
ison-0.087
Token in
Token ID287
Feature activation+1.72e+00

Pos logit contributions
 prow+0.161
 long+0.155
 gall+0.149
 villains+0.147
 badly+0.147
Neg logit contributions
elling-0.213
aughter-0.208
atted-0.194
ays-0.190
ison-0.190
Token his
Token ID465
Feature activation+1.89e+00

Pos logit contributions
 long+0.029
 badly+0.029
 prow+0.026
 gall+0.025
 deserted+0.024
Neg logit contributions
elling-0.044
aughter-0.041
els-0.039
ays-0.039
ison-0.039
Token room
Token ID2119
Feature activation+5.44e+00

Pos logit contributions
 badly+0.039
 long+0.038
 prow+0.036
 gall+0.035
 deserted+0.034
Neg logit contributions
elling-0.039
aughter-0.038
ison-0.034
ays-0.034
atted-0.033
Token.
Token ID13
Feature activation+3.00e-01

Pos logit contributions
 long+0.101
 prow+0.093
 badly+0.090
 gall+0.089
 deserted+0.087
Neg logit contributions
elling-0.121
aughter-0.120
atted-0.114
ays-0.111
els-0.109
Token The
Token ID383
Feature activation+2.57e-01

Pos logit contributions
 long+0.006
 badly+0.005
 much+0.005
 far+0.005
 prow+0.005
Neg logit contributions
elling-0.008
aughter-0.007
ada-0.007
ays-0.007
atted-0.007
Token because
Token ID780
Feature activation-1.85e+00

Pos logit contributions
elling+0.041
aughter+0.039
ays+0.037
ada+0.036
atted+0.036
Neg logit contributions
 long-0.018
 badly-0.016
 much-0.014
 far-0.014
 prow-0.013
Token she
Token ID673
Feature activation+2.88e-01

Pos logit contributions
elling+0.055
aughter+0.052
ays+0.049
ada+0.048
atted+0.048
Neg logit contributions
 long-0.029
 badly-0.026
 much-0.023
 far-0.023
 prow-0.023
Token had
Token ID550
Feature activation+3.21e+00

Pos logit contributions
 long+0.005
 badly+0.004
 prow+0.004
 far+0.004
 much+0.004
Neg logit contributions
elling-0.008
aughter-0.008
ays-0.007
ada-0.007
atted-0.007
Token a
Token ID257
Feature activation+6.81e+00

Pos logit contributions
 long+0.039
 badly+0.036
 cave+0.036
 prow+0.036
 quest+0.031
Neg logit contributions
elling-0.093
aughter-0.088
ada-0.085
atted-0.085
ays-0.084
Token huge
Token ID3236
Feature activation+3.39e+00

Pos logit contributions
 prow+0.082
 long+0.081
 badly+0.080
 gall+0.073
On+0.071
Neg logit contributions
elling-0.190
aughter-0.190
ada-0.178
ison-0.175
atted-0.169
Token chest
Token ID7721
Feature activation+9.50e+00

Pos logit contributions
 long+0.060
 badly+0.056
 prow+0.053
 much+0.052
 far+0.052
Neg logit contributions
elling-0.087
aughter-0.084
ada-0.077
ays-0.075
atted-0.075
Token full
Token ID1336
Feature activation-1.09e+00

Pos logit contributions
 prow+0.155
 long+0.152
One+0.139
 gall+0.139
 badly+0.139
Neg logit contributions
elling-0.221
aughter-0.220
ener-0.207
ays-0.202
isher-0.197
Token of
Token ID286
Feature activation-1.75e+00

Pos logit contributions
elling+0.024
aughter+0.022
ada+0.020
ener+0.020
ion+0.019
Neg logit contributions
 long-0.024
 badly-0.022
 prow-0.022
 far-0.020
One-0.020
Token toys
Token ID14958
Feature activation+6.07e+00

Pos logit contributions
elling+0.045
aughter+0.041
els+0.039
ison+0.038
atted+0.038
Neg logit contributions
 long-0.029
 badly-0.029
 prow-0.027
On-0.025
One-0.025
Token.
Token ID13
Feature activation+1.40e-01

Pos logit contributions
 long+0.096
 badly+0.085
 prow+0.083
 much+0.076
 far+0.075
Neg logit contributions
elling-0.171
aughter-0.163
ays-0.155
ada-0.152
els-0.150
Token Every
Token ID3887
Feature activation-6.18e-01

Pos logit contributions
 long+0.003
 far+0.002
One+0.002
 badly+0.002
 much+0.002
Neg logit contributions
elling-0.004
aughter-0.004
isher-0.004
atted-0.004
uns-0.003
Token and
Token ID290
Feature activation+2.11e+00

Pos logit contributions
 long+0.006
 badly+0.005
 much+0.004
 far+0.004
 prow+0.004
Neg logit contributions
elling-0.012
aughter-0.011
ada-0.010
ays-0.010
atted-0.010
Token had
Token ID550
Feature activation+3.81e+00

Pos logit contributions
 long+0.032
 badly+0.026
 much+0.025
 far+0.024
 prow+0.023
Neg logit contributions
elling-0.066
aughter-0.063
ays-0.060
ada-0.059
atted-0.058
Token a
Token ID257
Feature activation+7.46e+00

Pos logit contributions
 badly+0.055
 prow+0.047
 long+0.046
 cave+0.045
 quest+0.043
Neg logit contributions
elling-0.106
aughter-0.099
ada-0.095
ays-0.094
els-0.094
Token special
Token ID2041
Feature activation+3.52e+00

Pos logit contributions
 badly+0.131
 prow+0.128
 long+0.116
 gall+0.116
 deserted+0.113
Neg logit contributions
aughter-0.176
elling-0.174
ada-0.162
 Daughter-0.156
ison-0.154
Token pink
Token ID11398
Feature activation+4.12e+00

Pos logit contributions
 long+0.065
 badly+0.061
 prow+0.057
 far+0.055
 much+0.055
Neg logit contributions
elling-0.091
aughter-0.087
ada-0.079
ays-0.079
atted-0.078
Token journal
Token ID3989
Feature activation+9.45e+00

Pos logit contributions
 long+0.074
 badly+0.071
 prow+0.067
 far+0.065
 much+0.064
Neg logit contributions
elling-0.106
aughter-0.101
ays-0.094
ada-0.094
ener-0.091
Token that
Token ID326
Feature activation+3.59e-01

Pos logit contributions
 long+0.162
 badly+0.160
 prow+0.160
 deserted+0.145
 gall+0.141
Neg logit contributions
elling-0.230
aughter-0.223
ays-0.209
ener-0.203
ison-0.197
Token she
Token ID673
Feature activation+1.06e+00

Pos logit contributions
 long+0.006
 much+0.005
 far+0.005
 badly+0.005
 hidden+0.004
Neg logit contributions
elling-0.011
aughter-0.010
ays-0.010
ada-0.010
atted-0.010
Token wrote
Token ID2630
Feature activation-5.37e-02

Pos logit contributions
 long+0.020
 badly+0.018
 prow+0.016
 much+0.016
 far+0.016
Neg logit contributions
elling-0.028
aughter-0.026
atted-0.024
ada-0.024
ays-0.024
Token in
Token ID287
Feature activation-8.07e-01

Pos logit contributions
elling+0.002
aughter+0.001
ays+0.001
ada+0.001
atted+0.001
Neg logit contributions
 long-0.001
 badly-0.001
 far-0.001
 much-0.001
 prow-0.001
Token every
Token ID790
Feature activation-1.56e+00

Pos logit contributions
elling+0.019
aughter+0.017
ays+0.016
els+0.016
ada+0.016
Neg logit contributions
 long-0.017
 badly-0.016
 prow-0.015
 far-0.014
 gall-0.014
Token Lily
Token ID20037
Feature activation-7.24e-01

Pos logit contributions
elling+0.015
aughter+0.014
ays+0.013
ada+0.012
atted+0.012
Neg logit contributions
 long-0.018
 badly-0.017
 much-0.016
 far-0.016
 prow-0.016
Token.
Token ID13
Feature activation+2.52e-01

Pos logit contributions
elling+0.018
aughter+0.017
ison+0.016
ada+0.016
els+0.016
Neg logit contributions
 long-0.014
 badly-0.012
 prow-0.011
 quest-0.011
 gall-0.011
Token She
Token ID1375
Feature activation+2.27e+00

Pos logit contributions
 long+0.006
 badly+0.006
 much+0.005
 far+0.005
 prow+0.005
Neg logit contributions
elling-0.005
aughter-0.005
ays-0.005
ada-0.005
atted-0.004
Token had
Token ID550
Feature activation+4.46e+00

Pos logit contributions
 badly+0.051
 long+0.048
 prow+0.046
 deserted+0.045
 cave+0.045
Neg logit contributions
elling-0.041
aughter-0.038
ison-0.037
 means-0.035
atted-0.035
Token a
Token ID257
Feature activation+7.81e+00

Pos logit contributions
 badly+0.074
 quest+0.072
 cave+0.071
 prow+0.068
 gall+0.063
Neg logit contributions
elling-0.100
aughter-0.093
 Daughter-0.089
ada-0.089
ison-0.087
Token journal
Token ID3989
Feature activation+9.40e+00

Pos logit contributions
 prow+0.117
 badly+0.116
 gall+0.107
 deserted+0.106
 miles+0.105
Neg logit contributions
aughter-0.200
elling-0.185
ada-0.177
 Daughter-0.174
ison-0.172
Token that
Token ID326
Feature activation+1.33e+00

Pos logit contributions
 long+0.163
 prow+0.162
 badly+0.161
 deserted+0.150
 gall+0.148
Neg logit contributions
elling-0.222
aughter-0.211
ays-0.197
ison-0.190
ener-0.185
Token she
Token ID673
Feature activation+1.57e+00

Pos logit contributions
 long+0.023
 badly+0.020
 much+0.019
 far+0.019
 prow+0.018
Neg logit contributions
elling-0.038
aughter-0.036
ays-0.035
ada-0.034
atted-0.034
Token loved
Token ID6151
Feature activation-1.65e-01

Pos logit contributions
 long+0.030
 badly+0.028
 prow+0.025
 far+0.024
 cave+0.024
Neg logit contributions
elling-0.039
aughter-0.037
atted-0.034
ison-0.034
isher-0.034
Token to
Token ID284
Feature activation+2.08e+00

Pos logit contributions
elling+0.004
aughter+0.004
els+0.003
ays+0.003
ison+0.003
Neg logit contributions
 long-0.003
 badly-0.003
 prow-0.003
 deserted-0.003
 far-0.003
Token write
Token ID3551
Feature activation-3.54e-02

Pos logit contributions
 badly+0.042
 long+0.039
 prow+0.038
 deserted+0.036
 far+0.034
Neg logit contributions
elling-0.046
aughter-0.044
els-0.040
 agrees-0.038
 means-0.038
Token little
Token ID1310
Feature activation+2.62e+00

Pos logit contributions
 badly+0.117
 prow+0.112
 gall+0.101
 deserted+0.099
 cave+0.099
Neg logit contributions
aughter-0.173
elling-0.166
ada-0.157
cy-0.153
ison-0.151
Token rabbit
Token ID22746
Feature activation+3.69e+00

Pos logit contributions
 long+0.065
 badly+0.061
 prow+0.058
 much+0.057
 far+0.056
Neg logit contributions
elling-0.052
aughter-0.049
ada-0.044
ays-0.043
ison-0.042
Token named
Token ID3706
Feature activation-5.41e-01

Pos logit contributions
 long+0.068
 badly+0.063
 much+0.054
 deserted+0.054
 prow+0.053
Neg logit contributions
elling-0.095
aughter-0.087
atted-0.086
ays-0.083
ada-0.083
Token Tim
Token ID5045
Feature activation-1.85e+00

Pos logit contributions
elling+0.010
aughter+0.010
ada+0.009
ays+0.008
ison+0.008
Neg logit contributions
 long-0.014
 badly-0.013
 prow-0.012
 far-0.012
 much-0.012
Token found
Token ID1043
Feature activation+8.67e-01

Pos logit contributions
elling+0.050
aughter+0.047
ays+0.044
ada+0.044
atted+0.043
Neg logit contributions
 long-0.033
 badly-0.030
 prow-0.027
 much-0.027
 far-0.026
Token a
Token ID257
Feature activation+9.31e+00

Pos logit contributions
 long+0.011
 badly+0.010
 prow+0.009
 quest+0.008
 gall+0.008
Neg logit contributions
elling-0.027
aughter-0.025
ays-0.024
atted-0.024
ada-0.024
Token shiny
Token ID22441
Feature activation+6.22e+00

Pos logit contributions
 miles+0.138
 villains+0.134
 badly+0.128
 prow+0.128
vel+0.124
Neg logit contributions
aughter-0.192
 Daughter-0.177
 Girl-0.174
ada-0.168
ison-0.168
Token trophy
Token ID16383
Feature activation+8.49e+00

Pos logit contributions
 prow+0.107
 long+0.104
 much+0.101
On+0.099
One+0.099
Neg logit contributions
elling-0.152
aughter-0.143
isher-0.137
ener-0.136
ada-0.130
Token in
Token ID287
Feature activation+1.06e+00

Pos logit contributions
 long+0.160
 prow+0.146
 badly+0.140
One+0.139
 much+0.138
Neg logit contributions
elling-0.189
aughter-0.187
ays-0.178
isher-0.173
els-0.171
Token the
Token ID262
Feature activation+1.50e+00

Pos logit contributions
 long+0.021
 badly+0.021
 prow+0.018
 gall+0.017
 much+0.017
Neg logit contributions
elling-0.025
aughter-0.023
els-0.022
ison-0.022
ays-0.022
Token forest
Token ID8222
Feature activation+5.62e+00

Pos logit contributions
 badly+0.029
 long+0.027
 prow+0.024
 gall+0.024
 much+0.023
Neg logit contributions
elling-0.038
aughter-0.033
ison-0.031
ada-0.031
ays-0.030
Token.
Token ID13
Feature activation+2.02e-02

Pos logit contributions
 long+0.102
 badly+0.093
 prow+0.087
 much+0.081
 far+0.081
Neg logit contributions
elling-0.178
aughter-0.171
ays-0.159
els-0.154
ada-0.153
Token He
Token ID679
Feature activation+3.31e+00

Pos logit contributions
 long+0.000
 badly+0.000
 much+0.000
 far+0.000
 prow+0.000
Neg logit contributions
elling-0.001
aughter-0.001
atted-0.001
ada-0.001
ays-0.001
Token has
Token ID468
Feature activation+3.81e+00

Pos logit contributions
 long+0.088
 badly+0.084
 prow+0.078
 much+0.077
 far+0.076
Neg logit contributions
elling-0.060
aughter-0.055
ays-0.051
els-0.048
atted-0.048
Token a
Token ID257
Feature activation+7.86e+00

Pos logit contributions
 badly+0.054
 prow+0.048
 long+0.045
 cave+0.042
 gall+0.042
Neg logit contributions
elling-0.104
aughter-0.097
ays-0.096
ison-0.094
els-0.094
Token red
Token ID2266
Feature activation+4.61e+00

Pos logit contributions
 prow+0.111
 badly+0.110
 gall+0.101
One+0.095
 miles+0.092
Neg logit contributions
aughter-0.206
elling-0.194
ison-0.186
isher-0.178
ada-0.177
Token journal
Token ID3989
Feature activation+9.30e+00

Pos logit contributions
 long+0.088
 badly+0.087
 prow+0.084
 much+0.081
 deserted+0.079
Neg logit contributions
elling-0.107
aughter-0.104
ener-0.095
ays-0.094
isher-0.094
Token with
Token ID351
Feature activation-1.84e-01

Pos logit contributions
 badly+0.175
 long+0.173
 prow+0.169
 deserted+0.153
One+0.153
Neg logit contributions
elling-0.203
aughter-0.192
ays-0.186
ener-0.186
isher-0.178
Token many
Token ID867
Feature activation-1.05e+00

Pos logit contributions
elling+0.005
aughter+0.005
ays+0.005
ada+0.005
atted+0.005
Neg logit contributions
 long-0.003
 badly-0.003
 prow-0.002
 far-0.002
 much-0.002
Token pages
Token ID5468
Feature activation+5.97e+00

Pos logit contributions
elling+0.021
aughter+0.018
els+0.017
ays+0.017
atted+0.017
Neg logit contributions
 long-0.025
 badly-0.025
 prow-0.024
 far-0.023
 gall-0.022
Token.
Token ID13
Feature activation-4.26e-01

Pos logit contributions
 long+0.104
 prow+0.103
 badly+0.102
 gall+0.093
 che+0.091
Neg logit contributions
elling-0.146
ays-0.138
aughter-0.135
els-0.131
atted-0.127
Token He
Token ID679
Feature activation+2.64e+00

Pos logit contributions
elling+0.012
aughter+0.012
ada+0.011
ener+0.011
atted+0.011
Neg logit contributions
 long-0.007
 much-0.006
 badly-0.006
 far-0.006
One-0.006
Token:
Token ID25
Feature activation+1.87e+00

Pos logit contributions
 long+0.091
 badly+0.086
 prow+0.085
One+0.082
 gall+0.082
Neg logit contributions
aughter-0.125
elling-0.124
isher-0.116
els-0.115
ion-0.115
Token he
Token ID339
Feature activation+2.50e+00

Pos logit contributions
 long+0.034
 badly+0.033
 prow+0.030
 far+0.029
 cave+0.028
Neg logit contributions
elling-0.046
aughter-0.044
ays-0.040
ada-0.040
ison-0.040
Token had
Token ID550
Feature activation+4.27e+00

Pos logit contributions
 long+0.047
 badly+0.044
 prow+0.041
 far+0.039
 cave+0.038
Neg logit contributions
elling-0.059
aughter-0.055
ison-0.052
atted-0.052
ada-0.050
Token a
Token ID257
Feature activation+8.08e+00

Pos logit contributions
 badly+0.062
 long+0.055
 prow+0.053
 nut+0.049
One+0.048
Neg logit contributions
elling-0.108
aughter-0.100
atted-0.099
ison-0.097
els-0.097
Token bright
Token ID6016
Feature activation+5.60e+00

Pos logit contributions
 badly+0.113
 prow+0.111
One+0.106
On+0.104
 gall+0.102
Neg logit contributions
aughter-0.199
elling-0.184
ison-0.175
ener-0.173
ada-0.172
Token diary
Token ID26339
Feature activation+9.30e+00

Pos logit contributions
 long+0.096
 badly+0.096
 prow+0.091
 much+0.086
 far+0.086
Neg logit contributions
elling-0.141
aughter-0.134
ener-0.123
isher-0.122
ada-0.122
Token.
Token ID13
Feature activation+4.73e-01

Pos logit contributions
 prow+0.168
 badly+0.168
One+0.168
 long+0.168
Bl+0.165
Neg logit contributions
elling-0.168
ener-0.168
aughter-0.163
ays-0.161
isher-0.150
Token He
Token ID679
Feature activation+2.76e+00

Pos logit contributions
 long+0.010
 badly+0.009
 prow+0.008
 far+0.008
 much+0.008
Neg logit contributions
elling-0.012
aughter-0.011
ays-0.010
ada-0.010
atted-0.010
Token would
Token ID561
Feature activation+4.83e+00

Pos logit contributions
 long+0.055
 badly+0.052
 prow+0.046
 much+0.046
 far+0.046
Neg logit contributions
elling-0.063
aughter-0.058
atted-0.056
els-0.055
ison-0.055
Token hold
Token ID1745
Feature activation-1.34e+00

Pos logit contributions
 badly+0.083
 long+0.080
 deserted+0.072
ggy+0.071
 much+0.071
Neg logit contributions
elling-0.110
aughter-0.104
ener-0.099
els-0.099
 means-0.095
Token it
Token ID340
Feature activation+2.50e+00

Pos logit contributions
elling+0.044
aughter+0.040
ada+0.039
ays+0.038
uns+0.038
Neg logit contributions
 long-0.021
 badly-0.017
 far-0.016
 much-0.016
 well-0.014
TokenOne
Token ID3198
Feature activation+2.27e-01

Pos logit contributions
 long+0.009
 badly+0.009
 prow+0.008
 far+0.008
 much+0.008
Neg logit contributions
elling-0.011
aughter-0.011
ays-0.010
ada-0.010
ucks-0.010
Token day
Token ID1110
Feature activation+5.55e-01

Pos logit contributions
 long+0.005
 badly+0.005
 prow+0.004
 gall+0.004
 deserted+0.004
Neg logit contributions
elling-0.005
aughter-0.005
ays-0.004
atted-0.004
ada-0.004
Token,
Token ID11
Feature activation+2.04e+00

Pos logit contributions
 long+0.011
 badly+0.011
 prow+0.011
 cave+0.010
 deserted+0.010
Neg logit contributions
elling-0.012
aughter-0.012
atted-0.011
ays-0.011
ada-0.011
Token Tom
Token ID4186
Feature activation-1.09e+00

Pos logit contributions
 long+0.025
 prow+0.023
 badly+0.023
 gall+0.021
 cave+0.021
Neg logit contributions
elling-0.061
aughter-0.059
ays-0.056
atted-0.055
ada-0.054
Token saw
Token ID2497
Feature activation+7.80e-01

Pos logit contributions
elling+0.030
aughter+0.029
ays+0.027
ada+0.027
atted+0.026
Neg logit contributions
 long-0.019
 badly-0.017
 far-0.016
 much-0.016
 prow-0.016
Token a
Token ID257
Feature activation+9.29e+00

Pos logit contributions
 long+0.011
 badly+0.010
 prow+0.009
 gall+0.008
 quest+0.008
Neg logit contributions
elling-0.024
aughter-0.022
ays-0.021
ada-0.021
atted-0.021
Token beautiful
Token ID4950
Feature activation+5.21e+00

Pos logit contributions
 badly+0.134
 prow+0.130
 gall+0.119
 long+0.117
One+0.116
Neg logit contributions
aughter-0.235
elling-0.223
ison-0.210
ada-0.206
 Daughter-0.203
Token sunrise
Token ID40048
Feature activation+5.61e+00

Pos logit contributions
 long+0.103
 badly+0.101
 prow+0.098
 much+0.095
One+0.094
Neg logit contributions
elling-0.117
aughter-0.113
ada-0.101
 Daughter-0.095
ison-0.095
Token.
Token ID13
Feature activation+5.28e-02

Pos logit contributions
 long+0.111
 badly+0.102
 prow+0.100
 deserted+0.100
 villains+0.096
Neg logit contributions
elling-0.123
ays-0.117
aughter-0.116
ion-0.111
els-0.108
Token The
Token ID383
Feature activation-1.22e-01

Pos logit contributions
 long+0.001
 badly+0.001
 much+0.001
 far+0.001
 prow+0.001
Neg logit contributions
elling-0.001
aughter-0.001
ays-0.001
ada-0.001
atted-0.001
Token sky
Token ID6766
Feature activation+3.27e+00

Pos logit contributions
elling+0.003
aughter+0.003
ays+0.003
ada+0.003
atted+0.003
Neg logit contributions
 long-0.002
 badly-0.002
 prow-0.002
 far-0.002
 much-0.002
Token,
Token ID11
Feature activation+1.46e+00

Pos logit contributions
 long+0.017
 badly+0.015
 far+0.014
 much+0.014
 prow+0.014
Neg logit contributions
elling-0.027
aughter-0.026
ays-0.024
ada-0.024
atted-0.023
Token Lily
Token ID20037
Feature activation-4.22e-01

Pos logit contributions
 long+0.033
 badly+0.032
 prow+0.030
 far+0.029
 much+0.029
Neg logit contributions
elling-0.030
aughter-0.029
ays-0.026
ada-0.026
atted-0.025
Token saw
Token ID2497
Feature activation-9.50e-01

Pos logit contributions
elling+0.009
aughter+0.009
ison+0.009
atted+0.008
ays+0.008
Neg logit contributions
 badly-0.008
 long-0.008
 much-0.007
 prow-0.007
 nut-0.007
Token a
Token ID257
Feature activation+7.67e+00

Pos logit contributions
elling+0.031
aughter+0.029
ays+0.028
ada+0.028
atted+0.027
Neg logit contributions
 long-0.011
 badly-0.010
 prow-0.009
 far-0.008
 cave-0.008
Token big
Token ID1263
Feature activation+4.08e+00

Pos logit contributions
 badly+0.110
 prow+0.102
gh+0.101
One+0.098
 much+0.096
Neg logit contributions
aughter-0.176
ison-0.172
elling-0.171
 Girl-0.169
ada-0.167
Token chest
Token ID7721
Feature activation+9.27e+00

Pos logit contributions
 long+0.071
 badly+0.070
 much+0.065
On+0.063
One+0.063
Neg logit contributions
elling-0.104
aughter-0.096
ison-0.090
ada-0.089
ener-0.088
Token.
Token ID13
Feature activation-1.95e-01

Pos logit contributions
 villains+0.176
 prow+0.168
 long+0.160
 quest+0.152
 gall+0.152
Neg logit contributions
aughter-0.194
elling-0.194
ays-0.183
ener-0.177
ada-0.172
Token She
Token ID1375
Feature activation+2.90e+00

Pos logit contributions
elling+0.005
aughter+0.004
ays+0.004
isher+0.004
atted+0.004
Neg logit contributions
 long-0.004
 badly-0.004
 much-0.004
 far-0.004
 prow-0.004
Token asked
Token ID1965
Feature activation-6.20e+00

Pos logit contributions
 long+0.063
 badly+0.060
 much+0.056
 far+0.055
 prow+0.054
Neg logit contributions
elling-0.055
atted-0.053
aughter-0.051
ison-0.050
els-0.049
Token her
Token ID607
Feature activation-1.66e+00

Pos logit contributions
elling+0.185
ays+0.169
isher+0.163
aughter+0.163
uns+0.158
Neg logit contributions
 long-0.116
 badly-0.102
 much-0.099
 far-0.097
 well-0.084
Token friends
Token ID2460
Feature activation-2.74e-01

Pos logit contributions
elling+0.044
aughter+0.042
ada+0.039
ays+0.039
atted+0.038
Neg logit contributions
 long-0.031
 badly-0.028
 prow-0.026
 far-0.025
 much-0.025
Token a
Token ID257
Feature activation+7.03e+00

Pos logit contributions
 long+0.027
 badly+0.024
 prow+0.024
 cave+0.022
 gall+0.021
Neg logit contributions
elling-0.065
aughter-0.063
ays-0.059
atted-0.058
ada-0.058
Token girl
Token ID2576
Feature activation+1.91e+00

Pos logit contributions
 badly+0.122
 prow+0.117
 long+0.107
 gall+0.106
 deserted+0.105
Neg logit contributions
aughter-0.169
elling-0.165
ada-0.153
cy-0.149
ison-0.147
Token named
Token ID3706
Feature activation-5.25e-01

Pos logit contributions
 long+0.024
 badly+0.020
 far+0.019
 much+0.018
 prow+0.018
Neg logit contributions
elling-0.062
aughter-0.060
ays-0.057
ada-0.056
isher-0.055
Token Sue
Token ID26089
Feature activation-9.16e-01

Pos logit contributions
elling+0.011
aughter+0.010
ays+0.010
isher+0.009
atted+0.009
Neg logit contributions
 long-0.013
 badly-0.012
 much-0.012
 far-0.011
 prow-0.011
Token found
Token ID1043
Feature activation+8.45e-01

Pos logit contributions
elling+0.026
aughter+0.025
ays+0.023
ada+0.023
atted+0.023
Neg logit contributions
 long-0.015
 badly-0.014
 far-0.012
 much-0.012
 prow-0.012
Token a
Token ID257
Feature activation+9.27e+00

Pos logit contributions
 long+0.009
 badly+0.007
 prow+0.006
 far+0.006
 much+0.006
Neg logit contributions
elling-0.029
aughter-0.028
ays-0.027
ada-0.026
atted-0.026
Token broken
Token ID5445
Feature activation+3.14e+00

Pos logit contributions
 prow+0.133
 gall+0.126
 badly+0.122
 villains+0.120
One+0.120
Neg logit contributions
aughter-0.238
elling-0.212
 Daughter-0.206
ada-0.203
ison-0.201
Token scarf
Token ID36953
Feature activation+8.71e+00

Pos logit contributions
 prow+0.058
 long+0.057
 gall+0.055
One+0.055
On+0.055
Neg logit contributions
elling-0.067
aughter-0.065
isher-0.060
ener-0.059
ada-0.058
Token.
Token ID13
Feature activation+6.81e-02

Pos logit contributions
 long+0.160
 prow+0.149
 villains+0.143
 cave+0.142
 badly+0.141
Neg logit contributions
aughter-0.196
elling-0.196
ays-0.188
ion-0.177
ada-0.177
Token She
Token ID1375
Feature activation+3.37e+00

Pos logit contributions
 long+0.002
 badly+0.001
 much+0.001
 far+0.001
 prow+0.001
Neg logit contributions
elling-0.002
aughter-0.002
isher-0.002
ays-0.001
ada-0.001
Token wanted
Token ID2227
Feature activation+2.48e-01

Pos logit contributions
 long+0.065
 badly+0.059
 prow+0.055
 much+0.054
 far+0.053
Neg logit contributions
elling-0.086
aughter-0.081
atted-0.075
ays-0.074
els-0.074
Token little
Token ID1310
Feature activation+2.38e+00

Pos logit contributions
 badly+0.102
 prow+0.094
 long+0.085
 gall+0.084
 cave+0.084
Neg logit contributions
aughter-0.174
elling-0.167
ada-0.159
cy-0.156
ison-0.153
Token girl
Token ID2576
Feature activation+8.41e-01

Pos logit contributions
 long+0.057
 badly+0.053
 prow+0.050
 much+0.049
 far+0.049
Neg logit contributions
elling-0.050
aughter-0.047
ays-0.043
ada-0.043
atted-0.041
Token named
Token ID3706
Feature activation-1.19e+00

Pos logit contributions
 long+0.011
 badly+0.010
 far+0.009
 much+0.009
 prow+0.009
Neg logit contributions
elling-0.027
aughter-0.026
ays-0.024
ada-0.024
atted-0.024
Token Lucy
Token ID22162
Feature activation-1.11e+00

Pos logit contributions
elling+0.025
aughter+0.023
ays+0.022
isher+0.022
atted+0.021
Neg logit contributions
 long-0.030
 badly-0.027
 much-0.026
 far-0.026
 prow-0.025
Token found
Token ID1043
Feature activation+5.87e-01

Pos logit contributions
elling+0.032
aughter+0.031
ays+0.029
ada+0.028
atted+0.028
Neg logit contributions
 long-0.018
 badly-0.016
 far-0.015
 prow-0.015
 much-0.015
Token a
Token ID257
Feature activation+9.26e+00

Pos logit contributions
 long+0.006
 badly+0.005
 far+0.004
 much+0.004
 prow+0.004
Neg logit contributions
elling-0.021
aughter-0.020
ays-0.019
ada-0.019
atted-0.018
Token big
Token ID1263
Feature activation+4.82e+00

Pos logit contributions
 prow+0.132
 villains+0.126
 gall+0.125
 badly+0.123
One+0.122
Neg logit contributions
aughter-0.235
 Daughter-0.205
elling-0.204
ison-0.201
ada-0.199
Token box
Token ID3091
Feature activation+8.57e+00

Pos logit contributions
 long+0.090
 badly+0.086
 prow+0.083
 much+0.081
One+0.079
Neg logit contributions
elling-0.116
aughter-0.113
ada-0.100
ison-0.100
ener-0.098
Token in
Token ID287
Feature activation+1.77e+00

Pos logit contributions
 long+0.163
 prow+0.157
 villains+0.155
 badly+0.150
 gall+0.149
Neg logit contributions
elling-0.189
aughter-0.188
ays-0.175
ion-0.171
atted-0.170
Token her
Token ID607
Feature activation+2.09e+00

Pos logit contributions
 long+0.042
 badly+0.039
 prow+0.036
 far+0.036
 much+0.036
Neg logit contributions
elling-0.039
aughter-0.036
ays-0.033
ada-0.033
atted-0.032
Token room
Token ID2119
Feature activation+5.31e+00

Pos logit contributions
 long+0.037
 badly+0.035
 prow+0.032
 much+0.031
 gall+0.031
Neg logit contributions
elling-0.056
aughter-0.053
ays-0.049
ada-0.048
atted-0.048
Token swimming
Token ID14899
Feature activation+1.20e+00

Pos logit contributions
elling+0.025
atted+0.024
ays+0.023
els+0.022
ries+0.022
Neg logit contributions
 badly-0.016
case-0.015
 prow-0.015
 quest-0.015
 deserted-0.015
Token,
Token ID11
Feature activation+1.13e+00

Pos logit contributions
 long+0.023
 badly+0.021
 prow+0.020
 gall+0.019
 deserted+0.019
Neg logit contributions
elling-0.030
aughter-0.027
atted-0.026
ays-0.026
ada-0.025
Token Fin
Token ID4463
Feature activation-1.44e+00

Pos logit contributions
 badly+0.025
 prow+0.025
 cave+0.025
case+0.024
 long+0.024
Neg logit contributions
elling-0.018
ada-0.016
atted-0.015
aughter-0.015
ays-0.015
Token found
Token ID1043
Feature activation-1.54e-02

Pos logit contributions
elling+0.032
aughter+0.030
atted+0.029
ays+0.029
ada+0.029
Neg logit contributions
 badly-0.028
 long-0.027
 prow-0.024
 much-0.024
 cave-0.023
Token a
Token ID257
Feature activation+7.71e+00

Pos logit contributions
elling+0.000
aughter+0.000
ays+0.000
atted+0.000
ada+0.000
Neg logit contributions
 long-0.000
 badly-0.000
 prow-0.000
 gall-0.000
 quest-0.000
Token journal
Token ID3989
Feature activation+9.26e+00

Pos logit contributions
 prow+0.106
One+0.101
 badly+0.097
 gall+0.096
 villains+0.094
Neg logit contributions
aughter-0.185
elling-0.175
ada-0.166
ison-0.163
ener-0.159
Token.
Token ID13
Feature activation+1.69e-01

Pos logit contributions
 long+0.188
 badly+0.173
 prow+0.173
 much+0.171
One+0.168
Neg logit contributions
ays-0.177
elling-0.176
aughter-0.168
ener-0.166
els-0.155
Token The
Token ID383
Feature activation+8.10e-02

Pos logit contributions
 long+0.003
 badly+0.003
 prow+0.003
 much+0.003
 far+0.003
Neg logit contributions
elling-0.004
aughter-0.004
ays-0.004
ada-0.004
atted-0.004
Token journal
Token ID3989
Feature activation+5.77e+00

Pos logit contributions
 badly+0.002
 prow+0.001
 long+0.001
 gall+0.001
ggy+0.001
Neg logit contributions
elling-0.002
aughter-0.002
ison-0.002
ener-0.002
els-0.002
Token had
Token ID550
Feature activation+4.19e+00

Pos logit contributions
 long+0.120
 badly+0.112
 prow+0.103
 much+0.103
 far+0.100
Neg logit contributions
elling-0.133
aughter-0.124
ays-0.116
atted-0.114
els-0.113
Token many
Token ID867
Feature activation+2.69e-01

Pos logit contributions
 badly+0.067
 prow+0.053
 seek+0.052
ggy+0.052
One+0.049
Neg logit contributions
elling-0.111
aughter-0.095
ays-0.094
 means-0.092
ison-0.092
Token boy
Token ID2933
Feature activation+9.70e-01

Pos logit contributions
 long+0.060
 badly+0.057
 prow+0.055
 much+0.053
 deserted+0.052
Neg logit contributions
elling-0.046
aughter-0.043
ada-0.039
ays-0.037
ison-0.037
Token named
Token ID3706
Feature activation-8.91e-01

Pos logit contributions
 long+0.013
 badly+0.011
 far+0.010
 much+0.010
 prow+0.010
Neg logit contributions
elling-0.031
aughter-0.030
ays-0.028
ada-0.027
atted-0.027
Token Tim
Token ID5045
Feature activation-2.50e+00

Pos logit contributions
elling+0.018
aughter+0.016
ays+0.015
ada+0.015
atted+0.014
Neg logit contributions
 long-0.022
 badly-0.021
 prow-0.020
 far-0.020
 much-0.019
Token found
Token ID1043
Feature activation+7.73e-01

Pos logit contributions
elling+0.069
aughter+0.065
ays+0.061
ada+0.061
atted+0.060
Neg logit contributions
 long-0.044
 badly-0.040
 prow-0.036
 far-0.036
 much-0.036
Token a
Token ID257
Feature activation+8.89e+00

Pos logit contributions
 long+0.010
 badly+0.008
 prow+0.007
 far+0.007
 gall+0.007
Neg logit contributions
elling-0.025
aughter-0.024
ays-0.023
ada-0.022
atted-0.022
Token puzzle
Token ID15027
Feature activation+9.26e+00

Pos logit contributions
 prow+0.124
 badly+0.119
 gall+0.117
 miles+0.116
One+0.116
Neg logit contributions
aughter-0.217
elling-0.193
 Daughter-0.193
ison-0.190
ada-0.188
Token in
Token ID287
Feature activation+1.19e+00

Pos logit contributions
 prow+0.175
 long+0.173
 gall+0.168
 badly+0.155
Bl+0.150
Neg logit contributions
aughter-0.194
elling-0.188
ion-0.183
isher-0.176
ays-0.173
Token his
Token ID465
Feature activation+1.84e+00

Pos logit contributions
 long+0.020
 badly+0.019
 prow+0.017
 far+0.016
 much+0.016
Neg logit contributions
elling-0.032
aughter-0.031
ays-0.029
els-0.028
ison-0.028
Token toy
Token ID13373
Feature activation+4.04e+00

Pos logit contributions
 long+0.034
 badly+0.033
 prow+0.030
 gall+0.029
 much+0.028
Neg logit contributions
elling-0.047
aughter-0.045
ays-0.041
atted-0.040
ison-0.040
Token box
Token ID3091
Feature activation+6.36e+00

Pos logit contributions
 long+0.084
 badly+0.078
 prow+0.073
 much+0.071
 gall+0.070
Neg logit contributions
aughter-0.093
elling-0.093
ays-0.087
ison-0.082
ada-0.081
Token.
Token ID13
Feature activation+4.73e-01

Pos logit contributions
 long+0.122
 prow+0.114
 villains+0.108
 gall+0.108
 badly+0.106
Neg logit contributions
elling-0.144
aughter-0.143
ays-0.139
atted-0.134
els-0.133
Token a
Token ID257
Feature activation+6.77e+00

Pos logit contributions
 long+0.028
 prow+0.027
 badly+0.026
 quest+0.025
 cave+0.024
Neg logit contributions
elling-0.063
aughter-0.062
ays-0.058
ada-0.058
atted-0.057
Token brown
Token ID7586
Feature activation+3.21e+00

Pos logit contributions
 badly+0.121
 prow+0.116
 long+0.107
 gall+0.106
 deserted+0.104
Neg logit contributions
aughter-0.160
elling-0.155
ada-0.146
cy-0.141
ison-0.140
Token dog
Token ID3290
Feature activation+2.68e+00

Pos logit contributions
 long+0.068
 badly+0.063
 much+0.060
 prow+0.059
 far+0.057
Neg logit contributions
elling-0.074
aughter-0.072
ada-0.065
ays-0.063
atted-0.062
Token found
Token ID1043
Feature activation+9.71e-01

Pos logit contributions
 long+0.048
 badly+0.044
 much+0.038
 prow+0.038
 deserted+0.038
Neg logit contributions
elling-0.070
aughter-0.065
atted-0.063
ays-0.062
ada-0.061
Token a
Token ID257
Feature activation+8.75e+00

Pos logit contributions
 long+0.012
 badly+0.010
 prow+0.009
 far+0.008
 cave+0.008
Neg logit contributions
elling-0.032
aughter-0.030
ays-0.029
atted-0.028
ada-0.028
Token puzzle
Token ID15027
Feature activation+9.25e+00

Pos logit contributions
 prow+0.130
 miles+0.129
 badly+0.120
 gall+0.120
vel+0.117
Neg logit contributions
aughter-0.189
 Daughter-0.179
ada-0.170
elling-0.168
 Girl-0.167
Token.
Token ID13
Feature activation+2.31e-01

Pos logit contributions
 long+0.168
 prow+0.164
 gall+0.157
 badly+0.153
One+0.146
Neg logit contributions
aughter-0.206
elling-0.202
ion-0.196
ays-0.186
isher-0.183
Token The
Token ID383
Feature activation-5.99e-01

Pos logit contributions
 long+0.006
 badly+0.006
 prow+0.005
 far+0.005
 much+0.005
Neg logit contributions
elling-0.004
aughter-0.004
ays-0.004
ada-0.004
atted-0.004
Token puzzle
Token ID15027
Feature activation+4.75e+00

Pos logit contributions
elling+0.013
aughter+0.013
ays+0.011
ison+0.011
ada+0.011
Neg logit contributions
 badly-0.013
 long-0.012
 prow-0.012
 gall-0.011
 much-0.011
Token was
Token ID373
Feature activation+6.59e+00

Pos logit contributions
 long+0.083
 badly+0.075
 prow+0.068
 far+0.068
 much+0.067
Neg logit contributions
elling-0.132
aughter-0.125
ays-0.117
ada-0.115
atted-0.114
Token big
Token ID1263
Feature activation+1.78e+00

Pos logit contributions
 prow+0.088
 badly+0.085
 gall+0.081
 long+0.081
ggy+0.075
Neg logit contributions
elling-0.181
aughter-0.170
els-0.158
ays-0.157
atted-0.154
Token green
Token ID4077
Feature activation+5.17e+00

Pos logit contributions
 long+0.058
 badly+0.054
 prow+0.048
 much+0.047
 far+0.046
Neg logit contributions
elling-0.110
aughter-0.104
ays-0.097
ada-0.096
atted-0.095
Token forest
Token ID8222
Feature activation+5.01e+00

Pos logit contributions
 badly+0.099
 long+0.097
 much+0.089
 gall+0.084
 prow+0.083
Neg logit contributions
elling-0.119
aughter-0.115
ays-0.106
cy-0.106
ada-0.104
Token,
Token ID11
Feature activation+2.90e+00

Pos logit contributions
 long+0.108
 badly+0.103
 prow+0.101
 gall+0.100
 deserted+0.098
Neg logit contributions
elling-0.101
aughter-0.095
atted-0.092
ays-0.088
els-0.087
Token there
Token ID612
Feature activation+1.80e+00

Pos logit contributions
 long+0.035
 badly+0.033
 prow+0.031
 gall+0.030
 quest+0.030
Neg logit contributions
elling-0.085
aughter-0.083
cy-0.076
ison-0.076
 means-0.075
Token lived
Token ID5615
Feature activation+5.00e+00

Pos logit contributions
 long+0.036
 badly+0.033
 prow+0.030
 far+0.030
 much+0.030
Neg logit contributions
elling-0.044
aughter-0.041
ays-0.039
ada-0.038
atted-0.038
Token a
Token ID257
Feature activation+9.24e+00

Pos logit contributions
 badly+0.086
 long+0.082
 quest+0.080
 gall+0.073
gh+0.073
Neg logit contributions
elling-0.116
aughter-0.113
ucks-0.104
ada-0.102
ion-0.100
Token little
Token ID1310
Feature activation+3.03e+00

Pos logit contributions
gh+0.159
 badly+0.157
 prow+0.141
 quest+0.134
 gall+0.131
Neg logit contributions
aughter-0.191
cy-0.188
elling-0.182
ada-0.181
 Girl-0.176
Token leaf
Token ID12835
Feature activation+6.84e+00

Pos logit contributions
 long+0.064
 badly+0.061
 prow+0.057
 much+0.056
 far+0.054
Neg logit contributions
elling-0.070
aughter-0.065
ada-0.060
ison-0.058
ays-0.058
Token.
Token ID13
Feature activation+6.10e-01

Pos logit contributions
 long+0.136
 deserted+0.129
 prow+0.122
 quest+0.120
 badly+0.120
Neg logit contributions
elling-0.134
atted-0.128
aughter-0.127
ion-0.126
ays-0.126
Token The
Token ID383
Feature activation+3.13e-01

Pos logit contributions
 long+0.016
 badly+0.014
 much+0.014
 prow+0.014
 far+0.014
Neg logit contributions
elling-0.012
aughter-0.011
ays-0.010
ada-0.010
atted-0.010
Token little
Token ID1310
Feature activation-3.46e+00

Pos logit contributions
 badly+0.006
 long+0.006
 prow+0.005
 deserted+0.005
 gall+0.005
Neg logit contributions
elling-0.008
aughter-0.007
ada-0.007
ays-0.007
atted-0.007
Token little
Token ID1310
Feature activation+2.42e+00

Pos logit contributions
 badly+0.116
 prow+0.109
 gall+0.100
 long+0.100
 deserted+0.097
Neg logit contributions
aughter-0.175
elling-0.170
ada-0.160
cy-0.155
ison-0.155
Token girl
Token ID2576
Feature activation+1.60e+00

Pos logit contributions
 long+0.061
 badly+0.058
 prow+0.055
 much+0.054
 far+0.053
Neg logit contributions
elling-0.047
aughter-0.044
ada-0.040
ays-0.039
ison-0.038
Token named
Token ID3706
Feature activation-1.06e+00

Pos logit contributions
 long+0.021
 badly+0.018
 far+0.016
 much+0.016
 prow+0.016
Neg logit contributions
elling-0.052
aughter-0.050
ays-0.047
ada-0.046
atted-0.045
Token Lucy
Token ID22162
Feature activation-6.83e-01

Pos logit contributions
elling+0.023
aughter+0.021
ays+0.021
isher+0.020
atted+0.020
Neg logit contributions
 long-0.026
 badly-0.023
 much-0.023
 far-0.022
 prow-0.021
Token found
Token ID1043
Feature activation+7.64e-01

Pos logit contributions
elling+0.019
aughter+0.018
ays+0.017
ada+0.017
atted+0.016
Neg logit contributions
 long-0.012
 badly-0.011
 prow-0.010
 much-0.009
 far-0.009
Token a
Token ID257
Feature activation+9.22e+00

Pos logit contributions
 long+0.009
 badly+0.008
 prow+0.007
 far+0.006
 much+0.006
Neg logit contributions
elling-0.026
aughter-0.024
ays-0.023
ada-0.023
atted-0.023
Token big
Token ID1263
Feature activation+4.81e+00

Pos logit contributions
 prow+0.131
 gall+0.126
 badly+0.124
 villains+0.123
 miles+0.122
Neg logit contributions
aughter-0.229
 Daughter-0.202
elling-0.201
 Girl-0.196
ada-0.196
Token book
Token ID1492
Feature activation+8.30e+00

Pos logit contributions
 long+0.091
 badly+0.087
 prow+0.084
 much+0.081
One+0.080
Neg logit contributions
elling-0.114
aughter-0.111
ison-0.098
ada-0.098
ener-0.096
Token called
Token ID1444
Feature activation-2.19e-01

Pos logit contributions
 prow+0.163
 long+0.158
 gall+0.158
 che+0.151
 villains+0.149
Neg logit contributions
aughter-0.174
elling-0.171
ion-0.163
ays-0.160
ison-0.143
Token a
Token ID257
Feature activation+7.85e+00

Pos logit contributions
elling+0.005
aughter+0.005
ays+0.005
ada+0.005
atted+0.005
Neg logit contributions
 long-0.005
 badly-0.004
 much-0.004
 far-0.004
 prow-0.004
Token dictionary
Token ID22155
Feature activation+7.42e+00

Pos logit contributions
 badly+0.144
 prow+0.134
 miles+0.125
One+0.123
 gall+0.119
Neg logit contributions
aughter-0.177
elling-0.156
ison-0.155
 Girl-0.151
ada-0.148
Token man
Token ID582
Feature activation+1.42e+00

Pos logit contributions
 long+0.076
 badly+0.074
 prow+0.072
 much+0.071
One+0.068
Neg logit contributions
aughter-0.065
elling-0.064
ada-0.057
ison-0.056
ener-0.054
Token who
Token ID508
Feature activation+1.63e+00

Pos logit contributions
 long+0.027
 badly+0.024
 prow+0.022
 deserted+0.022
 much+0.022
Neg logit contributions
elling-0.034
aughter-0.032
els-0.031
atted-0.031
ison-0.031
Token had
Token ID550
Feature activation+4.69e+00

Pos logit contributions
 long+0.034
 badly+0.032
 prow+0.029
 soaring+0.029
 deserted+0.029
Neg logit contributions
elling-0.031
ison-0.030
 means-0.029
aughter-0.029
cy-0.028
Token a
Token ID257
Feature activation+7.85e+00

Pos logit contributions
 badly+0.088
 quest+0.085
 cave+0.081
 prow+0.081
 trail+0.080
Neg logit contributions
 Dad-0.073
 smiles-0.073
elling-0.073
 Daughter-0.072
 opinions-0.071
Token big
Token ID1263
Feature activation+3.84e+00

Pos logit contributions
 badly+0.111
 prow+0.109
 miles+0.105
 gall+0.099
On+0.099
Neg logit contributions
aughter-0.191
 Daughter-0.172
ison-0.169
ada-0.166
elling-0.164
Token chest
Token ID7721
Feature activation+9.22e+00

Pos logit contributions
 long+0.069
 badly+0.067
 prow+0.062
 much+0.060
 far+0.058
Neg logit contributions
elling-0.096
aughter-0.093
ada-0.084
ison-0.083
ays-0.083
Token.
Token ID13
Feature activation+7.36e-01

Pos logit contributions
 long+0.164
 prow+0.163
 villains+0.154
Bl+0.151
One+0.150
Neg logit contributions
elling-0.190
aughter-0.188
atted-0.181
ener-0.175
ays-0.175
Token He
Token ID679
Feature activation+2.94e+00

Pos logit contributions
 long+0.014
 badly+0.013
 prow+0.012
 far+0.012
 much+0.012
Neg logit contributions
elling-0.019
aughter-0.018
ays-0.017
ada-0.016
atted-0.016
Token loved
Token ID6151
Feature activation-2.61e-01

Pos logit contributions
 long+0.060
 badly+0.057
 prow+0.051
 much+0.049
 deserted+0.049
Neg logit contributions
elling-0.062
aughter-0.059
ison-0.057
atted-0.055
 means-0.055
Token his
Token ID465
Feature activation-2.98e-01

Pos logit contributions
elling+0.006
aughter+0.005
els+0.005
atted+0.005
ison+0.005
Neg logit contributions
 badly-0.005
 long-0.005
ggy-0.004
 prow-0.004
 deserted-0.004
Token chest
Token ID7721
Feature activation+6.61e+00

Pos logit contributions
aughter+0.007
elling+0.007
ener+0.006
 means+0.006
 smiles+0.006
Neg logit contributions
 badly-0.005
 prow-0.004
One-0.004
 deserted-0.004
ggy-0.004
Token treasure
Token ID14068
Feature activation+5.66e-01

Pos logit contributions
elling+0.078
aughter+0.076
ener+0.066
ada+0.066
ison+0.066
Neg logit contributions
 badly-0.062
 long-0.061
 prow-0.056
 gall-0.053
 much-0.052
Token was
Token ID373
Feature activation+3.89e+00

Pos logit contributions
 long+0.010
 badly+0.009
 far+0.008
 prow+0.008
 much+0.008
Neg logit contributions
elling-0.016
aughter-0.015
ays-0.014
ada-0.014
atted-0.013
Token so
Token ID523
Feature activation+3.06e+00

Pos logit contributions
 long+0.052
 badly+0.045
 much+0.039
 far+0.039
 prow+0.039
Neg logit contributions
elling-0.126
aughter-0.120
ays-0.114
ada-0.113
atted-0.111
Token special
Token ID2041
Feature activation-3.72e-01

Pos logit contributions
 long+0.036
 badly+0.030
 much+0.027
 far+0.026
 prow+0.023
Neg logit contributions
elling-0.106
aughter-0.101
ays-0.098
ada-0.095
els-0.094
Token that
Token ID326
Feature activation-2.30e+00

Pos logit contributions
elling+0.011
aughter+0.011
ays+0.010
ada+0.010
atted+0.010
Neg logit contributions
 long-0.006
 badly-0.005
 far-0.005
 much-0.005
 prow-0.005
Token Ab
Token ID2275
Feature activation-1.23e+01

Pos logit contributions
elling+0.050
aughter+0.046
ays+0.043
ada+0.042
atted+0.041
Neg logit contributions
 long-0.056
 badly-0.051
 much-0.049
 far-0.049
 prow-0.047
Tokenbie
Token ID12590
Feature activation-4.74e+00

Pos logit contributions
elling+0.229
ion+0.191
aughter+0.190
ada+0.186
olds+0.184
Neg logit contributions
 long-0.293
 badly-0.261
 far-0.257
 cave-0.252
 prow-0.251
Token decide
Token ID5409
Feature activation+2.62e-01

Pos logit contributions
elling+0.138
aughter+0.131
ays+0.124
ada+0.122
atted+0.118
Neg logit contributions
 long-0.077
 badly-0.069
 far-0.062
 much-0.062
 prow-0.062
Token to
Token ID284
Feature activation+3.41e-01

Pos logit contributions
 long+0.005
 badly+0.004
 far+0.004
 much+0.004
 prow+0.004
Neg logit contributions
elling-0.007
aughter-0.007
ays-0.007
ada-0.007
atted-0.006
Token keep
Token ID1394
Feature activation-1.62e+00

Pos logit contributions
 long+0.007
 badly+0.006
 much+0.006
 prow+0.006
 deserted+0.005
Neg logit contributions
elling-0.008
aughter-0.008
ada-0.007
els-0.007
isher-0.007
Token it
Token ID340
Feature activation+2.34e+00

Pos logit contributions
elling+0.044
aughter+0.041
ays+0.039
ada+0.038
atted+0.038
Neg logit contributions
 long-0.029
 badly-0.027
 prow-0.024
 far-0.024
 much-0.024
TokenS
Token ID50
Feature activation-9.65e+00

Pos logit contributions
 long+0.049
 badly+0.043
 much+0.040
 far+0.039
 prow+0.039
Neg logit contributions
elling-0.082
aughter-0.078
ays-0.073
ada-0.072
atted-0.071
Tokenully
Token ID2132
Feature activation-3.05e+00

Pos logit contributions
elling+0.179
aughter+0.167
ays+0.163
ion+0.161
ours+0.156
Neg logit contributions
 long-0.232
 badly-0.207
One-0.206
 far-0.204
On-0.200
Token was
Token ID373
Feature activation+3.85e+00

Pos logit contributions
elling+0.090
aughter+0.086
ays+0.083
ada+0.080
uns+0.080
Neg logit contributions
 long-0.049
 badly-0.044
 prow-0.041
 much-0.040
 far-0.040
Token glad
Token ID9675
Feature activation-3.05e+00

Pos logit contributions
 long+0.052
 badly+0.044
 prow+0.041
 far+0.039
 cave+0.038
Neg logit contributions
elling-0.119
aughter-0.113
ada-0.107
ays-0.107
atted-0.106
Token that
Token ID326
Feature activation-4.77e+00

Pos logit contributions
elling+0.092
aughter+0.087
ays+0.082
ada+0.081
atted+0.080
Neg logit contributions
 long-0.047
 badly-0.041
 far-0.037
 prow-0.037
 much-0.037
Token J
Token ID449
Feature activation-1.21e+01

Pos logit contributions
elling+0.141
aughter+0.133
ays+0.126
ada+0.122
atted+0.121
Neg logit contributions
 long-0.081
 badly-0.070
 much-0.066
 far-0.065
 prow-0.062
Tokenody
Token ID1118
Feature activation-5.76e+00

Pos logit contributions
elling+0.232
ours+0.217
aughter+0.215
 roles+0.208
ion+0.207
Neg logit contributions
 long-0.256
 badly-0.243
 far-0.226
 prow-0.219
 hidden-0.217
Token was
Token ID373
Feature activation+2.79e+00

Pos logit contributions
elling+0.173
aughter+0.165
ays+0.156
ada+0.153
uns+0.152
Neg logit contributions
 long-0.086
 badly-0.078
 far-0.070
 prow-0.070
 much-0.070
Token safe
Token ID3338
Feature activation-8.32e-01

Pos logit contributions
 long+0.043
 prow+0.039
 cave+0.037
 badly+0.036
 quest+0.035
Neg logit contributions
elling-0.073
aughter-0.068
atted-0.067
ada-0.066
ener-0.062
Token.
Token ID13
Feature activation-1.72e+00

Pos logit contributions
elling+0.022
aughter+0.021
ays+0.020
ada+0.020
atted+0.019
Neg logit contributions
 long-0.015
 badly-0.014
 prow-0.013
 far-0.012
 much-0.012
Token He
Token ID679
Feature activation+7.12e-01

Pos logit contributions
elling+0.054
aughter+0.051
els+0.050
isher+0.049
ries+0.048
Neg logit contributions
 long-0.024
 much-0.022
 badly-0.020
 far-0.018
 prow-0.016
Token From
Token ID3574
Feature activation-1.72e+00

Pos logit contributions
elling+0.019
aughter+0.017
isher+0.017
ays+0.017
els+0.017
Neg logit contributions
 long-0.015
 much-0.014
 badly-0.014
 far-0.013
On-0.012
Token that
Token ID326
Feature activation-3.73e+00

Pos logit contributions
elling+0.048
aughter+0.045
ays+0.043
ada+0.042
uns+0.042
Neg logit contributions
 long-0.031
 badly-0.029
 far-0.028
 much-0.025
 prow-0.024
Token day
Token ID1110
Feature activation-2.97e+00

Pos logit contributions
elling+0.100
aughter+0.094
ays+0.088
ada+0.087
atted+0.086
Neg logit contributions
 long-0.070
 badly-0.063
 far-0.057
 prow-0.057
 much-0.057
Token on
Token ID319
Feature activation-4.22e+00

Pos logit contributions
elling+0.083
aughter+0.080
ays+0.074
ada+0.074
atted+0.073
Neg logit contributions
 long-0.050
 badly-0.045
 prow-0.042
 far-0.040
 much-0.040
Token,
Token ID11
Feature activation-1.49e+00

Pos logit contributions
elling+0.127
aughter+0.117
ays+0.112
ada+0.110
isher+0.109
Neg logit contributions
 long-0.072
 badly-0.063
 much-0.057
 far-0.056
 prow-0.053
Token Ab
Token ID2275
Feature activation-1.19e+01

Pos logit contributions
elling+0.035
aughter+0.031
ays+0.031
isher+0.029
els+0.029
Neg logit contributions
 long-0.035
 badly-0.032
 much-0.031
 far-0.031
 prow-0.028
Tokenbie
Token ID12590
Feature activation-4.10e+00

Pos logit contributions
elling+0.227
ion+0.192
aughter+0.188
isher+0.188
ada+0.187
Neg logit contributions
 long-0.280
 badly-0.253
 much-0.248
 far-0.248
One-0.239
Token never
Token ID1239
Feature activation-1.91e+00

Pos logit contributions
elling+0.127
aughter+0.122
ays+0.116
ada+0.114
ucks+0.111
Neg logit contributions
 long-0.059
 badly-0.050
 far-0.047
 prow-0.047
 much-0.047
Token forgot
Token ID16453
Feature activation-1.26e+00

Pos logit contributions
elling+0.050
aughter+0.047
ada+0.043
ays+0.043
atted+0.043
Neg logit contributions
 long-0.036
 badly-0.033
 prow-0.030
 much-0.030
 far-0.030
Token the
Token ID262
Feature activation-2.59e+00

Pos logit contributions
elling+0.035
aughter+0.033
ays+0.032
ada+0.031
isher+0.030
Neg logit contributions
 long-0.023
 badly-0.020
 far-0.019
 much-0.019
 prow-0.018
Token special
Token ID2041
Feature activation-5.38e+00

Pos logit contributions
elling+0.074
aughter+0.072
ener+0.066
ada+0.065
isher+0.065
Neg logit contributions
 badly-0.037
 long-0.036
 prow-0.031
 much-0.030
 gall-0.029
Token the
Token ID262
Feature activation-3.56e+00

Pos logit contributions
elling+0.008
aughter+0.007
ays+0.007
isher+0.007
atted+0.007
Neg logit contributions
 long-0.008
 badly-0.007
 far-0.007
 much-0.007
 prow-0.006
Token gate
Token ID8946
Feature activation+1.39e+00

Pos logit contributions
elling+0.089
aughter+0.087
ada+0.077
atted+0.075
ison+0.074
Neg logit contributions
 long-0.067
 badly-0.067
 prow-0.061
 much-0.057
 gall-0.057
Tokenkeeper
Token ID13884
Feature activation-1.10e+00

Pos logit contributions
 long+0.025
 badly+0.022
 far+0.021
 much+0.020
 prow+0.020
Neg logit contributions
elling-0.039
aughter-0.037
ays-0.035
ada-0.034
uns-0.034
Token agreed
Token ID4987
Feature activation-1.00e+00

Pos logit contributions
elling+0.030
aughter+0.028
ays+0.027
ada+0.026
uns+0.025
Neg logit contributions
 long-0.020
 badly-0.018
 far-0.017
 prow-0.017
 che-0.017
Token that
Token ID326
Feature activation-3.60e+00

Pos logit contributions
elling+0.028
aughter+0.026
ays+0.024
ada+0.024
atted+0.024
Neg logit contributions
 long-0.017
 badly-0.016
 prow-0.014
 far-0.014
 much-0.014
Token Ab
Token ID2275
Feature activation-1.18e+01

Pos logit contributions
elling+0.079
aughter+0.073
ays+0.068
ada+0.066
atted+0.065
Neg logit contributions
 long-0.086
 badly-0.080
 much-0.075
 far-0.075
 prow-0.074
Tokenbie
Token ID12590
Feature activation-5.09e+00

Pos logit contributions
elling+0.227
aughter+0.190
ada+0.185
ion+0.183
olds+0.181
Neg logit contributions
 long-0.269
 badly-0.247
 che-0.239
 far-0.239
 cave-0.237
Token had
Token ID550
Feature activation+1.65e+00

Pos logit contributions
elling+0.149
aughter+0.141
ays+0.134
ada+0.133
ucks+0.129
Neg logit contributions
 long-0.081
 badly-0.072
 much-0.066
 far-0.065
 prow-0.065
Token worked
Token ID3111
Feature activation+7.96e-01

Pos logit contributions
 long+0.028
 badly+0.026
 prow+0.023
 far+0.023
 much+0.023
Neg logit contributions
elling-0.046
aughter-0.044
ays-0.040
ada-0.040
atted-0.040
Token hard
Token ID1327
Feature activation-1.04e+00

Pos logit contributions
 long+0.012
 badly+0.010
 far+0.009
 prow+0.009
 much+0.009
Neg logit contributions
elling-0.024
aughter-0.023
ays-0.022
ada-0.022
atted-0.021
Token and
Token ID290
Feature activation+1.89e+00

Pos logit contributions
elling+0.027
aughter+0.026
ays+0.024
ada+0.024
atted+0.024
Neg logit contributions
 long-0.019
 badly-0.017
 prow-0.016
 far-0.016
 much-0.016
Token just
Token ID655
Feature activation-1.53e+00

Pos logit contributions
 long+0.027
 badly+0.025
 far+0.024
 much+0.023
 prow+0.023
Neg logit contributions
elling-0.038
aughter-0.035
ays-0.033
ada-0.032
uns-0.032
Token watched
Token ID7342
Feature activation-5.38e+00

Pos logit contributions
elling+0.037
aughter+0.035
ays+0.033
ada+0.032
atted+0.031
Neg logit contributions
 long-0.032
 badly-0.030
 far-0.027
 prow-0.027
 much-0.027
Token with
Token ID351
Feature activation-4.47e+00

Pos logit contributions
elling+0.145
aughter+0.134
ays+0.129
ison+0.123
isher+0.123
Neg logit contributions
 long-0.109
 badly-0.096
 far-0.087
 much-0.087
 prow-0.081
Token admiration
Token ID28506
Feature activation-4.14e+00

Pos logit contributions
elling+0.142
aughter+0.131
ays+0.127
uns+0.124
atted+0.123
Neg logit contributions
 long-0.070
 badly-0.057
 far-0.056
 much-0.055
 prow-0.050
Token as
Token ID355
Feature activation-1.30e+00

Pos logit contributions
elling+0.135
aughter+0.129
ays+0.124
ucks+0.120
ada+0.120
Neg logit contributions
 long-0.058
 badly-0.048
 much-0.043
 far-0.042
 prow-0.036
Token T
Token ID309
Feature activation-1.18e+01

Pos logit contributions
elling+0.032
aughter+0.030
ays+0.028
ada+0.027
atted+0.027
Neg logit contributions
 long-0.028
 badly-0.025
 much-0.024
 far-0.024
 prow-0.023
Tokenessa
Token ID21411
Feature activation-5.10e+00

Pos logit contributions
elling+0.212
ays+0.185
aughter+0.173
ucks+0.173
cy+0.171
Neg logit contributions
 long-0.310
 badly-0.289
 far-0.264
 much-0.260
ggy-0.257
Token sw
Token ID1509
Feature activation-3.11e+00

Pos logit contributions
elling+0.137
aughter+0.129
ays+0.120
ada+0.118
ucks+0.115
Neg logit contributions
 long-0.096
 badly-0.086
 far-0.080
 prow-0.079
 much-0.079
Tokenam
Token ID321
Feature activation+4.75e-02

Pos logit contributions
elling+0.060
aughter+0.057
ison+0.050
ada+0.049
ays+0.048
Neg logit contributions
 long-0.077
 badly-0.074
 prow-0.070
 far-0.068
 gall-0.067
Token down
Token ID866
Feature activation-5.77e-01

Pos logit contributions
 long+0.001
 badly+0.001
 far+0.001
 much+0.001
 prow+0.001
Neg logit contributions
elling-0.001
aughter-0.001
ays-0.001
ada-0.001
uns-0.001
Token the
Token ID262
Feature activation-1.25e+00

Pos logit contributions
elling+0.018
aughter+0.017
ays+0.015
ada+0.015
uns+0.015
Neg logit contributions
 long-0.010
 badly-0.008
 far-0.008
 much-0.008
 prow-0.007
Token.
Token ID13
Feature activation-1.39e+00

Pos logit contributions
 long+0.006
 badly+0.005
 prow+0.005
 far+0.005
 much+0.005
Neg logit contributions
elling-0.011
aughter-0.010
ays-0.010
ada-0.010
atted-0.010
Token And
Token ID843
Feature activation-5.01e-01

Pos logit contributions
elling+0.042
isher+0.039
aughter+0.039
ener+0.038
els+0.037
Neg logit contributions
 long-0.022
 much-0.021
 badly-0.018
One-0.018
 far-0.017
Token that
Token ID326
Feature activation-3.45e+00

Pos logit contributions
elling+0.016
aughter+0.015
ays+0.014
isher+0.014
uns+0.014
Neg logit contributions
 long-0.008
 badly-0.007
 much-0.007
 far-0.006
 well-0.006
Token is
Token ID318
Feature activation+2.95e+00

Pos logit contributions
elling+0.093
aughter+0.088
ays+0.082
ada+0.082
atted+0.080
Neg logit contributions
 long-0.065
 badly-0.058
 much-0.055
 far-0.054
 prow-0.053
Token how
Token ID703
Feature activation-7.53e-01

Pos logit contributions
 long+0.060
 badly+0.055
 much+0.054
 far+0.052
 prow+0.047
Neg logit contributions
elling-0.077
aughter-0.073
ays-0.069
ada-0.067
ucks-0.066
Token L
Token ID406
Feature activation-1.18e+01

Pos logit contributions
elling+0.017
aughter+0.016
ays+0.015
ada+0.014
isher+0.014
Neg logit contributions
 long-0.018
 badly-0.016
 much-0.015
 far-0.015
 prow-0.014
Tokeniza
Token ID23638
Feature activation-5.79e+00

Pos logit contributions
elling+0.214
isher+0.175
aughter+0.173
atted+0.170
 supervision+0.170
Neg logit contributions
 long-0.297
 badly-0.289
 far-0.272
 much-0.268
 well-0.265
Token always
Token ID1464
Feature activation-8.18e-01

Pos logit contributions
elling+0.161
aughter+0.151
ays+0.144
ucks+0.142
uns+0.141
Neg logit contributions
 long-0.096
 badly-0.087
 far-0.082
 much-0.080
 che-0.080
Token started
Token ID2067
Feature activation+2.99e-01

Pos logit contributions
elling+0.021
aughter+0.020
ays+0.019
ada+0.019
atted+0.018
Neg logit contributions
 long-0.016
 badly-0.014
 far-0.013
 prow-0.013
 much-0.013
Token the
Token ID262
Feature activation-1.45e+00

Pos logit contributions
 long+0.005
 badly+0.005
 much+0.004
 far+0.004
 prow+0.004
Neg logit contributions
elling-0.009
aughter-0.008
ays-0.008
ada-0.008
isher-0.007
Token day
Token ID1110
Feature activation-2.14e+00

Pos logit contributions
elling+0.040
ays+0.036
els+0.036
atted+0.036
aughter+0.035
Neg logit contributions
 long-0.030
 far-0.023
 much-0.023
 badly-0.022
 hidden-0.022
Token,
Token ID11
Feature activation+1.47e+00

Pos logit contributions
elling+0.066
aughter+0.062
ada+0.058
ays+0.058
isher+0.056
Neg logit contributions
 long-0.042
 badly-0.037
 far-0.035
 much-0.033
 prow-0.032
Token when
Token ID618
Feature activation-1.65e+00

Pos logit contributions
 long+0.025
 badly+0.022
 far+0.021
 much+0.020
 prow+0.020
Neg logit contributions
elling-0.043
aughter-0.041
ays-0.039
ada-0.038
isher-0.037
Token she
Token ID673
Feature activation-8.41e-01

Pos logit contributions
elling+0.040
aughter+0.039
ays+0.035
ada+0.035
atted+0.034
Neg logit contributions
 long-0.032
 badly-0.031
 prow-0.028
 far-0.027
 much-0.027
Token remembered
Token ID12086
Feature activation-1.51e+00

Pos logit contributions
elling+0.021
aughter+0.020
atted+0.019
ays+0.019
ener+0.018
Neg logit contributions
 long-0.016
 badly-0.015
 much-0.013
 prow-0.013
 far-0.013
Token what
Token ID644
Feature activation-3.72e+00

Pos logit contributions
elling+0.038
aughter+0.035
ays+0.033
ada+0.032
atted+0.032
Neg logit contributions
 long-0.031
 badly-0.028
 prow-0.026
 far-0.026
 much-0.026
Token Grand
Token ID5675
Feature activation-1.18e+01

Pos logit contributions
elling+0.103
aughter+0.096
ays+0.092
ada+0.090
isher+0.088
Neg logit contributions
 long-0.069
 badly-0.062
 much-0.057
 far-0.057
 prow-0.055
Tokenma
Token ID2611
Feature activation-4.95e+00

Pos logit contributions
elling+0.144
aughter+0.129
ada+0.121
ays+0.117
atted+0.108
Neg logit contributions
 long-0.362
 badly-0.348
 prow-0.340
 cave-0.335
ggy-0.326
Token mouse
Token ID10211
Feature activation-1.74e+00

Pos logit contributions
elling+0.146
aughter+0.139
ays+0.133
ucks+0.131
ada+0.129
Neg logit contributions
 long-0.079
 badly-0.071
 far-0.065
 much-0.063
 prow-0.063
Token had
Token ID550
Feature activation+5.24e-01

Pos logit contributions
elling+0.049
aughter+0.047
ays+0.044
ada+0.043
ucks+0.042
Neg logit contributions
 long-0.030
 badly-0.027
 far-0.024
 prow-0.024
 much-0.024
Token said
Token ID531
Feature activation-3.42e+00

Pos logit contributions
 long+0.010
 badly+0.009
 prow+0.009
 much+0.008
 far+0.008
Neg logit contributions
elling-0.013
aughter-0.013
ays-0.012
ada-0.012
atted-0.012
Token.
Token ID13
Feature activation-8.49e-01

Pos logit contributions
elling+0.101
aughter+0.095
ays+0.091
ada+0.088
ucks+0.087
Neg logit contributions
 long-0.058
 badly-0.052
 much-0.047
 far-0.047
 prow-0.044
Token Maggie
Token ID33014
Feature activation-2.53e+00

Pos logit contributions
elling+0.008
ays+0.007
aughter+0.007
isher+0.007
els+0.007
Neg logit contributions
 long-0.004
 badly-0.003
 much-0.003
 far-0.003
 che-0.003
Token nodded
Token ID14464
Feature activation-2.30e+00

Pos logit contributions
elling+0.067
aughter+0.062
ays+0.059
ada+0.058
uns+0.058
Neg logit contributions
 long-0.047
 badly-0.044
 far-0.042
 prow-0.042
 che-0.041
Token and
Token ID290
Feature activation+1.39e+00

Pos logit contributions
elling+0.060
aughter+0.056
ays+0.053
ada+0.052
atted+0.051
Neg logit contributions
 long-0.045
 badly-0.041
 far-0.038
 much-0.037
 prow-0.037
Token remembered
Token ID12086
Feature activation-1.66e+00

Pos logit contributions
 long+0.029
 badly+0.026
 far+0.024
 prow+0.024
 much+0.024
Neg logit contributions
elling-0.034
aughter-0.032
ays-0.030
ada-0.030
atted-0.029
Token what
Token ID644
Feature activation-4.03e+00

Pos logit contributions
elling+0.035
aughter+0.033
atted+0.031
ada+0.030
ion+0.029
Neg logit contributions
 long-0.037
 badly-0.034
 prow-0.033
 cave-0.032
 much-0.031
Token Grand
Token ID5675
Feature activation-1.18e+01

Pos logit contributions
elling+0.104
aughter+0.096
ays+0.091
ada+0.089
atted+0.087
Neg logit contributions
 long-0.083
 badly-0.075
 much-0.070
 far-0.069
 prow-0.068
Tokenma
Token ID2611
Feature activation-4.63e+00

Pos logit contributions
elling+0.144
aughter+0.126
ada+0.118
ays+0.116
atted+0.111
Neg logit contributions
 long-0.359
 badly-0.351
 cave-0.337
 prow-0.337
One-0.326
Token Mouse
Token ID21839
Feature activation-4.35e+00

Pos logit contributions
elling+0.137
aughter+0.129
ays+0.125
ada+0.121
ucks+0.121
Neg logit contributions
 long-0.073
 badly-0.068
 far-0.061
 much-0.060
 prow-0.060
Token said
Token ID531
Feature activation-3.23e+00

Pos logit contributions
elling+0.129
aughter+0.122
ays+0.118
ucks+0.116
ada+0.113
Neg logit contributions
 long-0.069
 badly-0.064
 far-0.056
 much-0.056
 prow-0.055
Token.
Token ID13
Feature activation-8.54e-01

Pos logit contributions
elling+0.097
aughter+0.092
ays+0.087
ada+0.086
atted+0.084
Neg logit contributions
 long-0.050
 badly-0.045
 much-0.040
 far-0.040
 prow-0.039
Token
Token ID220
Feature activation-1.38e+00

Pos logit contributions
elling+0.024
aughter+0.022
isher+0.022
els+0.022
ays+0.021
Neg logit contributions
 long-0.015
 much-0.013
 badly-0.013
 far-0.012
 prow-0.012
Token.
Token ID13
Feature activation-1.20e+00

Pos logit contributions
 long+0.010
 badly+0.008
 much+0.008
 far+0.008
 prow+0.007
Neg logit contributions
elling-0.019
aughter-0.017
ada-0.017
ays-0.017
atted-0.016
Token They
Token ID1119
Feature activation+5.47e-01

Pos logit contributions
elling+0.029
aughter+0.027
isher+0.027
ener+0.026
ison+0.026
Neg logit contributions
 long-0.025
 much-0.024
 badly-0.023
 far-0.021
 prow-0.020
Token all
Token ID477
Feature activation-8.74e-01

Pos logit contributions
 long+0.011
 badly+0.010
 much+0.009
 far+0.009
 prow+0.009
Neg logit contributions
elling-0.014
aughter-0.013
ays-0.013
ada-0.012
atted-0.012
Token listened
Token ID16399
Feature activation-1.68e+00

Pos logit contributions
elling+0.020
aughter+0.019
ays+0.018
ada+0.017
els+0.017
Neg logit contributions
 long-0.019
 badly-0.018
 much-0.017
 far-0.017
 prow-0.017
Token and
Token ID290
Feature activation+5.70e-01

Pos logit contributions
elling+0.048
aughter+0.045
ays+0.042
ada+0.042
isher+0.041
Neg logit contributions
 long-0.030
 badly-0.026
 much-0.025
 far-0.024
 prow-0.023
Token Ab
Token ID2275
Feature activation-1.18e+01

Pos logit contributions
 long+0.011
 badly+0.010
 far+0.010
 much+0.010
 prow+0.009
Neg logit contributions
elling-0.015
aughter-0.014
ays-0.014
ada-0.013
uns-0.013
Tokenbie
Token ID12590
Feature activation-5.12e+00

Pos logit contributions
elling+0.239
 roles+0.199
aughter+0.198
ion+0.198
ays+0.195
Neg logit contributions
 long-0.252
 badly-0.234
 far-0.224
 che-0.223
 much-0.219
Token felt
Token ID2936
Feature activation+2.71e+00

Pos logit contributions
elling+0.137
aughter+0.130
ays+0.122
ada+0.121
isher+0.117
Neg logit contributions
 long-0.096
 badly-0.086
 much-0.082
 far-0.080
 prow-0.080
Token so
Token ID523
Feature activation+2.38e+00

Pos logit contributions
 long+0.031
 badly+0.026
 prow+0.022
 far+0.022
 much+0.021
Neg logit contributions
elling-0.092
aughter-0.088
ays-0.084
ada-0.083
atted-0.082
Token proud
Token ID6613
Feature activation-1.08e+00

Pos logit contributions
 long+0.036
 prow+0.034
 badly+0.033
 cave+0.032
 gall+0.031
Neg logit contributions
elling-0.068
aughter-0.063
atted-0.059
ada-0.058
olds-0.056
Token and
Token ID290
Feature activation+9.53e-01

Pos logit contributions
elling+0.029
aughter+0.028
ays+0.026
ada+0.026
atted+0.025
Neg logit contributions
 long-0.020
 badly-0.018
 much-0.016
 far-0.016
 prow-0.016
Token They
Token ID1119
Feature activation+5.77e-01

Pos logit contributions
elling+0.030
aughter+0.029
isher+0.026
els+0.026
atted+0.026
Neg logit contributions
 long-0.027
 much-0.024
 badly-0.023
 far-0.022
One-0.022
Token wonder
Token ID4240
Feature activation-2.61e+00

Pos logit contributions
 long+0.012
 badly+0.012
 much+0.011
 prow+0.010
 cave+0.010
Neg logit contributions
elling-0.013
aughter-0.013
ada-0.012
 agrees-0.011
ison-0.011
Token what
Token ID644
Feature activation-3.99e+00

Pos logit contributions
elling+0.067
aughter+0.063
ays+0.059
ada+0.057
isher+0.055
Neg logit contributions
 long-0.054
 badly-0.048
 far-0.046
 much-0.045
 prow-0.044
Token other
Token ID584
Feature activation-7.33e+00

Pos logit contributions
elling+0.106
aughter+0.099
ada+0.094
ays+0.093
atted+0.091
Neg logit contributions
 long-0.080
 badly-0.069
 much-0.067
 far-0.067
 prow-0.064
Token things
Token ID1243
Feature activation-1.49e+00

Pos logit contributions
elling+0.191
aughter+0.182
ays+0.170
ada+0.167
atted+0.164
Neg logit contributions
 long-0.144
 badly-0.129
 much-0.121
 far-0.120
 prow-0.117
Token Grand
Token ID5675
Feature activation-1.18e+01

Pos logit contributions
elling+0.039
aughter+0.037
ays+0.034
ada+0.034
atted+0.034
Neg logit contributions
 long-0.029
 badly-0.026
 far-0.026
 much-0.025
 prow-0.024
Tokenpa
Token ID8957
Feature activation-3.79e+00

Pos logit contributions
elling+0.149
aughter+0.136
ada+0.126
atted+0.120
ays+0.118
Neg logit contributions
 long-0.347
 badly-0.334
 che-0.323
One-0.319
 much-0.317
Token used
Token ID973
Feature activation-2.04e+00

Pos logit contributions
elling+0.094
aughter+0.089
ada+0.085
ays+0.083
isher+0.081
Neg logit contributions
 long-0.077
 far-0.069
 badly-0.067
 much-0.066
 che-0.065
Token to
Token ID284
Feature activation-1.82e+00

Pos logit contributions
elling+0.065
aughter+0.061
ays+0.058
ada+0.058
ucks+0.057
Neg logit contributions
 long-0.031
 badly-0.026
 far-0.025
 much-0.024
 hidden-0.021
Token wear
Token ID5806
Feature activation-1.44e-01

Pos logit contributions
elling+0.053
aughter+0.051
ays+0.048
ada+0.047
atted+0.046
Neg logit contributions
 long-0.029
 badly-0.026
 far-0.023
 much-0.023
 prow-0.023
Token and
Token ID290
Feature activation+3.28e-01

Pos logit contributions
elling+0.005
aughter+0.004
ada+0.004
ays+0.004
atted+0.004
Neg logit contributions
 long-0.002
 badly-0.002
 far-0.002
 much-0.002
 prow-0.001
Token had
Token ID550
Feature activation+1.80e+00

Pos logit contributions
 long+0.002
 badly+0.002
 far+0.002
 much+0.002
 prow+0.002
Neg logit contributions
elling-0.005
aughter-0.005
ays-0.005
ada-0.005
atted-0.005
Token so
Token ID523
Feature activation+2.06e+00

Pos logit contributions
 long+0.021
 badly+0.018
 prow+0.015
 far+0.015
 much+0.015
Neg logit contributions
elling-0.061
aughter-0.058
ays-0.055
ada-0.055
atted-0.054
Token much
Token ID881
Feature activation+5.68e-01

Pos logit contributions
 long+0.010
 badly+0.007
 prow+0.006
 cave+0.005
 deserted+0.004
Neg logit contributions
elling-0.081
aughter-0.078
atted-0.075
ays-0.074
ada-0.074
Token fun
Token ID1257
Feature activation+1.03e+00

Pos logit contributions
 badly+0.007
 prow+0.006
 long+0.006
 deserted+0.006
 cave+0.005
Neg logit contributions
elling-0.018
aughter-0.016
olds-0.016
 agrees-0.016
atted-0.016
Token.
Token ID13
Feature activation-5.30e-01

Pos logit contributions
 long+0.019
 badly+0.018
 prow+0.017
 deserted+0.016
 far+0.016
Neg logit contributions
elling-0.025
aughter-0.024
ays-0.023
atted-0.022
els-0.022
Token K
Token ID509
Feature activation-1.17e+01

Pos logit contributions
elling+0.013
aughter+0.013
isher+0.012
ener+0.012
ays+0.012
Neg logit contributions
 long-0.011
 much-0.010
 badly-0.010
 prow-0.009
 far-0.009
Tokenait
Token ID4548
Feature activation-9.94e+00

Pos logit contributions
elling+0.199
aughter+0.176
ays+0.170
 supervision+0.168
isher+0.168
Neg logit contributions
 long-0.295
 badly-0.292
 much-0.268
One-0.262
 far-0.262
Tokenlyn
Token ID6213
Feature activation-1.84e+00

Pos logit contributions
elling+0.230
ays+0.211
aughter+0.206
 supervision+0.205
ours+0.199
Neg logit contributions
 badly-0.189
 prow-0.178
 long-0.176
 cave-0.176
 never-0.175
Token was
Token ID373
Feature activation+3.40e+00

Pos logit contributions
elling+0.055
aughter+0.052
ays+0.049
uns+0.048
isher+0.048
Neg logit contributions
 long-0.030
 badly-0.026
 far-0.025
 much-0.025
 prow-0.024
Token so
Token ID523
Feature activation+3.48e+00

Pos logit contributions
 long+0.047
 badly+0.040
 prow+0.038
 cave+0.036
 far+0.035
Neg logit contributions
elling-0.103
aughter-0.098
ada-0.093
atted-0.092
ays-0.091
Token happy
Token ID3772
Feature activation+1.71e-01

Pos logit contributions
 long+0.056
 badly+0.051
 prow+0.048
 cave+0.046
 far+0.044
Neg logit contributions
elling-0.099
aughter-0.095
ada-0.088
atted-0.087
ays-0.086
Token the
Token ID262
Feature activation-1.90e+00

Pos logit contributions
elling+0.113
aughter+0.103
isher+0.096
uns+0.096
ada+0.095
Neg logit contributions
 long-0.088
 badly-0.075
 far-0.074
 much-0.072
 prow-0.069
Token day
Token ID1110
Feature activation-1.57e+00

Pos logit contributions
elling+0.049
aughter+0.046
ada+0.042
ays+0.042
atted+0.041
Neg logit contributions
 long-0.036
 badly-0.035
 prow-0.032
 far-0.031
 much-0.031
Token,
Token ID11
Feature activation-2.96e-01

Pos logit contributions
elling+0.050
aughter+0.046
isher+0.045
ays+0.044
ucks+0.043
Neg logit contributions
 long-0.026
 much-0.020
 far-0.019
 badly-0.019
 che-0.017
Token Bella
Token ID39304
Feature activation-2.22e+00

Pos logit contributions
elling+0.007
aughter+0.007
ays+0.006
isher+0.006
atted+0.006
Neg logit contributions
 long-0.007
 badly-0.006
 far-0.006
 much-0.006
 prow-0.006
Token kissed
Token ID27946
Feature activation-3.12e+00

Pos logit contributions
elling+0.060
aughter+0.056
ays+0.053
isher+0.051
ada+0.051
Neg logit contributions
 long-0.042
 badly-0.037
 far-0.035
 che-0.035
 much-0.035
Token Aless
Token ID47319
Feature activation-1.17e+01

Pos logit contributions
elling+0.090
aughter+0.081
ays+0.079
ucks+0.076
oke+0.075
Neg logit contributions
 long-0.061
 badly-0.052
 far-0.050
 much-0.047
 gall-0.047
Tokenio
Token ID952
Feature activation-4.02e+00

Pos logit contributions
elling+0.178
aughter+0.153
 supervision+0.142
ion+0.137
ays+0.136
Neg logit contributions
 badly-0.301
ggy-0.295
 long-0.292
gh-0.287
 che-0.270
Token on
Token ID319
Feature activation-3.53e+00

Pos logit contributions
elling+0.094
aughter+0.088
ays+0.082
ada+0.080
isher+0.077
Neg logit contributions
 long-0.090
 badly-0.081
 much-0.077
 far-0.076
 prow-0.076
Token the
Token ID262
Feature activation-2.03e+00

Pos logit contributions
elling+0.081
aughter+0.076
ays+0.070
ada+0.069
atted+0.068
Neg logit contributions
 long-0.079
 badly-0.073
 prow-0.068
 far-0.068
 much-0.067
Token cheek
Token ID19353
Feature activation-4.36e-01

Pos logit contributions
elling+0.058
aughter+0.055
ada+0.051
ays+0.051
atted+0.050
Neg logit contributions
 long-0.033
 badly-0.030
 prow-0.027
 much-0.027
 far-0.027
Token and
Token ID290
Feature activation-6.22e-01

Pos logit contributions
elling+0.011
aughter+0.010
ays+0.010
ada+0.010
atted+0.009
Neg logit contributions
 long-0.008
 badly-0.008
 prow-0.007
 far-0.007
 much-0.007
Token is
Token ID318
Feature activation+4.15e+00

Pos logit contributions
elling+0.005
aughter+0.005
ada+0.005
uns+0.005
ays+0.005
Neg logit contributions
 long-0.006
 badly-0.005
 far-0.005
 much-0.005
 prow-0.005
Token gone
Token ID3750
Feature activation+1.08e+00

Pos logit contributions
 prow+0.057
 long+0.055
 badly+0.054
 gall+0.053
One+0.053
Neg logit contributions
elling-0.110
ays-0.096
aughter-0.096
els-0.095
ion-0.093
Token.
Token ID13
Feature activation-1.51e+00

Pos logit contributions
 prow+0.020
 long+0.020
 gall+0.018
 badly+0.018
 quest+0.017
Neg logit contributions
elling-0.026
aughter-0.024
ays-0.023
ener-0.022
ada-0.021
Token What
Token ID1867
Feature activation-3.32e+00

Pos logit contributions
elling+0.034
aughter+0.033
atted+0.031
els+0.031
uns+0.030
Neg logit contributions
 long-0.032
 badly-0.029
 much-0.028
 far-0.027
 prow-0.026
Token if
Token ID611
Feature activation-4.31e+00

Pos logit contributions
elling+0.071
aughter+0.068
ays+0.062
 means+0.059
ada+0.059
Neg logit contributions
 long-0.076
 badly-0.071
 prow-0.067
 far-0.065
 cave-0.064
Token Grand
Token ID5675
Feature activation-1.17e+01

Pos logit contributions
elling+0.115
aughter+0.108
ays+0.101
ada+0.101
atted+0.098
Neg logit contributions
 long-0.083
 badly-0.074
 far-0.069
 much-0.068
 prow-0.067
Tokenma
Token ID2611
Feature activation-5.20e+00

Pos logit contributions
elling+0.163
atted+0.147
arted+0.146
ener+0.145
aughter+0.141
Neg logit contributions
 badly-0.300
 long-0.290
ggy-0.283
One-0.280
 hidden-0.274
Token needs
Token ID2476
Feature activation-1.36e+00

Pos logit contributions
elling+0.110
aughter+0.106
ada+0.097
isher+0.094
uns+0.094
Neg logit contributions
 long-0.118
 badly-0.109
 far-0.109
 che-0.103
 prow-0.102
Token it
Token ID340
Feature activation+3.04e-01

Pos logit contributions
elling+0.038
aughter+0.036
ays+0.034
ada+0.033
atted+0.033
Neg logit contributions
 long-0.024
 badly-0.022
 far-0.020
 much-0.020
 prow-0.019
Token?"
Token ID1701
Feature activation-2.61e+00

Pos logit contributions
 long+0.006
 badly+0.005
 prow+0.005
 much+0.005
 far+0.005
Neg logit contributions
elling-0.008
aughter-0.008
ays-0.007
ada-0.007
atted-0.007
Token Anna
Token ID11735
Feature activation-2.51e+00

Pos logit contributions
elling+0.057
aughter+0.052
ada+0.047
isher+0.047
ays+0.047
Neg logit contributions
 long-0.066
 badly-0.059
 much-0.057
 far-0.057
 prow-0.055
Token longer
Token ID2392
Feature activation+1.48e+00

Pos logit contributions
elling+0.042
aughter+0.039
ada+0.037
atted+0.037
ays+0.037
Neg logit contributions
 long-0.020
 badly-0.018
 prow-0.017
 far-0.016
 much-0.016
Token sad
Token ID6507
Feature activation-1.87e+00

Pos logit contributions
 long+0.023
 badly+0.020
 prow+0.020
 far+0.018
 cave+0.018
Neg logit contributions
elling-0.042
aughter-0.040
ada-0.037
atted-0.037
ays-0.037
Token.
Token ID13
Feature activation-3.44e-01

Pos logit contributions
elling+0.054
aughter+0.051
ays+0.049
ada+0.048
atted+0.048
Neg logit contributions
 long-0.029
 badly-0.026
 prow-0.024
 far-0.023
 much-0.023
Token Even
Token ID3412
Feature activation-1.97e+00

Pos logit contributions
elling+0.011
aughter+0.010
isher+0.010
els+0.009
ener+0.009
Neg logit contributions
 long-0.006
 much-0.005
 badly-0.004
 far-0.004
One-0.004
Token though
Token ID996
Feature activation-2.69e+00

Pos logit contributions
elling+0.050
aughter+0.047
ays+0.043
ada+0.043
atted+0.041
Neg logit contributions
 long-0.042
 badly-0.038
 far-0.036
 much-0.036
 prow-0.034
Token T
Token ID309
Feature activation-1.16e+01

Pos logit contributions
elling+0.062
aughter+0.058
ays+0.054
ada+0.053
atted+0.053
Neg logit contributions
 long-0.056
 badly-0.053
 prow-0.050
 far-0.048
 much-0.048
Tokenuna
Token ID9613
Feature activation-3.25e+00

Pos logit contributions
elling+0.289
aughter+0.269
isher+0.259
ays+0.256
atted+0.251
Neg logit contributions
 long-0.223
 badly-0.206
ggy-0.189
 much-0.180
 prow-0.180
Token missed
Token ID6825
Feature activation-2.84e+00

Pos logit contributions
elling+0.091
aughter+0.087
ays+0.081
ada+0.080
uns+0.078
Neg logit contributions
 long-0.057
 badly-0.052
 prow-0.047
 far-0.047
 much-0.046
Token his
Token ID465
Feature activation-3.63e+00

Pos logit contributions
elling+0.081
aughter+0.076
ays+0.072
ada+0.070
isher+0.068
Neg logit contributions
 long-0.054
 badly-0.047
 much-0.044
 far-0.044
 prow-0.042
Token old
Token ID1468
Feature activation-2.51e+00

Pos logit contributions
elling+0.109
aughter+0.103
ays+0.098
ada+0.096
atted+0.094
Neg logit contributions
 long-0.059
 badly-0.051
 far-0.046
 much-0.045
 prow-0.045
Token friends
Token ID2460
Feature activation-1.55e+00

Pos logit contributions
elling+0.073
aughter+0.068
ays+0.065
ada+0.064
atted+0.063
Neg logit contributions
 long-0.043
 badly-0.038
 far-0.034
 prow-0.034
 much-0.034
Token he
Token ID339
Feature activation+1.23e+00

Pos logit contributions
 long+0.033
 badly+0.029
 much+0.027
 far+0.027
 prow+0.026
Neg logit contributions
elling-0.055
aughter-0.052
ays-0.049
ada-0.048
els-0.047
Token opened
Token ID4721
Feature activation-2.78e-01

Pos logit contributions
 long+0.021
 badly+0.019
 much+0.018
 far+0.018
 prow+0.017
Neg logit contributions
elling-0.035
aughter-0.033
ays-0.031
ada-0.031
uns-0.030
Token the
Token ID262
Feature activation-1.87e+00

Pos logit contributions
elling+0.007
aughter+0.007
ada+0.006
ays+0.006
atted+0.006
Neg logit contributions
 long-0.006
 badly-0.005
 far-0.005
 much-0.005
 prow-0.005
Token gate
Token ID8946
Feature activation+2.90e+00

Pos logit contributions
elling+0.048
aughter+0.046
ays+0.042
ada+0.041
atted+0.041
Neg logit contributions
 long-0.036
 badly-0.033
 prow-0.031
 much-0.030
 far-0.029
Token.
Token ID13
Feature activation-1.01e+00

Pos logit contributions
 long+0.047
 badly+0.042
 prow+0.038
 much+0.037
 far+0.037
Neg logit contributions
elling-0.084
aughter-0.080
ays-0.075
ada-0.074
atted-0.073
Token Ab
Token ID2275
Feature activation-1.16e+01

Pos logit contributions
elling+0.024
aughter+0.022
ays+0.021
els+0.021
isher+0.021
Neg logit contributions
 long-0.022
 badly-0.019
 much-0.019
 far-0.018
 prow-0.018
Tokenbie
Token ID12590
Feature activation-3.24e+00

Pos logit contributions
elling+0.229
ion+0.193
 roles+0.193
 supervision+0.193
aughter+0.189
Neg logit contributions
 long-0.240
 badly-0.229
 far-0.219
One-0.214
 che-0.214
Token was
Token ID373
Feature activation+3.58e+00

Pos logit contributions
elling+0.090
aughter+0.085
ays+0.080
ada+0.079
uns+0.077
Neg logit contributions
 long-0.058
 badly-0.052
 prow-0.048
 far-0.048
 much-0.048
Token so
Token ID523
Feature activation+3.38e+00

Pos logit contributions
 long+0.053
 badly+0.047
 prow+0.042
 far+0.041
 cave+0.040
Neg logit contributions
elling-0.107
aughter-0.102
ada-0.095
ays-0.095
atted-0.095
Token happy
Token ID3772
Feature activation+3.44e-01

Pos logit contributions
 long+0.056
 badly+0.050
 prow+0.045
 far+0.044
 much+0.044
Neg logit contributions
elling-0.098
aughter-0.093
ays-0.087
ada-0.087
atted-0.085
Token and
Token ID290
Feature activation+7.94e-01

Pos logit contributions
 long+0.006
 badly+0.006
 prow+0.005
 far+0.005
 much+0.005
Neg logit contributions
elling-0.009
aughter-0.009
ays-0.008
ada-0.008
atted-0.008
Token have
Token ID423
Feature activation+3.19e-02

Pos logit contributions
elling+0.053
aughter+0.049
ada+0.045
ays+0.045
atted+0.044
Neg logit contributions
 long-0.051
 badly-0.048
 prow-0.044
 much-0.044
 far-0.044
Token fun
Token ID1257
Feature activation+2.10e-01

Pos logit contributions
 long+0.001
 badly+0.000
 prow+0.000
 far+0.000
 cave+0.000
Neg logit contributions
elling-0.001
aughter-0.001
ays-0.001
atted-0.001
ada-0.001
Token without
Token ID1231
Feature activation-4.85e+00

Pos logit contributions
 long+0.004
 badly+0.003
 prow+0.003
 far+0.003
 much+0.003
Neg logit contributions
elling-0.006
aughter-0.005
ays-0.005
atted-0.005
ada-0.005
Token it
Token ID340
Feature activation-1.10e+00

Pos logit contributions
elling+0.113
aughter+0.105
ada+0.097
atted+0.096
ays+0.096
Neg logit contributions
 long-0.102
 badly-0.094
 prow-0.089
 far-0.087
 much-0.087
Token."
Token ID526
Feature activation-1.07e+00

Pos logit contributions
elling+0.031
aughter+0.029
ays+0.027
ada+0.027
atted+0.026
Neg logit contributions
 long-0.019
 badly-0.017
 prow-0.015
 far-0.015
 much-0.015
Token K
Token ID509
Feature activation-1.16e+01

Pos logit contributions
elling+0.035
aughter+0.032
isher+0.030
ays+0.030
ener+0.030
Neg logit contributions
 long-0.015
 badly-0.014
 much-0.013
 prow-0.012
 gall-0.012
Tokenait
Token ID4548
Feature activation-9.59e+00

Pos logit contributions
elling+0.207
aughter+0.178
ays+0.174
 roles+0.169
olds+0.164
Neg logit contributions
 badly-0.279
 long-0.270
 prow-0.247
One-0.243
 gall-0.242
Tokenlyn
Token ID6213
Feature activation-1.65e+00

Pos logit contributions
elling+0.221
ays+0.200
aughter+0.189
ucks+0.186
ours+0.184
Neg logit contributions
 badly-0.196
 long-0.181
 gall-0.179
 prow-0.175
 far-0.175
Token smiled
Token ID13541
Feature activation-1.07e+00

Pos logit contributions
elling+0.039
aughter+0.037
ays+0.034
ada+0.033
uns+0.033
Neg logit contributions
 long-0.036
 badly-0.033
 much-0.031
 far-0.031
 prow-0.031
Token and
Token ID290
Feature activation-9.88e-02

Pos logit contributions
elling+0.025
aughter+0.023
ada+0.022
atted+0.022
ays+0.022
Neg logit contributions
 long-0.022
 badly-0.020
 prow-0.020
 far-0.019
 cave-0.019
Token agreed
Token ID4987
Feature activation-1.72e+00

Pos logit contributions
elling+0.002
aughter+0.002
ays+0.002
ada+0.002
els+0.002
Neg logit contributions
 long-0.002
 badly-0.002
 far-0.002
 much-0.002
 prow-0.002
Token children
Token ID1751
Feature activation-1.21e+00

Pos logit contributions
elling+0.136
ays+0.134
ucks+0.123
aughter+0.122
atted+0.121
Neg logit contributions
 long-0.073
 much-0.054
 big-0.054
 badly-0.053
 cave-0.053
Token didn
Token ID1422
Feature activation+2.15e+00

Pos logit contributions
elling+0.032
aughter+0.029
ays+0.028
ucks+0.027
uns+0.027
Neg logit contributions
 long-0.023
 badly-0.022
 far-0.021
 much-0.020
 prow-0.020
Token't
Token ID470
Feature activation+9.36e-01

Pos logit contributions
 long+0.035
 badly+0.032
 prow+0.030
 far+0.029
 cave+0.029
Neg logit contributions
elling-0.060
aughter-0.057
ays-0.054
ada-0.054
ion-0.052
Token know
Token ID760
Feature activation-2.44e+00

Pos logit contributions
 long+0.019
 badly+0.018
 prow+0.017
 far+0.016
 much+0.016
Neg logit contributions
elling-0.023
aughter-0.021
ada-0.020
ays-0.019
atted-0.019
Token what
Token ID644
Feature activation-4.23e+00

Pos logit contributions
elling+0.064
aughter+0.061
ays+0.056
ada+0.055
ucks+0.054
Neg logit contributions
 long-0.049
 badly-0.045
 much-0.042
 far-0.041
 prow-0.040
Token Mam
Token ID29926
Feature activation-1.16e+01

Pos logit contributions
elling+0.123
aughter+0.113
ays+0.110
ada+0.107
ucks+0.105
Neg logit contributions
 long-0.078
 badly-0.068
 much-0.064
 far-0.063
 prow-0.060
Tokenma
Token ID2611
Feature activation-5.70e+00

Pos logit contributions
elling+0.200
ucks+0.182
ays+0.169
arted+0.152
aughter+0.148
Neg logit contributions
 long-0.288
 badly-0.277
 che-0.274
 much-0.263
 far-0.262
Token meant
Token ID4001
Feature activation-2.04e+00

Pos logit contributions
elling+0.169
aughter+0.157
ucks+0.156
ays+0.154
isher+0.147
Neg logit contributions
 long-0.089
 badly-0.083
 much-0.074
 che-0.074
 far-0.072
Token but
Token ID475
Feature activation-1.29e+00

Pos logit contributions
elling+0.062
aughter+0.057
ays+0.055
ada+0.053
ucks+0.053
Neg logit contributions
 long-0.034
 badly-0.030
 much-0.027
 far-0.026
 prow-0.024
Token they
Token ID484
Feature activation+6.93e-02

Pos logit contributions
elling+0.040
aughter+0.037
ays+0.036
ucks+0.035
uns+0.034
Neg logit contributions
 long-0.021
 badly-0.019
 much-0.018
 far-0.018
 hidden-0.016
Token ran
Token ID4966
Feature activation-4.05e-01

Pos logit contributions
 long+0.001
 badly+0.001
 far+0.001
 prow+0.001
 much+0.001
Neg logit contributions
elling-0.002
aughter-0.002
ays-0.002
ada-0.002
atted-0.001
Token and
Token ID290
Feature activation+4.55e-01

Pos logit contributions
 long+0.002
 badly+0.002
 prow+0.002
 far+0.002
 much+0.002
Neg logit contributions
elling-0.002
aughter-0.002
ays-0.002
ada-0.002
atted-0.002
Token everyone
Token ID2506
Feature activation-1.29e+00

Pos logit contributions
 long+0.011
 badly+0.010
 much+0.009
 far+0.009
 prow+0.009
Neg logit contributions
elling-0.010
aughter-0.009
ays-0.009
ada-0.009
els-0.008
Token was
Token ID373
Feature activation+3.44e+00

Pos logit contributions
elling+0.037
aughter+0.035
ays+0.033
ada+0.032
uns+0.032
Neg logit contributions
 long-0.022
 badly-0.020
 far-0.019
 much-0.019
 prow-0.018
Token happy
Token ID3772
Feature activation-1.25e+00

Pos logit contributions
 long+0.047
 badly+0.040
 prow+0.039
 cave+0.037
 quest+0.035
Neg logit contributions
elling-0.101
aughter-0.098
ada-0.093
atted-0.093
ays-0.092
Token.
Token ID13
Feature activation-1.81e+00

Pos logit contributions
elling+0.033
aughter+0.032
ays+0.030
atted+0.030
ada+0.029
Neg logit contributions
 long-0.022
 badly-0.020
 prow-0.019
 far-0.018
 much-0.017
Token T
Token ID309
Feature activation-1.15e+01

Pos logit contributions
elling+0.045
aughter+0.042
isher+0.039
els+0.039
atted+0.038
Neg logit contributions
 long-0.039
 much-0.034
 badly-0.033
 far-0.032
 prow-0.031
Tokenoni
Token ID14651
Feature activation-2.67e+00

Pos logit contributions
elling+0.187
aughter+0.160
ays+0.150
isher+0.149
atted+0.149
Neg logit contributions
 long-0.317
 badly-0.307
 much-0.285
 che-0.283
ggy-0.279
Token was
Token ID373
Feature activation+2.73e+00

Pos logit contributions
elling+0.071
aughter+0.067
ays+0.064
ada+0.061
uns+0.060
Neg logit contributions
 long-0.050
 badly-0.046
 far-0.043
 much-0.043
 prow-0.043
Token happy
Token ID3772
Feature activation-1.27e+00

Pos logit contributions
 long+0.041
 badly+0.036
 prow+0.033
 cave+0.031
 far+0.031
Neg logit contributions
elling-0.080
aughter-0.076
ada-0.072
atted-0.071
ays-0.071
Token to
Token ID284
Feature activation-2.20e+00

Pos logit contributions
elling+0.034
aughter+0.032
ays+0.030
atted+0.030
ada+0.030
Neg logit contributions
 long-0.022
 badly-0.020
 prow-0.019
 far-0.018
 much-0.018
Token have
Token ID423
Feature activation-9.76e-01

Pos logit contributions
elling+0.059
aughter+0.056
ada+0.053
ays+0.052
atted+0.051
Neg logit contributions
 long-0.038
 badly-0.036
 prow-0.032
 far-0.032
 much-0.031
Token not
Token ID407
Feature activation+1.99e+00

Pos logit contributions
 long+0.065
 badly+0.056
 prow+0.054
 cave+0.051
 much+0.050
Neg logit contributions
elling-0.155
aughter-0.153
ada-0.146
ays-0.141
olds-0.139
Token believe
Token ID1975
Feature activation-3.06e+00

Pos logit contributions
 long+0.041
 much+0.037
 badly+0.037
 prow+0.036
 cave+0.035
Neg logit contributions
elling-0.039
ada-0.038
aughter-0.037
ener-0.037
cy-0.034
Token how
Token ID703
Feature activation-7.67e-01

Pos logit contributions
elling+0.081
aughter+0.076
ays+0.070
ada+0.069
uns+0.068
Neg logit contributions
 long-0.060
 badly-0.056
 far-0.051
 much-0.050
 prow-0.048
Token care
Token ID1337
Feature activation-2.62e+00

Pos logit contributions
elling+0.023
aughter+0.022
atted+0.021
ada+0.021
ays+0.020
Neg logit contributions
 long-0.009
 badly-0.009
 prow-0.008
 cave-0.008
 gall-0.008
Tokenfree
Token ID5787
Feature activation-5.64e+00

Pos logit contributions
elling+0.078
aughter+0.075
ays+0.070
ada+0.070
atted+0.069
Neg logit contributions
 long-0.042
 badly-0.037
 much-0.034
 far-0.033
 prow-0.032
Token T
Token ID309
Feature activation-1.15e+01

Pos logit contributions
elling+0.167
aughter+0.159
ays+0.147
ada+0.147
atted+0.145
Neg logit contributions
 long-0.091
 badly-0.082
 far-0.072
 much-0.072
 prow-0.071
Tokenessa
Token ID21411
Feature activation-4.22e+00

Pos logit contributions
elling+0.194
aughter+0.170
ays+0.164
atted+0.158
isher+0.155
Neg logit contributions
 long-0.312
 badly-0.298
 far-0.276
 much-0.275
 prow-0.267
Token was
Token ID373
Feature activation+2.54e+00

Pos logit contributions
elling+0.118
aughter+0.111
ays+0.103
ada+0.103
uns+0.100
Neg logit contributions
 long-0.073
 badly-0.068
 far-0.061
 much-0.061
 prow-0.060
Token.
Token ID13
Feature activation-3.55e-01

Pos logit contributions
 long+0.040
 badly+0.035
 prow+0.034
One+0.033
 cave+0.033
Neg logit contributions
elling-0.068
aughter-0.065
ada-0.061
atted-0.060
ays-0.060
Token She
Token ID1375
Feature activation+2.17e+00

Pos logit contributions
elling+0.008
aughter+0.008
isher+0.007
els+0.007
uns+0.007
Neg logit contributions
 long-0.008
 badly-0.007
 much-0.007
 far-0.007
 prow-0.006
Token just
Token ID655
Feature activation-1.14e+00

Pos logit contributions
 long+0.043
 badly+0.040
 much+0.036
 prow+0.036
 far+0.036
Neg logit contributions
elling-0.053
aughter-0.050
atted-0.047
ada-0.047
ays-0.046
Token the
Token ID262
Feature activation-1.82e+00

Pos logit contributions
elling+0.069
aughter+0.065
ays+0.060
atted+0.059
ada+0.058
Neg logit contributions
 long-0.059
 badly-0.052
 much-0.050
 far-0.050
 prow-0.047
Token sky
Token ID6766
Feature activation+1.14e+00

Pos logit contributions
elling+0.048
aughter+0.044
ada+0.040
ays+0.040
ison+0.040
Neg logit contributions
 badly-0.034
 long-0.034
 prow-0.030
 much-0.029
 gall-0.028
Token.
Token ID13
Feature activation-1.01e+00

Pos logit contributions
 long+0.019
 badly+0.017
 prow+0.016
 far+0.015
 much+0.015
Neg logit contributions
elling-0.032
aughter-0.031
ays-0.029
ada-0.028
atted-0.028
Token And
Token ID843
Feature activation-2.20e-02

Pos logit contributions
elling+0.029
aughter+0.029
isher+0.028
ener+0.027
ays+0.027
Neg logit contributions
 long-0.017
 much-0.016
 badly-0.014
One-0.014
 prow-0.013
Token whenever
Token ID8797
Feature activation-2.86e+00

Pos logit contributions
elling+0.001
aughter+0.001
ays+0.001
isher+0.001
ada+0.001
Neg logit contributions
 long-0.000
 badly-0.000
 much-0.000
 far-0.000
 prow-0.000
Token Mitt
Token ID16627
Feature activation-1.15e+01

Pos logit contributions
elling+0.055
aughter+0.050
 means+0.046
ada+0.046
els+0.046
Neg logit contributions
 long-0.064
 badly-0.062
 prow-0.060
 deserted-0.060
 far-0.058
Tokenens
Token ID641
Feature activation-5.43e+00

Pos logit contributions
elling+0.140
aughter+0.129
ays+0.118
isher+0.112
els+0.105
Neg logit contributions
 long-0.370
 badly-0.356
 much-0.331
 far-0.327
 prow-0.327
Token felt
Token ID2936
Feature activation+1.70e+00

Pos logit contributions
elling+0.148
aughter+0.140
ays+0.131
ada+0.130
atted+0.127
Neg logit contributions
 long-0.098
 badly-0.088
 far-0.081
 prow-0.081
 much-0.080
Token hungry
Token ID14720
Feature activation-1.86e+00

Pos logit contributions
 long+0.024
 badly+0.021
 prow+0.020
 cave+0.018
 far+0.018
Neg logit contributions
elling-0.052
aughter-0.048
ays-0.046
ada-0.045
atted-0.045
Token,
Token ID11
Feature activation-2.45e-01

Pos logit contributions
elling+0.048
aughter+0.045
ada+0.041
ays+0.041
atted+0.041
Neg logit contributions
 long-0.035
 badly-0.032
 prow-0.030
 far-0.029
 much-0.029
Token Tweet
Token ID18752
Feature activation-4.32e+00

Pos logit contributions
elling+0.006
aughter+0.006
ays+0.005
ada+0.005
atted+0.005
Neg logit contributions
 long-0.005
 badly-0.005
 prow-0.004
 far-0.004
 much-0.004
Token box
Token ID3091
Feature activation+3.27e+00

Pos logit contributions
elling+0.041
aughter+0.039
ada+0.035
ener+0.035
isher+0.035
Neg logit contributions
 long-0.031
 badly-0.029
 prow-0.027
 much-0.027
 far-0.026
Token.
Token ID13
Feature activation-7.90e-01

Pos logit contributions
 long+0.050
 badly+0.046
 prow+0.043
 much+0.040
 cave+0.040
Neg logit contributions
elling-0.093
aughter-0.088
ays-0.083
ada-0.082
atted-0.082
Token It
Token ID632
Feature activation+1.45e+00

Pos logit contributions
elling+0.020
aughter+0.019
isher+0.019
atted+0.018
ays+0.018
Neg logit contributions
 long-0.017
 much-0.014
 badly-0.014
 far-0.013
 prow-0.013
Token made
Token ID925
Feature activation-6.82e-01

Pos logit contributions
 long+0.025
 badly+0.022
 prow+0.020
 far+0.020
 much+0.020
Neg logit contributions
elling-0.042
aughter-0.040
ays-0.037
ada-0.037
atted-0.036
Token a
Token ID257
Feature activation+6.55e+00

Pos logit contributions
elling+0.021
aughter+0.020
ays+0.019
isher+0.018
atted+0.018
Neg logit contributions
 long-0.012
 much-0.009
 badly-0.009
 far-0.009
 prow-0.008
Token bright
Token ID6016
Feature activation+4.62e+00

Pos logit contributions
 badly+0.079
 prow+0.074
 long+0.073
One+0.070
 wandering+0.068
Neg logit contributions
elling-0.179
aughter-0.175
ada-0.161
oke-0.156
ison-0.155
Token light
Token ID1657
Feature activation+3.23e+00

Pos logit contributions
 long+0.078
 badly+0.072
 prow+0.067
 far+0.065
 much+0.063
Neg logit contributions
elling-0.128
aughter-0.120
ays-0.112
ada-0.111
atted-0.110
Token.
Token ID13
Feature activation-1.02e+00

Pos logit contributions
 long+0.059
 badly+0.054
 prow+0.049
 far+0.049
 much+0.049
Neg logit contributions
elling-0.086
aughter-0.081
ays-0.076
ada-0.075
atted-0.074
Token She
Token ID1375
Feature activation+1.93e+00

Pos logit contributions
elling+0.025
isher+0.023
aughter+0.023
els+0.022
ays+0.022
Neg logit contributions
 long-0.023
 much-0.019
 badly-0.018
 far-0.018
On-0.017
Token carefully
Token ID7773
Feature activation+5.54e-01

Pos logit contributions
 long+0.037
 badly+0.034
 far+0.031
 prow+0.031
 much+0.031
Neg logit contributions
elling-0.051
aughter-0.048
ays-0.044
ada-0.044
atted-0.043
Token put
Token ID1234
Feature activation-1.63e+00

Pos logit contributions
 long+0.010
 badly+0.009
 far+0.009
 prow+0.009
 much+0.008
Neg logit contributions
elling-0.015
aughter-0.014
ays-0.013
ada-0.013
isher-0.013
Token make
Token ID787
Feature activation-1.62e+00

Pos logit contributions
 long+0.009
 badly+0.009
 much+0.008
 prow+0.008
 far+0.007
Neg logit contributions
elling-0.009
aughter-0.009
ada-0.008
isher-0.008
els-0.008
Token it
Token ID340
Feature activation+3.32e+00

Pos logit contributions
elling+0.044
aughter+0.041
ays+0.039
ada+0.039
atted+0.038
Neg logit contributions
 long-0.027
 badly-0.026
 prow-0.024
 far-0.022
 much-0.022
Token perfect
Token ID2818
Feature activation-1.12e+00

Pos logit contributions
 badly+0.052
 long+0.050
 prow+0.049
One+0.048
ggy+0.045
Neg logit contributions
elling-0.080
ays-0.074
aughter-0.072
 agrees-0.070
els-0.070
Token.
Token ID13
Feature activation+3.50e-02

Pos logit contributions
elling+0.032
aughter+0.031
ays+0.029
ada+0.028
els+0.028
Neg logit contributions
 long-0.018
 badly-0.016
 prow-0.015
 much-0.014
 far-0.014
Token
Token ID198
Feature activation+5.46e+00

Pos logit contributions
 long+0.001
 much+0.001
 badly+0.001
 far+0.001
 prow+0.001
Neg logit contributions
elling-0.001
aughter-0.001
ays-0.001
els-0.001
ada-0.001
Token
Token ID198
Feature activation+4.63e+00

Pos logit contributions
 long+0.096
 badly+0.087
 prow+0.079
 far+0.078
 much+0.078
Neg logit contributions
elling-0.150
aughter-0.142
ays-0.133
ada-0.132
atted-0.129
TokenWhile
Token ID3633
Feature activation-8.17e-01

Pos logit contributions
 badly+0.078
 long+0.077
 prow+0.074
 gall+0.071
 cave+0.066
Neg logit contributions
elling-0.118
aughter-0.112
ada-0.102
ison-0.100
uns-0.098
Token Jack
Token ID3619
Feature activation-3.67e+00

Pos logit contributions
elling+0.022
aughter+0.021
ays+0.020
atted+0.020
ada+0.019
Neg logit contributions
 long-0.013
 badly-0.013
 prow-0.012
 gall-0.011
 far-0.011
Token was
Token ID373
Feature activation+2.62e+00

Pos logit contributions
elling+0.095
aughter+0.089
ays+0.084
ada+0.082
atted+0.082
Neg logit contributions
 long-0.070
 badly-0.064
 prow-0.058
 much-0.058
 far-0.058
Token busy
Token ID8179
Feature activation+1.14e+00

Pos logit contributions
 badly+0.047
 long+0.045
One+0.044
 prow+0.044
 nut+0.043
Neg logit contributions
elling-0.058
atted-0.058
ays-0.055
aughter-0.055
ours-0.053
Token building
Token ID2615
Feature activation+3.58e-01

Pos logit contributions
 long+0.020
 badly+0.019
 prow+0.017
 much+0.017
 far+0.017
Neg logit contributions
elling-0.029
aughter-0.027
atted-0.027
ays-0.026
ada-0.026
Token upon
Token ID2402
Feature activation+3.71e+00

Pos logit contributions
elling+0.032
aughter+0.029
isher+0.028
ada+0.027
ays+0.027
Neg logit contributions
 long-0.019
 badly-0.016
 much-0.016
 far-0.016
 cave-0.014
Token a
Token ID257
Feature activation+4.71e+00

Pos logit contributions
 quest+0.074
 cave+0.071
 badly+0.070
 prow+0.070
 trail+0.068
Neg logit contributions
elling-0.060
aughter-0.057
 smiles-0.055
 Jill-0.054
 means-0.053
Token time
Token ID640
Feature activation+1.41e+00

Pos logit contributions
 badly+0.074
 prow+0.072
 gall+0.072
 cave+0.071
 long+0.069
Neg logit contributions
aughter-0.113
elling-0.110
cy-0.101
atted-0.098
ada-0.096
Token,
Token ID11
Feature activation+2.20e+00

Pos logit contributions
 long+0.026
 badly+0.024
 prow+0.022
 far+0.022
 much+0.022
Neg logit contributions
elling-0.037
aughter-0.035
ays-0.033
ada-0.032
atted-0.032
Token there
Token ID612
Feature activation+3.48e-01

Pos logit contributions
 long+0.025
 badly+0.021
 prow+0.021
 gall+0.019
 quest+0.019
Neg logit contributions
elling-0.069
aughter-0.067
ays-0.064
ada-0.062
atted-0.062
Token was
Token ID373
Feature activation+4.65e+00

Pos logit contributions
 long+0.006
 badly+0.005
 much+0.005
 far+0.005
 prow+0.005
Neg logit contributions
elling-0.010
aughter-0.010
ays-0.009
ada-0.009
uns-0.009
Token a
Token ID257
Feature activation+8.45e+00

Pos logit contributions
 badly+0.064
 long+0.060
 prow+0.060
 quest+0.056
 cave+0.056
Neg logit contributions
elling-0.131
aughter-0.125
ada-0.118
ays-0.116
ion-0.113
Token little
Token ID1310
Feature activation+2.17e+00

Pos logit contributions
 badly+0.153
 prow+0.143
 long+0.136
gh+0.132
 gall+0.130
Neg logit contributions
aughter-0.195
elling-0.185
ada-0.180
 Daughter-0.173
ison-0.167
Token flower
Token ID15061
Feature activation+4.79e+00

Pos logit contributions
 long+0.057
 badly+0.054
 prow+0.051
 much+0.050
 far+0.050
Neg logit contributions
elling-0.040
aughter-0.037
ada-0.033
ays-0.033
atted-0.032
Token that
Token ID326
Feature activation+1.24e+00

Pos logit contributions
 long+0.093
 deserted+0.086
 badly+0.083
 quest+0.082
 prow+0.082
Neg logit contributions
elling-0.107
ays-0.099
atted-0.098
aughter-0.097
els-0.095
Token was
Token ID373
Feature activation+5.40e+00

Pos logit contributions
 long+0.022
 badly+0.021
 prow+0.018
 far+0.017
 much+0.017
Neg logit contributions
elling-0.033
aughter-0.031
ays-0.029
atted-0.029
ada-0.029
Token the
Token ID262
Feature activation-6.94e-01

Pos logit contributions
elling+0.006
aughter+0.006
ada+0.005
ays+0.005
isher+0.005
Neg logit contributions
 long-0.004
 badly-0.004
 far-0.004
 much-0.003
 prow-0.003
Token river
Token ID7850
Feature activation+3.24e+00

Pos logit contributions
elling+0.018
aughter+0.017
ada+0.016
ays+0.016
ison+0.015
Neg logit contributions
 badly-0.012
 long-0.012
 prow-0.011
 much-0.010
 gall-0.010
Token on
Token ID319
Feature activation-3.25e+00

Pos logit contributions
 long+0.059
 badly+0.056
 prow+0.051
 much+0.049
One+0.048
Neg logit contributions
elling-0.078
aughter-0.075
atted-0.071
ays-0.070
ada-0.069
Token their
Token ID511
Feature activation-2.92e-01

Pos logit contributions
elling+0.087
aughter+0.082
atted+0.077
ays+0.075
ada+0.075
Neg logit contributions
 long-0.067
 badly-0.058
 far-0.057
 much-0.056
 prow-0.052
Token pink
Token ID11398
Feature activation-4.41e-01

Pos logit contributions
elling+0.008
aughter+0.007
atted+0.007
ada+0.007
ays+0.007
Neg logit contributions
 long-0.005
 badly-0.005
 prow-0.004
 much-0.004
 far-0.004
Token raft
Token ID31812
Feature activation+5.18e+00

Pos logit contributions
elling+0.012
aughter+0.011
ays+0.010
atted+0.010
ada+0.010
Neg logit contributions
 long-0.008
 badly-0.007
 much-0.007
 prow-0.007
 far-0.007
Token,
Token ID11
Feature activation+2.41e+00

Pos logit contributions
 long+0.080
 badly+0.072
 prow+0.064
 much+0.063
 far+0.063
Neg logit contributions
elling-0.152
aughter-0.145
ays-0.136
ada-0.135
atted-0.133
Token enjoying
Token ID13226
Feature activation-1.34e+00

Pos logit contributions
 long+0.043
 badly+0.040
 much+0.036
 prow+0.036
 cave+0.035
Neg logit contributions
elling-0.062
aughter-0.058
atted-0.055
ays-0.055
ada-0.054
Token the
Token ID262
Feature activation+1.66e-01

Pos logit contributions
elling+0.025
aughter+0.023
atted+0.022
els+0.021
ays+0.021
Neg logit contributions
 long-0.032
 badly-0.031
 prow-0.029
 cave-0.028
 much-0.028
Token sunshine
Token ID34488
Feature activation+1.51e+00

Pos logit contributions
 badly+0.003
 long+0.003
 prow+0.002
ggy+0.002
 much+0.002
Neg logit contributions
elling-0.005
aughter-0.004
ada-0.004
atted-0.004
isher-0.004
Token and
Token ID290
Feature activation+1.12e+00

Pos logit contributions
 long+0.027
 badly+0.026
 prow+0.024
 far+0.023
 much+0.023
Neg logit contributions
elling-0.039
aughter-0.036
ays-0.034
ada-0.034
atted-0.033
Token with
Token ID351
Feature activation-2.25e+00

Pos logit contributions
 long+0.028
 badly+0.026
 prow+0.024
 much+0.023
 far+0.023
Neg logit contributions
elling-0.043
aughter-0.041
atted-0.039
ada-0.038
ays-0.038
Token her
Token ID607
Feature activation-2.30e+00

Pos logit contributions
elling+0.072
aughter+0.068
ays+0.065
ada+0.064
uns+0.063
Neg logit contributions
 long-0.032
 badly-0.027
 much-0.026
 far-0.025
 prow-0.023
Token pink
Token ID11398
Feature activation-7.99e-01

Pos logit contributions
elling+0.063
aughter+0.059
ays+0.056
ada+0.055
atted+0.053
Neg logit contributions
 long-0.044
 badly-0.039
 far-0.036
 much-0.035
 prow-0.034
Token flowers
Token ID12734
Feature activation+4.09e+00

Pos logit contributions
elling+0.023
aughter+0.022
ays+0.021
ada+0.021
atted+0.020
Neg logit contributions
 long-0.013
 badly-0.012
 far-0.010
 much-0.010
 prow-0.010
Token and
Token ID290
Feature activation+5.79e-01

Pos logit contributions
 long+0.079
 badly+0.074
 prow+0.071
 much+0.067
One+0.067
Neg logit contributions
elling-0.097
aughter-0.092
atted-0.087
ays-0.086
ada-0.085
Token a
Token ID257
Feature activation+4.98e+00

Pos logit contributions
 long+0.010
 badly+0.009
 prow+0.008
 far+0.008
 much+0.008
Neg logit contributions
elling-0.016
aughter-0.015
ays-0.014
ada-0.014
atted-0.014
Token silly
Token ID14397
Feature activation+1.63e+00

Pos logit contributions
 prow+0.075
 badly+0.074
 long+0.067
One+0.065
 deserted+0.063
Neg logit contributions
aughter-0.121
elling-0.119
ada-0.113
 Daughter-0.110
atted-0.107
Token joke
Token ID9707
Feature activation+8.40e-01

Pos logit contributions
 long+0.030
 badly+0.028
 prow+0.027
 far+0.026
 much+0.026
Neg logit contributions
elling-0.040
aughter-0.038
ada-0.035
atted-0.035
ener-0.034
Token in
Token ID287
Feature activation-2.35e+00

Pos logit contributions
 long+0.013
 badly+0.011
 prow+0.010
 far+0.010
 much+0.010
Neg logit contributions
elling-0.025
aughter-0.024
ays-0.022
ada-0.022
atted-0.022
Token her
Token ID607
Feature activation-1.65e+00

Pos logit contributions
elling+0.055
aughter+0.051
ays+0.047
ada+0.047
atted+0.046
Neg logit contributions
 long-0.052
 badly-0.048
 far-0.044
 much-0.044
 prow-0.044
Token heart
Token ID2612
Feature activation+1.19e+00

Pos logit contributions
elling+0.045
aughter+0.043
ays+0.040
ada+0.040
atted+0.039
Neg logit contributions
 long-0.029
 badly-0.027
 far-0.024
 prow-0.024
 much-0.024
Token so
Token ID523
Feature activation+3.37e+00

Pos logit contributions
 long+0.064
 badly+0.055
 prow+0.053
 cave+0.051
 far+0.050
Neg logit contributions
elling-0.108
aughter-0.103
atted-0.098
ays-0.096
ada-0.096
Token excited
Token ID6568
Feature activation+6.74e-01

Pos logit contributions
 long+0.054
 badly+0.048
 much+0.043
 far+0.043
 prow+0.042
Neg logit contributions
elling-0.101
aughter-0.095
ays-0.090
ada-0.088
atted-0.087
Token that
Token ID326
Feature activation-2.51e+00

Pos logit contributions
 long+0.011
 badly+0.010
 far+0.009
 much+0.009
 prow+0.009
Neg logit contributions
elling-0.019
aughter-0.018
ays-0.017
ada-0.017
atted-0.017
Token she
Token ID673
Feature activation+4.67e-01

Pos logit contributions
elling+0.073
aughter+0.067
ays+0.065
isher+0.063
ada+0.063
Neg logit contributions
 long-0.046
 badly-0.039
 much-0.038
 far-0.037
 prow-0.034
Token ran
Token ID4966
Feature activation-3.72e-01

Pos logit contributions
 long+0.009
 badly+0.008
 prow+0.008
 far+0.008
 much+0.008
Neg logit contributions
elling-0.012
aughter-0.011
ada-0.010
atted-0.010
ays-0.010
Token all
Token ID477
Feature activation-5.31e-01

Pos logit contributions
elling+0.009
aughter+0.009
ada+0.008
ays+0.008
atted+0.008
Neg logit contributions
 long-0.007
 badly-0.007
 prow-0.007
 cave-0.006
 much-0.006
Token the
Token ID262
Feature activation-1.91e+00

Pos logit contributions
elling+0.015
aughter+0.015
ays+0.014
ener+0.013
ada+0.013
Neg logit contributions
 long-0.010
 far-0.009
 much-0.008
 badly-0.008
 safe-0.007
Token way
Token ID835
Feature activation-3.22e-01

Pos logit contributions
elling+0.057
aughter+0.054
ays+0.052
ada+0.050
atted+0.050
Neg logit contributions
 long-0.032
 badly-0.026
 far-0.025
 prow-0.024
 much-0.024
Token to
Token ID284
Feature activation-8.19e-02

Pos logit contributions
elling+0.008
aughter+0.008
ays+0.007
ada+0.007
atted+0.007
Neg logit contributions
 long-0.006
 badly-0.006
 far-0.005
 much-0.005
 prow-0.005
Token the
Token ID262
Feature activation-1.98e+00

Pos logit contributions
elling+0.002
aughter+0.002
ays+0.002
ada+0.002
atted+0.002
Neg logit contributions
 long-0.002
 badly-0.002
 far-0.001
 much-0.001
 prow-0.001
Token store
Token ID3650
Feature activation-9.26e-02

Pos logit contributions
elling+0.052
aughter+0.049
ada+0.046
ays+0.044
isher+0.044
Neg logit contributions
 long-0.036
 badly-0.033
 prow-0.031
 much-0.030
 gall-0.029
Token.
Token ID13
Feature activation-6.34e-01

Pos logit contributions
 long+0.034
 badly+0.030
 prow+0.028
 much+0.027
One+0.027
Neg logit contributions
elling-0.057
aughter-0.053
atted-0.051
ays-0.051
els-0.050
Token Although
Token ID4900
Feature activation-2.95e+00

Pos logit contributions
elling+0.018
aughter+0.017
els+0.016
ays+0.016
isher+0.016
Neg logit contributions
 long-0.011
 badly-0.010
 much-0.010
 far-0.009
 prow-0.009
Token he
Token ID339
Feature activation+3.54e-01

Pos logit contributions
elling+0.085
aughter+0.081
ays+0.075
atted+0.075
ada+0.075
Neg logit contributions
 long-0.047
 badly-0.041
 prow-0.038
 far-0.037
 much-0.037
Token was
Token ID373
Feature activation+3.08e+00

Pos logit contributions
 long+0.007
 badly+0.006
 prow+0.006
 far+0.006
 much+0.006
Neg logit contributions
elling-0.009
aughter-0.009
ays-0.008
ada-0.008
atted-0.008
Token forbidden
Token ID19467
Feature activation-3.13e+00

Pos logit contributions
 long+0.045
 badly+0.039
 prow+0.039
 cave+0.036
 gall+0.036
Neg logit contributions
elling-0.087
aughter-0.079
atted-0.078
ays-0.074
ener-0.074
Token to
Token ID284
Feature activation-2.58e-01

Pos logit contributions
elling+0.090
aughter+0.086
ays+0.081
ada+0.080
uns+0.077
Neg logit contributions
 long-0.055
 badly-0.049
 far-0.044
 much-0.044
 prow-0.043
Token climb
Token ID12080
Feature activation-1.80e+00

Pos logit contributions
elling+0.006
aughter+0.006
ada+0.006
atted+0.006
els+0.005
Neg logit contributions
 long-0.005
 badly-0.004
 prow-0.004
 much-0.004
 cave-0.004
Token it
Token ID340
Feature activation+2.79e-01

Pos logit contributions
elling+0.049
aughter+0.047
ada+0.043
ays+0.043
atted+0.042
Neg logit contributions
 long-0.034
 badly-0.031
 far-0.029
 much-0.028
 prow-0.027
Token again
Token ID757
Feature activation-1.36e+00

Pos logit contributions
 long+0.005
 badly+0.005
 prow+0.004
 far+0.004
 much+0.004
Neg logit contributions
elling-0.008
aughter-0.007
ays-0.007
ada-0.007
atted-0.006
Token,
Token ID11
Feature activation-6.88e-02

Pos logit contributions
elling+0.035
aughter+0.033
ays+0.030
ada+0.030
atted+0.030
Neg logit contributions
 long-0.025
 badly-0.023
 prow-0.022
 much-0.021
 far-0.021
Token he
Token ID339
Feature activation+1.03e+00

Pos logit contributions
elling+0.002
aughter+0.002
ays+0.002
ada+0.002
atted+0.002
Neg logit contributions
 long-0.001
 badly-0.001
 far-0.001
 prow-0.001
 much-0.001
TokenThe
Token ID464
Feature activation-1.25e+00

Pos logit contributions
 long+0.072
 badly+0.065
 prow+0.059
 far+0.059
 much+0.058
Neg logit contributions
elling-0.113
aughter-0.108
ada-0.100
ays-0.100
atted-0.098
Token little
Token ID1310
Feature activation-4.39e+00

Pos logit contributions
elling+0.033
aughter+0.031
ada+0.029
ays+0.029
atted+0.028
Neg logit contributions
 long-0.023
 badly-0.021
 prow-0.019
 much-0.019
 far-0.019
Token girl
Token ID2576
Feature activation-8.34e-01

Pos logit contributions
elling+0.094
aughter+0.087
ays+0.083
ada+0.077
ucks+0.076
Neg logit contributions
 long-0.109
 badly-0.098
 far-0.094
 prow-0.091
 much-0.091
Token decided
Token ID3066
Feature activation+1.34e+00

Pos logit contributions
elling+0.023
aughter+0.022
ays+0.021
ada+0.020
uns+0.020
Neg logit contributions
 long-0.015
 badly-0.013
 far-0.013
 prow-0.012
 much-0.012
Token to
Token ID284
Feature activation+1.49e+00

Pos logit contributions
 long+0.024
 badly+0.022
 prow+0.021
 much+0.020
 cave+0.020
Neg logit contributions
elling-0.035
aughter-0.033
ada-0.031
ays-0.031
atted-0.030
Token run
Token ID1057
Feature activation+2.86e-02

Pos logit contributions
 long+0.027
 badly+0.025
 much+0.023
 prow+0.022
 far+0.022
Neg logit contributions
elling-0.039
aughter-0.037
ada-0.035
ays-0.033
els-0.033
Token and
Token ID290
Feature activation+2.77e+00

Pos logit contributions
 long+0.001
 badly+0.001
 far+0.000
 much+0.000
 prow+0.000
Neg logit contributions
elling-0.001
aughter-0.001
ays-0.001
ada-0.001
atted-0.001
Token jump
Token ID4391
Feature activation+9.58e-01

Pos logit contributions
 long+0.042
 badly+0.037
 prow+0.033
 much+0.033
 far+0.033
Neg logit contributions
elling-0.083
aughter-0.079
ays-0.074
ada-0.074
atted-0.073
Token towards
Token ID3371
Feature activation-4.20e+00

Pos logit contributions
 long+0.018
 badly+0.016
 prow+0.015
 cave+0.015
 much+0.015
Neg logit contributions
elling-0.024
aughter-0.023
ada-0.021
ays-0.021
atted-0.021
Token the
Token ID262
Feature activation-4.83e-01

Pos logit contributions
elling+0.118
aughter+0.110
ays+0.105
uns+0.100
isher+0.100
Neg logit contributions
 long-0.084
 far-0.072
 badly-0.070
 much-0.068
 prow-0.065
Token thing
Token ID1517
Feature activation+8.48e-01

Pos logit contributions
elling+0.013
aughter+0.013
ada+0.012
ays+0.012
isher+0.011
Neg logit contributions
 long-0.008
 badly-0.007
 prow-0.007
 much-0.007
 far-0.006
Token are
Token ID389
Feature activation+3.53e+00

Pos logit contributions
 long+0.008
 badly+0.007
 prow+0.007
 much+0.007
 far+0.007
Neg logit contributions
elling-0.010
aughter-0.009
ays-0.009
ada-0.009
atted-0.008
Token happy
Token ID3772
Feature activation-8.18e-01

Pos logit contributions
 long+0.051
 badly+0.048
 prow+0.043
One+0.040
 cave+0.040
Neg logit contributions
elling-0.103
aughter-0.098
ays-0.093
atted-0.092
ada-0.091
Token and
Token ID290
Feature activation+9.79e-01

Pos logit contributions
elling+0.022
aughter+0.020
ays+0.019
atted+0.019
ada+0.019
Neg logit contributions
 long-0.015
 badly-0.014
 prow-0.013
 far-0.012
 much-0.012
Token friendly
Token ID8030
Feature activation-2.85e+00

Pos logit contributions
 long+0.015
 badly+0.014
 prow+0.012
 much+0.012
 far+0.012
Neg logit contributions
elling-0.029
aughter-0.028
ays-0.026
ada-0.026
atted-0.025
Token.
Token ID13
Feature activation-1.28e+00

Pos logit contributions
elling+0.092
aughter+0.088
ays+0.083
ada+0.082
atted+0.081
Neg logit contributions
 long-0.038
 badly-0.033
 far-0.029
 much-0.028
 prow-0.028
Token They
Token ID1119
Feature activation-9.65e-02

Pos logit contributions
elling+0.035
aughter+0.033
isher+0.031
ener+0.031
els+0.030
Neg logit contributions
 long-0.028
 much-0.023
 badly-0.022
 far-0.022
One-0.021
Token say
Token ID910
Feature activation-4.90e+00

Pos logit contributions
elling+0.002
aughter+0.002
ada+0.002
ays+0.002
atted+0.002
Neg logit contributions
 long-0.002
 badly-0.002
 prow-0.002
 much-0.002
 far-0.002
Token sorry
Token ID7926
Feature activation-4.15e+00

Pos logit contributions
elling+0.130
aughter+0.123
ays+0.115
ada+0.113
isher+0.111
Neg logit contributions
 long-0.097
 badly-0.086
 much-0.081
 far-0.079
 prow-0.077
Token for
Token ID329
Feature activation-5.03e+00

Pos logit contributions
elling+0.118
aughter+0.113
ays+0.105
ada+0.105
atted+0.102
Neg logit contributions
 long-0.073
 badly-0.064
 much-0.059
 far-0.059
 prow-0.057
Token fighting
Token ID4330
Feature activation-3.32e+00

Pos logit contributions
elling+0.118
aughter+0.111
ays+0.101
ada+0.100
atted+0.099
Neg logit contributions
 long-0.108
 badly-0.100
 prow-0.093
 much-0.091
 far-0.091
Token.
Token ID13
Feature activation-1.38e+00

Pos logit contributions
elling+0.114
aughter+0.110
ada+0.105
uns+0.104
isher+0.103
Neg logit contributions
 long-0.041
 badly-0.036
 much-0.035
 far-0.035
 well-0.031
Token said
Token ID531
Feature activation-2.64e+00

Pos logit contributions
 long+0.003
 badly+0.003
 far+0.002
 prow+0.002
 much+0.002
Neg logit contributions
elling-0.003
aughter-0.003
atted-0.003
ada-0.003
ays-0.003
Token hello
Token ID23748
Feature activation-2.22e+00

Pos logit contributions
elling+0.074
aughter+0.070
ays+0.066
ada+0.065
isher+0.064
Neg logit contributions
 long-0.047
 badly-0.042
 much-0.039
 far-0.038
 prow-0.038
Token to
Token ID284
Feature activation-2.76e+00

Pos logit contributions
elling+0.062
aughter+0.059
ays+0.055
ada+0.054
atted+0.053
Neg logit contributions
 long-0.040
 badly-0.036
 far-0.033
 much-0.032
 prow-0.032
Token the
Token ID262
Feature activation-2.32e+00

Pos logit contributions
elling+0.071
aughter+0.067
ays+0.065
isher+0.064
uns+0.062
Neg logit contributions
 long-0.058
 far-0.050
 badly-0.049
 much-0.049
 prow-0.047
Token family
Token ID1641
Feature activation-9.55e-01

Pos logit contributions
elling+0.058
aughter+0.055
ada+0.051
ays+0.049
ener+0.048
Neg logit contributions
 long-0.042
 badly-0.041
 prow-0.040
 much-0.037
 gall-0.036
Token and
Token ID290
Feature activation+6.03e-01

Pos logit contributions
elling+0.029
aughter+0.027
ays+0.026
ada+0.025
isher+0.025
Neg logit contributions
 long-0.015
 badly-0.014
 much-0.013
 far-0.013
 prow-0.012
Token pet
Token ID4273
Feature activation-2.98e+00

Pos logit contributions
 long+0.012
 badly+0.011
 far+0.010
 prow+0.010
 much+0.010
Neg logit contributions
elling-0.016
aughter-0.015
ays-0.014
ada-0.014
isher-0.013
Tokented
Token ID1513
Feature activation-3.65e+00

Pos logit contributions
elling+0.076
aughter+0.068
uns+0.065
oke+0.065
isher+0.064
Neg logit contributions
 long-0.062
 far-0.057
 badly-0.055
 much-0.051
 gall-0.049
Token the
Token ID262
Feature activation-2.08e+00

Pos logit contributions
elling+0.098
aughter+0.092
isher+0.086
ays+0.085
oke+0.083
Neg logit contributions
 long-0.074
 badly-0.065
 far-0.063
 much-0.061
 prow-0.059
Token bunny
Token ID44915
Feature activation+1.87e+00

Pos logit contributions
elling+0.052
aughter+0.049
ada+0.046
ays+0.044
ener+0.044
Neg logit contributions
 long-0.036
 badly-0.036
 prow-0.035
 much-0.033
 far-0.032
Token.
Token ID13
Feature activation+1.89e-01

Pos logit contributions
 long+0.032
 badly+0.028
 prow+0.027
 deserted+0.026
 far+0.026
Neg logit contributions
elling-0.048
aughter-0.046
atted-0.044
ays-0.043
ada-0.042
Token though
Token ID996
Feature activation-3.19e+00

Pos logit contributions
elling+0.055
aughter+0.052
ays+0.047
ada+0.047
atted+0.046
Neg logit contributions
 long-0.049
 badly-0.045
 far-0.042
 much-0.042
 prow-0.041
Token it
Token ID340
Feature activation-1.99e+00

Pos logit contributions
elling+0.082
aughter+0.078
ays+0.073
atted+0.072
ada+0.072
Neg logit contributions
 long-0.056
 badly-0.052
 prow-0.048
 far-0.047
 much-0.046
Token was
Token ID373
Feature activation+2.90e+00

Pos logit contributions
elling+0.050
aughter+0.047
ays+0.043
ada+0.043
atted+0.042
Neg logit contributions
 long-0.040
 badly-0.036
 much-0.033
 far-0.033
 prow-0.033
Token bitter
Token ID12922
Feature activation-1.59e+00

Pos logit contributions
 long+0.036
 badly+0.033
 prow+0.032
 cave+0.030
 gall+0.029
Neg logit contributions
elling-0.085
aughter-0.078
atted-0.074
ays-0.073
ada-0.073
Token at
Token ID379
Feature activation-4.97e+00

Pos logit contributions
elling+0.043
aughter+0.041
ays+0.038
ada+0.038
atted+0.037
Neg logit contributions
 long-0.029
 badly-0.026
 far-0.024
 much-0.024
 prow-0.023
Token first
Token ID717
Feature activation-3.61e+00

Pos logit contributions
elling+0.145
aughter+0.137
ays+0.130
ada+0.128
atted+0.126
Neg logit contributions
 long-0.085
 badly-0.074
 far-0.069
 much-0.068
 prow-0.066
Token,
Token ID11
Feature activation-1.08e+00

Pos logit contributions
elling+0.098
aughter+0.093
ays+0.087
ada+0.086
atted+0.084
Neg logit contributions
 long-0.065
 badly-0.059
 far-0.053
 prow-0.053
 much-0.053
Token it
Token ID340
Feature activation-7.32e-02

Pos logit contributions
elling+0.027
aughter+0.026
ays+0.024
ada+0.024
atted+0.023
Neg logit contributions
 long-0.022
 badly-0.020
 prow-0.018
 far-0.018
 much-0.018
Token ended
Token ID4444
Feature activation-8.27e-01

Pos logit contributions
elling+0.002
aughter+0.002
ays+0.002
atted+0.002
ada+0.002
Neg logit contributions
 long-0.001
 badly-0.001
 prow-0.001
 much-0.001
 far-0.001
Token with
Token ID351
Feature activation-2.38e+00

Pos logit contributions
elling+0.019
aughter+0.017
ays+0.016
atted+0.016
els+0.016
Neg logit contributions
 long-0.017
 prow-0.015
 badly-0.015
 much-0.014
 cave-0.014
Token her
Token ID607
Feature activation-2.28e+00

Pos logit contributions
elling+0.069
aughter+0.065
ays+0.061
ada+0.061
atted+0.060
Neg logit contributions
 long-0.040
 badly-0.035
 far-0.032
 much-0.032
 prow-0.032
Token have
Token ID423
Feature activation+7.36e-01

Pos logit contributions
elling+0.024
aughter+0.023
ays+0.021
ada+0.021
atted+0.021
Neg logit contributions
 long-0.017
 badly-0.015
 prow-0.014
 far-0.013
 much-0.013
Token to
Token ID284
Feature activation+2.68e-01

Pos logit contributions
 long+0.012
 badly+0.011
 much+0.010
 far+0.010
 prow+0.009
Neg logit contributions
elling-0.022
aughter-0.021
ays-0.019
ada-0.019
els-0.019
Token listen
Token ID6004
Feature activation-3.09e+00

Pos logit contributions
 long+0.006
 badly+0.006
 prow+0.005
 cave+0.005
 deserted+0.005
Neg logit contributions
elling-0.006
aughter-0.005
ada-0.005
uns-0.005
olds-0.005
Token to
Token ID284
Feature activation-4.23e+00

Pos logit contributions
elling+0.093
aughter+0.090
ada+0.085
ays+0.084
els+0.082
Neg logit contributions
 long-0.050
 badly-0.044
 much-0.041
 far-0.040
 well-0.035
Token us
Token ID514
Feature activation-3.71e+00

Pos logit contributions
elling+0.113
aughter+0.107
ays+0.104
atted+0.100
ada+0.099
Neg logit contributions
 long-0.083
 far-0.072
 badly-0.071
 much-0.068
 prow-0.063
Token.
Token ID13
Feature activation-1.24e+00

Pos logit contributions
elling+0.121
aughter+0.113
ays+0.110
ada+0.108
atted+0.107
Neg logit contributions
 long-0.051
 badly-0.046
 far-0.040
 much-0.040
 well-0.036
Token You
Token ID921
Feature activation-9.23e-01

Pos logit contributions
elling+0.026
aughter+0.026
atted+0.024
ener+0.023
isher+0.023
Neg logit contributions
 long-0.028
 badly-0.026
 much-0.026
 far-0.024
One-0.023
Token have
Token ID423
Feature activation+6.72e-01

Pos logit contributions
elling+0.024
aughter+0.023
ada+0.021
ays+0.021
atted+0.021
Neg logit contributions
 long-0.017
 badly-0.016
 prow-0.014
 far-0.014
 much-0.014
Token to
Token ID284
Feature activation+2.48e-01

Pos logit contributions
 long+0.011
 badly+0.010
 much+0.009
 far+0.009
 prow+0.009
Neg logit contributions
elling-0.020
aughter-0.019
ays-0.018
ada-0.018
els-0.017
Token be
Token ID307
Feature activation+1.76e+00

Pos logit contributions
 long+0.005
 badly+0.005
 prow+0.005
 cave+0.005
 deserted+0.005
Neg logit contributions
elling-0.005
aughter-0.005
ada-0.005
uns-0.004
olds-0.004
Token careful
Token ID8161
Feature activation-3.57e+00

Pos logit contributions
 long+0.026
 badly+0.023
 prow+0.021
 cave+0.019
 far+0.019
Neg logit contributions
elling-0.053
aughter-0.050
ays-0.047
atted-0.047
ada-0.047
Token their
Token ID511
Feature activation-2.60e+00

Pos logit contributions
elling+0.055
aughter+0.051
isher+0.047
ays+0.047
ada+0.047
Neg logit contributions
 long-0.048
 badly-0.042
 much-0.041
 far-0.040
 che-0.037
Token stories
Token ID3923
Feature activation+1.82e+00

Pos logit contributions
elling+0.074
aughter+0.071
ays+0.068
ada+0.067
ison+0.064
Neg logit contributions
 long-0.048
 badly-0.039
 much-0.038
 far-0.037
 nap-0.035
Token and
Token ID290
Feature activation-2.52e-01

Pos logit contributions
 long+0.034
 badly+0.031
 prow+0.030
 far+0.028
 cave+0.028
Neg logit contributions
elling-0.046
aughter-0.044
ays-0.041
ada-0.040
atted-0.040
Token their
Token ID511
Feature activation-3.18e+00

Pos logit contributions
elling+0.007
aughter+0.006
ays+0.006
ada+0.006
isher+0.006
Neg logit contributions
 long-0.005
 badly-0.004
 much-0.004
 far-0.004
 prow-0.004
Token laughs
Token ID22051
Feature activation-1.28e+00

Pos logit contributions
elling+0.090
aughter+0.085
ays+0.082
ada+0.080
ison+0.077
Neg logit contributions
 long-0.060
 badly-0.051
 far-0.047
 much-0.047
 che-0.044
Token.
Token ID13
Feature activation-1.60e+00

Pos logit contributions
elling+0.035
aughter+0.032
ays+0.031
ada+0.030
atted+0.030
Neg logit contributions
 long-0.021
 badly-0.019
 prow-0.019
 cave-0.018
 deserted-0.017
Token They
Token ID1119
Feature activation+2.15e-01

Pos logit contributions
elling+0.044
aughter+0.042
isher+0.041
ener+0.039
els+0.038
Neg logit contributions
 long-0.032
 much-0.027
 badly-0.025
 far-0.025
One-0.024
Token became
Token ID2627
Feature activation+3.13e+00

Pos logit contributions
 long+0.005
 badly+0.004
 prow+0.004
 cave+0.004
 much+0.004
Neg logit contributions
elling-0.005
aughter-0.005
ada-0.004
ays-0.004
atted-0.004
Token good
Token ID922
Feature activation-2.46e+00

Pos logit contributions
 long+0.041
 badly+0.033
 much+0.032
 far+0.031
 well+0.026
Neg logit contributions
elling-0.108
aughter-0.102
ays-0.097
atted-0.094
isher-0.094
Token friends
Token ID2460
Feature activation-1.68e+00

Pos logit contributions
elling+0.068
aughter+0.065
ays+0.060
ada+0.060
ison+0.058
Neg logit contributions
 long-0.044
 badly-0.039
 far-0.036
 much-0.036
 prow-0.035
Token.
Token ID13
Feature activation-1.33e+00

Pos logit contributions
elling+0.045
aughter+0.043
ays+0.040
ada+0.039
atted+0.039
Neg logit contributions
 long-0.028
 prow-0.025
 badly-0.025
 far-0.023
 gall-0.023
Token had
Token ID550
Feature activation+3.27e+00

Pos logit contributions
 long+0.058
 badly+0.053
 prow+0.050
 far+0.048
 much+0.048
Neg logit contributions
elling-0.078
aughter-0.073
ada-0.068
ays-0.068
atted-0.066
Token lots
Token ID6041
Feature activation+2.08e-01

Pos logit contributions
 long+0.046
 badly+0.041
 prow+0.037
 far+0.036
 much+0.035
Neg logit contributions
elling-0.100
aughter-0.095
ays-0.089
ada-0.088
atted-0.088
Token of
Token ID286
Feature activation-7.43e-01

Pos logit contributions
 long+0.004
 badly+0.004
 much+0.003
 far+0.003
 che+0.003
Neg logit contributions
elling-0.006
aughter-0.005
ays-0.005
ada-0.005
ucks-0.005
Token water
Token ID1660
Feature activation+2.55e+00

Pos logit contributions
elling+0.023
aughter+0.022
ays+0.021
ada+0.021
atted+0.021
Neg logit contributions
 long-0.010
 badly-0.009
 much-0.008
 far-0.008
 prow-0.008
Token.
Token ID13
Feature activation+7.04e-01

Pos logit contributions
 long+0.041
 badly+0.036
 far+0.034
 much+0.032
 prow+0.031
Neg logit contributions
elling-0.076
aughter-0.073
ada-0.067
ays-0.066
ucks-0.065
Token D
Token ID360
Feature activation-6.14e+00

Pos logit contributions
 long+0.014
 badly+0.013
 much+0.012
 far+0.012
 prow+0.012
Neg logit contributions
elling-0.018
aughter-0.017
isher-0.016
ays-0.015
els-0.015
Tokenucky
Token ID5309
Feature activation-9.19e-01

Pos logit contributions
elling+0.130
aughter+0.123
ays+0.114
els+0.112
isher+0.108
Neg logit contributions
 long-0.144
 badly-0.138
 much-0.125
 far-0.124
 prow-0.121
Token had
Token ID550
Feature activation+3.62e+00

Pos logit contributions
elling+0.025
aughter+0.024
ays+0.022
ada+0.022
isher+0.021
Neg logit contributions
 long-0.017
 badly-0.015
 prow-0.014
 far-0.014
 much-0.014
Token many
Token ID867
Feature activation-6.44e-02

Pos logit contributions
 badly+0.044
 long+0.042
 prow+0.038
 cave+0.037
 far+0.033
Neg logit contributions
elling-0.108
aughter-0.101
atted-0.097
ada-0.096
ays-0.096
Token friends
Token ID2460
Feature activation+1.18e+00

Pos logit contributions
elling+0.002
aughter+0.002
ays+0.002
ada+0.002
atted+0.002
Neg logit contributions
 long-0.001
 badly-0.001
 prow-0.001
 far-0.001
 much-0.001
Token who
Token ID508
Feature activation-3.54e-01

Pos logit contributions
 long+0.018
 badly+0.016
 far+0.015
 much+0.015
 prow+0.014
Neg logit contributions
elling-0.036
aughter-0.034
ays-0.032
ada-0.031
isher-0.031
Token sky
Token ID6766
Feature activation+1.57e+00

Pos logit contributions
elling+0.048
aughter+0.046
ener+0.042
isher+0.041
ada+0.041
Neg logit contributions
 badly-0.035
 long-0.034
 prow-0.032
 much-0.031
 gall-0.030
Token.
Token ID13
Feature activation-9.94e-01

Pos logit contributions
 long+0.023
 badly+0.020
 prow+0.018
 much+0.018
 far+0.018
Neg logit contributions
elling-0.048
aughter-0.045
ays-0.043
ada-0.042
atted-0.042
Token "
Token ID366
Feature activation-1.34e+00

Pos logit contributions
elling+0.028
isher+0.026
aughter+0.026
ays+0.025
els+0.025
Neg logit contributions
 long-0.019
 much-0.016
 badly-0.015
 far-0.015
 prow-0.014
TokenYou
Token ID1639
Feature activation-7.85e-01

Pos logit contributions
elling+0.033
aughter+0.031
ays+0.029
ada+0.028
atted+0.028
Neg logit contributions
 long-0.028
 badly-0.026
 far-0.024
 prow-0.024
 much-0.024
Token cannot
Token ID2314
Feature activation+1.77e+00

Pos logit contributions
elling+0.019
aughter+0.017
ada+0.016
ays+0.016
ison+0.016
Neg logit contributions
 long-0.017
 badly-0.015
 much-0.014
 prow-0.014
 far-0.013
Token use
Token ID779
Feature activation-1.97e+00

Pos logit contributions
 long+0.037
 badly+0.034
 much+0.033
 prow+0.033
 cave+0.032
Neg logit contributions
elling-0.037
ada-0.034
aughter-0.032
els-0.031
ener-0.031
Token my
Token ID616
Feature activation-3.36e+00

Pos logit contributions
elling+0.050
aughter+0.047
ays+0.044
ada+0.043
atted+0.042
Neg logit contributions
 long-0.040
 badly-0.036
 far-0.034
 much-0.034
 prow-0.033
Token stone
Token ID7815
Feature activation+4.70e+00

Pos logit contributions
elling+0.093
aughter+0.089
els+0.082
ener+0.082
isher+0.081
Neg logit contributions
 long-0.052
 badly-0.051
 prow-0.046
 much-0.045
 far-0.043
Token!"
Token ID2474
Feature activation-1.27e+00

Pos logit contributions
 long+0.085
 badly+0.076
 prow+0.074
 much+0.071
 far+0.070
Neg logit contributions
elling-0.124
aughter-0.116
ays-0.109
ada-0.107
ener-0.106
Token the
Token ID262
Feature activation-4.15e+00

Pos logit contributions
elling+0.037
aughter+0.033
isher+0.032
ays+0.032
uns+0.032
Neg logit contributions
 long-0.024
 badly-0.020
 far-0.020
 much-0.019
 prow-0.017
Token voice
Token ID3809
Feature activation-1.05e+00

Pos logit contributions
elling+0.078
aughter+0.072
ays+0.064
ada+0.064
atted+0.062
Neg logit contributions
 long-0.110
 badly-0.103
 prow-0.097
 much-0.097
 far-0.097
Token smiled
Token ID13541
Feature activation-1.44e+00

Pos logit contributions
elling+0.072
aughter+0.066
uns+0.062
ays+0.062
ada+0.061
Neg logit contributions
 long-0.056
 che-0.052
 badly-0.051
 far-0.050
 prow-0.049
Token and
Token ID290
Feature activation-2.22e-01

Pos logit contributions
elling+0.033
aughter+0.030
ays+0.029
atted+0.028
ada+0.028
Neg logit contributions
 long-0.031
 badly-0.029
 prow-0.028
 gall-0.026
 far-0.026
Token said
Token ID531
Feature activation-5.23e+00

Pos logit contributions
elling+0.005
aughter+0.005
ays+0.005
els+0.004
ada+0.004
Neg logit contributions
 long-0.005
 badly-0.004
 far-0.004
 much-0.004
 che-0.004
Token,
Token ID11
Feature activation-1.20e+00

Pos logit contributions
elling+0.153
aughter+0.144
ays+0.135
ada+0.133
isher+0.131
Neg logit contributions
 long-0.089
 badly-0.080
 much-0.073
 far-0.070
 prow-0.066
Token "
Token ID366
Feature activation-2.73e+00

Pos logit contributions
elling+0.027
aughter+0.025
ays+0.023
ada+0.022
atted+0.022
Neg logit contributions
 long-0.028
 badly-0.026
 prow-0.024
 far-0.024
 much-0.024
TokenOf
Token ID5189
Feature activation-9.02e+00

Pos logit contributions
elling+0.062
aughter+0.059
ays+0.056
els+0.056
atted+0.054
Neg logit contributions
 long-0.062
 badly-0.055
 far-0.054
 much-0.054
One-0.053
Token course
Token ID1781
Feature activation-5.12e+00

Pos logit contributions
elling+0.227
ays+0.217
aughter+0.217
ada+0.201
atted+0.201
Neg logit contributions
 long-0.176
 badly-0.157
 much-0.154
One-0.146
 far-0.146
Token,
Token ID11
Feature activation-4.12e+00

Pos logit contributions
elling+0.164
aughter+0.155
ays+0.152
isher+0.148
els+0.145
Neg logit contributions
 long-0.079
 badly-0.065
 much-0.062
 far-0.059
 che-0.050
Token Sara
Token ID24799
Feature activation-5.20e+00

Pos logit contributions
elling+0.090
aughter+0.083
ays+0.077
atted+0.075
isher+0.075
Neg logit contributions
 long-0.101
 badly-0.091
 much-0.086
 far-0.086
 che-0.083
Token.
Token ID13
Feature activation-2.58e+00

Pos logit contributions
elling+0.184
aughter+0.175
ays+0.170
ada+0.167
atted+0.165
Neg logit contributions
 long-0.054
 badly-0.043
 much-0.038
 far-0.037
 cave-0.032
Token You
Token ID921
Feature activation-1.06e+00

Pos logit contributions
elling+0.062
els+0.060
aughter+0.060
atted+0.056
ada+0.056
Neg logit contributions
 long-0.049
 much-0.042
 badly-0.040
 far-0.040
 well-0.039
Token€
Token ID26391
Feature activation-2.54e+00

Pos logit contributions
elling+0.095
aughter+0.087
ays+0.086
atted+0.083
els+0.082
Neg logit contributions
 long-0.106
 badly-0.100
 far-0.095
 prow-0.093
 much-0.093
Token
Token ID198
Feature activation+3.82e+00

Pos logit contributions
elling+0.058
aughter+0.057
ays+0.054
ada+0.052
atted+0.051
Neg logit contributions
 long-0.051
 badly-0.047
 prow-0.044
 far-0.043
 deserted-0.043
Token
Token ID198
Feature activation+2.91e+00

Pos logit contributions
 long+0.066
 badly+0.060
 prow+0.055
 far+0.053
 much+0.053
Neg logit contributions
elling-0.106
aughter-0.101
ays-0.093
ada-0.093
atted-0.091
TokenSo
Token ID2396
Feature activation-3.37e+00

Pos logit contributions
 long+0.053
 badly+0.047
 prow+0.045
 cave+0.044
 far+0.043
Neg logit contributions
elling-0.075
aughter-0.072
ada-0.068
ion-0.067
ison-0.065
Token the
Token ID262
Feature activation-4.10e+00

Pos logit contributions
elling+0.084
aughter+0.077
ays+0.073
ada+0.070
els+0.070
Neg logit contributions
 long-0.072
 badly-0.065
 far-0.061
 much-0.061
 prow-0.058
Token little
Token ID1310
Feature activation-7.42e+00

Pos logit contributions
elling+0.107
aughter+0.100
atted+0.090
ada+0.090
ion+0.089
Neg logit contributions
 badly-0.070
 long-0.068
 prow-0.061
 deserted-0.060
 far-0.057
Token frog
Token ID21264
Feature activation-6.85e-01

Pos logit contributions
elling+0.189
aughter+0.177
ays+0.165
ada+0.163
atted+0.160
Neg logit contributions
 long-0.147
 badly-0.136
 prow-0.125
 much-0.124
 far-0.123
Token hopped
Token ID45230
Feature activation+1.88e-01

Pos logit contributions
elling+0.019
aughter+0.018
ays+0.017
ada+0.017
isher+0.016
Neg logit contributions
 long-0.012
 badly-0.011
 far-0.010
 prow-0.010
 much-0.010
Token out
Token ID503
Feature activation-1.11e+00

Pos logit contributions
 long+0.004
 badly+0.004
 far+0.003
 much+0.003
 prow+0.003
Neg logit contributions
elling-0.005
aughter-0.004
ays-0.004
ada-0.004
atted-0.004
Token to
Token ID284
Feature activation-2.10e+00

Pos logit contributions
elling+0.031
aughter+0.030
ada+0.027
ays+0.027
isher+0.026
Neg logit contributions
 long-0.022
 badly-0.019
 far-0.018
 much-0.018
 prow-0.017
Token the
Token ID262
Feature activation-8.02e-01

Pos logit contributions
elling+0.054
aughter+0.052
ays+0.048
ada+0.047
atted+0.046
Neg logit contributions
 long-0.040
 badly-0.037
 prow-0.034
 much-0.034
 far-0.033
Token any
Token ID597
Feature activation-1.81e+00

Pos logit contributions
elling+0.073
aughter+0.069
ays+0.065
ada+0.064
atted+0.063
Neg logit contributions
 long-0.049
 badly-0.044
 far-0.041
 much-0.041
 prow-0.039
Token way
Token ID835
Feature activation-2.45e+00

Pos logit contributions
elling+0.047
aughter+0.044
ays+0.040
isher+0.039
els+0.039
Neg logit contributions
 long-0.032
 badly-0.030
 prow-0.029
 gall-0.027
 deserted-0.026
Token she
Token ID673
Feature activation-3.60e-01

Pos logit contributions
elling+0.079
aughter+0.075
ays+0.071
isher+0.070
atted+0.070
Neg logit contributions
 long-0.034
 badly-0.029
 much-0.026
 far-0.026
 prow-0.022
Token could
Token ID714
Feature activation+3.27e+00

Pos logit contributions
elling+0.010
aughter+0.009
ays+0.008
ada+0.008
atted+0.008
Neg logit contributions
 long-0.007
 badly-0.006
 far-0.006
 prow-0.006
 much-0.006
Token.
Token ID13
Feature activation-1.20e+00

Pos logit contributions
 long+0.064
 badly+0.058
 prow+0.054
 much+0.054
 far+0.052
Neg logit contributions
elling-0.078
aughter-0.074
ada-0.070
ays-0.067
els-0.066
Token Mar
Token ID1526
Feature activation-1.01e+01

Pos logit contributions
elling+0.037
aughter+0.034
isher+0.033
ays+0.033
ener+0.032
Neg logit contributions
 long-0.019
 badly-0.016
 much-0.016
 far-0.014
 prow-0.014
Tokeniah
Token ID9520
Feature activation-2.39e+00

Pos logit contributions
elling+0.174
ays+0.154
ion+0.152
 supervision+0.147
 agrees+0.145
Neg logit contributions
 long-0.246
 badly-0.232
 far-0.229
 much-0.216
 cave-0.213
Token found
Token ID1043
Feature activation-1.91e+00

Pos logit contributions
elling+0.069
aughter+0.064
ays+0.062
ada+0.060
ucks+0.059
Neg logit contributions
 long-0.039
 badly-0.036
 far-0.033
 prow-0.033
 much-0.032
Token many
Token ID867
Feature activation-1.59e+00

Pos logit contributions
elling+0.066
aughter+0.061
ays+0.061
uns+0.059
ucks+0.059
Neg logit contributions
 long-0.024
 badly-0.019
 far-0.019
 much-0.019
 hidden-0.016
Token ways
Token ID2842
Feature activation+8.45e-01

Pos logit contributions
elling+0.042
aughter+0.040
ays+0.038
ada+0.037
atted+0.036
Neg logit contributions
 long-0.031
 badly-0.028
 much-0.025
 far-0.025
 prow-0.025
Token to
Token ID284
Feature activation+7.47e-01

Pos logit contributions
 long+0.014
 badly+0.013
 far+0.012
 much+0.012
 prow+0.011
Neg logit contributions
elling-0.025
aughter-0.023
ays-0.022
ada-0.022
ucks-0.021
Token.
Token ID13
Feature activation-9.57e-01

Pos logit contributions
elling+0.013
aughter+0.012
ays+0.012
atted+0.011
ada+0.011
Neg logit contributions
 long-0.008
 badly-0.007
 prow-0.007
 far-0.006
 gall-0.006
Token The
Token ID383
Feature activation-2.15e+00

Pos logit contributions
elling+0.029
aughter+0.028
ays+0.026
ada+0.026
isher+0.026
Neg logit contributions
 long-0.015
 badly-0.013
 much-0.012
 far-0.012
 prow-0.011
Token end
Token ID886
Feature activation-8.84e-01

Pos logit contributions
elling+0.050
aughter+0.048
ays+0.044
ada+0.043
isher+0.042
Neg logit contributions
 long-0.045
 badly-0.043
 prow-0.040
 much-0.039
 far-0.038
Token.
Token ID13
Feature activation-1.08e+00

Pos logit contributions
elling+0.026
aughter+0.024
ays+0.023
atted+0.022
els+0.022
Neg logit contributions
 long-0.014
 badly-0.012
 prow-0.011
 far-0.011
 deserted-0.010
Token<|endoftext|>
Token ID50256
Feature activation-2.29e-02

Pos logit contributions
elling+0.033
aughter+0.032
ada+0.029
ays+0.029
isher+0.029
Neg logit contributions
 long-0.017
 badly-0.014
 much-0.014
 far-0.013
 prow-0.012
TokenL
Token ID43
Feature activation-7.74e+00

Pos logit contributions
elling+0.000
aughter+0.000
ays+0.000
ada+0.000
atted+0.000
Neg logit contributions
 long-0.001
 badly-0.000
 prow-0.000
 far-0.000
 much-0.000
Tokenily
Token ID813
Feature activation-2.82e+00

Pos logit contributions
elling+0.161
aughter+0.137
atted+0.132
ays+0.131
ada+0.131
Neg logit contributions
 long-0.188
 far-0.169
 much-0.168
 cave-0.166
 badly-0.166
Token and
Token ID290
Feature activation-5.89e-01

Pos logit contributions
elling+0.082
aughter+0.078
ada+0.072
ays+0.072
uns+0.071
Neg logit contributions
 long-0.046
 badly-0.040
 far-0.040
 prow-0.038
 che-0.037
Token Tom
Token ID4186
Feature activation-2.77e+00

Pos logit contributions
elling+0.014
aughter+0.012
ays+0.012
uns+0.011
atted+0.011
Neg logit contributions
 long-0.014
 badly-0.013
 far-0.012
 much-0.012
 prow-0.011
Token were
Token ID547
Feature activation+4.25e+00

Pos logit contributions
elling+0.078
aughter+0.074
ays+0.069
ada+0.067
atted+0.067
Neg logit contributions
 long-0.048
 badly-0.043
 far-0.041
 much-0.040
 prow-0.040
Token friends
Token ID2460
Feature activation+9.29e-01

Pos logit contributions
 long+0.074
 badly+0.073
 prow+0.068
 deserted+0.064
 gall+0.064
Neg logit contributions
elling-0.111
aughter-0.109
ays-0.101
atted-0.099
ada-0.097
Token the
Token ID262
Feature activation-2.47e+00

Pos logit contributions
elling+0.021
aughter+0.020
ada+0.018
ion+0.018
atted+0.018
Neg logit contributions
 long-0.021
 badly-0.020
 prow-0.018
 much-0.018
 deserted-0.018
Token strong
Token ID1913
Feature activation-4.43e+00

Pos logit contributions
elling+0.063
aughter+0.060
ison+0.053
ada+0.053
ener+0.053
Neg logit contributions
 badly-0.042
 long-0.040
 prow-0.037
 much-0.036
 forever-0.033
Token man
Token ID582
Feature activation-5.16e-01

Pos logit contributions
elling+0.117
aughter+0.111
ays+0.104
ada+0.102
atted+0.101
Neg logit contributions
 long-0.086
 badly-0.077
 far-0.071
 prow-0.070
 much-0.070
Token and
Token ID290
Feature activation+1.29e+00

Pos logit contributions
elling+0.013
aughter+0.012
ays+0.012
ada+0.012
atted+0.011
Neg logit contributions
 long-0.010
 badly-0.009
 prow-0.008
 much-0.008
 far-0.008
Token the
Token ID262
Feature activation-3.11e+00

Pos logit contributions
 long+0.024
 badly+0.022
 far+0.020
 prow+0.020
 much+0.020
Neg logit contributions
elling-0.034
aughter-0.032
ays-0.030
ada-0.030
atted-0.029
Token little
Token ID1310
Feature activation-6.93e+00

Pos logit contributions
elling+0.079
aughter+0.077
ada+0.069
ener+0.068
ison+0.068
Neg logit contributions
 badly-0.053
 long-0.051
 prow-0.047
 much-0.046
 deserted-0.044
Token girl
Token ID2576
Feature activation-7.13e-01

Pos logit contributions
elling+0.141
aughter+0.131
ays+0.119
ada+0.117
atted+0.114
Neg logit contributions
 long-0.173
 badly-0.162
 prow-0.151
 much-0.151
 far-0.151
Token was
Token ID373
Feature activation+4.78e+00

Pos logit contributions
elling+0.018
aughter+0.017
atted+0.016
ada+0.016
ays+0.016
Neg logit contributions
 long-0.014
 badly-0.012
 prow-0.011
 much-0.011
 far-0.011
Token happy
Token ID3772
Feature activation-9.04e-02

Pos logit contributions
 long+0.071
 cave+0.060
 quest+0.059
 prow+0.057
 trail+0.057
Neg logit contributions
elling-0.122
atted-0.120
aughter-0.115
ada-0.112
ion-0.112
Token.
Token ID13
Feature activation-2.31e-01

Pos logit contributions
elling+0.002
aughter+0.002
atted+0.002
ays+0.002
ada+0.002
Neg logit contributions
 long-0.002
 badly-0.001
 prow-0.001
 much-0.001
 far-0.001
Token She
Token ID1375
Feature activation+1.78e+00

Pos logit contributions
elling+0.007
isher+0.006
els+0.006
aughter+0.006
ener+0.006
Neg logit contributions
 long-0.004
 much-0.003
 badly-0.003
 far-0.003
On-0.003