TOP ACTIVATIONS
MAX = 9.618

were two beds in a bedroom . One was a big
a time , in a wide , green park , there
was walking in a me adow when he saw something strange
and wandered into a big garden . It was a special
named Ben found a big chamber in his house . He
was walking through a me adow . It was so sunny
Jackie was playing in her garden , among the flowers and
a time , in a wide country , there was a
He lived in a big chamber far away . Every day
Chloe was walking through a garden and she saw a big
Jess lived in a big garden with lots of flowers and
big forest , there was a log . The log was
the forest , there was a bald lion . He was
Ben was playing in his garden . He saw a big
was playing outside in his garden . He saw a big
. He lived in a chamber underground . He was very
my was playing in his garden one day . He was
time , little Jack found a blue bath tub in the
my was playing in his garden with his toy cars .
a time , in a peaceful town , there was a

MIN ACTIVATIONS
MIN = -13.326

't have all these ch ocol ates ! Now , go
and he didn 't want L yl a to be unhappy
ff ron and vanilla and lic or ice and star an
treasure was so special that Ab bie decide to keep it
and laughed and ate ch ocol ates . They had a
learn more about rh in oc er oses and respect them
to see the rh in oc eros anymore . He is
not eat too many ch ocol ates and his friend learned
bled up all the ch ocol ates , and she was
the gate keeper agreed that Ab bie had worked hard and
happy for the rh in oc eros . She said ,
From that day on , Ab bie never forgot the special
you like the rh in oc eros , but it is
had never seen the av oc ados . They wished they
He did not care if L ila was hurt or sad
you like the rh in oc eros . But remember ,
L ily , these ch ocol ates are bitter ," he
agreement and hopped away . Br idget watched it go ,
okay ? And maybe not pr unes . They are too
that boss y rh in oc eros . <|endoftext|> D ee

INTERVAL 3.882 - 9.618
CONTAINS 6.231%

upon a time , there was a little girl named Lily
upon a time there was a happy family . Every day
One day , he found a big , shiny rock in
upon a time , there was a little bunny named Benny
She ran to the beach hut , waiting for her mummy

INTERVAL -1.854 - 3.882
CONTAINS 71.753%

and went inside her house . She knew she 'd always
?" she asked John . John looked at the
axe . It is old and rusty and has a long
boats and many of other things . The little boy was
with the rocket and make it fly . They save the

INTERVAL -7.590 - -1.854
CONTAINS 21.709%

P andy smiled . He said , " Of course I
up something he wanted . <|endoftext|> Once upon a time ,
jealous . I wanted to make the tallest tower and I
she asked . Lily told her about Tim my â €
the dirt . " But it would be fun

INTERVAL -13.326 - -7.590
CONTAINS 0.307%

sorry . They hoped their k ite was happy too .
voice said . Lu cy felt generous and quickly
he didn 't want L yl a to be unhappy .
and thanked her for not punishing them . <|endoftext|> Once upon
little longer ." L ily and Max gigg led
Token were
Token ID547
Feature activation+3.63e+00

Pos logit contributions
 safe+0.005
 cozy+0.004
 curled+0.004
 curious+0.004
 scary+0.004
Neg logit contributions
parents-0.011
father-0.011
ums-0.010
ases-0.010
emaker-0.010
Token two
Token ID734
Feature activation+4.81e+00

Pos logit contributions
 safe+0.054
 cozy+0.053
 a+0.051
 curled+0.049
 curious+0.048
Neg logit contributions
parents-0.107
father-0.102
asers-0.100
uns-0.099
ases-0.098
Token beds
Token ID20237
Feature activation+5.53e+00

Pos logit contributions
 safe+0.091
 cozy+0.090
 curious+0.086
 a+0.083
 hidden+0.083
Neg logit contributions
parents-0.122
father-0.115
ips-0.113
ases-0.112
emaker-0.112
Token in
Token ID287
Feature activation+4.55e+00

Pos logit contributions
 cozy+0.047
 safe+0.045
 curled+0.040
 scary+0.036
 curious+0.035
Neg logit contributions
parents-0.193
father-0.187
ums-0.185
asers-0.182
ips-0.181
Token a
Token ID257
Feature activation+7.57e+00

Pos logit contributions
 curled+0.029
 safe+0.027
 cozy+0.023
 curious+0.019
 shady+0.018
Neg logit contributions
parents-0.162
father-0.159
els-0.149
ases-0.149
ums-0.149
Token bedroom
Token ID14043
Feature activation+9.62e+00

Pos logit contributions
 curled+0.086
 perched+0.085
 but+0.071
 like+0.068
 filled+0.067
Neg logit contributions
parents-0.200
ma-0.196
father-0.196
oral-0.190
 narrative-0.181
Token.
Token ID13
Feature activation+2.60e+00

Pos logit contributions
 safe+0.115
 cozy+0.114
 curled+0.105
 scary+0.102
 hollow+0.091
Neg logit contributions
parents-0.295
ums-0.286
father-0.283
uns-0.270
olds-0.268
Token One
Token ID1881
Feature activation+5.47e-01

Pos logit contributions
 safe+0.043
 cozy+0.039
 curled+0.039
 hidden+0.037
 scary+0.036
Neg logit contributions
parents-0.070
father-0.067
uns-0.065
ums-0.064
ases-0.064
Token was
Token ID373
Feature activation+5.62e+00

Pos logit contributions
 perched+0.011
 abandoned+0.010
 hollow+0.010
 safe+0.010
 curled+0.010
Neg logit contributions
parents-0.011
father-0.011
ma-0.011
oral-0.011
ugs-0.010
Token a
Token ID257
Feature activation+7.47e+00

Pos logit contributions
 curled+0.049
 hollow+0.049
 safe+0.049
 perched+0.048
 clearing+0.048
Neg logit contributions
father-0.159
ums-0.159
parents-0.158
pa-0.156
ma-0.156
Token big
Token ID1263
Feature activation+5.00e+00

Pos logit contributions
 perched+0.107
 curled+0.096
 abandoned+0.091
 its+0.089
 hollow+0.083
Neg logit contributions
oral-0.180
father-0.177
ma-0.176
parents-0.173
omm-0.170
Token a
Token ID257
Feature activation+4.85e+00

Pos logit contributions
 curled+0.022
 hollow+0.021
 abandoned+0.020
 safe+0.019
 perched+0.018
Neg logit contributions
parents-0.079
father-0.075
ases-0.075
olds-0.073
oral-0.072
Token time
Token ID640
Feature activation+2.25e+00

Pos logit contributions
 safe+0.087
 cozy+0.084
 curled+0.083
 curious+0.078
 hidden+0.078
Neg logit contributions
parents-0.122
father-0.116
ums-0.111
ases-0.111
ips-0.111
Token,
Token ID11
Feature activation+2.17e+00

Pos logit contributions
 safe+0.011
 cozy+0.010
 curled+0.008
 curious+0.007
 hidden+0.006
Neg logit contributions
parents-0.088
father-0.085
asers-0.083
ases-0.083
ums-0.082
Token in
Token ID287
Feature activation+3.80e+00

Pos logit contributions
 curled+0.019
 safe+0.018
 cozy+0.017
 abandoned+0.015
 shady+0.015
Neg logit contributions
ases-0.067
els-0.067
father-0.067
parents-0.067
ips-0.067
Token a
Token ID257
Feature activation+7.87e+00

Pos logit contributions
 curled+0.029
 safe+0.027
 abandoned+0.024
 cozy+0.024
 hollow+0.022
Neg logit contributions
parents-0.120
father-0.120
ases-0.115
ums-0.114
els-0.112
Token wide
Token ID3094
Feature activation+9.54e+00

Pos logit contributions
 curled+0.087
 but+0.069
 perched+0.069
 safe+0.065
 filled+0.059
Neg logit contributions
father-0.229
parents-0.229
ases-0.215
pa-0.213
oral-0.211
Token,
Token ID11
Feature activation+4.12e+00

Pos logit contributions
 curled+0.160
 safe+0.151
 curious+0.145
 perched+0.144
 cozy+0.138
Neg logit contributions
father-0.217
pa-0.211
parents-0.209
ma-0.202
ums-0.199
Token green
Token ID4077
Feature activation+7.29e+00

Pos logit contributions
 curled+0.041
 safe+0.032
 shady+0.031
 bark+0.030
 nib+0.030
Neg logit contributions
 narrative-0.118
father-0.115
els-0.115
parents-0.113
itt-0.113
Token park
Token ID3952
Feature activation+6.43e+00

Pos logit contributions
 safe+0.126
 curled+0.123
 cozy+0.122
 curious+0.115
 scary+0.114
Neg logit contributions
parents-0.174
father-0.173
ums-0.167
oral-0.162
els-0.161
Token,
Token ID11
Feature activation+3.15e+00

Pos logit contributions
 curled+0.073
 cozy+0.073
 safe+0.069
 hollow+0.062
 perched+0.062
Neg logit contributions
ums-0.188
parents-0.187
ips-0.180
ases-0.179
olds-0.179
Token there
Token ID612
Feature activation+1.50e+00

Pos logit contributions
 curled+0.032
 cozy+0.031
 safe+0.030
 shady+0.028
 nib+0.028
Neg logit contributions
 narrative-0.086
parents-0.086
ases-0.086
els-0.085
ips-0.084
Token was
Token ID373
Feature activation+5.35e+00

Pos logit contributions
parents+0.043
father+0.042
ases+0.041
els+0.040
pa+0.040
Neg logit contributions
 safe-0.015
 cozy-0.014
 curled-0.013
 curious-0.012
 scary-0.012
Token walking
Token ID6155
Feature activation+6.07e+00

Pos logit contributions
 curled+0.050
 safe+0.049
 cozy+0.046
 shady+0.045
 abandoned+0.044
Neg logit contributions
parents-0.165
ips-0.162
ums-0.161
father-0.161
ases-0.161
Token in
Token ID287
Feature activation+5.77e+00

Pos logit contributions
 cozy+0.079
 safe+0.076
 curled+0.073
 scary+0.073
 abandoned+0.072
Neg logit contributions
parents-0.172
ases-0.171
ips-0.170
ums-0.168
father-0.168
Token a
Token ID257
Feature activation+8.52e+00

Pos logit contributions
 curled+0.083
 safe+0.078
 cozy+0.070
 abandoned+0.069
 scary+0.069
Neg logit contributions
father-0.143
parents-0.140
pa-0.134
ases-0.132
ips-0.131
Token me
Token ID502
Feature activation+1.81e+00

Pos logit contributions
 curled+0.115
 but+0.095
 perched+0.094
 like+0.086
 as+0.085
Neg logit contributions
father-0.220
parents-0.220
ma-0.211
oral-0.210
ups-0.202
Tokenadow
Token ID4584
Feature activation+9.45e+00

Pos logit contributions
 safe+0.026
 cozy+0.025
 curled+0.024
 curious+0.023
 hidden+0.023
Neg logit contributions
parents-0.054
father-0.051
asers-0.051
ases-0.050
ums-0.050
Token when
Token ID618
Feature activation+4.43e+00

Pos logit contributions
 cozy+0.113
 scary+0.107
 safe+0.105
 curled+0.100
 creepy+0.100
Neg logit contributions
ums-0.261
parents-0.246
father-0.243
olds-0.237
ips-0.232
Token he
Token ID339
Feature activation+1.37e+00

Pos logit contributions
 safe+0.065
 curled+0.063
 cozy+0.059
 open+0.056
 stable+0.054
Neg logit contributions
parents-0.103
father-0.102
 End-0.099
pa-0.095
ips-0.095
Token saw
Token ID2497
Feature activation+2.67e+00

Pos logit contributions
 safe+0.012
 cozy+0.012
 curled+0.011
 curious+0.010
 hiding+0.010
Neg logit contributions
parents-0.048
father-0.046
asers-0.044
ips-0.044
ums-0.044
Token something
Token ID1223
Feature activation+5.17e+00

Pos logit contributions
 safe+0.019
 cozy+0.018
 curled+0.017
 hiding+0.014
 open+0.013
Neg logit contributions
parents-0.092
father-0.091
ums-0.087
ases-0.086
olds-0.085
Token strange
Token ID6283
Feature activation+3.35e+00

Pos logit contributions
 curled+0.054
 stable+0.049
 cozy+0.049
 safe+0.048
 hut+0.048
Neg logit contributions
asers-0.137
els-0.136
parents-0.135
olds-0.133
ums-0.132
Token and
Token ID290
Feature activation+4.66e+00

Pos logit contributions
 safe+0.072
 cozy+0.068
 curled+0.062
 scary+0.060
 hollow+0.057
Neg logit contributions
parents-0.187
father-0.185
ips-0.181
ases-0.180
olds-0.176
Token wandered
Token ID36036
Feature activation+4.59e+00

Pos logit contributions
 bone+0.068
 safe+0.065
 shady+0.065
 cozy+0.064
 boun+0.063
Neg logit contributions
parents-0.118
father-0.107
pa-0.106
ople-0.104
ups-0.102
Token into
Token ID656
Feature activation+5.04e+00

Pos logit contributions
 cozy+0.060
 curled+0.060
 abandoned+0.058
 curious+0.056
 safe+0.055
Neg logit contributions
ips-0.112
parents-0.108
olds-0.107
ases-0.106
father-0.105
Token a
Token ID257
Feature activation+8.02e+00

Pos logit contributions
 safe+0.072
 curled+0.070
 curious+0.062
 cozy+0.061
 filled+0.060
Neg logit contributions
parents-0.128
father-0.124
els-0.120
pa-0.118
ips-0.116
Token big
Token ID1263
Feature activation+7.82e+00

Pos logit contributions
 curled+0.118
 as+0.100
 but+0.100
 perched+0.099
 like+0.096
Neg logit contributions
ups-0.176
oral-0.175
parents-0.171
ma-0.171
father-0.171
Token garden
Token ID11376
Feature activation+9.40e+00

Pos logit contributions
 curled+0.121
 perched+0.108
 curious+0.103
 a+0.102
 safe+0.101
Neg logit contributions
pa-0.186
father-0.184
ma-0.180
parents-0.175
els-0.170
Token.
Token ID13
Feature activation+2.85e+00

Pos logit contributions
 cozy+0.131
 scary+0.129
 safe+0.125
 curled+0.120
 deer+0.117
Neg logit contributions
ums-0.239
parents-0.233
ips-0.230
olds-0.228
ups-0.227
Token It
Token ID632
Feature activation+1.21e+00

Pos logit contributions
 safe+0.046
 hiding+0.042
 curled+0.042
 cozy+0.042
 scary+0.042
Neg logit contributions
parents-0.070
ums-0.065
father-0.065
olds-0.065
uns-0.063
Token was
Token ID373
Feature activation+5.22e+00

Pos logit contributions
 safe+0.019
 cozy+0.017
 hello+0.017
 hut+0.016
 stable+0.016
Neg logit contributions
parents-0.028
ups-0.027
father-0.026
ma-0.024
ips-0.024
Token a
Token ID257
Feature activation+7.85e+00

Pos logit contributions
 hunter+0.047
 deer+0.046
 hello+0.046
 hut+0.045
 grave+0.043
Neg logit contributions
ums-0.129
parents-0.128
father-0.127
ips-0.125
 End-0.123
Token special
Token ID2041
Feature activation+4.07e+00

Pos logit contributions
 curled+0.097
 perched+0.095
 its+0.086
 a+0.075
 there+0.072
Neg logit contributions
oral-0.186
 narrative-0.175
parents-0.173
orter-0.171
 End-0.169
Token named
Token ID3706
Feature activation+1.34e+00

Pos logit contributions
 safe+0.030
 cozy+0.029
 curled+0.027
 curious+0.025
 hidden+0.024
Neg logit contributions
parents-0.094
father-0.090
asers-0.087
ases-0.087
ums-0.086
Token Ben
Token ID3932
Feature activation+3.57e-01

Pos logit contributions
 curled+0.019
 safe+0.019
 cozy+0.019
 perched+0.018
 curious+0.017
Neg logit contributions
parents-0.037
father-0.035
ases-0.035
uns-0.034
olds-0.034
Token found
Token ID1043
Feature activation+2.19e+00

Pos logit contributions
 safe+0.005
 cozy+0.005
 curled+0.005
 shady+0.005
 curious+0.004
Neg logit contributions
parents-0.010
father-0.010
pa-0.009
ases-0.009
els-0.009
Token a
Token ID257
Feature activation+8.28e+00

Pos logit contributions
 safe+0.005
 cozy+0.004
 curled+0.003
 curious+0.001
 hidden+0.001
Neg logit contributions
parents-0.091
father-0.088
asers-0.087
ases-0.086
ums-0.086
Token big
Token ID1263
Feature activation+6.92e+00

Pos logit contributions
 curled+0.123
 perched+0.110
 filled+0.097
 behind+0.096
 like+0.094
Neg logit contributions
oral-0.196
father-0.194
parents-0.192
orter-0.178
omm-0.171
Token chamber
Token ID11847
Feature activation+9.40e+00

Pos logit contributions
 curled+0.114
 perched+0.107
 safe+0.107
 cozy+0.104
 curious+0.099
Neg logit contributions
father-0.161
pa-0.154
parents-0.150
els-0.146
ma-0.143
Token in
Token ID287
Feature activation+3.28e+00

Pos logit contributions
 cozy+0.116
 curled+0.106
 safe+0.106
 sandy+0.095
 deer+0.093
Neg logit contributions
father-0.256
parents-0.256
ases-0.252
olds-0.252
ips-0.247
Token his
Token ID465
Feature activation+2.51e+00

Pos logit contributions
 safe+0.032
 cozy+0.030
 curled+0.029
 curious+0.026
 hidden+0.025
Neg logit contributions
parents-0.113
father-0.109
asers-0.105
ases-0.105
ips-0.104
Token house
Token ID2156
Feature activation+5.91e+00

Pos logit contributions
 curled+0.039
 perched+0.036
 safe+0.036
 filled+0.034
 but+0.033
Neg logit contributions
parents-0.068
father-0.067
ums-0.063
ips-0.060
olds-0.060
Token.
Token ID13
Feature activation+2.58e+00

Pos logit contributions
 cozy+0.056
 safe+0.056
 curled+0.052
 scary+0.045
 curious+0.045
Neg logit contributions
parents-0.201
father-0.193
ums-0.189
ases-0.188
ips-0.187
Token He
Token ID679
Feature activation+2.24e+00

Pos logit contributions
 safe+0.032
 cozy+0.029
 curled+0.028
 hidden+0.027
 hiding+0.026
Neg logit contributions
parents-0.078
ips-0.075
ums-0.075
ases-0.075
father-0.075
Token was
Token ID373
Feature activation+5.61e+00

Pos logit contributions
 cozy+0.067
 safe+0.066
 shady+0.061
 scary+0.059
 sandy+0.057
Neg logit contributions
parents-0.093
father-0.092
pa-0.086
emaker-0.084
ases-0.083
Token walking
Token ID6155
Feature activation+6.06e+00

Pos logit contributions
 curled+0.058
 safe+0.055
 cozy+0.054
 shady+0.051
 hut+0.050
Neg logit contributions
parents-0.170
ips-0.169
father-0.168
ums-0.167
ases-0.162
Token through
Token ID832
Feature activation+5.62e+00

Pos logit contributions
 cozy+0.081
 curled+0.080
 safe+0.079
 abandoned+0.076
 scary+0.073
Neg logit contributions
ases-0.166
parents-0.165
father-0.163
ips-0.163
ums-0.157
Token a
Token ID257
Feature activation+8.31e+00

Pos logit contributions
 curled+0.088
 safe+0.085
 curious+0.074
 but+0.074
 cozy+0.074
Neg logit contributions
father-0.131
parents-0.130
els-0.127
pa-0.120
ons-0.120
Token me
Token ID502
Feature activation+1.09e+00

Pos logit contributions
 curled+0.123
 but+0.102
 like+0.102
 perched+0.098
 hiding+0.097
Neg logit contributions
father-0.205
parents-0.202
ups-0.198
ma-0.193
oral-0.191
Tokenadow
Token ID4584
Feature activation+9.39e+00

Pos logit contributions
 safe+0.015
 cozy+0.014
 curled+0.014
 curious+0.013
 hidden+0.013
Neg logit contributions
parents-0.033
father-0.032
asers-0.032
emaker-0.031
ums-0.031
Token.
Token ID13
Feature activation+3.36e+00

Pos logit contributions
 cozy+0.129
 scary+0.124
 safe+0.120
 curled+0.115
 creepy+0.115
Neg logit contributions
ums-0.247
parents-0.235
father-0.234
ips-0.230
olds-0.222
Token It
Token ID632
Feature activation+1.52e+00

Pos logit contributions
 safe+0.062
 dangerous+0.057
 hidden+0.056
 fl+0.055
 scary+0.054
Neg logit contributions
parents-0.068
ums-0.068
anes-0.068
orter-0.067
father-0.066
Token was
Token ID373
Feature activation+5.51e+00

Pos logit contributions
 safe+0.024
 hut+0.022
 cozy+0.021
 badly+0.020
 abandoned+0.020
Neg logit contributions
parents-0.035
father-0.033
ma-0.031
ups-0.031
emaker-0.031
Token so
Token ID523
Feature activation+5.21e+00

Pos logit contributions
 hut+0.055
 deer+0.045
 grave+0.045
 curled+0.044
 hunter+0.043
Neg logit contributions
ums-0.141
parents-0.140
father-0.140
ips-0.135
ams-0.135
Token sunny
Token ID27737
Feature activation+4.22e+00

Pos logit contributions
 curled+0.044
 hiding+0.041
 safe+0.039
 perched+0.036
 but+0.036
Neg logit contributions
parents-0.162
ums-0.159
orter-0.155
olds-0.154
ugs-0.154
Token Jackie
Token ID24766
Feature activation+2.08e+00

Pos logit contributions
 safe+0.008
 cozy+0.008
 curled+0.007
 curious+0.007
 hidden+0.007
Neg logit contributions
parents-0.020
father-0.019
asers-0.019
emaker-0.019
ips-0.019
Token was
Token ID373
Feature activation+4.99e+00

Pos logit contributions
 safe+0.030
 cozy+0.028
 hollow+0.026
 shady+0.026
 scary+0.026
Neg logit contributions
parents-0.058
father-0.055
pa-0.055
els-0.054
emaker-0.053
Token playing
Token ID2712
Feature activation+5.03e+00

Pos logit contributions
 safe+0.062
 curled+0.061
 cozy+0.060
 shady+0.055
 scary+0.053
Neg logit contributions
parents-0.150
father-0.145
ips-0.142
ums-0.142
asers-0.141
Token in
Token ID287
Feature activation+5.64e+00

Pos logit contributions
 curled+0.059
 cozy+0.058
 hollow+0.058
 steep+0.057
 safe+0.056
Neg logit contributions
ases-0.142
ips-0.140
parents-0.140
father-0.136
els-0.132
Token her
Token ID607
Feature activation+3.06e+00

Pos logit contributions
 safe+0.082
 curled+0.079
 cozy+0.075
 curious+0.071
 scary+0.069
Neg logit contributions
parents-0.157
father-0.152
ips-0.146
ases-0.146
els-0.145
Token garden
Token ID11376
Feature activation+9.38e+00

Pos logit contributions
 curled+0.046
 safe+0.042
 perched+0.041
 but+0.039
 scary+0.039
Neg logit contributions
parents-0.082
father-0.080
ips-0.077
ums-0.076
els-0.074
Token,
Token ID11
Feature activation+5.43e+00

Pos logit contributions
 scary+0.122
 curled+0.120
 safe+0.118
 cozy+0.114
 deer+0.112
Neg logit contributions
ums-0.253
parents-0.245
ips-0.242
father-0.241
els-0.238
Token among
Token ID1871
Feature activation+1.69e+00

Pos logit contributions
 safe+0.083
 stable+0.077
 hut+0.077
 cozy+0.076
 shady+0.076
Neg logit contributions
parents-0.118
father-0.115
ma-0.113
emaker-0.113
pa-0.105
Token the
Token ID262
Feature activation+3.82e+00

Pos logit contributions
 safe+0.017
 cozy+0.017
 curled+0.016
 a+0.015
 curious+0.015
Neg logit contributions
parents-0.057
father-0.054
asers-0.053
ases-0.053
ums-0.053
Token flowers
Token ID12734
Feature activation+4.46e+00

Pos logit contributions
 curled+0.059
 safe+0.059
 cozy+0.055
 filled+0.052
 hiding+0.051
Neg logit contributions
parents-0.101
father-0.096
asers-0.095
ums-0.094
els-0.094
Token and
Token ID290
Feature activation+3.92e+00

Pos logit contributions
 safe+0.041
 cozy+0.040
 curled+0.037
 scary+0.035
 shady+0.031
Neg logit contributions
parents-0.146
ips-0.143
father-0.143
els-0.141
ums-0.141
Token a
Token ID257
Feature activation+4.81e+00

Pos logit contributions
 curled+0.022
 hollow+0.022
 abandoned+0.020
 safe+0.020
 cozy+0.018
Neg logit contributions
parents-0.079
father-0.076
ases-0.075
olds-0.073
emaker-0.072
Token time
Token ID640
Feature activation+2.34e+00

Pos logit contributions
 safe+0.085
 cozy+0.083
 curled+0.082
 curious+0.077
 hidden+0.077
Neg logit contributions
parents-0.122
father-0.116
ums-0.111
ases-0.111
ips-0.110
Token,
Token ID11
Feature activation+2.36e+00

Pos logit contributions
 safe+0.014
 cozy+0.013
 curled+0.012
 curious+0.009
 hidden+0.009
Neg logit contributions
parents-0.088
father-0.085
ums-0.083
ips-0.083
ases-0.083
Token in
Token ID287
Feature activation+3.62e+00

Pos logit contributions
 curled+0.020
 safe+0.018
 cozy+0.018
 abandoned+0.016
 shady+0.015
Neg logit contributions
parents-0.074
ips-0.073
ases-0.073
father-0.073
els-0.073
Token a
Token ID257
Feature activation+7.76e+00

Pos logit contributions
 curled+0.027
 safe+0.026
 abandoned+0.025
 cozy+0.023
 hollow+0.022
Neg logit contributions
parents-0.113
father-0.113
ums-0.107
ases-0.107
emaker-0.106
Token wide
Token ID3094
Feature activation+9.30e+00

Pos logit contributions
 curled+0.087
 perched+0.068
 but+0.067
 safe+0.062
 hollow+0.060
Neg logit contributions
parents-0.226
father-0.225
oral-0.211
ases-0.210
pa-0.209
Token country
Token ID1499
Feature activation+7.06e+00

Pos logit contributions
 curled+0.157
 safe+0.149
 perched+0.142
 curious+0.139
 cozy+0.138
Neg logit contributions
father-0.216
pa-0.207
parents-0.207
ums-0.199
ma-0.197
Token,
Token ID11
Feature activation+3.63e+00

Pos logit contributions
 curled+0.079
 cozy+0.074
 safe+0.068
 hollow+0.064
 scary+0.064
Neg logit contributions
parents-0.214
father-0.209
ums-0.208
olds-0.202
ma-0.202
Token there
Token ID612
Feature activation+1.24e+00

Pos logit contributions
 curled+0.043
 shady+0.039
 nib+0.039
 stable+0.037
 safe+0.036
Neg logit contributions
 narrative-0.091
parents-0.089
ma-0.088
els-0.087
father-0.087
Token was
Token ID373
Feature activation+5.20e+00

Pos logit contributions
 safe+0.017
 cozy+0.016
 curled+0.016
 scary+0.015
 curious+0.015
Neg logit contributions
parents-0.036
father-0.034
ums-0.033
ases-0.033
ips-0.033
Token a
Token ID257
Feature activation+8.55e+00

Pos logit contributions
 hollow+0.056
 safe+0.056
 curled+0.055
 hut+0.052
 perched+0.050
Neg logit contributions
parents-0.137
father-0.131
oral-0.127
 narrative-0.125
ases-0.123
Token He
Token ID679
Feature activation+1.66e+00

Pos logit contributions
 safe+0.039
 hidden+0.035
 cozy+0.034
 fl+0.034
 curled+0.034
Neg logit contributions
ums-0.060
father-0.059
parents-0.059
orter-0.056
ases-0.056
Token lived
Token ID5615
Feature activation+4.57e+00

Pos logit contributions
 safe+0.027
 shady+0.026
 cozy+0.025
 hollow+0.025
 hut+0.024
Neg logit contributions
parents-0.041
father-0.040
pa-0.038
affles-0.037
oral-0.036
Token in
Token ID287
Feature activation+3.73e+00

Pos logit contributions
 curled+0.048
 cozy+0.046
 safe+0.044
 scary+0.041
 shady+0.041
Neg logit contributions
ips-0.137
ases-0.132
father-0.131
parents-0.129
ums-0.128
Token a
Token ID257
Feature activation+7.33e+00

Pos logit contributions
 safe+0.020
 curled+0.020
 cozy+0.017
 curious+0.013
 hiding+0.013
Neg logit contributions
parents-0.135
father-0.134
ases-0.128
ums-0.126
emaker-0.125
Token big
Token ID1263
Feature activation+6.88e+00

Pos logit contributions
 curled+0.096
 but+0.085
 like+0.083
 perched+0.082
 as+0.077
Neg logit contributions
father-0.192
parents-0.185
oral-0.177
ma-0.176
ums-0.173
Token chamber
Token ID11847
Feature activation+9.29e+00

Pos logit contributions
 curled+0.099
 perched+0.090
 safe+0.088
 a+0.084
 curious+0.084
Neg logit contributions
father-0.187
pa-0.178
parents-0.177
ma-0.167
ums-0.166
Token far
Token ID1290
Feature activation+1.66e+00

Pos logit contributions
 curled+0.115
 cozy+0.115
 safe+0.105
 hollow+0.098
 scary+0.098
Neg logit contributions
father-0.250
ases-0.239
ips-0.238
ums-0.237
parents-0.237
Token away
Token ID1497
Feature activation+4.56e+00

Pos logit contributions
 cozy+0.031
 curled+0.030
 safe+0.027
 curious+0.027
 hiding+0.026
Neg logit contributions
ases-0.036
els-0.035
ople-0.034
emaker-0.034
father-0.034
Token.
Token ID13
Feature activation+2.54e+00

Pos logit contributions
 cozy+0.063
 curled+0.060
 safe+0.059
 curious+0.055
 boun+0.055
Neg logit contributions
iness-0.106
ases-0.106
ums-0.105
father-0.105
ips-0.105
Token Every
Token ID3887
Feature activation+7.23e-01

Pos logit contributions
 safe+0.038
 cozy+0.035
 curled+0.035
 hidden+0.034
 scary+0.033
Neg logit contributions
parents-0.072
father-0.069
ums-0.067
olds-0.065
uns-0.065
Token day
Token ID1110
Feature activation+2.21e+00

Pos logit contributions
 safe+0.018
 cozy+0.018
 curious+0.017
 abandoned+0.017
 curled+0.017
Neg logit contributions
parents-0.012
father-0.012
oral-0.012
emaker-0.010
iness-0.010
Token Chloe
Token ID29476
Feature activation+9.78e-01

Pos logit contributions
 safe+0.033
 hollow+0.033
 abandoned+0.031
 cozy+0.031
 shady+0.030
Neg logit contributions
parents-0.087
ips-0.084
ugs-0.083
father-0.083
ums-0.081
Token was
Token ID373
Feature activation+5.45e+00

Pos logit contributions
 cozy+0.017
 safe+0.016
 shady+0.016
 high+0.015
 boun+0.015
Neg logit contributions
parents-0.025
pa-0.023
father-0.023
ople-0.022
ups-0.022
Token walking
Token ID6155
Feature activation+5.04e+00

Pos logit contributions
 curled+0.084
 shady+0.082
 cozy+0.081
 hut+0.079
 deer+0.078
Neg logit contributions
parents-0.131
ums-0.130
ips-0.130
father-0.129
olds-0.123
Token through
Token ID832
Feature activation+4.74e+00

Pos logit contributions
 cozy+0.070
 curled+0.067
 safe+0.063
 creepy+0.059
 shady+0.058
Neg logit contributions
father-0.129
parents-0.128
ips-0.126
ases-0.124
ums-0.121
Token a
Token ID257
Feature activation+7.73e+00

Pos logit contributions
 safe+0.064
 curled+0.063
 cozy+0.060
 curious+0.053
 shady+0.051
Neg logit contributions
parents-0.133
father-0.130
els-0.123
ips-0.121
ases-0.120
Token garden
Token ID11376
Feature activation+9.28e+00

Pos logit contributions
 curled+0.121
 perched+0.095
 hello+0.093
 safe+0.092
 hiding+0.092
Neg logit contributions
father-0.186
ups-0.183
parents-0.180
orter-0.175
ma-0.172
Token and
Token ID290
Feature activation+5.09e+00

Pos logit contributions
 cozy+0.119
 safe+0.109
 scary+0.106
 curled+0.103
 cute+0.099
Neg logit contributions
ums-0.254
parents-0.251
father-0.250
ips-0.245
olds-0.236
Token she
Token ID673
Feature activation+2.75e+00

Pos logit contributions
 safe+0.068
 cozy+0.067
 shady+0.064
 hut+0.063
 cooler+0.062
Neg logit contributions
parents-0.142
father-0.132
olds-0.123
ups-0.122
pa-0.121
Token saw
Token ID2497
Feature activation+2.14e+00

Pos logit contributions
 safe+0.036
 cozy+0.034
 hut+0.030
 shady+0.030
 fluffy+0.030
Neg logit contributions
parents-0.081
father-0.076
ups-0.072
ums-0.071
ips-0.070
Token a
Token ID257
Feature activation+7.99e+00

Pos logit contributions
 safe+0.015
 cozy+0.013
 curled+0.013
 hiding+0.011
 open+0.010
Neg logit contributions
parents-0.076
father-0.074
asers-0.071
els-0.071
olds-0.071
Token big
Token ID1263
Feature activation+6.45e+00

Pos logit contributions
 curled+0.107
 safe+0.087
 its+0.083
 hiding+0.082
 filled+0.080
Neg logit contributions
parents-0.193
oral-0.193
orter-0.189
father-0.188
ums-0.184
Token Jess
Token ID12707
Feature activation-1.52e-01

Pos logit contributions
 safe+0.030
 cozy+0.028
 curled+0.027
 hidden+0.026
 scary+0.026
Neg logit contributions
parents-0.052
father-0.050
ums-0.048
ases-0.047
uns-0.047
Token lived
Token ID5615
Feature activation+4.99e+00

Pos logit contributions
parents+0.005
father+0.004
pa+0.004
els+0.004
ums+0.004
Neg logit contributions
 safe-0.002
 cozy-0.002
 shady-0.002
 curled-0.002
 hiding-0.002
Token in
Token ID287
Feature activation+4.32e+00

Pos logit contributions
 curled+0.050
 safe+0.046
 cozy+0.045
 scary+0.045
 abandoned+0.041
Neg logit contributions
ips-0.157
parents-0.151
father-0.150
ums-0.150
ases-0.149
Token a
Token ID257
Feature activation+7.42e+00

Pos logit contributions
 curled+0.029
 safe+0.027
 cozy+0.022
 perched+0.021
 shady+0.020
Neg logit contributions
parents-0.151
father-0.148
ases-0.141
ums-0.140
ips-0.138
Token big
Token ID1263
Feature activation+7.60e+00

Pos logit contributions
 curled+0.113
 perched+0.099
 but+0.097
 like+0.090
 as+0.088
Neg logit contributions
father-0.178
parents-0.174
oral-0.171
ma-0.170
ums-0.166
Token garden
Token ID11376
Feature activation+9.28e+00

Pos logit contributions
 curled+0.137
 perched+0.128
 safe+0.117
 tag+0.111
 filled+0.108
Neg logit contributions
father-0.182
pa-0.174
parents-0.173
ma-0.165
ums-0.160
Token with
Token ID351
Feature activation+2.69e+00

Pos logit contributions
 scary+0.110
 curled+0.104
 cozy+0.101
 safe+0.100
 creepy+0.094
Neg logit contributions
ums-0.268
parents-0.252
father-0.251
ips-0.250
olds-0.243
Token lots
Token ID6041
Feature activation+3.36e+00

Pos logit contributions
 safe+0.029
 cozy+0.029
 a+0.026
 curled+0.025
 curious+0.025
Neg logit contributions
parents-0.091
father-0.087
asers-0.086
ips-0.085
emaker-0.085
Token of
Token ID286
Feature activation+1.75e+00

Pos logit contributions
 safe+0.070
 cozy+0.068
 curled+0.068
 hiding+0.066
 hidden+0.064
Neg logit contributions
parents-0.074
father-0.070
asers-0.069
ums-0.068
olds-0.067
Token flowers
Token ID12734
Feature activation+4.50e+00

Pos logit contributions
 curled+0.031
 safe+0.030
 cozy+0.028
 filled+0.028
 hiding+0.028
Neg logit contributions
parents-0.043
asers-0.041
father-0.040
els-0.040
ums-0.040
Token and
Token ID290
Feature activation+3.36e+00

Pos logit contributions
 cozy+0.059
 safe+0.058
 curled+0.058
 scary+0.056
 abandoned+0.054
Neg logit contributions
els-0.118
ips-0.117
ums-0.117
asers-0.115
parents-0.114
Token big
Token ID1263
Feature activation+8.35e+00

Pos logit contributions
 curled+0.096
 perched+0.076
 but+0.073
 abandoned+0.066
 hollow+0.066
Neg logit contributions
father-0.226
parents-0.225
oral-0.212
ma-0.211
ases-0.211
Token forest
Token ID8222
Feature activation+7.73e+00

Pos logit contributions
 curled+0.142
 perched+0.123
 safe+0.120
 cozy+0.112
 abandoned+0.106
Neg logit contributions
father-0.213
pa-0.204
parents-0.203
ma-0.194
els-0.191
Token,
Token ID11
Feature activation+3.66e+00

Pos logit contributions
 cozy+0.085
 curled+0.083
 safe+0.079
 hollow+0.068
 shady+0.068
Neg logit contributions
ums-0.232
parents-0.231
father-0.228
ases-0.225
olds-0.224
Token there
Token ID612
Feature activation+1.64e+00

Pos logit contributions
 curled+0.039
 shady+0.035
 cozy+0.034
 safe+0.034
 stable+0.033
Neg logit contributions
 narrative-0.097
els-0.097
parents-0.096
father-0.096
ma-0.096
Token was
Token ID373
Feature activation+5.72e+00

Pos logit contributions
 safe+0.022
 cozy+0.022
 curled+0.020
 scary+0.019
 shady+0.019
Neg logit contributions
parents-0.048
father-0.046
ases-0.045
ums-0.044
ips-0.044
Token a
Token ID257
Feature activation+9.26e+00

Pos logit contributions
 curled+0.063
 safe+0.062
 hut+0.059
 hollow+0.059
 deer+0.059
Neg logit contributions
parents-0.139
father-0.138
ases-0.132
 End-0.130
oral-0.129
Token log
Token ID2604
Feature activation+7.17e+00

Pos logit contributions
 curled+0.141
 perched+0.124
 filled+0.104
 abandoned+0.101
 hiding+0.099
Neg logit contributions
oral-0.210
father-0.195
 End-0.191
parents-0.188
 narrative-0.188
Token.
Token ID13
Feature activation+1.96e+00

Pos logit contributions
 cozy+0.099
 safe+0.086
 curled+0.083
 scary+0.081
 shady+0.077
Neg logit contributions
parents-0.196
ums-0.194
father-0.193
ases-0.185
olds-0.185
Token The
Token ID383
Feature activation+3.16e-01

Pos logit contributions
 safe+0.031
 cozy+0.030
 curled+0.029
 hidden+0.028
 hiding+0.027
Neg logit contributions
parents-0.052
ases-0.051
olds-0.050
father-0.050
ums-0.049
Token log
Token ID2604
Feature activation+3.00e+00

Pos logit contributions
 safe+0.003
 cozy+0.003
 curled+0.003
 hiding+0.003
 filled+0.003
Neg logit contributions
parents-0.010
father-0.010
asers-0.010
ums-0.010
ases-0.010
Token was
Token ID373
Feature activation+5.06e+00

Pos logit contributions
 cozy+0.041
 safe+0.040
 curled+0.036
 shady+0.035
 scary+0.034
Neg logit contributions
parents-0.086
father-0.082
ums-0.079
olds-0.078
els-0.078
Token the
Token ID262
Feature activation+5.13e+00

Pos logit contributions
 safe+0.037
 cozy+0.036
 curled+0.035
 scary+0.032
 hiding+0.031
Neg logit contributions
parents-0.118
father-0.117
ips-0.111
ums-0.111
els-0.110
Token forest
Token ID8222
Feature activation+5.76e+00

Pos logit contributions
 curled+0.081
 safe+0.078
 cozy+0.073
 hiding+0.070
 curious+0.070
Neg logit contributions
parents-0.140
father-0.136
ums-0.126
pa-0.124
ips-0.123
Token,
Token ID11
Feature activation+3.59e+00

Pos logit contributions
 cozy+0.057
 safe+0.056
 curled+0.049
 shady+0.045
 scary+0.044
Neg logit contributions
parents-0.184
ums-0.177
father-0.175
olds-0.174
els-0.170
Token there
Token ID612
Feature activation+1.35e+00

Pos logit contributions
 stable+0.045
 cozy+0.041
 curled+0.041
 safe+0.041
 shady+0.039
Neg logit contributions
parents-0.087
ums-0.085
emaker-0.082
 End-0.082
father-0.081
Token was
Token ID373
Feature activation+6.04e+00

Pos logit contributions
 safe+0.019
 cozy+0.019
 scary+0.017
 curled+0.017
 curious+0.016
Neg logit contributions
parents-0.039
father-0.036
ums-0.036
ases-0.035
ips-0.035
Token a
Token ID257
Feature activation+9.25e+00

Pos logit contributions
 safe+0.058
 hut+0.057
 curled+0.056
 deer+0.053
 cozy+0.052
Neg logit contributions
parents-0.165
father-0.162
ums-0.156
ips-0.150
olds-0.149
Token bald
Token ID28273
Feature activation+4.21e+00

Pos logit contributions
 curled+0.130
 perched+0.114
 hiding+0.097
 its+0.096
 filled+0.091
Neg logit contributions
oral-0.230
father-0.222
parents-0.220
orter-0.217
 narrative-0.214
Token lion
Token ID18744
Feature activation+5.71e+00

Pos logit contributions
 cozy+0.068
 safe+0.067
 curled+0.063
 hiding+0.060
 hidden+0.060
Neg logit contributions
parents-0.109
father-0.108
emaker-0.106
ums-0.104
orter-0.103
Token.
Token ID13
Feature activation+2.30e+00

Pos logit contributions
 cozy+0.093
 safe+0.084
 shady+0.081
 comfortable+0.073
 scary+0.072
Neg logit contributions
father-0.152
parents-0.148
ases-0.144
ums-0.143
els-0.141
Token He
Token ID679
Feature activation+1.72e+00

Pos logit contributions
 safe+0.033
 cozy+0.032
 curled+0.031
 hiding+0.030
 hidden+0.030
Neg logit contributions
parents-0.065
father-0.063
ases-0.062
ums-0.062
orter-0.061
Token was
Token ID373
Feature activation+4.82e+00

Pos logit contributions
 safe+0.028
 cozy+0.027
 shady+0.025
 hut+0.024
 hollow+0.024
Neg logit contributions
parents-0.043
father-0.041
ases-0.039
pa-0.039
ons-0.038
Token Ben
Token ID3932
Feature activation+1.30e+00

Pos logit contributions
 stable+0.047
 cozy+0.046
 curled+0.046
 safe+0.044
 shady+0.044
Neg logit contributions
parents-0.100
ases-0.098
ums-0.098
emaker-0.098
ips-0.097
Token was
Token ID373
Feature activation+6.18e+00

Pos logit contributions
 safe+0.016
 cozy+0.016
 hollow+0.014
 curled+0.014
 scary+0.014
Neg logit contributions
parents-0.039
pa-0.037
father-0.037
ases-0.035
olds-0.034
Token playing
Token ID2712
Feature activation+5.29e+00

Pos logit contributions
 curled+0.077
 safe+0.077
 cozy+0.076
 shady+0.072
 scary+0.068
Neg logit contributions
parents-0.178
ums-0.174
ips-0.172
father-0.172
ases-0.168
Token in
Token ID287
Feature activation+5.53e+00

Pos logit contributions
 cozy+0.062
 curled+0.061
 hollow+0.061
 shady+0.057
 safe+0.056
Neg logit contributions
ases-0.151
parents-0.150
ips-0.146
father-0.145
emaker-0.142
Token his
Token ID465
Feature activation+2.73e+00

Pos logit contributions
 safe+0.075
 curled+0.070
 cozy+0.069
 curious+0.067
 scary+0.061
Neg logit contributions
parents-0.158
father-0.156
ases-0.150
ips-0.149
els-0.148
Token garden
Token ID11376
Feature activation+9.22e+00

Pos logit contributions
 curled+0.041
 safe+0.037
 perched+0.036
 but+0.035
 scary+0.035
Neg logit contributions
parents-0.072
father-0.071
ums-0.068
ips-0.067
pa-0.065
Token.
Token ID13
Feature activation+3.27e+00

Pos logit contributions
 cozy+0.109
 safe+0.108
 scary+0.107
 curled+0.107
 deer+0.095
Neg logit contributions
ums-0.264
parents-0.261
father-0.256
ips-0.251
ases-0.248
Token He
Token ID679
Feature activation+2.80e+00

Pos logit contributions
 safe+0.051
 perched+0.046
 cozy+0.045
 hidden+0.044
 abandoned+0.043
Neg logit contributions
ums-0.086
ips-0.084
parents-0.084
ases-0.081
uns-0.081
Token saw
Token ID2497
Feature activation+2.76e+00

Pos logit contributions
 safe+0.036
 cozy+0.033
 hollow+0.030
 curled+0.029
 hut+0.029
Neg logit contributions
parents-0.082
father-0.078
emaker-0.075
ases-0.075
els-0.074
Token a
Token ID257
Feature activation+8.61e+00

Pos logit contributions
 safe+0.015
 cozy+0.014
 curled+0.013
 hiding+0.009
 curious+0.009
Neg logit contributions
parents-0.103
father-0.101
ases-0.097
asers-0.097
emaker-0.096
Token big
Token ID1263
Feature activation+6.94e+00

Pos logit contributions
 curled+0.120
 perched+0.098
 safe+0.094
 its+0.092
 filled+0.090
Neg logit contributions
oral-0.209
parents-0.202
orter-0.200
father-0.197
asers-0.192
Token was
Token ID373
Feature activation+6.44e+00

Pos logit contributions
 safe+0.020
 cozy+0.019
 scary+0.017
 curious+0.017
 shady+0.017
Neg logit contributions
parents-0.045
pa-0.043
father-0.043
ases-0.041
els-0.040
Token playing
Token ID2712
Feature activation+5.39e+00

Pos logit contributions
 safe+0.070
 curled+0.070
 cozy+0.068
 shady+0.063
 scary+0.060
Neg logit contributions
parents-0.199
ums-0.192
father-0.192
ips-0.189
ases-0.186
Token outside
Token ID2354
Feature activation+7.04e+00

Pos logit contributions
 cozy+0.064
 curled+0.063
 hollow+0.062
 shady+0.059
 safe+0.059
Neg logit contributions
ases-0.151
parents-0.151
father-0.147
ips-0.147
emaker-0.141
Token in
Token ID287
Feature activation+4.53e+00

Pos logit contributions
 cozy+0.086
 safe+0.086
 curled+0.080
 scary+0.076
 hollow+0.074
Neg logit contributions
father-0.198
ases-0.198
parents-0.197
ums-0.195
ips-0.193
Token his
Token ID465
Feature activation+2.75e+00

Pos logit contributions
 curled+0.053
 safe+0.053
 cozy+0.049
 curious+0.046
 scary+0.045
Neg logit contributions
parents-0.137
father-0.134
ases-0.130
els-0.128
ips-0.127
Token garden
Token ID11376
Feature activation+9.21e+00

Pos logit contributions
 curled+0.040
 perched+0.036
 safe+0.036
 but+0.035
 scary+0.034
Neg logit contributions
parents-0.073
father-0.072
ums-0.070
ips-0.068
ases-0.068
Token.
Token ID13
Feature activation+3.50e+00

Pos logit contributions
 scary+0.121
 curled+0.120
 safe+0.119
 cozy+0.118
 creepy+0.105
Neg logit contributions
ums-0.257
parents-0.250
father-0.244
ips-0.238
ups-0.235
Token He
Token ID679
Feature activation+2.54e+00

Pos logit contributions
 safe+0.055
 perched+0.052
 fl+0.051
 curled+0.051
 cozy+0.049
Neg logit contributions
parents-0.089
ums-0.085
orter-0.085
father-0.083
ips-0.082
Token saw
Token ID2497
Feature activation+2.56e+00

Pos logit contributions
 safe+0.034
 cozy+0.030
 hut+0.028
 hollow+0.027
 curled+0.027
Neg logit contributions
parents-0.072
father-0.068
emaker-0.065
ases-0.065
orter-0.065
Token a
Token ID257
Feature activation+8.55e+00

Pos logit contributions
 safe+0.016
 cozy+0.015
 curled+0.014
 hiding+0.011
 curious+0.010
Neg logit contributions
parents-0.091
father-0.089
ases-0.086
asers-0.085
els-0.085
Token big
Token ID1263
Feature activation+6.88e+00

Pos logit contributions
 curled+0.118
 perched+0.095
 safe+0.093
 its+0.092
 filled+0.089
Neg logit contributions
oral-0.206
parents-0.199
orter-0.197
father-0.191
 narrative-0.189
Token.
Token ID13
Feature activation+1.97e+00

Pos logit contributions
 cozy+0.039
 safe+0.038
 curled+0.034
 scary+0.034
 shady+0.032
Neg logit contributions
parents-0.110
father-0.109
ums-0.104
els-0.102
ases-0.102
Token He
Token ID679
Feature activation+1.71e+00

Pos logit contributions
 safe+0.027
 curled+0.025
 cozy+0.025
 hidden+0.024
 hiding+0.024
Neg logit contributions
parents-0.057
father-0.056
ums-0.054
ases-0.054
orter-0.053
Token lived
Token ID5615
Feature activation+4.87e+00

Pos logit contributions
 safe+0.027
 cozy+0.025
 shady+0.025
 hollow+0.024
 hut+0.024
Neg logit contributions
parents-0.044
father-0.043
pa-0.041
oral-0.039
orter-0.039
Token in
Token ID287
Feature activation+3.83e+00

Pos logit contributions
 curled+0.049
 cozy+0.045
 safe+0.044
 scary+0.041
 shady+0.038
Neg logit contributions
ips-0.152
father-0.149
parents-0.149
ases-0.146
ums-0.146
Token a
Token ID257
Feature activation+7.05e+00

Pos logit contributions
 safe+0.016
 curled+0.015
 cozy+0.014
 hiding+0.010
 curious+0.009
Neg logit contributions
parents-0.148
father-0.145
ases-0.139
ums-0.139
asers-0.138
Token chamber
Token ID11847
Feature activation+9.20e+00

Pos logit contributions
 curled+0.098
 perched+0.080
 but+0.078
 like+0.072
 filled+0.067
Neg logit contributions
father-0.193
parents-0.190
oral-0.181
orter-0.178
ums-0.178
Token underground
Token ID11447
Feature activation+7.97e+00

Pos logit contributions
 curled+0.117
 cozy+0.109
 safe+0.102
 deer+0.097
 scary+0.097
Neg logit contributions
father-0.256
ums-0.247
parents-0.245
ma-0.242
ips-0.239
Token.
Token ID13
Feature activation+2.41e+00

Pos logit contributions
 curled+0.125
 cozy+0.103
 safe+0.101
 nib+0.101
 perched+0.097
Neg logit contributions
ups-0.192
els-0.187
ums-0.185
ases-0.184
olds-0.184
Token He
Token ID679
Feature activation+1.81e+00

Pos logit contributions
 safe+0.033
 cozy+0.031
 curled+0.031
 hidden+0.029
 hiding+0.029
Neg logit contributions
parents-0.071
father-0.068
ums-0.066
uns-0.066
orter-0.065
Token was
Token ID373
Feature activation+4.58e+00

Pos logit contributions
 safe+0.028
 cozy+0.026
 shady+0.024
 hollow+0.023
 fluffy+0.023
Neg logit contributions
parents-0.048
father-0.045
pa-0.043
oral-0.042
 narrative-0.041
Token very
Token ID845
Feature activation+5.62e+00

Pos logit contributions
 curled+0.053
 shady+0.051
 hut+0.048
 bone+0.048
 tent+0.046
Neg logit contributions
ums-0.114
father-0.113
parents-0.111
olds-0.106
ases-0.104
Tokenmy
Token ID1820
Feature activation+9.85e-01

Pos logit contributions
parents+0.050
father+0.047
ases+0.043
asers+0.043
ums+0.043
Neg logit contributions
 safe-0.074
 cozy-0.072
 curled-0.070
 curious-0.069
 scary-0.068
Token was
Token ID373
Feature activation+5.57e+00

Pos logit contributions
 safe+0.014
 cozy+0.013
 scary+0.012
 shady+0.012
 curious+0.012
Neg logit contributions
parents-0.028
father-0.027
pa-0.027
ases-0.025
ips-0.025
Token playing
Token ID2712
Feature activation+4.98e+00

Pos logit contributions
 curled+0.050
 shady+0.048
 safe+0.048
 cozy+0.046
 hut+0.046
Neg logit contributions
ips-0.168
ums-0.168
father-0.167
parents-0.167
ases-0.161
Token in
Token ID287
Feature activation+5.17e+00

Pos logit contributions
 cozy+0.063
 hollow+0.061
 scary+0.060
 curled+0.059
 abandoned+0.058
Neg logit contributions
parents-0.136
ips-0.135
ases-0.135
father-0.135
ums-0.130
Token his
Token ID465
Feature activation+2.86e+00

Pos logit contributions
 safe+0.065
 curled+0.063
 cozy+0.060
 curious+0.056
 scary+0.054
Neg logit contributions
parents-0.155
father-0.152
ases-0.144
els-0.143
pa-0.143
Token garden
Token ID11376
Feature activation+9.20e+00

Pos logit contributions
 curled+0.044
 safe+0.039
 perched+0.038
 scary+0.037
 but+0.036
Neg logit contributions
parents-0.076
father-0.074
ums-0.070
ips-0.069
pa-0.069
Token one
Token ID530
Feature activation+1.45e+00

Pos logit contributions
 scary+0.114
 curled+0.109
 cozy+0.107
 safe+0.106
 creepy+0.100
Neg logit contributions
ums-0.260
father-0.251
parents-0.251
ips-0.242
els-0.240
Token day
Token ID1110
Feature activation+3.75e+00

Pos logit contributions
 safe+0.022
 cozy+0.022
 curled+0.022
 curious+0.020
 hiding+0.020
Neg logit contributions
parents-0.040
father-0.039
asers-0.037
ums-0.037
ases-0.037
Token.
Token ID13
Feature activation+2.39e+00

Pos logit contributions
 curled+0.034
 cozy+0.034
 safe+0.034
 scary+0.028
 hiding+0.027
Neg logit contributions
parents-0.122
ums-0.120
father-0.119
uns-0.116
asers-0.116
Token He
Token ID679
Feature activation+2.38e+00

Pos logit contributions
 safe+0.033
 cozy+0.032
 curled+0.031
 hidden+0.029
 hiding+0.028
Neg logit contributions
parents-0.068
ums-0.065
ips-0.065
orter-0.065
uns-0.064
Token was
Token ID373
Feature activation+4.83e+00

Pos logit contributions
 safe+0.030
 cozy+0.028
 curled+0.024
 scary+0.024
 hut+0.023
Neg logit contributions
parents-0.072
father-0.068
els-0.066
ases-0.066
ips-0.065
Token time
Token ID640
Feature activation+2.24e+00

Pos logit contributions
 safe+0.089
 cozy+0.086
 curled+0.085
 curious+0.080
 hidden+0.079
Neg logit contributions
parents-0.122
father-0.116
ums-0.111
ases-0.111
ips-0.110
Token,
Token ID11
Feature activation+2.74e+00

Pos logit contributions
 safe+0.017
 cozy+0.017
 curled+0.016
 shady+0.012
 curious+0.012
Neg logit contributions
parents-0.079
father-0.077
ums-0.075
ips-0.075
ases-0.074
Token little
Token ID1310
Feature activation+4.01e+00

Pos logit contributions
 curled+0.030
 safe+0.029
 cozy+0.027
 shady+0.027
 bark+0.026
Neg logit contributions
els-0.076
father-0.074
ma-0.074
emaker-0.074
ips-0.073
Token Jack
Token ID3619
Feature activation-4.39e-01

Pos logit contributions
 safe+0.051
 cozy+0.050
 curled+0.049
 curious+0.044
 hidden+0.042
Neg logit contributions
parents-0.123
father-0.120
asers-0.115
emaker-0.115
ums-0.115
Token found
Token ID1043
Feature activation+3.02e+00

Pos logit contributions
parents+0.013
father+0.013
ases+0.013
pa+0.012
els+0.012
Neg logit contributions
 safe-0.006
 cozy-0.005
 curious-0.005
 curled-0.005
 scary-0.005
Token a
Token ID257
Feature activation+9.17e+00

Pos logit contributions
 safe+0.013
 cozy+0.012
 curled+0.011
 curious+0.008
 scary+0.007
Neg logit contributions
parents-0.118
father-0.115
asers-0.111
ums-0.111
ases-0.110
Token blue
Token ID4171
Feature activation+5.61e+00

Pos logit contributions
 curled+0.129
 perched+0.111
 behind+0.102
 like+0.099
 filled+0.096
Neg logit contributions
oral-0.216
parents-0.208
father-0.207
ma-0.202
orter-0.196
Token bath
Token ID7837
Feature activation+4.09e+00

Pos logit contributions
 safe+0.097
 cozy+0.093
 curious+0.089
 curled+0.087
 scary+0.084
Neg logit contributions
father-0.134
parents-0.134
els-0.132
ums-0.131
oral-0.127
Tokentub
Token ID37995
Feature activation+7.55e+00

Pos logit contributions
 cozy+0.075
 safe+0.073
 curled+0.069
 curious+0.069
 scary+0.068
Neg logit contributions
parents-0.102
father-0.096
ums-0.096
asers-0.095
ips-0.094
Token in
Token ID287
Feature activation+3.89e+00

Pos logit contributions
 cozy+0.089
 safe+0.082
 scary+0.080
 curled+0.074
 curious+0.070
Neg logit contributions
parents-0.225
ums-0.220
father-0.217
ases-0.213
els-0.212
Token the
Token ID262
Feature activation+3.69e+00

Pos logit contributions
 safe+0.044
 cozy+0.040
 curled+0.040
 curious+0.037
 scary+0.035
Neg logit contributions
parents-0.127
father-0.122
els-0.116
ases-0.116
asers-0.116
Tokenmy
Token ID1820
Feature activation+2.32e+00

Pos logit contributions
parents+0.017
father+0.016
ases+0.015
ips+0.015
asers+0.015
Neg logit contributions
 safe-0.029
 cozy-0.029
 curled-0.028
 curious-0.027
 scary-0.027
Token was
Token ID373
Feature activation+6.08e+00

Pos logit contributions
 safe+0.029
 cozy+0.028
 curled+0.026
 shady+0.025
 scary+0.025
Neg logit contributions
parents-0.071
father-0.068
ases-0.066
pa-0.065
emaker-0.065
Token playing
Token ID2712
Feature activation+5.15e+00

Pos logit contributions
 safe+0.067
 curled+0.066
 cozy+0.065
 shady+0.059
 scary+0.057
Neg logit contributions
parents-0.188
father-0.181
ums-0.181
ips-0.179
ases-0.177
Token in
Token ID287
Feature activation+5.61e+00

Pos logit contributions
 curled+0.057
 cozy+0.056
 hollow+0.055
 safe+0.052
 shady+0.051
Neg logit contributions
parents-0.151
ases-0.151
ips-0.148
father-0.147
emaker-0.144
Token his
Token ID465
Feature activation+2.66e+00

Pos logit contributions
 safe+0.070
 curled+0.066
 cozy+0.065
 curious+0.062
 scary+0.057
Neg logit contributions
parents-0.169
father-0.165
ases-0.159
ips-0.159
els-0.158
Token garden
Token ID11376
Feature activation+9.17e+00

Pos logit contributions
 curled+0.038
 safe+0.035
 perched+0.034
 but+0.032
 scary+0.032
Neg logit contributions
parents-0.073
father-0.072
ums-0.069
ips-0.068
ases-0.067
Token with
Token ID351
Feature activation+3.06e+00

Pos logit contributions
 cozy+0.105
 curled+0.105
 safe+0.104
 scary+0.103
 deer+0.090
Neg logit contributions
ums-0.265
parents-0.263
father-0.259
ips-0.254
ases-0.251
Token his
Token ID465
Feature activation+2.51e+00

Pos logit contributions
 safe+0.029
 cozy+0.028
 curled+0.026
 curious+0.024
 a+0.024
Neg logit contributions
parents-0.106
father-0.102
asers-0.100
ases-0.099
ums-0.099
Token toy
Token ID13373
Feature activation+4.46e+00

Pos logit contributions
 safe+0.037
 curled+0.037
 cozy+0.035
 filled+0.032
 curious+0.031
Neg logit contributions
parents-0.071
father-0.068
asers-0.067
ums-0.066
ases-0.065
Token cars
Token ID5006
Feature activation+3.68e+00

Pos logit contributions
 safe+0.072
 cozy+0.070
 curled+0.068
 curious+0.063
 scary+0.062
Neg logit contributions
parents-0.122
father-0.119
asers-0.117
els-0.116
ums-0.115
Token.
Token ID13
Feature activation+3.21e+00

Pos logit contributions
 cozy+0.036
 safe+0.034
 curled+0.032
 curious+0.029
 scary+0.028
Neg logit contributions
parents-0.125
father-0.121
ases-0.119
ips-0.119
asers-0.119
Token a
Token ID257
Feature activation+4.81e+00

Pos logit contributions
 abandoned+0.025
 hollow+0.025
 curled+0.025
 safe+0.022
 hut+0.021
Neg logit contributions
parents-0.084
father-0.081
ases-0.081
ips-0.077
emaker-0.077
Token time
Token ID640
Feature activation+2.44e+00

Pos logit contributions
 safe+0.089
 cozy+0.087
 curled+0.086
 curious+0.081
 hidden+0.080
Neg logit contributions
parents-0.116
father-0.110
ases-0.106
ums-0.106
ips-0.105
Token,
Token ID11
Feature activation+2.51e+00

Pos logit contributions
 safe+0.014
 cozy+0.013
 curled+0.012
 curious+0.009
 hidden+0.009
Neg logit contributions
parents-0.091
father-0.089
ums-0.087
ips-0.087
ases-0.087
Token in
Token ID287
Feature activation+4.02e+00

Pos logit contributions
 curled+0.024
 safe+0.021
 cozy+0.021
 abandoned+0.019
 shady+0.018
Neg logit contributions
ips-0.076
ases-0.076
els-0.076
father-0.075
emaker-0.075
Token a
Token ID257
Feature activation+8.05e+00

Pos logit contributions
 curled+0.031
 safe+0.029
 abandoned+0.029
 cozy+0.027
 hollow+0.025
Neg logit contributions
parents-0.123
father-0.123
ases-0.118
ips-0.117
ums-0.117
Token peaceful
Token ID12309
Feature activation+9.17e+00

Pos logit contributions
 curled+0.091
 perched+0.075
 but+0.072
 abandoned+0.066
 safe+0.065
Neg logit contributions
father-0.227
parents-0.226
ma-0.212
pa-0.212
ases-0.212
Token town
Token ID3240
Feature activation+6.32e+00

Pos logit contributions
 curled+0.146
 hollow+0.139
 curious+0.139
 safe+0.136
 perched+0.136
Neg logit contributions
pa-0.193
father-0.187
parents-0.187
ma-0.182
ople-0.179
Token,
Token ID11
Feature activation+3.45e+00

Pos logit contributions
 curled+0.064
 cozy+0.062
 safe+0.059
 hollow+0.052
 scary+0.050
Neg logit contributions
parents-0.201
ums-0.197
father-0.194
olds-0.189
ases-0.187
Token there
Token ID612
Feature activation+1.94e+00

Pos logit contributions
 curled+0.040
 nib+0.035
 shady+0.033
 safe+0.033
 deserted+0.033
Neg logit contributions
ma-0.088
 narrative-0.085
parents-0.085
father-0.084
els-0.083
Token was
Token ID373
Feature activation+5.34e+00

Pos logit contributions
 safe+0.027
 cozy+0.026
 curled+0.024
 scary+0.024
 curious+0.023
Neg logit contributions
parents-0.056
father-0.054
ums-0.052
ases-0.052
ips-0.051
Token a
Token ID257
Feature activation+8.73e+00

Pos logit contributions
 curled+0.057
 hollow+0.055
 safe+0.054
 hut+0.053
 deer+0.049
Neg logit contributions
parents-0.146
father-0.142
ips-0.135
ases-0.134
oral-0.134
Token't
Token ID470
Feature activation-2.80e+00

Pos logit contributions
parents+0.073
asers+0.072
father+0.070
ips+0.069
uns+0.069
Neg logit contributions
 safe-0.044
 open-0.041
 cozy-0.041
 a-0.040
 curled-0.040
Token have
Token ID423
Feature activation-4.87e+00

Pos logit contributions
parents+0.083
father+0.079
asers+0.078
ips+0.077
ases+0.076
Neg logit contributions
 safe-0.041
 cozy-0.038
 curled-0.037
 a-0.035
 open-0.035
Token all
Token ID477
Feature activation-4.10e+00

Pos logit contributions
parents+0.151
father+0.144
asers+0.141
ums+0.139
ases+0.139
Neg logit contributions
 safe-0.064
 cozy-0.061
 curled-0.058
 curious-0.055
 a-0.055
Token these
Token ID777
Feature activation-4.62e+00

Pos logit contributions
parents+0.131
father+0.125
asers+0.122
ases+0.121
ums+0.121
Neg logit contributions
 safe-0.049
 cozy-0.048
 curled-0.045
 curious-0.042
 hidden-0.042
Token ch
Token ID442
Feature activation-8.38e+00

Pos logit contributions
parents+0.133
father+0.128
asers+0.123
ases+0.123
ips+0.123
Neg logit contributions
 safe-0.071
 cozy-0.068
 curled-0.064
 a-0.062
 curious-0.062
Tokenocol
Token ID4668
Feature activation-1.33e+01

Pos logit contributions
parents+0.144
 narrative+0.136
father+0.132
oral+0.127
emaker+0.126
Neg logit contributions
 safe-0.197
 hiding-0.196
 and-0.194
 a-0.190
 curious-0.189
Tokenates
Token ID689
Feature activation-3.07e+00

Pos logit contributions
parents+0.217
oral+0.213
emaker+0.206
 narrative+0.206
orter+0.196
Neg logit contributions
 a-0.299
 in-0.298
 hiding-0.288
 perched-0.285
 inside-0.279
Token!
Token ID0
Feature activation-1.70e+00

Pos logit contributions
parents+0.098
father+0.093
asers+0.093
ases+0.091
ips+0.091
Neg logit contributions
 cozy-0.035
 safe-0.033
 curled-0.032
 curious-0.030
 hiding-0.029
Token Now
Token ID2735
Feature activation-1.82e-01

Pos logit contributions
parents+0.048
father+0.046
asers+0.045
emaker+0.044
els+0.044
Neg logit contributions
 safe-0.026
 cozy-0.025
 a-0.025
 curious-0.024
 curled-0.024
Token,
Token ID11
Feature activation+6.91e-01

Pos logit contributions
parents+0.005
father+0.005
ums+0.004
ases+0.004
olds+0.004
Neg logit contributions
 cozy-0.003
 curled-0.003
 safe-0.003
 curious-0.003
 abandoned-0.002
Token go
Token ID467
Feature activation-1.02e+00

Pos logit contributions
 cozy+0.011
 safe+0.011
 curled+0.011
 curious+0.010
 hidden+0.010
Neg logit contributions
parents-0.019
father-0.017
asers-0.017
ums-0.017
ases-0.017
Token and
Token ID290
Feature activation+1.62e+00

Pos logit contributions
parents+0.034
father+0.032
asers+0.032
ips+0.031
els+0.031
Neg logit contributions
 cozy-0.012
 curled-0.011
 safe-0.011
 scary-0.011
 abandoned-0.010
Token he
Token ID339
Feature activation-1.12e+00

Pos logit contributions
 cozy+0.022
 safe+0.022
 curled+0.020
 hut+0.019
 hiding+0.019
Neg logit contributions
parents-0.047
father-0.043
olds-0.042
asers-0.042
ases-0.042
Token didn
Token ID1422
Feature activation-4.72e+00

Pos logit contributions
parents+0.033
father+0.031
asers+0.030
ases+0.029
olds+0.029
Neg logit contributions
 safe-0.016
 cozy-0.016
 hiding-0.014
 curled-0.014
 shady-0.014
Token't
Token ID470
Feature activation-2.31e+00

Pos logit contributions
parents+0.112
father+0.105
asers+0.104
uns+0.102
ums+0.098
Neg logit contributions
 safe-0.098
 a-0.095
 cozy-0.094
 open-0.092
 curled-0.091
Token want
Token ID765
Feature activation-4.42e+00

Pos logit contributions
parents+0.061
father+0.058
asers+0.056
ases+0.056
ums+0.055
Neg logit contributions
 safe-0.040
 cozy-0.040
 curled-0.038
 curious-0.036
 hidden-0.036
Token L
Token ID406
Feature activation-1.33e+01

Pos logit contributions
parents+0.168
father+0.163
asers+0.160
ums+0.158
emaker+0.158
Neg logit contributions
 safe-0.027
 cozy-0.024
 a-0.023
 curled-0.022
 hidden-0.020
Tokenyl
Token ID2645
Feature activation-9.22e+00

Pos logit contributions
father+0.309
parents+0.306
emaker+0.289
 narrative+0.288
uns+0.288
Neg logit contributions
 hidden-0.238
 there-0.235
 a-0.234
 hiding-0.228
 open-0.225
Tokena
Token ID64
Feature activation-3.91e+00

Pos logit contributions
parents+0.215
father+0.204
ums+0.203
ips+0.203
 narrative+0.201
Neg logit contributions
 safe-0.184
 a-0.175
 curled-0.168
 cozy-0.167
 hidden-0.166
Token to
Token ID284
Feature activation-2.80e+00

Pos logit contributions
parents+0.125
father+0.119
asers+0.118
olds+0.117
ases+0.117
Neg logit contributions
 cozy-0.043
 safe-0.041
 curled-0.040
 hidden-0.036
 hiding-0.035
Token be
Token ID307
Feature activation-2.82e+00

Pos logit contributions
parents+0.075
father+0.070
ases+0.069
ums+0.068
olds+0.067
Neg logit contributions
 cozy-0.045
 safe-0.044
 curled-0.043
 curious-0.041
 hidden-0.040
Token unhappy
Token ID19283
Feature activation-1.19e+00

Pos logit contributions
parents+0.084
father+0.082
ases+0.081
asers+0.080
ips+0.080
Neg logit contributions
 curled-0.030
 cozy-0.029
 hut-0.027
 shady-0.027
 safe-0.025
Tokenff
Token ID487
Feature activation-3.65e+00

Pos logit contributions
parents+0.137
 narrative+0.131
father+0.122
ases+0.122
asers+0.121
Neg logit contributions
 safe-0.183
 curled-0.177
 a-0.175
 filled-0.173
,-0.173
Tokenron
Token ID1313
Feature activation-1.54e+00

Pos logit contributions
parents+0.093
father+0.088
asers+0.086
ips+0.085
ases+0.084
Neg logit contributions
 safe-0.067
 cozy-0.064
 curled-0.063
 curious-0.061
 a-0.060
Token and
Token ID290
Feature activation+9.44e-04

Pos logit contributions
parents+0.053
father+0.051
asers+0.050
ums+0.049
uns+0.049
Neg logit contributions
 safe-0.014
 cozy-0.014
 curled-0.012
 hiding-0.012
 hidden-0.011
Token vanilla
Token ID16858
Feature activation-1.38e+00

Pos logit contributions
 safe+0.000
 cozy+0.000
 curled+0.000
 curious+0.000
 hidden+0.000
Neg logit contributions
parents-0.000
father-0.000
asers-0.000
ases-0.000
ums-0.000
Token and
Token ID290
Feature activation+3.51e-01

Pos logit contributions
parents+0.050
father+0.048
asers+0.047
ums+0.047
ases+0.047
Neg logit contributions
 safe-0.010
 cozy-0.010
 curled-0.009
 curious-0.008
 hidden-0.008
Token lic
Token ID3476
Feature activation-1.32e+01

Pos logit contributions
 safe+0.005
 cozy+0.005
 curled+0.005
 hiding+0.004
 curious+0.004
Neg logit contributions
parents-0.010
father-0.010
asers-0.010
uns-0.009
ases-0.009
Tokenor
Token ID273
Feature activation-8.40e+00

Pos logit contributions
parents+0.153
father+0.136
 narrative+0.127
ums+0.126
orter+0.125
Neg logit contributions
 curled-0.390
 safe-0.390
 there-0.388
 a-0.382
 perched-0.379
Tokenice
Token ID501
Feature activation-3.12e+00

Pos logit contributions
parents+0.197
father+0.186
asers+0.178
ips+0.177
ases+0.177
Neg logit contributions
 safe-0.171
 cozy-0.166
 curled-0.162
 a-0.158
 curious-0.156
Token and
Token ID290
Feature activation+5.78e-01

Pos logit contributions
parents+0.108
father+0.104
asers+0.102
ums+0.101
uns+0.101
Neg logit contributions
 safe-0.027
 cozy-0.027
 curled-0.025
 curious-0.022
 hiding-0.022
Token star
Token ID3491
Feature activation-3.54e+00

Pos logit contributions
 safe+0.009
 cozy+0.008
 curled+0.008
 curious+0.008
 hidden+0.008
Neg logit contributions
parents-0.016
father-0.016
asers-0.015
ums-0.015
ases-0.015
Token an
Token ID281
Feature activation-1.06e+00

Pos logit contributions
parents+0.103
father+0.099
asers+0.096
ases+0.095
ums+0.095
Neg logit contributions
 safe-0.052
 cozy-0.050
 curled-0.048
 curious-0.045
 hidden-0.045
Token treasure
Token ID14068
Feature activation-1.44e+00

Pos logit contributions
parents+0.056
father+0.054
asers+0.052
ases+0.052
ips+0.052
Neg logit contributions
 safe-0.038
 cozy-0.037
 curled-0.035
 curious-0.034
 hidden-0.034
Token was
Token ID373
Feature activation+2.12e+00

Pos logit contributions
parents+0.045
father+0.042
asers+0.040
ums+0.040
els+0.040
Neg logit contributions
 cozy-0.019
 safe-0.018
 curled-0.016
 curious-0.016
 scary-0.016
Token so
Token ID523
Feature activation+3.22e+00

Pos logit contributions
 cozy+0.019
 curled+0.018
 safe+0.018
 hut+0.018
 curious+0.016
Neg logit contributions
parents-0.068
father-0.066
asers-0.063
ums-0.063
els-0.063
Token special
Token ID2041
Feature activation+3.06e-01

Pos logit contributions
 safe+0.031
 curled+0.031
 cozy+0.029
 hiding+0.028
 hut+0.025
Neg logit contributions
parents-0.106
father-0.100
ums-0.100
olds-0.100
asers-0.099
Token that
Token ID326
Feature activation-1.03e+00

Pos logit contributions
 cozy+0.004
 curled+0.004
 safe+0.004
 curious+0.003
 hollow+0.003
Neg logit contributions
parents-0.009
father-0.008
asers-0.008
ums-0.008
orter-0.008
Token Ab
Token ID2275
Feature activation-1.29e+01

Pos logit contributions
parents+0.029
father+0.028
ums+0.027
asers+0.026
uns+0.026
Neg logit contributions
 safe-0.016
 cozy-0.015
 curled-0.015
 curious-0.014
 hiding-0.014
Tokenbie
Token ID12590
Feature activation-5.13e+00

Pos logit contributions
parents+0.260
asers+0.245
emaker+0.243
ases+0.242
orter+0.241
Neg logit contributions
 curled-0.277
 hiding-0.272
 cozy-0.269
 hidden-0.267
 curious-0.267
Token decide
Token ID5409
Feature activation+9.45e-02

Pos logit contributions
parents+0.165
father+0.158
asers+0.153
ums+0.151
ases+0.150
Neg logit contributions
 safe-0.061
 cozy-0.060
 curled-0.054
 scary-0.051
 curious-0.051
Token to
Token ID284
Feature activation-1.92e+00

Pos logit contributions
 cozy+0.001
 safe+0.001
 curled+0.001
 scary+0.001
 hut+0.001
Neg logit contributions
parents-0.003
father-0.003
asers-0.003
olds-0.003
ums-0.003
Token keep
Token ID1394
Feature activation-2.76e+00

Pos logit contributions
parents+0.054
father+0.052
asers+0.049
ases+0.049
ums+0.049
Neg logit contributions
 safe-0.030
 cozy-0.029
 curled-0.028
 curious-0.027
 scary-0.026
Token it
Token ID340
Feature activation-2.34e-01

Pos logit contributions
parents+0.090
father+0.086
els+0.083
asers+0.082
ips+0.082
Neg logit contributions
 curled-0.030
 cozy-0.029
 safe-0.029
 shady-0.025
 curious-0.025
Token and
Token ID290
Feature activation+2.16e+00

Pos logit contributions
parents+0.029
father+0.028
ases+0.027
asers+0.027
ums+0.027
Neg logit contributions
 safe-0.013
 cozy-0.012
 curled-0.012
 abandoned-0.012
 hidden-0.011
Token laughed
Token ID13818
Feature activation+7.12e-01

Pos logit contributions
 safe+0.041
 cozy+0.039
 curled+0.036
 hidden+0.036
 hut+0.036
Neg logit contributions
parents-0.051
father-0.048
ases-0.045
ips-0.045
els-0.045
Token and
Token ID290
Feature activation+2.13e+00

Pos logit contributions
 abandoned+0.013
 hollow+0.012
 curled+0.012
 locked+0.011
 clearing+0.011
Neg logit contributions
ums-0.016
ases-0.015
ips-0.015
els-0.014
father-0.014
Token ate
Token ID15063
Feature activation-1.12e+00

Pos logit contributions
 safe+0.038
 cozy+0.036
 hut+0.036
 hidden+0.035
 curled+0.035
Neg logit contributions
parents-0.049
father-0.045
ases-0.044
ips-0.044
pa-0.043
Token ch
Token ID442
Feature activation-6.39e+00

Pos logit contributions
parents+0.030
els+0.030
ases+0.030
ips+0.029
ums+0.029
Neg logit contributions
 abandoned-0.015
 curled-0.014
 safe-0.013
 hollow-0.013
 hidden-0.013
Tokenocol
Token ID4668
Feature activation-1.28e+01

Pos logit contributions
parents+0.114
father+0.095
 narrative+0.091
oral+0.090
emaker+0.079
Neg logit contributions
 high-0.167
 cozy-0.166
 safe-0.165
 a-0.164
 curious-0.163
Tokenates
Token ID689
Feature activation-1.64e+00

Pos logit contributions
parents+0.217
oral+0.210
 narrative+0.201
orter+0.200
emaker+0.191
Neg logit contributions
 a-0.308
 in-0.290
 hiding-0.288
 perched-0.284
 inside-0.282
Token.
Token ID13
Feature activation+3.80e-01

Pos logit contributions
parents+0.051
asers+0.049
els+0.049
uns+0.049
father+0.049
Neg logit contributions
 cozy-0.017
 safe-0.017
 abandoned-0.016
 curled-0.016
 scary-0.015
Token They
Token ID1119
Feature activation-3.31e-01

Pos logit contributions
 safe+0.006
 curled+0.006
 cozy+0.005
 hidden+0.005
 hiding+0.005
Neg logit contributions
parents-0.010
ums-0.010
ips-0.010
ases-0.010
olds-0.010
Token had
Token ID550
Feature activation-2.90e-01

Pos logit contributions
parents+0.009
father+0.008
asers+0.008
ases+0.008
emaker+0.008
Neg logit contributions
 safe-0.005
 cozy-0.005
 hiding-0.005
 curled-0.005
 shady-0.005
Token a
Token ID257
Feature activation+5.12e+00

Pos logit contributions
parents+0.010
father+0.010
ases+0.010
ips+0.009
asers+0.009
Neg logit contributions
 safe-0.002
 curled-0.002
 cozy-0.002
 hut-0.002
 hollow-0.001
Token learn
Token ID2193
Feature activation-2.37e+00

Pos logit contributions
parents+0.040
father+0.038
ases+0.036
ums+0.036
asers+0.036
Neg logit contributions
 cozy-0.022
 safe-0.022
 curled-0.021
 curious-0.020
 hidden-0.019
Token more
Token ID517
Feature activation-3.04e+00

Pos logit contributions
parents+0.073
olds+0.071
father+0.070
asers+0.069
ases+0.069
Neg logit contributions
 cozy-0.027
 curled-0.027
 safe-0.025
 curious-0.024
 hiding-0.023
Token about
Token ID546
Feature activation-3.90e+00

Pos logit contributions
parents+0.086
olds+0.086
asers+0.082
father+0.082
oral+0.082
Neg logit contributions
 curled-0.042
 cozy-0.040
 filled-0.038
 safe-0.038
 hollow-0.035
Token rh
Token ID9529
Feature activation-7.61e+00

Pos logit contributions
parents+0.119
father+0.114
asers+0.110
ases+0.109
ums+0.108
Neg logit contributions
 safe-0.051
 cozy-0.050
 curled-0.049
 curious-0.045
 filled-0.044
Tokenin
Token ID259
Feature activation-3.56e+00

Pos logit contributions
parents+0.137
uns+0.123
ums+0.122
omm+0.121
father+0.121
Neg logit contributions
 safe-0.182
 far-0.181
 hidden-0.177
 there-0.177
 hiding-0.173
Tokenoc
Token ID420
Feature activation-1.27e+01

Pos logit contributions
parents+0.106
father+0.101
asers+0.099
ases+0.097
ums+0.097
Neg logit contributions
 safe-0.050
 cozy-0.050
 curled-0.047
 curious-0.044
 hidden-0.044
Tokener
Token ID263
Feature activation-3.90e+00

Pos logit contributions
 narrative+0.253
parents+0.250
emaker+0.236
iness+0.235
oral+0.235
Neg logit contributions
 hiding-0.234
,-0.232
 a-0.232
 safe-0.231
 and-0.219
Tokenoses
Token ID4629
Feature activation-3.08e+00

Pos logit contributions
parents+0.140
father+0.134
asers+0.132
ums+0.130
ases+0.130
Neg logit contributions
 safe-0.031
 cozy-0.030
 curled-0.028
 curious-0.025
 hidden-0.024
Token and
Token ID290
Feature activation-2.07e-02

Pos logit contributions
parents+0.110
father+0.105
asers+0.103
ases+0.102
ips+0.102
Neg logit contributions
 cozy-0.025
 safe-0.024
 curled-0.022
 curious-0.019
 hidden-0.018
Token respect
Token ID2461
Feature activation-4.95e+00

Pos logit contributions
parents+0.001
father+0.001
asers+0.000
ases+0.000
ums+0.000
Neg logit contributions
 safe-0.000
 cozy-0.000
 curled-0.000
 curious-0.000
 hidden-0.000
Token them
Token ID606
Feature activation-3.82e+00

Pos logit contributions
parents+0.157
father+0.150
asers+0.148
ips+0.147
ums+0.146
Neg logit contributions
 safe-0.062
 cozy-0.058
 curled-0.054
 a-0.053
 hidden-0.051
Token to
Token ID284
Feature activation-3.34e+00

Pos logit contributions
parents+0.166
father+0.160
asers+0.157
ases+0.155
ums+0.155
Neg logit contributions
 safe-0.034
 cozy-0.032
 curled-0.029
 curious-0.026
 hidden-0.025
Token see
Token ID766
Feature activation-3.19e+00

Pos logit contributions
parents+0.087
father+0.083
ases+0.081
ums+0.080
olds+0.078
Neg logit contributions
 cozy-0.054
 safe-0.054
 curled-0.052
 curious-0.051
 hidden-0.050
Token the
Token ID262
Feature activation-2.41e+00

Pos logit contributions
parents+0.094
father+0.091
els+0.088
ases+0.088
asers+0.086
Neg logit contributions
 curled-0.042
 safe-0.041
 cozy-0.041
 filled-0.037
 curious-0.037
Token rh
Token ID9529
Feature activation-8.99e+00

Pos logit contributions
parents+0.074
father+0.071
asers+0.069
ases+0.068
ums+0.068
Neg logit contributions
 safe-0.032
 cozy-0.031
 curled-0.030
 curious-0.028
 hidden-0.027
Tokenin
Token ID259
Feature activation-4.71e+00

Pos logit contributions
parents+0.181
uns+0.167
ums+0.165
father+0.161
olds+0.160
Neg logit contributions
 safe-0.202
 hidden-0.197
 there-0.196
 far-0.193
 a-0.189
Tokenoc
Token ID420
Feature activation-1.26e+01

Pos logit contributions
parents+0.125
father+0.119
asers+0.116
ases+0.114
ums+0.114
Neg logit contributions
 safe-0.082
 cozy-0.080
 curled-0.077
 curious-0.073
 hidden-0.073
Tokeneros
Token ID27498
Feature activation-2.07e+00

Pos logit contributions
parents+0.310
asers+0.306
ips+0.296
 narrative+0.296
uns+0.294
Neg logit contributions
 safe-0.196
 hiding-0.190
 furry-0.182
 bark-0.181
 hut-0.180
Token anymore
Token ID7471
Feature activation-2.16e+00

Pos logit contributions
parents+0.055
ases+0.054
father+0.053
els+0.052
ips+0.052
Neg logit contributions
 cozy-0.030
 curled-0.029
 abandoned-0.028
 safe-0.028
 hollow-0.027
Token.
Token ID13
Feature activation-6.87e-01

Pos logit contributions
parents+0.060
ases+0.058
ips+0.057
father+0.057
asers+0.056
Neg logit contributions
 cozy-0.030
 curled-0.029
 safe-0.029
 hollow-0.028
 abandoned-0.027
Token He
Token ID679
Feature activation-1.49e+00

Pos logit contributions
parents+0.017
ases+0.016
father+0.016
ums+0.016
olds+0.016
Neg logit contributions
 safe-0.012
 hiding-0.012
 hidden-0.012
 cozy-0.011
 curled-0.011
Token is
Token ID318
Feature activation+5.72e-01

Pos logit contributions
parents+0.036
father+0.033
olds+0.032
ases+0.032
asers+0.032
Neg logit contributions
 cozy-0.027
 safe-0.027
 shady-0.026
 abandoned-0.025
 curled-0.025
Token not
Token ID407
Feature activation-3.05e+00

Pos logit contributions
parents+0.031
father+0.030
asers+0.029
ases+0.029
ips+0.029
Neg logit contributions
 safe-0.014
 cozy-0.014
 curled-0.013
 a-0.012
 curious-0.012
Token eat
Token ID4483
Feature activation-4.35e+00

Pos logit contributions
parents+0.088
father+0.084
asers+0.082
ases+0.081
ums+0.081
Neg logit contributions
 safe-0.045
 cozy-0.044
 curled-0.042
 curious-0.040
 hidden-0.039
Token too
Token ID1165
Feature activation-2.38e+00

Pos logit contributions
parents+0.120
els+0.115
asers+0.115
father+0.114
ases+0.113
Neg logit contributions
 curled-0.063
 safe-0.062
 cozy-0.061
 curious-0.056
 filled-0.056
Token many
Token ID867
Feature activation-3.53e+00

Pos logit contributions
parents+0.074
father+0.071
asers+0.070
ases+0.069
ums+0.069
Neg logit contributions
 safe-0.029
 cozy-0.028
 curled-0.028
 hiding-0.025
 curious-0.025
Token ch
Token ID442
Feature activation-6.70e+00

Pos logit contributions
parents+0.097
father+0.093
asers+0.091
ases+0.089
ums+0.088
Neg logit contributions
 safe-0.057
 cozy-0.056
 curled-0.054
 curious-0.051
 hidden-0.050
Tokenocol
Token ID4668
Feature activation-1.26e+01

Pos logit contributions
parents+0.111
father+0.091
 narrative+0.090
oral+0.088
orter+0.078
Neg logit contributions
 high-0.185
 safe-0.183
 far-0.182
 hiding-0.179
 curious-0.178
Tokenates
Token ID689
Feature activation-2.88e+00

Pos logit contributions
parents+0.195
oral+0.179
orter+0.174
 narrative+0.166
olds+0.163
Neg logit contributions
 a-0.314
 in-0.309
 perched-0.304
 hiding-0.302
 there-0.291
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
parents+0.096
father+0.092
asers+0.092
ums+0.090
uns+0.090
Neg logit contributions
 cozy-0.028
 safe-0.027
 curled-0.026
 curious-0.024
 hiding-0.023
Token his
Token ID465
Feature activation-7.22e-01

Pos logit contributions
 safe+0.013
 cozy+0.013
 curled+0.013
 curious+0.012
 hidden+0.012
Neg logit contributions
parents-0.030
father-0.029
asers-0.028
ases-0.028
olds-0.028
Token friend
Token ID1545
Feature activation-6.67e-01

Pos logit contributions
parents+0.020
father+0.019
asers+0.018
ips+0.018
ases+0.018
Neg logit contributions
 safe-0.012
 cozy-0.012
 curled-0.011
 curious-0.011
 a-0.011
Token learned
Token ID4499
Feature activation-1.71e+00

Pos logit contributions
parents+0.021
father+0.020
asers+0.019
ums+0.019
ases+0.019
Neg logit contributions
 safe-0.008
 cozy-0.008
 curled-0.008
 hidden-0.007
 hiding-0.007
Tokenbled
Token ID9342
Feature activation-2.82e+00

Pos logit contributions
parents+0.054
father+0.048
asers+0.044
ases+0.042
uns+0.042
Neg logit contributions
 safe-0.149
 cozy-0.147
 curled-0.144
 a-0.141
 curious-0.141
Token up
Token ID510
Feature activation-3.22e-01

Pos logit contributions
parents+0.094
father+0.090
asers+0.088
ases+0.087
ums+0.087
Neg logit contributions
 safe-0.030
 cozy-0.029
 curled-0.027
 a-0.026
 hidden-0.025
Token all
Token ID477
Feature activation-2.30e+00

Pos logit contributions
parents+0.009
els+0.009
ums+0.009
father+0.009
ips+0.009
Neg logit contributions
 safe-0.004
 cozy-0.004
 curled-0.004
 abandoned-0.004
 hiding-0.003
Token the
Token ID262
Feature activation-3.11e+00

Pos logit contributions
parents+0.083
father+0.080
asers+0.079
ums+0.078
ases+0.077
Neg logit contributions
 safe-0.019
 cozy-0.018
 curled-0.016
 a-0.016
 hidden-0.015
Token ch
Token ID442
Feature activation-7.42e+00

Pos logit contributions
parents+0.088
father+0.084
asers+0.082
ases+0.081
ums+0.081
Neg logit contributions
 safe-0.049
 cozy-0.048
 curled-0.046
 curious-0.043
 hidden-0.043
Tokenocol
Token ID4668
Feature activation-1.26e+01

Pos logit contributions
parents+0.130
father+0.110
 narrative+0.109
oral+0.102
emaker+0.097
Neg logit contributions
 safe-0.182
 open-0.180
 hidden-0.176
 hiding-0.174
 high-0.173
Tokenates
Token ID689
Feature activation-2.03e+00

Pos logit contributions
parents+0.190
emaker+0.172
orter+0.171
olds+0.165
oral+0.164
Neg logit contributions
 in-0.298
 a-0.297
 perched-0.289
 hiding-0.288
 inside-0.286
Token,
Token ID11
Feature activation+3.24e+00

Pos logit contributions
parents+0.065
ums+0.063
ips+0.063
father+0.063
els+0.063
Neg logit contributions
 cozy-0.019
 scary-0.017
 safe-0.017
 curled-0.016
 curious-0.016
Token and
Token ID290
Feature activation+2.34e+00

Pos logit contributions
 hut+0.031
 safe+0.031
 scary+0.030
 cozy+0.030
 curled+0.028
Neg logit contributions
parents-0.096
father-0.091
ums-0.088
ips-0.088
olds-0.087
Token she
Token ID673
Feature activation+1.99e-01

Pos logit contributions
 safe+0.031
 cozy+0.029
 scary+0.028
 abandoned+0.027
 curious+0.027
Neg logit contributions
parents-0.068
father-0.063
ums-0.061
asers-0.060
ips-0.060
Token was
Token ID373
Feature activation+1.82e+00

Pos logit contributions
 safe+0.003
 cozy+0.002
 scary+0.002
 curious+0.002
 curled+0.002
Neg logit contributions
parents-0.006
father-0.006
ums-0.006
ases-0.006
asers-0.005
Token the
Token ID262
Feature activation-2.38e+00

Pos logit contributions
 cozy+0.043
 curled+0.042
 safe+0.042
 hiding+0.041
 shady+0.039
Neg logit contributions
parents-0.069
ums-0.065
father-0.064
olds-0.061
els-0.061
Token gate
Token ID8946
Feature activation+1.80e-01

Pos logit contributions
parents+0.073
father+0.069
asers+0.067
ums+0.066
ases+0.066
Neg logit contributions
 safe-0.031
 cozy-0.031
 curled-0.030
 hiding-0.027
 curious-0.027
Tokenkeeper
Token ID13884
Feature activation-7.20e-01

Pos logit contributions
 cozy+0.002
 safe+0.002
 curled+0.002
 curious+0.002
 scary+0.002
Neg logit contributions
parents-0.005
father-0.005
ums-0.005
asers-0.005
olds-0.005
Token agreed
Token ID4987
Feature activation+1.63e+00

Pos logit contributions
parents+0.021
father+0.020
asers+0.019
emaker+0.018
ums+0.018
Neg logit contributions
 cozy-0.011
 safe-0.011
 scary-0.009
 curious-0.009
 curled-0.009
Token that
Token ID326
Feature activation-1.04e+00

Pos logit contributions
 cozy+0.017
 safe+0.015
 curled+0.014
 hut+0.014
 scary+0.014
Neg logit contributions
parents-0.053
father-0.050
ums-0.049
olds-0.049
uns-0.048
Token Ab
Token ID2275
Feature activation-1.26e+01

Pos logit contributions
parents+0.029
father+0.028
ums+0.027
asers+0.026
uns+0.026
Neg logit contributions
 safe-0.016
 cozy-0.016
 curled-0.015
 curious-0.014
 scary-0.014
Tokenbie
Token ID12590
Feature activation-5.27e+00

Pos logit contributions
parents+0.260
asers+0.245
emaker+0.243
anes+0.242
orter+0.242
Neg logit contributions
 curled-0.257
 hiding-0.257
 hidden-0.248
 there-0.247
 a-0.246
Token had
Token ID550
Feature activation-1.31e+00

Pos logit contributions
parents+0.169
father+0.162
asers+0.158
ases+0.156
ums+0.156
Neg logit contributions
 safe-0.062
 cozy-0.061
 curled-0.056
 curious-0.052
 hidden-0.052
Token worked
Token ID3111
Feature activation-6.21e-01

Pos logit contributions
parents+0.036
father+0.034
asers+0.032
ips+0.032
ases+0.032
Neg logit contributions
 cozy-0.021
 safe-0.020
 curled-0.020
 curious-0.019
 hut-0.019
Token hard
Token ID1327
Feature activation-1.15e+00

Pos logit contributions
parents+0.020
father+0.020
asers+0.019
ips+0.019
els+0.019
Neg logit contributions
 cozy-0.006
 curled-0.006
 safe-0.006
 curious-0.005
 hiding-0.005
Token and
Token ID290
Feature activation+1.24e+00

Pos logit contributions
parents+0.038
father+0.037
asers+0.037
ips+0.036
ases+0.036
Neg logit contributions
 cozy-0.012
 safe-0.012
 curled-0.011
 curious-0.010
 shady-0.009
Token happy
Token ID3772
Feature activation-1.29e+00

Pos logit contributions
 safe+0.019
 cozy+0.019
 curled+0.018
 scary+0.017
 open+0.017
Neg logit contributions
parents-0.035
father-0.033
asers-0.032
ums-0.032
ases-0.032
Token for
Token ID329
Feature activation-5.18e+00

Pos logit contributions
parents+0.041
father+0.039
asers+0.038
ases+0.038
ums+0.038
Neg logit contributions
 safe-0.016
 cozy-0.016
 curled-0.015
 hidden-0.014
 curious-0.014
Token the
Token ID262
Feature activation-2.47e+00

Pos logit contributions
parents+0.141
father+0.133
asers+0.132
ums+0.130
emaker+0.130
Neg logit contributions
 safe-0.089
 cozy-0.085
 a-0.082
 hiding-0.081
 curled-0.080
Token rh
Token ID9529
Feature activation-8.59e+00

Pos logit contributions
parents+0.077
father+0.074
asers+0.073
ases+0.072
ips+0.072
Neg logit contributions
 safe-0.032
 cozy-0.031
 curled-0.029
 curious-0.027
 hidden-0.027
Tokenin
Token ID259
Feature activation-4.63e+00

Pos logit contributions
parents+0.182
uns+0.165
ums+0.164
 narrative+0.160
ips+0.159
Neg logit contributions
 safe-0.184
 hidden-0.178
 far-0.176
 there-0.172
 a-0.171
Tokenoc
Token ID420
Feature activation-1.26e+01

Pos logit contributions
parents+0.133
ips+0.127
asers+0.125
father+0.125
ases+0.124
Neg logit contributions
 safe-0.070
 cozy-0.066
 a-0.064
 curled-0.064
 curious-0.062
Tokeneros
Token ID27498
Feature activation-1.97e+00

Pos logit contributions
parents+0.305
 narrative+0.300
asers+0.299
emaker+0.290
iness+0.288
Neg logit contributions
 hiding-0.189
 safe-0.186
 furry-0.180
 bark-0.180
 deer-0.176
Token.
Token ID13
Feature activation-1.72e-02

Pos logit contributions
parents+0.063
father+0.061
asers+0.058
olds+0.058
ums+0.058
Neg logit contributions
 cozy-0.022
 safe-0.020
 curled-0.019
 abandoned-0.018
 shady-0.018
Token She
Token ID1375
Feature activation-8.27e-01

Pos logit contributions
parents+0.001
father+0.001
ums+0.001
ips+0.001
olds+0.001
Neg logit contributions
 safe-0.000
 cozy-0.000
 curled-0.000
 hidden-0.000
 hiding-0.000
Token said
Token ID531
Feature activation-2.18e+00

Pos logit contributions
parents+0.021
father+0.019
asers+0.018
olds+0.018
oral+0.018
Neg logit contributions
 safe-0.015
 cozy-0.014
 hut-0.013
 high-0.013
 hiding-0.013
Token,
Token ID11
Feature activation+4.78e-01

Pos logit contributions
parents+0.075
father+0.072
asers+0.071
ums+0.071
ases+0.070
Neg logit contributions
 safe-0.020
 cozy-0.019
 curled-0.018
 hidden-0.016
 hiding-0.016
Token From
Token ID3574
Feature activation-2.50e-01

Pos logit contributions
parents+0.009
father+0.009
asers+0.009
ases+0.009
emaker+0.009
Neg logit contributions
 safe-0.003
 cozy-0.003
 curled-0.003
 a-0.003
 curious-0.003
Token that
Token ID326
Feature activation-2.00e+00

Pos logit contributions
parents+0.007
father+0.007
els+0.006
pa+0.006
asers+0.006
Neg logit contributions
 cozy-0.004
 safe-0.004
 curled-0.004
 shady-0.004
 hiding-0.003
Token day
Token ID1110
Feature activation-2.10e+00

Pos logit contributions
parents+0.048
father+0.046
asers+0.044
ums+0.043
ases+0.043
Neg logit contributions
 safe-0.040
 cozy-0.039
 curled-0.038
 curious-0.036
 hidden-0.036
Token on
Token ID319
Feature activation-2.47e+00

Pos logit contributions
parents+0.062
father+0.059
asers+0.058
ases+0.057
ums+0.057
Neg logit contributions
 safe-0.030
 cozy-0.029
 curled-0.028
 curious-0.027
 a-0.026
Token,
Token ID11
Feature activation+9.74e-01

Pos logit contributions
parents+0.084
father+0.081
ums+0.080
els+0.078
ips+0.078
Neg logit contributions
 cozy-0.022
 safe-0.021
 curled-0.020
 curious-0.018
 scary-0.018
Token Ab
Token ID2275
Feature activation-1.25e+01

Pos logit contributions
 safe+0.015
 cozy+0.014
 curled+0.014
 curious+0.013
 hidden+0.013
Neg logit contributions
parents-0.028
father-0.027
asers-0.026
ases-0.026
ums-0.026
Tokenbie
Token ID12590
Feature activation-3.27e+00

Pos logit contributions
parents+0.272
asers+0.266
ases+0.262
orter+0.261
emaker+0.259
Neg logit contributions
 cozy-0.245
 curled-0.244
 hidden-0.239
 curious-0.237
 a-0.236
Token never
Token ID1239
Feature activation-3.30e+00

Pos logit contributions
parents+0.106
father+0.101
asers+0.099
ases+0.098
ums+0.098
Neg logit contributions
 safe-0.038
 cozy-0.036
 curled-0.034
 curious-0.032
 hidden-0.031
Token forgot
Token ID16453
Feature activation-1.66e+00

Pos logit contributions
parents+0.084
father+0.079
orter+0.076
emaker+0.076
asers+0.076
Neg logit contributions
 cozy-0.056
 safe-0.055
 hut-0.052
 curled-0.051
 high-0.049
Token the
Token ID262
Feature activation-2.57e+00

Pos logit contributions
parents+0.048
father+0.046
olds+0.045
ups+0.044
asers+0.044
Neg logit contributions
 cozy-0.021
 curled-0.020
 safe-0.019
 hiding-0.018
 furry-0.018
Token special
Token ID2041
Feature activation-5.31e+00

Pos logit contributions
parents+0.080
father+0.076
asers+0.073
ases+0.072
ums+0.071
Neg logit contributions
 cozy-0.032
 safe-0.032
 curled-0.032
 filled-0.028
 curious-0.027
Token you
Token ID345
Feature activation-6.50e+00

Pos logit contributions
parents+0.078
father+0.072
els+0.070
olds+0.070
asers+0.069
Neg logit contributions
 cozy-0.069
 curled-0.068
 safe-0.067
 abandoned-0.064
 hollow-0.064
Token like
Token ID588
Feature activation-3.19e+00

Pos logit contributions
parents+0.168
father+0.155
asers+0.154
olds+0.151
ases+0.150
Neg logit contributions
 cozy-0.113
 safe-0.111
 curled-0.104
 shady-0.104
 hut-0.103
Token the
Token ID262
Feature activation-2.23e+00

Pos logit contributions
parents+0.098
father+0.093
els+0.091
asers+0.091
ases+0.091
Neg logit contributions
 safe-0.037
 cozy-0.037
 curled-0.036
 curious-0.032
 filled-0.032
Token rh
Token ID9529
Feature activation-8.70e+00

Pos logit contributions
parents+0.072
father+0.069
asers+0.067
ums+0.066
ases+0.066
Neg logit contributions
 safe-0.026
 cozy-0.025
 curled-0.024
 curious-0.022
 hiding-0.022
Tokenin
Token ID259
Feature activation-4.79e+00

Pos logit contributions
parents+0.185
uns+0.167
ums+0.163
olds+0.161
father+0.160
Neg logit contributions
 safe-0.186
 there-0.182
 hidden-0.182
 far-0.179
 a-0.178
Tokenoc
Token ID420
Feature activation-1.25e+01

Pos logit contributions
parents+0.130
father+0.123
asers+0.120
ips+0.119
ases+0.118
Neg logit contributions
 safe-0.081
 cozy-0.078
 curled-0.075
 curious-0.072
 a-0.072
Tokeneros
Token ID27498
Feature activation-2.41e+00

Pos logit contributions
parents+0.320
 narrative+0.312
asers+0.307
ips+0.300
iness+0.299
Neg logit contributions
 hiding-0.170
 furry-0.169
 safe-0.168
 deer-0.168
 bark-0.167
Token,
Token ID11
Feature activation+1.63e-01

Pos logit contributions
parents+0.075
father+0.073
asers+0.072
ases+0.071
olds+0.070
Neg logit contributions
 cozy-0.027
 curled-0.024
 safe-0.024
 shady-0.023
 abandoned-0.023
Token but
Token ID475
Feature activation-2.14e+00

Pos logit contributions
 cozy+0.001
 safe+0.001
 curled+0.001
 abandoned+0.001
 hidden+0.001
Neg logit contributions
parents-0.006
father-0.005
ums-0.005
asers-0.005
olds-0.005
Token it
Token ID340
Feature activation-3.95e+00

Pos logit contributions
parents+0.062
father+0.059
asers+0.058
ums+0.057
ases+0.057
Neg logit contributions
 safe-0.030
 cozy-0.030
 curled-0.029
 curious-0.027
 hidden-0.026
Token is
Token ID318
Feature activation+2.10e-01

Pos logit contributions
parents+0.103
father+0.099
asers+0.095
els+0.095
olds+0.093
Neg logit contributions
 safe-0.065
 cozy-0.065
 shady-0.061
 curled-0.060
 abandoned-0.059
Token had
Token ID550
Feature activation-3.43e+00

Pos logit contributions
parents+0.123
father+0.115
iness+0.110
asers+0.109
emaker+0.109
Neg logit contributions
 cozy-0.063
 safe-0.059
 hiding-0.057
 curled-0.057
 shady-0.057
Token never
Token ID1239
Feature activation-5.61e+00

Pos logit contributions
parents+0.096
father+0.090
asers+0.087
ases+0.087
ips+0.086
Neg logit contributions
 safe-0.054
 cozy-0.053
 curled-0.052
 curious-0.049
 hidden-0.048
Token seen
Token ID1775
Feature activation-3.92e+00

Pos logit contributions
parents+0.152
father+0.142
asers+0.137
ases+0.137
ips+0.136
Neg logit contributions
 safe-0.093
 cozy-0.093
 curled-0.088
 curious-0.085
 high-0.081
Token the
Token ID262
Feature activation-3.40e+00

Pos logit contributions
parents+0.129
father+0.123
asers+0.121
ums+0.120
ips+0.120
Neg logit contributions
 safe-0.044
 cozy-0.042
 a-0.039
 curled-0.038
 hidden-0.037
Token av
Token ID1196
Feature activation-5.89e+00

Pos logit contributions
parents+0.097
father+0.093
asers+0.088
ases+0.086
emaker+0.086
Neg logit contributions
 curled-0.051
 cozy-0.051
 safe-0.051
 filled-0.045
 curious-0.045
Tokenoc
Token ID420
Feature activation-1.25e+01

Pos logit contributions
parents+0.173
ases+0.162
asers+0.161
father+0.160
ips+0.160
Neg logit contributions
 safe-0.090
 cozy-0.085
 hidden-0.077
 open-0.077
 high-0.076
Tokenados
Token ID22484
Feature activation-3.28e+00

Pos logit contributions
parents+0.243
asers+0.233
uns+0.230
ips+0.226
ases+0.221
Neg logit contributions
 safe-0.252
,-0.246
 hut-0.241
 open-0.240
 deer-0.237
Token.
Token ID13
Feature activation+7.19e-02

Pos logit contributions
parents+0.091
father+0.088
emaker+0.086
ips+0.085
asers+0.085
Neg logit contributions
 cozy-0.047
 curled-0.043
 safe-0.042
 curious-0.041
 fluffy-0.040
Token They
Token ID1119
Feature activation-2.88e-01

Pos logit contributions
 safe+0.001
 curled+0.001
 cozy+0.001
 hidden+0.001
 hiding+0.001
Neg logit contributions
parents-0.002
father-0.002
olds-0.002
ums-0.002
ases-0.002
Token wished
Token ID16555
Feature activation-1.05e+00

Pos logit contributions
parents+0.007
father+0.007
 narrative+0.007
olds+0.007
els+0.007
Neg logit contributions
 safe-0.005
 cozy-0.005
 shady-0.004
 hiding-0.004
 high-0.004
Token they
Token ID484
Feature activation-4.51e+00

Pos logit contributions
parents+0.035
father+0.033
asers+0.033
ases+0.033
ips+0.032
Neg logit contributions
 safe-0.011
 cozy-0.011
 curled-0.010
 a-0.010
 curious-0.010
Token He
Token ID679
Feature activation-1.06e+00

Pos logit contributions
parents+0.049
father+0.047
asers+0.046
uns+0.045
els+0.045
Neg logit contributions
 a-0.007
 safe-0.006
 cozy-0.006
 curious-0.005
 but-0.005
Token did
Token ID750
Feature activation-8.63e-01

Pos logit contributions
parents+0.034
father+0.032
asers+0.032
ases+0.032
ums+0.031
Neg logit contributions
 safe-0.012
 cozy-0.012
 curled-0.011
 curious-0.010
 hiding-0.010
Token not
Token ID407
Feature activation-2.44e+00

Pos logit contributions
parents+0.025
father+0.024
ases+0.023
olds+0.023
ips+0.023
Neg logit contributions
 cozy-0.012
 safe-0.012
 curled-0.012
 hut-0.011
 curious-0.011
Token care
Token ID1337
Feature activation-4.09e+00

Pos logit contributions
parents+0.071
father+0.068
asers+0.066
ums+0.066
ips+0.065
Neg logit contributions
 safe-0.038
 cozy-0.036
 curled-0.034
 a-0.033
 curious-0.033
Token if
Token ID611
Feature activation-5.35e+00

Pos logit contributions
parents+0.145
father+0.139
asers+0.135
ases+0.134
ums+0.134
Neg logit contributions
 safe-0.036
 cozy-0.034
 curled-0.031
 a-0.029
 curious-0.029
Token L
Token ID406
Feature activation-1.24e+01

Pos logit contributions
parents+0.181
father+0.174
asers+0.170
ases+0.168
ums+0.168
Neg logit contributions
 safe-0.053
 cozy-0.051
 curled-0.048
 curious-0.043
 hidden-0.042
Tokenila
Token ID10102
Feature activation-7.22e+00

Pos logit contributions
parents+0.204
father+0.193
asers+0.171
ums+0.170
uns+0.168
Neg logit contributions
 curled-0.323
 safe-0.319
 there-0.318
 hiding-0.318
 a-0.318
Token was
Token ID373
Feature activation-2.45e+00

Pos logit contributions
parents+0.226
father+0.215
ases+0.211
asers+0.210
els+0.207
Neg logit contributions
 cozy-0.090
 safe-0.089
 curled-0.076
 curious-0.074
 shady-0.074
Token hurt
Token ID5938
Feature activation-4.32e+00

Pos logit contributions
parents+0.072
asers+0.069
ases+0.068
father+0.068
ips+0.068
Neg logit contributions
 curled-0.030
 cozy-0.029
 hut-0.029
 safe-0.026
 clearing-0.025
Token or
Token ID393
Feature activation-8.04e-01

Pos logit contributions
parents+0.166
father+0.159
asers+0.156
uns+0.155
orter+0.153
Neg logit contributions
 safe-0.027
 a-0.022
 cozy-0.022
 curled-0.020
 curious-0.019
Token sad
Token ID6507
Feature activation-3.34e+00

Pos logit contributions
parents+0.023
asers+0.022
ases+0.022
olds+0.022
father+0.022
Neg logit contributions
 curled-0.010
 safe-0.010
 cozy-0.010
 hut-0.009
 clearing-0.009
Token you
Token ID345
Feature activation-6.63e+00

Pos logit contributions
parents+0.040
father+0.037
ums+0.036
asers+0.036
ases+0.035
Neg logit contributions
 safe-0.034
 cozy-0.033
 curled-0.032
 hidden-0.031
 curious-0.031
Token like
Token ID588
Feature activation-3.20e+00

Pos logit contributions
parents+0.172
father+0.159
ases+0.156
asers+0.154
ips+0.152
Neg logit contributions
 cozy-0.111
 safe-0.107
 hiding-0.104
 hut-0.103
 curled-0.102
Token the
Token ID262
Feature activation-2.23e+00

Pos logit contributions
parents+0.099
father+0.094
ases+0.092
els+0.091
olds+0.090
Neg logit contributions
 curled-0.037
 safe-0.037
 cozy-0.037
 curious-0.033
 filled-0.032
Token rh
Token ID9529
Feature activation-8.58e+00

Pos logit contributions
parents+0.071
father+0.068
asers+0.065
ums+0.065
ases+0.064
Neg logit contributions
 safe-0.026
 cozy-0.026
 curled-0.026
 curious-0.023
 filled-0.022
Tokenin
Token ID259
Feature activation-4.73e+00

Pos logit contributions
parents+0.176
uns+0.160
ums+0.155
olds+0.153
father+0.152
Neg logit contributions
 safe-0.192
 hidden-0.186
 there-0.185
 far-0.184
 a-0.183
Tokenoc
Token ID420
Feature activation-1.24e+01

Pos logit contributions
parents+0.121
father+0.115
asers+0.112
ases+0.110
ums+0.109
Neg logit contributions
 safe-0.087
 cozy-0.085
 curled-0.082
 curious-0.078
 hiding-0.078
Tokeneros
Token ID27498
Feature activation-2.24e+00

Pos logit contributions
parents+0.315
asers+0.303
 narrative+0.301
ips+0.298
uns+0.296
Neg logit contributions
 safe-0.176
 deer-0.173
 furry-0.172
 hiding-0.172
 bark-0.167
Token.
Token ID13
Feature activation-1.04e+00

Pos logit contributions
parents+0.065
father+0.063
ases+0.062
asers+0.062
emaker+0.061
Neg logit contributions
 cozy-0.028
 abandoned-0.026
 curled-0.026
 safe-0.025
 shady-0.025
Token But
Token ID887
Feature activation-1.29e+00

Pos logit contributions
parents+0.029
father+0.028
ums+0.028
ases+0.027
olds+0.027
Neg logit contributions
 safe-0.016
 cozy-0.015
 curled-0.015
 hidden-0.014
 hiding-0.014
Token remember
Token ID3505
Feature activation-2.68e+00

Pos logit contributions
parents+0.036
father+0.034
asers+0.034
ases+0.033
ums+0.033
Neg logit contributions
 safe-0.020
 cozy-0.020
 curled-0.019
 curious-0.018
 hidden-0.018
Token,
Token ID11
Feature activation+1.10e+00

Pos logit contributions
parents+0.085
father+0.082
olds+0.080
els+0.079
pa+0.079
Neg logit contributions
 cozy-0.029
 safe-0.027
 curled-0.026
 curious-0.025
 wide-0.023
TokenL
Token ID43
Feature activation-9.78e+00

Pos logit contributions
parents+0.109
father+0.106
asers+0.101
ums+0.100
ips+0.099
Neg logit contributions
 a-0.067
 cozy-0.066
 curled-0.064
 safe-0.064
 furry-0.064
Tokenily
Token ID813
Feature activation-2.75e+00

Pos logit contributions
parents+0.204
father+0.192
asers+0.184
ums+0.179
els+0.178
Neg logit contributions
 safe-0.215
 curled-0.210
 a-0.210
 hiding-0.207
 hidden-0.204
Token,
Token ID11
Feature activation-7.05e-01

Pos logit contributions
parents+0.094
father+0.090
ums+0.089
ases+0.088
emaker+0.087
Neg logit contributions
 cozy-0.025
 safe-0.024
 curled-0.021
 shady-0.019
 curious-0.018
Token these
Token ID777
Feature activation-5.83e+00

Pos logit contributions
parents+0.019
father+0.018
ases+0.018
asers+0.017
emaker+0.017
Neg logit contributions
 safe-0.011
 cozy-0.011
 curled-0.011
 curious-0.010
 wide-0.010
Token ch
Token ID442
Feature activation-8.31e+00

Pos logit contributions
parents+0.134
father+0.125
asers+0.123
ases+0.120
ums+0.120
Neg logit contributions
 safe-0.121
 cozy-0.119
 curled-0.116
 curious-0.111
 hiding-0.110
Tokenocol
Token ID4668
Feature activation-1.24e+01

Pos logit contributions
parents+0.163
 narrative+0.147
oral+0.139
father+0.134
ups+0.130
Neg logit contributions
 hiding-0.180
 furry-0.175
 and-0.175
 a-0.174
 fluffy-0.173
Tokenates
Token ID689
Feature activation-2.41e+00

Pos logit contributions
parents+0.197
oral+0.192
 narrative+0.181
orter+0.177
olds+0.168
Neg logit contributions
 a-0.295
 in-0.286
 hiding-0.285
 perched-0.274
 and-0.272
Token are
Token ID389
Feature activation-5.83e-01

Pos logit contributions
parents+0.069
father+0.065
ases+0.065
ips+0.064
asers+0.064
Neg logit contributions
 cozy-0.035
 safe-0.034
 curled-0.032
 curious-0.031
 hidden-0.029
Token bitter
Token ID12922
Feature activation-2.00e+00

Pos logit contributions
parents+0.017
asers+0.017
father+0.017
els+0.017
ums+0.016
Neg logit contributions
 curled-0.006
 safe-0.006
 cozy-0.006
 hut-0.006
 abandoned-0.006
Token,"
Token ID553
Feature activation-1.80e+00

Pos logit contributions
parents+0.063
asers+0.061
ases+0.061
ums+0.061
uns+0.061
Neg logit contributions
 cozy-0.022
 safe-0.022
 curled-0.021
 curious-0.019
 hidden-0.018
Token he
Token ID339
Feature activation-2.02e+00

Pos logit contributions
parents+0.051
father+0.049
olds+0.048
ums+0.047
asers+0.047
Neg logit contributions
 safe-0.027
 cozy-0.025
 curled-0.024
 hidden-0.022
 shady-0.022
Token agreement
Token ID4381
Feature activation-6.26e-01

Pos logit contributions
parents+0.022
father+0.021
asers+0.020
uns+0.019
ips+0.019
Neg logit contributions
 safe-0.026
 cozy-0.025
 curled-0.025
 a-0.024
 curious-0.024
Token and
Token ID290
Feature activation+1.28e+00

Pos logit contributions
parents+0.021
father+0.021
ums+0.020
olds+0.020
asers+0.020
Neg logit contributions
 cozy-0.006
 safe-0.005
 curled-0.005
 scary-0.004
 shady-0.004
Token hopped
Token ID45230
Feature activation+5.01e-01

Pos logit contributions
 cozy+0.018
 safe+0.018
 scary+0.017
 curled+0.017
 curious+0.016
Neg logit contributions
parents-0.037
father-0.035
ums-0.034
asers-0.034
ases-0.033
Token away
Token ID1497
Feature activation+1.64e+00

Pos logit contributions
 cozy+0.008
 safe+0.007
 curled+0.007
 scary+0.007
 abandoned+0.007
Neg logit contributions
parents-0.014
ips-0.013
ases-0.013
father-0.013
asers-0.013
Token.
Token ID13
Feature activation-1.29e+00

Pos logit contributions
 cozy+0.019
 curled+0.018
 hut+0.017
 shady+0.017
 stable+0.016
Neg logit contributions
ums-0.045
parents-0.044
olds-0.044
iness-0.042
father-0.042
Token Br
Token ID1709
Feature activation-1.23e+01

Pos logit contributions
parents+0.040
father+0.038
asers+0.037
ases+0.036
ips+0.036
Neg logit contributions
 safe-0.017
 cozy-0.017
 curled-0.016
 curious-0.015
 a-0.015
Tokenidget
Token ID17484
Feature activation-5.44e+00

Pos logit contributions
parents+0.228
olds+0.204
father+0.195
ases+0.185
ups+0.184
Neg logit contributions
 curled-0.277
 safe-0.277
 and-0.276
 hidden-0.271
 but-0.271
Token watched
Token ID7342
Feature activation-1.53e+00

Pos logit contributions
parents+0.171
father+0.164
ums+0.159
asers+0.159
ases+0.157
Neg logit contributions
 safe-0.067
 cozy-0.066
 curled-0.058
 hidden-0.057
 hiding-0.056
Token it
Token ID340
Feature activation+4.53e-02

Pos logit contributions
parents+0.048
father+0.047
ums+0.045
ases+0.045
emaker+0.045
Neg logit contributions
 cozy-0.017
 safe-0.016
 curled-0.016
 abandoned-0.015
 shady-0.015
Token go
Token ID467
Feature activation-6.67e-01

Pos logit contributions
 cozy+0.001
 abandoned+0.001
 safe+0.001
 curled+0.001
 scary+0.000
Neg logit contributions
parents-0.001
father-0.001
ums-0.001
ips-0.001
ases-0.001
Token,
Token ID11
Feature activation+2.66e+00

Pos logit contributions
parents+0.021
father+0.020
ips+0.020
ums+0.020
orter+0.019
Neg logit contributions
 cozy-0.007
 curled-0.007
 curious-0.006
 safe-0.006
 scary-0.006
Token okay
Token ID8788
Feature activation-3.52e+00

Pos logit contributions
parents+0.008
father+0.007
asers+0.007
ums+0.007
ases+0.007
Neg logit contributions
 safe-0.003
 cozy-0.003
 curled-0.003
 hidden-0.002
 hiding-0.002
Token?
Token ID30
Feature activation-2.27e+00

Pos logit contributions
parents+0.094
father+0.090
els+0.087
ases+0.087
ums+0.086
Neg logit contributions
 safe-0.058
 cozy-0.057
 curled-0.057
 curious-0.052
 hidden-0.051
Token And
Token ID843
Feature activation-8.53e-01

Pos logit contributions
parents+0.066
father+0.063
asers+0.061
emaker+0.061
ases+0.060
Neg logit contributions
 safe-0.033
 cozy-0.032
 a-0.031
 curled-0.031
 curious-0.030
Token maybe
Token ID3863
Feature activation-4.00e+00

Pos logit contributions
parents+0.024
father+0.023
asers+0.022
ases+0.022
ums+0.022
Neg logit contributions
 safe-0.013
 cozy-0.013
 curled-0.012
 curious-0.012
 hidden-0.012
Token not
Token ID407
Feature activation-3.93e+00

Pos logit contributions
parents+0.119
father+0.114
asers+0.110
ums+0.109
ases+0.109
Neg logit contributions
 safe-0.056
 cozy-0.054
 curled-0.053
 curious-0.049
 hidden-0.048
Token pr
Token ID778
Feature activation-1.23e+01

Pos logit contributions
parents+0.116
father+0.111
ases+0.110
asers+0.110
els+0.109
Neg logit contributions
 curled-0.047
 safe-0.047
 cozy-0.046
 abandoned-0.040
 hidden-0.040
Tokenunes
Token ID4015
Feature activation-6.25e+00

Pos logit contributions
parents+0.188
father+0.162
 narrative+0.149
emaker+0.148
oral+0.146
Neg logit contributions
 a-0.345
 open-0.334
 safe-0.331
 hiding-0.328
 high-0.326
Token.
Token ID13
Feature activation-2.55e+00

Pos logit contributions
parents+0.195
father+0.187
asers+0.182
ums+0.180
ases+0.179
Neg logit contributions
 safe-0.080
 cozy-0.077
 curled-0.073
 a-0.069
 hidden-0.068
Token They
Token ID1119
Feature activation-3.11e+00

Pos logit contributions
parents+0.077
father+0.074
emaker+0.072
asers+0.072
ips+0.070
Neg logit contributions
 safe-0.034
 cozy-0.034
 a-0.032
 curious-0.031
 curled-0.030
Token are
Token ID389
Feature activation-6.19e-01

Pos logit contributions
parents+0.082
father+0.079
asers+0.076
ums+0.076
ips+0.075
Neg logit contributions
 safe-0.055
 cozy-0.053
 curled-0.051
 a-0.049
 hidden-0.049
Token too
Token ID1165
Feature activation-1.16e+00

Pos logit contributions
parents+0.019
father+0.019
asers+0.019
els+0.018
ums+0.018
Neg logit contributions
 curled-0.007
 safe-0.007
 cozy-0.007
 curious-0.006
 hiding-0.006
Token that
Token ID326
Feature activation-1.68e+00

Pos logit contributions
parents+0.088
father+0.086
ases+0.080
olds+0.080
pa+0.080
Neg logit contributions
 cozy-0.031
 safe-0.030
 curled-0.030
 curious-0.028
 scary-0.026
Token boss
Token ID6478
Feature activation-6.72e+00

Pos logit contributions
parents+0.050
father+0.047
ums+0.045
emaker+0.044
pa+0.044
Neg logit contributions
 curled-0.020
 cozy-0.019
 safe-0.019
 curious-0.018
 open-0.018
Tokeny
Token ID88
Feature activation-3.32e+00

Pos logit contributions
parents+0.170
father+0.161
asers+0.156
ums+0.154
ases+0.154
Neg logit contributions
 safe-0.125
 cozy-0.121
 curled-0.118
 curious-0.112
 a-0.112
Token rh
Token ID9529
Feature activation-7.81e+00

Pos logit contributions
parents+0.099
father+0.095
asers+0.091
emaker+0.090
ases+0.090
Neg logit contributions
 cozy-0.044
 safe-0.043
 curled-0.041
 curious-0.039
 wide-0.038
Tokenin
Token ID259
Feature activation-4.47e+00

Pos logit contributions
parents+0.157
uns+0.141
ums+0.139
father+0.137
ips+0.136
Neg logit contributions
 safe-0.180
 hidden-0.174
 far-0.173
 a-0.172
 there-0.171
Tokenoc
Token ID420
Feature activation-1.23e+01

Pos logit contributions
parents+0.133
father+0.128
asers+0.123
ases+0.122
ums+0.121
Neg logit contributions
 safe-0.062
 cozy-0.062
 curled-0.059
 curious-0.055
 hiding-0.055
Tokeneros
Token ID27498
Feature activation-3.76e+00

Pos logit contributions
parents+0.290
 narrative+0.274
father+0.266
oral+0.266
asers+0.264
Neg logit contributions
 safe-0.214
 hiding-0.207
 a-0.207
 deer-0.201
 furry-0.200
Token.
Token ID13
Feature activation-9.78e-01

Pos logit contributions
parents+0.116
father+0.113
asers+0.109
ases+0.107
emaker+0.107
Neg logit contributions
 cozy-0.047
 safe-0.044
 curled-0.042
 shady-0.040
 curious-0.038
Token<|endoftext|>
Token ID50256
Feature activation-2.29e+00

Pos logit contributions
parents+0.036
asers+0.036
els+0.035
affles+0.034
ases+0.034
Neg logit contributions
 safe-0.007
 a-0.007
 cozy-0.007
 but-0.005
 curious-0.005
TokenD
Token ID35
Feature activation-7.56e+00

Pos logit contributions
parents+0.058
father+0.055
ips+0.054
ases+0.053
ums+0.053
Neg logit contributions
 safe-0.041
 cozy-0.041
 curled-0.039
 curious-0.038
 scary-0.037
Tokenee
Token ID1453
Feature activation-4.85e+00

Pos logit contributions
parents+0.126
asers+0.118
father+0.114
ases+0.112
emaker+0.107
Neg logit contributions
 safe-0.200
 cozy-0.196
 curled-0.193
 a-0.190
 hidden-0.189
Token upon
Token ID2402
Feature activation+2.30e+00

Pos logit contributions
parents+0.072
asers+0.069
ases+0.068
father+0.068
uns+0.067
Neg logit contributions
 safe-0.019
 cozy-0.019
 there-0.019
 a-0.018
 high-0.017
Token a
Token ID257
Feature activation+4.14e+00

Pos logit contributions
 curled+0.019
 hollow+0.018
 abandoned+0.018
 safe+0.018
 hut+0.016
Neg logit contributions
parents-0.070
father-0.067
ases-0.067
olds-0.065
emaker-0.065
Token time
Token ID640
Feature activation+1.84e+00

Pos logit contributions
 safe+0.071
 cozy+0.069
 curled+0.066
 curious+0.063
 hidden+0.063
Neg logit contributions
parents-0.111
father-0.105
asers-0.102
ases-0.101
ums-0.101
Token,
Token ID11
Feature activation+2.25e+00

Pos logit contributions
 safe+0.011
 cozy+0.011
 curled+0.010
 curious+0.008
 hidden+0.008
Neg logit contributions
parents-0.068
father-0.066
ums-0.065
ips-0.065
ases-0.064
Token there
Token ID612
Feature activation-1.73e-02

Pos logit contributions
 curled+0.022
 safe+0.021
 cozy+0.019
 bark+0.019
 shady+0.018
Neg logit contributions
ips-0.065
els-0.065
father-0.065
emaker-0.065
parents-0.064
Token was
Token ID373
Feature activation+4.97e+00

Pos logit contributions
parents+0.001
father+0.001
ums+0.001
ases+0.001
ips+0.001
Neg logit contributions
 safe-0.000
 cozy-0.000
 curled-0.000
 curious-0.000
 scary-0.000
Token a
Token ID257
Feature activation+8.17e+00

Pos logit contributions
 safe+0.032
 curled+0.031
 cozy+0.025
 abandoned+0.023
 hiding+0.022
Neg logit contributions
parents-0.172
father-0.169
ases-0.164
ums-0.163
ips-0.163
Token little
Token ID1310
Feature activation+5.62e+00

Pos logit contributions
 curled+0.105
 perched+0.094
 abandoned+0.081
 safe+0.081
 hiding+0.081
Neg logit contributions
father-0.217
parents-0.216
oral-0.214
orter-0.212
ums-0.209
Token girl
Token ID2576
Feature activation+3.98e+00

Pos logit contributions
 curled+0.096
 safe+0.090
 perched+0.087
 cozy+0.086
 hiding+0.085
Neg logit contributions
father-0.133
asers-0.131
parents-0.131
orter-0.129
emaker-0.128
Token named
Token ID3706
Feature activation+1.22e+00

Pos logit contributions
 safe+0.047
 cozy+0.046
 curled+0.043
 scary+0.040
 hidden+0.039
Neg logit contributions
parents-0.125
father-0.121
ums-0.116
ases-0.115
els-0.115
Token Lily
Token ID20037
Feature activation+1.58e+00

Pos logit contributions
 safe+0.014
 curled+0.014
 cozy+0.014
 curious+0.012
 hidden+0.012
Neg logit contributions
parents-0.038
father-0.036
ips-0.035
asers-0.035
uns-0.035
Token upon
Token ID2402
Feature activation+2.55e+00

Pos logit contributions
parents+0.076
father+0.073
asers+0.073
ases+0.072
uns+0.071
Neg logit contributions
 safe-0.023
 cozy-0.022
 there-0.021
 a-0.020
 high-0.020
Token a
Token ID257
Feature activation+4.75e+00

Pos logit contributions
 hollow+0.027
 abandoned+0.026
 curled+0.024
 hut+0.024
 safe+0.024
Neg logit contributions
parents-0.070
father-0.065
ases-0.064
olds-0.063
ons-0.063
Token time
Token ID640
Feature activation+2.50e+00

Pos logit contributions
 safe+0.087
 cozy+0.084
 curled+0.084
 curious+0.079
 hidden+0.078
Neg logit contributions
parents-0.117
father-0.111
ums-0.107
ases-0.106
ips-0.106
Token there
Token ID612
Feature activation-2.27e-01

Pos logit contributions
 safe+0.021
 cozy+0.021
 curled+0.020
 shady+0.015
 curious+0.015
Neg logit contributions
parents-0.085
father-0.083
ums-0.082
ips-0.081
ases-0.080
Token was
Token ID373
Feature activation+5.14e+00

Pos logit contributions
parents+0.007
father+0.007
ums+0.006
emaker+0.006
ases+0.006
Neg logit contributions
 safe-0.003
 cozy-0.003
 curled-0.003
 curious-0.003
 scary-0.002
Token a
Token ID257
Feature activation+8.47e+00

Pos logit contributions
 safe+0.041
 curled+0.040
 hollow+0.034
 cozy+0.033
 abandoned+0.033
Neg logit contributions
parents-0.164
father-0.161
ases-0.156
ips-0.154
ums-0.154
Token happy
Token ID3772
Feature activation+7.01e+00

Pos logit contributions
 curled+0.111
 perched+0.102
 abandoned+0.087
 hiding+0.085
 hollow+0.084
Neg logit contributions
oral-0.218
parents-0.215
orter-0.214
father-0.214
ums-0.204
Token family
Token ID1641
Feature activation+3.02e+00

Pos logit contributions
 curled+0.111
 perched+0.108
 safe+0.107
 cozy+0.101
 hollow+0.101
Neg logit contributions
parents-0.166
father-0.163
pa-0.160
ums-0.157
oral-0.153
Token.
Token ID13
Feature activation+2.53e+00

Pos logit contributions
 cozy+0.030
 safe+0.029
 curled+0.027
 scary+0.024
 curious+0.024
Neg logit contributions
parents-0.102
father-0.099
ums-0.095
asers-0.095
ases-0.095
Token Every
Token ID3887
Feature activation+1.19e+00

Pos logit contributions
 safe+0.043
 curled+0.040
 cozy+0.040
 hidden+0.038
 hiding+0.037
Neg logit contributions
parents-0.065
father-0.061
ums-0.061
ases-0.061
ips-0.060
Token day
Token ID1110
Feature activation+2.37e+00

Pos logit contributions
 safe+0.026
 cozy+0.026
 curled+0.025
 curious+0.025
 hiding+0.024
Neg logit contributions
parents-0.024
father-0.023
oral-0.022
ips-0.022
ases-0.022
Token One
Token ID1881
Feature activation-1.56e-02

Pos logit contributions
 safe+0.029
 cozy+0.028
 curled+0.026
 a+0.025
 curious+0.025
Neg logit contributions
parents-0.073
father-0.070
asers-0.069
ases-0.068
ips-0.067
Token day
Token ID1110
Feature activation+2.74e+00

Pos logit contributions
parents+0.000
father+0.000
pa+0.000
oral+0.000
emaker+0.000
Neg logit contributions
 curled-0.000
 perched-0.000
 safe-0.000
 hiding-0.000
 cozy-0.000
Token,
Token ID11
Feature activation+4.31e+00

Pos logit contributions
 cozy+0.027
 safe+0.027
 curled+0.026
 shady+0.022
 curious+0.022
Neg logit contributions
parents-0.087
olds-0.084
ums-0.084
ips-0.084
father-0.084
Token he
Token ID339
Feature activation+1.41e+00

Pos logit contributions
 curled+0.063
 safe+0.061
 cozy+0.059
 shady+0.053
 hollow+0.053
Neg logit contributions
parents-0.115
father-0.110
emaker-0.108
asers-0.108
els-0.107
Token found
Token ID1043
Feature activation+1.84e+00

Pos logit contributions
 safe+0.021
 cozy+0.020
 hollow+0.018
 fluffy+0.018
 high+0.018
Neg logit contributions
parents-0.038
father-0.035
pa-0.034
ups-0.033
ases-0.033
Token a
Token ID257
Feature activation+7.64e+00

Pos logit contributions
 safe+0.006
 cozy+0.005
 curled+0.004
 curious+0.003
 hiding+0.003
Neg logit contributions
parents-0.074
father-0.072
asers-0.071
ases-0.070
ums-0.070
Token big
Token ID1263
Feature activation+6.49e+00

Pos logit contributions
 curled+0.108
 perched+0.088
 filled+0.083
 safe+0.081
 its+0.079
Neg logit contributions
parents-0.181
father-0.179
oral-0.176
asers-0.170
orter-0.170
Token,
Token ID11
Feature activation+4.91e+00

Pos logit contributions
 curled+0.099
 cozy+0.093
 safe+0.091
 perched+0.088
 curious+0.086
Neg logit contributions
father-0.153
pa-0.150
els-0.149
parents-0.149
orter-0.144
Token shiny
Token ID22441
Feature activation+5.35e+00

Pos logit contributions
 curled+0.060
 perched+0.053
 hiding+0.051
 where+0.049
 shelter+0.049
Neg logit contributions
 narrative-0.119
asers-0.116
els-0.113
orter-0.111
father-0.109
Token rock
Token ID3881
Feature activation+4.93e+00

Pos logit contributions
 cozy+0.097
 curled+0.096
 safe+0.094
 curious+0.087
 scared+0.080
Neg logit contributions
els-0.122
father-0.112
parents-0.109
asers-0.107
ums-0.107
Token in
Token ID287
Feature activation+2.43e+00

Pos logit contributions
 cozy+0.071
 safe+0.061
 shady+0.054
 curled+0.053
 fluffy+0.052
Neg logit contributions
els-0.139
ases-0.135
father-0.134
ips-0.133
asers-0.133
Token upon
Token ID2402
Feature activation+2.17e+00

Pos logit contributions
parents+0.070
asers+0.067
father+0.066
ases+0.066
uns+0.065
Neg logit contributions
 safe-0.019
 cozy-0.019
 there-0.018
 high-0.017
 a-0.017
Token a
Token ID257
Feature activation+3.59e+00

Pos logit contributions
 curled+0.018
 hollow+0.018
 abandoned+0.017
 safe+0.016
 hut+0.015
Neg logit contributions
parents-0.066
father-0.063
ases-0.063
emaker-0.061
olds-0.061
Token time
Token ID640
Feature activation+1.68e+00

Pos logit contributions
 safe+0.062
 cozy+0.060
 curled+0.058
 curious+0.055
 hidden+0.055
Neg logit contributions
parents-0.095
father-0.090
asers-0.087
ases-0.087
ums-0.086
Token,
Token ID11
Feature activation+1.91e+00

Pos logit contributions
 safe+0.012
 cozy+0.011
 curled+0.011
 curious+0.008
 hidden+0.008
Neg logit contributions
parents-0.060
father-0.058
ums-0.057
ips-0.057
ases-0.057
Token there
Token ID612
Feature activation-5.75e-01

Pos logit contributions
 curled+0.020
 safe+0.017
 cozy+0.017
 bark+0.016
 abandoned+0.016
Neg logit contributions
ips-0.054
emaker-0.054
ases-0.054
father-0.053
els-0.053
Token was
Token ID373
Feature activation+4.83e+00

Pos logit contributions
parents+0.018
father+0.017
ums+0.017
ases+0.016
ips+0.016
Neg logit contributions
 safe-0.007
 cozy-0.007
 curled-0.006
 curious-0.006
 scary-0.006
Token a
Token ID257
Feature activation+7.93e+00

Pos logit contributions
 curled+0.030
 safe+0.030
 cozy+0.024
 abandoned+0.023
 hollow+0.023
Neg logit contributions
parents-0.167
father-0.163
ases-0.160
ips-0.158
ums-0.158
Token little
Token ID1310
Feature activation+5.24e+00

Pos logit contributions
 curled+0.102
 perched+0.093
 abandoned+0.081
 hiding+0.081
 hollow+0.080
Neg logit contributions
parents-0.209
father-0.208
oral-0.207
orter-0.204
ums-0.200
Token bunny
Token ID44915
Feature activation+5.59e+00

Pos logit contributions
 curled+0.093
 safe+0.085
 perched+0.085
 cozy+0.082
 hiding+0.082
Neg logit contributions
father-0.119
parents-0.117
emaker-0.116
asers-0.116
orter-0.116
Token named
Token ID3706
Feature activation+1.08e+00

Pos logit contributions
 cozy+0.092
 safe+0.089
 scary+0.085
 shady+0.084
 hollow+0.083
Neg logit contributions
father-0.135
parents-0.132
ums-0.128
emaker-0.127
ases-0.126
Token Benny
Token ID44275
Feature activation+9.20e-01

Pos logit contributions
 safe+0.016
 curious+0.015
 curled+0.015
 filled+0.015
 abandoned+0.015
Neg logit contributions
parents-0.027
father-0.026
emaker-0.025
uns-0.025
ums-0.025
Token She
Token ID1375
Feature activation+7.62e-01

Pos logit contributions
 safe+0.019
 cozy+0.018
 curled+0.017
 hiding+0.017
 hidden+0.016
Neg logit contributions
parents-0.043
ums-0.042
father-0.041
asers-0.041
ips-0.041
Token ran
Token ID4966
Feature activation+2.04e+00

Pos logit contributions
 safe+0.011
 cozy+0.011
 hut+0.010
 scary+0.010
 hiding+0.010
Neg logit contributions
parents-0.021
father-0.019
asers-0.018
uns-0.018
pa-0.018
Token to
Token ID284
Feature activation+1.83e+00

Pos logit contributions
 cozy+0.031
 curled+0.028
 safe+0.027
 hut+0.027
 abandoned+0.027
Neg logit contributions
parents-0.053
ips-0.051
olds-0.049
ases-0.049
uns-0.049
Token the
Token ID262
Feature activation+2.09e+00

Pos logit contributions
 safe+0.024
 cozy+0.023
 curled+0.022
 scary+0.022
 abandoned+0.021
Neg logit contributions
parents-0.055
father-0.051
ases-0.050
ips-0.050
olds-0.049
Token beach
Token ID10481
Feature activation+4.41e+00

Pos logit contributions
 curled+0.036
 safe+0.034
 cozy+0.034
 hiding+0.033
 but+0.032
Neg logit contributions
parents-0.050
father-0.046
ups-0.044
ums-0.044
asers-0.044
Token hut
Token ID40812
Feature activation+5.82e+00

Pos logit contributions
 cozy+0.043
 safe+0.040
 scary+0.039
 curled+0.038
 creepy+0.034
Neg logit contributions
parents-0.140
ums-0.134
father-0.131
pa-0.130
ips-0.129
Token,
Token ID11
Feature activation+3.46e+00

Pos logit contributions
 cozy+0.069
 scary+0.064
 creepy+0.059
 safe+0.058
 cute+0.058
Neg logit contributions
parents-0.164
father-0.160
ips-0.157
ma-0.157
ums-0.154
Token waiting
Token ID4953
Feature activation+2.13e+00

Pos logit contributions
 hut+0.039
 scary+0.039
 cozy+0.038
 safe+0.036
 chamber+0.034
Neg logit contributions
parents-0.095
ips-0.090
olds-0.088
uns-0.087
ums-0.087
Token for
Token ID329
Feature activation-2.06e+00

Pos logit contributions
 curled+0.030
 cozy+0.030
 deer+0.028
 safe+0.028
 bark+0.028
Neg logit contributions
parents-0.053
ips-0.053
olds-0.051
iness-0.051
uns-0.050
Token her
Token ID607
Feature activation+3.66e-01

Pos logit contributions
parents+0.063
father+0.060
asers+0.058
ums+0.058
ips+0.058
Neg logit contributions
 safe-0.027
 cozy-0.026
 curled-0.025
 curious-0.024
 hiding-0.023
Token mummy
Token ID38567
Feature activation-4.00e-01

Pos logit contributions
 curled+0.005
 safe+0.005
 cozy+0.005
 abandoned+0.004
 curious+0.004
Neg logit contributions
parents-0.010
father-0.010
asers-0.010
ums-0.009
uns-0.009
Token and
Token ID290
Feature activation+1.36e+00

Pos logit contributions
parents+0.041
father+0.039
ums+0.037
ases+0.037
asers+0.037
Neg logit contributions
 safe-0.017
 curled-0.017
 cozy-0.017
 hidden-0.015
 scary-0.015
Token went
Token ID1816
Feature activation+7.40e-01

Pos logit contributions
 cozy+0.019
 safe+0.019
 scary+0.018
 curled+0.018
 curious+0.017
Neg logit contributions
parents-0.039
father-0.036
ums-0.034
asers-0.034
olds-0.034
Token inside
Token ID2641
Feature activation+2.30e+00

Pos logit contributions
 cozy+0.011
 curled+0.011
 curious+0.011
 scary+0.011
 safe+0.011
Neg logit contributions
parents-0.020
ips-0.019
father-0.019
ases-0.018
ums-0.018
Token her
Token ID607
Feature activation-1.11e-01

Pos logit contributions
 curled+0.026
 safe+0.025
 deer+0.025
 cozy+0.025
 scary+0.025
Neg logit contributions
parents-0.068
father-0.065
olds-0.063
ips-0.062
ums-0.062
Token house
Token ID2156
Feature activation+2.95e+00

Pos logit contributions
parents+0.003
father+0.003
ums+0.003
ips+0.003
els+0.003
Neg logit contributions
 curled-0.002
 safe-0.002
 cozy-0.002
 filled-0.002
 curious-0.001
Token.
Token ID13
Feature activation+2.75e-01

Pos logit contributions
 cozy+0.031
 curled+0.030
 scary+0.029
 safe+0.029
 curious+0.028
Neg logit contributions
parents-0.093
father-0.089
ips-0.086
ums-0.085
asers-0.084
Token She
Token ID1375
Feature activation+5.28e-01

Pos logit contributions
 safe+0.003
 cozy+0.003
 curled+0.003
 curious+0.003
 a+0.003
Neg logit contributions
parents-0.009
father-0.008
asers-0.008
ases-0.008
ips-0.008
Token knew
Token ID2993
Feature activation-1.08e+00

Pos logit contributions
 safe+0.007
 cozy+0.007
 scary+0.007
 hiding+0.006
 curious+0.006
Neg logit contributions
parents-0.015
father-0.015
asers-0.014
ums-0.014
ips-0.014
Token she
Token ID673
Feature activation-3.52e+00

Pos logit contributions
parents+0.034
father+0.032
asers+0.031
olds+0.030
ums+0.030
Neg logit contributions
 cozy-0.013
 safe-0.012
 curled-0.012
 curious-0.011
 shady-0.011
Token'd
Token ID1549
Feature activation-2.21e+00

Pos logit contributions
parents+0.099
father+0.094
asers+0.087
pa+0.086
emaker+0.086
Neg logit contributions
 cozy-0.053
 safe-0.051
 shady-0.049
 hiding-0.049
 curled-0.048
Token always
Token ID1464
Feature activation-2.21e+00

Pos logit contributions
parents+0.059
father+0.055
ases+0.052
ips+0.051
olds+0.051
Neg logit contributions
 cozy-0.036
 safe-0.036
 curled-0.034
 curious-0.034
 furry-0.032
Token?"
Token ID1701
Feature activation-2.73e-02

Pos logit contributions
parents+0.065
father+0.063
ases+0.061
asers+0.061
ips+0.060
Neg logit contributions
 cozy-0.033
 safe-0.032
 curled-0.032
 curious-0.029
 hiding-0.028
Token she
Token ID673
Feature activation+1.07e+00

Pos logit contributions
parents+0.001
father+0.001
pa+0.001
olds+0.001
ums+0.001
Neg logit contributions
 safe-0.000
 cozy-0.000
 curled-0.000
 shady-0.000
 hiding-0.000
Token asked
Token ID1965
Feature activation-1.26e+00

Pos logit contributions
 safe+0.017
 cozy+0.017
 curled+0.015
 shady+0.015
 hidden+0.015
Neg logit contributions
parents-0.029
father-0.028
ums-0.026
asers-0.026
els-0.026
Token John
Token ID1757
Feature activation-3.55e-01

Pos logit contributions
parents+0.042
father+0.041
ums+0.039
asers+0.039
emaker+0.039
Neg logit contributions
 cozy-0.013
 safe-0.012
 curled-0.012
 hidden-0.011
 scary-0.011
Token.
Token ID13
Feature activation+2.08e+00

Pos logit contributions
parents+0.012
father+0.012
ums+0.011
olds+0.011
asers+0.011
Neg logit contributions
 safe-0.003
 cozy-0.003
 curled-0.003
 hidden-0.003
 scary-0.003
Token
Token ID198
Feature activation-3.85e-01

Pos logit contributions
 safe+0.033
 cozy+0.032
 curled+0.031
 hidden+0.029
 scary+0.029
Neg logit contributions
parents-0.058
father-0.055
ums-0.053
olds-0.052
ips-0.052
Token
Token ID198
Feature activation-2.09e-01

Pos logit contributions
parents+0.013
asers+0.013
ases+0.012
ips+0.012
father+0.012
Neg logit contributions
 safe-0.004
 a-0.004
 cozy-0.004
 wide-0.003
 curious-0.003
TokenJohn
Token ID7554
Feature activation-1.01e+00

Pos logit contributions
parents+0.004
father+0.004
asers+0.004
ases+0.004
ums+0.004
Neg logit contributions
 safe-0.005
 cozy-0.005
 curled-0.005
 hiding-0.005
 hidden-0.005
Token looked
Token ID3114
Feature activation+3.01e+00

Pos logit contributions
parents+0.028
father+0.027
els+0.026
ums+0.025
pa+0.025
Neg logit contributions
 safe-0.016
 cozy-0.016
 shady-0.014
 curled-0.014
 scary-0.014
Token at
Token ID379
Feature activation-2.05e+00

Pos logit contributions
 shady+0.043
 curled+0.043
 tag+0.043
 cozy+0.042
 seek+0.041
Neg logit contributions
els-0.070
father-0.068
parents-0.068
olds-0.067
asers-0.067
Token the
Token ID262
Feature activation-7.35e-01

Pos logit contributions
parents+0.070
father+0.067
asers+0.066
ums+0.065
ases+0.065
Neg logit contributions
 safe-0.021
 cozy-0.020
 curled-0.018
 curious-0.017
 a-0.017
Token axe
Token ID23272
Feature activation+3.71e+00

Pos logit contributions
 cozy+0.038
 safe+0.038
 curled+0.037
 there+0.036
 shady+0.035
Neg logit contributions
father-0.069
parents-0.068
pa-0.067
els-0.066
ums-0.065
Token.
Token ID13
Feature activation+1.59e+00

Pos logit contributions
 cozy+0.055
 curled+0.046
 safe+0.045
 curious+0.045
 shady+0.043
Neg logit contributions
els-0.097
father-0.095
parents-0.093
emaker-0.093
ases-0.092
Token It
Token ID632
Feature activation-3.27e-01

Pos logit contributions
 safe+0.018
 cozy+0.017
 curled+0.016
 curious+0.015
 a+0.015
Neg logit contributions
parents-0.052
father-0.050
asers-0.049
ases-0.048
ums-0.048
Token is
Token ID318
Feature activation+3.53e+00

Pos logit contributions
parents+0.007
father+0.007
els+0.007
ases+0.007
asers+0.007
Neg logit contributions
 safe-0.006
 cozy-0.006
 curious-0.006
 abandoned-0.006
 curled-0.006
Token old
Token ID1468
Feature activation+3.37e+00

Pos logit contributions
 curled+0.034
 hut+0.031
 safe+0.031
 cozy+0.028
 peek+0.026
Neg logit contributions
els-0.102
father-0.096
ums-0.095
asers-0.094
parents-0.094
Token and
Token ID290
Feature activation+2.19e+00

Pos logit contributions
 curled+0.044
 cozy+0.043
 safe+0.042
 low+0.037
 clearing+0.035
Neg logit contributions
els-0.090
pa-0.085
ases-0.084
asers-0.084
father-0.083
Token rusty
Token ID47855
Feature activation+1.56e+00

Pos logit contributions
 curled+0.023
 safe+0.021
 hut+0.019
 hiding+0.019
 clearing+0.019
Neg logit contributions
els-0.063
asers-0.062
parents-0.061
father-0.060
olds-0.059
Token and
Token ID290
Feature activation+1.51e+00

Pos logit contributions
 cozy+0.023
 curled+0.023
 safe+0.020
 shady+0.020
 perched+0.019
Neg logit contributions
els-0.041
ums-0.038
asers-0.037
father-0.037
pa-0.036
Token has
Token ID468
Feature activation+6.32e-01

Pos logit contributions
 curled+0.017
 safe+0.016
 hiding+0.015
 cozy+0.015
 clearing+0.015
Neg logit contributions
parents-0.042
els-0.041
asers-0.041
father-0.040
ases-0.040
Token a
Token ID257
Feature activation+4.84e+00

Pos logit contributions
 safe+0.006
 cozy+0.006
 curled+0.006
 hiding+0.005
 curious+0.005
Neg logit contributions
parents-0.020
father-0.019
ases-0.019
asers-0.019
els-0.019
Token long
Token ID890
Feature activation+4.45e+00

Pos logit contributions
 curled+0.060
 safe+0.051
 filled+0.050
 cozy+0.047
 there+0.046
Neg logit contributions
parents-0.125
father-0.125
ums-0.124
els-0.121
oral-0.120
Token boats
Token ID16659
Feature activation-8.08e-01

Pos logit contributions
parents+0.118
father+0.111
ums+0.111
asers+0.111
uns+0.110
Neg logit contributions
 safe-0.087
 a-0.084
 cozy-0.083
 hidden-0.079
 open-0.079
Token and
Token ID290
Feature activation+2.73e-01

Pos logit contributions
parents+0.026
father+0.025
asers+0.024
ases+0.024
ips+0.024
Neg logit contributions
 cozy-0.009
 safe-0.009
 curled-0.008
 curious-0.008
 shady-0.007
Token many
Token ID867
Feature activation-1.85e+00

Pos logit contributions
 safe+0.005
 cozy+0.005
 curled+0.005
 curious+0.004
 hidden+0.004
Neg logit contributions
parents-0.007
father-0.007
asers-0.007
ases-0.006
ums-0.006
Token of
Token ID286
Feature activation-3.51e+00

Pos logit contributions
parents+0.049
father+0.047
ums+0.045
ips+0.045
asers+0.045
Neg logit contributions
 safe-0.033
 cozy-0.032
 curled-0.030
 hidden-0.029
 a-0.029
Token other
Token ID584
Feature activation-3.81e+00

Pos logit contributions
parents+0.085
father+0.081
ums+0.081
ips+0.080
asers+0.079
Neg logit contributions
 safe-0.069
 cozy-0.067
 a-0.065
 hidden-0.063
 curious-0.063
Token things
Token ID1243
Feature activation-1.26e+00

Pos logit contributions
parents+0.090
emaker+0.086
ips+0.085
father+0.085
ums+0.085
Neg logit contributions
 safe-0.078
 cozy-0.074
 a-0.072
 curious-0.072
 scary-0.072
Token.
Token ID13
Feature activation-8.54e-01

Pos logit contributions
parents+0.039
asers+0.037
father+0.037
ips+0.036
els+0.036
Neg logit contributions
 cozy-0.014
 curled-0.014
 safe-0.013
 curious-0.012
 shady-0.012
Token The
Token ID383
Feature activation-1.68e+00

Pos logit contributions
parents+0.028
father+0.027
asers+0.027
iness+0.027
ases+0.027
Neg logit contributions
 a-0.009
 cozy-0.009
 safe-0.009
 but-0.008
 curious-0.007
Token little
Token ID1310
Feature activation-3.99e+00

Pos logit contributions
ips+0.065
parents+0.065
ases+0.065
father+0.063
ums+0.063
Neg logit contributions
 safe-0.010
 cozy-0.008
 curious-0.008
 a-0.008
 wide-0.007
Token boy
Token ID2933
Feature activation-9.16e-01

Pos logit contributions
parents+0.123
father+0.118
asers+0.115
ases+0.114
ums+0.113
Neg logit contributions
 safe-0.052
 cozy-0.051
 curled-0.049
 hidden-0.044
 curious-0.044
Token was
Token ID373
Feature activation+2.11e+00

Pos logit contributions
parents+0.029
father+0.028
asers+0.027
ums+0.027
ases+0.027
Neg logit contributions
 safe-0.011
 cozy-0.010
 curled-0.009
 hiding-0.009
 scary-0.009
Token with
Token ID351
Feature activation-3.16e+00

Pos logit contributions
parents+0.051
ases+0.051
ips+0.051
father+0.049
orter+0.048
Neg logit contributions
 curled-0.022
 hollow-0.020
 cozy-0.019
 perched-0.019
 abandoned-0.018
Token the
Token ID262
Feature activation-2.14e+00

Pos logit contributions
parents+0.110
father+0.105
asers+0.103
ums+0.102
ases+0.102
Neg logit contributions
 safe-0.031
 cozy-0.029
 curled-0.026
 a-0.025
 curious-0.024
Token rocket
Token ID10701
Feature activation+9.86e-01

Pos logit contributions
parents+0.061
father+0.060
asers+0.056
emaker+0.056
ips+0.056
Neg logit contributions
 curled-0.031
 safe-0.030
 cozy-0.029
 curious-0.027
 filled-0.027
Token and
Token ID290
Feature activation+9.56e-01

Pos logit contributions
 cozy+0.010
 curled+0.010
 safe+0.009
 curious+0.009
 abandoned+0.008
Neg logit contributions
parents-0.032
father-0.031
ases-0.030
emaker-0.030
asers-0.030
Token make
Token ID787
Feature activation-3.78e+00

Pos logit contributions
 safe+0.012
 curled+0.012
 cozy+0.012
 curious+0.011
 hidden+0.011
Neg logit contributions
parents-0.028
father-0.027
ases-0.026
asers-0.026
olds-0.026
Token it
Token ID340
Feature activation+1.61e-01

Pos logit contributions
parents+0.136
father+0.131
ases+0.128
asers+0.128
ums+0.127
Neg logit contributions
 safe-0.028
 curled-0.026
 cozy-0.026
 curious-0.021
 hidden-0.021
Token fly
Token ID6129
Feature activation-1.08e+00

Pos logit contributions
 abandoned+0.002
 curled+0.002
 safe+0.002
 clearing+0.002
 curious+0.002
Neg logit contributions
parents-0.004
ases-0.004
ips-0.004
olds-0.004
father-0.004
Token.
Token ID13
Feature activation-5.37e-01

Pos logit contributions
ases+0.029
parents+0.028
ips+0.028
angles+0.027
els+0.027
Neg logit contributions
 curled-0.015
 cozy-0.015
 abandoned-0.014
 curious-0.014
 hollow-0.013
Token They
Token ID1119
Feature activation-5.37e-01

Pos logit contributions
parents+0.018
father+0.017
asers+0.017
emaker+0.016
ases+0.016
Neg logit contributions
 safe-0.006
 cozy-0.006
 curled-0.006
 a-0.005
 curious-0.005
Token save
Token ID3613
Feature activation-4.85e+00

Pos logit contributions
parents+0.015
father+0.014
asers+0.014
ums+0.014
uns+0.014
Neg logit contributions
 safe-0.009
 cozy-0.009
 curled-0.008
 a-0.008
 open-0.008
Token the
Token ID262
Feature activation-2.09e+00

Pos logit contributions
parents+0.179
father+0.173
asers+0.170
ums+0.169
uns+0.168
Neg logit contributions
 safe-0.040
 cozy-0.036
 a-0.035
 open-0.029
 hidden-0.029
TokenP
Token ID47
Feature activation-7.28e+00

Pos logit contributions
parents+0.038
father+0.037
asers+0.036
iness+0.035
emaker+0.035
Neg logit contributions
 cozy-0.022
 safe-0.022
 a-0.021
 wide-0.021
 curious-0.021
Tokenandy
Token ID10757
Feature activation-2.38e+00

Pos logit contributions
parents+0.139
emaker+0.130
father+0.128
iness+0.123
 narrative+0.121
Neg logit contributions
,-0.159
 a-0.155
 high-0.155
 curled-0.154
 safe-0.154
Token smiled
Token ID13541
Feature activation-2.82e-01

Pos logit contributions
parents+0.064
father+0.060
olds+0.058
ums+0.058
els+0.057
Neg logit contributions
 safe-0.039
 cozy-0.039
 shady-0.034
 scary-0.034
 hidden-0.034
Token.
Token ID13
Feature activation+9.79e-02

Pos logit contributions
ums+0.008
father+0.008
parents+0.008
ips+0.008
ases+0.008
Neg logit contributions
 curled-0.003
 abandoned-0.003
 cozy-0.003
 safe-0.003
 hiding-0.003
Token He
Token ID679
Feature activation-9.65e-01

Pos logit contributions
 safe+0.001
 cozy+0.001
 curled+0.001
 hidden+0.001
 scary+0.001
Neg logit contributions
parents-0.003
ums-0.003
father-0.003
ips-0.003
ases-0.003
Token said
Token ID531
Feature activation-2.45e+00

Pos logit contributions
parents+0.022
father+0.020
oral+0.020
asers+0.020
olds+0.019
Neg logit contributions
 safe-0.018
 cozy-0.018
 hut-0.017
 hiding-0.016
 underground-0.016
Token,
Token ID11
Feature activation+1.48e+00

Pos logit contributions
parents+0.086
ums+0.083
father+0.083
asers+0.082
olds+0.082
Neg logit contributions
 cozy-0.019
 safe-0.018
 curled-0.017
 abandoned-0.015
 hidden-0.015
Token "
Token ID366
Feature activation-2.50e+00

Pos logit contributions
 safe+0.014
 cozy+0.014
 curled+0.013
 hidden+0.012
 scary+0.012
Neg logit contributions
parents-0.048
father-0.047
orter-0.046
asers-0.046
ums-0.046
TokenOf
Token ID5189
Feature activation-8.12e+00

Pos logit contributions
parents+0.068
father+0.066
els+0.065
asers+0.064
ases+0.064
Neg logit contributions
 safe-0.043
 cozy-0.041
 curious-0.039
 a-0.038
 wide-0.038
Token course
Token ID1781
Feature activation-2.41e+00

Pos logit contributions
asers+0.190
ips+0.187
ums+0.184
uns+0.180
ugs+0.179
Neg logit contributions
 a-0.171
 safe-0.171
,-0.161
 scary-0.153
 furry-0.153
Token I
Token ID314
Feature activation-6.19e+00

Pos logit contributions
parents+0.075
father+0.071
olds+0.071
ums+0.070
asers+0.070
Neg logit contributions
 cozy-0.026
 curled-0.026
 safe-0.023
 abandoned-0.022
 hut-0.022
Token up
Token ID510
Feature activation-1.79e+00

Pos logit contributions
parents+0.163
father+0.155
asers+0.150
ases+0.149
ums+0.148
Neg logit contributions
 safe-0.086
 cozy-0.086
 curled-0.084
 curious-0.078
 hiding-0.078
Token something
Token ID1223
Feature activation-1.04e+00

Pos logit contributions
parents+0.056
father+0.054
asers+0.052
ases+0.052
olds+0.052
Neg logit contributions
 curled-0.020
 cozy-0.019
 safe-0.018
 hiding-0.017
 filled-0.017
Token he
Token ID339
Feature activation-5.08e+00

Pos logit contributions
parents+0.030
olds+0.029
ups+0.029
asers+0.029
father+0.028
Neg logit contributions
 curled-0.012
 cozy-0.011
 clearing-0.011
 abandoned-0.010
 shady-0.010
Token wanted
Token ID2227
Feature activation-4.27e+00

Pos logit contributions
parents+0.152
father+0.144
asers+0.140
ases+0.137
olds+0.135
Neg logit contributions
 safe-0.069
 cozy-0.069
 curled-0.065
 hiding-0.060
 shady-0.060
Token.
Token ID13
Feature activation+5.90e-02

Pos logit contributions
parents+0.146
father+0.140
asers+0.136
ases+0.136
olds+0.134
Neg logit contributions
 cozy-0.040
 safe-0.038
 curled-0.036
 curious-0.033
 scary-0.030
Token<|endoftext|>
Token ID50256
Feature activation-2.51e+00

Pos logit contributions
 safe+0.001
 cozy+0.001
 curled+0.001
 a+0.001
 curious+0.001
Neg logit contributions
parents-0.002
father-0.002
asers-0.002
ases-0.002
ips-0.002
TokenOnce
Token ID7454
Feature activation-1.87e+00

Pos logit contributions
parents+0.078
father+0.075
asers+0.073
ases+0.072
ums+0.072
Neg logit contributions
 safe-0.032
 cozy-0.031
 curled-0.029
 a-0.027
 curious-0.027
Token upon
Token ID2402
Feature activation+2.16e+00

Pos logit contributions
parents+0.066
ases+0.063
asers+0.063
father+0.062
uns+0.061
Neg logit contributions
 safe-0.018
 cozy-0.017
 there-0.017
 a-0.016
 high-0.016
Token a
Token ID257
Feature activation+4.04e+00

Pos logit contributions
 curled+0.019
 safe+0.018
 hollow+0.018
 abandoned+0.018
 hut+0.017
Neg logit contributions
parents-0.064
father-0.062
ases-0.060
olds-0.060
ons-0.058
Token time
Token ID640
Feature activation+2.02e+00

Pos logit contributions
 safe+0.068
 cozy+0.066
 curled+0.063
 curious+0.060
 hidden+0.060
Neg logit contributions
parents-0.110
father-0.105
asers-0.102
ases-0.101
ums-0.100
Token,
Token ID11
Feature activation+2.61e+00

Pos logit contributions
 safe+0.017
 cozy+0.016
 curled+0.016
 curious+0.012
 filled+0.012
Neg logit contributions
parents-0.069
father-0.067
ums-0.066
ips-0.066
olds-0.065
Token jealous
Token ID19354
Feature activation-1.75e-01

Pos logit contributions
 curled+0.009
 hut+0.009
 cozy+0.009
 shady+0.008
 safe+0.008
Neg logit contributions
parents-0.022
father-0.021
ums-0.021
ips-0.021
ases-0.020
Token.
Token ID13
Feature activation+2.99e-01

Pos logit contributions
parents+0.004
asers+0.004
ases+0.004
father+0.004
ums+0.004
Neg logit contributions
 cozy-0.003
 safe-0.003
 curled-0.003
 curious-0.003
 hidden-0.003
Token I
Token ID314
Feature activation-2.68e+00

Pos logit contributions
 safe+0.004
 cozy+0.004
 curled+0.004
 hidden+0.004
 hiding+0.004
Neg logit contributions
parents-0.009
father-0.009
asers-0.008
ums-0.008
ases-0.008
Token wanted
Token ID2227
Feature activation-2.38e+00

Pos logit contributions
parents+0.076
ases+0.071
father+0.071
olds+0.071
asers+0.070
Neg logit contributions
 safe-0.040
 cozy-0.039
 curled-0.035
 hut-0.034
 shady-0.034
Token to
Token ID284
Feature activation-1.34e+00

Pos logit contributions
parents+0.085
father+0.082
asers+0.081
ases+0.080
ums+0.080
Neg logit contributions
 safe-0.019
 cozy-0.018
 curled-0.017
 curious-0.015
 hidden-0.015
Token make
Token ID787
Feature activation-3.28e+00

Pos logit contributions
parents+0.038
father+0.036
asers+0.035
ases+0.035
ums+0.035
Neg logit contributions
 safe-0.020
 cozy-0.020
 curled-0.019
 curious-0.018
 hidden-0.018
Token the
Token ID262
Feature activation+9.77e-01

Pos logit contributions
parents+0.107
father+0.103
asers+0.100
ases+0.099
ums+0.099
Neg logit contributions
 safe-0.037
 cozy-0.036
 curled-0.034
 curious-0.031
 a-0.031
Token tallest
Token ID38760
Feature activation+2.92e-01

Pos logit contributions
 safe+0.016
 cozy+0.016
 curled+0.015
 curious+0.014
 hidden+0.014
Neg logit contributions
parents-0.027
father-0.026
asers-0.025
ases-0.024
ums-0.024
Token tower
Token ID10580
Feature activation-1.01e+00

Pos logit contributions
 curled+0.004
 cozy+0.004
 safe+0.004
 hiding+0.004
 abandoned+0.003
Neg logit contributions
parents-0.008
father-0.008
asers-0.008
ases-0.008
ums-0.008
Token and
Token ID290
Feature activation+9.92e-01

Pos logit contributions
parents+0.030
father+0.029
ases+0.029
asers+0.028
emaker+0.028
Neg logit contributions
 cozy-0.013
 safe-0.013
 curled-0.012
 curious-0.011
 hidden-0.011
Token I
Token ID314
Feature activation-4.23e+00

Pos logit contributions
 curled+0.012
 safe+0.011
 cozy+0.011
 hut+0.010
 hiding+0.010
Neg logit contributions
parents-0.029
asers-0.027
father-0.027
orter-0.027
ases-0.027
Token she
Token ID673
Feature activation+4.32e-02

Pos logit contributions
parents+0.077
father+0.073
asers+0.071
ums+0.070
ips+0.070
Neg logit contributions
 safe-0.040
 cozy-0.039
 curled-0.038
 hidden-0.035
 curious-0.035
Token asked
Token ID1965
Feature activation-2.76e+00

Pos logit contributions
 safe+0.001
 cozy+0.001
 shady+0.001
 curled+0.001
 hut+0.001
Neg logit contributions
parents-0.001
father-0.001
pa-0.001
asers-0.001
olds-0.001
Token.
Token ID13
Feature activation+4.36e-01

Pos logit contributions
parents+0.085
father+0.081
asers+0.079
ums+0.078
ases+0.078
Neg logit contributions
 safe-0.036
 cozy-0.035
 curled-0.033
 curious-0.031
 hidden-0.031
Token Lily
Token ID20037
Feature activation-1.93e+00

Pos logit contributions
 safe+0.007
 cozy+0.006
 curled+0.006
 hidden+0.006
 hiding+0.006
Neg logit contributions
parents-0.012
father-0.012
ums-0.012
asers-0.012
ips-0.011
Token told
Token ID1297
Feature activation-3.85e+00

Pos logit contributions
parents+0.050
father+0.048
pa+0.045
ums+0.044
olds+0.044
Neg logit contributions
 safe-0.034
 cozy-0.033
 shady-0.031
 high-0.030
 hut-0.030
Token her
Token ID607
Feature activation-2.02e+00

Pos logit contributions
parents+0.113
asers+0.110
father+0.109
ips+0.108
ases+0.108
Neg logit contributions
 safe-0.056
 a-0.053
 cozy-0.052
 curled-0.051
 open-0.048
Token about
Token ID546
Feature activation-1.78e+00

Pos logit contributions
parents+0.058
father+0.055
asers+0.054
ums+0.053
olds+0.053
Neg logit contributions
 safe-0.030
 cozy-0.030
 curled-0.028
 hidden-0.027
 hiding-0.027
Token Tim
Token ID5045
Feature activation-3.72e+00

Pos logit contributions
parents+0.056
father+0.053
asers+0.051
els+0.050
ums+0.050
Neg logit contributions
 safe-0.022
 cozy-0.021
 curled-0.020
 curious-0.019
 hut-0.018
Tokenmy
Token ID1820
Feature activation-1.70e+00

Pos logit contributions
parents+0.059
father+0.055
asers+0.051
ums+0.049
emaker+0.049
Neg logit contributions
 safe-0.104
 cozy-0.104
 curled-0.101
 curious-0.098
 hidden-0.097
Tokenâ
Token ID22940
Feature activation-5.97e+00

Pos logit contributions
parents+0.055
father+0.053
asers+0.050
olds+0.050
ums+0.050
Neg logit contributions
 cozy-0.018
 safe-0.017
 curled-0.017
 shady-0.016
 hut-0.015
Token€
Token ID26391
Feature activation-3.03e+00

Pos logit contributions
parents+0.120
father+0.115
asers+0.114
els+0.110
ases+0.110
Neg logit contributions
 safe-0.141
 cozy-0.132
 curled-0.132
 hidden-0.126
 hiding-0.126
Token the
Token ID262
Feature activation-9.77e-01

Pos logit contributions
parents+0.012
father+0.011
asers+0.011
ums+0.011
uns+0.011
Neg logit contributions
 safe-0.003
 cozy-0.003
 curled-0.002
 a-0.002
 hidden-0.002
Token dirt
Token ID13647
Feature activation+1.82e+00

Pos logit contributions
parents+0.029
father+0.027
ases+0.026
ums+0.026
uns+0.026
Neg logit contributions
 curled-0.013
 safe-0.013
 cozy-0.013
 curious-0.012
 filled-0.011
Token.
Token ID13
Feature activation+8.04e-01

Pos logit contributions
 cozy+0.015
 safe+0.015
 curled+0.013
 curious+0.012
 hidden+0.011
Neg logit contributions
parents-0.063
father-0.061
ases-0.060
asers-0.060
ips-0.060
Token
Token ID198
Feature activation-1.67e+00

Pos logit contributions
 safe+0.008
 cozy+0.008
 a+0.007
 curious+0.007
 curled+0.007
Neg logit contributions
parents-0.027
father-0.026
asers-0.026
emaker-0.025
uns-0.025
Token
Token ID198
Feature activation-1.33e+00

Pos logit contributions
asers+0.066
parents+0.065
iness+0.063
father+0.063
emaker+0.062
Neg logit contributions
 a-0.011
 safe-0.010
 far-0.008
 high-0.008
 cozy-0.008
Token"
Token ID1
Feature activation-2.78e+00

Pos logit contributions
parents+0.039
father+0.037
asers+0.037
ums+0.036
uns+0.036
Neg logit contributions
 safe-0.020
 cozy-0.019
 a-0.018
 curious-0.018
 curled-0.018
TokenBut
Token ID1537
Feature activation-1.04e+00

Pos logit contributions
parents+0.074
father+0.073
asers+0.069
ums+0.069
uns+0.067
Neg logit contributions
 safe-0.049
 cozy-0.048
 a-0.047
 curious-0.046
 curled-0.045
Token it
Token ID340
Feature activation-3.63e+00

Pos logit contributions
parents+0.029
father+0.028
els+0.027
ases+0.026
ums+0.026
Neg logit contributions
 cozy-0.016
 safe-0.015
 curled-0.015
 shady-0.014
 abandoned-0.014
Token would
Token ID561
Feature activation-6.69e-01

Pos logit contributions
parents+0.084
els+0.082
father+0.080
ases+0.079
asers+0.079
Neg logit contributions
 safe-0.065
 cozy-0.065
 abandoned-0.062
 curled-0.062
 shady-0.060
Token be
Token ID307
Feature activation-4.80e-01

Pos logit contributions
parents+0.019
father+0.018
ases+0.018
asers+0.018
ums+0.018
Neg logit contributions
 safe-0.009
 cozy-0.009
 curled-0.009
 curious-0.008
 hidden-0.008
Token fun
Token ID1257
Feature activation+1.84e-01

Pos logit contributions
parents+0.014
father+0.014
asers+0.014
els+0.013
uns+0.013
Neg logit contributions
 curled-0.005
 safe-0.005
 cozy-0.005
 hut-0.005
 deer-0.004
Token sorry
Token ID7926
Feature activation-3.74e+00

Pos logit contributions
parents+0.134
father+0.128
asers+0.127
ums+0.126
ases+0.126
Neg logit contributions
 curled-0.074
 cozy-0.072
 safe-0.071
 abandoned-0.067
 hidden-0.067
Token.
Token ID13
Feature activation-3.15e-01

Pos logit contributions
parents+0.116
ases+0.113
olds+0.113
ums+0.111
asers+0.111
Neg logit contributions
 cozy-0.041
 curled-0.040
 safe-0.040
 abandoned-0.038
 hollow-0.035
Token They
Token ID1119
Feature activation-1.37e+00

Pos logit contributions
parents+0.009
father+0.008
ums+0.008
ips+0.008
ases+0.008
Neg logit contributions
 safe-0.005
 curled-0.005
 cozy-0.004
 hidden-0.004
 hiding-0.004
Token hoped
Token ID10719
Feature activation-3.02e+00

Pos logit contributions
parents+0.036
father+0.034
asers+0.033
ases+0.033
ums+0.032
Neg logit contributions
 safe-0.024
 cozy-0.023
 curled-0.022
 hiding-0.022
 curious-0.021
Token their
Token ID511
Feature activation-3.77e+00

Pos logit contributions
parents+0.100
father+0.095
asers+0.093
ases+0.092
ums+0.092
Neg logit contributions
 safe-0.031
 cozy-0.031
 curled-0.029
 curious-0.027
 hidden-0.026
Token k
Token ID479
Feature activation-7.69e+00

Pos logit contributions
parents+0.110
father+0.105
asers+0.103
ums+0.101
ases+0.101
Neg logit contributions
 safe-0.057
 cozy-0.055
 curled-0.052
 curious-0.050
 hidden-0.049
Tokenite
Token ID578
Feature activation-3.28e+00

Pos logit contributions
parents+0.145
father+0.130
 narrative+0.120
oral+0.119
iness+0.119
Neg logit contributions
 safe-0.192
 high-0.185
 furry-0.183
 far-0.182
 deer-0.182
Token was
Token ID373
Feature activation-9.27e-01

Pos logit contributions
parents+0.092
father+0.086
ases+0.085
asers+0.084
ips+0.084
Neg logit contributions
 cozy-0.048
 safe-0.047
 curled-0.046
 curious-0.044
 hiding-0.043
Token happy
Token ID3772
Feature activation-1.12e+00

Pos logit contributions
parents+0.025
father+0.023
els+0.023
ases+0.023
ips+0.022
Neg logit contributions
 curled-0.012
 abandoned-0.011
 clearing-0.011
 bark-0.011
 Under-0.011
Token too
Token ID1165
Feature activation-3.26e+00

Pos logit contributions
parents+0.034
father+0.033
ases+0.033
ips+0.032
asers+0.032
Neg logit contributions
 cozy-0.013
 safe-0.012
 curled-0.012
 abandoned-0.011
 hollow-0.011
Token.
Token ID13
Feature activation-5.94e-01

Pos logit contributions
parents+0.079
father+0.077
iness+0.077
ums+0.077
pa+0.076
Neg logit contributions
 curled-0.048
 clearing-0.046
 hiding-0.046
 perched-0.045
 hollow-0.044
Token voice
Token ID3809
Feature activation+3.72e-01

Pos logit contributions
parents+0.038
father+0.038
ums+0.036
asers+0.035
oral+0.035
Neg logit contributions
 curled-0.016
 safe-0.015
 cozy-0.015
 shady-0.014
 filled-0.014
Token said
Token ID531
Feature activation-4.93e-01

Pos logit contributions
 cozy+0.006
 safe+0.006
 shady+0.005
 curled+0.005
 hiding+0.005
Neg logit contributions
parents-0.010
father-0.009
asers-0.009
emaker-0.009
els-0.009
Token.
Token ID13
Feature activation+3.76e-02

Pos logit contributions
parents+0.013
ums+0.013
father+0.013
emaker+0.013
olds+0.013
Neg logit contributions
 cozy-0.006
 shady-0.006
 sandy-0.006
 safe-0.006
 hidden-0.006
Token
Token ID198
Feature activation-1.36e+00

Pos logit contributions
 safe+0.001
 cozy+0.001
 curled+0.001
 hidden+0.001
 hiding+0.001
Neg logit contributions
parents-0.001
olds-0.001
father-0.001
ips-0.001
oral-0.001
Token
Token ID198
Feature activation-8.68e-01

Pos logit contributions
parents+0.037
father+0.034
olds+0.034
ips+0.033
ases+0.033
Neg logit contributions
 safe-0.021
 curled-0.021
 cozy-0.021
 hiding-0.020
 hidden-0.019
TokenLu
Token ID25596
Feature activation-8.20e+00

Pos logit contributions
parents+0.016
oral+0.015
olds+0.015
father+0.015
ups+0.014
Neg logit contributions
 curled-0.019
 cozy-0.019
 hiding-0.019
 safe-0.018
 hidden-0.018
Tokency
Token ID948
Feature activation-4.10e-01

Pos logit contributions
parents+0.129
father+0.118
ips+0.115
els+0.112
ases+0.111
Neg logit contributions
 safe-0.230
 cozy-0.227
 scary-0.218
 curled-0.214
 curious-0.213
Token felt
Token ID2936
Feature activation+3.28e+00

Pos logit contributions
parents+0.011
father+0.010
ums+0.010
ople+0.010
io+0.010
Neg logit contributions
 safe-0.007
 cozy-0.007
 scary-0.006
 shady-0.006
 hut-0.006
Token generous
Token ID14431
Feature activation+5.55e-01

Pos logit contributions
 hiding+0.037
 hut+0.035
 shady+0.034
 stable+0.032
 abandoned+0.032
Neg logit contributions
olds-0.092
parents-0.090
ums-0.088
ases-0.086
father-0.084
Token and
Token ID290
Feature activation+2.34e+00

Pos logit contributions
 cozy+0.007
 hut+0.006
 scary+0.006
 safe+0.006
 dangerous+0.006
Neg logit contributions
ums-0.016
iness-0.015
parents-0.015
olds-0.015
father-0.015
Token quickly
Token ID2952
Feature activation+1.04e+00

Pos logit contributions
 hiding+0.035
 hut+0.034
 clearing+0.033
 abandoned+0.033
 cabin+0.032
Neg logit contributions
parents-0.055
olds-0.050
ups-0.048
ums-0.047
father-0.047
Token he
Token ID339
Feature activation-1.12e+00

Pos logit contributions
 cozy+0.022
 safe+0.022
 curled+0.020
 hut+0.019
 hiding+0.019
Neg logit contributions
parents-0.047
father-0.043
olds-0.042
asers-0.042
ases-0.042
Token didn
Token ID1422
Feature activation-4.72e+00

Pos logit contributions
parents+0.033
father+0.031
asers+0.030
ases+0.029
olds+0.029
Neg logit contributions
 safe-0.016
 cozy-0.016
 hiding-0.014
 curled-0.014
 shady-0.014
Token't
Token ID470
Feature activation-2.31e+00

Pos logit contributions
parents+0.112
father+0.105
asers+0.104
uns+0.102
ums+0.098
Neg logit contributions
 safe-0.098
 a-0.095
 cozy-0.094
 open-0.092
 curled-0.091
Token want
Token ID765
Feature activation-4.42e+00

Pos logit contributions
parents+0.061
father+0.058
asers+0.056
ases+0.056
ums+0.055
Neg logit contributions
 safe-0.040
 cozy-0.040
 curled-0.038
 curious-0.036
 hidden-0.036
Token L
Token ID406
Feature activation-1.33e+01

Pos logit contributions
parents+0.168
father+0.163
asers+0.160
ums+0.158
emaker+0.158
Neg logit contributions
 safe-0.027
 cozy-0.024
 a-0.023
 curled-0.022
 hidden-0.020
Tokenyl
Token ID2645
Feature activation-9.22e+00

Pos logit contributions
father+0.309
parents+0.306
emaker+0.289
 narrative+0.288
uns+0.288
Neg logit contributions
 hidden-0.238
 there-0.235
 a-0.234
 hiding-0.228
 open-0.225
Tokena
Token ID64
Feature activation-3.91e+00

Pos logit contributions
parents+0.215
father+0.204
ums+0.203
ips+0.203
 narrative+0.201
Neg logit contributions
 safe-0.184
 a-0.175
 curled-0.168
 cozy-0.167
 hidden-0.166
Token to
Token ID284
Feature activation-2.80e+00

Pos logit contributions
parents+0.125
father+0.119
asers+0.118
olds+0.117
ases+0.117
Neg logit contributions
 cozy-0.043
 safe-0.041
 curled-0.040
 hidden-0.036
 hiding-0.035
Token be
Token ID307
Feature activation-2.82e+00

Pos logit contributions
parents+0.075
father+0.070
ases+0.069
ums+0.068
olds+0.067
Neg logit contributions
 cozy-0.045
 safe-0.044
 curled-0.043
 curious-0.041
 hidden-0.040
Token unhappy
Token ID19283
Feature activation-1.19e+00

Pos logit contributions
parents+0.084
father+0.082
ases+0.081
asers+0.080
ips+0.080
Neg logit contributions
 curled-0.030
 cozy-0.029
 hut-0.027
 shady-0.027
 safe-0.025
Token.
Token ID13
Feature activation+7.11e-01

Pos logit contributions
parents+0.037
ases+0.036
father+0.036
ums+0.035
asers+0.035
Neg logit contributions
 cozy-0.013
 safe-0.013
 curled-0.013
 shady-0.011
 abandoned-0.011
Token and
Token ID290
Feature activation+1.05e+00

Pos logit contributions
parents+0.044
father+0.042
ums+0.041
uns+0.041
asers+0.040
Neg logit contributions
 cozy-0.012
 safe-0.012
 curled-0.011
 scary-0.011
 hidden-0.011
Token thanked
Token ID26280
Feature activation-3.43e+00

Pos logit contributions
 safe+0.021
 cozy+0.020
 curled+0.020
 hidden+0.019
 hiding+0.019
Neg logit contributions
parents-0.025
father-0.024
asers-0.022
ums-0.022
ases-0.022
Token her
Token ID607
Feature activation-1.53e+00

Pos logit contributions
parents+0.100
father+0.095
asers+0.093
ases+0.092
ums+0.092
Neg logit contributions
 safe-0.051
 cozy-0.049
 curled-0.047
 curious-0.045
 hidden-0.044
Token for
Token ID329
Feature activation-4.05e+00

Pos logit contributions
parents+0.042
ums+0.041
father+0.040
oral+0.039
asers+0.039
Neg logit contributions
 curled-0.021
 safe-0.021
 abandoned-0.021
 cozy-0.021
 scary-0.020
Token not
Token ID407
Feature activation-3.96e+00

Pos logit contributions
parents+0.110
father+0.105
asers+0.100
ases+0.099
ips+0.099
Neg logit contributions
 cozy-0.065
 safe-0.065
 curled-0.064
 curious-0.060
 scary-0.059
Token punishing
Token ID28641
Feature activation-8.10e+00

Pos logit contributions
parents+0.097
father+0.091
olds+0.088
asers+0.088
ips+0.087
Neg logit contributions
 cozy-0.071
 curled-0.070
 safe-0.068
 curious-0.063
 furry-0.063
Token them
Token ID606
Feature activation-3.84e+00

Pos logit contributions
parents+0.212
father+0.201
asers+0.197
ases+0.194
ips+0.193
Neg logit contributions
 safe-0.145
 cozy-0.140
 curled-0.134
 a-0.131
 curious-0.130
Token.
Token ID13
Feature activation-3.29e-01

Pos logit contributions
parents+0.124
father+0.119
asers+0.115
ums+0.115
olds+0.115
Neg logit contributions
 cozy-0.040
 curled-0.038
 safe-0.037
 abandoned-0.034
 curious-0.034
Token<|endoftext|>
Token ID50256
Feature activation-2.33e+00

Pos logit contributions
parents+0.010
father+0.010
asers+0.009
ases+0.009
ums+0.009
Neg logit contributions
 safe-0.004
 cozy-0.004
 curled-0.004
 curious-0.004
 hidden-0.004
TokenOnce
Token ID7454
Feature activation-2.15e+00

Pos logit contributions
parents+0.059
father+0.056
ips+0.054
ases+0.053
asers+0.053
Neg logit contributions
 safe-0.043
 cozy-0.042
 curled-0.041
 curious-0.039
 scary-0.038
Token upon
Token ID2402
Feature activation+2.29e+00

Pos logit contributions
parents+0.073
asers+0.070
ases+0.070
father+0.069
uns+0.068
Neg logit contributions
 safe-0.023
 cozy-0.022
 there-0.020
 a-0.020
 high-0.019
Token little
Token ID1310
Feature activation-3.23e-01

Pos logit contributions
 curled+0.033
 safe+0.033
 cozy+0.033
 filled+0.029
 hiding+0.029
Neg logit contributions
parents-0.089
father-0.084
oral-0.082
asers-0.082
uns-0.081
Token longer
Token ID2392
Feature activation-1.29e+00

Pos logit contributions
parents+0.009
father+0.009
asers+0.009
emaker+0.008
ases+0.008
Neg logit contributions
 cozy-0.004
 curled-0.004
 safe-0.004
 perched-0.004
 filled-0.004
Token."
Token ID526
Feature activation+2.40e-01

Pos logit contributions
parents+0.040
father+0.038
ases+0.038
asers+0.037
uns+0.037
Neg logit contributions
 cozy-0.014
 curled-0.014
 safe-0.012
 hollow-0.011
 hidden-0.011
Token
Token ID198
Feature activation-1.38e+00

Pos logit contributions
 safe+0.003
 cozy+0.003
 curled+0.003
 curious+0.003
 hidden+0.003
Neg logit contributions
parents-0.007
father-0.007
asers-0.007
ums-0.007
ases-0.007
Token
Token ID198
Feature activation-8.24e-01

Pos logit contributions
parents+0.049
asers+0.048
father+0.047
ips+0.047
iness+0.046
Neg logit contributions
 safe-0.012
 cozy-0.012
 a-0.011
 curious-0.010
 curled-0.009
TokenL
Token ID43
Feature activation-8.25e+00

Pos logit contributions
parents+0.021
father+0.020
asers+0.020
ips+0.019
ases+0.019
Neg logit contributions
 safe-0.015
 cozy-0.015
 curled-0.014
 curious-0.014
 a-0.014
Tokenily
Token ID813
Feature activation-1.68e+00

Pos logit contributions
parents+0.186
father+0.175
orter+0.171
uns+0.170
ums+0.169
Neg logit contributions
 a-0.161
 hiding-0.157
 curled-0.155
 safe-0.153
 hidden-0.151
Token and
Token ID290
Feature activation-9.81e-01

Pos logit contributions
parents+0.049
father+0.047
olds+0.045
ums+0.045
ases+0.045
Neg logit contributions
 safe-0.025
 cozy-0.024
 scary-0.021
 shady-0.021
 hidden-0.021
Token Max
Token ID5436
Feature activation-1.69e+00

Pos logit contributions
parents+0.029
asers+0.027
father+0.027
ips+0.027
uns+0.027
Neg logit contributions
 safe-0.014
 cozy-0.014
 a-0.013
 curled-0.013
 hiding-0.013
Token gigg
Token ID30442
Feature activation-4.56e+00

Pos logit contributions
parents+0.041
ases+0.037
father+0.037
olds+0.037
ople+0.036
Neg logit contributions
 cozy-0.031
 safe-0.029
 shady-0.028
 hiding-0.028
 scary-0.028
Tokenled
Token ID992
Feature activation+1.58e+00

Pos logit contributions
parents+0.108
father+0.102
uns+0.094
ums+0.094
asers+0.092
Neg logit contributions
 safe-0.091
 cozy-0.088
 curled-0.088
 curious-0.087
 a-0.087