TOP ACTIVATIONS
MAX = 10.103

and his mom my went to the park . Johnny saw
day , two kids went to the park . They saw
One day , Lucy went to the playground . She saw
day , two friends went to the river . It was
One day , Jack went to the church . He was
day , Mom my went to the garden and picked a
, Ben and Lily went to the park with their mom
ily and Sam went on a saf ari with their mom
Anna and her mom went to the beach . Anna loved
One day , John went to the mall with his mom
, Lily and Max went to the park with their mom
<|endoftext|> One day John went to visit his grand pa .
, Ben and Lily went to the park with their mom
. She loved going outside to play in the field near
One day , Johnny went to the beach . He took
One day , Sarah went to the store with her mom
One day , Bunny went to a festival . There were
. Every day they went to the park . Today they
Bobby and his mom went to the park . He saw
Zack and his mom went to the island . It was

MIN ACTIVATIONS
MIN = -8.378

friends . Everyone praised her cle vern ess and eventually Jan
kept getting ang rier and ang rier . Finally , Tim
noise . Lily got even ang rier and threw her doll
to make her mom any ang rier , so she stopped
so the fox got even ang rier . "
, but Johnny kept getting ang rier and ang rier .
. Everyone praised her cle vern ess and eventually Jan ie
and listen ," her mom said . " I
, getting ang rier and ang rier . The two friends
for his obedience and cle vern ess . <|endoftext|> Once upon
io was thankful and thanked T ummy before they each continued
praised for his obedience and cle vern ess . <|endoftext|> Once
tr ay again ," Tim says . " You
The giant got ang rier and began to stomp
This made the boy even ang rier , and he began
something ," the bald man says . " OK
I forgive you ," she says . They go
my love ," said M omm a . " Let 's
my best friend ," Tom says . L ily
to yourself and not be mad anymore . Nem o thought

INTERVAL 5.482 - 10.103
CONTAINS 0.876%

a time , in a small village , there was a
Lily . Lily lived in a small pond with her family
named Lily . She loved to explore the garden behind her
he had ever seen . <|endoftext|> Once upon a time ,
. They like to play in the park . Today ,

INTERVAL 0.862 - 5.482
CONTAINS 31.227%

day . <|endoftext|> One day it was raining outside . It
's get you out of that dress and clean up this
love it !" <|endoftext|> Once upon a time , there was
Her mom asked , " What does it say ,
to sail on his boat with his friends . They would

INTERVAL -3.758 - 0.862
CONTAINS 65.284%

day , an elephant saw the rh in oc eros z
. From that day on , they never forgot about the
," suggested Jack . Jill replied , " But what if
Thank you for rescuing my puppy ". The knight smiled and
" What have you done ?" Mom shouted . "

INTERVAL -8.378 - -3.758
CONTAINS 2.613%

!" Grand ma was still a bit worried ,
he was determined to be very naughty . He went to
L ily smiled . She forg ave her brother . She
proud . He had been clever and done something amazing .
The little girl was happy . She now had something
Token and
Token ID290
Feature activation+3.11e+00

Pos logit contributions
 hunt+0.152
 hide+0.139
 library+0.131
 hall+0.130
tail+0.129
Neg logit contributions
ously-0.127
lessly-0.126
der-0.119
lessness-0.108
oke-0.103
Token his
Token ID465
Feature activation+2.40e+00

Pos logit contributions
 hunt+0.075
 hide+0.071
 pretend+0.068
 draw+0.067
 hall+0.067
Neg logit contributions
lessly-0.099
der-0.096
ously-0.088
rier-0.088
lessness-0.087
Token mom
Token ID1995
Feature activation+5.16e+00

Pos logit contributions
 hunt+0.075
 hide+0.074
 pretend+0.072
 draw+0.071
 play+0.070
Neg logit contributions
lessly-0.059
der-0.055
ously-0.052
rier-0.050
lessness-0.049
Tokenmy
Token ID1820
Feature activation+5.04e+00

Pos logit contributions
 hunt+0.125
 hide+0.112
 pretend+0.103
 hall+0.103
c+0.101
Neg logit contributions
lessly-0.152
der-0.143
ously-0.140
lessness-0.137
arl-0.127
Token went
Token ID1816
Feature activation+4.71e+00

Pos logit contributions
 hunt+0.142
 hide+0.126
 hall+0.123
c+0.119
 draw+0.118
Neg logit contributions
lessly-0.126
ously-0.119
der-0.118
lessness-0.111
 understanding-0.100
Token to
Token ID284
Feature activation+8.55e+00

Pos logit contributions
 hunt+0.127
 hide+0.118
c+0.116
 draw+0.115
 hall+0.114
Neg logit contributions
lessly-0.130
der-0.119
ously-0.114
lessness-0.107
arl-0.105
Token the
Token ID262
Feature activation+5.84e+00

Pos logit contributions
c+0.230
 hunt+0.228
asis+0.226
 grounds+0.209
 hall+0.209
Neg logit contributions
 understanding-0.158
lessly-0.138
 Dad-0.129
 much-0.128
 ï¿½-0.128
Token park
Token ID3952
Feature activation+4.35e+00

Pos logit contributions
c+0.072
 hunt+0.071
e+0.070
 hide+0.067
tail+0.060
Neg logit contributions
der-0.198
lessly-0.189
oral-0.189
arl-0.187
rier-0.181
Token.
Token ID13
Feature activation+2.47e+00

Pos logit contributions
 hunt+0.129
 hide+0.119
 draw+0.119
 hall+0.118
 library+0.116
Neg logit contributions
lessly-0.094
der-0.091
ously-0.085
lessness-0.078
 understanding-0.076
Token Johnny
Token ID15470
Feature activation+4.67e+00

Pos logit contributions
 hunt+0.071
 hide+0.067
 hall+0.066
 draw+0.066
 pretend+0.065
Neg logit contributions
lessly-0.066
der-0.063
ously-0.059
rier-0.058
lessness-0.056
Token saw
Token ID2497
Feature activation+3.29e+00

Pos logit contributions
 hunt+0.115
 hide+0.105
 draw+0.105
tail+0.101
 hall+0.100
Neg logit contributions
lessly-0.115
ously-0.110
der-0.107
oke-0.097
lessness-0.096
Token day
Token ID1110
Feature activation+1.49e+00

Pos logit contributions
 hunt+0.071
 hide+0.065
 hall+0.061
 draw+0.061
 pretend+0.060
Neg logit contributions
lessly-0.100
der-0.093
ously-0.089
lessness-0.089
rier-0.084
Token,
Token ID11
Feature activation+3.11e+00

Pos logit contributions
 hunt+0.035
 hide+0.033
 draw+0.031
 hall+0.031
 pretend+0.031
Neg logit contributions
lessly-0.048
der-0.046
ously-0.043
rier-0.043
lessness-0.042
Token two
Token ID734
Feature activation+2.20e+00

Pos logit contributions
 hunt+0.068
 hide+0.062
 hall+0.060
 draw+0.059
 pretend+0.058
Neg logit contributions
lessly-0.105
der-0.099
ously-0.095
lessness-0.092
rier-0.092
Token kids
Token ID3988
Feature activation+4.65e+00

Pos logit contributions
 hide+0.055
 hunt+0.055
 draw+0.051
 hall+0.050
 pretend+0.050
Neg logit contributions
lessly-0.070
der-0.065
rier-0.062
ously-0.059
lessness-0.059
Token went
Token ID1816
Feature activation+4.41e+00

Pos logit contributions
 hunt+0.121
 hide+0.108
 hall+0.105
 draw+0.105
 land+0.104
Neg logit contributions
lessly-0.133
der-0.125
ously-0.124
lessness-0.117
 understanding-0.108
Token to
Token ID284
Feature activation+8.54e+00

Pos logit contributions
 hunt+0.121
c+0.116
 land+0.113
 hide+0.112
 draw+0.111
Neg logit contributions
lessly-0.111
der-0.098
ously-0.094
lessness-0.089
arl-0.088
Token the
Token ID262
Feature activation+5.35e+00

Pos logit contributions
asis+0.222
c+0.214
 hunt+0.202
atmeal+0.188
ci+0.188
Neg logit contributions
 understanding-0.173
lessly-0.156
ously-0.148
 ï¿½-0.147
 sons-0.144
Token park
Token ID3952
Feature activation+4.31e+00

Pos logit contributions
c+0.064
 hunt+0.059
e+0.057
 hide+0.052
tail+0.051
Neg logit contributions
der-0.191
lessly-0.184
oral-0.181
rier-0.172
 understanding-0.171
Token.
Token ID13
Feature activation+3.17e+00

Pos logit contributions
 hunt+0.121
 hide+0.112
 draw+0.110
 hall+0.110
 pretend+0.108
Neg logit contributions
lessly-0.110
der-0.104
ously-0.099
lessness-0.093
rier-0.086
Token They
Token ID1119
Feature activation+4.12e+00

Pos logit contributions
 hunt+0.108
 hide+0.100
c+0.099
 land+0.098
 hall+0.096
Neg logit contributions
lessly-0.064
der-0.060
lessness-0.052
ously-0.051
arl-0.047
Token saw
Token ID2497
Feature activation+3.29e+00

Pos logit contributions
 hunt+0.089
c+0.084
 exhibit+0.077
tail+0.076
asis+0.076
Neg logit contributions
lessly-0.124
ously-0.113
lessness-0.109
der-0.107
 understanding-0.103
TokenOne
Token ID3198
Feature activation+3.19e+00

Pos logit contributions
 hunt+0.199
 draw+0.184
 hide+0.184
 pretend+0.183
 hall+0.179
Neg logit contributions
lessly-0.105
der-0.088
lessness-0.086
ously-0.082
oral-0.072
Token day
Token ID1110
Feature activation+1.35e+00

Pos logit contributions
 hunt+0.077
 hide+0.071
 hall+0.066
 draw+0.066
 pretend+0.066
Neg logit contributions
lessly-0.098
der-0.090
lessness-0.088
ously-0.085
oral-0.081
Token,
Token ID11
Feature activation+3.24e+00

Pos logit contributions
 hunt+0.030
 hide+0.029
 pretend+0.027
 draw+0.027
 hall+0.027
Neg logit contributions
lessly-0.045
der-0.043
ously-0.040
rier-0.040
lessness-0.040
Token Lucy
Token ID22162
Feature activation+5.86e+00

Pos logit contributions
 hunt+0.075
 hide+0.067
 hall+0.066
 draw+0.066
c+0.063
Neg logit contributions
lessly-0.105
der-0.099
ously-0.094
lessness-0.092
rier-0.090
Token went
Token ID1816
Feature activation+4.61e+00

Pos logit contributions
 hunt+0.175
 hall+0.161
 hide+0.156
 library+0.153
 swamp+0.151
Neg logit contributions
ously-0.117
lessly-0.113
der-0.100
lessness-0.097
 understanding-0.093
Token to
Token ID284
Feature activation+8.53e+00

Pos logit contributions
 hunt+0.124
 hide+0.117
c+0.116
 hall+0.115
 draw+0.113
Neg logit contributions
lessly-0.121
der-0.111
ously-0.104
arl-0.099
lessness-0.098
Token the
Token ID262
Feature activation+5.27e+00

Pos logit contributions
asis+0.268
c+0.252
ci+0.248
rob+0.241
 hunt+0.236
Neg logit contributions
 understanding-0.135
 ï¿½-0.101
 dad-0.101
 much-0.100
 Dad-0.096
Token playground
Token ID24817
Feature activation+4.02e+00

Pos logit contributions
 hide+0.064
c+0.064
 hunt+0.064
e+0.062
tail+0.055
Neg logit contributions
der-0.176
oral-0.171
lessly-0.160
arl-0.160
 understanding-0.159
Token.
Token ID13
Feature activation+3.08e+00

Pos logit contributions
 hunt+0.106
 hide+0.096
 pretend+0.094
 draw+0.094
 hall+0.093
Neg logit contributions
lessly-0.111
der-0.107
ously-0.099
lessness-0.094
rier-0.091
Token She
Token ID1375
Feature activation+4.14e+00

Pos logit contributions
 hunt+0.091
 hide+0.082
c+0.079
 draw+0.079
 land+0.079
Neg logit contributions
lessly-0.081
der-0.075
ously-0.070
lessness-0.069
rier-0.065
Token saw
Token ID2497
Feature activation+3.44e+00

Pos logit contributions
 hunt+0.102
c+0.091
tail+0.091
 exhibit+0.087
 hide+0.084
Neg logit contributions
lessly-0.108
ously-0.098
der-0.093
lessness-0.092
 understanding-0.086
Token day
Token ID1110
Feature activation+9.94e-01

Pos logit contributions
 hunt+0.067
 hide+0.061
 hall+0.057
 pretend+0.057
 draw+0.056
Neg logit contributions
lessly-0.082
der-0.075
lessness-0.074
ously-0.071
 understanding-0.068
Token,
Token ID11
Feature activation+2.99e+00

Pos logit contributions
 hunt+0.024
 hide+0.023
 hall+0.022
 draw+0.022
 pretend+0.021
Neg logit contributions
lessly-0.031
der-0.029
ously-0.028
lessness-0.027
rier-0.027
Token two
Token ID734
Feature activation+1.77e+00

Pos logit contributions
 hunt+0.069
 hide+0.061
 hall+0.060
 draw+0.060
c+0.058
Neg logit contributions
lessly-0.098
der-0.091
ously-0.088
lessness-0.085
rier-0.082
Token friends
Token ID2460
Feature activation+4.19e+00

Pos logit contributions
 hunt+0.045
 hide+0.045
 draw+0.042
 hall+0.041
 pretend+0.041
Neg logit contributions
lessly-0.055
der-0.052
rier-0.050
lessness-0.047
ously-0.047
Token went
Token ID1816
Feature activation+4.11e+00

Pos logit contributions
 hunt+0.107
 hide+0.097
 hall+0.095
 draw+0.094
 land+0.093
Neg logit contributions
lessly-0.122
der-0.116
ously-0.113
lessness-0.107
rier-0.102
Token to
Token ID284
Feature activation+8.49e+00

Pos logit contributions
 hunt+0.104
c+0.100
 land+0.097
 hide+0.096
 draw+0.095
Neg logit contributions
lessly-0.112
der-0.100
ously-0.097
arl-0.092
lessness-0.091
Token the
Token ID262
Feature activation+5.14e+00

Pos logit contributions
asis+0.250
c+0.227
atmeal+0.215
 hunt+0.210
ci+0.208
Neg logit contributions
 understanding-0.142
 ï¿½-0.126
lessly-0.117
 sons-0.116
 much-0.116
Token river
Token ID7850
Feature activation+4.38e+00

Pos logit contributions
e+0.072
c+0.071
tail+0.061
 hunt+0.058
 hide+0.057
Neg logit contributions
der-0.167
oral-0.157
lessly-0.151
Days-0.145
arl-0.145
Token.
Token ID13
Feature activation+2.61e+00

Pos logit contributions
 hunt+0.114
 hide+0.107
 draw+0.104
 hall+0.103
 pretend+0.102
Neg logit contributions
lessly-0.125
der-0.119
ously-0.113
lessness-0.108
rier-0.106
Token It
Token ID632
Feature activation+3.29e+00

Pos logit contributions
 hunt+0.094
c+0.088
 hide+0.087
 land+0.086
 hall+0.084
Neg logit contributions
lessly-0.043
der-0.041
lessness-0.034
ously-0.034
arl-0.031
Token was
Token ID373
Feature activation+1.24e+00

Pos logit contributions
 hunt+0.081
 library+0.075
 hide+0.074
 draw+0.074
 hall+0.073
Neg logit contributions
lessly-0.094
der-0.087
lessness-0.081
ously-0.081
rier-0.073
TokenOne
Token ID3198
Feature activation+3.10e+00

Pos logit contributions
 hunt+0.205
 hide+0.191
 draw+0.188
 pretend+0.188
 hall+0.181
Neg logit contributions
lessly-0.122
der-0.103
lessness-0.101
ously-0.098
oral-0.086
Token day
Token ID1110
Feature activation+1.42e+00

Pos logit contributions
 hunt+0.072
 hide+0.066
 hall+0.062
 draw+0.062
 pretend+0.062
Neg logit contributions
lessly-0.099
der-0.092
lessness-0.088
ously-0.087
rier-0.083
Token,
Token ID11
Feature activation+3.41e+00

Pos logit contributions
 hunt+0.031
 hide+0.030
 pretend+0.027
 draw+0.027
 hall+0.027
Neg logit contributions
lessly-0.048
der-0.046
rier-0.044
ously-0.043
lessness-0.043
Token Jack
Token ID3619
Feature activation+5.50e+00

Pos logit contributions
 hunt+0.081
 hide+0.071
 hall+0.070
 draw+0.070
c+0.068
Neg logit contributions
lessly-0.110
der-0.102
ously-0.098
lessness-0.096
rier-0.092
Token went
Token ID1816
Feature activation+4.53e+00

Pos logit contributions
 hunt+0.147
 hall+0.138
tail+0.133
 hide+0.133
 library+0.133
Neg logit contributions
ously-0.122
lessly-0.114
der-0.102
lessness-0.100
 understanding-0.097
Token to
Token ID284
Feature activation+8.49e+00

Pos logit contributions
 hunt+0.113
c+0.106
 hide+0.105
 hall+0.103
 draw+0.102
Neg logit contributions
lessly-0.128
der-0.117
ously-0.112
arl-0.106
lessness-0.105
Token the
Token ID262
Feature activation+5.51e+00

Pos logit contributions
asis+0.247
c+0.238
ci+0.226
 grounds+0.226
 hunt+0.218
Neg logit contributions
 understanding-0.138
 ï¿½-0.113
 Dad-0.113
 dad-0.112
 much-0.107
Token church
Token ID4928
Feature activation+3.60e+00

Pos logit contributions
e+0.070
c+0.070
 hunt+0.068
 hide+0.067
 counts+0.060
Neg logit contributions
der-0.179
oral-0.170
 understanding-0.161
lessly-0.160
arl-0.160
Token.
Token ID13
Feature activation+2.77e+00

Pos logit contributions
 hunt+0.099
 hide+0.093
 draw+0.089
 pretend+0.088
 hall+0.087
Neg logit contributions
lessly-0.089
der-0.084
ously-0.076
lessness-0.075
rier-0.073
Token He
Token ID679
Feature activation+3.39e+00

Pos logit contributions
 hunt+0.102
 hide+0.097
c+0.095
 hall+0.094
 land+0.093
Neg logit contributions
lessly-0.051
der-0.046
ously-0.041
lessness-0.040
rier-0.037
Token was
Token ID373
Feature activation+1.33e+00

Pos logit contributions
 hunt+0.080
 hide+0.070
tail+0.069
 draw+0.068
 exhibit+0.068
Neg logit contributions
lessly-0.083
ously-0.078
 understanding-0.078
der-0.077
lessness-0.073
Token day
Token ID1110
Feature activation+1.56e+00

Pos logit contributions
 hunt+0.077
 hide+0.071
 hall+0.067
 pretend+0.066
 draw+0.066
Neg logit contributions
lessly-0.104
der-0.096
lessness-0.092
ously-0.092
rier-0.086
Token,
Token ID11
Feature activation+3.50e+00

Pos logit contributions
 hunt+0.036
 hide+0.034
 draw+0.032
 hall+0.032
 pretend+0.032
Neg logit contributions
lessly-0.051
der-0.049
ously-0.046
rier-0.045
lessness-0.045
Token Mom
Token ID11254
Feature activation+4.72e+00

Pos logit contributions
 hunt+0.083
 hide+0.073
 hall+0.073
 draw+0.072
 land+0.070
Neg logit contributions
lessly-0.112
der-0.104
ously-0.100
lessness-0.098
rier-0.094
Tokenmy
Token ID1820
Feature activation+4.52e+00

Pos logit contributions
 hunt+0.106
 hide+0.098
 hall+0.090
 pretend+0.090
 land+0.089
Neg logit contributions
lessly-0.151
der-0.142
ously-0.138
lessness-0.135
arl-0.125
Token went
Token ID1816
Feature activation+4.21e+00

Pos logit contributions
 hunt+0.114
 hide+0.106
 hall+0.101
 pretend+0.099
 draw+0.099
Neg logit contributions
lessly-0.129
der-0.120
ously-0.119
lessness-0.113
 understanding-0.104
Token to
Token ID284
Feature activation+8.41e+00

Pos logit contributions
 hunt+0.112
 hide+0.106
 hall+0.101
 draw+0.101
c+0.100
Neg logit contributions
lessly-0.118
der-0.110
ously-0.103
lessness-0.099
rier-0.099
Token the
Token ID262
Feature activation+5.32e+00

Pos logit contributions
 hunt+0.228
asis+0.226
c+0.225
 hall+0.211
ci+0.208
Neg logit contributions
 understanding-0.162
lessly-0.147
der-0.133
ously-0.128
 ï¿½-0.123
Token garden
Token ID11376
Feature activation+5.04e+00

Pos logit contributions
e+0.072
c+0.072
 hunt+0.071
 hide+0.071
 drew+0.060
Neg logit contributions
der-0.173
lessly-0.164
oral-0.164
arl-0.161
 understanding-0.160
Token and
Token ID290
Feature activation+3.82e+00

Pos logit contributions
 hunt+0.142
 hide+0.131
 hall+0.130
 pretend+0.129
 library+0.129
Neg logit contributions
lessly-0.120
der-0.117
ously-0.111
lessness-0.102
 understanding-0.096
Token picked
Token ID6497
Feature activation+4.00e+00

Pos logit contributions
 hunt+0.088
c+0.080
 library+0.077
asis+0.077
tail+0.076
Neg logit contributions
lessly-0.104
 understanding-0.095
ously-0.095
lessness-0.093
der-0.088
Token a
Token ID257
Feature activation+3.88e+00

Pos logit contributions
 hunt+0.118
 hall+0.111
 library+0.105
 draw+0.102
ital+0.100
Neg logit contributions
lessly-0.097
der-0.095
ously-0.082
rier-0.080
lessness-0.075
Token,
Token ID11
Feature activation+3.02e+00

Pos logit contributions
 hunt+0.024
 hide+0.023
 pretend+0.022
 draw+0.021
 hall+0.021
Neg logit contributions
lessly-0.040
der-0.039
rier-0.036
ously-0.036
lessness-0.036
Token Ben
Token ID3932
Feature activation+5.05e+00

Pos logit contributions
 hunt+0.068
 hide+0.060
 hall+0.059
 draw+0.058
c+0.057
Neg logit contributions
lessly-0.100
der-0.094
ously-0.090
lessness-0.088
rier-0.086
Token and
Token ID290
Feature activation+2.90e+00

Pos logit contributions
 hunt+0.155
 hall+0.148
c+0.147
tail+0.144
 library+0.141
Neg logit contributions
lessly-0.095
ously-0.089
der-0.083
lessness-0.082
 understanding-0.076
Token Lily
Token ID20037
Feature activation+6.08e+00

Pos logit contributions
 hunt+0.067
 hide+0.063
 draw+0.060
 pretend+0.059
 hall+0.058
Neg logit contributions
lessly-0.096
der-0.092
ously-0.086
rier-0.085
lessness-0.084
Token went
Token ID1816
Feature activation+4.71e+00

Pos logit contributions
 hunt+0.156
 hall+0.139
 exhibit+0.137
c+0.136
 library+0.133
Neg logit contributions
lessly-0.160
ously-0.145
lessness-0.144
der-0.141
 understanding-0.129
Token to
Token ID284
Feature activation+8.36e+00

Pos logit contributions
 hunt+0.133
c+0.128
 hide+0.125
 hall+0.123
 draw+0.123
Neg logit contributions
lessly-0.117
der-0.105
ously-0.100
lessness-0.096
arl-0.095
Token the
Token ID262
Feature activation+5.56e+00

Pos logit contributions
asis+0.230
c+0.228
 hunt+0.217
 hall+0.208
ci+0.206
Neg logit contributions
 understanding-0.152
lessly-0.143
der-0.130
arl-0.129
 much-0.128
Token park
Token ID3952
Feature activation+3.97e+00

Pos logit contributions
e+0.083
c+0.081
 hunt+0.069
 hide+0.067
 drew+0.066
Neg logit contributions
der-0.176
oral-0.169
arl-0.168
lessly-0.158
 understanding-0.157
Token with
Token ID351
Feature activation+3.97e+00

Pos logit contributions
 hunt+0.103
 hide+0.094
 hall+0.093
 draw+0.092
 pretend+0.091
Neg logit contributions
lessly-0.109
der-0.103
ously-0.098
lessness-0.094
rier-0.090
Token their
Token ID511
Feature activation+2.74e+00

Pos logit contributions
 hunt+0.118
 hide+0.108
 hall+0.107
c+0.104
 draw+0.102
Neg logit contributions
lessly-0.096
der-0.089
ously-0.084
lessness-0.082
rier-0.078
Token mom
Token ID1995
Feature activation+3.51e+00

Pos logit contributions
 hunt+0.087
 hide+0.082
 draw+0.079
 hall+0.079
 pretend+0.078
Neg logit contributions
lessly-0.065
der-0.061
ously-0.057
lessness-0.055
rier-0.054
Tokenily
Token ID813
Feature activation+6.24e+00

Pos logit contributions
 hunt+0.061
 hide+0.057
 hall+0.055
 pretend+0.054
 draw+0.054
Neg logit contributions
lessly-0.058
der-0.054
ously-0.052
lessness-0.049
rier-0.048
Token and
Token ID290
Feature activation+4.99e+00

Pos logit contributions
 hunt+0.193
 hide+0.181
 hall+0.176
 pretend+0.173
 draw+0.171
Neg logit contributions
lessly-0.143
der-0.138
ously-0.133
lessness-0.120
 understanding-0.109
Token Sam
Token ID3409
Feature activation+5.74e+00

Pos logit contributions
 hunt+0.129
 hall+0.117
 exhibit+0.110
 library+0.108
c+0.108
Neg logit contributions
lessly-0.135
ously-0.126
 understanding-0.123
der-0.122
lessness-0.117
Token went
Token ID1816
Feature activation+5.43e+00

Pos logit contributions
 hunt+0.155
 hide+0.142
 hall+0.140
 draw+0.133
 pretend+0.133
Neg logit contributions
lessly-0.158
der-0.151
ously-0.147
lessness-0.139
rier-0.126
Token on
Token ID319
Feature activation+6.25e+00

Pos logit contributions
 hunt+0.145
 hide+0.136
 hall+0.131
 draw+0.130
 pretend+0.129
Neg logit contributions
lessly-0.153
der-0.143
ously-0.135
lessness-0.129
rier-0.126
Token a
Token ID257
Feature activation+8.35e+00

Pos logit contributions
 hunt+0.175
 draw+0.161
 hall+0.159
 hide+0.156
c+0.156
Neg logit contributions
der-0.138
lessly-0.136
ously-0.124
 badly-0.118
 understanding-0.116
Token saf
Token ID1932
Feature activation+3.72e+00

Pos logit contributions
 hunt+0.199
 hide+0.196
 draw+0.185
c+0.178
 drew+0.177
Neg logit contributions
lessly-0.205
der-0.185
oral-0.180
ously-0.177
arl-0.173
Tokenari
Token ID2743
Feature activation+5.10e+00

Pos logit contributions
 hunt+0.106
 hide+0.100
 hall+0.096
 draw+0.096
 pretend+0.095
Neg logit contributions
lessly-0.101
der-0.095
ously-0.089
lessness-0.086
rier-0.086
Token with
Token ID351
Feature activation+3.76e+00

Pos logit contributions
 hunt+0.134
 hide+0.129
 draw+0.126
 pretend+0.122
 hall+0.122
Neg logit contributions
lessly-0.133
der-0.123
ously-0.119
lessness-0.113
rier-0.112
Token their
Token ID511
Feature activation+3.20e+00

Pos logit contributions
 hunt+0.088
 hide+0.085
 pretend+0.080
 draw+0.079
 land+0.078
Neg logit contributions
lessly-0.123
der-0.121
ously-0.111
rier-0.109
lessness-0.108
Token mom
Token ID1995
Feature activation+3.56e+00

Pos logit contributions
 hunt+0.105
 hide+0.101
 hall+0.097
 pretend+0.097
 draw+0.096
Neg logit contributions
lessly-0.076
der-0.071
ously-0.065
rier-0.063
lessness-0.062
Token Anna
Token ID11735
Feature activation+5.41e+00

Pos logit contributions
 hunt+0.072
 hide+0.065
 hall+0.063
 draw+0.062
 pretend+0.060
Neg logit contributions
lessly-0.110
der-0.104
ously-0.099
lessness-0.097
rier-0.096
Token and
Token ID290
Feature activation+2.97e+00

Pos logit contributions
 hunt+0.177
 hall+0.165
 fro+0.163
 exhibit+0.163
c+0.163
Neg logit contributions
lessly-0.077
ously-0.075
der-0.069
lessness-0.065
 understanding-0.059
Token her
Token ID607
Feature activation+2.31e+00

Pos logit contributions
 hunt+0.066
 hall+0.059
 hide+0.058
c+0.057
 draw+0.055
Neg logit contributions
lessly-0.095
der-0.091
ously-0.088
lessness-0.084
rier-0.081
Token mom
Token ID1995
Feature activation+5.19e+00

Pos logit contributions
 hunt+0.075
 hide+0.071
 draw+0.069
 hall+0.069
 pretend+0.068
Neg logit contributions
lessly-0.054
der-0.050
ously-0.046
lessness-0.044
rier-0.044
Token went
Token ID1816
Feature activation+4.46e+00

Pos logit contributions
 hunt+0.133
 hide+0.120
c+0.117
 hall+0.114
 pretend+0.112
Neg logit contributions
lessly-0.138
der-0.130
ously-0.125
lessness-0.123
arl-0.119
Token to
Token ID284
Feature activation+8.34e+00

Pos logit contributions
 hunt+0.131
c+0.125
 hide+0.124
 draw+0.121
 hall+0.120
Neg logit contributions
lessly-0.105
der-0.094
ously-0.089
arl-0.084
lessness-0.084
Token the
Token ID262
Feature activation+5.52e+00

Pos logit contributions
asis+0.244
 hunt+0.229
c+0.228
 hall+0.214
nob+0.214
Neg logit contributions
 understanding-0.140
lessly-0.121
 much-0.117
 Dad-0.116
 dad-0.115
Token beach
Token ID10481
Feature activation+4.40e+00

Pos logit contributions
e+0.081
c+0.077
 hide+0.066
 hunt+0.065
 drew+0.064
Neg logit contributions
der-0.176
oral-0.171
arl-0.166
lessly-0.160
rier-0.157
Token.
Token ID13
Feature activation+3.06e+00

Pos logit contributions
 hunt+0.113
 hide+0.108
 draw+0.105
 hall+0.105
 library+0.104
Neg logit contributions
lessly-0.109
der-0.105
ously-0.099
lessness-0.095
rier-0.094
Token Anna
Token ID11735
Feature activation+4.04e+00

Pos logit contributions
 hunt+0.101
c+0.092
 hide+0.092
 draw+0.089
 land+0.089
Neg logit contributions
lessly-0.066
der-0.062
ously-0.055
lessness-0.054
rier-0.051
Token loved
Token ID6151
Feature activation+2.99e+00

Pos logit contributions
 hunt+0.095
tail+0.092
 hall+0.086
c+0.086
 restaurants+0.084
Neg logit contributions
lessly-0.098
ously-0.091
der-0.090
lessness-0.087
oke-0.082
TokenOne
Token ID3198
Feature activation+4.18e+00

Pos logit contributions
 hunt+0.205
 hide+0.184
 pretend+0.183
 draw+0.182
 imagine+0.179
Neg logit contributions
lessly-0.121
lessness-0.104
der-0.100
arl-0.097
ously-0.096
Token day
Token ID1110
Feature activation+2.32e+00

Pos logit contributions
 hunt+0.106
 hide+0.096
 draw+0.089
 hall+0.088
 pretend+0.088
Neg logit contributions
lessly-0.126
der-0.114
lessness-0.112
ously-0.108
oral-0.102
Token,
Token ID11
Feature activation+3.85e+00

Pos logit contributions
 hunt+0.053
 hide+0.051
 hall+0.048
 draw+0.048
 pretend+0.047
Neg logit contributions
lessly-0.075
der-0.072
ously-0.067
rier-0.067
lessness-0.066
Token John
Token ID1757
Feature activation+5.71e+00

Pos logit contributions
 hunt+0.080
 hide+0.072
 draw+0.068
 hall+0.068
 pretend+0.066
Neg logit contributions
lessly-0.136
der-0.129
ously-0.123
lessness-0.120
rier-0.120
Token went
Token ID1816
Feature activation+4.92e+00

Pos logit contributions
 hunt+0.154
 hide+0.144
 hall+0.140
 library+0.132
 draw+0.129
Neg logit contributions
lessly-0.140
ously-0.138
der-0.126
lessness-0.120
 understanding-0.113
Token to
Token ID284
Feature activation+8.29e+00

Pos logit contributions
 hunt+0.130
 hide+0.120
c+0.120
 hall+0.118
 draw+0.117
Neg logit contributions
lessly-0.138
der-0.126
ously-0.121
lessness-0.113
arl-0.112
Token the
Token ID262
Feature activation+5.44e+00

Pos logit contributions
c+0.231
asis+0.219
 hunt+0.219
ci+0.218
 hall+0.205
Neg logit contributions
 understanding-0.150
lessly-0.135
der-0.122
 much-0.120
ously-0.118
Token mall
Token ID17374
Feature activation+4.13e+00

Pos logit contributions
 hunt+0.067
 hide+0.062
c+0.061
e+0.054
 drew+0.052
Neg logit contributions
der-0.199
lessly-0.191
rier-0.189
arl-0.186
oral-0.183
Token with
Token ID351
Feature activation+4.19e+00

Pos logit contributions
 hunt+0.096
 hide+0.089
 draw+0.085
 pretend+0.084
 hall+0.083
Neg logit contributions
lessly-0.130
der-0.124
ously-0.116
lessness-0.114
rier-0.113
Token his
Token ID465
Feature activation+3.55e+00

Pos logit contributions
 hunt+0.117
 hide+0.109
 hall+0.105
 draw+0.104
c+0.103
Neg logit contributions
lessly-0.115
der-0.107
ously-0.101
lessness-0.099
rier-0.098
Token mom
Token ID1995
Feature activation+4.17e+00

Pos logit contributions
 hunt+0.114
 hide+0.111
 hall+0.108
 pretend+0.108
 draw+0.106
Neg logit contributions
lessly-0.088
der-0.082
ously-0.076
rier-0.072
lessness-0.071
Token,
Token ID11
Feature activation+3.15e+00

Pos logit contributions
 hunt+0.023
 hide+0.022
 pretend+0.020
 draw+0.020
 hall+0.020
Neg logit contributions
lessly-0.036
der-0.034
rier-0.032
ously-0.032
lessness-0.032
Token Lily
Token ID20037
Feature activation+5.69e+00

Pos logit contributions
 hunt+0.065
 hide+0.060
 draw+0.056
 hall+0.056
 pretend+0.056
Neg logit contributions
lessly-0.110
der-0.105
ously-0.100
rier-0.098
lessness-0.098
Token and
Token ID290
Feature activation+2.73e+00

Pos logit contributions
 hunt+0.176
tail+0.161
 hall+0.157
c+0.155
 hide+0.155
Neg logit contributions
ously-0.101
lessly-0.095
der-0.086
lessness-0.083
 understanding-0.080
Token Max
Token ID5436
Feature activation+5.52e+00

Pos logit contributions
 hunt+0.055
 hide+0.050
 hall+0.048
 draw+0.047
 pretend+0.046
Neg logit contributions
lessly-0.096
der-0.091
ously-0.088
lessness-0.085
rier-0.084
Token went
Token ID1816
Feature activation+4.39e+00

Pos logit contributions
 hunt+0.150
 hall+0.139
c+0.135
 hide+0.130
 draw+0.129
Neg logit contributions
lessly-0.126
ously-0.124
der-0.119
lessness-0.113
 understanding-0.104
Token to
Token ID284
Feature activation+8.29e+00

Pos logit contributions
 hunt+0.128
c+0.124
 hide+0.120
 draw+0.118
 hall+0.118
Neg logit contributions
lessly-0.105
der-0.094
ously-0.089
lessness-0.085
arl-0.084
Token the
Token ID262
Feature activation+5.67e+00

Pos logit contributions
asis+0.221
c+0.217
 hunt+0.208
ci+0.200
 hall+0.197
Neg logit contributions
 understanding-0.156
lessly-0.152
der-0.137
 much-0.133
arl-0.131
Token park
Token ID3952
Feature activation+4.19e+00

Pos logit contributions
e+0.084
c+0.082
 drew+0.068
 hunt+0.067
 hide+0.065
Neg logit contributions
der-0.178
oral-0.169
arl-0.165
omm-0.153
lessly-0.152
Token with
Token ID351
Feature activation+3.92e+00

Pos logit contributions
 hunt+0.104
 hide+0.095
 draw+0.094
 hall+0.094
c+0.092
Neg logit contributions
lessly-0.116
der-0.110
ously-0.106
lessness-0.101
rier-0.096
Token their
Token ID511
Feature activation+2.71e+00

Pos logit contributions
 hunt+0.104
 hide+0.095
 hall+0.093
c+0.091
 land+0.090
Neg logit contributions
lessly-0.109
der-0.102
ously-0.097
lessness-0.095
rier-0.091
Token mom
Token ID1995
Feature activation+3.32e+00

Pos logit contributions
 hunt+0.089
 hide+0.084
c+0.081
 draw+0.081
 hall+0.080
Neg logit contributions
lessly-0.056
der-0.053
ously-0.049
lessness-0.047
rier-0.046
Token<|endoftext|>
Token ID50256
Feature activation+6.34e+00

Pos logit contributions
 hunt+0.076
 hide+0.071
 draw+0.068
 hall+0.067
 pretend+0.067
Neg logit contributions
lessly-0.068
der-0.064
ously-0.059
lessness-0.058
rier-0.056
TokenOne
Token ID3198
Feature activation+4.27e+00

Pos logit contributions
 hunt+0.227
 hide+0.206
 pretend+0.205
 draw+0.205
 imagine+0.199
Neg logit contributions
lessly-0.116
lessness-0.097
der-0.094
ously-0.090
arl-0.087
Token day
Token ID1110
Feature activation+2.49e+00

Pos logit contributions
 hunt+0.100
 hide+0.092
 draw+0.086
 hall+0.086
 pretend+0.085
Neg logit contributions
lessly-0.137
der-0.127
lessness-0.121
ously-0.121
rier-0.114
Token John
Token ID1757
Feature activation+6.08e+00

Pos logit contributions
 hunt+0.054
 hide+0.052
 hall+0.049
 pretend+0.048
 draw+0.048
Neg logit contributions
lessly-0.084
der-0.081
rier-0.075
ously-0.075
lessness-0.074
Token went
Token ID1816
Feature activation+4.65e+00

Pos logit contributions
 hunt+0.161
 hide+0.150
 hall+0.145
 draw+0.138
 library+0.135
Neg logit contributions
lessly-0.154
ously-0.150
der-0.143
lessness-0.132
oke-0.120
Token to
Token ID284
Feature activation+8.27e+00

Pos logit contributions
 hunt+0.118
 hide+0.109
 hall+0.106
 draw+0.106
c+0.106
Neg logit contributions
lessly-0.137
der-0.127
ously-0.120
lessness-0.114
arl-0.111
Token visit
Token ID3187
Feature activation+3.47e+00

Pos logit contributions
c+0.234
 hunt+0.222
ci+0.221
asis+0.219
 drew+0.206
Neg logit contributions
 understanding-0.139
der-0.115
lessly-0.114
 badly-0.108
 much-0.107
Token his
Token ID465
Feature activation+3.95e+00

Pos logit contributions
 hunt+0.100
 hall+0.096
c+0.096
 hide+0.094
 draw+0.091
Neg logit contributions
lessly-0.083
der-0.079
ously-0.072
 understanding-0.070
rier-0.069
Token grand
Token ID4490
Feature activation+3.84e+00

Pos logit contributions
 hunt+0.127
 hide+0.120
c+0.115
 draw+0.115
 hall+0.114
Neg logit contributions
lessly-0.082
der-0.081
rier-0.072
ously-0.071
lessness-0.067
Tokenpa
Token ID8957
Feature activation+4.96e+00

Pos logit contributions
 hide+0.082
 hunt+0.080
c+0.074
 pretend+0.074
 draw+0.073
Neg logit contributions
der-0.108
lessly-0.102
ously-0.102
oral-0.098
rier-0.096
Token.
Token ID13
Feature activation+2.13e+00

Pos logit contributions
 hide+0.115
 hunt+0.115
 hall+0.112
 land+0.106
 swings+0.106
Neg logit contributions
lessly-0.130
der-0.128
ously-0.120
rier-0.113
lessness-0.112
Token,
Token ID11
Feature activation+3.66e+00

Pos logit contributions
 hunt+0.040
 hide+0.039
 pretend+0.036
 hall+0.036
 draw+0.036
Neg logit contributions
lessly-0.068
der-0.066
rier-0.062
ously-0.061
lessness-0.060
Token Ben
Token ID3932
Feature activation+5.14e+00

Pos logit contributions
 hunt+0.073
 hide+0.068
 hall+0.063
 draw+0.063
 pretend+0.062
Neg logit contributions
lessly-0.131
der-0.125
ously-0.119
lessness-0.116
rier-0.116
Token and
Token ID290
Feature activation+2.86e+00

Pos logit contributions
 hunt+0.140
 hall+0.128
tail+0.128
c+0.126
 exhibit+0.125
Neg logit contributions
lessly-0.108
ously-0.106
lessness-0.099
der-0.096
 understanding-0.091
Token Lily
Token ID20037
Feature activation+5.76e+00

Pos logit contributions
 hunt+0.064
 hide+0.063
 pretend+0.059
 draw+0.058
 play+0.058
Neg logit contributions
lessly-0.097
der-0.096
rier-0.088
ously-0.087
lessness-0.085
Token went
Token ID1816
Feature activation+4.86e+00

Pos logit contributions
 hunt+0.156
 hall+0.135
 exhibit+0.135
c+0.133
 library+0.132
Neg logit contributions
lessly-0.151
ously-0.137
lessness-0.137
der-0.135
 understanding-0.120
Token to
Token ID284
Feature activation+8.26e+00

Pos logit contributions
 hunt+0.138
 hide+0.128
c+0.128
 draw+0.127
 hall+0.126
Neg logit contributions
lessly-0.127
der-0.117
ously-0.110
lessness-0.105
arl-0.102
Token the
Token ID262
Feature activation+5.75e+00

Pos logit contributions
c+0.223
asis+0.220
 hunt+0.216
ci+0.208
 hall+0.205
Neg logit contributions
 understanding-0.150
lessly-0.143
arl-0.132
 much-0.130
der-0.127
Token park
Token ID3952
Feature activation+4.44e+00

Pos logit contributions
e+0.080
c+0.078
 hunt+0.071
 hide+0.067
 drew+0.066
Neg logit contributions
der-0.192
arl-0.189
oral-0.183
lessly-0.180
rier-0.179
Token with
Token ID351
Feature activation+4.32e+00

Pos logit contributions
 hunt+0.118
 hide+0.109
 draw+0.108
 hall+0.108
c+0.105
Neg logit contributions
lessly-0.114
der-0.109
ously-0.103
lessness-0.098
 understanding-0.092
Token their
Token ID511
Feature activation+3.19e+00

Pos logit contributions
 hunt+0.133
 hide+0.121
 hall+0.120
c+0.119
 draw+0.115
Neg logit contributions
lessly-0.099
der-0.091
ously-0.086
lessness-0.084
rier-0.081
Token mom
Token ID1995
Feature activation+3.84e+00

Pos logit contributions
 hunt+0.112
 hide+0.106
 draw+0.103
 hall+0.102
c+0.102
Neg logit contributions
lessly-0.062
der-0.057
ously-0.052
lessness-0.051
rier-0.050
Token.
Token ID13
Feature activation+1.37e+00

Pos logit contributions
 hunt+0.105
 hide+0.098
 hall+0.095
 pretend+0.092
 draw+0.092
Neg logit contributions
lessly-0.103
der-0.097
ously-0.092
lessness-0.089
rier-0.083
Token She
Token ID1375
Feature activation+2.64e+00

Pos logit contributions
 hunt+0.044
 hide+0.039
 land+0.038
c+0.037
 pretend+0.037
Neg logit contributions
lessly-0.034
der-0.032
lessness-0.029
ously-0.028
arl-0.027
Token loved
Token ID6151
Feature activation+3.05e+00

Pos logit contributions
 hunt+0.076
 hide+0.068
 exhibit+0.068
c+0.067
 hall+0.066
Neg logit contributions
lessly-0.066
der-0.060
lessness-0.057
ously-0.055
 importantly-0.052
Token going
Token ID1016
Feature activation+3.56e+00

Pos logit contributions
 hunt+0.094
 hide+0.089
c+0.089
 hall+0.085
 pretend+0.084
Neg logit contributions
der-0.062
lessly-0.061
ously-0.053
 understanding-0.053
lessness-0.050
Token outside
Token ID2354
Feature activation+5.48e+00

Pos logit contributions
 hunt+0.105
 hide+0.101
c+0.099
 draw+0.098
 drew+0.097
Neg logit contributions
lessly-0.079
der-0.067
lessness-0.063
ously-0.062
arl-0.061
Token to
Token ID284
Feature activation+8.26e+00

Pos logit contributions
 hunt+0.159
 hide+0.143
 draw+0.141
 hall+0.137
c+0.137
Neg logit contributions
lessly-0.126
der-0.122
ously-0.110
lessness-0.107
rier-0.100
Token play
Token ID711
Feature activation+4.45e+00

Pos logit contributions
 drew+0.125
ci+0.124
c+0.124
house+0.123
e+0.121
Neg logit contributions
 understanding-0.222
lessly-0.211
arl-0.201
der-0.199
 much-0.198
Token in
Token ID287
Feature activation+6.74e+00

Pos logit contributions
 library+0.103
 hunt+0.102
 exhibit+0.099
c+0.099
 draws+0.097
Neg logit contributions
der-0.104
lessly-0.103
 badly-0.087
ously-0.086
He-0.084
Token the
Token ID262
Feature activation+5.58e+00

Pos logit contributions
 hunt+0.198
 met+0.181
 drew+0.179
asis+0.178
ci+0.177
Neg logit contributions
 understanding-0.117
lessly-0.109
 much-0.104
ful-0.097
He-0.095
Token field
Token ID2214
Feature activation+4.08e+00

Pos logit contributions
 hunt+0.130
 hide+0.111
e+0.106
 drew+0.102
ci+0.100
Neg logit contributions
lessly-0.159
der-0.146
arl-0.145
He-0.141
 much-0.136
Token near
Token ID1474
Feature activation+4.54e+00

Pos logit contributions
 hunt+0.113
 hide+0.101
 draw+0.100
 pretend+0.100
 hall+0.100
Neg logit contributions
lessly-0.100
der-0.092
lessness-0.084
ously-0.084
rier-0.080
TokenOne
Token ID3198
Feature activation+3.00e+00

Pos logit contributions
 hunt+0.166
 hide+0.152
 draw+0.151
 pretend+0.149
 imagine+0.142
Neg logit contributions
lessly-0.137
der-0.121
lessness-0.119
ously-0.116
arl-0.104
Token day
Token ID1110
Feature activation+1.21e+00

Pos logit contributions
 hunt+0.072
 hide+0.066
 hall+0.062
 pretend+0.061
 draw+0.061
Neg logit contributions
lessly-0.094
der-0.086
lessness-0.084
ously-0.082
 understanding-0.077
Token,
Token ID11
Feature activation+3.19e+00

Pos logit contributions
 hunt+0.027
 hide+0.026
 pretend+0.024
 draw+0.024
 hall+0.024
Neg logit contributions
lessly-0.040
der-0.038
rier-0.035
ously-0.035
lessness-0.035
Token Johnny
Token ID15470
Feature activation+5.84e+00

Pos logit contributions
 hunt+0.071
 hide+0.065
 hall+0.062
 draw+0.062
 pretend+0.060
Neg logit contributions
lessly-0.106
der-0.100
ously-0.096
lessness-0.093
rier-0.092
Token went
Token ID1816
Feature activation+4.27e+00

Pos logit contributions
 hunt+0.159
 hide+0.145
 hall+0.141
 library+0.141
tail+0.140
Neg logit contributions
ously-0.137
lessly-0.135
der-0.125
lessness-0.116
 understanding-0.112
Token to
Token ID284
Feature activation+8.26e+00

Pos logit contributions
 hunt+0.094
c+0.088
 hide+0.086
 hall+0.084
 land+0.083
Neg logit contributions
lessly-0.133
der-0.122
ously-0.118
arl-0.112
lessness-0.112
Token the
Token ID262
Feature activation+5.24e+00

Pos logit contributions
asis+0.226
c+0.222
 hunt+0.209
ci+0.203
 grounds+0.200
Neg logit contributions
 understanding-0.157
lessly-0.134
 ï¿½-0.127
He-0.125
His-0.124
Token beach
Token ID10481
Feature activation+4.18e+00

Pos logit contributions
c+0.068
 hunt+0.064
e+0.064
 hide+0.062
 drew+0.055
Neg logit contributions
der-0.173
oral-0.162
lessly-0.157
 understanding-0.156
arl-0.156
Token.
Token ID13
Feature activation+2.98e+00

Pos logit contributions
 hunt+0.104
 hide+0.098
 draw+0.096
 hall+0.096
c+0.096
Neg logit contributions
lessly-0.109
der-0.105
ously-0.099
lessness-0.093
rier-0.091
Token He
Token ID679
Feature activation+3.81e+00

Pos logit contributions
 hunt+0.101
 hide+0.097
 draw+0.094
 hall+0.094
 pretend+0.093
Neg logit contributions
lessly-0.064
der-0.060
ously-0.055
rier-0.053
lessness-0.052
Token took
Token ID1718
Feature activation+3.01e+00

Pos logit contributions
 hunt+0.079
tail+0.076
c+0.071
 library+0.070
 hall+0.068
Neg logit contributions
lessly-0.113
ously-0.103
der-0.099
lessness-0.098
 understanding-0.093
TokenOne
Token ID3198
Feature activation+3.06e+00

Pos logit contributions
 hunt+0.151
 hide+0.138
 draw+0.136
 pretend+0.135
 hall+0.128
Neg logit contributions
lessly-0.144
der-0.133
lessness-0.127
ously-0.125
oral-0.115
Token day
Token ID1110
Feature activation+1.31e+00

Pos logit contributions
 hunt+0.073
 hide+0.067
 hall+0.063
 pretend+0.062
 draw+0.062
Neg logit contributions
lessly-0.095
der-0.088
lessness-0.085
ously-0.084
 understanding-0.079
Token,
Token ID11
Feature activation+3.38e+00

Pos logit contributions
 hunt+0.029
 hide+0.028
 pretend+0.026
 draw+0.026
 hall+0.026
Neg logit contributions
lessly-0.043
der-0.041
ously-0.039
rier-0.039
lessness-0.038
Token Sarah
Token ID10490
Feature activation+5.77e+00

Pos logit contributions
 hunt+0.078
 hide+0.069
 hall+0.069
 draw+0.068
 land+0.066
Neg logit contributions
lessly-0.110
der-0.103
ously-0.098
lessness-0.096
rier-0.093
Token went
Token ID1816
Feature activation+4.24e+00

Pos logit contributions
 hunt+0.175
 hide+0.154
 fro+0.151
 hall+0.150
 exhibit+0.147
Neg logit contributions
ously-0.120
lessly-0.112
der-0.109
lessness-0.095
 understanding-0.088
Token to
Token ID284
Feature activation+8.26e+00

Pos logit contributions
 hunt+0.107
c+0.101
 hide+0.099
 land+0.096
 pretend+0.096
Neg logit contributions
lessly-0.118
der-0.106
ously-0.102
arl-0.098
lessness-0.097
Token the
Token ID262
Feature activation+5.01e+00

Pos logit contributions
asis+0.262
c+0.239
 hunt+0.231
ci+0.231
at+0.223
Neg logit contributions
 understanding-0.141
 ï¿½-0.111
 much-0.107
lessly-0.104
 dad-0.100
Token store
Token ID3650
Feature activation+3.03e+00

Pos logit contributions
c+0.066
 hunt+0.063
e+0.061
 hide+0.059
 drew+0.054
Neg logit contributions
der-0.170
oral-0.158
arl-0.152
lessly-0.152
 understanding-0.151
Token with
Token ID351
Feature activation+4.47e+00

Pos logit contributions
 hunt+0.063
 hide+0.058
 hall+0.055
 draw+0.055
 pretend+0.054
Neg logit contributions
lessly-0.107
der-0.102
ously-0.096
lessness-0.095
rier-0.094
Token her
Token ID607
Feature activation+3.25e+00

Pos logit contributions
 hunt+0.094
 hide+0.084
 hall+0.082
c+0.080
 land+0.079
Neg logit contributions
lessly-0.147
der-0.138
ously-0.134
lessness-0.131
rier-0.127
Token mom
Token ID1995
Feature activation+3.25e+00

Pos logit contributions
 hunt+0.096
 hide+0.093
 pretend+0.091
 hall+0.090
 draw+0.090
Neg logit contributions
lessly-0.091
der-0.086
ously-0.078
lessness-0.076
rier-0.075
TokenOne
Token ID3198
Feature activation+3.32e+00

Pos logit contributions
 hunt+0.195
 draw+0.181
 hide+0.181
 pretend+0.177
 hall+0.174
Neg logit contributions
lessly-0.114
der-0.099
ously-0.093
lessness-0.093
oral-0.081
Token day
Token ID1110
Feature activation+1.37e+00

Pos logit contributions
 hunt+0.078
 hide+0.072
 hall+0.068
 draw+0.068
 pretend+0.067
Neg logit contributions
lessly-0.105
der-0.097
lessness-0.093
ously-0.092
rier-0.086
Token,
Token ID11
Feature activation+3.49e+00

Pos logit contributions
 hunt+0.028
 hide+0.027
 pretend+0.025
 draw+0.025
 hall+0.024
Neg logit contributions
lessly-0.048
der-0.046
rier-0.044
lessness-0.043
ously-0.043
Token Bunny
Token ID28683
Feature activation+6.05e+00

Pos logit contributions
 hunt+0.070
 hide+0.066
 draw+0.061
 pretend+0.061
 hall+0.061
Neg logit contributions
lessly-0.124
der-0.119
ously-0.113
rier-0.111
lessness-0.110
Token went
Token ID1816
Feature activation+4.33e+00

Pos logit contributions
 hunt+0.164
 library+0.155
 hall+0.153
 swings+0.149
 draw+0.149
Neg logit contributions
lessly-0.137
ously-0.135
lessness-0.119
der-0.116
 understanding-0.109
Token to
Token ID284
Feature activation+8.26e+00

Pos logit contributions
 hunt+0.111
c+0.104
 hide+0.102
 hall+0.101
 draw+0.101
Neg logit contributions
lessly-0.121
der-0.111
ously-0.105
lessness-0.099
arl-0.099
Token a
Token ID257
Feature activation+5.84e+00

Pos logit contributions
asis+0.229
ci+0.226
 grounds+0.219
c+0.216
 hunt+0.213
Neg logit contributions
 understanding-0.130
 much-0.116
lessly-0.113
 badly-0.110
He-0.109
Token festival
Token ID10876
Feature activation+3.24e+00

Pos logit contributions
 hunt+0.140
c+0.136
e+0.136
 hide+0.135
 drew+0.133
Neg logit contributions
lessly-0.135
der-0.131
oral-0.129
 understanding-0.123
arl-0.120
Token.
Token ID13
Feature activation+2.39e+00

Pos logit contributions
 hunt+0.086
c+0.083
 draw+0.082
 hide+0.081
 pretend+0.079
Neg logit contributions
lessly-0.075
der-0.069
ously-0.064
lessness-0.064
 understanding-0.060
Token There
Token ID1318
Feature activation+3.93e+00

Pos logit contributions
 hunt+0.069
 hide+0.066
 pretend+0.062
 draw+0.062
 hall+0.062
Neg logit contributions
lessly-0.064
der-0.061
ously-0.057
rier-0.056
lessness-0.055
Token were
Token ID547
Feature activation+2.17e+00

Pos logit contributions
 hunt+0.109
 draw+0.100
 hall+0.097
 library+0.096
 hide+0.095
Neg logit contributions
lessly-0.092
ously-0.091
lessness-0.085
der-0.085
oral-0.076
Token.
Token ID13
Feature activation+2.34e+00

Pos logit contributions
 hunt+0.018
 hide+0.018
c+0.016
 play+0.016
 pretend+0.016
Neg logit contributions
lessly-0.036
rier-0.033
der-0.033
arl-0.032
oral-0.031
Token Every
Token ID3887
Feature activation+1.66e+00

Pos logit contributions
 hunt+0.070
 hide+0.066
 draw+0.063
 hall+0.063
 pretend+0.063
Neg logit contributions
lessly-0.060
der-0.056
ously-0.052
lessness-0.051
rier-0.050
Token day
Token ID1110
Feature activation-4.48e-01

Pos logit contributions
 hunt+0.036
 hide+0.034
 draw+0.032
 hall+0.031
 pretend+0.031
Neg logit contributions
lessly-0.055
der-0.053
ously-0.050
lessness-0.049
rier-0.048
Token they
Token ID484
Feature activation+3.66e+00

Pos logit contributions
lessly+0.014
der+0.013
ously+0.012
rier+0.012
lessness+0.012
Neg logit contributions
 hunt-0.011
 hide-0.011
 pretend-0.010
 draw-0.010
 hall-0.010
Token went
Token ID1816
Feature activation+4.05e+00

Pos logit contributions
 hunt+0.094
 exhibit+0.089
 hall+0.088
c+0.087
 hide+0.086
Neg logit contributions
lessly-0.088
der-0.083
ously-0.080
lessness-0.078
 understanding-0.069
Token to
Token ID284
Feature activation+8.25e+00

Pos logit contributions
c+0.127
 hunt+0.125
e+0.122
 hide+0.121
 draw+0.120
Neg logit contributions
lessly-0.075
der-0.062
ously-0.060
arl-0.056
lessness-0.054
Token the
Token ID262
Feature activation+4.68e+00

Pos logit contributions
asis+0.249
c+0.235
nob+0.223
 hunt+0.217
orns+0.216
Neg logit contributions
 understanding-0.125
 ï¿½-0.110
 much-0.105
der-0.103
 dad-0.102
Token park
Token ID3952
Feature activation+3.28e+00

Pos logit contributions
e+0.065
 hunt+0.060
c+0.060
 hide+0.055
 drew+0.052
Neg logit contributions
der-0.157
oral-0.151
lessly-0.141
rier-0.137
 understanding-0.137
Token.
Token ID13
Feature activation+2.45e+00

Pos logit contributions
 hunt+0.099
 draw+0.094
 hide+0.093
 hall+0.091
 pretend+0.091
Neg logit contributions
der-0.066
lessly-0.065
ously-0.060
lessness-0.054
 understanding-0.050
Token Today
Token ID6288
Feature activation+2.03e+00

Pos logit contributions
 hunt+0.076
 hide+0.071
 draw+0.068
 land+0.067
 hall+0.067
Neg logit contributions
lessly-0.059
der-0.055
ously-0.050
lessness-0.050
rier-0.045
Token they
Token ID484
Feature activation+2.72e+00

Pos logit contributions
 hunt+0.064
 draw+0.057
 hide+0.057
c+0.055
 hall+0.054
Neg logit contributions
lessly-0.047
der-0.044
ously-0.042
lessness-0.040
rier-0.036
Token Bobby
Token ID17155
Feature activation+5.63e+00

Pos logit contributions
 hunt+0.072
 hide+0.062
 hall+0.062
 draw+0.062
c+0.059
Neg logit contributions
lessly-0.093
der-0.086
ously-0.082
lessness-0.080
rier-0.078
Token and
Token ID290
Feature activation+2.58e+00

Pos logit contributions
 hunt+0.161
 hide+0.146
 hall+0.144
 library+0.144
tail+0.138
Neg logit contributions
lessly-0.130
ously-0.127
der-0.112
lessness-0.112
 understanding-0.102
Token his
Token ID465
Feature activation+1.99e+00

Pos logit contributions
 hunt+0.063
 hide+0.060
 pretend+0.057
 draw+0.057
 hall+0.056
Neg logit contributions
lessly-0.081
der-0.079
rier-0.073
ously-0.072
lessness-0.071
Token mom
Token ID1995
Feature activation+4.68e+00

Pos logit contributions
 hunt+0.063
 hide+0.060
 draw+0.058
 pretend+0.058
 hall+0.057
Neg logit contributions
lessly-0.049
der-0.046
ously-0.043
rier-0.041
lessness-0.041
Token went
Token ID1816
Feature activation+4.44e+00

Pos logit contributions
 hunt+0.107
 hide+0.096
 hall+0.089
 pretend+0.089
 draw+0.086
Neg logit contributions
lessly-0.149
der-0.140
ously-0.137
lessness-0.135
arl-0.125
Token to
Token ID284
Feature activation+8.25e+00

Pos logit contributions
 hunt+0.126
 hide+0.118
c+0.118
 draw+0.115
 hall+0.115
Neg logit contributions
lessly-0.114
der-0.103
ously-0.099
lessness-0.093
arl-0.091
Token the
Token ID262
Feature activation+5.46e+00

Pos logit contributions
asis+0.225
c+0.222
 hunt+0.221
 hall+0.207
 grounds+0.203
Neg logit contributions
 understanding-0.152
lessly-0.135
 much-0.124
 Dad-0.123
 ï¿½-0.121
Token park
Token ID3952
Feature activation+4.15e+00

Pos logit contributions
e+0.074
c+0.073
 hunt+0.068
 hide+0.066
 drew+0.061
Neg logit contributions
der-0.181
oral-0.174
lessly-0.171
arl-0.171
rier-0.165
Token.
Token ID13
Feature activation+2.68e+00

Pos logit contributions
 hunt+0.121
 hide+0.112
 draw+0.112
 hall+0.111
 library+0.109
Neg logit contributions
lessly-0.095
der-0.090
ously-0.086
lessness-0.080
 understanding-0.075
Token He
Token ID679
Feature activation+3.85e+00

Pos logit contributions
 hunt+0.080
 hide+0.076
 pretend+0.073
 hall+0.073
 draw+0.073
Neg logit contributions
lessly-0.069
der-0.066
ously-0.061
rier-0.060
lessness-0.059
Token saw
Token ID2497
Feature activation+3.07e+00

Pos logit contributions
 hunt+0.079
 draw+0.068
tail+0.068
 hide+0.067
 hall+0.067
Neg logit contributions
lessly-0.111
ously-0.104
der-0.097
lessness-0.097
 understanding-0.095
Token Zack
Token ID30718
Feature activation+4.32e+00

Pos logit contributions
 hunt+0.075
 hide+0.067
 hall+0.065
 draw+0.065
 pretend+0.063
Neg logit contributions
lessly-0.112
der-0.105
ously-0.101
lessness-0.099
rier-0.097
Token and
Token ID290
Feature activation+3.05e+00

Pos logit contributions
 hunt+0.121
 hall+0.112
 library+0.109
c+0.109
 hide+0.108
Neg logit contributions
lessly-0.102
ously-0.098
der-0.091
lessness-0.089
 understanding-0.081
Token his
Token ID465
Feature activation+1.95e+00

Pos logit contributions
 hunt+0.076
 hide+0.072
 pretend+0.068
 draw+0.067
 hall+0.067
Neg logit contributions
lessly-0.095
der-0.092
rier-0.085
ously-0.084
lessness-0.083
Token mom
Token ID1995
Feature activation+4.78e+00

Pos logit contributions
 hunt+0.055
 hide+0.053
 draw+0.050
 pretend+0.050
 hall+0.050
Neg logit contributions
lessly-0.054
der-0.050
ously-0.047
rier-0.046
lessness-0.046
Token went
Token ID1816
Feature activation+4.22e+00

Pos logit contributions
 hunt+0.107
 hide+0.098
 hall+0.091
 pretend+0.091
 draw+0.090
Neg logit contributions
lessly-0.155
der-0.146
ously-0.141
lessness-0.138
rier-0.131
Token to
Token ID284
Feature activation+8.25e+00

Pos logit contributions
 hunt+0.119
 hide+0.112
c+0.111
 draw+0.109
 hall+0.109
Neg logit contributions
lessly-0.109
der-0.098
ously-0.094
lessness-0.089
arl-0.087
Token the
Token ID262
Feature activation+5.33e+00

Pos logit contributions
asis+0.226
c+0.223
 hunt+0.219
 hall+0.209
ci+0.203
Neg logit contributions
 understanding-0.151
lessly-0.140
 much-0.125
 ï¿½-0.122
ously-0.122
Token island
Token ID7022
Feature activation+4.19e+00

Pos logit contributions
e+0.073
c+0.072
 hunt+0.065
 hide+0.064
 draw+0.060
Neg logit contributions
der-0.175
oral-0.168
lessly-0.164
arl-0.162
rier-0.156
Token.
Token ID13
Feature activation+2.37e+00

Pos logit contributions
 hunt+0.116
 draw+0.112
 hide+0.112
 pretend+0.109
 hall+0.108
Neg logit contributions
lessly-0.104
der-0.097
ously-0.093
lessness-0.087
rier-0.085
Token It
Token ID632
Feature activation+3.49e+00

Pos logit contributions
 hunt+0.071
 hide+0.067
 hall+0.064
 draw+0.064
 pretend+0.064
Neg logit contributions
lessly-0.061
der-0.057
ously-0.053
rier-0.051
lessness-0.051
Token was
Token ID373
Feature activation+1.09e+00

Pos logit contributions
 hunt+0.096
 draw+0.095
tail+0.091
 library+0.090
 hide+0.089
Neg logit contributions
lessly-0.077
der-0.067
lessness-0.066
ously-0.063
oral-0.057
Token friends
Token ID2460
Feature activation-7.47e-01

Pos logit contributions
lessly+0.035
der+0.033
ously+0.031
lessness+0.030
rier+0.030
Neg logit contributions
 hunt-0.034
 hide-0.032
 hall-0.031
 draw-0.031
 pretend-0.031
Token.
Token ID13
Feature activation-9.74e-01

Pos logit contributions
lessly+0.023
der+0.022
ously+0.021
rier+0.021
lessness+0.020
Neg logit contributions
 hunt-0.018
 hide-0.017
 draw-0.016
 pretend-0.016
 hall-0.016
Token Everyone
Token ID11075
Feature activation-1.82e+00

Pos logit contributions
lessly+0.029
der+0.028
rier+0.027
ously+0.026
lessness+0.025
Neg logit contributions
 hunt-0.024
 hide-0.023
 hall-0.022
 draw-0.022
c-0.022
Token praised
Token ID15342
Feature activation-5.77e+00

Pos logit contributions
lessly+0.061
der+0.058
ously+0.055
lessness+0.054
rier+0.053
Neg logit contributions
 hunt-0.040
 hide-0.037
 hall-0.035
 draw-0.035
 pretend-0.035
Token her
Token ID607
Feature activation-5.52e+00

Pos logit contributions
lessly+0.200
der+0.198
rier+0.185
ously+0.182
lessness+0.179
Neg logit contributions
 hunt-0.110
 hide-0.109
 draw-0.101
 pretend-0.100
 hall-0.097
Token cle
Token ID1190
Feature activation-8.06e+00

Pos logit contributions
lessly+0.153
der+0.144
ously+0.136
rier+0.134
lessness+0.131
Neg logit contributions
 hunt-0.149
 hide-0.144
 draw-0.139
 pretend-0.139
 hall-0.135
Tokenvern
Token ID933
Feature activation-7.78e+00

Pos logit contributions
der+0.190
lessly+0.163
ously+0.149
lessness+0.141
rier+0.138
Neg logit contributions
 hunt-0.253
 hide-0.245
 draw-0.229
 collect-0.227
e-0.225
Tokeness
Token ID408
Feature activation-5.63e+00

Pos logit contributions
lessly+0.228
der+0.227
 importantly+0.223
oral+0.217
ously+0.216
Neg logit contributions
 hunt-0.145
 hide-0.135
 draw-0.129
ci-0.129
c-0.124
Token and
Token ID290
Feature activation-3.44e+00

Pos logit contributions
lessly+0.156
der+0.147
ously+0.138
rier+0.134
lessness+0.134
Neg logit contributions
 hunt-0.154
 hide-0.146
 draw-0.140
 hall-0.139
 pretend-0.138
Token eventually
Token ID4191
Feature activation-1.58e+00

Pos logit contributions
lessly+0.104
der+0.099
rier+0.093
ously+0.092
lessness+0.090
Neg logit contributions
 hunt-0.086
 hide-0.082
 draw-0.078
 pretend-0.077
 hall-0.077
Token Jan
Token ID2365
Feature activation-3.07e+00

Pos logit contributions
lessly+0.049
der+0.046
ously+0.044
lessness+0.043
rier+0.042
Neg logit contributions
 hunt-0.039
 hide-0.036
 hall-0.034
 draw-0.034
 pretend-0.034
Token kept
Token ID4030
Feature activation-6.12e-01

Pos logit contributions
 hunt+0.005
 hide+0.004
 hall+0.004
 draw+0.004
 land+0.004
Neg logit contributions
lessly-0.005
der-0.005
ously-0.005
lessness-0.004
rier-0.004
Token getting
Token ID1972
Feature activation-3.10e+00

Pos logit contributions
lessly+0.019
der+0.018
ously+0.017
rier+0.016
lessness+0.016
Neg logit contributions
 hunt-0.015
 hide-0.014
 hall-0.013
 draw-0.013
 pretend-0.013
Token ang
Token ID3550
Feature activation-7.79e+00

Pos logit contributions
lessly+0.084
der+0.081
ously+0.076
lessness+0.072
rier+0.070
Neg logit contributions
 hunt-0.087
 hide-0.086
 draw-0.082
 pretend-0.081
 land-0.080
Tokenrier
Token ID5277
Feature activation-5.94e+00

Pos logit contributions
lessly+0.092
ously+0.090
oral+0.088
lessness+0.087
 importantly+0.077
Neg logit contributions
 hide-0.292
 hunt-0.282
 pretend-0.274
 land-0.266
 draw-0.265
Token and
Token ID290
Feature activation-1.46e+00

Pos logit contributions
lessly+0.215
oral+0.203
 importantly+0.198
arl+0.193
rier+0.191
Neg logit contributions
 hunt-0.104
 hide-0.096
 pretend-0.087
 land-0.084
 hall-0.079
Token ang
Token ID3550
Feature activation-8.01e+00

Pos logit contributions
lessly+0.038
der+0.036
ously+0.034
lessness+0.033
rier+0.032
Neg logit contributions
 hunt-0.043
 hide-0.041
 draw-0.039
 pretend-0.039
 hall-0.039
Tokenrier
Token ID5277
Feature activation-5.80e+00

Pos logit contributions
ously+0.098
lessly+0.095
 importantly+0.091
lessness+0.090
oral+0.084
Neg logit contributions
 hide-0.292
 hunt-0.283
 pretend-0.280
 land-0.274
 collect-0.260
Token.
Token ID13
Feature activation+2.56e-01

Pos logit contributions
lessly+0.211
 importantly+0.195
rier+0.192
oral+0.190
der+0.186
Neg logit contributions
 hunt-0.099
 hide-0.092
 pretend-0.086
 land-0.080
 play-0.077
Token Finally
Token ID9461
Feature activation-1.58e+00

Pos logit contributions
 hunt+0.006
 hide+0.006
 hall+0.006
 draw+0.006
 pretend+0.006
Neg logit contributions
lessly-0.008
der-0.007
rier-0.007
ously-0.007
lessness-0.007
Token,
Token ID11
Feature activation-7.60e-01

Pos logit contributions
lessly+0.063
der+0.060
rier+0.058
ously+0.055
lessness+0.055
Neg logit contributions
 hide-0.026
 hunt-0.026
 pretend-0.025
 play-0.024
 draw-0.024
Token Tim
Token ID5045
Feature activation+4.92e-01

Pos logit contributions
lessly+0.025
der+0.023
ously+0.022
rier+0.022
lessness+0.022
Neg logit contributions
 hunt-0.018
 hide-0.017
 draw-0.016
 pretend-0.016
 hall-0.016
Token noise
Token ID7838
Feature activation-4.99e+00

Pos logit contributions
lessly+0.070
der+0.067
ously+0.063
rier+0.062
lessness+0.061
Neg logit contributions
 hunt-0.050
 hide-0.046
 hall-0.044
 draw-0.044
 pretend-0.043
Token.
Token ID13
Feature activation-9.22e-01

Pos logit contributions
lessly+0.159
ously+0.152
der+0.150
oral+0.148
arl+0.142
Neg logit contributions
 hide-0.105
 hunt-0.104
 pretend-0.101
 draw-0.099
 play-0.096
Token Lily
Token ID20037
Feature activation-5.43e-01

Pos logit contributions
lessly+0.028
ously+0.027
der+0.026
rier+0.026
lessness+0.026
Neg logit contributions
 hide-0.021
 hunt-0.021
 hall-0.020
 pretend-0.020
 draw-0.020
Token got
Token ID1392
Feature activation-3.04e+00

Pos logit contributions
lessly+0.017
der+0.016
ously+0.015
lessness+0.015
rier+0.015
Neg logit contributions
 hunt-0.013
 hide-0.012
 hall-0.012
 draw-0.012
 pretend-0.011
Token even
Token ID772
Feature activation-2.11e+00

Pos logit contributions
lessly+0.088
der+0.084
ously+0.079
lessness+0.076
rier+0.076
Neg logit contributions
 hunt-0.080
 hide-0.076
 hall-0.072
 draw-0.072
 pretend-0.071
Token ang
Token ID3550
Feature activation-7.96e+00

Pos logit contributions
lessly+0.045
der+0.041
ously+0.038
lessness+0.036
rier+0.036
Neg logit contributions
 hunt-0.072
 hide-0.068
 hall-0.066
 draw-0.066
 land-0.065
Tokenrier
Token ID5277
Feature activation-6.56e+00

Pos logit contributions
oral+0.089
lessly+0.082
lessness+0.081
 importantly+0.081
ously+0.078
Neg logit contributions
 hide-0.300
 pretend-0.284
 hunt-0.283
 land-0.280
 draw-0.272
Token and
Token ID290
Feature activation-2.34e+00

Pos logit contributions
lessly+0.230
oral+0.218
rier+0.211
 importantly+0.209
arl+0.206
Neg logit contributions
 hunt-0.118
 hide-0.112
 pretend-0.101
 hall-0.099
 land-0.095
Token threw
Token ID9617
Feature activation-1.66e-01

Pos logit contributions
lessly+0.063
der+0.060
ously+0.056
rier+0.054
lessness+0.054
Neg logit contributions
 hunt-0.067
 hide-0.063
 hall-0.061
 draw-0.061
 pretend-0.060
Token her
Token ID607
Feature activation-1.62e+00

Pos logit contributions
lessly+0.006
der+0.006
ously+0.005
lessness+0.005
rier+0.005
Neg logit contributions
 hunt-0.003
 hide-0.003
 draw-0.003
 pretend-0.003
 hall-0.003
Token doll
Token ID3654
Feature activation-5.55e-01

Pos logit contributions
lessly+0.057
der+0.054
ously+0.051
lessness+0.051
rier+0.050
Neg logit contributions
 hunt-0.033
 hide-0.031
 hall-0.029
 draw-0.028
 land-0.028
Token to
Token ID284
Feature activation+1.82e+00

Pos logit contributions
lessly+0.020
der+0.019
ously+0.018
rier+0.018
lessness+0.018
Neg logit contributions
 hunt-0.017
 hide-0.016
 draw-0.015
 pretend-0.015
 hall-0.015
Token make
Token ID787
Feature activation-6.12e-01

Pos logit contributions
 hunt+0.040
 hall+0.036
c+0.036
 hide+0.036
 land+0.034
Neg logit contributions
lessly-0.059
der-0.056
ously-0.053
rier-0.051
lessness-0.051
Token her
Token ID607
Feature activation-2.15e+00

Pos logit contributions
lessly+0.020
der+0.019
ously+0.018
rier+0.018
lessness+0.017
Neg logit contributions
 hunt-0.014
 hide-0.012
 hall-0.012
 draw-0.012
 pretend-0.012
Token mom
Token ID1995
Feature activation-4.81e-01

Pos logit contributions
lessly+0.061
der+0.058
ously+0.054
lessness+0.053
rier+0.053
Neg logit contributions
 hunt-0.058
 hide-0.054
 hall-0.052
 draw-0.052
 pretend-0.051
Token any
Token ID597
Feature activation-3.21e+00

Pos logit contributions
lessly+0.012
der+0.011
ously+0.011
lessness+0.010
rier+0.010
Neg logit contributions
 hunt-0.014
 hide-0.014
 hall-0.013
 draw-0.013
 land-0.013
Token ang
Token ID3550
Feature activation-7.82e+00

Pos logit contributions
lessly+0.091
der+0.085
ously+0.081
lessness+0.079
rier+0.077
Neg logit contributions
 hunt-0.088
 hide-0.086
 draw-0.082
 pretend-0.081
 hall-0.079
Tokenrier
Token ID5277
Feature activation-5.10e+00

Pos logit contributions
lessly+0.090
 importantly+0.087
lessness+0.082
ously+0.080
oral+0.076
Neg logit contributions
 hide-0.287
 land-0.276
 hunt-0.271
 pretend-0.269
 collect-0.258
Token,
Token ID11
Feature activation-3.29e+00

Pos logit contributions
lessly+0.174
der+0.160
rier+0.157
 importantly+0.157
lessness+0.154
Neg logit contributions
 hunt-0.101
 hide-0.095
 pretend-0.088
 land-0.084
 hall-0.083
Token so
Token ID523
Feature activation-4.88e+00

Pos logit contributions
lessly+0.100
der+0.097
rier+0.095
oral+0.093
ously+0.092
Neg logit contributions
 hunt-0.070
 hide-0.070
 hall-0.066
 pretend-0.065
 draw-0.065
Token she
Token ID673
Feature activation-8.57e-01

Pos logit contributions
lessly+0.166
der+0.161
ously+0.155
rier+0.154
lessness+0.149
Neg logit contributions
 hide-0.088
 hunt-0.085
 land-0.084
 hall-0.082
 park-0.079
Token stopped
Token ID5025
Feature activation-1.71e+00

Pos logit contributions
lessly+0.027
der+0.026
ously+0.025
lessness+0.024
rier+0.024
Neg logit contributions
 hunt-0.020
 hide-0.019
 draw-0.018
 pretend-0.018
 hall-0.018
Token so
Token ID523
Feature activation-4.35e+00

Pos logit contributions
lessly+0.098
der+0.091
ously+0.088
rier+0.087
lessness+0.086
Neg logit contributions
 hunt-0.079
 hide-0.074
 pretend-0.070
 land-0.070
 hall-0.069
Token the
Token ID262
Feature activation-3.51e+00

Pos logit contributions
lessly+0.148
der+0.138
ously+0.135
lessness+0.131
rier+0.128
Neg logit contributions
 hunt-0.095
 hide-0.089
 land-0.087
 pretend-0.083
 draw-0.081
Token fox
Token ID21831
Feature activation-1.53e+00

Pos logit contributions
lessly+0.111
der+0.106
rier+0.099
ously+0.099
lessness+0.096
Neg logit contributions
 hunt-0.083
 hide-0.078
 hall-0.075
 draw-0.075
c-0.072
Token got
Token ID1392
Feature activation-2.98e+00

Pos logit contributions
lessly+0.047
der+0.045
ously+0.043
lessness+0.040
rier+0.040
Neg logit contributions
 hunt-0.037
 hide-0.035
 hall-0.034
 draw-0.033
 pretend-0.033
Token even
Token ID772
Feature activation-1.86e+00

Pos logit contributions
lessly+0.095
der+0.092
ously+0.086
lessness+0.084
rier+0.082
Neg logit contributions
 hunt-0.071
 hide-0.069
 pretend-0.063
 draw-0.063
 land-0.062
Token ang
Token ID3550
Feature activation-7.79e+00

Pos logit contributions
lessly+0.047
der+0.044
ously+0.041
lessness+0.040
rier+0.040
Neg logit contributions
 hunt-0.056
 hide-0.053
 hall-0.051
 draw-0.051
 pretend-0.050
Tokenrier
Token ID5277
Feature activation-6.16e+00

Pos logit contributions
lessly+0.094
lessness+0.083
ously+0.081
 importantly+0.073
oral+0.070
Neg logit contributions
 hunt-0.312
 hide-0.307
 land-0.304
 pretend-0.290
 collect-0.284
Token.
Token ID13
Feature activation-3.55e-01

Pos logit contributions
lessly+0.218
oral+0.202
 importantly+0.198
arl+0.198
rier+0.197
Neg logit contributions
 hunt-0.116
 hide-0.104
 pretend-0.095
 land-0.094
 hall-0.085
Token
Token ID198
Feature activation-2.73e+00

Pos logit contributions
lessly+0.010
ously+0.009
der+0.009
lessness+0.009
rier+0.009
Neg logit contributions
 hunt-0.010
 hide-0.009
 hall-0.009
 draw-0.008
 pretend-0.008
Token
Token ID198
Feature activation-2.77e+00

Pos logit contributions
lessly+0.087
der+0.085
rier+0.083
ously+0.079
lessness+0.078
Neg logit contributions
 hunt-0.065
 hide-0.061
 hall-0.056
 draw-0.056
 park-0.055
Token"
Token ID1
Feature activation+3.07e-01

Pos logit contributions
lessly+0.065
der+0.062
ously+0.058
lessness+0.057
rier+0.056
Neg logit contributions
 hunt-0.090
 hide-0.085
 pretend-0.081
 draw-0.081
 hall-0.081
Token,
Token ID11
Feature activation-2.64e+00

Pos logit contributions
lessly+0.084
der+0.082
rier+0.078
lessness+0.076
oral+0.076
Neg logit contributions
 hunt-0.041
 hide-0.041
 pretend-0.037
 play-0.036
 draw-0.035
Token but
Token ID475
Feature activation-1.75e+00

Pos logit contributions
lessly+0.087
oral+0.084
der+0.084
rier+0.083
arl+0.083
Neg logit contributions
 hunt-0.054
 hide-0.052
 pretend-0.050
 hall-0.048
 park-0.047
Token Johnny
Token ID15470
Feature activation+1.82e-01

Pos logit contributions
lessly+0.055
der+0.053
rier+0.051
ously+0.050
lessness+0.048
Neg logit contributions
 hunt-0.041
 hide-0.040
 draw-0.038
 pretend-0.038
 hall-0.037
Token kept
Token ID4030
Feature activation-6.12e-01

Pos logit contributions
 hunt+0.005
 hide+0.004
 hall+0.004
 draw+0.004
 land+0.004
Neg logit contributions
lessly-0.005
der-0.005
ously-0.005
lessness-0.004
rier-0.004
Token getting
Token ID1972
Feature activation-3.10e+00

Pos logit contributions
lessly+0.019
der+0.018
ously+0.017
rier+0.016
lessness+0.016
Neg logit contributions
 hunt-0.015
 hide-0.014
 hall-0.013
 draw-0.013
 pretend-0.013
Token ang
Token ID3550
Feature activation-7.79e+00

Pos logit contributions
lessly+0.084
der+0.081
ously+0.076
lessness+0.072
rier+0.070
Neg logit contributions
 hunt-0.087
 hide-0.086
 draw-0.082
 pretend-0.081
 land-0.080
Tokenrier
Token ID5277
Feature activation-5.94e+00

Pos logit contributions
lessly+0.092
ously+0.090
oral+0.088
lessness+0.087
 importantly+0.077
Neg logit contributions
 hide-0.292
 hunt-0.282
 pretend-0.274
 land-0.266
 draw-0.265
Token and
Token ID290
Feature activation-1.46e+00

Pos logit contributions
lessly+0.215
oral+0.203
 importantly+0.198
arl+0.193
rier+0.191
Neg logit contributions
 hunt-0.104
 hide-0.096
 pretend-0.087
 land-0.084
 hall-0.079
Token ang
Token ID3550
Feature activation-8.01e+00

Pos logit contributions
lessly+0.038
der+0.036
ously+0.034
lessness+0.033
rier+0.032
Neg logit contributions
 hunt-0.043
 hide-0.041
 draw-0.039
 pretend-0.039
 hall-0.039
Tokenrier
Token ID5277
Feature activation-5.80e+00

Pos logit contributions
ously+0.098
lessly+0.095
 importantly+0.091
lessness+0.090
oral+0.084
Neg logit contributions
 hide-0.292
 hunt-0.283
 pretend-0.280
 land-0.274
 collect-0.260
Token.
Token ID13
Feature activation+2.56e-01

Pos logit contributions
lessly+0.211
 importantly+0.195
rier+0.192
oral+0.190
der+0.186
Neg logit contributions
 hunt-0.099
 hide-0.092
 pretend-0.086
 land-0.080
 play-0.077
Token.
Token ID13
Feature activation-9.74e-01

Pos logit contributions
lessly+0.023
der+0.022
ously+0.021
rier+0.021
lessness+0.020
Neg logit contributions
 hunt-0.018
 hide-0.017
 draw-0.016
 pretend-0.016
 hall-0.016
Token Everyone
Token ID11075
Feature activation-1.82e+00

Pos logit contributions
lessly+0.029
der+0.028
rier+0.027
ously+0.026
lessness+0.025
Neg logit contributions
 hunt-0.024
 hide-0.023
 hall-0.022
 draw-0.022
c-0.022
Token praised
Token ID15342
Feature activation-5.77e+00

Pos logit contributions
lessly+0.061
der+0.058
ously+0.055
lessness+0.054
rier+0.053
Neg logit contributions
 hunt-0.040
 hide-0.037
 hall-0.035
 draw-0.035
 pretend-0.035
Token her
Token ID607
Feature activation-5.52e+00

Pos logit contributions
lessly+0.200
der+0.198
rier+0.185
ously+0.182
lessness+0.179
Neg logit contributions
 hunt-0.110
 hide-0.109
 draw-0.101
 pretend-0.100
 hall-0.097
Token cle
Token ID1190
Feature activation-8.06e+00

Pos logit contributions
lessly+0.153
der+0.144
ously+0.136
rier+0.134
lessness+0.131
Neg logit contributions
 hunt-0.149
 hide-0.144
 draw-0.139
 pretend-0.139
 hall-0.135
Tokenvern
Token ID933
Feature activation-7.78e+00

Pos logit contributions
der+0.190
lessly+0.163
ously+0.149
lessness+0.141
rier+0.138
Neg logit contributions
 hunt-0.253
 hide-0.245
 draw-0.229
 collect-0.227
e-0.225
Tokeness
Token ID408
Feature activation-5.63e+00

Pos logit contributions
lessly+0.228
der+0.227
 importantly+0.223
oral+0.217
ously+0.216
Neg logit contributions
 hunt-0.145
 hide-0.135
 draw-0.129
ci-0.129
c-0.124
Token and
Token ID290
Feature activation-3.44e+00

Pos logit contributions
lessly+0.156
der+0.147
ously+0.138
rier+0.134
lessness+0.134
Neg logit contributions
 hunt-0.154
 hide-0.146
 draw-0.140
 hall-0.139
 pretend-0.138
Token eventually
Token ID4191
Feature activation-1.58e+00

Pos logit contributions
lessly+0.104
der+0.099
rier+0.093
ously+0.092
lessness+0.090
Neg logit contributions
 hunt-0.086
 hide-0.082
 draw-0.078
 pretend-0.077
 hall-0.077
Token Jan
Token ID2365
Feature activation-3.07e+00

Pos logit contributions
lessly+0.049
der+0.046
ously+0.044
lessness+0.043
rier+0.042
Neg logit contributions
 hunt-0.039
 hide-0.036
 hall-0.034
 draw-0.034
 pretend-0.034
Tokenie
Token ID494
Feature activation-1.79e+00

Pos logit contributions
lessly+0.106
der+0.101
ously+0.096
lessness+0.094
rier+0.093
Neg logit contributions
 hunt-0.065
 hide-0.061
 hall-0.057
 draw-0.057
 pretend-0.056
Token and
Token ID290
Feature activation-4.51e-02

Pos logit contributions
lessly+0.070
der+0.067
ously+0.062
rier+0.061
lessness+0.061
Neg logit contributions
 hunt-0.057
 hide-0.054
 draw-0.051
 hall-0.051
 pretend-0.051
Token listen
Token ID6004
Feature activation-2.35e+00

Pos logit contributions
lessly+0.001
der+0.001
 understanding+0.001
lessness+0.001
ously+0.001
Neg logit contributions
 hunt-0.001
 hall-0.001
c-0.001
 exhibit-0.001
 hide-0.001
Token,"
Token ID553
Feature activation-3.06e+00

Pos logit contributions
lessly+0.072
der+0.069
ously+0.066
rier+0.065
lessness+0.063
Neg logit contributions
 hunt-0.056
 hide-0.054
 draw-0.051
 pretend-0.051
 land-0.049
Token her
Token ID607
Feature activation-5.24e+00

Pos logit contributions
lessly+0.120
der+0.116
rier+0.112
ously+0.108
lessness+0.105
Neg logit contributions
 draw-0.051
 hide-0.049
 hunt-0.048
 play-0.048
 pretend-0.046
Token mom
Token ID1995
Feature activation-3.28e+00

Pos logit contributions
lessly+0.149
der+0.134
ously+0.131
rier+0.128
lessness+0.124
Neg logit contributions
 hide-0.143
 draw-0.142
 hall-0.140
 hunt-0.137
 pretend-0.135
Token said
Token ID531
Feature activation-7.50e+00

Pos logit contributions
lessly+0.117
der+0.109
rier+0.107
lessness+0.103
ously+0.103
Neg logit contributions
 hunt-0.064
 hide-0.062
 draw-0.059
c-0.058
 pretend-0.056
Token.
Token ID13
Feature activation-2.12e+00

Pos logit contributions
oral+0.283
lessly+0.281
der+0.277
arl+0.261
 importantly+0.260
Neg logit contributions
 hide-0.097
 hunt-0.087
 draw-0.084
 play-0.082
 outside-0.081
Token
Token ID198
Feature activation-5.43e+00

Pos logit contributions
lessly+0.072
rier+0.070
der+0.069
ously+0.064
lessness+0.063
Neg logit contributions
 hide-0.041
 hunt-0.041
 draw-0.040
 hall-0.039
 park-0.039
Token
Token ID198
Feature activation-4.92e+00

Pos logit contributions
lessly+0.176
der+0.174
rier+0.174
ously+0.159
lessness+0.154
Neg logit contributions
 hunt-0.119
 hide-0.116
 draw-0.110
 park-0.109
 play-0.109
Token"
Token ID1
Feature activation-2.04e+00

Pos logit contributions
rier+0.147
der+0.146
lessly+0.146
ously+0.127
lessness+0.126
Neg logit contributions
 hunt-0.119
 hide-0.116
c-0.112
 draw-0.112
 park-0.111
TokenI
Token ID40
Feature activation+1.80e-01

Pos logit contributions
lessly+0.066
der+0.061
rier+0.060
ously+0.060
lessness+0.057
Neg logit contributions
 park-0.042
 draw-0.041
 hunt-0.041
 hide-0.041
 pretend-0.039
Token,
Token ID11
Feature activation-1.23e+00

Pos logit contributions
 hunt+0.026
 draw+0.023
 hide+0.023
 hall+0.023
c+0.023
Neg logit contributions
lessly-0.031
der-0.030
ously-0.028
lessness-0.027
rier-0.027
Token getting
Token ID1972
Feature activation-2.41e+00

Pos logit contributions
lessly+0.037
der+0.035
ously+0.033
rier+0.032
lessness+0.032
Neg logit contributions
 hunt-0.031
 hide-0.028
 draw-0.028
 hall-0.027
c-0.027
Token ang
Token ID3550
Feature activation-6.75e+00

Pos logit contributions
lessly+0.059
der+0.056
ously+0.052
lessness+0.050
rier+0.050
Neg logit contributions
 hunt-0.074
 hide-0.071
 draw-0.068
 hall-0.068
 pretend-0.068
Tokenrier
Token ID5277
Feature activation-5.19e+00

Pos logit contributions
lessly+0.070
lessness+0.061
ously+0.056
 importantly+0.054
arl+0.048
Neg logit contributions
 hide-0.280
 land-0.279
 hunt-0.277
 pretend-0.268
 garden-0.256
Token and
Token ID290
Feature activation-1.82e+00

Pos logit contributions
lessly+0.174
oral+0.159
rier+0.156
arl+0.155
der+0.155
Neg logit contributions
 hunt-0.105
 hide-0.099
 land-0.093
 pretend-0.092
 hall-0.088
Token ang
Token ID3550
Feature activation-7.47e+00

Pos logit contributions
lessly+0.041
der+0.038
ously+0.035
lessness+0.034
rier+0.034
Neg logit contributions
 hunt-0.060
 hide-0.057
 hall-0.055
 draw-0.055
 pretend-0.055
Tokenrier
Token ID5277
Feature activation-5.90e+00

Pos logit contributions
lessly+0.080
lessness+0.074
 importantly+0.072
ously+0.070
arl+0.060
Neg logit contributions
 land-0.299
 hide-0.299
 hunt-0.293
 pretend-0.285
 collect-0.268
Token.
Token ID13
Feature activation-4.67e-01

Pos logit contributions
lessly+0.200
oral+0.182
rier+0.182
 importantly+0.181
arl+0.180
Neg logit contributions
 hunt-0.111
 hide-0.105
 land-0.098
 pretend-0.098
 hall-0.093
Token The
Token ID383
Feature activation-1.71e+00

Pos logit contributions
lessly+0.014
der+0.013
ously+0.012
lessness+0.012
rier+0.012
Neg logit contributions
 hunt-0.012
 hide-0.012
 hall-0.011
 draw-0.011
 pretend-0.011
Token two
Token ID734
Feature activation-1.99e+00

Pos logit contributions
lessly+0.049
der+0.047
rier+0.044
ously+0.042
lessness+0.042
Neg logit contributions
 hunt-0.045
 hide-0.042
 draw-0.040
 hall-0.040
c-0.040
Token friends
Token ID2460
Feature activation+1.61e-01

Pos logit contributions
lessly+0.052
der+0.049
ously+0.045
arl+0.044
lessness+0.044
Neg logit contributions
 hunt-0.058
 hide-0.057
 land-0.054
 pretend-0.054
 draw-0.053
Token for
Token ID329
Feature activation-2.20e+00

Pos logit contributions
lessly+0.184
der+0.180
ously+0.170
lessness+0.169
rier+0.164
Neg logit contributions
 hunt-0.111
 hide-0.110
 pretend-0.099
 draw-0.096
 land-0.096
Token his
Token ID465
Feature activation-5.25e+00

Pos logit contributions
lessly+0.069
der+0.066
ously+0.062
rier+0.062
lessness+0.061
Neg logit contributions
 hunt-0.053
 hide-0.051
 pretend-0.047
 draw-0.047
 hall-0.046
Token obedience
Token ID28777
Feature activation-4.26e+00

Pos logit contributions
lessly+0.147
der+0.140
rier+0.133
ously+0.126
lessness+0.125
Neg logit contributions
 hunt-0.150
 hide-0.140
 pretend-0.136
 play-0.134
 land-0.134
Token and
Token ID290
Feature activation-3.32e+00

Pos logit contributions
lessly+0.126
der+0.119
ously+0.112
rier+0.110
lessness+0.110
Neg logit contributions
 hunt-0.110
 hide-0.104
 draw-0.098
 hall-0.098
 pretend-0.098
Token cle
Token ID1190
Feature activation-7.37e+00

Pos logit contributions
lessly+0.094
der+0.089
ously+0.083
rier+0.082
lessness+0.081
Neg logit contributions
 hunt-0.090
 hide-0.085
 draw-0.081
 pretend-0.081
 hall-0.080
Tokenvern
Token ID933
Feature activation-7.45e+00

Pos logit contributions
der+0.179
lessly+0.137
ously+0.131
lessness+0.125
rier+0.124
Neg logit contributions
 hunt-0.224
 hide-0.224
e-0.206
 explore-0.200
 collect-0.198
Tokeness
Token ID408
Feature activation-4.79e+00

Pos logit contributions
der+0.225
lessly+0.222
 importantly+0.221
oral+0.221
lessness+0.213
Neg logit contributions
 hunt-0.134
 hide-0.119
ci-0.119
 play-0.110
 explore-0.106
Token.
Token ID13
Feature activation-1.81e+00

Pos logit contributions
lessly+0.137
der+0.130
ously+0.122
lessness+0.118
rier+0.118
Neg logit contributions
 hunt-0.128
 hide-0.121
 draw-0.115
 hall-0.115
 pretend-0.114
Token<|endoftext|>
Token ID50256
Feature activation+5.52e+00

Pos logit contributions
lessly+0.058
rier+0.058
der+0.056
ously+0.053
arl+0.051
Neg logit contributions
 hunt-0.037
c-0.036
 forest-0.035
 hall-0.035
 park-0.035
TokenOnce
Token ID7454
Feature activation+4.28e+00

Pos logit contributions
 hunt+0.189
 draw+0.176
 hide+0.176
 pretend+0.173
 hall+0.168
Neg logit contributions
lessly-0.114
der-0.098
lessness-0.094
ously-0.093
oral-0.080
Token upon
Token ID2402
Feature activation+3.75e+00

Pos logit contributions
 hunt+0.078
 hide+0.074
 hall+0.073
 land+0.069
 draw+0.069
Neg logit contributions
lessly-0.153
der-0.152
rier-0.144
ously-0.138
lessness-0.132
Tokenio
Token ID952
Feature activation-2.01e+00

Pos logit contributions
lessly+0.116
der+0.110
ously+0.103
rier+0.100
lessness+0.098
Neg logit contributions
 hunt-0.117
 hide-0.111
 hall-0.106
 draw-0.106
 pretend-0.105
Token was
Token ID373
Feature activation-3.52e+00

Pos logit contributions
lessly+0.065
der+0.061
rier+0.059
ously+0.058
lessness+0.057
Neg logit contributions
 hunt-0.046
 hide-0.043
 draw-0.041
 pretend-0.041
c-0.040
Token thankful
Token ID25535
Feature activation-5.68e+00

Pos logit contributions
lessly+0.091
der+0.090
ously+0.082
rier+0.079
lessness+0.079
Neg logit contributions
 hunt-0.099
 hide-0.099
 pretend-0.094
 draw-0.092
 land-0.092
Token and
Token ID290
Feature activation-3.50e+00

Pos logit contributions
lessly+0.184
rier+0.178
der+0.174
arl+0.167
oral+0.162
Neg logit contributions
 hunt-0.120
 hide-0.114
 pretend-0.104
 play-0.104
 draw-0.103
Token thanked
Token ID26280
Feature activation-4.07e+00

Pos logit contributions
lessly+0.096
der+0.093
rier+0.086
ously+0.086
lessness+0.083
Neg logit contributions
 hunt-0.095
 hide-0.092
 pretend-0.087
 draw-0.087
 land-0.086
Token T
Token ID309
Feature activation-7.39e+00

Pos logit contributions
lessly+0.138
der+0.131
ously+0.124
lessness+0.122
rier+0.121
Neg logit contributions
 hunt-0.088
 hide-0.083
 draw-0.079
 pretend-0.078
 hall-0.078
Tokenummy
Token ID13513
Feature activation-2.94e+00

Pos logit contributions
lessly+0.214
der+0.191
lessness+0.190
ously+0.184
 importantly+0.177
Neg logit contributions
 hunt-0.189
 hide-0.175
 draw-0.170
 pretend-0.167
 land-0.166
Token before
Token ID878
Feature activation-2.54e+00

Pos logit contributions
lessly+0.077
der+0.072
ously+0.068
lessness+0.066
rier+0.066
Neg logit contributions
 hunt-0.087
 hide-0.082
 draw-0.079
 pretend-0.079
 hall-0.077
Token they
Token ID484
Feature activation-1.01e+00

Pos logit contributions
lessly+0.080
der+0.077
rier+0.073
ously+0.071
lessness+0.070
Neg logit contributions
 hunt-0.060
 hide-0.056
 hall-0.054
 draw-0.054
 land-0.054
Token each
Token ID1123
Feature activation-1.11e+00

Pos logit contributions
lessly+0.033
der+0.031
ously+0.030
lessness+0.029
rier+0.028
Neg logit contributions
 hunt-0.022
 hide-0.021
 hall-0.020
 draw-0.020
 pretend-0.020
Token continued
Token ID3767
Feature activation-3.54e-01

Pos logit contributions
lessly+0.037
der+0.034
ously+0.033
lessness+0.032
rier+0.031
Neg logit contributions
 hunt-0.024
 hide-0.022
 hall-0.022
 draw-0.022
 pretend-0.021
Token praised
Token ID15342
Feature activation-5.47e+00

Pos logit contributions
lessly+0.056
der+0.054
ously+0.050
rier+0.048
lessness+0.048
Neg logit contributions
 hunt-0.064
 hide-0.061
 draw-0.058
 hall-0.058
 pretend-0.058
Token for
Token ID329
Feature activation-2.20e+00

Pos logit contributions
lessly+0.184
der+0.180
ously+0.170
lessness+0.169
rier+0.164
Neg logit contributions
 hunt-0.111
 hide-0.110
 pretend-0.099
 draw-0.096
 land-0.096
Token his
Token ID465
Feature activation-5.25e+00

Pos logit contributions
lessly+0.069
der+0.066
ously+0.062
rier+0.062
lessness+0.061
Neg logit contributions
 hunt-0.053
 hide-0.051
 pretend-0.047
 draw-0.047
 hall-0.046
Token obedience
Token ID28777
Feature activation-4.26e+00

Pos logit contributions
lessly+0.147
der+0.140
rier+0.133
ously+0.126
lessness+0.125
Neg logit contributions
 hunt-0.150
 hide-0.140
 pretend-0.136
 play-0.134
 land-0.134
Token and
Token ID290
Feature activation-3.32e+00

Pos logit contributions
lessly+0.126
der+0.119
ously+0.112
rier+0.110
lessness+0.110
Neg logit contributions
 hunt-0.110
 hide-0.104
 draw-0.098
 hall-0.098
 pretend-0.098
Token cle
Token ID1190
Feature activation-7.37e+00

Pos logit contributions
lessly+0.094
der+0.089
ously+0.083
rier+0.082
lessness+0.081
Neg logit contributions
 hunt-0.090
 hide-0.085
 draw-0.081
 pretend-0.081
 hall-0.080
Tokenvern
Token ID933
Feature activation-7.45e+00

Pos logit contributions
der+0.179
lessly+0.137
ously+0.131
lessness+0.125
rier+0.124
Neg logit contributions
 hunt-0.224
 hide-0.224
e-0.206
 explore-0.200
 collect-0.198
Tokeness
Token ID408
Feature activation-4.79e+00

Pos logit contributions
der+0.225
lessly+0.222
 importantly+0.221
oral+0.221
lessness+0.213
Neg logit contributions
 hunt-0.134
 hide-0.119
ci-0.119
 play-0.110
 explore-0.106
Token.
Token ID13
Feature activation-1.81e+00

Pos logit contributions
lessly+0.137
der+0.130
ously+0.122
lessness+0.118
rier+0.118
Neg logit contributions
 hunt-0.128
 hide-0.121
 draw-0.115
 hall-0.115
 pretend-0.114
Token<|endoftext|>
Token ID50256
Feature activation+5.52e+00

Pos logit contributions
lessly+0.058
rier+0.058
der+0.056
ously+0.053
arl+0.051
Neg logit contributions
 hunt-0.037
c-0.036
 forest-0.035
 hall-0.035
 park-0.035
TokenOnce
Token ID7454
Feature activation+4.28e+00

Pos logit contributions
 hunt+0.189
 draw+0.176
 hide+0.176
 pretend+0.173
 hall+0.168
Neg logit contributions
lessly-0.114
der-0.098
lessness-0.094
ously-0.093
oral-0.080
Tokentr
Token ID2213
Feature activation-1.63e+00

Pos logit contributions
lessly+0.011
der+0.011
rier+0.010
ously+0.010
lessness+0.010
Neg logit contributions
 hunt-0.009
 hide-0.008
 draw-0.008
 hall-0.008
 pretend-0.008
Tokenay
Token ID323
Feature activation-1.22e+00

Pos logit contributions
lessly+0.038
ously+0.035
der+0.034
rier+0.031
lessness+0.031
Neg logit contributions
 hunt-0.050
c-0.049
 hide-0.048
 draw-0.048
 hall-0.048
Token again
Token ID757
Feature activation-1.02e+00

Pos logit contributions
lessly+0.035
der+0.034
rier+0.031
ously+0.031
lessness+0.031
Neg logit contributions
 hunt-0.032
 hide-0.030
 hall-0.029
 draw-0.028
 land-0.028
Token,"
Token ID553
Feature activation-2.90e+00

Pos logit contributions
lessly+0.030
der+0.028
ously+0.026
rier+0.026
lessness+0.026
Neg logit contributions
 hunt-0.026
 hide-0.025
 hall-0.024
 draw-0.024
 pretend-0.023
Token Tim
Token ID5045
Feature activation-1.83e+00

Pos logit contributions
lessly+0.082
der+0.079
ously+0.073
lessness+0.071
rier+0.070
Neg logit contributions
 hunt-0.079
 hide-0.072
 hall-0.070
 draw-0.069
 pretend-0.068
Token says
Token ID1139
Feature activation-7.30e+00

Pos logit contributions
lessly+0.051
der+0.050
ously+0.046
rier+0.043
lessness+0.042
Neg logit contributions
 hunt-0.049
 hide-0.045
 hall-0.045
 land-0.043
 draw-0.043
Token.
Token ID13
Feature activation-1.72e+00

Pos logit contributions
lessly+0.276
der+0.267
oral+0.266
 importantly+0.253
arl+0.251
Neg logit contributions
 hide-0.105
 hunt-0.095
 draw-0.093
 play-0.089
 pretend-0.086
Token
Token ID198
Feature activation-5.12e+00

Pos logit contributions
lessly+0.062
rier+0.060
der+0.060
ously+0.057
lessness+0.057
Neg logit contributions
 hide-0.031
 hunt-0.029
 park-0.028
 play-0.028
 pretend-0.027
Token
Token ID198
Feature activation-4.60e+00

Pos logit contributions
lessly+0.171
der+0.167
rier+0.161
ously+0.156
lessness+0.150
Neg logit contributions
 hunt-0.111
 hide-0.110
 play-0.103
 park-0.102
 draw-0.102
Token"
Token ID1
Feature activation-1.45e+00

Pos logit contributions
der+0.125
lessly+0.124
rier+0.116
ously+0.109
fulness+0.107
Neg logit contributions
c-0.123
 hide-0.123
 hunt-0.122
 park-0.119
 play-0.118
TokenYou
Token ID1639
Feature activation-8.12e-01

Pos logit contributions
lessly+0.046
ously+0.043
der+0.043
rier+0.041
lessness+0.040
Neg logit contributions
 hunt-0.032
 hide-0.031
 park-0.030
 pretend-0.030
 draw-0.030
Token
Token ID198
Feature activation-2.44e+00

Pos logit contributions
lessly+0.009
der+0.009
ously+0.008
lessness+0.008
rier+0.008
Neg logit contributions
 hunt-0.007
 hide-0.006
 hall-0.006
 pretend-0.006
 park-0.006
Token
Token ID198
Feature activation-2.45e+00

Pos logit contributions
der+0.087
lessly+0.084
rier+0.084
ously+0.078
lessness+0.078
Neg logit contributions
 hunt-0.047
 hide-0.046
 park-0.042
 play-0.042
c-0.041
TokenThe
Token ID464
Feature activation-2.66e+00

Pos logit contributions
der+0.076
lessly+0.074
rier+0.072
ously+0.070
lessness+0.069
Neg logit contributions
 hunt-0.057
 hide-0.055
 park-0.053
c-0.052
 land-0.052
Token giant
Token ID6175
Feature activation-1.14e+00

Pos logit contributions
lessly+0.085
der+0.082
rier+0.076
ously+0.075
lessness+0.075
Neg logit contributions
 hunt-0.061
 hide-0.056
 draw-0.054
 hall-0.054
c-0.053
Token got
Token ID1392
Feature activation-3.55e+00

Pos logit contributions
lessly+0.036
der+0.035
ously+0.033
lessness+0.032
rier+0.032
Neg logit contributions
 hunt-0.026
 hide-0.025
 hall-0.023
 draw-0.023
 pretend-0.023
Token ang
Token ID3550
Feature activation-7.28e+00

Pos logit contributions
der+0.118
lessly+0.115
ously+0.107
lessness+0.106
arl+0.104
Neg logit contributions
 hide-0.080
 hunt-0.075
 pretend-0.071
 play-0.071
 land-0.068
Tokenrier
Token ID5277
Feature activation-5.89e+00

Pos logit contributions
lessly+0.090
lessness+0.085
 importantly+0.078
ously+0.073
arl+0.072
Neg logit contributions
 hide-0.282
 hunt-0.278
 land-0.274
 pretend-0.263
 collect-0.257
Token and
Token ID290
Feature activation-2.14e+00

Pos logit contributions
lessly+0.209
oral+0.198
arl+0.195
rier+0.194
 importantly+0.193
Neg logit contributions
 hunt-0.102
 hide-0.094
 pretend-0.085
 land-0.083
 hall-0.076
Token began
Token ID2540
Feature activation-2.36e+00

Pos logit contributions
lessly+0.062
der+0.058
ously+0.056
lessness+0.055
rier+0.053
Neg logit contributions
 hunt-0.057
 hide-0.054
 land-0.051
 pretend-0.050
 hall-0.050
Token to
Token ID284
Feature activation+1.82e+00

Pos logit contributions
lessly+0.076
der+0.073
rier+0.070
lessness+0.069
ously+0.066
Neg logit contributions
 hunt-0.055
 hide-0.053
 pretend-0.048
 draw-0.048
 land-0.047
Token stomp
Token ID32221
Feature activation-1.14e+00

Pos logit contributions
 hunt+0.040
 hall+0.039
c+0.039
 library+0.038
 drew+0.038
Neg logit contributions
lessly-0.053
der-0.050
ously-0.049
 understanding-0.046
rier-0.045
TokenThis
Token ID1212
Feature activation-1.18e+00

Pos logit contributions
lessly+0.064
der+0.062
rier+0.058
lessness+0.055
ously+0.054
Neg logit contributions
 hunt-0.073
 hide-0.070
 draw-0.069
 pretend-0.068
 hall-0.068
Token made
Token ID925
Feature activation-1.50e+00

Pos logit contributions
lessly+0.032
der+0.031
ously+0.029
lessness+0.028
oral+0.028
Neg logit contributions
 hunt-0.031
 hide-0.029
c-0.027
 draw-0.027
 collect-0.027
Token the
Token ID262
Feature activation-9.02e-01

Pos logit contributions
lessly+0.052
der+0.049
ously+0.047
lessness+0.046
rier+0.046
Neg logit contributions
 hunt-0.032
 hide-0.029
 hall-0.028
 draw-0.027
 pretend-0.027
Token boy
Token ID2933
Feature activation-8.03e-01

Pos logit contributions
lessly+0.029
der+0.027
ously+0.025
lessness+0.025
rier+0.025
Neg logit contributions
 hunt-0.022
 hide-0.020
 draw-0.019
 pretend-0.018
c-0.018
Token even
Token ID772
Feature activation-2.18e+00

Pos logit contributions
lessly+0.023
der+0.022
ously+0.021
rier+0.020
lessness+0.020
Neg logit contributions
 hunt-0.021
 hide-0.020
 draw-0.019
 pretend-0.019
 hall-0.019
Token ang
Token ID3550
Feature activation-7.25e+00

Pos logit contributions
lessly+0.057
der+0.054
ously+0.050
lessness+0.049
rier+0.049
Neg logit contributions
 hunt-0.064
 hide-0.061
 draw-0.058
 hall-0.058
 pretend-0.058
Tokenrier
Token ID5277
Feature activation-5.51e+00

Pos logit contributions
lessly+0.084
lessness+0.075
ously+0.071
 importantly+0.069
oral+0.061
Neg logit contributions
 hide-0.282
 hunt-0.277
 land-0.270
 pretend-0.270
 collect-0.261
Token,
Token ID11
Feature activation-4.26e+00

Pos logit contributions
lessly+0.189
arl+0.171
der+0.170
rier+0.169
oral+0.168
Neg logit contributions
 hunt-0.107
 hide-0.101
 pretend-0.095
 land-0.091
 hall-0.091
Token and
Token ID290
Feature activation-2.07e+00

Pos logit contributions
lessly+0.132
der+0.124
ously+0.121
oral+0.120
arl+0.120
Neg logit contributions
 hunt-0.095
 hide-0.092
 pretend-0.086
 hall-0.086
 draw-0.086
Token he
Token ID339
Feature activation-7.18e-01

Pos logit contributions
lessly+0.057
der+0.054
ously+0.050
rier+0.049
lessness+0.049
Neg logit contributions
 hunt-0.058
 hide-0.054
 hall-0.052
 draw-0.052
 pretend-0.052
Token began
Token ID2540
Feature activation-9.37e-01

Pos logit contributions
lessly+0.021
der+0.020
ously+0.019
rier+0.017
lessness+0.017
Neg logit contributions
 hunt-0.019
 hall-0.017
 hide-0.017
 draw-0.016
 library-0.016
Token something
Token ID1223
Feature activation-3.38e+00

Pos logit contributions
 hunt+0.018
 hall+0.016
 hide+0.016
 draw+0.015
 land+0.015
Neg logit contributions
lessly-0.024
der-0.023
rier-0.022
ously-0.021
lessness-0.021
Token,"
Token ID553
Feature activation-2.50e+00

Pos logit contributions
lessly+0.110
ously+0.103
der+0.100
lessness+0.096
rier+0.095
Neg logit contributions
 hunt-0.078
 hide-0.076
 pretend-0.070
 draw-0.069
 land-0.068
Token the
Token ID262
Feature activation-2.84e+00

Pos logit contributions
lessly+0.094
der+0.088
ously+0.087
rier+0.084
lessness+0.084
Neg logit contributions
 hunt-0.046
 draw-0.045
 hide-0.045
 play-0.043
 park-0.042
Token bald
Token ID28273
Feature activation-3.14e+00

Pos logit contributions
lessly+0.081
der+0.075
ously+0.072
lessness+0.069
rier+0.068
Neg logit contributions
 hunt-0.079
 hide-0.075
 draw-0.072
 hall-0.072
 pretend-0.071
Token man
Token ID582
Feature activation-3.00e+00

Pos logit contributions
lessly+0.105
lessness+0.095
 importantly+0.094
ously+0.091
der+0.088
Neg logit contributions
 hunt-0.069
 draw-0.062
 land-0.062
 hide-0.061
 park-0.060
Token says
Token ID1139
Feature activation-7.24e+00

Pos logit contributions
lessly+0.094
der+0.091
ously+0.085
rier+0.083
lessness+0.082
Neg logit contributions
 hunt-0.071
 hide-0.066
 hall-0.064
 draw-0.062
 pretend-0.062
Token.
Token ID13
Feature activation-1.47e+00

Pos logit contributions
lessly+0.274
oral+0.265
der+0.258
 importantly+0.251
arl+0.245
Neg logit contributions
 hide-0.097
 draw-0.090
 hunt-0.090
 pretend-0.083
 play-0.082
Token
Token ID198
Feature activation-4.81e+00

Pos logit contributions
lessly+0.055
ously+0.052
rier+0.052
der+0.052
lessness+0.050
Neg logit contributions
 park-0.023
 hunt-0.022
 hide-0.022
 draw-0.022
 play-0.022
Token
Token ID198
Feature activation-4.41e+00

Pos logit contributions
lessly+0.168
der+0.163
rier+0.162
ously+0.158
lessness+0.151
Neg logit contributions
 hide-0.092
 hunt-0.091
 park-0.088
 play-0.087
 draw-0.087
Token"
Token ID1
Feature activation-1.13e+00

Pos logit contributions
lessly+0.131
der+0.129
rier+0.124
ously+0.121
lessness+0.119
Neg logit contributions
 hunt-0.101
 park-0.101
 hide-0.100
c-0.100
 draw-0.099
TokenOK
Token ID11380
Feature activation-2.47e+00

Pos logit contributions
lessly+0.035
ously+0.033
der+0.032
lessness+0.031
rier+0.030
Neg logit contributions
 hunt-0.025
 park-0.025
 draw-0.024
 pretend-0.024
 hide-0.024
Token I
Token ID314
Feature activation+1.48e+00

Pos logit contributions
lessly+0.023
der+0.021
ously+0.020
lessness+0.020
rier+0.019
Neg logit contributions
 hunt-0.021
 hide-0.020
 hall-0.019
 draw-0.019
 pretend-0.019
Token forgive
Token ID20927
Feature activation-1.41e+00

Pos logit contributions
 hunt+0.037
 hall+0.035
 hide+0.034
 draw+0.032
 library+0.032
Neg logit contributions
lessly-0.044
der-0.041
ously-0.038
lessness-0.038
rier-0.036
Token you
Token ID345
Feature activation-2.79e+00

Pos logit contributions
lessly+0.044
der+0.043
ously+0.039
rier+0.039
lessness+0.038
Neg logit contributions
 hunt-0.033
 hall-0.030
 hide-0.029
 draw-0.028
 pretend-0.028
Token,"
Token ID553
Feature activation-3.28e+00

Pos logit contributions
lessly+0.096
der+0.089
ously+0.088
rier+0.084
lessness+0.084
Neg logit contributions
 hunt-0.060
 hide-0.058
 draw-0.053
 pretend-0.053
 play-0.052
Token she
Token ID673
Feature activation-3.42e+00

Pos logit contributions
lessly+0.115
der+0.109
rier+0.105
ously+0.104
lessness+0.103
Neg logit contributions
 hunt-0.067
 hide-0.066
 draw-0.062
 park-0.061
 play-0.061
Token says
Token ID1139
Feature activation-7.22e+00

Pos logit contributions
lessly+0.093
der+0.091
ously+0.081
rier+0.079
lessness+0.077
Neg logit contributions
 hunt-0.092
 hall-0.086
 hide-0.085
 land-0.082
 draw-0.080
Token.
Token ID13
Feature activation-2.11e+00

Pos logit contributions
lessly+0.275
der+0.270
oral+0.268
 importantly+0.251
 virtue+0.250
Neg logit contributions
 hide-0.095
 hunt-0.086
 draw-0.086
 play-0.083
 pretend-0.074
Token
Token ID198
Feature activation-5.17e+00

Pos logit contributions
lessly+0.071
rier+0.067
der+0.065
lessness+0.065
ously+0.062
Neg logit contributions
 hunt-0.043
 hide-0.043
 park-0.041
 garden-0.039
 play-0.039
Token
Token ID198
Feature activation-4.64e+00

Pos logit contributions
lessly+0.166
der+0.160
rier+0.157
ously+0.150
lessness+0.147
Neg logit contributions
 hunt-0.117
 hide-0.113
 park-0.108
 draw-0.106
 play-0.105
TokenThey
Token ID2990
Feature activation-4.02e-01

Pos logit contributions
lessly+0.120
der+0.116
rier+0.113
lessness+0.105
ously+0.102
Neg logit contributions
 hunt-0.136
 hide-0.131
 park-0.129
c-0.125
 play-0.124
Token go
Token ID467
Feature activation-7.69e-01

Pos logit contributions
lessly+0.012
der+0.011
ously+0.011
lessness+0.010
arl+0.010
Neg logit contributions
 hunt-0.009
c-0.009
 hall-0.009
 hide-0.008
 exhibit-0.008
Token my
Token ID616
Feature activation-3.83e+00

Pos logit contributions
lessly+0.101
der+0.094
ously+0.094
lessness+0.087
rier+0.086
Neg logit contributions
 hunt-0.084
 hide-0.083
 pretend-0.078
 land-0.077
 hall-0.076
Token love
Token ID1842
Feature activation-2.52e+00

Pos logit contributions
lessly+0.092
der+0.082
ously+0.081
rier+0.079
lessness+0.074
Neg logit contributions
 hunt-0.116
 hide-0.116
 draw-0.111
 pretend-0.111
 land-0.110
Token,"
Token ID553
Feature activation-3.20e+00

Pos logit contributions
lessly+0.083
der+0.078
ously+0.075
rier+0.073
lessness+0.072
Neg logit contributions
 hunt-0.057
 hide-0.054
 draw-0.050
 pretend-0.050
 land-0.050
Token said
Token ID531
Feature activation-5.56e+00

Pos logit contributions
lessly+0.104
der+0.102
rier+0.096
ously+0.095
arl+0.091
Neg logit contributions
 hide-0.068
 hunt-0.068
 draw-0.066
 play-0.065
 pretend-0.065
Token M
Token ID337
Feature activation-4.22e+00

Pos logit contributions
lessly+0.185
der+0.178
ously+0.165
lessness+0.161
rier+0.157
Neg logit contributions
 hide-0.122
 hunt-0.122
 draw-0.113
 pretend-0.113
 land-0.111
Tokenomm
Token ID2002
Feature activation-7.21e+00

Pos logit contributions
lessly+0.080
ously+0.061
lessness+0.058
der+0.056
rier+0.056
Neg logit contributions
 hide-0.139
 hunt-0.139
 hall-0.137
 exhibit-0.135
c-0.133
Tokena
Token ID64
Feature activation-4.13e+00

Pos logit contributions
der+0.220
arl+0.216
lessly+0.214
rier+0.194
 importantly+0.191
Neg logit contributions
c-0.143
 hide-0.133
 hunt-0.119
 draw-0.117
e-0.116
Token.
Token ID13
Feature activation-1.95e+00

Pos logit contributions
lessly+0.146
der+0.136
arl+0.132
lessness+0.130
rier+0.129
Neg logit contributions
 hide-0.080
 hunt-0.080
c-0.076
 draw-0.074
 pretend-0.073
Token "
Token ID366
Feature activation-2.30e-01

Pos logit contributions
lessly+0.055
der+0.051
rier+0.049
lessness+0.049
oral+0.048
Neg logit contributions
 hunt-0.048
 hide-0.047
 garden-0.045
 hall-0.044
 draw-0.044
TokenLet
Token ID5756
Feature activation+9.08e-01

Pos logit contributions
lessly+0.006
der+0.006
ously+0.006
rier+0.006
lessness+0.005
Neg logit contributions
 hunt-0.006
 hide-0.006
 draw-0.006
 pretend-0.006
 hall-0.006
Token's
Token ID338
Feature activation+3.05e+00

Pos logit contributions
 hunt+0.015
 hide+0.013
 hall+0.013
 draw+0.012
 pretend+0.012
Neg logit contributions
lessly-0.035
der-0.033
ously-0.032
rier-0.031
lessness-0.031
Token my
Token ID616
Feature activation-3.74e+00

Pos logit contributions
lessly+0.087
der+0.082
rier+0.075
lessness+0.073
ously+0.070
Neg logit contributions
 hunt-0.105
 hall-0.096
 hide-0.094
c-0.093
 land-0.091
Token best
Token ID1266
Feature activation-3.30e+00

Pos logit contributions
lessly+0.105
der+0.095
ously+0.093
lessness+0.089
rier+0.085
Neg logit contributions
 hunt-0.105
 hide-0.103
 pretend-0.098
 draw-0.096
 hall-0.095
Token friend
Token ID1545
Feature activation-3.00e+00

Pos logit contributions
lessly+0.107
ously+0.093
lessness+0.093
der+0.092
rier+0.086
Neg logit contributions
 hide-0.081
 hunt-0.079
 pretend-0.078
 draw-0.076
 play-0.074
Token,"
Token ID553
Feature activation-2.46e+00

Pos logit contributions
lessly+0.096
der+0.091
ously+0.086
rier+0.084
lessness+0.083
Neg logit contributions
 hunt-0.071
 hide-0.069
 pretend-0.063
 draw-0.063
 hall-0.062
Token Tom
Token ID4186
Feature activation-1.94e+00

Pos logit contributions
lessly+0.073
der+0.070
ously+0.065
lessness+0.064
rier+0.063
Neg logit contributions
 hunt-0.064
 hide-0.060
 hall-0.057
 draw-0.057
 pretend-0.056
Token says
Token ID1139
Feature activation-7.20e+00

Pos logit contributions
lessly+0.053
der+0.053
ously+0.048
rier+0.046
lessness+0.045
Neg logit contributions
 hunt-0.052
 hall-0.048
 hide-0.047
 land-0.046
 draw-0.045
Token.
Token ID13
Feature activation-1.36e+00

Pos logit contributions
lessly+0.274
der+0.268
oral+0.262
 importantly+0.254
ously+0.250
Neg logit contributions
 hide-0.102
 draw-0.094
 play-0.093
 hunt-0.092
 pretend-0.086
Token
Token ID198
Feature activation-4.90e+00

Pos logit contributions
lessly+0.051
der+0.049
rier+0.049
ously+0.049
lessness+0.046
Neg logit contributions
 play-0.022
 hide-0.022
 draw-0.021
 hunt-0.021
 pretend-0.020
Token
Token ID198
Feature activation-4.43e+00

Pos logit contributions
lessly+0.171
der+0.170
rier+0.165
ously+0.162
lessness+0.151
Neg logit contributions
 hide-0.095
 hunt-0.093
 play-0.093
 draw-0.089
 park-0.089
TokenL
Token ID43
Feature activation-4.53e+00

Pos logit contributions
der+0.143
lessly+0.135
rier+0.135
ously+0.133
lessness+0.124
Neg logit contributions
c-0.090
 park-0.089
 draw-0.089
 play-0.088
 hide-0.088
Tokenily
Token ID813
Feature activation-7.06e-01

Pos logit contributions
lessly+0.123
der+0.114
rier+0.109
ously+0.108
lessness+0.106
Neg logit contributions
 hunt-0.124
 hide-0.122
 draw-0.116
 hall-0.114
 park-0.112
Token to
Token ID284
Feature activation-2.15e+00

Pos logit contributions
lessly+0.166
der+0.152
rier+0.151
arl+0.150
oral+0.149
Neg logit contributions
 hunt-0.097
 hide-0.096
c-0.085
 play-0.085
 draw-0.085
Token yourself
Token ID3511
Feature activation-3.52e+00

Pos logit contributions
lessly+0.073
der+0.070
ously+0.068
lessness+0.066
rier+0.065
Neg logit contributions
 hunt-0.045
 hide-0.043
 draw-0.040
 land-0.040
 pretend-0.040
Token and
Token ID290
Feature activation-3.03e+00

Pos logit contributions
lessly+0.110
der+0.104
ously+0.099
lessness+0.097
rier+0.096
Neg logit contributions
 hunt-0.082
 hide-0.077
 pretend-0.073
 draw-0.073
 land-0.072
Token not
Token ID407
Feature activation-2.33e+00

Pos logit contributions
lessly+0.088
der+0.083
ously+0.079
rier+0.077
lessness+0.076
Neg logit contributions
 hunt-0.080
 hide-0.076
 draw-0.072
 pretend-0.071
 land-0.070
Token be
Token ID307
Feature activation-3.85e+00

Pos logit contributions
lessly+0.075
der+0.071
ously+0.068
rier+0.067
lessness+0.066
Neg logit contributions
 hunt-0.055
 hide-0.052
 draw-0.050
 pretend-0.048
 land-0.048
Token mad
Token ID8805
Feature activation-7.19e+00

Pos logit contributions
lessly+0.101
der+0.097
ously+0.092
lessness+0.088
oral+0.087
Neg logit contributions
 hide-0.108
 hunt-0.107
c-0.102
 draw-0.100
 pretend-0.100
Token anymore
Token ID7471
Feature activation-4.05e+00

Pos logit contributions
lessly+0.248
oral+0.243
ously+0.233
lessness+0.226
arl+0.225
Neg logit contributions
 hide-0.109
 hunt-0.105
 pretend-0.102
 play-0.098
 land-0.097
Token.
Token ID13
Feature activation-9.63e-01

Pos logit contributions
lessly+0.139
rier+0.130
der+0.128
ously+0.126
arl+0.125
Neg logit contributions
 hunt-0.080
 hide-0.075
 pretend-0.069
 draw-0.068
 play-0.067
Token Nem
Token ID22547
Feature activation-2.14e+00

Pos logit contributions
lessly+0.031
ously+0.029
der+0.029
rier+0.028
lessness+0.027
Neg logit contributions
 hunt-0.019
 hide-0.019
 park-0.018
c-0.018
 garden-0.018
Tokeno
Token ID78
Feature activation-2.65e+00

Pos logit contributions
lessly+0.074
der+0.071
ously+0.066
rier+0.066
lessness+0.065
Neg logit contributions
 hunt-0.045
 hide-0.041
c-0.039
 draw-0.039
 hall-0.038
Token thought
Token ID1807
Feature activation-3.59e+00

Pos logit contributions
lessly+0.089
der+0.082
rier+0.081
ously+0.079
arl+0.078
Neg logit contributions
 hunt-0.056
 hide-0.054
c-0.051
 draw-0.050
 pretend-0.050
Token a
Token ID257
Feature activation+3.38e+00

Pos logit contributions
 hunt+0.105
 hall+0.095
 hide+0.092
c+0.092
 draw+0.090
Neg logit contributions
lessly-0.115
der-0.107
ously-0.099
lessness-0.099
 understanding-0.093
Token time
Token ID640
Feature activation+2.41e+00

Pos logit contributions
 hunt+0.079
 hide+0.074
 pretend+0.070
 hall+0.069
 draw+0.069
Neg logit contributions
lessly-0.110
der-0.105
ously-0.099
rier-0.099
lessness-0.097
Token,
Token ID11
Feature activation+4.43e+00

Pos logit contributions
 hunt+0.052
 hide+0.051
 hall+0.047
 pretend+0.046
 draw+0.046
Neg logit contributions
lessly-0.082
der-0.079
rier-0.074
ously-0.074
lessness-0.072
Token in
Token ID287
Feature activation+5.64e+00

Pos logit contributions
 hunt+0.087
 hide+0.075
 hall+0.072
 draw+0.072
 land+0.069
Neg logit contributions
lessly-0.162
der-0.152
ously-0.145
lessness-0.145
rier-0.141
Token a
Token ID257
Feature activation+7.02e+00

Pos logit contributions
 hunt+0.143
 hall+0.127
 hide+0.126
 draw+0.124
 land+0.119
Neg logit contributions
lessly-0.165
der-0.152
ously-0.143
lessness-0.141
rier-0.139
Token small
Token ID1402
Feature activation+5.89e+00

Pos logit contributions
 hunt+0.237
 hide+0.214
ci+0.213
 draw+0.205
 drew+0.203
Neg logit contributions
lessly-0.112
 understanding-0.098
 much-0.092
Days-0.091
oral-0.090
Token village
Token ID7404
Feature activation+4.18e+00

Pos logit contributions
 hunt+0.125
 hide+0.108
nob+0.107
 drew+0.103
 draw+0.101
Neg logit contributions
lessly-0.156
der-0.151
rier-0.147
lessness-0.145
 understanding-0.144
Token,
Token ID11
Feature activation+3.43e+00

Pos logit contributions
 hunt+0.122
 hide+0.110
 pretend+0.108
 draw+0.107
 hall+0.103
Neg logit contributions
lessly-0.104
der-0.094
lessness-0.092
ously-0.090
 importantly-0.086
Token there
Token ID612
Feature activation+4.39e+00

Pos logit contributions
 hunt+0.079
 draw+0.067
 hall+0.064
c+0.064
 hide+0.063
Neg logit contributions
lessly-0.113
lessness-0.100
der-0.100
ously-0.098
 importantly-0.094
Token was
Token ID373
Feature activation+1.89e+00

Pos logit contributions
 hunt+0.120
 hall+0.101
 draw+0.099
 hide+0.098
c+0.098
Neg logit contributions
lessly-0.108
ously-0.107
der-0.104
lessness-0.099
 importantly-0.090
Token a
Token ID257
Feature activation+2.13e+00

Pos logit contributions
 hunt+0.055
c+0.048
 draw+0.046
 hide+0.046
 hall+0.046
Neg logit contributions
lessly-0.048
der-0.045
lessness-0.040
 importantly-0.040
 understanding-0.039
Token Lily
Token ID20037
Feature activation+3.94e+00

Pos logit contributions
 hunt+0.107
 hall+0.103
 land+0.101
c+0.100
 draw+0.100
Neg logit contributions
lessly-0.090
der-0.088
arl-0.074
rier-0.073
ously-0.072
Token.
Token ID13
Feature activation+1.55e+00

Pos logit contributions
 hunt+0.110
 hide+0.103
 hall+0.101
 draw+0.099
 pretend+0.098
Neg logit contributions
lessly-0.105
der-0.099
ously-0.095
lessness-0.091
rier-0.084
Token Lily
Token ID20037
Feature activation+3.50e+00

Pos logit contributions
 hunt+0.047
 hide+0.042
 land+0.042
 hall+0.040
 pretend+0.040
Neg logit contributions
lessly-0.039
der-0.037
lessness-0.033
ously-0.032
arl-0.031
Token lived
Token ID5615
Feature activation+2.56e+00

Pos logit contributions
 hunt+0.102
 exhibit+0.091
 restaurants+0.091
 hall+0.090
 hide+0.089
Neg logit contributions
lessly-0.077
der-0.071
lessness-0.069
ously-0.067
 badly-0.059
Token in
Token ID287
Feature activation+5.65e+00

Pos logit contributions
 hunt+0.064
 hall+0.056
 draw+0.054
 pretend+0.053
 library+0.053
Neg logit contributions
lessly-0.079
der-0.072
lessness-0.068
ously-0.068
rier-0.067
Token a
Token ID257
Feature activation+6.39e+00

Pos logit contributions
 hunt+0.162
 hall+0.142
 exhibit+0.140
ci+0.139
 draw+0.136
Neg logit contributions
lessly-0.137
der-0.120
lessness-0.117
 understanding-0.116
 importantly-0.110
Token small
Token ID1402
Feature activation+5.05e+00

Pos logit contributions
 hunt+0.197
 draw+0.179
 hide+0.176
 drew+0.176
ci+0.174
Neg logit contributions
lessly-0.123
arl-0.105
der-0.105
lessness-0.102
 understanding-0.099
Token pond
Token ID16723
Feature activation+3.90e+00

Pos logit contributions
 hunt+0.091
 draw+0.079
 drew+0.079
 hide+0.078
nob+0.076
Neg logit contributions
lessly-0.159
der-0.158
rier-0.147
lessness-0.146
ously-0.146
Token with
Token ID351
Feature activation+2.56e+00

Pos logit contributions
 hunt+0.098
 hall+0.086
 draw+0.086
 pretend+0.085
 hide+0.085
Neg logit contributions
lessly-0.113
der-0.103
lessness-0.099
ously-0.097
 understanding-0.092
Token her
Token ID607
Feature activation+2.13e+00

Pos logit contributions
 hunt+0.057
 hide+0.054
 draw+0.051
 hall+0.050
 pretend+0.050
Neg logit contributions
lessly-0.085
der-0.081
ously-0.076
lessness-0.075
rier-0.075
Token family
Token ID1641
Feature activation+2.31e+00

Pos logit contributions
 hunt+0.063
 hide+0.060
 draw+0.057
 hall+0.057
 pretend+0.057
Neg logit contributions
lessly-0.055
der-0.052
ously-0.049
lessness-0.047
rier-0.047
Token named
Token ID3706
Feature activation+4.17e+00

Pos logit contributions
 hunt+0.071
 hide+0.061
 hall+0.059
 pretend+0.057
 library+0.057
Neg logit contributions
lessly-0.084
der-0.081
ously-0.078
lessness-0.076
 importantly-0.074
Token Lily
Token ID20037
Feature activation+3.89e+00

Pos logit contributions
 hunt+0.123
 land+0.105
 hide+0.103
c+0.103
 pretend+0.102
Neg logit contributions
lessly-0.107
der-0.096
arl-0.092
lessness-0.091
ously-0.087
Token.
Token ID13
Feature activation+1.66e+00

Pos logit contributions
 hunt+0.108
 hide+0.101
 hall+0.098
 pretend+0.095
 draw+0.095
Neg logit contributions
lessly-0.106
der-0.100
ously-0.095
lessness-0.092
rier-0.085
Token She
Token ID1375
Feature activation+2.78e+00

Pos logit contributions
 hunt+0.054
 hide+0.047
 land+0.047
 pretend+0.046
c+0.045
Neg logit contributions
lessly-0.039
der-0.037
lessness-0.034
ously-0.032
arl-0.032
Token loved
Token ID6151
Feature activation+3.06e+00

Pos logit contributions
 hunt+0.081
 hide+0.072
 exhibit+0.072
 hall+0.071
c+0.070
Neg logit contributions
lessly-0.068
der-0.062
lessness-0.059
ously-0.057
 importantly-0.054
Token to
Token ID284
Feature activation+6.60e+00

Pos logit contributions
 hunt+0.094
 hide+0.089
c+0.088
 hall+0.085
 pretend+0.084
Neg logit contributions
lessly-0.063
der-0.063
ously-0.055
 understanding-0.055
lessness-0.051
Token explore
Token ID7301
Feature activation+5.01e+00

Pos logit contributions
c+0.091
 exhibit+0.083
 hunt+0.082
 met+0.079
 hall+0.079
Neg logit contributions
 understanding-0.214
lessly-0.213
der-0.210
arl-0.200
ously-0.199
Token the
Token ID262
Feature activation+3.64e+00

Pos logit contributions
 hunt+0.138
 drew+0.130
c+0.127
e+0.126
 met+0.125
Neg logit contributions
lessly-0.122
der-0.113
ously-0.099
lessness-0.098
 understanding-0.097
Token garden
Token ID11376
Feature activation+3.75e+00

Pos logit contributions
tail+0.090
e+0.090
 hunt+0.088
 drew+0.081
c+0.080
Neg logit contributions
lessly-0.091
arl-0.083
der-0.081
oral-0.079
rier-0.077
Token behind
Token ID2157
Feature activation+4.81e+00

Pos logit contributions
 hunt+0.104
 drew+0.096
 draw+0.095
 met+0.094
c+0.093
Neg logit contributions
lessly-0.087
der-0.082
lessness-0.075
rier-0.071
ously-0.070
Token her
Token ID607
Feature activation+4.01e+00

Pos logit contributions
 hunt+0.144
ci+0.128
asis+0.127
ital+0.124
c+0.120
Neg logit contributions
der-0.100
lessly-0.097
 much-0.086
ously-0.086
 understanding-0.086
Token he
Token ID339
Feature activation-4.98e-01

Pos logit contributions
lessly+0.101
der+0.094
ously+0.088
lessness+0.086
rier+0.085
Neg logit contributions
 hunt-0.112
 hide-0.106
 hall-0.102
 draw-0.102
 pretend-0.101
Token had
Token ID550
Feature activation+1.29e+00

Pos logit contributions
lessly+0.013
der+0.012
lessness+0.011
ously+0.011
 importantly+0.010
Neg logit contributions
 hall-0.013
 hunt-0.013
 hide-0.013
c-0.012
 draw-0.012
Token ever
Token ID1683
Feature activation+5.58e-01

Pos logit contributions
 hunt+0.044
 hall+0.043
 hide+0.042
c+0.042
 draw+0.041
Neg logit contributions
lessly-0.024
der-0.022
ously-0.021
lessness-0.019
rier-0.018
Token seen
Token ID1775
Feature activation+1.47e-01

Pos logit contributions
 hunt+0.009
 hide+0.009
 hall+0.009
 pretend+0.009
 draw+0.008
Neg logit contributions
lessly-0.020
der-0.019
ously-0.018
lessness-0.018
 understanding-0.017
Token.
Token ID13
Feature activation+1.35e-01

Pos logit contributions
 hunt+0.004
 hall+0.004
 draw+0.004
 hide+0.004
 pretend+0.004
Neg logit contributions
lessly-0.004
der-0.003
rier-0.003
lessness-0.003
ously-0.003
Token<|endoftext|>
Token ID50256
Feature activation+5.51e+00

Pos logit contributions
 hunt+0.003
 hide+0.003
 hall+0.003
 play+0.003
 draw+0.003
Neg logit contributions
lessly-0.004
der-0.004
rier-0.004
ously-0.004
lessness-0.004
TokenOnce
Token ID7454
Feature activation+3.79e+00

Pos logit contributions
 hunt+0.186
 hide+0.173
 draw+0.172
 pretend+0.169
 hall+0.168
Neg logit contributions
lessly-0.115
der-0.101
ously-0.096
lessness-0.094
rier-0.083
Token upon
Token ID2402
Feature activation+3.90e+00

Pos logit contributions
 hunt+0.062
 hide+0.060
 hall+0.059
 play+0.058
 draw+0.058
Neg logit contributions
der-0.140
lessly-0.140
rier-0.135
ously-0.127
lessness-0.121
Token a
Token ID257
Feature activation+3.01e+00

Pos logit contributions
 hunt+0.098
 hall+0.090
 hide+0.087
c+0.086
 draw+0.085
Neg logit contributions
lessly-0.113
der-0.105
ously-0.098
lessness-0.097
 understanding-0.091
Token time
Token ID640
Feature activation+2.20e+00

Pos logit contributions
 hunt+0.067
 hide+0.063
 pretend+0.059
 draw+0.058
 hall+0.058
Neg logit contributions
lessly-0.102
der-0.097
rier-0.093
ously-0.092
lessness-0.090
Token,
Token ID11
Feature activation+4.44e+00

Pos logit contributions
 hunt+0.052
 hide+0.049
 hall+0.046
 draw+0.046
 pretend+0.045
Neg logit contributions
lessly-0.071
der-0.067
ously-0.063
rier-0.062
lessness-0.062
Token.
Token ID13
Feature activation+2.40e+00

Pos logit contributions
 hunt+0.032
 hide+0.031
 pretend+0.029
 draw+0.029
 land+0.028
Neg logit contributions
lessly-0.053
der-0.050
ously-0.048
rier-0.047
lessness-0.047
Token They
Token ID1119
Feature activation+3.68e+00

Pos logit contributions
 hunt+0.070
 hide+0.067
 draw+0.064
 pretend+0.064
 hall+0.064
Neg logit contributions
lessly-0.064
der-0.060
ously-0.056
rier-0.056
lessness-0.055
Token like
Token ID588
Feature activation+3.93e+00

Pos logit contributions
 hunt+0.089
 hall+0.081
c+0.080
 hide+0.078
 exhibit+0.078
Neg logit contributions
lessly-0.113
der-0.102
lessness-0.098
ously-0.097
arl-0.090
Token to
Token ID284
Feature activation+6.21e+00

Pos logit contributions
 hunt+0.118
 hide+0.112
 hall+0.111
c+0.111
 pretend+0.106
Neg logit contributions
lessly-0.090
der-0.087
ously-0.079
lessness-0.075
rier-0.072
Token play
Token ID711
Feature activation+3.74e+00

Pos logit contributions
 hall+0.075
c+0.073
 hunt+0.072
 exhibit+0.071
 library+0.069
Neg logit contributions
 understanding-0.185
ously-0.183
der-0.182
lessly-0.181
oral-0.177
Token in
Token ID287
Feature activation+7.20e+00

Pos logit contributions
 hunt+0.101
 hall+0.101
 library+0.099
c+0.098
 restaurants+0.097
Neg logit contributions
der-0.085
lessly-0.072
rier-0.063
ously-0.063
 badly-0.062
Token the
Token ID262
Feature activation+6.06e+00

Pos logit contributions
 hunt+0.220
asis+0.219
 met+0.210
c+0.208
atmeal+0.207
Neg logit contributions
 understanding-0.091
 much-0.090
lessly-0.086
Their-0.084
 importantly-0.071
Token park
Token ID3952
Feature activation+3.80e+00

Pos logit contributions
 hunt+0.096
e+0.094
 hide+0.087
tail+0.081
nob+0.079
Neg logit contributions
arl-0.180
oral-0.175
lessly-0.168
der-0.166
 much-0.162
Token.
Token ID13
Feature activation+2.87e+00

Pos logit contributions
 hunt+0.114
 draw+0.106
 hide+0.104
 hall+0.104
 pretend+0.101
Neg logit contributions
der-0.086
lessly-0.086
ously-0.072
lessness-0.071
rier-0.069
Token Today
Token ID6288
Feature activation+2.40e+00

Pos logit contributions
 hunt+0.078
 hide+0.073
 hall+0.069
 land+0.069
 draw+0.068
Neg logit contributions
lessly-0.078
der-0.075
ously-0.068
lessness-0.067
rier-0.064
Token,
Token ID11
Feature activation+1.48e-01

Pos logit contributions
 hunt+0.067
 draw+0.060
 hide+0.060
c+0.059
 hall+0.058
Neg logit contributions
lessly-0.062
der-0.060
ously-0.055
lessness-0.054
rier-0.051
Token day
Token ID1110
Feature activation-7.97e-01

Pos logit contributions
lessly+0.022
der+0.021
ously+0.020
lessness+0.020
rier+0.019
Neg logit contributions
 hunt-0.015
 hide-0.014
 draw-0.014
 hall-0.013
c-0.013
Token.
Token ID13
Feature activation-1.47e-01

Pos logit contributions
lessly+0.021
der+0.020
ously+0.019
lessness+0.018
rier+0.018
Neg logit contributions
 hunt-0.022
 hide-0.021
 hall-0.020
 draw-0.020
c-0.020
Token<|endoftext|>
Token ID50256
Feature activation+5.42e+00

Pos logit contributions
lessly+0.005
der+0.005
rier+0.005
ously+0.004
lessness+0.004
Neg logit contributions
 hunt-0.003
 hide-0.003
 play-0.003
c-0.003
 hall-0.003
TokenOne
Token ID3198
Feature activation+3.34e+00

Pos logit contributions
 hunt+0.161
 draw+0.148
 hide+0.146
 pretend+0.145
 imagine+0.139
Neg logit contributions
lessly-0.137
der-0.120
lessness-0.118
ously-0.116
oral-0.104
Token day
Token ID1110
Feature activation+1.41e+00

Pos logit contributions
 hunt+0.081
 hide+0.074
 draw+0.070
 hall+0.069
 pretend+0.069
Neg logit contributions
lessly-0.103
der-0.094
lessness-0.092
ously-0.090
oral-0.084
Token it
Token ID340
Feature activation+3.04e+00

Pos logit contributions
 hunt+0.029
 hide+0.028
 pretend+0.026
 draw+0.025
 hall+0.025
Neg logit contributions
lessly-0.049
der-0.047
rier-0.045
lessness-0.044
ously-0.043
Token was
Token ID373
Feature activation+1.59e+00

Pos logit contributions
 hunt+0.082
 hide+0.073
 draw+0.071
 pretend+0.070
 library+0.069
Neg logit contributions
lessly-0.085
der-0.081
ously-0.073
lessness-0.072
oral-0.068
Token raining
Token ID43079
Feature activation+3.45e+00

Pos logit contributions
 hunt+0.037
 hide+0.034
 draw+0.032
 hall+0.032
 pretend+0.032
Neg logit contributions
lessly-0.051
der-0.048
ously-0.046
lessness-0.045
rier-0.044
Token outside
Token ID2354
Feature activation+4.23e+00

Pos logit contributions
 hunt+0.070
 hide+0.063
 draw+0.062
c+0.060
 pretend+0.059
Neg logit contributions
lessly-0.121
der-0.115
ously-0.107
lessness-0.106
rier-0.103
Token.
Token ID13
Feature activation+2.25e+00

Pos logit contributions
 hunt+0.113
 hide+0.104
 draw+0.101
 hall+0.099
c+0.099
Neg logit contributions
lessly-0.120
der-0.114
ously-0.106
lessness-0.103
rier-0.103
Token It
Token ID632
Feature activation+2.73e+00

Pos logit contributions
 hunt+0.067
 hide+0.062
 draw+0.059
 hall+0.059
 pretend+0.059
Neg logit contributions
lessly-0.058
der-0.054
ously-0.050
lessness-0.049
rier-0.048
Token's
Token ID338
Feature activation+2.85e+00

Pos logit contributions
 hunt+0.040
 hall+0.035
 hide+0.033
 draw+0.033
 collect+0.033
Neg logit contributions
lessly-0.055
der-0.054
ously-0.052
rier-0.050
oral-0.049
Token get
Token ID651
Feature activation-4.17e-01

Pos logit contributions
 hunt+0.068
c+0.063
 hall+0.062
 library+0.061
 exhibit+0.060
Neg logit contributions
lessly-0.077
der-0.072
 understanding-0.068
rier-0.068
ously-0.067
Token you
Token ID345
Feature activation+9.26e-01

Pos logit contributions
lessly+0.013
der+0.012
lessness+0.011
rier+0.011
ously+0.011
Neg logit contributions
 hunt-0.010
 hall-0.009
 hide-0.009
 library-0.008
c-0.008
Token out
Token ID503
Feature activation+4.93e-03

Pos logit contributions
 hunt+0.023
 hall+0.022
 hide+0.021
 library+0.021
 forest+0.020
Neg logit contributions
lessly-0.026
der-0.024
lessness-0.023
ously-0.022
arl-0.022
Token of
Token ID286
Feature activation+2.16e+00

Pos logit contributions
 hunt+0.000
 hide+0.000
 draw+0.000
 hall+0.000
 pretend+0.000
Neg logit contributions
lessly-0.000
der-0.000
ously-0.000
rier-0.000
lessness-0.000
Token that
Token ID326
Feature activation+1.36e+00

Pos logit contributions
 hunt+0.049
 hide+0.044
c+0.043
 hall+0.042
 draw+0.042
Neg logit contributions
lessly-0.068
der-0.065
ously-0.061
rier-0.060
lessness-0.059
Token dress
Token ID6576
Feature activation+2.76e-03

Pos logit contributions
 hunt+0.031
 hide+0.028
 draw+0.026
 drew+0.026
c+0.026
Neg logit contributions
lessly-0.040
der-0.040
rier-0.039
lessness-0.038
ously-0.035
Token and
Token ID290
Feature activation+1.03e+00

Pos logit contributions
 hunt+0.000
 hide+0.000
 hall+0.000
 draw+0.000
 land+0.000
Neg logit contributions
lessly-0.000
der-0.000
rier-0.000
lessness-0.000
ously-0.000
Token clean
Token ID3424
Feature activation+5.64e-01

Pos logit contributions
 hunt+0.021
 hall+0.019
c+0.019
 exhibit+0.019
 hide+0.018
Neg logit contributions
lessly-0.033
der-0.032
ously-0.029
lessness-0.029
 understanding-0.029
Token up
Token ID510
Feature activation+1.56e-01

Pos logit contributions
 hunt+0.014
 hide+0.012
 hall+0.012
 draw+0.012
 land+0.011
Neg logit contributions
lessly-0.018
der-0.017
rier-0.017
lessness-0.015
ously-0.015
Token this
Token ID428
Feature activation-5.29e-01

Pos logit contributions
 hunt+0.004
 hide+0.004
 hall+0.003
 draw+0.003
 pretend+0.003
Neg logit contributions
lessly-0.005
der-0.005
ously-0.004
lessness-0.004
rier-0.004
Token love
Token ID1842
Feature activation+5.57e-01

Pos logit contributions
 hunt+0.077
 hide+0.071
 hall+0.069
 draw+0.067
 pretend+0.066
Neg logit contributions
lessly-0.105
der-0.099
ously-0.094
lessness-0.092
rier-0.090
Token it
Token ID340
Feature activation-7.59e-01

Pos logit contributions
 hunt+0.012
 hide+0.011
 hall+0.011
 draw+0.011
 pretend+0.011
Neg logit contributions
lessly-0.018
der-0.018
ously-0.017
rier-0.016
lessness-0.016
Token!"
Token ID2474
Feature activation+3.52e-01

Pos logit contributions
lessly+0.023
der+0.022
ously+0.021
lessness+0.020
rier+0.020
Neg logit contributions
 hunt-0.019
 hide-0.018
 draw-0.017
 pretend-0.017
 hall-0.017
Token<|endoftext|>
Token ID50256
Feature activation+4.82e+00

Pos logit contributions
 hunt+0.008
 hide+0.008
 draw+0.008
 hall+0.008
 pretend+0.008
Neg logit contributions
lessly-0.011
der-0.010
rier-0.010
ously-0.010
lessness-0.009
TokenOnce
Token ID7454
Feature activation+3.04e+00

Pos logit contributions
 hunt+0.172
 hide+0.159
 pretend+0.157
 draw+0.157
 hall+0.152
Neg logit contributions
lessly-0.089
der-0.075
lessness-0.073
ously-0.071
arl-0.066
Token upon
Token ID2402
Feature activation+3.73e+00

Pos logit contributions
 hunt+0.047
 hall+0.046
 hide+0.045
 draw+0.045
 land+0.045
Neg logit contributions
lessly-0.111
der-0.110
rier-0.105
ously-0.103
lessness-0.096
Token a
Token ID257
Feature activation+2.48e+00

Pos logit contributions
 hunt+0.096
 hall+0.086
 hide+0.085
c+0.082
 draw+0.081
Neg logit contributions
lessly-0.110
der-0.105
lessness-0.094
ously-0.094
rier-0.090
Token time
Token ID640
Feature activation+2.14e+00

Pos logit contributions
 hunt+0.050
 hide+0.047
 pretend+0.045
 draw+0.044
 park+0.042
Neg logit contributions
lessly-0.091
der-0.087
rier-0.084
ously-0.081
lessness-0.081
Token,
Token ID11
Feature activation+3.73e+00

Pos logit contributions
 hunt+0.048
 hide+0.046
 hall+0.043
 draw+0.043
 pretend+0.043
Neg logit contributions
lessly-0.070
der-0.067
ously-0.063
rier-0.063
lessness-0.061
Token there
Token ID612
Feature activation+4.30e+00

Pos logit contributions
 hunt+0.065
 hide+0.058
 draw+0.054
 hall+0.054
 pretend+0.053
Neg logit contributions
lessly-0.143
der-0.137
ously-0.131
lessness-0.128
rier-0.128
Token was
Token ID373
Feature activation+2.32e+00

Pos logit contributions
 hunt+0.123
 draw+0.106
 hide+0.105
 hall+0.104
 pretend+0.100
Neg logit contributions
lessly-0.105
der-0.100
ously-0.099
lessness-0.096
 importantly-0.086
Token
Token ID198
Feature activation-1.48e+00

Pos logit contributions
lessly+0.048
der+0.046
rier+0.043
ously+0.043
lessness+0.042
Neg logit contributions
 hunt-0.042
 hide-0.041
 draw-0.039
 hall-0.038
 pretend-0.038
TokenHer
Token ID9360
Feature activation-2.17e+00

Pos logit contributions
lessly+0.030
der+0.027
ously+0.025
lessness+0.024
rier+0.023
Neg logit contributions
 hunt-0.052
 hide-0.050
 hall-0.048
 draw-0.047
 land-0.047
Token mom
Token ID1995
Feature activation-2.78e-01

Pos logit contributions
lessly+0.073
der+0.065
ously+0.063
rier+0.062
lessness+0.061
Neg logit contributions
 pretend-0.053
 draw-0.053
 play-0.051
 hide-0.051
 hunt-0.050
Token asked
Token ID1965
Feature activation-7.75e-01

Pos logit contributions
lessly+0.009
der+0.008
rier+0.008
ously+0.008
lessness+0.008
Neg logit contributions
 hunt-0.007
 hide-0.006
 draw-0.006
 hall-0.006
 pretend-0.006
Token,
Token ID11
Feature activation-2.01e+00

Pos logit contributions
lessly+0.025
der+0.024
ously+0.022
lessness+0.022
rier+0.022
Neg logit contributions
 hunt-0.018
 hide-0.017
 hall-0.016
 draw-0.016
 pretend-0.016
Token "
Token ID366
Feature activation+1.38e+00

Pos logit contributions
lessly+0.052
der+0.049
ously+0.046
lessness+0.044
rier+0.044
Neg logit contributions
 hide-0.057
 hunt-0.056
 draw-0.056
 pretend-0.055
 hall-0.054
TokenWhat
Token ID2061
Feature activation+6.44e-01

Pos logit contributions
 hunt+0.034
 hide+0.031
 hall+0.030
 draw+0.030
 pretend+0.030
Neg logit contributions
lessly-0.043
der-0.041
ously-0.039
rier-0.038
lessness-0.037
Token does
Token ID857
Feature activation-1.21e+00

Pos logit contributions
 hunt+0.016
 hall+0.014
 land+0.013
c+0.013
asis+0.013
Neg logit contributions
der-0.017
lessly-0.017
 importantly-0.016
lessness-0.016
 understanding-0.016
Token it
Token ID340
Feature activation+1.73e+00

Pos logit contributions
lessly+0.038
der+0.036
ously+0.034
lessness+0.033
rier+0.033
Neg logit contributions
 hunt-0.029
 hide-0.027
 hall-0.026
 draw-0.026
 pretend-0.026
Token say
Token ID910
Feature activation+5.00e-02

Pos logit contributions
 hunt+0.038
 hall+0.036
 hide+0.035
c+0.033
 library+0.033
Neg logit contributions
lessly-0.053
der-0.050
rier-0.047
lessness-0.047
 understanding-0.046
Token,
Token ID11
Feature activation-2.10e+00

Pos logit contributions
 hunt+0.001
 hide+0.001
 hall+0.001
c+0.001
 draw+0.001
Neg logit contributions
lessly-0.002
der-0.002
rier-0.001
lessness-0.001
ously-0.001
Token to
Token ID284
Feature activation+6.20e+00

Pos logit contributions
 hunt+0.076
 hide+0.073
 hall+0.072
c+0.072
 draw+0.070
Neg logit contributions
lessly-0.060
der-0.058
ously-0.053
lessness-0.049
 understanding-0.049
Token sail
Token ID14936
Feature activation+3.93e+00

Pos logit contributions
c+0.089
 hall+0.083
 exhibit+0.082
 gardens+0.078
 met+0.077
Neg logit contributions
lessly-0.193
 understanding-0.192
der-0.187
arl-0.186
ously-0.185
Token on
Token ID319
Feature activation+4.61e+00

Pos logit contributions
 hunt+0.099
 hall+0.092
 draw+0.091
 hide+0.089
 pretend+0.086
Neg logit contributions
lessly-0.117
der-0.105
lessness-0.100
ously-0.098
rier-0.096
Token his
Token ID465
Feature activation+4.90e+00

Pos logit contributions
 hunt+0.109
 drew+0.102
 hide+0.101
c+0.100
 draw+0.099
Neg logit contributions
lessly-0.130
der-0.128
ously-0.113
rier-0.108
lessness-0.107
Token boat
Token ID8848
Feature activation+4.09e+00

Pos logit contributions
 hunt+0.104
 hide+0.096
 hall+0.091
 draw+0.091
 pretend+0.090
Neg logit contributions
lessly-0.169
der-0.161
ously-0.153
lessness-0.150
rier-0.150
Token with
Token ID351
Feature activation+2.50e+00

Pos logit contributions
 hunt+0.102
 hide+0.100
 hall+0.100
 draw+0.098
 library+0.096
Neg logit contributions
lessly-0.106
der-0.097
ously-0.091
lessness-0.089
rier-0.088
Token his
Token ID465
Feature activation+2.13e+00

Pos logit contributions
 hunt+0.068
 hide+0.064
 hall+0.062
 draw+0.061
 pretend+0.060
Neg logit contributions
lessly-0.071
der-0.067
ously-0.063
lessness-0.061
rier-0.061
Token friends
Token ID2460
Feature activation+2.83e+00

Pos logit contributions
 hunt+0.056
 hide+0.052
 pretend+0.051
 land+0.051
 play+0.050
Neg logit contributions
lessly-0.068
der-0.064
ously-0.059
rier-0.058
lessness-0.057
Token.
Token ID13
Feature activation+1.72e+00

Pos logit contributions
 hunt+0.072
 hide+0.068
 hall+0.065
 draw+0.065
 pretend+0.064
Neg logit contributions
lessly-0.085
der-0.080
ously-0.076
lessness-0.074
rier-0.073
Token They
Token ID1119
Feature activation+2.74e+00

Pos logit contributions
 hunt+0.047
 hide+0.044
 draw+0.042
 hall+0.042
 pretend+0.042
Neg logit contributions
lessly-0.049
der-0.046
ously-0.044
lessness-0.043
rier-0.043
Token would
Token ID561
Feature activation+3.47e+00

Pos logit contributions
 hunt+0.062
 hide+0.058
 hall+0.057
 draw+0.055
c+0.054
Neg logit contributions
lessly-0.087
der-0.081
ously-0.078
lessness-0.076
rier-0.073
Token day
Token ID1110
Feature activation-1.31e-01

Pos logit contributions
 hunt+0.032
 hide+0.030
 draw+0.028
 hall+0.028
 pretend+0.027
Neg logit contributions
lessly-0.047
der-0.045
ously-0.042
lessness-0.042
rier-0.041
Token,
Token ID11
Feature activation+3.76e-01

Pos logit contributions
lessly+0.004
der+0.004
ously+0.004
lessness+0.004
rier+0.004
Neg logit contributions
 hunt-0.003
 hide-0.003
 draw-0.003
 hall-0.003
 pretend-0.003
Token an
Token ID281
Feature activation+4.59e-01

Pos logit contributions
 hunt+0.009
 hide+0.008
 hall+0.008
 draw+0.008
c+0.008
Neg logit contributions
lessly-0.012
der-0.011
ously-0.011
rier-0.010
lessness-0.010
Token elephant
Token ID20950
Feature activation+3.39e+00

Pos logit contributions
 hunt+0.010
 hide+0.009
 pretend+0.009
 land+0.009
 hall+0.009
Neg logit contributions
lessly-0.016
der-0.015
ously-0.015
lessness-0.014
rier-0.014
Token saw
Token ID2497
Feature activation+1.97e+00

Pos logit contributions
 hunt+0.076
 hide+0.071
 draw+0.071
 hall+0.070
 pretend+0.070
Neg logit contributions
der-0.090
ously-0.090
lessly-0.087
oke-0.079
lessness-0.076
Token the
Token ID262
Feature activation-1.92e-01

Pos logit contributions
 hunt+0.047
 hide+0.043
 draw+0.043
 hall+0.042
 pretend+0.041
Neg logit contributions
lessly-0.060
der-0.059
ously-0.055
lessness-0.053
rier-0.052
Token rh
Token ID9529
Feature activation-2.16e+00

Pos logit contributions
lessly+0.006
der+0.005
rier+0.005
lessness+0.005
ously+0.005
Neg logit contributions
 hunt-0.005
 hide-0.004
 draw-0.004
 hall-0.004
c-0.004
Tokenin
Token ID259
Feature activation+7.02e-01

Pos logit contributions
lessly+0.051
der+0.047
ously+0.044
lessness+0.042
rier+0.041
Neg logit contributions
 hunt-0.070
 hide-0.066
 hall-0.063
 draw-0.063
 pretend-0.063
Tokenoc
Token ID420
Feature activation-3.01e+00

Pos logit contributions
 hunt+0.019
 hide+0.018
 draw+0.018
 hall+0.017
 pretend+0.017
Neg logit contributions
lessly-0.019
der-0.018
ously-0.017
rier-0.016
lessness-0.016
Tokeneros
Token ID27498
Feature activation+8.87e-01

Pos logit contributions
lessly+0.090
der+0.083
ously+0.079
lessness+0.078
rier+0.075
Neg logit contributions
 hunt-0.079
 hide-0.073
 hall-0.070
 draw-0.070
 pretend-0.069
Token z
Token ID1976
Feature activation-6.11e-01

Pos logit contributions
 hunt+0.024
 hall+0.024
 draw+0.024
 hide+0.024
 drew+0.023
Neg logit contributions
der-0.021
lessly-0.020
ously-0.018
lessness-0.017
rier-0.017
Token.
Token ID13
Feature activation-8.92e-01

Pos logit contributions
lessly+0.121
der+0.114
ously+0.108
lessness+0.105
rier+0.104
Neg logit contributions
 hunt-0.092
 hide-0.087
 land-0.082
 pretend-0.082
 hall-0.081
Token From
Token ID3574
Feature activation-2.01e+00

Pos logit contributions
lessly+0.027
rier+0.025
der+0.025
ously+0.025
lessness+0.023
Neg logit contributions
 hunt-0.021
 hide-0.020
 pretend-0.020
 hall-0.020
 draw-0.019
Token that
Token ID326
Feature activation+8.38e-01

Pos logit contributions
lessly+0.051
der+0.048
ously+0.044
lessness+0.043
rier+0.043
Neg logit contributions
 hunt-0.061
 hide-0.058
 hall-0.056
 draw-0.056
 pretend-0.055
Token day
Token ID1110
Feature activation-2.63e+00

Pos logit contributions
 hunt+0.016
 hide+0.015
 hall+0.014
 draw+0.014
 pretend+0.014
Neg logit contributions
lessly-0.031
der-0.030
ously-0.028
rier-0.028
lessness-0.027
Token on
Token ID319
Feature activation-5.42e-02

Pos logit contributions
der+0.105
lessly+0.099
rier+0.097
ously+0.094
lessness+0.090
Neg logit contributions
 hide-0.040
 hunt-0.037
 land-0.035
 pretend-0.034
 hall-0.033
Token,
Token ID11
Feature activation-2.03e+00

Pos logit contributions
lessly+0.002
der+0.002
ously+0.002
rier+0.002
lessness+0.002
Neg logit contributions
 hunt-0.001
 hide-0.001
 hall-0.001
 pretend-0.001
 draw-0.001
Token they
Token ID484
Feature activation-8.73e-01

Pos logit contributions
lessly+0.061
der+0.057
lessness+0.053
ously+0.053
rier+0.053
Neg logit contributions
 hunt-0.052
 hide-0.048
 draw-0.046
 hall-0.046
 pretend-0.044
Token never
Token ID1239
Feature activation-8.18e-01

Pos logit contributions
lessly+0.026
der+0.024
ously+0.023
lessness+0.023
rier+0.021
Neg logit contributions
 hunt-0.022
 hide-0.021
 hall-0.020
 draw-0.020
 pretend-0.020
Token forgot
Token ID16453
Feature activation-2.66e+00

Pos logit contributions
lessly+0.024
der+0.023
ously+0.022
lessness+0.021
rier+0.020
Neg logit contributions
 hunt-0.021
 hide-0.020
 hall-0.019
 pretend-0.019
 draw-0.018
Token about
Token ID546
Feature activation-1.55e-01

Pos logit contributions
lessly+0.085
der+0.081
ously+0.077
rier+0.075
lessness+0.075
Neg logit contributions
 hunt-0.062
 hide-0.058
 draw-0.055
 hall-0.055
 pretend-0.055
Token the
Token ID262
Feature activation-1.81e+00

Pos logit contributions
lessly+0.005
der+0.005
ously+0.005
lessness+0.005
rier+0.005
Neg logit contributions
 hunt-0.003
 hide-0.003
 hall-0.003
 draw-0.003
c-0.003
Token,"
Token ID553
Feature activation-4.33e-01

Pos logit contributions
 hunt+0.017
 hide+0.016
 hall+0.015
 draw+0.015
 pretend+0.015
Neg logit contributions
lessly-0.021
der-0.020
rier-0.018
ously-0.018
lessness-0.018
Token suggested
Token ID5220
Feature activation-2.55e+00

Pos logit contributions
lessly+0.013
der+0.013
ously+0.012
lessness+0.012
rier+0.012
Neg logit contributions
 hunt-0.011
 hide-0.010
 draw-0.010
 hall-0.010
 pretend-0.009
Token Jack
Token ID3619
Feature activation+1.67e+00

Pos logit contributions
lessly+0.087
der+0.082
ously+0.077
lessness+0.076
rier+0.076
Neg logit contributions
 hunt-0.053
 hide-0.053
 draw-0.049
 pretend-0.048
 hall-0.047
Token.
Token ID13
Feature activation+2.76e-01

Pos logit contributions
 hunt+0.043
 hide+0.040
 hall+0.039
 draw+0.038
 land+0.038
Neg logit contributions
lessly-0.049
der-0.046
ously-0.044
rier-0.043
lessness-0.042
Token Jill
Token ID22772
Feature activation+8.76e-01

Pos logit contributions
 hunt+0.006
 hide+0.006
 draw+0.005
 hall+0.005
 pretend+0.005
Neg logit contributions
lessly-0.009
der-0.009
rier-0.008
ously-0.008
lessness-0.008
Token replied
Token ID8712
Feature activation-2.95e+00

Pos logit contributions
 hunt+0.022
 hide+0.020
 hall+0.020
 draw+0.020
 land+0.019
Neg logit contributions
lessly-0.026
der-0.025
ously-0.024
lessness-0.023
rier-0.022
Token,
Token ID11
Feature activation-2.39e+00

Pos logit contributions
lessly+0.102
der+0.095
arl+0.092
oral+0.092
lessness+0.092
Neg logit contributions
 hide-0.059
 hunt-0.056
 pretend-0.053
 draw-0.052
 play-0.050
Token "
Token ID366
Feature activation+2.32e+00

Pos logit contributions
lessly+0.060
der+0.055
lessness+0.053
ously+0.052
oral+0.052
Neg logit contributions
 hunt-0.068
 hide-0.067
 pretend-0.065
 draw-0.065
 hall-0.063
TokenBut
Token ID1537
Feature activation-9.39e-01

Pos logit contributions
 hunt+0.064
 hide+0.061
 draw+0.060
 pretend+0.059
 play+0.057
Neg logit contributions
lessly-0.063
der-0.061
ously-0.056
lessness-0.055
rier-0.055
Token what
Token ID644
Feature activation-3.87e-01

Pos logit contributions
der+0.023
lessly+0.022
rier+0.020
ously+0.019
lessness+0.019
Neg logit contributions
 hunt-0.028
c-0.026
 hall-0.026
 hide-0.025
 land-0.025
Token if
Token ID611
Feature activation-5.99e-01

Pos logit contributions
der+0.012
lessly+0.011
ously+0.010
lessness+0.010
 importantly+0.010
Neg logit contributions
 hunt-0.009
 hall-0.008
c-0.008
 hide-0.008
 land-0.007
TokenThank
Token ID10449
Feature activation-3.44e+00

Pos logit contributions
 hunt+0.015
 hide+0.015
 draw+0.014
 pretend+0.014
 hall+0.014
Neg logit contributions
lessly-0.017
der-0.016
ously-0.015
rier-0.014
lessness-0.014
Token you
Token ID345
Feature activation-3.80e+00

Pos logit contributions
lessly+0.130
der+0.125
ously+0.120
lessness+0.116
rier+0.115
Neg logit contributions
 hunt-0.062
 hide-0.058
 pretend-0.052
 draw-0.052
 land-0.051
Token for
Token ID329
Feature activation+4.48e-02

Pos logit contributions
lessly+0.124
der+0.121
ously+0.117
lessness+0.108
rier+0.108
Neg logit contributions
 hunt-0.081
 hide-0.080
 hall-0.073
 draw-0.070
 pretend-0.070
Token rescuing
Token ID48329
Feature activation-8.47e-01

Pos logit contributions
 hunt+0.001
 hide+0.001
 hall+0.001
 draw+0.001
 pretend+0.001
Neg logit contributions
lessly-0.001
der-0.001
ously-0.001
lessness-0.001
rier-0.001
Token my
Token ID616
Feature activation-2.16e+00

Pos logit contributions
lessly+0.030
der+0.028
rier+0.027
ously+0.027
lessness+0.026
Neg logit contributions
 hunt-0.017
 hide-0.015
 hall-0.015
 draw-0.015
 pretend-0.014
Token puppy
Token ID26188
Feature activation-1.30e+00

Pos logit contributions
lessly+0.061
der+0.059
ously+0.055
rier+0.054
lessness+0.054
Neg logit contributions
 hunt-0.056
 hide-0.053
 hall-0.051
 draw-0.051
 pretend-0.049
Token".
Token ID1911
Feature activation-1.64e+00

Pos logit contributions
lessly+0.038
der+0.036
rier+0.033
ously+0.033
lessness+0.033
Neg logit contributions
 hunt-0.033
 hide-0.031
 hall-0.030
 draw-0.030
 pretend-0.029
Token The
Token ID383
Feature activation-3.36e+00

Pos logit contributions
lessly+0.053
der+0.051
rier+0.049
ously+0.048
lessness+0.047
Neg logit contributions
 hunt-0.037
 hide-0.035
 draw-0.033
 play-0.032
 hall-0.032
Token knight
Token ID22062
Feature activation-2.33e+00

Pos logit contributions
lessly+0.096
der+0.090
ously+0.084
lessness+0.083
rier+0.083
Neg logit contributions
 hunt-0.091
 hide-0.086
 draw-0.082
 hall-0.081
 pretend-0.081
Token smiled
Token ID13541
Feature activation-5.09e+00

Pos logit contributions
lessly+0.069
der+0.066
ously+0.062
lessness+0.060
rier+0.060
Neg logit contributions
 hunt-0.060
 hide-0.056
 draw-0.053
 hall-0.053
 pretend-0.053
Token and
Token ID290
Feature activation-2.94e+00

Pos logit contributions
oral+0.190
lessly+0.189
arl+0.187
oke+0.186
lessness+0.176
Neg logit contributions
 pretend-0.067
 play-0.065
 land-0.064
 hide-0.062
 outside-0.060
Token
Token ID198
Feature activation-3.23e-01

Pos logit contributions
der+0.009
lessly+0.009
rier+0.009
ously+0.009
lessness+0.008
Neg logit contributions
 hunt-0.007
 hide-0.007
 hall-0.007
 draw-0.007
 pretend-0.006
Token"
Token ID1
Feature activation+1.46e+00

Pos logit contributions
der+0.008
lessly+0.008
ously+0.007
rier+0.007
lessness+0.006
Neg logit contributions
 hunt-0.010
 hide-0.010
 hall-0.010
 draw-0.009
 pretend-0.009
TokenWhat
Token ID2061
Feature activation+5.55e-01

Pos logit contributions
 hunt+0.042
 hide+0.040
 hall+0.039
 draw+0.039
 pretend+0.038
Neg logit contributions
lessly-0.039
der-0.037
ously-0.034
rier-0.033
lessness-0.033
Token have
Token ID423
Feature activation+5.94e-01

Pos logit contributions
 hunt+0.014
 hall+0.012
c+0.012
 land+0.012
 forest+0.011
Neg logit contributions
der-0.015
lessly-0.014
lessness-0.014
ously-0.014
oke-0.013
Token you
Token ID345
Feature activation+1.81e+00

Pos logit contributions
 hunt+0.015
c+0.014
 hide+0.014
 draw+0.014
 hall+0.014
Neg logit contributions
der-0.016
lessly-0.016
lessness-0.014
ously-0.014
 importantly-0.014
Token done
Token ID1760
Feature activation-4.82e-01

Pos logit contributions
 hunt+0.054
c+0.050
 hall+0.049
 hide+0.049
 library+0.049
Neg logit contributions
lessly-0.042
der-0.037
lessness-0.036
arl-0.034
ously-0.034
Token?"
Token ID1701
Feature activation-6.34e-01

Pos logit contributions
lessly+0.015
der+0.014
rier+0.013
ously+0.013
lessness+0.013
Neg logit contributions
 hunt-0.011
c-0.010
 hide-0.010
 hall-0.010
 draw-0.010
Token Mom
Token ID11254
Feature activation+9.57e-01

Pos logit contributions
lessly+0.015
der+0.014
ously+0.013
rier+0.013
lessness+0.013
Neg logit contributions
 hunt-0.020
 hide-0.019
 draw-0.018
 hall-0.018
 pretend-0.018
Token shouted
Token ID17293
Feature activation-4.17e+00

Pos logit contributions
 hunt+0.023
 hide+0.021
 hall+0.020
 pretend+0.020
 land+0.020
Neg logit contributions
lessly-0.030
der-0.028
ously-0.027
lessness-0.026
rier-0.026
Token.
Token ID13
Feature activation+1.92e-01

Pos logit contributions
lessly+0.148
der+0.138
ously+0.132
oral+0.132
lessness+0.131
Neg logit contributions
 hide-0.081
 hunt-0.076
 draw-0.074
 pretend-0.073
 play-0.070
Token "
Token ID366
Feature activation+1.64e+00

Pos logit contributions
 hall+0.004
 hide+0.004
 garden+0.004
 draw+0.004
 park+0.004
Neg logit contributions
lessly-0.006
ously-0.006
rier-0.006
der-0.006
lessness-0.005
Token!"
Token ID2474
Feature activation-1.28e+00

Pos logit contributions
lessly+0.041
der+0.039
ously+0.036
lessness+0.036
rier+0.036
Neg logit contributions
 hunt-0.032
 hide-0.031
 draw-0.029
 pretend-0.029
 hall-0.028
Token
Token ID198
Feature activation-3.60e+00

Pos logit contributions
lessly+0.038
der+0.036
rier+0.034
ously+0.034
lessness+0.033
Neg logit contributions
 hunt-0.032
 hide-0.031
 draw-0.030
 hall-0.030
 pretend-0.029
Token
Token ID198
Feature activation-3.24e+00

Pos logit contributions
lessly+0.112
der+0.108
rier+0.102
ously+0.098
lessness+0.098
Neg logit contributions
 hunt-0.088
 hide-0.085
 draw-0.082
 hall-0.079
 park-0.078
TokenGrand
Token ID23581
Feature activation-2.24e+00

Pos logit contributions
lessly+0.090
der+0.087
rier+0.083
lessness+0.078
ously+0.077
Neg logit contributions
 hunt-0.089
 draw-0.086
 hide-0.086
 hall-0.083
 park-0.082
Tokenma
Token ID2611
Feature activation-1.29e+00

Pos logit contributions
lessly+0.076
der+0.072
ously+0.069
rier+0.067
lessness+0.067
Neg logit contributions
 hunt-0.049
 hide-0.045
 draw-0.043
 hall-0.043
 pretend-0.042
Token was
Token ID373
Feature activation-4.38e+00

Pos logit contributions
lessly+0.039
der+0.037
rier+0.035
ously+0.034
lessness+0.034
Neg logit contributions
 hunt-0.032
 hide-0.031
 draw-0.029
 pretend-0.028
 hall-0.028
Token still
Token ID991
Feature activation-2.87e+00

Pos logit contributions
der+0.118
lessly+0.115
ously+0.105
lessness+0.101
rier+0.099
Neg logit contributions
 hide-0.125
 hunt-0.117
 pretend-0.115
 draw-0.113
 park-0.113
Token a
Token ID257
Feature activation-3.22e+00

Pos logit contributions
lessly+0.078
der+0.075
ously+0.069
rier+0.067
lessness+0.067
Neg logit contributions
 hunt-0.080
 hide-0.078
 draw-0.074
 hall-0.073
 pretend-0.073
Token bit
Token ID1643
Feature activation-5.31e+00

Pos logit contributions
lessly+0.094
der+0.093
rier+0.085
ously+0.083
lessness+0.079
Neg logit contributions
 hunt-0.083
 hide-0.082
 pretend-0.082
 hall-0.081
 draw-0.079
Token worried
Token ID7960
Feature activation-4.12e+00

Pos logit contributions
lessly+0.142
der+0.142
ously+0.135
oral+0.134
fulness+0.127
Neg logit contributions
 hide-0.132
 pretend-0.126
c-0.123
 play-0.121
 draw-0.120
Token,
Token ID11
Feature activation-4.01e+00

Pos logit contributions
lessly+0.136
der+0.128
rier+0.126
oral+0.125
arl+0.125
Neg logit contributions
 hide-0.079
 hunt-0.078
 draw-0.074
 pretend-0.073
 play-0.070
Token he
Token ID339
Feature activation-1.04e+00

Pos logit contributions
lessly+0.054
der+0.051
ously+0.048
lessness+0.046
rier+0.046
Neg logit contributions
 hunt-0.063
 hide-0.060
 draw-0.058
 hall-0.058
 pretend-0.057
Token was
Token ID373
Feature activation-2.68e+00

Pos logit contributions
lessly+0.031
der+0.030
ously+0.028
lessness+0.027
rier+0.027
Neg logit contributions
 hunt-0.026
 hide-0.024
 hall-0.023
 draw-0.023
 pretend-0.023
Token determined
Token ID5295
Feature activation-2.28e+00

Pos logit contributions
lessly+0.070
der+0.067
ously+0.063
rier+0.061
lessness+0.061
Neg logit contributions
 hunt-0.077
 hide-0.075
 draw-0.071
 pretend-0.070
 hall-0.070
Token to
Token ID284
Feature activation+3.71e+00

Pos logit contributions
lessly+0.079
rier+0.075
der+0.074
lessness+0.072
arl+0.071
Neg logit contributions
 hunt-0.048
 hide-0.044
 draw-0.041
 play-0.041
 park-0.040
Token be
Token ID307
Feature activation-1.06e+00

Pos logit contributions
c+0.079
 hall+0.076
 library+0.071
 hunt+0.070
 gardens+0.068
Neg logit contributions
lessly-0.109
der-0.107
ously-0.102
 understanding-0.101
rier-0.099
Token very
Token ID845
Feature activation-4.51e+00

Pos logit contributions
lessly+0.026
der+0.024
ously+0.022
rier+0.022
lessness+0.022
Neg logit contributions
 hunt-0.032
 hall-0.030
 hide-0.029
 draw-0.029
c-0.029
Token naughty
Token ID45128
Feature activation-3.75e+00

Pos logit contributions
der+0.120
lessly+0.118
ously+0.116
lessness+0.107
oral+0.101
Neg logit contributions
 hide-0.127
 hunt-0.123
 pretend-0.118
 play-0.117
 land-0.116
Token.
Token ID13
Feature activation+2.03e-02

Pos logit contributions
lessly+0.125
der+0.115
ously+0.113
oral+0.112
lessness+0.111
Neg logit contributions
 hunt-0.078
 hide-0.077
 pretend-0.070
 hall-0.068
 draw-0.068
Token He
Token ID679
Feature activation+6.35e-02

Pos logit contributions
 hunt+0.001
 hide+0.001
 hall+0.001
 draw+0.001
 pretend+0.001
Neg logit contributions
lessly-0.001
der-0.000
ously-0.000
rier-0.000
lessness-0.000
Token went
Token ID1816
Feature activation+9.90e-01

Pos logit contributions
 hunt+0.002
 hide+0.001
 hall+0.001
 draw+0.001
 land+0.001
Neg logit contributions
lessly-0.002
der-0.002
ously-0.002
lessness-0.002
rier-0.002
Token to
Token ID284
Feature activation+4.71e+00

Pos logit contributions
 hunt+0.026
 hide+0.024
 draw+0.024
c+0.024
 pretend+0.023
Neg logit contributions
lessly-0.028
der-0.026
ously-0.025
lessness-0.023
rier-0.023
TokenL
Token ID43
Feature activation-2.50e+00

Pos logit contributions
der+0.070
lessly+0.069
rier+0.062
ously+0.061
lessness+0.060
Neg logit contributions
 hunt-0.069
 hide-0.069
 draw-0.066
 pretend-0.066
 play-0.066
Tokenily
Token ID813
Feature activation+4.24e-02

Pos logit contributions
lessly+0.063
der+0.059
ously+0.055
rier+0.053
lessness+0.053
Neg logit contributions
 hunt-0.076
 hide-0.073
 draw-0.070
 hall-0.069
 pretend-0.069
Token smiled
Token ID13541
Feature activation-3.52e+00

Pos logit contributions
 hunt+0.001
 hide+0.001
 hall+0.001
 draw+0.001
 land+0.001
Neg logit contributions
lessly-0.001
der-0.001
ously-0.001
lessness-0.001
rier-0.001
Token.
Token ID13
Feature activation+1.30e-03

Pos logit contributions
lessly+0.134
oral+0.124
oke+0.123
lessness+0.123
ously+0.122
Neg logit contributions
 play-0.048
 hide-0.047
 pretend-0.047
 land-0.045
 draw-0.044
Token She
Token ID1375
Feature activation-1.12e+00

Pos logit contributions
 hunt+0.000
 hide+0.000
 draw+0.000
 hall+0.000
 pretend+0.000
Neg logit contributions
lessly-0.000
der-0.000
ously-0.000
rier-0.000
lessness-0.000
Token forg
Token ID5784
Feature activation-4.02e+00

Pos logit contributions
lessly+0.037
der+0.035
ously+0.033
lessness+0.033
rier+0.032
Neg logit contributions
 hunt-0.025
 hide-0.023
 hall-0.022
 draw-0.022
 pretend-0.022
Tokenave
Token ID1015
Feature activation-1.91e+00

Pos logit contributions
lessly+0.098
rier+0.097
der+0.092
ously+0.086
oral+0.085
Neg logit contributions
 hunt-0.112
 drew-0.109
 hide-0.108
 draw-0.105
c-0.104
Token her
Token ID607
Feature activation-3.33e+00

Pos logit contributions
lessly+0.054
der+0.052
ously+0.050
lessness+0.047
rier+0.046
Neg logit contributions
 hunt-0.049
 hall-0.044
 hide-0.043
 land-0.043
 pretend-0.042
Token brother
Token ID3956
Feature activation-5.04e-01

Pos logit contributions
lessly+0.103
der+0.096
ously+0.091
rier+0.089
lessness+0.089
Neg logit contributions
 hunt-0.083
 hide-0.079
 draw-0.075
 hall-0.075
 pretend-0.074
Token.
Token ID13
Feature activation-1.40e-01

Pos logit contributions
lessly+0.014
der+0.013
ously+0.012
lessness+0.012
rier+0.012
Neg logit contributions
 hunt-0.014
 hide-0.013
 hall-0.012
 draw-0.012
 land-0.012
Token She
Token ID1375
Feature activation-9.52e-01

Pos logit contributions
lessly+0.004
der+0.004
ously+0.004
rier+0.004
lessness+0.004
Neg logit contributions
 hunt-0.003
 hide-0.003
 draw-0.003
 hall-0.003
 pretend-0.003
Token proud
Token ID6613
Feature activation-4.37e+00

Pos logit contributions
der+0.138
lessly+0.129
ously+0.126
oral+0.114
arl+0.114
Neg logit contributions
 hide-0.144
 pretend-0.141
 land-0.140
 draw-0.137
 play-0.137
Token.
Token ID13
Feature activation-9.86e-01

Pos logit contributions
lessly+0.110
arl+0.099
rier+0.099
der+0.096
ously+0.093
Neg logit contributions
 hunt-0.125
 hide-0.123
 draw-0.117
 pretend-0.117
 land-0.116
Token He
Token ID679
Feature activation-2.09e+00

Pos logit contributions
lessly+0.026
rier+0.026
der+0.026
ously+0.024
arl+0.023
Neg logit contributions
 hunt-0.025
 hide-0.024
 draw-0.024
 play-0.024
 park-0.023
Token had
Token ID550
Feature activation+1.26e-01

Pos logit contributions
lessly+0.063
der+0.059
ously+0.056
lessness+0.054
rier+0.054
Neg logit contributions
 hunt-0.054
 hide-0.050
 hall-0.048
 draw-0.048
 pretend-0.047
Token been
Token ID587
Feature activation-2.16e+00

Pos logit contributions
 hunt+0.003
 hide+0.003
 hall+0.003
 pretend+0.003
 draw+0.003
Neg logit contributions
lessly-0.004
der-0.004
ously-0.004
lessness-0.003
rier-0.003
Token clever
Token ID14169
Feature activation-3.98e+00

Pos logit contributions
lessly+0.055
der+0.052
ously+0.048
lessness+0.047
rier+0.047
Neg logit contributions
 hunt-0.065
 hide-0.061
 hall-0.059
 draw-0.059
 pretend-0.058
Token and
Token ID290
Feature activation-2.35e+00

Pos logit contributions
lessly+0.117
der+0.109
rier+0.102
arl+0.101
lessness+0.101
Neg logit contributions
 hunt-0.101
 hide-0.099
 pretend-0.093
 draw-0.091
 land-0.089
Token done
Token ID1760
Feature activation-2.78e+00

Pos logit contributions
lessly+0.065
der+0.059
lessness+0.056
ously+0.056
rier+0.053
Neg logit contributions
 hunt-0.066
 hide-0.060
 hall-0.060
 draw-0.058
c-0.058
Token something
Token ID1223
Feature activation-4.19e+00

Pos logit contributions
lessly+0.089
der+0.084
ously+0.080
lessness+0.078
rier+0.078
Neg logit contributions
 hunt-0.066
 hide-0.061
 hall-0.058
 draw-0.058
 pretend-0.057
Token amazing
Token ID4998
Feature activation-2.23e+00

Pos logit contributions
lessly+0.121
der+0.116
ously+0.113
lessness+0.106
rier+0.106
Neg logit contributions
 hunt-0.108
 hide-0.103
 pretend-0.099
 draw-0.097
 land-0.096
Token.
Token ID13
Feature activation-7.68e-01

Pos logit contributions
lessly+0.067
der+0.064
ously+0.060
lessness+0.059
rier+0.059
Neg logit contributions
 hunt-0.056
 hide-0.053
 draw-0.050
 hall-0.050
 pretend-0.050
Token
Token ID198
Feature activation-3.28e+00

Pos logit contributions
der+0.099
lessly+0.099
rier+0.095
ously+0.090
lessness+0.086
Neg logit contributions
 hunt-0.077
 hide-0.075
 draw-0.071
 hall-0.070
 play-0.070
TokenThe
Token ID464
Feature activation-3.78e+00

Pos logit contributions
der+0.088
lessly+0.088
rier+0.084
ously+0.080
lessness+0.077
Neg logit contributions
 hunt-0.088
 hide-0.087
 draw-0.084
c-0.083
 hall-0.083
Token little
Token ID1310
Feature activation-3.42e+00

Pos logit contributions
lessly+0.087
der+0.081
lessness+0.073
ously+0.073
rier+0.072
Neg logit contributions
 hunt-0.124
 hide-0.116
 draw-0.111
 hall-0.110
c-0.109
Token girl
Token ID2576
Feature activation-1.36e+00

Pos logit contributions
lessly+0.070
der+0.065
ously+0.062
rier+0.060
lessness+0.059
Neg logit contributions
 hunt-0.118
 hide-0.106
 hall-0.103
 draw-0.103
c-0.102
Token was
Token ID373
Feature activation-3.58e+00

Pos logit contributions
lessly+0.043
der+0.041
ously+0.039
lessness+0.038
rier+0.037
Neg logit contributions
 hunt-0.032
 hide-0.029
 hall-0.028
 draw-0.028
 pretend-0.027
Token happy
Token ID3772
Feature activation-3.82e+00

Pos logit contributions
lessly+0.094
der+0.091
ously+0.084
rier+0.081
lessness+0.080
Neg logit contributions
 hunt-0.102
 hide-0.100
 pretend-0.095
 draw-0.095
 hall-0.095
Token.
Token ID13
Feature activation-1.21e+00

Pos logit contributions
lessly+0.122
der+0.111
rier+0.109
arl+0.108
ously+0.106
Neg logit contributions
 hunt-0.088
 hide-0.086
 draw-0.081
 pretend-0.080
 land-0.079
Token She
Token ID1375
Feature activation-2.17e+00

Pos logit contributions
lessly+0.040
der+0.037
rier+0.037
ously+0.036
lessness+0.035
Neg logit contributions
 hunt-0.024
 draw-0.023
 hide-0.023
 hall-0.023
 library-0.022
Token now
Token ID783
Feature activation-1.68e+00

Pos logit contributions
lessly+0.064
der+0.060
ously+0.057
lessness+0.055
rier+0.055
Neg logit contributions
 hunt-0.057
 hide-0.053
 hall-0.051
 draw-0.051
 pretend-0.050
Token had
Token ID550
Feature activation-1.37e-01

Pos logit contributions
lessly+0.051
der+0.048
ously+0.045
lessness+0.044
rier+0.043
Neg logit contributions
 hunt-0.043
 hide-0.040
 hall-0.038
 draw-0.038
 pretend-0.038
Token something
Token ID1223
Feature activation-3.39e+00

Pos logit contributions
lessly+0.004
der+0.004
ously+0.004
lessness+0.004
 understanding+0.004
Neg logit contributions
 hunt-0.003
 hall-0.003
 hide-0.003
 pretend-0.003
 draw-0.003