TOP ACTIVATIONS
MAX = 7.268

good . They learned a lot . <|endoftext|> Anna and Ben
Dad . They learned a lot from the cord . <|endoftext|>
careful . They learned a lot . <|endoftext|> S ara and
and tries to lick the lot ion off . He does
smart . You learned a lot today . And you learned
nice . We learned a lot today . We learned how
a lot , learn a lot , and care a lot
him . They learned a lot from him . They learned
rewarding . They learned a lot from the mixer . <|endoftext|>
They decide to share the lot ion . They rub it
mom . They learned a lot . <|endoftext|> L ily and
. They hoped for a lot of things . But they
sea . They learned a lot about the reef and the
obedient . He learned a lot that day . <|endoftext|> Sam
. She had learned a lot from the novel .
respectful . They learned a lot from the spider and the
have fun and laugh a lot . They do not miss
. We have learned a lot today . We will always
day . They learned a lot and made a new friend
He does not laugh a lot . Tom wants

MIN ACTIVATIONS
MIN = -9.428

and slides down . " Whe e !" she says .
But then Joe noticed something else . He saw a box
But then Mia noticed something else that made her gasp .
Baby Bear looked up and said , " But
F uzz y saw something up ahead and started running .
â The bunny looked up and shook his ears ,
The younger sister looked up and smiled . " I
cat ?" The cat looked up to see a rabbit spring
of wheat . Paul looked up at the wheat and found
The girl looked up and saw a lady with
but then Jack spotted something up ahead . Around a bend
away but he noticed something else on the island . It
at him . Bill looked up and he saw a huge
a sign that said " Whe at for Sale ". Lily
noise . " Whe e !" Anna and Ben
â Jim looked up and around the forest .
that it made everyone look up in wonder . It was
The old man looked up and saw a big ,
eyes and yelled , " Whe ee e !" as the
Tom and Anna look up . They see a boy

INTERVAL 3.094 - 7.268
CONTAINS 0.885%

the veterinarian , organized and neat like she always was .
It 's not scary or mean . It 's just doing
bus and walk into the library . One day
wished he had not run away . She wondered if he
. He laughed and ran away with Anna .

INTERVAL -1.080 - 3.094
CONTAINS 81.219%

friends , Ben and Dave . They were jumping and running
bed . She hugged them both and said , " Good
. <|endoftext|> Once there was a t eddy bear . He
. Max heard her voice from under the bed
He gave her a small box . She opened it and

INTERVAL -5.254 - -1.080
CONTAINS 17.817%

Joe was so proud of himself for learning how
. The boy helped the pony with its hurt
circle block to a green triangle block . She makes a
. He would never forget how to do it . <|endoftext|>
, some mint tea for you ," Ben said , giving

INTERVAL -9.428 - -5.254
CONTAINS 0.079%

M olly looked up with confusion and said ,
. Just then , something else happened . He heard a
the ground and picked it up . She brought it back
it . He picked it up , but it was very
a noise . She looked up to see her friend Bob
Token good
Token ID922
Feature activation+1.45e+00

Pos logit contributions
opl+0.061
agons+0.061
akers+0.060
ines+0.060
eds+0.059
Neg logit contributions
 hes-0.091
 curiously-0.085
 toddler-0.083
 hopefully-0.081
 intrigued-0.081
Token.
Token ID13
Feature activation+1.47e+00

Pos logit contributions
akers+0.035
opl+0.035
agons+0.035
eds+0.034
ines+0.034
Neg logit contributions
 hes-0.064
 toddler-0.061
 curiously-0.060
 hopefully-0.057
 intrigued-0.056
Token They
Token ID1119
Feature activation+2.35e+00

Pos logit contributions
akers+0.051
agons+0.050
ines+0.049
eds+0.049
opl+0.048
Neg logit contributions
 hes-0.050
 toddler-0.048
 curiously-0.048
 Int-0.046
 hopefully-0.044
Token learned
Token ID4499
Feature activation+5.02e+00

Pos logit contributions
akers+0.077
agons+0.075
ines+0.075
opl+0.074
eds+0.073
Neg logit contributions
 hes-0.082
 curiously-0.078
 toddler-0.078
 Int-0.072
 intrigued-0.071
Token a
Token ID257
Feature activation+1.70e+00

Pos logit contributions
akers+0.138
agons+0.132
opl+0.131
ines+0.129
eds+0.126
Neg logit contributions
 hes-0.215
 curiously-0.205
 toddler-0.199
 hopefully-0.193
 marvel-0.187
Token lot
Token ID1256
Feature activation+7.27e+00

Pos logit contributions
akers+0.069
opl+0.069
agons+0.066
ons+0.065
ards+0.065
Neg logit contributions
 curiously-0.048
 toddler-0.048
 hes-0.047
 marvel-0.042
 intrigued-0.042
Token.
Token ID13
Feature activation+1.74e+00

Pos logit contributions
akers+0.275
eds+0.271
agons+0.271
opl+0.260
arter+0.256
Neg logit contributions
 hes-0.209
 curiously-0.199
 hopefully-0.185
 toddler-0.181
 nervously-0.178
Token<|endoftext|>
Token ID50256
Feature activation+3.95e-01

Pos logit contributions
akers+0.060
agons+0.059
ines+0.058
eds+0.058
opl+0.056
Neg logit contributions
 hes-0.059
 toddler-0.057
 curiously-0.056
 Int-0.055
 Exc-0.053
TokenAnna
Token ID31160
Feature activation+1.36e+00

Pos logit contributions
akers+0.016
agons+0.016
opl+0.015
ines+0.015
eds+0.015
Neg logit contributions
 hes-0.011
 toddler-0.010
 curiously-0.010
 Int-0.009
 marvel-0.009
Token and
Token ID290
Feature activation+6.24e-01

Pos logit contributions
akers+0.043
agons+0.042
opl+0.042
ines+0.041
eds+0.041
Neg logit contributions
 hes-0.049
 toddler-0.046
 curiously-0.046
 marvel-0.043
 Int-0.043
Token Ben
Token ID3932
Feature activation+5.25e-01

Pos logit contributions
 Kim+0.012
 ow+0.012
arl+0.011
ards+0.011
icks+0.011
Neg logit contributions
 hes-0.028
 curiously-0.028
 toddler-0.027
 Int-0.026
 intrigued-0.026
Token Dad
Token ID17415
Feature activation+3.15e-01

Pos logit contributions
akers+0.042
ards+0.039
arl+0.039
eds+0.039
smart+0.039
Neg logit contributions
 hes-0.045
 curiously-0.045
 Int-0.044
 toddler-0.042
 Exc-0.041
Token.
Token ID13
Feature activation+1.46e+00

Pos logit contributions
agons+0.010
akers+0.010
opl+0.010
arter+0.009
eds+0.009
Neg logit contributions
 hes-0.012
 curiously-0.011
 toddler-0.011
 hopefully-0.011
 marvel-0.010
Token They
Token ID1119
Feature activation+2.30e+00

Pos logit contributions
akers+0.045
agons+0.044
ines+0.043
eds+0.043
opl+0.043
Neg logit contributions
 hes-0.055
 curiously-0.053
 toddler-0.053
 Int-0.050
 hopefully-0.049
Token learned
Token ID4499
Feature activation+4.97e+00

Pos logit contributions
akers+0.071
agons+0.070
ines+0.070
opl+0.069
eds+0.067
Neg logit contributions
 hes-0.083
 toddler-0.080
 curiously-0.079
 Int-0.075
 intrigued-0.073
Token a
Token ID257
Feature activation+1.81e+00

Pos logit contributions
akers+0.148
agons+0.142
opl+0.141
ines+0.138
eds+0.138
Neg logit contributions
 hes-0.200
 curiously-0.192
 toddler-0.186
 hopefully-0.179
 marvel-0.174
Token lot
Token ID1256
Feature activation+7.24e+00

Pos logit contributions
akers+0.076
opl+0.074
agons+0.071
ards+0.071
ons+0.070
Neg logit contributions
 curiously-0.051
 hes-0.049
 toddler-0.049
 marvel-0.043
 intrigued-0.043
Token from
Token ID422
Feature activation+1.12e+00

Pos logit contributions
akers+0.281
eds+0.279
agons+0.276
arter+0.265
opl+0.265
Neg logit contributions
 hes-0.206
 curiously-0.195
 hopefully-0.180
 toddler-0.177
 nervously-0.172
Token the
Token ID262
Feature activation+1.79e+00

Pos logit contributions
agons+0.041
akers+0.040
opl+0.040
ines+0.040
arl+0.039
Neg logit contributions
 hes-0.037
 curiously-0.035
 toddler-0.034
 hopefully-0.031
 marvel-0.031
Token cord
Token ID15050
Feature activation+2.60e+00

Pos logit contributions
akers+0.055
ines+0.054
agons+0.053
opl+0.053
eds+0.053
Neg logit contributions
 hes-0.066
 toddler-0.064
 curiously-0.063
 marvel-0.059
 hopefully-0.058
Token.
Token ID13
Feature activation+1.59e+00

Pos logit contributions
agons+0.080
akers+0.078
opl+0.075
eds+0.075
arl+0.074
Neg logit contributions
 hes-0.098
 curiously-0.095
 toddler-0.093
 hopefully-0.089
 nervously-0.086
Token<|endoftext|>
Token ID50256
Feature activation+1.36e-01

Pos logit contributions
akers+0.049
agons+0.048
ines+0.047
eds+0.047
opl+0.046
Neg logit contributions
 hes-0.060
 toddler-0.057
 curiously-0.057
 Int-0.055
 intrigued-0.054
Token careful
Token ID8161
Feature activation-2.34e-01

Pos logit contributions
agons+0.078
opl+0.078
ines+0.078
arter+0.075
eds+0.074
Neg logit contributions
 hes-0.073
 intrigued-0.068
 toddler-0.067
 curiously-0.066
 hopefully-0.065
Token.
Token ID13
Feature activation+1.36e+00

Pos logit contributions
 hes+0.008
 curiously+0.008
 toddler+0.007
 hopefully+0.007
 intrigued+0.007
Neg logit contributions
agons-0.008
akers-0.008
opl-0.008
arter-0.007
arl-0.007
Token They
Token ID1119
Feature activation+2.24e+00

Pos logit contributions
akers+0.047
agons+0.046
ines+0.045
opl+0.045
eds+0.044
Neg logit contributions
 hes-0.047
 curiously-0.044
 toddler-0.044
 Int-0.041
 hopefully-0.041
Token learned
Token ID4499
Feature activation+5.13e+00

Pos logit contributions
akers+0.076
agons+0.074
opl+0.074
ines+0.073
eds+0.073
Neg logit contributions
 hes-0.076
 curiously-0.072
 toddler-0.072
 marvel-0.067
 Int-0.066
Token a
Token ID257
Feature activation+1.80e+00

Pos logit contributions
akers+0.180
agons+0.174
opl+0.174
ines+0.169
eds+0.167
Neg logit contributions
 hes-0.178
 curiously-0.168
 toddler-0.162
 hopefully-0.155
 intrigued-0.152
Token lot
Token ID1256
Feature activation+7.22e+00

Pos logit contributions
akers+0.076
opl+0.075
ons+0.073
cks+0.070
ards+0.070
Neg logit contributions
 curiously-0.048
 toddler-0.047
 hes-0.046
 intrigued-0.042
 marvel-0.041
Token.
Token ID13
Feature activation+1.68e+00

Pos logit contributions
akers+0.283
eds+0.273
agons+0.270
opl+0.266
arter+0.261
Neg logit contributions
 hes-0.203
 curiously-0.197
 hopefully-0.174
 nervously-0.173
 toddler-0.173
Token<|endoftext|>
Token ID50256
Feature activation+3.24e-01

Pos logit contributions
akers+0.058
agons+0.057
ines+0.056
eds+0.055
opl+0.055
Neg logit contributions
 hes-0.057
 toddler-0.054
 curiously-0.054
 Int-0.052
 hopefully-0.051
TokenS
Token ID50
Feature activation+3.59e+00

Pos logit contributions
akers+0.013
agons+0.013
ines+0.012
opl+0.012
arter+0.012
Neg logit contributions
 hes-0.009
 toddler-0.008
 curiously-0.008
 Int-0.008
 marvel-0.007
Tokenara
Token ID3301
Feature activation+2.37e+00

Pos logit contributions
akers+0.133
agons+0.130
arter+0.128
opl+0.127
smart+0.126
Neg logit contributions
 hes-0.113
 curiously-0.106
 toddler-0.103
 marvel-0.098
 Int-0.093
Token and
Token ID290
Feature activation+4.62e-01

Pos logit contributions
agons+0.073
akers+0.073
opl+0.071
ines+0.070
arl+0.070
Neg logit contributions
 hes-0.087
 toddler-0.082
 curiously-0.082
 marvel-0.077
 intrigued-0.076
Token and
Token ID290
Feature activation-2.20e-02

Pos logit contributions
akers+0.024
agons+0.024
opl+0.022
arter+0.022
eds+0.022
Neg logit contributions
 hes-0.027
 curiously-0.027
 toddler-0.025
 hopefully-0.024
 intrigued-0.024
Token tries
Token ID8404
Feature activation-9.25e-01

Pos logit contributions
 toddler+0.001
 hes+0.001
 curiously+0.001
 Int+0.001
 hopefully+0.001
Neg logit contributions
opl-0.001
ines-0.001
ards-0.001
agons-0.001
eds-0.001
Token to
Token ID284
Feature activation+7.19e-01

Pos logit contributions
 hes+0.028
 curiously+0.026
 toddler+0.025
 hopefully+0.023
 marvel+0.023
Neg logit contributions
akers-0.037
agons-0.036
opl-0.036
ines-0.035
arter-0.034
Token lick
Token ID36728
Feature activation+4.25e-01

Pos logit contributions
akers+0.023
agons+0.023
opl+0.023
ines+0.023
eds+0.022
Neg logit contributions
 hes-0.025
 toddler-0.024
 curiously-0.023
 Int-0.022
 intrigued-0.022
Token the
Token ID262
Feature activation+1.08e+00

Pos logit contributions
akers+0.018
opl+0.017
agons+0.017
ines+0.016
arter+0.016
Neg logit contributions
 hes-0.012
 curiously-0.012
 toddler-0.011
 hopefully-0.011
 nervously-0.010
Token lot
Token ID1256
Feature activation+7.19e+00

Pos logit contributions
arter+0.032
akers+0.032
opl+0.031
smart+0.030
ines+0.029
Neg logit contributions
 toddler-0.041
 hes-0.041
 curiously-0.040
 daughter-0.037
 Int-0.037
Tokenion
Token ID295
Feature activation+2.47e-01

Pos logit contributions
akers+0.230
arter+0.218
agons+0.217
opl+0.205
eds+0.196
Neg logit contributions
 curiously-0.260
 hes-0.251
 toddler-0.230
 nervously-0.230
 hopefully-0.226
Token off
Token ID572
Feature activation+1.88e+00

Pos logit contributions
akers+0.007
agons+0.007
opl+0.007
eds+0.007
arl+0.006
Neg logit contributions
 curiously-0.010
 hes-0.010
 toddler-0.009
 hopefully-0.009
 nervously-0.009
Token.
Token ID13
Feature activation+1.13e+00

Pos logit contributions
akers+0.077
opl+0.074
agons+0.073
ines+0.070
arter+0.069
Neg logit contributions
 curiously-0.057
 hes-0.054
 toddler-0.051
 nervously-0.050
 hopefully-0.048
Token He
Token ID679
Feature activation+1.01e+00

Pos logit contributions
akers+0.040
agons+0.039
opl+0.039
ines+0.039
eds+0.038
Neg logit contributions
 hes-0.037
 curiously-0.035
 toddler-0.035
 Int-0.032
 marvel-0.032
Token does
Token ID857
Feature activation+1.94e+00

Pos logit contributions
ines+0.027
agons+0.027
opl+0.026
akers+0.026
eds+0.026
Neg logit contributions
 hes-0.040
 toddler-0.040
 curiously-0.039
 hopefully-0.037
 marvel-0.037
Token smart
Token ID4451
Feature activation+3.61e-01

Pos logit contributions
opl+0.080
eds+0.078
ines+0.074
orns+0.074
agons+0.073
Neg logit contributions
 intrigued-0.054
 toddler-0.053
 marvel-0.052
 hes-0.050
 curiously-0.049
Token.
Token ID13
Feature activation+2.60e+00

Pos logit contributions
akers+0.009
agons+0.008
opl+0.008
ines+0.008
eds+0.008
Neg logit contributions
 hes-0.016
 toddler-0.015
 curiously-0.015
 Int-0.014
 marvel-0.014
Token You
Token ID921
Feature activation+8.29e-01

Pos logit contributions
akers+0.083
agons+0.083
ines+0.083
eds+0.081
opl+0.080
Neg logit contributions
 hes-0.092
 toddler-0.089
 curiously-0.086
 Int-0.083
 hopefully-0.082
Token learned
Token ID4499
Feature activation+4.98e+00

Pos logit contributions
eds+0.026
opl+0.026
arter+0.025
arl+0.025
akers+0.024
Neg logit contributions
 hes-0.032
 marvel-0.030
 toddler-0.029
 intrigued-0.028
 curiously-0.028
Token a
Token ID257
Feature activation+1.32e+00

Pos logit contributions
akers+0.180
opl+0.172
agons+0.171
ines+0.170
arter+0.164
Neg logit contributions
 hes-0.171
 curiously-0.160
 toddler-0.156
 hopefully-0.147
 Int-0.143
Token lot
Token ID1256
Feature activation+7.17e+00

Pos logit contributions
akers+0.056
opl+0.054
ards+0.052
agons+0.052
ons+0.051
Neg logit contributions
 curiously-0.037
 hes-0.036
 toddler-0.036
 marvel-0.033
 intrigued-0.032
Token today
Token ID1909
Feature activation+6.12e-01

Pos logit contributions
akers+0.258
agons+0.258
eds+0.255
arter+0.248
opl+0.241
Neg logit contributions
 hes-0.224
 curiously-0.209
 toddler-0.192
 hopefully-0.189
 nervously-0.182
Token.
Token ID13
Feature activation+2.62e+00

Pos logit contributions
akers+0.015
agons+0.015
opl+0.015
eds+0.014
arter+0.014
Neg logit contributions
 hes-0.026
 curiously-0.025
 toddler-0.025
 hopefully-0.024
 marvel-0.023
Token And
Token ID843
Feature activation+1.74e+00

Pos logit contributions
ines+0.084
akers+0.084
agons+0.083
eds+0.082
opl+0.078
Neg logit contributions
 hes-0.094
 toddler-0.091
 curiously-0.087
 Int-0.086
 hopefully-0.084
Token you
Token ID345
Feature activation-8.81e-01

Pos logit contributions
akers+0.044
agons+0.042
opl+0.042
ines+0.041
arl+0.040
Neg logit contributions
 hes-0.073
 curiously-0.070
 toddler-0.070
 Int-0.067
 marvel-0.065
Token learned
Token ID4499
Feature activation+4.82e+00

Pos logit contributions
 hes+0.031
 marvel+0.029
 toddler+0.027
 curiously+0.027
 intrigued+0.027
Neg logit contributions
opl-0.030
arter-0.030
agons-0.030
eds-0.029
arl-0.029
Token nice
Token ID3621
Feature activation+8.98e-01

Pos logit contributions
eds+0.076
opl+0.075
agons+0.074
ines+0.071
py+0.070
Neg logit contributions
 intrigued-0.063
 hes-0.062
 toddler-0.062
 marvel-0.061
 curiously-0.060
Token.
Token ID13
Feature activation+2.11e+00

Pos logit contributions
opl+0.022
agons+0.022
akers+0.021
eds+0.021
arter+0.021
Neg logit contributions
 hes-0.040
 toddler-0.038
 curiously-0.037
 hopefully-0.035
 intrigued-0.035
Token We
Token ID775
Feature activation-1.16e+00

Pos logit contributions
ines+0.068
akers+0.068
eds+0.067
agons+0.067
opl+0.064
Neg logit contributions
 hes-0.075
 toddler-0.073
 curiously-0.070
 Int-0.069
 hopefully-0.068
Token learned
Token ID4499
Feature activation+5.09e+00

Pos logit contributions
 hes+0.044
 marvel+0.041
 toddler+0.040
 curiously+0.040
 und+0.038
Neg logit contributions
opl-0.035
agons-0.035
eds-0.035
arter-0.034
akers-0.033
Token a
Token ID257
Feature activation+1.43e+00

Pos logit contributions
akers+0.167
agons+0.161
ines+0.159
opl+0.159
eds+0.154
Neg logit contributions
 hes-0.187
 curiously-0.176
 toddler-0.174
 hopefully-0.162
 marvel-0.161
Token lot
Token ID1256
Feature activation+7.16e+00

Pos logit contributions
akers+0.060
opl+0.058
ards+0.056
ines+0.055
agons+0.055
Neg logit contributions
 hes-0.040
 toddler-0.040
 curiously-0.039
 marvel-0.036
 daughter-0.034
Token today
Token ID1909
Feature activation+7.65e-01

Pos logit contributions
agons+0.252
akers+0.250
eds+0.249
arter+0.245
opl+0.244
Neg logit contributions
 hes-0.230
 curiously-0.212
 toddler-0.198
 hopefully-0.188
 further-0.184
Token.
Token ID13
Feature activation+2.11e+00

Pos logit contributions
agons+0.020
opl+0.020
akers+0.019
arter+0.019
eds+0.019
Neg logit contributions
 hes-0.033
 curiously-0.031
 toddler-0.031
 hopefully-0.029
 marvel-0.029
Token We
Token ID775
Feature activation-8.05e-01

Pos logit contributions
akers+0.067
agons+0.066
ines+0.066
eds+0.065
opl+0.065
Neg logit contributions
 hes-0.077
 toddler-0.073
 curiously-0.072
 Int-0.069
 hopefully-0.068
Token learned
Token ID4499
Feature activation+5.17e+00

Pos logit contributions
 hes+0.031
 marvel+0.028
 toddler+0.028
 curiously+0.028
 und+0.027
Neg logit contributions
eds-0.024
opl-0.024
agons-0.024
arter-0.024
akers-0.023
Token how
Token ID703
Feature activation-2.07e+00

Pos logit contributions
akers+0.181
agons+0.173
opl+0.171
ines+0.170
eds+0.166
Neg logit contributions
 hes-0.182
 curiously-0.171
 toddler-0.167
 hopefully-0.156
 marvel-0.155
Token a
Token ID257
Feature activation+1.05e+00

Pos logit contributions
akers+0.060
agons+0.057
arter+0.057
eds+0.056
opl+0.056
Neg logit contributions
 hes-0.073
 curiously-0.070
 toddler-0.067
 hopefully-0.064
 Int-0.063
Token lot
Token ID1256
Feature activation+6.72e+00

Pos logit contributions
akers+0.020
agons+0.019
opl+0.019
ines+0.019
eds+0.018
Neg logit contributions
 hes-0.051
 toddler-0.050
 curiously-0.049
 marvel-0.047
 Int-0.047
Token,
Token ID11
Feature activation+1.04e+00

Pos logit contributions
arter+0.218
agons+0.212
eds+0.210
akers+0.209
opl+0.199
Neg logit contributions
 hes-0.236
 curiously-0.233
 toddler-0.217
 hopefully-0.214
 nervously-0.205
Token learn
Token ID2193
Feature activation+4.39e+00

Pos logit contributions
akers+0.031
agons+0.030
opl+0.030
smart+0.030
ards+0.029
Neg logit contributions
 hes-0.036
 toddler-0.036
 curiously-0.034
 Int-0.033
 marvel-0.032
Token a
Token ID257
Feature activation+1.45e+00

Pos logit contributions
akers+0.136
agons+0.130
opl+0.127
eds+0.127
arter+0.127
Neg logit contributions
 hes-0.166
 curiously-0.159
 toddler-0.154
 hopefully-0.146
 Int-0.145
Token lot
Token ID1256
Feature activation+7.13e+00

Pos logit contributions
akers+0.043
opl+0.040
ards+0.040
eds+0.040
agons+0.039
Neg logit contributions
 curiously-0.058
 toddler-0.058
 hes-0.058
 marvel-0.055
 3-0.053
Token,
Token ID11
Feature activation+8.82e-01

Pos logit contributions
eds+0.240
arter+0.235
akers+0.235
agons+0.231
opl+0.218
Neg logit contributions
 hes-0.243
 curiously-0.239
 toddler-0.219
 hopefully-0.215
 nervously-0.205
Token and
Token ID290
Feature activation+8.22e-01

Pos logit contributions
akers+0.026
opl+0.026
agons+0.026
ines+0.025
smart+0.025
Neg logit contributions
 hes-0.031
 toddler-0.031
 curiously-0.030
 Int-0.028
 marvel-0.028
Token care
Token ID1337
Feature activation+8.60e-01

Pos logit contributions
akers+0.022
agons+0.022
opl+0.022
ines+0.021
eds+0.021
Neg logit contributions
 hes-0.033
 toddler-0.031
 curiously-0.031
 Int-0.030
 marvel-0.029
Token a
Token ID257
Feature activation+6.53e-02

Pos logit contributions
akers+0.027
arter+0.025
agons+0.025
eds+0.025
opl+0.025
Neg logit contributions
 curiously-0.032
 hes-0.031
 toddler-0.029
 hopefully-0.028
 intrigued-0.028
Token lot
Token ID1256
Feature activation+6.36e+00

Pos logit contributions
akers+0.002
agons+0.002
ines+0.002
eds+0.002
ards+0.002
Neg logit contributions
 toddler-0.002
 hes-0.002
 curiously-0.002
 marvel-0.002
 daughter-0.002
Token him
Token ID683
Feature activation-1.07e+00

Pos logit contributions
akers+0.005
opl+0.004
agons+0.004
ines+0.004
eds+0.004
Neg logit contributions
 hes-0.009
 toddler-0.009
 curiously-0.009
 Int-0.009
 intrigued-0.008
Token.
Token ID13
Feature activation+1.16e+00

Pos logit contributions
 hes+0.044
 curiously+0.042
 toddler+0.040
 hopefully+0.039
 marvel+0.038
Neg logit contributions
akers-0.030
agons-0.030
opl-0.029
eds-0.028
arl-0.028
Token They
Token ID1119
Feature activation+2.26e+00

Pos logit contributions
akers+0.041
agons+0.040
eds+0.039
ines+0.039
opl+0.038
Neg logit contributions
 hes-0.038
 toddler-0.037
 curiously-0.037
 Int-0.036
 Exc-0.035
Token learned
Token ID4499
Feature activation+4.99e+00

Pos logit contributions
akers+0.074
eds+0.072
agons+0.071
opl+0.071
ines+0.070
Neg logit contributions
 hes-0.082
 curiously-0.076
 toddler-0.075
 marvel-0.073
 intrigued-0.070
Token a
Token ID257
Feature activation+1.73e+00

Pos logit contributions
akers+0.148
agons+0.142
opl+0.140
ines+0.140
eds+0.138
Neg logit contributions
 hes-0.200
 curiously-0.188
 toddler-0.187
 hopefully-0.175
 marvel-0.174
Token lot
Token ID1256
Feature activation+7.13e+00

Pos logit contributions
akers+0.074
opl+0.072
agons+0.068
ons+0.068
eds+0.068
Neg logit contributions
 toddler-0.047
 curiously-0.047
 hes-0.046
 marvel-0.042
 intrigued-0.041
Token from
Token ID422
Feature activation+1.24e+00

Pos logit contributions
eds+0.279
akers+0.278
agons+0.272
arter+0.256
cks+0.255
Neg logit contributions
 hes-0.202
 curiously-0.190
 toddler-0.171
 hopefully-0.169
 nervously-0.168
Token him
Token ID683
Feature activation+6.58e-01

Pos logit contributions
akers+0.044
agons+0.044
arl+0.043
ines+0.042
opl+0.042
Neg logit contributions
 hes-0.042
 curiously-0.040
 toddler-0.039
 marvel-0.036
 hopefully-0.036
Token.
Token ID13
Feature activation+1.45e+00

Pos logit contributions
agons+0.020
akers+0.020
opl+0.019
arl+0.019
arter+0.019
Neg logit contributions
 hes-0.026
 curiously-0.025
 hopefully-0.023
 toddler-0.023
 marvel-0.022
Token They
Token ID1119
Feature activation+2.30e+00

Pos logit contributions
akers+0.052
agons+0.051
eds+0.051
ines+0.050
opl+0.047
Neg logit contributions
 hes-0.047
 toddler-0.045
 Int-0.045
 curiously-0.045
 Exc-0.043
Token learned
Token ID4499
Feature activation+5.13e+00

Pos logit contributions
akers+0.081
agons+0.079
eds+0.078
opl+0.078
ines+0.077
Neg logit contributions
 hes-0.076
 curiously-0.071
 toddler-0.071
 marvel-0.067
 intrigued-0.065
Token rewarding
Token ID23404
Feature activation+1.37e+00

Pos logit contributions
opl+0.008
agons+0.008
ines+0.008
eds+0.008
akers+0.008
Neg logit contributions
 hes-0.010
 toddler-0.009
 curiously-0.009
 intrigued-0.009
 hopefully-0.009
Token.
Token ID13
Feature activation+1.24e+00

Pos logit contributions
opl+0.042
eds+0.041
agons+0.040
akers+0.039
arl+0.038
Neg logit contributions
 hes-0.053
 toddler-0.050
 curiously-0.050
 hopefully-0.047
 nervously-0.045
Token They
Token ID1119
Feature activation+2.16e+00

Pos logit contributions
akers+0.047
agons+0.046
ines+0.045
eds+0.045
opl+0.043
Neg logit contributions
 hes-0.039
 toddler-0.037
 curiously-0.037
 Int-0.036
 Exc-0.035
Token learned
Token ID4499
Feature activation+4.86e+00

Pos logit contributions
akers+0.071
agons+0.070
ines+0.069
opl+0.069
eds+0.067
Neg logit contributions
 hes-0.075
 curiously-0.071
 toddler-0.071
 Int-0.066
 intrigued-0.065
Token a
Token ID257
Feature activation+1.63e+00

Pos logit contributions
akers+0.140
opl+0.133
agons+0.132
ines+0.128
eds+0.128
Neg logit contributions
 hes-0.203
 curiously-0.195
 toddler-0.188
 hopefully-0.182
 intrigued-0.177
Token lot
Token ID1256
Feature activation+7.13e+00

Pos logit contributions
akers+0.066
opl+0.065
ards+0.061
ons+0.061
eds+0.061
Neg logit contributions
 curiously-0.050
 toddler-0.048
 hes-0.047
 intrigued-0.043
 marvel-0.043
Token from
Token ID422
Feature activation+1.05e+00

Pos logit contributions
eds+0.286
akers+0.279
agons+0.279
opl+0.269
cks+0.269
Neg logit contributions
 hes-0.191
 curiously-0.183
 toddler-0.170
 hopefully-0.168
 nervously-0.162
Token the
Token ID262
Feature activation+1.57e+00

Pos logit contributions
agons+0.039
opl+0.039
akers+0.039
ines+0.038
arl+0.038
Neg logit contributions
 hes-0.033
 curiously-0.032
 toddler-0.031
 hopefully-0.028
 daughter-0.028
Token mixer
Token ID33938
Feature activation+1.69e+00

Pos logit contributions
akers+0.047
agons+0.045
opl+0.045
ines+0.045
eds+0.044
Neg logit contributions
 hes-0.060
 toddler-0.058
 curiously-0.058
 marvel-0.054
 Int-0.054
Token.
Token ID13
Feature activation+1.49e+00

Pos logit contributions
agons+0.053
akers+0.052
eds+0.050
arl+0.050
opl+0.049
Neg logit contributions
 hes-0.063
 curiously-0.060
 toddler-0.059
 hopefully-0.057
 marvel-0.055
Token<|endoftext|>
Token ID50256
Feature activation+1.83e-01

Pos logit contributions
akers+0.056
agons+0.055
ines+0.054
eds+0.053
opl+0.051
Neg logit contributions
 hes-0.047
 toddler-0.045
 Int-0.044
 curiously-0.044
 Exc-0.043
Token They
Token ID1119
Feature activation+2.27e+00

Pos logit contributions
akers+0.046
agons+0.045
ines+0.045
opl+0.044
eds+0.043
Neg logit contributions
 hes-0.070
 curiously-0.066
 toddler-0.066
 marvel-0.062
 Int-0.062
Token decide
Token ID5409
Feature activation+8.64e-01

Pos logit contributions
akers+0.078
agons+0.077
opl+0.076
eds+0.075
ines+0.075
Neg logit contributions
 hes-0.077
 curiously-0.073
 toddler-0.072
 marvel-0.068
 intrigued-0.066
Token to
Token ID284
Feature activation+7.35e-01

Pos logit contributions
akers+0.030
agons+0.030
ines+0.029
opl+0.029
eds+0.029
Neg logit contributions
 hes-0.029
 toddler-0.027
 curiously-0.027
 Int-0.025
 marvel-0.025
Token share
Token ID2648
Feature activation+1.88e+00

Pos logit contributions
opl+0.029
akers+0.029
agons+0.028
eds+0.028
ines+0.028
Neg logit contributions
 hes-0.020
 curiously-0.020
 marvel-0.020
 toddler-0.019
 convince-0.018
Token the
Token ID262
Feature activation+1.19e+00

Pos logit contributions
agons+0.057
akers+0.057
opl+0.055
ines+0.054
arter+0.053
Neg logit contributions
 hes-0.074
 curiously-0.070
 toddler-0.068
 marvel-0.065
 hopefully-0.065
Token lot
Token ID1256
Feature activation+7.11e+00

Pos logit contributions
opl+0.033
akers+0.033
ines+0.033
smart+0.032
eds+0.032
Neg logit contributions
 hes-0.047
 toddler-0.046
 curiously-0.044
 marvel-0.043
 intrigued-0.042
Tokenion
Token ID295
Feature activation-1.09e-01

Pos logit contributions
agons+0.248
opl+0.230
arter+0.229
py+0.213
akers+0.212
Neg logit contributions
 hes-0.245
 curiously-0.240
 toddler-0.223
 hopefully-0.222
 further-0.216
Token.
Token ID13
Feature activation+1.39e+00

Pos logit contributions
 hes+0.004
 curiously+0.004
 toddler+0.004
 hopefully+0.004
 eagerly+0.004
Neg logit contributions
opl-0.003
agons-0.003
arl-0.003
akers-0.003
arter-0.003
Token They
Token ID1119
Feature activation+2.13e+00

Pos logit contributions
akers+0.037
agons+0.036
ines+0.036
opl+0.036
eds+0.035
Neg logit contributions
 hes-0.057
 toddler-0.054
 curiously-0.054
 marvel-0.051
 Int-0.050
Token rub
Token ID6437
Feature activation-1.34e+00

Pos logit contributions
akers+0.070
agons+0.069
opl+0.068
ines+0.068
eds+0.067
Neg logit contributions
 hes-0.074
 curiously-0.070
 toddler-0.070
 marvel-0.065
 Int-0.065
Token it
Token ID340
Feature activation-3.75e-02

Pos logit contributions
 hes+0.037
 curiously+0.036
 toddler+0.035
 hopefully+0.033
 nervously+0.031
Neg logit contributions
akers-0.054
opl-0.054
arter-0.053
agons-0.053
ines-0.050
Token mom
Token ID1995
Feature activation+1.35e+00

Pos logit contributions
 hes+0.011
 toddler+0.010
 curiously+0.010
 marvel+0.009
 hopefully+0.009
Neg logit contributions
opl-0.009
akers-0.009
agons-0.009
ines-0.009
eds-0.009
Token.
Token ID13
Feature activation+1.24e+00

Pos logit contributions
agons+0.046
arter+0.044
akers+0.044
arl+0.044
opl+0.043
Neg logit contributions
 hes-0.047
 toddler-0.044
 curiously-0.043
 marvel-0.041
 hopefully-0.041
Token They
Token ID1119
Feature activation+2.16e+00

Pos logit contributions
akers+0.050
agons+0.050
eds+0.048
ines+0.047
ons+0.046
Neg logit contributions
 hes-0.035
 toddler-0.032
 curiously-0.032
 Int-0.031
 Exc-0.031
Token learned
Token ID4499
Feature activation+4.73e+00

Pos logit contributions
akers+0.068
ines+0.067
agons+0.067
opl+0.066
eds+0.063
Neg logit contributions
 hes-0.076
 curiously-0.073
 toddler-0.073
 Int-0.069
 intrigued-0.067
Token a
Token ID257
Feature activation+1.48e+00

Pos logit contributions
akers+0.120
opl+0.114
agons+0.114
eds+0.110
ines+0.109
Neg logit contributions
 hes-0.213
 curiously-0.205
 toddler-0.199
 hopefully-0.191
 marvel-0.188
Token lot
Token ID1256
Feature activation+7.05e+00

Pos logit contributions
akers+0.055
opl+0.054
eds+0.052
agons+0.051
ards+0.051
Neg logit contributions
 curiously-0.049
 toddler-0.048
 hes-0.048
 marvel-0.044
 intrigued-0.043
Token.
Token ID13
Feature activation+1.62e+00

Pos logit contributions
eds+0.275
akers+0.270
agons+0.269
opl+0.258
arter+0.256
Neg logit contributions
 hes-0.199
 curiously-0.191
 toddler-0.180
 hopefully-0.173
 nervously-0.169
Token<|endoftext|>
Token ID50256
Feature activation-6.25e-02

Pos logit contributions
akers+0.071
agons+0.071
eds+0.069
ines+0.068
opl+0.066
Neg logit contributions
 hes-0.039
 toddler-0.037
 curiously-0.036
 Int-0.036
 Exc-0.034
TokenL
Token ID43
Feature activation+3.87e+00

Pos logit contributions
 hes+0.002
 curiously+0.002
 toddler+0.002
 marvel+0.002
 intrigued+0.002
Neg logit contributions
akers-0.002
agons-0.002
opl-0.002
ines-0.002
eds-0.002
Tokenily
Token ID813
Feature activation+2.11e+00

Pos logit contributions
opl+0.110
agons+0.105
akers+0.105
arter+0.103
 lava+0.100
Neg logit contributions
 hes-0.159
 curiously-0.148
 toddler-0.142
 und-0.135
 daughter-0.135
Token and
Token ID290
Feature activation-3.22e-02

Pos logit contributions
akers+0.066
opl+0.064
ines+0.064
agons+0.064
eds+0.063
Neg logit contributions
 hes-0.076
 curiously-0.072
 toddler-0.072
 Int-0.068
 hopefully-0.067
Token.
Token ID13
Feature activation+5.61e-01

Pos logit contributions
akers+0.061
agons+0.059
opl+0.058
ines+0.058
eds+0.057
Neg logit contributions
 hes-0.111
 toddler-0.106
 curiously-0.106
 Int-0.100
 marvel-0.100
Token They
Token ID1119
Feature activation+1.78e+00

Pos logit contributions
akers+0.014
opl+0.013
ines+0.013
arl+0.013
agons+0.013
Neg logit contributions
 hes-0.024
 toddler-0.023
 curiously-0.023
 marvel-0.021
 intrigued-0.021
Token hoped
Token ID10719
Feature activation+6.20e-01

Pos logit contributions
ines+0.054
ards+0.053
akers+0.053
agons+0.051
 years+0.049
Neg logit contributions
 toddler-0.060
 hes-0.060
 curiously-0.058
 Int-0.057
 intrigued-0.054
Token for
Token ID329
Feature activation+1.41e+00

Pos logit contributions
akers+0.017
ines+0.017
opl+0.016
ards+0.016
eds+0.016
Neg logit contributions
 hes-0.024
 toddler-0.023
 curiously-0.023
 Int-0.022
 intrigued-0.022
Token a
Token ID257
Feature activation+9.58e-01

Pos logit contributions
agons+0.049
akers+0.049
opl+0.048
ines+0.047
eds+0.046
Neg logit contributions
 hes-0.049
 curiously-0.046
 toddler-0.045
 hopefully-0.043
 marvel-0.042
Token lot
Token ID1256
Feature activation+7.04e+00

Pos logit contributions
akers+0.030
agons+0.029
ines+0.028
opl+0.028
eds+0.028
Neg logit contributions
 hes-0.035
 curiously-0.033
 toddler-0.033
 Int-0.032
 intrigued-0.031
Token of
Token ID286
Feature activation+5.41e-01

Pos logit contributions
agons+0.247
arter+0.232
eds+0.232
akers+0.230
opl+0.227
Neg logit contributions
 hes-0.251
 curiously-0.227
 hopefully-0.219
 toddler-0.212
 nervously-0.209
Token things
Token ID1243
Feature activation+4.05e-01

Pos logit contributions
akers+0.015
opl+0.015
agons+0.015
eds+0.015
ines+0.014
Neg logit contributions
 hes-0.022
 curiously-0.021
 toddler-0.020
 marvel-0.020
 intrigued-0.019
Token.
Token ID13
Feature activation+9.11e-01

Pos logit contributions
akers+0.011
agons+0.010
opl+0.010
ines+0.010
eds+0.010
Neg logit contributions
 hes-0.017
 curiously-0.016
 toddler-0.016
 marvel-0.015
 Int-0.015
Token But
Token ID887
Feature activation-1.18e+00

Pos logit contributions
akers+0.023
arl+0.022
opl+0.022
ines+0.022
agons+0.021
Neg logit contributions
 hes-0.038
 curiously-0.036
 toddler-0.036
 marvel-0.034
 intrigued-0.033
Token they
Token ID484
Feature activation-9.65e-01

Pos logit contributions
 hes+0.045
 toddler+0.045
 Int+0.045
 curiously+0.044
 Exc+0.043
Neg logit contributions
akers-0.030
ards-0.030
arter-0.029
 lava-0.029
ines-0.028
Token sea
Token ID5417
Feature activation+2.25e+00

Pos logit contributions
agons+0.065
ines+0.065
opl+0.065
 lava+0.064
eds+0.064
Neg logit contributions
 hes-0.063
 Int-0.056
 curiously-0.056
 toddler-0.056
 hopefully-0.054
Token.
Token ID13
Feature activation+1.24e+00

Pos logit contributions
agons+0.054
akers+0.053
opl+0.053
arl+0.051
arter+0.050
Neg logit contributions
 hes-0.098
 curiously-0.095
 toddler-0.094
 marvel-0.090
 hopefully-0.088
Token They
Token ID1119
Feature activation+2.11e+00

Pos logit contributions
arl+0.033
ines+0.032
akers+0.032
arter+0.032
opl+0.032
Neg logit contributions
 hes-0.049
 curiously-0.047
 toddler-0.046
 marvel-0.043
 hopefully-0.043
Token learned
Token ID4499
Feature activation+4.73e+00

Pos logit contributions
akers+0.061
ards+0.058
ines+0.058
agons+0.056
eds+0.054
Neg logit contributions
 hes-0.078
 curiously-0.077
 toddler-0.076
 Int-0.075
 hopefully-0.071
Token a
Token ID257
Feature activation+1.53e+00

Pos logit contributions
akers+0.129
agons+0.124
opl+0.123
ines+0.121
eds+0.119
Neg logit contributions
 hes-0.197
 curiously-0.188
 toddler-0.187
 marvel-0.176
 Int-0.175
Token lot
Token ID1256
Feature activation+7.02e+00

Pos logit contributions
akers+0.059
opl+0.058
agons+0.055
ards+0.054
ons+0.054
Neg logit contributions
 curiously-0.049
 toddler-0.049
 hes-0.048
 marvel-0.045
 intrigued-0.043
Token about
Token ID546
Feature activation+4.22e-01

Pos logit contributions
akers+0.279
agons+0.272
eds+0.267
opl+0.258
arter+0.254
Neg logit contributions
 hes-0.201
 curiously-0.189
 toddler-0.177
 nervously-0.168
 hopefully-0.166
Token the
Token ID262
Feature activation+7.91e-01

Pos logit contributions
akers+0.011
opl+0.011
ines+0.011
eds+0.011
agons+0.011
Neg logit contributions
 hes-0.017
 toddler-0.016
 curiously-0.016
 Int-0.016
 hopefully-0.015
Token reef
Token ID25088
Feature activation+5.47e-01

Pos logit contributions
akers+0.027
agons+0.027
ines+0.026
opl+0.026
eds+0.026
Neg logit contributions
 hes-0.026
 curiously-0.025
 toddler-0.024
 Int-0.024
 hopefully-0.023
Token and
Token ID290
Feature activation+3.11e-01

Pos logit contributions
agons+0.019
akers+0.018
arl+0.018
opl+0.018
eds+0.017
Neg logit contributions
 hes-0.019
 curiously-0.018
 toddler-0.018
 marvel-0.017
 hopefully-0.017
Token the
Token ID262
Feature activation+4.15e-01

Pos logit contributions
akers+0.008
eds+0.007
opl+0.007
ines+0.007
ards+0.007
Neg logit contributions
 hes-0.013
 curiously-0.013
 toddler-0.012
 Int-0.012
 Exc-0.011
Token obedient
Token ID41652
Feature activation-2.00e-01

Pos logit contributions
akers+0.034
opl+0.032
agons+0.032
ines+0.032
smart+0.032
Neg logit contributions
 hes-0.052
 toddler-0.051
 curiously-0.051
 Int-0.049
 Exc-0.047
Token.
Token ID13
Feature activation+1.14e+00

Pos logit contributions
 hes+0.009
 toddler+0.008
 curiously+0.008
 hopefully+0.008
 marvel+0.008
Neg logit contributions
akers-0.005
agons-0.005
opl-0.005
ines-0.005
eds-0.005
Token He
Token ID679
Feature activation+1.29e+00

Pos logit contributions
opl+0.037
akers+0.037
agons+0.037
ines+0.036
arl+0.036
Neg logit contributions
 hes-0.039
 curiously-0.037
 toddler-0.037
 marvel-0.034
 intrigued-0.034
Token learned
Token ID4499
Feature activation+4.91e+00

Pos logit contributions
ines+0.039
akers+0.038
opl+0.038
agons+0.038
ards+0.037
Neg logit contributions
 toddler-0.045
 hes-0.045
 curiously-0.044
 Int-0.042
 hopefully-0.041
Token a
Token ID257
Feature activation+1.58e+00

Pos logit contributions
akers+0.133
agons+0.128
opl+0.126
ines+0.125
eds+0.123
Neg logit contributions
 hes-0.206
 curiously-0.197
 toddler-0.194
 hopefully-0.184
 Int-0.183
Token lot
Token ID1256
Feature activation+7.00e+00

Pos logit contributions
akers+0.061
opl+0.057
ards+0.057
agons+0.056
ines+0.056
Neg logit contributions
 curiously-0.049
 hes-0.049
 toddler-0.048
 marvel-0.044
 intrigued-0.043
Token that
Token ID326
Feature activation+1.01e+00

Pos logit contributions
akers+0.266
eds+0.258
agons+0.255
arter+0.248
cks+0.241
Neg logit contributions
 hes-0.212
 curiously-0.205
 toddler-0.189
 hopefully-0.189
 nervously-0.180
Token day
Token ID1110
Feature activation+1.47e-01

Pos logit contributions
opl+0.028
akers+0.026
ards+0.026
smart+0.025
agons+0.025
Neg logit contributions
 hes-0.040
 curiously-0.039
 toddler-0.039
 Int-0.037
 intrigued-0.036
Token.
Token ID13
Feature activation+1.30e+00

Pos logit contributions
akers+0.004
agons+0.004
eds+0.004
arter+0.004
 lava+0.004
Neg logit contributions
 hes-0.006
 curiously-0.005
 hopefully-0.005
 toddler-0.005
 nervously-0.005
Token<|endoftext|>
Token ID50256
Feature activation+4.84e-02

Pos logit contributions
akers+0.044
opl+0.044
agons+0.043
ines+0.043
eds+0.042
Neg logit contributions
 hes-0.043
 curiously-0.041
 toddler-0.041
 marvel-0.038
 Int-0.037
TokenSam
Token ID16305
Feature activation+2.39e+00

Pos logit contributions
akers+0.002
agons+0.002
opl+0.002
ines+0.002
eds+0.002
Neg logit contributions
 hes-0.001
 toddler-0.001
 curiously-0.001
 Int-0.001
 marvel-0.001
Token.
Token ID13
Feature activation+1.16e+00

Pos logit contributions
akers+0.011
agons+0.010
opl+0.010
ines+0.010
eds+0.010
Neg logit contributions
 hes-0.018
 toddler-0.018
 curiously-0.018
 Int-0.017
 marvel-0.016
Token She
Token ID1375
Feature activation+2.45e+00

Pos logit contributions
akers+0.046
agons+0.045
ines+0.044
eds+0.044
opl+0.043
Neg logit contributions
 hes-0.033
 toddler-0.032
 curiously-0.031
 Int-0.030
 hopefully-0.029
Token had
Token ID550
Feature activation+3.62e+00

Pos logit contributions
ines+0.076
akers+0.075
opl+0.074
agons+0.074
eds+0.072
Neg logit contributions
 hes-0.086
 toddler-0.084
 curiously-0.083
 Int-0.079
 hopefully-0.077
Token learned
Token ID4499
Feature activation+4.92e+00

Pos logit contributions
akers+0.114
agons+0.111
opl+0.110
ines+0.110
eds+0.108
Neg logit contributions
 hes-0.133
 curiously-0.126
 toddler-0.126
 Int-0.118
 marvel-0.117
Token a
Token ID257
Feature activation+1.42e+00

Pos logit contributions
akers+0.165
agons+0.154
eds+0.151
arter+0.150
opl+0.148
Neg logit contributions
 hes-0.185
 curiously-0.178
 toddler-0.168
 hopefully-0.165
 intrigued-0.158
Token lot
Token ID1256
Feature activation+6.99e+00

Pos logit contributions
akers+0.048
opl+0.045
ards+0.045
eds+0.044
ines+0.044
Neg logit contributions
 curiously-0.052
 toddler-0.051
 hes-0.051
 marvel-0.048
 intrigued-0.047
Token from
Token ID422
Feature activation+1.09e+00

Pos logit contributions
akers+0.249
eds+0.242
agons+0.241
arter+0.230
cks+0.222
Neg logit contributions
 hes-0.231
 curiously-0.223
 hopefully-0.207
 toddler-0.204
 nervously-0.197
Token the
Token ID262
Feature activation+1.61e+00

Pos logit contributions
akers+0.038
agons+0.038
arl+0.036
ines+0.036
opl+0.036
Neg logit contributions
 hes-0.037
 curiously-0.036
 toddler-0.034
 hopefully-0.032
 marvel-0.032
Token novel
Token ID5337
Feature activation+2.30e+00

Pos logit contributions
akers+0.049
agons+0.048
opl+0.047
ines+0.047
eds+0.046
Neg logit contributions
 hes-0.060
 curiously-0.057
 toddler-0.057
 Int-0.053
 marvel-0.053
Token.
Token ID13
Feature activation+1.25e+00

Pos logit contributions
agons+0.063
akers+0.063
eds+0.059
opl+0.058
arter+0.057
Neg logit contributions
 hes-0.093
 curiously-0.091
 toddler-0.090
 hopefully-0.086
 marvel-0.084
Token
Token ID198
Feature activation+8.89e-01

Pos logit contributions
akers+0.050
agons+0.049
eds+0.048
ines+0.048
ards+0.046
Neg logit contributions
 hes-0.037
 toddler-0.035
 curiously-0.035
 Int-0.034
 Exc-0.033
Token respectful
Token ID25923
Feature activation+3.85e-01

Pos logit contributions
akers+0.043
agons+0.042
ines+0.041
opl+0.041
eds+0.040
Neg logit contributions
 hes-0.060
 curiously-0.057
 toddler-0.057
 Int-0.054
 marvel-0.053
Token.
Token ID13
Feature activation+1.36e+00

Pos logit contributions
akers+0.010
agons+0.010
opl+0.010
eds+0.009
arter+0.009
Neg logit contributions
 hes-0.016
 curiously-0.016
 toddler-0.015
 hopefully-0.015
 intrigued-0.014
Token They
Token ID1119
Feature activation+2.28e+00

Pos logit contributions
akers+0.046
agons+0.045
ines+0.045
eds+0.044
opl+0.043
Neg logit contributions
 hes-0.047
 toddler-0.045
 curiously-0.044
 Int-0.043
 marvel-0.042
Token learned
Token ID4499
Feature activation+4.72e+00

Pos logit contributions
akers+0.066
ines+0.065
agons+0.064
ards+0.062
opl+0.062
Neg logit contributions
 hes-0.083
 curiously-0.081
 toddler-0.081
 Int-0.078
 hopefully-0.075
Token a
Token ID257
Feature activation+1.33e+00

Pos logit contributions
akers+0.118
opl+0.111
agons+0.110
eds+0.106
ines+0.106
Neg logit contributions
 hes-0.215
 curiously-0.206
 toddler-0.201
 hopefully-0.192
 intrigued-0.191
Token lot
Token ID1256
Feature activation+6.98e+00

Pos logit contributions
akers+0.050
opl+0.048
eds+0.046
ons+0.046
agons+0.046
Neg logit contributions
 curiously-0.045
 hes-0.043
 toddler-0.043
 marvel-0.040
 intrigued-0.040
Token from
Token ID422
Feature activation+1.00e+00

Pos logit contributions
akers+0.276
eds+0.272
agons+0.269
opl+0.257
arter+0.254
Neg logit contributions
 hes-0.201
 curiously-0.190
 toddler-0.171
 hopefully-0.169
 nervously-0.168
Token the
Token ID262
Feature activation+1.62e+00

Pos logit contributions
akers+0.037
agons+0.037
opl+0.036
arl+0.035
ines+0.035
Neg logit contributions
 hes-0.032
 curiously-0.030
 toddler-0.030
 marvel-0.028
 hopefully-0.027
Token spider
Token ID19230
Feature activation+1.17e+00

Pos logit contributions
akers+0.048
agons+0.046
opl+0.046
ines+0.046
eds+0.045
Neg logit contributions
 hes-0.063
 toddler-0.061
 curiously-0.060
 marvel-0.056
 Int-0.056
Token and
Token ID290
Feature activation+5.19e-01

Pos logit contributions
akers+0.034
agons+0.034
opl+0.033
arl+0.032
arter+0.032
Neg logit contributions
 hes-0.046
 curiously-0.043
 toddler-0.043
 marvel-0.041
 hopefully-0.040
Token the
Token ID262
Feature activation+7.43e-01

Pos logit contributions
akers+0.013
opl+0.013
eds+0.013
ards+0.013
agons+0.013
Neg logit contributions
 hes-0.021
 curiously-0.020
 toddler-0.020
 Int-0.019
 intrigued-0.019
Token have
Token ID423
Feature activation+2.08e+00

Pos logit contributions
akers+0.072
eds+0.071
agons+0.068
opl+0.068
arter+0.067
Neg logit contributions
 hes-0.084
 curiously-0.080
 marvel-0.079
 toddler-0.077
 hopefully-0.073
Token fun
Token ID1257
Feature activation-2.85e-01

Pos logit contributions
opl+0.062
arter+0.062
akers+0.061
agons+0.061
eds+0.060
Neg logit contributions
 hes-0.082
 toddler-0.074
 curiously-0.073
 marvel-0.073
 Int-0.071
Token and
Token ID290
Feature activation+4.60e-01

Pos logit contributions
 hes+0.011
 curiously+0.011
 toddler+0.010
 hopefully+0.010
 marvel+0.010
Neg logit contributions
agons-0.008
opl-0.008
arl-0.008
arter-0.008
akers-0.008
Token laugh
Token ID6487
Feature activation+1.11e+00

Pos logit contributions
agons+0.015
akers+0.015
opl+0.015
eds+0.015
ines+0.015
Neg logit contributions
 hes-0.016
 toddler-0.015
 curiously-0.015
 marvel-0.014
 hopefully-0.014
Token a
Token ID257
Feature activation+1.03e+00

Pos logit contributions
akers+0.028
agons+0.027
opl+0.027
eds+0.026
ines+0.026
Neg logit contributions
 hes-0.048
 curiously-0.047
 toddler-0.046
 hopefully-0.043
 marvel-0.043
Token lot
Token ID1256
Feature activation+6.97e+00

Pos logit contributions
opl+0.028
eds+0.028
akers+0.028
ards+0.027
agons+0.027
Neg logit contributions
 toddler-0.040
 marvel-0.040
 hes-0.040
 curiously-0.039
 couple-0.037
Token.
Token ID13
Feature activation+1.80e+00

Pos logit contributions
agons+0.314
eds+0.292
opl+0.290
arter+0.286
ons+0.278
Neg logit contributions
 hes-0.146
 happily-0.138
 again-0.137
 curiously-0.136
 hopefully-0.129
Token They
Token ID1119
Feature activation+2.35e+00

Pos logit contributions
akers+0.073
agons+0.072
eds+0.071
opl+0.070
ines+0.070
Neg logit contributions
 hes-0.049
 toddler-0.046
 curiously-0.046
 Int-0.044
 intrigued-0.043
Token do
Token ID466
Feature activation+6.47e-01

Pos logit contributions
akers+0.068
eds+0.067
agons+0.064
arter+0.064
opl+0.063
Neg logit contributions
 hes-0.093
 marvel-0.089
 curiously-0.088
 toddler-0.086
 hopefully-0.082
Token not
Token ID407
Feature activation+2.77e+00

Pos logit contributions
akers+0.019
agons+0.019
opl+0.018
arl+0.018
arter+0.018
Neg logit contributions
 hes-0.026
 curiously-0.024
 marvel-0.024
 intrigued-0.023
 hopefully-0.022
Token miss
Token ID2051
Feature activation+1.80e+00

Pos logit contributions
agons+0.105
opl+0.105
akers+0.100
ines+0.100
eds+0.100
Neg logit contributions
 hes-0.085
 toddler-0.079
 curiously-0.078
 marvel-0.077
 hopefully-0.073
Token.
Token ID13
Feature activation+1.96e+00

Pos logit contributions
agons+0.067
arter+0.063
opl+0.060
eds+0.059
arl+0.057
Neg logit contributions
 hes-0.100
 toddler-0.098
 daughter-0.092
 curiously-0.091
 marvel-0.090
Token We
Token ID775
Feature activation-1.00e+00

Pos logit contributions
ines+0.073
eds+0.072
akers+0.072
agons+0.071
ons+0.067
Neg logit contributions
 toddler-0.060
 hes-0.059
 Int-0.055
 Exc-0.054
 curiously-0.054
Token have
Token ID423
Feature activation+1.88e+00

Pos logit contributions
 hes+0.036
 marvel+0.034
 toddler+0.034
 curiously+0.033
 und+0.032
Neg logit contributions
eds-0.032
arter-0.031
opl-0.031
agons-0.030
ines-0.030
Token learned
Token ID4499
Feature activation+4.99e+00

Pos logit contributions
akers+0.059
agons+0.058
opl+0.057
ines+0.057
eds+0.056
Neg logit contributions
 hes-0.069
 toddler-0.065
 curiously-0.065
 Int-0.060
 marvel-0.060
Token a
Token ID257
Feature activation+1.53e+00

Pos logit contributions
akers+0.165
agons+0.159
ines+0.155
opl+0.154
arter+0.152
Neg logit contributions
 hes-0.185
 toddler-0.171
 curiously-0.171
 hopefully-0.160
 marvel-0.157
Token lot
Token ID1256
Feature activation+6.97e+00

Pos logit contributions
akers+0.058
opl+0.057
ards+0.055
ines+0.055
agons+0.055
Neg logit contributions
 toddler-0.048
 hes-0.047
 curiously-0.046
 marvel-0.044
 3-0.042
Token today
Token ID1909
Feature activation+7.62e-01

Pos logit contributions
eds+0.242
agons+0.241
akers+0.239
arter+0.233
cks+0.226
Neg logit contributions
 hes-0.220
 curiously-0.201
 toddler-0.192
 hopefully-0.186
 further-0.181
Token.
Token ID13
Feature activation+1.81e+00

Pos logit contributions
agons+0.023
opl+0.022
akers+0.022
arter+0.021
eds+0.021
Neg logit contributions
 hes-0.029
 toddler-0.028
 curiously-0.028
 hopefully-0.026
 eagerly-0.025
Token We
Token ID775
Feature activation-8.97e-01

Pos logit contributions
akers+0.063
agons+0.062
ines+0.062
eds+0.061
opl+0.059
Neg logit contributions
 hes-0.061
 toddler-0.059
 curiously-0.056
 Int-0.055
 hopefully-0.054
Token will
Token ID481
Feature activation+5.03e-01

Pos logit contributions
 hes+0.034
 marvel+0.031
 toddler+0.031
 curiously+0.030
 und+0.030
Neg logit contributions
eds-0.028
arter-0.027
agons-0.027
opl-0.027
akers-0.026
Token always
Token ID1464
Feature activation+8.59e-01

Pos logit contributions
akers+0.019
eds+0.018
agons+0.018
opl+0.018
ines+0.018
Neg logit contributions
 hes-0.016
 curiously-0.015
 marvel-0.015
 hopefully-0.014
 toddler-0.014
Token day
Token ID1110
Feature activation+4.89e-02

Pos logit contributions
 hes+0.021
 curiously+0.020
 toddler+0.020
 Int+0.020
 intrigued+0.019
Neg logit contributions
akers-0.014
agons-0.013
ines-0.013
opl-0.013
eds-0.013
Token.
Token ID13
Feature activation+1.88e+00

Pos logit contributions
akers+0.001
agons+0.001
opl+0.001
ines+0.001
eds+0.001
Neg logit contributions
 hes-0.002
 curiously-0.002
 toddler-0.002
 marvel-0.002
 Int-0.002
Token They
Token ID1119
Feature activation+2.40e+00

Pos logit contributions
agons+0.072
akers+0.072
eds+0.070
opl+0.070
ons+0.069
Neg logit contributions
 hes-0.056
 toddler-0.054
 curiously-0.052
 Int-0.051
 intrigued-0.050
Token learned
Token ID4499
Feature activation+4.82e+00

Pos logit contributions
akers+0.069
ines+0.066
opl+0.064
ards+0.064
agons+0.063
Neg logit contributions
 hes-0.091
 curiously-0.088
 toddler-0.087
 Int-0.084
 hopefully-0.081
Token a
Token ID257
Feature activation+1.49e+00

Pos logit contributions
akers+0.123
agons+0.118
opl+0.117
ines+0.115
eds+0.113
Neg logit contributions
 hes-0.211
 curiously-0.201
 toddler-0.200
 marvel-0.188
 Int-0.188
Token lot
Token ID1256
Feature activation+6.97e+00

Pos logit contributions
akers+0.058
opl+0.055
agons+0.053
ards+0.053
eds+0.052
Neg logit contributions
 toddler-0.048
 curiously-0.048
 hes-0.047
 marvel-0.044
 Int-0.042
Token and
Token ID290
Feature activation+5.53e-01

Pos logit contributions
akers+0.271
eds+0.266
agons+0.263
arter+0.249
opl+0.247
Neg logit contributions
 hes-0.208
 curiously-0.196
 toddler-0.184
 hopefully-0.173
 nervously-0.173
Token made
Token ID925
Feature activation+2.05e+00

Pos logit contributions
akers+0.017
agons+0.017
opl+0.017
ines+0.017
eds+0.016
Neg logit contributions
 hes-0.020
 curiously-0.019
 toddler-0.019
 Int-0.018
 hopefully-0.018
Token a
Token ID257
Feature activation+2.71e-01

Pos logit contributions
akers+0.053
eds+0.052
ons+0.050
ines+0.050
agons+0.050
Neg logit contributions
 hes-0.080
 toddler-0.079
 Int-0.077
 curiously-0.076
 hopefully-0.074
Token new
Token ID649
Feature activation-1.33e+00

Pos logit contributions
akers+0.009
agons+0.009
opl+0.009
ines+0.009
eds+0.009
Neg logit contributions
 hes-0.009
 curiously-0.009
 toddler-0.009
 marvel-0.008
 hopefully-0.008
Token friend
Token ID1545
Feature activation-2.69e-01

Pos logit contributions
 hes+0.049
 Int+0.047
 curiously+0.045
 toddler+0.045
 Exc+0.043
Neg logit contributions
opl-0.038
akers-0.038
ines-0.038
eds-0.037
agons-0.037
Token He
Token ID679
Feature activation+9.96e-01

Pos logit contributions
akers+0.054
agons+0.054
eds+0.053
ines+0.052
opl+0.052
Neg logit contributions
 hes-0.040
 curiously-0.038
 toddler-0.038
 Int-0.036
 hopefully-0.035
Token does
Token ID857
Feature activation+2.06e+00

Pos logit contributions
akers+0.025
agons+0.024
opl+0.024
eds+0.023
ines+0.023
Neg logit contributions
 hes-0.043
 curiously-0.041
 toddler-0.041
 Int-0.038
 marvel-0.038
Token not
Token ID407
Feature activation+2.99e+00

Pos logit contributions
akers+0.063
agons+0.061
eds+0.058
opl+0.056
arter+0.055
Neg logit contributions
 hes-0.082
 curiously-0.079
 marvel-0.073
 hopefully-0.073
 intrigued-0.072
Token laugh
Token ID6487
Feature activation+8.11e-01

Pos logit contributions
agons+0.116
opl+0.115
eds+0.112
akers+0.110
ines+0.108
Neg logit contributions
 hes-0.086
 marvel-0.079
 curiously-0.078
 toddler-0.075
 hopefully-0.073
Token a
Token ID257
Feature activation+1.08e+00

Pos logit contributions
akers+0.025
agons+0.025
opl+0.024
eds+0.024
ines+0.023
Neg logit contributions
 hes-0.030
 curiously-0.030
 toddler-0.028
 hopefully-0.027
 intrigued-0.026
Token lot
Token ID1256
Feature activation+6.95e+00

Pos logit contributions
akers+0.022
agons+0.021
opl+0.020
ines+0.020
eds+0.020
Neg logit contributions
 hes-0.052
 toddler-0.051
 curiously-0.050
 marvel-0.048
 hopefully-0.047
Token.
Token ID13
Feature activation+1.56e+00

Pos logit contributions
agons+0.281
eds+0.273
akers+0.269
arter+0.255
opl+0.250
Neg logit contributions
 curiously-0.181
 hes-0.180
 hopefully-0.178
 again-0.169
 nervously-0.169
Token
Token ID198
Feature activation+9.40e-01

Pos logit contributions
akers+0.062
agons+0.061
eds+0.060
ines+0.060
opl+0.059
Neg logit contributions
 hes-0.044
 curiously-0.042
 toddler-0.042
 Int-0.040
 hopefully-0.039
Token
Token ID198
Feature activation+1.13e+00

Pos logit contributions
akers+0.043
agons+0.042
opl+0.040
arter+0.040
eds+0.040
Neg logit contributions
 hes-0.022
 curiously-0.021
 toddler-0.019
 Int-0.019
 intrigued-0.018
TokenTom
Token ID13787
Feature activation+4.06e-01

Pos logit contributions
arter+0.040
opl+0.039
agons+0.039
akers+0.039
eds+0.038
Neg logit contributions
 curiously-0.034
 hes-0.034
 toddler-0.033
 Int-0.032
 hopefully-0.031
Token wants
Token ID3382
Feature activation-1.20e-01

Pos logit contributions
akers+0.013
agons+0.012
opl+0.012
arter+0.012
arl+0.012
Neg logit contributions
 hes-0.015
 curiously-0.014
 toddler-0.014
 Int-0.013
 marvel-0.013
Token and
Token ID290
Feature activation-2.73e+00

Pos logit contributions
 hes+0.184
 toddler+0.177
 curiously+0.173
 intrigued+0.166
 marvel+0.165
Neg logit contributions
opl-0.116
ines-0.115
akers-0.114
eds-0.112
ards-0.111
Token slides
Token ID19392
Feature activation-1.48e+00

Pos logit contributions
 hes+0.090
 toddler+0.088
 curiously+0.085
 Int+0.082
 hopefully+0.079
Neg logit contributions
akers-0.089
ards-0.087
ines-0.086
agons-0.085
opl-0.085
Token down
Token ID866
Feature activation-2.03e+00

Pos logit contributions
 hes+0.049
 curiously+0.046
 toddler+0.046
 marvel+0.042
 Int+0.042
Neg logit contributions
akers-0.052
agons-0.051
opl-0.050
ines-0.050
eds-0.049
Token.
Token ID13
Feature activation-8.43e-01

Pos logit contributions
 hes+0.074
 curiously+0.071
 toddler+0.069
 marvel+0.065
 Int+0.065
Neg logit contributions
akers-0.066
agons-0.065
opl-0.062
eds-0.062
arter-0.061
Token "
Token ID366
Feature activation-1.54e+00

Pos logit contributions
 hes+0.032
 toddler+0.030
 curiously+0.030
 marvel+0.028
 intrigued+0.028
Neg logit contributions
opl-0.024
akers-0.024
agons-0.023
arl-0.023
ines-0.023
TokenWhe
Token ID10842
Feature activation-7.63e+00

Pos logit contributions
 hes+0.050
 curiously+0.050
 toddler+0.045
 hopefully+0.043
 intrigued+0.043
Neg logit contributions
akers-0.055
opl-0.053
arl-0.052
agons-0.050
eds-0.049
Tokene
Token ID68
Feature activation+2.13e+00

Pos logit contributions
 hes+0.382
 toddler+0.367
 curiously+0.367
 Int+0.350
 marvel+0.349
Neg logit contributions
akers-0.134
agons-0.131
opl-0.127
ines-0.124
eds-0.122
Token!"
Token ID2474
Feature activation-1.95e+00

Pos logit contributions
agons+0.029
akers+0.029
opl+0.027
arter+0.027
ines+0.026
Neg logit contributions
 hes-0.115
 curiously-0.112
 toddler-0.110
 marvel-0.106
 Int-0.105
Token she
Token ID673
Feature activation-8.90e-01

Pos logit contributions
 hes+0.073
 toddler+0.072
 curiously+0.069
 marvel+0.066
 intrigued+0.064
Neg logit contributions
akers-0.053
ards-0.052
opl-0.052
agons-0.052
ines-0.051
Token says
Token ID1139
Feature activation-1.25e+00

Pos logit contributions
 toddler+0.031
 hes+0.029
 hopefully+0.028
 curiously+0.028
 Nathan+0.026
Neg logit contributions
!"-0.027
ards-0.026
ines-0.026
ons-0.025
opl-0.024
Token.
Token ID13
Feature activation+7.69e-01

Pos logit contributions
 toddler+0.066
 hes+0.065
 curiously+0.063
 Int+0.062
 marvel+0.061
Neg logit contributions
akers-0.016
ines-0.015
smart-0.014
arl-0.013
ards-0.013
Token But
Token ID887
Feature activation-4.40e+00

Pos logit contributions
 hes+0.074
 toddler+0.071
 curiously+0.069
 Int+0.066
 intrigued+0.064
Neg logit contributions
akers-0.064
ines-0.062
agons-0.061
eds-0.061
opl-0.060
Token then
Token ID788
Feature activation-3.01e+00

Pos logit contributions
 hes+0.163
 toddler+0.157
 curiously+0.154
 Int+0.152
 intrigued+0.148
Neg logit contributions
akers-0.122
opl-0.119
ards-0.119
ines-0.117
agons-0.116
Token Joe
Token ID5689
Feature activation-3.24e+00

Pos logit contributions
 hes+0.121
 toddler+0.116
 curiously+0.116
 Int+0.111
 intrigued+0.109
Neg logit contributions
akers-0.079
agons-0.073
ines-0.073
opl-0.073
ons-0.071
Token noticed
Token ID6810
Feature activation-1.43e+00

Pos logit contributions
 hes+0.094
 toddler+0.093
 curiously+0.089
 Int+0.087
 hopefully+0.084
Neg logit contributions
ines-0.116
akers-0.112
opl-0.111
eds-0.111
ards-0.110
Token something
Token ID1223
Feature activation-3.65e+00

Pos logit contributions
 hes+0.055
 toddler+0.054
 curiously+0.052
 Int+0.051
 intrigued+0.050
Neg logit contributions
akers-0.038
ines-0.037
ards-0.037
 lava-0.036
ons-0.036
Token else
Token ID2073
Feature activation-6.94e+00

Pos logit contributions
 hes+0.153
 toddler+0.148
 curiously+0.143
 Int+0.140
 hopefully+0.137
Neg logit contributions
ines-0.090
akers-0.090
agons-0.087
opl-0.085
ards-0.083
Token.
Token ID13
Feature activation-1.68e+00

Pos logit contributions
 hes+0.294
 toddler+0.285
 curiously+0.278
 Int+0.272
 hopefully+0.263
Neg logit contributions
akers-0.170
ines-0.166
opl-0.158
eds-0.158
agons-0.157
Token He
Token ID679
Feature activation-5.87e-01

Pos logit contributions
 hes+0.064
 curiously+0.060
 toddler+0.060
 convince+0.057
 marvel+0.057
Neg logit contributions
opl-0.044
smart-0.043
akers-0.043
 lava-0.043
arter-0.043
Token saw
Token ID2497
Feature activation-4.11e-01

Pos logit contributions
 toddler+0.019
 hes+0.018
 hopefully+0.017
 Int+0.017
 Und+0.017
Neg logit contributions
 years-0.018
ines-0.017
akers-0.017
ards-0.017
eds-0.017
Token a
Token ID257
Feature activation-1.50e+00

Pos logit contributions
 hes+0.015
 toddler+0.015
 curiously+0.015
 Int+0.014
 marvel+0.014
Neg logit contributions
akers-0.012
ines-0.012
opl-0.012
agons-0.012
eds-0.011
Token box
Token ID3091
Feature activation+6.59e-01

Pos logit contributions
 hes+0.057
 toddler+0.056
 curiously+0.055
 marvel+0.052
 intrigued+0.051
Neg logit contributions
akers-0.044
agons-0.044
opl-0.043
ines-0.043
eds-0.042
Token But
Token ID887
Feature activation-1.93e+00

Pos logit contributions
 hes+0.012
 toddler+0.012
 curiously+0.012
 marvel+0.011
 Int+0.011
Neg logit contributions
akers-0.009
ines-0.009
agons-0.009
opl-0.009
eds-0.008
Token then
Token ID788
Feature activation-1.69e+00

Pos logit contributions
 hes+0.073
 toddler+0.073
 curiously+0.069
 Int+0.068
 marvel+0.066
Neg logit contributions
akers-0.053
ines-0.052
eds-0.050
arter-0.049
ards-0.049
Token Mia
Token ID32189
Feature activation-1.26e+00

Pos logit contributions
 hes+0.075
 curiously+0.072
 toddler+0.070
 marvel+0.067
 hopefully+0.067
Neg logit contributions
agons-0.045
akers-0.044
opl-0.044
arl-0.042
eds-0.042
Token noticed
Token ID6810
Feature activation-1.22e+00

Pos logit contributions
 toddler+0.037
 hes+0.036
 curiously+0.034
 Int+0.033
 hopefully+0.032
Neg logit contributions
akers-0.044
ines-0.043
eds-0.043
ards-0.042
ons-0.042
Token something
Token ID1223
Feature activation-3.59e+00

Pos logit contributions
 hes+0.051
 curiously+0.048
 toddler+0.047
 hopefully+0.044
 Int+0.044
Neg logit contributions
akers-0.035
agons-0.035
opl-0.033
arl-0.033
ines-0.033
Token else
Token ID2073
Feature activation-6.74e+00

Pos logit contributions
 hes+0.143
 toddler+0.137
 curiously+0.136
 Int+0.128
 marvel+0.127
Neg logit contributions
akers-0.099
agons-0.097
ines-0.096
opl-0.095
eds-0.093
Token that
Token ID326
Feature activation-1.99e+00

Pos logit contributions
 hes+0.279
 curiously+0.266
 toddler+0.261
 marvel+0.249
 intrigued+0.245
Neg logit contributions
agons-0.185
akers-0.185
opl-0.183
arl-0.176
eds-0.175
Token made
Token ID925
Feature activation-3.39e-01

Pos logit contributions
 hes+0.080
 toddler+0.078
 curiously+0.076
 hopefully+0.071
 Int+0.071
Neg logit contributions
ines-0.051
akers-0.050
ards-0.050
ons-0.050
opl-0.049
Token her
Token ID607
Feature activation-1.95e+00

Pos logit contributions
 hes+0.011
 curiously+0.010
 toddler+0.009
 marvel+0.009
 hopefully+0.009
Neg logit contributions
akers-0.013
opl-0.013
agons-0.013
eds-0.013
arl-0.012
Token gasp
Token ID47440
Feature activation-2.61e+00

Pos logit contributions
 hes+0.070
 toddler+0.065
 curiously+0.065
 Int+0.062
 intrigued+0.060
Neg logit contributions
akers-0.061
ines-0.061
opl-0.060
eds-0.060
agons-0.059
Token.
Token ID13
Feature activation-7.13e-01

Pos logit contributions
 hes+0.106
 curiously+0.105
 toddler+0.098
 hopefully+0.095
 marvel+0.094
Neg logit contributions
opl-0.074
agons-0.073
akers-0.073
arl-0.071
eds-0.068
Token
Token ID198
Feature activation-6.77e-01

Pos logit contributions
 hes+0.074
 marvel+0.067
 curiously+0.066
 toddler+0.063
 intrigued+0.062
Neg logit contributions
akers-0.067
agons-0.065
 lava-0.064
arl-0.063
ards-0.063
Token
Token ID198
Feature activation-3.60e-01

Pos logit contributions
 hes+0.024
 curiously+0.021
 toddler+0.021
 marvel+0.021
 intrigued+0.020
Neg logit contributions
akers-0.022
agons-0.022
ines-0.021
arl-0.021
eds-0.021
TokenBaby
Token ID36534
Feature activation-1.65e-01

Pos logit contributions
 hes+0.013
 curiously+0.012
 toddler+0.012
 marvel+0.012
 Int+0.011
Neg logit contributions
akers-0.012
agons-0.011
eds-0.011
ines-0.011
arl-0.011
Token Bear
Token ID14732
Feature activation-1.43e+00

Pos logit contributions
 hes+0.005
 curiously+0.005
 toddler+0.005
 Int+0.005
 marvel+0.005
Neg logit contributions
akers-0.006
agons-0.005
ines-0.005
eds-0.005
opl-0.005
Token looked
Token ID3114
Feature activation-1.35e+00

Pos logit contributions
 hes+0.042
 toddler+0.040
 Int+0.039
 curiously+0.038
 hopefully+0.038
Neg logit contributions
akers-0.048
ards-0.048
ines-0.047
agons-0.045
eds-0.045
Token up
Token ID510
Feature activation-6.73e+00

Pos logit contributions
 hes+0.046
 toddler+0.043
 marvel+0.042
 Int+0.041
 mirac+0.040
Neg logit contributions
ines-0.042
akers-0.039
agons-0.039
 fires-0.039
ards-0.038
Token and
Token ID290
Feature activation-2.12e+00

Pos logit contributions
 hes+0.329
 Int+0.321
 toddler+0.318
Whoever+0.313
Whe+0.312
Neg logit contributions
 years-0.089
ines-0.083
,-0.080
!-0.080
!"-0.074
Token said
Token ID531
Feature activation-1.41e+00

Pos logit contributions
Whoever+0.071
 toddler+0.068
 hes+0.068
yle+0.066
 athlete+0.065
Neg logit contributions
ards-0.055
!"-0.055
akers-0.055
ons-0.052
 lava-0.051
Token,
Token ID11
Feature activation-5.69e-01

Pos logit contributions
 hes+0.071
 toddler+0.069
 curiously+0.068
 marvel+0.066
 Int+0.066
Neg logit contributions
akers-0.024
agons-0.022
ines-0.022
opl-0.021
eds-0.021
Token "
Token ID366
Feature activation-6.66e-01

Pos logit contributions
 hes+0.020
 toddler+0.019
 curiously+0.019
 marvel+0.018
 Int+0.018
Neg logit contributions
akers-0.017
agons-0.016
arl-0.016
ines-0.016
ards-0.016
TokenBut
Token ID1537
Feature activation-7.48e-01

Pos logit contributions
 hes+0.021
 curiously+0.020
 toddler+0.018
 Exc+0.018
 nervously+0.017
Neg logit contributions
akers-0.024
ww-0.021
eds-0.021
ards-0.020
arl-0.020
Token F
Token ID376
Feature activation-4.14e-01

Pos logit contributions
akers+0.022
ines+0.021
agons+0.021
opl+0.020
eds+0.020
Neg logit contributions
 hes-0.023
 toddler-0.021
 curiously-0.021
 marvel-0.020
 hopefully-0.020
Tokenuzz
Token ID4715
Feature activation-3.42e-01

Pos logit contributions
 hes+0.015
 curiously+0.014
 toddler+0.014
 marvel+0.013
 Int+0.013
Neg logit contributions
akers-0.013
agons-0.013
opl-0.013
eds-0.013
ines-0.012
Tokeny
Token ID88
Feature activation+8.95e-01

Pos logit contributions
 hes+0.014
 toddler+0.013
 curiously+0.013
 hopefully+0.013
 Int+0.013
Neg logit contributions
akers-0.009
ines-0.009
eds-0.009
agons-0.009
opl-0.008
Token saw
Token ID2497
Feature activation-3.48e-01

Pos logit contributions
akers+0.032
eds+0.030
ines+0.030
 years+0.030
agons+0.030
Neg logit contributions
 hes-0.027
 hopefully-0.025
 curiously-0.024
 toddler-0.024
 Int-0.023
Token something
Token ID1223
Feature activation-3.49e+00

Pos logit contributions
 hes+0.013
 curiously+0.013
 toddler+0.013
 hopefully+0.012
 marvel+0.012
Neg logit contributions
akers-0.010
agons-0.009
ines-0.009
opl-0.009
ards-0.009
Token up
Token ID510
Feature activation-6.69e+00

Pos logit contributions
 hes+0.149
 toddler+0.143
 curiously+0.140
 Int+0.136
 hopefully+0.135
Neg logit contributions
akers-0.083
ines-0.083
agons-0.082
opl-0.078
eds-0.077
Token ahead
Token ID4058
Feature activation-3.29e+00

Pos logit contributions
 hes+0.243
 toddler+0.232
 Int+0.223
 curiously+0.221
 Exc+0.212
Neg logit contributions
ines-0.201
akers-0.197
arl-0.191
eds-0.189
smart-0.181
Token and
Token ID290
Feature activation-1.17e+00

Pos logit contributions
 hes+0.146
 curiously+0.140
 toddler+0.138
 marvel+0.131
 Int+0.129
Neg logit contributions
akers-0.079
agons-0.078
opl-0.076
eds-0.074
ines-0.074
Token started
Token ID2067
Feature activation-1.27e+00

Pos logit contributions
 toddler+0.036
 hes+0.035
 curiously+0.033
 hopefully+0.032
 Int+0.030
Neg logit contributions
akers-0.038
ards-0.037
ines-0.037
smart-0.036
eds-0.036
Token running
Token ID2491
Feature activation-2.74e-01

Pos logit contributions
 toddler+0.049
 hes+0.047
 curiously+0.046
 hopefully+0.046
 Int+0.044
Neg logit contributions
eds-0.032
smart-0.032
ines-0.031
ards-0.031
akers-0.031
Token.
Token ID13
Feature activation-3.42e-01

Pos logit contributions
 hes+0.008
 toddler+0.008
 curiously+0.008
 Int+0.007
 hopefully+0.007
Neg logit contributions
akers-0.010
ines-0.010
agons-0.010
opl-0.010
eds-0.010
Tokenâ
Token ID22940
Feature activation-9.34e-01

Pos logit contributions
 hes+0.071
 curiously+0.067
 toddler+0.066
 marvel+0.064
 intrigued+0.062
Neg logit contributions
akers-0.062
agons-0.061
opl-0.061
ines-0.059
arl-0.059
Token
Token ID26391
Feature activation-2.05e+00

Pos logit contributions
 hes+0.024
 toddler+0.021
 Int+0.021
 curiously+0.021
 marvel+0.021
Neg logit contributions
akers-0.039
eds-0.039
ines-0.038
agons-0.038
arter-0.037
Token The
Token ID383
Feature activation+3.09e-01

Pos logit contributions
 hes+0.069
 toddler+0.065
 curiously+0.065
 Int+0.062
 marvel+0.061
Neg logit contributions
akers-0.069
agons-0.069
ines-0.068
opl-0.067
eds-0.066
Token bunny
Token ID44915
Feature activation-1.93e+00

Pos logit contributions
akers+0.011
agons+0.011
ines+0.011
opl+0.011
eds+0.011
Neg logit contributions
 hes-0.010
 toddler-0.010
 curiously-0.010
 Int-0.009
 marvel-0.009
Token looked
Token ID3114
Feature activation-1.10e+00

Pos logit contributions
 hes+0.059
 toddler+0.056
 curiously+0.055
 Int+0.052
 hopefully+0.051
Neg logit contributions
akers-0.069
ines-0.068
opl-0.066
agons-0.066
eds-0.066
Token up
Token ID510
Feature activation-6.68e+00

Pos logit contributions
 hes+0.034
 toddler+0.032
 curiously+0.031
 Int+0.029
 marvel+0.029
Neg logit contributions
akers-0.040
ines-0.039
agons-0.039
opl-0.038
eds-0.038
Token and
Token ID290
Feature activation-1.79e+00

Pos logit contributions
 toddler+0.281
 hes+0.278
 Int+0.265
 Exc+0.254
 convince+0.252
Neg logit contributions
ines-0.147
!"-0.146
arl-0.139
 years-0.136
akers-0.134
Token shook
Token ID14682
Feature activation-1.69e+00

Pos logit contributions
 toddler+0.060
 hes+0.056
Whoever+0.053
 aid+0.053
 convince+0.052
Neg logit contributions
!"-0.055
ards-0.051
 lava-0.050
smart-0.049
ons-0.049
Token his
Token ID465
Feature activation-1.19e+00

Pos logit contributions
 hes+0.064
 toddler+0.064
 curiously+0.060
 Int+0.059
 intrigued+0.058
Neg logit contributions
ines-0.046
akers-0.046
eds-0.046
opl-0.046
agons-0.045
Token ears
Token ID11368
Feature activation+1.45e+00

Pos logit contributions
 hes+0.037
 toddler+0.035
 curiously+0.035
 Int+0.033
 marvel+0.032
Neg logit contributions
akers-0.043
agons-0.042
opl-0.042
ines-0.041
eds-0.041
Token,
Token ID11
Feature activation-1.26e+00

Pos logit contributions
akers+0.029
agons+0.028
opl+0.028
ines+0.028
eds+0.027
Neg logit contributions
 hes-0.069
 toddler-0.066
 curiously-0.066
 Int-0.063
 marvel-0.062
Token
Token ID198
Feature activation-6.79e-01

Pos logit contributions
 hes+0.035
 toddler+0.033
 curiously+0.032
 marvel+0.030
 intrigued+0.030
Neg logit contributions
ines-0.034
akers-0.034
opl-0.033
agons-0.033
eds-0.033
TokenThe
Token ID464
Feature activation+6.82e-01

Pos logit contributions
 toddler+0.027
 curiously+0.027
 hes+0.027
 Int+0.026
 intrigued+0.026
Neg logit contributions
opl-0.019
arter-0.019
ines-0.019
agons-0.019
akers-0.018
Token younger
Token ID7099
Feature activation-1.83e+00

Pos logit contributions
akers+0.026
agons+0.025
ines+0.025
arl+0.025
opl+0.025
Neg logit contributions
 hes-0.020
 curiously-0.018
 toddler-0.018
 marvel-0.017
 Int-0.017
Token sister
Token ID6621
Feature activation+8.22e-01

Pos logit contributions
 hes+0.065
 curiously+0.062
 toddler+0.060
 Int+0.060
 marvel+0.059
Neg logit contributions
akers-0.058
ines-0.055
eds-0.053
agons-0.053
smart-0.053
Token looked
Token ID3114
Feature activation-1.14e+00

Pos logit contributions
akers+0.026
ons+0.026
ards+0.026
ines+0.026
smart+0.025
Neg logit contributions
 hes-0.024
 toddler-0.024
 curiously-0.023
 hopefully-0.022
 Int-0.022
Token up
Token ID510
Feature activation-6.68e+00

Pos logit contributions
 hes+0.036
 toddler+0.035
 curiously+0.032
 Int+0.032
 hopefully+0.031
Neg logit contributions
ines-0.039
akers-0.038
ards-0.037
agons-0.036
eds-0.036
Token and
Token ID290
Feature activation-1.80e+00

Pos logit contributions
 hes+0.320
 toddler+0.316
 Int+0.312
Whoever+0.298
 Exc+0.297
Neg logit contributions
ines-0.103
 years-0.100
!"-0.096
akers-0.093
arl-0.092
Token smiled
Token ID13541
Feature activation-2.01e+00

Pos logit contributions
 toddler+0.065
 hes+0.061
Whoever+0.061
 aid+0.061
 athlete+0.058
Neg logit contributions
ons-0.044
!"-0.044
ards-0.043
smart-0.043
akers-0.043
Token.
Token ID13
Feature activation-1.54e-01

Pos logit contributions
 hes+0.111
 toddler+0.110
 Int+0.107
 Exc+0.106
Whoever+0.105
Neg logit contributions
akers-0.014
ines-0.014
!"-0.013
arl-0.012
smart-0.011
Token "
Token ID366
Feature activation-1.01e+00

Pos logit contributions
 hes+0.006
 toddler+0.005
 curiously+0.005
 marvel+0.005
 hopefully+0.005
Neg logit contributions
akers-0.005
ines-0.004
agons-0.004
arl-0.004
opl-0.004
TokenI
Token ID40
Feature activation-1.85e+00

Pos logit contributions
 hes+0.033
 curiously+0.031
 toddler+0.030
 hopefully+0.028
 nervously+0.027
Neg logit contributions
akers-0.036
opl-0.035
ines-0.034
smart-0.034
agons-0.033
Token cat
Token ID3797
Feature activation+8.94e-01

Pos logit contributions
akers+0.002
agons+0.002
opl+0.002
ines+0.002
!"+0.002
Neg logit contributions
 hes-0.003
 curiously-0.003
 Int-0.003
 intrigued-0.003
 toddler-0.002
Token?"
Token ID1701
Feature activation-2.99e+00

Pos logit contributions
akers+0.020
agons+0.020
ines+0.019
opl+0.019
eds+0.019
Neg logit contributions
 hes-0.040
 toddler-0.038
 curiously-0.038
 intrigued-0.036
 marvel-0.036
Token The
Token ID383
Feature activation+5.21e-01

Pos logit contributions
 hes+0.108
 toddler+0.102
 curiously+0.102
 marvel+0.094
 hopefully+0.094
Neg logit contributions
akers-0.093
agons-0.091
opl-0.090
ines-0.090
eds-0.089
Token cat
Token ID3797
Feature activation+6.80e-01

Pos logit contributions
akers+0.021
ines+0.021
agons+0.020
opl+0.020
eds+0.020
Neg logit contributions
 toddler-0.015
 hes-0.014
 curiously-0.014
 Int-0.014
 3-0.013
Token looked
Token ID3114
Feature activation-1.11e+00

Pos logit contributions
ines+0.023
akers+0.023
agons+0.023
opl+0.023
eds+0.023
Neg logit contributions
 hes-0.022
 toddler-0.021
 curiously-0.020
 hopefully-0.019
 Int-0.019
Token up
Token ID510
Feature activation-6.68e+00

Pos logit contributions
 hes+0.032
 toddler+0.031
 hopefully+0.028
 Int+0.028
 curiously+0.028
Neg logit contributions
ines-0.041
akers-0.039
eds-0.039
agons-0.038
ards-0.038
Token to
Token ID284
Feature activation-1.85e+00

Pos logit contributions
 hes+0.335
 toddler+0.332
 Int+0.325
Whoever+0.321
 convince+0.316
Neg logit contributions
ines-0.083
arl-0.078
!"-0.077
!-0.074
 years-0.071
Token see
Token ID766
Feature activation-2.50e-01

Pos logit contributions
 hes+0.069
 toddler+0.069
 curiously+0.067
 intrigued+0.065
 Int+0.063
Neg logit contributions
opl-0.050
py-0.050
arter-0.050
akers-0.050
eds-0.050
Token a
Token ID257
Feature activation-2.12e+00

Pos logit contributions
 hes+0.010
 curiously+0.010
 toddler+0.009
 Int+0.009
 hopefully+0.009
Neg logit contributions
akers-0.008
agons-0.008
ines-0.007
eds-0.007
opl-0.007
Token rabbit
Token ID22746
Feature activation-1.82e+00

Pos logit contributions
 hes+0.070
 toddler+0.068
 curiously+0.067
 marvel+0.062
 intrigued+0.061
Neg logit contributions
agons-0.075
akers-0.074
opl-0.074
ines-0.073
eds-0.071
Token spring
Token ID6076
Feature activation-1.71e+00

Pos logit contributions
 hes+0.076
 curiously+0.073
 toddler+0.073
 Int+0.068
 marvel+0.068
Neg logit contributions
akers-0.047
agons-0.046
opl-0.046
ines-0.045
eds-0.045
Token of
Token ID286
Feature activation-8.34e-01

Pos logit contributions
 hes+0.004
 curiously+0.004
 toddler+0.004
 Int+0.003
 marvel+0.003
Neg logit contributions
akers-0.002
agons-0.002
opl-0.002
eds-0.002
ines-0.002
Token wheat
Token ID17135
Feature activation-1.14e+00

Pos logit contributions
 hes+0.035
 curiously+0.035
 toddler+0.035
 marvel+0.033
 Int+0.032
Neg logit contributions
opl-0.021
akers-0.021
eds-0.020
arl-0.020
agons-0.020
Token.
Token ID13
Feature activation-6.90e-01

Pos logit contributions
 hes+0.050
 curiously+0.048
 toddler+0.048
 marvel+0.045
 Int+0.045
Neg logit contributions
akers-0.027
agons-0.026
opl-0.026
ines-0.026
eds-0.025
Token Paul
Token ID3362
Feature activation-2.01e-01

Pos logit contributions
 hes+0.029
 toddler+0.026
 curiously+0.025
 hopefully+0.024
 convince+0.024
Neg logit contributions
opl-0.018
iren-0.016
agons-0.016
ay-0.016
ines-0.016
Token looked
Token ID3114
Feature activation-1.13e+00

Pos logit contributions
 hes+0.007
 toddler+0.006
 hopefully+0.006
 Int+0.006
 curiously+0.006
Neg logit contributions
akers-0.006
ards-0.006
ons-0.006
ines-0.006
 years-0.006
Token up
Token ID510
Feature activation-6.67e+00

Pos logit contributions
 hes+0.036
 toddler+0.035
 Int+0.031
 curiously+0.031
 hopefully+0.031
Neg logit contributions
ines-0.038
akers-0.037
 lava-0.036
agons-0.035
ards-0.035
Token at
Token ID379
Feature activation-3.60e+00

Pos logit contributions
 hes+0.322
 toddler+0.320
 Int+0.310
 convince+0.306
 persuade+0.303
Neg logit contributions
ines-0.091
!"-0.090
!-0.088
 years-0.087
 lava-0.084
Token the
Token ID262
Feature activation-5.16e-01

Pos logit contributions
 hes+0.132
 curiously+0.126
 toddler+0.126
 marvel+0.117
 Int+0.116
Neg logit contributions
akers-0.116
agons-0.113
ines-0.111
opl-0.110
eds-0.108
Token wheat
Token ID17135
Feature activation-5.52e-01

Pos logit contributions
 hes+0.020
 toddler+0.019
 curiously+0.019
 marvel+0.018
 intrigued+0.018
Neg logit contributions
akers-0.015
agons-0.015
ines-0.015
opl-0.015
eds-0.014
Token and
Token ID290
Feature activation-1.73e+00

Pos logit contributions
 hes+0.022
 curiously+0.021
 toddler+0.021
 marvel+0.020
 Int+0.020
Neg logit contributions
akers-0.015
agons-0.015
opl-0.015
ines-0.015
eds-0.014
Token found
Token ID1043
Feature activation-1.47e-01

Pos logit contributions
 toddler+0.054
 hes+0.052
 aid+0.049
 convince+0.048
 Und+0.047
Neg logit contributions
 lava-0.051
ards-0.050
akers-0.049
 years-0.049
!"-0.049
Token
Token ID198
Feature activation-8.20e-01

Pos logit contributions
 hes+0.066
 toddler+0.063
 curiously+0.062
 Int+0.060
 marvel+0.059
Neg logit contributions
agons-0.078
akers-0.077
ines-0.075
opl-0.075
eds-0.074
Token
Token ID198
Feature activation-4.86e-01

Pos logit contributions
 hes+0.023
 curiously+0.021
 toddler+0.021
 Int+0.019
 marvel+0.019
Neg logit contributions
akers-0.033
agons-0.033
opl-0.032
ines-0.032
eds-0.032
TokenThe
Token ID464
Feature activation+3.54e-01

Pos logit contributions
 hes+0.017
 toddler+0.017
 curiously+0.016
 Int+0.016
 intrigued+0.016
Neg logit contributions
akers-0.015
agons-0.015
ines-0.015
arter-0.015
opl-0.015
Token girl
Token ID2576
Feature activation-9.46e-01

Pos logit contributions
akers+0.012
agons+0.012
opl+0.011
arl+0.011
ines+0.011
Neg logit contributions
 hes-0.012
 curiously-0.011
 toddler-0.011
 Int-0.011
 marvel-0.011
Token looked
Token ID3114
Feature activation-1.10e+00

Pos logit contributions
 hes+0.029
 toddler+0.029
 hopefully+0.027
 curiously+0.027
 Int+0.026
Neg logit contributions
ines-0.029
 years-0.029
akers-0.029
eds-0.028
ards-0.028
Token up
Token ID510
Feature activation-6.67e+00

Pos logit contributions
 hes+0.034
 toddler+0.033
 hopefully+0.030
 Int+0.030
 curiously+0.030
Neg logit contributions
ines-0.039
akers-0.036
eds-0.036
agons-0.035
opl-0.035
Token and
Token ID290
Feature activation-2.08e+00

Pos logit contributions
 hes+0.308
 toddler+0.305
 Int+0.296
Whoever+0.291
 convince+0.284
Neg logit contributions
!"-0.117
ines-0.115
 years-0.108
arl-0.106
!-0.105
Token saw
Token ID2497
Feature activation-6.74e-01

Pos logit contributions
 toddler+0.072
Whoever+0.070
 hes+0.069
 convince+0.065
 athlete+0.063
Neg logit contributions
!"-0.058
eds-0.053
smart-0.053
 boom-0.053
ards-0.053
Token a
Token ID257
Feature activation-1.95e+00

Pos logit contributions
 hes+0.025
 curiously+0.024
 toddler+0.024
 Int+0.023
 marvel+0.022
Neg logit contributions
akers-0.020
opl-0.020
agons-0.019
ines-0.019
eds-0.019
Token lady
Token ID10846
Feature activation-1.91e+00

Pos logit contributions
 hes+0.070
 toddler+0.067
 curiously+0.066
 marvel+0.062
 Int+0.062
Neg logit contributions
akers-0.062
agons-0.061
opl-0.060
ines-0.060
eds-0.059
Token with
Token ID351
Feature activation-3.29e+00

Pos logit contributions
 hes+0.074
 toddler+0.071
 curiously+0.069
 Int+0.067
 intrigued+0.065
Neg logit contributions
akers-0.054
ines-0.053
opl-0.052
eds-0.051
agons-0.050
Token but
Token ID475
Feature activation-2.32e+00

Pos logit contributions
akers+0.007
ards+0.006
opl+0.006
ines+0.006
 lava+0.006
Neg logit contributions
 hes-0.008
 toddler-0.008
 curiously-0.008
 Int-0.008
 hopefully-0.007
Token then
Token ID788
Feature activation-1.82e+00

Pos logit contributions
 hes+0.083
 toddler+0.081
 Int+0.080
 curiously+0.080
 Exc+0.076
Neg logit contributions
 lava-0.064
akers-0.063
ards-0.062
 years-0.062
arter-0.062
Token Jack
Token ID3619
Feature activation-2.15e+00

Pos logit contributions
 hes+0.065
 toddler+0.062
 curiously+0.062
 Int+0.059
 hopefully+0.057
Neg logit contributions
akers-0.054
 lava-0.052
agons-0.051
ines-0.051
ons-0.050
Token spotted
Token ID13489
Feature activation-1.52e+00

Pos logit contributions
 hes+0.062
 toddler+0.060
 curiously+0.060
 Int+0.058
 hopefully+0.058
Neg logit contributions
akers-0.074
eds-0.072
ines-0.071
ards-0.070
 years-0.069
Token something
Token ID1223
Feature activation-3.45e+00

Pos logit contributions
 hes+0.057
 toddler+0.055
 curiously+0.054
 Int+0.052
 intrigued+0.052
Neg logit contributions
akers-0.042
 lava-0.041
opl-0.041
agons-0.041
ards-0.040
Token up
Token ID510
Feature activation-6.66e+00

Pos logit contributions
 hes+0.144
 toddler+0.140
 Int+0.134
 curiously+0.134
 hopefully+0.131
Neg logit contributions
ines-0.083
akers-0.082
agons-0.080
arter-0.077
opl-0.076
Token ahead
Token ID4058
Feature activation-3.36e+00

Pos logit contributions
 hes+0.241
 toddler+0.227
 Int+0.223
 curiously+0.218
 Exc+0.212
Neg logit contributions
ines-0.200
akers-0.197
arl-0.193
eds-0.191
agons-0.184
Token.
Token ID13
Feature activation-9.85e-01

Pos logit contributions
 hes+0.152
 curiously+0.147
 toddler+0.144
 marvel+0.137
 hopefully+0.136
Neg logit contributions
akers-0.078
agons-0.076
opl-0.073
eds-0.072
ines-0.072
Token Around
Token ID16824
Feature activation-2.19e+00

Pos logit contributions
 hes+0.042
 curiously+0.039
 toddler+0.038
 convince+0.037
 marvel+0.037
Neg logit contributions
opl-0.023
agons-0.022
akers-0.022
 lava-0.022
arl-0.022
Token a
Token ID257
Feature activation+1.07e+00

Pos logit contributions
 hes+0.094
 toddler+0.090
 curiously+0.088
 intrigued+0.086
 Int+0.085
Neg logit contributions
opl-0.050
agons-0.049
eds-0.048
ines-0.048
akers-0.048
Token bend
Token ID19396
Feature activation-9.51e-01

Pos logit contributions
akers+0.033
agons+0.031
opl+0.030
arter+0.030
arl+0.030
Neg logit contributions
 hes-0.042
 curiously-0.041
 toddler-0.041
 Int-0.038
 marvel-0.038
Token away
Token ID1497
Feature activation+2.28e+00

Pos logit contributions
 hes+0.011
 toddler+0.010
 curiously+0.010
 Int+0.009
 marvel+0.009
Neg logit contributions
akers-0.011
agons-0.011
ines-0.011
opl-0.011
eds-0.011
Token but
Token ID475
Feature activation-2.73e+00

Pos logit contributions
akers+0.063
agons+0.062
opl+0.060
ines+0.059
eds+0.059
Neg logit contributions
 hes-0.094
 curiously-0.091
 toddler-0.089
 marvel-0.084
 hopefully-0.084
Token he
Token ID339
Feature activation-3.09e+00

Pos logit contributions
 toddler+0.111
 hes+0.107
 Int+0.107
 Exc+0.104
 curiously+0.101
Neg logit contributions
ards-0.058
 lava-0.057
 years-0.055
smart-0.055
 that-0.053
Token noticed
Token ID6810
Feature activation-1.24e+00

Pos logit contributions
 toddler+0.106
 hes+0.104
 Int+0.100
 curiously+0.098
 Exc+0.097
Neg logit contributions
ines-0.093
akers-0.093
eds-0.092
ards-0.089
 years-0.089
Token something
Token ID1223
Feature activation-3.63e+00

Pos logit contributions
 hes+0.052
 toddler+0.050
 curiously+0.049
 Int+0.047
 marvel+0.047
Neg logit contributions
akers-0.031
agons-0.030
opl-0.030
ines-0.030
eds-0.030
Token else
Token ID2073
Feature activation-6.64e+00

Pos logit contributions
 hes+0.148
 toddler+0.144
 curiously+0.140
 Int+0.134
 hopefully+0.133
Neg logit contributions
akers-0.093
ines-0.093
agons-0.092
opl-0.089
eds-0.088
Token on
Token ID319
Feature activation-3.05e-01

Pos logit contributions
 hes+0.287
 toddler+0.277
 curiously+0.274
 Int+0.261
 marvel+0.258
Neg logit contributions
akers-0.161
ines-0.155
agons-0.154
opl-0.153
eds-0.150
Token the
Token ID262
Feature activation-2.11e-01

Pos logit contributions
 hes+0.009
 curiously+0.009
 toddler+0.009
 hopefully+0.008
 marvel+0.008
Neg logit contributions
akers-0.013
ines-0.012
agons-0.012
eds-0.011
arl-0.011
Token island
Token ID7022
Feature activation-1.40e-01

Pos logit contributions
 hes+0.007
 toddler+0.006
 curiously+0.006
 marvel+0.006
 Int+0.006
Neg logit contributions
akers-0.008
opl-0.008
agons-0.008
ines-0.008
arl-0.007
Token.
Token ID13
Feature activation-7.84e-01

Pos logit contributions
 hes+0.006
 curiously+0.006
 toddler+0.006
 marvel+0.006
 intrigued+0.006
Neg logit contributions
akers-0.003
agons-0.003
opl-0.003
arl-0.003
ines-0.003
Token It
Token ID632
Feature activation+1.00e-01

Pos logit contributions
 hes+0.032
 toddler+0.029
 curiously+0.029
 marvel+0.029
 convince+0.029
Neg logit contributions
 lava-0.019
agons-0.018
akers-0.018
arter-0.018
arl-0.018
Token at
Token ID379
Feature activation-2.73e+00

Pos logit contributions
 hes+0.051
 curiously+0.049
 toddler+0.048
 marvel+0.046
 Int+0.046
Neg logit contributions
akers-0.027
agons-0.026
opl-0.025
ines-0.025
eds-0.025
Token him
Token ID683
Feature activation-1.32e+00

Pos logit contributions
 hes+0.113
 toddler+0.108
 curiously+0.108
 intrigued+0.104
 Int+0.104
Neg logit contributions
opl-0.071
eds-0.067
ines-0.066
akers-0.065
agons-0.063
Token.
Token ID13
Feature activation-5.86e-01

Pos logit contributions
 hes+0.059
 curiously+0.058
 toddler+0.056
 marvel+0.054
 hopefully+0.053
Neg logit contributions
akers-0.032
agons-0.031
opl-0.029
arl-0.029
eds-0.029
Token Bill
Token ID3941
Feature activation-5.32e-01

Pos logit contributions
 hes+0.021
 toddler+0.020
 curiously+0.020
 hopefully+0.018
 marvel+0.017
Neg logit contributions
opl-0.019
agons-0.017
akers-0.017
ines-0.017
 lava-0.017
Token looked
Token ID3114
Feature activation-1.05e+00

Pos logit contributions
 toddler+0.016
 hes+0.016
 curiously+0.015
 hopefully+0.014
 Int+0.014
Neg logit contributions
ines-0.019
eds-0.018
ards-0.018
akers-0.018
ons-0.018
Token up
Token ID510
Feature activation-6.63e+00

Pos logit contributions
 toddler+0.033
 hes+0.033
 hopefully+0.030
 mirac+0.030
Whoever+0.029
Neg logit contributions
ines-0.034
ards-0.032
!"-0.031
 lava-0.031
opl-0.031
Token and
Token ID290
Feature activation-1.86e+00

Pos logit contributions
 toddler+0.338
 hes+0.334
Whoever+0.326
 convince+0.324
 Int+0.322
Neg logit contributions
,-0.077
!-0.077
 years-0.070
!"-0.069
ines-0.067
Token he
Token ID339
Feature activation-2.74e+00

Pos logit contributions
 toddler+0.065
 hes+0.061
Whoever+0.059
 convince+0.058
 aid+0.057
Neg logit contributions
!"-0.051
ards-0.050
eds-0.050
 boom-0.048
ons-0.048
Token saw
Token ID2497
Feature activation-1.02e+00

Pos logit contributions
 toddler+0.094
 hes+0.088
 hopefully+0.083
Whoever+0.081
 persuade+0.081
Neg logit contributions
eds-0.083
ines-0.082
 years-0.078
ards-0.078
ons-0.078
Token a
Token ID257
Feature activation-2.14e+00

Pos logit contributions
 hes+0.038
 curiously+0.036
 toddler+0.036
 Int+0.034
 marvel+0.034
Neg logit contributions
akers-0.031
agons-0.031
opl-0.030
ines-0.030
eds-0.030
Token huge
Token ID3236
Feature activation-5.20e-01

Pos logit contributions
 hes+0.075
 toddler+0.072
 curiously+0.071
 marvel+0.066
 Int+0.066
Neg logit contributions
akers-0.070
agons-0.069
opl-0.068
ines-0.068
eds-0.067
Token a
Token ID257
Feature activation-1.63e+00

Pos logit contributions
akers+0.086
agons+0.085
ines+0.085
opl+0.084
arter+0.082
Neg logit contributions
 hes-0.095
 toddler-0.089
 curiously-0.088
 marvel-0.083
 intrigued-0.083
Token sign
Token ID1051
Feature activation+6.18e-01

Pos logit contributions
 hes+0.068
 curiously+0.065
 toddler+0.065
 marvel+0.061
 intrigued+0.061
Neg logit contributions
akers-0.042
agons-0.041
opl-0.041
ines-0.041
eds-0.040
Token that
Token ID326
Feature activation-1.52e-01

Pos logit contributions
akers+0.014
agons+0.013
eds+0.013
opl+0.013
arter+0.012
Neg logit contributions
 hes-0.028
 curiously-0.028
 toddler-0.027
 intrigued-0.026
 marvel-0.026
Token said
Token ID531
Feature activation-5.92e-01

Pos logit contributions
 hes+0.004
 toddler+0.004
 curiously+0.004
 Int+0.004
 hopefully+0.004
Neg logit contributions
akers-0.006
agons-0.006
ines-0.006
opl-0.006
eds-0.006
Token "
Token ID366
Feature activation+4.27e-02

Pos logit contributions
 hes+0.014
 curiously+0.012
 toddler+0.011
 intrigued+0.011
 hopefully+0.011
Neg logit contributions
agons-0.028
ines-0.028
eds-0.028
akers-0.028
opl-0.027
TokenWhe
Token ID10842
Feature activation-6.62e+00

Pos logit contributions
akers+0.002
agons+0.001
opl+0.001
ines+0.001
eds+0.001
Neg logit contributions
 hes-0.001
 curiously-0.001
 toddler-0.001
 Int-0.001
 marvel-0.001
Tokenat
Token ID265
Feature activation-7.06e-02

Pos logit contributions
 hes+0.288
 toddler+0.278
 curiously+0.272
 marvel+0.261
 Int+0.260
Neg logit contributions
agons-0.158
opl-0.151
akers-0.147
arl-0.144
arter-0.142
Token for
Token ID329
Feature activation+8.24e-01

Pos logit contributions
 toddler+0.003
 curiously+0.003
 hes+0.003
 intrigued+0.003
 eagerly+0.003
Neg logit contributions
ines-0.002
agons-0.002
opl-0.002
arter-0.002
ands-0.002
Token Sale
Token ID16467
Feature activation+2.63e+00

Pos logit contributions
ines+0.031
opl+0.030
arter+0.029
akers+0.028
agons+0.028
Neg logit contributions
 hes-0.026
 toddler-0.023
 3-0.023
 curiously-0.023
 daughter-0.021
Token".
Token ID1911
Feature activation-1.12e+00

Pos logit contributions
opl+0.061
arter+0.060
py+0.058
agons+0.057
ines+0.056
Neg logit contributions
 toddler-0.115
 curiously-0.114
 hes-0.112
 3-0.104
 intrigued-0.104
Token Lily
Token ID20037
Feature activation-3.68e-01

Pos logit contributions
 hes+0.043
 toddler+0.041
 curiously+0.041
 Int+0.039
 marvel+0.038
Neg logit contributions
akers-0.033
agons-0.033
opl-0.033
ines-0.032
eds-0.032
Token noise
Token ID7838
Feature activation-1.31e-01

Pos logit contributions
 hes+0.023
 curiously+0.023
 toddler+0.023
 marvel+0.022
 intrigued+0.021
Neg logit contributions
akers-0.014
agons-0.014
ines-0.013
opl-0.013
ons-0.013
Token.
Token ID13
Feature activation+1.63e-01

Pos logit contributions
 hes+0.005
 curiously+0.005
 hopefully+0.004
 toddler+0.004
 marvel+0.004
Neg logit contributions
agons-0.004
akers-0.004
opl-0.004
eds-0.004
arter-0.004
Token
Token ID198
Feature activation+2.22e-02

Pos logit contributions
opl+0.005
akers+0.005
agons+0.005
ines+0.005
arl+0.005
Neg logit contributions
 hes-0.006
 toddler-0.006
 curiously-0.006
 marvel-0.005
 intrigued-0.005
Token
Token ID198
Feature activation+7.55e-02

Pos logit contributions
akers+0.001
agons+0.001
opl+0.001
ines+0.001
eds+0.001
Neg logit contributions
 hes-0.001
 curiously-0.001
 toddler-0.001
 Int-0.001
 marvel-0.001
Token"
Token ID1
Feature activation-4.18e-01

Pos logit contributions
akers+0.002
agons+0.002
opl+0.002
ines+0.002
eds+0.002
Neg logit contributions
 hes-0.003
 curiously-0.003
 toddler-0.003
 Int-0.003
 marvel-0.003
TokenWhe
Token ID10842
Feature activation-6.62e+00

Pos logit contributions
 curiously+0.016
 hes+0.016
 toddler+0.015
 intrigued+0.014
 anx+0.014
Neg logit contributions
arl-0.011
akers-0.011
 lava-0.011
ards-0.011
iren-0.011
Tokene
Token ID68
Feature activation+2.82e+00

Pos logit contributions
 hes+0.323
 toddler+0.308
 curiously+0.306
 marvel+0.294
 Int+0.291
Neg logit contributions
agons-0.131
akers-0.127
opl-0.121
arl-0.117
arter-0.115
Token!"
Token ID2474
Feature activation-1.50e+00

Pos logit contributions
agons+0.056
akers+0.055
opl+0.053
ines+0.053
arter+0.052
Neg logit contributions
 hes-0.136
 curiously-0.131
 toddler-0.127
 marvel-0.123
 intrigued-0.122
Token Anna
Token ID11735
Feature activation-1.29e+00

Pos logit contributions
 hes+0.057
 toddler+0.057
 curiously+0.055
 marvel+0.052
 intrigued+0.052
Neg logit contributions
ards-0.039
akers-0.038
ines-0.038
agons-0.038
opl-0.038
Token and
Token ID290
Feature activation-1.86e+00

Pos logit contributions
 toddler+0.051
 hes+0.049
 curiously+0.049
 hopefully+0.048
 mirac+0.046
Neg logit contributions
!"-0.032
ines-0.032
ards-0.030
opl-0.029
ocus-0.028
Token Ben
Token ID3932
Feature activation-1.80e+00

Pos logit contributions
 curiously+0.077
 toddler+0.077
 intrigued+0.077
 hes+0.077
 marvel+0.075
Neg logit contributions
 ow-0.040
 snakes-0.038
arl-0.038
ards-0.036
 Kim-0.036
Tokenâ
Token ID22940
Feature activation-5.64e-01

Pos logit contributions
 hes+0.063
 curiously+0.060
 toddler+0.058
 marvel+0.057
 intrigued+0.054
Neg logit contributions
opl-0.054
agons-0.053
akers-0.053
smart-0.053
arl-0.053
Token
Token ID26391
Feature activation-1.67e+00

Pos logit contributions
 hes+0.016
 toddler+0.014
 curiously+0.014
 Int+0.014
 marvel+0.013
Neg logit contributions
akers-0.023
eds-0.022
agons-0.022
ines-0.022
arter-0.022
Token
Token ID628
Feature activation-1.50e+00

Pos logit contributions
 hes+0.052
 curiously+0.050
 toddler+0.049
 hopefully+0.045
 marvel+0.045
Neg logit contributions
akers-0.060
opl-0.059
agons-0.059
ines-0.059
eds-0.058
Token Jim
Token ID5395
Feature activation-2.60e+00

Pos logit contributions
 hes+0.048
 curiously+0.046
 toddler+0.045
 marvel+0.042
 hopefully+0.042
Neg logit contributions
akers-0.051
ines-0.051
opl-0.051
agons-0.049
arl-0.049
Token looked
Token ID3114
Feature activation-8.26e-01

Pos logit contributions
 hes+0.069
 toddler+0.065
 curiously+0.064
 Int+0.059
 hopefully+0.059
Neg logit contributions
akers-0.104
ines-0.102
opl-0.101
agons-0.101
eds-0.100
Token up
Token ID510
Feature activation-6.62e+00

Pos logit contributions
 hes+0.027
 toddler+0.026
 Int+0.024
 convince+0.024
 mirac+0.023
Neg logit contributions
ines-0.027
akers-0.025
smart-0.025
ards-0.025
eds-0.024
Token and
Token ID290
Feature activation-2.17e+00

Pos logit contributions
 toddler+0.295
 hes+0.294
 Int+0.291
Whoever+0.287
Whe+0.280
Neg logit contributions
!"-0.111
ines-0.111
!-0.109
 years-0.106
arl-0.099
Token around
Token ID1088
Feature activation-3.47e+00

Pos logit contributions
Whoever+0.079
 toddler+0.079
 hes+0.075
 convince+0.075
 aid+0.074
Neg logit contributions
!"-0.048
ards-0.047
!-0.045
 that-0.044
smart-0.043
Token the
Token ID262
Feature activation+2.76e-01

Pos logit contributions
 toddler+0.175
 Int+0.172
 hes+0.171
 Exc+0.170
 curiously+0.167
Neg logit contributions
ines-0.044
 years-0.037
smart-0.037
akers-0.036
ards-0.036
Token forest
Token ID8222
Feature activation+1.55e+00

Pos logit contributions
akers+0.010
agons+0.009
opl+0.009
ines+0.009
arl+0.009
Neg logit contributions
 toddler-0.009
 curiously-0.009
 hes-0.009
 marvel-0.008
 intrigued-0.008
Token.
Token ID13
Feature activation-8.01e-01

Pos logit contributions
akers+0.031
ines+0.031
ards+0.029
opl+0.028
eds+0.028
Neg logit contributions
 hes-0.072
 toddler-0.070
 curiously-0.068
 Int-0.067
 Exc-0.065
Token that
Token ID326
Feature activation+1.59e-01

Pos logit contributions
arl+0.004
agons+0.004
akers+0.004
opl+0.004
eds+0.004
Neg logit contributions
 curiously-0.005
 hes-0.005
 toddler-0.005
 marvel-0.004
 eagerly-0.004
Token it
Token ID340
Feature activation-1.43e+00

Pos logit contributions
 lava+0.004
opl+0.004
akers+0.004
ards+0.004
ons+0.004
Neg logit contributions
 hes-0.006
 curiously-0.006
 toddler-0.006
 Int-0.006
 hopefully-0.006
Token made
Token ID925
Feature activation+4.78e-01

Pos logit contributions
 hes+0.058
 toddler+0.057
 curiously+0.055
 hopefully+0.053
 Int+0.051
Neg logit contributions
ines-0.037
ards-0.037
 lava-0.036
eds-0.036
akers-0.036
Token everyone
Token ID2506
Feature activation-2.54e+00

Pos logit contributions
akers+0.016
agons+0.016
opl+0.015
ines+0.015
eds+0.015
Neg logit contributions
 hes-0.017
 curiously-0.016
 toddler-0.016
 marvel-0.015
 Int-0.015
Token look
Token ID804
Feature activation-7.56e-01

Pos logit contributions
 hes+0.085
 toddler+0.081
 curiously+0.080
 Int+0.074
 hopefully+0.073
Neg logit contributions
akers-0.086
ines-0.084
opl-0.084
agons-0.083
eds-0.083
Token up
Token ID510
Feature activation-6.62e+00

Pos logit contributions
 hes+0.030
 toddler+0.028
 curiously+0.028
 Int+0.026
 marvel+0.026
Neg logit contributions
akers-0.022
agons-0.021
opl-0.021
ines-0.021
eds-0.020
Token in
Token ID287
Feature activation-2.18e+00

Pos logit contributions
 toddler+0.278
 hes+0.277
 Int+0.271
 Exc+0.265
 curiously+0.245
Neg logit contributions
ines-0.153
!"-0.148
akers-0.145
 years-0.141
eds-0.137
Token wonder
Token ID4240
Feature activation+1.98e+00

Pos logit contributions
 hes+0.077
 curiously+0.073
 toddler+0.072
 marvel+0.068
 Int+0.067
Neg logit contributions
akers-0.072
agons-0.071
opl-0.070
ines-0.070
eds-0.069
Token.
Token ID13
Feature activation-1.45e-02

Pos logit contributions
akers+0.047
agons+0.047
opl+0.045
eds+0.044
arter+0.043
Neg logit contributions
 hes-0.089
 curiously-0.086
 toddler-0.082
 marvel-0.080
 hopefully-0.080
Token It
Token ID632
Feature activation+7.14e-01

Pos logit contributions
 hes+0.001
 toddler+0.001
 curiously+0.001
 convince+0.001
 persuade+0.001
Neg logit contributions
 lava-0.000
 Glass-0.000
opl-0.000
ons-0.000
 Cookies-0.000
Token was
Token ID373
Feature activation-4.71e-01

Pos logit contributions
ines+0.016
ards+0.015
py+0.015
 years+0.015
 lava+0.014
Neg logit contributions
 toddler-0.031
 hes-0.030
 hopefully-0.028
 curiously-0.028
 daughter-0.027
Token
Token ID198
Feature activation-6.92e-01

Pos logit contributions
 hes+0.026
 toddler+0.024
 curiously+0.024
 marvel+0.022
 intrigued+0.022
Neg logit contributions
akers-0.025
opl-0.025
eds-0.025
agons-0.025
ines-0.025
TokenThe
Token ID464
Feature activation+1.04e+00

Pos logit contributions
 hes+0.023
 toddler+0.022
 curiously+0.022
 Int+0.021
 intrigued+0.020
Neg logit contributions
akers-0.024
ines-0.024
agons-0.024
opl-0.024
arter-0.024
Token old
Token ID1468
Feature activation+1.18e+00

Pos logit contributions
opl+0.030
akers+0.030
agons+0.030
arl+0.029
 lava+0.028
Neg logit contributions
 hes-0.039
 curiously-0.038
 toddler-0.035
 Int-0.035
 marvel-0.034
Token man
Token ID582
Feature activation-1.25e+00

Pos logit contributions
opl+0.033
iren+0.032
akers+0.032
ines+0.031
agons+0.030
Neg logit contributions
 hes-0.043
 curiously-0.043
 toddler-0.041
 Int-0.041
 hopefully-0.040
Token looked
Token ID3114
Feature activation-9.13e-01

Pos logit contributions
 hes+0.039
 toddler+0.039
 curiously+0.036
 Nathan+0.035
 marvel+0.035
Neg logit contributions
akers-0.041
ines-0.041
ards-0.041
smart-0.040
eds-0.040
Token up
Token ID510
Feature activation-6.61e+00

Pos logit contributions
 toddler+0.029
 hes+0.029
 mirac+0.027
 marvel+0.027
 Int+0.026
Neg logit contributions
ines-0.031
ards-0.029
eds-0.028
agons-0.028
akers-0.027
Token and
Token ID290
Feature activation-2.06e+00

Pos logit contributions
 toddler+0.330
 hes+0.321
 Int+0.315
Whoever+0.308
 nic+0.306
Neg logit contributions
ines-0.085
 years-0.082
!-0.079
!"-0.078
 lava-0.074
Token saw
Token ID2497
Feature activation-6.82e-01

Pos logit contributions
 toddler+0.069
 hes+0.064
Whoever+0.062
 athlete+0.060
 convince+0.059
Neg logit contributions
ards-0.062
eds-0.059
smart-0.059
akers-0.058
 lava-0.058
Token a
Token ID257
Feature activation-2.04e+00

Pos logit contributions
 hes+0.026
 toddler+0.025
 curiously+0.025
 Int+0.023
 marvel+0.023
Neg logit contributions
akers-0.020
agons-0.020
opl-0.020
ines-0.020
eds-0.019
Token big
Token ID1263
Feature activation+1.00e-01

Pos logit contributions
 hes+0.067
 toddler+0.064
 curiously+0.064
 marvel+0.059
 Int+0.058
Neg logit contributions
akers-0.071
agons-0.070
opl-0.070
ines-0.069
eds-0.068
Token,
Token ID11
Feature activation-1.50e+00

Pos logit contributions
akers+0.003
agons+0.003
ines+0.003
eds+0.003
opl+0.003
Neg logit contributions
 hes-0.003
 toddler-0.003
 curiously-0.003
 3-0.003
 marvel-0.003
Token eyes
Token ID2951
Feature activation+1.84e+00

Pos logit contributions
 hes+0.038
 toddler+0.036
 Int+0.035
 marvel+0.035
 curiously+0.035
Neg logit contributions
agons-0.024
ines-0.024
eds-0.023
opl-0.023
akers-0.023
Token and
Token ID290
Feature activation-1.67e+00

Pos logit contributions
akers+0.049
agons+0.048
opl+0.047
ines+0.047
eds+0.046
Neg logit contributions
 hes-0.075
 curiously-0.072
 toddler-0.072
 Int-0.067
 marvel-0.067
Token yelled
Token ID22760
Feature activation+3.45e-01

Pos logit contributions
 toddler+0.059
 hes+0.056
 Int+0.053
 curiously+0.053
 intrigued+0.051
Neg logit contributions
ons-0.047
ards-0.046
ines-0.046
akers-0.045
smart-0.045
Token,
Token ID11
Feature activation+5.75e-01

Pos logit contributions
akers+0.009
ines+0.009
agons+0.008
eds+0.008
opl+0.008
Neg logit contributions
 hes-0.014
 toddler-0.014
 curiously-0.013
 Int-0.013
 marvel-0.013
Token "
Token ID366
Feature activation-4.65e-01

Pos logit contributions
akers+0.019
opl+0.019
ines+0.019
agons+0.019
arl+0.018
Neg logit contributions
 hes-0.019
 toddler-0.018
 curiously-0.018
 Int-0.017
 marvel-0.016
TokenWhe
Token ID10842
Feature activation-6.60e+00

Pos logit contributions
 curiously+0.018
 hes+0.018
 intrigued+0.017
 toddler+0.017
 nervously+0.016
Neg logit contributions
akers-0.012
 yourself-0.012
opl-0.011
eds-0.011
ons-0.011
Tokenee
Token ID1453
Feature activation-3.39e-01

Pos logit contributions
 hes+0.328
 toddler+0.316
 curiously+0.313
 marvel+0.301
 Int+0.298
Neg logit contributions
agons-0.121
akers-0.117
opl-0.114
ines-0.108
arl-0.107
Tokene
Token ID68
Feature activation+2.34e+00

Pos logit contributions
 hes+0.016
 toddler+0.015
 curiously+0.015
 marvel+0.015
 intrigued+0.014
Neg logit contributions
agons-0.007
akers-0.007
arl-0.007
opl-0.007
arter-0.007
Token!"
Token ID2474
Feature activation-1.56e+00

Pos logit contributions
agons+0.051
akers+0.049
ines+0.048
opl+0.047
arter+0.046
Neg logit contributions
 hes-0.108
 curiously-0.103
 toddler-0.102
 marvel-0.098
 intrigued-0.097
Token as
Token ID355
Feature activation-3.15e+00

Pos logit contributions
 hes+0.053
 toddler+0.051
 curiously+0.050
 marvel+0.047
 intrigued+0.047
Neg logit contributions
akers-0.048
opl-0.048
ines-0.048
agons-0.047
arl-0.047
Token the
Token ID262
Feature activation-1.21e-01

Pos logit contributions
 hes+0.108
 toddler+0.100
 curiously+0.099
 Int+0.099
 intrigued+0.093
Neg logit contributions
ines-0.101
akers-0.100
ards-0.095
opl-0.095
arter-0.094
Token
Token ID198
Feature activation+4.56e-03

Pos logit contributions
 hes+0.006
 curiously+0.006
 toddler+0.006
 Int+0.005
 marvel+0.005
Neg logit contributions
akers-0.011
agons-0.011
opl-0.010
ines-0.010
eds-0.010
TokenTom
Token ID13787
Feature activation-5.47e-01

Pos logit contributions
arter+0.000
agons+0.000
opl+0.000
akers+0.000
py+0.000
Neg logit contributions
 hes-0.000
 toddler-0.000
 curiously-0.000
 Int-0.000
 intrigued-0.000
Token and
Token ID290
Feature activation-2.12e+00

Pos logit contributions
 hes+0.021
 curiously+0.020
 toddler+0.020
 Int+0.019
 marvel+0.019
Neg logit contributions
akers-0.016
agons-0.016
opl-0.016
ines-0.015
eds-0.015
Token Anna
Token ID11735
Feature activation-1.69e+00

Pos logit contributions
 hes+0.089
 curiously+0.087
 marvel+0.086
 toddler+0.084
 intrigued+0.084
Neg logit contributions
smart-0.046
arl-0.045
 snakes-0.044
ines-0.044
py-0.043
Token look
Token ID804
Feature activation-7.74e-01

Pos logit contributions
 hes+0.070
 toddler+0.069
 curiously+0.065
 hopefully+0.064
 Int+0.063
Neg logit contributions
ines-0.040
agons-0.037
arl-0.037
akers-0.036
eds-0.036
Token up
Token ID510
Feature activation-6.59e+00

Pos logit contributions
 hes+0.023
 toddler+0.022
 curiously+0.021
 hopefully+0.020
 Int+0.019
Neg logit contributions
ines-0.029
akers-0.028
opl-0.028
agons-0.028
eds-0.028
Token.
Token ID13
Feature activation+4.06e-01

Pos logit contributions
 hes+0.345
 toddler+0.344
 Int+0.337
 Exc+0.327
 intrigued+0.322
Neg logit contributions
ines-0.086
arl-0.077
akers-0.071
arter-0.069
eds-0.068
Token They
Token ID1119
Feature activation+1.31e+00

Pos logit contributions
arl+0.007
akers+0.007
ines+0.007
agons+0.007
opl+0.007
Neg logit contributions
 hes-0.019
 curiously-0.019
 toddler-0.018
 marvel-0.018
 hopefully-0.018
Token see
Token ID766
Feature activation-2.73e-01

Pos logit contributions
ines+0.035
agons+0.035
akers+0.034
opl+0.034
arl+0.034
Neg logit contributions
 hes-0.050
 toddler-0.050
 curiously-0.047
 hopefully-0.045
 Int-0.045
Token a
Token ID257
Feature activation-1.85e+00

Pos logit contributions
 hes+0.010
 curiously+0.009
 toddler+0.009
 marvel+0.008
 Int+0.008
Neg logit contributions
akers-0.009
agons-0.009
ines-0.009
opl-0.009
eds-0.009
Token boy
Token ID2933
Feature activation-2.68e+00

Pos logit contributions
 toddler+0.059
 hes+0.058
 curiously+0.057
 marvel+0.053
 intrigued+0.052
Neg logit contributions
agons-0.067
opl-0.066
akers-0.066
ines-0.064
eds-0.063
Token the
Token ID262
Feature activation+1.45e+00

Pos logit contributions
akers+0.000
agons+0.000
opl+0.000
ines+0.000
eds+0.000
Neg logit contributions
 hes-0.000
 curiously-0.000
 toddler-0.000
 marvel-0.000
 hopefully-0.000
Token veterinarian
Token ID41404
Feature activation+1.63e+00

Pos logit contributions
akers+0.055
agons+0.054
opl+0.053
ines+0.053
eds+0.053
Neg logit contributions
 hes-0.044
 toddler-0.042
 curiously-0.041
 Int-0.038
 marvel-0.038
Token,
Token ID11
Feature activation-1.20e-01

Pos logit contributions
akers+0.056
ines+0.055
agons+0.055
opl+0.054
eds+0.054
Neg logit contributions
 hes-0.054
 toddler-0.051
 curiously-0.049
 Int-0.047
 intrigued-0.045
Token organized
Token ID8389
Feature activation+2.20e+00

Pos logit contributions
 hes+0.004
 toddler+0.004
 curiously+0.004
 Int+0.004
 intrigued+0.004
Neg logit contributions
akers-0.004
opl-0.004
ines-0.004
agons-0.004
eds-0.003
Token and
Token ID290
Feature activation+1.47e-01

Pos logit contributions
akers+0.074
agons+0.072
opl+0.070
arl+0.070
arter+0.070
Neg logit contributions
 hes-0.078
 curiously-0.074
 toddler-0.073
 marvel-0.067
 hopefully-0.066
Token neat
Token ID15049
Feature activation+3.21e+00

Pos logit contributions
akers+0.005
ines+0.004
opl+0.004
agons+0.004
eds+0.004
Neg logit contributions
 hes-0.005
 toddler-0.005
 curiously-0.005
 Int-0.005
 hopefully-0.005
Token like
Token ID588
Feature activation+1.53e+00

Pos logit contributions
akers+0.092
opl+0.091
agons+0.090
arter+0.089
ines+0.088
Neg logit contributions
 hes-0.126
 curiously-0.122
 toddler-0.121
 marvel-0.112
 intrigued-0.111
Token she
Token ID673
Feature activation-3.39e-01

Pos logit contributions
akers+0.049
agons+0.048
ines+0.048
opl+0.048
eds+0.047
Neg logit contributions
 hes-0.056
 curiously-0.052
 toddler-0.052
 marvel-0.049
 intrigued-0.048
Token always
Token ID1464
Feature activation+3.57e-01

Pos logit contributions
 hes+0.011
 curiously+0.011
 toddler+0.011
 Int+0.010
 marvel+0.010
Neg logit contributions
akers-0.012
agons-0.012
opl-0.012
ines-0.011
eds-0.011
Token was
Token ID373
Feature activation-3.71e-01

Pos logit contributions
akers+0.013
agons+0.012
opl+0.012
ines+0.012
eds+0.012
Neg logit contributions
 hes-0.012
 curiously-0.011
 toddler-0.011
 Int-0.010
 marvel-0.010
Token.
Token ID13
Feature activation-4.20e-01

Pos logit contributions
 hes+0.013
 curiously+0.012
 toddler+0.012
 marvel+0.012
 intrigued+0.011
Neg logit contributions
akers-0.012
agons-0.012
opl-0.012
ines-0.012
eds-0.012
Token It
Token ID632
Feature activation+9.83e-01

Pos logit contributions
akers+0.035
agons+0.034
ines+0.034
eds+0.033
opl+0.033
Neg logit contributions
 hes-0.032
 toddler-0.031
 curiously-0.031
 Int-0.029
 hopefully-0.029
Token's
Token ID338
Feature activation-1.25e+00

Pos logit contributions
akers+0.031
opl+0.030
ines+0.030
agons+0.030
eds+0.030
Neg logit contributions
 hes-0.035
 toddler-0.034
 curiously-0.033
 marvel-0.031
 hopefully-0.031
Token not
Token ID407
Feature activation+1.59e+00

Pos logit contributions
 hes+0.046
 curiously+0.043
 toddler+0.041
 hopefully+0.040
 intrigued+0.040
Neg logit contributions
akers-0.039
agons-0.039
ines-0.038
opl-0.038
eds-0.038
Token scary
Token ID14343
Feature activation+9.76e-01

Pos logit contributions
agons+0.043
opl+0.043
ines+0.043
akers+0.042
eds+0.041
Neg logit contributions
 hes-0.065
 curiously-0.061
 toddler-0.060
 marvel-0.058
 intrigued-0.057
Token or
Token ID393
Feature activation+8.90e-01

Pos logit contributions
akers+0.025
agons+0.025
opl+0.025
ines+0.024
eds+0.024
Neg logit contributions
 hes-0.041
 curiously-0.039
 toddler-0.039
 hopefully-0.037
 intrigued-0.036
Token mean
Token ID1612
Feature activation+3.11e+00

Pos logit contributions
akers+0.022
agons+0.022
opl+0.021
ines+0.021
eds+0.020
Neg logit contributions
 hes-0.039
 curiously-0.037
 toddler-0.037
 marvel-0.035
 hopefully-0.035
Token.
Token ID13
Feature activation+1.20e+00

Pos logit contributions
akers+0.091
agons+0.090
opl+0.088
arter+0.086
ines+0.085
Neg logit contributions
 hes-0.122
 curiously-0.114
 toddler-0.113
 hopefully-0.107
 Int-0.103
Token It
Token ID632
Feature activation+9.65e-01

Pos logit contributions
ines+0.048
akers+0.047
agons+0.047
eds+0.046
opl+0.045
Neg logit contributions
 hes-0.034
 toddler-0.033
 Int-0.032
 curiously-0.032
 hopefully-0.030
Token's
Token ID338
Feature activation-1.08e+00

Pos logit contributions
akers+0.033
agons+0.032
opl+0.032
ines+0.032
eds+0.031
Neg logit contributions
 hes-0.033
 curiously-0.031
 toddler-0.031
 Int-0.029
 marvel-0.028
Token just
Token ID655
Feature activation-1.16e+00

Pos logit contributions
 hes+0.040
 curiously+0.037
 toddler+0.036
 hopefully+0.035
 intrigued+0.034
Neg logit contributions
agons-0.033
ines-0.033
akers-0.033
opl-0.032
eds-0.032
Token doing
Token ID1804
Feature activation+7.71e-01

Pos logit contributions
 hes+0.044
 curiously+0.041
 toddler+0.041
 marvel+0.039
 Int+0.038
Neg logit contributions
akers-0.035
agons-0.035
ines-0.034
opl-0.034
eds-0.034
Token bus
Token ID1323
Feature activation+1.47e+00

Pos logit contributions
akers+0.021
arter+0.020
agons+0.020
arl+0.019
eds+0.019
Neg logit contributions
 curiously-0.028
 hes-0.028
 toddler-0.027
 marvel-0.026
 Int-0.025
Token and
Token ID290
Feature activation-6.44e-01

Pos logit contributions
akers+0.046
agons+0.046
eds+0.044
arter+0.044
ines+0.043
Neg logit contributions
 hes-0.053
 curiously-0.051
 toddler-0.050
 marvel-0.048
 hopefully-0.047
Token walk
Token ID2513
Feature activation-1.59e+00

Pos logit contributions
 hes+0.024
 toddler+0.023
 curiously+0.022
 Int+0.022
 intrigued+0.021
Neg logit contributions
akers-0.018
opl-0.018
ines-0.018
ards-0.018
agons-0.018
Token into
Token ID656
Feature activation-1.56e-01

Pos logit contributions
 hes+0.054
 toddler+0.050
 curiously+0.050
 Int+0.047
 marvel+0.046
Neg logit contributions
akers-0.054
ines-0.053
agons-0.053
opl-0.052
eds-0.052
Token the
Token ID262
Feature activation+5.34e-01

Pos logit contributions
 hes+0.006
 curiously+0.006
 toddler+0.006
 marvel+0.005
 Int+0.005
Neg logit contributions
akers-0.005
agons-0.005
opl-0.005
ines-0.005
eds-0.005
Token library
Token ID5888
Feature activation+3.12e+00

Pos logit contributions
akers+0.017
agons+0.017
arter+0.016
arl+0.016
eds+0.016
Neg logit contributions
 hes-0.019
 toddler-0.019
 curiously-0.018
 marvel-0.018
 Int-0.017
Token.
Token ID13
Feature activation-4.20e-01

Pos logit contributions
akers+0.066
ines+0.062
agons+0.061
opl+0.061
eds+0.060
Neg logit contributions
 hes-0.146
 toddler-0.140
 curiously-0.139
 Int-0.133
 marvel-0.131
Token
Token ID198
Feature activation-7.77e-02

Pos logit contributions
 hes+0.015
 curiously+0.014
 toddler+0.014
 intrigued+0.013
 hopefully+0.013
Neg logit contributions
opl-0.014
akers-0.013
agons-0.013
ines-0.013
arl-0.013
Token
Token ID198
Feature activation+2.01e-01

Pos logit contributions
 hes+0.003
 toddler+0.003
 curiously+0.002
 hopefully+0.002
 intrigued+0.002
Neg logit contributions
ines-0.003
opl-0.002
akers-0.002
eds-0.002
agons-0.002
TokenOne
Token ID3198
Feature activation+5.24e-01

Pos logit contributions
akers+0.006
agons+0.006
opl+0.006
ines+0.006
eds+0.006
Neg logit contributions
 hes-0.007
 curiously-0.007
 toddler-0.007
 Int-0.006
 marvel-0.006
Token day
Token ID1110
Feature activation-8.34e-01

Pos logit contributions
agons+0.018
eds+0.017
akers+0.017
opl+0.017
ines+0.017
Neg logit contributions
 hes-0.018
 toddler-0.018
 curiously-0.017
 marvel-0.016
 intrigued-0.016
Token wished
Token ID16555
Feature activation+6.47e-01

Pos logit contributions
 years+0.077
ines+0.074
ards+0.072
akers+0.071
eds+0.071
Neg logit contributions
 toddler-0.077
 hes-0.072
 curiously-0.072
 hopefully-0.069
 Int-0.068
Token he
Token ID339
Feature activation-9.80e-01

Pos logit contributions
akers+0.022
ines+0.022
opl+0.022
agons+0.021
arl+0.021
Neg logit contributions
 hes-0.020
 toddler-0.020
 curiously-0.019
 Int-0.019
 intrigued-0.019
Token had
Token ID550
Feature activation+3.74e+00

Pos logit contributions
 hes+0.039
 curiously+0.037
 toddler+0.037
 Int+0.035
 marvel+0.034
Neg logit contributions
akers-0.028
ines-0.026
agons-0.026
eds-0.026
opl-0.026
Token not
Token ID407
Feature activation+3.16e+00

Pos logit contributions
akers+0.094
eds+0.089
ards+0.085
smart+0.085
ons+0.085
Neg logit contributions
 curiously-0.155
 toddler-0.153
 hes-0.153
 Int-0.144
 Exc-0.142
Token run
Token ID1057
Feature activation+1.21e+00

Pos logit contributions
agons+0.117
opl+0.116
akers+0.115
ines+0.114
arl+0.111
Neg logit contributions
 hes-0.101
 toddler-0.091
 curiously-0.090
 marvel-0.087
 intrigued-0.085
Token away
Token ID1497
Feature activation+3.34e+00

Pos logit contributions
agons+0.046
opl+0.046
akers+0.046
arter+0.044
ines+0.044
Neg logit contributions
 hes-0.037
 curiously-0.035
 toddler-0.034
 hopefully-0.031
 nervously-0.030
Token.
Token ID13
Feature activation+1.20e+00

Pos logit contributions
agons+0.101
akers+0.098
opl+0.095
eds+0.091
ines+0.091
Neg logit contributions
 hes-0.133
 curiously-0.129
 toddler-0.125
 hopefully-0.120
 nervously-0.116
Token She
Token ID1375
Feature activation+2.52e+00

Pos logit contributions
akers+0.036
agons+0.035
opl+0.035
ines+0.035
eds+0.034
Neg logit contributions
 hes-0.046
 toddler-0.043
 curiously-0.043
 marvel-0.040
 Int-0.040
Token wondered
Token ID14028
Feature activation+4.89e-01

Pos logit contributions
 years+0.087
ines+0.085
ards+0.082
akers+0.082
eds+0.082
Neg logit contributions
 toddler-0.076
 hes-0.074
 curiously-0.073
 hopefully-0.068
 intrigued-0.068
Token if
Token ID611
Feature activation-2.74e+00

Pos logit contributions
akers+0.014
ines+0.014
agons+0.014
opl+0.014
eds+0.013
Neg logit contributions
 hes-0.018
 toddler-0.018
 curiously-0.017
 Int-0.017
 intrigued-0.016
Token he
Token ID339
Feature activation-1.42e+00

Pos logit contributions
 hes+0.102
 curiously+0.102
 marvel+0.099
 toddler+0.098
 intrigued+0.097
Neg logit contributions
ards-0.075
ons-0.072
 snakes-0.071
akers-0.070
agons-0.069
Token.
Token ID13
Feature activation+1.16e+00

Pos logit contributions
akers+0.066
agons+0.063
opl+0.063
arter+0.061
ons+0.060
Neg logit contributions
 curiously-0.051
 hes-0.050
 hopefully-0.044
 nervously-0.043
 toddler-0.043
Token He
Token ID679
Feature activation+8.68e-01

Pos logit contributions
ines+0.036
arl+0.036
akers+0.036
opl+0.036
agons+0.036
Neg logit contributions
 hes-0.041
 toddler-0.039
 curiously-0.038
 marvel-0.036
 intrigued-0.035
Token laughed
Token ID13818
Feature activation-2.00e-01

Pos logit contributions
ines+0.027
ards+0.027
akers+0.027
eds+0.027
agons+0.026
Neg logit contributions
 toddler-0.029
 hes-0.029
 curiously-0.028
 hopefully-0.027
 Int-0.026
Token and
Token ID290
Feature activation+1.30e+00

Pos logit contributions
 hes+0.008
 toddler+0.008
 curiously+0.008
 Int+0.007
 marvel+0.007
Neg logit contributions
akers-0.005
agons-0.005
opl-0.005
ines-0.005
eds-0.005
Token ran
Token ID4966
Feature activation+1.96e+00

Pos logit contributions
akers+0.046
agons+0.045
opl+0.045
ines+0.045
eds+0.045
Neg logit contributions
 hes-0.040
 curiously-0.039
 toddler-0.039
 Int-0.035
 hopefully-0.035
Token away
Token ID1497
Feature activation+3.72e+00

Pos logit contributions
akers+0.074
agons+0.074
opl+0.074
arter+0.072
ines+0.071
Neg logit contributions
 hes-0.061
 curiously-0.057
 toddler-0.056
 Int-0.052
 marvel-0.051
Token with
Token ID351
Feature activation-2.79e-01

Pos logit contributions
agons+0.093
akers+0.091
opl+0.090
eds+0.085
ines+0.085
Neg logit contributions
 hes-0.165
 curiously-0.160
 toddler-0.156
 marvel-0.149
 hopefully-0.149
Token Anna
Token ID11735
Feature activation-4.71e-01

Pos logit contributions
 hes+0.010
 toddler+0.009
 curiously+0.009
 Int+0.009
 marvel+0.009
Neg logit contributions
akers-0.009
agons-0.009
ines-0.009
opl-0.009
eds-0.009
Token.
Token ID13
Feature activation+1.44e+00

Pos logit contributions
 hes+0.021
 curiously+0.020
 toddler+0.020
 hopefully+0.019
 marvel+0.019
Neg logit contributions
akers-0.011
agons-0.011
opl-0.011
arl-0.011
arter-0.011
Token
Token ID198
Feature activation+7.38e-01

Pos logit contributions
ines+0.048
akers+0.048
arl+0.048
opl+0.047
agons+0.047
Neg logit contributions
 hes-0.048
 toddler-0.045
 curiously-0.044
 marvel-0.042
 intrigued-0.041
Token
Token ID198
Feature activation+9.86e-01

Pos logit contributions
akers+0.027
ines+0.027
agons+0.027
opl+0.027
eds+0.027
Neg logit contributions
 hes-0.022
 toddler-0.021
 curiously-0.021
 marvel-0.019
 Int-0.019
Token friends
Token ID2460
Feature activation+6.78e-01

Pos logit contributions
 hes+0.013
 curiously+0.012
 toddler+0.012
 marvel+0.011
 intrigued+0.011
Neg logit contributions
akers-0.011
opl-0.011
agons-0.011
ines-0.011
eds-0.011
Token,
Token ID11
Feature activation+2.00e-01

Pos logit contributions
agons+0.017
akers+0.017
opl+0.017
arl+0.016
ines+0.016
Neg logit contributions
 hes-0.029
 curiously-0.028
 toddler-0.028
 marvel-0.026
 hopefully-0.026
Token Ben
Token ID3932
Feature activation-6.04e-01

Pos logit contributions
akers+0.005
agons+0.005
ines+0.005
ards+0.005
opl+0.005
Neg logit contributions
 hes-0.008
 curiously-0.008
 toddler-0.008
 intrigued-0.007
 hopefully-0.007
Token and
Token ID290
Feature activation-1.55e+00

Pos logit contributions
 hes+0.025
 curiously+0.024
 toddler+0.024
 Int+0.022
 marvel+0.022
Neg logit contributions
akers-0.016
agons-0.016
ines-0.016
opl-0.016
eds-0.015
Token Dave
Token ID9935
Feature activation-2.33e+00

Pos logit contributions
 hes+0.060
 curiously+0.058
 toddler+0.056
 Int+0.053
 intrigued+0.053
Neg logit contributions
akers-0.045
agons-0.042
opl-0.042
arter-0.041
ines-0.041
Token.
Token ID13
Feature activation+3.73e-01

Pos logit contributions
 hes+0.107
 curiously+0.103
 toddler+0.102
 marvel+0.097
 hopefully+0.096
Neg logit contributions
opl-0.051
agons-0.051
akers-0.050
ines-0.048
eds-0.047
Token They
Token ID1119
Feature activation+1.50e+00

Pos logit contributions
agons+0.006
ines+0.006
arl+0.006
opl+0.006
akers+0.006
Neg logit contributions
 hes-0.018
 toddler-0.018
 curiously-0.018
 hopefully-0.017
 Int-0.017
Token were
Token ID547
Feature activation+1.68e-01

Pos logit contributions
akers+0.055
opl+0.055
agons+0.054
eds+0.054
ines+0.053
Neg logit contributions
 hes-0.046
 curiously-0.044
 toddler-0.043
 marvel-0.041
 intrigued-0.040
Token jumping
Token ID14284
Feature activation-1.02e+00

Pos logit contributions
akers+0.005
agons+0.005
opl+0.005
ines+0.005
eds+0.005
Neg logit contributions
 hes-0.006
 curiously-0.006
 toddler-0.006
 marvel-0.006
 intrigued-0.006
Token and
Token ID290
Feature activation+6.49e-01

Pos logit contributions
 hes+0.036
 toddler+0.034
 curiously+0.034
 Int+0.031
 hopefully+0.031
Neg logit contributions
akers-0.033
agons-0.033
ines-0.033
opl-0.032
eds-0.032
Token running
Token ID2491
Feature activation+5.36e-01

Pos logit contributions
akers+0.021
agons+0.021
opl+0.020
ines+0.020
eds+0.020
Neg logit contributions
 hes-0.023
 toddler-0.022
 curiously-0.022
 Int-0.020
 marvel-0.020
Token bed
Token ID3996
Feature activation+2.61e+00

Pos logit contributions
opl+0.021
agons+0.021
akers+0.020
ines+0.020
eds+0.020
Neg logit contributions
 hes-0.027
 curiously-0.026
 toddler-0.025
 Int-0.024
 intrigued-0.024
Token.
Token ID13
Feature activation-8.41e-03

Pos logit contributions
agons+0.062
akers+0.062
opl+0.060
eds+0.059
ines+0.057
Neg logit contributions
 hes-0.115
 curiously-0.111
 toddler-0.110
 marvel-0.104
 hopefully-0.104
Token She
Token ID1375
Feature activation+1.95e+00

Pos logit contributions
 hes+0.000
 curiously+0.000
 toddler+0.000
 Int+0.000
 marvel+0.000
Neg logit contributions
akers-0.000
agons-0.000
opl-0.000
ines-0.000
eds-0.000
Token hugged
Token ID41130
Feature activation+6.96e-01

Pos logit contributions
ines+0.058
akers+0.056
eds+0.055
 years+0.054
ards+0.054
Neg logit contributions
 hes-0.066
 toddler-0.066
 curiously-0.065
 Int-0.064
 hopefully-0.062
Token them
Token ID606
Feature activation-4.92e-01

Pos logit contributions
akers+0.021
agons+0.021
opl+0.021
ines+0.021
eds+0.021
Neg logit contributions
 hes-0.025
 toddler-0.024
 curiously-0.024
 Int-0.022
 intrigued-0.022
Token both
Token ID1111
Feature activation+6.65e-01

Pos logit contributions
 hes+0.020
 toddler+0.019
 curiously+0.018
 Int+0.018
 intrigued+0.017
Neg logit contributions
akers-0.013
agons-0.013
ines-0.013
opl-0.013
eds-0.013
Token and
Token ID290
Feature activation-5.40e-01

Pos logit contributions
akers+0.017
agons+0.017
ines+0.017
opl+0.017
eds+0.016
Neg logit contributions
 hes-0.028
 curiously-0.026
 toddler-0.026
 Int-0.025
 marvel-0.025
Token said
Token ID531
Feature activation-3.86e-01

Pos logit contributions
 hes+0.018
 Int+0.018
 curiously+0.017
 toddler+0.017
 intrigued+0.017
Neg logit contributions
ards-0.016
akers-0.016
ons-0.016
 years-0.016
ines-0.015
Token,
Token ID11
Feature activation+5.89e-01

Pos logit contributions
 hes+0.017
 curiously+0.016
 toddler+0.016
 hopefully+0.015
 marvel+0.015
Neg logit contributions
akers-0.010
agons-0.010
opl-0.009
eds-0.009
ines-0.009
Token "
Token ID366
Feature activation+1.91e-01

Pos logit contributions
akers+0.019
opl+0.019
agons+0.018
ines+0.018
arl+0.018
Neg logit contributions
 hes-0.020
 toddler-0.019
 curiously-0.019
 Int-0.017
 marvel-0.017
TokenGood
Token ID10248
Feature activation+1.22e+00

Pos logit contributions
akers+0.006
opl+0.006
agons+0.006
eds+0.006
arl+0.006
Neg logit contributions
 hes-0.007
 curiously-0.006
 toddler-0.006
 intrigued-0.006
 hopefully-0.006
Token.
Token ID13
Feature activation+1.16e-01

Pos logit contributions
 hes+0.045
 curiously+0.044
 toddler+0.043
 marvel+0.042
 hopefully+0.042
Neg logit contributions
agons-0.026
akers-0.025
arl-0.024
opl-0.024
ines-0.024
Token<|endoftext|>
Token ID50256
Feature activation-6.50e-01

Pos logit contributions
opl+0.004
akers+0.004
ines+0.004
agons+0.004
smart+0.004
Neg logit contributions
 hes-0.004
 curiously-0.004
 toddler-0.004
 marvel-0.003
 intrigued-0.003
TokenOnce
Token ID7454
Feature activation+1.47e+00

Pos logit contributions
 hes+0.021
 curiously+0.020
 toddler+0.020
 intrigued+0.018
 marvel+0.018
Neg logit contributions
ines-0.022
akers-0.022
opl-0.022
agons-0.022
eds-0.022
Token there
Token ID612
Feature activation-1.43e+00

Pos logit contributions
ines+0.063
arter+0.057
arl+0.057
akers+0.057
eds+0.057
Neg logit contributions
 hes-0.039
 toddler-0.037
 curiously-0.033
 Int-0.032
 intrigued-0.032
Token was
Token ID373
Feature activation+2.00e+00

Pos logit contributions
 hes+0.058
 toddler+0.056
 curiously+0.054
 intrigued+0.052
 Int+0.052
Neg logit contributions
ines-0.035
opl-0.033
ons-0.033
akers-0.031
ards-0.031
Token a
Token ID257
Feature activation-4.50e-01

Pos logit contributions
akers+0.052
opl+0.050
agons+0.050
ines+0.048
 lava+0.047
Neg logit contributions
 hes-0.082
 curiously-0.077
 toddler-0.077
 hopefully-0.074
 Int-0.073
Token t
Token ID256
Feature activation-1.61e+00

Pos logit contributions
 hes+0.013
 toddler+0.013
 curiously+0.013
 marvel+0.012
 Int+0.011
Neg logit contributions
akers-0.017
agons-0.017
opl-0.017
ines-0.017
eds-0.016
Tokeneddy
Token ID21874
Feature activation-4.95e-01

Pos logit contributions
 toddler+0.064
 hes+0.064
 curiously+0.064
 marvel+0.058
 hopefully+0.058
Neg logit contributions
opl-0.042
agons-0.042
akers-0.042
ines-0.040
eds-0.039
Token bear
Token ID6842
Feature activation-4.82e-01

Pos logit contributions
 hes+0.015
 toddler+0.015
 curiously+0.015
 daughter+0.014
 hopefully+0.013
Neg logit contributions
arl-0.019
agons-0.019
akers-0.018
opl-0.018
ines-0.018
Token.
Token ID13
Feature activation+7.87e-01

Pos logit contributions
 curiously+0.016
 toddler+0.016
 hes+0.016
 marvel+0.015
 3+0.015
Neg logit contributions
agons-0.017
arl-0.017
opl-0.015
akers-0.015
ands-0.015
Token He
Token ID679
Feature activation+1.30e+00

Pos logit contributions
eds+0.038
akers+0.038
ines+0.037
agons+0.036
ands+0.035
Neg logit contributions
 Int-0.018
 toddler-0.018
 curiously-0.017
 Exc-0.017
 hes-0.017
Token.
Token ID13
Feature activation+8.76e-01

Pos logit contributions
akers+0.109
agons+0.107
opl+0.104
eds+0.102
ines+0.102
Neg logit contributions
 hes-0.154
 curiously-0.146
 toddler-0.141
 Int-0.133
 marvel-0.132
Token
Token ID198
Feature activation+3.23e-02

Pos logit contributions
arl+0.025
smart+0.025
ines+0.024
opl+0.024
akers+0.024
Neg logit contributions
 hes-0.032
 toddler-0.031
 curiously-0.029
 marvel-0.029
 und-0.028
Token
Token ID198
Feature activation+2.48e-01

Pos logit contributions
akers+0.001
agons+0.001
ines+0.001
opl+0.001
eds+0.001
Neg logit contributions
 hes-0.001
 toddler-0.001
 curiously-0.001
 marvel-0.001
 intrigued-0.001
TokenMax
Token ID11518
Feature activation-2.73e-01

Pos logit contributions
akers+0.008
agons+0.007
opl+0.007
ines+0.007
eds+0.007
Neg logit contributions
 hes-0.009
 curiously-0.009
 toddler-0.009
 Int-0.008
 marvel-0.008
Token heard
Token ID2982
Feature activation+1.47e+00

Pos logit contributions
 hes+0.008
 toddler+0.008
 curiously+0.008
 hopefully+0.008
 Int+0.007
Neg logit contributions
akers-0.009
ines-0.009
ards-0.009
eds-0.009
 years-0.009
Token her
Token ID607
Feature activation-3.70e-01

Pos logit contributions
akers+0.047
agons+0.046
opl+0.045
ines+0.045
eds+0.045
Neg logit contributions
 hes-0.053
 curiously-0.051
 toddler-0.051
 Int-0.047
 marvel-0.047
Token voice
Token ID3809
Feature activation+5.51e-01

Pos logit contributions
 hes+0.016
 toddler+0.016
 Int+0.016
 marvel+0.015
 Exc+0.015
Neg logit contributions
ines-0.008
agons-0.007
akers-0.007
arter-0.007
opl-0.007
Token from
Token ID422
Feature activation-2.27e-01

Pos logit contributions
ines+0.012
akers+0.012
opl+0.012
agons+0.012
smart+0.011
Neg logit contributions
 hes-0.024
 toddler-0.024
 curiously-0.023
 Int-0.022
 intrigued-0.022
Token under
Token ID739
Feature activation+8.89e-01

Pos logit contributions
 hes+0.008
 toddler+0.007
 Int+0.007
 curiously+0.007
 intrigued+0.007
Neg logit contributions
ines-0.007
akers-0.007
agons-0.007
opl-0.007
arter-0.007
Token the
Token ID262
Feature activation+1.28e+00

Pos logit contributions
akers+0.027
agons+0.026
opl+0.026
ines+0.026
eds+0.025
Neg logit contributions
 hes-0.034
 curiously-0.032
 toddler-0.032
 Int-0.030
 marvel-0.030
Token bed
Token ID3996
Feature activation+3.06e+00

Pos logit contributions
akers+0.058
cks+0.054
opl+0.054
mm+0.052
agons+0.052
Neg logit contributions
 curiously-0.033
 toddler-0.032
 hes-0.028
 nervously-0.028
 anx-0.028
TokenHe
Token ID1544
Feature activation+1.10e+00

Pos logit contributions
akers+0.017
agons+0.016
ines+0.016
opl+0.016
eds+0.016
Neg logit contributions
 hes-0.023
 toddler-0.022
 curiously-0.022
 Int-0.020
 marvel-0.020
Token gave
Token ID2921
Feature activation-8.60e-02

Pos logit contributions
ons+0.027
 cho+0.027
akers+0.027
ards+0.026
 lava+0.025
Neg logit contributions
 curiously-0.039
 toddler-0.039
 hopefully-0.038
 hes-0.037
 Int-0.037
Token her
Token ID607
Feature activation+1.03e-01

Pos logit contributions
 hes+0.004
 intrigued+0.004
 toddler+0.004
 Int+0.003
 curiously+0.003
Neg logit contributions
eds-0.002
ons-0.002
ines-0.002
akers-0.002
py-0.002
Token a
Token ID257
Feature activation-8.47e-01

Pos logit contributions
akers+0.003
ons+0.002
eds+0.002
agons+0.002
opl+0.002
Neg logit contributions
 hes-0.004
 curiously-0.004
 toddler-0.004
 Int-0.004
 intrigued-0.004
Token small
Token ID1402
Feature activation-3.40e-02

Pos logit contributions
 hes+0.035
 curiously+0.033
 toddler+0.032
 hopefully+0.031
 Int+0.031
Neg logit contributions
akers-0.022
agons-0.022
arl-0.022
ines-0.021
opl-0.021
Token box
Token ID3091
Feature activation+1.65e+00

Pos logit contributions
 hes+0.001
 curiously+0.001
 toddler+0.001
 Int+0.001
 intrigued+0.001
Neg logit contributions
akers-0.001
agons-0.001
ines-0.001
eds-0.001
opl-0.001
Token.
Token ID13
Feature activation+1.08e+00

Pos logit contributions
akers+0.045
agons+0.045
opl+0.044
ines+0.043
eds+0.043
Neg logit contributions
 hes-0.066
 curiously-0.064
 toddler-0.063
 marvel-0.059
 intrigued-0.059
Token She
Token ID1375
Feature activation+2.17e+00

Pos logit contributions
akers+0.033
agons+0.032
opl+0.032
ines+0.032
arl+0.031
Neg logit contributions
 hes-0.040
 toddler-0.038
 curiously-0.038
 marvel-0.035
 Int-0.035
Token opened
Token ID4721
Feature activation+1.11e+00

Pos logit contributions
ons+0.059
akers+0.058
ines+0.054
ards+0.053
 years+0.051
Neg logit contributions
 toddler-0.079
 hes-0.078
 curiously-0.076
 Int-0.076
 hopefully-0.074
Token it
Token ID340
Feature activation-6.48e-02

Pos logit contributions
ines+0.030
akers+0.030
eds+0.029
opl+0.029
agons+0.029
Neg logit contributions
 hes-0.043
 toddler-0.042
 curiously-0.040
 Int-0.039
 intrigued-0.039
Token and
Token ID290
Feature activation-3.43e-01

Pos logit contributions
 hes+0.003
 toddler+0.003
 curiously+0.003
 Int+0.003
 marvel+0.003
Neg logit contributions
ines-0.002
akers-0.001
eds-0.001
agons-0.001
opl-0.001
Token
Token ID198
Feature activation-2.62e-01

Pos logit contributions
opl+0.013
ines+0.013
akers+0.012
agons+0.012
arl+0.012
Neg logit contributions
 hes-0.013
 toddler-0.013
 curiously-0.012
 marvel-0.011
 intrigued-0.011
Token
Token ID198
Feature activation-2.42e-02

Pos logit contributions
 hes+0.008
 toddler+0.007
 curiously+0.007
 Int+0.007
 hopefully+0.007
Neg logit contributions
akers-0.010
ines-0.010
agons-0.010
opl-0.010
eds-0.010
TokenJoe
Token ID19585
Feature activation-4.95e-01

Pos logit contributions
 hes+0.001
 curiously+0.001
 toddler+0.001
 marvel+0.001
 intrigued+0.001
Neg logit contributions
akers-0.001
agons-0.001
arl-0.001
arter-0.001
opl-0.001
Token was
Token ID373
Feature activation-7.96e-01

Pos logit contributions
 hes+0.015
 toddler+0.014
 curiously+0.014
 Int+0.013
 hopefully+0.013
Neg logit contributions
akers-0.018
ines-0.018
eds-0.018
agons-0.018
opl-0.018
Token so
Token ID523
Feature activation-7.89e-01

Pos logit contributions
 hes+0.034
 toddler+0.033
 curiously+0.033
 Int+0.031
 Exc+0.031
Neg logit contributions
akers-0.019
opl-0.017
ines-0.017
ards-0.016
smart-0.016
Token proud
Token ID6613
Feature activation-1.11e+00

Pos logit contributions
 hes+0.026
 toddler+0.024
 curiously+0.023
 Int+0.022
 hopefully+0.021
Neg logit contributions
akers-0.028
ines-0.027
opl-0.026
smart-0.025
arl-0.025
Token of
Token ID286
Feature activation+6.03e-01

Pos logit contributions
 hes+0.048
 toddler+0.046
 curiously+0.046
 marvel+0.044
 hopefully+0.043
Neg logit contributions
akers-0.027
agons-0.027
opl-0.026
eds-0.026
ines-0.025
Token himself
Token ID2241
Feature activation+7.12e-01

Pos logit contributions
akers+0.028
arl+0.026
agons+0.026
eds+0.026
opl+0.025
Neg logit contributions
 hes-0.017
 curiously-0.016
 hopefully-0.014
 marvel-0.014
 nervously-0.013
Token for
Token ID329
Feature activation+1.15e+00

Pos logit contributions
akers+0.020
agons+0.020
arl+0.020
opl+0.020
eds+0.020
Neg logit contributions
 hes-0.029
 curiously-0.028
 toddler-0.027
 hopefully-0.026
 marvel-0.026
Token learning
Token ID4673
Feature activation+2.78e+00

Pos logit contributions
agons+0.038
akers+0.038
opl+0.037
arl+0.037
ines+0.036
Neg logit contributions
 hes-0.043
 curiously-0.041
 toddler-0.039
 marvel-0.038
 hopefully-0.038
Token how
Token ID703
Feature activation-2.47e+00

Pos logit contributions
akers+0.085
agons+0.081
arter+0.079
eds+0.079
arl+0.079
Neg logit contributions
 hes-0.110
 curiously-0.104
 toddler-0.101
 marvel-0.096
 hopefully-0.096
Token.
Token ID13
Feature activation+2.14e-01

Pos logit contributions
akers+0.009
ines+0.009
opl+0.009
agons+0.008
eds+0.008
Neg logit contributions
 hes-0.017
 toddler-0.016
 curiously-0.016
 Int-0.015
 intrigued-0.015
Token
Token ID198
Feature activation+1.71e-01

Pos logit contributions
opl+0.006
arl+0.006
agons+0.006
akers+0.006
ines+0.006
Neg logit contributions
 hes-0.008
 toddler-0.008
 curiously-0.008
 marvel-0.007
 hopefully-0.007
Token
Token ID198
Feature activation+4.29e-01

Pos logit contributions
ines+0.006
akers+0.006
opl+0.006
agons+0.006
eds+0.006
Neg logit contributions
 hes-0.005
 toddler-0.005
 curiously-0.005
 marvel-0.005
 intrigued-0.005
TokenThe
Token ID464
Feature activation+8.49e-01

Pos logit contributions
arter+0.018
akers+0.017
ines+0.017
opl+0.017
agons+0.017
Neg logit contributions
 hes-0.012
 curiously-0.012
 toddler-0.011
 Int-0.011
 intrigued-0.011
Token boy
Token ID2933
Feature activation-1.11e+00

Pos logit contributions
opl+0.028
agons+0.028
arl+0.028
 ow+0.027
py+0.026
Neg logit contributions
 hes-0.028
 curiously-0.027
 marvel-0.026
 Int-0.025
 Exc-0.025
Token helped
Token ID4193
Feature activation-1.13e+00

Pos logit contributions
 toddler+0.037
 hes+0.037
 curiously+0.036
 Int+0.034
 hopefully+0.034
Neg logit contributions
ines-0.033
akers-0.033
ards-0.032
eds-0.032
ons-0.032
Token the
Token ID262
Feature activation+5.16e-01

Pos logit contributions
 hes+0.049
 toddler+0.048
 curiously+0.046
 intrigued+0.046
 Int+0.046
Neg logit contributions
akers-0.025
eds-0.025
agons-0.024
opl-0.024
ards-0.024
Token pony
Token ID26902
Feature activation+7.38e-01

Pos logit contributions
opl+0.017
arl+0.016
 ow+0.016
agons+0.016
eds+0.015
Neg logit contributions
 hes-0.017
 curiously-0.016
 marvel-0.015
 anx-0.015
 intrigued-0.015
Token with
Token ID351
Feature activation-2.00e+00

Pos logit contributions
ines+0.025
akers+0.024
opl+0.024
eds+0.024
agons+0.024
Neg logit contributions
 hes-0.024
 toddler-0.024
 curiously-0.022
 Int-0.021
 intrigued-0.021
Token its
Token ID663
Feature activation+6.55e-01

Pos logit contributions
 hes+0.075
 toddler+0.072
 curiously+0.071
 Int+0.067
 marvel+0.066
Neg logit contributions
akers-0.060
agons-0.059
opl-0.059
ines-0.058
eds-0.058
Token hurt
Token ID5938
Feature activation+1.31e+00

Pos logit contributions
akers+0.018
agons+0.018
opl+0.017
ines+0.017
eds+0.017
Neg logit contributions
 hes-0.026
 toddler-0.025
 curiously-0.025
 Int-0.024
 marvel-0.024
Token circle
Token ID9197
Feature activation-1.28e+00

Pos logit contributions
 hes+0.027
 curiously+0.026
 toddler+0.026
 Int+0.024
 marvel+0.024
Neg logit contributions
akers-0.026
agons-0.026
opl-0.025
ines-0.025
eds-0.025
Token block
Token ID2512
Feature activation-1.01e+00

Pos logit contributions
 hes+0.047
 curiously+0.045
 toddler+0.043
 marvel+0.042
 intrigued+0.041
Neg logit contributions
akers-0.040
agons-0.040
opl-0.039
eds-0.038
arter-0.038
Token to
Token ID284
Feature activation-1.84e-02

Pos logit contributions
 hes+0.033
 curiously+0.032
 hopefully+0.029
 eagerly+0.028
 toddler+0.028
Neg logit contributions
akers-0.037
agons-0.036
arter-0.035
eds-0.035
opl-0.035
Token a
Token ID257
Feature activation-4.56e-01

Pos logit contributions
 hes+0.001
 curiously+0.001
 toddler+0.001
 marvel+0.001
 Int+0.001
Neg logit contributions
akers-0.001
agons-0.001
opl-0.001
ines-0.001
eds-0.001
Token green
Token ID4077
Feature activation-9.51e-02

Pos logit contributions
 hes+0.017
 toddler+0.016
 curiously+0.016
 marvel+0.015
 hopefully+0.015
Neg logit contributions
akers-0.014
agons-0.014
ines-0.014
opl-0.014
eds-0.013
Token triangle
Token ID22950
Feature activation-1.79e+00

Pos logit contributions
 hes+0.003
 curiously+0.003
 toddler+0.003
 marvel+0.003
 hopefully+0.003
Neg logit contributions
akers-0.003
agons-0.003
ines-0.003
opl-0.003
eds-0.003
Token block
Token ID2512
Feature activation-9.76e-01

Pos logit contributions
 hes+0.072
 curiously+0.070
 toddler+0.068
 marvel+0.064
 hopefully+0.064
Neg logit contributions
akers-0.049
agons-0.048
opl-0.047
eds-0.046
arl-0.046
Token.
Token ID13
Feature activation+9.20e-01

Pos logit contributions
 hes+0.035
 curiously+0.035
 hopefully+0.032
 eagerly+0.032
 toddler+0.031
Neg logit contributions
akers-0.031
agons-0.030
eds-0.030
arter-0.029
opl-0.029
Token She
Token ID1375
Feature activation+1.92e+00

Pos logit contributions
akers+0.027
ines+0.027
opl+0.027
arl+0.027
agons+0.026
Neg logit contributions
 hes-0.034
 toddler-0.032
 curiously-0.032
 marvel-0.031
 intrigued-0.030
Token makes
Token ID1838
Feature activation-7.02e-01

Pos logit contributions
ines+0.051
opl+0.049
agons+0.049
akers+0.046
ocus+0.046
Neg logit contributions
 toddler-0.078
 hes-0.076
 curiously-0.073
 hopefully-0.071
 Int-0.069
Token a
Token ID257
Feature activation-1.15e+00

Pos logit contributions
 hes+0.026
 toddler+0.026
 curiously+0.025
 Int+0.024
 hopefully+0.024
Neg logit contributions
akers-0.020
opl-0.020
ines-0.020
agons-0.020
eds-0.019
Token.
Token ID13
Feature activation+3.17e-01

Pos logit contributions
 hes+0.020
 curiously+0.020
 toddler+0.019
 hopefully+0.019
 marvel+0.018
Neg logit contributions
akers-0.010
agons-0.010
arl-0.010
ines-0.010
opl-0.009
Token He
Token ID679
Feature activation+8.16e-01

Pos logit contributions
akers+0.012
opl+0.011
agons+0.011
ines+0.011
eds+0.011
Neg logit contributions
 hes-0.010
 curiously-0.009
 toddler-0.009
 marvel-0.008
 Int-0.008
Token would
Token ID561
Feature activation-6.17e-01

Pos logit contributions
akers+0.027
ines+0.027
opl+0.027
agons+0.027
eds+0.026
Neg logit contributions
 hes-0.027
 toddler-0.026
 curiously-0.026
 Int-0.024
 intrigued-0.024
Token never
Token ID1239
Feature activation+4.07e+00

Pos logit contributions
 hes+0.022
 curiously+0.022
 marvel+0.021
 toddler+0.020
 hopefully+0.020
Neg logit contributions
akers-0.021
agons-0.020
eds-0.020
ines-0.019
opl-0.019
Token forget
Token ID6044
Feature activation+2.28e+00

Pos logit contributions
akers+0.113
arter+0.112
smart+0.111
ines+0.110
opl+0.109
Neg logit contributions
 hes-0.154
 toddler-0.151
 curiously-0.147
 Int-0.141
 intrigued-0.136
Token how
Token ID703
Feature activation-2.56e+00

Pos logit contributions
akers+0.067
agons+0.065
ines+0.064
opl+0.064
eds+0.063
Neg logit contributions
 hes-0.089
 curiously-0.085
 toddler-0.084
 marvel-0.079
 Int-0.079
Token to
Token ID284
Feature activation+3.48e-01

Pos logit contributions
 hes+0.102
 curiously+0.099
 toddler+0.098
 intrigued+0.092
 hopefully+0.092
Neg logit contributions
agons-0.073
akers-0.072
opl-0.072
ines-0.070
eds-0.068
Token do
Token ID466
Feature activation+1.49e+00

Pos logit contributions
akers+0.014
agons+0.013
ines+0.013
eds+0.013
opl+0.013
Neg logit contributions
 curiously-0.010
 hes-0.010
 marvel-0.010
 hopefully-0.009
 toddler-0.009
Token it
Token ID340
Feature activation-4.74e-02

Pos logit contributions
akers+0.055
agons+0.054
arl+0.053
ines+0.052
opl+0.052
Neg logit contributions
 hes-0.048
 curiously-0.046
 toddler-0.044
 marvel-0.042
 hopefully-0.042
Token.
Token ID13
Feature activation+1.86e-01

Pos logit contributions
 hes+0.001
 toddler+0.001
 curiously+0.001
 Int+0.001
 intrigued+0.001
Neg logit contributions
akers-0.002
opl-0.002
ines-0.002
agons-0.002
eds-0.002
Token<|endoftext|>
Token ID50256
Feature activation-4.47e-01

Pos logit contributions
akers+0.007
agons+0.007
ines+0.007
opl+0.007
eds+0.007
Neg logit contributions
 hes-0.005
 toddler-0.005
 curiously-0.005
 Int-0.005
 marvel-0.005
Token,
Token ID11
Feature activation-3.62e-01

Pos logit contributions
agons+0.043
opl+0.042
arter+0.041
arl+0.039
eds+0.037
Neg logit contributions
 hes-0.049
 toddler-0.048
 curiously-0.047
 daughter-0.046
 marvel-0.044
Token some
Token ID617
Feature activation-1.59e+00

Pos logit contributions
 hes+0.015
 toddler+0.014
 curiously+0.014
 hopefully+0.013
 marvel+0.013
Neg logit contributions
akers-0.010
agons-0.010
opl-0.010
ines-0.010
eds-0.010
Token mint
Token ID21043
Feature activation+7.65e-01

Pos logit contributions
 hes+0.063
 toddler+0.061
 curiously+0.059
 daughter+0.056
 hopefully+0.056
Neg logit contributions
opl-0.046
akers-0.043
eds-0.043
arter-0.042
ines-0.041
Token tea
Token ID8887
Feature activation+9.61e-01

Pos logit contributions
opl+0.027
arter+0.025
agons+0.023
smart+0.023
arl+0.023
Neg logit contributions
 hes-0.025
 toddler-0.024
 curiously-0.023
 daughter-0.022
 favourite-0.021
Token for
Token ID329
Feature activation+5.66e-01

Pos logit contributions
opl+0.025
agons+0.024
akers+0.024
arter+0.024
eds+0.024
Neg logit contributions
 hes-0.039
 toddler-0.037
 curiously-0.037
 hopefully-0.035
 marvel-0.034
Token you
Token ID345
Feature activation-1.53e+00

Pos logit contributions
opl+0.017
ines+0.017
akers+0.017
agons+0.016
eds+0.016
Neg logit contributions
 hes-0.022
 curiously-0.020
 toddler-0.020
 hopefully-0.019
 marvel-0.019
Token,"
Token ID553
Feature activation-2.05e+00

Pos logit contributions
 hes+0.066
 curiously+0.064
 toddler+0.062
 marvel+0.060
 hopefully+0.060
Neg logit contributions
opl-0.039
agons-0.039
arter-0.035
akers-0.034
eds-0.034
Token Ben
Token ID3932
Feature activation-1.20e+00

Pos logit contributions
 hes+0.079
ounded+0.077
 toddler+0.077
 intrigued+0.077
 Int+0.076
Neg logit contributions
ards-0.048
ons-0.046
mm-0.044
opl-0.043
 cheese-0.043
Token said
Token ID531
Feature activation-1.47e+00

Pos logit contributions
 hes+0.037
 toddler+0.037
 curiously+0.036
 Int+0.035
 hopefully+0.034
Neg logit contributions
akers-0.038
ards-0.038
ines-0.037
ons-0.036
py-0.035
Token,
Token ID11
Feature activation-8.46e-01

Pos logit contributions
 hes+0.078
 toddler+0.076
 Int+0.074
 curiously+0.073
 marvel+0.072
Neg logit contributions
akers-0.019
smart-0.017
ines-0.017
ards-0.016
agons-0.015
Token giving
Token ID3501
Feature activation-7.26e-01

Pos logit contributions
 hes+0.030
 toddler+0.029
 curiously+0.028
 Int+0.027
 marvel+0.026
Neg logit contributions
akers-0.025
opl-0.024
arl-0.024
ards-0.024
agons-0.024
Token
Token ID198
Feature activation-4.68e-01

Pos logit contributions
 hes+0.065
 toddler+0.059
 curiously+0.058
 marvel+0.056
 intrigued+0.056
Neg logit contributions
ines-0.061
arl-0.060
eds-0.059
akers-0.059
!"-0.058
Token
Token ID198
Feature activation-2.34e-01

Pos logit contributions
 hes+0.018
 toddler+0.016
 curiously+0.015
 marvel+0.015
 hopefully+0.015
Neg logit contributions
ines-0.014
eds-0.013
akers-0.013
arl-0.013
agons-0.013
TokenM
Token ID44
Feature activation+1.15e-01

Pos logit contributions
 hes+0.010
 toddler+0.009
 curiously+0.009
 Int+0.009
 marvel+0.009
Neg logit contributions
akers-0.006
ines-0.005
eds-0.005
agons-0.005
 years-0.005
Tokenolly
Token ID5098
Feature activation+2.60e-01

Pos logit contributions
akers+0.004
agons+0.004
ines+0.004
opl+0.004
eds+0.004
Neg logit contributions
 hes-0.004
 toddler-0.004
 curiously-0.004
 Int-0.004
 marvel-0.004
Token looked
Token ID3114
Feature activation-8.34e-02

Pos logit contributions
 years+0.007
ards+0.007
akers+0.007
arts+0.007
ines+0.007
Neg logit contributions
 toddler-0.008
 hes-0.008
 hopefully-0.008
 Int-0.008
ancer-0.008
Token up
Token ID510
Feature activation-5.49e+00

Pos logit contributions
 hes+0.003
 toddler+0.003
Isa+0.003
 convince+0.003
 mirac+0.003
Neg logit contributions
ines-0.002
ards-0.002
akers-0.002
 years-0.002
!"-0.002
Token with
Token ID351
Feature activation-2.17e+00

Pos logit contributions
 Int+0.277
Whoever+0.274
 hes+0.273
Isa+0.273
 toddler+0.269
Neg logit contributions
,-0.064
 years-0.061
!-0.060
.-0.052
 that-0.050
Token confusion
Token ID10802
Feature activation-8.38e-01

Pos logit contributions
 hes+0.069
 curiously+0.066
 toddler+0.065
 Int+0.063
 mirac+0.061
Neg logit contributions
opl-0.065
 lava-0.064
 years-0.064
smart-0.063
agons-0.063
Token and
Token ID290
Feature activation-1.17e+00

Pos logit contributions
 hes+0.040
 toddler+0.040
 Int+0.039
 curiously+0.038
 Exc+0.038
Neg logit contributions
ines-0.013
akers-0.013
smart-0.012
arl-0.012
agons-0.012
Token said
Token ID531
Feature activation-6.41e-01

Pos logit contributions
 toddler+0.041
Whoever+0.041
Isa+0.039
 athlete+0.039
ancer+0.039
Neg logit contributions
 years-0.027
 Kim-0.026
 that-0.026
ards-0.025
 ow-0.024
Token,
Token ID11
Feature activation+1.23e-01

Pos logit contributions
 hes+0.033
 toddler+0.033
 curiously+0.032
 Int+0.031
 marvel+0.031
Neg logit contributions
akers-0.009
ines-0.009
agons-0.008
smart-0.008
arl-0.008
Token.
Token ID13
Feature activation+1.23e-03

Pos logit contributions
akers+0.008
agons+0.007
opl+0.007
ines+0.007
eds+0.007
Neg logit contributions
 hes-0.015
 curiously-0.014
 toddler-0.014
 hopefully-0.013
 marvel-0.013
Token Just
Token ID2329
Feature activation-5.70e-01

Pos logit contributions
akers+0.000
opl+0.000
agons+0.000
ines+0.000
smart+0.000
Neg logit contributions
 hes-0.000
 toddler-0.000
 curiously-0.000
 marvel-0.000
 hopefully-0.000
Token then
Token ID788
Feature activation-9.40e-01

Pos logit contributions
 toddler+0.020
 hes+0.019
 Int+0.019
 curiously+0.018
 Exc+0.018
Neg logit contributions
akers-0.017
ines-0.016
opl-0.016
py-0.016
ards-0.016
Token,
Token ID11
Feature activation+2.36e-01

Pos logit contributions
 toddler+0.050
 hes+0.049
 curiously+0.048
 Int+0.047
 marvel+0.046
Neg logit contributions
akers-0.010
smart-0.010
ines-0.010
ards-0.009
opl-0.009
Token something
Token ID1223
Feature activation-2.30e+00

Pos logit contributions
 lava+0.006
akers+0.006
smart+0.006
ards+0.006
 fires+0.005
Neg logit contributions
 hes-0.009
 toddler-0.009
 curiously-0.008
 Int-0.008
 marvel-0.008
Token else
Token ID2073
Feature activation-5.48e+00

Pos logit contributions
 toddler+0.096
 hes+0.091
 hopefully+0.089
 curiously+0.089
 Int+0.086
Neg logit contributions
ines-0.054
 lava-0.052
akers-0.052
smart-0.052
ards-0.051
Token happened
Token ID3022
Feature activation-3.49e+00

Pos logit contributions
 hes+0.176
 toddler+0.174
 curiously+0.167
 Int+0.161
 hopefully+0.158
Neg logit contributions
akers-0.185
ines-0.182
eds-0.178
smart-0.174
agons-0.174
Token.
Token ID13
Feature activation+1.34e-01

Pos logit contributions
 hes+0.175
 toddler+0.172
 curiously+0.169
 Int+0.162
 marvel+0.161
Neg logit contributions
akers-0.058
ines-0.055
agons-0.055
opl-0.055
eds-0.053
Token He
Token ID679
Feature activation+5.63e-01

Pos logit contributions
 lava+0.003
smart+0.003
arl+0.003
 Glass+0.003
akers+0.003
Neg logit contributions
 hes-0.005
 curiously-0.005
 marvel-0.005
 convince-0.005
 toddler-0.005
Token heard
Token ID2982
Feature activation+1.60e+00

Pos logit contributions
eds+0.017
akers+0.017
ines+0.017
 years+0.017
ards+0.016
Neg logit contributions
 toddler-0.018
 hes-0.017
 hopefully-0.017
 Int-0.017
 curiously-0.016
Token a
Token ID257
Feature activation-5.63e-01

Pos logit contributions
akers+0.044
opl+0.043
ines+0.043
agons+0.042
eds+0.042
Neg logit contributions
 hes-0.060
 toddler-0.059
 curiously-0.057
 hopefully-0.054
 Int-0.054
Token the
Token ID262
Feature activation-1.82e-01

Pos logit contributions
akers+0.022
ines+0.020
agons+0.020
eds+0.020
arter+0.019
Neg logit contributions
 hes-0.017
 curiously-0.017
 toddler-0.017
 nervously-0.015
 hopefully-0.015
Token ground
Token ID2323
Feature activation+9.01e-01

Pos logit contributions
 hes+0.007
 toddler+0.007
 curiously+0.007
 marvel+0.006
 Int+0.006
Neg logit contributions
akers-0.006
ines-0.005
agons-0.005
opl-0.005
eds-0.005
Token and
Token ID290
Feature activation-1.07e+00

Pos logit contributions
agons+0.026
akers+0.026
opl+0.024
eds+0.024
arl+0.024
Neg logit contributions
 hes-0.036
 curiously-0.035
 toddler-0.034
 marvel-0.033
 hopefully-0.032
Token picked
Token ID6497
Feature activation-3.41e-01

Pos logit contributions
 hes+0.034
 toddler+0.033
 curiously+0.032
 Int+0.031
 hopefully+0.030
Neg logit contributions
akers-0.034
ards-0.034
eds-0.033
ines-0.033
smart-0.033
Token it
Token ID340
Feature activation-4.99e-01

Pos logit contributions
 hes+0.013
 toddler+0.013
 curiously+0.012
 Int+0.012
 intrigued+0.012
Neg logit contributions
opl-0.008
ines-0.008
ons-0.008
eds-0.008
akers-0.008
Token up
Token ID510
Feature activation-5.30e+00

Pos logit contributions
 hes+0.014
 toddler+0.014
 Int+0.012
 intrigued+0.012
 marvel+0.012
Neg logit contributions
ines-0.019
eds-0.018
akers-0.018
agons-0.018
!"-0.018
Token.
Token ID13
Feature activation-1.58e-01

Pos logit contributions
 hes+0.267
 toddler+0.261
 curiously+0.253
 Int+0.250
 intrigued+0.244
Neg logit contributions
akers-0.088
ines-0.083
agons-0.079
eds-0.078
opl-0.077
Token She
Token ID1375
Feature activation+1.79e+00

Pos logit contributions
 hes+0.006
 toddler+0.006
 curiously+0.006
 marvel+0.005
 hopefully+0.005
Neg logit contributions
agons-0.004
opl-0.004
arl-0.004
ines-0.004
akers-0.004
Token brought
Token ID3181
Feature activation-6.49e-01

Pos logit contributions
akers+0.052
ons+0.051
ards+0.050
ines+0.050
eds+0.050
Neg logit contributions
 hes-0.060
 toddler-0.060
 Int-0.057
 curiously-0.057
 hopefully-0.057
Token it
Token ID340
Feature activation-1.20e+00

Pos logit contributions
 hes+0.025
 toddler+0.024
 curiously+0.024
 intrigued+0.023
 Int+0.023
Neg logit contributions
akers-0.017
ines-0.017
eds-0.017
ons-0.017
agons-0.017
Token back
Token ID736
Feature activation-1.20e+00

Pos logit contributions
 hes+0.035
 curiously+0.033
 toddler+0.033
 Int+0.030
 marvel+0.030
Neg logit contributions
akers-0.046
agons-0.045
opl-0.045
ines-0.045
eds-0.044
Token it
Token ID340
Feature activation-6.76e-01

Pos logit contributions
akers+0.034
agons+0.033
opl+0.032
ines+0.032
arl+0.032
Neg logit contributions
 hes-0.028
 curiously-0.027
 toddler-0.026
 marvel-0.024
 hopefully-0.024
Token.
Token ID13
Feature activation-6.16e-01

Pos logit contributions
 hes+0.030
 curiously+0.029
 toddler+0.028
 hopefully+0.027
 marvel+0.027
Neg logit contributions
akers-0.017
agons-0.017
opl-0.016
eds-0.016
arl-0.016
Token He
Token ID679
Feature activation+5.33e-01

Pos logit contributions
 hes+0.029
 toddler+0.027
 curiously+0.026
 marvel+0.025
 convince+0.025
Neg logit contributions
opl-0.010
!"-0.010
 Kim-0.010
ards-0.010
 Glass-0.009
Token picked
Token ID6497
Feature activation-1.10e+00

Pos logit contributions
akers+0.017
ards+0.017
eds+0.016
ines+0.016
py+0.016
Neg logit contributions
 toddler-0.016
 hes-0.016
 hopefully-0.015
 curiously-0.015
 Int-0.015
Token it
Token ID340
Feature activation-7.08e-01

Pos logit contributions
 hes+0.042
 toddler+0.042
 curiously+0.040
 Int+0.039
 intrigued+0.039
Neg logit contributions
ons-0.028
eds-0.028
 lava-0.028
ines-0.028
opl-0.027
Token up
Token ID510
Feature activation-5.50e+00

Pos logit contributions
 hes+0.017
 toddler+0.015
 curiously+0.014
 Int+0.014
 marvel+0.013
Neg logit contributions
akers-0.031
ines-0.031
agons-0.030
eds-0.030
opl-0.030
Token,
Token ID11
Feature activation-5.33e-01

Pos logit contributions
 hes+0.223
 toddler+0.212
 curiously+0.212
 Int+0.199
 marvel+0.199
Neg logit contributions
akers-0.151
agons-0.147
opl-0.146
ines-0.145
eds-0.142
Token but
Token ID475
Feature activation-2.88e+00

Pos logit contributions
 toddler+0.022
 hes+0.021
 curiously+0.020
 Int+0.020
 intrigued+0.020
Neg logit contributions
ards-0.012
akers-0.012
ons-0.011
eds-0.011
smart-0.011
Token it
Token ID340
Feature activation-2.31e+00

Pos logit contributions
 hes+0.115
 toddler+0.114
 Int+0.109
 curiously+0.108
 marvel+0.107
Neg logit contributions
ards-0.066
 lava-0.066
ons-0.064
akers-0.063
 years-0.062
Token was
Token ID373
Feature activation-1.68e+00

Pos logit contributions
 hes+0.089
 toddler+0.087
 curiously+0.082
 marvel+0.081
 Clare+0.081
Neg logit contributions
ards-0.062
eds-0.060
 lava-0.060
ines-0.059
akers-0.059
Token very
Token ID845
Feature activation-2.78e-01

Pos logit contributions
 hes+0.061
 curiously+0.057
 toddler+0.055
 intrigued+0.052
 und+0.052
Neg logit contributions
agons-0.057
akers-0.057
opl-0.057
ines-0.054
eds-0.053
Token a
Token ID257
Feature activation-7.38e-01

Pos logit contributions
akers+0.044
agons+0.043
opl+0.043
ines+0.043
eds+0.042
Neg logit contributions
 hes-0.047
 curiously-0.045
 toddler-0.045
 Int-0.041
 marvel-0.041
Token noise
Token ID7838
Feature activation-3.29e-01

Pos logit contributions
 hes+0.030
 curiously+0.029
 toddler+0.029
 Int+0.027
 marvel+0.027
Neg logit contributions
akers-0.022
opl-0.021
agons-0.021
ines-0.020
eds-0.020
Token.
Token ID13
Feature activation-1.98e-01

Pos logit contributions
 hes+0.013
 curiously+0.013
 toddler+0.012
 Int+0.012
 marvel+0.012
Neg logit contributions
akers-0.009
agons-0.009
opl-0.009
ines-0.009
eds-0.009
Token She
Token ID1375
Feature activation+1.53e+00

Pos logit contributions
 hes+0.008
 toddler+0.007
 curiously+0.007
 marvel+0.007
 hopefully+0.007
Neg logit contributions
akers-0.006
agons-0.005
opl-0.005
ines-0.005
arl-0.005
Token looked
Token ID3114
Feature activation-1.11e-01

Pos logit contributions
eds+0.048
ines+0.048
akers+0.047
agons+0.047
py+0.046
Neg logit contributions
 hes-0.049
 toddler-0.049
 hopefully-0.045
 curiously-0.045
 intrigued-0.043
Token up
Token ID510
Feature activation-5.56e+00

Pos logit contributions
 hes+0.003
 toddler+0.003
 curiously+0.003
 hopefully+0.003
 Int+0.003
Neg logit contributions
ines-0.004
akers-0.004
agons-0.004
eds-0.004
ards-0.004
Token to
Token ID284
Feature activation-9.83e-01

Pos logit contributions
 toddler+0.280
 hes+0.276
 Int+0.270
Isa+0.266
 Evan+0.264
Neg logit contributions
ines-0.073
 years-0.065
!-0.059
!"-0.056
 lava-0.054
Token see
Token ID766
Feature activation+3.62e-01

Pos logit contributions
 hes+0.038
 toddler+0.037
 curiously+0.035
 intrigued+0.035
 Int+0.034
Neg logit contributions
opl-0.024
arter-0.024
eds-0.024
py-0.024
ines-0.024
Token her
Token ID607
Feature activation-1.01e+00

Pos logit contributions
akers+0.011
agons+0.011
opl+0.011
eds+0.011
ines+0.011
Neg logit contributions
 hes-0.014
 curiously-0.013
 toddler-0.013
 hopefully-0.012
 marvel-0.012
Token friend
Token ID1545
Feature activation-7.72e-01

Pos logit contributions
 hes+0.036
 toddler+0.033
 Int+0.032
 curiously+0.032
 marvel+0.030
Neg logit contributions
akers-0.032
ines-0.032
opl-0.032
agons-0.031
eds-0.031
Token Bob
Token ID5811
Feature activation-2.04e+00

Pos logit contributions
 hes+0.037
 toddler+0.036
 curiously+0.035
 Int+0.034
 marvel+0.034
Neg logit contributions
akers-0.015
ines-0.014
opl-0.014
agons-0.014
eds-0.013