TOP ACTIVATIONS
MAX = 0.230

her mom my . L ily ran inside
and dad every day . One day , Lily
with a big ball . Tim said , "
the parade a lot . Tim saw a man
her sweater very much . One day , Lily
tables , and beds . One day , Tim
play and have fun . One day , Spot
Lily was very happy . L ily wanted to
and c udd ly . One day , Tim
ran to her mom . " Mom my ,
went inside the house . In the house ,
and got very dirty . Tom said , "
were very good friends . One day , Sue
and felt very happy . But then , her
in the warm sun . One day , Max
when she j ogged . While j ogging ,
Sue looked sad . Tim asked , "
little dog , Max . One day , Sue
it to his mom . Tim said , "
ran to the tree . " Hi , Tree

MIN ACTIVATIONS
MIN = -0.265

so loud with all the furniture talking , it was hard
one of the p udd les and watched as the rain
could hide in the little cracks and cre v ices of
Spr inkle it on any area and you will be granted
for getting the wild new carpet . <|endoftext|> Once there was
in its ears , the switch knows that it will never
all of the family 's possessions had been taken away ,
given you . Feel the air and let it take you
they could now use the glue . The man was happy
the fairy of the magic wand . You can make anything
had their special pe b ble which united them in friendship
something else - a beautiful wand with a star on the
his dinner with his cob web . <|endoftext|> Once upon a
kind and looked after her body . They gave her some
day and the way her body felt with each step .
the answers to your exam paper ." M ia
therapy of being in the pool had worked its magic .
tried one of the unknown machines , but she knew it
disagreements and show people the path of kindness and understanding .
were taking care of the air . From that moment ,

INTERVAL 0.106 - 0.230
CONTAINS 6.541%

and hide behind her . " Mom , Mom
!" she said . <|endoftext|> Once upon a time , there
on the board every day . One day , she drew
a little girl named Lily . She loved to play outside
happy and laugh a lot . But then they

INTERVAL -0.018 - 0.106
CONTAINS 52.278%

slow down her slide . When she reached the bottom ,
her in the cart . Maybe she forgot to take her
Max thought about it for a moment and then said ,
'm sorry , Lily . Can I share your colors ,
catch with this different ball I found . It 's really

INTERVAL -0.142 - -0.018
CONTAINS 40.410%

, and they put out the fire . From that day
flew into a hole in the ground . She couldn 't
ma was so happy and proud , and off she went
wanted to know what it was , so he went to
my calls . " The slide is high and fast .

INTERVAL -0.265 - -0.142
CONTAINS 0.771%

real estate and started a business together . Soon , they
� You cl ap your hands as fast as you can
wide would come to the well to take a peek inside
were papers and books and toys all over the floor .
, dreaming of all the music he will dance to in
Token her
Token ID607
Feature activation+3.80e-02

Pos logit contributions
V+0.076
E+0.071
+0.068
Look+0.066
 Many+0.065
Neg logit contributions
ier-0.053
 size-0.051
 perfectly-0.047
 wouldn-0.047
 away-0.046
Token mom
Token ID1995
Feature activation+3.41e-02

Pos logit contributions
V+0.056
+0.054
E+0.053
 Many+0.049
Look+0.049
Neg logit contributions
ier-0.056
 size-0.049
 away-0.049
ist-0.049
 wouldn-0.048
Tokenmy
Token ID1820
Feature activation-5.34e-03

Pos logit contributions
V+0.057
+0.056
E+0.054
Look+0.050
Mom+0.049
Neg logit contributions
ier-0.041
 size-0.039
 perfectly-0.038
 wouldn-0.038
 wand-0.037
Token.
Token ID13
Feature activation+1.51e-01

Pos logit contributions
ier+0.008
 wouldn+0.007
 size+0.007
 perfectly+0.007
 inches+0.007
Neg logit contributions
-0.009
V-0.007
 The-0.007
E-0.007
M-0.006
Token
Token ID220
Feature activation+2.06e-01

Pos logit contributions
Ta+0.263
Mic+0.251
V+0.250
Mac+0.241
Ah+0.241
Neg logit contributions
 and-0.217
,-0.211
.-0.210
 to-0.206
 as-0.184
Token
Token ID198
Feature activation+2.30e-01

Pos logit contributions
V+0.319
Grab+0.316
Ta+0.315
Quick+0.314
Uh+0.314
Neg logit contributions
.-0.315
 for-0.276
 in-0.271
 and-0.254
 all-0.248
Token
Token ID198
Feature activation+1.09e-01

Pos logit contributions
 Deal+0.440
Grab+0.435
Ta+0.417
Uh+0.410
Really+0.401
Neg logit contributions
.-0.310
 and-0.292
,-0.262
 to-0.255
 for-0.252
TokenL
Token ID43
Feature activation+9.59e-02

Pos logit contributions
 Deal+0.126
 Ow+0.125
 Glass+0.110
 Some+0.109
 Next+0.108
Neg logit contributions
 in-0.164
 away-0.163
 and-0.156
 made-0.156
 when-0.154
Tokenily
Token ID813
Feature activation+2.53e-02

Pos logit contributions
V+0.159
Look+0.145
E+0.145
Br+0.141
Wow+0.141
Neg logit contributions
ier-0.139
 for-0.124
 enough-0.119
ies-0.118
ils-0.116
Token ran
Token ID4966
Feature activation+7.64e-02

Pos logit contributions
V+0.043
+0.041
E+0.039
 While+0.038
Look+0.037
Neg logit contributions
ier-0.036
iness-0.030
 luck-0.030
 perfectly-0.029
 size-0.029
Token inside
Token ID2641
Feature activation+5.96e-02

Pos logit contributions
V+0.202
E+0.200
Look+0.186
O+0.183
Wow+0.182
Neg logit contributions
 away-0.032
 for-0.028
 enough-0.021
 in-0.017
 as-0.016
Token and
Token ID290
Feature activation+7.08e-03

Pos logit contributions
ier+0.019
 powers+0.018
eking+0.017
 coins+0.017
 happen+0.017
Neg logit contributions
-0.021
Mom-0.017
 The-0.016
M-0.016
 While-0.016
Token dad
Token ID9955
Feature activation-6.80e-03

Pos logit contributions
+0.008
Mom+0.007
Dad+0.007
V+0.007
 While+0.007
Neg logit contributions
ier-0.015
 perfectly-0.013
ist-0.013
et-0.013
ening-0.012
Token every
Token ID790
Feature activation+4.62e-02

Pos logit contributions
ier+0.009
 airplane+0.008
 powers+0.008
castle+0.008
 thieves+0.008
Neg logit contributions
-0.012
 -0.010
 While-0.010
V-0.010
 The-0.009
Token day
Token ID1110
Feature activation+7.40e-03

Pos logit contributions
V+0.084
E+0.078
Look+0.075
+0.075
 Many+0.072
Neg logit contributions
ier-0.052
 size-0.047
 perfectly-0.047
 enough-0.047
 wouldn-0.046
Token.
Token ID13
Feature activation+1.64e-01

Pos logit contributions
+0.014
Mom+0.011
 The+0.011
E+0.011
V+0.011
Neg logit contributions
ier-0.009
 wand-0.009
ener-0.008
et-0.008
ps-0.008
Token
Token ID198
Feature activation+2.29e-01

Pos logit contributions
Quick+0.294
Ta+0.282
Grab+0.269
AAAA+0.269
Cur+0.263
Neg logit contributions
.-0.219
 for-0.212
 and-0.201
,-0.196
 to-0.186
Token
Token ID198
Feature activation+1.25e-01

Pos logit contributions
Grab+0.417
 Deal+0.414
Ta+0.390
Really+0.360
Uh+0.359
Neg logit contributions
.-0.339
 for-0.303
 and-0.299
 in-0.297
 to-0.285
TokenOne
Token ID3198
Feature activation+8.25e-02

Pos logit contributions
 Deal+0.198
 Int+0.184
 Here+0.179
 Ow+0.177
Ta+0.177
Neg logit contributions
 would-0.150
 for-0.149
 in-0.139
 often-0.135
 game-0.132
Token day
Token ID1110
Feature activation+9.33e-02

Pos logit contributions
V+0.201
E+0.193
Look+0.183
Wow+0.174
 Many+0.170
Neg logit contributions
 for-0.053
 in-0.042
 away-0.041
 size-0.040
 left-0.040
Token,
Token ID11
Feature activation+7.68e-02

Pos logit contributions
V+0.281
E+0.271
Look+0.259
Wow+0.250
Jen+0.249
Neg logit contributions
 for-0.006
 in-0.001
,-0.000
 enough+0.002
 as+0.005
Token Lily
Token ID20037
Feature activation+3.08e-02

Pos logit contributions
V+0.109
E+0.100
Look+0.090
 Many+0.088
O+0.086
Neg logit contributions
 for-0.114
 away-0.110
 enough-0.110
 as-0.109
 size-0.107
Token with
Token ID351
Feature activation+3.50e-02

Pos logit contributions
V+0.078
+0.073
E+0.072
Look+0.071
 Many+0.069
Neg logit contributions
ier-0.038
 size-0.035
 enough-0.034
 wouldn-0.033
 airplane-0.033
Token a
Token ID257
Feature activation+2.28e-02

Pos logit contributions
V+0.054
E+0.051
+0.049
 Many+0.047
Look+0.047
Neg logit contributions
ier-0.052
 wouldn-0.048
 size-0.047
 perfectly-0.046
 enough-0.046
Token big
Token ID1263
Feature activation+1.81e-02

Pos logit contributions
V+0.038
Look+0.035
E+0.035
+0.034
L+0.033
Neg logit contributions
ier-0.032
 enough-0.029
 wouldn-0.028
 perfectly-0.028
iness-0.027
Token ball
Token ID2613
Feature activation+1.01e-02

Pos logit contributions
V+0.028
Look+0.025
+0.025
 Many+0.024
 While+0.023
Neg logit contributions
ier-0.028
 wouldn-0.025
 perfectly-0.024
 enough-0.023
ist-0.023
Token.
Token ID13
Feature activation+1.51e-01

Pos logit contributions
+0.019
V+0.018
 While+0.017
L+0.017
 Many+0.016
Neg logit contributions
ier-0.011
 luck-0.010
ener-0.010
 wouldn-0.010
 size-0.010
Token
Token ID198
Feature activation+2.28e-01

Pos logit contributions
E+0.256
Isa+0.234
Br+0.230
V+0.225
Ta+0.222
Neg logit contributions
,-0.225
 and-0.214
 to-0.206
.-0.188
 for-0.181
Token
Token ID198
Feature activation+1.22e-01

Pos logit contributions
 Deal+0.454
Really+0.418
Ta+0.409
Grab+0.406
omm+0.403
Neg logit contributions
 and-0.328
.-0.325
,-0.293
 in-0.289
 to-0.286
TokenTim
Token ID14967
Feature activation+2.63e-02

Pos logit contributions
 Deal+0.180
omm+0.141
Isa+0.136
 Ow+0.134
ucks+0.131
Neg logit contributions
 and-0.199
 made-0.197
 in-0.194
!-0.189
,-0.188
Token said
Token ID531
Feature activation+4.95e-02

Pos logit contributions
V+0.045
+0.043
 While+0.039
E+0.039
Look+0.039
Neg logit contributions
ier-0.036
 luck-0.033
 size-0.032
 perfectly-0.031
iness-0.031
Token,
Token ID11
Feature activation+5.78e-02

Pos logit contributions
V+0.115
E+0.110
+0.105
Look+0.104
 Many+0.103
Neg logit contributions
ier-0.028
 size-0.027
 away-0.025
 enough-0.025
 perfectly-0.024
Token "
Token ID366
Feature activation+1.21e-01

Pos logit contributions
V+0.073
E+0.067
Look+0.060
 Many+0.057
+0.057
Neg logit contributions
ier-0.093
 enough-0.091
 away-0.091
 size-0.091
 for-0.089
Token the
Token ID262
Feature activation+1.96e-02

Pos logit contributions
V+0.019
+0.018
 Many+0.016
Look+0.015
 Whe+0.015
Neg logit contributions
ier-0.017
ener-0.015
ist-0.015
 perfectly-0.015
iness-0.015
Token parade
Token ID16134
Feature activation-5.51e-02

Pos logit contributions
V+0.035
+0.032
Look+0.032
 Many+0.031
E+0.030
Neg logit contributions
ier-0.024
 wouldn-0.024
ener-0.023
 perfectly-0.023
 enough-0.022
Token a
Token ID257
Feature activation+3.08e-02

Pos logit contributions
eking+0.110
ener+0.108
pan+0.105
 breaths+0.103
 permission+0.103
Neg logit contributions
-0.076
 They-0.055
 The-0.054
 And-0.049
 Everyone-0.045
Token lot
Token ID1256
Feature activation-1.88e-02

Pos logit contributions
V+0.043
E+0.039
+0.038
Look+0.036
Wow+0.034
Neg logit contributions
ier-0.049
 enough-0.048
 away-0.046
 wouldn-0.045
 perfectly-0.045
Token.
Token ID13
Feature activation+1.48e-01

Pos logit contributions
 wouldn+0.028
ener+0.027
 size+0.027
eking+0.027
ened+0.027
Neg logit contributions
-0.030
L-0.022
 The-0.022
Tom-0.021
 And-0.021
Token
Token ID198
Feature activation+2.27e-01

Pos logit contributions
E+0.260
Ta+0.259
Mic+0.247
Ba+0.247
 Deal+0.243
Neg logit contributions
.-0.193
 and-0.189
,-0.182
 to-0.171
 for-0.160
Token
Token ID198
Feature activation+1.15e-01

Pos logit contributions
 Deal+0.394
Grab+0.380
Ta+0.376
Really+0.361
Uh+0.354
Neg logit contributions
.-0.349
 and-0.331
 in-0.309
 to-0.302
,-0.297
TokenTim
Token ID14967
Feature activation+2.20e-02

Pos logit contributions
 Deal+0.167
 Here+0.125
 Ow+0.119
V+0.119
Ta+0.116
Neg logit contributions
 made-0.183
 in-0.178
 and-0.175
 would-0.173
 like-0.172
Token saw
Token ID2497
Feature activation+4.94e-02

Pos logit contributions
V+0.037
+0.035
 While+0.031
Look+0.031
E+0.031
Neg logit contributions
ier-0.032
 luck-0.029
 size-0.028
 wand-0.028
rier-0.028
Token a
Token ID257
Feature activation+1.48e-02

Pos logit contributions
V+0.090
E+0.084
Look+0.079
+0.079
 Many+0.078
Neg logit contributions
ier-0.055
 size-0.053
 away-0.052
 enough-0.051
 perfectly-0.051
Token man
Token ID582
Feature activation-6.13e-03

Pos logit contributions
V+0.022
+0.019
Look+0.019
E+0.019
Wow+0.018
Neg logit contributions
ier-0.023
 enough-0.022
et-0.022
 perfectly-0.022
 wouldn-0.022
Token her
Token ID607
Feature activation+6.95e-02

Pos logit contributions
V+0.104
E+0.098
Look+0.092
+0.092
 Many+0.090
Neg logit contributions
ier-0.048
 size-0.045
 away-0.045
 enough-0.045
 perfectly-0.043
Token sweater
Token ID35685
Feature activation-3.23e-02

Pos logit contributions
V+0.139
E+0.133
Look+0.126
 Many+0.124
Wow+0.121
Neg logit contributions
 away-0.060
 size-0.059
 enough-0.058
 for-0.057
 perfectly-0.056
Token very
Token ID845
Feature activation+8.44e-02

Pos logit contributions
ener+0.055
ening+0.052
eking+0.051
rying+0.050
lling+0.049
Neg logit contributions
-0.048
 While-0.038
 The-0.037
 And-0.035
 But-0.035
Token much
Token ID881
Feature activation+2.30e-03

Pos logit contributions
V+0.169
Look+0.157
E+0.146
Wow+0.144
Kids+0.138
Neg logit contributions
 perfectly-0.080
ier-0.080
 for-0.078
 enough-0.078
 size-0.076
Token.
Token ID13
Feature activation+1.66e-01

Pos logit contributions
+0.004
E+0.003
V+0.003
 Now+0.003
L+0.003
Neg logit contributions
eking-0.003
 size-0.003
ener-0.002
 falling-0.002
 wand-0.002
Token
Token ID198
Feature activation+2.27e-01

Pos logit contributions
Quick+0.316
Uh+0.289
Pe+0.274
Ta+0.272
Phill+0.271
Neg logit contributions
.-0.215
,-0.205
 and-0.200
 for-0.171
 to-0.170
Token
Token ID198
Feature activation+1.20e-01

Pos logit contributions
 Deal+0.425
Grab+0.416
Ta+0.386
Uh+0.374
Quick+0.369
Neg logit contributions
.-0.335
 and-0.302
 in-0.287
 to-0.283
 for-0.280
TokenOne
Token ID3198
Feature activation+7.75e-02

Pos logit contributions
 Deal+0.202
 Ow+0.176
 Glass+0.167
 Here+0.166
ucks+0.165
Neg logit contributions
 and-0.148
 would-0.147
 in-0.145
 made-0.144
.-0.143
Token day
Token ID1110
Feature activation+8.95e-02

Pos logit contributions
V+0.182
E+0.176
Look+0.167
Wow+0.158
 Many+0.156
Neg logit contributions
 for-0.050
 size-0.043
 away-0.043
 in-0.041
 left-0.040
Token,
Token ID11
Feature activation+8.11e-02

Pos logit contributions
V+0.265
E+0.258
Look+0.245
 Many+0.237
Wow+0.236
Neg logit contributions
 for-0.005
 in-0.001
,-0.000
 enough+0.002
 away+0.004
Token Lily
Token ID20037
Feature activation+3.51e-02

Pos logit contributions
V+0.117
E+0.109
Look+0.098
 Many+0.095
Kids+0.094
Neg logit contributions
 for-0.120
 as-0.116
 in-0.113
 away-0.113
 enough-0.110
Token tables
Token ID8893
Feature activation-3.93e-02

Pos logit contributions
V+0.011
+0.011
 Many+0.010
E+0.010
Tim+0.010
Neg logit contributions
ier-0.009
 luck-0.008
 away-0.008
ener-0.008
iness-0.008
Token,
Token ID11
Feature activation+1.73e-02

Pos logit contributions
 powers+0.062
 luck+0.062
ener+0.061
 reality+0.060
ience+0.059
Neg logit contributions
-0.059
 The-0.046
 They-0.043
 In-0.042
 At-0.041
Token and
Token ID290
Feature activation-1.23e-02

Pos logit contributions
V+0.032
+0.031
Look+0.030
E+0.029
Mom+0.029
Neg logit contributions
ier-0.018
 luck-0.016
 perfectly-0.016
 powers-0.016
 size-0.015
Token beds
Token ID20237
Feature activation-6.54e-02

Pos logit contributions
ier+0.019
ist+0.019
ener+0.018
iness+0.018
 perfectly+0.018
Neg logit contributions
 Many-0.019
V-0.018
E-0.017
-0.017
Look-0.017
Token.
Token ID13
Feature activation+1.53e-01

Pos logit contributions
 obstacle+0.118
ience+0.117
ener+0.116
 pearl+0.116
 worries+0.116
Neg logit contributions
-0.087
 They-0.063
-0.054
 The-0.052
 In-0.049
Token
Token ID198
Feature activation+2.27e-01

Pos logit contributions
Ba+0.293
Isa+0.279
Ta+0.276
Har+0.264
V+0.264
Neg logit contributions
.-0.147
 for-0.142
,-0.138
 and-0.132
 too-0.125
Token
Token ID198
Feature activation+1.31e-01

Pos logit contributions
 Deal+0.446
Ta+0.445
Grab+0.430
Whe+0.391
Really+0.386
Neg logit contributions
.-0.301
 and-0.270
,-0.253
 in-0.249
 to-0.249
TokenOne
Token ID3198
Feature activation+7.93e-02

Pos logit contributions
 Deal+0.197
Isa+0.173
Ta+0.167
V+0.164
Ba+0.160
Neg logit contributions
 in-0.174
 for-0.169
,-0.162
 made-0.161
 would-0.161
Token day
Token ID1110
Feature activation+9.90e-02

Pos logit contributions
V+0.187
E+0.182
Look+0.172
Wow+0.163
O+0.160
Neg logit contributions
 for-0.055
 in-0.044
 size-0.044
 away-0.042
 left-0.042
Token,
Token ID11
Feature activation+8.29e-02

Pos logit contributions
V+0.313
E+0.303
Look+0.281
Kids+0.280
Ta+0.275
Neg logit contributions
 for-0.003
,-0.000
 in+0.002
 as+0.006
 enough+0.010
Token Tim
Token ID5045
Feature activation+4.41e-02

Pos logit contributions
V+0.130
E+0.120
Kids+0.106
Look+0.102
 Deal+0.101
Neg logit contributions
 for-0.118
 as-0.117
 in-0.113
 enough-0.110
 away-0.110
Token play
Token ID711
Feature activation+7.03e-02

Pos logit contributions
V+0.046
+0.043
 Many+0.042
 While+0.041
E+0.040
Neg logit contributions
ier-0.053
 enough-0.047
 perfectly-0.046
 wouldn-0.044
ener-0.044
Token and
Token ID290
Feature activation+4.18e-02

Pos logit contributions
V+0.172
E+0.168
Look+0.156
Wow+0.154
 Many+0.152
Neg logit contributions
 away-0.034
 for-0.031
ier-0.028
 enough-0.026
 size-0.026
Token have
Token ID423
Feature activation+7.20e-02

Pos logit contributions
V+0.071
E+0.065
+0.064
Look+0.063
 While+0.063
Neg logit contributions
ier-0.055
 enough-0.049
 size-0.048
 perfectly-0.047
 away-0.046
Token fun
Token ID1257
Feature activation+5.87e-02

Pos logit contributions
V+0.157
E+0.152
Look+0.141
Mom+0.138
O+0.138
Neg logit contributions
 enough-0.053
 away-0.047
 for-0.046
 coins-0.045
 luck-0.045
Token.
Token ID13
Feature activation+1.56e-01

Pos logit contributions
V+0.143
E+0.137
Look+0.130
 Many+0.127
+0.126
Neg logit contributions
ier-0.028
 size-0.026
 enough-0.026
 away-0.026
 for-0.024
Token
Token ID198
Feature activation+2.27e-01

Pos logit contributions
Ta+0.281
E+0.270
P+0.249
O+0.245
J+0.243
Neg logit contributions
 for-0.228
 and-0.212
.-0.207
,-0.200
 anymore-0.184
Token
Token ID198
Feature activation+1.35e-01

Pos logit contributions
 Deal+0.388
Grab+0.388
Ta+0.387
Really+0.350
Gu+0.347
Neg logit contributions
.-0.349
 for-0.320
 and-0.314
 in-0.307
 on-0.292
TokenOne
Token ID3198
Feature activation+9.27e-02

Pos logit contributions
 Deal+0.228
Ta+0.219
 Here+0.200
 Glass+0.191
omm+0.188
Neg logit contributions
 for-0.171
 would-0.167
 on-0.160
 in-0.158
 made-0.156
Token day
Token ID1110
Feature activation+1.07e-01

Pos logit contributions
E+0.232
V+0.232
Look+0.216
Wow+0.203
 Deal+0.203
Neg logit contributions
 for-0.057
 in-0.043
 left-0.040
 away-0.038
 size-0.034
Token,
Token ID11
Feature activation+9.99e-02

Pos logit contributions
V+0.328
E+0.322
Look+0.303
Ta+0.301
Kids+0.297
Neg logit contributions
,-0.000
 for+0.000
 in+0.003
 enough+0.005
 as+0.007
Token Spot
Token ID15899
Feature activation+5.07e-02

Pos logit contributions
V+0.185
E+0.179
Ta+0.166
Kids+0.165
 Deal+0.164
Neg logit contributions
 as-0.111
 for-0.105
 in-0.104
 enough-0.099
 on-0.095
Token Lily
Token ID20037
Feature activation+4.18e-02

Pos logit contributions
Phill+0.231
Phil+0.228
Josh+0.225
Dan+0.218
Rick+0.218
Neg logit contributions
 for-0.148
.-0.143
 as-0.137
,-0.136
 and-0.134
Token was
Token ID373
Feature activation+6.20e-02

Pos logit contributions
V+0.076
E+0.071
+0.068
Look+0.066
 While+0.065
Neg logit contributions
ier-0.048
 enough-0.044
 away-0.043
 size-0.043
 luck-0.043
Token very
Token ID845
Feature activation+6.85e-02

Pos logit contributions
V+0.142
E+0.137
Look+0.129
Wow+0.126
O+0.125
Neg logit contributions
 enough-0.037
ier-0.036
 size-0.036
 away-0.035
 for-0.035
Token happy
Token ID3772
Feature activation+2.45e-03

Pos logit contributions
V+0.146
E+0.136
Look+0.132
Wow+0.128
O+0.125
Neg logit contributions
ier-0.054
 for-0.053
 size-0.052
 away-0.051
 perfectly-0.049
Token.
Token ID13
Feature activation+1.69e-01

Pos logit contributions
+0.004
 Now+0.003
L+0.003
 While+0.003
M+0.003
Neg logit contributions
ier-0.004
iful-0.003
ist-0.003
ience-0.003
rier-0.003
Token
Token ID198
Feature activation+2.27e-01

Pos logit contributions
Ta+0.315
Ar+0.315
Quick+0.313
AAAA+0.310
Isa+0.302
Neg logit contributions
.-0.191
,-0.178
 and-0.177
 to-0.164
 for-0.150
Token
Token ID198
Feature activation+1.23e-01

Pos logit contributions
 Deal+0.434
Grab+0.431
Ta+0.414
Uh+0.393
Really+0.389
Neg logit contributions
.-0.316
 and-0.284
 for-0.268
 to-0.265
 in-0.265
TokenL
Token ID43
Feature activation+1.04e-01

Pos logit contributions
 Deal+0.187
 Ow+0.168
omm+0.158
 Glass+0.150
 Acc+0.149
Neg logit contributions
 in-0.167
 made-0.163
 and-0.162
 when-0.160
 on-0.151
Tokenily
Token ID813
Feature activation+3.00e-02

Pos logit contributions
V+0.187
Look+0.173
Br+0.171
E+0.168
Wow+0.167
Neg logit contributions
ier-0.143
 for-0.126
 enough-0.117
ies-0.113
 in-0.112
Token wanted
Token ID2227
Feature activation+2.98e-02

Pos logit contributions
V+0.053
+0.049
E+0.048
Look+0.046
 While+0.046
Neg logit contributions
ier-0.038
 luck-0.033
 perfectly-0.033
 size-0.033
 away-0.032
Token to
Token ID284
Feature activation+4.50e-02

Pos logit contributions
V+0.058
+0.057
E+0.056
Look+0.053
 Many+0.053
Neg logit contributions
ier-0.029
ist-0.026
 perfectly-0.026
iness-0.026
 wouldn-0.025
Token and
Token ID290
Feature activation+5.33e-02

Pos logit contributions
+0.008
 While+0.007
E+0.007
V+0.007
 Now+0.007
Neg logit contributions
 possible-0.006
ier-0.006
 happen-0.006
eking-0.006
 wouldn-0.006
Token c
Token ID269
Feature activation+7.99e-02

Pos logit contributions
V+0.102
E+0.096
Look+0.090
+0.089
 Many+0.088
Neg logit contributions
ier-0.053
 size-0.050
 enough-0.050
 away-0.050
 perfectly-0.048
Tokenudd
Token ID4185
Feature activation+6.83e-02

Pos logit contributions
V+0.163
Look+0.149
E+0.147
Wow+0.143
 Many+0.142
Neg logit contributions
ier-0.081
 wouldn-0.068
 perfectly-0.068
ies-0.067
 away-0.067
Tokenly
Token ID306
Feature activation-7.30e-02

Pos logit contributions
V+0.188
E+0.176
Look+0.174
 Many+0.170
+0.170
Neg logit contributions
ier-0.018
 away-0.013
 enough-0.010
 for-0.009
 size-0.008
Token.
Token ID13
Feature activation+1.42e-01

Pos logit contributions
castle+0.150
isher+0.149
ist+0.145
ener+0.143
 airplane+0.143
Neg logit contributions
-0.087
 And-0.059
 They-0.055
 While-0.054
 One-0.052
Token
Token ID198
Feature activation+2.27e-01

Pos logit contributions
Ta+0.248
V+0.220
Ah+0.194
Luc+0.194
Click+0.194
Neg logit contributions
 for-0.192
,-0.172
.-0.171
 and-0.170
 because-0.168
Token
Token ID198
Feature activation+1.32e-01

Pos logit contributions
Ta+0.457
 Deal+0.431
Grab+0.421
Whe+0.368
Really+0.368
Neg logit contributions
.-0.303
 and-0.271
 for-0.248
,-0.247
 in-0.246
TokenOne
Token ID3198
Feature activation+1.01e-01

Pos logit contributions
 Deal+0.209
Ta+0.208
 Here+0.172
 Boom+0.169
 Int+0.168
Neg logit contributions
 for-0.162
 and-0.154
 would-0.146
!-0.145
 in-0.145
Token day
Token ID1110
Feature activation+9.97e-02

Pos logit contributions
V+0.274
E+0.269
Look+0.256
 Deal+0.244
 Int+0.243
Neg logit contributions
 for-0.048
 in-0.028
 left-0.027
 away-0.020
ies-0.017
Token,
Token ID11
Feature activation+9.33e-02

Pos logit contributions
V+0.320
E+0.306
Look+0.288
Ta+0.287
Kids+0.285
Neg logit contributions
,-0.000
 for+0.004
 as+0.008
 in+0.008
 on+0.013
Token Tim
Token ID5045
Feature activation+7.80e-02

Pos logit contributions
V+0.164
E+0.142
Ta+0.135
 Many+0.132
Kids+0.131
Neg logit contributions
 as-0.127
 for-0.124
 in-0.114
 away-0.112
 enough-0.109
Token ran
Token ID4966
Feature activation+6.51e-02

Pos logit contributions
V+0.005
 While+0.004
+0.004
E+0.004
Look+0.004
Neg logit contributions
ier-0.006
work-0.005
iest-0.005
 size-0.005
 luck-0.005
Token to
Token ID284
Feature activation+6.02e-02

Pos logit contributions
V+0.170
E+0.165
Look+0.157
 Many+0.155
Wow+0.152
Neg logit contributions
 away-0.024
 enough-0.018
 for-0.017
ier-0.016
 size-0.014
Token her
Token ID607
Feature activation+6.00e-02

Pos logit contributions
V+0.119
E+0.113
Look+0.105
Wow+0.103
O+0.102
Neg logit contributions
 away-0.056
 size-0.053
 enough-0.053
ier-0.052
 for-0.051
Token mom
Token ID1995
Feature activation+4.33e-02

Pos logit contributions
V+0.113
E+0.108
Look+0.100
 Many+0.098
Wow+0.098
Neg logit contributions
 away-0.063
 size-0.062
 enough-0.061
ier-0.060
 for-0.058
Token.
Token ID13
Feature activation+1.42e-01

Pos logit contributions
V+0.071
E+0.066
+0.066
Look+0.061
 While+0.061
Neg logit contributions
ier-0.055
 size-0.052
 perfectly-0.050
 enough-0.050
 luck-0.050
Token
Token ID198
Feature activation+2.27e-01

Pos logit contributions
V+0.211
Ta+0.203
E+0.196
Whe+0.180
Three+0.175
Neg logit contributions
,-0.224
 to-0.223
 and-0.214
.-0.206
 as-0.196
Token
Token ID198
Feature activation+1.05e-01

Pos logit contributions
Really+0.452
Ta+0.450
Whe+0.449
Grab+0.448
 Deal+0.426
Neg logit contributions
.-0.292
 and-0.279
 to-0.274
,-0.264
 for-0.231
Token"
Token ID1
Feature activation+1.00e-01

Pos logit contributions
 Ow+0.161
V+0.159
Wow+0.150
 Some+0.148
E+0.146
Neg logit contributions
 away-0.126
 in-0.118
 bit-0.109
 left-0.106
 and-0.104
TokenMom
Token ID29252
Feature activation+1.86e-02

Pos logit contributions
 Many+0.087
 Soon+0.086
V+0.084
 Later+0.083
 From+0.081
Neg logit contributions
 away-0.188
 ring-0.183
 bit-0.182
 veil-0.175
 airplane-0.175
Tokenmy
Token ID1820
Feature activation-8.95e-04

Pos logit contributions
V+0.029
+0.029
E+0.026
Mom+0.025
Look+0.025
Neg logit contributions
ier-0.024
 wand-0.023
 coins-0.023
 perfectly-0.022
 powers-0.022
Token,
Token ID11
Feature activation+4.04e-02

Pos logit contributions
ier+0.001
 size+0.001
 perfectly+0.001
 properly+0.001
 coins+0.001
Neg logit contributions
-0.002
V-0.001
-0.001
 The-0.001
 Now-0.001
Token went
Token ID1816
Feature activation+5.44e-02

Pos logit contributions
V+0.079
E+0.074
Look+0.070
+0.070
 While+0.068
Neg logit contributions
ier-0.044
 size-0.041
 enough-0.041
 away-0.039
 perfectly-0.039
Token inside
Token ID2641
Feature activation+4.88e-02

Pos logit contributions
V+0.131
E+0.125
Look+0.119
+0.118
 Many+0.117
Neg logit contributions
ier-0.027
 size-0.025
 away-0.024
 enough-0.024
 perfectly-0.022
Token the
Token ID262
Feature activation+7.52e-02

Pos logit contributions
V+0.106
E+0.101
+0.098
Look+0.095
 Many+0.094
Neg logit contributions
ier-0.036
 size-0.033
 enough-0.032
 away-0.032
 perfectly-0.031
Token house
Token ID2156
Feature activation-2.03e-02

Pos logit contributions
V+0.185
E+0.177
O+0.164
Wow+0.161
 Many+0.161
Neg logit contributions
 away-0.044
 size-0.038
 for-0.037
 enough-0.034
 airplane-0.032
Token.
Token ID13
Feature activation+1.75e-01

Pos logit contributions
ier+0.033
ience+0.032
 luck+0.031
ache+0.031
iness+0.031
Neg logit contributions
-0.033
 The-0.024
 While-0.024
 On-0.023
 -0.023
Token
Token ID198
Feature activation+2.26e-01

Pos logit contributions
Mic+0.348
E+0.319
Red+0.303
Mad+0.300
Twe+0.298
Neg logit contributions
 to-0.245
.-0.235
,-0.235
 and-0.221
 in-0.211
Token
Token ID198
Feature activation+1.49e-01

Pos logit contributions
 Deal+0.417
Really+0.408
omm+0.389
Grab+0.384
Uh+0.377
Neg logit contributions
.-0.326
 and-0.321
 in-0.302
 to-0.293
,-0.286
TokenIn
Token ID818
Feature activation+5.47e-02

Pos logit contributions
 Deal+0.186
omm+0.167
innie+0.164
ollo+0.161
Mic+0.160
Neg logit contributions
 made-0.246
 in-0.235
 like-0.229
 ever-0.218
 and-0.215
Token the
Token ID262
Feature activation+6.81e-02

Pos logit contributions
V+0.107
E+0.102
Look+0.095
+0.094
 Many+0.093
Neg logit contributions
ier-0.051
 size-0.049
 enough-0.049
 away-0.049
 perfectly-0.047
Token house
Token ID2156
Feature activation+2.02e-02

Pos logit contributions
V+0.147
E+0.143
O+0.128
Look+0.126
 Many+0.126
Neg logit contributions
 away-0.056
 size-0.053
 for-0.050
 enough-0.049
 falling-0.047
Token,
Token ID11
Feature activation+5.21e-02

Pos logit contributions
+0.038
V+0.036
 Many+0.034
 While+0.033
Look+0.033
Neg logit contributions
ier-0.021
 wand-0.020
 airplane-0.020
 powers-0.020
ache-0.019
Token and
Token ID290
Feature activation+5.86e-03

Pos logit contributions
 powers+0.018
 luck+0.018
ience+0.017
 secure+0.017
 perfectly+0.017
Neg logit contributions
-0.027
 While-0.023
V-0.023
L-0.023
Look-0.023
Token got
Token ID1392
Feature activation+3.96e-02

Pos logit contributions
+0.008
V+0.008
Look+0.008
 While+0.007
Wow+0.007
Neg logit contributions
ier-0.011
ist-0.010
eness-0.010
 luck-0.009
iness-0.009
Token very
Token ID845
Feature activation+5.49e-02

Pos logit contributions
V+0.078
E+0.073
Look+0.070
+0.069
 Many+0.067
Neg logit contributions
ier-0.041
 size-0.037
ist-0.035
 enough-0.034
 airplane-0.034
Token dirty
Token ID11841
Feature activation-8.80e-03

Pos logit contributions
V+0.114
E+0.108
Look+0.102
+0.101
 Many+0.100
Neg logit contributions
ier-0.046
 size-0.043
 away-0.043
 enough-0.043
 perfectly-0.040
Token.
Token ID13
Feature activation+1.53e-01

Pos logit contributions
 wand+0.012
ience+0.011
 powers+0.011
 crown+0.011
 transformed+0.011
Neg logit contributions
-0.013
E-0.011
L-0.010
V-0.010
Wow-0.010
Token
Token ID198
Feature activation+2.26e-01

Pos logit contributions
E+0.266
Gr+0.255
Ta+0.254
Pr+0.235
Beck+0.233
Neg logit contributions
 to-0.188
,-0.187
 and-0.186
.-0.173
 he-0.158
Token
Token ID198
Feature activation+1.22e-01

Pos logit contributions
 Deal+0.450
Ta+0.439
Really+0.422
Grab+0.421
Uh+0.399
Neg logit contributions
.-0.292
 and-0.279
 to-0.270
,-0.253
 for-0.245
TokenTom
Token ID13787
Feature activation+2.98e-02

Pos logit contributions
 Deal+0.175
E+0.151
 Ow+0.145
Ta+0.134
Isa+0.133
Neg logit contributions
!-0.175
 and-0.175
 made-0.174
 than-0.165
 left-0.164
Token said
Token ID531
Feature activation+3.71e-02

Pos logit contributions
V+0.049
+0.046
E+0.043
L+0.042
Look+0.042
Neg logit contributions
ier-0.042
 luck-0.038
 perfectly-0.037
 size-0.037
iness-0.035
Token,
Token ID11
Feature activation+4.41e-02

Pos logit contributions
V+0.078
+0.074
E+0.074
 Many+0.071
Look+0.070
Neg logit contributions
 size-0.029
ier-0.028
 perfectly-0.025
 airplane-0.025
ist-0.024
Token "
Token ID366
Feature activation+1.03e-01

Pos logit contributions
V+0.047
E+0.042
+0.040
Look+0.038
 Many+0.036
Neg logit contributions
ier-0.083
 size-0.081
 away-0.079
 enough-0.079
 perfectly-0.077
Token were
Token ID547
Feature activation+7.02e-02

Pos logit contributions
V+0.111
E+0.105
Look+0.099
+0.098
 Many+0.097
Neg logit contributions
ier-0.039
 size-0.036
 enough-0.036
 away-0.036
 perfectly-0.034
Token very
Token ID845
Feature activation+7.05e-02

Pos logit contributions
V+0.161
E+0.159
Look+0.148
O+0.146
 Many+0.145
Neg logit contributions
 enough-0.041
 for-0.040
 size-0.039
 away-0.038
 perfectly-0.035
Token good
Token ID922
Feature activation+2.97e-02

Pos logit contributions
V+0.145
E+0.138
Look+0.134
Wow+0.129
 While+0.128
Neg logit contributions
 for-0.056
 size-0.056
ier-0.055
 away-0.053
 perfectly-0.052
Token friends
Token ID2460
Feature activation+3.67e-02

Pos logit contributions
V+0.052
+0.050
E+0.047
 Many+0.047
 While+0.046
Neg logit contributions
 away-0.031
 perfectly-0.030
ier-0.030
 size-0.030
 wouldn-0.029
Token.
Token ID13
Feature activation+1.46e-01

Pos logit contributions
V+0.079
+0.077
E+0.075
Look+0.071
 While+0.070
Neg logit contributions
ier-0.026
 size-0.024
 coins-0.023
 away-0.022
 airplane-0.022
Token
Token ID198
Feature activation+2.26e-01

Pos logit contributions
E+0.220
Ta+0.195
V+0.189
Isa+0.184
Quick+0.182
Neg logit contributions
.-0.215
 and-0.213
 for-0.212
,-0.211
 like-0.196
Token
Token ID198
Feature activation+1.28e-01

Pos logit contributions
 Deal+0.398
Grab+0.393
Ta+0.380
Really+0.367
omm+0.361
Neg logit contributions
.-0.338
 and-0.307
 in-0.294
 for-0.286
 to-0.284
TokenOne
Token ID3198
Feature activation+7.43e-02

Pos logit contributions
 Deal+0.183
E+0.175
 Glass+0.171
V+0.171
Ta+0.167
Neg logit contributions
 for-0.171
 made-0.165
 would-0.165
 like-0.161
 than-0.160
Token day
Token ID1110
Feature activation+9.18e-02

Pos logit contributions
V+0.166
E+0.162
Look+0.152
Wow+0.144
 Many+0.141
Neg logit contributions
 for-0.055
 size-0.049
 away-0.049
 in-0.045
 enough-0.044
Token,
Token ID11
Feature activation+7.45e-02

Pos logit contributions
V+0.274
E+0.266
Look+0.249
Kids+0.241
 Many+0.240
Neg logit contributions
 for-0.009
 enough-0.005
 in-0.002
 as-0.000
 away-0.000
Token Sue
Token ID26089
Feature activation-3.00e-03

Pos logit contributions
V+0.108
E+0.100
Look+0.088
 Many+0.084
O+0.083
Neg logit contributions
 for-0.108
 enough-0.107
 away-0.106
 as-0.103
 size-0.102
Token and
Token ID290
Feature activation+2.08e-02

Pos logit contributions
 appearance+0.036
ier+0.035
 permission+0.035
eking+0.035
 reality+0.035
Neg logit contributions
-0.046
M-0.040
 While-0.037
E-0.035
L-0.035
Token felt
Token ID2936
Feature activation+4.71e-02

Pos logit contributions
V+0.035
+0.035
Look+0.034
E+0.034
 While+0.032
Neg logit contributions
ier-0.026
ist-0.024
 away-0.022
 wand-0.022
 luck-0.022
Token very
Token ID845
Feature activation+4.72e-02

Pos logit contributions
V+0.090
E+0.086
+0.081
Look+0.080
 Many+0.079
Neg logit contributions
ier-0.046
 size-0.045
 enough-0.042
 away-0.042
 perfectly-0.041
Token happy
Token ID3772
Feature activation-6.81e-03

Pos logit contributions
V+0.085
E+0.081
+0.076
Look+0.075
 Many+0.074
Neg logit contributions
ier-0.050
 size-0.048
 enough-0.048
 away-0.047
 perfectly-0.045
Token.
Token ID13
Feature activation+1.69e-01

Pos logit contributions
ener+0.010
ist+0.010
ience+0.010
 wouldn+0.010
et+0.010
Neg logit contributions
-0.011
 Now-0.007
 While-0.007
M-0.007
 The-0.007
Token
Token ID198
Feature activation+2.26e-01

Pos logit contributions
Uh+0.302
Quick+0.301
Ar+0.298
AAAA+0.292
Spe+0.292
Neg logit contributions
.-0.200
 and-0.186
 to-0.183
,-0.180
 for-0.175
Token
Token ID198
Feature activation+1.16e-01

Pos logit contributions
Grab+0.435
 Deal+0.432
Ta+0.408
Uh+0.400
AAAA+0.386
Neg logit contributions
.-0.310
 and-0.278
 for-0.268
 in-0.266
 to-0.255
TokenBut
Token ID1537
Feature activation+3.40e-02

Pos logit contributions
 Deal+0.161
 Ow+0.150
ucks+0.139
 Glass+0.135
 Here+0.131
Neg logit contributions
 when-0.162
 in-0.162
 made-0.151
 even-0.148
 and-0.145
Token then
Token ID788
Feature activation+4.55e-02

Pos logit contributions
V+0.065
E+0.061
+0.060
Look+0.059
 While+0.059
Neg logit contributions
ier-0.036
 size-0.030
iness-0.029
 luck-0.029
 powers-0.029
Token,
Token ID11
Feature activation+3.88e-02

Pos logit contributions
V+0.102
E+0.097
+0.093
Look+0.092
 Many+0.090
Neg logit contributions
ier-0.031
 size-0.028
 away-0.027
 perfectly-0.026
 enough-0.025
Token her
Token ID607
Feature activation+2.53e-02

Pos logit contributions
V+0.053
E+0.049
+0.047
Look+0.045
 While+0.044
Neg logit contributions
ier-0.063
 size-0.059
ist-0.056
 away-0.056
et-0.055
Token in
Token ID287
Feature activation+9.42e-02

Pos logit contributions
+0.064
V+0.064
E+0.060
 While+0.059
Look+0.058
Neg logit contributions
ier-0.026
 luck-0.021
ener-0.021
 wouldn-0.021
 enough-0.021
Token the
Token ID262
Feature activation+9.29e-02

Pos logit contributions
V+0.240
E+0.239
Finally+0.222
Pe+0.222
Br+0.221
Neg logit contributions
 away-0.034
 in-0.031
 for-0.029
.-0.028
!-0.027
Token warm
Token ID5814
Feature activation+6.15e-02

Pos logit contributions
E+0.201
V+0.201
 Many+0.179
 While+0.174
 Deal+0.173
Neg logit contributions
 away-0.072
 for-0.058
 left-0.057
 back-0.056
 luck-0.055
Token sun
Token ID4252
Feature activation+7.18e-03

Pos logit contributions
V+0.119
E+0.113
Look+0.104
 Many+0.102
+0.100
Neg logit contributions
 away-0.059
 size-0.059
ier-0.058
 enough-0.057
 for-0.055
Token.
Token ID13
Feature activation+1.54e-01

Pos logit contributions
+0.014
V+0.012
Look+0.011
 While+0.011
Mom+0.011
Neg logit contributions
ier-0.008
 luck-0.007
 powers-0.007
 coins-0.007
ener-0.007
Token
Token ID198
Feature activation+2.25e-01

Pos logit contributions
Ta+0.250
E+0.244
Pe+0.230
Phill+0.221
O+0.218
Neg logit contributions
 for-0.205
,-0.205
.-0.204
 and-0.201
 but-0.177
Token
Token ID198
Feature activation+1.29e-01

Pos logit contributions
 Deal+0.425
Ta+0.416
Grab+0.413
Really+0.369
Uh+0.359
Neg logit contributions
.-0.329
 and-0.293
 in-0.279
 to-0.274
,-0.273
TokenOne
Token ID3198
Feature activation+9.35e-02

Pos logit contributions
 Deal+0.215
Ta+0.203
 Ow+0.185
ucks+0.184
E+0.183
Neg logit contributions
 would-0.157
 for-0.146
 made-0.145
 and-0.143
 game-0.141
Token day
Token ID1110
Feature activation+1.02e-01

Pos logit contributions
V+0.237
E+0.234
Look+0.218
 Deal+0.207
Mic+0.206
Neg logit contributions
 for-0.055
 in-0.043
 left-0.040
 away-0.038
 size-0.035
Token,
Token ID11
Feature activation+8.35e-02

Pos logit contributions
V+0.322
E+0.318
Look+0.295
Kids+0.291
Ta+0.290
Neg logit contributions
,-0.000
 for+0.001
 in+0.007
 as+0.010
 enough+0.013
Token Max
Token ID5436
Feature activation+3.97e-02

Pos logit contributions
V+0.128
E+0.123
Kids+0.108
Look+0.108
Pe+0.102
Neg logit contributions
 for-0.117
 as-0.116
 in-0.111
 enough-0.107
 away-0.107
Token when
Token ID618
Feature activation+3.87e-02

Pos logit contributions
V+0.097
E+0.092
+0.090
Look+0.087
 Many+0.086
Neg logit contributions
ier-0.028
 size-0.025
 away-0.025
 enough-0.025
 wouldn-0.024
Token she
Token ID673
Feature activation+4.28e-02

Pos logit contributions
V+0.066
+0.062
E+0.062
Look+0.059
 Many+0.056
Neg logit contributions
ier-0.049
 away-0.045
 size-0.044
ist-0.044
 enough-0.043
Token j
Token ID474
Feature activation+7.14e-02

Pos logit contributions
V+0.079
E+0.072
+0.070
Look+0.068
 Many+0.066
Neg logit contributions
ier-0.048
 size-0.046
 enough-0.044
 away-0.044
 perfectly-0.043
Tokenogged
Token ID42545
Feature activation+1.65e-02

Pos logit contributions
V+0.110
E+0.103
Look+0.098
Wow+0.091
+0.088
Neg logit contributions
ier-0.101
 away-0.099
 enough-0.099
 for-0.095
 luck-0.094
Token.
Token ID13
Feature activation+1.70e-01

Pos logit contributions
+0.033
V+0.031
 While+0.029
L+0.028
Look+0.028
Neg logit contributions
ier-0.017
ener-0.017
 size-0.016
 inches-0.016
iness-0.016
Token
Token ID198
Feature activation+2.25e-01

Pos logit contributions
Uh+0.316
E+0.314
Quick+0.301
Ta+0.293
roc+0.286
Neg logit contributions
 for-0.210
 and-0.204
.-0.195
 to-0.193
,-0.189
Token
Token ID198
Feature activation+1.29e-01

Pos logit contributions
 Deal+0.411
Grab+0.397
Ta+0.378
Really+0.376
Uh+0.368
Neg logit contributions
.-0.331
 and-0.319
 in-0.298
 for-0.295
 to-0.289
TokenWhile
Token ID3633
Feature activation+8.40e-02

Pos logit contributions
 Deal+0.194
ucks+0.177
 Here+0.165
ipt+0.164
 Ow+0.159
Neg logit contributions
 made-0.176
 in-0.171
 when-0.169
 and-0.168
 would-0.167
Token j
Token ID474
Feature activation+7.79e-02

Pos logit contributions
E+0.194
V+0.190
Look+0.173
Wow+0.170
 Deal+0.168
Neg logit contributions
 for-0.053
 in-0.047
 enough-0.046
 falling-0.046
 on-0.045
Tokenogging
Token ID30853
Feature activation+2.40e-02

Pos logit contributions
V+0.129
E+0.123
Look+0.119
Wow+0.111
Mom+0.107
Neg logit contributions
ier-0.098
 for-0.098
 away-0.097
 enough-0.097
 luck-0.091
Token,
Token ID11
Feature activation+6.76e-02

Pos logit contributions
V+0.047
+0.046
 While+0.044
 Many+0.043
E+0.042
Neg logit contributions
 size-0.023
ier-0.022
 inches-0.020
 airplane-0.020
work-0.020
Token Sue
Token ID26089
Feature activation+2.27e-02

Pos logit contributions
V+0.246
E+0.236
Ba+0.227
Mic+0.220
Isa+0.215
Neg logit contributions
,-0.139
 for-0.136
 in-0.130
.-0.129
 and-0.128
Token looked
Token ID3114
Feature activation+3.50e-02

Pos logit contributions
+0.035
V+0.035
E+0.032
Look+0.031
 While+0.030
Neg logit contributions
ier-0.035
 luck-0.031
 size-0.031
iness-0.030
 enough-0.030
Token sad
Token ID6507
Feature activation-4.79e-03

Pos logit contributions
V+0.066
+0.065
E+0.064
Look+0.062
 While+0.062
Neg logit contributions
ier-0.035
 size-0.034
 enough-0.028
 powers-0.028
 perfectly-0.028
Token.
Token ID13
Feature activation+1.35e-01

Pos logit contributions
ils+0.006
et+0.006
 powers+0.006
cards+0.006
 trace+0.006
Neg logit contributions
-0.008
 They-0.006
 She-0.006
 So-0.006
 The-0.006
Token
Token ID220
Feature activation+1.85e-01

Pos logit contributions
V+0.230
E+0.220
Ba+0.206
Ta+0.205
Isa+0.195
Neg logit contributions
,-0.193
.-0.179
 and-0.178
 to-0.176
 in-0.175
Token
Token ID198
Feature activation+2.25e-01

Pos logit contributions
Ta+0.278
V+0.271
Grab+0.251
Quick+0.249
E+0.248
Neg logit contributions
.-0.306
 for-0.277
 in-0.275
 and-0.268
 on-0.260
Token
Token ID198
Feature activation+1.14e-01

Pos logit contributions
Ta+0.417
 Deal+0.397
Grab+0.390
Uh+0.387
Really+0.385
Neg logit contributions
.-0.327
 and-0.311
,-0.308
!-0.272
 to-0.264
TokenTim
Token ID14967
Feature activation+3.08e-02

Pos logit contributions
 Deal+0.135
V+0.133
E+0.113
Isa+0.113
 Ow+0.109
Neg logit contributions
 in-0.192
!-0.188
,-0.186
 and-0.184
 than-0.181
Token asked
Token ID1965
Feature activation+7.63e-02

Pos logit contributions
V+0.049
+0.047
E+0.043
Look+0.043
 While+0.042
Neg logit contributions
ier-0.045
 size-0.040
 perfectly-0.040
 luck-0.040
iness-0.038
Token,
Token ID11
Feature activation+6.90e-02

Pos logit contributions
V+0.125
E+0.117
Look+0.102
 While+0.098
 Many+0.097
Neg logit contributions
 for-0.110
 enough-0.106
 away-0.104
ier-0.098
 luck-0.098
Token "
Token ID366
Feature activation+1.11e-01

Pos logit contributions
V+0.097
E+0.091
Look+0.080
 Many+0.080
 While+0.077
Neg logit contributions
 for-0.101
 enough-0.099
ier-0.099
 away-0.098
 size-0.096
Token little
Token ID1310
Feature activation-2.83e-02

Pos logit contributions
+0.033
V+0.030
 Blood+0.027
 Many+0.027
Mom+0.026
Neg logit contributions
ier-0.033
 enough-0.030
 away-0.028
ist-0.028
ening-0.027
Token dog
Token ID3290
Feature activation-3.42e-03

Pos logit contributions
ier+0.054
 happening+0.053
ist+0.049
 wouldn+0.049
 happen+0.049
Neg logit contributions
-0.032
V-0.032
 Many-0.031
Mom-0.028
Dad-0.027
Token,
Token ID11
Feature activation-1.10e-02

Pos logit contributions
ier+0.005
iness+0.004
 luck+0.004
 size+0.004
rier+0.004
Neg logit contributions
-0.006
V-0.006
 The-0.006
 While-0.005
 Many-0.005
Token Max
Token ID5436
Feature activation+1.22e-02

Pos logit contributions
ier+0.025
iness+0.021
ering+0.021
ening+0.021
ener+0.021
Neg logit contributions
-0.015
 The-0.012
V-0.011
 Many-0.011
 Whe-0.011
Token.
Token ID13
Feature activation+1.40e-01

Pos logit contributions
+0.024
V+0.022
E+0.020
L+0.020
 The+0.019
Neg logit contributions
ier-0.014
 luck-0.013
iness-0.012
 size-0.012
 inches-0.012
Token
Token ID198
Feature activation+2.25e-01

Pos logit contributions
E+0.192
Ta+0.191
Quick+0.170
Phill+0.168
Isa+0.166
Neg logit contributions
 and-0.200
.-0.198
 for-0.194
,-0.194
 too-0.174
Token
Token ID198
Feature activation+1.21e-01

Pos logit contributions
 Deal+0.410
Ta+0.390
Grab+0.390
Really+0.367
Uh+0.351
Neg logit contributions
.-0.327
 and-0.300
 in-0.282
 to-0.279
 for-0.277
TokenOne
Token ID3198
Feature activation+8.14e-02

Pos logit contributions
 Deal+0.182
Ta+0.158
 Ow+0.158
E+0.157
Brother+0.155
Neg logit contributions
 for-0.162
 would-0.152
 in-0.151
 and-0.151
 on-0.150
Token day
Token ID1110
Feature activation+9.46e-02

Pos logit contributions
V+0.194
E+0.191
Look+0.181
Wow+0.170
O+0.166
Neg logit contributions
 for-0.053
 in-0.042
 size-0.041
 away-0.040
 left-0.039
Token,
Token ID11
Feature activation+7.35e-02

Pos logit contributions
V+0.289
E+0.281
Look+0.263
Kids+0.260
Ta+0.255
Neg logit contributions
 for-0.006
 enough-0.000
,-0.000
 in+0.000
 as+0.003
Token Sue
Token ID26089
Feature activation+5.46e-03

Pos logit contributions
V+0.104
E+0.096
Look+0.085
Kids+0.081
 Many+0.079
Neg logit contributions
 for-0.109
 enough-0.109
 away-0.107
 as-0.104
 size-0.102
Token it
Token ID340
Feature activation+3.40e-02

Pos logit contributions
V+0.101
E+0.096
Look+0.090
+0.088
 Many+0.087
Neg logit contributions
ier-0.055
 size-0.053
 enough-0.052
 away-0.052
 perfectly-0.050
Token to
Token ID284
Feature activation+6.99e-02

Pos logit contributions
V+0.068
+0.067
E+0.062
 Many+0.061
 While+0.061
Neg logit contributions
ier-0.031
 size-0.028
 perfectly-0.025
ist-0.025
 wouldn-0.025
Token his
Token ID465
Feature activation+3.64e-02

Pos logit contributions
V+0.148
E+0.144
Look+0.133
Wow+0.130
O+0.128
Neg logit contributions
 away-0.055
 for-0.055
 size-0.052
 enough-0.050
ier-0.049
Token mom
Token ID1995
Feature activation+4.75e-03

Pos logit contributions
V+0.061
+0.057
E+0.055
Look+0.053
Mom+0.053
Neg logit contributions
ier-0.047
 wouldn-0.041
 perfectly-0.040
 enough-0.040
ist-0.040
Token.
Token ID13
Feature activation+1.47e-01

Pos logit contributions
+0.009
 The+0.007
V+0.007
 +0.007
 One+0.007
Neg logit contributions
ier-0.006
 powers-0.006
 perfectly-0.006
 size-0.006
 wand-0.006
Token
Token ID198
Feature activation+2.25e-01

Pos logit contributions
Ta+0.269
V+0.258
Ba+0.255
 Deal+0.252
E+0.251
Neg logit contributions
,-0.223
 to-0.220
.-0.211
 and-0.208
 with-0.175
Token
Token ID198
Feature activation+1.19e-01

Pos logit contributions
 Deal+0.410
Ta+0.406
Really+0.398
Grab+0.396
Uh+0.382
Neg logit contributions
.-0.335
 and-0.307
 to-0.293
 in-0.286
,-0.284
TokenTim
Token ID14967
Feature activation+3.05e-02

Pos logit contributions
 Deal+0.168
V+0.137
Ta+0.130
Isa+0.126
omm+0.126
Neg logit contributions
 made-0.199
 in-0.191
 bit-0.184
 and-0.182
.-0.181
Token said
Token ID531
Feature activation+4.33e-02

Pos logit contributions
V+0.051
+0.048
E+0.045
 While+0.044
Look+0.044
Neg logit contributions
ier-0.042
 luck-0.037
 size-0.037
 perfectly-0.037
iness-0.036
Token,
Token ID11
Feature activation+5.96e-02

Pos logit contributions
V+0.096
E+0.091
+0.089
 Many+0.087
Look+0.087
Neg logit contributions
ier-0.028
 size-0.028
 away-0.025
 perfectly-0.025
 enough-0.025
Token "
Token ID366
Feature activation+1.22e-01

Pos logit contributions
V+0.077
E+0.072
Look+0.064
 Many+0.061
+0.060
Neg logit contributions
ier-0.094
 enough-0.092
 away-0.092
 size-0.092
 for-0.091
Token ran
Token ID4966
Feature activation+7.22e-02

Pos logit contributions
V+0.081
E+0.075
+0.071
Look+0.071
O+0.069
Neg logit contributions
ier-0.045
 size-0.043
 enough-0.041
 away-0.041
 luck-0.040
Token to
Token ID284
Feature activation+7.72e-02

Pos logit contributions
V+0.182
E+0.179
Look+0.167
 Many+0.165
O+0.163
Neg logit contributions
 away-0.034
 for-0.028
 enough-0.026
 perfectly-0.023
ier-0.022
Token the
Token ID262
Feature activation+5.81e-02

Pos logit contributions
E+0.173
V+0.171
Wow+0.156
O+0.154
Kids+0.154
Neg logit contributions
 away-0.058
 for-0.053
 ring-0.047
 enough-0.047
 as-0.045
Token tree
Token ID5509
Feature activation+1.25e-02

Pos logit contributions
V+0.122
E+0.116
Look+0.109
+0.107
 Many+0.107
Neg logit contributions
ier-0.046
 away-0.044
 size-0.044
 enough-0.044
 wouldn-0.041
Token.
Token ID13
Feature activation+1.59e-01

Pos logit contributions
+0.024
 While+0.022
V+0.022
 On+0.021
 Later+0.020
Neg logit contributions
ier-0.014
 luck-0.012
iness-0.012
 perfectly-0.012
 wouldn-0.011
Token
Token ID198
Feature activation+2.25e-01

Pos logit contributions
E+0.273
Uh+0.260
Ta+0.257
Isa+0.248
Ar+0.246
Neg logit contributions
 to-0.234
 and-0.215
,-0.210
.-0.202
 for-0.195
Token
Token ID198
Feature activation+1.18e-01

Pos logit contributions
 Deal+0.426
Really+0.426
Grab+0.414
Ta+0.406
Uh+0.402
Neg logit contributions
.-0.306
 and-0.300
 to-0.278
,-0.269
 for-0.260
Token"
Token ID1
Feature activation+1.33e-01

Pos logit contributions
 Deal+0.156
 Ow+0.141
ucks+0.137
 Next+0.136
 Some+0.131
Neg logit contributions
 in-0.161
 when-0.153
 made-0.151
 and-0.150
 set-0.150
TokenHi
Token ID17250
Feature activation+7.86e-02

Pos logit contributions
 From+0.155
azes+0.148
 Deal+0.148
 Their+0.146
innie+0.145
Neg logit contributions
 set-0.202
 made-0.192
 bit-0.179
 spread-0.176
 kept-0.173
Token,
Token ID11
Feature activation+7.73e-02

Pos logit contributions
V+0.198
E+0.190
Look+0.185
 Many+0.180
 While+0.178
Neg logit contributions
 enough-0.036
 for-0.029
 away-0.028
 luck-0.024
 airplane-0.024
Token Tree
Token ID12200
Feature activation-6.01e-03

Pos logit contributions
V+0.150
E+0.141
Look+0.136
 While+0.131
Br+0.127
Neg logit contributions
 for-0.076
 enough-0.071
 airplane-0.067
 ring-0.067
 away-0.067
Token so
Token ID523
Feature activation-2.90e-03

Pos logit contributions
ener+0.115
sters+0.109
iness+0.106
ome+0.104
lers+0.104
Neg logit contributions
-0.094
V-0.075
Mom-0.074
 -0.068
 The-0.068
Token loud
Token ID7812
Feature activation-9.41e-02

Pos logit contributions
ener+0.005
ier+0.005
 away+0.004
et+0.004
 airplane+0.004
Neg logit contributions
-0.004
E-0.004
V-0.004
 Many-0.004
L-0.003
Token with
Token ID351
Feature activation-6.96e-02

Pos logit contributions
 permission+0.154
 collection+0.152
 size+0.146
 footing+0.144
 crown+0.144
Neg logit contributions
-0.146
 Now-0.139
 It-0.122
V-0.113
 But-0.110
Token all
Token ID477
Feature activation-4.47e-02

Pos logit contributions
ier+0.137
castle+0.132
ener+0.130
comb+0.126
ache+0.125
Neg logit contributions
-0.084
E-0.074
V-0.072
Look-0.068
 Many-0.068
Token the
Token ID262
Feature activation-3.17e-02

Pos logit contributions
ache+0.088
ener+0.086
ier+0.085
comb+0.085
 wouldn+0.079
Neg logit contributions
-0.054
 The-0.044
 Now-0.043
 Everyone-0.041
 -0.040
Token furniture
Token ID13091
Feature activation-2.28e-01

Pos logit contributions
ier+0.055
castle+0.054
 able+0.052
ache+0.052
owing+0.051
Neg logit contributions
V-0.041
-0.040
E-0.039
L-0.034
 Many-0.034
Token talking
Token ID3375
Feature activation-1.42e-01

Pos logit contributions
urger+0.320
ener+0.313
rier+0.307
 luck+0.306
ishing+0.302
Neg logit contributions
-0.291
 It-0.222
 They-0.214
 She-0.211
 The-0.200
Token,
Token ID11
Feature activation-9.28e-02

Pos logit contributions
eking+0.205
eling+0.204
ener+0.198
cker+0.195
 size+0.192
Neg logit contributions
-0.225
 It-0.191
 Everyone-0.189
 They-0.177
 She-0.174
Token it
Token ID340
Feature activation-1.09e-01

Pos logit contributions
ier+0.170
ils+0.159
ener+0.154
lers+0.150
 problems+0.149
Neg logit contributions
-0.147
 It-0.132
 Now-0.122
 The-0.116
It-0.115
Token was
Token ID373
Feature activation-7.08e-02

Pos logit contributions
 permission+0.245
ier+0.229
 coins+0.227
rier+0.225
 dollars+0.222
Neg logit contributions
-0.106
V-0.079
L-0.078
It-0.073
M-0.066
Token hard
Token ID1327
Feature activation-1.09e-01

Pos logit contributions
ener+0.134
sters+0.132
ly+0.127
ome+0.126
iness+0.125
Neg logit contributions
-0.099
V-0.076
 The-0.073
 In-0.071
 When-0.069
Token one
Token ID530
Feature activation-1.10e-01

Pos logit contributions
ier+0.157
ache+0.142
ets+0.138
 wouldn+0.138
 collection+0.136
Neg logit contributions
-0.093
 The-0.073
E-0.065
 -0.064
 It-0.063
Token of
Token ID286
Feature activation-5.28e-02

Pos logit contributions
ils+0.200
bags+0.185
ier+0.178
ps+0.172
 bothering+0.171
Neg logit contributions
-0.147
Mom-0.107
<|endoftext|>-0.106
 Mom-0.097
M-0.095
Token the
Token ID262
Feature activation-1.40e-01

Pos logit contributions
ier+0.107
ache+0.102
 wouldn+0.101
 compared+0.098
ener+0.096
Neg logit contributions
-0.067
Look-0.052
V-0.050
 -0.049
E-0.048
Token p
Token ID279
Feature activation-2.47e-02

Pos logit contributions
 able+0.215
ier+0.214
 wouldn+0.200
ils+0.195
iness+0.194
Neg logit contributions
-0.190
Mom-0.178
Look-0.175
E-0.169
V-0.158
Tokenudd
Token ID4185
Feature activation-4.32e-02

Pos logit contributions
 happen+0.049
 battle+0.047
 monsters+0.047
 bothering+0.046
 shield+0.046
Neg logit contributions
-0.019
E-0.018
Jen-0.018
 Later-0.017
V-0.017
Tokenles
Token ID829
Feature activation-2.27e-01

Pos logit contributions
 lair+0.071
 prisoner+0.071
boys+0.070
 size+0.070
 hunt+0.070
Neg logit contributions
E-0.042
 Now-0.042
-0.036
O-0.035
Kids-0.035
Token and
Token ID290
Feature activation-1.14e-01

Pos logit contributions
 property+0.343
 choices+0.343
 luck+0.343
 appearance+0.341
 thieves+0.337
Neg logit contributions
-0.329
 -0.231
 The-0.200
 She-0.196
 It-0.183
Token watched
Token ID7342
Feature activation-6.98e-02

Pos logit contributions
ier+0.211
iness+0.194
 anymore+0.187
owing+0.181
eness+0.175
Neg logit contributions
-0.153
Look-0.130
 -0.130
 The-0.129
 Then-0.126
Token as
Token ID355
Feature activation-6.92e-02

Pos logit contributions
 property+0.123
 stolen+0.117
ache+0.117
 stash+0.115
 size+0.114
Neg logit contributions
-0.100
 The-0.087
 Everyone-0.079
 It-0.077
 Mom-0.068
Token the
Token ID262
Feature activation-1.03e-01

Pos logit contributions
ode+0.139
et+0.137
ets+0.137
ils+0.135
lers+0.135
Neg logit contributions
-0.082
V-0.063
 The-0.062
E-0.058
 Mom-0.056
Token rain
Token ID6290
Feature activation-1.09e-01

Pos logit contributions
 able+0.169
ier+0.162
 enough+0.154
 wouldn+0.152
 anywhere+0.150
Neg logit contributions
E-0.131
V-0.130
Mom-0.129
Look-0.123
-0.122
Token could
Token ID714
Feature activation-2.23e-02

Pos logit contributions
 recipe+0.200
 amount+0.199
rier+0.199
 size+0.199
 dollars+0.196
Neg logit contributions
-0.062
 could-0.033
 It-0.032
 They-0.030
 -0.030
Token hide
Token ID7808
Feature activation-8.62e-02

Pos logit contributions
 dollars+0.037
ier+0.036
 compared+0.034
ening+0.034
ener+0.034
Neg logit contributions
-0.034
E-0.026
V-0.026
Mom-0.026
 Many-0.026
Token in
Token ID287
Feature activation-6.49e-02

Pos logit contributions
 wedding+0.130
ache+0.130
 ceremony+0.130
 mustache+0.129
 recipe+0.124
Neg logit contributions
-0.111
E-0.102
L-0.091
Tim-0.083
 The-0.080
Token the
Token ID262
Feature activation-1.31e-02

Pos logit contributions
ache+0.154
ier+0.154
 wouldn+0.143
ly+0.138
ener+0.138
Neg logit contributions
-0.073
 The-0.044
E-0.043
 Sometimes-0.041
-0.039
Token little
Token ID1310
Feature activation-5.57e-02

Pos logit contributions
ier+0.027
ist+0.026
iness+0.025
 wouldn+0.024
 able+0.024
Neg logit contributions
Look-0.015
-0.015
-0.013
Wow-0.013
O-0.012
Token cracks
Token ID23217
Feature activation-2.27e-01

Pos logit contributions
ier+0.110
 wouldn+0.108
ps+0.104
iness+0.100
cher+0.098
Neg logit contributions
-0.061
-0.051
Wow-0.050
 The-0.048
Look-0.046
Token and
Token ID290
Feature activation-9.80e-02

Pos logit contributions
 groom+0.388
ache+0.381
urger+0.380
 appearance+0.379
ener+0.378
Neg logit contributions
-0.242
 The-0.142
 They-0.136
The-0.122
 It-0.112
Token cre
Token ID1126
Feature activation+4.04e-02

Pos logit contributions
ier+0.184
iness+0.158
eness+0.146
et+0.144
ler+0.144
Neg logit contributions
-0.132
What-0.100
Look-0.100
 -0.100
 The-0.098
Tokenv
Token ID85
Feature activation-1.73e-02

Pos logit contributions
V+0.060
E+0.057
Look+0.052
+0.050
 Many+0.049
Neg logit contributions
 wouldn-0.054
 size-0.054
ier-0.053
 enough-0.053
 for-0.053
Tokenices
Token ID1063
Feature activation-2.04e-01

Pos logit contributions
 ring+0.029
 ringing+0.029
 airplane+0.029
 bothering+0.029
 compared+0.028
Neg logit contributions
-0.020
Jen-0.017
M-0.017
E-0.017
ipt-0.016
Token of
Token ID286
Feature activation-5.66e-02

Pos logit contributions
 crown+0.320
 forth+0.318
 veil+0.315
 pearl+0.312
 breaths+0.312
Neg logit contributions
-0.277
 They-0.182
 -0.169
 It-0.164
 The-0.162
Token Spr
Token ID5522
Feature activation+1.32e-02

Pos logit contributions
V+0.063
E+0.054
Look+0.048
O+0.043
Wow+0.042
Neg logit contributions
 enough-0.139
ier-0.138
 size-0.135
 perfectly-0.135
 away-0.134
Tokeninkle
Token ID19894
Feature activation-7.88e-02

Pos logit contributions
V+0.017
+0.016
E+0.016
+0.015
O+0.015
Neg logit contributions
 size-0.020
 away-0.019
 enough-0.019
 airplane-0.018
 forever-0.018
Token it
Token ID340
Feature activation-1.29e-01

Pos logit contributions
 lair+0.131
epad+0.127
 destination+0.123
 fault+0.120
castle+0.120
Neg logit contributions
-0.118
E-0.114
O-0.111
M-0.108
 The-0.107
Token on
Token ID319
Feature activation-3.68e-02

Pos logit contributions
ault+0.239
 permission+0.234
rier+0.232
reating+0.231
 dollars+0.230
Neg logit contributions
-0.163
-0.113
 It-0.106
 -0.104
 Next-0.102
Token any
Token ID597
Feature activation-7.62e-02

Pos logit contributions
ier+0.075
rying+0.071
 able+0.068
owing+0.068
eking+0.066
Neg logit contributions
-0.048
E-0.043
O-0.039
 The-0.038
L-0.037
Token area
Token ID1989
Feature activation-2.25e-01

Pos logit contributions
ier+0.119
 wouldn+0.118
bags+0.116
bows+0.113
ried+0.110
Neg logit contributions
-0.088
Mom-0.076
 Many-0.071
Ah-0.069
E-0.069
Token and
Token ID290
Feature activation-9.70e-02

Pos logit contributions
ache+0.334
ault+0.320
urger+0.312
 prisoner+0.310
 tast+0.307
Neg logit contributions
-0.328
 It-0.255
 The-0.243
 She-0.239
 -0.223
Token you
Token ID345
Feature activation-6.90e-02

Pos logit contributions
ier+0.179
owing+0.172
iness+0.167
 affect+0.162
ening+0.162
Neg logit contributions
-0.121
L-0.097
Wow-0.097
V-0.096
Look-0.096
Token will
Token ID481
Feature activation-6.97e-02

Pos logit contributions
rier+0.108
ier+0.107
ering+0.103
 falling+0.101
 attention+0.100
Neg logit contributions
-0.103
V-0.086
E-0.085
 On-0.082
 In-0.080
Token be
Token ID307
Feature activation-6.49e-02

Pos logit contributions
led+0.137
ier+0.136
ishing+0.132
lers+0.130
 dollars+0.130
Neg logit contributions
-0.098
 One-0.072
 -0.065
 The-0.065
 On-0.064
Token granted
Token ID7520
Feature activation-7.44e-02

Pos logit contributions
iness+0.124
ly+0.115
bars+0.112
cases+0.110
ware+0.107
Neg logit contributions
-0.085
 -0.062
 The-0.059
M-0.057
The-0.052
Token for
Token ID329
Feature activation-5.33e-02

Pos logit contributions
ener+0.005
ier+0.005
 wouldn+0.005
rier+0.005
ache+0.005
Neg logit contributions
-0.004
 Now-0.004
V-0.004
Look-0.004
 The-0.004
Token getting
Token ID1972
Feature activation-7.55e-02

Pos logit contributions
ener+0.100
ist+0.099
iness+0.098
ier+0.097
 wouldn+0.095
Neg logit contributions
-0.075
E-0.057
 Now-0.057
Tom-0.056
Tim-0.056
Token the
Token ID262
Feature activation-1.13e-01

Pos logit contributions
comb+0.133
itive+0.131
ist+0.130
bee+0.129
iness+0.128
Neg logit contributions
-0.099
 late-0.083
Mom-0.077
-0.077
E-0.070
Token wild
Token ID4295
Feature activation-1.35e-01

Pos logit contributions
ils+0.174
es+0.168
owed+0.160
 imagined+0.159
ier+0.158
Neg logit contributions
-0.132
Look-0.125
-0.113
E-0.110
Mom-0.107
Token new
Token ID649
Feature activation-1.34e-01

Pos logit contributions
cher+0.222
eking+0.218
ier+0.215
 able+0.214
 rewarded+0.206
Neg logit contributions
-0.172
-0.146
 The-0.127
 Sometimes-0.118
 Today-0.117
Token carpet
Token ID20710
Feature activation-2.25e-01

Pos logit contributions
ier+0.236
eking+0.224
ils+0.211
led+0.206
 able+0.200
Neg logit contributions
-0.171
-0.128
Look-0.126
Tim-0.117
 Whe-0.116
Token.
Token ID13
Feature activation+5.63e-02

Pos logit contributions
 appearance+0.373
ening+0.372
 worries+0.367
 hesitation+0.367
eking+0.365
Neg logit contributions
-0.259
 She-0.148
 The-0.139
-0.132
 They-0.126
Token<|endoftext|>
Token ID50256
Feature activation+1.09e-01

Pos logit contributions
V+0.031
E+0.025
Look+0.019
+0.017
 Many+0.016
Neg logit contributions
ier-0.131
 size-0.129
 enough-0.128
 away-0.128
 perfectly-0.126
TokenOnce
Token ID7454
Feature activation+9.13e-02

Pos logit contributions
V+0.082
E+0.078
 Many+0.063
Br+0.057
 While+0.057
Neg logit contributions
 in-0.222
 on-0.220
 away-0.219
!-0.214
 for-0.213
Token there
Token ID612
Feature activation+4.86e-02

Pos logit contributions
V+0.182
E+0.170
Look+0.161
O+0.152
 Many+0.152
Neg logit contributions
 for-0.098
 enough-0.090
ier-0.089
 away-0.088
ies-0.084
Token was
Token ID373
Feature activation+8.49e-02

Pos logit contributions
V+0.092
E+0.088
+0.084
Look+0.082
 Many+0.080
Neg logit contributions
ier-0.049
 size-0.046
 enough-0.045
 away-0.045
 perfectly-0.043
Token in
Token ID287
Feature activation-7.38e-02

Pos logit contributions
eking+0.257
rying+0.252
 size+0.248
 footing+0.248
pan+0.246
Neg logit contributions
-0.220
 The-0.190
 Everyone-0.182
 From-0.167
 It-0.164
Token its
Token ID663
Feature activation-3.60e-02

Pos logit contributions
drops+0.134
ier+0.132
ache+0.130
ies+0.127
led+0.126
Neg logit contributions
-0.119
 The-0.088
 -0.082
 Now-0.081
 It-0.080
Token ears
Token ID11368
Feature activation-1.91e-01

Pos logit contributions
ier+0.074
ache+0.065
iness+0.060
 kernels+0.059
fish+0.059
Neg logit contributions
-0.045
 The-0.039
 -0.036
 From-0.035
 One-0.034
Token,
Token ID11
Feature activation-9.21e-02

Pos logit contributions
ils+0.330
ishes+0.309
 wounds+0.300
 thieves+0.300
isher+0.297
Neg logit contributions
-0.288
 -0.216
 The-0.206
 It-0.201
 She-0.199
Token the
Token ID262
Feature activation-2.22e-02

Pos logit contributions
ier+0.192
ils+0.183
ache+0.182
et+0.176
le+0.175
Neg logit contributions
-0.116
 The-0.089
 It-0.084
 -0.083
 Everyone-0.074
Token switch
Token ID5078
Feature activation-2.24e-01

Pos logit contributions
ier+0.037
 enough+0.036
 able+0.035
i+0.035
ils+0.034
Neg logit contributions
V-0.031
-0.031
Look-0.030
Wow-0.028
Mom-0.028
Token knows
Token ID4206
Feature activation-1.22e-01

Pos logit contributions
rying+0.338
 thieves+0.338
urger+0.337
 pounds+0.332
 oats+0.332
Neg logit contributions
-0.302
 It-0.227
 The-0.224
 -0.218
 She-0.206
Token that
Token ID326
Feature activation-9.97e-02

Pos logit contributions
ener+0.213
drops+0.208
ited+0.205
 breaths+0.197
 aroma+0.195
Neg logit contributions
-0.195
 It-0.126
 The-0.123
 -0.121
 From-0.120
Token it
Token ID340
Feature activation-8.98e-02

Pos logit contributions
ener+0.179
ier+0.179
ache+0.175
drops+0.171
ly+0.170
Neg logit contributions
-0.141
 -0.101
 It-0.096
 Next-0.094
 The-0.093
Token will
Token ID481
Feature activation-6.68e-02

Pos logit contributions
rier+0.174
 unatt+0.168
 injuries+0.166
 grip+0.165
 footing+0.163
Neg logit contributions
-0.109
-0.074
 Next-0.071
 It-0.070
 From-0.069
Token never
Token ID1239
Feature activation-7.05e-02

Pos logit contributions
lers+0.135
ishes+0.119
led+0.119
 dollars+0.119
ener+0.117
Neg logit contributions
-0.097
One-0.071
The-0.066
Even-0.066
In-0.065
Token all
Token ID477
Feature activation-9.62e-02

Pos logit contributions
rying+0.213
owing+0.212
ier+0.208
ist+0.207
ener+0.206
Neg logit contributions
-0.110
Tom-0.091
 Tim-0.086
Tim-0.083
 Tom-0.083
Token of
Token ID286
Feature activation-3.99e-02

Pos logit contributions
comb+0.202
 wand+0.185
ache+0.184
ppers+0.180
king+0.179
Neg logit contributions
-0.094
 The-0.085
 Everyone-0.073
 They-0.070
 It-0.070
Token the
Token ID262
Feature activation-5.63e-02

Pos logit contributions
ener+0.079
ier+0.078
 wouldn+0.075
ache+0.074
drops+0.072
Neg logit contributions
-0.047
E-0.042
Tim-0.037
 Many-0.037
Tom-0.034
Token family
Token ID1641
Feature activation-1.48e-01

Pos logit contributions
ier+0.103
owed+0.103
 able+0.102
ring+0.102
comb+0.099
Neg logit contributions
Look-0.059
E-0.058
Mom-0.054
Wow-0.053
-0.052
Token's
Token ID338
Feature activation-9.79e-02

Pos logit contributions
rier+0.277
eking+0.261
 potion+0.253
 breaths+0.251
 former+0.250
Neg logit contributions
-0.187
 The-0.114
 -0.112
 Now-0.107
-0.099
Token possessions
Token ID23309
Feature activation-2.24e-01

Pos logit contributions
ier+0.172
ist+0.148
cher+0.146
pan+0.145
comb+0.143
Neg logit contributions
-0.127
<|endoftext|>-0.093
 Many-0.092
 Now-0.089
 From-0.087
Token had
Token ID550
Feature activation-1.29e-01

Pos logit contributions
ener+0.438
urger+0.407
ishing+0.406
rying+0.405
enery+0.400
Neg logit contributions
-0.284
 It-0.172
 -0.161
 The-0.152
 They-0.151
Token been
Token ID587
Feature activation-1.00e-01

Pos logit contributions
 forth+0.246
rier+0.236
ener+0.233
ish+0.217
ci+0.213
Neg logit contributions
-0.166
In-0.104
 In-0.102
Tom-0.101
A-0.100
Token taken
Token ID2077
Feature activation-9.26e-02

Pos logit contributions
comb+0.146
als+0.141
iness+0.140
 forth+0.136
flies+0.136
Neg logit contributions
-0.142
-0.097
L-0.097
A-0.094
J-0.088
Token away
Token ID1497
Feature activation-1.54e-01

Pos logit contributions
castle+0.164
 veil+0.146
 airplane+0.143
 salon+0.139
 crown+0.137
Neg logit contributions
-0.150
 In-0.111
 -0.109
 The-0.108
 It-0.104
Token,
Token ID11
Feature activation-1.16e-01

Pos logit contributions
 airplane+0.288
 veil+0.287
 comb+0.287
 crown+0.284
 cape+0.283
Neg logit contributions
-0.190
 They-0.123
 The-0.105
 He-0.091
-0.086
Token given
Token ID1813
Feature activation-5.01e-02

Pos logit contributions
rier+0.184
 forth+0.178
ishly+0.177
ener+0.176
work+0.175
Neg logit contributions
-0.137
A-0.103
-0.090
 The-0.085
There-0.082
Token you
Token ID345
Feature activation-1.28e-01

Pos logit contributions
castle+0.087
pan+0.085
ier+0.085
iness+0.083
 size+0.082
Neg logit contributions
-0.068
E-0.061
V-0.057
O-0.052
Look-0.051
Token.
Token ID13
Feature activation+7.04e-02

Pos logit contributions
ier+0.252
drops+0.240
castle+0.232
 forth+0.232
iness+0.229
Neg logit contributions
-0.158
 The-0.123
A-0.103
-0.099
The-0.098
Token Feel
Token ID18571
Feature activation-7.66e-02

Pos logit contributions
V+0.067
E+0.059
Look+0.053
O+0.048
Wow+0.047
Neg logit contributions
 enough-0.131
ier-0.130
 away-0.127
 size-0.126
 perfectly-0.126
Token the
Token ID262
Feature activation-7.63e-02

Pos logit contributions
castle+0.139
pit+0.121
work+0.118
icking+0.115
ies+0.115
Neg logit contributions
-0.125
 -0.097
 The-0.089
V-0.084
 Now-0.083
Token air
Token ID1633
Feature activation-2.23e-01

Pos logit contributions
led+0.125
ils+0.123
cases+0.121
rying+0.114
ier+0.109
Neg logit contributions
-0.086
Look-0.085
V-0.074
Mom-0.074
Ah-0.073
Token and
Token ID290
Feature activation-1.18e-01

Pos logit contributions
ils+0.267
 coins+0.260
 artifacts+0.257
 jewels+0.240
 treasures+0.239
Neg logit contributions
-0.322
 She-0.251
 The-0.246
 -0.225
 It-0.224
Token let
Token ID1309
Feature activation-1.54e-02

Pos logit contributions
ier+0.202
rier+0.179
ener+0.175
ist+0.173
owing+0.167
Neg logit contributions
-0.168
 The-0.137
 Now-0.137
 It-0.131
 Next-0.126
Token it
Token ID340
Feature activation-1.09e-01

Pos logit contributions
 size+0.024
ier+0.024
work+0.023
 compared+0.023
 wand+0.022
Neg logit contributions
-0.023
V-0.023
E-0.022
O-0.020
L-0.019
Token take
Token ID1011
Feature activation-4.86e-02

Pos logit contributions
 compared+0.192
pled+0.187
 thieves+0.185
 dollars+0.183
ening+0.182
Neg logit contributions
-0.152
 It-0.103
 The-0.101
 Everyone-0.098
 -0.095
Token you
Token ID345
Feature activation-1.23e-01

Pos logit contributions
castle+0.087
ier+0.080
flies+0.077
rying+0.076
 size+0.076
Neg logit contributions
-0.075
E-0.068
V-0.062
 Now-0.061
O-0.060
Token they
Token ID484
Feature activation-6.59e-02

Pos logit contributions
ier+0.145
ishing+0.143
ache+0.142
eking+0.138
ener+0.136
Neg logit contributions
-0.072
 The-0.049
Sam-0.045
 -0.045
 It-0.045
Token could
Token ID714
Feature activation-4.98e-02

Pos logit contributions
 size+0.121
 decisions+0.119
 dollars+0.117
 inches+0.117
 fault+0.116
Neg logit contributions
-0.075
Wow-0.061
It-0.058
 -0.055
Look-0.055
Token now
Token ID783
Feature activation-8.79e-02

Pos logit contributions
 dollars+0.098
ier+0.097
ener+0.097
 pounds+0.097
 compared+0.096
Neg logit contributions
-0.071
Mom-0.050
Her-0.049
 The-0.047
The-0.047
Token use
Token ID779
Feature activation-1.20e-01

Pos logit contributions
 coins+0.159
ener+0.158
isher+0.156
cards+0.152
 pounds+0.151
Neg logit contributions
-0.126
 The-0.085
 It-0.083
 Everyone-0.080
 -0.079
Token the
Token ID262
Feature activation-1.18e-01

Pos logit contributions
ier+0.258
castle+0.245
ener+0.243
rier+0.235
rated+0.234
Neg logit contributions
-0.142
 The-0.101
 -0.100
 It-0.091
 Lily-0.074
Token glue
Token ID22749
Feature activation-2.23e-01

Pos logit contributions
ier+0.203
 able+0.189
ils+0.180
rying+0.172
 compared+0.172
Neg logit contributions
-0.139
Look-0.131
<-0.125
<|endoftext|>-0.122
Here-0.116
Token.
Token ID13
Feature activation+7.58e-02

Pos logit contributions
eling+0.346
eking+0.331
 centimeters+0.322
 inches+0.319
rier+0.318
Neg logit contributions
-0.274
 -0.176
 They-0.160
 The-0.142
 She-0.140
Token The
Token ID383
Feature activation-5.59e-02

Pos logit contributions
V+0.050
E+0.044
Look+0.031
O+0.030
Wow+0.026
Neg logit contributions
 enough-0.163
 for-0.161
 away-0.160
 wouldn-0.159
 size-0.158
Token man
Token ID582
Feature activation-1.13e-01

Pos logit contributions
ier+0.104
 able+0.099
rying+0.096
 enough+0.095
owing+0.094
Neg logit contributions
Mom-0.071
-0.066
V-0.065
Look-0.062
<|endoftext|>-0.059
Token was
Token ID373
Feature activation-3.79e-02

Pos logit contributions
 attention+0.240
rier+0.237
ier+0.229
 anywhere+0.226
 situations+0.226
Neg logit contributions
-0.140
 From-0.104
 The-0.094
 -0.083
The-0.075
Token happy
Token ID3772
Feature activation-4.96e-02

Pos logit contributions
ly+0.074
iness+0.072
ener+0.070
ies+0.070
lers+0.069
Neg logit contributions
-0.051
 -0.038
 The-0.036
V-0.036
Mom-0.034
Token the
Token ID262
Feature activation-6.22e-02

Pos logit contributions
iness+0.060
ly+0.052
eness+0.052
ies+0.051
led+0.050
Neg logit contributions
-0.050
 The-0.039
V-0.037
-0.037
E-0.036
Token fairy
Token ID25607
Feature activation-1.25e-01

Pos logit contributions
led+0.112
ened+0.109
ering+0.108
ening+0.107
ils+0.102
Neg logit contributions
Look-0.066
V-0.064
E-0.063
Mom-0.063
L-0.063
Token of
Token ID286
Feature activation-5.62e-02

Pos logit contributions
 levers+0.191
 castles+0.189
 heels+0.188
eking+0.186
 argued+0.186
Neg logit contributions
-0.182
 The-0.130
"-0.125
 And-0.122
-0.115
Token the
Token ID262
Feature activation-8.44e-02

Pos logit contributions
ier+0.125
 wouldn+0.124
cases+0.121
ist+0.115
drops+0.113
Neg logit contributions
-0.051
Look-0.042
E-0.041
 Many-0.040
V-0.039
Token magic
Token ID5536
Feature activation-1.25e-01

Pos logit contributions
ier+0.153
 able+0.141
 wouldn+0.139
iness+0.136
ering+0.135
Neg logit contributions
Look-0.091
-0.091
-0.085
Mom-0.084
L-0.081
Token wand
Token ID11569
Feature activation-2.23e-01

Pos logit contributions
 argued+0.241
eking+0.225
ering+0.222
cking+0.219
lling+0.217
Neg logit contributions
-0.132
-0.103
L-0.093
 Now-0.092
 She-0.083
Token.
Token ID13
Feature activation+5.70e-02

Pos logit contributions
 injuries+0.356
ened+0.352
 possessions+0.351
 coins+0.348
ishing+0.348
Neg logit contributions
-0.276
 She-0.179
-0.166
"-0.164
 And-0.156
Token You
Token ID921
Feature activation-3.10e-02

Pos logit contributions
V+0.060
E+0.054
Look+0.048
+0.046
 Many+0.045
Neg logit contributions
ier-0.103
 enough-0.101
 size-0.101
 away-0.101
 perfectly-0.099
Token can
Token ID460
Feature activation-1.36e-02

Pos logit contributions
rier+0.053
ier+0.053
 size+0.052
ering+0.052
ening+0.052
Neg logit contributions
-0.038
V-0.033
 Many-0.032
 It-0.030
 In-0.030
Token make
Token ID787
Feature activation-3.24e-02

Pos logit contributions
ier+0.022
iness+0.021
 compared+0.020
ening+0.020
 enough+0.020
Neg logit contributions
-0.020
V-0.019
E-0.018
Mom-0.017
 Many-0.017
Token anything
Token ID1997
Feature activation-1.12e-01

Pos logit contributions
iness+0.061
ier+0.061
 wouldn+0.057
ener+0.054
belt+0.053
Neg logit contributions
-0.041
E-0.040
V-0.037
 Many-0.032
 Everyone-0.029
Token had
Token ID550
Feature activation-9.43e-02

Pos logit contributions
ier+0.292
rier+0.282
stones+0.279
 pounds+0.268
vell+0.264
Neg logit contributions
-0.233
 -0.173
 They-0.171
 The-0.152
 Their-0.144
Token their
Token ID511
Feature activation-1.12e-01

Pos logit contributions
ener+0.198
comb+0.177
iness+0.176
ist+0.175
rier+0.173
Neg logit contributions
-0.123
 -0.086
 The-0.082
 Their-0.079
Tom-0.077
Token special
Token ID2041
Feature activation-1.48e-01

Pos logit contributions
ier+0.242
ist+0.191
 able+0.186
rying+0.181
lling+0.178
Neg logit contributions
-0.129
 Mom-0.109
ossible-0.101
Mom-0.101
 Their-0.100
Token pe
Token ID613
Feature activation-6.44e-02

Pos logit contributions
ier+0.289
lling+0.255
 able+0.245
eking+0.244
pping+0.231
Neg logit contributions
-0.185
<|endoftext|>-0.134
ossible-0.133
 Today-0.127
Look-0.124
Tokenb
Token ID65
Feature activation-7.08e-02

Pos logit contributions
 compared+0.105
storm+0.103
 party+0.103
 bothering+0.102
 hurting+0.101
Neg logit contributions
E-0.050
-0.050
Finally-0.048
Near-0.045
Jen-0.045
Tokenble
Token ID903
Feature activation-2.20e-01

Pos logit contributions
 bit+0.137
 compared+0.135
 bothering+0.131
 hurting+0.128
 ruining+0.126
Neg logit contributions
-0.067
Jen-0.047
M-0.044
E-0.042
Mom-0.042
Token which
Token ID543
Feature activation-1.28e-01

Pos logit contributions
 aroma+0.358
lling+0.345
 breaths+0.345
ening+0.342
 matter+0.333
Neg logit contributions
-0.273
 The-0.156
 And-0.150
-0.145
 They-0.143
Token united
Token ID16503
Feature activation-1.31e-01

Pos logit contributions
rying+0.247
eking+0.246
iously+0.246
ishing+0.244
king+0.242
Neg logit contributions
-0.162
 -0.119
-0.101
Look-0.101
 The-0.094
Token them
Token ID606
Feature activation-1.61e-01

Pos logit contributions
eking+0.233
ache+0.229
ider+0.222
 breaths+0.220
iously+0.217
Neg logit contributions
-0.194
 They-0.136
 -0.114
Look-0.111
 It-0.109
Token in
Token ID287
Feature activation-5.29e-02

Pos logit contributions
ener+0.260
 thieves+0.254
 aroma+0.249
 breaths+0.245
pan+0.241
Neg logit contributions
-0.228
 -0.136
 They-0.117
 The-0.115
<|endoftext|>-0.111
Token friendship
Token ID14738
Feature activation-1.43e-01

Pos logit contributions
ier+0.113
ache+0.110
ly+0.108
drops+0.107
 wouldn+0.106
Neg logit contributions
-0.074
Look-0.052
 -0.049
 The-0.047
E-0.046
Token something
Token ID1223
Feature activation-1.24e-01

Pos logit contributions
ier+0.173
iness+0.141
ious+0.137
 compared+0.135
ener+0.135
Neg logit contributions
-0.099
 Many-0.080
 The-0.077
 -0.072
 One-0.070
Token else
Token ID2073
Feature activation-1.58e-01

Pos logit contributions
ener+0.241
stick+0.231
ps+0.227
ring+0.224
ked+0.223
Neg logit contributions
-0.170
<|endoftext|>-0.105
-0.099
 -0.091
It-0.089
Token -
Token ID532
Feature activation-1.11e-01

Pos logit contributions
ier+0.258
eness+0.251
ps+0.251
ist+0.247
 breaths+0.246
Neg logit contributions
-0.249
 -0.165
-0.161
 They-0.141
 The-0.141
Token a
Token ID257
Feature activation-9.87e-02

Pos logit contributions
ier+0.256
ishing+0.238
ist+0.230
iness+0.230
et+0.229
Neg logit contributions
-0.144
 It-0.116
 The-0.106
 -0.097
 What-0.090
Token beautiful
Token ID4950
Feature activation-1.13e-01

Pos logit contributions
ier+0.188
led+0.174
ils+0.173
ps+0.172
iness+0.166
Neg logit contributions
-0.124
Look-0.122
E-0.117
V-0.116
O-0.111
Token wand
Token ID11569
Feature activation-2.20e-01

Pos logit contributions
ier+0.220
ils+0.204
ps+0.193
ried+0.191
cases+0.190
Neg logit contributions
-0.145
Look-0.131
Wow-0.112
 -0.111
 It-0.105
Token with
Token ID351
Feature activation-1.10e-01

Pos logit contributions
 choices+0.343
ishes+0.329
 decisions+0.327
 forth+0.326
 unnecessary+0.322
Neg logit contributions
-0.269
 It-0.186
 She-0.169
-0.159
It-0.147
Token a
Token ID257
Feature activation-1.09e-01

Pos logit contributions
ier+0.211
rying+0.209
castle+0.205
icking+0.202
ener+0.195
Neg logit contributions
-0.129
 Many-0.115
 It-0.113
Look-0.110
 The-0.104
Token star
Token ID3491
Feature activation-1.49e-01

Pos logit contributions
ils+0.187
ier+0.184
rying+0.182
iness+0.176
icking+0.175
Neg logit contributions
-0.152
Look-0.136
 -0.128
It-0.126
V-0.119
Token on
Token ID319
Feature activation-4.95e-02

Pos logit contributions
 permission+0.221
 punished+0.221
 bills+0.210
 jobs+0.209
 argued+0.206
Neg logit contributions
-0.177
 It-0.130
-0.117
It-0.116
"-0.105
Token the
Token ID262
Feature activation-7.74e-02

Pos logit contributions
ier+0.109
rying+0.107
icking+0.105
owing+0.105
cher+0.101
Neg logit contributions
-0.062
 It-0.050
 The-0.048
E-0.046
 One-0.044
Token his
Token ID465
Feature activation-1.15e-01

Pos logit contributions
ier+0.020
ist+0.017
work+0.017
pan+0.017
cher+0.016
Neg logit contributions
-0.017
 Next-0.016
E-0.015
 Many-0.015
Tim-0.014
Token dinner
Token ID8073
Feature activation-1.54e-01

Pos logit contributions
ier+0.247
lling+0.200
ist+0.188
iness+0.187
ils+0.184
Neg logit contributions
-0.155
<|endoftext|>-0.118
 Mom-0.112
 His-0.112
 -0.110
Token with
Token ID351
Feature activation-5.69e-02

Pos logit contributions
ener+0.251
ening+0.249
 breaths+0.246
ked+0.244
ened+0.241
Neg logit contributions
-0.249
M-0.153
 Now-0.153
 -0.152
-0.152
Token his
Token ID465
Feature activation-7.62e-02

Pos logit contributions
ier+0.133
ist+0.130
rying+0.122
ener+0.121
drops+0.120
Neg logit contributions
-0.061
 Mom-0.047
Mom-0.044
Look-0.042
E-0.041
Token cob
Token ID22843
Feature activation-5.74e-02

Pos logit contributions
ier+0.187
ist+0.158
lling+0.156
rying+0.155
en+0.148
Neg logit contributions
-0.085
 Mom-0.064
 Next-0.056
<|endoftext|>-0.055
Mom-0.054
Tokenweb
Token ID12384
Feature activation-2.20e-01

Pos logit contributions
 destination+0.091
 impossible+0.087
 apart+0.085
 leave+0.085
 compared+0.085
Neg logit contributions
-0.059
M-0.055
Jen-0.055
 Now-0.051
itt-0.050
Token.
Token ID13
Feature activation+7.00e-02

Pos logit contributions
 expert+0.348
lling+0.346
ier+0.337
ault+0.328
rier+0.324
Neg logit contributions
-0.259
 And-0.161
 The-0.160
 -0.158
 She-0.153
Token<|endoftext|>
Token ID50256
Feature activation+1.17e-01

Pos logit contributions
V+0.051
E+0.043
Look+0.035
O+0.032
Wow+0.031
Neg logit contributions
 enough-0.144
 size-0.142
 away-0.142
ier-0.141
 perfectly-0.141
TokenOnce
Token ID7454
Feature activation+9.19e-02

Pos logit contributions
V+0.099
E+0.094
 Many+0.078
Br+0.076
 While+0.076
Neg logit contributions
 in-0.227
 on-0.225
.-0.220
 away-0.218
!-0.218
Token upon
Token ID2402
Feature activation+8.05e-02

Pos logit contributions
V+0.184
E+0.171
Look+0.163
Wow+0.155
O+0.155
Neg logit contributions
 for-0.096
 enough-0.089
 away-0.088
ier-0.087
ies-0.084
Token a
Token ID257
Feature activation+1.08e-01

Pos logit contributions
V+0.173
E+0.163
Look+0.154
Kids+0.153
 While+0.153
Neg logit contributions
 enough-0.054
ier-0.049
 size-0.049
 away-0.049
 for-0.048
Token kind
Token ID1611
Feature activation-9.27e-02

Pos logit contributions
et+0.094
 powers+0.089
ode+0.088
eness+0.087
stone+0.087
Neg logit contributions
Mom-0.053
-0.052
E-0.050
 Their-0.047
L-0.047
Token and
Token ID290
Feature activation-8.13e-02

Pos logit contributions
eking+0.206
aches+0.191
eling+0.183
cking+0.182
 footing+0.177
Neg logit contributions
-0.111
 They-0.101
 She-0.091
 Now-0.080
 And-0.076
Token looked
Token ID3114
Feature activation-9.12e-02

Pos logit contributions
et+0.148
castle+0.142
ier+0.139
ener+0.139
le+0.138
Neg logit contributions
-0.103
 The-0.088
 From-0.084
 They-0.083
 Their-0.082
Token after
Token ID706
Feature activation-8.19e-02

Pos logit contributions
comb+0.198
castle+0.187
ode+0.183
eman+0.179
fer+0.178
Neg logit contributions
-0.101
 The-0.090
 When-0.083
 From-0.075
 They-0.073
Token her
Token ID607
Feature activation-9.25e-02

Pos logit contributions
 sword+0.165
castle+0.165
ode+0.164
 ship+0.163
ener+0.160
Neg logit contributions
-0.085
E-0.079
 The-0.065
 Everyone-0.060
 Lily-0.057
Token body
Token ID1767
Feature activation-2.20e-01

Pos logit contributions
castle+0.157
ier+0.153
boats+0.145
igator+0.139
boat+0.139
Neg logit contributions
-0.130
 And-0.099
 The-0.098
 In-0.089
 When-0.085
Token.
Token ID13
Feature activation+5.71e-02

Pos logit contributions
 explorer+0.385
 pearl+0.384
 adventurer+0.373
 thieves+0.358
ener+0.353
Neg logit contributions
-0.274
 They-0.189
 She-0.171
 -0.170
 The-0.168
Token They
Token ID1119
Feature activation-7.77e-02

Pos logit contributions
V+0.046
E+0.039
Look+0.033
+0.030
 Many+0.030
Neg logit contributions
ier-0.118
 enough-0.117
 away-0.116
 size-0.116
 perfectly-0.114
Token gave
Token ID2921
Feature activation-2.11e-02

Pos logit contributions
rier+0.171
ier+0.154
 decisions+0.149
 pounds+0.147
 powers+0.146
Neg logit contributions
-0.087
 -0.071
 From-0.059
 The-0.058
Mom-0.056
Token her
Token ID607
Feature activation-4.89e-02

Pos logit contributions
castle+0.046
ier+0.044
 ship+0.044
 size+0.042
fish+0.042
Neg logit contributions
E-0.023
-0.020
Look-0.017
V-0.017
 The-0.016
Token some
Token ID617
Feature activation-1.11e-01

Pos logit contributions
ier+0.116
eking+0.103
castle+0.102
ist+0.099
ering+0.099
Neg logit contributions
-0.050
 The-0.044
<|endoftext|>-0.034
E-0.033
M-0.032
Token day
Token ID1110
Feature activation-1.36e-01

Pos logit contributions
ier+0.202
ppers+0.181
ener+0.177
asy+0.174
urger+0.174
Neg logit contributions
-0.148
 -0.117
 The-0.103
 She-0.099
 It-0.099
Token and
Token ID290
Feature activation-1.07e-01

Pos logit contributions
isher+0.287
ener+0.282
ode+0.262
ishes+0.259
ologist+0.255
Neg logit contributions
-0.181
 The-0.124
 She-0.113
 -0.104
 It-0.101
Token the
Token ID262
Feature activation-1.36e-02

Pos logit contributions
ier+0.201
ler+0.181
 dollars+0.170
ache+0.169
ist+0.168
Neg logit contributions
-0.144
 The-0.114
 -0.112
 She-0.111
 From-0.110
Token way
Token ID835
Feature activation-8.76e-02

Pos logit contributions
ier+0.025
work+0.024
ist+0.023
fish+0.022
iness+0.021
Neg logit contributions
-0.017
Wow-0.016
V-0.016
Me-0.015
Look-0.015
Token her
Token ID607
Feature activation-6.21e-02

Pos logit contributions
urger+0.174
 scales+0.173
ache+0.173
 topp+0.172
ils+0.172
Neg logit contributions
-0.106
 She-0.088
 The-0.077
 -0.074
 It-0.068
Token body
Token ID1767
Feature activation-2.19e-01

Pos logit contributions
ier+0.130
cher+0.109
ist+0.108
ler+0.105
pan+0.103
Neg logit contributions
-0.085
 The-0.061
-0.059
 Next-0.059
 In-0.058
Token felt
Token ID2936
Feature activation-8.40e-02

Pos logit contributions
 expert+0.326
ener+0.324
 thieves+0.323
urger+0.321
 pearl+0.312
Neg logit contributions
-0.318
 -0.218
-0.199
 It-0.198
 She-0.197
Token with
Token ID351
Feature activation-6.93e-02

Pos logit contributions
sters+0.155
ener+0.155
cles+0.154
eman+0.153
castle+0.152
Neg logit contributions
-0.132
 The-0.096
 -0.090
The-0.085
 When-0.077
Token each
Token ID1123
Feature activation-1.43e-02

Pos logit contributions
castle+0.156
ier+0.150
ache+0.147
cher+0.145
ener+0.142
Neg logit contributions
-0.071
 -0.055
 The-0.050
 It-0.048
E-0.047
Token step
Token ID2239
Feature activation-1.58e-01

Pos logit contributions
castle+0.030
work+0.029
flies+0.028
ier+0.027
nuts+0.026
Neg logit contributions
-0.015
 -0.011
 Many-0.011
M-0.010
Me-0.010
Token.
Token ID13
Feature activation+9.77e-02

Pos logit contributions
 island+0.256
 size+0.253
castle+0.252
 lair+0.248
 enemy+0.241
Neg logit contributions
-0.205
 She-0.174
 The-0.151
 -0.146
 Her-0.144
Token the
Token ID262
Feature activation-1.45e-01

Pos logit contributions
itive+0.253
 echoing+0.231
ache+0.229
ault+0.223
comb+0.221
Neg logit contributions
-0.198
 It-0.140
 The-0.133
-0.133
Look-0.119
Token answers
Token ID7429
Feature activation-2.22e-01

Pos logit contributions
ier+0.253
iness+0.231
 able+0.228
owing+0.227
rying+0.215
Neg logit contributions
-0.170
Look-0.166
E-0.155
Mom-0.149
V-0.149
Token to
Token ID284
Feature activation-7.66e-02

Pos logit contributions
eking+0.330
 mess+0.320
 state+0.319
 island+0.310
 size+0.310
Neg logit contributions
-0.356
 It-0.274
 The-0.261
 -0.247
 She-0.246
Token your
Token ID534
Feature activation-1.05e-01

Pos logit contributions
ier+0.160
led+0.142
iness+0.139
lers+0.135
ener+0.133
Neg logit contributions
-0.103
 Many-0.081
 The-0.080
V-0.080
 Everyone-0.074
Token exam
Token ID2814
Feature activation-1.88e-01

Pos logit contributions
ier+0.222
iness+0.203
lling+0.189
led+0.186
eking+0.170
Neg logit contributions
-0.134
 Mom-0.114
 The-0.105
V-0.096
Mom-0.096
Token paper
Token ID3348
Feature activation-2.19e-01

Pos logit contributions
lling+0.271
 pearl+0.269
 wil+0.256
ening+0.253
rying+0.252
Neg logit contributions
-0.315
 -0.226
 Now-0.215
 She-0.210
 The-0.208
Token."
Token ID526
Feature activation+5.10e-02

Pos logit contributions
lling+0.321
eking+0.315
rying+0.309
eling+0.308
ier+0.297
Neg logit contributions
-0.326
 She-0.223
 -0.218
 They-0.216
 It-0.210
Token
Token ID198
Feature activation+1.29e-01

Pos logit contributions
V+0.022
E+0.017
+0.011
Look+0.011
 Many+0.009
Neg logit contributions
ier-0.127
 size-0.124
 enough-0.124
 away-0.123
 perfectly-0.122
Token
Token ID198
Feature activation+9.27e-02

Pos logit contributions
V+0.106
Wow+0.101
 While+0.095
Ta+0.090
Gu+0.090
Neg logit contributions
.-0.214
!-0.207
 and-0.203
 for-0.203
 away-0.199
TokenM
Token ID44
Feature activation+3.01e-02

Pos logit contributions
V+0.069
 Many+0.055
 While+0.055
E+0.054
Look+0.052
Neg logit contributions
 away-0.169
 size-0.161
 perfectly-0.157
 bit-0.155
 forever-0.153
Tokenia
Token ID544
Feature activation-5.84e-02

Pos logit contributions
E+0.027
V+0.026
+0.022
Mom+0.021
O+0.020
Neg logit contributions
 away-0.062
ier-0.058
 anywhere-0.058
 wouldn-0.058
 compared-0.057
Token therapy
Token ID9102
Feature activation-9.66e-02

Pos logit contributions
ier+0.071
ache+0.069
 able+0.065
eking+0.064
 unatt+0.063
Neg logit contributions
-0.043
Look-0.043
Wow-0.041
Mom-0.041
Kids-0.040
Token of
Token ID286
Feature activation-4.25e-03

Pos logit contributions
eking+0.219
ier+0.206
rying+0.198
iful+0.194
ist+0.193
Neg logit contributions
-0.104
L-0.092
 Lily-0.090
 The-0.086
V-0.080
Token being
Token ID852
Feature activation-7.16e-02

Pos logit contributions
ache+0.008
 wouldn+0.008
ier+0.008
drops+0.008
ener+0.007
Neg logit contributions
-0.005
Look-0.005
E-0.005
O-0.004
V-0.004
Token in
Token ID287
Feature activation-7.04e-02

Pos logit contributions
 decision+0.114
 appearance+0.109
 happen+0.108
 destination+0.106
iness+0.105
Neg logit contributions
-0.088
V-0.081
L-0.079
It-0.078
M-0.078
Token the
Token ID262
Feature activation-3.52e-02

Pos logit contributions
ier+0.150
ly+0.139
sters+0.137
ener+0.136
cases+0.133
Neg logit contributions
-0.094
 The-0.067
 Now-0.059
Once-0.059
Look-0.058
Token pool
Token ID5933
Feature activation-2.19e-01

Pos logit contributions
ier+0.069
 able+0.064
ache+0.063
ist+0.062
 wouldn+0.062
Neg logit contributions
Look-0.044
V-0.043
-0.042
Mom-0.040
-0.034
Token had
Token ID550
Feature activation-1.08e-01

Pos logit contributions
eking+0.332
 breaths+0.321
ps+0.307
ache+0.305
 attention+0.301
Neg logit contributions
-0.331
 The-0.251
 She-0.243
 Now-0.237
 -0.227
Token worked
Token ID3111
Feature activation-1.17e-01

Pos logit contributions
rier+0.206
ener+0.201
ish+0.180
 forth+0.179
 airplane+0.168
Neg logit contributions
-0.149
O-0.104
Tom-0.103
It-0.100
A-0.100
Token its
Token ID663
Feature activation-9.50e-02

Pos logit contributions
 groom+0.185
flies+0.185
ode+0.184
sters+0.184
ener+0.183
Neg logit contributions
-0.157
 -0.102
L-0.096
 Soon-0.089
E-0.084
Token magic
Token ID5536
Feature activation-1.48e-01

Pos logit contributions
ier+0.194
pan+0.170
ils+0.162
led+0.158
ener+0.156
Neg logit contributions
-0.107
 One-0.080
 Everywhere-0.075
 Sometimes-0.071
One-0.070
Token.
Token ID13
Feature activation+9.93e-02

Pos logit contributions
eling+0.205
ier+0.205
nam+0.203
 thieves+0.199
eking+0.195
Neg logit contributions
-0.222
 But-0.136
<|endoftext|>-0.131
 -0.131
 Everywhere-0.126
Token tried
Token ID3088
Feature activation-8.32e-02

Pos logit contributions
rier+0.191
pan+0.164
ier+0.161
 diamonds+0.152
iness+0.151
Neg logit contributions
-0.103
-0.082
I-0.073
 Now-0.070
 From-0.069
Token one
Token ID530
Feature activation-6.25e-02

Pos logit contributions
ener+0.163
mer+0.157
cher+0.156
isher+0.155
led+0.149
Neg logit contributions
-0.129
 -0.088
 She-0.074
 It-0.074
 The-0.070
Token of
Token ID286
Feature activation-4.66e-02

Pos logit contributions
ps+0.104
eking+0.097
ier+0.094
ping+0.093
bags+0.092
Neg logit contributions
-0.103
Sure-0.071
 But-0.070
M-0.066
 On-0.066
Token the
Token ID262
Feature activation-1.69e-02

Pos logit contributions
ier+0.097
castle+0.095
ener+0.093
ache+0.090
ly+0.090
Neg logit contributions
-0.053
V-0.039
E-0.037
 -0.035
 The-0.035
Token unknown
Token ID6439
Feature activation-7.65e-02

Pos logit contributions
flies+0.028
castle+0.027
fly+0.027
ier+0.026
 able+0.026
Neg logit contributions
V-0.024
-0.022
E-0.019
 Whe-0.019
Look-0.018
Token machines
Token ID8217
Feature activation-2.18e-01

Pos logit contributions
eking+0.131
nuts+0.120
castle+0.120
 able+0.118
 hatched+0.118
Neg logit contributions
-0.106
 But-0.078
-0.075
 Everyone-0.071
 It-0.071
Token,
Token ID11
Feature activation-1.05e-01

Pos logit contributions
urger+0.369
ils+0.348
ishes+0.344
ies+0.339
 jaws+0.337
Neg logit contributions
-0.294
 She-0.207
 -0.206
 It-0.177
 The-0.176
Token but
Token ID475
Feature activation-7.13e-02

Pos logit contributions
ier+0.199
ils+0.189
fish+0.178
led+0.172
le+0.171
Neg logit contributions
-0.136
 It-0.113
 The-0.111
 But-0.103
 -0.102
Token she
Token ID673
Feature activation-8.18e-02

Pos logit contributions
urger+0.141
ier+0.141
immers+0.136
et+0.134
ets+0.134
Neg logit contributions
-0.080
 -0.062
It-0.055
 She-0.052
Me-0.051
Token knew
Token ID2993
Feature activation-6.31e-02

Pos logit contributions
 attention+0.172
rier+0.167
 diamonds+0.157
ier+0.153
 rising+0.152
Neg logit contributions
-0.088
M-0.069
 From-0.062
Even-0.058
Well-0.055
Token it
Token ID340
Feature activation-8.75e-02

Pos logit contributions
mer+0.117
ener+0.116
 ship+0.116
castle+0.115
drops+0.112
Neg logit contributions
-0.084
It-0.055
 It-0.052
 She-0.052
 -0.051
Token disagreements
Token ID37990
Feature activation-1.86e-01

Pos logit contributions
ishly+0.130
bows+0.119
drops+0.114
flies+0.113
work+0.111
Neg logit contributions
-0.095
Mom-0.083
One-0.081
E-0.074
A-0.072
Token and
Token ID290
Feature activation-1.17e-01

Pos logit contributions
itive+0.276
ener+0.255
 wil+0.252
 crown+0.250
eking+0.249
Neg logit contributions
-0.301
-0.195
 -0.191
"-0.186
 They-0.182
Token show
Token ID905
Feature activation-9.12e-02

Pos logit contributions
ier+0.233
iness+0.218
ening+0.209
et+0.205
led+0.202
Neg logit contributions
-0.154
 -0.110
 But-0.108
Mom-0.100
 The-0.099
Token people
Token ID661
Feature activation-1.12e-01

Pos logit contributions
castle+0.158
 falling+0.152
 melting+0.150
comb+0.150
ier+0.148
Neg logit contributions
-0.122
 Everyone-0.113
E-0.092
 Now-0.090
 It-0.088
Token the
Token ID262
Feature activation-5.50e-02

Pos logit contributions
 size+0.180
sed+0.177
 breaths+0.172
 falling+0.169
sing+0.168
Neg logit contributions
-0.170
The-0.119
 Everyone-0.110
 -0.110
 The-0.109
Token path
Token ID3108
Feature activation-2.18e-01

Pos logit contributions
ishly+0.081
ier+0.080
owed+0.077
pan+0.076
owing+0.075
Neg logit contributions
-0.073
V-0.073
Look-0.068
E-0.064
King-0.061
Token of
Token ID286
Feature activation-7.95e-02

Pos logit contributions
itive+0.299
 breaths+0.299
ache+0.294
 veil+0.290
 footing+0.285
Neg logit contributions
-0.305
 The-0.240
 They-0.226
 Everyone-0.221
 It-0.220
Token kindness
Token ID23887
Feature activation-1.42e-01

Pos logit contributions
ier+0.165
drops+0.160
led+0.153
ener+0.151
 wouldn+0.151
Neg logit contributions
-0.094
E-0.065
 The-0.060
-0.058
 Many-0.056
Token and
Token ID290
Feature activation-1.23e-01

Pos logit contributions
eking+0.258
 falling+0.229
 wil+0.226
cking+0.226
sing+0.225
Neg logit contributions
-0.213
-0.153
 Everyone-0.136
 Now-0.135
 They-0.134
Token understanding
Token ID4547
Feature activation-9.24e-02

Pos logit contributions
ier+0.212
et+0.198
le+0.187
ist+0.186
ler+0.181
Neg logit contributions
-0.174
 Next-0.136
 -0.129
 Many-0.129
 Everyone-0.122
Token.
Token ID13
Feature activation+8.07e-02

Pos logit contributions
castle+0.155
comb+0.153
 airplane+0.143
 buried+0.142
storm+0.141
Neg logit contributions
-0.149
 Everyone-0.107
 The-0.106
E-0.099
-0.098
Token were
Token ID547
Feature activation-1.14e-01

Pos logit contributions
urger+0.268
 permission+0.263
 luck+0.263
 Majesty+0.262
 fault+0.260
Neg logit contributions
-0.134
It-0.092
 It-0.088
 The-0.085
 She-0.083
Token taking
Token ID2263
Feature activation-8.60e-02

Pos logit contributions
isher+0.203
ener+0.190
 elbow+0.188
ci+0.178
ome+0.170
Neg logit contributions
-0.163
 -0.101
 The-0.094
 One-0.091
 In-0.089
Token care
Token ID1337
Feature activation-9.71e-02

Pos logit contributions
castle+0.187
ier+0.158
pit+0.158
 salon+0.157
fish+0.148
Neg logit contributions
-0.119
 The-0.097
E-0.087
 It-0.086
 -0.084
Token of
Token ID286
Feature activation-7.92e-02

Pos logit contributions
 forth+0.241
ener+0.232
 thieves+0.231
castle+0.229
 argued+0.228
Neg logit contributions
-0.087
 The-0.046
 They-0.035
 It-0.033
 Everyone-0.027
Token the
Token ID262
Feature activation-3.86e-02

Pos logit contributions
ener+0.173
ier+0.169
drops+0.166
castle+0.162
aser+0.161
Neg logit contributions
-0.098
 The-0.061
 -0.057
 They-0.049
 It-0.048
Token air
Token ID1633
Feature activation-2.18e-01

Pos logit contributions
ier+0.066
castle+0.065
et+0.063
 able+0.063
 perfectly+0.061
Neg logit contributions
-0.053
 Next-0.046
-0.044
Look-0.042
Wow-0.040
Token.
Token ID13
Feature activation+7.59e-02

Pos logit contributions
isher+0.373
 prisoner+0.370
 prisoners+0.359
eking+0.358
ener+0.354
Neg logit contributions
-0.263
 They-0.194
 She-0.177
 The-0.170
 It-0.169
Token From
Token ID3574
Feature activation-2.32e-02

Pos logit contributions
V+0.072
E+0.062
O+0.051
Look+0.051
Wow+0.047
Neg logit contributions
 enough-0.144
 for-0.141
 away-0.138
 wouldn-0.138
 perfectly-0.137
Token that
Token ID326
Feature activation+1.49e-03

Pos logit contributions
ier+0.042
iness+0.042
epad+0.041
 affect+0.040
 size+0.039
Neg logit contributions
-0.030
Mom-0.024
A-0.023
 Now-0.022
 The-0.022
Token moment
Token ID2589
Feature activation-4.59e-03

Pos logit contributions
+0.002
V+0.001
 +0.001
 Now+0.001
Look+0.001
Neg logit contributions
ier-0.003
ist-0.003
ps-0.003
drops-0.003
 coins-0.003
Token,
Token ID11
Feature activation-2.84e-02

Pos logit contributions
 size+0.006
castle+0.006
flies+0.005
 island+0.005
work+0.005
Neg logit contributions
-0.008
V-0.007
 Now-0.007
M-0.007
Mom-0.007
Token and
Token ID290
Feature activation-3.66e-03

Pos logit contributions
castle+0.046
ier+0.045
 reality+0.045
iest+0.045
 happen+0.044
Neg logit contributions
-0.041
Mom-0.032
 The-0.032
 So-0.029
V-0.029
Token hide
Token ID7808
Feature activation+4.92e-03

Pos logit contributions
ier+0.007
iest+0.007
iness+0.006
 perfectly+0.006
iful+0.006
Neg logit contributions
-0.005
V-0.005
 The-0.004
 While-0.004
Look-0.004
Token behind
Token ID2157
Feature activation+6.26e-03

Pos logit contributions
E+0.009
+0.008
V+0.008
Mom+0.008
L+0.008
Neg logit contributions
 collection-0.005
castle-0.005
work-0.005
ier-0.005
 powers-0.005
Token her
Token ID607
Feature activation-1.59e-02

Pos logit contributions
+0.009
V+0.008
E+0.008
 The+0.008
Look+0.007
Neg logit contributions
ier-0.012
ist-0.011
 wouldn-0.011
iness-0.010
ener-0.010
Token.
Token ID13
Feature activation+1.26e-01

Pos logit contributions
ier+0.026
 wouldn+0.024
iest+0.023
 created+0.022
ist+0.022
Neg logit contributions
-0.025
 The-0.019
Mom-0.018
 Her-0.017
 -0.017
Token
Token ID198
Feature activation+1.73e-01

Pos logit contributions
Mic+0.165
innie+0.161
Ta+0.159
E+0.156
V+0.148
Neg logit contributions
 to-0.182
 in-0.176
.-0.170
 for-0.169
 and-0.167
Token
Token ID198
Feature activation+1.01e-01

Pos logit contributions
Ta+0.311
Whe+0.285
Mic+0.285
Grab+0.280
Uh+0.277
Neg logit contributions
.-0.215
 and-0.210
,-0.187
 to-0.181
 soon-0.172
Token"
Token ID1
Feature activation+9.01e-02

Pos logit contributions
 Ow+0.132
V+0.129
 Many+0.125
E+0.123
 Whe+0.121
Neg logit contributions
 away-0.131
 in-0.127
 soon-0.122
 bit-0.122
 left-0.118
TokenMom
Token ID29252
Feature activation-4.31e-03

Pos logit contributions
 While+0.094
 Many+0.093
 Later+0.089
 On+0.086
 From+0.086
Neg logit contributions
 away-0.161
 bit-0.149
 size-0.141
 left-0.141
 melted-0.140
Token,
Token ID11
Feature activation+5.68e-02

Pos logit contributions
ier+0.005
 powers+0.005
 wand+0.005
 properly+0.005
iness+0.005
Neg logit contributions
-0.007
V-0.007
-0.006
Tom-0.006
Look-0.006
Token Mom
Token ID11254
Feature activation-1.58e-02

Pos logit contributions
V+0.113
E+0.107
Look+0.101
+0.099
 Many+0.098
Neg logit contributions
ier-0.051
 enough-0.050
 away-0.049
 size-0.049
 for-0.046
Token!"
Token ID2474
Feature activation+5.56e-02

Pos logit contributions
+0.051
V+0.050
E+0.048
Look+0.046
Wow+0.046
Neg logit contributions
 wand-0.032
 size-0.031
 airplane-0.030
 wouldn-0.030
 coins-0.029
Token she
Token ID673
Feature activation-1.11e-02

Pos logit contributions
V+0.073
E+0.067
Look+0.060
+0.058
 Many+0.058
Neg logit contributions
ier-0.088
 away-0.086
 enough-0.086
 size-0.086
 perfectly-0.084
Token said
Token ID531
Feature activation-1.45e-02

Pos logit contributions
ier+0.023
 size+0.022
 falling+0.022
ening+0.021
 anywhere+0.021
Neg logit contributions
-0.013
V-0.011
 From-0.010
 -0.010
L-0.010
Token.
Token ID13
Feature activation+1.18e-01

Pos logit contributions
 island+0.020
 monsters+0.020
 size+0.020
et+0.019
flies+0.019
Neg logit contributions
-0.026
 -0.021
 The-0.021
 Her-0.020
 From-0.020
Token<|endoftext|>
Token ID50256
Feature activation+1.24e-01

Pos logit contributions
V+0.126
E+0.118
Ta+0.106
O+0.104
Three+0.096
Neg logit contributions
 for-0.194
 as-0.187
.-0.180
 and-0.178
 again-0.176
TokenOnce
Token ID7454
Feature activation+1.16e-01

Pos logit contributions
V+0.122
E+0.113
Br+0.104
 Many+0.099
 While+0.098
Neg logit contributions
.-0.239
 in-0.233
 on-0.232
!-0.230
 again-0.224
Token upon
Token ID2402
Feature activation+8.74e-02

Pos logit contributions
V+0.259
E+0.243
Look+0.236
Cat+0.223
Wow+0.222
Neg logit contributions
 for-0.122
ies-0.102
 enough-0.098
 in-0.093
 away-0.092
Token a
Token ID257
Feature activation+1.19e-01

Pos logit contributions
V+0.198
E+0.185
Kids+0.177
Look+0.175
 Later+0.173
Neg logit contributions
 enough-0.050
 for-0.047
 in-0.045
 away-0.045
 size-0.043
Token time
Token ID640
Feature activation+1.00e-01

Pos logit contributions
V+0.294
E+0.277
Look+0.277
Wow+0.275
 Where+0.263
Neg logit contributions
 enough-0.051
 for-0.049
 forever-0.048
 bit-0.048
 size-0.046
Token,
Token ID11
Feature activation+9.97e-02

Pos logit contributions
V+0.316
E+0.300
Look+0.295
Kids+0.286
Wow+0.285
Neg logit contributions
,-0.000
 for+0.004
 in+0.009
 enough+0.010
!+0.015
Token there
Token ID612
Feature activation+8.92e-02

Pos logit contributions
V+0.266
Kids+0.253
E+0.250
 Many+0.248
Cat+0.246
Neg logit contributions
 in-0.052
 for-0.037
 on-0.036
 away-0.030
 enough-0.024
Token on
Token ID319
Feature activation+9.18e-03

Pos logit contributions
V+0.015
+0.015
E+0.013
 Many+0.013
Look+0.013
Neg logit contributions
 wouldn-0.009
 size-0.008
iness-0.008
ier-0.008
 bit-0.008
Token the
Token ID262
Feature activation+4.83e-02

Pos logit contributions
V+0.014
+0.013
E+0.013
Look+0.012
 Many+0.012
Neg logit contributions
ier-0.015
iness-0.013
 wouldn-0.013
ener-0.013
ache-0.013
Token board
Token ID3096
Feature activation-8.92e-02

Pos logit contributions
V+0.093
E+0.087
Look+0.083
+0.083
 Many+0.081
Neg logit contributions
ier-0.047
 enough-0.045
 size-0.044
 away-0.044
 wouldn-0.042
Token every
Token ID790
Feature activation+1.70e-02

Pos logit contributions
ener+0.164
 permission+0.164
 thieves+0.162
 luck+0.156
ier+0.156
Neg logit contributions
-0.136
 One-0.105
 She-0.096
 On-0.096
 The-0.091
Token day
Token ID1110
Feature activation-3.78e-02

Pos logit contributions
V+0.023
+0.021
Look+0.020
 While+0.020
Wow+0.019
Neg logit contributions
ier-0.029
ist-0.026
 wouldn-0.025
 enough-0.024
 coins-0.024
Token.
Token ID13
Feature activation+1.55e-01

Pos logit contributions
 luck+0.061
ener+0.060
ience+0.059
bags+0.059
 sword+0.058
Neg logit contributions
-0.062
 The-0.047
 She-0.045
 Her-0.044
 It-0.042
Token One
Token ID1881
Feature activation+6.56e-02

Pos logit contributions
Mic+0.251
Quick+0.246
E+0.239
Ta+0.238
Pe+0.238
Neg logit contributions
.-0.217
 for-0.206
 and-0.201
,-0.198
 to-0.186
Token day
Token ID1110
Feature activation+7.13e-02

Pos logit contributions
V+0.142
E+0.138
Look+0.128
 Many+0.123
Wow+0.123
Neg logit contributions
 size-0.049
 for-0.047
 away-0.046
ier-0.045
 enough-0.045
Token,
Token ID11
Feature activation+7.17e-02

Pos logit contributions
V+0.190
E+0.184
Look+0.173
 Many+0.168
Wow+0.168
Neg logit contributions
 enough-0.019
 for-0.019
ier-0.016
 away-0.016
 size-0.014
Token she
Token ID673
Feature activation+1.44e-02

Pos logit contributions
V+0.110
E+0.103
Look+0.092
 Many+0.087
O+0.086
Neg logit contributions
 for-0.099
 away-0.097
 enough-0.097
 size-0.095
 perfectly-0.092
Token drew
Token ID9859
Feature activation+1.37e-02

Pos logit contributions
V+0.022
+0.020
Look+0.020
E+0.019
L+0.019
Neg logit contributions
ier-0.022
rier-0.020
 luck-0.020
 attention-0.020
iness-0.019
Token a
Token ID257
Feature activation+8.13e-02

Pos logit contributions
V+0.254
Kids+0.240
E+0.235
Jen+0.231
Cat+0.231
Neg logit contributions
 enough-0.044
 in-0.042
 for-0.032
 as-0.031
 on-0.026
Token little
Token ID1310
Feature activation+2.70e-02

Pos logit contributions
V+0.158
E+0.147
Look+0.143
 Many+0.142
 While+0.141
Neg logit contributions
 size-0.070
 airplane-0.067
 for-0.066
 crown-0.066
 wand-0.065
Token girl
Token ID2576
Feature activation+5.30e-02

Pos logit contributions
V+0.037
E+0.036
+0.031
Wow+0.031
Look+0.031
Neg logit contributions
ier-0.042
 wouldn-0.041
 perfectly-0.040
 enough-0.040
 size-0.039
Token named
Token ID3706
Feature activation+1.01e-01

Pos logit contributions
V+0.080
E+0.074
Look+0.068
+0.068
 Many+0.066
Neg logit contributions
ier-0.074
 size-0.072
 enough-0.071
 away-0.071
 perfectly-0.069
Token Lily
Token ID20037
Feature activation+8.06e-02

Pos logit contributions
V+0.169
Nice+0.160
His+0.159
E+0.158
Br+0.157
Neg logit contributions
 for-0.138
 in-0.123
 away-0.120
 inside-0.113
 all-0.111
Token.
Token ID13
Feature activation+1.23e-01

Pos logit contributions
V+0.232
E+0.221
 Many+0.215
Look+0.210
O+0.208
Neg logit contributions
 for-0.008
 away-0.007
 size-0.006
 in-0.005
.-0.002
Token She
Token ID1375
Feature activation+8.66e-02

Pos logit contributions
Isa+0.185
Ge+0.183
Ta+0.182
Quick+0.181
Ah+0.179
Neg logit contributions
 for-0.156
,-0.143
.-0.139
 like-0.139
 in-0.138
Token loved
Token ID6151
Feature activation+3.53e-02

Pos logit contributions
V+0.197
 Deal+0.191
E+0.190
 Many+0.184
Wow+0.178
Neg logit contributions
 in-0.054
 made-0.049
 kept-0.047
 left-0.046
 wouldn-0.045
Token to
Token ID284
Feature activation+3.55e-02

Pos logit contributions
V+0.073
E+0.070
+0.068
Look+0.067
 Many+0.066
Neg logit contributions
ier-0.033
 perfectly-0.027
 away-0.026
ist-0.026
 size-0.026
Token play
Token ID711
Feature activation+6.64e-02

Pos logit contributions
V+0.060
E+0.055
 Many+0.054
+0.054
 While+0.052
Neg logit contributions
ier-0.049
 enough-0.043
 away-0.042
 perfectly-0.042
 wouldn-0.042
Token outside
Token ID2354
Feature activation+1.35e-02

Pos logit contributions
V+0.159
E+0.152
Look+0.144
 Many+0.142
Wow+0.141
Neg logit contributions
 away-0.033
 for-0.030
ier-0.029
 enough-0.028
 size-0.027
Token happy
Token ID3772
Feature activation-8.26e-03

Pos logit contributions
V+0.035
+0.034
Look+0.031
E+0.030
Mom+0.030
Neg logit contributions
iness-0.025
ier-0.024
ist-0.023
 wouldn-0.021
ly-0.021
Token and
Token ID290
Feature activation+3.81e-03

Pos logit contributions
ience+0.013
ier+0.013
 wouldn+0.013
ist+0.012
iness+0.012
Neg logit contributions
-0.014
-0.010
 They-0.009
M-0.009
 The-0.009
Token laugh
Token ID6487
Feature activation+9.14e-03

Pos logit contributions
+0.005
V+0.005
 While+0.005
 Now+0.005
 The+0.005
Neg logit contributions
ier-0.006
ist-0.006
iness-0.005
ener-0.005
 anywhere-0.005
Token a
Token ID257
Feature activation+7.14e-02

Pos logit contributions
+0.018
V+0.017
E+0.016
M+0.015
 While+0.015
Neg logit contributions
 powers-0.009
 wouldn-0.009
eking-0.009
 falling-0.009
 size-0.009
Token lot
Token ID1256
Feature activation-2.68e-02

Pos logit contributions
V+0.116
E+0.106
Look+0.101
 Many+0.097
 While+0.096
Neg logit contributions
 size-0.089
ier-0.087
 away-0.084
 for-0.084
 enough-0.083
Token.
Token ID13
Feature activation+1.32e-01

Pos logit contributions
 wouldn+0.041
 forth+0.038
eking+0.037
 disappeared+0.037
 vanished+0.036
Neg logit contributions
-0.038
 The-0.029
E-0.028
 And-0.028
L-0.027
Token
Token ID198
Feature activation+1.59e-01

Pos logit contributions
Mic+0.202
Ta+0.186
Mat+0.178
Jan+0.173
Wh+0.172
Neg logit contributions
.-0.194
 for-0.193
 in-0.192
 and-0.186
,-0.184
Token
Token ID198
Feature activation+9.30e-02

Pos logit contributions
Mic+0.258
Ta+0.250
 Deal+0.227
Wh+0.219
roc+0.216
Neg logit contributions
.-0.211
 and-0.208
 in-0.199
 to-0.189
,-0.189
TokenBut
Token ID1537
Feature activation+4.93e-03

Pos logit contributions
V+0.106
 Many+0.100
 Ow+0.099
E+0.098
 From+0.092
Neg logit contributions
 away-0.137
 in-0.132
 bit-0.129
 size-0.128
 for-0.125
Token then
Token ID788
Feature activation+1.32e-02

Pos logit contributions
V+0.007
+0.007
Mom+0.007
Look+0.007
E+0.007
Neg logit contributions
ier-0.008
iness-0.007
 powers-0.007
ist-0.007
ache-0.007
Token they
Token ID484
Feature activation+6.94e-03

Pos logit contributions
V+0.023
+0.023
E+0.022
M+0.020
+0.020
Neg logit contributions
ier-0.016
ist-0.015
et-0.015
 size-0.014
 airplane-0.014
Token slow
Token ID3105
Feature activation-7.42e-02

Pos logit contributions
ier+0.203
iness+0.174
ist+0.164
iful+0.159
iest+0.159
Neg logit contributions
-0.122
 -0.114
Look-0.113
 Whe-0.112
V-0.107
Token down
Token ID866
Feature activation-8.62e-02

Pos logit contributions
work+0.108
vell+0.105
inations+0.105
 island+0.105
castle+0.105
Neg logit contributions
-0.101
 The-0.087
 They-0.085
 She-0.085
 It-0.083
Token her
Token ID607
Feature activation-5.52e-02

Pos logit contributions
urger+0.184
ache+0.170
 mustache+0.162
ils+0.161
ode+0.158
Neg logit contributions
-0.127
 She-0.094
 The-0.093
-0.088
 -0.081
Token slide
Token ID10649
Feature activation-9.52e-02

Pos logit contributions
ier+0.097
ils+0.090
work+0.088
 rewarded+0.081
 compared+0.079
Neg logit contributions
-0.082
 -0.063
 The-0.061
 But-0.061
 His-0.057
Token.
Token ID13
Feature activation+7.05e-02

Pos logit contributions
ils+0.170
 generosity+0.165
 tale+0.163
 stew+0.163
 potions+0.161
Neg logit contributions
-0.121
L-0.099
Tim-0.096
 She-0.092
 -0.090
Token When
Token ID1649
Feature activation+6.06e-03

Pos logit contributions
V+0.059
E+0.049
Look+0.044
Wow+0.040
 Many+0.039
Neg logit contributions
 enough-0.138
ier-0.138
 perfectly-0.135
 away-0.135
 size-0.134
Token she
Token ID673
Feature activation+1.21e-02

Pos logit contributions
E+0.008
V+0.008
+0.008
O+0.007
L+0.007
Neg logit contributions
ist-0.010
ier-0.010
et-0.010
ener-0.010
work-0.010
Token reached
Token ID4251
Feature activation-3.62e-02

Pos logit contributions
+0.017
V+0.017
O+0.016
E+0.016
M+0.015
Neg logit contributions
rier-0.019
ier-0.019
 size-0.019
work-0.018
 airplane-0.018
Token the
Token ID262
Feature activation-2.20e-02

Pos logit contributions
ache+0.080
ious+0.074
iness+0.074
eness+0.072
ier+0.072
Neg logit contributions
-0.041
-0.029
 The-0.029
 -0.027
V-0.027
Token bottom
Token ID4220
Feature activation-8.47e-02

Pos logit contributions
ist+0.053
ies+0.052
ci+0.051
iness+0.050
 able+0.050
Neg logit contributions
V-0.014
Look-0.013
E-0.013
Wow-0.012
Me-0.012
Token,
Token ID11
Feature activation-2.13e-02

Pos logit contributions
ils+0.168
ps+0.158
 property+0.155
 powers+0.154
 privacy+0.153
Neg logit contributions
-0.116
Jen-0.088
 She-0.088
Tom-0.082
Tim-0.081
Token her
Token ID607
Feature activation-5.92e-02

Pos logit contributions
ier+0.034
castle+0.033
cest+0.033
ious+0.032
 size+0.032
Neg logit contributions
-0.020
V-0.019
Look-0.017
E-0.016
 -0.016
Token in
Token ID287
Feature activation+8.46e-03

Pos logit contributions
ier+0.102
lling+0.091
 compared+0.088
ious+0.087
ising+0.087
Neg logit contributions
-0.083
L-0.072
Mom-0.068
 The-0.067
V-0.063
Token the
Token ID262
Feature activation-1.83e-03

Pos logit contributions
+0.012
V+0.012
E+0.011
Look+0.010
Mom+0.010
Neg logit contributions
ier-0.015
 wouldn-0.014
flies-0.013
iness-0.013
ist-0.013
Token cart
Token ID6383
Feature activation-2.10e-02

Pos logit contributions
ier+0.003
ist+0.003
flies+0.003
 able+0.003
work+0.003
Neg logit contributions
V-0.003
-0.003
Look-0.003
E-0.002
 While-0.002
Token.
Token ID13
Feature activation+1.03e-01

Pos logit contributions
 luck+0.029
flies+0.029
 powers+0.028
 wand+0.028
 size+0.028
Neg logit contributions
-0.038
L-0.031
V-0.030
Mom-0.030
 Later-0.029
Token Maybe
Token ID6674
Feature activation+9.46e-03

Pos logit contributions
V+0.126
E+0.124
O+0.117
Jen+0.110
Br+0.109
Neg logit contributions
 for-0.150
 enough-0.142
 as-0.141
 before-0.138
 or-0.137
Token she
Token ID673
Feature activation-3.12e-02

Pos logit contributions
+0.013
V+0.011
Mom+0.011
E+0.010
L+0.010
Neg logit contributions
ier-0.016
iness-0.015
et-0.015
ist-0.015
ening-0.015
Token forgot
Token ID16453
Feature activation-8.42e-02

Pos logit contributions
ier+0.049
rier+0.048
 throughout+0.047
ening+0.047
 rising+0.046
Neg logit contributions
-0.044
V-0.039
E-0.035
 From-0.034
I-0.034
Token to
Token ID284
Feature activation-2.94e-02

Pos logit contributions
mer+0.185
ener+0.180
ising+0.173
ler+0.172
 size+0.171
Neg logit contributions
-0.108
 Lily-0.079
 The-0.069
 Mom-0.068
 She-0.067
Token take
Token ID1011
Feature activation-4.04e-03

Pos logit contributions
ier+0.058
work+0.054
lers+0.054
led+0.053
flies+0.051
Neg logit contributions
-0.036
Tom-0.035
V-0.034
L-0.033
Mom-0.032
Token her
Token ID607
Feature activation-4.68e-02

Pos logit contributions
ier+0.008
castle+0.008
iness+0.008
ist+0.008
fish+0.008
Neg logit contributions
V-0.004
-0.004
E-0.004
Look-0.003
 Many-0.003
TokenMax
Token ID11518
Feature activation-9.94e-03

Pos logit contributions
E+0.079
V+0.078
 Many+0.076
 Ow+0.075
Look+0.073
Neg logit contributions
 away-0.139
 airplane-0.126
 perfectly-0.126
 bit-0.125
 size-0.125
Token thought
Token ID1807
Feature activation-9.32e-04

Pos logit contributions
ier+0.018
iness+0.016
iest+0.015
 anywhere+0.015
rier+0.015
Neg logit contributions
-0.015
V-0.014
M-0.014
 While-0.012
Sam-0.012
Token about
Token ID546
Feature activation+4.80e-02

Pos logit contributions
 size+0.001
work+0.001
flies+0.001
 state+0.001
castle+0.001
Neg logit contributions
-0.002
V-0.002
 Many-0.002
 On-0.001
 It-0.001
Token it
Token ID340
Feature activation-2.25e-02

Pos logit contributions
V+0.088
E+0.084
+0.080
Look+0.078
O+0.075
Neg logit contributions
ier-0.053
 size-0.050
 away-0.049
 enough-0.048
 perfectly-0.047
Token for
Token ID329
Feature activation+2.94e-02

Pos logit contributions
pan+0.034
ier+0.034
castle+0.034
 wil+0.034
eking+0.033
Neg logit contributions
-0.040
 So-0.029
 The-0.028
 When-0.028
 -0.028
Token a
Token ID257
Feature activation+4.73e-02

Pos logit contributions
V+0.047
E+0.044
+0.042
 Many+0.039
O+0.039
Neg logit contributions
ier-0.041
 wouldn-0.038
 airplane-0.038
 size-0.038
iness-0.037
Token moment
Token ID2589
Feature activation-4.74e-02

Pos logit contributions
V+0.078
E+0.074
+0.068
Look+0.068
 Many+0.066
Neg logit contributions
ier-0.061
 enough-0.056
 size-0.056
 away-0.056
 perfectly-0.055
Token and
Token ID290
Feature activation+5.80e-04

Pos logit contributions
 stays+0.087
ps+0.085
 punished+0.084
bags+0.084
ishes+0.084
Neg logit contributions
-0.074
M-0.044
 The-0.042
Once-0.041
 On-0.040
Token then
Token ID788
Feature activation-3.53e-02

Pos logit contributions
+0.001
V+0.001
L+0.001
E+0.001
M+0.001
Neg logit contributions
ier-0.001
iness-0.001
 size-0.001
 falling-0.001
 anywhere-0.001
Token said
Token ID531
Feature activation+1.28e-02

Pos logit contributions
 size+0.070
 falling+0.070
iful+0.069
iness+0.069
ier+0.068
Neg logit contributions
-0.037
V-0.030
 From-0.029
L-0.027
 -0.027
Token,
Token ID11
Feature activation+3.72e-02

Pos logit contributions
+0.023
V+0.022
E+0.021
 Many+0.019
 The+0.019
Neg logit contributions
 size-0.015
 airplane-0.014
et-0.013
 falling-0.013
 island-0.013
Token'm
Token ID1101
Feature activation+3.89e-02

Pos logit contributions
V+0.024
+0.023
E+0.021
Look+0.020
L+0.020
Neg logit contributions
 airplane-0.020
 falling-0.020
ier-0.019
eking-0.019
 happening-0.018
Token sorry
Token ID7926
Feature activation+3.47e-02

Pos logit contributions
V+0.056
+0.052
E+0.051
Look+0.047
 Many+0.047
Neg logit contributions
 size-0.055
ier-0.055
 wouldn-0.053
 away-0.053
iness-0.052
Token,
Token ID11
Feature activation-1.19e-02

Pos logit contributions
V+0.066
+0.066
E+0.064
 Many+0.058
Mom+0.058
Neg logit contributions
ier-0.033
 away-0.031
 size-0.031
et-0.030
 coins-0.029
Token Lily
Token ID20037
Feature activation-1.89e-02

Pos logit contributions
ier+0.026
ils+0.026
led+0.026
 powers+0.025
lers+0.025
Neg logit contributions
-0.011
V-0.009
E-0.009
L-0.008
 Many-0.008
Token.
Token ID13
Feature activation+9.07e-02

Pos logit contributions
eking+0.028
 powers+0.027
 footing+0.027
 falling+0.026
 size+0.026
Neg logit contributions
-0.032
 The-0.023
V-0.022
-0.022
 They-0.021
Token Can
Token ID1680
Feature activation+8.74e-02

Pos logit contributions
V+0.159
E+0.157
O+0.143
Look+0.140
Jen+0.136
Neg logit contributions
 for-0.092
 enough-0.087
 wouldn-0.087
 perfectly-0.085
 wand-0.085
Token I
Token ID314
Feature activation+2.85e-02

Pos logit contributions
V+0.194
E+0.184
Look+0.176
 While+0.170
Wow+0.168
Neg logit contributions
 wouldn-0.056
 for-0.055
 away-0.052
 enough-0.052
 perfectly-0.051
Token share
Token ID2648
Feature activation+2.24e-02

Pos logit contributions
V+0.044
+0.044
E+0.040
Mom+0.039
Look+0.039
Neg logit contributions
ier-0.040
 enough-0.037
 size-0.036
 coins-0.035
ening-0.035
Token your
Token ID534
Feature activation+1.79e-02

Pos logit contributions
+0.034
V+0.034
E+0.032
Look+0.030
Mom+0.030
Neg logit contributions
ier-0.033
 wouldn-0.031
 size-0.030
iness-0.030
 falling-0.029
Token colors
Token ID7577
Feature activation-9.95e-03

Pos logit contributions
V+0.033
+0.032
 Many+0.030
Look+0.030
E+0.029
Neg logit contributions
ier-0.022
 enough-0.019
 wouldn-0.018
 away-0.017
 compared-0.017
Token,
Token ID11
Feature activation-1.05e-02

Pos logit contributions
 wouldn+0.013
 falling+0.013
 size+0.013
ils+0.013
iously+0.013
Neg logit contributions
-0.016
Mom-0.014
V-0.014
L-0.013
E-0.013
Token catch
Token ID4929
Feature activation+1.92e-02

Pos logit contributions
V+0.045
+0.041
E+0.039
Look+0.038
 Many+0.038
Neg logit contributions
ier-0.029
 size-0.028
iness-0.027
ener-0.026
 wouldn-0.026
Token with
Token ID351
Feature activation+1.16e-02

Pos logit contributions
V+0.035
+0.033
E+0.032
Look+0.030
 Many+0.030
Neg logit contributions
ier-0.025
 size-0.024
 wouldn-0.023
ener-0.022
iness-0.022
Token this
Token ID428
Feature activation-3.60e-02

Pos logit contributions
+0.014
V+0.014
E+0.014
 Many+0.013
Tim+0.013
Neg logit contributions
ier-0.022
 wouldn-0.020
 size-0.020
ener-0.019
ache-0.019
Token different
Token ID1180
Feature activation-3.93e-02

Pos logit contributions
ier+0.071
ened+0.067
iously+0.067
ils+0.067
eness+0.066
Neg logit contributions
-0.044
V-0.034
Look-0.032
-0.031
Tim-0.030
Token ball
Token ID2613
Feature activation-7.41e-02

Pos logit contributions
ier+0.074
ache+0.071
 wouldn+0.068
 able+0.067
 unable+0.063
Neg logit contributions
 Many-0.054
-0.053
Wow-0.049
Look-0.048
V-0.048
Token I
Token ID314
Feature activation-1.13e-02

Pos logit contributions
eking+0.141
eling+0.137
 aroma+0.134
 secure+0.133
 trace+0.130
Neg logit contributions
-0.101
 They-0.070
 And-0.067
 Now-0.067
Tim-0.066
Token found
Token ID1043
Feature activation-6.31e-02

Pos logit contributions
rier+0.020
ier+0.020
 airplane+0.018
 falling+0.018
 happening+0.018
Neg logit contributions
-0.016
V-0.015
 While-0.013
 Many-0.013
 On-0.012
Token.
Token ID13
Feature activation+8.32e-02

Pos logit contributions
 size+0.101
ache+0.099
 decision+0.099
 state+0.097
iness+0.097
Neg logit contributions
-0.099
-0.082
 Now-0.070
Look-0.068
 The-0.066
Token It
Token ID632
Feature activation-2.11e-02

Pos logit contributions
V+0.088
E+0.087
O+0.075
Look+0.074
Jen+0.068
Neg logit contributions
 for-0.138
 enough-0.137
 perfectly-0.134
 wouldn-0.133
 away-0.133
Token's
Token ID338
Feature activation-2.06e-03

Pos logit contributions
ier+0.039
iness+0.037
 permission+0.035
 hesitation+0.034
epad+0.034
Neg logit contributions
-0.026
V-0.025
Look-0.020
-0.019
E-0.019
Token really
Token ID1107
Feature activation+8.06e-03

Pos logit contributions
ier+0.003
iness+0.003
ener+0.003
work+0.003
ly+0.003
Neg logit contributions
V-0.003
-0.003
Look-0.003
E-0.002
 While-0.002
Token,
Token ID11
Feature activation-7.57e-02

Pos logit contributions
 powers+0.119
ier+0.116
 possible+0.114
iest+0.113
 luck+0.111
Neg logit contributions
-0.108
 -0.096
 The-0.074
 Her-0.069
 She-0.063
Token and
Token ID290
Feature activation-8.36e-02

Pos logit contributions
ier+0.155
ils+0.150
 powers+0.145
iest+0.143
iful+0.137
Neg logit contributions
-0.123
 -0.093
 The-0.084
Tim-0.080
Tom-0.077
Token they
Token ID484
Feature activation-3.74e-02

Pos logit contributions
iful+0.172
ier+0.167
ets+0.163
ist+0.163
ising+0.162
Neg logit contributions
-0.093
 Tim-0.077
 Lily-0.077
 Mom-0.069
 The-0.067
Token put
Token ID1234
Feature activation-2.37e-02

Pos logit contributions
 powers+0.072
rier+0.067
 decisions+0.066
 size+0.065
 possible+0.065
Neg logit contributions
-0.046
 -0.041
Mom-0.040
M-0.036
L-0.036
Token out
Token ID503
Feature activation-4.96e-02

Pos logit contributions
ier+0.041
 affect+0.040
ache+0.040
 size+0.040
 appearance+0.039
Neg logit contributions
-0.031
Look-0.028
V-0.026
 -0.026
Wow-0.025
Token the
Token ID262
Feature activation-3.99e-02

Pos logit contributions
ising+0.107
 size+0.107
ier+0.104
 appearance+0.104
 affect+0.103
Neg logit contributions
 The-0.044
-0.044
 Their-0.032
 Some-0.031
E-0.029
Token fire
Token ID2046
Feature activation-8.36e-02

Pos logit contributions
ier+0.078
ils+0.072
 able+0.070
ets+0.069
 imagined+0.068
Neg logit contributions
-0.045
Mom-0.042
E-0.039
Look-0.037
-0.036
Token.
Token ID13
Feature activation+7.95e-02

Pos logit contributions
eling+0.138
eking+0.135
 som+0.135
 succeeding+0.135
 evil+0.135
Neg logit contributions
-0.124
-0.098
 They-0.087
 The-0.087
Mom-0.084
Token From
Token ID3574
Feature activation-1.30e-02

Pos logit contributions
V+0.073
E+0.062
O+0.051
Look+0.051
Wow+0.046
Neg logit contributions
 enough-0.149
 away-0.145
 for-0.145
 wouldn-0.144
 size-0.143
Token that
Token ID326
Feature activation+1.19e-02

Pos logit contributions
epad+0.021
ier+0.021
iness+0.020
ets+0.019
 size+0.019
Neg logit contributions
-0.019
Mom-0.018
-0.016
Look-0.016
A-0.016
Token day
Token ID1110
Feature activation+2.99e-02

Pos logit contributions
+0.015
V+0.013
Look+0.012
 While+0.011
 +0.011
Neg logit contributions
ier-0.024
 coins-0.021
ener-0.020
rier-0.020
eking-0.020
Token flew
Token ID13112
Feature activation-3.32e-02

Pos logit contributions
 possible+0.048
 powers+0.047
ier+0.045
 size+0.045
rier+0.045
Neg logit contributions
-0.033
 -0.028
 While-0.027
 From-0.025
 The-0.024
Token into
Token ID656
Feature activation-2.76e-02

Pos logit contributions
ache+0.042
 size+0.040
 dollars+0.038
 recipe+0.038
 pounds+0.038
Neg logit contributions
-0.065
Look-0.054
 The-0.053
-0.051
Wow-0.051
Token a
Token ID257
Feature activation-2.47e-02

Pos logit contributions
ier+0.061
 wouldn+0.058
ache+0.058
ist+0.056
iness+0.055
Neg logit contributions
-0.033
V-0.029
E-0.028
 The-0.025
Look-0.025
Token hole
Token ID7604
Feature activation-1.32e-01

Pos logit contributions
ier+0.046
iness+0.042
 wouldn+0.041
ps+0.041
 permission+0.040
Neg logit contributions
E-0.033
-0.033
V-0.033
Look-0.030
L-0.030
Token in
Token ID287
Feature activation-2.58e-02

Pos logit contributions
 decisions+0.236
 passports+0.231
 generosity+0.229
 bills+0.229
 fortune+0.226
Neg logit contributions
-0.173
-0.111
 She-0.094
 The-0.093
M-0.090
Token the
Token ID262
Feature activation-6.17e-02

Pos logit contributions
ier+0.053
ache+0.050
owing+0.049
 wouldn+0.049
ils+0.049
Neg logit contributions
-0.034
Look-0.029
E-0.028
Wow-0.026
Tim-0.024
Token ground
Token ID2323
Feature activation-1.36e-01

Pos logit contributions
ier+0.116
ering+0.115
iness+0.110
 able+0.107
ies+0.107
Neg logit contributions
Look-0.073
-0.073
Mom-0.068
E-0.060
Wow-0.060
Token.
Token ID13
Feature activation+6.06e-02

Pos logit contributions
ils+0.285
 groom+0.284
 passports+0.284
 married+0.278
 performance+0.274
Neg logit contributions
-0.179
 -0.104
 The-0.096
 She-0.094
-0.092
Token She
Token ID1375
Feature activation-3.78e-02

Pos logit contributions
V+0.045
E+0.039
Look+0.032
 Many+0.028
+0.028
Neg logit contributions
ier-0.127
 enough-0.126
 size-0.125
 away-0.125
 perfectly-0.123
Token couldn
Token ID3521
Feature activation+5.62e-02

Pos logit contributions
ier+0.084
rier+0.077
iness+0.076
 attention+0.072
 possible+0.071
Neg logit contributions
-0.046
V-0.042
 From-0.039
E-0.038
M-0.037
Token't
Token ID470
Feature activation+3.04e-02

Pos logit contributions
V+0.060
E+0.055
Look+0.048
+0.046
 Many+0.045
Neg logit contributions
ier-0.102
 enough-0.099
 size-0.099
 away-0.099
 perfectly-0.097
Tokenma
Token ID2611
Feature activation-5.03e-02

Pos logit contributions
V+0.085
E+0.081
Look+0.076
+0.074
O+0.072
Neg logit contributions
ier-0.052
 size-0.049
 enough-0.049
 away-0.049
 wouldn-0.047
Token was
Token ID373
Feature activation-2.27e-02

Pos logit contributions
 possible+0.093
 anywhere+0.092
 attention+0.092
ier+0.090
rier+0.089
Neg logit contributions
-0.064
 From-0.057
V-0.054
M-0.054
Mom-0.053
Token so
Token ID523
Feature activation+1.31e-02

Pos logit contributions
ies+0.040
fish+0.040
iness+0.040
flies+0.039
ly+0.038
Neg logit contributions
-0.033
 -0.028
V-0.027
 The-0.025
Once-0.025
Token happy
Token ID3772
Feature activation-4.31e-02

Pos logit contributions
+0.016
V+0.014
E+0.013
 Many+0.013
 From+0.012
Neg logit contributions
 airplane-0.024
 size-0.024
et-0.024
ist-0.024
 powers-0.024
Token and
Token ID290
Feature activation-2.76e-02

Pos logit contributions
 heaviest+0.083
iful+0.082
wy+0.081
et+0.078
ier+0.077
Neg logit contributions
-0.068
 She-0.047
 Now-0.041
 But-0.041
 The-0.039
Token proud
Token ID6613
Feature activation-4.54e-02

Pos logit contributions
ier+0.054
ist+0.054
iful+0.052
iness+0.050
et+0.050
Neg logit contributions
-0.035
 From-0.035
 The-0.030
 Now-0.029
 -0.028
Token,
Token ID11
Feature activation-7.10e-02

Pos logit contributions
et+0.092
ps+0.091
wy+0.090
 bank+0.090
rated+0.089
Neg logit contributions
-0.058
 Now-0.033
 She-0.033
L-0.030
 Everyone-0.030
Token and
Token ID290
Feature activation-5.44e-02

Pos logit contributions
ier+0.155
iful+0.145
ils+0.142
ist+0.135
ps+0.135
Neg logit contributions
-0.101
 From-0.080
 She-0.077
 -0.076
 The-0.075
Token off
Token ID572
Feature activation-6.00e-02

Pos logit contributions
ist+0.114
ier+0.111
ologist+0.109
et+0.105
iful+0.105
Neg logit contributions
 From-0.065
-0.064
 The-0.053
 -0.052
 Now-0.051
Token she
Token ID673
Feature activation-5.67e-02

Pos logit contributions
ode+0.140
ologist+0.130
 heaviest+0.130
 property+0.128
oscope+0.125
Neg logit contributions
-0.068
 She-0.052
 The-0.048
 From-0.046
 Her-0.044
Token went
Token ID1816
Feature activation-2.79e-02

Pos logit contributions
rier+0.130
pan+0.118
ier+0.116
 attention+0.110
 recipe+0.108
Neg logit contributions
-0.071
 From-0.059
 Now-0.047
 -0.046
 The-0.045
Token wanted
Token ID2227
Feature activation+3.44e-02

Pos logit contributions
V+0.180
E+0.169
 Many+0.162
Look+0.159
 Deal+0.158
Neg logit contributions
 wouldn-0.050
 kept-0.048
 size-0.047
 away-0.044
 left-0.043
Token to
Token ID284
Feature activation+7.26e-02

Pos logit contributions
V+0.067
+0.066
Look+0.062
E+0.061
 Many+0.061
Neg logit contributions
ier-0.035
 size-0.031
ist-0.031
iness-0.030
 perfectly-0.029
Token know
Token ID760
Feature activation+8.22e-03

Pos logit contributions
V+0.152
E+0.146
Look+0.135
Wow+0.132
Kids+0.132
Neg logit contributions
 away-0.060
 ring-0.058
 size-0.055
 crown-0.053
 for-0.053
Token what
Token ID644
Feature activation+4.87e-02

Pos logit contributions
+0.012
V+0.011
E+0.010
 Many+0.010
Look+0.009
Neg logit contributions
 size-0.012
ier-0.011
work-0.011
iness-0.011
ener-0.011
Token it
Token ID340
Feature activation+3.94e-02

Pos logit contributions
V+0.091
E+0.086
+0.082
Look+0.081
 Many+0.079
Neg logit contributions
ier-0.051
 size-0.048
 away-0.047
 enough-0.047
 perfectly-0.045
Token was
Token ID373
Feature activation-2.45e-02

Pos logit contributions
V+0.068
+0.064
E+0.064
Look+0.060
Wow+0.058
Neg logit contributions
ier-0.047
 size-0.043
 enough-0.043
 away-0.042
 perfectly-0.041
Token,
Token ID11
Feature activation+3.40e-02

Pos logit contributions
iness+0.033
ience+0.032
 breaths+0.032
isher+0.031
ened+0.031
Neg logit contributions
-0.040
 On-0.029
 One-0.029
 In-0.028
 While-0.027
Token so
Token ID523
Feature activation+6.69e-02

Pos logit contributions
V+0.064
+0.063
E+0.061
 Many+0.058
 While+0.058
Neg logit contributions
ier-0.037
 size-0.034
work-0.031
 perfectly-0.031
 luck-0.030
Token he
Token ID339
Feature activation+6.31e-02

Pos logit contributions
V+0.148
E+0.143
Look+0.135
 Many+0.130
O+0.129
Neg logit contributions
 enough-0.046
 away-0.046
 for-0.045
 wouldn-0.043
 size-0.043
Token went
Token ID1816
Feature activation+5.64e-02

Pos logit contributions
V+0.129
E+0.123
Look+0.116
 Many+0.115
+0.114
Neg logit contributions
 away-0.051
 enough-0.050
ier-0.050
 size-0.049
 wouldn-0.049
Token to
Token ID284
Feature activation+6.78e-02

Pos logit contributions
V+0.133
E+0.128
Look+0.121
+0.119
 Many+0.119
Neg logit contributions
ier-0.030
 away-0.028
 size-0.027
 enough-0.027
 perfectly-0.025
Tokenmy
Token ID1820
Feature activation+2.32e-02

Pos logit contributions
V+0.075
E+0.069
+0.064
Look+0.064
 Many+0.061
Neg logit contributions
ier-0.067
 size-0.065
 enough-0.064
 away-0.064
 perfectly-0.064
Token calls
Token ID3848
Feature activation-1.55e-02

Pos logit contributions
V+0.027
+0.024
E+0.021
Look+0.020
 Many+0.019
Neg logit contributions
 size-0.043
 perfectly-0.043
ier-0.043
 forever-0.042
iness-0.041
Token.
Token ID13
Feature activation+7.38e-02

Pos logit contributions
 powers+0.022
fish+0.021
lers+0.021
 collection+0.021
castle+0.020
Neg logit contributions
-0.027
Mom-0.022
V-0.022
 The-0.022
E-0.021
Token "
Token ID366
Feature activation+9.46e-02

Pos logit contributions
V+0.094
E+0.087
Look+0.076
O+0.074
Wow+0.072
Neg logit contributions
 for-0.118
 enough-0.115
 away-0.113
 size-0.113
ier-0.109
TokenThe
Token ID464
Feature activation-1.05e-02

Pos logit contributions
V+0.090
 While+0.084
E+0.080
 Later+0.080
 On+0.076
Neg logit contributions
 away-0.154
 size-0.151
ier-0.148
 falling-0.146
 ever-0.144
Token slide
Token ID10649
Feature activation-8.08e-02

Pos logit contributions
ist+0.019
ils+0.018
le+0.018
ier+0.018
ening+0.017
Neg logit contributions
V-0.013
Mom-0.013
Look-0.013
-0.013
O-0.012
Token is
Token ID318
Feature activation-4.21e-02

Pos logit contributions
 powers+0.178
 veil+0.175
 permission+0.170
epad+0.170
comb+0.170
Neg logit contributions
-0.074
V-0.063
 Sometimes-0.060
-0.059
 The-0.052
Token high
Token ID1029
Feature activation-5.96e-02

Pos logit contributions
iness+0.083
work+0.082
le+0.082
fish+0.081
ener+0.080
Neg logit contributions
-0.046
V-0.046
Mom-0.041
O-0.040
Tim-0.039
Token and
Token ID290
Feature activation-3.26e-02

Pos logit contributions
 powers+0.096
 collection+0.091
 reapp+0.091
 possessions+0.090
 generosity+0.089
Neg logit contributions
-0.078
-0.068
E-0.061
M-0.060
Mom-0.059
Token fast
Token ID3049
Feature activation-8.77e-02

Pos logit contributions
ier+0.059
ils+0.057
work+0.056
eness+0.056
 powers+0.054
Neg logit contributions
-0.043
V-0.041
It-0.036
 Sometimes-0.035
 The-0.035
Token.
Token ID13
Feature activation+1.01e-01

Pos logit contributions
ci+0.187
work+0.181
cards+0.175
 lit+0.168
 crown+0.168
Neg logit contributions
-0.105
-0.072
Tom-0.062
 Now-0.062
 They-0.062
Token real
Token ID1103
Feature activation-8.67e-02

Pos logit contributions
ier+0.174
ener+0.153
ly+0.153
led+0.152
flies+0.152
Neg logit contributions
-0.107
 -0.083
 The-0.080
 Some-0.071
 One-0.070
Token estate
Token ID7964
Feature activation-9.05e-02

Pos logit contributions
flies+0.157
ier+0.156
fly+0.154
ist+0.148
fish+0.145
Neg logit contributions
-0.113
 Many-0.092
 One-0.087
 -0.085
E-0.084
Token and
Token ID290
Feature activation-1.24e-01

Pos logit contributions
 veil+0.149
fish+0.137
 wil+0.133
chy+0.128
 blurry+0.127
Neg logit contributions
-0.147
 Now-0.123
 They-0.118
 And-0.114
 Soon-0.110
Token started
Token ID2067
Feature activation-9.14e-02

Pos logit contributions
ier+0.214
rier+0.204
eking+0.197
ist+0.194
le+0.193
Neg logit contributions
-0.149
 They-0.133
 From-0.127
 -0.127
 The-0.123
Token a
Token ID257
Feature activation-7.11e-02

Pos logit contributions
ier+0.180
ist+0.176
iness+0.175
fish+0.174
le+0.168
Neg logit contributions
-0.128
 The-0.107
 They-0.103
 -0.099
 Their-0.092
Token business
Token ID1597
Feature activation-1.58e-01

Pos logit contributions
ps+0.151
aces+0.145
ier+0.142
et+0.142
iness+0.139
Neg logit contributions
-0.077
E-0.072
L-0.061
Finally-0.059
 -0.058
Token together
Token ID1978
Feature activation-1.31e-01

Pos logit contributions
eking+0.248
 veil+0.246
ils+0.242
et+0.238
ps+0.231
Neg logit contributions
-0.207
 They-0.155
 He-0.144
 -0.125
 The-0.122
Token.
Token ID13
Feature activation+6.69e-02

Pos logit contributions
ener+0.243
 airplane+0.239
eking+0.235
ache+0.233
ier+0.232
Neg logit contributions
-0.179
 -0.110
 They-0.104
 The-0.097
 In-0.095
Token Soon
Token ID15894
Feature activation-4.99e-02

Pos logit contributions
V+0.061
E+0.055
Look+0.045
O+0.043
Wow+0.040
Neg logit contributions
 enough-0.128
ier-0.127
 away-0.126
 size-0.126
 perfectly-0.126
Token,
Token ID11
Feature activation-6.30e-02

Pos logit contributions
ist+0.090
ache+0.088
eking+0.088
rying+0.087
 ring+0.085
Neg logit contributions
-0.063
 Everyone-0.051
 -0.050
Tom-0.050
 Tom-0.049
Token they
Token ID484
Feature activation-6.46e-02

Pos logit contributions
ache+0.144
ener+0.142
ier+0.139
et+0.138
owing+0.136
Neg logit contributions
-0.064
Jack-0.045
 Everyone-0.045
Wow-0.042
 The-0.042
Token�
Token ID241
Feature activation+5.97e-02

Pos logit contributions
V+0.086
E+0.082
Look+0.074
+0.071
Wow+0.071
Neg logit contributions
ier-0.090
 away-0.089
 enough-0.089
 size-0.089
 perfectly-0.087
TokenYou
Token ID1639
Feature activation-4.97e-02

Pos logit contributions
V+0.050
E+0.044
Look+0.037
+0.035
 Many+0.035
Neg logit contributions
ier-0.120
 size-0.118
 enough-0.118
 away-0.117
 perfectly-0.115
Token cl
Token ID537
Feature activation+3.69e-02

Pos logit contributions
ier+0.090
rier+0.089
 unatt+0.088
ering+0.088
ies+0.085
Neg logit contributions
-0.064
V-0.060
 It-0.055
 Many-0.054
 While-0.049
Tokenap
Token ID499
Feature activation-8.77e-02

Pos logit contributions
V+0.053
E+0.047
+0.046
 Many+0.044
Look+0.043
Neg logit contributions
 size-0.055
ier-0.055
 coins-0.054
 away-0.054
 enough-0.053
Token your
Token ID534
Feature activation-5.22e-02

Pos logit contributions
 coins+0.139
 jewels+0.135
 airplane+0.134
 diamonds+0.132
ier+0.129
Neg logit contributions
-0.135
 It-0.115
 Now-0.103
 -0.101
V-0.100
Token hands
Token ID2832
Feature activation-1.71e-01

Pos logit contributions
ier+0.107
 happening+0.093
 compared+0.090
work+0.090
ache+0.089
Neg logit contributions
-0.059
 Many-0.051
V-0.051
 Now-0.046
 The-0.045
Token as
Token ID355
Feature activation-5.73e-02

Pos logit contributions
 coins+0.275
 possessions+0.273
ils+0.273
 injuries+0.268
 jewels+0.266
Neg logit contributions
-0.271
 The-0.185
 -0.182
 It-0.178
 She-0.172
Token fast
Token ID3049
Feature activation-6.25e-02

Pos logit contributions
ener+0.104
ache+0.094
lers+0.094
et+0.093
oscope+0.092
Neg logit contributions
-0.079
V-0.069
 Everyone-0.067
 Now-0.064
 Many-0.063
Token as
Token ID355
Feature activation-7.06e-02

Pos logit contributions
work+0.081
flies+0.074
 size+0.073
cards+0.073
smanship+0.073
Neg logit contributions
 Now-0.102
-0.101
V-0.100
O-0.091
 But-0.090
Token you
Token ID345
Feature activation-6.68e-02

Pos logit contributions
ener+0.140
ache+0.138
ening+0.136
lers+0.136
eness+0.134
Neg logit contributions
-0.076
V-0.061
A-0.055
L-0.054
 Now-0.054
Token can
Token ID460
Feature activation-3.43e-02

Pos logit contributions
rier+0.123
ering+0.113
ier+0.113
 possessions+0.110
cards+0.110
Neg logit contributions
-0.091
V-0.088
 Many-0.081
 On-0.074
-0.074
Token wide
Token ID3094
Feature activation-1.31e-01

Pos logit contributions
ier+0.181
ist+0.168
work+0.161
iness+0.159
icking+0.158
Neg logit contributions
-0.146
 Many-0.122
 The-0.121
 -0.115
Look-0.113
Token would
Token ID561
Feature activation-1.26e-01

Pos logit contributions
cher+0.229
ils+0.221
ault+0.216
vell+0.216
ppers+0.215
Neg logit contributions
-0.160
-0.118
 She-0.116
 One-0.114
 The-0.113
Token come
Token ID1282
Feature activation-6.09e-02

Pos logit contributions
ier+0.271
ishing+0.264
led+0.262
ishes+0.252
ils+0.252
Neg logit contributions
-0.151
 One-0.110
One-0.103
 Many-0.101
 -0.099
Token to
Token ID284
Feature activation-4.11e-02

Pos logit contributions
ils+0.110
iness+0.095
 cand+0.088
 aroma+0.087
ier+0.087
Neg logit contributions
-0.109
 The-0.077
-0.071
 She-0.069
One-0.069
Token the
Token ID262
Feature activation-8.35e-02

Pos logit contributions
ier+0.089
led+0.084
iness+0.084
ener+0.082
 wouldn+0.081
Neg logit contributions
-0.056
 The-0.042
 Many-0.040
 One-0.034
V-0.034
Token well
Token ID880
Feature activation-1.58e-01

Pos logit contributions
iness+0.175
ier+0.162
led+0.161
ened+0.159
owing+0.155
Neg logit contributions
-0.095
Look-0.095
Mom-0.091
V-0.090
E-0.081
Token to
Token ID284
Feature activation-3.34e-02

Pos logit contributions
ps+0.272
 streaming+0.252
ened+0.247
urger+0.247
 permission+0.242
Neg logit contributions
-0.244
-0.169
One-0.154
 One-0.142
 -0.138
Token take
Token ID1011
Feature activation-4.44e-02

Pos logit contributions
ier+0.062
lers+0.060
ener+0.059
ies+0.058
led+0.057
Neg logit contributions
-0.050
 The-0.038
 Many-0.038
V-0.037
Look-0.036
Token a
Token ID257
Feature activation-1.77e-02

Pos logit contributions
castle+0.085
ier+0.078
 size+0.075
fish+0.075
work+0.074
Neg logit contributions
-0.064
 Everyone-0.053
 Now-0.053
 The-0.051
E-0.051
Token peek
Token ID27185
Feature activation-1.14e-01

Pos logit contributions
ier+0.028
 size+0.027
et+0.026
ils+0.025
ps+0.025
Neg logit contributions
-0.026
V-0.024
E-0.024
 Many-0.022
 -0.021
Token inside
Token ID2641
Feature activation-1.21e-01

Pos logit contributions
 size+0.182
 ruining+0.168
 fight+0.162
 balance+0.162
 tongues+0.162
Neg logit contributions
-0.160
-0.118
 The-0.117
 It-0.109
"-0.106
Token were
Token ID547
Feature activation-8.89e-02

Pos logit contributions
ier+0.027
rying+0.025
ist+0.025
et+0.025
 size+0.025
Neg logit contributions
-0.020
 -0.015
 It-0.015
 She-0.015
 The-0.015
Token papers
Token ID9473
Feature activation-1.36e-01

Pos logit contributions
le+0.162
ist+0.154
owing+0.152
ener+0.149
ier+0.145
Neg logit contributions
 Some-0.114
E-0.101
 Glass-0.099
-0.097
Mom-0.096
Token and
Token ID290
Feature activation-7.16e-02

Pos logit contributions
iously+0.221
eers+0.221
rated+0.220
 melodies+0.219
iful+0.219
Neg logit contributions
-0.152
There-0.125
 She-0.118
Once-0.113
Inside-0.112
Token books
Token ID3835
Feature activation-1.09e-01

Pos logit contributions
owing+0.137
ier+0.131
rying+0.127
icking+0.120
ering+0.119
Neg logit contributions
 Some-0.093
 Glass-0.088
E-0.087
V-0.083
L-0.083
Token and
Token ID290
Feature activation-9.66e-02

Pos logit contributions
 Majesty+0.210
 decisions+0.205
 instincts+0.201
 permission+0.200
 crew+0.196
Neg logit contributions
-0.146
 She-0.106
 It-0.099
There-0.098
It-0.098
Token toys
Token ID14958
Feature activation-1.51e-01

Pos logit contributions
owing+0.187
rying+0.180
ier+0.176
ering+0.168
ist+0.168
Neg logit contributions
 Some-0.124
 Many-0.111
 Glass-0.105
-0.104
L-0.103
Token all
Token ID477
Feature activation-4.80e-02

Pos logit contributions
rier+0.293
 tune+0.271
 crew+0.267
 former+0.267
 Majesty+0.267
Neg logit contributions
-0.179
 She-0.120
 It-0.108
 They-0.107
-0.104
Token over
Token ID625
Feature activation-4.85e-02

Pos logit contributions
sters+0.084
rier+0.082
ault+0.081
ster+0.080
pper+0.080
Neg logit contributions
-0.068
 The-0.053
 They-0.052
 -0.051
 It-0.050
Token the
Token ID262
Feature activation+6.14e-03

Pos logit contributions
ache+0.083
flies+0.082
work+0.082
 wand+0.080
ishly+0.080
Neg logit contributions
-0.072
Mom-0.054
E-0.053
L-0.049
Tim-0.048
Token floor
Token ID4314
Feature activation-1.71e-01

Pos logit contributions
V+0.008
+0.008
Look+0.007
L+0.007
Mom+0.007
Neg logit contributions
ier-0.011
 able-0.010
iness-0.010
 enough-0.010
 crown-0.010
Token.
Token ID13
Feature activation+8.83e-02

Pos logit contributions
 eagle+0.363
 groom+0.355
 forth+0.351
 medal+0.350
 adventurer+0.344
Neg logit contributions
-0.169
 She-0.094
 They-0.090
 The-0.072
 -0.068
Token,
Token ID11
Feature activation-7.62e-03

Pos logit contributions
 pests+0.204
 thieves+0.204
 prisoner+0.194
 slaves+0.191
 plank+0.191
Neg logit contributions
-0.152
 The-0.109
 His-0.106
 -0.105
 It-0.102
Token dreaming
Token ID28368
Feature activation-7.42e-02

Pos logit contributions
ier+0.013
ils+0.012
le+0.011
iness+0.011
ener+0.011
Neg logit contributions
-0.013
 From-0.011
 -0.011
Look-0.011
 The-0.010
Token of
Token ID286
Feature activation-4.62e-02

Pos logit contributions
ci+0.176
 forth+0.174
boys+0.174
 wrists+0.172
flies+0.170
Neg logit contributions
-0.069
 Many-0.046
 Everyone-0.039
 The-0.038
 In-0.037
Token all
Token ID477
Feature activation-1.16e-02

Pos logit contributions
ener+0.099
ly+0.096
ist+0.092
mer+0.091
ets+0.088
Neg logit contributions
-0.056
<|endoftext|>-0.044
V-0.040
 Glass-0.038
 -0.038
Token the
Token ID262
Feature activation-2.01e-02

Pos logit contributions
ener+0.022
ier+0.022
 perfectly+0.021
sters+0.021
ache+0.021
Neg logit contributions
-0.012
 The-0.010
 When-0.009
 Now-0.009
 On-0.008
Token music
Token ID2647
Feature activation-1.47e-01

Pos logit contributions
ier+0.041
ener+0.038
comb+0.038
fly+0.038
 able+0.038
Neg logit contributions
V-0.021
-0.019
E-0.018
Look-0.017
 Many-0.016
Token he
Token ID339
Feature activation+4.91e-03

Pos logit contributions
eking+0.302
rying+0.282
iously+0.274
ener+0.274
asy+0.270
Neg logit contributions
-0.198
 It-0.131
 Everyone-0.130
 He-0.128
 Now-0.126
Token will
Token ID481
Feature activation-4.32e-02

Pos logit contributions
+0.008
V+0.008
+0.007
L+0.006
E+0.006
Neg logit contributions
rier-0.007
ier-0.007
pan-0.006
 enough-0.006
 size-0.006
Token dance
Token ID9280
Feature activation-1.01e-01

Pos logit contributions
lers+0.077
ener+0.074
 coins+0.072
ishes+0.071
rier+0.070
Neg logit contributions
-0.069
Once-0.049
One-0.048
V-0.047
It-0.047
Token to
Token ID284
Feature activation-1.55e-02

Pos logit contributions
 thieves+0.174
 supervision+0.174
ener+0.172
 pulp+0.166
iously+0.165
Neg logit contributions
-0.159
L-0.115
V-0.108
Max-0.099
 The-0.098
Token in
Token ID287
Feature activation-4.70e-02

Pos logit contributions
ener+0.025
ier+0.025
 wouldn+0.023
 coins+0.023
led+0.023
Neg logit contributions
-0.024
 Many-0.020
V-0.020
 The-0.019
 Everyone-0.018