TOP ACTIVATIONS
MAX = 9.750

her mom my . L ily ran inside
and dad every day . One day , Lily
the parade a lot . Tim saw a man
tables , and beds . One day , Tim
ran to her mom . " Mom my ,
her sweater very much . One day , Lily
Sue looked sad . Tim asked , "
Lily was very happy . L ily wanted to
and c udd ly . One day , Tim
with a big ball . Tim said , "
play and have fun . One day , Spot
and got very dirty . Tom said , "
little dog , Max . One day , Sue
were very good friends . One day , Sue
and felt very happy . But then , her
ran to the tree . " Hi , Tree
it to his mom . Tim said , "
in the warm sun . One day , Max
went inside the house . In the house ,
He was so happy . Tim took the tape

MIN ACTIVATIONS
MIN = -10.928

so loud with all the furniture talking , it was hard
could hide in the little cracks and cre v ices of
for getting the wild new carpet . <|endoftext|> Once there was
one of the p udd les and watched as the rain
all of the family 's possessions had been taken away ,
in its ears , the switch knows that it will never
they could now use the glue . The man was happy
given you . Feel the air and let it take you
something else - a beautiful wand with a star on the
Spr inkle it on any area and you will be granted
the fairy of the magic wand . You can make anything
his dinner with his cob web . <|endoftext|> Once upon a
kind and looked after her body . They gave her some
had their special pe b ble which united them in friendship
day and the way her body felt with each step .
the man was wearing a sock on his head , but
tried one of the unknown machines , but she knew it
were taking care of the air . From that moment ,
their feet were the heavy space boots . They pretended the
subtle tranqu ility filled the air and the rabbit felt his

INTERVAL 4.581 - 9.750
CONTAINS 6.124%

friends won the day . Sam was so happy
?" Lily asked Mom . " Yes , we
ever said " yes ". One day , the
lion !" she exclaimed . They watched the film
difficult ," Jill says . " Yes , it

INTERVAL -0.589 - 4.581
CONTAINS 49.584%

woman , smiled and ran to pick up the l ily
Once upon a time , there was a little boy named
called Sam . He was three years old and loved to
anything . <|endoftext|> Once upon a time , there was a
You can dance there too !" Sammy was so excited ,

INTERVAL -5.758 - -0.589
CONTAINS 43.401%

so excited for her special day . She was wearing the
he found it . And he would always love bananas .
went home , never to play together again . <|endoftext|> Sam
about colors and shapes , and Lily loved colors and shapes
but you should not use tape on the blankets . It

INTERVAL -10.928 - -5.758
CONTAINS 0.891%

shouted . " The red team is stronger and smarter .
to put lights around the tank . When they turned them
to either side of the bridge , so it formed a
and bringing justice to different parts of the world . Whenever
around in the spot of sunshine . Suddenly , he wasn
Token her
Token ID607
Feature activation+1.51e+00

Pos logit contributions
V+0.075
E+0.071
+0.069
 Many+0.066
Look+0.065
Neg logit contributions
ier-0.053
 size-0.051
 away-0.047
 perfectly-0.047
 wouldn-0.047
Token mom
Token ID1995
Feature activation+1.41e+00

Pos logit contributions
V+0.054
+0.052
E+0.051
 Many+0.047
Look+0.047
Neg logit contributions
ier-0.054
 size-0.048
 away-0.047
ist-0.047
et-0.046
Tokenmy
Token ID1820
Feature activation-2.17e-01

Pos logit contributions
+0.057
V+0.057
E+0.054
Look+0.050
 While+0.049
Neg logit contributions
ier-0.041
 size-0.039
 perfectly-0.037
 wouldn-0.037
 wand-0.037
Token.
Token ID13
Feature activation+6.39e+00

Pos logit contributions
ier+0.008
 wouldn+0.007
 size+0.007
 perfectly+0.007
 inches+0.007
Neg logit contributions
-0.008
 The-0.007
V-0.007
E-0.006
 -0.006
Token
Token ID220
Feature activation+8.84e+00

Pos logit contributions
Ta+0.262
Mic+0.252
V+0.250
Ah+0.244
 Deal+0.243
Neg logit contributions
 and-0.223
,-0.217
.-0.215
 to-0.211
 as-0.188
Token
Token ID198
Feature activation+9.75e+00

Pos logit contributions
V+0.324
Grab+0.322
Quick+0.319
Ta+0.319
Uh+0.318
Neg logit contributions
.-0.326
 for-0.286
 in-0.283
 and-0.264
 all-0.259
Token
Token ID198
Feature activation+4.41e+00

Pos logit contributions
 Deal+0.445
Grab+0.436
Ta+0.415
Uh+0.409
Really+0.406
Neg logit contributions
.-0.319
 and-0.303
,-0.271
 to-0.263
 for-0.261
TokenL
Token ID43
Feature activation+3.77e+00

Pos logit contributions
 Deal+0.121
 Ow+0.120
 Glass+0.107
 Some+0.107
 Next+0.106
Neg logit contributions
 away-0.161
 in-0.160
 made-0.154
 fl-0.153
 and-0.152
Tokenily
Token ID813
Feature activation+9.64e-01

Pos logit contributions
V+0.147
E+0.135
Look+0.135
Wow+0.131
Br+0.130
Neg logit contributions
ier-0.133
 for-0.119
ies-0.114
 enough-0.114
ils-0.112
Token ran
Token ID4966
Feature activation+3.02e+00

Pos logit contributions
V+0.039
+0.038
E+0.036
 While+0.035
Look+0.034
Neg logit contributions
ier-0.033
iness-0.028
 luck-0.027
 perfectly-0.027
 size-0.027
Token inside
Token ID2641
Feature activation+2.52e+00

Pos logit contributions
V+0.189
E+0.187
Look+0.174
 Many+0.172
Wow+0.171
Neg logit contributions
 away-0.032
 for-0.027
 enough-0.021
 in-0.018
 as-0.016
Token and
Token ID290
Feature activation+1.84e-01

Pos logit contributions
ier+0.020
 powers+0.018
eking+0.018
 coins+0.018
 wand+0.018
Neg logit contributions
-0.022
Mom-0.017
 The-0.017
M-0.017
 -0.016
Token dad
Token ID9955
Feature activation-3.89e-01

Pos logit contributions
+0.005
Mom+0.004
Dad+0.004
 While+0.004
V+0.004
Neg logit contributions
ier-0.009
 perfectly-0.008
ist-0.008
et-0.008
ening-0.008
Token every
Token ID790
Feature activation+1.76e+00

Pos logit contributions
ier+0.013
 airplane+0.011
castle+0.011
 powers+0.011
eking+0.011
Neg logit contributions
-0.017
 -0.014
 While-0.014
 The-0.013
V-0.013
Token day
Token ID1110
Feature activation+3.18e-01

Pos logit contributions
V+0.076
E+0.071
+0.070
Look+0.068
 Many+0.066
Neg logit contributions
ier-0.048
 size-0.044
 perfectly-0.043
 enough-0.043
 wouldn-0.042
Token.
Token ID13
Feature activation+6.95e+00

Pos logit contributions
+0.014
 The+0.012
Mom+0.012
E+0.011
 Her+0.011
Neg logit contributions
ier-0.009
 wand-0.009
ener-0.009
et-0.008
ps-0.008
Token
Token ID198
Feature activation+9.68e+00

Pos logit contributions
Quick+0.296
Ta+0.283
Grab+0.271
AAAA+0.270
Cur+0.265
Neg logit contributions
.-0.224
 for-0.218
 and-0.207
,-0.202
 to-0.190
Token
Token ID198
Feature activation+5.14e+00

Pos logit contributions
 Deal+0.415
Grab+0.413
Ta+0.384
Really+0.360
Uh+0.353
Neg logit contributions
.-0.348
 for-0.313
 and-0.309
 in-0.308
 to-0.293
TokenOne
Token ID3198
Feature activation+3.31e+00

Pos logit contributions
 Deal+0.194
 Int+0.181
 Here+0.178
 Now+0.174
 Ow+0.174
Neg logit contributions
 would-0.149
 for-0.147
 in-0.138
 game-0.133
 often-0.133
Token day
Token ID1110
Feature activation+3.76e+00

Pos logit contributions
V+0.191
E+0.185
Look+0.175
Wow+0.166
 Many+0.164
Neg logit contributions
 for-0.051
 away-0.042
 size-0.041
 in-0.041
 left-0.040
Token,
Token ID11
Feature activation+3.17e+00

Pos logit contributions
V+0.269
E+0.259
Look+0.248
Wow+0.239
 Many+0.239
Neg logit contributions
 for-0.007
 in-0.002
,-0.000
 enough+0.000
 away+0.002
Token Lily
Token ID20037
Feature activation+1.18e+00

Pos logit contributions
V+0.106
E+0.097
Look+0.088
 Many+0.086
+0.082
Neg logit contributions
 for-0.115
 away-0.112
 enough-0.110
 size-0.109
 as-0.109
Token the
Token ID262
Feature activation+6.61e-01

Pos logit contributions
V+0.019
+0.018
 Many+0.016
 Whe+0.016
Look+0.016
Neg logit contributions
ier-0.017
ener-0.016
ist-0.015
 perfectly-0.015
iness-0.015
Token parade
Token ID16134
Feature activation-2.34e+00

Pos logit contributions
V+0.028
+0.027
Look+0.026
 Many+0.025
E+0.025
Neg logit contributions
ier-0.020
 wouldn-0.019
ener-0.019
 perfectly-0.018
ist-0.018
Token a
Token ID257
Feature activation+1.17e+00

Pos logit contributions
eking+0.112
ener+0.110
pan+0.106
 breaths+0.105
 wand+0.104
Neg logit contributions
-0.080
 They-0.057
 The-0.057
 And-0.051
 Everyone-0.048
Token lot
Token ID1256
Feature activation-8.32e-01

Pos logit contributions
V+0.039
E+0.036
+0.035
Look+0.033
 Many+0.031
Neg logit contributions
ier-0.044
 enough-0.043
 away-0.042
 wouldn-0.041
et-0.041
Token.
Token ID13
Feature activation+6.28e+00

Pos logit contributions
 wouldn+0.030
 size+0.029
ener+0.029
eking+0.029
 wand+0.028
Neg logit contributions
-0.033
 The-0.024
L-0.023
 And-0.023
 Now-0.022
Token
Token ID198
Feature activation+9.64e+00

Pos logit contributions
E+0.264
Ta+0.260
Ba+0.250
Mic+0.250
 Deal+0.248
Neg logit contributions
.-0.198
 and-0.195
,-0.187
 to-0.175
 for-0.164
Token
Token ID198
Feature activation+4.67e+00

Pos logit contributions
 Deal+0.396
Grab+0.378
Ta+0.372
Really+0.363
Uh+0.350
Neg logit contributions
.-0.359
 and-0.343
 in-0.322
 to-0.311
,-0.308
TokenTim
Token ID14967
Feature activation+8.82e-01

Pos logit contributions
 Deal+0.162
 Here+0.124
 Ow+0.116
V+0.115
 Next+0.112
Neg logit contributions
 made-0.181
 in-0.175
 would-0.171
 and-0.171
 like-0.169
Token saw
Token ID2497
Feature activation+2.08e+00

Pos logit contributions
V+0.035
+0.034
 While+0.030
Look+0.030
E+0.030
Neg logit contributions
ier-0.031
 luck-0.028
 size-0.027
 wand-0.027
iness-0.026
Token a
Token ID257
Feature activation+5.05e-01

Pos logit contributions
V+0.090
E+0.085
+0.081
Look+0.080
 Many+0.079
Neg logit contributions
ier-0.056
 size-0.054
 away-0.053
 enough-0.052
 perfectly-0.051
Token man
Token ID582
Feature activation-3.73e-01

Pos logit contributions
V+0.018
+0.016
Look+0.016
E+0.016
Wow+0.015
Neg logit contributions
ier-0.019
 enough-0.018
et-0.018
 perfectly-0.018
 wouldn-0.018
Token tables
Token ID8893
Feature activation-1.77e+00

Pos logit contributions
+0.007
V+0.007
 Many+0.007
E+0.006
Tim+0.006
Neg logit contributions
ier-0.006
 luck-0.005
 away-0.005
ener-0.005
iness-0.005
Token,
Token ID11
Feature activation+5.42e-01

Pos logit contributions
 luck+0.067
 powers+0.067
ener+0.066
 reality+0.064
ience+0.064
Neg logit contributions
-0.065
 The-0.051
 They-0.048
 In-0.046
 At-0.045
Token and
Token ID290
Feature activation-5.98e-01

Pos logit contributions
+0.024
V+0.024
Look+0.022
E+0.022
 Many+0.022
Neg logit contributions
ier-0.014
 luck-0.013
 perfectly-0.012
 size-0.012
 powers-0.012
Token beds
Token ID20237
Feature activation-2.82e+00

Pos logit contributions
ier+0.023
ist+0.022
ener+0.021
iness+0.021
et+0.020
Neg logit contributions
 Many-0.022
V-0.021
-0.021
E-0.020
Look-0.020
Token.
Token ID13
Feature activation+6.52e+00

Pos logit contributions
 obstacle+0.123
ience+0.120
ener+0.120
 pearl+0.120
 worries+0.119
Neg logit contributions
-0.092
 They-0.067
-0.057
 The-0.057
 In-0.052
Token
Token ID198
Feature activation+9.64e+00

Pos logit contributions
Ba+0.298
Isa+0.283
Ta+0.277
Har+0.269
Elsa+0.268
Neg logit contributions
.-0.151
 for-0.147
,-0.143
 and-0.137
 too-0.129
Token
Token ID198
Feature activation+5.37e+00

Pos logit contributions
 Deal+0.451
Ta+0.442
Grab+0.429
Whe+0.393
Really+0.388
Neg logit contributions
.-0.310
 and-0.280
,-0.263
 in-0.261
 to-0.257
TokenOne
Token ID3198
Feature activation+3.19e+00

Pos logit contributions
 Deal+0.193
Isa+0.167
Ta+0.159
V+0.159
Ba+0.155
Neg logit contributions
 in-0.171
 for-0.166
 made-0.161
 would-0.160
,-0.159
Token day
Token ID1110
Feature activation+3.98e+00

Pos logit contributions
V+0.179
E+0.174
Look+0.165
Wow+0.157
O+0.153
Neg logit contributions
 for-0.052
 size-0.044
 away-0.043
 in-0.043
 left-0.042
Token,
Token ID11
Feature activation+3.43e+00

Pos logit contributions
V+0.299
E+0.290
Look+0.269
Kids+0.268
 Deal+0.262
Neg logit contributions
 for-0.004
,-0.000
 in+0.000
 as+0.006
 enough+0.008
Token Tim
Token ID5045
Feature activation+1.74e+00

Pos logit contributions
V+0.126
E+0.116
Kids+0.103
Look+0.099
 Deal+0.098
Neg logit contributions
 for-0.119
 as-0.117
 in-0.115
 away-0.112
 enough-0.111
Token ran
Token ID4966
Feature activation+2.53e+00

Pos logit contributions
V+0.003
+0.003
 While+0.003
E+0.002
Look+0.002
Neg logit contributions
ier-0.004
work-0.003
iest-0.003
 size-0.003
 luck-0.003
Token to
Token ID284
Feature activation+2.47e+00

Pos logit contributions
V+0.157
E+0.153
Look+0.145
 Many+0.144
+0.142
Neg logit contributions
 away-0.024
ier-0.017
 enough-0.017
 for-0.016
 size-0.016
Token her
Token ID607
Feature activation+2.41e+00

Pos logit contributions
V+0.117
E+0.112
Look+0.104
+0.102
Wow+0.102
Neg logit contributions
 away-0.056
 size-0.053
ier-0.052
 enough-0.052
 for-0.050
Token mom
Token ID1995
Feature activation+1.75e+00

Pos logit contributions
V+0.108
E+0.103
Look+0.096
 Many+0.095
Wow+0.094
Neg logit contributions
 away-0.061
 size-0.061
 enough-0.059
ier-0.058
 airplane-0.056
Token.
Token ID13
Feature activation+6.04e+00

Pos logit contributions
V+0.069
+0.066
E+0.065
Look+0.060
 While+0.060
Neg logit contributions
ier-0.053
 size-0.050
 luck-0.048
 perfectly-0.048
 enough-0.048
Token
Token ID198
Feature activation+9.63e+00

Pos logit contributions
V+0.212
Ta+0.203
E+0.199
Whe+0.183
Nice+0.177
Neg logit contributions
,-0.232
 to-0.230
 and-0.222
.-0.212
 as-0.201
Token
Token ID198
Feature activation+4.30e+00

Pos logit contributions
Really+0.458
Whe+0.454
Grab+0.451
Ta+0.450
 Deal+0.432
Neg logit contributions
.-0.301
 and-0.289
 to-0.283
,-0.274
 for-0.240
Token"
Token ID1
Feature activation+4.06e+00

Pos logit contributions
 Ow+0.156
V+0.154
Wow+0.146
 Some+0.145
E+0.143
Neg logit contributions
 away-0.126
 in-0.117
 bit-0.111
 left-0.107
 and-0.102
TokenMom
Token ID29252
Feature activation+7.13e-01

Pos logit contributions
 Soon+0.086
 Many+0.085
 Later+0.082
 From+0.081
V+0.081
Neg logit contributions
 away-0.184
 ring-0.180
 bit-0.180
 veil-0.172
 airplane-0.172
Tokenmy
Token ID1820
Feature activation-1.07e-01

Pos logit contributions
+0.027
V+0.027
E+0.024
Mom+0.023
Look+0.023
Neg logit contributions
ier-0.023
 wand-0.021
 coins-0.021
 size-0.020
 perfectly-0.020
Token,
Token ID11
Feature activation+1.58e+00

Pos logit contributions
ier+0.003
 size+0.003
 perfectly+0.003
 coins+0.003
 properly+0.003
Neg logit contributions
-0.004
V-0.004
 The-0.004
 Now-0.004
-0.004
Token her
Token ID607
Feature activation+2.76e+00

Pos logit contributions
V+0.101
E+0.096
+0.091
Look+0.090
 Many+0.088
Neg logit contributions
ier-0.048
 size-0.046
 away-0.045
 enough-0.044
 perfectly-0.042
Token sweater
Token ID35685
Feature activation-1.44e+00

Pos logit contributions
V+0.131
E+0.127
Look+0.120
 Many+0.119
+0.116
Neg logit contributions
 size-0.058
 away-0.057
 enough-0.055
 for-0.054
ier-0.054
Token very
Token ID845
Feature activation+3.42e+00

Pos logit contributions
ener+0.059
ening+0.055
eking+0.055
rying+0.054
lling+0.052
Neg logit contributions
-0.053
 The-0.041
 While-0.041
 And-0.038
 She-0.038
Token much
Token ID881
Feature activation+3.21e-02

Pos logit contributions
V+0.162
Look+0.151
E+0.141
Wow+0.139
Kids+0.133
Neg logit contributions
ier-0.080
 perfectly-0.079
 for-0.077
 size-0.077
 enough-0.077
Token.
Token ID13
Feature activation+7.01e+00

Pos logit contributions
+0.001
E+0.001
V+0.001
 Now+0.001
L+0.001
Neg logit contributions
eking-0.001
 size-0.001
ener-0.001
 falling-0.001
 wand-0.001
Token
Token ID198
Feature activation+9.63e+00

Pos logit contributions
Quick+0.316
Uh+0.288
Pe+0.275
Phill+0.272
Spe+0.270
Neg logit contributions
.-0.220
,-0.211
 and-0.206
 for-0.176
 to-0.174
Token
Token ID198
Feature activation+4.89e+00

Pos logit contributions
 Deal+0.427
Grab+0.414
Ta+0.381
Uh+0.369
Quick+0.365
Neg logit contributions
.-0.344
 and-0.312
 in-0.299
 to-0.291
 for-0.290
TokenOne
Token ID3198
Feature activation+3.11e+00

Pos logit contributions
 Deal+0.197
 Ow+0.171
 Here+0.163
 Glass+0.163
ucks+0.159
Neg logit contributions
 would-0.146
 and-0.145
 made-0.143
 in-0.143
 game-0.140
Token day
Token ID1110
Feature activation+3.61e+00

Pos logit contributions
V+0.174
E+0.169
Look+0.159
Wow+0.152
 Many+0.151
Neg logit contributions
 for-0.048
 size-0.044
 away-0.043
 left-0.041
 in-0.040
Token,
Token ID11
Feature activation+3.36e+00

Pos logit contributions
V+0.254
E+0.248
Look+0.236
 Many+0.229
Wow+0.227
Neg logit contributions
 for-0.006
 in-0.002
,-0.000
 enough+0.000
 away+0.001
Token Lily
Token ID20037
Feature activation+1.36e+00

Pos logit contributions
V+0.113
E+0.106
Look+0.095
 Many+0.094
Kids+0.092
Neg logit contributions
 for-0.121
 as-0.116
 away-0.116
 in-0.115
 enough-0.112
Token Sue
Token ID26089
Feature activation+9.50e-01

Pos logit contributions
V+0.249
E+0.239
Ba+0.231
Mic+0.223
Isa+0.218
Neg logit contributions
,-0.144
 for-0.140
 in-0.135
 and-0.133
.-0.132
Token looked
Token ID3114
Feature activation+1.32e+00

Pos logit contributions
+0.036
V+0.035
E+0.032
Look+0.031
 While+0.030
Neg logit contributions
ier-0.035
 size-0.031
 luck-0.031
iness-0.030
 enough-0.030
Token sad
Token ID6507
Feature activation-2.56e-01

Pos logit contributions
V+0.060
+0.060
E+0.057
 While+0.056
Look+0.056
Neg logit contributions
ier-0.033
 size-0.031
 powers-0.026
 enough-0.026
iness-0.025
Token.
Token ID13
Feature activation+5.76e+00

Pos logit contributions
ils+0.008
et+0.008
 powers+0.008
 trace+0.007
 size+0.007
Neg logit contributions
-0.011
 They-0.008
 She-0.008
 The-0.008
 So-0.008
Token
Token ID220
Feature activation+7.98e+00

Pos logit contributions
V+0.233
E+0.223
Ba+0.210
Ta+0.206
Isa+0.198
Neg logit contributions
,-0.200
 and-0.184
.-0.184
 in-0.182
 to-0.182
Token
Token ID198
Feature activation+9.62e+00

Pos logit contributions
Ta+0.284
V+0.277
Grab+0.258
Quick+0.255
E+0.255
Neg logit contributions
.-0.318
 for-0.288
 in-0.287
 and-0.280
 on-0.272
Token
Token ID198
Feature activation+4.66e+00

Pos logit contributions
Ta+0.418
 Deal+0.404
Grab+0.393
Really+0.392
Uh+0.388
Neg logit contributions
.-0.338
 and-0.323
,-0.319
!-0.283
 in-0.276
TokenTim
Token ID14967
Feature activation+1.27e+00

Pos logit contributions
 Deal+0.131
V+0.128
E+0.110
Isa+0.108
 Ow+0.107
Neg logit contributions
 in-0.189
!-0.184
,-0.182
 and-0.181
 made-0.179
Token asked
Token ID1965
Feature activation+3.19e+00

Pos logit contributions
V+0.048
+0.048
E+0.043
Look+0.042
 While+0.042
Neg logit contributions
ier-0.044
 size-0.040
 luck-0.039
 perfectly-0.039
iness-0.038
Token,
Token ID11
Feature activation+3.04e+00

Pos logit contributions
V+0.123
E+0.116
Look+0.101
 Many+0.097
 While+0.097
Neg logit contributions
 for-0.112
 enough-0.107
 away-0.106
ier-0.100
 luck-0.100
Token "
Token ID366
Feature activation+4.51e+00

Pos logit contributions
V+0.101
E+0.096
 Many+0.085
Look+0.083
 While+0.081
Neg logit contributions
 for-0.108
ier-0.105
 enough-0.105
 away-0.104
 size-0.103
Token Lily
Token ID20037
Feature activation+1.60e+00

Pos logit contributions
Phill+0.233
Phil+0.232
Josh+0.230
Rick+0.222
Dan+0.222
Neg logit contributions
 for-0.152
.-0.146
 as-0.140
,-0.140
 and-0.139
Token was
Token ID373
Feature activation+2.47e+00

Pos logit contributions
V+0.069
E+0.065
+0.063
Look+0.061
 While+0.060
Neg logit contributions
ier-0.045
 away-0.041
 size-0.041
 luck-0.040
 enough-0.040
Token very
Token ID845
Feature activation+2.71e+00

Pos logit contributions
V+0.135
E+0.130
Look+0.123
+0.121
Wow+0.120
Neg logit contributions
ier-0.036
 size-0.035
 enough-0.035
 away-0.035
 for-0.033
Token happy
Token ID3772
Feature activation+3.50e-02

Pos logit contributions
V+0.138
E+0.129
Look+0.125
Wow+0.121
+0.120
Neg logit contributions
ier-0.052
 size-0.051
 for-0.050
 away-0.049
 luck-0.047
Token.
Token ID13
Feature activation+7.19e+00

Pos logit contributions
+0.001
 Now+0.001
 While+0.001
L+0.001
 The+0.001
Neg logit contributions
ier-0.001
iful-0.001
ist-0.001
ience-0.001
ps-0.001
Token
Token ID198
Feature activation+9.62e+00

Pos logit contributions
Ar+0.317
Ta+0.316
Quick+0.314
AAAA+0.312
Isa+0.304
Neg logit contributions
.-0.196
,-0.184
 and-0.183
 to-0.168
 for-0.155
Token
Token ID198
Feature activation+5.01e+00

Pos logit contributions
 Deal+0.437
Grab+0.430
Ta+0.410
Really+0.391
Uh+0.389
Neg logit contributions
.-0.325
 and-0.294
 for-0.278
 in-0.276
 to-0.273
TokenL
Token ID43
Feature activation+4.14e+00

Pos logit contributions
 Deal+0.182
 Ow+0.164
omm+0.154
 Glass+0.148
 Cr+0.146
Neg logit contributions
 in-0.164
 made-0.162
 and-0.159
 when-0.156
 on-0.148
Tokenily
Token ID813
Feature activation+1.13e+00

Pos logit contributions
V+0.176
Look+0.163
Br+0.161
E+0.159
Wow+0.158
Neg logit contributions
ier-0.139
 for-0.121
 enough-0.113
ies-0.111
 in-0.109
Token wanted
Token ID2227
Feature activation+1.26e+00

Pos logit contributions
V+0.047
+0.045
E+0.043
 While+0.042
Look+0.042
Neg logit contributions
ier-0.035
 luck-0.031
 size-0.030
 perfectly-0.030
 away-0.029
Token to
Token ID284
Feature activation+1.79e+00

Pos logit contributions
V+0.059
+0.059
E+0.057
Look+0.054
 Many+0.054
Neg logit contributions
ier-0.030
ist-0.026
iness-0.026
 size-0.026
 perfectly-0.026
Token and
Token ID290
Feature activation+2.19e+00

Pos logit contributions
+0.004
 While+0.003
E+0.003
V+0.003
 Now+0.003
Neg logit contributions
ier-0.003
 happen-0.003
eking-0.003
 possible-0.003
 wouldn-0.002
Token c
Token ID269
Feature activation+3.10e+00

Pos logit contributions
V+0.100
E+0.095
+0.090
Look+0.089
 Many+0.087
Neg logit contributions
ier-0.052
 size-0.050
 away-0.049
 enough-0.049
 perfectly-0.046
Tokenudd
Token ID4185
Feature activation+2.62e+00

Pos logit contributions
V+0.152
Look+0.139
E+0.138
Wow+0.134
 Many+0.133
Neg logit contributions
ier-0.075
ies-0.064
 away-0.063
 wouldn-0.063
 perfectly-0.063
Tokenly
Token ID306
Feature activation-3.12e+00

Pos logit contributions
V+0.173
E+0.162
Look+0.160
+0.158
 Many+0.157
Neg logit contributions
ier-0.016
 away-0.012
 enough-0.008
 size-0.008
 for-0.008
Token.
Token ID13
Feature activation+6.06e+00

Pos logit contributions
castle+0.154
isher+0.153
ist+0.148
 airplane+0.148
eking+0.148
Neg logit contributions
-0.092
 And-0.062
 They-0.059
 While-0.056
 The-0.056
Token
Token ID198
Feature activation+9.62e+00

Pos logit contributions
Ta+0.252
V+0.223
Ah+0.200
Luc+0.199
Click+0.197
Neg logit contributions
 for-0.198
,-0.177
 and-0.176
.-0.175
 because-0.173
Token
Token ID198
Feature activation+5.45e+00

Pos logit contributions
Ta+0.455
 Deal+0.434
Grab+0.420
Really+0.370
Whe+0.370
Neg logit contributions
.-0.312
 and-0.281
 for-0.257
 in-0.257
,-0.257
TokenOne
Token ID3198
Feature activation+4.07e+00

Pos logit contributions
 Deal+0.208
Ta+0.203
 Here+0.174
 Someone+0.169
 Boom+0.169
Neg logit contributions
 for-0.160
 and-0.151
 would-0.145
 in-0.144
!-0.143
Token day
Token ID1110
Feature activation+4.06e+00

Pos logit contributions
V+0.265
E+0.259
Look+0.248
 Deal+0.236
 Int+0.235
Neg logit contributions
 for-0.046
 left-0.029
 in-0.028
 away-0.022
ies-0.019
Token,
Token ID11
Feature activation+3.89e+00

Pos logit contributions
V+0.309
E+0.297
Look+0.279
Kids+0.277
Ta+0.276
Neg logit contributions
,-0.000
 for+0.003
 in+0.007
 as+0.008
 on+0.012
Token Tim
Token ID5045
Feature activation+3.12e+00

Pos logit contributions
V+0.160
E+0.140
 Many+0.130
Ta+0.130
Kids+0.128
Neg logit contributions
 as-0.128
 for-0.126
 in-0.117
 away-0.115
 bit-0.112
Token with
Token ID351
Feature activation+1.57e+00

Pos logit contributions
V+0.077
+0.072
E+0.071
Look+0.070
 Many+0.068
Neg logit contributions
ier-0.038
 size-0.035
 enough-0.034
 wouldn-0.033
 airplane-0.033
Token a
Token ID257
Feature activation+8.37e-01

Pos logit contributions
V+0.058
E+0.055
+0.054
 Many+0.051
Look+0.051
Neg logit contributions
ier-0.056
 wouldn-0.051
 size-0.051
 away-0.049
 perfectly-0.049
Token big
Token ID1263
Feature activation+5.93e-01

Pos logit contributions
V+0.034
Look+0.031
E+0.031
+0.031
L+0.029
Neg logit contributions
ier-0.028
 enough-0.025
 wouldn-0.024
 perfectly-0.024
et-0.024
Token ball
Token ID2613
Feature activation+3.07e-01

Pos logit contributions
V+0.022
+0.020
Look+0.020
 Many+0.019
 While+0.018
Neg logit contributions
ier-0.022
 wouldn-0.020
 perfectly-0.019
ist-0.018
 enough-0.018
Token.
Token ID13
Feature activation+6.40e+00

Pos logit contributions
+0.014
V+0.013
 While+0.012
L+0.012
 Many+0.012
Neg logit contributions
ier-0.008
 luck-0.008
ener-0.007
 wouldn-0.007
 size-0.007
Token
Token ID198
Feature activation+9.60e+00

Pos logit contributions
E+0.259
Isa+0.236
Br+0.232
V+0.225
Ta+0.222
Neg logit contributions
,-0.233
 and-0.221
 to-0.211
.-0.193
 for-0.187
Token
Token ID198
Feature activation+4.97e+00

Pos logit contributions
 Deal+0.452
Really+0.416
omm+0.401
Ta+0.400
Grab+0.399
Neg logit contributions
 and-0.338
.-0.333
,-0.302
 in-0.300
 to-0.293
TokenTim
Token ID14967
Feature activation+1.08e+00

Pos logit contributions
 Deal+0.175
omm+0.136
Isa+0.130
 Ow+0.129
ucks+0.125
Neg logit contributions
 made-0.195
 and-0.194
 in-0.191
!-0.185
,-0.183
Token said
Token ID531
Feature activation+1.98e+00

Pos logit contributions
V+0.044
+0.044
 While+0.039
E+0.039
Look+0.039
Neg logit contributions
ier-0.036
 luck-0.032
 size-0.032
 perfectly-0.030
iness-0.030
Token,
Token ID11
Feature activation+2.56e+00

Pos logit contributions
V+0.110
E+0.105
+0.102
Look+0.100
 Many+0.099
Neg logit contributions
ier-0.028
 size-0.027
 away-0.025
 enough-0.024
 perfectly-0.023
Token "
Token ID366
Feature activation+4.96e+00

Pos logit contributions
V+0.076
E+0.071
Look+0.063
 Many+0.061
+0.061
Neg logit contributions
ier-0.099
 away-0.097
 size-0.097
 enough-0.097
 for-0.095
Token play
Token ID711
Feature activation+2.79e+00

Pos logit contributions
V+0.043
+0.040
 Many+0.039
 While+0.038
E+0.037
Neg logit contributions
ier-0.048
 enough-0.043
 perfectly-0.042
ener-0.041
 wouldn-0.040
Token and
Token ID290
Feature activation+1.58e+00

Pos logit contributions
V+0.162
E+0.159
Look+0.148
Wow+0.145
 Many+0.145
Neg logit contributions
 away-0.033
 for-0.030
ier-0.029
 size-0.027
 enough-0.026
Token have
Token ID423
Feature activation+2.91e+00

Pos logit contributions
V+0.064
+0.059
E+0.058
 While+0.057
 Many+0.057
Neg logit contributions
ier-0.050
 enough-0.044
 size-0.044
 away-0.042
 luck-0.042
Token fun
Token ID1257
Feature activation+2.26e+00

Pos logit contributions
V+0.152
E+0.147
Look+0.137
Mom+0.134
O+0.133
Neg logit contributions
 enough-0.050
 away-0.046
 luck-0.044
 for-0.044
 coins-0.044
Token.
Token ID13
Feature activation+6.59e+00

Pos logit contributions
V+0.131
E+0.126
Look+0.119
+0.118
 Many+0.117
Neg logit contributions
ier-0.028
 size-0.026
 away-0.025
 enough-0.025
 for-0.022
Token
Token ID198
Feature activation+9.60e+00

Pos logit contributions
Ta+0.279
E+0.271
P+0.247
O+0.244
J+0.244
Neg logit contributions
 for-0.234
 and-0.218
.-0.211
,-0.205
 anymore-0.189
Token
Token ID198
Feature activation+5.53e+00

Pos logit contributions
 Deal+0.388
Grab+0.384
Ta+0.381
Really+0.350
Gu+0.343
Neg logit contributions
.-0.358
 for-0.330
 and-0.325
 in-0.319
 on-0.303
TokenOne
Token ID3198
Feature activation+3.75e+00

Pos logit contributions
 Deal+0.224
Ta+0.211
 Here+0.199
 Glass+0.189
 Someone+0.187
Neg logit contributions
 for-0.169
 would-0.166
 on-0.158
 made-0.157
 in-0.156
Token day
Token ID1110
Feature activation+4.33e+00

Pos logit contributions
E+0.224
V+0.223
Look+0.209
Wow+0.196
 Deal+0.196
Neg logit contributions
 for-0.055
 in-0.042
 left-0.041
 away-0.039
 size-0.035
Token,
Token ID11
Feature activation+4.14e+00

Pos logit contributions
V+0.315
E+0.310
Look+0.292
Ta+0.287
Kids+0.286
Neg logit contributions
 for-0.001
,-0.000
 in+0.002
 enough+0.004
 as+0.007
Token Spot
Token ID15899
Feature activation+2.06e+00

Pos logit contributions
V+0.181
E+0.175
Kids+0.162
 Deal+0.162
Ta+0.161
Neg logit contributions
 as-0.112
 for-0.107
 in-0.106
 enough-0.100
 on-0.097
Token and
Token ID290
Feature activation+2.48e-01

Pos logit contributions
 luck+0.023
 powers+0.022
ience+0.022
 secure+0.022
 perfectly+0.021
Neg logit contributions
-0.035
 While-0.029
V-0.028
L-0.028
Look-0.028
Token got
Token ID1392
Feature activation+1.54e+00

Pos logit contributions
+0.009
V+0.008
Look+0.008
 While+0.007
Wow+0.007
Neg logit contributions
ier-0.011
ist-0.010
 luck-0.010
eness-0.010
iness-0.009
Token very
Token ID845
Feature activation+2.13e+00

Pos logit contributions
V+0.072
E+0.068
+0.066
Look+0.066
 Many+0.063
Neg logit contributions
ier-0.038
 size-0.035
ist-0.033
 airplane-0.032
 enough-0.032
Token dirty
Token ID11841
Feature activation-5.45e-01

Pos logit contributions
V+0.106
E+0.102
+0.096
Look+0.096
 Many+0.094
Neg logit contributions
ier-0.043
 size-0.041
 away-0.040
 enough-0.039
 perfectly-0.037
Token.
Token ID13
Feature activation+6.51e+00

Pos logit contributions
 wand+0.018
ience+0.017
 crown+0.017
 powers+0.017
 transformed+0.017
Neg logit contributions
-0.019
E-0.016
 They-0.015
L-0.015
Wow-0.015
Token
Token ID198
Feature activation+9.58e+00

Pos logit contributions
E+0.270
Gr+0.261
Ta+0.255
Beck+0.238
Pr+0.238
Neg logit contributions
,-0.193
 to-0.193
 and-0.192
.-0.178
!-0.163
Token
Token ID198
Feature activation+5.04e+00

Pos logit contributions
 Deal+0.453
Ta+0.435
Really+0.425
Grab+0.420
Uh+0.395
Neg logit contributions
.-0.300
 and-0.289
 to-0.278
,-0.262
 for-0.254
TokenTom
Token ID13787
Feature activation+1.19e+00

Pos logit contributions
 Deal+0.172
E+0.148
 Ow+0.143
Isa+0.129
Ta+0.128
Neg logit contributions
 made-0.175
!-0.173
 and-0.173
 left-0.165
 than-0.164
Token said
Token ID531
Feature activation+1.48e+00

Pos logit contributions
V+0.047
+0.045
E+0.042
 While+0.040
L+0.040
Neg logit contributions
ier-0.041
 luck-0.036
 size-0.036
 perfectly-0.035
iness-0.034
Token,
Token ID11
Feature activation+2.00e+00

Pos logit contributions
V+0.075
+0.072
E+0.071
 Many+0.069
Look+0.067
Neg logit contributions
 size-0.029
ier-0.028
 airplane-0.024
 perfectly-0.024
ist-0.024
Token "
Token ID366
Feature activation+4.23e+00

Pos logit contributions
V+0.051
E+0.046
+0.044
Look+0.041
 Many+0.040
Neg logit contributions
ier-0.090
 size-0.088
 away-0.086
 enough-0.085
 luck-0.084
Token little
Token ID1310
Feature activation-1.23e+00

Pos logit contributions
+0.033
V+0.030
 Many+0.026
 Blood+0.026
Mom+0.026
Neg logit contributions
ier-0.032
 enough-0.029
 away-0.027
ist-0.027
le-0.026
Token dog
Token ID3290
Feature activation-1.25e-01

Pos logit contributions
ier+0.056
 happening+0.055
ist+0.051
 wouldn+0.051
 happen+0.051
Neg logit contributions
-0.035
 Many-0.033
V-0.033
Mom-0.030
 The-0.029
Token,
Token ID11
Feature activation-3.41e-01

Pos logit contributions
ier+0.004
iness+0.003
 luck+0.003
 size+0.003
eking+0.003
Neg logit contributions
-0.006
V-0.005
 The-0.005
 While-0.005
 Many-0.004
Token Max
Token ID5436
Feature activation+5.36e-01

Pos logit contributions
ier+0.019
iness+0.016
ering+0.016
ening+0.016
 luck+0.015
Neg logit contributions
-0.011
 The-0.009
 Many-0.008
V-0.008
 Whe-0.008
Token.
Token ID13
Feature activation+5.98e+00

Pos logit contributions
+0.026
V+0.024
E+0.022
L+0.021
 The+0.021
Neg logit contributions
ier-0.015
 luck-0.013
 size-0.013
iness-0.013
 inches-0.012
Token
Token ID198
Feature activation+9.56e+00

Pos logit contributions
E+0.197
Ta+0.194
Quick+0.173
Phill+0.172
Isa+0.170
Neg logit contributions
 and-0.207
.-0.204
,-0.201
 for-0.200
 too-0.179
Token
Token ID198
Feature activation+4.96e+00

Pos logit contributions
 Deal+0.413
Grab+0.388
Ta+0.386
Really+0.370
omm+0.352
Neg logit contributions
.-0.337
 and-0.311
 in-0.294
 to-0.287
 for-0.287
TokenOne
Token ID3198
Feature activation+3.28e+00

Pos logit contributions
 Deal+0.178
 Ow+0.155
E+0.154
 Glass+0.153
Ta+0.152
Neg logit contributions
 for-0.159
 would-0.150
 in-0.149
 made-0.148
 and-0.147
Token day
Token ID1110
Feature activation+3.80e+00

Pos logit contributions
V+0.186
E+0.183
Look+0.174
Wow+0.164
O+0.159
Neg logit contributions
 for-0.051
 size-0.042
 in-0.041
 away-0.040
 left-0.040
Token,
Token ID11
Feature activation+3.08e+00

Pos logit contributions
V+0.276
E+0.269
Look+0.252
Kids+0.249
Mic+0.243
Neg logit contributions
 for-0.007
 enough-0.001
 in-0.001
,-0.000
 away+0.001
Token Sue
Token ID26089
Feature activation+1.60e-01

Pos logit contributions
V+0.103
E+0.095
Look+0.084
Kids+0.080
 Many+0.079
Neg logit contributions
 for-0.111
 enough-0.111
 away-0.110
 as-0.106
 size-0.105
Token were
Token ID547
Feature activation+2.82e+00

Pos logit contributions
V+0.100
E+0.095
+0.091
Look+0.090
 Many+0.088
Neg logit contributions
ier-0.037
 size-0.034
 away-0.033
 enough-0.033
 luck-0.031
Token very
Token ID845
Feature activation+2.84e+00

Pos logit contributions
V+0.155
E+0.152
Look+0.142
 Many+0.140
O+0.140
Neg logit contributions
 enough-0.040
 size-0.039
 for-0.039
 away-0.038
 falling-0.034
Token good
Token ID922
Feature activation+1.04e+00

Pos logit contributions
V+0.140
E+0.134
Look+0.130
Wow+0.125
 While+0.124
Neg logit contributions
 size-0.054
ier-0.053
 for-0.053
 away-0.052
ies-0.050
Token friends
Token ID2460
Feature activation+1.44e+00

Pos logit contributions
V+0.043
+0.042
 Many+0.039
E+0.039
 While+0.038
Neg logit contributions
 away-0.026
 size-0.026
ier-0.025
 perfectly-0.025
 wouldn-0.025
Token.
Token ID13
Feature activation+6.18e+00

Pos logit contributions
V+0.074
+0.073
E+0.070
Look+0.067
 While+0.065
Neg logit contributions
ier-0.025
 size-0.024
 coins-0.022
 airplane-0.022
 away-0.022
Token
Token ID198
Feature activation+9.56e+00

Pos logit contributions
E+0.221
Ta+0.193
V+0.188
Isa+0.185
Phill+0.182
Neg logit contributions
.-0.220
 and-0.219
 for-0.217
,-0.217
 like-0.203
Token
Token ID198
Feature activation+5.23e+00

Pos logit contributions
 Deal+0.399
Grab+0.390
Ta+0.375
Really+0.368
omm+0.362
Neg logit contributions
.-0.347
 and-0.317
 in-0.306
 for-0.297
,-0.292
TokenOne
Token ID3198
Feature activation+2.99e+00

Pos logit contributions
 Deal+0.178
E+0.171
 Glass+0.168
V+0.166
 Here+0.164
Neg logit contributions
 for-0.167
 made-0.164
 would-0.163
 like-0.158
 than-0.157
Token day
Token ID1110
Feature activation+3.68e+00

Pos logit contributions
V+0.158
E+0.155
Look+0.145
Wow+0.138
 Many+0.137
Neg logit contributions
 for-0.053
 size-0.049
 away-0.049
ier-0.045
 left-0.044
Token,
Token ID11
Feature activation+3.10e+00

Pos logit contributions
V+0.261
E+0.254
Look+0.238
 Many+0.231
Kids+0.230
Neg logit contributions
 for-0.010
 enough-0.006
 in-0.003
 away-0.003
 as-0.000
Token Sue
Token ID26089
Feature activation-1.57e-01

Pos logit contributions
V+0.106
E+0.098
Look+0.087
 Many+0.083
+0.082
Neg logit contributions
 for-0.109
 away-0.109
 enough-0.108
 size-0.105
 as-0.104
Token and
Token ID290
Feature activation+8.00e-01

Pos logit contributions
 appearance+0.038
ier+0.038
eking+0.038
 permission+0.037
 luck+0.037
Neg logit contributions
-0.050
M-0.043
 While-0.040
 -0.038
E-0.038
Token felt
Token ID2936
Feature activation+1.86e+00

Pos logit contributions
+0.033
V+0.033
Look+0.031
E+0.031
 While+0.030
Neg logit contributions
ier-0.024
ist-0.022
 away-0.021
 wand-0.021
 luck-0.020
Token very
Token ID845
Feature activation+1.80e+00

Pos logit contributions
V+0.085
E+0.082
+0.078
Look+0.076
 Many+0.076
Neg logit contributions
ier-0.044
 size-0.043
 away-0.040
 enough-0.040
 perfectly-0.038
Token happy
Token ID3772
Feature activation-3.61e-01

Pos logit contributions
V+0.077
E+0.074
+0.071
Look+0.068
 Many+0.068
Neg logit contributions
ier-0.046
 size-0.045
 enough-0.044
 away-0.044
 airplane-0.042
Token.
Token ID13
Feature activation+7.11e+00

Pos logit contributions
ener+0.013
ist+0.013
ience+0.013
 wouldn+0.012
et+0.012
Neg logit contributions
-0.014
 Now-0.010
 While-0.009
 The-0.009
 When-0.009
Token
Token ID198
Feature activation+9.54e+00

Pos logit contributions
Uh+0.299
Quick+0.298
Ar+0.296
Spe+0.291
AAAA+0.290
Neg logit contributions
.-0.204
 and-0.192
 to-0.187
,-0.186
 for-0.180
Token
Token ID198
Feature activation+4.75e+00

Pos logit contributions
 Deal+0.434
Grab+0.432
Ta+0.403
Uh+0.395
AAAA+0.382
Neg logit contributions
.-0.318
 and-0.287
 for-0.278
 in-0.277
 to-0.262
TokenBut
Token ID1537
Feature activation+1.39e+00

Pos logit contributions
 Deal+0.157
 Ow+0.146
ucks+0.134
 Glass+0.132
 Here+0.129
Neg logit contributions
 in-0.160
 when-0.158
 made-0.150
 even-0.146
 fl-0.145
Token then
Token ID788
Feature activation+1.83e+00

Pos logit contributions
V+0.064
E+0.060
+0.060
Look+0.058
 While+0.058
Neg logit contributions
ier-0.035
 size-0.030
 luck-0.029
iness-0.029
 wand-0.029
Token,
Token ID11
Feature activation+1.62e+00

Pos logit contributions
V+0.098
E+0.093
+0.091
Look+0.088
 Many+0.087
Neg logit contributions
ier-0.031
 size-0.028
 away-0.027
 perfectly-0.025
 enough-0.024
Token her
Token ID607
Feature activation+9.39e-01

Pos logit contributions
V+0.052
E+0.049
+0.048
Look+0.045
 While+0.044
Neg logit contributions
ier-0.063
 size-0.060
 away-0.057
 luck-0.056
ist-0.056
Token ran
Token ID4966
Feature activation+2.85e+00

Pos logit contributions
V+0.074
E+0.068
+0.066
Look+0.065
 While+0.063
Neg logit contributions
ier-0.042
 size-0.040
 away-0.038
 enough-0.038
 luck-0.037
Token to
Token ID284
Feature activation+3.16e+00

Pos logit contributions
V+0.170
E+0.167
Look+0.157
 Many+0.156
Wow+0.152
Neg logit contributions
 away-0.034
 for-0.027
 enough-0.025
ier-0.024
 perfectly-0.022
Token the
Token ID262
Feature activation+2.29e+00

Pos logit contributions
E+0.169
V+0.167
Wow+0.152
Kids+0.151
Look+0.150
Neg logit contributions
 away-0.058
 for-0.052
 ring-0.048
 enough-0.046
 falling-0.044
Token tree
Token ID5509
Feature activation+4.13e-01

Pos logit contributions
V+0.115
E+0.110
+0.103
Look+0.103
 Many+0.102
Neg logit contributions
ier-0.044
 size-0.043
 away-0.043
 enough-0.041
 wouldn-0.039
Token.
Token ID13
Feature activation+6.72e+00

Pos logit contributions
+0.019
 While+0.017
V+0.017
 On+0.016
 Later+0.016
Neg logit contributions
ier-0.012
 luck-0.010
iness-0.010
ils-0.009
 perfectly-0.009
Token
Token ID198
Feature activation+9.54e+00

Pos logit contributions
E+0.274
Uh+0.258
Ta+0.255
Isa+0.247
Ar+0.245
Neg logit contributions
 to-0.240
 and-0.221
,-0.216
.-0.207
 for-0.200
Token
Token ID198
Feature activation+4.81e+00

Pos logit contributions
 Deal+0.430
Really+0.429
Grab+0.413
Ta+0.403
Uh+0.399
Neg logit contributions
.-0.315
 and-0.311
 to-0.287
,-0.279
 for-0.269
Token"
Token ID1
Feature activation+5.43e+00

Pos logit contributions
 Deal+0.152
 Ow+0.137
 Next+0.134
ucks+0.132
 Some+0.129
Neg logit contributions
 in-0.158
 made-0.150
 when-0.149
 set-0.149
 bit-0.148
TokenHi
Token ID17250
Feature activation+3.17e+00

Pos logit contributions
 From+0.153
 Their+0.144
 Deal+0.144
azes+0.143
 Sometimes+0.141
Neg logit contributions
 set-0.200
 made-0.190
 bit-0.180
 spread-0.175
 kept-0.171
Token,
Token ID11
Feature activation+3.16e+00

Pos logit contributions
V+0.191
E+0.183
Look+0.179
 Many+0.174
 While+0.172
Neg logit contributions
 enough-0.035
 away-0.029
 for-0.029
 luck-0.025
 airplane-0.025
Token Tree
Token ID12200
Feature activation-3.04e-01

Pos logit contributions
V+0.146
E+0.138
Look+0.133
 While+0.129
 Many+0.125
Neg logit contributions
 for-0.075
 enough-0.070
 ring-0.067
 airplane-0.067
 away-0.067
Token it
Token ID340
Feature activation+1.30e+00

Pos logit contributions
V+0.099
E+0.094
+0.088
Look+0.088
 Many+0.086
Neg logit contributions
ier-0.055
 size-0.053
 away-0.053
 enough-0.052
 perfectly-0.050
Token to
Token ID284
Feature activation+2.91e+00

Pos logit contributions
V+0.062
+0.062
E+0.056
 Many+0.056
 While+0.056
Neg logit contributions
ier-0.029
 size-0.026
ist-0.024
 airplane-0.024
 perfectly-0.024
Token his
Token ID465
Feature activation+1.42e+00

Pos logit contributions
V+0.147
E+0.144
Look+0.132
Wow+0.129
O+0.127
Neg logit contributions
 away-0.057
 for-0.056
 size-0.053
 crown-0.051
 enough-0.050
Token mom
Token ID1995
Feature activation+2.27e-01

Pos logit contributions
V+0.056
+0.054
E+0.051
Look+0.049
Mom+0.049
Neg logit contributions
ier-0.044
 wouldn-0.039
et-0.038
 away-0.038
ist-0.038
Token.
Token ID13
Feature activation+6.23e+00

Pos logit contributions
+0.010
 The+0.008
 +0.008
V+0.008
 One+0.008
Neg logit contributions
ier-0.007
 powers-0.007
 perfectly-0.007
 size-0.007
 wand-0.006
Token
Token ID198
Feature activation+9.54e+00

Pos logit contributions
Ta+0.267
V+0.257
Ba+0.255
 Deal+0.255
E+0.252
Neg logit contributions
,-0.229
 to-0.225
.-0.215
 and-0.214
 in-0.179
Token
Token ID198
Feature activation+4.82e+00

Pos logit contributions
 Deal+0.412
Ta+0.401
Really+0.399
Grab+0.393
Uh+0.377
Neg logit contributions
.-0.344
 and-0.317
 to-0.301
 in-0.298
,-0.294
TokenTim
Token ID14967
Feature activation+1.19e+00

Pos logit contributions
 Deal+0.161
V+0.131
Ta+0.121
omm+0.120
E+0.119
Neg logit contributions
 made-0.197
 in-0.187
 bit-0.183
 and-0.177
 left-0.176
Token said
Token ID531
Feature activation+1.69e+00

Pos logit contributions
V+0.048
+0.046
E+0.042
 While+0.042
Look+0.041
Neg logit contributions
ier-0.040
 luck-0.035
 size-0.035
 perfectly-0.035
iness-0.034
Token,
Token ID11
Feature activation+2.58e+00

Pos logit contributions
V+0.090
E+0.085
+0.085
 Many+0.082
Look+0.081
Neg logit contributions
ier-0.027
 size-0.027
 away-0.024
 perfectly-0.023
 airplane-0.023
Token "
Token ID366
Feature activation+4.95e+00

Pos logit contributions
V+0.079
E+0.074
Look+0.066
 Many+0.064
+0.063
Neg logit contributions
ier-0.099
 away-0.097
 size-0.096
 enough-0.096
 for-0.096
Token in
Token ID287
Feature activation+3.91e+00

Pos logit contributions
+0.061
V+0.060
E+0.056
 While+0.055
Look+0.054
Neg logit contributions
ier-0.024
 luck-0.020
ener-0.020
 wouldn-0.020
 coins-0.019
Token the
Token ID262
Feature activation+3.79e+00

Pos logit contributions
E+0.236
V+0.236
Finally+0.218
Pe+0.218
 While+0.217
Neg logit contributions
 away-0.037
 in-0.033
 for-0.030
 size-0.030
 more-0.029
Token warm
Token ID5814
Feature activation+2.46e+00

Pos logit contributions
E+0.197
V+0.196
 Many+0.176
 While+0.171
 Deal+0.170
Neg logit contributions
 away-0.072
 left-0.058
 for-0.056
 luck-0.055
 size-0.055
Token sun
Token ID4252
Feature activation+2.01e-01

Pos logit contributions
V+0.114
E+0.108
Look+0.100
+0.098
 Many+0.098
Neg logit contributions
 away-0.057
 size-0.057
ier-0.057
 enough-0.054
 for-0.052
Token.
Token ID13
Feature activation+6.53e+00

Pos logit contributions
+0.009
V+0.008
Look+0.008
 While+0.008
 The+0.007
Neg logit contributions
ier-0.005
 luck-0.005
 powers-0.005
 coins-0.005
ener-0.005
Token
Token ID198
Feature activation+9.54e+00

Pos logit contributions
Ta+0.249
E+0.247
Pe+0.232
Phill+0.223
O+0.219
Neg logit contributions
 for-0.211
,-0.211
.-0.209
 and-0.207
 but-0.181
Token
Token ID198
Feature activation+5.29e+00

Pos logit contributions
 Deal+0.427
Ta+0.411
Grab+0.411
Really+0.370
Ha+0.355
Neg logit contributions
.-0.338
 and-0.303
 in-0.291
,-0.283
 to-0.282
TokenOne
Token ID3198
Feature activation+3.78e+00

Pos logit contributions
 Deal+0.212
Ta+0.195
 Ow+0.181
 Here+0.181
E+0.180
Neg logit contributions
 would-0.156
 made-0.146
 for-0.144
 game-0.142
 and-0.140
Token day
Token ID1110
Feature activation+4.11e+00

Pos logit contributions
V+0.228
E+0.225
Look+0.210
 Deal+0.200
Wow+0.198
Neg logit contributions
 for-0.053
 in-0.042
 left-0.041
 away-0.039
 size-0.036
Token,
Token ID11
Feature activation+3.47e+00

Pos logit contributions
V+0.307
E+0.304
Look+0.283
Kids+0.279
Ta+0.275
Neg logit contributions
,-0.000
 for+0.000
 in+0.005
 as+0.010
 enough+0.011
Token Max
Token ID5436
Feature activation+1.60e+00

Pos logit contributions
V+0.125
E+0.121
Kids+0.105
Look+0.105
 Many+0.100
Neg logit contributions
 for-0.118
 as-0.117
 in-0.113
 away-0.110
 enough-0.109
Token went
Token ID1816
Feature activation+2.07e+00

Pos logit contributions
V+0.071
E+0.067
+0.064
Look+0.063
 While+0.061
Neg logit contributions
ier-0.040
 size-0.038
 enough-0.037
 away-0.036
 luck-0.035
Token inside
Token ID2641
Feature activation+2.09e+00

Pos logit contributions
V+0.118
E+0.113
+0.108
Look+0.107
 Many+0.106
Neg logit contributions
ier-0.026
 size-0.024
 away-0.023
 enough-0.023
 perfectly-0.021
Token the
Token ID262
Feature activation+2.96e+00

Pos logit contributions
V+0.108
E+0.104
+0.101
Look+0.098
 Many+0.097
Neg logit contributions
ier-0.037
 size-0.035
 away-0.033
 enough-0.033
 perfectly-0.031
Token house
Token ID2156
Feature activation-9.08e-01

Pos logit contributions
V+0.173
E+0.166
O+0.153
 Many+0.152
Wow+0.152
Neg logit contributions
 away-0.042
 size-0.038
 for-0.035
 enough-0.033
 airplane-0.032
Token.
Token ID13
Feature activation+7.35e+00

Pos logit contributions
ier+0.035
ience+0.034
 luck+0.034
ache+0.033
iness+0.033
Neg logit contributions
-0.036
 The-0.027
 While-0.026
 -0.025
 On-0.025
Token
Token ID198
Feature activation+9.54e+00

Pos logit contributions
Mic+0.346
E+0.318
Red+0.301
Mad+0.298
Twe+0.297
Neg logit contributions
 to-0.251
,-0.241
.-0.240
 and-0.228
 in-0.218
Token
Token ID198
Feature activation+6.06e+00

Pos logit contributions
 Deal+0.416
Really+0.406
omm+0.386
Grab+0.378
V+0.371
Neg logit contributions
.-0.334
 and-0.331
 in-0.313
 to-0.301
,-0.296
TokenIn
Token ID818
Feature activation+2.18e+00

Pos logit contributions
 Deal+0.180
omm+0.161
innie+0.157
ollo+0.154
Mic+0.153
Neg logit contributions
 made-0.243
 in-0.231
 like-0.225
 ever-0.214
 bit-0.212
Token the
Token ID262
Feature activation+2.66e+00

Pos logit contributions
V+0.101
E+0.097
+0.091
Look+0.090
 Many+0.089
Neg logit contributions
ier-0.051
 size-0.049
 away-0.049
 enough-0.048
 perfectly-0.045
Token house
Token ID2156
Feature activation+7.72e-01

Pos logit contributions
V+0.136
E+0.133
 Many+0.118
O+0.118
Look+0.118
Neg logit contributions
 away-0.053
 size-0.051
 enough-0.046
 for-0.046
 falling-0.045
Token,
Token ID11
Feature activation+2.20e+00

Pos logit contributions
+0.035
V+0.033
 Many+0.031
 While+0.031
Look+0.030
Neg logit contributions
ier-0.020
 wand-0.018
 airplane-0.018
ache-0.018
 powers-0.018
Token He
Token ID679
Feature activation+2.92e+00

Pos logit contributions
Kids+0.232
Ba+0.230
Gr+0.229
E+0.226
 Deal+0.218
Neg logit contributions
 and-0.171
,-0.170
.-0.168
 to-0.167
 as-0.154
Token was
Token ID373
Feature activation+2.01e+00

Pos logit contributions
V+0.149
E+0.144
 Many+0.137
Look+0.136
+0.134
Neg logit contributions
 size-0.049
 wouldn-0.048
 away-0.045
 kept-0.044
 perfectly-0.044
Token so
Token ID523
Feature activation+3.44e+00

Pos logit contributions
V+0.104
E+0.098
+0.094
Look+0.093
 Many+0.092
Neg logit contributions
ier-0.038
 size-0.035
 away-0.034
 enough-0.033
 wouldn-0.031
Token happy
Token ID3772
Feature activation+4.28e-01

Pos logit contributions
V+0.191
E+0.183
Kids+0.179
Look+0.177
Wow+0.170
Neg logit contributions
 for-0.047
 in-0.044
ier-0.043
 perfectly-0.043
 size-0.043
Token.
Token ID13
Feature activation+6.30e+00

Pos logit contributions
+0.020
 While+0.016
 Now+0.015
+0.015
 The+0.015
Neg logit contributions
ier-0.012
 wand-0.012
ener-0.011
iness-0.011
ist-0.011
Token
Token ID198
Feature activation+9.53e+00

Pos logit contributions
Ta+0.277
Ba+0.261
E+0.259
 Deal+0.256
Elsa+0.250
Neg logit contributions
.-0.190
,-0.190
 and-0.185
 to-0.174
 with-0.159
Token
Token ID198
Feature activation+4.65e+00

Pos logit contributions
 Deal+0.437
Ta+0.417
Grab+0.410
Really+0.402
Uh+0.384
Neg logit contributions
.-0.326
 and-0.306
 in-0.279
 to-0.276
,-0.275
TokenTim
Token ID14967
Feature activation+1.43e+00

Pos logit contributions
 Deal+0.175
omm+0.135
 Ow+0.133
Ta+0.131
Isa+0.125
Neg logit contributions
 in-0.163
 made-0.163
 and-0.157
!-0.153
 sound-0.153
Token took
Token ID1718
Feature activation+3.02e+00

Pos logit contributions
V+0.062
+0.058
E+0.056
 While+0.055
Look+0.054
Neg logit contributions
ier-0.042
 luck-0.037
 size-0.037
 perfectly-0.036
 enough-0.035
Token the
Token ID262
Feature activation+2.92e+00

Pos logit contributions
V+0.155
E+0.155
Look+0.145
Kids+0.143
Wow+0.140
Neg logit contributions
 away-0.058
 enough-0.055
 for-0.050
 coins-0.048
ier-0.048
Token tape
Token ID9154
Feature activation+1.46e-01

Pos logit contributions
E+0.151
V+0.149
Kids+0.138
Look+0.137
 Many+0.135
Neg logit contributions
 away-0.049
 size-0.048
 for-0.048
ier-0.048
 enough-0.045
Token so
Token ID523
Feature activation-4.67e-02

Pos logit contributions
ener+0.110
sters+0.103
iness+0.102
lers+0.100
ome+0.099
Neg logit contributions
-0.093
V-0.074
Mom-0.073
 -0.069
 The-0.068
Token loud
Token ID7812
Feature activation-3.84e+00

Pos logit contributions
ener+0.002
ier+0.002
 away+0.002
et+0.002
 size+0.002
Neg logit contributions
-0.002
E-0.002
V-0.002
 Many-0.001
L-0.001
Token with
Token ID351
Feature activation-2.78e+00

Pos logit contributions
 permission+0.148
 collection+0.147
 size+0.143
 crown+0.141
 footing+0.139
Neg logit contributions
-0.146
 Now-0.139
 It-0.123
V-0.112
 But-0.110
Token all
Token ID477
Feature activation-1.79e+00

Pos logit contributions
ier+0.131
castle+0.125
ener+0.123
ache+0.119
comb+0.119
Neg logit contributions
-0.083
E-0.072
V-0.069
 Many-0.067
 The-0.067
Token the
Token ID262
Feature activation-1.37e+00

Pos logit contributions
ache+0.084
ener+0.082
ier+0.082
comb+0.080
 wouldn+0.075
Neg logit contributions
-0.053
 The-0.044
 Now-0.043
 Everyone-0.041
 -0.040
Token furniture
Token ID13091
Feature activation-9.39e+00

Pos logit contributions
ier+0.057
castle+0.055
ache+0.053
 able+0.053
owing+0.053
Neg logit contributions
V-0.043
-0.043
E-0.041
 Many-0.037
L-0.036
Token talking
Token ID3375
Feature activation-5.78e+00

Pos logit contributions
urger+0.314
ener+0.308
 luck+0.304
ishing+0.300
rier+0.299
Neg logit contributions
-0.297
 It-0.230
 They-0.220
 She-0.218
 The-0.208
Token,
Token ID11
Feature activation-3.72e+00

Pos logit contributions
eking+0.200
eling+0.198
ener+0.193
cker+0.189
 size+0.188
Neg logit contributions
-0.226
 It-0.192
 Everyone-0.191
 They-0.178
 She-0.175
Token it
Token ID340
Feature activation-4.55e+00

Pos logit contributions
ier+0.162
ils+0.152
ener+0.146
lers+0.143
 problems+0.141
Neg logit contributions
-0.144
 It-0.130
 Now-0.120
 The-0.115
It-0.111
Token was
Token ID373
Feature activation-2.86e+00

Pos logit contributions
 permission+0.241
ier+0.227
 coins+0.225
rier+0.221
 luck+0.220
Neg logit contributions
-0.111
V-0.081
L-0.079
It-0.075
 It-0.070
Token hard
Token ID1327
Feature activation-4.65e+00

Pos logit contributions
ener+0.128
sters+0.125
ly+0.122
ome+0.120
iness+0.120
Neg logit contributions
-0.099
V-0.075
 The-0.074
 In-0.071
 When-0.070
Token could
Token ID714
Feature activation-9.93e-01

Pos logit contributions
 amount+0.204
 size+0.203
 recipe+0.203
rier+0.201
 haircut+0.199
Neg logit contributions
-0.067
 It-0.037
 -0.035
 They-0.035
 could-0.034
Token hide
Token ID7808
Feature activation-3.59e+00

Pos logit contributions
 dollars+0.039
ier+0.038
 compared+0.036
ening+0.036
ener+0.036
Neg logit contributions
-0.038
E-0.029
V-0.029
Mom-0.028
 Many-0.028
Token in
Token ID287
Feature activation-2.65e+00

Pos logit contributions
ache+0.129
 wedding+0.129
 ceremony+0.128
 mustache+0.128
 performance+0.122
Neg logit contributions
-0.114
E-0.103
L-0.092
Tim-0.085
 The-0.084
Token the
Token ID262
Feature activation-6.05e-01

Pos logit contributions
ache+0.150
ier+0.150
 wouldn+0.138
ly+0.135
ener+0.134
Neg logit contributions
-0.073
 The-0.045
E-0.043
 Sometimes-0.042
 -0.040
Token little
Token ID1310
Feature activation-2.37e+00

Pos logit contributions
ier+0.030
ist+0.028
iness+0.027
 wouldn+0.027
 able+0.026
Neg logit contributions
-0.017
Look-0.017
-0.015
Wow-0.014
E-0.014
Token cracks
Token ID23217
Feature activation-9.36e+00

Pos logit contributions
ier+0.111
 wouldn+0.109
ps+0.105
iness+0.101
eking+0.099
Neg logit contributions
-0.065
-0.054
Wow-0.052
 The-0.052
Look-0.048
Token and
Token ID290
Feature activation-4.05e+00

Pos logit contributions
 groom+0.381
ache+0.376
urger+0.372
ener+0.372
 appearance+0.372
Neg logit contributions
-0.250
 The-0.152
 They-0.145
The-0.126
 It-0.122
Token cre
Token ID1126
Feature activation+1.45e+00

Pos logit contributions
ier+0.181
iness+0.155
ler+0.142
eness+0.142
et+0.142
Neg logit contributions
-0.136
 -0.104
 The-0.102
What-0.102
Look-0.101
Tokenv
Token ID85
Feature activation-7.62e-01

Pos logit contributions
V+0.051
E+0.049
Look+0.046
+0.044
 Many+0.043
Neg logit contributions
 size-0.047
 wouldn-0.046
ier-0.046
 wand-0.046
 airplane-0.046
Tokenices
Token ID1063
Feature activation-8.41e+00

Pos logit contributions
 ring+0.031
 airplane+0.030
 ringing+0.030
 bothering+0.030
 compared+0.030
Neg logit contributions
-0.022
Jen-0.018
M-0.018
E-0.018
 Everyone-0.017
Token of
Token ID286
Feature activation-2.29e+00

Pos logit contributions
 crown+0.318
 forth+0.315
 veil+0.312
 pearl+0.307
 breaths+0.306
Neg logit contributions
-0.282
 They-0.188
 -0.176
 It-0.170
 The-0.169
Token for
Token ID329
Feature activation-2.08e+00

Pos logit contributions
ier+0.006
ener+0.006
 wouldn+0.005
ache+0.005
iness+0.005
Neg logit contributions
-0.005
 Now-0.005
V-0.004
Look-0.004
 The-0.004
Token getting
Token ID1972
Feature activation-3.15e+00

Pos logit contributions
ener+0.092
ist+0.092
iness+0.091
ier+0.091
 wouldn+0.088
Neg logit contributions
-0.072
 Now-0.054
E-0.054
Tom-0.053
Tim-0.053
Token the
Token ID262
Feature activation-4.75e+00

Pos logit contributions
comb+0.131
itive+0.130
ist+0.128
bee+0.128
iness+0.127
Neg logit contributions
-0.102
 late-0.083
-0.079
Mom-0.077
 The-0.073
Token wild
Token ID4295
Feature activation-5.61e+00

Pos logit contributions
ils+0.175
es+0.164
 imagined+0.159
ier+0.159
owed+0.158
Neg logit contributions
-0.140
Look-0.129
-0.118
E-0.114
Mom-0.110
Token new
Token ID649
Feature activation-5.57e+00

Pos logit contributions
cher+0.218
eking+0.217
ier+0.215
 able+0.210
ils+0.204
Neg logit contributions
-0.178
-0.150
 The-0.133
 Sometimes-0.124
 They-0.122
Token carpet
Token ID20710
Feature activation-9.33e+00

Pos logit contributions
ier+0.234
eking+0.222
ils+0.209
led+0.203
 able+0.196
Neg logit contributions
-0.178
-0.133
Look-0.129
 Whe-0.121
Tim-0.119
Token.
Token ID13
Feature activation+2.78e+00

Pos logit contributions
 appearance+0.368
ening+0.368
eking+0.364
 worries+0.364
eling+0.362
Neg logit contributions
-0.267
 She-0.156
 The-0.149
-0.139
 They-0.135
Token<|endoftext|>
Token ID50256
Feature activation+4.82e+00

Pos logit contributions
V+0.037
E+0.030
Look+0.022
+0.020
 Many+0.019
Neg logit contributions
ier-0.152
 size-0.151
 away-0.151
 enough-0.150
 wouldn-0.148
TokenOnce
Token ID7454
Feature activation+3.88e+00

Pos logit contributions
V+0.086
E+0.083
 Many+0.069
 While+0.061
Br+0.061
Neg logit contributions
 in-0.237
 on-0.235
 away-0.234
!-0.228
 for-0.226
Token there
Token ID612
Feature activation+2.15e+00

Pos logit contributions
V+0.185
E+0.174
Look+0.165
 Many+0.157
Wow+0.155
Neg logit contributions
 for-0.099
 away-0.091
ier-0.091
 enough-0.090
ies-0.088
Token was
Token ID373
Feature activation+3.52e+00

Pos logit contributions
V+0.099
E+0.094
+0.090
Look+0.088
 Many+0.086
Neg logit contributions
ier-0.051
 size-0.049
 away-0.048
 enough-0.047
 luck-0.045
Token one
Token ID530
Feature activation-4.57e+00

Pos logit contributions
ier+0.141
ache+0.128
ets+0.124
 wouldn+0.124
 collection+0.123
Neg logit contributions
-0.086
 The-0.068
E-0.060
 -0.060
 It-0.058
Token of
Token ID286
Feature activation-1.97e+00

Pos logit contributions
ils+0.198
bags+0.183
ier+0.176
ps+0.169
ts+0.169
Neg logit contributions
-0.150
Mom-0.107
<|endoftext|>-0.106
 Mom-0.099
M-0.096
Token the
Token ID262
Feature activation-5.74e+00

Pos logit contributions
ier+0.095
ache+0.091
 wouldn+0.090
 compared+0.087
ies+0.086
Neg logit contributions
-0.061
Look-0.047
V-0.045
 -0.045
E-0.044
Token p
Token ID279
Feature activation-1.11e+00

Pos logit contributions
ier+0.209
 able+0.207
 wouldn+0.194
ils+0.190
iness+0.188
Neg logit contributions
-0.193
Mom-0.176
Look-0.174
E-0.169
V-0.158
Tokenudd
Token ID4185
Feature activation-1.92e+00

Pos logit contributions
 happen+0.053
 monsters+0.050
 battle+0.050
 bothering+0.049
 size+0.049
Neg logit contributions
-0.022
E-0.020
 Later-0.020
Jen-0.020
V-0.019
Tokenles
Token ID829
Feature activation-9.31e+00

Pos logit contributions
 size+0.074
 prisoner+0.074
 lair+0.074
boys+0.073
 hunt+0.073
Neg logit contributions
 Now-0.047
E-0.047
-0.041
 Everyone-0.040
Kids-0.039
Token and
Token ID290
Feature activation-4.65e+00

Pos logit contributions
 luck+0.336
 property+0.335
 choices+0.332
 appearance+0.331
 amount+0.330
Neg logit contributions
-0.332
 -0.236
 The-0.207
 She-0.201
 It-0.190
Token watched
Token ID7342
Feature activation-2.73e+00

Pos logit contributions
ier+0.205
iness+0.187
 anymore+0.178
owing+0.174
eness+0.168
Neg logit contributions
-0.154
 -0.132
 The-0.131
Look-0.129
 Then-0.128
Token as
Token ID355
Feature activation-2.70e+00

Pos logit contributions
 property+0.113
 stolen+0.109
ache+0.109
 size+0.107
 stash+0.107
Neg logit contributions
-0.096
 The-0.083
 Everyone-0.076
 It-0.074
 When-0.066
Token the
Token ID262
Feature activation-4.25e+00

Pos logit contributions
ode+0.129
et+0.128
ets+0.127
ils+0.126
lers+0.125
Neg logit contributions
-0.079
 The-0.060
V-0.059
E-0.055
 Mom-0.053
Token rain
Token ID6290
Feature activation-4.60e+00

Pos logit contributions
 able+0.164
ier+0.158
 enough+0.149
 wouldn+0.148
 anywhere+0.146
Neg logit contributions
E-0.131
V-0.130
Mom-0.128
-0.125
Look-0.123
Token all
Token ID477
Feature activation-3.97e+00

Pos logit contributions
rying+0.203
owing+0.202
ier+0.200
ist+0.198
ener+0.197
Neg logit contributions
-0.109
Tom-0.087
 Tim-0.084
 It-0.082
 Tom-0.081
Token of
Token ID286
Feature activation-1.53e+00

Pos logit contributions
comb+0.198
 wand+0.183
ache+0.182
ppers+0.178
eking+0.177
Neg logit contributions
-0.096
 The-0.087
 Everyone-0.076
 It-0.072
 They-0.072
Token the
Token ID262
Feature activation-2.42e+00

Pos logit contributions
ener+0.072
ier+0.072
 wouldn+0.069
ache+0.067
 wand+0.065
Neg logit contributions
-0.044
E-0.039
Tim-0.034
 Many-0.034
Tom-0.031
Token family
Token ID1641
Feature activation-6.13e+00

Pos logit contributions
ier+0.106
owed+0.105
 able+0.104
ring+0.104
comb+0.101
Neg logit contributions
Look-0.062
E-0.061
-0.056
Mom-0.056
Wow-0.056
Token's
Token ID338
Feature activation-4.10e+00

Pos logit contributions
rier+0.270
eking+0.258
 potion+0.250
 breaths+0.246
rying+0.245
Neg logit contributions
-0.192
 The-0.120
 -0.118
 Now-0.112
 Everyone-0.105
Token possessions
Token ID23309
Feature activation-9.30e+00

Pos logit contributions
ier+0.172
ist+0.148
cher+0.145
pan+0.144
fish+0.143
Neg logit contributions
-0.132
 Many-0.096
<|endoftext|>-0.096
 Now-0.093
 From-0.092
Token had
Token ID550
Feature activation-5.37e+00

Pos logit contributions
ener+0.435
ishing+0.405
urger+0.403
rying+0.402
enery+0.395
Neg logit contributions
-0.292
 It-0.181
 -0.171
 The-0.161
 They-0.160
Token been
Token ID587
Feature activation-4.19e+00

Pos logit contributions
 forth+0.244
rier+0.231
ener+0.230
ish+0.216
ci+0.210
Neg logit contributions
-0.172
 In-0.108
In-0.106
Tom-0.104
A-0.102
Token taken
Token ID2077
Feature activation-3.82e+00

Pos logit contributions
comb+0.144
iness+0.139
als+0.138
 forth+0.137
 angel+0.135
Neg logit contributions
-0.148
-0.102
L-0.098
A-0.096
 -0.094
Token away
Token ID1497
Feature activation-6.33e+00

Pos logit contributions
castle+0.160
 veil+0.144
 airplane+0.140
 crown+0.135
fish+0.134
Neg logit contributions
-0.153
 In-0.113
 -0.112
 The-0.112
 It-0.107
Token,
Token ID11
Feature activation-4.75e+00

Pos logit contributions
 airplane+0.283
 veil+0.282
 comb+0.281
 crown+0.280
 cape+0.277
Neg logit contributions
-0.194
 They-0.128
 The-0.112
 He-0.096
 In-0.092
Token in
Token ID287
Feature activation-2.99e+00

Pos logit contributions
eking+0.251
rying+0.245
 size+0.243
 footing+0.241
 grip+0.238
Neg logit contributions
-0.220
 The-0.191
 Everyone-0.184
 From-0.169
 It-0.166
Token its
Token ID663
Feature activation-1.53e+00

Pos logit contributions
drops+0.129
ier+0.128
ache+0.126
ies+0.124
led+0.122
Neg logit contributions
-0.118
 The-0.088
 -0.083
 Now-0.081
 It-0.080
Token ears
Token ID11368
Feature activation-7.80e+00

Pos logit contributions
ier+0.075
ache+0.066
iness+0.061
fish+0.060
 kernels+0.060
Neg logit contributions
-0.048
 The-0.041
 -0.039
 From-0.038
 Everyone-0.036
Token,
Token ID11
Feature activation-3.69e+00

Pos logit contributions
ils+0.321
ishes+0.298
 thieves+0.292
 crown+0.290
 wounds+0.289
Neg logit contributions
-0.290
 -0.220
 The-0.210
 It-0.205
 She-0.203
Token the
Token ID262
Feature activation-9.82e-01

Pos logit contributions
ier+0.184
ils+0.175
ache+0.174
et+0.169
le+0.168
Neg logit contributions
-0.114
 The-0.088
 It-0.083
 -0.082
 Everyone-0.074
Token switch
Token ID5078
Feature activation-9.28e+00

Pos logit contributions
ier+0.039
 enough+0.038
i+0.037
 able+0.037
ils+0.036
Neg logit contributions
-0.034
V-0.033
Look-0.032
Wow-0.030
Mom-0.030
Token knows
Token ID4206
Feature activation-4.91e+00

Pos logit contributions
 thieves+0.333
rying+0.333
urger+0.330
 oats+0.324
 pounds+0.324
Neg logit contributions
-0.310
 It-0.235
 The-0.233
 -0.227
 She-0.214
Token that
Token ID326
Feature activation-4.06e+00

Pos logit contributions
ener+0.204
drops+0.198
ited+0.197
 breaths+0.188
 aroma+0.186
Neg logit contributions
-0.191
 It-0.125
 The-0.123
 -0.121
 From-0.120
Token it
Token ID340
Feature activation-3.76e+00

Pos logit contributions
ier+0.174
ener+0.174
ache+0.171
ly+0.166
drops+0.166
Neg logit contributions
-0.140
 -0.101
 It-0.097
 The-0.094
 Next-0.094
Token will
Token ID481
Feature activation-2.84e+00

Pos logit contributions
rier+0.172
 unatt+0.167
 grip+0.165
 injuries+0.164
 footing+0.162
Neg logit contributions
-0.114
-0.077
 Next-0.075
 It-0.074
 From-0.074
Token never
Token ID1239
Feature activation-3.05e+00

Pos logit contributions
lers+0.136
led+0.120
ishes+0.120
 dollars+0.119
ener+0.118
Neg logit contributions
-0.102
One-0.073
The-0.069
Even-0.068
It-0.068
Token they
Token ID484
Feature activation-2.81e+00

Pos logit contributions
ier+0.137
ishing+0.135
ache+0.133
eking+0.130
ener+0.128
Neg logit contributions
-0.069
 The-0.047
 -0.044
 It-0.043
 Everyone-0.042
Token could
Token ID714
Feature activation-2.06e+00

Pos logit contributions
 size+0.124
 decisions+0.119
 inches+0.119
 dollars+0.118
 coins+0.117
Neg logit contributions
-0.080
Wow-0.064
It-0.061
 -0.060
Look-0.058
Token now
Token ID783
Feature activation-3.65e+00

Pos logit contributions
ier+0.096
 dollars+0.096
ener+0.096
 compared+0.094
 pounds+0.094
Neg logit contributions
-0.073
Her-0.050
Mom-0.050
 The-0.049
 Her-0.048
Token use
Token ID779
Feature activation-4.83e+00

Pos logit contributions
 coins+0.157
ener+0.156
isher+0.153
ier+0.149
cards+0.148
Neg logit contributions
-0.130
 The-0.089
 It-0.087
 Everyone-0.084
 -0.083
Token the
Token ID262
Feature activation-4.92e+00

Pos logit contributions
ier+0.248
castle+0.235
ener+0.233
rated+0.224
rier+0.223
Neg logit contributions
-0.141
 The-0.101
 -0.101
 It-0.092
 Lily-0.074
Token glue
Token ID22749
Feature activation-9.25e+00

Pos logit contributions
ier+0.202
 able+0.186
ils+0.179
rying+0.171
led+0.170
Neg logit contributions
-0.145
Look-0.134
<-0.126
<|endoftext|>-0.125
Here-0.119
Token.
Token ID13
Feature activation+3.44e+00

Pos logit contributions
eling+0.344
eking+0.330
 centimeters+0.319
 inches+0.317
 aroma+0.312
Neg logit contributions
-0.282
 -0.185
 They-0.169
 The-0.152
 She-0.149
Token The
Token ID383
Feature activation-2.34e+00

Pos logit contributions
V+0.054
E+0.048
Look+0.034
O+0.033
Wow+0.028
Neg logit contributions
 enough-0.175
 for-0.174
 away-0.173
 wouldn-0.172
 size-0.172
Token man
Token ID582
Feature activation-4.66e+00

Pos logit contributions
ier+0.103
 able+0.097
rying+0.095
 enough+0.094
owing+0.093
Neg logit contributions
Mom-0.072
-0.069
V-0.066
Look-0.064
<|endoftext|>-0.060
Token was
Token ID373
Feature activation-1.54e+00

Pos logit contributions
 attention+0.234
rier+0.230
ier+0.225
 anywhere+0.220
 situations+0.218
Neg logit contributions
-0.143
 From-0.108
 The-0.098
 -0.087
 Now-0.077
Token happy
Token ID3772
Feature activation-2.02e+00

Pos logit contributions
ly+0.071
iness+0.070
ener+0.068
ies+0.068
lers+0.066
Neg logit contributions
-0.051
 -0.039
 The-0.037
V-0.035
Mom-0.034
Token given
Token ID1813
Feature activation-1.92e+00

Pos logit contributions
rier+0.176
 forth+0.174
ener+0.171
work+0.169
ishly+0.169
Neg logit contributions
-0.137
A-0.101
-0.090
 The-0.087
 It-0.083
Token you
Token ID345
Feature activation-5.19e+00

Pos logit contributions
castle+0.079
ier+0.078
pan+0.077
iness+0.076
 size+0.076
Neg logit contributions
-0.064
E-0.056
V-0.053
 The-0.048
O-0.048
Token.
Token ID13
Feature activation+3.19e+00

Pos logit contributions
ier+0.243
drops+0.230
 forth+0.225
castle+0.225
iness+0.221
Neg logit contributions
-0.158
 The-0.123
A-0.100
-0.100
 A-0.097
Token Feel
Token ID18571
Feature activation-3.12e+00

Pos logit contributions
V+0.072
E+0.064
Look+0.057
Wow+0.051
O+0.051
Neg logit contributions
 enough-0.143
ier-0.141
 away-0.139
 size-0.138
 for-0.137
Token the
Token ID262
Feature activation-3.25e+00

Pos logit contributions
castle+0.134
pit+0.116
work+0.114
ies+0.112
 creations+0.111
Neg logit contributions
-0.124
 -0.097
 The-0.090
 Now-0.084
 From-0.083
Token air
Token ID1633
Feature activation-9.23e+00

Pos logit contributions
led+0.127
ils+0.125
cases+0.121
rying+0.115
ier+0.111
Neg logit contributions
-0.092
Look-0.089
V-0.077
Mom-0.076
E-0.076
Token and
Token ID290
Feature activation-4.80e+00

Pos logit contributions
ils+0.265
 coins+0.257
 artifacts+0.249
 jewels+0.236
 treasures+0.236
Neg logit contributions
-0.329
 She-0.258
 The-0.254
 -0.234
 It-0.232
Token let
Token ID1309
Feature activation-5.03e-01

Pos logit contributions
ier+0.197
rier+0.172
ener+0.171
ist+0.169
owing+0.162
Neg logit contributions
-0.169
 The-0.138
 Now-0.138
 It-0.133
 From-0.127
Token it
Token ID340
Feature activation-4.53e+00

Pos logit contributions
 size+0.019
ier+0.018
work+0.018
 compared+0.018
 wand+0.017
Neg logit contributions
-0.019
V-0.018
E-0.017
O-0.016
 Many-0.015
Token take
Token ID1011
Feature activation-1.88e+00

Pos logit contributions
 compared+0.188
pled+0.182
 thieves+0.181
 dollars+0.178
ening+0.177
Neg logit contributions
-0.158
 It-0.109
 The-0.107
 Everyone-0.104
 -0.102
Token you
Token ID345
Feature activation-5.04e+00

Pos logit contributions
castle+0.080
ier+0.074
 size+0.071
rying+0.070
flies+0.070
Neg logit contributions
-0.071
E-0.064
V-0.058
 Now-0.058
 The-0.056
Token something
Token ID1223
Feature activation-5.08e+00

Pos logit contributions
ier+0.161
iness+0.131
ious+0.128
 compared+0.126
ener+0.126
Neg logit contributions
-0.097
 Many-0.078
 The-0.076
 -0.072
 Everyone-0.069
Token else
Token ID2073
Feature activation-6.42e+00

Pos logit contributions
ener+0.235
stick+0.222
ps+0.221
ring+0.218
ked+0.216
Neg logit contributions
-0.173
<|endoftext|>-0.106
-0.102
 -0.097
It-0.091
Token -
Token ID532
Feature activation-4.35e+00

Pos logit contributions
ier+0.251
ps+0.243
eness+0.241
ist+0.239
 breaths+0.237
Neg logit contributions
-0.248
 -0.167
-0.161
 The-0.144
 They-0.143
Token a
Token ID257
Feature activation-4.04e+00

Pos logit contributions
ier+0.239
ishing+0.221
et+0.214
iness+0.213
ist+0.213
Neg logit contributions
-0.138
 It-0.111
 The-0.102
 -0.094
 What-0.087
Token beautiful
Token ID4950
Feature activation-4.69e+00

Pos logit contributions
ier+0.183
led+0.169
ils+0.169
ps+0.166
iness+0.161
Neg logit contributions
-0.127
Look-0.122
E-0.117
V-0.116
O-0.110
Token wand
Token ID11569
Feature activation-9.21e+00

Pos logit contributions
ier+0.216
ils+0.200
ps+0.189
ried+0.186
cases+0.184
Neg logit contributions
-0.149
Look-0.132
 -0.116
Wow-0.114
 It-0.110
Token with
Token ID351
Feature activation-4.42e+00

Pos logit contributions
 choices+0.340
 forth+0.328
ishes+0.327
 decisions+0.322
 luck+0.318
Neg logit contributions
-0.279
 It-0.195
 She-0.179
-0.167
 -0.151
Token a
Token ID257
Feature activation-4.53e+00

Pos logit contributions
ier+0.203
rying+0.200
castle+0.196
icking+0.193
 forth+0.187
Neg logit contributions
-0.129
 Many-0.113
 It-0.113
Look-0.108
 The-0.105
Token star
Token ID3491
Feature activation-6.29e+00

Pos logit contributions
ils+0.185
ier+0.181
rying+0.180
iness+0.174
icking+0.172
Neg logit contributions
-0.158
Look-0.139
 -0.133
It-0.128
V-0.122
Token on
Token ID319
Feature activation-1.91e+00

Pos logit contributions
 punished+0.223
 permission+0.221
 jobs+0.209
 bills+0.209
 argued+0.207
Neg logit contributions
-0.185
 It-0.138
-0.123
It-0.119
"-0.108
Token the
Token ID262
Feature activation-3.27e+00

Pos logit contributions
ier+0.100
rying+0.097
icking+0.096
owing+0.096
ener+0.092
Neg logit contributions
-0.059
 It-0.047
 The-0.046
E-0.043
 One-0.042
Token Spr
Token ID5522
Feature activation+4.44e-01

Pos logit contributions
V+0.069
E+0.058
Look+0.053
Wow+0.046
O+0.046
Neg logit contributions
 enough-0.153
ier-0.152
 size-0.149
 perfectly-0.148
 away-0.148
Tokeninkle
Token ID19894
Feature activation-3.23e+00

Pos logit contributions
V+0.014
+0.014
E+0.013
+0.013
 Now+0.012
Neg logit contributions
 size-0.016
 away-0.016
 enough-0.015
 airplane-0.015
 compared-0.014
Token it
Token ID340
Feature activation-5.35e+00

Pos logit contributions
 lair+0.127
epad+0.122
 destination+0.119
castle+0.117
rying+0.115
Neg logit contributions
-0.119
E-0.113
O-0.109
 The-0.108
M-0.106
Token on
Token ID319
Feature activation-1.35e+00

Pos logit contributions
ault+0.237
 permission+0.230
rier+0.227
 dollars+0.226
reating+0.225
Neg logit contributions
-0.168
-0.117
 It-0.112
 -0.110
 Next-0.107
Token any
Token ID597
Feature activation-3.26e+00

Pos logit contributions
ier+0.066
rying+0.062
 able+0.059
owing+0.059
eking+0.058
Neg logit contributions
-0.043
E-0.038
 The-0.034
O-0.034
 It-0.033
Token area
Token ID1989
Feature activation-9.20e+00

Pos logit contributions
ier+0.121
 wouldn+0.120
bags+0.118
bows+0.114
castle+0.112
Neg logit contributions
-0.094
Mom-0.078
 Many-0.076
Ah-0.073
E-0.072
Token and
Token ID290
Feature activation-3.89e+00

Pos logit contributions
ache+0.327
ault+0.314
 prisoner+0.304
urger+0.303
 tast+0.300
Neg logit contributions
-0.330
 It-0.259
 The-0.248
 She-0.243
 -0.228
Token you
Token ID345
Feature activation-2.91e+00

Pos logit contributions
ier+0.172
owing+0.164
iness+0.159
 affect+0.155
ening+0.154
Neg logit contributions
-0.119
Wow-0.094
 Now-0.093
L-0.093
V-0.093
Token will
Token ID481
Feature activation-2.90e+00

Pos logit contributions
rier+0.108
ier+0.108
ering+0.104
 falling+0.102
 attention+0.100
Neg logit contributions
-0.107
V-0.088
E-0.088
 On-0.085
 In-0.083
Token be
Token ID307
Feature activation-2.69e+00

Pos logit contributions
led+0.135
ier+0.134
ishing+0.130
lers+0.128
 dollars+0.127
Neg logit contributions
-0.101
 One-0.074
 -0.068
 The-0.068
 On-0.066
Token granted
Token ID7520
Feature activation-2.90e+00

Pos logit contributions
iness+0.122
ly+0.113
bars+0.109
cases+0.107
ware+0.105
Neg logit contributions
-0.087
 -0.065
 The-0.062
M-0.057
The-0.053
Token the
Token ID262
Feature activation-2.66e+00

Pos logit contributions
iness+0.056
ly+0.049
ies+0.048
eness+0.048
led+0.048
Neg logit contributions
-0.049
 The-0.039
-0.036
V-0.036
 -0.036
Token fairy
Token ID25607
Feature activation-5.18e+00

Pos logit contributions
led+0.114
ened+0.110
ering+0.110
ening+0.108
ils+0.104
Neg logit contributions
Look-0.069
E-0.067
-0.067
V-0.066
Mom-0.066
Token of
Token ID286
Feature activation-2.24e+00

Pos logit contributions
 levers+0.186
 castles+0.186
 heels+0.185
eking+0.185
 size+0.184
Neg logit contributions
-0.185
 The-0.135
 And-0.125
"-0.125
-0.118
Token the
Token ID262
Feature activation-3.54e+00

Pos logit contributions
ier+0.119
 wouldn+0.118
cases+0.113
ist+0.109
ache+0.107
Neg logit contributions
-0.051
Look-0.041
E-0.040
 Many-0.039
V-0.038
Token magic
Token ID5536
Feature activation-5.24e+00

Pos logit contributions
ier+0.153
 able+0.139
 wouldn+0.138
iness+0.135
ering+0.134
Neg logit contributions
-0.096
Look-0.094
-0.089
Mom-0.085
L-0.082
Token wand
Token ID11569
Feature activation-9.19e+00

Pos logit contributions
 argued+0.238
eking+0.224
ering+0.220
cking+0.218
lling+0.215
Neg logit contributions
-0.140
-0.109
 Now-0.099
L-0.096
 She-0.091
Token.
Token ID13
Feature activation+2.59e+00

Pos logit contributions
 injuries+0.349
ishing+0.344
ened+0.344
 coins+0.344
 possessions+0.343
Neg logit contributions
-0.282
 She-0.186
-0.171
 It-0.165
"-0.165
Token You
Token ID921
Feature activation-1.40e+00

Pos logit contributions
V+0.065
E+0.059
Look+0.052
+0.051
 Many+0.049
Neg logit contributions
ier-0.112
 size-0.110
 away-0.110
 enough-0.110
 perfectly-0.107
Token can
Token ID460
Feature activation-6.19e-01

Pos logit contributions
ier+0.057
 size+0.057
rier+0.057
ering+0.056
ening+0.056
Neg logit contributions
-0.043
V-0.036
 Many-0.036
 It-0.034
 In-0.034
Token make
Token ID787
Feature activation-1.26e+00

Pos logit contributions
ier+0.024
iness+0.023
 compared+0.022
ening+0.022
 enough+0.022
Neg logit contributions
-0.022
V-0.021
E-0.020
Mom-0.018
 Many-0.018
Token anything
Token ID1997
Feature activation-4.70e+00

Pos logit contributions
ier+0.056
iness+0.056
 wouldn+0.053
ener+0.050
castle+0.049
Neg logit contributions
-0.039
E-0.037
V-0.034
 Many-0.031
 Everyone-0.028
Token his
Token ID465
Feature activation-4.67e+00

Pos logit contributions
ier+0.024
ist+0.021
work+0.020
pan+0.019
cher+0.019
Neg logit contributions
-0.021
 Next-0.019
E-0.018
 Many-0.018
Tim-0.017
Token dinner
Token ID8073
Feature activation-6.35e+00

Pos logit contributions
ier+0.239
lling+0.193
ist+0.182
iness+0.181
ils+0.179
Neg logit contributions
-0.157
<|endoftext|>-0.117
 His-0.114
 Mom-0.114
 -0.113
Token with
Token ID351
Feature activation-2.24e+00

Pos logit contributions
ener+0.246
ening+0.243
 breaths+0.241
ked+0.237
ened+0.235
Neg logit contributions
-0.251
 -0.157
 Now-0.156
-0.154
M-0.153
Token his
Token ID465
Feature activation-3.07e+00

Pos logit contributions
ier+0.125
ist+0.122
rying+0.114
ener+0.113
drops+0.112
Neg logit contributions
-0.059
 Mom-0.045
Mom-0.042
 -0.041
Look-0.040
Token cob
Token ID22843
Feature activation-2.49e+00

Pos logit contributions
ier+0.179
ist+0.151
lling+0.149
rying+0.148
iness+0.141
Neg logit contributions
-0.085
 Mom-0.064
 Next-0.056
 The-0.055
<|endoftext|>-0.054
Tokenweb
Token ID12384
Feature activation-9.14e+00

Pos logit contributions
 destination+0.093
 impossible+0.089
 matter+0.088
 leave+0.088
 apart+0.088
Neg logit contributions
-0.064
M-0.058
Jen-0.058
 Now-0.056
itt-0.053
Token.
Token ID13
Feature activation+3.31e+00

Pos logit contributions
 expert+0.345
lling+0.344
ier+0.336
ault+0.326
rier+0.318
Neg logit contributions
-0.266
 The-0.169
 And-0.168
 -0.167
 She-0.161
Token<|endoftext|>
Token ID50256
Feature activation+5.13e+00

Pos logit contributions
V+0.058
E+0.049
Look+0.040
O+0.036
 Many+0.035
Neg logit contributions
 enough-0.162
 size-0.160
 away-0.159
 for-0.158
ier-0.158
TokenOnce
Token ID7454
Feature activation+3.91e+00

Pos logit contributions
V+0.103
E+0.100
 Many+0.084
 While+0.081
Br+0.081
Neg logit contributions
 in-0.241
 on-0.239
.-0.232
 away-0.232
!-0.230
Token upon
Token ID2402
Feature activation+3.44e+00

Pos logit contributions
V+0.188
E+0.176
Look+0.167
 Many+0.159
Wow+0.159
Neg logit contributions
 for-0.097
 away-0.090
 enough-0.090
ier-0.089
ies-0.087
Token a
Token ID257
Feature activation+4.40e+00

Pos logit contributions
V+0.176
E+0.166
Look+0.157
 While+0.157
Kids+0.157
Neg logit contributions
 enough-0.056
 size-0.052
 away-0.052
ier-0.052
 for-0.050
Token kind
Token ID1611
Feature activation-3.86e+00

Pos logit contributions
et+0.095
 powers+0.089
ode+0.089
stone+0.087
eness+0.087
Neg logit contributions
-0.056
Mom-0.055
E-0.052
 Their-0.050
V-0.048
Token and
Token ID290
Feature activation-3.34e+00

Pos logit contributions
eking+0.204
aches+0.189
eling+0.181
cking+0.180
 footing+0.175
Neg logit contributions
-0.115
 They-0.104
 She-0.094
 Now-0.083
 And-0.079
Token looked
Token ID3114
Feature activation-3.81e+00

Pos logit contributions
et+0.145
castle+0.139
ier+0.136
ener+0.135
le+0.135
Neg logit contributions
-0.105
 The-0.090
 From-0.086
 They-0.085
 Their-0.084
Token after
Token ID706
Feature activation-3.25e+00

Pos logit contributions
comb+0.195
castle+0.186
ode+0.183
eman+0.177
fer+0.175
Neg logit contributions
-0.106
 The-0.095
 When-0.088
 From-0.079
 They-0.078
Token her
Token ID607
Feature activation-3.85e+00

Pos logit contributions
 sword+0.157
castle+0.156
 ship+0.155
ode+0.155
ener+0.151
Neg logit contributions
-0.083
E-0.076
 The-0.064
 Everyone-0.059
 They-0.056
Token body
Token ID1767
Feature activation-9.09e+00

Pos logit contributions
castle+0.156
ier+0.152
boats+0.143
igator+0.138
boat+0.137
Neg logit contributions
-0.134
 And-0.102
 The-0.102
 In-0.092
 When-0.089
Token.
Token ID13
Feature activation+2.64e+00

Pos logit contributions
 explorer+0.380
 pearl+0.379
 adventurer+0.367
 thieves+0.354
 expert+0.350
Neg logit contributions
-0.280
 They-0.196
 She-0.179
 -0.178
 The-0.176
Token They
Token ID1119
Feature activation-3.26e+00

Pos logit contributions
V+0.051
E+0.043
Look+0.037
+0.034
 Many+0.033
Neg logit contributions
ier-0.129
 away-0.129
 size-0.128
 enough-0.128
 perfectly-0.126
Token gave
Token ID2921
Feature activation-7.58e-01

Pos logit contributions
rier+0.169
ier+0.154
 decisions+0.147
 pounds+0.146
ishing+0.146
Neg logit contributions
-0.091
 -0.075
 From-0.063
 The-0.062
Mom-0.058
Token her
Token ID607
Feature activation-2.00e+00

Pos logit contributions
castle+0.039
ier+0.038
 ship+0.038
 size+0.036
fish+0.036
Neg logit contributions
E-0.019
-0.018
Look-0.015
V-0.014
 The-0.014
Token some
Token ID617
Feature activation-4.60e+00

Pos logit contributions
ier+0.113
eking+0.101
castle+0.100
ist+0.097
ering+0.097
Neg logit contributions
-0.051
 The-0.045
E-0.033
<|endoftext|>-0.033
 Some-0.033
Token had
Token ID550
Feature activation-3.87e+00

Pos logit contributions
ier+0.284
rier+0.270
stones+0.269
 pounds+0.257
vell+0.256
Neg logit contributions
-0.236
 -0.177
 They-0.175
 The-0.157
 Their-0.148
Token their
Token ID511
Feature activation-4.63e+00

Pos logit contributions
ener+0.193
comb+0.172
iness+0.171
ist+0.170
rier+0.167
Neg logit contributions
-0.125
 -0.088
 The-0.085
 Their-0.081
Tom-0.077
Token special
Token ID2041
Feature activation-6.18e+00

Pos logit contributions
ier+0.238
ist+0.187
 able+0.182
rying+0.178
lling+0.175
Neg logit contributions
-0.133
 Mom-0.112
 Their-0.104
Mom-0.101
ossible-0.101
Token pe
Token ID613
Feature activation-2.87e+00

Pos logit contributions
ier+0.286
lling+0.253
eking+0.243
 able+0.240
pping+0.228
Neg logit contributions
-0.192
<|endoftext|>-0.138
ossible-0.135
 Today-0.133
 -0.130
Tokenb
Token ID65
Feature activation-3.00e+00

Pos logit contributions
 compared+0.110
 party+0.109
storm+0.108
 hurting+0.108
 bothering+0.108
Neg logit contributions
-0.058
E-0.056
Finally-0.053
Jen-0.050
Near-0.050
Tokenble
Token ID903
Feature activation-9.07e+00

Pos logit contributions
 bit+0.139
 compared+0.135
 bothering+0.131
 hurting+0.128
 argument+0.127
Neg logit contributions
-0.072
Jen-0.050
M-0.047
E-0.046
-0.045
Token which
Token ID543
Feature activation-5.20e+00

Pos logit contributions
 aroma+0.351
lling+0.339
 breaths+0.338
ening+0.335
 matter+0.328
Neg logit contributions
-0.279
 The-0.164
 And-0.156
-0.151
 They-0.150
Token united
Token ID16503
Feature activation-5.35e+00

Pos logit contributions
eking+0.240
rying+0.239
ishing+0.237
iously+0.236
king+0.233
Neg logit contributions
-0.163
 -0.121
-0.103
Look-0.102
 The-0.098
Token them
Token ID606
Feature activation-6.70e+00

Pos logit contributions
eking+0.229
ache+0.224
ider+0.214
 breaths+0.214
itive+0.209
Neg logit contributions
-0.195
 They-0.138
 -0.118
 It-0.113
Look-0.111
Token in
Token ID287
Feature activation-2.08e+00

Pos logit contributions
ener+0.258
 thieves+0.252
 aroma+0.247
 breaths+0.243
isher+0.238
Neg logit contributions
-0.233
 -0.142
 They-0.123
 The-0.121
<|endoftext|>-0.113
Token friendship
Token ID14738
Feature activation-5.88e+00

Pos logit contributions
ier+0.106
ache+0.103
ly+0.101
drops+0.099
 wouldn+0.099
Neg logit contributions
-0.071
Look-0.050
 -0.048
 The-0.046
E-0.045
Token day
Token ID1110
Feature activation-5.62e+00

Pos logit contributions
ier+0.198
ppers+0.178
ener+0.174
asy+0.170
ache+0.170
Neg logit contributions
-0.151
 -0.120
 The-0.107
 She-0.103
 It-0.102
Token and
Token ID290
Feature activation-4.43e+00

Pos logit contributions
isher+0.281
ener+0.277
ode+0.259
ishes+0.254
ish+0.251
Neg logit contributions
-0.185
 The-0.128
 She-0.117
 -0.109
 It-0.106
Token the
Token ID262
Feature activation-6.38e-01

Pos logit contributions
ier+0.199
ler+0.180
ache+0.168
 dollars+0.167
ist+0.166
Neg logit contributions
-0.147
 The-0.117
 -0.116
 She-0.113
 From-0.112
Token way
Token ID835
Feature activation-3.56e+00

Pos logit contributions
ier+0.028
work+0.026
ist+0.026
fish+0.024
iness+0.024
Neg logit contributions
-0.020
Wow-0.018
V-0.018
Me-0.017
 Many-0.017
Token her
Token ID607
Feature activation-2.61e+00

Pos logit contributions
ache+0.168
urger+0.168
 scales+0.168
ils+0.167
 topp+0.165
Neg logit contributions
-0.106
 She-0.088
 The-0.078
 -0.075
 It-0.069
Token body
Token ID1767
Feature activation-9.07e+00

Pos logit contributions
ier+0.130
cher+0.108
ist+0.108
ler+0.105
pan+0.102
Neg logit contributions
-0.088
 The-0.064
 Next-0.062
-0.062
 In-0.061
Token felt
Token ID2936
Feature activation-3.48e+00

Pos logit contributions
 expert+0.322
ener+0.319
 thieves+0.319
urger+0.315
 pearl+0.306
Neg logit contributions
-0.326
 -0.227
 It-0.207
 She-0.206
-0.205
Token with
Token ID351
Feature activation-2.78e+00

Pos logit contributions
ener+0.152
sters+0.152
cles+0.151
castle+0.150
eman+0.150
Neg logit contributions
-0.135
 The-0.099
 -0.093
The-0.086
 When-0.081
Token each
Token ID1123
Feature activation-6.64e-01

Pos logit contributions
castle+0.148
ier+0.143
ache+0.141
cher+0.137
ener+0.136
Neg logit contributions
-0.071
 -0.055
 The-0.050
 It-0.048
E-0.046
Token step
Token ID2239
Feature activation-6.48e+00

Pos logit contributions
castle+0.033
work+0.032
flies+0.031
ier+0.030
nuts+0.029
Neg logit contributions
-0.017
 -0.013
 Many-0.012
M-0.012
 One-0.011
Token.
Token ID13
Feature activation+4.20e+00

Pos logit contributions
 island+0.253
 size+0.250
castle+0.247
 lair+0.240
 enemy+0.236
Neg logit contributions
-0.208
 She-0.176
 The-0.155
 -0.150
 Her-0.147
Token the
Token ID262
Feature activation-4.43e-01

Pos logit contributions
rying+0.213
eking+0.202
ener+0.196
ishing+0.192
ies+0.191
Neg logit contributions
-0.143
 -0.110
 Everyone-0.100
 The-0.097
 It-0.093
Token man
Token ID582
Feature activation-6.69e+00

Pos logit contributions
ier+0.018
flies+0.017
 able+0.017
ist+0.017
 forever+0.016
Neg logit contributions
V-0.016
-0.014
Look-0.013
Mom-0.013
Tom-0.013
Token was
Token ID373
Feature activation-3.93e+00

Pos logit contributions
rier+0.310
 luck+0.297
ier+0.291
 choices+0.288
 attention+0.287
Neg logit contributions
-0.201
 -0.136
 The-0.130
 It-0.120
It-0.107
Token wearing
Token ID5762
Feature activation-4.49e+00

Pos logit contributions
als+0.156
ener+0.155
lers+0.154
ishes+0.154
ly+0.154
Neg logit contributions
-0.143
 -0.108
 The-0.096
It-0.092
Once-0.087
Token a
Token ID257
Feature activation-1.97e+00

Pos logit contributions
rying+0.253
ies+0.232
flies+0.231
ier+0.229
comb+0.225
Neg logit contributions
-0.116
 It-0.097
 The-0.094
 -0.084
 His-0.074
Token sock
Token ID32263
Feature activation-9.00e+00

Pos logit contributions
ier+0.083
ils+0.078
flies+0.076
 happening+0.074
rying+0.073
Neg logit contributions
-0.068
V-0.063
 -0.061
Look-0.060
E-0.055
Token on
Token ID319
Feature activation-2.28e+00

Pos logit contributions
eling+0.368
ies+0.349
lled+0.347
rated+0.344
ishes+0.340
Neg logit contributions
-0.268
 -0.173
 She-0.166
 He-0.165
Once-0.164
Token his
Token ID465
Feature activation-2.54e+00

Pos logit contributions
rying+0.131
ier+0.131
ies+0.126
owing+0.123
ener+0.121
Neg logit contributions
-0.054
 The-0.042
 It-0.040
 His-0.038
 Everyone-0.035
Token head
Token ID1182
Feature activation-6.91e+00

Pos logit contributions
ier+0.125
pan+0.094
rying+0.094
 enough+0.093
flies+0.093
Neg logit contributions
-0.081
 His-0.072
 Whe-0.066
 -0.066
 The-0.064
Token,
Token ID11
Feature activation-3.97e+00

Pos logit contributions
ils+0.288
 breaths+0.278
ts+0.276
 thieves+0.269
 obstacle+0.268
Neg logit contributions
-0.251
 -0.186
 The-0.166
 It-0.166
 His-0.158
Token but
Token ID475
Feature activation-3.02e+00

Pos logit contributions
ier+0.200
ils+0.186
iness+0.170
iest+0.169
iful+0.157
Neg logit contributions
-0.156
 -0.134
 The-0.124
 It-0.118
 But-0.112
Token tried
Token ID3088
Feature activation-3.38e+00

Pos logit contributions
rier+0.188
pan+0.162
ier+0.161
iness+0.150
 diamonds+0.150
Neg logit contributions
-0.108
-0.086
I-0.075
 Now-0.074
 From-0.073
Token one
Token ID530
Feature activation-2.59e+00

Pos logit contributions
ener+0.157
mer+0.151
cher+0.150
isher+0.149
led+0.144
Neg logit contributions
-0.128
 -0.089
 It-0.076
 She-0.075
 The-0.072
Token of
Token ID286
Feature activation-1.79e+00

Pos logit contributions
ps+0.102
eking+0.096
ier+0.093
bags+0.091
ping+0.091
Neg logit contributions
-0.105
 But-0.072
Sure-0.071
 On-0.068
 -0.067
Token the
Token ID262
Feature activation-8.19e-01

Pos logit contributions
ier+0.089
castle+0.087
ener+0.085
ache+0.083
ly+0.083
Neg logit contributions
-0.050
V-0.037
E-0.035
 -0.034
 The-0.034
Token unknown
Token ID6439
Feature activation-3.22e+00

Pos logit contributions
flies+0.032
castle+0.032
fly+0.031
ier+0.030
 able+0.030
Neg logit contributions
V-0.028
-0.027
E-0.023
 Whe-0.023
Look-0.022
Token machines
Token ID8217
Feature activation-8.99e+00

Pos logit contributions
eking+0.132
castle+0.120
nuts+0.119
 hatched+0.119
 wil+0.118
Neg logit contributions
-0.111
 But-0.081
-0.078
 Everyone-0.075
 It-0.075
Token,
Token ID11
Feature activation-4.29e+00

Pos logit contributions
urger+0.360
ils+0.343
ies+0.335
ishes+0.335
eking+0.331
Neg logit contributions
-0.299
 She-0.212
 -0.212
 It-0.185
 The-0.184
Token but
Token ID475
Feature activation-2.91e+00

Pos logit contributions
ier+0.194
ils+0.185
fish+0.174
led+0.169
le+0.168
Neg logit contributions
-0.138
 It-0.115
 The-0.113
 But-0.105
 -0.104
Token she
Token ID673
Feature activation-3.41e+00

Pos logit contributions
ier+0.139
urger+0.138
immers+0.133
et+0.132
ets+0.131
Neg logit contributions
-0.080
 -0.063
It-0.054
 She-0.053
 The-0.053
Token knew
Token ID2993
Feature activation-2.51e+00

Pos logit contributions
 attention+0.170
rier+0.164
 diamonds+0.155
ier+0.153
 airplane+0.152
Neg logit contributions
-0.092
M-0.070
 From-0.066
Even-0.059
 -0.057
Token it
Token ID340
Feature activation-3.60e+00

Pos logit contributions
ener+0.110
 ship+0.110
mer+0.110
castle+0.109
ler+0.106
Neg logit contributions
-0.082
It-0.053
 It-0.052
 -0.051
 She-0.051
Token were
Token ID547
Feature activation-4.78e+00

Pos logit contributions
urger+0.270
 luck+0.267
 permission+0.264
 Majesty+0.261
rier+0.260
Neg logit contributions
-0.141
It-0.096
 It-0.095
 The-0.092
 She-0.090
Token taking
Token ID2263
Feature activation-3.50e+00

Pos logit contributions
isher+0.203
ener+0.190
 elbow+0.187
ci+0.177
 medal+0.171
Neg logit contributions
-0.169
 -0.107
 The-0.100
 One-0.095
 In-0.094
Token care
Token ID1337
Feature activation-4.02e+00

Pos logit contributions
castle+0.181
ier+0.154
pit+0.152
 salon+0.151
fish+0.144
Neg logit contributions
-0.119
 The-0.098
 It-0.087
E-0.087
 -0.086
Token of
Token ID286
Feature activation-3.18e+00

Pos logit contributions
 forth+0.238
ener+0.228
 thieves+0.227
castle+0.225
 argued+0.223
Neg logit contributions
-0.091
 The-0.050
 They-0.039
 It-0.038
 Everyone-0.032
Token the
Token ID262
Feature activation-1.69e+00

Pos logit contributions
ener+0.166
ier+0.162
drops+0.158
castle+0.156
aser+0.153
Neg logit contributions
-0.097
 The-0.062
 -0.058
 They-0.050
 It-0.050
Token air
Token ID1633
Feature activation-8.99e+00

Pos logit contributions
ier+0.069
castle+0.069
et+0.067
 able+0.065
 perfectly+0.064
Neg logit contributions
-0.057
 Next-0.049
-0.047
Look-0.045
Wow-0.042
Token.
Token ID13
Feature activation+3.39e+00

Pos logit contributions
isher+0.366
 prisoner+0.365
eking+0.354
 prisoners+0.352
ishing+0.350
Neg logit contributions
-0.269
 They-0.200
 She-0.184
 The-0.178
 It-0.176
Token From
Token ID3574
Feature activation-9.41e-01

Pos logit contributions
V+0.077
E+0.068
O+0.056
Look+0.055
Wow+0.051
Neg logit contributions
 enough-0.152
 for-0.149
 away-0.146
 wouldn-0.146
 size-0.145
Token that
Token ID326
Feature activation-7.30e-05

Pos logit contributions
ier+0.041
iness+0.040
epad+0.039
 affect+0.038
 size+0.038
Neg logit contributions
-0.030
Mom-0.023
 Now-0.023
 The-0.023
A-0.023
Token moment
Token ID2589
Feature activation-2.05e-01

Pos logit contributions
ier+0.000
ist+0.000
ps+0.000
drops+0.000
 coins+0.000
Neg logit contributions
-0.000
 -0.000
V-0.000
 Now-0.000
Look-0.000
Token,
Token ID11
Feature activation-1.09e+00

Pos logit contributions
 size+0.006
castle+0.006
 island+0.006
 crown+0.006
 mess+0.006
Neg logit contributions
-0.009
V-0.008
 Now-0.008
 From-0.008
 The-0.008
Token their
Token ID511
Feature activation-4.74e+00

Pos logit contributions
ener+0.355
ishing+0.338
isher+0.336
ish+0.333
 promise+0.330
Neg logit contributions
-0.176
 He-0.094
 It-0.090
 The-0.089
 Tim-0.086
Token feet
Token ID3625
Feature activation-8.13e+00

Pos logit contributions
ier+0.238
 able+0.194
pan+0.193
pping+0.182
ist+0.181
Neg logit contributions
-0.143
 Mom-0.122
Me-0.110
Look-0.108
 The-0.104
Token were
Token ID547
Feature activation-5.80e+00

Pos logit contributions
urger+0.381
 Majesty+0.381
 thieves+0.379
ener+0.377
 prisoner+0.372
Neg logit contributions
-0.205
 It-0.132
 -0.131
It-0.129
 She-0.128
Token the
Token ID262
Feature activation-5.91e+00

Pos logit contributions
ener+0.234
ience+0.226
isher+0.223
 dollars+0.213
ils+0.211
Neg logit contributions
-0.182
Once-0.123
 He-0.120
Sam-0.118
 -0.117
Token heavy
Token ID4334
Feature activation-5.78e+00

Pos logit contributions
owed+0.219
ils+0.219
ier+0.210
 able+0.203
es+0.203
Neg logit contributions
-0.149
Mom-0.143
Look-0.143
Wow-0.129
Oh-0.127
Token space
Token ID2272
Feature activation-8.98e+00

Pos logit contributions
cher+0.262
ils+0.255
vell+0.247
chers+0.237
 able+0.237
Neg logit contributions
 Today-0.131
 One-0.127
-0.124
 He-0.115
 While-0.107
Token boots
Token ID14412
Feature activation-6.80e+00

Pos logit contributions
eling+0.364
eking+0.359
unct+0.331
ier+0.319
 believing+0.316
Neg logit contributions
-0.224
 They-0.154
 He-0.126
 The-0.117
 It-0.116
Token.
Token ID13
Feature activation+2.59e+00

Pos logit contributions
 pearl+0.293
 luck+0.283
 mistake+0.276
 potions+0.275
 treasures+0.274
Neg logit contributions
-0.231
 -0.171
 She-0.159
 The-0.151
 It-0.142
Token They
Token ID1119
Feature activation-3.12e+00

Pos logit contributions
V+0.042
E+0.037
Look+0.029
+0.028
 Many+0.026
Neg logit contributions
ier-0.135
 size-0.134
 away-0.133
 enough-0.133
 wouldn-0.130
Token pretended
Token ID36766
Feature activation-4.27e+00

Pos logit contributions
rier+0.165
ier+0.156
 decisions+0.155
 choices+0.148
 attention+0.140
Neg logit contributions
V-0.080
 -0.079
-0.078
 While-0.069
 One-0.063
Token the
Token ID262
Feature activation-4.00e+00

Pos logit contributions
ener+0.260
isher+0.240
ishing+0.239
ising+0.234
iful+0.234
Neg logit contributions
-0.105
 The-0.067
 It-0.063
 Tim-0.058
 Tom-0.057
Token subtle
Token ID11800
Feature activation-4.38e+00

Pos logit contributions
ier+0.076
ps+0.074
iness+0.072
ils+0.069
et+0.069
Neg logit contributions
-0.052
V-0.047
E-0.046
Mom-0.045
L-0.043
Token tranqu
Token ID31882
Feature activation+1.33e+00

Pos logit contributions
ishes+0.186
rying+0.183
ge+0.178
ier+0.176
 anywhere+0.174
Neg logit contributions
-0.129
<|endoftext|>-0.102
Sam-0.102
Tom-0.101
L-0.100
Tokenility
Token ID879
Feature activation-7.08e+00

Pos logit contributions
V+0.056
E+0.053
Look+0.051
+0.051
 Many+0.050
Neg logit contributions
 size-0.033
 coins-0.032
 enough-0.032
 airplane-0.032
 away-0.031
Token filled
Token ID5901
Feature activation-1.50e+00

Pos logit contributions
ache+0.313
 unatt+0.308
 injuries+0.301
 mustache+0.301
eking+0.296
Neg logit contributions
-0.199
 They-0.140
 It-0.138
 The-0.126
 Now-0.110
Token the
Token ID262
Feature activation-2.77e+00

Pos logit contributions
ier+0.077
ache+0.076
 wouldn+0.069
iness+0.068
ener+0.067
Neg logit contributions
-0.048
Look-0.041
E-0.036
 The-0.034
M-0.034
Token air
Token ID1633
Feature activation-8.98e+00

Pos logit contributions
ier+0.135
 wouldn+0.130
 able+0.126
iness+0.124
ge+0.122
Neg logit contributions
Look-0.066
-0.059
Mom-0.057
Wow-0.051
-0.051
Token and
Token ID290
Feature activation-3.67e+00

Pos logit contributions
 thieves+0.367
ache+0.361
 bills+0.350
ener+0.342
 elbow+0.341
Neg logit contributions
-0.308
 -0.218
 She-0.193
 The-0.192
 They-0.187
Token the
Token ID262
Feature activation-3.62e+00

Pos logit contributions
ier+0.161
ache+0.146
 dollars+0.143
ener+0.141
iness+0.139
Neg logit contributions
-0.124
 The-0.103
 -0.099
Look-0.098
 Next-0.094
Token rabbit
Token ID22746
Feature activation-6.50e+00

Pos logit contributions
ier+0.164
 able+0.153
iness+0.149
ils+0.137
 wouldn+0.137
Neg logit contributions
Look-0.100
-0.094
Mom-0.093
Wow-0.085
V-0.082
Token felt
Token ID2936
Feature activation-2.84e+00

Pos logit contributions
ier+0.306
rier+0.274
 attention+0.268
 thieves+0.267
iness+0.265
Neg logit contributions
-0.233
 -0.189
 The-0.169
 From-0.169
 It-0.151
Token his
Token ID465
Feature activation-2.70e+00

Pos logit contributions
ppers+0.126
ener+0.124
 forth+0.124
castle+0.123
ies+0.122
Neg logit contributions
-0.104
 The-0.086
 -0.078
 It-0.076
 From-0.071
Token friends
Token ID2460
Feature activation-5.07e+00

Pos logit contributions
ier+0.034
rying+0.031
ist+0.030
 growing+0.029
 enough+0.029
Neg logit contributions
-0.013
V-0.012
 Many-0.010
Mom-0.009
 Mom-0.009
Token won
Token ID1839
Feature activation-7.25e-01

Pos logit contributions
rier+0.249
 potion+0.240
 ger+0.239
 pests+0.237
 choices+0.229
Neg logit contributions
-0.148
 -0.140
 The-0.091
 Everyone-0.089
 From-0.087
Token the
Token ID262
Feature activation-7.12e-01

Pos logit contributions
ener+0.030
 crown+0.029
ies+0.028
iness+0.028
 wand+0.028
Neg logit contributions
-0.029
V-0.022
 The-0.022
E-0.021
 It-0.020
Token day
Token ID1110
Feature activation-3.03e+00

Pos logit contributions
ier+0.024
ies+0.024
ener+0.023
iness+0.022
 coins+0.020
Neg logit contributions
V-0.031
-0.026
E-0.026
 Many-0.025
Look-0.025
Token.
Token ID13
Feature activation+5.23e+00

Pos logit contributions
 wand+0.127
ener+0.125
pan+0.112
 size+0.110
 wouldn+0.109
Neg logit contributions
-0.120
 Many-0.085
 Everyone-0.084
 The-0.082
 This-0.080
Token
Token ID198
Feature activation+5.77e+00

Pos logit contributions
E+0.169
V+0.146
O+0.126
Pe+0.120
Gu+0.112
Neg logit contributions
 for-0.204
!-0.192
 enough-0.187
 and-0.186
 as-0.184
Token
Token ID198
Feature activation+4.02e+00

Pos logit contributions
E+0.188
Wow+0.170
Gu+0.170
V+0.168
Really+0.152
Neg logit contributions
 for-0.197
 on-0.189
!-0.185
 and-0.183
 that-0.182
TokenSam
Token ID16305
Feature activation-3.07e+00

Pos logit contributions
E+0.095
V+0.088
 While+0.078
 Ow+0.075
 Next+0.071
Neg logit contributions
 away-0.163
 for-0.151
 wouldn-0.151
 on-0.149
 made-0.149
Token was
Token ID373
Feature activation-1.32e+00

Pos logit contributions
rier+0.154
ier+0.142
 anywhere+0.134
 possible+0.132
 thieves+0.131
Neg logit contributions
-0.101
 From-0.093
V-0.083
 The-0.081
 Now-0.081
Token so
Token ID523
Feature activation+3.65e-01

Pos logit contributions
ies+0.056
iness+0.055
ly+0.055
fish+0.055
ils+0.055
Neg logit contributions
-0.046
 -0.039
 The-0.037
V-0.033
 From-0.033
Token happy
Token ID3772
Feature activation-1.96e+00

Pos logit contributions
+0.011
V+0.009
 Many+0.008
E+0.008
 The+0.008
Neg logit contributions
 size-0.016
 powers-0.016
 wand-0.016
ist-0.016
ier-0.016
Token?"
Token ID1701
Feature activation+3.05e+00

Pos logit contributions
ening+0.098
lling+0.096
 size+0.093
 possessions+0.093
ened+0.093
Neg logit contributions
-0.061
-0.040
Mom-0.038
 The-0.031
Dad-0.031
Token Lily
Token ID20037
Feature activation+2.48e+00

Pos logit contributions
V+0.092
E+0.084
Look+0.075
 While+0.071
Wow+0.071
Neg logit contributions
 enough-0.115
 away-0.114
 for-0.112
 size-0.111
 coins-0.109
Token asked
Token ID1965
Feature activation+2.12e+00

Pos logit contributions
V+0.102
E+0.097
Look+0.090
 Many+0.089
+0.088
Neg logit contributions
ier-0.068
 size-0.068
 enough-0.067
 away-0.067
 wouldn-0.063
Token Mom
Token ID11254
Feature activation+1.43e+00

Pos logit contributions
V+0.110
E+0.105
+0.101
Look+0.099
 Many+0.097
Neg logit contributions
ier-0.038
 size-0.036
 away-0.035
 enough-0.034
 wouldn-0.032
Token.
Token ID13
Feature activation+5.43e+00

Pos logit contributions
V+0.074
+0.071
E+0.070
Look+0.065
Mom+0.065
Neg logit contributions
ier-0.025
 size-0.024
 perfectly-0.022
 luck-0.022
 enough-0.021
Token
Token ID198
Feature activation+6.37e+00

Pos logit contributions
Ta+0.175
V+0.163
Mic+0.161
innie+0.157
E+0.156
Neg logit contributions
.-0.213
 and-0.213
,-0.205
 to-0.203
 in-0.194
Token
Token ID198
Feature activation+3.61e+00

Pos logit contributions
Ta+0.240
Wow+0.226
Whe+0.222
Nice+0.221
 Deal+0.220
Neg logit contributions
.-0.235
,-0.227
 and-0.210
 to-0.205
!-0.185
Token"
Token ID1
Feature activation+3.57e+00

Pos logit contributions
V+0.124
 Many+0.122
E+0.118
 While+0.113
 Ow+0.113
Neg logit contributions
 away-0.115
 size-0.100
 bit-0.099
 in-0.099
 left-0.098
TokenYes
Token ID5297
Feature activation+1.34e+00

Pos logit contributions
V+0.093
 Later+0.084
 Many+0.081
E+0.080
 While+0.078
Neg logit contributions
 bit-0.145
 away-0.141
 coins-0.134
 ring-0.132
 size-0.131
Token,
Token ID11
Feature activation+1.04e+00

Pos logit contributions
+0.063
E+0.063
V+0.062
Mom+0.058
O+0.057
Neg logit contributions
ier-0.027
 size-0.027
work-0.024
 powers-0.024
 wand-0.024
Token we
Token ID356
Feature activation+1.52e+00

Pos logit contributions
+0.037
E+0.036
V+0.035
Mom+0.034
 Many+0.033
Neg logit contributions
ier-0.036
 size-0.035
 away-0.033
 perfectly-0.032
 luck-0.032
Token ever
Token ID1683
Feature activation-6.26e-01

Pos logit contributions
rier+0.058
nuts+0.053
ps+0.052
 airplane+0.052
ier+0.051
Neg logit contributions
-0.046
-0.029
L-0.028
 -0.027
 The-0.026
Token said
Token ID531
Feature activation-2.34e+00

Pos logit contributions
rier+0.026
 airplane+0.022
 perfectly+0.022
cards+0.021
ier+0.021
Neg logit contributions
-0.020
V-0.017
-0.017
 While-0.016
 Many-0.016
Token "
Token ID366
Feature activation+5.26e-01

Pos logit contributions
ishing+0.100
lling+0.099
ler+0.098
ising+0.097
 size+0.096
Neg logit contributions
-0.079
 Her-0.060
 -0.060
 It-0.060
 The-0.059
Tokenyes
Token ID8505
Feature activation+2.39e-01

Pos logit contributions
V+0.016
E+0.016
Look+0.014
+0.014
Wow+0.014
Neg logit contributions
 wand-0.022
 enough-0.022
ier-0.022
 luck-0.022
 airplane-0.021
Token".
Token ID1911
Feature activation+3.95e+00

Pos logit contributions
V+0.009
+0.009
E+0.009
O+0.008
 Many+0.008
Neg logit contributions
 size-0.008
 coins-0.007
 luck-0.007
 treasures-0.007
 airplane-0.007
Token
Token ID198
Feature activation+7.30e+00

Pos logit contributions
V+0.100
E+0.088
O+0.076
Kids+0.064
Br+0.063
Neg logit contributions
 away-0.172
 for-0.169
.-0.168
 in-0.165
 size-0.165
Token
Token ID198
Feature activation+4.15e+00

Pos logit contributions
 Deal+0.304
Ta+0.287
Grab+0.282
Uh+0.262
 Glass+0.250
Neg logit contributions
.-0.255
 and-0.222
!-0.212
 in-0.210
 to-0.201
TokenOne
Token ID3198
Feature activation+2.10e+00

Pos logit contributions
 Deal+0.118
 Ow+0.114
 Glass+0.106
ucks+0.105
V+0.102
Neg logit contributions
 in-0.134
 even-0.132
 size-0.128
 away-0.128
 nearby-0.127
Token day
Token ID1110
Feature activation+2.26e+00

Pos logit contributions
V+0.094
E+0.090
+0.085
Look+0.084
 Many+0.083
Neg logit contributions
ier-0.053
 size-0.050
 away-0.049
 enough-0.049
 perfectly-0.047
Token,
Token ID11
Feature activation+1.42e+00

Pos logit contributions
V+0.133
E+0.127
Look+0.121
+0.120
 Many+0.119
Neg logit contributions
ier-0.026
 away-0.024
 size-0.024
 enough-0.023
 perfectly-0.020
Token the
Token ID262
Feature activation+5.26e-01

Pos logit contributions
E+0.052
+0.051
V+0.051
 While+0.049
Wow+0.047
Neg logit contributions
ier-0.051
 size-0.046
et-0.044
 luck-0.044
ist-0.043
Token lion
Token ID18744
Feature activation-3.73e+00

Pos logit contributions
V+0.008
+0.008
Look+0.007
E+0.007
 Many+0.006
Neg logit contributions
ier-0.008
iness-0.008
 permission-0.007
ener-0.007
 wouldn-0.007
Token!"
Token ID2474
Feature activation+3.32e+00

Pos logit contributions
 permission+0.181
 attention+0.169
iness+0.166
ier+0.165
 size+0.160
Neg logit contributions
-0.108
-0.086
 Now-0.080
 It-0.063
 This-0.061
Token she
Token ID673
Feature activation+1.49e+00

Pos logit contributions
V+0.178
E+0.166
Look+0.155
Wow+0.150
 Many+0.149
Neg logit contributions
 away-0.057
 for-0.052
 enough-0.048
 left-0.047
 bit-0.047
Token exclaimed
Token ID33552
Feature activation-3.69e-02

Pos logit contributions
V+0.059
E+0.055
+0.055
Look+0.051
 Many+0.050
Neg logit contributions
ier-0.049
 size-0.045
 luck-0.044
 away-0.043
 enough-0.043
Token.
Token ID13
Feature activation+5.69e+00

Pos logit contributions
et+0.001
 size+0.001
ier+0.001
 powers+0.001
 island+0.001
Neg logit contributions
-0.002
 The-0.001
E-0.001
 Her-0.001
V-0.001
Token
Token ID198
Feature activation+7.21e+00

Pos logit contributions
V+0.171
E+0.163
Ta+0.158
Jen+0.152
Le+0.141
Neg logit contributions
 to-0.216
,-0.202
 for-0.202
.-0.201
 and-0.200
Token
Token ID198
Feature activation+3.95e+00

Pos logit contributions
 Deal+0.283
Whe+0.281
Uh+0.276
Grab+0.274
Hmm+0.272
Neg logit contributions
 and-0.226
.-0.221
 in-0.210
 to-0.208
,-0.201
TokenThey
Token ID2990
Feature activation-2.48e-01

Pos logit contributions
V+0.092
 Ow+0.085
+0.083
E+0.080
 Some+0.078
Neg logit contributions
 away-0.152
 in-0.149
 bit-0.144
 size-0.143
 perfectly-0.143
Token watched
Token ID7342
Feature activation-4.34e-01

Pos logit contributions
rier+0.010
ier+0.009
 powers+0.009
 size+0.008
 than+0.008
Neg logit contributions
V-0.009
-0.008
Mom-0.008
M-0.008
E-0.008
Token the
Token ID262
Feature activation+1.33e+00

Pos logit contributions
ier+0.018
 size+0.016
iness+0.016
 wouldn+0.016
ener+0.015
Neg logit contributions
-0.016
 Many-0.014
 The-0.014
V-0.013
 When-0.013
Token film
Token ID2646
Feature activation-3.92e+00

Pos logit contributions
V+0.061
E+0.056
+0.054
Look+0.054
 Many+0.053
Neg logit contributions
ier-0.036
 enough-0.033
 wouldn-0.033
 away-0.032
 perfectly-0.031
Token difficult
Token ID2408
Feature activation-2.46e+00

Pos logit contributions
ener+0.101
ils+0.097
work+0.095
iness+0.093
ci+0.092
Neg logit contributions
-0.070
V-0.062
One-0.055
O-0.053
 One-0.053
Token,"
Token ID553
Feature activation+1.43e+00

Pos logit contributions
 crown+0.115
 wouldn+0.109
 reapp+0.109
iness+0.108
 ling+0.107
Neg logit contributions
-0.067
-0.045
 Now-0.041
 On-0.039
E-0.039
Token Jill
Token ID22772
Feature activation-8.91e-02

Pos logit contributions
V+0.033
+0.032
E+0.031
Look+0.029
 Many+0.027
Neg logit contributions
ier-0.071
 size-0.066
 away-0.065
 wouldn-0.064
 falling-0.064
Token says
Token ID1139
Feature activation+3.29e-01

Pos logit contributions
ier+0.005
 size+0.005
 perfectly+0.005
 anywhere+0.004
 falling+0.004
Neg logit contributions
-0.002
V-0.002
 On-0.001
-0.001
 The-0.001
Token.
Token ID13
Feature activation+4.48e+00

Pos logit contributions
+0.015
V+0.013
 The+0.012
E+0.012
 Their+0.012
Neg logit contributions
 size-0.009
 airplane-0.008
et-0.008
 island-0.008
ier-0.008
Token
Token ID198
Feature activation+6.91e+00

Pos logit contributions
V+0.129
E+0.120
Ta+0.112
Br+0.106
O+0.102
Neg logit contributions
 in-0.172
 and-0.172
 for-0.170
.-0.166
 to-0.164
Token
Token ID198
Feature activation+4.44e+00

Pos logit contributions
Ta+0.219
Uh+0.211
 While+0.209
Wow+0.206
Grab+0.205
Neg logit contributions
.-0.268
 and-0.244
,-0.231
!-0.231
 in-0.214
Token"
Token ID1
Feature activation+3.05e+00

Pos logit contributions
 Ow+0.162
V+0.160
 While+0.158
 Many+0.158
 Whe+0.157
Neg logit contributions
 away-0.112
!-0.107
 in-0.102
.-0.101
 than-0.101
TokenYes
Token ID5297
Feature activation+1.04e+00

Pos logit contributions
V+0.070
E+0.064
 Later+0.061
 While+0.061
 Many+0.061
Neg logit contributions
 size-0.124
 away-0.124
ier-0.122
 bit-0.121
 wouldn-0.118
Token,
Token ID11
Feature activation+2.93e-01

Pos logit contributions
+0.047
E+0.047
V+0.046
Mom+0.043
O+0.042
Neg logit contributions
 size-0.023
ier-0.022
 powers-0.021
 growing-0.021
 wand-0.021
Token it
Token ID340
Feature activation-3.14e-01

Pos logit contributions
+0.009
V+0.008
E+0.008
 Many+0.008
 It+0.008
Neg logit contributions
 size-0.012
ier-0.012
lers-0.011
 away-0.011
 powers-0.011
Token woman
Token ID2415
Feature activation-1.35e+00

Pos logit contributions
V+0.111
E+0.106
Look+0.100
+0.100
 Many+0.098
Neg logit contributions
ier-0.045
 size-0.044
 away-0.043
 enough-0.042
 luck-0.040
Token,
Token ID11
Feature activation+5.35e-01

Pos logit contributions
ier+0.047
 powers+0.046
iest+0.045
 size+0.044
 wouldn+0.043
Neg logit contributions
-0.054
 The-0.046
 From-0.044
 -0.042
 When-0.040
Token smiled
Token ID13541
Feature activation+4.72e-01

Pos logit contributions
+0.025
V+0.022
 From+0.022
 While+0.022
 The+0.022
Neg logit contributions
ier-0.015
 powers-0.014
 luck-0.013
iest-0.013
 size-0.013
Token and
Token ID290
Feature activation-2.09e-01

Pos logit contributions
+0.021
M+0.018
Mom+0.018
E+0.018
V+0.017
Neg logit contributions
ils-0.013
 happen-0.013
 powers-0.012
 falling-0.012
 coins-0.012
Token ran
Token ID4966
Feature activation+1.23e+00

Pos logit contributions
ier+0.008
 size+0.008
 anywhere+0.008
iness+0.008
ist+0.008
Neg logit contributions
-0.007
 From-0.006
 The-0.006
V-0.006
M-0.006
Token to
Token ID284
Feature activation+1.20e+00

Pos logit contributions
V+0.065
+0.064
E+0.061
 While+0.060
Look+0.059
Neg logit contributions
ier-0.022
 size-0.022
 wouldn-0.021
iness-0.019
ist-0.018
Token pick
Token ID2298
Feature activation+1.57e+00

Pos logit contributions
+0.045
V+0.044
E+0.039
 Many+0.039
Look+0.038
Neg logit contributions
ier-0.045
 wouldn-0.039
iness-0.039
 size-0.038
ist-0.038
Token up
Token ID510
Feature activation-8.96e-01

Pos logit contributions
V+0.067
E+0.064
+0.063
Look+0.059
 Many+0.058
Neg logit contributions
 size-0.043
ier-0.043
 wouldn-0.040
 perfectly-0.038
 falling-0.038
Token the
Token ID262
Feature activation+1.58e+00

Pos logit contributions
ier+0.044
 size+0.042
ist+0.041
 wouldn+0.041
castle+0.040
Neg logit contributions
-0.026
 The-0.023
E-0.019
 Many-0.018
L-0.017
Token l
Token ID300
Feature activation+1.98e+00

Pos logit contributions
V+0.078
E+0.074
+0.071
Look+0.070
 Many+0.068
Neg logit contributions
ier-0.035
 size-0.030
 wouldn-0.030
 away-0.029
 enough-0.029
Tokenily
Token ID813
Feature activation-2.51e+00

Pos logit contributions
V+0.053
E+0.049
+0.044
Look+0.043
 Many+0.042
Neg logit contributions
ier-0.083
 size-0.083
 away-0.082
 enough-0.081
 wouldn-0.079
TokenOnce
Token ID7454
Feature activation+5.11e+00

Pos logit contributions
V+0.105
E+0.096
 Many+0.091
 While+0.082
Br+0.082
Neg logit contributions
 in-0.245
 on-0.241
.-0.239
!-0.234
 again-0.231
Token upon
Token ID2402
Feature activation+4.07e+00

Pos logit contributions
V+0.274
E+0.258
Look+0.249
Wow+0.236
O+0.235
Neg logit contributions
 for-0.128
ies-0.109
 in-0.104
,-0.101
 away-0.099
Token a
Token ID257
Feature activation+4.88e+00

Pos logit contributions
V+0.225
E+0.210
 Later+0.203
Kids+0.202
Look+0.200
Neg logit contributions
 in-0.049
 for-0.047
 enough-0.047
 away-0.045
 as-0.042
Token time
Token ID640
Feature activation+3.88e+00

Pos logit contributions
V+0.282
E+0.267
Wow+0.263
Look+0.262
 Glass+0.254
Neg logit contributions
 for-0.057
 bit-0.052
 size-0.051
!-0.047
 forever-0.047
Token,
Token ID11
Feature activation+3.22e+00

Pos logit contributions
V+0.281
E+0.268
Look+0.261
Wow+0.251
Here+0.248
Neg logit contributions
 for-0.006
,-0.000
 in+0.001
 enough+0.003
 perfectly+0.008
Token there
Token ID612
Feature activation+3.48e+00

Pos logit contributions
V+0.184
E+0.175
 Many+0.164
Look+0.163
Kids+0.162
Neg logit contributions
 in-0.044
 away-0.043
 for-0.042
 enough-0.037
 perfectly-0.037
Token was
Token ID373
Feature activation+4.23e+00

Pos logit contributions
V+0.197
E+0.187
 Many+0.178
Look+0.172
 While+0.172
Neg logit contributions
 size-0.043
 wouldn-0.040
 away-0.040
 perfectly-0.038
 enough-0.037
Token a
Token ID257
Feature activation+3.13e+00

Pos logit contributions
V+0.253
Kids+0.239
E+0.236
Cat+0.229
Jen+0.228
Neg logit contributions
 in-0.040
 enough-0.039
 for-0.036
 as-0.029
 more-0.027
Token little
Token ID1310
Feature activation+1.02e+00

Pos logit contributions
V+0.143
E+0.135
Look+0.129
 Many+0.128
 While+0.126
Neg logit contributions
 size-0.069
 away-0.065
 airplane-0.065
 for-0.065
 crown-0.064
Token boy
Token ID2933
Feature activation+3.30e+00

Pos logit contributions
V+0.034
E+0.033
+0.031
Look+0.029
Wow+0.029
Neg logit contributions
ier-0.037
 wouldn-0.036
 perfectly-0.035
 size-0.034
 away-0.034
Token named
Token ID3706
Feature activation+3.84e+00

Pos logit contributions
V+0.156
E+0.148
Look+0.139
Wow+0.138
 Many+0.134
Neg logit contributions
 for-0.077
 size-0.075
 in-0.075
 away-0.074
 bit-0.071
Token called
Token ID1444
Feature activation+2.59e+00

Pos logit contributions
V+0.117
E+0.111
Look+0.104
Wow+0.101
 Many+0.101
Neg logit contributions
 size-0.075
 away-0.075
 for-0.072
 enough-0.070
ier-0.070
Token Sam
Token ID3409
Feature activation+1.57e+00

Pos logit contributions
V+0.072
E+0.066
Look+0.060
+0.056
Wow+0.055
Neg logit contributions
ier-0.106
 size-0.106
 enough-0.105
 away-0.104
 for-0.102
Token.
Token ID13
Feature activation+4.98e+00

Pos logit contributions
+0.079
V+0.079
E+0.076
Look+0.071
L+0.070
Neg logit contributions
ier-0.031
 size-0.027
 luck-0.026
 enough-0.026
 coins-0.026
Token He
Token ID679
Feature activation+2.35e+00

Pos logit contributions
E+0.256
Pe+0.206
Gr+0.203
V+0.201
Baby+0.198
Neg logit contributions
 for-0.135
 size-0.127
et-0.125
 made-0.125
!-0.118
Token was
Token ID373
Feature activation+2.20e+00

Pos logit contributions
V+0.106
E+0.101
Look+0.094
+0.093
 Many+0.092
Neg logit contributions
ier-0.056
 size-0.056
 away-0.054
 enough-0.053
 wouldn-0.052
Token three
Token ID1115
Feature activation+6.49e-01

Pos logit contributions
V+0.110
E+0.105
+0.099
Look+0.099
 Many+0.097
Neg logit contributions
ier-0.043
 size-0.041
 away-0.040
 enough-0.040
 perfectly-0.037
Token years
Token ID812
Feature activation+8.96e-01

Pos logit contributions
E+0.026
+0.026
V+0.025
 Many+0.025
L+0.023
Neg logit contributions
ier-0.018
 wand-0.017
 wouldn-0.017
 away-0.017
work-0.017
Token old
Token ID1468
Feature activation+1.54e+00

Pos logit contributions
+0.032
E+0.031
V+0.031
 Many+0.031
Mom+0.028
Neg logit contributions
 wouldn-0.029
ier-0.028
 away-0.028
 perfectly-0.027
 size-0.027
Token and
Token ID290
Feature activation+2.18e+00

Pos logit contributions
V+0.079
E+0.078
+0.078
 While+0.072
Look+0.072
Neg logit contributions
ier-0.027
 away-0.024
 size-0.024
 perfectly-0.024
 wouldn-0.023
Token loved
Token ID6151
Feature activation+4.80e-01

Pos logit contributions
V+0.094
E+0.089
+0.084
Look+0.083
 Many+0.081
Neg logit contributions
ier-0.058
 size-0.056
 away-0.055
 enough-0.055
 perfectly-0.052
Token to
Token ID284
Feature activation+3.12e-01

Pos logit contributions
+0.020
V+0.019
E+0.018
 While+0.018
 Many+0.018
Neg logit contributions
ier-0.016
ist-0.015
ener-0.014
iness-0.013
 perfectly-0.013
Token anything
Token ID1997
Feature activation-4.29e+00

Pos logit contributions
castle+0.107
ier+0.100
drops+0.097
work+0.094
nuts+0.093
Neg logit contributions
-0.063
 The-0.049
E-0.048
V-0.045
 It-0.043
Token.
Token ID13
Feature activation+5.19e+00

Pos logit contributions
castle+0.182
 crown+0.174
urger+0.173
et+0.170
isher+0.170
Neg logit contributions
-0.160
-0.106
 The-0.105
 Sometimes-0.098
 It-0.095
Token<|endoftext|>
Token ID50256
Feature activation+4.44e+00

Pos logit contributions
V+0.129
E+0.112
O+0.093
Wow+0.089
Look+0.088
Neg logit contributions
 enough-0.218
 for-0.211
 forever-0.211
work-0.207
 from-0.206
TokenOnce
Token ID7454
Feature activation+3.99e+00

Pos logit contributions
V+0.083
E+0.076
 Many+0.066
 While+0.060
Br+0.059
Neg logit contributions
 in-0.213
 on-0.210
 away-0.208
 for-0.206
!-0.204
Token upon
Token ID2402
Feature activation+3.79e+00

Pos logit contributions
V+0.192
E+0.180
Look+0.173
Wow+0.162
O+0.161
Neg logit contributions
 for-0.104
 enough-0.095
 away-0.093
ier-0.093
ies-0.091
Token a
Token ID257
Feature activation+4.34e+00

Pos logit contributions
V+0.201
E+0.188
Look+0.182
Kids+0.180
 Later+0.179
Neg logit contributions
 enough-0.052
 for-0.051
 in-0.049
 away-0.049
 size-0.046
Token time
Token ID640
Feature activation+2.83e+00

Pos logit contributions
V+0.239
Look+0.223
E+0.222
Wow+0.218
 Many+0.211
Neg logit contributions
 size-0.061
 for-0.061
 enough-0.058
 forever-0.057
 bit-0.056
Token,
Token ID11
Feature activation+2.97e+00

Pos logit contributions
V+0.176
E+0.168
Look+0.161
 Many+0.156
Wow+0.156
Neg logit contributions
ier-0.021
 enough-0.020
 for-0.019
 size-0.019
 away-0.019
Token there
Token ID612
Feature activation+2.95e+00

Pos logit contributions
V+0.158
E+0.150
Look+0.141
 Many+0.141
 While+0.137
Neg logit contributions
 away-0.046
 for-0.044
 enough-0.044
 in-0.043
 perfectly-0.043
Token was
Token ID373
Feature activation+4.01e+00

Pos logit contributions
V+0.153
E+0.145
 Many+0.136
Look+0.135
 While+0.133
Neg logit contributions
 size-0.050
 away-0.048
 enough-0.047
 wouldn-0.047
ier-0.046
Token a
Token ID257
Feature activation+3.24e+00

Pos logit contributions
V+0.226
E+0.212
Kids+0.208
Jen+0.202
 Later+0.201
Neg logit contributions
 enough-0.046
 in-0.045
 for-0.042
 as-0.036
 size-0.033
Token You
Token ID921
Feature activation-5.99e-01

Pos logit contributions
V+0.090
E+0.088
Look+0.075
O+0.074
+0.068
Neg logit contributions
 enough-0.131
 for-0.131
 away-0.131
 wouldn-0.130
 size-0.128
Token can
Token ID460
Feature activation+5.36e-01

Pos logit contributions
ier+0.024
 size+0.023
ening+0.022
ering+0.022
rier+0.022
Neg logit contributions
-0.019
V-0.018
 Many-0.017
 While-0.016
 It-0.015
Token dance
Token ID9280
Feature activation-1.20e+00

Pos logit contributions
V+0.021
+0.020
E+0.018
Mom+0.017
 Many+0.017
Neg logit contributions
ier-0.019
 coins-0.018
ener-0.017
 compared-0.017
iness-0.017
Token there
Token ID612
Feature activation-4.13e+00

Pos logit contributions
ener+0.040
iness+0.040
 size+0.039
 supervision+0.039
ier+0.039
Neg logit contributions
-0.053
V-0.050
E-0.043
L-0.042
 Whe-0.042
Token too
Token ID1165
Feature activation+4.54e-01

Pos logit contributions
ener+0.172
ened+0.167
lers+0.165
 breaths+0.164
eking+0.162
Neg logit contributions
-0.165
 The-0.102
 -0.102
 One-0.100
One-0.099
Token!"
Token ID2474
Feature activation+3.44e+00

Pos logit contributions
+0.019
V+0.018
E+0.017
 While+0.016
 Many+0.016
Neg logit contributions
ener-0.013
 wouldn-0.013
ier-0.013
 coins-0.013
 size-0.013
Token Sammy
Token ID44067
Feature activation-2.56e-01

Pos logit contributions
V+0.129
E+0.123
Look+0.101
 Many+0.099
Mom+0.098
Neg logit contributions
 away-0.118
 for-0.116
 wouldn-0.111
 enough-0.109
!-0.106
Token was
Token ID373
Feature activation+1.10e+00

Pos logit contributions
ier+0.010
 anywhere+0.009
 falling+0.009
 possible+0.009
 luck+0.009
Neg logit contributions
-0.009
V-0.009
E-0.008
 While-0.008
M-0.008
Token so
Token ID523
Feature activation+1.91e+00

Pos logit contributions
V+0.050
+0.047
E+0.044
 While+0.042
Look+0.042
Neg logit contributions
ier-0.031
iness-0.028
 airplane-0.028
ist-0.028
 size-0.028
Token excited
Token ID6568
Feature activation-1.01e+00

Pos logit contributions
V+0.077
E+0.073
+0.071
Look+0.067
 Many+0.066
Neg logit contributions
ier-0.058
 size-0.056
 away-0.054
 enough-0.054
 airplane-0.053
Token,
Token ID11
Feature activation-6.90e-01

Pos logit contributions
 wand+0.040
 prope+0.037
 heaviest+0.036
ps+0.036
 wil+0.036
Neg logit contributions
-0.041
 She-0.028
 But-0.028
 When-0.027
 The-0.027
Token so
Token ID523
Feature activation+1.70e+00

Pos logit contributions
V+0.037
+0.036
Look+0.034
 While+0.034
 Many+0.033
Neg logit contributions
ier-0.020
 coins-0.019
 size-0.019
ener-0.018
iness-0.018
Token excited
Token ID6568
Feature activation-1.12e+00

Pos logit contributions
V+0.065
E+0.062
+0.061
 Many+0.058
Look+0.058
Neg logit contributions
ier-0.053
 size-0.051
 away-0.050
 wand-0.048
 coins-0.048
Token for
Token ID329
Feature activation+2.23e-01

Pos logit contributions
 wand+0.040
ier+0.039
 heaviest+0.038
ener+0.038
ps+0.038
Neg logit contributions
-0.046
 She-0.036
 The-0.036
 On-0.035
 When-0.035
Token her
Token ID607
Feature activation-1.13e+00

Pos logit contributions
+0.007
 Many+0.006
V+0.006
 The+0.006
 In+0.006
Neg logit contributions
ier-0.010
ener-0.010
rier-0.010
ache-0.009
iness-0.009
Token special
Token ID2041
Feature activation-2.74e+00

Pos logit contributions
ier+0.053
et+0.043
rier+0.042
ist+0.042
iness+0.041
Neg logit contributions
-0.040
 Many-0.035
V-0.034
 The-0.032
 While-0.031
Token day
Token ID1110
Feature activation-3.21e+00

Pos logit contributions
ier+0.145
rier+0.120
eking+0.110
 able+0.109
 wouldn+0.109
Neg logit contributions
-0.083
 Many-0.074
 Today-0.074
V-0.073
<-0.069
Token.
Token ID13
Feature activation+2.98e+00

Pos logit contributions
ps+0.155
ener+0.145
 ring+0.141
cut+0.140
et+0.140
Neg logit contributions
-0.116
 The-0.080
 On-0.075
 Her-0.075
 She-0.075
Token She
Token ID1375
Feature activation-1.83e+00

Pos logit contributions
V+0.062
E+0.056
Look+0.047
O+0.045
Wow+0.041
Neg logit contributions
 enough-0.141
 size-0.140
 for-0.137
 away-0.136
ier-0.134
Token was
Token ID373
Feature activation-6.57e-01

Pos logit contributions
ier+0.094
rier+0.091
 attention+0.079
iness+0.079
 unatt+0.076
Neg logit contributions
-0.059
V-0.051
 Now-0.049
 While-0.049
 From-0.049
Token wearing
Token ID5762
Feature activation-6.51e-01

Pos logit contributions
iness+0.026
ier+0.025
eness+0.025
fish+0.024
ener+0.024
Neg logit contributions
-0.024
V-0.022
 The-0.022
 While-0.021
 Many-0.020
Token the
Token ID262
Feature activation-1.77e+00

Pos logit contributions
ier+0.036
ener+0.033
iness+0.033
rying+0.033
owing+0.031
Neg logit contributions
V-0.015
-0.014
E-0.013
Look-0.013
 The-0.012
Token he
Token ID339
Feature activation-2.58e+00

Pos logit contributions
ist+0.180
ising+0.165
ener+0.159
ier+0.158
eness+0.157
Neg logit contributions
-0.066
 The-0.051
 It-0.041
 His-0.039
 Tim-0.039
Token found
Token ID1043
Feature activation-2.67e+00

Pos logit contributions
rier+0.123
pan+0.122
ier+0.107
 airplane+0.106
 attention+0.104
Neg logit contributions
-0.077
-0.060
I-0.055
 The-0.053
 Next-0.050
Token it
Token ID340
Feature activation-4.07e+00

Pos logit contributions
drops+0.133
ist+0.126
iness+0.121
ishly+0.120
 forth+0.120
Neg logit contributions
-0.084
-0.057
 The-0.055
 It-0.052
 -0.043
Token.
Token ID13
Feature activation+4.55e+00

Pos logit contributions
 breaths+0.180
rier+0.177
isher+0.173
 passports+0.173
drops+0.171
Neg logit contributions
-0.157
-0.099
 The-0.088
 One-0.086
 In-0.084
Token And
Token ID843
Feature activation-1.20e+00

Pos logit contributions
V+0.114
E+0.105
Look+0.093
Mom+0.082
O+0.081
Neg logit contributions
 for-0.195
 enough-0.187
!-0.177
 forever-0.177
 again-0.175
Token he
Token ID339
Feature activation-1.63e+00

Pos logit contributions
ier+0.054
ist+0.053
et+0.051
eking+0.050
pan+0.050
Neg logit contributions
-0.044
 From-0.034
 The-0.033
 When-0.030
 -0.030
Token would
Token ID561
Feature activation-2.34e+00

Pos logit contributions
rier+0.070
eking+0.066
 attention+0.066
ier+0.064
ly+0.062
Neg logit contributions
-0.056
 From-0.050
 Now-0.043
From-0.042
M-0.041
Token always
Token ID1464
Feature activation-3.11e+00

Pos logit contributions
ier+0.119
led+0.113
ener+0.101
ering+0.101
rier+0.100
Neg logit contributions
-0.082
 One-0.069
 -0.066
 The-0.062
 She-0.059
Token love
Token ID1842
Feature activation-1.91e+00

Pos logit contributions
ier+0.166
led+0.147
cases+0.147
rier+0.145
iness+0.144
Neg logit contributions
-0.081
 One-0.059
 But-0.056
 -0.053
 see-0.047
Token bananas
Token ID35484
Feature activation-3.74e+00

Pos logit contributions
ier+0.089
iness+0.085
ist+0.082
ener+0.078
iful+0.078
Neg logit contributions
-0.069
 The-0.047
Tim-0.041
 -0.040
 It-0.040
Token.
Token ID13
Feature activation+5.04e+00

Pos logit contributions
ened+0.169
 breaths+0.168
ening+0.164
 size+0.151
isher+0.149
Neg logit contributions
-0.131
 And-0.075
 The-0.073
 He-0.070
 She-0.069
Token went
Token ID1816
Feature activation-1.14e+00

Pos logit contributions
work+0.051
fly+0.050
ist+0.050
 powers+0.049
flies+0.049
Neg logit contributions
 Many-0.032
-0.028
L-0.027
Tim-0.027
Tom-0.026
Token home
Token ID1363
Feature activation-2.50e+00

Pos logit contributions
iness+0.048
 size+0.046
 sizes+0.044
itive+0.044
 pounds+0.044
Neg logit contributions
-0.042
V-0.033
 Now-0.033
 While-0.032
 Soon-0.032
Token,
Token ID11
Feature activation-1.55e+00

Pos logit contributions
et+0.110
iful+0.106
 heaviest+0.104
le+0.103
iest+0.102
Neg logit contributions
-0.100
 From-0.071
 The-0.070
 -0.069
 And-0.064
Token never
Token ID1239
Feature activation-2.73e+00

Pos logit contributions
ier+0.074
iest+0.063
 powers+0.063
iful+0.060
ils+0.058
Neg logit contributions
-0.063
 -0.056
 From-0.051
 The-0.050
 While-0.046
Token to
Token ID284
Feature activation-2.29e+00

Pos logit contributions
rier+0.126
iest+0.125
cards+0.122
ler+0.120
 kernels+0.119
Neg logit contributions
-0.076
 Lily-0.059
 The-0.059
 Tim-0.053
 What-0.052
Token play
Token ID711
Feature activation-2.06e+00

Pos logit contributions
led+0.119
ier+0.117
cards+0.107
rier+0.105
lers+0.103
Neg logit contributions
-0.060
Tim-0.053
Mom-0.051
 One-0.048
L-0.048
Token together
Token ID1978
Feature activation-5.12e+00

Pos logit contributions
i+0.086
ener+0.083
iness+0.082
iful+0.081
iously+0.080
Neg logit contributions
-0.072
Tim-0.059
L-0.058
 The-0.057
 It-0.056
Token again
Token ID757
Feature activation-3.10e+00

Pos logit contributions
lers+0.233
cards+0.224
 breaths+0.219
tails+0.219
 powers+0.218
Neg logit contributions
-0.168
Once-0.109
 -0.101
 The-0.096
 Lily-0.088
Token.
Token ID13
Feature activation+5.21e+00

Pos logit contributions
 wand+0.136
ener+0.135
 crown+0.131
ier+0.130
isher+0.130
Neg logit contributions
-0.120
 The-0.075
-0.073
M-0.066
 They-0.065
Token<|endoftext|>
Token ID50256
Feature activation+4.28e+00

Pos logit contributions
V+0.141
E+0.118
O+0.104
Look+0.103
Quick+0.097
Neg logit contributions
 for-0.219
 forever-0.209
 ever-0.204
 enough-0.200
 again-0.199
TokenSam
Token ID16305
Feature activation-1.68e+00

Pos logit contributions
V+0.079
E+0.071
 Many+0.060
 While+0.054
Br+0.052
Neg logit contributions
 in-0.208
 away-0.206
 on-0.204
 for-0.203
 enough-0.201
Token about
Token ID546
Feature activation-3.31e+00

Pos logit contributions
lers+0.079
ener+0.077
sters+0.077
iness+0.077
fish+0.075
Neg logit contributions
-0.051
V-0.042
 Lily-0.036
 The-0.036
 When-0.035
Token colors
Token ID7577
Feature activation-5.57e+00

Pos logit contributions
ener+0.171
ier+0.169
ler+0.169
le+0.166
ist+0.161
Neg logit contributions
-0.109
 Many-0.070
 Whe-0.069
 The-0.069
L-0.065
Token and
Token ID290
Feature activation-3.12e+00

Pos logit contributions
 permission+0.269
 prisoner+0.263
 breaths+0.262
 thieves+0.259
 wounds+0.254
Neg logit contributions
-0.145
 She-0.127
 It-0.113
 They-0.086
 The-0.083
Token shapes
Token ID15268
Feature activation-6.57e+00

Pos logit contributions
owing+0.140
ier+0.134
rying+0.134
 away+0.126
icking+0.126
Neg logit contributions
-0.104
 It-0.088
 Many-0.086
 Lily-0.085
Wow-0.084
Token,
Token ID11
Feature activation-2.91e+00

Pos logit contributions
urger+0.311
 property+0.310
 thieves+0.301
ener+0.298
 injuries+0.297
Neg logit contributions
-0.178
 She-0.150
 It-0.118
 They-0.109
 Her-0.102
Token and
Token ID290
Feature activation-3.55e+00

Pos logit contributions
ier+0.131
owing+0.128
cher+0.126
ils+0.126
rying+0.120
Neg logit contributions
-0.091
 It-0.078
 Some-0.078
 The-0.075
 Many-0.074
Token Lily
Token ID20037
Feature activation-2.41e+00

Pos logit contributions
owing+0.182
rying+0.167
ier+0.161
ist+0.159
eking+0.155
Neg logit contributions
 Lily-0.102
-0.100
 Some-0.094
 Many-0.094
Wow-0.092
Token loved
Token ID6151
Feature activation-1.77e+00

Pos logit contributions
rier+0.123
 attention+0.117
 thieves+0.114
ier+0.113
 pests+0.113
Neg logit contributions
-0.068
 From-0.051
V-0.049
 -0.048
M-0.045
Token colors
Token ID7577
Feature activation-3.56e+00

Pos logit contributions
ier+0.079
ist+0.073
iness+0.071
ener+0.071
icking+0.070
Neg logit contributions
-0.061
V-0.048
Look-0.045
 Whe-0.043
 The-0.042
Token and
Token ID290
Feature activation-2.57e+00

Pos logit contributions
 supervision+0.173
 thieves+0.166
 stolen+0.165
 punished+0.165
 footing+0.164
Neg logit contributions
-0.099
 She-0.074
 It-0.065
It-0.055
 They-0.054
Token shapes
Token ID15268
Feature activation-5.28e+00

Pos logit contributions
ier+0.113
 unatt+0.103
rying+0.100
ener+0.099
ist+0.098
Neg logit contributions
-0.089
 It-0.076
 They-0.075
 Many-0.074
 The-0.073
Token but
Token ID475
Feature activation-1.31e-03

Pos logit contributions
+0.002
E+0.002
Mom+0.002
V+0.002
L+0.002
Neg logit contributions
work-0.002
ier-0.002
lers-0.002
 powers-0.002
ils-0.002
Token you
Token ID345
Feature activation+1.37e+00

Pos logit contributions
ier+0.000
ache+0.000
 away+0.000
et+0.000
iness+0.000
Neg logit contributions
-0.000
V-0.000
Look-0.000
 Next-0.000
Mom-0.000
Token should
Token ID815
Feature activation+8.44e-01

Pos logit contributions
V+0.053
+0.050
E+0.050
 Many+0.046
 While+0.046
Neg logit contributions
ier-0.043
 size-0.041
 away-0.039
 luck-0.038
 enough-0.038
Token not
Token ID407
Feature activation+8.64e-02

Pos logit contributions
+0.032
V+0.031
E+0.029
Mom+0.029
Look+0.027
Neg logit contributions
ier-0.027
 airplane-0.024
 compared-0.024
ener-0.024
 enough-0.024
Token use
Token ID779
Feature activation-4.81e-01

Pos logit contributions
+0.003
V+0.003
Tim+0.003
Mom+0.003
 Many+0.003
Neg logit contributions
ier-0.003
 wand-0.003
led-0.003
 airplane-0.003
 perfectly-0.003
Token tape
Token ID9154
Feature activation-3.26e+00

Pos logit contributions
ier+0.022
iness+0.021
castle+0.019
ener+0.019
 expert+0.019
Neg logit contributions
-0.015
V-0.013
E-0.012
 The-0.011
 Sometimes-0.011
Token on
Token ID319
Feature activation-1.23e+00

Pos logit contributions
 gob+0.121
 journey+0.120
 feast+0.120
 explorer+0.119
 connection+0.118
Neg logit contributions
-0.110
Mom-0.082
Tom-0.079
 Tom-0.072
 If-0.070
Token the
Token ID262
Feature activation-2.11e+00

Pos logit contributions
ier+0.053
 expert+0.048
ener+0.048
iness+0.048
work+0.046
Neg logit contributions
-0.041
E-0.036
V-0.035
Mom-0.032
 Glass-0.032
Token blankets
Token ID34794
Feature activation-4.52e+00

Pos logit contributions
iness+0.099
ier+0.099
 able+0.091
i+0.090
owing+0.090
Neg logit contributions
V-0.056
-0.050
Mom-0.048
 Water-0.047
Max-0.046
Token.
Token ID13
Feature activation+5.60e+00

Pos logit contributions
 ringing+0.190
 powers+0.189
 lit+0.188
 soaring+0.188
eking+0.188
Neg logit contributions
-0.143
Mom-0.091
L-0.084
 If-0.081
 They-0.079
Token It
Token ID632
Feature activation-1.92e-01

Pos logit contributions
V+0.162
E+0.161
Look+0.144
Br+0.139
O+0.139
Neg logit contributions
 enough-0.184
 for-0.179
 or-0.173
 like-0.172
 than-0.169
Token shouted
Token ID17293
Feature activation-1.32e-01

Pos logit contributions
 luck+0.018
ier+0.018
 size+0.017
iness+0.017
 anywhere+0.017
Neg logit contributions
-0.013
V-0.011
E-0.010
 So-0.010
 Many-0.010
Token.
Token ID13
Feature activation+3.48e+00

Pos logit contributions
 size+0.004
ier+0.003
 island+0.003
 powers+0.003
ils+0.003
Neg logit contributions
-0.006
V-0.005
 The-0.005
E-0.005
 Many-0.005
Token "
Token ID366
Feature activation+3.15e+00

Pos logit contributions
E+0.083
V+0.082
Look+0.065
Wow+0.063
O+0.062
Neg logit contributions
 enough-0.146
 away-0.146
 for-0.145
 size-0.141
 wouldn-0.141
TokenThe
Token ID464
Feature activation-1.22e+00

Pos logit contributions
V+0.063
E+0.057
 While+0.051
 Later+0.051
+0.048
Neg logit contributions
 away-0.142
 size-0.138
ier-0.138
 perfectly-0.135
 enough-0.133
Token red
Token ID2266
Feature activation-2.22e+00

Pos logit contributions
 wouldn+0.056
ist+0.056
ier+0.056
eking+0.054
ering+0.054
Neg logit contributions
V-0.038
-0.034
 Many-0.033
Mom-0.031
 Some-0.029
Token team
Token ID1074
Feature activation-6.04e+00

Pos logit contributions
 secure+0.110
eking+0.107
 wouldn+0.101
 able+0.101
 ling+0.099
Neg logit contributions
-0.053
 Many-0.051
 Everyone-0.046
 It-0.046
 The-0.045
Token is
Token ID318
Feature activation-1.98e+00

Pos logit contributions
 injuries+0.278
ier+0.271
eking+0.268
 unatt+0.267
 crops+0.264
Neg logit contributions
-0.160
 It-0.118
 Many-0.112
 They-0.111
 Everyone-0.105
Token stronger
Token ID7387
Feature activation-2.65e+00

Pos logit contributions
iness+0.091
ier+0.084
comb+0.084
iest+0.080
ener+0.079
Neg logit contributions
V-0.066
-0.066
 The-0.053
Look-0.051
 Many-0.050
Token and
Token ID290
Feature activation-2.39e+00

Pos logit contributions
 lit+0.112
eking+0.106
 wil+0.103
 veil+0.102
eling+0.100
Neg logit contributions
-0.091
Tom-0.064
V-0.062
 Now-0.061
 They-0.061
Token smarter
Token ID23714
Feature activation-3.66e+00

Pos logit contributions
iness+0.093
 veil+0.089
comb+0.086
 outfit+0.086
ils+0.084
Neg logit contributions
V-0.091
-0.084
 Sometimes-0.074
Wow-0.069
It-0.067
Token.
Token ID13
Feature activation+3.71e+00

Pos logit contributions
 lit+0.179
 wil+0.172
 dripping+0.166
eking+0.165
 veil+0.165
Neg logit contributions
-0.105
 They-0.060
 Now-0.059
Tom-0.057
"-0.057
Token to
Token ID284
Feature activation-2.44e+00

Pos logit contributions
iness+0.121
ler+0.119
et+0.114
iful+0.113
le+0.113
Neg logit contributions
-0.062
 The-0.044
 It-0.042
 But-0.038
 Inside-0.037
Token put
Token ID1234
Feature activation-1.05e+00

Pos logit contributions
ier+0.118
 compared+0.101
 dollars+0.099
iness+0.099
 pounds+0.098
Neg logit contributions
-0.076
V-0.063
 The-0.062
 It-0.061
 Many-0.061
Token lights
Token ID7588
Feature activation-5.86e+00

Pos logit contributions
ache+0.045
ier+0.042
 size+0.041
iness+0.039
 affect+0.038
Neg logit contributions
-0.036
Look-0.030
 -0.030
-0.029
 The-0.029
Token around
Token ID1088
Feature activation-3.14e+00

Pos logit contributions
eling+0.271
 mistake+0.266
ils+0.265
 appetite+0.261
 decision+0.258
Neg logit contributions
-0.143
 It-0.097
 Inside-0.096
 The-0.089
 She-0.085
Token the
Token ID262
Feature activation+1.50e+00

Pos logit contributions
ache+0.179
ier+0.170
 luck+0.150
ener+0.149
 infection+0.147
Neg logit contributions
-0.079
 The-0.060
 It-0.047
 Tim-0.047
 -0.046
Token tank
Token ID6873
Feature activation-6.24e+00

Pos logit contributions
V+0.063
+0.060
E+0.060
Look+0.057
 Many+0.056
Neg logit contributions
ier-0.043
 enough-0.040
 wouldn-0.039
 size-0.038
 luck-0.038
Token.
Token ID13
Feature activation+2.06e+00

Pos logit contributions
 groom+0.292
eling+0.291
 believing+0.276
ises+0.271
 bothering+0.271
Neg logit contributions
-0.196
 -0.146
 The-0.126
 They-0.123
 He-0.113
Token When
Token ID1649
Feature activation-2.63e-01

Pos logit contributions
V+0.026
E+0.022
+0.019
Look+0.016
 Many+0.016
Neg logit contributions
ier-0.119
 size-0.117
 away-0.115
 enough-0.114
 luck-0.113
Token they
Token ID484
Feature activation+4.13e-02

Pos logit contributions
ist+0.014
et+0.014
ache+0.014
ener+0.013
ier+0.013
Neg logit contributions
-0.007
 The-0.006
 Soon-0.005
 -0.005
Look-0.005
Token turned
Token ID2900
Feature activation-1.20e+00

Pos logit contributions
+0.001
V+0.001
E+0.001
O+0.001
 While+0.001
Neg logit contributions
rier-0.002
ier-0.001
 size-0.001
 powers-0.001
 than-0.001
Token them
Token ID606
Feature activation-3.36e+00

Pos logit contributions
 size+0.047
 property+0.045
comb+0.044
ils+0.043
ist+0.042
Neg logit contributions
-0.046
 The-0.039
 It-0.037
 -0.036
 Boom-0.035
Token to
Token ID284
Feature activation-4.66e-01

Pos logit contributions
eling+0.191
itive+0.190
 thieves+0.187
 groom+0.180
 singers+0.180
Neg logit contributions
-0.148
 -0.120
 The-0.119
 When-0.114
 Suddenly-0.109
Token either
Token ID2035
Feature activation+2.78e-01

Pos logit contributions
ier+0.021
iness+0.020
ies+0.019
flies+0.019
lers+0.019
Neg logit contributions
-0.014
V-0.014
E-0.012
 The-0.012
Tim-0.012
Token side
Token ID1735
Feature activation-4.08e+00

Pos logit contributions
+0.012
V+0.010
Mom+0.010
 The+0.010
 While+0.010
Neg logit contributions
ils-0.009
ier-0.009
 bothering-0.008
ache-0.008
 crown-0.008
Token of
Token ID286
Feature activation-1.22e+00

Pos logit contributions
ils+0.171
 veil+0.166
 wedding+0.160
 groom+0.156
 married+0.155
Neg logit contributions
-0.158
 The-0.131
 -0.123
 It-0.117
 He-0.110
Token the
Token ID262
Feature activation+9.71e-02

Pos logit contributions
ier+0.061
flies+0.059
ache+0.059
ils+0.058
ies+0.058
Neg logit contributions
-0.033
 -0.024
 The-0.024
Tim-0.022
 Glass-0.022
Token bridge
Token ID7696
Feature activation-6.09e+00

Pos logit contributions
V+0.003
+0.003
Mom+0.003
Three+0.003
Tim+0.003
Neg logit contributions
ier-0.004
iness-0.004
ils-0.004
led-0.004
ist-0.004
Token,
Token ID11
Feature activation-2.53e+00

Pos logit contributions
 veil+0.270
 pearl+0.264
 luck+0.261
ils+0.258
 groom+0.258
Neg logit contributions
-0.225
 -0.187
 The-0.176
 It-0.146
 But-0.145
Token so
Token ID523
Feature activation-8.69e-01

Pos logit contributions
ils+0.130
ier+0.130
iness+0.122
iest+0.121
fish+0.120
Neg logit contributions
-0.080
 -0.066
 But-0.064
 The-0.064
 It-0.062
Token it
Token ID340
Feature activation-1.88e+00

Pos logit contributions
 wand+0.046
ist+0.043
ets+0.041
ils+0.041
 tale+0.041
Neg logit contributions
-0.025
Mom-0.019
 The-0.019
Dad-0.018
 When-0.018
Token formed
Token ID7042
Feature activation-2.81e+00

Pos logit contributions
ier+0.060
 hesitation+0.059
 attention+0.058
 luck+0.058
 appearance+0.057
Neg logit contributions
V-0.069
-0.064
O-0.062
E-0.060
L-0.059
Token a
Token ID257
Feature activation-2.39e+00

Pos logit contributions
 affect+0.147
 appearance+0.144
 wedding+0.144
 tale+0.142
 collection+0.140
Neg logit contributions
-0.062
 The-0.051
E-0.047
V-0.043
Look-0.043
Token and
Token ID290
Feature activation-5.48e+00

Pos logit contributions
et+0.314
 breaths+0.310
ils+0.309
urger+0.303
 ic+0.302
Neg logit contributions
-0.293
 She-0.204
 -0.204
 The-0.192
The-0.169
Token bringing
Token ID6079
Feature activation-4.21e+00

Pos logit contributions
ier+0.223
mer+0.205
et+0.204
ist+0.204
le+0.204
Neg logit contributions
-0.193
 -0.153
 Now-0.145
 Next-0.144
 From-0.139
Token justice
Token ID5316
Feature activation-8.07e+00

Pos logit contributions
castle+0.195
comb+0.194
flies+0.189
fish+0.188
rying+0.188
Neg logit contributions
-0.129
 Everyone-0.109
 Many-0.099
 -0.095
E-0.094
Token to
Token ID284
Feature activation-2.86e+00

Pos logit contributions
eking+0.344
 forth+0.344
eling+0.339
 argument+0.337
 flood+0.335
Neg logit contributions
-0.265
-0.135
 She-0.129
 They-0.118
 The-0.113
Token different
Token ID1180
Feature activation-3.32e+00

Pos logit contributions
ener+0.129
ier+0.129
led+0.126
lers+0.126
fish+0.123
Neg logit contributions
-0.094
 Many-0.073
 The-0.064
 Everyone-0.063
 -0.059
Token parts
Token ID3354
Feature activation-7.18e+00

Pos logit contributions
drops+0.141
castle+0.140
 buried+0.137
comb+0.137
fish+0.137
Neg logit contributions
-0.109
 Everyone-0.087
 Many-0.081
 People-0.076
 The-0.073
Token of
Token ID286
Feature activation-3.30e+00

Pos logit contributions
 breaths+0.323
sed+0.322
 charged+0.316
 argued+0.313
ked+0.310
Neg logit contributions
-0.228
 She-0.138
 Everyone-0.132
 The-0.127
-0.116
Token the
Token ID262
Feature activation-1.82e+00

Pos logit contributions
ier+0.162
drops+0.155
ener+0.155
led+0.147
 wouldn+0.146
Neg logit contributions
-0.098
 Many-0.065
 -0.063
 The-0.060
Look-0.059
Token world
Token ID995
Feature activation-6.52e+00

Pos logit contributions
fish+0.089
ier+0.089
ies+0.087
flies+0.086
ist+0.086
Neg logit contributions
-0.044
 Many-0.038
Look-0.036
-0.033
 Next-0.032
Token.
Token ID13
Feature activation+3.29e+00

Pos logit contributions
 footing+0.273
 elbow+0.271
 size+0.264
 ankle+0.258
ener+0.258
Neg logit contributions
-0.237
 She-0.161
 The-0.159
 Everyone-0.150
-0.150
Token Whenever
Token ID21326
Feature activation-1.58e+00

Pos logit contributions
V+0.059
E+0.045
Look+0.035
O+0.032
Wow+0.031
Neg logit contributions
 enough-0.172
ier-0.168
 for-0.166
 away-0.164
 perfectly-0.164
Token around
Token ID1088
Feature activation-3.30e+00

Pos logit contributions
iness+0.068
ache+0.065
smanship+0.065
 possessions+0.065
 supervision+0.065
Neg logit contributions
-0.082
 It-0.064
 The-0.064
Look-0.063
E-0.061
Token in
Token ID287
Feature activation-9.19e-01

Pos logit contributions
ier+0.162
ache+0.162
iness+0.150
 dollars+0.148
icking+0.144
Neg logit contributions
-0.133
 The-0.095
 -0.091
-0.080
<|endoftext|>-0.078
Token the
Token ID262
Feature activation-1.75e+00

Pos logit contributions
ier+0.050
ache+0.050
 wouldn+0.048
ener+0.043
ist+0.043
Neg logit contributions
-0.031
Look-0.024
V-0.021
 The-0.021
 -0.021
Token spot
Token ID4136
Feature activation-4.70e+00

Pos logit contributions
ier+0.093
 wouldn+0.084
iness+0.081
 able+0.080
ist+0.079
Neg logit contributions
Look-0.050
-0.045
V-0.044
Mom-0.040
 The-0.039
Token of
Token ID286
Feature activation-1.31e+00

Pos logit contributions
ache+0.208
ience+0.204
 pointer+0.194
 airplane+0.194
 bills+0.190
Neg logit contributions
-0.161
 -0.112
 The-0.104
 Mia-0.102
M-0.098
Token sunshine
Token ID34488
Feature activation-6.19e+00

Pos logit contributions
ier+0.071
 wouldn+0.068
ache+0.067
ist+0.065
ener+0.062
Neg logit contributions
-0.037
Look-0.035
V-0.031
Tim-0.028
Tom-0.027
Token.
Token ID13
Feature activation+3.01e+00

Pos logit contributions
 punished+0.305
 disrespectful+0.302
 evil+0.302
 dishonest+0.298
 permission+0.297
Neg logit contributions
-0.160
 -0.089
 They-0.080
 She-0.078
-0.067
Token Suddenly
Token ID24975
Feature activation-2.82e+00

Pos logit contributions
V+0.048
E+0.041
Look+0.032
 Many+0.028
O+0.028
Neg logit contributions
 enough-0.154
 size-0.153
ier-0.153
 away-0.152
 perfectly-0.150
Token,
Token ID11
Feature activation-2.64e+00

Pos logit contributions
rying+0.119
work+0.116
ier+0.115
 size+0.115
ist+0.114
Neg logit contributions
-0.090
 The-0.077
-0.069
L-0.068
 It-0.067
Token he
Token ID339
Feature activation-2.42e+00

Pos logit contributions
ier+0.130
ist+0.129
ets+0.129
et+0.126
 size+0.124
Neg logit contributions
-0.068
 The-0.054
E-0.051
Wow-0.050
 Lily-0.050
Token wasn
Token ID2492
Feature activation+3.06e+00

Pos logit contributions
 choices+0.116
 happen+0.114
rier+0.113
ier+0.112
 attention+0.111
Neg logit contributions
-0.061
M-0.057
Sam-0.054
Ben-0.052
Mom-0.051