TOP ACTIVATIONS
MAX = 58.969

learned that courage is important . They learned that soccer is
but they were more careful . <|endoftext|> Anna and Ben were
with mom 's necklace again . They still played in the
ball . They are clever . They know how to share
Can we go outside now ?" Anna and Ben asked .
. Let 's play again ." They take their boats and
it is ! Follow me !" The squirrel led
their mom and thanked her . They went back
hats ! You ruined them !" The dog heard
. They are happy friends . <|endoftext|> L ily and Tom
They cleaned Lily 's finger . They put a band -
swim with you , duck !" And they sw am together
the comet and its difference . They felt a new friendship
that courage is important . They learned that soccer is fun
they had learned a lesson . They never touched the cord
told them she loved them . She told them they had
bad . It hurt me ." Anna said .
and happy and good things ," Sam says .
. They smiled and laughed . They had a great adventure
they could play outside again . He hoped they could be

MIN ACTIVATIONS
MIN = -188.858

his head out of his bur row and saw something new
children put on their bath ro bes and ran around in
reward . It was ch ocol ates and sweets !
't know the rh in oc eros was wild . Can
Everyday he would take the av oc ados and scatter them
he would take the av oc ados and scatter them around
When Daisy put the magn ifying glass down , the cater
give to the rh in oc eros . Lily walked to
quiet compared to the noisy se ag ull s outside .
her mom my watched in amaz ement as the machine flew
as he opened the ch ocol ates and started eating them
� � I â € ™ m sorry Jane . I
giving him too many ch ocol ates ." The king learned
you carry a l ily pad too ?" The
it to the rh in oc eros . When he tasted
there was a rh in oc eros who was always naughty
harder to push the wheel bar row . But her backyard
you go , my d arl ings . Enjoy your lunch
� � That â € ™ s a special necklace ,
am taking the rh in oc eros to a new home

INTERVAL -2.988 - 58.969
CONTAINS 56.567%

magic tree ?" His mum said , " Yes
day to pick flowers from her garden . One day ,
to an old , rusty shed . The mouse went inside
and filled the kitchen . The smoke alarm be ep ed
following the steam in the sky . At the

INTERVAL -64.945 - -2.988
CONTAINS 40.444%

. They match the colors of his suit . Red ,
to try the slide ?" The giant said ,
. He can 't hurt us ." L ily
they succeeded ! The rice turned out perfectly and everyone enjoyed
The bird thought and thought , and then it remembered the

INTERVAL -126.902 - -64.945
CONTAINS 2.982%

the game some more and Sam got very sleepy .
a big bed with soft pill ows and a big garden
of animals , fruits and shapes . One day , Tim
he could play with them too . They put the ship
him , even if it meant wearing a collar .

INTERVAL -188.858 - -126.902
CONTAINS 0.007%

When Daisy put the magn ifying glass down , the cater
children put on their bath ro bes and ran around in
he would take the av oc ados and scatter them around
her mom my watched in amaz ement as the machine flew
't know the rh in oc eros was wild . Can
Token learned
Token ID4499
Feature activation+5.19e+01

Pos logit contributions
Don+0.192
Grand+0.185
Me+0.180
books+0.174
Doctor+0.173
Neg logit contributions
 victory-0.217
 how-0.216
pieces-0.216
ieve-0.214
ap-0.213
Token that
Token ID326
Feature activation+5.40e+01

Pos logit contributions
Don+0.223
 She+0.218
 meant+0.196
Grand+0.195
Maybe+0.192
Neg logit contributions
ishing-0.230
ocol-0.227
pieces-0.220
ipping-0.214
immers-0.200
Token courage
Token ID11917
Feature activation+5.21e+01

Pos logit contributions
Don+0.278
 She+0.262
Me+0.254
chy+0.251
Grand+0.250
Neg logit contributions
ishing-0.173
pieces-0.163
ap-0.154
ipping-0.152
ocol-0.149
Token is
Token ID318
Feature activation+5.16e+01

Pos logit contributions
Don+0.184
 She+0.168
Me+0.161
chy+0.158
Grand+0.155
Neg logit contributions
ishing-0.252
pieces-0.242
ipping-0.231
ap-0.230
ocol-0.230
Token important
Token ID1593
Feature activation+5.13e+01

Pos logit contributions
Don+0.210
 She+0.186
 meant+0.185
chy+0.184
 needed+0.180
Neg logit contributions
ishing-0.228
pieces-0.217
ocol-0.214
ipping-0.214
ooming-0.204
Token.
Token ID13
Feature activation+5.78e+01

Pos logit contributions
Don+0.175
 She+0.163
 meant+0.148
chy+0.147
 needed+0.147
Neg logit contributions
ishing-0.253
pieces-0.250
ocol-0.240
ipping-0.236
ric-0.227
Token They
Token ID1119
Feature activation+5.63e+01

Pos logit contributions
chy+0.156
Don+0.154
Me+0.136
books+0.132
jar+0.129
Neg logit contributions
ap-0.305
ric-0.285
pieces-0.282
 our-0.278
ishing-0.277
Token learned
Token ID4499
Feature activation+5.55e+01

Pos logit contributions
Don+0.204
Grand+0.195
Me+0.192
Doctor+0.184
books+0.184
Neg logit contributions
 victory-0.231
pieces-0.229
 how-0.229
ieve-0.227
ap-0.226
Token that
Token ID326
Feature activation+5.73e+01

Pos logit contributions
Don+0.244
 She+0.237
Grand+0.215
 meant+0.214
Maybe+0.210
Neg logit contributions
ishing-0.240
ocol-0.233
pieces-0.228
ipping-0.222
immers-0.208
Token soccer
Token ID11783
Feature activation+5.25e+01

Pos logit contributions
Don+0.291
 She+0.274
Me+0.265
Grand+0.263
chy+0.263
Neg logit contributions
ishing-0.185
pieces-0.175
ap-0.169
ipping-0.163
ieve-0.160
Token is
Token ID318
Feature activation+5.51e+01

Pos logit contributions
Don+0.185
 She+0.167
Me+0.161
Grand+0.160
chy+0.157
Neg logit contributions
ishing-0.255
pieces-0.243
ap-0.235
ipping-0.233
ocol-0.229
Token but
Token ID475
Feature activation+5.39e+01

Pos logit contributions
Don+0.188
 She+0.172
Me+0.166
chy+0.161
Grand+0.161
Neg logit contributions
ishing-0.247
pieces-0.237
ap-0.226
ipping-0.225
ocol-0.223
Token they
Token ID484
Feature activation+5.25e+01

Pos logit contributions
Don+0.196
 She+0.178
Me+0.172
chy+0.167
Grand+0.165
Neg logit contributions
ishing-0.261
pieces-0.249
ipping-0.240
ap-0.237
ocol-0.236
Token were
Token ID547
Feature activation+5.02e+01

Pos logit contributions
Don+0.177
Grand+0.170
 She+0.163
Doctor+0.157
Me+0.154
Neg logit contributions
ishing-0.232
 how-0.228
pieces-0.226
 able-0.225
ap-0.222
Token more
Token ID517
Feature activation+5.10e+01

Pos logit contributions
Don+0.126
chy+0.118
 She+0.117
Me+0.111
 meant+0.107
Neg logit contributions
ishing-0.296
ocol-0.296
pieces-0.292
ipping-0.290
ooming-0.277
Token careful
Token ID8161
Feature activation+5.06e+01

Pos logit contributions
Don+0.215
 She+0.198
Grand+0.190
chy+0.188
Me+0.185
Neg logit contributions
ishing-0.214
pieces-0.213
ocol-0.199
ipping-0.198
ap-0.195
Token.
Token ID13
Feature activation+5.71e+01

Pos logit contributions
Don+0.149
 She+0.145
chy+0.131
Grand+0.124
 meant+0.123
Neg logit contributions
ocol-0.278
pieces-0.272
ishing-0.271
ipping-0.255
oured-0.253
Token<|endoftext|>
Token ID50256
Feature activation+5.49e+01

Pos logit contributions
chy+0.188
Don+0.164
ho+0.157
 woke+0.149
Me+0.148
Neg logit contributions
ap-0.265
 our-0.248
pieces-0.246
 how-0.246
ric-0.243
TokenAnna
Token ID31160
Feature activation+5.12e+01

Pos logit contributions
Don+0.181
chy+0.166
Grand+0.159
Me+0.153
 She+0.145
Neg logit contributions
ishing-0.264
ipping-0.259
ap-0.258
pieces-0.246
ocol-0.244
Token and
Token ID290
Feature activation+5.44e+01

Pos logit contributions
Don+0.208
Me+0.185
Wow+0.185
Doctor+0.180
Grand+0.180
Neg logit contributions
ishing-0.213
ap-0.190
ish-0.190
 how-0.186
ipping-0.181
Token Ben
Token ID3932
Feature activation+5.11e+01

Pos logit contributions
Don+0.149
 She+0.149
Me+0.116
 needed+0.114
Grand+0.110
Neg logit contributions
ishing-0.333
pieces-0.313
ipping-0.305
ocol-0.303
ieve-0.302
Token were
Token ID547
Feature activation+5.42e+01

Pos logit contributions
Don+0.151
Grand+0.136
Me+0.135
Doctor+0.133
chy+0.130
Neg logit contributions
ishing-0.258
ap-0.245
ish-0.242
 how-0.241
pieces-0.236
Token with
Token ID351
Feature activation+5.06e+01

Pos logit contributions
 She+0.177
Don+0.170
Me+0.160
chy+0.154
books+0.146
Neg logit contributions
ric-0.225
ishing-0.217
ocol-0.215
pieces-0.213
ig-0.198
Token mom
Token ID1995
Feature activation+4.99e+01

Pos logit contributions
Don+0.220
 She+0.206
Me+0.202
chy+0.200
Grand+0.193
Neg logit contributions
ishing-0.200
pieces-0.190
ipping-0.187
ap-0.181
ieve-0.177
Token's
Token ID338
Feature activation+4.82e+01

Pos logit contributions
Don+0.254
 She+0.238
Me+0.232
chy+0.229
Grand+0.227
Neg logit contributions
ishing-0.165
pieces-0.154
ipping-0.144
ap-0.144
ocol-0.142
Token necklace
Token ID38987
Feature activation+4.66e+01

Pos logit contributions
Don+0.241
 She+0.221
Me+0.219
chy+0.215
Grand+0.211
Neg logit contributions
ishing-0.163
pieces-0.155
ipping-0.147
ap-0.146
ocol-0.145
Token again
Token ID757
Feature activation+4.93e+01

Pos logit contributions
Don+0.237
 She+0.224
chy+0.217
Me+0.217
Grand+0.212
Neg logit contributions
ishing-0.151
pieces-0.143
ocol-0.135
ipping-0.132
ap-0.132
Token.
Token ID13
Feature activation+5.70e+01

Pos logit contributions
Don+0.230
Me+0.212
 She+0.206
chy+0.202
Well+0.201
Neg logit contributions
ishing-0.187
ap-0.172
pieces-0.169
ipping-0.163
 able-0.160
Token They
Token ID1119
Feature activation+5.55e+01

Pos logit contributions
chy+0.169
Don+0.155
books+0.140
Me+0.136
 woke+0.134
Neg logit contributions
ap-0.274
 our-0.268
ric-0.263
 how-0.259
pieces-0.258
Token still
Token ID991
Feature activation+5.27e+01

Pos logit contributions
Don+0.226
Grand+0.218
books+0.215
jar+0.214
 Did+0.214
Neg logit contributions
 least-0.192
 how-0.191
 friends-0.187
 able-0.177
ap-0.173
Token played
Token ID2826
Feature activation+5.41e+01

Pos logit contributions
Don+0.162
Grand+0.156
Doctor+0.149
Me+0.146
 She+0.141
Neg logit contributions
ishing-0.257
 how-0.249
ap-0.248
ish-0.238
 able-0.235
Token in
Token ID287
Feature activation+5.27e+01

Pos logit contributions
Don+0.234
 She+0.227
Me+0.213
chy+0.206
books+0.199
Neg logit contributions
ishing-0.216
pieces-0.205
ric-0.200
ocol-0.200
ipping-0.191
Token the
Token ID262
Feature activation+5.10e+01

Pos logit contributions
Don+0.238
 She+0.223
Me+0.216
chy+0.212
Grand+0.211
Neg logit contributions
ishing-0.205
pieces-0.192
ipping-0.184
ap-0.182
ocol-0.181
Token ball
Token ID2613
Feature activation+4.54e+01

Pos logit contributions
Don+0.183
 She+0.173
Me+0.171
chy+0.167
Grand+0.159
Neg logit contributions
ishing-0.215
pieces-0.202
ocol-0.197
ipping-0.194
ieve-0.191
Token.
Token ID13
Feature activation+5.51e+01

Pos logit contributions
Don+0.192
Me+0.169
Grand+0.168
 She+0.167
chy+0.161
Neg logit contributions
ishing-0.190
pieces-0.181
ap-0.180
ipping-0.174
ish-0.169
Token They
Token ID1119
Feature activation+5.38e+01

Pos logit contributions
Don+0.145
chy+0.116
Grand+0.115
Doctor+0.107
 meant+0.104
Neg logit contributions
ap-0.318
pieces-0.304
ishing-0.301
ipping-0.289
 victory-0.286
Token are
Token ID389
Feature activation+5.01e+01

Pos logit contributions
Don+0.261
 She+0.241
Me+0.235
Grand+0.235
chy+0.230
Neg logit contributions
ishing-0.186
pieces-0.178
ap-0.169
ipping-0.166
ocol-0.164
Token clever
Token ID14169
Feature activation+5.04e+01

Pos logit contributions
Don+0.175
 She+0.167
Me+0.160
chy+0.157
Grand+0.145
Neg logit contributions
ishing-0.249
ocol-0.238
pieces-0.236
ipping-0.229
ieve-0.221
Token.
Token ID13
Feature activation+5.67e+01

Pos logit contributions
 She+0.167
Don+0.159
chy+0.146
 meant+0.142
 needed+0.141
Neg logit contributions
pieces-0.250
ocol-0.248
ishing-0.248
ric-0.238
ipping-0.231
Token They
Token ID1119
Feature activation+5.57e+01

Pos logit contributions
Don+0.160
chy+0.140
Grand+0.125
jar+0.120
Doctor+0.120
Neg logit contributions
ap-0.318
pieces-0.296
ishing-0.287
 how-0.278
 victory-0.277
Token know
Token ID760
Feature activation+5.32e+01

Pos logit contributions
Don+0.262
 She+0.242
Me+0.236
Grand+0.234
chy+0.231
Neg logit contributions
ishing-0.202
pieces-0.193
ap-0.183
ipping-0.180
ocol-0.179
Token how
Token ID703
Feature activation+5.32e+01

Pos logit contributions
Don+0.297
 She+0.287
Me+0.278
Grand+0.264
chy+0.263
Neg logit contributions
ishing-0.162
pieces-0.137
ocol-0.134
ipping-0.133
immers-0.128
Token to
Token ID284
Feature activation+5.45e+01

Pos logit contributions
Don+0.191
 She+0.180
Me+0.173
 needed+0.161
chy+0.158
Neg logit contributions
ishing-0.270
ocol-0.269
pieces-0.258
ipping-0.249
ieve-0.247
Token share
Token ID2648
Feature activation+5.35e+01

Pos logit contributions
chy+0.198
 She+0.193
Don+0.187
Me+0.187
Maybe+0.160
Neg logit contributions
ishing-0.264
pieces-0.258
ocol-0.254
isher-0.230
 Majesty-0.227
TokenCan
Token ID6090
Feature activation+5.26e+01

Pos logit contributions
chy+0.175
 She+0.159
 needed+0.146
 He+0.139
 Did+0.129
Neg logit contributions
pieces-0.233
 winners-0.228
ishing-0.226
ieve-0.220
 victory-0.217
Token we
Token ID356
Feature activation+5.19e+01

Pos logit contributions
Don+0.214
 She+0.202
Me+0.193
Grand+0.176
chy+0.176
Neg logit contributions
ishing-0.260
pieces-0.232
ipping-0.231
ocol-0.229
ieve-0.221
Token go
Token ID467
Feature activation+4.97e+01

Pos logit contributions
 She+0.184
 had+0.158
 needed+0.153
 said+0.150
Don+0.149
Neg logit contributions
ishing-0.287
ocol-0.266
immers-0.266
pieces-0.260
ipping-0.259
Token outside
Token ID2354
Feature activation+4.70e+01

Pos logit contributions
 She+0.185
 woke+0.165
 had+0.162
 said+0.160
 fell+0.152
Neg logit contributions
ocol-0.255
imming-0.226
ishing-0.223
pieces-0.223
 Majesty-0.223
Token now
Token ID783
Feature activation+5.04e+01

Pos logit contributions
 She+0.183
Don+0.175
Grand+0.165
chy+0.158
 woke+0.157
Neg logit contributions
ishing-0.202
pieces-0.199
ocol-0.194
ieve-0.185
ipping-0.183
Token?"
Token ID1701
Feature activation+5.66e+01

Pos logit contributions
 She+0.186
 woke+0.167
Don+0.160
Grand+0.160
 said+0.159
Neg logit contributions
pieces-0.246
ishing-0.225
ric-0.224
ocol-0.223
immers-0.208
Token Anna
Token ID11735
Feature activation+5.12e+01

Pos logit contributions
Don+0.201
chy+0.200
books+0.183
Me+0.169
Doctor+0.165
Neg logit contributions
ap-0.245
pieces-0.240
 our-0.231
ocol-0.229
 how-0.227
Token and
Token ID290
Feature activation+5.39e+01

Pos logit contributions
Me+0.209
Don+0.200
chy+0.193
Wow+0.175
books+0.170
Neg logit contributions
ishing-0.207
ap-0.198
 able-0.193
 how-0.189
ish-0.188
Token Ben
Token ID3932
Feature activation+5.10e+01

Pos logit contributions
Don+0.134
 She+0.117
Me+0.109
chy+0.106
Grand+0.105
Neg logit contributions
ishing-0.321
pieces-0.310
ipping-0.299
ap-0.298
ocol-0.298
Token asked
Token ID1965
Feature activation+5.50e+01

Pos logit contributions
Me+0.198
Don+0.189
chy+0.173
Well+0.160
Doctor+0.154
Neg logit contributions
ap-0.222
ishing-0.216
 able-0.212
pieces-0.210
ish-0.203
Token.
Token ID13
Feature activation+5.80e+01

Pos logit contributions
Don+0.225
 She+0.200
Me+0.199
chy+0.195
Grand+0.193
Neg logit contributions
ishing-0.239
pieces-0.227
ap-0.223
ipping-0.217
ocol-0.214
Token.
Token ID13
Feature activation+5.31e+01

Pos logit contributions
Don+0.157
 She+0.146
Grand+0.144
Me+0.135
Doctor+0.132
Neg logit contributions
ishing-0.231
pieces-0.227
ocol-0.221
ipping-0.218
ieve-0.209
Token Let
Token ID3914
Feature activation+5.28e+01

Pos logit contributions
Don+0.157
 She+0.144
Me+0.134
Grand+0.126
 Did+0.123
Neg logit contributions
ishing-0.301
pieces-0.282
ocol-0.277
ipping-0.277
 able-0.265
Token's
Token ID338
Feature activation+4.92e+01

Pos logit contributions
Don+0.189
 She+0.178
Me+0.167
chy+0.161
Grand+0.160
Neg logit contributions
ishing-0.261
ocol-0.254
pieces-0.246
ipping-0.238
ric-0.233
Token play
Token ID711
Feature activation+4.93e+01

Pos logit contributions
 had+0.097
 went+0.086
 She+0.081
 said+0.075
 woke+0.074
Neg logit contributions
ocol-0.356
ishing-0.329
pieces-0.309
ries-0.305
atted-0.305
Token again
Token ID757
Feature activation+4.97e+01

Pos logit contributions
Don+0.206
Maybe+0.192
Me+0.190
 She+0.185
 woke+0.183
Neg logit contributions
ocol-0.214
ishing-0.196
ric-0.195
pieces-0.178
ipping-0.176
Token."
Token ID526
Feature activation+5.66e+01

Pos logit contributions
Don+0.185
 She+0.171
Grand+0.170
Me+0.162
Doctor+0.158
Neg logit contributions
ishing-0.226
pieces-0.221
ocol-0.215
ipping-0.213
ieve-0.202
Token They
Token ID1119
Feature activation+5.47e+01

Pos logit contributions
Don+0.150
chy+0.136
Me+0.127
Grand+0.123
Doctor+0.115
Neg logit contributions
ap-0.316
ishing-0.308
pieces-0.300
ipping-0.294
 how-0.288
Token take
Token ID1011
Feature activation+5.38e+01

Pos logit contributions
chy+0.189
 She+0.186
Don+0.183
 had+0.175
 said+0.165
Neg logit contributions
ishing-0.282
ocol-0.276
ipping-0.252
pieces-0.251
immers-0.238
Token their
Token ID511
Feature activation+5.17e+01

Pos logit contributions
 She+0.196
Me+0.171
Don+0.169
 They+0.168
 woke+0.165
Neg logit contributions
ocol-0.262
ric-0.259
ishing-0.246
 Majesty-0.240
pieces-0.236
Token boats
Token ID16659
Feature activation+4.99e+01

Pos logit contributions
Don+0.194
Me+0.183
chy+0.172
 needed+0.167
 She+0.167
Neg logit contributions
ishing-0.237
ocol-0.228
pieces-0.217
ipping-0.216
 able-0.216
Token and
Token ID290
Feature activation+5.64e+01

Pos logit contributions
 She+0.098
Don+0.096
 said+0.090
chy+0.083
 fell+0.078
Neg logit contributions
ocol-0.320
 Majesty-0.306
pieces-0.297
ric-0.287
ishing-0.286
Token it
Token ID340
Feature activation+4.64e+01

Pos logit contributions
Don+0.210
 She+0.201
 needed+0.185
Me+0.185
chy+0.184
Neg logit contributions
ishing-0.203
pieces-0.185
ipping-0.184
ocol-0.177
ap-0.173
Token is
Token ID318
Feature activation+4.65e+01

Pos logit contributions
Don+0.152
Grand+0.134
Me+0.133
 She+0.132
Doctor+0.124
Neg logit contributions
ishing-0.236
pieces-0.225
ap-0.219
ieve-0.217
ipping-0.215
Token!
Token ID0
Feature activation+5.43e+01

Pos logit contributions
Don+0.199
 She+0.187
chy+0.181
Me+0.176
Grand+0.173
Neg logit contributions
ishing-0.188
pieces-0.181
ipping-0.171
ocol-0.169
ap-0.168
Token Follow
Token ID7281
Feature activation+5.04e+01

Pos logit contributions
Don+0.196
chy+0.177
Me+0.174
 She+0.170
Grand+0.169
Neg logit contributions
ishing-0.247
pieces-0.243
ap-0.237
ipping-0.228
ocol-0.227
Token me
Token ID502
Feature activation+4.98e+01

Pos logit contributions
 She+0.148
Don+0.141
Doctor+0.125
Grand+0.124
Maybe+0.118
Neg logit contributions
ocol-0.268
pieces-0.266
ipping-0.260
ooming-0.257
ishing-0.250
Token!"
Token ID2474
Feature activation+5.65e+01

Pos logit contributions
 She+0.191
Don+0.175
chy+0.170
 needed+0.160
Grand+0.159
Neg logit contributions
ocol-0.226
ishing-0.224
pieces-0.218
ipping-0.208
ieve-0.198
Token
Token ID198
Feature activation+5.39e+01

Pos logit contributions
chy+0.214
Don+0.211
Me+0.201
Grand+0.194
Doctor+0.184
Neg logit contributions
pieces-0.246
ishing-0.232
 how-0.226
ap-0.223
 able-0.220
Token
Token ID198
Feature activation+5.34e+01

Pos logit contributions
Don+0.156
chy+0.155
 meant+0.132
Me+0.131
 needed+0.127
Neg logit contributions
pieces-0.273
ap-0.259
ishing-0.257
 how-0.254
ipping-0.246
TokenThe
Token ID464
Feature activation+5.40e+01

Pos logit contributions
chy+0.163
Don+0.148
 She+0.146
 meant+0.137
 needed+0.136
Neg logit contributions
pieces-0.274
ishing-0.260
ap-0.248
 able-0.245
ipping-0.244
Token squirrel
Token ID33039
Feature activation+5.06e+01

Pos logit contributions
Don+0.206
 She+0.190
chy+0.183
Grand+0.181
Me+0.177
Neg logit contributions
ishing-0.241
pieces-0.230
ipping-0.222
 victory-0.219
ap-0.219
Token led
Token ID2957
Feature activation+5.29e+01

Pos logit contributions
Don+0.193
Grand+0.173
 She+0.173
Me+0.170
chy+0.169
Neg logit contributions
ishing-0.227
pieces-0.214
 able-0.210
ish-0.206
ap-0.205
Token their
Token ID511
Feature activation+4.79e+01

Pos logit contributions
Don+0.216
chy+0.193
Me+0.189
 She+0.187
Well+0.186
Neg logit contributions
ishing-0.203
ap-0.197
pieces-0.191
 how-0.181
ish-0.180
Token mom
Token ID1995
Feature activation+4.65e+01

Pos logit contributions
Don+0.192
 She+0.174
Me+0.173
chy+0.168
Grand+0.168
Neg logit contributions
ishing-0.210
pieces-0.202
ocol-0.191
ipping-0.191
ap-0.189
Token and
Token ID290
Feature activation+5.10e+01

Pos logit contributions
Don+0.188
 She+0.173
Me+0.167
chy+0.164
Grand+0.163
Neg logit contributions
ishing-0.202
pieces-0.193
ipping-0.183
ap-0.183
ocol-0.182
Token thanked
Token ID26280
Feature activation+5.14e+01

Pos logit contributions
Don+0.147
Me+0.134
 She+0.133
chy+0.132
books+0.122
Neg logit contributions
ishing-0.269
pieces-0.257
ap-0.252
 able-0.252
 how-0.247
Token her
Token ID607
Feature activation+4.97e+01

Pos logit contributions
Don+0.234
chy+0.215
Me+0.208
 She+0.207
Well+0.203
Neg logit contributions
ishing-0.190
pieces-0.187
ap-0.182
ocol-0.173
ieve-0.170
Token.
Token ID13
Feature activation+5.64e+01

Pos logit contributions
Don+0.214
 She+0.200
Me+0.192
chy+0.190
Grand+0.188
Neg logit contributions
ishing-0.204
pieces-0.193
ocol-0.184
ipping-0.184
ap-0.181
Token
Token ID198
Feature activation+5.36e+01

Pos logit contributions
chy+0.233
Don+0.177
books+0.175
Doctor+0.174
 cost+0.172
Neg logit contributions
 how-0.245
ap-0.233
 as-0.221
 our-0.220
 what-0.217
Token
Token ID198
Feature activation+5.30e+01

Pos logit contributions
chy+0.221
Don+0.194
 meant+0.185
 needed+0.183
 Did+0.175
Neg logit contributions
 how-0.211
ap-0.207
 our-0.202
 what-0.199
pieces-0.190
TokenThey
Token ID2990
Feature activation+5.55e+01

Pos logit contributions
chy+0.250
 meant+0.221
 needed+0.219
Don+0.211
 Did+0.209
Neg logit contributions
ap-0.185
ishing-0.178
pieces-0.171
ipping-0.171
 our-0.166
Token went
Token ID1816
Feature activation+5.29e+01

Pos logit contributions
Don+0.201
 Did+0.191
Me+0.186
 She+0.184
chy+0.180
Neg logit contributions
 able-0.232
ap-0.226
 how-0.225
ishing-0.224
 friends-0.218
Token back
Token ID736
Feature activation+5.31e+01

Pos logit contributions
 She+0.205
Don+0.185
 low+0.167
Maybe+0.161
 said+0.160
Neg logit contributions
ocol-0.277
pieces-0.255
ric-0.244
ishing-0.241
ieve-0.234
Token hats
Token ID23910
Feature activation+4.16e+01

Pos logit contributions
Don+0.182
chy+0.168
Me+0.168
 She+0.168
 needed+0.161
Neg logit contributions
pieces-0.172
ishing-0.169
ipping-0.163
ocol-0.162
ap-0.160
Token!
Token ID0
Feature activation+5.35e+01

Pos logit contributions
chy+0.168
Don+0.166
 She+0.166
 needed+0.152
 meant+0.152
Neg logit contributions
pieces-0.173
ishing-0.167
ocol-0.165
ieve-0.159
ipping-0.152
Token You
Token ID921
Feature activation+5.24e+01

Pos logit contributions
Don+0.174
Me+0.152
chy+0.151
 She+0.148
Grand+0.146
Neg logit contributions
ishing-0.262
pieces-0.257
ap-0.252
ipping-0.246
ocol-0.246
Token ruined
Token ID20484
Feature activation+4.84e+01

Pos logit contributions
Don+0.168
 She+0.153
chy+0.151
 needed+0.150
 meant+0.145
Neg logit contributions
ishing-0.280
pieces-0.270
ipping-0.259
ocol-0.258
ap-0.248
Token them
Token ID606
Feature activation+4.92e+01

Pos logit contributions
 She+0.206
Don+0.205
Grand+0.197
Me+0.196
 V+0.191
Neg logit contributions
pieces-0.173
ipping-0.172
ishing-0.172
ooming-0.161
ap-0.159
Token!"
Token ID2474
Feature activation+5.64e+01

Pos logit contributions
Don+0.248
 She+0.239
chy+0.233
Grand+0.228
Me+0.226
Neg logit contributions
ishing-0.158
pieces-0.152
ocol-0.143
ipping-0.137
ieve-0.133
Token
Token ID198
Feature activation+5.42e+01

Pos logit contributions
chy+0.278
Don+0.237
books+0.227
Me+0.224
Today+0.223
Neg logit contributions
 our-0.186
 as-0.185
 how-0.182
 victory-0.178
ap-0.177
Token
Token ID198
Feature activation+5.38e+01

Pos logit contributions
chy+0.141
Don+0.132
Me+0.113
 needed+0.110
 meant+0.110
Neg logit contributions
ishing-0.283
ap-0.282
pieces-0.278
ipping-0.273
ieve-0.270
TokenThe
Token ID464
Feature activation+5.35e+01

Pos logit contributions
Don+0.147
chy+0.145
 She+0.136
Me+0.128
 meant+0.122
Neg logit contributions
ishing-0.284
pieces-0.276
ap-0.269
ipping-0.267
ieve-0.261
Token dog
Token ID3290
Feature activation+5.12e+01

Pos logit contributions
Don+0.213
 She+0.195
chy+0.195
books+0.189
 Did+0.188
Neg logit contributions
ishing-0.210
 victory-0.204
ap-0.203
ipping-0.202
pieces-0.200
Token heard
Token ID2982
Feature activation+5.41e+01

Pos logit contributions
jar+0.183
Grand+0.181
Don+0.180
Today+0.179
books+0.177
Neg logit contributions
 how-0.209
ap-0.197
ishing-0.195
ish-0.188
 wre-0.187
Token.
Token ID13
Feature activation+5.43e+01

Pos logit contributions
Don+0.145
 She+0.141
chy+0.130
Me+0.129
Grand+0.125
Neg logit contributions
ishing-0.218
ocol-0.206
ipping-0.202
pieces-0.202
ish-0.195
Token They
Token ID1119
Feature activation+5.25e+01

Pos logit contributions
chy+0.196
Don+0.191
 needed+0.165
jar+0.161
ho+0.159
Neg logit contributions
ap-0.259
 victory-0.227
 how-0.223
pieces-0.218
 our-0.217
Token are
Token ID389
Feature activation+4.89e+01

Pos logit contributions
Don+0.244
 She+0.225
Me+0.219
Grand+0.216
chy+0.215
Neg logit contributions
ishing-0.195
pieces-0.187
ap-0.177
ipping-0.174
ocol-0.172
Token happy
Token ID3772
Feature activation+5.10e+01

Pos logit contributions
 She+0.147
Don+0.145
Me+0.142
chy+0.132
Grand+0.127
Neg logit contributions
ishing-0.268
ocol-0.255
ipping-0.253
ooming-0.242
pieces-0.240
Token friends
Token ID2460
Feature activation+4.95e+01

Pos logit contributions
 She+0.157
Don+0.156
Me+0.132
Grand+0.128
Well+0.126
Neg logit contributions
ishing-0.272
pieces-0.266
ocol-0.260
ipping-0.248
ish-0.247
Token.
Token ID13
Feature activation+5.63e+01

Pos logit contributions
Don+0.162
 She+0.156
Me+0.140
chy+0.136
Grand+0.134
Neg logit contributions
ishing-0.256
pieces-0.243
ocol-0.240
ipping-0.233
ric-0.226
Token<|endoftext|>
Token ID50256
Feature activation+5.42e+01

Pos logit contributions
chy+0.199
Don+0.187
jar+0.168
ho+0.159
Doctor+0.157
Neg logit contributions
ap-0.266
 our-0.235
 how-0.230
 victory-0.229
pieces-0.224
TokenL
Token ID43
Feature activation+4.92e+01

Pos logit contributions
Don+0.154
chy+0.132
Grand+0.129
Me+0.128
 She+0.127
Neg logit contributions
ishing-0.295
ap-0.281
ipping-0.280
pieces-0.280
ocol-0.274
Tokenily
Token ID813
Feature activation+4.97e+01

Pos logit contributions
 She+0.124
Don+0.119
Me+0.106
Grand+0.104
chy+0.100
Neg logit contributions
ishing-0.279
pieces-0.267
ipping-0.261
ocol-0.258
ieve-0.256
Token and
Token ID290
Feature activation+5.44e+01

Pos logit contributions
Don+0.203
Me+0.178
Grand+0.177
chy+0.171
Doctor+0.171
Neg logit contributions
ishing-0.220
ap-0.204
ipping-0.197
ish-0.195
pieces-0.192
Token Tom
Token ID4186
Feature activation+4.98e+01

Pos logit contributions
 She+0.179
Don+0.164
Me+0.135
 needed+0.134
 meant+0.128
Neg logit contributions
ishing-0.339
ipping-0.305
ocol-0.303
pieces-0.301
immers-0.300
Token They
Token ID1119
Feature activation+5.25e+01

Pos logit contributions
chy+0.244
ho+0.217
wl+0.203
Don+0.195
Me+0.195
Neg logit contributions
 our-0.195
 how-0.194
 what-0.182
 as-0.181
ap-0.180
Token cleaned
Token ID20750
Feature activation+5.07e+01

Pos logit contributions
Don+0.196
 She+0.189
Me+0.186
Wow+0.176
 Did+0.176
Neg logit contributions
 able-0.214
 how-0.213
ishing-0.209
 friends-0.199
 breath-0.195
Token Lily
Token ID20037
Feature activation+5.01e+01

Pos logit contributions
Don+0.267
 She+0.256
Me+0.249
chy+0.241
Grand+0.238
Neg logit contributions
ishing-0.156
pieces-0.152
ocol-0.140
ipping-0.138
ap-0.137
Token's
Token ID338
Feature activation+4.70e+01

Pos logit contributions
Don+0.217
 She+0.200
Me+0.194
chy+0.191
Grand+0.189
Neg logit contributions
ishing-0.204
pieces-0.194
ipping-0.184
ap-0.184
ocol-0.182
Token finger
Token ID7660
Feature activation+4.71e+01

Pos logit contributions
Don+0.157
Me+0.155
 She+0.151
chy+0.144
Grand+0.139
Neg logit contributions
pieces-0.240
ieve-0.222
ocol-0.222
ipping-0.219
ishing-0.218
Token.
Token ID13
Feature activation+5.63e+01

Pos logit contributions
Don+0.197
 She+0.177
Me+0.174
Grand+0.170
Well+0.166
Neg logit contributions
ishing-0.202
pieces-0.189
ap-0.183
ipping-0.181
ish-0.176
Token They
Token ID1119
Feature activation+5.50e+01

Pos logit contributions
chy+0.240
ho+0.200
Don+0.193
Me+0.186
wl+0.182
Neg logit contributions
 our-0.227
 how-0.221
ap-0.213
 such-0.210
 as-0.209
Token put
Token ID1234
Feature activation+5.30e+01

Pos logit contributions
Don+0.195
 She+0.184
Me+0.179
 Did+0.170
Wow+0.169
Neg logit contributions
ishing-0.241
 able-0.235
 how-0.232
pieces-0.226
ish-0.224
Token a
Token ID257
Feature activation+5.02e+01

Pos logit contributions
Don+0.243
 She+0.228
Me+0.219
chy+0.212
Grand+0.208
Neg logit contributions
ishing-0.204
pieces-0.195
ap-0.185
ipping-0.185
ocol-0.179
Token band
Token ID4097
Feature activation+4.82e+01

Pos logit contributions
Don+0.231
Me+0.217
Grand+0.210
 She+0.210
chy+0.203
Neg logit contributions
ishing-0.191
pieces-0.188
ipping-0.174
ieve-0.172
ocol-0.171
Token-
Token ID12
Feature activation+3.65e+01

Pos logit contributions
Don+0.173
Grand+0.164
 She+0.157
post+0.156
M+0.156
Neg logit contributions
ocol-0.227
pieces-0.221
ishing-0.217
 heights-0.217
 Majesty-0.216
Token swim
Token ID9422
Feature activation+4.63e+01

Pos logit contributions
 woke+0.148
chy+0.145
 She+0.143
 had+0.127
 fell+0.124
Neg logit contributions
ishing-0.288
pieces-0.260
ipping-0.253
isher-0.253
ooming-0.249
Token with
Token ID351
Feature activation+4.97e+01

Pos logit contributions
Don+0.201
 She+0.194
chy+0.186
Me+0.179
 needed+0.177
Neg logit contributions
pieces-0.178
ishing-0.177
ocol-0.176
ipping-0.171
ieve-0.162
Token you
Token ID345
Feature activation+4.94e+01

Pos logit contributions
Don+0.205
 She+0.200
Me+0.189
Doctor+0.186
 needed+0.182
Neg logit contributions
ishing-0.212
ipping-0.198
ooming-0.195
pieces-0.193
ric-0.184
Token,
Token ID11
Feature activation+5.30e+01

Pos logit contributions
Don+0.241
 She+0.225
Me+0.218
chy+0.216
Grand+0.214
Neg logit contributions
ishing-0.173
pieces-0.163
ipping-0.154
ap-0.152
ocol-0.152
Token duck
Token ID22045
Feature activation+4.86e+01

Pos logit contributions
Don+0.194
 She+0.177
Me+0.170
chy+0.167
Grand+0.165
Neg logit contributions
ishing-0.252
pieces-0.240
ipping-0.230
ap-0.229
ocol-0.228
Token!"
Token ID2474
Feature activation+5.63e+01

Pos logit contributions
Don+0.218
 She+0.214
chy+0.212
 needed+0.201
Me+0.200
Neg logit contributions
ishing-0.176
pieces-0.173
ocol-0.171
ieve-0.163
ipping-0.163
Token And
Token ID843
Feature activation+5.55e+01

Pos logit contributions
chy+0.207
Don+0.187
Me+0.172
Grand+0.170
jar+0.168
Neg logit contributions
 how-0.247
 as-0.235
ap-0.230
 able-0.229
ishing-0.229
Token they
Token ID484
Feature activation+5.40e+01

Pos logit contributions
Don+0.244
 She+0.223
Me+0.220
chy+0.220
Grand+0.216
Neg logit contributions
ishing-0.217
pieces-0.211
ap-0.199
 able-0.195
ipping-0.193
Token sw
Token ID1509
Feature activation+4.53e+01

Pos logit contributions
Don+0.250
 She+0.233
Grand+0.224
Me+0.221
chy+0.219
Neg logit contributions
ishing-0.200
pieces-0.188
ap-0.186
ish-0.178
 able-0.177
Tokenam
Token ID321
Feature activation+5.14e+01

Pos logit contributions
Me+0.198
Don+0.194
 She+0.185
M+0.182
 needed+0.177
Neg logit contributions
pieces-0.157
ieve-0.150
ocol-0.147
 winners-0.136
 heights-0.132
Token together
Token ID1978
Feature activation+5.38e+01

Pos logit contributions
 She+0.204
Don+0.201
Me+0.189
 He+0.185
 had+0.182
Neg logit contributions
ocol-0.240
ishing-0.220
ric-0.219
pieces-0.218
ieve-0.208
Token the
Token ID262
Feature activation+4.98e+01

Pos logit contributions
Don+0.235
 She+0.221
Me+0.211
chy+0.206
Grand+0.206
Neg logit contributions
ishing-0.186
pieces-0.175
ipping-0.168
ocol-0.165
ap-0.161
Token comet
Token ID31733
Feature activation+4.50e+01

Pos logit contributions
Don+0.226
 She+0.210
Me+0.204
chy+0.201
Grand+0.199
Neg logit contributions
ishing-0.192
pieces-0.182
ipping-0.172
ap-0.172
ocol-0.170
Token and
Token ID290
Feature activation+5.28e+01

Pos logit contributions
Don+0.176
 She+0.170
chy+0.166
Me+0.159
 needed+0.155
Neg logit contributions
ishing-0.197
ocol-0.189
pieces-0.183
ipping-0.182
ooming-0.168
Token its
Token ID663
Feature activation+5.01e+01

Pos logit contributions
Don+0.218
 She+0.199
Me+0.191
 needed+0.187
 knew+0.181
Neg logit contributions
ishing-0.250
ipping-0.240
ocol-0.238
pieces-0.225
ooming-0.224
Token difference
Token ID3580
Feature activation+4.90e+01

Pos logit contributions
Don+0.237
Me+0.223
 She+0.221
chy+0.220
 needed+0.210
Neg logit contributions
ishing-0.182
pieces-0.174
ipping-0.169
ocol-0.164
 able-0.153
Token.
Token ID13
Feature activation+5.63e+01

Pos logit contributions
Don+0.174
 She+0.160
Me+0.154
chy+0.151
Grand+0.146
Neg logit contributions
ishing-0.232
pieces-0.227
ocol-0.216
ipping-0.215
ap-0.211
Token They
Token ID1119
Feature activation+5.47e+01

Pos logit contributions
chy+0.151
Don+0.142
Doctor+0.125
books+0.123
Grand+0.123
Neg logit contributions
ap-0.298
 how-0.290
 victory-0.286
pieces-0.280
ishing-0.280
Token felt
Token ID2936
Feature activation+5.20e+01

Pos logit contributions
Don+0.182
Grand+0.172
 She+0.171
 Did+0.165
Doctor+0.165
Neg logit contributions
 how-0.240
 able-0.236
ap-0.235
ishing-0.234
 victory-0.233
Token a
Token ID257
Feature activation+4.97e+01

Pos logit contributions
Don+0.182
 She+0.164
Me+0.156
chy+0.154
Grand+0.146
Neg logit contributions
ishing-0.265
pieces-0.252
ipping-0.246
ocol-0.244
ap-0.236
Token new
Token ID649
Feature activation+5.06e+01

Pos logit contributions
Don+0.171
Me+0.157
chy+0.152
Grand+0.150
 She+0.142
Neg logit contributions
pieces-0.255
ishing-0.251
ocol-0.240
ipping-0.230
immers-0.223
Token friendship
Token ID14738
Feature activation+5.16e+01

Pos logit contributions
Don+0.188
Me+0.178
 She+0.177
chy+0.176
 needed+0.173
Neg logit contributions
pieces-0.232
ishing-0.226
ocol-0.223
ieve-0.209
ipping-0.207
Token that
Token ID326
Feature activation+5.40e+01

Pos logit contributions
Don+0.223
 She+0.218
 meant+0.196
Grand+0.195
Maybe+0.192
Neg logit contributions
ishing-0.230
ocol-0.227
pieces-0.220
ipping-0.214
immers-0.200
Token courage
Token ID11917
Feature activation+5.21e+01

Pos logit contributions
Don+0.278
 She+0.262
Me+0.254
chy+0.251
Grand+0.250
Neg logit contributions
ishing-0.173
pieces-0.163
ap-0.154
ipping-0.152
ocol-0.149
Token is
Token ID318
Feature activation+5.16e+01

Pos logit contributions
Don+0.184
 She+0.168
Me+0.161
chy+0.158
Grand+0.155
Neg logit contributions
ishing-0.252
pieces-0.242
ipping-0.231
ap-0.230
ocol-0.230
Token important
Token ID1593
Feature activation+5.13e+01

Pos logit contributions
Don+0.210
 She+0.186
 meant+0.185
chy+0.184
 needed+0.180
Neg logit contributions
ishing-0.228
pieces-0.217
ocol-0.214
ipping-0.214
ooming-0.204
Token.
Token ID13
Feature activation+5.78e+01

Pos logit contributions
Don+0.175
 She+0.163
 meant+0.148
chy+0.147
 needed+0.147
Neg logit contributions
ishing-0.253
pieces-0.250
ocol-0.240
ipping-0.236
ric-0.227
Token They
Token ID1119
Feature activation+5.63e+01

Pos logit contributions
chy+0.156
Don+0.154
Me+0.136
books+0.132
jar+0.129
Neg logit contributions
ap-0.305
ric-0.285
pieces-0.282
 our-0.278
ishing-0.277
Token learned
Token ID4499
Feature activation+5.55e+01

Pos logit contributions
Don+0.204
Grand+0.195
Me+0.192
Doctor+0.184
books+0.184
Neg logit contributions
 victory-0.231
pieces-0.229
 how-0.229
ieve-0.227
ap-0.226
Token that
Token ID326
Feature activation+5.73e+01

Pos logit contributions
Don+0.244
 She+0.237
Grand+0.215
 meant+0.214
Maybe+0.210
Neg logit contributions
ishing-0.240
ocol-0.233
pieces-0.228
ipping-0.222
immers-0.208
Token soccer
Token ID11783
Feature activation+5.25e+01

Pos logit contributions
Don+0.291
 She+0.274
Me+0.265
Grand+0.263
chy+0.263
Neg logit contributions
ishing-0.185
pieces-0.175
ap-0.169
ipping-0.163
ieve-0.160
Token is
Token ID318
Feature activation+5.51e+01

Pos logit contributions
Don+0.185
 She+0.167
Me+0.161
Grand+0.160
chy+0.157
Neg logit contributions
ishing-0.255
pieces-0.243
ap-0.235
ipping-0.233
ocol-0.229
Token fun
Token ID1257
Feature activation+5.45e+01

Pos logit contributions
Don+0.206
 meant+0.179
 She+0.176
chy+0.175
 needed+0.172
Neg logit contributions
ishing-0.273
ipping-0.262
pieces-0.258
ocol-0.257
ooming-0.253
Token they
Token ID484
Feature activation+5.08e+01

Pos logit contributions
Don+0.163
 She+0.148
Me+0.141
chy+0.136
Grand+0.128
Neg logit contributions
ishing-0.282
pieces-0.268
ipping-0.261
ocol-0.257
ap-0.257
Token had
Token ID550
Feature activation+5.03e+01

Pos logit contributions
Grand+0.144
Don+0.144
 She+0.133
Doctor+0.131
books+0.128
Neg logit contributions
 how-0.248
ishing-0.242
ish-0.235
 able-0.234
pieces-0.229
Token learned
Token ID4499
Feature activation+5.06e+01

Pos logit contributions
Don+0.239
 She+0.222
Me+0.218
Grand+0.214
chy+0.210
Neg logit contributions
ishing-0.183
pieces-0.170
ap-0.164
 able-0.158
 victory-0.158
Token a
Token ID257
Feature activation+4.89e+01

Pos logit contributions
Don+0.205
 She+0.205
 meant+0.188
Maybe+0.179
chy+0.176
Neg logit contributions
ocol-0.241
ishing-0.239
pieces-0.232
ipping-0.231
ooming-0.217
Token lesson
Token ID11483
Feature activation+5.10e+01

Pos logit contributions
Don+0.141
 She+0.128
Me+0.118
Grand+0.118
Doctor+0.112
Neg logit contributions
ishing-0.252
 victory-0.248
pieces-0.244
ap-0.242
ish-0.233
Token.
Token ID13
Feature activation+5.62e+01

Pos logit contributions
Don+0.218
Me+0.200
 She+0.196
chy+0.195
Grand+0.190
Neg logit contributions
ishing-0.206
pieces-0.196
ap-0.188
ipping-0.185
ocol-0.183
Token They
Token ID1119
Feature activation+5.47e+01

Pos logit contributions
chy+0.176
Don+0.147
books+0.139
Me+0.139
 woke+0.137
Neg logit contributions
ap-0.281
pieces-0.269
 how-0.268
ocol-0.267
ric-0.263
Token never
Token ID1239
Feature activation+5.30e+01

Pos logit contributions
Don+0.191
Me+0.184
books+0.183
 She+0.181
 Did+0.180
Neg logit contributions
 able-0.219
 how-0.218
ishing-0.210
ish-0.210
pieces-0.208
Token touched
Token ID12615
Feature activation+5.11e+01

Pos logit contributions
Don+0.181
Me+0.168
 She+0.166
Grand+0.157
chy+0.151
Neg logit contributions
ishing-0.262
ap-0.241
pieces-0.239
ish-0.234
 able-0.233
Token the
Token ID262
Feature activation+5.14e+01

Pos logit contributions
Don+0.220
 She+0.216
chy+0.203
Me+0.197
 meant+0.194
Neg logit contributions
ishing-0.198
pieces-0.197
ipping-0.185
ap-0.184
ocol-0.183
Token cord
Token ID15050
Feature activation+4.80e+01

Pos logit contributions
Don+0.272
Well+0.244
Grand+0.242
Doctor+0.239
 She+0.238
Neg logit contributions
ishing-0.148
 victory-0.137
ipping-0.134
pieces-0.128
ish-0.127
Token told
Token ID1297
Feature activation+5.04e+01

Pos logit contributions
Don+0.129
 She+0.113
Me+0.109
chy+0.105
books+0.103
Neg logit contributions
ishing-0.280
pieces-0.268
 how-0.266
 able-0.264
ish-0.258
Token them
Token ID606
Feature activation+4.80e+01

Pos logit contributions
Don+0.288
 She+0.275
Me+0.268
chy+0.261
Grand+0.260
Neg logit contributions
ishing-0.140
pieces-0.128
ipping-0.118
ap-0.117
ocol-0.112
Token she
Token ID673
Feature activation+5.18e+01

Pos logit contributions
 She+0.188
Don+0.184
chy+0.168
Me+0.163
 needed+0.162
Neg logit contributions
ishing-0.236
ocol-0.218
ipping-0.214
pieces-0.209
ooming-0.202
Token loved
Token ID6151
Feature activation+4.94e+01

Pos logit contributions
Don+0.140
 She+0.124
Grand+0.120
Me+0.120
Doctor+0.113
Neg logit contributions
ishing-0.287
pieces-0.275
ap-0.268
ish-0.264
ipping-0.263
Token them
Token ID606
Feature activation+4.93e+01

Pos logit contributions
Don+0.295
 She+0.274
chy+0.270
Me+0.270
Grand+0.268
Neg logit contributions
ishing-0.120
pieces-0.108
ocol-0.101
ap-0.098
ipping-0.098
Token.
Token ID13
Feature activation+5.62e+01

Pos logit contributions
Don+0.275
Me+0.252
Grand+0.249
Well+0.246
Wow+0.244
Neg logit contributions
ishing-0.135
ap-0.133
pieces-0.131
 victory-0.122
 how-0.121
Token She
Token ID1375
Feature activation+5.39e+01

Pos logit contributions
chy+0.192
Don+0.152
books+0.145
 cost+0.143
wl+0.143
Neg logit contributions
ap-0.262
 how-0.259
 friends-0.246
 ways-0.246
ocol-0.242
Token told
Token ID1297
Feature activation+5.41e+01

Pos logit contributions
Don+0.132
 She+0.113
Me+0.111
chy+0.104
books+0.103
Neg logit contributions
ishing-0.307
pieces-0.295
 how-0.292
 able-0.289
ish-0.287
Token them
Token ID606
Feature activation+5.19e+01

Pos logit contributions
Don+0.313
 She+0.298
Me+0.290
chy+0.284
Grand+0.283
Neg logit contributions
ishing-0.146
pieces-0.132
ipping-0.122
ap-0.121
ocol-0.117
Token they
Token ID484
Feature activation+5.55e+01

Pos logit contributions
 She+0.211
Don+0.208
chy+0.190
Me+0.186
 needed+0.182
Neg logit contributions
ishing-0.243
ocol-0.222
ipping-0.220
pieces-0.214
ooming-0.207
Token had
Token ID550
Feature activation+5.34e+01

Pos logit contributions
Don+0.140
 She+0.123
Grand+0.120
Me+0.120
Doctor+0.113
Neg logit contributions
ishing-0.316
pieces-0.307
ap-0.297
ieve-0.292
ish-0.291
Token bad
Token ID2089
Feature activation+4.93e+01

Pos logit contributions
Don+0.206
chy+0.191
 She+0.190
 meant+0.182
Grand+0.180
Neg logit contributions
ishing-0.198
pieces-0.196
ipping-0.192
ooming-0.183
ap-0.179
Token.
Token ID13
Feature activation+5.38e+01

Pos logit contributions
Don+0.200
 She+0.188
chy+0.176
Me+0.174
Grand+0.172
Neg logit contributions
ishing-0.213
pieces-0.206
ipping-0.194
ocol-0.193
ap-0.191
Token It
Token ID632
Feature activation+5.16e+01

Pos logit contributions
Don+0.185
chy+0.166
Me+0.163
 She+0.161
Grand+0.157
Neg logit contributions
ishing-0.253
pieces-0.246
ap-0.244
ocol-0.240
ipping-0.233
Token hurt
Token ID5938
Feature activation+5.03e+01

Pos logit contributions
Don+0.143
 She+0.130
Grand+0.127
Me+0.124
books+0.120
Neg logit contributions
ishing-0.273
pieces-0.260
ieve-0.256
ap-0.252
 able-0.251
Token me
Token ID502
Feature activation+5.09e+01

Pos logit contributions
Don+0.290
Me+0.269
chy+0.267
Well+0.265
books+0.263
Neg logit contributions
ishing-0.123
ap-0.114
 how-0.105
pieces-0.103
 victory-0.100
Token."
Token ID526
Feature activation+5.61e+01

Pos logit contributions
Don+0.227
Me+0.208
 She+0.204
Grand+0.197
Well+0.196
Neg logit contributions
ishing-0.205
pieces-0.189
ap-0.188
ipping-0.182
ish-0.180
Token Anna
Token ID11735
Feature activation+4.97e+01

Pos logit contributions
Don+0.217
chy+0.209
books+0.203
Grand+0.186
 meant+0.182
Neg logit contributions
 victory-0.216
ap-0.213
 such-0.206
 our-0.203
 how-0.199
Token said
Token ID531
Feature activation+5.33e+01

Pos logit contributions
books+0.162
chy+0.150
Me+0.146
jar+0.146
Don+0.144
Neg logit contributions
 how-0.235
ap-0.230
 friends-0.226
 able-0.224
 as-0.221
Token.
Token ID13
Feature activation+5.61e+01

Pos logit contributions
Don+0.176
 She+0.172
chy+0.153
Me+0.146
Grand+0.145
Neg logit contributions
ishing-0.272
pieces-0.264
ocol-0.256
ipping-0.245
ieve-0.245
Token
Token ID198
Feature activation+5.51e+01

Pos logit contributions
chy+0.220
 cost+0.194
grand+0.190
rated+0.188
books+0.188
Neg logit contributions
ap-0.221
 how-0.214
 as-0.214
 something-0.203
 friends-0.197
Token
Token ID198
Feature activation+5.46e+01

Pos logit contributions
chy+0.167
Don+0.164
 meant+0.150
 needed+0.150
books+0.147
Neg logit contributions
ap-0.260
ipping-0.249
ishing-0.247
 how-0.246
ooming-0.244
Token and
Token ID290
Feature activation+5.19e+01

Pos logit contributions
 She+0.145
Don+0.124
 meant+0.121
chy+0.118
 needed+0.118
Neg logit contributions
ishing-0.268
pieces-0.263
ric-0.255
ocol-0.254
ipping-0.251
Token happy
Token ID3772
Feature activation+5.04e+01

Pos logit contributions
Don+0.172
 She+0.157
Me+0.148
 needed+0.144
chy+0.144
Neg logit contributions
ishing-0.283
ipping-0.274
ooming-0.263
ig-0.258
pieces-0.255
Token and
Token ID290
Feature activation+5.20e+01

Pos logit contributions
 She+0.180
Don+0.174
 needed+0.151
Grand+0.149
 meant+0.147
Neg logit contributions
ishing-0.247
pieces-0.244
ipping-0.230
ocol-0.228
ieve-0.223
Token good
Token ID922
Feature activation+4.93e+01

Pos logit contributions
Don+0.197
 She+0.183
Me+0.174
chy+0.172
Grand+0.166
Neg logit contributions
ishing-0.247
pieces-0.231
ipping-0.230
ocol-0.226
ooming-0.219
Token things
Token ID1243
Feature activation+4.93e+01

Pos logit contributions
Don+0.195
 She+0.186
chy+0.171
Grand+0.171
Me+0.170
Neg logit contributions
ishing-0.217
pieces-0.210
ipping-0.198
ocol-0.195
ieve-0.193
Token,"
Token ID553
Feature activation+5.61e+01

Pos logit contributions
Don+0.214
 She+0.197
Me+0.193
chy+0.187
Grand+0.187
Neg logit contributions
ishing-0.199
pieces-0.188
ap-0.179
ipping-0.178
ocol-0.177
Token Sam
Token ID3409
Feature activation+5.04e+01

Pos logit contributions
Don+0.233
chy+0.227
Me+0.212
Grand+0.206
books+0.206
Neg logit contributions
pieces-0.227
ishing-0.217
ap-0.212
 victory-0.209
ocol-0.202
Token says
Token ID1139
Feature activation+5.34e+01

Pos logit contributions
Don+0.181
Me+0.167
chy+0.156
Grand+0.150
Doctor+0.149
Neg logit contributions
ishing-0.230
pieces-0.225
ap-0.220
 how-0.214
ieve-0.213
Token.
Token ID13
Feature activation+5.67e+01

Pos logit contributions
Don+0.167
 She+0.164
chy+0.145
Grand+0.142
Me+0.141
Neg logit contributions
ishing-0.280
pieces-0.265
ocol-0.259
ipping-0.254
ric-0.251
Token
Token ID198
Feature activation+5.58e+01

Pos logit contributions
chy+0.256
Don+0.210
Me+0.204
jar+0.202
 needed+0.197
Neg logit contributions
 how-0.215
 as-0.196
 friends-0.195
ap-0.193
 fishing-0.189
Token
Token ID198
Feature activation+5.54e+01

Pos logit contributions
Don+0.153
chy+0.150
Me+0.128
 needed+0.124
 meant+0.123
Neg logit contributions
ishing-0.291
ap-0.287
pieces-0.286
ipping-0.275
 how-0.274
Token.
Token ID13
Feature activation+5.38e+01

Pos logit contributions
Don+0.187
 She+0.169
Me+0.166
chy+0.162
Grand+0.159
Neg logit contributions
ishing-0.207
pieces-0.197
ap-0.190
ipping-0.187
ocol-0.186
Token They
Token ID1119
Feature activation+5.22e+01

Pos logit contributions
chy+0.185
Don+0.164
Doctor+0.154
 needed+0.146
ho+0.145
Neg logit contributions
ap-0.258
 how-0.248
 our-0.246
 victory-0.226
 as-0.225
Token smiled
Token ID13541
Feature activation+4.93e+01

Pos logit contributions
Don+0.182
 Did+0.178
books+0.173
 She+0.173
Me+0.170
Neg logit contributions
 able-0.218
 how-0.213
 friends-0.210
 exactly-0.200
 breath-0.200
Token and
Token ID290
Feature activation+5.26e+01

Pos logit contributions
Don+0.197
 She+0.183
Me+0.173
chy+0.171
Grand+0.170
Neg logit contributions
ishing-0.219
pieces-0.210
ocol-0.199
ipping-0.199
ap-0.196
Token laughed
Token ID13818
Feature activation+4.99e+01

Pos logit contributions
Don+0.174
chy+0.159
 She+0.159
Me+0.157
books+0.155
Neg logit contributions
ishing-0.246
 able-0.240
ap-0.236
 how-0.230
pieces-0.227
Token.
Token ID13
Feature activation+5.61e+01

Pos logit contributions
Don+0.201
 She+0.181
Me+0.180
chy+0.175
Grand+0.174
Neg logit contributions
ishing-0.217
pieces-0.206
ap-0.200
ipping-0.197
ocol-0.192
Token They
Token ID1119
Feature activation+5.47e+01

Pos logit contributions
chy+0.178
Don+0.161
Doctor+0.146
 needed+0.140
ho+0.139
Neg logit contributions
ap-0.277
 how-0.270
 our-0.257
 victory-0.244
 such-0.240
Token had
Token ID550
Feature activation+5.31e+01

Pos logit contributions
Don+0.182
 Did+0.181
Doctor+0.175
 She+0.175
books+0.171
Neg logit contributions
 able-0.227
 how-0.224
 friends-0.222
 exactly-0.207
ap-0.207
Token a
Token ID257
Feature activation+5.12e+01

Pos logit contributions
Don+0.255
 She+0.240
Me+0.234
chy+0.228
Grand+0.228
Neg logit contributions
ishing-0.185
pieces-0.172
 able-0.170
ap-0.169
 victory-0.165
Token great
Token ID1049
Feature activation+5.09e+01

Pos logit contributions
Don+0.215
 She+0.199
Me+0.191
chy+0.188
Grand+0.187
Neg logit contributions
ishing-0.213
pieces-0.203
ap-0.194
ipping-0.192
ocol-0.189
Token adventure
Token ID8855
Feature activation+5.21e+01

Pos logit contributions
Don+0.128
 She+0.116
Me+0.111
chy+0.108
Grand+0.104
Neg logit contributions
ishing-0.298
pieces-0.289
ocol-0.283
ipping-0.277
ieve-0.274
Token they
Token ID484
Feature activation+5.11e+01

Pos logit contributions
Don+0.176
 She+0.161
Me+0.154
chy+0.151
Grand+0.149
Neg logit contributions
ishing-0.244
pieces-0.233
ipping-0.224
ap-0.223
ocol-0.222
Token could
Token ID714
Feature activation+4.99e+01

Pos logit contributions
Don+0.111
Grand+0.095
 She+0.094
Me+0.092
Doctor+0.088
Neg logit contributions
ishing-0.304
pieces-0.299
ap-0.291
ieve-0.283
 able-0.283
Token play
Token ID711
Feature activation+4.93e+01

Pos logit contributions
 She+0.169
Maybe+0.140
+0.140
Don+0.138
chy+0.137
Neg logit contributions
ocol-0.287
ishing-0.286
isher-0.265
pieces-0.264
ipping-0.252
Token outside
Token ID2354
Feature activation+4.73e+01

Pos logit contributions
Don+0.222
 She+0.207
Maybe+0.198
Me+0.196
Grand+0.187
Neg logit contributions
ishing-0.192
ric-0.189
ocol-0.189
pieces-0.177
ipping-0.176
Token again
Token ID757
Feature activation+4.88e+01

Pos logit contributions
Don+0.234
Me+0.215
 She+0.212
chy+0.207
Grand+0.207
Neg logit contributions
ishing-0.160
pieces-0.150
ap-0.146
ipping-0.141
ieve-0.138
Token.
Token ID13
Feature activation+5.61e+01

Pos logit contributions
Don+0.222
Me+0.204
 She+0.202
chy+0.199
Well+0.194
Neg logit contributions
ishing-0.187
pieces-0.172
ap-0.169
ieve-0.161
ipping-0.160
Token He
Token ID679
Feature activation+5.39e+01

Pos logit contributions
chy+0.161
Don+0.147
books+0.141
ho+0.141
 rhy+0.136
Neg logit contributions
 how-0.250
 victory-0.248
ap-0.246
 such-0.240
 friends-0.238
Token hoped
Token ID10719
Feature activation+5.40e+01

Pos logit contributions
Don+0.183
books+0.179
Grand+0.176
 She+0.171
Wow+0.171
Neg logit contributions
ishing-0.237
 how-0.230
ish-0.223
ap-0.220
ig-0.220
Token they
Token ID484
Feature activation+5.49e+01

Pos logit contributions
Don+0.185
 She+0.171
Me+0.160
chy+0.158
Grand+0.155
Neg logit contributions
ishing-0.273
pieces-0.259
ipping-0.250
ap-0.248
ocol-0.247
Token could
Token ID714
Feature activation+5.36e+01

Pos logit contributions
Don+0.123
Grand+0.109
 She+0.105
Me+0.103
Doctor+0.103
Neg logit contributions
ishing-0.318
pieces-0.313
ap-0.306
 able-0.299
 how-0.299
Token be
Token ID307
Feature activation+5.22e+01

Pos logit contributions
 She+0.179
chy+0.152
Maybe+0.149
+0.147
Don+0.147
Neg logit contributions
ishing-0.310
ocol-0.309
isher-0.288
pieces-0.285
ipping-0.277
Token his
Token ID465
Feature activation-1.07e+02

Pos logit contributions
Don+0.207
 She+0.201
chy+0.186
 meant+0.182
 needed+0.181
Neg logit contributions
pieces-0.141
ocol-0.136
ishing-0.135
ipping-0.134
ooming-0.130
Token head
Token ID1182
Feature activation-1.03e+02

Pos logit contributions
ishing+0.436
pieces+0.410
ipping+0.399
ieve+0.387
ocol+0.387
Neg logit contributions
Don-0.456
 She-0.434
chy-0.422
Me-0.419
Grand-0.402
Token out
Token ID503
Feature activation-9.52e+01

Pos logit contributions
ishing+0.396
pieces+0.369
ap+0.366
ipping+0.360
 able+0.347
Neg logit contributions
Don-0.463
Me-0.420
 She-0.412
Grand-0.406
Doctor-0.402
Token of
Token ID286
Feature activation-8.69e+01

Pos logit contributions
ishing+0.373
pieces+0.354
ipping+0.335
ap+0.333
ocol+0.331
Neg logit contributions
Don-0.424
 She-0.395
Me-0.381
chy-0.376
Grand-0.372
Token his
Token ID465
Feature activation-9.11e+01

Pos logit contributions
ishing+0.385
pieces+0.383
ap+0.361
ipping+0.361
ish+0.357
Neg logit contributions
Don-0.337
 She-0.323
Me-0.296
Grand-0.293
chy-0.287
Token bur
Token ID4356
Feature activation-1.35e+02

Pos logit contributions
ishing+0.371
pieces+0.355
ipping+0.341
ieve+0.337
ocol+0.334
Neg logit contributions
Don-0.370
 She-0.364
chy-0.354
Me-0.346
 needed-0.337
Tokenrow
Token ID808
Feature activation-1.02e+02

Pos logit contributions
pieces+0.777
 victory+0.741
 winners+0.723
ieve+0.704
ap+0.688
Neg logit contributions
chy-0.370
 needed-0.325
 She-0.321
Don-0.307
 meant-0.291
Token and
Token ID290
Feature activation-6.43e+01

Pos logit contributions
ishing+0.444
pieces+0.426
ocol+0.403
ipping+0.403
ap+0.402
Neg logit contributions
Don-0.410
 She-0.381
Me-0.362
chy-0.362
Grand-0.355
Token saw
Token ID2497
Feature activation-6.96e+01

Pos logit contributions
ishing+0.322
pieces+0.308
ipping+0.296
ap+0.295
ocol+0.294
Neg logit contributions
Don-0.219
 She-0.198
Me-0.189
chy-0.185
Grand-0.183
Token something
Token ID1223
Feature activation-6.87e+01

Pos logit contributions
ishing+0.237
pieces+0.222
ipping+0.209
ap+0.208
ocol+0.205
Neg logit contributions
Don-0.349
 She-0.327
Me-0.318
chy-0.313
Grand-0.310
Token new
Token ID649
Feature activation-6.94e+01

Pos logit contributions
ishing+0.265
pieces+0.258
ocol+0.242
 victory+0.241
ieve+0.240
Neg logit contributions
Don-0.297
 She-0.275
Me-0.266
Grand-0.264
Doctor-0.263
Token children
Token ID1751
Feature activation-9.13e+01

Pos logit contributions
Don+0.164
 She+0.147
Me+0.146
books+0.139
Doctor+0.139
Neg logit contributions
ishing-0.275
 able-0.266
pieces-0.258
 how-0.257
ap-0.253
Token put
Token ID1234
Feature activation-8.04e+01

Pos logit contributions
ishing+0.437
pieces+0.419
ap+0.401
ipping+0.400
ocol+0.396
Neg logit contributions
Don-0.325
 She-0.295
Me-0.285
chy-0.278
Grand-0.276
Token on
Token ID319
Feature activation-7.97e+01

Pos logit contributions
ishing+0.292
pieces+0.274
ipping+0.260
ap+0.258
ocol+0.256
Neg logit contributions
Don-0.379
 She-0.356
Me-0.345
chy-0.339
Grand-0.333
Token their
Token ID511
Feature activation-8.80e+01

Pos logit contributions
ishing+0.332
pieces+0.301
ipping+0.300
ap+0.291
ieve+0.289
Neg logit contributions
Don-0.337
 She-0.327
Me-0.313
Grand-0.299
chy-0.297
Token bath
Token ID7837
Feature activation-9.44e+01

Pos logit contributions
ishing+0.316
pieces+0.294
ipping+0.282
ap+0.277
ocol+0.274
Neg logit contributions
Don-0.417
 She-0.392
Me-0.386
chy-0.381
Grand-0.371
Tokenro
Token ID305
Feature activation-1.34e+02

Pos logit contributions
ishing+0.475
pieces+0.450
ap+0.436
ipping+0.435
ocol+0.427
Neg logit contributions
Don-0.322
 She-0.287
Me-0.278
Grand-0.269
chy-0.268
Tokenbes
Token ID12636
Feature activation-1.05e+02

Pos logit contributions
pieces+0.671
ishing+0.648
ocol+0.634
ieve+0.623
ish+0.596
Neg logit contributions
Don-0.428
chy-0.414
 She-0.409
books-0.390
Me-0.381
Token and
Token ID290
Feature activation-6.34e+01

Pos logit contributions
ishing+0.335
ap+0.313
ipping+0.297
pieces+0.296
ish+0.290
Neg logit contributions
Don-0.548
Me-0.522
Well-0.480
Grand-0.480
Doctor-0.480
Token ran
Token ID4966
Feature activation-6.99e+01

Pos logit contributions
ishing+0.306
pieces+0.293
ap+0.281
ipping+0.279
ocol+0.275
Neg logit contributions
Don-0.223
 She-0.201
Me-0.197
chy-0.190
Grand-0.188
Token around
Token ID1088
Feature activation-6.81e+01

Pos logit contributions
ishing+0.245
pieces+0.234
ocol+0.228
ipping+0.217
ieve+0.215
Neg logit contributions
Don-0.341
 She-0.328
chy-0.309
Me-0.306
Grand-0.303
Token in
Token ID287
Feature activation-6.45e+01

Pos logit contributions
ishing+0.218
pieces+0.206
ap+0.201
ipping+0.193
 victory+0.190
Neg logit contributions
Don-0.350
 She-0.317
Grand-0.315
Me-0.314
chy-0.314
Token reward
Token ID6721
Feature activation-1.03e+02

Pos logit contributions
ishing+0.515
pieces+0.499
ieve+0.490
ipping+0.475
 able+0.467
Neg logit contributions
 She-0.420
Don-0.418
chy-0.386
 needed-0.381
Me-0.377
Token.
Token ID13
Feature activation-5.61e+01

Pos logit contributions
ishing+0.413
pieces+0.390
ipping+0.372
ap+0.369
ocol+0.367
Neg logit contributions
Don-0.451
 She-0.422
Me-0.404
chy-0.399
Grand-0.395
Token It
Token ID632
Feature activation-6.59e+01

Pos logit contributions
pieces+0.294
ap+0.282
 how+0.268
ishing+0.261
ocol+0.261
Neg logit contributions
chy-0.163
Don-0.155
Me-0.129
books-0.129
Grand-0.127
Token was
Token ID373
Feature activation-6.34e+01

Pos logit contributions
ishing+0.327
pieces+0.324
ap+0.310
ish+0.310
 how+0.310
Neg logit contributions
Don-0.204
Grand-0.186
 She-0.172
Doctor-0.171
books-0.170
Token ch
Token ID442
Feature activation-7.43e+01

Pos logit contributions
ishing+0.208
pieces+0.198
 able+0.193
ocol+0.189
 victory+0.187
Neg logit contributions
Don-0.317
 She-0.288
Me-0.288
Well-0.280
Grand-0.279
Tokenocol
Token ID4668
Feature activation-1.32e+02

Pos logit contributions
pieces+0.220
ishing+0.211
 Majesty+0.187
ric+0.186
 able+0.184
Neg logit contributions
 She-0.378
Don-0.378
chy-0.359
Me-0.337
Well-0.331
Tokenates
Token ID689
Feature activation-8.14e+01

Pos logit contributions
pieces+0.403
 victory+0.357
ap+0.337
ishing+0.331
latable+0.330
Neg logit contributions
Don-0.629
Grand-0.622
M-0.610
Doctor-0.601
books-0.592
Token and
Token ID290
Feature activation-5.59e+01

Pos logit contributions
ishing+0.289
pieces+0.272
ipping+0.256
ap+0.256
ocol+0.253
Neg logit contributions
Don-0.394
 She-0.365
Me-0.357
chy-0.349
Grand-0.349
Token sweets
Token ID42402
Feature activation-7.62e+01

Pos logit contributions
ishing+0.239
pieces+0.230
ap+0.217
ocol+0.217
ipping+0.215
Neg logit contributions
Don-0.224
 She-0.203
Me-0.197
Grand-0.195
chy-0.192
Token!
Token ID0
Feature activation-4.22e+01

Pos logit contributions
ishing+0.262
pieces+0.242
ipping+0.231
 able+0.230
ap+0.228
Neg logit contributions
Don-0.377
Me-0.339
 She-0.336
Grand-0.336
Doctor-0.328
Token
Token ID198
Feature activation-4.42e+01

Pos logit contributions
pieces+0.213
ap+0.202
ocol+0.193
ishing+0.192
 how+0.189
Neg logit contributions
chy-0.132
Don-0.125
books-0.110
Me-0.110
 needed-0.104
Token't
Token ID470
Feature activation-9.02e+01

Pos logit contributions
Don+0.223
 She+0.205
Me+0.201
chy+0.198
Grand+0.196
Neg logit contributions
ishing-0.181
pieces-0.173
ap-0.163
ipping-0.163
ocol-0.162
Token know
Token ID760
Feature activation-7.94e+01

Pos logit contributions
ishing+0.462
ipping+0.432
ocol+0.423
pieces+0.413
ooming+0.400
Neg logit contributions
Don-0.321
 She-0.313
 needed-0.299
chy-0.296
 meant-0.294
Token the
Token ID262
Feature activation-7.50e+01

Pos logit contributions
ishing+0.257
ipping+0.227
pieces+0.224
ap+0.212
ocol+0.210
Neg logit contributions
Don-0.412
 She-0.407
chy-0.379
Me-0.378
 needed-0.371
Token rh
Token ID9529
Feature activation-1.15e+02

Pos logit contributions
ishing+0.333
pieces+0.316
ipping+0.303
ap+0.301
ocol+0.298
Neg logit contributions
Don-0.297
 She-0.272
Me-0.264
chy-0.261
Grand-0.255
Tokenin
Token ID259
Feature activation-9.92e+01

Pos logit contributions
pieces+0.434
ishing+0.428
latable+0.421
 Majesty+0.416
 able+0.401
Neg logit contributions
Don-0.462
 needed-0.433
Doctor-0.431
Me-0.425
Well-0.409
Tokenoc
Token ID420
Feature activation-1.31e+02

Pos logit contributions
ishing+0.387
pieces+0.373
ish+0.358
 victory+0.347
ap+0.347
Neg logit contributions
Don-0.433
 She-0.406
Grand-0.385
Me-0.378
chy-0.376
Tokeneros
Token ID27498
Feature activation-9.15e+01

Pos logit contributions
pieces+0.392
 able+0.358
ap+0.348
ieve+0.344
ishing+0.342
Neg logit contributions
Don-0.610
ho-0.610
Doctor-0.596
 She-0.585
Me-0.580
Token was
Token ID373
Feature activation-7.39e+01

Pos logit contributions
ishing+0.490
pieces+0.479
ap+0.458
 able+0.455
ish+0.452
Neg logit contributions
Don-0.265
 She-0.231
Grand-0.230
Me-0.230
chy-0.212
Token wild
Token ID4295
Feature activation-7.49e+01

Pos logit contributions
ishing+0.315
pieces+0.300
ipping+0.289
ap+0.284
ocol+0.281
Neg logit contributions
Don-0.309
 She-0.286
chy-0.280
Me-0.271
Grand-0.268
Token.
Token ID13
Feature activation-4.69e+01

Pos logit contributions
pieces+0.363
ishing+0.361
ocol+0.346
ipping+0.337
ieve+0.336
Neg logit contributions
 She-0.258
Don-0.253
chy-0.238
 meant-0.227
 needed-0.222
Token Can
Token ID1680
Feature activation-5.58e+01

Pos logit contributions
ap+0.218
pieces+0.218
 how+0.211
ric+0.206
ocol+0.200
Neg logit contributions
chy-0.156
Don-0.134
books-0.123
Me-0.117
 rhy-0.116
Token Everyday
Token ID48154
Feature activation-9.36e+01

Pos logit contributions
Don+0.188
 She+0.167
Me+0.167
Grand+0.166
Doctor+0.165
Neg logit contributions
ishing-0.217
pieces-0.212
ap-0.204
 able-0.203
 how-0.200
Token he
Token ID339
Feature activation-8.01e+01

Pos logit contributions
ishing+0.478
pieces+0.460
ap+0.442
ipping+0.440
ocol+0.438
Neg logit contributions
Don-0.303
 She-0.269
Me-0.261
chy-0.256
Grand-0.253
Token would
Token ID561
Feature activation-7.24e+01

Pos logit contributions
ishing+0.431
pieces+0.424
ap+0.416
ish+0.407
 able+0.402
Neg logit contributions
Don-0.221
Grand-0.186
Me-0.184
 She-0.182
Wow-0.176
Token take
Token ID1011
Feature activation-7.08e+01

Pos logit contributions
ishing+0.308
pieces+0.278
ipping+0.274
ocol+0.272
ooming+0.255
Neg logit contributions
 She-0.308
Don-0.306
chy-0.291
Me-0.279
 needed-0.275
Token the
Token ID262
Feature activation-7.23e+01

Pos logit contributions
ishing+0.299
pieces+0.274
ipping+0.272
ric+0.265
ap+0.264
Neg logit contributions
Don-0.288
 She-0.284
Me-0.262
chy-0.257
Grand-0.251
Token av
Token ID1196
Feature activation-1.31e+02

Pos logit contributions
ishing+0.289
pieces+0.275
ipping+0.260
ap+0.259
ocol+0.258
Neg logit contributions
Don-0.316
 She-0.293
Me-0.284
chy-0.278
Grand-0.277
Tokenoc
Token ID420
Feature activation-1.31e+02

Pos logit contributions
ieve+0.695
ishing+0.687
ipping+0.686
ap+0.684
pieces+0.677
Neg logit contributions
Don-0.311
Doctor-0.308
 She-0.302
Grand-0.285
M-0.270
Tokenados
Token ID22484
Feature activation-9.19e+01

Pos logit contributions
ieve+0.396
pieces+0.394
ishing+0.385
ap+0.383
latable+0.378
Neg logit contributions
 She-0.577
Doctor-0.576
Don-0.566
Grand-0.549
ho-0.539
Token and
Token ID290
Feature activation-6.05e+01

Pos logit contributions
ishing+0.324
pieces+0.305
ap+0.286
ipping+0.286
ocol+0.283
Neg logit contributions
Don-0.445
 She-0.414
Me-0.405
chy-0.397
Grand-0.395
Token scatter
Token ID41058
Feature activation-7.27e+01

Pos logit contributions
ishing+0.266
ipping+0.247
pieces+0.239
ap+0.238
ooming+0.237
Neg logit contributions
Don-0.248
 She-0.237
Me-0.222
 needed-0.222
chy-0.218
Token them
Token ID606
Feature activation-6.94e+01

Pos logit contributions
ishing+0.227
pieces+0.210
ipping+0.202
ap+0.199
ieve+0.192
Neg logit contributions
Don-0.380
 She-0.363
Me-0.348
chy-0.342
Grand-0.335
Token he
Token ID339
Feature activation-8.01e+01

Pos logit contributions
ishing+0.478
pieces+0.460
ap+0.442
ipping+0.440
ocol+0.438
Neg logit contributions
Don-0.303
 She-0.269
Me-0.261
chy-0.256
Grand-0.253
Token would
Token ID561
Feature activation-7.24e+01

Pos logit contributions
ishing+0.431
pieces+0.424
ap+0.416
ish+0.407
 able+0.402
Neg logit contributions
Don-0.221
Grand-0.186
Me-0.184
 She-0.182
Wow-0.176
Token take
Token ID1011
Feature activation-7.08e+01

Pos logit contributions
ishing+0.308
pieces+0.278
ipping+0.274
ocol+0.272
ooming+0.255
Neg logit contributions
 She-0.308
Don-0.306
chy-0.291
Me-0.279
 needed-0.275
Token the
Token ID262
Feature activation-7.23e+01

Pos logit contributions
ishing+0.299
pieces+0.274
ipping+0.272
ric+0.265
ap+0.264
Neg logit contributions
Don-0.288
 She-0.284
Me-0.262
chy-0.257
Grand-0.251
Token av
Token ID1196
Feature activation-1.31e+02

Pos logit contributions
ishing+0.289
pieces+0.275
ipping+0.260
ap+0.259
ocol+0.258
Neg logit contributions
Don-0.316
 She-0.293
Me-0.284
chy-0.278
Grand-0.277
Tokenoc
Token ID420
Feature activation-1.31e+02

Pos logit contributions
ieve+0.695
ishing+0.687
ipping+0.686
ap+0.684
pieces+0.677
Neg logit contributions
Don-0.311
Doctor-0.308
 She-0.302
Grand-0.285
M-0.270
Tokenados
Token ID22484
Feature activation-9.19e+01

Pos logit contributions
ieve+0.396
pieces+0.394
ishing+0.385
ap+0.383
latable+0.378
Neg logit contributions
 She-0.577
Doctor-0.576
Don-0.566
Grand-0.549
ho-0.539
Token and
Token ID290
Feature activation-6.05e+01

Pos logit contributions
ishing+0.324
pieces+0.305
ap+0.286
ipping+0.286
ocol+0.283
Neg logit contributions
Don-0.445
 She-0.414
Me-0.405
chy-0.397
Grand-0.395
Token scatter
Token ID41058
Feature activation-7.27e+01

Pos logit contributions
ishing+0.266
ipping+0.247
pieces+0.239
ap+0.238
ooming+0.237
Neg logit contributions
Don-0.248
 She-0.237
Me-0.222
 needed-0.222
chy-0.218
Token them
Token ID606
Feature activation-6.94e+01

Pos logit contributions
ishing+0.227
pieces+0.210
ipping+0.202
ap+0.199
ieve+0.192
Neg logit contributions
Don-0.380
 She-0.363
Me-0.348
chy-0.342
Grand-0.335
Token around
Token ID1088
Feature activation-6.73e+01

Pos logit contributions
ishing+0.232
pieces+0.218
ipping+0.204
ap+0.203
ocol+0.203
Neg logit contributions
Don-0.347
 She-0.326
Me-0.316
chy-0.315
Grand-0.309
Token When
Token ID1649
Feature activation-5.91e+01

Pos logit contributions
pieces+0.286
ap+0.282
 how+0.269
ocol+0.263
ishing+0.254
Neg logit contributions
chy-0.142
Don-0.130
books-0.112
Me-0.110
Doctor-0.107
Token Daisy
Token ID40355
Feature activation-7.48e+01

Pos logit contributions
ishing+0.301
pieces+0.292
ap+0.281
ipping+0.278
ocol+0.278
Neg logit contributions
Don-0.191
 She-0.169
Me-0.164
chy-0.161
Grand-0.159
Token put
Token ID1234
Feature activation-6.70e+01

Pos logit contributions
ishing+0.326
ish+0.313
ipping+0.307
ap+0.304
pieces+0.303
Neg logit contributions
Don-0.282
Wow-0.251
Grand-0.250
Me-0.250
books-0.237
Token the
Token ID262
Feature activation-6.46e+01

Pos logit contributions
ishing+0.285
pieces+0.273
ap+0.261
ipping+0.258
ocol+0.254
Neg logit contributions
Don-0.274
 She-0.261
Me-0.247
chy-0.241
Grand-0.234
Token magn
Token ID7842
Feature activation-8.50e+01

Pos logit contributions
ishing+0.272
pieces+0.255
ipping+0.245
ap+0.245
 able+0.239
Neg logit contributions
Don-0.265
 She-0.244
Me-0.244
chy-0.241
Grand-0.231
Tokenifying
Token ID4035
Feature activation-1.28e+02

Pos logit contributions
ishing+0.243
pieces+0.224
ap+0.206
ocol+0.206
ipping+0.206
Neg logit contributions
Don-0.465
 She-0.448
chy-0.429
Me-0.425
Grand-0.421
Token glass
Token ID5405
Feature activation-8.37e+01

Pos logit contributions
ishing+0.674
pieces+0.645
ieve+0.628
ipping+0.617
immers+0.593
Neg logit contributions
 She-0.423
Don-0.402
Me-0.378
chy-0.359
Grand-0.352
Token down
Token ID866
Feature activation-7.07e+01

Pos logit contributions
ishing+0.280
pieces+0.255
ap+0.250
ipping+0.245
ish+0.241
Neg logit contributions
Don-0.431
Me-0.390
Grand-0.384
 She-0.371
Well-0.369
Token,
Token ID11
Feature activation-5.28e+01

Pos logit contributions
ishing+0.215
ocol+0.201
ap+0.200
pieces+0.199
ipping+0.194
Neg logit contributions
Don-0.364
Grand-0.341
chy-0.326
Me-0.325
Doctor-0.323
Token the
Token ID262
Feature activation-5.91e+01

Pos logit contributions
ishing+0.376
pieces+0.349
ipping+0.349
ocol+0.347
ap+0.346
Neg logit contributions
Don-0.089
 She-0.080
Me-0.062
chy-0.055
 needed-0.049
Token cater
Token ID20825
Feature activation-8.40e+01

Pos logit contributions
ishing+0.244
pieces+0.235
ocol+0.222
ap+0.222
ipping+0.220
Neg logit contributions
Don-0.247
 She-0.226
Me-0.215
Grand-0.215
chy-0.212
Token give
Token ID1577
Feature activation-8.03e+01

Pos logit contributions
ishing+0.341
ipping+0.316
pieces+0.303
ooming+0.299
immers+0.287
Neg logit contributions
Don-0.318
 She-0.317
chy-0.296
Me-0.292
 needed-0.276
Token to
Token ID284
Feature activation-6.94e+01

Pos logit contributions
ishing+0.299
pieces+0.285
ipping+0.278
ap+0.275
ieve+0.267
Neg logit contributions
Don-0.374
 She-0.351
Me-0.335
chy-0.330
 needed-0.320
Token the
Token ID262
Feature activation-7.43e+01

Pos logit contributions
ishing+0.356
ipping+0.332
ooming+0.320
ieve+0.313
ap+0.309
Neg logit contributions
 She-0.245
Don-0.232
Me-0.217
chy-0.212
 needed-0.203
Token rh
Token ID9529
Feature activation-1.11e+02

Pos logit contributions
ishing+0.298
pieces+0.287
ap+0.270
ocol+0.269
ipping+0.268
Neg logit contributions
Don-0.320
 She-0.295
Me-0.286
Grand-0.281
chy-0.280
Tokenin
Token ID259
Feature activation-9.64e+01

Pos logit contributions
ishing+0.414
pieces+0.401
 able+0.388
latable+0.385
ipping+0.381
Neg logit contributions
Don-0.459
Doctor-0.444
Grand-0.433
Me-0.432
 needed-0.404
Tokenoc
Token ID420
Feature activation-1.28e+02

Pos logit contributions
ishing+0.427
pieces+0.408
ap+0.388
ipping+0.387
ocol+0.385
Neg logit contributions
Don-0.382
 She-0.349
Me-0.338
chy-0.332
Grand-0.329
Tokeneros
Token ID27498
Feature activation-9.09e+01

Pos logit contributions
pieces+0.376
 able+0.348
ishing+0.334
ieve+0.331
ap+0.324
Neg logit contributions
ho-0.603
Don-0.601
Doctor-0.599
Me-0.576
 She-0.575
Token.
Token ID13
Feature activation-5.04e+01

Pos logit contributions
ishing+0.371
pieces+0.345
ipping+0.335
ocol+0.332
ap+0.326
Neg logit contributions
Don-0.390
 She-0.370
Me-0.349
chy-0.349
Grand-0.337
Token Lily
Token ID20037
Feature activation-6.66e+01

Pos logit contributions
pieces+0.236
 how+0.228
ap+0.225
 Majesty+0.214
ric+0.213
Neg logit contributions
chy-0.170
Don-0.149
 meant-0.132
Me-0.132
 needed-0.130
Token walked
Token ID6807
Feature activation-6.88e+01

Pos logit contributions
ishing+0.319
ap+0.296
ish+0.294
pieces+0.290
 how+0.289
Neg logit contributions
Don-0.233
Me-0.204
Grand-0.204
Wow-0.199
jar-0.194
Token to
Token ID284
Feature activation-5.85e+01

Pos logit contributions
ishing+0.217
pieces+0.196
ap+0.189
ipping+0.185
 able+0.179
Neg logit contributions
Don-0.357
chy-0.332
Me-0.329
 She-0.323
Grand-0.322
Token quiet
Token ID5897
Feature activation-1.06e+02

Pos logit contributions
 She+0.157
Don+0.156
chy+0.143
Me+0.140
Grand+0.135
Neg logit contributions
ishing-0.217
pieces-0.212
ieve-0.198
immers-0.197
ipping-0.197
Token compared
Token ID3688
Feature activation-8.97e+01

Pos logit contributions
ishing+0.491
pieces+0.472
ocol+0.453
ipping+0.451
ieve+0.444
Neg logit contributions
Don-0.388
 She-0.366
chy-0.344
Me-0.338
Grand-0.328
Token to
Token ID284
Feature activation-6.94e+01

Pos logit contributions
ishing+0.347
pieces+0.332
ipping+0.317
ap+0.314
ocol+0.309
Neg logit contributions
Don-0.404
 She-0.383
Me-0.361
chy-0.356
Grand-0.350
Token the
Token ID262
Feature activation-7.57e+01

Pos logit contributions
ishing+0.257
pieces+0.244
ap+0.230
ipping+0.229
ocol+0.228
Neg logit contributions
Don-0.323
 She-0.300
Me-0.292
chy-0.287
Grand-0.285
Token noisy
Token ID31210
Feature activation-8.50e+01

Pos logit contributions
ishing+0.330
pieces+0.312
ipping+0.299
ap+0.297
ocol+0.295
Neg logit contributions
Don-0.306
 She-0.281
Me-0.273
chy-0.269
Grand-0.264
Token se
Token ID384
Feature activation-1.28e+02

Pos logit contributions
ishing+0.361
pieces+0.343
ipping+0.326
ap+0.324
ocol+0.324
Neg logit contributions
Don-0.350
 She-0.324
Me-0.313
chy-0.308
Grand-0.304
Tokenag
Token ID363
Feature activation-9.77e+01

Pos logit contributions
ishing+0.400
ipping+0.390
ocol+0.370
 heights+0.369
 breath+0.363
Neg logit contributions
Don-0.570
chy-0.550
books-0.530
Me-0.517
Doctor-0.515
Tokenull
Token ID724
Feature activation-8.20e+01

Pos logit contributions
ishing+0.403
pieces+0.385
ipping+0.367
ocol+0.365
ieve+0.360
Neg logit contributions
Don-0.409
 She-0.384
Me-0.373
chy-0.367
Grand-0.356
Tokens
Token ID82
Feature activation-7.33e+01

Pos logit contributions
ishing+0.379
pieces+0.361
ocol+0.347
ipping+0.346
ap+0.344
Neg logit contributions
Don-0.307
 She-0.283
chy-0.270
Me-0.269
Grand-0.260
Token outside
Token ID2354
Feature activation-7.45e+01

Pos logit contributions
ishing+0.290
pieces+0.271
ap+0.266
ipping+0.259
ocol+0.252
Neg logit contributions
Don-0.326
 She-0.298
Me-0.293
Grand-0.290
Doctor-0.283
Token.
Token ID13
Feature activation-4.47e+01

Pos logit contributions
ishing+0.273
ap+0.260
pieces+0.260
ipping+0.248
ocol+0.240
Neg logit contributions
Don-0.343
Grand-0.311
Me-0.310
Doctor-0.303
Well-0.301
Token her
Token ID607
Feature activation-1.05e+02

Pos logit contributions
Don+0.169
 She+0.154
Me+0.147
chy+0.145
Grand+0.143
Neg logit contributions
ishing-0.214
pieces-0.204
ipping-0.197
ap-0.196
ocol-0.195
Token mom
Token ID1995
Feature activation-8.93e+01

Pos logit contributions
ishing+0.430
pieces+0.403
ipping+0.390
ocol+0.382
ieve+0.379
Neg logit contributions
Don-0.445
 She-0.424
Me-0.407
chy-0.406
Grand-0.391
Tokenmy
Token ID1820
Feature activation-9.61e+01

Pos logit contributions
ishing+0.404
pieces+0.392
ap+0.378
ipping+0.370
ish+0.364
Neg logit contributions
Don-0.341
Me-0.307
 She-0.293
chy-0.288
Grand-0.288
Token watched
Token ID7342
Feature activation-7.84e+01

Pos logit contributions
ishing+0.432
pieces+0.407
ap+0.401
ipping+0.394
ish+0.392
Neg logit contributions
Don-0.372
Me-0.341
Grand-0.321
Wow-0.320
 She-0.313
Token in
Token ID287
Feature activation-7.16e+01

Pos logit contributions
ishing+0.304
pieces+0.285
ipping+0.272
ap+0.269
ocol+0.266
Neg logit contributions
Don-0.351
 She-0.333
Me-0.315
chy-0.313
Grand-0.312
Token amaz
Token ID40642
Feature activation-1.27e+02

Pos logit contributions
ishing+0.295
pieces+0.278
ipping+0.265
ap+0.263
ish+0.262
Neg logit contributions
Don-0.309
 She-0.294
Me-0.276
Grand-0.269
chy-0.265
Tokenement
Token ID972
Feature activation-8.91e+01

Pos logit contributions
ric+0.596
 dam+0.554
 prisoners+0.553
immers+0.550
pieces+0.546
Neg logit contributions
 She-0.441
 said-0.359
Well-0.354
Grand-0.353
books-0.352
Token as
Token ID355
Feature activation-7.10e+01

Pos logit contributions
ishing+0.404
pieces+0.381
ocol+0.374
ieve+0.367
ipping+0.365
Neg logit contributions
 She-0.330
Don-0.324
chy-0.298
Grand-0.289
Me-0.282
Token the
Token ID262
Feature activation-6.97e+01

Pos logit contributions
ishing+0.340
pieces+0.333
ap+0.315
ocol+0.314
ipping+0.312
Neg logit contributions
Don-0.248
 She-0.220
Me-0.215
chy-0.213
Grand-0.209
Token machine
Token ID4572
Feature activation-7.92e+01

Pos logit contributions
ishing+0.291
pieces+0.274
ipping+0.262
ap+0.262
ocol+0.259
Neg logit contributions
Don-0.294
 She-0.272
Me-0.265
chy-0.259
Grand-0.257
Token flew
Token ID13112
Feature activation-7.41e+01

Pos logit contributions
pieces+0.349
ishing+0.347
ap+0.336
 how+0.331
ipping+0.326
Neg logit contributions
Don-0.304
Me-0.273
Grand-0.267
Doctor-0.265
Wow-0.260
Token as
Token ID355
Feature activation-5.88e+01

Pos logit contributions
ishing+0.312
pieces+0.299
ap+0.286
ipping+0.284
ocol+0.277
Neg logit contributions
Don-0.306
 She-0.276
Me-0.275
Grand-0.265
chy-0.260
Token he
Token ID339
Feature activation-5.78e+01

Pos logit contributions
ishing+0.307
pieces+0.293
ipping+0.282
ap+0.281
ocol+0.280
Neg logit contributions
Don-0.188
 She-0.169
Me-0.161
chy-0.158
Grand-0.155
Token opened
Token ID4721
Feature activation-6.42e+01

Pos logit contributions
ishing+0.292
pieces+0.275
 able+0.269
ieve+0.269
ap+0.265
Neg logit contributions
Don-0.181
 She-0.170
Me-0.164
Wow-0.159
Grand-0.159
Token the
Token ID262
Feature activation-5.76e+01

Pos logit contributions
ishing+0.268
pieces+0.263
ipping+0.251
ap+0.251
ieve+0.247
Neg logit contributions
 She-0.252
Don-0.247
chy-0.223
Grand-0.221
Me-0.218
Token ch
Token ID442
Feature activation-7.54e+01

Pos logit contributions
ishing+0.283
pieces+0.263
ipping+0.259
ap+0.255
ocol+0.251
Neg logit contributions
Don-0.199
 She-0.190
chy-0.181
Me-0.178
Grand-0.172
Tokenocol
Token ID4668
Feature activation-1.25e+02

Pos logit contributions
 able+0.193
pieces+0.190
 how+0.174
ellery+0.173
 such+0.171
Neg logit contributions
 She-0.324
chy-0.317
 He-0.317
 needed-0.311
 faded-0.307
Tokenates
Token ID689
Feature activation-7.41e+01

Pos logit contributions
pieces+0.411
 victory+0.374
ieve+0.353
latable+0.348
 able+0.336
Neg logit contributions
Grand-0.550
M-0.540
books-0.535
Doctor-0.528
Don-0.527
Token and
Token ID290
Feature activation-4.60e+01

Pos logit contributions
ishing+0.318
pieces+0.301
ocol+0.288
ap+0.286
ipping+0.285
Neg logit contributions
Don-0.301
 She-0.281
Me-0.266
chy-0.266
Grand-0.261
Token started
Token ID2067
Feature activation-5.76e+01

Pos logit contributions
ishing+0.224
ipping+0.208
pieces+0.208
ocol+0.204
ap+0.201
Neg logit contributions
Don-0.168
 She-0.155
Me-0.147
chy-0.145
Grand-0.141
Token eating
Token ID6600
Feature activation-6.04e+01

Pos logit contributions
ishing+0.222
pieces+0.210
ocol+0.199
ieve+0.197
ap+0.196
Neg logit contributions
Don-0.260
 She-0.249
Me-0.233
chy-0.232
Grand-0.229
Token them
Token ID606
Feature activation-5.58e+01

Pos logit contributions
ishing+0.218
pieces+0.205
ipping+0.194
ap+0.193
ocol+0.190
Neg logit contributions
Don-0.288
 She-0.270
Me-0.262
chy-0.257
Grand-0.254
Token�
Token ID129
Feature activation-1.38e+02

Pos logit contributions
Don+0.146
Grand+0.141
Doctor+0.133
Me+0.127
chy+0.126
Neg logit contributions
ap-0.226
ishing-0.224
ish-0.215
pieces-0.211
 how-0.210
Token�
Token ID241
Feature activation-1.10e+02

Pos logit contributions
ishing+0.555
pieces+0.523
ocol+0.494
ipping+0.494
ap+0.491
Neg logit contributions
Don-0.609
 She-0.569
Me-0.546
Grand-0.538
chy-0.537
TokenI
Token ID40
Feature activation-8.23e+01

Pos logit contributions
pieces+0.614
ishing+0.611
ap+0.607
ric+0.589
ipping+0.588
Neg logit contributions
Don-0.240
chy-0.229
 She-0.203
Doctor-0.195
Grand-0.194
Tokenâ
Token ID22940
Feature activation-1.01e+02

Pos logit contributions
ishing+0.372
pieces+0.355
ipping+0.339
ap+0.338
ocol+0.336
Neg logit contributions
Don-0.318
 She-0.292
Me-0.281
chy-0.276
Grand-0.273
Token€
Token ID26391
Feature activation-1.03e+02

Pos logit contributions
ishing+0.443
pieces+0.422
ap+0.404
ipping+0.403
ocol+0.398
Neg logit contributions
Don-0.411
 She-0.378
Me-0.366
chy-0.359
Grand-0.356
Tokenâ„¢
Token ID8151
Feature activation-1.24e+02

Pos logit contributions
ishing+0.483
pieces+0.453
ocol+0.435
ieve+0.433
ipping+0.429
Neg logit contributions
Don-0.392
 She-0.366
Me-0.347
Grand-0.341
chy-0.333
Tokenm
Token ID76
Feature activation-9.13e+01

Pos logit contributions
ap+0.516
pieces+0.513
ric+0.504
ipping+0.499
ishing+0.493
Neg logit contributions
Don-0.474
Grand-0.422
Doctor-0.405
chy-0.398
Me-0.398
Token sorry
Token ID7926
Feature activation-8.21e+01

Pos logit contributions
ishing+0.373
pieces+0.356
ocol+0.332
ipping+0.332
ap+0.328
Neg logit contributions
Don-0.391
 She-0.369
Me-0.351
chy-0.349
Grand-0.343
Token Jane
Token ID12091
Feature activation-8.47e+01

Pos logit contributions
ishing+0.328
pieces+0.310
ap+0.295
ipping+0.294
ocol+0.291
Neg logit contributions
Don-0.362
 She-0.334
Me-0.325
chy-0.319
Grand-0.317
Token.
Token ID13
Feature activation-5.57e+01

Pos logit contributions
ishing+0.375
pieces+0.355
ap+0.351
ish+0.349
ipping+0.342
Neg logit contributions
Don-0.331
Me-0.296
Grand-0.295
 She-0.285
Doctor-0.283
Token I
Token ID314
Feature activation-6.08e+01

Pos logit contributions
pieces+0.259
ap+0.258
ishing+0.256
ipping+0.249
ocol+0.245
Neg logit contributions
chy-0.178
Don-0.176
books-0.153
Grand-0.152
Me-0.152
Token giving
Token ID3501
Feature activation-6.60e+01

Pos logit contributions
ishing+0.205
pieces+0.192
ocol+0.181
ap+0.181
ipping+0.180
Neg logit contributions
Don-0.299
 She-0.278
Me-0.271
chy-0.267
Grand-0.264
Token him
Token ID683
Feature activation-6.36e+01

Pos logit contributions
ishing+0.244
pieces+0.234
ipping+0.224
ap+0.221
ocol+0.215
Neg logit contributions
Don-0.308
 She-0.292
Me-0.278
chy-0.273
Grand-0.270
Token too
Token ID1165
Feature activation-6.14e+01

Pos logit contributions
ishing+0.246
pieces+0.244
ipping+0.238
ooming+0.224
ocol+0.224
Neg logit contributions
Don-0.266
chy-0.260
 She-0.258
Me-0.246
 needed-0.242
Token many
Token ID867
Feature activation-5.74e+01

Pos logit contributions
ishing+0.261
ap+0.246
 victory+0.243
 able+0.243
pieces+0.242
Neg logit contributions
Don-0.244
Me-0.214
 She-0.206
Well-0.205
Grand-0.203
Token ch
Token ID442
Feature activation-7.37e+01

Pos logit contributions
ishing+0.239
pieces+0.225
ipping+0.222
ap+0.218
ooming+0.209
Neg logit contributions
Don-0.238
 She-0.225
chy-0.215
Me-0.215
Grand-0.210
Tokenocol
Token ID4668
Feature activation-1.24e+02

Pos logit contributions
pieces+0.199
ishing+0.161
ieve+0.161
 able+0.159
 victory+0.151
Neg logit contributions
 She-0.381
Don-0.369
chy-0.359
 needed-0.357
Doctor-0.347
Tokenates
Token ID689
Feature activation-7.46e+01

Pos logit contributions
pieces+0.414
 victory+0.362
latable+0.360
ieve+0.347
 able+0.322
Neg logit contributions
Grand-0.537
Don-0.533
Doctor-0.526
M-0.524
books-0.518
Token."
Token ID526
Feature activation-3.74e+01

Pos logit contributions
ishing+0.315
pieces+0.300
ipping+0.286
ap+0.284
ocol+0.283
Neg logit contributions
Don-0.306
 She-0.287
chy-0.275
Me-0.272
Grand-0.267
Token The
Token ID383
Feature activation-4.33e+01

Pos logit contributions
ap+0.177
pieces+0.174
 how+0.169
ipping+0.160
 friends+0.160
Neg logit contributions
chy-0.128
Don-0.111
books-0.109
 meant-0.102
jar-0.099
Token king
Token ID5822
Feature activation-5.62e+01

Pos logit contributions
ishing+0.191
pieces+0.186
ap+0.176
ocol+0.175
ipping+0.174
Neg logit contributions
Don-0.170
 She-0.155
Me-0.148
chy-0.147
Grand-0.144
Token learned
Token ID4499
Feature activation-4.77e+01

Pos logit contributions
ishing+0.302
pieces+0.287
ap+0.282
ish+0.276
ipping+0.275
Neg logit contributions
Don-0.171
 She-0.146
Me-0.144
Grand-0.137
chy-0.136
Token you
Token ID345
Feature activation-9.31e+01

Pos logit contributions
ishing+0.452
pieces+0.428
ipping+0.408
ap+0.405
ocol+0.405
Neg logit contributions
Don-0.457
 She-0.425
chy-0.406
Me-0.405
Grand-0.396
Token carry
Token ID3283
Feature activation-9.25e+01

Pos logit contributions
ishing+0.371
pieces+0.361
ap+0.359
 able+0.340
ieve+0.339
Neg logit contributions
Don-0.389
Me-0.358
Grand-0.356
 She-0.347
Doctor-0.342
Token a
Token ID257
Feature activation-8.49e+01

Pos logit contributions
ishing+0.312
pieces+0.302
ipping+0.286
ap+0.285
ieve+0.271
Neg logit contributions
Don-0.452
 She-0.432
Me-0.407
chy-0.407
 needed-0.398
Token l
Token ID300
Feature activation-9.86e+01

Pos logit contributions
ishing+0.391
pieces+0.372
ipping+0.358
ieve+0.353
ocol+0.352
Neg logit contributions
Don-0.315
chy-0.297
Me-0.293
 She-0.284
Grand-0.280
Tokenily
Token ID813
Feature activation-8.31e+01

Pos logit contributions
ishing+0.474
pieces+0.449
ipping+0.424
ocol+0.419
ap+0.419
Neg logit contributions
Don-0.345
 She-0.331
chy-0.312
Me-0.304
Grand-0.292
Token pad
Token ID14841
Feature activation-1.23e+02

Pos logit contributions
ishing+0.351
pieces+0.333
ipping+0.316
ap+0.316
ocol+0.313
Neg logit contributions
Don-0.345
 She-0.318
Me-0.308
chy-0.302
Grand-0.300
Token too
Token ID1165
Feature activation-7.62e+01

Pos logit contributions
ishing+0.512
pieces+0.491
ocol+0.468
ieve+0.464
ipping+0.458
Neg logit contributions
Don-0.513
 She-0.497
chy-0.468
Me-0.460
Grand-0.446
Token?"
Token ID1701
Feature activation-5.31e+01

Pos logit contributions
ishing+0.259
 able+0.248
 victory+0.244
ap+0.242
ipping+0.228
Neg logit contributions
Don-0.352
Me-0.332
Wow-0.312
Grand-0.312
 Did-0.312
Token
Token ID220
Feature activation-6.98e+01

Pos logit contributions
pieces+0.255
ap+0.252
 how+0.245
ipping+0.234
ishing+0.234
Neg logit contributions
chy-0.168
Don-0.153
books-0.148
jar-0.143
Me-0.135
Token
Token ID198
Feature activation-5.02e+01

Pos logit contributions
ap+0.345
pieces+0.337
ishing+0.332
ipping+0.327
ric+0.323
Neg logit contributions
Don-0.207
chy-0.193
Me-0.172
Grand-0.168
Wow-0.167
TokenThe
Token ID464
Feature activation-5.57e+01

Pos logit contributions
pieces+0.228
ap+0.217
 how+0.210
ishing+0.204
ipping+0.203
Neg logit contributions
chy-0.177
 meant-0.153
 needed-0.152
Don-0.144
 smelled-0.134
Token it
Token ID340
Feature activation-7.39e+01

Pos logit contributions
ishing+0.321
pieces+0.305
ipping+0.298
ap+0.295
ieve+0.287
Neg logit contributions
Don-0.316
 She-0.299
Me-0.286
chy-0.275
Grand-0.269
Token to
Token ID284
Feature activation-6.24e+01

Pos logit contributions
ishing+0.292
ipping+0.275
pieces+0.274
ocol+0.274
ieve+0.258
Neg logit contributions
Don-0.320
 She-0.312
chy-0.308
Me-0.290
 needed-0.289
Token the
Token ID262
Feature activation-6.97e+01

Pos logit contributions
ishing+0.268
pieces+0.256
ap+0.243
ipping+0.242
ocol+0.241
Neg logit contributions
Don-0.256
 She-0.234
Me-0.227
chy-0.223
Grand-0.221
Token rh
Token ID9529
Feature activation-1.05e+02

Pos logit contributions
ishing+0.285
pieces+0.273
ap+0.260
ipping+0.257
ocol+0.256
Neg logit contributions
Don-0.296
 She-0.273
Me-0.263
chy-0.258
Grand-0.257
Tokenin
Token ID259
Feature activation-9.22e+01

Pos logit contributions
pieces+0.392
ishing+0.389
latable+0.363
 able+0.362
ocol+0.360
Neg logit contributions
Don-0.434
Doctor-0.414
Grand-0.404
Me-0.403
 needed-0.388
Tokenoc
Token ID420
Feature activation-1.23e+02

Pos logit contributions
ishing+0.392
pieces+0.378
ap+0.362
ipping+0.353
ocol+0.350
Neg logit contributions
Don-0.383
 She-0.349
Me-0.337
chy-0.331
Grand-0.327
Tokeneros
Token ID27498
Feature activation-8.80e+01

Pos logit contributions
pieces+0.383
 able+0.334
ieve+0.326
ishing+0.320
ap+0.314
Neg logit contributions
ho-0.582
Don-0.580
Doctor-0.572
 She-0.557
Me-0.549
Token.
Token ID13
Feature activation-4.81e+01

Pos logit contributions
ishing+0.380
pieces+0.362
ap+0.344
ipping+0.344
ocol+0.341
Neg logit contributions
Don-0.356
 She-0.328
Me-0.317
chy-0.311
Grand-0.309
Token When
Token ID1649
Feature activation-5.45e+01

Pos logit contributions
pieces+0.214
 how+0.206
ap+0.205
ocol+0.195
ric+0.190
Neg logit contributions
chy-0.178
Don-0.162
 meant-0.140
Me-0.137
 needed-0.137
Token he
Token ID339
Feature activation-6.23e+01

Pos logit contributions
ishing+0.214
pieces+0.212
ap+0.202
 victory+0.202
ocol+0.200
Neg logit contributions
Don-0.231
chy-0.208
Me-0.206
 She-0.204
Grand-0.204
Token tasted
Token ID29187
Feature activation-6.67e+01

Pos logit contributions
ishing+0.289
 able+0.273
pieces+0.267
 how+0.266
ap+0.265
Neg logit contributions
Don-0.228
 She-0.212
Me-0.200
chy-0.196
Grand-0.192
Token there
Token ID612
Feature activation-7.29e+01

Pos logit contributions
ishing+0.316
pieces+0.297
ocol+0.279
ipping+0.277
ap+0.276
Neg logit contributions
Don-0.362
 She-0.343
Me-0.319
chy-0.313
Grand-0.313
Token was
Token ID373
Feature activation-6.96e+01

Pos logit contributions
ap+0.324
ishing+0.315
 how+0.305
ipping+0.297
ish+0.291
Neg logit contributions
Don-0.267
Grand-0.258
chy-0.253
Doctor-0.240
Me-0.236
Token a
Token ID257
Feature activation-6.61e+01

Pos logit contributions
ishing+0.236
pieces+0.220
ipping+0.207
ap+0.206
ocol+0.205
Neg logit contributions
Don-0.352
 She-0.329
Me-0.321
chy-0.315
Grand-0.314
Token rh
Token ID9529
Feature activation-1.04e+02

Pos logit contributions
ishing+0.268
pieces+0.261
ap+0.253
 victory+0.245
ipping+0.244
Neg logit contributions
Don-0.276
 She-0.254
chy-0.241
Me-0.237
Well-0.233
Tokenin
Token ID259
Feature activation-9.14e+01

Pos logit contributions
pieces+0.420
 Majesty+0.405
ishing+0.403
latable+0.384
ric+0.376
Neg logit contributions
Don-0.410
Doctor-0.398
Grand-0.382
Me-0.375
M-0.361
Tokenoc
Token ID420
Feature activation-1.23e+02

Pos logit contributions
ishing+0.389
pieces+0.371
ap+0.355
ipping+0.351
ocol+0.350
Neg logit contributions
Don-0.378
 She-0.348
Me-0.333
chy-0.330
Grand-0.322
Tokeneros
Token ID27498
Feature activation-8.67e+01

Pos logit contributions
pieces+0.423
ieve+0.355
ishing+0.353
ap+0.347
 able+0.343
Neg logit contributions
ho-0.579
Doctor-0.523
 She-0.519
Don-0.515
 He-0.505
Token who
Token ID508
Feature activation-6.78e+01

Pos logit contributions
ishing+0.352
pieces+0.334
ap+0.317
ipping+0.317
ocol+0.314
Neg logit contributions
Don-0.375
 She-0.346
Me-0.336
chy-0.330
Grand-0.327
Token was
Token ID373
Feature activation-6.87e+01

Pos logit contributions
 how+0.310
ishing+0.302
 able+0.295
ish+0.292
ric+0.290
Neg logit contributions
Don-0.213
Grand-0.196
jar-0.186
Doctor-0.184
 She-0.180
Token always
Token ID1464
Feature activation-6.79e+01

Pos logit contributions
ishing+0.311
pieces+0.297
ap+0.284
ipping+0.283
ocol+0.282
Neg logit contributions
Don-0.265
 She-0.242
Me-0.234
chy-0.229
Grand-0.227
Token naughty
Token ID45128
Feature activation-6.71e+01

Pos logit contributions
ishing+0.271
pieces+0.261
ipping+0.247
ocol+0.246
ap+0.245
Neg logit contributions
Don-0.299
 She-0.274
Me-0.267
chy-0.264
Grand-0.258
Token harder
Token ID7069
Feature activation-7.98e+01

Pos logit contributions
ishing+0.398
pieces+0.378
ipping+0.367
ap+0.362
ocol+0.362
Neg logit contributions
Don-0.276
 She-0.252
Me-0.240
chy-0.236
Grand-0.230
Token to
Token ID284
Feature activation-6.80e+01

Pos logit contributions
ishing+0.333
pieces+0.314
ipping+0.300
ap+0.300
ocol+0.297
Neg logit contributions
Don-0.337
 She-0.308
Me-0.303
chy-0.295
Grand-0.294
Token push
Token ID4574
Feature activation-7.61e+01

Pos logit contributions
ishing+0.285
pieces+0.268
ipping+0.254
ocol+0.251
ap+0.242
Neg logit contributions
Don-0.285
 She-0.272
chy-0.270
Me-0.255
Doctor-0.250
Token the
Token ID262
Feature activation-7.11e+01

Pos logit contributions
ishing+0.271
pieces+0.260
ap+0.242
ipping+0.241
ocol+0.239
Neg logit contributions
Don-0.363
 She-0.328
Me-0.328
Grand-0.325
chy-0.325
Token wheel
Token ID7825
Feature activation-8.13e+01

Pos logit contributions
ishing+0.313
pieces+0.294
ipping+0.283
ap+0.281
ocol+0.277
Neg logit contributions
Don-0.283
 She-0.262
chy-0.257
Me-0.255
Grand-0.246
Tokenbar
Token ID5657
Feature activation-1.23e+02

Pos logit contributions
ishing+0.281
pieces+0.265
ap+0.257
ipping+0.254
 able+0.243
Neg logit contributions
Don-0.402
Me-0.370
 She-0.361
Grand-0.356
Doctor-0.349
Tokenrow
Token ID808
Feature activation-8.75e+01

Pos logit contributions
ishing+0.400
pieces+0.398
ocol+0.372
ieve+0.362
ipping+0.356
Neg logit contributions
Don-0.595
 She-0.567
chy-0.554
Me-0.540
Grand-0.536
Token.
Token ID13
Feature activation-4.45e+01

Pos logit contributions
ishing+0.299
ipping+0.273
ap+0.273
pieces+0.270
 able+0.266
Neg logit contributions
Don-0.429
Me-0.398
Grand-0.390
Doctor-0.378
 She-0.377
Token But
Token ID887
Feature activation-4.89e+01

Pos logit contributions
pieces+0.202
ap+0.191
 how+0.184
ocol+0.179
ipping+0.174
Neg logit contributions
chy-0.161
Don-0.143
 meant-0.131
 rhy-0.129
books-0.127
Token her
Token ID607
Feature activation-5.81e+01

Pos logit contributions
ishing+0.184
pieces+0.174
ap+0.166
ipping+0.165
ocol+0.163
Neg logit contributions
Don-0.224
 She-0.207
Me-0.202
chy-0.200
Grand-0.199
Token backyard
Token ID24296
Feature activation-6.38e+01

Pos logit contributions
ishing+0.292
pieces+0.287
ocol+0.285
ieve+0.274
ipping+0.273
Neg logit contributions
Don-0.183
 needed-0.179
chy-0.173
 knew-0.168
Me-0.168
Token you
Token ID345
Feature activation-9.65e+01

Pos logit contributions
Don+0.239
 She+0.223
chy+0.220
Grand+0.219
Me+0.215
Neg logit contributions
ishing-0.178
pieces-0.171
ap-0.165
 able-0.161
 victory-0.160
Token go
Token ID467
Feature activation-8.50e+01

Pos logit contributions
ishing+0.418
pieces+0.387
ipping+0.377
ocol+0.372
ap+0.357
Neg logit contributions
Don-0.398
 She-0.375
chy-0.358
Me-0.352
 needed-0.344
Token,
Token ID11
Feature activation-6.19e+01

Pos logit contributions
ocol+0.410
pieces+0.397
ishing+0.394
ieve+0.360
isher+0.358
Neg logit contributions
 She-0.306
 needed-0.282
Don-0.277
 He-0.275
chy-0.272
Token my
Token ID616
Feature activation-7.74e+01

Pos logit contributions
ishing+0.282
pieces+0.254
ipping+0.251
ocol+0.247
ieve+0.239
Neg logit contributions
Don-0.252
 She-0.241
Me-0.224
chy-0.218
Grand-0.218
Token d
Token ID288
Feature activation-9.14e+01

Pos logit contributions
ishing+0.292
pieces+0.276
ipping+0.261
ocol+0.256
ap+0.253
Neg logit contributions
Don-0.353
 She-0.331
Me-0.326
chy-0.319
Grand-0.314
Tokenarl
Token ID7063
Feature activation-1.23e+02

Pos logit contributions
ishing+0.424
pieces+0.420
ieve+0.402
 able+0.381
 victory+0.375
Neg logit contributions
 She-0.314
Don-0.300
chy-0.285
Grand-0.272
Me-0.259
Tokenings
Token ID654
Feature activation-8.35e+01

Pos logit contributions
pieces+0.534
ishing+0.525
ieve+0.497
ocol+0.492
ipping+0.469
Neg logit contributions
Don-0.474
 She-0.468
chy-0.430
Me-0.424
Grand-0.412
Token.
Token ID13
Feature activation-5.06e+01

Pos logit contributions
ishing+0.223
ap+0.217
 how+0.216
ipping+0.204
pieces+0.203
Neg logit contributions
Don-0.438
Me-0.423
chy-0.394
Grand-0.392
Wow-0.390
Token Enjoy
Token ID18179
Feature activation-7.13e+01

Pos logit contributions
ishing+0.247
pieces+0.241
ap+0.234
ocol+0.231
ipping+0.229
Neg logit contributions
Don-0.170
chy-0.149
 She-0.148
Me-0.148
Grand-0.143
Token your
Token ID534
Feature activation-6.84e+01

Pos logit contributions
pieces+0.252
ishing+0.234
ieve+0.217
ipping+0.216
ooming+0.212
Neg logit contributions
 She-0.329
Don-0.328
Me-0.314
Grand-0.310
 woke-0.302
Token lunch
Token ID9965
Feature activation-7.61e+01

Pos logit contributions
ishing+0.252
pieces+0.244
ipping+0.227
 able+0.222
ieve+0.219
Neg logit contributions
Don-0.306
Me-0.290
 She-0.287
chy-0.285
 needed-0.276
Token�
Token ID129
Feature activation-1.38e+02

Pos logit contributions
 She+0.175
Don+0.166
chy+0.160
Grand+0.146
 meant+0.146
Neg logit contributions
pieces-0.218
ishing-0.201
ocol-0.198
ap-0.192
ieve-0.190
Token�
Token ID241
Feature activation-1.10e+02

Pos logit contributions
ishing+0.555
pieces+0.523
ocol+0.494
ipping+0.494
ap+0.491
Neg logit contributions
Don-0.609
 She-0.569
Me-0.546
Grand-0.538
chy-0.537
TokenThat
Token ID2504
Feature activation-7.95e+01

Pos logit contributions
pieces+0.614
ishing+0.611
ap+0.607
ric+0.589
ipping+0.588
Neg logit contributions
Don-0.240
chy-0.229
 She-0.203
Doctor-0.195
Grand-0.194
Tokenâ
Token ID22940
Feature activation-9.85e+01

Pos logit contributions
ishing+0.389
pieces+0.369
ipping+0.355
ocol+0.355
ap+0.350
Neg logit contributions
Don-0.281
 She-0.261
Me-0.247
chy-0.244
Grand-0.237
Token€
Token ID26391
Feature activation-1.00e+02

Pos logit contributions
ishing+0.439
pieces+0.413
ap+0.410
ipping+0.406
ocol+0.391
Neg logit contributions
Don-0.387
 She-0.356
Me-0.348
Grand-0.342
chy-0.336
Tokenâ„¢
Token ID8151
Feature activation-1.22e+02

Pos logit contributions
ishing+0.459
pieces+0.444
ap+0.429
ipping+0.425
ocol+0.419
Neg logit contributions
Don-0.374
 She-0.336
Me-0.327
chy-0.326
Grand-0.317
Tokens
Token ID82
Feature activation-8.64e+01

Pos logit contributions
ap+0.495
pieces+0.491
ric+0.484
ipping+0.478
ishing+0.478
Neg logit contributions
Don-0.486
Grand-0.457
Doctor-0.422
chy-0.408
jar-0.407
Token a
Token ID257
Feature activation-8.30e+01

Pos logit contributions
ishing+0.392
pieces+0.370
ipping+0.353
ocol+0.349
ieve+0.344
Neg logit contributions
Don-0.338
 She-0.321
Me-0.302
chy-0.300
Grand-0.288
Token special
Token ID2041
Feature activation-8.78e+01

Pos logit contributions
ishing+0.370
pieces+0.352
ipping+0.336
ap+0.335
ocol+0.334
Neg logit contributions
Don-0.326
 She-0.298
Me-0.290
chy-0.284
Grand-0.281
Token necklace
Token ID38987
Feature activation-9.28e+01

Pos logit contributions
ishing+0.375
pieces+0.358
ipping+0.339
ocol+0.337
ap+0.335
Neg logit contributions
Don-0.358
 She-0.334
Me-0.323
chy-0.318
Grand-0.311
Token,
Token ID11
Feature activation-6.01e+01

Pos logit contributions
ishing+0.328
pieces+0.308
ap+0.290
ipping+0.286
ish+0.284
Neg logit contributions
Don-0.454
Me-0.416
Grand-0.398
 She-0.397
Doctor-0.393
Token am
Token ID716
Feature activation-6.78e+01

Pos logit contributions
ishing+0.294
pieces+0.282
ap+0.278
ocol+0.266
ipping+0.266
Neg logit contributions
Don-0.265
 She-0.241
Grand-0.239
Me-0.236
Doctor-0.227
Token taking
Token ID2263
Feature activation-6.81e+01

Pos logit contributions
ishing+0.321
ipping+0.291
pieces+0.287
ooming+0.278
ocol+0.275
Neg logit contributions
Don-0.254
 She-0.254
chy-0.239
 needed-0.230
Me-0.229
Token the
Token ID262
Feature activation-6.81e+01

Pos logit contributions
ishing+0.246
pieces+0.227
ipping+0.221
ap+0.212
ocol+0.210
Neg logit contributions
Don-0.327
 She-0.309
Me-0.296
chy-0.289
Grand-0.286
Token rh
Token ID9529
Feature activation-1.02e+02

Pos logit contributions
ishing+0.270
pieces+0.255
ipping+0.242
ap+0.241
ocol+0.240
Neg logit contributions
Don-0.300
 She-0.277
Me-0.270
chy-0.268
Grand-0.263
Tokenin
Token ID259
Feature activation-9.00e+01

Pos logit contributions
pieces+0.399
ishing+0.393
latable+0.392
 Majesty+0.386
 able+0.373
Neg logit contributions
Don-0.370
Doctor-0.370
Grand-0.358
Me-0.353
 needed-0.341
Tokenoc
Token ID420
Feature activation-1.22e+02

Pos logit contributions
ishing+0.442
pieces+0.420
ipping+0.407
ocol+0.404
ap+0.397
Neg logit contributions
Don-0.307
 She-0.280
Me-0.273
chy-0.270
Grand-0.267
Tokeneros
Token ID27498
Feature activation-8.62e+01

Pos logit contributions
pieces+0.387
 able+0.345
ieve+0.337
ishing+0.325
ap+0.309
Neg logit contributions
ho-0.565
Doctor-0.534
Don-0.530
Me-0.512
 She-0.512
Token to
Token ID284
Feature activation-6.44e+01

Pos logit contributions
ishing+0.349
ocol+0.332
pieces+0.319
ipping+0.316
ieve+0.301
Neg logit contributions
Don-0.362
 She-0.344
chy-0.341
 needed-0.329
 woke-0.312
Token a
Token ID257
Feature activation-6.67e+01

Pos logit contributions
ishing+0.234
pieces+0.217
ipping+0.207
ap+0.203
ocol+0.202
Neg logit contributions
Don-0.309
 She-0.290
Me-0.281
chy-0.277
Grand-0.276
Token new
Token ID649
Feature activation-7.25e+01

Pos logit contributions
ishing+0.308
pieces+0.286
ipping+0.283
ocol+0.277
ap+0.274
Neg logit contributions
Don-0.253
 She-0.231
Me-0.229
Grand-0.227
chy-0.226
Token home
Token ID1363
Feature activation-7.35e+01

Pos logit contributions
ishing+0.294
pieces+0.293
ocol+0.285
ipping+0.282
ieve+0.274
Neg logit contributions
Don-0.285
Grand-0.282
Me-0.274
 She-0.273
 needed-0.265
Token magic
Token ID5536
Feature activation-5.82e+00

Pos logit contributions
Don+0.006
Grand+0.006
books+0.006
 Did+0.005
Well+0.005
Neg logit contributions
 victory-0.006
pieces-0.005
ishing-0.005
ish-0.005
 heights-0.005
Token tree
Token ID5509
Feature activation-4.36e+00

Pos logit contributions
ishing+0.032
pieces+0.031
ipping+0.030
ocol+0.029
ap+0.029
Neg logit contributions
Don-0.017
 She-0.016
Me-0.014
chy-0.014
Grand-0.014
Token?"
Token ID1701
Feature activation+1.28e+01

Pos logit contributions
ishing+0.017
ocol+0.016
pieces+0.015
ipping+0.015
ieve+0.015
Neg logit contributions
Don-0.019
 She-0.019
chy-0.018
Grand-0.017
 needed-0.017
Token
Token ID198
Feature activation+1.19e+01

Pos logit contributions
chy+0.048
jar+0.039
Don+0.039
books+0.036
Me+0.034
Neg logit contributions
 how-0.056
ap-0.055
pieces-0.055
 as-0.054
 able-0.052
Token
Token ID198
Feature activation+1.14e+01

Pos logit contributions
chy+0.039
Don+0.035
 needed+0.031
 meant+0.030
Wow+0.029
Neg logit contributions
ap-0.059
pieces-0.056
ipping-0.054
ishing-0.054
 victory-0.053
TokenHis
Token ID6653
Feature activation+6.56e+00

Pos logit contributions
Don+0.035
chy+0.033
 She+0.031
 needed+0.029
Me+0.028
Neg logit contributions
ishing-0.058
pieces-0.057
ap-0.057
ipping-0.055
 victory-0.054
Token mum
Token ID25682
Feature activation+4.34e+00

Pos logit contributions
Don+0.030
 She+0.028
Me+0.028
Grand+0.027
chy+0.027
Neg logit contributions
ishing-0.025
pieces-0.024
ocol-0.023
ipping-0.022
ieve-0.022
Token said
Token ID531
Feature activation+1.01e+01

Pos logit contributions
Don+0.015
 She+0.013
chy+0.013
Me+0.013
Doctor+0.012
Neg logit contributions
ishing-0.021
pieces-0.020
ipping-0.020
ap-0.020
 able-0.019
Token,
Token ID11
Feature activation+1.26e+01

Pos logit contributions
Don+0.048
 She+0.045
Me+0.043
chy+0.043
Grand+0.042
Neg logit contributions
ishing-0.037
pieces-0.035
ipping-0.033
ap-0.032
ocol-0.032
Token "
Token ID366
Feature activation+1.22e+01

Pos logit contributions
Don+0.039
chy+0.036
Wow+0.035
Me+0.035
Well+0.032
Neg logit contributions
 able-0.061
 how-0.061
ishing-0.059
 as-0.058
ap-0.058
TokenYes
Token ID5297
Feature activation+9.04e+00

Pos logit contributions
chy+0.047
 woke+0.040
 needed+0.039
books+0.039
 She+0.039
Neg logit contributions
pieces-0.051
ishing-0.050
ipping-0.047
 how-0.046
ap-0.046
Token day
Token ID1110
Feature activation+3.71e+01

Pos logit contributions
Don+0.090
 She+0.077
Me+0.072
chy+0.070
Grand+0.069
Neg logit contributions
ishing-0.228
pieces-0.220
ap-0.213
ipping-0.212
ocol-0.210
Token to
Token ID284
Feature activation+4.09e+01

Pos logit contributions
Don+0.164
 She+0.159
chy+0.148
Me+0.147
Grand+0.147
Neg logit contributions
ishing-0.146
pieces-0.139
ocol-0.132
ipping-0.130
ieve-0.128
Token pick
Token ID2298
Feature activation+3.56e+01

Pos logit contributions
 She+0.132
Me+0.115
 woke+0.112
 smelled+0.109
 low+0.108
Neg logit contributions
ishing-0.238
immers-0.224
isher-0.220
ipping-0.218
ooming-0.213
Token flowers
Token ID12734
Feature activation+3.42e+01

Pos logit contributions
 She+0.140
Don+0.128
Me+0.120
chy+0.114
 meant+0.113
Neg logit contributions
ishing-0.167
ipping-0.155
ooming-0.153
immers-0.153
pieces-0.152
Token from
Token ID422
Feature activation+3.77e+01

Pos logit contributions
 She+0.130
Don+0.123
chy+0.111
Me+0.108
 meant+0.103
Neg logit contributions
ishing-0.161
pieces-0.150
ocol-0.148
ipping-0.146
immers-0.145
Token her
Token ID607
Feature activation+3.84e+01

Pos logit contributions
Don+0.177
 She+0.162
Me+0.159
Grand+0.158
chy+0.156
Neg logit contributions
ishing-0.142
pieces-0.132
ap-0.124
ocol-0.124
ipping-0.122
Token garden
Token ID11376
Feature activation+3.33e+01

Pos logit contributions
Don+0.153
 She+0.150
Me+0.143
Grand+0.137
chy+0.136
Neg logit contributions
ishing-0.162
pieces-0.158
ieve-0.152
ocol-0.151
ipping-0.149
Token.
Token ID13
Feature activation+4.70e+01

Pos logit contributions
Don+0.146
Me+0.130
 She+0.130
chy+0.128
Grand+0.126
Neg logit contributions
ishing-0.137
pieces-0.128
ap-0.124
ipping-0.123
ocol-0.119
Token One
Token ID1881
Feature activation+4.44e+01

Pos logit contributions
Don+0.146
chy+0.126
Grand+0.117
Doctor+0.114
 meant+0.113
Neg logit contributions
pieces-0.232
ap-0.229
 how-0.218
ishing-0.218
ric-0.212
Token day
Token ID1110
Feature activation+4.25e+01

Pos logit contributions
Don+0.142
 She+0.119
 Did+0.117
Grand+0.116
Well+0.115
Neg logit contributions
ishing-0.224
ap-0.219
pieces-0.210
ish-0.209
 victory-0.204
Token,
Token ID11
Feature activation+4.47e+01

Pos logit contributions
Don+0.185
 She+0.180
Me+0.169
chy+0.163
Grand+0.162
Neg logit contributions
ishing-0.174
pieces-0.168
ieve-0.155
ocol-0.152
ipping-0.152
Token to
Token ID284
Feature activation+1.71e+01

Pos logit contributions
 She+0.049
Don+0.046
 He+0.039
+0.039
Maybe+0.037
Neg logit contributions
ocol-0.073
pieces-0.069
isher-0.066
ieve-0.066
ishing-0.065
Token an
Token ID281
Feature activation+1.41e+01

Pos logit contributions
Don+0.076
 She+0.070
Me+0.068
Grand+0.064
Maybe+0.064
Neg logit contributions
ishing-0.076
ipping-0.070
pieces-0.070
ocol-0.066
ish-0.066
Token old
Token ID1468
Feature activation+1.02e+01

Pos logit contributions
Don+0.049
 She+0.047
chy+0.043
Grand+0.043
Me+0.043
Neg logit contributions
ishing-0.069
pieces-0.065
ipping-0.064
ap-0.063
ocol-0.062
Token,
Token ID11
Feature activation+1.98e+01

Pos logit contributions
Don+0.037
 She+0.033
Me+0.032
chy+0.031
Grand+0.031
Neg logit contributions
ishing-0.049
pieces-0.047
ipping-0.045
ap-0.045
ocol-0.044
Token rusty
Token ID47855
Feature activation+1.12e+01

Pos logit contributions
Don+0.067
 She+0.064
Me+0.062
Grand+0.060
Well+0.059
Neg logit contributions
ishing-0.092
pieces-0.088
ocol-0.085
 victory-0.084
ap-0.083
Token shed
Token ID14999
Feature activation+1.03e+01

Pos logit contributions
Don+0.039
 She+0.038
chy+0.035
Me+0.034
Grand+0.034
Neg logit contributions
ishing-0.054
pieces-0.053
ieve-0.050
ipping-0.049
ap-0.049
Token.
Token ID13
Feature activation+2.53e+01

Pos logit contributions
Don+0.037
 She+0.034
chy+0.032
Me+0.032
Grand+0.031
Neg logit contributions
ishing-0.049
pieces-0.048
ocol-0.046
ipping-0.045
ap-0.045
Token The
Token ID383
Feature activation+2.22e+01

Pos logit contributions
Don+0.084
Grand+0.080
chy+0.079
Me+0.075
books+0.071
Neg logit contributions
ishing-0.116
pieces-0.116
ap-0.114
 able-0.109
ipping-0.108
Token mouse
Token ID10211
Feature activation+1.44e+01

Pos logit contributions
Don+0.097
Well+0.087
Grand+0.087
Me+0.085
books+0.085
Neg logit contributions
ishing-0.080
 victory-0.079
pieces-0.077
 friends-0.075
ish-0.075
Token went
Token ID1816
Feature activation+1.95e+01

Pos logit contributions
Don+0.049
Grand+0.044
 She+0.043
Wow+0.043
Me+0.043
Neg logit contributions
ishing-0.068
 able-0.065
pieces-0.064
ish-0.063
 how-0.063
Token inside
Token ID2641
Feature activation+1.72e+01

Pos logit contributions
 She+0.079
Don+0.078
 had+0.066
chy+0.066
 needed+0.066
Neg logit contributions
pieces-0.096
ocol-0.096
ishing-0.092
ap-0.090
 Majesty-0.089
Token and
Token ID290
Feature activation+4.87e+01

Pos logit contributions
Don+0.224
Me+0.206
Well+0.201
Wow+0.197
Grand+0.186
Neg logit contributions
ishing-0.109
ap-0.108
ipping-0.098
 able-0.096
 as-0.095
Token filled
Token ID5901
Feature activation+4.22e+01

Pos logit contributions
Don+0.210
 She+0.198
Me+0.193
books+0.191
Well+0.189
Neg logit contributions
ishing-0.177
 how-0.177
 able-0.165
pieces-0.163
 breath-0.159
Token the
Token ID262
Feature activation+4.30e+01

Pos logit contributions
Don+0.183
Me+0.172
 She+0.171
Grand+0.160
chy+0.156
Neg logit contributions
ishing-0.173
pieces-0.165
ipping-0.160
ocol-0.158
ieve-0.152
Token kitchen
Token ID9592
Feature activation+4.05e+01

Pos logit contributions
Don+0.177
 She+0.170
Me+0.166
Grand+0.160
chy+0.160
Neg logit contributions
ishing-0.177
pieces-0.172
ipping-0.163
ieve-0.162
ocol-0.162
Token.
Token ID13
Feature activation+5.26e+01

Pos logit contributions
Don+0.165
 She+0.156
chy+0.149
Me+0.149
Grand+0.146
Neg logit contributions
ishing-0.171
pieces-0.165
ocol-0.155
ipping-0.155
ap-0.153
Token The
Token ID383
Feature activation+5.02e+01

Pos logit contributions
chy+0.208
Don+0.203
ho+0.185
Well+0.166
 needed+0.165
Neg logit contributions
 our-0.199
 how-0.199
ap-0.197
 such-0.191
 victory-0.184
Token smoke
Token ID7523
Feature activation+4.58e+01

Pos logit contributions
Don+0.256
 She+0.240
Me+0.233
chy+0.230
Grand+0.228
Neg logit contributions
ishing-0.167
pieces-0.157
ipping-0.147
ap-0.146
ocol-0.145
Token alarm
Token ID10436
Feature activation+4.74e+01

Pos logit contributions
Don+0.183
 She+0.163
Grand+0.158
Me+0.157
Well+0.153
Neg logit contributions
ishing-0.197
ap-0.185
 how-0.184
ish-0.180
ipping-0.177
Token be
Token ID307
Feature activation+4.81e+01

Pos logit contributions
Don+0.217
Me+0.189
Well+0.183
Pr+0.180
Grand+0.179
Neg logit contributions
ishing-0.178
 how-0.176
 as-0.169
ap-0.163
ish-0.163
Tokenep
Token ID538
Feature activation+4.42e+01

Pos logit contributions
Don+0.240
 She+0.225
Me+0.219
chy+0.214
Grand+0.211
Neg logit contributions
ishing-0.166
pieces-0.154
ap-0.145
ipping-0.145
ocol-0.144
Tokened
Token ID276
Feature activation+4.39e+01

Pos logit contributions
Don+0.216
Me+0.196
 She+0.194
Well+0.188
Grand+0.187
Neg logit contributions
ishing-0.162
ap-0.147
pieces-0.146
ipping-0.144
 able-0.140
Token following
Token ID1708
Feature activation+3.49e+01

Pos logit contributions
Don+0.173
 She+0.160
Me+0.155
chy+0.153
Grand+0.152
Neg logit contributions
ishing-0.149
pieces-0.141
ipping-0.133
ap-0.133
ocol-0.132
Token the
Token ID262
Feature activation+3.55e+01

Pos logit contributions
Don+0.191
 She+0.176
chy+0.175
Me+0.171
Well+0.170
Neg logit contributions
ishing-0.105
pieces-0.101
ap-0.095
ieve-0.087
ocol-0.086
Token steam
Token ID13324
Feature activation+3.03e+01

Pos logit contributions
Don+0.174
 She+0.170
Me+0.158
chy+0.156
Well+0.154
Neg logit contributions
pieces-0.117
ishing-0.117
ap-0.107
 victory-0.105
ieve-0.104
Token in
Token ID287
Feature activation+3.50e+01

Pos logit contributions
Don+0.157
Me+0.143
 She+0.139
Grand+0.136
Well+0.136
Neg logit contributions
ishing-0.101
ap-0.097
pieces-0.094
ipping-0.092
ish-0.089
Token the
Token ID262
Feature activation+3.41e+01

Pos logit contributions
Don+0.193
chy+0.184
 She+0.175
Well+0.175
 Did+0.174
Neg logit contributions
pieces-0.092
ishing-0.091
ap-0.089
 victory-0.088
 how-0.085
Token sky
Token ID6766
Feature activation+3.21e+01

Pos logit contributions
Don+0.155
 She+0.145
Me+0.138
chy+0.138
Well+0.134
Neg logit contributions
ishing-0.128
pieces-0.125
ap-0.121
ipping-0.114
 victory-0.113
Token.
Token ID13
Feature activation+4.39e+01

Pos logit contributions
Don+0.129
 She+0.117
Me+0.114
chy+0.111
Grand+0.110
Neg logit contributions
ishing-0.142
pieces-0.135
ap-0.130
ipping-0.129
ocol-0.127
Token
Token ID198
Feature activation+4.11e+01

Pos logit contributions
chy+0.128
Don+0.123
 meant+0.107
Grand+0.105
 needed+0.101
Neg logit contributions
ap-0.225
pieces-0.218
ishing-0.206
ric-0.203
ipping-0.203
Token
Token ID198
Feature activation+4.05e+01

Pos logit contributions
chy+0.157
Don+0.150
 meant+0.130
 needed+0.128
 Did+0.126
Neg logit contributions
ap-0.167
 how-0.161
pieces-0.155
 what-0.153
 our-0.153
TokenAt
Token ID2953
Feature activation+4.13e+01

Pos logit contributions
chy+0.150
 She+0.139
Don+0.136
 meant+0.135
 needed+0.132
Neg logit contributions
ap-0.171
pieces-0.167
ishing-0.165
ipping-0.159
imming-0.152
Token the
Token ID262
Feature activation+3.81e+01

Pos logit contributions
Don+0.152
 She+0.146
Grand+0.137
Me+0.134
chy+0.130
Neg logit contributions
ishing-0.200
pieces-0.193
ocol-0.186
ish-0.180
ipping-0.179
Token.
Token ID13
Feature activation-4.57e+01

Pos logit contributions
ishing+0.257
pieces+0.238
ap+0.235
ish+0.227
 able+0.226
Neg logit contributions
Don-0.305
Me-0.282
Grand-0.273
Wow-0.268
Well-0.267
Token They
Token ID1119
Feature activation-5.16e+01

Pos logit contributions
pieces+0.220
ap+0.218
ocol+0.204
ishing+0.202
 how+0.198
Neg logit contributions
Don-0.146
chy-0.139
 needed-0.124
Me-0.122
books-0.121
Token match
Token ID2872
Feature activation-6.49e+01

Pos logit contributions
ishing+0.203
pieces+0.197
ap+0.187
ieve+0.183
ocol+0.181
Neg logit contributions
Don-0.225
 She-0.204
Me-0.202
Grand-0.198
chy-0.194
Token the
Token ID262
Feature activation-5.52e+01

Pos logit contributions
ishing+0.227
pieces+0.214
ipping+0.202
ap+0.199
ocol+0.198
Neg logit contributions
Don-0.315
 She-0.298
Me-0.288
chy-0.284
Grand-0.281
Token colors
Token ID7577
Feature activation-6.79e+01

Pos logit contributions
ishing+0.227
pieces+0.213
ipping+0.206
ap+0.203
ocol+0.201
Neg logit contributions
Don-0.235
 She-0.218
Me-0.212
chy-0.211
Grand-0.206
Token of
Token ID286
Feature activation-5.91e+01

Pos logit contributions
ishing+0.256
pieces+0.241
ap+0.229
ipping+0.228
ocol+0.225
Neg logit contributions
Don-0.316
 She-0.288
Me-0.284
Grand-0.277
chy-0.276
Token his
Token ID465
Feature activation-6.21e+01

Pos logit contributions
ishing+0.226
pieces+0.211
ipping+0.203
ap+0.200
ocol+0.195
Neg logit contributions
Don-0.268
 She-0.253
Me-0.245
chy-0.240
Grand-0.238
Token suit
Token ID6050
Feature activation-7.24e+01

Pos logit contributions
ishing+0.232
pieces+0.219
ap+0.206
ipping+0.206
ocol+0.205
Neg logit contributions
Don-0.290
 She-0.269
Me-0.261
chy-0.257
Grand-0.255
Token.
Token ID13
Feature activation-3.60e+01

Pos logit contributions
ishing+0.235
ap+0.227
pieces+0.224
 how+0.212
ish+0.209
Neg logit contributions
Don-0.400
Me-0.353
Wow-0.337
Well-0.335
Doctor-0.328
Token Red
Token ID2297
Feature activation-5.58e+01

Pos logit contributions
pieces+0.173
ap+0.170
ocol+0.160
 our+0.157
 how+0.157
Neg logit contributions
chy-0.120
Don-0.115
 needed-0.104
 knew-0.101
 woke-0.098
Token,
Token ID11
Feature activation-4.19e+01

Pos logit contributions
ishing+0.244
ap+0.222
ish+0.220
ipping+0.217
ieve+0.211
Neg logit contributions
Don-0.237
Grand-0.212
Me-0.204
jar-0.200
Doctor-0.199
Token to
Token ID284
Feature activation-1.64e+01

Pos logit contributions
ishing+0.076
pieces+0.070
ipping+0.068
ocol+0.067
ap+0.065
Neg logit contributions
Don-0.089
 She-0.085
Me-0.080
Grand-0.080
chy-0.078
Token try
Token ID1949
Feature activation-2.20e+01

Pos logit contributions
ishing+0.091
ocol+0.082
isher+0.081
immers+0.080
ipping+0.077
Neg logit contributions
 She-0.061
 rhy-0.056
 said-0.051
Me-0.049
-0.048
Token the
Token ID262
Feature activation-1.85e+01

Pos logit contributions
ishing+0.109
ocol+0.108
immers+0.106
isher+0.105
pieces+0.102
Neg logit contributions
 She-0.073
Grand-0.064
-0.064
Don-0.062
 V-0.060
Token slide
Token ID10649
Feature activation-2.71e+01

Pos logit contributions
ishing+0.071
pieces+0.067
ipping+0.064
ap+0.063
ocol+0.063
Neg logit contributions
Don-0.084
 She-0.078
Me-0.076
chy-0.075
Grand-0.074
Token?"
Token ID1701
Feature activation-7.61e+00

Pos logit contributions
ishing+0.081
pieces+0.075
ap+0.071
ipping+0.070
ish+0.067
Neg logit contributions
Don-0.148
 She-0.137
Me-0.136
chy-0.133
Grand-0.132
Token
Token ID198
Feature activation-8.06e+00

Pos logit contributions
pieces+0.032
 how+0.031
ap+0.030
ipping+0.030
ocol+0.029
Neg logit contributions
chy-0.031
Don-0.024
 needed-0.024
 meant-0.023
jar-0.023
Token
Token ID198
Feature activation-8.44e+00

Pos logit contributions
pieces+0.036
 how+0.036
ap+0.035
ishing+0.035
ipping+0.034
Neg logit contributions
chy-0.028
Don-0.024
 needed-0.023
 meant-0.023
 Did-0.021
TokenThe
Token ID464
Feature activation-1.13e+01

Pos logit contributions
pieces+0.038
ishing+0.036
ipping+0.036
ap+0.036
 how+0.036
Neg logit contributions
chy-0.031
 needed-0.027
 meant-0.026
 Did-0.023
 rhy-0.022
Token giant
Token ID6175
Feature activation-2.28e+01

Pos logit contributions
ishing+0.051
pieces+0.050
ap+0.048
ipping+0.047
ocol+0.047
Neg logit contributions
Don-0.043
 She-0.040
chy-0.038
Me-0.037
Grand-0.037
Token said
Token ID531
Feature activation-1.21e+01

Pos logit contributions
ishing+0.109
pieces+0.102
ap+0.101
ish+0.100
 how+0.098
Neg logit contributions
Don-0.083
 She-0.073
chy-0.071
Me-0.069
Grand-0.069
Token,
Token ID11
Feature activation-7.67e+00

Pos logit contributions
ishing+0.052
pieces+0.049
ocol+0.046
ric+0.045
ipping+0.045
Neg logit contributions
Don-0.050
 She-0.049
Me-0.045
chy-0.044
Grand-0.043
Token.
Token ID13
Feature activation-2.47e+01

Pos logit contributions
ishing+0.201
ap+0.188
pieces+0.183
 able+0.173
ish+0.172
Neg logit contributions
Don-0.254
Me-0.230
 She-0.223
Grand-0.222
Well-0.222
Token He
Token ID679
Feature activation-3.34e+01

Pos logit contributions
ap+0.128
pieces+0.127
ocol+0.125
ishing+0.123
ieve+0.121
Neg logit contributions
Don-0.066
chy-0.059
Me-0.057
books-0.056
Grand-0.052
Token can
Token ID460
Feature activation-3.35e+01

Pos logit contributions
ishing+0.162
pieces+0.153
ap+0.149
ieve+0.148
ish+0.147
Neg logit contributions
Don-0.117
 She-0.105
Me-0.103
Grand-0.102
Doctor-0.096
Token't
Token ID470
Feature activation-3.57e+01

Pos logit contributions
ishing+0.147
pieces+0.136
ipping+0.131
ocol+0.126
ap+0.123
Neg logit contributions
Don-0.141
 She-0.137
chy-0.131
Me-0.128
Grand-0.125
Token hurt
Token ID5938
Feature activation-3.90e+01

Pos logit contributions
ishing+0.139
pieces+0.131
ipping+0.125
ap+0.122
ocol+0.122
Neg logit contributions
Don-0.162
 She-0.152
chy-0.146
Me-0.146
Grand-0.143
Token us
Token ID514
Feature activation-3.91e+01

Pos logit contributions
ishing+0.121
pieces+0.113
ipping+0.105
ap+0.104
ocol+0.103
Neg logit contributions
Don-0.208
 She-0.195
Me-0.190
chy-0.187
Grand-0.186
Token."
Token ID526
Feature activation-2.02e+01

Pos logit contributions
ishing+0.157
pieces+0.147
ap+0.142
ipping+0.140
ocol+0.137
Neg logit contributions
Don-0.173
 She-0.158
Me-0.156
Grand-0.151
chy-0.150
Token
Token ID198
Feature activation-2.21e+01

Pos logit contributions
pieces+0.095
ishing+0.095
ap+0.094
 how+0.089
ipping+0.088
Neg logit contributions
Don-0.062
chy-0.061
Me-0.055
Grand-0.053
books-0.053
Token
Token ID198
Feature activation-2.25e+01

Pos logit contributions
pieces+0.098
ishing+0.097
 how+0.096
ap+0.095
igs+0.093
Neg logit contributions
chy-0.069
Don-0.065
 meant-0.061
 rhy-0.058
 needed-0.058
TokenL
Token ID43
Feature activation-3.69e+01

Pos logit contributions
ishing+0.123
pieces+0.122
ap+0.118
ipping+0.113
immers+0.112
Neg logit contributions
chy-0.052
 meant-0.048
Don-0.048
 rhy-0.046
 needed-0.045
Tokenily
Token ID813
Feature activation-3.15e+01

Pos logit contributions
ishing+0.247
pieces+0.241
ap+0.233
ipping+0.232
ocol+0.231
Neg logit contributions
Don-0.063
 She-0.057
Me-0.045
chy-0.045
Grand-0.043
Token they
Token ID484
Feature activation-2.67e+00

Pos logit contributions
Don+0.000
 She+0.000
Me+0.000
chy+0.000
Grand+0.000
Neg logit contributions
ishing-0.001
ocol-0.001
ipping-0.001
pieces-0.001
ieve-0.001
Token succeeded
Token ID14131
Feature activation-9.52e+00

Pos logit contributions
ishing+0.013
pieces+0.013
ap+0.012
 able+0.012
ish+0.012
Neg logit contributions
Don-0.009
 She-0.008
Me-0.008
chy-0.008
Grand-0.007
Token!
Token ID0
Feature activation+5.36e+00

Pos logit contributions
pieces+0.042
 Majesty+0.041
ishing+0.041
ocol+0.041
ric+0.039
Neg logit contributions
Don-0.036
 She-0.031
 needed-0.031
Grand-0.030
chy-0.030
Token The
Token ID383
Feature activation+2.08e+00

Pos logit contributions
Don+0.017
chy+0.015
Grand+0.015
Doctor+0.014
Me+0.014
Neg logit contributions
pieces-0.028
ap-0.026
ishing-0.025
 how-0.024
 victory-0.024
Token rice
Token ID11464
Feature activation-1.05e+01

Pos logit contributions
Don+0.009
Me+0.008
chy+0.008
 She+0.008
 needed+0.008
Neg logit contributions
ishing-0.009
pieces-0.008
ocol-0.008
ipping-0.008
ric-0.008
Token turned
Token ID2900
Feature activation-6.79e+00

Pos logit contributions
ishing+0.055
pieces+0.052
ap+0.051
ipping+0.051
 able+0.051
Neg logit contributions
Don-0.032
 She-0.029
Grand-0.028
Me-0.027
Doctor-0.026
Token out
Token ID503
Feature activation-6.98e+00

Pos logit contributions
ishing+0.018
pieces+0.016
 able+0.015
ish+0.015
ap+0.015
Neg logit contributions
Don-0.039
Me-0.037
Grand-0.036
 She-0.035
Well-0.035
Token perfectly
Token ID7138
Feature activation-1.08e+01

Pos logit contributions
ishing+0.024
pieces+0.022
ap+0.022
 able+0.021
 victory+0.021
Neg logit contributions
Don-0.035
Me-0.032
chy-0.031
Doctor-0.031
Grand-0.031
Token and
Token ID290
Feature activation+4.45e+00

Pos logit contributions
pieces+0.045
ishing+0.045
ocol+0.043
ap+0.041
ipping+0.041
Neg logit contributions
Don-0.044
 She-0.042
chy-0.040
Grand-0.039
Me-0.038
Token everyone
Token ID2506
Feature activation-1.03e+00

Pos logit contributions
Don+0.014
 She+0.013
Me+0.012
chy+0.012
Grand+0.012
Neg logit contributions
ishing-0.023
pieces-0.022
ap-0.021
ipping-0.021
 able-0.021
Token enjoyed
Token ID8359
Feature activation+1.25e+00

Pos logit contributions
ishing+0.006
pieces+0.005
ap+0.005
 able+0.005
ipping+0.005
Neg logit contributions
Don-0.003
 She-0.003
Grand-0.003
chy-0.003
Me-0.003
Token The
Token ID383
Feature activation-4.87e+01

Pos logit contributions
ap+0.197
 how+0.192
pieces+0.191
 friends+0.178
ocol+0.178
Neg logit contributions
chy-0.161
books-0.138
Don-0.135
jar-0.129
 meant-0.121
Token bird
Token ID6512
Feature activation-6.14e+01

Pos logit contributions
ishing+0.188
pieces+0.180
ap+0.179
ish+0.177
ocol+0.177
Neg logit contributions
Don-0.203
 She-0.182
chy-0.182
books-0.177
Grand-0.176
Token thought
Token ID1807
Feature activation-5.24e+01

Pos logit contributions
ishing+0.350
ish+0.326
 how+0.323
ap+0.322
pieces+0.322
Neg logit contributions
Don-0.160
Grand-0.141
Me-0.136
Doctor-0.130
books-0.129
Token and
Token ID290
Feature activation-4.07e+01

Pos logit contributions
ishing+0.149
pieces+0.143
ocol+0.135
ap+0.130
ieve+0.129
Neg logit contributions
Don-0.292
Me-0.269
Grand-0.268
 She-0.264
Wow-0.258
Token thought
Token ID1807
Feature activation-5.16e+01

Pos logit contributions
 how+0.213
ishing+0.210
 able+0.194
 Majesty+0.192
pieces+0.190
Neg logit contributions
chy-0.112
Don-0.106
Me-0.100
books-0.098
Grand-0.095
Token,
Token ID11
Feature activation-3.80e+01

Pos logit contributions
ishing+0.183
pieces+0.175
ocol+0.165
ap+0.164
ipping+0.164
Neg logit contributions
Don-0.252
Me-0.229
 She-0.228
Grand-0.224
chy-0.218
Token and
Token ID290
Feature activation-3.81e+01

Pos logit contributions
 how+0.179
 as+0.167
pieces+0.167
 able+0.166
ishing+0.164
Neg logit contributions
chy-0.124
Don-0.118
Me-0.117
jar-0.115
books-0.113
Token then
Token ID788
Feature activation-4.07e+01

Pos logit contributions
 how+0.148
ishing+0.144
 able+0.142
 as+0.132
pieces+0.131
Neg logit contributions
chy-0.161
Don-0.151
books-0.151
Me-0.149
 She-0.145
Token it
Token ID340
Feature activation-4.77e+01

Pos logit contributions
ishing+0.211
 able+0.209
 how+0.205
pieces+0.200
ap+0.195
Neg logit contributions
Don-0.117
Me-0.111
chy-0.105
books-0.104
Grand-0.101
Token remembered
Token ID12086
Feature activation-4.29e+01

Pos logit contributions
ishing+0.220
 able+0.201
 how+0.199
pieces+0.191
ieve+0.189
Neg logit contributions
 She-0.174
Don-0.166
Me-0.158
 Did-0.155
Grand-0.147
Token the
Token ID262
Feature activation-4.00e+01

Pos logit contributions
ishing+0.155
pieces+0.148
ipping+0.140
ocol+0.137
ap+0.136
Neg logit contributions
Don-0.210
 She-0.202
chy-0.186
Me-0.184
 needed-0.183
Token the
Token ID262
Feature activation-8.44e+01

Pos logit contributions
ishing+0.318
pieces+0.299
ocol+0.294
ipping+0.277
ieve+0.274
Neg logit contributions
Don-0.373
 She-0.360
Me-0.345
chy-0.339
Grand-0.324
Token game
Token ID983
Feature activation-8.94e+01

Pos logit contributions
ishing+0.362
pieces+0.336
ocol+0.328
ipping+0.325
ap+0.323
Neg logit contributions
Don-0.350
chy-0.320
Me-0.319
 She-0.318
Grand-0.304
Token some
Token ID617
Feature activation-7.75e+01

Pos logit contributions
ishing+0.317
pieces+0.305
ap+0.293
ipping+0.283
ieve+0.273
Neg logit contributions
Don-0.436
 She-0.393
Me-0.391
Grand-0.385
chy-0.376
Token more
Token ID517
Feature activation-8.09e+01

Pos logit contributions
ishing+0.304
ocol+0.278
ipping+0.276
pieces+0.274
ric+0.270
Neg logit contributions
Don-0.335
Me-0.316
 She-0.308
chy-0.307
Grand-0.291
Token and
Token ID290
Feature activation-6.57e+01

Pos logit contributions
ishing+0.351
pieces+0.328
ipping+0.321
ocol+0.318
ap+0.313
Neg logit contributions
Don-0.321
 She-0.303
chy-0.292
Me-0.292
Grand-0.279
Token Sam
Token ID3409
Feature activation-7.48e+01

Pos logit contributions
ishing+0.291
ocol+0.276
ipping+0.271
pieces+0.264
ap+0.262
Neg logit contributions
Don-0.275
 She-0.251
Me-0.247
chy-0.237
 needed-0.230
Token got
Token ID1392
Feature activation-7.61e+01

Pos logit contributions
ishing+0.326
pieces+0.324
 able+0.321
 how+0.318
ish+0.313
Neg logit contributions
Don-0.274
Grand-0.265
jar-0.263
Wow-0.256
Doctor-0.251
Token very
Token ID845
Feature activation-7.33e+01

Pos logit contributions
ishing+0.321
pieces+0.297
ocol+0.291
ipping+0.290
ap+0.287
Neg logit contributions
Don-0.324
 She-0.301
Me-0.285
chy-0.282
 needed-0.273
Token sleepy
Token ID42023
Feature activation-7.41e+01

Pos logit contributions
ishing+0.346
pieces+0.329
ap+0.325
ipping+0.315
ocol+0.313
Neg logit contributions
Don-0.269
 She-0.242
Grand-0.231
Me-0.230
Doctor-0.225
Token.
Token ID13
Feature activation-4.31e+01

Pos logit contributions
ishing+0.312
pieces+0.292
ap+0.283
ipping+0.283
 able+0.273
Neg logit contributions
Don-0.310
Me-0.283
 She-0.277
Grand-0.267
chy-0.266
Token
Token ID220
Feature activation-6.39e+01

Pos logit contributions
ap+0.209
pieces+0.204
 how+0.200
ric+0.192
 Majesty+0.187
Neg logit contributions
chy-0.132
Don-0.126
Grand-0.117
jar-0.112
books-0.111
Token a
Token ID257
Feature activation-4.88e+01

Pos logit contributions
ishing+0.175
pieces+0.165
ipping+0.159
ap+0.158
ocol+0.155
Neg logit contributions
Don-0.176
 She-0.163
Me-0.158
chy-0.155
Grand-0.153
Token big
Token ID1263
Feature activation-5.61e+01

Pos logit contributions
ishing+0.207
pieces+0.203
ocol+0.194
ap+0.192
ipping+0.189
Neg logit contributions
Don-0.196
 She-0.176
Me-0.167
chy-0.165
Well-0.165
Token bed
Token ID3996
Feature activation-5.90e+01

Pos logit contributions
ishing+0.241
pieces+0.230
ipping+0.217
ap+0.215
ieve+0.215
Neg logit contributions
Don-0.227
 She-0.215
Me-0.205
chy-0.203
Grand-0.199
Token with
Token ID351
Feature activation-4.84e+01

Pos logit contributions
ishing+0.196
ap+0.182
ipping+0.176
pieces+0.176
 able+0.168
Neg logit contributions
Don-0.305
Me-0.274
Well-0.262
 She-0.261
Grand-0.259
Token soft
Token ID2705
Feature activation-5.98e+01

Pos logit contributions
pieces+0.164
ishing+0.164
ocol+0.159
 how+0.146
 able+0.145
Neg logit contributions
Don-0.233
 She-0.206
chy-0.203
Me-0.202
Well-0.202
Token pill
Token ID9582
Feature activation-6.59e+01

Pos logit contributions
pieces+0.274
ishing+0.267
ieve+0.265
 able+0.254
ap+0.252
Neg logit contributions
 She-0.232
Don-0.210
 woke-0.207
chy-0.200
Me-0.198
Tokenows
Token ID1666
Feature activation-6.46e+01

Pos logit contributions
ishing+0.268
pieces+0.246
ap+0.242
ipping+0.238
ocol+0.235
Neg logit contributions
Don-0.300
Me-0.269
 She-0.256
Grand-0.252
Well-0.250
Token and
Token ID290
Feature activation-4.23e+01

Pos logit contributions
ishing+0.237
pieces+0.213
ipping+0.212
ap+0.210
 able+0.203
Neg logit contributions
Don-0.310
Me-0.281
Grand-0.275
Doctor-0.272
 She-0.272
Token a
Token ID257
Feature activation-4.70e+01

Pos logit contributions
ishing+0.195
ipping+0.182
pieces+0.179
ap+0.178
ieve+0.171
Neg logit contributions
Don-0.169
 She-0.160
Me-0.153
Grand-0.151
chy-0.150
Token big
Token ID1263
Feature activation-5.31e+01

Pos logit contributions
ishing+0.201
pieces+0.196
ap+0.185
ocol+0.185
ipping+0.183
Neg logit contributions
Don-0.189
 She-0.172
Me-0.165
chy-0.162
Grand-0.160
Token garden
Token ID11376
Feature activation-5.86e+01

Pos logit contributions
ishing+0.233
pieces+0.222
ipping+0.210
ieve+0.209
ap+0.207
Neg logit contributions
Don-0.210
 She-0.200
chy-0.190
Me-0.189
Grand-0.184
Token of
Token ID286
Feature activation-6.35e+01

Pos logit contributions
ishing+0.261
pieces+0.250
ap+0.234
ipping+0.231
ocol+0.229
Neg logit contributions
Don-0.364
Me-0.330
 She-0.329
chy-0.320
Grand-0.319
Token animals
Token ID4695
Feature activation-6.78e+01

Pos logit contributions
ishing+0.250
ipping+0.230
ap+0.222
ooming+0.220
pieces+0.219
Neg logit contributions
 She-0.283
Don-0.283
Me-0.262
Grand-0.253
chy-0.252
Token,
Token ID11
Feature activation-4.73e+01

Pos logit contributions
ishing+0.281
pieces+0.269
ap+0.255
ipping+0.254
ocol+0.251
Neg logit contributions
Don-0.287
 She-0.261
Me-0.257
Grand-0.250
chy-0.250
Token fruits
Token ID15921
Feature activation-6.85e+01

Pos logit contributions
ishing+0.209
pieces+0.195
ipping+0.191
ap+0.187
ocol+0.187
Neg logit contributions
Don-0.191
 She-0.181
Me-0.171
chy-0.168
Grand-0.166
Token and
Token ID290
Feature activation-4.74e+01

Pos logit contributions
ishing+0.231
pieces+0.213
ap+0.210
ipping+0.204
ish+0.204
Neg logit contributions
Don-0.344
Me-0.314
Grand-0.302
Wow-0.293
chy-0.293
Token shapes
Token ID15268
Feature activation-6.85e+01

Pos logit contributions
ishing+0.224
ipping+0.218
ooming+0.208
ap+0.206
immers+0.198
Neg logit contributions
Don-0.185
 She-0.182
 needed-0.169
Me-0.164
chy-0.164
Token.
Token ID13
Feature activation-3.27e+01

Pos logit contributions
ishing+0.324
pieces+0.307
ipping+0.296
ocol+0.295
ap+0.292
Neg logit contributions
Don-0.243
 She-0.235
chy-0.219
Me-0.216
Grand-0.210
Token One
Token ID1881
Feature activation-4.16e+01

Pos logit contributions
pieces+0.172
ap+0.166
 how+0.162
ocol+0.161
 Majesty+0.156
Neg logit contributions
chy-0.089
Don-0.084
 woke-0.069
Me-0.069
 meant-0.068
Token day
Token ID1110
Feature activation-4.61e+01

Pos logit contributions
ap+0.199
ishing+0.195
ish+0.192
pieces+0.188
 victory+0.185
Neg logit contributions
Don-0.152
Grand-0.128
Well-0.120
 She-0.117
Doctor-0.116
Token,
Token ID11
Feature activation-3.75e+01

Pos logit contributions
ishing+0.168
pieces+0.159
ipping+0.149
ap+0.148
ocol+0.147
Neg logit contributions
Don-0.220
 She-0.206
Me-0.199
chy-0.196
Grand-0.195
Token Tim
Token ID5045
Feature activation-4.88e+01

Pos logit contributions
ishing+0.212
pieces+0.202
ipping+0.196
ap+0.195
ocol+0.194
Neg logit contributions
Don-0.108
 She-0.097
Me-0.091
Grand-0.086
chy-0.085
Token he
Token ID339
Feature activation-8.69e+01

Pos logit contributions
Grand+0.209
Don+0.195
Doctor+0.194
jar+0.185
Mrs+0.182
Neg logit contributions
 how-0.196
ishing-0.181
ish-0.172
ieve-0.172
 up-0.162
Token could
Token ID714
Feature activation-7.62e+01

Pos logit contributions
ishing+0.477
pieces+0.463
ap+0.452
 able+0.437
ipping+0.437
Neg logit contributions
Don-0.241
 She-0.211
Me-0.205
Grand-0.201
chy-0.189
Token play
Token ID711
Feature activation-7.80e+01

Pos logit contributions
ishing+0.360
pieces+0.330
ocol+0.327
ipping+0.316
ish+0.295
Neg logit contributions
 She-0.306
chy-0.288
Don-0.287
 needed-0.273
 He-0.260
Token with
Token ID351
Feature activation-6.78e+01

Pos logit contributions
ishing+0.345
ocol+0.318
pieces+0.317
ipping+0.306
ric+0.303
Neg logit contributions
Don-0.312
 She-0.300
chy-0.287
Me-0.283
books-0.269
Token them
Token ID606
Feature activation-7.14e+01

Pos logit contributions
ishing+0.325
ap+0.306
ipping+0.303
pieces+0.297
ish+0.291
Neg logit contributions
Don-0.233
 She-0.230
Me-0.220
chy-0.209
 needed-0.203
Token too
Token ID1165
Feature activation-6.94e+01

Pos logit contributions
ishing+0.284
pieces+0.271
ap+0.258
ipping+0.254
ieve+0.248
Neg logit contributions
Don-0.317
 She-0.287
Me-0.286
Grand-0.280
Doctor-0.272
Token.
Token ID13
Feature activation-4.42e+01

Pos logit contributions
ishing+0.220
 able+0.210
ap+0.208
 victory+0.208
 how+0.197
Neg logit contributions
Don-0.358
Me-0.331
Doctor-0.313
Wow-0.309
Grand-0.308
Token They
Token ID1119
Feature activation-5.01e+01

Pos logit contributions
pieces+0.235
ap+0.227
 how+0.217
ishing+0.214
ocol+0.213
Neg logit contributions
Don-0.118
chy-0.118
Me-0.097
Grand-0.096
 meant-0.093
Token put
Token ID1234
Feature activation-5.83e+01

Pos logit contributions
pieces+0.219
ishing+0.209
ap+0.207
 how+0.207
ish+0.203
Neg logit contributions
Don-0.188
Grand-0.167
 She-0.163
Me-0.163
Doctor-0.160
Token the
Token ID262
Feature activation-5.52e+01

Pos logit contributions
ishing+0.228
pieces+0.216
ipping+0.208
ap+0.206
ieve+0.202
Neg logit contributions
Don-0.256
 She-0.241
Me-0.236
chy-0.224
Wow-0.222
Token ship
Token ID4074
Feature activation-7.06e+01

Pos logit contributions
ishing+0.233
pieces+0.218
ipping+0.211
ap+0.209
 able+0.206
Neg logit contributions
Don-0.227
Me-0.210
chy-0.209
 She-0.205
Grand-0.194
Token him
Token ID683
Feature activation-8.41e+01

Pos logit contributions
ishing+0.341
pieces+0.317
ish+0.308
ipping+0.302
ap+0.300
Neg logit contributions
Don-0.344
 She-0.332
Me-0.309
Grand-0.307
Doctor-0.297
Token,
Token ID11
Feature activation-6.27e+01

Pos logit contributions
ishing+0.374
pieces+0.356
ocol+0.341
ipping+0.340
ap+0.336
Neg logit contributions
Don-0.328
 She-0.306
chy-0.292
Me-0.287
Grand-0.282
Token even
Token ID772
Feature activation-7.75e+01

Pos logit contributions
ishing+0.282
pieces+0.268
ipping+0.256
ap+0.254
ocol+0.254
Neg logit contributions
Don-0.246
 She-0.227
Me-0.217
chy-0.213
Grand-0.211
Token if
Token ID611
Feature activation-6.22e+01

Pos logit contributions
ishing+0.374
ipping+0.369
ap+0.362
pieces+0.361
ocol+0.357
Neg logit contributions
Don-0.286
 She-0.250
 meant-0.239
 needed-0.238
chy-0.236
Token it
Token ID340
Feature activation-6.96e+01

Pos logit contributions
ishing+0.374
pieces+0.356
ipping+0.346
ocol+0.340
ap+0.339
Neg logit contributions
Don-0.165
 She-0.145
Me-0.131
 needed-0.127
chy-0.123
Token meant
Token ID4001
Feature activation-6.68e+01

Pos logit contributions
ishing+0.347
pieces+0.336
ap+0.330
ish+0.324
 victory+0.321
Neg logit contributions
Don-0.218
Grand-0.199
 She-0.184
Me-0.182
Doctor-0.180
Token wearing
Token ID5762
Feature activation-6.98e+01

Pos logit contributions
ishing+0.273
pieces+0.255
ipping+0.245
ocol+0.240
ap+0.238
Neg logit contributions
Don-0.291
 She-0.275
chy-0.262
Me-0.259
Grand-0.251
Token a
Token ID257
Feature activation-6.62e+01

Pos logit contributions
ishing+0.280
pieces+0.268
ipping+0.266
ap+0.259
ooming+0.252
Neg logit contributions
Don-0.295
 She-0.280
chy-0.270
Me-0.269
Grand-0.255
Token collar
Token ID19908
Feature activation-7.76e+01

Pos logit contributions
ishing+0.308
pieces+0.284
ipping+0.279
ocol+0.273
ieve+0.273
Neg logit contributions
Don-0.244
chy-0.231
Me-0.228
 She-0.227
Grand-0.224
Token.
Token ID13
Feature activation-4.07e+01

Pos logit contributions
ishing+0.248
pieces+0.237
 able+0.229
ap+0.226
ipping+0.222
Neg logit contributions
Don-0.390
Me-0.368
Grand-0.349
Doctor-0.348
Wow-0.342
Token
Token ID220
Feature activation-6.14e+01

Pos logit contributions
pieces+0.211
ap+0.204
ishing+0.195
ocol+0.194
 how+0.192
Neg logit contributions
chy-0.115
Don-0.113
Me-0.097
books-0.092
Grand-0.091
Token When
Token ID1649
Feature activation-5.91e+01

Pos logit contributions
pieces+0.286
ap+0.282
 how+0.269
ocol+0.263
ishing+0.254
Neg logit contributions
chy-0.142
Don-0.130
books-0.112
Me-0.110
Doctor-0.107
Token Daisy
Token ID40355
Feature activation-7.48e+01

Pos logit contributions
ishing+0.301
pieces+0.292
ap+0.281
ipping+0.278
ocol+0.278
Neg logit contributions
Don-0.191
 She-0.169
Me-0.164
chy-0.161
Grand-0.159
Token put
Token ID1234
Feature activation-6.70e+01

Pos logit contributions
ishing+0.326
ish+0.313
ipping+0.307
ap+0.304
pieces+0.303
Neg logit contributions
Don-0.282
Wow-0.251
Grand-0.250
Me-0.250
books-0.237
Token the
Token ID262
Feature activation-6.46e+01

Pos logit contributions
ishing+0.285
pieces+0.273
ap+0.261
ipping+0.258
ocol+0.254
Neg logit contributions
Don-0.274
 She-0.261
Me-0.247
chy-0.241
Grand-0.234
Token magn
Token ID7842
Feature activation-8.50e+01

Pos logit contributions
ishing+0.272
pieces+0.255
ipping+0.245
ap+0.245
 able+0.239
Neg logit contributions
Don-0.265
 She-0.244
Me-0.244
chy-0.241
Grand-0.231
Tokenifying
Token ID4035
Feature activation-1.28e+02

Pos logit contributions
ishing+0.243
pieces+0.224
ap+0.206
ocol+0.206
ipping+0.206
Neg logit contributions
Don-0.465
 She-0.448
chy-0.429
Me-0.425
Grand-0.421
Token glass
Token ID5405
Feature activation-8.37e+01

Pos logit contributions
ishing+0.674
pieces+0.645
ieve+0.628
ipping+0.617
immers+0.593
Neg logit contributions
 She-0.423
Don-0.402
Me-0.378
chy-0.359
Grand-0.352
Token down
Token ID866
Feature activation-7.07e+01

Pos logit contributions
ishing+0.280
pieces+0.255
ap+0.250
ipping+0.245
ish+0.241
Neg logit contributions
Don-0.431
Me-0.390
Grand-0.384
 She-0.371
Well-0.369
Token,
Token ID11
Feature activation-5.28e+01

Pos logit contributions
ishing+0.215
ocol+0.201
ap+0.200
pieces+0.199
ipping+0.194
Neg logit contributions
Don-0.364
Grand-0.341
chy-0.326
Me-0.325
Doctor-0.323
Token the
Token ID262
Feature activation-5.91e+01

Pos logit contributions
ishing+0.376
pieces+0.349
ipping+0.349
ocol+0.347
ap+0.346
Neg logit contributions
Don-0.089
 She-0.080
Me-0.062
chy-0.055
 needed-0.049
Token cater
Token ID20825
Feature activation-8.40e+01

Pos logit contributions
ishing+0.244
pieces+0.235
ocol+0.222
ap+0.222
ipping+0.220
Neg logit contributions
Don-0.247
 She-0.226
Me-0.215
Grand-0.215
chy-0.212
Token children
Token ID1751
Feature activation-9.13e+01

Pos logit contributions
Don+0.164
 She+0.147
Me+0.146
books+0.139
Doctor+0.139
Neg logit contributions
ishing-0.275
 able-0.266
pieces-0.258
 how-0.257
ap-0.253
Token put
Token ID1234
Feature activation-8.04e+01

Pos logit contributions
ishing+0.437
pieces+0.419
ap+0.401
ipping+0.400
ocol+0.396
Neg logit contributions
Don-0.325
 She-0.295
Me-0.285
chy-0.278
Grand-0.276
Token on
Token ID319
Feature activation-7.97e+01

Pos logit contributions
ishing+0.292
pieces+0.274
ipping+0.260
ap+0.258
ocol+0.256
Neg logit contributions
Don-0.379
 She-0.356
Me-0.345
chy-0.339
Grand-0.333
Token their
Token ID511
Feature activation-8.80e+01

Pos logit contributions
ishing+0.332
pieces+0.301
ipping+0.300
ap+0.291
ieve+0.289
Neg logit contributions
Don-0.337
 She-0.327
Me-0.313
Grand-0.299
chy-0.297
Token bath
Token ID7837
Feature activation-9.44e+01

Pos logit contributions
ishing+0.316
pieces+0.294
ipping+0.282
ap+0.277
ocol+0.274
Neg logit contributions
Don-0.417
 She-0.392
Me-0.386
chy-0.381
Grand-0.371
Tokenro
Token ID305
Feature activation-1.34e+02

Pos logit contributions
ishing+0.475
pieces+0.450
ap+0.436
ipping+0.435
ocol+0.427
Neg logit contributions
Don-0.322
 She-0.287
Me-0.278
Grand-0.269
chy-0.268
Tokenbes
Token ID12636
Feature activation-1.05e+02

Pos logit contributions
pieces+0.671
ishing+0.648
ocol+0.634
ieve+0.623
ish+0.596
Neg logit contributions
Don-0.428
chy-0.414
 She-0.409
books-0.390
Me-0.381
Token and
Token ID290
Feature activation-6.34e+01

Pos logit contributions
ishing+0.335
ap+0.313
ipping+0.297
pieces+0.296
ish+0.290
Neg logit contributions
Don-0.548
Me-0.522
Well-0.480
Grand-0.480
Doctor-0.480
Token ran
Token ID4966
Feature activation-6.99e+01

Pos logit contributions
ishing+0.306
pieces+0.293
ap+0.281
ipping+0.279
ocol+0.275
Neg logit contributions
Don-0.223
 She-0.201
Me-0.197
chy-0.190
Grand-0.188
Token around
Token ID1088
Feature activation-6.81e+01

Pos logit contributions
ishing+0.245
pieces+0.234
ocol+0.228
ipping+0.217
ieve+0.215
Neg logit contributions
Don-0.341
 She-0.328
chy-0.309
Me-0.306
Grand-0.303
Token in
Token ID287
Feature activation-6.45e+01

Pos logit contributions
ishing+0.218
pieces+0.206
ap+0.201
ipping+0.193
 victory+0.190
Neg logit contributions
Don-0.350
 She-0.317
Grand-0.315
Me-0.314
chy-0.314
Token he
Token ID339
Feature activation-8.01e+01

Pos logit contributions
ishing+0.478
pieces+0.460
ap+0.442
ipping+0.440
ocol+0.438
Neg logit contributions
Don-0.303
 She-0.269
Me-0.261
chy-0.256
Grand-0.253
Token would
Token ID561
Feature activation-7.24e+01

Pos logit contributions
ishing+0.431
pieces+0.424
ap+0.416
ish+0.407
 able+0.402
Neg logit contributions
Don-0.221
Grand-0.186
Me-0.184
 She-0.182
Wow-0.176
Token take
Token ID1011
Feature activation-7.08e+01

Pos logit contributions
ishing+0.308
pieces+0.278
ipping+0.274
ocol+0.272
ooming+0.255
Neg logit contributions
 She-0.308
Don-0.306
chy-0.291
Me-0.279
 needed-0.275
Token the
Token ID262
Feature activation-7.23e+01

Pos logit contributions
ishing+0.299
pieces+0.274
ipping+0.272
ric+0.265
ap+0.264
Neg logit contributions
Don-0.288
 She-0.284
Me-0.262
chy-0.257
Grand-0.251
Token av
Token ID1196
Feature activation-1.31e+02

Pos logit contributions
ishing+0.289
pieces+0.275
ipping+0.260
ap+0.259
ocol+0.258
Neg logit contributions
Don-0.316
 She-0.293
Me-0.284
chy-0.278
Grand-0.277
Tokenoc
Token ID420
Feature activation-1.31e+02

Pos logit contributions
ieve+0.695
ishing+0.687
ipping+0.686
ap+0.684
pieces+0.677
Neg logit contributions
Don-0.311
Doctor-0.308
 She-0.302
Grand-0.285
M-0.270
Tokenados
Token ID22484
Feature activation-9.19e+01

Pos logit contributions
ieve+0.396
pieces+0.394
ishing+0.385
ap+0.383
latable+0.378
Neg logit contributions
 She-0.577
Doctor-0.576
Don-0.566
Grand-0.549
ho-0.539
Token and
Token ID290
Feature activation-6.05e+01

Pos logit contributions
ishing+0.324
pieces+0.305
ap+0.286
ipping+0.286
ocol+0.283
Neg logit contributions
Don-0.445
 She-0.414
Me-0.405
chy-0.397
Grand-0.395
Token scatter
Token ID41058
Feature activation-7.27e+01

Pos logit contributions
ishing+0.266
ipping+0.247
pieces+0.239
ap+0.238
ooming+0.237
Neg logit contributions
Don-0.248
 She-0.237
Me-0.222
 needed-0.222
chy-0.218
Token them
Token ID606
Feature activation-6.94e+01

Pos logit contributions
ishing+0.227
pieces+0.210
ipping+0.202
ap+0.199
ieve+0.192
Neg logit contributions
Don-0.380
 She-0.363
Me-0.348
chy-0.342
Grand-0.335
Token around
Token ID1088
Feature activation-6.73e+01

Pos logit contributions
ishing+0.232
pieces+0.218
ipping+0.204
ap+0.203
ocol+0.203
Neg logit contributions
Don-0.347
 She-0.326
Me-0.316
chy-0.315
Grand-0.309
Token her
Token ID607
Feature activation-1.05e+02

Pos logit contributions
Don+0.169
 She+0.154
Me+0.147
chy+0.145
Grand+0.143
Neg logit contributions
ishing-0.214
pieces-0.204
ipping-0.197
ap-0.196
ocol-0.195
Token mom
Token ID1995
Feature activation-8.93e+01

Pos logit contributions
ishing+0.430
pieces+0.403
ipping+0.390
ocol+0.382
ieve+0.379
Neg logit contributions
Don-0.445
 She-0.424
Me-0.407
chy-0.406
Grand-0.391
Tokenmy
Token ID1820
Feature activation-9.61e+01

Pos logit contributions
ishing+0.404
pieces+0.392
ap+0.378
ipping+0.370
ish+0.364
Neg logit contributions
Don-0.341
Me-0.307
 She-0.293
chy-0.288
Grand-0.288
Token watched
Token ID7342
Feature activation-7.84e+01

Pos logit contributions
ishing+0.432
pieces+0.407
ap+0.401
ipping+0.394
ish+0.392
Neg logit contributions
Don-0.372
Me-0.341
Grand-0.321
Wow-0.320
 She-0.313
Token in
Token ID287
Feature activation-7.16e+01

Pos logit contributions
ishing+0.304
pieces+0.285
ipping+0.272
ap+0.269
ocol+0.266
Neg logit contributions
Don-0.351
 She-0.333
Me-0.315
chy-0.313
Grand-0.312
Token amaz
Token ID40642
Feature activation-1.27e+02

Pos logit contributions
ishing+0.295
pieces+0.278
ipping+0.265
ap+0.263
ish+0.262
Neg logit contributions
Don-0.309
 She-0.294
Me-0.276
Grand-0.269
chy-0.265
Tokenement
Token ID972
Feature activation-8.91e+01

Pos logit contributions
ric+0.596
 dam+0.554
 prisoners+0.553
immers+0.550
pieces+0.546
Neg logit contributions
 She-0.441
 said-0.359
Well-0.354
Grand-0.353
books-0.352
Token as
Token ID355
Feature activation-7.10e+01

Pos logit contributions
ishing+0.404
pieces+0.381
ocol+0.374
ieve+0.367
ipping+0.365
Neg logit contributions
 She-0.330
Don-0.324
chy-0.298
Grand-0.289
Me-0.282
Token the
Token ID262
Feature activation-6.97e+01

Pos logit contributions
ishing+0.340
pieces+0.333
ap+0.315
ocol+0.314
ipping+0.312
Neg logit contributions
Don-0.248
 She-0.220
Me-0.215
chy-0.213
Grand-0.209
Token machine
Token ID4572
Feature activation-7.92e+01

Pos logit contributions
ishing+0.291
pieces+0.274
ipping+0.262
ap+0.262
ocol+0.259
Neg logit contributions
Don-0.294
 She-0.272
Me-0.265
chy-0.259
Grand-0.257
Token flew
Token ID13112
Feature activation-7.41e+01

Pos logit contributions
pieces+0.349
ishing+0.347
ap+0.336
 how+0.331
ipping+0.326
Neg logit contributions
Don-0.304
Me-0.273
Grand-0.267
Doctor-0.265
Wow-0.260
Token't
Token ID470
Feature activation-9.02e+01

Pos logit contributions
Don+0.223
 She+0.205
Me+0.201
chy+0.198
Grand+0.196
Neg logit contributions
ishing-0.181
pieces-0.173
ap-0.163
ipping-0.163
ocol-0.162
Token know
Token ID760
Feature activation-7.94e+01

Pos logit contributions
ishing+0.462
ipping+0.432
ocol+0.423
pieces+0.413
ooming+0.400
Neg logit contributions
Don-0.321
 She-0.313
 needed-0.299
chy-0.296
 meant-0.294
Token the
Token ID262
Feature activation-7.50e+01

Pos logit contributions
ishing+0.257
ipping+0.227
pieces+0.224
ap+0.212
ocol+0.210
Neg logit contributions
Don-0.412
 She-0.407
chy-0.379
Me-0.378
 needed-0.371
Token rh
Token ID9529
Feature activation-1.15e+02

Pos logit contributions
ishing+0.333
pieces+0.316
ipping+0.303
ap+0.301
ocol+0.298
Neg logit contributions
Don-0.297
 She-0.272
Me-0.264
chy-0.261
Grand-0.255
Tokenin
Token ID259
Feature activation-9.92e+01

Pos logit contributions
pieces+0.434
ishing+0.428
latable+0.421
 Majesty+0.416
 able+0.401
Neg logit contributions
Don-0.462
 needed-0.433
Doctor-0.431
Me-0.425
Well-0.409
Tokenoc
Token ID420
Feature activation-1.31e+02

Pos logit contributions
ishing+0.387
pieces+0.373
ish+0.358
 victory+0.347
ap+0.347
Neg logit contributions
Don-0.433
 She-0.406
Grand-0.385
Me-0.378
chy-0.376
Tokeneros
Token ID27498
Feature activation-9.15e+01

Pos logit contributions
pieces+0.392
 able+0.358
ap+0.348
ieve+0.344
ishing+0.342
Neg logit contributions
Don-0.610
ho-0.610
Doctor-0.596
 She-0.585
Me-0.580
Token was
Token ID373
Feature activation-7.39e+01

Pos logit contributions
ishing+0.490
pieces+0.479
ap+0.458
 able+0.455
ish+0.452
Neg logit contributions
Don-0.265
 She-0.231
Grand-0.230
Me-0.230
chy-0.212
Token wild
Token ID4295
Feature activation-7.49e+01

Pos logit contributions
ishing+0.315
pieces+0.300
ipping+0.289
ap+0.284
ocol+0.281
Neg logit contributions
Don-0.309
 She-0.286
chy-0.280
Me-0.271
Grand-0.268
Token.
Token ID13
Feature activation-4.69e+01

Pos logit contributions
pieces+0.363
ishing+0.361
ocol+0.346
ipping+0.337
ieve+0.336
Neg logit contributions
 She-0.258
Don-0.253
chy-0.238
 meant-0.227
 needed-0.222
Token Can
Token ID1680
Feature activation-5.58e+01

Pos logit contributions
ap+0.218
pieces+0.218
 how+0.211
ric+0.206
ocol+0.200
Neg logit contributions
chy-0.156
Don-0.134
books-0.123
Me-0.117
 rhy-0.116