TOP ACTIVATIONS
MAX = 56.833

't know the rh in oc eros was wild . Can
am taking the rh in oc eros to a new home
children put on their bath ro bes and ran around in
his head out of his bur row and saw something new
he would take the av oc ados and scatter them around
harder to push the wheel bar row . But her backyard
reward . It was ch ocol ates and sweets !
give to the rh in oc eros . Lily walked to
quiet compared to the noisy se ag ull s outside .
as he opened the ch ocol ates and started eating them
giving him too many ch ocol ates ." The king learned
it to the rh in oc eros . When he tasted
that the bad rh in oc eros was now nice .
you go , my d arl ings . Enjoy your lunch
who lived in his cozy bur row . He loved to
there was a rh in oc eros who was always naughty
Everyday he would take the av oc ados and scatter them
She picks up the re ind eer . She looks at
able to escape from his bur row and play in the
mist and into their cozy bur rows . The end .

MIN ACTIVATIONS
MIN = -22.596

J ill likes to dance . She dances in her room
to play with their toys . They have cars , dolls
to play with his drum . He has a big drum
to play with his lizard . His lizard is red and
Can you spin for me ?" Spot spun around and around
like to play with gears . They have many gears of
play golf in the park . They had a small stick
liked to play with snow . They made a big snow
playing with paper and scissors . They liked to cut out
Tom and Sam are brothers . They like to play with
jump in p udd les . She wore her yellow boots
<|endoftext|> Todd was so excited ! His mom had just told
liked to learn new words . She had a book with
what that land is like ?". John began to
to swim in the pool . She has a pink cup
like to play with cars . They have many cars of
are playing with paper planes . They make them from old
ily liked to design things . She had many papers and
Ben and Lily are twins . They share a room and
were playing in the snow . They made a big snow

INTERVAL 36.976 - 56.833
CONTAINS 0.013%

flavors they liked and what topp ings would go best .
placed it onto the wheel bar row . John
he would take the av oc ados and scatter them around
there was a rh in oc eros who was always naughty
couldn 't find any av oc ados . Finally , she

INTERVAL 17.119 - 36.976
CONTAINS 4.028%

in a row along the windows ill . One day ,
He pointed to a small jar with a label that said
cave . They take the wolf with them . They go
She started to demand the umbrella . " Please , mom
't do it . My sc ooter is not good for

INTERVAL -2.739 - 17.119
CONTAINS 54.779%

Tim went back to the oct opus and gave it
there was a very special kid . His name was Joe
sausage and butterfly flew around the sky , enjoying the light
and angry . They want their coins back . They want
again . The animals were happy and thanked Lily for her

INTERVAL -22.596 - -2.739
CONTAINS 41.180%

shooter . <|endoftext|> Once upon a time , there was a
a little girl . She was very curious , and liked
much fun it was to be the mail man !
. What a wonderful day for the lion ! <|endoftext|> Once
Who â s there ? â
Token't
Token ID470
Feature activation+1.58e+01

Pos logit contributions
Sorry+0.138
Listen+0.137
Kids+0.136
Don+0.132
Sure+0.131
Neg logit contributions
et-0.107
an-0.105
lers-0.101
ish-0.101
ig-0.097
Token know
Token ID760
Feature activation+1.77e+01

Pos logit contributions
et+0.192
lers+0.188
an+0.188
ig+0.186
ish+0.186
Neg logit contributions
Sorry-0.118
Listen-0.117
Kids-0.115
Don-0.112
Dad-0.110
Token the
Token ID262
Feature activation+1.84e+01

Pos logit contributions
an+0.211
et+0.202
ish+0.201
ig+0.195
lers+0.190
Neg logit contributions
Listen-0.133
Kids-0.132
Don-0.129
Sorry-0.129
 Something-0.126
Token rh
Token ID9529
Feature activation+4.14e+01

Pos logit contributions
cks+0.173
ig+0.157
et+0.156
lers+0.150
igs+0.149
Neg logit contributions
Hi-0.193
Listen-0.192
Don-0.190
Sorry-0.185
Sure-0.183
Tokenin
Token ID259
Feature activation+3.04e+01

Pos logit contributions
et+0.387
 cob+0.351
cks+0.338
ps+0.333
 egg+0.332
Neg logit contributions
Sorry-0.433
Dad-0.400
Listen-0.397
By-0.397
He-0.395
Tokenoc
Token ID420
Feature activation+4.76e+01

Pos logit contributions
cks+0.257
igs+0.254
et+0.243
ig+0.241
ps+0.235
Neg logit contributions
Sure-0.329
Sorry-0.328
Don-0.325
Hi-0.322
Um-0.320
Tokeneros
Token ID27498
Feature activation+2.89e+01

Pos logit contributions
cks+0.303
 cob+0.301
igs+0.289
ery+0.289
 web+0.289
Neg logit contributions
Sorry-0.585
Sure-0.535
Um-0.526
Dad-0.526
Hi-0.521
Token was
Token ID373
Feature activation+1.56e+01

Pos logit contributions
et+0.342
ig+0.334
an+0.330
ish+0.330
lers+0.328
Neg logit contributions
Sorry-0.216
Listen-0.213
Kids-0.212
Don-0.209
Hi-0.201
Token wild
Token ID4295
Feature activation+2.30e+01

Pos logit contributions
an+0.175
ish+0.173
ig+0.172
lers+0.160
et+0.158
Neg logit contributions
Don-0.132
Listen-0.129
Sorry-0.124
Thanks-0.121
Um-0.121
Token.
Token ID13
Feature activation+3.38e+00

Pos logit contributions
cks+0.238
ig+0.231
igs+0.229
et+0.221
ery+0.220
Neg logit contributions
Hi-0.198
Don-0.191
Kids-0.185
Listen-0.184
Come-0.179
Token Can
Token ID1680
Feature activation+7.31e+00

Pos logit contributions
et+0.040
 egg+0.039
an+0.039
lers+0.038
ish+0.038
Neg logit contributions
Sorry-0.024
Listen-0.024
Kids-0.023
By-0.021
Don-0.021
Token am
Token ID716
Feature activation+1.32e+01

Pos logit contributions
et+0.136
an+0.131
lers+0.131
ish+0.130
ig+0.127
Neg logit contributions
Sorry-0.112
Kids-0.111
Listen-0.111
Don-0.107
Sure-0.104
Token taking
Token ID2263
Feature activation+1.99e+01

Pos logit contributions
an+0.156
et+0.152
lers+0.148
ish+0.148
ig+0.146
Neg logit contributions
Sorry-0.101
Kids-0.100
Listen-0.100
Um-0.099
Don-0.097
Token the
Token ID262
Feature activation+2.05e+01

Pos logit contributions
an+0.218
et+0.213
ish+0.212
ig+0.209
 egg+0.199
Neg logit contributions
Listen-0.171
Sorry-0.165
Kids-0.164
Hi-0.162
Don-0.162
Token rh
Token ID9529
Feature activation+3.91e+01

Pos logit contributions
cks+0.200
et+0.172
ig+0.160
an+0.160
igs+0.160
Neg logit contributions
Sure-0.205
Hi-0.203
Listen-0.200
Kids-0.197
No-0.195
Tokenin
Token ID259
Feature activation+3.23e+01

Pos logit contributions
et+0.387
 cob+0.354
ps+0.342
 egg+0.327
cks+0.327
Neg logit contributions
Sorry-0.388
Dad-0.367
By-0.361
He-0.356
Jack-0.354
Tokenoc
Token ID420
Feature activation+4.68e+01

Pos logit contributions
cks+0.310
igs+0.304
ps+0.300
et+0.291
els+0.290
Neg logit contributions
Sure-0.311
Sorry-0.301
Dad-0.293
By-0.293
No-0.288
Tokeneros
Token ID27498
Feature activation+3.04e+01

Pos logit contributions
 cob+0.328
 web+0.309
cks+0.308
ery+0.303
eh+0.299
Neg logit contributions
Sorry-0.544
Dad-0.500
Sure-0.483
Um-0.473
His-0.469
Token to
Token ID284
Feature activation+1.08e+01

Pos logit contributions
et+0.270
cks+0.264
ig+0.259
igs+0.255
an+0.253
Neg logit contributions
Sorry-0.298
Listen-0.285
Sure-0.284
Don-0.284
By-0.278
Token a
Token ID257
Feature activation+1.76e+01

Pos logit contributions
lers+0.117
ish+0.116
et+0.115
ig+0.115
an+0.113
Neg logit contributions
Listen-0.094
Dad-0.093
Sorry-0.091
Kids-0.090
By-0.090
Token new
Token ID649
Feature activation+2.28e+01

Pos logit contributions
cks+0.162
et+0.161
lers+0.157
ps+0.156
ish+0.153
Neg logit contributions
Sure-0.182
Listen-0.176
Sorry-0.176
Dad-0.175
Exc-0.175
Token home
Token ID1363
Feature activation+2.15e+01

Pos logit contributions
cks+0.227
ps+0.214
aces+0.210
lers+0.206
et+0.205
Neg logit contributions
Dad-0.221
Sure-0.217
So-0.213
Hi-0.209
On-0.207
Token children
Token ID1751
Feature activation+2.28e+01

Pos logit contributions
Sorry+0.103
Listen+0.102
Kids+0.101
Don+0.096
Sure+0.095
Neg logit contributions
et-0.140
lers-0.135
an-0.134
ish-0.134
ig-0.132
Token put
Token ID1234
Feature activation+2.19e+01

Pos logit contributions
et+0.223
an+0.223
 egg+0.217
ig+0.216
ish+0.215
Neg logit contributions
Sorry-0.216
Kids-0.215
Listen-0.209
Don-0.205
Hi-0.202
Token on
Token ID319
Feature activation+1.50e+01

Pos logit contributions
an+0.213
ig+0.201
cks+0.196
ish+0.191
iflower+0.190
Neg logit contributions
Kids-0.199
Listen-0.192
Dad-0.191
Sure-0.190
Hi-0.190
Token their
Token ID511
Feature activation+2.20e+01

Pos logit contributions
an+0.178
ish+0.176
ig+0.175
et+0.174
lers+0.166
Neg logit contributions
Kids-0.115
Listen-0.111
Sure-0.109
Dad-0.109
Don-0.106
Token bath
Token ID7837
Feature activation+2.99e+01

Pos logit contributions
et+0.173
cks+0.171
ish+0.167
an+0.163
ig+0.163
Neg logit contributions
Kids-0.235
Sorry-0.233
Dad-0.231
Hi-0.230
Listen-0.227
Tokenro
Token ID305
Feature activation+4.64e+01

Pos logit contributions
cks+0.264
et+0.253
 egg+0.248
ish+0.240
an+0.238
Neg logit contributions
Kids-0.329
Sorry-0.307
Sure-0.299
Dad-0.298
Listen-0.297
Tokenbes
Token ID12636
Feature activation+2.98e+01

Pos logit contributions
ish+0.422
 egg+0.385
ishes+0.374
 axe+0.373
 matches+0.371
Neg logit contributions
His-0.470
Kids-0.446
When-0.436
Dad-0.434
He-0.423
Token and
Token ID290
Feature activation+1.08e+01

Pos logit contributions
cks+0.278
an+0.277
ig+0.264
et+0.262
igs+0.260
Neg logit contributions
Sorry-0.293
Listen-0.289
Sure-0.288
Don-0.286
Kids-0.285
Token ran
Token ID4966
Feature activation+1.69e+01

Pos logit contributions
an+0.123
et+0.123
ish+0.119
ig+0.117
lers+0.116
Neg logit contributions
Kids-0.091
Sorry-0.090
Listen-0.089
Don-0.086
Dad-0.084
Token around
Token ID1088
Feature activation+1.52e+01

Pos logit contributions
et+0.141
an+0.139
ish+0.138
 egg+0.135
lers+0.134
Neg logit contributions
Kids-0.192
Listen-0.190
Sorry-0.189
Sure-0.183
Don-0.182
Token in
Token ID287
Feature activation+1.33e+01

Pos logit contributions
an+0.149
ish+0.146
et+0.145
lers+0.139
ig+0.138
Neg logit contributions
Kids-0.153
Listen-0.150
Don-0.148
Sure-0.146
Sorry-0.146
Token his
Token ID465
Feature activation+2.60e+01

Pos logit contributions
Don+0.046
Listen+0.046
What+0.045
 Something+0.044
When+0.043
Neg logit contributions
et-0.031
ool-0.029
opl-0.028
rob-0.028
cut-0.027
Token head
Token ID1182
Feature activation+2.89e+01

Pos logit contributions
et+0.227
cks+0.216
ig+0.212
ish+0.211
an+0.210
Neg logit contributions
Dad-0.256
Sorry-0.248
Hi-0.247
Sure-0.246
Um-0.244
Token out
Token ID503
Feature activation+2.47e+01

Pos logit contributions
an+0.241
cks+0.236
et+0.235
igs+0.232
ish+0.232
Neg logit contributions
Sorry-0.314
Listen-0.304
Don-0.303
Dad-0.298
Hi-0.297
Token of
Token ID286
Feature activation+1.99e+01

Pos logit contributions
an+0.244
ish+0.241
et+0.238
ig+0.229
pp+0.220
Neg logit contributions
Listen-0.229
Sorry-0.228
Dad-0.226
Don-0.225
He-0.224
Token his
Token ID465
Feature activation+2.35e+01

Pos logit contributions
an+0.243
ish+0.239
ig+0.226
et+0.225
pp+0.222
Neg logit contributions
Listen-0.153
Dad-0.153
Kids-0.150
Don-0.147
Hi-0.143
Token bur
Token ID4356
Feature activation+4.64e+01

Pos logit contributions
et+0.192
cks+0.181
pp+0.175
an+0.173
ish+0.166
Neg logit contributions
Dad-0.247
Sorry-0.241
Hi-0.237
Um-0.234
 Something-0.234
Tokenrow
Token ID808
Feature activation+3.31e+01

Pos logit contributions
 globe+0.366
 matches+0.350
 image+0.347
cks+0.347
 gir+0.346
Neg logit contributions
He-0.451
His-0.426
When-0.414
Dad-0.405
Don-0.400
Token and
Token ID290
Feature activation+1.11e+01

Pos logit contributions
cks+0.312
ish+0.279
et+0.278
an+0.276
ig+0.273
Neg logit contributions
He-0.330
His-0.313
Dad-0.306
Sorry-0.303
Don-0.302
Token saw
Token ID2497
Feature activation+1.49e+01

Pos logit contributions
et+0.131
an+0.130
ish+0.129
ig+0.127
lers+0.124
Neg logit contributions
Sorry-0.087
Don-0.086
Listen-0.085
Kids-0.084
Dad-0.081
Token something
Token ID1223
Feature activation+1.81e+01

Pos logit contributions
ig+0.171
ish+0.170
an+0.170
et+0.168
pp+0.158
Neg logit contributions
Don-0.118
Listen-0.115
Kids-0.112
Sorry-0.110
Sure-0.110
Token new
Token ID649
Feature activation+2.04e+01

Pos logit contributions
et+0.172
an+0.169
ish+0.169
ig+0.166
cks+0.159
Neg logit contributions
Listen-0.181
Don-0.179
Sorry-0.179
Kids-0.176
Sure-0.172
Token he
Token ID339
Feature activation+1.62e+01

Pos logit contributions
et+0.242
an+0.236
ish+0.234
lers+0.230
ig+0.228
Neg logit contributions
Sorry-0.149
Kids-0.149
Listen-0.148
Don-0.141
Dad-0.138
Token would
Token ID561
Feature activation+1.08e+01

Pos logit contributions
et+0.203
an+0.202
ish+0.199
lers+0.198
cks+0.197
Neg logit contributions
Sorry-0.106
Listen-0.105
Kids-0.105
Don-0.104
Hi-0.100
Token take
Token ID1011
Feature activation+1.74e+01

Pos logit contributions
an+0.129
et+0.127
lers+0.127
ish+0.126
ig+0.124
Neg logit contributions
Listen-0.075
Sorry-0.074
Sure-0.073
Dad-0.072
Don-0.072
Token the
Token ID262
Feature activation+1.86e+01

Pos logit contributions
an+0.216
ish+0.206
et+0.201
 egg+0.198
ig+0.195
Neg logit contributions
Listen-0.127
Kids-0.122
Sure-0.120
Hi-0.118
Jack-0.118
Token av
Token ID1196
Feature activation+4.21e+01

Pos logit contributions
cks+0.171
et+0.149
an+0.142
igs+0.136
ig+0.136
Neg logit contributions
Sure-0.200
Hi-0.199
Listen-0.193
Don-0.190
Exc-0.188
Tokenoc
Token ID420
Feature activation+4.61e+01

Pos logit contributions
ps+0.348
 cob+0.326
cks+0.321
 matches+0.319
an+0.318
Neg logit contributions
Exc-0.456
Sure-0.449
When-0.448
He-0.439
Dad-0.438
Tokenados
Token ID22484
Feature activation+2.88e+01

Pos logit contributions
 cob+0.336
cks+0.324
 web+0.320
et+0.306
ery+0.306
Neg logit contributions
Sorry-0.510
Dad-0.495
Sure-0.471
Um-0.467
Exc-0.465
Token and
Token ID290
Feature activation+1.05e+01

Pos logit contributions
an+0.306
cks+0.298
et+0.298
ig+0.288
lers+0.286
Neg logit contributions
Sure-0.242
Don-0.240
He-0.236
When-0.235
Dad-0.235
Token scatter
Token ID41058
Feature activation+2.11e+01

Pos logit contributions
an+0.107
ig+0.106
et+0.103
cks+0.099
ish+0.098
Neg logit contributions
Sorry-0.101
Don-0.098
Listen-0.097
Exc-0.097
Dad-0.096
Token them
Token ID606
Feature activation+2.03e+01

Pos logit contributions
an+0.262
ig+0.258
cks+0.239
 egg+0.238
ich+0.237
Neg logit contributions
Exc-0.149
Hi-0.142
Sure-0.141
Jack-0.140
Don-0.140
Token around
Token ID1088
Feature activation+1.66e+01

Pos logit contributions
an+0.182
 egg+0.164
ig+0.163
opl+0.161
et+0.161
Neg logit contributions
On-0.188
He-0.185
Don-0.183
His-0.182
Sure-0.180
Token harder
Token ID7069
Feature activation+2.00e+01

Pos logit contributions
an+0.176
ig+0.172
et+0.172
ish+0.170
lers+0.164
Neg logit contributions
Sorry-0.126
Don-0.126
Kids-0.123
Listen-0.123
Hi-0.120
Token to
Token ID284
Feature activation+1.24e+01

Pos logit contributions
an+0.190
ish+0.180
et+0.174
 egg+0.171
ery+0.168
Neg logit contributions
He-0.198
Sorry-0.197
Don-0.196
Listen-0.193
Dad-0.187
Token push
Token ID4574
Feature activation+2.02e+01

Pos logit contributions
an+0.146
lers+0.142
ig+0.140
ish+0.137
et+0.134
Neg logit contributions
Sorry-0.092
Dad-0.092
Listen-0.092
Don-0.088
Sure-0.086
Token the
Token ID262
Feature activation+1.83e+01

Pos logit contributions
an+0.203
ish+0.191
 egg+0.191
et+0.188
ig+0.187
Neg logit contributions
Don-0.186
Sorry-0.185
Kids-0.185
Dad-0.183
Listen-0.182
Token wheel
Token ID7825
Feature activation+2.83e+01

Pos logit contributions
cks+0.147
et+0.137
pp+0.132
ig+0.129
an+0.128
Neg logit contributions
Sorry-0.202
Dad-0.199
Don-0.196
He-0.196
No-0.196
Tokenbar
Token ID5657
Feature activation+4.56e+01

Pos logit contributions
 egg+0.243
an+0.240
lers+0.236
cks+0.234
iflower+0.231
Neg logit contributions
Sorry-0.297
Sure-0.293
He-0.291
His-0.284
Don-0.283
Tokenrow
Token ID808
Feature activation+2.94e+01

Pos logit contributions
 egg+0.335
 cob+0.314
 matches+0.314
lers+0.310
cks+0.306
Neg logit contributions
His-0.528
He-0.519
Dad-0.511
Sorry-0.509
So-0.492
Token.
Token ID13
Feature activation+1.87e+00

Pos logit contributions
 egg+0.249
cks+0.249
an+0.246
ish+0.240
lers+0.239
Neg logit contributions
Sorry-0.314
He-0.309
Don-0.305
Sure-0.296
His-0.295
Token But
Token ID887
Feature activation+5.60e+00

Pos logit contributions
et+0.022
lers+0.021
ish+0.021
ig+0.021
 egg+0.021
Neg logit contributions
Sorry-0.014
Listen-0.014
Kids-0.013
Don-0.012
Dad-0.012
Token her
Token ID607
Feature activation+1.32e+01

Pos logit contributions
an+0.065
et+0.064
lers+0.063
ish+0.062
cks+0.062
Neg logit contributions
Sorry-0.045
Don-0.044
Listen-0.044
Kids-0.043
Sure-0.042
Token backyard
Token ID24296
Feature activation+2.09e+01

Pos logit contributions
cks+0.150
et+0.149
an+0.143
pp+0.143
lers+0.142
Neg logit contributions
Sorry-0.106
Dad-0.104
Kids-0.100
Listen-0.100
Um-0.099
Token reward
Token ID6721
Feature activation+2.66e+01

Pos logit contributions
cks+0.297
et+0.279
an+0.261
igs+0.258
ig+0.256
Neg logit contributions
Hi-0.384
Dad-0.364
Sorry-0.362
No-0.361
Don-0.356
Token.
Token ID13
Feature activation+5.45e+00

Pos logit contributions
an+0.227
et+0.222
ig+0.215
ish+0.215
cks+0.211
Neg logit contributions
Kids-0.280
Sorry-0.280
Listen-0.276
Don-0.275
Dad-0.273
Token It
Token ID632
Feature activation+1.51e+01

Pos logit contributions
et+0.068
ish+0.067
lers+0.066
an+0.066
 egg+0.065
Neg logit contributions
Sorry-0.036
Listen-0.036
Kids-0.035
Don-0.032
Dad-0.031
Token was
Token ID373
Feature activation+1.22e+01

Pos logit contributions
et+0.175
an+0.174
ish+0.172
ig+0.172
lers+0.171
Neg logit contributions
Sorry-0.112
Don-0.111
Listen-0.110
Kids-0.108
Sure-0.107
Token ch
Token ID442
Feature activation+2.53e+01

Pos logit contributions
an+0.139
lers+0.136
ish+0.136
ig+0.136
et+0.126
Neg logit contributions
Don-0.101
Listen-0.099
Thanks-0.099
Dad-0.096
When-0.095
Tokenocol
Token ID4668
Feature activation+4.52e+01

Pos logit contributions
 egg+0.165
et+0.161
an+0.159
 matches+0.147
lers+0.147
Neg logit contributions
Sorry-0.315
Listen-0.308
Don-0.304
Exc-0.303
Sure-0.303
Tokenates
Token ID689
Feature activation+2.79e+01

Pos logit contributions
 cob+0.260
 v+0.250
an+0.239
 matches+0.232
 wall+0.229
Neg logit contributions
Dad-0.581
Sorry-0.576
His-0.558
By-0.556
Listen-0.556
Token and
Token ID290
Feature activation+1.12e+01

Pos logit contributions
an+0.257
cks+0.247
igs+0.242
et+0.237
ig+0.237
Neg logit contributions
He-0.254
Sorry-0.248
When-0.246
Jack-0.245
Don-0.243
Token sweets
Token ID42402
Feature activation+2.64e+01

Pos logit contributions
an+0.124
ish+0.117
ig+0.116
et+0.113
ans+0.108
Neg logit contributions
Don-0.101
Sorry-0.100
Listen-0.097
Hi-0.096
Um-0.093
Token!
Token ID0
Feature activation+1.89e+00

Pos logit contributions
an+0.253
cks+0.243
ig+0.241
ish+0.237
igs+0.236
Neg logit contributions
Sorry-0.248
Don-0.246
He-0.242
When-0.236
By-0.235
Token
Token ID198
Feature activation+2.08e+00

Pos logit contributions
et+0.024
lers+0.023
ish+0.023
an+0.023
 egg+0.023
Neg logit contributions
Sorry-0.013
Listen-0.013
Kids-0.013
Don-0.012
Dad-0.011
Token give
Token ID1577
Feature activation+1.91e+01

Pos logit contributions
an+0.147
ig+0.147
lers+0.146
ish+0.146
et+0.144
Neg logit contributions
Listen-0.107
Sorry-0.106
Don-0.106
Sure-0.105
Dad-0.105
Token to
Token ID284
Feature activation+1.07e+01

Pos logit contributions
ig+0.228
an+0.224
et+0.220
cks+0.217
opl+0.213
Neg logit contributions
Don-0.140
Listen-0.137
Kids-0.136
Sorry-0.136
Sure-0.133
Token the
Token ID262
Feature activation+1.76e+01

Pos logit contributions
an+0.121
et+0.118
ish+0.118
ig+0.117
lers+0.115
Neg logit contributions
Kids-0.086
Listen-0.083
Sure-0.082
Don-0.081
Sorry-0.081
Token rh
Token ID9529
Feature activation+3.71e+01

Pos logit contributions
cks+0.149
et+0.142
lers+0.133
ig+0.129
an+0.127
Neg logit contributions
Sorry-0.199
Don-0.198
Hi-0.196
Sure-0.193
Listen-0.191
Tokenin
Token ID259
Feature activation+2.90e+01

Pos logit contributions
et+0.348
ps+0.304
 cob+0.302
cks+0.297
 egg+0.296
Neg logit contributions
Sorry-0.397
Dad-0.378
Sure-0.372
Jack-0.368
He-0.368
Tokenoc
Token ID420
Feature activation+4.50e+01

Pos logit contributions
cks+0.257
et+0.247
ps+0.241
eking+0.233
ishing+0.228
Neg logit contributions
Sure-0.320
His-0.300
He-0.298
Dad-0.297
Sorry-0.292
Tokeneros
Token ID27498
Feature activation+2.76e+01

Pos logit contributions
 cob+0.268
cks+0.268
ery+0.268
 web+0.250
et+0.249
Neg logit contributions
Sorry-0.565
Sure-0.525
Dad-0.525
Um-0.512
Listen-0.506
Token.
Token ID13
Feature activation+2.84e+00

Pos logit contributions
et+0.252
cks+0.246
ig+0.241
an+0.241
lers+0.232
Neg logit contributions
Sure-0.268
Sorry-0.263
Listen-0.263
Don-0.261
Hi-0.259
Token Lily
Token ID20037
Feature activation+1.29e+01

Pos logit contributions
et+0.036
ish+0.035
lers+0.034
an+0.034
 egg+0.034
Neg logit contributions
Sorry-0.019
Listen-0.019
Kids-0.018
Don-0.017
Dad-0.016
Token walked
Token ID6807
Feature activation+1.53e+01

Pos logit contributions
et+0.141
lers+0.139
 egg+0.139
an+0.138
ish+0.137
Neg logit contributions
Kids-0.108
Listen-0.104
Sorry-0.104
Don-0.104
Sure-0.100
Token to
Token ID284
Feature activation+7.36e+00

Pos logit contributions
et+0.113
lers+0.107
an+0.107
ish+0.107
ig+0.103
Neg logit contributions
Sorry-0.191
Listen-0.190
Kids-0.190
Don-0.184
Sure-0.182
Token quiet
Token ID5897
Feature activation+3.30e+01

Pos logit contributions
Hi+0.016
Exc+0.015
Dad+0.015
Sorry+0.014
Jack+0.014
Neg logit contributions
cks-0.015
ates-0.015
aces-0.014
ps-0.014
ery-0.014
Token compared
Token ID3688
Feature activation+2.57e+01

Pos logit contributions
ig+0.278
ish+0.277
cks+0.269
igs+0.269
et+0.266
Neg logit contributions
Kids-0.347
Sorry-0.338
Listen-0.335
Don-0.330
Dad-0.329
Token to
Token ID284
Feature activation+1.13e+01

Pos logit contributions
an+0.278
ish+0.275
ig+0.273
et+0.270
lers+0.263
Neg logit contributions
Listen-0.219
Kids-0.218
Don-0.216
Dad-0.211
Sorry-0.211
Token the
Token ID262
Feature activation+1.88e+01

Pos logit contributions
an+0.121
ish+0.120
ig+0.118
lers+0.117
et+0.117
Neg logit contributions
Listen-0.099
Kids-0.099
Don-0.096
Sorry-0.096
Dad-0.093
Token noisy
Token ID31210
Feature activation+2.74e+01

Pos logit contributions
cks+0.166
et+0.155
ig+0.154
ish+0.149
lers+0.148
Neg logit contributions
Listen-0.201
Don-0.201
Sorry-0.199
Hi-0.198
Kids-0.197
Token se
Token ID384
Feature activation+4.49e+01

Pos logit contributions
cks+0.250
ig+0.225
et+0.222
pp+0.221
ish+0.220
Neg logit contributions
Kids-0.299
Sorry-0.289
Listen-0.285
Hi-0.285
Dad-0.282
Tokenag
Token ID363
Feature activation+3.23e+01

Pos logit contributions
 egg+0.334
 cob+0.318
 chim+0.313
 matches+0.307
 eggs+0.301
Neg logit contributions
Sure-0.519
Sorry-0.518
Listen-0.514
Kids-0.507
His-0.505
Tokenull
Token ID724
Feature activation+2.77e+01

Pos logit contributions
 egg+0.284
ish+0.280
lers+0.272
et+0.263
ig+0.260
Neg logit contributions
His-0.329
Kids-0.328
Dad-0.325
Listen-0.322
Jack-0.317
Tokens
Token ID82
Feature activation+1.64e+01

Pos logit contributions
 egg+0.264
igs+0.259
ates+0.258
et+0.252
ool+0.247
Neg logit contributions
Don-0.238
Sorry-0.237
He-0.236
His-0.227
By-0.224
Token outside
Token ID2354
Feature activation+2.03e+01

Pos logit contributions
an+0.139
et+0.137
 egg+0.135
ish+0.134
ig+0.133
Neg logit contributions
Sorry-0.168
Don-0.165
By-0.160
Listen-0.158
Thanks-0.158
Token.
Token ID13
Feature activation+2.49e+00

Pos logit contributions
an+0.164
ish+0.161
lers+0.159
ig+0.159
et+0.159
Neg logit contributions
Sorry-0.229
Listen-0.225
Don-0.225
Kids-0.223
Dad-0.219
Token as
Token ID355
Feature activation+9.43e+00

Pos logit contributions
ish+0.202
an+0.202
ig+0.198
et+0.198
cks+0.197
Neg logit contributions
Listen-0.241
Kids-0.240
Don-0.238
Sorry-0.236
So-0.236
Token he
Token ID339
Feature activation+1.32e+01

Pos logit contributions
et+0.131
an+0.130
ish+0.130
lers+0.127
ig+0.125
Neg logit contributions
Sorry-0.055
Listen-0.055
Kids-0.053
Don-0.053
Dad-0.052
Token opened
Token ID4721
Feature activation+1.87e+01

Pos logit contributions
et+0.159
an+0.156
ish+0.155
lers+0.154
 egg+0.153
Neg logit contributions
Sorry-0.098
Kids-0.098
Listen-0.097
Don-0.093
Sure-0.090
Token the
Token ID262
Feature activation+1.63e+01

Pos logit contributions
an+0.222
ish+0.212
ig+0.208
ery+0.205
et+0.202
Neg logit contributions
His-0.130
When-0.129
Dad-0.128
Listen-0.127
He-0.125
Token ch
Token ID442
Feature activation+2.82e+01

Pos logit contributions
cks+0.132
et+0.131
ig+0.123
an+0.122
ish+0.121
Neg logit contributions
Listen-0.178
Sorry-0.178
Hi-0.178
Sure-0.176
Kids-0.175
Tokenocol
Token ID4668
Feature activation+4.47e+01

Pos logit contributions
 egg+0.170
et+0.167
fl+0.164
 globe+0.162
pp+0.161
Neg logit contributions
Sorry-0.332
Kids-0.328
Dad-0.319
His-0.316
Listen-0.315
Tokenates
Token ID689
Feature activation+2.60e+01

Pos logit contributions
 cob+0.264
 v+0.255
 matches+0.235
 bulb+0.221
 web+0.216
Neg logit contributions
Dad-0.579
His-0.557
Sorry-0.550
By-0.545
Listen-0.538
Token and
Token ID290
Feature activation+7.65e+00

Pos logit contributions
an+0.228
et+0.220
cks+0.218
igs+0.214
lers+0.209
Neg logit contributions
Dad-0.255
Sorry-0.252
When-0.250
He-0.249
Sure-0.249
Token started
Token ID2067
Feature activation+1.33e+01

Pos logit contributions
an+0.085
ig+0.084
et+0.083
ish+0.082
lers+0.080
Neg logit contributions
Sorry-0.065
Don-0.063
Listen-0.062
Kids-0.062
Sure-0.061
Token eating
Token ID6600
Feature activation+1.72e+01

Pos logit contributions
an+0.144
ish+0.138
et+0.136
ig+0.133
ery+0.130
Neg logit contributions
Listen-0.121
Don-0.118
Kids-0.117
Sorry-0.117
His-0.111
Token them
Token ID606
Feature activation+1.75e+01

Pos logit contributions
an+0.200
ig+0.189
ish+0.183
et+0.177
lers+0.175
Neg logit contributions
Dad-0.133
Exc-0.133
Sure-0.132
Don-0.131
He-0.130
Token giving
Token ID3501
Feature activation+1.90e+01

Pos logit contributions
ish+0.144
lers+0.141
ig+0.139
an+0.131
ishes+0.125
Neg logit contributions
Dad-0.135
Listen-0.133
Don-0.129
Kids-0.128
What-0.128
Token him
Token ID683
Feature activation+1.76e+01

Pos logit contributions
ig+0.216
cks+0.209
et+0.206
an+0.203
igs+0.200
Neg logit contributions
Hi-0.149
Listen-0.146
Don-0.145
Dad-0.143
Sorry-0.141
Token too
Token ID1165
Feature activation+1.56e+01

Pos logit contributions
cks+0.182
ig+0.180
et+0.178
hole+0.176
lers+0.173
Neg logit contributions
Sorry-0.137
Listen-0.136
Sure-0.134
Dad-0.132
Hi-0.131
Token many
Token ID867
Feature activation+1.83e+01

Pos logit contributions
et+0.210
an+0.205
ish+0.205
ig+0.203
lers+0.195
Neg logit contributions
Sorry-0.102
Listen-0.101
Dad-0.097
Don-0.095
By-0.091
Token ch
Token ID442
Feature activation+2.75e+01

Pos logit contributions
an+0.161
ish+0.160
ig+0.157
et+0.157
pp+0.155
Neg logit contributions
Exc-0.178
He-0.174
Sorry-0.174
Dad-0.172
Hi-0.170
Tokenocol
Token ID4668
Feature activation+4.45e+01

Pos logit contributions
 globe+0.202
et+0.196
ning+0.196
 matches+0.191
 cob+0.190
Neg logit contributions
Dad-0.296
Sorry-0.280
Kids-0.280
Listen-0.272
He-0.272
Tokenates
Token ID689
Feature activation+2.68e+01

Pos logit contributions
 cob+0.278
 v+0.255
 matches+0.238
 chim+0.235
 wall+0.230
Neg logit contributions
Dad-0.578
Sorry-0.561
By-0.540
Listen-0.538
His-0.536
Token."
Token ID526
Feature activation-8.39e-01

Pos logit contributions
an+0.219
cks+0.215
et+0.213
ig+0.210
ish+0.197
Neg logit contributions
Dad-0.270
Sorry-0.269
He-0.269
Don-0.256
His-0.253
Token The
Token ID383
Feature activation+8.21e+00

Pos logit contributions
Listen+0.006
Sorry+0.006
Kids+0.006
By+0.006
Don+0.006
Neg logit contributions
et-0.010
lers-0.010
ish-0.009
 egg-0.009
an-0.009
Token king
Token ID5822
Feature activation+1.87e+01

Pos logit contributions
et+0.083
lers+0.082
cks+0.080
ig+0.078
ish+0.078
Neg logit contributions
Dad-0.081
Sorry-0.080
Kids-0.079
Sure-0.077
Hi-0.077
Token learned
Token ID4499
Feature activation+7.65e+00

Pos logit contributions
et+0.221
cks+0.215
lers+0.214
ish+0.214
igs+0.213
Neg logit contributions
Sorry-0.130
Kids-0.128
Listen-0.127
Sure-0.126
Don-0.123
Token it
Token ID340
Feature activation+2.17e+01

Pos logit contributions
an+0.231
ig+0.228
et+0.226
cks+0.220
ish+0.219
Neg logit contributions
Kids-0.125
Sure-0.121
Listen-0.118
Don-0.117
Dad-0.116
Token to
Token ID284
Feature activation+9.18e+00

Pos logit contributions
et+0.238
cks+0.235
ig+0.234
an+0.229
ish+0.223
Neg logit contributions
Listen-0.172
Sorry-0.170
Sure-0.169
Don-0.164
Hi-0.161
Token the
Token ID262
Feature activation+1.69e+01

Pos logit contributions
et+0.108
lers+0.108
ish+0.108
ig+0.107
an+0.107
Neg logit contributions
Kids-0.071
Listen-0.069
Sorry-0.069
Dad-0.068
Don-0.066
Token rh
Token ID9529
Feature activation+3.59e+01

Pos logit contributions
et+0.148
cks+0.146
ig+0.145
lers+0.145
ish+0.137
Neg logit contributions
Sorry-0.185
Dad-0.179
Hi-0.179
Listen-0.178
Kids-0.177
Tokenin
Token ID259
Feature activation+2.76e+01

Pos logit contributions
et+0.331
ps+0.285
 cob+0.285
cks+0.282
 egg+0.281
Neg logit contributions
Sorry-0.387
Dad-0.369
He-0.361
Jack-0.359
Listen-0.358
Tokenoc
Token ID420
Feature activation+4.39e+01

Pos logit contributions
et+0.232
cks+0.229
ps+0.215
igs+0.213
lers+0.210
Neg logit contributions
Sure-0.304
Sorry-0.292
Dad-0.290
His-0.283
He-0.282
Tokeneros
Token ID27498
Feature activation+2.67e+01

Pos logit contributions
 cob+0.268
cks+0.267
ery+0.259
eh+0.254
et+0.253
Neg logit contributions
Sorry-0.544
Dad-0.509
Sure-0.499
Listen-0.490
Um-0.488
Token.
Token ID13
Feature activation+2.42e+00

Pos logit contributions
et+0.220
cks+0.204
ig+0.202
an+0.201
lers+0.197
Neg logit contributions
Sorry-0.289
Sure-0.289
Listen-0.282
Don-0.280
Kids-0.279
Token When
Token ID1649
Feature activation+8.15e+00

Pos logit contributions
et+0.029
ish+0.028
lers+0.028
an+0.028
 egg+0.028
Neg logit contributions
Sorry-0.017
Listen-0.017
Kids-0.016
Don-0.016
Dad-0.015
Token he
Token ID339
Feature activation+1.20e+01

Pos logit contributions
et+0.095
an+0.093
ish+0.092
lers+0.091
ig+0.090
Neg logit contributions
Sorry-0.067
Listen-0.066
Kids-0.066
Don-0.063
Dad-0.062
Token tasted
Token ID29187
Feature activation+1.54e+01

Pos logit contributions
et+0.135
lers+0.130
ish+0.130
an+0.130
ig+0.127
Neg logit contributions
Sorry-0.102
Listen-0.101
Kids-0.101
Don-0.096
Sure-0.094
Token that
Token ID326
Feature activation+1.12e+01

Pos logit contributions
et+0.169
an+0.163
ish+0.162
lers+0.160
ig+0.159
Neg logit contributions
Sorry-0.184
Kids-0.182
Listen-0.181
Don-0.178
Sure-0.172
Token the
Token ID262
Feature activation+1.48e+01

Pos logit contributions
et+0.143
an+0.142
ish+0.140
lers+0.140
ig+0.137
Neg logit contributions
Sorry-0.077
Listen-0.077
Kids-0.077
Don-0.073
Dad-0.071
Token bad
Token ID2089
Feature activation+2.11e+01

Pos logit contributions
cks+0.144
et+0.140
lers+0.135
ig+0.131
ish+0.130
Neg logit contributions
Sorry-0.153
Kids-0.152
Listen-0.149
Hi-0.148
Sure-0.146
Token rh
Token ID9529
Feature activation+3.63e+01

Pos logit contributions
cks+0.206
et+0.198
lers+0.195
an+0.191
ish+0.191
Neg logit contributions
Kids-0.215
Sorry-0.210
Listen-0.210
Don-0.202
Hi-0.201
Tokenin
Token ID259
Feature activation+2.73e+01

Pos logit contributions
et+0.321
 egg+0.289
ig+0.287
 cob+0.284
cks+0.283
Neg logit contributions
Sorry-0.401
Listen-0.373
Dad-0.369
He-0.369
By-0.368
Tokenoc
Token ID420
Feature activation+4.32e+01

Pos logit contributions
cks+0.215
et+0.209
igs+0.209
ish+0.203
ig+0.203
Neg logit contributions
Sorry-0.311
Don-0.304
Kids-0.301
Listen-0.301
Sure-0.298
Tokeneros
Token ID27498
Feature activation+2.69e+01

Pos logit contributions
cks+0.260
 cob+0.259
 web+0.246
igs+0.244
ig+0.233
Neg logit contributions
Sorry-0.568
Sure-0.518
Dad-0.510
Listen-0.503
Um-0.502
Token was
Token ID373
Feature activation+1.35e+01

Pos logit contributions
et+0.326
cks+0.321
an+0.319
ig+0.318
ish+0.318
Neg logit contributions
Sorry-0.187
Kids-0.183
Don-0.183
Listen-0.183
Hi-0.170
Token now
Token ID783
Feature activation+1.46e+01

Pos logit contributions
lers+0.150
ish+0.149
an+0.149
ig+0.146
et+0.140
Neg logit contributions
Listen-0.114
Thanks-0.114
Sorry-0.113
Don-0.113
By-0.110
Token nice
Token ID3621
Feature activation+1.81e+01

Pos logit contributions
an+0.140
et+0.136
lers+0.134
ish+0.133
ig+0.132
Neg logit contributions
Sorry-0.145
Listen-0.143
Don-0.142
Thanks-0.140
By-0.136
Token.
Token ID13
Feature activation+1.12e+00

Pos logit contributions
cks+0.168
an+0.162
et+0.162
lers+0.158
ig+0.155
Neg logit contributions
Kids-0.185
Listen-0.181
Don-0.180
Sorry-0.178
Hi-0.174
Token you
Token ID345
Feature activation+2.00e+01

Pos logit contributions
Sorry+0.164
Kids+0.163
Listen+0.161
Exc+0.156
Sure+0.155
Neg logit contributions
et-0.111
 egg-0.108
 matches-0.104
lers-0.103
ish-0.100
Token go
Token ID467
Feature activation+1.93e+01

Pos logit contributions
et+0.210
an+0.205
lers+0.201
ish+0.198
ig+0.198
Neg logit contributions
Sorry-0.183
Listen-0.182
Kids-0.180
Don-0.175
Hi-0.172
Token,
Token ID11
Feature activation+8.29e+00

Pos logit contributions
et+0.181
an+0.174
ish+0.171
 egg+0.171
ery+0.171
Neg logit contributions
Listen-0.192
Sorry-0.190
Kids-0.189
Hi-0.185
Sure-0.180
Token my
Token ID616
Feature activation+1.76e+01

Pos logit contributions
lers+0.095
et+0.095
an+0.092
ish+0.091
cks+0.090
Neg logit contributions
Kids-0.068
Sorry-0.066
Listen-0.066
Don-0.066
Dad-0.065
Token d
Token ID288
Feature activation+2.96e+01

Pos logit contributions
cks+0.170
et+0.167
lers+0.161
an+0.149
els+0.148
Neg logit contributions
Sorry-0.172
Sure-0.171
Dad-0.169
Kids-0.165
Hi-0.163
Tokenarl
Token ID7063
Feature activation+4.29e+01

Pos logit contributions
lers+0.289
et+0.286
 egg+0.284
cks+0.278
an+0.276
Neg logit contributions
Sorry-0.283
Kids-0.281
Listen-0.273
Sure-0.269
Dad-0.268
Tokenings
Token ID654
Feature activation+2.34e+01

Pos logit contributions
et+0.336
 egg+0.310
 cob+0.305
 matches+0.304
lers+0.301
Neg logit contributions
Sorry-0.503
Listen-0.484
Kids-0.484
Dad-0.483
By-0.470
Token.
Token ID13
Feature activation+5.26e+00

Pos logit contributions
et+0.175
an+0.172
lers+0.171
ish+0.171
ig+0.170
Neg logit contributions
Sorry-0.283
Kids-0.281
Listen-0.281
Don-0.274
Dad-0.270
Token Enjoy
Token ID18179
Feature activation+1.71e+01

Pos logit contributions
et+0.054
ish+0.052
an+0.052
lers+0.052
 egg+0.052
Neg logit contributions
Sorry-0.048
Listen-0.047
Kids-0.046
Don-0.044
Dad-0.043
Token your
Token ID534
Feature activation+1.78e+01

Pos logit contributions
lers+0.158
ish+0.157
an+0.156
et+0.153
hole+0.149
Neg logit contributions
Dad-0.171
Kids-0.161
Listen-0.157
Sorry-0.155
Sure-0.154
Token lunch
Token ID9965
Feature activation+2.92e+01

Pos logit contributions
et+0.171
cks+0.159
an+0.153
ish+0.150
lers+0.150
Neg logit contributions
Sorry-0.179
Dad-0.170
Listen-0.170
Hi-0.168
Kids-0.167
Token who
Token ID508
Feature activation+1.08e+01

Pos logit contributions
et+0.269
cks+0.261
ps+0.261
igs+0.250
an+0.248
Neg logit contributions
Don-0.171
Hi-0.170
He-0.167
Kids-0.166
Sorry-0.164
Token lived
Token ID5615
Feature activation+1.54e+01

Pos logit contributions
et+0.123
ish+0.118
an+0.118
lers+0.117
ig+0.116
Neg logit contributions
Sorry-0.089
Listen-0.088
Kids-0.088
Don-0.086
Sure-0.083
Token in
Token ID287
Feature activation+1.45e+01

Pos logit contributions
et+0.144
an+0.135
 egg+0.130
lers+0.129
ish+0.128
Neg logit contributions
Don-0.163
By-0.160
He-0.159
Sorry-0.159
His-0.156
Token his
Token ID465
Feature activation+1.90e+01

Pos logit contributions
et+0.158
ish+0.156
an+0.155
ig+0.151
lers+0.149
Neg logit contributions
Listen-0.130
Don-0.128
Kids-0.125
Sorry-0.124
Dad-0.122
Token cozy
Token ID37438
Feature activation+3.02e+01

Pos logit contributions
et+0.163
cks+0.146
ps+0.142
an+0.138
ish+0.138
Neg logit contributions
Hi-0.209
Dad-0.207
Listen-0.206
By-0.201
Sorry-0.201
Token bur
Token ID4356
Feature activation+4.29e+01

Pos logit contributions
et+0.303
cks+0.292
ps+0.265
ery+0.251
ab+0.244
Neg logit contributions
Hi-0.292
Dad-0.280
His-0.271
Kids-0.269
Exc-0.269
Tokenrow
Token ID808
Feature activation+3.02e+01

Pos logit contributions
et+0.332
 matches+0.326
 oven+0.318
cks+0.318
 globe+0.309
Neg logit contributions
He-0.425
His-0.404
When-0.378
She-0.378
Don-0.377
Token.
Token ID13
Feature activation+4.40e-01

Pos logit contributions
cks+0.300
et+0.278
 matches+0.273
els+0.271
ig+0.269
Neg logit contributions
He-0.282
His-0.266
On-0.255
So-0.253
She-0.251
Token He
Token ID679
Feature activation+9.82e+00

Pos logit contributions
 egg+0.005
et+0.005
lers+0.005
an+0.005
ish+0.005
Neg logit contributions
Sorry-0.003
Listen-0.003
Kids-0.003
Don-0.003
By-0.003
Token loved
Token ID6151
Feature activation+1.13e+01

Pos logit contributions
et+0.107
lers+0.103
an+0.102
ish+0.102
 egg+0.100
Neg logit contributions
Sorry-0.088
Kids-0.087
Listen-0.086
Don-0.082
Dad-0.081
Token to
Token ID284
Feature activation+6.76e+00

Pos logit contributions
an+0.138
et+0.138
ish+0.132
ig+0.132
cks+0.127
Neg logit contributions
Listen-0.089
Don-0.087
Kids-0.084
Hi-0.082
Come-0.079
Token there
Token ID612
Feature activation+1.27e+01

Pos logit contributions
an+0.112
et+0.111
ish+0.110
lers+0.108
 egg+0.107
Neg logit contributions
Kids-0.151
Listen-0.149
Sorry-0.148
Don-0.147
Sure-0.142
Token was
Token ID373
Feature activation+1.01e+01

Pos logit contributions
et+0.134
ish+0.128
an+0.127
lers+0.127
ig+0.125
Neg logit contributions
Sorry-0.118
Kids-0.117
Listen-0.117
Don-0.111
Sure-0.109
Token a
Token ID257
Feature activation+1.48e+01

Pos logit contributions
et+0.085
lers+0.078
 egg+0.078
an+0.077
ish+0.075
Neg logit contributions
Kids-0.115
Sorry-0.115
Listen-0.112
Sure-0.108
Don-0.107
Token rh
Token ID9529
Feature activation+3.51e+01

Pos logit contributions
et+0.145
lers+0.138
cks+0.136
an+0.133
ish+0.133
Neg logit contributions
Sorry-0.153
Kids-0.150
Listen-0.147
Dad-0.147
Sure-0.146
Tokenin
Token ID259
Feature activation+2.70e+01

Pos logit contributions
et+0.329
cks+0.291
ps+0.289
 cob+0.283
ish+0.282
Neg logit contributions
Sorry-0.368
Dad-0.351
Jack-0.347
He-0.345
By-0.338
Tokenoc
Token ID420
Feature activation+4.27e+01

Pos logit contributions
ps+0.274
cks+0.272
et+0.266
igs+0.256
aces+0.255
Neg logit contributions
Dad-0.255
Sure-0.255
Sorry-0.249
He-0.247
His-0.244
Tokeneros
Token ID27498
Feature activation+2.75e+01

Pos logit contributions
cks+0.287
 cob+0.274
ery+0.274
igs+0.267
et+0.261
Neg logit contributions
Sorry-0.498
Dad-0.460
Sure-0.458
Um-0.451
Hi-0.448
Token who
Token ID508
Feature activation+1.28e+01

Pos logit contributions
et+0.270
igs+0.257
ig+0.256
ish+0.253
ps+0.252
Neg logit contributions
Sorry-0.246
Hi-0.242
He-0.242
Don-0.241
His-0.239
Token was
Token ID373
Feature activation+1.36e+01

Pos logit contributions
et+0.156
ig+0.154
ish+0.152
an+0.152
 egg+0.150
Neg logit contributions
Sorry-0.091
Don-0.091
Listen-0.090
Kids-0.088
Hi-0.087
Token always
Token ID1464
Feature activation+1.72e+01

Pos logit contributions
ig+0.164
ish+0.158
an+0.156
et+0.154
ishing+0.150
Neg logit contributions
He-0.101
His-0.097
Don-0.094
Dad-0.094
Hi-0.094
Token naughty
Token ID45128
Feature activation+1.74e+01

Pos logit contributions
ig+0.179
igs+0.173
an+0.172
ish+0.169
ool+0.166
Neg logit contributions
Don-0.168
Hi-0.159
Thanks-0.152
What-0.152
Listen-0.150
Token Everyday
Token ID48154
Feature activation+1.99e+01

Pos logit contributions
Kids+0.100
Sorry+0.100
Listen+0.099
Don+0.096
Sure+0.094
Neg logit contributions
et-0.126
an-0.124
lers-0.123
ish-0.122
 egg-0.121
Token he
Token ID339
Feature activation+1.62e+01

Pos logit contributions
et+0.242
an+0.236
ish+0.234
lers+0.230
ig+0.228
Neg logit contributions
Sorry-0.149
Kids-0.149
Listen-0.148
Don-0.141
Dad-0.138
Token would
Token ID561
Feature activation+1.08e+01

Pos logit contributions
et+0.203
an+0.202
ish+0.199
lers+0.198
cks+0.197
Neg logit contributions
Sorry-0.106
Listen-0.105
Kids-0.105
Don-0.104
Hi-0.100
Token take
Token ID1011
Feature activation+1.74e+01

Pos logit contributions
an+0.129
et+0.127
lers+0.127
ish+0.126
ig+0.124
Neg logit contributions
Listen-0.075
Sorry-0.074
Sure-0.073
Dad-0.072
Don-0.072
Token the
Token ID262
Feature activation+1.86e+01

Pos logit contributions
an+0.216
ish+0.206
et+0.201
 egg+0.198
ig+0.195
Neg logit contributions
Listen-0.127
Kids-0.122
Sure-0.120
Hi-0.118
Jack-0.118
Token av
Token ID1196
Feature activation+4.21e+01

Pos logit contributions
cks+0.171
et+0.149
an+0.142
igs+0.136
ig+0.136
Neg logit contributions
Sure-0.200
Hi-0.199
Listen-0.193
Don-0.190
Exc-0.188
Tokenoc
Token ID420
Feature activation+4.61e+01

Pos logit contributions
ps+0.348
 cob+0.326
cks+0.321
 matches+0.319
an+0.318
Neg logit contributions
Exc-0.456
Sure-0.449
When-0.448
He-0.439
Dad-0.438
Tokenados
Token ID22484
Feature activation+2.88e+01

Pos logit contributions
 cob+0.336
cks+0.324
 web+0.320
et+0.306
ery+0.306
Neg logit contributions
Sorry-0.510
Dad-0.495
Sure-0.471
Um-0.467
Exc-0.465
Token and
Token ID290
Feature activation+1.05e+01

Pos logit contributions
an+0.306
cks+0.298
et+0.298
ig+0.288
lers+0.286
Neg logit contributions
Sure-0.242
Don-0.240
He-0.236
When-0.235
Dad-0.235
Token scatter
Token ID41058
Feature activation+2.11e+01

Pos logit contributions
an+0.107
ig+0.106
et+0.103
cks+0.099
ish+0.098
Neg logit contributions
Sorry-0.101
Don-0.098
Listen-0.097
Exc-0.097
Dad-0.096
Token them
Token ID606
Feature activation+2.03e+01

Pos logit contributions
an+0.262
ig+0.258
cks+0.239
 egg+0.238
ich+0.237
Neg logit contributions
Exc-0.149
Hi-0.142
Sure-0.141
Jack-0.140
Don-0.140
Token She
Token ID1375
Feature activation+1.12e+01

Pos logit contributions
et+0.091
lers+0.089
ish+0.088
an+0.088
 egg+0.088
Neg logit contributions
Sorry-0.043
Listen-0.043
Kids-0.042
Don-0.038
Sure-0.038
Token picks
Token ID11103
Feature activation+1.77e+01

Pos logit contributions
et+0.120
an+0.117
ish+0.117
lers+0.115
ig+0.114
Neg logit contributions
Sorry-0.098
Kids-0.097
Listen-0.096
Don-0.094
Sure-0.091
Token up
Token ID510
Feature activation+1.59e+01

Pos logit contributions
asis+0.130
ig+0.129
an+0.125
igs+0.122
cks+0.121
Neg logit contributions
Sure-0.205
Listen-0.201
Don-0.198
Hi-0.196
What-0.195
Token the
Token ID262
Feature activation+1.72e+01

Pos logit contributions
an+0.183
ish+0.180
et+0.178
ig+0.176
cks+0.170
Neg logit contributions
Kids-0.125
Listen-0.124
Don-0.123
Sorry-0.121
Sure-0.120
Token re
Token ID302
Feature activation+3.39e+01

Pos logit contributions
cks+0.131
ors+0.124
et+0.123
ig+0.119
igs+0.119
Neg logit contributions
Hi-0.192
Don-0.188
What-0.184
Sure-0.183
Listen-0.182
Tokenind
Token ID521
Feature activation+4.18e+01

Pos logit contributions
 hanging+0.243
et+0.242
 matches+0.235
 cob+0.226
 soaring+0.226
Neg logit contributions
Dad-0.388
Sorry-0.379
His-0.373
He-0.352
Daddy-0.352
Tokeneer
Token ID28153
Feature activation+2.86e+01

Pos logit contributions
 soaring+0.271
ig+0.268
 hanging+0.258
igs+0.258
 web+0.251
Neg logit contributions
Dad-0.528
His-0.485
Kids-0.478
He-0.474
Listen-0.471
Token.
Token ID13
Feature activation+1.76e+00

Pos logit contributions
igs+0.298
et+0.283
cks+0.278
ig+0.274
an+0.274
Neg logit contributions
He-0.230
His-0.222
Don-0.221
Sure-0.220
When-0.216
Token She
Token ID1375
Feature activation+8.63e+00

Pos logit contributions
et+0.023
 egg+0.023
lers+0.023
ish+0.023
ig+0.023
Neg logit contributions
Sorry-0.010
Listen-0.010
Kids-0.009
Sure-0.008
Don-0.008
Token looks
Token ID3073
Feature activation+1.30e+01

Pos logit contributions
et+0.080
an+0.079
ish+0.077
ig+0.075
 egg+0.074
Neg logit contributions
Kids-0.085
Sorry-0.084
Listen-0.084
Exc-0.084
Um-0.083
Token at
Token ID379
Feature activation+8.72e+00

Pos logit contributions
an+0.111
ig+0.110
et+0.107
 egg+0.106
ish+0.104
Neg logit contributions
Don-0.139
Sorry-0.138
Listen-0.137
Hi-0.136
Sure-0.135
Token able
Token ID1498
Feature activation+1.91e+01

Pos logit contributions
lers+0.149
ish+0.148
an+0.147
ig+0.143
 egg+0.142
Neg logit contributions
Sorry-0.161
Kids-0.159
Dad-0.158
Thanks-0.157
Don-0.155
Token to
Token ID284
Feature activation+1.20e+01

Pos logit contributions
et+0.238
an+0.232
ish+0.226
ig+0.224
cks+0.220
Neg logit contributions
Sorry-0.142
Listen-0.141
Don-0.141
Kids-0.135
By-0.131
Token escape
Token ID6654
Feature activation+2.20e+01

Pos logit contributions
et+0.139
lers+0.136
an+0.136
ish+0.134
ig+0.132
Neg logit contributions
Sorry-0.093
Listen-0.093
Kids-0.092
Dad-0.091
Sure-0.089
Token from
Token ID422
Feature activation+1.60e+01

Pos logit contributions
an+0.208
et+0.205
ig+0.202
ish+0.201
cks+0.196
Neg logit contributions
Don-0.205
He-0.199
Sorry-0.197
Listen-0.196
Exc-0.193
Token his
Token ID465
Feature activation+1.74e+01

Pos logit contributions
an+0.182
ish+0.178
ig+0.171
pp+0.170
ery+0.169
Neg logit contributions
Listen-0.128
Don-0.126
Hi-0.124
Dad-0.123
Kids-0.123
Token bur
Token ID4356
Feature activation+4.16e+01

Pos logit contributions
et+0.159
cks+0.154
an+0.141
lers+0.141
pp+0.141
Neg logit contributions
Sorry-0.175
Dad-0.174
Sure-0.169
Kids-0.168
Hi-0.166
Tokenrow
Token ID808
Feature activation+2.86e+01

Pos logit contributions
 globe+0.321
cks+0.315
 oven+0.310
 matches+0.304
 image+0.304
Neg logit contributions
He-0.425
His-0.403
When-0.394
So-0.388
Exc-0.388
Token and
Token ID290
Feature activation+8.79e+00

Pos logit contributions
cks+0.260
lers+0.236
et+0.235
ish+0.232
 egg+0.232
Neg logit contributions
He-0.288
Sorry-0.280
Don-0.279
His-0.278
Dad-0.273
Token play
Token ID711
Feature activation+1.53e+01

Pos logit contributions
et+0.100
an+0.096
ish+0.095
lers+0.095
ig+0.094
Neg logit contributions
Sorry-0.074
Listen-0.072
Kids-0.072
Don-0.072
Sure-0.070
Token in
Token ID287
Feature activation+1.25e+01

Pos logit contributions
ish+0.129
an+0.127
 egg+0.124
cks+0.124
lers+0.122
Neg logit contributions
Sure-0.165
Jack-0.161
Don-0.161
Sorry-0.160
Listen-0.159
Token the
Token ID262
Feature activation+1.65e+01

Pos logit contributions
ish+0.133
an+0.133
et+0.130
lers+0.127
ig+0.123
Neg logit contributions
Kids-0.112
Listen-0.112
Don-0.111
Dad-0.109
Sorry-0.108
Token mist
Token ID4020
Feature activation+3.06e+01

Pos logit contributions
cks+0.164
et+0.158
ish+0.157
ig+0.151
an+0.150
Neg logit contributions
Hi-0.206
Listen-0.202
Sorry-0.201
Kids-0.200
Exc-0.198
Token and
Token ID290
Feature activation+1.08e+01

Pos logit contributions
cks+0.276
an+0.276
ish+0.272
 egg+0.269
ates+0.261
Neg logit contributions
Sorry-0.292
Hi-0.289
Don-0.284
Listen-0.283
He-0.276
Token into
Token ID656
Feature activation+1.70e+01

Pos logit contributions
an+0.119
ig+0.117
ish+0.117
et+0.116
lers+0.113
Neg logit contributions
Don-0.097
Sorry-0.094
Listen-0.094
Hi-0.091
Kids-0.088
Token their
Token ID511
Feature activation+1.79e+01

Pos logit contributions
ish+0.188
an+0.185
et+0.182
ig+0.176
lers+0.174
Neg logit contributions
Don-0.148
Listen-0.147
Dad-0.145
Sorry-0.144
Kids-0.144
Token cozy
Token ID37438
Feature activation+2.84e+01

Pos logit contributions
et+0.130
ish+0.120
an+0.120
ery+0.120
cks+0.117
Neg logit contributions
Sorry-0.213
Kids-0.211
Listen-0.209
Don-0.208
Hi-0.206
Token bur
Token ID4356
Feature activation+4.13e+01

Pos logit contributions
et+0.249
cks+0.247
ery+0.229
ish+0.208
ates+0.208
Neg logit contributions
Hi-0.295
Dad-0.286
Kids-0.285
Sorry-0.278
Exc-0.274
Tokenrows
Token ID8516
Feature activation+2.66e+01

Pos logit contributions
 matches+0.316
 oven+0.316
 globe+0.306
et+0.304
 ornament+0.303
Neg logit contributions
He-0.415
Don-0.395
Sorry-0.390
When-0.389
His-0.388
Token.
Token ID13
Feature activation+1.88e+00

Pos logit contributions
ish+0.245
an+0.245
ig+0.234
igs+0.233
 egg+0.230
Neg logit contributions
Don-0.257
By-0.246
He-0.244
When-0.240
Listen-0.240
Token The
Token ID383
Feature activation+9.85e+00

Pos logit contributions
et+0.022
lers+0.022
 egg+0.022
an+0.022
ish+0.022
Neg logit contributions
Sorry-0.014
Listen-0.013
Kids-0.012
Don-0.012
Sure-0.011
Token end
Token ID886
Feature activation+1.12e+01

Pos logit contributions
cks+0.084
et+0.083
lers+0.078
ish+0.078
ig+0.075
Neg logit contributions
Sorry-0.112
Kids-0.112
Listen-0.110
Hi-0.109
Don-0.109
Token.
Token ID13
Feature activation+4.84e-01

Pos logit contributions
et+0.086
ish+0.085
an+0.083
cks+0.080
lers+0.079
Neg logit contributions
Kids-0.135
Listen-0.132
Sorry-0.129
Don-0.129
Sure-0.128
TokenJ
Token ID41
Feature activation-8.82e+00

Pos logit contributions
Kids+0.088
Listen+0.087
Sorry+0.087
Don+0.083
Dad+0.081
Neg logit contributions
et-0.121
an-0.117
ish-0.116
lers-0.115
 egg-0.113
Tokenill
Token ID359
Feature activation-6.26e+00

Pos logit contributions
Kids+0.066
Listen+0.064
Sorry+0.064
Dad+0.062
Don+0.060
Neg logit contributions
et-0.107
lers-0.105
an-0.104
ish-0.103
 egg-0.102
Token likes
Token ID7832
Feature activation-1.02e+01

Pos logit contributions
Kids+0.059
Hi+0.057
Listen+0.056
Don+0.054
Exc+0.054
Neg logit contributions
rier-0.057
cks-0.057
 egg-0.057
an-0.056
lers-0.056
Token to
Token ID284
Feature activation-1.39e+01

Pos logit contributions
Kids+0.092
Listen+0.091
Sorry+0.089
Dad+0.087
Don+0.087
Neg logit contributions
et-0.114
an-0.112
lers-0.108
ig-0.107
ish-0.107
Token dance
Token ID9280
Feature activation-5.89e+00

Pos logit contributions
Kids+0.121
Sorry+0.121
Listen+0.121
Dad+0.115
Sure+0.115
Neg logit contributions
et-0.150
lers-0.146
an-0.146
ish-0.144
ig-0.141
Token.
Token ID13
Feature activation-2.17e+01

Pos logit contributions
Sorry+0.071
Kids+0.071
Listen+0.071
Sure+0.071
Hi+0.069
Neg logit contributions
et-0.043
an-0.043
 egg-0.041
cks-0.041
ish-0.041
Token She
Token ID1375
Feature activation-1.38e+01

Pos logit contributions
Sorry+0.137
Kids+0.129
Listen+0.126
By+0.116
Um+0.116
Neg logit contributions
et-0.296
ish-0.275
c-0.266
ig-0.266
 egg-0.265
Token dances
Token ID38207
Feature activation-8.89e+00

Pos logit contributions
Kids+0.151
Listen+0.149
Sorry+0.147
Don+0.144
Sure+0.141
Neg logit contributions
et-0.120
ish-0.118
an-0.117
 egg-0.117
lers-0.115
Token in
Token ID287
Feature activation-1.15e+01

Pos logit contributions
Sorry+0.109
Listen+0.109
Kids+0.108
Don+0.106
Sure+0.105
Neg logit contributions
et-0.066
lers-0.063
ish-0.063
an-0.063
 egg-0.061
Token her
Token ID607
Feature activation-9.79e+00

Pos logit contributions
Listen+0.102
Kids+0.102
Dad+0.099
Don+0.098
Sorry+0.098
Neg logit contributions
an-0.125
ish-0.125
lers-0.124
et-0.123
ig-0.119
Token room
Token ID2119
Feature activation-4.15e+00

Pos logit contributions
Kids+0.116
Sorry+0.111
Listen+0.111
Dad+0.110
Hi+0.109
Neg logit contributions
et-0.078
an-0.072
ish-0.072
lers-0.072
cks-0.071
Token to
Token ID284
Feature activation-1.20e+01

Pos logit contributions
Kids+0.074
Listen+0.072
Dad+0.070
Sorry+0.069
Don+0.067
Neg logit contributions
an-0.103
et-0.103
ig-0.100
lers-0.097
cks-0.097
Token play
Token ID711
Feature activation-8.18e+00

Pos logit contributions
Sorry+0.098
Kids+0.097
Listen+0.097
Don+0.092
Dad+0.091
Neg logit contributions
et-0.138
an-0.133
lers-0.133
ish-0.132
ig-0.130
Token with
Token ID351
Feature activation-8.69e+00

Pos logit contributions
Sorry+0.099
Sure+0.097
Kids+0.096
Listen+0.096
Dad+0.094
Neg logit contributions
et-0.061
an-0.059
lers-0.057
 egg-0.057
ish-0.056
Token their
Token ID511
Feature activation-7.60e+00

Pos logit contributions
Dad+0.083
Kids+0.081
Sorry+0.080
Jack+0.079
Exc+0.077
Neg logit contributions
an-0.088
et-0.087
ish-0.082
ig-0.081
lers-0.079
Token toys
Token ID14958
Feature activation-3.06e+00

Pos logit contributions
Kids+0.102
Sorry+0.101
Listen+0.100
Dad+0.099
Sure+0.099
Neg logit contributions
et-0.049
an-0.043
ish-0.042
ig-0.042
cks-0.042
Token.
Token ID13
Feature activation-2.15e+01

Pos logit contributions
Sure+0.028
Sorry+0.028
On+0.028
He+0.027
So+0.027
Neg logit contributions
an-0.029
cks-0.028
et-0.027
 egg-0.027
ig-0.026
Token They
Token ID1119
Feature activation-1.42e+01

Pos logit contributions
Sorry+0.164
Listen+0.148
Kids+0.145
Don+0.144
Um+0.138
Neg logit contributions
et-0.285
 matches-0.248
cks-0.246
ish-0.246
 oven-0.245
Token have
Token ID423
Feature activation-1.39e+01

Pos logit contributions
Sorry+0.124
Kids+0.122
Listen+0.122
Don+0.117
Dad+0.114
Neg logit contributions
et-0.158
lers-0.151
ish-0.151
an-0.151
ig-0.148
Token cars
Token ID5006
Feature activation-3.01e+00

Pos logit contributions
Kids+0.140
Sorry+0.140
Listen+0.139
Don+0.134
Dad+0.132
Neg logit contributions
et-0.134
an-0.130
ish-0.129
lers-0.128
ig-0.125
Token,
Token ID11
Feature activation-1.27e+01

Pos logit contributions
Sorry+0.033
Kids+0.033
Listen+0.033
Sure+0.033
Dad+0.032
Neg logit contributions
an-0.024
 egg-0.024
lers-0.024
et-0.024
ish-0.024
Token dolls
Token ID36062
Feature activation-4.96e+00

Pos logit contributions
Kids+0.183
Listen+0.183
Sorry+0.183
Don+0.178
Dad+0.176
Neg logit contributions
et-0.066
an-0.063
ish-0.062
lers-0.061
ig-0.059
Token to
Token ID284
Feature activation-1.37e+01

Pos logit contributions
Kids+0.089
Listen+0.088
Sorry+0.086
Don+0.085
Dad+0.085
Neg logit contributions
et-0.112
an-0.109
ig-0.106
ish-0.105
lers-0.105
Token play
Token ID711
Feature activation-8.76e+00

Pos logit contributions
Sorry+0.118
Kids+0.118
Listen+0.117
Don+0.112
Dad+0.112
Neg logit contributions
et-0.149
lers-0.145
an-0.145
ish-0.144
ig-0.141
Token with
Token ID351
Feature activation-8.73e+00

Pos logit contributions
Sorry+0.104
Kids+0.103
Listen+0.103
Sure+0.103
Don+0.101
Neg logit contributions
et-0.067
an-0.065
lers-0.062
ish-0.062
 egg-0.062
Token his
Token ID465
Feature activation-8.00e+00

Pos logit contributions
Dad+0.066
His+0.065
Jack+0.063
Exc+0.060
Kids+0.060
Neg logit contributions
et-0.106
an-0.105
ig-0.102
ish-0.100
lers-0.096
Token drum
Token ID13026
Feature activation+1.63e+00

Pos logit contributions
Sorry+0.094
Dad+0.093
Kids+0.093
Sure+0.091
Listen+0.091
Neg logit contributions
et-0.064
an-0.057
cks-0.057
lers-0.054
ig-0.054
Token.
Token ID13
Feature activation-2.15e+01

Pos logit contributions
an+0.013
cks+0.012
 egg+0.012
et+0.012
ish+0.012
Neg logit contributions
Sure-0.018
Sorry-0.018
Listen-0.018
Kids-0.018
Don-0.018
Token He
Token ID679
Feature activation-1.23e+01

Pos logit contributions
Sorry+0.131
Kids+0.122
Listen+0.119
Um+0.109
Don+0.108
Neg logit contributions
et-0.311
ish-0.297
an-0.291
ig-0.285
 egg-0.282
Token has
Token ID468
Feature activation-1.35e+01

Pos logit contributions
Kids+0.123
Listen+0.122
Sorry+0.120
Don+0.119
Sure+0.115
Neg logit contributions
an-0.116
et-0.116
ish-0.114
 egg-0.114
lers-0.112
Token a
Token ID257
Feature activation-6.39e+00

Pos logit contributions
Listen+0.128
Kids+0.126
Sorry+0.125
Don+0.123
Dad+0.121
Neg logit contributions
ish-0.139
an-0.139
et-0.138
lers-0.133
ig-0.133
Token big
Token ID1263
Feature activation-1.67e+00

Pos logit contributions
Sorry+0.082
Listen+0.081
Kids+0.081
Sure+0.079
Don+0.079
Neg logit contributions
et-0.045
an-0.042
lers-0.041
ish-0.041
cks-0.040
Token drum
Token ID13026
Feature activation+2.96e-01

Pos logit contributions
Sorry+0.020
Sure+0.020
Hi+0.020
Listen+0.020
Kids+0.020
Neg logit contributions
cks-0.013
an-0.012
et-0.012
lers-0.011
ates-0.011
Token to
Token ID284
Feature activation-1.34e+01

Pos logit contributions
Kids+0.086
Listen+0.086
Sorry+0.084
Dad+0.083
Don+0.082
Neg logit contributions
et-0.107
an-0.103
ig-0.102
ish-0.101
lers-0.100
Token play
Token ID711
Feature activation-8.69e+00

Pos logit contributions
Sorry+0.123
Kids+0.122
Listen+0.122
Don+0.117
Sure+0.114
Neg logit contributions
et-0.141
an-0.135
lers-0.135
ish-0.135
ig-0.132
Token with
Token ID351
Feature activation-9.03e+00

Pos logit contributions
Sorry+0.105
Listen+0.104
Kids+0.103
Sure+0.103
Dad+0.101
Neg logit contributions
et-0.065
an-0.062
lers-0.061
ish-0.061
 egg-0.060
Token his
Token ID465
Feature activation-8.07e+00

Pos logit contributions
Dad+0.054
His+0.052
Jack+0.049
Listen+0.047
Kids+0.047
Neg logit contributions
et-0.124
an-0.122
ig-0.121
ish-0.119
lers-0.116
Token lizard
Token ID42406
Feature activation+5.74e-01

Pos logit contributions
Kids+0.098
Dad+0.098
Sorry+0.098
Listen+0.096
Sure+0.096
Neg logit contributions
et-0.060
cks-0.053
an-0.053
ig-0.051
ish-0.051
Token.
Token ID13
Feature activation-2.15e+01

Pos logit contributions
et+0.005
cks+0.005
an+0.004
ps+0.004
opl+0.004
Neg logit contributions
Sure-0.006
His-0.006
Sorry-0.006
He-0.006
Exc-0.006
Token His
Token ID2399
Feature activation-1.09e+01

Pos logit contributions
Sorry+0.120
Kids+0.118
Listen+0.115
Don+0.106
Dad+0.101
Neg logit contributions
et-0.311
ish-0.299
an-0.297
lers-0.294
 egg-0.293
Token lizard
Token ID42406
Feature activation-2.64e+00

Pos logit contributions
Sorry+0.126
Kids+0.125
Listen+0.125
Don+0.121
Sure+0.120
Neg logit contributions
et-0.089
lers-0.084
an-0.083
ish-0.083
ig-0.081
Token is
Token ID318
Feature activation-1.04e+01

Pos logit contributions
Listen+0.027
Kids+0.027
Don+0.027
Hi+0.026
Sorry+0.026
Neg logit contributions
et-0.023
an-0.022
ish-0.021
 egg-0.021
ates-0.020
Token red
Token ID2266
Feature activation-5.70e-01

Pos logit contributions
Listen+0.102
Don+0.101
He+0.100
His+0.100
Kids+0.098
Neg logit contributions
ish-0.103
an-0.100
ig-0.099
et-0.099
lers-0.098
Token and
Token ID290
Feature activation-1.24e+01

Pos logit contributions
He+0.004
Hi+0.004
Sure+0.004
Dad+0.004
His+0.004
Neg logit contributions
cks-0.007
et-0.006
an-0.006
igs-0.006
ig-0.006
Token Can
Token ID1680
Feature activation-1.53e+01

Pos logit contributions
Sorry+0.138
Listen+0.132
Kids+0.132
Dad+0.129
Don+0.123
Neg logit contributions
et-0.190
ish-0.187
lers-0.183
an-0.182
ig-0.182
Token you
Token ID345
Feature activation-1.18e+01

Pos logit contributions
Kids+0.180
Listen+0.177
Sorry+0.174
Dad+0.171
Don+0.167
Neg logit contributions
lers-0.101
ig-0.100
 matches-0.098
et-0.098
ish-0.097
Token spin
Token ID7906
Feature activation-4.73e+00

Pos logit contributions
Kids+0.122
Sorry+0.120
Listen+0.120
Don+0.116
Dad+0.114
Neg logit contributions
et-0.113
ig-0.108
ish-0.107
an-0.104
lers-0.104
Token for
Token ID329
Feature activation-1.15e+01

Pos logit contributions
Sure+0.051
Listen+0.051
Hi+0.050
Sorry+0.050
Kids+0.049
Neg logit contributions
an-0.041
 egg-0.040
et-0.039
ish-0.038
ery-0.037
Token me
Token ID502
Feature activation-6.38e+00

Pos logit contributions
Listen+0.123
Sorry+0.123
Kids+0.123
Dad+0.117
Sure+0.117
Neg logit contributions
et-0.105
an-0.102
ish-0.102
lers-0.100
ig-0.097
Token?"
Token ID1701
Feature activation-2.15e+01

Pos logit contributions
Sure+0.066
Listen+0.065
Sorry+0.065
Kids+0.063
Hi+0.063
Neg logit contributions
an-0.057
et-0.056
lers-0.056
 egg-0.055
cks-0.055
Token Spot
Token ID15899
Feature activation-8.81e+00

Pos logit contributions
Sorry+0.192
Listen+0.189
Kids+0.186
Dad+0.176
Um+0.175
Neg logit contributions
et-0.225
lers-0.219
ish-0.217
 egg-0.214
an-0.210
Token spun
Token ID26843
Feature activation-9.49e+00

Pos logit contributions
Sorry+0.078
Listen+0.077
Kids+0.076
Don+0.073
Dad+0.072
Neg logit contributions
et-0.097
an-0.093
ish-0.093
lers-0.092
ig-0.090
Token around
Token ID1088
Feature activation-1.03e+01

Pos logit contributions
Sorry+0.143
Kids+0.142
Listen+0.141
Dad+0.136
Um+0.135
Neg logit contributions
et-0.049
lers-0.043
an-0.041
ish-0.041
ig-0.038
Token and
Token ID290
Feature activation-1.42e+01

Pos logit contributions
Sorry+0.127
Kids+0.127
Listen+0.126
Don+0.123
Dad+0.121
Neg logit contributions
et-0.077
an-0.072
lers-0.072
ish-0.072
ig-0.070
Token around
Token ID1088
Feature activation-1.07e+01

Pos logit contributions
Don+0.116
Sorry+0.116
Kids+0.115
Sure+0.115
Listen+0.115
Neg logit contributions
an-0.163
 egg-0.160
et-0.159
ish-0.158
lers-0.156
Token like
Token ID588
Feature activation-1.17e+01

Pos logit contributions
Sorry+0.075
Listen+0.074
Kids+0.074
Don+0.070
Sure+0.069
Neg logit contributions
et-0.117
ish-0.113
lers-0.112
an-0.112
ig-0.109
Token to
Token ID284
Feature activation-1.21e+01

Pos logit contributions
Sorry+0.124
Kids+0.122
Listen+0.122
Don+0.118
Sure+0.116
Neg logit contributions
et-0.107
lers-0.102
ish-0.102
an-0.102
 egg-0.100
Token play
Token ID711
Feature activation-8.67e+00

Pos logit contributions
Sorry+0.098
Kids+0.097
Listen+0.097
Don+0.092
Sure+0.090
Neg logit contributions
et-0.141
lers-0.136
ish-0.136
an-0.136
ig-0.133
Token with
Token ID351
Feature activation-8.51e+00

Pos logit contributions
Sorry+0.104
Sure+0.104
Kids+0.103
Listen+0.103
Don+0.101
Neg logit contributions
et-0.063
an-0.061
 egg-0.058
lers-0.058
cks-0.057
Token gears
Token ID28713
Feature activation+4.60e-01

Pos logit contributions
Dad+0.069
Kids+0.068
Jack+0.066
His+0.065
Listen+0.063
Neg logit contributions
an-0.097
et-0.096
ig-0.091
ish-0.091
ab-0.088
Token.
Token ID13
Feature activation-2.14e+01

Pos logit contributions
an+0.005
cks+0.005
 majesty+0.004
ig+0.004
et+0.004
Neg logit contributions
He-0.004
His-0.004
Sure-0.004
She-0.004
Dad-0.004
Token They
Token ID1119
Feature activation-1.34e+01

Pos logit contributions
Sorry+0.181
Listen+0.156
Um+0.154
Don+0.152
By+0.149
Neg logit contributions
et-0.262
ish-0.231
 matches-0.229
 v-0.228
ab-0.227
Token have
Token ID423
Feature activation-1.33e+01

Pos logit contributions
Sorry+0.126
Kids+0.123
Listen+0.123
Don+0.117
Sure+0.114
Neg logit contributions
et-0.144
ish-0.136
lers-0.135
an-0.135
ig-0.132
Token many
Token ID867
Feature activation-6.79e+00

Pos logit contributions
Sorry+0.147
Kids+0.145
Listen+0.145
Don+0.140
Sure+0.138
Neg logit contributions
et-0.118
lers-0.112
an-0.110
ish-0.110
ig-0.108
Token gears
Token ID28713
Feature activation+4.99e-01

Pos logit contributions
Kids+0.088
Listen+0.088
Sorry+0.088
Dad+0.086
Don+0.086
Neg logit contributions
et-0.043
an-0.041
 egg-0.041
ish-0.041
lers-0.038
Token of
Token ID286
Feature activation-7.79e+00

Pos logit contributions
an+0.005
 egg+0.005
cks+0.005
et+0.005
ish+0.005
Neg logit contributions
He-0.004
Kids-0.004
Listen-0.004
Sure-0.004
Sorry-0.004
Token play
Token ID711
Feature activation-8.63e+00

Pos logit contributions
Sorry+0.111
Kids+0.110
Listen+0.110
Don+0.105
Sure+0.102
Neg logit contributions
et-0.149
ish-0.143
lers-0.143
an-0.143
ig-0.140
Token golf
Token ID13126
Feature activation+4.66e-01

Pos logit contributions
Sorry+0.104
Sure+0.103
Kids+0.103
Listen+0.102
Dad+0.100
Neg logit contributions
et-0.064
an-0.061
lers-0.059
ish-0.059
 egg-0.058
Token in
Token ID287
Feature activation-9.07e+00

Pos logit contributions
 egg+0.004
an+0.004
et+0.004
ish+0.004
cks+0.003
Neg logit contributions
Sure-0.005
Sorry-0.005
Kids-0.005
Exc-0.005
Don-0.005
Token the
Token ID262
Feature activation-7.03e+00

Pos logit contributions
Kids+0.098
Listen+0.097
Sorry+0.097
Dad+0.094
Don+0.094
Neg logit contributions
et-0.081
lers-0.078
an-0.078
ish-0.078
ig-0.076
Token park
Token ID3952
Feature activation-5.09e+00

Pos logit contributions
Kids+0.081
Sorry+0.080
Listen+0.080
Dad+0.078
Don+0.077
Neg logit contributions
et-0.058
lers-0.055
an-0.054
ish-0.054
ig-0.053
Token.
Token ID13
Feature activation-2.14e+01

Pos logit contributions
Sure+0.049
Dad+0.049
On+0.049
Sorry+0.049
His+0.048
Neg logit contributions
ish-0.046
ig-0.046
lers-0.045
an-0.045
 egg-0.044
Token They
Token ID1119
Feature activation-1.39e+01

Pos logit contributions
Sorry+0.181
Listen+0.153
Um+0.150
Kids+0.149
Don+0.148
Neg logit contributions
et-0.279
ish-0.245
an-0.236
 matches-0.234
cks-0.233
Token had
Token ID550
Feature activation-1.32e+01

Pos logit contributions
Sorry+0.108
Kids+0.107
Listen+0.107
Don+0.102
Dad+0.100
Neg logit contributions
et-0.167
an-0.161
lers-0.161
ish-0.161
ig-0.157
Token a
Token ID257
Feature activation-7.33e+00

Pos logit contributions
Listen+0.128
Kids+0.127
Sorry+0.126
Don+0.124
Dad+0.121
Neg logit contributions
et-0.132
ish-0.129
an-0.129
lers-0.128
ig-0.125
Token small
Token ID1402
Feature activation-2.93e+00

Pos logit contributions
Sorry+0.092
Kids+0.092
Listen+0.091
Don+0.089
Dad+0.088
Neg logit contributions
et-0.053
lers-0.050
an-0.049
ish-0.049
ig-0.047
Token stick
Token ID4859
Feature activation+1.08e+00

Pos logit contributions
Sorry+0.035
Kids+0.034
Listen+0.034
Dad+0.034
Sure+0.034
Neg logit contributions
et-0.022
cks-0.021
an-0.021
lers-0.021
ps-0.020
Token liked
Token ID8288
Feature activation-1.10e+01

Pos logit contributions
Kids+0.074
Listen+0.073
Sorry+0.072
Don+0.071
Dad+0.071
Neg logit contributions
lers-0.088
et-0.087
an-0.087
 egg-0.086
ish-0.086
Token to
Token ID284
Feature activation-1.31e+01

Pos logit contributions
Listen+0.094
Kids+0.092
Sorry+0.090
Don+0.088
Dad+0.088
Neg logit contributions
et-0.128
an-0.125
ig-0.122
ish-0.121
lers-0.120
Token play
Token ID711
Feature activation-8.91e+00

Pos logit contributions
Sorry+0.105
Kids+0.104
Listen+0.104
Don+0.099
Dad+0.097
Neg logit contributions
et-0.154
an-0.148
lers-0.148
ish-0.148
ig-0.145
Token with
Token ID351
Feature activation-8.70e+00

Pos logit contributions
Sorry+0.108
Sure+0.108
Kids+0.106
Listen+0.106
Dad+0.104
Neg logit contributions
et-0.064
an-0.062
lers-0.059
ish-0.059
 egg-0.059
Token snow
Token ID6729
Feature activation+5.01e-01

Pos logit contributions
Dad+0.075
Kids+0.072
Jack+0.071
Sorry+0.068
Listen+0.067
Neg logit contributions
an-0.094
et-0.091
ish-0.089
ig-0.089
hole-0.088
Token.
Token ID13
Feature activation-2.13e+01

Pos logit contributions
hole+0.004
ool+0.004
ig+0.004
 cob+0.004
opl+0.004
Neg logit contributions
Sorry-0.005
They-0.005
He-0.004
Kids-0.004
Dad-0.004
Token They
Token ID1119
Feature activation-1.38e+01

Pos logit contributions
Sorry+0.157
Listen+0.144
Kids+0.140
Don+0.135
By+0.131
Neg logit contributions
et-0.290
ish-0.272
an-0.263
 egg-0.263
ig-0.261
Token made
Token ID925
Feature activation-1.19e+01

Pos logit contributions
Kids+0.095
Sorry+0.093
Listen+0.092
Dad+0.092
Don+0.091
Neg logit contributions
an-0.171
et-0.170
ish-0.168
lers-0.167
ig-0.165
Token a
Token ID257
Feature activation-7.30e+00

Pos logit contributions
Listen+0.115
Dad+0.114
When+0.112
Don+0.112
Sorry+0.112
Neg logit contributions
et-0.109
ish-0.109
ig-0.106
lers-0.106
pp-0.105
Token big
Token ID1263
Feature activation-3.56e+00

Pos logit contributions
Sorry+0.081
Dad+0.079
Sure+0.078
Listen+0.078
Kids+0.078
Neg logit contributions
et-0.064
lers-0.059
ish-0.058
an-0.057
cks-0.057
Token snow
Token ID6729
Feature activation+2.27e+00

Pos logit contributions
Dad+0.034
Sure+0.033
He+0.032
Hi+0.032
No+0.031
Neg logit contributions
aces-0.033
cks-0.033
ps-0.033
ates-0.032
pp-0.031
Token playing
Token ID2712
Feature activation-9.63e+00

Pos logit contributions
Kids+0.143
Sorry+0.143
Listen+0.143
Dad+0.140
Don+0.139
Neg logit contributions
et-0.092
an-0.089
lers-0.083
ish-0.083
 egg-0.080
Token with
Token ID351
Feature activation-9.24e+00

Pos logit contributions
Sure+0.116
Sorry+0.115
Listen+0.113
Kids+0.112
Don+0.111
Neg logit contributions
et-0.071
an-0.070
lers-0.068
 egg-0.067
ish-0.067
Token paper
Token ID3348
Feature activation+1.27e+00

Pos logit contributions
Kids+0.067
Sorry+0.065
Listen+0.065
Dad+0.065
Sure+0.062
Neg logit contributions
et-0.115
an-0.115
ish-0.111
ig-0.110
lers-0.110
Token and
Token ID290
Feature activation-1.19e+01

Pos logit contributions
opl+0.013
cks+0.012
et+0.012
an+0.012
ab+0.012
Neg logit contributions
So-0.010
On-0.010
They-0.010
When-0.010
He-0.009
Token scissors
Token ID41786
Feature activation-1.15e+00

Pos logit contributions
Sorry+0.139
Kids+0.138
Dad+0.138
Listen+0.138
Don+0.134
Neg logit contributions
et-0.097
an-0.097
ish-0.092
lers-0.090
ig-0.089
Token.
Token ID13
Feature activation-2.13e+01

Pos logit contributions
He+0.010
His+0.010
Don+0.010
When+0.010
Sorry+0.010
Neg logit contributions
an-0.010
 egg-0.010
ish-0.010
ig-0.010
igs-0.010
Token They
Token ID1119
Feature activation-1.36e+01

Pos logit contributions
Sorry+0.209
Listen+0.197
Kids+0.193
Don+0.190
By+0.184
Neg logit contributions
et-0.207
 matches-0.197
 up-0.192
 oven-0.191
cks-0.188
Token liked
Token ID8288
Feature activation-1.19e+01

Pos logit contributions
Sorry+0.106
Kids+0.104
Listen+0.103
Don+0.101
Dad+0.100
Neg logit contributions
et-0.160
an-0.158
lers-0.156
ish-0.155
ig-0.153
Token to
Token ID284
Feature activation-1.35e+01

Pos logit contributions
Listen+0.086
Dad+0.081
Kids+0.081
Don+0.080
Sure+0.078
Neg logit contributions
an-0.152
et-0.147
ish-0.144
ig-0.142
lers-0.140
Token cut
Token ID2005
Feature activation-4.75e+00

Pos logit contributions
Sorry+0.130
Listen+0.128
Kids+0.128
Dad+0.127
Sure+0.126
Neg logit contributions
lers-0.129
et-0.128
an-0.127
ish-0.126
ig-0.123
Token out
Token ID503
Feature activation-5.03e+00

Pos logit contributions
When+0.042
Dad+0.042
 Something+0.041
What+0.041
He+0.040
Neg logit contributions
an-0.044
asis-0.043
opl-0.041
eh-0.041
ig-0.040
TokenTom
Token ID13787
Feature activation-8.95e+00

Pos logit contributions
Kids+0.092
Sorry+0.091
Listen+0.091
Don+0.087
Dad+0.085
Neg logit contributions
et-0.127
an-0.122
ish-0.121
lers-0.121
 egg-0.119
Token and
Token ID290
Feature activation-1.41e+01

Pos logit contributions
Kids+0.099
Listen+0.097
Dad+0.095
Hi+0.095
Sorry+0.094
Neg logit contributions
lers-0.075
an-0.073
 egg-0.072
et-0.071
cks-0.069
Token Sam
Token ID3409
Feature activation-8.32e+00

Pos logit contributions
Sorry+0.156
Kids+0.148
Listen+0.145
Um+0.141
Don+0.141
Neg logit contributions
et-0.120
ish-0.119
 egg-0.117
lers-0.117
an-0.112
Token are
Token ID389
Feature activation-1.16e+01

Pos logit contributions
Sorry+0.077
Kids+0.076
Listen+0.076
Don+0.073
Sure+0.072
Neg logit contributions
et-0.087
ish-0.084
an-0.083
lers-0.083
ig-0.081
Token brothers
Token ID9397
Feature activation-8.19e+00

Pos logit contributions
Sorry+0.149
Kids+0.148
Listen+0.148
Don+0.143
Dad+0.142
Neg logit contributions
et-0.081
an-0.076
lers-0.076
ish-0.076
ig-0.073
Token.
Token ID13
Feature activation-2.13e+01

Pos logit contributions
Sorry+0.091
Kids+0.091
Dad+0.091
Sure+0.089
Listen+0.089
Neg logit contributions
et-0.064
an-0.064
ish-0.062
ig-0.061
 egg-0.061
Token They
Token ID1119
Feature activation-1.36e+01

Pos logit contributions
Sorry+0.199
Don+0.186
Listen+0.184
Enough+0.175
By+0.171
Neg logit contributions
et-0.228
c-0.197
 red-0.194
ish-0.191
 block-0.190
Token like
Token ID588
Feature activation-1.20e+01

Pos logit contributions
Sorry+0.135
Kids+0.134
Listen+0.134
Don+0.127
Um+0.125
Neg logit contributions
et-0.136
ish-0.129
an-0.127
lers-0.126
ig-0.123
Token to
Token ID284
Feature activation-1.20e+01

Pos logit contributions
Sorry+0.128
Kids+0.126
Listen+0.125
Don+0.121
Sure+0.119
Neg logit contributions
et-0.108
lers-0.103
ish-0.103
an-0.102
 egg-0.101
Token play
Token ID711
Feature activation-8.55e+00

Pos logit contributions
Sorry+0.105
Kids+0.104
Listen+0.103
Don+0.102
Um+0.095
Neg logit contributions
et-0.134
ish-0.128
lers-0.127
an-0.127
ig-0.125
Token with
Token ID351
Feature activation-8.66e+00

Pos logit contributions
Sorry+0.104
Sure+0.102
Kids+0.102
Listen+0.102
Dad+0.100
Neg logit contributions
et-0.062
an-0.059
 egg-0.058
ish-0.057
lers-0.057
Token jump
Token ID4391
Feature activation-5.36e+00

Pos logit contributions
Sorry+0.125
Kids+0.123
Listen+0.123
Don+0.118
Dad+0.114
Neg logit contributions
et-0.150
lers-0.144
ish-0.144
an-0.143
 egg-0.141
Token in
Token ID287
Feature activation-8.09e+00

Pos logit contributions
Exc+0.058
Sure+0.058
His+0.057
Hi+0.056
Listen+0.056
Neg logit contributions
et-0.043
 egg-0.043
an-0.042
ish-0.042
ig-0.042
Token p
Token ID279
Feature activation-1.61e+00

Pos logit contributions
Dad+0.068
Kids+0.066
Listen+0.064
What+0.064
Exc+0.064
Neg logit contributions
an-0.093
ish-0.090
et-0.087
lers-0.087
pp-0.086
Tokenudd
Token ID4185
Feature activation-2.23e+00

Pos logit contributions
Sorry+0.020
Kids+0.020
Listen+0.019
Sure+0.019
Dad+0.019
Neg logit contributions
et-0.012
 egg-0.011
an-0.011
lers-0.011
ish-0.011
Tokenles
Token ID829
Feature activation-2.95e+00

Pos logit contributions
Kids+0.030
Dad+0.029
Exc+0.029
Listen+0.029
Sorry+0.028
Neg logit contributions
et-0.012
 egg-0.011
ates-0.010
ery-0.010
ig-0.010
Token.
Token ID13
Feature activation-2.13e+01

Pos logit contributions
His+0.029
Don+0.029
He+0.029
Her+0.029
Sorry+0.029
Neg logit contributions
ish-0.026
an-0.026
ig-0.025
et-0.025
 egg-0.024
Token She
Token ID1375
Feature activation-1.34e+01

Pos logit contributions
Sorry+0.153
Listen+0.137
Kids+0.137
By+0.130
Don+0.127
Neg logit contributions
et-0.282
ish-0.261
 matches-0.257
 egg-0.253
 v-0.251
Token wore
Token ID12408
Feature activation-1.01e+01

Pos logit contributions
Kids+0.102
Listen+0.098
Sorry+0.097
Don+0.095
Dad+0.092
Neg logit contributions
et-0.160
ish-0.160
an-0.159
lers-0.157
 egg-0.157
Token her
Token ID607
Feature activation-8.83e+00

Pos logit contributions
Listen+0.086
Kids+0.083
Sorry+0.083
Don+0.082
Hi+0.081
Neg logit contributions
an-0.114
ish-0.113
et-0.112
ig-0.109
lers-0.109
Token yellow
Token ID7872
Feature activation+3.37e+00

Pos logit contributions
Hi+0.103
Kids+0.102
Sorry+0.101
Listen+0.101
His+0.100
Neg logit contributions
et-0.062
an-0.060
cks-0.059
ish-0.059
ors-0.055
Token boots
Token ID14412
Feature activation+1.64e+00

Pos logit contributions
et+0.026
cks+0.025
an+0.024
ish+0.024
lers+0.023
Neg logit contributions
Sorry-0.039
Hi-0.039
Kids-0.039
Sure-0.038
Listen-0.038
Token<|endoftext|>
Token ID50256
Feature activation-1.27e+01

Pos logit contributions
Sorry+0.170
Kids+0.154
Listen+0.149
Um+0.148
Sure+0.147
Neg logit contributions
 egg-0.177
et-0.175
ates-0.174
!"-0.171
an-0.170
TokenTodd
Token ID42817
Feature activation-7.90e+00

Pos logit contributions
Sorry+0.081
Listen+0.076
Kids+0.075
Don+0.072
Sure+0.071
Neg logit contributions
et-0.169
lers-0.164
an-0.164
ish-0.162
ig-0.161
Token was
Token ID373
Feature activation-1.30e+01

Pos logit contributions
Kids+0.069
Dad+0.068
Listen+0.067
Sorry+0.066
Hi+0.064
Neg logit contributions
lers-0.078
et-0.078
cks-0.077
 egg-0.076
ig-0.076
Token so
Token ID523
Feature activation-1.32e+01

Pos logit contributions
Listen+0.120
Dad+0.120
Thanks+0.117
Don+0.117
Hi+0.115
Neg logit contributions
an-0.131
lers-0.128
et-0.126
ig-0.123
ish-0.123
Token excited
Token ID6568
Feature activation-8.56e+00

Pos logit contributions
Sorry+0.121
Listen+0.120
Kids+0.119
Don+0.115
Dad+0.114
Neg logit contributions
et-0.141
an-0.135
lers-0.135
ish-0.134
ig-0.131
Token!
Token ID0
Feature activation-2.12e+01

Pos logit contributions
Listen+0.080
Kids+0.079
Sorry+0.078
Jack+0.077
Dad+0.076
Neg logit contributions
et-0.085
ig-0.082
an-0.082
cks-0.080
ish-0.080
Token His
Token ID2399
Feature activation-1.13e+01

Pos logit contributions
Sorry+0.101
Kids+0.095
Listen+0.094
Don+0.087
Sure+0.083
Neg logit contributions
et-0.322
an-0.314
ish-0.312
 egg-0.309
lers-0.308
Token mom
Token ID1995
Feature activation-8.38e+00

Pos logit contributions
Sorry+0.120
Kids+0.119
Listen+0.118
Don+0.115
Sure+0.112
Neg logit contributions
et-0.105
an-0.101
ish-0.101
lers-0.100
 egg-0.098
Token had
Token ID550
Feature activation-1.43e+01

Pos logit contributions
Kids+0.071
Listen+0.069
Sorry+0.068
Dad+0.067
Don+0.066
Neg logit contributions
an-0.091
et-0.091
lers-0.091
ig-0.088
cks-0.088
Token just
Token ID655
Feature activation-9.13e+00

Pos logit contributions
Listen+0.122
Sorry+0.121
Kids+0.118
Dad+0.117
Don+0.115
Neg logit contributions
an-0.162
ish-0.160
et-0.159
lers-0.158
ig-0.156
Token told
Token ID1297
Feature activation-1.24e+01

Pos logit contributions
Listen+0.075
Sorry+0.074
Don+0.071
Kids+0.071
Dad+0.070
Neg logit contributions
an-0.106
ig-0.101
lers-0.100
ish-0.099
et-0.098
Token liked
Token ID8288
Feature activation-1.10e+01

Pos logit contributions
Kids+0.097
Listen+0.094
Don+0.093
Sorry+0.093
Hi+0.092
Neg logit contributions
et-0.092
lers-0.091
an-0.091
ish-0.090
cks-0.089
Token to
Token ID284
Feature activation-1.38e+01

Pos logit contributions
Listen+0.106
Kids+0.106
Sorry+0.105
Don+0.101
Dad+0.101
Neg logit contributions
et-0.113
an-0.110
ish-0.108
lers-0.107
ig-0.107
Token learn
Token ID2193
Feature activation-9.86e+00

Pos logit contributions
Sorry+0.125
Kids+0.124
Listen+0.123
Don+0.119
Sure+0.115
Neg logit contributions
et-0.147
lers-0.140
ish-0.140
an-0.140
 egg-0.138
Token new
Token ID649
Feature activation-4.49e+00

Pos logit contributions
Sure+0.101
Listen+0.101
Kids+0.099
Exc+0.099
Sorry+0.097
Neg logit contributions
et-0.093
an-0.092
ish-0.089
ig-0.086
lers-0.085
Token words
Token ID2456
Feature activation-2.39e+00

Pos logit contributions
Kids+0.060
Listen+0.059
Sorry+0.058
Sure+0.058
Don+0.057
Neg logit contributions
et-0.029
an-0.027
ish-0.026
ig-0.026
 egg-0.025
Token.
Token ID13
Feature activation-2.12e+01

Pos logit contributions
Listen+0.026
Sure+0.026
Kids+0.026
Dad+0.026
Sorry+0.026
Neg logit contributions
an-0.020
et-0.019
ish-0.019
lers-0.019
ig-0.019
Token She
Token ID1375
Feature activation-1.34e+01

Pos logit contributions
Sorry+0.163
Listen+0.145
Kids+0.144
By+0.139
Don+0.134
Neg logit contributions
et-0.272
ish-0.255
 matches-0.251
 ornament-0.245
c-0.244
Token had
Token ID550
Feature activation-1.34e+01

Pos logit contributions
Sorry+0.121
Listen+0.119
Kids+0.118
Don+0.113
Sure+0.111
Neg logit contributions
et-0.146
an-0.135
lers-0.135
ish-0.135
 egg-0.133
Token a
Token ID257
Feature activation-6.91e+00

Pos logit contributions
Listen+0.128
Kids+0.127
Sorry+0.125
Don+0.123
Dad+0.121
Neg logit contributions
an-0.138
et-0.137
ish-0.136
lers-0.132
ig-0.132
Token book
Token ID1492
Feature activation-1.64e+00

Pos logit contributions
Sorry+0.079
Sure+0.078
Listen+0.078
Kids+0.077
Dad+0.076
Neg logit contributions
et-0.059
cks-0.056
an-0.055
lers-0.054
ish-0.052
Token with
Token ID351
Feature activation-8.09e+00

Pos logit contributions
Sorry+0.017
Listen+0.017
Kids+0.017
Sure+0.017
Don+0.017
Neg logit contributions
et-0.015
an-0.014
ish-0.014
lers-0.014
ig-0.014
Token what
Token ID644
Feature activation-9.18e+00

Pos logit contributions
When+0.043
Sure+0.041
Dad+0.040
Listen+0.039
He+0.039
Neg logit contributions
an-0.105
et-0.104
lers-0.102
ig-0.099
cks-0.098
Token that
Token ID326
Feature activation-7.63e+00

Pos logit contributions
Listen+0.086
Don+0.081
Dad+0.080
Kids+0.080
Sorry+0.078
Neg logit contributions
et-0.101
ish-0.096
an-0.095
ig-0.093
ps-0.092
Token land
Token ID1956
Feature activation-5.77e-01

Pos logit contributions
Listen+0.071
Don+0.069
Hi+0.068
Come+0.067
 Something+0.066
Neg logit contributions
et-0.074
lers-0.072
igs-0.072
ates-0.071
ishing-0.071
Token is
Token ID318
Feature activation-7.15e+00

Pos logit contributions
Listen+0.005
Hi+0.005
Don+0.005
He+0.005
Sorry+0.005
Neg logit contributions
et-0.005
lers-0.005
ig-0.005
an-0.005
ish-0.005
Token like
Token ID588
Feature activation-7.46e+00

Pos logit contributions
When+0.074
Listen+0.071
On+0.071
By+0.069
Don+0.069
Neg logit contributions
lers-0.066
ig-0.064
ish-0.063
et-0.062
igs-0.058
Token?".
Token ID43634
Feature activation-2.12e+01

Pos logit contributions
Dad+0.073
Listen+0.072
By+0.072
Hi+0.071
When+0.071
Neg logit contributions
ish-0.073
et-0.071
lers-0.071
cks-0.070
an-0.069
Token
Token ID198
Feature activation-2.01e+01

Pos logit contributions
Kids+0.170
Sorry+0.170
Listen+0.155
Um+0.145
By+0.142
Neg logit contributions
et-0.247
 egg-0.234
lers-0.231
 up-0.231
an-0.230
Token
Token ID198
Feature activation-2.04e+01

Pos logit contributions
Sorry+0.166
Kids+0.153
Listen+0.151
Um+0.148
Don+0.143
Neg logit contributions
et-0.215
an-0.215
ig-0.210
!"-0.208
 up-0.206
TokenJohn
Token ID7554
Feature activation-9.34e+00

Pos logit contributions
Kids+0.117
Sorry+0.110
Listen+0.105
Um+0.101
Sure+0.093
Neg logit contributions
et-0.276
an-0.267
!"-0.254
 egg-0.254
ep-0.253
Token began
Token ID2540
Feature activation-1.04e+01

Pos logit contributions
Sorry+0.072
Kids+0.071
Listen+0.071
Don+0.068
Dad+0.067
Neg logit contributions
et-0.112
lers-0.108
an-0.108
ish-0.107
ig-0.105
Token to
Token ID284
Feature activation-1.37e+01

Pos logit contributions
Listen+0.104
Sorry+0.104
Kids+0.104
Don+0.100
Dad+0.098
Neg logit contributions
et-0.102
ish-0.098
an-0.098
lers-0.098
ig-0.095
Token to
Token ID284
Feature activation-1.34e+01

Pos logit contributions
Kids+0.088
Listen+0.088
Sorry+0.086
Dad+0.084
Don+0.084
Neg logit contributions
et-0.100
an-0.096
lers-0.094
ig-0.094
ish-0.094
Token swim
Token ID9422
Feature activation-4.72e+00

Pos logit contributions
Sorry+0.115
Kids+0.114
Listen+0.114
Don+0.108
Dad+0.107
Neg logit contributions
et-0.149
an-0.144
lers-0.144
ish-0.143
ig-0.140
Token in
Token ID287
Feature activation-9.29e+00

Pos logit contributions
Sorry+0.061
Kids+0.060
Listen+0.060
Sure+0.058
Don+0.058
Neg logit contributions
et-0.032
lers-0.030
an-0.030
 egg-0.030
ish-0.029
Token the
Token ID262
Feature activation-6.95e+00

Pos logit contributions
Kids+0.091
Listen+0.091
Dad+0.088
Don+0.088
Sorry+0.087
Neg logit contributions
et-0.092
an-0.092
ish-0.091
lers-0.090
ig-0.087
Token pool
Token ID5933
Feature activation-7.73e-01

Pos logit contributions
Kids+0.076
Listen+0.076
Sorry+0.075
Hi+0.075
Dad+0.074
Neg logit contributions
et-0.061
lers-0.057
an-0.057
ish-0.056
ig-0.056
Token.
Token ID13
Feature activation-2.12e+01

Pos logit contributions
Dad+0.008
Sorry+0.008
Sure+0.008
Listen+0.008
Kids+0.008
Neg logit contributions
lers-0.006
ig-0.006
ish-0.006
et-0.006
an-0.006
Token She
Token ID1375
Feature activation-1.33e+01

Pos logit contributions
Sorry+0.143
Listen+0.128
Kids+0.128
By+0.118
Um+0.117
Neg logit contributions
et-0.284
ish-0.276
 egg-0.267
 matches-0.267
ig-0.265
Token has
Token ID468
Feature activation-1.32e+01

Pos logit contributions
Kids+0.142
Listen+0.138
Don+0.134
Sorry+0.133
Dad+0.130
Neg logit contributions
ish-0.120
et-0.119
 egg-0.118
an-0.117
lers-0.115
Token a
Token ID257
Feature activation-6.56e+00

Pos logit contributions
Sorry+0.133
Kids+0.133
Listen+0.133
Don+0.128
Dad+0.126
Neg logit contributions
et-0.128
an-0.123
ish-0.123
lers-0.122
ig-0.119
Token pink
Token ID11398
Feature activation+7.77e-01

Pos logit contributions
Sorry+0.079
Kids+0.078
Listen+0.077
Sure+0.076
Dad+0.076
Neg logit contributions
et-0.053
cks-0.048
lers-0.048
an-0.048
ish-0.046
Token cup
Token ID6508
Feature activation+1.26e+00

Pos logit contributions
et+0.006
cks+0.006
lers+0.006
an+0.005
ish+0.005
Neg logit contributions
Sorry-0.009
Kids-0.009
Sure-0.009
Hi-0.009
Dad-0.009
Token like
Token ID588
Feature activation-1.17e+01

Pos logit contributions
Kids+0.076
Listen+0.075
Sorry+0.075
Don+0.073
Dad+0.072
Neg logit contributions
et-0.084
an-0.084
lers-0.084
 egg-0.083
ig-0.082
Token to
Token ID284
Feature activation-1.20e+01

Pos logit contributions
Sorry+0.130
Kids+0.127
Listen+0.127
Don+0.123
Sure+0.121
Neg logit contributions
et-0.099
lers-0.095
ish-0.094
 egg-0.094
an-0.094
Token play
Token ID711
Feature activation-8.90e+00

Pos logit contributions
Sorry+0.098
Kids+0.097
Listen+0.097
Don+0.093
Sure+0.090
Neg logit contributions
et-0.140
ish-0.135
lers-0.134
an-0.134
 egg-0.132
Token with
Token ID351
Feature activation-8.64e+00

Pos logit contributions
Sorry+0.112
Kids+0.110
Listen+0.110
Sure+0.109
Dad+0.108
Neg logit contributions
et-0.062
an-0.059
lers-0.058
ish-0.057
 egg-0.057
Token cars
Token ID5006
Feature activation-1.73e+00

Pos logit contributions
Dad+0.061
Kids+0.060
Jack+0.057
His+0.056
Sorry+0.055
Neg logit contributions
an-0.107
et-0.106
ish-0.104
ig-0.103
ab-0.099
Token.
Token ID13
Feature activation-2.12e+01

Pos logit contributions
He+0.016
Dad+0.015
Sorry+0.015
Kids+0.015
His+0.015
Neg logit contributions
 egg-0.017
ig-0.016
an-0.016
ish-0.016
iflower-0.016
Token They
Token ID1119
Feature activation-1.35e+01

Pos logit contributions
Sorry+0.165
Listen+0.148
Kids+0.148
Don+0.145
By+0.139
Neg logit contributions
et-0.270
 matches-0.242
cks-0.242
 egg-0.238
 oven-0.237
Token have
Token ID423
Feature activation-1.33e+01

Pos logit contributions
Kids+0.095
Listen+0.095
Sorry+0.094
Don+0.091
Dad+0.090
Neg logit contributions
an-0.165
et-0.165
lers-0.165
ish-0.163
 egg-0.162
Token many
Token ID867
Feature activation-6.97e+00

Pos logit contributions
Listen+0.130
Kids+0.130
Sorry+0.127
Don+0.124
Dad+0.123
Neg logit contributions
ish-0.132
an-0.132
et-0.132
lers-0.128
ig-0.127
Token cars
Token ID5006
Feature activation-1.95e+00

Pos logit contributions
Kids+0.079
Exc+0.077
Sorry+0.077
Listen+0.076
His+0.076
Neg logit contributions
ish-0.052
 egg-0.052
an-0.051
pp-0.051
et-0.050
Token of
Token ID286
Feature activation-8.07e+00

Pos logit contributions
Kids+0.017
He+0.017
Sorry+0.017
Dad+0.017
Sure+0.017
Neg logit contributions
 egg-0.020
an-0.019
ish-0.019
iflower-0.019
lers-0.018
Token are
Token ID389
Feature activation-1.05e+01

Pos logit contributions
Sorry+0.086
Listen+0.084
Kids+0.084
Um+0.081
Sure+0.080
Neg logit contributions
et-0.076
ish-0.072
an-0.070
lers-0.068
 egg-0.067
Token playing
Token ID2712
Feature activation-8.53e+00

Pos logit contributions
Sorry+0.133
Kids+0.133
Listen+0.132
Dad+0.128
Don+0.128
Neg logit contributions
et-0.076
an-0.072
lers-0.070
ish-0.069
 egg-0.067
Token with
Token ID351
Feature activation-8.59e+00

Pos logit contributions
Sorry+0.105
Sure+0.103
Listen+0.100
Kids+0.100
Jack+0.100
Neg logit contributions
et-0.061
an-0.059
lers-0.057
 egg-0.056
cks-0.056
Token paper
Token ID3348
Feature activation+1.75e+00

Pos logit contributions
Kids+0.073
Listen+0.071
Dad+0.071
Sorry+0.071
Don+0.068
Neg logit contributions
et-0.097
an-0.096
ish-0.092
ig-0.092
lers-0.091
Token planes
Token ID13016
Feature activation+1.31e+00

Pos logit contributions
opl+0.016
et+0.016
cks+0.016
an+0.015
ab+0.015
Neg logit contributions
Dad-0.014
So-0.014
He-0.014
Exc-0.014
His-0.014
Token.
Token ID13
Feature activation-2.12e+01

Pos logit contributions
an+0.013
 majesty+0.013
 egg+0.013
igs+0.013
ig+0.013
Neg logit contributions
He-0.010
On-0.010
By-0.010
They-0.010
His-0.009
Token They
Token ID1119
Feature activation-1.34e+01

Pos logit contributions
Sorry+0.190
Listen+0.178
Don+0.167
By+0.165
Um+0.163
Neg logit contributions
et-0.237
ish-0.232
 matches-0.227
 egg-0.220
an-0.217
Token make
Token ID787
Feature activation-9.68e+00

Pos logit contributions
Sorry+0.096
Kids+0.095
Dad+0.095
Listen+0.095
Don+0.094
Neg logit contributions
an-0.159
lers-0.158
et-0.157
 egg-0.156
ig-0.155
Token them
Token ID606
Feature activation-5.27e+00

Pos logit contributions
When+0.078
Dad+0.075
Listen+0.074
They+0.074
Everyone+0.073
Neg logit contributions
ish-0.105
ig-0.103
lers-0.097
et-0.097
an-0.094
Token from
Token ID422
Feature activation-8.24e+00

Pos logit contributions
Sure+0.045
Sorry+0.045
He+0.045
Listen+0.044
When+0.044
Neg logit contributions
et-0.052
lers-0.051
an-0.050
ish-0.050
ig-0.049
Token old
Token ID1468
Feature activation-4.57e+00

Pos logit contributions
Dad+0.064
What+0.063
Listen+0.063
Kids+0.060
Don+0.059
Neg logit contributions
an-0.095
ish-0.093
ig-0.090
lers-0.090
ery-0.087
Tokenily
Token ID813
Feature activation-9.64e+00

Pos logit contributions
Sorry+0.097
Listen+0.095
Don+0.095
Kids+0.090
By+0.090
Neg logit contributions
et-0.069
ig-0.065
 egg-0.064
ish-0.063
an-0.063
Token liked
Token ID8288
Feature activation-1.10e+01

Pos logit contributions
Kids+0.094
Listen+0.091
Don+0.090
Sorry+0.090
Dad+0.089
Neg logit contributions
lers-0.093
an-0.093
cks-0.092
et-0.092
ish-0.091
Token to
Token ID284
Feature activation-1.34e+01

Pos logit contributions
Listen+0.106
Kids+0.105
Sorry+0.105
Don+0.100
Dad+0.100
Neg logit contributions
et-0.114
an-0.110
ish-0.108
lers-0.107
ig-0.107
Token design
Token ID1486
Feature activation-5.28e+00

Pos logit contributions
Sorry+0.124
Kids+0.123
Listen+0.123
Don+0.118
Sure+0.115
Neg logit contributions
et-0.140
lers-0.134
an-0.134
ish-0.134
 egg-0.131
Token things
Token ID1243
Feature activation-5.32e+00

Pos logit contributions
When+0.053
Exc+0.052
He+0.052
Dad+0.051
What+0.051
Neg logit contributions
an-0.046
et-0.044
 egg-0.044
ish-0.043
ig-0.042
Token.
Token ID13
Feature activation-2.12e+01

Pos logit contributions
Sure+0.055
Sorry+0.054
He+0.054
Listen+0.054
Don+0.053
Neg logit contributions
an-0.046
et-0.046
 egg-0.043
lers-0.043
ish-0.043
Token She
Token ID1375
Feature activation-1.26e+01

Pos logit contributions
Sorry+0.165
Kids+0.148
Listen+0.143
By+0.137
Don+0.135
Neg logit contributions
et-0.265
 ornament-0.244
 matches-0.244
ish-0.243
 egg-0.242
Token had
Token ID550
Feature activation-1.26e+01

Pos logit contributions
Sorry+0.106
Kids+0.106
Listen+0.105
Don+0.101
Dad+0.099
Neg logit contributions
et-0.142
lers-0.137
an-0.137
ish-0.137
ig-0.134
Token many
Token ID867
Feature activation-6.38e+00

Pos logit contributions
Sorry+0.129
Listen+0.129
Kids+0.129
Don+0.125
Dad+0.123
Neg logit contributions
et-0.119
an-0.115
ish-0.115
lers-0.114
ig-0.111
Token papers
Token ID9473
Feature activation-1.20e+00

Pos logit contributions
Exc+0.072
Dad+0.071
Kids+0.071
Jack+0.068
Sorry+0.068
Neg logit contributions
et-0.048
an-0.047
cks-0.044
ish-0.043
pp-0.042
Token and
Token ID290
Feature activation-1.15e+01

Pos logit contributions
Sorry+0.012
Dad+0.012
When+0.011
Listen+0.011
Kids+0.011
Neg logit contributions
et-0.011
an-0.011
ish-0.011
lers-0.010
cks-0.010
TokenBen
Token ID11696
Feature activation-9.20e+00

Pos logit contributions
Kids+0.092
Sorry+0.092
Listen+0.091
Don+0.087
Dad+0.085
Neg logit contributions
et-0.122
an-0.117
ish-0.116
lers-0.116
 egg-0.114
Token and
Token ID290
Feature activation-1.38e+01

Pos logit contributions
Kids+0.092
Sorry+0.092
Listen+0.091
Don+0.089
Dad+0.087
Neg logit contributions
et-0.088
lers-0.085
an-0.085
ish-0.084
ig-0.083
Token Lily
Token ID20037
Feature activation-1.05e+01

Pos logit contributions
Sorry+0.150
Kids+0.143
Listen+0.141
Um+0.135
Don+0.135
Neg logit contributions
et-0.120
 egg-0.117
ish-0.116
lers-0.112
an-0.111
Token are
Token ID389
Feature activation-1.13e+01

Pos logit contributions
Sorry+0.078
Kids+0.077
Listen+0.077
Don+0.073
Dad+0.072
Neg logit contributions
et-0.129
an-0.125
lers-0.125
ish-0.124
ig-0.122
Token twins
Token ID20345
Feature activation-9.62e+00

Pos logit contributions
Sorry+0.140
Kids+0.140
Listen+0.139
Dad+0.135
Don+0.135
Neg logit contributions
et-0.085
an-0.080
lers-0.078
ish-0.077
ig-0.075
Token.
Token ID13
Feature activation-2.12e+01

Pos logit contributions
Sorry+0.109
Kids+0.107
Dad+0.107
Listen+0.105
Don+0.104
Neg logit contributions
et-0.078
an-0.078
lers-0.075
ish-0.075
ig-0.075
Token They
Token ID1119
Feature activation-1.38e+01

Pos logit contributions
Sorry+0.223
Don+0.196
Listen+0.195
By+0.189
Enough+0.186
Neg logit contributions
et-0.177
 red-0.176
 green-0.174
 blue-0.173
 yellow-0.170
Token share
Token ID2648
Feature activation-9.10e+00

Pos logit contributions
Sorry+0.151
Kids+0.150
Listen+0.149
Don+0.143
Um+0.141
Neg logit contributions
et-0.123
ish-0.115
an-0.113
lers-0.111
 egg-0.107
Token a
Token ID257
Feature activation-6.21e+00

Pos logit contributions
Listen+0.087
Sorry+0.087
Kids+0.086
Don+0.083
Dad+0.083
Neg logit contributions
et-0.093
an-0.090
ish-0.089
lers-0.089
ig-0.089
Token room
Token ID2119
Feature activation-2.92e+00

Pos logit contributions
Sorry+0.079
Listen+0.079
Kids+0.079
Sure+0.077
Dad+0.076
Neg logit contributions
et-0.044
ish-0.040
an-0.040
lers-0.040
ig-0.039
Token and
Token ID290
Feature activation-1.38e+01

Pos logit contributions
Sure+0.027
Sorry+0.027
Dad+0.027
On+0.026
His+0.026
Neg logit contributions
ish-0.028
an-0.028
et-0.027
ig-0.027
lers-0.027
Token were
Token ID547
Feature activation-1.14e+01

Pos logit contributions
Sorry+0.078
Listen+0.077
Kids+0.076
Don+0.072
Sure+0.071
Neg logit contributions
et-0.112
ish-0.107
an-0.106
lers-0.106
ig-0.103
Token playing
Token ID2712
Feature activation-9.13e+00

Pos logit contributions
Kids+0.133
Listen+0.132
Sorry+0.132
Dad+0.129
Don+0.127
Neg logit contributions
et-0.092
an-0.089
lers-0.083
ish-0.082
 egg-0.081
Token in
Token ID287
Feature activation-9.43e+00

Pos logit contributions
Sorry+0.113
Sure+0.112
Listen+0.110
Kids+0.110
Don+0.109
Neg logit contributions
et-0.064
an-0.062
lers-0.061
 egg-0.060
ish-0.059
Token the
Token ID262
Feature activation-6.88e+00

Pos logit contributions
Sorry+0.110
Listen+0.108
Kids+0.108
Don+0.104
Sure+0.103
Neg logit contributions
et-0.077
lers-0.073
an-0.072
ish-0.072
 egg-0.071
Token snow
Token ID6729
Feature activation+1.42e+00

Pos logit contributions
Kids+0.083
Sorry+0.082
Listen+0.082
Don+0.080
Dad+0.079
Neg logit contributions
et-0.053
lers-0.050
an-0.050
ish-0.050
ig-0.048
Token.
Token ID13
Feature activation-2.12e+01

Pos logit contributions
cks+0.013
igs+0.012
ig+0.012
ish+0.012
ool+0.012
Neg logit contributions
Dad-0.012
His-0.012
He-0.012
Sorry-0.012
They-0.012
Token They
Token ID1119
Feature activation-1.37e+01

Pos logit contributions
Sorry+0.205
Listen+0.179
By+0.171
Um+0.171
Don+0.164
Neg logit contributions
ish-0.220
et-0.214
 up-0.211
 red-0.206
 green-0.203
Token made
Token ID925
Feature activation-1.18e+01

Pos logit contributions
Sorry+0.095
Dad+0.094
Kids+0.094
Don+0.094
Listen+0.093
Neg logit contributions
an-0.169
et-0.169
lers-0.166
ish-0.166
ig-0.165
Token a
Token ID257
Feature activation-7.35e+00

Pos logit contributions
Listen+0.107
Sorry+0.105
Kids+0.105
Don+0.104
Dad+0.103
Neg logit contributions
et-0.122
ish-0.119
ig-0.118
lers-0.117
an-0.116
Token big
Token ID1263
Feature activation-4.15e+00

Pos logit contributions
Sorry+0.085
Listen+0.083
Kids+0.083
Dad+0.083
Sure+0.083
Neg logit contributions
et-0.060
lers-0.055
ish-0.055
an-0.054
ig-0.053
Token snow
Token ID6729
Feature activation+2.20e+00

Pos logit contributions
Sure+0.043
Dad+0.042
Sorry+0.042
He+0.041
Hi+0.040
Neg logit contributions
ates-0.034
aces-0.033
cks-0.033
pp-0.032
ps-0.032
Token flavors
Token ID17361
Feature activation+3.03e+01

Pos logit contributions
cks+0.302
an+0.297
ig+0.282
igs+0.282
lers+0.275
Neg logit contributions
Sorry-0.373
Dad-0.351
Sure-0.332
He-0.330
Hi-0.330
Token they
Token ID484
Feature activation+1.55e+01

Pos logit contributions
an+0.283
cks+0.273
et+0.270
ig+0.269
lers+0.268
Neg logit contributions
Sorry-0.311
Dad-0.304
Sure-0.291
Listen-0.290
He-0.287
Token liked
Token ID8288
Feature activation+1.68e+01

Pos logit contributions
et+0.187
an+0.186
ig+0.184
lers+0.182
ish+0.181
Neg logit contributions
Sorry-0.115
Kids-0.111
Listen-0.110
Don-0.108
Dad-0.106
Token and
Token ID290
Feature activation+8.77e+00

Pos logit contributions
an+0.155
ig+0.150
ish+0.146
et+0.143
lers+0.141
Neg logit contributions
Dad-0.170
Sorry-0.169
Kids-0.167
Listen-0.164
Don-0.162
Token what
Token ID644
Feature activation+1.09e+01

Pos logit contributions
an+0.101
et+0.101
ig+0.100
ish+0.099
lers+0.097
Neg logit contributions
Sorry-0.070
Kids-0.069
Listen-0.068
Don-0.067
Dad-0.065
Token topp
Token ID23126
Feature activation+3.89e+01

Pos logit contributions
et+0.133
lers+0.129
an+0.128
ish+0.128
 egg+0.126
Neg logit contributions
Sorry-0.081
Kids-0.080
Listen-0.080
Don-0.076
Sure-0.074
Tokenings
Token ID654
Feature activation+2.29e+01

Pos logit contributions
 cob+0.271
ig+0.263
 web+0.258
 egg+0.257
et+0.256
Neg logit contributions
Sorry-0.460
Dad-0.453
His-0.434
Listen-0.432
Kids-0.424
Token would
Token ID561
Feature activation+1.21e+01

Pos logit contributions
an+0.257
et+0.254
ig+0.252
ish+0.250
lers+0.248
Neg logit contributions
Sorry-0.194
Listen-0.192
Kids-0.192
Don-0.185
Dad-0.181
Token go
Token ID467
Feature activation+1.62e+01

Pos logit contributions
et+0.135
lers+0.132
an+0.131
ish+0.131
ig+0.130
Neg logit contributions
Sorry-0.104
Listen-0.102
Kids-0.101
Don-0.098
Dad-0.097
Token best
Token ID1266
Feature activation+1.64e+01

Pos logit contributions
an+0.124
ig+0.121
lers+0.120
et+0.120
 egg+0.119
Neg logit contributions
Sorry-0.191
Listen-0.188
Kids-0.185
Don-0.182
Dad-0.181
Token.
Token ID13
Feature activation+7.01e-01

Pos logit contributions
et+0.138
cks+0.135
an+0.135
lers+0.131
ig+0.129
Neg logit contributions
Sorry-0.177
Dad-0.174
Kids-0.173
Listen-0.167
Sure-0.163
Token placed
Token ID4624
Feature activation+1.41e+01

Pos logit contributions
an+0.062
et+0.062
ish+0.060
ig+0.060
lers+0.059
Neg logit contributions
Sorry-0.049
Kids-0.048
Don-0.047
Listen-0.047
Dad-0.046
Token it
Token ID340
Feature activation+1.58e+01

Pos logit contributions
an+0.180
ig+0.172
ish+0.168
opl+0.168
lers+0.165
Neg logit contributions
Don-0.097
Dad-0.095
Listen-0.094
What-0.093
Sure-0.091
Token onto
Token ID4291
Feature activation+1.15e+01

Pos logit contributions
opl+0.134
an+0.133
ish+0.120
 egg+0.120
cks+0.120
Neg logit contributions
His-0.163
Sorry-0.162
Don-0.162
He-0.161
Listen-0.160
Token the
Token ID262
Feature activation+1.24e+01

Pos logit contributions
an+0.124
ish+0.124
lers+0.123
ig+0.122
et+0.118
Neg logit contributions
Dad-0.100
Listen-0.099
Sorry-0.097
Don-0.097
Kids-0.096
Token wheel
Token ID7825
Feature activation+2.43e+01

Pos logit contributions
cks+0.105
et+0.096
ig+0.090
lers+0.090
ish+0.087
Neg logit contributions
Dad-0.148
Sorry-0.144
Don-0.144
Exc-0.139
Daddy-0.138
Tokenbar
Token ID5657
Feature activation+3.84e+01

Pos logit contributions
cks+0.234
igs+0.216
 egg+0.212
an+0.211
lers+0.210
Neg logit contributions
He-0.228
His-0.222
Sure-0.210
Dad-0.209
When-0.208
Tokenrow
Token ID808
Feature activation+2.41e+01

Pos logit contributions
 matches+0.280
cks+0.274
 cob+0.272
 egg+0.267
lers+0.263
Neg logit contributions
His-0.440
He-0.425
Dad-0.424
So-0.396
When-0.385
Token.
Token ID13
Feature activation-2.42e+00

Pos logit contributions
cks+0.233
ish+0.224
lers+0.220
 egg+0.220
ig+0.219
Neg logit contributions
He-0.230
His-0.222
Don-0.207
Dad-0.206
Sorry-0.206
Token
Token ID198
Feature activation-1.51e+00

Pos logit contributions
Sorry+0.019
Listen+0.018
Kids+0.017
Don+0.016
Um+0.016
Neg logit contributions
et-0.027
ish-0.026
 oven-0.026
 egg-0.026
 up-0.025
Token
Token ID198
Feature activation-2.02e+00

Pos logit contributions
Sorry+0.013
Listen+0.012
Hi+0.011
Um+0.011
Don+0.011
Neg logit contributions
et-0.015
an-0.015
ery-0.015
!"-0.014
ig-0.014
TokenJohn
Token ID7554
Feature activation+1.04e+01

Pos logit contributions
Listen+0.011
Hi+0.011
Um+0.011
Kids+0.011
Sorry+0.010
Neg logit contributions
ab-0.022
ep-0.022
et-0.022
ery-0.022
eh-0.022
Token he
Token ID339
Feature activation+1.62e+01

Pos logit contributions
et+0.242
an+0.236
ish+0.234
lers+0.230
ig+0.228
Neg logit contributions
Sorry-0.149
Kids-0.149
Listen-0.148
Don-0.141
Dad-0.138
Token would
Token ID561
Feature activation+1.08e+01

Pos logit contributions
et+0.203
an+0.202
ish+0.199
lers+0.198
cks+0.197
Neg logit contributions
Sorry-0.106
Listen-0.105
Kids-0.105
Don-0.104
Hi-0.100
Token take
Token ID1011
Feature activation+1.74e+01

Pos logit contributions
an+0.129
et+0.127
lers+0.127
ish+0.126
ig+0.124
Neg logit contributions
Listen-0.075
Sorry-0.074
Sure-0.073
Dad-0.072
Don-0.072
Token the
Token ID262
Feature activation+1.86e+01

Pos logit contributions
an+0.216
ish+0.206
et+0.201
 egg+0.198
ig+0.195
Neg logit contributions
Listen-0.127
Kids-0.122
Sure-0.120
Hi-0.118
Jack-0.118
Token av
Token ID1196
Feature activation+4.21e+01

Pos logit contributions
cks+0.171
et+0.149
an+0.142
igs+0.136
ig+0.136
Neg logit contributions
Sure-0.200
Hi-0.199
Listen-0.193
Don-0.190
Exc-0.188
Tokenoc
Token ID420
Feature activation+4.61e+01

Pos logit contributions
ps+0.348
 cob+0.326
cks+0.321
 matches+0.319
an+0.318
Neg logit contributions
Exc-0.456
Sure-0.449
When-0.448
He-0.439
Dad-0.438
Tokenados
Token ID22484
Feature activation+2.88e+01

Pos logit contributions
 cob+0.336
cks+0.324
 web+0.320
et+0.306
ery+0.306
Neg logit contributions
Sorry-0.510
Dad-0.495
Sure-0.471
Um-0.467
Exc-0.465
Token and
Token ID290
Feature activation+1.05e+01

Pos logit contributions
an+0.306
cks+0.298
et+0.298
ig+0.288
lers+0.286
Neg logit contributions
Sure-0.242
Don-0.240
He-0.236
When-0.235
Dad-0.235
Token scatter
Token ID41058
Feature activation+2.11e+01

Pos logit contributions
an+0.107
ig+0.106
et+0.103
cks+0.099
ish+0.098
Neg logit contributions
Sorry-0.101
Don-0.098
Listen-0.097
Exc-0.097
Dad-0.096
Token them
Token ID606
Feature activation+2.03e+01

Pos logit contributions
an+0.262
ig+0.258
cks+0.239
 egg+0.238
ich+0.237
Neg logit contributions
Exc-0.149
Hi-0.142
Sure-0.141
Jack-0.140
Don-0.140
Token around
Token ID1088
Feature activation+1.66e+01

Pos logit contributions
an+0.182
 egg+0.164
ig+0.163
opl+0.161
et+0.161
Neg logit contributions
On-0.188
He-0.185
Don-0.183
His-0.182
Sure-0.180
Token there
Token ID612
Feature activation+1.27e+01

Pos logit contributions
an+0.112
et+0.111
ish+0.110
lers+0.108
 egg+0.107
Neg logit contributions
Kids-0.151
Listen-0.149
Sorry-0.148
Don-0.147
Sure-0.142
Token was
Token ID373
Feature activation+1.01e+01

Pos logit contributions
et+0.134
ish+0.128
an+0.127
lers+0.127
ig+0.125
Neg logit contributions
Sorry-0.118
Kids-0.117
Listen-0.117
Don-0.111
Sure-0.109
Token a
Token ID257
Feature activation+1.48e+01

Pos logit contributions
et+0.085
lers+0.078
 egg+0.078
an+0.077
ish+0.075
Neg logit contributions
Kids-0.115
Sorry-0.115
Listen-0.112
Sure-0.108
Don-0.107
Token rh
Token ID9529
Feature activation+3.51e+01

Pos logit contributions
et+0.145
lers+0.138
cks+0.136
an+0.133
ish+0.133
Neg logit contributions
Sorry-0.153
Kids-0.150
Listen-0.147
Dad-0.147
Sure-0.146
Tokenin
Token ID259
Feature activation+2.70e+01

Pos logit contributions
et+0.329
cks+0.291
ps+0.289
 cob+0.283
ish+0.282
Neg logit contributions
Sorry-0.368
Dad-0.351
Jack-0.347
He-0.345
By-0.338
Tokenoc
Token ID420
Feature activation+4.27e+01

Pos logit contributions
ps+0.274
cks+0.272
et+0.266
igs+0.256
aces+0.255
Neg logit contributions
Dad-0.255
Sure-0.255
Sorry-0.249
He-0.247
His-0.244
Tokeneros
Token ID27498
Feature activation+2.75e+01

Pos logit contributions
cks+0.287
 cob+0.274
ery+0.274
igs+0.267
et+0.261
Neg logit contributions
Sorry-0.498
Dad-0.460
Sure-0.458
Um-0.451
Hi-0.448
Token who
Token ID508
Feature activation+1.28e+01

Pos logit contributions
et+0.270
igs+0.257
ig+0.256
ish+0.253
ps+0.252
Neg logit contributions
Sorry-0.246
Hi-0.242
He-0.242
Don-0.241
His-0.239
Token was
Token ID373
Feature activation+1.36e+01

Pos logit contributions
et+0.156
ig+0.154
ish+0.152
an+0.152
 egg+0.150
Neg logit contributions
Sorry-0.091
Don-0.091
Listen-0.090
Kids-0.088
Hi-0.087
Token always
Token ID1464
Feature activation+1.72e+01

Pos logit contributions
ig+0.164
ish+0.158
an+0.156
et+0.154
ishing+0.150
Neg logit contributions
He-0.101
His-0.097
Don-0.094
Dad-0.094
Hi-0.094
Token naughty
Token ID45128
Feature activation+1.74e+01

Pos logit contributions
ig+0.179
igs+0.173
an+0.172
ish+0.169
ool+0.166
Neg logit contributions
Don-0.168
Hi-0.159
Thanks-0.152
What-0.152
Listen-0.150
Token couldn
Token ID3521
Feature activation+1.19e+01

Pos logit contributions
an+0.139
lers+0.133
et+0.133
ish+0.132
cks+0.129
Neg logit contributions
Don-0.081
Sorry-0.076
Listen-0.076
Dad-0.070
Hi-0.070
Token't
Token ID470
Feature activation+9.13e+00

Pos logit contributions
et+0.141
lers+0.138
ig+0.135
 egg+0.135
ish+0.135
Neg logit contributions
Sorry-0.090
Kids-0.090
Listen-0.088
Sure-0.084
Dad-0.084
Token find
Token ID1064
Feature activation+1.39e+01

Pos logit contributions
et+0.084
an+0.080
ish+0.079
 egg+0.078
lers+0.077
Neg logit contributions
Kids-0.099
Sorry-0.099
Listen-0.098
Don-0.093
Sure-0.091
Token any
Token ID597
Feature activation+1.90e+01

Pos logit contributions
an+0.173
ish+0.163
pp+0.160
et+0.159
lers+0.157
Neg logit contributions
Listen-0.094
Don-0.092
Sure-0.092
Hi-0.090
Kids-0.090
Token av
Token ID1196
Feature activation+3.84e+01

Pos logit contributions
pp+0.197
an+0.180
et+0.178
cks+0.176
ans+0.173
Neg logit contributions
He-0.165
Where-0.164
Listen-0.161
Don-0.160
Exc-0.159
Tokenoc
Token ID420
Feature activation+3.99e+01

Pos logit contributions
ps+0.323
an+0.312
 matches+0.305
 bulb+0.305
 egg+0.304
Neg logit contributions
He-0.390
Exc-0.389
When-0.382
Don-0.379
By-0.375
Tokenados
Token ID22484
Feature activation+2.78e+01

Pos logit contributions
 cob+0.320
 web+0.307
ig+0.300
 curtain+0.299
an+0.298
Neg logit contributions
Sorry-0.425
Exc-0.392
Dad-0.390
Um-0.388
Listen-0.385
Token.
Token ID13
Feature activation-5.67e-01

Pos logit contributions
an+0.256
ig+0.239
ish+0.235
et+0.235
igs+0.233
Neg logit contributions
Don-0.285
Sorry-0.267
Listen-0.266
He-0.261
Kids-0.260
Token Finally
Token ID9461
Feature activation+8.61e+00

Pos logit contributions
Sorry+0.004
Listen+0.004
Kids+0.004
Dad+0.004
By+0.004
Neg logit contributions
et-0.006
c-0.006
lers-0.006
ish-0.006
ig-0.006
Token,
Token ID11
Feature activation+2.92e+00

Pos logit contributions
et+0.052
lers+0.049
ig+0.048
 egg+0.047
ish+0.047
Neg logit contributions
Kids-0.119
Sorry-0.118
Listen-0.118
Don-0.114
Sure-0.113
Token she
Token ID673
Feature activation+8.55e+00

Pos logit contributions
et+0.035
lers+0.034
ish+0.034
 egg+0.033
an+0.033
Neg logit contributions
Kids-0.022
Sorry-0.022
Listen-0.022
Don-0.021
Dad-0.020
Token in
Token ID287
Feature activation+8.34e+00

Pos logit contributions
an+0.111
opl+0.106
 egg+0.104
cks+0.103
et+0.101
Neg logit contributions
He-0.136
Sorry-0.134
His-0.134
Don-0.133
Sure-0.133
Token a
Token ID257
Feature activation+1.18e+01

Pos logit contributions
an+0.093
ish+0.091
lers+0.089
et+0.089
ig+0.087
Neg logit contributions
Listen-0.071
Dad-0.070
Don-0.070
Sorry-0.069
Kids-0.069
Token row
Token ID5752
Feature activation+2.08e+01

Pos logit contributions
cks+0.089
et+0.086
an+0.085
ps+0.084
ates+0.081
Neg logit contributions
Sure-0.143
Exc-0.137
Sorry-0.136
Dad-0.133
Don-0.133
Token along
Token ID1863
Feature activation+1.15e+01

Pos logit contributions
cks+0.219
lers+0.213
 egg+0.206
an+0.204
et+0.204
Neg logit contributions
Dad-0.172
He-0.171
Sure-0.168
His-0.167
Sorry-0.163
Token the
Token ID262
Feature activation+9.77e+00

Pos logit contributions
ish+0.126
an+0.125
ig+0.124
lers+0.122
et+0.121
Neg logit contributions
Dad-0.096
Exc-0.095
Listen-0.093
Kids-0.092
Don-0.092
Token windows
Token ID9168
Feature activation+2.04e+01

Pos logit contributions
cks+0.090
ish+0.087
ig+0.085
pp+0.085
els+0.084
Neg logit contributions
Dad-0.097
Hi-0.095
Listen-0.095
Exc-0.094
Kids-0.094
Tokenill
Token ID359
Feature activation+1.88e+01

Pos logit contributions
lers+0.211
an+0.207
igs+0.201
 egg+0.200
cks+0.196
Neg logit contributions
Dad-0.170
He-0.168
His-0.165
Sorry-0.162
When-0.160
Token.
Token ID13
Feature activation-5.56e+00

Pos logit contributions
cks+0.175
igs+0.165
an+0.161
ps+0.159
ates+0.154
Neg logit contributions
Exc-0.162
He-0.162
When-0.159
On-0.153
Dad-0.152
Token One
Token ID1881
Feature activation+3.95e+00

Pos logit contributions
Sorry+0.034
Listen+0.033
Kids+0.032
Don+0.030
Dad+0.029
Neg logit contributions
et-0.076
ish-0.074
lers-0.073
an-0.072
ig-0.072
Token day
Token ID1110
Feature activation+3.50e+00

Pos logit contributions
cks+0.045
ps+0.042
et+0.040
lers+0.040
an+0.039
Neg logit contributions
Sure-0.034
Listen-0.034
No-0.034
Dad-0.033
Don-0.033
Token,
Token ID11
Feature activation-1.81e+00

Pos logit contributions
et+0.022
lers+0.021
ish+0.020
an+0.020
ig+0.020
Neg logit contributions
Sorry-0.047
Kids-0.047
Listen-0.047
Don-0.045
Dad-0.045
Token He
Token ID679
Feature activation+5.93e+00

Pos logit contributions
Sorry+0.022
Listen+0.022
Kids+0.021
Enough+0.020
Don+0.019
Neg logit contributions
 matches-0.023
et-0.022
 v-0.022
ab-0.022
 block-0.022
Token pointed
Token ID6235
Feature activation+1.16e+01

Pos logit contributions
et+0.064
an+0.060
ish+0.060
lers+0.059
 egg+0.059
Neg logit contributions
Kids-0.054
Listen-0.054
Sorry-0.053
Don-0.050
Dad-0.049
Token to
Token ID284
Feature activation+3.52e+00

Pos logit contributions
an+0.117
ig+0.115
lers+0.114
et+0.114
ery+0.113
Neg logit contributions
Dad-0.104
Listen-0.103
Sure-0.103
Don-0.102
His-0.101
Token a
Token ID257
Feature activation+1.17e+01

Pos logit contributions
an+0.037
et+0.036
ig+0.036
ish+0.036
lers+0.036
Neg logit contributions
Listen-0.032
Dad-0.031
Don-0.031
Kids-0.031
Sorry-0.031
Token small
Token ID1402
Feature activation+1.84e+01

Pos logit contributions
et+0.099
ors+0.096
ig+0.093
cks+0.093
an+0.093
Neg logit contributions
Dad-0.125
He-0.124
Hi-0.122
 Something-0.121
Don-0.121
Token jar
Token ID17379
Feature activation+2.24e+01

Pos logit contributions
cks+0.162
ps+0.150
eking+0.148
et+0.148
ates+0.147
Neg logit contributions
Dad-0.185
Hi-0.182
No-0.181
 Something-0.180
He-0.170
Token with
Token ID351
Feature activation+1.00e+01

Pos logit contributions
ish+0.218
an+0.218
lers+0.215
ig+0.214
igs+0.211
Neg logit contributions
He-0.202
Dad-0.196
Sorry-0.196
His-0.196
Don-0.195
Token a
Token ID257
Feature activation+1.24e+01

Pos logit contributions
an+0.115
ig+0.115
ish+0.107
et+0.105
asis+0.104
Neg logit contributions
Don-0.081
Listen-0.081
Dad-0.081
 Something-0.080
Um-0.080
Token label
Token ID6167
Feature activation+2.35e+01

Pos logit contributions
et+0.107
an+0.104
ans+0.099
cks+0.097
ishing+0.096
Neg logit contributions
 Something-0.135
Sorry-0.135
He-0.134
Dad-0.132
Don-0.131
Token that
Token ID326
Feature activation+6.67e+00

Pos logit contributions
an+0.225
ig+0.215
igs+0.215
ish+0.213
et+0.207
Neg logit contributions
He-0.216
Sorry-0.216
Dad-0.216
Listen-0.214
Don-0.210
Token said
Token ID531
Feature activation+6.69e+00

Pos logit contributions
et+0.084
an+0.084
ish+0.084
ig+0.084
lers+0.082
Neg logit contributions
Sorry-0.046
Listen-0.045
Kids-0.045
Don-0.044
Dad-0.043
Token cave
Token ID11527
Feature activation+2.73e+01

Pos logit contributions
cks+0.121
et+0.121
ig+0.112
ish+0.110
lers+0.110
Neg logit contributions
Sorry-0.149
Dad-0.148
Hi-0.148
Kids-0.147
Don-0.146
Token.
Token ID13
Feature activation+1.97e-01

Pos logit contributions
ish+0.266
ig+0.256
an+0.252
et+0.250
 egg+0.245
Neg logit contributions
On-0.248
Sorry-0.237
He-0.235
When-0.232
Don-0.229
Token They
Token ID1119
Feature activation+5.88e+00

Pos logit contributions
et+0.002
 matches+0.002
 egg+0.002
cks+0.002
 Tower+0.002
Neg logit contributions
Sorry-0.002
Listen-0.001
Kids-0.001
Sure-0.001
Enough-0.001
Token take
Token ID1011
Feature activation+1.15e+01

Pos logit contributions
et+0.065
lers+0.062
an+0.062
ish+0.062
ig+0.061
Neg logit contributions
Sorry-0.051
Kids-0.051
Listen-0.050
Don-0.048
Dad-0.048
Token the
Token ID262
Feature activation+1.18e+01

Pos logit contributions
et+0.121
ish+0.119
an+0.119
ig+0.118
lers+0.115
Neg logit contributions
Kids-0.103
Listen-0.103
Sorry-0.099
Dad-0.097
Don-0.097
Token wolf
Token ID17481
Feature activation+2.10e+01

Pos logit contributions
et+0.103
cks+0.100
pp+0.090
ig+0.088
ish+0.088
Neg logit contributions
Hi-0.126
Sure-0.125
Listen-0.124
Don-0.122
Sorry-0.122
Token with
Token ID351
Feature activation+1.02e+01

Pos logit contributions
et+0.240
ig+0.239
iflower+0.233
igs+0.232
ates+0.229
Neg logit contributions
On-0.144
When-0.142
Don-0.139
Sure-0.135
He-0.135
Token them
Token ID606
Feature activation+1.53e+01

Pos logit contributions
an+0.124
et+0.123
ish+0.122
ig+0.122
lers+0.119
Neg logit contributions
Kids-0.072
Listen-0.071
Dad-0.070
Sorry-0.070
Don-0.070
Token.
Token ID13
Feature activation-1.25e+00

Pos logit contributions
et+0.150
an+0.147
ig+0.143
iflower+0.143
igs+0.142
Neg logit contributions
Don-0.129
So-0.128
On-0.127
By-0.126
He-0.125
Token They
Token ID1119
Feature activation+4.81e+00

Pos logit contributions
Sorry+0.010
Listen+0.010
Kids+0.009
Um+0.009
Enough+0.009
Neg logit contributions
 matches-0.013
 egg-0.013
cks-0.013
et-0.013
 chim-0.013
Token go
Token ID467
Feature activation+9.54e+00

Pos logit contributions
et+0.053
an+0.052
lers+0.052
ish+0.051
ig+0.051
Neg logit contributions
Sorry-0.040
Kids-0.040
Listen-0.039
Don-0.039
Sure-0.038
Token She
Token ID1375
Feature activation+5.69e+00

Pos logit contributions
Sorry+0.013
Listen+0.013
Kids+0.012
By+0.011
Um+0.011
Neg logit contributions
ish-0.026
et-0.026
ig-0.025
an-0.025
 egg-0.025
Token started
Token ID2067
Feature activation+1.16e+01

Pos logit contributions
et+0.061
lers+0.058
ish+0.058
an+0.058
 egg+0.056
Neg logit contributions
Listen-0.052
Sorry-0.051
Kids-0.051
Dad-0.048
Don-0.047
Token to
Token ID284
Feature activation+5.78e+00

Pos logit contributions
et+0.127
an+0.126
ish+0.123
lers+0.121
ig+0.121
Neg logit contributions
Listen-0.106
Kids-0.104
Sorry-0.103
Don-0.103
Sure-0.097
Token demand
Token ID3512
Feature activation+1.20e+01

Pos logit contributions
et+0.068
lers+0.066
an+0.066
ish+0.065
ig+0.065
Neg logit contributions
Sorry-0.046
Kids-0.046
Listen-0.046
Don-0.044
Dad-0.043
Token the
Token ID262
Feature activation+1.10e+01

Pos logit contributions
an+0.128
et+0.125
ig+0.122
ish+0.122
cks+0.119
Neg logit contributions
Listen-0.113
Sorry-0.112
Kids-0.111
Don-0.109
Sure-0.107
Token umbrella
Token ID25510
Feature activation+2.30e+01

Pos logit contributions
cks+0.093
et+0.089
lers+0.088
ates+0.082
ig+0.082
Neg logit contributions
Kids-0.131
Listen-0.129
Sorry-0.129
Don-0.126
Sure-0.126
Token.
Token ID13
Feature activation-3.07e+00

Pos logit contributions
lers+0.209
an+0.208
ig+0.205
et+0.205
 egg+0.204
Neg logit contributions
Sorry-0.231
Listen-0.226
Don-0.226
Sure-0.223
Kids-0.220
Token "
Token ID366
Feature activation-1.81e-01

Pos logit contributions
Sorry+0.023
Listen+0.022
Kids+0.021
By+0.020
Don+0.019
Neg logit contributions
et-0.034
ish-0.034
ig-0.034
 egg-0.033
 oven-0.033
TokenPlease
Token ID5492
Feature activation+4.14e+00

Pos logit contributions
Kids+0.001
Together+0.001
 Something+0.001
Listen+0.001
 His+0.001
Neg logit contributions
et-0.002
c-0.002
 egg-0.002
ab-0.002
 ornament-0.002
Token,
Token ID11
Feature activation+2.76e+00

Pos logit contributions
lers+0.040
et+0.039
an+0.038
 egg+0.037
cks+0.037
Neg logit contributions
Sorry-0.043
Kids-0.041
Sure-0.041
Listen-0.041
Dad-0.040
Token mom
Token ID1995
Feature activation+1.03e+01

Pos logit contributions
lers+0.035
et+0.035
an+0.034
cks+0.034
ish+0.033
Neg logit contributions
Kids-0.020
Sorry-0.020
Sure-0.019
Listen-0.019
Don-0.018
Token't
Token ID470
Feature activation+1.58e+01

Pos logit contributions
By+0.074
Sorry+0.073
Dad+0.072
Kids+0.069
Listen+0.069
Neg logit contributions
lers-0.049
an-0.048
ish-0.046
et-0.046
ig-0.044
Token do
Token ID466
Feature activation+1.63e+01

Pos logit contributions
et+0.192
lers+0.188
an+0.188
ig+0.186
ish+0.186
Neg logit contributions
Sorry-0.118
Listen-0.117
Kids-0.115
Don-0.112
Dad-0.110
Token it
Token ID340
Feature activation+2.18e+01

Pos logit contributions
et+0.161
an+0.160
ish+0.157
 egg+0.152
ig+0.152
Neg logit contributions
Sorry-0.158
Listen-0.157
Kids-0.155
Don-0.150
Sure-0.146
Token.
Token ID13
Feature activation+3.46e+00

Pos logit contributions
 egg+0.172
an+0.172
et+0.167
lers+0.164
ish+0.161
Neg logit contributions
Sorry-0.248
Listen-0.245
Kids-0.240
Don-0.235
Sure-0.233
Token My
Token ID2011
Feature activation+1.63e+01

Pos logit contributions
et+0.041
lers+0.040
ish+0.040
an+0.039
 matches+0.039
Neg logit contributions
Listen-0.025
Sorry-0.024
Kids-0.023
By-0.022
Don-0.021
Token sc
Token ID629
Feature activation+2.74e+01

Pos logit contributions
et+0.177
cks+0.169
lers+0.168
pp+0.164
an+0.162
Neg logit contributions
Dad-0.134
Sure-0.133
Kids-0.132
Sorry-0.132
Um-0.131
Tokenooter
Token ID25141
Feature activation+2.92e+01

Pos logit contributions
et+0.206
 egg+0.205
ery+0.193
an+0.190
lers+0.186
Neg logit contributions
Sorry-0.324
Sure-0.321
Kids-0.314
Um-0.310
Listen-0.308
Token is
Token ID318
Feature activation+1.35e+01

Pos logit contributions
 egg+0.297
an+0.297
et+0.289
cks+0.284
ish+0.283
Neg logit contributions
Sorry-0.272
Don-0.253
Listen-0.251
Hi-0.248
Kids-0.245
Token not
Token ID407
Feature activation+1.37e+01

Pos logit contributions
an+0.146
ish+0.142
ig+0.137
lers+0.134
et+0.130
Neg logit contributions
Listen-0.118
Don-0.118
Sorry-0.117
Um-0.114
Thanks-0.110
Token good
Token ID922
Feature activation+1.68e+01

Pos logit contributions
an+0.158
lers+0.153
ig+0.150
ish+0.147
asis+0.139
Neg logit contributions
He-0.106
Don-0.101
His-0.099
Listen-0.096
Thanks-0.095
Token for
Token ID329
Feature activation+1.17e+01

Pos logit contributions
an+0.138
et+0.137
ig+0.132
lers+0.132
 egg+0.130
Neg logit contributions
Sorry-0.188
Kids-0.187
Listen-0.186
Don-0.177
Hi-0.176
Token
Token ID198
Feature activation-1.20e+01

Pos logit contributions
Sorry+0.095
Listen+0.086
Um+0.085
Kids+0.082
Don+0.082
Neg logit contributions
an-0.127
et-0.127
cks-0.125
lers-0.124
ig-0.122
TokenTim
Token ID14967
Feature activation-5.04e-01

Pos logit contributions
Um+0.083
Kids+0.082
 Something+0.078
Listen+0.078
Hi+0.069
Neg logit contributions
cut-0.136
et-0.135
eh-0.133
ep-0.132
ery-0.130
Token went
Token ID1816
Feature activation-3.38e+00

Pos logit contributions
Sorry+0.005
Kids+0.005
Listen+0.005
Don+0.005
Dad+0.005
Neg logit contributions
et-0.005
an-0.005
lers-0.005
ish-0.005
ig-0.005
Token back
Token ID736
Feature activation-1.40e+00

Pos logit contributions
Kids+0.036
Listen+0.036
Sorry+0.036
Sure+0.036
Don+0.035
Neg logit contributions
et-0.030
lers-0.029
ish-0.029
an-0.029
 egg-0.028
Token to
Token ID284
Feature activation-6.14e+00

Pos logit contributions
Kids+0.011
Listen+0.010
Sure+0.010
Hi+0.010
Don+0.010
Neg logit contributions
et-0.017
an-0.016
ery-0.016
cks-0.016
lers-0.016
Token the
Token ID262
Feature activation-4.94e-01

Pos logit contributions
Kids+0.055
Listen+0.055
Sorry+0.053
Don+0.052
Sure+0.052
Neg logit contributions
et-0.066
lers-0.065
an-0.065
ish-0.064
ig-0.063
Token oct
Token ID19318
Feature activation+1.14e+01

Pos logit contributions
Kids+0.006
Sorry+0.006
Listen+0.006
Dad+0.006
Don+0.006
Neg logit contributions
et-0.004
lers-0.004
ish-0.004
an-0.004
ig-0.004
Tokenopus
Token ID25790
Feature activation+6.45e+00

Pos logit contributions
et+0.094
lers+0.088
ish+0.088
an+0.088
 egg+0.086
Neg logit contributions
Sorry-0.132
Kids-0.131
Listen-0.130
Don-0.125
Dad-0.124
Token and
Token ID290
Feature activation-6.55e+00

Pos logit contributions
cks+0.067
et+0.065
ps+0.065
ery+0.062
ates+0.061
Neg logit contributions
Sure-0.055
Jack-0.051
He-0.050
Don-0.050
So-0.050
Token gave
Token ID2921
Feature activation-2.89e+00

Pos logit contributions
Sorry+0.056
Kids+0.055
Listen+0.055
Don+0.053
Sure+0.052
Neg logit contributions
et-0.073
an-0.071
lers-0.071
ish-0.071
ig-0.069
Token it
Token ID340
Feature activation+3.29e+00

Pos logit contributions
Sure+0.020
Dad+0.020
Listen+0.020
Don+0.019
Hi+0.019
Neg logit contributions
ig-0.034
an-0.034
lers-0.033
et-0.033
cks-0.033
Token there
Token ID612
Feature activation-5.76e+00

Pos logit contributions
Sorry+0.136
Listen+0.136
Kids+0.135
Don+0.132
Dad+0.129
Neg logit contributions
et-0.098
an-0.093
lers-0.093
ish-0.092
ig-0.089
Token was
Token ID373
Feature activation-8.70e+00

Pos logit contributions
Sorry+0.062
Kids+0.059
Listen+0.059
Um+0.057
Don+0.056
Neg logit contributions
et-0.054
ish-0.051
an-0.050
ig-0.050
lers-0.050
Token a
Token ID257
Feature activation-3.08e+00

Pos logit contributions
Sorry+0.106
Kids+0.104
Listen+0.102
Sure+0.098
Um+0.098
Neg logit contributions
et-0.066
 egg-0.064
lers-0.061
ish-0.060
ery-0.060
Token very
Token ID845
Feature activation-1.80e+00

Pos logit contributions
Sorry+0.033
Kids+0.032
Listen+0.032
Dad+0.031
Sure+0.031
Neg logit contributions
et-0.029
lers-0.028
ish-0.027
an-0.026
cks-0.026
Token special
Token ID2041
Feature activation+1.61e+00

Pos logit contributions
Sorry+0.018
Kids+0.018
Listen+0.017
Dad+0.017
Sure+0.017
Neg logit contributions
et-0.018
lers-0.017
cks-0.017
ish-0.016
ig-0.016
Token kid
Token ID5141
Feature activation+2.01e+00

Pos logit contributions
cks+0.019
et+0.017
ps+0.016
lers+0.016
els+0.016
Neg logit contributions
Kids-0.015
Sure-0.015
Hi-0.015
Dad-0.014
Sorry-0.014
Token.
Token ID13
Feature activation-1.61e+01

Pos logit contributions
et+0.021
an+0.021
ps+0.021
ish+0.021
cks+0.021
Neg logit contributions
Kids-0.017
Sure-0.016
Sorry-0.016
Dad-0.016
His-0.016
Token His
Token ID2399
Feature activation-5.89e+00

Pos logit contributions
Sorry+0.131
Kids+0.118
Listen+0.117
Don+0.113
By+0.109
Neg logit contributions
et-0.196
ish-0.181
 matches-0.180
 egg-0.180
 Tower-0.178
Token name
Token ID1438
Feature activation-8.09e-01

Pos logit contributions
Sorry+0.065
Kids+0.065
Listen+0.065
Don+0.062
Sure+0.062
Neg logit contributions
et-0.051
lers-0.049
an-0.048
ish-0.048
ig-0.047
Token was
Token ID373
Feature activation-8.83e+00

Pos logit contributions
Sorry+0.008
Kids+0.008
Listen+0.008
Don+0.007
Sure+0.007
Neg logit contributions
et-0.008
lers-0.008
an-0.008
ish-0.008
 egg-0.008
Token Joe
Token ID5689
Feature activation-1.32e+00

Pos logit contributions
Sorry+0.071
Listen+0.069
Kids+0.069
Don+0.066
Dad+0.064
Neg logit contributions
et-0.105
lers-0.100
ish-0.099
an-0.099
 egg-0.098
Token sausage
Token ID31889
Feature activation+1.04e+01

Pos logit contributions
et+0.003
an+0.003
lers+0.003
ish+0.003
 egg+0.002
Neg logit contributions
Sorry-0.003
Kids-0.003
Listen-0.003
Don-0.003
Sure-0.003
Token and
Token ID290
Feature activation-4.40e+00

Pos logit contributions
et+0.122
lers+0.117
an+0.117
ish+0.117
cks+0.116
Neg logit contributions
Sorry-0.082
Kids-0.081
Listen-0.081
Don-0.077
Dad-0.076
Token butterfly
Token ID35113
Feature activation+7.28e+00

Pos logit contributions
Listen+0.042
Don+0.042
Kids+0.042
Sorry+0.042
Dad+0.041
Neg logit contributions
et-0.044
an-0.042
lers-0.042
ig-0.042
cks-0.041
Token flew
Token ID13112
Feature activation+3.90e+00

Pos logit contributions
et+0.093
an+0.091
 egg+0.090
cks+0.090
ish+0.090
Neg logit contributions
Kids-0.046
Sorry-0.046
Listen-0.046
Hi-0.044
By-0.044
Token around
Token ID1088
Feature activation+2.33e+00

Pos logit contributions
et+0.032
an+0.032
 egg+0.032
cks+0.032
ish+0.031
Neg logit contributions
Sure-0.042
By-0.041
Listen-0.041
Hi-0.041
Don-0.041
Token the
Token ID262
Feature activation+3.16e+00

Pos logit contributions
an+0.025
ish+0.024
et+0.024
lers+0.024
ig+0.024
Neg logit contributions
Listen-0.021
Hi-0.020
Don-0.020
Kids-0.020
Sure-0.020
Token sky
Token ID6766
Feature activation+9.97e+00

Pos logit contributions
et+0.026
cks+0.024
lers+0.024
ig+0.023
an+0.023
Neg logit contributions
Listen-0.037
Sorry-0.037
Kids-0.037
Dad-0.036
Don-0.036
Token,
Token ID11
Feature activation-4.27e+00

Pos logit contributions
an+0.101
cks+0.100
 egg+0.100
igs+0.099
lers+0.095
Neg logit contributions
By-0.086
Hi-0.082
Sure-0.078
Don-0.078
Sorry-0.077
Token enjoying
Token ID13226
Feature activation+2.26e+00

Pos logit contributions
Don+0.042
Listen+0.040
Thanks+0.040
By+0.039
Hi+0.039
Neg logit contributions
an-0.043
lers-0.042
ool-0.042
ig-0.042
ish-0.042
Token the
Token ID262
Feature activation+1.32e+00

Pos logit contributions
ish+0.027
an+0.027
ig+0.027
et+0.026
lers+0.026
Neg logit contributions
Listen-0.017
Don-0.017
Thanks-0.017
Dad-0.017
By-0.017
Token light
Token ID1657
Feature activation+1.29e+01

Pos logit contributions
et+0.013
cks+0.013
ig+0.012
lers+0.012
ish+0.012
Neg logit contributions
Listen-0.013
Hi-0.013
Sorry-0.013
Kids-0.013
Thanks-0.013
Token and
Token ID290
Feature activation-5.24e+00

Pos logit contributions
Sorry+0.000
Kids+0.000
Listen+0.000
Don+0.000
Dad+0.000
Neg logit contributions
et-0.000
an-0.000
ish-0.000
lers-0.000
ig-0.000
Token angry
Token ID7954
Feature activation+2.45e+00

Pos logit contributions
Kids+0.054
Sorry+0.054
Listen+0.054
Sure+0.052
Dad+0.051
Neg logit contributions
et-0.049
lers-0.048
an-0.047
ish-0.047
 egg-0.046
Token.
Token ID13
Feature activation-1.09e+01

Pos logit contributions
et+0.019
ig+0.018
an+0.018
ish+0.018
 egg+0.017
Neg logit contributions
Sorry-0.030
Listen-0.029
Kids-0.029
Don-0.029
Dad-0.028
Token They
Token ID1119
Feature activation-6.55e+00

Pos logit contributions
Sorry+0.091
Listen+0.087
Sure+0.083
By+0.083
Kids+0.081
Neg logit contributions
et-0.122
an-0.122
lers-0.122
ish-0.120
cks-0.119
Token want
Token ID765
Feature activation-5.16e+00

Pos logit contributions
Kids+0.064
Sorry+0.064
Listen+0.064
Dad+0.060
Sure+0.059
Neg logit contributions
et-0.066
lers-0.063
an-0.062
ish-0.062
 egg-0.060
Token their
Token ID511
Feature activation+8.48e-02

Pos logit contributions
Sorry+0.057
Kids+0.055
Listen+0.054
Don+0.054
Sure+0.052
Neg logit contributions
et-0.045
 egg-0.045
 matches-0.044
lers-0.043
an-0.042
Token coins
Token ID10796
Feature activation+1.19e+01

Pos logit contributions
et+0.001
ig+0.001
lers+0.001
pp+0.001
cks+0.001
Neg logit contributions
Dad-0.001
Sorry-0.001
Kids-0.001
Listen-0.001
Sure-0.001
Token back
Token ID736
Feature activation+2.13e+00

Pos logit contributions
et+0.100
lers+0.099
cks+0.097
ig+0.097
an+0.096
Neg logit contributions
Sorry-0.118
He-0.116
Dad-0.116
Don-0.116
Sure-0.114
Token.
Token ID13
Feature activation-1.16e+01

Pos logit contributions
an+0.019
igs+0.019
et+0.019
cks+0.019
ig+0.019
Neg logit contributions
Don-0.019
Kids-0.019
Dad-0.019
Sorry-0.019
Sure-0.019
Token They
Token ID1119
Feature activation-6.25e+00

Pos logit contributions
Listen+0.108
Sorry+0.106
Kids+0.101
By+0.100
Thanks+0.099
Neg logit contributions
 matches-0.104
an-0.103
et-0.099
."-0.099
lers-0.098
Token want
Token ID765
Feature activation-5.33e+00

Pos logit contributions
Sorry+0.061
Kids+0.060
Listen+0.060
Dad+0.056
Don+0.056
Neg logit contributions
et-0.065
ish-0.061
lers-0.061
an-0.061
 egg-0.060
Token again
Token ID757
Feature activation+7.52e+00

Pos logit contributions
cks+0.119
an+0.117
ig+0.114
opl+0.112
igs+0.111
Neg logit contributions
Sorry-0.116
Hi-0.114
Don-0.113
Listen-0.110
Exc-0.109
Token.
Token ID13
Feature activation-6.01e+00

Pos logit contributions
an+0.063
lers+0.060
et+0.059
ish+0.059
cks+0.057
Neg logit contributions
Sorry-0.086
Kids-0.085
Don-0.084
Listen-0.083
Dad-0.082
Token The
Token ID383
Feature activation+1.78e+00

Pos logit contributions
Sorry+0.067
Listen+0.066
Dad+0.063
Kids+0.061
Don+0.059
Neg logit contributions
et-0.048
 v-0.048
 matches-0.047
 oven-0.047
 cob-0.046
Token animals
Token ID4695
Feature activation+7.65e+00

Pos logit contributions
et+0.014
ig+0.014
lers+0.014
ish+0.014
cks+0.014
Neg logit contributions
Sorry-0.021
Kids-0.021
Listen-0.021
Dad-0.020
Hi-0.020
Token were
Token ID547
Feature activation+2.53e+00

Pos logit contributions
et+0.105
an+0.103
lers+0.102
ig+0.101
ish+0.101
Neg logit contributions
Sorry-0.044
Kids-0.044
Listen-0.043
Don-0.042
Sure-0.040
Token happy
Token ID3772
Feature activation+4.24e+00

Pos logit contributions
ig+0.031
lers+0.030
et+0.030
ish+0.030
an+0.030
Neg logit contributions
Sorry-0.017
Don-0.017
Dad-0.017
Listen-0.017
Kids-0.017
Token and
Token ID290
Feature activation-1.21e+00

Pos logit contributions
et+0.038
an+0.037
lers+0.036
ish+0.036
ig+0.035
Neg logit contributions
Sorry-0.046
Kids-0.045
Listen-0.045
Don-0.044
Dad-0.043
Token thanked
Token ID26280
Feature activation+2.84e+00

Pos logit contributions
Sorry+0.011
Listen+0.011
Kids+0.011
Don+0.010
Dad+0.010
Neg logit contributions
et-0.013
an-0.013
lers-0.013
ish-0.013
 egg-0.012
Token Lily
Token ID20037
Feature activation+6.12e+00

Pos logit contributions
et+0.031
lers+0.029
ish+0.029
 egg+0.029
an+0.029
Neg logit contributions
Sorry-0.025
Listen-0.025
Kids-0.024
Don-0.023
Dad-0.023
Token for
Token ID329
Feature activation-1.76e+00

Pos logit contributions
cks+0.048
et+0.048
an+0.046
lers+0.045
 egg+0.045
Neg logit contributions
Sure-0.069
Kids-0.065
Listen-0.065
Don-0.065
When-0.065
Token her
Token ID607
Feature activation+4.81e+00

Pos logit contributions
Sorry+0.019
Kids+0.019
Listen+0.019
Don+0.018
Dad+0.018
Neg logit contributions
et-0.016
lers-0.015
an-0.015
ish-0.015
 egg-0.015
Token shooter
Token ID14273
Feature activation+1.06e+01

Pos logit contributions
an+0.114
et+0.112
cks+0.111
lers+0.111
 egg+0.111
Neg logit contributions
Sorry-0.116
Listen-0.112
Don-0.110
Sure-0.110
Kids-0.109
Token.
Token ID13
Feature activation-1.36e+01

Pos logit contributions
an+0.094
cks+0.092
 egg+0.092
et+0.091
lers+0.089
Neg logit contributions
Sorry-0.107
Listen-0.106
Don-0.105
Kids-0.103
By-0.103
Token<|endoftext|>
Token ID50256
Feature activation-5.49e+00

Pos logit contributions
Sorry+0.117
Kids+0.110
Listen+0.105
Um+0.104
Exc+0.099
Neg logit contributions
et-0.146
ish-0.144
opl-0.143
an-0.141
c-0.139
TokenOnce
Token ID7454
Feature activation-7.73e+00

Pos logit contributions
Sorry+0.028
Kids+0.028
Listen+0.028
Don+0.026
Dad+0.025
Neg logit contributions
et-0.080
an-0.078
lers-0.078
ish-0.077
ig-0.076
Token upon
Token ID2402
Feature activation-5.44e+00

Pos logit contributions
Sorry+0.068
Kids+0.067
Listen+0.067
Don+0.064
Dad+0.063
Neg logit contributions
et-0.084
an-0.081
lers-0.081
ish-0.080
 egg-0.079
Token a
Token ID257
Feature activation-4.68e+00

Pos logit contributions
Kids+0.071
Sorry+0.071
Listen+0.067
Um+0.066
Sure+0.065
Neg logit contributions
et-0.034
 egg-0.034
ery-0.033
an-0.032
lers-0.032
Token time
Token ID640
Feature activation-7.41e+00

Pos logit contributions
Sorry+0.054
Kids+0.053
Listen+0.053
Don+0.051
Sure+0.050
Neg logit contributions
et-0.039
an-0.037
ish-0.037
lers-0.037
ig-0.036
Token,
Token ID11
Feature activation-1.14e+01

Pos logit contributions
Listen+0.089
Sorry+0.089
Kids+0.089
Don+0.087
Sure+0.086
Neg logit contributions
et-0.057
an-0.054
lers-0.053
ish-0.052
 egg-0.051
Token there
Token ID612
Feature activation-5.03e+00

Pos logit contributions
Sorry+0.126
Listen+0.126
Kids+0.125
Don+0.123
Dad+0.120
Neg logit contributions
et-0.099
an-0.094
lers-0.094
ish-0.093
 egg-0.090
Token was
Token ID373
Feature activation-8.12e+00

Pos logit contributions
Sorry+0.049
Kids+0.047
Listen+0.047
Don+0.045
Sure+0.044
Neg logit contributions
et-0.051
ish-0.049
an-0.049
lers-0.049
ig-0.048
Token a
Token ID257
Feature activation-2.37e+00

Pos logit contributions
Sorry+0.093
Kids+0.092
Listen+0.091
Don+0.087
Sure+0.087
Neg logit contributions
et-0.068
 egg-0.065
lers-0.064
ish-0.063
an-0.063
Token a
Token ID257
Feature activation-8.79e-01

Pos logit contributions
Sorry+0.072
Kids+0.071
Listen+0.070
Don+0.067
Sure+0.067
Neg logit contributions
et-0.053
 egg-0.050
lers-0.050
ish-0.049
an-0.049
Token little
Token ID1310
Feature activation+1.53e+00

Pos logit contributions
Sorry+0.009
Kids+0.009
Listen+0.009
Dad+0.009
Sure+0.009
Neg logit contributions
et-0.008
lers-0.008
an-0.008
ish-0.008
cks-0.007
Token girl
Token ID2576
Feature activation+1.05e+00

Pos logit contributions
cks+0.020
et+0.019
ps+0.018
ates+0.018
aces+0.017
Neg logit contributions
Kids-0.012
Hi-0.012
Sorry-0.012
Dad-0.011
Sure-0.011
Token.
Token ID13
Feature activation-1.36e+01

Pos logit contributions
et+0.013
ps+0.013
an+0.013
ery+0.013
ish+0.013
Neg logit contributions
Kids-0.007
Dad-0.007
Sorry-0.007
Don-0.007
Sure-0.007
Token She
Token ID1375
Feature activation-5.43e+00

Pos logit contributions
Sorry+0.097
Listen+0.087
Kids+0.086
Um+0.079
Don+0.077
Neg logit contributions
et-0.171
 egg-0.164
 cob-0.163
c-0.163
ish-0.160
Token was
Token ID373
Feature activation-4.43e+00

Pos logit contributions
Sorry+0.053
Listen+0.052
Kids+0.052
Sure+0.049
Don+0.049
Neg logit contributions
et-0.055
an-0.053
lers-0.053
ish-0.052
 egg-0.051
Token very
Token ID845
Feature activation-1.78e+00

Pos logit contributions
Listen+0.041
Don+0.041
Sorry+0.041
Kids+0.041
Sure+0.040
Neg logit contributions
et-0.046
an-0.045
lers-0.044
ish-0.043
ig-0.043
Token curious
Token ID11040
Feature activation+1.08e+00

Pos logit contributions
Sorry+0.018
Kids+0.017
Listen+0.017
Dad+0.017
Don+0.017
Neg logit contributions
et-0.018
lers-0.017
an-0.017
ish-0.017
ig-0.017
Token,
Token ID11
Feature activation-7.57e+00

Pos logit contributions
cks+0.010
an+0.010
et+0.010
ish+0.010
ig+0.009
Neg logit contributions
Listen-0.011
Kids-0.011
Don-0.010
Dad-0.010
Hi-0.010
Token and
Token ID290
Feature activation-7.20e+00

Pos logit contributions
Listen+0.070
Kids+0.070
Don+0.069
Sorry+0.068
Sure+0.066
Neg logit contributions
et-0.079
ish-0.078
an-0.078
lers-0.077
ig-0.075
Token liked
Token ID8288
Feature activation-3.87e+00

Pos logit contributions
Sorry+0.060
Kids+0.059
Listen+0.059
Don+0.057
Sure+0.055
Neg logit contributions
et-0.083
an-0.080
lers-0.080
ish-0.080
ig-0.078
Token much
Token ID881
Feature activation-1.10e+00

Pos logit contributions
Sorry+0.068
Listen+0.068
Kids+0.067
Don+0.066
Dad+0.064
Neg logit contributions
et-0.082
an-0.079
ish-0.078
lers-0.077
ig-0.076
Token fun
Token ID1257
Feature activation-3.38e-01

Pos logit contributions
Listen+0.009
Sorry+0.009
Kids+0.008
Don+0.008
Dad+0.008
Neg logit contributions
et-0.013
an-0.013
ish-0.013
lers-0.013
ig-0.013
Token it
Token ID340
Feature activation+1.49e+00

Pos logit contributions
Kids+0.002
Listen+0.002
Sorry+0.002
Don+0.002
Dad+0.002
Neg logit contributions
an-0.004
et-0.004
ish-0.004
ig-0.004
lers-0.004
Token was
Token ID373
Feature activation-4.07e+00

Pos logit contributions
et+0.017
an+0.017
ish+0.017
lers+0.017
ig+0.017
Neg logit contributions
Listen-0.012
Sorry-0.012
Kids-0.012
Don-0.011
Sure-0.011
Token to
Token ID284
Feature activation-7.49e+00

Pos logit contributions
Listen+0.039
Don+0.038
Sorry+0.037
Dad+0.036
On+0.035
Neg logit contributions
ig-0.040
lers-0.039
an-0.039
cks-0.039
et-0.038
Token be
Token ID307
Feature activation-5.09e+00

Pos logit contributions
Sorry+0.063
Listen+0.062
Kids+0.061
Dad+0.060
Don+0.059
Neg logit contributions
et-0.085
lers-0.083
ish-0.082
an-0.082
ig-0.081
Token the
Token ID262
Feature activation-2.21e+00

Pos logit contributions
Sorry+0.044
Dad+0.044
Don+0.043
Thanks+0.042
Listen+0.042
Neg logit contributions
ig-0.053
lers-0.053
an-0.053
ish-0.051
et-0.051
Token mail
Token ID6920
Feature activation+7.07e+00

Pos logit contributions
Sorry+0.025
Listen+0.024
Don+0.024
Sure+0.024
Dad+0.024
Neg logit contributions
lers-0.019
et-0.019
cks-0.019
ig-0.018
an-0.018
Tokenman
Token ID805
Feature activation+5.19e+00

Pos logit contributions
cks+0.054
lers+0.054
et+0.052
ig+0.052
igs+0.051
Neg logit contributions
Sorry-0.083
Dad-0.081
Listen-0.081
Sure-0.080
Don-0.080
Token!
Token ID0
Feature activation-1.60e+01

Pos logit contributions
cks+0.042
an+0.042
et+0.042
lers+0.041
igs+0.041
Neg logit contributions
Sorry-0.055
Don-0.055
Listen-0.055
Sure-0.054
Dad-0.053
Token
Token ID198
Feature activation-1.60e+01

Pos logit contributions
Sorry+0.133
Listen+0.125
Kids+0.120
Um+0.113
Don+0.113
Neg logit contributions
et-0.173
ish-0.171
ig-0.171
lers-0.170
an-0.169
Token.
Token ID13
Feature activation-1.65e+01

Pos logit contributions
an+0.032
ish+0.032
 egg+0.031
cks+0.031
ig+0.031
Neg logit contributions
Sorry-0.041
Kids-0.040
Listen-0.040
Dad-0.040
By-0.040
Token What
Token ID1867
Feature activation-9.87e+00

Pos logit contributions
Sorry+0.125
Listen+0.114
Kids+0.114
Dad+0.103
Sure+0.102
Neg logit contributions
et-0.201
ish-0.192
ig-0.191
an-0.191
lers-0.191
Token a
Token ID257
Feature activation-5.76e+00

Pos logit contributions
Sorry+0.093
Listen+0.092
Kids+0.092
Don+0.089
Dad+0.087
Neg logit contributions
et-0.102
an-0.099
lers-0.098
ish-0.098
ig-0.096
Token wonderful
Token ID7932
Feature activation-2.32e+00

Pos logit contributions
Sorry+0.061
Listen+0.061
Kids+0.061
Don+0.059
Dad+0.057
Neg logit contributions
et-0.053
an-0.050
ish-0.050
lers-0.049
 egg-0.049
Token day
Token ID1110
Feature activation-4.46e+00

Pos logit contributions
Kids+0.024
Listen+0.024
Sorry+0.024
Sure+0.024
Dad+0.023
Neg logit contributions
lers-0.021
cks-0.021
et-0.021
ish-0.020
an-0.020
Token for
Token ID329
Feature activation-9.48e+00

Pos logit contributions
Sorry+0.051
Kids+0.051
Listen+0.050
Don+0.050
Dad+0.049
Neg logit contributions
an-0.036
ish-0.035
et-0.035
lers-0.034
 egg-0.033
Token the
Token ID262
Feature activation-5.28e+00

Pos logit contributions
Kids+0.092
Listen+0.091
Sorry+0.091
Dad+0.088
Don+0.086
Neg logit contributions
et-0.093
ish-0.092
an-0.092
lers-0.091
ig-0.088
Token lion
Token ID18744
Feature activation+2.74e+00

Pos logit contributions
Listen+0.057
Sorry+0.057
Kids+0.057
Don+0.055
Sure+0.055
Neg logit contributions
et-0.047
ish-0.047
lers-0.046
ig-0.045
an-0.045
Token!
Token ID0
Feature activation-1.64e+01

Pos logit contributions
igs+0.024
et+0.024
an+0.024
cks+0.024
ish+0.024
Neg logit contributions
By-0.026
Don-0.026
Kids-0.026
Sorry-0.026
Listen-0.026
Token<|endoftext|>
Token ID50256
Feature activation-9.42e+00

Pos logit contributions
Sorry+0.152
Listen+0.140
Kids+0.135
Um+0.129
Exc+0.127
Neg logit contributions
ig-0.158
et-0.157
c-0.156
lers-0.155
ish-0.155
TokenOnce
Token ID7454
Feature activation-1.12e+01

Pos logit contributions
Sorry+0.076
Listen+0.074
Kids+0.074
Don+0.071
Sure+0.069
Neg logit contributions
et-0.110
lers-0.106
an-0.106
ish-0.105
ig-0.104
Token
Token ID129
Feature activation-7.07e+00

Pos logit contributions
Sorry+0.087
Listen+0.086
Kids+0.086
Don+0.083
Dad+0.081
Neg logit contributions
et-0.097
an-0.092
ish-0.092
lers-0.092
 egg-0.090
Token
Token ID241
Feature activation-1.43e+01

Pos logit contributions
Listen+0.068
Kids+0.067
Don+0.066
Sorry+0.066
Dad+0.064
Neg logit contributions
ish-0.068
lers-0.068
ig-0.067
et-0.066
 egg-0.066
TokenWho
Token ID8241
Feature activation-1.20e+01

Pos logit contributions
Sorry+0.081
Kids+0.076
Listen+0.071
Sure+0.070
Together+0.067
Neg logit contributions
et-0.188
an-0.180
cks-0.175
lers-0.174
 egg-0.174
Tokenâ
Token ID22940
Feature activation-7.17e+00

Pos logit contributions
Sorry+0.094
Listen+0.093
Kids+0.093
Don+0.089
Dad+0.089
Neg logit contributions
et-0.137
an-0.133
ish-0.132
lers-0.131
ig-0.131
Token
Token ID26391
Feature activation-9.69e+00

Pos logit contributions
Kids+0.068
Listen+0.068
Sorry+0.068
Don+0.065
Sure+0.064
Neg logit contributions
et-0.072
lers-0.071
ish-0.070
an-0.070
 egg-0.070
Token
Token ID8151
Feature activation-7.62e+00

Pos logit contributions
Sorry+0.082
Kids+0.081
Listen+0.081
Don+0.077
Sure+0.077
Neg logit contributions
et-0.108
lers-0.105
an-0.104
ish-0.104
ig-0.103
Tokens
Token ID82
Feature activation-1.10e+01

Pos logit contributions
Sorry+0.075
Kids+0.074
Listen+0.074
Don+0.071
Dad+0.070
Neg logit contributions
et-0.075
an-0.072
lers-0.071
ish-0.071
 egg-0.070
Token there
Token ID612
Feature activation-5.54e+00

Pos logit contributions
Listen+0.100
Kids+0.098
Sorry+0.097
Don+0.097
Hi+0.096
Neg logit contributions
et-0.113
lers-0.107
ig-0.107
an-0.107
ish-0.105
Token?
Token ID30
Feature activation-1.85e+01

Pos logit contributions
Listen+0.059
Kids+0.058
Sorry+0.058
Dad+0.058
Don+0.058
Neg logit contributions
et-0.049
lers-0.048
an-0.047
ish-0.046
ig-0.045
Tokenâ
Token ID22940
Feature activation-8.77e+00

Pos logit contributions
Sorry+0.141
Listen+0.135
Kids+0.134
Don+0.120
Dad+0.120
Neg logit contributions
ish-0.210
lers-0.207
et-0.204
 egg-0.201
c-0.201
Token
Token ID26391
Feature activation-1.13e+01

Pos logit contributions
Kids+0.077
Listen+0.076
Sorry+0.076
Don+0.073
Sure+0.071
Neg logit contributions
lers-0.094
et-0.093
ish-0.093
 egg-0.092
an-0.091