TOP ACTIVATIONS
MAX = 25.128

. Can I see it ?" L ily nods
I have one , Tom ?" Mia asks . She stands
! What did you do ?" Lily said . She was
read this book to me ?" Mom smiles and
Can I pet your dog ?" The boy said
Where did you find this ?" Jill 's mom asked .
Can you spin for me ?" Spot spun around and around
can we have the key ?" L ila asked .
" What did you do ?" Mom asks . " Why
Sam , a big pig !" Tim says .
Can we go outside now ?" Anna and Ben asked .
" Go away , dog !" Ben shouted . " We
. He works at night ." L ily says
. He might bite you ." Ben is brave
" Did you see that ?" Ben asked Lily . "
" You did it !" Lily said .
Lily . Are you lost ?" she asked .
" Who is there ?" a voice said .
can I get for you ?" he asks .
how can we get them ?" Max says . He looks

MIN ACTIVATIONS
MIN = -69.689

children put on their bath ro bes and ran around in
't know the rh in oc eros was wild . Can
harder to push the wheel bar row . But her backyard
reward . It was ch ocol ates and sweets !
When Daisy put the magn ifying glass down , the cater
he would take the av oc ados and scatter them around
his head out of his bur row and saw something new
am taking the rh in oc eros to a new home
giving him too many ch ocol ates ." The king learned
you go , my d arl ings . Enjoy your lunch
give to the rh in oc eros . Lily walked to
quiet compared to the noisy se ag ull s outside .
you carry a l ily pad too ?" The
as he opened the ch ocol ates and started eating them
Everyday he would take the av oc ados and scatter them
it to the rh in oc eros . When he tasted
flavors they liked and what topp ings would go best .
He touched the brightly - colored pet als , and as
there was a rh in oc eros who was always naughty
's wonderful , my d arl ings !" Mom my says

INTERVAL 1.424 - 25.128
CONTAINS 49.695%

was chicken and broccoli , which Lucy loved . She was
so happy and his friends were excited too . They all
and lay down in the grass . She spotted a small
a youth was walking down the street when he noticed something
fell down with a loud scream . Tom heard

INTERVAL -22.281 - 1.424
CONTAINS 46.740%

sad and Lily told her about the cane . Her mom
use it again . They buried it with Lily and hoped
mom came over and thanked Tim my for being so kind
. " Sure , you can . The more
and smiled - he felt like a real doctor !

INTERVAL -45.985 - -22.281
CONTAINS 3.548%

He will always have good memories of the unknown island ,
was happy and typed a reply . But while
it is best not to be weak . You need to
the sky when she was only three . She remembered how
, Fred saw an attractive red blanket on the grass .

INTERVAL -69.689 - -45.985
CONTAINS 0.017%

mist and into their cozy bur rows . The end .
a stick in the bird c age and the bird balanced
it to the rh in oc eros . When he tasted
and watched as the wood pe cker worked away . He
itty wanted to introduce Dog gy to her ball . She
Token.
Token ID13
Feature activation+2.02e+01

Pos logit contributions
Kids+0.110
Well+0.102
Okay+0.099
Sorry+0.097
Dad+0.096
Neg logit contributions
 up-0.064
an-0.057
 soaring-0.057
ish-0.053
lers-0.052
Token Can
Token ID1680
Feature activation+1.90e+01

Pos logit contributions
Kids+0.174
Well+0.165
Enough+0.162
Sorry+0.157
Okay+0.156
Neg logit contributions
 up-0.207
 out-0.174
an-0.164
--0.164
 crown-0.160
Token I
Token ID314
Feature activation+1.70e+01

Pos logit contributions
Kids+0.188
Well+0.175
Okay+0.163
Don+0.161
Sorry+0.160
Neg logit contributions
 up-0.187
--0.152
an-0.151
ish-0.146
 crown-0.146
Token see
Token ID766
Feature activation+1.53e+01

Pos logit contributions
Kids+0.167
Well+0.158
Okay+0.147
Sorry+0.146
Don+0.146
Neg logit contributions
 up-0.167
an-0.142
--0.136
ish-0.136
 crown-0.132
Token it
Token ID340
Feature activation+1.18e+01

Pos logit contributions
Kids+0.134
Well+0.124
Dad+0.119
Daddy+0.119
Listen+0.119
Neg logit contributions
an-0.144
asis-0.140
 up-0.123
ish-0.123
 nest-0.121
Token?"
Token ID1701
Feature activation+2.45e+01

Pos logit contributions
Kids+0.128
Well+0.123
Sorry+0.121
Don+0.117
Listen+0.116
Neg logit contributions
 up-0.084
an-0.082
lers-0.076
ish-0.075
--0.068
Token
Token ID198
Feature activation+2.25e+01

Pos logit contributions
Kids+0.228
Okay+0.224
Well+0.222
Sorry+0.222
Enough+0.219
Neg logit contributions
 up-0.240
--0.198
 home-0.198
 out-0.191
 crown-0.176
Token
Token ID198
Feature activation+2.28e+01

Pos logit contributions
Kids+0.208
Well+0.205
Uh+0.197
Okay+0.196
Sorry+0.193
Neg logit contributions
 up-0.230
--0.194
 home-0.182
 out-0.182
an-0.175
TokenL
Token ID43
Feature activation+1.85e+01

Pos logit contributions
Well+0.144
Kids+0.144
Listen+0.126
Enough+0.123
Don+0.118
Neg logit contributions
 up-0.265
--0.255
an-0.235
ling-0.231
 in-0.225
Tokenily
Token ID813
Feature activation+1.89e+01

Pos logit contributions
Kids+0.126
Well+0.111
Okay+0.098
Sorry+0.096
Don+0.096
Neg logit contributions
 up-0.242
an-0.213
ish-0.209
lers-0.208
 soaring-0.208
Token nods
Token ID34833
Feature activation+1.59e+01

Pos logit contributions
Kids+0.187
Well+0.176
Okay+0.164
Don+0.162
Sorry+0.159
Neg logit contributions
 up-0.174
an-0.154
--0.147
ish-0.145
 soaring-0.144
Token I
Token ID314
Feature activation+1.63e+01

Pos logit contributions
Kids+0.170
Well+0.158
Okay+0.147
Don+0.145
Sorry+0.144
Neg logit contributions
 up-0.189
an-0.159
--0.155
ish-0.154
 crown-0.151
Token have
Token ID423
Feature activation+1.59e+01

Pos logit contributions
Kids+0.136
Well+0.131
Sorry+0.123
Okay+0.120
Don+0.119
Neg logit contributions
 up-0.169
an-0.154
lers-0.145
 soaring-0.143
--0.139
Token one
Token ID530
Feature activation+1.05e+01

Pos logit contributions
Well+0.153
Kids+0.149
Sorry+0.146
Don+0.145
Listen+0.145
Neg logit contributions
an-0.138
 up-0.129
ish-0.117
ling-0.116
ig-0.115
Token,
Token ID11
Feature activation+1.96e+01

Pos logit contributions
Well+0.095
Um+0.091
Sorry+0.091
No+0.090
 She+0.089
Neg logit contributions
 up-0.076
 soaring-0.075
an-0.073
ans-0.071
eking-0.069
Token Tom
Token ID4186
Feature activation+1.40e+01

Pos logit contributions
Kids+0.149
Well+0.138
Okay+0.125
Don+0.124
Sorry+0.124
Neg logit contributions
 up-0.237
an-0.206
--0.201
ish-0.200
 crown-0.198
Token?"
Token ID1701
Feature activation+2.44e+01

Pos logit contributions
Kids+0.129
Ben+0.114
Dad+0.114
Okay+0.114
Well+0.113
Neg logit contributions
 up-0.117
 soaring-0.112
lers-0.107
an-0.107
 feathers-0.104
Token Mia
Token ID32189
Feature activation+1.68e+01

Pos logit contributions
Kids+0.256
Enough+0.233
Well+0.232
Okay+0.231
Em+0.230
Neg logit contributions
 up-0.227
--0.181
 out-0.173
 pe-0.168
 home-0.167
Token asks
Token ID7893
Feature activation+1.98e+01

Pos logit contributions
Kids+0.143
Well+0.134
Okay+0.123
Don+0.122
Sorry+0.122
Neg logit contributions
 up-0.189
an-0.163
--0.158
ish-0.158
 crown-0.154
Token.
Token ID13
Feature activation+2.26e+01

Pos logit contributions
Kids+0.267
Well+0.254
Okay+0.243
Don+0.243
Sorry+0.242
Neg logit contributions
 up-0.117
an-0.095
ish-0.086
--0.083
 crown-0.082
Token She
Token ID1375
Feature activation+1.86e+01

Pos logit contributions
Kids+0.190
oral+0.170
Em+0.167
Uh+0.158
Listen+0.157
Neg logit contributions
 up-0.283
 out-0.237
 in-0.214
 pe-0.211
--0.206
Token stands
Token ID6296
Feature activation+1.47e+01

Pos logit contributions
Kids+0.173
Well+0.167
Okay+0.151
Don+0.149
Enough+0.148
Neg logit contributions
 up-0.170
an-0.162
 home-0.156
 soaring-0.149
asis-0.148
Token!
Token ID0
Feature activation+1.99e+01

Pos logit contributions
Kids+0.163
Sorry+0.157
Well+0.155
Okay+0.150
Enough+0.147
Neg logit contributions
 up-0.096
an-0.088
ish-0.075
 soaring-0.072
lers-0.070
Token What
Token ID1867
Feature activation+1.66e+01

Pos logit contributions
Kids+0.156
Well+0.147
Okay+0.141
Don+0.136
Listen+0.132
Neg logit contributions
 up-0.226
--0.196
 out-0.194
ish-0.193
an-0.192
Token did
Token ID750
Feature activation+1.37e+01

Pos logit contributions
Kids+0.154
Well+0.144
Okay+0.134
Don+0.133
Sorry+0.133
Neg logit contributions
 up-0.174
an-0.149
--0.144
ish-0.143
 crown-0.140
Token you
Token ID345
Feature activation+1.64e+01

Pos logit contributions
Kids+0.128
Well+0.119
Sorry+0.112
Don+0.111
Okay+0.110
Neg logit contributions
 up-0.133
an-0.123
ish-0.119
ling-0.110
lers-0.108
Token do
Token ID466
Feature activation+1.31e+01

Pos logit contributions
Kids+0.145
Well+0.137
Sorry+0.129
Okay+0.128
Don+0.127
Neg logit contributions
 up-0.170
an-0.150
--0.143
lers-0.142
 crown-0.140
Token?"
Token ID1701
Feature activation+2.43e+01

Pos logit contributions
Kids+0.152
Well+0.143
Okay+0.136
Sorry+0.136
Don+0.134
Neg logit contributions
 up-0.095
an-0.086
ish-0.082
lers-0.080
 soaring-0.078
Token Lily
Token ID20037
Feature activation+1.68e+01

Pos logit contributions
Kids+0.246
Okay+0.237
Well+0.220
oral+0.214
Listen+0.214
Neg logit contributions
 up-0.234
 out-0.196
--0.185
 home-0.176
 in-0.175
Token said
Token ID531
Feature activation+1.99e+01

Pos logit contributions
Kids+0.155
Well+0.145
Okay+0.138
Sorry+0.134
Listen+0.133
Neg logit contributions
 up-0.182
--0.153
 out-0.148
an-0.147
lers-0.144
Token.
Token ID13
Feature activation+2.22e+01

Pos logit contributions
Kids+0.252
Well+0.239
Sorry+0.229
Don+0.229
Okay+0.228
Neg logit contributions
 up-0.121
an-0.113
ish-0.099
 crown-0.097
asis-0.097
Token She
Token ID1375
Feature activation+1.84e+01

Pos logit contributions
Kids+0.239
Okay+0.227
Sorry+0.216
Uh+0.213
Hmm+0.212
Neg logit contributions
 up-0.195
 out-0.172
 home-0.141
 in-0.140
--0.134
Token was
Token ID373
Feature activation+1.74e+01

Pos logit contributions
Kids+0.180
Well+0.168
Okay+0.159
Sorry+0.156
Don+0.154
Neg logit contributions
 up-0.184
--0.155
an-0.152
 crown-0.147
 out-0.146
Token read
Token ID1100
Feature activation+1.27e+01

Pos logit contributions
Kids+0.112
Well+0.103
Okay+0.095
Sorry+0.094
Don+0.094
Neg logit contributions
 up-0.189
an-0.170
ish-0.163
--0.161
lers-0.159
Token this
Token ID428
Feature activation+9.17e+00

Pos logit contributions
Kids+0.133
Well+0.125
Max+0.120
Dad+0.120
Grand+0.119
Neg logit contributions
an-0.097
 up-0.093
ish-0.091
lers-0.081
asis-0.081
Token book
Token ID1492
Feature activation+7.67e+00

Pos logit contributions
Kids+0.090
Well+0.089
Okay+0.081
Listen+0.081
Don+0.081
Neg logit contributions
 up-0.076
an-0.070
ish-0.069
lers-0.065
 soaring-0.062
Token to
Token ID284
Feature activation+1.51e+01

Pos logit contributions
Kids+0.091
Well+0.088
Okay+0.084
Sorry+0.083
Don+0.082
Neg logit contributions
 up-0.052
an-0.046
ish-0.043
lers-0.041
 soaring-0.040
Token me
Token ID502
Feature activation+1.21e+01

Pos logit contributions
Kids+0.134
Well+0.125
Don+0.118
Dad+0.117
Okay+0.117
Neg logit contributions
 up-0.152
an-0.139
ish-0.136
lers-0.132
ling-0.125
Token?"
Token ID1701
Feature activation+2.43e+01

Pos logit contributions
Kids+0.119
Well+0.114
Grand+0.109
Okay+0.108
Um+0.108
Neg logit contributions
 up-0.093
an-0.089
lers-0.085
ish-0.081
 soaring-0.079
Token
Token ID198
Feature activation+2.21e+01

Pos logit contributions
Kids+0.242
Enough+0.230
Well+0.227
Uh+0.224
Okay+0.217
Neg logit contributions
 up-0.232
--0.202
 home-0.196
 out-0.186
 crown-0.167
Token
Token ID198
Feature activation+2.22e+01

Pos logit contributions
Well+0.218
Kids+0.216
Uh+0.203
Enough+0.199
Okay+0.198
Neg logit contributions
 up-0.213
--0.189
 home-0.175
 out-0.171
an-0.152
TokenMom
Token ID29252
Feature activation+1.82e+01

Pos logit contributions
Kids+0.166
Well+0.164
Enough+0.144
Don+0.139
Uh+0.135
Neg logit contributions
 up-0.252
--0.252
 home-0.215
an-0.212
 crown-0.208
Token smiles
Token ID21845
Feature activation+1.45e+01

Pos logit contributions
Kids+0.185
Well+0.175
Don+0.168
Sorry+0.167
Okay+0.166
Neg logit contributions
 up-0.180
an-0.144
--0.144
ish-0.141
 crown-0.139
Token and
Token ID290
Feature activation+1.92e+01

Pos logit contributions
Kids+0.197
Well+0.189
Okay+0.179
Don+0.178
Sorry+0.178
Neg logit contributions
 up-0.090
an-0.067
--0.063
ish-0.062
 crown-0.060
TokenCan
Token ID6090
Feature activation+1.70e+01

Pos logit contributions
Kids+0.102
Well+0.096
Okay+0.077
Enough+0.076
Sorry+0.075
Neg logit contributions
 up-0.231
an-0.202
--0.200
 out-0.197
 crown-0.192
Token I
Token ID314
Feature activation+1.56e+01

Pos logit contributions
Kids+0.162
Well+0.152
Okay+0.141
Sorry+0.139
Don+0.138
Neg logit contributions
 up-0.175
an-0.145
--0.143
ish-0.139
 out-0.138
Token pet
Token ID4273
Feature activation+1.05e+01

Pos logit contributions
Kids+0.150
Well+0.138
Okay+0.131
Sorry+0.131
Don+0.129
Neg logit contributions
 up-0.155
an-0.135
ish-0.127
--0.126
lers-0.125
Token your
Token ID534
Feature activation+9.74e+00

Pos logit contributions
Kids+0.103
Well+0.094
Don+0.090
Sorry+0.088
Okay+0.087
Neg logit contributions
 up-0.097
an-0.090
 soaring-0.085
 crown-0.084
ish-0.084
Token dog
Token ID3290
Feature activation+9.35e+00

Pos logit contributions
Kids+0.091
Okay+0.083
Well+0.083
Don+0.081
Sorry+0.079
Neg logit contributions
 up-0.087
asis-0.075
an-0.074
 soaring-0.073
ling-0.072
Token?"
Token ID1701
Feature activation+2.41e+01

Pos logit contributions
Kids+0.111
Well+0.104
Sorry+0.101
Don+0.100
Okay+0.100
Neg logit contributions
 up-0.062
an-0.055
 soaring-0.048
lers-0.048
ish-0.047
Token
Token ID198
Feature activation+2.16e+01

Pos logit contributions
Kids+0.237
Uh+0.232
Well+0.228
Sorry+0.222
Okay+0.219
Neg logit contributions
 up-0.231
--0.183
 out-0.178
 home-0.174
 flying-0.166
Token
Token ID198
Feature activation+2.19e+01

Pos logit contributions
Kids+0.196
Well+0.194
Uh+0.188
Okay+0.182
Sorry+0.177
Neg logit contributions
 up-0.215
--0.188
 out-0.171
 home-0.170
an-0.168
TokenThe
Token ID464
Feature activation+1.85e+01

Pos logit contributions
Well+0.165
Kids+0.165
Uh+0.141
Enough+0.135
Listen+0.135
Neg logit contributions
 up-0.236
--0.228
an-0.208
 feathers-0.191
 home-0.191
Token boy
Token ID2933
Feature activation+1.65e+01

Pos logit contributions
Kids+0.199
Well+0.193
Okay+0.178
Sorry+0.177
Don+0.175
Neg logit contributions
 up-0.167
an-0.136
--0.132
 home-0.126
 out-0.124
Token said
Token ID531
Feature activation+2.01e+01

Pos logit contributions
Kids+0.149
Well+0.137
Sorry+0.126
Okay+0.126
Don+0.124
Neg logit contributions
 up-0.181
--0.154
an-0.153
ish-0.148
 crown-0.146
TokenWhere
Token ID8496
Feature activation+1.59e+01

Pos logit contributions
uddy+0.124
 Sadly+0.124
oral+0.119
enny+0.119
Kids+0.118
Neg logit contributions
 up-0.182
 pe-0.170
 feathers-0.166
 sand-0.162
 crown-0.161
Token did
Token ID750
Feature activation+1.30e+01

Pos logit contributions
Kids+0.118
Well+0.110
Don+0.100
Okay+0.100
Sorry+0.098
Neg logit contributions
 up-0.190
an-0.174
ish-0.163
 home-0.161
lers-0.160
Token you
Token ID345
Feature activation+1.59e+01

Pos logit contributions
Kids+0.143
Well+0.134
Don+0.126
Sorry+0.126
Okay+0.125
Neg logit contributions
 up-0.110
an-0.097
ish-0.092
--0.086
lers-0.085
Token find
Token ID1064
Feature activation+1.32e+01

Pos logit contributions
Kids+0.136
Well+0.129
Don+0.123
Dad+0.122
Sorry+0.122
Neg logit contributions
 up-0.154
an-0.141
lers-0.136
--0.131
rier-0.128
Token this
Token ID428
Feature activation+8.85e+00

Pos logit contributions
Kids+0.111
Well+0.099
Listen+0.097
Daddy+0.097
Dad+0.096
Neg logit contributions
an-0.118
ling-0.112
rier-0.108
ish-0.105
 up-0.103
Token?"
Token ID1701
Feature activation+2.40e+01

Pos logit contributions
Kids+0.081
Well+0.081
That+0.076
Don+0.075
Listen+0.075
Neg logit contributions
an-0.070
lers-0.067
 up-0.064
ish-0.064
ans-0.063
Token Jill
Token ID22772
Feature activation+1.61e+01

Pos logit contributions
Kids+0.230
Okay+0.208
Well+0.203
Enough+0.199
Max+0.195
Neg logit contributions
 up-0.260
 out-0.215
--0.205
 home-0.202
 crown-0.195
Token's
Token ID338
Feature activation+1.62e+01

Pos logit contributions
Kids+0.138
Well+0.127
Okay+0.120
Sorry+0.118
Don+0.117
Neg logit contributions
 up-0.180
an-0.153
--0.151
ish-0.149
 out-0.147
Token mom
Token ID1995
Feature activation+1.70e+01

Pos logit contributions
Kids+0.131
Well+0.122
Okay+0.113
Sorry+0.112
Don+0.110
Neg logit contributions
 up-0.180
an-0.161
ish-0.154
lers-0.152
 soaring-0.151
Token asked
Token ID1965
Feature activation+1.87e+01

Pos logit contributions
Kids+0.154
Well+0.143
Okay+0.137
Sorry+0.133
Enough+0.132
Neg logit contributions
 up-0.183
--0.156
an-0.149
 out-0.148
ish-0.146
Token.
Token ID13
Feature activation+2.22e+01

Pos logit contributions
Kids+0.219
Well+0.205
Don+0.202
Okay+0.196
Dad+0.195
Neg logit contributions
 up-0.122
an-0.119
asis-0.109
ish-0.106
 crown-0.103
Token Can
Token ID1680
Feature activation+1.79e+01

Pos logit contributions
Kids+0.155
Well+0.145
Okay+0.139
Sorry+0.139
Together+0.133
Neg logit contributions
 up-0.225
 out-0.192
ish-0.183
 soaring-0.181
an-0.180
Token you
Token ID345
Feature activation+1.61e+01

Pos logit contributions
Kids+0.193
Well+0.181
Okay+0.170
Uh+0.168
Enough+0.167
Neg logit contributions
 up-0.155
--0.123
an-0.120
 out-0.117
 soaring-0.116
Token spin
Token ID7906
Feature activation+1.15e+01

Pos logit contributions
Kids+0.164
Well+0.155
Okay+0.145
Sorry+0.144
Don+0.144
Neg logit contributions
 up-0.151
an-0.129
ish-0.123
--0.122
lers-0.118
Token for
Token ID329
Feature activation+1.47e+01

Pos logit contributions
Kids+0.130
Well+0.126
Don+0.121
Max+0.120
Okay+0.117
Neg logit contributions
an-0.076
asis-0.067
 up-0.066
ish-0.065
 nest-0.061
Token me
Token ID502
Feature activation+1.17e+01

Pos logit contributions
Kids+0.166
Well+0.153
Sorry+0.146
Enough+0.145
Listen+0.144
Neg logit contributions
 up-0.118
an-0.108
ish-0.103
ling-0.093
lers-0.091
Token?"
Token ID1701
Feature activation+2.39e+01

Pos logit contributions
Kids+0.108
Well+0.102
Okay+0.098
Sorry+0.096
Listen+0.094
Neg logit contributions
an-0.100
 up-0.096
rier-0.092
lers-0.089
ish-0.087
Token Spot
Token ID15899
Feature activation+1.69e+01

Pos logit contributions
Kids+0.223
Well+0.202
Sorry+0.198
Uh+0.195
Okay+0.193
Neg logit contributions
 up-0.255
--0.205
 home-0.198
 out-0.196
ish-0.190
Token spun
Token ID26843
Feature activation+1.54e+01

Pos logit contributions
Kids+0.166
Well+0.156
Sorry+0.150
Enough+0.147
Listen+0.145
Neg logit contributions
 up-0.172
--0.145
an-0.140
ish-0.136
 crown-0.130
Token around
Token ID1088
Feature activation+1.54e+01

Pos logit contributions
Kids+0.239
Well+0.229
Okay+0.219
Sorry+0.219
Don+0.216
Neg logit contributions
 up-0.072
an-0.043
--0.042
 out-0.039
ish-0.038
Token and
Token ID290
Feature activation+1.89e+01

Pos logit contributions
Kids+0.201
Well+0.193
Don+0.183
Okay+0.183
Sorry+0.181
Neg logit contributions
 up-0.101
an-0.081
ish-0.076
--0.073
 crown-0.071
Token around
Token ID1088
Feature activation+1.64e+01

Pos logit contributions
Kids+0.177
Well+0.168
Don+0.156
Okay+0.155
Sorry+0.153
Neg logit contributions
 up-0.188
an-0.168
ish-0.160
--0.158
 crown-0.154
Token can
Token ID460
Feature activation+1.75e+01

Pos logit contributions
Kids+0.188
Well+0.176
Okay+0.162
Don+0.161
Sorry+0.160
Neg logit contributions
 up-0.213
an-0.186
--0.177
ish-0.175
lers-0.172
Token we
Token ID356
Feature activation+1.60e+01

Pos logit contributions
Kids+0.173
Well+0.164
Okay+0.154
Don+0.152
Sorry+0.149
Neg logit contributions
 up-0.173
an-0.140
--0.139
ish-0.138
 out-0.136
Token have
Token ID423
Feature activation+1.53e+01

Pos logit contributions
Kids+0.150
Well+0.142
Okay+0.134
Sorry+0.134
Don+0.131
Neg logit contributions
 up-0.159
an-0.140
--0.130
lers-0.130
ish-0.129
Token the
Token ID262
Feature activation+1.18e+01

Pos logit contributions
Kids+0.159
Well+0.155
Sorry+0.148
Listen+0.147
Don+0.145
Neg logit contributions
an-0.126
 up-0.121
ish-0.109
ling-0.104
ig-0.100
Token key
Token ID1994
Feature activation+6.23e+00

Pos logit contributions
Okay+0.114
Kids+0.112
Well+0.112
Sorry+0.109
Uh+0.106
Neg logit contributions
 up-0.095
an-0.087
ling-0.082
lers-0.081
--0.081
Token?"
Token ID1701
Feature activation+2.39e+01

Pos logit contributions
Kids+0.059
Well+0.058
Sorry+0.057
Okay+0.055
Dad+0.054
Neg logit contributions
an-0.048
 up-0.046
 soaring-0.044
lers-0.043
 feathers-0.042
Token L
Token ID406
Feature activation+1.56e+01

Pos logit contributions
Okay+0.250
Kids+0.242
Enough+0.242
Listen+0.235
Well+0.228
Neg logit contributions
 up-0.203
 out-0.171
 pe-0.162
 home-0.161
 in-0.150
Tokenila
Token ID10102
Feature activation+1.61e+01

Pos logit contributions
Kids+0.116
Well+0.106
Okay+0.096
Don+0.095
Sorry+0.095
Neg logit contributions
 up-0.192
an-0.169
--0.164
ish-0.163
 crown-0.160
Token asked
Token ID1965
Feature activation+1.94e+01

Pos logit contributions
Kids+0.145
Well+0.134
Okay+0.126
Sorry+0.125
Don+0.123
Neg logit contributions
 up-0.176
--0.147
an-0.146
ish-0.142
 out-0.142
Token.
Token ID13
Feature activation+2.23e+01

Pos logit contributions
Kids+0.255
Well+0.242
Okay+0.232
Don+0.232
Sorry+0.230
Neg logit contributions
 up-0.116
an-0.099
ish-0.089
 crown-0.086
--0.084
Token
Token ID198
Feature activation+2.21e+01

Pos logit contributions
Kids+0.236
Okay+0.217
Well+0.213
Enough+0.212
Uh+0.212
Neg logit contributions
 up-0.211
 out-0.180
 home-0.156
 in-0.144
--0.139
Token"
Token ID1
Feature activation+2.03e+01

Pos logit contributions
Kids+0.227
Well+0.221
Okay+0.215
Uh+0.211
Um+0.210
Neg logit contributions
 up-0.163
--0.133
 inside-0.117
 in-0.117
 out-0.116
TokenWhat
Token ID2061
Feature activation+1.62e+01

Pos logit contributions
Well+0.114
Kids+0.111
Okay+0.106
 Did+0.105
 On+0.105
Neg logit contributions
 up-0.232
--0.192
 pe-0.191
 inside-0.190
 in-0.189
Token did
Token ID750
Feature activation+1.33e+01

Pos logit contributions
Kids+0.170
Well+0.162
Okay+0.154
Sorry+0.150
Don+0.149
Neg logit contributions
 up-0.149
--0.122
 out-0.119
an-0.115
ish-0.115
Token you
Token ID345
Feature activation+1.58e+01

Pos logit contributions
Kids+0.150
Well+0.142
Okay+0.133
Don+0.133
Sorry+0.133
Neg logit contributions
 up-0.113
an-0.093
ish-0.089
--0.088
 crown-0.085
Token do
Token ID466
Feature activation+1.25e+01

Pos logit contributions
Kids+0.173
Well+0.163
Okay+0.152
Don+0.149
Sorry+0.148
Neg logit contributions
 up-0.148
an-0.117
ish-0.116
 out-0.115
--0.115
Token?"
Token ID1701
Feature activation+2.39e+01

Pos logit contributions
Kids+0.166
Well+0.159
Okay+0.151
Don+0.151
Sorry+0.150
Neg logit contributions
 up-0.082
an-0.061
--0.060
ish-0.057
 out-0.056
Token Mom
Token ID11254
Feature activation+1.68e+01

Pos logit contributions
Kids+0.243
Okay+0.242
Well+0.229
Sorry+0.218
Alright+0.215
Neg logit contributions
 up-0.224
 out-0.181
 in-0.169
--0.164
 home-0.161
Token asks
Token ID7893
Feature activation+1.93e+01

Pos logit contributions
Kids+0.154
Well+0.147
Okay+0.141
Sorry+0.138
Don+0.135
Neg logit contributions
 up-0.183
an-0.148
--0.147
 out-0.145
ish-0.144
Token.
Token ID13
Feature activation+2.16e+01

Pos logit contributions
Kids+0.267
Well+0.256
Okay+0.244
Don+0.243
Sorry+0.242
Neg logit contributions
 up-0.112
an-0.084
--0.077
ish-0.077
 crown-0.073
Token "
Token ID366
Feature activation+1.95e+01

Pos logit contributions
Okay+0.228
Kids+0.227
Sorry+0.211
Hmm+0.204
Well+0.204
Neg logit contributions
 up-0.198
 out-0.174
 in-0.152
--0.146
 inside-0.144
TokenWhy
Token ID5195
Feature activation+1.87e+01

Pos logit contributions
Kids+0.118
Okay+0.110
Alright+0.097
Well+0.095
Um+0.095
Neg logit contributions
 up-0.226
--0.189
an-0.189
 in-0.187
 out-0.183
Token Sam
Token ID3409
Feature activation+1.43e+01

Pos logit contributions
Kids+0.189
Well+0.176
Don+0.167
Sorry+0.165
Okay+0.163
Neg logit contributions
 up-0.190
an-0.167
lers-0.160
ish-0.157
ling-0.157
Token,
Token ID11
Feature activation+1.93e+01

Pos logit contributions
Kids+0.157
Well+0.144
Ben+0.142
Okay+0.141
Sam+0.141
Neg logit contributions
 up-0.097
 soaring-0.084
lers-0.080
 underwater-0.079
an-0.078
Token a
Token ID257
Feature activation+1.27e+01

Pos logit contributions
Kids+0.215
Well+0.205
Don+0.196
Sorry+0.191
Okay+0.190
Neg logit contributions
 up-0.157
an-0.139
ish-0.127
lers-0.126
ling-0.125
Token big
Token ID1263
Feature activation+8.07e+00

Pos logit contributions
Kids+0.137
Well+0.123
Uh+0.122
Ben+0.119
Dad+0.119
Neg logit contributions
 up-0.101
ishing-0.081
an-0.080
lers-0.076
ish-0.076
Token pig
Token ID12967
Feature activation+1.01e+01

Pos logit contributions
Kids+0.075
Uh+0.071
Dad+0.069
 She+0.069
Well+0.068
Neg logit contributions
 up-0.067
ishing-0.055
an-0.054
lers-0.053
eking-0.053
Token!"
Token ID2474
Feature activation+2.38e+01

Pos logit contributions
Kids+0.097
 She+0.092
Well+0.090
Ben+0.090
Sam+0.089
Neg logit contributions
 up-0.067
 seas-0.059
rier-0.058
 soaring-0.058
ishing-0.055
Token Tim
Token ID5045
Feature activation+1.59e+01

Pos logit contributions
Kids+0.264
Okay+0.244
Well+0.239
Listen+0.239
Enough+0.231
Neg logit contributions
 up-0.208
 out-0.163
--0.159
 pe-0.159
 in-0.152
Token says
Token ID1139
Feature activation+1.90e+01

Pos logit contributions
Kids+0.141
Well+0.133
Okay+0.123
Sorry+0.123
Don+0.121
Neg logit contributions
 up-0.174
an-0.149
--0.144
ish-0.143
 out-0.140
Token.
Token ID13
Feature activation+2.23e+01

Pos logit contributions
Kids+0.232
Well+0.221
Don+0.213
Okay+0.213
Sorry+0.213
Neg logit contributions
 up-0.120
an-0.109
ish-0.099
asis-0.096
 crown-0.094
Token
Token ID198
Feature activation+2.28e+01

Pos logit contributions
Kids+0.214
Okay+0.192
Enough+0.190
Sorry+0.186
Em+0.184
Neg logit contributions
 up-0.234
 out-0.207
--0.200
 in-0.184
 pe-0.180
Token
Token ID198
Feature activation+2.32e+01

Pos logit contributions
Kids+0.217
Well+0.209
Okay+0.202
Enough+0.197
Uh+0.197
Neg logit contributions
 up-0.221
--0.192
 out-0.179
an-0.173
 home-0.171
TokenCan
Token ID6090
Feature activation+1.84e+01

Pos logit contributions
 She+0.091
Kids+0.091
Well+0.090
enny+0.089
 Did+0.085
Neg logit contributions
 up-0.218
 feathers-0.195
 inside-0.195
 crown-0.195
 egg-0.193
Token we
Token ID356
Feature activation+1.70e+01

Pos logit contributions
Kids+0.187
Well+0.178
Okay+0.167
Don+0.163
Sorry+0.162
Neg logit contributions
 up-0.176
--0.139
an-0.138
 out-0.136
 home-0.135
Token go
Token ID467
Feature activation+1.49e+01

Pos logit contributions
Kids+0.153
Well+0.141
Sorry+0.137
Okay+0.137
Don+0.129
Neg logit contributions
 up-0.170
an-0.157
lers-0.150
 soaring-0.142
ish-0.141
Token outside
Token ID2354
Feature activation+1.15e+01

Pos logit contributions
Kids+0.169
Sorry+0.150
Well+0.146
Listen+0.144
Sure+0.142
Neg logit contributions
an-0.102
 scales-0.094
rier-0.094
lers-0.090
ish-0.084
Token now
Token ID783
Feature activation+1.45e+01

Pos logit contributions
Kids+0.121
Well+0.112
Sorry+0.109
Dad+0.107
Okay+0.106
Neg logit contributions
an-0.087
 up-0.083
lers-0.079
ish-0.074
 soaring-0.074
Token?"
Token ID1701
Feature activation+2.38e+01

Pos logit contributions
Kids+0.145
Sorry+0.128
Dad+0.127
Well+0.125
Daddy+0.123
Neg logit contributions
an-0.119
lers-0.113
 up-0.111
 soaring-0.103
ish-0.102
Token Anna
Token ID11735
Feature activation+1.69e+01

Pos logit contributions
Okay+0.231
Well+0.225
Kids+0.224
Sorry+0.212
Enough+0.209
Neg logit contributions
 up-0.230
 out-0.194
 home-0.183
 in-0.172
 crown-0.169
Token and
Token ID290
Feature activation+1.98e+01

Pos logit contributions
Kids+0.150
Well+0.140
Okay+0.131
Sorry+0.130
Don+0.128
Neg logit contributions
 up-0.183
an-0.155
--0.153
ish-0.150
 out-0.148
Token Ben
Token ID3932
Feature activation+1.66e+01

Pos logit contributions
Kids+0.155
Well+0.146
Okay+0.134
Sorry+0.128
Don+0.127
Neg logit contributions
 up-0.245
 out-0.208
 home-0.202
ish-0.201
--0.199
Token asked
Token ID1965
Feature activation+1.90e+01

Pos logit contributions
Kids+0.145
Well+0.137
Okay+0.127
Sorry+0.126
Don+0.124
Neg logit contributions
 up-0.182
an-0.152
 out-0.151
--0.149
ish-0.149
Token.
Token ID13
Feature activation+2.13e+01

Pos logit contributions
Kids+0.241
Well+0.226
Okay+0.220
Sorry+0.218
Don+0.217
Neg logit contributions
 up-0.117
an-0.106
ish-0.093
 soaring-0.090
asis-0.090
Token"
Token ID1
Feature activation+2.07e+01

Pos logit contributions
Kids+0.188
Well+0.187
Okay+0.163
Um+0.163
Don+0.161
Neg logit contributions
 up-0.211
an-0.179
--0.175
 out-0.166
ling-0.163
TokenGo
Token ID5247
Feature activation+1.68e+01

Pos logit contributions
Kids+0.108
enny+0.100
oral+0.098
Well+0.095
Together+0.090
Neg logit contributions
 up-0.255
an-0.226
 feathers-0.216
--0.214
ling-0.212
Token away
Token ID1497
Feature activation+1.26e+01

Pos logit contributions
Kids+0.248
Well+0.234
Okay+0.230
Don+0.229
Sorry+0.228
Neg logit contributions
 up-0.073
an-0.055
lers-0.050
 soaring-0.050
ish-0.049
Token,
Token ID11
Feature activation+1.95e+01

Pos logit contributions
Kids+0.143
Well+0.131
Sorry+0.129
Ben+0.128
Dad+0.127
Neg logit contributions
 up-0.080
 soaring-0.075
lers-0.075
 crown-0.073
 feathers-0.070
Token dog
Token ID3290
Feature activation+1.42e+01

Pos logit contributions
Kids+0.177
Well+0.159
Don+0.155
Okay+0.148
Sorry+0.148
Neg logit contributions
 up-0.203
an-0.178
ish-0.175
lers-0.174
 crown-0.170
Token!"
Token ID2474
Feature activation+2.38e+01

Pos logit contributions
Kids+0.159
Don+0.152
Okay+0.150
Max+0.149
Well+0.149
Neg logit contributions
 up-0.084
 soaring-0.075
 crown-0.073
ishing-0.070
an-0.069
Token Ben
Token ID3932
Feature activation+1.74e+01

Pos logit contributions
Kids+0.240
Okay+0.233
Well+0.230
Listen+0.220
Sorry+0.220
Neg logit contributions
 up-0.225
 out-0.191
 home-0.175
--0.172
 in-0.168
Token shouted
Token ID17293
Feature activation+1.82e+01

Pos logit contributions
Kids+0.145
Well+0.135
Okay+0.124
Sorry+0.123
Don+0.121
Neg logit contributions
 up-0.200
an-0.173
--0.171
ish-0.165
 out-0.164
Token.
Token ID13
Feature activation+2.18e+01

Pos logit contributions
Kids+0.209
Well+0.197
Don+0.193
Sorry+0.190
Okay+0.187
Neg logit contributions
an-0.114
 up-0.114
asis-0.111
ish-0.103
 crown-0.102
Token "
Token ID366
Feature activation+1.95e+01

Pos logit contributions
Kids+0.236
Okay+0.220
Enough+0.213
Well+0.212
Hmm+0.212
Neg logit contributions
 up-0.197
 out-0.168
--0.151
 home-0.145
 in-0.144
TokenWe
Token ID1135
Feature activation+1.90e+01

Pos logit contributions
Kids+0.114
Well+0.094
Okay+0.086
Um+0.080
Em+0.080
Neg logit contributions
 up-0.262
an-0.232
--0.225
 out-0.223
 soaring-0.221
Token.
Token ID13
Feature activation+2.08e+01

Pos logit contributions
Kids+0.126
Well+0.118
Okay+0.115
Don+0.115
Sorry+0.114
Neg logit contributions
 up-0.062
an-0.057
ish-0.055
 soaring-0.054
lers-0.050
Token He
Token ID679
Feature activation+1.61e+01

Pos logit contributions
Kids+0.140
Well+0.139
Okay+0.133
Sorry+0.127
Enough+0.118
Neg logit contributions
 up-0.249
 out-0.218
an-0.210
--0.207
 home-0.205
Token works
Token ID2499
Feature activation+1.23e+01

Pos logit contributions
Kids+0.140
Well+0.133
Don+0.125
Okay+0.120
Sorry+0.118
Neg logit contributions
 up-0.159
an-0.141
 soaring-0.137
 home-0.135
ish-0.133
Token at
Token ID379
Feature activation+1.43e+01

Pos logit contributions
Kids+0.141
Well+0.139
Okay+0.134
Don+0.134
Sorry+0.133
Neg logit contributions
an-0.074
ish-0.070
 up-0.070
 nest-0.064
 soaring-0.063
Token night
Token ID1755
Feature activation+1.15e+01

Pos logit contributions
Kids+0.152
Well+0.136
Lisa+0.133
Listen+0.129
Dad+0.127
Neg logit contributions
ish-0.111
an-0.111
 up-0.107
lers-0.106
ling-0.100
Token."
Token ID526
Feature activation+2.38e+01

Pos logit contributions
Kids+0.121
Well+0.112
Grand+0.110
Em+0.108
 She+0.106
Neg logit contributions
ish-0.087
an-0.086
 up-0.082
lers-0.077
 crown-0.075
Token
Token ID198
Feature activation+2.23e+01

Pos logit contributions
Okay+0.237
Kids+0.237
Sorry+0.235
Well+0.232
Don+0.224
Neg logit contributions
 up-0.217
 home-0.187
--0.171
 out-0.168
 in-0.147
Token
Token ID198
Feature activation+2.24e+01

Pos logit contributions
Kids+0.225
Okay+0.222
Well+0.221
Sorry+0.220
Uh+0.219
Neg logit contributions
 up-0.204
--0.161
 out-0.158
 home-0.158
 inside-0.140
TokenL
Token ID43
Feature activation+1.83e+01

Pos logit contributions
Kids+0.157
Well+0.152
 Did+0.134
 Deal+0.130
 Promise+0.129
Neg logit contributions
 up-0.239
--0.229
 away-0.197
 in-0.195
 out-0.190
Tokenily
Token ID813
Feature activation+1.83e+01

Pos logit contributions
Kids+0.135
Well+0.124
Okay+0.112
Don+0.111
Sorry+0.111
Neg logit contributions
 up-0.226
an-0.199
--0.193
ish-0.193
 crown-0.189
Token says
Token ID1139
Feature activation+1.95e+01

Pos logit contributions
Kids+0.192
Well+0.182
Okay+0.170
Don+0.169
Sorry+0.168
Neg logit contributions
 up-0.167
an-0.141
--0.135
ish-0.134
 crown-0.130
Token.
Token ID13
Feature activation+2.02e+01

Pos logit contributions
Kids+0.110
Well+0.103
Don+0.100
Okay+0.098
 She+0.095
Neg logit contributions
an-0.083
 up-0.082
 crown-0.078
asis-0.077
 soaring-0.075
Token He
Token ID679
Feature activation+1.50e+01

Pos logit contributions
Kids+0.174
Well+0.168
Okay+0.157
Sorry+0.155
Enough+0.154
Neg logit contributions
 up-0.207
an-0.177
 out-0.174
ish-0.168
--0.163
Token might
Token ID1244
Feature activation+1.27e+01

Pos logit contributions
Kids+0.115
Well+0.111
Don+0.106
Okay+0.102
Enough+0.098
Neg logit contributions
 up-0.156
an-0.139
 home-0.137
 soaring-0.137
 crown-0.133
Token bite
Token ID13197
Feature activation+1.03e+01

Pos logit contributions
Kids+0.116
Don+0.109
Well+0.108
Okay+0.103
Sorry+0.102
Neg logit contributions
 up-0.118
ish-0.110
an-0.108
lers-0.104
ishing-0.102
Token you
Token ID345
Feature activation+1.38e+01

Pos logit contributions
Kids+0.105
Well+0.096
Don+0.095
Sorry+0.092
Okay+0.091
Neg logit contributions
 up-0.083
an-0.079
 crown-0.076
 soaring-0.074
ish-0.073
Token."
Token ID526
Feature activation+2.38e+01

Pos logit contributions
Don+0.123
Okay+0.121
Kids+0.118
That+0.117
Sorry+0.116
Neg logit contributions
 soaring-0.091
 crown-0.090
 seas-0.089
an-0.088
 up-0.087
Token
Token ID198
Feature activation+2.19e+01

Pos logit contributions
Kids+0.242
Uh+0.223
Sorry+0.220
Okay+0.220
Well+0.213
Neg logit contributions
 up-0.233
 out-0.187
 home-0.181
--0.173
 in-0.161
Token
Token ID198
Feature activation+2.22e+01

Pos logit contributions
Kids+0.215
Okay+0.211
Uh+0.210
Well+0.206
Sorry+0.205
Neg logit contributions
 up-0.209
 out-0.164
--0.164
 home-0.160
an-0.155
TokenBen
Token ID11696
Feature activation+1.87e+01

Pos logit contributions
Well+0.133
Kids+0.133
Uh+0.111
Um+0.111
Listen+0.110
Neg logit contributions
 up-0.266
an-0.250
--0.242
ling-0.229
 pe-0.223
Token is
Token ID318
Feature activation+1.70e+01

Pos logit contributions
Kids+0.195
Well+0.185
Don+0.172
Okay+0.171
Sorry+0.169
Neg logit contributions
 up-0.165
an-0.143
--0.136
 home-0.135
ish-0.135
Token brave
Token ID14802
Feature activation+1.54e+01

Pos logit contributions
Kids+0.133
Don+0.130
Well+0.125
Sorry+0.118
Um+0.116
Neg logit contributions
ish-0.170
an-0.168
ig-0.160
asis-0.159
 seas-0.158
Token"
Token ID1
Feature activation+2.06e+01

Pos logit contributions
Kids+0.194
Well+0.191
Um+0.171
Don+0.170
Okay+0.169
Neg logit contributions
 up-0.202
an-0.178
--0.173
ling-0.159
 out-0.157
TokenDid
Token ID11633
Feature activation+1.70e+01

Pos logit contributions
Kids+0.116
Enough+0.085
Well+0.084
Okay+0.078
enny+0.075
Neg logit contributions
 up-0.247
 soaring-0.229
 crown-0.228
 feathers-0.227
an-0.221
Token you
Token ID345
Feature activation+1.67e+01

Pos logit contributions
Kids+0.131
Well+0.128
Don+0.118
Ben+0.117
Um+0.116
Neg logit contributions
 up-0.174
an-0.170
ishing-0.159
--0.158
 home-0.157
Token see
Token ID766
Feature activation+1.57e+01

Pos logit contributions
Kids+0.110
Well+0.109
Sorry+0.107
Don+0.101
Okay+0.100
Neg logit contributions
 up-0.192
an-0.180
--0.165
lers-0.163
ishing-0.163
Token that
Token ID326
Feature activation+1.49e+01

Pos logit contributions
Kids+0.126
Well+0.124
Don+0.117
Um+0.116
That+0.114
Neg logit contributions
an-0.137
asis-0.126
kel-0.123
ig-0.123
 up-0.122
Token?"
Token ID1701
Feature activation+2.38e+01

Pos logit contributions
Sorry+0.123
That+0.118
Uh+0.117
Kids+0.117
Well+0.113
Neg logit contributions
lers-0.117
ishing-0.114
adows-0.110
ig-0.109
kel-0.106
Token Ben
Token ID3932
Feature activation+1.69e+01

Pos logit contributions
Kids+0.249
Okay+0.236
Well+0.232
Sorry+0.222
Enough+0.220
Neg logit contributions
 up-0.222
 out-0.169
--0.167
 home-0.162
an-0.157
Token asked
Token ID1965
Feature activation+1.96e+01

Pos logit contributions
Kids+0.135
Well+0.125
Okay+0.114
Don+0.113
Sorry+0.113
Neg logit contributions
 up-0.199
an-0.173
--0.168
ish-0.167
 crown-0.164
Token Lily
Token ID20037
Feature activation+1.60e+01

Pos logit contributions
Kids+0.241
Well+0.225
Don+0.215
Okay+0.212
Sorry+0.212
Neg logit contributions
 up-0.133
an-0.117
ish-0.107
 soaring-0.106
 crown-0.104
Token.
Token ID13
Feature activation+2.20e+01

Pos logit contributions
 She+0.149
Don+0.141
Kids+0.139
Well+0.139
Brother+0.132
Neg logit contributions
 seas-0.127
an-0.119
asis-0.118
 island-0.110
ig-0.106
Token "
Token ID366
Feature activation+1.98e+01

Pos logit contributions
Kids+0.220
Okay+0.211
Sorry+0.207
Enough+0.206
Well+0.202
Neg logit contributions
 up-0.204
 out-0.172
--0.158
 home-0.156
 in-0.145
Token
Token ID198
Feature activation+2.18e+01

Pos logit contributions
Well+0.199
Kids+0.195
Okay+0.193
Uh+0.190
Hmm+0.185
Neg logit contributions
 up-0.205
--0.179
 out-0.164
 home-0.158
an-0.154
Token"
Token ID1
Feature activation+2.09e+01

Pos logit contributions
Well+0.171
Kids+0.165
Listen+0.146
Um+0.146
Don+0.145
Neg logit contributions
 up-0.230
--0.216
an-0.208
ling-0.193
 out-0.188
TokenYou
Token ID1639
Feature activation+1.92e+01

Pos logit contributions
oral+0.128
Kids+0.125
 On+0.113
enny+0.111
 Deal+0.110
Neg logit contributions
 up-0.199
 pe-0.177
 feathers-0.174
 in-0.174
 fl-0.173
Token did
Token ID750
Feature activation+1.50e+01

Pos logit contributions
Kids+0.178
Well+0.167
Okay+0.155
Don+0.154
Sorry+0.153
Neg logit contributions
 up-0.200
an-0.171
--0.166
ish-0.165
 crown-0.161
Token it
Token ID340
Feature activation+1.33e+01

Pos logit contributions
Kids+0.151
Well+0.145
Don+0.139
Okay+0.139
Sorry+0.138
Neg logit contributions
 up-0.128
an-0.121
ish-0.112
lers-0.107
--0.105
Token!"
Token ID2474
Feature activation+2.37e+01

Pos logit contributions
Kids+0.162
Well+0.159
Sorry+0.158
Okay+0.154
Enough+0.151
Neg logit contributions
an-0.068
 up-0.063
 soaring-0.059
lers-0.059
 feathers-0.059
Token Lily
Token ID20037
Feature activation+1.66e+01

Pos logit contributions
Kids+0.215
Well+0.205
Okay+0.204
Sorry+0.198
Don+0.195
Neg logit contributions
 up-0.239
 out-0.208
 home-0.200
--0.200
 crown-0.197
Token said
Token ID531
Feature activation+1.97e+01

Pos logit contributions
Kids+0.157
Well+0.145
Okay+0.136
Listen+0.135
Sorry+0.135
Neg logit contributions
 up-0.177
--0.147
 out-0.142
an-0.141
ish-0.138
Token.
Token ID13
Feature activation+2.18e+01

Pos logit contributions
Kids+0.234
Well+0.223
Okay+0.216
Don+0.215
Sorry+0.215
Neg logit contributions
 up-0.125
an-0.121
asis-0.111
ish-0.108
 soaring-0.105
Token
Token ID198
Feature activation+2.23e+01

Pos logit contributions
Okay+0.220
Kids+0.220
Sorry+0.211
Hmm+0.201
Uh+0.198
Neg logit contributions
 up-0.212
 out-0.186
 in-0.158
 home-0.158
--0.150
Token
Token ID198
Feature activation+2.28e+01

Pos logit contributions
Kids+0.199
Well+0.197
Okay+0.197
Uh+0.191
Sorry+0.187
Neg logit contributions
 up-0.233
--0.191
 out-0.187
 home-0.184
an-0.182
Token Lily
Token ID20037
Feature activation+1.38e+01

Pos logit contributions
Kids+0.168
Well+0.157
Okay+0.145
Don+0.144
Sorry+0.143
Neg logit contributions
 up-0.199
an-0.173
ish-0.166
--0.166
lers-0.162
Token.
Token ID13
Feature activation+1.82e+01

Pos logit contributions
Kids+0.154
Well+0.149
Don+0.144
Okay+0.141
Max+0.139
Neg logit contributions
 up-0.096
an-0.087
ish-0.082
 soaring-0.080
 crown-0.079
Token Are
Token ID4231
Feature activation+1.67e+01

Pos logit contributions
Kids+0.129
Well+0.118
Okay+0.110
Sorry+0.108
Don+0.104
Neg logit contributions
 up-0.228
 out-0.193
--0.193
an-0.193
 home-0.190
Token you
Token ID345
Feature activation+1.72e+01

Pos logit contributions
Kids+0.177
Well+0.169
Don+0.157
Sorry+0.156
Okay+0.155
Neg logit contributions
 up-0.149
an-0.131
ish-0.122
--0.121
 crown-0.116
Token lost
Token ID2626
Feature activation+1.11e+01

Pos logit contributions
Kids+0.141
Well+0.124
Don+0.122
Sorry+0.118
Listen+0.115
Neg logit contributions
an-0.173
 up-0.157
lers-0.156
ish-0.152
rier-0.151
Token?"
Token ID1701
Feature activation+2.37e+01

Pos logit contributions
Kids+0.108
Dad+0.100
Daddy+0.098
Max+0.098
Don+0.097
Neg logit contributions
an-0.085
 up-0.080
 soaring-0.077
lers-0.077
ish-0.077
Token she
Token ID673
Feature activation+1.64e+01

Pos logit contributions
Kids+0.303
Okay+0.279
Well+0.276
Together+0.267
Alright+0.259
Neg logit contributions
 up-0.191
 out-0.151
 home-0.142
 in-0.133
 crown-0.127
Token asked
Token ID1965
Feature activation+1.92e+01

Pos logit contributions
Kids+0.159
Well+0.153
Okay+0.148
Sorry+0.141
Enough+0.140
Neg logit contributions
 up-0.165
--0.136
 out-0.131
an-0.131
 crown-0.130
Token.
Token ID13
Feature activation+2.09e+01

Pos logit contributions
Kids+0.249
Well+0.237
Don+0.229
Okay+0.226
Sorry+0.226
Neg logit contributions
 up-0.115
an-0.100
ish-0.090
lers-0.084
 crown-0.084
Token
Token ID198
Feature activation+2.17e+01

Pos logit contributions
Kids+0.245
Okay+0.232
Sorry+0.222
Enough+0.214
Uh+0.211
Neg logit contributions
 up-0.179
 out-0.165
 in-0.144
 inside-0.136
 outside-0.134
Token
Token ID198
Feature activation+2.21e+01

Pos logit contributions
Kids+0.210
Okay+0.202
Well+0.202
Uh+0.193
Sorry+0.187
Neg logit contributions
 up-0.213
--0.172
 out-0.169
 home-0.168
 inside-0.154
Token
Token ID198
Feature activation+2.07e+01

Pos logit contributions
Kids+0.194
Okay+0.192
Well+0.186
Sorry+0.184
Uh+0.181
Neg logit contributions
 up-0.195
--0.162
 out-0.157
 home-0.153
an-0.151
Token"
Token ID1
Feature activation+2.06e+01

Pos logit contributions
Kids+0.212
Well+0.204
Okay+0.191
Listen+0.190
Enough+0.189
Neg logit contributions
 up-0.170
--0.160
an-0.143
 out-0.132
b-0.128
TokenWho
Token ID8241
Feature activation+1.68e+01

Pos logit contributions
enny+0.125
 Deal+0.114
Kids+0.108
istent+0.108
 She+0.106
Neg logit contributions
 pe-0.196
 up-0.194
 feathers-0.183
an-0.180
 fl-0.173
Token is
Token ID318
Feature activation+1.53e+01

Pos logit contributions
Kids+0.142
Well+0.128
Okay+0.121
Don+0.120
Sorry+0.118
Neg logit contributions
 up-0.173
an-0.154
rier-0.149
ish-0.146
 soaring-0.146
Token there
Token ID612
Feature activation+1.10e+01

Pos logit contributions
Kids+0.132
Don+0.127
Well+0.126
 Who+0.124
Max+0.118
Neg logit contributions
an-0.125
ish-0.117
est-0.108
ling-0.108
ishing-0.106
Token?"
Token ID1701
Feature activation+2.37e+01

Pos logit contributions
Kids+0.107
Well+0.101
Don+0.098
Dad+0.097
Sorry+0.095
Neg logit contributions
an-0.085
 up-0.083
lers-0.079
ish-0.079
ig-0.075
Token a
Token ID257
Feature activation+1.40e+01

Pos logit contributions
Kids+0.241
Okay+0.238
Well+0.224
Listen+0.222
Enough+0.221
Neg logit contributions
 up-0.215
--0.174
 out-0.174
 pe-0.164
 home-0.164
Token voice
Token ID3809
Feature activation+1.34e+01

Pos logit contributions
Kids+0.156
Well+0.150
Okay+0.140
Don+0.140
Sorry+0.138
Neg logit contributions
 up-0.116
an-0.097
--0.096
 crown-0.091
ish-0.090
Token said
Token ID531
Feature activation+1.90e+01

Pos logit contributions
Kids+0.117
Well+0.110
Okay+0.103
Don+0.101
Sorry+0.101
Neg logit contributions
 up-0.147
an-0.125
--0.124
 out-0.120
ish-0.120
Token.
Token ID13
Feature activation+2.11e+01

Pos logit contributions
Kids+0.226
Well+0.208
Don+0.201
Sorry+0.198
Okay+0.194
Neg logit contributions
an-0.122
 up-0.119
asis-0.113
ish-0.109
 soaring-0.106
Token
Token ID198
Feature activation+2.13e+01

Pos logit contributions
Kids+0.189
Okay+0.183
Sorry+0.171
Well+0.170
Enough+0.168
Neg logit contributions
 up-0.227
 out-0.199
--0.190
 inside-0.177
an-0.177
Token can
Token ID460
Feature activation+1.58e+01

Pos logit contributions
Kids+0.136
Well+0.124
Don+0.116
Listen+0.114
Sorry+0.112
Neg logit contributions
 up-0.150
an-0.138
ish-0.129
 home-0.124
lers-0.124
Token I
Token ID314
Feature activation+1.50e+01

Pos logit contributions
Kids+0.135
Well+0.125
Don+0.114
Sorry+0.113
Okay+0.111
Neg logit contributions
 up-0.174
an-0.154
ish-0.150
--0.144
 crown-0.144
Token get
Token ID651
Feature activation+1.34e+01

Pos logit contributions
Kids+0.134
Well+0.124
Sorry+0.116
Don+0.115
Okay+0.114
Neg logit contributions
 up-0.158
an-0.138
ish-0.131
lers-0.131
--0.129
Token for
Token ID329
Feature activation+1.45e+01

Pos logit contributions
Kids+0.150
Well+0.140
Sorry+0.139
Dad+0.137
Don+0.134
Neg logit contributions
an-0.093
 up-0.086
ig-0.083
ish-0.081
 soaring-0.077
Token you
Token ID345
Feature activation+1.36e+01

Pos logit contributions
Kids+0.142
Dad+0.130
Well+0.128
Sorry+0.125
Daddy+0.125
Neg logit contributions
 up-0.125
an-0.121
ish-0.116
ig-0.106
lers-0.105
Token?"
Token ID1701
Feature activation+2.37e+01

Pos logit contributions
Kids+0.109
Dad+0.106
Daddy+0.102
Grand+0.102
Well+0.100
Neg logit contributions
an-0.122
 up-0.114
lers-0.110
ish-0.107
ig-0.106
Token he
Token ID339
Feature activation+1.65e+01

Pos logit contributions
Kids+0.250
Okay+0.247
Well+0.237
Enough+0.227
Alright+0.225
Neg logit contributions
 up-0.209
 out-0.165
 home-0.163
--0.158
 in-0.149
Token asks
Token ID7893
Feature activation+1.93e+01

Pos logit contributions
Kids+0.132
Well+0.122
Don+0.112
Okay+0.110
Sorry+0.110
Neg logit contributions
 up-0.188
an-0.167
ish-0.162
--0.161
 home-0.157
Token.
Token ID13
Feature activation+2.16e+01

Pos logit contributions
Kids+0.263
Well+0.251
Okay+0.239
Don+0.238
Sorry+0.238
Neg logit contributions
 up-0.115
an-0.089
ish-0.082
--0.080
 crown-0.077
Token
Token ID198
Feature activation+2.16e+01

Pos logit contributions
Kids+0.225
Okay+0.220
Well+0.207
Enough+0.205
Sorry+0.201
Neg logit contributions
 up-0.197
 out-0.170
--0.153
 home-0.151
 inside-0.142
Token
Token ID198
Feature activation+2.20e+01

Pos logit contributions
Kids+0.194
Well+0.191
Okay+0.180
Don+0.172
Enough+0.171
Neg logit contributions
 up-0.226
--0.190
 out-0.185
 home-0.179
an-0.174
Token how
Token ID703
Feature activation+1.56e+01

Pos logit contributions
Kids+0.145
Well+0.145
Okay+0.135
Don+0.133
Sorry+0.131
Neg logit contributions
 up-0.198
an-0.190
lers-0.187
ling-0.178
--0.173
Token can
Token ID460
Feature activation+1.75e+01

Pos logit contributions
Kids+0.167
Well+0.158
Okay+0.149
Don+0.148
Sorry+0.147
Neg logit contributions
 up-0.139
an-0.118
ish-0.112
--0.111
 crown-0.109
Token we
Token ID356
Feature activation+1.55e+01

Pos logit contributions
Kids+0.154
Well+0.145
Sorry+0.133
Don+0.132
Okay+0.131
Neg logit contributions
 up-0.187
an-0.164
--0.158
ish-0.157
 crown-0.153
Token get
Token ID651
Feature activation+1.31e+01

Pos logit contributions
Kids+0.121
Well+0.121
Okay+0.115
Sorry+0.114
Don+0.113
Neg logit contributions
 up-0.156
an-0.144
lers-0.140
--0.134
ishing-0.132
Token them
Token ID606
Feature activation+1.14e+01

Pos logit contributions
Sorry+0.122
Kids+0.115
Don+0.113
Listen+0.112
Well+0.112
Neg logit contributions
an-0.114
ig-0.104
asis-0.097
 soaring-0.093
est-0.093
Token?"
Token ID1701
Feature activation+2.37e+01

Pos logit contributions
Kids+0.120
Well+0.116
Sorry+0.114
Don+0.111
Okay+0.110
Neg logit contributions
an-0.082
 up-0.080
lers-0.073
ish-0.069
 soaring-0.068
Token Max
Token ID5436
Feature activation+1.56e+01

Pos logit contributions
Kids+0.247
Well+0.236
Okay+0.232
Alright+0.219
Ok+0.213
Neg logit contributions
 up-0.233
 out-0.187
 flying-0.179
 home-0.172
--0.170
Token says
Token ID1139
Feature activation+1.85e+01

Pos logit contributions
Kids+0.150
Well+0.145
Okay+0.137
Sorry+0.136
Don+0.131
Neg logit contributions
 up-0.164
an-0.128
 out-0.126
--0.126
ish-0.124
Token.
Token ID13
Feature activation+2.15e+01

Pos logit contributions
Kids+0.206
Well+0.195
Don+0.190
Sorry+0.190
Okay+0.189
Neg logit contributions
an-0.122
 up-0.121
asis-0.116
ish-0.110
 soaring-0.107
Token He
Token ID679
Feature activation+1.76e+01

Pos logit contributions
Kids+0.221
Okay+0.201
Well+0.194
Sorry+0.191
Enough+0.189
Neg logit contributions
 up-0.207
 out-0.185
--0.169
 in-0.163
 inside-0.154
Token looks
Token ID3073
Feature activation+1.27e+01

Pos logit contributions
Kids+0.156
Well+0.155
Don+0.141
Okay+0.135
Enough+0.135
Neg logit contributions
 up-0.164
 home-0.157
an-0.157
ish-0.148
asis-0.145
Token children
Token ID1751
Feature activation-3.10e+01

Pos logit contributions
Kids+0.156
Well+0.146
Okay+0.135
Sorry+0.134
Don+0.133
Neg logit contributions
 up-0.203
an-0.175
--0.171
ish-0.168
 out-0.165
Token put
Token ID1234
Feature activation-2.92e+01

Pos logit contributions
 up+0.271
an+0.248
ish+0.227
-+0.223
 crown+0.220
Neg logit contributions
Kids-0.315
Well-0.299
Okay-0.275
Sorry-0.275
Don-0.274
Token on
Token ID319
Feature activation-2.55e+01

Pos logit contributions
an+0.202
 up+0.187
asis+0.181
 soaring+0.176
ish+0.168
Neg logit contributions
Kids-0.320
Well-0.298
Don-0.282
Okay-0.276
Sorry-0.271
Token their
Token ID511
Feature activation-3.08e+01

Pos logit contributions
 up+0.218
an+0.216
ish+0.205
asis+0.196
 soaring+0.188
Neg logit contributions
Kids-0.253
Well-0.231
Don-0.217
Okay-0.214
Enough-0.211
Token bath
Token ID7837
Feature activation-3.65e+01

Pos logit contributions
 up+0.235
an+0.204
ish+0.198
 soaring+0.189
-+0.182
Neg logit contributions
Kids-0.325
Okay-0.293
Well-0.291
Sorry-0.282
Don-0.280
Tokenro
Token ID305
Feature activation-5.57e+01

Pos logit contributions
 up+0.326
 soaring+0.302
an+0.291
ish+0.280
 crown+0.269
Neg logit contributions
Kids-0.366
Well-0.318
Okay-0.312
Sorry-0.294
Don-0.285
Tokenbes
Token ID12636
Feature activation-3.81e+01

Pos logit contributions
 up+0.547
ish+0.543
 island+0.483
 biggest+0.476
 soaring+0.475
Neg logit contributions
Kids-0.423
Ben-0.370
Well-0.363
Max-0.348
Tom-0.342
Token and
Token ID290
Feature activation-1.70e+01

Pos logit contributions
 up+0.248
an+0.244
 crown+0.213
ish+0.212
 soaring+0.211
Neg logit contributions
Kids-0.439
Well-0.425
Okay-0.406
Don-0.401
Sorry-0.395
Token ran
Token ID4966
Feature activation-2.45e+01

Pos logit contributions
 up+0.171
an+0.163
ish+0.150
-+0.147
 soaring+0.143
Neg logit contributions
Kids-0.154
Well-0.140
Don-0.132
Sorry-0.130
Okay-0.127
Token around
Token ID1088
Feature activation-2.25e+01

Pos logit contributions
an+0.151
ish+0.141
 up+0.137
asis+0.131
 soaring+0.127
Neg logit contributions
Kids-0.312
Well-0.280
Don-0.272
Okay-0.270
Listen-0.267
Token in
Token ID287
Feature activation-2.09e+01

Pos logit contributions
an+0.162
 up+0.157
ish+0.150
lers+0.134
asis+0.131
Neg logit contributions
Kids-0.261
Well-0.244
Don-0.234
Okay-0.226
Listen-0.222
Token't
Token ID470
Feature activation-3.00e+01

Pos logit contributions
Kids+0.227
Well+0.217
Okay+0.206
Don+0.205
Sorry+0.205
Neg logit contributions
 up-0.116
an-0.090
--0.085
ish-0.084
 crown-0.081
Token know
Token ID760
Feature activation-2.87e+01

Pos logit contributions
 up+0.323
an+0.316
ish+0.300
lers+0.299
ig+0.292
Neg logit contributions
Kids-0.225
Well-0.205
Don-0.200
Sorry-0.193
Okay-0.193
Token the
Token ID262
Feature activation-2.75e+01

Pos logit contributions
an+0.259
 up+0.245
ish+0.234
asis+0.216
ling+0.213
Neg logit contributions
Kids-0.269
Well-0.252
Don-0.239
Enough-0.226
Listen-0.226
Token rh
Token ID9529
Feature activation-4.76e+01

Pos logit contributions
 up+0.222
an+0.196
ish+0.189
ishing+0.188
-+0.186
Neg logit contributions
Kids-0.276
Well-0.270
Okay-0.260
Don-0.257
Uh-0.246
Tokenin
Token ID259
Feature activation-3.90e+01

Pos logit contributions
 up+0.452
 soaring+0.346
 out+0.343
 crown+0.326
 underwater+0.324
Neg logit contributions
Well-0.458
Sorry-0.436
Kids-0.430
Okay-0.422
oral-0.407
Tokenoc
Token ID420
Feature activation-5.51e+01

Pos logit contributions
 up+0.334
 soaring+0.273
an+0.262
 out+0.260
ishing+0.255
Neg logit contributions
Well-0.406
Kids-0.386
Okay-0.374
Don-0.368
Sorry-0.356
Tokeneros
Token ID27498
Feature activation-3.52e+01

Pos logit contributions
 up+0.415
 soaring+0.355
 out+0.354
 flying+0.326
 crown+0.326
Neg logit contributions
Well-0.584
Sorry-0.559
Okay-0.550
Kids-0.539
Enough-0.532
Token was
Token ID373
Feature activation-2.73e+01

Pos logit contributions
 up+0.354
an+0.317
ish+0.305
 crown+0.300
 soaring+0.295
Neg logit contributions
Kids-0.301
Well-0.290
Don-0.270
Okay-0.259
Sorry-0.259
Token wild
Token ID4295
Feature activation-3.04e+01

Pos logit contributions
an+0.250
ish+0.237
ig+0.221
 up+0.217
asis+0.207
Neg logit contributions
Don-0.238
Well-0.228
Kids-0.228
Um-0.215
Listen-0.213
Token.
Token ID13
Feature activation-1.05e+01

Pos logit contributions
an+0.229
aches+0.219
ig+0.217
 up+0.212
 crown+0.210
Neg logit contributions
Kids-0.275
 She-0.266
Well-0.263
Don-0.256
 They-0.246
Token Can
Token ID1680
Feature activation-1.50e+01

Pos logit contributions
 up+0.119
 home+0.100
 out+0.100
an+0.095
-+0.095
Neg logit contributions
Kids-0.079
Sorry-0.072
Okay-0.072
Enough-0.070
Well-0.070
Token harder
Token ID7069
Feature activation-2.64e+01

Pos logit contributions
 up+0.231
an+0.229
ish+0.212
-+0.206
ig+0.202
Neg logit contributions
Kids-0.209
Well-0.198
Don-0.194
Okay-0.185
Sorry-0.182
Token to
Token ID284
Feature activation-2.00e+01

Pos logit contributions
 up+0.185
an+0.180
ish+0.162
 crown+0.148
asis+0.146
Neg logit contributions
Well-0.290
Kids-0.284
Don-0.275
Okay-0.267
Sorry-0.267
Token push
Token ID4574
Feature activation-2.72e+01

Pos logit contributions
an+0.190
 up+0.182
-+0.172
lers+0.171
ish+0.171
Neg logit contributions
Kids-0.163
Well-0.154
Don-0.149
Okay-0.146
Sorry-0.146
Token the
Token ID262
Feature activation-2.47e+01

Pos logit contributions
 up+0.198
an+0.184
ish+0.165
 crown+0.164
asis+0.162
Neg logit contributions
Kids-0.309
Well-0.297
Don-0.282
Okay-0.278
Sorry-0.274
Token wheel
Token ID7825
Feature activation-3.35e+01

Pos logit contributions
 up+0.194
an+0.169
-+0.159
ish+0.156
ishing+0.155
Neg logit contributions
Kids-0.249
Well-0.241
Okay-0.235
Don-0.230
Uh-0.229
Tokenbar
Token ID5657
Feature activation-5.49e+01

Pos logit contributions
an+0.199
 up+0.196
 crown+0.177
ish+0.176
lers+0.174
Neg logit contributions
Kids-0.376
Well-0.374
Okay-0.359
Sorry-0.358
Don-0.358
Tokenrow
Token ID808
Feature activation-3.67e+01

Pos logit contributions
 up+0.409
 soaring+0.352
 crown+0.336
 flying+0.326
 underwater+0.315
Neg logit contributions
Well-0.565
Kids-0.552
Okay-0.531
Sorry-0.521
Dad-0.520
Token.
Token ID13
Feature activation-9.10e+00

Pos logit contributions
an+0.222
 up+0.212
ish+0.203
 crown+0.201
asis+0.190
Neg logit contributions
Well-0.410
Kids-0.405
Don-0.397
Okay-0.392
Sorry-0.391
Token But
Token ID887
Feature activation-1.17e+01

Pos logit contributions
 up+0.098
 out+0.081
 home+0.081
-+0.075
 soaring+0.074
Neg logit contributions
Kids-0.075
Sorry-0.068
Okay-0.068
Listen-0.066
Well-0.066
Token her
Token ID607
Feature activation-1.89e+01

Pos logit contributions
 up+0.106
an+0.100
ish+0.090
-+0.089
lers+0.088
Neg logit contributions
Kids-0.114
Well-0.111
Don-0.105
Sorry-0.103
Okay-0.103
Token backyard
Token ID24296
Feature activation-2.56e+01

Pos logit contributions
 up+0.200
an+0.179
ling+0.169
asis+0.167
 out+0.166
Neg logit contributions
Kids-0.145
Well-0.144
Okay-0.137
Sorry-0.129
Dad-0.126
Token reward
Token ID6721
Feature activation-3.66e+01

Pos logit contributions
 up+0.336
an+0.301
 soaring+0.274
asis+0.272
eking+0.266
Neg logit contributions
Kids-0.433
Well-0.417
Uh-0.403
Okay-0.403
Don-0.395
Token.
Token ID13
Feature activation-1.27e+01

Pos logit contributions
 up+0.255
an+0.244
ish+0.217
 soaring+0.203
 crown+0.202
Neg logit contributions
Kids-0.416
Well-0.404
Don-0.377
Okay-0.374
Sorry-0.367
Token It
Token ID632
Feature activation-2.21e+01

Pos logit contributions
 up+0.152
 home+0.127
 out+0.127
-+0.123
an+0.123
Neg logit contributions
Kids-0.094
Well-0.082
Sorry-0.081
Okay-0.081
Enough-0.081
Token was
Token ID373
Feature activation-2.25e+01

Pos logit contributions
 up+0.223
an+0.201
ish+0.192
-+0.191
 home+0.189
Neg logit contributions
Kids-0.191
Well-0.190
Don-0.174
Okay-0.172
Sorry-0.167
Token ch
Token ID442
Feature activation-2.95e+01

Pos logit contributions
an+0.172
 up+0.167
ish+0.160
lers+0.153
ig+0.146
Neg logit contributions
Well-0.229
Kids-0.229
Don-0.221
Okay-0.210
Um-0.207
Tokenocol
Token ID4668
Feature activation-5.46e+01

Pos logit contributions
 up+0.253
an+0.207
 crown+0.198
 home+0.193
-+0.189
Neg logit contributions
Kids-0.312
Well-0.307
Don-0.283
Okay-0.283
Sorry-0.282
Tokenates
Token ID689
Feature activation-3.46e+01

Pos logit contributions
 up+0.450
an+0.336
 nest+0.322
 crown+0.316
 out+0.315
Neg logit contributions
Well-0.561
Kids-0.541
Dad-0.530
Sorry-0.528
Um-0.509
Token and
Token ID290
Feature activation-1.89e+01

Pos logit contributions
an+0.244
 up+0.235
 crown+0.208
ish+0.203
ans+0.199
Neg logit contributions
Well-0.333
 She-0.320
Kids-0.317
That-0.317
Don-0.314
Token sweets
Token ID42402
Feature activation-3.28e+01

Pos logit contributions
 up+0.165
an+0.162
ish+0.149
-+0.137
ig+0.136
Neg logit contributions
Well-0.182
Kids-0.181
Don-0.172
Sorry-0.165
Okay-0.164
Token!
Token ID0
Feature activation-9.55e+00

Pos logit contributions
an+0.244
 up+0.235
ish+0.216
ans+0.207
ig+0.200
Neg logit contributions
Well-0.335
Don-0.313
Kids-0.312
Okay-0.312
That-0.310
Token
Token ID198
Feature activation-9.29e+00

Pos logit contributions
 up+0.113
 out+0.095
 home+0.094
-+0.092
an+0.091
Neg logit contributions
Kids-0.072
Well-0.063
Okay-0.063
Sorry-0.063
Enough-0.061
Token When
Token ID1649
Feature activation-1.74e+01

Pos logit contributions
 up+0.162
 out+0.138
 home+0.135
-+0.133
an+0.131
Neg logit contributions
Kids-0.088
Well-0.079
Okay-0.077
Sorry-0.076
Enough-0.076
Token Daisy
Token ID40355
Feature activation-2.25e+01

Pos logit contributions
 up+0.187
an+0.169
ish+0.161
-+0.158
lers+0.154
Neg logit contributions
Kids-0.151
Well-0.140
Sorry-0.129
Don-0.129
Okay-0.128
Token put
Token ID1234
Feature activation-2.19e+01

Pos logit contributions
 up+0.227
an+0.194
-+0.186
ish+0.186
 crown+0.181
Neg logit contributions
Kids-0.215
Well-0.202
Okay-0.187
Don-0.186
Sorry-0.186
Token the
Token ID262
Feature activation-2.09e+01

Pos logit contributions
an+0.189
asis+0.158
ans+0.157
ish+0.154
 island+0.149
Neg logit contributions
Well-0.207
Kids-0.203
Don-0.191
 She-0.188
Max-0.185
Token magn
Token ID7842
Feature activation-3.41e+01

Pos logit contributions
 up+0.160
an+0.140
 soaring+0.131
ishing+0.128
ish+0.127
Neg logit contributions
Kids-0.199
Okay-0.193
Well-0.193
Sorry-0.185
Don-0.184
Tokenifying
Token ID4035
Feature activation-5.45e+01

Pos logit contributions
 up+0.379
an+0.358
-+0.331
 crown+0.329
asis+0.328
Neg logit contributions
Well-0.242
Kids-0.240
Sorry-0.216
Don-0.213
Okay-0.207
Token glass
Token ID5405
Feature activation-3.60e+01

Pos logit contributions
 up+0.507
an+0.476
ish+0.442
ling+0.437
-+0.435
Neg logit contributions
Well-0.475
Kids-0.473
Uh-0.443
Sorry-0.441
Don-0.425
Token down
Token ID866
Feature activation-2.66e+01

Pos logit contributions
 up+0.242
an+0.223
 soaring+0.204
ish+0.192
-+0.177
Neg logit contributions
Kids-0.416
Well-0.400
Sorry-0.373
Okay-0.370
Don-0.370
Token,
Token ID11
Feature activation-1.22e+01

Pos logit contributions
 up+0.195
an+0.175
ish+0.162
-+0.152
lers+0.151
Neg logit contributions
Kids-0.309
Well-0.300
Don-0.283
Okay-0.280
Sorry-0.279
Token the
Token ID262
Feature activation-1.87e+01

Pos logit contributions
an+0.166
 up+0.159
ish+0.156
-+0.150
ishing+0.148
Neg logit contributions
Kids-0.064
Well-0.064
Don-0.060
Okay-0.049
Sorry-0.049
Token cater
Token ID20825
Feature activation-3.17e+01

Pos logit contributions
 up+0.154
an+0.128
ish+0.122
-+0.120
 soaring+0.117
Neg logit contributions
Kids-0.207
Well-0.197
Okay-0.188
Sorry-0.185
Don-0.184
Token he
Token ID339
Feature activation-2.68e+01

Pos logit contributions
 up+0.348
an+0.323
ish+0.306
 crown+0.283
-+0.283
Neg logit contributions
Kids-0.293
Well-0.278
Don-0.252
Okay-0.247
Sorry-0.246
Token would
Token ID561
Feature activation-2.22e+01

Pos logit contributions
 up+0.311
an+0.284
ish+0.270
-+0.266
 soaring+0.263
Neg logit contributions
Kids-0.195
Well-0.187
Okay-0.172
Don-0.171
Sorry-0.162
Token take
Token ID1011
Feature activation-2.62e+01

Pos logit contributions
 up+0.218
an+0.210
ish+0.196
lers+0.191
ishing+0.190
Neg logit contributions
Kids-0.179
Well-0.175
Okay-0.168
Don-0.162
Together-0.156
Token the
Token ID262
Feature activation-2.62e+01

Pos logit contributions
an+0.236
ish+0.211
 up+0.208
asis+0.202
 nest+0.191
Neg logit contributions
Kids-0.245
Well-0.234
Max-0.220
Don-0.215
Listen-0.212
Token av
Token ID1196
Feature activation-5.20e+01

Pos logit contributions
 up+0.202
an+0.182
-+0.165
ish+0.163
ishing+0.157
Neg logit contributions
Kids-0.265
Well-0.257
Okay-0.252
Don-0.245
Sorry-0.233
Tokenoc
Token ID420
Feature activation-5.45e+01

Pos logit contributions
 up+0.475
an+0.393
 soaring+0.389
 crown+0.387
 swimming+0.384
Neg logit contributions
Well-0.472
Okay-0.439
Together-0.437
Hmm-0.425
Exc-0.419
Tokenados
Token ID22484
Feature activation-3.60e+01

Pos logit contributions
 up+0.510
 soaring+0.430
 out+0.427
 underwater+0.424
 swimming+0.417
Neg logit contributions
Well-0.486
Sorry-0.448
Okay-0.444
oral-0.435
Kids-0.430
Token and
Token ID290
Feature activation-1.76e+01

Pos logit contributions
an+0.267
 up+0.257
 crown+0.228
 soaring+0.222
ish+0.220
Neg logit contributions
Well-0.368
Kids-0.352
Don-0.345
Okay-0.337
Together-0.325
Token scatter
Token ID41058
Feature activation-2.76e+01

Pos logit contributions
 up+0.154
an+0.153
ig+0.134
ish+0.132
-+0.128
Neg logit contributions
Well-0.165
Kids-0.161
Don-0.157
Okay-0.155
Sorry-0.154
Token them
Token ID606
Feature activation-2.64e+01

Pos logit contributions
an+0.220
 up+0.200
asis+0.194
ig+0.190
 soaring+0.186
Neg logit contributions
Well-0.267
Kids-0.260
Don-0.252
Wow-0.249
Okay-0.239
Token around
Token ID1088
Feature activation-2.37e+01

Pos logit contributions
an+0.165
asis+0.130
ish+0.124
ig+0.118
-+0.118
Neg logit contributions
Well-0.267
Don-0.261
Okay-0.250
 She-0.249
Together-0.248
Token his
Token ID465
Feature activation-3.69e+01

Pos logit contributions
Don+0.137
Listen+0.128
Well+0.126
 She+0.126
Kids+0.123
Neg logit contributions
asis-0.055
castle-0.045
an-0.044
 ig-0.038
eh-0.036
Token head
Token ID1182
Feature activation-3.87e+01

Pos logit contributions
 up+0.288
an+0.258
ish+0.249
-+0.240
ishing+0.239
Neg logit contributions
Kids-0.338
Well-0.330
Okay-0.327
Dad-0.313
Don-0.311
Token out
Token ID503
Feature activation-3.30e+01

Pos logit contributions
 up+0.270
an+0.256
ish+0.230
lers+0.216
 crown+0.214
Neg logit contributions
Kids-0.420
Well-0.413
Don-0.399
Sorry-0.395
Okay-0.395
Token of
Token ID286
Feature activation-2.87e+01

Pos logit contributions
an+0.255
 up+0.248
ish+0.241
asis+0.212
ig+0.205
Neg logit contributions
Kids-0.327
Well-0.325
Don-0.307
Sorry-0.294
Dad-0.294
Token his
Token ID465
Feature activation-3.16e+01

Pos logit contributions
an+0.265
ish+0.250
 up+0.238
asis+0.216
ig+0.215
Neg logit contributions
Kids-0.274
Well-0.251
Don-0.243
Dad-0.240
Listen-0.233
Token bur
Token ID4356
Feature activation-5.44e+01

Pos logit contributions
 up+0.239
an+0.223
ish+0.203
 soaring+0.197
ling+0.195
Neg logit contributions
Kids-0.285
Well-0.268
Dad-0.266
Okay-0.266
Daddy-0.264
Tokenrow
Token ID808
Feature activation-4.08e+01

Pos logit contributions
 up+0.478
 crown+0.446
 soaring+0.444
 statue+0.439
 palace+0.419
Neg logit contributions
Kids-0.310
Don-0.298
Daddy-0.288
Well-0.287
Brother-0.285
Token and
Token ID290
Feature activation-1.72e+01

Pos logit contributions
 up+0.274
an+0.271
ish+0.261
 crown+0.253
 soaring+0.247
Neg logit contributions
Kids-0.380
Well-0.380
 She-0.368
Don-0.366
Okay-0.359
Token saw
Token ID2497
Feature activation-2.29e+01

Pos logit contributions
an+0.167
 up+0.166
ish+0.159
ig+0.148
-+0.145
Neg logit contributions
Kids-0.146
Don-0.138
Well-0.138
Sorry-0.129
Okay-0.124
Token something
Token ID1223
Feature activation-2.44e+01

Pos logit contributions
an+0.185
 up+0.179
ish+0.178
ig+0.168
asis+0.167
Neg logit contributions
Well-0.223
Kids-0.222
Don-0.215
That-0.193
Um-0.192
Token new
Token ID649
Feature activation-2.60e+01

Pos logit contributions
 up+0.188
an+0.186
ish+0.177
ig+0.161
ishing+0.158
Neg logit contributions
Kids-0.253
Well-0.248
Don-0.239
That-0.224
Sorry-0.224
Token am
Token ID716
Feature activation-2.40e+01

Pos logit contributions
 up+0.249
an+0.219
-+0.207
ish+0.207
 soaring+0.203
Neg logit contributions
Kids-0.203
Well-0.194
Okay-0.180
Don-0.179
Sorry-0.177
Token taking
Token ID2263
Feature activation-2.69e+01

Pos logit contributions
an+0.221
 up+0.204
ish+0.200
lers+0.193
asis+0.187
Neg logit contributions
Kids-0.219
Well-0.209
Don-0.193
Um-0.192
Sorry-0.189
Token the
Token ID262
Feature activation-2.68e+01

Pos logit contributions
an+0.217
 up+0.207
ish+0.200
asis+0.196
ig+0.181
Neg logit contributions
Kids-0.272
Well-0.255
Don-0.245
Enough-0.243
Okay-0.239
Token rh
Token ID9529
Feature activation-4.46e+01

Pos logit contributions
 up+0.209
an+0.187
ish+0.173
-+0.172
ishing+0.171
Neg logit contributions
Kids-0.268
Well-0.249
Okay-0.247
Don-0.232
Listen-0.229
Tokenin
Token ID259
Feature activation-3.82e+01

Pos logit contributions
 up+0.436
 soaring+0.342
 out+0.339
 crown+0.339
 underwater+0.320
Neg logit contributions
Well-0.389
Sorry-0.388
Kids-0.371
Okay-0.367
oral-0.366
Tokenoc
Token ID420
Feature activation-5.36e+01

Pos logit contributions
 up+0.361
 soaring+0.318
eking+0.317
ishing+0.309
 crown+0.305
Neg logit contributions
Well-0.301
Okay-0.279
Sorry-0.278
Dad-0.269
Kids-0.269
Tokeneros
Token ID27498
Feature activation-3.51e+01

Pos logit contributions
 up+0.424
 out+0.366
 soaring+0.345
 crown+0.342
 anywhere+0.336
Neg logit contributions
Well-0.533
Sorry-0.521
Okay-0.506
oral-0.498
Kids-0.485
Token to
Token ID284
Feature activation-2.06e+01

Pos logit contributions
 up+0.231
an+0.222
 crown+0.211
ish+0.201
ig+0.200
Neg logit contributions
Kids-0.340
Well-0.329
Don-0.322
Sorry-0.322
Okay-0.308
Token a
Token ID257
Feature activation-2.48e+01

Pos logit contributions
 up+0.177
ish+0.166
an+0.166
lers+0.161
ig+0.154
Neg logit contributions
Kids-0.200
Well-0.189
Don-0.181
Dad-0.180
Brother-0.176
Token new
Token ID649
Feature activation-2.90e+01

Pos logit contributions
 up+0.202
an+0.178
ish+0.171
lers+0.169
ishing+0.163
Neg logit contributions
Kids-0.249
Okay-0.232
Well-0.229
Uh-0.228
Max-0.225
Token home
Token ID1363
Feature activation-2.80e+01

Pos logit contributions
 up+0.205
eking+0.187
lers+0.183
ling+0.180
ish+0.174
Neg logit contributions
Okay-0.281
Dad-0.273
Daddy-0.271
Kids-0.267
Grand-0.267
Token giving
Token ID3501
Feature activation-2.50e+01

Pos logit contributions
 up+0.186
ish+0.169
an+0.164
lers+0.159
ig+0.150
Neg logit contributions
Kids-0.226
Well-0.215
Don-0.204
That-0.201
Dad-0.198
Token him
Token ID683
Feature activation-2.48e+01

Pos logit contributions
 soaring+0.202
asis+0.196
an+0.194
ig+0.188
 island+0.183
Neg logit contributions
Kids-0.231
Don-0.217
Well-0.215
Okay-0.207
Dad-0.204
Token too
Token ID1165
Feature activation-2.09e+01

Pos logit contributions
 soaring+0.171
 up+0.165
an+0.165
ig+0.156
asis+0.152
Neg logit contributions
Kids-0.220
Well-0.208
Don-0.208
Sorry-0.207
Okay-0.206
Token many
Token ID867
Feature activation-2.38e+01

Pos logit contributions
 up+0.213
an+0.195
ish+0.188
 crown+0.179
 soaring+0.176
Neg logit contributions
Kids-0.183
Well-0.176
Okay-0.165
Don-0.164
Sorry-0.163
Token ch
Token ID442
Feature activation-3.14e+01

Pos logit contributions
 up+0.185
an+0.171
ish+0.161
ig+0.144
asis+0.140
Neg logit contributions
Kids-0.235
Well-0.229
Okay-0.217
Don-0.215
Sorry-0.211
Tokenocol
Token ID4668
Feature activation-5.32e+01

Pos logit contributions
 up+0.312
 out+0.240
 soaring+0.231
an+0.229
 anywhere+0.226
Neg logit contributions
Kids-0.282
Well-0.265
Dad-0.251
Ben-0.243
Sorry-0.240
Tokenates
Token ID689
Feature activation-3.27e+01

Pos logit contributions
 up+0.460
 out+0.338
 crown+0.323
 way+0.317
 nest+0.316
Neg logit contributions
Kids-0.512
Dad-0.511
Well-0.510
Sorry-0.495
Okay-0.472
Token."
Token ID526
Feature activation-6.89e+00

Pos logit contributions
 up+0.218
an+0.216
 soaring+0.193
ish+0.184
 crown+0.183
Neg logit contributions
Kids-0.328
Well-0.327
Don-0.306
Sorry-0.306
Okay-0.304
Token The
Token ID383
Feature activation-1.34e+01

Pos logit contributions
 up+0.073
 home+0.061
-+0.059
 out+0.058
an+0.057
Neg logit contributions
Kids-0.059
Well-0.055
Okay-0.055
Uh-0.055
Enough-0.054
Token king
Token ID5822
Feature activation-1.99e+01

Pos logit contributions
 up+0.121
ish+0.098
lers+0.098
an+0.098
-+0.094
Neg logit contributions
Kids-0.140
Well-0.131
Okay-0.125
Sorry-0.124
Don-0.122
Token learned
Token ID4499
Feature activation-1.42e+01

Pos logit contributions
 up+0.222
an+0.191
 soaring+0.188
 out+0.187
 home+0.187
Neg logit contributions
Kids-0.157
Well-0.154
Okay-0.136
Don-0.133
Sorry-0.132
Token you
Token ID345
Feature activation-3.32e+01

Pos logit contributions
Kids+0.233
Well+0.222
Okay+0.211
Sorry+0.208
Don+0.205
Neg logit contributions
 up-0.148
an-0.110
 out-0.110
--0.109
 home-0.108
Token go
Token ID467
Feature activation-3.00e+01

Pos logit contributions
 up+0.313
an+0.296
ish+0.271
lers+0.266
-+0.263
Neg logit contributions
Kids-0.307
Well-0.294
Okay-0.277
Don-0.274
Sorry-0.269
Token,
Token ID11
Feature activation-1.52e+01

Pos logit contributions
an+0.232
ish+0.216
rier+0.200
ery+0.196
 crown+0.195
Neg logit contributions
Kids-0.291
Well-0.253
Listen-0.252
Sorry-0.250
 She-0.243
Token my
Token ID616
Feature activation-2.65e+01

Pos logit contributions
 up+0.153
an+0.142
lers+0.141
ish+0.136
-+0.133
Neg logit contributions
Kids-0.134
Well-0.123
Don-0.118
Okay-0.117
Dad-0.111
Token d
Token ID288
Feature activation-3.65e+01

Pos logit contributions
 up+0.221
an+0.180
lers+0.179
 soaring+0.174
ling+0.172
Neg logit contributions
Kids-0.269
Well-0.258
Okay-0.258
Sorry-0.242
Um-0.236
Tokenarl
Token ID7063
Feature activation-5.29e+01

Pos logit contributions
 up+0.462
an+0.398
 soaring+0.392
lers+0.385
 crown+0.385
Neg logit contributions
Kids-0.254
Well-0.229
Okay-0.211
Sorry-0.208
Enough-0.194
Tokenings
Token ID654
Feature activation-3.17e+01

Pos logit contributions
 up+0.430
 soaring+0.371
an+0.343
 underwater+0.342
 crown+0.339
Neg logit contributions
Kids-0.543
Well-0.539
Okay-0.514
Sorry-0.497
Dad-0.474
Token.
Token ID13
Feature activation-1.24e+01

Pos logit contributions
 up+0.210
an+0.164
ish+0.153
-+0.153
 crown+0.146
Neg logit contributions
Kids-0.410
Well-0.392
Okay-0.372
Don-0.371
Sorry-0.370
Token Enjoy
Token ID18179
Feature activation-2.42e+01

Pos logit contributions
 up+0.134
 out+0.113
an+0.112
ish+0.110
-+0.109
Neg logit contributions
Kids-0.102
Well-0.095
Sorry-0.090
Okay-0.090
Enough-0.089
Token your
Token ID534
Feature activation-2.53e+01

Pos logit contributions
 soaring+0.186
an+0.178
 up+0.172
ish+0.172
lers+0.165
Neg logit contributions
Kids-0.244
Dad-0.226
Ben-0.224
Okay-0.212
Well-0.208
Token lunch
Token ID9965
Feature activation-3.30e+01

Pos logit contributions
 up+0.207
an+0.178
ish+0.167
ling+0.166
 out+0.163
Neg logit contributions
Kids-0.249
Okay-0.234
Sorry-0.232
Well-0.225
Dad-0.220
Token give
Token ID1577
Feature activation-2.70e+01

Pos logit contributions
 up+0.213
an+0.206
ish+0.196
-+0.196
lers+0.189
Neg logit contributions
Kids-0.202
Well-0.190
Don-0.187
Okay-0.180
Ben-0.178
Token to
Token ID284
Feature activation-2.04e+01

Pos logit contributions
an+0.217
asis+0.212
 soaring+0.203
ig+0.198
 up+0.197
Neg logit contributions
Kids-0.263
Don-0.245
Well-0.240
Sorry-0.226
Okay-0.225
Token the
Token ID262
Feature activation-2.44e+01

Pos logit contributions
 up+0.187
an+0.177
ish+0.165
-+0.159
ling+0.156
Neg logit contributions
Kids-0.193
Well-0.175
Don-0.167
Okay-0.162
Brother-0.159
Token rh
Token ID9529
Feature activation-4.41e+01

Pos logit contributions
 up+0.187
an+0.155
-+0.147
ish+0.146
lers+0.143
Neg logit contributions
Kids-0.270
Well-0.264
Okay-0.254
Don-0.253
Sorry-0.247
Tokenin
Token ID259
Feature activation-3.67e+01

Pos logit contributions
 up+0.408
 out+0.320
 soaring+0.320
 underwater+0.291
 crown+0.290
Neg logit contributions
Well-0.427
Kids-0.424
Sorry-0.420
Okay-0.407
oral-0.403
Tokenoc
Token ID420
Feature activation-5.28e+01

Pos logit contributions
 up+0.308
 soaring+0.256
an+0.246
ishing+0.245
eking+0.244
Neg logit contributions
Well-0.363
Kids-0.350
Okay-0.335
Don-0.331
Sorry-0.324
Tokeneros
Token ID27498
Feature activation-3.36e+01

Pos logit contributions
 up+0.393
 out+0.332
 soaring+0.316
 flying+0.292
 crown+0.288
Neg logit contributions
Well-0.569
Sorry-0.556
Kids-0.550
Okay-0.549
Enough-0.530
Token.
Token ID13
Feature activation-1.03e+01

Pos logit contributions
 up+0.232
an+0.215
 soaring+0.193
ish+0.192
ishing+0.186
Neg logit contributions
Kids-0.357
Well-0.347
Don-0.332
Okay-0.324
Sorry-0.318
Token Lily
Token ID20037
Feature activation-1.83e+01

Pos logit contributions
 up+0.118
 out+0.098
 home+0.097
 soaring+0.093
an+0.093
Neg logit contributions
Kids-0.080
Sorry-0.072
Okay-0.071
Enough-0.071
Well-0.070
Token walked
Token ID6807
Feature activation-2.24e+01

Pos logit contributions
 up+0.169
an+0.144
-+0.137
ish+0.137
 crown+0.134
Neg logit contributions
Kids-0.187
Well-0.177
Okay-0.165
Don-0.165
Sorry-0.163
Token to
Token ID284
Feature activation-1.53e+01

Pos logit contributions
 up+0.124
an+0.116
ish+0.113
lers+0.106
 soaring+0.101
Neg logit contributions
Kids-0.290
Well-0.278
Okay-0.269
Don-0.269
Sorry-0.261
Token quiet
Token ID5897
Feature activation-4.27e+01

Pos logit contributions
Kids+0.081
Uh+0.078
Well+0.077
Don+0.073
Okay+0.073
Neg logit contributions
 up-0.061
an-0.054
lers-0.050
kel-0.049
ans-0.049
Token compared
Token ID3688
Feature activation-3.42e+01

Pos logit contributions
 up+0.311
ish+0.284
an+0.281
ig+0.261
 soaring+0.259
Neg logit contributions
Kids-0.465
Well-0.420
Okay-0.410
Don-0.404
Sorry-0.394
Token to
Token ID284
Feature activation-2.22e+01

Pos logit contributions
 up+0.265
an+0.264
ish+0.248
asis+0.244
ig+0.227
Neg logit contributions
Kids-0.362
Well-0.342
Don-0.330
Okay-0.314
Listen-0.309
Token the
Token ID262
Feature activation-2.64e+01

Pos logit contributions
 up+0.193
an+0.181
ish+0.172
lers+0.159
-+0.158
Neg logit contributions
Kids-0.230
Well-0.215
Don-0.205
Okay-0.199
Listen-0.195
Token noisy
Token ID31210
Feature activation-3.34e+01

Pos logit contributions
 up+0.211
an+0.186
ish+0.180
-+0.173
lers+0.170
Neg logit contributions
Kids-0.276
Well-0.263
Okay-0.259
Don-0.254
Enough-0.244
Token se
Token ID384
Feature activation-5.28e+01

Pos logit contributions
 up+0.277
an+0.232
ish+0.225
 soaring+0.218
 crown+0.209
Neg logit contributions
Kids-0.363
Well-0.326
Okay-0.321
Sorry-0.310
Don-0.309
Tokenag
Token ID363
Feature activation-3.95e+01

Pos logit contributions
 up+0.435
 soaring+0.342
 nest+0.335
 flying+0.329
 inside+0.323
Neg logit contributions
Kids-0.568
Okay-0.550
Well-0.533
Sorry-0.512
oral-0.504
Tokenull
Token ID724
Feature activation-3.20e+01

Pos logit contributions
 up+0.321
 crown+0.275
ish+0.274
 nest+0.256
lers+0.252
Neg logit contributions
Kids-0.417
Well-0.397
Don-0.362
Dad-0.360
That-0.358
Tokens
Token ID82
Feature activation-2.62e+01

Pos logit contributions
 up+0.253
 seas+0.227
an+0.218
 crown+0.215
 island+0.211
Neg logit contributions
Well-0.293
Kids-0.289
Don-0.281
That-0.268
Sorry-0.266
Token outside
Token ID2354
Feature activation-2.80e+01

Pos logit contributions
 up+0.179
an+0.164
ish+0.150
asis+0.140
 crown+0.137
Neg logit contributions
Well-0.282
Kids-0.276
Okay-0.275
Don-0.273
Sorry-0.266
Token.
Token ID13
Feature activation-9.46e+00

Pos logit contributions
 up+0.185
an+0.169
ish+0.156
 crown+0.145
lers+0.145
Neg logit contributions
Kids-0.336
Well-0.323
Don-0.311
Sorry-0.305
Okay-0.305
Token you
Token ID345
Feature activation-2.93e+01

Pos logit contributions
 up+0.280
an+0.270
ish+0.256
lers+0.240
ishing+0.226
Neg logit contributions
Kids-0.344
Well-0.335
Don-0.322
Listen-0.304
 She-0.291
Token carry
Token ID3283
Feature activation-3.44e+01

Pos logit contributions
 up+0.284
an+0.256
ish+0.239
-+0.236
lers+0.231
Neg logit contributions
Kids-0.275
Well-0.262
Don-0.244
Okay-0.244
Sorry-0.238
Token a
Token ID257
Feature activation-3.14e+01

Pos logit contributions
an+0.242
 up+0.223
 soaring+0.214
asis+0.213
ish+0.212
Neg logit contributions
Kids-0.356
Well-0.346
Don-0.329
Okay-0.317
Dad-0.311
Token l
Token ID300
Feature activation-3.91e+01

Pos logit contributions
 up+0.273
an+0.240
ping+0.224
 soaring+0.224
lers+0.220
Neg logit contributions
Kids-0.291
Okay-0.284
Well-0.272
Uh-0.271
Sure-0.265
Tokenily
Token ID813
Feature activation-3.07e+01

Pos logit contributions
 up+0.444
an+0.374
 soaring+0.353
ish+0.351
lers+0.349
Neg logit contributions
Well-0.313
Kids-0.308
Okay-0.287
Sorry-0.272
Don-0.265
Token pad
Token ID14841
Feature activation-5.25e+01

Pos logit contributions
 up+0.223
an+0.208
 soaring+0.188
asis+0.183
 crown+0.183
Neg logit contributions
Well-0.304
Kids-0.303
Don-0.292
Okay-0.290
Em-0.280
Token too
Token ID1165
Feature activation-2.36e+01

Pos logit contributions
 up+0.425
an+0.393
ling+0.367
lers+0.365
cking+0.363
Neg logit contributions
Well-0.499
Okay-0.475
Sorry-0.450
Kids-0.449
Don-0.449
Token?"
Token ID1701
Feature activation-1.02e+01

Pos logit contributions
 up+0.181
an+0.156
ish+0.146
 crown+0.140
lers+0.139
Neg logit contributions
Kids-0.271
Well-0.263
Okay-0.249
Don-0.248
Sorry-0.245
Token
Token ID220
Feature activation-2.12e+01

Pos logit contributions
 up+0.116
-+0.096
 home+0.095
 out+0.094
an+0.092
Neg logit contributions
Kids-0.084
Well-0.076
Okay-0.074
Sorry-0.072
Uh-0.072
Token
Token ID198
Feature activation-9.98e+00

Pos logit contributions
 up+0.260
an+0.224
-+0.219
ish+0.216
 out+0.215
Neg logit contributions
Kids-0.158
Well-0.145
Okay-0.135
Sorry-0.134
Don-0.131
TokenThe
Token ID464
Feature activation-1.55e+01

Pos logit contributions
 up+0.099
-+0.083
 home+0.080
an+0.079
 out+0.075
Neg logit contributions
Kids-0.087
Well-0.082
Uh-0.082
Sorry-0.082
Okay-0.081
Token as
Token ID355
Feature activation-1.85e+01

Pos logit contributions
 up+0.195
an+0.185
ish+0.176
ig+0.155
 crown+0.155
Neg logit contributions
Kids-0.309
Well-0.298
Okay-0.284
Don-0.280
Together-0.272
Token he
Token ID339
Feature activation-1.96e+01

Pos logit contributions
 up+0.204
an+0.199
ish+0.193
lers+0.180
asis+0.174
Neg logit contributions
Kids-0.142
Well-0.135
Don-0.126
Okay-0.122
Sorry-0.120
Token opened
Token ID4721
Feature activation-2.54e+01

Pos logit contributions
 up+0.221
an+0.192
ish+0.185
-+0.183
 crown+0.181
Neg logit contributions
Kids-0.164
Well-0.154
Okay-0.141
Don-0.140
Sorry-0.139
Token the
Token ID262
Feature activation-2.04e+01

Pos logit contributions
an+0.219
asis+0.205
ish+0.196
ig+0.175
 up+0.174
Neg logit contributions
Kids-0.220
Well-0.213
Don-0.202
 She-0.200
Dad-0.197
Token ch
Token ID442
Feature activation-3.11e+01

Pos logit contributions
 up+0.169
an+0.153
ish+0.146
-+0.145
 soaring+0.143
Neg logit contributions
Kids-0.200
Well-0.190
Okay-0.180
Sorry-0.176
Uh-0.176
Tokenocol
Token ID4668
Feature activation-5.24e+01

Pos logit contributions
 up+0.293
 anywhere+0.227
 out+0.217
 soaring+0.206
 inside+0.206
Neg logit contributions
Kids-0.294
Well-0.272
Sorry-0.255
Uh-0.255
Okay-0.250
Tokenates
Token ID689
Feature activation-3.07e+01

Pos logit contributions
 up+0.466
 out+0.329
 anywhere+0.324
 nest+0.312
 crown+0.312
Neg logit contributions
Well-0.504
Dad-0.497
Kids-0.497
Sorry-0.467
oral-0.462
Token and
Token ID290
Feature activation-1.32e+01

Pos logit contributions
an+0.215
 up+0.202
 crown+0.186
ish+0.178
 soaring+0.174
Neg logit contributions
Well-0.306
Kids-0.303
Don-0.284
Dad-0.283
That-0.281
Token started
Token ID2067
Feature activation-2.07e+01

Pos logit contributions
an+0.126
 up+0.123
ish+0.115
-+0.112
ig+0.111
Neg logit contributions
Kids-0.115
Well-0.112
Don-0.107
Sorry-0.105
Okay-0.102
Token eating
Token ID6600
Feature activation-2.29e+01

Pos logit contributions
an+0.173
 up+0.157
ish+0.156
asis+0.143
ig+0.137
Neg logit contributions
Kids-0.211
Well-0.200
Don-0.195
Listen-0.186
Sorry-0.183
Token them
Token ID606
Feature activation-2.11e+01

Pos logit contributions
an+0.193
ish+0.164
 up+0.163
asis+0.158
ig+0.158
Neg logit contributions
Well-0.210
Kids-0.210
Don-0.203
Dad-0.196
Okay-0.195
Token Everyday
Token ID48154
Feature activation-3.34e+01

Pos logit contributions
Kids+0.149
Well+0.141
Okay+0.132
Don+0.131
Sorry+0.131
Neg logit contributions
 up-0.140
an-0.118
--0.113
ish-0.113
 crown-0.110
Token he
Token ID339
Feature activation-2.68e+01

Pos logit contributions
 up+0.348
an+0.323
ish+0.306
 crown+0.283
-+0.283
Neg logit contributions
Kids-0.293
Well-0.278
Don-0.252
Okay-0.247
Sorry-0.246
Token would
Token ID561
Feature activation-2.22e+01

Pos logit contributions
 up+0.311
an+0.284
ish+0.270
-+0.266
 soaring+0.263
Neg logit contributions
Kids-0.195
Well-0.187
Okay-0.172
Don-0.171
Sorry-0.162
Token take
Token ID1011
Feature activation-2.62e+01

Pos logit contributions
 up+0.218
an+0.210
ish+0.196
lers+0.191
ishing+0.190
Neg logit contributions
Kids-0.179
Well-0.175
Okay-0.168
Don-0.162
Together-0.156
Token the
Token ID262
Feature activation-2.62e+01

Pos logit contributions
an+0.236
ish+0.211
 up+0.208
asis+0.202
 nest+0.191
Neg logit contributions
Kids-0.245
Well-0.234
Max-0.220
Don-0.215
Listen-0.212
Token av
Token ID1196
Feature activation-5.20e+01

Pos logit contributions
 up+0.202
an+0.182
-+0.165
ish+0.163
ishing+0.157
Neg logit contributions
Kids-0.265
Well-0.257
Okay-0.252
Don-0.245
Sorry-0.233
Tokenoc
Token ID420
Feature activation-5.45e+01

Pos logit contributions
 up+0.475
an+0.393
 soaring+0.389
 crown+0.387
 swimming+0.384
Neg logit contributions
Well-0.472
Okay-0.439
Together-0.437
Hmm-0.425
Exc-0.419
Tokenados
Token ID22484
Feature activation-3.60e+01

Pos logit contributions
 up+0.510
 soaring+0.430
 out+0.427
 underwater+0.424
 swimming+0.417
Neg logit contributions
Well-0.486
Sorry-0.448
Okay-0.444
oral-0.435
Kids-0.430
Token and
Token ID290
Feature activation-1.76e+01

Pos logit contributions
an+0.267
 up+0.257
 crown+0.228
 soaring+0.222
ish+0.220
Neg logit contributions
Well-0.368
Kids-0.352
Don-0.345
Okay-0.337
Together-0.325
Token scatter
Token ID41058
Feature activation-2.76e+01

Pos logit contributions
 up+0.154
an+0.153
ig+0.134
ish+0.132
-+0.128
Neg logit contributions
Well-0.165
Kids-0.161
Don-0.157
Okay-0.155
Sorry-0.154
Token them
Token ID606
Feature activation-2.64e+01

Pos logit contributions
an+0.220
 up+0.200
asis+0.194
ig+0.190
 soaring+0.186
Neg logit contributions
Well-0.267
Kids-0.260
Don-0.252
Wow-0.249
Okay-0.239
Token it
Token ID340
Feature activation-2.76e+01

Pos logit contributions
an+0.255
asis+0.251
 up+0.243
 soaring+0.239
 island+0.228
Neg logit contributions
Kids-0.258
Well-0.237
Don-0.220
Okay-0.213
Dad-0.206
Token to
Token ID284
Feature activation-1.82e+01

Pos logit contributions
an+0.207
asis+0.195
ig+0.191
 up+0.189
ish+0.188
Neg logit contributions
Kids-0.247
Don-0.237
Okay-0.233
Well-0.233
Sorry-0.230
Token the
Token ID262
Feature activation-2.33e+01

Pos logit contributions
 up+0.184
an+0.167
ish+0.162
-+0.156
lers+0.156
Neg logit contributions
Kids-0.167
Well-0.153
Don-0.141
Okay-0.140
Sorry-0.138
Token rh
Token ID9529
Feature activation-4.21e+01

Pos logit contributions
 up+0.186
an+0.157
ish+0.154
-+0.154
lers+0.154
Neg logit contributions
Kids-0.251
Well-0.238
Okay-0.233
Sorry-0.228
Don-0.227
Tokenin
Token ID259
Feature activation-3.49e+01

Pos logit contributions
 up+0.389
 out+0.306
 soaring+0.304
 crown+0.283
 underwater+0.276
Neg logit contributions
Well-0.407
Kids-0.403
Sorry-0.399
Okay-0.385
oral-0.382
Tokenoc
Token ID420
Feature activation-5.13e+01

Pos logit contributions
 up+0.300
 soaring+0.249
an+0.233
 crown+0.231
 out+0.229
Neg logit contributions
Well-0.337
Kids-0.334
Okay-0.321
Sorry-0.308
Brother-0.305
Tokeneros
Token ID27498
Feature activation-3.24e+01

Pos logit contributions
 up+0.384
 out+0.332
 soaring+0.316
 crown+0.293
 flying+0.293
Neg logit contributions
Well-0.547
Sorry-0.528
Kids-0.521
Okay-0.520
oral-0.503
Token.
Token ID13
Feature activation-9.36e+00

Pos logit contributions
 up+0.220
an+0.196
 soaring+0.180
ish+0.178
 crown+0.173
Neg logit contributions
Kids-0.349
Well-0.342
Don-0.323
Okay-0.322
Sorry-0.318
Token When
Token ID1649
Feature activation-1.45e+01

Pos logit contributions
 up+0.101
 home+0.084
 out+0.083
an+0.080
-+0.080
Neg logit contributions
Kids-0.077
Sorry-0.070
Okay-0.070
Enough-0.069
Well-0.068
Token he
Token ID339
Feature activation-1.87e+01

Pos logit contributions
 up+0.135
an+0.116
ish+0.111
-+0.110
 crown+0.106
Neg logit contributions
Kids-0.149
Well-0.141
Sorry-0.131
Don-0.131
Okay-0.131
Token tasted
Token ID29187
Feature activation-2.26e+01

Pos logit contributions
 up+0.195
an+0.170
ish+0.162
-+0.158
lers+0.157
Neg logit contributions
Kids-0.171
Well-0.161
Don-0.148
Sorry-0.147
Okay-0.146
Token flavors
Token ID17361
Feature activation-3.92e+01

Pos logit contributions
an+0.287
 up+0.283
 soaring+0.265
 crown+0.241
lers+0.234
Neg logit contributions
Well-0.460
Kids-0.454
Sorry-0.446
Okay-0.439
Dad-0.420
Token they
Token ID484
Feature activation-2.53e+01

Pos logit contributions
 up+0.281
an+0.277
 soaring+0.251
 crown+0.246
lers+0.233
Neg logit contributions
Well-0.427
Kids-0.418
Okay-0.399
Sorry-0.396
Dad-0.386
Token liked
Token ID8288
Feature activation-2.61e+01

Pos logit contributions
 up+0.287
an+0.259
-+0.245
ish+0.242
 soaring+0.239
Neg logit contributions
Kids-0.199
Well-0.191
Okay-0.175
Don-0.173
Sorry-0.171
Token and
Token ID290
Feature activation-1.62e+01

Pos logit contributions
an+0.187
 up+0.186
ish+0.166
ig+0.155
lers+0.150
Neg logit contributions
Kids-0.287
Well-0.277
Okay-0.260
Don-0.257
Dad-0.255
Token what
Token ID644
Feature activation-1.98e+01

Pos logit contributions
 up+0.166
an+0.156
ish+0.147
-+0.142
ig+0.141
Neg logit contributions
Kids-0.141
Well-0.135
Don-0.126
Okay-0.124
Sorry-0.121
Token topp
Token ID23126
Feature activation-5.02e+01

Pos logit contributions
 up+0.213
an+0.182
-+0.177
ish+0.175
 crown+0.172
Neg logit contributions
Kids-0.176
Well-0.165
Okay-0.152
Sorry-0.151
Don-0.151
Tokenings
Token ID654
Feature activation-3.08e+01

Pos logit contributions
 soaring+0.398
 up+0.384
 flying+0.367
 nest+0.364
 home+0.347
Neg logit contributions
Kids-0.457
Well-0.454
Sorry-0.446
Dad-0.444
Okay-0.442
Token would
Token ID561
Feature activation-2.13e+01

Pos logit contributions
 up+0.283
an+0.240
ish+0.228
-+0.228
 crown+0.221
Neg logit contributions
Kids-0.320
Well-0.303
Okay-0.282
Don-0.281
Sorry-0.281
Token go
Token ID467
Feature activation-2.35e+01

Pos logit contributions
 up+0.213
an+0.190
ish+0.181
-+0.178
lers+0.178
Neg logit contributions
Kids-0.198
Well-0.191
Okay-0.179
Sorry-0.177
Don-0.176
Token best
Token ID1266
Feature activation-2.43e+01

Pos logit contributions
 up+0.138
an+0.132
ish+0.117
 crown+0.116
 soaring+0.113
Neg logit contributions
Kids-0.290
Well-0.278
Sorry-0.269
Don-0.267
Okay-0.265
Token.
Token ID13
Feature activation-8.60e+00

Pos logit contributions
 up+0.184
an+0.163
 soaring+0.150
ish+0.142
lers+0.140
Neg logit contributions
Kids-0.269
Well-0.254
Okay-0.249
Sorry-0.242
Dad-0.240
Token He
Token ID679
Feature activation-2.61e+01

Pos logit contributions
Kids+0.172
Sorry+0.147
Well+0.147
Uh+0.142
Okay+0.139
Neg logit contributions
 up-0.256
--0.211
an-0.209
 home-0.207
 out-0.206
Token touched
Token ID12615
Feature activation-2.89e+01

Pos logit contributions
 up+0.282
an+0.256
ish+0.247
-+0.243
 soaring+0.239
Neg logit contributions
Kids-0.203
Well-0.198
Don-0.182
Okay-0.181
Sorry-0.174
Token the
Token ID262
Feature activation-2.63e+01

Pos logit contributions
an+0.252
asis+0.248
 up+0.229
 nest+0.225
 island+0.223
Neg logit contributions
Well-0.262
Kids-0.254
Don-0.249
That-0.234
Wow-0.233
Token brightly
Token ID35254
Feature activation-3.87e+01

Pos logit contributions
 up+0.187
ishing+0.170
an+0.166
-+0.162
ig+0.160
Neg logit contributions
Well-0.246
Don-0.246
Kids-0.244
Okay-0.239
Uh-0.235
Token-
Token ID12
Feature activation-4.37e+01

Pos logit contributions
 soaring+0.233
 home+0.232
 Majesty+0.231
 kingdom+0.231
 up+0.227
Neg logit contributions
Well-0.397
That-0.387
Kids-0.385
Wow-0.380
Don-0.365
Tokencolored
Token ID25717
Feature activation-5.00e+01

Pos logit contributions
 up+0.391
 soaring+0.339
an+0.327
ish+0.322
 home+0.322
Neg logit contributions
Well-0.439
Kids-0.438
Don-0.412
Okay-0.411
Sorry-0.403
Token pet
Token ID4273
Feature activation-3.68e+01

Pos logit contributions
 up+0.374
 soaring+0.330
eking+0.319
an+0.317
cking+0.310
Neg logit contributions
Kids-0.478
Well-0.453
Okay-0.448
Wow-0.443
Sorry-0.432
Tokenals
Token ID874
Feature activation-4.06e+01

Pos logit contributions
 up+0.334
an+0.292
 soaring+0.271
 home+0.265
ish+0.264
Neg logit contributions
Kids-0.367
Well-0.351
Don-0.326
Okay-0.324
Sorry-0.323
Token,
Token ID11
Feature activation-1.61e+01

Pos logit contributions
an+0.278
 up+0.273
ish+0.239
 home+0.229
asis+0.228
Neg logit contributions
Well-0.432
Kids-0.423
Don-0.416
Sorry-0.399
That-0.398
Token and
Token ID290
Feature activation-1.53e+01

Pos logit contributions
an+0.129
 up+0.126
ish+0.120
asis+0.117
-+0.114
Neg logit contributions
Don-0.165
Well-0.162
Kids-0.159
That-0.147
Sorry-0.145
Token as
Token ID355
Feature activation-1.88e+01

Pos logit contributions
an+0.145
 up+0.142
ish+0.133
asis+0.129
-+0.125
Neg logit contributions
Well-0.133
Kids-0.132
Don-0.129
Sorry-0.118
Okay-0.116
Token there
Token ID612
Feature activation-2.32e+01

Pos logit contributions
 up+0.169
an+0.160
ish+0.147
lers+0.134
 crown+0.133
Neg logit contributions
Kids-0.297
Well-0.287
Don-0.271
Sorry-0.259
Okay-0.257
Token was
Token ID373
Feature activation-2.20e+01

Pos logit contributions
 up+0.213
an+0.182
ish+0.173
-+0.173
 crown+0.169
Neg logit contributions
Kids-0.240
Well-0.227
Don-0.212
Okay-0.212
Sorry-0.210
Token a
Token ID257
Feature activation-2.20e+01

Pos logit contributions
 up+0.158
an+0.135
ish+0.128
-+0.121
lers+0.118
Neg logit contributions
Kids-0.272
Well-0.261
Don-0.248
Okay-0.246
Sorry-0.244
Token rh
Token ID9529
Feature activation-4.05e+01

Pos logit contributions
 up+0.176
an+0.144
ish+0.136
-+0.135
 out+0.130
Neg logit contributions
Kids-0.255
Well-0.243
Okay-0.230
Sorry-0.228
Don-0.228
Tokenin
Token ID259
Feature activation-3.49e+01

Pos logit contributions
 up+0.374
 out+0.281
 soaring+0.280
 feathers+0.270
 home+0.269
Neg logit contributions
Well-0.379
Kids-0.379
Sorry-0.377
Dad-0.360
Ben-0.358
Tokenoc
Token ID420
Feature activation-4.99e+01

Pos logit contributions
 up+0.347
 soaring+0.283
 out+0.274
ishing+0.266
 underwater+0.258
Neg logit contributions
Well-0.313
Kids-0.299
Okay-0.286
Dad-0.281
Sorry-0.279
Tokeneros
Token ID27498
Feature activation-3.29e+01

Pos logit contributions
 up+0.400
 out+0.330
 soaring+0.319
 anywhere+0.311
 underwater+0.295
Neg logit contributions
Well-0.475
Sorry-0.466
Okay-0.454
Kids-0.433
Um-0.430
Token who
Token ID508
Feature activation-2.26e+01

Pos logit contributions
 up+0.286
an+0.245
ish+0.239
rier+0.230
ishing+0.229
Neg logit contributions
Well-0.302
Kids-0.301
Don-0.278
Sorry-0.269
Okay-0.264
Token was
Token ID373
Feature activation-2.47e+01

Pos logit contributions
 up+0.245
an+0.213
ish+0.206
-+0.203
 crown+0.201
Neg logit contributions
Kids-0.194
Well-0.186
Don-0.170
Okay-0.169
Sorry-0.167
Token always
Token ID1464
Feature activation-2.73e+01

Pos logit contributions
an+0.215
ish+0.211
ig+0.206
 up+0.204
ishing+0.202
Neg logit contributions
Well-0.203
Don-0.195
Grand-0.195
Kids-0.192
Um-0.190
Token naughty
Token ID45128
Feature activation-2.48e+01

Pos logit contributions
an+0.226
ish+0.213
ig+0.212
ans+0.199
lers+0.187
Neg logit contributions
Don-0.262
Well-0.237
That-0.228
Kids-0.223
Thank-0.222
Token's
Token ID338
Feature activation-2.27e+01

Pos logit contributions
 up+0.180
an+0.172
 soaring+0.170
ishing+0.157
-+0.156
Neg logit contributions
Kids-0.239
Well-0.229
Okay-0.223
Sorry-0.218
Don-0.214
Token wonderful
Token ID7932
Feature activation-2.88e+01

Pos logit contributions
 up+0.182
an+0.150
-+0.142
ish+0.141
 crown+0.137
Neg logit contributions
Kids-0.265
Well-0.253
Okay-0.238
Don-0.237
Sorry-0.237
Token,
Token ID11
Feature activation-1.32e+01

Pos logit contributions
 up+0.201
an+0.201
 soaring+0.195
asis+0.187
 flying+0.170
Neg logit contributions
Kids-0.297
Well-0.289
Okay-0.284
 She-0.264
Dad-0.264
Token my
Token ID616
Feature activation-2.52e+01

Pos logit contributions
 up+0.154
an+0.137
ish+0.132
-+0.131
lers+0.130
Neg logit contributions
Kids-0.105
Well-0.097
Okay-0.089
Don-0.088
Sorry-0.087
Token d
Token ID288
Feature activation-3.45e+01

Pos logit contributions
 up+0.221
an+0.190
lers+0.180
 soaring+0.179
 out+0.174
Neg logit contributions
Kids-0.241
Well-0.238
Okay-0.234
Sorry-0.223
Um-0.213
Tokenarl
Token ID7063
Feature activation-4.97e+01

Pos logit contributions
 up+0.406
an+0.352
-+0.339
ish+0.336
 crown+0.335
Neg logit contributions
Kids-0.276
Well-0.255
Okay-0.233
Sorry-0.231
Don-0.227
Tokenings
Token ID654
Feature activation-3.04e+01

Pos logit contributions
 up+0.355
 soaring+0.289
an+0.280
 crown+0.275
 underwater+0.268
Neg logit contributions
Kids-0.589
Well-0.575
Okay-0.548
Sorry-0.533
Don-0.516
Token!"
Token ID2474
Feature activation-8.62e+00

Pos logit contributions
 up+0.194
-+0.137
an+0.135
 out+0.133
ish+0.127
Neg logit contributions
Kids-0.410
Well-0.392
Okay-0.374
Sorry-0.371
Don-0.370
Token Mom
Token ID11254
Feature activation-1.79e+01

Pos logit contributions
 up+0.098
 out+0.082
-+0.081
 home+0.080
an+0.079
Neg logit contributions
Kids-0.072
Well-0.066
Okay-0.064
Sorry-0.062
Enough-0.061
Tokenmy
Token ID1820
Feature activation-2.11e+01

Pos logit contributions
 up+0.200
an+0.178
-+0.171
ish+0.170
 crown+0.168
Neg logit contributions
Kids-0.148
Well-0.137
Don-0.124
Okay-0.124
Sorry-0.123
Token says
Token ID1139
Feature activation-1.57e+01

Pos logit contributions
 up+0.256
an+0.231
ish+0.223
-+0.221
 home+0.218
Neg logit contributions
Kids-0.156
Well-0.142
Don-0.127
Sorry-0.125
Okay-0.125
Token was
Token ID373
Feature activation+5.58e+00

Pos logit contributions
Kids+0.056
Well+0.054
Don+0.050
Okay+0.048
Sorry+0.048
Neg logit contributions
 up-0.064
an-0.058
ish-0.056
 home-0.055
 soaring-0.054
Token chicken
Token ID9015
Feature activation+1.51e+00

Pos logit contributions
Kids+0.060
Well+0.059
Don+0.057
Okay+0.055
Sorry+0.054
Neg logit contributions
 up-0.042
an-0.041
ish-0.038
lers-0.035
ig-0.035
Token and
Token ID290
Feature activation+1.03e+01

Pos logit contributions
Kids+0.015
Well+0.014
Okay+0.014
Max+0.014
Ben+0.014
Neg logit contributions
 up-0.011
an-0.009
 soaring-0.008
ishing-0.008
ish-0.008
Token broccoli
Token ID44653
Feature activation-1.75e+00

Pos logit contributions
Kids+0.106
Well+0.103
Don+0.097
Sorry+0.094
Okay+0.093
Neg logit contributions
 up-0.093
an-0.085
ish-0.079
--0.073
ig-0.073
Token,
Token ID11
Feature activation+1.05e+01

Pos logit contributions
 up+0.013
an+0.012
ish+0.010
 home+0.010
 soaring+0.010
Neg logit contributions
Kids-0.019
Well-0.019
Don-0.017
Okay-0.017
Sorry-0.017
Token which
Token ID543
Feature activation+6.08e+00

Pos logit contributions
Kids+0.112
Well+0.112
Don+0.107
Okay+0.102
Sorry+0.100
Neg logit contributions
 up-0.083
an-0.079
ish-0.073
--0.067
ig-0.067
Token Lucy
Token ID22162
Feature activation+4.59e+00

Pos logit contributions
Kids+0.053
Well+0.052
Don+0.048
Okay+0.046
Sorry+0.046
Neg logit contributions
 up-0.063
an-0.058
ish-0.054
--0.053
 home-0.052
Token loved
Token ID6151
Feature activation+6.10e+00

Pos logit contributions
Kids+0.038
Well+0.036
Don+0.033
Okay+0.033
Sorry+0.032
Neg logit contributions
 up-0.050
an-0.043
 soaring-0.042
 home-0.041
 crown-0.041
Token.
Token ID13
Feature activation+1.53e+01

Pos logit contributions
Kids+0.064
Well+0.062
Don+0.060
Okay+0.057
That+0.055
Neg logit contributions
 up-0.049
an-0.048
ish-0.043
asis-0.040
 crown-0.038
Token She
Token ID1375
Feature activation+9.25e+00

Pos logit contributions
Kids+0.095
Sorry+0.081
Well+0.077
Okay+0.075
Listen+0.074
Neg logit contributions
 up-0.216
 home-0.181
 out-0.179
--0.178
an-0.170
Token was
Token ID373
Feature activation+8.23e+00

Pos logit contributions
Kids+0.080
Well+0.077
Okay+0.070
Don+0.070
Sorry+0.068
Neg logit contributions
 up-0.099
an-0.086
ish-0.082
 out-0.081
 home-0.081
Token so
Token ID523
Feature activation+7.28e+00

Pos logit contributions
Well+0.041
Kids+0.040
Don+0.039
That+0.038
Sorry+0.035
Neg logit contributions
an-0.040
ish-0.039
lers-0.037
 up-0.037
ishing-0.036
Token happy
Token ID3772
Feature activation+4.62e+00

Pos logit contributions
Kids+0.086
Well+0.082
Okay+0.077
Don+0.076
Sorry+0.076
Neg logit contributions
 up-0.058
an-0.047
--0.045
ish-0.045
 crown-0.043
Token and
Token ID290
Feature activation+7.39e+00

Pos logit contributions
Well+0.047
Kids+0.046
Don+0.043
Sorry+0.041
That+0.041
Neg logit contributions
an-0.037
 up-0.037
ish-0.035
 crown-0.032
lers-0.032
Token his
Token ID465
Feature activation+2.38e+00

Pos logit contributions
Kids+0.061
Well+0.059
Don+0.055
Sorry+0.053
Okay+0.053
Neg logit contributions
 up-0.078
an-0.074
ish-0.072
lers-0.068
--0.067
Token friends
Token ID2460
Feature activation+3.35e+00

Pos logit contributions
Kids+0.031
Well+0.030
Okay+0.029
Sorry+0.028
Don+0.028
Neg logit contributions
 up-0.014
an-0.011
ish-0.010
 out-0.010
--0.010
Token were
Token ID547
Feature activation+3.37e+00

Pos logit contributions
Kids+0.026
Well+0.025
Don+0.022
Okay+0.022
Sorry+0.022
Neg logit contributions
 up-0.039
an-0.034
ish-0.033
--0.033
 crown-0.032
Token excited
Token ID6568
Feature activation+2.90e+00

Pos logit contributions
Well+0.028
Kids+0.028
Don+0.027
That+0.027
Sarah+0.026
Neg logit contributions
an-0.032
ish-0.031
lers-0.030
 up-0.030
 territory-0.027
Token too
Token ID1165
Feature activation+3.97e+00

Pos logit contributions
Kids+0.029
Well+0.029
Don+0.027
Okay+0.027
Listen+0.026
Neg logit contributions
an-0.023
 up-0.022
ish-0.021
ig-0.020
asis-0.020
Token.
Token ID13
Feature activation+1.24e+01

Pos logit contributions
Kids+0.044
Well+0.043
Don+0.040
Sorry+0.040
Dad+0.040
Neg logit contributions
 up-0.029
an-0.026
ish-0.024
 crown-0.024
lers-0.022
Token They
Token ID1119
Feature activation+7.41e+00

Pos logit contributions
Kids+0.105
Sorry+0.094
Uh+0.092
Listen+0.088
Enough+0.086
Neg logit contributions
 up-0.150
 out-0.116
 home-0.114
ish-0.113
--0.110
Token all
Token ID477
Feature activation+2.01e+00

Pos logit contributions
Kids+0.069
Well+0.064
Okay+0.060
Sorry+0.059
Don+0.059
Neg logit contributions
 up-0.077
an-0.066
--0.064
ish-0.064
 crown-0.062
Token and
Token ID290
Feature activation+1.38e+01

Pos logit contributions
Kids+0.089
Well+0.082
Don+0.081
Okay+0.081
Sorry+0.077
Neg logit contributions
 up-0.058
an-0.056
ish-0.052
lers-0.049
 crown-0.048
Token lay
Token ID3830
Feature activation+8.13e+00

Pos logit contributions
Kids+0.129
Don+0.127
Well+0.120
Okay+0.117
Sorry+0.116
Neg logit contributions
an-0.131
ish-0.118
ig-0.117
lers-0.114
--0.111
Token down
Token ID866
Feature activation+8.49e+00

Pos logit contributions
Don+0.096
Kids+0.094
Well+0.092
Okay+0.089
Sorry+0.089
Neg logit contributions
an-0.046
ish-0.039
 crown-0.038
asis-0.038
lers-0.038
Token in
Token ID287
Feature activation+9.80e+00

Pos logit contributions
Well+0.102
Kids+0.101
Don+0.099
Sorry+0.095
Okay+0.095
Neg logit contributions
an-0.052
ish-0.047
 up-0.044
lers-0.041
 crown-0.039
Token the
Token ID262
Feature activation+8.11e+00

Pos logit contributions
Kids+0.109
Well+0.103
Don+0.099
Okay+0.095
Dad+0.095
Neg logit contributions
 up-0.078
an-0.073
ish-0.070
lers-0.065
--0.064
Token grass
Token ID8701
Feature activation+2.65e+00

Pos logit contributions
Kids+0.098
Well+0.093
Okay+0.088
Don+0.088
Sorry+0.088
Neg logit contributions
 up-0.060
an-0.049
ish-0.048
--0.047
lers-0.045
Token.
Token ID13
Feature activation+1.94e+01

Pos logit contributions
Kids+0.026
Well+0.026
Don+0.025
Okay+0.025
 She+0.025
Neg logit contributions
 up-0.019
an-0.019
ish-0.017
lers-0.017
 crown-0.017
Token She
Token ID1375
Feature activation+1.38e+01

Pos logit contributions
Kids+0.111
Well+0.094
Sorry+0.088
Okay+0.084
Don+0.082
Neg logit contributions
 up-0.279
an-0.243
ish-0.240
 out-0.240
--0.240
Token spotted
Token ID13489
Feature activation+1.31e+01

Pos logit contributions
Kids+0.114
Well+0.107
Don+0.099
Okay+0.097
Sorry+0.092
Neg logit contributions
 up-0.150
an-0.136
ish-0.130
 soaring-0.127
 out-0.125
Token a
Token ID257
Feature activation+9.17e+00

Pos logit contributions
Don+0.117
Well+0.114
Kids+0.111
Listen+0.106
That+0.103
Neg logit contributions
an-0.127
asis-0.118
ish-0.118
ig-0.112
lers-0.111
Token small
Token ID1402
Feature activation+4.41e+00

Pos logit contributions
Kids+0.097
Well+0.091
Okay+0.090
Don+0.089
Uh+0.089
Neg logit contributions
 up-0.074
an-0.065
lers-0.061
ish-0.061
ishing-0.058
Token a
Token ID257
Feature activation+4.24e+00

Pos logit contributions
Kids+0.141
Well+0.134
Okay+0.127
Sorry+0.126
Don+0.125
Neg logit contributions
 up-0.076
an-0.057
--0.055
 out-0.054
ish-0.053
Token youth
Token ID6205
Feature activation+1.11e+00

Pos logit contributions
Kids+0.057
Well+0.055
Don+0.052
Sorry+0.051
Enough+0.051
Neg logit contributions
 up-0.025
--0.021
an-0.019
 crown-0.018
ish-0.018
Token was
Token ID373
Feature activation+4.65e+00

Pos logit contributions
Kids+0.010
Well+0.009
Sorry+0.009
Don+0.009
Okay+0.009
Neg logit contributions
 up-0.012
an-0.010
ish-0.009
 crown-0.009
 soaring-0.009
Token walking
Token ID6155
Feature activation+3.23e+00

Pos logit contributions
Kids+0.044
Well+0.042
Don+0.041
Em+0.039
Sorry+0.039
Neg logit contributions
an-0.039
ish-0.036
asis-0.034
 up-0.034
le-0.033
Token down
Token ID866
Feature activation+2.37e+00

Pos logit contributions
Kids+0.039
Well+0.039
Don+0.038
Okay+0.038
Sorry+0.037
Neg logit contributions
an-0.017
 up-0.015
asis-0.015
 crown-0.015
lers-0.015
Token the
Token ID262
Feature activation+2.58e+00

Pos logit contributions
Kids+0.031
Well+0.029
Okay+0.028
Sorry+0.028
Don+0.028
Neg logit contributions
 up-0.016
an-0.012
--0.012
ish-0.011
 out-0.011
Token street
Token ID4675
Feature activation-7.14e-01

Pos logit contributions
Kids+0.022
Well+0.020
Okay+0.018
Don+0.018
Sorry+0.018
Neg logit contributions
 up-0.028
ish-0.025
an-0.025
lers-0.024
 crown-0.024
Token when
Token ID618
Feature activation+5.64e+00

Pos logit contributions
an+0.005
 up+0.005
ish+0.005
 crown+0.005
lers+0.005
Neg logit contributions
Well-0.008
Kids-0.007
Don-0.007
Okay-0.007
Sorry-0.007
Token he
Token ID339
Feature activation+4.83e+00

Pos logit contributions
Kids+0.062
Well+0.058
Okay+0.055
Sorry+0.053
Don+0.053
Neg logit contributions
 up-0.051
 out-0.039
--0.039
 home-0.039
 soaring-0.039
Token noticed
Token ID6810
Feature activation+6.39e+00

Pos logit contributions
Kids+0.049
Well+0.046
Okay+0.043
Don+0.042
Sorry+0.042
Neg logit contributions
 up-0.047
an-0.039
--0.038
ish-0.038
 out-0.037
Token something
Token ID1223
Feature activation+2.08e+00

Pos logit contributions
Kids+0.065
Well+0.063
Don+0.062
Listen+0.060
Sorry+0.059
Neg logit contributions
an-0.053
 up-0.051
ish-0.050
asis-0.045
ig-0.045
Token fell
Token ID3214
Feature activation+9.34e+00

Pos logit contributions
Kids+0.121
Well+0.113
Okay+0.105
Don+0.104
Sorry+0.104
Neg logit contributions
 up-0.144
an-0.124
--0.120
ish-0.120
 crown-0.117
Token down
Token ID866
Feature activation+5.70e+00

Pos logit contributions
Kids+0.122
Well+0.119
Don+0.115
Sorry+0.112
Okay+0.110
Neg logit contributions
 up-0.049
an-0.048
ish-0.043
asis-0.037
 crown-0.037
Token with
Token ID351
Feature activation+8.66e+00

Pos logit contributions
Kids+0.078
Well+0.075
Don+0.071
Okay+0.071
Sorry+0.071
Neg logit contributions
 up-0.033
an-0.026
ish-0.024
--0.023
 crown-0.022
Token a
Token ID257
Feature activation+6.34e+00

Pos logit contributions
Kids+0.095
Well+0.091
Don+0.088
Sorry+0.086
Okay+0.085
Neg logit contributions
 up-0.071
an-0.064
ish-0.060
--0.058
lers-0.055
Token loud
Token ID7812
Feature activation+1.27e+00

Pos logit contributions
Kids+0.068
Well+0.065
Don+0.063
Sorry+0.062
Okay+0.062
Neg logit contributions
 up-0.055
an-0.046
ish-0.043
--0.042
 home-0.041
Token scream
Token ID8196
Feature activation+4.92e+00

Pos logit contributions
Kids+0.012
Well+0.011
Don+0.011
Okay+0.010
Sorry+0.010
Neg logit contributions
 up-0.012
an-0.010
ish-0.010
lers-0.010
 home-0.010
Token.
Token ID13
Feature activation+1.68e+01

Pos logit contributions
Kids+0.059
Well+0.054
Don+0.053
Sorry+0.052
Okay+0.052
Neg logit contributions
 up-0.033
an-0.029
ish-0.028
lers-0.026
 soaring-0.025
Token
Token ID198
Feature activation+1.67e+01

Pos logit contributions
Kids+0.177
Okay+0.166
Uh+0.160
oral+0.160
Sorry+0.156
Neg logit contributions
 up-0.141
 out-0.120
 home-0.103
 inside-0.099
 in-0.097
Token
Token ID198
Feature activation+1.71e+01

Pos logit contributions
Kids+0.166
Okay+0.162
Well+0.156
Sorry+0.154
Uh+0.154
Neg logit contributions
 up-0.164
 out-0.131
 home-0.131
--0.131
an-0.118
TokenTom
Token ID13787
Feature activation+1.34e+01

Pos logit contributions
Kids+0.116
Well+0.107
Um+0.093
Uh+0.091
Okay+0.090
Neg logit contributions
 up-0.200
--0.185
 out-0.167
 inside-0.164
 home-0.163
Token heard
Token ID2982
Feature activation+1.21e+01

Pos logit contributions
Kids+0.124
Well+0.116
Okay+0.108
Don+0.107
Sorry+0.107
Neg logit contributions
 up-0.140
an-0.120
--0.116
ish-0.115
 crown-0.112
Token sad
Token ID6507
Feature activation-1.58e+01

Pos logit contributions
an+0.133
lers+0.124
ish+0.124
asis+0.116
ig+0.115
Neg logit contributions
Kids-0.112
Well-0.105
Don-0.103
Em-0.096
 She-0.094
Token and
Token ID290
Feature activation-7.93e+00

Pos logit contributions
 up+0.106
an+0.089
ish+0.081
 crown+0.077
 soaring+0.077
Neg logit contributions
Kids-0.201
Well-0.192
Don-0.181
Okay-0.181
Sorry-0.180
Token Lily
Token ID20037
Feature activation-1.25e+01

Pos logit contributions
 up+0.091
an+0.076
-+0.075
ish+0.074
 out+0.073
Neg logit contributions
Kids-0.067
Well-0.062
Okay-0.058
Sorry-0.056
Don-0.056
Token told
Token ID1297
Feature activation-1.00e+01

Pos logit contributions
 up+0.143
an+0.124
-+0.121
ish+0.120
 crown+0.118
Neg logit contributions
Kids-0.103
Well-0.095
Okay-0.087
Sorry-0.087
Don-0.087
Token her
Token ID607
Feature activation-1.39e+01

Pos logit contributions
 up+0.101
an+0.095
ish+0.091
lers+0.086
 soaring+0.085
Neg logit contributions
Kids-0.092
Well-0.083
Don-0.079
Okay-0.076
Listen-0.075
Token about
Token ID546
Feature activation-1.08e+01

Pos logit contributions
 up+0.122
an+0.111
ish+0.104
-+0.098
lers+0.098
Neg logit contributions
Kids-0.145
Well-0.138
Don-0.128
Okay-0.126
Sorry-0.124
Token the
Token ID262
Feature activation-1.30e+01

Pos logit contributions
 up+0.095
an+0.093
ish+0.087
lers+0.081
-+0.080
Neg logit contributions
Kids-0.109
Well-0.103
Don-0.099
Listen-0.094
Okay-0.092
Token cane
Token ID33009
Feature activation-2.17e+01

Pos logit contributions
 up+0.106
an+0.094
ish+0.092
lers+0.090
-+0.088
Neg logit contributions
Kids-0.137
Well-0.131
Okay-0.128
Don-0.126
Listen-0.120
Token.
Token ID13
Feature activation+2.53e-01

Pos logit contributions
 up+0.144
an+0.129
ish+0.114
-+0.107
 crown+0.105
Neg logit contributions
Kids-0.255
Well-0.251
Don-0.238
Okay-0.236
Sorry-0.230
Token Her
Token ID2332
Feature activation-7.24e+00

Pos logit contributions
Kids+0.002
Uh+0.002
Sorry+0.002
Okay+0.002
Enough+0.002
Neg logit contributions
 up-0.003
 home-0.002
 out-0.002
--0.002
c-0.002
Token mom
Token ID1995
Feature activation-8.78e+00

Pos logit contributions
 up+0.079
an+0.069
ish+0.066
-+0.066
 out+0.064
Neg logit contributions
Kids-0.064
Well-0.060
Okay-0.055
Don-0.054
Sorry-0.054
Token use
Token ID779
Feature activation-1.97e+01

Pos logit contributions
 up+0.167
an+0.150
lers+0.141
ish+0.140
-+0.138
Neg logit contributions
Kids-0.149
Well-0.145
Um-0.143
Sorry-0.138
Okay-0.137
Token it
Token ID340
Feature activation-2.12e+01

Pos logit contributions
an+0.180
 up+0.160
asis+0.159
ish+0.157
ling+0.151
Neg logit contributions
Kids-0.173
Well-0.166
Don-0.153
Listen-0.147
Sarah-0.145
Token again
Token ID757
Feature activation-1.88e+01

Pos logit contributions
an+0.154
 up+0.141
ish+0.132
lers+0.127
asis+0.123
Neg logit contributions
Kids-0.220
Well-0.219
Don-0.210
Okay-0.204
Sorry-0.200
Token.
Token ID13
Feature activation-3.62e+00

Pos logit contributions
an+0.145
 up+0.144
ish+0.127
 crown+0.121
lers+0.119
Neg logit contributions
Kids-0.189
Well-0.186
Don-0.177
Dad-0.173
Sorry-0.172
Token They
Token ID1119
Feature activation-9.36e+00

Pos logit contributions
 up+0.042
 home+0.037
 out+0.035
-+0.033
an+0.032
Neg logit contributions
Sorry-0.026
Kids-0.026
Okay-0.024
Listen-0.024
Enough-0.024
Token buried
Token ID11694
Feature activation-1.89e+01

Pos logit contributions
 up+0.097
an+0.081
-+0.080
 out+0.079
ish+0.079
Neg logit contributions
Kids-0.088
Well-0.081
Okay-0.076
Sorry-0.076
Enough-0.075
Token it
Token ID340
Feature activation-1.70e+01

Pos logit contributions
an+0.151
 up+0.142
asis+0.139
ish+0.136
 soaring+0.131
Neg logit contributions
Kids-0.185
Well-0.178
Don-0.168
Wow-0.162
That-0.158
Token with
Token ID351
Feature activation-1.29e+01

Pos logit contributions
an+0.095
ish+0.079
asis+0.077
 up+0.077
ans+0.077
Neg logit contributions
Well-0.191
Don-0.186
Kids-0.182
That-0.176
Okay-0.176
Token Lily
Token ID20037
Feature activation-1.40e+01

Pos logit contributions
 up+0.102
an+0.098
ish+0.089
asis+0.084
-+0.083
Neg logit contributions
Kids-0.137
Well-0.132
Don-0.127
Okay-0.121
Sorry-0.120
Token and
Token ID290
Feature activation-7.69e+00

Pos logit contributions
 up+0.097
an+0.092
asis+0.083
 crown+0.079
ish+0.075
Neg logit contributions
Kids-0.145
Well-0.143
Don-0.142
Max-0.138
Em-0.135
Token hoped
Token ID10719
Feature activation-1.04e+01

Pos logit contributions
 up+0.082
an+0.073
ish+0.069
-+0.068
lers+0.066
Neg logit contributions
Kids-0.069
Well-0.064
Don-0.060
Okay-0.059
Sorry-0.059
Token mom
Token ID1995
Feature activation-8.70e+00

Pos logit contributions
 up+0.072
an+0.065
ish+0.062
lers+0.061
-+0.060
Neg logit contributions
Kids-0.066
Well-0.061
Okay-0.059
Don-0.056
Sorry-0.056
Token came
Token ID1625
Feature activation-9.24e+00

Pos logit contributions
 up+0.099
an+0.088
ish+0.084
 soaring+0.083
 crown+0.083
Neg logit contributions
Kids-0.071
Well-0.065
Don-0.060
Okay-0.059
Sorry-0.058
Token over
Token ID625
Feature activation-8.43e+00

Pos logit contributions
asis+0.068
an+0.067
ish+0.062
 crown+0.061
lers+0.061
Neg logit contributions
Kids-0.101
Don-0.090
Dad-0.087
Well-0.086
Daddy-0.086
Token and
Token ID290
Feature activation-3.51e+00

Pos logit contributions
 up+0.054
an+0.040
-+0.039
 home+0.037
ish+0.037
Neg logit contributions
Kids-0.113
Well-0.108
Okay-0.103
Sorry-0.102
Don-0.101
Token thanked
Token ID26280
Feature activation-6.30e+00

Pos logit contributions
 up+0.041
an+0.036
ish+0.035
-+0.034
 crown+0.034
Neg logit contributions
Kids-0.028
Well-0.026
Don-0.024
Okay-0.024
Sorry-0.024
Token Tim
Token ID5045
Feature activation-1.03e+01

Pos logit contributions
 up+0.051
an+0.040
-+0.040
ish+0.038
 out+0.038
Neg logit contributions
Kids-0.073
Well-0.070
Okay-0.066
Sorry-0.066
Don-0.065
Tokenmy
Token ID1820
Feature activation-1.12e+01

Pos logit contributions
 up+0.107
 soaring+0.101
an+0.099
 crown+0.096
lers+0.095
Neg logit contributions
Kids-0.083
Well-0.076
Okay-0.072
Don-0.069
Brother-0.066
Token for
Token ID329
Feature activation-4.80e+00

Pos logit contributions
an+0.066
 up+0.066
 soaring+0.062
asis+0.061
 crown+0.058
Neg logit contributions
Kids-0.128
Well-0.124
Okay-0.119
Don-0.116
Together-0.115
Token being
Token ID852
Feature activation-9.46e+00

Pos logit contributions
 up+0.042
an+0.039
ish+0.038
lers+0.035
-+0.034
Neg logit contributions
Kids-0.051
Well-0.048
Don-0.046
Sorry-0.044
Okay-0.044
Token so
Token ID523
Feature activation-4.55e+00

Pos logit contributions
 up+0.070
an+0.065
ish+0.060
lers+0.057
asis+0.056
Neg logit contributions
Kids-0.110
Well-0.104
Don-0.100
Sorry-0.098
Okay-0.097
Token kind
Token ID1611
Feature activation-1.02e+01

Pos logit contributions
 up+0.039
an+0.032
-+0.031
ish+0.030
 crown+0.030
Neg logit contributions
Kids-0.051
Well-0.049
Okay-0.046
Don-0.045
Sorry-0.045
Token.
Token ID13
Feature activation-1.76e+01

Pos logit contributions
Kids+0.101
Well+0.093
Sorry+0.087
Don+0.086
Okay+0.084
Neg logit contributions
 up-0.092
an-0.088
ish-0.085
ig-0.079
lers-0.076
Token
Token ID198
Feature activation-1.37e+01

Pos logit contributions
 up+0.218
 out+0.183
 home+0.181
-+0.179
an+0.178
Neg logit contributions
Kids-0.122
Well-0.108
Okay-0.105
Sorry-0.103
Enough-0.103
Token
Token ID198
Feature activation-1.14e+01

Pos logit contributions
 up+0.129
 home+0.108
-+0.107
an+0.106
 out+0.100
Neg logit contributions
Kids-0.126
Okay-0.121
Sorry-0.121
Well-0.120
Enough-0.114
Token"
Token ID1
Feature activation-1.24e+01

Pos logit contributions
 up+0.115
an+0.105
-+0.104
 home+0.099
 soaring+0.096
Neg logit contributions
Kids-0.079
Well-0.075
Enough-0.071
Sorry-0.067
Okay-0.067
TokenSure
Token ID19457
Feature activation-1.79e+01

Pos logit contributions
 up+0.140
an+0.124
 pe+0.122
 feathers+0.120
 crown+0.119
Neg logit contributions
Kids-0.057
oral-0.046
Enough-0.046
enny-0.044
 Did-0.043
Token,
Token ID11
Feature activation-9.07e+00

Pos logit contributions
 up+0.137
an+0.122
ish+0.116
lers+0.111
ishing+0.109
Neg logit contributions
Kids-0.195
Well-0.187
Okay-0.179
Sorry-0.178
Um-0.173
Token you
Token ID345
Feature activation-1.53e+01

Pos logit contributions
 up+0.102
an+0.092
lers+0.090
ish+0.090
-+0.087
Neg logit contributions
Kids-0.074
Well-0.068
Don-0.062
Okay-0.062
Sorry-0.060
Token can
Token ID460
Feature activation-1.33e+01

Pos logit contributions
 up+0.171
an+0.149
-+0.143
ish+0.141
lers+0.140
Neg logit contributions
Kids-0.128
Well-0.121
Okay-0.112
Sorry-0.110
Don-0.109
Token.
Token ID13
Feature activation-7.79e+00

Pos logit contributions
 up+0.126
an+0.118
lers+0.112
ish+0.107
 soaring+0.107
Neg logit contributions
Kids-0.113
Well-0.109
Okay-0.108
Sorry-0.105
Dad-0.100
Token The
Token ID383
Feature activation-1.50e+01

Pos logit contributions
 up+0.088
ish+0.075
an+0.075
 out+0.075
 home+0.073
Neg logit contributions
Kids-0.062
Well-0.056
Sorry-0.053
Enough-0.052
Don-0.052
Token more
Token ID517
Feature activation-1.80e+01

Pos logit contributions
 up+0.120
an+0.100
-+0.095
ish+0.094
lers+0.093
Neg logit contributions
Kids-0.168
Well-0.160
Okay-0.155
Don-0.151
Sorry-0.151
Token and
Token ID290
Feature activation-5.12e+00

Pos logit contributions
an+0.141
ish+0.126
asis+0.122
 up+0.122
ans+0.121
Neg logit contributions
Well-0.178
Don-0.166
Kids-0.165
That-0.164
Okay-0.164
Token smiled
Token ID13541
Feature activation-1.37e+01

Pos logit contributions
 up+0.060
an+0.057
ish+0.054
lers+0.052
asis+0.051
Neg logit contributions
Kids-0.038
Well-0.037
Don-0.034
Sorry-0.032
Okay-0.032
Token -
Token ID532
Feature activation-5.55e+00

Pos logit contributions
an+0.089
 up+0.087
lers+0.083
asis+0.081
ish+0.078
Neg logit contributions
Well-0.148
Kids-0.145
Don-0.144
Sorry-0.141
Okay-0.141
Token he
Token ID339
Feature activation-8.85e+00

Pos logit contributions
 up+0.054
an+0.053
ish+0.049
lers+0.048
-+0.047
Neg logit contributions
Well-0.049
Kids-0.049
Don-0.046
Sorry-0.044
Okay-0.044
Token felt
Token ID2936
Feature activation-1.14e+01

Pos logit contributions
 up+0.101
an+0.088
ish+0.085
-+0.084
 crown+0.083
Neg logit contributions
Kids-0.073
Well-0.069
Don-0.063
Okay-0.063
Sorry-0.062
Token like
Token ID588
Feature activation-9.59e+00

Pos logit contributions
an+0.098
 up+0.093
ish+0.092
lers+0.090
ig+0.089
Neg logit contributions
Don-0.109
Well-0.106
Kids-0.105
Okay-0.098
Sorry-0.098
Token a
Token ID257
Feature activation-1.14e+01

Pos logit contributions
 up+0.089
an+0.084
ish+0.080
lers+0.074
-+0.071
Neg logit contributions
Kids-0.096
Well-0.094
Don-0.089
Okay-0.084
Sorry-0.084
Token real
Token ID1103
Feature activation-1.47e+01

Pos logit contributions
 up+0.094
an+0.079
ish+0.073
lers+0.073
ans+0.068
Neg logit contributions
Kids-0.127
Well-0.121
Okay-0.118
Sorry-0.115
Don-0.115
Token doctor
Token ID6253
Feature activation-1.49e+01

Pos logit contributions
 up+0.131
an+0.110
ish+0.102
lers+0.101
 soaring+0.099
Neg logit contributions
Kids-0.156
Well-0.149
Okay-0.143
Sorry-0.140
Don-0.137
Token!
Token ID0
Feature activation+1.46e+00

Pos logit contributions
 up+0.112
an+0.102
ish+0.090
igs+0.089
ans+0.088
Neg logit contributions
Kids-0.158
Well-0.153
Don-0.147
Okay-0.145
Sorry-0.138
Token
Token ID198
Feature activation+8.94e-02

Pos logit contributions
Kids+0.014
Sorry+0.013
Okay+0.013
Enough+0.012
Uh+0.012
Neg logit contributions
 up-0.014
 home-0.012
 out-0.012
--0.011
an-0.011
Token He
Token ID679
Feature activation-1.48e+01

Pos logit contributions
 up+0.088
 out+0.074
 home+0.073
!"+0.069
 soaring+0.069
Neg logit contributions
Kids-0.056
Okay-0.052
Sorry-0.052
Enough-0.052
Uh-0.051
Token will
Token ID481
Feature activation-1.39e+01

Pos logit contributions
 up+0.156
an+0.136
ish+0.131
-+0.130
 home+0.128
Neg logit contributions
Kids-0.132
Well-0.124
Don-0.113
Okay-0.113
Sorry-0.112
Token always
Token ID1464
Feature activation-1.85e+01

Pos logit contributions
 up+0.136
an+0.119
ish+0.117
-+0.114
lers+0.113
Neg logit contributions
Kids-0.134
Well-0.127
Okay-0.119
Don-0.118
Sorry-0.117
Token have
Token ID423
Feature activation-1.71e+01

Pos logit contributions
 up+0.197
an+0.189
ish+0.177
lers+0.176
-+0.169
Neg logit contributions
Kids-0.140
Okay-0.133
Well-0.131
Don-0.127
Sorry-0.126
Token good
Token ID922
Feature activation-1.91e+01

Pos logit contributions
 up+0.140
an+0.135
ish+0.122
asis+0.112
-+0.112
Neg logit contributions
Kids-0.183
Well-0.174
Don-0.166
Enough-0.162
Okay-0.162
Token memories
Token ID9846
Feature activation-2.42e+01

Pos logit contributions
 up+0.156
an+0.150
ish+0.127
 soaring+0.127
asis+0.126
Neg logit contributions
Kids-0.202
Well-0.188
Okay-0.181
Enough-0.179
Sorry-0.179
Token of
Token ID286
Feature activation-1.96e+01

Pos logit contributions
an+0.179
 up+0.174
ish+0.151
lers+0.145
ans+0.144
Neg logit contributions
Kids-0.255
Well-0.253
Okay-0.239
Don-0.230
Sorry-0.229
Token the
Token ID262
Feature activation-2.08e+01

Pos logit contributions
an+0.170
 up+0.167
ish+0.161
lers+0.150
ig+0.145
Neg logit contributions
Kids-0.191
Well-0.174
Dad-0.162
Don-0.161
Okay-0.160
Token unknown
Token ID6439
Feature activation-2.87e+01

Pos logit contributions
 up+0.166
an+0.145
ish+0.135
lers+0.134
-+0.132
Neg logit contributions
Kids-0.224
Well-0.213
Okay-0.206
Sorry-0.198
Enough-0.197
Token island
Token ID7022
Feature activation-3.20e+01

Pos logit contributions
 up+0.208
an+0.186
ish+0.174
 soaring+0.171
lers+0.170
Neg logit contributions
Kids-0.309
Well-0.295
Okay-0.281
Sorry-0.270
Enough-0.263
Token,
Token ID11
Feature activation-9.53e+00

Pos logit contributions
 up+0.226
an+0.222
ish+0.206
ig+0.199
 soaring+0.198
Neg logit contributions
Well-0.294
Kids-0.293
Okay-0.280
Don-0.275
 She-0.264
Token was
Token ID373
Feature activation-3.31e+01

Pos logit contributions
Kids+0.152
Well+0.147
Sorry+0.137
Okay+0.136
Don+0.131
Neg logit contributions
 up-0.209
--0.173
 out-0.173
 crown-0.171
 home-0.167
Token happy
Token ID3772
Feature activation-2.94e+01

Pos logit contributions
an+0.292
ish+0.271
asis+0.252
ig+0.251
lers+0.247
Neg logit contributions
Don-0.297
Well-0.296
Kids-0.292
Sorry-0.271
Okay-0.267
Token and
Token ID290
Feature activation-1.97e+01

Pos logit contributions
 up+0.219
an+0.213
ish+0.196
lers+0.181
asis+0.175
Neg logit contributions
Kids-0.320
Well-0.314
Okay-0.290
Don-0.290
Sorry-0.282
Token typed
Token ID25683
Feature activation-2.60e+01

Pos logit contributions
 up+0.211
an+0.204
ish+0.196
lers+0.188
ig+0.186
Neg logit contributions
Kids-0.157
Well-0.150
Don-0.142
Okay-0.140
Sorry-0.138
Token a
Token ID257
Feature activation-2.52e+01

Pos logit contributions
an+0.162
 up+0.159
asis+0.146
ish+0.143
 soaring+0.135
Neg logit contributions
Kids-0.307
Well-0.295
Don-0.282
Okay-0.281
Sorry-0.275
Token reply
Token ID10971
Feature activation-2.94e+01

Pos logit contributions
 up+0.213
an+0.187
ish+0.180
lers+0.175
ishing+0.164
Neg logit contributions
Kids-0.260
Well-0.247
Okay-0.241
Don-0.236
Sorry-0.236
Token.
Token ID13
Feature activation-8.33e+00

Pos logit contributions
 up+0.204
an+0.177
ish+0.170
lers+0.166
ans+0.150
Neg logit contributions
Kids-0.340
Well-0.327
Don-0.309
Okay-0.308
Sorry-0.306
Token
Token ID198
Feature activation-8.49e+00

Pos logit contributions
 up+0.089
 out+0.074
 home+0.071
 soaring+0.069
-+0.068
Neg logit contributions
Kids-0.068
Okay-0.066
Sorry-0.065
Enough-0.062
Well-0.062
Token
Token ID198
Feature activation-7.83e+00

Pos logit contributions
 up+0.083
 home+0.068
-+0.068
an+0.067
 out+0.064
Neg logit contributions
Kids-0.076
Okay-0.074
Sorry-0.074
Well-0.073
Uh-0.071
TokenBut
Token ID1537
Feature activation-9.01e+00

Pos logit contributions
 up+0.069
-+0.063
an+0.060
 home+0.058
 soaring+0.055
Neg logit contributions
Kids-0.064
Well-0.062
Enough-0.058
Um-0.058
Uh-0.057
Token while
Token ID981
Feature activation-1.15e+01

Pos logit contributions
 up+0.089
an+0.082
ish+0.076
lers+0.075
-+0.075
Neg logit contributions
Kids-0.084
Well-0.080
Don-0.075
Sorry-0.074
Okay-0.073
Token it
Token ID340
Feature activation-2.81e+01

Pos logit contributions
an+0.219
 up+0.217
ish+0.203
 crown+0.195
lers+0.181
Neg logit contributions
Kids-0.364
Well-0.351
Okay-0.341
Em-0.328
Don-0.324
Token is
Token ID318
Feature activation-2.55e+01

Pos logit contributions
 up+0.255
an+0.234
ish+0.227
 crown+0.221
lers+0.213
Neg logit contributions
Kids-0.257
Well-0.253
Don-0.238
Okay-0.233
Sorry-0.225
Token best
Token ID1266
Feature activation-2.61e+01

Pos logit contributions
an+0.215
ish+0.204
lers+0.194
 up+0.192
ig+0.187
Neg logit contributions
Don-0.227
Kids-0.223
Well-0.220
Okay-0.210
Listen-0.209
Token not
Token ID407
Feature activation-2.63e+01

Pos logit contributions
 up+0.209
an+0.196
ish+0.189
ling+0.186
lers+0.177
Neg logit contributions
Kids-0.245
Well-0.233
Okay-0.232
Dad-0.211
Sorry-0.210
Token to
Token ID284
Feature activation-1.97e+01

Pos logit contributions
an+0.205
 up+0.187
ish+0.179
asis+0.175
ig+0.172
Neg logit contributions
Well-0.238
Kids-0.236
Don-0.233
That-0.215
Listen-0.215
Token be
Token ID307
Feature activation-2.50e+01

Pos logit contributions
 up+0.202
an+0.183
lers+0.179
ish+0.176
-+0.167
Neg logit contributions
Kids-0.169
Well-0.151
Okay-0.150
Dad-0.146
Don-0.143
Token weak
Token ID4939
Feature activation-2.94e+01

Pos logit contributions
an+0.196
 up+0.190
ish+0.183
ig+0.174
asis+0.169
Neg logit contributions
Don-0.223
Kids-0.220
Well-0.205
Sarah-0.203
Sorry-0.201
Token.
Token ID13
Feature activation-8.55e+00

Pos logit contributions
an+0.227
 crown+0.217
 up+0.212
asis+0.205
 soaring+0.197
Neg logit contributions
Kids-0.284
Well-0.272
Don-0.269
Okay-0.262
Sorry-0.259
Token You
Token ID921
Feature activation-1.46e+01

Pos logit contributions
 up+0.095
an+0.080
-+0.079
 out+0.079
 home+0.078
Neg logit contributions
Kids-0.065
Sorry-0.059
Enough-0.058
Well-0.058
Okay-0.056
Token need
Token ID761
Feature activation-1.73e+01

Pos logit contributions
 up+0.168
an+0.149
-+0.143
 crown+0.140
 soaring+0.139
Neg logit contributions
Well-0.109
Kids-0.108
Don-0.098
Okay-0.097
Sorry-0.093
Token to
Token ID284
Feature activation-1.52e+01

Pos logit contributions
 up+0.139
an+0.132
ish+0.122
ling+0.112
lers+0.112
Neg logit contributions
Kids-0.189
Well-0.179
Listen-0.172
Don-0.172
Okay-0.169
Token the
Token ID262
Feature activation-2.65e+01

Pos logit contributions
 up+0.194
an+0.180
ish+0.174
lers+0.159
-+0.154
Neg logit contributions
Kids-0.291
Well-0.279
Don-0.266
Okay-0.259
Sorry-0.254
Token sky
Token ID6766
Feature activation-3.21e+01

Pos logit contributions
 up+0.243
an+0.216
ish+0.215
-+0.202
lers+0.198
Neg logit contributions
Kids-0.258
Well-0.246
Okay-0.231
Don-0.227
Sorry-0.227
Token when
Token ID618
Feature activation-1.92e+01

Pos logit contributions
an+0.250
ish+0.236
 up+0.233
lers+0.221
ans+0.208
Neg logit contributions
Well-0.302
 She-0.292
Kids-0.289
Don-0.286
Okay-0.276
Token she
Token ID673
Feature activation-2.00e+01

Pos logit contributions
 up+0.218
an+0.188
-+0.183
ish+0.181
 out+0.179
Neg logit contributions
Kids-0.159
Well-0.147
Okay-0.136
Sorry-0.134
Don-0.134
Token was
Token ID373
Feature activation-2.13e+01

Pos logit contributions
 up+0.219
an+0.191
ish+0.184
-+0.178
lers+0.177
Neg logit contributions
Kids-0.172
Well-0.164
Don-0.151
Sorry-0.150
Okay-0.148
Token only
Token ID691
Feature activation-2.44e+01

Pos logit contributions
an+0.190
ish+0.188
lers+0.172
ig+0.159
 up+0.154
Neg logit contributions
Well-0.178
Kids-0.178
Don-0.175
Grand-0.172
Em-0.163
Token three
Token ID1115
Feature activation-2.42e+01

Pos logit contributions
an+0.225
 up+0.202
ish+0.198
asis+0.191
lers+0.184
Neg logit contributions
Well-0.225
Don-0.221
Kids-0.217
Um-0.204
Enough-0.202
Token.
Token ID13
Feature activation-6.53e+00

Pos logit contributions
 up+0.260
an+0.227
ish+0.225
 out+0.212
 home+0.210
Neg logit contributions
Kids-0.200
Well-0.193
Don-0.170
Okay-0.167
Sorry-0.164
Token She
Token ID1375
Feature activation-1.40e+01

Pos logit contributions
 up+0.077
 home+0.066
 out+0.065
-+0.061
 soaring+0.061
Neg logit contributions
Kids-0.048
Sorry-0.045
Okay-0.043
Well-0.041
Enough-0.041
Token remembered
Token ID12086
Feature activation-1.58e+01

Pos logit contributions
 up+0.154
an+0.135
ish+0.131
-+0.127
 home+0.125
Neg logit contributions
Kids-0.120
Well-0.114
Don-0.104
Okay-0.104
Sorry-0.102
Token how
Token ID703
Feature activation-1.37e+01

Pos logit contributions
an+0.137
 up+0.134
ish+0.127
lers+0.117
asis+0.116
Neg logit contributions
Kids-0.157
Well-0.152
Don-0.147
Listen-0.138
Enough-0.136
Token,
Token ID11
Feature activation-1.40e+01

Pos logit contributions
 up+0.205
an+0.202
ish+0.182
ishing+0.160
 crown+0.159
Neg logit contributions
Kids-0.343
Well-0.319
Sorry-0.299
Don-0.294
Em-0.293
Token Fred
Token ID8559
Feature activation-2.67e+01

Pos logit contributions
 up+0.152
an+0.142
ish+0.135
lers+0.131
-+0.131
Neg logit contributions
Kids-0.115
Well-0.108
Don-0.102
Okay-0.097
Sorry-0.096
Token saw
Token ID2497
Feature activation-2.03e+01

Pos logit contributions
 up+0.262
an+0.233
ish+0.222
 soaring+0.219
 crown+0.216
Neg logit contributions
Kids-0.240
Well-0.224
Don-0.214
Okay-0.206
Ben-0.197
Token an
Token ID281
Feature activation-2.47e+01

Pos logit contributions
an+0.184
ish+0.180
ig+0.175
 up+0.170
asis+0.165
Neg logit contributions
Don-0.180
Kids-0.178
Well-0.178
Enough-0.165
Listen-0.164
Token attractive
Token ID10966
Feature activation-3.01e+01

Pos logit contributions
 up+0.192
lers+0.172
 soaring+0.160
an+0.159
ig+0.157
Neg logit contributions
Kids-0.231
Well-0.224
Dad-0.220
Okay-0.217
Don-0.210
Token red
Token ID2266
Feature activation-3.50e+01

Pos logit contributions
 up+0.230
an+0.206
lers+0.203
ans+0.192
ishing+0.189
Neg logit contributions
Kids-0.294
Well-0.275
Okay-0.274
Don-0.260
Uh-0.259
Token blanket
Token ID18447
Feature activation-3.68e+01

Pos logit contributions
 up+0.270
an+0.228
lers+0.214
ishing+0.209
 soaring+0.204
Neg logit contributions
Kids-0.358
Well-0.345
Okay-0.341
Sorry-0.332
Uh-0.326
Token on
Token ID319
Feature activation-2.31e+01

Pos logit contributions
 up+0.225
an+0.216
ish+0.202
ig+0.186
lers+0.184
Neg logit contributions
Well-0.401
Kids-0.398
Don-0.386
Sorry-0.372
 She-0.372
Token the
Token ID262
Feature activation-2.46e+01

Pos logit contributions
 up+0.176
an+0.166
ish+0.159
-+0.151
lers+0.149
Neg logit contributions
Kids-0.251
Well-0.242
Don-0.232
Dad-0.227
That-0.223
Token grass
Token ID8701
Feature activation-3.47e+01

Pos logit contributions
 up+0.201
ish+0.196
ishing+0.196
-+0.191
ig+0.190
Neg logit contributions
Kids-0.227
Max-0.205
Okay-0.205
Dad-0.203
Don-0.202
Token.
Token ID13
Feature activation-6.71e+00

Pos logit contributions
 up+0.240
an+0.220
ish+0.209
 crown+0.205
ig+0.197
Neg logit contributions
Well-0.363
Kids-0.360
Don-0.341
 She-0.334
Okay-0.334
Token mist
Token ID4020
Feature activation-3.57e+01

Pos logit contributions
 up+0.201
ish+0.182
an+0.180
-+0.174
lers+0.162
Neg logit contributions
Kids-0.255
Well-0.240
Okay-0.231
Don-0.230
Enough-0.228
Token and
Token ID290
Feature activation-1.65e+01

Pos logit contributions
an+0.255
 up+0.250
ish+0.239
 crown+0.226
ans+0.219
Neg logit contributions
Well-0.353
Kids-0.348
 She-0.335
Don-0.330
Sorry-0.322
Token into
Token ID656
Feature activation-2.42e+01

Pos logit contributions
an+0.150
 up+0.141
ish+0.141
-+0.137
ig+0.133
Neg logit contributions
Don-0.153
Kids-0.150
Well-0.149
Sorry-0.138
Okay-0.135
Token their
Token ID511
Feature activation-2.51e+01

Pos logit contributions
an+0.207
ish+0.204
 up+0.204
lers+0.171
asis+0.171
Neg logit contributions
Kids-0.234
Well-0.230
Don-0.224
Dad-0.208
Lisa-0.207
Token cozy
Token ID37438
Feature activation-3.32e+01

Pos logit contributions
 up+0.181
an+0.156
ish+0.149
rier+0.142
-+0.141
Neg logit contributions
Kids-0.274
Well-0.256
Okay-0.251
Don-0.250
Sorry-0.243
Token bur
Token ID4356
Feature activation-4.71e+01

Pos logit contributions
 up+0.253
aches+0.220
ish+0.213
an+0.211
b+0.207
Neg logit contributions
Kids-0.307
 She-0.303
Daddy-0.272
Dad-0.268
 They-0.266
Tokenrows
Token ID8516
Feature activation-3.09e+01

Pos logit contributions
 up+0.418
 crown+0.379
 statue+0.356
ish+0.348
 seas+0.348
Neg logit contributions
Don-0.305
Kids-0.300
Well-0.294
 She-0.281
She-0.275
Token.
Token ID13
Feature activation-8.40e+00

Pos logit contributions
an+0.246
ish+0.236
 up+0.215
ig+0.206
 crown+0.205
Neg logit contributions
Don-0.279
Well-0.270
 She-0.265
Kids-0.259
That-0.248
Token The
Token ID383
Feature activation-1.41e+01

Pos logit contributions
 up+0.097
 out+0.082
 home+0.082
an+0.078
 soaring+0.077
Neg logit contributions
Kids-0.061
Sorry-0.058
Okay-0.058
Well-0.056
Enough-0.054
Token end
Token ID886
Feature activation-1.63e+01

Pos logit contributions
 up+0.112
an+0.088
ish+0.088
-+0.085
 out+0.085
Neg logit contributions
Kids-0.163
Well-0.156
Okay-0.147
Don-0.147
Sorry-0.145
Token.
Token ID13
Feature activation-6.30e+00

Pos logit contributions
 up+0.103
an+0.086
ish+0.082
 crown+0.076
 soaring+0.074
Neg logit contributions
Kids-0.212
Well-0.202
Don-0.190
Okay-0.189
Sorry-0.186
Token a
Token ID257
Feature activation-3.24e+01

Pos logit contributions
Don+0.107
Well+0.104
Kids+0.098
Listen+0.093
That+0.093
Neg logit contributions
ish-0.143
an-0.142
 up-0.132
lers-0.131
est-0.127
Token stick
Token ID4859
Feature activation-3.89e+01

Pos logit contributions
 up+0.281
an+0.249
ish+0.235
lers+0.230
ishing+0.224
Neg logit contributions
Kids-0.328
Well-0.310
Okay-0.309
Uh-0.304
Don-0.302
Token in
Token ID287
Feature activation-2.81e+01

Pos logit contributions
 up+0.263
an+0.247
ish+0.225
 crown+0.212
lers+0.212
Neg logit contributions
Kids-0.404
Well-0.402
Don-0.388
Sorry-0.380
Okay-0.371
Token the
Token ID262
Feature activation-2.87e+01

Pos logit contributions
 up+0.216
an+0.194
ish+0.187
-+0.173
lers+0.173
Neg logit contributions
Kids-0.313
Well-0.307
Don-0.289
Okay-0.279
Dad-0.277
Token bird
Token ID6512
Feature activation-3.31e+01

Pos logit contributions
 up+0.230
-+0.194
an+0.192
 soaring+0.185
ish+0.181
Neg logit contributions
Kids-0.300
Well-0.279
Okay-0.272
Don-0.271
Uh-0.268
Tokenc
Token ID66
Feature activation-4.91e+01

Pos logit contributions
 up+0.230
rier+0.225
 seas+0.183
an+0.180
 aboard+0.178
Neg logit contributions
Kids-0.300
 She-0.296
Don-0.295
Well-0.289
 suggested-0.289
Tokenage
Token ID496
Feature activation-3.80e+01

Pos logit contributions
 up+0.479
 out+0.364
 underwater+0.356
 soaring+0.355
ish+0.354
Neg logit contributions
Kids-0.453
Well-0.443
Okay-0.417
Sorry-0.412
Dad-0.394
Token and
Token ID290
Feature activation-1.77e+01

Pos logit contributions
 up+0.263
an+0.252
ish+0.243
lers+0.223
 crown+0.217
Neg logit contributions
Well-0.389
Kids-0.380
Don-0.372
Okay-0.362
Sorry-0.358
Token the
Token ID262
Feature activation-2.32e+01

Pos logit contributions
an+0.180
 up+0.173
ish+0.162
asis+0.160
ig+0.160
Neg logit contributions
Kids-0.138
Well-0.135
Don-0.133
Sorry-0.123
Okay-0.121
Token bird
Token ID6512
Feature activation-2.85e+01

Pos logit contributions
 up+0.192
ish+0.151
lers+0.149
an+0.149
ishing+0.148
Neg logit contributions
Kids-0.245
Well-0.232
Okay-0.227
Don-0.226
Sorry-0.221
Token balanced
Token ID12974
Feature activation-2.90e+01

Pos logit contributions
 up+0.256
 seas+0.216
an+0.210
 home+0.209
 underwater+0.205
Neg logit contributions
Well-0.247
Kids-0.238
Don-0.237
Sorry-0.223
Okay-0.217
Token it
Token ID340
Feature activation-2.76e+01

Pos logit contributions
an+0.255
asis+0.251
 up+0.243
 soaring+0.239
 island+0.228
Neg logit contributions
Kids-0.258
Well-0.237
Don-0.220
Okay-0.213
Dad-0.206
Token to
Token ID284
Feature activation-1.82e+01

Pos logit contributions
an+0.207
asis+0.195
ig+0.191
 up+0.189
ish+0.188
Neg logit contributions
Kids-0.247
Don-0.237
Okay-0.233
Well-0.233
Sorry-0.230
Token the
Token ID262
Feature activation-2.33e+01

Pos logit contributions
 up+0.184
an+0.167
ish+0.162
-+0.156
lers+0.156
Neg logit contributions
Kids-0.167
Well-0.153
Don-0.141
Okay-0.140
Sorry-0.138
Token rh
Token ID9529
Feature activation-4.21e+01

Pos logit contributions
 up+0.186
an+0.157
ish+0.154
-+0.154
lers+0.154
Neg logit contributions
Kids-0.251
Well-0.238
Okay-0.233
Sorry-0.228
Don-0.227
Tokenin
Token ID259
Feature activation-3.49e+01

Pos logit contributions
 up+0.389
 out+0.306
 soaring+0.304
 crown+0.283
 underwater+0.276
Neg logit contributions
Well-0.407
Kids-0.403
Sorry-0.399
Okay-0.385
oral-0.382
Tokenoc
Token ID420
Feature activation-5.13e+01

Pos logit contributions
 up+0.300
 soaring+0.249
an+0.233
 crown+0.231
 out+0.229
Neg logit contributions
Well-0.337
Kids-0.334
Okay-0.321
Sorry-0.308
Brother-0.305
Tokeneros
Token ID27498
Feature activation-3.24e+01

Pos logit contributions
 up+0.384
 out+0.332
 soaring+0.316
 crown+0.293
 flying+0.293
Neg logit contributions
Well-0.547
Sorry-0.528
Kids-0.521
Okay-0.520
oral-0.503
Token.
Token ID13
Feature activation-9.36e+00

Pos logit contributions
 up+0.220
an+0.196
 soaring+0.180
ish+0.178
 crown+0.173
Neg logit contributions
Kids-0.349
Well-0.342
Don-0.323
Okay-0.322
Sorry-0.318
Token When
Token ID1649
Feature activation-1.45e+01

Pos logit contributions
 up+0.101
 home+0.084
 out+0.083
an+0.080
-+0.080
Neg logit contributions
Kids-0.077
Sorry-0.070
Okay-0.070
Enough-0.069
Well-0.068
Token he
Token ID339
Feature activation-1.87e+01

Pos logit contributions
 up+0.135
an+0.116
ish+0.111
-+0.110
 crown+0.106
Neg logit contributions
Kids-0.149
Well-0.141
Sorry-0.131
Don-0.131
Okay-0.131
Token tasted
Token ID29187
Feature activation-2.26e+01

Pos logit contributions
 up+0.195
an+0.170
ish+0.162
-+0.158
lers+0.157
Neg logit contributions
Kids-0.171
Well-0.161
Don-0.148
Sorry-0.147
Okay-0.146
Token and
Token ID290
Feature activation-2.40e+01

Pos logit contributions
Well+0.145
Kids+0.144
Don+0.139
Okay+0.134
Sorry+0.133
Neg logit contributions
an-0.084
 up-0.081
ish-0.075
lers-0.072
 crown-0.071
Token watched
Token ID7342
Feature activation-2.73e+01

Pos logit contributions
 up+0.231
an+0.229
ish+0.212
-+0.206
ig+0.202
Neg logit contributions
Kids-0.209
Well-0.198
Don-0.194
Okay-0.185
Sorry-0.182
Token as
Token ID355
Feature activation-2.27e+01

Pos logit contributions
an+0.240
asis+0.222
 up+0.219
ish+0.219
lers+0.207
Neg logit contributions
Kids-0.252
Well-0.238
Don-0.221
Dad-0.220
Daddy-0.217
Token the
Token ID262
Feature activation-2.39e+01

Pos logit contributions
 up+0.199
an+0.186
ish+0.177
-+0.166
lers+0.165
Neg logit contributions
Kids-0.232
Well-0.223
Don-0.209
Okay-0.206
Sorry-0.205
Token wood
Token ID4898
Feature activation-3.38e+01

Pos logit contributions
 up+0.193
an+0.159
ish+0.154
-+0.148
 out+0.147
Neg logit contributions
Kids-0.268
Well-0.253
Okay-0.244
Sorry-0.240
Don-0.236
Tokenpe
Token ID431
Feature activation-4.86e+01

Pos logit contributions
 up+0.313
an+0.279
ish+0.264
 crown+0.251
 home+0.250
Neg logit contributions
Kids-0.312
Well-0.310
Don-0.287
Okay-0.283
Sorry-0.283
Tokencker
Token ID15280
Feature activation-3.30e+01

Pos logit contributions
 up+0.401
 crown+0.329
an+0.326
 home+0.318
 seas+0.309
Neg logit contributions
Well-0.448
Sorry-0.437
Kids-0.425
Okay-0.424
Don-0.424
Token worked
Token ID3111
Feature activation-2.86e+01

Pos logit contributions
 up+0.334
an+0.301
ish+0.282
-+0.278
 home+0.276
Neg logit contributions
Well-0.260
Don-0.248
Kids-0.245
Sorry-0.240
Okay-0.228
Token away
Token ID1497
Feature activation-2.69e+01

Pos logit contributions
an+0.212
 up+0.188
ish+0.186
asis+0.180
 crown+0.170
Neg logit contributions
Well-0.293
Kids-0.285
Don-0.275
Okay-0.271
Together-0.266
Token.
Token ID13
Feature activation-7.42e+00

Pos logit contributions
an+0.197
ish+0.177
 up+0.177
lers+0.165
asis+0.162
Neg logit contributions
Well-0.260
Kids-0.253
Don-0.250
 She-0.245
Okay-0.242
Token He
Token ID679
Feature activation-1.57e+01

Pos logit contributions
 up+0.086
 home+0.072
 out+0.070
an+0.070
-+0.069
Neg logit contributions
Kids-0.058
Sorry-0.051
Okay-0.050
Well-0.049
Enough-0.049
Tokenitty
Token ID9760
Feature activation-3.30e+01

Pos logit contributions
 up+0.385
an+0.330
ish+0.317
 crown+0.316
-+0.315
Neg logit contributions
Kids-0.319
Well-0.304
Okay-0.280
Don-0.273
Sorry-0.271
Token wanted
Token ID2227
Feature activation-2.30e+01

Pos logit contributions
 up+0.317
an+0.276
 crown+0.262
ish+0.262
 home+0.262
Neg logit contributions
Kids-0.305
Well-0.297
Don-0.278
Okay-0.268
Sorry-0.267
Token to
Token ID284
Feature activation-1.89e+01

Pos logit contributions
 up+0.189
an+0.188
ish+0.179
ig+0.163
ling+0.163
Neg logit contributions
Kids-0.239
Well-0.224
Don-0.219
Listen-0.209
Um-0.206
Token introduce
Token ID10400
Feature activation-2.47e+01

Pos logit contributions
 up+0.191
an+0.176
lers+0.173
ish+0.172
ishing+0.168
Neg logit contributions
Kids-0.161
Well-0.143
Okay-0.139
Sorry-0.136
Dad-0.135
Token Dog
Token ID8532
Feature activation-2.82e+01

Pos logit contributions
 up+0.244
an+0.223
ish+0.215
 crown+0.207
 soaring+0.201
Neg logit contributions
Kids-0.228
Well-0.216
Don-0.202
Okay-0.199
Sorry-0.197
Tokengy
Token ID1360
Feature activation-4.83e+01

Pos logit contributions
 up+0.186
an+0.179
ish+0.174
asis+0.173
ig+0.173
Neg logit contributions
Kids-0.304
Max-0.281
Don-0.279
Well-0.270
Okay-0.265
Token to
Token ID284
Feature activation-1.74e+01

Pos logit contributions
 up+0.366
an+0.355
ish+0.328
ig+0.323
asis+0.321
Neg logit contributions
Kids-0.449
Well-0.424
Okay-0.415
Don-0.405
Enough-0.393
Token her
Token ID607
Feature activation-2.25e+01

Pos logit contributions
 up+0.153
an+0.138
ish+0.135
lers+0.128
-+0.122
Neg logit contributions
Kids-0.184
Well-0.169
Don-0.166
Okay-0.160
Listen-0.156
Token ball
Token ID2613
Feature activation-3.06e+01

Pos logit contributions
 up+0.147
an+0.140
ish+0.128
lers+0.127
ishing+0.120
Neg logit contributions
Kids-0.233
Well-0.231
Okay-0.224
Don-0.219
Max-0.212
Token.
Token ID13
Feature activation-5.96e+00

Pos logit contributions
 up+0.202
an+0.165
ish+0.153
 crown+0.147
 soaring+0.146
Neg logit contributions
Kids-0.382
Well-0.363
Okay-0.349
Don-0.348
Sorry-0.342
Token She
Token ID1375
Feature activation-1.32e+01

Pos logit contributions
 up+0.068
-+0.057
an+0.057
 home+0.057
 out+0.056
Neg logit contributions
Kids-0.049
Well-0.043
Sorry-0.043
Enough-0.043
Okay-0.042