TOP ACTIVATIONS
MAX = 0.175

never imagined there would be such a beautiful thing in the
smile . She had become such a clever girl and she
his hard work had created such a pleasant image . He
be here . It 's such a beautiful day ." They
thanking Him for giving him such a caring wife .
the mouse for finding him such a difficult hiding spot .
home because it had been such a lovely night . <|endoftext|>
was very fortunate to get such a special poem . <|endoftext|>
said that it had been such a nice experience that they
friends thanked her for being such a good leader . Lily
to be able to create such a special place . <|endoftext|>
being persistent enough to find such a special place .
I 'm glad you are such a competitive kid , Jimmy
the sky - it was such a surprise ! They opened
felt lucky to have found such a wonderful place to relax
says , " You are such good twins . You are
. He thought it was such a great lesson , and
excited . It had been such a wonderful day ! <|endoftext|>
was happy to have had such a fun time . It
too . It had been such a lovely adventure ! <|endoftext|>

MIN ACTIVATIONS
MIN = -0.196

. The great Zen o hugs Ben and says
One day , little Jo anne was r aking the leaves
a small bird named B ibi , fly away from his
gate was broken ," the woman says . "
way home ... <|endoftext|> Jo anne was walking in the woods
The se ag ull tried to take the oy
to play , so G aby said , " I don
coming right now ," the woman says . "
here , cat ," the woman said with a big smile
said . The maid smiled . " Yes ,
sorry , kids ," the woman says . " Rex is
smart and safe ," the policeman says . "
the park ," the other kid said . They
a nice surprise ," the woman said . " I brought
deep !" The kid remembered what the voice said
Hello , kids !" the woman says . " Do you
Cic i said , B ibi came back to play with
to you ," the other kid said . "
said . The maid quickly ran away to find
arms . The se ag ull flew away . Tommy laughed

INTERVAL 0.082 - 0.175
CONTAINS 1.601%

. He never used the knife again , and
that their hard work had made a difference . <|endoftext|> Once
picking up her toys and putting them away .
feather like it . She felt so happy to have saved
They keep playing soccer and having fun . <|endoftext|> Anna liked

INTERVAL -0.011 - 0.082
CONTAINS 60.129%

and opened the box . Inside was a surprise ! A
special person she knew : her dad . She was a
and continued his patrol . The guard was never scared again
her . So , Lily decided to try to fix the
the perfect sky . <|endoftext|> Once there was a bald man

INTERVAL -0.103 - -0.011
CONTAINS 32.037%

He liked to play with it in the water . One
it is ours now !" one of the big kids says
ie was relieved and grateful . She knew she had found
s find something better for you to do . How about
of Lily 's hands . It flew far away . Lily

INTERVAL -0.196 - -0.103
CONTAINS 6.233%

bread to it . The goose caught the bread and ate
something I can own ?" Mom thought for a minute and
Tom says . Mom shakes her head and picks
was a little boy named Tim my . Tim my loved
boy named Tim my . Tim my was shy and didn
Token never
Token ID1239
Feature activation+4.38e-02

Pos logit contributions
o+0.065
jo+0.062
os+0.060
ppy+0.059
azz+0.058
Neg logit contributions
 top-0.073
 great-0.070
 amazing-0.069
 new-0.069
 accomplishment-0.068
Token imagined
Token ID15758
Feature activation+7.85e-02

Pos logit contributions
 anything+0.085
 new+0.078
 something+0.078
 trouble+0.077
 great+0.076
Neg logit contributions
jo-0.095
o-0.094
amb-0.089
itty-0.089
os-0.088
Token there
Token ID612
Feature activation+7.40e-03

Pos logit contributions
 chores+0.123
 job+0.121
 biggest+0.116
 accomplishment+0.116
 middle+0.113
Neg logit contributions
o-0.193
os-0.183
y-0.175
jo-0.174
ppy-0.167
Token would
Token ID561
Feature activation+6.46e-02

Pos logit contributions
 accomplishment+0.013
 chores+0.013
 top+0.013
 biggest+0.013
 progress+0.012
Neg logit contributions
o-0.019
y-0.017
jo-0.017
os-0.017
ppy-0.017
Token be
Token ID307
Feature activation+9.45e-02

Pos logit contributions
 chores+0.112
 accomplishment+0.110
 biggest+0.109
 trouble+0.105
 progress+0.104
Neg logit contributions
o-0.149
y-0.145
ppy-0.140
jo-0.138
os-0.138
Token such
Token ID884
Feature activation+1.66e-01

Pos logit contributions
 chores+0.120
 accomplishment+0.120
 top+0.119
 bottom+0.108
 biggest+0.106
Neg logit contributions
o-0.288
y-0.273
os-0.267
ppy-0.265
jo-0.264
Token a
Token ID257
Feature activation+9.39e-02

Pos logit contributions
 bottom+0.212
 top+0.199
 chores+0.199
 middle+0.192
 accomplishment+0.191
Neg logit contributions
y-0.433
ppy-0.425
o-0.423
jo-0.388
ety-0.379
Token beautiful
Token ID4950
Feature activation+7.09e-02

Pos logit contributions
 chores+0.109
 anything+0.100
 top+0.096
 accomplishment+0.095
 progress+0.090
Neg logit contributions
o-0.288
y-0.275
jo-0.268
ppy-0.267
os-0.254
Token thing
Token ID1517
Feature activation-4.01e-02

Pos logit contributions
 chores+0.121
 anything+0.112
 biggest+0.109
 top+0.108
 accomplishment+0.106
Neg logit contributions
o-0.179
y-0.177
jo-0.168
ppy-0.161
os-0.156
Token in
Token ID287
Feature activation+8.59e-02

Pos logit contributions
o+0.104
jo+0.097
ppy+0.095
os+0.095
y+0.093
Neg logit contributions
 top-0.070
 anything-0.070
 biggest-0.067
 chores-0.066
 new-0.066
Token the
Token ID262
Feature activation+8.35e-02

Pos logit contributions
 biggest+0.132
 chores+0.123
 accomplishment+0.119
 progress+0.113
 best+0.111
Neg logit contributions
y-0.228
o-0.215
bug-0.206
os-0.204
ppy-0.196
Token smile
Token ID8212
Feature activation-3.71e-02

Pos logit contributions
 top+0.051
 accomplishment+0.049
 anything+0.049
 chores+0.048
 new+0.048
Neg logit contributions
o-0.082
os-0.075
jo-0.075
ppy-0.075
y-0.073
Token.
Token ID13
Feature activation-4.83e-02

Pos logit contributions
o+0.109
jo+0.102
os+0.102
ppy+0.100
azz+0.096
Neg logit contributions
 top-0.053
 anything-0.050
 something-0.047
 bottom-0.047
 accomplishment-0.046
Token She
Token ID1375
Feature activation-2.71e-02

Pos logit contributions
o+0.116
ppy+0.107
jo+0.106
y+0.105
os+0.104
Neg logit contributions
 top-0.096
 anything-0.091
 biggest-0.091
 new-0.090
 accomplishment-0.090
Token had
Token ID550
Feature activation+5.71e-02

Pos logit contributions
o+0.060
jo+0.056
ppy+0.056
os+0.055
azz+0.054
Neg logit contributions
 top-0.057
 accomplishment-0.054
 new-0.054
 great-0.054
 most-0.053
Token become
Token ID1716
Feature activation+1.20e-01

Pos logit contributions
 top+0.107
 chores+0.104
 anything+0.102
 accomplishment+0.102
 biggest+0.102
Neg logit contributions
o-0.140
jo-0.127
ppy-0.126
y-0.126
os-0.126
Token such
Token ID884
Feature activation+1.64e-01

Pos logit contributions
 bottom+0.164
 top+0.146
wered+0.142
 chores+0.141
 site+0.139
Neg logit contributions
o-0.325
bug-0.303
y-0.302
ppy-0.291
os-0.278
Token a
Token ID257
Feature activation+8.74e-02

Pos logit contributions
 bottom+0.263
 avail+0.253
 site+0.248
 top+0.228
 towers+0.228
Neg logit contributions
o-0.368
ppy-0.359
y-0.356
bug-0.354
ne-0.313
Token clever
Token ID14169
Feature activation+5.23e-02

Pos logit contributions
 anything+0.080
 chores+0.078
 yourselves+0.070
 top+0.070
wered+0.068
Neg logit contributions
o-0.287
y-0.269
jo-0.265
ppy-0.255
os-0.253
Token girl
Token ID2576
Feature activation-8.48e-02

Pos logit contributions
 anything+0.079
 top+0.078
 chores+0.077
 biggest+0.076
 bottom+0.072
Neg logit contributions
o-0.143
y-0.135
jo-0.135
ppy-0.131
os-0.129
Token and
Token ID290
Feature activation+4.51e-04

Pos logit contributions
o+0.207
jo+0.195
os+0.193
ppy+0.191
y+0.186
Neg logit contributions
 anything-0.154
 top-0.151
 new-0.147
 something-0.146
 most-0.146
Token she
Token ID673
Feature activation-2.72e-02

Pos logit contributions
 top+0.001
 chores+0.001
 bottom+0.001
 anything+0.001
 biggest+0.001
Neg logit contributions
o-0.001
jo-0.001
os-0.001
y-0.001
ppy-0.001
Token his
Token ID465
Feature activation+3.61e-02

Pos logit contributions
 top+0.052
 biggest+0.051
 yourselves+0.051
 chores+0.050
 bottom+0.049
Neg logit contributions
o-0.073
jo-0.065
os-0.065
y-0.065
ppy-0.060
Token hard
Token ID1327
Feature activation+1.86e-02

Pos logit contributions
 anything+0.057
 biggest+0.049
 top+0.049
 yourselves+0.049
 chores+0.048
Neg logit contributions
o-0.098
y-0.090
jo-0.090
os-0.089
ppy-0.085
Token work
Token ID670
Feature activation-1.92e-03

Pos logit contributions
 anything+0.020
 top+0.020
 biggest+0.019
 chores+0.018
 accomplishment+0.017
Neg logit contributions
o-0.059
jo-0.056
y-0.056
os-0.055
ppy-0.055
Token had
Token ID550
Feature activation+7.49e-02

Pos logit contributions
o+0.004
jo+0.003
ppy+0.003
os+0.003
y+0.003
Neg logit contributions
 top-0.004
 anything-0.004
 chores-0.004
 biggest-0.004
 accomplishment-0.004
Token created
Token ID2727
Feature activation+7.88e-02

Pos logit contributions
 top+0.171
 biggest+0.165
 chores+0.165
 bottom+0.161
 accomplishment+0.161
Neg logit contributions
o-0.153
y-0.137
os-0.136
ppy-0.133
jo-0.132
Token such
Token ID884
Feature activation+1.64e-01

Pos logit contributions
 top+0.113
wered+0.111
 biggest+0.111
 chores+0.109
 bottom+0.109
Neg logit contributions
o-0.197
y-0.194
jo-0.185
os-0.185
ppy-0.169
Token a
Token ID257
Feature activation+9.20e-02

Pos logit contributions
 avail+0.222
 bottom+0.217
 middle+0.207
 top+0.202
 yourselves+0.196
Neg logit contributions
y-0.400
o-0.397
ppy-0.368
bug-0.338
jo-0.335
Token pleasant
Token ID15497
Feature activation+5.39e-02

Pos logit contributions
 anything+0.087
 chores+0.085
wered+0.080
 top+0.076
 yourselves+0.075
Neg logit contributions
o-0.290
y-0.278
jo-0.268
ppy-0.259
os-0.253
Token image
Token ID2939
Feature activation-6.83e-02

Pos logit contributions
 anything+0.064
 biggest+0.062
 chores+0.062
 top+0.061
 yourselves+0.054
Neg logit contributions
o-0.162
y-0.160
jo-0.155
os-0.147
ppy-0.145
Token.
Token ID13
Feature activation-5.00e-02

Pos logit contributions
o+0.173
jo+0.160
ppy+0.158
os+0.158
y+0.157
Neg logit contributions
 top-0.122
 anything-0.120
 chores-0.117
 biggest-0.117
 accomplishment-0.116
Token He
Token ID679
Feature activation-2.77e-02

Pos logit contributions
o+0.127
ppy+0.125
y+0.121
jo+0.119
os+0.117
Neg logit contributions
 top-0.088
 biggest-0.082
 accomplishment-0.082
 anything-0.082
 new-0.082
Token be
Token ID307
Feature activation+9.03e-02

Pos logit contributions
 top+0.095
 biggest+0.093
 accomplishment+0.091
 bottom+0.090
 chores+0.090
Neg logit contributions
o-0.188
ppy-0.172
os-0.168
jo-0.168
y-0.164
Token here
Token ID994
Feature activation+4.69e-02

Pos logit contributions
 top+0.111
 bottom+0.108
 accomplishment+0.102
 chores+0.101
 anything+0.099
Neg logit contributions
o-0.297
ppy-0.262
y-0.260
os-0.259
jo-0.254
Token.
Token ID13
Feature activation-4.84e-02

Pos logit contributions
 top+0.089
 chores+0.087
 anything+0.087
 accomplishment+0.086
 biggest+0.085
Neg logit contributions
o-0.115
y-0.103
ppy-0.103
os-0.103
jo-0.103
Token It
Token ID632
Feature activation-3.39e-02

Pos logit contributions
o+0.113
jo+0.108
y+0.108
ppy+0.107
os+0.105
Neg logit contributions
 top-0.092
 anything-0.091
 biggest-0.090
 accomplishment-0.088
 most-0.088
Token's
Token ID338
Feature activation+6.52e-02

Pos logit contributions
o+0.060
jo+0.058
ppy+0.055
azz+0.053
y+0.052
Neg logit contributions
 biggest-0.086
 most-0.085
 new-0.085
 top-0.085
 anything-0.084
Token such
Token ID884
Feature activation+1.64e-01

Pos logit contributions
 bottom+0.097
 top+0.093
 progress+0.089
 accomplishment+0.088
 middle+0.085
Neg logit contributions
o-0.189
y-0.161
ppy-0.159
my-0.150
jo-0.148
Token a
Token ID257
Feature activation+9.09e-02

Pos logit contributions
 bottom+0.238
 towers+0.221
 top+0.216
 avail+0.214
 accomplishment+0.208
Neg logit contributions
o-0.435
y-0.396
ppy-0.391
ety-0.362
ah-0.348
Token beautiful
Token ID4950
Feature activation+7.10e-02

Pos logit contributions
 anything+0.089
 chores+0.083
 top+0.081
 bottom+0.077
 yourselves+0.077
Neg logit contributions
o-0.298
y-0.265
jo-0.261
ppy-0.253
itty-0.250
Token day
Token ID1110
Feature activation+2.53e-02

Pos logit contributions
 anything+0.074
 top+0.073
 chores+0.071
 biggest+0.069
 accomplishment+0.067
Neg logit contributions
o-0.233
jo-0.220
y-0.218
os-0.213
ppy-0.210
Token."
Token ID526
Feature activation-5.21e-02

Pos logit contributions
 top+0.043
 anything+0.043
 biggest+0.042
 chores+0.041
 new+0.041
Neg logit contributions
o-0.066
jo-0.062
ppy-0.060
os-0.060
y-0.060
Token They
Token ID1119
Feature activation-2.98e-02

Pos logit contributions
o+0.145
jo+0.142
os+0.141
ppy+0.140
y+0.137
Neg logit contributions
 new-0.079
 many-0.077
 top-0.077
 good-0.071
 something-0.070
Token thanking
Token ID41892
Feature activation+4.91e-02

Pos logit contributions
 top+0.015
 anything+0.014
 chores+0.014
 biggest+0.014
 accomplishment+0.014
Neg logit contributions
o-0.019
jo-0.017
os-0.017
ppy-0.017
y-0.017
Token Him
Token ID10978
Feature activation-5.71e-02

Pos logit contributions
 biggest+0.083
 top+0.079
 bottom+0.078
 chores+0.077
 trouble+0.076
Neg logit contributions
o-0.120
jo-0.119
ppy-0.114
y-0.112
os-0.110
Token for
Token ID329
Feature activation+6.07e-02

Pos logit contributions
o+0.161
jo+0.150
os+0.148
ppy+0.147
y+0.147
Neg logit contributions
 top-0.086
 anything-0.084
 chores-0.083
 accomplishment-0.081
 biggest-0.081
Token giving
Token ID3501
Feature activation+8.18e-02

Pos logit contributions
 biggest+0.089
 bottom+0.086
 top+0.086
 middle+0.074
 holes+0.074
Neg logit contributions
o-0.162
jo-0.150
ppy-0.148
os-0.146
y-0.142
Token him
Token ID683
Feature activation-2.04e-02

Pos logit contributions
 top+0.127
 biggest+0.123
 accomplishment+0.122
 chores+0.122
 anything+0.120
Neg logit contributions
o-0.224
jo-0.211
y-0.208
os-0.205
ppy-0.204
Token such
Token ID884
Feature activation+1.63e-01

Pos logit contributions
o+0.070
jo+0.065
ppy+0.065
os+0.065
y+0.065
Neg logit contributions
 top-0.018
 accomplishment-0.017
 bottom-0.016
 biggest-0.016
 anything-0.016
Token a
Token ID257
Feature activation+9.06e-02

Pos logit contributions
 bottom+0.229
 avail+0.208
 top+0.203
 towers+0.193
 middle+0.193
Neg logit contributions
o-0.420
y-0.380
ppy-0.369
jo-0.362
ah-0.343
Token caring
Token ID18088
Feature activation+1.56e-02

Pos logit contributions
 anything+0.085
 yourselves+0.080
 top+0.079
 chores+0.076
 bottom+0.074
Neg logit contributions
o-0.298
jo-0.276
y-0.275
ppy-0.264
os-0.261
Token wife
Token ID3656
Feature activation-7.25e-02

Pos logit contributions
 top+0.022
 anything+0.022
 chores+0.021
 accomplishment+0.021
 biggest+0.021
Neg logit contributions
o-0.046
jo-0.042
os-0.042
ppy-0.042
y-0.042
Token.
Token ID13
Feature activation-4.52e-02

Pos logit contributions
o+0.177
jo+0.164
os+0.163
ppy+0.161
y+0.160
Neg logit contributions
 anything-0.135
 top-0.132
 chores-0.130
 new-0.129
 biggest-0.128
Token
Token ID198
Feature activation+9.97e-04

Pos logit contributions
o+0.117
ppy+0.112
y+0.110
jo+0.108
os+0.108
Neg logit contributions
 top-0.076
 anything-0.073
 accomplishment-0.071
 new-0.071
 biggest-0.071
Token the
Token ID262
Feature activation+3.29e-02

Pos logit contributions
 biggest+0.044
 chores+0.042
 top+0.039
 bottom+0.037
 accomplishment+0.037
Neg logit contributions
o-0.160
jo-0.150
os-0.149
ppy-0.148
y-0.147
Token mouse
Token ID10211
Feature activation-1.39e-01

Pos logit contributions
 anything+0.076
 chores+0.075
 top+0.074
 accomplishment+0.073
 biggest+0.072
Neg logit contributions
o-0.067
jo-0.059
y-0.059
ppy-0.058
os-0.057
Token for
Token ID329
Feature activation+3.88e-02

Pos logit contributions
o+0.395
jo+0.378
os+0.372
ppy+0.370
y+0.363
Neg logit contributions
 top-0.203
 new-0.198
 anything-0.192
 accomplishment-0.192
 chores-0.188
Token finding
Token ID4917
Feature activation+9.31e-02

Pos logit contributions
 top+0.064
 biggest+0.064
 bottom+0.062
 chores+0.059
 progress+0.059
Neg logit contributions
o-0.105
ppy-0.089
os-0.087
y-0.084
jo-0.082
Token him
Token ID683
Feature activation-2.10e-02

Pos logit contributions
 chores+0.126
 bottom+0.126
 top+0.121
 accomplishment+0.116
 End+0.111
Neg logit contributions
o-0.274
os-0.248
ppy-0.240
y-0.240
jo-0.239
Token such
Token ID884
Feature activation+1.63e-01

Pos logit contributions
o+0.048
ppy+0.043
y+0.043
os+0.042
jo+0.042
Neg logit contributions
 top-0.043
 chores-0.041
 accomplishment-0.041
 biggest-0.041
 anything-0.041
Token a
Token ID257
Feature activation+8.89e-02

Pos logit contributions
 bottom+0.260
 sidelines+0.247
 top+0.230
 chores+0.229
 avail+0.229
Neg logit contributions
o-0.365
ppy-0.335
y-0.323
lets-0.294
ety-0.289
Token difficult
Token ID2408
Feature activation+8.26e-02

Pos logit contributions
 anything+0.087
 chores+0.081
 yourselves+0.078
wered+0.076
 avail+0.073
Neg logit contributions
o-0.291
y-0.263
ppy-0.260
jo-0.258
ah-0.247
Token hiding
Token ID11816
Feature activation+5.90e-02

Pos logit contributions
 anything+0.096
 yourselves+0.095
 biggest+0.095
 chores+0.095
 top+0.092
Neg logit contributions
o-0.238
y-0.234
jo-0.226
ppy-0.222
ah-0.212
Token spot
Token ID4136
Feature activation-1.35e-02

Pos logit contributions
 chores+0.073
 biggest+0.067
 progress+0.065
 anything+0.063
 accomplishment+0.063
Neg logit contributions
o-0.176
y-0.172
jo-0.167
ppy-0.162
os-0.159
Token.
Token ID13
Feature activation-4.97e-02

Pos logit contributions
o+0.033
jo+0.030
ppy+0.030
os+0.030
y+0.030
Neg logit contributions
 top-0.025
 anything-0.025
 chores-0.024
 biggest-0.024
 accomplishment-0.024
Token home
Token ID1363
Feature activation-3.88e-02

Pos logit contributions
 top+0.015
 anything+0.015
 chores+0.015
 biggest+0.015
 accomplishment+0.015
Neg logit contributions
o-0.025
jo-0.023
os-0.023
ppy-0.023
y-0.023
Token because
Token ID780
Feature activation-7.30e-03

Pos logit contributions
o+0.095
jo+0.087
y+0.087
os+0.087
ppy+0.085
Neg logit contributions
 top-0.072
 chores-0.071
 anything-0.070
 biggest-0.070
 accomplishment-0.069
Token it
Token ID340
Feature activation-5.09e-02

Pos logit contributions
o+0.017
os+0.015
jo+0.015
y+0.015
azz+0.013
Neg logit contributions
 biggest-0.013
 chores-0.013
 top-0.013
 bottom-0.013
 yourselves-0.012
Token had
Token ID550
Feature activation+7.15e-02

Pos logit contributions
o+0.111
jo+0.102
ppy+0.101
os+0.099
y+0.097
Neg logit contributions
 top-0.108
 anything-0.107
 biggest-0.105
 new-0.104
 most-0.104
Token been
Token ID587
Feature activation+9.00e-02

Pos logit contributions
 top+0.113
 chores+0.112
 accomplishment+0.111
 biggest+0.106
 bottom+0.106
Neg logit contributions
o-0.191
y-0.177
jo-0.175
os-0.174
ppy-0.171
Token such
Token ID884
Feature activation+1.62e-01

Pos logit contributions
 top+0.124
 chores+0.119
 accomplishment+0.117
 biggest+0.114
 bottom+0.113
Neg logit contributions
o-0.272
y-0.251
os-0.248
jo-0.246
ppy-0.246
Token a
Token ID257
Feature activation+8.88e-02

Pos logit contributions
 bottom+0.238
 avail+0.236
 top+0.224
 chores+0.217
 site+0.215
Neg logit contributions
o-0.386
y-0.371
ppy-0.353
jo-0.322
ah-0.321
Token lovely
Token ID14081
Feature activation+7.06e-02

Pos logit contributions
 anything+0.080
 chores+0.079
 yourselves+0.072
 top+0.071
wered+0.071
Neg logit contributions
o-0.289
y-0.273
jo-0.269
ppy-0.257
ah-0.252
Token night
Token ID1755
Feature activation-6.18e-03

Pos logit contributions
 anything+0.090
 top+0.089
 chores+0.087
 biggest+0.086
 accomplishment+0.081
Neg logit contributions
o-0.211
jo-0.201
y-0.200
ppy-0.192
os-0.192
Token.
Token ID13
Feature activation-4.79e-02

Pos logit contributions
o+0.015
jo+0.014
y+0.014
os+0.014
ppy+0.014
Neg logit contributions
 top-0.011
 chores-0.011
 biggest-0.011
 anything-0.011
 accomplishment-0.011
Token<|endoftext|>
Token ID50256
Feature activation-5.49e-02

Pos logit contributions
o+0.119
jo+0.110
ppy+0.109
os+0.108
y+0.108
Neg logit contributions
 top-0.089
 anything-0.087
 chores-0.085
 biggest-0.085
 accomplishment-0.085
Token was
Token ID373
Feature activation+7.16e-02

Pos logit contributions
o+0.038
jo+0.035
ppy+0.035
os+0.034
azz+0.034
Neg logit contributions
 anything-0.045
 most-0.044
 top-0.044
 great-0.044
 new-0.043
Token very
Token ID845
Feature activation+6.05e-02

Pos logit contributions
 top+0.101
 bottom+0.096
 accomplishment+0.096
 chores+0.095
 avail+0.092
Neg logit contributions
o-0.203
y-0.185
ppy-0.179
os-0.177
jo-0.176
Token fortunate
Token ID20200
Feature activation+8.17e-02

Pos logit contributions
 top+0.079
 anything+0.074
 bottom+0.074
 chores+0.071
 accomplishment+0.067
Neg logit contributions
o-0.186
y-0.171
ppy-0.169
jo-0.167
os-0.164
Token to
Token ID284
Feature activation+7.79e-02

Pos logit contributions
 top+0.179
 chores+0.173
 bottom+0.171
 anything+0.171
 biggest+0.171
Neg logit contributions
o-0.176
y-0.164
jo-0.157
os-0.154
ppy-0.151
Token get
Token ID651
Feature activation+9.91e-02

Pos logit contributions
 biggest+0.139
 top+0.135
 accomplishment+0.131
 chores+0.131
 anything+0.129
Neg logit contributions
o-0.189
jo-0.178
ppy-0.175
os-0.171
y-0.169
Token such
Token ID884
Feature activation+1.62e-01

Pos logit contributions
 top+0.133
 bottom+0.127
 accomplishment+0.125
 chores+0.119
 biggest+0.119
Neg logit contributions
o-0.299
y-0.274
os-0.273
jo-0.267
ppy-0.267
Token a
Token ID257
Feature activation+8.75e-02

Pos logit contributions
 bottom+0.237
 top+0.207
 middle+0.204
 avail+0.191
wered+0.191
Neg logit contributions
o-0.405
ppy-0.377
y-0.375
jo-0.352
bug-0.333
Token special
Token ID2041
Feature activation+7.20e-02

Pos logit contributions
 top+0.099
 anything+0.097
 chores+0.094
 biggest+0.090
 bottom+0.090
Neg logit contributions
o-0.275
jo-0.258
y-0.257
ppy-0.253
os-0.249
Token poem
Token ID21247
Feature activation-4.69e-02

Pos logit contributions
 top+0.144
 anything+0.142
 chores+0.139
 biggest+0.138
 accomplishment+0.136
Neg logit contributions
o-0.165
jo-0.153
y-0.151
ppy-0.148
os-0.146
Token.
Token ID13
Feature activation-4.85e-02

Pos logit contributions
o+0.117
jo+0.107
y+0.106
os+0.105
ppy+0.105
Neg logit contributions
 top-0.086
 anything-0.083
 biggest-0.083
 chores-0.083
 accomplishment-0.082
Token<|endoftext|>
Token ID50256
Feature activation-5.71e-02

Pos logit contributions
o+0.124
ppy+0.122
y+0.119
jo+0.117
os+0.114
Neg logit contributions
 top-0.083
 anything-0.078
 biggest-0.078
 accomplishment-0.077
 new-0.077
Token said
Token ID531
Feature activation-2.90e-03

Pos logit contributions
o+0.251
jo+0.247
os+0.241
ppy+0.237
azz+0.230
Neg logit contributions
 top-0.258
 new-0.252
 something-0.247
 accomplishment-0.245
 most-0.242
Token that
Token ID326
Feature activation+9.13e-03

Pos logit contributions
o+0.008
y+0.007
jo+0.007
ppy+0.007
os+0.007
Neg logit contributions
 top-0.005
 biggest-0.005
 chores-0.005
 anything-0.005
 accomplishment-0.004
Token it
Token ID340
Feature activation-4.33e-02

Pos logit contributions
 top+0.013
 biggest+0.013
 accomplishment+0.013
 chores+0.013
 bottom+0.012
Neg logit contributions
o-0.026
os-0.024
y-0.024
jo-0.024
ppy-0.024
Token had
Token ID550
Feature activation+6.73e-02

Pos logit contributions
o+0.094
jo+0.086
ppy+0.085
os+0.084
y+0.082
Neg logit contributions
 top-0.094
 anything-0.092
 chores-0.091
 new-0.090
 biggest-0.090
Token been
Token ID587
Feature activation+9.12e-02

Pos logit contributions
 top+0.134
 biggest+0.129
 accomplishment+0.129
 chores+0.129
 anything+0.127
Neg logit contributions
o-0.156
y-0.141
os-0.140
ppy-0.140
jo-0.140
Token such
Token ID884
Feature activation+1.62e-01

Pos logit contributions
 top+0.144
 accomplishment+0.139
 bottom+0.134
 chores+0.134
 biggest+0.133
Neg logit contributions
o-0.255
y-0.234
ppy-0.229
os-0.225
jo-0.222
Token a
Token ID257
Feature activation+8.58e-02

Pos logit contributions
 bottom+0.214
 avail+0.201
 top+0.196
 middle+0.196
 accomplishment+0.190
Neg logit contributions
y-0.397
o-0.394
ppy-0.373
ety-0.343
jo-0.340
Token nice
Token ID3621
Feature activation+5.62e-02

Pos logit contributions
 anything+0.086
 chores+0.079
 yourselves+0.078
wered+0.076
 top+0.073
Neg logit contributions
o-0.267
y-0.253
jo-0.247
ppy-0.238
ah-0.233
Token experience
Token ID1998
Feature activation+2.87e-02

Pos logit contributions
 anything+0.065
 top+0.061
 chores+0.061
 biggest+0.060
 yourselves+0.056
Neg logit contributions
o-0.174
y-0.170
jo-0.166
os-0.157
ppy-0.155
Token that
Token ID326
Feature activation+3.64e-02

Pos logit contributions
 top+0.056
 anything+0.054
 chores+0.054
 biggest+0.054
 accomplishment+0.053
Neg logit contributions
o-0.068
jo-0.063
y-0.062
os-0.062
ppy-0.061
Token they
Token ID484
Feature activation-2.17e-02

Pos logit contributions
 top+0.065
 anything+0.062
 chores+0.062
 biggest+0.062
 accomplishment+0.061
Neg logit contributions
o-0.093
jo-0.085
os-0.085
y-0.085
ppy-0.084
Token friends
Token ID2460
Feature activation-6.16e-02

Pos logit contributions
jo+0.380
azz+0.368
os+0.358
o+0.357
ppy+0.336
Neg logit contributions
 top-0.307
 many-0.287
 good-0.283
 something-0.278
 new-0.277
Token thanked
Token ID26280
Feature activation+7.78e-02

Pos logit contributions
o+0.155
jo+0.151
ppy+0.144
os+0.144
azz+0.141
Neg logit contributions
 anything-0.105
 top-0.101
 new-0.100
 something-0.100
 most-0.099
Token her
Token ID607
Feature activation+1.39e-02

Pos logit contributions
 top+0.090
 anything+0.087
 chores+0.085
 biggest+0.084
 accomplishment+0.083
Neg logit contributions
o-0.247
jo-0.232
os-0.229
ppy-0.229
y-0.229
Token for
Token ID329
Feature activation+6.70e-02

Pos logit contributions
 top+0.022
 anything+0.022
 chores+0.022
 biggest+0.022
 accomplishment+0.021
Neg logit contributions
o-0.037
jo-0.035
y-0.034
ppy-0.034
os-0.034
Token being
Token ID852
Feature activation+8.98e-02

Pos logit contributions
 top+0.089
 bottom+0.085
 biggest+0.085
 chores+0.080
 progress+0.076
Neg logit contributions
o-0.198
jo-0.179
ppy-0.179
os-0.177
y-0.176
Token such
Token ID884
Feature activation+1.62e-01

Pos logit contributions
 top+0.108
 chores+0.103
 bottom+0.102
 accomplishment+0.099
 anything+0.093
Neg logit contributions
o-0.291
y-0.267
ppy-0.262
jo-0.262
os-0.261
Token a
Token ID257
Feature activation+7.76e-02

Pos logit contributions
 bottom+0.192
 top+0.183
 avail+0.178
 chores+0.166
 accomplishment+0.165
Neg logit contributions
o-0.491
y-0.450
ppy-0.443
jo-0.424
os-0.411
Token good
Token ID922
Feature activation+6.80e-02

Pos logit contributions
 anything+0.084
 top+0.082
 chores+0.081
 bottom+0.075
 biggest+0.075
Neg logit contributions
o-0.246
y-0.231
jo-0.229
ppy-0.222
os-0.216
Token leader
Token ID3554
Feature activation-3.16e-02

Pos logit contributions
 top+0.130
 anything+0.127
 chores+0.126
 biggest+0.124
 accomplishment+0.123
Neg logit contributions
o-0.164
y-0.151
jo-0.150
ppy-0.148
os-0.146
Token.
Token ID13
Feature activation-4.60e-02

Pos logit contributions
o+0.079
jo+0.073
ppy+0.072
os+0.072
y+0.070
Neg logit contributions
 anything-0.057
 top-0.057
 chores-0.055
 biggest-0.055
 new-0.055
Token Lily
Token ID20037
Feature activation-1.17e-01

Pos logit contributions
o+0.120
ppy+0.120
y+0.117
jo+0.115
os+0.114
Neg logit contributions
 top-0.072
 new-0.066
 anything-0.065
 many-0.064
 biggest-0.063
Token to
Token ID284
Feature activation+7.66e-02

Pos logit contributions
 top+0.144
 anything+0.141
 chores+0.140
 biggest+0.139
 accomplishment+0.138
Neg logit contributions
o-0.176
jo-0.161
os-0.159
ppy-0.159
y-0.158
Token be
Token ID307
Feature activation+8.83e-02

Pos logit contributions
 biggest+0.125
 top+0.120
 chores+0.119
 anything+0.118
 trouble+0.118
Neg logit contributions
o-0.194
ppy-0.185
jo-0.182
y-0.181
os-0.178
Token able
Token ID1498
Feature activation+9.14e-02

Pos logit contributions
 top+0.159
 chores+0.155
 accomplishment+0.151
 bottom+0.150
 biggest+0.148
Neg logit contributions
o-0.231
y-0.209
os-0.208
ppy-0.206
jo-0.205
Token to
Token ID284
Feature activation+7.65e-02

Pos logit contributions
 top+0.207
 chores+0.201
 bottom+0.198
 anything+0.197
 biggest+0.197
Neg logit contributions
o-0.195
y-0.173
os-0.168
jo-0.166
ppy-0.162
Token create
Token ID2251
Feature activation+9.21e-02

Pos logit contributions
 biggest+0.129
 anything+0.123
 top+0.122
 chores+0.122
 trouble+0.121
Neg logit contributions
o-0.189
ppy-0.179
y-0.176
os-0.174
jo-0.174
Token such
Token ID884
Feature activation+1.62e-01

Pos logit contributions
 bottom+0.108
wered+0.106
 End+0.102
 top+0.102
 accomplishment+0.100
Neg logit contributions
o-0.254
y-0.248
jo-0.248
os-0.243
ppy-0.233
Token a
Token ID257
Feature activation+9.30e-02

Pos logit contributions
 bottom+0.198
 middle+0.179
 top+0.177
 avail+0.175
 chores+0.172
Neg logit contributions
o-0.421
y-0.410
ppy-0.399
jo-0.385
lets-0.375
Token special
Token ID2041
Feature activation+7.60e-02

Pos logit contributions
 chores+0.073
 anything+0.072
 top+0.070
 accomplishment+0.063
 biggest+0.063
Neg logit contributions
o-0.320
y-0.302
jo-0.301
ppy-0.294
os-0.288
Token place
Token ID1295
Feature activation+8.46e-03

Pos logit contributions
 top+0.095
 anything+0.094
 chores+0.091
 biggest+0.090
 accomplishment+0.087
Neg logit contributions
o-0.232
jo-0.219
y-0.217
ppy-0.212
os-0.211
Token.
Token ID13
Feature activation-4.57e-02

Pos logit contributions
 top+0.017
 chores+0.016
 biggest+0.016
 accomplishment+0.016
 anything+0.016
Neg logit contributions
o-0.020
y-0.018
jo-0.018
os-0.018
ppy-0.017
Token<|endoftext|>
Token ID50256
Feature activation-5.55e-02

Pos logit contributions
ppy+0.125
y+0.124
o+0.121
jo+0.120
os+0.116
Neg logit contributions
 top-0.068
 new-0.062
 chores-0.062
 anything-0.062
 many-0.061
Token being
Token ID852
Feature activation+9.29e-02

Pos logit contributions
 biggest+0.090
 top+0.085
 chores+0.082
 bottom+0.082
 accomplishment+0.079
Neg logit contributions
o-0.203
ppy-0.190
y-0.184
jo-0.183
os-0.182
Token persistent
Token ID16218
Feature activation+5.74e-02

Pos logit contributions
 top+0.138
 chores+0.132
 bottom+0.129
 accomplishment+0.127
 biggest+0.123
Neg logit contributions
o-0.274
y-0.253
ppy-0.249
os-0.247
jo-0.246
Token enough
Token ID1576
Feature activation+8.83e-02

Pos logit contributions
 top+0.107
 anything+0.106
 chores+0.103
 biggest+0.102
 accomplishment+0.102
Neg logit contributions
o-0.141
jo-0.130
ppy-0.128
os-0.128
y-0.125
Token to
Token ID284
Feature activation+7.79e-02

Pos logit contributions
 top+0.183
 anything+0.177
 biggest+0.176
 bottom+0.174
 chores+0.172
Neg logit contributions
o-0.199
y-0.180
jo-0.177
ppy-0.176
os-0.175
Token find
Token ID1064
Feature activation+8.28e-02

Pos logit contributions
 biggest+0.134
 top+0.128
 anything+0.124
 accomplishment+0.124
 chores+0.123
Neg logit contributions
o-0.196
ppy-0.179
y-0.176
os-0.173
jo-0.173
Token such
Token ID884
Feature activation+1.61e-01

Pos logit contributions
 top+0.088
 bottom+0.085
 accomplishment+0.085
 chores+0.085
 biggest+0.080
Neg logit contributions
o-0.255
y-0.239
os-0.238
jo-0.238
ppy-0.237
Token a
Token ID257
Feature activation+8.91e-02

Pos logit contributions
 bottom+0.186
 avail+0.180
 top+0.178
 chores+0.165
 middle+0.162
Neg logit contributions
o-0.460
y-0.438
ppy-0.427
jo-0.401
bug-0.395
Token special
Token ID2041
Feature activation+7.35e-02

Pos logit contributions
 anything+0.084
 chores+0.082
 top+0.081
wered+0.079
 biggest+0.075
Neg logit contributions
o-0.286
y-0.270
jo-0.263
ppy-0.258
os-0.248
Tokenplace
Token ID5372
Feature activation+3.34e-02

Pos logit contributions
 top+0.081
 anything+0.080
 chores+0.077
 biggest+0.077
 bottom+0.073
Neg logit contributions
o-0.229
y-0.217
jo-0.214
ppy-0.206
os-0.206
Token.
Token ID13
Feature activation-4.55e-02

Pos logit contributions
 top+0.045
 bottom+0.042
 anything+0.041
 biggest+0.041
 accomplishment+0.041
Neg logit contributions
o-0.100
y-0.093
jo-0.093
os-0.092
ppy-0.092
Token
Token ID220
Feature activation-2.09e-02

Pos logit contributions
o+0.120
y+0.118
ppy+0.118
os+0.113
jo+0.113
Neg logit contributions
 top-0.073
 anything-0.068
 new-0.067
 biggest-0.066
 something-0.065
TokenI
Token ID40
Feature activation+5.26e-03

Pos logit contributions
o+0.034
jo+0.032
ppy+0.031
os+0.031
y+0.030
Neg logit contributions
 top-0.024
 anything-0.024
 biggest-0.023
 most-0.023
 accomplishment-0.023
Token'm
Token ID1101
Feature activation+6.45e-02

Pos logit contributions
 great+0.010
 top+0.010
 anything+0.010
 chores+0.010
 accomplishment+0.010
Neg logit contributions
o-0.012
jo-0.012
ppy-0.011
os-0.011
y-0.011
Token glad
Token ID9675
Feature activation+4.03e-02

Pos logit contributions
 top+0.105
 tasks+0.103
 chores+0.102
 bottom+0.101
 avail+0.096
Neg logit contributions
o-0.177
os-0.151
ppy-0.150
jo-0.150
y-0.148
Token you
Token ID345
Feature activation-1.33e-02

Pos logit contributions
 top+0.067
 biggest+0.065
 chores+0.064
 accomplishment+0.064
 bottom+0.064
Neg logit contributions
o-0.106
y-0.097
jo-0.097
ppy-0.095
os-0.095
Token are
Token ID389
Feature activation+7.87e-02

Pos logit contributions
o+0.029
jo+0.028
os+0.026
y+0.026
ppy+0.026
Neg logit contributions
 anything-0.028
 top-0.028
 biggest-0.027
 great-0.027
 chores-0.026
Token such
Token ID884
Feature activation+1.61e-01

Pos logit contributions
 chores+0.113
 top+0.113
 bottom+0.105
 tasks+0.105
 accomplishment+0.104
Neg logit contributions
o-0.226
y-0.208
ppy-0.203
jo-0.200
os-0.198
Token a
Token ID257
Feature activation+8.80e-02

Pos logit contributions
 bottom+0.255
 top+0.228
 middle+0.224
 avail+0.214
 chores+0.208
Neg logit contributions
o-0.395
ppy-0.370
y-0.352
bug-0.331
jo-0.309
Token competitive
Token ID7606
Feature activation+4.63e-02

Pos logit contributions
 anything+0.074
 chores+0.074
 top+0.072
 bottom+0.064
 accomplishment+0.064
Neg logit contributions
o-0.302
y-0.283
jo-0.280
ppy-0.276
os-0.270
Token kid
Token ID5141
Feature activation-1.06e-01

Pos logit contributions
 top+0.075
 anything+0.074
 chores+0.073
 biggest+0.072
 accomplishment+0.071
Neg logit contributions
o-0.123
y-0.116
jo-0.115
ppy-0.112
os-0.111
Token,
Token ID11
Feature activation+8.24e-03

Pos logit contributions
o+0.283
jo+0.272
ppy+0.267
os+0.265
azz+0.258
Neg logit contributions
 anything-0.171
 top-0.160
 new-0.160
 biggest-0.159
 most-0.157
Token Jimmy
Token ID12963
Feature activation-1.11e-01

Pos logit contributions
 top+0.015
 anything+0.014
 chores+0.014
 biggest+0.014
 accomplishment+0.014
Neg logit contributions
o-0.021
jo-0.019
y-0.019
ppy-0.019
os-0.019
Token the
Token ID262
Feature activation+6.65e-02

Pos logit contributions
 biggest+0.081
 chores+0.075
 accomplishment+0.074
 top+0.069
 progress+0.069
Neg logit contributions
o-0.212
y-0.206
jo-0.195
os-0.194
ppy-0.184
Token sky
Token ID6766
Feature activation-5.21e-02

Pos logit contributions
 anything+0.082
 accomplishment+0.080
 chores+0.079
 top+0.079
 biggest+0.076
Neg logit contributions
o-0.200
jo-0.188
y-0.187
os-0.185
ppy-0.184
Token -
Token ID532
Feature activation+1.03e-02

Pos logit contributions
o+0.129
jo+0.120
ppy+0.119
os+0.117
y+0.114
Neg logit contributions
 top-0.097
 anything-0.094
 biggest-0.091
 new-0.091
 accomplishment-0.091
Token it
Token ID340
Feature activation-4.45e-02

Pos logit contributions
 top+0.016
 biggest+0.015
 chores+0.015
 anything+0.015
 accomplishment+0.015
Neg logit contributions
o-0.028
os-0.025
y-0.025
jo-0.025
ppy-0.024
Token was
Token ID373
Feature activation+7.49e-02

Pos logit contributions
o+0.096
jo+0.088
ppy+0.087
os+0.085
y+0.083
Neg logit contributions
 top-0.097
 anything-0.094
 biggest-0.093
 new-0.093
 most-0.092
Token such
Token ID884
Feature activation+1.61e-01

Pos logit contributions
 chores+0.106
 tasks+0.103
 progress+0.098
 top+0.097
 accomplishment+0.096
Neg logit contributions
o-0.217
y-0.200
jo-0.182
os-0.182
ppy-0.176
Token a
Token ID257
Feature activation+8.57e-02

Pos logit contributions
 avail+0.260
 bottom+0.258
 top+0.241
 chores+0.237
 accomplishment+0.226
Neg logit contributions
o-0.374
y-0.350
ppy-0.317
bug-0.291
jo-0.287
Token surprise
Token ID5975
Feature activation+5.24e-03

Pos logit contributions
 anything+0.075
 chores+0.073
 top+0.065
 yourselves+0.060
 biggest+0.060
Neg logit contributions
o-0.289
y-0.269
jo-0.268
ppy-0.254
os-0.252
Token!
Token ID0
Feature activation-5.07e-02

Pos logit contributions
 top+0.010
 anything+0.010
 accomplishment+0.009
 chores+0.009
 biggest+0.009
Neg logit contributions
o-0.013
jo-0.012
ppy-0.012
os-0.012
y-0.012
Token They
Token ID1119
Feature activation-2.62e-02

Pos logit contributions
ppy+0.139
o+0.137
y+0.137
jo+0.132
os+0.128
Neg logit contributions
 top-0.077
 anything-0.070
 new-0.069
 trouble-0.068
 biggest-0.068
Token opened
Token ID4721
Feature activation+6.69e-02

Pos logit contributions
o+0.064
jo+0.063
ppy+0.058
azz+0.058
y+0.057
Neg logit contributions
 top-0.047
 many-0.045
 great-0.044
 accomplishment-0.044
 new-0.043
Token felt
Token ID2936
Feature activation+8.05e-02

Pos logit contributions
o+0.269
jo+0.252
os+0.248
ppy+0.244
azz+0.240
Neg logit contributions
 top-0.284
 new-0.268
 most-0.264
 biggest-0.262
 great-0.262
Token lucky
Token ID9670
Feature activation+3.38e-02

Pos logit contributions
 avail+0.170
 tasks+0.170
 chores+0.169
 purpose+0.153
 top+0.153
Neg logit contributions
o-0.164
y-0.148
ppy-0.142
bo-0.134
bug-0.130
Token to
Token ID284
Feature activation+7.99e-02

Pos logit contributions
 top+0.068
 chores+0.066
 anything+0.066
 biggest+0.065
 accomplishment+0.065
Neg logit contributions
o-0.079
jo-0.072
y-0.071
os-0.071
ppy-0.070
Token have
Token ID423
Feature activation+7.93e-02

Pos logit contributions
 biggest+0.157
 chores+0.155
 top+0.152
 progress+0.152
 accomplishment+0.151
Neg logit contributions
o-0.178
ppy-0.161
jo-0.160
os-0.157
y-0.154
Token found
Token ID1043
Feature activation+6.85e-02

Pos logit contributions
 top+0.144
 chores+0.139
 accomplishment+0.139
 biggest+0.134
 bottom+0.134
Neg logit contributions
o-0.207
jo-0.185
y-0.185
os-0.185
ppy-0.184
Token such
Token ID884
Feature activation+1.61e-01

Pos logit contributions
 accomplishment+0.073
 bottom+0.069
 top+0.069
 chores+0.068
 End+0.062
Neg logit contributions
o-0.219
os-0.204
y-0.202
jo-0.194
ppy-0.191
Token a
Token ID257
Feature activation+8.60e-02

Pos logit contributions
 bottom+0.237
 accomplishment+0.216
 avail+0.213
 sidelines+0.211
 top+0.211
Neg logit contributions
o-0.403
y-0.378
ppy-0.364
bug-0.352
lets-0.339
Token wonderful
Token ID7932
Feature activation+7.74e-02

Pos logit contributions
 anything+0.077
 chores+0.075
 top+0.071
 accomplishment+0.067
 bottom+0.063
Neg logit contributions
o-0.297
y-0.272
jo-0.272
os-0.264
ppy-0.263
Token place
Token ID1295
Feature activation-5.72e-03

Pos logit contributions
 anything+0.084
 chores+0.082
 top+0.080
 biggest+0.077
 bottom+0.073
Neg logit contributions
o-0.249
y-0.233
jo-0.231
os-0.223
ppy-0.216
Token to
Token ID284
Feature activation+7.49e-02

Pos logit contributions
o+0.014
jo+0.013
y+0.012
os+0.012
ppy+0.012
Neg logit contributions
 top-0.011
 chores-0.011
 anything-0.011
 biggest-0.011
 accomplishment-0.011
Token relax
Token ID8960
Feature activation+2.91e-02

Pos logit contributions
 biggest+0.129
 top+0.125
 chores+0.124
 accomplishment+0.123
 anything+0.122
Neg logit contributions
o-0.186
os-0.168
jo-0.167
ppy-0.167
y-0.162
Token says
Token ID1139
Feature activation-1.47e-02

Pos logit contributions
o+0.069
jo+0.063
ppy+0.062
y+0.061
os+0.061
Neg logit contributions
 top-0.089
 anything-0.088
 chores-0.087
 biggest-0.087
 new-0.087
Token,
Token ID11
Feature activation-3.01e-02

Pos logit contributions
o+0.041
jo+0.038
ppy+0.038
y+0.037
os+0.037
Neg logit contributions
 top-0.023
 anything-0.023
 chores-0.022
 new-0.022
 biggest-0.022
Token "
Token ID366
Feature activation-1.21e-02

Pos logit contributions
o+0.091
jo+0.090
y+0.088
ppy+0.088
os+0.087
Neg logit contributions
 top-0.038
 new-0.037
 anything-0.036
 something-0.035
 good-0.034
TokenYou
Token ID1639
Feature activation-1.06e-02

Pos logit contributions
jo+0.034
o+0.034
ppy+0.033
os+0.033
anna+0.032
Neg logit contributions
 anything-0.016
 most-0.016
 top-0.015
 peace-0.015
 middle-0.015
Token are
Token ID389
Feature activation+7.15e-02

Pos logit contributions
o+0.026
jo+0.024
os+0.024
ppy+0.024
y+0.024
Neg logit contributions
 top-0.020
 anything-0.019
 chores-0.019
 biggest-0.019
 accomplishment-0.019
Token such
Token ID884
Feature activation+1.61e-01

Pos logit contributions
 accomplishment+0.099
 top+0.098
 bottom+0.094
 avail+0.091
 progress+0.091
Neg logit contributions
o-0.220
ppy-0.193
bug-0.178
y-0.178
os-0.176
Token good
Token ID922
Feature activation+7.28e-02

Pos logit contributions
 bottom+0.202
 top+0.189
 accomplishment+0.183
 avail+0.175
 middle+0.168
Neg logit contributions
o-0.477
ppy-0.445
y-0.414
bug-0.384
bo-0.381
Token twins
Token ID20345
Feature activation-6.43e-02

Pos logit contributions
 top+0.079
 anything+0.077
 chores+0.076
 biggest+0.075
 accomplishment+0.073
Neg logit contributions
o-0.236
jo-0.225
ppy-0.222
os-0.222
y-0.218
Token.
Token ID13
Feature activation-4.03e-02

Pos logit contributions
o+0.168
jo+0.160
os+0.155
ppy+0.153
y+0.151
Neg logit contributions
 anything-0.109
 top-0.106
 new-0.104
 chores-0.103
 biggest-0.102
Token You
Token ID921
Feature activation-2.07e-02

Pos logit contributions
o+0.100
jo+0.095
y+0.093
ppy+0.093
os+0.092
Neg logit contributions
 anything-0.072
 top-0.071
 biggest-0.069
 chores-0.068
 trouble-0.067
Token are
Token ID389
Feature activation+6.69e-02

Pos logit contributions
o+0.048
jo+0.044
os+0.043
y+0.043
ppy+0.043
Neg logit contributions
 top-0.042
 anything-0.041
 chores-0.041
 biggest-0.040
 accomplishment-0.040
Token.
Token ID13
Feature activation-4.79e-02

Pos logit contributions
 top+0.116
 chores+0.111
 biggest+0.111
 bottom+0.111
 anything+0.110
Neg logit contributions
o-0.107
y-0.103
jo-0.098
os-0.096
ppy-0.095
Token He
Token ID679
Feature activation-2.99e-02

Pos logit contributions
o+0.123
ppy+0.115
jo+0.114
y+0.113
os+0.112
Neg logit contributions
 top-0.085
 anything-0.082
 biggest-0.081
 accomplishment-0.080
 chores-0.080
Token thought
Token ID1807
Feature activation+3.12e-03

Pos logit contributions
o+0.063
ppy+0.059
jo+0.058
os+0.056
y+0.056
Neg logit contributions
 top-0.066
 accomplishment-0.063
 new-0.063
 most-0.062
 great-0.062
Token it
Token ID340
Feature activation-3.75e-02

Pos logit contributions
 top+0.005
 biggest+0.005
 bottom+0.005
 middle+0.005
 chores+0.005
Neg logit contributions
o-0.008
jo-0.008
os-0.008
y-0.008
ppy-0.008
Token was
Token ID373
Feature activation+8.38e-02

Pos logit contributions
o+0.081
ppy+0.075
jo+0.073
os+0.071
azz+0.069
Neg logit contributions
 top-0.083
 anything-0.081
 biggest-0.080
 most-0.080
 new-0.080
Token such
Token ID884
Feature activation+1.61e-01

Pos logit contributions
 top+0.103
 bottom+0.102
 middle+0.100
 accomplishment+0.100
 chores+0.099
Neg logit contributions
o-0.242
y-0.225
ppy-0.211
jo-0.211
os-0.211
Token a
Token ID257
Feature activation+8.82e-02

Pos logit contributions
 bottom+0.233
 avail+0.228
 top+0.208
 middle+0.206
 yourselves+0.202
Neg logit contributions
o-0.385
y-0.369
ppy-0.360
ah-0.327
ny-0.325
Token great
Token ID1049
Feature activation+8.07e-02

Pos logit contributions
 chores+0.073
 anything+0.071
 yourselves+0.070
wered+0.066
 avail+0.064
Neg logit contributions
o-0.284
y-0.267
jo-0.263
ah-0.254
ppy-0.253
Token lesson
Token ID11483
Feature activation+1.25e-03

Pos logit contributions
 yourselves+0.087
 anything+0.084
 chores+0.081
 middle+0.077
 biggest+0.077
Neg logit contributions
o-0.239
y-0.234
jo-0.228
ah-0.219
os-0.214
Token,
Token ID11
Feature activation+1.43e-02

Pos logit contributions
 top+0.002
 biggest+0.002
 chores+0.002
 anything+0.002
 bottom+0.002
Neg logit contributions
o-0.003
jo-0.003
y-0.003
os-0.003
ppy-0.003
Token and
Token ID290
Feature activation+7.94e-03

Pos logit contributions
 top+0.025
 chores+0.024
 bottom+0.024
 biggest+0.024
 anything+0.023
Neg logit contributions
o-0.036
jo-0.034
y-0.033
os-0.033
ppy-0.032
Token excited
Token ID6568
Feature activation+3.20e-02

Pos logit contributions
 chores+0.017
 top+0.016
 anything+0.016
 bottom+0.015
 tasks+0.015
Neg logit contributions
o-0.036
jo-0.033
y-0.033
os-0.033
ppy-0.032
Token.
Token ID13
Feature activation-4.91e-02

Pos logit contributions
 top+0.060
 anything+0.059
 chores+0.058
 biggest+0.058
 accomplishment+0.058
Neg logit contributions
o-0.079
jo-0.072
os-0.071
ppy-0.071
y-0.071
Token It
Token ID632
Feature activation-4.38e-02

Pos logit contributions
o+0.120
ppy+0.111
jo+0.110
y+0.109
os+0.109
Neg logit contributions
 top-0.094
 anything-0.091
 chores-0.090
 biggest-0.089
 accomplishment-0.089
Token had
Token ID550
Feature activation+6.79e-02

Pos logit contributions
o+0.096
ppy+0.089
os+0.088
jo+0.088
y+0.085
Neg logit contributions
 top-0.094
 new-0.091
 most-0.090
 great-0.089
 biggest-0.089
Token been
Token ID587
Feature activation+8.65e-02

Pos logit contributions
 chores+0.135
 top+0.133
 accomplishment+0.132
 anything+0.128
 biggest+0.128
Neg logit contributions
o-0.158
jo-0.143
y-0.143
os-0.141
ppy-0.139
Token such
Token ID884
Feature activation+1.61e-01

Pos logit contributions
 chores+0.126
 top+0.125
 accomplishment+0.119
 bottom+0.115
 progress+0.113
Neg logit contributions
o-0.259
y-0.235
os-0.230
jo-0.226
ppy-0.226
Token a
Token ID257
Feature activation+8.70e-02

Pos logit contributions
 bottom+0.265
 site+0.246
 avail+0.242
 top+0.239
 chores+0.232
Neg logit contributions
o-0.354
y-0.344
ppy-0.334
bug-0.322
ety-0.298
Token wonderful
Token ID7932
Feature activation+7.68e-02

Pos logit contributions
 anything+0.079
 chores+0.078
 top+0.070
 yourselves+0.069
 tasks+0.065
Neg logit contributions
o-0.289
y-0.269
jo-0.266
ppy-0.255
ah-0.251
Token day
Token ID1110
Feature activation+2.17e-02

Pos logit contributions
 anything+0.096
 chores+0.092
 top+0.091
 biggest+0.090
 bottom+0.082
Neg logit contributions
o-0.234
jo-0.221
y-0.221
os-0.211
ppy-0.208
Token!
Token ID0
Feature activation-4.74e-02

Pos logit contributions
 top+0.040
 anything+0.039
 chores+0.038
 biggest+0.038
 accomplishment+0.038
Neg logit contributions
o-0.054
jo-0.050
ppy-0.049
os-0.049
y-0.049
Token<|endoftext|>
Token ID50256
Feature activation-5.78e-02

Pos logit contributions
o+0.123
ppy+0.116
y+0.114
jo+0.113
os+0.113
Neg logit contributions
 top-0.083
 anything-0.080
 chores-0.078
 biggest-0.078
 new-0.078
Token was
Token ID373
Feature activation+6.56e-02

Pos logit contributions
o+0.125
jo+0.116
os+0.114
ppy+0.113
y+0.110
Neg logit contributions
 top-0.116
 new-0.112
 accomplishment-0.111
 anything-0.110
 most-0.110
Token happy
Token ID3772
Feature activation+3.77e-02

Pos logit contributions
 top+0.100
 tasks+0.099
 chores+0.098
 progress+0.097
 biggest+0.095
Neg logit contributions
o-0.183
y-0.168
ppy-0.161
jo-0.154
ety-0.150
Token to
Token ID284
Feature activation+7.85e-02

Pos logit contributions
 top+0.072
 anything+0.070
 chores+0.069
 biggest+0.069
 accomplishment+0.068
Neg logit contributions
o-0.092
jo-0.085
ppy-0.083
os-0.083
y-0.083
Token have
Token ID423
Feature activation+7.61e-02

Pos logit contributions
 biggest+0.144
 top+0.141
 chores+0.141
 anything+0.140
 accomplishment+0.139
Neg logit contributions
o-0.189
ppy-0.175
jo-0.173
y-0.170
os-0.168
Token had
Token ID550
Feature activation+7.96e-02

Pos logit contributions
 top+0.098
 accomplishment+0.093
 chores+0.093
 bottom+0.091
 biggest+0.090
Neg logit contributions
o-0.232
y-0.214
jo-0.213
ppy-0.213
os-0.212
Token such
Token ID884
Feature activation+1.61e-01

Pos logit contributions
 top+0.087
 bottom+0.082
 accomplishment+0.081
 chores+0.079
 biggest+0.076
Neg logit contributions
o-0.260
y-0.238
ppy-0.238
jo-0.236
os-0.236
Token a
Token ID257
Feature activation+9.04e-02

Pos logit contributions
 bottom+0.253
 sidelines+0.242
 site+0.240
 accomplishment+0.225
 top+0.219
Neg logit contributions
o-0.355
ppy-0.338
y-0.334
ety-0.306
bug-0.304
Token fun
Token ID1257
Feature activation+2.95e-02

Pos logit contributions
 anything+0.076
 top+0.075
 chores+0.072
 biggest+0.068
 accomplishment+0.067
Neg logit contributions
o-0.313
y-0.293
jo-0.292
ppy-0.288
os-0.285
Token time
Token ID640
Feature activation+2.08e-02

Pos logit contributions
 top+0.034
 new+0.033
 anything+0.032
 chores+0.032
 accomplishment+0.032
Neg logit contributions
o-0.095
ppy-0.091
os-0.090
jo-0.089
y-0.085
Token.
Token ID13
Feature activation-4.93e-02

Pos logit contributions
 top+0.038
 anything+0.038
 chores+0.036
 biggest+0.036
 accomplishment+0.036
Neg logit contributions
o-0.052
jo-0.048
ppy-0.048
os-0.047
y-0.047
Token It
Token ID632
Feature activation-4.14e-02

Pos logit contributions
o+0.136
ppy+0.130
jo+0.129
y+0.127
os+0.126
Neg logit contributions
 top-0.077
 new-0.075
 many-0.073
 biggest-0.073
 anything-0.072
Token too
Token ID1165
Feature activation+2.42e-02

Pos logit contributions
o+0.144
jo+0.132
os+0.132
y+0.131
ppy+0.128
Neg logit contributions
 top-0.100
 accomplishment-0.096
 anything-0.095
 biggest-0.095
 chores-0.095
Token.
Token ID13
Feature activation-4.51e-02

Pos logit contributions
 top+0.051
 chores+0.051
 anything+0.051
 biggest+0.050
 accomplishment+0.050
Neg logit contributions
o-0.052
y-0.048
jo-0.047
os-0.046
ppy-0.045
Token It
Token ID632
Feature activation-3.95e-02

Pos logit contributions
ppy+0.122
o+0.118
y+0.116
jo+0.113
os+0.107
Neg logit contributions
 top-0.075
 biggest-0.066
 anything-0.066
 new-0.065
 something-0.064
Token had
Token ID550
Feature activation+6.27e-02

Pos logit contributions
o+0.088
ppy+0.082
jo+0.081
os+0.079
azz+0.077
Neg logit contributions
 top-0.082
 biggest-0.081
 new-0.080
 most-0.080
 anything-0.079
Token been
Token ID587
Feature activation+8.45e-02

Pos logit contributions
 accomplishment+0.089
 chores+0.089
 top+0.086
 bottom+0.081
 anything+0.081
Neg logit contributions
o-0.177
y-0.164
jo-0.164
os-0.161
ppy-0.158
Token such
Token ID884
Feature activation+1.61e-01

Pos logit contributions
 top+0.110
 chores+0.106
 accomplishment+0.104
 bottom+0.104
 progress+0.101
Neg logit contributions
o-0.265
y-0.239
os-0.235
ppy-0.232
jo-0.232
Token a
Token ID257
Feature activation+8.59e-02

Pos logit contributions
 bottom+0.255
 avail+0.235
 site+0.229
 top+0.226
 chores+0.215
Neg logit contributions
o-0.360
y-0.353
ppy-0.336
bug-0.306
ah-0.299
Token lovely
Token ID14081
Feature activation+6.91e-02

Pos logit contributions
 anything+0.082
 chores+0.077
 yourselves+0.073
wered+0.068
 top+0.067
Neg logit contributions
o-0.277
y-0.258
jo-0.253
ah-0.243
ppy-0.240
Token adventure
Token ID8855
Feature activation-1.03e-02

Pos logit contributions
 anything+0.094
 chores+0.089
 top+0.084
 biggest+0.083
 yourselves+0.081
Neg logit contributions
o-0.202
y-0.194
jo-0.191
os-0.180
ah-0.175
Token!
Token ID0
Feature activation-4.93e-02

Pos logit contributions
o+0.025
jo+0.023
ppy+0.023
os+0.023
y+0.023
Neg logit contributions
 top-0.020
 anything-0.019
 chores-0.019
 biggest-0.019
 accomplishment-0.019
Token<|endoftext|>
Token ID50256
Feature activation-5.88e-02

Pos logit contributions
ppy+0.134
o+0.131
y+0.128
jo+0.125
os+0.120
Neg logit contributions
 top-0.079
 biggest-0.072
 anything-0.071
 new-0.070
 middle-0.070
Token.
Token ID13
Feature activation-5.60e-02

Pos logit contributions
o+0.066
jo+0.063
ppy+0.061
os+0.061
y+0.060
Neg logit contributions
 top-0.040
 anything-0.039
 new-0.039
 accomplishment-0.039
 chores-0.037
Token
Token ID198
Feature activation-1.86e-02

Pos logit contributions
o+0.151
jo+0.144
ppy+0.143
os+0.141
y+0.141
Neg logit contributions
 top-0.092
 anything-0.087
 new-0.087
 biggest-0.084
 something-0.083
Token
Token ID198
Feature activation-1.77e-02

Pos logit contributions
o+0.048
jo+0.045
ppy+0.044
os+0.044
y+0.044
Neg logit contributions
 top-0.033
 anything-0.031
 new-0.031
 chores-0.031
 biggest-0.031
TokenThe
Token ID464
Feature activation+3.33e-02

Pos logit contributions
o+0.043
jo+0.040
os+0.039
ppy+0.039
y+0.039
Neg logit contributions
 top-0.034
 anything-0.033
 chores-0.033
 biggest-0.032
 accomplishment-0.032
Token great
Token ID1049
Feature activation+2.44e-02

Pos logit contributions
 anything+0.073
 chores+0.071
 accomplishment+0.070
 top+0.070
 biggest+0.069
Neg logit contributions
o-0.069
y-0.064
jo-0.061
os-0.061
ppy-0.059
Token Zen
Token ID14760
Feature activation-1.96e-01

Pos logit contributions
 anything+0.044
 top+0.044
 biggest+0.043
 chores+0.043
 accomplishment+0.042
Neg logit contributions
o-0.060
y-0.057
jo-0.055
os-0.055
ppy-0.051
Tokeno
Token ID78
Feature activation-1.20e-01

Pos logit contributions
ppy+0.191
jo+0.181
aa+0.158
anna+0.156
ety+0.152
Neg logit contributions
 new-0.561
 something-0.552
 anything-0.552
 most-0.543
 best-0.535
Token hugs
Token ID40657
Feature activation-3.35e-04

Pos logit contributions
jo+0.304
o+0.296
ppy+0.280
azz+0.269
os+0.268
Neg logit contributions
 top-0.214
 new-0.200
 accomplishment-0.191
 good-0.191
 something-0.188
Token Ben
Token ID3932
Feature activation-1.14e-01

Pos logit contributions
o+0.001
jo+0.001
os+0.001
ppy+0.001
y+0.001
Neg logit contributions
 top-0.000
 anything-0.000
 new-0.000
 chores-0.000
 biggest-0.000
Token and
Token ID290
Feature activation-7.94e-03

Pos logit contributions
o+0.302
jo+0.298
os+0.291
azz+0.281
ppy+0.279
Neg logit contributions
 new-0.179
 top-0.176
 good-0.168
 something-0.168
 anything-0.167
Token says
Token ID1139
Feature activation-2.30e-03

Pos logit contributions
o+0.018
jo+0.016
ppy+0.016
os+0.016
y+0.016
Neg logit contributions
 top-0.017
 anything-0.017
 chores-0.016
 biggest-0.016
 accomplishment-0.016
TokenOne
Token ID3198
Feature activation+1.97e-02

Pos logit contributions
o+0.141
ppy+0.140
jo+0.138
os+0.132
y+0.132
Neg logit contributions
 top-0.084
 anything-0.078
 new-0.077
 bottom-0.075
 biggest-0.074
Token day
Token ID1110
Feature activation-1.55e-04

Pos logit contributions
 top+0.024
 anything+0.023
 chores+0.023
 biggest+0.023
 accomplishment+0.023
Neg logit contributions
o-0.061
jo-0.058
ppy-0.057
os-0.057
y-0.057
Token,
Token ID11
Feature activation+5.23e-03

Pos logit contributions
o+0.000
jo+0.000
os+0.000
ppy+0.000
y+0.000
Neg logit contributions
 top-0.000
 anything-0.000
 chores-0.000
 biggest-0.000
 accomplishment-0.000
Token little
Token ID1310
Feature activation-2.02e-02

Pos logit contributions
 chores+0.004
 accomplishment+0.004
 anything+0.004
 biggest+0.004
 top+0.004
Neg logit contributions
o-0.019
y-0.018
ppy-0.017
jo-0.017
os-0.017
Token Jo
Token ID5302
Feature activation-1.37e-01

Pos logit contributions
o+0.054
os+0.052
jo+0.052
ppy+0.050
y+0.047
Neg logit contributions
 top-0.033
 new-0.032
 anything-0.031
 trouble-0.031
 biggest-0.030
Tokenanne
Token ID21952
Feature activation-1.85e-01

Pos logit contributions
o+0.239
ppy+0.237
azz+0.232
lets+0.229
os+0.221
Neg logit contributions
 top-0.277
 the-0.275
 new-0.273
 amazing-0.270
 great-0.264
Token was
Token ID373
Feature activation+6.00e-02

Pos logit contributions
ppy+0.454
azz+0.449
o+0.437
jo+0.409
os+0.408
Neg logit contributions
 top-0.300
 bottom-0.274
 trouble-0.273
 life-0.272
 the-0.272
Token r
Token ID374
Feature activation+6.21e-03

Pos logit contributions
 avail+0.101
 accomplishment+0.100
 anything+0.097
 progress+0.096
 top+0.096
Neg logit contributions
o-0.163
y-0.142
ppy-0.138
os-0.132
jo-0.132
Tokenaking
Token ID868
Feature activation+1.50e-02

Pos logit contributions
 top+0.014
 biggest+0.014
 anything+0.014
 chores+0.014
 accomplishment+0.013
Neg logit contributions
o-0.012
jo-0.012
ppy-0.011
os-0.011
y-0.011
Token the
Token ID262
Feature activation+6.07e-02

Pos logit contributions
 accomplishment+0.028
 biggest+0.026
 trouble+0.026
 chores+0.026
 awesome+0.025
Neg logit contributions
y-0.037
o-0.035
jo-0.032
ppy-0.032
os-0.031
Token leaves
Token ID5667
Feature activation-6.65e-02

Pos logit contributions
 anything+0.080
 top+0.077
 accomplishment+0.076
 chores+0.075
 biggest+0.073
Neg logit contributions
o-0.182
jo-0.170
y-0.169
os-0.168
ppy-0.167
Token a
Token ID257
Feature activation+3.14e-02

Pos logit contributions
 anything+0.005
 top+0.005
 new+0.005
 something+0.005
 chores+0.005
Neg logit contributions
o-0.013
jo-0.013
os-0.013
ppy-0.013
y-0.013
Token small
Token ID1402
Feature activation+2.27e-02

Pos logit contributions
 anything+0.035
 top+0.035
 chores+0.033
 accomplishment+0.033
 biggest+0.033
Neg logit contributions
o-0.100
jo-0.093
y-0.093
ppy-0.092
os-0.091
Token bird
Token ID6512
Feature activation-1.27e-01

Pos logit contributions
 top+0.045
 new+0.043
 chores+0.042
 anything+0.042
 biggest+0.042
Neg logit contributions
o-0.054
jo-0.051
os-0.050
ppy-0.050
y-0.047
Token named
Token ID3706
Feature activation-3.02e-02

Pos logit contributions
jo+0.283
ppy+0.282
os+0.277
azz+0.272
o+0.261
Neg logit contributions
 top-0.226
 new-0.225
 something-0.221
 anything-0.216
 things-0.209
Token B
Token ID347
Feature activation-1.04e-01

Pos logit contributions
o+0.075
jo+0.074
y+0.071
os+0.071
ppy+0.071
Neg logit contributions
 anything-0.053
 top-0.052
 new-0.050
 biggest-0.049
 something-0.049
Tokenibi
Token ID27567
Feature activation-1.84e-01

Pos logit contributions
jo+0.109
anne+0.108
bug+0.107
os+0.105
ppy+0.098
Neg logit contributions
 top-0.253
 biggest-0.250
 new-0.249
 job-0.240
 original-0.235
Token,
Token ID11
Feature activation-1.85e-03

Pos logit contributions
jo+0.564
azz+0.530
o+0.520
os+0.513
anne+0.513
Neg logit contributions
 top-0.222
 new-0.222
 anything-0.219
 best-0.203
 something-0.203
Token fly
Token ID6129
Feature activation-1.65e-02

Pos logit contributions
jo+0.005
o+0.005
ppy+0.005
os+0.005
azz+0.004
Neg logit contributions
 something-0.003
 anything-0.003
 new-0.003
 trouble-0.003
 top-0.003
Token away
Token ID1497
Feature activation+3.63e-02

Pos logit contributions
o+0.044
jo+0.043
ppy+0.043
os+0.041
azz+0.041
Neg logit contributions
 top-0.025
 new-0.025
 anything-0.024
 something-0.024
 chores-0.022
Token from
Token ID422
Feature activation+6.56e-02

Pos logit contributions
 top+0.075
 chores+0.074
 biggest+0.074
 anything+0.073
 accomplishment+0.072
Neg logit contributions
o-0.082
y-0.076
os-0.074
jo-0.073
ppy-0.072
Token his
Token ID465
Feature activation+4.55e-02

Pos logit contributions
 biggest+0.074
 accomplishment+0.072
 progress+0.070
 chores+0.069
 top+0.064
Neg logit contributions
o-0.209
y-0.190
jo-0.189
os-0.184
ppy-0.181
Token gate
Token ID8946
Feature activation-9.13e-02

Pos logit contributions
 anything+0.081
 accomplishment+0.080
 chores+0.079
 top+0.079
 biggest+0.076
Neg logit contributions
o-0.184
jo-0.172
y-0.170
ppy-0.170
os-0.167
Token was
Token ID373
Feature activation+6.96e-02

Pos logit contributions
o+0.215
jo+0.198
ppy+0.196
os+0.193
y+0.193
Neg logit contributions
 top-0.182
 anything-0.177
 biggest-0.173
 chores-0.172
 accomplishment-0.171
Token broken
Token ID5445
Feature activation+4.78e-02

Pos logit contributions
 biggest+0.122
 progress+0.121
 chores+0.119
 accomplishment+0.118
 top+0.118
Neg logit contributions
o-0.176
y-0.155
ppy-0.152
jo-0.149
os-0.146
Token,"
Token ID553
Feature activation-1.93e-02

Pos logit contributions
 top+0.101
 chores+0.101
 accomplishment+0.100
 biggest+0.100
 anything+0.099
Neg logit contributions
o-0.103
y-0.097
jo-0.093
ppy-0.092
os-0.091
Token the
Token ID262
Feature activation+1.46e-02

Pos logit contributions
o+0.048
jo+0.044
ppy+0.044
os+0.044
y+0.044
Neg logit contributions
 top-0.036
 anything-0.035
 chores-0.035
 accomplishment-0.034
 biggest-0.034
Token woman
Token ID2415
Feature activation-1.84e-01

Pos logit contributions
 anything+0.025
 top+0.025
 chores+0.025
 biggest+0.024
 accomplishment+0.024
Neg logit contributions
o-0.038
jo-0.035
ppy-0.034
y-0.034
os-0.034
Token says
Token ID1139
Feature activation-2.93e-02

Pos logit contributions
jo+0.407
o+0.401
ppy+0.400
azz+0.388
aa+0.376
Neg logit contributions
 something-0.342
 top-0.336
 the-0.332
 new-0.329
 a-0.314
Token.
Token ID13
Feature activation-6.05e-02

Pos logit contributions
o+0.082
jo+0.080
ppy+0.077
os+0.076
azz+0.076
Neg logit contributions
 top-0.043
 anything-0.043
 new-0.042
 something-0.041
 accomplishment-0.041
Token
Token ID198
Feature activation-1.92e-02

Pos logit contributions
o+0.163
ppy+0.152
y+0.152
jo+0.151
os+0.151
Neg logit contributions
 top-0.103
 anything-0.097
 new-0.097
 trouble-0.094
 chores-0.092
Token
Token ID198
Feature activation-1.85e-02

Pos logit contributions
o+0.051
ppy+0.048
y+0.047
jo+0.047
os+0.047
Neg logit contributions
 top-0.032
 new-0.031
 anything-0.030
 chores-0.030
 trouble-0.030
Token"
Token ID1
Feature activation-2.14e-02

Pos logit contributions
o+0.038
os+0.034
jo+0.034
y+0.034
ppy+0.033
Neg logit contributions
 top-0.043
 chores-0.041
 anything-0.041
 new-0.041
 accomplishment-0.040
Token way
Token ID835
Feature activation+2.24e-02

Pos logit contributions
 anything+0.061
 top+0.057
 chores+0.056
 accomplishment+0.055
 yourselves+0.054
Neg logit contributions
o-0.158
jo-0.146
y-0.145
os-0.142
ppy-0.140
Token home
Token ID1363
Feature activation-2.46e-02

Pos logit contributions
 top+0.036
 anything+0.036
 chores+0.036
 accomplishment+0.035
 biggest+0.035
Neg logit contributions
o-0.061
jo-0.056
y-0.056
os-0.055
ppy-0.055
Token...
Token ID986
Feature activation-1.25e-03

Pos logit contributions
o+0.056
y+0.051
os+0.050
jo+0.050
ppy+0.050
Neg logit contributions
 top-0.050
 chores-0.049
 biggest-0.049
 accomplishment-0.049
 anything-0.049
Token<|endoftext|>
Token ID50256
Feature activation-4.74e-02

Pos logit contributions
o+0.003
jo+0.003
ppy+0.003
y+0.003
os+0.003
Neg logit contributions
 top-0.002
 anything-0.002
 biggest-0.002
 new-0.002
 something-0.002
TokenJo
Token ID9908
Feature activation-1.21e-01

Pos logit contributions
o+0.128
ppy+0.124
jo+0.124
os+0.119
y+0.119
Neg logit contributions
 top-0.076
 anything-0.071
 bottom-0.069
 new-0.068
 biggest-0.068
Tokenanne
Token ID21952
Feature activation-1.83e-01

Pos logit contributions
o+0.226
azz+0.211
ppy+0.206
os+0.203
lets+0.194
Neg logit contributions
 top-0.271
 bottom-0.254
 new-0.252
 accomplishment-0.251
 anything-0.246
Token was
Token ID373
Feature activation+6.06e-02

Pos logit contributions
ppy+0.409
o+0.406
azz+0.405
os+0.386
jo+0.384
Neg logit contributions
 top-0.338
 many-0.313
 new-0.311
 bottom-0.308
 middle-0.305
Token walking
Token ID6155
Feature activation+1.82e-02

Pos logit contributions
 anything+0.101
 avail+0.101
 tasks+0.099
 accomplishment+0.097
 chores+0.097
Neg logit contributions
o-0.162
y-0.140
ppy-0.136
jo-0.134
os-0.133
Token in
Token ID287
Feature activation+4.72e-02

Pos logit contributions
 anything+0.032
 chores+0.032
 top+0.031
 accomplishment+0.031
 biggest+0.031
Neg logit contributions
o-0.047
y-0.043
jo-0.043
os-0.042
ppy-0.042
Token the
Token ID262
Feature activation+5.36e-02

Pos logit contributions
 biggest+0.065
rier+0.063
 chores+0.062
 accomplishment+0.059
 cheapest+0.057
Neg logit contributions
o-0.132
y-0.124
jo-0.123
os-0.120
of-0.112
Token woods
Token ID16479
Feature activation-5.01e-02

Pos logit contributions
 accomplishment+0.080
 anything+0.072
 chores+0.072
 progress+0.067
 tasks+0.064
Neg logit contributions
o-0.159
y-0.144
jo-0.140
os-0.136
itty-0.130
Token
Token ID198
Feature activation-1.55e-02

Pos logit contributions
ppy+0.168
o+0.167
jo+0.167
os+0.161
y+0.156
Neg logit contributions
 new-0.088
 top-0.086
 many-0.078
 trouble-0.077
 biggest-0.077
Token
Token ID198
Feature activation-1.62e-02

Pos logit contributions
o+0.042
ppy+0.041
jo+0.040
os+0.040
y+0.039
Neg logit contributions
 top-0.024
 new-0.023
 anything-0.022
 chores-0.022
 many-0.022
TokenThe
Token ID464
Feature activation+2.55e-02

Pos logit contributions
o+0.033
jo+0.030
os+0.029
ppy+0.029
y+0.028
Neg logit contributions
 top-0.038
 anything-0.037
 chores-0.037
 biggest-0.037
 new-0.036
Token se
Token ID384
Feature activation-6.80e-02

Pos logit contributions
 anything+0.053
 chores+0.053
 top+0.052
 accomplishment+0.052
 progress+0.050
Neg logit contributions
o-0.056
y-0.051
ppy-0.049
jo-0.048
os-0.048
Tokenag
Token ID363
Feature activation-4.86e-02

Pos logit contributions
o+0.180
os+0.171
ppy+0.168
jo+0.166
y+0.160
Neg logit contributions
 accomplishment-0.103
 new-0.097
 chores-0.096
 good-0.095
 original-0.095
Tokenull
Token ID724
Feature activation-1.83e-01

Pos logit contributions
o+0.113
ppy+0.109
jo+0.108
os+0.105
azz+0.101
Neg logit contributions
 top-0.094
 bottom-0.087
 anything-0.087
 new-0.087
 biggest-0.086
Token tried
Token ID3088
Feature activation+3.60e-02

Pos logit contributions
jo+0.434
azz+0.401
os+0.386
ppy+0.386
anna+0.369
Neg logit contributions
 top-0.356
 new-0.355
 many-0.334
 good-0.334
 other-0.328
Token to
Token ID284
Feature activation+3.93e-02

Pos logit contributions
 top+0.070
 anything+0.068
 chores+0.067
 biggest+0.066
 new+0.066
Neg logit contributions
o-0.087
jo-0.082
ppy-0.080
os-0.080
y-0.079
Token take
Token ID1011
Feature activation+6.50e-02

Pos logit contributions
 top+0.069
 anything+0.067
 chores+0.066
 biggest+0.066
 accomplishment+0.065
Neg logit contributions
o-0.102
jo-0.095
os-0.093
ppy-0.093
y-0.093
Token the
Token ID262
Feature activation+5.92e-02

Pos logit contributions
 chores+0.074
 accomplishment+0.073
 biggest+0.072
 top+0.070
 bottom+0.070
Neg logit contributions
o-0.203
y-0.188
ppy-0.188
os-0.185
jo-0.182
Token oy
Token ID35104
Feature activation-7.94e-02

Pos logit contributions
 chores+0.111
 accomplishment+0.108
 anything+0.105
 progress+0.104
 tasks+0.103
Neg logit contributions
o-0.141
y-0.134
ppy-0.125
jo-0.120
os-0.119
Token to
Token ID284
Feature activation+7.19e-02

Pos logit contributions
 top+0.054
 anything+0.053
 chores+0.052
 new+0.051
 biggest+0.051
Neg logit contributions
o-0.076
jo-0.071
os-0.070
ppy-0.069
y-0.067
Token play
Token ID711
Feature activation+5.50e-02

Pos logit contributions
 top+0.119
 anything+0.117
 chores+0.115
 biggest+0.113
 accomplishment+0.112
Neg logit contributions
o-0.193
jo-0.180
os-0.177
ppy-0.176
y-0.176
Token,
Token ID11
Feature activation+7.71e-03

Pos logit contributions
 biggest+0.110
 accomplishment+0.107
 chores+0.102
 top+0.101
 progress+0.100
Neg logit contributions
o-0.120
y-0.113
ppy-0.109
os-0.106
jo-0.105
Token so
Token ID523
Feature activation+2.07e-02

Pos logit contributions
 top+0.012
 anything+0.011
 chores+0.011
 biggest+0.011
 accomplishment+0.011
Neg logit contributions
o-0.021
jo-0.020
y-0.020
ppy-0.020
os-0.020
Token G
Token ID402
Feature activation-7.14e-02

Pos logit contributions
 top+0.032
 anything+0.031
 chores+0.030
 biggest+0.030
 accomplishment+0.030
Neg logit contributions
o-0.058
jo-0.054
os-0.053
ppy-0.053
y-0.053
Tokenaby
Token ID3930
Feature activation-1.82e-01

Pos logit contributions
o+0.163
os+0.145
jo+0.143
y+0.138
ppy+0.136
Neg logit contributions
 top-0.150
 anything-0.145
 accomplishment-0.142
 chores-0.141
 bottom-0.140
Token said
Token ID531
Feature activation-4.46e-03

Pos logit contributions
o+0.369
azz+0.344
jo+0.341
os+0.337
ppy+0.325
Neg logit contributions
 top-0.358
 something-0.338
 good-0.337
 great-0.332
 new-0.331
Token,
Token ID11
Feature activation-2.62e-02

Pos logit contributions
o+0.013
jo+0.012
os+0.012
ppy+0.012
y+0.012
Neg logit contributions
 top-0.006
 anything-0.006
 chores-0.006
 new-0.006
 accomplishment-0.006
Token "
Token ID366
Feature activation-1.42e-02

Pos logit contributions
o+0.079
jo+0.077
os+0.076
ppy+0.075
y+0.075
Neg logit contributions
 top-0.035
 something-0.032
 anything-0.032
 new-0.032
 good-0.029
TokenI
Token ID40
Feature activation+5.16e-03

Pos logit contributions
o+0.040
jo+0.038
ppy+0.038
os+0.038
y+0.036
Neg logit contributions
 top-0.021
 most-0.020
 anything-0.020
 biggest-0.020
 trouble-0.019
Token don
Token ID836
Feature activation+2.36e-02

Pos logit contributions
 great+0.009
 trouble+0.009
 chores+0.009
 top+0.009
 good+0.009
Neg logit contributions
o-0.012
jo-0.012
os-0.011
ppy-0.011
y-0.011
Token coming
Token ID2406
Feature activation+3.50e-02

Pos logit contributions
 top+0.073
 chores+0.068
 accomplishment+0.067
 bottom+0.067
 biggest+0.065
Neg logit contributions
o-0.207
ppy-0.187
jo-0.186
os-0.185
y-0.182
Token right
Token ID826
Feature activation+5.32e-02

Pos logit contributions
 top+0.057
 anything+0.056
 biggest+0.054
 new+0.053
 chores+0.053
Neg logit contributions
o-0.094
jo-0.090
os-0.088
ppy-0.088
y-0.087
Token now
Token ID783
Feature activation+4.61e-02

Pos logit contributions
 top+0.091
 anything+0.089
 chores+0.089
 biggest+0.088
 accomplishment+0.087
Neg logit contributions
o-0.140
jo-0.129
y-0.128
ppy-0.127
os-0.127
Token,"
Token ID553
Feature activation-2.17e-02

Pos logit contributions
 top+0.088
 anything+0.086
 chores+0.085
 biggest+0.084
 accomplishment+0.083
Neg logit contributions
o-0.112
jo-0.103
os-0.102
ppy-0.102
y-0.101
Token the
Token ID262
Feature activation+2.39e-02

Pos logit contributions
o+0.063
jo+0.059
os+0.058
ppy+0.058
y+0.058
Neg logit contributions
 top-0.031
 anything-0.030
 biggest-0.030
 chores-0.030
 accomplishment-0.029
Token woman
Token ID2415
Feature activation-1.82e-01

Pos logit contributions
 top+0.052
 anything+0.051
 chores+0.051
 biggest+0.050
 accomplishment+0.050
Neg logit contributions
o-0.052
jo-0.047
ppy-0.046
y-0.046
os-0.046
Token says
Token ID1139
Feature activation-3.00e-02

Pos logit contributions
jo+0.446
o+0.441
ppy+0.437
y+0.417
ety+0.408
Neg logit contributions
 new-0.330
 bigger-0.303
 top-0.298
 the-0.292
 trouble-0.289
Token.
Token ID13
Feature activation-6.19e-02

Pos logit contributions
o+0.085
jo+0.082
os+0.078
ppy+0.078
y+0.077
Neg logit contributions
 new-0.043
 top-0.041
 anything-0.041
 accomplishment-0.041
 chores-0.039
Token
Token ID198
Feature activation-2.24e-02

Pos logit contributions
o+0.173
jo+0.165
ppy+0.164
os+0.162
y+0.161
Neg logit contributions
 top-0.096
 new-0.093
 trouble-0.091
 anything-0.090
 something-0.088
Token
Token ID198
Feature activation-2.22e-02

Pos logit contributions
o+0.058
jo+0.054
ppy+0.053
y+0.053
os+0.053
Neg logit contributions
 top-0.039
 anything-0.038
 new-0.037
 chores-0.037
 accomplishment-0.037
Token"
Token ID1
Feature activation-2.38e-02

Pos logit contributions
o+0.050
jo+0.045
os+0.045
ppy+0.044
y+0.044
Neg logit contributions
 top-0.047
 anything-0.046
 chores-0.046
 biggest-0.045
 accomplishment-0.045
Token here
Token ID994
Feature activation+2.37e-02

Pos logit contributions
 top+0.029
 anything+0.029
 chores+0.028
 biggest+0.028
 accomplishment+0.027
Neg logit contributions
o-0.050
jo-0.048
os-0.047
ppy-0.047
y-0.046
Token,
Token ID11
Feature activation+7.69e-04

Pos logit contributions
 top+0.042
 chores+0.041
 anything+0.041
 accomplishment+0.041
 biggest+0.041
Neg logit contributions
o-0.060
y-0.054
jo-0.054
ppy-0.054
os-0.053
Token cat
Token ID3797
Feature activation-1.23e-01

Pos logit contributions
 top+0.001
 anything+0.001
 new+0.001
 biggest+0.001
 chores+0.001
Neg logit contributions
o-0.002
jo-0.002
os-0.002
y-0.002
ppy-0.002
Token,"
Token ID553
Feature activation-2.56e-02

Pos logit contributions
jo+0.328
os+0.327
o+0.322
azz+0.317
ppy+0.314
Neg logit contributions
 anything-0.171
 top-0.167
 new-0.158
 something-0.156
 good-0.147
Token the
Token ID262
Feature activation+1.21e-02

Pos logit contributions
o+0.076
jo+0.072
ppy+0.071
os+0.071
y+0.070
Neg logit contributions
 top-0.036
 anything-0.035
 chores-0.034
 new-0.034
 biggest-0.033
Token woman
Token ID2415
Feature activation-1.81e-01

Pos logit contributions
 top+0.024
 biggest+0.023
 chores+0.023
 new+0.023
 anything+0.023
Neg logit contributions
o-0.029
jo-0.027
os-0.027
ppy-0.026
y-0.026
Token said
Token ID531
Feature activation-1.82e-02

Pos logit contributions
jo+0.450
uddy+0.418
enny+0.414
azz+0.413
ppy+0.408
Neg logit contributions
 the-0.264
 new-0.253
 more-0.251
 a-0.243
 bigger-0.234
Token with
Token ID351
Feature activation+4.75e-02

Pos logit contributions
o+0.049
jo+0.047
os+0.046
ppy+0.046
y+0.045
Neg logit contributions
 top-0.029
 anything-0.029
 new-0.029
 accomplishment-0.028
 chores-0.028
Token a
Token ID257
Feature activation+4.22e-02

Pos logit contributions
 site+0.061
 accomplishment+0.059
 biggest+0.058
 bottom+0.057
 top+0.057
Neg logit contributions
o-0.127
ppy-0.123
y-0.116
of-0.114
azz-0.113
Token big
Token ID1263
Feature activation+4.05e-02

Pos logit contributions
 top+0.060
 anything+0.060
 chores+0.058
 biggest+0.057
 accomplishment+0.057
Neg logit contributions
o-0.122
jo-0.114
y-0.112
os-0.112
ppy-0.112
Token smile
Token ID8212
Feature activation-4.14e-02

Pos logit contributions
 top+0.074
 accomplishment+0.071
 chores+0.071
 new+0.070
 biggest+0.070
Neg logit contributions
o-0.104
jo-0.096
ppy-0.095
os-0.095
y-0.093
Token said
Token ID531
Feature activation-1.11e-02

Pos logit contributions
o+0.119
os+0.117
jo+0.115
itty+0.108
y+0.108
Neg logit contributions
 new-0.118
 something-0.115
 good-0.114
 most-0.113
 amazing-0.112
Token.
Token ID13
Feature activation-5.01e-02

Pos logit contributions
o+0.028
jo+0.026
os+0.025
ppy+0.025
y+0.025
Neg logit contributions
 top-0.021
 anything-0.020
 chores-0.020
 biggest-0.020
 accomplishment-0.020
Token
Token ID198
Feature activation-9.79e-03

Pos logit contributions
o+0.141
jo+0.138
ppy+0.135
y+0.133
os+0.132
Neg logit contributions
 top-0.079
 new-0.072
 anything-0.071
 biggest-0.070
 something-0.069
Token
Token ID198
Feature activation-9.40e-03

Pos logit contributions
o+0.027
ppy+0.026
jo+0.025
y+0.025
os+0.025
Neg logit contributions
 top-0.016
 new-0.015
 anything-0.014
 many-0.014
 biggest-0.014
TokenThe
Token ID464
Feature activation+3.16e-02

Pos logit contributions
o+0.029
jo+0.027
os+0.027
ppy+0.026
y+0.026
Neg logit contributions
 top-0.013
 anything-0.012
 chores-0.012
 biggest-0.012
 new-0.011
Token maid
Token ID25920
Feature activation-1.81e-01

Pos logit contributions
 anything+0.052
 chores+0.051
 accomplishment+0.051
 top+0.049
 trouble+0.049
Neg logit contributions
o-0.079
y-0.074
ppy-0.073
jo-0.071
os-0.070
Token smiled
Token ID13541
Feature activation+5.35e-03

Pos logit contributions
jo+0.362
round+0.343
o+0.335
azz+0.335
ppy+0.332
Neg logit contributions
 top-0.324
 best-0.324
 many-0.320
 new-0.318
 something-0.312
Token.
Token ID13
Feature activation-4.90e-02

Pos logit contributions
 top+0.011
 anything+0.011
 new+0.011
 biggest+0.011
 something+0.010
Neg logit contributions
o-0.012
jo-0.011
os-0.011
ppy-0.011
y-0.011
Token "
Token ID366
Feature activation-9.07e-03

Pos logit contributions
o+0.145
jo+0.142
ppy+0.141
os+0.136
y+0.135
Neg logit contributions
 top-0.067
 new-0.063
 biggest-0.062
 anything-0.059
 most-0.058
TokenYes
Token ID5297
Feature activation-1.38e-02

Pos logit contributions
o+0.023
jo+0.023
os+0.022
ppy+0.022
y+0.021
Neg logit contributions
 top-0.015
 anything-0.015
 most-0.015
 biggest-0.015
 something-0.014
Token,
Token ID11
Feature activation-4.51e-03

Pos logit contributions
o+0.041
jo+0.038
os+0.037
ppy+0.036
y+0.036
Neg logit contributions
 top-0.022
 anything-0.021
 new-0.020
 biggest-0.019
 accomplishment-0.019
Token sorry
Token ID7926
Feature activation+2.67e-02

Pos logit contributions
 top+0.065
 accomplishment+0.063
 chores+0.062
 bottom+0.060
 tasks+0.059
Neg logit contributions
o-0.190
ppy-0.171
jo-0.170
os-0.169
y-0.165
Token,
Token ID11
Feature activation+3.29e-03

Pos logit contributions
 anything+0.039
 top+0.038
 new+0.038
 chores+0.036
 something+0.036
Neg logit contributions
o-0.078
jo-0.074
os-0.073
ppy-0.072
y-0.071
Token kids
Token ID3988
Feature activation-1.10e-01

Pos logit contributions
 top+0.006
 anything+0.006
 chores+0.006
 biggest+0.006
 accomplishment+0.006
Neg logit contributions
o-0.008
jo-0.008
os-0.008
ppy-0.008
y-0.008
Token,"
Token ID553
Feature activation-2.40e-02

Pos logit contributions
o+0.320
jo+0.319
ppy+0.313
os+0.310
azz+0.309
Neg logit contributions
 anything-0.133
 trouble-0.131
 top-0.129
 new-0.129
 good-0.123
Token the
Token ID262
Feature activation+1.15e-02

Pos logit contributions
o+0.052
jo+0.046
os+0.046
y+0.046
azz+0.045
Neg logit contributions
 biggest-0.049
 top-0.048
 bottom-0.047
 accomplishment-0.047
 anything-0.046
Token woman
Token ID2415
Feature activation-1.81e-01

Pos logit contributions
 top+0.024
 anything+0.024
 chores+0.023
 biggest+0.023
 accomplishment+0.023
Neg logit contributions
o-0.026
jo-0.024
ppy-0.023
y-0.023
os-0.023
Token says
Token ID1139
Feature activation-3.67e-02

Pos logit contributions
jo+0.436
o+0.432
ppy+0.426
y+0.404
ety+0.388
Neg logit contributions
 new-0.351
 top-0.328
 trouble-0.325
 bigger-0.316
 something-0.309
Token.
Token ID13
Feature activation-6.13e-02

Pos logit contributions
o+0.101
jo+0.097
ppy+0.092
os+0.092
y+0.091
Neg logit contributions
 top-0.059
 anything-0.058
 new-0.058
 accomplishment-0.056
 chores-0.056
Token "
Token ID366
Feature activation-2.13e-02

Pos logit contributions
o+0.169
jo+0.159
os+0.157
ppy+0.157
y+0.153
Neg logit contributions
 top-0.102
 new-0.097
 anything-0.096
 trouble-0.094
 biggest-0.092
TokenRex
Token ID47389
Feature activation-1.44e-01

Pos logit contributions
o+0.056
jo+0.052
ppy+0.051
os+0.051
y+0.051
Neg logit contributions
 top-0.037
 anything-0.036
 biggest-0.035
 chores-0.035
 new-0.034
Token is
Token ID318
Feature activation+5.84e-02

Pos logit contributions
jo+0.378
o+0.363
ppy+0.357
os+0.353
azz+0.351
Neg logit contributions
 new-0.229
 top-0.227
 good-0.226
 anything-0.224
 great-0.219
Token smart
Token ID4451
Feature activation+4.12e-02

Pos logit contributions
 top+0.127
 bottom+0.126
 accomplishment+0.119
 avail+0.116
 chores+0.114
Neg logit contributions
o-0.221
ppy-0.191
y-0.188
os-0.181
jo-0.178
Token and
Token ID290
Feature activation+1.19e-02

Pos logit contributions
 top+0.088
 anything+0.086
 chores+0.085
 biggest+0.085
 accomplishment+0.084
Neg logit contributions
o-0.091
jo-0.083
y-0.081
ppy-0.081
os-0.081
Token safe
Token ID3338
Feature activation+3.91e-02

Pos logit contributions
 top+0.017
 bottom+0.017
 anything+0.016
 accomplishment+0.016
 biggest+0.016
Neg logit contributions
o-0.034
ppy-0.031
os-0.030
y-0.030
jo-0.030
Token,"
Token ID553
Feature activation-1.61e-02

Pos logit contributions
 accomplishment+0.077
 top+0.077
 biggest+0.076
 chores+0.075
 bottom+0.074
Neg logit contributions
o-0.091
y-0.084
os-0.080
ppy-0.080
jo-0.079
Token the
Token ID262
Feature activation+1.35e-02

Pos logit contributions
o+0.046
jo+0.043
ppy+0.042
os+0.042
y+0.042
Neg logit contributions
 top-0.024
 anything-0.024
 chores-0.023
 biggest-0.023
 accomplishment-0.023
Token policeman
Token ID32166
Feature activation-1.81e-01

Pos logit contributions
 top+0.022
 anything+0.022
 chores+0.021
 biggest+0.021
 accomplishment+0.021
Neg logit contributions
o-0.036
jo-0.034
os-0.033
ppy-0.033
y-0.033
Token says
Token ID1139
Feature activation-2.46e-02

Pos logit contributions
ppy+0.369
o+0.357
jo+0.356
y+0.332
ety+0.326
Neg logit contributions
 new-0.384
 top-0.375
 chores-0.366
 something-0.360
 money-0.356
Token.
Token ID13
Feature activation-5.84e-02

Pos logit contributions
o+0.066
jo+0.063
ppy+0.061
os+0.061
y+0.060
Neg logit contributions
 top-0.040
 anything-0.040
 new-0.039
 accomplishment-0.038
 something-0.038
Token
Token ID198
Feature activation-1.89e-02

Pos logit contributions
o+0.158
ppy+0.151
y+0.149
jo+0.149
os+0.148
Neg logit contributions
 top-0.094
 new-0.089
 anything-0.087
 trouble-0.085
 something-0.084
Token
Token ID198
Feature activation-1.81e-02

Pos logit contributions
o+0.048
jo+0.044
ppy+0.044
os+0.044
y+0.044
Neg logit contributions
 top-0.034
 anything-0.033
 chores-0.032
 new-0.032
 biggest-0.032
Token"
Token ID1
Feature activation-1.82e-02

Pos logit contributions
o+0.039
jo+0.035
os+0.035
ppy+0.035
y+0.035
Neg logit contributions
 top-0.040
 anything-0.039
 chores-0.038
 accomplishment-0.038
 biggest-0.038
Token the
Token ID262
Feature activation+8.58e-02

Pos logit contributions
 biggest+0.113
 accomplishment+0.107
 bottom+0.100
 chores+0.097
 top+0.096
Neg logit contributions
o-0.240
y-0.227
os-0.220
ppy-0.211
jo-0.206
Token park
Token ID3952
Feature activation-4.69e-02

Pos logit contributions
 accomplishment+0.121
 anything+0.107
 avail+0.107
 chores+0.105
 progress+0.103
Neg logit contributions
o-0.244
y-0.226
os-0.217
jo-0.211
ah-0.208
Token,"
Token ID553
Feature activation-2.26e-02

Pos logit contributions
o+0.116
jo+0.108
ppy+0.106
os+0.106
y+0.104
Neg logit contributions
 top-0.086
 anything-0.085
 chores-0.083
 biggest-0.082
 accomplishment-0.081
Token the
Token ID262
Feature activation+1.76e-02

Pos logit contributions
o+0.063
jo+0.059
ppy+0.058
os+0.058
y+0.058
Neg logit contributions
 top-0.036
 anything-0.035
 chores-0.034
 new-0.034
 accomplishment-0.033
Token other
Token ID584
Feature activation-4.61e-02

Pos logit contributions
 anything+0.029
 chores+0.028
 accomplishment+0.027
 top+0.027
 biggest+0.026
Neg logit contributions
o-0.046
y-0.043
jo-0.042
os-0.042
ppy-0.041
Token kid
Token ID5141
Feature activation-1.80e-01

Pos logit contributions
o+0.141
jo+0.134
ppy+0.134
os+0.131
y+0.128
Neg logit contributions
 top-0.061
 anything-0.057
 new-0.057
 biggest-0.056
 chores-0.055
Token said
Token ID531
Feature activation-1.63e-02

Pos logit contributions
jo+0.391
o+0.376
ppy+0.371
azz+0.357
os+0.350
Neg logit contributions
 something-0.345
 new-0.343
 top-0.336
 many-0.333
 more-0.332
Token.
Token ID13
Feature activation-6.06e-02

Pos logit contributions
o+0.043
jo+0.040
os+0.039
ppy+0.039
y+0.038
Neg logit contributions
 top-0.027
 anything-0.027
 new-0.027
 accomplishment-0.026
 chores-0.026
Token
Token ID198
Feature activation-1.88e-02

Pos logit contributions
o+0.163
ppy+0.157
jo+0.157
y+0.154
os+0.153
Neg logit contributions
 top-0.098
 new-0.092
 anything-0.092
 something-0.088
 biggest-0.088
Token
Token ID198
Feature activation-1.87e-02

Pos logit contributions
o+0.049
ppy+0.046
jo+0.046
y+0.045
os+0.045
Neg logit contributions
 top-0.032
 new-0.031
 anything-0.030
 chores-0.030
 biggest-0.030
TokenThey
Token ID2990
Feature activation-4.34e-02

Pos logit contributions
o+0.046
jo+0.043
os+0.043
ppy+0.042
y+0.042
Neg logit contributions
 top-0.035
 anything-0.034
 chores-0.033
 biggest-0.033
 accomplishment-0.032
Token a
Token ID257
Feature activation+4.27e-02

Pos logit contributions
 anything+0.064
 top+0.062
 chores+0.061
 biggest+0.061
 new+0.061
Neg logit contributions
o-0.075
jo-0.071
ppy-0.069
os-0.067
y-0.067
Token nice
Token ID3621
Feature activation+2.45e-02

Pos logit contributions
 anything+0.038
 top+0.037
 accomplishment+0.036
 chores+0.035
 biggest+0.033
Neg logit contributions
o-0.142
jo-0.133
y-0.132
ppy-0.132
os-0.128
Token surprise
Token ID5975
Feature activation-9.34e-03

Pos logit contributions
 top+0.036
 new+0.034
 biggest+0.034
 accomplishment+0.033
 anything+0.033
Neg logit contributions
o-0.071
ppy-0.067
os-0.067
jo-0.067
y-0.064
Token,"
Token ID553
Feature activation-1.72e-02

Pos logit contributions
o+0.024
jo+0.022
ppy+0.022
os+0.022
y+0.021
Neg logit contributions
 top-0.017
 anything-0.017
 chores-0.016
 biggest-0.016
 new-0.016
Token the
Token ID262
Feature activation+1.73e-02

Pos logit contributions
o+0.051
os+0.049
jo+0.049
ppy+0.049
y+0.048
Neg logit contributions
 top-0.024
 anything-0.023
 new-0.022
 something-0.022
 chores-0.022
Token woman
Token ID2415
Feature activation-1.80e-01

Pos logit contributions
 top+0.029
 biggest+0.027
 anything+0.027
 chores+0.027
 new+0.027
Neg logit contributions
o-0.047
jo-0.044
ppy-0.044
os-0.044
y-0.043
Token said
Token ID531
Feature activation-2.26e-02

Pos logit contributions
o+0.409
ppy+0.402
jo+0.388
ety+0.380
uddy+0.371
Neg logit contributions
 top-0.353
 new-0.353
 something-0.330
 the-0.319
 many-0.319
Token.
Token ID13
Feature activation-5.92e-02

Pos logit contributions
o+0.062
jo+0.060
ppy+0.057
os+0.057
azz+0.057
Neg logit contributions
 top-0.035
 anything-0.035
 new-0.035
 accomplishment-0.034
 something-0.033
Token "
Token ID366
Feature activation-1.81e-02

Pos logit contributions
o+0.172
ppy+0.170
y+0.168
jo+0.167
os+0.166
Neg logit contributions
 top-0.082
 new-0.078
 something-0.071
 anything-0.070
 many-0.070
TokenI
Token ID40
Feature activation+2.82e-04

Pos logit contributions
o+0.049
jo+0.047
os+0.046
ppy+0.046
y+0.046
Neg logit contributions
 top-0.029
 anything-0.028
 biggest-0.027
 something-0.026
 most-0.026
Token brought
Token ID3181
Feature activation+7.86e-02

Pos logit contributions
 top+0.001
 anything+0.001
 great+0.001
 chores+0.001
 trouble+0.001
Neg logit contributions
o-0.001
jo-0.001
os-0.001
ppy-0.001
y-0.001
Token deep
Token ID2769
Feature activation+2.50e-02

Pos logit contributions
 top+0.018
 anything+0.018
 chores+0.017
 biggest+0.017
 accomplishment+0.017
Neg logit contributions
o-0.084
jo-0.080
os-0.079
y-0.079
ppy-0.079
Token!"
Token ID2474
Feature activation-4.60e-02

Pos logit contributions
 top+0.047
 anything+0.046
 biggest+0.045
 chores+0.045
 accomplishment+0.045
Neg logit contributions
o-0.061
jo-0.056
os-0.056
ppy-0.055
y-0.055
Token
Token ID198
Feature activation-1.47e-02

Pos logit contributions
o+0.131
ppy+0.129
y+0.126
jo+0.126
os+0.123
Neg logit contributions
 top-0.069
 something-0.069
 anything-0.066
 new-0.063
 trouble-0.060
Token
Token ID198
Feature activation-1.39e-02

Pos logit contributions
o+0.040
ppy+0.040
y+0.039
os+0.038
jo+0.038
Neg logit contributions
 top-0.023
 new-0.021
 something-0.020
 anything-0.020
 great-0.020
TokenThe
Token ID464
Feature activation+2.70e-02

Pos logit contributions
o+0.042
os+0.040
jo+0.040
ppy+0.039
y+0.039
Neg logit contributions
 top-0.019
 anything-0.017
 new-0.016
 something-0.016
 chores-0.016
Token kid
Token ID5141
Feature activation-1.80e-01

Pos logit contributions
 anything+0.051
 chores+0.050
 accomplishment+0.049
 top+0.049
 biggest+0.047
Neg logit contributions
o-0.065
y-0.059
jo-0.058
os-0.058
ppy-0.057
Token remembered
Token ID12086
Feature activation+5.06e-02

Pos logit contributions
ppy+0.405
o+0.401
jo+0.394
azz+0.393
ety+0.386
Neg logit contributions
 top-0.363
 something-0.333
 new-0.314
 most-0.309
 bottom-0.308
Token what
Token ID644
Feature activation+2.53e-02

Pos logit contributions
 biggest+0.062
 top+0.061
 accomplishment+0.059
 chores+0.058
 bottom+0.056
Neg logit contributions
o-0.158
jo-0.146
os-0.145
y-0.145
ppy-0.144
Token the
Token ID262
Feature activation+6.12e-02

Pos logit contributions
 top+0.026
 biggest+0.024
 accomplishment+0.024
 anything+0.024
 chores+0.024
Neg logit contributions
o-0.084
jo-0.079
os-0.078
y-0.078
ppy-0.077
Token voice
Token ID3809
Feature activation-9.22e-02

Pos logit contributions
 anything+0.096
 chores+0.093
 accomplishment+0.092
 top+0.091
 biggest+0.087
Neg logit contributions
o-0.163
jo-0.152
y-0.152
os-0.146
ppy-0.142
Token said
Token ID531
Feature activation-2.74e-04

Pos logit contributions
o+0.200
ppy+0.191
jo+0.182
os+0.172
y+0.171
Neg logit contributions
 new-0.199
 anything-0.196
 top-0.193
 something-0.191
 amazing-0.190
TokenHello
Token ID15496
Feature activation-1.85e-03

Pos logit contributions
o+0.046
jo+0.046
ppy+0.044
os+0.044
y+0.042
Neg logit contributions
 top-0.027
 middle-0.025
 most-0.025
 biggest-0.025
 anything-0.025
Token,
Token ID11
Feature activation+1.33e-02

Pos logit contributions
o+0.005
jo+0.005
ppy+0.005
os+0.005
y+0.005
Neg logit contributions
 anything-0.003
 top-0.003
 new-0.003
 biggest-0.003
 something-0.003
Token kids
Token ID3988
Feature activation-1.21e-01

Pos logit contributions
 top+0.019
 anything+0.018
 new+0.018
 biggest+0.018
 chores+0.018
Neg logit contributions
o-0.039
jo-0.037
os-0.036
ppy-0.036
y-0.036
Token!"
Token ID2474
Feature activation-4.76e-02

Pos logit contributions
jo+0.328
o+0.322
ppy+0.319
azz+0.316
os+0.308
Neg logit contributions
 anything-0.162
 top-0.159
 new-0.158
 something-0.154
 things-0.151
Token the
Token ID262
Feature activation+1.32e-02

Pos logit contributions
o+0.132
jo+0.124
y+0.123
os+0.122
ppy+0.122
Neg logit contributions
 top-0.071
 anything-0.070
 biggest-0.070
 accomplishment-0.069
 chores-0.069
Token woman
Token ID2415
Feature activation-1.80e-01

Pos logit contributions
 anything+0.029
 top+0.029
 accomplishment+0.029
 chores+0.028
 biggest+0.028
Neg logit contributions
o-0.027
jo-0.025
y-0.025
ppy-0.024
os-0.024
Token says
Token ID1139
Feature activation-2.94e-02

Pos logit contributions
ppy+0.445
o+0.425
jo+0.412
ety+0.391
y+0.390
Neg logit contributions
 new-0.342
 top-0.310
 the-0.298
 bigger-0.293
 something-0.292
Token.
Token ID13
Feature activation-5.23e-02

Pos logit contributions
o+0.082
jo+0.078
ppy+0.077
os+0.075
azz+0.074
Neg logit contributions
 top-0.044
 new-0.044
 anything-0.042
 something-0.042
 accomplishment-0.042
Token "
Token ID366
Feature activation-1.62e-02

Pos logit contributions
o+0.146
jo+0.138
ppy+0.137
os+0.135
y+0.134
Neg logit contributions
 top-0.083
 new-0.079
 anything-0.078
 biggest-0.076
 something-0.075
TokenDo
Token ID5211
Feature activation+3.77e-02

Pos logit contributions
jo+0.046
o+0.046
ppy+0.044
os+0.043
y+0.043
Neg logit contributions
 top-0.022
 anything-0.022
 middle-0.020
 something-0.020
 most-0.020
Token you
Token ID345
Feature activation-6.76e-03

Pos logit contributions
 top+0.053
 anything+0.051
 chores+0.050
 biggest+0.050
 accomplishment+0.050
Neg logit contributions
o-0.111
jo-0.103
ppy-0.102
os-0.102
y-0.102
Token Cic
Token ID37212
Feature activation-5.01e-02

Pos logit contributions
 chores+0.040
 top+0.039
 biggest+0.039
 anything+0.039
 accomplishment+0.039
Neg logit contributions
o-0.053
os-0.047
ppy-0.046
y-0.046
jo-0.046
Tokeni
Token ID72
Feature activation-1.13e-01

Pos logit contributions
o+0.071
jo+0.068
azz+0.066
os+0.062
ppy+0.057
Neg logit contributions
 top-0.140
 anything-0.136
 chores-0.135
 accomplishment-0.134
 new-0.132
Token said
Token ID531
Feature activation+3.95e-03

Pos logit contributions
jo+0.244
o+0.230
azz+0.226
ppy+0.219
y+0.215
Neg logit contributions
 top-0.224
 chores-0.208
 new-0.205
 trouble-0.204
 something-0.203
Token,
Token ID11
Feature activation-1.81e-02

Pos logit contributions
 top+0.006
 biggest+0.006
 accomplishment+0.006
 chores+0.006
 bottom+0.006
Neg logit contributions
o-0.011
jo-0.010
y-0.010
os-0.010
ppy-0.010
Token B
Token ID347
Feature activation-8.79e-02

Pos logit contributions
o+0.051
jo+0.048
ppy+0.047
os+0.047
y+0.047
Neg logit contributions
 top-0.028
 anything-0.027
 chores-0.026
 biggest-0.026
 accomplishment-0.026
Tokenibi
Token ID27567
Feature activation-1.80e-01

Pos logit contributions
os+0.133
jo+0.126
o+0.124
y+0.115
bug+0.112
Neg logit contributions
 biggest-0.207
 new-0.204
 top-0.200
 accomplishment-0.200
 original-0.197
Token came
Token ID1625
Feature activation+1.92e-02

Pos logit contributions
jo+0.388
azz+0.381
o+0.379
os+0.369
ppy+0.359
Neg logit contributions
 top-0.342
 new-0.321
 good-0.312
 great-0.311
 most-0.308
Token back
Token ID736
Feature activation-1.18e-02

Pos logit contributions
 top+0.031
 anything+0.030
 biggest+0.030
 new+0.029
 chores+0.029
Neg logit contributions
o-0.052
jo-0.049
os-0.048
ppy-0.048
y-0.047
Token to
Token ID284
Feature activation+4.76e-02

Pos logit contributions
o+0.029
jo+0.028
ppy+0.027
os+0.027
y+0.027
Neg logit contributions
 top-0.022
 anything-0.021
 chores-0.021
 new-0.021
 accomplishment-0.021
Token play
Token ID711
Feature activation+3.94e-02

Pos logit contributions
 top+0.062
 biggest+0.062
 chores+0.061
 anything+0.060
 accomplishment+0.059
Neg logit contributions
o-0.144
jo-0.133
ppy-0.132
os-0.131
y-0.131
Token with
Token ID351
Feature activation+6.35e-02

Pos logit contributions
 biggest+0.076
 accomplishment+0.074
 top+0.072
 chores+0.072
 anything+0.070
Neg logit contributions
o-0.091
y-0.085
ppy-0.083
jo-0.081
os-0.081
Token to
Token ID284
Feature activation+7.58e-02

Pos logit contributions
 anything+0.064
 top+0.062
 new+0.062
 trouble+0.062
 biggest+0.062
Neg logit contributions
o-0.098
jo-0.092
ppy-0.092
os-0.090
y-0.086
Token you
Token ID345
Feature activation-1.76e-02

Pos logit contributions
 top+0.101
 anything+0.099
 chores+0.095
 biggest+0.094
 new+0.092
Neg logit contributions
o-0.229
jo-0.215
ppy-0.212
os-0.212
y-0.211
Token,"
Token ID553
Feature activation-1.87e-02

Pos logit contributions
o+0.043
jo+0.040
os+0.039
ppy+0.038
y+0.038
Neg logit contributions
 top-0.034
 anything-0.033
 chores-0.033
 biggest-0.032
 new-0.032
Token the
Token ID262
Feature activation+1.84e-02

Pos logit contributions
o+0.059
jo+0.056
ppy+0.056
y+0.055
os+0.055
Neg logit contributions
 top-0.022
 anything-0.021
 chores-0.020
 biggest-0.020
 new-0.020
Token other
Token ID584
Feature activation-5.23e-02

Pos logit contributions
 accomplishment+0.028
 chores+0.028
 anything+0.028
 top+0.027
 progress+0.026
Neg logit contributions
o-0.050
y-0.047
ppy-0.046
jo-0.046
os-0.045
Token kid
Token ID5141
Feature activation-1.79e-01

Pos logit contributions
o+0.156
jo+0.149
os+0.145
ppy+0.144
y+0.138
Neg logit contributions
 top-0.071
 anything-0.070
 biggest-0.068
 new-0.068
 chores-0.067
Token said
Token ID531
Feature activation-1.82e-02

Pos logit contributions
jo+0.417
ppy+0.386
os+0.372
o+0.372
azz+0.369
Neg logit contributions
 new-0.335
 something-0.330
 more-0.330
 a-0.315
 the-0.314
Token.
Token ID13
Feature activation-5.85e-02

Pos logit contributions
o+0.049
jo+0.046
os+0.045
ppy+0.045
y+0.043
Neg logit contributions
 top-0.030
 new-0.029
 anything-0.029
 accomplishment-0.029
 chores-0.028
Token
Token ID198
Feature activation-1.94e-02

Pos logit contributions
o+0.155
jo+0.148
ppy+0.147
os+0.144
y+0.144
Neg logit contributions
 top-0.098
 anything-0.094
 new-0.093
 biggest-0.091
 something-0.089
Token
Token ID198
Feature activation-1.95e-02

Pos logit contributions
o+0.049
jo+0.046
ppy+0.046
y+0.046
os+0.045
Neg logit contributions
 top-0.035
 anything-0.033
 new-0.033
 chores-0.033
 biggest-0.032
Token"
Token ID1
Feature activation-2.00e-02

Pos logit contributions
o+0.051
jo+0.048
os+0.047
ppy+0.047
y+0.047
Neg logit contributions
 top-0.033
 anything-0.032
 chores-0.032
 biggest-0.032
 accomplishment-0.031
Token said
Token ID531
Feature activation-1.06e-02

Pos logit contributions
o+0.112
jo+0.111
os+0.104
ppy+0.103
y+0.102
Neg logit contributions
 new-0.122
 top-0.119
 something-0.118
 most-0.117
 accomplishment-0.117
Token.
Token ID13
Feature activation-5.07e-02

Pos logit contributions
o+0.027
jo+0.025
os+0.024
ppy+0.024
y+0.024
Neg logit contributions
 top-0.020
 anything-0.019
 chores-0.019
 accomplishment-0.019
 biggest-0.019
Token
Token ID198
Feature activation-1.25e-02

Pos logit contributions
o+0.144
jo+0.140
ppy+0.138
y+0.135
os+0.134
Neg logit contributions
 top-0.077
 new-0.071
 many-0.069
 biggest-0.068
 anything-0.067
Token
Token ID198
Feature activation-1.18e-02

Pos logit contributions
o+0.034
ppy+0.032
jo+0.032
y+0.031
os+0.031
Neg logit contributions
 top-0.021
 new-0.019
 anything-0.019
 biggest-0.018
 chores-0.018
TokenThe
Token ID464
Feature activation+3.18e-02

Pos logit contributions
o+0.034
jo+0.032
ppy+0.031
os+0.031
y+0.031
Neg logit contributions
 top-0.017
 anything-0.017
 chores-0.017
 biggest-0.016
 accomplishment-0.016
Token maid
Token ID25920
Feature activation-1.79e-01

Pos logit contributions
 anything+0.053
 chores+0.053
 accomplishment+0.052
 top+0.051
 trouble+0.050
Neg logit contributions
o-0.078
y-0.074
ppy-0.073
jo-0.071
os-0.069
Token quickly
Token ID2952
Feature activation-2.96e-02

Pos logit contributions
jo+0.355
o+0.335
ppy+0.332
azz+0.331
ety+0.330
Neg logit contributions
 top-0.335
 new-0.328
 many-0.325
 best-0.325
 most-0.317
Token ran
Token ID4966
Feature activation+9.24e-03

Pos logit contributions
o+0.065
jo+0.061
os+0.057
ppy+0.056
y+0.055
Neg logit contributions
 top-0.065
 chores-0.063
 accomplishment-0.063
 new-0.062
 anything-0.060
Token away
Token ID1497
Feature activation+3.16e-02

Pos logit contributions
 top+0.017
 anything+0.017
 biggest+0.017
 chores+0.017
 accomplishment+0.016
Neg logit contributions
o-0.023
jo-0.021
y-0.021
os-0.021
ppy-0.021
Token to
Token ID284
Feature activation+5.18e-02

Pos logit contributions
 biggest+0.067
 chores+0.065
 anything+0.065
 top+0.064
 bottom+0.061
Neg logit contributions
o-0.069
y-0.065
jo-0.062
os-0.062
ppy-0.061
Token find
Token ID1064
Feature activation+6.21e-02

Pos logit contributions
 biggest+0.093
 most+0.084
 progress+0.082
 top+0.082
 chores+0.081
Neg logit contributions
o-0.113
ppy-0.107
jo-0.104
os-0.103
y-0.101
Token arms
Token ID5101
Feature activation-4.72e-02

Pos logit contributions
 top+0.049
 biggest+0.047
 new+0.046
 chores+0.046
 accomplishment+0.046
Neg logit contributions
o-0.080
jo-0.076
os-0.074
ppy-0.074
y-0.073
Token.
Token ID13
Feature activation-4.90e-02

Pos logit contributions
o+0.125
jo+0.119
ppy+0.116
os+0.115
itty+0.110
Neg logit contributions
 top-0.080
 anything-0.075
 new-0.075
 biggest-0.074
 chores-0.073
Token The
Token ID383
Feature activation+3.17e-02

Pos logit contributions
o+0.124
jo+0.114
os+0.113
y+0.112
ppy+0.112
Neg logit contributions
 top-0.088
 anything-0.087
 chores-0.086
 biggest-0.085
 accomplishment-0.084
Token se
Token ID384
Feature activation-6.58e-02

Pos logit contributions
 top+0.056
 anything+0.056
 accomplishment+0.055
 chores+0.055
 biggest+0.053
Neg logit contributions
o-0.080
y-0.072
jo-0.072
ppy-0.072
os-0.071
Tokenag
Token ID363
Feature activation-4.31e-02

Pos logit contributions
o+0.168
ppy+0.164
os+0.159
azz+0.155
y+0.154
Neg logit contributions
 accomplishment-0.096
 choice-0.088
 chores-0.087
 freedom-0.085
 original-0.085
Tokenull
Token ID724
Feature activation-1.79e-01

Pos logit contributions
o+0.101
ppy+0.101
jo+0.097
azz+0.096
os+0.093
Neg logit contributions
 top-0.079
 bottom-0.074
 anything-0.072
 something-0.072
 new-0.070
Token flew
Token ID13112
Feature activation+2.77e-02

Pos logit contributions
jo+0.416
o+0.410
ppy+0.407
azz+0.405
enny+0.393
Neg logit contributions
 top-0.343
 new-0.325
 most-0.305
 something-0.301
 good-0.300
Token away
Token ID1497
Feature activation+2.86e-02

Pos logit contributions
 top+0.046
 anything+0.044
 biggest+0.043
 new+0.043
 accomplishment+0.042
Neg logit contributions
o-0.074
jo-0.070
ppy-0.069
os-0.069
y-0.067
Token.
Token ID13
Feature activation-4.90e-02

Pos logit contributions
 chores+0.060
 biggest+0.058
 anything+0.058
 top+0.057
 accomplishment+0.056
Neg logit contributions
o-0.064
y-0.060
os-0.056
jo-0.056
ppy-0.056
Token Tommy
Token ID19919
Feature activation-1.39e-01

Pos logit contributions
o+0.115
jo+0.104
os+0.103
y+0.101
ppy+0.099
Neg logit contributions
 anything-0.095
 chores-0.095
 top-0.094
 biggest-0.094
 accomplishment-0.093
Token laughed
Token ID13818
Feature activation+7.74e-03

Pos logit contributions
jo+0.327
azz+0.320
os+0.316
o+0.314
itty+0.306
Neg logit contributions
 top-0.256
 the-0.228
 something-0.226
 new-0.225
 bottom-0.221
Token.
Token ID13
Feature activation-4.47e-02

Pos logit contributions
 top+0.011
 chores+0.011
 biggest+0.011
 anything+0.011
 accomplishment+0.011
Neg logit contributions
o-0.012
jo-0.011
y-0.011
ppy-0.011
os-0.011
Token
Token ID198
Feature activation-6.10e-04

Pos logit contributions
o+0.117
y+0.113
ppy+0.113
jo+0.110
os+0.110
Neg logit contributions
 top-0.075
 anything-0.070
 biggest-0.069
 new-0.068
 accomplishment-0.067
Token
Token ID198
Feature activation-1.12e-03

Pos logit contributions
o+0.002
ppy+0.001
jo+0.001
y+0.001
os+0.001
Neg logit contributions
 top-0.001
 anything-0.001
 chores-0.001
 new-0.001
 biggest-0.001
TokenHe
Token ID1544
Feature activation-3.02e-02

Pos logit contributions
o+0.003
jo+0.003
os+0.003
ppy+0.003
y+0.003
Neg logit contributions
 top-0.002
 anything-0.002
 chores-0.002
 biggest-0.002
 accomplishment-0.002
Token never
Token ID1239
Feature activation+4.51e-02

Pos logit contributions
o+0.063
jo+0.058
os+0.058
ppy+0.057
y+0.056
Neg logit contributions
 top-0.068
 new-0.064
 anything-0.064
 something-0.063
 accomplishment-0.063
Token used
Token ID973
Feature activation+8.51e-02

Pos logit contributions
 anything+0.095
 trouble+0.089
 something+0.088
 new+0.087
 bottom+0.086
Neg logit contributions
o-0.094
jo-0.092
os-0.087
itty-0.086
ppy-0.084
Token the
Token ID262
Feature activation+7.43e-02

Pos logit contributions
 biggest+0.099
 top+0.099
 bottom+0.099
 middle+0.098
 chores+0.092
Neg logit contributions
o-0.258
y-0.237
os-0.234
jo-0.233
ppy-0.227
Token knife
Token ID9845
Feature activation-9.66e-02

Pos logit contributions
 top+0.107
 anything+0.106
 chores+0.102
 accomplishment+0.102
 biggest+0.101
Neg logit contributions
o-0.206
jo-0.192
y-0.191
ppy-0.189
os-0.188
Token again
Token ID757
Feature activation+1.44e-02

Pos logit contributions
o+0.237
jo+0.218
ppy+0.215
os+0.215
y+0.214
Neg logit contributions
 top-0.181
 anything-0.178
 chores-0.175
 biggest-0.174
 accomplishment-0.173
Token,
Token ID11
Feature activation+7.04e-03

Pos logit contributions
 top+0.030
 chores+0.029
 biggest+0.029
 anything+0.029
 accomplishment+0.029
Neg logit contributions
o-0.033
jo-0.030
y-0.030
os-0.029
ppy-0.029
Token and
Token ID290
Feature activation+1.30e-02

Pos logit contributions
 top+0.013
 chores+0.013
 biggest+0.013
 anything+0.013
 bottom+0.013
Neg logit contributions
o-0.017
jo-0.015
y-0.015
os-0.015
ppy-0.015
Token that
Token ID326
Feature activation+2.08e-02

Pos logit contributions
 top+0.065
 biggest+0.064
 anything+0.062
 chores+0.061
 bottom+0.061
Neg logit contributions
o-0.114
jo-0.106
y-0.105
os-0.104
ppy-0.103
Token their
Token ID511
Feature activation+4.62e-02

Pos logit contributions
 top+0.028
 biggest+0.027
 accomplishment+0.026
 bottom+0.026
 anything+0.026
Neg logit contributions
o-0.061
jo-0.057
os-0.057
y-0.057
ppy-0.055
Token hard
Token ID1327
Feature activation+2.05e-02

Pos logit contributions
 anything+0.053
 top+0.052
 biggest+0.049
 chores+0.049
 bottom+0.047
Neg logit contributions
o-0.138
jo-0.133
y-0.131
os-0.130
ppy-0.127
Token work
Token ID670
Feature activation+1.74e-03

Pos logit contributions
 top+0.027
 anything+0.027
 biggest+0.026
 chores+0.026
 accomplishment+0.025
Neg logit contributions
o-0.060
jo-0.057
y-0.057
os-0.056
ppy-0.055
Token had
Token ID550
Feature activation+7.16e-02

Pos logit contributions
 top+0.004
 anything+0.004
 chores+0.004
 biggest+0.004
 accomplishment+0.004
Neg logit contributions
o-0.004
jo-0.003
ppy-0.003
os-0.003
y-0.003
Token made
Token ID925
Feature activation+8.36e-02

Pos logit contributions
 top+0.147
 biggest+0.140
 bottom+0.140
 chores+0.139
 accomplishment+0.138
Neg logit contributions
o-0.162
y-0.148
jo-0.145
os-0.145
ppy-0.143
Token a
Token ID257
Feature activation+6.43e-02

Pos logit contributions
 top+0.085
 bottom+0.081
 biggest+0.080
 middle+0.077
 accomplishment+0.074
Neg logit contributions
o-0.265
jo-0.252
y-0.252
os-0.249
ppy-0.236
Token difference
Token ID3580
Feature activation+4.17e-03

Pos logit contributions
 anything+0.060
 chores+0.059
 top+0.058
 yourselves+0.056
 biggest+0.055
Neg logit contributions
o-0.203
y-0.194
jo-0.192
ppy-0.182
os-0.182
Token.
Token ID13
Feature activation-5.06e-02

Pos logit contributions
 top+0.008
 chores+0.008
 anything+0.008
 biggest+0.008
 accomplishment+0.008
Neg logit contributions
o-0.010
y-0.009
jo-0.009
os-0.009
ppy-0.009
Token<|endoftext|>
Token ID50256
Feature activation-5.33e-02

Pos logit contributions
o+0.131
ppy+0.126
y+0.124
jo+0.122
os+0.121
Neg logit contributions
 top-0.085
 anything-0.082
 new-0.080
 biggest-0.079
 accomplishment-0.079
TokenOnce
Token ID7454
Feature activation+6.98e-03

Pos logit contributions
ppy+0.147
o+0.147
jo+0.144
y+0.137
os+0.136
Neg logit contributions
 top-0.079
 anything-0.073
 new-0.071
 bottom-0.071
 biggest-0.069
Token picking
Token ID10868
Feature activation+5.82e-02

Pos logit contributions
 top+0.092
 anything+0.091
 biggest+0.091
 accomplishment+0.089
 chores+0.088
Neg logit contributions
o-0.110
y-0.099
jo-0.098
os-0.098
ppy-0.096
Token up
Token ID510
Feature activation+6.23e-02

Pos logit contributions
 accomplishment+0.092
 End+0.081
 chores+0.080
 bottom+0.079
 freedom+0.078
Neg logit contributions
o-0.157
y-0.153
os-0.140
ppy-0.137
jo-0.133
Token her
Token ID607
Feature activation+2.17e-02

Pos logit contributions
 accomplishment+0.077
 top+0.074
 biggest+0.073
 bottom+0.071
 anything+0.068
Neg logit contributions
o-0.187
y-0.178
os-0.175
ppy-0.173
jo-0.171
Token toys
Token ID14958
Feature activation-4.79e-02

Pos logit contributions
 accomplishment+0.033
 anything+0.032
 top+0.031
 chores+0.031
 progress+0.030
Neg logit contributions
o-0.061
y-0.057
jo-0.055
os-0.054
ppy-0.054
Token and
Token ID290
Feature activation+2.24e-03

Pos logit contributions
o+0.098
y+0.094
ppy+0.087
os+0.087
jo+0.081
Neg logit contributions
 biggest-0.105
 accomplishment-0.105
 anything-0.103
 chores-0.103
 top-0.102
Token putting
Token ID5137
Feature activation+8.89e-02

Pos logit contributions
 top+0.004
 biggest+0.004
 anything+0.003
 chores+0.003
 accomplishment+0.003
Neg logit contributions
o-0.006
y-0.005
os-0.005
ppy-0.005
jo-0.005
Token them
Token ID606
Feature activation-3.92e-02

Pos logit contributions
 accomplishment+0.127
 biggest+0.121
 progress+0.117
 top+0.117
 bottom+0.117
Neg logit contributions
o-0.252
y-0.236
ppy-0.229
os-0.228
jo-0.221
Token away
Token ID1497
Feature activation+4.68e-02

Pos logit contributions
o+0.104
jo+0.097
os+0.095
ppy+0.095
y+0.095
Neg logit contributions
 top-0.066
 anything-0.065
 chores-0.064
 biggest-0.063
 accomplishment-0.063
Token.
Token ID13
Feature activation-4.75e-02

Pos logit contributions
 biggest+0.099
 accomplishment+0.098
 top+0.097
 chores+0.095
 anything+0.094
Neg logit contributions
o-0.098
y-0.095
os-0.089
ppy-0.087
jo-0.086
Token
Token ID220
Feature activation-2.93e-02

Pos logit contributions
o+0.132
ppy+0.130
jo+0.128
os+0.127
y+0.124
Neg logit contributions
 top-0.077
 new-0.073
 biggest-0.069
 something-0.067
 many-0.067
Token
Token ID198
Feature activation-8.77e-03

Pos logit contributions
o+0.077
ppy+0.074
jo+0.074
y+0.072
os+0.071
Neg logit contributions
 top-0.047
 new-0.045
 anything-0.045
 biggest-0.044
 chores-0.044
Token feather
Token ID17972
Feature activation-7.38e-02

Pos logit contributions
 chores+0.086
 anything+0.083
 top+0.082
 accomplishment+0.080
 biggest+0.078
Neg logit contributions
o-0.190
y-0.177
jo-0.176
ppy-0.173
os-0.168
Token like
Token ID588
Feature activation+7.94e-02

Pos logit contributions
o+0.200
y+0.184
ppy+0.184
jo+0.184
os+0.181
Neg logit contributions
 top-0.117
 chores-0.116
 anything-0.114
 biggest-0.113
 accomplishment-0.113
Token it
Token ID340
Feature activation-4.13e-02

Pos logit contributions
 chores+0.113
 accomplishment+0.102
 biggest+0.100
 progress+0.099
 tasks+0.098
Neg logit contributions
o-0.221
y-0.211
ppy-0.206
jo-0.204
os-0.201
Token.
Token ID13
Feature activation-5.09e-02

Pos logit contributions
o+0.117
y+0.109
ppy+0.109
jo+0.108
os+0.107
Neg logit contributions
 chores-0.060
 top-0.060
 accomplishment-0.059
 biggest-0.058
 anything-0.057
Token She
Token ID1375
Feature activation-3.13e-02

Pos logit contributions
o+0.134
y+0.130
ppy+0.129
jo+0.128
os+0.125
Neg logit contributions
 top-0.082
 biggest-0.078
 anything-0.078
 new-0.077
 accomplishment-0.076
Token felt
Token ID2936
Feature activation+8.23e-02

Pos logit contributions
o+0.066
jo+0.063
ppy+0.061
os+0.061
azz+0.060
Neg logit contributions
 top-0.067
 new-0.065
 accomplishment-0.065
 great-0.064
 most-0.063
Token so
Token ID523
Feature activation+4.49e-02

Pos logit contributions
 avail+0.154
 chores+0.154
 tasks+0.149
 top+0.148
 bottom+0.141
Neg logit contributions
o-0.180
y-0.170
ppy-0.167
bug-0.162
bo-0.155
Token happy
Token ID3772
Feature activation+3.04e-02

Pos logit contributions
 chores+0.069
 avail+0.065
 anything+0.065
 top+0.063
 tasks+0.063
Neg logit contributions
o-0.117
y-0.108
ppy-0.105
jo-0.104
os-0.104
Token to
Token ID284
Feature activation+7.30e-02

Pos logit contributions
 top+0.054
 chores+0.053
 anything+0.052
 biggest+0.052
 accomplishment+0.051
Neg logit contributions
o-0.078
jo-0.072
y-0.071
os-0.071
ppy-0.070
Token have
Token ID423
Feature activation+6.89e-02

Pos logit contributions
 chores+0.126
 top+0.125
 biggest+0.124
 anything+0.124
 accomplishment+0.120
Neg logit contributions
o-0.185
jo-0.172
ppy-0.170
y-0.168
os-0.167
Token saved
Token ID7448
Feature activation+9.61e-02

Pos logit contributions
 top+0.156
 chores+0.155
 accomplishment+0.150
 biggest+0.150
 bottom+0.149
Neg logit contributions
o-0.144
jo-0.127
y-0.127
ppy-0.126
os-0.125
Token They
Token ID1119
Feature activation-2.79e-02

Pos logit contributions
o+0.115
jo+0.107
ppy+0.107
y+0.106
os+0.105
Neg logit contributions
 top-0.089
 anything-0.087
 chores-0.085
 biggest-0.085
 new-0.083
Token keep
Token ID1394
Feature activation+8.73e-02

Pos logit contributions
o+0.074
jo+0.069
os+0.068
y+0.068
ppy+0.067
Neg logit contributions
 top-0.048
 anything-0.046
 chores-0.046
 biggest-0.045
 accomplishment-0.045
Token playing
Token ID2712
Feature activation+4.75e-02

Pos logit contributions
 top+0.139
 biggest+0.137
 bottom+0.135
 chores+0.132
 anything+0.132
Neg logit contributions
o-0.236
y-0.205
os-0.204
jo-0.198
ppy-0.191
Token soccer
Token ID11783
Feature activation-1.83e-02

Pos logit contributions
 biggest+0.096
 avail+0.092
 chores+0.089
 beauty+0.089
 cheapest+0.088
Neg logit contributions
o-0.094
y-0.094
ppy-0.085
os-0.082
jo-0.079
Token and
Token ID290
Feature activation+1.35e-02

Pos logit contributions
o+0.041
y+0.038
os+0.036
ppy+0.036
jo+0.035
Neg logit contributions
 chores-0.038
 biggest-0.038
 anything-0.037
 top-0.037
 accomplishment-0.037
Token having
Token ID1719
Feature activation+9.30e-02

Pos logit contributions
 biggest+0.022
 top+0.022
 chores+0.022
 anything+0.021
 accomplishment+0.021
Neg logit contributions
o-0.035
os-0.032
y-0.032
ppy-0.031
jo-0.031
Token fun
Token ID1257
Feature activation+2.74e-02

Pos logit contributions
 accomplishment+0.083
 chores+0.080
 top+0.079
 bottom+0.079
 biggest+0.079
Neg logit contributions
o-0.301
jo-0.271
y-0.270
os-0.270
ppy-0.265
Token.
Token ID13
Feature activation-4.71e-02

Pos logit contributions
 biggest+0.060
 chores+0.059
 top+0.059
 accomplishment+0.058
 anything+0.057
Neg logit contributions
o-0.057
y-0.053
jo-0.051
os-0.050
ppy-0.050
Token<|endoftext|>
Token ID50256
Feature activation-4.90e-02

Pos logit contributions
o+0.114
jo+0.106
ppy+0.104
os+0.104
y+0.104
Neg logit contributions
 top-0.091
 anything-0.089
 chores-0.088
 biggest-0.087
 accomplishment-0.087
TokenAnna
Token ID31160
Feature activation-1.42e-01

Pos logit contributions
o+0.127
jo+0.119
ppy+0.117
os+0.116
y+0.116
Neg logit contributions
 top-0.085
 anything-0.083
 chores-0.081
 biggest-0.081
 accomplishment-0.079
Token liked
Token ID8288
Feature activation+6.76e-02

Pos logit contributions
o+0.335
jo+0.322
ppy+0.312
os+0.310
y+0.298
Neg logit contributions
 top-0.261
 anything-0.259
 chores-0.253
 new-0.252
 trouble-0.252
Token and
Token ID290
Feature activation+4.47e-03

Pos logit contributions
 biggest+0.034
 chores+0.034
 anything+0.033
 accomplishment+0.032
 top+0.032
Neg logit contributions
o-0.069
y-0.064
os-0.064
jo-0.064
ppy-0.062
Token opened
Token ID4721
Feature activation+6.34e-02

Pos logit contributions
 top+0.010
 anything+0.010
 chores+0.010
 biggest+0.010
 accomplishment+0.010
Neg logit contributions
o-0.009
jo-0.009
ppy-0.008
os-0.008
y-0.008
Token the
Token ID262
Feature activation+6.34e-02

Pos logit contributions
 chores+0.095
 biggest+0.091
 tasks+0.084
 most+0.083
 accomplishment+0.083
Neg logit contributions
o-0.175
y-0.164
os-0.159
jo-0.156
ppy-0.149
Token box
Token ID3091
Feature activation-6.89e-02

Pos logit contributions
 anything+0.108
 accomplishment+0.108
 chores+0.107
 top+0.103
 biggest+0.100
Neg logit contributions
o-0.156
y-0.146
jo-0.144
ppy-0.141
os-0.140
Token.
Token ID13
Feature activation-5.00e-02

Pos logit contributions
o+0.160
jo+0.146
y+0.146
os+0.145
ppy+0.143
Neg logit contributions
 anything-0.136
 top-0.136
 chores-0.135
 biggest-0.133
 accomplishment-0.131
Token Inside
Token ID14384
Feature activation+1.79e-02

Pos logit contributions
o+0.131
y+0.128
ppy+0.127
jo+0.126
os+0.123
Neg logit contributions
 top-0.083
 anything-0.077
 new-0.077
 biggest-0.076
 something-0.075
Token was
Token ID373
Feature activation+6.86e-02

Pos logit contributions
 chores+0.034
 biggest+0.033
 accomplishment+0.033
 anything+0.033
 progress+0.033
Neg logit contributions
o-0.044
y-0.041
os-0.040
jo-0.039
ppy-0.037
Token a
Token ID257
Feature activation+5.34e-02

Pos logit contributions
 accomplishment+0.077
 chores+0.073
 avail+0.070
 bottom+0.067
 progress+0.067
Neg logit contributions
o-0.215
os-0.192
y-0.190
jo-0.190
ppy-0.187
Token surprise
Token ID5975
Feature activation-1.15e-02

Pos logit contributions
 anything+0.060
 chores+0.057
 top+0.054
 accomplishment+0.054
 biggest+0.052
Neg logit contributions
o-0.166
y-0.155
jo-0.154
ppy-0.149
os-0.147
Token!
Token ID0
Feature activation-5.18e-02

Pos logit contributions
o+0.030
jo+0.028
ppy+0.027
os+0.027
y+0.027
Neg logit contributions
 top-0.020
 anything-0.020
 chores-0.019
 accomplishment-0.019
 biggest-0.019
Token A
Token ID317
Feature activation+2.89e-02

Pos logit contributions
o+0.137
y+0.134
ppy+0.134
jo+0.131
os+0.128
Neg logit contributions
 top-0.084
 anything-0.078
 something-0.077
 new-0.077
 trouble-0.076
Token special
Token ID2041
Feature activation+4.10e-02

Pos logit contributions
 top+0.078
 bottom+0.074
 chores+0.073
 anything+0.072
 accomplishment+0.068
Neg logit contributions
o-0.243
y-0.233
jo-0.226
os-0.223
ppy-0.219
Token person
Token ID1048
Feature activation-1.03e-01

Pos logit contributions
 top+0.050
 anything+0.049
 chores+0.049
 biggest+0.047
 accomplishment+0.046
Neg logit contributions
o-0.124
jo-0.117
y-0.116
os-0.114
ppy-0.112
Token she
Token ID673
Feature activation-2.96e-02

Pos logit contributions
o+0.249
jo+0.234
ppy+0.229
os+0.227
y+0.219
Neg logit contributions
 anything-0.194
 new-0.186
 top-0.183
 something-0.182
 chores-0.181
Token knew
Token ID2993
Feature activation+2.97e-02

Pos logit contributions
o+0.056
jo+0.052
os+0.050
ppy+0.049
y+0.048
Neg logit contributions
 anything-0.069
 top-0.069
 great-0.069
 new-0.068
 amazing-0.068
Token:
Token ID25
Feature activation+1.01e-02

Pos logit contributions
 top+0.050
 biggest+0.049
 accomplishment+0.048
 chores+0.048
 anything+0.048
Neg logit contributions
o-0.076
jo-0.071
os-0.070
y-0.070
ppy-0.070
Token her
Token ID607
Feature activation+1.60e-02

Pos logit contributions
 top+0.017
 anything+0.017
 chores+0.016
 biggest+0.016
 accomplishment+0.016
Neg logit contributions
o-0.027
jo-0.025
ppy-0.025
y-0.025
os-0.025
Token dad
Token ID9955
Feature activation-1.06e-01

Pos logit contributions
 anything+0.021
 top+0.020
 chores+0.020
 biggest+0.019
 accomplishment+0.019
Neg logit contributions
o-0.047
jo-0.044
y-0.043
os-0.043
ppy-0.043
Token.
Token ID13
Feature activation-4.98e-02

Pos logit contributions
o+0.225
jo+0.211
ppy+0.208
os+0.205
azz+0.199
Neg logit contributions
 anything-0.219
 top-0.218
 new-0.215
 chores-0.213
 something-0.212
Token She
Token ID1375
Feature activation-3.42e-02

Pos logit contributions
o+0.126
ppy+0.122
y+0.118
jo+0.118
os+0.115
Neg logit contributions
 top-0.087
 anything-0.081
 new-0.080
 biggest-0.079
 something-0.078
Token was
Token ID373
Feature activation+6.42e-02

Pos logit contributions
o+0.072
jo+0.068
ppy+0.067
os+0.066
azz+0.064
Neg logit contributions
 top-0.073
 accomplishment-0.070
 new-0.070
 great-0.069
 something-0.068
Token a
Token ID257
Feature activation+5.25e-02

Pos logit contributions
 top+0.098
 chores+0.097
 anything+0.095
 bottom+0.094
 tasks+0.093
Neg logit contributions
o-0.182
y-0.161
os-0.159
jo-0.159
ppy-0.157
Token and
Token ID290
Feature activation+6.63e-03

Pos logit contributions
o+0.198
ppy+0.187
os+0.182
jo+0.181
azz+0.175
Neg logit contributions
 top-0.137
 anything-0.130
 new-0.128
 accomplishment-0.126
 bottom-0.125
Token continued
Token ID3767
Feature activation+4.47e-02

Pos logit contributions
 top+0.012
 anything+0.012
 biggest+0.012
 chores+0.012
 accomplishment+0.012
Neg logit contributions
o-0.016
jo-0.015
ppy-0.015
y-0.015
os-0.015
Token his
Token ID465
Feature activation+4.15e-02

Pos logit contributions
 biggest+0.078
 top+0.078
 anything+0.076
 accomplishment+0.076
 chores+0.075
Neg logit contributions
o-0.115
jo-0.105
y-0.104
os-0.103
ppy-0.100
Token patrol
Token ID13969
Feature activation-2.61e-04

Pos logit contributions
 top+0.057
 anything+0.056
 biggest+0.054
 chores+0.053
 accomplishment+0.053
Neg logit contributions
o-0.122
jo-0.115
os-0.112
y-0.112
ppy-0.112
Token.
Token ID13
Feature activation-4.83e-02

Pos logit contributions
o+0.001
jo+0.001
y+0.001
os+0.001
ppy+0.001
Neg logit contributions
 biggest-0.000
 chores-0.000
 top-0.000
 accomplishment-0.000
 anything-0.000
Token The
Token ID383
Feature activation+3.55e-02

Pos logit contributions
o+0.122
jo+0.113
ppy+0.111
os+0.111
y+0.111
Neg logit contributions
 top-0.088
 anything-0.086
 chores-0.084
 biggest-0.084
 accomplishment-0.083
Token guard
Token ID4860
Feature activation-1.35e-01

Pos logit contributions
 anything+0.048
 chores+0.047
 accomplishment+0.045
 top+0.045
 biggest+0.045
Neg logit contributions
o-0.103
jo-0.096
y-0.094
os-0.094
ppy-0.093
Token was
Token ID373
Feature activation+6.02e-02

Pos logit contributions
o+0.290
ppy+0.279
jo+0.264
azz+0.261
os+0.255
Neg logit contributions
 top-0.280
 new-0.271
 most-0.266
 anything-0.259
 many-0.254
Token never
Token ID1239
Feature activation+5.59e-02

Pos logit contributions
 chores+0.098
 tasks+0.096
 top+0.093
 avail+0.091
 accomplishment+0.090
Neg logit contributions
o-0.167
y-0.146
jo-0.146
os-0.144
ppy-0.139
Token scared
Token ID12008
Feature activation+6.85e-02

Pos logit contributions
 top+0.102
 chores+0.098
 biggest+0.096
 accomplishment+0.096
 bottom+0.094
Neg logit contributions
o-0.144
ppy-0.131
jo-0.131
y-0.130
os-0.129
Token again
Token ID757
Feature activation+1.09e-02

Pos logit contributions
 chores+0.133
 top+0.131
 accomplishment+0.129
 biggest+0.128
 anything+0.122
Neg logit contributions
o-0.160
y-0.153
jo-0.147
ppy-0.147
os-0.144
Token her
Token ID607
Feature activation+4.88e-03

Pos logit contributions
 top+0.071
 biggest+0.071
 accomplishment+0.071
 chores+0.068
 anything+0.067
Neg logit contributions
o-0.088
ppy-0.082
y-0.081
jo-0.081
os-0.079
Token.
Token ID13
Feature activation-5.78e-02

Pos logit contributions
 top+0.009
 anything+0.009
 biggest+0.009
 accomplishment+0.009
 chores+0.009
Neg logit contributions
o-0.012
jo-0.011
y-0.011
ppy-0.011
os-0.011
Token So
Token ID1406
Feature activation-4.04e-03

Pos logit contributions
o+0.151
jo+0.142
ppy+0.141
os+0.141
y+0.138
Neg logit contributions
 top-0.102
 new-0.097
 anything-0.095
 biggest-0.094
 something-0.093
Token,
Token ID11
Feature activation-1.71e-02

Pos logit contributions
o+0.009
y+0.009
ppy+0.008
jo+0.008
os+0.008
Neg logit contributions
 biggest-0.008
 chores-0.008
 top-0.008
 anything-0.007
 accomplishment-0.007
Token Lily
Token ID20037
Feature activation-1.21e-01

Pos logit contributions
o+0.041
y+0.038
ppy+0.037
jo+0.037
os+0.037
Neg logit contributions
 biggest-0.032
 top-0.031
 accomplishment-0.031
 chores-0.031
 anything-0.031
Token decided
Token ID3066
Feature activation+3.58e-03

Pos logit contributions
jo+0.260
o+0.259
os+0.255
azz+0.251
itty+0.238
Neg logit contributions
 top-0.233
 something-0.215
 chores-0.211
 trouble-0.209
 good-0.209
Token to
Token ID284
Feature activation+4.04e-02

Pos logit contributions
 top+0.007
 anything+0.007
 biggest+0.007
 chores+0.007
 accomplishment+0.007
Neg logit contributions
o-0.008
jo-0.007
y-0.007
ppy-0.007
os-0.007
Token try
Token ID1949
Feature activation+3.90e-02

Pos logit contributions
 top+0.064
 anything+0.064
 biggest+0.063
 accomplishment+0.062
 chores+0.062
Neg logit contributions
o-0.109
jo-0.101
ppy-0.100
os-0.099
y-0.099
Token to
Token ID284
Feature activation+5.11e-02

Pos logit contributions
 biggest+0.066
 top+0.066
 accomplishment+0.065
 anything+0.064
 chores+0.063
Neg logit contributions
o-0.102
y-0.093
jo-0.092
os-0.092
ppy-0.091
Token fix
Token ID4259
Feature activation+4.46e-02

Pos logit contributions
 anything+0.084
 biggest+0.084
 accomplishment+0.083
 top+0.082
 chores+0.079
Neg logit contributions
o-0.132
ppy-0.122
jo-0.120
os-0.119
y-0.119
Token the
Token ID262
Feature activation+6.53e-02

Pos logit contributions
 accomplishment+0.053
 biggest+0.051
 bottom+0.048
 chores+0.047
 top+0.046
Neg logit contributions
o-0.131
y-0.124
ppy-0.121
os-0.121
jo-0.120
Token the
Token ID262
Feature activation+8.60e-02

Pos logit contributions
 biggest+0.152
 chores+0.132
 bottom+0.130
 cheapest+0.129
 top+0.127
Neg logit contributions
o-0.207
jo-0.202
y-0.201
os-0.185
ppy-0.181
Token perfect
Token ID2818
Feature activation+6.08e-02

Pos logit contributions
 chores+0.104
 anything+0.102
 accomplishment+0.099
 top+0.094
 biggest+0.092
Neg logit contributions
o-0.255
jo-0.243
y-0.241
ppy-0.234
os-0.230
Token sky
Token ID6766
Feature activation-3.65e-02

Pos logit contributions
 chores+0.074
 anything+0.071
 biggest+0.070
 top+0.067
 trouble+0.066
Neg logit contributions
o-0.178
y-0.177
jo-0.174
ppy-0.167
os-0.165
Token.
Token ID13
Feature activation-4.92e-02

Pos logit contributions
o+0.091
jo+0.084
os+0.083
ppy+0.083
y+0.082
Neg logit contributions
 top-0.066
 anything-0.065
 chores-0.064
 biggest-0.064
 accomplishment-0.063
Token<|endoftext|>
Token ID50256
Feature activation-5.38e-02

Pos logit contributions
o+0.130
ppy+0.126
y+0.124
jo+0.122
os+0.121
Neg logit contributions
 top-0.079
 anything-0.077
 new-0.075
 biggest-0.074
 accomplishment-0.073
TokenOnce
Token ID7454
Feature activation+8.52e-03

Pos logit contributions
o+0.147
ppy+0.145
jo+0.143
os+0.137
y+0.137
Neg logit contributions
 top-0.083
 anything-0.077
 bottom-0.074
 new-0.074
 trouble-0.072
Token there
Token ID612
Feature activation-1.97e-02

Pos logit contributions
 top+0.015
 anything+0.015
 new+0.015
 chores+0.015
 biggest+0.015
Neg logit contributions
o-0.021
jo-0.020
ppy-0.020
os-0.019
y-0.019
Token was
Token ID373
Feature activation+6.74e-02

Pos logit contributions
o+0.040
jo+0.036
y+0.035
os+0.035
ppy+0.035
Neg logit contributions
 top-0.045
 anything-0.045
 chores-0.045
 biggest-0.044
 accomplishment-0.044
Token a
Token ID257
Feature activation+2.12e-02

Pos logit contributions
 top+0.060
 accomplishment+0.058
 anything+0.058
 bottom+0.056
 chores+0.055
Neg logit contributions
o-0.232
y-0.214
jo-0.211
ppy-0.210
os-0.209
Token bald
Token ID28273
Feature activation-4.83e-02

Pos logit contributions
 new+0.022
 top+0.021
 great+0.020
 good+0.020
 biggest+0.018
Neg logit contributions
os-0.072
o-0.071
jo-0.070
ppy-0.068
y-0.065
Token man
Token ID582
Feature activation-1.31e-01

Pos logit contributions
o+0.141
os+0.140
jo+0.129
ppy+0.126
azz+0.122
Neg logit contributions
 top-0.072
 new-0.068
 bottom-0.064
 ancient-0.063
 accomplishment-0.061
Token He
Token ID679
Feature activation-3.54e-02

Pos logit contributions
o+0.109
os+0.098
jo+0.098
y+0.096
ppy+0.094
Neg logit contributions
 anything-0.094
 top-0.094
 chores-0.093
 accomplishment-0.093
 biggest-0.092
Token liked
Token ID8288
Feature activation+7.69e-02

Pos logit contributions
o+0.077
os+0.076
jo+0.076
ppy+0.075
azz+0.073
Neg logit contributions
 top-0.070
 new-0.070
 great-0.068
 biggest-0.068
 most-0.068
Token to
Token ID284
Feature activation+5.89e-02

Pos logit contributions
 top+0.168
 accomplishment+0.166
 anything+0.163
 chores+0.163
 biggest+0.162
Neg logit contributions
o-0.170
y-0.146
os-0.146
jo-0.145
ppy-0.139
Token play
Token ID711
Feature activation+5.13e-02

Pos logit contributions
 top+0.104
 anything+0.103
 biggest+0.101
 accomplishment+0.100
 chores+0.100
Neg logit contributions
o-0.150
jo-0.137
ppy-0.136
os-0.136
y-0.135
Token with
Token ID351
Feature activation+5.75e-02

Pos logit contributions
 accomplishment+0.102
 progress+0.098
 chores+0.098
 biggest+0.098
 trouble+0.097
Neg logit contributions
o-0.121
y-0.109
ppy-0.103
os-0.102
jo-0.098
Token it
Token ID340
Feature activation-4.78e-02

Pos logit contributions
 accomplishment+0.084
 biggest+0.077
 progress+0.077
 chores+0.075
 trouble+0.074
Neg logit contributions
o-0.170
y-0.155
os-0.154
jo-0.151
ppy-0.148
Token in
Token ID287
Feature activation+6.04e-02

Pos logit contributions
o+0.123
y+0.116
ppy+0.111
os+0.108
jo+0.106
Neg logit contributions
 accomplishment-0.084
 progress-0.079
 trouble-0.078
 yourselves-0.078
 chores-0.078
Token the
Token ID262
Feature activation+6.58e-02

Pos logit contributions
 accomplishment+0.111
rier+0.108
 progress+0.103
 biggest+0.097
 chores+0.096
Neg logit contributions
o-0.153
y-0.132
os-0.128
my-0.123
jo-0.121
Token water
Token ID1660
Feature activation-5.35e-02

Pos logit contributions
 accomplishment+0.096
 anything+0.085
 progress+0.081
 top+0.078
 chores+0.077
Neg logit contributions
o-0.201
y-0.184
jo-0.173
ppy-0.172
os-0.172
Token.
Token ID13
Feature activation-4.22e-02

Pos logit contributions
o+0.124
y+0.113
os+0.110
ppy+0.108
jo+0.107
Neg logit contributions
 accomplishment-0.110
 top-0.108
 chores-0.108
 anything-0.107
 biggest-0.107
Token One
Token ID1881
Feature activation+1.51e-02

Pos logit contributions
o+0.107
ppy+0.102
jo+0.102
y+0.100
os+0.099
Neg logit contributions
 top-0.076
 anything-0.072
 biggest-0.072
 new-0.071
 chores-0.070
Token it
Token ID340
Feature activation-4.26e-02

Pos logit contributions
o+0.014
ppy+0.013
y+0.013
os+0.012
jo+0.012
Neg logit contributions
 chores-0.008
 top-0.008
 biggest-0.007
 accomplishment-0.007
 progress-0.007
Token is
Token ID318
Feature activation+7.30e-02

Pos logit contributions
jo+0.111
o+0.107
ppy+0.105
os+0.102
azz+0.102
Neg logit contributions
 top-0.077
 biggest-0.073
 anything-0.072
 new-0.071
 great-0.070
Token ours
Token ID16903
Feature activation+2.03e-03

Pos logit contributions
 accomplishment+0.121
 top+0.118
 progress+0.118
 chores+0.117
 middle+0.117
Neg logit contributions
o-0.193
ppy-0.170
y-0.168
ety-0.157
os-0.155
Token now
Token ID783
Feature activation+4.48e-02

Pos logit contributions
 top+0.004
 chores+0.004
 accomplishment+0.004
 anything+0.004
 biggest+0.004
Neg logit contributions
o-0.004
y-0.004
ppy-0.004
os-0.004
jo-0.004
Token!"
Token ID2474
Feature activation-4.61e-02

Pos logit contributions
 top+0.092
 anything+0.090
 chores+0.089
 biggest+0.089
 accomplishment+0.088
Neg logit contributions
o-0.103
jo-0.094
y-0.092
ppy-0.092
os-0.092
Token one
Token ID530
Feature activation-3.67e-02

Pos logit contributions
o+0.096
jo+0.086
azz+0.083
y+0.083
os+0.082
Neg logit contributions
 biggest-0.099
 chores-0.098
 top-0.096
 bottom-0.095
 accomplishment-0.095
Token of
Token ID286
Feature activation+1.87e-02

Pos logit contributions
o+0.105
jo+0.102
ppy+0.101
os+0.098
anna+0.095
Neg logit contributions
 new-0.052
 top-0.051
 good-0.050
 chores-0.048
 many-0.046
Token the
Token ID262
Feature activation+4.00e-02

Pos logit contributions
 accomplishment+0.027
 biggest+0.026
 progress+0.025
 chores+0.025
 tasks+0.024
Neg logit contributions
o-0.053
y-0.049
ppy-0.048
os-0.047
jo-0.045
Token big
Token ID1263
Feature activation+6.85e-03

Pos logit contributions
 anything+0.060
 avail+0.059
 accomplishment+0.057
 such+0.053
 top+0.052
Neg logit contributions
o-0.106
y-0.100
os-0.097
jo-0.093
ppy-0.092
Token kids
Token ID3988
Feature activation-1.52e-01

Pos logit contributions
 top+0.011
 anything+0.011
 chores+0.011
 accomplishment+0.011
 biggest+0.011
Neg logit contributions
o-0.019
y-0.017
jo-0.017
os-0.017
ppy-0.017
Token says
Token ID1139
Feature activation-2.82e-02

Pos logit contributions
o+0.387
jo+0.382
ppy+0.359
y+0.350
os+0.345
Neg logit contributions
 top-0.277
 chores-0.273
 anything-0.266
 bottom-0.257
 new-0.254
Tokenie
Token ID494
Feature activation-1.14e-01

Pos logit contributions
azz+0.320
o+0.311
ppy+0.304
os+0.303
jo+0.303
Neg logit contributions
 new-0.261
 top-0.256
 amazing-0.248
 most-0.245
 ancient-0.239
Token was
Token ID373
Feature activation+5.63e-02

Pos logit contributions
o+0.258
azz+0.257
jo+0.252
ppy+0.247
os+0.244
Neg logit contributions
 top-0.205
 new-0.192
 most-0.183
 amazing-0.180
 many-0.180
Token relieved
Token ID22598
Feature activation+5.56e-02

Pos logit contributions
 chores+0.092
 tasks+0.090
 top+0.089
 avail+0.085
 anything+0.085
Neg logit contributions
o-0.159
ppy-0.137
y-0.137
jo-0.136
os-0.136
Token and
Token ID290
Feature activation+2.80e-03

Pos logit contributions
 top+0.109
 anything+0.108
 accomplishment+0.104
 new+0.104
 something+0.103
Neg logit contributions
o-0.133
jo-0.124
os-0.122
ppy-0.121
y-0.119
Token grateful
Token ID14066
Feature activation+6.13e-02

Pos logit contributions
 top+0.006
 chores+0.006
 anything+0.006
 biggest+0.006
 bottom+0.006
Neg logit contributions
o-0.006
jo-0.005
os-0.005
y-0.005
ppy-0.005
Token.
Token ID13
Feature activation-5.68e-02

Pos logit contributions
 top+0.088
 anything+0.088
 new+0.086
 accomplishment+0.085
 something+0.083
Neg logit contributions
o-0.178
jo-0.166
os-0.166
ppy-0.165
y-0.161
Token She
Token ID1375
Feature activation-3.79e-02

Pos logit contributions
o+0.151
ppy+0.149
y+0.147
jo+0.145
os+0.144
Neg logit contributions
 top-0.091
 new-0.086
 biggest-0.083
 many-0.082
 anything-0.081
Token knew
Token ID2993
Feature activation+2.41e-02

Pos logit contributions
o+0.081
os+0.078
jo+0.078
y+0.076
ppy+0.073
Neg logit contributions
 top-0.081
 new-0.080
 great-0.079
 amazing-0.079
 accomplishment-0.078
Token she
Token ID673
Feature activation-3.27e-02

Pos logit contributions
 top+0.039
 chores+0.038
 biggest+0.038
 bottom+0.037
 accomplishment+0.036
Neg logit contributions
o-0.065
jo-0.059
ppy-0.059
os-0.059
y-0.058
Token had
Token ID550
Feature activation+5.59e-02

Pos logit contributions
o+0.067
os+0.064
azz+0.063
jo+0.063
aa+0.062
Neg logit contributions
 amazing-0.067
 great-0.066
 most-0.066
 new-0.065
 anything-0.065
Token found
Token ID1043
Feature activation+6.47e-02

Pos logit contributions
 top+0.113
 biggest+0.108
 accomplishment+0.108
 chores+0.107
 bottom+0.107
Neg logit contributions
o-0.132
ppy-0.118
jo-0.116
y-0.116
os-0.115
Tokens
Token ID82
Feature activation+2.32e-02

Pos logit contributions
ppy+0.168
jo+0.161
o+0.157
azz+0.145
itty+0.142
Neg logit contributions
 top-0.182
 anything-0.176
 bottom-0.174
 most-0.167
 many-0.167
Token find
Token ID1064
Feature activation+6.56e-02

Pos logit contributions
 top+0.034
 anything+0.033
 chores+0.032
 biggest+0.032
 accomplishment+0.032
Neg logit contributions
o-0.067
jo-0.062
ppy-0.062
os-0.062
y-0.061
Token something
Token ID1223
Feature activation+3.43e-02

Pos logit contributions
 accomplishment+0.050
 top+0.050
 chores+0.047
 bottom+0.047
 biggest+0.044
Neg logit contributions
o-0.222
jo-0.208
y-0.206
ppy-0.206
os-0.205
Token better
Token ID1365
Feature activation+4.19e-02

Pos logit contributions
 top+0.035
 accomplishment+0.034
 chores+0.032
 anything+0.032
 biggest+0.031
Neg logit contributions
o-0.112
y-0.105
jo-0.104
os-0.103
ppy-0.102
Token for
Token ID329
Feature activation+5.43e-02

Pos logit contributions
 top+0.084
 biggest+0.081
 chores+0.081
 accomplishment+0.081
 anything+0.080
Neg logit contributions
o-0.097
y-0.089
jo-0.088
ppy-0.087
os-0.085
Token you
Token ID345
Feature activation-2.19e-02

Pos logit contributions
 biggest+0.080
 top+0.070
 middle+0.068
 most+0.067
 bottom+0.067
Neg logit contributions
o-0.147
ppy-0.137
jo-0.130
y-0.130
os-0.130
Token to
Token ID284
Feature activation+5.19e-02

Pos logit contributions
o+0.054
jo+0.050
os+0.049
ppy+0.049
y+0.049
Neg logit contributions
 top-0.040
 anything-0.040
 chores-0.039
 biggest-0.039
 accomplishment-0.038
Token do
Token ID466
Feature activation+7.48e-02

Pos logit contributions
 biggest+0.094
 accomplishment+0.090
 top+0.087
 anything+0.086
 most+0.085
Neg logit contributions
o-0.123
ppy-0.113
y-0.111
jo-0.110
os-0.109
Token.
Token ID13
Feature activation-4.80e-02

Pos logit contributions
 biggest+0.134
 accomplishment+0.129
 top+0.128
 bottom+0.124
 avail+0.122
Neg logit contributions
o-0.178
y-0.164
jo-0.161
os-0.160
ppy-0.159
Token How
Token ID1374
Feature activation+4.14e-02

Pos logit contributions
o+0.118
y+0.117
ppy+0.115
jo+0.114
os+0.112
Neg logit contributions
 top-0.083
 anything-0.080
 something-0.077
 biggest-0.076
 most-0.075
Token about
Token ID546
Feature activation+7.52e-02

Pos logit contributions
 top+0.080
 anything+0.079
 chores+0.077
 biggest+0.077
 accomplishment+0.076
Neg logit contributions
o-0.100
jo-0.092
ppy-0.090
os-0.090
y-0.090
Token of
Token ID286
Feature activation+6.09e-02

Pos logit contributions
 top+0.050
 anything+0.049
 biggest+0.047
 chores+0.047
 new+0.047
Neg logit contributions
o-0.079
jo-0.074
ppy-0.073
os-0.072
y-0.071
Token Lily
Token ID20037
Feature activation-1.07e-01

Pos logit contributions
 biggest+0.087
 chores+0.083
 tasks+0.075
 accomplishment+0.074
 progress+0.074
Neg logit contributions
o-0.163
y-0.155
jo-0.150
ppy-0.148
os-0.147
Token's
Token ID338
Feature activation+5.89e-02

Pos logit contributions
o+0.202
jo+0.191
os+0.183
azz+0.175
ppy+0.171
Neg logit contributions
 top-0.248
 new-0.239
 anything-0.238
 chores-0.236
 biggest-0.232
Token hands
Token ID2832
Feature activation-6.04e-02

Pos logit contributions
 anything+0.099
 top+0.098
 biggest+0.095
 chores+0.094
 accomplishment+0.094
Neg logit contributions
o-0.153
jo-0.143
y-0.140
ppy-0.140
os-0.139
Token.
Token ID13
Feature activation-4.97e-02

Pos logit contributions
o+0.156
jo+0.149
ppy+0.143
os+0.141
y+0.137
Neg logit contributions
 top-0.108
 anything-0.104
 chores-0.100
 biggest-0.100
 new-0.099
Token It
Token ID632
Feature activation-4.05e-02

Pos logit contributions
o+0.120
jo+0.110
os+0.109
ppy+0.108
y+0.108
Neg logit contributions
 top-0.095
 anything-0.094
 chores-0.092
 biggest-0.092
 accomplishment-0.091
Token flew
Token ID13112
Feature activation+2.98e-02

Pos logit contributions
o+0.098
jo+0.092
ppy+0.090
os+0.087
azz+0.084
Neg logit contributions
 top-0.080
 anything-0.077
 new-0.075
 most-0.074
 biggest-0.074
Token far
Token ID1290
Feature activation+5.12e-02

Pos logit contributions
 chores+0.050
 accomplishment+0.049
 anything+0.048
 top+0.048
 biggest+0.048
Neg logit contributions
o-0.079
y-0.074
jo-0.072
ppy-0.072
os-0.071
Token away
Token ID1497
Feature activation+3.87e-02

Pos logit contributions
 top+0.097
 anything+0.095
 chores+0.094
 biggest+0.093
 accomplishment+0.092
Neg logit contributions
o-0.125
jo-0.115
ppy-0.114
os-0.113
y-0.113
Token.
Token ID13
Feature activation-5.13e-02

Pos logit contributions
 chores+0.088
 biggest+0.085
 anything+0.084
 accomplishment+0.084
 top+0.083
Neg logit contributions
o-0.078
y-0.076
os-0.069
jo-0.068
ppy-0.068
Token Lily
Token ID20037
Feature activation-1.22e-01

Pos logit contributions
o+0.127
jo+0.117
os+0.115
ppy+0.115
y+0.115
Neg logit contributions
 top-0.094
 anything-0.093
 chores-0.093
 biggest-0.091
 accomplishment-0.091
Token bread
Token ID8509
Feature activation-6.35e-02

Pos logit contributions
 chores+0.088
 anything+0.087
 accomplishment+0.087
 top+0.087
 biggest+0.083
Neg logit contributions
o-0.207
y-0.201
ppy-0.192
os-0.191
jo-0.186
Token to
Token ID284
Feature activation+4.04e-02

Pos logit contributions
o+0.158
jo+0.157
ppy+0.146
os+0.145
y+0.141
Neg logit contributions
 top-0.114
 anything-0.109
 new-0.106
 biggest-0.106
 chores-0.104
Token it
Token ID340
Feature activation-5.07e-02

Pos logit contributions
 chores+0.041
 accomplishment+0.040
 biggest+0.040
 progress+0.037
 trouble+0.037
Neg logit contributions
o-0.131
ppy-0.123
y-0.121
os-0.119
jo-0.117
Token.
Token ID13
Feature activation-5.34e-02

Pos logit contributions
o+0.127
jo+0.121
ppy+0.117
os+0.116
y+0.113
Neg logit contributions
 top-0.094
 anything-0.090
 new-0.088
 biggest-0.088
 chores-0.088
Token The
Token ID383
Feature activation+3.25e-02

Pos logit contributions
o+0.120
os+0.107
jo+0.107
ppy+0.106
y+0.105
Neg logit contributions
 chores-0.109
 top-0.108
 anything-0.108
 accomplishment-0.108
 biggest-0.108
Token goose
Token ID37246
Feature activation-1.56e-01

Pos logit contributions
 anything+0.074
 accomplishment+0.073
 chores+0.072
 top+0.070
 tasks+0.069
Neg logit contributions
o-0.064
y-0.060
jo-0.054
ppy-0.054
os-0.053
Token caught
Token ID4978
Feature activation+4.17e-02

Pos logit contributions
jo+0.337
o+0.321
azz+0.309
ppy+0.307
os+0.289
Neg logit contributions
 top-0.351
 bottom-0.320
 new-0.318
 anything-0.317
 something-0.312
Token the
Token ID262
Feature activation+5.62e-02

Pos logit contributions
 chores+0.049
 accomplishment+0.048
 biggest+0.047
 top+0.047
 anything+0.046
Neg logit contributions
o-0.131
y-0.123
ppy-0.120
os-0.120
jo-0.118
Token bread
Token ID8509
Feature activation-7.69e-02

Pos logit contributions
 anything+0.088
 chores+0.087
 accomplishment+0.086
 top+0.083
 progress+0.079
Neg logit contributions
o-0.151
y-0.144
ppy-0.136
jo-0.134
os-0.131
Token and
Token ID290
Feature activation-3.83e-03

Pos logit contributions
o+0.170
jo+0.154
y+0.153
ppy+0.153
os+0.152
Neg logit contributions
 top-0.162
 anything-0.160
 chores-0.158
 biggest-0.158
 accomplishment-0.157
Token ate
Token ID15063
Feature activation+5.45e-02

Pos logit contributions
o+0.008
jo+0.007
os+0.007
ppy+0.007
y+0.007
Neg logit contributions
 top-0.009
 anything-0.009
 chores-0.009
 biggest-0.009
 accomplishment-0.009
Token something
Token ID1223
Feature activation+2.28e-02

Pos logit contributions
 accomplishment+0.075
 bottom+0.069
 top+0.067
 biggest+0.064
 progress+0.063
Neg logit contributions
o-0.220
ppy-0.201
jo-0.201
y-0.200
os-0.200
Token I
Token ID314
Feature activation+6.45e-03

Pos logit contributions
 top+0.030
 accomplishment+0.030
 bottom+0.028
 anything+0.028
 chores+0.028
Neg logit contributions
o-0.068
y-0.062
ppy-0.062
jo-0.062
os-0.061
Token can
Token ID460
Feature activation+3.85e-02

Pos logit contributions
 top+0.012
 anything+0.012
 chores+0.012
 biggest+0.011
 new+0.011
Neg logit contributions
o-0.016
jo-0.015
os-0.014
y-0.014
ppy-0.014
Token own
Token ID898
Feature activation+6.44e-02

Pos logit contributions
 accomplishment+0.060
 top+0.059
 anything+0.059
 biggest+0.059
 chores+0.057
Neg logit contributions
o-0.105
ppy-0.096
jo-0.096
os-0.096
y-0.096
Token?"
Token ID1701
Feature activation-3.69e-02

Pos logit contributions
 accomplishment+0.130
 biggest+0.124
 top+0.123
 chores+0.120
 progress+0.119
Neg logit contributions
o-0.152
os-0.135
y-0.134
jo-0.133
ppy-0.130
Token Mom
Token ID11254
Feature activation-1.26e-01

Pos logit contributions
o+0.089
ppy+0.085
jo+0.081
os+0.080
y+0.079
Neg logit contributions
 top-0.075
 anything-0.071
 new-0.070
 something-0.069
 chores-0.069
Token thought
Token ID1807
Feature activation-1.12e-02

Pos logit contributions
azz+0.266
jo+0.260
ppy+0.259
aa+0.255
uddy+0.247
Neg logit contributions
 the-0.225
 new-0.222
 top-0.210
 a-0.208
 bigger-0.206
Token for
Token ID329
Feature activation+3.15e-02

Pos logit contributions
o+0.030
jo+0.029
y+0.028
os+0.028
ppy+0.028
Neg logit contributions
 top-0.018
 biggest-0.017
 anything-0.017
 chores-0.016
 bottom-0.016
Token a
Token ID257
Feature activation+4.49e-02

Pos logit contributions
 bottom+0.038
 biggest+0.038
 accomplishment+0.037
 chores+0.036
 top+0.035
Neg logit contributions
o-0.084
ppy-0.080
y-0.079
jo-0.077
of-0.076
Token minute
Token ID5664
Feature activation-3.50e-02

Pos logit contributions
 top+0.073
 new+0.070
 anything+0.070
 chores+0.069
 biggest+0.069
Neg logit contributions
o-0.125
os-0.117
jo-0.115
ppy-0.115
y-0.112
Token and
Token ID290
Feature activation-6.36e-03

Pos logit contributions
o+0.091
jo+0.084
ppy+0.084
os+0.083
azz+0.079
Neg logit contributions
 top-0.059
 anything-0.059
 new-0.057
 accomplishment-0.057
 chores-0.057
Token Tom
Token ID4186
Feature activation-1.43e-01

Pos logit contributions
o+0.056
jo+0.052
ppy+0.051
os+0.051
y+0.051
Neg logit contributions
 top-0.040
 anything-0.040
 chores-0.039
 biggest-0.038
 accomplishment-0.038
Token says
Token ID1139
Feature activation-3.08e-02

Pos logit contributions
o+0.324
jo+0.309
os+0.304
ppy+0.294
itty+0.292
Neg logit contributions
 new-0.293
 chores-0.292
 trouble-0.289
 top-0.286
 something-0.278
Token.
Token ID13
Feature activation-5.94e-02

Pos logit contributions
o+0.085
jo+0.081
os+0.078
ppy+0.078
y+0.076
Neg logit contributions
 top-0.049
 anything-0.048
 new-0.048
 chores-0.047
 accomplishment-0.046
Token
Token ID198
Feature activation-2.26e-02

Pos logit contributions
o+0.152
jo+0.142
ppy+0.142
os+0.140
y+0.140
Neg logit contributions
 top-0.107
 anything-0.102
 new-0.100
 chores-0.100
 biggest-0.100
Token
Token ID198
Feature activation-2.20e-02

Pos logit contributions
o+0.058
ppy+0.056
jo+0.055
y+0.054
os+0.054
Neg logit contributions
 top-0.039
 new-0.037
 anything-0.037
 chores-0.036
 biggest-0.036
TokenMom
Token ID29252
Feature activation-1.17e-01

Pos logit contributions
o+0.054
jo+0.051
ppy+0.051
os+0.050
y+0.049
Neg logit contributions
 top-0.041
 anything-0.038
 new-0.038
 chores-0.038
 biggest-0.037
Token shakes
Token ID29959
Feature activation+4.63e-02

Pos logit contributions
o+0.288
jo+0.287
ppy+0.278
y+0.260
anna+0.260
Neg logit contributions
 top-0.212
 new-0.202
 chores-0.199
 trouble-0.197
 bottom-0.191
Token her
Token ID607
Feature activation+7.75e-03

Pos logit contributions
 biggest+0.066
 top+0.064
 chores+0.062
 accomplishment+0.062
 anything+0.062
Neg logit contributions
o-0.136
y-0.127
jo-0.126
os-0.124
ppy-0.124
Token head
Token ID1182
Feature activation-4.55e-02

Pos logit contributions
 top+0.013
 chores+0.012
 new+0.012
 anything+0.012
 biggest+0.012
Neg logit contributions
o-0.021
os-0.020
jo-0.020
ppy-0.019
y-0.019
Token and
Token ID290
Feature activation-1.56e-02

Pos logit contributions
o+0.124
jo+0.120
os+0.117
ppy+0.116
azz+0.110
Neg logit contributions
 top-0.075
 new-0.070
 anything-0.070
 chores-0.068
 something-0.067
Token picks
Token ID11103
Feature activation+6.15e-02

Pos logit contributions
o+0.032
jo+0.029
os+0.029
ppy+0.029
y+0.029
Neg logit contributions
 top-0.035
 anything-0.035
 chores-0.034
 biggest-0.034
 accomplishment-0.034
Token was
Token ID373
Feature activation+6.04e-02

Pos logit contributions
o+0.026
y+0.023
jo+0.023
os+0.023
ppy+0.022
Neg logit contributions
 top-0.032
 chores-0.032
 anything-0.032
 accomplishment-0.032
 biggest-0.032
Token a
Token ID257
Feature activation+2.70e-02

Pos logit contributions
 accomplishment+0.072
 progress+0.065
 anything+0.065
 bottom+0.063
 chores+0.063
Neg logit contributions
o-0.193
y-0.174
ppy-0.168
jo-0.165
os-0.164
Token little
Token ID1310
Feature activation+5.44e-03

Pos logit contributions
 top+0.031
 new+0.029
 biggest+0.029
 anything+0.028
 chores+0.028
Neg logit contributions
o-0.087
jo-0.083
os-0.082
ppy-0.081
y-0.080
Token boy
Token ID2933
Feature activation-1.26e-01

Pos logit contributions
 new+0.010
 top+0.010
 good+0.009
 biggest+0.009
 ancient+0.009
Neg logit contributions
o-0.014
jo-0.014
os-0.013
ppy-0.013
azz-0.012
Token named
Token ID3706
Feature activation-2.26e-02

Pos logit contributions
jo+0.272
azz+0.264
ppy+0.259
os+0.255
anna+0.249
Neg logit contributions
 many-0.229
 something-0.227
 new-0.227
 anything-0.227
 top-0.223
Token Tim
Token ID5045
Feature activation-1.25e-01

Pos logit contributions
o+0.054
jo+0.053
y+0.051
ppy+0.051
os+0.051
Neg logit contributions
 top-0.043
 anything-0.041
 new-0.041
 biggest-0.041
 chores-0.041
Tokenmy
Token ID1820
Feature activation-9.90e-02

Pos logit contributions
jo+0.166
os+0.166
anna+0.159
azz+0.158
oped+0.156
Neg logit contributions
 new-0.329
 anything-0.325
 many-0.322
 top-0.318
 something-0.313
Token.
Token ID13
Feature activation-3.93e-02

Pos logit contributions
os+0.278
jo+0.275
ppy+0.273
azz+0.271
o+0.265
Neg logit contributions
 new-0.126
 top-0.125
 many-0.122
 good-0.115
 something-0.114
Token Tim
Token ID5045
Feature activation-1.20e-01

Pos logit contributions
o+0.093
ppy+0.090
jo+0.089
y+0.087
os+0.086
Neg logit contributions
 top-0.075
 anything-0.072
 biggest-0.072
 chores-0.071
 new-0.071
Tokenmy
Token ID1820
Feature activation-9.68e-02

Pos logit contributions
azz+0.150
ppy+0.143
os+0.143
jo+0.137
aa+0.134
Neg logit contributions
 top-0.333
 great-0.332
 new-0.330
 most-0.328
 good-0.327
Token loved
Token ID6151
Feature activation+7.65e-02

Pos logit contributions
os+0.245
azz+0.244
ppy+0.242
of+0.228
jo+0.225
Neg logit contributions
 good-0.154
 great-0.153
 new-0.151
 many-0.151
 top-0.151
Token boy
Token ID2933
Feature activation-1.24e-01

Pos logit contributions
 new+0.010
 top+0.010
 biggest+0.010
 good+0.010
 ancient+0.009
Neg logit contributions
o-0.014
jo-0.014
os-0.014
ppy-0.013
azz-0.013
Token named
Token ID3706
Feature activation-2.29e-02

Pos logit contributions
jo+0.261
ppy+0.256
azz+0.255
os+0.250
o+0.240
Neg logit contributions
 something-0.236
 new-0.236
 top-0.235
 anything-0.234
 many-0.233
Token Tim
Token ID5045
Feature activation-1.25e-01

Pos logit contributions
o+0.056
jo+0.055
y+0.054
ppy+0.053
os+0.053
Neg logit contributions
 top-0.043
 new-0.041
 anything-0.041
 biggest-0.040
 chores-0.039
Tokenmy
Token ID1820
Feature activation-9.86e-02

Pos logit contributions
os+0.193
jo+0.190
azz+0.187
ppy+0.182
oped+0.182
Neg logit contributions
 new-0.306
 anything-0.301
 many-0.297
 top-0.296
 something-0.291
Token.
Token ID13
Feature activation-4.13e-02

Pos logit contributions
os+0.276
ppy+0.272
azz+0.271
jo+0.270
o+0.262
Neg logit contributions
 top-0.128
 new-0.128
 many-0.124
 something-0.118
 good-0.116
Token Tim
Token ID5045
Feature activation-1.21e-01

Pos logit contributions
o+0.098
ppy+0.096
jo+0.095
y+0.092
os+0.092
Neg logit contributions
 top-0.079
 anything-0.075
 biggest-0.074
 chores-0.074
 new-0.073
Tokenmy
Token ID1820
Feature activation-9.73e-02

Pos logit contributions
azz+0.144
os+0.140
ppy+0.139
jo+0.135
aa+0.128
Neg logit contributions
 top-0.337
 great-0.336
 new-0.335
 most-0.333
 biggest-0.333
Token was
Token ID373
Feature activation+6.46e-02

Pos logit contributions
ppy+0.241
os+0.241
azz+0.240
jo+0.228
of+0.224
Neg logit contributions
 great-0.159
 good-0.159
 top-0.158
 new-0.156
 many-0.154
Token shy
Token ID15800
Feature activation+1.37e-02

Pos logit contributions
 accomplishment+0.105
 avail+0.100
 top+0.098
 progress+0.097
 beauty+0.093
Neg logit contributions
o-0.191
y-0.159
ppy-0.155
os-0.152
itty-0.148
Token and
Token ID290
Feature activation-8.66e-03

Pos logit contributions
 top+0.027
 anything+0.027
 new+0.026
 biggest+0.026
 trouble+0.026
Neg logit contributions
o-0.033
jo-0.030
ppy-0.030
os-0.030
y-0.028
Token didn
Token ID1422
Feature activation+3.16e-02

Pos logit contributions
o+0.019
os+0.018
jo+0.018
ppy+0.018
y+0.018
Neg logit contributions
 top-0.017
 anything-0.017
 chores-0.017
 new-0.017
 biggest-0.017