TOP ACTIVATIONS
MAX = 15.932

and make sure you never get lost on your journey .
so glad I could deliver such joy to you , my
, so he could avoid getting another band age . <|endoftext|>
off and vowed never to get too close to a trap
, but his mustache kept getting in the way and preventing
so proud of herself for getting fit . It was all
sharing toys was better than getting angry or hating anyone .
He was determined to never get into such a disgusting situation
being persistent enough to find such a special place .
friends thanked her for being such a good leader . Lily
Thank you Jack for finding such a nice shelter ."
always be careful and never get caught again . <|endoftext|> Once
and her family will never get to see her again .
closely so he would never get lost again ! <|endoftext|> Baby
roller so we won 't get tired ." Brad
of her for picking out such a pretty dress . Lily
make sure he didn 't get lost . As he looked
and be careful not to get tired ." Sam smiled and
determined to never get into such a disgusting situation again !
worked so hard to bring such a wonderful surprise to them

MIN ACTIVATIONS
MIN = -19.406

. The great Zen o hugs Ben and says
saw a big box of Leg os that he really wanted
! â <|endoftext|> Pa olo was very excited . He
One day , the hipp o saw a little bird
age . The coach spoke stern ly .
One day , the hipp o saw a new friend
. The hipp o found some vegetables and
bird . So , the hipp o used its big mouth
me ." The coach smiled and said , "
But then , the hipp o went to play with
come with me ," the coach says . " I will
food . The hipp o went to the kitchen
very hungry . So the hipp o decided to cook some
." The prince went to the battlefield with
One day , the little hipp o wanted to drink some
white hipp o . The hipp o was so cute and
One day , the hipp o met a wise old
hungry !" The hipp o smiled and said ,
to cry . But the hipp o didn 't give up
One day , the hipp o saw something that caught

INTERVAL 7.098 - 15.932
CONTAINS 6.662%

love you too . You are right . Bat hing is
around the lake were lots of other re ind eer .
But they did not like to share . Lily wanted all
." But now Twe ety was gone and Jerry did not
. Stay calm and hold on to your doll ."

INTERVAL -1.737 - 7.098
CONTAINS 55.004%

is very nice . You are lucky . I can wait
Soon enough , they arrived at the store , and Jenny
the dog away . Lily was very happy that her mom
. " Now what do I do ?" he
a little boy who was very hungry . He had been

INTERVAL -10.572 - -1.737
CONTAINS 33.085%

was safe and that her friend liked it so much .
They poked them with their fingers . They said they were
He regretted it a lot . Mark had spent
it was safe . Then it told its mom my about
y ummy . " M mm , this is

INTERVAL -19.406 - -10.572
CONTAINS 5.249%

the mel on thanked the bird for its wonderful adventure .
and said , " B unny , will you come to
. The hunter thought of a plan .
put the iron , the curtain , the bracelet and the
ade !" The mole smiled and said ,
Token and
Token ID290
Feature activation+7.04e+00

Pos logit contributions
 anything+0.081
 chores+0.077
 something+0.074
 accomplishment+0.073
 trouble+0.073
Neg logit contributions
y-0.095
jo-0.090
o-0.089
os-0.086
ppy-0.085
Token make
Token ID787
Feature activation+8.91e+00

Pos logit contributions
 anything+0.115
 chores+0.107
 something+0.099
 trouble+0.099
 accomplishment+0.098
Neg logit contributions
y-0.173
o-0.165
jo-0.163
os-0.159
ppy-0.154
Token sure
Token ID1654
Feature activation+3.19e+00

Pos logit contributions
 anything+0.092
 chores+0.083
 accomplishment+0.078
 trouble+0.074
 something+0.074
Neg logit contributions
y-0.266
jo-0.258
o-0.256
os-0.250
ppy-0.243
Token you
Token ID345
Feature activation-7.75e-02

Pos logit contributions
 anything+0.042
 chores+0.039
 accomplishment+0.038
 trouble+0.037
 something+0.037
Neg logit contributions
y-0.088
o-0.084
jo-0.084
os-0.083
ne-0.080
Token never
Token ID1239
Feature activation+8.52e+00

Pos logit contributions
y+0.002
jo+0.002
o+0.002
azz+0.002
os+0.002
Neg logit contributions
 anything-0.001
 something-0.001
 chores-0.001
 trouble-0.001
 accomplishment-0.001
Token get
Token ID651
Feature activation+1.47e+01

Pos logit contributions
 anything+0.154
 something+0.140
 chores+0.136
 trouble+0.135
 accomplishment+0.126
Neg logit contributions
jo-0.208
y-0.206
o-0.201
os-0.198
azz-0.195
Token lost
Token ID2626
Feature activation+7.48e+00

Pos logit contributions
 anything+0.193
 chores+0.191
 accomplishment+0.185
wered+0.171
 something+0.168
Neg logit contributions
y-0.404
o-0.388
os-0.369
ppy-0.368
jo-0.367
Token on
Token ID319
Feature activation+1.10e+01

Pos logit contributions
 chores+0.138
 anything+0.136
 accomplishment+0.131
 something+0.126
 trouble+0.126
Neg logit contributions
y-0.173
jo-0.158
o-0.158
ppy-0.153
os-0.153
Token your
Token ID534
Feature activation+7.07e+00

Pos logit contributions
 accomplishment+0.094
 chores+0.088
wered+0.083
 anything+0.081
 avail+0.075
Neg logit contributions
y-0.332
jo-0.311
o-0.308
os-0.307
ne-0.302
Token journey
Token ID7002
Feature activation-1.23e+00

Pos logit contributions
 anything+0.101
 something+0.090
 chores+0.089
 trouble+0.085
 accomplishment+0.084
Neg logit contributions
y-0.186
jo-0.182
o-0.176
ppy-0.173
ne-0.171
Token.
Token ID13
Feature activation-2.56e+00

Pos logit contributions
y+0.027
jo+0.026
o+0.026
os+0.025
ppy+0.025
Neg logit contributions
 anything-0.024
 chores-0.023
 something-0.023
 trouble-0.022
 accomplishment-0.022
Token so
Token ID523
Feature activation+8.18e+00

Pos logit contributions
y+0.073
jo+0.069
ppy+0.067
o+0.067
os+0.067
Neg logit contributions
 anything-0.042
 something-0.040
 chores-0.039
 many-0.039
 himself-0.037
Token glad
Token ID9675
Feature activation+4.85e+00

Pos logit contributions
 anything+0.113
 chores+0.103
 trouble+0.094
 yourselves+0.092
 something+0.091
Neg logit contributions
y-0.227
o-0.218
jo-0.214
os-0.206
anne-0.204
Token I
Token ID314
Feature activation+1.97e+00

Pos logit contributions
 anything+0.078
 something+0.072
 chores+0.071
 trouble+0.068
 accomplishment+0.066
Neg logit contributions
y-0.126
jo-0.123
o-0.122
os-0.119
ppy-0.119
Token could
Token ID714
Feature activation+7.48e+00

Pos logit contributions
 anything+0.045
 something+0.042
 great+0.040
 good+0.040
 chores+0.040
Neg logit contributions
jo-0.037
y-0.037
azz-0.035
o-0.035
os-0.034
Token deliver
Token ID5203
Feature activation+8.47e+00

Pos logit contributions
 anything+0.117
 chores+0.105
 something+0.105
 trouble+0.103
 accomplishment+0.099
Neg logit contributions
y-0.193
o-0.188
jo-0.187
os-0.180
ppy-0.180
Token such
Token ID884
Feature activation+1.47e+01

Pos logit contributions
 anything+0.090
 chores+0.085
 accomplishment+0.079
 trouble+0.076
 something+0.074
Neg logit contributions
y-0.255
o-0.248
jo-0.247
os-0.243
ne-0.238
Token joy
Token ID8716
Feature activation+1.42e+00

Pos logit contributions
 anything+0.124
 chores+0.111
 accomplishment+0.100
wered+0.095
 yourselves+0.093
Neg logit contributions
y-0.451
jo-0.434
o-0.431
ne-0.418
ppy-0.415
Token to
Token ID284
Feature activation+1.01e+01

Pos logit contributions
 anything+0.027
 something+0.025
 chores+0.024
 trouble+0.023
 good+0.022
Neg logit contributions
jo-0.033
y-0.032
o-0.032
azz-0.031
os-0.031
Token you
Token ID345
Feature activation+1.03e+00

Pos logit contributions
 anything+0.120
 chores+0.114
 accomplishment+0.107
 something+0.106
 trouble+0.105
Neg logit contributions
y-0.287
jo-0.281
o-0.279
os-0.272
ppy-0.271
Token,
Token ID11
Feature activation+2.56e+00

Pos logit contributions
 anything+0.021
 chores+0.020
 something+0.019
 trouble+0.019
 accomplishment+0.018
Neg logit contributions
y-0.023
jo-0.022
o-0.021
os-0.021
ppy-0.021
Token my
Token ID616
Feature activation+2.96e+00

Pos logit contributions
 anything+0.042
 chores+0.039
 something+0.038
 trouble+0.037
 accomplishment+0.036
Neg logit contributions
y-0.066
jo-0.064
o-0.063
os-0.062
ppy-0.061
Token,
Token ID11
Feature activation+3.06e+00

Pos logit contributions
 anything+0.086
 chores+0.082
 something+0.079
 trouble+0.077
 accomplishment+0.077
Neg logit contributions
y-0.090
jo-0.086
o-0.085
os-0.082
ppy-0.081
Token so
Token ID523
Feature activation+6.21e+00

Pos logit contributions
 anything+0.059
 chores+0.058
 something+0.054
 accomplishment+0.054
 trouble+0.053
Neg logit contributions
y-0.066
o-0.064
jo-0.063
ppy-0.060
azz-0.060
Token he
Token ID339
Feature activation-3.17e+00

Pos logit contributions
 anything+0.087
 chores+0.081
 something+0.077
 trouble+0.075
 accomplishment+0.073
Neg logit contributions
y-0.173
jo-0.167
o-0.167
os-0.162
ppy-0.160
Token could
Token ID714
Feature activation+6.25e+00

Pos logit contributions
y+0.067
jo+0.065
azz+0.064
os+0.063
o+0.063
Neg logit contributions
 anything-0.065
 something-0.062
 chores-0.062
 good-0.059
 trouble-0.058
Token avoid
Token ID3368
Feature activation+9.33e+00

Pos logit contributions
 anything+0.087
 chores+0.078
 something+0.078
 trouble+0.076
 accomplishment+0.074
Neg logit contributions
y-0.173
o-0.169
jo-0.168
os-0.163
ppy-0.161
Token getting
Token ID1972
Feature activation+1.45e+01

Pos logit contributions
 chores+0.085
 anything+0.082
 accomplishment+0.077
 something+0.072
wered+0.068
Neg logit contributions
y-0.287
jo-0.275
o-0.272
os-0.265
ne-0.259
Token another
Token ID1194
Feature activation+5.36e+00

Pos logit contributions
 anything+0.193
 chores+0.189
 accomplishment+0.187
wered+0.169
 avail+0.168
Neg logit contributions
y-0.390
o-0.383
os-0.360
jo-0.348
ppy-0.346
Token band
Token ID4097
Feature activation-4.01e+00

Pos logit contributions
 anything+0.070
 chores+0.063
 something+0.062
 trouble+0.059
 accomplishment+0.058
Neg logit contributions
y-0.156
jo-0.152
o-0.152
os-0.149
ppy-0.148
Tokenage
Token ID496
Feature activation-1.53e+00

Pos logit contributions
y+0.079
jo+0.076
o+0.075
os+0.073
ppy+0.073
Neg logit contributions
 anything-0.089
 chores-0.084
 something-0.083
 trouble-0.079
 accomplishment-0.078
Token.
Token ID13
Feature activation-2.73e+00

Pos logit contributions
y+0.037
jo+0.036
o+0.036
os+0.035
ppy+0.034
Neg logit contributions
 anything-0.027
 chores-0.025
 something-0.025
 trouble-0.024
 accomplishment-0.023
Token<|endoftext|>
Token ID50256
Feature activation-6.09e+00

Pos logit contributions
y+0.074
jo+0.068
ppy+0.068
os+0.067
o+0.066
Neg logit contributions
 anything-0.042
 something-0.040
 chores-0.038
 trouble-0.038
 himself-0.038
Token off
Token ID572
Feature activation+4.20e+00

Pos logit contributions
 anything+0.058
 something+0.053
 chores+0.053
 trouble+0.050
 accomplishment+0.050
Neg logit contributions
y-0.092
jo-0.090
o-0.088
os-0.087
ppy-0.086
Token and
Token ID290
Feature activation+7.13e+00

Pos logit contributions
 anything+0.077
 chores+0.072
 something+0.069
 trouble+0.067
 accomplishment+0.065
Neg logit contributions
y-0.101
o-0.096
jo-0.096
os-0.093
ppy-0.090
Token vowed
Token ID19982
Feature activation+3.88e+00

Pos logit contributions
 anything+0.156
 something+0.152
 chores+0.142
 trouble+0.141
 accomplishment+0.140
Neg logit contributions
y-0.142
jo-0.139
azz-0.131
ppy-0.130
os-0.130
Token never
Token ID1239
Feature activation+8.29e+00

Pos logit contributions
 anything+0.070
 chores+0.066
 trouble+0.063
 something+0.063
 accomplishment+0.061
Neg logit contributions
y-0.091
o-0.089
jo-0.088
os-0.084
azz-0.082
Token to
Token ID284
Feature activation+1.05e+01

Pos logit contributions
 anything+0.149
 something+0.136
 chores+0.136
 trouble+0.131
 accomplishment+0.126
Neg logit contributions
y-0.202
jo-0.197
o-0.193
os-0.190
ppy-0.187
Token get
Token ID651
Feature activation+1.45e+01

Pos logit contributions
 anything+0.174
 chores+0.161
 something+0.158
 trouble+0.153
 accomplishment+0.151
Neg logit contributions
y-0.269
jo-0.259
o-0.258
os-0.251
ppy-0.248
Token too
Token ID1165
Feature activation+6.45e+00

Pos logit contributions
 chores+0.203
 anything+0.202
 accomplishment+0.200
 trouble+0.180
 something+0.179
Neg logit contributions
y-0.383
o-0.375
os-0.354
jo-0.352
ppy-0.349
Token close
Token ID1969
Feature activation+7.55e+00

Pos logit contributions
 anything+0.083
 chores+0.077
 accomplishment+0.076
 yourselves+0.069
 avail+0.067
Neg logit contributions
y-0.184
o-0.177
os-0.168
jo-0.167
ne-0.166
Token to
Token ID284
Feature activation+8.90e+00

Pos logit contributions
 anything+0.151
 chores+0.147
 something+0.138
 accomplishment+0.137
 trouble+0.135
Neg logit contributions
y-0.166
jo-0.154
o-0.154
os-0.148
ppy-0.144
Token a
Token ID257
Feature activation+5.71e+00

Pos logit contributions
 anything+0.074
 chores+0.073
 accomplishment+0.068
 something+0.063
 trouble+0.061
Neg logit contributions
y-0.272
jo-0.266
o-0.264
os-0.260
ne-0.254
Token trap
Token ID12840
Feature activation-2.63e+00

Pos logit contributions
 anything+0.079
 chores+0.073
 something+0.070
 trouble+0.067
 accomplishment+0.066
Neg logit contributions
y-0.160
jo-0.154
o-0.154
os-0.149
ppy-0.148
Token,
Token ID11
Feature activation+2.61e+00

Pos logit contributions
 anything+0.062
 chores+0.058
 something+0.057
 accomplishment+0.056
 trouble+0.056
Neg logit contributions
y-0.066
jo-0.061
o-0.060
os-0.059
ppy-0.057
Token but
Token ID475
Feature activation+4.11e+00

Pos logit contributions
 anything+0.041
 chores+0.038
 something+0.036
 trouble+0.036
 accomplishment+0.035
Neg logit contributions
y-0.068
jo-0.065
o-0.065
os-0.063
ppy-0.062
Token his
Token ID465
Feature activation+1.85e+00

Pos logit contributions
 anything+0.055
 chores+0.052
 trouble+0.049
 something+0.048
 accomplishment+0.047
Neg logit contributions
y-0.115
o-0.112
jo-0.110
os-0.108
ppy-0.107
Token mustache
Token ID49303
Feature activation-1.02e+01

Pos logit contributions
 anything+0.028
 chores+0.025
 something+0.025
 trouble+0.023
 accomplishment+0.023
Neg logit contributions
y-0.049
jo-0.048
o-0.048
os-0.046
ppy-0.046
Token kept
Token ID4030
Feature activation+1.17e+01

Pos logit contributions
y+0.201
jo+0.196
o+0.193
ppy+0.188
os+0.187
Neg logit contributions
 anything-0.227
 something-0.212
 chores-0.212
 trouble-0.203
 accomplishment-0.198
Token getting
Token ID1972
Feature activation+1.44e+01

Pos logit contributions
 anything+0.142
 chores+0.133
 yourselves+0.129
wered+0.126
 accomplishment+0.121
Neg logit contributions
y-0.321
o-0.306
jo-0.291
os-0.284
ah-0.280
Token in
Token ID287
Feature activation+1.02e+01

Pos logit contributions
 yourselves+0.177
 accomplishment+0.171
wered+0.170
 chores+0.163
 anything+0.161
Neg logit contributions
y-0.392
o-0.371
ppy-0.345
os-0.340
bo-0.339
Token the
Token ID262
Feature activation+6.97e+00

Pos logit contributions
 anything+0.124
 biggest+0.117
 yourselves+0.116
wered+0.114
 chores+0.111
Neg logit contributions
y-0.272
o-0.239
jo-0.237
ah-0.230
ne-0.226
Token way
Token ID835
Feature activation+5.30e+00

Pos logit contributions
 anything+0.095
 accomplishment+0.084
 chores+0.083
 something+0.083
 yourselves+0.080
Neg logit contributions
y-0.186
o-0.176
jo-0.175
ah-0.169
ne-0.168
Token and
Token ID290
Feature activation+7.38e+00

Pos logit contributions
 anything+0.100
 chores+0.094
 something+0.092
 trouble+0.090
 accomplishment+0.089
Neg logit contributions
y-0.123
jo-0.116
o-0.116
os-0.113
ppy-0.110
Token preventing
Token ID12174
Feature activation+1.14e+01

Pos logit contributions
 anything+0.114
 chores+0.110
 trouble+0.101
 something+0.100
 accomplishment+0.098
Neg logit contributions
y-0.189
o-0.183
jo-0.179
os-0.176
ppy-0.168
Token so
Token ID523
Feature activation+8.18e+00

Pos logit contributions
y+0.047
jo+0.045
o+0.045
os+0.044
ppy+0.043
Neg logit contributions
 anything-0.036
 chores-0.033
 something-0.033
 trouble-0.032
 accomplishment-0.031
Token proud
Token ID6613
Feature activation+5.30e+00

Pos logit contributions
 anything+0.113
 chores+0.103
 trouble+0.094
 yourselves+0.092
 something+0.091
Neg logit contributions
y-0.227
o-0.218
jo-0.214
os-0.206
anne-0.204
Token of
Token ID286
Feature activation+8.49e+00

Pos logit contributions
 anything+0.094
 chores+0.087
 something+0.087
 trouble+0.084
 accomplishment+0.082
Neg logit contributions
y-0.129
jo-0.125
o-0.124
os-0.120
ppy-0.120
Token herself
Token ID5223
Feature activation-1.07e+00

Pos logit contributions
 chores+0.094
 anything+0.093
 trouble+0.080
 job+0.078
 something+0.076
Neg logit contributions
y-0.239
o-0.235
jo-0.233
os-0.226
ne-0.224
Token for
Token ID329
Feature activation+8.84e+00

Pos logit contributions
y+0.027
jo+0.026
o+0.026
os+0.026
azz+0.025
Neg logit contributions
 anything-0.018
 something-0.017
 chores-0.016
 accomplishment-0.015
 trouble-0.015
Token getting
Token ID1972
Feature activation+1.43e+01

Pos logit contributions
 chores+0.101
 anything+0.099
 trouble+0.088
 biggest+0.087
 accomplishment+0.085
Neg logit contributions
y-0.233
o-0.230
ppy-0.223
jo-0.222
ne-0.221
Token fit
Token ID4197
Feature activation+6.12e+00

Pos logit contributions
 anything+0.176
 chores+0.168
 accomplishment+0.162
 trouble+0.151
wered+0.147
Neg logit contributions
y-0.408
o-0.396
os-0.377
jo-0.368
ppy-0.367
Token.
Token ID13
Feature activation-2.41e+00

Pos logit contributions
 anything+0.122
 chores+0.117
 trouble+0.112
 something+0.111
 accomplishment+0.110
Neg logit contributions
y-0.137
jo-0.124
o-0.124
ppy-0.120
os-0.120
Token It
Token ID632
Feature activation-2.71e+00

Pos logit contributions
y+0.065
ppy+0.061
jo+0.061
os+0.059
o+0.059
Neg logit contributions
 anything-0.035
 something-0.034
 himself-0.032
 many-0.032
 an-0.031
Token was
Token ID373
Feature activation+9.50e+00

Pos logit contributions
y+0.056
jo+0.055
o+0.054
azz+0.052
ppy+0.052
Neg logit contributions
 anything-0.057
 something-0.054
 chores-0.053
 trouble-0.051
 great-0.051
Token all
Token ID477
Feature activation+6.32e+00

Pos logit contributions
 anything+0.107
 chores+0.105
 yourselves+0.095
 trouble+0.090
 avail+0.086
Neg logit contributions
y-0.280
o-0.274
jo-0.258
os-0.252
ah-0.251
Token sharing
Token ID7373
Feature activation+2.68e+00

Pos logit contributions
 anything+0.031
 chores+0.029
 avail+0.028
 accomplishment+0.027
 yourselves+0.027
Neg logit contributions
y-0.051
o-0.049
os-0.047
jo-0.046
azz-0.046
Token toys
Token ID14958
Feature activation-2.28e+00

Pos logit contributions
 anything+0.057
 chores+0.056
 accomplishment+0.054
 miles+0.053
 trouble+0.052
Neg logit contributions
y-0.055
o-0.050
os-0.048
jo-0.048
azz-0.047
Token was
Token ID373
Feature activation+1.02e+01

Pos logit contributions
y+0.046
o+0.042
os+0.040
jo+0.040
ppy+0.040
Neg logit contributions
 anything-0.050
 chores-0.048
 accomplishment-0.047
 trouble-0.046
 something-0.046
Token better
Token ID1365
Feature activation+5.72e+00

Pos logit contributions
wered+0.133
 avail+0.132
 miles+0.126
 bottom+0.122
 middle+0.122
Neg logit contributions
y-0.255
o-0.250
ety-0.228
ppy-0.227
lets-0.222
Token than
Token ID621
Feature activation+9.01e+00

Pos logit contributions
 anything+0.114
 chores+0.110
wered+0.108
 accomplishment+0.102
 miles+0.102
Neg logit contributions
y-0.131
o-0.119
ppy-0.115
jo-0.112
ne-0.111
Token getting
Token ID1972
Feature activation+1.42e+01

Pos logit contributions
 chores+0.121
 middle+0.115
 accomplishment+0.114
 anything+0.114
 top+0.110
Neg logit contributions
y-0.239
o-0.224
ppy-0.217
ne-0.213
ety-0.211
Token angry
Token ID7954
Feature activation+3.95e+00

Pos logit contributions
 anything+0.161
 chores+0.161
 accomplishment+0.159
wered+0.154
 avail+0.150
Neg logit contributions
y-0.414
o-0.386
ppy-0.379
os-0.368
ne-0.367
Token or
Token ID393
Feature activation+8.18e+00

Pos logit contributions
 anything+0.077
 chores+0.071
 something+0.071
 trouble+0.069
 accomplishment+0.067
Neg logit contributions
y-0.089
jo-0.087
o-0.086
os-0.084
ppy-0.083
Token hating
Token ID45701
Feature activation+5.58e+00

Pos logit contributions
 anything+0.111
 chores+0.103
 something+0.099
 trouble+0.095
 accomplishment+0.094
Neg logit contributions
y-0.232
o-0.223
jo-0.220
os-0.216
ppy-0.215
Token anyone
Token ID2687
Feature activation-4.09e+00

Pos logit contributions
 anything+0.067
 chores+0.062
 accomplishment+0.061
 something+0.057
 trouble+0.057
Neg logit contributions
y-0.164
o-0.156
jo-0.154
os-0.153
ppy-0.150
Token.
Token ID13
Feature activation-2.79e+00

Pos logit contributions
jo+0.095
y+0.093
o+0.091
ppy+0.089
azz+0.089
Neg logit contributions
 anything-0.080
 something-0.073
 chores-0.071
 trouble-0.069
 good-0.067
Token He
Token ID679
Feature activation-1.88e+00

Pos logit contributions
y+0.056
ppy+0.052
jo+0.051
ne+0.050
os+0.050
Neg logit contributions
 anything-0.028
 something-0.027
 himself-0.026
 many-0.025
 chores-0.025
Token was
Token ID373
Feature activation+7.95e+00

Pos logit contributions
y+0.041
jo+0.041
azz+0.040
o+0.039
ppy+0.039
Neg logit contributions
 anything-0.035
 something-0.034
 chores-0.033
 accomplishment-0.032
 trouble-0.032
Token determined
Token ID5295
Feature activation+4.14e+00

Pos logit contributions
 anything+0.104
 chores+0.101
 trouble+0.090
 yourselves+0.088
 avail+0.088
Neg logit contributions
o-0.224
y-0.221
jo-0.212
os-0.206
ppy-0.205
Token to
Token ID284
Feature activation+8.61e+00

Pos logit contributions
 anything+0.084
 chores+0.082
 accomplishment+0.079
 trouble+0.077
 something+0.076
Neg logit contributions
y-0.086
o-0.080
jo-0.080
os-0.076
ne-0.075
Token never
Token ID1239
Feature activation+8.20e+00

Pos logit contributions
 anything+0.135
 chores+0.125
 trouble+0.122
 something+0.122
 accomplishment+0.120
Neg logit contributions
y-0.209
jo-0.202
o-0.199
ne-0.199
ppy-0.196
Token get
Token ID651
Feature activation+1.42e+01

Pos logit contributions
 anything+0.131
 something+0.118
 chores+0.117
 trouble+0.113
 accomplishment+0.105
Neg logit contributions
y-0.217
jo-0.216
o-0.211
os-0.207
ppy-0.204
Token into
Token ID656
Feature activation+1.04e+01

Pos logit contributions
 accomplishment+0.190
 chores+0.189
 anything+0.184
wered+0.175
 yourselves+0.168
Neg logit contributions
y-0.380
o-0.367
os-0.347
ppy-0.345
ne-0.341
Token such
Token ID884
Feature activation+1.39e+01

Pos logit contributions
wered+0.080
 accomplishment+0.064
 chores+0.061
 middle+0.059
 End+0.058
Neg logit contributions
y-0.307
jo-0.286
ne-0.282
o-0.273
os-0.272
Token a
Token ID257
Feature activation+8.48e+00

Pos logit contributions
 chores+0.112
 yourselves+0.110
wered+0.108
 anything+0.103
 accomplishment+0.099
Neg logit contributions
y-0.407
jo-0.380
ppy-0.373
o-0.368
ne-0.368
Token disgusting
Token ID23374
Feature activation+3.72e+00

Pos logit contributions
 anything+0.098
 chores+0.088
 yourselves+0.082
 something+0.082
 himself+0.081
Neg logit contributions
y-0.252
o-0.237
jo-0.237
ne-0.232
ppy-0.227
Token situation
Token ID3074
Feature activation+1.63e+00

Pos logit contributions
 anything+0.061
 chores+0.057
 something+0.055
 trouble+0.053
 accomplishment+0.053
Neg logit contributions
y-0.095
jo-0.090
o-0.089
ppy-0.086
os-0.086
Token being
Token ID852
Feature activation+1.02e+01

Pos logit contributions
 chores+0.081
 anything+0.077
 accomplishment+0.070
 trouble+0.069
 biggest+0.065
Neg logit contributions
y-0.216
o-0.211
ppy-0.208
jo-0.205
ne-0.201
Token persistent
Token ID16218
Feature activation+3.78e+00

Pos logit contributions
 anything+0.136
 chores+0.136
 accomplishment+0.121
 something+0.116
 trouble+0.116
Neg logit contributions
y-0.292
o-0.281
jo-0.272
os-0.268
ppy-0.268
Token enough
Token ID1576
Feature activation+6.63e+00

Pos logit contributions
 anything+0.073
 something+0.068
 chores+0.067
 trouble+0.065
 accomplishment+0.063
Neg logit contributions
jo-0.084
y-0.084
o-0.083
azz-0.081
os-0.080
Token to
Token ID284
Feature activation+8.73e+00

Pos logit contributions
 anything+0.134
 chores+0.124
 something+0.122
 accomplishment+0.118
 trouble+0.117
Neg logit contributions
y-0.145
o-0.137
jo-0.135
os-0.130
ppy-0.130
Token find
Token ID1064
Feature activation+8.88e+00

Pos logit contributions
 anything+0.138
 chores+0.129
 something+0.125
 trouble+0.125
 accomplishment+0.121
Neg logit contributions
y-0.212
o-0.205
jo-0.199
ppy-0.199
ne-0.197
Token such
Token ID884
Feature activation+1.42e+01

Pos logit contributions
 chores+0.083
 anything+0.076
 accomplishment+0.074
wered+0.067
 trouble+0.065
Neg logit contributions
y-0.271
o-0.259
jo-0.259
os-0.255
ne-0.253
Token a
Token ID257
Feature activation+7.96e+00

Pos logit contributions
 chores+0.132
 avail+0.131
 anything+0.127
wered+0.120
 accomplishment+0.114
Neg logit contributions
y-0.411
o-0.383
ppy-0.374
ne-0.368
jo-0.363
Token special
Token ID2041
Feature activation+4.86e+00

Pos logit contributions
 anything+0.084
 chores+0.074
wered+0.066
 yourselves+0.066
 something+0.065
Neg logit contributions
y-0.246
o-0.234
jo-0.232
ne-0.226
ah-0.222
Tokenplace
Token ID5372
Feature activation+4.14e+00

Pos logit contributions
 anything+0.061
 chores+0.054
 something+0.050
 trouble+0.049
 yourselves+0.047
Neg logit contributions
y-0.143
jo-0.136
o-0.136
ne-0.132
os-0.130
Token.
Token ID13
Feature activation-2.62e+00

Pos logit contributions
 anything+0.055
 chores+0.050
 something+0.048
 trouble+0.047
 accomplishment+0.046
Neg logit contributions
y-0.118
o-0.114
jo-0.114
os-0.111
ppy-0.110
Token
Token ID220
Feature activation-2.72e+00

Pos logit contributions
y+0.071
ppy+0.066
jo+0.065
os+0.064
o+0.064
Neg logit contributions
 anything-0.040
 something-0.038
 chores-0.036
 trouble-0.035
 himself-0.035
Token friends
Token ID2460
Feature activation-4.63e+00

Pos logit contributions
jo+0.339
azz+0.336
os+0.315
y+0.311
anne+0.311
Neg logit contributions
 something-0.251
 anything-0.246
 many-0.243
 not-0.239
 good-0.237
Token thanked
Token ID26280
Feature activation+8.07e+00

Pos logit contributions
jo+0.111
azz+0.107
y+0.107
itty+0.104
o+0.104
Neg logit contributions
 anything-0.083
 something-0.079
 chores-0.073
 many-0.072
 an-0.071
Token her
Token ID607
Feature activation+1.28e+00

Pos logit contributions
 anything+0.086
 chores+0.076
 something+0.074
 trouble+0.070
 accomplishment+0.067
Neg logit contributions
y-0.253
jo-0.246
o-0.245
os-0.239
ppy-0.238
Token for
Token ID329
Feature activation+7.38e+00

Pos logit contributions
 anything+0.021
 chores+0.020
 something+0.019
 trouble+0.019
 accomplishment+0.018
Neg logit contributions
y-0.033
jo-0.032
o-0.031
os-0.031
ppy-0.030
Token being
Token ID852
Feature activation+9.60e+00

Pos logit contributions
 chores+0.079
 anything+0.079
 top+0.069
 accomplishment+0.067
 trouble+0.066
Neg logit contributions
y-0.205
o-0.204
jo-0.199
ppy-0.194
os-0.193
Token such
Token ID884
Feature activation+1.42e+01

Pos logit contributions
 chores+0.103
 anything+0.102
 accomplishment+0.090
 trouble+0.082
 top+0.081
Neg logit contributions
y-0.298
o-0.290
jo-0.280
os-0.274
ppy-0.273
Token a
Token ID257
Feature activation+6.68e+00

Pos logit contributions
 anything+0.136
 chores+0.132
 avail+0.130
 accomplishment+0.118
 miles+0.113
Neg logit contributions
y-0.418
o-0.405
ppy-0.384
ne-0.380
jo-0.379
Token good
Token ID922
Feature activation+5.03e+00

Pos logit contributions
 anything+0.080
 chores+0.071
 something+0.065
 trouble+0.062
 yourselves+0.062
Neg logit contributions
y-0.201
o-0.192
jo-0.191
ne-0.184
ppy-0.182
Token leader
Token ID3554
Feature activation-3.36e+00

Pos logit contributions
 anything+0.103
 chores+0.096
 something+0.095
 trouble+0.092
 accomplishment+0.090
Neg logit contributions
y-0.110
jo-0.105
o-0.104
os-0.100
ppy-0.099
Token.
Token ID13
Feature activation-2.57e+00

Pos logit contributions
y+0.076
jo+0.075
o+0.074
os+0.073
azz+0.072
Neg logit contributions
 anything-0.064
 something-0.060
 chores-0.058
 trouble-0.056
 accomplishment-0.054
Token Lily
Token ID20037
Feature activation-1.09e+01

Pos logit contributions
y+0.069
ppy+0.066
jo+0.065
os+0.063
ne+0.063
Neg logit contributions
 anything-0.036
 something-0.035
 himself-0.034
 many-0.033
 an-0.033
TokenThank
Token ID10449
Feature activation+4.50e+00

Pos logit contributions
 anything+0.004
 something+0.004
 himself+0.004
 yourselves+0.004
 moderation+0.004
Neg logit contributions
y-0.008
jo-0.008
os-0.008
ppy-0.008
azz-0.008
Token you
Token ID345
Feature activation-1.19e+00

Pos logit contributions
 anything+0.053
 chores+0.047
 something+0.047
 trouble+0.044
 accomplishment+0.042
Neg logit contributions
y-0.138
jo-0.134
o-0.133
os-0.130
ppy-0.129
Token Jack
Token ID3619
Feature activation-9.95e+00

Pos logit contributions
y+0.029
jo+0.028
o+0.028
os+0.027
ppy+0.027
Neg logit contributions
 anything-0.021
 chores-0.019
 something-0.019
 trouble-0.019
 accomplishment-0.018
Token for
Token ID329
Feature activation+7.33e+00

Pos logit contributions
jo+0.242
azz+0.237
y+0.235
os+0.235
o+0.230
Neg logit contributions
 anything-0.166
 something-0.157
 chores-0.151
 trouble-0.146
 good-0.145
Token finding
Token ID4917
Feature activation+9.87e+00

Pos logit contributions
 anything+0.101
 chores+0.093
 biggest+0.085
 avail+0.085
 accomplishment+0.084
Neg logit contributions
o-0.191
y-0.184
ppy-0.175
os-0.171
itty-0.168
Token such
Token ID884
Feature activation+1.42e+01

Pos logit contributions
 chores+0.119
 accomplishment+0.113
 anything+0.108
wered+0.103
 trouble+0.099
Neg logit contributions
y-0.274
o-0.268
os-0.255
jo-0.254
ne-0.252
Token a
Token ID257
Feature activation+8.08e+00

Pos logit contributions
 avail+0.153
 chores+0.152
 anything+0.152
 bottom+0.145
wered+0.136
Neg logit contributions
y-0.355
o-0.343
ne-0.323
ppy-0.322
bug-0.319
Token nice
Token ID3621
Feature activation+3.98e+00

Pos logit contributions
 anything+0.089
 chores+0.074
 yourselves+0.070
wered+0.069
 himself+0.066
Neg logit contributions
y-0.248
o-0.240
jo-0.231
ne-0.225
ah-0.224
Token shelter
Token ID11772
Feature activation-3.09e+00

Pos logit contributions
 anything+0.051
 chores+0.046
 something+0.044
 trouble+0.042
 accomplishment+0.041
Neg logit contributions
y-0.117
jo-0.112
o-0.112
os-0.108
ppy-0.107
Token."
Token ID526
Feature activation-3.18e+00

Pos logit contributions
y+0.073
jo+0.071
o+0.070
os+0.068
ppy+0.068
Neg logit contributions
 anything-0.056
 chores-0.052
 something-0.052
 trouble-0.050
 accomplishment-0.049
Token
Token ID220
Feature activation-2.64e+00

Pos logit contributions
y+0.089
jo+0.086
ppy+0.084
ah+0.083
os+0.082
Neg logit contributions
 something-0.041
 anything-0.039
 himself-0.039
 an-0.037
 many-0.037
Token always
Token ID1464
Feature activation+8.09e+00

Pos logit contributions
 anything+0.114
 chores+0.105
 trouble+0.103
 something+0.103
 accomplishment+0.099
Neg logit contributions
y-0.171
o-0.166
jo-0.164
ppy-0.160
os-0.157
Token be
Token ID307
Feature activation+1.05e+01

Pos logit contributions
 anything+0.132
 chores+0.121
 something+0.119
 trouble+0.116
 accomplishment+0.113
Neg logit contributions
y-0.207
o-0.199
jo-0.199
os-0.193
ppy-0.192
Token careful
Token ID8161
Feature activation+4.20e+00

Pos logit contributions
 anything+0.138
 chores+0.135
 accomplishment+0.127
 top+0.120
 trouble+0.119
Neg logit contributions
y-0.298
o-0.296
ppy-0.274
os-0.271
jo-0.271
Token and
Token ID290
Feature activation+7.97e+00

Pos logit contributions
 anything+0.073
 chores+0.069
 trouble+0.067
 something+0.066
 accomplishment+0.066
Neg logit contributions
y-0.104
jo-0.098
o-0.098
os-0.094
ppy-0.093
Token never
Token ID1239
Feature activation+7.72e+00

Pos logit contributions
 anything+0.116
 chores+0.105
 trouble+0.101
 something+0.101
 accomplishment+0.097
Neg logit contributions
y-0.216
o-0.211
jo-0.204
os-0.201
ppy-0.199
Token get
Token ID651
Feature activation+1.41e+01

Pos logit contributions
 anything+0.138
 something+0.125
 chores+0.118
 trouble+0.117
 good+0.114
Neg logit contributions
jo-0.199
y-0.188
azz-0.184
itty-0.183
os-0.182
Token caught
Token ID4978
Feature activation+7.51e+00

Pos logit contributions
 anything+0.181
 chores+0.171
 accomplishment+0.167
 something+0.160
 trouble+0.157
Neg logit contributions
y-0.405
o-0.392
jo-0.376
os-0.376
ppy-0.372
Token again
Token ID757
Feature activation+2.91e+00

Pos logit contributions
 anything+0.085
 chores+0.083
 accomplishment+0.079
 trouble+0.074
 something+0.074
Neg logit contributions
y-0.225
o-0.213
jo-0.210
os-0.208
ppy-0.207
Token.
Token ID13
Feature activation-2.92e+00

Pos logit contributions
 anything+0.061
 chores+0.058
 something+0.056
 accomplishment+0.056
 trouble+0.055
Neg logit contributions
y-0.061
jo-0.057
o-0.057
os-0.055
ppy-0.054
Token<|endoftext|>
Token ID50256
Feature activation-7.38e+00

Pos logit contributions
y+0.072
jo+0.069
o+0.067
os+0.066
ppy+0.066
Neg logit contributions
 anything-0.052
 chores-0.048
 something-0.048
 trouble-0.046
 accomplishment-0.045
TokenOnce
Token ID7454
Feature activation-1.20e+00

Pos logit contributions
y+0.193
jo+0.189
ppy+0.187
azz+0.181
o+0.180
Neg logit contributions
 anything-0.114
 something-0.107
 trouble-0.101
 chores-0.099
 many-0.098
Token and
Token ID290
Feature activation+6.30e+00

Pos logit contributions
 anything+0.047
 chores+0.043
 something+0.042
 trouble+0.041
 accomplishment+0.040
Neg logit contributions
y-0.064
jo-0.062
o-0.061
os-0.059
ppy-0.059
Token her
Token ID607
Feature activation-6.67e-02

Pos logit contributions
 anything+0.097
 chores+0.089
 something+0.088
 trouble+0.084
 accomplishment+0.082
Neg logit contributions
y-0.168
jo-0.163
o-0.162
os-0.158
ppy-0.156
Token family
Token ID1641
Feature activation-8.58e+00

Pos logit contributions
y+0.002
jo+0.002
o+0.002
os+0.002
ppy+0.002
Neg logit contributions
 anything-0.001
 chores-0.001
 something-0.001
 trouble-0.001
 accomplishment-0.001
Token will
Token ID481
Feature activation+5.36e+00

Pos logit contributions
jo+0.174
y+0.174
azz+0.170
o+0.168
ppy+0.166
Neg logit contributions
 anything-0.177
 something-0.171
 chores-0.167
 accomplishment-0.160
 trouble-0.160
Token never
Token ID1239
Feature activation+8.22e+00

Pos logit contributions
 anything+0.089
 chores+0.080
 trouble+0.079
 something+0.079
 accomplishment+0.075
Neg logit contributions
y-0.133
o-0.126
jo-0.126
ppy-0.123
ne-0.122
Token get
Token ID651
Feature activation+1.41e+01

Pos logit contributions
 anything+0.135
 chores+0.122
 something+0.121
 trouble+0.117
 accomplishment+0.113
Neg logit contributions
y-0.213
jo-0.209
o-0.206
os-0.203
ppy-0.198
Token to
Token ID284
Feature activation+9.78e+00

Pos logit contributions
 anything+0.205
 chores+0.205
 accomplishment+0.201
wered+0.187
 something+0.182
Neg logit contributions
y-0.367
o-0.351
ppy-0.332
os-0.329
jo-0.328
Token see
Token ID766
Feature activation+7.47e+00

Pos logit contributions
 chores+0.141
 anything+0.140
 accomplishment+0.135
 something+0.127
 yourselves+0.126
Neg logit contributions
y-0.239
o-0.223
jo-0.223
ne-0.219
ppy-0.216
Token her
Token ID607
Feature activation+1.22e+00

Pos logit contributions
 anything+0.077
 chores+0.071
 something+0.066
 trouble+0.064
 accomplishment+0.064
Neg logit contributions
y-0.234
jo-0.226
o-0.225
os-0.221
ppy-0.218
Token again
Token ID757
Feature activation+2.15e+00

Pos logit contributions
 anything+0.022
 chores+0.020
 something+0.020
 trouble+0.019
 accomplishment+0.019
Neg logit contributions
y-0.029
jo-0.028
o-0.028
os-0.027
ppy-0.026
Token.
Token ID13
Feature activation-2.62e+00

Pos logit contributions
 anything+0.044
 chores+0.042
 something+0.041
 trouble+0.040
 accomplishment+0.040
Neg logit contributions
y-0.046
jo-0.044
o-0.043
os-0.042
ppy-0.041
Token closely
Token ID7173
Feature activation+2.76e+00

Pos logit contributions
y+0.153
jo+0.147
o+0.146
os+0.142
ppy+0.140
Neg logit contributions
 anything-0.118
 chores-0.110
 something-0.108
 trouble-0.105
 accomplishment-0.103
Token so
Token ID523
Feature activation+6.34e+00

Pos logit contributions
 anything+0.055
 chores+0.051
 something+0.050
 trouble+0.049
 accomplishment+0.048
Neg logit contributions
y-0.061
jo-0.059
o-0.058
os-0.056
ppy-0.056
Token he
Token ID339
Feature activation-2.65e+00

Pos logit contributions
 anything+0.089
 chores+0.085
 trouble+0.080
 something+0.078
 accomplishment+0.077
Neg logit contributions
y-0.172
jo-0.167
o-0.164
os-0.161
ppy-0.157
Token would
Token ID561
Feature activation+6.46e+00

Pos logit contributions
y+0.048
jo+0.047
o+0.045
azz+0.044
os+0.044
Neg logit contributions
 anything-0.063
 something-0.060
 chores-0.059
 trouble-0.057
 an-0.056
Token never
Token ID1239
Feature activation+7.95e+00

Pos logit contributions
 anything+0.101
 chores+0.092
 something+0.091
 trouble+0.089
 accomplishment+0.087
Neg logit contributions
y-0.168
jo-0.163
o-0.161
ppy-0.157
os-0.156
Token get
Token ID651
Feature activation+1.40e+01

Pos logit contributions
 anything+0.121
 something+0.106
 chores+0.105
 trouble+0.102
 accomplishment+0.094
Neg logit contributions
y-0.218
jo-0.216
o-0.212
os-0.209
azz-0.204
Token lost
Token ID2626
Feature activation+6.75e+00

Pos logit contributions
 chores+0.218
 accomplishment+0.212
 anything+0.206
wered+0.196
 yourselves+0.190
Neg logit contributions
y-0.353
o-0.342
os-0.323
ppy-0.318
jo-0.318
Token again
Token ID757
Feature activation+2.86e+00

Pos logit contributions
 chores+0.137
 accomplishment+0.128
 anything+0.127
wered+0.120
 trouble+0.120
Neg logit contributions
y-0.146
jo-0.131
o-0.129
os-0.126
ne-0.125
Token!
Token ID0
Feature activation-2.10e+00

Pos logit contributions
 anything+0.061
 chores+0.060
 accomplishment+0.057
 something+0.057
 trouble+0.056
Neg logit contributions
y-0.058
jo-0.054
o-0.053
os-0.052
ppy-0.051
Token<|endoftext|>
Token ID50256
Feature activation-6.83e+00

Pos logit contributions
y+0.056
jo+0.052
ppy+0.052
o+0.051
os+0.051
Neg logit contributions
 anything-0.033
 something-0.031
 trouble-0.030
 chores-0.030
 himself-0.029
TokenBaby
Token ID36534
Feature activation-1.42e+01

Pos logit contributions
jo+0.183
ppy+0.182
y+0.181
azz+0.173
anne+0.170
Neg logit contributions
 anything-0.097
 something-0.093
 trouble-0.086
 an-0.083
 many-0.083
Tokenroller
Token ID10646
Feature activation-6.66e+00

Pos logit contributions
y+0.065
jo+0.062
o+0.062
os+0.059
ppy+0.059
Neg logit contributions
 anything-0.079
 chores-0.076
 something-0.075
 trouble-0.073
 accomplishment-0.072
Token so
Token ID523
Feature activation+6.53e+00

Pos logit contributions
jo+0.154
azz+0.149
y+0.148
ppy+0.148
o+0.147
Neg logit contributions
 anything-0.120
 something-0.115
 chores-0.108
 many-0.102
 an-0.101
Token we
Token ID356
Feature activation+1.19e+00

Pos logit contributions
 anything+0.087
 chores+0.079
 trouble+0.073
 something+0.073
 accomplishment+0.070
Neg logit contributions
y-0.187
o-0.179
jo-0.177
os-0.174
ne-0.169
Token won
Token ID1839
Feature activation+7.57e+00

Pos logit contributions
 anything+0.024
 chores+0.023
 something+0.022
 trouble+0.021
 great+0.021
Neg logit contributions
jo-0.026
y-0.026
o-0.025
azz-0.025
os-0.024
Token't
Token ID470
Feature activation+9.08e+00

Pos logit contributions
 anything+0.126
 chores+0.117
 something+0.114
 accomplishment+0.111
 trouble+0.111
Neg logit contributions
y-0.190
o-0.183
jo-0.182
os-0.178
ppy-0.174
Token get
Token ID651
Feature activation+1.40e+01

Pos logit contributions
 anything+0.136
 chores+0.123
 something+0.120
 trouble+0.116
 accomplishment+0.109
Neg logit contributions
y-0.250
jo-0.249
o-0.243
os-0.240
azz-0.236
Token tired
Token ID10032
Feature activation+4.26e+00

Pos logit contributions
 anything+0.191
 chores+0.187
 accomplishment+0.183
 something+0.169
 trouble+0.168
Neg logit contributions
y-0.378
o-0.366
os-0.348
ppy-0.343
ne-0.342
Token."
Token ID526
Feature activation-3.47e+00

Pos logit contributions
 anything+0.080
 chores+0.075
 something+0.074
 trouble+0.071
 accomplishment+0.071
Neg logit contributions
y-0.100
jo-0.095
o-0.094
os-0.091
ppy-0.091
Token
Token ID198
Feature activation-1.29e+00

Pos logit contributions
y+0.093
jo+0.089
ppy+0.089
os+0.086
o+0.085
Neg logit contributions
 anything-0.051
 something-0.051
 an-0.046
 himself-0.046
 trouble-0.046
Token
Token ID198
Feature activation-1.15e+00

Pos logit contributions
y+0.036
ppy+0.033
jo+0.033
os+0.033
o+0.032
Neg logit contributions
 something-0.018
 anything-0.018
 chores-0.017
 trouble-0.016
 himself-0.016
TokenBrad
Token ID30805
Feature activation-1.16e+01

Pos logit contributions
y+0.030
jo+0.029
os+0.028
o+0.028
ppy+0.028
Neg logit contributions
 anything-0.017
 something-0.017
 chores-0.016
 trouble-0.015
 himself-0.015
Token of
Token ID286
Feature activation+6.56e+00

Pos logit contributions
 anything+0.078
 something+0.072
 chores+0.072
 trouble+0.069
 accomplishment+0.067
Neg logit contributions
y-0.111
jo-0.108
o-0.107
os-0.104
ppy-0.104
Token her
Token ID607
Feature activation+6.67e-01

Pos logit contributions
 anything+0.081
 chores+0.077
 trouble+0.071
 something+0.070
 accomplishment+0.069
Neg logit contributions
y-0.187
o-0.183
jo-0.182
os-0.178
ppy-0.176
Token for
Token ID329
Feature activation+7.35e+00

Pos logit contributions
 anything+0.011
 chores+0.010
 something+0.010
 trouble+0.010
 accomplishment+0.010
Neg logit contributions
y-0.017
jo-0.017
o-0.017
os-0.016
ppy-0.016
Token picking
Token ID10868
Feature activation+6.10e+00

Pos logit contributions
 anything+0.091
 chores+0.089
 trouble+0.082
 accomplishment+0.080
 biggest+0.079
Neg logit contributions
y-0.195
o-0.192
ppy-0.186
jo-0.184
os-0.183
Token out
Token ID503
Feature activation+5.95e+00

Pos logit contributions
 anything+0.071
 chores+0.068
 accomplishment+0.066
 trouble+0.062
 something+0.058
Neg logit contributions
y-0.182
o-0.172
os-0.168
jo-0.167
ne-0.164
Token such
Token ID884
Feature activation+1.40e+01

Pos logit contributions
 anything+0.047
 accomplishment+0.043
 chores+0.042
 trouble+0.040
 yourselves+0.035
Neg logit contributions
y-0.192
jo-0.183
o-0.183
os-0.181
ne-0.178
Token a
Token ID257
Feature activation+7.53e+00

Pos logit contributions
 anything+0.135
 chores+0.124
 accomplishment+0.122
 avail+0.116
 miles+0.112
Neg logit contributions
y-0.402
o-0.368
jo-0.356
ne-0.354
bug-0.349
Token pretty
Token ID2495
Feature activation+4.41e+00

Pos logit contributions
 anything+0.073
 chores+0.060
 something+0.056
 trouble+0.053
 himself+0.051
Neg logit contributions
y-0.245
o-0.235
jo-0.232
ne-0.225
os-0.223
Token dress
Token ID6576
Feature activation-3.43e+00

Pos logit contributions
 anything+0.071
 chores+0.066
 something+0.065
 trouble+0.062
 accomplishment+0.061
Neg logit contributions
y-0.114
jo-0.110
o-0.109
os-0.106
ppy-0.105
Token.
Token ID13
Feature activation-2.99e+00

Pos logit contributions
jo+0.080
y+0.079
o+0.078
os+0.077
ppy+0.076
Neg logit contributions
 anything-0.063
 something-0.059
 chores-0.057
 an-0.052
 accomplishment-0.052
Token Lily
Token ID20037
Feature activation-1.07e+01

Pos logit contributions
y+0.084
ppy+0.081
jo+0.077
os+0.077
ne+0.076
Neg logit contributions
 something-0.039
 anything-0.038
 many-0.036
 an-0.036
 himself-0.036
Token make
Token ID787
Feature activation+9.48e+00

Pos logit contributions
 anything+0.099
 chores+0.089
 something+0.084
 trouble+0.083
 accomplishment+0.079
Neg logit contributions
y-0.184
o-0.180
jo-0.178
os-0.173
ppy-0.171
Token sure
Token ID1654
Feature activation+3.33e+00

Pos logit contributions
 anything+0.093
 chores+0.084
 accomplishment+0.080
 something+0.076
 trouble+0.075
Neg logit contributions
y-0.284
jo-0.276
o-0.276
os-0.270
ppy-0.263
Token he
Token ID339
Feature activation-1.78e+00

Pos logit contributions
 anything+0.038
 chores+0.037
 accomplishment+0.036
 trouble+0.034
wered+0.033
Neg logit contributions
y-0.093
o-0.089
jo-0.089
os-0.088
ne-0.085
Token didn
Token ID1422
Feature activation+6.50e+00

Pos logit contributions
y+0.035
jo+0.035
azz+0.034
o+0.034
itty+0.033
Neg logit contributions
 anything-0.039
 something-0.036
 chores-0.036
 trouble-0.034
 great-0.034
Token't
Token ID470
Feature activation+7.16e+00

Pos logit contributions
 anything+0.101
 chores+0.093
 something+0.092
 accomplishment+0.091
 trouble+0.090
Neg logit contributions
y-0.167
jo-0.161
o-0.161
os-0.158
ne-0.155
Token get
Token ID651
Feature activation+1.40e+01

Pos logit contributions
 anything+0.117
 chores+0.104
 something+0.103
 trouble+0.100
 good+0.095
Neg logit contributions
y-0.187
jo-0.187
o-0.183
os-0.182
azz-0.177
Token lost
Token ID2626
Feature activation+7.15e+00

Pos logit contributions
 anything+0.189
 chores+0.185
 accomplishment+0.180
 something+0.167
 trouble+0.164
Neg logit contributions
y-0.377
o-0.366
os-0.346
jo-0.344
ppy-0.343
Token.
Token ID13
Feature activation-2.69e+00

Pos logit contributions
 anything+0.135
 chores+0.133
 accomplishment+0.127
 something+0.126
 trouble+0.123
Neg logit contributions
y-0.163
jo-0.152
o-0.151
os-0.146
ppy-0.144
Token As
Token ID1081
Feature activation+4.39e+00

Pos logit contributions
y+0.075
ppy+0.070
jo+0.068
os+0.067
ne+0.067
Neg logit contributions
 anything-0.038
 something-0.037
 himself-0.035
 trouble-0.034
 an-0.034
Token he
Token ID339
Feature activation-3.92e+00

Pos logit contributions
 anything+0.066
 chores+0.064
 yourselves+0.061
 something+0.056
 trouble+0.056
Neg logit contributions
y-0.112
o-0.110
jo-0.108
ne-0.102
os-0.101
Token looked
Token ID3114
Feature activation+3.50e+00

Pos logit contributions
y+0.081
jo+0.080
azz+0.079
ppy+0.076
o+0.076
Neg logit contributions
 something-0.079
 anything-0.079
 chores-0.076
 trouble-0.074
 miles-0.074
Token and
Token ID290
Feature activation+6.31e+00

Pos logit contributions
 anything+0.070
 chores+0.066
 something+0.064
 trouble+0.063
 accomplishment+0.063
Neg logit contributions
y-0.066
o-0.061
jo-0.061
os-0.059
ppy-0.059
Token be
Token ID307
Feature activation+9.35e+00

Pos logit contributions
 anything+0.102
 chores+0.091
 trouble+0.089
 something+0.088
 accomplishment+0.086
Neg logit contributions
y-0.155
o-0.150
ppy-0.146
ne-0.143
jo-0.142
Token careful
Token ID8161
Feature activation+2.82e+00

Pos logit contributions
 anything+0.116
 chores+0.108
 accomplishment+0.099
 trouble+0.098
 something+0.096
Neg logit contributions
y-0.275
o-0.269
jo-0.256
ppy-0.255
os-0.252
Token not
Token ID407
Feature activation+8.11e+00

Pos logit contributions
 anything+0.052
 chores+0.048
 something+0.047
 trouble+0.046
 accomplishment+0.045
Neg logit contributions
y-0.067
jo-0.065
o-0.064
os-0.062
ppy-0.062
Token to
Token ID284
Feature activation+1.02e+01

Pos logit contributions
 anything+0.137
 something+0.122
 chores+0.120
 good+0.113
 trouble+0.112
Neg logit contributions
jo-0.215
y-0.204
os-0.200
o-0.199
ppy-0.198
Token get
Token ID651
Feature activation+1.40e+01

Pos logit contributions
 anything+0.171
 chores+0.157
 something+0.154
 trouble+0.148
 accomplishment+0.143
Neg logit contributions
y-0.259
jo-0.253
o-0.250
os-0.244
ppy-0.241
Token tired
Token ID10032
Feature activation+4.68e+00

Pos logit contributions
 anything+0.188
 chores+0.176
 accomplishment+0.171
 something+0.166
 trouble+0.161
Neg logit contributions
y-0.388
o-0.374
jo-0.362
os-0.360
ppy-0.358
Token."
Token ID526
Feature activation-4.13e+00

Pos logit contributions
 anything+0.090
 chores+0.084
 something+0.083
 accomplishment+0.080
 trouble+0.080
Neg logit contributions
y-0.107
jo-0.102
o-0.101
os-0.098
ppy-0.098
Token Sam
Token ID3409
Feature activation-1.44e+01

Pos logit contributions
y+0.117
ppy+0.110
jo+0.108
os+0.106
ah+0.105
Neg logit contributions
 anything-0.056
 something-0.055
 himself-0.053
 trouble-0.051
 an-0.050
Token smiled
Token ID13541
Feature activation-9.78e-02

Pos logit contributions
azz+0.305
jo+0.301
os+0.285
oke+0.278
itty+0.278
Neg logit contributions
 something-0.278
 an-0.269
 a-0.264
 himself-0.263
 not-0.263
Token and
Token ID290
Feature activation+3.28e+00

Pos logit contributions
y+0.002
jo+0.002
o+0.002
os+0.002
ppy+0.002
Neg logit contributions
 anything-0.002
 something-0.002
 chores-0.002
 trouble-0.002
 accomplishment-0.002
Token determined
Token ID5295
Feature activation+4.14e+00

Pos logit contributions
 anything+0.104
 chores+0.101
 trouble+0.090
 yourselves+0.088
 avail+0.088
Neg logit contributions
o-0.224
y-0.221
jo-0.212
os-0.206
ppy-0.205
Token to
Token ID284
Feature activation+8.61e+00

Pos logit contributions
 anything+0.084
 chores+0.082
 accomplishment+0.079
 trouble+0.077
 something+0.076
Neg logit contributions
y-0.086
o-0.080
jo-0.080
os-0.076
ne-0.075
Token never
Token ID1239
Feature activation+8.20e+00

Pos logit contributions
 anything+0.135
 chores+0.125
 trouble+0.122
 something+0.122
 accomplishment+0.120
Neg logit contributions
y-0.209
jo-0.202
o-0.199
ne-0.199
ppy-0.196
Token get
Token ID651
Feature activation+1.42e+01

Pos logit contributions
 anything+0.131
 something+0.118
 chores+0.117
 trouble+0.113
 accomplishment+0.105
Neg logit contributions
y-0.217
jo-0.216
o-0.211
os-0.207
ppy-0.204
Token into
Token ID656
Feature activation+1.04e+01

Pos logit contributions
 accomplishment+0.190
 chores+0.189
 anything+0.184
wered+0.175
 yourselves+0.168
Neg logit contributions
y-0.380
o-0.367
os-0.347
ppy-0.345
ne-0.341
Token such
Token ID884
Feature activation+1.39e+01

Pos logit contributions
wered+0.080
 accomplishment+0.064
 chores+0.061
 middle+0.059
 End+0.058
Neg logit contributions
y-0.307
jo-0.286
ne-0.282
o-0.273
os-0.272
Token a
Token ID257
Feature activation+8.48e+00

Pos logit contributions
 chores+0.112
 yourselves+0.110
wered+0.108
 anything+0.103
 accomplishment+0.099
Neg logit contributions
y-0.407
jo-0.380
ppy-0.373
o-0.368
ne-0.368
Token disgusting
Token ID23374
Feature activation+3.72e+00

Pos logit contributions
 anything+0.098
 chores+0.088
 yourselves+0.082
 something+0.082
 himself+0.081
Neg logit contributions
y-0.252
o-0.237
jo-0.237
ne-0.232
ppy-0.227
Token situation
Token ID3074
Feature activation+1.63e+00

Pos logit contributions
 anything+0.061
 chores+0.057
 something+0.055
 trouble+0.053
 accomplishment+0.053
Neg logit contributions
y-0.095
jo-0.090
o-0.089
ppy-0.086
os-0.086
Token again
Token ID757
Feature activation+1.93e+00

Pos logit contributions
 anything+0.034
 chores+0.032
 something+0.032
 trouble+0.031
 accomplishment+0.031
Neg logit contributions
y-0.034
jo-0.033
o-0.032
os-0.031
ppy-0.031
Token!
Token ID0
Feature activation-2.43e+00

Pos logit contributions
 anything+0.042
 chores+0.040
 something+0.038
 accomplishment+0.038
 trouble+0.038
Neg logit contributions
y-0.039
jo-0.037
o-0.037
os-0.035
ppy-0.035
Token worked
Token ID3111
Feature activation+5.32e+00

Pos logit contributions
 anything+0.129
 chores+0.125
 accomplishment+0.116
 something+0.114
 trouble+0.113
Neg logit contributions
y-0.167
o-0.161
jo-0.158
os-0.156
ppy-0.151
Token so
Token ID523
Feature activation+7.75e+00

Pos logit contributions
 anything+0.087
 chores+0.082
 yourselves+0.080
wered+0.080
 trouble+0.078
Neg logit contributions
y-0.135
jo-0.123
o-0.123
os-0.120
ne-0.117
Token hard
Token ID1327
Feature activation+3.76e+00

Pos logit contributions
 anything+0.080
 chores+0.070
 yourselves+0.068
 trouble+0.065
 something+0.065
Neg logit contributions
y-0.233
jo-0.222
os-0.221
o-0.221
ne-0.216
Token to
Token ID284
Feature activation+8.99e+00

Pos logit contributions
 anything+0.068
 chores+0.063
 something+0.062
 trouble+0.060
 accomplishment+0.059
Neg logit contributions
y-0.090
jo-0.086
o-0.084
os-0.082
ne-0.081
Token bring
Token ID2222
Feature activation+9.60e+00

Pos logit contributions
 anything+0.137
 chores+0.128
 trouble+0.128
 something+0.122
 accomplishment+0.120
Neg logit contributions
y-0.220
jo-0.213
ne-0.208
o-0.208
ppy-0.207
Token such
Token ID884
Feature activation+1.39e+01

Pos logit contributions
 anything+0.086
 chores+0.079
 accomplishment+0.072
 trouble+0.069
 something+0.066
Neg logit contributions
y-0.305
jo-0.294
o-0.294
os-0.291
ne-0.283
Token a
Token ID257
Feature activation+7.44e+00

Pos logit contributions
 anything+0.124
 yourselves+0.111
 avail+0.106
 chores+0.104
wered+0.100
Neg logit contributions
y-0.407
o-0.383
jo-0.382
ne-0.372
os-0.370
Token wonderful
Token ID7932
Feature activation+5.79e+00

Pos logit contributions
 anything+0.080
 chores+0.068
 something+0.064
 trouble+0.061
 yourselves+0.060
Neg logit contributions
y-0.231
o-0.224
jo-0.223
ne-0.215
os-0.215
Token surprise
Token ID5975
Feature activation-5.73e-01

Pos logit contributions
 anything+0.072
 chores+0.064
 something+0.060
 trouble+0.058
 yourselves+0.057
Neg logit contributions
y-0.171
jo-0.164
o-0.161
ne-0.156
os-0.155
Token to
Token ID284
Feature activation+8.40e+00

Pos logit contributions
y+0.013
jo+0.013
o+0.013
os+0.013
ppy+0.013
Neg logit contributions
 anything-0.011
 something-0.010
 chores-0.010
 trouble-0.009
 accomplishment-0.009
Token them
Token ID606
Feature activation-1.82e+00

Pos logit contributions
 anything+0.087
 chores+0.083
 trouble+0.078
 accomplishment+0.075
 something+0.074
Neg logit contributions
y-0.248
jo-0.244
o-0.242
os-0.240
ne-0.232
Token.
Token ID13
Feature activation-4.07e+00

Pos logit contributions
y+0.064
jo+0.063
o+0.062
os+0.060
azz+0.060
Neg logit contributions
 anything-0.048
 something-0.045
 chores-0.044
 accomplishment-0.042
 trouble-0.042
Token
Token ID198
Feature activation-3.68e+00

Pos logit contributions
y+0.108
jo+0.105
ppy+0.102
o+0.101
os+0.101
Neg logit contributions
 anything-0.064
 something-0.061
 trouble-0.057
 chores-0.056
 many-0.055
Token
Token ID198
Feature activation-3.98e+00

Pos logit contributions
y+0.091
jo+0.088
o+0.086
os+0.085
ppy+0.085
Neg logit contributions
 anything-0.063
 something-0.059
 chores-0.059
 trouble-0.057
 accomplishment-0.055
TokenThe
Token ID464
Feature activation-7.45e-01

Pos logit contributions
y+0.093
jo+0.090
o+0.089
os+0.087
ppy+0.086
Neg logit contributions
 anything-0.075
 chores-0.070
 something-0.070
 trouble-0.067
 accomplishment-0.066
Token great
Token ID1049
Feature activation-1.79e+00

Pos logit contributions
y+0.015
jo+0.014
o+0.014
os+0.014
ne+0.013
Neg logit contributions
 anything-0.017
 chores-0.016
 something-0.015
 trouble-0.015
 accomplishment-0.015
Token Zen
Token ID14760
Feature activation-1.94e+01

Pos logit contributions
y+0.042
jo+0.039
o+0.039
os+0.038
ne+0.037
Neg logit contributions
 anything-0.035
 chores-0.032
 something-0.031
 trouble-0.030
 accomplishment-0.030
Tokeno
Token ID78
Feature activation-1.26e+01

Pos logit contributions
jo+0.194
ppy+0.191
pillar+0.185
y+0.181
azz+0.174
Neg logit contributions
 anything-0.534
 something-0.534
 not-0.520
 himself-0.501
 an-0.501
Token hugs
Token ID40657
Feature activation+4.51e-02

Pos logit contributions
jo+0.320
y+0.291
azz+0.289
itty+0.286
ppy+0.282
Neg logit contributions
 something-0.205
 anything-0.199
 good-0.194
 an-0.191
 a-0.186
Token Ben
Token ID3932
Feature activation-1.23e+01

Pos logit contributions
 anything+0.001
 something+0.000
 chores+0.000
 trouble+0.000
 himself+0.000
Neg logit contributions
y-0.001
jo-0.001
o-0.001
os-0.001
ppy-0.001
Token and
Token ID290
Feature activation+3.34e+00

Pos logit contributions
jo+0.315
azz+0.306
os+0.302
itty+0.297
y+0.292
Neg logit contributions
 something-0.196
 anything-0.194
 himself-0.189
 good-0.181
 many-0.178
Token says
Token ID1139
Feature activation-7.45e-02

Pos logit contributions
 anything+0.075
 chores+0.070
 something+0.070
 trouble+0.068
 accomplishment+0.067
Neg logit contributions
y-0.067
jo-0.065
o-0.064
os-0.062
ppy-0.061
Token saw
Token ID2497
Feature activation+4.37e+00

Pos logit contributions
y+0.175
jo+0.171
o+0.165
ppy+0.162
os+0.162
Neg logit contributions
 anything-0.214
 something-0.206
 chores-0.204
 trouble-0.199
 an-0.192
Token a
Token ID257
Feature activation+1.82e+00

Pos logit contributions
 accomplishment+0.057
 avail+0.054
 chores+0.046
 temptation+0.046
 achieve+0.046
Neg logit contributions
o-0.121
y-0.119
os-0.108
ray-0.107
jo-0.105
Token big
Token ID1263
Feature activation+1.40e+00

Pos logit contributions
 anything+0.023
 chores+0.020
 something+0.019
 accomplishment+0.018
 trouble+0.018
Neg logit contributions
y-0.054
o-0.052
jo-0.051
azz-0.049
ne-0.049
Token box
Token ID3091
Feature activation-9.04e+00

Pos logit contributions
 anything+0.027
 chores+0.025
 something+0.024
 himself+0.023
 accomplishment+0.023
Neg logit contributions
y-0.032
o-0.030
jo-0.030
os-0.028
ne-0.028
Token of
Token ID286
Feature activation+5.97e+00

Pos logit contributions
y+0.236
jo+0.233
o+0.229
os+0.225
ppy+0.224
Neg logit contributions
 anything-0.140
 chores-0.129
 something-0.129
 trouble-0.122
 accomplishment-0.119
Token Leg
Token ID3564
Feature activation-1.89e+01

Pos logit contributions
 anything+0.091
 accomplishment+0.084
 chores+0.075
 possible+0.074
 avail+0.074
Neg logit contributions
y-0.151
o-0.141
jo-0.138
itty-0.136
os-0.136
Tokenos
Token ID418
Feature activation-7.62e+00

Pos logit contributions
ppy+0.269
jo+0.266
pillar+0.261
ah+0.258
aa+0.256
Neg logit contributions
 anything-0.450
 something-0.437
 himself-0.424
 an-0.418
 such-0.418
Token that
Token ID326
Feature activation+1.20e+00

Pos logit contributions
jo+0.189
ppy+0.185
azz+0.183
anne+0.182
o+0.179
Neg logit contributions
 chores-0.121
 anything-0.118
 something-0.113
 things-0.106
 himself-0.103
Token he
Token ID339
Feature activation-3.84e+00

Pos logit contributions
 anything+0.025
 chores+0.023
 something+0.023
 accomplishment+0.022
 trouble+0.022
Neg logit contributions
y-0.026
o-0.025
jo-0.025
os-0.024
ppy-0.023
Token really
Token ID1107
Feature activation+1.92e+00

Pos logit contributions
y+0.078
jo+0.076
os+0.075
azz+0.075
itty+0.073
Neg logit contributions
 great-0.076
 something-0.075
 anything-0.074
 an-0.073
 many-0.073
Token wanted
Token ID2227
Feature activation+5.41e+00

Pos logit contributions
 good+0.040
 something+0.040
 great+0.040
 many+0.040
 chores+0.039
Neg logit contributions
azz-0.038
itty-0.038
jo-0.037
os-0.037
y-0.037
Token!
Token ID0
Feature activation-3.71e+00

Pos logit contributions
 anything+0.035
 something+0.033
 chores+0.032
 trouble+0.031
 an+0.030
Neg logit contributions
jo-0.044
y-0.044
o-0.043
os-0.042
azz-0.042
Tokenâ
Token ID22940
Feature activation-5.55e+00

Pos logit contributions
y+0.098
jo+0.093
ppy+0.089
os+0.089
o+0.088
Neg logit contributions
 anything-0.060
 something-0.057
 trouble-0.054
 chores-0.053
 an-0.052
Token
Token ID26391
Feature activation-9.35e+00

Pos logit contributions
y+0.129
jo+0.124
os+0.123
o+0.121
azz+0.120
Neg logit contributions
 anything-0.103
 something-0.096
 chores-0.095
 accomplishment-0.094
 job-0.092
Token<|endoftext|>
Token ID50256
Feature activation-7.71e+00

Pos logit contributions
y+0.213
jo+0.202
o+0.197
ppy+0.195
os+0.193
Neg logit contributions
 anything-0.183
 something-0.171
 chores-0.170
 trouble-0.165
 accomplishment-0.158
TokenPa
Token ID28875
Feature activation-9.01e+00

Pos logit contributions
jo+0.206
y+0.204
ppy+0.202
azz+0.193
o+0.192
Neg logit contributions
 anything-0.114
 something-0.109
 trouble-0.100
 an-0.098
 many-0.096
Tokenolo
Token ID14057
Feature activation-1.88e+01

Pos logit contributions
jo+0.160
y+0.157
o+0.153
os+0.152
ppy+0.141
Neg logit contributions
 anything-0.216
 something-0.207
 chores-0.202
 trouble-0.198
 an-0.195
Token was
Token ID373
Feature activation+5.99e+00

Pos logit contributions
jo+0.423
azz+0.423
itty+0.407
ppy+0.404
oke+0.395
Neg logit contributions
 something-0.327
 an-0.317
 anything-0.316
 chores-0.297
 great-0.292
Token very
Token ID845
Feature activation+6.15e+00

Pos logit contributions
 anything+0.091
 chores+0.081
 accomplishment+0.075
 trouble+0.074
 something+0.073
Neg logit contributions
o-0.165
y-0.164
jo-0.153
os-0.151
ppy-0.146
Token excited
Token ID6568
Feature activation-2.68e-02

Pos logit contributions
 anything+0.110
 chores+0.094
 avail+0.090
 something+0.089
 yourselves+0.088
Neg logit contributions
o-0.155
y-0.154
os-0.143
jo-0.137
ppy-0.135
Token.
Token ID13
Feature activation-3.22e+00

Pos logit contributions
y+0.001
jo+0.001
o+0.001
ppy+0.001
azz+0.001
Neg logit contributions
 anything-0.001
 something-0.000
 chores-0.000
 trouble-0.000
 an-0.000
Token He
Token ID679
Feature activation-4.38e+00

Pos logit contributions
y+0.074
jo+0.071
o+0.070
os+0.068
ppy+0.067
Neg logit contributions
 anything-0.062
 chores-0.058
 something-0.057
 trouble-0.055
 accomplishment-0.055
Token
Token ID198
Feature activation-2.82e+00

Pos logit contributions
y+0.060
jo+0.058
o+0.057
ppy+0.056
os+0.056
Neg logit contributions
 anything-0.041
 something-0.038
 chores-0.038
 trouble-0.036
 many-0.035
TokenOne
Token ID3198
Feature activation-2.83e+00

Pos logit contributions
y+0.073
jo+0.071
o+0.070
os+0.069
ppy+0.068
Neg logit contributions
 anything-0.046
 chores-0.043
 something-0.042
 trouble-0.040
 accomplishment-0.039
Token day
Token ID1110
Feature activation-2.87e+00

Pos logit contributions
jo+0.087
y+0.086
azz+0.084
os+0.084
o+0.083
Neg logit contributions
 anything-0.030
 something-0.028
 chores-0.027
 good-0.025
 trouble-0.025
Token,
Token ID11
Feature activation-4.42e-01

Pos logit contributions
y+0.069
o+0.066
jo+0.066
os+0.064
ppy+0.063
Neg logit contributions
 anything-0.052
 chores-0.049
 something-0.047
 trouble-0.046
 accomplishment-0.046
Token the
Token ID262
Feature activation-1.27e+00

Pos logit contributions
y+0.014
o+0.014
os+0.013
ne+0.013
ah+0.012
Neg logit contributions
 chores-0.004
 anything-0.004
 accomplishment-0.004
 trouble-0.004
 yourselves-0.004
Token hipp
Token ID18568
Feature activation-1.85e+01

Pos logit contributions
y+0.024
o+0.024
jo+0.022
os+0.021
ne+0.020
Neg logit contributions
 anything-0.030
 chores-0.028
 something-0.027
 accomplishment-0.027
 trouble-0.027
Tokeno
Token ID78
Feature activation-1.30e+01

Pos logit contributions
jo+0.168
ppy+0.156
azz+0.154
anne+0.150
anna+0.147
Neg logit contributions
 many-0.554
 something-0.525
 anything-0.522
 others-0.518
 most-0.513
Token saw
Token ID2497
Feature activation+3.48e+00

Pos logit contributions
jo+0.291
azz+0.282
anne+0.259
itty+0.259
ne+0.256
Neg logit contributions
 something-0.242
 anything-0.234
 many-0.228
 an-0.225
 trouble-0.220
Token a
Token ID257
Feature activation+1.96e+00

Pos logit contributions
 accomplishment+0.036
 chores+0.035
 anything+0.032
 avail+0.030
 tasks+0.028
Neg logit contributions
o-0.110
y-0.108
os-0.101
ppy-0.096
ne-0.095
Token little
Token ID1310
Feature activation+1.84e-01

Pos logit contributions
 anything+0.023
 chores+0.021
 something+0.019
 accomplishment+0.019
 trouble+0.019
Neg logit contributions
y-0.059
o-0.057
jo-0.056
ppy-0.054
ne-0.054
Token bird
Token ID6512
Feature activation-1.19e+01

Pos logit contributions
 anything+0.004
 something+0.004
 trouble+0.003
 chores+0.003
 good+0.003
Neg logit contributions
jo-0.004
os-0.004
azz-0.004
ppy-0.004
o-0.004
Token age
Token ID2479
Feature activation-4.07e+00

Pos logit contributions
 anything+0.030
 chores+0.028
 something+0.028
 trouble+0.027
 accomplishment+0.026
Neg logit contributions
y-0.049
jo-0.047
o-0.047
os-0.045
ppy-0.045
Token.
Token ID13
Feature activation-4.39e+00

Pos logit contributions
y+0.091
jo+0.091
o+0.090
ppy+0.087
azz+0.087
Neg logit contributions
 anything-0.078
 something-0.073
 chores-0.072
 trouble-0.069
 accomplishment-0.066
Token
Token ID198
Feature activation-3.15e+00

Pos logit contributions
y+0.123
ppy+0.120
jo+0.114
ah+0.113
azz+0.112
Neg logit contributions
 anything-0.060
 something-0.060
 many-0.057
 trouble-0.055
 an-0.054
Token
Token ID198
Feature activation-3.71e+00

Pos logit contributions
y+0.085
ppy+0.083
jo+0.081
os+0.080
o+0.079
Neg logit contributions
 anything-0.044
 something-0.043
 many-0.042
 chores-0.041
 trouble-0.039
TokenThe
Token ID464
Feature activation-2.01e+00

Pos logit contributions
y+0.090
os+0.089
jo+0.088
azz+0.088
ppy+0.088
Neg logit contributions
 anything-0.062
 something-0.061
 chores-0.059
 many-0.057
 great-0.057
Token coach
Token ID3985
Feature activation-1.84e+01

Pos logit contributions
y+0.052
o+0.049
jo+0.048
azz+0.046
os+0.046
Neg logit contributions
 anything-0.034
 something-0.029
 chores-0.029
 trouble-0.028
 miles-0.028
Token spoke
Token ID5158
Feature activation-2.16e-01

Pos logit contributions
jo+0.411
ppy+0.398
azz+0.385
ksh+0.381
itty+0.368
Neg logit contributions
 something-0.321
 the-0.308
 an-0.304
 many-0.302
 a-0.294
Token stern
Token ID26370
Feature activation-1.65e+00

Pos logit contributions
jo+0.006
y+0.006
o+0.006
ppy+0.005
os+0.005
Neg logit contributions
 anything-0.003
 something-0.003
 chores-0.003
 an-0.003
 many-0.003
Tokenly
Token ID306
Feature activation-2.98e+00

Pos logit contributions
itty+0.028
o+0.026
azz+0.025
os+0.025
ppy+0.025
Neg logit contributions
 something-0.042
 anything-0.041
 things-0.040
 good-0.040
 many-0.039
Token.
Token ID13
Feature activation-4.71e+00

Pos logit contributions
itty+0.073
o+0.073
ppy+0.073
jo+0.072
azz+0.072
Neg logit contributions
 something-0.046
 anything-0.046
 chores-0.043
 things-0.042
 many-0.042
Token
Token ID6184
Feature activation-4.99e+00

Pos logit contributions
y+0.131
ppy+0.127
itty+0.124
jo+0.123
ne+0.123
Neg logit contributions
 anything-0.065
 something-0.064
 many-0.060
 an-0.059
 himself-0.058
Token
Token ID198
Feature activation-2.98e+00

Pos logit contributions
y+0.060
jo+0.058
o+0.058
os+0.056
ppy+0.056
Neg logit contributions
 anything-0.046
 chores-0.043
 something-0.042
 trouble-0.041
 accomplishment-0.040
TokenOne
Token ID3198
Feature activation-2.87e+00

Pos logit contributions
y+0.076
jo+0.073
o+0.072
os+0.071
ppy+0.070
Neg logit contributions
 anything-0.051
 chores-0.047
 something-0.046
 trouble-0.045
 accomplishment-0.044
Token day
Token ID1110
Feature activation-3.15e+00

Pos logit contributions
jo+0.087
y+0.087
azz+0.084
os+0.084
o+0.084
Neg logit contributions
 anything-0.031
 something-0.029
 chores-0.028
 good-0.026
 trouble-0.025
Token,
Token ID11
Feature activation-6.32e-01

Pos logit contributions
y+0.076
o+0.073
jo+0.072
os+0.070
ppy+0.069
Neg logit contributions
 anything-0.057
 chores-0.054
 something-0.052
 trouble-0.051
 accomplishment-0.051
Token the
Token ID262
Feature activation-1.44e+00

Pos logit contributions
o+0.019
y+0.019
os+0.018
ah+0.017
ne+0.017
Neg logit contributions
 chores-0.007
 accomplishment-0.007
 anything-0.006
 trouble-0.006
 yourselves-0.006
Token hipp
Token ID18568
Feature activation-1.84e+01

Pos logit contributions
y+0.028
o+0.027
jo+0.025
os+0.024
ne+0.024
Neg logit contributions
 anything-0.033
 chores-0.031
 something-0.030
 accomplishment-0.030
 trouble-0.030
Tokeno
Token ID78
Feature activation-1.30e+01

Pos logit contributions
jo+0.161
ppy+0.156
azz+0.148
anne+0.145
anna+0.142
Neg logit contributions
 many-0.552
 something-0.525
 anything-0.519
 others-0.515
 himself-0.512
Token saw
Token ID2497
Feature activation+3.48e+00

Pos logit contributions
jo+0.288
azz+0.273
itty+0.261
anne+0.260
ne+0.255
Neg logit contributions
 something-0.242
 anything-0.234
 many-0.228
 an-0.224
 trouble-0.224
Token a
Token ID257
Feature activation+2.02e+00

Pos logit contributions
 accomplishment+0.038
 chores+0.036
 anything+0.033
 avail+0.031
 tasks+0.029
Neg logit contributions
o-0.111
y-0.108
os-0.101
ppy-0.096
jo-0.096
Token new
Token ID649
Feature activation+2.77e-01

Pos logit contributions
 anything+0.022
 chores+0.020
 something+0.019
 accomplishment+0.018
 trouble+0.018
Neg logit contributions
y-0.063
o-0.061
jo-0.060
os-0.058
ppy-0.058
Token friend
Token ID1545
Feature activation-7.86e+00

Pos logit contributions
 anything+0.005
 something+0.005
 chores+0.005
 trouble+0.004
 accomplishment+0.004
Neg logit contributions
y-0.007
jo-0.007
ppy-0.006
o-0.006
os-0.006
Token.
Token ID13
Feature activation-3.90e+00

Pos logit contributions
 anything+0.014
 chores+0.014
 something+0.013
 trouble+0.013
 accomplishment+0.013
Neg logit contributions
y-0.015
jo-0.014
o-0.014
os-0.013
ppy-0.013
Token
Token ID220
Feature activation-4.77e+00

Pos logit contributions
y+0.116
ppy+0.111
jo+0.108
azz+0.104
os+0.103
Neg logit contributions
 anything-0.047
 something-0.047
 himself-0.047
 many-0.047
 an-0.045
Token
Token ID198
Feature activation-2.81e+00

Pos logit contributions
y+0.118
jo+0.113
o+0.111
ppy+0.109
os+0.108
Neg logit contributions
 anything-0.083
 chores-0.077
 something-0.077
 trouble-0.073
 accomplishment-0.070
Token
Token ID198
Feature activation-3.17e+00

Pos logit contributions
y+0.072
jo+0.069
ppy+0.068
o+0.068
os+0.067
Neg logit contributions
 anything-0.045
 something-0.042
 chores-0.041
 many-0.039
 trouble-0.039
TokenThe
Token ID464
Feature activation-1.48e+00

Pos logit contributions
y+0.090
jo+0.087
o+0.087
os+0.084
ppy+0.084
Neg logit contributions
 anything-0.045
 chores-0.041
 something-0.040
 trouble-0.039
 accomplishment-0.038
Token hipp
Token ID18568
Feature activation-1.83e+01

Pos logit contributions
y+0.027
o+0.026
jo+0.024
os+0.023
ne+0.023
Neg logit contributions
 anything-0.036
 chores-0.034
 something-0.033
 trouble-0.033
 accomplishment-0.032
Tokeno
Token ID78
Feature activation-1.28e+01

Pos logit contributions
azz+0.175
ppy+0.173
jo+0.167
ches+0.157
anna+0.152
Neg logit contributions
 many-0.517
 himself-0.491
 something-0.491
 others-0.489
 most-0.488
Token found
Token ID1043
Feature activation+3.61e+00

Pos logit contributions
azz+0.302
jo+0.279
itty+0.270
oke+0.256
ne+0.254
Neg logit contributions
 many-0.215
 something-0.214
 an-0.204
 good-0.201
 top-0.199
Token some
Token ID617
Feature activation+4.47e+00

Pos logit contributions
 avail+0.051
 chores+0.049
 accomplishment+0.046
 achieve+0.042
 university+0.040
Neg logit contributions
o-0.103
y-0.097
os-0.093
ray-0.089
bug-0.086
Token vegetables
Token ID13701
Feature activation-8.87e+00

Pos logit contributions
 anything+0.065
 accomplishment+0.058
 yourselves+0.056
 avail+0.055
 chores+0.053
Neg logit contributions
y-0.121
o-0.109
ne-0.106
os-0.105
ppy-0.105
Token and
Token ID290
Feature activation+4.36e+00

Pos logit contributions
y+0.188
o+0.175
os+0.166
ppy+0.164
jo+0.163
Neg logit contributions
 anything-0.191
 chores-0.182
 accomplishment-0.175
 trouble-0.175
 something-0.172
Token bird
Token ID6512
Feature activation-1.31e+01

Pos logit contributions
 anything+0.047
 chores+0.043
 something+0.043
 accomplishment+0.041
 trouble+0.041
Neg logit contributions
y-0.050
o-0.048
jo-0.047
os-0.045
ne-0.045
Token.
Token ID13
Feature activation-4.16e+00

Pos logit contributions
jo+0.329
azz+0.317
ne+0.309
os+0.305
oke+0.304
Neg logit contributions
 anything-0.226
 something-0.218
 chores-0.195
 an-0.190
 many-0.189
Token So
Token ID1406
Feature activation-8.70e-01

Pos logit contributions
y+0.118
jo+0.115
ppy+0.113
azz+0.111
os+0.109
Neg logit contributions
 anything-0.058
 something-0.056
 many-0.052
 an-0.052
 himself-0.051
Token,
Token ID11
Feature activation-1.22e+00

Pos logit contributions
y+0.018
o+0.017
jo+0.016
ah+0.016
os+0.016
Neg logit contributions
 chores-0.018
 anything-0.018
 trouble-0.017
 yourselves-0.017
 accomplishment-0.017
Token the
Token ID262
Feature activation-2.07e+00

Pos logit contributions
y+0.036
o+0.036
os+0.033
ne+0.033
ah+0.032
Neg logit contributions
 chores-0.014
 anything-0.013
 yourselves-0.012
 accomplishment-0.012
 trouble-0.012
Token hipp
Token ID18568
Feature activation-1.83e+01

Pos logit contributions
y+0.042
o+0.040
jo+0.039
os+0.038
ppy+0.037
Neg logit contributions
 anything-0.046
 chores-0.043
 something-0.042
 trouble-0.041
 accomplishment-0.041
Tokeno
Token ID78
Feature activation-1.29e+01

Pos logit contributions
jo+0.189
anna+0.181
ppy+0.172
azz+0.171
anne+0.170
Neg logit contributions
 many-0.509
 something-0.482
 others-0.481
 himself-0.479
 most-0.477
Token used
Token ID973
Feature activation+4.70e+00

Pos logit contributions
jo+0.310
azz+0.296
os+0.283
itty+0.281
oke+0.279
Neg logit contributions
 something-0.228
 an-0.214
 good-0.213
 not-0.212
 many-0.210
Token its
Token ID663
Feature activation+4.71e+00

Pos logit contributions
 anything+0.067
 yourselves+0.067
 avail+0.064
 chores+0.062
 university+0.062
Neg logit contributions
y-0.123
o-0.121
os-0.112
ppy-0.109
bug-0.103
Token big
Token ID1263
Feature activation+2.37e+00

Pos logit contributions
 anything+0.074
 chores+0.065
 accomplishment+0.063
 trouble+0.062
 something+0.062
Neg logit contributions
y-0.125
o-0.120
ppy-0.115
jo-0.115
os-0.113
Token mouth
Token ID5422
Feature activation-3.63e+00

Pos logit contributions
 anything+0.045
 chores+0.041
 something+0.041
 trouble+0.039
 accomplishment+0.039
Neg logit contributions
y-0.055
o-0.053
jo-0.053
os-0.051
ppy-0.051
Token me
Token ID502
Feature activation-3.04e+00

Pos logit contributions
 anything+0.088
 chores+0.088
 accomplishment+0.081
 trouble+0.080
 something+0.079
Neg logit contributions
y-0.201
o-0.193
jo-0.188
ppy-0.188
os-0.187
Token."
Token ID526
Feature activation-5.32e+00

Pos logit contributions
jo+0.074
y+0.072
azz+0.069
itty+0.068
os+0.067
Neg logit contributions
 anything-0.054
 something-0.051
 chores-0.048
 trouble-0.046
 an-0.045
Token
Token ID198
Feature activation-3.81e+00

Pos logit contributions
y+0.135
jo+0.131
o+0.127
ppy+0.126
os+0.124
Neg logit contributions
 anything-0.092
 something-0.085
 chores-0.083
 trouble-0.082
 an-0.078
Token
Token ID198
Feature activation-4.00e+00

Pos logit contributions
y+0.095
jo+0.091
o+0.090
os+0.088
ppy+0.088
Neg logit contributions
 anything-0.066
 something-0.062
 chores-0.061
 trouble-0.059
 accomplishment-0.057
TokenThe
Token ID464
Feature activation-1.65e+00

Pos logit contributions
y+0.089
jo+0.086
o+0.084
os+0.083
ppy+0.081
Neg logit contributions
 anything-0.080
 something-0.076
 chores-0.075
 trouble-0.072
 accomplishment-0.070
Token coach
Token ID3985
Feature activation-1.83e+01

Pos logit contributions
y+0.037
o+0.033
ne+0.032
azz+0.032
ppy+0.031
Neg logit contributions
 anything-0.033
 miles-0.029
 something-0.028
 chores-0.028
 yourselves-0.027
Token smiled
Token ID13541
Feature activation-1.64e+00

Pos logit contributions
jo+0.410
ppy+0.374
y+0.366
azz+0.363
ety+0.361
Neg logit contributions
 something-0.350
 the-0.320
 a-0.320
 an-0.316
 not-0.312
Token and
Token ID290
Feature activation+2.02e+00

Pos logit contributions
jo+0.034
y+0.034
o+0.033
os+0.033
ppy+0.032
Neg logit contributions
 anything-0.034
 something-0.033
 chores-0.031
 trouble-0.031
 good-0.030
Token said
Token ID531
Feature activation-1.30e+00

Pos logit contributions
 something+0.047
 anything+0.045
 good+0.044
 an+0.044
 many+0.044
Neg logit contributions
jo-0.040
y-0.039
os-0.035
ppy-0.035
anne-0.033
Token,
Token ID11
Feature activation-2.77e+00

Pos logit contributions
y+0.033
jo+0.032
o+0.032
os+0.031
ppy+0.031
Neg logit contributions
 anything-0.022
 something-0.020
 chores-0.020
 trouble-0.019
 accomplishment-0.018
Token "
Token ID366
Feature activation-2.21e+00

Pos logit contributions
y+0.086
jo+0.085
os+0.081
ppy+0.080
ne+0.079
Neg logit contributions
 anything-0.032
 something-0.031
 good-0.028
 not-0.027
 himself-0.027
Token
Token ID198
Feature activation-2.84e+00

Pos logit contributions
y+0.058
jo+0.056
o+0.055
os+0.054
ppy+0.054
Neg logit contributions
 anything-0.042
 chores-0.039
 something-0.039
 trouble-0.037
 accomplishment-0.036
TokenBut
Token ID1537
Feature activation-6.96e-01

Pos logit contributions
y+0.076
o+0.073
jo+0.072
ne+0.070
ppy+0.070
Neg logit contributions
 anything-0.046
 chores-0.041
 trouble-0.040
 something-0.040
 accomplishment-0.040
Token then
Token ID788
Feature activation-3.27e-01

Pos logit contributions
y+0.018
o+0.018
ah+0.016
ppy+0.016
ne+0.016
Neg logit contributions
 chores-0.011
 trouble-0.010
 anything-0.010
 accomplishment-0.010
 mischief-0.010
Token,
Token ID11
Feature activation-7.84e-01

Pos logit contributions
y+0.007
o+0.007
jo+0.006
os+0.006
ppy+0.006
Neg logit contributions
 chores-0.006
 anything-0.006
 accomplishment-0.006
 trouble-0.006
 yourselves-0.006
Token the
Token ID262
Feature activation-1.43e+00

Pos logit contributions
o+0.021
y+0.021
os+0.019
ah+0.019
ne+0.019
Neg logit contributions
 chores-0.010
 accomplishment-0.009
 yourselves-0.009
 moderation-0.008
 mischief-0.008
Token hipp
Token ID18568
Feature activation-1.82e+01

Pos logit contributions
y+0.028
o+0.027
jo+0.025
os+0.025
ne+0.024
Neg logit contributions
 anything-0.032
 chores-0.030
 something-0.030
 accomplishment-0.029
 trouble-0.029
Tokeno
Token ID78
Feature activation-1.27e+01

Pos logit contributions
anna+0.190
jo+0.189
ppy+0.175
anne+0.172
azz+0.164
Neg logit contributions
 many-0.494
 something-0.474
 others-0.470
 anything-0.466
 himself-0.463
Token went
Token ID1816
Feature activation+2.53e+00

Pos logit contributions
jo+0.291
itty+0.271
anne+0.265
os+0.264
azz+0.263
Neg logit contributions
 something-0.231
 anything-0.219
 not-0.211
 an-0.209
 trouble-0.209
Token to
Token ID284
Feature activation+4.83e+00

Pos logit contributions
 accomplishment+0.051
 anything+0.051
 chores+0.049
 avail+0.048
 moderation+0.047
Neg logit contributions
y-0.055
o-0.054
ppy-0.047
os-0.046
azz-0.045
Token play
Token ID711
Feature activation+1.75e+00

Pos logit contributions
 anything+0.060
 accomplishment+0.060
 chores+0.058
 End+0.057
 awesome+0.056
Neg logit contributions
o-0.118
y-0.108
os-0.106
bug-0.101
jo-0.100
Token with
Token ID351
Feature activation+6.11e+00

Pos logit contributions
 chores+0.034
 accomplishment+0.033
 anything+0.033
 avail+0.032
 trouble+0.031
Neg logit contributions
y-0.038
o-0.036
ppy-0.033
os-0.032
ne-0.032
Token come
Token ID1282
Feature activation+1.18e+00

Pos logit contributions
y+0.014
jo+0.013
o+0.013
os+0.013
ppy+0.012
Neg logit contributions
 anything-0.013
 chores-0.012
 something-0.012
 trouble-0.011
 accomplishment-0.011
Token with
Token ID351
Feature activation+7.37e+00

Pos logit contributions
 anything+0.018
 something+0.017
 chores+0.016
 trouble+0.015
 an+0.015
Neg logit contributions
y-0.031
jo-0.031
ppy-0.029
os-0.029
azz-0.029
Token me
Token ID502
Feature activation-2.44e+00

Pos logit contributions
 anything+0.139
 chores+0.130
 accomplishment+0.128
 something+0.119
 trouble+0.119
Neg logit contributions
y-0.165
o-0.163
jo-0.154
os-0.153
ppy-0.151
Token,"
Token ID553
Feature activation-3.25e+00

Pos logit contributions
y+0.057
jo+0.056
o+0.054
azz+0.053
os+0.053
Neg logit contributions
 anything-0.046
 something-0.043
 chores-0.043
 trouble-0.040
 many-0.039
Token the
Token ID262
Feature activation-2.78e+00

Pos logit contributions
y+0.106
o+0.103
jo+0.103
os+0.101
ppy+0.100
Neg logit contributions
 anything-0.031
 chores-0.027
 something-0.025
 trouble-0.024
 accomplishment-0.023
Token coach
Token ID3985
Feature activation-1.82e+01

Pos logit contributions
y+0.069
o+0.064
os+0.062
jo+0.062
ne+0.061
Neg logit contributions
 anything-0.049
 chores-0.045
 something-0.042
 accomplishment-0.042
 trouble-0.042
Token says
Token ID1139
Feature activation-3.34e+00

Pos logit contributions
jo+0.394
itty+0.369
ppy+0.363
anne+0.360
y+0.357
Neg logit contributions
 the-0.321
 many-0.316
 chores-0.316
 new-0.314
 good-0.314
Token.
Token ID13
Feature activation-4.43e+00

Pos logit contributions
y+0.080
jo+0.080
o+0.077
os+0.075
ppy+0.074
Neg logit contributions
 anything-0.060
 something-0.056
 chores-0.055
 trouble-0.053
 accomplishment-0.052
Token "
Token ID366
Feature activation-1.79e+00

Pos logit contributions
y+0.122
jo+0.121
ppy+0.117
os+0.114
ne+0.114
Neg logit contributions
 anything-0.062
 something-0.061
 many-0.057
 trouble-0.057
 an-0.055
TokenI
Token ID40
Feature activation-2.96e-01

Pos logit contributions
y+0.049
jo+0.047
o+0.047
os+0.046
ppy+0.045
Neg logit contributions
 anything-0.027
 chores-0.024
 something-0.024
 trouble-0.023
 accomplishment-0.022
Token will
Token ID481
Feature activation+4.71e+00

Pos logit contributions
y+0.008
jo+0.008
o+0.007
os+0.007
azz+0.007
Neg logit contributions
 anything-0.005
 chores-0.005
 something-0.004
 great-0.004
 good-0.004
Token food
Token ID2057
Feature activation-6.32e+00

Pos logit contributions
 anything+0.069
 accomplishment+0.060
 yourselves+0.057
 chores+0.056
 miles+0.055
Neg logit contributions
y-0.152
o-0.145
os-0.141
jo-0.139
ne-0.138
Token.
Token ID13
Feature activation-3.70e+00

Pos logit contributions
y+0.154
o+0.148
os+0.142
jo+0.141
ppy+0.140
Neg logit contributions
 anything-0.113
 chores-0.105
 trouble-0.102
 accomplishment-0.101
 something-0.099
Token
Token ID198
Feature activation-2.26e+00

Pos logit contributions
y+0.094
jo+0.091
ppy+0.088
o+0.088
os+0.087
Neg logit contributions
 anything-0.063
 something-0.059
 chores-0.057
 trouble-0.055
 himself-0.054
Token
Token ID198
Feature activation-2.70e+00

Pos logit contributions
y+0.059
ppy+0.056
jo+0.056
o+0.055
os+0.055
Neg logit contributions
 anything-0.034
 something-0.033
 chores-0.032
 many-0.031
 trouble-0.031
TokenThe
Token ID464
Feature activation-1.41e+00

Pos logit contributions
y+0.070
jo+0.068
o+0.066
os+0.066
ppy+0.065
Neg logit contributions
 anything-0.043
 chores-0.041
 something-0.040
 trouble-0.038
 many-0.037
Token hipp
Token ID18568
Feature activation-1.82e+01

Pos logit contributions
y+0.030
o+0.029
jo+0.027
os+0.026
ne+0.026
Neg logit contributions
 anything-0.030
 chores-0.028
 something-0.027
 trouble-0.027
 accomplishment-0.026
Tokeno
Token ID78
Feature activation-1.27e+01

Pos logit contributions
ppy+0.173
azz+0.170
jo+0.163
anna+0.150
anne+0.148
Neg logit contributions
 many-0.526
 himself-0.499
 something-0.498
 most-0.491
 others-0.490
Token went
Token ID1816
Feature activation+2.57e+00

Pos logit contributions
azz+0.288
jo+0.268
of+0.247
ne+0.246
itty+0.244
Neg logit contributions
 something-0.224
 many-0.223
 an-0.214
 good-0.214
 chores-0.210
Token to
Token ID284
Feature activation+4.96e+00

Pos logit contributions
 anything+0.054
 chores+0.049
 trouble+0.049
 accomplishment+0.049
 something+0.048
Neg logit contributions
y-0.056
o-0.055
jo-0.051
os-0.051
ppy-0.049
Token the
Token ID262
Feature activation+2.50e+00

Pos logit contributions
 accomplishment+0.065
 anything+0.064
 chores+0.062
 End+0.060
 awesome+0.059
Neg logit contributions
o-0.126
y-0.114
os-0.111
ppy-0.106
bug-0.105
Token kitchen
Token ID9592
Feature activation-5.56e+00

Pos logit contributions
 anything+0.037
 accomplishment+0.035
 chores+0.033
 trouble+0.031
 something+0.030
Neg logit contributions
y-0.068
o-0.067
jo-0.061
os-0.061
ppy-0.060
Token very
Token ID845
Feature activation+6.59e+00

Pos logit contributions
 anything+0.085
 accomplishment+0.079
 chores+0.078
 trouble+0.073
 something+0.071
Neg logit contributions
o-0.198
y-0.198
jo-0.184
os-0.181
ppy-0.179
Token hungry
Token ID14720
Feature activation-1.23e-01

Pos logit contributions
 anything+0.086
 chores+0.076
 something+0.074
 trouble+0.070
 accomplishment+0.070
Neg logit contributions
y-0.195
o-0.192
jo-0.184
os-0.182
ppy-0.179
Token.
Token ID13
Feature activation-3.32e+00

Pos logit contributions
y+0.003
o+0.002
jo+0.002
os+0.002
ppy+0.002
Neg logit contributions
 anything-0.003
 chores-0.003
 accomplishment-0.002
 something-0.002
 trouble-0.002
Token So
Token ID1406
Feature activation-7.26e-01

Pos logit contributions
y+0.084
jo+0.082
ppy+0.079
o+0.079
os+0.079
Neg logit contributions
 anything-0.056
 something-0.052
 chores-0.051
 trouble-0.049
 an-0.048
Token the
Token ID262
Feature activation-1.78e+00

Pos logit contributions
y+0.015
o+0.014
ah+0.013
jo+0.012
ne+0.012
Neg logit contributions
 chores-0.016
 anything-0.015
 yourselves-0.015
 trouble-0.015
 accomplishment-0.015
Token hipp
Token ID18568
Feature activation-1.82e+01

Pos logit contributions
y+0.033
o+0.032
jo+0.030
os+0.029
ne+0.028
Neg logit contributions
 anything-0.043
 chores-0.040
 something-0.039
 accomplishment-0.038
 trouble-0.038
Tokeno
Token ID78
Feature activation-1.28e+01

Pos logit contributions
azz+0.180
ppy+0.180
jo+0.167
ems+0.164
anna+0.160
Neg logit contributions
 many-0.518
 himself-0.494
 most-0.481
 others-0.480
 something-0.479
Token decided
Token ID3066
Feature activation+8.50e-01

Pos logit contributions
azz+0.269
jo+0.259
itty+0.248
os+0.243
y+0.243
Neg logit contributions
 something-0.255
 anything-0.248
 many-0.245
 an-0.244
 chores-0.241
Token to
Token ID284
Feature activation+4.33e+00

Pos logit contributions
 chores+0.018
 anything+0.018
 accomplishment+0.018
 trouble+0.017
 moderation+0.016
Neg logit contributions
y-0.017
o-0.016
jo-0.015
ne-0.015
ppy-0.014
Token cook
Token ID4255
Feature activation-7.97e-01

Pos logit contributions
 anything+0.079
 accomplishment+0.075
 something+0.070
 trouble+0.070
 chores+0.069
Neg logit contributions
y-0.098
o-0.095
ppy-0.090
jo-0.090
os-0.090
Token some
Token ID617
Feature activation+5.39e+00

Pos logit contributions
o+0.022
y+0.021
os+0.020
ppy+0.020
ne+0.018
Neg logit contributions
 accomplishment-0.009
 avail-0.008
 trouble-0.008
 anything-0.008
 miles-0.008
Token."
Token ID526
Feature activation-5.98e+00

Pos logit contributions
 anything+0.018
 something+0.016
 chores+0.015
 good+0.015
 trouble+0.015
Neg logit contributions
jo-0.027
azz-0.026
y-0.026
ppy-0.026
os-0.026
Token
Token ID220
Feature activation-6.03e+00

Pos logit contributions
y+0.169
ppy+0.165
jo+0.161
os+0.156
azz+0.155
Neg logit contributions
 anything-0.081
 something-0.080
 himself-0.077
 chores-0.074
 an-0.073
Token
Token ID198
Feature activation-3.73e+00

Pos logit contributions
y+0.152
jo+0.146
o+0.143
ppy+0.142
os+0.139
Neg logit contributions
 anything-0.101
 something-0.094
 chores-0.093
 trouble-0.089
 many-0.086
Token
Token ID198
Feature activation-3.86e+00

Pos logit contributions
y+0.095
jo+0.090
o+0.089
ppy+0.089
os+0.088
Neg logit contributions
 anything-0.062
 chores-0.058
 something-0.057
 trouble-0.055
 many-0.054
TokenThe
Token ID464
Feature activation-1.88e+00

Pos logit contributions
y+0.098
jo+0.095
o+0.094
os+0.091
ppy+0.091
Neg logit contributions
 anything-0.065
 chores-0.060
 something-0.059
 trouble-0.057
 accomplishment-0.057
Token prince
Token ID19716
Feature activation-1.81e+01

Pos logit contributions
y+0.043
o+0.040
jo+0.039
ppy+0.039
ne+0.039
Neg logit contributions
 anything-0.034
 something-0.030
 yourselves-0.030
 chores-0.030
 accomplishment-0.029
Token went
Token ID1816
Feature activation+2.46e+00

Pos logit contributions
azz+0.411
jo+0.402
itty+0.399
y+0.386
os+0.385
Neg logit contributions
 something-0.315
 an-0.302
 many-0.298
 anything-0.297
 not-0.296
Token to
Token ID284
Feature activation+5.09e+00

Pos logit contributions
 anything+0.049
 chores+0.045
 trouble+0.044
 something+0.044
 accomplishment+0.043
Neg logit contributions
y-0.055
o-0.054
jo-0.052
os-0.051
azz-0.049
Token the
Token ID262
Feature activation+2.17e+00

Pos logit contributions
 anything+0.057
 chores+0.057
 End+0.055
 yourselves+0.054
 biggest+0.054
Neg logit contributions
o-0.123
y-0.122
jo-0.120
os-0.118
ne-0.116
Token battlefield
Token ID13480
Feature activation-9.46e+00

Pos logit contributions
 accomplishment+0.032
 anything+0.032
 chores+0.029
 moderation+0.027
 trouble+0.027
Neg logit contributions
y-0.054
o-0.050
jo-0.048
ne-0.048
ray-0.047
Token with
Token ID351
Feature activation+5.36e+00

Pos logit contributions
y+0.210
jo+0.203
o+0.201
os+0.196
ppy+0.194
Neg logit contributions
 anything-0.187
 something-0.175
 chores-0.174
 trouble-0.166
 accomplishment-0.164
Token One
Token ID1881
Feature activation-2.46e+00

Pos logit contributions
y+0.093
jo+0.090
ppy+0.088
o+0.087
os+0.086
Neg logit contributions
 anything-0.061
 something-0.057
 chores-0.056
 many-0.053
 trouble-0.053
Token day
Token ID1110
Feature activation-3.01e+00

Pos logit contributions
jo+0.076
azz+0.075
y+0.075
ppy+0.074
os+0.073
Neg logit contributions
 anything-0.025
 something-0.024
 chores-0.022
 good-0.022
 many-0.021
Token,
Token ID11
Feature activation-6.95e-01

Pos logit contributions
y+0.072
o+0.069
jo+0.068
os+0.066
ppy+0.065
Neg logit contributions
 anything-0.055
 chores-0.052
 something-0.050
 trouble-0.049
 accomplishment-0.049
Token the
Token ID262
Feature activation-1.39e+00

Pos logit contributions
y+0.021
o+0.021
os+0.019
itty+0.018
ne+0.018
Neg logit contributions
 chores-0.008
 accomplishment-0.008
 anything-0.008
 trouble-0.007
 yourselves-0.007
Token little
Token ID1310
Feature activation-4.36e+00

Pos logit contributions
y+0.035
o+0.034
jo+0.033
os+0.032
ppy+0.031
Neg logit contributions
 anything-0.024
 chores-0.022
 something-0.022
 trouble-0.021
 accomplishment-0.021
Token hipp
Token ID18568
Feature activation-1.81e+01

Pos logit contributions
jo+0.099
azz+0.097
y+0.095
os+0.094
ppy+0.094
Neg logit contributions
 anything-0.084
 something-0.079
 chores-0.076
 trouble-0.075
 good-0.074
Tokeno
Token ID78
Feature activation-1.26e+01

Pos logit contributions
ppy+0.157
jo+0.154
azz+0.154
anna+0.142
anne+0.139
Neg logit contributions
 many-0.535
 something-0.508
 himself-0.502
 anything-0.499
 others-0.494
Token wanted
Token ID2227
Feature activation+4.46e+00

Pos logit contributions
jo+0.282
azz+0.274
itty+0.260
anne+0.258
ppy+0.254
Neg logit contributions
 something-0.233
 anything-0.222
 many-0.220
 an-0.215
 trouble-0.207
Token to
Token ID284
Feature activation+5.80e+00

Pos logit contributions
 accomplishment+0.107
 chores+0.101
 anything+0.098
 avail+0.097
 yourselves+0.096
Neg logit contributions
o-0.081
y-0.078
os-0.070
ne-0.066
jo-0.064
Token drink
Token ID4144
Feature activation+2.81e+00

Pos logit contributions
 anything+0.105
 accomplishment+0.096
 trouble+0.095
 something+0.093
 chores+0.093
Neg logit contributions
y-0.130
o-0.129
os-0.125
jo-0.123
ne-0.121
Token some
Token ID617
Feature activation+5.70e+00

Pos logit contributions
 accomplishment+0.035
 anything+0.030
 chores+0.030
 trouble+0.030
 miles+0.030
Neg logit contributions
y-0.083
o-0.079
os-0.074
ppy-0.072
of-0.070
Token white
Token ID2330
Feature activation-5.17e+00

Pos logit contributions
jo+0.018
os+0.018
y+0.018
o+0.017
azz+0.017
Neg logit contributions
 anything-0.007
 good-0.007
 something-0.007
 great-0.007
 chores-0.006
Token hipp
Token ID18568
Feature activation-1.57e+01

Pos logit contributions
os+0.132
azz+0.125
jo+0.124
ppy+0.122
oke+0.120
Neg logit contributions
 top-0.086
 something-0.084
 new-0.080
 life-0.079
 sand-0.077
Tokeno
Token ID78
Feature activation-1.17e+01

Pos logit contributions
ppy+0.133
azz+0.122
anna+0.120
jo+0.120
anne+0.119
Neg logit contributions
 many-0.498
 something-0.473
 anything-0.464
 others-0.461
 an-0.456
Token.
Token ID13
Feature activation-4.01e+00

Pos logit contributions
azz+0.278
jo+0.263
oke+0.262
os+0.256
ppy+0.254
Neg logit contributions
 something-0.207
 anything-0.202
 an-0.191
 many-0.190
 top-0.181
Token The
Token ID383
Feature activation-7.19e-01

Pos logit contributions
y+0.109
jo+0.105
ppy+0.104
os+0.101
ne+0.101
Neg logit contributions
 anything-0.060
 something-0.055
 chores-0.053
 an-0.052
 trouble-0.052
Token hipp
Token ID18568
Feature activation-1.81e+01

Pos logit contributions
y+0.014
o+0.013
jo+0.013
os+0.012
ne+0.012
Neg logit contributions
 anything-0.017
 chores-0.016
 something-0.015
 trouble-0.015
 accomplishment-0.015
Tokeno
Token ID78
Feature activation-1.33e+01

Pos logit contributions
ppy+0.177
azz+0.173
jo+0.149
ems+0.146
ches+0.141
Neg logit contributions
 many-0.541
 himself-0.512
 most-0.502
 something-0.499
 others-0.495
Token was
Token ID373
Feature activation+6.61e+00

Pos logit contributions
azz+0.330
ppy+0.312
jo+0.311
oke+0.300
anne+0.300
Neg logit contributions
 many-0.207
 something-0.200
 an-0.194
 great-0.188
 good-0.186
Token so
Token ID523
Feature activation+5.33e+00

Pos logit contributions
 anything+0.101
 accomplishment+0.100
 chores+0.099
 avail+0.097
 tasks+0.095
Neg logit contributions
o-0.186
y-0.175
ppy-0.154
ah-0.152
jo-0.151
Token cute
Token ID13779
Feature activation-2.90e+00

Pos logit contributions
 anything+0.059
 chores+0.052
 something+0.050
 trouble+0.048
 accomplishment+0.047
Neg logit contributions
y-0.168
o-0.165
jo-0.161
os-0.159
ppy-0.156
Token and
Token ID290
Feature activation+2.91e+00

Pos logit contributions
jo+0.074
os+0.072
y+0.072
azz+0.072
ppy+0.071
Neg logit contributions
 anything-0.047
 something-0.046
 an-0.041
 many-0.040
 trouble-0.040
Token
Token ID198
Feature activation-2.47e+00

Pos logit contributions
y+0.055
ppy+0.054
jo+0.053
os+0.052
o+0.052
Neg logit contributions
 anything-0.031
 something-0.029
 chores-0.029
 many-0.028
 trouble-0.027
TokenOne
Token ID3198
Feature activation-2.80e+00

Pos logit contributions
y+0.069
jo+0.066
os+0.066
ppy+0.065
o+0.065
Neg logit contributions
 anything-0.034
 chores-0.033
 something-0.032
 many-0.030
 great-0.029
Token day
Token ID1110
Feature activation-3.13e+00

Pos logit contributions
jo+0.087
azz+0.087
y+0.086
os+0.086
ppy+0.085
Neg logit contributions
 anything-0.027
 something-0.026
 chores-0.025
 good-0.024
 great-0.023
Token,
Token ID11
Feature activation-5.68e-01

Pos logit contributions
y+0.073
o+0.070
jo+0.068
os+0.066
ppy+0.066
Neg logit contributions
 anything-0.058
 chores-0.056
 accomplishment-0.054
 trouble-0.053
 something-0.052
Token the
Token ID262
Feature activation-1.36e+00

Pos logit contributions
o+0.016
y+0.016
os+0.014
ppy+0.014
ah+0.014
Neg logit contributions
 chores-0.007
 accomplishment-0.007
 anything-0.007
 trouble-0.007
 yourselves-0.007
Token hipp
Token ID18568
Feature activation-1.81e+01

Pos logit contributions
y+0.031
o+0.029
jo+0.028
os+0.027
ne+0.027
Neg logit contributions
 anything-0.027
 chores-0.025
 something-0.025
 trouble-0.024
 accomplishment-0.024
Tokeno
Token ID78
Feature activation-1.27e+01

Pos logit contributions
azz+0.155
ppy+0.155
jo+0.148
anna+0.134
ems+0.132
Neg logit contributions
 many-0.522
 something-0.509
 most-0.499
 himself-0.496
 others-0.496
Token met
Token ID1138
Feature activation+8.19e-01

Pos logit contributions
jo+0.265
azz+0.261
itty+0.250
ppy+0.247
oke+0.242
Neg logit contributions
 something-0.231
 anything-0.222
 trouble-0.218
 an-0.217
 many-0.215
Token a
Token ID257
Feature activation+1.31e+00

Pos logit contributions
 anything+0.005
 chores+0.004
 something+0.004
 trouble+0.003
 accomplishment+0.003
Neg logit contributions
y-0.029
o-0.029
jo-0.029
os-0.028
ppy-0.028
Token wise
Token ID10787
Feature activation-2.56e+00

Pos logit contributions
 anything+0.018
 chores+0.017
 something+0.016
 trouble+0.015
 accomplishment+0.015
Neg logit contributions
y-0.037
jo-0.036
o-0.036
os-0.035
ppy-0.035
Token old
Token ID1468
Feature activation-4.50e+00

Pos logit contributions
os+0.078
jo+0.078
y+0.078
ppy+0.077
o+0.077
Neg logit contributions
 anything-0.027
 something-0.025
 trouble-0.024
 chores-0.023
 good-0.023
Token hungry
Token ID14720
Feature activation+9.58e-01

Pos logit contributions
 anything+0.082
 chores+0.072
 something+0.069
 trouble+0.064
 accomplishment+0.064
Neg logit contributions
y-0.180
o-0.176
jo-0.171
os-0.170
ppy-0.165
Token!"
Token ID2474
Feature activation-4.45e+00

Pos logit contributions
 anything+0.020
 chores+0.019
 something+0.018
 trouble+0.018
 accomplishment+0.018
Neg logit contributions
y-0.020
jo-0.020
o-0.019
os-0.019
ppy-0.019
Token
Token ID198
Feature activation-3.35e+00

Pos logit contributions
y+0.134
ppy+0.128
jo+0.127
azz+0.122
ah+0.121
Neg logit contributions
 something-0.059
 anything-0.055
 an-0.054
 many-0.052
 himself-0.051
Token
Token ID198
Feature activation-3.56e+00

Pos logit contributions
y+0.097
ppy+0.092
jo+0.092
os+0.089
ah+0.088
Neg logit contributions
 something-0.041
 anything-0.040
 many-0.040
 an-0.038
 himself-0.037
TokenThe
Token ID464
Feature activation-1.53e+00

Pos logit contributions
y+0.113
jo+0.112
os+0.109
o+0.107
azz+0.106
Neg logit contributions
 anything-0.034
 something-0.033
 chores-0.029
 many-0.029
 himself-0.028
Token hipp
Token ID18568
Feature activation-1.81e+01

Pos logit contributions
y+0.036
jo+0.035
o+0.034
os+0.034
ppy+0.034
Neg logit contributions
 anything-0.029
 chores-0.027
 something-0.027
 trouble-0.026
 accomplishment-0.026
Tokeno
Token ID78
Feature activation-1.29e+01

Pos logit contributions
ppy+0.178
azz+0.177
anna+0.162
jo+0.159
ems+0.152
Neg logit contributions
 himself-0.494
 many-0.493
 something-0.477
 most-0.474
 top-0.469
Token smiled
Token ID13541
Feature activation-7.09e-01

Pos logit contributions
azz+0.295
itty+0.280
jo+0.278
oke+0.262
os+0.258
Neg logit contributions
 something-0.224
 not-0.216
 an-0.215
 top-0.212
 himself-0.211
Token and
Token ID290
Feature activation+3.22e+00

Pos logit contributions
y+0.014
jo+0.014
o+0.013
os+0.013
ppy+0.013
Neg logit contributions
 anything-0.016
 chores-0.015
 something-0.015
 trouble-0.014
 accomplishment-0.014
Token said
Token ID531
Feature activation-2.07e-01

Pos logit contributions
 something+0.079
 anything+0.079
 an+0.074
 good+0.073
 many+0.073
Neg logit contributions
jo-0.056
y-0.056
os-0.052
azz-0.052
o-0.051
Token,
Token ID11
Feature activation-1.66e+00

Pos logit contributions
y+0.005
jo+0.005
o+0.005
os+0.005
ppy+0.005
Neg logit contributions
 anything-0.003
 something-0.003
 chores-0.003
 trouble-0.003
 accomplishment-0.003
Token to
Token ID284
Feature activation+6.83e+00

Pos logit contributions
 anything+0.077
 something+0.071
 chores+0.070
 trouble+0.068
 accomplishment+0.065
Neg logit contributions
y-0.109
jo-0.106
o-0.104
ppy-0.103
os-0.102
Token cry
Token ID3960
Feature activation-6.32e-01

Pos logit contributions
 anything+0.121
 something+0.113
 chores+0.113
 trouble+0.111
 an+0.106
Neg logit contributions
y-0.166
jo-0.164
o-0.162
os-0.160
itty-0.159
Token.
Token ID13
Feature activation-4.20e+00

Pos logit contributions
jo+0.015
o+0.015
y+0.015
os+0.015
ne+0.014
Neg logit contributions
 anything-0.012
 something-0.011
 chores-0.010
 trouble-0.010
 good-0.010
Token But
Token ID887
Feature activation-8.52e-01

Pos logit contributions
y+0.109
ppy+0.106
jo+0.104
o+0.103
azz+0.102
Neg logit contributions
 anything-0.068
 something-0.066
 an-0.061
 many-0.061
 chores-0.060
Token the
Token ID262
Feature activation-1.43e+00

Pos logit contributions
y+0.023
o+0.022
jo+0.021
os+0.021
ppy+0.021
Neg logit contributions
 anything-0.012
 chores-0.012
 trouble-0.011
 accomplishment-0.011
 yourselves-0.011
Token hipp
Token ID18568
Feature activation-1.81e+01

Pos logit contributions
y+0.031
o+0.030
jo+0.030
os+0.029
ppy+0.028
Neg logit contributions
 anything-0.029
 chores-0.027
 something-0.027
 trouble-0.026
 accomplishment-0.026
Tokeno
Token ID78
Feature activation-1.29e+01

Pos logit contributions
ppy+0.184
anna+0.158
jo+0.147
ems+0.146
azz+0.145
Neg logit contributions
 many-0.494
 not-0.480
 himself-0.480
 most-0.479
 something-0.478
Token didn
Token ID1422
Feature activation+3.05e+00

Pos logit contributions
jo+0.289
azz+0.283
os+0.280
y+0.275
itty+0.271
Neg logit contributions
 not-0.215
 something-0.213
 great-0.206
 an-0.202
 good-0.199
Token't
Token ID470
Feature activation+4.22e+00

Pos logit contributions
 anything+0.053
 something+0.048
 accomplishment+0.047
 chores+0.047
 trouble+0.046
Neg logit contributions
y-0.072
o-0.070
jo-0.069
os-0.067
azz-0.067
Token give
Token ID1577
Feature activation+7.14e+00

Pos logit contributions
 anything+0.067
 something+0.058
 good+0.057
 chores+0.056
 trouble+0.055
Neg logit contributions
jo-0.114
os-0.114
o-0.113
y-0.113
itty-0.110
Token up
Token ID510
Feature activation+4.94e+00

Pos logit contributions
 anything+0.085
 chores+0.076
 something+0.074
 trouble+0.070
 accomplishment+0.067
Neg logit contributions
y-0.216
jo-0.210
o-0.209
os-0.204
ppy-0.203
Token
Token ID198
Feature activation-2.54e+00

Pos logit contributions
y+0.052
jo+0.049
o+0.049
ppy+0.048
os+0.048
Neg logit contributions
 anything-0.036
 something-0.033
 chores-0.033
 trouble-0.032
 many-0.031
TokenOne
Token ID3198
Feature activation-2.73e+00

Pos logit contributions
y+0.063
jo+0.061
o+0.061
os+0.059
ppy+0.058
Neg logit contributions
 anything-0.044
 chores-0.041
 something-0.040
 trouble-0.039
 accomplishment-0.038
Token day
Token ID1110
Feature activation-3.05e+00

Pos logit contributions
jo+0.085
y+0.085
azz+0.083
os+0.083
ppy+0.082
Neg logit contributions
 anything-0.027
 something-0.025
 chores-0.024
 good-0.023
 great-0.022
Token,
Token ID11
Feature activation-5.60e-01

Pos logit contributions
y+0.071
o+0.068
jo+0.066
os+0.064
ppy+0.064
Neg logit contributions
 anything-0.058
 chores-0.056
 accomplishment-0.053
 trouble-0.053
 something-0.052
Token the
Token ID262
Feature activation-1.17e+00

Pos logit contributions
o+0.015
y+0.015
os+0.014
ah+0.014
ppy+0.014
Neg logit contributions
 chores-0.008
 accomplishment-0.007
 anything-0.007
 trouble-0.007
 yourselves-0.007
Token hipp
Token ID18568
Feature activation-1.80e+01

Pos logit contributions
y+0.024
o+0.023
jo+0.021
os+0.020
ne+0.020
Neg logit contributions
 anything-0.026
 chores-0.024
 something-0.024
 accomplishment-0.023
 trouble-0.023
Tokeno
Token ID78
Feature activation-1.26e+01

Pos logit contributions
jo+0.162
ppy+0.161
azz+0.159
anna+0.151
anne+0.144
Neg logit contributions
 many-0.531
 something-0.509
 himself-0.499
 anything-0.497
 others-0.494
Token saw
Token ID2497
Feature activation+3.64e+00

Pos logit contributions
jo+0.269
azz+0.261
itty+0.252
ppy+0.250
anne+0.250
Neg logit contributions
 something-0.237
 anything-0.230
 trouble-0.224
 many-0.220
 an-0.220
Token something
Token ID1223
Feature activation-1.08e+00

Pos logit contributions
 accomplishment+0.041
 chores+0.039
 anything+0.034
 avail+0.033
 tasks+0.032
Neg logit contributions
o-0.114
y-0.110
os-0.103
jo-0.100
ah-0.099
Token that
Token ID326
Feature activation+1.42e+00

Pos logit contributions
y+0.034
o+0.033
jo+0.032
os+0.031
ne+0.031
Neg logit contributions
 anything-0.011
 chores-0.010
 accomplishment-0.009
 trouble-0.009
 yourselves-0.009
Token caught
Token ID4978
Feature activation+2.45e+00

Pos logit contributions
 anything+0.032
 chores+0.032
 accomplishment+0.031
 trouble+0.030
 yourselves+0.030
Neg logit contributions
y-0.026
o-0.026
os-0.024
jo-0.024
ppy-0.022
Token love
Token ID1842
Feature activation+6.25e+00

Pos logit contributions
 anything+0.002
 chores+0.002
 something+0.002
 good+0.002
 great+0.002
Neg logit contributions
jo-0.003
y-0.003
azz-0.003
o-0.003
os-0.003
Token you
Token ID345
Feature activation-1.65e+00

Pos logit contributions
 anything+0.086
 chores+0.078
 accomplishment+0.078
 something+0.074
 trouble+0.071
Neg logit contributions
o-0.169
y-0.169
os-0.161
jo-0.159
ne-0.154
Token too
Token ID1165
Feature activation+1.84e+00

Pos logit contributions
y+0.042
jo+0.041
o+0.040
os+0.039
azz+0.039
Neg logit contributions
 anything-0.028
 chores-0.026
 something-0.026
 trouble-0.024
 accomplishment-0.024
Token.
Token ID13
Feature activation-1.75e+00

Pos logit contributions
 anything+0.037
 chores+0.034
 something+0.034
 trouble+0.033
 accomplishment+0.032
Neg logit contributions
y-0.041
jo-0.039
o-0.039
os-0.038
ppy-0.038
Token You
Token ID921
Feature activation-1.13e+00

Pos logit contributions
y+0.038
jo+0.037
o+0.036
os+0.035
ppy+0.035
Neg logit contributions
 anything-0.036
 chores-0.033
 something-0.033
 trouble-0.032
 accomplishment-0.031
Token are
Token ID389
Feature activation+7.81e+00

Pos logit contributions
y+0.027
jo+0.026
o+0.026
os+0.025
azz+0.025
Neg logit contributions
 anything-0.021
 chores-0.020
 something-0.019
 trouble-0.018
 great-0.018
Token right
Token ID826
Feature activation+3.94e+00

Pos logit contributions
 accomplishment+0.093
 chores+0.091
 anything+0.087
 miles+0.085
 bottom+0.084
Neg logit contributions
o-0.217
y-0.203
ppy-0.202
bug-0.193
os-0.185
Token.
Token ID13
Feature activation-1.85e+00

Pos logit contributions
 anything+0.074
 something+0.069
 chores+0.069
 trouble+0.066
 accomplishment+0.064
Neg logit contributions
y-0.091
jo-0.090
o-0.088
os-0.086
ppy-0.085
Token Bat
Token ID6577
Feature activation-7.61e+00

Pos logit contributions
y+0.042
jo+0.040
o+0.039
os+0.038
ppy+0.038
Neg logit contributions
 anything-0.036
 chores-0.034
 something-0.034
 trouble-0.033
 accomplishment-0.032
Tokenhing
Token ID722
Feature activation-2.62e+00

Pos logit contributions
jo+0.147
azz+0.147
y+0.141
o+0.140
ppy+0.133
Neg logit contributions
 anything-0.163
 chores-0.156
 something-0.152
 good-0.147
 yourselves-0.144
Token is
Token ID318
Feature activation+8.97e+00

Pos logit contributions
y+0.067
jo+0.066
o+0.066
os+0.064
ppy+0.063
Neg logit contributions
 anything-0.043
 chores-0.039
 something-0.039
 trouble-0.037
 accomplishment-0.035
Token around
Token ID1088
Feature activation+2.01e+00

Pos logit contributions
 anything+0.002
 something+0.002
 chores+0.001
 trouble+0.001
 accomplishment+0.001
Neg logit contributions
y-0.005
jo-0.005
o-0.005
ppy-0.005
azz-0.005
Token the
Token ID262
Feature activation+2.11e+00

Pos logit contributions
 anything+0.027
 chores+0.026
 something+0.024
 trouble+0.023
 accomplishment+0.023
Neg logit contributions
y-0.057
jo-0.055
o-0.055
os-0.054
ppy-0.053
Token lake
Token ID13546
Feature activation-9.09e+00

Pos logit contributions
 anything+0.028
 chores+0.025
 something+0.024
 trouble+0.023
 accomplishment+0.023
Neg logit contributions
y-0.061
jo-0.059
o-0.059
os-0.057
ppy-0.057
Token were
Token ID547
Feature activation+9.44e+00

Pos logit contributions
y+0.185
jo+0.180
o+0.177
ppy+0.172
os+0.171
Neg logit contributions
 anything-0.195
 something-0.184
 chores-0.182
 trouble-0.176
 accomplishment-0.173
Token lots
Token ID6041
Feature activation+4.16e+00

Pos logit contributions
 chores+0.141
 anything+0.139
 accomplishment+0.126
 avail+0.125
 moderation+0.124
Neg logit contributions
y-0.257
o-0.250
os-0.224
jo-0.222
ne-0.210
Token of
Token ID286
Feature activation+9.13e+00

Pos logit contributions
 anything+0.085
 chores+0.081
 something+0.077
 trouble+0.074
 accomplishment+0.073
Neg logit contributions
y-0.092
jo-0.086
o-0.085
os-0.083
ne-0.083
Token other
Token ID584
Feature activation+1.47e+00

Pos logit contributions
 anything+0.113
 chores+0.100
 accomplishment+0.094
 something+0.086
 avail+0.086
Neg logit contributions
y-0.267
o-0.253
jo-0.250
ne-0.245
os-0.243
Token re
Token ID302
Feature activation-8.06e-01

Pos logit contributions
 anything+0.019
 chores+0.017
 something+0.016
 accomplishment+0.016
 trouble+0.015
Neg logit contributions
y-0.043
jo-0.041
o-0.041
os-0.041
ne-0.040
Tokenind
Token ID521
Feature activation-4.55e+00

Pos logit contributions
y+0.016
jo+0.015
azz+0.015
ppy+0.015
o+0.015
Neg logit contributions
 anything-0.017
 chores-0.016
 something-0.016
 trouble-0.016
 miles-0.016
Tokeneer
Token ID28153
Feature activation-8.85e+00

Pos logit contributions
azz+0.071
jo+0.065
y+0.063
itty+0.063
ppy+0.061
Neg logit contributions
 anything-0.116
 many-0.109
 himself-0.108
 something-0.107
 things-0.106
Token.
Token ID13
Feature activation-2.98e+00

Pos logit contributions
y+0.212
jo+0.212
azz+0.205
ppy+0.204
o+0.202
Neg logit contributions
 anything-0.154
 something-0.146
 chores-0.139
 trouble-0.136
 many-0.132
Token But
Token ID887
Feature activation-9.84e-02

Pos logit contributions
y+0.062
jo+0.060
o+0.059
os+0.058
ppy+0.057
Neg logit contributions
 anything-0.048
 chores-0.045
 something-0.045
 trouble-0.043
 accomplishment-0.042
Token they
Token ID484
Feature activation-4.18e+00

Pos logit contributions
y+0.002
o+0.002
ppy+0.002
jo+0.002
os+0.002
Neg logit contributions
 anything-0.002
 trouble-0.002
 chores-0.002
 accomplishment-0.002
 progress-0.001
Token did
Token ID750
Feature activation+4.90e+00

Pos logit contributions
jo+0.096
y+0.095
azz+0.090
o+0.090
os+0.089
Neg logit contributions
 anything-0.079
 chores-0.075
 something-0.075
 trouble-0.072
 many-0.069
Token not
Token ID407
Feature activation+6.07e+00

Pos logit contributions
 accomplishment+0.056
 mischief+0.054
 trouble+0.052
 freedom+0.051
 avail+0.050
Neg logit contributions
y-0.135
o-0.129
ah-0.119
os-0.118
ne-0.114
Token like
Token ID588
Feature activation+7.25e+00

Pos logit contributions
 anything+0.097
 chores+0.090
 something+0.088
 trouble+0.084
 accomplishment+0.082
Neg logit contributions
y-0.160
jo-0.155
o-0.153
os-0.150
ppy-0.148
Token to
Token ID284
Feature activation+7.39e+00

Pos logit contributions
 chores+0.119
 anything+0.118
 accomplishment+0.118
 trouble+0.113
 avail+0.111
Neg logit contributions
y-0.176
o-0.167
os-0.166
jo-0.159
azz-0.153
Token share
Token ID2648
Feature activation+4.79e+00

Pos logit contributions
 anything+0.119
 trouble+0.113
 something+0.113
 accomplishment+0.110
 chores+0.110
Neg logit contributions
y-0.184
o-0.175
ppy-0.168
jo-0.167
os-0.167
Token.
Token ID13
Feature activation-2.65e+00

Pos logit contributions
 accomplishment+0.095
 chores+0.090
 anything+0.089
 trouble+0.086
 miles+0.085
Neg logit contributions
y-0.105
o-0.093
os-0.092
jo-0.087
ne-0.086
Token Lily
Token ID20037
Feature activation-1.21e+01

Pos logit contributions
y+0.064
jo+0.063
o+0.061
os+0.060
ppy+0.060
Neg logit contributions
 anything-0.048
 something-0.044
 chores-0.044
 trouble-0.042
 many-0.040
Token wanted
Token ID2227
Feature activation+5.12e+00

Pos logit contributions
jo+0.256
azz+0.253
os+0.244
itty+0.241
o+0.234
Neg logit contributions
 not-0.234
 something-0.233
 chores-0.230
 trouble-0.227
 anything-0.227
Token all
Token ID477
Feature activation+2.77e+00

Pos logit contributions
 accomplishment+0.107
 anything+0.101
 chores+0.100
 avail+0.097
 trouble+0.096
Neg logit contributions
y-0.106
o-0.101
os-0.095
jo-0.091
ne-0.088
Token."
Token ID526
Feature activation-4.53e+00

Pos logit contributions
jo+0.168
y+0.160
azz+0.160
os+0.157
ne+0.155
Neg logit contributions
 anything-0.122
 something-0.113
 chores-0.104
 trouble-0.103
 good-0.101
Token But
Token ID887
Feature activation+1.94e-01

Pos logit contributions
y+0.117
jo+0.113
ppy+0.110
os+0.110
o+0.109
Neg logit contributions
 anything-0.076
 something-0.071
 chores-0.070
 trouble-0.068
 himself-0.065
Token now
Token ID783
Feature activation+7.41e-01

Pos logit contributions
 chores+0.003
 tasks+0.003
 mischief+0.003
 anything+0.003
 trouble+0.003
Neg logit contributions
o-0.004
y-0.004
ah-0.004
ety-0.004
itty-0.004
Token Twe
Token ID24205
Feature activation-9.30e+00

Pos logit contributions
 anything+0.011
 chores+0.011
 accomplishment+0.010
 trouble+0.010
 something+0.010
Neg logit contributions
y-0.020
o-0.019
jo-0.019
ppy-0.018
os-0.018
Tokenety
Token ID2963
Feature activation-1.16e+01

Pos logit contributions
jo+0.119
anne+0.108
os+0.106
ne+0.098
y+0.090
Neg logit contributions
 moderation-0.233
 anything-0.231
 something-0.224
 himself-0.223
 yourselves-0.221
Token was
Token ID373
Feature activation+7.18e+00

Pos logit contributions
jo+0.272
azz+0.258
os+0.255
round+0.251
y+0.246
Neg logit contributions
 anything-0.206
 not-0.202
 something-0.201
 trouble-0.190
 good-0.189
Token gone
Token ID3750
Feature activation+2.40e+00

Pos logit contributions
 chores+0.096
 anything+0.094
 accomplishment+0.090
 yourselves+0.085
 tasks+0.084
Neg logit contributions
y-0.199
o-0.195
ppy-0.185
jo-0.180
ne-0.178
Token and
Token ID290
Feature activation+5.72e+00

Pos logit contributions
 anything+0.051
 chores+0.049
 something+0.047
 accomplishment+0.046
 trouble+0.046
Neg logit contributions
y-0.050
o-0.047
jo-0.046
ppy-0.044
os-0.044
Token Jerry
Token ID13075
Feature activation-1.24e+01

Pos logit contributions
 anything+0.117
 chores+0.111
 something+0.109
 trouble+0.106
 accomplishment+0.104
Neg logit contributions
y-0.124
jo-0.119
o-0.119
os-0.115
ppy-0.113
Token did
Token ID750
Feature activation+5.49e+00

Pos logit contributions
jo+0.268
y+0.262
azz+0.257
os+0.255
o+0.251
Neg logit contributions
 anything-0.255
 something-0.243
 chores-0.229
 trouble-0.227
 good-0.227
Token not
Token ID407
Feature activation+6.53e+00

Pos logit contributions
 biggest+0.053
 yourselves+0.053
 port+0.052
 avail+0.050
 trouble+0.049
Neg logit contributions
y-0.147
o-0.145
ah-0.137
jo-0.136
ppy-0.135
Token.
Token ID13
Feature activation-1.86e+00

Pos logit contributions
 anything+0.087
 chores+0.083
 something+0.079
 accomplishment+0.078
 trouble+0.078
Neg logit contributions
y-0.101
jo-0.095
o-0.095
os-0.093
ppy-0.091
Token Stay
Token ID16160
Feature activation+6.40e+00

Pos logit contributions
y+0.045
jo+0.043
o+0.041
os+0.041
ppy+0.040
Neg logit contributions
 anything-0.035
 something-0.033
 chores-0.032
 trouble-0.031
 an-0.030
Token calm
Token ID9480
Feature activation+3.91e+00

Pos logit contributions
 anything+0.100
 chores+0.093
 accomplishment+0.086
 trouble+0.086
 something+0.086
Neg logit contributions
y-0.172
o-0.166
jo-0.162
ppy-0.159
os-0.156
Token and
Token ID290
Feature activation+6.66e+00

Pos logit contributions
 anything+0.085
 something+0.079
 chores+0.079
 trouble+0.077
 accomplishment+0.074
Neg logit contributions
y-0.080
jo-0.078
o-0.077
os-0.075
azz-0.073
Token hold
Token ID1745
Feature activation+6.99e+00

Pos logit contributions
 anything+0.086
 chores+0.077
 trouble+0.074
 accomplishment+0.070
 something+0.070
Neg logit contributions
o-0.184
y-0.182
ppy-0.175
jo-0.173
os-0.170
Token on
Token ID319
Feature activation+8.06e+00

Pos logit contributions
 anything+0.081
 chores+0.072
 something+0.070
 trouble+0.066
 accomplishment+0.063
Neg logit contributions
y-0.215
jo-0.209
o-0.208
os-0.203
ppy-0.201
Token to
Token ID284
Feature activation+7.56e+00

Pos logit contributions
 anything+0.133
 chores+0.120
 something+0.119
 trouble+0.116
 accomplishment+0.114
Neg logit contributions
y-0.205
jo-0.197
o-0.197
ppy-0.189
os-0.189
Token your
Token ID534
Feature activation+5.39e+00

Pos logit contributions
 anything+0.083
 accomplishment+0.082
 chores+0.079
 trouble+0.075
 something+0.071
Neg logit contributions
y-0.204
ppy-0.202
o-0.201
jo-0.198
ne-0.190
Token doll
Token ID3654
Feature activation-7.92e+00

Pos logit contributions
 anything+0.091
 something+0.080
 chores+0.079
 accomplishment+0.078
 trouble+0.076
Neg logit contributions
y-0.129
jo-0.126
o-0.125
ppy-0.119
os-0.118
Token."
Token ID526
Feature activation-4.25e+00

Pos logit contributions
jo+0.191
y+0.189
azz+0.185
oke+0.184
o+0.182
Neg logit contributions
 anything-0.139
 something-0.131
 chores-0.131
 many-0.116
 good-0.114
Token
Token ID198
Feature activation-2.34e+00

Pos logit contributions
y+0.113
jo+0.110
os+0.106
ppy+0.106
o+0.105
Neg logit contributions
 anything-0.066
 something-0.063
 chores-0.060
 trouble-0.060
 many-0.058
Token is
Token ID318
Feature activation+7.62e+00

Pos logit contributions
jo+0.218
y+0.215
ppy+0.214
azz+0.206
anne+0.206
Neg logit contributions
 anything-0.151
 something-0.138
 chores-0.133
 good-0.132
 trouble-0.131
Token very
Token ID845
Feature activation+8.24e+00

Pos logit contributions
 chores+0.094
 accomplishment+0.088
 moderation+0.085
 anything+0.083
 trouble+0.082
Neg logit contributions
o-0.212
y-0.209
ppy-0.193
jo-0.190
azz-0.189
Token nice
Token ID3621
Feature activation+9.43e-01

Pos logit contributions
 anything+0.086
 chores+0.074
 something+0.071
 trouble+0.068
 accomplishment+0.064
Neg logit contributions
y-0.262
o-0.257
jo-0.253
os-0.246
azz-0.245
Token.
Token ID13
Feature activation-2.43e+00

Pos logit contributions
 anything+0.017
 something+0.016
 trouble+0.015
 chores+0.015
 good+0.014
Neg logit contributions
jo-0.023
ppy-0.022
os-0.022
o-0.022
y-0.022
Token You
Token ID921
Feature activation-1.36e+00

Pos logit contributions
y+0.062
jo+0.058
o+0.057
ppy+0.056
os+0.056
Neg logit contributions
 anything-0.043
 something-0.039
 chores-0.037
 trouble-0.037
 an-0.036
Token are
Token ID389
Feature activation+6.99e+00

Pos logit contributions
y+0.029
jo+0.029
o+0.028
os+0.027
ppy+0.026
Neg logit contributions
 anything-0.029
 chores-0.026
 something-0.026
 trouble-0.025
 great-0.025
Token lucky
Token ID9670
Feature activation+2.11e+00

Pos logit contributions
 chores+0.092
 accomplishment+0.090
 avail+0.087
 mischief+0.084
 moderation+0.082
Neg logit contributions
o-0.179
ppy-0.166
y-0.164
bug-0.164
ne-0.152
Token.
Token ID13
Feature activation-2.93e+00

Pos logit contributions
 anything+0.038
 something+0.035
 chores+0.033
 trouble+0.033
 good+0.031
Neg logit contributions
jo-0.052
y-0.052
o-0.050
ppy-0.050
os-0.049
Token I
Token ID314
Feature activation-8.39e-01

Pos logit contributions
y+0.071
jo+0.068
o+0.065
os+0.065
ppy+0.065
Neg logit contributions
 anything-0.054
 something-0.050
 chores-0.047
 trouble-0.047
 an-0.046
Token can
Token ID460
Feature activation+4.09e+00

Pos logit contributions
jo+0.022
y+0.022
o+0.021
os+0.021
azz+0.020
Neg logit contributions
 anything-0.013
 chores-0.012
 something-0.012
 trouble-0.012
 great-0.012
Token wait
Token ID4043
Feature activation+4.66e+00

Pos logit contributions
 anything+0.066
 chores+0.061
 something+0.060
 trouble+0.058
 accomplishment+0.056
Neg logit contributions
y-0.106
jo-0.103
o-0.102
os-0.100
ppy-0.099
TokenSoon
Token ID28093
Feature activation-1.08e+00

Pos logit contributions
y+0.084
jo+0.081
o+0.081
os+0.078
ppy+0.077
Neg logit contributions
 anything-0.063
 chores-0.059
 something-0.059
 trouble-0.056
 accomplishment-0.055
Token enough
Token ID1576
Feature activation-2.87e-01

Pos logit contributions
y+0.025
jo+0.023
o+0.023
azz+0.022
ne+0.022
Neg logit contributions
 chores-0.019
 yourselves-0.018
 anything-0.018
wered-0.017
 moderation-0.017
Token,
Token ID11
Feature activation-4.08e-01

Pos logit contributions
y+0.006
jo+0.005
ne+0.005
os+0.005
o+0.005
Neg logit contributions
 anything-0.005
wered-0.005
 yourselves-0.005
 chores-0.005
 accomplishment-0.005
Token they
Token ID484
Feature activation-3.85e+00

Pos logit contributions
y+0.010
azz+0.010
os+0.010
ne+0.010
o+0.010
Neg logit contributions
 chores-0.005
 yourselves-0.005
 anything-0.005
 accomplishment-0.005
 freedom-0.004
Token arrived
Token ID5284
Feature activation+2.54e+00

Pos logit contributions
jo+0.087
o+0.084
itty+0.082
y+0.082
azz+0.081
Neg logit contributions
 anything-0.073
 chores-0.069
 something-0.068
 many-0.067
 great-0.065
Token at
Token ID379
Feature activation+6.46e+00

Pos logit contributions
 anything+0.058
 chores+0.052
 miles+0.051
 avail+0.049
wered+0.049
Neg logit contributions
y-0.051
o-0.046
os-0.044
jo-0.044
ne-0.041
Token the
Token ID262
Feature activation+2.32e+00

Pos logit contributions
 anything+0.070
 chores+0.066
 yourselves+0.058
 miles+0.057
 accomplishment+0.056
Neg logit contributions
y-0.188
o-0.179
jo-0.175
os-0.174
ppy-0.166
Token store
Token ID3650
Feature activation-4.47e+00

Pos logit contributions
 anything+0.040
 accomplishment+0.036
 himself+0.034
 miles+0.034
wered+0.034
Neg logit contributions
y-0.055
ne-0.047
ray-0.047
pillar-0.047
itty-0.046
Token,
Token ID11
Feature activation+1.25e+00

Pos logit contributions
y+0.094
o+0.088
jo+0.088
os+0.085
ne+0.082
Neg logit contributions
 anything-0.096
 chores-0.089
 something-0.086
 trouble-0.085
 accomplishment-0.084
Token and
Token ID290
Feature activation+4.08e+00

Pos logit contributions
 anything+0.022
 chores+0.021
 trouble+0.019
 something+0.018
 accomplishment+0.018
Neg logit contributions
y-0.030
o-0.029
jo-0.028
os-0.028
ppy-0.027
Token Jenny
Token ID28437
Feature activation-1.32e+01

Pos logit contributions
 anything+0.065
 chores+0.060
 something+0.058
 trouble+0.057
 accomplishment+0.055
Neg logit contributions
y-0.107
jo-0.103
o-0.103
os-0.100
ppy-0.099
Token the
Token ID262
Feature activation+1.98e+00

Pos logit contributions
 anything+0.053
 something+0.046
 chores+0.044
 trouble+0.042
 accomplishment+0.039
Neg logit contributions
y-0.140
jo-0.138
o-0.137
os-0.134
ppy-0.133
Token dog
Token ID3290
Feature activation-1.11e+01

Pos logit contributions
 anything+0.039
 chores+0.037
 something+0.037
 trouble+0.036
 accomplishment+0.035
Neg logit contributions
y-0.044
jo-0.043
o-0.043
os-0.042
ppy-0.041
Token away
Token ID1497
Feature activation+2.02e+00

Pos logit contributions
jo+0.274
azz+0.265
y+0.265
o+0.262
ne+0.262
Neg logit contributions
 anything-0.194
 something-0.180
 chores-0.173
 good-0.162
 trouble-0.160
Token.
Token ID13
Feature activation-2.71e+00

Pos logit contributions
 anything+0.040
 chores+0.038
 something+0.037
 trouble+0.036
 accomplishment+0.035
Neg logit contributions
y-0.045
jo-0.043
o-0.043
os-0.042
ppy-0.041
Token Lily
Token ID20037
Feature activation-1.13e+01

Pos logit contributions
y+0.072
ppy+0.067
jo+0.067
os+0.065
o+0.065
Neg logit contributions
 anything-0.043
 something-0.041
 trouble-0.039
 chores-0.039
 an-0.038
Token was
Token ID373
Feature activation+7.04e+00

Pos logit contributions
azz+0.249
jo+0.246
os+0.245
y+0.243
oke+0.235
Neg logit contributions
 something-0.203
 not-0.196
 anything-0.193
 an-0.188
 trouble-0.187
Token very
Token ID845
Feature activation+8.26e+00

Pos logit contributions
 anything+0.105
 chores+0.101
 accomplishment+0.092
 tasks+0.089
 avail+0.088
Neg logit contributions
y-0.188
o-0.187
jo-0.179
ppy-0.174
os-0.171
Token happy
Token ID3772
Feature activation+2.82e+00

Pos logit contributions
 anything+0.134
 chores+0.122
 something+0.114
 accomplishment+0.110
 trouble+0.108
Neg logit contributions
y-0.218
o-0.212
jo-0.208
ppy-0.200
os-0.200
Token that
Token ID326
Feature activation+2.11e+00

Pos logit contributions
 anything+0.054
 something+0.051
 chores+0.050
 trouble+0.048
 accomplishment+0.047
Neg logit contributions
y-0.065
jo-0.063
o-0.062
os-0.061
ppy-0.060
Token her
Token ID607
Feature activation-7.37e-01

Pos logit contributions
 anything+0.035
 chores+0.034
 accomplishment+0.034
 yourselves+0.033
 trouble+0.032
Neg logit contributions
y-0.050
o-0.047
jo-0.046
os-0.046
azz-0.044
Token mom
Token ID1995
Feature activation-9.58e+00

Pos logit contributions
y+0.021
jo+0.020
o+0.020
os+0.020
ppy+0.020
Neg logit contributions
 anything-0.010
 chores-0.009
 something-0.009
 trouble-0.009
 accomplishment-0.008
Token.
Token ID13
Feature activation-2.48e+00

Pos logit contributions
y+0.117
jo+0.115
o+0.113
os+0.109
ppy+0.109
Neg logit contributions
 anything-0.103
 something-0.096
 chores-0.096
 trouble-0.092
 accomplishment-0.090
Token
Token ID198
Feature activation-7.71e-01

Pos logit contributions
y+0.060
jo+0.058
o+0.057
os+0.056
ppy+0.056
Neg logit contributions
 anything-0.045
 chores-0.041
 something-0.041
 trouble-0.040
 accomplishment-0.039
Token
Token ID198
Feature activation-1.35e+00

Pos logit contributions
y+0.018
jo+0.017
o+0.017
os+0.017
ppy+0.016
Neg logit contributions
 anything-0.015
 chores-0.014
 something-0.013
 trouble-0.013
 accomplishment-0.013
Token"
Token ID1
Feature activation-8.55e-01

Pos logit contributions
y+0.031
jo+0.031
o+0.030
os+0.029
ne+0.029
Neg logit contributions
 anything-0.026
 chores-0.024
 something-0.023
 trouble-0.023
 accomplishment-0.023
TokenNow
Token ID3844
Feature activation-2.25e-01

Pos logit contributions
y+0.022
jo+0.021
o+0.021
os+0.021
ppy+0.020
Neg logit contributions
 anything-0.014
 something-0.013
 chores-0.013
 trouble-0.012
 accomplishment-0.012
Token what
Token ID644
Feature activation+8.94e-01

Pos logit contributions
y+0.006
jo+0.006
o+0.006
os+0.006
ppy+0.006
Neg logit contributions
 anything-0.003
 chores-0.003
 something-0.003
 trouble-0.003
 accomplishment-0.003
Token do
Token ID466
Feature activation+2.73e+00

Pos logit contributions
 anything+0.018
 chores+0.017
 something+0.017
 trouble+0.016
 accomplishment+0.016
Neg logit contributions
y-0.020
jo-0.019
o-0.019
os-0.018
ppy-0.018
Token I
Token ID314
Feature activation+3.60e-01

Pos logit contributions
 anything+0.038
 chores+0.036
 accomplishment+0.034
 something+0.034
 trouble+0.033
Neg logit contributions
y-0.074
jo-0.072
o-0.072
os-0.070
ne-0.070
Token do
Token ID466
Feature activation+4.52e+00

Pos logit contributions
 anything+0.006
 chores+0.006
 something+0.006
 trouble+0.005
 accomplishment+0.005
Neg logit contributions
y-0.009
jo-0.009
o-0.009
os-0.008
ppy-0.008
Token?"
Token ID1701
Feature activation-3.39e+00

Pos logit contributions
 anything+0.088
 chores+0.085
 accomplishment+0.081
 trouble+0.080
wered+0.078
Neg logit contributions
y-0.098
o-0.091
jo-0.090
ne-0.089
os-0.087
Token he
Token ID339
Feature activation-4.92e+00

Pos logit contributions
y+0.082
jo+0.079
o+0.078
os+0.076
ppy+0.075
Neg logit contributions
 anything-0.061
 chores-0.057
 something-0.056
 trouble-0.054
 accomplishment-0.053
Token a
Token ID257
Feature activation+1.74e+00

Pos logit contributions
 anything+0.079
 accomplishment+0.072
 chores+0.071
 avail+0.064
 yourselves+0.060
Neg logit contributions
y-0.218
o-0.217
jo-0.200
ppy-0.199
itty-0.198
Token little
Token ID1310
Feature activation-7.44e-01

Pos logit contributions
 anything+0.021
 something+0.020
 chores+0.020
 good+0.020
 great+0.019
Neg logit contributions
jo-0.051
os-0.050
y-0.050
o-0.049
azz-0.048
Token boy
Token ID2933
Feature activation-1.16e+01

Pos logit contributions
jo+0.018
os+0.018
azz+0.017
anne+0.017
o+0.016
Neg logit contributions
 good-0.013
 something-0.012
 anything-0.012
 new-0.012
 great-0.012
Token who
Token ID508
Feature activation-1.07e+00

Pos logit contributions
azz+0.280
jo+0.268
oke+0.262
anne+0.259
ppy+0.257
Neg logit contributions
 something-0.196
 anything-0.193
 an-0.184
 many-0.181
 not-0.174
Token was
Token ID373
Feature activation+7.35e+00

Pos logit contributions
y+0.022
jo+0.022
azz+0.022
os+0.021
ne+0.021
Neg logit contributions
 anything-0.022
 chores-0.021
 something-0.021
 trouble-0.020
 good-0.020
Token very
Token ID845
Feature activation+7.01e+00

Pos logit contributions
 anything+0.100
 accomplishment+0.092
 chores+0.087
 avail+0.085
 something+0.083
Neg logit contributions
o-0.211
y-0.210
jo-0.194
ppy-0.191
os-0.189
Token hungry
Token ID14720
Feature activation-8.18e-02

Pos logit contributions
 anything+0.095
 something+0.082
 chores+0.080
 trouble+0.076
 accomplishment+0.075
Neg logit contributions
y-0.205
o-0.201
jo-0.194
os-0.191
ppy-0.190
Token.
Token ID13
Feature activation-2.83e+00

Pos logit contributions
y+0.002
jo+0.002
o+0.002
os+0.002
ppy+0.002
Neg logit contributions
 anything-0.002
 chores-0.002
 something-0.002
 trouble-0.001
 accomplishment-0.001
Token He
Token ID679
Feature activation-3.58e+00

Pos logit contributions
y+0.070
jo+0.068
o+0.066
ppy+0.066
os+0.065
Neg logit contributions
 anything-0.049
 something-0.046
 chores-0.046
 trouble-0.044
 himself-0.042
Token had
Token ID550
Feature activation+7.09e+00

Pos logit contributions
azz+0.066
os+0.064
y+0.064
jo+0.063
ne+0.060
Neg logit contributions
 anything-0.081
 something-0.081
 chores-0.079
 an-0.077
 many-0.077
Token been
Token ID587
Feature activation+8.80e+00

Pos logit contributions
 accomplishment+0.084
 avail+0.066
 anything+0.064
 chores+0.064
 progress+0.061
Neg logit contributions
o-0.213
y-0.212
jo-0.198
ppy-0.193
os-0.188
Token was
Token ID373
Feature activation+1.09e+01

Pos logit contributions
jo+0.083
y+0.082
o+0.079
os+0.078
azz+0.077
Neg logit contributions
 anything-0.079
 chores-0.074
 something-0.074
 trouble-0.070
 accomplishment-0.067
Token safe
Token ID3338
Feature activation+4.07e+00

Pos logit contributions
 anything+0.141
 chores+0.126
 yourselves+0.117
 avail+0.116
 accomplishment+0.114
Neg logit contributions
y-0.312
o-0.303
jo-0.289
ppy-0.281
os-0.279
Token and
Token ID290
Feature activation+7.13e+00

Pos logit contributions
 anything+0.088
 chores+0.083
 something+0.081
 trouble+0.079
 accomplishment+0.078
Neg logit contributions
y-0.085
o-0.080
jo-0.079
os-0.076
ppy-0.075
Token that
Token ID326
Feature activation+1.81e+00

Pos logit contributions
 anything+0.126
 chores+0.116
 trouble+0.111
 something+0.108
wered+0.107
Neg logit contributions
y-0.172
o-0.167
jo-0.158
ppy-0.153
os-0.153
Token her
Token ID607
Feature activation-1.73e-01

Pos logit contributions
 anything+0.022
 chores+0.021
 accomplishment+0.020
 trouble+0.019
 something+0.019
Neg logit contributions
y-0.051
o-0.050
jo-0.048
os-0.048
ppy-0.046
Token friend
Token ID1545
Feature activation-7.00e+00

Pos logit contributions
y+0.005
jo+0.005
o+0.005
os+0.005
ppy+0.005
Neg logit contributions
 anything-0.002
 chores-0.002
 something-0.002
 trouble-0.002
 accomplishment-0.002
Token liked
Token ID8288
Feature activation+7.73e+00

Pos logit contributions
jo+0.133
y+0.132
azz+0.130
os+0.127
ne+0.126
Neg logit contributions
 anything-0.154
 something-0.150
 chores-0.143
 trouble-0.140
 an-0.138
Token it
Token ID340
Feature activation-4.06e+00

Pos logit contributions
 anything+0.119
 chores+0.110
 something+0.107
 trouble+0.104
 accomplishment+0.104
Neg logit contributions
y-0.205
o-0.196
jo-0.196
os-0.190
ppy-0.187
Token so
Token ID523
Feature activation+5.99e+00

Pos logit contributions
y+0.100
jo+0.095
o+0.095
os+0.092
ppy+0.091
Neg logit contributions
 anything-0.071
 chores-0.067
 something-0.065
 trouble-0.063
 accomplishment-0.062
Token much
Token ID881
Feature activation+4.26e+00

Pos logit contributions
 anything+0.050
 chores+0.043
 something+0.039
 trouble+0.037
 accomplishment+0.035
Neg logit contributions
y-0.204
jo-0.197
o-0.196
os-0.190
ne-0.186
Token.
Token ID13
Feature activation-3.09e+00

Pos logit contributions
 anything+0.077
 something+0.071
 chores+0.071
 trouble+0.069
 accomplishment+0.067
Neg logit contributions
y-0.103
jo-0.100
o-0.100
os-0.097
ppy-0.097
Token They
Token ID1119
Feature activation-2.43e+00

Pos logit contributions
y+0.052
ppy+0.049
jo+0.049
os+0.048
o+0.048
Neg logit contributions
 anything-0.032
 something-0.030
 an-0.028
 chores-0.028
 himself-0.028
Token poked
Token ID47320
Feature activation+4.54e+00

Pos logit contributions
y+0.053
jo+0.052
o+0.051
os+0.049
azz+0.049
Neg logit contributions
 anything-0.049
 chores-0.046
 something-0.045
 trouble-0.043
 many-0.042
Token them
Token ID606
Feature activation-2.55e+00

Pos logit contributions
 anything+0.049
 something+0.043
 chores+0.041
 trouble+0.038
 accomplishment+0.036
Neg logit contributions
y-0.142
jo-0.140
o-0.139
os-0.136
ppy-0.135
Token with
Token ID351
Feature activation+8.19e+00

Pos logit contributions
y+0.060
jo+0.058
o+0.058
os+0.056
ppy+0.056
Neg logit contributions
 anything-0.047
 chores-0.044
 something-0.043
 trouble-0.042
 accomplishment-0.041
Token their
Token ID511
Feature activation+4.21e+00

Pos logit contributions
 anything+0.094
 chores+0.086
 yourselves+0.084
 job+0.078
 trouble+0.077
Neg logit contributions
y-0.236
o-0.223
os-0.221
jo-0.219
ppy-0.216
Token fingers
Token ID9353
Feature activation-3.48e+00

Pos logit contributions
 anything+0.073
 chores+0.068
 something+0.067
 trouble+0.065
 accomplishment+0.063
Neg logit contributions
y-0.104
jo-0.100
o-0.100
os-0.097
ppy-0.096
Token.
Token ID13
Feature activation-2.22e+00

Pos logit contributions
jo+0.079
y+0.077
o+0.077
azz+0.074
os+0.074
Neg logit contributions
 anything-0.066
 something-0.062
 chores-0.060
 an-0.056
 many-0.056
Token They
Token ID1119
Feature activation-2.57e+00

Pos logit contributions
y+0.053
jo+0.051
o+0.050
ppy+0.050
os+0.049
Neg logit contributions
 anything-0.041
 something-0.038
 chores-0.037
 trouble-0.036
 an-0.035
Token said
Token ID531
Feature activation+1.47e+00

Pos logit contributions
jo+0.051
y+0.051
o+0.049
itty+0.047
azz+0.046
Neg logit contributions
 anything-0.057
 something-0.054
 chores-0.053
 many-0.051
 trouble-0.051
Token they
Token ID484
Feature activation-3.41e+00

Pos logit contributions
 anything+0.025
 chores+0.023
 something+0.022
 trouble+0.022
 accomplishment+0.022
Neg logit contributions
y-0.037
o-0.035
jo-0.035
os-0.034
ppy-0.034
Token were
Token ID547
Feature activation+8.63e+00

Pos logit contributions
jo+0.078
y+0.076
azz+0.074
o+0.073
itty+0.073
Neg logit contributions
 anything-0.067
 something-0.063
 chores-0.061
 great-0.059
 an-0.058
Token He
Token ID679
Feature activation-3.48e+00

Pos logit contributions
y+0.096
ppy+0.094
ne+0.086
jo+0.085
o+0.085
Neg logit contributions
 anything-0.060
 something-0.058
 many-0.057
 chores-0.056
 trouble-0.056
Token regretted
Token ID44652
Feature activation+6.58e+00

Pos logit contributions
azz+0.076
jo+0.076
y+0.074
os+0.073
itty+0.071
Neg logit contributions
 something-0.066
 an-0.064
 not-0.063
 chores-0.063
 great-0.062
Token it
Token ID340
Feature activation-3.81e+00

Pos logit contributions
 middle+0.076
 trouble+0.071
 moderation+0.071
 chores+0.071
 anything+0.070
Neg logit contributions
o-0.171
y-0.168
jo-0.160
os-0.159
ppy-0.156
Token a
Token ID257
Feature activation+3.75e+00

Pos logit contributions
y+0.090
o+0.085
jo+0.084
ppy+0.082
os+0.081
Neg logit contributions
 anything-0.070
 chores-0.064
 something-0.063
 trouble-0.063
 accomplishment-0.062
Token lot
Token ID1256
Feature activation+2.19e+00

Pos logit contributions
 anything+0.032
 chores+0.028
 something+0.027
 trouble+0.025
 accomplishment+0.023
Neg logit contributions
y-0.126
jo-0.122
o-0.122
os-0.119
ppy-0.118
Token.
Token ID13
Feature activation-3.96e+00

Pos logit contributions
 anything+0.043
 chores+0.040
 something+0.040
 trouble+0.038
 accomplishment+0.038
Neg logit contributions
y-0.049
jo-0.048
o-0.047
os-0.046
ppy-0.045
Token
Token ID198
Feature activation-3.11e+00

Pos logit contributions
y+0.103
ppy+0.099
jo+0.093
o+0.093
azz+0.092
Neg logit contributions
 anything-0.066
 something-0.063
 many-0.060
 himself-0.060
 chores-0.060
Token
Token ID198
Feature activation-3.53e+00

Pos logit contributions
y+0.081
ppy+0.077
jo+0.076
o+0.075
os+0.074
Neg logit contributions
 anything-0.049
 something-0.047
 chores-0.045
 trouble-0.044
 many-0.044
TokenMark
Token ID9704
Feature activation-1.48e+01

Pos logit contributions
y+0.079
ppy+0.076
o+0.071
jo+0.071
os+0.070
Neg logit contributions
 anything-0.068
 something-0.067
 chores-0.065
 many-0.064
 top-0.064
Token had
Token ID550
Feature activation+4.93e+00

Pos logit contributions
azz+0.333
os+0.320
itty+0.316
jo+0.313
oke+0.308
Neg logit contributions
 something-0.268
 an-0.256
 a-0.243
 trouble-0.243
 not-0.241
Token spent
Token ID3377
Feature activation+6.15e+00

Pos logit contributions
 anything+0.070
 accomplishment+0.067
 chores+0.064
 avail+0.062
 yourselves+0.060
Neg logit contributions
o-0.128
y-0.126
jo-0.122
os-0.121
ne-0.117
Token it
Token ID340
Feature activation-4.77e+00

Pos logit contributions
 anything+0.053
 something+0.049
 chores+0.048
 trouble+0.047
 accomplishment+0.045
Neg logit contributions
y-0.075
jo-0.073
o-0.073
os-0.071
ppy-0.071
Token was
Token ID373
Feature activation+1.00e+01

Pos logit contributions
y+0.091
jo+0.091
o+0.090
azz+0.087
os+0.086
Neg logit contributions
 anything-0.107
 something-0.101
 chores-0.098
 trouble-0.096
 great-0.094
Token safe
Token ID3338
Feature activation+4.49e+00

Pos logit contributions
 chores+0.139
 anything+0.134
 trouble+0.124
 tasks+0.123
 yourselves+0.120
Neg logit contributions
y-0.274
o-0.268
jo-0.245
os-0.241
ppy-0.239
Token.
Token ID13
Feature activation-2.54e+00

Pos logit contributions
 anything+0.094
 chores+0.090
 something+0.086
 trouble+0.085
 accomplishment+0.084
Neg logit contributions
y-0.097
o-0.091
jo-0.090
os-0.087
ppy-0.085
Token Then
Token ID3244
Feature activation+5.58e-01

Pos logit contributions
y+0.066
jo+0.062
ppy+0.062
os+0.061
o+0.061
Neg logit contributions
 anything-0.040
 something-0.038
 an-0.036
 trouble-0.035
 chores-0.035
Token it
Token ID340
Feature activation-4.90e+00

Pos logit contributions
 anything+0.009
 chores+0.008
 something+0.008
 trouble+0.008
 accomplishment+0.008
Neg logit contributions
y-0.014
o-0.014
jo-0.014
os-0.013
ppy-0.013
Token told
Token ID1297
Feature activation+3.64e+00

Pos logit contributions
jo+0.098
y+0.097
o+0.095
azz+0.094
os+0.093
Neg logit contributions
 anything-0.100
 something-0.099
 chores-0.096
 trouble-0.093
 good-0.093
Token its
Token ID663
Feature activation+3.25e+00

Pos logit contributions
 anything+0.044
 chores+0.041
 something+0.037
 trouble+0.036
 accomplishment+0.035
Neg logit contributions
y-0.106
jo-0.104
o-0.101
ppy-0.100
os-0.099
Token mom
Token ID1995
Feature activation-8.03e+00

Pos logit contributions
 anything+0.041
 chores+0.036
 something+0.036
 trouble+0.034
 accomplishment+0.033
Neg logit contributions
y-0.095
jo-0.092
o-0.092
os-0.089
ppy-0.088
Tokenmy
Token ID1820
Feature activation-6.10e+00

Pos logit contributions
jo+0.197
y+0.197
o+0.190
azz+0.189
ne+0.189
Neg logit contributions
 anything-0.139
 something-0.131
 chores-0.126
 many-0.117
 trouble-0.116
Token about
Token ID546
Feature activation+8.89e+00

Pos logit contributions
y+0.161
jo+0.157
o+0.155
os+0.151
ppy+0.150
Neg logit contributions
 anything-0.096
 chores-0.088
 something-0.088
 trouble-0.082
 accomplishment-0.080
Token y
Token ID331
Feature activation+1.44e+00

Pos logit contributions
 anything+0.091
 chores+0.086
 something+0.080
 trouble+0.080
 accomplishment+0.077
Neg logit contributions
y-0.166
o-0.161
jo-0.156
os-0.154
ppy-0.153
Tokenummy
Token ID13513
Feature activation-3.90e-01

Pos logit contributions
 anything+0.026
 chores+0.024
 something+0.023
 trouble+0.023
 accomplishment+0.022
Neg logit contributions
y-0.035
jo-0.034
o-0.034
os-0.033
ppy-0.033
Token.
Token ID13
Feature activation-3.45e+00

Pos logit contributions
jo+0.009
y+0.009
o+0.009
azz+0.009
os+0.009
Neg logit contributions
 anything-0.007
 something-0.007
 chores-0.007
 trouble-0.007
 good-0.006
Token
Token ID198
Feature activation-2.57e+00

Pos logit contributions
y+0.068
o+0.066
jo+0.064
azz+0.064
os+0.062
Neg logit contributions
 anything-0.074
 chores-0.072
 accomplishment-0.070
 miles-0.068
 trouble-0.066
Token
Token ID198
Feature activation-3.25e+00

Pos logit contributions
y+0.061
jo+0.059
o+0.059
os+0.057
ppy+0.057
Neg logit contributions
 anything-0.047
 chores-0.044
 something-0.043
 trouble-0.042
 accomplishment-0.041
Token"
Token ID1
Feature activation-2.04e+00

Pos logit contributions
y+0.067
o+0.064
jo+0.063
ppy+0.061
os+0.061
Neg logit contributions
 anything-0.071
 chores-0.067
 something-0.065
 trouble-0.064
 accomplishment-0.063
TokenM
Token ID44
Feature activation-4.35e+00

Pos logit contributions
jo+0.053
y+0.053
o+0.051
os+0.051
azz+0.049
Neg logit contributions
 anything-0.033
 something-0.031
 chores-0.030
 trouble-0.028
 good-0.028
Tokenmm
Token ID3020
Feature activation-2.34e+00

Pos logit contributions
y+0.127
jo+0.126
o+0.122
os+0.121
ppy+0.120
Neg logit contributions
 anything-0.054
 chores-0.052
 something-0.050
 trouble-0.048
 accomplishment-0.045
Token,
Token ID11
Feature activation+8.32e-01

Pos logit contributions
y+0.059
jo+0.059
o+0.057
os+0.057
azz+0.056
Neg logit contributions
 anything-0.039
 something-0.036
 chores-0.036
 trouble-0.034
 accomplishment-0.034
Token this
Token ID428
Feature activation-1.43e+00

Pos logit contributions
 anything+0.010
 chores+0.010
 trouble+0.009
 accomplishment+0.009
 something+0.009
Neg logit contributions
y-0.024
o-0.023
os-0.022
ppy-0.022
ne-0.022
Token is
Token ID318
Feature activation+8.09e+00

Pos logit contributions
y+0.034
jo+0.033
o+0.033
os+0.032
ppy+0.032
Neg logit contributions
 anything-0.026
 chores-0.024
 something-0.024
 trouble-0.023
 accomplishment-0.022
Token the
Token ID262
Feature activation+2.20e+00

Pos logit contributions
 anything+0.113
 something+0.108
 chores+0.105
 accomplishment+0.103
 trouble+0.102
Neg logit contributions
y-0.106
jo-0.103
o-0.099
ppy-0.097
azz-0.097
Token mel
Token ID7758
Feature activation-9.17e+00

Pos logit contributions
 anything+0.040
 chores+0.037
 something+0.037
 trouble+0.036
 accomplishment+0.035
Neg logit contributions
y-0.053
jo-0.052
o-0.051
os-0.050
ppy-0.050
Tokenon
Token ID261
Feature activation-9.75e+00

Pos logit contributions
jo+0.210
y+0.206
ppy+0.196
azz+0.195
anna+0.187
Neg logit contributions
 anything-0.157
 something-0.152
 himself-0.146
 accomplishment-0.137
 top-0.137
Token thanked
Token ID26280
Feature activation+4.84e+00

Pos logit contributions
y+0.207
jo+0.204
azz+0.202
o+0.199
ppy+0.196
Neg logit contributions
 anything-0.185
 something-0.182
 chores-0.177
 accomplishment-0.176
 many-0.173
Token the
Token ID262
Feature activation+1.20e+00

Pos logit contributions
 anything+0.042
 chores+0.038
 something+0.034
 trouble+0.034
 accomplishment+0.031
Neg logit contributions
y-0.159
jo-0.155
o-0.154
os-0.150
ppy-0.150
Token bird
Token ID6512
Feature activation-1.15e+01

Pos logit contributions
 anything+0.021
 something+0.020
 chores+0.020
 trouble+0.020
 accomplishment+0.019
Neg logit contributions
y-0.030
jo-0.029
os-0.028
o-0.028
ppy-0.028
Token for
Token ID329
Feature activation+4.29e+00

Pos logit contributions
y+0.315
jo+0.313
azz+0.309
os+0.306
ne+0.304
Neg logit contributions
 anything-0.156
 something-0.148
 chores-0.139
 many-0.134
 accomplishment-0.130
Token its
Token ID663
Feature activation+4.30e+00

Pos logit contributions
 anything+0.054
 chores+0.051
 biggest+0.047
 trouble+0.047
 top+0.045
Neg logit contributions
o-0.109
y-0.106
os-0.102
jo-0.101
ppy-0.100
Token wonderful
Token ID7932
Feature activation+2.91e+00

Pos logit contributions
 anything+0.055
 yourselves+0.046
 something+0.046
 chores+0.045
 trouble+0.042
Neg logit contributions
y-0.123
jo-0.120
o-0.120
os-0.115
ppy-0.114
Token adventure
Token ID8855
Feature activation-2.49e+00

Pos logit contributions
 anything+0.034
 chores+0.031
 something+0.030
 trouble+0.029
 accomplishment+0.028
Neg logit contributions
y-0.088
jo-0.086
o-0.086
os-0.084
ppy-0.083
Token.
Token ID13
Feature activation-4.05e+00

Pos logit contributions
y+0.057
o+0.055
jo+0.055
ppy+0.054
azz+0.054
Neg logit contributions
 anything-0.047
 something-0.044
 chores-0.043
 accomplishment-0.041
 trouble-0.041
Token and
Token ID290
Feature activation+4.73e+00

Pos logit contributions
 something+0.027
 anything+0.026
 an+0.026
 many+0.025
 a+0.024
Neg logit contributions
jo-0.029
y-0.028
azz-0.027
os-0.027
ne-0.027
Token said
Token ID531
Feature activation+1.41e+00

Pos logit contributions
 anything+0.076
 something+0.075
 an+0.068
 many+0.067
 chores+0.067
Neg logit contributions
jo-0.121
y-0.120
os-0.116
azz-0.115
ne-0.115
Token,
Token ID11
Feature activation-1.03e+00

Pos logit contributions
 anything+0.025
 chores+0.023
 something+0.023
 trouble+0.022
 accomplishment+0.021
Neg logit contributions
y-0.035
jo-0.034
o-0.033
os-0.033
ppy-0.032
Token "
Token ID366
Feature activation-5.12e-01

Pos logit contributions
jo+0.031
y+0.031
ppy+0.030
os+0.030
itty+0.030
Neg logit contributions
 something-0.012
 anything-0.012
 an-0.011
 many-0.010
 not-0.010
TokenB
Token ID33
Feature activation-6.81e+00

Pos logit contributions
oke+0.014
jo+0.013
os+0.013
ppy+0.013
azz+0.013
Neg logit contributions
 something-0.007
 anything-0.006
 himself-0.006
 most-0.006
 middle-0.006
Tokenunny
Token ID16948
Feature activation-1.07e+01

Pos logit contributions
jo+0.092
anne+0.086
azz+0.082
os+0.079
y+0.078
Neg logit contributions
 job-0.160
 chores-0.158
 anything-0.158
 something-0.156
 good-0.153
Token,
Token ID11
Feature activation+7.04e-01

Pos logit contributions
jo+0.283
azz+0.282
y+0.274
os+0.271
ne+0.270
Neg logit contributions
 anything-0.162
 something-0.152
 chores-0.143
 good-0.138
 trouble-0.136
Token will
Token ID481
Feature activation+4.66e+00

Pos logit contributions
 anything+0.012
 chores+0.011
 something+0.011
 trouble+0.010
 accomplishment+0.010
Neg logit contributions
y-0.018
jo-0.017
o-0.017
os-0.017
ppy-0.016
Token you
Token ID345
Feature activation+3.74e-01

Pos logit contributions
 anything+0.063
 chores+0.058
 something+0.056
 trouble+0.055
 accomplishment+0.054
Neg logit contributions
y-0.131
o-0.128
jo-0.126
os-0.124
ppy-0.123
Token come
Token ID1282
Feature activation+5.25e+00

Pos logit contributions
 anything+0.006
 chores+0.005
 something+0.005
 trouble+0.005
 accomplishment+0.005
Neg logit contributions
y-0.010
jo-0.010
o-0.010
os-0.010
ppy-0.009
Token to
Token ID284
Feature activation+7.31e+00

Pos logit contributions
 anything+0.087
 chores+0.081
 something+0.079
 trouble+0.077
 accomplishment+0.076
Neg logit contributions
y-0.134
o-0.129
jo-0.128
os-0.125
ppy-0.124
Token.
Token ID13
Feature activation-3.65e+00

Pos logit contributions
 anything+0.070
 chores+0.069
 yourselves+0.064
 accomplishment+0.063
 something+0.062
Neg logit contributions
y-0.078
o-0.074
jo-0.072
ne-0.072
ppy-0.069
Token
Token ID220
Feature activation-3.97e+00

Pos logit contributions
y+0.097
jo+0.092
ppy+0.090
azz+0.088
os+0.087
Neg logit contributions
 anything-0.060
 something-0.057
 himself-0.053
 trouble-0.053
 chores-0.053
Token
Token ID198
Feature activation-2.15e+00

Pos logit contributions
y+0.101
jo+0.097
ppy+0.094
o+0.094
os+0.092
Neg logit contributions
 anything-0.066
 something-0.061
 chores-0.061
 trouble-0.058
 himself-0.056
Token
Token ID198
Feature activation-2.62e+00

Pos logit contributions
y+0.052
jo+0.050
o+0.050
os+0.049
ppy+0.048
Neg logit contributions
 anything-0.038
 chores-0.035
 something-0.035
 trouble-0.034
 accomplishment-0.033
TokenThe
Token ID464
Feature activation-9.15e-01

Pos logit contributions
y+0.072
o+0.070
jo+0.070
os+0.068
ne+0.068
Neg logit contributions
 anything-0.039
 chores-0.035
 something-0.034
 trouble-0.033
 accomplishment-0.032
Token hunter
Token ID19177
Feature activation-1.38e+01

Pos logit contributions
y+0.017
o+0.016
jo+0.016
ppy+0.016
ne+0.015
Neg logit contributions
 anything-0.021
 chores-0.019
 something-0.019
 accomplishment-0.019
 yourselves-0.019
Token thought
Token ID1807
Feature activation-1.69e+00

Pos logit contributions
jo+0.289
azz+0.281
y+0.278
os+0.273
o+0.267
Neg logit contributions
 something-0.277
 anything-0.273
 an-0.258
 trouble-0.252
 himself-0.252
Token of
Token ID286
Feature activation+4.68e+00

Pos logit contributions
y+0.044
jo+0.042
o+0.042
os+0.041
ppy+0.041
Neg logit contributions
 anything-0.027
 chores-0.025
 something-0.024
 trouble-0.023
 yourselves-0.023
Token a
Token ID257
Feature activation+3.45e+00

Pos logit contributions
 chores+0.046
 accomplishment+0.042
 trouble+0.040
 anything+0.040
 job+0.040
Neg logit contributions
y-0.130
o-0.130
jo-0.123
os-0.121
ppy-0.121
Token plan
Token ID1410
Feature activation-3.24e+00

Pos logit contributions
 anything+0.042
 chores+0.036
 something+0.035
 yourselves+0.034
 trouble+0.033
Neg logit contributions
y-0.103
jo-0.099
o-0.099
ppy-0.094
azz-0.094
Token.
Token ID13
Feature activation-3.98e+00

Pos logit contributions
y+0.074
jo+0.071
o+0.071
ppy+0.069
azz+0.068
Neg logit contributions
 anything-0.063
 something-0.059
 chores-0.058
 trouble-0.056
 accomplishment-0.054
Token put
Token ID1234
Feature activation+6.73e+00

Pos logit contributions
y+0.127
jo+0.125
o+0.123
os+0.118
azz+0.116
Neg logit contributions
 anything-0.074
 chores-0.070
 something-0.068
 trouble-0.065
 accomplishment-0.062
Token the
Token ID262
Feature activation+3.25e+00

Pos logit contributions
 anything+0.075
 accomplishment+0.070
 miles+0.067
 trouble+0.066
 chores+0.066
Neg logit contributions
y-0.200
o-0.191
os-0.184
ppy-0.175
itty-0.174
Token iron
Token ID6953
Feature activation-8.99e+00

Pos logit contributions
 anything+0.068
 chores+0.062
 accomplishment+0.062
 himself+0.062
 something+0.060
Neg logit contributions
y-0.066
ne-0.059
itty-0.058
o-0.057
ppy-0.056
Token,
Token ID11
Feature activation+2.18e+00

Pos logit contributions
jo+0.244
y+0.235
azz+0.231
o+0.231
ne+0.226
Neg logit contributions
 anything-0.134
 chores-0.124
 something-0.124
 trouble-0.111
 himself-0.109
Token the
Token ID262
Feature activation+1.86e+00

Pos logit contributions
 anything+0.032
 chores+0.030
 trouble+0.028
 something+0.028
 accomplishment+0.028
Neg logit contributions
y-0.059
o-0.058
jo-0.056
os-0.055
ppy-0.055
Token curtain
Token ID29461
Feature activation-1.26e+01

Pos logit contributions
 anything+0.035
 chores+0.033
 something+0.032
 trouble+0.031
 accomplishment+0.031
Neg logit contributions
y-0.043
jo-0.040
o-0.040
os-0.039
ppy-0.039
Token,
Token ID11
Feature activation+1.48e+00

Pos logit contributions
y+0.288
jo+0.288
o+0.280
azz+0.272
ppy+0.272
Neg logit contributions
 anything-0.238
 chores-0.223
 something-0.222
 trouble-0.211
 accomplishment-0.207
Token the
Token ID262
Feature activation+1.40e+00

Pos logit contributions
 anything+0.032
 chores+0.031
 trouble+0.030
 something+0.029
 accomplishment+0.029
Neg logit contributions
y-0.030
o-0.029
os-0.028
jo-0.028
ppy-0.027
Token bracelet
Token ID42893
Feature activation-1.13e+01

Pos logit contributions
 anything+0.024
 chores+0.022
 something+0.022
 trouble+0.021
 accomplishment+0.021
Neg logit contributions
y-0.034
jo-0.033
o-0.033
os-0.032
ppy-0.031
Token and
Token ID290
Feature activation+4.30e+00

Pos logit contributions
jo+0.275
y+0.271
o+0.269
azz+0.261
os+0.258
Neg logit contributions
 anything-0.196
 chores-0.185
 something-0.182
 trouble-0.171
 good-0.169
Token the
Token ID262
Feature activation+3.35e+00

Pos logit contributions
 anything+0.039
 chores+0.037
 accomplishment+0.033
 trouble+0.032
 something+0.030
Neg logit contributions
y-0.134
o-0.130
os-0.128
jo-0.125
ppy-0.124
Tokenade
Token ID671
Feature activation-9.05e+00

Pos logit contributions
y+0.086
jo+0.084
o+0.083
azz+0.079
os+0.079
Neg logit contributions
 anything-0.080
 chores-0.077
 something-0.075
 trouble-0.073
 accomplishment-0.071
Token!"
Token ID2474
Feature activation-4.21e+00

Pos logit contributions
y+0.203
jo+0.201
o+0.197
azz+0.191
os+0.191
Neg logit contributions
 anything-0.175
 chores-0.163
 something-0.163
 trouble-0.156
 accomplishment-0.152
Token
Token ID198
Feature activation-2.50e+00

Pos logit contributions
y+0.113
ppy+0.108
o+0.107
jo+0.104
os+0.103
Neg logit contributions
 anything-0.065
 something-0.064
 trouble-0.059
 an-0.058
 many-0.057
Token
Token ID198
Feature activation-2.72e+00

Pos logit contributions
y+0.069
ppy+0.065
o+0.064
jo+0.062
os+0.062
Neg logit contributions
 anything-0.036
 something-0.035
 chores-0.033
 many-0.033
 trouble-0.033
TokenThe
Token ID464
Feature activation-6.16e-01

Pos logit contributions
y+0.066
o+0.062
ppy+0.061
jo+0.060
os+0.060
Neg logit contributions
 anything-0.048
 something-0.047
 chores-0.045
 many-0.044
 trouble-0.043
Token mole
Token ID9411
Feature activation-1.57e+01

Pos logit contributions
y+0.014
o+0.013
jo+0.013
ne+0.013
ppy+0.012
Neg logit contributions
 anything-0.012
 chores-0.011
 something-0.010
 accomplishment-0.010
 trouble-0.010
Token smiled
Token ID13541
Feature activation-3.72e-01

Pos logit contributions
azz+0.335
jo+0.334
os+0.322
y+0.319
o+0.312
Neg logit contributions
 something-0.306
 himself-0.283
 not-0.281
 top-0.279
 an-0.278
Token and
Token ID290
Feature activation+3.32e+00

Pos logit contributions
y+0.008
jo+0.007
o+0.007
os+0.007
ppy+0.007
Neg logit contributions
 anything-0.008
 chores-0.007
 something-0.007
 trouble-0.007
 accomplishment-0.007
Token said
Token ID531
Feature activation+2.41e-02

Pos logit contributions
 anything+0.080
 something+0.079
 accomplishment+0.075
 trouble+0.074
 an+0.074
Neg logit contributions
y-0.059
jo-0.058
o-0.055
os-0.055
ppy-0.054
Token,
Token ID11
Feature activation-1.45e+00

Pos logit contributions
 anything+0.000
 something+0.000
 chores+0.000
 trouble+0.000
 accomplishment+0.000
Neg logit contributions
y-0.001
jo-0.001
o-0.001
os-0.001
ppy-0.001
Token
Token ID198
Feature activation-2.26e+00

Pos logit contributions
y+0.040
jo+0.039
o+0.038
os+0.038
ppy+0.037
Neg logit contributions
 anything-0.021
 something-0.020
 chores-0.019
 an-0.019
 accomplishment-0.018