TOP ACTIVATIONS
MAX = 0.177

friends thanked her for being such a good leader . Lily
She thanked him for being such a good friend . From
being persistent enough to find such a special place .
Thank you Jack for finding such a nice shelter ."
of her for picking out such a pretty dress . Lily
so glad I could deliver such joy to you , my
thanked his Uncle for recommending such a great toy . They
, because you have been such a good girl . Now
proud of him for being such a good dog and taking
a big hug for being such a helpful , organized little
says , " You are such good twins . You are
home because it had been such a lovely night . <|endoftext|>
her parents for showing her such an amazing sight . From
a big bone for being such a good protector .
proud of him for being such a good listener .
thank you for giving me such good advice ! â
good twins . You are such good siblings . You support
to be able to see such a pretty view ."
was very happy to have such a special friend . <|endoftext|>
happy you and Billy are such good friends ." <|endoftext|> Once

MIN ACTIVATIONS
MIN = -0.204

. The great Zen o hugs Ben and says
bird . So , the hipp o used its big mouth
a small bird named B ibi , fly away from his
One day , the hipp o saw a little bird
One day , the hipp o saw a new friend
One day , the little hipp o wanted to drink some
very hungry . So the hipp o decided to cook some
One day , the hipp o met a wise old
food . The hipp o went to the kitchen
One day , Cle ary was walking through the
The se ag ull tried to take the oy
Jack asked the mer maid why the waves were rocking
sad . The kid stayed sad all day ,
. The hipp o found some vegetables and
One day , the bull saw a butterfly . He
in a river . The hipp o was very happy and
. The wolf went away , but Snow
One day , the hipp o saw something that caught
from the tank . Hel io was thankful and thanked T
One day , the pig gy saw a big dog

INTERVAL 0.082 - 0.177
CONTAINS 1.797%

He knew he was safe in his crib .
but he didn 't have enough money to buy it .
was three years old and liked to dance . She went
often asked his parents questions about the world around him .
He knew he should have stayed with the egg like his

INTERVAL -0.013 - 0.082
CONTAINS 61.843%

home . The next day , Lily went to
was very strong . They worked together to make a plan
afternoon . They laughed and smiled as they shared their yellow
They didn 't know what to do . But then ,
them . He was wearing a suit and sunglasses . He

INTERVAL -0.109 - -0.013
CONTAINS 30.898%

to surrender and stay still . The bear sniff ed her
that fighting does not help anyone . <|endoftext|> One day ,
and Ben are twins . They like to play in the
in finding the perfect fork ! She used the long fork
. He ran away fast . Tom and Anna

INTERVAL -0.204 - -0.109
CONTAINS 5.462%

Ben said . Mom and dad looked at the
even more spicy and the family felt their mouths begin to
big dog named Max . Max was very hungry and wanted
. They still like the jet . They hope to see
shapes . She made a doll , a car and even
Token friends
Token ID2460
Feature activation-3.96e-02

Pos logit contributions
jo+0.428
azz+0.426
os+0.394
anne+0.383
o+0.380
Neg logit contributions
 top-0.287
 good-0.276
 something-0.274
 many-0.271
 things-0.270
Token thanked
Token ID26280
Feature activation+9.28e-02

Pos logit contributions
jo+0.104
azz+0.101
o+0.100
os+0.097
ppy+0.096
Neg logit contributions
 anything-0.064
 something-0.063
 new-0.061
 top-0.061
 things-0.060
Token her
Token ID607
Feature activation+2.69e-02

Pos logit contributions
 top+0.089
 anything+0.089
 new+0.081
 something+0.081
 bottom+0.081
Neg logit contributions
jo-0.303
o-0.300
azz-0.293
os-0.291
ppy-0.289
Token for
Token ID329
Feature activation+8.34e-02

Pos logit contributions
 anything+0.041
 top+0.040
 bottom+0.038
 something+0.038
 chores+0.038
Neg logit contributions
jo-0.072
o-0.071
azz-0.069
os-0.068
ppy-0.068
Token being
Token ID852
Feature activation+1.00e-01

Pos logit contributions
 top+0.101
 bottom+0.100
 biggest+0.090
 chores+0.085
 anything+0.083
Neg logit contributions
o-0.245
jo-0.239
ppy-0.230
os-0.228
azz-0.226
Token such
Token ID884
Feature activation+1.69e-01

Pos logit contributions
 top+0.117
 bottom+0.115
 chores+0.107
 anything+0.101
 sand+0.093
Neg logit contributions
o-0.318
jo-0.304
os-0.295
ppy-0.294
azz-0.293
Token a
Token ID257
Feature activation+8.49e-02

Pos logit contributions
 bottom+0.199
 top+0.179
 chores+0.153
 port+0.153
 anything+0.152
Neg logit contributions
o-0.500
ppy-0.468
jo-0.465
bug-0.440
itty-0.440
Token good
Token ID922
Feature activation+6.82e-02

Pos logit contributions
 anything+0.096
 top+0.088
 bottom+0.086
 chores+0.085
 something+0.079
Neg logit contributions
o-0.257
jo-0.256
azz-0.242
ppy-0.239
itty-0.237
Token leader
Token ID3554
Feature activation-3.53e-02

Pos logit contributions
 top+0.128
 anything+0.127
 bottom+0.121
 chores+0.120
 something+0.119
Neg logit contributions
jo-0.158
o-0.157
azz-0.151
ppy-0.149
os-0.146
Token.
Token ID13
Feature activation-3.84e-02

Pos logit contributions
jo+0.089
o+0.088
azz+0.086
os+0.085
ppy+0.084
Neg logit contributions
 anything-0.060
 top-0.059
 something-0.058
 new-0.058
 things-0.056
Token Lily
Token ID20037
Feature activation-1.00e-01

Pos logit contributions
ppy+0.102
jo+0.102
os+0.098
azz+0.097
o+0.097
Neg logit contributions
 top-0.057
 new-0.054
 something-0.054
 many-0.053
 anything-0.053
Token She
Token ID1375
Feature activation-1.39e-02

Pos logit contributions
ppy+0.083
jo+0.083
os+0.080
o+0.079
azz+0.079
Neg logit contributions
 top-0.051
 something-0.050
 anything-0.048
 new-0.048
 many-0.047
Token thanked
Token ID26280
Feature activation+5.88e-02

Pos logit contributions
jo+0.029
o+0.028
azz+0.028
os+0.027
ppy+0.027
Neg logit contributions
 top-0.030
 anything-0.029
 something-0.029
 new-0.029
 great-0.029
Token him
Token ID683
Feature activation-3.29e-02

Pos logit contributions
 anything+0.057
 top+0.057
 new+0.052
 bottom+0.052
 something+0.052
Neg logit contributions
jo-0.191
o-0.189
azz-0.185
os-0.184
ppy-0.182
Token for
Token ID329
Feature activation+6.38e-02

Pos logit contributions
jo+0.094
o+0.092
os+0.091
azz+0.089
anne+0.087
Neg logit contributions
 new-0.048
 top-0.046
 anything-0.046
 something-0.045
 things-0.044
Token being
Token ID852
Feature activation+8.99e-02

Pos logit contributions
 top+0.094
 bottom+0.093
 biggest+0.092
 most+0.082
 anything+0.080
Neg logit contributions
o-0.174
jo-0.158
ppy-0.158
os-0.156
azz-0.150
Token such
Token ID884
Feature activation+1.61e-01

Pos logit contributions
 top+0.128
 bottom+0.128
 chores+0.117
 anything+0.109
 shore+0.104
Neg logit contributions
o-0.267
jo-0.249
ppy-0.245
os-0.243
azz-0.241
Token a
Token ID257
Feature activation+8.32e-02

Pos logit contributions
 bottom+0.223
 top+0.196
 shore+0.184
 port+0.180
 avail+0.178
Neg logit contributions
o-0.429
ppy-0.405
jo-0.379
bug-0.377
y-0.367
Token good
Token ID922
Feature activation+6.42e-02

Pos logit contributions
 anything+0.089
 top+0.082
 bottom+0.081
 chores+0.076
wered+0.073
Neg logit contributions
o-0.259
jo-0.252
azz-0.240
itty-0.239
ppy-0.238
Token friend
Token ID1545
Feature activation-5.18e-02

Pos logit contributions
 anything+0.118
 top+0.118
 bottom+0.112
 something+0.110
 chores+0.110
Neg logit contributions
jo-0.151
o-0.151
azz-0.144
ppy-0.143
os-0.140
Token.
Token ID13
Feature activation-4.45e-02

Pos logit contributions
jo+0.142
azz+0.137
o+0.137
os+0.136
ppy+0.133
Neg logit contributions
 anything-0.080
 something-0.077
 new-0.074
 things-0.073
 top-0.072
Token From
Token ID3574
Feature activation+3.28e-02

Pos logit contributions
ppy+0.123
jo+0.121
os+0.115
azz+0.115
ring+0.113
Neg logit contributions
 top-0.061
 many-0.059
 new-0.058
 the-0.056
 something-0.056
Token being
Token ID852
Feature activation+9.74e-02

Pos logit contributions
 top+0.084
 biggest+0.084
 bottom+0.083
 chores+0.077
 anything+0.073
Neg logit contributions
o-0.226
jo-0.221
ppy-0.220
os-0.213
azz-0.209
Token persistent
Token ID16218
Feature activation+5.77e-02

Pos logit contributions
 top+0.143
 bottom+0.138
 chores+0.132
 anything+0.126
 sand+0.121
Neg logit contributions
o-0.279
jo-0.268
ppy-0.262
os-0.261
azz-0.255
Token enough
Token ID1576
Feature activation+9.33e-02

Pos logit contributions
 anything+0.100
 top+0.100
 new+0.096
 something+0.096
 bottom+0.094
Neg logit contributions
jo-0.144
o-0.142
azz-0.138
os-0.136
ppy-0.135
Token to
Token ID284
Feature activation+8.62e-02

Pos logit contributions
 top+0.181
 anything+0.178
 bottom+0.176
 biggest+0.168
 something+0.166
Neg logit contributions
o-0.210
jo-0.206
os-0.196
ppy-0.196
azz-0.193
Token find
Token ID1064
Feature activation+9.08e-02

Pos logit contributions
 biggest+0.138
 top+0.134
 anything+0.133
 most+0.129
 new+0.129
Neg logit contributions
o-0.209
jo-0.201
ppy-0.201
os-0.194
azz-0.194
Token such
Token ID884
Feature activation+1.61e-01

Pos logit contributions
 bottom+0.088
 top+0.086
 chores+0.079
 biggest+0.070
 port+0.068
Neg logit contributions
jo-0.277
o-0.276
os-0.270
ppy-0.266
azz-0.263
Token a
Token ID257
Feature activation+8.71e-02

Pos logit contributions
 bottom+0.185
 top+0.167
 port+0.151
 shore+0.148
 avail+0.148
Neg logit contributions
o-0.441
ppy-0.427
jo-0.415
y-0.401
bug-0.401
Token special
Token ID2041
Feature activation+6.59e-02

Pos logit contributions
 anything+0.084
 top+0.076
 chores+0.074
wered+0.072
 bottom+0.072
Neg logit contributions
o-0.269
jo-0.265
azz-0.253
ppy-0.249
itty-0.246
Tokenplace
Token ID5372
Feature activation+3.10e-02

Pos logit contributions
 anything+0.073
 top+0.070
 bottom+0.066
 chores+0.064
 biggest+0.062
Neg logit contributions
jo-0.198
o-0.197
azz-0.184
os-0.183
ppy-0.182
Token.
Token ID13
Feature activation-4.07e-02

Pos logit contributions
 top+0.040
 bottom+0.039
 anything+0.037
 chores+0.034
 biggest+0.033
Neg logit contributions
o-0.091
jo-0.090
os-0.087
azz-0.087
ppy-0.086
Token
Token ID220
Feature activation-1.08e-02

Pos logit contributions
ppy+0.110
jo+0.110
os+0.106
o+0.106
azz+0.105
Neg logit contributions
 top-0.060
 something-0.057
 anything-0.057
 new-0.056
 many-0.054
TokenThank
Token ID10449
Feature activation+7.36e-02

Pos logit contributions
jo+0.006
ppy+0.006
os+0.006
azz+0.006
o+0.005
Neg logit contributions
 top-0.003
 something-0.003
 most-0.003
 anything-0.003
 middle-0.003
Token you
Token ID345
Feature activation-9.24e-03

Pos logit contributions
 anything+0.085
 top+0.084
 something+0.081
 new+0.079
 things+0.078
Neg logit contributions
jo-0.230
o-0.225
azz-0.223
os-0.219
ppy-0.217
Token Jack
Token ID3619
Feature activation-9.84e-02

Pos logit contributions
jo+0.025
o+0.024
azz+0.024
os+0.023
ppy+0.023
Neg logit contributions
 anything-0.014
 top-0.014
 new-0.014
 something-0.014
 bottom-0.013
Token for
Token ID329
Feature activation+7.47e-02

Pos logit contributions
jo+0.265
azz+0.261
o+0.255
os+0.254
ppy+0.246
Neg logit contributions
 anything-0.149
 top-0.146
 something-0.145
 new-0.145
 good-0.142
Token finding
Token ID4917
Feature activation+1.05e-01

Pos logit contributions
 biggest+0.116
 top+0.114
 bottom+0.114
 anything+0.109
 most+0.105
Neg logit contributions
o-0.197
ppy-0.177
os-0.173
jo-0.171
itty-0.166
Token such
Token ID884
Feature activation+1.61e-01

Pos logit contributions
 bottom+0.136
 chores+0.125
 top+0.118
 accomplishment+0.114
wered+0.111
Neg logit contributions
o-0.298
jo-0.280
os-0.279
ppy-0.274
azz-0.270
Token a
Token ID257
Feature activation+8.83e-02

Pos logit contributions
 bottom+0.228
 top+0.185
 middle+0.175
 shore+0.174
 avail+0.171
Neg logit contributions
o-0.395
ppy-0.364
bug-0.355
lets-0.353
jo-0.351
Token nice
Token ID3621
Feature activation+5.22e-02

Pos logit contributions
 anything+0.085
 top+0.071
 bottom+0.069
 yourselves+0.069
 chores+0.068
Neg logit contributions
o-0.283
jo-0.272
azz-0.262
ppy-0.257
itty-0.256
Token shelter
Token ID11772
Feature activation-3.99e-02

Pos logit contributions
 anything+0.059
 top+0.056
 bottom+0.051
 chores+0.051
 biggest+0.050
Neg logit contributions
o-0.161
jo-0.160
azz-0.153
os-0.150
ppy-0.150
Token."
Token ID526
Feature activation-4.03e-02

Pos logit contributions
jo+0.100
o+0.100
os+0.096
azz+0.095
ppy+0.094
Neg logit contributions
 top-0.068
 anything-0.068
 bottom-0.065
 new-0.064
 something-0.064
Token
Token ID220
Feature activation-1.48e-02

Pos logit contributions
jo+0.116
ppy+0.113
azz+0.110
os+0.110
o+0.107
Neg logit contributions
 top-0.052
 something-0.052
 new-0.052
 the-0.050
 many-0.048
Token of
Token ID286
Feature activation+8.10e-02

Pos logit contributions
 anything+0.105
 top+0.105
 new+0.101
 something+0.100
 bottom+0.098
Neg logit contributions
jo-0.166
o-0.164
azz-0.159
os-0.158
ppy-0.157
Token her
Token ID607
Feature activation+2.39e-02

Pos logit contributions
 top+0.102
 anything+0.100
 bottom+0.098
 biggest+0.097
 new+0.095
Neg logit contributions
jo-0.237
o-0.236
os-0.227
azz-0.227
ppy-0.226
Token for
Token ID329
Feature activation+7.97e-02

Pos logit contributions
 anything+0.036
 top+0.036
 new+0.034
 something+0.034
 bottom+0.033
Neg logit contributions
jo-0.066
o-0.065
azz-0.063
os-0.063
ppy-0.062
Token picking
Token ID10868
Feature activation+6.55e-02

Pos logit contributions
 top+0.110
 biggest+0.109
 bottom+0.107
 anything+0.100
 new+0.098
Neg logit contributions
o-0.217
jo-0.211
ppy-0.208
os-0.205
azz-0.202
Token out
Token ID503
Feature activation+5.32e-02

Pos logit contributions
 bottom+0.069
 top+0.067
 chores+0.066
 anything+0.065
 accomplishment+0.063
Neg logit contributions
o-0.197
os-0.189
jo-0.187
ppy-0.183
azz-0.181
Token such
Token ID884
Feature activation+1.60e-01

Pos logit contributions
 bottom+0.047
 top+0.045
 biggest+0.037
 anything+0.036
 shore+0.035
Neg logit contributions
jo-0.170
o-0.169
os-0.167
azz-0.163
ppy-0.163
Token a
Token ID257
Feature activation+8.71e-02

Pos logit contributions
 bottom+0.181
 top+0.148
 anything+0.147
 chores+0.138
 miles+0.136
Neg logit contributions
o-0.426
jo-0.411
ppy-0.397
y-0.395
bug-0.394
Token pretty
Token ID2495
Feature activation+6.26e-02

Pos logit contributions
 anything+0.074
 top+0.060
 chores+0.056
 bottom+0.056
 something+0.054
Neg logit contributions
o-0.291
jo-0.286
azz-0.274
itty-0.269
os-0.268
Token dress
Token ID6576
Feature activation-3.85e-02

Pos logit contributions
 anything+0.092
 top+0.087
 chores+0.085
 bottom+0.083
 something+0.081
Neg logit contributions
jo-0.169
o-0.167
azz-0.159
os-0.158
ppy-0.156
Token.
Token ID13
Feature activation-4.40e-02

Pos logit contributions
jo+0.100
o+0.099
azz+0.096
os+0.095
ppy+0.094
Neg logit contributions
 anything-0.063
 top-0.063
 new-0.061
 something-0.060
 things-0.058
Token Lily
Token ID20037
Feature activation-9.83e-02

Pos logit contributions
ppy+0.127
jo+0.121
os+0.120
azz+0.117
o+0.117
Neg logit contributions
 top-0.061
 new-0.059
 something-0.057
 many-0.056
 the-0.052
Token so
Token ID523
Feature activation+7.14e-02

Pos logit contributions
jo+0.029
ppy+0.028
azz+0.028
o+0.027
os+0.027
Neg logit contributions
 top-0.016
 new-0.015
 many-0.015
 something-0.014
 great-0.014
Token glad
Token ID9675
Feature activation+5.50e-02

Pos logit contributions
 anything+0.101
 top+0.094
 chores+0.093
 bottom+0.092
 places+0.090
Neg logit contributions
o-0.194
jo-0.190
os-0.180
anne-0.178
ppy-0.174
Token I
Token ID314
Feature activation+2.96e-02

Pos logit contributions
 anything+0.085
 top+0.085
 new+0.080
 bottom+0.080
 something+0.080
Neg logit contributions
jo-0.147
o-0.146
azz-0.141
os-0.140
ppy-0.139
Token could
Token ID714
Feature activation+7.76e-02

Pos logit contributions
 anything+0.065
 top+0.064
 something+0.064
 great+0.063
 good+0.062
Neg logit contributions
jo-0.060
o-0.057
azz-0.057
os-0.055
ppy-0.054
Token deliver
Token ID5203
Feature activation+8.03e-02

Pos logit contributions
 anything+0.119
 top+0.115
 new+0.111
 something+0.110
 bottom+0.110
Neg logit contributions
jo-0.203
o-0.203
ppy-0.192
os-0.192
azz-0.191
Token such
Token ID884
Feature activation+1.60e-01

Pos logit contributions
 bottom+0.083
 top+0.083
 anything+0.078
 chores+0.076
 biggest+0.073
Neg logit contributions
o-0.248
jo-0.247
os-0.241
azz-0.235
ppy-0.233
Token joy
Token ID8716
Feature activation+2.63e-03

Pos logit contributions
 bottom+0.162
 top+0.138
 anything+0.124
 chores+0.113
 middle+0.113
Neg logit contributions
jo-0.487
o-0.481
ppy-0.459
os-0.452
azz-0.452
Token to
Token ID284
Feature activation+9.05e-02

Pos logit contributions
 anything+0.005
 top+0.004
 new+0.004
 something+0.004
 things+0.004
Neg logit contributions
jo-0.007
o-0.007
azz-0.006
os-0.006
ppy-0.006
Token you
Token ID345
Feature activation+4.76e-03

Pos logit contributions
 top+0.110
 bottom+0.106
 anything+0.103
 biggest+0.102
 chores+0.100
Neg logit contributions
jo-0.264
o-0.261
os-0.252
ppy-0.251
azz-0.251
Token,
Token ID11
Feature activation+9.59e-03

Pos logit contributions
 top+0.009
 anything+0.009
 bottom+0.008
 new+0.008
 chores+0.008
Neg logit contributions
jo-0.011
o-0.011
ppy-0.011
azz-0.011
os-0.011
Token my
Token ID616
Feature activation+4.90e-02

Pos logit contributions
 anything+0.015
 top+0.015
 something+0.014
 new+0.014
 bottom+0.014
Neg logit contributions
jo-0.025
o-0.025
azz-0.024
os-0.024
ppy-0.024
Token thanked
Token ID26280
Feature activation+7.84e-02

Pos logit contributions
 anything+0.023
 top+0.023
 something+0.022
 new+0.022
 bottom+0.022
Neg logit contributions
jo-0.020
o-0.020
azz-0.019
os-0.019
ppy-0.019
Token his
Token ID465
Feature activation+3.78e-02

Pos logit contributions
 top+0.071
 bottom+0.071
 biggest+0.069
 anything+0.067
 chores+0.066
Neg logit contributions
jo-0.253
o-0.248
os-0.239
azz-0.238
ppy-0.237
Token Uncle
Token ID23169
Feature activation-1.05e-01

Pos logit contributions
 anything+0.055
 top+0.050
 bottom+0.049
 chores+0.048
 something+0.047
Neg logit contributions
jo-0.106
o-0.104
azz-0.100
ppy-0.097
os-0.097
Token for
Token ID329
Feature activation+7.35e-02

Pos logit contributions
jo+0.269
azz+0.266
os+0.262
o+0.258
ppy+0.252
Neg logit contributions
 anything-0.169
 something-0.168
 new-0.166
 top-0.163
 things-0.155
Token recommending
Token ID34639
Feature activation+6.85e-02

Pos logit contributions
 top+0.082
 biggest+0.082
 bottom+0.082
 anything+0.075
 most+0.071
Neg logit contributions
o-0.221
jo-0.213
ppy-0.205
os-0.205
azz-0.197
Token such
Token ID884
Feature activation+1.60e-01

Pos logit contributions
 top+0.064
 bottom+0.063
 anything+0.061
 biggest+0.057
 chores+0.057
Neg logit contributions
o-0.218
jo-0.218
os-0.210
azz-0.208
ppy-0.205
Token a
Token ID257
Feature activation+8.03e-02

Pos logit contributions
 bottom+0.185
 top+0.159
 anything+0.150
 middle+0.133
 shore+0.132
Neg logit contributions
o-0.460
jo-0.447
ppy-0.418
os-0.404
azz-0.401
Token great
Token ID1049
Feature activation+7.88e-02

Pos logit contributions
 anything+0.078
 top+0.068
 bottom+0.063
 yourselves+0.062
 chores+0.062
Neg logit contributions
o-0.260
jo-0.259
azz-0.246
ppy-0.241
os-0.240
Token toy
Token ID13373
Feature activation-2.89e-02

Pos logit contributions
 anything+0.098
 top+0.093
 bottom+0.087
 chores+0.085
 something+0.084
Neg logit contributions
jo-0.235
o-0.232
azz-0.220
os-0.216
ppy-0.215
Token.
Token ID13
Feature activation-3.80e-02

Pos logit contributions
jo+0.074
o+0.073
azz+0.072
os+0.071
ppy+0.070
Neg logit contributions
 anything-0.048
 top-0.048
 new-0.046
 something-0.046
 things-0.045
Token They
Token ID1119
Feature activation-2.96e-03

Pos logit contributions
ppy+0.103
jo+0.100
os+0.099
azz+0.098
o+0.097
Neg logit contributions
 top-0.056
 something-0.053
 new-0.051
 anything-0.051
 many-0.050
Token,
Token ID11
Feature activation+4.07e-03

Pos logit contributions
jo+0.023
o+0.022
azz+0.022
os+0.021
ppy+0.021
Neg logit contributions
 anything-0.015
 top-0.014
 something-0.014
 new-0.014
 chores-0.014
Token because
Token ID780
Feature activation+5.82e-03

Pos logit contributions
 anything+0.007
 top+0.007
 new+0.006
 bottom+0.006
 something+0.006
Neg logit contributions
jo-0.010
o-0.010
azz-0.010
os-0.010
ppy-0.010
Token you
Token ID345
Feature activation+9.73e-04

Pos logit contributions
 bottom+0.009
 top+0.008
 chores+0.008
 anything+0.008
 biggest+0.008
Neg logit contributions
o-0.015
os-0.014
jo-0.014
ppy-0.014
azz-0.014
Token have
Token ID423
Feature activation+8.14e-02

Pos logit contributions
 anything+0.002
 top+0.002
 something+0.002
 new+0.002
 bottom+0.002
Neg logit contributions
jo-0.002
o-0.002
azz-0.002
os-0.002
ppy-0.002
Token been
Token ID587
Feature activation+8.85e-02

Pos logit contributions
 bottom+0.117
 top+0.117
 chores+0.109
 anything+0.107
 biggest+0.104
Neg logit contributions
o-0.223
jo-0.212
ppy-0.208
os-0.207
azz-0.202
Token such
Token ID884
Feature activation+1.59e-01

Pos logit contributions
 top+0.123
 bottom+0.122
 chores+0.110
 anything+0.107
 sand+0.103
Neg logit contributions
o-0.258
ppy-0.241
jo-0.236
os-0.233
azz-0.226
Token a
Token ID257
Feature activation+8.40e-02

Pos logit contributions
 bottom+0.237
 top+0.203
 shore+0.190
 port+0.184
 beach+0.178
Neg logit contributions
ppy-0.397
o-0.388
bug-0.359
ne-0.345
lets-0.344
Token good
Token ID922
Feature activation+5.82e-02

Pos logit contributions
 anything+0.068
 top+0.062
 bottom+0.060
 chores+0.059
 something+0.054
Neg logit contributions
o-0.280
jo-0.273
azz-0.262
ppy-0.261
itty-0.255
Token girl
Token ID2576
Feature activation-8.23e-02

Pos logit contributions
 anything+0.086
 top+0.084
 bottom+0.079
 something+0.079
 new+0.079
Neg logit contributions
o-0.158
jo-0.157
azz-0.150
ppy-0.149
os-0.148
Token.
Token ID13
Feature activation-6.29e-02

Pos logit contributions
jo+0.229
azz+0.224
os+0.218
o+0.218
ppy+0.210
Neg logit contributions
 anything-0.123
 something-0.120
 things-0.116
 good-0.114
 new-0.114
Token Now
Token ID2735
Feature activation-8.80e-03

Pos logit contributions
jo+0.159
os+0.150
o+0.148
ppy+0.147
azz+0.144
Neg logit contributions
 something-0.105
 anything-0.105
 top-0.103
 most-0.100
 good-0.099
Token proud
Token ID6613
Feature activation+6.02e-02

Pos logit contributions
 things+0.106
 top+0.104
 places+0.103
 anything+0.103
 bottom+0.099
Neg logit contributions
o-0.207
bug-0.181
os-0.180
jo-0.180
ppy-0.176
Token of
Token ID286
Feature activation+7.56e-02

Pos logit contributions
 anything+0.101
 top+0.100
 new+0.096
 something+0.096
 bottom+0.094
Neg logit contributions
jo-0.155
o-0.153
azz-0.148
os-0.147
ppy-0.146
Token him
Token ID683
Feature activation-2.93e-02

Pos logit contributions
 top+0.115
 biggest+0.113
 bottom+0.111
 anything+0.109
 most+0.107
Neg logit contributions
jo-0.203
o-0.198
azz-0.193
ppy-0.190
os-0.190
Token for
Token ID329
Feature activation+7.30e-02

Pos logit contributions
jo+0.081
o+0.080
azz+0.079
os+0.078
ppy+0.076
Neg logit contributions
 anything-0.044
 new-0.043
 something-0.043
 top-0.042
 things-0.041
Token being
Token ID852
Feature activation+8.82e-02

Pos logit contributions
 biggest+0.121
 top+0.118
 bottom+0.117
 middle+0.111
 beauty+0.106
Neg logit contributions
o-0.177
ppy-0.165
jo-0.165
os-0.158
azz-0.152
Token such
Token ID884
Feature activation+1.58e-01

Pos logit contributions
 top+0.125
 bottom+0.121
 chores+0.110
 anything+0.110
 sand+0.103
Neg logit contributions
o-0.261
jo-0.243
ppy-0.242
os-0.241
azz-0.234
Token a
Token ID257
Feature activation+8.30e-02

Pos logit contributions
 bottom+0.210
 top+0.183
 shore+0.175
 middle+0.171
 beach+0.168
Neg logit contributions
o-0.414
ppy-0.401
bug-0.367
lets-0.356
y-0.353
Token good
Token ID922
Feature activation+6.18e-02

Pos logit contributions
 anything+0.074
 top+0.065
 bottom+0.063
 yourselves+0.061
 chores+0.059
Neg logit contributions
o-0.271
jo-0.261
azz-0.252
ppy-0.250
itty-0.250
Token dog
Token ID3290
Feature activation-1.03e-01

Pos logit contributions
 anything+0.099
 top+0.098
 something+0.092
 new+0.092
 bottom+0.092
Neg logit contributions
jo-0.163
o-0.162
azz-0.156
os-0.154
ppy-0.153
Token and
Token ID290
Feature activation+2.26e-02

Pos logit contributions
jo+0.280
azz+0.269
o+0.266
os+0.264
ppy+0.260
Neg logit contributions
 anything-0.161
 something-0.155
 new-0.152
 top-0.149
 things-0.148
Token taking
Token ID2263
Feature activation+8.40e-02

Pos logit contributions
 top+0.036
 bottom+0.036
 anything+0.033
 middle+0.030
 chores+0.030
Neg logit contributions
o-0.062
jo-0.058
os-0.056
azz-0.055
ppy-0.054
Token a
Token ID257
Feature activation+5.36e-02

Pos logit contributions
 anything+0.008
 top+0.008
 bottom+0.008
 biggest+0.007
 sand+0.006
Neg logit contributions
o-0.034
jo-0.033
os-0.033
azz-0.032
ppy-0.032
Token big
Token ID1263
Feature activation+4.13e-02

Pos logit contributions
 anything+0.050
 top+0.049
 something+0.045
 bottom+0.044
 new+0.044
Neg logit contributions
jo-0.176
o-0.175
azz-0.171
os-0.169
ppy-0.168
Token hug
Token ID16225
Feature activation-1.05e-02

Pos logit contributions
 top+0.082
 anything+0.081
 new+0.079
 something+0.079
 bottom+0.077
Neg logit contributions
jo-0.093
o-0.092
azz-0.089
os-0.088
ppy-0.087
Token for
Token ID329
Feature activation+5.38e-02

Pos logit contributions
jo+0.029
o+0.028
azz+0.028
os+0.027
ppy+0.027
Neg logit contributions
 anything-0.016
 top-0.016
 something-0.016
 new-0.015
 things-0.015
Token being
Token ID852
Feature activation+9.13e-02

Pos logit contributions
 biggest+0.066
 top+0.065
 bottom+0.064
 anything+0.062
 most+0.059
Neg logit contributions
jo-0.151
o-0.149
ppy-0.148
azz-0.144
os-0.144
Token such
Token ID884
Feature activation+1.58e-01

Pos logit contributions
 top+0.117
 bottom+0.114
 chores+0.104
 anything+0.103
 sand+0.095
Neg logit contributions
o-0.276
jo-0.268
os-0.261
ppy-0.260
azz-0.257
Token a
Token ID257
Feature activation+8.35e-02

Pos logit contributions
 bottom+0.216
 shore+0.190
 top+0.183
 port+0.183
 edge+0.170
Neg logit contributions
ppy-0.381
o-0.377
bug-0.348
jo-0.344
lets-0.341
Token helpful
Token ID7613
Feature activation+3.15e-02

Pos logit contributions
 anything+0.065
 top+0.054
 chores+0.053
 bottom+0.052
 something+0.050
Neg logit contributions
o-0.280
jo-0.279
azz-0.270
ppy-0.265
itty-0.260
Token,
Token ID11
Feature activation+1.51e-02

Pos logit contributions
 top+0.050
 anything+0.050
 new+0.048
 something+0.048
 bottom+0.047
Neg logit contributions
jo-0.083
o-0.082
azz-0.080
os-0.079
ppy-0.078
Token organized
Token ID8389
Feature activation+3.07e-02

Pos logit contributions
 top+0.017
 anything+0.017
 bottom+0.016
 chores+0.015
 things+0.014
Neg logit contributions
jo-0.046
o-0.046
azz-0.045
os-0.044
ppy-0.044
Token little
Token ID1310
Feature activation+1.40e-02

Pos logit contributions
 anything+0.044
 top+0.042
 bottom+0.041
 biggest+0.040
 something+0.038
Neg logit contributions
jo-0.084
o-0.081
azz-0.078
os-0.078
ppy-0.076
Token says
Token ID1139
Feature activation-1.45e-02

Pos logit contributions
jo+0.028
o+0.028
azz+0.027
os+0.027
ppy+0.026
Neg logit contributions
 anything-0.040
 top-0.040
 something-0.039
 bottom-0.039
 new-0.038
Token,
Token ID11
Feature activation-3.80e-02

Pos logit contributions
jo+0.042
o+0.041
azz+0.040
ppy+0.039
os+0.039
Neg logit contributions
 anything-0.021
 top-0.020
 new-0.020
 something-0.020
 good-0.019
Token "
Token ID366
Feature activation-2.02e-02

Pos logit contributions
jo+0.118
ppy+0.111
azz+0.110
o+0.110
os+0.110
Neg logit contributions
 new-0.049
 something-0.047
 top-0.047
 anything-0.046
 good-0.046
TokenYou
Token ID1639
Feature activation-3.31e-03

Pos logit contributions
jo+0.060
os+0.056
ppy+0.056
azz+0.055
o+0.055
Neg logit contributions
 something-0.026
 things-0.026
 anything-0.025
 most-0.025
 good-0.024
Token are
Token ID389
Feature activation+6.58e-02

Pos logit contributions
o+0.008
jo+0.008
azz+0.008
os+0.008
ppy+0.008
Neg logit contributions
 top-0.006
 anything-0.006
 new-0.005
 bottom-0.005
 something-0.005
Token such
Token ID884
Feature activation+1.58e-01

Pos logit contributions
 top+0.084
 bottom+0.082
 places+0.075
 miles+0.075
 accomplishment+0.072
Neg logit contributions
o-0.199
ppy-0.182
bug-0.169
os-0.169
azz-0.165
Token good
Token ID922
Feature activation+5.77e-02

Pos logit contributions
 bottom+0.185
 top+0.166
 miles+0.153
 anything+0.146
 places+0.146
Neg logit contributions
o-0.465
ppy-0.451
azz-0.418
jo-0.406
itty-0.398
Token twins
Token ID20345
Feature activation-6.10e-02

Pos logit contributions
 anything+0.061
 top+0.060
 new+0.057
 something+0.056
 things+0.055
Neg logit contributions
jo-0.186
o-0.183
azz-0.180
os-0.179
ppy-0.177
Token.
Token ID13
Feature activation-5.06e-02

Pos logit contributions
jo+0.172
azz+0.165
o+0.164
os+0.161
ppy+0.157
Neg logit contributions
 anything-0.093
 something-0.089
 good-0.088
 new-0.088
 top-0.087
Token You
Token ID921
Feature activation-8.72e-03

Pos logit contributions
jo+0.124
o+0.118
azz+0.118
ppy+0.116
os+0.115
Neg logit contributions
 anything-0.093
 something-0.089
 top-0.087
 things-0.086
 good-0.086
Token are
Token ID389
Feature activation+6.36e-02

Pos logit contributions
o+0.020
jo+0.020
azz+0.020
os+0.019
ppy+0.019
Neg logit contributions
 top-0.016
 anything-0.016
 new-0.016
 bottom-0.016
 something-0.016
Token home
Token ID1363
Feature activation-3.56e-02

Pos logit contributions
 anything+0.018
 top+0.018
 something+0.017
 new+0.017
 chores+0.017
Neg logit contributions
jo-0.032
o-0.032
azz-0.031
os-0.031
ppy-0.030
Token because
Token ID780
Feature activation+4.12e-03

Pos logit contributions
jo+0.089
o+0.088
os+0.085
azz+0.083
ppy+0.082
Neg logit contributions
 top-0.061
 anything-0.061
 chores-0.058
 bottom-0.058
 something-0.057
Token it
Token ID340
Feature activation-3.72e-02

Pos logit contributions
 bottom+0.007
 top+0.007
 chores+0.006
 biggest+0.006
 anything+0.006
Neg logit contributions
jo-0.010
o-0.010
os-0.009
azz-0.009
y-0.008
Token had
Token ID550
Feature activation+7.81e-02

Pos logit contributions
jo+0.085
o+0.084
azz+0.081
os+0.080
ppy+0.079
Neg logit contributions
 anything-0.073
 top-0.072
 new-0.070
 something-0.070
 bottom-0.069
Token been
Token ID587
Feature activation+8.64e-02

Pos logit contributions
 top+0.115
 chores+0.111
 bottom+0.110
 anything+0.107
 things+0.102
Neg logit contributions
o-0.206
jo-0.204
azz-0.200
os-0.197
ppy-0.191
Token such
Token ID884
Feature activation+1.58e-01

Pos logit contributions
 top+0.110
 bottom+0.103
 chores+0.102
 anything+0.098
 things+0.095
Neg logit contributions
o-0.259
jo-0.253
os-0.248
azz-0.245
ppy-0.244
Token a
Token ID257
Feature activation+8.12e-02

Pos logit contributions
 bottom+0.225
 top+0.204
 avail+0.192
 shore+0.192
 chores+0.190
Neg logit contributions
o-0.363
ppy-0.348
jo-0.334
y-0.329
red-0.316
Token lovely
Token ID14081
Feature activation+5.48e-02

Pos logit contributions
 anything+0.074
 chores+0.067
 yourselves+0.066
 top+0.062
wered+0.058
Neg logit contributions
o-0.254
jo-0.254
azz-0.242
anne-0.233
ppy-0.233
Token night
Token ID1755
Feature activation-1.45e-02

Pos logit contributions
 anything+0.070
 top+0.066
 chores+0.063
 something+0.062
 bottom+0.061
Neg logit contributions
jo-0.163
o-0.158
azz-0.154
os-0.150
ppy-0.148
Token.
Token ID13
Feature activation-5.38e-02

Pos logit contributions
jo+0.036
o+0.035
azz+0.034
os+0.034
ppy+0.033
Neg logit contributions
 top-0.025
 anything-0.024
 chores-0.024
 new-0.023
 bottom-0.023
Token<|endoftext|>
Token ID50256
Feature activation-4.92e-02

Pos logit contributions
jo+0.136
o+0.134
azz+0.131
ppy+0.131
os+0.130
Neg logit contributions
 top-0.092
 anything-0.091
 new-0.088
 something-0.088
 bottom-0.085
Token her
Token ID607
Feature activation+5.58e-02

Pos logit contributions
o+0.180
jo+0.180
azz+0.173
os+0.172
ppy+0.170
Neg logit contributions
 anything-0.120
 top-0.117
 something-0.114
 new-0.113
 chores-0.113
Token parents
Token ID3397
Feature activation-6.93e-02

Pos logit contributions
 anything+0.079
 top+0.076
 something+0.072
 bottom+0.072
 chores+0.070
Neg logit contributions
jo-0.154
o-0.152
ppy-0.145
azz-0.145
os-0.144
Token for
Token ID329
Feature activation+8.39e-02

Pos logit contributions
jo+0.175
azz+0.171
o+0.170
os+0.164
ppy+0.163
Neg logit contributions
 anything-0.119
 top-0.118
 something-0.115
 new-0.114
 bottom-0.112
Token showing
Token ID4478
Feature activation+5.86e-02

Pos logit contributions
 biggest+0.101
 top+0.100
 bottom+0.100
 anything+0.090
 most+0.088
Neg logit contributions
o-0.244
ppy-0.233
jo-0.231
os-0.225
azz-0.217
Token her
Token ID607
Feature activation+2.74e-02

Pos logit contributions
 top+0.062
 anything+0.061
 bottom+0.057
 new+0.056
 something+0.055
Neg logit contributions
jo-0.185
o-0.182
azz-0.178
os-0.177
ppy-0.175
Token such
Token ID884
Feature activation+1.57e-01

Pos logit contributions
 top+0.021
 bottom+0.020
 anything+0.019
 biggest+0.018
 miles+0.017
Neg logit contributions
jo-0.091
o-0.091
os-0.089
azz-0.087
ppy-0.085
Token an
Token ID281
Feature activation+8.84e-02

Pos logit contributions
 bottom+0.157
 top+0.139
 anything+0.135
 middle+0.122
 biggest+0.119
Neg logit contributions
jo-0.464
o-0.461
ppy-0.442
azz-0.439
anne-0.431
Token amazing
Token ID4998
Feature activation+6.32e-02

Pos logit contributions
 anything+0.089
 top+0.078
 something+0.072
 bottom+0.072
 chores+0.065
Neg logit contributions
jo-0.292
o-0.279
ppy-0.268
azz-0.268
os-0.268
Token sight
Token ID6504
Feature activation-1.43e-02

Pos logit contributions
 anything+0.075
 top+0.071
 bottom+0.067
 chores+0.065
 biggest+0.064
Neg logit contributions
jo-0.191
o-0.184
azz-0.177
os-0.176
anne-0.173
Token.
Token ID13
Feature activation-3.38e-02

Pos logit contributions
jo+0.037
o+0.036
azz+0.035
ppy+0.035
os+0.035
Neg logit contributions
 anything-0.024
 top-0.023
 something-0.023
 new-0.023
 things-0.022
Token From
Token ID3574
Feature activation+3.59e-02

Pos logit contributions
ppy+0.089
jo+0.089
o+0.087
os+0.087
azz+0.085
Neg logit contributions
 top-0.052
 something-0.049
 new-0.049
 anything-0.048
 many-0.048
Token a
Token ID257
Feature activation+5.16e-02

Pos logit contributions
jo+0.480
o+0.479
azz+0.466
os+0.464
ppy+0.460
Neg logit contributions
 anything-0.093
 top-0.092
 bottom-0.081
 new-0.080
 something-0.079
Token big
Token ID1263
Feature activation+3.94e-02

Pos logit contributions
 anything+0.049
 top+0.047
 bottom+0.043
 something+0.043
 chores+0.042
Neg logit contributions
jo-0.169
o-0.169
azz-0.164
os-0.162
ppy-0.161
Token bone
Token ID9970
Feature activation-7.96e-02

Pos logit contributions
 anything+0.077
 top+0.077
 new+0.073
 something+0.073
 bottom+0.073
Neg logit contributions
jo-0.090
o-0.089
azz-0.086
os-0.085
ppy-0.084
Token for
Token ID329
Feature activation+4.82e-02

Pos logit contributions
jo+0.205
o+0.199
azz+0.197
os+0.193
ppy+0.192
Neg logit contributions
 top-0.133
 anything-0.131
 new-0.129
 something-0.128
 bottom-0.124
Token being
Token ID852
Feature activation+9.02e-02

Pos logit contributions
 biggest+0.049
 top+0.048
 anything+0.048
 bottom+0.047
 most+0.044
Neg logit contributions
jo-0.148
o-0.147
os-0.142
ppy-0.142
azz-0.140
Token such
Token ID884
Feature activation+1.57e-01

Pos logit contributions
 top+0.121
 bottom+0.118
 chores+0.110
 anything+0.109
 biggest+0.099
Neg logit contributions
o-0.271
jo-0.259
ppy-0.254
os-0.254
azz-0.247
Token a
Token ID257
Feature activation+8.13e-02

Pos logit contributions
 bottom+0.220
 top+0.191
 avail+0.184
 shore+0.178
 beach+0.172
Neg logit contributions
o-0.412
ppy-0.394
jo-0.355
lets-0.354
y-0.350
Token good
Token ID922
Feature activation+6.27e-02

Pos logit contributions
 anything+0.076
 top+0.068
 bottom+0.066
 chores+0.064
 yourselves+0.061
Neg logit contributions
o-0.260
jo-0.255
azz-0.246
ppy-0.244
itty-0.237
Token protector
Token ID36784
Feature activation-1.73e-02

Pos logit contributions
 anything+0.107
 top+0.104
 bottom+0.099
 something+0.098
 chores+0.097
Neg logit contributions
o-0.158
jo-0.158
azz-0.152
ppy-0.149
os-0.146
Token.
Token ID13
Feature activation-5.06e-02

Pos logit contributions
jo+0.044
o+0.043
azz+0.042
os+0.042
ppy+0.041
Neg logit contributions
 anything-0.030
 top-0.029
 something-0.028
 new-0.028
 bottom-0.027
Token
Token ID198
Feature activation+2.85e-03

Pos logit contributions
ppy+0.135
jo+0.134
o+0.130
azz+0.129
os+0.127
Neg logit contributions
 top-0.079
 anything-0.073
 something-0.073
 new-0.072
 many-0.072
Token proud
Token ID6613
Feature activation+6.22e-02

Pos logit contributions
 top+0.106
 anything+0.104
 bottom+0.102
 things+0.102
 chores+0.100
Neg logit contributions
o-0.207
jo-0.183
os-0.178
ppy-0.176
azz-0.173
Token of
Token ID286
Feature activation+7.06e-02

Pos logit contributions
 anything+0.101
 top+0.100
 new+0.097
 something+0.096
 bottom+0.094
Neg logit contributions
jo-0.163
o-0.161
azz-0.157
os-0.155
ppy-0.154
Token him
Token ID683
Feature activation-3.26e-02

Pos logit contributions
 top+0.106
 biggest+0.104
 anything+0.102
 bottom+0.102
 new+0.099
Neg logit contributions
o-0.191
jo-0.189
os-0.181
azz-0.180
ppy-0.178
Token for
Token ID329
Feature activation+6.99e-02

Pos logit contributions
jo+0.091
azz+0.090
o+0.089
os+0.088
ppy+0.086
Neg logit contributions
 new-0.047
 anything-0.047
 something-0.046
 good-0.045
 things-0.045
Token being
Token ID852
Feature activation+8.82e-02

Pos logit contributions
 top+0.113
 biggest+0.112
 bottom+0.112
 middle+0.104
 beauty+0.101
Neg logit contributions
o-0.176
ppy-0.162
jo-0.161
os-0.156
azz-0.153
Token such
Token ID884
Feature activation+1.57e-01

Pos logit contributions
 top+0.118
 bottom+0.113
 anything+0.105
 chores+0.104
 sand+0.096
Neg logit contributions
o-0.269
jo-0.254
os-0.249
ppy-0.248
azz-0.247
Token a
Token ID257
Feature activation+8.11e-02

Pos logit contributions
 bottom+0.214
 top+0.188
 middle+0.168
 beach+0.168
 shore+0.168
Neg logit contributions
o-0.428
ppy-0.408
azz-0.373
jo-0.368
lets-0.366
Token good
Token ID922
Feature activation+6.18e-02

Pos logit contributions
 anything+0.078
 top+0.067
 bottom+0.065
wered+0.061
 yourselves+0.061
Neg logit contributions
o-0.261
jo-0.253
azz-0.244
ppy-0.243
itty-0.239
Token listener
Token ID24783
Feature activation-6.65e-02

Pos logit contributions
 anything+0.107
 top+0.106
 bottom+0.100
 something+0.099
 new+0.098
Neg logit contributions
jo-0.152
o-0.151
azz-0.145
ppy-0.144
os-0.140
Token.
Token ID13
Feature activation-4.45e-02

Pos logit contributions
jo+0.175
o+0.173
azz+0.172
os+0.168
ppy+0.166
Neg logit contributions
 anything-0.104
 top-0.101
 something-0.099
 things-0.097
 new-0.096
Token
Token ID220
Feature activation-9.47e-03

Pos logit contributions
ppy+0.131
jo+0.125
azz+0.121
os+0.120
ring+0.118
Neg logit contributions
 top-0.060
 new-0.060
 many-0.057
 something-0.055
 the-0.052
Token thank
Token ID5875
Feature activation+6.19e-02

Pos logit contributions
 anything+0.020
 top+0.020
 new+0.020
 something+0.020
 chores+0.019
Neg logit contributions
jo-0.030
o-0.029
azz-0.029
os-0.028
ppy-0.028
Token you
Token ID345
Feature activation+2.31e-04

Pos logit contributions
 anything+0.074
 top+0.073
 something+0.069
 new+0.068
 chores+0.066
Neg logit contributions
jo-0.191
o-0.187
azz-0.184
os-0.182
ppy-0.180
Token for
Token ID329
Feature activation+7.15e-02

Pos logit contributions
 anything+0.000
 top+0.000
 bottom+0.000
 new+0.000
 chores+0.000
Neg logit contributions
jo-0.001
o-0.001
azz-0.001
ppy-0.001
os-0.001
Token giving
Token ID3501
Feature activation+8.58e-02

Pos logit contributions
 biggest+0.113
 bottom+0.111
 top+0.108
 most+0.101
 anything+0.098
Neg logit contributions
o-0.186
ppy-0.166
jo-0.160
os-0.159
itty-0.149
Token me
Token ID502
Feature activation-5.42e-03

Pos logit contributions
 top+0.108
 bottom+0.106
 anything+0.105
 chores+0.101
 new+0.099
Neg logit contributions
o-0.249
jo-0.246
os-0.238
ppy-0.237
azz-0.235
Token such
Token ID884
Feature activation+1.57e-01

Pos logit contributions
o+0.019
jo+0.018
os+0.018
ppy+0.018
azz+0.018
Neg logit contributions
 top-0.004
 bottom-0.004
 anything-0.004
 trouble-0.003
 new-0.003
Token good
Token ID922
Feature activation+6.84e-02

Pos logit contributions
 bottom+0.201
 top+0.165
 shore+0.164
 places+0.160
 miles+0.158
Neg logit contributions
o-0.409
ppy-0.396
jo-0.385
bug-0.360
anne-0.356
Token advice
Token ID5608
Feature activation+2.56e-02

Pos logit contributions
 top+0.093
 anything+0.090
 bottom+0.087
 chores+0.084
 new+0.083
Neg logit contributions
o-0.190
jo-0.188
ppy-0.181
os-0.178
azz-0.177
Token!
Token ID0
Feature activation-4.18e-02

Pos logit contributions
 anything+0.046
 top+0.046
 new+0.044
 bottom+0.044
 something+0.044
Neg logit contributions
jo-0.062
o-0.061
os-0.059
azz-0.059
ppy-0.058
Tokenâ
Token ID22940
Feature activation-4.19e-02

Pos logit contributions
jo+0.119
ppy+0.117
os+0.113
azz+0.112
o+0.110
Neg logit contributions
 top-0.054
 something-0.053
 anything-0.051
 new-0.051
 many-0.050
Token
Token ID26391
Feature activation-8.26e-02

Pos logit contributions
azz+0.097
os+0.096
jo+0.093
o+0.093
ppy+0.089
Neg logit contributions
 top-0.077
 most-0.074
 great-0.072
 bottom-0.071
 good-0.071
Token good
Token ID922
Feature activation+5.77e-02

Pos logit contributions
 bottom+0.185
 top+0.166
 miles+0.153
 anything+0.146
 places+0.146
Neg logit contributions
o-0.465
ppy-0.451
azz-0.418
jo-0.406
itty-0.398
Token twins
Token ID20345
Feature activation-6.10e-02

Pos logit contributions
 anything+0.061
 top+0.060
 new+0.057
 something+0.056
 things+0.055
Neg logit contributions
jo-0.186
o-0.183
azz-0.180
os-0.179
ppy-0.177
Token.
Token ID13
Feature activation-5.06e-02

Pos logit contributions
jo+0.172
azz+0.165
o+0.164
os+0.161
ppy+0.157
Neg logit contributions
 anything-0.093
 something-0.089
 good-0.088
 new-0.088
 top-0.087
Token You
Token ID921
Feature activation-8.72e-03

Pos logit contributions
jo+0.124
o+0.118
azz+0.118
ppy+0.116
os+0.115
Neg logit contributions
 anything-0.093
 something-0.089
 top-0.087
 things-0.086
 good-0.086
Token are
Token ID389
Feature activation+6.36e-02

Pos logit contributions
o+0.020
jo+0.020
azz+0.020
os+0.019
ppy+0.019
Neg logit contributions
 top-0.016
 anything-0.016
 new-0.016
 bottom-0.016
 something-0.016
Token such
Token ID884
Feature activation+1.57e-01

Pos logit contributions
 bottom+0.086
 top+0.085
 places+0.073
 miles+0.072
 chores+0.071
Neg logit contributions
o-0.190
ppy-0.173
bug-0.165
os-0.157
itty-0.154
Token good
Token ID922
Feature activation+5.66e-02

Pos logit contributions
 bottom+0.195
 top+0.176
 miles+0.159
 places+0.154
 middle+0.151
Neg logit contributions
o-0.453
ppy-0.442
azz-0.402
bug-0.389
jo-0.388
Token siblings
Token ID20569
Feature activation-5.18e-02

Pos logit contributions
 anything+0.061
 top+0.061
 new+0.056
 something+0.056
 bottom+0.056
Neg logit contributions
jo-0.179
o-0.178
azz-0.173
os-0.172
ppy-0.171
Token.
Token ID13
Feature activation-5.23e-02

Pos logit contributions
jo+0.122
o+0.122
os+0.116
azz+0.116
ppy+0.116
Neg logit contributions
 top-0.096
 anything-0.095
 bottom-0.092
 new-0.091
 chores-0.090
Token You
Token ID921
Feature activation-1.32e-02

Pos logit contributions
jo+0.125
o+0.119
azz+0.118
os+0.116
ppy+0.115
Neg logit contributions
 anything-0.100
 something-0.095
 top-0.094
 good-0.092
 things-0.092
Token support
Token ID1104
Feature activation+2.46e-02

Pos logit contributions
jo+0.031
o+0.031
azz+0.030
os+0.030
ppy+0.030
Neg logit contributions
 top-0.024
 anything-0.024
 new-0.023
 something-0.023
 bottom-0.023
Token to
Token ID284
Feature activation+9.02e-02

Pos logit contributions
 anything+0.072
 top+0.071
 new+0.069
 something+0.069
 things+0.067
Neg logit contributions
jo-0.109
o-0.107
azz-0.105
os-0.103
ppy-0.103
Token be
Token ID307
Feature activation+8.49e-02

Pos logit contributions
 top+0.143
 anything+0.142
 new+0.137
 something+0.135
 bottom+0.134
Neg logit contributions
o-0.234
jo-0.234
azz-0.226
os-0.224
ppy-0.223
Token able
Token ID1498
Feature activation+8.74e-02

Pos logit contributions
 top+0.117
 bottom+0.112
 anything+0.112
 chores+0.108
 new+0.103
Neg logit contributions
o-0.247
jo-0.242
os-0.234
azz-0.232
ppy-0.232
Token to
Token ID284
Feature activation+8.74e-02

Pos logit contributions
 top+0.175
 anything+0.172
 bottom+0.169
 chores+0.164
 something+0.162
Neg logit contributions
o-0.195
jo-0.191
os-0.184
azz-0.179
ppy-0.177
Token see
Token ID766
Feature activation+7.61e-02

Pos logit contributions
 anything+0.134
 biggest+0.132
 new+0.130
 things+0.129
 most+0.128
Neg logit contributions
o-0.216
jo-0.212
ppy-0.209
os-0.204
azz-0.201
Token such
Token ID884
Feature activation+1.57e-01

Pos logit contributions
 top+0.073
 bottom+0.069
 anything+0.067
 chores+0.064
 biggest+0.061
Neg logit contributions
jo-0.244
o-0.244
os-0.238
azz-0.237
ppy-0.234
Token a
Token ID257
Feature activation+8.25e-02

Pos logit contributions
 bottom+0.161
 top+0.147
 chores+0.128
 anything+0.125
 middle+0.122
Neg logit contributions
o-0.470
jo-0.461
ppy-0.454
azz-0.443
os-0.433
Token pretty
Token ID2495
Feature activation+5.61e-02

Pos logit contributions
 anything+0.082
 top+0.072
 chores+0.069
 something+0.068
 bottom+0.066
Neg logit contributions
o-0.258
jo-0.258
azz-0.248
ppy-0.241
itty-0.238
Token view
Token ID1570
Feature activation-2.17e-02

Pos logit contributions
 anything+0.085
 top+0.082
 chores+0.080
 bottom+0.078
 something+0.077
Neg logit contributions
jo-0.150
o-0.147
azz-0.142
ppy-0.139
os-0.139
Token."
Token ID526
Feature activation-4.50e-02

Pos logit contributions
jo+0.054
o+0.053
azz+0.052
os+0.051
ppy+0.051
Neg logit contributions
 anything-0.038
 top-0.038
 new-0.036
 something-0.036
 bottom-0.035
Token
Token ID198
Feature activation+3.22e-03

Pos logit contributions
ppy+0.130
jo+0.129
azz+0.126
os+0.124
o+0.123
Neg logit contributions
 top-0.057
 new-0.057
 the-0.056
 something-0.056
 many-0.053
Token was
Token ID373
Feature activation+7.14e-02

Pos logit contributions
jo+0.323
azz+0.320
o+0.315
ppy+0.306
os+0.301
Neg logit contributions
 anything-0.274
 top-0.272
 something-0.264
 new-0.263
 good-0.258
Token very
Token ID845
Feature activation+5.93e-02

Pos logit contributions
 top+0.097
 bottom+0.095
 anything+0.090
 chores+0.089
 things+0.088
Neg logit contributions
o-0.209
jo-0.198
os-0.189
ppy-0.186
anne-0.181
Token happy
Token ID3772
Feature activation+3.94e-02

Pos logit contributions
 anything+0.076
 top+0.076
 bottom+0.073
 chores+0.067
 something+0.066
Neg logit contributions
o-0.178
jo-0.172
os-0.165
azz-0.163
ppy-0.163
Token to
Token ID284
Feature activation+8.34e-02

Pos logit contributions
 top+0.071
 anything+0.070
 bottom+0.068
 chores+0.068
 new+0.066
Neg logit contributions
o-0.094
jo-0.094
os-0.090
azz-0.089
ppy-0.088
Token have
Token ID423
Feature activation+9.02e-02

Pos logit contributions
 anything+0.126
 top+0.126
 new+0.121
 something+0.120
 chores+0.119
Neg logit contributions
jo-0.224
o-0.221
azz-0.214
os-0.212
ppy-0.211
Token such
Token ID884
Feature activation+1.57e-01

Pos logit contributions
 top+0.091
 bottom+0.087
 chores+0.078
 biggest+0.076
 anything+0.075
Neg logit contributions
o-0.291
jo-0.288
azz-0.277
os-0.277
ppy-0.272
Token a
Token ID257
Feature activation+8.40e-02

Pos logit contributions
 bottom+0.205
 top+0.171
 middle+0.163
 shore+0.161
 yourselves+0.154
Neg logit contributions
o-0.395
ppy-0.387
jo-0.375
azz-0.364
lets-0.363
Token special
Token ID2041
Feature activation+6.40e-02

Pos logit contributions
 anything+0.082
 top+0.079
 bottom+0.073
 something+0.072
 chores+0.070
Neg logit contributions
o-0.271
jo-0.269
azz-0.260
ppy-0.255
os-0.254
Token friend
Token ID1545
Feature activation-5.51e-02

Pos logit contributions
 anything+0.121
 top+0.120
 bottom+0.115
 something+0.114
 new+0.113
Neg logit contributions
jo-0.149
o-0.148
azz-0.142
os-0.140
ppy-0.138
Token.
Token ID13
Feature activation-5.19e-02

Pos logit contributions
jo+0.150
azz+0.145
o+0.143
os+0.142
ppy+0.140
Neg logit contributions
 anything-0.088
 something-0.084
 new-0.080
 top-0.079
 things-0.079
Token<|endoftext|>
Token ID50256
Feature activation-4.27e-02

Pos logit contributions
ppy+0.142
jo+0.140
azz+0.136
o+0.135
os+0.133
Neg logit contributions
 top-0.076
 anything-0.072
 new-0.072
 many-0.071
 something-0.070
Token happy
Token ID3772
Feature activation+3.76e-02

Pos logit contributions
 top+0.102
 bottom+0.101
 places+0.098
 things+0.093
 chores+0.092
Neg logit contributions
o-0.174
os-0.151
jo-0.150
azz-0.149
ppy-0.149
Token you
Token ID345
Feature activation-1.02e-02

Pos logit contributions
 anything+0.060
 top+0.059
 new+0.058
 something+0.058
 things+0.056
Neg logit contributions
jo-0.101
o-0.098
azz-0.098
os-0.096
ppy-0.096
Token and
Token ID290
Feature activation+1.53e-02

Pos logit contributions
jo+0.025
azz+0.023
o+0.022
os+0.022
anne+0.022
Neg logit contributions
 anything-0.020
 great-0.019
 something-0.019
 good-0.019
 top-0.019
Token Billy
Token ID15890
Feature activation-1.06e-01

Pos logit contributions
 top+0.027
 anything+0.027
 bottom+0.026
 chores+0.025
 something+0.025
Neg logit contributions
o-0.038
jo-0.037
os-0.035
azz-0.035
ppy-0.034
Token are
Token ID389
Feature activation+7.24e-02

Pos logit contributions
jo+0.262
azz+0.258
o+0.245
os+0.242
ppy+0.240
Neg logit contributions
 anything-0.189
 good-0.185
 new-0.184
 great-0.182
 top-0.181
Token such
Token ID884
Feature activation+1.57e-01

Pos logit contributions
 bottom+0.116
 top+0.111
 chores+0.105
 tasks+0.099
 middle+0.098
Neg logit contributions
o-0.195
ppy-0.164
bug-0.158
jo-0.156
os-0.154
Token good
Token ID922
Feature activation+6.23e-02

Pos logit contributions
 bottom+0.224
 top+0.198
 places+0.177
 middle+0.174
 edge+0.168
Neg logit contributions
o-0.413
ppy-0.397
bug-0.367
azz-0.353
lets-0.344
Token friends
Token ID2460
Feature activation-3.66e-02

Pos logit contributions
 top+0.088
 anything+0.088
 bottom+0.083
 new+0.082
 something+0.082
Neg logit contributions
o-0.176
jo-0.176
azz-0.170
os-0.168
ppy-0.167
Token."
Token ID526
Feature activation-4.60e-02

Pos logit contributions
jo+0.098
o+0.096
azz+0.095
os+0.093
ppy+0.092
Neg logit contributions
 anything-0.058
 top-0.056
 something-0.055
 new-0.055
 good-0.054
Token<|endoftext|>
Token ID50256
Feature activation-3.68e-02

Pos logit contributions
ppy+0.140
jo+0.134
azz+0.134
os+0.128
ne+0.125
Neg logit contributions
 new-0.059
 top-0.059
 the-0.057
 something-0.057
 many-0.056
TokenOnce
Token ID7454
Feature activation+2.44e-02

Pos logit contributions
jo+0.097
o+0.094
ppy+0.093
azz+0.093
os+0.091
Neg logit contributions
 top-0.059
 anything-0.058
 new-0.057
 something-0.057
 bottom-0.055
Token.
Token ID13
Feature activation-7.46e-02

Pos logit contributions
jo+0.087
o+0.084
azz+0.084
os+0.082
ppy+0.081
Neg logit contributions
 new-0.042
 top-0.042
 anything-0.042
 something-0.041
 good-0.040
Token
Token ID198
Feature activation-2.55e-02

Pos logit contributions
jo+0.209
o+0.200
ppy+0.200
os+0.198
azz+0.196
Neg logit contributions
 top-0.112
 new-0.110
 something-0.110
 anything-0.109
 good-0.103
Token
Token ID198
Feature activation-2.72e-02

Pos logit contributions
jo+0.065
o+0.064
azz+0.063
os+0.062
ppy+0.062
Neg logit contributions
 top-0.043
 anything-0.043
 new-0.042
 something-0.041
 bottom-0.040
TokenThe
Token ID464
Feature activation+1.88e-02

Pos logit contributions
jo+0.065
o+0.064
azz+0.062
os+0.062
ppy+0.061
Neg logit contributions
 anything-0.051
 top-0.051
 new-0.049
 something-0.049
 bottom-0.048
Token great
Token ID1049
Feature activation+1.48e-02

Pos logit contributions
 anything+0.040
 top+0.037
 chores+0.037
 something+0.036
 great+0.035
Neg logit contributions
o-0.039
jo-0.039
azz-0.037
os-0.037
ppy-0.035
Token Zen
Token ID14760
Feature activation-2.04e-01

Pos logit contributions
 anything+0.027
 top+0.026
 chores+0.025
 bottom+0.025
 biggest+0.024
Neg logit contributions
jo-0.035
o-0.035
os-0.034
azz-0.034
y-0.032
Tokeno
Token ID78
Feature activation-1.21e-01

Pos logit contributions
jo+0.227
ppy+0.213
azz+0.210
ksh+0.189
anna+0.182
Neg logit contributions
 something-0.576
 new-0.574
 the-0.566
 anything-0.566
 good-0.556
Token hugs
Token ID40657
Feature activation-2.98e-03

Pos logit contributions
jo+0.327
azz+0.304
o+0.296
ppy+0.291
uddy+0.289
Neg logit contributions
 top-0.207
 new-0.197
 good-0.194
 something-0.193
 things-0.187
Token Ben
Token ID3932
Feature activation-1.22e-01

Pos logit contributions
jo+0.009
o+0.009
azz+0.009
os+0.009
ppy+0.009
Neg logit contributions
 anything-0.003
 top-0.003
 something-0.003
 new-0.003
 things-0.003
Token and
Token ID290
Feature activation-1.61e-03

Pos logit contributions
jo+0.353
azz+0.347
os+0.335
o+0.331
itty+0.323
Neg logit contributions
 new-0.174
 something-0.170
 good-0.169
 top-0.166
 things-0.166
Token says
Token ID1139
Feature activation-2.33e-03

Pos logit contributions
jo+0.003
o+0.003
azz+0.003
os+0.003
ppy+0.003
Neg logit contributions
 anything-0.003
 top-0.003
 bottom-0.003
 something-0.003
 chores-0.003
Token bird
Token ID6512
Feature activation-1.58e-01

Pos logit contributions
 anything+0.051
 something+0.047
 top+0.046
 chores+0.046
 bottom+0.043
Neg logit contributions
o-0.065
jo-0.061
os-0.058
itty-0.057
azz-0.057
Token.
Token ID13
Feature activation-6.23e-02

Pos logit contributions
jo+0.451
azz+0.442
os+0.416
anne+0.404
o+0.402
Neg logit contributions
 something-0.237
 anything-0.237
 new-0.234
 things-0.222
 good-0.218
Token So
Token ID1406
Feature activation-8.36e-03

Pos logit contributions
jo+0.182
azz+0.179
ppy+0.177
o+0.169
os+0.169
Neg logit contributions
 top-0.094
 new-0.087
 something-0.087
 many-0.086
 anything-0.084
Token,
Token ID11
Feature activation-2.19e-02

Pos logit contributions
o+0.020
jo+0.019
os+0.018
ppy+0.018
azz+0.018
Neg logit contributions
 chores-0.015
 anything-0.015
 bottom-0.014
 top-0.014
 biggest-0.014
Token the
Token ID262
Feature activation+2.78e-03

Pos logit contributions
o+0.072
os+0.068
jo+0.066
azz+0.064
ppy+0.064
Neg logit contributions
 chores-0.020
 bottom-0.018
 yourselves-0.017
 anything-0.017
 biggest-0.017
Token hipp
Token ID18568
Feature activation-1.85e-01

Pos logit contributions
 anything+0.006
 chores+0.005
 something+0.005
 top+0.005
 trouble+0.005
Neg logit contributions
o-0.006
jo-0.006
os-0.005
itty-0.005
azz-0.005
Tokeno
Token ID78
Feature activation-1.36e-01

Pos logit contributions
jo+0.182
azz+0.168
anna+0.157
ppy+0.156
anne+0.148
Neg logit contributions
 many-0.565
 most-0.564
 best-0.557
 top-0.556
 things-0.551
Token used
Token ID973
Feature activation+4.49e-02

Pos logit contributions
jo+0.347
azz+0.336
os+0.316
itty+0.303
o+0.298
Neg logit contributions
 top-0.241
 something-0.238
 good-0.236
 new-0.233
 things-0.227
Token its
Token ID663
Feature activation+4.16e-02

Pos logit contributions
 bottom+0.065
 yourselves+0.064
 top+0.062
 anything+0.062
 biggest+0.061
Neg logit contributions
o-0.122
os-0.112
ppy-0.109
y-0.103
jo-0.102
Token big
Token ID1263
Feature activation+2.05e-02

Pos logit contributions
 anything+0.060
 top+0.055
 bottom+0.053
 chores+0.052
 yourselves+0.051
Neg logit contributions
o-0.115
ppy-0.109
jo-0.108
os-0.106
azz-0.102
Token mouth
Token ID5422
Feature activation-4.61e-02

Pos logit contributions
 anything+0.037
 top+0.036
 bottom+0.034
 something+0.034
 new+0.034
Neg logit contributions
o-0.050
jo-0.049
os-0.047
ppy-0.047
azz-0.047
Token a
Token ID257
Feature activation+3.22e-02

Pos logit contributions
 something+0.001
 anything+0.001
 new+0.001
 top+0.001
 good+0.001
Neg logit contributions
jo-0.001
azz-0.001
os-0.001
o-0.001
ppy-0.001
Token small
Token ID1402
Feature activation+1.26e-02

Pos logit contributions
 anything+0.034
 top+0.032
 something+0.029
 chores+0.029
 bottom+0.029
Neg logit contributions
o-0.103
jo-0.102
azz-0.099
os-0.098
ppy-0.098
Token bird
Token ID6512
Feature activation-1.37e-01

Pos logit contributions
 top+0.023
 new+0.023
 something+0.022
 anything+0.022
 things+0.022
Neg logit contributions
jo-0.031
o-0.030
azz-0.030
os-0.030
ppy-0.029
Token named
Token ID3706
Feature activation-4.47e-02

Pos logit contributions
jo+0.326
azz+0.325
ppy+0.309
os+0.308
ksh+0.299
Neg logit contributions
 something-0.243
 new-0.240
 top-0.237
 things-0.236
 anything-0.233
Token B
Token ID347
Feature activation-1.20e-01

Pos logit contributions
jo+0.114
azz+0.106
os+0.105
o+0.104
ppy+0.104
Neg logit contributions
 anything-0.079
 something-0.077
 top-0.076
 new-0.074
 an-0.072
Tokenibi
Token ID27567
Feature activation-1.85e-01

Pos logit contributions
anne+0.146
jo+0.144
bug+0.133
azz+0.132
os+0.129
Neg logit contributions
 new-0.281
 top-0.280
 biggest-0.268
 bottom-0.263
 job-0.260
Token,
Token ID11
Feature activation-6.84e-03

Pos logit contributions
jo+0.603
azz+0.586
anne+0.554
os+0.540
enny+0.532
Neg logit contributions
 new-0.206
 anything-0.205
 the-0.204
 top-0.201
 something-0.199
Token fly
Token ID6129
Feature activation-2.01e-02

Pos logit contributions
jo+0.019
azz+0.018
os+0.017
ppy+0.017
o+0.017
Neg logit contributions
 something-0.011
 anything-0.010
 new-0.010
 top-0.010
 good-0.010
Token away
Token ID1497
Feature activation+3.47e-02

Pos logit contributions
jo+0.057
azz+0.055
o+0.054
ppy+0.054
os+0.053
Neg logit contributions
 something-0.028
 new-0.028
 top-0.028
 anything-0.028
 things-0.027
Token from
Token ID422
Feature activation+6.43e-02

Pos logit contributions
 anything+0.068
 top+0.067
 chores+0.066
 biggest+0.065
 bottom+0.065
Neg logit contributions
o-0.077
jo-0.075
os-0.074
azz-0.071
ppy-0.070
Token his
Token ID465
Feature activation+3.72e-02

Pos logit contributions
 biggest+0.058
 chores+0.054
 most+0.052
 top+0.052
 great+0.051
Neg logit contributions
o-0.206
jo-0.201
os-0.191
ppy-0.186
itty-0.184
Token
Token ID198
Feature activation-8.94e-03

Pos logit contributions
jo+0.010
o+0.009
ppy+0.009
azz+0.009
os+0.009
Neg logit contributions
 top-0.006
 anything-0.006
 new-0.006
 something-0.006
 bottom-0.005
TokenOne
Token ID3198
Feature activation+8.76e-03

Pos logit contributions
jo+0.024
o+0.024
azz+0.024
os+0.023
ppy+0.023
Neg logit contributions
 top-0.014
 anything-0.014
 new-0.013
 something-0.013
 bottom-0.013
Token day
Token ID1110
Feature activation-1.57e-02

Pos logit contributions
 top+0.009
 new+0.008
 anything+0.008
 something+0.008
 good+0.008
Neg logit contributions
jo-0.029
azz-0.028
o-0.028
os-0.028
ppy-0.027
Token,
Token ID11
Feature activation-8.70e-03

Pos logit contributions
o+0.041
jo+0.041
azz+0.039
os+0.039
ppy+0.039
Neg logit contributions
 anything-0.025
 top-0.025
 chores-0.023
 new-0.023
 bottom-0.023
Token the
Token ID262
Feature activation+1.12e-02

Pos logit contributions
o+0.029
os+0.027
jo+0.027
ppy+0.027
azz+0.026
Neg logit contributions
 chores-0.007
 anything-0.006
 bottom-0.006
 beauty-0.006
 trouble-0.006
Token hipp
Token ID18568
Feature activation-1.84e-01

Pos logit contributions
 anything+0.025
 chores+0.024
 something+0.023
 top+0.022
 trouble+0.022
Neg logit contributions
o-0.024
jo-0.020
os-0.020
itty-0.019
ppy-0.018
Tokeno
Token ID78
Feature activation-1.34e-01

Pos logit contributions
jo+0.154
azz+0.143
ppy+0.133
anne+0.120
anna+0.116
Neg logit contributions
 many-0.606
 most-0.597
 top-0.592
 things-0.591
 best-0.588
Token saw
Token ID2497
Feature activation+4.21e-02

Pos logit contributions
jo+0.323
azz+0.315
ppy+0.285
os+0.284
anne+0.282
Neg logit contributions
 top-0.245
 something-0.242
 things-0.237
 new-0.234
 anything-0.233
Token a
Token ID257
Feature activation+4.00e-02

Pos logit contributions
 chores+0.041
 accomplishment+0.041
 top+0.036
 bottom+0.036
 anything+0.035
Neg logit contributions
o-0.141
os-0.128
ppy-0.121
jo-0.121
azz-0.120
Token little
Token ID1310
Feature activation+6.81e-03

Pos logit contributions
 anything+0.038
 top+0.034
 chores+0.033
 something+0.031
 bottom+0.030
Neg logit contributions
o-0.130
jo-0.127
azz-0.125
ppy-0.122
os-0.120
Token bird
Token ID6512
Feature activation-1.38e-01

Pos logit contributions
 new+0.012
 top+0.012
 anything+0.012
 something+0.012
 things+0.011
Neg logit contributions
jo-0.018
azz-0.017
o-0.017
os-0.017
ppy-0.016
Token
Token ID198
Feature activation-8.52e-03

Pos logit contributions
jo+0.008
o+0.008
azz+0.008
os+0.008
ppy+0.008
Neg logit contributions
 top-0.005
 anything-0.005
 new-0.005
 something-0.005
 bottom-0.005
TokenOne
Token ID3198
Feature activation+8.71e-03

Pos logit contributions
jo+0.022
o+0.022
azz+0.021
os+0.021
ppy+0.021
Neg logit contributions
 anything-0.014
 top-0.014
 new-0.013
 something-0.013
 bottom-0.013
Token day
Token ID1110
Feature activation-1.71e-02

Pos logit contributions
 top+0.009
 new+0.009
 anything+0.008
 something+0.008
 good+0.008
Neg logit contributions
jo-0.028
azz-0.028
o-0.027
os-0.027
ppy-0.027
Token,
Token ID11
Feature activation-1.02e-02

Pos logit contributions
o+0.045
jo+0.045
azz+0.043
os+0.043
ppy+0.043
Neg logit contributions
 anything-0.027
 top-0.027
 new-0.025
 chores-0.025
 bottom-0.025
Token the
Token ID262
Feature activation+1.09e-02

Pos logit contributions
o+0.034
os+0.031
jo+0.031
ppy+0.030
azz+0.030
Neg logit contributions
 chores-0.009
 anything-0.009
 bottom-0.008
 trouble-0.008
 beauty-0.008
Token hipp
Token ID18568
Feature activation-1.84e-01

Pos logit contributions
 anything+0.024
 chores+0.023
 something+0.023
 trouble+0.021
 top+0.021
Neg logit contributions
o-0.023
jo-0.020
os-0.020
itty-0.019
y-0.018
Tokeno
Token ID78
Feature activation-1.34e-01

Pos logit contributions
jo+0.147
azz+0.138
ppy+0.133
anne+0.116
anna+0.112
Neg logit contributions
 many-0.606
 most-0.594
 things-0.591
 top-0.589
 best-0.584
Token saw
Token ID2497
Feature activation+4.20e-02

Pos logit contributions
jo+0.324
azz+0.312
ppy+0.290
os+0.288
o+0.287
Neg logit contributions
 top-0.244
 something-0.242
 things-0.235
 new-0.234
 anything-0.233
Token a
Token ID257
Feature activation+3.91e-02

Pos logit contributions
 accomplishment+0.041
 chores+0.041
 top+0.037
 bottom+0.037
 anything+0.035
Neg logit contributions
o-0.142
os-0.128
jo-0.123
ppy-0.121
azz-0.121
Token new
Token ID649
Feature activation+2.75e-02

Pos logit contributions
 anything+0.033
 top+0.030
 chores+0.029
 something+0.027
 bottom+0.026
Neg logit contributions
o-0.131
jo-0.128
azz-0.126
ppy-0.123
os-0.122
Token friend
Token ID1545
Feature activation-9.41e-02

Pos logit contributions
 anything+0.045
 top+0.044
 chores+0.042
 something+0.042
 bottom+0.042
Neg logit contributions
o-0.072
jo-0.071
azz-0.068
os-0.068
ppy-0.067
Token One
Token ID1881
Feature activation+1.15e-02

Pos logit contributions
jo+0.122
o+0.117
ppy+0.117
azz+0.116
os+0.115
Neg logit contributions
 top-0.082
 anything-0.079
 new-0.079
 something-0.077
 bottom-0.076
Token day
Token ID1110
Feature activation-1.91e-02

Pos logit contributions
 top+0.011
 new+0.011
 something+0.011
 anything+0.011
 good+0.011
Neg logit contributions
jo-0.038
azz-0.037
ppy-0.036
os-0.036
o-0.036
Token,
Token ID11
Feature activation-1.17e-02

Pos logit contributions
o+0.050
jo+0.049
os+0.047
azz+0.047
ppy+0.047
Neg logit contributions
 anything-0.031
 top-0.030
 chores-0.029
 new-0.029
 bottom-0.029
Token the
Token ID262
Feature activation+1.20e-02

Pos logit contributions
o+0.038
os+0.034
ppy+0.033
jo+0.033
itty+0.033
Neg logit contributions
 chores-0.011
 beauty-0.011
 accomplishment-0.011
 anything-0.010
 trouble-0.010
Token little
Token ID1310
Feature activation-3.18e-02

Pos logit contributions
 anything+0.019
 chores+0.018
 something+0.017
 trouble+0.017
 top+0.016
Neg logit contributions
o-0.032
jo-0.030
os-0.029
ppy-0.028
azz-0.028
Token hipp
Token ID18568
Feature activation-1.84e-01

Pos logit contributions
o+0.076
jo+0.075
azz+0.072
os+0.072
ppy+0.071
Neg logit contributions
 anything-0.059
 top-0.058
 something-0.056
 new-0.056
 chores-0.056
Tokeno
Token ID78
Feature activation-1.32e-01

Pos logit contributions
azz+0.143
jo+0.142
ppy+0.134
anna+0.111
anne+0.110
Neg logit contributions
 many-0.597
 most-0.584
 top-0.583
 things-0.575
 best-0.569
Token wanted
Token ID2227
Feature activation+5.90e-02

Pos logit contributions
jo+0.320
azz+0.313
ppy+0.290
o+0.285
anne+0.285
Neg logit contributions
 top-0.239
 something-0.235
 things-0.229
 new-0.227
 anything-0.225
Token to
Token ID284
Feature activation+5.36e-02

Pos logit contributions
 top+0.134
 accomplishment+0.134
 bottom+0.134
 chores+0.130
 great+0.128
Neg logit contributions
o-0.117
os-0.101
jo-0.097
ppy-0.094
azz-0.092
Token drink
Token ID4144
Feature activation+2.45e-02

Pos logit contributions
 anything+0.094
 most+0.089
 biggest+0.088
 new+0.088
 things+0.087
Neg logit contributions
o-0.126
os-0.120
jo-0.120
ppy-0.114
azz-0.114
Token some
Token ID617
Feature activation+7.83e-02

Pos logit contributions
 accomplishment+0.029
 bottom+0.027
 top+0.026
 miles+0.026
 trouble+0.025
Neg logit contributions
o-0.073
os-0.067
ppy-0.066
y-0.065
azz-0.062
Token very
Token ID845
Feature activation+4.66e-02

Pos logit contributions
 top+0.084
 anything+0.083
 bottom+0.079
 chores+0.077
 accomplishment+0.076
Neg logit contributions
o-0.208
jo-0.193
os-0.187
ppy-0.185
azz-0.181
Token hungry
Token ID14720
Feature activation+1.65e-02

Pos logit contributions
 anything+0.059
 top+0.057
 bottom+0.054
 something+0.053
 chores+0.052
Neg logit contributions
o-0.143
jo-0.136
os-0.133
ppy-0.131
azz-0.130
Token.
Token ID13
Feature activation-4.28e-02

Pos logit contributions
 top+0.032
 anything+0.032
 chores+0.030
 new+0.030
 bottom+0.030
Neg logit contributions
o-0.036
jo-0.036
os-0.034
ppy-0.034
azz-0.034
Token So
Token ID1406
Feature activation-2.06e-04

Pos logit contributions
jo+0.110
ppy+0.105
azz+0.105
o+0.104
os+0.103
Neg logit contributions
 top-0.075
 anything-0.072
 new-0.072
 something-0.071
 bottom-0.069
Token the
Token ID262
Feature activation+1.04e-02

Pos logit contributions
o+0.000
jo+0.000
os+0.000
ppy+0.000
azz+0.000
Neg logit contributions
 chores-0.000
 anything-0.000
 bottom-0.000
 top-0.000
 biggest-0.000
Token hipp
Token ID18568
Feature activation-1.83e-01

Pos logit contributions
 anything+0.024
 chores+0.023
 something+0.023
 trouble+0.022
 top+0.022
Neg logit contributions
o-0.021
jo-0.018
os-0.018
itty-0.017
y-0.016
Tokeno
Token ID78
Feature activation-1.33e-01

Pos logit contributions
azz+0.172
ppy+0.159
jo+0.157
anna+0.133
ems+0.130
Neg logit contributions
 many-0.574
 most-0.568
 things-0.553
 best-0.551
 top-0.549
Token decided
Token ID3066
Feature activation+5.00e-03

Pos logit contributions
azz+0.304
jo+0.296
os+0.275
o+0.269
itty+0.268
Neg logit contributions
 top-0.261
 something-0.258
 many-0.253
 good-0.251
 things-0.249
Token to
Token ID284
Feature activation+4.36e-02

Pos logit contributions
 top+0.011
 bottom+0.011
 chores+0.010
 anything+0.010
 biggest+0.010
Neg logit contributions
o-0.010
jo-0.010
azz-0.009
ppy-0.009
os-0.009
Token cook
Token ID4255
Feature activation-9.70e-03

Pos logit contributions
 anything+0.076
 new+0.075
 biggest+0.074
 most+0.072
 top+0.072
Neg logit contributions
o-0.103
jo-0.097
ppy-0.096
os-0.096
azz-0.089
Token some
Token ID617
Feature activation+7.59e-02

Pos logit contributions
o+0.029
os+0.026
ppy+0.026
jo+0.024
y+0.024
Neg logit contributions
 bottom-0.011
 biggest-0.010
 accomplishment-0.010
 top-0.009
 richest-0.009
Token
Token ID198
Feature activation-7.53e-03

Pos logit contributions
ppy+0.008
jo+0.008
azz+0.008
o+0.008
os+0.008
Neg logit contributions
 top-0.004
 new-0.004
 something-0.004
 anything-0.004
 many-0.004
TokenOne
Token ID3198
Feature activation+7.36e-03

Pos logit contributions
jo+0.022
os+0.021
azz+0.021
o+0.021
ppy+0.021
Neg logit contributions
 top-0.011
 new-0.010
 anything-0.010
 great-0.010
 chores-0.010
Token day
Token ID1110
Feature activation-1.90e-02

Pos logit contributions
 new+0.007
 top+0.007
 good+0.006
 great+0.006
 something+0.006
Neg logit contributions
jo-0.025
azz-0.025
os-0.024
ppy-0.024
o-0.024
Token,
Token ID11
Feature activation-1.02e-02

Pos logit contributions
o+0.049
jo+0.047
azz+0.046
os+0.046
ppy+0.045
Neg logit contributions
 anything-0.031
 top-0.031
 chores-0.030
 biggest-0.030
 bottom-0.030
Token the
Token ID262
Feature activation+1.17e-02

Pos logit contributions
o+0.031
jo+0.028
os+0.028
azz+0.028
ppy+0.028
Neg logit contributions
 beauty-0.011
 chores-0.011
 bottom-0.011
 biggest-0.011
 anything-0.011
Token hipp
Token ID18568
Feature activation-1.82e-01

Pos logit contributions
 anything+0.022
 chores+0.021
 something+0.020
 trouble+0.020
 avail+0.019
Neg logit contributions
o-0.028
jo-0.026
os-0.025
y-0.024
itty-0.023
Tokeno
Token ID78
Feature activation-1.32e-01

Pos logit contributions
azz+0.140
jo+0.133
ppy+0.128
anna+0.100
anne+0.095
Neg logit contributions
 most-0.591
 many-0.585
 top-0.584
 best-0.574
 things-0.573
Token met
Token ID1138
Feature activation+1.18e-02

Pos logit contributions
jo+0.307
azz+0.303
ppy+0.284
o+0.280
os+0.278
Neg logit contributions
 top-0.247
 something-0.235
 bottom-0.229
 things-0.228
 most-0.226
Token a
Token ID257
Feature activation+3.46e-02

Pos logit contributions
 anything+0.006
 top+0.006
 new+0.005
 bottom+0.005
 chores+0.005
Neg logit contributions
o-0.044
jo-0.044
azz-0.043
os-0.042
ppy-0.042
Token wise
Token ID10787
Feature activation-6.79e-03

Pos logit contributions
 anything+0.040
 top+0.039
 chores+0.036
 bottom+0.036
 something+0.036
Neg logit contributions
o-0.106
jo-0.105
azz-0.102
os-0.100
ppy-0.100
Token old
Token ID1468
Feature activation-3.26e-02

Pos logit contributions
jo+0.023
o+0.022
os+0.022
azz+0.022
ppy+0.022
Neg logit contributions
 new-0.006
 top-0.006
 anything-0.006
 something-0.006
 good-0.006
Token food
Token ID2057
Feature activation-6.00e-02

Pos logit contributions
 anything+0.090
 bottom+0.089
 top+0.088
 yourselves+0.077
 miles+0.076
Neg logit contributions
o-0.217
os-0.209
jo-0.207
ppy-0.203
azz-0.197
Token.
Token ID13
Feature activation-4.88e-02

Pos logit contributions
o+0.150
os+0.141
jo+0.141
ppy+0.140
azz+0.135
Neg logit contributions
 top-0.105
 anything-0.104
 bottom-0.101
 great-0.098
 biggest-0.097
Token
Token ID198
Feature activation+7.79e-04

Pos logit contributions
jo+0.128
ppy+0.123
azz+0.121
o+0.121
os+0.121
Neg logit contributions
 top-0.084
 new-0.081
 anything-0.080
 something-0.080
 bottom-0.076
Token
Token ID198
Feature activation-4.46e-03

Pos logit contributions
 top+0.001
 new+0.001
 something+0.001
 anything+0.001
 many+0.001
Neg logit contributions
jo-0.002
ppy-0.002
azz-0.002
o-0.002
os-0.002
TokenThe
Token ID464
Feature activation+1.21e-02

Pos logit contributions
jo+0.012
o+0.012
os+0.012
azz+0.011
ppy+0.011
Neg logit contributions
 top-0.007
 anything-0.007
 something-0.007
 great-0.007
 new-0.007
Token hipp
Token ID18568
Feature activation-1.82e-01

Pos logit contributions
 anything+0.025
 something+0.023
 chores+0.022
 trouble+0.022
 top+0.022
Neg logit contributions
o-0.028
jo-0.025
os-0.024
ppy-0.023
itty-0.023
Tokeno
Token ID78
Feature activation-1.31e-01

Pos logit contributions
azz+0.162
jo+0.153
ppy+0.153
anna+0.124
anne+0.123
Neg logit contributions
 many-0.577
 most-0.573
 top-0.569
 things-0.567
 best-0.555
Token went
Token ID1816
Feature activation+1.92e-02

Pos logit contributions
azz+0.327
jo+0.309
ppy+0.273
enny+0.272
itty+0.270
Neg logit contributions
 top-0.237
 many-0.227
 good-0.227
 things-0.226
 something-0.224
Token to
Token ID284
Feature activation+4.07e-02

Pos logit contributions
 anything+0.038
 top+0.038
 bottom+0.037
 great+0.036
 something+0.036
Neg logit contributions
o-0.044
jo-0.042
os-0.041
azz-0.040
ppy-0.040
Token the
Token ID262
Feature activation+3.42e-02

Pos logit contributions
 best+0.051
 biggest+0.051
 most+0.048
 richest+0.046
 anything+0.045
Neg logit contributions
o-0.115
os-0.101
ppy-0.098
jo-0.098
bug-0.093
Token kitchen
Token ID9592
Feature activation-4.82e-02

Pos logit contributions
 anything+0.045
 accomplishment+0.043
 chores+0.039
 trouble+0.038
 miles+0.038
Neg logit contributions
o-0.101
jo-0.092
os-0.090
ppy-0.089
azz-0.087
Token
Token ID198
Feature activation+1.40e-03

Pos logit contributions
jo+0.083
ppy+0.080
o+0.080
azz+0.079
os+0.077
Neg logit contributions
 top-0.048
 anything-0.048
 new-0.047
 something-0.047
 bottom-0.045
Token
Token ID198
Feature activation-4.90e-03

Pos logit contributions
 top+0.002
 anything+0.002
 new+0.002
 something+0.002
 bottom+0.002
Neg logit contributions
jo-0.004
o-0.004
ppy-0.004
azz-0.004
os-0.004
TokenOne
Token ID3198
Feature activation+4.74e-03

Pos logit contributions
jo+0.013
o+0.013
os+0.012
azz+0.012
ppy+0.012
Neg logit contributions
 top-0.008
 anything-0.008
 new-0.008
 something-0.008
 bottom-0.007
Token day
Token ID1110
Feature activation-1.94e-02

Pos logit contributions
 new+0.003
 good+0.003
 great+0.003
 top+0.003
 big+0.003
Neg logit contributions
azz-0.017
jo-0.017
os-0.017
ppy-0.016
lets-0.016
Token,
Token ID11
Feature activation-1.05e-02

Pos logit contributions
o+0.049
jo+0.048
azz+0.047
ppy+0.046
os+0.045
Neg logit contributions
 anything-0.032
 top-0.032
 bottom-0.030
 chores-0.030
 biggest-0.030
Token Cle
Token ID3779
Feature activation-1.82e-01

Pos logit contributions
o+0.022
azz+0.022
jo+0.021
ppy+0.020
os+0.019
Neg logit contributions
 bottom-0.021
 anything-0.020
 top-0.020
 chores-0.020
 biggest-0.020
Tokenary
Token ID560
Feature activation-7.37e-02

Pos logit contributions
ksh+0.126
jo+0.101
ring+0.081
int+0.080
round+0.080
Neg logit contributions
 the-0.556
 something-0.549
 anything-0.540
 top-0.537
 many-0.532
Token was
Token ID373
Feature activation+5.67e-02

Pos logit contributions
azz+0.182
jo+0.180
os+0.177
o+0.174
uddy+0.169
Neg logit contributions
 something-0.128
 top-0.127
 a-0.119
 many-0.118
 the-0.118
Token walking
Token ID6155
Feature activation+7.14e-03

Pos logit contributions
 anything+0.085
 top+0.085
 bottom+0.081
 chores+0.079
 things+0.077
Neg logit contributions
o-0.156
jo-0.148
ppy-0.145
os-0.142
azz-0.141
Token through
Token ID832
Feature activation+4.73e-02

Pos logit contributions
 anything+0.012
 top+0.012
 chores+0.011
 bottom+0.011
 new+0.011
Neg logit contributions
o-0.018
jo-0.018
os-0.017
azz-0.017
ppy-0.017
Token the
Token ID262
Feature activation+5.89e-02

Pos logit contributions
 chores+0.051
 biggest+0.051
 accomplishment+0.048
 choice+0.048
 best+0.047
Neg logit contributions
o-0.142
jo-0.139
os-0.134
azz-0.127
ppy-0.126
Token
Token ID198
Feature activation-1.17e-02

Pos logit contributions
jo+0.188
ppy+0.182
os+0.176
o+0.174
azz+0.173
Neg logit contributions
 new-0.096
 top-0.090
 good-0.087
 something-0.087
 many-0.086
Token
Token ID198
Feature activation-1.63e-02

Pos logit contributions
jo+0.032
ppy+0.032
o+0.031
azz+0.031
os+0.031
Neg logit contributions
 new-0.017
 top-0.017
 something-0.016
 anything-0.016
 many-0.016
TokenThe
Token ID464
Feature activation+1.07e-02

Pos logit contributions
jo+0.033
o+0.032
os+0.031
azz+0.031
ppy+0.030
Neg logit contributions
 top-0.037
 anything-0.037
 new-0.036
 something-0.036
 great-0.036
Token se
Token ID384
Feature activation-8.85e-02

Pos logit contributions
 anything+0.021
 chores+0.020
 top+0.020
 something+0.020
 bottom+0.019
Neg logit contributions
o-0.023
azz-0.022
jo-0.022
ppy-0.021
os-0.021
Tokenag
Token ID363
Feature activation-6.30e-02

Pos logit contributions
jo+0.215
o+0.214
os+0.213
azz+0.212
ppy+0.208
Neg logit contributions
 good-0.140
 new-0.140
 things-0.137
 anything-0.135
 great-0.135
Tokenull
Token ID724
Feature activation-1.82e-01

Pos logit contributions
jo+0.140
o+0.136
azz+0.134
ppy+0.131
os+0.131
Neg logit contributions
 top-0.127
 anything-0.125
 something-0.122
 new-0.122
 bottom-0.121
Token tried
Token ID3088
Feature activation+3.48e-02

Pos logit contributions
jo+0.468
azz+0.451
enny+0.414
os+0.412
ppy+0.407
Neg logit contributions
 new-0.332
 top-0.329
 good-0.320
 big-0.316
 things-0.315
Token to
Token ID284
Feature activation+3.65e-02

Pos logit contributions
 anything+0.060
 top+0.060
 new+0.059
 something+0.058
 bottom+0.056
Neg logit contributions
jo-0.090
o-0.086
azz-0.085
os-0.084
ppy-0.084
Token take
Token ID1011
Feature activation+5.54e-02

Pos logit contributions
 anything+0.063
 top+0.062
 something+0.060
 new+0.060
 chores+0.059
Neg logit contributions
o-0.091
jo-0.090
azz-0.087
ppy-0.087
os-0.087
Token the
Token ID262
Feature activation+4.39e-02

Pos logit contributions
 bottom+0.051
 chores+0.050
 top+0.050
 anything+0.049
 biggest+0.047
Neg logit contributions
o-0.176
jo-0.171
azz-0.170
ppy-0.169
os-0.168
Token oy
Token ID35104
Feature activation-1.05e-01

Pos logit contributions
 chores+0.080
 anything+0.078
 top+0.072
 accomplishment+0.071
 miles+0.071
Neg logit contributions
o-0.098
ppy-0.090
y-0.090
jo-0.089
azz-0.087
Token
Token ID198
Feature activation-1.41e-02

Pos logit contributions
jo+0.024
o+0.023
ppy+0.023
azz+0.023
os+0.023
Neg logit contributions
 top-0.014
 new-0.014
 anything-0.014
 something-0.013
 bottom-0.013
TokenJack
Token ID14295
Feature activation-1.18e-01

Pos logit contributions
jo+0.031
o+0.030
azz+0.029
os+0.029
ppy+0.029
Neg logit contributions
 anything-0.029
 top-0.029
 new-0.028
 something-0.028
 bottom-0.028
Token asked
Token ID1965
Feature activation+1.01e-02

Pos logit contributions
azz+0.283
jo+0.274
itty+0.271
uddy+0.265
os+0.262
Neg logit contributions
 top-0.211
 something-0.204
 the-0.202
 new-0.197
 many-0.193
Token the
Token ID262
Feature activation+4.37e-03

Pos logit contributions
 top+0.008
 anything+0.007
 bottom+0.007
 biggest+0.007
 chores+0.007
Neg logit contributions
o-0.034
jo-0.034
os-0.033
azz-0.033
ppy-0.033
Token mer
Token ID4017
Feature activation-8.13e-02

Pos logit contributions
 anything+0.007
 chores+0.007
 top+0.006
 trouble+0.006
 something+0.006
Neg logit contributions
jo-0.011
o-0.011
azz-0.011
ppy-0.011
os-0.011
Tokenmaid
Token ID23151
Feature activation-1.82e-01

Pos logit contributions
o+0.192
jo+0.186
azz+0.180
os+0.179
itty+0.179
Neg logit contributions
 top-0.148
 new-0.148
 anything-0.147
 things-0.143
 biggest-0.141
Token why
Token ID1521
Feature activation+3.32e-02

Pos logit contributions
jo+0.536
azz+0.512
o+0.503
ppy+0.501
os+0.491
Neg logit contributions
 anything-0.249
 new-0.246
 something-0.244
 top-0.230
 things-0.225
Token the
Token ID262
Feature activation+4.74e-02

Pos logit contributions
 top+0.051
 anything+0.049
 biggest+0.048
 bottom+0.048
 chores+0.047
Neg logit contributions
jo-0.088
o-0.087
os-0.085
azz-0.085
ppy-0.083
Token waves
Token ID9813
Feature activation-8.21e-02

Pos logit contributions
 chores+0.074
 anything+0.073
 top+0.071
 something+0.067
 bottom+0.067
Neg logit contributions
jo-0.120
o-0.118
azz-0.117
ppy-0.116
os-0.112
Token were
Token ID547
Feature activation+7.21e-02

Pos logit contributions
jo+0.185
o+0.182
azz+0.176
ppy+0.173
os+0.172
Neg logit contributions
 anything-0.164
 top-0.161
 something-0.157
 new-0.157
 bottom-0.155
Token rocking
Token ID38059
Feature activation-9.96e-03

Pos logit contributions
 top+0.098
 chores+0.093
 bottom+0.090
 anything+0.088
 accomplishment+0.085
Neg logit contributions
o-0.204
jo-0.200
ppy-0.194
azz-0.194
os-0.191
Token sad
Token ID6507
Feature activation+1.76e-02

Pos logit contributions
 anything+0.039
 top+0.038
 bottom+0.035
 something+0.035
 new+0.034
Neg logit contributions
o-0.102
jo-0.100
azz-0.097
os-0.097
ppy-0.097
Token.
Token ID13
Feature activation-5.80e-02

Pos logit contributions
 anything+0.031
 top+0.030
 something+0.029
 new+0.029
 bottom+0.028
Neg logit contributions
jo-0.044
o-0.044
azz-0.042
os-0.042
ppy-0.042
Token
Token ID198
Feature activation-1.18e-02

Pos logit contributions
ppy+0.169
jo+0.164
azz+0.159
os+0.156
o+0.156
Neg logit contributions
 top-0.089
 new-0.083
 something-0.082
 anything-0.078
 bottom-0.077
Token
Token ID198
Feature activation-1.59e-02

Pos logit contributions
ppy+0.033
jo+0.033
azz+0.032
o+0.032
os+0.032
Neg logit contributions
 top-0.016
 new-0.016
 something-0.015
 many-0.015
 anything-0.015
TokenThe
Token ID464
Feature activation+6.95e-03

Pos logit contributions
ppy+0.046
os+0.046
azz+0.046
jo+0.046
o+0.045
Neg logit contributions
 top-0.022
 new-0.020
 something-0.020
 great-0.019
 many-0.019
Token kid
Token ID5141
Feature activation-1.82e-01

Pos logit contributions
 anything+0.012
 top+0.011
 something+0.011
 chores+0.010
 bottom+0.010
Neg logit contributions
o-0.018
jo-0.017
azz-0.017
os-0.017
ppy-0.016
Token stayed
Token ID9658
Feature activation+6.26e-02

Pos logit contributions
jo+0.463
azz+0.457
os+0.435
o+0.417
ppy+0.416
Neg logit contributions
 top-0.331
 something-0.327
 new-0.315
 best-0.313
 most-0.313
Token sad
Token ID6507
Feature activation+2.90e-02

Pos logit contributions
 anything+0.117
 bottom+0.117
 top+0.115
 biggest+0.110
 beauty+0.105
Neg logit contributions
o-0.147
jo-0.133
ppy-0.132
y-0.129
azz-0.126
Token all
Token ID477
Feature activation+1.40e-02

Pos logit contributions
 anything+0.056
 top+0.056
 new+0.053
 something+0.053
 bottom+0.053
Neg logit contributions
jo-0.067
o-0.066
azz-0.064
os-0.063
ppy-0.063
Token day
Token ID1110
Feature activation-9.88e-04

Pos logit contributions
 anything+0.017
 top+0.017
 something+0.016
 new+0.016
 bottom+0.016
Neg logit contributions
jo-0.043
o-0.042
azz-0.041
os-0.041
ppy-0.040
Token,
Token ID11
Feature activation-2.96e-03

Pos logit contributions
jo+0.003
o+0.002
azz+0.002
os+0.002
ppy+0.002
Neg logit contributions
 anything-0.002
 top-0.002
 new-0.002
 something-0.002
 bottom-0.002
Token.
Token ID13
Feature activation-5.95e-02

Pos logit contributions
 top+0.033
 anything+0.032
 bottom+0.031
 chores+0.031
 new+0.031
Neg logit contributions
o-0.037
jo-0.036
ppy-0.034
azz-0.034
os-0.034
Token
Token ID220
Feature activation-3.22e-02

Pos logit contributions
ppy+0.181
jo+0.176
azz+0.172
os+0.166
anne+0.165
Neg logit contributions
 top-0.082
 many-0.075
 new-0.073
 something-0.071
 the-0.069
Token
Token ID198
Feature activation-8.81e-03

Pos logit contributions
jo+0.084
o+0.082
azz+0.081
ppy+0.080
os+0.079
Neg logit contributions
 top-0.052
 anything-0.052
 new-0.050
 something-0.050
 bottom-0.048
Token
Token ID198
Feature activation-1.29e-02

Pos logit contributions
jo+0.024
ppy+0.023
azz+0.023
o+0.023
os+0.023
Neg logit contributions
 top-0.014
 new-0.013
 anything-0.013
 something-0.012
 many-0.012
TokenThe
Token ID464
Feature activation+9.94e-03

Pos logit contributions
jo+0.038
o+0.038
azz+0.037
os+0.036
ppy+0.036
Neg logit contributions
 top-0.017
 anything-0.017
 something-0.016
 new-0.016
 bottom-0.016
Token hipp
Token ID18568
Feature activation-1.82e-01

Pos logit contributions
 anything+0.023
 something+0.022
 chores+0.021
 trouble+0.021
 top+0.021
Neg logit contributions
o-0.020
jo-0.017
os-0.017
itty-0.016
ppy-0.016
Tokeno
Token ID78
Feature activation-1.31e-01

Pos logit contributions
azz+0.168
jo+0.157
ppy+0.154
anna+0.127
anne+0.124
Neg logit contributions
 most-0.563
 many-0.561
 top-0.557
 things-0.552
 others-0.543
Token found
Token ID1043
Feature activation+4.23e-02

Pos logit contributions
azz+0.337
jo+0.317
itty+0.292
uddy+0.285
ppy+0.282
Neg logit contributions
 top-0.233
 many-0.217
 something-0.213
 good-0.212
 great-0.212
Token some
Token ID617
Feature activation+7.46e-02

Pos logit contributions
 avail+0.056
 chores+0.054
 accomplishment+0.049
 bottom+0.048
 port+0.045
Neg logit contributions
o-0.128
os-0.116
ray-0.105
ppy-0.104
bug-0.104
Token vegetables
Token ID13701
Feature activation-7.70e-02

Pos logit contributions
 anything+0.099
 top+0.095
 bottom+0.095
 yourselves+0.091
 accomplishment+0.086
Neg logit contributions
o-0.193
ppy-0.184
os-0.184
y-0.182
jo-0.182
Token and
Token ID290
Feature activation+1.67e-03

Pos logit contributions
o+0.169
os+0.158
ppy+0.155
jo+0.153
y+0.147
Neg logit contributions
 top-0.159
 anything-0.156
 bottom-0.151
 great-0.150
 chores-0.150
Token
Token ID198
Feature activation-4.55e-03

Pos logit contributions
 top+0.004
 new+0.004
 anything+0.004
 something+0.003
 many+0.003
Neg logit contributions
jo-0.006
ppy-0.006
azz-0.006
o-0.006
os-0.006
TokenOne
Token ID3198
Feature activation+7.44e-03

Pos logit contributions
jo+0.013
ppy+0.013
azz+0.013
os+0.013
o+0.013
Neg logit contributions
 top-0.006
 new-0.006
 great-0.006
 bottom-0.005
 anything-0.005
Token day
Token ID1110
Feature activation-1.83e-02

Pos logit contributions
 new+0.006
 top+0.006
 good+0.006
 great+0.005
 something+0.005
Neg logit contributions
azz-0.026
jo-0.026
ppy-0.025
os-0.025
o-0.024
Token,
Token ID11
Feature activation-1.00e-02

Pos logit contributions
o+0.047
jo+0.046
os+0.044
azz+0.044
ppy+0.044
Neg logit contributions
 anything-0.030
 top-0.029
 chores-0.029
 new-0.028
 bottom-0.028
Token the
Token ID262
Feature activation+1.42e-02

Pos logit contributions
o+0.029
jo+0.027
os+0.026
azz+0.026
ppy+0.026
Neg logit contributions
 chores-0.013
 anything-0.013
 bottom-0.013
 beauty-0.013
 biggest-0.012
Token bull
Token ID6473
Feature activation-1.81e-01

Pos logit contributions
 anything+0.030
 top+0.029
 something+0.028
 chores+0.028
 new+0.027
Neg logit contributions
jo-0.030
o-0.030
azz-0.028
os-0.028
ppy-0.028
Token saw
Token ID2497
Feature activation+4.39e-02

Pos logit contributions
jo+0.465
azz+0.455
ppy+0.419
o+0.409
anne+0.409
Neg logit contributions
 top-0.330
 something-0.318
 bottom-0.308
 anything-0.302
 things-0.300
Token a
Token ID257
Feature activation+4.00e-02

Pos logit contributions
 chores+0.036
 accomplishment+0.035
 bottom+0.034
 top+0.034
 anything+0.032
Neg logit contributions
o-0.151
jo-0.141
os-0.141
azz-0.135
ppy-0.133
Token butterfly
Token ID35113
Feature activation-1.25e-01

Pos logit contributions
 anything+0.036
 top+0.035
 something+0.032
 chores+0.031
 bottom+0.031
Neg logit contributions
jo-0.132
o-0.132
azz-0.129
os-0.127
ppy-0.126
Token.
Token ID13
Feature activation-6.35e-02

Pos logit contributions
jo+0.363
azz+0.348
o+0.343
os+0.339
ppy+0.337
Neg logit contributions
 top-0.185
 anything-0.178
 something-0.176
 new-0.174
 bottom-0.169
Token He
Token ID679
Feature activation-2.86e-02

Pos logit contributions
jo+0.188
ppy+0.187
azz+0.182
o+0.176
os+0.175
Neg logit contributions
 top-0.089
 new-0.085
 the-0.082
 something-0.082
 anything-0.079
Token in
Token ID287
Feature activation+5.32e-02

Pos logit contributions
 anything+0.101
 chores+0.093
 bottom+0.092
 top+0.092
 trouble+0.091
Neg logit contributions
o-0.132
os-0.115
ppy-0.112
jo-0.110
y-0.108
Token a
Token ID257
Feature activation+4.75e-02

Pos logit contributions
 accomplishment+0.074
 choice+0.073
 bottom+0.073
 biggest+0.072
rier+0.069
Neg logit contributions
o-0.142
os-0.120
jo-0.119
ppy-0.115
itty-0.114
Token river
Token ID7850
Feature activation-8.23e-02

Pos logit contributions
 anything+0.039
 chores+0.031
 something+0.031
 top+0.031
 accomplishment+0.029
Neg logit contributions
o-0.167
jo-0.162
os-0.156
azz-0.155
ppy-0.153
Token.
Token ID13
Feature activation-4.47e-02

Pos logit contributions
o+0.203
jo+0.203
os+0.194
azz+0.193
ppy+0.190
Neg logit contributions
 anything-0.145
 top-0.142
 new-0.136
 something-0.136
 chores-0.135
Token The
Token ID383
Feature activation+1.68e-02

Pos logit contributions
jo+0.119
ppy+0.115
azz+0.114
o+0.113
os+0.112
Neg logit contributions
 top-0.074
 anything-0.070
 new-0.070
 something-0.069
 many-0.068
Token hipp
Token ID18568
Feature activation-1.81e-01

Pos logit contributions
 anything+0.040
 something+0.038
 chores+0.037
 trouble+0.036
 avail+0.036
Neg logit contributions
o-0.033
jo-0.027
os-0.027
y-0.025
itty-0.025
Tokeno
Token ID78
Feature activation-1.29e-01

Pos logit contributions
azz+0.149
ppy+0.145
jo+0.134
anne+0.107
anna+0.099
Neg logit contributions
 many-0.600
 most-0.591
 top-0.584
 things-0.581
 new-0.569
Token was
Token ID373
Feature activation+6.15e-02

Pos logit contributions
azz+0.322
jo+0.314
ppy+0.303
anne+0.287
itty+0.281
Neg logit contributions
 top-0.224
 many-0.220
 most-0.214
 new-0.213
 things-0.213
Token very
Token ID845
Feature activation+4.46e-02

Pos logit contributions
 avail+0.103
 bottom+0.102
 top+0.102
 chores+0.100
 choice+0.098
Neg logit contributions
o-0.172
os-0.133
ppy-0.132
jo-0.130
bug-0.128
Token happy
Token ID3772
Feature activation+1.96e-02

Pos logit contributions
 anything+0.058
 top+0.052
 something+0.051
 bottom+0.050
 chores+0.049
Neg logit contributions
o-0.141
os-0.128
jo-0.127
ppy-0.124
azz-0.122
Token and
Token ID290
Feature activation+4.65e-03

Pos logit contributions
 top+0.040
 anything+0.040
 chores+0.038
 bottom+0.038
 new+0.038
Neg logit contributions
o-0.044
jo-0.043
os-0.041
azz-0.041
ppy-0.040
Token.
Token ID13
Feature activation-5.64e-02

Pos logit contributions
 top+0.043
 bottom+0.042
 biggest+0.042
 chores+0.041
 anything+0.041
Neg logit contributions
jo-0.058
o-0.058
ppy-0.058
os-0.056
azz-0.055
Token
Token ID220
Feature activation-2.89e-02

Pos logit contributions
jo+0.144
o+0.140
os+0.139
azz+0.139
ppy+0.139
Neg logit contributions
 top-0.098
 new-0.094
 anything-0.094
 something-0.093
 things-0.090
Token
Token ID198
Feature activation-7.97e-03

Pos logit contributions
jo+0.077
o+0.074
ppy+0.074
azz+0.074
os+0.072
Neg logit contributions
 top-0.045
 anything-0.044
 new-0.044
 something-0.043
 bottom-0.041
Token
Token ID198
Feature activation-1.28e-02

Pos logit contributions
ppy+0.022
jo+0.022
azz+0.022
os+0.022
o+0.022
Neg logit contributions
 top-0.011
 new-0.011
 something-0.010
 many-0.010
 anything-0.010
TokenThe
Token ID464
Feature activation+6.76e-03

Pos logit contributions
jo+0.029
os+0.029
o+0.029
azz+0.028
ppy+0.028
Neg logit contributions
 top-0.025
 new-0.024
 anything-0.024
 something-0.024
 great-0.024
Token wolf
Token ID17481
Feature activation-1.81e-01

Pos logit contributions
 anything+0.011
 something+0.010
 top+0.010
 bottom+0.010
 sand+0.010
Neg logit contributions
jo-0.017
o-0.017
ppy-0.016
azz-0.016
os-0.016
Token went
Token ID1816
Feature activation+1.56e-02

Pos logit contributions
azz+0.454
jo+0.448
os+0.429
enny+0.416
ppy+0.415
Neg logit contributions
 something-0.316
 new-0.314
 top-0.312
 the-0.297
 things-0.296
Token away
Token ID1497
Feature activation+2.27e-02

Pos logit contributions
 anything+0.027
 top+0.026
 bottom+0.026
 biggest+0.025
 chores+0.025
Neg logit contributions
o-0.040
jo-0.038
azz-0.037
ppy-0.036
os-0.036
Token,
Token ID11
Feature activation-4.51e-03

Pos logit contributions
 anything+0.046
 chores+0.045
 top+0.044
 biggest+0.044
 bottom+0.043
Neg logit contributions
o-0.050
jo-0.049
os-0.046
azz-0.046
ppy-0.046
Token but
Token ID475
Feature activation-7.20e-03

Pos logit contributions
jo+0.011
o+0.011
azz+0.011
os+0.011
ppy+0.011
Neg logit contributions
 anything-0.008
 top-0.008
 new-0.007
 bottom-0.007
 something-0.007
Token Snow
Token ID7967
Feature activation-1.46e-01

Pos logit contributions
o+0.015
jo+0.014
ppy+0.014
azz+0.013
os+0.012
Neg logit contributions
 top-0.015
 chores-0.015
 biggest-0.015
 anything-0.015
 bottom-0.014
Token
Token ID198
Feature activation-5.88e-03

Pos logit contributions
jo+0.001
o+0.001
ppy+0.001
azz+0.001
os+0.001
Neg logit contributions
 top-0.000
 anything-0.000
 new-0.000
 something-0.000
 bottom-0.000
TokenOne
Token ID3198
Feature activation+9.05e-03

Pos logit contributions
jo+0.016
o+0.015
azz+0.015
os+0.015
ppy+0.015
Neg logit contributions
 top-0.010
 anything-0.009
 new-0.009
 something-0.009
 bottom-0.009
Token day
Token ID1110
Feature activation-1.73e-02

Pos logit contributions
 new+0.008
 top+0.008
 good+0.008
 something+0.008
 great+0.008
Neg logit contributions
jo-0.031
azz-0.030
os-0.029
ppy-0.029
o-0.029
Token,
Token ID11
Feature activation-9.65e-03

Pos logit contributions
o+0.045
jo+0.044
azz+0.042
os+0.042
ppy+0.042
Neg logit contributions
 anything-0.028
 top-0.028
 chores-0.027
 new-0.027
 bottom-0.026
Token the
Token ID262
Feature activation+1.30e-02

Pos logit contributions
o+0.029
os+0.026
ppy+0.026
jo+0.026
azz+0.026
Neg logit contributions
 chores-0.012
 beauty-0.011
 trouble-0.011
 anything-0.011
 bottom-0.011
Token hipp
Token ID18568
Feature activation-1.81e-01

Pos logit contributions
 anything+0.028
 chores+0.026
 something+0.025
 trouble+0.025
 accomplishment+0.024
Neg logit contributions
o-0.029
jo-0.025
os-0.024
itty-0.023
y-0.023
Tokeno
Token ID78
Feature activation-1.31e-01

Pos logit contributions
jo+0.149
azz+0.149
ppy+0.139
anna+0.120
anne+0.116
Neg logit contributions
 many-0.585
 most-0.579
 top-0.573
 things-0.568
 best-0.564
Token saw
Token ID2497
Feature activation+4.45e-02

Pos logit contributions
jo+0.307
azz+0.299
o+0.285
os+0.284
ppy+0.284
Neg logit contributions
 top-0.244
 something-0.239
 new-0.234
 anything-0.233
 things-0.232
Token something
Token ID1223
Feature activation+1.53e-02

Pos logit contributions
 chores+0.044
 accomplishment+0.044
 bottom+0.040
 top+0.039
 beauty+0.038
Neg logit contributions
o-0.148
os-0.133
jo-0.131
azz-0.128
ppy-0.127
Token that
Token ID326
Feature activation+1.50e-02

Pos logit contributions
 top+0.013
 anything+0.012
 bottom+0.011
 chores+0.011
 yourselves+0.010
Neg logit contributions
o-0.051
jo-0.050
azz-0.049
os-0.048
ppy-0.048
Token caught
Token ID4978
Feature activation+2.93e-02

Pos logit contributions
 top+0.031
 chores+0.031
 anything+0.031
 bottom+0.030
 trouble+0.029
Neg logit contributions
o-0.032
jo-0.030
os-0.030
azz-0.028
ppy-0.028
Token from
Token ID422
Feature activation+4.84e-02

Pos logit contributions
jo+0.164
o+0.163
os+0.157
azz+0.156
ppy+0.154
Neg logit contributions
 anything-0.113
 top-0.112
 bottom-0.106
 new-0.106
 something-0.106
Token the
Token ID262
Feature activation+4.65e-02

Pos logit contributions
 biggest+0.046
 top+0.044
 bottom+0.044
 anything+0.043
 chores+0.042
Neg logit contributions
jo-0.154
o-0.150
azz-0.146
ppy-0.144
os-0.143
Token tank
Token ID6873
Feature activation-1.10e-01

Pos logit contributions
 anything+0.058
 something+0.054
 chores+0.054
 yourselves+0.053
 top+0.050
Neg logit contributions
jo-0.123
ppy-0.114
azz-0.114
o-0.113
os-0.109
Token.
Token ID13
Feature activation-6.36e-02

Pos logit contributions
jo+0.274
o+0.271
azz+0.263
os+0.260
ppy+0.258
Neg logit contributions
 anything-0.189
 top-0.189
 new-0.181
 something-0.180
 bottom-0.178
Token Hel
Token ID5053
Feature activation-7.71e-02

Pos logit contributions
jo+0.147
o+0.146
ppy+0.144
os+0.140
anne+0.135
Neg logit contributions
 top-0.122
 new-0.118
 anything-0.118
 something-0.116
 many-0.114
Tokenio
Token ID952
Feature activation-1.81e-01

Pos logit contributions
jo+0.153
o+0.148
azz+0.145
os+0.142
ppy+0.138
Neg logit contributions
 top-0.173
 anything-0.172
 bottom-0.163
 things-0.163
 something-0.162
Token was
Token ID373
Feature activation+5.42e-02

Pos logit contributions
azz+0.428
jo+0.386
itty+0.383
ppy+0.381
o+0.361
Neg logit contributions
 top-0.333
 many-0.312
 most-0.307
 great-0.306
 new-0.305
Token thankful
Token ID25535
Feature activation+5.71e-02

Pos logit contributions
 top+0.075
 anything+0.074
 chores+0.073
 bottom+0.072
 things+0.071
Neg logit contributions
o-0.157
jo-0.152
os-0.146
ppy-0.142
azz-0.141
Token and
Token ID290
Feature activation+6.32e-03

Pos logit contributions
 top+0.094
 anything+0.094
 bottom+0.089
 chores+0.088
 new+0.087
Neg logit contributions
jo-0.148
o-0.146
azz-0.142
os-0.140
ppy-0.139
Token thanked
Token ID26280
Feature activation+4.75e-02

Pos logit contributions
 anything+0.013
 top+0.013
 bottom+0.013
 chores+0.012
 something+0.012
Neg logit contributions
jo-0.014
o-0.014
azz-0.013
os-0.013
ppy-0.013
Token T
Token ID309
Feature activation-7.61e-02

Pos logit contributions
 top+0.075
 anything+0.075
 bottom+0.072
 new+0.071
 something+0.071
Neg logit contributions
jo-0.124
o-0.122
azz-0.120
os-0.117
ppy-0.116
Token
Token ID198
Feature activation-1.04e-02

Pos logit contributions
ppy+0.014
jo+0.014
azz+0.013
o+0.013
os+0.013
Neg logit contributions
 top-0.007
 new-0.007
 something-0.007
 many-0.007
 anything-0.007
TokenOne
Token ID3198
Feature activation+7.64e-03

Pos logit contributions
jo+0.029
o+0.029
os+0.029
ppy+0.029
azz+0.029
Neg logit contributions
 top-0.015
 anything-0.014
 new-0.014
 something-0.014
 great-0.014
Token day
Token ID1110
Feature activation-1.78e-02

Pos logit contributions
 new+0.007
 good+0.007
 top+0.006
 great+0.006
 something+0.006
Neg logit contributions
jo-0.026
azz-0.026
os-0.025
ppy-0.025
o-0.025
Token,
Token ID11
Feature activation-1.08e-02

Pos logit contributions
o+0.044
jo+0.043
azz+0.042
os+0.042
ppy+0.041
Neg logit contributions
 anything-0.030
 top-0.030
 chores-0.029
 biggest-0.029
 bottom-0.029
Token the
Token ID262
Feature activation+1.60e-02

Pos logit contributions
o+0.033
jo+0.030
os+0.030
ppy+0.029
azz+0.029
Neg logit contributions
 chores-0.011
 yourselves-0.011
 bottom-0.011
 beauty-0.011
 biggest-0.010
Token pig
Token ID12967
Feature activation-1.80e-01

Pos logit contributions
 anything+0.040
 accomplishment+0.037
 top+0.036
 yourselves+0.036
 something+0.035
Neg logit contributions
o-0.024
ppy-0.022
itty-0.021
jo-0.021
y-0.020
Tokengy
Token ID1360
Feature activation-1.06e-01

Pos logit contributions
jo+0.407
azz+0.400
os+0.367
ppy+0.366
itty+0.360
Neg logit contributions
 top-0.340
 something-0.327
 most-0.319
 bottom-0.316
 things-0.309
Token saw
Token ID2497
Feature activation+4.51e-02

Pos logit contributions
jo+0.242
azz+0.241
o+0.235
os+0.231
ppy+0.226
Neg logit contributions
 top-0.207
 new-0.199
 something-0.197
 anything-0.196
 things-0.195
Token a
Token ID257
Feature activation+4.03e-02

Pos logit contributions
 chores+0.048
 accomplishment+0.047
 bottom+0.045
 choice+0.044
 top+0.042
Neg logit contributions
o-0.143
os-0.129
jo-0.127
azz-0.124
ppy-0.121
Token big
Token ID1263
Feature activation+2.43e-02

Pos logit contributions
 anything+0.042
 top+0.039
 chores+0.037
 something+0.035
 bottom+0.035
Neg logit contributions
o-0.127
jo-0.126
azz-0.123
ppy-0.120
os-0.119
Token dog
Token ID3290
Feature activation-1.35e-01

Pos logit contributions
 anything+0.040
 top+0.038
 chores+0.037
 something+0.036
 bottom+0.036
Neg logit contributions
o-0.062
jo-0.062
azz-0.059
os-0.058
ppy-0.058
Token He
Token ID679
Feature activation-1.33e-02

Pos logit contributions
jo+0.133
ppy+0.130
os+0.128
o+0.128
azz+0.126
Neg logit contributions
 top-0.078
 new-0.075
 something-0.075
 anything-0.073
 many-0.072
Token knew
Token ID2993
Feature activation+2.77e-02

Pos logit contributions
jo+0.029
azz+0.029
o+0.029
os+0.029
ppy+0.028
Neg logit contributions
 top-0.026
 new-0.026
 something-0.026
 great-0.025
 good-0.025
Token he
Token ID339
Feature activation-1.27e-02

Pos logit contributions
 bottom+0.039
 top+0.039
 biggest+0.037
 anything+0.037
 chores+0.037
Neg logit contributions
o-0.076
jo-0.075
os-0.071
azz-0.071
ppy-0.070
Token was
Token ID373
Feature activation+6.62e-02

Pos logit contributions
azz+0.027
jo+0.026
o+0.026
os+0.025
ppy+0.025
Neg logit contributions
 anything-0.026
 something-0.026
 great-0.026
 new-0.026
 good-0.026
Token safe
Token ID3338
Feature activation+4.44e-02

Pos logit contributions
 top+0.087
 bottom+0.084
 chores+0.080
 sand+0.079
 places+0.078
Neg logit contributions
o-0.187
ppy-0.176
jo-0.176
os-0.166
anna-0.165
Token in
Token ID287
Feature activation+9.21e-02

Pos logit contributions
 top+0.093
 bottom+0.090
 chores+0.089
 anything+0.089
 biggest+0.087
Neg logit contributions
o-0.095
jo-0.093
os-0.087
ppy-0.086
azz-0.084
Token his
Token ID465
Feature activation+5.44e-02

Pos logit contributions
 biggest+0.117
 top+0.104
 bottom+0.099
 most+0.095
 cheapest+0.090
Neg logit contributions
jo-0.261
o-0.249
os-0.239
ppy-0.225
azz-0.221
Token crib
Token ID48083
Feature activation-6.66e-02

Pos logit contributions
 anything+0.071
 top+0.068
 bottom+0.063
 something+0.062
 biggest+0.060
Neg logit contributions
jo-0.158
o-0.154
os-0.146
azz-0.146
ppy-0.145
Token.
Token ID13
Feature activation-5.22e-02

Pos logit contributions
jo+0.160
o+0.158
os+0.151
ppy+0.149
azz+0.147
Neg logit contributions
 anything-0.119
 top-0.118
 bottom-0.113
 chores-0.113
 great-0.112
Token
Token ID198
Feature activation-5.76e-04

Pos logit contributions
jo+0.141
ppy+0.139
os+0.137
o+0.136
azz+0.135
Neg logit contributions
 top-0.077
 new-0.076
 something-0.076
 anything-0.073
 many-0.072
Token
Token ID198
Feature activation-5.79e-03

Pos logit contributions
jo+0.002
o+0.002
azz+0.002
ppy+0.002
os+0.002
Neg logit contributions
 top-0.001
 anything-0.001
 new-0.001
 something-0.001
 bottom-0.001
Token but
Token ID475
Feature activation-1.50e-04

Pos logit contributions
 top+0.002
 anything+0.002
 bottom+0.002
 chores+0.002
 great+0.002
Neg logit contributions
o-0.003
jo-0.003
azz-0.002
ppy-0.002
os-0.002
Token he
Token ID339
Feature activation-1.73e-02

Pos logit contributions
o+0.000
ppy+0.000
jo+0.000
azz+0.000
y+0.000
Neg logit contributions
 chores-0.000
 biggest-0.000
 anything-0.000
 trouble-0.000
 yourselves-0.000
Token didn
Token ID1422
Feature activation+4.41e-02

Pos logit contributions
jo+0.041
azz+0.040
os+0.040
o+0.037
ppy+0.036
Neg logit contributions
 top-0.034
 something-0.034
 great-0.032
 new-0.032
 good-0.032
Token't
Token ID470
Feature activation+4.85e-02

Pos logit contributions
 anything+0.078
 sand+0.077
 things+0.077
 places+0.076
 new+0.076
Neg logit contributions
o-0.104
jo-0.101
azz-0.101
os-0.099
ppy-0.096
Token have
Token ID423
Feature activation+8.44e-02

Pos logit contributions
 anything+0.088
 good+0.084
 top+0.083
 something+0.083
 new+0.082
Neg logit contributions
jo-0.124
o-0.120
os-0.118
azz-0.116
ppy-0.111
Token enough
Token ID1576
Feature activation+8.66e-02

Pos logit contributions
 top+0.099
 bottom+0.097
 biggest+0.087
 most+0.086
 chores+0.085
Neg logit contributions
o-0.249
jo-0.244
azz-0.238
ppy-0.237
os-0.233
Token money
Token ID1637
Feature activation+4.70e-03

Pos logit contributions
 top+0.124
 bottom+0.119
 anything+0.111
 biggest+0.104
 most+0.103
Neg logit contributions
o-0.239
jo-0.233
azz-0.229
os-0.222
ppy-0.220
Token to
Token ID284
Feature activation+6.31e-02

Pos logit contributions
 top+0.009
 anything+0.008
 bottom+0.008
 new+0.008
 chores+0.008
Neg logit contributions
o-0.011
jo-0.011
azz-0.011
os-0.011
ppy-0.010
Token buy
Token ID2822
Feature activation+1.00e-01

Pos logit contributions
 biggest+0.115
 most+0.109
 top+0.107
 anything+0.107
 things+0.102
Neg logit contributions
o-0.139
ppy-0.135
jo-0.134
azz-0.133
os-0.128
Token it
Token ID340
Feature activation-2.58e-02

Pos logit contributions
 bottom+0.132
 top+0.125
wered+0.121
 accomplishment+0.120
 port+0.119
Neg logit contributions
o-0.269
jo-0.251
os-0.248
azz-0.242
ppy-0.236
Token.
Token ID13
Feature activation-4.86e-02

Pos logit contributions
o+0.054
jo+0.051
ppy+0.049
os+0.049
azz+0.049
Neg logit contributions
 top-0.052
 bottom-0.051
 anything-0.050
 miles-0.049
 chores-0.049
Token was
Token ID373
Feature activation+6.55e-02

Pos logit contributions
azz+0.066
jo+0.065
os+0.064
ppy+0.062
o+0.062
Neg logit contributions
 great-0.041
 good-0.041
 top-0.040
 new-0.040
 something-0.039
Token three
Token ID1115
Feature activation+2.30e-03

Pos logit contributions
 top+0.088
 bottom+0.087
 anything+0.086
 accomplishment+0.080
 beauty+0.079
Neg logit contributions
o-0.193
ppy-0.169
jo-0.169
itty-0.167
os-0.167
Token years
Token ID812
Feature activation-1.00e-02

Pos logit contributions
 new+0.003
 things+0.003
 top+0.003
 good+0.003
 great+0.003
Neg logit contributions
azz-0.007
jo-0.007
o-0.007
ppy-0.007
os-0.006
Token old
Token ID1468
Feature activation-1.69e-02

Pos logit contributions
jo+0.035
o+0.035
azz+0.034
os+0.034
ppy+0.033
Neg logit contributions
 anything-0.008
 top-0.007
 something-0.007
 bottom-0.006
 chores-0.006
Token and
Token ID290
Feature activation-7.60e-03

Pos logit contributions
jo+0.047
os+0.046
azz+0.046
o+0.045
ppy+0.044
Neg logit contributions
 new-0.028
 things-0.027
 something-0.027
 top-0.027
 anything-0.026
Token liked
Token ID8288
Feature activation+8.35e-02

Pos logit contributions
o+0.017
jo+0.017
os+0.016
azz+0.016
ppy+0.016
Neg logit contributions
 anything-0.015
 top-0.015
 bottom-0.015
 chores-0.014
 something-0.014
Token to
Token ID284
Feature activation+6.74e-02

Pos logit contributions
 top+0.155
 anything+0.155
 bottom+0.148
 something+0.147
 new+0.146
Neg logit contributions
jo-0.198
o-0.197
azz-0.189
os-0.188
ppy-0.184
Token dance
Token ID9280
Feature activation+8.08e-03

Pos logit contributions
 anything+0.109
 top+0.108
 new+0.104
 something+0.103
 bottom+0.102
Neg logit contributions
jo-0.175
o-0.173
azz-0.167
os-0.167
ppy-0.165
Token.
Token ID13
Feature activation-4.37e-02

Pos logit contributions
 bottom+0.015
 top+0.015
 anything+0.015
 trouble+0.015
 chores+0.014
Neg logit contributions
jo-0.018
o-0.018
azz-0.018
ppy-0.017
os-0.017
Token She
Token ID1375
Feature activation-2.22e-02

Pos logit contributions
o+0.109
jo+0.109
ppy+0.105
os+0.105
azz+0.102
Neg logit contributions
 top-0.077
 new-0.076
 anything-0.075
 something-0.073
 great-0.071
Token went
Token ID1816
Feature activation+2.73e-02

Pos logit contributions
jo+0.050
o+0.049
azz+0.048
os+0.047
ppy+0.046
Neg logit contributions
 new-0.044
 great-0.044
 top-0.043
 good-0.043
 anything-0.043
Token often
Token ID1690
Feature activation+8.01e-02

Pos logit contributions
azz+0.231
jo+0.222
os+0.217
ppy+0.208
o+0.206
Neg logit contributions
 top-0.187
 new-0.184
 good-0.181
 great-0.178
 something-0.178
Token asked
Token ID1965
Feature activation+6.13e-02

Pos logit contributions
 top+0.134
 anything+0.129
 bottom+0.126
 biggest+0.125
 new+0.124
Neg logit contributions
jo-0.200
o-0.199
os-0.191
ppy-0.188
azz-0.187
Token his
Token ID465
Feature activation+3.30e-02

Pos logit contributions
 anything+0.081
 top+0.080
 something+0.076
 new+0.076
 bottom+0.074
Neg logit contributions
jo-0.179
o-0.177
azz-0.173
os-0.171
ppy-0.170
Token parents
Token ID3397
Feature activation-8.21e-02

Pos logit contributions
 anything+0.044
 top+0.043
 bottom+0.041
 something+0.040
 chores+0.039
Neg logit contributions
jo-0.095
o-0.094
azz-0.091
os-0.090
ppy-0.089
Token questions
Token ID2683
Feature activation-8.29e-03

Pos logit contributions
jo+0.214
o+0.211
azz+0.206
os+0.203
ppy+0.202
Neg logit contributions
 anything-0.133
 top-0.133
 new-0.127
 something-0.127
 bottom-0.124
Token about
Token ID546
Feature activation+1.01e-01

Pos logit contributions
jo+0.021
o+0.021
azz+0.020
ppy+0.020
os+0.020
Neg logit contributions
 anything-0.014
 top-0.014
 something-0.013
 new-0.013
 chores-0.013
Token the
Token ID262
Feature activation+7.93e-02

Pos logit contributions
 top+0.093
 bottom+0.091
 biggest+0.085
 middle+0.080
 anything+0.080
Neg logit contributions
jo-0.323
o-0.319
os-0.313
azz-0.303
ppy-0.297
Token world
Token ID995
Feature activation-3.49e-02

Pos logit contributions
 anything+0.083
 top+0.082
 bottom+0.076
 something+0.075
 chores+0.073
Neg logit contributions
jo-0.245
o-0.241
azz-0.234
os-0.232
ppy-0.228
Token around
Token ID1088
Feature activation+4.05e-02

Pos logit contributions
jo+0.091
o+0.090
azz+0.088
ppy+0.088
os+0.085
Neg logit contributions
 anything-0.056
 top-0.055
 new-0.055
 something-0.054
 things-0.053
Token him
Token ID683
Feature activation-2.86e-02

Pos logit contributions
 top+0.048
 anything+0.046
 biggest+0.044
 bottom+0.044
 chores+0.044
Neg logit contributions
jo-0.122
o-0.121
os-0.118
azz-0.116
ppy-0.114
Token.
Token ID13
Feature activation-3.79e-02

Pos logit contributions
jo+0.074
o+0.073
azz+0.073
ppy+0.071
os+0.071
Neg logit contributions
 anything-0.047
 top-0.046
 something-0.045
 new-0.045
 things-0.044
Token He
Token ID679
Feature activation-2.84e-02

Pos logit contributions
ppy+0.190
jo+0.187
azz+0.182
os+0.177
o+0.171
Neg logit contributions
 top-0.101
 new-0.098
 something-0.094
 many-0.092
 things-0.091
Token knew
Token ID2993
Feature activation+1.81e-02

Pos logit contributions
os+0.072
jo+0.070
azz+0.068
o+0.067
itty+0.066
Neg logit contributions
 new-0.051
 great-0.051
 good-0.051
 top-0.050
 something-0.050
Token he
Token ID339
Feature activation-2.25e-02

Pos logit contributions
 bottom+0.027
 top+0.027
 chores+0.025
 biggest+0.025
 anything+0.024
Neg logit contributions
o-0.050
jo-0.047
azz-0.046
os-0.046
ppy-0.045
Token should
Token ID815
Feature activation+4.77e-02

Pos logit contributions
azz+0.053
jo+0.052
os+0.051
round+0.048
ppy+0.047
Neg logit contributions
 great-0.040
 something-0.039
 good-0.039
 amazing-0.039
 anything-0.039
Token have
Token ID423
Feature activation+6.46e-02

Pos logit contributions
 anything+0.069
 biggest+0.066
 top+0.066
 new+0.065
 something+0.064
Neg logit contributions
o-0.128
jo-0.128
ppy-0.124
azz-0.122
os-0.119
Token stayed
Token ID9658
Feature activation+8.71e-02

Pos logit contributions
 top+0.162
 anything+0.159
 bottom+0.156
 chores+0.153
 new+0.152
Neg logit contributions
o-0.115
jo-0.109
os-0.104
ppy-0.104
azz-0.103
Token with
Token ID351
Feature activation+7.76e-02

Pos logit contributions
 bottom+0.167
 top+0.158
wered+0.137
 middle+0.131
 biggest+0.129
Neg logit contributions
o-0.207
ppy-0.188
jo-0.178
azz-0.170
y-0.170
Token the
Token ID262
Feature activation+4.87e-02

Pos logit contributions
 port+0.093
 chores+0.092
 bottom+0.091
 biggest+0.083
 valley+0.082
Neg logit contributions
o-0.206
os-0.187
jo-0.185
ppy-0.183
itty-0.170
Token egg
Token ID5935
Feature activation-1.05e-01

Pos logit contributions
 anything+0.096
 top+0.089
 chores+0.088
 something+0.088
 bottom+0.086
Neg logit contributions
o-0.103
jo-0.102
ppy-0.098
itty-0.097
azz-0.095
Token like
Token ID588
Feature activation+6.50e-02

Pos logit contributions
jo+0.274
o+0.271
azz+0.263
os+0.261
ppy+0.258
Neg logit contributions
 anything-0.169
 top-0.169
 new-0.161
 something-0.160
 bottom-0.158
Token his
Token ID465
Feature activation+2.13e-02

Pos logit contributions
 bottom+0.074
 top+0.073
 biggest+0.072
 chores+0.068
 most+0.067
Neg logit contributions
o-0.187
os-0.179
jo-0.178
azz-0.173
ppy-0.172
Token home
Token ID1363
Feature activation-1.19e-02

Pos logit contributions
 anything+0.073
 top+0.072
 bottom+0.069
 something+0.068
 biggest+0.068
Neg logit contributions
o-0.101
jo-0.099
azz-0.096
os-0.094
ppy-0.093
Token.
Token ID13
Feature activation-4.79e-02

Pos logit contributions
o+0.027
jo+0.026
os+0.025
azz+0.024
ppy+0.024
Neg logit contributions
 anything-0.024
 top-0.023
 bottom-0.023
 biggest-0.023
 great-0.022
Token
Token ID198
Feature activation-2.39e-04

Pos logit contributions
jo+0.135
ppy+0.134
os+0.131
o+0.131
azz+0.129
Neg logit contributions
 top-0.071
 new-0.071
 something-0.068
 many-0.065
 anything-0.065
Token
Token ID198
Feature activation-4.53e-03

Pos logit contributions
jo+0.001
o+0.001
ppy+0.001
azz+0.001
os+0.001
Neg logit contributions
 top-0.000
 new-0.000
 anything-0.000
 something-0.000
 great-0.000
TokenThe
Token ID464
Feature activation+1.62e-02

Pos logit contributions
jo+0.012
o+0.012
os+0.012
ppy+0.011
azz+0.011
Neg logit contributions
 top-0.008
 new-0.007
 something-0.007
 great-0.007
 anything-0.007
Token next
Token ID1306
Feature activation+9.54e-03

Pos logit contributions
 anything+0.021
 something+0.018
 top+0.017
 bottom+0.016
 chores+0.016
Neg logit contributions
o-0.048
jo-0.047
azz-0.046
ppy-0.045
os-0.044
Token day
Token ID1110
Feature activation-2.87e-02

Pos logit contributions
 good+0.008
 new+0.008
 top+0.008
 things+0.007
 great+0.007
Neg logit contributions
jo-0.034
ppy-0.033
azz-0.033
os-0.033
o-0.032
Token,
Token ID11
Feature activation-9.22e-03

Pos logit contributions
jo+0.079
o+0.078
azz+0.076
os+0.075
ppy+0.074
Neg logit contributions
 anything-0.043
 top-0.043
 new-0.041
 something-0.041
 bottom-0.040
Token Lily
Token ID20037
Feature activation-1.17e-01

Pos logit contributions
o+0.014
azz+0.013
ppy+0.013
y+0.012
jo+0.012
Neg logit contributions
 bottom-0.022
 biggest-0.022
 beauty-0.022
 shore-0.021
 top-0.021
Token went
Token ID1816
Feature activation+2.84e-02

Pos logit contributions
jo+0.249
azz+0.237
o+0.237
os+0.236
ppy+0.228
Neg logit contributions
 something-0.241
 new-0.241
 top-0.240
 anything-0.237
 things-0.233
Token to
Token ID284
Feature activation+4.60e-02

Pos logit contributions
 anything+0.053
 top+0.053
 bottom+0.051
 great+0.049
 something+0.049
Neg logit contributions
o-0.068
jo-0.066
azz-0.064
os-0.063
ppy-0.062
Token was
Token ID373
Feature activation+6.01e-02

Pos logit contributions
jo+0.049
azz+0.048
os+0.044
ppy+0.042
o+0.042
Neg logit contributions
 top-0.037
 good-0.037
 great-0.037
 new-0.036
 something-0.036
Token very
Token ID845
Feature activation+5.02e-02

Pos logit contributions
 top+0.087
 bottom+0.085
 anything+0.085
 chores+0.082
 places+0.082
Neg logit contributions
o-0.171
ppy-0.157
jo-0.152
os-0.152
azz-0.147
Token strong
Token ID1913
Feature activation+1.47e-02

Pos logit contributions
 anything+0.069
 top+0.065
 bottom+0.061
 chores+0.059
 something+0.058
Neg logit contributions
o-0.151
jo-0.140
ppy-0.139
os-0.139
azz-0.136
Token.
Token ID13
Feature activation-6.02e-02

Pos logit contributions
 anything+0.025
 top+0.025
 new+0.024
 something+0.024
 things+0.023
Neg logit contributions
jo-0.038
o-0.037
azz-0.036
os-0.036
ppy-0.035
Token They
Token ID1119
Feature activation-1.53e-02

Pos logit contributions
jo+0.155
ppy+0.148
os+0.148
o+0.148
azz+0.145
Neg logit contributions
 top-0.106
 new-0.102
 anything-0.101
 something-0.100
 good-0.097
Token worked
Token ID3111
Feature activation+4.85e-02

Pos logit contributions
jo+0.041
azz+0.039
o+0.036
anna+0.035
anne+0.035
Neg logit contributions
 top-0.027
 many-0.025
 the-0.025
 good-0.025
 great-0.024
Token together
Token ID1978
Feature activation+1.22e-02

Pos logit contributions
 anything+0.086
 bottom+0.084
 top+0.083
 biggest+0.081
 something+0.079
Neg logit contributions
o-0.119
jo-0.115
os-0.111
ppy-0.109
azz-0.109
Token to
Token ID284
Feature activation+5.50e-02

Pos logit contributions
 anything+0.022
 top+0.021
 new+0.021
 something+0.020
 bottom+0.020
Neg logit contributions
jo-0.030
o-0.030
azz-0.029
os-0.029
ppy-0.028
Token make
Token ID787
Feature activation+7.34e-02

Pos logit contributions
 anything+0.088
 biggest+0.086
 things+0.086
 new+0.085
 most+0.084
Neg logit contributions
o-0.136
ppy-0.131
jo-0.130
os-0.125
azz-0.121
Token a
Token ID257
Feature activation+5.33e-02

Pos logit contributions
 bottom+0.066
 top+0.060
 anything+0.059
 middle+0.058
 End+0.056
Neg logit contributions
o-0.232
jo-0.222
os-0.221
azz-0.209
ppy-0.206
Token plan
Token ID1410
Feature activation-6.59e-03

Pos logit contributions
 anything+0.054
 something+0.046
 top+0.045
 chores+0.043
 things+0.042
Neg logit contributions
o-0.169
jo-0.167
azz-0.162
os-0.159
ppy-0.158
Token afternoon
Token ID6672
Feature activation-1.99e-02

Pos logit contributions
 top+0.061
 new+0.058
 anything+0.057
 bottom+0.055
 great+0.055
Neg logit contributions
jo-0.175
o-0.173
azz-0.171
ppy-0.170
os-0.170
Token.
Token ID13
Feature activation-5.65e-02

Pos logit contributions
jo+0.051
o+0.051
azz+0.049
os+0.049
ppy+0.049
Neg logit contributions
 anything-0.033
 top-0.033
 new-0.031
 something-0.031
 bottom-0.031
Token They
Token ID1119
Feature activation-7.41e-03

Pos logit contributions
ppy+0.155
jo+0.155
o+0.149
azz+0.148
os+0.146
Neg logit contributions
 top-0.083
 anything-0.079
 new-0.078
 many-0.077
 something-0.075
Token laughed
Token ID13818
Feature activation+7.77e-03

Pos logit contributions
jo+0.018
azz+0.017
o+0.017
os+0.016
ppy+0.016
Neg logit contributions
 top-0.014
 anything-0.014
 great-0.013
 good-0.013
 new-0.013
Token and
Token ID290
Feature activation+6.39e-03

Pos logit contributions
 anything+0.015
 top+0.015
 chores+0.015
 bottom+0.015
 something+0.014
Neg logit contributions
jo-0.017
o-0.017
azz-0.017
os-0.016
ppy-0.016
Token smiled
Token ID13541
Feature activation+1.36e-02

Pos logit contributions
 good+0.014
 great+0.013
 many+0.013
 top+0.013
 the+0.013
Neg logit contributions
jo-0.014
azz-0.013
os-0.013
o-0.013
ppy-0.012
Token as
Token ID355
Feature activation+4.33e-02

Pos logit contributions
 anything+0.023
 top+0.022
 chores+0.022
 bottom+0.022
 trouble+0.021
Neg logit contributions
o-0.034
jo-0.034
azz-0.033
os-0.032
ppy-0.032
Token they
Token ID484
Feature activation-1.71e-02

Pos logit contributions
 top+0.061
 bottom+0.059
 anything+0.059
 chores+0.057
 something+0.056
Neg logit contributions
jo-0.120
o-0.119
azz-0.116
os-0.116
ppy-0.113
Token shared
Token ID4888
Feature activation+6.15e-02

Pos logit contributions
jo+0.041
azz+0.039
o+0.037
os+0.035
anne+0.035
Neg logit contributions
 top-0.032
 great-0.032
 many-0.031
 good-0.031
 sand-0.030
Token their
Token ID511
Feature activation+5.29e-02

Pos logit contributions
 top+0.058
 bottom+0.057
 anything+0.055
 biggest+0.054
 chores+0.052
Neg logit contributions
o-0.194
jo-0.193
os-0.188
azz-0.187
ppy-0.182
Token yellow
Token ID7872
Feature activation+7.25e-04

Pos logit contributions
 anything+0.057
 top+0.055
 something+0.051
 bottom+0.051
 chores+0.048
Neg logit contributions
jo-0.167
o-0.165
azz-0.160
os-0.160
ppy-0.157
Token They
Token ID1119
Feature activation-1.82e-02

Pos logit contributions
jo+0.148
o+0.144
azz+0.141
ppy+0.141
os+0.140
Neg logit contributions
 top-0.108
 anything-0.104
 new-0.103
 something-0.102
 things-0.102
Token didn
Token ID1422
Feature activation+2.73e-02

Pos logit contributions
jo+0.045
azz+0.044
os+0.041
o+0.041
itty+0.040
Neg logit contributions
 many-0.034
 top-0.034
 good-0.033
 something-0.033
 great-0.033
Token't
Token ID470
Feature activation+4.20e-02

Pos logit contributions
 anything+0.047
 new+0.045
 things+0.045
 sand+0.045
 biggest+0.044
Neg logit contributions
o-0.068
jo-0.066
azz-0.065
os-0.064
ppy-0.062
Token know
Token ID760
Feature activation+4.24e-02

Pos logit contributions
 good+0.071
 anything+0.069
 great+0.067
 something+0.066
 new+0.066
Neg logit contributions
jo-0.112
os-0.107
azz-0.105
o-0.103
itty-0.100
Token what
Token ID644
Feature activation+4.16e-02

Pos logit contributions
 top+0.063
 bottom+0.060
 anything+0.058
 chores+0.058
 great+0.057
Neg logit contributions
o-0.116
jo-0.114
os-0.112
azz-0.110
ppy-0.109
Token to
Token ID284
Feature activation+6.51e-02

Pos logit contributions
 anything+0.068
 new+0.068
 something+0.067
 good+0.064
 top+0.064
Neg logit contributions
jo-0.111
azz-0.108
ppy-0.106
o-0.105
os-0.104
Token do
Token ID466
Feature activation+7.01e-02

Pos logit contributions
 anything+0.094
 top+0.093
 new+0.087
 something+0.087
 biggest+0.086
Neg logit contributions
o-0.181
jo-0.178
os-0.172
azz-0.172
ppy-0.172
Token.
Token ID13
Feature activation-5.80e-02

Pos logit contributions
 top+0.150
 biggest+0.145
 bottom+0.144
 anything+0.142
 sand+0.139
Neg logit contributions
o-0.142
jo-0.139
os-0.136
ppy-0.133
azz-0.130
Token But
Token ID887
Feature activation-2.07e-02

Pos logit contributions
jo+0.143
o+0.138
azz+0.135
anne+0.135
ppy+0.135
Neg logit contributions
 top-0.108
 anything-0.103
 something-0.102
 many-0.101
 things-0.101
Token then
Token ID788
Feature activation+8.17e-03

Pos logit contributions
jo+0.060
o+0.059
azz+0.057
os+0.057
ppy+0.057
Neg logit contributions
 anything-0.028
 top-0.028
 new-0.026
 something-0.026
 bottom-0.026
Token,
Token ID11
Feature activation-1.27e-02

Pos logit contributions
 top+0.012
 anything+0.012
 bottom+0.012
 new+0.012
 chores+0.012
Neg logit contributions
o-0.022
jo-0.022
azz-0.021
os-0.021
ppy-0.021
Token them
Token ID606
Feature activation-5.27e-02

Pos logit contributions
 anything+0.056
 top+0.056
 bottom+0.051
 new+0.051
 something+0.050
Neg logit contributions
jo-0.191
o-0.191
azz-0.186
os-0.185
ppy-0.183
Token.
Token ID13
Feature activation-6.91e-02

Pos logit contributions
jo+0.151
azz+0.146
o+0.144
ppy+0.143
os+0.138
Neg logit contributions
 top-0.077
 anything-0.075
 something-0.073
 new-0.072
 things-0.071
Token He
Token ID679
Feature activation-2.16e-02

Pos logit contributions
jo+0.191
ppy+0.187
o+0.182
os+0.180
azz+0.176
Neg logit contributions
 new-0.109
 something-0.103
 anything-0.101
 trouble-0.101
 top-0.100
Token was
Token ID373
Feature activation+5.26e-02

Pos logit contributions
jo+0.052
ppy+0.050
azz+0.050
os+0.048
anna+0.048
Neg logit contributions
 new-0.039
 top-0.038
 something-0.038
 great-0.037
 big-0.036
Token wearing
Token ID5762
Feature activation+6.93e-02

Pos logit contributions
 top+0.074
 anything+0.072
 bottom+0.071
 chores+0.071
 biggest+0.068
Neg logit contributions
o-0.151
jo-0.145
azz-0.141
os-0.140
ppy-0.137
Token a
Token ID257
Feature activation+6.69e-02

Pos logit contributions
 bottom+0.059
 top+0.053
 chores+0.051
 anything+0.050
 accomplishment+0.050
Neg logit contributions
jo-0.220
o-0.218
azz-0.216
os-0.212
ppy-0.204
Token suit
Token ID6050
Feature activation-8.40e-02

Pos logit contributions
 anything+0.101
 top+0.097
 chores+0.092
 bottom+0.092
 something+0.091
Neg logit contributions
jo-0.182
o-0.180
azz-0.176
ppy-0.170
os-0.170
Token and
Token ID290
Feature activation+7.42e-04

Pos logit contributions
jo+0.213
o+0.210
azz+0.204
ppy+0.203
os+0.203
Neg logit contributions
 top-0.145
 anything-0.144
 something-0.139
 new-0.137
 bottom-0.135
Token sunglasses
Token ID36251
Feature activation-7.46e-02

Pos logit contributions
 top+0.001
 anything+0.001
 bottom+0.001
 chores+0.001
 new+0.001
Neg logit contributions
jo-0.003
o-0.003
azz-0.002
os-0.002
ppy-0.002
Token.
Token ID13
Feature activation-7.12e-02

Pos logit contributions
jo+0.203
o+0.197
azz+0.193
ppy+0.191
os+0.189
Neg logit contributions
 top-0.117
 anything-0.116
 something-0.112
 new-0.111
 bottom-0.109
Token He
Token ID679
Feature activation-2.22e-02

Pos logit contributions
jo+0.172
o+0.168
os+0.162
ppy+0.162
azz+0.161
Neg logit contributions
 anything-0.133
 top-0.132
 new-0.130
 something-0.129
 things-0.125
Token to
Token ID284
Feature activation+9.09e-02

Pos logit contributions
 top+0.041
 chores+0.041
 anything+0.041
 something+0.040
 great+0.039
Neg logit contributions
o-0.060
jo-0.059
os-0.057
azz-0.056
ppy-0.055
Token surrender
Token ID16908
Feature activation+6.23e-02

Pos logit contributions
 biggest+0.129
 anything+0.128
 top+0.128
 most+0.123
 bottom+0.123
Neg logit contributions
o-0.235
jo-0.233
os-0.223
ppy-0.221
azz-0.218
Token and
Token ID290
Feature activation+2.82e-02

Pos logit contributions
 anything+0.100
 top+0.100
 new+0.095
 something+0.095
 bottom+0.093
Neg logit contributions
jo-0.163
o-0.162
azz-0.157
os-0.156
ppy-0.154
Token stay
Token ID2652
Feature activation+9.64e-02

Pos logit contributions
 top+0.045
 anything+0.045
 bottom+0.043
 new+0.043
 biggest+0.042
Neg logit contributions
o-0.072
jo-0.072
os-0.069
azz-0.068
ppy-0.068
Token still
Token ID991
Feature activation+7.47e-02

Pos logit contributions
 top+0.159
 bottom+0.158
 anything+0.155
 chores+0.151
 biggest+0.147
Neg logit contributions
o-0.239
jo-0.225
ppy-0.221
y-0.214
os-0.208
Token.
Token ID13
Feature activation-4.06e-02

Pos logit contributions
 top+0.136
 anything+0.135
 biggest+0.133
 bottom+0.130
 chores+0.129
Neg logit contributions
o-0.168
jo-0.166
ppy-0.160
os-0.155
y-0.151
Token The
Token ID383
Feature activation+3.71e-02

Pos logit contributions
ppy+0.108
jo+0.108
os+0.103
azz+0.103
o+0.103
Neg logit contributions
 top-0.061
 something-0.058
 anything-0.057
 new-0.056
 many-0.055
Token bear
Token ID6842
Feature activation-1.43e-01

Pos logit contributions
 anything+0.052
 top+0.049
 something+0.047
 new+0.047
 chores+0.046
Neg logit contributions
jo-0.104
o-0.103
azz-0.100
ppy-0.099
os-0.099
Token sniff
Token ID26300
Feature activation+1.29e-02

Pos logit contributions
jo+0.326
azz+0.318
o+0.317
ppy+0.308
os+0.305
Neg logit contributions
 top-0.282
 anything-0.275
 new-0.270
 something-0.269
 bottom-0.264
Tokened
Token ID276
Feature activation+4.10e-02

Pos logit contributions
 anything+0.034
 top+0.034
 something+0.033
 new+0.033
 things+0.033
Neg logit contributions
jo-0.020
o-0.020
azz-0.020
ppy-0.018
os-0.018
Token her
Token ID607
Feature activation+2.31e-02

Pos logit contributions
 anything+0.061
 top+0.061
 chores+0.058
 bottom+0.058
 new+0.057
Neg logit contributions
jo-0.110
o-0.109
os-0.105
azz-0.105
ppy-0.104
Token that
Token ID326
Feature activation+2.12e-02

Pos logit contributions
 bottom+0.068
 top+0.066
 anything+0.063
 biggest+0.063
 miles+0.060
Neg logit contributions
o-0.127
jo-0.123
azz-0.120
os-0.117
ppy-0.113
Token fighting
Token ID4330
Feature activation+4.63e-03

Pos logit contributions
 top+0.031
 bottom+0.030
 biggest+0.029
 anything+0.028
 middle+0.028
Neg logit contributions
o-0.057
os-0.054
jo-0.053
azz-0.051
ppy-0.049
Token does
Token ID857
Feature activation+7.44e-02

Pos logit contributions
 top+0.008
 anything+0.008
 bottom+0.008
 chores+0.008
 something+0.008
Neg logit contributions
o-0.011
jo-0.011
os-0.011
azz-0.011
ppy-0.010
Token not
Token ID407
Feature activation+6.17e-02

Pos logit contributions
 middle+0.096
 sand+0.090
 biggest+0.090
 shore+0.089
 avail+0.086
Neg logit contributions
o-0.179
ppy-0.172
azz-0.171
aring-0.168
jo-0.167
Token help
Token ID1037
Feature activation+2.63e-02

Pos logit contributions
 top+0.100
 bottom+0.093
 biggest+0.092
 middle+0.090
 anything+0.090
Neg logit contributions
o-0.159
ppy-0.149
azz-0.149
jo-0.149
os-0.144
Token anyone
Token ID2687
Feature activation-6.20e-02

Pos logit contributions
 top+0.033
 anything+0.031
 bottom+0.030
 chores+0.029
 something+0.029
Neg logit contributions
jo-0.078
o-0.078
azz-0.076
os-0.076
ppy-0.075
Token.
Token ID13
Feature activation-6.19e-02

Pos logit contributions
jo+0.153
o+0.149
azz+0.147
os+0.143
ppy+0.142
Neg logit contributions
 anything-0.114
 top-0.109
 new-0.108
 something-0.107
 things-0.105
Token<|endoftext|>
Token ID50256
Feature activation-4.82e-02

Pos logit contributions
ppy+0.180
jo+0.175
azz+0.168
os+0.163
o+0.161
Neg logit contributions
 many-0.085
 new-0.084
 top-0.084
 something-0.084
 anything-0.078
TokenOne
Token ID3198
Feature activation+1.60e-02

Pos logit contributions
jo+0.116
o+0.116
azz+0.111
os+0.110
ppy+0.108
Neg logit contributions
 anything-0.087
 top-0.086
 new-0.083
 something-0.083
 chores-0.083
Token day
Token ID1110
Feature activation-9.79e-04

Pos logit contributions
 anything+0.021
 top+0.020
 bottom+0.019
 something+0.019
 chores+0.018
Neg logit contributions
o-0.047
jo-0.047
os-0.045
azz-0.045
ppy-0.044
Token,
Token ID11
Feature activation-3.96e-03

Pos logit contributions
jo+0.003
o+0.003
azz+0.003
os+0.003
ppy+0.003
Neg logit contributions
 anything-0.001
 top-0.001
 new-0.001
 something-0.001
 bottom-0.001
Token and
Token ID290
Feature activation+8.51e-03

Pos logit contributions
jo+0.309
azz+0.299
o+0.294
os+0.288
ppy+0.285
Neg logit contributions
 anything-0.212
 top-0.211
 good-0.211
 new-0.209
 something-0.208
Token Ben
Token ID3932
Feature activation-1.09e-01

Pos logit contributions
 anything+0.011
 top+0.011
 something+0.011
 new+0.010
 chores+0.010
Neg logit contributions
jo-0.025
o-0.025
azz-0.024
os-0.024
ppy-0.024
Token are
Token ID389
Feature activation+7.86e-02

Pos logit contributions
jo+0.271
o+0.270
azz+0.261
os+0.259
ppy+0.257
Neg logit contributions
 anything-0.187
 top-0.187
 new-0.179
 something-0.178
 bottom-0.176
Token twins
Token ID20345
Feature activation-5.36e-02

Pos logit contributions
 anything+0.122
 top+0.119
 bottom+0.117
 beauty+0.106
 something+0.104
Neg logit contributions
o-0.227
jo-0.207
os-0.207
ppy-0.204
azz-0.202
Token.
Token ID13
Feature activation-3.15e-02

Pos logit contributions
jo+0.128
o+0.128
azz+0.122
os+0.122
ppy+0.121
Neg logit contributions
 top-0.097
 anything-0.097
 something-0.093
 new-0.092
 bottom-0.092
Token They
Token ID1119
Feature activation-1.90e-02

Pos logit contributions
jo+0.079
azz+0.077
o+0.076
os+0.076
ppy+0.076
Neg logit contributions
 something-0.052
 top-0.052
 anything-0.051
 good-0.051
 new-0.051
Token like
Token ID588
Feature activation+6.63e-02

Pos logit contributions
o+0.047
jo+0.047
ppy+0.046
azz+0.045
os+0.045
Neg logit contributions
 top-0.032
 anything-0.032
 new-0.030
 something-0.030
 bottom-0.030
Token to
Token ID284
Feature activation+6.72e-02

Pos logit contributions
 anything+0.115
 top+0.113
 new+0.112
 something+0.110
 chores+0.108
Neg logit contributions
jo-0.168
o-0.163
azz-0.161
ppy-0.159
os-0.158
Token play
Token ID711
Feature activation+6.03e-02

Pos logit contributions
 top+0.105
 anything+0.104
 something+0.099
 new+0.099
 chores+0.099
Neg logit contributions
jo-0.182
o-0.179
azz-0.176
os-0.173
ppy-0.172
Token in
Token ID287
Feature activation+6.59e-02

Pos logit contributions
 anything+0.098
 bottom+0.096
 top+0.096
 biggest+0.093
 accomplishment+0.093
Neg logit contributions
o-0.149
jo-0.141
ppy-0.141
os-0.140
azz-0.139
Token the
Token ID262
Feature activation+7.00e-02

Pos logit contributions
rier+0.096
 biggest+0.086
 accomplishment+0.076
 best+0.076
 most+0.076
Neg logit contributions
o-0.179
os-0.167
jo-0.167
y-0.155
azz-0.154
Token in
Token ID287
Feature activation+7.13e-02

Pos logit contributions
 top+0.031
 anything+0.030
 chores+0.030
 bottom+0.030
 new+0.029
Neg logit contributions
o-0.044
jo-0.043
os-0.042
ppy-0.042
azz-0.042
Token finding
Token ID4917
Feature activation+8.28e-02

Pos logit contributions
 biggest+0.117
 top+0.108
 bottom+0.104
 most+0.102
 anything+0.097
Neg logit contributions
o-0.170
ppy-0.164
os-0.163
jo-0.163
bug-0.156
Token the
Token ID262
Feature activation+7.25e-02

Pos logit contributions
 bottom+0.094
 chores+0.085
wered+0.083
 port+0.081
 top+0.081
Neg logit contributions
o-0.228
jo-0.227
os-0.225
ppy-0.223
azz-0.219
Token perfect
Token ID2818
Feature activation+4.90e-02

Pos logit contributions
 anything+0.142
 yourselves+0.138
 miles+0.137
wered+0.128
 top+0.125
Neg logit contributions
jo-0.145
o-0.142
ppy-0.136
y-0.131
azz-0.130
Token fork
Token ID15563
Feature activation-1.12e-01

Pos logit contributions
 anything+0.100
 chores+0.095
 top+0.094
 yourselves+0.092
 miles+0.092
Neg logit contributions
jo-0.102
o-0.099
ppy-0.094
y-0.092
os-0.089
Token!
Token ID0
Feature activation-5.15e-02

Pos logit contributions
jo+0.279
o+0.276
azz+0.268
os+0.265
ppy+0.262
Neg logit contributions
 anything-0.193
 top-0.192
 new-0.184
 something-0.184
 bottom-0.181
Token She
Token ID1375
Feature activation-2.04e-02

Pos logit contributions
jo+0.149
ppy+0.147
os+0.141
o+0.141
azz+0.139
Neg logit contributions
 top-0.071
 new-0.067
 many-0.067
 something-0.066
 anything-0.064
Token used
Token ID973
Feature activation+6.28e-02

Pos logit contributions
jo+0.044
o+0.043
azz+0.042
os+0.041
ppy+0.041
Neg logit contributions
 top-0.043
 anything-0.042
 new-0.041
 something-0.041
 great-0.040
Token the
Token ID262
Feature activation+5.23e-02

Pos logit contributions
 yourselves+0.085
 bottom+0.081
 biggest+0.077
 port+0.074
 chores+0.074
Neg logit contributions
o-0.162
os-0.158
jo-0.158
ppy-0.156
azz-0.155
Token long
Token ID890
Feature activation+1.33e-02

Pos logit contributions
 anything+0.114
 yourselves+0.113
 miles+0.110
 places+0.106
wered+0.105
Neg logit contributions
jo-0.092
o-0.090
ppy-0.090
azz-0.083
y-0.082
Token fork
Token ID15563
Feature activation-1.17e-01

Pos logit contributions
 anything+0.024
 top+0.024
 new+0.023
 something+0.023
 chores+0.023
Neg logit contributions
jo-0.032
o-0.031
ppy-0.030
azz-0.030
os-0.030
Token.
Token ID13
Feature activation-6.14e-02

Pos logit contributions
jo+0.314
azz+0.299
o+0.295
ppy+0.293
os+0.292
Neg logit contributions
 top-0.158
 anything-0.153
 something-0.147
 things-0.143
 new-0.142
Token He
Token ID679
Feature activation-2.16e-02

Pos logit contributions
jo+0.146
o+0.144
azz+0.139
os+0.138
ppy+0.137
Neg logit contributions
 anything-0.115
 top-0.115
 new-0.110
 something-0.110
 bottom-0.109
Token ran
Token ID4966
Feature activation+4.23e-03

Pos logit contributions
jo+0.049
azz+0.047
ppy+0.045
o+0.044
os+0.044
Neg logit contributions
 top-0.043
 something-0.042
 new-0.042
 many-0.041
 good-0.041
Token away
Token ID1497
Feature activation+3.29e-02

Pos logit contributions
 anything+0.008
 chores+0.008
 biggest+0.008
 top+0.008
 bottom+0.008
Neg logit contributions
o-0.010
jo-0.010
azz-0.009
os-0.009
ppy-0.009
Token fast
Token ID3049
Feature activation+1.35e-02

Pos logit contributions
 chores+0.070
 biggest+0.069
 anything+0.069
 great+0.065
 bottom+0.065
Neg logit contributions
o-0.067
jo-0.063
os-0.061
azz-0.060
y-0.059
Token.
Token ID13
Feature activation-5.77e-02

Pos logit contributions
 top+0.027
 anything+0.027
 chores+0.026
 biggest+0.026
 new+0.026
Neg logit contributions
jo-0.030
o-0.029
azz-0.028
os-0.028
ppy-0.027
Token
Token ID198
Feature activation-6.94e-03

Pos logit contributions
jo+0.142
o+0.141
azz+0.137
os+0.135
ppy+0.134
Neg logit contributions
 anything-0.102
 top-0.102
 new-0.098
 something-0.098
 bottom-0.096
Token
Token ID198
Feature activation-1.32e-02

Pos logit contributions
jo+0.018
o+0.017
azz+0.017
ppy+0.017
os+0.017
Neg logit contributions
 top-0.012
 anything-0.012
 new-0.011
 something-0.011
 bottom-0.011
TokenTom
Token ID13787
Feature activation-1.29e-01

Pos logit contributions
jo+0.029
o+0.029
azz+0.028
os+0.028
ppy+0.028
Neg logit contributions
 top-0.027
 anything-0.027
 new-0.026
 something-0.026
 bottom-0.025
Token and
Token ID290
Feature activation-7.28e-03

Pos logit contributions
jo+0.363
azz+0.359
anne+0.340
ppy+0.339
ksh+0.339
Neg logit contributions
 the-0.162
 new-0.148
 top-0.146
 many-0.145
 big-0.144
Token Anna
Token ID11735
Feature activation-1.22e-01

Pos logit contributions
jo+0.018
o+0.018
azz+0.017
os+0.017
ppy+0.017
Neg logit contributions
 anything-0.013
 top-0.013
 new-0.013
 something-0.013
 good-0.012
Token Ben
Token ID3932
Feature activation-1.27e-01

Pos logit contributions
jo+0.131
o+0.130
azz+0.127
os+0.125
ppy+0.123
Neg logit contributions
 anything-0.098
 top-0.097
 bottom-0.093
 something-0.092
 new-0.092
Token said
Token ID531
Feature activation-2.42e-02

Pos logit contributions
jo+0.297
itty+0.293
os+0.286
o+0.275
azz+0.267
Neg logit contributions
 something-0.247
 a-0.235
 an-0.232
 the-0.229
 new-0.227
Token.
Token ID13
Feature activation-7.67e-02

Pos logit contributions
jo+0.067
o+0.066
azz+0.065
os+0.064
ppy+0.063
Neg logit contributions
 anything-0.036
 top-0.036
 new-0.036
 something-0.035
 good-0.034
Token
Token ID198
Feature activation-2.76e-02

Pos logit contributions
jo+0.195
o+0.192
azz+0.186
os+0.185
ppy+0.184
Neg logit contributions
 top-0.132
 anything-0.131
 new-0.127
 something-0.126
 bottom-0.123
Token
Token ID198
Feature activation-2.98e-02

Pos logit contributions
o+0.065
jo+0.065
azz+0.062
os+0.061
ppy+0.060
Neg logit contributions
 anything-0.053
 top-0.052
 bottom-0.050
 chores-0.050
 something-0.049
TokenMom
Token ID29252
Feature activation-1.09e-01

Pos logit contributions
o+0.064
jo+0.062
azz+0.062
ppy+0.061
os+0.058
Neg logit contributions
 anything-0.065
 top-0.060
 chores-0.060
 bottom-0.060
 biggest-0.059
Token and
Token ID290
Feature activation-2.00e-02

Pos logit contributions
jo+0.345
anna+0.319
azz+0.308
anne+0.299
ppy+0.299
Neg logit contributions
 the-0.111
 top-0.111
 a-0.111
 big-0.106
 something-0.103
Token dad
Token ID9955
Feature activation-1.11e-01

Pos logit contributions
jo+0.055
o+0.055
azz+0.053
os+0.053
ppy+0.052
Neg logit contributions
 anything-0.030
 top-0.030
 new-0.029
 something-0.029
 bottom-0.028
Token looked
Token ID3114
Feature activation+5.21e-02

Pos logit contributions
jo+0.260
azz+0.235
o+0.234
os+0.220
itty+0.218
Neg logit contributions
 top-0.234
 something-0.231
 bottom-0.224
 many-0.222
 sand-0.221
Token at
Token ID379
Feature activation+6.60e-02

Pos logit contributions
 anything+0.089
 top+0.089
 new+0.087
 something+0.086
 things+0.083
Neg logit contributions
jo-0.133
o-0.131
azz-0.127
os-0.126
ppy-0.125
Token the
Token ID262
Feature activation+4.28e-02

Pos logit contributions
 biggest+0.067
 most+0.061
 bottom+0.059
 anything+0.059
 top+0.058
Neg logit contributions
o-0.198
jo-0.198
azz-0.193
ppy-0.192
os-0.192
Token even
Token ID772
Feature activation+3.06e-02

Pos logit contributions
 bottom+0.098
 accomplishment+0.097
 top+0.096
 yourselves+0.096
 chores+0.094
Neg logit contributions
o-0.254
ppy-0.237
y-0.229
bug-0.223
os-0.222
Token more
Token ID517
Feature activation+7.05e-02

Pos logit contributions
 top+0.035
 bottom+0.033
 chores+0.032
 anything+0.030
 yourselves+0.029
Neg logit contributions
o-0.092
ppy-0.087
jo-0.086
os-0.085
y-0.085
Token spicy
Token ID26880
Feature activation-9.51e-03

Pos logit contributions
 anything+0.074
 top+0.073
 bottom+0.068
 chores+0.064
 something+0.062
Neg logit contributions
jo-0.224
o-0.222
os-0.214
azz-0.212
anne-0.207
Token and
Token ID290
Feature activation+2.24e-02

Pos logit contributions
jo+0.027
o+0.027
os+0.026
azz+0.026
ppy+0.025
Neg logit contributions
 anything-0.013
 top-0.013
 new-0.012
 something-0.012
 bottom-0.012
Token the
Token ID262
Feature activation+4.94e-02

Pos logit contributions
 top+0.027
 bottom+0.026
 chores+0.026
 anything+0.025
 port+0.023
Neg logit contributions
o-0.065
jo-0.061
os-0.061
ppy-0.058
y-0.058
Token family
Token ID1641
Feature activation-1.11e-01

Pos logit contributions
 anything+0.055
 top+0.051
 something+0.049
 chores+0.047
 bottom+0.047
Neg logit contributions
jo-0.150
o-0.148
os-0.142
azz-0.142
ppy-0.139
Token felt
Token ID2936
Feature activation+8.22e-02

Pos logit contributions
jo+0.267
azz+0.266
ppy+0.259
o+0.258
os+0.244
Neg logit contributions
 top-0.202
 new-0.200
 anything-0.199
 something-0.198
 things-0.194
Token their
Token ID511
Feature activation+3.98e-02

Pos logit contributions
 top+0.121
 avail+0.121
 places+0.112
 yourselves+0.112
 chores+0.107
Neg logit contributions
o-0.185
ppy-0.179
bug-0.178
os-0.175
y-0.172
Token mouths
Token ID28552
Feature activation-2.62e-02

Pos logit contributions
 anything+0.054
 top+0.051
 something+0.049
 chores+0.047
 bottom+0.047
Neg logit contributions
jo-0.111
o-0.109
os-0.105
ppy-0.104
anne-0.101
Token begin
Token ID2221
Feature activation+5.95e-02

Pos logit contributions
o+0.066
jo+0.064
os+0.064
ppy+0.063
azz+0.061
Neg logit contributions
 anything-0.041
 top-0.040
 new-0.039
 things-0.039
 chores-0.039
Token to
Token ID284
Feature activation+7.03e-02

Pos logit contributions
 anything+0.115
 top+0.114
 bottom+0.110
 chores+0.110
 biggest+0.110
Neg logit contributions
jo-0.134
o-0.134
os-0.128
azz-0.126
ppy-0.123
Token big
Token ID1263
Feature activation+1.09e-02

Pos logit contributions
 new+0.026
 top+0.026
 good+0.024
 great+0.024
 anything+0.023
Neg logit contributions
jo-0.087
os-0.085
o-0.085
azz-0.084
ppy-0.082
Token dog
Token ID3290
Feature activation-1.37e-01

Pos logit contributions
 anything+0.019
 top+0.019
 something+0.018
 new+0.018
 bottom+0.018
Neg logit contributions
jo-0.027
o-0.027
azz-0.026
os-0.026
ppy-0.026
Token named
Token ID3706
Feature activation-3.37e-02

Pos logit contributions
azz+0.349
jo+0.346
ppy+0.332
anne+0.329
os+0.329
Neg logit contributions
 the-0.231
 something-0.227
 many-0.226
 new-0.225
 a-0.218
Token Max
Token ID5436
Feature activation-1.36e-01

Pos logit contributions
jo+0.088
os+0.085
azz+0.081
o+0.081
ppy+0.080
Neg logit contributions
 new-0.059
 top-0.058
 something-0.058
 anything-0.057
 great-0.054
Token.
Token ID13
Feature activation-4.41e-02

Pos logit contributions
azz+0.409
jo+0.407
os+0.401
anne+0.393
oke+0.387
Neg logit contributions
 something-0.160
 anything-0.156
 new-0.152
 many-0.148
 the-0.145
Token Max
Token ID5436
Feature activation-1.30e-01

Pos logit contributions
jo+0.101
o+0.100
ppy+0.099
os+0.098
azz+0.095
Neg logit contributions
 top-0.084
 new-0.082
 anything-0.082
 chores-0.080
 great-0.079
Token was
Token ID373
Feature activation+6.39e-02

Pos logit contributions
azz+0.333
jo+0.321
os+0.319
ppy+0.312
o+0.302
Neg logit contributions
 new-0.220
 great-0.218
 good-0.216
 something-0.214
 top-0.211
Token very
Token ID845
Feature activation+4.36e-02

Pos logit contributions
 top+0.074
 anything+0.073
 bottom+0.071
 chores+0.064
 accomplishment+0.064
Neg logit contributions
o-0.206
jo-0.191
azz-0.184
ppy-0.184
os-0.181
Token hungry
Token ID14720
Feature activation+1.11e-02

Pos logit contributions
 anything+0.048
 top+0.046
 something+0.042
 bottom+0.042
 chores+0.040
Neg logit contributions
o-0.140
jo-0.138
azz-0.134
os-0.133
ppy-0.132
Token and
Token ID290
Feature activation+1.42e-03

Pos logit contributions
 anything+0.024
 top+0.024
 bottom+0.023
 chores+0.023
 biggest+0.023
Neg logit contributions
o-0.023
jo-0.022
ppy-0.021
os-0.021
azz-0.020
Token wanted
Token ID2227
Feature activation+6.83e-02

Pos logit contributions
 anything+0.003
 top+0.003
 new+0.003
 something+0.003
 bottom+0.003
Neg logit contributions
jo-0.003
o-0.003
azz-0.002
os-0.002
ppy-0.002
Token.
Token ID13
Feature activation-6.07e-02

Pos logit contributions
jo+0.094
o+0.093
azz+0.090
os+0.089
ppy+0.088
Neg logit contributions
 anything-0.069
 top-0.069
 new-0.066
 something-0.066
 bottom-0.066
Token They
Token ID1119
Feature activation-1.47e-02

Pos logit contributions
jo+0.136
o+0.134
azz+0.130
os+0.128
ppy+0.128
Neg logit contributions
 anything-0.122
 top-0.122
 new-0.117
 something-0.117
 bottom-0.116
Token still
Token ID991
Feature activation+3.68e-02

Pos logit contributions
o+0.037
jo+0.036
ppy+0.036
azz+0.035
os+0.035
Neg logit contributions
 anything-0.024
 top-0.024
 something-0.024
 new-0.023
 things-0.023
Token like
Token ID588
Feature activation+6.35e-02

Pos logit contributions
 top+0.063
 anything+0.062
 new+0.059
 something+0.059
 bottom+0.059
Neg logit contributions
jo-0.093
o-0.093
os-0.089
azz-0.089
ppy-0.088
Token the
Token ID262
Feature activation+4.33e-02

Pos logit contributions
 top+0.084
 bottom+0.080
 anything+0.075
 middle+0.075
 biggest+0.074
Neg logit contributions
o-0.182
os-0.177
jo-0.176
azz-0.173
ppy-0.164
Token jet
Token ID12644
Feature activation-1.23e-01

Pos logit contributions
 anything+0.085
 something+0.079
 yourselves+0.078
 top+0.078
 himself+0.077
Neg logit contributions
jo-0.091
azz-0.089
o-0.084
os-0.081
itty-0.079
Token.
Token ID13
Feature activation-5.98e-02

Pos logit contributions
jo+0.325
o+0.320
azz+0.314
os+0.308
ppy+0.306
Neg logit contributions
 anything-0.201
 top-0.197
 new-0.196
 something-0.188
 things-0.188
Token They
Token ID1119
Feature activation-1.34e-02

Pos logit contributions
jo+0.129
o+0.128
azz+0.123
os+0.122
ppy+0.119
Neg logit contributions
 anything-0.125
 top-0.124
 new-0.120
 something-0.119
 bottom-0.119
Token hope
Token ID2911
Feature activation+2.89e-02

Pos logit contributions
o+0.033
jo+0.033
ppy+0.032
azz+0.032
os+0.032
Neg logit contributions
 anything-0.022
 top-0.022
 something-0.022
 new-0.021
 biggest-0.021
Token to
Token ID284
Feature activation+5.98e-02

Pos logit contributions
 top+0.041
 bottom+0.039
 chores+0.039
 biggest+0.038
 great+0.037
Neg logit contributions
jo-0.077
o-0.077
os-0.075
azz-0.074
ppy-0.069
Token see
Token ID766
Feature activation+5.33e-02

Pos logit contributions
 things+0.102
 accomplishment+0.101
 biggest+0.100
 trouble+0.099
 top+0.098
Neg logit contributions
o-0.125
ppy-0.119
azz-0.113
jo-0.113
of-0.111
Token shapes
Token ID15268
Feature activation-2.70e-02

Pos logit contributions
 chores+0.075
 anything+0.072
 bottom+0.071
 biggest+0.068
 top+0.066
Neg logit contributions
jo-0.218
o-0.214
ppy-0.212
os-0.212
azz-0.210
Token.
Token ID13
Feature activation-5.58e-02

Pos logit contributions
o+0.062
jo+0.062
os+0.060
azz+0.060
ppy+0.059
Neg logit contributions
 anything-0.049
 top-0.049
 chores-0.048
 new-0.047
 bottom-0.047
Token She
Token ID1375
Feature activation-1.73e-02

Pos logit contributions
jo+0.141
o+0.137
ppy+0.134
azz+0.134
os+0.132
Neg logit contributions
 top-0.101
 anything-0.098
 something-0.096
 new-0.096
 bottom-0.093
Token made
Token ID925
Feature activation+6.48e-02

Pos logit contributions
jo+0.040
o+0.039
azz+0.039
ppy+0.037
os+0.037
Neg logit contributions
 top-0.033
 anything-0.033
 new-0.032
 something-0.032
 great-0.032
Token a
Token ID257
Feature activation+4.51e-02

Pos logit contributions
 bottom+0.062
 chores+0.059
 top+0.057
 anything+0.056
 danger+0.054
Neg logit contributions
jo-0.199
o-0.196
os-0.192
azz-0.191
ppy-0.185
Token doll
Token ID3654
Feature activation-1.14e-01

Pos logit contributions
 anything+0.049
 top+0.045
 chores+0.044
 something+0.044
 bottom+0.042
Neg logit contributions
jo-0.141
o-0.139
azz-0.136
os-0.134
ppy-0.133
Token,
Token ID11
Feature activation+6.35e-03

Pos logit contributions
jo+0.309
o+0.304
azz+0.297
os+0.293
ppy+0.292
Neg logit contributions
 top-0.176
 anything-0.176
 new-0.168
 something-0.167
 bottom-0.164
Token a
Token ID257
Feature activation+4.87e-02

Pos logit contributions
 anything+0.008
 top+0.008
 bottom+0.007
 chores+0.007
 new+0.007
Neg logit contributions
jo-0.019
o-0.019
azz-0.018
os-0.018
ppy-0.018
Token car
Token ID1097
Feature activation-7.52e-02

Pos logit contributions
 top+0.068
 anything+0.066
 new+0.065
 something+0.063
 bottom+0.063
Neg logit contributions
jo-0.140
o-0.138
os-0.135
azz-0.135
ppy-0.134
Token and
Token ID290
Feature activation+4.08e-03

Pos logit contributions
jo+0.212
o+0.207
azz+0.204
ppy+0.203
os+0.201
Neg logit contributions
 top-0.109
 anything-0.107
 new-0.104
 something-0.102
 things-0.100
Token even
Token ID772
Feature activation+2.08e-02

Pos logit contributions
 anything+0.004
 top+0.004
 new+0.003
 something+0.003
 bottom+0.003
Neg logit contributions
jo-0.013
o-0.013
azz-0.013
os-0.013
ppy-0.013