TOP ACTIVATIONS
MAX = 15.751

and make sure you never get lost on your journey .
, we don 't always get what we want . It
's okay to not always get what you want ! <|endoftext|>
, they wouldn 't have gotten broken . Tim my realized
, so he could avoid getting another band age . <|endoftext|>
off and vowed never to get too close to a trap
money and didn 't delay getting them ". Bob
sharing toys was better than getting angry or hating anyone .
so they wouldn 't have gotten in trouble . From then
, but his mustache kept getting in the way and preventing
and be careful not to get tired ." Sam smiled and
friends that he would never get stuck again . From that
the wet p uddle without getting wet . Max
farmer and vowed to never get lost again . From that
make sure he didn 't get lost . As he looked
always be careful and never get caught again . <|endoftext|> Once
and her family will never get to see her again .
I didn 't want you getting too close to the posts
, we might not always get the result we wanted .
destroying things it would never get the butter it wanted .

MIN ACTIVATIONS
MIN = -19.898

. The great Zen o hugs Ben and says
One day , the hipp o saw a little bird
food . The hipp o went to the kitchen
One day , little Jo anne was r aking the leaves
. The flaming o nodded its head and
One day , the hipp o saw a new friend
was a little fish named Nem o . Nem o was
very hungry . So the hipp o decided to cook some
. The hipp o found some vegetables and
One day , Tom the Toad went out for a jog
But then , the hipp o went to play with
in a river . The hipp o was very happy and
One day , the hipp o met a wise old
One day , the hipp o saw something that caught
right . The hipp o looked around and noticed
! â € <|endoftext|> Pa olo was very excited . He
was a little fish named Nem o . Nem o lived
a little fish y named Nem o . Nem o loved
bird . So , the hipp o used its big mouth
was a little fish named Nem o . Nem o loved

INTERVAL 6.839 - 15.751
CONTAINS 9.345%

block . It helps you learn numbers . Do you know
the kitchen . He was having so much fun that he
good memories that she never wanted to let it go .
. When she got there , Molly noticed that
. When she got there , Sarah saw a

INTERVAL -2.073 - 6.839
CONTAINS 53.942%

not know it was her design . He says , "
. It looked sharp and dangerous . She was scared .
other . " That was a bad idea ,
got back , Jenny had made another decision . She wanted
a crown and a dress at home . Can I go

INTERVAL -10.985 - -2.073
CONTAINS 31.340%

bad and wanted to help . Together , they cleaned the
You know ." Mom read the card . "
. Bob was feeling tired . He decided to twist and
to do . She asked her mom , " Where did
help me carry this rock ? It 's too heavy for

INTERVAL -19.898 - -10.985
CONTAINS 5.373%

. The fire work spark led and tw ink
nice it looked . His aunt looked very happy . She
went to her brother , Jack , and asked him if
touch your c actus , Tom ?" she asks .
flying around the flowers . Lily liked the bee . She
Token and
Token ID290
Feature activation+5.97e+00

Pos logit contributions
 anything+0.064
 chores+0.058
 trouble+0.058
 miles+0.057
 something+0.057
Neg logit contributions
jo-0.079
y-0.075
ppy-0.074
o-0.074
azz-0.073
Token make
Token ID787
Feature activation+8.72e+00

Pos logit contributions
 anything+0.091
 chores+0.081
 top+0.079
 trouble+0.079
 miles+0.078
Neg logit contributions
jo-0.142
o-0.135
y-0.133
ppy-0.133
os-0.130
Token sure
Token ID1654
Feature activation+3.09e+00

Pos logit contributions
 anything+0.084
 chores+0.070
 trouble+0.069
 miles+0.068
 something+0.066
Neg logit contributions
jo-0.257
o-0.243
ppy-0.242
y-0.240
azz-0.237
Token you
Token ID345
Feature activation-3.54e-01

Pos logit contributions
 anything+0.036
 trouble+0.033
 chores+0.032
 miles+0.031
 something+0.031
Neg logit contributions
jo-0.084
o-0.080
y-0.079
ppy-0.079
os-0.079
Token never
Token ID1239
Feature activation+8.53e+00

Pos logit contributions
jo+0.009
azz+0.008
ppy+0.008
o+0.008
y+0.008
Neg logit contributions
 anything-0.006
 something-0.006
 chores-0.005
 trouble-0.005
 good-0.005
Token get
Token ID651
Feature activation+1.51e+01

Pos logit contributions
 anything+0.143
 something+0.128
 trouble+0.127
 chores+0.123
 good+0.122
Neg logit contributions
jo-0.211
azz-0.198
ppy-0.198
o-0.195
os-0.190
Token lost
Token ID2626
Feature activation+7.74e+00

Pos logit contributions
 anything+0.179
 chores+0.173
 top+0.163
 accomplishment+0.162
wered+0.161
Neg logit contributions
o-0.382
y-0.377
ppy-0.377
jo-0.376
ne-0.364
Token on
Token ID319
Feature activation+1.14e+01

Pos logit contributions
 anything+0.129
 chores+0.128
 trouble+0.121
 good+0.117
 something+0.117
Neg logit contributions
jo-0.171
ppy-0.165
y-0.165
o-0.160
azz-0.159
Token your
Token ID534
Feature activation+6.38e+00

Pos logit contributions
 accomplishment+0.078
 chores+0.077
 anything+0.077
wered+0.074
 middle+0.072
Neg logit contributions
jo-0.324
y-0.314
o-0.306
ppy-0.305
os-0.304
Token journey
Token ID7002
Feature activation-7.73e-01

Pos logit contributions
 anything+0.088
 something+0.077
 trouble+0.075
 yourselves+0.073
 chores+0.073
Neg logit contributions
jo-0.166
ppy-0.156
o-0.152
y-0.151
azz-0.149
Token.
Token ID13
Feature activation-3.28e+00

Pos logit contributions
jo+0.017
ppy+0.016
o+0.016
azz+0.016
y+0.016
Neg logit contributions
 anything-0.014
 trouble-0.013
 chores-0.013
 something-0.013
 good-0.012
Token,
Token ID11
Feature activation+2.06e+00

Pos logit contributions
 miles+0.076
wered+0.076
 accomplishment+0.074
 anything+0.072
 middle+0.072
Neg logit contributions
jo-0.196
y-0.196
azz-0.192
o-0.191
ppy-0.190
Token we
Token ID356
Feature activation+5.45e-01

Pos logit contributions
 anything+0.028
 chores+0.027
 trouble+0.026
 top+0.026
 bottom+0.025
Neg logit contributions
jo-0.048
ppy-0.048
y-0.046
azz-0.046
ne-0.046
Token don
Token ID836
Feature activation+4.77e+00

Pos logit contributions
 anything+0.011
 something+0.010
 trouble+0.010
 chores+0.010
 good+0.010
Neg logit contributions
jo-0.011
ppy-0.010
azz-0.010
o-0.010
y-0.010
Token't
Token ID470
Feature activation+6.55e+00

Pos logit contributions
 anything+0.069
 something+0.061
 trouble+0.060
 good+0.058
 miles+0.057
Neg logit contributions
jo-0.121
azz-0.117
o-0.116
ppy-0.115
y-0.114
Token always
Token ID1464
Feature activation+8.76e+00

Pos logit contributions
 anything+0.093
 trouble+0.082
 something+0.081
 chores+0.081
 good+0.081
Neg logit contributions
jo-0.180
azz-0.170
ppy-0.168
o-0.167
os-0.166
Token get
Token ID651
Feature activation+1.48e+01

Pos logit contributions
 anything+0.149
 miles+0.134
 top+0.133
 chores+0.133
 something+0.133
Neg logit contributions
jo-0.190
ppy-0.188
o-0.183
y-0.182
azz-0.180
Token what
Token ID644
Feature activation+4.57e+00

Pos logit contributions
wered+0.174
 bottom+0.172
 top+0.171
 anything+0.167
 accomplishment+0.167
Neg logit contributions
o-0.374
ppy-0.372
y-0.371
jo-0.352
ne-0.351
Token we
Token ID356
Feature activation+1.22e+00

Pos logit contributions
 anything+0.072
 chores+0.065
 miles+0.064
 trouble+0.064
 something+0.063
Neg logit contributions
jo-0.107
o-0.102
ppy-0.102
y-0.101
azz-0.101
Token want
Token ID765
Feature activation+6.92e+00

Pos logit contributions
 anything+0.022
 something+0.020
 trouble+0.020
 chores+0.020
 good+0.019
Neg logit contributions
jo-0.028
ppy-0.026
azz-0.026
o-0.026
y-0.025
Token.
Token ID13
Feature activation-3.51e+00

Pos logit contributions
 anything+0.127
 chores+0.118
 trouble+0.117
 miles+0.116
 something+0.114
Neg logit contributions
jo-0.146
ppy-0.141
y-0.139
o-0.139
azz-0.138
Token It
Token ID632
Feature activation-2.92e+00

Pos logit contributions
jo+0.088
ppy+0.084
y+0.083
anne+0.081
azz+0.079
Neg logit contributions
 anything-0.056
 something-0.052
 trouble-0.050
 many-0.048
 an-0.047
Token's
Token ID338
Feature activation+6.01e+00

Pos logit contributions
jo+0.061
azz+0.056
ppy+0.056
o+0.055
y+0.053
Neg logit contributions
 anything-0.097
 something-0.091
 chores-0.089
 trouble-0.089
 good-0.088
Token okay
Token ID8788
Feature activation+1.53e+00

Pos logit contributions
 miles+0.076
 anything+0.070
 bottom+0.069
 trouble+0.066
 shore+0.066
Neg logit contributions
o-0.149
ppy-0.148
y-0.146
jo-0.139
ah-0.134
Token to
Token ID284
Feature activation+7.59e+00

Pos logit contributions
 anything+0.025
 something+0.022
 chores+0.021
 trouble+0.021
 good+0.021
Neg logit contributions
jo-0.039
azz-0.037
ppy-0.037
o-0.036
ne-0.035
Token not
Token ID407
Feature activation+8.36e+00

Pos logit contributions
 anything+0.117
 something+0.104
 trouble+0.103
 chores+0.103
 good+0.098
Neg logit contributions
jo-0.192
ppy-0.184
azz-0.181
o-0.181
y-0.178
Token always
Token ID1464
Feature activation+9.11e+00

Pos logit contributions
 anything+0.127
 something+0.113
 trouble+0.112
 chores+0.111
 good+0.107
Neg logit contributions
jo-0.215
ppy-0.205
azz-0.203
o-0.202
y-0.199
Token get
Token ID651
Feature activation+1.47e+01

Pos logit contributions
 anything+0.147
 chores+0.137
 top+0.135
 miles+0.134
 trouble+0.132
Neg logit contributions
jo-0.199
ppy-0.196
y-0.192
o-0.192
azz-0.188
Token what
Token ID644
Feature activation+4.87e+00

Pos logit contributions
 anything+0.168
 top+0.159
 chores+0.157
 bottom+0.155
 accomplishment+0.155
Neg logit contributions
o-0.385
ppy-0.383
y-0.381
jo-0.376
azz-0.366
Token you
Token ID345
Feature activation-1.02e+00

Pos logit contributions
 anything+0.073
 chores+0.065
 trouble+0.064
 something+0.064
 miles+0.063
Neg logit contributions
jo-0.120
o-0.114
ppy-0.114
azz-0.114
y-0.113
Token want
Token ID765
Feature activation+6.71e+00

Pos logit contributions
jo+0.025
azz+0.023
ppy+0.023
o+0.023
y+0.022
Neg logit contributions
 anything-0.017
 something-0.016
 chores-0.016
 trouble-0.016
 good-0.015
Token!
Token ID0
Feature activation-3.76e+00

Pos logit contributions
 anything+0.120
 chores+0.111
 trouble+0.109
 miles+0.107
 something+0.107
Neg logit contributions
jo-0.147
ppy-0.141
y-0.138
o-0.138
azz-0.138
Token<|endoftext|>
Token ID50256
Feature activation-8.66e+00

Pos logit contributions
jo+0.102
ppy+0.100
y+0.097
anne+0.095
azz+0.095
Neg logit contributions
 anything-0.050
 something-0.046
 trouble-0.045
 many-0.043
 good-0.042
Token,
Token ID11
Feature activation+3.09e+00

Pos logit contributions
y+0.066
jo+0.066
ppy+0.066
o+0.065
azz+0.063
Neg logit contributions
 anything-0.054
 trouble-0.051
 miles-0.051
 chores-0.051
 many-0.048
Token they
Token ID484
Feature activation-1.54e+00

Pos logit contributions
 anything+0.039
 top+0.036
 bottom+0.036
 miles+0.036
 chores+0.036
Neg logit contributions
jo-0.077
o-0.075
azz-0.074
ppy-0.074
y-0.072
Token wouldn
Token ID3636
Feature activation+6.52e+00

Pos logit contributions
jo+0.029
ppy+0.027
azz+0.027
o+0.027
y+0.026
Neg logit contributions
 anything-0.034
 something-0.031
 chores-0.031
 trouble-0.031
 good-0.030
Token't
Token ID470
Feature activation+7.80e+00

Pos logit contributions
 anything+0.077
 something+0.066
 trouble+0.065
 chores+0.065
 good+0.061
Neg logit contributions
jo-0.190
ppy-0.181
azz-0.180
o-0.179
y-0.177
Token have
Token ID423
Feature activation+1.07e+01

Pos logit contributions
 anything+0.114
 something+0.100
 trouble+0.099
 chores+0.098
 good+0.097
Neg logit contributions
jo-0.211
azz-0.198
ppy-0.197
o-0.197
os-0.194
Token gotten
Token ID7891
Feature activation+1.47e+01

Pos logit contributions
 anything+0.137
 chores+0.129
 top+0.122
 accomplishment+0.121
 yourselves+0.121
Neg logit contributions
jo-0.271
o-0.267
ppy-0.265
y-0.265
azz-0.259
Token broken
Token ID5445
Feature activation+6.28e+00

Pos logit contributions
 chores+0.184
wered+0.183
 accomplishment+0.182
 anything+0.182
 top+0.181
Neg logit contributions
y-0.361
o-0.359
ppy-0.352
jo-0.340
ne-0.333
Token.
Token ID13
Feature activation-3.47e+00

Pos logit contributions
 anything+0.119
 chores+0.115
 something+0.110
 trouble+0.109
 miles+0.108
Neg logit contributions
jo-0.127
y-0.125
ppy-0.121
o-0.119
azz-0.117
Token Tim
Token ID5045
Feature activation-1.25e+01

Pos logit contributions
ppy+0.094
jo+0.092
y+0.091
ne+0.086
ah+0.086
Neg logit contributions
 anything-0.044
 something-0.042
 many-0.040
 trouble-0.040
 an-0.039
Tokenmy
Token ID1820
Feature activation-8.42e+00

Pos logit contributions
azz+0.175
jo+0.175
anne+0.159
ne+0.159
os+0.159
Neg logit contributions
 good-0.293
 himself-0.293
 something-0.289
 trouble-0.285
 many-0.285
Token realized
Token ID6939
Feature activation+3.93e+00

Pos logit contributions
jo+0.201
azz+0.195
ppy+0.188
os+0.183
itty+0.182
Neg logit contributions
 something-0.136
 anything-0.135
 good-0.131
 many-0.129
 trouble-0.128
Token,
Token ID11
Feature activation+3.16e+00

Pos logit contributions
 anything+0.078
 chores+0.072
 something+0.071
 trouble+0.071
 good+0.069
Neg logit contributions
jo-0.087
ppy-0.082
azz-0.081
o-0.081
y-0.081
Token so
Token ID523
Feature activation+6.23e+00

Pos logit contributions
 anything+0.059
 chores+0.057
 trouble+0.054
 yourselves+0.053
 something+0.053
Neg logit contributions
jo-0.065
ppy-0.062
azz-0.062
o-0.062
y-0.060
Token he
Token ID339
Feature activation-3.57e+00

Pos logit contributions
 anything+0.079
 chores+0.071
 trouble+0.070
 something+0.068
 good+0.064
Neg logit contributions
jo-0.172
o-0.163
ppy-0.163
azz-0.162
y-0.160
Token could
Token ID714
Feature activation+6.25e+00

Pos logit contributions
jo+0.078
azz+0.075
ppy+0.072
o+0.071
os+0.070
Neg logit contributions
 anything-0.068
 something-0.063
 good-0.063
 chores-0.062
 trouble-0.061
Token avoid
Token ID3368
Feature activation+8.95e+00

Pos logit contributions
 anything+0.083
 trouble+0.073
 something+0.072
 chores+0.069
 yourselves+0.067
Neg logit contributions
jo-0.169
o-0.162
ppy-0.161
azz-0.159
y-0.157
Token getting
Token ID1972
Feature activation+1.47e+01

Pos logit contributions
 anything+0.070
 chores+0.070
 yourselves+0.064
 good+0.061
 something+0.060
Neg logit contributions
jo-0.265
y-0.252
o-0.250
ppy-0.244
azz-0.242
Token another
Token ID1194
Feature activation+4.15e+00

Pos logit contributions
 anything+0.181
 chores+0.171
 accomplishment+0.165
 yourselves+0.164
 top+0.161
Neg logit contributions
o-0.374
y-0.359
jo-0.353
ppy-0.350
os-0.347
Token band
Token ID4097
Feature activation-4.36e+00

Pos logit contributions
 anything+0.050
 something+0.044
 trouble+0.043
 chores+0.043
 good+0.041
Neg logit contributions
jo-0.120
ppy-0.115
azz-0.114
o-0.114
y-0.111
Tokenage
Token ID496
Feature activation-1.55e+00

Pos logit contributions
jo+0.087
ppy+0.082
azz+0.081
o+0.079
y+0.077
Neg logit contributions
 anything-0.092
 something-0.085
 chores-0.083
 trouble-0.083
 good-0.082
Token.
Token ID13
Feature activation-3.29e+00

Pos logit contributions
jo+0.038
ppy+0.036
o+0.035
azz+0.035
y+0.035
Neg logit contributions
 anything-0.025
 trouble-0.023
 chores-0.022
 something-0.022
 good-0.021
Token<|endoftext|>
Token ID50256
Feature activation-7.21e+00

Pos logit contributions
jo+0.086
ppy+0.085
y+0.083
azz+0.081
ne+0.079
Neg logit contributions
 anything-0.046
 something-0.043
 trouble-0.041
 many-0.041
 himself-0.040
Token off
Token ID572
Feature activation+4.22e+00

Pos logit contributions
 anything+0.052
 something+0.047
 trouble+0.046
 chores+0.045
 good+0.045
Neg logit contributions
jo-0.092
ppy-0.088
azz-0.087
o-0.086
y-0.084
Token and
Token ID290
Feature activation+6.68e+00

Pos logit contributions
 anything+0.073
 chores+0.065
 trouble+0.064
 something+0.064
 yourselves+0.063
Neg logit contributions
jo-0.099
o-0.094
ppy-0.092
azz-0.092
y-0.092
Token vowed
Token ID19982
Feature activation+3.97e+00

Pos logit contributions
 anything+0.134
 something+0.128
 good+0.122
 trouble+0.121
 many+0.121
Neg logit contributions
jo-0.140
azz-0.131
ppy-0.130
y-0.124
o-0.123
Token never
Token ID1239
Feature activation+8.30e+00

Pos logit contributions
 anything+0.068
 trouble+0.062
 chores+0.061
 something+0.059
 yourselves+0.058
Neg logit contributions
jo-0.091
o-0.087
azz-0.084
y-0.083
ppy-0.083
Token to
Token ID284
Feature activation+1.01e+01

Pos logit contributions
 anything+0.140
 chores+0.126
 something+0.126
 trouble+0.125
 good+0.120
Neg logit contributions
jo-0.198
ppy-0.188
azz-0.186
o-0.186
y-0.183
Token get
Token ID651
Feature activation+1.46e+01

Pos logit contributions
 anything+0.156
 something+0.140
 chores+0.139
 trouble+0.139
 many+0.133
Neg logit contributions
jo-0.250
ppy-0.238
o-0.237
azz-0.236
y-0.233
Token too
Token ID1165
Feature activation+6.35e+00

Pos logit contributions
 anything+0.185
 chores+0.181
 accomplishment+0.176
 top+0.174
 yourselves+0.170
Neg logit contributions
o-0.368
jo-0.358
ppy-0.355
y-0.354
azz-0.343
Token close
Token ID1969
Feature activation+7.72e+00

Pos logit contributions
 anything+0.077
 chores+0.068
 yourselves+0.067
 accomplishment+0.064
 top+0.060
Neg logit contributions
o-0.167
y-0.166
jo-0.164
ppy-0.159
ne-0.159
Token to
Token ID284
Feature activation+8.75e+00

Pos logit contributions
 anything+0.144
 chores+0.137
 something+0.130
 trouble+0.130
 good+0.129
Neg logit contributions
jo-0.163
y-0.154
o-0.152
ppy-0.149
azz-0.148
Token a
Token ID257
Feature activation+4.66e+00

Pos logit contributions
 anything+0.066
 chores+0.061
 trouble+0.056
 yourselves+0.055
 good+0.054
Neg logit contributions
jo-0.265
o-0.251
ppy-0.251
azz-0.248
y-0.246
Token trap
Token ID12840
Feature activation-2.47e+00

Pos logit contributions
 anything+0.061
 chores+0.053
 something+0.053
 trouble+0.052
 good+0.049
Neg logit contributions
jo-0.128
ppy-0.122
azz-0.121
o-0.121
y-0.119
Token money
Token ID1637
Feature activation-3.68e+00

Pos logit contributions
 anything+0.035
 something+0.030
 yourselves+0.030
 trouble+0.029
 chores+0.029
Neg logit contributions
jo-0.081
ppy-0.077
azz-0.077
o-0.076
y-0.075
Token and
Token ID290
Feature activation+5.96e+00

Pos logit contributions
jo+0.085
ppy+0.081
azz+0.080
o+0.079
y+0.078
Neg logit contributions
 anything-0.065
 something-0.059
 chores-0.058
 trouble-0.058
 good-0.057
Token didn
Token ID1422
Feature activation+7.07e+00

Pos logit contributions
 anything+0.086
 trouble+0.079
 chores+0.078
 bottom+0.075
 miles+0.075
Neg logit contributions
jo-0.145
o-0.141
ppy-0.140
azz-0.137
y-0.134
Token't
Token ID470
Feature activation+6.77e+00

Pos logit contributions
 anything+0.100
 something+0.090
 trouble+0.089
 chores+0.085
 miles+0.085
Neg logit contributions
jo-0.180
azz-0.174
o-0.173
ppy-0.171
ne-0.168
Token delay
Token ID5711
Feature activation+8.30e+00

Pos logit contributions
 anything+0.105
 something+0.094
 trouble+0.093
 chores+0.093
 good+0.090
Neg logit contributions
jo-0.173
ppy-0.164
azz-0.163
o-0.162
y-0.159
Token getting
Token ID1972
Feature activation+1.46e+01

Pos logit contributions
 anything+0.097
 miles+0.093
 middle+0.093
wered+0.092
 chores+0.092
Neg logit contributions
jo-0.205
y-0.203
o-0.201
ppy-0.201
azz-0.192
Token them
Token ID606
Feature activation-2.17e+00

Pos logit contributions
wered+0.158
 accomplishment+0.156
 bottom+0.153
 anything+0.151
 top+0.149
Neg logit contributions
o-0.370
ppy-0.361
y-0.357
ne-0.342
azz-0.340
Token".
Token ID1911
Feature activation-4.95e+00

Pos logit contributions
jo+0.052
o+0.052
ppy+0.051
y+0.051
azz+0.050
Neg logit contributions
 anything-0.032
 trouble-0.029
 something-0.029
 chores-0.029
 miles-0.028
Token
Token ID198
Feature activation-2.61e+00

Pos logit contributions
jo+0.123
ppy+0.123
azz+0.120
y+0.115
o+0.111
Neg logit contributions
 anything-0.079
 something-0.074
 trouble-0.074
 many-0.071
 top-0.070
Token
Token ID198
Feature activation-3.04e+00

Pos logit contributions
jo+0.071
ppy+0.071
azz+0.069
y+0.067
os+0.065
Neg logit contributions
 anything-0.032
 many-0.031
 something-0.031
 good-0.030
 trouble-0.030
TokenBob
Token ID18861
Feature activation-1.50e+01

Pos logit contributions
jo+0.071
ppy+0.068
azz+0.068
y+0.064
os+0.063
Neg logit contributions
 anything-0.053
 something-0.051
 good-0.050
 many-0.049
 trouble-0.049
Token sharing
Token ID7373
Feature activation+2.82e+00

Pos logit contributions
 anything+0.023
 chores+0.020
 yourselves+0.020
 miles+0.020
 great+0.020
Neg logit contributions
o-0.038
jo-0.037
y-0.037
azz-0.037
os-0.035
Token toys
Token ID14958
Feature activation-2.74e+00

Pos logit contributions
 anything+0.055
 miles+0.053
 chores+0.053
 trouble+0.051
 top+0.050
Neg logit contributions
y-0.054
jo-0.053
o-0.052
azz-0.052
ppy-0.052
Token was
Token ID373
Feature activation+9.50e+00

Pos logit contributions
y+0.050
ppy+0.049
jo+0.049
o+0.048
azz+0.047
Neg logit contributions
 anything-0.056
 trouble-0.053
 chores-0.053
 miles-0.052
wered-0.051
Token better
Token ID1365
Feature activation+5.59e+00

Pos logit contributions
 bottom+0.119
 miles+0.119
wered+0.116
 top+0.112
 middle+0.112
Neg logit contributions
o-0.219
y-0.210
ppy-0.209
bug-0.200
my-0.199
Token than
Token ID621
Feature activation+8.62e+00

Pos logit contributions
 anything+0.104
wered+0.099
 chores+0.098
 miles+0.096
 top+0.095
Neg logit contributions
y-0.117
ppy-0.116
o-0.114
jo-0.113
ne-0.107
Token getting
Token ID1972
Feature activation+1.45e+01

Pos logit contributions
 middle+0.109
 chores+0.107
 top+0.106
 anything+0.103
 bottom+0.103
Neg logit contributions
ppy-0.206
y-0.205
o-0.203
azz-0.196
ne-0.196
Token angry
Token ID7954
Feature activation+3.64e+00

Pos logit contributions
 anything+0.154
 top+0.148
wered+0.147
 bottom+0.147
 chores+0.147
Neg logit contributions
ppy-0.385
y-0.384
o-0.375
azz-0.369
ne-0.361
Token or
Token ID393
Feature activation+7.74e+00

Pos logit contributions
 anything+0.065
 something+0.059
 trouble+0.059
 chores+0.059
 good+0.056
Neg logit contributions
jo-0.083
ppy-0.079
azz-0.078
o-0.078
y-0.076
Token hating
Token ID45701
Feature activation+5.54e+00

Pos logit contributions
 anything+0.098
 chores+0.087
 trouble+0.086
 something+0.084
 miles+0.081
Neg logit contributions
jo-0.206
ppy-0.204
o-0.203
y-0.200
azz-0.199
Token anyone
Token ID2687
Feature activation-4.66e+00

Pos logit contributions
 anything+0.060
 trouble+0.053
 chores+0.053
 miles+0.052
 accomplishment+0.051
Neg logit contributions
jo-0.155
o-0.150
y-0.150
ppy-0.150
azz-0.149
Token.
Token ID13
Feature activation-3.29e+00

Pos logit contributions
jo+0.108
ppy+0.101
azz+0.101
o+0.098
anne+0.096
Neg logit contributions
 anything-0.087
 something-0.079
 trouble-0.077
 good-0.076
 chores-0.076
Token so
Token ID523
Feature activation+6.18e+00

Pos logit contributions
 anything+0.084
 trouble+0.076
 chores+0.076
 something+0.075
 yourselves+0.073
Neg logit contributions
jo-0.110
ppy-0.103
o-0.102
azz-0.102
y-0.102
Token they
Token ID484
Feature activation-2.42e+00

Pos logit contributions
 anything+0.072
 trouble+0.061
 chores+0.061
 something+0.060
 yourselves+0.058
Neg logit contributions
jo-0.176
ppy-0.165
azz-0.165
o-0.163
y-0.162
Token wouldn
Token ID3636
Feature activation+5.85e+00

Pos logit contributions
jo+0.050
azz+0.047
o+0.046
itty+0.046
ppy+0.045
Neg logit contributions
 anything-0.049
 something-0.045
 chores-0.045
 many-0.045
 trouble-0.044
Token't
Token ID470
Feature activation+7.12e+00

Pos logit contributions
 anything+0.070
 something+0.060
 trouble+0.059
 chores+0.059
 good+0.056
Neg logit contributions
jo-0.169
ppy-0.162
azz-0.161
o-0.160
y-0.158
Token have
Token ID423
Feature activation+1.08e+01

Pos logit contributions
 anything+0.096
 something+0.082
 trouble+0.081
 good+0.081
 chores+0.080
Neg logit contributions
jo-0.202
azz-0.192
ppy-0.189
o-0.186
itty-0.186
Token gotten
Token ID7891
Feature activation+1.45e+01

Pos logit contributions
 anything+0.154
 chores+0.146
 top+0.141
 yourselves+0.140
 accomplishment+0.139
Neg logit contributions
jo-0.249
o-0.249
ppy-0.240
y-0.238
ne-0.236
Token in
Token ID287
Feature activation+9.36e+00

Pos logit contributions
wered+0.185
 accomplishment+0.185
 chores+0.184
 anything+0.184
 top+0.179
Neg logit contributions
o-0.354
y-0.344
jo-0.334
ppy-0.333
ne-0.322
Token trouble
Token ID5876
Feature activation+4.57e+00

Pos logit contributions
wered+0.071
 biggest+0.070
 anything+0.067
 middle+0.066
 chores+0.065
Neg logit contributions
jo-0.252
y-0.250
o-0.237
ne-0.234
os-0.234
Token.
Token ID13
Feature activation-3.33e+00

Pos logit contributions
 anything+0.087
 chores+0.081
 something+0.080
 trouble+0.079
 good+0.077
Neg logit contributions
jo-0.096
ppy-0.090
y-0.089
o-0.089
azz-0.087
Token From
Token ID3574
Feature activation+3.81e+00

Pos logit contributions
jo+0.086
ppy+0.085
y+0.082
azz+0.081
o+0.080
Neg logit contributions
 anything-0.048
 something-0.045
 trouble-0.044
 many-0.043
 good-0.041
Token then
Token ID788
Feature activation+5.73e-01

Pos logit contributions
 anything+0.061
 chores+0.056
 trouble+0.054
 good+0.052
 miles+0.051
Neg logit contributions
jo-0.091
o-0.088
ppy-0.086
y-0.086
azz-0.085
Token,
Token ID11
Feature activation+1.58e+00

Pos logit contributions
 anything+0.044
 trouble+0.041
 chores+0.040
 something+0.040
 miles+0.040
Neg logit contributions
jo-0.050
y-0.048
ppy-0.047
o-0.046
azz-0.046
Token but
Token ID475
Feature activation+2.89e+00

Pos logit contributions
 anything+0.023
 chores+0.021
 trouble+0.021
 something+0.020
 good+0.019
Neg logit contributions
jo-0.041
ppy-0.038
o-0.038
azz-0.038
y-0.038
Token his
Token ID465
Feature activation+2.85e-01

Pos logit contributions
 anything+0.036
 trouble+0.032
 chores+0.032
 something+0.030
 yourselves+0.029
Neg logit contributions
jo-0.079
ppy-0.077
o-0.077
y-0.075
azz-0.074
Token mustache
Token ID49303
Feature activation-1.07e+01

Pos logit contributions
 anything+0.004
 something+0.004
 trouble+0.004
 chores+0.004
 yourselves+0.003
Neg logit contributions
jo-0.007
ppy-0.007
o-0.007
azz-0.007
y-0.007
Token kept
Token ID4030
Feature activation+1.14e+01

Pos logit contributions
jo+0.222
ppy+0.210
azz+0.208
o+0.205
y+0.199
Neg logit contributions
 anything-0.216
 something-0.199
 trouble-0.196
 chores-0.196
 good-0.191
Token getting
Token ID1972
Feature activation+1.45e+01

Pos logit contributions
 anything+0.128
 yourselves+0.120
 chores+0.112
 miles+0.110
wered+0.109
Neg logit contributions
jo-0.289
o-0.288
y-0.287
ppy-0.273
ah-0.266
Token in
Token ID287
Feature activation+9.84e+00

Pos logit contributions
 yourselves+0.181
wered+0.163
 anything+0.156
 accomplishment+0.153
 ourselves+0.153
Neg logit contributions
o-0.352
y-0.351
ppy-0.342
azz-0.333
jo-0.324
Token the
Token ID262
Feature activation+5.77e+00

Pos logit contributions
 anything+0.111
 yourselves+0.108
 biggest+0.103
wered+0.098
 richest+0.096
Neg logit contributions
y-0.237
jo-0.231
o-0.220
ah-0.214
ne-0.211
Token way
Token ID835
Feature activation+5.54e+00

Pos logit contributions
 anything+0.074
 yourselves+0.064
 something+0.064
 miles+0.062
 chores+0.061
Neg logit contributions
jo-0.148
o-0.141
y-0.141
ppy-0.138
ne-0.136
Token and
Token ID290
Feature activation+7.12e+00

Pos logit contributions
 anything+0.096
 trouble+0.088
 chores+0.088
 something+0.087
 good+0.083
Neg logit contributions
jo-0.127
y-0.120
o-0.120
ppy-0.118
azz-0.118
Token preventing
Token ID12174
Feature activation+1.14e+01

Pos logit contributions
 anything+0.100
 chores+0.093
 trouble+0.091
 yourselves+0.089
 something+0.086
Neg logit contributions
jo-0.178
o-0.172
y-0.168
ppy-0.165
os-0.164
Token and
Token ID290
Feature activation+6.07e+00

Pos logit contributions
 anything+0.072
 trouble+0.065
 chores+0.065
 something+0.065
 miles+0.063
Neg logit contributions
jo-0.073
ppy-0.069
o-0.068
y-0.068
azz-0.067
Token be
Token ID307
Feature activation+9.18e+00

Pos logit contributions
 anything+0.095
 trouble+0.084
 miles+0.082
 chores+0.081
 something+0.080
Neg logit contributions
ppy-0.140
jo-0.138
o-0.137
azz-0.134
y-0.133
Token careful
Token ID8161
Feature activation+2.65e+00

Pos logit contributions
 anything+0.112
 chores+0.099
 trouble+0.097
 miles+0.094
 top+0.093
Neg logit contributions
jo-0.250
o-0.250
ppy-0.247
y-0.242
azz-0.237
Token not
Token ID407
Feature activation+8.03e+00

Pos logit contributions
 anything+0.044
 something+0.040
 trouble+0.039
 chores+0.039
 good+0.038
Neg logit contributions
jo-0.064
ppy-0.061
o-0.060
azz-0.060
y-0.059
Token to
Token ID284
Feature activation+9.87e+00

Pos logit contributions
 anything+0.127
 something+0.112
 good+0.109
 chores+0.107
 trouble+0.107
Neg logit contributions
jo-0.213
ppy-0.197
azz-0.196
anne-0.194
itty-0.191
Token get
Token ID651
Feature activation+1.45e+01

Pos logit contributions
 anything+0.154
 something+0.138
 trouble+0.136
 chores+0.136
 good+0.131
Neg logit contributions
jo-0.250
ppy-0.238
azz-0.236
o-0.235
y-0.230
Token tired
Token ID10032
Feature activation+5.41e+00

Pos logit contributions
 anything+0.175
 chores+0.157
 trouble+0.152
 miles+0.150
 something+0.150
Neg logit contributions
jo-0.379
o-0.378
ppy-0.377
y-0.371
azz-0.369
Token."
Token ID526
Feature activation-4.48e+00

Pos logit contributions
 anything+0.095
 something+0.087
 chores+0.086
 trouble+0.086
 good+0.082
Neg logit contributions
jo-0.123
ppy-0.118
azz-0.116
y-0.115
o-0.115
Token Sam
Token ID3409
Feature activation-1.51e+01

Pos logit contributions
ppy+0.125
jo+0.123
y+0.120
azz+0.116
ah+0.115
Neg logit contributions
 anything-0.053
 something-0.051
 trouble-0.049
 himself-0.049
 many-0.049
Token smiled
Token ID13541
Feature activation+4.01e-01

Pos logit contributions
azz+0.336
jo+0.335
ppy+0.304
itty+0.302
os+0.296
Neg logit contributions
 something-0.265
 the-0.259
 not-0.255
 an-0.254
 a-0.251
Token and
Token ID290
Feature activation+3.19e+00

Pos logit contributions
 anything+0.008
 something+0.007
 trouble+0.007
 chores+0.007
 good+0.007
Neg logit contributions
jo-0.009
ppy-0.008
azz-0.008
o-0.008
os-0.008
Token friends
Token ID2460
Feature activation-7.85e+00

Pos logit contributions
 anything+0.002
 something+0.002
 yourselves+0.002
 trouble+0.002
 chores+0.002
Neg logit contributions
jo-0.005
ppy-0.005
o-0.005
y-0.005
azz-0.005
Token that
Token ID326
Feature activation+1.62e+00

Pos logit contributions
jo+0.191
ppy+0.186
y+0.185
o+0.184
ne+0.178
Neg logit contributions
 trouble-0.108
 anything-0.108
 yourselves-0.104
 chores-0.099
 miles-0.098
Token he
Token ID339
Feature activation-2.48e+00

Pos logit contributions
 anything+0.021
 yourselves+0.020
 trouble+0.020
 chores+0.020
 top+0.019
Neg logit contributions
o-0.041
jo-0.041
ppy-0.040
y-0.040
os-0.039
Token would
Token ID561
Feature activation+6.51e+00

Pos logit contributions
jo+0.051
azz+0.047
anne+0.047
ppy+0.045
itty+0.045
Neg logit contributions
 anything-0.050
 good-0.047
 something-0.046
 many-0.045
 great-0.045
Token never
Token ID1239
Feature activation+8.01e+00

Pos logit contributions
 anything+0.102
 trouble+0.093
 something+0.091
 chores+0.086
 yourselves+0.086
Neg logit contributions
ppy-0.154
jo-0.153
o-0.149
azz-0.147
y-0.145
Token get
Token ID651
Feature activation+1.44e+01

Pos logit contributions
 anything+0.120
 something+0.105
 trouble+0.105
 chores+0.102
 good+0.101
Neg logit contributions
jo-0.215
azz-0.200
ppy-0.199
o-0.198
os-0.195
Token stuck
Token ID7819
Feature activation+5.68e+00

Pos logit contributions
 anything+0.189
 chores+0.184
 top+0.182
 accomplishment+0.179
 bottom+0.179
Neg logit contributions
o-0.362
ppy-0.353
y-0.349
jo-0.343
azz-0.338
Token again
Token ID757
Feature activation+3.22e+00

Pos logit contributions
 chores+0.111
 anything+0.107
 trouble+0.103
 yourselves+0.101
 something+0.100
Neg logit contributions
y-0.112
ppy-0.110
o-0.107
jo-0.106
azz-0.102
Token.
Token ID13
Feature activation-3.55e+00

Pos logit contributions
 anything+0.062
 chores+0.058
 trouble+0.056
 something+0.056
 good+0.055
Neg logit contributions
jo-0.068
ppy-0.064
o-0.063
y-0.063
azz-0.063
Token From
Token ID3574
Feature activation+3.49e+00

Pos logit contributions
jo+0.087
ppy+0.085
azz+0.083
y+0.081
anne+0.080
Neg logit contributions
 anything-0.057
 many-0.053
 something-0.053
 trouble-0.051
 good-0.051
Token that
Token ID326
Feature activation-1.54e+00

Pos logit contributions
 anything+0.056
 chores+0.051
 trouble+0.050
 miles+0.046
 something+0.046
Neg logit contributions
jo-0.085
o-0.083
ppy-0.082
y-0.081
ne-0.079
Token the
Token ID262
Feature activation+4.29e+00

Pos logit contributions
 chores+0.096
 anything+0.094
 good+0.085
 yourselves+0.084
 great+0.080
Neg logit contributions
jo-0.231
y-0.227
o-0.222
os-0.217
ne-0.215
Token wet
Token ID9583
Feature activation-7.83e-01

Pos logit contributions
 anything+0.059
 chores+0.051
 something+0.051
 trouble+0.051
 good+0.048
Neg logit contributions
jo-0.112
o-0.107
y-0.107
ppy-0.107
ne-0.105
Token p
Token ID279
Feature activation-4.96e+00

Pos logit contributions
jo+0.021
ppy+0.020
azz+0.020
o+0.020
y+0.019
Neg logit contributions
 anything-0.011
 something-0.010
 trouble-0.010
 chores-0.010
 good-0.009
Tokenuddle
Token ID24500
Feature activation-3.79e+00

Pos logit contributions
jo+0.098
ppy+0.091
azz+0.091
o+0.086
anne+0.084
Neg logit contributions
 anything-0.104
 something-0.098
 chores-0.096
 trouble-0.096
 yourselves-0.094
Token without
Token ID1231
Feature activation+1.05e+01

Pos logit contributions
jo+0.086
ppy+0.081
azz+0.081
o+0.080
y+0.078
Neg logit contributions
 anything-0.068
 something-0.062
 trouble-0.061
 chores-0.061
 good-0.059
Token getting
Token ID1972
Feature activation+1.44e+01

Pos logit contributions
 anything+0.105
 chores+0.101
 middle+0.092
 top+0.091
 great+0.091
Neg logit contributions
ppy-0.280
o-0.277
jo-0.273
y-0.268
os-0.260
Token wet
Token ID9583
Feature activation+3.56e+00

Pos logit contributions
 anything+0.207
 accomplishment+0.205
 chores+0.200
 avail+0.194
wered+0.189
Neg logit contributions
o-0.334
y-0.327
ppy-0.313
os-0.302
ne-0.289
Token.
Token ID13
Feature activation-3.38e+00

Pos logit contributions
 anything+0.068
 chores+0.064
 something+0.062
 trouble+0.062
 good+0.060
Neg logit contributions
jo-0.073
y-0.071
ppy-0.070
o-0.069
azz-0.067
Token
Token ID198
Feature activation-9.27e-01

Pos logit contributions
ppy+0.086
jo+0.086
y+0.083
azz+0.081
ne+0.080
Neg logit contributions
 anything-0.048
 something-0.046
 many-0.044
 trouble-0.044
 an-0.042
Token
Token ID198
Feature activation-1.29e+00

Pos logit contributions
jo+0.025
ppy+0.024
y+0.024
azz+0.023
o+0.023
Neg logit contributions
 anything-0.012
 something-0.012
 many-0.011
 trouble-0.011
 chores-0.011
TokenMax
Token ID11518
Feature activation-1.34e+01

Pos logit contributions
jo+0.033
ppy+0.032
azz+0.031
y+0.031
o+0.031
Neg logit contributions
 anything-0.019
 something-0.017
 trouble-0.017
 chores-0.017
 many-0.016
Token farmer
Token ID18739
Feature activation-1.16e+01

Pos logit contributions
 anything+0.032
 yourselves+0.027
 miles+0.027
 chores+0.027
 something+0.027
Neg logit contributions
jo-0.036
y-0.034
ppy-0.034
o-0.034
anne-0.033
Token and
Token ID290
Feature activation+4.39e+00

Pos logit contributions
jo+0.283
azz+0.278
ppy+0.269
o+0.260
os+0.259
Neg logit contributions
 anything-0.189
 something-0.176
 good-0.165
 many-0.162
 himself-0.162
Token vowed
Token ID19982
Feature activation+4.50e+00

Pos logit contributions
 anything+0.061
 chores+0.055
 trouble+0.053
 something+0.052
 yourselves+0.050
Neg logit contributions
jo-0.115
o-0.111
ppy-0.110
y-0.108
azz-0.108
Token to
Token ID284
Feature activation+7.21e+00

Pos logit contributions
 anything+0.078
 trouble+0.071
 chores+0.071
 something+0.068
 miles+0.067
Neg logit contributions
jo-0.101
o-0.098
ppy-0.094
azz-0.093
y-0.093
Token never
Token ID1239
Feature activation+8.37e+00

Pos logit contributions
 anything+0.116
 trouble+0.104
 something+0.103
 chores+0.103
 many+0.097
Neg logit contributions
jo-0.168
ppy-0.165
o-0.163
azz-0.156
y-0.156
Token get
Token ID651
Feature activation+1.44e+01

Pos logit contributions
 anything+0.130
 something+0.116
 trouble+0.114
 chores+0.112
 good+0.110
Neg logit contributions
jo-0.218
ppy-0.205
azz-0.203
o-0.202
y-0.198
Token lost
Token ID2626
Feature activation+7.25e+00

Pos logit contributions
 anything+0.202
 chores+0.201
 top+0.197
 accomplishment+0.197
 bottom+0.194
Neg logit contributions
o-0.341
ppy-0.325
jo-0.324
y-0.324
os-0.312
Token again
Token ID757
Feature activation+3.20e+00

Pos logit contributions
 chores+0.135
 anything+0.130
 accomplishment+0.123
 great+0.123
 many+0.123
Neg logit contributions
jo-0.143
y-0.141
ppy-0.136
o-0.133
ne-0.129
Token.
Token ID13
Feature activation-3.42e+00

Pos logit contributions
 anything+0.062
 chores+0.059
 trouble+0.057
 something+0.056
 good+0.056
Neg logit contributions
jo-0.066
ppy-0.061
o-0.061
y-0.061
azz-0.059
Token From
Token ID3574
Feature activation+3.84e+00

Pos logit contributions
jo+0.089
ppy+0.088
y+0.086
azz+0.084
ne+0.082
Neg logit contributions
 anything-0.048
 something-0.044
 trouble-0.044
 many-0.044
 good-0.041
Token that
Token ID326
Feature activation-1.24e+00

Pos logit contributions
 anything+0.061
 chores+0.056
 trouble+0.054
 something+0.051
 good+0.050
Neg logit contributions
jo-0.095
o-0.091
ppy-0.090
y-0.089
ne-0.087
Token make
Token ID787
Feature activation+9.36e+00

Pos logit contributions
 anything+0.071
 chores+0.060
 trouble+0.060
 something+0.059
 miles+0.057
Neg logit contributions
jo-0.137
o-0.132
ppy-0.131
y-0.127
azz-0.126
Token sure
Token ID1654
Feature activation+3.43e+00

Pos logit contributions
 anything+0.085
 chores+0.071
 trouble+0.070
 miles+0.068
 something+0.067
Neg logit contributions
jo-0.279
o-0.266
ppy-0.264
azz-0.261
y-0.260
Token he
Token ID339
Feature activation-1.86e+00

Pos logit contributions
 anything+0.035
 trouble+0.032
 chores+0.032
 miles+0.031
 top+0.030
Neg logit contributions
jo-0.094
o-0.090
os-0.089
y-0.089
azz-0.088
Token didn
Token ID1422
Feature activation+6.91e+00

Pos logit contributions
jo+0.039
azz+0.038
ppy+0.037
o+0.036
itty+0.036
Neg logit contributions
 anything-0.037
 something-0.034
 chores-0.033
 trouble-0.033
 good-0.033
Token't
Token ID470
Feature activation+7.01e+00

Pos logit contributions
 anything+0.098
 trouble+0.087
 something+0.087
 chores+0.085
 good+0.083
Neg logit contributions
jo-0.178
azz-0.170
o-0.168
ppy-0.168
ne-0.165
Token get
Token ID651
Feature activation+1.43e+01

Pos logit contributions
 anything+0.108
 something+0.094
 trouble+0.093
 good+0.092
 chores+0.092
Neg logit contributions
jo-0.184
azz-0.174
ppy-0.172
o-0.172
os-0.169
Token lost
Token ID2626
Feature activation+7.44e+00

Pos logit contributions
 anything+0.175
 chores+0.164
 top+0.158
 accomplishment+0.156
 miles+0.155
Neg logit contributions
o-0.366
jo-0.358
ppy-0.358
y-0.355
azz-0.344
Token.
Token ID13
Feature activation-3.25e+00

Pos logit contributions
 anything+0.128
 chores+0.123
 trouble+0.118
 something+0.117
 good+0.116
Neg logit contributions
jo-0.165
y-0.157
ppy-0.155
o-0.154
azz-0.152
Token As
Token ID1081
Feature activation+4.21e+00

Pos logit contributions
ppy+0.087
jo+0.085
y+0.085
ne+0.081
azz+0.080
Neg logit contributions
 anything-0.041
 something-0.040
 trouble-0.038
 many-0.038
 himself-0.037
Token he
Token ID339
Feature activation-3.95e+00

Pos logit contributions
 anything+0.058
 yourselves+0.054
 chores+0.053
 trouble+0.049
 something+0.048
Neg logit contributions
jo-0.109
o-0.104
y-0.100
azz-0.099
ppy-0.099
Token looked
Token ID3114
Feature activation+4.30e+00

Pos logit contributions
jo+0.087
azz+0.085
ppy+0.082
o+0.078
anne+0.077
Neg logit contributions
 anything-0.073
 something-0.071
 trouble-0.068
 miles-0.068
 chores-0.067
Token always
Token ID1464
Feature activation+8.71e+00

Pos logit contributions
 anything+0.107
 trouble+0.099
 something+0.096
 chores+0.094
 many+0.091
Neg logit contributions
jo-0.169
ppy-0.164
o-0.162
y-0.157
azz-0.157
Token be
Token ID307
Feature activation+1.04e+01

Pos logit contributions
 anything+0.135
 trouble+0.120
 something+0.119
 chores+0.118
 many+0.114
Neg logit contributions
jo-0.217
ppy-0.209
o-0.206
azz-0.202
y-0.201
Token careful
Token ID8161
Feature activation+3.97e+00

Pos logit contributions
 anything+0.135
 chores+0.125
 top+0.122
 bottom+0.120
 trouble+0.119
Neg logit contributions
o-0.275
ppy-0.268
jo-0.267
y-0.263
os-0.251
Token and
Token ID290
Feature activation+7.56e+00

Pos logit contributions
 anything+0.064
 trouble+0.059
 chores+0.057
 something+0.056
 miles+0.055
Neg logit contributions
jo-0.096
y-0.091
ppy-0.091
o-0.091
azz-0.088
Token never
Token ID1239
Feature activation+8.09e+00

Pos logit contributions
 anything+0.105
 trouble+0.093
 chores+0.090
 something+0.089
 top+0.087
Neg logit contributions
jo-0.195
o-0.191
ppy-0.189
y-0.185
os-0.180
Token get
Token ID651
Feature activation+1.43e+01

Pos logit contributions
 anything+0.136
 something+0.121
 trouble+0.117
 good+0.116
 chores+0.112
Neg logit contributions
jo-0.209
azz-0.194
ppy-0.189
itty-0.188
anne-0.186
Token caught
Token ID4978
Feature activation+7.44e+00

Pos logit contributions
 anything+0.167
 chores+0.151
 trouble+0.147
 accomplishment+0.144
 something+0.143
Neg logit contributions
o-0.386
jo-0.385
ppy-0.382
y-0.378
os-0.367
Token again
Token ID757
Feature activation+3.10e+00

Pos logit contributions
 anything+0.075
 chores+0.070
 trouble+0.067
 something+0.064
 accomplishment+0.063
Neg logit contributions
jo-0.213
ppy-0.209
y-0.208
o-0.206
azz-0.202
Token.
Token ID13
Feature activation-3.41e+00

Pos logit contributions
 anything+0.059
 chores+0.055
 something+0.054
 trouble+0.054
 good+0.052
Neg logit contributions
jo-0.066
ppy-0.062
o-0.061
y-0.061
azz-0.060
Token<|endoftext|>
Token ID50256
Feature activation-8.27e+00

Pos logit contributions
jo+0.086
ppy+0.083
azz+0.081
y+0.081
o+0.079
Neg logit contributions
 anything-0.053
 something-0.048
 trouble-0.048
 chores-0.046
 many-0.046
TokenOnce
Token ID7454
Feature activation-1.86e+00

Pos logit contributions
jo+0.217
ppy+0.212
azz+0.206
y+0.198
anne+0.198
Neg logit contributions
 anything-0.120
 something-0.109
 trouble-0.107
 many-0.104
 good-0.102
Token and
Token ID290
Feature activation+5.85e+00

Pos logit contributions
 anything+0.041
 chores+0.037
 something+0.036
 trouble+0.036
 good+0.035
Neg logit contributions
jo-0.058
ppy-0.055
o-0.055
azz-0.054
y-0.053
Token her
Token ID607
Feature activation-9.40e-01

Pos logit contributions
 anything+0.082
 something+0.072
 chores+0.072
 trouble+0.072
 good+0.068
Neg logit contributions
jo-0.156
ppy-0.149
o-0.147
azz-0.147
y-0.145
Token family
Token ID1641
Feature activation-9.07e+00

Pos logit contributions
jo+0.026
ppy+0.025
azz+0.025
o+0.024
y+0.024
Neg logit contributions
 anything-0.013
 something-0.011
 trouble-0.011
 chores-0.011
 good-0.010
Token will
Token ID481
Feature activation+6.22e+00

Pos logit contributions
jo+0.196
azz+0.187
ppy+0.185
o+0.178
y+0.173
Neg logit contributions
 anything-0.173
 something-0.162
 chores-0.157
 trouble-0.157
 good-0.156
Token never
Token ID1239
Feature activation+8.21e+00

Pos logit contributions
 anything+0.099
 trouble+0.090
 something+0.087
 chores+0.085
 yourselves+0.083
Neg logit contributions
jo-0.145
ppy-0.142
o-0.138
y-0.137
ne-0.136
Token get
Token ID651
Feature activation+1.43e+01

Pos logit contributions
 anything+0.128
 something+0.114
 trouble+0.113
 chores+0.112
 good+0.108
Neg logit contributions
jo-0.208
ppy-0.198
azz-0.196
o-0.196
y-0.192
Token to
Token ID284
Feature activation+9.79e+00

Pos logit contributions
 anything+0.187
 chores+0.184
 top+0.178
 accomplishment+0.177
wered+0.176
Neg logit contributions
o-0.344
y-0.341
ppy-0.340
jo-0.333
ne-0.324
Token see
Token ID766
Feature activation+7.76e+00

Pos logit contributions
 anything+0.130
 chores+0.126
 yourselves+0.122
 many+0.118
 accomplishment+0.117
Neg logit contributions
jo-0.224
ppy-0.216
y-0.213
ne-0.212
o-0.211
Token her
Token ID607
Feature activation+1.04e+00

Pos logit contributions
 anything+0.072
 chores+0.063
 trouble+0.062
 something+0.060
 good+0.057
Neg logit contributions
jo-0.238
ppy-0.228
o-0.226
azz-0.226
y-0.225
Token again
Token ID757
Feature activation+2.39e+00

Pos logit contributions
 anything+0.017
 chores+0.015
 trouble+0.015
 something+0.015
 yourselves+0.015
Neg logit contributions
jo-0.024
ppy-0.023
y-0.023
o-0.023
azz-0.023
Token.
Token ID13
Feature activation-3.05e+00

Pos logit contributions
 anything+0.045
 chores+0.042
 trouble+0.041
 something+0.041
 good+0.040
Neg logit contributions
jo-0.051
ppy-0.048
o-0.048
y-0.047
azz-0.047
Token I
Token ID314
Feature activation+6.15e-01

Pos logit contributions
jo+0.069
y+0.068
ppy+0.067
anne+0.064
azz+0.064
Neg logit contributions
 anything-0.042
 something-0.039
 yourselves-0.037
 an-0.036
 good-0.036
Token didn
Token ID1422
Feature activation+6.09e+00

Pos logit contributions
 anything+0.010
 something+0.010
 good+0.009
 trouble+0.009
 great+0.009
Neg logit contributions
jo-0.015
azz-0.014
ppy-0.014
itty-0.013
o-0.013
Token't
Token ID470
Feature activation+6.40e+00

Pos logit contributions
 anything+0.076
 something+0.066
 trouble+0.065
 chores+0.064
 good+0.061
Neg logit contributions
jo-0.172
ppy-0.164
azz-0.164
o-0.163
y-0.160
Token want
Token ID765
Feature activation+9.19e+00

Pos logit contributions
 anything+0.095
 trouble+0.084
 good+0.084
 something+0.083
 chores+0.079
Neg logit contributions
jo-0.173
azz-0.164
os-0.160
o-0.160
ppy-0.159
Token you
Token ID345
Feature activation-4.11e-01

Pos logit contributions
 anything+0.148
 something+0.133
 trouble+0.131
 chores+0.131
 good+0.126
Neg logit contributions
jo-0.228
ppy-0.217
azz-0.215
o-0.214
y-0.210
Token getting
Token ID1972
Feature activation+1.42e+01

Pos logit contributions
jo+0.011
azz+0.010
ppy+0.010
o+0.010
itty+0.010
Neg logit contributions
 anything-0.007
 something-0.006
 chores-0.006
 good-0.006
 trouble-0.005
Token too
Token ID1165
Feature activation+5.16e+00

Pos logit contributions
 anything+0.185
 accomplishment+0.172
 chores+0.171
 top+0.166
 miles+0.164
Neg logit contributions
o-0.353
ppy-0.345
y-0.345
jo-0.341
ne-0.329
Token close
Token ID1969
Feature activation+7.14e+00

Pos logit contributions
 anything+0.055
 chores+0.044
 something+0.044
 trouble+0.043
 miles+0.041
Neg logit contributions
jo-0.153
o-0.148
y-0.148
ppy-0.147
azz-0.144
Token to
Token ID284
Feature activation+8.38e+00

Pos logit contributions
 anything+0.127
 chores+0.116
 something+0.115
 trouble+0.114
 good+0.111
Neg logit contributions
jo-0.163
ppy-0.153
o-0.152
y-0.151
azz-0.151
Token the
Token ID262
Feature activation+3.71e+00

Pos logit contributions
 anything+0.094
 chores+0.085
 trouble+0.081
 something+0.081
 good+0.078
Neg logit contributions
jo-0.235
ppy-0.226
o-0.222
y-0.221
azz-0.220
Token posts
Token ID6851
Feature activation-6.54e+00

Pos logit contributions
 anything+0.052
 something+0.045
 chores+0.044
 trouble+0.044
 good+0.042
Neg logit contributions
jo-0.099
ppy-0.094
azz-0.093
o-0.093
y-0.092
Token,
Token ID11
Feature activation+2.17e+00

Pos logit contributions
 anything+0.101
 miles+0.098
 trouble+0.098
 chores+0.097
 accomplishment+0.096
Neg logit contributions
jo-0.134
y-0.133
azz-0.130
ppy-0.128
o-0.126
Token we
Token ID356
Feature activation-2.05e-01

Pos logit contributions
 anything+0.030
 chores+0.027
 trouble+0.027
 something+0.026
 good+0.026
Neg logit contributions
jo-0.054
ppy-0.052
azz-0.052
o-0.052
y-0.051
Token might
Token ID1244
Feature activation+5.99e+00

Pos logit contributions
jo+0.004
ppy+0.004
azz+0.004
o+0.004
y+0.004
Neg logit contributions
 anything-0.004
 something-0.004
 chores-0.004
 trouble-0.004
 good-0.004
Token not
Token ID407
Feature activation+7.83e+00

Pos logit contributions
 anything+0.083
 trouble+0.079
 something+0.076
 chores+0.075
 miles+0.073
Neg logit contributions
ppy-0.144
jo-0.143
y-0.139
ne-0.138
o-0.138
Token always
Token ID1464
Feature activation+9.01e+00

Pos logit contributions
 anything+0.108
 something+0.093
 chores+0.092
 trouble+0.091
 good+0.089
Neg logit contributions
jo-0.219
ppy-0.207
azz-0.206
o-0.205
os-0.200
Token get
Token ID651
Feature activation+1.42e+01

Pos logit contributions
 anything+0.137
 trouble+0.125
 something+0.124
 chores+0.124
 miles+0.119
Neg logit contributions
jo-0.213
ppy-0.207
y-0.201
azz-0.201
o-0.200
Token the
Token ID262
Feature activation+4.21e+00

Pos logit contributions
 anything+0.158
wered+0.154
 middle+0.152
 chores+0.152
 top+0.152
Neg logit contributions
y-0.365
o-0.360
ppy-0.358
jo-0.351
ne-0.350
Token result
Token ID1255
Feature activation+1.00e+00

Pos logit contributions
 anything+0.056
 chores+0.053
 miles+0.050
 trouble+0.049
 yourselves+0.049
Neg logit contributions
jo-0.099
ne-0.099
y-0.098
ppy-0.096
azz-0.095
Token we
Token ID356
Feature activation+3.60e-01

Pos logit contributions
 anything+0.018
 something+0.016
 trouble+0.016
 chores+0.016
 good+0.015
Neg logit contributions
jo-0.023
ppy-0.022
azz-0.021
o-0.021
y-0.021
Token wanted
Token ID2227
Feature activation+7.99e+00

Pos logit contributions
 anything+0.006
 something+0.005
 chores+0.005
 trouble+0.005
 good+0.005
Neg logit contributions
jo-0.009
ppy-0.009
azz-0.009
o-0.009
y-0.008
Token.
Token ID13
Feature activation-3.89e+00

Pos logit contributions
 anything+0.143
 chores+0.130
 trouble+0.130
 something+0.129
 good+0.124
Neg logit contributions
jo-0.180
ppy-0.171
azz-0.169
o-0.168
y-0.166
Token destroying
Token ID13897
Feature activation+5.66e+00

Pos logit contributions
 anything+0.114
 bottom+0.108
wered+0.106
 miles+0.105
 middle+0.104
Neg logit contributions
o-0.270
y-0.255
ppy-0.253
jo-0.249
os-0.241
Token things
Token ID1243
Feature activation-1.91e+00

Pos logit contributions
 anything+0.040
 chores+0.040
 accomplishment+0.039
 trouble+0.039
 miles+0.038
Neg logit contributions
y-0.165
jo-0.163
ppy-0.162
o-0.159
os-0.157
Token it
Token ID340
Feature activation-3.33e+00

Pos logit contributions
ppy+0.042
jo+0.042
y+0.042
o+0.041
os+0.040
Neg logit contributions
 anything-0.030
 trouble-0.029
 chores-0.028
 something-0.027
 miles-0.027
Token would
Token ID561
Feature activation+7.54e+00

Pos logit contributions
jo+0.064
ppy+0.061
o+0.060
azz+0.060
y+0.059
Neg logit contributions
 anything-0.069
 trouble-0.064
 chores-0.064
 something-0.063
 good-0.061
Token never
Token ID1239
Feature activation+8.37e+00

Pos logit contributions
 anything+0.113
 yourselves+0.106
 trouble+0.105
 something+0.105
 many+0.103
Neg logit contributions
ppy-0.165
y-0.160
o-0.159
jo-0.158
ne-0.153
Token get
Token ID651
Feature activation+1.42e+01

Pos logit contributions
 anything+0.123
 something+0.109
 trouble+0.107
 chores+0.106
 good+0.103
Neg logit contributions
jo-0.222
ppy-0.210
azz-0.209
o-0.208
y-0.203
Token the
Token ID262
Feature activation+4.13e+00

Pos logit contributions
 anything+0.184
 chores+0.180
 top+0.174
 middle+0.173
wered+0.173
Neg logit contributions
y-0.344
ppy-0.343
o-0.340
jo-0.325
os-0.319
Token butter
Token ID9215
Feature activation-7.23e+00

Pos logit contributions
 anything+0.057
 something+0.050
 chores+0.049
 yourselves+0.049
 miles+0.048
Neg logit contributions
jo-0.102
ppy-0.102
y-0.101
o-0.099
ne-0.097
Token it
Token ID340
Feature activation-3.43e+00

Pos logit contributions
jo+0.173
ppy+0.171
y+0.168
o+0.166
azz+0.164
Neg logit contributions
 anything-0.112
 trouble-0.105
 chores-0.103
 something-0.102
 many-0.098
Token wanted
Token ID2227
Feature activation+7.07e+00

Pos logit contributions
jo+0.077
ppy+0.074
o+0.073
azz+0.072
y+0.072
Neg logit contributions
 anything-0.061
 trouble-0.056
 chores-0.056
 something-0.055
 good-0.053
Token.
Token ID13
Feature activation-3.64e+00

Pos logit contributions
 anything+0.124
 chores+0.115
 trouble+0.115
 something+0.112
 yourselves+0.109
Neg logit contributions
jo-0.157
ppy-0.151
o-0.148
y-0.148
azz-0.146
Token.
Token ID13
Feature activation-3.54e+00

Pos logit contributions
jo+0.045
azz+0.042
ppy+0.042
o+0.041
itty+0.041
Neg logit contributions
 anything-0.028
 something-0.026
 good-0.025
 many-0.024
 trouble-0.024
Token
Token ID198
Feature activation-3.16e+00

Pos logit contributions
jo+0.094
ppy+0.090
azz+0.087
y+0.086
o+0.086
Neg logit contributions
 anything-0.052
 something-0.048
 trouble-0.046
 many-0.046
 good-0.045
Token
Token ID198
Feature activation-3.64e+00

Pos logit contributions
jo+0.079
ppy+0.075
azz+0.074
o+0.073
y+0.073
Neg logit contributions
 anything-0.050
 something-0.046
 trouble-0.045
 chores-0.044
 good-0.044
TokenThe
Token ID464
Feature activation-1.18e+00

Pos logit contributions
jo+0.085
ppy+0.080
azz+0.079
o+0.079
y+0.077
Neg logit contributions
 anything-0.065
 something-0.059
 trouble-0.058
 chores-0.058
 good-0.057
Token great
Token ID1049
Feature activation-1.44e+00

Pos logit contributions
jo+0.023
ppy+0.022
azz+0.022
o+0.022
y+0.021
Neg logit contributions
 anything-0.025
 something-0.023
 chores-0.023
 trouble-0.022
 miles-0.022
Token Zen
Token ID14760
Feature activation-1.99e+01

Pos logit contributions
jo+0.032
y+0.030
azz+0.030
o+0.030
ppy+0.029
Neg logit contributions
 anything-0.027
 chores-0.024
 something-0.024
 trouble-0.024
 miles-0.023
Tokeno
Token ID78
Feature activation-1.26e+01

Pos logit contributions
jo+0.219
ppy+0.211
ksh+0.195
azz+0.195
pillar+0.186
Neg logit contributions
 anything-0.524
 something-0.518
 not-0.512
 good-0.497
 the-0.495
Token hugs
Token ID40657
Feature activation+1.29e+00

Pos logit contributions
jo+0.325
azz+0.294
ppy+0.288
itty+0.284
anne+0.281
Neg logit contributions
 good-0.192
 something-0.191
 anything-0.190
 top-0.179
 many-0.178
Token Ben
Token ID3932
Feature activation-1.32e+01

Pos logit contributions
 anything+0.015
 something+0.013
 trouble+0.012
 good+0.012
 chores+0.011
Neg logit contributions
jo-0.039
ppy-0.037
o-0.037
azz-0.037
anne-0.036
Token and
Token ID290
Feature activation+3.91e+00

Pos logit contributions
jo+0.357
azz+0.344
itty+0.329
anne+0.326
os+0.323
Neg logit contributions
 anything-0.183
 something-0.182
 good-0.179
 himself-0.175
 many-0.172
Token says
Token ID1139
Feature activation+1.02e+00

Pos logit contributions
 anything+0.082
 something+0.076
 trouble+0.075
 chores+0.074
 good+0.073
Neg logit contributions
jo-0.079
ppy-0.075
azz-0.073
o-0.073
y-0.071
Token
Token ID198
Feature activation-2.99e+00

Pos logit contributions
jo+0.062
ppy+0.060
azz+0.059
o+0.058
y+0.058
Neg logit contributions
 anything-0.038
 something-0.035
 many-0.034
 trouble-0.034
 chores-0.033
TokenOne
Token ID3198
Feature activation-3.32e+00

Pos logit contributions
jo+0.076
ppy+0.073
azz+0.072
o+0.071
y+0.070
Neg logit contributions
 anything-0.047
 something-0.042
 chores-0.042
 trouble-0.041
 many-0.040
Token day
Token ID1110
Feature activation-2.80e+00

Pos logit contributions
jo+0.105
azz+0.102
ppy+0.100
os+0.097
ne+0.097
Neg logit contributions
 anything-0.031
 good-0.028
 something-0.028
 many-0.026
 chores-0.025
Token,
Token ID11
Feature activation-1.28e+00

Pos logit contributions
jo+0.066
o+0.063
ppy+0.063
azz+0.062
y+0.062
Neg logit contributions
 anything-0.047
 chores-0.042
 trouble-0.042
 something-0.041
 yourselves-0.040
Token the
Token ID262
Feature activation-2.62e+00

Pos logit contributions
o+0.038
y+0.037
jo+0.036
ppy+0.036
ne+0.036
Neg logit contributions
 anything-0.011
 chores-0.011
 trouble-0.010
 yourselves-0.010
 accomplishment-0.009
Token hipp
Token ID18568
Feature activation-1.98e+01

Pos logit contributions
o+0.047
jo+0.045
y+0.043
itty+0.041
ppy+0.041
Neg logit contributions
 anything-0.060
 chores-0.054
 something-0.054
 trouble-0.053
 yourselves-0.051
Tokeno
Token ID78
Feature activation-1.44e+01

Pos logit contributions
jo+0.188
ppy+0.171
azz+0.169
anne+0.162
anna+0.147
Neg logit contributions
 many-0.587
 anything-0.552
 something-0.548
 things-0.546
 others-0.546
Token saw
Token ID2497
Feature activation+3.53e+00

Pos logit contributions
jo+0.337
azz+0.323
anne+0.300
ppy+0.297
itty+0.295
Neg logit contributions
 something-0.239
 anything-0.238
 many-0.233
 good-0.227
 trouble-0.224
Token a
Token ID257
Feature activation+1.29e+00

Pos logit contributions
 chores+0.032
 accomplishment+0.032
 anything+0.031
 trouble+0.028
 miles+0.028
Neg logit contributions
o-0.107
y-0.100
ppy-0.098
os-0.098
jo-0.097
Token little
Token ID1310
Feature activation-6.77e-01

Pos logit contributions
 anything+0.014
 chores+0.012
 something+0.012
 trouble+0.011
 yourselves+0.011
Neg logit contributions
jo-0.037
o-0.036
ppy-0.036
azz-0.036
y-0.036
Token bird
Token ID6512
Feature activation-1.27e+01

Pos logit contributions
jo+0.016
azz+0.015
ppy+0.015
anne+0.014
os+0.014
Neg logit contributions
 anything-0.012
 something-0.012
 trouble-0.012
 good-0.012
 things-0.011
Token food
Token ID2057
Feature activation-6.32e+00

Pos logit contributions
 anything+0.060
 yourselves+0.050
 miles+0.050
 nothing+0.048
 trouble+0.048
Neg logit contributions
jo-0.123
o-0.122
y-0.121
ppy-0.121
ne-0.118
Token.
Token ID13
Feature activation-4.36e+00

Pos logit contributions
o+0.143
jo+0.143
ppy+0.142
y+0.140
os+0.136
Neg logit contributions
 anything-0.107
 trouble-0.098
 chores-0.095
 miles-0.093
 yourselves-0.092
Token
Token ID198
Feature activation-2.54e+00

Pos logit contributions
jo+0.113
ppy+0.109
azz+0.105
y+0.104
anne+0.103
Neg logit contributions
 anything-0.067
 something-0.063
 many-0.060
 trouble-0.058
 good-0.058
Token
Token ID198
Feature activation-2.94e+00

Pos logit contributions
jo+0.066
ppy+0.066
azz+0.063
y+0.062
o+0.061
Neg logit contributions
 anything-0.034
 something-0.032
 many-0.032
 trouble-0.031
 good-0.030
TokenThe
Token ID464
Feature activation-2.53e+00

Pos logit contributions
jo+0.075
ppy+0.072
azz+0.071
os+0.070
o+0.070
Neg logit contributions
 anything-0.044
 something-0.040
 chores-0.039
 good-0.039
 many-0.039
Token hipp
Token ID18568
Feature activation-1.97e+01

Pos logit contributions
o+0.049
jo+0.049
y+0.047
ppy+0.045
itty+0.045
Neg logit contributions
 anything-0.053
 something-0.047
 trouble-0.047
 chores-0.047
 yourselves-0.044
Tokeno
Token ID78
Feature activation-1.41e+01

Pos logit contributions
ppy+0.192
azz+0.189
jo+0.188
anne+0.165
ches+0.157
Neg logit contributions
 many-0.563
 things-0.528
 something-0.525
 others-0.522
 himself-0.522
Token went
Token ID1816
Feature activation+3.01e+00

Pos logit contributions
azz+0.337
jo+0.321
itty+0.285
ppy+0.283
ne+0.282
Neg logit contributions
 many-0.229
 good-0.223
 something-0.221
 things-0.209
 an-0.208
Token to
Token ID284
Feature activation+4.73e+00

Pos logit contributions
 anything+0.058
 trouble+0.053
 yourselves+0.052
 chores+0.051
 something+0.051
Neg logit contributions
o-0.064
jo-0.063
ppy-0.061
y-0.060
azz-0.059
Token the
Token ID262
Feature activation+1.30e+00

Pos logit contributions
 anything+0.055
 accomplishment+0.050
 chores+0.050
 best+0.049
 End+0.049
Neg logit contributions
o-0.120
ppy-0.108
jo-0.107
os-0.106
bug-0.104
Token kitchen
Token ID9592
Feature activation-5.08e+00

Pos logit contributions
 anything+0.018
 accomplishment+0.016
 trouble+0.016
 chores+0.015
 miles+0.015
Neg logit contributions
o-0.033
jo-0.033
y-0.032
ppy-0.032
itty-0.031
TokenOne
Token ID3198
Feature activation-2.37e+00

Pos logit contributions
jo+0.220
ppy+0.216
azz+0.206
anne+0.203
y+0.198
Neg logit contributions
 anything-0.111
 something-0.102
 trouble-0.100
 many-0.098
 good-0.094
Token day
Token ID1110
Feature activation-1.73e+00

Pos logit contributions
jo+0.072
ppy+0.070
azz+0.069
o+0.067
y+0.066
Neg logit contributions
 anything-0.026
 something-0.023
 trouble-0.022
 chores-0.022
 good-0.021
Token,
Token ID11
Feature activation-1.03e+00

Pos logit contributions
jo+0.041
ppy+0.040
o+0.039
azz+0.039
y+0.039
Neg logit contributions
 anything-0.029
 chores-0.026
 trouble-0.026
 something-0.025
 good-0.024
Token little
Token ID1310
Feature activation-4.45e+00

Pos logit contributions
y+0.032
o+0.032
jo+0.031
ppy+0.031
azz+0.030
Neg logit contributions
 anything-0.009
 chores-0.008
 trouble-0.008
 accomplishment-0.007
 yourselves-0.007
Token Jo
Token ID5302
Feature activation-1.61e+01

Pos logit contributions
jo+0.115
os+0.107
azz+0.106
ppy+0.106
anne+0.103
Neg logit contributions
 anything-0.068
 trouble-0.065
 good-0.065
 something-0.064
 things-0.063
Tokenanne
Token ID21952
Feature activation-1.97e+01

Pos logit contributions
azz+0.297
oke+0.286
ppy+0.284
ksh+0.283
lets+0.277
Neg logit contributions
 the-0.298
 himself-0.297
 his-0.293
 anything-0.289
 an-0.288
Token was
Token ID373
Feature activation+5.70e+00

Pos logit contributions
azz+0.508
ppy+0.490
jo+0.468
oke+0.456
ksh+0.455
Neg logit contributions
 something-0.253
 an-0.251
 many-0.250
 trouble-0.250
 the-0.250
Token r
Token ID374
Feature activation-2.31e+00

Pos logit contributions
 anything+0.088
 avail+0.080
 yourselves+0.078
 accomplishment+0.077
 ourselves+0.076
Neg logit contributions
o-0.137
itty-0.127
y-0.127
ppy-0.127
jo-0.124
Tokenaking
Token ID868
Feature activation-2.07e-01

Pos logit contributions
jo+0.042
azz+0.038
ppy+0.038
anne+0.036
o+0.036
Neg logit contributions
 anything-0.052
 trouble-0.048
 something-0.048
 many-0.048
 chores-0.048
Token the
Token ID262
Feature activation+2.39e+00

Pos logit contributions
y+0.005
jo+0.005
ppy+0.005
o+0.005
ne+0.004
Neg logit contributions
 anything-0.003
 trouble-0.003
 chores-0.003
 accomplishment-0.003
 yourselves-0.003
Token leaves
Token ID5667
Feature activation-7.95e+00

Pos logit contributions
 anything+0.034
 something+0.030
 chores+0.030
 trouble+0.030
 good+0.029
Neg logit contributions
jo-0.064
ppy-0.061
azz-0.061
o-0.060
y-0.059
Token.
Token ID13
Feature activation-3.97e+00

Pos logit contributions
 anything+0.020
 something+0.018
 trouble+0.017
 chores+0.017
 good+0.017
Neg logit contributions
jo-0.034
ppy-0.033
azz-0.032
o-0.032
y-0.031
Token
Token ID220
Feature activation-3.19e+00

Pos logit contributions
jo+0.102
ppy+0.098
y+0.094
anne+0.094
azz+0.091
Neg logit contributions
 anything-0.061
 something-0.058
 good-0.053
 himself-0.053
 many-0.053
Token
Token ID198
Feature activation-1.39e+00

Pos logit contributions
jo+0.083
ppy+0.080
y+0.078
anne+0.077
azz+0.076
Neg logit contributions
 anything-0.047
 something-0.043
 many-0.041
 chores-0.041
 trouble-0.040
Token
Token ID198
Feature activation-2.12e+00

Pos logit contributions
jo+0.037
ppy+0.036
y+0.035
azz+0.035
anne+0.034
Neg logit contributions
 anything-0.019
 something-0.018
 many-0.017
 good-0.017
 trouble-0.017
TokenThe
Token ID464
Feature activation-1.57e+00

Pos logit contributions
jo+0.054
ppy+0.051
azz+0.050
y+0.049
os+0.049
Neg logit contributions
 anything-0.032
 something-0.030
 good-0.029
 chores-0.029
 trouble-0.029
Token flaming
Token ID39518
Feature activation-1.96e+01

Pos logit contributions
jo+0.038
ppy+0.036
azz+0.036
o+0.035
y+0.035
Neg logit contributions
 anything-0.026
 something-0.024
 trouble-0.024
 chores-0.023
 good-0.023
Tokeno
Token ID78
Feature activation-1.26e+01

Pos logit contributions
ksh+0.174
ppy+0.161
azz+0.141
ches+0.132
jo+0.124
Neg logit contributions
 many-0.491
 best-0.490
 most-0.488
 something-0.478
 not-0.474
Token nodded
Token ID14464
Feature activation+1.02e+00

Pos logit contributions
azz+0.282
jo+0.279
ppy+0.269
itty+0.258
enny+0.246
Neg logit contributions
 something-0.199
 top-0.194
 many-0.192
 the-0.189
 good-0.189
Token its
Token ID663
Feature activation+2.28e+00

Pos logit contributions
 anything+0.017
 something+0.015
 trouble+0.015
 chores+0.015
 good+0.014
Neg logit contributions
jo-0.025
ppy-0.024
azz-0.024
o-0.023
y-0.023
Token head
Token ID1182
Feature activation-2.92e+00

Pos logit contributions
 anything+0.046
 something+0.042
 trouble+0.041
 chores+0.041
 many+0.039
Neg logit contributions
jo-0.048
ppy-0.045
o-0.044
azz-0.044
y-0.043
Token and
Token ID290
Feature activation+4.73e+00

Pos logit contributions
jo+0.071
azz+0.067
ppy+0.066
o+0.065
itty+0.065
Neg logit contributions
 anything-0.048
 something-0.046
 good-0.043
 many-0.043
 top-0.042
Token
Token ID198
Feature activation-3.00e+00

Pos logit contributions
jo+0.060
ppy+0.057
azz+0.056
o+0.056
y+0.055
Neg logit contributions
 anything-0.041
 something-0.037
 trouble-0.036
 chores-0.036
 good-0.035
TokenOne
Token ID3198
Feature activation-3.33e+00

Pos logit contributions
jo+0.074
ppy+0.071
azz+0.070
o+0.069
y+0.068
Neg logit contributions
 anything-0.049
 something-0.044
 chores-0.044
 trouble-0.044
 good-0.042
Token day
Token ID1110
Feature activation-2.95e+00

Pos logit contributions
jo+0.105
azz+0.102
ppy+0.101
os+0.097
ne+0.097
Neg logit contributions
 anything-0.031
 something-0.028
 good-0.028
 many-0.026
 chores-0.025
Token,
Token ID11
Feature activation-1.35e+00

Pos logit contributions
jo+0.070
o+0.067
ppy+0.067
y+0.066
azz+0.066
Neg logit contributions
 anything-0.049
 chores-0.045
 trouble-0.045
 something-0.044
 yourselves-0.042
Token the
Token ID262
Feature activation-2.58e+00

Pos logit contributions
o+0.040
y+0.038
jo+0.038
ppy+0.037
os+0.037
Neg logit contributions
 anything-0.013
 chores-0.013
 trouble-0.012
 yourselves-0.012
 accomplishment-0.011
Token hipp
Token ID18568
Feature activation-1.95e+01

Pos logit contributions
o+0.047
jo+0.044
y+0.043
itty+0.041
ppy+0.040
Neg logit contributions
 anything-0.058
 chores-0.052
 something-0.052
 trouble-0.052
 yourselves-0.050
Tokeno
Token ID78
Feature activation-1.42e+01

Pos logit contributions
jo+0.179
ppy+0.169
azz+0.162
anne+0.156
anna+0.141
Neg logit contributions
 many-0.579
 anything-0.544
 something-0.542
 things-0.538
 others-0.538
Token saw
Token ID2497
Feature activation+3.69e+00

Pos logit contributions
jo+0.328
azz+0.310
anne+0.295
ppy+0.294
itty+0.291
Neg logit contributions
 something-0.237
 anything-0.237
 many-0.232
 trouble-0.226
 good-0.222
Token a
Token ID257
Feature activation+1.35e+00

Pos logit contributions
 chores+0.034
 accomplishment+0.033
 anything+0.033
 trouble+0.029
 miles+0.028
Neg logit contributions
o-0.113
y-0.104
jo-0.103
os-0.103
ppy-0.103
Token new
Token ID649
Feature activation-1.78e-01

Pos logit contributions
 anything+0.014
 chores+0.011
 something+0.011
 trouble+0.011
 yourselves+0.011
Neg logit contributions
jo-0.040
o-0.039
ppy-0.039
azz-0.039
y-0.039
Token friend
Token ID1545
Feature activation-8.18e+00

Pos logit contributions
jo+0.004
ppy+0.004
azz+0.004
o+0.004
os+0.004
Neg logit contributions
 anything-0.003
 something-0.003
 trouble-0.003
 chores-0.003
 good-0.003
Token was
Token ID373
Feature activation+5.49e+00

Pos logit contributions
jo+0.095
o+0.090
y+0.088
ppy+0.086
azz+0.085
Neg logit contributions
 anything-0.110
 chores-0.103
 trouble-0.101
 something-0.100
 yourselves-0.098
Token a
Token ID257
Feature activation-1.72e-01

Pos logit contributions
 anything+0.062
 chores+0.053
 yourselves+0.053
 accomplishment+0.050
 danger+0.048
Neg logit contributions
o-0.152
jo-0.147
y-0.147
ppy-0.143
azz-0.141
Token little
Token ID1310
Feature activation-2.48e+00

Pos logit contributions
jo+0.005
ppy+0.005
os+0.005
azz+0.005
o+0.005
Neg logit contributions
 anything-0.002
 good-0.002
 something-0.002
 great-0.002
 trouble-0.002
Token fish
Token ID5916
Feature activation-1.48e+01

Pos logit contributions
jo+0.062
azz+0.057
os+0.056
anne+0.056
ppy+0.056
Neg logit contributions
 good-0.043
 anything-0.040
 something-0.040
 trouble-0.040
 things-0.040
Token named
Token ID3706
Feature activation-6.29e-01

Pos logit contributions
jo+0.320
azz+0.314
ppy+0.304
anne+0.302
oke+0.284
Neg logit contributions
 many-0.272
 anything-0.270
 something-0.267
 good-0.254
 things-0.249
Token Nem
Token ID22547
Feature activation-1.95e+01

Pos logit contributions
jo+0.016
azz+0.015
ppy+0.015
y+0.015
anne+0.015
Neg logit contributions
 anything-0.010
 something-0.009
 good-0.009
 many-0.008
 trouble-0.008
Tokeno
Token ID78
Feature activation-1.29e+01

Pos logit contributions
jo+0.156
anne+0.143
azz+0.140
oke+0.125
ol+0.125
Neg logit contributions
 something-0.559
 many-0.555
 anything-0.554
 an-0.544
 good-0.544
Token.
Token ID13
Feature activation-4.26e+00

Pos logit contributions
jo+0.327
azz+0.325
anne+0.313
ppy+0.311
oke+0.301
Neg logit contributions
 anything-0.188
 something-0.179
 many-0.178
 an-0.164
 good-0.164
Token Nem
Token ID22547
Feature activation-1.90e+01

Pos logit contributions
y+0.088
o+0.088
jo+0.085
azz+0.082
ppy+0.082
Neg logit contributions
 anything-0.081
 chores-0.077
 trouble-0.076
 yourselves-0.072
 danger-0.072
Tokeno
Token ID78
Feature activation-1.32e+01

Pos logit contributions
azz+0.159
jo+0.151
anne+0.143
anna+0.135
ppy+0.133
Neg logit contributions
 many-0.525
 something-0.523
 himself-0.521
 good-0.521
 an-0.511
Token was
Token ID373
Feature activation+6.51e+00

Pos logit contributions
azz+0.323
jo+0.318
ppy+0.313
itty+0.298
anne+0.298
Neg logit contributions
 anything-0.202
 many-0.200
 something-0.194
 good-0.190
 great-0.185
Token very
Token ID845
Feature activation+6.21e+00

Pos logit contributions
 anything+0.078
 trouble+0.067
 chores+0.067
 accomplishment+0.065
 yourselves+0.063
Neg logit contributions
o-0.200
jo-0.195
y-0.190
ppy-0.189
itty-0.182
Token hungry
Token ID14720
Feature activation-4.30e-02

Pos logit contributions
 anything+0.076
 something+0.063
 trouble+0.063
 chores+0.063
 miles+0.058
Neg logit contributions
o-0.175
jo-0.175
ppy-0.170
y-0.169
azz-0.165
Token.
Token ID13
Feature activation-3.84e+00

Pos logit contributions
jo+0.001
ppy+0.001
o+0.001
y+0.001
azz+0.001
Neg logit contributions
 anything-0.001
 chores-0.001
 trouble-0.001
 something-0.001
 yourselves-0.001
Token So
Token ID1406
Feature activation-1.24e+00

Pos logit contributions
jo+0.099
ppy+0.096
azz+0.093
ne+0.091
y+0.090
Neg logit contributions
 anything-0.059
 something-0.055
 many-0.052
 trouble-0.052
 good-0.051
Token the
Token ID262
Feature activation-2.64e+00

Pos logit contributions
o+0.024
y+0.023
jo+0.023
ah+0.022
ppy+0.021
Neg logit contributions
 anything-0.024
 chores-0.024
 yourselves-0.024
 trouble-0.024
 miles-0.022
Token hipp
Token ID18568
Feature activation-1.94e+01

Pos logit contributions
o+0.044
jo+0.043
y+0.042
itty+0.039
ppy+0.039
Neg logit contributions
 anything-0.062
 chores-0.057
 trouble-0.056
 something-0.056
 miles-0.054
Tokeno
Token ID78
Feature activation-1.40e+01

Pos logit contributions
azz+0.195
ppy+0.195
jo+0.188
anne+0.167
ems+0.167
Neg logit contributions
 many-0.549
 himself-0.511
 others-0.507
 things-0.503
 most-0.503
Token decided
Token ID3066
Feature activation+1.08e+00

Pos logit contributions
azz+0.311
jo+0.307
itty+0.284
ne+0.274
ppy+0.274
Neg logit contributions
 something-0.244
 anything-0.243
 many-0.242
 good-0.235
 an-0.230
Token to
Token ID284
Feature activation+4.30e+00

Pos logit contributions
 anything+0.022
 chores+0.021
 trouble+0.021
 accomplishment+0.020
 yourselves+0.020
Neg logit contributions
jo-0.020
o-0.019
y-0.019
ppy-0.018
azz-0.018
Token cook
Token ID4255
Feature activation-5.61e-01

Pos logit contributions
 anything+0.073
 trouble+0.065
 accomplishment+0.065
 miles+0.064
 something+0.064
Neg logit contributions
jo-0.093
ppy-0.092
o-0.092
y-0.089
os-0.086
Token some
Token ID617
Feature activation+4.77e+00

Pos logit contributions
o+0.015
ppy+0.014
y+0.014
os+0.013
jo+0.013
Neg logit contributions
 trouble-0.006
 accomplishment-0.005
 miles-0.005
 anything-0.005
 comparison-0.005
Token.
Token ID13
Feature activation-4.11e+00

Pos logit contributions
 anything+0.017
 something+0.015
 chores+0.015
 trouble+0.015
 good+0.015
Neg logit contributions
jo-0.019
ppy-0.018
azz-0.018
o-0.018
y-0.017
Token
Token ID220
Feature activation-4.63e+00

Pos logit contributions
ppy+0.121
jo+0.118
y+0.115
azz+0.113
anne+0.111
Neg logit contributions
 many-0.047
 anything-0.044
 himself-0.044
 something-0.044
 top-0.042
Token
Token ID198
Feature activation-2.57e+00

Pos logit contributions
jo+0.116
ppy+0.111
azz+0.109
y+0.107
o+0.107
Neg logit contributions
 anything-0.073
 something-0.066
 trouble-0.064
 chores-0.064
 many-0.063
Token
Token ID198
Feature activation-3.08e+00

Pos logit contributions
jo+0.067
ppy+0.066
azz+0.064
y+0.063
o+0.062
Neg logit contributions
 anything-0.036
 many-0.033
 something-0.033
 good-0.032
 trouble-0.032
TokenThe
Token ID464
Feature activation-2.41e+00

Pos logit contributions
jo+0.086
ppy+0.082
azz+0.081
o+0.081
y+0.079
Neg logit contributions
 anything-0.041
 something-0.036
 chores-0.036
 trouble-0.035
 good-0.034
Token hipp
Token ID18568
Feature activation-1.94e+01

Pos logit contributions
o+0.041
jo+0.039
y+0.038
itty+0.036
ppy+0.036
Neg logit contributions
 anything-0.057
 something-0.051
 trouble-0.051
 chores-0.051
 yourselves-0.049
Tokeno
Token ID78
Feature activation-1.38e+01

Pos logit contributions
azz+0.191
ppy+0.189
jo+0.188
ches+0.167
anne+0.163
Neg logit contributions
 many-0.541
 others-0.508
 things-0.507
 something-0.503
 most-0.502
Token found
Token ID1043
Feature activation+4.23e+00

Pos logit contributions
azz+0.342
jo+0.323
itty+0.304
ppy+0.287
uddy+0.285
Neg logit contributions
 many-0.212
 something-0.203
 good-0.201
 top-0.195
 the-0.193
Token some
Token ID617
Feature activation+4.58e+00

Pos logit contributions
 chores+0.051
 avail+0.050
 accomplishment+0.046
 yourselves+0.044
 achieve+0.044
Neg logit contributions
o-0.116
os-0.105
y-0.104
ray-0.103
ppy-0.101
Token vegetables
Token ID13701
Feature activation-8.31e+00

Pos logit contributions
 anything+0.064
 yourselves+0.057
 miles+0.054
 accomplishment+0.052
 nothing+0.051
Neg logit contributions
y-0.112
ppy-0.108
jo-0.108
o-0.107
ne-0.105
Token and
Token ID290
Feature activation+4.32e+00

Pos logit contributions
y+0.160
o+0.158
ppy+0.156
jo+0.153
os+0.148
Neg logit contributions
 anything-0.171
 trouble-0.159
 chores-0.158
 yourselves-0.156
 miles-0.151
TokenOne
Token ID3198
Feature activation-3.00e+00

Pos logit contributions
jo+0.241
ppy+0.235
azz+0.228
anne+0.221
o+0.215
Neg logit contributions
 anything-0.118
 something-0.110
 trouble-0.104
 many-0.103
 top-0.099
Token day
Token ID1110
Feature activation-1.99e+00

Pos logit contributions
jo+0.091
ppy+0.089
azz+0.089
o+0.085
ne+0.084
Neg logit contributions
 anything-0.031
 something-0.028
 chores-0.026
 trouble-0.026
 good-0.026
Token,
Token ID11
Feature activation-1.14e+00

Pos logit contributions
jo+0.047
ppy+0.045
o+0.045
y+0.045
azz+0.044
Neg logit contributions
 anything-0.033
 trouble-0.030
 chores-0.030
 something-0.029
 yourselves-0.028
Token Tom
Token ID4186
Feature activation-1.63e+01

Pos logit contributions
jo+0.038
o+0.037
ppy+0.036
y+0.036
azz+0.036
Neg logit contributions
 anything-0.009
 chores-0.007
 trouble-0.007
 something-0.006
 good-0.006
Token the
Token ID262
Feature activation-9.84e-01

Pos logit contributions
jo+0.365
azz+0.357
ppy+0.351
ne+0.334
itty+0.329
Neg logit contributions
 anything-0.282
 something-0.277
 trouble-0.274
 good-0.260
 many-0.259
Token Toad
Token ID42616
Feature activation-1.94e+01

Pos logit contributions
jo+0.030
ppy+0.029
azz+0.028
o+0.028
os+0.028
Neg logit contributions
 anything-0.010
 trouble-0.010
 chores-0.009
 good-0.009
 something-0.009
Token went
Token ID1816
Feature activation+3.48e+00

Pos logit contributions
azz+0.440
jo+0.439
ppy+0.430
anne+0.411
itty+0.406
Neg logit contributions
 anything-0.314
 something-0.313
 many-0.305
 things-0.294
 good-0.294
Token out
Token ID503
Feature activation+5.49e-01

Pos logit contributions
 anything+0.059
 trouble+0.053
 something+0.052
 chores+0.052
 good+0.050
Neg logit contributions
jo-0.083
ppy-0.079
o-0.079
azz-0.079
y-0.077
Token for
Token ID329
Feature activation+4.71e+00

Pos logit contributions
 anything+0.010
 chores+0.008
 something+0.008
 yourselves+0.008
 trouble+0.008
Neg logit contributions
jo-0.013
o-0.012
y-0.012
os-0.012
azz-0.012
Token a
Token ID257
Feature activation+1.73e+00

Pos logit contributions
 anything+0.042
 yourselves+0.037
 bottom+0.035
 accomplishment+0.035
 chores+0.034
Neg logit contributions
jo-0.134
o-0.130
ne-0.126
azz-0.126
ppy-0.126
Token jog
Token ID48342
Feature activation-3.01e+00

Pos logit contributions
 anything+0.026
 great+0.025
 good+0.025
 something+0.025
 chores+0.024
Neg logit contributions
jo-0.045
ppy-0.044
os-0.043
azz-0.043
o-0.041
Token
Token ID198
Feature activation-2.81e+00

Pos logit contributions
jo+0.054
ppy+0.052
azz+0.051
o+0.050
y+0.050
Neg logit contributions
 anything-0.036
 something-0.032
 trouble-0.032
 chores-0.032
 many-0.031
TokenBut
Token ID1537
Feature activation-1.09e+00

Pos logit contributions
jo+0.072
ppy+0.069
o+0.069
azz+0.068
y+0.067
Neg logit contributions
 anything-0.044
 trouble-0.039
 chores-0.038
 something-0.038
 accomplishment-0.035
Token then
Token ID788
Feature activation-3.77e-01

Pos logit contributions
o+0.027
y+0.025
ppy+0.025
ah+0.025
bug+0.024
Neg logit contributions
 chores-0.016
 trouble-0.015
 anything-0.015
 yourselves-0.014
 mischief-0.014
Token,
Token ID11
Feature activation-1.21e+00

Pos logit contributions
jo+0.008
o+0.008
ppy+0.008
y+0.008
azz+0.007
Neg logit contributions
 anything-0.007
 chores-0.006
 yourselves-0.006
 trouble-0.006
 accomplishment-0.006
Token the
Token ID262
Feature activation-2.90e+00

Pos logit contributions
o+0.031
y+0.029
ne+0.029
ah+0.029
ppy+0.028
Neg logit contributions
 chores-0.013
 yourselves-0.013
 accomplishment-0.012
 trouble-0.011
 comparison-0.011
Token hipp
Token ID18568
Feature activation-1.94e+01

Pos logit contributions
o+0.052
jo+0.050
y+0.049
itty+0.048
ne+0.046
Neg logit contributions
 anything-0.064
 chores-0.058
 something-0.057
 trouble-0.057
 yourselves-0.055
Tokeno
Token ID78
Feature activation-1.38e+01

Pos logit contributions
jo+0.208
anna+0.190
ppy+0.189
anne+0.184
azz+0.179
Neg logit contributions
 many-0.522
 others-0.492
 something-0.492
 anything-0.489
 not-0.484
Token went
Token ID1816
Feature activation+3.44e+00

Pos logit contributions
jo+0.330
azz+0.299
itty+0.299
anne+0.298
os+0.283
Neg logit contributions
 something-0.224
 anything-0.219
 trouble-0.208
 not-0.206
 good-0.204
Token to
Token ID284
Feature activation+4.98e+00

Pos logit contributions
 anything+0.064
 accomplishment+0.062
 chores+0.060
 yourselves+0.059
 avail+0.058
Neg logit contributions
o-0.072
y-0.068
ppy-0.065
azz-0.063
ches-0.062
Token play
Token ID711
Feature activation+2.76e+00

Pos logit contributions
 anything+0.056
 End+0.052
 chores+0.052
 miles+0.051
 accomplishment+0.050
Neg logit contributions
o-0.120
jo-0.109
os-0.108
ppy-0.107
bug-0.106
Token with
Token ID351
Feature activation+6.50e+00

Pos logit contributions
 anything+0.047
 chores+0.047
 accomplishment+0.045
 trouble+0.044
 possible+0.044
Neg logit contributions
o-0.056
y-0.055
ppy-0.055
itty-0.051
bug-0.051
Token in
Token ID287
Feature activation+3.74e+00

Pos logit contributions
 anything+0.074
 trouble+0.066
 chores+0.065
 accomplishment+0.062
 something+0.061
Neg logit contributions
o-0.087
y-0.082
ppy-0.078
jo-0.076
os-0.076
Token a
Token ID257
Feature activation+1.75e+00

Pos logit contributions
 accomplishment+0.050
 chores+0.047
 anything+0.046
 choice+0.046
 possible+0.044
Neg logit contributions
o-0.088
jo-0.079
y-0.077
ppy-0.076
bug-0.075
Token river
Token ID7850
Feature activation-9.38e+00

Pos logit contributions
 anything+0.017
 chores+0.013
 something+0.013
 trouble+0.013
 accomplishment+0.012
Neg logit contributions
jo-0.054
o-0.053
y-0.052
ppy-0.052
azz-0.052
Token.
Token ID13
Feature activation-3.96e+00

Pos logit contributions
jo+0.219
ppy+0.208
azz+0.207
o+0.204
y+0.199
Neg logit contributions
 anything-0.164
 something-0.149
 chores-0.146
 trouble-0.146
 good-0.142
Token The
Token ID383
Feature activation-1.33e+00

Pos logit contributions
jo+0.104
ppy+0.103
azz+0.098
y+0.096
ne+0.096
Neg logit contributions
 anything-0.058
 many-0.054
 something-0.053
 trouble-0.050
 an-0.050
Token hipp
Token ID18568
Feature activation-1.93e+01

Pos logit contributions
o+0.022
y+0.020
jo+0.020
os+0.018
ah+0.018
Neg logit contributions
 anything-0.032
 something-0.029
 chores-0.029
 trouble-0.029
 accomplishment-0.027
Tokeno
Token ID78
Feature activation-1.38e+01

Pos logit contributions
ppy+0.183
azz+0.174
jo+0.166
anne+0.147
ches+0.137
Neg logit contributions
 many-0.578
 things-0.535
 most-0.529
 others-0.526
 himself-0.525
Token was
Token ID373
Feature activation+7.06e+00

Pos logit contributions
azz+0.328
jo+0.323
ppy+0.315
anne+0.302
itty+0.293
Neg logit contributions
 many-0.224
 something-0.203
 good-0.201
 anything-0.200
 things-0.197
Token very
Token ID845
Feature activation+6.52e+00

Pos logit contributions
 avail+0.107
 anything+0.106
 chores+0.104
 accomplishment+0.099
 yourselves+0.098
Neg logit contributions
o-0.183
y-0.155
ppy-0.149
bug-0.146
jo-0.145
Token happy
Token ID3772
Feature activation+1.34e+00

Pos logit contributions
 anything+0.087
 something+0.071
 chores+0.069
 trouble+0.069
 miles+0.065
Neg logit contributions
o-0.185
jo-0.174
y-0.174
ppy-0.172
os-0.169
Token and
Token ID290
Feature activation+3.70e+00

Pos logit contributions
 anything+0.027
 chores+0.025
 trouble+0.025
 something+0.024
 good+0.024
Neg logit contributions
jo-0.027
o-0.026
y-0.026
ppy-0.026
azz-0.025
Token
Token ID198
Feature activation-2.87e+00

Pos logit contributions
ppy+0.065
jo+0.064
azz+0.062
y+0.060
anne+0.059
Neg logit contributions
 anything-0.032
 many-0.030
 something-0.030
 chores-0.028
 good-0.028
TokenOne
Token ID3198
Feature activation-3.40e+00

Pos logit contributions
jo+0.078
ppy+0.077
azz+0.075
os+0.074
anne+0.073
Neg logit contributions
 anything-0.037
 chores-0.034
 many-0.034
 something-0.034
 good-0.033
Token day
Token ID1110
Feature activation-3.11e+00

Pos logit contributions
jo+0.111
azz+0.110
ppy+0.108
os+0.104
ne+0.102
Neg logit contributions
 good-0.026
 anything-0.025
 something-0.025
 many-0.023
 great-0.023
Token,
Token ID11
Feature activation-1.54e+00

Pos logit contributions
jo+0.071
o+0.069
ppy+0.069
y+0.068
azz+0.068
Neg logit contributions
 anything-0.053
 chores-0.049
 trouble-0.049
 yourselves-0.047
 something-0.046
Token the
Token ID262
Feature activation-2.64e+00

Pos logit contributions
o+0.042
jo+0.040
y+0.040
ppy+0.040
itty+0.039
Neg logit contributions
 anything-0.017
 yourselves-0.017
 trouble-0.017
 chores-0.017
 accomplishment-0.016
Token hipp
Token ID18568
Feature activation-1.93e+01

Pos logit contributions
jo+0.054
o+0.054
y+0.052
ppy+0.050
ne+0.049
Neg logit contributions
 anything-0.052
 chores-0.046
 trouble-0.046
 something-0.046
 yourselves-0.044
Tokeno
Token ID78
Feature activation-1.40e+01

Pos logit contributions
azz+0.172
ppy+0.172
jo+0.171
anne+0.142
ems+0.138
Neg logit contributions
 many-0.551
 something-0.527
 others-0.519
 anything-0.517
 most-0.517
Token met
Token ID1138
Feature activation+1.30e+00

Pos logit contributions
jo+0.314
azz+0.306
ppy+0.292
itty+0.287
anne+0.282
Neg logit contributions
 something-0.228
 anything-0.224
 trouble-0.220
 many-0.220
 an-0.211
Token a
Token ID257
Feature activation+8.77e-01

Pos logit contributions
 anything+0.007
 something+0.005
 trouble+0.005
 chores+0.005
 good+0.004
Neg logit contributions
jo-0.046
ppy-0.045
azz-0.044
o-0.044
y-0.043
Token wise
Token ID10787
Feature activation-3.06e+00

Pos logit contributions
 anything+0.011
 something+0.010
 trouble+0.010
 chores+0.010
 good+0.009
Neg logit contributions
jo-0.024
ppy-0.023
azz-0.023
o-0.023
y-0.023
Token old
Token ID1468
Feature activation-5.48e+00

Pos logit contributions
jo+0.094
ppy+0.092
azz+0.090
os+0.090
o+0.088
Neg logit contributions
 anything-0.030
 good-0.028
 something-0.027
 trouble-0.027
 new-0.025
Token
Token ID198
Feature activation-2.88e+00

Pos logit contributions
jo+0.059
ppy+0.058
azz+0.057
o+0.055
y+0.055
Neg logit contributions
 anything-0.037
 something-0.034
 trouble-0.033
 many-0.033
 chores-0.033
TokenOne
Token ID3198
Feature activation-3.28e+00

Pos logit contributions
jo+0.071
ppy+0.068
azz+0.067
o+0.067
y+0.066
Neg logit contributions
 anything-0.047
 something-0.043
 chores-0.042
 trouble-0.042
 good-0.040
Token day
Token ID1110
Feature activation-3.01e+00

Pos logit contributions
jo+0.106
azz+0.104
ppy+0.103
os+0.099
ne+0.098
Neg logit contributions
 anything-0.026
 good-0.025
 something-0.025
 many-0.023
 great-0.022
Token,
Token ID11
Feature activation-1.44e+00

Pos logit contributions
jo+0.068
o+0.066
ppy+0.065
y+0.065
azz+0.064
Neg logit contributions
 anything-0.052
 chores-0.049
 trouble-0.048
 yourselves-0.046
 something-0.046
Token the
Token ID262
Feature activation-2.46e+00

Pos logit contributions
o+0.039
y+0.036
ppy+0.036
jo+0.035
itty+0.035
Neg logit contributions
 chores-0.017
 trouble-0.017
 anything-0.017
 yourselves-0.016
 accomplishment-0.016
Token hipp
Token ID18568
Feature activation-1.93e+01

Pos logit contributions
o+0.046
jo+0.044
y+0.043
itty+0.041
ne+0.040
Neg logit contributions
 anything-0.053
 chores-0.048
 trouble-0.048
 something-0.047
 yourselves-0.046
Tokeno
Token ID78
Feature activation-1.40e+01

Pos logit contributions
jo+0.183
ppy+0.176
azz+0.173
anne+0.157
anna+0.151
Neg logit contributions
 many-0.564
 something-0.531
 anything-0.526
 things-0.521
 others-0.521
Token saw
Token ID2497
Feature activation+3.81e+00

Pos logit contributions
jo+0.316
azz+0.303
ppy+0.294
anne+0.290
itty+0.287
Neg logit contributions
 anything-0.236
 something-0.235
 trouble-0.228
 many-0.226
 good-0.217
Token something
Token ID1223
Feature activation-1.52e+00

Pos logit contributions
 chores+0.036
 accomplishment+0.035
 anything+0.033
 yourselves+0.030
 trouble+0.029
Neg logit contributions
o-0.115
jo-0.107
y-0.106
ppy-0.104
os-0.104
Token that
Token ID326
Feature activation+9.84e-01

Pos logit contributions
jo+0.045
o+0.044
y+0.043
azz+0.043
ppy+0.043
Neg logit contributions
 anything-0.015
 yourselves-0.013
 chores-0.012
 trouble-0.012
 miles-0.011
Token caught
Token ID4978
Feature activation+2.16e+00

Pos logit contributions
 anything+0.020
 chores+0.020
 yourselves+0.020
 trouble+0.019
 accomplishment+0.019
Neg logit contributions
o-0.018
jo-0.018
os-0.017
y-0.017
ppy-0.016
Token right
Token ID826
Feature activation+2.33e+00

Pos logit contributions
 chores+0.073
 anything+0.068
 yourselves+0.067
 trouble+0.064
 top+0.064
Neg logit contributions
ppy-0.129
y-0.128
jo-0.127
o-0.126
bug-0.121
Token.
Token ID13
Feature activation-3.92e+00

Pos logit contributions
 anything+0.044
 chores+0.042
 trouble+0.041
 something+0.040
 miles+0.039
Neg logit contributions
jo-0.048
y-0.047
ppy-0.047
o-0.046
azz-0.044
Token
Token ID198
Feature activation-2.22e+00

Pos logit contributions
jo+0.109
ppy+0.105
y+0.102
azz+0.101
ah+0.100
Neg logit contributions
 many-0.052
 anything-0.052
 something-0.051
 an-0.048
 top-0.048
Token
Token ID198
Feature activation-2.87e+00

Pos logit contributions
jo+0.057
ppy+0.056
azz+0.055
y+0.054
o+0.053
Neg logit contributions
 anything-0.031
 something-0.029
 many-0.029
 trouble-0.028
 good-0.028
TokenThe
Token ID464
Feature activation-2.31e+00

Pos logit contributions
jo+0.078
ppy+0.074
azz+0.074
o+0.074
y+0.072
Neg logit contributions
 anything-0.040
 something-0.035
 chores-0.035
 trouble-0.035
 good-0.033
Token hipp
Token ID18568
Feature activation-1.93e+01

Pos logit contributions
o+0.042
jo+0.041
y+0.041
ppy+0.039
itty+0.039
Neg logit contributions
 anything-0.050
 something-0.045
 trouble-0.045
 chores-0.045
 yourselves-0.043
Tokeno
Token ID78
Feature activation-1.38e+01

Pos logit contributions
jo+0.182
azz+0.175
ppy+0.174
anne+0.149
anna+0.142
Neg logit contributions
 many-0.551
 himself-0.525
 something-0.523
 others-0.520
 most-0.520
Token looked
Token ID3114
Feature activation+3.37e+00

Pos logit contributions
jo+0.336
azz+0.334
itty+0.306
ppy+0.301
anne+0.301
Neg logit contributions
 something-0.199
 many-0.198
 good-0.190
 top-0.190
 the-0.189
Token around
Token ID1088
Feature activation+3.73e+00

Pos logit contributions
 anything+0.059
 chores+0.057
 yourselves+0.056
 trouble+0.054
 miles+0.053
Neg logit contributions
y-0.072
jo-0.071
o-0.070
ppy-0.069
azz-0.067
Token and
Token ID290
Feature activation+5.13e+00

Pos logit contributions
 chores+0.073
 anything+0.072
 yourselves+0.070
 trouble+0.065
 miles+0.064
Neg logit contributions
jo-0.072
y-0.072
ppy-0.067
o-0.067
azz-0.064
Token noticed
Token ID6810
Feature activation+3.82e+00

Pos logit contributions
 anything+0.112
 something+0.103
 chores+0.103
 trouble+0.103
 good+0.100
Neg logit contributions
jo-0.098
ppy-0.092
azz-0.091
o-0.091
y-0.088
Token!
Token ID0
Feature activation-3.56e+00

Pos logit contributions
 anything+0.044
 something+0.040
 trouble+0.039
 good+0.038
 chores+0.038
Neg logit contributions
jo-0.061
azz-0.057
ppy-0.057
o-0.056
itty-0.055
Tokenâ
Token ID22940
Feature activation-6.34e+00

Pos logit contributions
jo+0.093
ppy+0.089
y+0.088
anne+0.086
os+0.084
Neg logit contributions
 anything-0.052
 something-0.048
 trouble-0.047
 many-0.045
 good-0.044
Token€
Token ID26391
Feature activation-1.05e+01

Pos logit contributions
jo+0.140
azz+0.134
ppy+0.132
os+0.129
o+0.128
Neg logit contributions
 anything-0.117
 something-0.108
 good-0.107
 chores-0.105
 trouble-0.104
Token<|endoftext|>
Token ID50256
Feature activation-8.02e+00

Pos logit contributions
jo+0.236
ppy+0.225
azz+0.221
y+0.217
o+0.217
Neg logit contributions
 anything-0.193
 something-0.177
 trouble-0.175
 chores-0.172
 good-0.168
TokenPa
Token ID28875
Feature activation-1.08e+01

Pos logit contributions
jo+0.223
ppy+0.216
azz+0.209
anne+0.206
o+0.200
Neg logit contributions
 anything-0.106
 something-0.098
 trouble-0.093
 many-0.091
 top-0.087
Tokenolo
Token ID14057
Feature activation-1.93e+01

Pos logit contributions
jo+0.198
o+0.175
ppy+0.174
os+0.172
azz+0.172
Neg logit contributions
 anything-0.249
 something-0.236
 trouble-0.230
 good-0.227
 many-0.227
Token was
Token ID373
Feature activation+6.30e+00

Pos logit contributions
jo+0.451
azz+0.443
ppy+0.427
itty+0.419
oke+0.398
Neg logit contributions
 something-0.307
 anything-0.306
 an-0.292
 many-0.286
 good-0.283
Token very
Token ID845
Feature activation+6.02e+00

Pos logit contributions
 anything+0.088
 chores+0.074
 trouble+0.072
 yourselves+0.070
 miles+0.069
Neg logit contributions
o-0.170
jo-0.167
y-0.160
ppy-0.159
os-0.155
Token excited
Token ID6568
Feature activation+1.63e-01

Pos logit contributions
 anything+0.103
 miles+0.084
 yourselves+0.084
 chores+0.084
 trouble+0.082
Neg logit contributions
o-0.146
jo-0.137
y-0.136
os-0.134
ppy-0.133
Token.
Token ID13
Feature activation-3.86e+00

Pos logit contributions
 anything+0.003
 something+0.003
 trouble+0.002
 chores+0.002
 many+0.002
Neg logit contributions
jo-0.004
ppy-0.004
azz-0.004
o-0.004
itty-0.004
Token He
Token ID679
Feature activation-4.97e+00

Pos logit contributions
jo+0.087
ppy+0.082
azz+0.082
o+0.081
y+0.080
Neg logit contributions
 anything-0.071
 something-0.064
 trouble-0.064
 chores-0.063
 good-0.061
Token was
Token ID373
Feature activation+5.21e+00

Pos logit contributions
jo+0.095
o+0.091
y+0.089
ppy+0.087
azz+0.086
Neg logit contributions
 anything-0.112
 chores-0.105
 trouble-0.103
 something-0.101
 yourselves-0.100
Token a
Token ID257
Feature activation-3.58e-01

Pos logit contributions
 anything+0.058
 yourselves+0.051
 accomplishment+0.050
 chores+0.048
 miles+0.048
Neg logit contributions
o-0.146
jo-0.139
y-0.139
ppy-0.137
azz-0.133
Token little
Token ID1310
Feature activation-2.58e+00

Pos logit contributions
jo+0.010
ppy+0.010
azz+0.010
os+0.010
o+0.010
Neg logit contributions
 anything-0.004
 good-0.004
 something-0.004
 great-0.004
 trouble-0.004
Token fish
Token ID5916
Feature activation-1.47e+01

Pos logit contributions
jo+0.064
azz+0.059
anne+0.058
os+0.058
ppy+0.058
Neg logit contributions
 good-0.045
 things-0.041
 anything-0.041
 trouble-0.041
 something-0.041
Token named
Token ID3706
Feature activation-2.80e-01

Pos logit contributions
jo+0.314
azz+0.306
ppy+0.295
anne+0.295
oke+0.277
Neg logit contributions
 anything-0.275
 many-0.274
 something-0.269
 good-0.254
 things-0.248
Token Nem
Token ID22547
Feature activation-1.93e+01

Pos logit contributions
jo+0.007
azz+0.007
ppy+0.007
y+0.007
anne+0.007
Neg logit contributions
 anything-0.004
 something-0.004
 good-0.004
 many-0.004
 chores-0.004
Tokeno
Token ID78
Feature activation-1.26e+01

Pos logit contributions
jo+0.152
anne+0.135
azz+0.133
ol+0.115
anna+0.114
Neg logit contributions
 anything-0.565
 something-0.565
 many-0.563
 good-0.554
 an-0.550
Token.
Token ID13
Feature activation-4.18e+00

Pos logit contributions
jo+0.317
azz+0.310
anne+0.299
ppy+0.297
itty+0.287
Neg logit contributions
 anything-0.194
 something-0.181
 many-0.180
 chores-0.169
 good-0.169
Token Nem
Token ID22547
Feature activation-1.89e+01

Pos logit contributions
o+0.089
y+0.088
jo+0.086
ppy+0.083
azz+0.083
Neg logit contributions
 anything-0.078
 chores-0.073
 trouble-0.072
 something-0.069
 accomplishment-0.068
Tokeno
Token ID78
Feature activation-1.30e+01

Pos logit contributions
azz+0.154
jo+0.146
anne+0.136
anna+0.127
ppy+0.126
Neg logit contributions
 many-0.536
 good-0.532
 something-0.531
 himself-0.527
 anything-0.522
Token lived
Token ID5615
Feature activation+4.01e+00

Pos logit contributions
azz+0.320
jo+0.314
ppy+0.307
itty+0.296
anne+0.296
Neg logit contributions
 anything-0.198
 many-0.196
 something-0.190
 good-0.186
 great-0.179
Token a
Token ID257
Feature activation+7.07e-02

Pos logit contributions
 anything+0.063
 yourselves+0.057
 accomplishment+0.055
 chores+0.053
 miles+0.049
Neg logit contributions
o-0.162
jo-0.156
y-0.154
ppy-0.153
azz-0.148
Token little
Token ID1310
Feature activation-2.24e+00

Pos logit contributions
 good+0.001
 anything+0.001
 great+0.001
 something+0.001
 trouble+0.001
Neg logit contributions
jo-0.002
os-0.002
azz-0.002
ppy-0.002
o-0.002
Token fish
Token ID5916
Feature activation-1.45e+01

Pos logit contributions
jo+0.056
azz+0.052
anne+0.051
os+0.051
ppy+0.050
Neg logit contributions
 good-0.039
 anything-0.035
 things-0.035
 trouble-0.035
 something-0.035
Tokeny
Token ID88
Feature activation-6.07e+00

Pos logit contributions
jo+0.297
azz+0.288
ppy+0.277
anne+0.277
oke+0.258
Neg logit contributions
 anything-0.284
 many-0.275
 something-0.275
 good-0.262
 an-0.254
Token named
Token ID3706
Feature activation-5.23e-02

Pos logit contributions
jo+0.154
azz+0.150
anne+0.143
ppy+0.141
os+0.140
Neg logit contributions
 anything-0.092
 something-0.092
 many-0.092
 good-0.091
 things-0.087
Token Nem
Token ID22547
Feature activation-1.92e+01

Pos logit contributions
jo+0.001
azz+0.001
ppy+0.001
y+0.001
o+0.001
Neg logit contributions
 anything-0.001
 something-0.001
 chores-0.001
 good-0.001
 trouble-0.001
Tokeno
Token ID78
Feature activation-1.25e+01

Pos logit contributions
jo+0.169
anne+0.167
anna+0.144
ol+0.142
affe+0.134
Neg logit contributions
 anything-0.522
 something-0.518
 many-0.509
 good-0.507
 himself-0.505
Token.
Token ID13
Feature activation-3.89e+00

Pos logit contributions
jo+0.314
azz+0.306
anne+0.295
ppy+0.294
itty+0.283
Neg logit contributions
 anything-0.195
 something-0.182
 many-0.178
 good-0.170
 chores-0.169
Token Nem
Token ID22547
Feature activation-1.84e+01

Pos logit contributions
y+0.080
o+0.080
jo+0.076
ppy+0.074
azz+0.073
Neg logit contributions
 anything-0.075
 trouble-0.070
 chores-0.070
 danger-0.067
 something-0.067
Tokeno
Token ID78
Feature activation-1.27e+01

Pos logit contributions
jo+0.152
azz+0.149
anne+0.149
anna+0.137
ne+0.127
Neg logit contributions
 himself-0.495
 good-0.495
 many-0.493
 something-0.493
 anything-0.484
Token loved
Token ID6151
Feature activation+6.79e+00

Pos logit contributions
azz+0.312
jo+0.307
ppy+0.298
anne+0.288
itty+0.288
Neg logit contributions
 anything-0.194
 many-0.193
 something-0.186
 good-0.185
 great-0.178
Token bird
Token ID6512
Feature activation-1.41e+01

Pos logit contributions
 anything+0.023
 something+0.020
 chores+0.020
 trouble+0.020
 yourselves+0.020
Neg logit contributions
jo-0.023
o-0.023
y-0.022
ppy-0.022
itty-0.022
Token.
Token ID13
Feature activation-4.19e+00

Pos logit contributions
jo+0.374
azz+0.359
ne+0.338
anne+0.338
ppy+0.334
Neg logit contributions
 anything-0.220
 something-0.209
 good-0.191
 many-0.188
 things-0.182
Token So
Token ID1406
Feature activation-1.01e+00

Pos logit contributions
jo+0.118
ppy+0.115
azz+0.113
y+0.109
anne+0.107
Neg logit contributions
 anything-0.055
 many-0.053
 something-0.053
 an-0.049
 top-0.049
Token,
Token ID11
Feature activation-1.54e+00

Pos logit contributions
o+0.020
y+0.019
jo+0.019
ppy+0.018
ah+0.018
Neg logit contributions
 anything-0.020
 chores-0.020
 yourselves-0.019
 trouble-0.019
 miles-0.018
Token the
Token ID262
Feature activation-3.13e+00

Pos logit contributions
o+0.043
y+0.041
os+0.040
jo+0.040
ne+0.040
Neg logit contributions
 chores-0.016
 yourselves-0.016
 anything-0.015
 trouble-0.015
 accomplishment-0.013
Token hipp
Token ID18568
Feature activation-1.92e+01

Pos logit contributions
jo+0.058
o+0.057
y+0.055
ppy+0.054
itty+0.053
Neg logit contributions
 anything-0.068
 chores-0.062
 something-0.062
 trouble-0.061
 yourselves-0.059
Tokeno
Token ID78
Feature activation-1.38e+01

Pos logit contributions
jo+0.208
ppy+0.185
azz+0.185
anne+0.181
anna+0.180
Neg logit contributions
 many-0.529
 others-0.497
 good-0.492
 not-0.492
 something-0.492
Token used
Token ID973
Feature activation+5.16e+00

Pos logit contributions
jo+0.350
azz+0.333
itty+0.311
anne+0.307
os+0.303
Neg logit contributions
 something-0.213
 good-0.210
 many-0.203
 not-0.199
 anything-0.199
Token its
Token ID663
Feature activation+4.02e+00

Pos logit contributions
 yourselves+0.072
 anything+0.071
 trouble+0.063
 chores+0.062
 avail+0.062
Neg logit contributions
o-0.126
y-0.121
ppy-0.119
os-0.116
bug-0.111
Token big
Token ID1263
Feature activation+1.41e+00

Pos logit contributions
 anything+0.061
 trouble+0.052
 yourselves+0.052
 miles+0.052
 chores+0.051
Neg logit contributions
ppy-0.097
o-0.097
jo-0.097
y-0.095
os-0.090
Token mouth
Token ID5422
Feature activation-2.95e+00

Pos logit contributions
 anything+0.026
 something+0.023
 chores+0.023
 trouble+0.023
 many+0.022
Neg logit contributions
jo-0.032
ppy-0.030
o-0.030
azz-0.030
y-0.030
Token was
Token ID373
Feature activation+5.88e+00

Pos logit contributions
jo+0.077
o+0.075
y+0.074
os+0.069
ppy+0.068
Neg logit contributions
 anything-0.100
 chores-0.096
 trouble-0.093
 yourselves-0.091
 accomplishment-0.090
Token a
Token ID257
Feature activation+4.41e-01

Pos logit contributions
 anything+0.061
 chores+0.051
 accomplishment+0.048
 yourselves+0.047
 trouble+0.045
Neg logit contributions
o-0.173
jo-0.167
ppy-0.167
y-0.167
itty-0.162
Token little
Token ID1310
Feature activation-2.24e+00

Pos logit contributions
 anything+0.005
 good+0.005
 great+0.005
 something+0.005
 trouble+0.004
Neg logit contributions
jo-0.013
os-0.012
azz-0.012
ppy-0.012
o-0.012
Token fish
Token ID5916
Feature activation-1.48e+01

Pos logit contributions
jo+0.057
azz+0.053
os+0.053
anne+0.052
ppy+0.051
Neg logit contributions
 good-0.038
 things-0.034
 anything-0.034
 something-0.034
 new-0.033
Token named
Token ID3706
Feature activation-3.50e-01

Pos logit contributions
jo+0.307
azz+0.301
ppy+0.289
anne+0.286
oke+0.269
Neg logit contributions
 anything-0.286
 something-0.279
 many-0.278
 good-0.264
 an-0.257
Token Nem
Token ID22547
Feature activation-1.92e+01

Pos logit contributions
jo+0.009
azz+0.009
ppy+0.009
anne+0.008
itty+0.008
Neg logit contributions
 anything-0.005
 something-0.005
 many-0.005
 good-0.005
 chores-0.004
Tokeno
Token ID78
Feature activation-1.30e+01

Pos logit contributions
jo+0.159
anne+0.145
azz+0.135
ol+0.126
anna+0.121
Neg logit contributions
 anything-0.557
 something-0.556
 many-0.552
 good-0.540
 not-0.540
Token.
Token ID13
Feature activation-4.10e+00

Pos logit contributions
jo+0.329
azz+0.327
ppy+0.311
anne+0.311
itty+0.301
Neg logit contributions
 anything-0.193
 something-0.183
 many-0.180
 good-0.168
 an-0.167
Token Nem
Token ID22547
Feature activation-1.83e+01

Pos logit contributions
o+0.090
y+0.089
jo+0.089
ppy+0.086
azz+0.085
Neg logit contributions
 anything-0.074
 trouble-0.070
 chores-0.069
 something-0.066
 accomplishment-0.065
Tokeno
Token ID78
Feature activation-1.29e+01

Pos logit contributions
jo+0.134
azz+0.129
anne+0.121
ppy+0.109
anna+0.104
Neg logit contributions
 many-0.540
 something-0.539
 anything-0.535
 good-0.535
 himself-0.530
Token loved
Token ID6151
Feature activation+7.10e+00

Pos logit contributions
jo+0.303
azz+0.301
ppy+0.291
anne+0.278
itty+0.278
Neg logit contributions
 anything-0.212
 many-0.201
 something-0.200
 good-0.197
 great-0.190
Token block
Token ID2512
Feature activation-7.16e+00

Pos logit contributions
 anything+0.061
 something+0.056
 trouble+0.055
 chores+0.055
 good+0.054
Neg logit contributions
jo-0.076
ppy-0.072
azz-0.071
o-0.070
y-0.069
Token.
Token ID13
Feature activation-2.21e+00

Pos logit contributions
jo+0.181
ppy+0.171
azz+0.171
anne+0.165
o+0.164
Neg logit contributions
 anything-0.118
 something-0.105
 good-0.104
 chores-0.100
 many-0.100
Token It
Token ID632
Feature activation-2.69e+00

Pos logit contributions
jo+0.055
y+0.052
ppy+0.051
o+0.050
anne+0.050
Neg logit contributions
 anything-0.037
 something-0.034
 trouble-0.032
 good-0.032
 many-0.032
Token helps
Token ID5419
Feature activation+4.59e+00

Pos logit contributions
jo+0.072
anne+0.067
ppy+0.066
azz+0.065
itty+0.062
Neg logit contributions
 anything-0.042
 good-0.040
 many-0.037
 something-0.037
 great-0.036
Token you
Token ID345
Feature activation+2.58e-02

Pos logit contributions
 anything+0.058
 trouble+0.051
 chores+0.051
 something+0.050
 good+0.047
Neg logit contributions
jo-0.127
ppy-0.122
o-0.121
azz-0.121
y-0.119
Token learn
Token ID2193
Feature activation+7.09e+00

Pos logit contributions
 anything+0.000
 trouble+0.000
 something+0.000
 chores+0.000
 good+0.000
Neg logit contributions
jo-0.001
ppy-0.001
o-0.001
azz-0.001
y-0.001
Token numbers
Token ID3146
Feature activation-1.02e+00

Pos logit contributions
 anything+0.104
 trouble+0.100
 chores+0.098
 miles+0.097
 accomplishment+0.094
Neg logit contributions
jo-0.164
azz-0.161
o-0.161
y-0.160
ne-0.157
Token.
Token ID13
Feature activation-2.75e+00

Pos logit contributions
jo+0.023
ppy+0.022
azz+0.022
o+0.022
y+0.021
Neg logit contributions
 anything-0.018
 something-0.017
 chores-0.016
 trouble-0.016
 good-0.016
Token Do
Token ID2141
Feature activation+3.52e+00

Pos logit contributions
jo+0.068
y+0.063
ppy+0.063
anne+0.062
o+0.061
Neg logit contributions
 anything-0.047
 something-0.043
 trouble-0.041
 good-0.040
 many-0.040
Token you
Token ID345
Feature activation-3.88e-01

Pos logit contributions
 anything+0.040
 trouble+0.034
 something+0.034
 chores+0.033
 good+0.031
Neg logit contributions
jo-0.104
ppy-0.100
azz-0.099
o-0.099
y-0.097
Token know
Token ID760
Feature activation+4.04e+00

Pos logit contributions
jo+0.010
azz+0.009
o+0.009
itty+0.009
ppy+0.009
Neg logit contributions
 anything-0.006
 something-0.006
 chores-0.006
 good-0.006
 trouble-0.006
Token the
Token ID262
Feature activation+3.40e+00

Pos logit contributions
 anything+0.064
 chores+0.060
 trouble+0.059
 yourselves+0.056
 something+0.055
Neg logit contributions
jo-0.115
y-0.109
ppy-0.107
o-0.106
azz-0.105
Token kitchen
Token ID9592
Feature activation-4.64e+00

Pos logit contributions
 anything+0.046
 something+0.040
 chores+0.040
 trouble+0.040
 good+0.038
Neg logit contributions
jo-0.092
ppy-0.088
azz-0.087
o-0.087
y-0.086
Token.
Token ID13
Feature activation-3.34e+00

Pos logit contributions
jo+0.112
ppy+0.106
azz+0.105
o+0.105
y+0.102
Neg logit contributions
 anything-0.078
 something-0.071
 trouble-0.069
 chores-0.069
 good-0.067
Token He
Token ID679
Feature activation-3.43e+00

Pos logit contributions
jo+0.083
ppy+0.083
y+0.080
azz+0.078
o+0.077
Neg logit contributions
 anything-0.050
 something-0.048
 many-0.045
 trouble-0.045
 an-0.044
Token was
Token ID373
Feature activation+7.84e+00

Pos logit contributions
jo+0.072
azz+0.070
itty+0.069
ppy+0.068
o+0.067
Neg logit contributions
 anything-0.064
 something-0.063
 many-0.060
 good-0.059
 great-0.058
Token having
Token ID1719
Feature activation+1.07e+01

Pos logit contributions
 anything+0.099
 chores+0.093
 trouble+0.086
 yourselves+0.082
 something+0.081
Neg logit contributions
jo-0.210
o-0.206
ppy-0.201
y-0.201
os-0.194
Token so
Token ID523
Feature activation+6.44e+00

Pos logit contributions
 anything+0.108
 yourselves+0.107
 chores+0.102
 accomplishment+0.094
wered+0.094
Neg logit contributions
jo-0.290
o-0.281
ppy-0.277
azz-0.275
y-0.275
Token much
Token ID881
Feature activation+5.84e+00

Pos logit contributions
 anything+0.069
 yourselves+0.055
 chores+0.053
 something+0.052
 avail+0.049
Neg logit contributions
jo-0.193
o-0.183
y-0.180
os-0.179
azz-0.177
Token fun
Token ID1257
Feature activation+1.49e+00

Pos logit contributions
 anything+0.052
 yourselves+0.046
 chores+0.040
 top+0.039
 bottom+0.038
Neg logit contributions
jo-0.177
azz-0.168
o-0.168
ne-0.167
y-0.165
Token that
Token ID326
Feature activation+2.55e+00

Pos logit contributions
 anything+0.028
 chores+0.026
 trouble+0.026
 something+0.025
 yourselves+0.025
Neg logit contributions
jo-0.032
y-0.029
o-0.029
ppy-0.029
azz-0.029
Token he
Token ID339
Feature activation-3.20e+00

Pos logit contributions
 anything+0.034
 yourselves+0.033
 chores+0.033
 trouble+0.032
 miles+0.030
Neg logit contributions
jo-0.065
o-0.061
y-0.060
ppy-0.059
azz-0.059
Token good
Token ID922
Feature activation+2.58e+00

Pos logit contributions
 anything+0.041
 something+0.033
 chores+0.032
 trouble+0.032
 yourselves+0.030
Neg logit contributions
jo-0.138
azz-0.131
ppy-0.131
o-0.129
y-0.129
Token memories
Token ID9846
Feature activation-4.12e+00

Pos logit contributions
 anything+0.025
 something+0.020
 chores+0.020
 trouble+0.020
 yourselves+0.019
Neg logit contributions
jo-0.081
ppy-0.076
o-0.076
azz-0.076
y-0.076
Token that
Token ID326
Feature activation+2.18e+00

Pos logit contributions
jo+0.100
ppy+0.095
azz+0.095
o+0.094
y+0.090
Neg logit contributions
 anything-0.069
 something-0.062
 trouble-0.060
 chores-0.060
 good-0.060
Token she
Token ID673
Feature activation-3.17e+00

Pos logit contributions
 anything+0.030
 chores+0.030
 trouble+0.029
 yourselves+0.029
 accomplishment+0.028
Neg logit contributions
jo-0.054
y-0.052
ppy-0.052
o-0.052
os-0.051
Token never
Token ID1239
Feature activation+5.59e+00

Pos logit contributions
jo+0.063
azz+0.059
o+0.057
ppy+0.057
itty+0.056
Neg logit contributions
 anything-0.067
 something-0.061
 good-0.061
 trouble-0.060
 chores-0.060
Token wanted
Token ID2227
Feature activation+7.76e+00

Pos logit contributions
 anything+0.113
 something+0.102
 trouble+0.100
 good+0.098
 many+0.097
Neg logit contributions
jo-0.123
azz-0.110
itty-0.109
anne-0.109
ppy-0.108
Token to
Token ID284
Feature activation+8.42e+00

Pos logit contributions
 anything+0.132
 chores+0.122
 trouble+0.121
 something+0.120
 good+0.115
Neg logit contributions
jo-0.181
ppy-0.171
azz-0.170
o-0.170
y-0.167
Token let
Token ID1309
Feature activation+5.92e+00

Pos logit contributions
 anything+0.135
 something+0.122
 trouble+0.121
 chores+0.120
 good+0.116
Neg logit contributions
jo-0.208
ppy-0.197
azz-0.196
o-0.195
y-0.192
Token it
Token ID340
Feature activation-2.16e+00

Pos logit contributions
 anything+0.052
 something+0.042
 trouble+0.041
 chores+0.041
 good+0.038
Neg logit contributions
jo-0.191
ppy-0.183
azz-0.182
o-0.182
y-0.179
Token go
Token ID467
Feature activation+6.20e+00

Pos logit contributions
jo+0.056
ppy+0.054
o+0.053
azz+0.053
y+0.053
Neg logit contributions
 anything-0.030
 trouble-0.027
 chores-0.027
 something-0.027
 yourselves-0.025
Token.
Token ID13
Feature activation-3.90e+00

Pos logit contributions
 anything+0.114
 trouble+0.106
 chores+0.105
 something+0.104
 good+0.100
Neg logit contributions
jo-0.135
ppy-0.128
y-0.128
o-0.127
azz-0.126
Token.
Token ID13
Feature activation-3.67e+00

Pos logit contributions
 anything+0.042
 chores+0.040
 trouble+0.039
 miles+0.038
 yourselves+0.037
Neg logit contributions
jo-0.040
y-0.039
ppy-0.038
o-0.038
ne-0.037
Token
Token ID198
Feature activation-2.08e+00

Pos logit contributions
jo+0.080
ppy+0.076
azz+0.075
o+0.074
y+0.073
Neg logit contributions
 anything-0.070
 something-0.064
 chores-0.064
 trouble-0.064
 good-0.061
Token
Token ID198
Feature activation-2.71e+00

Pos logit contributions
jo+0.050
ppy+0.048
azz+0.047
o+0.047
y+0.046
Neg logit contributions
 anything-0.035
 something-0.031
 trouble-0.031
 chores-0.031
 good-0.030
TokenWhen
Token ID2215
Feature activation-8.17e-01

Pos logit contributions
jo+0.060
ppy+0.058
azz+0.057
y+0.057
o+0.057
Neg logit contributions
 anything-0.051
 trouble-0.045
 chores-0.045
 something-0.044
 miles-0.042
Token she
Token ID673
Feature activation-4.53e+00

Pos logit contributions
y+0.016
ppy+0.015
o+0.015
ne+0.014
jo+0.014
Neg logit contributions
 yourselves-0.015
 anything-0.015
 chores-0.015
 miles-0.014
 trouble-0.014
Token got
Token ID1392
Feature activation+8.46e+00

Pos logit contributions
jo+0.084
ppy+0.077
azz+0.077
o+0.075
os+0.072
Neg logit contributions
 anything-0.102
 something-0.096
 trouble-0.094
 chores-0.093
 good-0.093
Token there
Token ID612
Feature activation-1.80e+00

Pos logit contributions
 anything+0.138
 yourselves+0.135
 accomplishment+0.135
 chores+0.127
 danger+0.126
Neg logit contributions
o-0.204
y-0.184
os-0.179
itty-0.174
bug-0.174
Token,
Token ID11
Feature activation+1.01e-01

Pos logit contributions
y+0.036
jo+0.035
o+0.034
ppy+0.034
os+0.033
Neg logit contributions
 anything-0.034
 chores-0.033
 yourselves-0.033
 trouble-0.032
 danger-0.031
Token Molly
Token ID30236
Feature activation-1.46e+01

Pos logit contributions
 yourselves+0.002
 chores+0.002
 miles+0.002
 trouble+0.002
 freedom+0.002
Neg logit contributions
y-0.002
ppy-0.002
ella-0.002
o-0.002
bug-0.002
Token noticed
Token ID6810
Feature activation+3.65e+00

Pos logit contributions
jo+0.323
azz+0.308
ppy+0.297
o+0.286
os+0.283
Neg logit contributions
 anything-0.265
 something-0.264
 trouble-0.254
 many-0.248
 chores-0.245
Token that
Token ID326
Feature activation+1.82e+00

Pos logit contributions
 anything+0.036
 chores+0.034
 trouble+0.033
 accomplishment+0.032
 good+0.032
Neg logit contributions
jo-0.101
y-0.100
o-0.099
ppy-0.097
ne-0.096
Token.
Token ID13
Feature activation-3.79e+00

Pos logit contributions
 anything+0.032
 something+0.029
 trouble+0.028
 chores+0.028
 good+0.027
Neg logit contributions
jo-0.044
ppy-0.042
azz-0.042
o-0.041
y-0.040
Token
Token ID198
Feature activation-1.81e+00

Pos logit contributions
ppy+0.093
jo+0.092
azz+0.086
y+0.086
o+0.086
Neg logit contributions
 anything-0.061
 something-0.059
 many-0.056
 chores-0.054
 trouble-0.054
Token
Token ID198
Feature activation-2.41e+00

Pos logit contributions
ppy+0.046
jo+0.046
azz+0.044
y+0.043
o+0.043
Neg logit contributions
 anything-0.026
 something-0.024
 many-0.023
 trouble-0.023
 chores-0.022
TokenWhen
Token ID2215
Feature activation-1.13e+00

Pos logit contributions
jo+0.060
ppy+0.060
o+0.056
azz+0.056
os+0.056
Neg logit contributions
 anything-0.036
 something-0.034
 many-0.033
 chores-0.032
 trouble-0.032
Token she
Token ID673
Feature activation-4.97e+00

Pos logit contributions
jo+0.024
o+0.024
y+0.024
azz+0.024
os+0.023
Neg logit contributions
 anything-0.018
 yourselves-0.018
 himself-0.017
 chores-0.017
 trouble-0.017
Token got
Token ID1392
Feature activation+8.18e+00

Pos logit contributions
jo+0.094
ppy+0.088
azz+0.086
o+0.083
os+0.081
Neg logit contributions
 anything-0.110
 something-0.104
 trouble-0.101
 good-0.100
 chores-0.100
Token there
Token ID612
Feature activation-1.71e+00

Pos logit contributions
 anything+0.137
 accomplishment+0.128
 chores+0.126
 yourselves+0.124
 temptation+0.118
Neg logit contributions
o-0.196
y-0.180
os-0.175
jo-0.172
azz-0.170
Token,
Token ID11
Feature activation+8.88e-02

Pos logit contributions
jo+0.037
y+0.036
o+0.035
ppy+0.035
os+0.034
Neg logit contributions
 anything-0.031
 chores-0.029
 trouble-0.029
 yourselves-0.028
 something-0.027
Token Sarah
Token ID10490
Feature activation-1.40e+01

Pos logit contributions
 yourselves+0.001
 anything+0.001
 trouble+0.001
 chores+0.001
 accomplishment+0.001
Neg logit contributions
jo-0.002
y-0.002
o-0.002
os-0.002
azz-0.002
Token saw
Token ID2497
Feature activation+4.91e+00

Pos logit contributions
jo+0.331
azz+0.315
ppy+0.312
itty+0.301
uddy+0.299
Neg logit contributions
 something-0.223
 anything-0.220
 many-0.211
 an-0.209
 the-0.205
Token a
Token ID257
Feature activation+1.82e+00

Pos logit contributions
 anything+0.041
 accomplishment+0.040
 chores+0.039
 job+0.035
 trouble+0.034
Neg logit contributions
jo-0.143
o-0.141
y-0.139
azz-0.138
os-0.135
Token not
Token ID407
Feature activation+6.60e+00

Pos logit contributions
 miles+0.068
 biggest+0.063
 plenty+0.061
 middle+0.059
 shore+0.059
Neg logit contributions
o-0.161
azz-0.158
ppy-0.157
jo-0.154
ray-0.148
Token know
Token ID760
Feature activation+5.51e+00

Pos logit contributions
 anything+0.101
 good+0.094
 something+0.091
 trouble+0.091
 chores+0.089
Neg logit contributions
jo-0.176
os-0.161
azz-0.161
itty-0.161
anne-0.160
Token it
Token ID340
Feature activation-3.04e+00

Pos logit contributions
 anything+0.072
 chores+0.070
 trouble+0.069
 accomplishment+0.065
 job+0.065
Neg logit contributions
jo-0.139
o-0.134
azz-0.133
ppy-0.133
y-0.131
Token was
Token ID373
Feature activation+9.90e+00

Pos logit contributions
jo+0.075
ppy+0.072
azz+0.072
ne+0.070
itty+0.068
Neg logit contributions
 anything-0.052
 something-0.048
 good-0.047
 chores-0.046
 many-0.045
Token her
Token ID607
Feature activation+6.24e-01

Pos logit contributions
 anything+0.099
 miles+0.099
 chores+0.089
 middle+0.089
 top+0.087
Neg logit contributions
ppy-0.278
o-0.274
y-0.268
jo-0.268
azz-0.261
Token design
Token ID1486
Feature activation+4.40e-01

Pos logit contributions
 anything+0.012
 miles+0.011
 yourselves+0.011
 something+0.011
 trouble+0.011
Neg logit contributions
jo-0.013
ppy-0.012
azz-0.012
y-0.012
o-0.012
Token.
Token ID13
Feature activation-2.81e+00

Pos logit contributions
 anything+0.009
 yourselves+0.008
 miles+0.008
 trouble+0.008
 chores+0.008
Neg logit contributions
jo-0.008
y-0.008
ppy-0.008
azz-0.008
o-0.007
Token He
Token ID679
Feature activation-3.10e+00

Pos logit contributions
jo+0.065
ppy+0.062
azz+0.061
o+0.061
y+0.060
Neg logit contributions
 anything-0.051
 something-0.046
 trouble-0.046
 chores-0.045
 good-0.044
Token says
Token ID1139
Feature activation+4.69e-01

Pos logit contributions
jo+0.065
ppy+0.060
azz+0.059
o+0.058
itty+0.058
Neg logit contributions
 anything-0.061
 trouble-0.061
 good-0.059
 chores-0.059
 something-0.059
Token,
Token ID11
Feature activation-1.94e+00

Pos logit contributions
 anything+0.008
 chores+0.007
 trouble+0.007
 something+0.007
 miles+0.007
Neg logit contributions
jo-0.011
ppy-0.011
azz-0.011
o-0.011
y-0.010
Token "
Token ID366
Feature activation-6.17e-01

Pos logit contributions
jo+0.053
ppy+0.049
azz+0.049
y+0.048
o+0.048
Neg logit contributions
 anything-0.028
 something-0.026
 good-0.024
 trouble-0.024
 chores-0.024
Token.
Token ID13
Feature activation-3.34e+00

Pos logit contributions
jo+0.084
azz+0.078
ppy+0.078
anne+0.077
o+0.076
Neg logit contributions
 anything-0.058
 something-0.052
 chores-0.050
 good-0.050
 trouble-0.049
Token It
Token ID632
Feature activation-3.59e+00

Pos logit contributions
jo+0.068
ppy+0.064
azz+0.064
o+0.064
y+0.062
Neg logit contributions
 anything-0.068
 something-0.062
 chores-0.062
 trouble-0.062
 good-0.060
Token looked
Token ID3114
Feature activation+4.72e+00

Pos logit contributions
jo+0.090
azz+0.086
ppy+0.084
anna+0.082
anne+0.080
Neg logit contributions
 anything-0.057
 something-0.055
 good-0.053
 not-0.052
 trouble-0.051
Token sharp
Token ID7786
Feature activation+2.49e+00

Pos logit contributions
 chores+0.066
 anything+0.065
 yourselves+0.064
 miles+0.064
 accomplishment+0.060
Neg logit contributions
y-0.109
ppy-0.108
jo-0.106
o-0.106
azz-0.105
Token and
Token ID290
Feature activation+6.94e+00

Pos logit contributions
 anything+0.049
 something+0.045
 trouble+0.045
 chores+0.044
 good+0.044
Neg logit contributions
jo-0.053
ppy-0.050
azz-0.050
o-0.049
os-0.047
Token dangerous
Token ID4923
Feature activation+2.93e+00

Pos logit contributions
 anything+0.095
 chores+0.084
 something+0.080
 trouble+0.080
 yourselves+0.079
Neg logit contributions
jo-0.179
ppy-0.176
o-0.175
azz-0.172
y-0.171
Token.
Token ID13
Feature activation-3.31e+00

Pos logit contributions
 anything+0.053
 something+0.048
 trouble+0.047
 chores+0.046
 good+0.045
Neg logit contributions
jo-0.069
ppy-0.065
azz-0.065
o-0.064
y-0.062
Token She
Token ID1375
Feature activation-3.01e+00

Pos logit contributions
jo+0.068
ppy+0.064
azz+0.064
o+0.063
y+0.061
Neg logit contributions
 anything-0.068
 chores-0.062
 something-0.062
 trouble-0.062
 good-0.060
Token was
Token ID373
Feature activation+7.42e+00

Pos logit contributions
jo+0.068
azz+0.063
os+0.061
itty+0.061
anne+0.061
Neg logit contributions
 anything-0.055
 something-0.055
 many-0.053
 not-0.052
 trouble-0.052
Token scared
Token ID12008
Feature activation+5.11e+00

Pos logit contributions
 anything+0.107
 chores+0.101
 miles+0.093
 top+0.093
 trouble+0.091
Neg logit contributions
o-0.181
ppy-0.179
jo-0.176
y-0.172
azz-0.171
Token.
Token ID13
Feature activation-3.32e+00

Pos logit contributions
 anything+0.086
 something+0.080
 trouble+0.078
 many+0.075
 good+0.075
Neg logit contributions
jo-0.127
ppy-0.121
azz-0.118
o-0.118
anne-0.116
Token other
Token ID584
Feature activation-2.16e+00

Pos logit contributions
 anything+0.001
 many+0.001
 good+0.001
 something+0.001
 trouble+0.001
Neg logit contributions
jo-0.010
ppy-0.009
azz-0.009
o-0.009
os-0.009
Token.
Token ID13
Feature activation-2.68e+00

Pos logit contributions
jo+0.054
azz+0.053
ppy+0.051
itty+0.051
o+0.051
Neg logit contributions
 anything-0.034
 something-0.032
 good-0.032
 chores-0.031
 many-0.031
Token
Token ID198
Feature activation-1.73e+00

Pos logit contributions
jo+0.056
azz+0.053
o+0.053
ppy+0.052
y+0.050
Neg logit contributions
 anything-0.054
 chores-0.049
 trouble-0.048
 something-0.048
 miles-0.047
Token
Token ID198
Feature activation-2.72e+00

Pos logit contributions
jo+0.040
o+0.037
ppy+0.037
azz+0.037
y+0.036
Neg logit contributions
 anything-0.031
 chores-0.028
 trouble-0.028
 something-0.028
 good-0.027
Token"
Token ID1
Feature activation-2.17e+00

Pos logit contributions
jo+0.064
o+0.062
ppy+0.062
azz+0.061
y+0.060
Neg logit contributions
 anything-0.047
 chores-0.042
 trouble-0.042
 something-0.041
 good-0.039
TokenThat
Token ID2504
Feature activation-1.26e-01

Pos logit contributions
jo+0.050
ppy+0.047
azz+0.047
o+0.047
y+0.046
Neg logit contributions
 anything-0.039
 something-0.035
 chores-0.035
 trouble-0.035
 good-0.034
Token was
Token ID373
Feature activation+8.68e+00

Pos logit contributions
ppy+0.003
jo+0.003
azz+0.003
ne+0.002
os+0.002
Neg logit contributions
 anything-0.002
 something-0.002
 trouble-0.002
 good-0.002
 top-0.002
Token a
Token ID257
Feature activation+2.88e+00

Pos logit contributions
 chores+0.082
 miles+0.079
 bottom+0.078
 accomplishment+0.078
 anything+0.078
Neg logit contributions
o-0.244
ppy-0.230
jo-0.221
ah-0.216
y-0.215
Token bad
Token ID2089
Feature activation-6.12e-02

Pos logit contributions
 anything+0.029
 chores+0.025
 something+0.024
 trouble+0.023
 yourselves+0.022
Neg logit contributions
jo-0.087
o-0.084
ppy-0.083
azz-0.083
y-0.082
Token idea
Token ID2126
Feature activation-2.34e+00

Pos logit contributions
ppy+0.001
jo+0.001
azz+0.001
os+0.001
ne+0.001
Neg logit contributions
 anything-0.001
 good-0.001
 something-0.001
 trouble-0.001
 things-0.001
Token,
Token ID11
Feature activation-5.20e-01

Pos logit contributions
jo+0.057
azz+0.055
ppy+0.055
o+0.052
os+0.051
Neg logit contributions
 anything-0.039
 something-0.037
 trouble-0.036
 good-0.035
 chores-0.034
Token got
Token ID1392
Feature activation+8.58e+00

Pos logit contributions
jo+0.211
ppy+0.199
azz+0.197
ksh+0.195
anne+0.188
Neg logit contributions
 the-0.135
 something-0.133
 top-0.130
 anything-0.129
 many-0.129
Token back
Token ID736
Feature activation+1.58e+00

Pos logit contributions
 anything+0.142
 chores+0.127
 yourselves+0.127
 trouble+0.121
 miles+0.120
Neg logit contributions
o-0.204
jo-0.196
y-0.193
os-0.190
ppy-0.188
Token,
Token ID11
Feature activation+7.71e-01

Pos logit contributions
 anything+0.024
 chores+0.021
 trouble+0.021
 something+0.021
 good+0.020
Neg logit contributions
jo-0.040
o-0.038
ppy-0.038
y-0.038
azz-0.038
Token Jenny
Token ID28437
Feature activation-1.49e+01

Pos logit contributions
 anything+0.013
 miles+0.013
 chores+0.012
 yourselves+0.012
 trouble+0.012
Neg logit contributions
azz-0.016
jo-0.016
y-0.016
ppy-0.015
o-0.015
Token had
Token ID550
Feature activation+6.09e+00

Pos logit contributions
jo+0.330
azz+0.312
itty+0.303
ppy+0.302
uddy+0.291
Neg logit contributions
 something-0.256
 anything-0.253
 top-0.247
 many-0.245
 an-0.242
Token made
Token ID925
Feature activation+6.11e+00

Pos logit contributions
 anything+0.100
 chores+0.091
 trouble+0.088
 something+0.087
 yourselves+0.086
Neg logit contributions
jo-0.143
o-0.138
ppy-0.137
azz-0.136
y-0.134
Token another
Token ID1194
Feature activation+2.81e+00

Pos logit contributions
 anything+0.049
 chores+0.043
 yourselves+0.042
 miles+0.038
 danger+0.038
Neg logit contributions
jo-0.182
o-0.175
y-0.172
azz-0.170
os-0.169
Token decision
Token ID2551
Feature activation-2.15e-02

Pos logit contributions
 anything+0.033
 something+0.031
 trouble+0.031
 chores+0.030
 good+0.030
Neg logit contributions
jo-0.082
ppy-0.080
azz-0.079
o-0.077
os-0.077
Token.
Token ID13
Feature activation-3.56e+00

Pos logit contributions
jo+0.001
ppy+0.000
azz+0.000
o+0.000
y+0.000
Neg logit contributions
 anything-0.000
 something-0.000
 trouble-0.000
 chores-0.000
 good-0.000
Token She
Token ID1375
Feature activation-3.04e+00

Pos logit contributions
jo+0.076
ppy+0.072
o+0.071
azz+0.070
y+0.069
Neg logit contributions
 anything-0.070
 something-0.064
 trouble-0.063
 chores-0.063
 good-0.061
Token wanted
Token ID2227
Feature activation+6.47e+00

Pos logit contributions
jo+0.067
azz+0.065
ppy+0.064
itty+0.060
y+0.060
Neg logit contributions
 something-0.052
 many-0.052
 good-0.051
 great-0.051
 trouble-0.050
Token a
Token ID257
Feature activation+5.59e+00

Pos logit contributions
 anything+0.055
 accomplishment+0.054
 chores+0.054
 bottom+0.049
 trouble+0.046
Neg logit contributions
jo-0.236
ppy-0.233
o-0.231
azz-0.229
y-0.228
Token crown
Token ID12389
Feature activation-7.66e+00

Pos logit contributions
 anything+0.077
 something+0.067
 trouble+0.067
 chores+0.066
 yourselves+0.064
Neg logit contributions
jo-0.149
ppy-0.142
azz-0.142
o-0.141
y-0.138
Token and
Token ID290
Feature activation+6.06e+00

Pos logit contributions
jo+0.213
anne+0.198
ppy+0.190
azz+0.189
o+0.187
Neg logit contributions
 anything-0.109
 something-0.097
 good-0.091
 many-0.089
 chores-0.087
Token a
Token ID257
Feature activation+5.45e+00

Pos logit contributions
 anything+0.048
 chores+0.043
 trouble+0.041
 accomplishment+0.038
 good+0.036
Neg logit contributions
jo-0.186
ppy-0.183
azz-0.181
o-0.179
y-0.177
Token dress
Token ID6576
Feature activation-2.67e+00

Pos logit contributions
 anything+0.085
 something+0.076
 trouble+0.075
 chores+0.075
 good+0.072
Neg logit contributions
jo-0.137
ppy-0.130
azz-0.129
o-0.129
y-0.126
Token at
Token ID379
Feature activation+6.77e+00

Pos logit contributions
jo+0.068
ppy+0.064
azz+0.064
o+0.063
anne+0.063
Neg logit contributions
 anything-0.042
 something-0.038
 good-0.036
 many-0.035
 chores-0.035
Token home
Token ID1363
Feature activation+4.01e-01

Pos logit contributions
 anything+0.066
 trouble+0.063
 chores+0.062
 accomplishment+0.057
 miles+0.055
Neg logit contributions
jo-0.190
o-0.189
y-0.188
ppy-0.186
os-0.185
Token.
Token ID13
Feature activation-2.94e+00

Pos logit contributions
 anything+0.008
 trouble+0.007
 chores+0.007
 something+0.007
 good+0.007
Neg logit contributions
jo-0.009
ppy-0.008
o-0.008
y-0.008
azz-0.008
Token Can
Token ID1680
Feature activation+4.43e+00

Pos logit contributions
jo+0.075
y+0.072
ppy+0.071
anne+0.069
o+0.069
Neg logit contributions
 anything-0.046
 something-0.042
 many-0.040
 good-0.039
 yourselves-0.039
Token I
Token ID314
Feature activation+1.15e+00

Pos logit contributions
 anything+0.071
 trouble+0.065
 chores+0.064
 something+0.064
 good+0.061
Neg logit contributions
jo-0.108
ppy-0.103
azz-0.103
o-0.102
y-0.100
Token go
Token ID467
Feature activation+6.75e+00

Pos logit contributions
 anything+0.016
 something+0.014
 chores+0.014
 good+0.014
 trouble+0.014
Neg logit contributions
jo-0.031
ppy-0.029
azz-0.029
o-0.029
y-0.029
Token bad
Token ID2089
Feature activation+1.93e+00

Pos logit contributions
 avail+0.125
 yourselves+0.118
 accomplishment+0.113
 chores+0.112
 ourselves+0.111
Neg logit contributions
ppy-0.189
o-0.180
y-0.173
jo-0.170
ches-0.167
Token and
Token ID290
Feature activation+6.03e+00

Pos logit contributions
 anything+0.032
 something+0.029
 trouble+0.029
 good+0.028
 chores+0.028
Neg logit contributions
jo-0.048
ppy-0.046
azz-0.045
o-0.045
itty-0.044
Token wanted
Token ID2227
Feature activation+6.38e+00

Pos logit contributions
 anything+0.098
 chores+0.087
 something+0.086
 trouble+0.085
 miles+0.083
Neg logit contributions
jo-0.144
o-0.139
ppy-0.138
y-0.135
azz-0.134
Token to
Token ID284
Feature activation+7.25e+00

Pos logit contributions
 anything+0.114
 chores+0.105
 yourselves+0.103
 trouble+0.102
 something+0.101
Neg logit contributions
jo-0.139
o-0.135
ppy-0.131
azz-0.130
y-0.128
Token help
Token ID1037
Feature activation+3.49e+00

Pos logit contributions
 anything+0.116
 something+0.104
 trouble+0.104
 many+0.102
 chores+0.102
Neg logit contributions
jo-0.169
ppy-0.164
o-0.163
azz-0.160
y-0.156
Token.
Token ID13
Feature activation-3.52e+00

Pos logit contributions
 anything+0.053
 something+0.048
 trouble+0.047
 chores+0.047
 good+0.045
Neg logit contributions
jo-0.089
ppy-0.085
o-0.084
azz-0.084
y-0.082
Token Together
Token ID17083
Feature activation-5.96e-01

Pos logit contributions
jo+0.094
ppy+0.094
y+0.092
ne+0.089
anne+0.089
Neg logit contributions
 anything-0.045
 something-0.045
 trouble-0.044
 himself-0.043
 many-0.042
Token,
Token ID11
Feature activation+1.13e+00

Pos logit contributions
jo+0.015
ppy+0.015
azz+0.014
o+0.014
y+0.014
Neg logit contributions
 anything-0.009
 something-0.008
 trouble-0.008
 chores-0.008
 good-0.008
Token they
Token ID484
Feature activation-3.72e+00

Pos logit contributions
 anything+0.017
 trouble+0.015
 chores+0.015
 something+0.014
 yourselves+0.014
Neg logit contributions
jo-0.029
o-0.028
azz-0.028
ppy-0.027
y-0.027
Token cleaned
Token ID20750
Feature activation+3.97e+00

Pos logit contributions
jo+0.094
azz+0.086
anne+0.083
anna+0.082
itty+0.081
Neg logit contributions
 anything-0.059
 many-0.059
 something-0.057
 good-0.056
 an-0.055
Token the
Token ID262
Feature activation+3.76e+00

Pos logit contributions
 anything+0.044
 chores+0.037
 trouble+0.036
 something+0.036
 miles+0.034
Neg logit contributions
jo-0.117
o-0.113
ppy-0.112
y-0.111
azz-0.110
Token You
Token ID921
Feature activation-1.06e+00

Pos logit contributions
jo+0.085
y+0.080
anne+0.078
ppy+0.077
o+0.074
Neg logit contributions
 anything-0.052
 something-0.048
 yourselves-0.045
 good-0.044
 an-0.044
Token know
Token ID760
Feature activation+3.42e+00

Pos logit contributions
jo+0.027
azz+0.025
o+0.025
ppy+0.025
y+0.025
Neg logit contributions
 anything-0.017
 something-0.015
 chores-0.015
 trouble-0.015
 good-0.015
Token."
Token ID526
Feature activation-4.94e+00

Pos logit contributions
 anything+0.044
 something+0.038
 chores+0.037
 trouble+0.037
 good+0.035
Neg logit contributions
jo-0.097
ppy-0.093
azz-0.092
o-0.092
y-0.090
Token
Token ID198
Feature activation-3.28e+00

Pos logit contributions
jo+0.124
ppy+0.119
azz+0.116
o+0.116
y+0.115
Neg logit contributions
 anything-0.078
 something-0.072
 trouble-0.070
 chores-0.069
 good-0.068
Token
Token ID198
Feature activation-3.65e+00

Pos logit contributions
jo+0.086
ppy+0.084
azz+0.081
y+0.081
o+0.080
Neg logit contributions
 anything-0.045
 something-0.043
 many-0.042
 trouble-0.041
 good-0.041
TokenMom
Token ID29252
Feature activation-1.08e+01

Pos logit contributions
jo+0.087
ppy+0.081
o+0.080
azz+0.080
os+0.079
Neg logit contributions
 anything-0.062
 something-0.059
 good-0.058
 chores-0.058
 many-0.058
Token read
Token ID1100
Feature activation+1.30e+00

Pos logit contributions
jo+0.268
azz+0.236
anna+0.234
pillar+0.230
ppy+0.230
Neg logit contributions
 not-0.172
 a-0.165
 the-0.165
 something-0.162
 many-0.156
Token the
Token ID262
Feature activation+2.09e+00

Pos logit contributions
 anything+0.010
 trouble+0.008
 chores+0.008
 something+0.008
 miles+0.007
Neg logit contributions
jo-0.042
ppy-0.040
azz-0.040
o-0.040
y-0.040
Token card
Token ID2657
Feature activation-8.61e+00

Pos logit contributions
 anything+0.041
 something+0.037
 trouble+0.037
 chores+0.037
 good+0.036
Neg logit contributions
jo-0.045
ppy-0.042
azz-0.042
o-0.042
y-0.041
Token.
Token ID13
Feature activation-3.75e+00

Pos logit contributions
jo+0.217
ppy+0.206
azz+0.203
o+0.202
y+0.196
Neg logit contributions
 anything-0.139
 something-0.128
 chores-0.123
 good-0.121
 many-0.121
Token "
Token ID366
Feature activation-1.06e+00

Pos logit contributions
jo+0.083
ppy+0.078
azz+0.077
o+0.077
y+0.076
Neg logit contributions
 anything-0.071
 something-0.065
 trouble-0.064
 chores-0.064
 good-0.062
Token.
Token ID13
Feature activation-4.05e+00

Pos logit contributions
azz+0.276
jo+0.273
anne+0.268
pillar+0.262
ol+0.258
Neg logit contributions
 many-0.368
 anything-0.366
 something-0.365
 his-0.362
 an-0.357
Token Bob
Token ID5811
Feature activation-1.66e+01

Pos logit contributions
jo+0.094
ppy+0.091
azz+0.090
y+0.086
ne+0.086
Neg logit contributions
 anything-0.071
 something-0.065
 trouble-0.065
 chores-0.064
 many-0.063
Token was
Token ID373
Feature activation+6.79e+00

Pos logit contributions
azz+0.376
jo+0.364
ppy+0.349
anne+0.335
ne+0.333
Neg logit contributions
 many-0.281
 good-0.279
 something-0.277
 anything-0.275
 an-0.267
Token feeling
Token ID4203
Feature activation+7.08e+00

Pos logit contributions
 anything+0.081
 accomplishment+0.070
 chores+0.070
 yourselves+0.069
 trouble+0.067
Neg logit contributions
o-0.198
ppy-0.186
jo-0.182
y-0.180
itty-0.178
Token tired
Token ID10032
Feature activation+4.70e-01

Pos logit contributions
 avail+0.110
 accomplishment+0.103
 yourselves+0.099
 top+0.097
 miles+0.096
Neg logit contributions
ppy-0.168
o-0.162
y-0.152
jo-0.144
itty-0.141
Token.
Token ID13
Feature activation-4.05e+00

Pos logit contributions
 anything+0.010
 chores+0.009
 trouble+0.009
 yourselves+0.009
 something+0.009
Neg logit contributions
jo-0.009
y-0.009
ppy-0.009
o-0.009
azz-0.008
Token He
Token ID679
Feature activation-4.68e+00

Pos logit contributions
jo+0.098
ppy+0.095
azz+0.093
y+0.091
o+0.090
Neg logit contributions
 anything-0.067
 something-0.062
 trouble-0.061
 chores-0.059
 many-0.059
Token decided
Token ID3066
Feature activation+1.32e+00

Pos logit contributions
azz+0.098
jo+0.098
oke+0.092
itty+0.091
os+0.091
Neg logit contributions
 something-0.088
 anything-0.088
 good-0.086
 many-0.085
 trouble-0.083
Token to
Token ID284
Feature activation+4.67e+00

Pos logit contributions
 anything+0.026
 chores+0.025
 trouble+0.025
 accomplishment+0.024
 yourselves+0.024
Neg logit contributions
jo-0.025
o-0.025
ppy-0.023
y-0.023
azz-0.023
Token twist
Token ID14528
Feature activation+2.52e+00

Pos logit contributions
 anything+0.077
 trouble+0.069
 something+0.067
 accomplishment+0.066
 yourselves+0.066
Neg logit contributions
jo-0.108
ppy-0.106
o-0.103
y-0.101
azz-0.099
Token and
Token ID290
Feature activation+5.08e+00

Pos logit contributions
 anything+0.021
 chores+0.020
 trouble+0.019
 accomplishment+0.019
 miles+0.017
Neg logit contributions
jo-0.079
y-0.077
ppy-0.076
o-0.076
azz-0.074
Token to
Token ID284
Feature activation+4.71e+00

Pos logit contributions
 anything+0.028
 something+0.025
 trouble+0.024
 good+0.024
 chores+0.024
Neg logit contributions
jo-0.043
ppy-0.042
azz-0.041
o-0.040
y-0.040
Token do
Token ID466
Feature activation+4.52e+00

Pos logit contributions
 anything+0.063
 something+0.055
 trouble+0.054
 chores+0.053
 good+0.051
Neg logit contributions
jo-0.131
ppy-0.125
azz-0.124
o-0.124
y-0.121
Token.
Token ID13
Feature activation-4.06e+00

Pos logit contributions
 anything+0.096
 chores+0.094
 trouble+0.092
 miles+0.092
 accomplishment+0.092
Neg logit contributions
jo-0.075
y-0.075
ppy-0.073
o-0.070
ne-0.070
Token She
Token ID1375
Feature activation-4.38e+00

Pos logit contributions
jo+0.100
ppy+0.098
azz+0.095
o+0.095
os+0.093
Neg logit contributions
 anything-0.068
 something-0.065
 many-0.062
 top-0.061
 trouble-0.061
Token asked
Token ID1965
Feature activation+1.78e+00

Pos logit contributions
azz+0.105
oke+0.103
jo+0.103
os+0.103
itty+0.101
Neg logit contributions
 something-0.070
 anything-0.066
 many-0.066
 an-0.066
 not-0.066
Token her
Token ID607
Feature activation-3.83e+00

Pos logit contributions
 anything+0.018
 trouble+0.015
 chores+0.015
 something+0.015
 miles+0.014
Neg logit contributions
jo-0.053
ppy-0.052
o-0.051
azz-0.051
y-0.051
Token mom
Token ID1995
Feature activation-9.99e+00

Pos logit contributions
jo+0.115
ppy+0.110
azz+0.108
o+0.107
y+0.107
Neg logit contributions
 anything-0.044
 something-0.039
 chores-0.038
 trouble-0.038
 good-0.038
Token,
Token ID11
Feature activation-7.68e-01

Pos logit contributions
jo+0.250
ppy+0.234
azz+0.233
o+0.228
ne+0.226
Neg logit contributions
 anything-0.166
 something-0.151
 chores-0.147
 many-0.142
 good-0.142
Token "
Token ID366
Feature activation-9.82e-01

Pos logit contributions
jo+0.021
azz+0.019
ppy+0.019
os+0.019
anne+0.019
Neg logit contributions
 anything-0.011
 something-0.011
 good-0.010
 trouble-0.010
 many-0.010
TokenWhere
Token ID8496
Feature activation-1.02e+00

Pos logit contributions
jo+0.026
ppy+0.025
azz+0.025
o+0.025
y+0.024
Neg logit contributions
 anything-0.014
 something-0.012
 trouble-0.012
 chores-0.012
 good-0.011
Token did
Token ID750
Feature activation+5.31e+00

Pos logit contributions
jo+0.025
ppy+0.023
azz+0.023
anne+0.022
y+0.022
Neg logit contributions
 anything-0.018
 something-0.016
 trouble-0.016
 good-0.016
 chores-0.016
Token help
Token ID1037
Feature activation+3.14e+00

Pos logit contributions
ppy+0.008
jo+0.008
o+0.008
azz+0.008
ne+0.008
Neg logit contributions
 anything-0.005
 trouble-0.005
 chores-0.005
 yourselves-0.005
 something-0.004
Token me
Token ID502
Feature activation-2.01e+00

Pos logit contributions
 anything+0.057
 trouble+0.054
 accomplishment+0.053
 chores+0.052
 miles+0.051
Neg logit contributions
jo-0.065
ppy-0.065
o-0.064
y-0.063
ne-0.062
Token carry
Token ID3283
Feature activation+7.28e+00

Pos logit contributions
ppy+0.046
o+0.046
jo+0.044
y+0.044
ne+0.043
Neg logit contributions
 anything-0.032
 trouble-0.031
 accomplishment-0.029
 chores-0.029
 danger-0.028
Token this
Token ID428
Feature activation+1.78e+00

Pos logit contributions
 accomplishment+0.082
 anything+0.074
 trouble+0.072
 comparison+0.071
 bottom+0.071
Neg logit contributions
o-0.187
y-0.186
ppy-0.185
ne-0.183
ches-0.177
Token rock
Token ID3881
Feature activation-8.05e+00

Pos logit contributions
 anything+0.035
 trouble+0.033
 chores+0.033
 yourselves+0.033
 something+0.032
Neg logit contributions
y-0.032
jo-0.032
ppy-0.032
o-0.031
ne-0.031
Token?
Token ID30
Feature activation-2.46e+00

Pos logit contributions
jo+0.184
ppy+0.177
o+0.176
azz+0.173
y+0.172
Neg logit contributions
 anything-0.142
 trouble-0.129
 chores-0.127
 something-0.127
 good-0.123
Token It
Token ID632
Feature activation-2.41e+00

Pos logit contributions
jo+0.064
anne+0.060
y+0.059
ppy+0.059
azz+0.057
Neg logit contributions
 anything-0.038
 something-0.036
 an-0.033
 many-0.033
 good-0.032
Token's
Token ID338
Feature activation+6.23e+00

Pos logit contributions
jo+0.044
azz+0.041
ppy+0.039
anne+0.039
itty+0.037
Neg logit contributions
 anything-0.057
 something-0.053
 good-0.052
 chores-0.051
 trouble-0.051
Token too
Token ID1165
Feature activation+3.35e+00

Pos logit contributions
 anything+0.080
 accomplishment+0.076
 miles+0.075
 bottom+0.075
 yourselves+0.074
Neg logit contributions
o-0.151
ppy-0.147
y-0.137
jo-0.135
ches-0.134
Token heavy
Token ID4334
Feature activation+2.34e+00

Pos logit contributions
 anything+0.033
 chores+0.026
 something+0.025
 trouble+0.025
 miles+0.024
Neg logit contributions
jo-0.103
o-0.101
ppy-0.100
y-0.100
azz-0.098
Token for
Token ID329
Feature activation+5.90e+00

Pos logit contributions
 anything+0.038
 something+0.034
 trouble+0.034
 chores+0.034
 good+0.032
Neg logit contributions
jo-0.057
ppy-0.055
o-0.054
azz-0.054
y-0.053
Token.
Token ID13
Feature activation-3.79e+00

Pos logit contributions
jo+0.001
ppy+0.001
azz+0.001
o+0.001
itty+0.001
Neg logit contributions
 anything-0.001
 something-0.001
 good-0.001
 many-0.001
 trouble-0.001
Token
Token ID198
Feature activation-1.93e+00

Pos logit contributions
ppy+0.108
jo+0.104
y+0.101
azz+0.098
ah+0.098
Neg logit contributions
 anything-0.046
 many-0.046
 something-0.045
 an-0.042
 top-0.041
Token
Token ID198
Feature activation-2.59e+00

Pos logit contributions
jo+0.050
ppy+0.049
azz+0.047
y+0.047
o+0.046
Neg logit contributions
 anything-0.028
 something-0.026
 many-0.026
 trouble-0.025
 good-0.024
TokenThe
Token ID464
Feature activation-1.88e+00

Pos logit contributions
jo+0.072
ppy+0.069
azz+0.068
o+0.068
y+0.067
Neg logit contributions
 anything-0.034
 something-0.030
 chores-0.029
 trouble-0.029
 many-0.028
Token fire
Token ID2046
Feature activation-1.47e+01

Pos logit contributions
jo+0.045
ne+0.043
o+0.043
y+0.043
itty+0.042
Neg logit contributions
 anything-0.028
 something-0.023
 yourselves-0.023
 trouble-0.023
 chores-0.022
Tokenwork
Token ID1818
Feature activation-1.27e+01

Pos logit contributions
azz+0.297
ppy+0.290
jo+0.280
of+0.263
itty+0.257
Neg logit contributions
 something-0.280
 anything-0.273
 things-0.270
 most-0.266
 top-0.265
Token spark
Token ID9009
Feature activation-5.64e+00

Pos logit contributions
jo+0.272
azz+0.261
ppy+0.260
o+0.250
itty+0.243
Neg logit contributions
 anything-0.243
 something-0.230
 good-0.224
 many-0.220
 trouble-0.220
Tokenled
Token ID992
Feature activation+4.52e+00

Pos logit contributions
jo+0.116
azz+0.114
o+0.111
ppy+0.110
itty+0.108
Neg logit contributions
 anything-0.112
 something-0.108
 things-0.103
 many-0.099
 good-0.099
Token and
Token ID290
Feature activation+4.49e+00

Pos logit contributions
 chores+0.086
 anything+0.085
 trouble+0.082
 yourselves+0.079
 accomplishment+0.075
Neg logit contributions
y-0.093
jo-0.090
o-0.088
ppy-0.086
os-0.083
Token tw
Token ID665
Feature activation-5.69e-01

Pos logit contributions
 anything+0.065
 something+0.057
 trouble+0.057
 chores+0.056
 good+0.054
Neg logit contributions
jo-0.119
ppy-0.113
azz-0.112
o-0.112
y-0.110
Tokenink
Token ID676
Feature activation-5.29e+00

Pos logit contributions
jo+0.012
ppy+0.011
o+0.011
azz+0.011
y+0.011
Neg logit contributions
 anything-0.012
 trouble-0.011
 something-0.011
 chores-0.011
 good-0.010
Token nice
Token ID3621
Feature activation-5.82e-01

Pos logit contributions
 anything+0.050
 something+0.041
 trouble+0.040
 good+0.040
 chores+0.040
Neg logit contributions
jo-0.162
ppy-0.157
azz-0.154
o-0.154
y-0.151
Token it
Token ID340
Feature activation-5.95e+00

Pos logit contributions
jo+0.017
ppy+0.017
azz+0.016
o+0.016
y+0.016
Neg logit contributions
 anything-0.007
 something-0.006
 trouble-0.005
 chores-0.005
 good-0.005
Token looked
Token ID3114
Feature activation+3.82e+00

Pos logit contributions
jo+0.137
azz+0.131
ppy+0.130
itty+0.126
anne+0.124
Neg logit contributions
 anything-0.106
 something-0.095
 good-0.094
 great-0.092
 trouble-0.091
Token.
Token ID13
Feature activation-3.87e+00

Pos logit contributions
 anything+0.068
 chores+0.063
 trouble+0.063
 something+0.061
 yourselves+0.060
Neg logit contributions
jo-0.085
ppy-0.080
y-0.080
azz-0.079
o-0.079
Token His
Token ID2399
Feature activation-5.81e-01

Pos logit contributions
jo+0.092
ppy+0.087
azz+0.085
o+0.085
y+0.084
Neg logit contributions
 anything-0.067
 something-0.062
 trouble-0.060
 chores-0.060
 many-0.059
Token aunt
Token ID25949
Feature activation-1.22e+01

Pos logit contributions
jo+0.014
ppy+0.013
azz+0.013
o+0.013
y+0.013
Neg logit contributions
 anything-0.010
 chores-0.009
 something-0.009
 trouble-0.009
 good-0.009
Token looked
Token ID3114
Feature activation+3.18e+00

Pos logit contributions
jo+0.279
azz+0.271
ppy+0.263
ksh+0.250
anne+0.247
Neg logit contributions
 the-0.202
 good-0.196
 something-0.196
 anything-0.195
 many-0.190
Token very
Token ID845
Feature activation+7.14e+00

Pos logit contributions
 anything+0.048
 chores+0.043
 trouble+0.042
 something+0.042
 miles+0.040
Neg logit contributions
jo-0.081
ppy-0.077
azz-0.077
o-0.077
y-0.076
Token happy
Token ID3772
Feature activation+1.68e+00

Pos logit contributions
 anything+0.080
 chores+0.068
 miles+0.066
 trouble+0.065
 something+0.064
Neg logit contributions
jo-0.206
azz-0.199
y-0.198
ppy-0.198
o-0.197
Token.
Token ID13
Feature activation-3.97e+00

Pos logit contributions
 anything+0.033
 something+0.030
 chores+0.030
 trouble+0.030
 good+0.029
Neg logit contributions
jo-0.036
ppy-0.034
azz-0.033
o-0.033
y-0.033
Token She
Token ID1375
Feature activation-3.98e+00

Pos logit contributions
jo+0.096
ppy+0.090
o+0.088
azz+0.088
y+0.087
Neg logit contributions
 anything-0.067
 something-0.063
 trouble-0.061
 many-0.060
 chores-0.060
Token went
Token ID1816
Feature activation+3.49e+00

Pos logit contributions
azz+0.313
jo+0.307
os+0.294
oke+0.291
itty+0.286
Neg logit contributions
 something-0.220
 many-0.210
 an-0.207
 not-0.205
 trouble-0.204
Token to
Token ID284
Feature activation+5.22e+00

Pos logit contributions
 anything+0.067
 trouble+0.060
 chores+0.059
 something+0.059
 yourselves+0.059
Neg logit contributions
jo-0.075
o-0.073
azz-0.071
ppy-0.070
y-0.070
Token her
Token ID607
Feature activation-1.43e+00

Pos logit contributions
 anything+0.069
 miles+0.060
 yourselves+0.059
 chores+0.059
 none+0.058
Neg logit contributions
jo-0.129
o-0.125
azz-0.124
ppy-0.121
os-0.121
Token brother
Token ID3956
Feature activation-1.03e+01

Pos logit contributions
jo+0.036
ppy+0.035
o+0.035
azz+0.035
y+0.034
Neg logit contributions
 anything-0.022
 something-0.019
 chores-0.019
 trouble-0.019
 good-0.018
Token,
Token ID11
Feature activation+1.20e+00

Pos logit contributions
jo+0.244
anne+0.227
ah+0.221
ppy+0.218
os+0.217
Neg logit contributions
 something-0.176
 anything-0.175
 an-0.164
 many-0.160
 trouble-0.158
Token Jack
Token ID3619
Feature activation-1.36e+01

Pos logit contributions
 something+0.019
 anything+0.019
 an+0.018
 many+0.017
 trouble+0.017
Neg logit contributions
jo-0.030
azz-0.028
ppy-0.028
os-0.028
y-0.027
Token,
Token ID11
Feature activation+8.29e-01

Pos logit contributions
jo+0.392
azz+0.386
anne+0.368
affe+0.366
os+0.363
Neg logit contributions
 anything-0.149
 something-0.142
 the-0.141
 an-0.127
 many-0.122
Token and
Token ID290
Feature activation+2.96e+00

Pos logit contributions
 anything+0.016
 something+0.016
 many+0.015
 trouble+0.015
 an+0.015
Neg logit contributions
jo-0.018
azz-0.017
ppy-0.017
anne-0.016
ne-0.016
Token asked
Token ID1965
Feature activation+2.32e+00

Pos logit contributions
 anything+0.060
 something+0.059
 many+0.056
 good+0.055
 trouble+0.055
Neg logit contributions
jo-0.061
ppy-0.057
azz-0.056
o-0.054
os-0.054
Token him
Token ID683
Feature activation-4.33e+00

Pos logit contributions
 anything+0.035
 trouble+0.031
 chores+0.031
 miles+0.030
 something+0.030
Neg logit contributions
jo-0.059
ppy-0.057
o-0.056
azz-0.056
y-0.056
Token if
Token ID611
Feature activation+2.24e+00

Pos logit contributions
jo+0.116
ppy+0.110
azz+0.109
o+0.108
y+0.107
Neg logit contributions
 anything-0.062
 something-0.056
 chores-0.054
 trouble-0.053
 good-0.052
Token touch
Token ID3638
Feature activation+5.13e+00

Pos logit contributions
 anything+0.017
 something+0.015
 good+0.015
 great+0.015
 chores+0.014
Neg logit contributions
jo-0.037
anne-0.033
azz-0.033
ppy-0.033
oke-0.033
Token your
Token ID534
Feature activation+2.36e+00

Pos logit contributions
 anything+0.041
 chores+0.033
 trouble+0.033
 something+0.032
 good+0.029
Neg logit contributions
jo-0.167
ppy-0.161
o-0.160
azz-0.160
y-0.158
Token c
Token ID269
Feature activation-2.99e+00

Pos logit contributions
 anything+0.053
 chores+0.050
 trouble+0.049
 something+0.048
 miles+0.047
Neg logit contributions
jo-0.042
ppy-0.040
o-0.039
y-0.039
azz-0.038
Tokenactus
Token ID34144
Feature activation-9.53e+00

Pos logit contributions
jo+0.053
ppy+0.048
azz+0.047
anne+0.047
o+0.047
Neg logit contributions
 anything-0.068
 something-0.064
 chores-0.062
 trouble-0.062
 good-0.062
Token,
Token ID11
Feature activation+1.85e+00

Pos logit contributions
jo+0.242
azz+0.227
ppy+0.226
o+0.221
anne+0.218
Neg logit contributions
 anything-0.154
 something-0.141
 chores-0.134
 trouble-0.133
 good-0.131
Token Tom
Token ID4186
Feature activation-1.16e+01

Pos logit contributions
 anything+0.030
 something+0.028
 chores+0.027
 good+0.027
 trouble+0.027
Neg logit contributions
jo-0.046
o-0.043
ppy-0.043
os-0.043
azz-0.043
Token?"
Token ID1701
Feature activation-3.53e+00

Pos logit contributions
jo+0.336
azz+0.319
anne+0.316
os+0.314
ppy+0.306
Neg logit contributions
 anything-0.133
 something-0.126
 trouble-0.114
 good-0.113
 chores-0.105
Token she
Token ID673
Feature activation-5.56e+00

Pos logit contributions
jo+0.077
ppy+0.073
azz+0.072
o+0.072
y+0.070
Neg logit contributions
 anything-0.068
 something-0.061
 chores-0.061
 trouble-0.061
 good-0.059
Token asks
Token ID7893
Feature activation+1.12e-03

Pos logit contributions
jo+0.126
o+0.119
os+0.119
itty+0.117
anne+0.114
Neg logit contributions
 anything-0.103
 trouble-0.102
 something-0.097
 good-0.096
 chores-0.095
Token.
Token ID13
Feature activation-3.48e+00

Pos logit contributions
 anything+0.000
 something+0.000
 trouble+0.000
 chores+0.000
 good+0.000
Neg logit contributions
jo-0.000
azz-0.000
ppy-0.000
o-0.000
anne-0.000
Token
Token ID198
Feature activation-2.65e+00

Pos logit contributions
jo+0.096
ppy+0.093
y+0.089
o+0.088
azz+0.088
Neg logit contributions
 anything-0.046
 something-0.044
 trouble-0.042
 many-0.042
 good-0.041
Token flying
Token ID7348
Feature activation+1.55e+00

Pos logit contributions
jo+0.313
azz+0.302
ppy+0.297
enny+0.286
anne+0.281
Neg logit contributions
 anything-0.117
 something-0.116
 many-0.112
 top-0.108
 things-0.106
Token around
Token ID1088
Feature activation+2.54e+00

Pos logit contributions
 anything+0.022
 something+0.020
 trouble+0.019
 chores+0.019
 good+0.019
Neg logit contributions
jo-0.041
ppy-0.039
azz-0.039
o-0.038
ne-0.037
Token the
Token ID262
Feature activation+2.41e+00

Pos logit contributions
 anything+0.038
 chores+0.034
 trouble+0.034
 something+0.033
 good+0.032
Neg logit contributions
jo-0.066
ppy-0.063
o-0.062
azz-0.062
y-0.061
Token flowers
Token ID12734
Feature activation-1.00e+01

Pos logit contributions
 anything+0.029
 something+0.025
 trouble+0.024
 chores+0.024
 good+0.023
Neg logit contributions
jo-0.070
ppy-0.067
azz-0.067
o-0.066
y-0.065
Token.
Token ID13
Feature activation-3.44e+00

Pos logit contributions
jo+0.269
azz+0.250
ppy+0.246
o+0.241
anne+0.238
Neg logit contributions
 anything-0.153
 something-0.141
 many-0.136
 top-0.132
 good-0.128
Token Lily
Token ID20037
Feature activation-1.29e+01

Pos logit contributions
jo+0.086
ppy+0.081
azz+0.079
o+0.078
os+0.077
Neg logit contributions
 anything-0.057
 something-0.053
 trouble-0.051
 many-0.050
 good-0.050
Token liked
Token ID8288
Feature activation+6.73e+00

Pos logit contributions
jo+0.324
azz+0.318
ppy+0.307
itty+0.303
oke+0.300
Neg logit contributions
 not-0.196
 the-0.183
 many-0.179
 an-0.175
 something-0.174
Token the
Token ID262
Feature activation+2.16e+00

Pos logit contributions
 anything+0.086
 chores+0.076
 trouble+0.076
 something+0.075
 good+0.071
Neg logit contributions
jo-0.184
o-0.177
ppy-0.176
azz-0.174
y-0.173
Token bee
Token ID20697
Feature activation-1.17e+01

Pos logit contributions
 anything+0.044
 chores+0.041
 trouble+0.041
 something+0.040
 yourselves+0.039
Neg logit contributions
jo-0.042
ppy-0.040
y-0.040
o-0.039
azz-0.039
Token.
Token ID13
Feature activation-3.80e+00

Pos logit contributions
jo+0.300
azz+0.292
ppy+0.281
ne+0.267
oke+0.266
Neg logit contributions
 anything-0.188
 something-0.179
 good-0.176
 many-0.167
 things-0.167
Token She
Token ID1375
Feature activation-4.36e+00

Pos logit contributions
jo+0.095
ppy+0.089
azz+0.086
o+0.086
ne+0.085
Neg logit contributions
 anything-0.063
 something-0.058
 trouble-0.056
 many-0.056
 good-0.055