TOP ACTIVATIONS
MAX = 13.116

what the rocket discovered in outer space . It was a
able to create a gust of wind that lifted the small
I saw them sticking out of the clothes . Next time
the stars and planets in outer space . So one day
She takes out a magn ifying glass . She looks at
patient and for figuring out such a tricky secret . <|endoftext|>
. The fish slipped out of Tim 's hands and fell
so glad I could deliver such joy to you , my
breat hes out a puff of smoke . Then , he
izz ed off back into outer space . Earth was an
. Soon they were in outer space and having fun .
that they were going to outer space . They watched out
, but his mustache kept getting in the way and preventing
determined to never get into such a disgusting situation again !
of her for picking out such a pretty dress . Lily
't have ended up in such bad situations .
have knocked over a pile of cans . The cans are
Thank you Jack for finding such a nice shelter ."
drop , a big gust of wind blew him over .
Suddenly , with a gust of wind , they were both

MIN ACTIVATIONS
MIN = -17.053

age . The coach spoke stern ly .
like the dentist . The dentist had a big chair ,
wide . The dentist fixed her tooth and Lily
nervous . The dentist was a nice lady .
sit . The dentist had an idea . He
and very excited . The coach was ignorant of the girls
me ." The coach smiled and said , "
to the dentist . The dentist said that Tim had a
me ?" The dragon heard the boy and flew
welcome , kids ," the firefighter says . " But remember
. Suddenly , the scare crow moved and waved its arms
<|endoftext|> One day , Mrs Jones took her pet dog to
. The clown said , " Hi Mia
day . The firefighter grabbed the hose and turned
sunny day , a little crab went to the park .
join . The coach noticed how fit Jim was
day , a quiet all igator went for a walk .
day , Will y the firefighter was feeling very confused .
stop it !" the scare crow said . " You are
it . The dentist said Lily and the bear

INTERVAL 5.573 - 13.116
CONTAINS 6.816%

humble and stop poll uting the air . And they lived
dad when they would get to camp . Mom and dad
, and then scoop ed the sauce on top . Finally
you done ? You are very naughty . You ruined the
great job . Can I take a picture of you and

INTERVAL -1.969 - 5.573
CONTAINS 63.414%

, take some p ears and offer them to your friends
the 3 year old laid down on the log and relaxed
everyone so happy with her enthusiasm and her humble nature .
but very happy . She knew that she had made a
Lily followed him . Inside the garage , they

INTERVAL -9.511 - -1.969
CONTAINS 24.889%

le . Everyone laughed as he happily ate the surprise .
hose and watered the flower . She also found a stick
flowers , leaves , and sticks . The bunny and the
big bag . He was curious and wanted to
loved to laugh out loud . She was always smiling and

INTERVAL -17.053 - -9.511
CONTAINS 4.881%

was a little ball named Billy . Billy loved to roll
, he saw that the bird was carrying a little stick
hill with her sled . Ben is lazy . He walks
a time there was a boy . He was always very
best mel on . The boy 's father said , "
Token what
Token ID644
Feature activation+3.06e+00

Pos logit contributions
 purpose+0.111
 rightful+0.102
 common+0.097
 sight+0.089
 proper+0.085
Neg logit contributions
es-0.169
m-0.168
owed-0.153
bee-0.148
re-0.145
Token the
Token ID262
Feature activation+5.22e+00

Pos logit contributions
 purpose+0.063
 sight+0.061
 common+0.057
 accomplishment+0.056
 rightful+0.056
Neg logit contributions
m-0.050
owed-0.048
es-0.046
re-0.043
oke-0.043
Token rocket
Token ID10701
Feature activation-1.05e+01

Pos logit contributions
 purpose+0.077
 common+0.069
 sight+0.068
 rightful+0.066
 harmony+0.063
Neg logit contributions
m-0.113
bee-0.099
owed-0.099
es-0.098
oke-0.098
Token discovered
Token ID5071
Feature activation+2.29e+00

Pos logit contributions
m+0.135
owed+0.127
ppy+0.118
op+0.118
es+0.114
Neg logit contributions
 sight-0.247
 purpose-0.239
 harmony-0.228
 beauty-0.225
 freedom-0.225
Token in
Token ID287
Feature activation+5.77e+00

Pos logit contributions
 purpose+0.051
 sight+0.050
 common+0.046
 harmony+0.046
 accomplishment+0.046
Neg logit contributions
m-0.035
owed-0.033
es-0.031
re-0.029
oke-0.029
Token outer
Token ID12076
Feature activation+1.23e+01

Pos logit contributions
 proper+0.078
 awesome+0.074
 rightful+0.074
 accomplishment+0.074
 purpose+0.073
Neg logit contributions
m-0.103
es-0.095
y-0.090
'-0.090
re-0.087
Token space
Token ID2272
Feature activation-5.76e-01

Pos logit contributions
 ours+0.132
 purpose+0.131
 accomplishment+0.128
 comparison+0.121
 awesome+0.121
Neg logit contributions
es-0.224
ne-0.223
m-0.221
ird-0.214
'-0.209
Token.
Token ID13
Feature activation-1.39e+00

Pos logit contributions
m+0.008
es+0.007
owed+0.007
re+0.006
oke+0.006
Neg logit contributions
 purpose-0.013
 sight-0.013
 common-0.012
 accomplishment-0.012
 rightful-0.012
Token It
Token ID632
Feature activation-2.31e+00

Pos logit contributions
m+0.024
owed+0.023
es+0.022
op+0.021
ppy+0.021
Neg logit contributions
 purpose-0.028
 sight-0.027
 common-0.025
 harmony-0.025
 accomplishment-0.025
Token was
Token ID373
Feature activation+6.20e+00

Pos logit contributions
m+0.026
owed+0.024
es+0.022
op+0.021
ppy+0.020
Neg logit contributions
 purpose-0.060
 sight-0.059
 common-0.056
 harmony-0.055
 accomplishment-0.055
Token a
Token ID257
Feature activation+4.28e+00

Pos logit contributions
 purpose+0.105
 recess+0.089
 common+0.088
 rightful+0.087
 sidelines+0.086
Neg logit contributions
m-0.106
owed-0.096
es-0.091
o-0.090
y-0.089
Token able
Token ID1498
Feature activation+5.15e+00

Pos logit contributions
 purpose+0.098
 common+0.092
 sight+0.087
 rightful+0.085
 harmony+0.084
Neg logit contributions
m-0.103
owed-0.098
es-0.093
re-0.087
ppy-0.087
Token to
Token ID284
Feature activation+6.65e+00

Pos logit contributions
 purpose+0.109
 common+0.104
 harmony+0.101
 sight+0.100
 accomplishment+0.099
Neg logit contributions
m-0.080
re-0.075
ll-0.068
owed-0.066
es-0.065
Token create
Token ID2251
Feature activation+5.70e+00

Pos logit contributions
 sight+0.115
 purpose+0.115
 common+0.103
 rightful+0.102
 harmony+0.100
Neg logit contributions
owed-0.124
m-0.124
es-0.115
ppy-0.114
bee-0.112
Token a
Token ID257
Feature activation+6.91e+00

Pos logit contributions
 rightful+0.080
 common+0.078
 purpose+0.077
 sight+0.075
 comparison+0.072
Neg logit contributions
es-0.123
owed-0.105
m-0.103
re-0.092
'-0.092
Token gust
Token ID35253
Feature activation-3.91e+00

Pos logit contributions
 common+0.071
 purpose+0.069
 theirs+0.065
 comparison+0.063
ags+0.062
Neg logit contributions
m-0.157
op-0.145
'-0.137
pillar-0.137
oke-0.136
Token of
Token ID286
Feature activation+1.19e+01

Pos logit contributions
owed+0.071
m+0.070
es+0.062
bee+0.062
ll+0.062
Neg logit contributions
 purpose-0.066
 sight-0.065
 harmony-0.062
 victory-0.060
 our-0.060
Token wind
Token ID2344
Feature activation-5.43e+00

Pos logit contributions
 purpose+0.180
 rightful+0.163
 common+0.161
 destination+0.152
 pleasure+0.152
Neg logit contributions
m-0.208
bee-0.196
ne-0.185
oke-0.183
'-0.181
Token that
Token ID326
Feature activation+3.35e+00

Pos logit contributions
m+0.088
owed+0.086
es+0.080
oke+0.077
op+0.076
Neg logit contributions
 purpose-0.111
 sight-0.110
 common-0.100
 harmony-0.100
 accomplishment-0.099
Token lifted
Token ID13663
Feature activation+3.94e+00

Pos logit contributions
 purpose+0.077
 sight+0.075
 common+0.072
 accomplishment+0.071
 rightful+0.070
Neg logit contributions
m-0.047
owed-0.045
es-0.041
re-0.038
op-0.038
Token the
Token ID262
Feature activation+6.76e+00

Pos logit contributions
 purpose+0.064
 rightful+0.064
 sight+0.063
 common+0.063
 accomplishment+0.062
Neg logit contributions
m-0.071
es-0.071
owed-0.068
y-0.065
ppy-0.063
Token small
Token ID1402
Feature activation+6.11e+00

Pos logit contributions
 purpose+0.094
 common+0.088
 sight+0.085
 accomplishment+0.083
 rightful+0.081
Neg logit contributions
m-0.148
op-0.131
bee-0.131
owed-0.129
'-0.128
Token I
Token ID314
Feature activation+3.96e-01

Pos logit contributions
m+0.007
owed+0.007
es+0.007
re+0.006
bee+0.006
Neg logit contributions
 purpose-0.007
 sight-0.007
 common-0.006
 harmony-0.006
 our-0.006
Token saw
Token ID2497
Feature activation+5.10e+00

Pos logit contributions
 purpose+0.007
 sight+0.007
 common+0.007
 fun+0.007
 harmony+0.007
Neg logit contributions
owed-0.007
m-0.006
es-0.006
bee-0.006
oke-0.006
Token them
Token ID606
Feature activation-3.69e+00

Pos logit contributions
 purpose+0.076
 sight+0.073
 common+0.070
 rightful+0.070
 harmony+0.069
Neg logit contributions
m-0.115
es-0.109
owed-0.107
re-0.102
op-0.101
Token sticking
Token ID17274
Feature activation+5.98e+00

Pos logit contributions
m+0.070
owed+0.070
es+0.065
oke+0.065
op+0.064
Neg logit contributions
 purpose-0.064
 sight-0.063
 common-0.056
 beauty-0.056
 awesome-0.056
Token out
Token ID503
Feature activation+5.72e+00

Pos logit contributions
 rightful+0.104
 purpose+0.102
 accomplishment+0.101
 sight+0.097
 harmony+0.095
Neg logit contributions
m-0.115
es-0.108
owed-0.104
ppy-0.104
y-0.100
Token of
Token ID286
Feature activation+1.17e+01

Pos logit contributions
 purpose+0.131
 sight+0.128
 rightful+0.123
 common+0.122
 accomplishment+0.122
Neg logit contributions
m-0.083
es-0.074
owed-0.073
ppy-0.066
re-0.065
Token the
Token ID262
Feature activation+6.92e+00

Pos logit contributions
 rightful+0.185
 purpose+0.167
 former+0.165
urd+0.164
 fullest+0.164
Neg logit contributions
m-0.198
es-0.166
ie-0.163
mallow-0.162
my-0.160
Token clothes
Token ID8242
Feature activation-4.91e+00

Pos logit contributions
 purpose+0.108
 sight+0.107
 common+0.103
 accomplishment+0.101
 harmony+0.100
Neg logit contributions
m-0.145
owed-0.128
es-0.127
op-0.126
ppy-0.124
Token.
Token ID13
Feature activation-9.06e-01

Pos logit contributions
m+0.078
owed+0.078
es+0.072
oke+0.069
bee+0.068
Neg logit contributions
 purpose-0.101
 sight-0.100
 common-0.091
 harmony-0.089
 sand-0.089
Token Next
Token ID7406
Feature activation+8.44e-01

Pos logit contributions
m+0.018
owed+0.017
es+0.016
re+0.015
bee+0.015
Neg logit contributions
 purpose-0.016
 sight-0.016
 common-0.015
 harmony-0.014
 our-0.014
Token time
Token ID640
Feature activation+3.03e-01

Pos logit contributions
 purpose+0.011
 sight+0.011
 common+0.009
 harmony+0.009
 accomplishment+0.009
Neg logit contributions
m-0.021
owed-0.020
es-0.019
re-0.019
ppy-0.018
Token the
Token ID262
Feature activation+4.39e+00

Pos logit contributions
 purpose+0.085
 rightful+0.077
 common+0.075
 sight+0.075
 accomplishment+0.071
Neg logit contributions
m-0.129
es-0.124
owed-0.120
re-0.114
bee-0.114
Token stars
Token ID5788
Feature activation-8.77e+00

Pos logit contributions
 purpose+0.073
 common+0.066
 accomplishment+0.065
 rightful+0.065
 sight+0.064
Neg logit contributions
m-0.086
es-0.072
re-0.072
bee-0.071
ll-0.071
Token and
Token ID290
Feature activation+2.84e+00

Pos logit contributions
m+0.143
owed+0.142
oke+0.129
es+0.127
ppy+0.126
Neg logit contributions
 sight-0.175
 purpose-0.171
 harmony-0.158
 common-0.156
 freedom-0.155
Token planets
Token ID14705
Feature activation-6.99e+00

Pos logit contributions
 purpose+0.056
 sight+0.051
 common+0.049
 rightful+0.049
 accomplishment+0.049
Neg logit contributions
m-0.051
owed-0.048
es-0.047
re-0.045
bee-0.043
Token in
Token ID287
Feature activation+4.71e+00

Pos logit contributions
m+0.109
owed+0.106
es+0.096
op+0.094
oke+0.094
Neg logit contributions
 sight-0.148
 purpose-0.147
 common-0.136
 harmony-0.135
 accomplishment-0.133
Token outer
Token ID12076
Feature activation+1.17e+01

Pos logit contributions
 rightful+0.084
 accomplishment+0.084
 purpose+0.081
 ours+0.080
 awesome+0.080
Neg logit contributions
m-0.077
es-0.070
 were-0.063
'-0.061
mine-0.059
Token space
Token ID2272
Feature activation-9.81e-01

Pos logit contributions
 awesome+0.134
 accomplishment+0.134
 purpose+0.133
 ours+0.133
 unnecessary+0.129
Neg logit contributions
es-0.196
ne-0.187
ie-0.176
ppy-0.175
y-0.173
Token.
Token ID13
Feature activation-2.53e+00

Pos logit contributions
m+0.011
es+0.010
owed+0.010
re+0.008
ppy+0.008
Neg logit contributions
 purpose-0.026
 sight-0.025
 accomplishment-0.024
 pleasure-0.024
 awesome-0.024
Token So
Token ID1406
Feature activation+8.53e-02

Pos logit contributions
owed+0.047
m+0.047
es+0.046
ppy+0.046
bee+0.045
Neg logit contributions
 sight-0.047
 purpose-0.046
 fun-0.041
 common-0.040
 beauty-0.040
Token one
Token ID530
Feature activation-3.92e+00

Pos logit contributions
 purpose+0.002
 sight+0.002
 common+0.002
 harmony+0.002
 rightful+0.002
Neg logit contributions
m-0.001
owed-0.001
es-0.001
re-0.001
op-0.001
Token day
Token ID1110
Feature activation-1.41e+00

Pos logit contributions
m+0.091
owed+0.089
es+0.086
oke+0.083
re+0.083
Neg logit contributions
 purpose-0.054
 sight-0.054
 common-0.048
 harmony-0.047
 accomplishment-0.046
Token She
Token ID1375
Feature activation-2.90e+00

Pos logit contributions
owed+0.040
m+0.040
ppy+0.036
bee+0.036
ll+0.035
Neg logit contributions
 sight-0.037
 purpose-0.037
 common-0.033
 harmony-0.033
 new-0.032
Token takes
Token ID2753
Feature activation+6.07e+00

Pos logit contributions
m+0.058
owed+0.055
ppy+0.051
re+0.050
op+0.050
Neg logit contributions
 sight-0.049
 purpose-0.048
 accomplishment-0.045
 common-0.045
 beauty-0.044
Token out
Token ID503
Feature activation+3.05e+00

Pos logit contributions
 rightful+0.105
 purpose+0.101
 sight+0.099
 harmony+0.095
 common+0.094
Neg logit contributions
es-0.122
m-0.114
owed-0.112
op-0.110
ppy-0.106
Token a
Token ID257
Feature activation+6.02e+00

Pos logit contributions
 purpose+0.044
 rightful+0.041
 sight+0.040
 harmony+0.038
 common+0.037
Neg logit contributions
m-0.070
es-0.069
owed-0.067
op-0.063
ppy-0.062
Token magn
Token ID7842
Feature activation-3.30e-01

Pos logit contributions
 purpose+0.080
 sight+0.076
 rightful+0.072
 harmony+0.070
 common+0.069
Neg logit contributions
m-0.142
owed-0.134
op-0.130
es-0.129
oke-0.127
Tokenifying
Token ID4035
Feature activation+1.14e+01

Pos logit contributions
m+0.007
owed+0.007
es+0.007
op+0.006
ppy+0.006
Neg logit contributions
 purpose-0.005
 sight-0.005
 common-0.005
 harmony-0.005
 accomplishment-0.005
Token glass
Token ID5405
Feature activation-8.02e-01

Pos logit contributions
 accomplishment+0.175
 awesome+0.166
 freedom+0.160
 comparison+0.155
 harmony+0.155
Neg logit contributions
m-0.228
es-0.196
op-0.193
o-0.189
owed-0.189
Token.
Token ID13
Feature activation-2.17e-01

Pos logit contributions
m+0.013
owed+0.012
es+0.012
op+0.011
oke+0.011
Neg logit contributions
 purpose-0.017
 sight-0.017
 common-0.016
 rightful-0.016
 harmony-0.016
Token She
Token ID1375
Feature activation-2.01e+00

Pos logit contributions
m+0.004
owed+0.004
es+0.003
op+0.003
re+0.003
Neg logit contributions
 purpose-0.004
 sight-0.004
 common-0.004
 harmony-0.004
 accomplishment-0.004
Token looks
Token ID3073
Feature activation+4.36e+00

Pos logit contributions
m+0.039
owed+0.037
es+0.035
ppy+0.034
re+0.034
Neg logit contributions
 purpose-0.035
 sight-0.035
 common-0.032
 accomplishment-0.032
 harmony-0.031
Token at
Token ID379
Feature activation+5.30e+00

Pos logit contributions
 purpose+0.101
 sight+0.091
 harmony+0.090
 rightful+0.089
 common+0.087
Neg logit contributions
m-0.065
owed-0.062
es-0.058
ppy-0.054
oke-0.054
Token patient
Token ID5827
Feature activation+6.56e-02

Pos logit contributions
 sight+0.120
 purpose+0.120
 harmony+0.103
 common+0.101
 sand+0.095
Neg logit contributions
m-0.145
owed-0.137
es-0.131
op-0.127
re-0.126
Token and
Token ID290
Feature activation+5.60e+00

Pos logit contributions
 purpose+0.001
 sight+0.001
 common+0.001
 harmony+0.001
 accomplishment+0.001
Neg logit contributions
m-0.001
owed-0.001
es-0.001
re-0.001
op-0.001
Token for
Token ID329
Feature activation+5.35e+00

Pos logit contributions
 purpose+0.101
 sight+0.098
 common+0.091
 harmony+0.085
 rightful+0.081
Neg logit contributions
m-0.109
owed-0.105
es-0.100
re-0.094
ll-0.093
Token figuring
Token ID22714
Feature activation+6.62e+00

Pos logit contributions
 sight+0.085
 purpose+0.085
 common+0.080
 rightful+0.076
 harmony+0.074
Neg logit contributions
m-0.101
es-0.097
owed-0.095
ird-0.088
re-0.086
Token out
Token ID503
Feature activation+3.32e+00

Pos logit contributions
 sight+0.110
 purpose+0.110
 harmony+0.105
 rightful+0.103
 common+0.100
Neg logit contributions
es-0.130
owed-0.129
m-0.127
ppy-0.117
op-0.116
Token such
Token ID884
Feature activation+1.14e+01

Pos logit contributions
 purpose+0.053
 sight+0.053
 common+0.049
 harmony+0.047
 rightful+0.047
Neg logit contributions
m-0.067
owed-0.066
es-0.066
re-0.059
oke-0.059
Token a
Token ID257
Feature activation+9.07e+00

Pos logit contributions
 harmony+0.141
 purpose+0.140
 sight+0.140
 sidelines+0.137
 recess+0.136
Neg logit contributions
es-0.226
owed-0.226
m-0.220
itive-0.202
re-0.200
Token tricky
Token ID17198
Feature activation+7.19e+00

Pos logit contributions
 purpose+0.102
 sight+0.102
 harmony+0.099
 common+0.096
 freedom+0.092
Neg logit contributions
m-0.214
op-0.198
oke-0.195
owed-0.193
re-0.190
Token secret
Token ID3200
Feature activation+2.54e+00

Pos logit contributions
 sight+0.095
 purpose+0.095
 harmony+0.087
 common+0.087
 rightful+0.084
Neg logit contributions
m-0.164
owed-0.157
op-0.150
es-0.150
re-0.148
Token.
Token ID13
Feature activation-1.70e+00

Pos logit contributions
 purpose+0.056
 sight+0.056
 common+0.053
 harmony+0.052
 accomplishment+0.051
Neg logit contributions
m-0.036
owed-0.035
es-0.033
op-0.031
re-0.030
Token<|endoftext|>
Token ID50256
Feature activation-6.05e+00

Pos logit contributions
m+0.031
owed+0.030
es+0.028
bee+0.027
ppy+0.027
Neg logit contributions
 purpose-0.032
 sight-0.032
 common-0.029
 harmony-0.029
 rightful-0.028
Token.
Token ID13
Feature activation-1.66e+00

Pos logit contributions
owed+0.085
oke+0.082
es+0.078
m+0.075
re+0.073
Neg logit contributions
 purpose-0.149
 sight-0.143
 same-0.133
 our-0.133
 new-0.131
Token The
Token ID383
Feature activation+5.56e+00

Pos logit contributions
m+0.033
owed+0.032
es+0.031
ppy+0.030
op+0.029
Neg logit contributions
 purpose-0.029
 sight-0.028
 common-0.025
 harmony-0.025
 accomplishment-0.025
Token fish
Token ID5916
Feature activation-1.20e+01

Pos logit contributions
 purpose+0.090
 sight+0.087
 common+0.087
 harmony+0.084
 accomplishment+0.079
Neg logit contributions
m-0.115
bee-0.102
owed-0.101
op-0.099
ppy-0.097
Token slipped
Token ID18859
Feature activation+4.21e+00

Pos logit contributions
m+0.169
owed+0.166
ppy+0.148
es+0.147
oke+0.145
Neg logit contributions
 sight-0.266
 purpose-0.261
 harmony-0.243
 common-0.241
 accomplishment-0.240
Token out
Token ID503
Feature activation+5.61e+00

Pos logit contributions
 purpose+0.097
 sight+0.095
 common+0.090
 harmony+0.088
 rightful+0.088
Neg logit contributions
m-0.060
owed-0.056
es-0.053
op-0.050
ppy-0.050
Token of
Token ID286
Feature activation+1.14e+01

Pos logit contributions
 purpose+0.120
 sight+0.120
 common+0.110
 harmony+0.109
 accomplishment+0.108
Neg logit contributions
m-0.088
owed-0.085
es-0.078
op-0.075
oke-0.075
Token Tim
Token ID5045
Feature activation-6.34e+00

Pos logit contributions
 rightful+0.213
 destination+0.185
 purpose+0.182
 former+0.182
 awesome+0.180
Neg logit contributions
m-0.191
mallow-0.164
my-0.154
olph-0.150
ie-0.147
Token's
Token ID338
Feature activation+6.75e+00

Pos logit contributions
owed+0.085
oke+0.080
m+0.080
es+0.074
bee+0.069
Neg logit contributions
 purpose-0.143
 sight-0.140
 sand-0.130
 our-0.128
 fun-0.126
Token hands
Token ID2832
Feature activation-3.81e+00

Pos logit contributions
 common+0.113
 sight+0.111
 purpose+0.111
 harmony+0.108
 awesome+0.103
Neg logit contributions
m-0.127
owed-0.117
op-0.114
ppy-0.113
re-0.112
Token and
Token ID290
Feature activation+2.72e+00

Pos logit contributions
owed+0.057
m+0.055
oke+0.052
es+0.050
ppy+0.050
Neg logit contributions
 sight-0.081
 purpose-0.080
 sand-0.072
 accomplishment-0.072
 harmony-0.072
Token fell
Token ID3214
Feature activation+4.03e+00

Pos logit contributions
 purpose+0.060
 sight+0.058
 common+0.056
 rightful+0.054
 harmony+0.054
Neg logit contributions
m-0.041
owed-0.040
es-0.040
ppy-0.035
op-0.034
Token so
Token ID523
Feature activation+6.23e+00

Pos logit contributions
owed+0.018
m+0.018
ppy+0.016
bee+0.016
es+0.016
Neg logit contributions
 purpose-0.017
 sight-0.016
 common-0.016
 harmony-0.015
 fun-0.015
Token glad
Token ID9675
Feature activation+4.34e+00

Pos logit contributions
 purpose+0.109
 sight+0.104
 harmony+0.098
 common+0.096
 rightful+0.094
Neg logit contributions
m-0.122
owed-0.117
re-0.108
es-0.106
op-0.105
Token I
Token ID314
Feature activation+1.08e+00

Pos logit contributions
 purpose+0.081
 sight+0.079
 common+0.073
 harmony+0.072
 accomplishment+0.071
Neg logit contributions
m-0.080
owed-0.077
es-0.073
re-0.070
oke-0.069
Token could
Token ID714
Feature activation+6.05e+00

Pos logit contributions
 sight+0.024
 common+0.023
 purpose+0.023
 freedom+0.022
 fun+0.022
Neg logit contributions
owed-0.013
op-0.013
m-0.013
es-0.013
oke-0.013
Token deliver
Token ID5203
Feature activation+6.77e+00

Pos logit contributions
 sight+0.102
 purpose+0.102
 common+0.091
 harmony+0.090
 accomplishment+0.088
Neg logit contributions
m-0.121
owed-0.118
es-0.112
re-0.108
op-0.107
Token such
Token ID884
Feature activation+1.14e+01

Pos logit contributions
 purpose+0.097
 sight+0.096
 accomplishment+0.092
 rightful+0.091
 comparison+0.090
Neg logit contributions
m-0.139
es-0.139
owed-0.133
re-0.126
'-0.120
Token joy
Token ID8716
Feature activation+1.02e+00

Pos logit contributions
 purpose+0.143
 sight+0.142
 sidelines+0.138
 recess+0.137
 comparison+0.136
Neg logit contributions
owed-0.218
es-0.218
m-0.208
ird-0.198
re-0.191
Token to
Token ID284
Feature activation+7.19e+00

Pos logit contributions
 purpose+0.021
 sight+0.021
 common+0.019
 harmony+0.019
 accomplishment+0.018
Neg logit contributions
m-0.017
owed-0.016
es-0.015
op-0.015
oke-0.015
Token you
Token ID345
Feature activation+1.30e+00

Pos logit contributions
 sight+0.113
 purpose+0.110
 accomplishment+0.101
 common+0.100
 harmony+0.098
Neg logit contributions
m-0.145
owed-0.138
es-0.136
re-0.129
ppy-0.128
Token,
Token ID11
Feature activation+4.14e+00

Pos logit contributions
 purpose+0.027
 sight+0.027
 common+0.025
 harmony+0.025
 accomplishment+0.025
Neg logit contributions
m-0.021
owed-0.020
es-0.019
op-0.018
oke-0.018
Token my
Token ID616
Feature activation+4.93e+00

Pos logit contributions
 purpose+0.074
 sight+0.071
 common+0.066
 harmony+0.065
 accomplishment+0.063
Neg logit contributions
m-0.079
owed-0.073
es-0.071
ppy-0.069
oke-0.068
Token breat
Token ID4882
Feature activation+6.02e+00

Pos logit contributions
m+0.051
owed+0.049
es+0.046
op+0.045
re+0.044
Neg logit contributions
 purpose-0.052
 sight-0.051
 common-0.047
 harmony-0.046
 accomplishment-0.046
Tokenhes
Token ID956
Feature activation+4.58e-01

Pos logit contributions
 purpose+0.123
 sight+0.121
 harmony+0.111
 common+0.111
 rightful+0.111
Neg logit contributions
m-0.106
es-0.100
owed-0.099
oke-0.092
op-0.091
Token out
Token ID503
Feature activation+4.28e+00

Pos logit contributions
 purpose+0.008
 rightful+0.008
 sight+0.008
 common+0.007
 pleasure+0.007
Neg logit contributions
es-0.009
m-0.009
owed-0.009
op-0.008
ppy-0.008
Token a
Token ID257
Feature activation+6.07e+00

Pos logit contributions
 purpose+0.076
 rightful+0.070
 common+0.068
 pleasure+0.065
 sight+0.064
Neg logit contributions
es-0.087
m-0.086
owed-0.077
oke-0.072
y-0.071
Token puff
Token ID33679
Feature activation-1.62e+00

Pos logit contributions
 purpose+0.075
 freedom+0.070
 rightful+0.069
 accomplishment+0.067
 sight+0.067
Neg logit contributions
m-0.144
op-0.128
oke-0.123
es-0.123
'-0.120
Token of
Token ID286
Feature activation+1.14e+01

Pos logit contributions
m+0.026
owed+0.025
es+0.023
re+0.022
ppy+0.022
Neg logit contributions
 purpose-0.035
 sight-0.034
 common-0.032
 harmony-0.031
 accomplishment-0.031
Token smoke
Token ID7523
Feature activation-3.39e+00

Pos logit contributions
 rightful+0.196
 purpose+0.179
 destination+0.178
 common+0.171
 former+0.163
Neg logit contributions
m-0.178
es-0.170
mallow-0.166
bee-0.154
 said-0.149
Token.
Token ID13
Feature activation-8.82e-01

Pos logit contributions
m+0.048
owed+0.045
es+0.043
op+0.040
re+0.040
Neg logit contributions
 purpose-0.078
 sight-0.076
 common-0.072
 accomplishment-0.071
 rightful-0.071
Token Then
Token ID3244
Feature activation-7.27e-01

Pos logit contributions
m+0.012
es+0.012
owed+0.011
oke+0.011
op+0.010
Neg logit contributions
 purpose-0.020
 sight-0.020
 rightful-0.019
 accomplishment-0.019
 beauty-0.019
Token,
Token ID11
Feature activation+3.10e-01

Pos logit contributions
es+0.010
m+0.010
owed+0.009
re+0.008
ppy+0.008
Neg logit contributions
 purpose-0.018
 sight-0.017
 rightful-0.016
 common-0.016
 accomplishment-0.016
Token he
Token ID339
Feature activation-5.00e+00

Pos logit contributions
 purpose+0.007
 sight+0.007
 rightful+0.006
 accomplishment+0.006
 common+0.006
Neg logit contributions
m-0.005
es-0.005
owed-0.004
ppy-0.004
bee-0.004
Tokenizz
Token ID6457
Feature activation-3.95e+00

Pos logit contributions
 purpose+0.013
 sight+0.013
 common+0.012
 accomplishment+0.012
 harmony+0.012
Neg logit contributions
owed-0.014
m-0.014
es-0.013
ppy-0.013
bee-0.013
Tokened
Token ID276
Feature activation+2.39e+00

Pos logit contributions
m+0.049
owed+0.045
op+0.042
oke+0.041
re+0.040
Neg logit contributions
 purpose-0.091
 sight-0.091
 freedom-0.086
 accomplishment-0.086
 our-0.085
Token off
Token ID572
Feature activation+1.84e+00

Pos logit contributions
 purpose+0.041
 sight+0.039
 rightful+0.037
 common+0.037
 awesome+0.036
Neg logit contributions
es-0.045
owed-0.044
m-0.041
'-0.038
y-0.038
Token back
Token ID736
Feature activation+5.12e-01

Pos logit contributions
 purpose+0.036
 sight+0.033
 awesome+0.032
 beauty+0.032
 rightful+0.032
Neg logit contributions
m-0.032
owed-0.030
es-0.030
re-0.028
oke-0.026
Token into
Token ID656
Feature activation+6.75e+00

Pos logit contributions
 purpose+0.008
 sight+0.007
 common+0.007
 awesome+0.007
 rightful+0.007
Neg logit contributions
m-0.011
owed-0.011
es-0.011
re-0.010
op-0.010
Token outer
Token ID12076
Feature activation+1.13e+01

Pos logit contributions
 purpose+0.107
 awesome+0.102
 fault+0.101
 ours+0.097
 rightful+0.095
Neg logit contributions
es-0.105
m-0.104
'-0.092
 who-0.091
 were-0.086
Token space
Token ID2272
Feature activation-6.36e-01

Pos logit contributions
 awesome+0.125
 purpose+0.125
 ours+0.114
 fault+0.113
 impossible+0.113
Neg logit contributions
es-0.203
ne-0.180
y-0.178
ie-0.177
 who-0.175
Token.
Token ID13
Feature activation-1.61e+00

Pos logit contributions
m+0.007
es+0.007
owed+0.007
re+0.006
op+0.006
Neg logit contributions
 purpose-0.016
 sight-0.015
 awesome-0.014
 common-0.014
 rightful-0.014
Token Earth
Token ID3668
Feature activation-9.15e+00

Pos logit contributions
m+0.030
owed+0.029
es+0.028
ppy+0.027
op+0.026
Neg logit contributions
 purpose-0.030
 sight-0.029
 common-0.027
 harmony-0.027
 accomplishment-0.026
Token was
Token ID373
Feature activation+4.78e+00

Pos logit contributions
m+0.116
owed+0.108
es+0.098
op+0.096
ppy+0.095
Neg logit contributions
 sight-0.221
 purpose-0.220
 common-0.205
 harmony-0.205
 accomplishment-0.203
Token an
Token ID281
Feature activation+6.52e+00

Pos logit contributions
 purpose+0.092
 common+0.079
 recess+0.075
 sight+0.075
 rightful+0.075
Neg logit contributions
m-0.083
owed-0.074
o-0.071
es-0.071
y-0.064
Token.
Token ID13
Feature activation-4.32e-01

Pos logit contributions
 purpose+0.060
 rightful+0.056
 sight+0.054
 common+0.054
 awesome+0.052
Neg logit contributions
m-0.042
es-0.042
owed-0.040
re-0.038
op-0.036
Token Soon
Token ID15894
Feature activation+2.39e-01

Pos logit contributions
m+0.008
owed+0.008
es+0.007
op+0.007
re+0.007
Neg logit contributions
 purpose-0.008
 sight-0.008
 common-0.007
 harmony-0.007
 accomplishment-0.007
Token they
Token ID484
Feature activation-1.63e+00

Pos logit contributions
 purpose+0.005
 sight+0.005
 rightful+0.005
 common+0.005
 harmony+0.005
Neg logit contributions
m-0.003
es-0.003
owed-0.003
re-0.003
oke-0.003
Token were
Token ID547
Feature activation+5.93e+00

Pos logit contributions
m+0.020
owed+0.019
es+0.018
op+0.017
oke+0.017
Neg logit contributions
 sight-0.039
 purpose-0.039
 common-0.037
 harmony-0.037
 accomplishment-0.036
Token in
Token ID287
Feature activation+5.09e+00

Pos logit contributions
 purpose+0.121
 common+0.108
 rightful+0.105
 sight+0.101
 harmony+0.098
Neg logit contributions
m-0.108
es-0.098
owed-0.097
re-0.096
ll-0.086
Token outer
Token ID12076
Feature activation+1.12e+01

Pos logit contributions
 ours+0.074
 purpose+0.073
 rightful+0.072
 yours+0.071
 realistic+0.068
Neg logit contributions
m-0.081
 were-0.073
es-0.072
re-0.066
 had-0.065
Token space
Token ID2272
Feature activation-6.30e-01

Pos logit contributions
 purpose+0.136
 comparison+0.126
 ours+0.120
 impossible+0.117
 accomplishment+0.117
Neg logit contributions
es-0.192
ne-0.175
ird-0.175
m-0.170
 who-0.167
Token and
Token ID290
Feature activation+4.52e+00

Pos logit contributions
m+0.007
es+0.007
owed+0.007
re+0.006
ll+0.005
Neg logit contributions
 purpose-0.016
 sight-0.015
 rightful-0.015
 common-0.014
 awesome-0.014
Token having
Token ID1719
Feature activation+6.72e+00

Pos logit contributions
 purpose+0.090
 common+0.078
 sight+0.078
 rightful+0.076
 accomplishment+0.072
Neg logit contributions
m-0.076
es-0.076
owed-0.075
os-0.066
re-0.065
Token fun
Token ID1257
Feature activation+3.46e+00

Pos logit contributions
 purpose+0.092
 rightful+0.087
 common+0.084
 ours+0.077
 accomplishment+0.076
Neg logit contributions
es-0.144
m-0.142
re-0.138
 were-0.126
owed-0.125
Token.
Token ID13
Feature activation-7.87e-01

Pos logit contributions
 purpose+0.078
 sight+0.076
 common+0.072
 rightful+0.070
 harmony+0.070
Neg logit contributions
m-0.051
owed-0.048
es-0.046
re-0.043
op-0.042
Token that
Token ID326
Feature activation+8.89e-01

Pos logit contributions
 purpose+0.051
 sight+0.048
 rightful+0.046
 common+0.045
 accomplishment+0.044
Neg logit contributions
es-0.051
m-0.051
owed-0.050
re-0.046
bee-0.045
Token they
Token ID484
Feature activation-2.04e+00

Pos logit contributions
 purpose+0.020
 sight+0.019
 common+0.018
 rightful+0.018
 accomplishment+0.017
Neg logit contributions
m-0.013
es-0.013
owed-0.013
re-0.011
os-0.011
Token were
Token ID547
Feature activation+5.15e+00

Pos logit contributions
m+0.025
owed+0.024
es+0.022
ppy+0.021
op+0.020
Neg logit contributions
 purpose-0.051
 sight-0.050
 common-0.047
 harmony-0.046
 accomplishment-0.046
Token going
Token ID1016
Feature activation+3.05e+00

Pos logit contributions
 purpose+0.112
 common+0.099
 rightful+0.095
 sight+0.094
 accomplishment+0.087
Neg logit contributions
m-0.080
es-0.080
re-0.071
o-0.071
owed-0.069
Token to
Token ID284
Feature activation+5.57e+00

Pos logit contributions
 purpose+0.067
 rightful+0.060
 accomplishment+0.060
 sight+0.059
 common+0.058
Neg logit contributions
m-0.048
es-0.044
re-0.044
owed-0.042
o-0.041
Token outer
Token ID12076
Feature activation+1.12e+01

Pos logit contributions
 sight+0.098
 purpose+0.096
 rightful+0.091
 accomplishment+0.090
 holes+0.086
Neg logit contributions
m-0.076
ird-0.073
'-0.073
owed-0.072
es-0.070
Token space
Token ID2272
Feature activation-1.02e-01

Pos logit contributions
 accomplishment+0.134
 purpose+0.127
 comparison+0.125
 sidelines+0.122
 refreshing+0.121
Neg logit contributions
es-0.208
ne-0.196
 who-0.190
'-0.183
m-0.182
Token.
Token ID13
Feature activation-1.45e+00

Pos logit contributions
es+0.001
m+0.001
owed+0.001
re+0.001
os+0.001
Neg logit contributions
 purpose-0.003
 sight-0.002
 rightful-0.002
 accomplishment-0.002
 pleasure-0.002
Token They
Token ID1119
Feature activation-2.18e+00

Pos logit contributions
m+0.027
owed+0.026
es+0.025
op+0.024
oke+0.024
Neg logit contributions
 purpose-0.027
 sight-0.026
 common-0.024
 accomplishment-0.024
 freedom-0.024
Token watched
Token ID7342
Feature activation+4.13e+00

Pos logit contributions
m+0.033
owed+0.031
es+0.028
oke+0.028
op+0.027
Neg logit contributions
 sight-0.047
 purpose-0.047
 common-0.044
 harmony-0.044
 accomplishment-0.043
Token out
Token ID503
Feature activation+1.67e+00

Pos logit contributions
 purpose+0.070
 rightful+0.070
 common+0.064
 sight+0.061
 freedom+0.055
Neg logit contributions
es-0.082
owed-0.081
m-0.075
bee-0.071
op-0.070
Token,
Token ID11
Feature activation+2.27e+00

Pos logit contributions
 sight+0.075
 purpose+0.075
 common+0.069
 harmony+0.069
 rightful+0.069
Neg logit contributions
m-0.047
owed-0.043
es-0.043
re-0.039
op-0.038
Token but
Token ID475
Feature activation+2.86e+00

Pos logit contributions
 purpose+0.045
 sight+0.044
 common+0.042
 rightful+0.040
 harmony+0.040
Neg logit contributions
m-0.038
owed-0.036
es-0.035
re-0.032
ppy-0.032
Token his
Token ID465
Feature activation+3.94e+00

Pos logit contributions
 purpose+0.052
 sight+0.051
 common+0.047
 harmony+0.046
 rightful+0.045
Neg logit contributions
m-0.054
owed-0.052
es-0.050
ppy-0.047
re-0.047
Token mustache
Token ID49303
Feature activation-9.74e+00

Pos logit contributions
 common+0.058
 purpose+0.058
 sight+0.057
 harmony+0.056
 comparison+0.053
Neg logit contributions
m-0.080
owed-0.076
op-0.075
es-0.071
re-0.070
Token kept
Token ID4030
Feature activation+8.39e+00

Pos logit contributions
m+0.114
owed+0.107
es+0.097
oke+0.095
ppy+0.094
Neg logit contributions
 purpose-0.246
 sight-0.241
 common-0.224
 beauty-0.224
 rightful-0.223
Token getting
Token ID1972
Feature activation+1.12e+01

Pos logit contributions
 common+0.144
 purpose+0.142
 rightful+0.137
 harmony+0.137
 sight+0.134
Neg logit contributions
m-0.154
es-0.134
re-0.127
owed-0.125
'-0.119
Token in
Token ID287
Feature activation+8.67e+00

Pos logit contributions
 purpose+0.189
 common+0.175
 harmony+0.174
 sight+0.170
 freedom+0.158
Neg logit contributions
m-0.238
re-0.193
owed-0.188
que-0.187
es-0.183
Token the
Token ID262
Feature activation+7.07e+00

Pos logit contributions
 awesome+0.120
 freedom+0.118
 rightful+0.117
 ours+0.116
 beauty+0.113
Neg logit contributions
m-0.171
re-0.145
es-0.140
'-0.137
y-0.131
Token way
Token ID835
Feature activation+4.42e+00

Pos logit contributions
 common+0.103
 purpose+0.091
 harmony+0.087
 sight+0.086
 accomplishment+0.083
Neg logit contributions
m-0.158
'-0.133
que-0.132
re-0.130
op-0.128
Token and
Token ID290
Feature activation+5.07e+00

Pos logit contributions
 purpose+0.094
 sight+0.091
 common+0.090
 harmony+0.088
 rightful+0.086
Neg logit contributions
m-0.072
owed-0.063
re-0.061
es-0.060
op-0.060
Token preventing
Token ID12174
Feature activation+8.93e+00

Pos logit contributions
 purpose+0.105
 common+0.102
 sight+0.099
 harmony+0.094
 awesome+0.093
Neg logit contributions
m-0.083
owed-0.077
es-0.074
re-0.069
ll-0.065
Token determined
Token ID5295
Feature activation+3.24e+00

Pos logit contributions
 purpose+0.109
 common+0.102
 sight+0.101
 harmony+0.093
 rightful+0.090
Neg logit contributions
m-0.102
owed-0.093
es-0.090
o-0.086
ppy-0.083
Token to
Token ID284
Feature activation+5.72e+00

Pos logit contributions
 purpose+0.057
 sight+0.057
 common+0.054
 harmony+0.054
 accomplishment+0.052
Neg logit contributions
m-0.061
owed-0.056
re-0.056
es-0.055
ll-0.052
Token never
Token ID1239
Feature activation+4.78e+00

Pos logit contributions
 sight+0.100
 purpose+0.095
 harmony+0.090
 common+0.088
 accomplishment+0.087
Neg logit contributions
m-0.105
owed-0.102
es-0.097
op-0.095
re-0.094
Token get
Token ID651
Feature activation+9.87e+00

Pos logit contributions
 purpose+0.077
 sight+0.075
 common+0.068
 harmony+0.067
 rightful+0.066
Neg logit contributions
m-0.100
owed-0.098
es-0.094
op-0.091
ppy-0.089
Token into
Token ID656
Feature activation+8.43e+00

Pos logit contributions
 purpose+0.163
 harmony+0.154
 sight+0.152
 accomplishment+0.148
 common+0.147
Neg logit contributions
m-0.203
re-0.183
es-0.179
owed-0.179
ppy-0.169
Token such
Token ID884
Feature activation+1.11e+01

Pos logit contributions
 common+0.089
 rightful+0.087
 harmony+0.085
 purpose+0.084
 awesome+0.083
Neg logit contributions
es-0.187
m-0.186
re-0.168
ird-0.160
itive-0.160
Token a
Token ID257
Feature activation+8.75e+00

Pos logit contributions
 sidelines+0.141
 purpose+0.141
 sight+0.140
 harmony+0.139
 common+0.138
Neg logit contributions
m-0.200
es-0.199
owed-0.198
itive-0.188
re-0.185
Token disgusting
Token ID23374
Feature activation+7.32e+00

Pos logit contributions
 common+0.096
 purpose+0.094
 sight+0.089
 harmony+0.089
 awesome+0.084
Neg logit contributions
m-0.207
op-0.189
'-0.183
ird-0.178
oke-0.177
Token situation
Token ID3074
Feature activation+7.14e-01

Pos logit contributions
 purpose+0.112
 sight+0.110
 common+0.107
 harmony+0.104
 awesome+0.101
Neg logit contributions
m-0.151
owed-0.143
es-0.136
op-0.136
bee-0.134
Token again
Token ID757
Feature activation+1.37e+00

Pos logit contributions
 purpose+0.017
 sight+0.016
 common+0.016
 harmony+0.015
 accomplishment+0.015
Neg logit contributions
m-0.010
owed-0.009
es-0.009
re-0.008
op-0.008
Token!
Token ID0
Feature activation-1.49e+00

Pos logit contributions
 purpose+0.032
 sight+0.031
 common+0.029
 accomplishment+0.029
 harmony+0.029
Neg logit contributions
m-0.019
owed-0.018
es-0.017
re-0.016
op-0.015
Token of
Token ID286
Feature activation+5.11e+00

Pos logit contributions
 purpose+0.084
 sight+0.083
 common+0.078
 harmony+0.077
 accomplishment+0.076
Neg logit contributions
m-0.058
owed-0.054
es-0.051
re-0.048
oke-0.048
Token her
Token ID607
Feature activation+2.27e+00

Pos logit contributions
 purpose+0.083
 sight+0.082
 harmony+0.073
 common+0.073
 rightful+0.072
Neg logit contributions
m-0.107
owed-0.101
es-0.099
re-0.093
ppy-0.093
Token for
Token ID329
Feature activation+4.81e+00

Pos logit contributions
 purpose+0.040
 sight+0.039
 common+0.036
 harmony+0.035
 accomplishment+0.035
Neg logit contributions
m-0.044
owed-0.043
es-0.041
op-0.039
oke-0.039
Token picking
Token ID10868
Feature activation+5.87e+00

Pos logit contributions
 purpose+0.089
 sight+0.088
 common+0.082
 harmony+0.079
 rightful+0.078
Neg logit contributions
m-0.092
owed-0.084
es-0.083
re-0.078
ppy-0.076
Token out
Token ID503
Feature activation+4.04e+00

Pos logit contributions
 purpose+0.087
 sight+0.083
 harmony+0.082
 common+0.081
 accomplishment+0.081
Neg logit contributions
m-0.131
owed-0.119
es-0.119
re-0.111
op-0.110
Token such
Token ID884
Feature activation+1.11e+01

Pos logit contributions
 purpose+0.056
 sight+0.055
 harmony+0.049
 common+0.048
 rightful+0.048
Neg logit contributions
m-0.092
owed-0.087
es-0.087
re-0.080
ppy-0.079
Token a
Token ID257
Feature activation+7.84e+00

Pos logit contributions
 sidelines+0.152
 purpose+0.148
 recess+0.147
 harmony+0.147
 sight+0.145
Neg logit contributions
m-0.202
owed-0.187
es-0.187
 who-0.173
itive-0.171
Token pretty
Token ID2495
Feature activation+6.41e+00

Pos logit contributions
 purpose+0.068
 sight+0.066
 harmony+0.065
 common+0.062
 freedom+0.060
Neg logit contributions
m-0.207
op-0.188
ll-0.181
'-0.180
re-0.178
Token dress
Token ID6576
Feature activation-3.25e+00

Pos logit contributions
 purpose+0.097
 sight+0.092
 common+0.090
 harmony+0.089
 awesome+0.089
Neg logit contributions
m-0.132
owed-0.123
op-0.115
bee-0.115
es-0.115
Token.
Token ID13
Feature activation-1.98e+00

Pos logit contributions
m+0.051
owed+0.049
es+0.046
oke+0.045
ppy+0.044
Neg logit contributions
 purpose-0.068
 sight-0.067
 common-0.062
 harmony-0.061
 rightful-0.061
Token Lily
Token ID20037
Feature activation-8.07e+00

Pos logit contributions
m+0.037
owed+0.036
es+0.034
ppy+0.033
bee+0.033
Neg logit contributions
 purpose-0.036
 sight-0.036
 common-0.032
 rightful-0.032
 harmony-0.032
Token't
Token ID470
Feature activation+8.20e+00

Pos logit contributions
 purpose+0.135
 sight+0.132
 accomplishment+0.120
 common+0.120
 freedom+0.118
Neg logit contributions
m-0.125
owed-0.123
es-0.114
oke-0.110
ppy-0.109
Token have
Token ID423
Feature activation+7.68e+00

Pos logit contributions
 purpose+0.126
 sight+0.120
 common+0.108
 accomplishment+0.107
 freedom+0.106
Neg logit contributions
m-0.181
owed-0.177
es-0.172
op-0.164
oke-0.163
Token ended
Token ID4444
Feature activation+8.38e+00

Pos logit contributions
 harmony+0.117
 common+0.113
 sight+0.111
 purpose+0.111
 outer+0.109
Neg logit contributions
m-0.152
re-0.149
owed-0.134
es-0.131
ll-0.125
Token up
Token ID510
Feature activation+5.81e+00

Pos logit contributions
 harmony+0.136
 sight+0.134
 common+0.133
 purpose+0.132
 pleasure+0.128
Neg logit contributions
m-0.143
es-0.143
owed-0.138
re-0.125
op-0.124
Token in
Token ID287
Feature activation+7.30e+00

Pos logit contributions
 purpose+0.115
 sight+0.111
 common+0.108
 harmony+0.106
 rightful+0.105
Neg logit contributions
m-0.097
es-0.085
re-0.085
owed-0.083
op-0.076
Token such
Token ID884
Feature activation+1.11e+01

Pos logit contributions
 rightful+0.103
 awesome+0.100
 beauty+0.097
 realistic+0.095
 purpose+0.095
Neg logit contributions
m-0.149
es-0.136
re-0.134
'-0.121
op-0.120
Token bad
Token ID2089
Feature activation+6.73e+00

Pos logit contributions
 sight+0.145
 common+0.144
 purpose+0.143
 harmony+0.141
 destination+0.134
Neg logit contributions
m-0.210
owed-0.207
es-0.202
re-0.193
ird-0.192
Token situations
Token ID7445
Feature activation-2.25e-01

Pos logit contributions
 purpose+0.085
 sight+0.083
 common+0.078
 harmony+0.075
 rightful+0.073
Neg logit contributions
m-0.161
owed-0.153
es-0.148
re-0.146
op-0.143
Token.
Token ID13
Feature activation-1.76e+00

Pos logit contributions
m+0.003
owed+0.003
es+0.003
ppy+0.003
op+0.003
Neg logit contributions
 purpose-0.005
 sight-0.005
 common-0.004
 harmony-0.004
 accomplishment-0.004
Token
Token ID220
Feature activation+6.04e-01

Pos logit contributions
m+0.033
owed+0.032
es+0.030
bee+0.029
ppy+0.029
Neg logit contributions
 purpose-0.033
 sight-0.032
 common-0.029
 harmony-0.029
 rightful-0.028
Token
Token ID198
Feature activation+6.93e-01

Pos logit contributions
 purpose+0.011
 sight+0.011
 common+0.010
 harmony+0.010
 accomplishment+0.010
Neg logit contributions
m-0.011
owed-0.010
bee-0.010
ppy-0.010
es-0.010
Token have
Token ID423
Feature activation+5.69e+00

Pos logit contributions
m+0.020
owed+0.019
op+0.018
es+0.018
oke+0.017
Neg logit contributions
 sight-0.030
 purpose-0.029
 common-0.028
 beauty-0.028
 harmony-0.028
Token knocked
Token ID13642
Feature activation+5.44e+00

Pos logit contributions
 purpose+0.089
 sight+0.086
 accomplishment+0.086
 harmony+0.081
 common+0.080
Neg logit contributions
m-0.114
re-0.108
es-0.107
owed-0.107
ppy-0.097
Token over
Token ID625
Feature activation+5.67e+00

Pos logit contributions
 purpose+0.092
 sight+0.089
 common+0.082
 accomplishment+0.082
 harmony+0.081
Neg logit contributions
m-0.112
owed-0.106
es-0.101
re-0.098
ppy-0.097
Token a
Token ID257
Feature activation+6.06e+00

Pos logit contributions
 purpose+0.073
 sight+0.068
 rightful+0.065
 accomplishment+0.063
 harmony+0.063
Neg logit contributions
m-0.136
es-0.127
owed-0.124
re-0.121
op-0.116
Token pile
Token ID14540
Feature activation-9.16e-01

Pos logit contributions
 purpose+0.095
 sight+0.092
 common+0.088
 accomplishment+0.087
 freedom+0.085
Neg logit contributions
m-0.128
op-0.113
oke-0.112
owed-0.112
es-0.110
Token of
Token ID286
Feature activation+1.11e+01

Pos logit contributions
m+0.014
owed+0.013
es+0.012
op+0.012
re+0.012
Neg logit contributions
 purpose-0.020
 sight-0.020
 common-0.018
 harmony-0.018
 accomplishment-0.018
Token cans
Token ID23916
Feature activation-1.98e+00

Pos logit contributions
 purpose+0.162
 sight+0.157
 rightful+0.149
 common+0.146
 accomplishment+0.144
Neg logit contributions
m-0.242
es-0.214
owed-0.211
re-0.207
op-0.204
Token.
Token ID13
Feature activation-1.53e+00

Pos logit contributions
m+0.029
owed+0.028
es+0.026
re+0.024
op+0.024
Neg logit contributions
 purpose-0.045
 sight-0.044
 common-0.041
 harmony-0.041
 accomplishment-0.040
Token The
Token ID383
Feature activation+4.52e+00

Pos logit contributions
m+0.030
owed+0.029
es+0.028
ppy+0.027
op+0.027
Neg logit contributions
 purpose-0.027
 sight-0.026
 common-0.024
 harmony-0.023
 accomplishment-0.023
Token cans
Token ID23916
Feature activation-6.46e+00

Pos logit contributions
 purpose+0.074
 sight+0.072
 common+0.067
 harmony+0.066
 accomplishment+0.063
Neg logit contributions
m-0.095
owed-0.087
es-0.084
op-0.083
oke-0.083
Token are
Token ID389
Feature activation+5.63e+00

Pos logit contributions
m+0.070
owed+0.064
es+0.058
re+0.054
op+0.054
Neg logit contributions
 purpose-0.173
 sight-0.170
 common-0.161
 harmony-0.160
 rightful-0.159
TokenThank
Token ID10449
Feature activation+2.93e+00

Pos logit contributions
owed+0.001
m+0.001
es+0.000
oke+0.000
ppy+0.000
Neg logit contributions
 purpose-0.001
 sight-0.001
 common-0.000
 harmony-0.000
 freedom-0.000
Token you
Token ID345
Feature activation+3.98e-01

Pos logit contributions
 purpose+0.042
 sight+0.040
 our+0.038
 harmony+0.037
 common+0.037
Neg logit contributions
owed-0.066
m-0.065
oke-0.060
es-0.060
ppy-0.060
Token Jack
Token ID3619
Feature activation-7.62e+00

Pos logit contributions
 purpose+0.009
 sight+0.009
 common+0.008
 harmony+0.008
 accomplishment+0.008
Neg logit contributions
m-0.006
owed-0.006
es-0.005
ppy-0.005
oke-0.005
Token for
Token ID329
Feature activation+4.40e+00

Pos logit contributions
m+0.126
owed+0.124
op+0.113
es+0.113
oke+0.109
Neg logit contributions
 purpose-0.153
 sight-0.150
 common-0.138
 accomplishment-0.136
 harmony-0.136
Token finding
Token ID4917
Feature activation+7.39e+00

Pos logit contributions
 sight+0.083
 purpose+0.081
 common+0.078
 rightful+0.075
 destination+0.074
Neg logit contributions
m-0.075
es-0.071
owed-0.066
bee-0.063
'-0.063
Token such
Token ID884
Feature activation+1.11e+01

Pos logit contributions
 rightful+0.100
 accomplishment+0.100
 purpose+0.100
 comparison+0.098
 harmony+0.097
Neg logit contributions
es-0.156
m-0.141
ie-0.134
owed-0.129
ppy-0.128
Token a
Token ID257
Feature activation+8.09e+00

Pos logit contributions
 sidelines+0.175
 destination+0.167
 sight+0.163
 recess+0.160
 purpose+0.158
Neg logit contributions
es-0.165
m-0.164
owed-0.156
que-0.149
ird-0.148
Token nice
Token ID3621
Feature activation+7.27e+00

Pos logit contributions
 purpose+0.080
 sight+0.078
 comparison+0.076
 harmony+0.073
 heights+0.072
Neg logit contributions
m-0.188
ll-0.170
'-0.169
op-0.167
ird-0.161
Token shelter
Token ID11772
Feature activation-2.37e+00

Pos logit contributions
 purpose+0.085
 awesome+0.083
 sight+0.082
 harmony+0.080
 common+0.079
Neg logit contributions
m-0.168
owed-0.157
op-0.150
re-0.149
bee-0.147
Token."
Token ID526
Feature activation-1.97e+00

Pos logit contributions
m+0.041
owed+0.039
es+0.037
op+0.035
re+0.035
Neg logit contributions
 purpose-0.047
 sight-0.047
 common-0.043
 harmony-0.042
 accomplishment-0.042
Token
Token ID220
Feature activation+8.12e-01

Pos logit contributions
m+0.041
owed+0.041
bee+0.038
ppy+0.037
op+0.037
Neg logit contributions
 sight-0.030
 purpose-0.030
 our-0.027
 harmony-0.027
 accomplishment-0.027
Tokendrop
Token ID14781
Feature activation-2.50e+00

Pos logit contributions
m+0.068
owed+0.065
es+0.061
op+0.058
re+0.058
Neg logit contributions
 purpose-0.083
 sight-0.081
 common-0.075
 harmony-0.074
 accomplishment-0.074
Token,
Token ID11
Feature activation+2.30e+00

Pos logit contributions
m+0.038
owed+0.037
es+0.034
op+0.032
oke+0.032
Neg logit contributions
 purpose-0.054
 sight-0.053
 common-0.049
 harmony-0.049
 accomplishment-0.048
Token a
Token ID257
Feature activation+4.12e+00

Pos logit contributions
 purpose+0.053
 sight+0.051
 common+0.049
 rightful+0.049
 harmony+0.048
Neg logit contributions
m-0.032
owed-0.029
es-0.029
bee-0.027
ppy-0.026
Token big
Token ID1263
Feature activation+5.64e+00

Pos logit contributions
 purpose+0.063
 sight+0.061
 common+0.057
 accomplishment+0.057
 rightful+0.057
Neg logit contributions
m-0.087
op-0.077
bee-0.077
owed-0.076
oke-0.075
Token gust
Token ID35253
Feature activation-4.12e+00

Pos logit contributions
 purpose+0.109
 common+0.101
 sight+0.101
 rightful+0.100
 harmony+0.096
Neg logit contributions
m-0.100
bee-0.086
owed-0.086
oke-0.084
es-0.083
Token of
Token ID286
Feature activation+1.11e+01

Pos logit contributions
owed+0.079
m+0.073
es+0.071
re+0.069
ll+0.069
Neg logit contributions
 sight-0.064
 purpose-0.062
 harmony-0.060
 our-0.058
 common-0.057
Token wind
Token ID2344
Feature activation-6.10e+00

Pos logit contributions
 purpose+0.164
 rightful+0.155
 pleasure+0.152
 common+0.145
 destination+0.142
Neg logit contributions
m-0.234
bee-0.209
ie-0.207
itty-0.204
ppy-0.200
Token blew
Token ID17948
Feature activation+5.07e+00

Pos logit contributions
owed+0.097
m+0.096
es+0.092
ppy+0.089
oke+0.088
Neg logit contributions
 sight-0.122
 purpose-0.119
 common-0.107
 harmony-0.106
 freedom-0.104
Token him
Token ID683
Feature activation-2.78e+00

Pos logit contributions
 purpose+0.090
 sight+0.086
 rightful+0.084
 common+0.084
 accomplishment+0.082
Neg logit contributions
m-0.097
es-0.090
owed-0.089
ppy-0.083
re-0.083
Token over
Token ID625
Feature activation+6.59e+00

Pos logit contributions
m+0.050
owed+0.048
es+0.045
op+0.043
ppy+0.043
Neg logit contributions
 purpose-0.053
 sight-0.053
 common-0.048
 harmony-0.048
 accomplishment-0.047
Token.
Token ID13
Feature activation-1.55e+00

Pos logit contributions
 rightful+0.122
 purpose+0.122
 awesome+0.118
 common+0.116
 accomplishment+0.115
Neg logit contributions
m-0.117
es-0.099
re-0.098
op-0.089
y-0.088
Token Suddenly
Token ID24975
Feature activation+9.33e-01

Pos logit contributions
m+0.034
owed+0.033
es+0.031
bee+0.030
ppy+0.029
Neg logit contributions
 purpose-0.034
 sight-0.034
 common-0.031
 harmony-0.030
 rightful-0.030
Token,
Token ID11
Feature activation+1.71e+00

Pos logit contributions
 purpose+0.021
 sight+0.021
 common+0.019
 harmony+0.019
 accomplishment+0.019
Neg logit contributions
m-0.014
owed-0.013
es-0.012
op-0.011
ppy-0.011
Token with
Token ID351
Feature activation+5.77e+00

Pos logit contributions
 purpose+0.027
 sight+0.026
 common+0.025
 rightful+0.025
 harmony+0.024
Neg logit contributions
m-0.035
owed-0.033
es-0.033
re-0.031
op-0.031
Token a
Token ID257
Feature activation+4.44e+00

Pos logit contributions
 purpose+0.095
 rightful+0.089
 common+0.083
 sight+0.077
 harmony+0.077
Neg logit contributions
m-0.105
es-0.099
owed-0.095
ppy-0.092
o-0.091
Token gust
Token ID35253
Feature activation-5.64e+00

Pos logit contributions
 purpose+0.052
 sight+0.049
 common+0.048
 ours+0.047
 rightful+0.047
Neg logit contributions
m-0.108
op-0.097
owed-0.096
es-0.096
oke-0.095
Token of
Token ID286
Feature activation+1.11e+01

Pos logit contributions
owed+0.113
m+0.107
bee+0.104
ll+0.102
ops+0.102
Neg logit contributions
 sight-0.077
 harmony-0.074
 purpose-0.073
 our-0.072
 victory-0.071
Token wind
Token ID2344
Feature activation-7.72e+00

Pos logit contributions
 purpose+0.146
 common+0.128
 rightful+0.127
 sight+0.122
 pleasure+0.120
Neg logit contributions
m-0.260
ppy-0.229
owed-0.229
bee-0.227
ie-0.224
Token,
Token ID11
Feature activation+2.52e+00

Pos logit contributions
m+0.115
owed+0.114
es+0.106
oke+0.104
ppy+0.104
Neg logit contributions
 sight-0.158
 purpose-0.153
 sand-0.138
 harmony-0.138
 common-0.138
Token they
Token ID484
Feature activation-2.04e+00

Pos logit contributions
 purpose+0.036
 sight+0.033
 common+0.031
 rightful+0.031
 harmony+0.030
Neg logit contributions
m-0.059
es-0.055
owed-0.055
oke-0.052
op-0.052
Token were
Token ID547
Feature activation+5.06e+00

Pos logit contributions
m+0.030
owed+0.029
oke+0.028
es+0.026
op+0.026
Neg logit contributions
 sight-0.043
 purpose-0.042
 harmony-0.040
 common-0.039
 sand-0.039
Token both
Token ID1111
Feature activation+3.36e+00

Pos logit contributions
 purpose+0.095
 common+0.089
 sight+0.086
 rightful+0.083
 accomplishment+0.081
Neg logit contributions
m-0.099
es-0.092
owed-0.091
re-0.086
ppy-0.084
Token age
Token ID2479
Feature activation-4.78e+00

Pos logit contributions
 purpose+0.020
 sight+0.020
 common+0.018
 harmony+0.018
 accomplishment+0.018
Neg logit contributions
m-0.024
owed-0.023
es-0.022
re-0.021
oke-0.021
Token.
Token ID13
Feature activation-5.12e+00

Pos logit contributions
m+0.072
owed+0.070
es+0.064
ppy+0.061
op+0.061
Neg logit contributions
 purpose-0.104
 sight-0.103
 common-0.095
 harmony-0.094
 accomplishment-0.093
Token
Token ID198
Feature activation-2.60e+00

Pos logit contributions
owed+0.095
m+0.095
ppy+0.088
es+0.088
bee+0.087
Neg logit contributions
 sight-0.095
 purpose-0.094
 common-0.085
 harmony-0.084
 fun-0.083
Token
Token ID198
Feature activation-3.48e+00

Pos logit contributions
owed+0.051
m+0.049
ppy+0.047
bee+0.046
es+0.046
Neg logit contributions
 purpose-0.045
 sight-0.045
 common-0.043
 fun-0.041
 new-0.040
TokenThe
Token ID464
Feature activation+1.28e+00

Pos logit contributions
owed+0.064
m+0.063
es+0.058
ppy+0.057
oke+0.057
Neg logit contributions
 purpose-0.067
 sight-0.065
 common-0.061
 harmony-0.059
 fun-0.059
Token coach
Token ID3985
Feature activation-1.71e+01

Pos logit contributions
 purpose+0.022
 sight+0.022
 harmony+0.020
 freedom+0.020
 common+0.020
Neg logit contributions
m-0.026
re-0.022
oke-0.022
es-0.021
owed-0.021
Token spoke
Token ID5158
Feature activation-1.58e+00

Pos logit contributions
owed+0.270
m+0.250
ppy+0.241
ops+0.233
es+0.232
Neg logit contributions
 sight-0.315
 fun-0.301
 new-0.290
 our-0.286
 common-0.283
Token stern
Token ID26370
Feature activation-7.12e-01

Pos logit contributions
m+0.031
owed+0.031
es+0.029
ppy+0.027
op+0.027
Neg logit contributions
 purpose-0.027
 sight-0.026
 common-0.024
 harmony-0.024
 accomplishment-0.024
Tokenly
Token ID306
Feature activation-2.79e+00

Pos logit contributions
m+0.009
owed+0.008
es+0.008
op+0.007
re+0.007
Neg logit contributions
 purpose-0.017
 sight-0.017
 common-0.016
 beauty-0.016
 harmony-0.016
Token.
Token ID13
Feature activation-4.48e+00

Pos logit contributions
m+0.044
owed+0.042
es+0.038
ppy+0.038
re+0.037
Neg logit contributions
 purpose-0.059
 sight-0.058
 common-0.053
 accomplishment-0.052
 beauty-0.052
Token
Token ID6184
Feature activation-1.31e+00

Pos logit contributions
m+0.082
owed+0.079
es+0.075
op+0.071
ppy+0.071
Neg logit contributions
 purpose-0.084
 sight-0.083
 common-0.076
 harmony-0.075
 accomplishment-0.074
Token like
Token ID588
Feature activation+5.06e+00

Pos logit contributions
 purpose+0.069
 sight+0.065
 common+0.064
 fun+0.060
 harmony+0.060
Neg logit contributions
m-0.084
owed-0.082
es-0.080
oke-0.077
op-0.075
Token the
Token ID262
Feature activation+4.32e+00

Pos logit contributions
 sight+0.080
 purpose+0.080
 common+0.078
 rightful+0.077
 harmony+0.076
Neg logit contributions
m-0.104
es-0.097
owed-0.095
op-0.091
re-0.090
Token dentist
Token ID38408
Feature activation-1.32e+01

Pos logit contributions
 purpose+0.074
 sight+0.072
 common+0.070
 harmony+0.069
 accomplishment+0.066
Neg logit contributions
m-0.088
owed-0.080
es-0.075
re-0.074
op-0.073
Token.
Token ID13
Feature activation-3.03e+00

Pos logit contributions
m+0.209
owed+0.198
op+0.183
es+0.182
ppy+0.180
Neg logit contributions
 purpose-0.268
 sight-0.259
 fun-0.242
 common-0.241
 awesome-0.236
Token The
Token ID383
Feature activation+2.07e+00

Pos logit contributions
m+0.050
owed+0.048
es+0.047
op+0.044
oke+0.043
Neg logit contributions
 sight-0.061
 purpose-0.060
 common-0.057
 freedom-0.056
 accomplishment-0.056
Token dentist
Token ID38408
Feature activation-1.65e+01

Pos logit contributions
 purpose+0.034
 sight+0.034
 common+0.031
 harmony+0.030
 accomplishment+0.030
Neg logit contributions
m-0.043
owed-0.041
es-0.039
op-0.038
re-0.038
Token had
Token ID550
Feature activation+4.17e+00

Pos logit contributions
m+0.224
ppy+0.201
owed+0.199
op+0.192
bee+0.192
Neg logit contributions
 sight-0.349
 purpose-0.346
 fun-0.336
 accomplishment-0.325
 new-0.322
Token a
Token ID257
Feature activation+4.08e+00

Pos logit contributions
 sight+0.070
 purpose+0.069
 accomplishment+0.068
 harmony+0.068
 common+0.064
Neg logit contributions
m-0.080
owed-0.076
es-0.075
re-0.073
ird-0.068
Token big
Token ID1263
Feature activation+5.27e+00

Pos logit contributions
 harmony+0.053
 purpose+0.052
 sight+0.050
 freedom+0.049
 common+0.048
Neg logit contributions
m-0.093
op-0.085
owed-0.082
ird-0.081
oke-0.080
Token chair
Token ID5118
Feature activation-9.04e+00

Pos logit contributions
 purpose+0.093
 sight+0.090
 common+0.089
 harmony+0.088
 rightful+0.087
Neg logit contributions
m-0.098
owed-0.090
op-0.086
es-0.084
re-0.082
Token,
Token ID11
Feature activation+8.69e-01

Pos logit contributions
m+0.141
owed+0.134
es+0.126
op+0.118
oke+0.118
Neg logit contributions
 purpose-0.197
 sight-0.194
 common-0.181
 harmony-0.179
 accomplishment-0.177
Token wide
Token ID3094
Feature activation+6.28e-01

Pos logit contributions
m+0.069
owed+0.065
es+0.061
op+0.058
re+0.058
Neg logit contributions
 purpose-0.086
 sight-0.084
 common-0.079
 harmony-0.078
 accomplishment-0.077
Token.
Token ID13
Feature activation-2.97e+00

Pos logit contributions
 purpose+0.014
 sight+0.014
 common+0.013
 harmony+0.013
 rightful+0.013
Neg logit contributions
m-0.009
owed-0.009
es-0.008
re-0.008
op-0.007
Token
Token ID198
Feature activation-7.08e-01

Pos logit contributions
m+0.049
owed+0.048
es+0.044
ppy+0.043
bee+0.043
Neg logit contributions
 purpose-0.061
 sight-0.061
 common-0.055
 harmony-0.054
 accomplishment-0.054
Token
Token ID198
Feature activation-1.23e+00

Pos logit contributions
owed+0.013
m+0.013
ppy+0.012
bee+0.012
es+0.012
Neg logit contributions
 purpose-0.013
 sight-0.012
 common-0.011
 fun-0.011
 new-0.011
TokenThe
Token ID464
Feature activation+2.42e+00

Pos logit contributions
m+0.022
owed+0.022
es+0.020
re+0.019
oke+0.019
Neg logit contributions
 purpose-0.024
 sight-0.023
 common-0.021
 harmony-0.021
 accomplishment-0.021
Token dentist
Token ID38408
Feature activation-1.64e+01

Pos logit contributions
 purpose+0.038
 sight+0.038
 common+0.034
 accomplishment+0.033
 harmony+0.033
Neg logit contributions
m-0.052
owed-0.050
es-0.048
op-0.046
ppy-0.046
Token fixed
Token ID5969
Feature activation+1.45e+00

Pos logit contributions
m+0.246
ppy+0.223
owed+0.221
bee+0.218
re+0.215
Neg logit contributions
 sight-0.336
 purpose-0.323
 accomplishment-0.318
 fun-0.306
 harmony-0.305
Token her
Token ID607
Feature activation+2.30e+00

Pos logit contributions
 rightful+0.021
 purpose+0.021
 sight+0.021
 common+0.020
 harmony+0.020
Neg logit contributions
es-0.032
m-0.032
owed-0.030
ppy-0.028
op-0.028
Token tooth
Token ID16162
Feature activation-8.09e+00

Pos logit contributions
 purpose+0.044
 sight+0.043
 common+0.041
 harmony+0.040
 accomplishment+0.039
Neg logit contributions
m-0.041
owed-0.040
es-0.038
ppy-0.036
op-0.035
Token and
Token ID290
Feature activation+2.22e+00

Pos logit contributions
m+0.128
owed+0.120
oke+0.111
re+0.110
es+0.109
Neg logit contributions
 purpose-0.171
 sight-0.168
 accomplishment-0.152
 freedom-0.151
 beauty-0.151
Token Lily
Token ID20037
Feature activation-9.03e+00

Pos logit contributions
 purpose+0.043
 sight+0.042
 common+0.040
 harmony+0.038
 rightful+0.037
Neg logit contributions
m-0.040
owed-0.038
es-0.037
ppy-0.034
oke-0.033
Token nervous
Token ID10927
Feature activation+4.63e-01

Pos logit contributions
 purpose+0.110
 destination+0.100
 sight+0.099
 common+0.098
 accomplishment+0.096
Neg logit contributions
m-0.080
es-0.076
owed-0.073
ppy-0.064
 who-0.060
Token.
Token ID13
Feature activation-1.83e+00

Pos logit contributions
 purpose+0.011
 sight+0.011
 common+0.010
 harmony+0.010
 accomplishment+0.010
Neg logit contributions
m-0.006
owed-0.006
es-0.006
oke-0.005
op-0.005
Token
Token ID198
Feature activation-9.49e-01

Pos logit contributions
m+0.030
owed+0.029
es+0.027
ppy+0.026
bee+0.026
Neg logit contributions
 purpose-0.038
 sight-0.038
 common-0.035
 harmony-0.034
 beauty-0.033
Token
Token ID198
Feature activation-1.53e+00

Pos logit contributions
m+0.016
owed+0.015
es+0.014
ppy+0.014
re+0.014
Neg logit contributions
 purpose-0.019
 sight-0.019
 common-0.018
 harmony-0.017
 accomplishment-0.017
TokenThe
Token ID464
Feature activation+2.47e+00

Pos logit contributions
owed+0.024
m+0.024
es+0.021
re+0.021
oke+0.021
Neg logit contributions
 purpose-0.032
 sight-0.031
 common-0.030
 our-0.029
 harmony-0.029
Token dentist
Token ID38408
Feature activation-1.64e+01

Pos logit contributions
 purpose+0.040
 sight+0.040
 common+0.036
 harmony+0.035
 accomplishment+0.035
Neg logit contributions
m-0.052
owed-0.050
es-0.047
op-0.046
oke-0.046
Token was
Token ID373
Feature activation+3.29e+00

Pos logit contributions
m+0.237
owed+0.222
ppy+0.219
op+0.213
re+0.211
Neg logit contributions
 sight-0.347
 purpose-0.337
 accomplishment-0.316
 fun-0.316
 common-0.315
Token a
Token ID257
Feature activation+3.15e+00

Pos logit contributions
 purpose+0.064
 harmony+0.059
 sight+0.056
 common+0.054
 pleasure+0.053
Neg logit contributions
m-0.064
owed-0.056
es-0.052
oke-0.051
o-0.050
Token nice
Token ID3621
Feature activation+2.36e+00

Pos logit contributions
 purpose+0.046
 sight+0.043
 harmony+0.042
 common+0.042
 freedom+0.040
Neg logit contributions
m-0.071
oke-0.061
op-0.061
owed-0.061
es-0.059
Token lady
Token ID10846
Feature activation-1.13e+01

Pos logit contributions
 purpose+0.043
 sight+0.043
 common+0.038
 accomplishment+0.038
 harmony+0.038
Neg logit contributions
m-0.045
owed-0.044
es-0.042
ppy-0.040
re-0.040
Token.
Token ID13
Feature activation-3.06e+00

Pos logit contributions
owed+0.165
m+0.159
bee+0.155
re+0.151
int+0.148
Neg logit contributions
 sight-0.219
 purpose-0.209
 harmony-0.208
 our-0.204
 peace-0.201
Token sit
Token ID1650
Feature activation+3.04e+00

Pos logit contributions
 purpose+0.096
 sight+0.096
 common+0.086
 harmony+0.085
 accomplishment+0.085
Neg logit contributions
m-0.122
owed-0.118
es-0.111
ppy-0.108
re-0.108
Token.
Token ID13
Feature activation-1.72e+00

Pos logit contributions
 purpose+0.061
 sight+0.060
 common+0.056
 harmony+0.055
 accomplishment+0.054
Neg logit contributions
m-0.053
owed-0.050
es-0.048
op-0.045
ppy-0.045
Token
Token ID198
Feature activation-7.19e-01

Pos logit contributions
m+0.023
owed+0.021
es+0.020
op+0.019
re+0.019
Neg logit contributions
 purpose-0.041
 sight-0.040
 common-0.038
 accomplishment-0.038
 harmony-0.038
Token
Token ID198
Feature activation-1.23e+00

Pos logit contributions
m+0.012
owed+0.011
es+0.010
ppy+0.010
re+0.010
Neg logit contributions
 purpose-0.015
 sight-0.015
 common-0.014
 harmony-0.013
 accomplishment-0.013
TokenThe
Token ID464
Feature activation+2.82e+00

Pos logit contributions
m+0.020
owed+0.020
es+0.018
op+0.017
re+0.017
Neg logit contributions
 purpose-0.026
 sight-0.025
 common-0.024
 harmony-0.023
 accomplishment-0.023
Token dentist
Token ID38408
Feature activation-1.64e+01

Pos logit contributions
 purpose+0.046
 sight+0.046
 common+0.042
 harmony+0.041
 accomplishment+0.040
Neg logit contributions
m-0.059
owed-0.056
es-0.054
op-0.052
oke-0.052
Token had
Token ID550
Feature activation+4.76e+00

Pos logit contributions
m+0.226
ppy+0.205
re+0.204
op+0.201
owed+0.201
Neg logit contributions
 sight-0.345
 purpose-0.336
 fun-0.324
 accomplishment-0.317
 common-0.313
Token an
Token ID281
Feature activation+7.36e+00

Pos logit contributions
 purpose+0.075
 sight+0.075
 harmony+0.074
 accomplishment+0.072
 common+0.068
Neg logit contributions
m-0.098
owed-0.093
re-0.090
es-0.089
ird-0.083
Token idea
Token ID2126
Feature activation-3.31e+00

Pos logit contributions
 purpose+0.106
 sight+0.103
 harmony+0.097
 common+0.097
 rightful+0.090
Neg logit contributions
m-0.163
owed-0.162
es-0.149
op-0.148
bee-0.147
Token.
Token ID13
Feature activation-3.06e+00

Pos logit contributions
m+0.063
ppy+0.062
owed+0.061
op+0.060
int+0.059
Neg logit contributions
 sight-0.054
 purpose-0.052
 fun-0.051
 common-0.048
 freedom-0.048
Token He
Token ID679
Feature activation-3.38e+00

Pos logit contributions
m+0.048
owed+0.045
es+0.043
oke+0.042
op+0.040
Neg logit contributions
 purpose-0.067
 sight-0.065
 common-0.062
 harmony-0.061
 beauty-0.059
Token and
Token ID290
Feature activation+5.65e-01

Pos logit contributions
owed+0.041
m+0.037
ppy+0.036
op+0.035
int+0.034
Neg logit contributions
 sight-0.045
 purpose-0.042
 fun-0.040
 new-0.040
 harmony-0.039
Token very
Token ID845
Feature activation+2.73e+00

Pos logit contributions
 purpose+0.011
 sight+0.010
 accomplishment+0.009
 common+0.009
 rightful+0.009
Neg logit contributions
m-0.011
owed-0.010
es-0.010
ppy-0.009
op-0.009
Token excited
Token ID6568
Feature activation+3.68e-01

Pos logit contributions
 purpose+0.035
 sight+0.034
 harmony+0.029
 accomplishment+0.029
 common+0.029
Neg logit contributions
m-0.067
owed-0.065
es-0.062
oke-0.061
op-0.060
Token.
Token ID13
Feature activation-3.87e+00

Pos logit contributions
 sight+0.007
 purpose+0.007
 common+0.007
 fun+0.007
 harmony+0.007
Neg logit contributions
m-0.006
owed-0.006
es-0.006
op-0.006
ppy-0.006
Token The
Token ID383
Feature activation+2.56e+00

Pos logit contributions
owed+0.079
m+0.078
ppy+0.076
bee+0.074
es+0.073
Neg logit contributions
 sight-0.065
 purpose-0.063
 common-0.058
 our-0.058
 harmony-0.057
Token coach
Token ID3985
Feature activation-1.63e+01

Pos logit contributions
 purpose+0.042
 sight+0.041
 ours+0.038
 freedom+0.038
 outer+0.038
Neg logit contributions
m-0.055
re-0.045
'-0.044
op-0.043
ll-0.043
Token was
Token ID373
Feature activation+3.31e+00

Pos logit contributions
owed+0.243
m+0.237
ppy+0.226
es+0.216
op+0.213
Neg logit contributions
 sight-0.320
 fun-0.315
 new-0.299
 common-0.296
 our-0.294
Token ignorant
Token ID20109
Feature activation-7.37e-01

Pos logit contributions
 purpose+0.071
 sight+0.062
 holes+0.059
 accomplishment+0.059
 harmony+0.059
Neg logit contributions
m-0.060
o-0.049
re-0.045
oke-0.044
es-0.043
Token of
Token ID286
Feature activation+4.33e+00

Pos logit contributions
m+0.012
owed+0.012
es+0.011
ppy+0.011
op+0.011
Neg logit contributions
 purpose-0.015
 sight-0.015
 common-0.014
 harmony-0.013
 fun-0.013
Token the
Token ID262
Feature activation+4.84e+00

Pos logit contributions
 purpose+0.055
 sight+0.053
 accomplishment+0.048
 rightful+0.048
 harmony+0.046
Neg logit contributions
m-0.105
owed-0.101
es-0.099
ppy-0.094
oke-0.093
Token girls
Token ID4813
Feature activation-1.07e+01

Pos logit contributions
 purpose+0.079
 sight+0.076
 common+0.073
 outer+0.072
 harmony+0.071
Neg logit contributions
m-0.102
re-0.086
op-0.082
oke-0.081
es-0.081
Token me
Token ID502
Feature activation-1.96e+00

Pos logit contributions
 purpose+0.090
 sight+0.088
 harmony+0.084
 rightful+0.082
 accomplishment+0.082
Neg logit contributions
m-0.112
owed-0.103
re-0.102
ppy-0.097
ll-0.096
Token."
Token ID526
Feature activation-3.21e+00

Pos logit contributions
m+0.034
owed+0.032
es+0.030
oke+0.030
op+0.029
Neg logit contributions
 purpose-0.038
 sight-0.037
 common-0.035
 harmony-0.034
 freedom-0.034
Token
Token ID198
Feature activation-1.45e+00

Pos logit contributions
owed+0.059
m+0.058
es+0.053
ppy+0.053
bee+0.052
Neg logit contributions
 purpose-0.061
 sight-0.060
 common-0.055
 harmony-0.053
 fun-0.053
Token
Token ID198
Feature activation-1.37e+00

Pos logit contributions
m+0.022
owed+0.021
es+0.020
op+0.019
oke+0.019
Neg logit contributions
 purpose-0.032
 sight-0.032
 common-0.029
 harmony-0.029
 accomplishment-0.029
TokenThe
Token ID464
Feature activation+2.66e+00

Pos logit contributions
m+0.023
owed+0.022
es+0.020
re+0.020
oke+0.020
Neg logit contributions
 purpose-0.028
 sight-0.028
 common-0.026
 harmony-0.026
 accomplishment-0.025
Token coach
Token ID3985
Feature activation-1.63e+01

Pos logit contributions
 purpose+0.039
 sight+0.038
 harmony+0.035
 outer+0.034
 common+0.033
Neg logit contributions
m-0.061
oke-0.051
bee-0.051
es-0.050
re-0.050
Token smiled
Token ID13541
Feature activation-1.89e+00

Pos logit contributions
owed+0.245
m+0.231
itive+0.227
op+0.227
ppy+0.227
Neg logit contributions
 sight-0.310
 joy-0.298
 fun-0.289
 new-0.286
 success-0.282
Token and
Token ID290
Feature activation+4.80e-01

Pos logit contributions
m+0.026
owed+0.026
es+0.023
op+0.023
re+0.023
Neg logit contributions
 sight-0.043
 purpose-0.043
 common-0.039
 harmony-0.039
 accomplishment-0.039
Token said
Token ID531
Feature activation-1.17e+00

Pos logit contributions
 sight+0.012
 purpose+0.012
 fun+0.011
 common+0.011
 accomplishment+0.011
Neg logit contributions
m-0.005
owed-0.005
op-0.005
es-0.005
re-0.005
Token,
Token ID11
Feature activation-2.94e+00

Pos logit contributions
m+0.019
owed+0.018
ppy+0.017
bee+0.017
re+0.016
Neg logit contributions
 purpose-0.024
 sight-0.023
 common-0.021
 our-0.021
 accomplishment-0.021
Token "
Token ID366
Feature activation-2.72e+00

Pos logit contributions
owed+0.068
m+0.065
ppy+0.063
re+0.062
es+0.062
Neg logit contributions
 sight-0.037
 purpose-0.036
 new-0.033
 fun-0.032
 accomplishment-0.031
Token to
Token ID284
Feature activation+4.39e+00

Pos logit contributions
m+0.079
owed+0.075
es+0.072
op+0.070
re+0.069
Neg logit contributions
 purpose-0.064
 sight-0.063
 harmony-0.059
 common-0.058
 accomplishment-0.057
Token the
Token ID262
Feature activation+3.40e+00

Pos logit contributions
 accomplishment+0.067
 sight+0.066
 purpose+0.066
 harmony+0.066
 beauty+0.066
Neg logit contributions
m-0.075
'-0.068
ird-0.066
re-0.064
 who-0.060
Token dentist
Token ID38408
Feature activation-1.29e+01

Pos logit contributions
 accomplishment+0.075
 harmony+0.072
 common+0.072
 freedom+0.068
 purpose+0.067
Neg logit contributions
m-0.052
'-0.041
op-0.037
board-0.036
itty-0.035
Token.
Token ID13
Feature activation-3.76e+00

Pos logit contributions
bee+0.230
m+0.227
owed+0.217
oke+0.214
itive+0.203
Neg logit contributions
 purpose-0.218
 sight-0.214
 new-0.193
 fun-0.190
 harmony-0.188
Token The
Token ID383
Feature activation+1.86e+00

Pos logit contributions
m+0.085
owed+0.083
es+0.080
ppy+0.078
bee+0.076
Neg logit contributions
 purpose-0.055
 sight-0.054
 common-0.047
 accomplishment-0.046
 harmony-0.046
Token dentist
Token ID38408
Feature activation-1.63e+01

Pos logit contributions
 purpose+0.031
 sight+0.031
 common+0.029
 harmony+0.028
 accomplishment+0.028
Neg logit contributions
m-0.038
owed-0.036
es-0.034
op-0.033
oke-0.033
Token said
Token ID531
Feature activation-1.40e+00

Pos logit contributions
m+0.235
bee+0.227
ppy+0.224
itive+0.211
re+0.207
Neg logit contributions
 sight-0.327
 purpose-0.316
 fun-0.312
 new-0.308
 original-0.302
Token that
Token ID326
Feature activation+1.88e-01

Pos logit contributions
m+0.026
owed+0.025
es+0.024
ppy+0.023
re+0.023
Neg logit contributions
 purpose-0.026
 sight-0.026
 common-0.024
 harmony-0.023
 accomplishment-0.023
Token Tim
Token ID5045
Feature activation-1.02e+01

Pos logit contributions
 purpose+0.003
 sight+0.003
 common+0.003
 harmony+0.003
 accomplishment+0.003
Neg logit contributions
m-0.004
owed-0.004
es-0.004
ppy-0.003
op-0.003
Token had
Token ID550
Feature activation+4.20e+00

Pos logit contributions
m+0.137
owed+0.131
oke+0.127
bee+0.124
mun+0.117
Neg logit contributions
 purpose-0.211
 sight-0.202
 fun-0.194
 original-0.193
 rightful-0.192
Token a
Token ID257
Feature activation+4.82e+00

Pos logit contributions
 sight+0.065
 purpose+0.065
 harmony+0.064
 common+0.063
 accomplishment+0.063
Neg logit contributions
m-0.088
re-0.081
owed-0.078
es-0.078
ppy-0.074
Token me
Token ID502
Feature activation-4.11e+00

Pos logit contributions
 accomplishment+0.077
 purpose+0.076
 sight+0.075
 rightful+0.074
 common+0.072
Neg logit contributions
m-0.088
es-0.086
owed-0.079
ppy-0.079
ie-0.077
Token?"
Token ID1701
Feature activation-4.26e+00

Pos logit contributions
m+0.061
owed+0.059
es+0.055
oke+0.054
re+0.053
Neg logit contributions
 purpose-0.089
 sight-0.089
 common-0.083
 harmony-0.081
 awesome-0.080
Token
Token ID198
Feature activation-2.66e+00

Pos logit contributions
m+0.063
owed+0.058
es+0.056
op+0.052
oke+0.051
Neg logit contributions
 purpose-0.097
 sight-0.094
 accomplishment-0.088
 common-0.087
 harmony-0.087
Token
Token ID198
Feature activation-2.62e+00

Pos logit contributions
m+0.042
owed+0.040
es+0.037
ppy+0.035
op+0.035
Neg logit contributions
 purpose-0.057
 sight-0.056
 common-0.052
 harmony-0.052
 accomplishment-0.051
TokenThe
Token ID464
Feature activation+2.91e+00

Pos logit contributions
m+0.033
owed+0.029
es+0.029
ppy+0.029
op+0.027
Neg logit contributions
 purpose-0.064
 sight-0.064
 accomplishment-0.061
 pleasure-0.058
 freedom-0.058
Token dragon
Token ID10441
Feature activation-1.63e+01

Pos logit contributions
 purpose+0.049
 sight+0.047
 common+0.044
 accomplishment+0.043
 harmony+0.043
Neg logit contributions
m-0.059
owed-0.056
es-0.054
oke-0.051
ppy-0.051
Token heard
Token ID2982
Feature activation+1.68e+00

Pos logit contributions
m+0.251
op+0.239
re+0.237
ird+0.235
oke+0.221
Neg logit contributions
 sight-0.336
 fun-0.310
 common-0.306
 purpose-0.303
 new-0.300
Token the
Token ID262
Feature activation+3.63e+00

Pos logit contributions
 purpose+0.030
 rightful+0.030
 sight+0.027
 harmony+0.027
 common+0.027
Neg logit contributions
m-0.033
es-0.031
owed-0.030
ppy-0.030
oke-0.029
Token boy
Token ID2933
Feature activation-1.20e+01

Pos logit contributions
 purpose+0.046
 sight+0.045
 common+0.039
 harmony+0.038
 beauty+0.037
Neg logit contributions
m-0.089
owed-0.088
es-0.084
op-0.082
ppy-0.081
Token and
Token ID290
Feature activation+3.51e-01

Pos logit contributions
ird+0.199
m+0.199
ops+0.197
owed+0.197
op+0.197
Neg logit contributions
 sight-0.206
 new-0.191
 sand-0.187
 harmony-0.187
 joy-0.186
Token flew
Token ID13112
Feature activation+1.34e+00

Pos logit contributions
 sight+0.007
 our+0.007
 purpose+0.007
 new+0.007
 sand+0.007
Neg logit contributions
m-0.006
op-0.005
owed-0.005
oke-0.005
ops-0.005
Token welcome
Token ID7062
Feature activation-6.63e-01

Pos logit contributions
 purpose+0.039
 sight+0.037
 accomplishment+0.033
 common+0.032
 rightful+0.032
Neg logit contributions
m-0.066
owed-0.059
es-0.058
ppy-0.057
re-0.056
Token,
Token ID11
Feature activation+1.98e+00

Pos logit contributions
m+0.010
owed+0.010
es+0.009
op+0.008
ppy+0.008
Neg logit contributions
 purpose-0.015
 sight-0.015
 common-0.014
 harmony-0.013
 accomplishment-0.013
Token kids
Token ID3988
Feature activation-9.94e+00

Pos logit contributions
 purpose+0.037
 sight+0.036
 common+0.034
 harmony+0.033
 rightful+0.033
Neg logit contributions
m-0.037
owed-0.034
es-0.033
oke-0.032
re-0.031
Token,"
Token ID553
Feature activation-3.20e+00

Pos logit contributions
owed+0.188
m+0.178
bee+0.178
int+0.176
op+0.174
Neg logit contributions
 purpose-0.157
 fun-0.155
 sight-0.154
 our-0.152
 peace-0.143
Token the
Token ID262
Feature activation+2.15e+00

Pos logit contributions
m+0.066
oke+0.058
es+0.056
owed+0.055
re+0.053
Neg logit contributions
 purpose-0.056
 sight-0.052
 harmony-0.046
 outer-0.045
 rightful-0.045
Token firefighter
Token ID43105
Feature activation-1.62e+01

Pos logit contributions
 purpose+0.034
 sight+0.033
 common+0.030
 harmony+0.029
 accomplishment+0.029
Neg logit contributions
m-0.047
owed-0.045
es-0.043
op-0.041
oke-0.041
Token says
Token ID1139
Feature activation-3.07e+00

Pos logit contributions
m+0.284
owed+0.273
op+0.273
re+0.258
es+0.257
Neg logit contributions
 sight-0.281
 our-0.266
 common-0.266
 purpose-0.266
 fun-0.264
Token.
Token ID13
Feature activation-3.26e+00

Pos logit contributions
m+0.051
owed+0.049
op+0.045
es+0.045
re+0.045
Neg logit contributions
 sight-0.060
 purpose-0.060
 accomplishment-0.056
 common-0.055
 harmony-0.054
Token "
Token ID366
Feature activation-2.94e+00

Pos logit contributions
m+0.058
owed+0.055
es+0.053
oke+0.050
op+0.050
Neg logit contributions
 purpose-0.064
 sight-0.063
 common-0.058
 harmony-0.057
 accomplishment-0.057
TokenBut
Token ID1537
Feature activation-4.43e-01

Pos logit contributions
owed+0.049
m+0.049
es+0.044
oke+0.043
re+0.043
Neg logit contributions
 purpose-0.059
 sight-0.058
 harmony-0.054
 common-0.054
 freedom-0.053
Token remember
Token ID3505
Feature activation+3.64e-01

Pos logit contributions
m+0.008
owed+0.008
es+0.007
oke+0.007
ll+0.007
Neg logit contributions
 purpose-0.009
 sight-0.008
 common-0.008
 harmony-0.007
 rightful-0.007
Token.
Token ID13
Feature activation-3.38e+00

Pos logit contributions
owed+0.067
m+0.066
es+0.062
op+0.058
ppy+0.057
Neg logit contributions
 purpose-0.064
 sight-0.063
 common-0.061
 harmony-0.060
 rightful-0.057
Token Suddenly
Token ID24975
Feature activation-1.54e+00

Pos logit contributions
m+0.066
owed+0.063
es+0.059
op+0.058
re+0.058
Neg logit contributions
 purpose-0.060
 sight-0.058
 common-0.054
 harmony-0.053
 accomplishment-0.053
Token,
Token ID11
Feature activation-1.64e-02

Pos logit contributions
m+0.024
owed+0.024
es+0.021
ppy+0.021
oke+0.021
Neg logit contributions
 purpose-0.033
 sight-0.033
 common-0.030
 accomplishment-0.029
 harmony-0.029
Token the
Token ID262
Feature activation+2.02e+00

Pos logit contributions
m+0.000
owed+0.000
es+0.000
op+0.000
re+0.000
Neg logit contributions
 purpose-0.000
 sight-0.000
 common-0.000
 harmony-0.000
 rightful-0.000
Token scare
Token ID19437
Feature activation-3.63e+00

Pos logit contributions
 purpose+0.035
 harmony+0.033
 sight+0.033
 common+0.032
 accomplishment+0.032
Neg logit contributions
m-0.039
owed-0.036
bee-0.034
op-0.034
re-0.033
Tokencrow
Token ID47114
Feature activation-1.62e+01

Pos logit contributions
m+0.060
owed+0.058
es+0.054
ppy+0.053
ll+0.052
Neg logit contributions
 purpose-0.074
 sight-0.073
 common-0.067
 harmony-0.066
 freedom-0.065
Token moved
Token ID3888
Feature activation+2.16e-01

Pos logit contributions
m+0.252
owed+0.237
ppy+0.230
oke+0.227
bee+0.219
Neg logit contributions
 sight-0.318
 purpose-0.298
 beauty-0.280
 sand-0.278
 fun-0.275
Token and
Token ID290
Feature activation+2.23e+00

Pos logit contributions
 purpose+0.004
 sight+0.004
 rightful+0.004
 common+0.004
 accomplishment+0.004
Neg logit contributions
m-0.004
owed-0.004
es-0.004
ppy-0.004
op-0.003
Token waved
Token ID26834
Feature activation+7.64e-01

Pos logit contributions
 purpose+0.035
 sight+0.034
 common+0.031
 harmony+0.031
 accomplishment+0.030
Neg logit contributions
m-0.048
owed-0.047
es-0.045
op-0.043
re-0.043
Token its
Token ID663
Feature activation+1.87e+00

Pos logit contributions
 purpose+0.013
 sight+0.012
 common+0.012
 rightful+0.012
 harmony+0.012
Neg logit contributions
m-0.014
owed-0.014
es-0.014
ppy-0.013
bee-0.013
Token arms
Token ID5101
Feature activation-6.96e+00

Pos logit contributions
 purpose+0.026
 sight+0.025
 harmony+0.024
 common+0.024
 rightful+0.023
Neg logit contributions
owed-0.041
m-0.041
es-0.039
re-0.038
ppy-0.038
Token<|endoftext|>
Token ID50256
Feature activation-8.48e+00

Pos logit contributions
m+0.037
owed+0.037
bee+0.035
es+0.035
ppy+0.034
Neg logit contributions
 purpose-0.035
 sight-0.034
 harmony-0.031
 pleasure-0.030
 common-0.030
TokenOne
Token ID3198
Feature activation-3.07e+00

Pos logit contributions
owed+0.178
bee+0.162
ppy+0.161
m+0.157
oke+0.156
Neg logit contributions
 sight-0.145
 purpose-0.136
 peace-0.125
 common-0.125
 harmony-0.123
Token day
Token ID1110
Feature activation-1.56e+00

Pos logit contributions
m+0.069
owed+0.066
es+0.062
op+0.061
re+0.061
Neg logit contributions
 purpose-0.046
 sight-0.044
 common-0.041
 harmony-0.040
 accomplishment-0.040
Token,
Token ID11
Feature activation-1.34e+00

Pos logit contributions
owed+0.030
m+0.030
oke+0.029
ppy+0.028
es+0.028
Neg logit contributions
 purpose-0.027
 sight-0.027
 common-0.025
 our-0.024
 fun-0.024
Token Mrs
Token ID9074
Feature activation-6.82e+00

Pos logit contributions
m+0.037
owed+0.036
es+0.035
oke+0.034
ppy+0.034
Neg logit contributions
 purpose-0.013
 sight-0.013
 common-0.011
 harmony-0.010
 accomplishment-0.010
Token Jones
Token ID5437
Feature activation-1.62e+01

Pos logit contributions
m+0.127
owed+0.125
op+0.122
re+0.118
oke+0.117
Neg logit contributions
 sight-0.123
 purpose-0.121
 common-0.114
 fun-0.107
 new-0.107
Token took
Token ID1718
Feature activation+9.23e-01

Pos logit contributions
owed+0.250
m+0.245
op+0.235
bee+0.233
int+0.220
Neg logit contributions
 sight-0.301
 common-0.290
 purpose-0.286
 our-0.279
 their-0.276
Token her
Token ID607
Feature activation-1.83e-01

Pos logit contributions
 purpose+0.014
 sight+0.013
 rightful+0.013
 accomplishment+0.012
 harmony+0.012
Neg logit contributions
m-0.020
owed-0.019
es-0.019
re-0.018
ppy-0.018
Token pet
Token ID4273
Feature activation-6.09e+00

Pos logit contributions
m+0.004
owed+0.004
es+0.003
op+0.003
re+0.003
Neg logit contributions
 purpose-0.003
 sight-0.003
 harmony-0.003
 common-0.003
 accomplishment-0.003
Token dog
Token ID3290
Feature activation-1.01e+01

Pos logit contributions
m+0.122
owed+0.120
es+0.111
oke+0.109
ppy+0.108
Neg logit contributions
 purpose-0.104
 sight-0.103
 common-0.091
 harmony-0.090
 rightful-0.089
Token to
Token ID284
Feature activation+2.60e+00

Pos logit contributions
m+0.204
owed+0.201
oke+0.188
bee+0.185
re+0.181
Neg logit contributions
 purpose-0.165
 sight-0.161
 our-0.147
 common-0.146
 new-0.144
Token.
Token ID13
Feature activation-4.33e+00

Pos logit contributions
owed+0.225
oke+0.220
ird+0.218
m+0.217
ppy+0.217
Neg logit contributions
 sight-0.213
 new-0.202
 sand-0.199
 purpose-0.197
 fun-0.196
Token
Token ID220
Feature activation-2.20e+00

Pos logit contributions
owed+0.088
m+0.088
ppy+0.085
itive+0.084
ird+0.083
Neg logit contributions
 sight-0.070
 purpose-0.066
 fun-0.061
 common-0.061
 new-0.060
Token
Token ID198
Feature activation-2.37e+00

Pos logit contributions
owed+0.042
m+0.041
bee+0.041
ppy+0.040
itive+0.038
Neg logit contributions
 purpose-0.037
 sight-0.036
 common-0.034
 fun-0.033
 new-0.033
Token
Token ID198
Feature activation-3.18e+00

Pos logit contributions
owed+0.046
m+0.044
ppy+0.042
bee+0.042
es+0.042
Neg logit contributions
 sight-0.041
 purpose-0.041
 common-0.038
 fun-0.037
 new-0.036
TokenThe
Token ID464
Feature activation+1.55e+00

Pos logit contributions
m+0.056
owed+0.056
oke+0.052
es+0.052
re+0.050
Neg logit contributions
 purpose-0.061
 sight-0.060
 common-0.057
 harmony-0.055
 fun-0.054
Token clown
Token ID25573
Feature activation-1.62e+01

Pos logit contributions
 purpose+0.028
 sight+0.026
 harmony+0.025
 rightful+0.025
 accomplishment+0.025
Neg logit contributions
m-0.031
owed-0.027
es-0.026
bee-0.026
oke-0.025
Token said
Token ID531
Feature activation-1.69e+00

Pos logit contributions
owed+0.280
ppy+0.270
m+0.262
itive+0.257
owing+0.252
Neg logit contributions
 fun-0.274
 sight-0.267
 new-0.264
 sand-0.253
 big-0.249
Token,
Token ID11
Feature activation-2.26e+00

Pos logit contributions
m+0.028
owed+0.026
ppy+0.024
es+0.024
op+0.024
Neg logit contributions
 purpose-0.035
 sight-0.034
 common-0.032
 our-0.031
 victory-0.031
Token "
Token ID366
Feature activation-2.80e+00

Pos logit contributions
owed+0.053
m+0.053
re+0.050
ird+0.049
ll+0.049
Neg logit contributions
 sight-0.030
 purpose-0.028
 new-0.027
 fun-0.026
 our-0.026
TokenHi
Token ID17250
Feature activation-1.89e+00

Pos logit contributions
owed+0.066
oke+0.059
op+0.059
es+0.058
ird+0.057
Neg logit contributions
 sight-0.040
 purpose-0.038
 harmony-0.038
 freedom-0.035
 beauty-0.035
Token Mia
Token ID32189
Feature activation-1.11e+01

Pos logit contributions
m+0.032
owed+0.032
es+0.029
op+0.029
re+0.029
Neg logit contributions
 sight-0.037
 purpose-0.036
 common-0.033
 our-0.033
 awesome-0.033
Token day
Token ID1110
Feature activation-2.17e+00

Pos logit contributions
 purpose+0.063
 common+0.061
 sight+0.058
 rightful+0.056
 harmony+0.056
Neg logit contributions
m-0.114
oke-0.099
owed-0.098
op-0.098
re-0.097
Token.
Token ID13
Feature activation-4.21e+00

Pos logit contributions
m+0.032
owed+0.030
es+0.028
op+0.026
re+0.026
Neg logit contributions
 purpose-0.049
 sight-0.048
 common-0.045
 harmony-0.044
 accomplishment-0.044
Token
Token ID198
Feature activation-1.91e+00

Pos logit contributions
owed+0.093
m+0.093
bee+0.087
ppy+0.087
es+0.084
Neg logit contributions
 sight-0.064
 purpose-0.062
 common-0.055
 harmony-0.054
 fun-0.054
Token
Token ID198
Feature activation-2.70e+00

Pos logit contributions
owed+0.033
m+0.032
es+0.029
ppy+0.028
re+0.028
Neg logit contributions
 purpose-0.039
 sight-0.038
 common-0.037
 harmony-0.035
 awesome-0.034
TokenThe
Token ID464
Feature activation+1.91e+00

Pos logit contributions
owed+0.047
m+0.046
es+0.042
ppy+0.041
re+0.040
Neg logit contributions
 purpose-0.054
 sight-0.052
 common-0.050
 harmony-0.049
 beauty-0.047
Token firefighter
Token ID43105
Feature activation-1.62e+01

Pos logit contributions
 purpose+0.032
 sight+0.031
 common+0.030
 harmony+0.028
 rightful+0.028
Neg logit contributions
m-0.039
owed-0.035
es-0.034
oke-0.034
op-0.032
Token grabbed
Token ID13176
Feature activation+3.30e+00

Pos logit contributions
m+0.246
owed+0.238
bee+0.230
ops+0.230
op+0.228
Neg logit contributions
 sight-0.293
 our-0.264
 new-0.261
 fun-0.260
 old-0.259
Token the
Token ID262
Feature activation+5.16e+00

Pos logit contributions
 rightful+0.048
 purpose+0.048
 sight+0.045
 common+0.045
 harmony+0.044
Neg logit contributions
m-0.072
es-0.070
owed-0.069
ppy-0.064
op-0.064
Token hose
Token ID31489
Feature activation-8.73e+00

Pos logit contributions
 purpose+0.072
 common+0.067
 rightful+0.066
 sight+0.066
 harmony+0.063
Neg logit contributions
m-0.120
owed-0.107
op-0.107
es-0.103
oke-0.103
Token and
Token ID290
Feature activation+1.25e+00

Pos logit contributions
m+0.115
owed+0.109
es+0.101
op+0.095
re+0.094
Neg logit contributions
 purpose-0.209
 sight-0.206
 common-0.194
 harmony-0.191
 accomplishment-0.190
Token turned
Token ID2900
Feature activation+2.97e+00

Pos logit contributions
 sight+0.026
 purpose+0.026
 common+0.024
 accomplishment+0.024
 harmony+0.024
Neg logit contributions
m-0.020
owed-0.019
es-0.017
oke-0.017
re-0.017
Token sunny
Token ID27737
Feature activation-2.46e+00

Pos logit contributions
m+0.073
owed+0.070
es+0.065
op+0.064
re+0.064
Neg logit contributions
 purpose-0.050
 sight-0.049
 common-0.044
 harmony-0.044
 accomplishment-0.043
Token day
Token ID1110
Feature activation-1.87e+00

Pos logit contributions
owed+0.062
m+0.060
es+0.059
ppy+0.058
oke+0.057
Neg logit contributions
 sight-0.032
 purpose-0.031
 common-0.026
 harmony-0.025
 freedom-0.024
Token,
Token ID11
Feature activation-1.68e+00

Pos logit contributions
m+0.033
owed+0.033
oke+0.031
es+0.031
ppy+0.030
Neg logit contributions
 sight-0.035
 purpose-0.035
 common-0.033
 our-0.032
 fun-0.031
Token a
Token ID257
Feature activation-1.30e+00

Pos logit contributions
m+0.042
owed+0.041
es+0.039
op+0.038
re+0.038
Neg logit contributions
 purpose-0.020
 sight-0.020
 common-0.017
 harmony-0.017
 accomplishment-0.017
Token little
Token ID1310
Feature activation-1.78e+00

Pos logit contributions
m+0.033
owed+0.033
es+0.031
re+0.030
ppy+0.030
Neg logit contributions
 purpose-0.015
 sight-0.015
 common-0.013
 harmony-0.012
 accomplishment-0.012
Token crab
Token ID32202
Feature activation-1.62e+01

Pos logit contributions
owed+0.035
es+0.032
m+0.032
re+0.030
ppy+0.030
Neg logit contributions
 sight-0.033
 purpose-0.033
 common-0.029
 harmony-0.029
 sand-0.029
Token went
Token ID1816
Feature activation-1.00e+00

Pos logit contributions
owed+0.273
oke+0.267
ppy+0.264
m+0.262
ird+0.250
Neg logit contributions
 sight-0.296
 our-0.274
 common-0.273
 sand-0.272
 big-0.271
Token to
Token ID284
Feature activation+2.88e+00

Pos logit contributions
m+0.020
owed+0.020
es+0.019
re+0.018
op+0.018
Neg logit contributions
 purpose-0.017
 sight-0.017
 common-0.015
 harmony-0.015
 accomplishment-0.015
Token the
Token ID262
Feature activation+1.66e+00

Pos logit contributions
 purpose+0.048
 accomplishment+0.047
 sight+0.043
 awesome+0.043
 rightful+0.043
Neg logit contributions
m-0.047
owed-0.043
o-0.043
ie-0.041
ppy-0.041
Token park
Token ID3952
Feature activation-7.33e+00

Pos logit contributions
 accomplishment+0.035
 common+0.030
 concentration+0.029
 harmony+0.029
 satisfaction+0.029
Neg logit contributions
m-0.028
ll-0.023
'-0.023
o-0.023
y-0.021
Token.
Token ID13
Feature activation-4.39e+00

Pos logit contributions
m+0.138
owed+0.134
es+0.128
ppy+0.127
oke+0.126
Neg logit contributions
 sight-0.133
 purpose-0.131
 common-0.120
 harmony-0.117
 freedom-0.116
Token join
Token ID4654
Feature activation+1.74e+00

Pos logit contributions
 sight+0.058
 purpose+0.058
 common+0.051
 harmony+0.051
 accomplishment+0.051
Neg logit contributions
m-0.076
owed-0.073
es-0.069
op-0.068
re-0.067
Token.
Token ID13
Feature activation-2.12e+00

Pos logit contributions
 purpose+0.029
 sight+0.028
 rightful+0.025
 common+0.025
 accomplishment+0.025
Neg logit contributions
m-0.036
owed-0.035
es-0.034
re-0.032
ppy-0.032
Token
Token ID198
Feature activation-6.18e-01

Pos logit contributions
m+0.043
owed+0.041
es+0.039
ppy+0.038
op+0.037
Neg logit contributions
 purpose-0.036
 sight-0.036
 common-0.032
 harmony-0.032
 accomplishment-0.031
Token
Token ID198
Feature activation-9.89e-01

Pos logit contributions
owed+0.011
m+0.011
es+0.010
ppy+0.010
bee+0.010
Neg logit contributions
 purpose-0.012
 sight-0.012
 common-0.011
 harmony-0.011
 accomplishment-0.010
TokenThe
Token ID464
Feature activation+3.14e+00

Pos logit contributions
m+0.018
owed+0.017
es+0.016
ppy+0.015
op+0.015
Neg logit contributions
 purpose-0.019
 sight-0.019
 common-0.018
 harmony-0.017
 accomplishment-0.017
Token coach
Token ID3985
Feature activation-1.62e+01

Pos logit contributions
 purpose+0.060
 sight+0.056
 outer+0.054
 common+0.053
 former+0.051
Neg logit contributions
m-0.054
'-0.051
oke-0.048
re-0.046
es-0.045
Token noticed
Token ID6810
Feature activation+1.05e+00

Pos logit contributions
m+0.215
owed+0.210
op+0.192
ppy+0.190
es+0.185
Neg logit contributions
 sight-0.330
 fun-0.317
 new-0.312
 joy-0.311
 purpose-0.308
Token how
Token ID703
Feature activation+2.02e+00

Pos logit contributions
 purpose+0.017
 sight+0.016
 rightful+0.015
 common+0.015
 harmony+0.015
Neg logit contributions
m-0.023
owed-0.022
es-0.021
oke-0.020
re-0.020
Token fit
Token ID4197
Feature activation+1.91e-01

Pos logit contributions
 purpose+0.032
 sight+0.030
 common+0.027
 harmony+0.027
 rightful+0.026
Neg logit contributions
m-0.045
owed-0.042
oke-0.040
es-0.040
re-0.040
Token Jim
Token ID5395
Feature activation-1.05e+01

Pos logit contributions
 purpose+0.003
 sight+0.003
 rightful+0.003
 common+0.002
 harmony+0.002
Neg logit contributions
m-0.004
owed-0.004
es-0.004
re-0.004
ppy-0.004
Token was
Token ID373
Feature activation+2.99e+00

Pos logit contributions
m+0.132
owed+0.123
es+0.110
oke+0.110
op+0.109
Neg logit contributions
 purpose-0.251
 sight-0.250
 common-0.233
 harmony-0.232
 accomplishment-0.231
Token day
Token ID1110
Feature activation-6.79e-01

Pos logit contributions
m+0.068
owed+0.064
op+0.059
re+0.059
es+0.059
Neg logit contributions
 purpose-0.051
 sight-0.050
 common-0.046
 harmony-0.045
 accomplishment-0.045
Token,
Token ID11
Feature activation-1.53e+00

Pos logit contributions
m+0.012
owed+0.012
es+0.011
oke+0.011
ppy+0.011
Neg logit contributions
 purpose-0.013
 sight-0.013
 common-0.012
 our-0.012
 fun-0.011
Token a
Token ID257
Feature activation-8.26e-01

Pos logit contributions
m+0.039
owed+0.038
es+0.037
op+0.036
ppy+0.035
Neg logit contributions
 purpose-0.018
 sight-0.018
 common-0.015
 harmony-0.015
 accomplishment-0.015
Token quiet
Token ID5897
Feature activation-5.76e-01

Pos logit contributions
owed+0.020
m+0.020
es+0.019
re+0.018
ppy+0.018
Neg logit contributions
 purpose-0.011
 sight-0.011
 common-0.010
 harmony-0.009
 fun-0.009
Token all
Token ID477
Feature activation-2.76e+00

Pos logit contributions
m+0.012
owed+0.011
es+0.011
re+0.010
ppy+0.010
Neg logit contributions
 purpose-0.010
 sight-0.009
 common-0.009
 harmony-0.008
 accomplishment-0.008
Tokenigator
Token ID23823
Feature activation-1.61e+01

Pos logit contributions
m+0.064
owed+0.063
es+0.061
ppy+0.058
re+0.058
Neg logit contributions
 sight-0.038
 purpose-0.038
 harmony-0.033
 common-0.033
 freedom-0.032
Token went
Token ID1816
Feature activation-3.48e-01

Pos logit contributions
m+0.264
owed+0.253
ppy+0.251
oke+0.248
ll+0.237
Neg logit contributions
 sight-0.330
 purpose-0.290
 freedom-0.290
 harmony-0.289
 sand-0.288
Token for
Token ID329
Feature activation+3.48e+00

Pos logit contributions
m+0.007
owed+0.007
es+0.006
op+0.006
ppy+0.006
Neg logit contributions
 purpose-0.006
 sight-0.006
 common-0.005
 harmony-0.005
 accomplishment-0.005
Token a
Token ID257
Feature activation+1.21e+00

Pos logit contributions
 rightful+0.058
 purpose+0.054
 destination+0.052
 sight+0.051
 accomplishment+0.051
Neg logit contributions
es-0.059
m-0.058
owed-0.056
bee-0.055
ppy-0.054
Token walk
Token ID2513
Feature activation-3.57e+00

Pos logit contributions
 purpose+0.022
 sight+0.022
 common+0.020
 harmony+0.019
 accomplishment+0.019
Neg logit contributions
m-0.023
owed-0.022
es-0.021
oke-0.020
re-0.020
Token.
Token ID13
Feature activation-3.93e+00

Pos logit contributions
m+0.061
owed+0.059
oke+0.055
es+0.055
ppy+0.053
Neg logit contributions
 sight-0.071
 purpose-0.071
 common-0.065
 harmony-0.065
 freedom-0.064
Token day
Token ID1110
Feature activation-5.29e-01

Pos logit contributions
m+0.057
owed+0.054
re+0.050
op+0.050
es+0.050
Neg logit contributions
 purpose-0.041
 sight-0.039
 common-0.036
 harmony-0.035
 accomplishment-0.035
Token,
Token ID11
Feature activation-9.33e-01

Pos logit contributions
m+0.009
owed+0.009
oke+0.008
es+0.008
ppy+0.008
Neg logit contributions
 sight-0.010
 purpose-0.010
 common-0.010
 fun-0.009
 sand-0.009
Token Will
Token ID2561
Feature activation-8.48e+00

Pos logit contributions
m+0.024
owed+0.024
es+0.023
op+0.022
re+0.022
Neg logit contributions
 purpose-0.010
 sight-0.010
 common-0.009
 harmony-0.009
 accomplishment-0.008
Tokeny
Token ID88
Feature activation-8.47e+00

Pos logit contributions
m+0.125
owed+0.119
oke+0.107
es+0.107
op+0.106
Neg logit contributions
 purpose-0.188
 sight-0.187
 common-0.176
 harmony-0.171
 our-0.168
Token the
Token ID262
Feature activation+1.82e+00

Pos logit contributions
m+0.128
owed+0.123
es+0.110
oke+0.110
re+0.106
Neg logit contributions
 purpose-0.188
 sight-0.187
 common-0.171
 harmony-0.170
 beauty-0.165
Token firefighter
Token ID43105
Feature activation-1.61e+01

Pos logit contributions
 purpose+0.019
 sight+0.018
 common+0.016
 harmony+0.015
 accomplishment+0.015
Neg logit contributions
m-0.049
owed-0.047
es-0.045
op-0.044
ppy-0.044
Token was
Token ID373
Feature activation+3.30e+00

Pos logit contributions
m+0.232
owed+0.221
ppy+0.204
es+0.204
oke+0.202
Neg logit contributions
 sight-0.345
 purpose-0.331
 common-0.312
 harmony-0.309
 beauty-0.305
Token feeling
Token ID4203
Feature activation+2.86e+00

Pos logit contributions
 purpose+0.067
 sight+0.064
 common+0.063
 rightful+0.060
 harmony+0.060
Neg logit contributions
m-0.059
owed-0.052
es-0.051
op-0.051
re-0.050
Token very
Token ID845
Feature activation+4.95e+00

Pos logit contributions
 purpose+0.058
 sight+0.054
 common+0.054
 rightful+0.049
 accomplishment+0.048
Neg logit contributions
m-0.053
es-0.044
owed-0.044
op-0.043
ppy-0.042
Token confused
Token ID10416
Feature activation+1.69e-01

Pos logit contributions
 purpose+0.088
 sight+0.082
 harmony+0.077
 common+0.076
 rightful+0.071
Neg logit contributions
m-0.101
owed-0.093
re-0.088
op-0.088
oke-0.086
Token.
Token ID13
Feature activation-4.10e+00

Pos logit contributions
 purpose+0.004
 sight+0.004
 common+0.003
 harmony+0.003
 accomplishment+0.003
Neg logit contributions
m-0.003
owed-0.002
es-0.002
oke-0.002
op-0.002
Token stop
Token ID2245
Feature activation+1.51e+00

Pos logit contributions
 purpose+0.041
 sight+0.040
 common+0.036
 harmony+0.036
 accomplishment+0.035
Neg logit contributions
m-0.053
owed-0.051
es-0.049
oke-0.047
op-0.047
Token it
Token ID340
Feature activation-4.80e+00

Pos logit contributions
 purpose+0.033
 sight+0.032
 common+0.031
 rightful+0.031
 harmony+0.030
Neg logit contributions
m-0.023
owed-0.021
es-0.021
op-0.020
oke-0.020
Token!"
Token ID2474
Feature activation-3.14e+00

Pos logit contributions
owed+0.090
m+0.090
es+0.083
ppy+0.082
bee+0.081
Neg logit contributions
 purpose-0.085
 sight-0.081
 our-0.076
 awesome-0.074
 fun-0.073
Token the
Token ID262
Feature activation+2.54e+00

Pos logit contributions
m+0.064
es+0.059
owed+0.059
oke+0.057
re+0.057
Neg logit contributions
 purpose-0.051
 sight-0.046
 rightful-0.046
 accomplishment-0.044
 beauty-0.044
Token scare
Token ID19437
Feature activation-2.18e+00

Pos logit contributions
 purpose+0.041
 sight+0.041
 common+0.038
 harmony+0.038
 accomplishment+0.038
Neg logit contributions
m-0.054
owed-0.049
es-0.048
op-0.047
re-0.047
Tokencrow
Token ID47114
Feature activation-1.61e+01

Pos logit contributions
owed+0.041
m+0.040
ll+0.037
ppy+0.036
es+0.036
Neg logit contributions
 sight-0.039
 purpose-0.039
 common-0.036
 sand-0.034
 harmony-0.033
Token said
Token ID531
Feature activation-2.01e+00

Pos logit contributions
m+0.226
owed+0.213
ppy+0.203
ll+0.197
itty+0.194
Neg logit contributions
 sight-0.329
 purpose-0.317
 fun-0.308
 common-0.303
 loud-0.296
Token.
Token ID13
Feature activation-3.59e+00

Pos logit contributions
m+0.033
owed+0.031
re+0.029
es+0.029
ll+0.029
Neg logit contributions
 purpose-0.040
 sight-0.040
 accomplishment-0.037
 common-0.036
 harmony-0.036
Token "
Token ID366
Feature activation-2.59e+00

Pos logit contributions
m+0.065
owed+0.063
es+0.059
ppy+0.057
op+0.056
Neg logit contributions
 purpose-0.069
 sight-0.068
 common-0.062
 harmony-0.061
 accomplishment-0.060
TokenYou
Token ID1639
Feature activation-2.19e+00

Pos logit contributions
m+0.045
owed+0.045
es+0.042
oke+0.040
ppy+0.040
Neg logit contributions
 purpose-0.050
 sight-0.049
 common-0.046
 harmony-0.045
 freedom-0.044
Token are
Token ID389
Feature activation+2.84e+00

Pos logit contributions
m+0.037
owed+0.036
es+0.033
bee+0.032
oke+0.032
Neg logit contributions
 purpose-0.043
 sight-0.042
 common-0.040
 our-0.039
 victory-0.039
Token it
Token ID340
Feature activation-3.45e+00

Pos logit contributions
 purpose+0.051
 sight+0.050
 common+0.048
 harmony+0.046
 rightful+0.046
Neg logit contributions
m-0.074
es-0.071
owed-0.070
ppy-0.066
op-0.065
Token.
Token ID13
Feature activation-1.89e+00

Pos logit contributions
m+0.051
owed+0.050
es+0.047
ppy+0.044
op+0.042
Neg logit contributions
 purpose-0.077
 sight-0.075
 common-0.072
 harmony-0.070
 rightful-0.069
Token
Token ID198
Feature activation-1.44e+00

Pos logit contributions
m+0.023
owed+0.021
es+0.021
op+0.019
oke+0.018
Neg logit contributions
 purpose-0.047
 sight-0.046
 accomplishment-0.044
 common-0.043
 harmony-0.043
Token
Token ID198
Feature activation-1.67e+00

Pos logit contributions
m+0.024
owed+0.024
es+0.022
ppy+0.021
re+0.021
Neg logit contributions
 purpose-0.029
 sight-0.028
 common-0.027
 harmony-0.026
 accomplishment-0.026
TokenThe
Token ID464
Feature activation+2.49e+00

Pos logit contributions
m+0.032
owed+0.030
es+0.029
op+0.028
ppy+0.028
Neg logit contributions
 purpose-0.031
 sight-0.031
 common-0.028
 harmony-0.027
 accomplishment-0.027
Token dentist
Token ID38408
Feature activation-1.61e+01

Pos logit contributions
 purpose+0.040
 sight+0.040
 common+0.035
 accomplishment+0.035
 harmony+0.035
Neg logit contributions
m-0.052
owed-0.052
es-0.049
re-0.047
op-0.047
Token said
Token ID531
Feature activation-1.42e+00

Pos logit contributions
m+0.217
owed+0.197
re+0.192
op+0.192
ppy+0.191
Neg logit contributions
 sight-0.354
 purpose-0.347
 accomplishment-0.330
 fun-0.329
 common-0.326
Token Lily
Token ID20037
Feature activation-1.22e+01

Pos logit contributions
m+0.028
owed+0.027
es+0.026
oke+0.025
op+0.024
Neg logit contributions
 purpose-0.025
 sight-0.025
 common-0.023
 harmony-0.023
 rightful-0.023
Token and
Token ID290
Feature activation+2.21e-01

Pos logit contributions
m+0.171
oke+0.155
owed+0.148
op+0.143
re+0.140
Neg logit contributions
 purpose-0.257
 sight-0.251
 common-0.242
 fun-0.241
 beauty-0.240
Token the
Token ID262
Feature activation+4.59e+00

Pos logit contributions
 purpose+0.004
 sight+0.004
 common+0.004
 harmony+0.004
 accomplishment+0.004
Neg logit contributions
m-0.004
owed-0.004
es-0.004
ppy-0.003
oke-0.003
Token bear
Token ID6842
Feature activation-1.28e+01

Pos logit contributions
 purpose+0.096
 sight+0.093
 common+0.090
 harmony+0.088
 accomplishment+0.086
Neg logit contributions
m-0.076
owed-0.067
es-0.064
bee-0.062
oke-0.061
Token humble
Token ID19695
Feature activation+3.01e+00

Pos logit contributions
 purpose+0.100
 sight+0.097
 rightful+0.081
 harmony+0.080
 accomplishment+0.080
Neg logit contributions
m-0.140
es-0.133
owed-0.129
oke-0.128
op-0.124
Token and
Token ID290
Feature activation+6.31e+00

Pos logit contributions
 purpose+0.067
 sight+0.066
 common+0.062
 harmony+0.061
 accomplishment+0.061
Neg logit contributions
m-0.045
owed-0.042
es-0.040
re-0.037
op-0.037
Token stop
Token ID2245
Feature activation+5.51e+00

Pos logit contributions
 purpose+0.098
 sight+0.095
 common+0.080
 comparison+0.077
 accomplishment+0.074
Neg logit contributions
m-0.131
owed-0.129
es-0.123
oke-0.115
ll-0.114
Token poll
Token ID3278
Feature activation+6.02e+00

Pos logit contributions
 purpose+0.112
 sight+0.110
 common+0.103
 accomplishment+0.102
 freedom+0.100
Neg logit contributions
m-0.092
owed-0.084
es-0.083
re-0.078
oke-0.076
Tokenuting
Token ID15129
Feature activation+3.95e+00

Pos logit contributions
 purpose+0.097
 sight+0.097
 accomplishment+0.088
 common+0.086
 harmony+0.086
Neg logit contributions
m-0.125
owed-0.123
es-0.118
op-0.114
ppy-0.112
Token the
Token ID262
Feature activation+7.26e+00

Pos logit contributions
 purpose+0.069
 sight+0.068
 accomplishment+0.065
 rightful+0.064
 awesome+0.063
Neg logit contributions
m-0.071
es-0.069
owed-0.065
y-0.061
re-0.061
Token air
Token ID1633
Feature activation-1.33e+00

Pos logit contributions
 purpose+0.120
 sight+0.114
 accomplishment+0.109
 common+0.108
 rightful+0.105
Neg logit contributions
m-0.147
oke-0.131
es-0.127
owed-0.127
re-0.126
Token.
Token ID13
Feature activation-1.86e+00

Pos logit contributions
m+0.018
owed+0.017
es+0.016
re+0.015
op+0.015
Neg logit contributions
 purpose-0.031
 sight-0.031
 common-0.029
 accomplishment-0.028
 harmony-0.028
Token And
Token ID843
Feature activation+2.22e+00

Pos logit contributions
m+0.034
owed+0.033
es+0.031
ppy+0.030
bee+0.030
Neg logit contributions
 purpose-0.035
 sight-0.035
 common-0.032
 harmony-0.032
 accomplishment-0.031
Token they
Token ID484
Feature activation-1.99e+00

Pos logit contributions
 purpose+0.043
 sight+0.042
 common+0.038
 rightful+0.037
 harmony+0.037
Neg logit contributions
m-0.040
owed-0.038
es-0.038
oke-0.035
re-0.035
Token lived
Token ID5615
Feature activation+3.54e+00

Pos logit contributions
m+0.038
owed+0.036
es+0.034
op+0.034
oke+0.034
Neg logit contributions
 sight-0.034
 purpose-0.034
 common-0.032
 harmony-0.032
 accomplishment-0.032
Token dad
Token ID9955
Feature activation-8.78e+00

Pos logit contributions
 purpose+0.025
 sight+0.024
 harmony+0.021
 common+0.021
 accomplishment+0.021
Neg logit contributions
m-0.038
owed-0.036
es-0.035
ppy-0.034
re-0.034
Token when
Token ID618
Feature activation-4.64e-01

Pos logit contributions
m+0.154
owed+0.151
es+0.140
op+0.139
bee+0.136
Neg logit contributions
 purpose-0.161
 sight-0.157
 common-0.149
 our-0.145
 harmony-0.144
Token they
Token ID484
Feature activation-3.50e+00

Pos logit contributions
m+0.006
es+0.006
owed+0.005
re+0.005
op+0.005
Neg logit contributions
 purpose-0.011
 sight-0.011
 rightful-0.010
 accomplishment-0.010
 freedom-0.010
Token would
Token ID561
Feature activation+2.99e+00

Pos logit contributions
owed+0.052
es+0.049
op+0.049
m+0.049
bee+0.049
Neg logit contributions
 sight-0.070
 common-0.065
 fun-0.064
 joy-0.063
 sand-0.062
Token get
Token ID651
Feature activation+7.25e+00

Pos logit contributions
 purpose+0.053
 sight+0.053
 harmony+0.047
 accomplishment+0.047
 common+0.046
Neg logit contributions
m-0.059
owed-0.055
re-0.051
ppy-0.050
es-0.050
Token to
Token ID284
Feature activation+6.44e+00

Pos logit contributions
 purpose+0.142
 sight+0.125
 accomplishment+0.124
 common+0.122
 harmony+0.122
Neg logit contributions
m-0.150
re-0.125
owed-0.124
es-0.116
o-0.115
Token camp
Token ID1413
Feature activation-4.58e+00

Pos logit contributions
 purpose+0.096
 sight+0.093
 accomplishment+0.091
 holes+0.085
 comparison+0.084
Neg logit contributions
m-0.113
ird-0.107
'-0.102
re-0.100
owed-0.099
Token.
Token ID13
Feature activation-2.61e+00

Pos logit contributions
m+0.068
owed+0.064
es+0.060
op+0.057
ppy+0.056
Neg logit contributions
 purpose-0.103
 sight-0.101
 common-0.094
 harmony-0.093
 accomplishment-0.093
Token Mom
Token ID11254
Feature activation-1.14e+01

Pos logit contributions
m+0.047
owed+0.044
es+0.042
op+0.040
re+0.040
Neg logit contributions
 purpose-0.050
 sight-0.050
 accomplishment-0.046
 freedom-0.046
 common-0.045
Token and
Token ID290
Feature activation+2.51e-01

Pos logit contributions
owed+0.182
m+0.177
itive+0.173
op+0.169
re+0.168
Neg logit contributions
 fun-0.190
 sight-0.186
 new-0.184
 old-0.178
 joy-0.174
Token dad
Token ID9955
Feature activation-1.06e+01

Pos logit contributions
 purpose+0.004
 sight+0.004
 harmony+0.004
 freedom+0.004
 accomplishment+0.004
Neg logit contributions
m-0.005
ppy-0.005
owed-0.005
op-0.004
re-0.004
Token,
Token ID11
Feature activation+2.26e+00

Pos logit contributions
m+0.084
owed+0.081
es+0.075
oke+0.072
op+0.072
Neg logit contributions
 purpose-0.104
 sight-0.102
 common-0.094
 harmony-0.093
 accomplishment-0.093
Token and
Token ID290
Feature activation+3.96e+00

Pos logit contributions
 purpose+0.054
 sight+0.053
 common+0.051
 rightful+0.050
 harmony+0.049
Neg logit contributions
m-0.030
owed-0.028
es-0.027
bee-0.025
ppy-0.024
Token then
Token ID788
Feature activation+1.64e+00

Pos logit contributions
 purpose+0.076
 sight+0.074
 common+0.069
 rightful+0.066
 harmony+0.066
Neg logit contributions
m-0.072
owed-0.068
es-0.068
ppy-0.062
re-0.062
Token scoop
Token ID23529
Feature activation+2.72e+00

Pos logit contributions
 purpose+0.036
 sight+0.035
 common+0.033
 harmony+0.033
 rightful+0.032
Neg logit contributions
m-0.025
owed-0.024
es-0.023
op-0.021
re-0.021
Tokened
Token ID276
Feature activation+4.10e+00

Pos logit contributions
 purpose+0.067
 sight+0.066
 common+0.062
 harmony+0.062
 accomplishment+0.061
Neg logit contributions
m-0.034
owed-0.032
es-0.030
op-0.027
re-0.027
Token the
Token ID262
Feature activation+5.62e+00

Pos logit contributions
 purpose+0.062
 common+0.060
 rightful+0.060
 sight+0.059
 harmony+0.057
Neg logit contributions
es-0.085
m-0.085
owed-0.080
re-0.075
y-0.074
Token sauce
Token ID10746
Feature activation-6.44e+00

Pos logit contributions
 purpose+0.089
 sight+0.086
 common+0.084
 harmony+0.083
 accomplishment+0.080
Neg logit contributions
m-0.120
owed-0.105
es-0.104
bee-0.102
re-0.102
Token on
Token ID319
Feature activation+5.94e+00

Pos logit contributions
m+0.114
owed+0.109
es+0.103
op+0.098
ppy+0.098
Neg logit contributions
 purpose-0.125
 sight-0.124
 common-0.114
 harmony-0.112
 accomplishment-0.111
Token top
Token ID1353
Feature activation-8.87e-01

Pos logit contributions
 sight+0.082
 rightful+0.081
 harmony+0.080
 common+0.080
 purpose+0.080
Neg logit contributions
m-0.124
es-0.121
owed-0.114
op-0.112
bee-0.110
Token.
Token ID13
Feature activation-1.01e+00

Pos logit contributions
m+0.013
owed+0.012
es+0.011
op+0.011
re+0.011
Neg logit contributions
 purpose-0.020
 sight-0.020
 common-0.019
 harmony-0.018
 accomplishment-0.018
Token Finally
Token ID9461
Feature activation+6.67e-01

Pos logit contributions
m+0.017
owed+0.017
es+0.016
ppy+0.015
oke+0.015
Neg logit contributions
 purpose-0.020
 sight-0.020
 common-0.018
 harmony-0.018
 accomplishment-0.018
Token you
Token ID345
Feature activation-6.78e-01

Pos logit contributions
 purpose+0.053
 sight+0.052
 accomplishment+0.049
 rightful+0.048
 harmony+0.048
Neg logit contributions
m-0.052
owed-0.049
es-0.048
re-0.047
oke-0.045
Token done
Token ID1760
Feature activation+1.49e+00

Pos logit contributions
owed+0.014
m+0.014
es+0.014
op+0.013
ppy+0.013
Neg logit contributions
 purpose-0.011
 sight-0.010
 common-0.010
 awesome-0.009
 fun-0.009
Token?
Token ID30
Feature activation-7.29e-01

Pos logit contributions
 purpose+0.035
 sight+0.034
 common+0.032
 harmony+0.032
 accomplishment+0.031
Neg logit contributions
m-0.021
owed-0.020
es-0.018
op-0.017
ppy-0.017
Token You
Token ID921
Feature activation-7.56e-01

Pos logit contributions
owed+0.014
m+0.014
bee+0.013
ppy+0.012
ll+0.012
Neg logit contributions
 purpose-0.013
 sight-0.012
 common-0.012
 our-0.012
 harmony-0.011
Token are
Token ID389
Feature activation+4.51e+00

Pos logit contributions
m+0.012
owed+0.012
es+0.011
bee+0.011
oke+0.010
Neg logit contributions
 purpose-0.015
 sight-0.015
 common-0.015
 our-0.014
 fun-0.014
Token very
Token ID845
Feature activation+6.13e+00

Pos logit contributions
 purpose+0.079
 sight+0.078
 harmony+0.072
 accomplishment+0.071
 common+0.070
Neg logit contributions
m-0.093
es-0.081
oke-0.081
owed-0.080
ppy-0.080
Token naughty
Token ID45128
Feature activation+2.03e+00

Pos logit contributions
 sight+0.091
 purpose+0.090
 harmony+0.081
 accomplishment+0.076
 freedom+0.076
Neg logit contributions
m-0.142
owed-0.132
oke-0.129
es-0.127
op-0.126
Token.
Token ID13
Feature activation-2.70e-01

Pos logit contributions
 purpose+0.041
 sight+0.041
 common+0.037
 harmony+0.037
 awesome+0.036
Neg logit contributions
owed-0.034
m-0.033
es-0.030
ppy-0.029
op-0.029
Token You
Token ID921
Feature activation-1.18e+00

Pos logit contributions
m+0.005
owed+0.005
es+0.004
re+0.004
bee+0.004
Neg logit contributions
 purpose-0.005
 sight-0.005
 common-0.005
 harmony-0.005
 our-0.005
Token ruined
Token ID20484
Feature activation+6.50e+00

Pos logit contributions
owed+0.021
m+0.020
es+0.018
bee+0.018
oke+0.018
Neg logit contributions
 purpose-0.021
 common-0.021
 sight-0.021
 our-0.021
 fun-0.020
Token the
Token ID262
Feature activation+8.05e+00

Pos logit contributions
 accomplishment+0.081
 rightful+0.081
 harmony+0.080
 pleasure+0.073
 sight+0.073
Neg logit contributions
es-0.145
m-0.144
o-0.130
owed-0.128
bee-0.128
Token great
Token ID1049
Feature activation+4.35e+00

Pos logit contributions
 sight+0.061
 purpose+0.061
 rightful+0.054
 harmony+0.053
 freedom+0.053
Neg logit contributions
m-0.160
bee-0.148
op-0.148
ppy-0.148
'-0.146
Token job
Token ID1693
Feature activation+8.35e-01

Pos logit contributions
 purpose+0.031
 sight+0.031
 common+0.025
 harmony+0.025
 rightful+0.023
Neg logit contributions
m-0.126
owed-0.124
es-0.120
op-0.118
oke-0.116
Token.
Token ID13
Feature activation-1.13e+00

Pos logit contributions
 purpose+0.017
 sight+0.017
 common+0.016
 harmony+0.016
 accomplishment+0.016
Neg logit contributions
m-0.014
owed-0.013
es-0.012
op-0.012
re-0.012
Token Can
Token ID1680
Feature activation+1.94e+00

Pos logit contributions
m+0.019
owed+0.018
es+0.017
ppy+0.016
oke+0.016
Neg logit contributions
 purpose-0.023
 sight-0.023
 common-0.021
 accomplishment-0.021
 harmony-0.020
Token I
Token ID314
Feature activation+1.08e+00

Pos logit contributions
 purpose+0.031
 sight+0.030
 common+0.028
 our+0.028
 harmony+0.027
Neg logit contributions
m-0.040
owed-0.040
es-0.038
oke-0.037
ppy-0.036
Token take
Token ID1011
Feature activation+5.83e+00

Pos logit contributions
 purpose+0.017
 sight+0.017
 common+0.015
 harmony+0.015
 accomplishment+0.015
Neg logit contributions
m-0.023
owed-0.022
es-0.021
op-0.021
re-0.021
Token a
Token ID257
Feature activation+5.51e+00

Pos logit contributions
 rightful+0.103
 purpose+0.086
 comparison+0.086
 former+0.083
 sight+0.083
Neg logit contributions
es-0.119
m-0.115
re-0.104
owed-0.102
'-0.099
Token picture
Token ID4286
Feature activation-2.41e+00

Pos logit contributions
 purpose+0.075
 sight+0.072
 rightful+0.067
 harmony+0.066
 accomplishment+0.065
Neg logit contributions
m-0.128
owed-0.121
op-0.117
es-0.114
oke-0.114
Token of
Token ID286
Feature activation+6.70e+00

Pos logit contributions
m+0.040
owed+0.039
es+0.036
op+0.035
ppy+0.035
Neg logit contributions
 purpose-0.049
 sight-0.048
 common-0.045
 harmony-0.044
 accomplishment-0.044
Token you
Token ID345
Feature activation-2.20e+00

Pos logit contributions
 purpose+0.112
 rightful+0.112
 sight+0.106
 common+0.106
 accomplishment+0.105
Neg logit contributions
m-0.121
es-0.115
owed-0.103
'-0.103
re-0.101
Token and
Token ID290
Feature activation+3.28e+00

Pos logit contributions
m+0.034
owed+0.034
es+0.031
oke+0.030
re+0.030
Neg logit contributions
 purpose-0.046
 sight-0.046
 common-0.043
 our-0.042
 harmony-0.042
Token,
Token ID11
Feature activation-6.77e-01

Pos logit contributions
m+0.032
owed+0.031
es+0.029
op+0.028
re+0.027
Neg logit contributions
 purpose-0.041
 sight-0.040
 accomplishment-0.037
 common-0.037
 awesome-0.036
Token take
Token ID1011
Feature activation+3.43e+00

Pos logit contributions
m+0.012
owed+0.011
es+0.011
oke+0.010
ppy+0.010
Neg logit contributions
 purpose-0.014
 sight-0.013
 common-0.012
 harmony-0.012
 rightful-0.012
Token some
Token ID617
Feature activation+4.53e+00

Pos logit contributions
 purpose+0.047
 sight+0.047
 rightful+0.045
 harmony+0.042
 comparison+0.041
Neg logit contributions
m-0.078
es-0.077
owed-0.074
ppy-0.071
re-0.071
Token p
Token ID279
Feature activation-2.14e+00

Pos logit contributions
 purpose+0.097
 sight+0.094
 common+0.090
 harmony+0.090
 rightful+0.087
Neg logit contributions
m-0.075
owed-0.067
re-0.063
es-0.061
op-0.059
Tokenears
Token ID4127
Feature activation-7.44e+00

Pos logit contributions
m+0.022
owed+0.021
es+0.020
bee+0.018
ppy+0.018
Neg logit contributions
 purpose-0.056
 sight-0.055
 common-0.052
 accomplishment-0.051
 rightful-0.051
Token and
Token ID290
Feature activation+2.61e+00

Pos logit contributions
m+0.135
owed+0.131
es+0.124
oke+0.119
op+0.118
Neg logit contributions
 purpose-0.139
 sight-0.137
 common-0.125
 harmony-0.123
 accomplishment-0.122
Token offer
Token ID2897
Feature activation+2.58e+00

Pos logit contributions
 purpose+0.049
 sight+0.049
 common+0.045
 harmony+0.044
 accomplishment+0.044
Neg logit contributions
m-0.048
owed-0.046
es-0.043
re-0.042
ppy-0.041
Token them
Token ID606
Feature activation-2.51e+00

Pos logit contributions
 sight+0.035
 purpose+0.035
 accomplishment+0.031
 harmony+0.031
 rightful+0.030
Neg logit contributions
m-0.060
owed-0.057
es-0.057
re-0.054
ppy-0.053
Token to
Token ID284
Feature activation+3.35e+00

Pos logit contributions
m+0.053
owed+0.052
es+0.050
re+0.048
ppy+0.047
Neg logit contributions
 purpose-0.040
 sight-0.040
 harmony-0.036
 accomplishment-0.035
 common-0.035
Token your
Token ID534
Feature activation+3.25e+00

Pos logit contributions
 sight+0.050
 purpose+0.048
 harmony+0.044
 accomplishment+0.043
 common+0.042
Neg logit contributions
m-0.073
owed-0.069
es-0.065
ppy-0.064
re-0.064
Token friends
Token ID2460
Feature activation-5.58e+00

Pos logit contributions
 purpose+0.050
 sight+0.049
 common+0.048
 harmony+0.046
 outer+0.044
Neg logit contributions
m-0.067
owed-0.063
es-0.060
re-0.060
op-0.058
Token the
Token ID262
Feature activation+5.09e+00

Pos logit contributions
 purpose+0.024
 sight+0.023
 common+0.021
 harmony+0.021
 accomplishment+0.021
Neg logit contributions
m-0.025
owed-0.023
es-0.022
ppy-0.021
re-0.021
Token 3
Token ID513
Feature activation-1.41e+00

Pos logit contributions
 purpose+0.086
 sight+0.082
 common+0.080
 harmony+0.077
 accomplishment+0.075
Neg logit contributions
m-0.101
owed-0.091
bee-0.088
op-0.087
es-0.086
Token year
Token ID614
Feature activation+7.11e-01

Pos logit contributions
m+0.025
owed+0.024
ppy+0.023
es+0.023
ll+0.023
Neg logit contributions
 sight-0.026
 purpose-0.025
 accomplishment-0.024
 beauty-0.023
 fun-0.023
Token old
Token ID1468
Feature activation-1.38e+00

Pos logit contributions
 purpose+0.005
 sight+0.005
 common+0.004
 harmony+0.004
 accomplishment+0.004
Neg logit contributions
m-0.021
owed-0.021
es-0.020
oke-0.020
re-0.019
Token laid
Token ID8104
Feature activation+4.06e+00

Pos logit contributions
m+0.023
owed+0.023
ppy+0.021
es+0.021
op+0.021
Neg logit contributions
 sight-0.028
 purpose-0.027
 harmony-0.024
 rightful-0.024
 beauty-0.024
Token down
Token ID866
Feature activation+3.38e+00

Pos logit contributions
 purpose+0.076
 sight+0.072
 common+0.070
 harmony+0.068
 accomplishment+0.067
Neg logit contributions
m-0.078
owed-0.074
es-0.071
op-0.066
re-0.066
Token on
Token ID319
Feature activation+7.11e+00

Pos logit contributions
 purpose+0.070
 sight+0.066
 common+0.063
 harmony+0.062
 rightful+0.061
Neg logit contributions
m-0.058
owed-0.055
es-0.051
re-0.048
op-0.047
Token the
Token ID262
Feature activation+7.82e+00

Pos logit contributions
 rightful+0.094
 purpose+0.092
 awesome+0.091
 common+0.089
 ours+0.089
Neg logit contributions
m-0.143
es-0.134
re-0.129
owed-0.125
op-0.122
Token log
Token ID2604
Feature activation-7.13e+00

Pos logit contributions
 purpose+0.112
 sight+0.110
 common+0.108
 harmony+0.102
 accomplishment+0.099
Neg logit contributions
m-0.171
owed-0.158
bee-0.153
op-0.151
es-0.149
Token and
Token ID290
Feature activation+3.96e+00

Pos logit contributions
m+0.114
owed+0.108
es+0.102
oke+0.099
ppy+0.098
Neg logit contributions
 purpose-0.149
 sight-0.148
 common-0.134
 harmony-0.134
 accomplishment-0.133
Token relaxed
Token ID18397
Feature activation+3.59e+00

Pos logit contributions
 purpose+0.092
 sight+0.089
 common+0.085
 harmony+0.082
 rightful+0.082
Neg logit contributions
m-0.056
owed-0.055
es-0.051
re-0.047
ppy-0.047
Token everyone
Token ID2506
Feature activation-4.23e+00

Pos logit contributions
 purpose+0.075
 common+0.074
 rightful+0.074
 sight+0.068
 harmony+0.068
Neg logit contributions
owed-0.082
es-0.081
m-0.078
re-0.071
op-0.069
Token so
Token ID523
Feature activation+3.63e+00

Pos logit contributions
m+0.088
owed+0.086
es+0.082
ppy+0.078
op+0.078
Neg logit contributions
 purpose-0.069
 sight-0.068
 common-0.061
 harmony-0.060
 accomplishment-0.059
Token happy
Token ID3772
Feature activation+1.79e+00

Pos logit contributions
 purpose+0.056
 sight+0.053
 common+0.047
 harmony+0.046
 rightful+0.046
Neg logit contributions
m-0.083
owed-0.079
es-0.075
oke-0.074
re-0.074
Token with
Token ID351
Feature activation+4.87e+00

Pos logit contributions
 purpose+0.038
 sight+0.038
 common+0.034
 harmony+0.034
 accomplishment+0.034
Neg logit contributions
m-0.029
owed-0.028
es-0.025
ppy-0.025
op-0.025
Token her
Token ID607
Feature activation+2.57e+00

Pos logit contributions
 purpose+0.087
 rightful+0.083
 common+0.079
 sight+0.075
 harmony+0.075
Neg logit contributions
m-0.084
es-0.083
owed-0.080
bee-0.072
re-0.069
Token enthusiasm
Token ID17131
Feature activation-8.57e-01

Pos logit contributions
 purpose+0.036
 sight+0.034
 common+0.033
 harmony+0.032
 rightful+0.031
Neg logit contributions
m-0.059
owed-0.057
es-0.053
op-0.053
re-0.053
Token and
Token ID290
Feature activation+4.68e+00

Pos logit contributions
m+0.012
owed+0.012
es+0.011
re+0.010
op+0.010
Neg logit contributions
 purpose-0.020
 sight-0.019
 common-0.018
 harmony-0.018
 accomplishment-0.018
Token her
Token ID607
Feature activation+2.98e+00

Pos logit contributions
 purpose+0.091
 sight+0.086
 common+0.081
 rightful+0.076
 outer+0.075
Neg logit contributions
m-0.081
owed-0.080
es-0.078
re-0.073
ll-0.068
Token humble
Token ID19695
Feature activation+3.72e+00

Pos logit contributions
 purpose+0.038
 sight+0.035
 common+0.034
 harmony+0.032
 rightful+0.031
Neg logit contributions
m-0.071
owed-0.069
op-0.065
es-0.065
re-0.064
Token nature
Token ID3450
Feature activation-5.86e+00

Pos logit contributions
 purpose+0.052
 sight+0.051
 common+0.049
 harmony+0.047
 awesome+0.047
Neg logit contributions
m-0.082
owed-0.080
op-0.075
re-0.074
es-0.074
Token.
Token ID13
Feature activation-1.46e+00

Pos logit contributions
m+0.091
owed+0.087
es+0.081
ppy+0.078
oke+0.078
Neg logit contributions
 purpose-0.127
 sight-0.126
 common-0.116
 harmony-0.115
 accomplishment-0.114
Token but
Token ID475
Feature activation+2.17e+00

Pos logit contributions
 purpose+0.016
 sight+0.016
 common+0.015
 harmony+0.015
 accomplishment+0.014
Neg logit contributions
m-0.010
owed-0.009
es-0.009
re-0.008
op-0.008
Token very
Token ID845
Feature activation+3.70e+00

Pos logit contributions
 purpose+0.038
 sight+0.035
 common+0.033
 harmony+0.031
 accomplishment+0.030
Neg logit contributions
m-0.046
owed-0.044
es-0.041
re-0.039
ppy-0.038
Token happy
Token ID3772
Feature activation+7.88e-01

Pos logit contributions
 purpose+0.054
 sight+0.052
 common+0.047
 harmony+0.047
 freedom+0.044
Neg logit contributions
m-0.085
owed-0.083
es-0.077
re-0.075
op-0.075
Token.
Token ID13
Feature activation-2.18e+00

Pos logit contributions
 sight+0.016
 purpose+0.016
 common+0.015
 accomplishment+0.015
 harmony+0.015
Neg logit contributions
m-0.013
owed-0.012
es-0.012
ppy-0.011
oke-0.011
Token She
Token ID1375
Feature activation-3.61e+00

Pos logit contributions
m+0.034
owed+0.033
es+0.030
bee+0.029
ppy+0.029
Neg logit contributions
 purpose-0.048
 sight-0.047
 common-0.043
 harmony-0.043
 rightful-0.043
Token knew
Token ID2993
Feature activation+1.59e+00

Pos logit contributions
m+0.052
owed+0.049
es+0.045
oke+0.045
op+0.045
Neg logit contributions
 sight-0.080
 purpose-0.078
 accomplishment-0.074
 rightful-0.074
 common-0.074
Token that
Token ID326
Feature activation+1.50e+00

Pos logit contributions
 purpose+0.034
 sight+0.033
 common+0.032
 harmony+0.031
 rightful+0.029
Neg logit contributions
m-0.025
owed-0.024
es-0.023
re-0.021
ll-0.021
Token she
Token ID673
Feature activation-3.95e+00

Pos logit contributions
 purpose+0.030
 sight+0.030
 common+0.028
 accomplishment+0.027
 harmony+0.027
Neg logit contributions
m-0.026
owed-0.023
es-0.023
re-0.021
op-0.021
Token had
Token ID550
Feature activation+3.44e+00

Pos logit contributions
m+0.053
owed+0.052
op+0.050
oke+0.049
bee+0.046
Neg logit contributions
 sight-0.084
 rightful-0.081
 fun-0.080
 purpose-0.080
 beauty-0.080
Token made
Token ID925
Feature activation+3.48e+00

Pos logit contributions
 purpose+0.069
 sight+0.067
 common+0.064
 accomplishment+0.063
 harmony+0.062
Neg logit contributions
m-0.059
owed-0.053
es-0.052
re-0.052
ppy-0.049
Token a
Token ID257
Feature activation+5.47e+00

Pos logit contributions
 common+0.050
 purpose+0.048
 sight+0.047
 rightful+0.046
 harmony+0.045
Neg logit contributions
es-0.074
m-0.073
owed-0.068
re-0.063
ie-0.062
Token Lily
Token ID20037
Feature activation-1.02e+01

Pos logit contributions
m+0.017
owed+0.016
es+0.015
op+0.015
oke+0.015
Neg logit contributions
 purpose-0.020
 sight-0.020
 harmony-0.019
 common-0.019
 rightful-0.018
Token followed
Token ID3940
Feature activation+2.86e+00

Pos logit contributions
m+0.164
owed+0.154
oke+0.153
itive+0.149
bee+0.148
Neg logit contributions
 sight-0.178
 holes-0.168
 fun-0.168
 sand-0.168
 metal-0.166
Token him
Token ID683
Feature activation-4.10e+00

Pos logit contributions
 purpose+0.053
 sight+0.051
 rightful+0.050
 harmony+0.049
 common+0.049
Neg logit contributions
m-0.050
owed-0.047
es-0.047
op-0.046
ppy-0.045
Token.
Token ID13
Feature activation-1.23e+00

Pos logit contributions
m+0.070
owed+0.067
es+0.063
oke+0.061
ppy+0.060
Neg logit contributions
 purpose-0.081
 sight-0.081
 common-0.074
 accomplishment-0.073
 harmony-0.073
Token
Token ID198
Feature activation+1.21e-01

Pos logit contributions
m+0.020
owed+0.020
es+0.018
ppy+0.018
oke+0.017
Neg logit contributions
 purpose-0.026
 sight-0.025
 common-0.023
 harmony-0.023
 accomplishment-0.023
Token
Token ID198
Feature activation-6.19e-01

Pos logit contributions
 purpose+0.002
 sight+0.002
 common+0.002
 harmony+0.002
 accomplishment+0.002
Neg logit contributions
m-0.002
owed-0.002
es-0.002
ppy-0.002
bee-0.002
TokenInside
Token ID24441
Feature activation+8.05e-01

Pos logit contributions
m+0.010
owed+0.009
es+0.009
oke+0.008
re+0.008
Neg logit contributions
 purpose-0.013
 sight-0.013
 common-0.012
 harmony-0.012
 accomplishment-0.012
Token the
Token ID262
Feature activation+5.27e+00

Pos logit contributions
 purpose+0.012
 sight+0.011
 rightful+0.011
 common+0.011
 harmony+0.010
Neg logit contributions
m-0.019
es-0.017
owed-0.017
re-0.016
op-0.016
Token garage
Token ID15591
Feature activation-7.08e+00

Pos logit contributions
 purpose+0.101
 common+0.097
 harmony+0.095
 sight+0.094
 accomplishment+0.092
Neg logit contributions
m-0.097
op-0.082
'-0.079
re-0.078
es-0.077
Token,
Token ID11
Feature activation+1.12e+00

Pos logit contributions
m+0.103
owed+0.100
ppy+0.093
es+0.092
bee+0.090
Neg logit contributions
 sight-0.155
 purpose-0.155
 common-0.141
 accomplishment-0.140
 harmony-0.140
Token they
Token ID484
Feature activation-3.41e+00

Pos logit contributions
 purpose+0.022
 sight+0.022
 common+0.020
 harmony+0.020
 accomplishment+0.020
Neg logit contributions
m-0.020
owed-0.019
es-0.018
re-0.017
op-0.017
Tokenle
Token ID293
Feature activation-2.57e+00

Pos logit contributions
m+0.020
owed+0.018
op+0.018
re+0.017
oke+0.017
Neg logit contributions
 purpose-0.038
 sight-0.037
 common-0.036
 beauty-0.035
 freedom-0.035
Token.
Token ID13
Feature activation-2.40e+00

Pos logit contributions
m+0.040
owed+0.038
es+0.036
re+0.034
op+0.033
Neg logit contributions
 purpose-0.056
 sight-0.055
 common-0.052
 harmony-0.051
 rightful-0.050
Token Everyone
Token ID11075
Feature activation-8.04e+00

Pos logit contributions
m+0.039
owed+0.037
es+0.035
op+0.034
re+0.034
Neg logit contributions
 purpose-0.050
 sight-0.049
 common-0.047
 harmony-0.045
 accomplishment-0.045
Token laughed
Token ID13818
Feature activation+3.09e-02

Pos logit contributions
m+0.113
owed+0.112
ppy+0.106
op+0.099
ops+0.097
Neg logit contributions
 sight-0.172
 purpose-0.162
 harmony-0.162
 joy-0.161
 beauty-0.159
Token as
Token ID355
Feature activation+1.26e+00

Pos logit contributions
 purpose+0.001
 sight+0.001
 common+0.001
 rightful+0.001
 harmony+0.001
Neg logit contributions
m-0.000
owed-0.000
es-0.000
ppy-0.000
re-0.000
Token he
Token ID339
Feature activation-5.72e+00

Pos logit contributions
 purpose+0.024
 sight+0.023
 rightful+0.022
 common+0.021
 harmony+0.020
Neg logit contributions
m-0.022
es-0.021
owed-0.021
oke-0.019
op-0.019
Token happily
Token ID18177
Feature activation-4.52e+00

Pos logit contributions
m+0.087
owed+0.084
oke+0.077
op+0.073
es+0.073
Neg logit contributions
 sight-0.117
 purpose-0.113
 common-0.110
 fun-0.109
 harmony-0.108
Token ate
Token ID15063
Feature activation+4.75e+00

Pos logit contributions
m+0.069
owed+0.062
op+0.061
oke+0.061
es+0.061
Neg logit contributions
 sight-0.090
 freedom-0.085
 purpose-0.085
 joy-0.085
 fun-0.085
Token the
Token ID262
Feature activation+5.20e+00

Pos logit contributions
 purpose+0.071
 rightful+0.070
 common+0.068
 sight+0.067
 harmony+0.066
Neg logit contributions
m-0.100
es-0.094
owed-0.088
re-0.083
op-0.083
Token surprise
Token ID5975
Feature activation-2.99e+00

Pos logit contributions
 purpose+0.085
 sight+0.080
 common+0.077
 harmony+0.077
 rightful+0.076
Neg logit contributions
m-0.113
owed-0.099
re-0.096
op-0.096
es-0.095
Token.
Token ID13
Feature activation-2.10e+00

Pos logit contributions
m+0.045
owed+0.043
es+0.040
re+0.038
op+0.038
Neg logit contributions
 purpose-0.066
 sight-0.065
 common-0.061
 harmony-0.060
 rightful-0.059
Token hose
Token ID31489
Feature activation-6.91e+00

Pos logit contributions
 purpose+0.058
 sight+0.056
 rightful+0.053
 awesome+0.053
 harmony+0.053
Neg logit contributions
m-0.077
owed-0.075
op-0.070
bee-0.069
es-0.069
Token and
Token ID290
Feature activation+1.51e+00

Pos logit contributions
m+0.109
owed+0.105
es+0.099
op+0.094
ppy+0.092
Neg logit contributions
 purpose-0.148
 sight-0.143
 common-0.139
 awesome-0.135
 harmony-0.134
Token watered
Token ID48024
Feature activation+3.23e+00

Pos logit contributions
 purpose+0.034
 sight+0.033
 common+0.031
 harmony+0.030
 rightful+0.030
Neg logit contributions
m-0.022
owed-0.022
es-0.021
ppy-0.019
op-0.019
Token the
Token ID262
Feature activation+4.62e+00

Pos logit contributions
 rightful+0.051
 purpose+0.048
 common+0.047
 awesome+0.046
 sight+0.044
Neg logit contributions
m-0.066
es-0.064
owed-0.061
op-0.057
ie-0.056
Token flower
Token ID15061
Feature activation-8.60e+00

Pos logit contributions
 purpose+0.073
 sight+0.068
 common+0.066
 accomplishment+0.064
 harmony+0.064
Neg logit contributions
m-0.099
owed-0.093
es-0.089
op-0.086
bee-0.086
Token.
Token ID13
Feature activation-2.46e+00

Pos logit contributions
m+0.145
owed+0.137
es+0.128
oke+0.128
re+0.125
Neg logit contributions
 purpose-0.174
 sight-0.173
 harmony-0.157
 common-0.157
 accomplishment-0.156
Token She
Token ID1375
Feature activation-3.67e+00

Pos logit contributions
m+0.042
owed+0.040
es+0.038
op+0.036
ppy+0.036
Neg logit contributions
 purpose-0.049
 sight-0.049
 common-0.045
 harmony-0.045
 accomplishment-0.044
Token also
Token ID635
Feature activation-3.37e-01

Pos logit contributions
m+0.060
oke+0.057
owed+0.055
re+0.053
op+0.051
Neg logit contributions
 sight-0.072
 purpose-0.070
 fun-0.068
 beauty-0.067
 harmony-0.066
Token found
Token ID1043
Feature activation+2.82e+00

Pos logit contributions
m+0.005
owed+0.005
es+0.004
op+0.004
oke+0.004
Neg logit contributions
 purpose-0.008
 sight-0.008
 common-0.007
 harmony-0.007
 accomplishment-0.007
Token a
Token ID257
Feature activation+3.74e+00

Pos logit contributions
 common+0.051
 rightful+0.050
 purpose+0.049
 destination+0.044
 harmony+0.044
Neg logit contributions
es-0.057
m-0.055
owed-0.051
op-0.046
os-0.044
Token stick
Token ID4859
Feature activation-8.01e+00

Pos logit contributions
 accomplishment+0.041
 purpose+0.041
 awesome+0.039
 comparison+0.039
 sight+0.039
Neg logit contributions
m-0.089
op-0.080
bee-0.075
re-0.075
ll-0.074
Token flowers
Token ID12734
Feature activation-6.68e+00

Pos logit contributions
 purpose+0.056
 rightful+0.052
 common+0.052
 comparison+0.047
 harmony+0.047
Neg logit contributions
m-0.059
es-0.058
owed-0.054
 were-0.049
re-0.048
Token,
Token ID11
Feature activation+2.23e+00

Pos logit contributions
m+0.092
owed+0.087
es+0.081
re+0.075
op+0.075
Neg logit contributions
 purpose-0.158
 sight-0.155
 common-0.145
 harmony-0.144
 accomplishment-0.143
Token leaves
Token ID5667
Feature activation-4.59e+00

Pos logit contributions
 sight+0.028
 purpose+0.027
 common+0.024
 harmony+0.024
 freedom+0.023
Neg logit contributions
m-0.055
owed-0.054
es-0.051
op-0.050
oke-0.050
Token,
Token ID11
Feature activation+1.68e+00

Pos logit contributions
m+0.062
owed+0.059
es+0.055
re+0.051
op+0.051
Neg logit contributions
 purpose-0.109
 sight-0.107
 common-0.101
 harmony-0.099
 accomplishment-0.099
Token and
Token ID290
Feature activation+3.43e+00

Pos logit contributions
 sight+0.034
 purpose+0.034
 common+0.032
 harmony+0.031
 freedom+0.031
Neg logit contributions
m-0.027
owed-0.027
es-0.025
op-0.024
oke-0.024
Token sticks
Token ID16461
Feature activation-3.53e+00

Pos logit contributions
 purpose+0.054
 sight+0.050
 common+0.046
 accomplishment+0.045
 rightful+0.045
Neg logit contributions
m-0.076
owed-0.072
es-0.071
re-0.067
ppy-0.066
Token.
Token ID13
Feature activation-2.71e+00

Pos logit contributions
m+0.046
owed+0.043
es+0.042
re+0.038
ppy+0.036
Neg logit contributions
 purpose-0.087
 sight-0.085
 rightful-0.081
 accomplishment-0.079
 harmony-0.079
Token The
Token ID383
Feature activation+3.43e+00

Pos logit contributions
m+0.055
owed+0.053
es+0.050
ppy+0.048
op+0.048
Neg logit contributions
 purpose-0.047
 sight-0.047
 common-0.042
 harmony-0.042
 accomplishment-0.041
Token bunny
Token ID44915
Feature activation-1.19e+01

Pos logit contributions
 purpose+0.073
 sight+0.067
 common+0.066
 accomplishment+0.065
 rightful+0.065
Neg logit contributions
m-0.058
bee-0.048
es-0.046
op-0.046
owed-0.046
Token and
Token ID290
Feature activation+1.87e+00

Pos logit contributions
m+0.188
owed+0.177
oke+0.170
mallow+0.163
ird+0.158
Neg logit contributions
 sight-0.234
 fun-0.228
 purpose-0.222
 new-0.221
 original-0.217
Token the
Token ID262
Feature activation+4.47e+00

Pos logit contributions
 purpose+0.020
 sight+0.019
 common+0.016
 accomplishment+0.016
 harmony+0.015
Neg logit contributions
m-0.050
owed-0.049
es-0.047
ppy-0.046
re-0.045
Token big
Token ID1263
Feature activation+4.19e+00

Pos logit contributions
 purpose+0.060
 sight+0.059
 common+0.057
 freedom+0.057
 harmony+0.056
Neg logit contributions
m-0.087
op-0.080
bee-0.076
oke-0.074
owed-0.074
Token bag
Token ID6131
Feature activation-6.96e+00

Pos logit contributions
 purpose+0.083
 common+0.078
 sight+0.078
 harmony+0.076
 rightful+0.075
Neg logit contributions
m-0.069
owed-0.059
oke-0.058
op-0.058
bee-0.057
Token.
Token ID13
Feature activation-2.08e+00

Pos logit contributions
m+0.114
owed+0.109
es+0.102
op+0.097
re+0.096
Neg logit contributions
 purpose-0.145
 sight-0.143
 common-0.132
 harmony-0.131
 accomplishment-0.129
Token
Token ID198
Feature activation-7.03e-02

Pos logit contributions
m+0.038
owed+0.037
es+0.035
ppy+0.033
op+0.033
Neg logit contributions
 purpose-0.039
 sight-0.038
 common-0.035
 harmony-0.035
 accomplishment-0.034
Token
Token ID198
Feature activation-6.02e-01

Pos logit contributions
owed+0.001
m+0.001
es+0.001
ppy+0.001
bee+0.001
Neg logit contributions
 purpose-0.001
 sight-0.001
 common-0.001
 harmony-0.001
 accomplishment-0.001
TokenHe
Token ID1544
Feature activation-3.31e+00

Pos logit contributions
m+0.012
owed+0.012
es+0.011
ppy+0.010
re+0.010
Neg logit contributions
 purpose-0.011
 sight-0.010
 common-0.010
 harmony-0.009
 accomplishment-0.009
Token was
Token ID373
Feature activation+4.74e+00

Pos logit contributions
m+0.045
owed+0.042
es+0.039
oke+0.037
ppy+0.036
Neg logit contributions
 purpose-0.078
 sight-0.077
 common-0.072
 harmony-0.071
 accomplishment-0.071
Token curious
Token ID11040
Feature activation+1.33e+00

Pos logit contributions
 purpose+0.102
 common+0.094
 sight+0.093
 harmony+0.088
 rightful+0.085
Neg logit contributions
m-0.079
owed-0.075
es-0.070
re-0.067
ird-0.066
Token and
Token ID290
Feature activation+1.81e+00

Pos logit contributions
 purpose+0.030
 sight+0.030
 common+0.029
 harmony+0.028
 rightful+0.028
Neg logit contributions
m-0.019
owed-0.018
es-0.017
re-0.016
oke-0.016
Token wanted
Token ID2227
Feature activation+3.98e+00

Pos logit contributions
 purpose+0.048
 sight+0.046
 common+0.044
 harmony+0.043
 accomplishment+0.043
Neg logit contributions
m-0.020
owed-0.019
es-0.017
re-0.016
oke-0.015
Token to
Token ID284
Feature activation+4.78e+00

Pos logit contributions
 accomplishment+0.080
 purpose+0.080
 harmony+0.080
 sight+0.079
 rightful+0.078
Neg logit contributions
m-0.063
es-0.057
re-0.056
op-0.053
owed-0.053
Token loved
Token ID6151
Feature activation+3.59e+00

Pos logit contributions
m+0.078
owed+0.078
es+0.075
oke+0.075
re+0.070
Neg logit contributions
 purpose-0.094
 sight-0.093
 fun-0.088
 common-0.087
 harmony-0.085
Token to
Token ID284
Feature activation+4.67e+00

Pos logit contributions
 purpose+0.061
 sight+0.059
 common+0.059
 harmony+0.056
 accomplishment+0.056
Neg logit contributions
m-0.071
owed-0.064
re-0.062
op-0.062
es-0.062
Token laugh
Token ID6487
Feature activation-1.07e+00

Pos logit contributions
 purpose+0.074
 sight+0.072
 common+0.065
 harmony+0.064
 accomplishment+0.063
Neg logit contributions
m-0.100
owed-0.097
es-0.093
oke-0.089
ppy-0.089
Token out
Token ID503
Feature activation+2.42e+00

Pos logit contributions
m+0.018
owed+0.018
oke+0.017
re+0.016
ll+0.016
Neg logit contributions
 purpose-0.021
 sight-0.021
 harmony-0.019
 fun-0.019
 common-0.019
Token loud
Token ID7812
Feature activation+3.78e-01

Pos logit contributions
 purpose+0.040
 sight+0.038
 common+0.037
 rightful+0.036
 harmony+0.035
Neg logit contributions
m-0.049
es-0.048
owed-0.047
bee-0.044
op-0.044
Token.
Token ID13
Feature activation-2.87e+00

Pos logit contributions
 purpose+0.008
 sight+0.008
 common+0.008
 harmony+0.008
 accomplishment+0.007
Neg logit contributions
m-0.006
owed-0.005
es-0.005
re-0.005
op-0.005
Token She
Token ID1375
Feature activation-4.06e+00

Pos logit contributions
m+0.050
owed+0.049
es+0.047
ppy+0.044
oke+0.044
Neg logit contributions
 purpose-0.057
 sight-0.057
 common-0.051
 harmony-0.051
 beauty-0.050
Token was
Token ID373
Feature activation+3.59e+00

Pos logit contributions
m+0.059
owed+0.058
oke+0.052
es+0.052
op+0.051
Neg logit contributions
 sight-0.086
 purpose-0.085
 fun-0.084
 common-0.083
 harmony-0.080
Token always
Token ID1464
Feature activation+5.07e+00

Pos logit contributions
 purpose+0.073
 sight+0.067
 rightful+0.065
 common+0.064
 accomplishment+0.064
Neg logit contributions
m-0.066
owed-0.055
o-0.055
es-0.052
re-0.051
Token smiling
Token ID16755
Feature activation-6.84e-02

Pos logit contributions
 purpose+0.096
 sight+0.086
 common+0.085
 accomplishment+0.085
 rightful+0.085
Neg logit contributions
m-0.102
es-0.083
owed-0.083
re-0.081
o-0.079
Token and
Token ID290
Feature activation+2.52e+00

Pos logit contributions
m+0.001
owed+0.001
es+0.001
op+0.001
ppy+0.001
Neg logit contributions
 purpose-0.002
 sight-0.001
 common-0.001
 harmony-0.001
 accomplishment-0.001
Token was
Token ID373
Feature activation+3.79e+00

Pos logit contributions
owed+0.069
m+0.068
ppy+0.064
es+0.062
oke+0.060
Neg logit contributions
 sight-0.129
 purpose-0.129
 common-0.119
 harmony-0.117
 fun-0.115
Token a
Token ID257
Feature activation+4.44e-01

Pos logit contributions
 purpose+0.047
 sight+0.044
 rightful+0.041
 accomplishment+0.040
 common+0.040
Neg logit contributions
m-0.094
owed-0.088
es-0.086
ppy-0.083
re-0.083
Token little
Token ID1310
Feature activation+2.34e-01

Pos logit contributions
 purpose+0.006
 sight+0.006
 common+0.005
 harmony+0.005
 accomplishment+0.005
Neg logit contributions
m-0.011
owed-0.011
es-0.010
re-0.010
ppy-0.010
Token ball
Token ID2613
Feature activation-1.12e+01

Pos logit contributions
 sight+0.005
 purpose+0.004
 common+0.004
 sand+0.004
 harmony+0.004
Neg logit contributions
owed-0.004
m-0.004
es-0.004
re-0.004
oke-0.004
Token named
Token ID3706
Feature activation-2.63e-01

Pos logit contributions
owed+0.210
m+0.191
es+0.189
oke+0.188
re+0.188
Neg logit contributions
 sight-0.203
 purpose-0.195
 new-0.183
 sand-0.179
 original-0.178
Token Billy
Token ID15890
Feature activation-1.37e+01

Pos logit contributions
m+0.005
owed+0.005
es+0.005
op+0.005
ppy+0.005
Neg logit contributions
 purpose-0.004
 sight-0.004
 common-0.004
 harmony-0.004
 accomplishment-0.004
Token.
Token ID13
Feature activation-3.87e+00

Pos logit contributions
owed+0.292
oke+0.281
m+0.277
op+0.271
ops+0.267
Neg logit contributions
 sight-0.199
 purpose-0.196
 our-0.191
 common-0.186
 fun-0.184
Token Billy
Token ID15890
Feature activation-1.25e+01

Pos logit contributions
m+0.074
owed+0.071
es+0.068
op+0.065
re+0.065
Neg logit contributions
 purpose-0.069
 sight-0.067
 common-0.062
 harmony-0.061
 accomplishment-0.061
Token loved
Token ID6151
Feature activation+3.66e+00

Pos logit contributions
owed+0.216
m+0.209
oke+0.198
es+0.190
int+0.189
Neg logit contributions
 purpose-0.234
 sight-0.233
 fun-0.232
 common-0.231
 our-0.228
Token to
Token ID284
Feature activation+4.63e+00

Pos logit contributions
 purpose+0.074
 common+0.072
 rightful+0.071
 accomplishment+0.070
 sight+0.069
Neg logit contributions
m-0.056
o-0.051
re-0.049
op-0.049
ll-0.047
Token roll
Token ID4836
Feature activation+1.30e+00

Pos logit contributions
 sight+0.079
 purpose+0.079
 rightful+0.070
 harmony+0.070
 accomplishment+0.070
Neg logit contributions
m-0.086
owed-0.086
bee-0.082
op-0.081
es-0.079
Token,
Token ID11
Feature activation+1.63e+00

Pos logit contributions
 purpose+0.118
 rightful+0.113
 sight+0.107
 harmony+0.107
 common+0.107
Neg logit contributions
m-0.055
owed-0.048
re-0.046
es-0.046
op-0.042
Token he
Token ID339
Feature activation-3.89e+00

Pos logit contributions
 purpose+0.033
 common+0.030
 sight+0.030
 rightful+0.029
 harmony+0.029
Neg logit contributions
m-0.028
es-0.026
owed-0.026
re-0.023
op-0.023
Token saw
Token ID2497
Feature activation+3.48e+00

Pos logit contributions
m+0.057
owed+0.054
es+0.050
op+0.049
oke+0.049
Neg logit contributions
 sight-0.085
 purpose-0.082
 common-0.078
 beauty-0.077
 harmony-0.077
Token that
Token ID326
Feature activation+2.87e+00

Pos logit contributions
 purpose+0.057
 common+0.054
 rightful+0.054
 accomplishment+0.048
 sight+0.048
Neg logit contributions
es-0.073
m-0.072
owed-0.070
os-0.061
oke-0.061
Token the
Token ID262
Feature activation+5.39e+00

Pos logit contributions
 purpose+0.042
 sight+0.037
 common+0.036
 accomplishment+0.036
 rightful+0.035
Neg logit contributions
m-0.065
owed-0.061
es-0.061
bee-0.055
re-0.055
Token bird
Token ID6512
Feature activation-1.14e+01

Pos logit contributions
 purpose+0.106
 common+0.100
 sight+0.099
 harmony+0.096
 accomplishment+0.096
Neg logit contributions
m-0.095
owed-0.085
bee-0.081
es-0.079
re-0.079
Token was
Token ID373
Feature activation+5.00e+00

Pos logit contributions
m+0.143
owed+0.139
op+0.132
ppy+0.129
oke+0.126
Neg logit contributions
 sight-0.264
 purpose-0.252
 beauty-0.246
 harmony-0.242
 freedom-0.239
Token carrying
Token ID6872
Feature activation+4.18e+00

Pos logit contributions
 purpose+0.096
 accomplishment+0.086
 common+0.085
 sight+0.085
 harmony+0.083
Neg logit contributions
m-0.092
owed-0.083
es-0.080
re-0.074
ppy-0.072
Token a
Token ID257
Feature activation+5.93e+00

Pos logit contributions
 purpose+0.060
 common+0.055
 rightful+0.054
 sight+0.054
 accomplishment+0.052
Neg logit contributions
es-0.090
m-0.090
owed-0.089
ppy-0.079
re-0.078
Token little
Token ID1310
Feature activation+4.16e+00

Pos logit contributions
 purpose+0.075
 sight+0.071
 rightful+0.069
 accomplishment+0.069
 harmony+0.068
Neg logit contributions
m-0.139
op-0.127
owed-0.126
re-0.121
es-0.121
Token stick
Token ID4859
Feature activation-7.42e+00

Pos logit contributions
 purpose+0.073
 sight+0.068
 common+0.067
 rightful+0.064
 awesome+0.064
Neg logit contributions
m-0.080
owed-0.075
es-0.070
op-0.070
re-0.069
Token hill
Token ID12788
Feature activation-7.62e+00

Pos logit contributions
 purpose+0.062
 sight+0.059
 common+0.058
 harmony+0.055
 accomplishment+0.055
Neg logit contributions
m-0.083
owed-0.072
op-0.071
re-0.070
es-0.069
Token with
Token ID351
Feature activation+4.42e+00

Pos logit contributions
m+0.111
owed+0.107
es+0.099
ppy+0.094
oke+0.094
Neg logit contributions
 purpose-0.170
 sight-0.169
 common-0.157
 harmony-0.155
 accomplishment-0.154
Token her
Token ID607
Feature activation+6.56e-01

Pos logit contributions
 rightful+0.098
 purpose+0.096
 former+0.095
 harmony+0.092
 sight+0.090
Neg logit contributions
m-0.055
es-0.050
op-0.045
re-0.045
owed-0.044
Token sled
Token ID46822
Feature activation-8.67e+00

Pos logit contributions
 purpose+0.010
 sight+0.010
 harmony+0.009
 common+0.009
 accomplishment+0.009
Neg logit contributions
m-0.014
owed-0.014
op-0.013
es-0.013
re-0.013
Token.
Token ID13
Feature activation-1.87e+00

Pos logit contributions
m+0.126
owed+0.122
es+0.112
ppy+0.110
oke+0.107
Neg logit contributions
 purpose-0.193
 sight-0.192
 common-0.176
 accomplishment-0.173
 harmony-0.171
Token Ben
Token ID3932
Feature activation-1.08e+01

Pos logit contributions
m+0.032
owed+0.031
es+0.029
ppy+0.028
oke+0.027
Neg logit contributions
 purpose-0.037
 sight-0.037
 common-0.034
 harmony-0.033
 accomplishment-0.033
Token is
Token ID318
Feature activation+2.79e+00

Pos logit contributions
m+0.198
owed+0.194
es+0.178
ppy+0.174
re+0.171
Neg logit contributions
 sight-0.198
 purpose-0.197
 common-0.183
 harmony-0.180
 accomplishment-0.180
Token lazy
Token ID16931
Feature activation-1.13e+00

Pos logit contributions
 purpose+0.054
 sight+0.046
 rightful+0.046
 common+0.046
 former+0.044
Neg logit contributions
m-0.059
owed-0.049
es-0.049
re-0.047
o-0.047
Token.
Token ID13
Feature activation-3.10e+00

Pos logit contributions
m+0.016
owed+0.015
es+0.014
re+0.013
op+0.013
Neg logit contributions
 purpose-0.027
 sight-0.026
 common-0.025
 harmony-0.024
 accomplishment-0.024
Token He
Token ID679
Feature activation-3.74e+00

Pos logit contributions
m+0.054
owed+0.053
es+0.049
ppy+0.047
op+0.047
Neg logit contributions
 purpose-0.061
 sight-0.060
 common-0.055
 harmony-0.054
 accomplishment-0.054
Token walks
Token ID11114
Feature activation-2.64e+00

Pos logit contributions
m+0.061
owed+0.058
es+0.055
op+0.053
re+0.053
Neg logit contributions
 purpose-0.080
 sight-0.077
 harmony-0.070
 common-0.070
 rightful-0.069
Token a
Token ID257
Feature activation-1.36e+00

Pos logit contributions
m+0.018
owed+0.018
es+0.017
oke+0.017
op+0.017
Neg logit contributions
 purpose-0.006
 sight-0.006
 common-0.005
 harmony-0.005
 accomplishment-0.005
Token time
Token ID640
Feature activation-3.05e+00

Pos logit contributions
m+0.039
owed+0.038
es+0.037
oke+0.036
re+0.036
Neg logit contributions
 purpose-0.012
 sight-0.011
 common-0.009
 harmony-0.009
 accomplishment-0.008
Token there
Token ID612
Feature activation-6.14e+00

Pos logit contributions
owed+0.054
m+0.053
ppy+0.051
bee+0.050
es+0.050
Neg logit contributions
 sight-0.056
 purpose-0.055
 common-0.051
 our-0.050
 fun-0.050
Token was
Token ID373
Feature activation+3.51e+00

Pos logit contributions
owed+0.085
m+0.082
ppy+0.081
es+0.077
oke+0.076
Neg logit contributions
 sight-0.139
 purpose-0.137
 common-0.126
 fun-0.124
 harmony-0.123
Token a
Token ID257
Feature activation-1.86e-01

Pos logit contributions
 purpose+0.042
 sight+0.040
 rightful+0.037
 accomplishment+0.036
 common+0.036
Neg logit contributions
m-0.088
owed-0.083
es-0.081
ppy-0.078
re-0.078
Token boy
Token ID2933
Feature activation-1.30e+01

Pos logit contributions
owed+0.004
m+0.004
es+0.004
re+0.004
ppy+0.004
Neg logit contributions
 purpose-0.002
 sight-0.002
 common-0.002
 harmony-0.002
 accomplishment-0.002
Token.
Token ID13
Feature activation-3.48e+00

Pos logit contributions
owed+0.215
m+0.210
oke+0.204
re+0.201
es+0.196
Neg logit contributions
 purpose-0.249
 sight-0.242
 common-0.231
 fun-0.227
 our-0.225
Token He
Token ID679
Feature activation-4.63e+00

Pos logit contributions
m+0.065
owed+0.064
es+0.060
ppy+0.057
oke+0.057
Neg logit contributions
 purpose-0.065
 sight-0.064
 common-0.059
 harmony-0.058
 rightful-0.056
Token was
Token ID373
Feature activation+3.63e+00

Pos logit contributions
m+0.082
owed+0.082
oke+0.081
es+0.078
re+0.075
Neg logit contributions
 purpose-0.085
 fun-0.085
 sight-0.084
 common-0.081
 original-0.079
Token always
Token ID1464
Feature activation+4.33e+00

Pos logit contributions
 purpose+0.068
 sight+0.063
 common+0.061
 harmony+0.061
 accomplishment+0.059
Neg logit contributions
m-0.073
owed-0.063
op-0.061
o-0.061
re-0.059
Token very
Token ID845
Feature activation+5.95e+00

Pos logit contributions
 purpose+0.079
 sight+0.073
 common+0.071
 accomplishment+0.071
 harmony+0.069
Neg logit contributions
m-0.089
owed-0.076
re-0.073
es-0.073
o-0.071
Token best
Token ID1266
Feature activation+3.48e+00

Pos logit contributions
 purpose+0.132
 sight+0.131
 common+0.118
 accomplishment+0.116
 harmony+0.116
Neg logit contributions
m-0.130
owed-0.128
es-0.122
ppy-0.115
op-0.115
Token mel
Token ID7758
Feature activation-6.71e+00

Pos logit contributions
 purpose+0.057
 sight+0.055
 common+0.053
 harmony+0.053
 rightful+0.052
Neg logit contributions
m-0.072
owed-0.066
re-0.064
op-0.063
es-0.062
Tokenon
Token ID261
Feature activation-6.57e+00

Pos logit contributions
owed+0.099
m+0.088
bee+0.086
itive+0.085
oke+0.084
Neg logit contributions
 purpose-0.135
 sight-0.134
 awesome-0.123
 sand-0.123
 freedom-0.122
Token.
Token ID13
Feature activation-3.07e+00

Pos logit contributions
m+0.111
owed+0.107
es+0.101
op+0.096
ppy+0.096
Neg logit contributions
 purpose-0.133
 sight-0.131
 common-0.122
 harmony-0.121
 accomplishment-0.118
Token The
Token ID383
Feature activation+3.78e+00

Pos logit contributions
m+0.069
owed+0.068
es+0.064
bee+0.063
ppy+0.063
Neg logit contributions
 purpose-0.045
 sight-0.045
 common-0.039
 harmony-0.039
 accomplishment-0.038
Token boy
Token ID2933
Feature activation-1.27e+01

Pos logit contributions
 sight+0.063
 purpose+0.062
 common+0.054
 accomplishment+0.054
 beauty+0.054
Neg logit contributions
owed-0.078
m-0.077
es-0.075
ppy-0.071
re-0.071
Token's
Token ID338
Feature activation+3.12e+00

Pos logit contributions
m+0.195
owed+0.179
ppy+0.176
op+0.176
oke+0.173
Neg logit contributions
 sight-0.245
 purpose-0.231
 joy-0.227
 new-0.226
 fun-0.225
Token father
Token ID2988
Feature activation-1.07e+01

Pos logit contributions
 purpose+0.052
 common+0.051
 harmony+0.050
 sight+0.048
 outer+0.047
Neg logit contributions
m-0.061
owed-0.057
op-0.055
re-0.054
bee-0.054
Token said
Token ID531
Feature activation-1.31e+00

Pos logit contributions
m+0.148
owed+0.146
op+0.140
ppy+0.138
re+0.135
Neg logit contributions
 sight-0.209
 accomplishment-0.193
 new-0.192
 joy-0.192
 purpose-0.189
Token,
Token ID11
Feature activation-2.00e+00

Pos logit contributions
m+0.023
owed+0.022
es+0.021
ppy+0.020
re+0.020
Neg logit contributions
 purpose-0.025
 sight-0.025
 common-0.023
 accomplishment-0.023
 harmony-0.022
Token "
Token ID366
Feature activation-2.75e+00

Pos logit contributions
owed+0.044
m+0.043
ppy+0.041
es+0.040
re+0.040
Neg logit contributions
 sight-0.028
 purpose-0.028
 our-0.025
 new-0.024
 accomplishment-0.024