TOP ACTIVATIONS
MAX = 0.113

You helped us when we needed you ." The
Can we go home and have a w affle ?"
" Why do we need a suit ?" Lily wondered
we make my room look like a cartoon ?" he said
girl and you don 't need any shiny things to make
barrel . We don 't need your toys . We have
. Why do we always have to do things your way
tower . " Now we need more towers . And a
anyone . We do not need school ." Their
never know when you might need it ." <|endoftext|> Ben and
will help you if you need . We can make something
, Sam . Maybe we need to play something else ,"
. But we don 't need to know . We have
ask for help when we need it , and we should
ask for help when we need it . <|endoftext|> Once upon
, " Why do we need to sell my toys ?"
mean . He does not need the car . I want
You can use them to have fun and to learn while
milk ? I think you deserve a treat ."
happy . We don 't need a big house or a

MIN ACTIVATIONS
MIN = -0.173

as Hel io . Hel io explained that their oxygen was
from the tank . Hel io was thankful and thanked T
It is not safe ," Pip said . L
stealing my apples !" the farmer shouted . He
hat , like this ." Sue said . She took off
some papers too ?" the prince asked . His dad smiled
helping each other ." Big gie and his friends listened to
and speak to me .' Claw ing his way out of
, what 's that ?" Timothy asked , pointing at the
he wanted . <|endoftext|> Ak io was a 3 year old
to be my friends ?" Pip said . "
consequences ." D ennis and Jenny left the restaurant
to me ?" The prince sn ar led and said
it ." G avin thanked the woman and drove
is like this pumpkin ," Pam says . "
in the mud ," the prince said . The
get to the water ." Sam whispered , trying to be
. J ill was embarrassed that she had
going to shoot us !" Jill screamed , hiding behind her
will hurt you !" The prince was scared but he was

INTERVAL 0.042 - 0.113
CONTAINS 12.238%

you can play with me . This is Lily , my
" Oh no , I 'm shrinking !" she cried .
Anna and Ben like to play in the pool . They
playful thing they had ever done , and it made their
learn more math and have fun too !" Tom was so

INTERVAL -0.030 - 0.042
CONTAINS 70.044%

" I lost my favorite toy ." Tim my said ,
her read the note . Her mom read it out loud
. She said , " Yes , Mom . That sounds
They wanted to show him to their mom and dad later
liked to make towers with the blocks . They tried to

INTERVAL -0.102 - -0.030
CONTAINS 14.047%

, it was a little bunny . She looked at the
something beautiful . He knew that he had made
, a big boy pushed Lily and she fell down .
� � € � � Jane , how can you tell
ran to help her and they got pieces of water buckets

INTERVAL -0.173 - -0.102
CONTAINS 3.670%

deep ." L ily was happy and started to
One day , Lily 's friend Tim my came
free ." L ily was sad but understood .
milk please ?" His mom my said , " Sure ,
nice . L ily felt sad . She thought
Token You
Token ID921
Feature activation+2.71e-02

Pos logit contributions
 purpose+0.090
 yours+0.089
 our+0.081
 ours+0.076
 awesome+0.074
Neg logit contributions
ppy-0.091
 hopped-0.087
i-0.085
 walked-0.083
bug-0.083
Token helped
Token ID4193
Feature activation+5.45e-02

Pos logit contributions
 yours+0.059
 our+0.054
 purpose+0.053
 your+0.049
 pleasure+0.048
Neg logit contributions
i-0.049
ppy-0.049
 felt-0.048
owed-0.047
oped-0.047
Token us
Token ID514
Feature activation+1.96e-02

Pos logit contributions
 yours+0.109
 pleasure+0.098
 purpose+0.095
 our+0.094
 electricity+0.087
Neg logit contributions
i-0.105
ie-0.101
ppy-0.101
 felt-0.100
s-0.097
Token when
Token ID618
Feature activation+6.26e-02

Pos logit contributions
 yours+0.046
 purpose+0.041
 our+0.040
 pleasure+0.039
 electricity+0.038
Neg logit contributions
ppy-0.031
 felt-0.031
i-0.031
s-0.031
 crawled-0.030
Token we
Token ID356
Feature activation+3.46e-02

Pos logit contributions
 yours+0.112
 purpose+0.106
 pleasure+0.100
 our+0.090
 ours+0.088
Neg logit contributions
 felt-0.129
 hopped-0.126
 ran-0.126
 walked-0.124
 crawled-0.123
Token needed
Token ID2622
Feature activation+1.08e-01

Pos logit contributions
 yours+0.073
 our+0.067
 purpose+0.065
 your+0.061
 pleasure+0.060
Neg logit contributions
i-0.064
ppy-0.064
 felt-0.062
oped-0.062
owed-0.061
Token you
Token ID345
Feature activation+4.20e-02

Pos logit contributions
 yours+0.210
 pleasure+0.204
 purpose+0.201
 diversity+0.200
 disl+0.199
Neg logit contributions
 got-0.171
 started-0.166
's-0.162
 went-0.158
s-0.157
Token."
Token ID526
Feature activation-1.92e-02

Pos logit contributions
 yours+0.090
 purpose+0.086
 electricity+0.078
 pleasure+0.078
 yourselves+0.078
Neg logit contributions
s-0.071
's-0.070
 felt-0.069
 got-0.068
ppy-0.067
Token
Token ID198
Feature activation-1.25e-03

Pos logit contributions
ppy+0.043
i+0.042
ling+0.041
ches+0.041
oped+0.041
Neg logit contributions
 yours-0.035
 our-0.033
 purpose-0.031
 your-0.030
 my-0.028
Token
Token ID198
Feature activation-6.79e-03

Pos logit contributions
i+0.002
ppy+0.002
 felt+0.002
owed+0.002
oped+0.002
Neg logit contributions
 yours-0.003
 our-0.003
 purpose-0.003
 your-0.002
 pleasure-0.002
TokenThe
Token ID464
Feature activation-6.49e-03

Pos logit contributions
i+0.013
ppy+0.013
owed+0.013
 felt+0.013
oped+0.012
Neg logit contributions
 yours-0.014
 our-0.013
 purpose-0.013
 your-0.012
 my-0.011
Token Can
Token ID1680
Feature activation+6.24e-02

Pos logit contributions
 purpose+0.111
 yours+0.110
 our+0.096
 electricity+0.092
 ours+0.090
Neg logit contributions
ppy-0.107
 hopped-0.104
's-0.101
 crawled-0.101
s-0.100
Token we
Token ID356
Feature activation+4.42e-02

Pos logit contributions
 yours+0.121
 purpose+0.114
 our+0.104
 pleasure+0.103
 electricity+0.100
Neg logit contributions
s-0.126
i-0.122
 felt-0.121
ppy-0.120
 crawled-0.118
Token go
Token ID467
Feature activation+5.20e-02

Pos logit contributions
 yours+0.094
 our+0.085
 purpose+0.084
 your+0.078
 pleasure+0.077
Neg logit contributions
i-0.082
ppy-0.082
 felt-0.080
 crawled-0.078
s-0.078
Token home
Token ID1363
Feature activation+4.94e-02

Pos logit contributions
 yours+0.108
 purpose+0.099
 our+0.097
 pleasure+0.089
 your+0.088
Neg logit contributions
i-0.098
ppy-0.098
 felt-0.097
s-0.096
 crawled-0.095
Token and
Token ID290
Feature activation+6.41e-02

Pos logit contributions
 yours+0.121
 purpose+0.112
 pleasure+0.106
 our+0.105
 electricity+0.105
Neg logit contributions
 felt-0.076
s-0.076
i-0.074
ie-0.072
ppy-0.072
Token have
Token ID423
Feature activation+1.08e-01

Pos logit contributions
 yours+0.145
 purpose+0.134
 our+0.127
 pleasure+0.123
 electricity+0.122
Neg logit contributions
 felt-0.109
 crawled-0.106
i-0.106
ppy-0.105
 climbed-0.105
Token a
Token ID257
Feature activation+7.78e-02

Pos logit contributions
 purpose+0.228
 yours+0.228
 pleasure+0.216
 yourselves+0.212
 electricity+0.210
Neg logit contributions
's-0.187
 got-0.176
 felt-0.176
 looked-0.173
 sat-0.173
Token w
Token ID266
Feature activation+1.29e-02

Pos logit contributions
 yours+0.154
 our+0.134
 purpose+0.133
 yourselves+0.133
 electricity+0.126
Neg logit contributions
 felt-0.153
s-0.152
 hopped-0.151
ppy-0.150
 got-0.150
Tokenaffle
Token ID30697
Feature activation+2.05e-02

Pos logit contributions
 yours+0.033
 our+0.031
 purpose+0.030
 your+0.029
 pleasure+0.028
Neg logit contributions
ppy-0.018
i-0.018
owed-0.017
 felt-0.017
ling-0.017
Token?"
Token ID1701
Feature activation-2.13e-02

Pos logit contributions
 yours+0.041
 purpose+0.037
 our+0.037
 pleasure+0.034
 electricity+0.034
Neg logit contributions
ppy-0.041
i-0.039
 felt-0.039
s-0.039
owed-0.039
Token
Token ID198
Feature activation-3.45e-03

Pos logit contributions
i+0.034
ppy+0.033
 felt+0.033
s+0.032
 crawled+0.032
Neg logit contributions
 yours-0.051
 our-0.047
 purpose-0.046
 your-0.043
 pleasure-0.043
Token
Token ID198
Feature activation-6.66e-03

Pos logit contributions
ppy+0.007
i+0.007
 felt+0.007
 crawled+0.007
oped+0.007
Neg logit contributions
 yours-0.010
 our-0.009
 purpose-0.009
 your-0.009
 pleasure-0.009
Token"
Token ID1
Feature activation+2.51e-02

Pos logit contributions
i+0.011
ppy+0.010
 felt+0.010
owed+0.010
 crawled+0.010
Neg logit contributions
 yours-0.016
 our-0.015
 purpose-0.015
 your-0.014
 pleasure-0.013
TokenWhy
Token ID5195
Feature activation+8.02e-02

Pos logit contributions
 yours+0.053
 our+0.048
 purpose+0.047
 your+0.044
 pleasure+0.043
Neg logit contributions
i-0.047
ppy-0.047
 felt-0.046
owed-0.046
 crawled-0.045
Token do
Token ID466
Feature activation+7.92e-02

Pos logit contributions
 yours+0.181
 purpose+0.164
 our+0.163
 pleasure+0.152
 your+0.149
Neg logit contributions
 felt-0.138
ppy-0.137
i-0.137
s-0.132
owed-0.131
Token we
Token ID356
Feature activation+3.28e-02

Pos logit contributions
 yours+0.133
 pleasure+0.115
 purpose+0.112
 our+0.101
 electricity+0.099
Neg logit contributions
 felt-0.177
's-0.176
s-0.175
i-0.173
 hopped-0.170
Token need
Token ID761
Feature activation+1.08e-01

Pos logit contributions
 yours+0.072
 our+0.066
 purpose+0.065
 your+0.061
 pleasure+0.059
Neg logit contributions
i-0.058
ppy-0.058
owed-0.056
 felt-0.056
oped-0.056
Token a
Token ID257
Feature activation+7.23e-02

Pos logit contributions
 pleasure+0.202
 yours+0.201
 purpose+0.187
 disl+0.185
 diversity+0.184
Neg logit contributions
 got-0.184
 started-0.180
's-0.176
 felt-0.170
 ran-0.168
Token suit
Token ID6050
Feature activation+1.50e-02

Pos logit contributions
 yours+0.150
 yourselves+0.138
 our+0.131
 purpose+0.126
 electricity+0.122
Neg logit contributions
s-0.128
 hopped-0.127
 felt-0.127
 got-0.127
 crawled-0.124
Token?"
Token ID1701
Feature activation-2.54e-02

Pos logit contributions
 yours+0.029
 purpose+0.027
 our+0.027
 pleasure+0.025
 electricity+0.024
Neg logit contributions
 felt-0.029
ppy-0.029
i-0.029
s-0.029
owed-0.029
Token Lily
Token ID20037
Feature activation-1.29e-01

Pos logit contributions
i+0.049
ppy+0.049
 felt+0.048
 crawled+0.048
owed+0.047
Neg logit contributions
 yours-0.052
 our-0.047
 purpose-0.046
 your-0.042
 pleasure-0.042
Token wondered
Token ID14028
Feature activation-1.57e-02

Pos logit contributions
oped+0.252
ops+0.233
owed+0.223
 mun+0.219
ppers+0.216
Neg logit contributions
 your-0.261
 my-0.250
 yours-0.246
 our-0.246
 purpose-0.222
Token we
Token ID356
Feature activation+4.99e-02

Pos logit contributions
 yours+0.137
 purpose+0.122
 our+0.113
 pleasure+0.109
 electricity+0.100
Neg logit contributions
 felt-0.192
i-0.192
ppy-0.190
s-0.190
 crawled-0.187
Token make
Token ID787
Feature activation+8.04e-02

Pos logit contributions
 yours+0.112
 purpose+0.103
 our+0.100
 pleasure+0.093
 your+0.091
Neg logit contributions
i-0.085
ppy-0.084
 felt-0.084
 crawled-0.082
ie-0.082
Token my
Token ID616
Feature activation+2.75e-02

Pos logit contributions
 yours+0.145
 pleasure+0.138
 common+0.133
 purpose+0.131
 diversity+0.124
Neg logit contributions
's-0.166
ie-0.154
s-0.151
i-0.151
es-0.145
Token room
Token ID2119
Feature activation+6.10e-02

Pos logit contributions
 yours+0.067
 purpose+0.059
 our+0.059
 yourselves+0.055
 pleasure+0.054
Neg logit contributions
 felt-0.042
ie-0.041
s-0.041
i-0.040
 crawled-0.040
Token look
Token ID804
Feature activation+7.49e-02

Pos logit contributions
 yours+0.132
 purpose+0.124
 electricity+0.117
 pleasure+0.116
 yourselves+0.113
Neg logit contributions
 felt-0.107
 got-0.104
ie-0.101
 looked-0.100
s-0.098
Token like
Token ID588
Feature activation+1.07e-01

Pos logit contributions
 purpose+0.143
 yours+0.137
 yourselves+0.127
 pleasure+0.124
 electricity+0.124
Neg logit contributions
s-0.143
ie-0.143
 felt-0.143
ling-0.141
 got-0.139
Token a
Token ID257
Feature activation+6.80e-02

Pos logit contributions
 purpose+0.217
 common+0.207
 disl+0.207
 fault+0.203
 pleasure+0.198
Neg logit contributions
 sat-0.166
 looked-0.164
 went-0.160
 said-0.160
 got-0.159
Token cartoon
Token ID16251
Feature activation+3.81e-02

Pos logit contributions
 yours+0.142
 yourselves+0.123
 purpose+0.123
 our+0.119
 electricity+0.118
Neg logit contributions
 felt-0.121
s-0.120
 got-0.118
's-0.118
 looked-0.117
Token?"
Token ID1701
Feature activation-3.28e-02

Pos logit contributions
 yours+0.075
 purpose+0.067
 our+0.066
 pleasure+0.063
 yourselves+0.060
Neg logit contributions
 felt-0.074
s-0.074
ppy-0.074
y-0.072
ie-0.072
Token he
Token ID339
Feature activation-9.54e-02

Pos logit contributions
ppy+0.073
i+0.068
ches+0.066
owed+0.066
ling+0.065
Neg logit contributions
 yours-0.062
 our-0.058
 purpose-0.056
 your-0.055
 my-0.051
Token said
Token ID531
Feature activation-2.08e-02

Pos logit contributions
oped+0.141
cases+0.129
oping+0.128
ops+0.127
hetically+0.120
Neg logit contributions
 yours-0.230
 our-0.228
 your-0.228
 my-0.224
 purpose-0.219
Token girl
Token ID2576
Feature activation-1.81e-03

Pos logit contributions
 yours+0.124
 our+0.117
 purpose+0.113
 ours+0.108
 electricity+0.108
Neg logit contributions
i-0.082
ie-0.078
y-0.076
 felt-0.075
ppy-0.071
Token and
Token ID290
Feature activation+4.38e-02

Pos logit contributions
i+0.003
ppy+0.003
 felt+0.003
 crawled+0.003
oped+0.003
Neg logit contributions
 yours-0.004
 our-0.004
 purpose-0.004
 your-0.004
 pleasure-0.004
Token you
Token ID345
Feature activation+1.95e-02

Pos logit contributions
 purpose+0.071
 yours+0.071
 pleasure+0.062
 our+0.060
 electricity+0.057
Neg logit contributions
 felt-0.101
 forgot-0.096
 ran-0.094
i-0.094
 got-0.094
Token don
Token ID836
Feature activation+7.14e-02

Pos logit contributions
 yours+0.044
 our+0.040
 purpose+0.039
 your+0.037
 pleasure+0.036
Neg logit contributions
i-0.034
ppy-0.034
 felt-0.033
oped-0.033
 crawled-0.033
Token't
Token ID470
Feature activation+5.40e-02

Pos logit contributions
 yours+0.141
 our+0.127
 purpose+0.125
 your+0.116
 pleasure+0.113
Neg logit contributions
i-0.143
ppy-0.143
 felt-0.140
 crawled-0.138
owed-0.138
Token need
Token ID761
Feature activation+1.07e-01

Pos logit contributions
 yours+0.118
 our+0.108
 purpose+0.106
 pleasure+0.100
 your+0.098
Neg logit contributions
ppy-0.095
i-0.095
 felt-0.094
 crawled-0.090
y-0.090
Token any
Token ID597
Feature activation+8.71e-02

Pos logit contributions
 pleasure+0.226
 yours+0.214
 outer+0.206
 awesome+0.205
 disl+0.204
Neg logit contributions
's-0.144
 got-0.143
s-0.142
 felt-0.141
 ran-0.141
Token shiny
Token ID22441
Feature activation+5.25e-02

Pos logit contributions
 yours+0.160
 our+0.149
 purpose+0.144
 yourselves+0.139
 pleasure+0.138
Neg logit contributions
 felt-0.157
s-0.155
's-0.153
 crawled-0.152
 got-0.152
Token things
Token ID1243
Feature activation+3.83e-02

Pos logit contributions
 yours+0.090
 pleasure+0.085
 purpose+0.084
 our+0.083
 ours+0.073
Neg logit contributions
ie-0.113
i-0.113
y-0.110
s-0.109
 felt-0.106
Token to
Token ID284
Feature activation+7.22e-02

Pos logit contributions
 purpose+0.085
 yours+0.081
 pleasure+0.080
 electricity+0.076
 our+0.076
Neg logit contributions
s-0.063
ppy-0.057
y-0.057
's-0.056
i-0.055
Token make
Token ID787
Feature activation+8.17e-02

Pos logit contributions
 yours+0.155
 pleasure+0.148
 our+0.139
 electricity+0.138
 purpose+0.135
Neg logit contributions
i-0.114
ie-0.113
 felt-0.113
 got-0.106
 crawled-0.105
Token barrel
Token ID9036
Feature activation+4.59e-03

Pos logit contributions
 yours+0.089
 purpose+0.078
 our+0.074
 your+0.072
 pleasure+0.072
Neg logit contributions
 felt-0.071
 crawled-0.071
s-0.070
 ran-0.070
ling-0.069
Token.
Token ID13
Feature activation+6.78e-02

Pos logit contributions
 yours+0.011
 our+0.010
 purpose+0.010
 your+0.009
 pleasure+0.009
Neg logit contributions
i-0.008
ppy-0.008
oped-0.008
 felt-0.008
 crawled-0.007
Token We
Token ID775
Feature activation+1.50e-02

Pos logit contributions
 purpose+0.159
 yours+0.148
 yourselves+0.128
 consequences+0.127
 potential+0.126
Neg logit contributions
's-0.102
�-0.093
s-0.090
 suggested-0.088
 could-0.087
Token don
Token ID836
Feature activation+7.13e-02

Pos logit contributions
 yours+0.030
 our+0.028
 purpose+0.027
 your+0.025
 pleasure+0.025
Neg logit contributions
i-0.029
ppy-0.029
owed-0.028
oped-0.028
 felt-0.028
Token't
Token ID470
Feature activation+4.24e-02

Pos logit contributions
 yours+0.142
 our+0.128
 purpose+0.126
 your+0.116
 pleasure+0.114
Neg logit contributions
i-0.142
ppy-0.141
 felt-0.139
 crawled-0.137
oped-0.136
Token need
Token ID761
Feature activation+1.07e-01

Pos logit contributions
 yours+0.093
 our+0.085
 purpose+0.083
 your+0.077
 pleasure+0.077
Neg logit contributions
ppy-0.076
i-0.076
 felt-0.074
 crawled-0.073
 hopped-0.072
Token your
Token ID534
Feature activation+4.06e-02

Pos logit contributions
 pleasure+0.214
 disl+0.198
arl+0.193
 purpose+0.193
 rightful+0.188
Neg logit contributions
 ran-0.164
's-0.163
 got-0.160
 started-0.158
 tried-0.156
Token toys
Token ID14958
Feature activation+2.90e-02

Pos logit contributions
 yours+0.087
 purpose+0.077
 yourselves+0.077
 our+0.076
 your+0.074
Neg logit contributions
 felt-0.068
 crawled-0.065
 ran-0.065
 hopped-0.065
s-0.064
Token.
Token ID13
Feature activation+6.55e-02

Pos logit contributions
 yours+0.063
 purpose+0.061
 pleasure+0.059
 our+0.057
 electricity+0.056
Neg logit contributions
s-0.049
ppy-0.048
y-0.046
i-0.045
 felt-0.045
Token We
Token ID775
Feature activation+1.64e-02

Pos logit contributions
 purpose+0.146
 yours+0.144
 sight+0.124
 talent+0.123
 outer+0.121
Neg logit contributions
's-0.108
s-0.100
es-0.094
ppy-0.092
�-0.087
Token have
Token ID423
Feature activation+9.13e-02

Pos logit contributions
 yours+0.035
 our+0.031
 purpose+0.030
 your+0.029
 pleasure+0.028
Neg logit contributions
i-0.031
ppy-0.031
oped-0.030
 felt-0.030
owed-0.030
Token.
Token ID13
Feature activation+4.40e-02

Pos logit contributions
 yours+0.181
 pleasure+0.177
 purpose+0.167
 electricity+0.164
 ours+0.162
Neg logit contributions
's-0.090
s-0.088
es-0.087
ches-0.086
 felt-0.084
Token Why
Token ID4162
Feature activation+7.63e-02

Pos logit contributions
 yours+0.080
 purpose+0.077
 our+0.069
 electricity+0.064
 ours+0.062
Neg logit contributions
ppy-0.092
 hopped-0.089
 crawled-0.088
 climbed-0.087
i-0.087
Token do
Token ID466
Feature activation+8.83e-02

Pos logit contributions
 yours+0.171
 purpose+0.155
 our+0.154
 pleasure+0.144
 your+0.140
Neg logit contributions
ppy-0.133
i-0.133
 felt-0.132
s-0.128
owed-0.127
Token we
Token ID356
Feature activation+3.57e-02

Pos logit contributions
 yours+0.154
 pleasure+0.146
 purpose+0.129
 fault+0.119
 electricity+0.118
Neg logit contributions
's-0.188
s-0.184
 felt-0.182
i-0.180
es-0.178
Token always
Token ID1464
Feature activation+6.25e-02

Pos logit contributions
 yours+0.075
 our+0.068
 purpose+0.067
 your+0.063
 pleasure+0.061
Neg logit contributions
i-0.067
ppy-0.067
 felt-0.065
 crawled-0.064
owed-0.064
Token have
Token ID423
Feature activation+1.06e-01

Pos logit contributions
 yours+0.143
 purpose+0.130
 our+0.128
 electricity+0.122
 pleasure+0.121
Neg logit contributions
 felt-0.102
ppy-0.099
 crawled-0.098
 forgot-0.098
 climbed-0.097
Token to
Token ID284
Feature activation+8.46e-02

Pos logit contributions
 yours+0.184
 purpose+0.180
arl+0.176
 pleasure+0.174
 outer+0.169
Neg logit contributions
's-0.198
 felt-0.197
 said-0.192
 started-0.189
 got-0.189
Token do
Token ID466
Feature activation+9.44e-02

Pos logit contributions
 yours+0.200
 pleasure+0.180
 electricity+0.178
 purpose+0.173
 ours+0.169
Neg logit contributions
 felt-0.118
 ran-0.117
 climbed-0.116
 crawled-0.114
 forgot-0.114
Token things
Token ID1243
Feature activation+4.63e-02

Pos logit contributions
 yours+0.147
 pleasure+0.143
 purpose+0.120
 gift+0.118
 ours+0.111
Neg logit contributions
's-0.210
 felt-0.209
s-0.202
 got-0.194
 forgot-0.193
Token your
Token ID534
Feature activation+3.15e-02

Pos logit contributions
 yours+0.098
 pleasure+0.093
 purpose+0.092
 our+0.086
 electricity+0.085
Neg logit contributions
s-0.081
 felt-0.080
ling-0.077
ie-0.077
ppy-0.076
Token way
Token ID835
Feature activation+7.08e-02

Pos logit contributions
 yours+0.062
 our+0.055
 purpose+0.053
 yourselves+0.050
 electricity+0.050
Neg logit contributions
 felt-0.057
i-0.056
 crawled-0.055
ppy-0.055
's-0.054
Token tower
Token ID10580
Feature activation-5.20e-02

Pos logit contributions
 yours+0.020
 purpose+0.017
 our+0.017
 pleasure+0.017
 your+0.016
Neg logit contributions
 felt-0.017
 crawled-0.016
 hopped-0.016
 ran-0.016
i-0.016
Token.
Token ID13
Feature activation+2.23e-02

Pos logit contributions
ppy+0.109
i+0.108
oped+0.106
 forgot+0.103
 felt+0.101
Neg logit contributions
 yours-0.100
 our-0.090
 purpose-0.087
 your-0.082
 my-0.076
Token "
Token ID366
Feature activation+3.08e-02

Pos logit contributions
 yours+0.062
 our+0.056
 purpose+0.056
 ours+0.053
 pleasure+0.051
Neg logit contributions
s-0.028
 felt-0.028
 hopped-0.027
 crawled-0.027
ppy-0.026
TokenNow
Token ID3844
Feature activation+4.68e-02

Pos logit contributions
 yours+0.070
 our+0.064
 purpose+0.064
 your+0.059
 pleasure+0.059
Neg logit contributions
ppy-0.053
 felt-0.051
 crawled-0.051
 climbed-0.051
oped-0.050
Token we
Token ID356
Feature activation+2.54e-02

Pos logit contributions
 yours+0.085
 purpose+0.082
 pleasure+0.074
 electricity+0.073
 our+0.070
Neg logit contributions
i-0.096
 felt-0.094
 ran-0.093
ppy-0.092
 climbed-0.092
Token need
Token ID761
Feature activation+1.06e-01

Pos logit contributions
 yours+0.056
 purpose+0.051
 our+0.051
 your+0.047
 pleasure+0.046
Neg logit contributions
ppy-0.046
i-0.045
 felt-0.045
 crawled-0.044
 climbed-0.043
Token more
Token ID517
Feature activation+6.91e-02

Pos logit contributions
 disl+0.226
arl+0.222
 purpose+0.219
 yours+0.216
 pleasure+0.212
Neg logit contributions
's-0.137
 started-0.137
 looked-0.134
 got-0.134
 said-0.130
Token towers
Token ID18028
Feature activation-1.94e-03

Pos logit contributions
 yours+0.121
 purpose+0.114
 our+0.101
 pleasure+0.101
 yourselves+0.093
Neg logit contributions
 crawled-0.147
 climbed-0.144
 felt-0.143
 ran-0.142
s-0.141
Token.
Token ID13
Feature activation+7.31e-02

Pos logit contributions
ppy+0.004
i+0.004
s+0.004
 felt+0.004
owed+0.004
Neg logit contributions
 yours-0.004
 purpose-0.004
 our-0.003
 pleasure-0.003
 your-0.003
Token And
Token ID843
Feature activation+5.22e-02

Pos logit contributions
 purpose+0.158
 yours+0.156
 our+0.131
 sight+0.127
 ours+0.126
Neg logit contributions
ppy-0.119
's-0.118
 climbed-0.115
es-0.110
bug-0.109
Token a
Token ID257
Feature activation+6.50e-02

Pos logit contributions
 purpose+0.109
 yours+0.108
 electricity+0.091
 pleasure+0.090
 our+0.089
Neg logit contributions
 climbed-0.090
 felt-0.090
 crawled-0.088
 ran-0.086
's-0.085
Token anyone
Token ID2687
Feature activation+1.47e-02

Pos logit contributions
 yours+0.164
 pleasure+0.158
 purpose+0.152
 ours+0.137
 our+0.136
Neg logit contributions
s-0.093
's-0.089
 felt-0.085
ches-0.082
es-0.081
Token.
Token ID13
Feature activation+6.79e-02

Pos logit contributions
 yours+0.032
 our+0.030
 purpose+0.029
 your+0.027
 pleasure+0.027
Neg logit contributions
ppy-0.026
i-0.026
 felt-0.025
 crawled-0.025
s-0.025
Token We
Token ID775
Feature activation+1.98e-02

Pos logit contributions
 purpose+0.143
 yours+0.141
 sight+0.116
 gift+0.115
 ours+0.113
Neg logit contributions
's-0.118
ppy-0.108
es-0.103
 cried-0.102
�-0.100
Token do
Token ID466
Feature activation+7.92e-02

Pos logit contributions
 yours+0.042
 our+0.038
 purpose+0.038
 your+0.035
 pleasure+0.035
Neg logit contributions
ppy-0.037
i-0.036
 felt-0.036
 crawled-0.035
s-0.035
Token not
Token ID407
Feature activation+5.02e-02

Pos logit contributions
 yours+0.188
 pleasure+0.183
 purpose+0.171
 ours+0.158
 electricity+0.156
Neg logit contributions
 felt-0.110
's-0.109
s-0.107
 forgot-0.105
 got-0.104
Token need
Token ID761
Feature activation+1.06e-01

Pos logit contributions
 yours+0.112
 our+0.102
 purpose+0.100
 pleasure+0.096
 your+0.093
Neg logit contributions
ppy-0.088
i-0.086
 felt-0.086
 crawled-0.084
 climbed-0.083
Token school
Token ID1524
Feature activation+4.10e-02

Pos logit contributions
 pleasure+0.198
 purpose+0.179
 yours+0.178
 rightful+0.176
 disl+0.172
Neg logit contributions
s-0.172
's-0.171
 ran-0.169
 got-0.168
 started-0.168
Token."
Token ID526
Feature activation-1.34e-02

Pos logit contributions
 yours+0.095
 purpose+0.089
 pleasure+0.087
 our+0.085
 electricity+0.080
Neg logit contributions
s-0.066
 felt-0.063
ppy-0.063
es-0.062
's-0.061
Token
Token ID198
Feature activation-2.02e-04

Pos logit contributions
 felt+0.019
s+0.019
i+0.019
ppy+0.019
 crawled+0.018
Neg logit contributions
 yours-0.035
 our-0.031
 purpose-0.031
 pleasure-0.029
 ours-0.029
Token
Token ID198
Feature activation-4.11e-03

Pos logit contributions
s+0.000
 felt+0.000
ppy+0.000
i+0.000
 hopped+0.000
Neg logit contributions
 yours-0.001
 purpose-0.000
 our-0.000
 pleasure-0.000
 ours-0.000
TokenTheir
Token ID14574
Feature activation-2.19e-02

Pos logit contributions
ppy+0.007
i+0.007
 felt+0.007
s+0.007
 crawled+0.007
Neg logit contributions
 yours-0.010
 our-0.009
 purpose-0.009
 your-0.008
 pleasure-0.008
Token never
Token ID1239
Feature activation+3.21e-02

Pos logit contributions
 yours+0.052
 our+0.048
 purpose+0.045
 your+0.044
 pleasure+0.042
Neg logit contributions
i-0.051
ppy-0.049
oped-0.049
ling-0.049
owed-0.048
Token know
Token ID760
Feature activation+7.12e-02

Pos logit contributions
 yours+0.077
 our+0.071
 purpose+0.069
 your+0.066
 pleasure+0.064
Neg logit contributions
i-0.050
ppy-0.050
oped-0.049
owed-0.048
s-0.048
Token when
Token ID618
Feature activation+7.00e-02

Pos logit contributions
 purpose+0.146
 pleasure+0.132
 yours+0.131
 common+0.126
 electricity+0.122
Neg logit contributions
i-0.127
s-0.122
ie-0.121
es-0.117
 felt-0.117
Token you
Token ID345
Feature activation+2.05e-02

Pos logit contributions
 yours+0.110
 purpose+0.109
 pleasure+0.107
 our+0.092
 ours+0.091
Neg logit contributions
i-0.149
 felt-0.147
s-0.144
 crawled-0.144
 ran-0.143
Token might
Token ID1244
Feature activation+6.79e-02

Pos logit contributions
 yours+0.045
 our+0.041
 purpose+0.039
 your+0.038
 pleasure+0.036
Neg logit contributions
i-0.035
ling-0.035
oped-0.034
ppy-0.034
 crawled-0.033
Token need
Token ID761
Feature activation+1.06e-01

Pos logit contributions
 yours+0.159
 our+0.146
 purpose+0.140
 pleasure+0.139
 ours+0.134
Neg logit contributions
 felt-0.112
i-0.108
ppy-0.108
 crawled-0.106
 got-0.104
Token it
Token ID340
Feature activation+4.22e-02

Pos logit contributions
 pleasure+0.208
 purpose+0.198
 ours+0.192
 yours+0.192
 disl+0.187
Neg logit contributions
 got-0.172
 started-0.167
s-0.167
 said-0.166
's-0.163
Token."
Token ID526
Feature activation-1.14e-02

Pos logit contributions
 yours+0.084
 purpose+0.081
 our+0.078
 pleasure+0.077
 electricity+0.073
Neg logit contributions
ppy-0.079
 felt-0.077
s-0.076
i-0.076
owed-0.075
Token<|endoftext|>
Token ID50256
Feature activation-1.72e-02

Pos logit contributions
i+0.022
ppy+0.022
owed+0.021
ling+0.021
oped+0.021
Neg logit contributions
 yours-0.024
 our-0.022
 purpose-0.021
 your-0.021
 pleasure-0.020
TokenBen
Token ID11696
Feature activation-1.23e-01

Pos logit contributions
i+0.032
ppy+0.032
owed+0.031
 felt+0.031
oped+0.030
Neg logit contributions
 yours-0.036
 our-0.033
 purpose-0.032
 your-0.031
 pleasure-0.029
Token and
Token ID290
Feature activation-2.89e-02

Pos logit contributions
cases+0.184
hetically+0.168
oped+0.167
i+0.163
ling+0.162
Neg logit contributions
 yours-0.304
 our-0.295
 your-0.289
 my-0.275
 purpose-0.258
Token will
Token ID481
Feature activation+6.58e-02

Pos logit contributions
 yours+0.038
 our+0.035
 purpose+0.034
 your+0.032
 pleasure+0.031
Neg logit contributions
i-0.037
ppy-0.036
 felt-0.035
owed-0.035
oped-0.035
Token help
Token ID1037
Feature activation+5.50e-02

Pos logit contributions
 yours+0.156
 our+0.143
 purpose+0.142
 pleasure+0.140
 ours+0.136
Neg logit contributions
 felt-0.097
i-0.096
ppy-0.094
 got-0.093
 crawled-0.092
Token you
Token ID345
Feature activation+4.02e-02

Pos logit contributions
 yours+0.079
 purpose+0.070
 pleasure+0.068
 our+0.067
 electricity+0.058
Neg logit contributions
i-0.137
ppy-0.137
ie-0.134
s-0.132
 felt-0.131
Token if
Token ID611
Feature activation+6.94e-02

Pos logit contributions
 yours+0.094
 purpose+0.086
 our+0.085
 pleasure+0.082
 electricity+0.078
Neg logit contributions
ppy-0.064
i-0.063
 felt-0.062
s-0.062
owed-0.060
Token you
Token ID345
Feature activation+2.72e-02

Pos logit contributions
 yours+0.112
 purpose+0.112
 pleasure+0.105
 our+0.095
 ours+0.090
Neg logit contributions
i-0.151
 felt-0.151
s-0.148
 ran-0.147
 skipped-0.146
Token need
Token ID761
Feature activation+1.05e-01

Pos logit contributions
 yours+0.059
 our+0.054
 purpose+0.052
 your+0.049
 pleasure+0.048
Neg logit contributions
i-0.049
ppy-0.048
oped-0.047
 felt-0.047
 crawled-0.047
Token.
Token ID13
Feature activation+6.38e-02

Pos logit contributions
 pleasure+0.199
 purpose+0.192
 yours+0.186
 ours+0.182
 rightful+0.179
Neg logit contributions
 got-0.181
 started-0.176
 looked-0.170
s-0.169
 reached-0.167
Token We
Token ID775
Feature activation+1.08e-02

Pos logit contributions
 purpose+0.120
 yours+0.120
 our+0.108
 ours+0.100
 common+0.096
Neg logit contributions
ppy-0.129
 climbed-0.117
i-0.117
 hopped-0.117
 crawled-0.117
Token can
Token ID460
Feature activation+7.25e-02

Pos logit contributions
 yours+0.022
 our+0.020
 purpose+0.019
 your+0.018
 pleasure+0.017
Neg logit contributions
i-0.021
ppy-0.021
 felt-0.021
owed-0.021
oped-0.020
Token make
Token ID787
Feature activation+8.01e-02

Pos logit contributions
 yours+0.159
 our+0.144
 purpose+0.142
 your+0.132
 pleasure+0.131
Neg logit contributions
i-0.130
ppy-0.129
 felt-0.127
 crawled-0.124
s-0.124
Token something
Token ID1223
Feature activation+4.98e-02

Pos logit contributions
 yours+0.129
 purpose+0.118
 pleasure+0.115
 common+0.101
 ours+0.098
Neg logit contributions
's-0.178
i-0.177
s-0.175
ie-0.172
 felt-0.170
Token,
Token ID11
Feature activation+4.34e-02

Pos logit contributions
 yours+0.069
 purpose+0.063
 our+0.062
 pleasure+0.059
 your+0.058
Neg logit contributions
 felt-0.054
s-0.053
ppy-0.053
y-0.052
owed-0.052
Token Sam
Token ID3409
Feature activation-5.57e-02

Pos logit contributions
 yours+0.095
 purpose+0.094
 our+0.083
 pleasure+0.081
 electricity+0.080
Neg logit contributions
 felt-0.070
ppy-0.070
i-0.068
 crawled-0.067
 climbed-0.067
Token.
Token ID13
Feature activation+5.59e-02

Pos logit contributions
oped+0.123
owed+0.117
cases+0.116
 skipped+0.115
 crawled+0.115
Neg logit contributions
 yours-0.104
 our-0.096
 your-0.089
 purpose-0.087
 my-0.083
Token Maybe
Token ID6674
Feature activation+7.08e-02

Pos logit contributions
 yours+0.140
 purpose+0.134
 our+0.119
 outer+0.116
 ours+0.115
Neg logit contributions
's-0.079
ppy-0.078
s-0.076
 felt-0.073
o-0.071
Token we
Token ID356
Feature activation+9.97e-03

Pos logit contributions
 purpose+0.154
 yours+0.150
 pleasure+0.144
ags+0.130
 electricity+0.129
Neg logit contributions
's-0.114
 ran-0.114
 felt-0.113
i-0.112
 climbed-0.112
Token need
Token ID761
Feature activation+1.05e-01

Pos logit contributions
 yours+0.021
 our+0.019
 purpose+0.019
 your+0.018
 pleasure+0.017
Neg logit contributions
ppy-0.018
i-0.018
 felt-0.018
 crawled-0.018
oped-0.018
Token to
Token ID284
Feature activation+7.87e-02

Pos logit contributions
 yours+0.202
 pleasure+0.196
 purpose+0.189
 diversity+0.185
 ours+0.183
Neg logit contributions
 said-0.161
 got-0.160
's-0.160
 started-0.156
 walked-0.156
Token play
Token ID711
Feature activation+7.42e-02

Pos logit contributions
 yours+0.185
 purpose+0.160
 pleasure+0.158
 our+0.156
 ours+0.153
Neg logit contributions
 felt-0.124
 crawled-0.120
 climbed-0.120
i-0.120
 ran-0.120
Token something
Token ID1223
Feature activation+4.63e-02

Pos logit contributions
 yours+0.175
 purpose+0.164
 pleasure+0.163
 electricity+0.151
 ours+0.148
Neg logit contributions
 felt-0.109
's-0.107
es-0.103
s-0.103
 hopped-0.101
Token else
Token ID2073
Feature activation+4.27e-02

Pos logit contributions
 yours+0.141
 purpose+0.138
 pleasure+0.134
 our+0.128
 electricity+0.127
Neg logit contributions
s-0.040
 felt-0.034
ling-0.034
ie-0.033
's-0.032
Token,"
Token ID553
Feature activation-2.48e-02

Pos logit contributions
 yours+0.096
 purpose+0.091
 our+0.087
 pleasure+0.085
 electricity+0.081
Neg logit contributions
s-0.071
ppy-0.069
i-0.069
 felt-0.069
y-0.067
Token.
Token ID13
Feature activation+6.45e-02

Pos logit contributions
 purpose+0.142
 yours+0.134
 pleasure+0.125
 common+0.125
 gift+0.115
Neg logit contributions
i-0.119
ie-0.116
's-0.113
s-0.112
 felt-0.110
Token But
Token ID887
Feature activation+4.51e-02

Pos logit contributions
 yours+0.155
 purpose+0.144
 ours+0.128
 our+0.125
 your+0.122
Neg logit contributions
's-0.103
s-0.094
 felt-0.090
es-0.090
ppy-0.090
Token we
Token ID356
Feature activation+5.39e-03

Pos logit contributions
 purpose+0.097
 yours+0.091
 common+0.078
 yourselves+0.078
 electricity+0.077
Neg logit contributions
 felt-0.080
i-0.078
 looked-0.076
 climbed-0.074
's-0.074
Token don
Token ID836
Feature activation+7.24e-02

Pos logit contributions
 yours+0.011
 our+0.010
 purpose+0.009
 your+0.009
 pleasure+0.009
Neg logit contributions
i-0.010
ppy-0.010
oped-0.010
owed-0.010
ling-0.010
Token't
Token ID470
Feature activation+4.46e-02

Pos logit contributions
 yours+0.148
 our+0.133
 purpose+0.131
 your+0.121
 pleasure+0.121
Neg logit contributions
i-0.140
ppy-0.139
 felt-0.137
s-0.136
oped-0.135
Token need
Token ID761
Feature activation+1.05e-01

Pos logit contributions
 yours+0.101
 our+0.092
 purpose+0.091
 your+0.085
 pleasure+0.084
Neg logit contributions
i-0.077
ppy-0.076
 felt-0.075
 crawled-0.073
 hopped-0.072
Token to
Token ID284
Feature activation+8.51e-02

Pos logit contributions
 yours+0.208
 pleasure+0.197
 rightful+0.192
arl+0.187
 ours+0.185
Neg logit contributions
's-0.168
 said-0.163
 looked-0.163
 screamed-0.163
 shouted-0.161
Token know
Token ID760
Feature activation+7.76e-02

Pos logit contributions
 yours+0.202
 our+0.177
 pleasure+0.174
 purpose+0.174
 ours+0.169
Neg logit contributions
i-0.137
 felt-0.137
ppy-0.130
 crawled-0.129
 forgot-0.127
Token.
Token ID13
Feature activation+6.46e-02

Pos logit contributions
 purpose+0.162
 yours+0.153
 pleasure+0.148
 common+0.143
 talent+0.130
Neg logit contributions
's-0.131
i-0.126
s-0.124
ie-0.124
es-0.123
Token We
Token ID775
Feature activation+5.05e-03

Pos logit contributions
 yours+0.152
 purpose+0.142
 ours+0.120
 our+0.118
 gift+0.118
Neg logit contributions
's-0.107
s-0.098
ppy-0.097
es-0.095
 felt-0.094
Token have
Token ID423
Feature activation+8.04e-02

Pos logit contributions
 yours+0.010
 our+0.009
 purpose+0.009
 your+0.008
 pleasure+0.008
Neg logit contributions
i-0.010
ppy-0.010
oped-0.010
owed-0.010
ling-0.010
Token ask
Token ID1265
Feature activation+4.45e-02

Pos logit contributions
 yours+0.132
 purpose+0.117
 our+0.117
 pleasure+0.114
 ours+0.108
Neg logit contributions
 felt-0.100
i-0.100
 crawled-0.097
 hopped-0.096
ppy-0.096
Token for
Token ID329
Feature activation+6.89e-02

Pos logit contributions
 yours+0.084
 purpose+0.078
 our+0.074
 pleasure+0.072
 ours+0.068
Neg logit contributions
i-0.091
ppy-0.089
ie-0.085
 felt-0.085
ches-0.083
Token help
Token ID1037
Feature activation+4.53e-02

Pos logit contributions
 yours+0.115
 purpose+0.109
 pleasure+0.104
 fault+0.099
 ours+0.098
Neg logit contributions
 ran-0.139
 crawled-0.138
 walked-0.137
 hopped-0.135
 felt-0.132
Token when
Token ID618
Feature activation+5.01e-02

Pos logit contributions
 yours+0.104
 our+0.095
 purpose+0.094
 pleasure+0.089
 your+0.087
Neg logit contributions
i-0.076
ppy-0.075
 felt-0.073
 crawled-0.072
ie-0.071
Token we
Token ID356
Feature activation+2.78e-02

Pos logit contributions
 yours+0.097
 purpose+0.087
 our+0.083
 pleasure+0.082
 ours+0.077
Neg logit contributions
 felt-0.100
i-0.097
 hopped-0.097
 crawled-0.096
 ran-0.095
Token need
Token ID761
Feature activation+1.05e-01

Pos logit contributions
 yours+0.055
 our+0.051
 purpose+0.048
 your+0.046
 pleasure+0.045
Neg logit contributions
i-0.054
ppy-0.054
oped-0.053
owed-0.052
s-0.052
Token it
Token ID340
Feature activation+4.98e-02

Pos logit contributions
 yours+0.196
 pleasure+0.186
 purpose+0.180
 diversity+0.179
 awesome+0.173
Neg logit contributions
 got-0.186
 started-0.184
's-0.182
 ran-0.176
s-0.176
Token,
Token ID11
Feature activation+3.67e-02

Pos logit contributions
 yours+0.102
 purpose+0.095
 our+0.092
 pleasure+0.089
 yourselves+0.084
Neg logit contributions
ppy-0.091
 felt-0.090
s-0.089
i-0.089
owed-0.088
Token and
Token ID290
Feature activation+3.22e-02

Pos logit contributions
 yours+0.086
 purpose+0.081
 our+0.077
 pleasure+0.072
 your+0.071
Neg logit contributions
 felt-0.056
ppy-0.055
i-0.055
 crawled-0.054
 hopped-0.053
Token we
Token ID356
Feature activation+8.05e-03

Pos logit contributions
 yours+0.068
 purpose+0.064
 our+0.060
 pleasure+0.057
 yourselves+0.055
Neg logit contributions
 felt-0.058
i-0.056
 crawled-0.056
 hopped-0.055
 climbed-0.055
Token should
Token ID815
Feature activation+6.38e-02

Pos logit contributions
 yours+0.016
 our+0.015
 purpose+0.014
 your+0.013
 pleasure+0.013
Neg logit contributions
i-0.016
ppy-0.016
oped-0.015
owed-0.015
 crawled-0.015
Token ask
Token ID1265
Feature activation+3.62e-02

Pos logit contributions
 yours+0.007
 our+0.007
 purpose+0.007
 your+0.006
 pleasure+0.006
Neg logit contributions
i-0.004
ppy-0.004
 felt-0.004
 crawled-0.004
owed-0.004
Token for
Token ID329
Feature activation+7.01e-02

Pos logit contributions
 yours+0.073
 purpose+0.067
 our+0.065
 pleasure+0.063
 electricity+0.060
Neg logit contributions
i-0.071
ppy-0.068
ie-0.066
 felt-0.065
owed-0.064
Token help
Token ID1037
Feature activation+4.27e-02

Pos logit contributions
 yours+0.119
 purpose+0.114
 pleasure+0.111
 fault+0.108
 common+0.102
Neg logit contributions
 ran-0.129
 walked-0.127
 said-0.127
 crawled-0.124
i-0.123
Token when
Token ID618
Feature activation+5.22e-02

Pos logit contributions
 yours+0.103
 our+0.095
 purpose+0.094
 pleasure+0.088
 your+0.088
Neg logit contributions
i-0.066
ppy-0.066
 felt-0.064
s-0.063
owed-0.063
Token we
Token ID356
Feature activation+2.94e-02

Pos logit contributions
 yours+0.104
 purpose+0.095
 our+0.090
 pleasure+0.088
 ours+0.083
Neg logit contributions
 felt-0.100
i-0.099
 hopped-0.096
 crawled-0.096
 ran-0.096
Token need
Token ID761
Feature activation+1.05e-01

Pos logit contributions
 yours+0.056
 our+0.051
 purpose+0.049
 your+0.046
 pleasure+0.045
Neg logit contributions
i-0.060
ppy-0.059
oped-0.058
owed-0.058
s-0.057
Token it
Token ID340
Feature activation+4.74e-02

Pos logit contributions
 yours+0.203
 pleasure+0.194
 diversity+0.189
 purpose+0.185
 awesome+0.184
Neg logit contributions
 got-0.180
 started-0.177
's-0.174
s-0.169
 felt-0.165
Token.
Token ID13
Feature activation+2.34e-02

Pos logit contributions
 yours+0.099
 purpose+0.093
 our+0.090
 pleasure+0.087
 yourselves+0.083
Neg logit contributions
ppy-0.083
 felt-0.082
i-0.082
s-0.082
owed-0.080
Token<|endoftext|>
Token ID50256
Feature activation-4.01e-03

Pos logit contributions
 yours+0.051
 purpose+0.047
 our+0.045
 electricity+0.041
 your+0.041
Neg logit contributions
ppy-0.041
s-0.040
i-0.040
 felt-0.039
oped-0.039
TokenOnce
Token ID7454
Feature activation+5.14e-03

Pos logit contributions
ppy+0.006
i+0.006
 felt+0.006
s+0.006
oped+0.006
Neg logit contributions
 yours-0.010
 our-0.009
 purpose-0.009
 your-0.008
 pleasure-0.008
Token upon
Token ID2402
Feature activation+1.96e-02

Pos logit contributions
 yours+0.014
 our+0.013
 purpose+0.012
 your+0.012
 pleasure+0.011
Neg logit contributions
ppy-0.007
i-0.007
 crawled-0.007
oped-0.007
owed-0.007
Token,
Token ID11
Feature activation-1.45e-02

Pos logit contributions
i+0.015
 felt+0.015
ppy+0.015
ie+0.015
owed+0.015
Neg logit contributions
 yours-0.036
 purpose-0.033
 our-0.032
 pleasure-0.032
 ours-0.031
Token "
Token ID366
Feature activation+2.36e-02

Pos logit contributions
 felt+0.024
ppy+0.024
s+0.023
i+0.023
 crawled+0.023
Neg logit contributions
 yours-0.034
 purpose-0.031
 our-0.030
 pleasure-0.028
 electricity-0.028
TokenWhy
Token ID5195
Feature activation+7.72e-02

Pos logit contributions
 yours+0.048
 our+0.043
 purpose+0.042
 pleasure+0.039
 your+0.039
Neg logit contributions
ppy-0.047
i-0.047
 felt-0.046
 crawled-0.045
oped-0.045
Token do
Token ID466
Feature activation+9.13e-02

Pos logit contributions
 yours+0.179
 purpose+0.166
 our+0.162
 pleasure+0.155
 electricity+0.148
Neg logit contributions
 felt-0.126
s-0.123
ppy-0.122
i-0.120
owed-0.120
Token we
Token ID356
Feature activation+3.91e-02

Pos logit contributions
 yours+0.162
 pleasure+0.147
 purpose+0.140
 our+0.124
 fault+0.123
Neg logit contributions
s-0.195
's-0.189
 felt-0.189
es-0.186
 got-0.185
Token need
Token ID761
Feature activation+1.05e-01

Pos logit contributions
 yours+0.083
 our+0.076
 purpose+0.075
 your+0.069
 pleasure+0.069
Neg logit contributions
i-0.072
ppy-0.071
 felt-0.070
 crawled-0.069
oped-0.068
Token to
Token ID284
Feature activation+7.91e-02

Pos logit contributions
 pleasure+0.198
 yours+0.182
 purpose+0.181
 disl+0.180
 diversity+0.178
Neg logit contributions
 got-0.191
 started-0.181
's-0.178
 sat-0.172
 walked-0.171
Token sell
Token ID3677
Feature activation+8.70e-02

Pos logit contributions
 yours+0.189
 purpose+0.167
 pleasure+0.165
 electricity+0.162
 our+0.161
Neg logit contributions
 got-0.115
 felt-0.114
 climbed-0.114
 ran-0.111
i-0.110
Token my
Token ID616
Feature activation+3.68e-02

Pos logit contributions
 pleasure+0.163
 yours+0.161
 purpose+0.158
 disl+0.149
 fault+0.143
Neg logit contributions
s-0.164
's-0.158
 said-0.147
 felt-0.146
 got-0.146
Token toys
Token ID14958
Feature activation+4.12e-02

Pos logit contributions
 yours+0.086
 purpose+0.074
 our+0.074
 yourselves+0.072
 pleasure+0.070
Neg logit contributions
s-0.055
 felt-0.055
's-0.055
cases-0.054
 crawled-0.053
Token?"
Token ID1701
Feature activation-2.12e-02

Pos logit contributions
 yours+0.089
 purpose+0.084
 pleasure+0.080
 electricity+0.079
 our+0.078
Neg logit contributions
s-0.071
 felt-0.070
y-0.068
 got-0.067
ppy-0.067
Token mean
Token ID1612
Feature activation+3.91e-02

Pos logit contributions
 yours+0.106
 purpose+0.101
 pleasure+0.093
 our+0.092
 yourselves+0.085
Neg logit contributions
ppy-0.106
 felt-0.105
 climbed-0.105
 crawled-0.103
's-0.103
Token.
Token ID13
Feature activation+5.37e-02

Pos logit contributions
 yours+0.094
 purpose+0.086
 our+0.085
 pleasure+0.085
 ours+0.078
Neg logit contributions
s-0.062
y-0.061
i-0.060
ppy-0.060
 felt-0.058
Token He
Token ID679
Feature activation-7.48e-03

Pos logit contributions
 yours+0.100
 purpose+0.091
 our+0.085
 pleasure+0.083
 ours+0.079
Neg logit contributions
ppy-0.114
s-0.109
i-0.109
 felt-0.108
's-0.107
Token does
Token ID857
Feature activation+7.70e-02

Pos logit contributions
ppy+0.014
owed+0.014
oped+0.014
i+0.014
cases+0.014
Neg logit contributions
 yours-0.015
 our-0.014
 your-0.013
 my-0.012
 purpose-0.012
Token not
Token ID407
Feature activation+4.28e-02

Pos logit contributions
 pleasure+0.213
 yours+0.199
 purpose+0.180
 electricity+0.176
 outer+0.174
Neg logit contributions
 climbed-0.084
 felt-0.082
 got-0.081
's-0.079
 ran-0.079
Token need
Token ID761
Feature activation+1.05e-01

Pos logit contributions
 yours+0.092
 purpose+0.087
 our+0.085
 pleasure+0.085
 yourselves+0.078
Neg logit contributions
ppy-0.075
 felt-0.072
 climbed-0.071
 crawled-0.071
i-0.070
Token the
Token ID262
Feature activation+4.80e-02

Pos logit contributions
 pleasure+0.212
 yours+0.192
 disl+0.189
 diversity+0.185
 outer+0.184
Neg logit contributions
's-0.166
 got-0.162
 said-0.160
 shouted-0.159
 ran-0.158
Token car
Token ID1097
Feature activation+5.30e-03

Pos logit contributions
 yours+0.099
 our+0.091
 purpose+0.085
 yourselves+0.082
 your+0.081
Neg logit contributions
 felt-0.085
 crawled-0.081
 hopped-0.081
 climbed-0.079
i-0.077
Token.
Token ID13
Feature activation+4.81e-02

Pos logit contributions
 yours+0.012
 purpose+0.012
 our+0.011
 pleasure+0.011
 your+0.011
Neg logit contributions
s-0.008
ppy-0.008
 felt-0.008
y-0.008
i-0.008
Token I
Token ID314
Feature activation+1.38e-02

Pos logit contributions
 yours+0.123
 purpose+0.120
 our+0.110
 pleasure+0.110
 ours+0.105
Neg logit contributions
ppy-0.063
's-0.062
s-0.060
es-0.059
 felt-0.058
Token want
Token ID765
Feature activation+8.73e-02

Pos logit contributions
 yours+0.029
 our+0.026
 purpose+0.026
 your+0.024
 pleasure+0.023
Neg logit contributions
i-0.026
ppy-0.026
owed-0.025
oped-0.025
 felt-0.025
Token You
Token ID921
Feature activation+3.01e-02

Pos logit contributions
 yours+0.112
 purpose+0.108
 our+0.098
 ours+0.092
 electricity+0.089
Neg logit contributions
ppy-0.124
's-0.116
i-0.114
 climbed-0.114
 hopped-0.113
Token can
Token ID460
Feature activation+7.55e-02

Pos logit contributions
 yours+0.064
 our+0.059
 purpose+0.057
 your+0.054
 pleasure+0.053
Neg logit contributions
i-0.056
ppy-0.055
oped-0.054
owed-0.054
 felt-0.053
Token use
Token ID779
Feature activation+8.60e-02

Pos logit contributions
 yours+0.171
 our+0.155
 purpose+0.152
 pleasure+0.143
 your+0.141
Neg logit contributions
i-0.131
ppy-0.130
 felt-0.128
 crawled-0.125
s-0.124
Token them
Token ID606
Feature activation+2.01e-02

Pos logit contributions
 inconvenience+0.096
 pleasure+0.094
 yours+0.091
 purpose+0.083
 Goodbye+0.078
Neg logit contributions
's-0.200
s-0.197
i-0.194
ches-0.190
 and-0.190
Token to
Token ID284
Feature activation+7.80e-02

Pos logit contributions
 yours+0.047
 purpose+0.045
 pleasure+0.043
 our+0.042
 electricity+0.040
Neg logit contributions
ppy-0.031
s-0.031
i-0.030
 felt-0.029
y-0.029
Token have
Token ID423
Feature activation+1.05e-01

Pos logit contributions
 yours+0.182
 our+0.159
 pleasure+0.157
 purpose+0.155
 ours+0.149
Neg logit contributions
 felt-0.123
ppy-0.120
i-0.120
 crawled-0.117
 climbed-0.112
Token fun
Token ID1257
Feature activation+5.32e-02

Pos logit contributions
 yours+0.172
 pleasure+0.153
 purpose+0.148
 yourselves+0.145
 ours+0.142
Neg logit contributions
 felt-0.213
 said-0.208
 looked-0.204
's-0.202
 got-0.200
Token and
Token ID290
Feature activation+6.22e-02

Pos logit contributions
 yours+0.166
 purpose+0.155
 pleasure+0.154
 our+0.151
 ours+0.147
Neg logit contributions
y-0.042
s-0.042
 felt-0.040
ppy-0.039
ie-0.039
Token to
Token ID284
Feature activation+7.89e-02

Pos logit contributions
 yours+0.124
 purpose+0.115
 pleasure+0.106
 our+0.105
 electricity+0.101
Neg logit contributions
 felt-0.116
 crawled-0.112
 forgot-0.112
 ran-0.111
 climbed-0.110
Token learn
Token ID2193
Feature activation+9.10e-02

Pos logit contributions
 yours+0.184
 ours+0.163
 pleasure+0.162
 our+0.157
 purpose+0.157
Neg logit contributions
 felt-0.119
i-0.116
 crawled-0.112
ppy-0.112
ie-0.110
Token while
Token ID981
Feature activation+6.46e-02

Pos logit contributions
 pleasure+0.206
 yours+0.203
 purpose+0.192
 inconvenience+0.183
 electricity+0.178
Neg logit contributions
s-0.150
's-0.131
 and-0.122
 felt-0.118
es-0.116
Token milk
Token ID7545
Feature activation+3.01e-02

Pos logit contributions
 yours+0.120
 purpose+0.110
 pleasure+0.105
 our+0.099
 yourselves+0.093
Neg logit contributions
 felt-0.118
 crawled-0.116
 climbed-0.116
 got-0.114
 ran-0.113
Token?
Token ID30
Feature activation+4.79e-02

Pos logit contributions
 yours+0.077
 purpose+0.072
 our+0.070
 pleasure+0.069
 electricity+0.066
Neg logit contributions
s-0.041
ppy-0.041
i-0.041
 felt-0.040
y-0.039
Token I
Token ID314
Feature activation+1.84e-02

Pos logit contributions
 yours+0.096
 purpose+0.092
 our+0.083
 ours+0.077
 your+0.076
Neg logit contributions
ppy-0.093
s-0.089
's-0.088
oped-0.087
es-0.086
Token think
Token ID892
Feature activation+7.85e-02

Pos logit contributions
 yours+0.037
 our+0.034
 purpose+0.033
 your+0.031
 pleasure+0.030
Neg logit contributions
i-0.036
ppy-0.036
 felt-0.035
owed-0.035
 crawled-0.035
Token you
Token ID345
Feature activation+6.79e-03

Pos logit contributions
 yours+0.165
 purpose+0.162
 pleasure+0.144
 our+0.143
 common+0.137
Neg logit contributions
s-0.134
 felt-0.132
i-0.132
ppy-0.129
's-0.129
Token deserve
Token ID10925
Feature activation+1.05e-01

Pos logit contributions
 yours+0.014
 our+0.013
 purpose+0.012
 your+0.012
 pleasure+0.012
Neg logit contributions
i-0.013
ppy-0.012
oped-0.012
ling-0.012
 crawled-0.012
Token a
Token ID257
Feature activation+7.97e-02

Pos logit contributions
 purpose+0.188
 yours+0.187
 pleasure+0.182
 yourselves+0.179
 disl+0.173
Neg logit contributions
's-0.176
ches-0.171
 got-0.166
 felt-0.166
s-0.163
Token treat
Token ID2190
Feature activation+3.68e-02

Pos logit contributions
 yours+0.150
 yourselves+0.149
 our+0.131
 ourselves+0.126
 purpose+0.123
Neg logit contributions
s-0.163
 got-0.161
 looked-0.159
 felt-0.157
 hopped-0.156
Token."
Token ID526
Feature activation-1.07e-02

Pos logit contributions
 yours+0.087
 purpose+0.085
 our+0.080
 pleasure+0.080
 electricity+0.077
Neg logit contributions
s-0.055
ppy-0.054
i-0.054
y-0.053
ie-0.050
Token
Token ID198
Feature activation-4.43e-03

Pos logit contributions
ppy+0.017
i+0.017
 felt+0.017
s+0.016
 crawled+0.016
Neg logit contributions
 yours-0.026
 our-0.024
 purpose-0.023
 your-0.022
 pleasure-0.022
Token
Token ID198
Feature activation-9.67e-03

Pos logit contributions
ppy+0.007
 felt+0.007
s+0.007
i+0.007
 crawled+0.006
Neg logit contributions
 yours-0.011
 our-0.010
 purpose-0.010
 pleasure-0.009
 your-0.009
Token happy
Token ID3772
Feature activation+5.23e-02

Pos logit contributions
 purpose+0.118
 yours+0.115
 electricity+0.102
 pleasure+0.099
 our+0.097
Neg logit contributions
i-0.105
 felt-0.104
 climbed-0.104
ppy-0.102
 forgot-0.101
Token.
Token ID13
Feature activation+6.54e-02

Pos logit contributions
 yours+0.116
 purpose+0.113
 electricity+0.101
 pleasure+0.100
 ours+0.099
Neg logit contributions
 felt-0.085
s-0.084
 ran-0.083
ie-0.083
i-0.082
Token We
Token ID775
Feature activation+1.46e-02

Pos logit contributions
 purpose+0.148
 yours+0.140
 electricity+0.118
 ours+0.116
 yourselves+0.112
Neg logit contributions
's-0.110
s-0.097
ppy-0.096
 walked-0.094
o-0.093
Token don
Token ID836
Feature activation+7.12e-02

Pos logit contributions
 yours+0.028
 our+0.025
 purpose+0.024
 your+0.023
 pleasure+0.022
Neg logit contributions
i-0.030
ppy-0.030
 felt-0.029
owed-0.029
oped-0.029
Token't
Token ID470
Feature activation+4.17e-02

Pos logit contributions
 yours+0.151
 our+0.135
 purpose+0.134
 pleasure+0.127
 your+0.124
Neg logit contributions
i-0.130
s-0.130
ppy-0.130
 felt-0.129
oped-0.127
Token need
Token ID761
Feature activation+1.05e-01

Pos logit contributions
 yours+0.089
 our+0.082
 purpose+0.080
 your+0.074
 pleasure+0.074
Neg logit contributions
ppy-0.076
i-0.076
 felt-0.074
 crawled-0.073
 hopped-0.072
Token a
Token ID257
Feature activation+8.49e-02

Pos logit contributions
 yours+0.197
 pleasure+0.191
 purpose+0.190
 disl+0.189
 diversity+0.180
Neg logit contributions
 ran-0.167
 walked-0.163
 sat-0.163
 crawled-0.161
's-0.160
Token big
Token ID1263
Feature activation+5.63e-02

Pos logit contributions
 yours+0.191
 our+0.170
 yourselves+0.170
 purpose+0.168
 electricity+0.165
Neg logit contributions
s-0.139
 skipped-0.137
 felt-0.136
 hopped-0.135
 crawled-0.134
Token house
Token ID2156
Feature activation+3.05e-02

Pos logit contributions
 yours+0.140
 our+0.128
 purpose+0.125
 yourselves+0.118
 electricity+0.118
Neg logit contributions
i-0.082
 felt-0.081
 hopped-0.080
 crawled-0.080
 skipped-0.078
Token or
Token ID393
Feature activation+6.15e-02

Pos logit contributions
 yours+0.066
 purpose+0.062
 pleasure+0.059
 our+0.059
 electricity+0.055
Neg logit contributions
y-0.052
s-0.051
ppy-0.051
i-0.049
 felt-0.049
Token a
Token ID257
Feature activation+7.90e-02

Pos logit contributions
 yours+0.110
 purpose+0.108
 pleasure+0.099
 our+0.088
 electricity+0.085
Neg logit contributions
 felt-0.132
 climbed-0.129
 crawled-0.128
 got-0.126
 ran-0.125
Token as
Token ID355
Feature activation+4.60e-02

Pos logit contributions
i+0.023
ppy+0.022
s+0.022
 felt+0.022
owed+0.022
Neg logit contributions
 yours-0.031
 purpose-0.028
 our-0.028
 pleasure-0.026
 your-0.026
Token Hel
Token ID5053
Feature activation-4.08e-02

Pos logit contributions
 yours+0.088
 purpose+0.083
 our+0.078
 yourselves+0.072
 ours+0.072
Neg logit contributions
 hopped-0.091
 felt-0.090
i-0.089
 ran-0.088
 walked-0.088
Tokenio
Token ID952
Feature activation-1.27e-01

Pos logit contributions
 felt+0.039
 crawled+0.038
s+0.036
ppy+0.036
oped+0.036
Neg logit contributions
 yours-0.124
 our-0.114
 purpose-0.113
 your-0.109
 pleasure-0.105
Token.
Token ID13
Feature activation-8.22e-03

Pos logit contributions
ited+0.113
ling+0.111
oped+0.111
i+0.110
ppy+0.108
Neg logit contributions
 yours-0.369
 our-0.352
 purpose-0.330
 your-0.328
 pleasure-0.319
Token Hel
Token ID5053
Feature activation-6.46e-02

Pos logit contributions
i+0.012
ppy+0.012
 felt+0.012
s+0.012
oped+0.012
Neg logit contributions
 yours-0.021
 our-0.019
 purpose-0.019
 your-0.017
 pleasure-0.017
Tokenio
Token ID952
Feature activation-1.73e-01

Pos logit contributions
 crawled+0.082
 felt+0.082
ppy+0.079
 hopped+0.079
oped+0.078
Neg logit contributions
 yours-0.175
 our-0.160
 purpose-0.156
 your-0.150
 my-0.144
Token explained
Token ID4893
Feature activation+8.14e-03

Pos logit contributions
mun+0.169
ited+0.167
oped+0.163
ppy+0.159
ling+0.152
Neg logit contributions
 our-0.438
 yours-0.425
 your-0.403
 purpose-0.398
 my-0.398
Token that
Token ID326
Feature activation+3.07e-02

Pos logit contributions
 yours+0.017
 purpose+0.016
 our+0.015
 pleasure+0.015
 ours+0.014
Neg logit contributions
 felt-0.015
i-0.015
s-0.015
 hopped-0.015
 ran-0.015
Token their
Token ID511
Feature activation+8.14e-03

Pos logit contributions
 yours+0.075
 purpose+0.069
 our+0.068
 pleasure+0.064
 your+0.062
Neg logit contributions
 felt-0.047
i-0.047
 hopped-0.045
 crawled-0.045
ppy-0.045
Token oxygen
Token ID11863
Feature activation-3.22e-02

Pos logit contributions
 yours+0.016
 yourselves+0.013
 your+0.012
 our+0.012
 purpose+0.012
Neg logit contributions
 felt-0.017
 hopped-0.016
 ran-0.016
 walked-0.016
 started-0.016
Token was
Token ID373
Feature activation+4.11e-02

Pos logit contributions
ppy+0.032
i+0.031
 felt+0.030
oped+0.029
owed+0.029
Neg logit contributions
 yours-0.096
 our-0.090
 purpose-0.089
 your-0.085
 pleasure-0.084
Token from
Token ID422
Feature activation+2.36e-02

Pos logit contributions
i+0.037
ppy+0.037
 felt+0.036
 crawled+0.035
owed+0.035
Neg logit contributions
 yours-0.059
 our-0.054
 purpose-0.053
 your-0.050
 pleasure-0.050
Token the
Token ID262
Feature activation+2.31e-02

Pos logit contributions
 yours+0.046
 purpose+0.042
 our+0.040
 pleasure+0.040
 ours+0.037
Neg logit contributions
i-0.048
 felt-0.044
 walked-0.044
 hopped-0.043
 ran-0.043
Token tank
Token ID6873
Feature activation-3.57e-02

Pos logit contributions
 yours+0.046
 purpose+0.039
 our+0.038
 yourselves+0.037
 ours+0.036
Neg logit contributions
 felt-0.042
 started-0.042
's-0.042
ling-0.042
 hopped-0.041
Token.
Token ID13
Feature activation-6.46e-03

Pos logit contributions
ppy+0.060
i+0.058
oped+0.058
 crawled+0.056
owed+0.056
Neg logit contributions
 yours-0.082
 our-0.076
 purpose-0.074
 your-0.070
 pleasure-0.068
Token Hel
Token ID5053
Feature activation-6.37e-02

Pos logit contributions
ppy+0.010
i+0.010
 felt+0.010
owed+0.009
 crawled+0.009
Neg logit contributions
 yours-0.016
 our-0.015
 purpose-0.014
 your-0.014
 pleasure-0.013
Tokenio
Token ID952
Feature activation-1.73e-01

Pos logit contributions
 felt+0.080
 crawled+0.080
ppy+0.078
 hopped+0.078
oped+0.078
Neg logit contributions
 yours-0.174
 our-0.158
 purpose-0.152
 your-0.149
 yourselves-0.143
Token was
Token ID373
Feature activation+6.03e-03

Pos logit contributions
oped+0.174
ited+0.171
ppy+0.166
mun+0.163
ling+0.153
Neg logit contributions
 our-0.443
 yours-0.427
 purpose-0.409
 your-0.407
 my-0.402
Token thankful
Token ID25535
Feature activation+1.15e-02

Pos logit contributions
 yours+0.015
 our+0.013
 purpose+0.013
 your+0.012
 pleasure+0.012
Neg logit contributions
i-0.009
ppy-0.009
 felt-0.009
 crawled-0.009
owed-0.009
Token and
Token ID290
Feature activation-1.32e-02

Pos logit contributions
 yours+0.026
 our+0.024
 purpose+0.024
 your+0.022
 pleasure+0.022
Neg logit contributions
i-0.020
ppy-0.020
 felt-0.019
 crawled-0.019
owed-0.019
Token thanked
Token ID26280
Feature activation+6.63e-03

Pos logit contributions
ppy+0.017
i+0.015
oped+0.015
ches+0.015
y+0.014
Neg logit contributions
 yours-0.037
 our-0.035
 your-0.034
 purpose-0.033
 pleasure-0.033
Token T
Token ID309
Feature activation-6.05e-02

Pos logit contributions
 yours+0.014
 our+0.012
 purpose+0.012
 ours+0.011
 pleasure+0.011
Neg logit contributions
i-0.013
 felt-0.012
ppy-0.012
ie-0.012
 hopped-0.012
Token It
Token ID632
Feature activation+1.89e-02

Pos logit contributions
 yours+0.135
 purpose+0.125
 our+0.116
 your+0.107
 ours+0.104
Neg logit contributions
ppy-0.118
 crawled-0.110
 climbed-0.109
owed-0.109
 hopped-0.107
Token is
Token ID318
Feature activation+6.81e-02

Pos logit contributions
 yours+0.036
 our+0.035
 your+0.031
 my+0.030
 purpose+0.029
Neg logit contributions
oped-0.039
i-0.039
ppy-0.038
ited-0.038
owed-0.038
Token not
Token ID407
Feature activation+6.93e-02

Pos logit contributions
 yours+0.141
 purpose+0.136
 pleasure+0.134
 our+0.118
 yourselves+0.117
Neg logit contributions
 crawled-0.116
 felt-0.116
 climbed-0.115
 had-0.110
ppy-0.110
Token safe
Token ID3338
Feature activation+5.72e-02

Pos logit contributions
 purpose+0.115
 yours+0.110
 pleasure+0.106
 our+0.097
 yourselves+0.094
Neg logit contributions
 felt-0.145
 climbed-0.143
 crawled-0.142
ppy-0.141
i-0.138
Token,"
Token ID553
Feature activation-1.33e-02

Pos logit contributions
 yours+0.140
 pleasure+0.137
 purpose+0.136
 our+0.128
 electricity+0.121
Neg logit contributions
y-0.080
s-0.077
i-0.075
ie-0.074
ppy-0.072
Token Pip
Token ID25149
Feature activation-1.68e-01

Pos logit contributions
i+0.026
 felt+0.025
ppy+0.025
 crawled+0.025
s+0.025
Neg logit contributions
 yours-0.028
 purpose-0.025
 our-0.025
 pleasure-0.023
 your-0.022
Token said
Token ID531
Feature activation-3.60e-02

Pos logit contributions
oped+0.266
oping+0.254
ited+0.230
ling+0.213
cases+0.208
Neg logit contributions
 our-0.336
 purpose-0.331
 yours-0.326
 my-0.324
 your-0.315
Token.
Token ID13
Feature activation-2.00e-03

Pos logit contributions
i+0.065
oped+0.064
ppy+0.064
 felt+0.062
ited+0.062
Neg logit contributions
 yours-0.078
 our-0.071
 purpose-0.068
 your-0.066
 pleasure-0.063
Token
Token ID198
Feature activation-5.28e-03

Pos logit contributions
i+0.004
ppy+0.004
owed+0.004
ie+0.004
ling+0.003
Neg logit contributions
 yours-0.004
 our-0.004
 purpose-0.004
 your-0.004
 pleasure-0.004
Token
Token ID198
Feature activation-1.37e-02

Pos logit contributions
i+0.009
ppy+0.009
 felt+0.009
owed+0.009
oped+0.009
Neg logit contributions
 yours-0.012
 our-0.011
 purpose-0.011
 your-0.010
 pleasure-0.010
TokenL
Token ID43
Feature activation-6.58e-02

Pos logit contributions
owed+0.023
i+0.022
oped+0.022
ppy+0.021
 felt+0.021
Neg logit contributions
 yours-0.032
 our-0.031
 purpose-0.029
 your-0.029
 my-0.027
Token stealing
Token ID16568
Feature activation+5.62e-02

Pos logit contributions
 yours+0.116
 purpose+0.114
 pleasure+0.105
 our+0.102
 electricity+0.095
Neg logit contributions
ppy-0.107
i-0.107
 climbed-0.106
 crawled-0.105
 felt-0.105
Token my
Token ID616
Feature activation+2.39e-02

Pos logit contributions
 yours+0.095
 pleasure+0.095
 purpose+0.093
 outer+0.086
 common+0.081
Neg logit contributions
s-0.110
's-0.107
 felt-0.106
i-0.106
y-0.105
Token apples
Token ID22514
Feature activation+1.46e-03

Pos logit contributions
 yours+0.051
 our+0.047
 purpose+0.045
 your+0.043
 yourselves+0.042
Neg logit contributions
i-0.042
 felt-0.042
 crawled-0.041
 hopped-0.041
 climbed-0.041
Token!"
Token ID2474
Feature activation-3.08e-02

Pos logit contributions
 yours+0.003
 our+0.002
 purpose+0.002
 your+0.002
 my+0.002
Neg logit contributions
ppy-0.003
i-0.003
oped-0.003
 felt-0.003
owed-0.003
Token the
Token ID262
Feature activation-9.46e-03

Pos logit contributions
 felt+0.051
i+0.050
s+0.050
ppy+0.049
 crawled+0.049
Neg logit contributions
 yours-0.073
 purpose-0.065
 our-0.065
 pleasure-0.061
 your-0.060
Token farmer
Token ID18739
Feature activation-1.68e-01

Pos logit contributions
ppy+0.017
i+0.017
oped+0.016
owed+0.016
 felt+0.016
Neg logit contributions
 yours-0.021
 our-0.019
 purpose-0.019
 your-0.018
 pleasure-0.017
Token shouted
Token ID17293
Feature activation-3.98e-02

Pos logit contributions
 pads+0.301
ops+0.264
ited+0.259
ede+0.251
oped+0.250
Neg logit contributions
 our-0.302
 my-0.300
 your-0.288
 the-0.254
 a-0.250
Token.
Token ID13
Feature activation-8.71e-03

Pos logit contributions
i+0.074
ppy+0.074
oped+0.072
 crawled+0.071
ited+0.071
Neg logit contributions
 yours-0.081
 our-0.077
 purpose-0.073
 your-0.071
 my-0.066
Token
Token ID198
Feature activation-9.96e-05

Pos logit contributions
ppy+0.013
 felt+0.013
i+0.013
s+0.013
 crawled+0.013
Neg logit contributions
 yours-0.021
 our-0.020
 purpose-0.019
 your-0.018
 pleasure-0.018
Token
Token ID198
Feature activation-9.28e-03

Pos logit contributions
s+0.000
 felt+0.000
ppy+0.000
i+0.000
 crawled+0.000
Neg logit contributions
 yours-0.000
 purpose-0.000
 our-0.000
 pleasure-0.000
 your-0.000
TokenHe
Token ID1544
Feature activation-7.09e-02

Pos logit contributions
i+0.014
ppy+0.014
 felt+0.013
owed+0.013
oped+0.013
Neg logit contributions
 yours-0.023
 our-0.022
 purpose-0.021
 your-0.020
 pleasure-0.019
Token hat
Token ID6877
Feature activation+2.86e-03

Pos logit contributions
i+0.039
ppy+0.038
 felt+0.038
oped+0.038
owed+0.037
Neg logit contributions
 yours-0.068
 our-0.063
 purpose-0.061
 your-0.059
 pleasure-0.056
Token,
Token ID11
Feature activation+1.47e-02

Pos logit contributions
 yours+0.008
 our+0.008
 purpose+0.008
 your+0.007
 pleasure+0.007
Neg logit contributions
ppy-0.003
i-0.003
 felt-0.003
owed-0.003
s-0.003
Token like
Token ID588
Feature activation+8.22e-02

Pos logit contributions
 yours+0.033
 purpose+0.031
 our+0.029
 pleasure+0.028
 electricity+0.027
Neg logit contributions
ppy-0.025
 felt-0.024
i-0.024
 crawled-0.024
 hopped-0.023
Token this
Token ID428
Feature activation+2.90e-02

Pos logit contributions
 purpose+0.123
 pleasure+0.118
 yours+0.117
 common+0.107
 our+0.104
Neg logit contributions
i-0.179
's-0.174
 hopped-0.174
ie-0.173
ppy-0.172
Token."
Token ID526
Feature activation-2.39e-02

Pos logit contributions
 yours+0.056
 purpose+0.052
 our+0.052
 pleasure+0.048
 your+0.046
Neg logit contributions
i-0.056
ppy-0.056
 felt-0.055
ie-0.055
s-0.054
Token Sue
Token ID26089
Feature activation-1.68e-01

Pos logit contributions
 felt+0.040
i+0.039
 crawled+0.039
ppy+0.039
s+0.038
Neg logit contributions
 yours-0.056
 our-0.051
 purpose-0.050
 pleasure-0.046
 your-0.046
Token said
Token ID531
Feature activation-4.30e-02

Pos logit contributions
oped+0.351
oping+0.334
pp+0.321
oise+0.309
ppers+0.308
Neg logit contributions
 your-0.263
 my-0.261
 our-0.243
 a-0.221
 an-0.217
Token.
Token ID13
Feature activation-6.74e-03

Pos logit contributions
ited+0.085
ppy+0.084
i+0.084
oped+0.083
ches+0.081
Neg logit contributions
 yours-0.082
 our-0.076
 purpose-0.072
 your-0.072
 pleasure-0.064
Token She
Token ID1375
Feature activation-7.11e-02

Pos logit contributions
i+0.011
ppy+0.011
 felt+0.011
owed+0.011
oped+0.011
Neg logit contributions
 yours-0.015
 our-0.014
 purpose-0.014
 your-0.013
 pleasure-0.013
Token took
Token ID1718
Feature activation-4.93e-04

Pos logit contributions
bi+0.094
ie+0.093
i+0.092
ony+0.092
ops+0.092
Neg logit contributions
 our-0.159
 my-0.158
 your-0.158
 yours-0.155
 magic-0.144
Token off
Token ID572
Feature activation-7.62e-03

Pos logit contributions
 felt+0.001
 climbed+0.001
i+0.001
 crawled+0.001
s+0.001
Neg logit contributions
 yours-0.001
 purpose-0.001
 our-0.001
 pleasure-0.001
 yourselves-0.001
Token some
Token ID617
Feature activation+6.22e-02

Pos logit contributions
 pleasure+0.135
 purpose+0.128
 yours+0.128
 inconvenience+0.119
 fault+0.114
Neg logit contributions
's-0.143
s-0.139
ie-0.133
ches-0.133
 said-0.132
Token papers
Token ID9473
Feature activation+2.83e-02

Pos logit contributions
 yours+0.121
 purpose+0.114
 pleasure+0.113
 our+0.105
 yourselves+0.103
Neg logit contributions
s-0.116
's-0.107
ie-0.107
 said-0.106
ches-0.106
Token too
Token ID1165
Feature activation+6.65e-02

Pos logit contributions
 yours+0.053
 purpose+0.051
 our+0.048
 pleasure+0.047
 electricity+0.045
Neg logit contributions
s-0.056
 felt-0.055
i-0.055
ppy-0.055
y-0.053
Token?"
Token ID1701
Feature activation-3.96e-02

Pos logit contributions
 yours+0.132
 pleasure+0.119
 our+0.118
 purpose+0.118
 electricity+0.111
Neg logit contributions
y-0.128
s-0.127
i-0.122
ie-0.121
 felt-0.121
Token the
Token ID262
Feature activation-1.02e-02

Pos logit contributions
ppy+0.066
owed+0.062
oped+0.062
i+0.060
ches+0.060
Neg logit contributions
 our-0.092
 yours-0.092
 your-0.088
 purpose-0.086
 my-0.083
Token prince
Token ID19716
Feature activation-1.67e-01

Pos logit contributions
ppy+0.018
i+0.018
oped+0.018
owed+0.017
 felt+0.017
Neg logit contributions
 yours-0.023
 our-0.021
 purpose-0.020
 your-0.019
 pleasure-0.019
Token asked
Token ID1965
Feature activation-4.03e-02

Pos logit contributions
oped+0.280
bi+0.250
mun+0.248
ops+0.247
ited+0.241
Neg logit contributions
 our-0.304
 my-0.299
 your-0.297
 purpose-0.276
 magic-0.274
Token.
Token ID13
Feature activation-1.22e-02

Pos logit contributions
ppy+0.074
oped+0.074
i+0.073
 crawled+0.071
ches+0.071
Neg logit contributions
 yours-0.084
 our-0.078
 purpose-0.075
 your-0.073
 my-0.067
Token His
Token ID2399
Feature activation-1.89e-02

Pos logit contributions
 felt+0.023
i+0.023
ppy+0.023
s+0.023
 crawled+0.022
Neg logit contributions
 yours-0.026
 our-0.023
 purpose-0.023
 your-0.021
 pleasure-0.021
Token dad
Token ID9955
Feature activation-1.31e-01

Pos logit contributions
ppy+0.043
i+0.042
oped+0.041
owed+0.041
ches+0.041
Neg logit contributions
 yours-0.033
 our-0.030
 purpose-0.030
 your-0.027
 pleasure-0.027
Token smiled
Token ID13541
Feature activation-3.10e-02

Pos logit contributions
 pads+0.226
ppy+0.188
bi+0.179
ony+0.177
ops+0.177
Neg logit contributions
 our-0.252
 yours-0.239
 gift-0.238
 your-0.237
 my-0.232
Token helping
Token ID5742
Feature activation+6.28e-02

Pos logit contributions
 purpose+0.112
 yours+0.106
 pleasure+0.098
 our+0.090
 fault+0.089
Neg logit contributions
 climbed-0.103
 felt-0.101
 crawled-0.100
 ran-0.100
 hopped-0.098
Token each
Token ID1123
Feature activation+3.99e-02

Pos logit contributions
 yours+0.108
 pleasure+0.105
 purpose+0.100
 our+0.088
 awesome+0.085
Neg logit contributions
s-0.133
's-0.132
ie-0.129
 felt-0.128
ches-0.127
Token other
Token ID584
Feature activation+8.51e-03

Pos logit contributions
 yours+0.089
 pleasure+0.082
 purpose+0.080
 our+0.078
 ours+0.072
Neg logit contributions
ie-0.068
s-0.066
 felt-0.065
y-0.065
i-0.064
Token."
Token ID526
Feature activation-1.52e-02

Pos logit contributions
 yours+0.017
 purpose+0.015
 our+0.015
 pleasure+0.014
 your+0.014
Neg logit contributions
ppy-0.017
 felt-0.017
i-0.016
s-0.016
ling-0.016
Token Big
Token ID4403
Feature activation-7.88e-02

Pos logit contributions
ppy+0.028
i+0.028
 felt+0.028
owed+0.027
 crawled+0.027
Neg logit contributions
 yours-0.032
 our-0.029
 purpose-0.029
 your-0.027
 pleasure-0.026
Tokengie
Token ID22699
Feature activation-1.67e-01

Pos logit contributions
ppy+0.106
cases+0.104
 crawled+0.104
owed+0.102
 skipped+0.100
Neg logit contributions
 yours-0.199
 our-0.191
 purpose-0.186
 your-0.176
 my-0.171
Token and
Token ID290
Feature activation-2.02e-02

Pos logit contributions
oped+0.249
cases+0.240
ited+0.239
ppy+0.228
oping+0.224
Neg logit contributions
 our-0.359
 my-0.343
 your-0.340
 purpose-0.318
 yours-0.312
Token his
Token ID465
Feature activation-2.60e-02

Pos logit contributions
i+0.045
ppy+0.045
 felt+0.044
 crawled+0.044
owed+0.043
Neg logit contributions
 yours-0.036
 our-0.032
 purpose-0.031
 your-0.028
 pleasure-0.028
Token friends
Token ID2460
Feature activation-1.03e-01

Pos logit contributions
i+0.047
ppy+0.046
 felt+0.046
 crawled+0.045
owed+0.045
Neg logit contributions
 yours-0.057
 our-0.052
 purpose-0.051
 your-0.048
 pleasure-0.047
Token listened
Token ID16399
Feature activation+3.06e-03

Pos logit contributions
ches+0.134
ony+0.130
ppy+0.129
alia+0.129
 pads+0.122
Neg logit contributions
 our-0.233
 yours-0.228
 respect-0.220
 your-0.216
 pleasure-0.207
Token to
Token ID284
Feature activation+9.87e-03

Pos logit contributions
 yours+0.006
 our+0.006
 purpose+0.006
 your+0.005
 pleasure+0.005
Neg logit contributions
i-0.006
ppy-0.006
 felt-0.006
 crawled-0.006
owed-0.006
Token and
Token ID290
Feature activation+4.15e-02

Pos logit contributions
 yours+0.029
 purpose+0.026
 our+0.026
 pleasure+0.024
 your+0.024
Neg logit contributions
ppy-0.018
i-0.018
 felt-0.017
owed-0.017
s-0.017
Token speak
Token ID2740
Feature activation+4.20e-02

Pos logit contributions
 yours+0.087
 purpose+0.083
 our+0.076
 pleasure+0.074
 electricity+0.072
Neg logit contributions
 felt-0.074
 hopped-0.073
 crawled-0.072
 climbed-0.071
i-0.071
Token to
Token ID284
Feature activation+6.20e-02

Pos logit contributions
 yours+0.085
 purpose+0.078
 pleasure+0.074
 electricity+0.073
 our+0.072
Neg logit contributions
s-0.076
y-0.075
's-0.074
i-0.074
ling-0.073
Token me
Token ID502
Feature activation+2.54e-03

Pos logit contributions
 yours+0.099
 purpose+0.087
 pleasure+0.087
 electricity+0.081
 common+0.080
Neg logit contributions
i-0.136
ie-0.128
's-0.128
 hopped-0.123
ppy-0.123
Token.'
Token ID2637
Feature activation-1.84e-02

Pos logit contributions
 yours+0.005
 our+0.005
 purpose+0.005
 pleasure+0.004
 your+0.004
Neg logit contributions
i-0.005
ppy-0.005
 felt-0.005
owed-0.005
s-0.005
Token Claw
Token ID30358
Feature activation-1.67e-01

Pos logit contributions
ppy+0.037
i+0.037
ie+0.036
ling+0.035
owed+0.035
Neg logit contributions
 yours-0.036
 our-0.033
 purpose-0.032
 your-0.031
 pleasure-0.029
Tokening
Token ID278
Feature activation-8.41e-02

Pos logit contributions
cases+0.217
ops+0.203
oped+0.199
atmeal+0.196
mun+0.189
Neg logit contributions
 our-0.347
 your-0.331
 my-0.323
 yours-0.319
 purpose-0.307
Token his
Token ID465
Feature activation-2.79e-02

Pos logit contributions
ll+0.098
 forgot+0.098
cases+0.098
oped+0.096
ppy+0.096
Neg logit contributions
 yours-0.208
 our-0.207
 purpose-0.198
 your-0.194
 my-0.184
Token way
Token ID835
Feature activation-7.47e-03

Pos logit contributions
i+0.053
ppy+0.052
 felt+0.052
 crawled+0.050
 hopped+0.050
Neg logit contributions
 yours-0.060
 our-0.054
 purpose-0.052
 your-0.049
 pleasure-0.047
Token out
Token ID503
Feature activation-2.46e-02

Pos logit contributions
ppy+0.015
i+0.015
oped+0.014
 felt+0.014
owed+0.014
Neg logit contributions
 yours-0.015
 our-0.014
 purpose-0.013
 your-0.013
 pleasure-0.012
Token of
Token ID286
Feature activation+1.47e-02

Pos logit contributions
ppy+0.042
i+0.041
owed+0.039
 felt+0.039
 forgot+0.038
Neg logit contributions
 yours-0.054
 our-0.052
 purpose-0.049
 your-0.048
 my-0.045
Token,
Token ID11
Feature activation+2.71e-02

Pos logit contributions
ppy+0.074
i+0.071
 crawled+0.071
oped+0.071
ling+0.070
Neg logit contributions
 yours-0.112
 our-0.105
 purpose-0.099
 your-0.098
 pleasure-0.094
Token what
Token ID644
Feature activation+5.88e-02

Pos logit contributions
 purpose+0.056
 yours+0.055
 electricity+0.047
 common+0.047
 pleasure+0.046
Neg logit contributions
 felt-0.051
 looked-0.050
ppy-0.050
 got-0.050
i-0.049
Token's
Token ID338
Feature activation+4.49e-02

Pos logit contributions
 yours+0.155
 our+0.143
 purpose+0.142
 your+0.133
 pleasure+0.133
Neg logit contributions
ppy-0.079
i-0.079
 felt-0.077
 crawled-0.075
s-0.075
Token that
Token ID326
Feature activation+7.49e-02

Pos logit contributions
 pleasure+0.095
 purpose+0.092
 yours+0.088
 yourselves+0.078
 diversity+0.078
Neg logit contributions
 looked-0.073
i-0.073
 got-0.071
 said-0.070
 felt-0.070
Token?"
Token ID1701
Feature activation-4.46e-02

Pos logit contributions
 yours+0.139
 purpose+0.127
 pleasure+0.121
 our+0.121
 yourselves+0.112
Neg logit contributions
 felt-0.151
s-0.148
i-0.147
's-0.146
 hopped-0.146
Token Timothy
Token ID22283
Feature activation-1.67e-01

Pos logit contributions
ppy+0.102
i+0.096
ches+0.096
owed+0.095
oped+0.093
Neg logit contributions
 yours-0.078
 our-0.074
 purpose-0.071
 your-0.069
 my-0.064
Token asked
Token ID1965
Feature activation-4.76e-02

Pos logit contributions
ops+0.336
cases+0.325
led+0.322
oped+0.316
umbs+0.297
Neg logit contributions
 my-0.265
 your-0.261
 our-0.235
 yours-0.227
 purpose-0.208
Token,
Token ID11
Feature activation-5.17e-02

Pos logit contributions
ited+0.095
ppy+0.095
ches+0.093
oped+0.092
ops+0.091
Neg logit contributions
 yours-0.088
 our-0.084
 purpose-0.083
 your-0.079
 my-0.072
Token pointing
Token ID10609
Feature activation-1.23e-02

Pos logit contributions
ppy+0.096
ches+0.094
ops+0.093
oped+0.091
s+0.091
Neg logit contributions
 yours-0.106
 our-0.101
 purpose-0.098
 your-0.095
 my-0.090
Token at
Token ID379
Feature activation+1.27e-02

Pos logit contributions
i+0.022
 felt+0.021
ppy+0.021
s+0.021
owed+0.021
Neg logit contributions
 yours-0.028
 our-0.025
 purpose-0.025
 pleasure-0.023
 your-0.023
Token the
Token ID262
Feature activation+1.45e-02

Pos logit contributions
 yours+0.025
 our+0.021
 purpose+0.021
 pleasure+0.020
 ours+0.019
Neg logit contributions
 felt-0.027
i-0.026
 hopped-0.026
 crawled-0.025
 walked-0.025
Token he
Token ID339
Feature activation-4.00e-02

Pos logit contributions
 yours+0.053
 our+0.049
 purpose+0.048
 your+0.046
 pleasure+0.045
Neg logit contributions
ppy-0.027
i-0.027
owed-0.026
oped-0.026
 felt-0.025
Token wanted
Token ID2227
Feature activation+4.82e-02

Pos logit contributions
cases+0.040
ppy+0.038
owed+0.037
oped+0.037
hetically+0.037
Neg logit contributions
 yours-0.112
 our-0.104
 your-0.101
 purpose-0.099
 pleasure-0.097
Token.
Token ID13
Feature activation-5.17e-03

Pos logit contributions
 yours+0.112
 our+0.105
 purpose+0.102
 pleasure+0.097
 ours+0.096
Neg logit contributions
 felt-0.076
i-0.075
 hopped-0.073
 crawled-0.073
ppy-0.072
Token<|endoftext|>
Token ID50256
Feature activation-1.95e-02

Pos logit contributions
i+0.009
ppy+0.009
ling+0.009
owed+0.009
 felt+0.009
Neg logit contributions
 yours-0.012
 our-0.011
 purpose-0.011
 pleasure-0.010
 your-0.010
TokenAk
Token ID33901
Feature activation-9.80e-02

Pos logit contributions
i+0.038
ppy+0.038
owed+0.036
ling+0.035
 felt+0.035
Neg logit contributions
 yours-0.039
 our-0.036
 purpose-0.035
 your-0.034
 pleasure-0.032
Tokenio
Token ID952
Feature activation-1.66e-01

Pos logit contributions
 crawled+0.113
 hopped+0.111
 felt+0.111
oped+0.111
 rushed+0.110
Neg logit contributions
 yours-0.272
 our-0.247
 your-0.239
 purpose-0.229
 my-0.224
Token was
Token ID373
Feature activation+5.57e-03

Pos logit contributions
ling+0.191
oped+0.191
cases+0.187
hetically+0.187
 crawled+0.183
Neg logit contributions
 yours-0.404
 our-0.391
 your-0.386
 purpose-0.372
 my-0.356
Token a
Token ID257
Feature activation+2.12e-02

Pos logit contributions
 yours+0.012
 our+0.011
 purpose+0.011
 your+0.010
 pleasure+0.010
Neg logit contributions
i-0.010
ppy-0.010
 felt-0.010
 crawled-0.009
oped-0.009
Token 3
Token ID513
Feature activation-4.82e-02

Pos logit contributions
 yours+0.043
 our+0.039
 purpose+0.038
 your+0.036
 pleasure+0.036
Neg logit contributions
ppy-0.041
i-0.041
owed-0.040
oped-0.040
 crawled-0.039
Token year
Token ID614
Feature activation+1.21e-03

Pos logit contributions
ppy+0.102
oped+0.102
ches+0.096
ll+0.095
owed+0.093
Neg logit contributions
 our-0.082
 yours-0.080
 purpose-0.080
 your-0.077
 my-0.072
Token old
Token ID1468
Feature activation-3.51e-02

Pos logit contributions
 yours+0.002
 our+0.002
 purpose+0.002
 pleasure+0.001
 your+0.001
Neg logit contributions
i-0.003
ppy-0.003
s-0.003
 felt-0.003
ling-0.003
Token to
Token ID284
Feature activation+8.58e-02

Pos logit contributions
 yours+0.207
 pleasure+0.206
 purpose+0.195
 ours+0.192
 yourselves+0.190
Neg logit contributions
s-0.124
's-0.122
 said-0.122
 sat-0.120
 felt-0.120
Token be
Token ID307
Feature activation+7.99e-02

Pos logit contributions
 yours+0.211
 electricity+0.194
 purpose+0.189
 pleasure+0.188
 yourselves+0.185
Neg logit contributions
 felt-0.126
 crawled-0.123
 hopped-0.122
 got-0.122
 climbed-0.121
Token my
Token ID616
Feature activation+3.80e-02

Pos logit contributions
 purpose+0.091
 yours+0.089
 electricity+0.087
 pleasure+0.087
 inconvenience+0.079
Neg logit contributions
 felt-0.205
ppy-0.201
i-0.201
 hopped-0.199
 crawled-0.198
Token friends
Token ID2460
Feature activation+1.98e-02

Pos logit contributions
 yours+0.101
 purpose+0.093
 our+0.093
 electricity+0.091
 yourselves+0.088
Neg logit contributions
i-0.049
ie-0.044
 felt-0.042
s-0.042
y-0.042
Token?"
Token ID1701
Feature activation-2.67e-02

Pos logit contributions
 yours+0.038
 purpose+0.035
 pleasure+0.033
 our+0.033
 electricity+0.032
Neg logit contributions
ppy-0.041
 felt-0.040
i-0.040
ie-0.040
y-0.039
Token Pip
Token ID25149
Feature activation-1.66e-01

Pos logit contributions
 felt+0.050
i+0.050
 crawled+0.049
 hopped+0.049
ppy+0.049
Neg logit contributions
 yours-0.057
 purpose-0.051
 our-0.050
 pleasure-0.047
 your-0.045
Token said
Token ID531
Feature activation-4.57e-02

Pos logit contributions
oped+0.291
cases+0.258
oping+0.256
ited+0.247
owed+0.233
Neg logit contributions
 my-0.331
 your-0.318
 our-0.318
 yours-0.314
 purpose-0.305
Token.
Token ID13
Feature activation-2.42e-03

Pos logit contributions
i+0.083
ppy+0.083
oped+0.083
ited+0.081
 forgot+0.080
Neg logit contributions
 yours-0.096
 our-0.089
 purpose-0.085
 your-0.083
 pleasure-0.077
Token
Token ID198
Feature activation-1.56e-03

Pos logit contributions
i+0.004
ppy+0.004
owed+0.004
 felt+0.004
ie+0.004
Neg logit contributions
 yours-0.005
 our-0.005
 purpose-0.005
 your-0.005
 pleasure-0.004
Token
Token ID198
Feature activation-9.28e-03

Pos logit contributions
ppy+0.002
i+0.002
 felt+0.002
s+0.002
 crawled+0.002
Neg logit contributions
 yours-0.004
 our-0.003
 purpose-0.003
 your-0.003
 pleasure-0.003
Token"
Token ID1
Feature activation+2.18e-02

Pos logit contributions
owed+0.016
i+0.016
ppy+0.015
 felt+0.015
oped+0.015
Neg logit contributions
 yours-0.021
 our-0.020
 purpose-0.019
 your-0.019
 my-0.018
Token consequences
Token ID6948
Feature activation+4.69e-02

Pos logit contributions
 yours+0.122
 purpose+0.109
 our+0.103
 pleasure+0.100
 ours+0.095
Neg logit contributions
 felt-0.147
 climbed-0.137
i-0.137
 forgot-0.137
 hopped-0.137
Token."
Token ID526
Feature activation-6.85e-03

Pos logit contributions
 yours+0.107
 purpose+0.102
 our+0.099
 pleasure+0.094
 your+0.090
Neg logit contributions
ppy-0.077
i-0.074
s-0.074
 felt-0.074
ling-0.073
Token
Token ID198
Feature activation+4.54e-03

Pos logit contributions
ppy+0.012
i+0.012
owed+0.012
oped+0.012
 crawled+0.012
Neg logit contributions
 yours-0.015
 our-0.014
 purpose-0.013
 your-0.013
 pleasure-0.013
Token
Token ID198
Feature activation-2.64e-03

Pos logit contributions
 yours+0.010
 our+0.009
 purpose+0.009
 your+0.009
 pleasure+0.008
Neg logit contributions
i-0.008
ppy-0.008
owed-0.008
 crawled-0.008
 felt-0.008
TokenD
Token ID35
Feature activation-6.07e-02

Pos logit contributions
owed+0.004
i+0.004
ppy+0.004
oped+0.004
 crawled+0.004
Neg logit contributions
 yours-0.006
 our-0.006
 purpose-0.006
 your-0.006
 yourselves-0.005
Tokenennis
Token ID10679
Feature activation-1.66e-01

Pos logit contributions
 crawled+0.091
 felt+0.090
oped+0.089
 skipped+0.088
owed+0.087
Neg logit contributions
 yours-0.149
 our-0.133
 purpose-0.131
 your-0.127
 pleasure-0.122
Token and
Token ID290
Feature activation-2.14e-02

Pos logit contributions
oped+0.192
cases+0.167
oping+0.163
ppy+0.162
mun+0.153
Neg logit contributions
 our-0.425
 purpose-0.414
 yours-0.408
 your-0.405
 my-0.401
Token Jenny
Token ID28437
Feature activation-1.31e-01

Pos logit contributions
ches+0.048
owed+0.048
oped+0.048
cases+0.047
ppy+0.047
Neg logit contributions
 yours-0.036
 our-0.034
 purpose-0.032
 your-0.032
 my-0.030
Token left
Token ID1364
Feature activation+1.16e-02

Pos logit contributions
ppy+0.138
oped+0.136
ches+0.135
ony+0.121
 pads+0.120
Neg logit contributions
 our-0.333
 yours-0.321
 respect-0.312
 your-0.310
 purpose-0.303
Token the
Token ID262
Feature activation+1.33e-02

Pos logit contributions
 yours+0.025
 our+0.022
 purpose+0.022
 your+0.020
 pleasure+0.020
Neg logit contributions
i-0.021
 felt-0.021
ppy-0.021
 crawled-0.020
s-0.020
Token restaurant
Token ID7072
Feature activation-5.02e-02

Pos logit contributions
 yours+0.030
 our+0.027
 purpose+0.026
 pleasure+0.025
 your+0.024
Neg logit contributions
 felt-0.023
ling-0.023
i-0.023
s-0.022
 crawled-0.022
Token to
Token ID284
Feature activation+7.50e-02

Pos logit contributions
 yours+0.099
 purpose+0.091
 our+0.091
 pleasure+0.088
 electricity+0.083
Neg logit contributions
y-0.110
i-0.108
s-0.107
 felt-0.107
ie-0.106
Token me
Token ID502
Feature activation+1.94e-02

Pos logit contributions
 yours+0.143
 pleasure+0.134
 purpose+0.125
 our+0.125
 electricity+0.117
Neg logit contributions
i-0.149
ie-0.146
 felt-0.144
ppy-0.141
 crawled-0.138
Token?"
Token ID1701
Feature activation-1.88e-02

Pos logit contributions
 yours+0.034
 our+0.030
 purpose+0.030
 pleasure+0.027
 your+0.026
Neg logit contributions
i-0.043
ppy-0.043
 felt-0.043
owed-0.042
s-0.042
Token
Token ID198
Feature activation-1.72e-03

Pos logit contributions
ppy+0.040
i+0.040
owed+0.038
oped+0.038
 felt+0.038
Neg logit contributions
 yours-0.035
 our-0.032
 purpose-0.031
 your-0.029
 pleasure-0.028
TokenThe
Token ID464
Feature activation-2.95e-03

Pos logit contributions
i+0.003
ppy+0.003
 felt+0.003
owed+0.003
 crawled+0.003
Neg logit contributions
 yours-0.004
 our-0.004
 purpose-0.004
 your-0.003
 pleasure-0.003
Token prince
Token ID19716
Feature activation-1.65e-01

Pos logit contributions
i+0.005
ppy+0.005
oped+0.005
owed+0.005
 felt+0.005
Neg logit contributions
 yours-0.007
 our-0.006
 purpose-0.006
 your-0.006
 pleasure-0.006
Token sn
Token ID3013
Feature activation-4.72e-02

Pos logit contributions
 pads+0.224
oped+0.213
cases+0.211
ops+0.202
ppy+0.201
Neg logit contributions
 our-0.364
 my-0.349
 your-0.349
 yours-0.338
 purpose-0.326
Tokenar
Token ID283
Feature activation-7.58e-02

Pos logit contributions
ppy+0.069
i+0.066
 crawled+0.066
 felt+0.066
ches+0.066
Neg logit contributions
 yours-0.116
 our-0.105
 purpose-0.103
 your-0.101
 yourselves-0.096
Tokenled
Token ID992
Feature activation-9.96e-03

Pos logit contributions
ppy+0.051
bi+0.050
oped+0.049
i+0.046
ony+0.045
Neg logit contributions
 yours-0.217
 our-0.211
 your-0.204
 purpose-0.197
 my-0.192
Token and
Token ID290
Feature activation-2.98e-02

Pos logit contributions
ppy+0.012
oped+0.012
i+0.012
ops+0.012
owed+0.011
Neg logit contributions
 yours-0.027
 our-0.026
 purpose-0.025
 your-0.024
 my-0.023
Token said
Token ID531
Feature activation-4.60e-03

Pos logit contributions
ppy+0.028
i+0.025
oped+0.025
owed+0.025
ops+0.024
Neg logit contributions
 yours-0.089
 our-0.087
 your-0.084
 purpose-0.082
 my-0.081
Token it
Token ID340
Feature activation+2.30e-02

Pos logit contributions
 yours+0.135
 purpose+0.126
 pleasure+0.120
 our+0.114
 gift+0.110
Neg logit contributions
i-0.118
s-0.116
 felt-0.115
 ran-0.115
 hopped-0.112
Token."
Token ID526
Feature activation-1.77e-02

Pos logit contributions
 yours+0.042
 our+0.038
 purpose+0.037
 pleasure+0.034
 your+0.034
Neg logit contributions
ppy-0.049
i-0.048
 felt-0.048
s-0.047
 crawled-0.047
Token
Token ID198
Feature activation-9.95e-04

Pos logit contributions
ppy+0.032
owed+0.031
ches+0.031
i+0.031
oped+0.031
Neg logit contributions
 yours-0.038
 our-0.036
 purpose-0.035
 your-0.033
 pleasure-0.033
Token
Token ID198
Feature activation-5.84e-03

Pos logit contributions
ppy+0.002
i+0.002
 felt+0.002
owed+0.002
 crawled+0.002
Neg logit contributions
 yours-0.002
 our-0.002
 purpose-0.002
 your-0.002
 pleasure-0.002
TokenG
Token ID38
Feature activation-6.91e-02

Pos logit contributions
ppy+0.010
owed+0.009
i+0.009
 felt+0.009
oped+0.009
Neg logit contributions
 yours-0.014
 our-0.013
 purpose-0.013
 your-0.012
 pleasure-0.011
Tokenavin
Token ID20637
Feature activation-1.65e-01

Pos logit contributions
 crawled+0.113
 hopped+0.111
 skipped+0.110
 felt+0.110
oped+0.108
Neg logit contributions
 yours-0.164
 our-0.146
 purpose-0.142
 your-0.137
 yourselves-0.132
Token thanked
Token ID26280
Feature activation-1.94e-03

Pos logit contributions
oped+0.171
cases+0.160
ppy+0.158
ited+0.158
mun+0.157
Neg logit contributions
 our-0.410
 your-0.389
 yours-0.387
 purpose-0.375
 my-0.374
Token the
Token ID262
Feature activation+1.11e-02

Pos logit contributions
i+0.004
ppy+0.004
 felt+0.004
 crawled+0.004
 hopped+0.004
Neg logit contributions
 yours-0.004
 our-0.003
 purpose-0.003
 pleasure-0.003
 your-0.003
Token woman
Token ID2415
Feature activation-9.56e-02

Pos logit contributions
 yours+0.031
 our+0.028
 purpose+0.027
 your+0.026
 pleasure+0.026
Neg logit contributions
i-0.014
 felt-0.014
 crawled-0.013
ppy-0.013
 hopped-0.013
Token and
Token ID290
Feature activation-4.02e-02

Pos logit contributions
oped+0.154
 forgot+0.149
 struggled+0.144
ppy+0.143
 crawled+0.140
Neg logit contributions
 yours-0.220
 our-0.218
 your-0.206
 my-0.194
 purpose-0.191
Token drove
Token ID10357
Feature activation-5.52e-03

Pos logit contributions
ppy+0.046
ops+0.043
ches+0.042
ony+0.041
oped+0.040
Neg logit contributions
 our-0.112
 yours-0.107
 your-0.107
 my-0.100
 pleasure-0.093
Token is
Token ID318
Feature activation+6.61e-02

Pos logit contributions
 yours+0.017
 our+0.016
 your+0.014
 purpose+0.014
 my+0.014
Neg logit contributions
oped-0.018
ppy-0.017
i-0.017
owed-0.017
 skipped-0.017
Token like
Token ID588
Feature activation+8.62e-02

Pos logit contributions
 purpose+0.120
 yours+0.117
 pleasure+0.110
 yourselves+0.104
 our+0.103
Neg logit contributions
i-0.124
 felt-0.123
ppy-0.117
 climbed-0.116
 crawled-0.116
Token this
Token ID428
Feature activation+3.93e-02

Pos logit contributions
 yours+0.133
 purpose+0.128
 pleasure+0.125
 common+0.116
 outer+0.116
Neg logit contributions
 felt-0.171
 ran-0.160
 got-0.158
 crawled-0.158
 hopped-0.158
Token pumpkin
Token ID30089
Feature activation-9.45e-03

Pos logit contributions
 yours+0.089
 our+0.082
 purpose+0.080
 pleasure+0.076
 your+0.075
Neg logit contributions
 felt-0.066
i-0.064
s-0.063
y-0.062
ie-0.062
Token,"
Token ID553
Feature activation-2.64e-02

Pos logit contributions
ppy+0.016
i+0.016
oped+0.016
 crawled+0.015
owed+0.015
Neg logit contributions
 yours-0.022
 our-0.020
 purpose-0.019
 your-0.018
 pleasure-0.018
Token Pam
Token ID24497
Feature activation-1.65e-01

Pos logit contributions
 felt+0.048
i+0.048
 hopped+0.047
 crawled+0.047
ppy+0.046
Neg logit contributions
 yours-0.058
 our-0.052
 purpose-0.052
 pleasure-0.048
 ours-0.047
Token says
Token ID1139
Feature activation-4.64e-02

Pos logit contributions
oped+0.382
oping+0.354
 mun+0.333
ppy+0.333
owed+0.327
Neg logit contributions
 our-0.273
 yours-0.270
 your-0.269
 my-0.259
 purpose-0.227
Token.
Token ID13
Feature activation-2.97e-03

Pos logit contributions
i+0.089
ppy+0.089
ited+0.088
oped+0.088
 forgot+0.086
Neg logit contributions
 yours-0.091
 our-0.084
 purpose-0.078
 your-0.078
 pleasure-0.072
Token
Token ID198
Feature activation-1.65e-03

Pos logit contributions
i+0.005
ppy+0.005
 felt+0.005
owed+0.005
 crawled+0.005
Neg logit contributions
 yours-0.007
 our-0.006
 purpose-0.006
 your-0.006
 pleasure-0.006
Token
Token ID198
Feature activation-7.82e-03

Pos logit contributions
ppy+0.003
i+0.003
 felt+0.003
s+0.002
 crawled+0.002
Neg logit contributions
 yours-0.004
 our-0.004
 purpose-0.004
 your-0.003
 pleasure-0.003
Token"
Token ID1
Feature activation+1.85e-02

Pos logit contributions
i+0.012
owed+0.012
ppy+0.012
 felt+0.012
oped+0.011
Neg logit contributions
 yours-0.019
 our-0.018
 purpose-0.017
 your-0.017
 yourselves-0.016
Token in
Token ID287
Feature activation+5.67e-02

Pos logit contributions
 yours+0.007
 our+0.006
 purpose+0.006
 pleasure+0.006
 your+0.006
Neg logit contributions
ppy-0.005
i-0.005
 felt-0.005
owed-0.004
s-0.004
Token the
Token ID262
Feature activation+3.01e-02

Pos logit contributions
 pleasure+0.092
 yours+0.089
 purpose+0.077
 diversity+0.074
 fault+0.073
Neg logit contributions
ie-0.107
 felt-0.107
 crawled-0.106
 hopped-0.105
i-0.105
Token mud
Token ID17492
Feature activation-7.14e-03

Pos logit contributions
 yours+0.074
 our+0.064
 purpose+0.064
 pleasure+0.061
 ours+0.061
Neg logit contributions
 felt-0.049
 crawled-0.048
 hopped-0.047
 climbed-0.046
 got-0.045
Token,"
Token ID553
Feature activation-2.82e-02

Pos logit contributions
ppy+0.013
oped+0.013
i+0.013
owed+0.012
 felt+0.012
Neg logit contributions
 yours-0.016
 our-0.014
 purpose-0.014
 your-0.013
 yourselves-0.013
Token the
Token ID262
Feature activation+4.20e-03

Pos logit contributions
i+0.042
 felt+0.042
ppy+0.041
 crawled+0.040
s+0.040
Neg logit contributions
 yours-0.071
 our-0.064
 purpose-0.064
 pleasure-0.060
 your-0.059
Token prince
Token ID19716
Feature activation-1.65e-01

Pos logit contributions
 yours+0.010
 our+0.009
 purpose+0.009
 pleasure+0.009
 your+0.009
Neg logit contributions
i-0.006
 felt-0.006
ppy-0.006
 crawled-0.006
 hopped-0.006
Token said
Token ID531
Feature activation-5.13e-02

Pos logit contributions
oped+0.264
ited+0.262
oping+0.244
acking+0.240
mun+0.240
Neg logit contributions
 our-0.289
 your-0.275
 yours-0.267
 my-0.263
 magic-0.257
Token.
Token ID13
Feature activation-8.26e-03

Pos logit contributions
i+0.099
ppy+0.098
ited+0.098
oped+0.097
 forgot+0.094
Neg logit contributions
 yours-0.099
 our-0.092
 purpose-0.088
 your-0.086
 pleasure-0.079
Token
Token ID198
Feature activation-5.21e-03

Pos logit contributions
i+0.014
ppy+0.014
owed+0.013
 felt+0.013
oped+0.013
Neg logit contributions
 yours-0.019
 our-0.018
 purpose-0.017
 your-0.016
 pleasure-0.016
Token
Token ID198
Feature activation-1.30e-02

Pos logit contributions
i+0.008
ppy+0.008
 felt+0.008
 crawled+0.008
owed+0.008
Neg logit contributions
 yours-0.012
 our-0.011
 purpose-0.011
 your-0.010
 pleasure-0.010
TokenThe
Token ID464
Feature activation-3.56e-03

Pos logit contributions
i+0.023
ppy+0.023
owed+0.022
 felt+0.022
oped+0.022
Neg logit contributions
 yours-0.029
 our-0.027
 purpose-0.027
 your-0.025
 pleasure-0.024
Token get
Token ID651
Feature activation+7.03e-02

Pos logit contributions
 yours+0.097
 purpose+0.089
 our+0.086
 pleasure+0.082
 your+0.080
Neg logit contributions
 felt-0.071
 crawled-0.070
i-0.070
ppy-0.070
 hopped-0.069
Token to
Token ID284
Feature activation+7.86e-02

Pos logit contributions
 purpose+0.148
 yours+0.135
 pleasure+0.130
 diversity+0.116
 common+0.114
Neg logit contributions
 ran-0.129
 cried-0.128
 walked-0.128
i-0.126
 felt-0.126
Token the
Token ID262
Feature activation+3.18e-02

Pos logit contributions
 yours+0.166
 purpose+0.158
 pleasure+0.157
 diversity+0.149
ags+0.144
Neg logit contributions
 climbed-0.123
 ran-0.121
 crawled-0.117
 felt-0.116
 walked-0.115
Token water
Token ID1660
Feature activation+1.35e-02

Pos logit contributions
 yours+0.077
 purpose+0.069
 our+0.066
 pleasure+0.066
 your+0.061
Neg logit contributions
i-0.050
 felt-0.050
 crawled-0.049
 hopped-0.047
 climbed-0.047
Token."
Token ID526
Feature activation-1.41e-02

Pos logit contributions
 yours+0.029
 our+0.027
 purpose+0.026
 your+0.025
 pleasure+0.024
Neg logit contributions
i-0.024
ppy-0.024
 felt-0.024
 crawled-0.023
s-0.023
Token Sam
Token ID3409
Feature activation-1.65e-01

Pos logit contributions
ppy+0.028
i+0.028
 felt+0.027
owed+0.027
oped+0.027
Neg logit contributions
 yours-0.028
 our-0.026
 purpose-0.025
 your-0.024
 pleasure-0.023
Token whispered
Token ID25029
Feature activation-4.28e-02

Pos logit contributions
oped+0.297
oping+0.295
ops+0.271
pp+0.264
atmeal+0.262
Neg logit contributions
 my-0.311
 our-0.310
 your-0.300
 an-0.260
 a-0.252
Token,
Token ID11
Feature activation-5.30e-02

Pos logit contributions
oped+0.074
ppy+0.074
i+0.073
ited+0.072
 felt+0.072
Neg logit contributions
 yours-0.091
 our-0.086
 purpose-0.083
 your-0.080
 pleasure-0.075
Token trying
Token ID2111
Feature activation+2.34e-02

Pos logit contributions
ops+0.104
ppy+0.100
oped+0.099
ie+0.099
s+0.098
Neg logit contributions
 our-0.102
 yours-0.100
 your-0.095
 my-0.090
 purpose-0.089
Token to
Token ID284
Feature activation+3.87e-02

Pos logit contributions
 yours+0.047
 our+0.041
 purpose+0.041
 pleasure+0.039
 your+0.037
Neg logit contributions
 felt-0.047
s-0.046
i-0.046
 crawled-0.046
owed-0.046
Token be
Token ID307
Feature activation+4.66e-02

Pos logit contributions
 yours+0.089
 purpose+0.080
 our+0.077
 pleasure+0.077
 ours+0.072
Neg logit contributions
 crawled-0.064
 felt-0.062
 ran-0.061
 hopped-0.061
i-0.061
Token.
Token ID13
Feature activation-1.24e-02

Pos logit contributions
ppy+0.098
i+0.094
oped+0.093
 felt+0.093
 crawled+0.092
Neg logit contributions
 yours-0.121
 our-0.111
 purpose-0.107
 your-0.104
 my-0.097
Token
Token ID220
Feature activation+8.08e-03

Pos logit contributions
ppy+0.024
i+0.024
owed+0.023
ches+0.023
ling+0.023
Neg logit contributions
 yours-0.025
 our-0.023
 purpose-0.023
 your-0.021
 pleasure-0.021
Token
Token ID198
Feature activation-1.39e-03

Pos logit contributions
 yours+0.018
 our+0.016
 purpose+0.016
 your+0.015
 pleasure+0.015
Neg logit contributions
ppy-0.015
i-0.015
 felt-0.014
 crawled-0.014
owed-0.014
Token
Token ID198
Feature activation-1.04e-02

Pos logit contributions
ppy+0.002
i+0.002
owed+0.002
 felt+0.002
oped+0.002
Neg logit contributions
 yours-0.003
 our-0.003
 purpose-0.003
 your-0.003
 pleasure-0.003
TokenJ
Token ID41
Feature activation-1.04e-01

Pos logit contributions
i+0.016
ppy+0.016
owed+0.015
 felt+0.015
oped+0.015
Neg logit contributions
 yours-0.025
 our-0.024
 purpose-0.024
 your-0.022
 pleasure-0.021
Tokenill
Token ID359
Feature activation-1.65e-01

Pos logit contributions
i+0.113
 hopped+0.112
 felt+0.112
ling+0.110
mun+0.110
Neg logit contributions
 yours-0.291
 our-0.257
 purpose-0.254
 your-0.250
 my-0.239
Token was
Token ID373
Feature activation-9.40e-03

Pos logit contributions
oped+0.227
cases+0.215
 pads+0.214
ppers+0.214
umbs+0.207
Neg logit contributions
 your-0.308
 my-0.299
 our-0.295
 purpose-0.282
 power-0.270
Token embarrassed
Token ID21100
Feature activation+1.18e-02

Pos logit contributions
ppy+0.015
i+0.015
s+0.014
oped+0.014
ie+0.014
Neg logit contributions
 yours-0.022
 our-0.021
 purpose-0.020
 your-0.020
 pleasure-0.019
Token that
Token ID326
Feature activation+2.76e-02

Pos logit contributions
 yours+0.030
 our+0.027
 purpose+0.027
 your+0.026
 pleasure+0.025
Neg logit contributions
ppy-0.017
i-0.017
oped-0.017
owed-0.017
 crawled-0.016
Token she
Token ID673
Feature activation-6.09e-02

Pos logit contributions
 yours+0.071
 our+0.066
 purpose+0.065
 pleasure+0.061
 your+0.060
Neg logit contributions
i-0.039
 felt-0.038
ppy-0.038
 crawled-0.037
s-0.037
Token had
Token ID550
Feature activation+4.64e-02

Pos logit contributions
ppy+0.074
hetically+0.072
mun+0.070
owed+0.069
ie+0.069
Neg logit contributions
 yours-0.142
 our-0.135
 your-0.130
 my-0.127
 purpose-0.126
Token going
Token ID1016
Feature activation+4.68e-02

Pos logit contributions
 purpose+0.136
 yours+0.136
 pleasure+0.128
 yourselves+0.125
 Enjoy+0.125
Neg logit contributions
's-0.113
 felt-0.113
 looked-0.108
 climbed-0.108
 got-0.107
Token to
Token ID284
Feature activation+7.73e-02

Pos logit contributions
 yours+0.098
 purpose+0.093
 pleasure+0.091
 our+0.086
 yourselves+0.085
Neg logit contributions
s-0.085
ches-0.081
y-0.080
 got-0.080
 felt-0.079
Token shoot
Token ID2686
Feature activation+4.21e-02

Pos logit contributions
 yours+0.182
 pleasure+0.166
 purpose+0.153
 our+0.151
 yourselves+0.151
Neg logit contributions
 felt-0.121
 crawled-0.117
 hopped-0.115
 got-0.115
 climbed-0.113
Token us
Token ID514
Feature activation+5.95e-03

Pos logit contributions
 yours+0.090
 pleasure+0.084
 purpose+0.082
 our+0.076
 ours+0.071
Neg logit contributions
s-0.075
y-0.074
i-0.073
's-0.073
ling-0.073
Token!"
Token ID2474
Feature activation-2.47e-02

Pos logit contributions
 yours+0.012
 purpose+0.011
 our+0.011
 pleasure+0.010
 your+0.010
Neg logit contributions
i-0.011
ppy-0.011
s-0.011
owed-0.011
 felt-0.011
Token Jill
Token ID22772
Feature activation-1.65e-01

Pos logit contributions
 felt+0.040
i+0.040
ppy+0.039
 crawled+0.039
s+0.038
Neg logit contributions
 yours-0.059
 our-0.054
 purpose-0.053
 pleasure-0.049
 your-0.049
Token screamed
Token ID25421
Feature activation-2.73e-02

Pos logit contributions
oped+0.267
bi+0.247
ll+0.246
pp+0.241
oise+0.240
Neg logit contributions
 our-0.317
 your-0.314
 my-0.310
 purpose-0.287
 an-0.269
Token,
Token ID11
Feature activation-4.21e-02

Pos logit contributions
oped+0.050
ppy+0.049
ited+0.049
i+0.049
ops+0.049
Neg logit contributions
 yours-0.054
 our-0.052
 purpose-0.051
 your-0.049
 pleasure-0.045
Token hiding
Token ID11816
Feature activation-1.79e-02

Pos logit contributions
ppy+0.080
ops+0.079
ie+0.078
oped+0.078
ony+0.077
Neg logit contributions
 our-0.082
 yours-0.082
 your-0.076
 purpose-0.075
 my-0.072
Token behind
Token ID2157
Feature activation+1.35e-02

Pos logit contributions
ppy+0.025
i+0.025
 felt+0.024
oped+0.023
owed+0.023
Neg logit contributions
 yours-0.047
 our-0.043
 purpose-0.043
 your-0.040
 pleasure-0.040
Token her
Token ID607
Feature activation-1.74e-02

Pos logit contributions
 yours+0.032
 ours+0.027
 our+0.027
 pleasure+0.026
 purpose+0.026
Neg logit contributions
 felt-0.023
 walked-0.023
 looked-0.023
 crawled-0.023
 ran-0.022
Token will
Token ID481
Feature activation+6.67e-02

Pos logit contributions
 yours+0.076
 our+0.070
 purpose+0.066
 your+0.064
 pleasure+0.062
Neg logit contributions
i-0.065
ppy-0.064
ling-0.062
owed-0.061
 felt-0.061
Token hurt
Token ID5938
Feature activation+7.10e-02

Pos logit contributions
 yours+0.152
 our+0.137
 pleasure+0.137
 purpose+0.133
 ours+0.130
Neg logit contributions
 felt-0.107
 crawled-0.106
 got-0.105
 hopped-0.105
 ran-0.104
Token you
Token ID345
Feature activation+2.92e-02

Pos logit contributions
 yours+0.115
 pleasure+0.112
 purpose+0.109
 gift+0.096
 common+0.095
Neg logit contributions
i-0.156
ie-0.149
ppy-0.148
's-0.147
 felt-0.143
Token!"
Token ID2474
Feature activation-2.62e-02

Pos logit contributions
 yours+0.052
 our+0.046
 purpose+0.046
 pleasure+0.041
 your+0.041
Neg logit contributions
ppy-0.064
i-0.064
 felt-0.063
owed-0.062
s-0.062
Token The
Token ID383
Feature activation-5.12e-04

Pos logit contributions
ppy+0.056
i+0.055
ling+0.053
oped+0.052
ie+0.052
Neg logit contributions
 yours-0.047
 our-0.046
 purpose-0.044
 your-0.043
 pleasure-0.041
Token prince
Token ID19716
Feature activation-1.64e-01

Pos logit contributions
ppy+0.001
 felt+0.001
i+0.001
 crawled+0.001
 hopped+0.001
Neg logit contributions
 yours-0.001
 our-0.001
 purpose-0.001
 pleasure-0.001
 your-0.001
Token was
Token ID373
Feature activation+4.33e-04

Pos logit contributions
cases+0.231
 pads+0.228
oped+0.219
oping+0.211
ppy+0.209
Neg logit contributions
 our-0.331
 my-0.318
 your-0.310
 purpose-0.308
 friendship-0.305
Token scared
Token ID12008
Feature activation+1.42e-02

Pos logit contributions
 yours+0.001
 our+0.001
 purpose+0.001
 your+0.001
 pleasure+0.001
Neg logit contributions
ppy-0.001
i-0.001
oped-0.001
s-0.001
owed-0.001
Token but
Token ID475
Feature activation-4.59e-02

Pos logit contributions
 yours+0.038
 our+0.037
 purpose+0.035
 your+0.035
 my+0.033
Neg logit contributions
ppy-0.018
owed-0.018
oped-0.017
i-0.017
 crawled-0.017
Token he
Token ID339
Feature activation-8.17e-02

Pos logit contributions
ops+0.083
ches+0.081
ppy+0.080
cases+0.080
oped+0.079
Neg logit contributions
 our-0.097
 yours-0.097
 your-0.096
 my-0.092
 pleasure-0.084
Token was
Token ID373
Feature activation+1.42e-07

Pos logit contributions
cases+0.124
ops+0.113
oped+0.111
ppy+0.109
oping+0.108
Neg logit contributions
 our-0.181
 yours-0.174
 my-0.173
 your-0.171
 purpose-0.164
Token you
Token ID345
Feature activation+6.15e-03

Pos logit contributions
 yours+0.070
 purpose+0.062
 our+0.062
 your+0.055
 pleasure+0.055
Neg logit contributions
ppy-0.088
i-0.088
 felt-0.086
 crawled-0.085
s-0.085
Token can
Token ID460
Feature activation+4.34e-02

Pos logit contributions
 yours+0.013
 purpose+0.012
 our+0.012
 your+0.011
 pleasure+0.011
Neg logit contributions
ppy-0.011
 felt-0.011
i-0.011
s-0.011
 crawled-0.011
Token play
Token ID711
Feature activation+4.84e-02

Pos logit contributions
 yours+0.093
 our+0.085
 purpose+0.084
 pleasure+0.078
 your+0.077
Neg logit contributions
i-0.079
ppy-0.079
 felt-0.078
s-0.077
 crawled-0.077
Token with
Token ID351
Feature activation+5.26e-02

Pos logit contributions
 yours+0.137
 pleasure+0.132
 purpose+0.129
 electricity+0.125
 ours+0.121
Neg logit contributions
es-0.052
s-0.051
's-0.050
 felt-0.047
y-0.046
Token me
Token ID502
Feature activation-7.56e-03

Pos logit contributions
 pleasure+0.088
 yours+0.085
 purpose+0.083
 electricity+0.076
 fault+0.070
Neg logit contributions
s-0.114
i-0.112
ie-0.111
ppy-0.110
's-0.109
Token.
Token ID13
Feature activation+5.16e-02

Pos logit contributions
i+0.013
ppy+0.013
 felt+0.013
oped+0.013
 crawled+0.013
Neg logit contributions
 yours-0.017
 our-0.016
 purpose-0.015
 your-0.014
 pleasure-0.014
Token This
Token ID770
Feature activation+3.86e-02

Pos logit contributions
 yours+0.093
 purpose+0.090
 our+0.083
 electricity+0.076
 ours+0.075
Neg logit contributions
ppy-0.108
 hopped-0.105
 crawled-0.104
oped-0.103
 skipped-0.103
Token is
Token ID318
Feature activation+7.10e-02

Pos logit contributions
 yours+0.084
 purpose+0.078
 our+0.078
 pleasure+0.074
 yourselves+0.071
Neg logit contributions
s-0.065
 felt-0.065
 crawled-0.063
's-0.063
y-0.063
Token Lily
Token ID20037
Feature activation-3.25e-02

Pos logit contributions
 purpose+0.072
 yours+0.061
 pleasure+0.061
 electricity+0.057
 yourselves+0.053
Neg logit contributions
ppy-0.190
i-0.188
ches-0.187
 climbed-0.187
 hopped-0.187
Token,
Token ID11
Feature activation+2.60e-02

Pos logit contributions
oped+0.055
i+0.054
ppy+0.053
 crawled+0.052
ited+0.052
Neg logit contributions
 yours-0.078
 our-0.071
 your-0.067
 purpose-0.065
 my-0.064
Token my
Token ID616
Feature activation+1.81e-02

Pos logit contributions
 yours+0.057
 purpose+0.053
 our+0.052
 pleasure+0.048
 your+0.047
Neg logit contributions
ppy-0.045
i-0.045
 felt-0.044
 crawled-0.043
 hopped-0.043
Token "
Token ID366
Feature activation+2.06e-02

Pos logit contributions
 felt+0.011
ppy+0.011
 crawled+0.011
s+0.011
i+0.011
Neg logit contributions
 yours-0.015
 our-0.014
 purpose-0.013
 your-0.012
 pleasure-0.012
TokenOh
Token ID5812
Feature activation+5.78e-02

Pos logit contributions
 yours+0.041
 our+0.037
 purpose+0.037
 your+0.034
 pleasure+0.033
Neg logit contributions
i-0.041
ppy-0.041
 felt-0.040
owed-0.040
oped-0.039
Token no
Token ID645
Feature activation+5.79e-02

Pos logit contributions
 yours+0.128
 purpose+0.119
 our+0.112
 yourselves+0.110
 electricity+0.110
Neg logit contributions
 felt-0.100
 looked-0.095
 got-0.095
i-0.093
 forgot-0.093
Token,
Token ID11
Feature activation+6.03e-02

Pos logit contributions
 yours+0.128
 our+0.115
 purpose+0.109
 pleasure+0.106
 ours+0.106
Neg logit contributions
i-0.105
ie-0.102
 felt-0.101
s-0.100
y-0.100
Token I
Token ID314
Feature activation+2.57e-02

Pos logit contributions
 yours+0.109
 purpose+0.106
 pleasure+0.096
ags+0.094
 common+0.093
Neg logit contributions
 felt-0.118
ppy-0.117
 crawled-0.116
i-0.115
 got-0.114
Token'm
Token ID1101
Feature activation+6.28e-02

Pos logit contributions
 yours+0.061
 our+0.057
 purpose+0.055
 your+0.053
 pleasure+0.051
Neg logit contributions
i-0.040
owed-0.038
ling-0.038
cases-0.038
ppy-0.038
Token shrinking
Token ID27382
Feature activation+4.82e-02

Pos logit contributions
 yours+0.144
 purpose+0.137
 our+0.130
 pleasure+0.126
 yourselves+0.122
Neg logit contributions
ppy-0.098
 felt-0.097
i-0.097
 crawled-0.094
 climbed-0.094
Token!"
Token ID2474
Feature activation-3.49e-02

Pos logit contributions
 yours+0.099
 purpose+0.089
 pleasure+0.088
 our+0.087
 electricity+0.079
Neg logit contributions
i-0.092
ppy-0.088
ling-0.088
ie-0.087
y-0.086
Token she
Token ID673
Feature activation-9.15e-02

Pos logit contributions
ppy+0.053
i+0.053
oped+0.051
ling+0.049
owed+0.049
Neg logit contributions
 yours-0.085
 our-0.081
 purpose-0.081
 your-0.079
 my-0.075
Token cried
Token ID16896
Feature activation-1.28e-02

Pos logit contributions
oped+0.113
ited+0.110
bi+0.103
ll+0.099
ops+0.098
Neg logit contributions
 our-0.238
 my-0.227
 your-0.226
 purpose-0.221
 yours-0.218
Token.
Token ID13
Feature activation-1.12e-02

Pos logit contributions
oped+0.023
i+0.023
ppy+0.022
owed+0.022
 skipped+0.022
Neg logit contributions
 yours-0.027
 our-0.025
 purpose-0.025
 your-0.023
 pleasure-0.022
TokenAnna
Token ID31160
Feature activation-1.21e-01

Pos logit contributions
ppy+0.012
i+0.012
 felt+0.012
s+0.012
 crawled+0.012
Neg logit contributions
 yours-0.018
 our-0.017
 purpose-0.017
 pleasure-0.016
 your-0.015
Token and
Token ID290
Feature activation-3.29e-02

Pos logit contributions
cases+0.160
hetically+0.150
ppy+0.145
ling+0.145
owed+0.144
Neg logit contributions
 yours-0.302
 our-0.302
 your-0.290
 purpose-0.282
 my-0.272
Token Ben
Token ID3932
Feature activation-1.02e-01

Pos logit contributions
i+0.069
ppy+0.069
 felt+0.068
 crawled+0.067
owed+0.067
Neg logit contributions
 yours-0.062
 our-0.056
 purpose-0.055
 your-0.050
 pleasure-0.049
Token like
Token ID588
Feature activation+4.45e-02

Pos logit contributions
ches+0.123
owed+0.122
ppy+0.122
i+0.122
ling+0.120
Neg logit contributions
 yours-0.276
 our-0.265
 your-0.249
 purpose-0.247
 pleasure-0.237
Token to
Token ID284
Feature activation+3.74e-02

Pos logit contributions
 yours+0.102
 purpose+0.092
 our+0.091
 pleasure+0.091
 yourselves+0.088
Neg logit contributions
s-0.073
 walked-0.071
 hopped-0.071
 skipped-0.070
 felt-0.070
Token play
Token ID711
Feature activation+4.31e-02

Pos logit contributions
 yours+0.084
 our+0.076
 purpose+0.073
 pleasure+0.070
 ours+0.069
Neg logit contributions
 felt-0.065
 crawled-0.064
i-0.064
ppy-0.064
 hopped-0.064
Token in
Token ID287
Feature activation+3.85e-02

Pos logit contributions
 yours+0.120
 pleasure+0.111
 purpose+0.109
 our+0.105
 ours+0.105
Neg logit contributions
s-0.052
ling-0.051
 hopped-0.051
 crawled-0.051
 felt-0.050
Token the
Token ID262
Feature activation+3.20e-02

Pos logit contributions
 yours+0.083
 pleasure+0.076
 purpose+0.070
 ours+0.066
 electricity+0.065
Neg logit contributions
 walked-0.070
 felt-0.069
 looked-0.068
 crawled-0.068
 hopped-0.068
Token pool
Token ID5933
Feature activation-1.47e-02

Pos logit contributions
 yours+0.080
 pleasure+0.073
 purpose+0.070
 our+0.067
 ours+0.066
Neg logit contributions
's-0.046
 hopped-0.045
 felt-0.045
s-0.045
 crawled-0.045
Token.
Token ID13
Feature activation+6.51e-03

Pos logit contributions
i+0.023
ppy+0.023
 felt+0.023
oped+0.022
owed+0.022
Neg logit contributions
 yours-0.035
 our-0.033
 purpose-0.032
 your-0.030
 pleasure-0.029
Token They
Token ID1119
Feature activation-4.31e-02

Pos logit contributions
 yours+0.015
 our+0.014
 purpose+0.014
 your+0.013
 pleasure+0.013
Neg logit contributions
ppy-0.011
i-0.011
 felt-0.011
 crawled-0.010
s-0.010
Token playful
Token ID34264
Feature activation+2.10e-02

Pos logit contributions
 yours+0.052
 purpose+0.044
 our+0.044
 yourselves+0.039
 your+0.039
Neg logit contributions
 felt-0.076
s-0.074
i-0.074
ling-0.073
 forgot-0.073
Token thing
Token ID1517
Feature activation-1.53e-02

Pos logit contributions
 yours+0.040
 our+0.036
 purpose+0.034
 ours+0.033
 your+0.032
Neg logit contributions
 felt-0.043
 hopped-0.043
i-0.042
ie-0.042
y-0.041
Token they
Token ID484
Feature activation-1.96e-02

Pos logit contributions
i+0.026
ppy+0.025
 felt+0.025
s+0.024
owed+0.024
Neg logit contributions
 yours-0.035
 our-0.032
 purpose-0.032
 your-0.029
 pleasure-0.029
Token had
Token ID550
Feature activation+4.71e-02

Pos logit contributions
i+0.020
ppy+0.020
owed+0.020
oped+0.019
s+0.019
Neg logit contributions
 yours-0.058
 our-0.054
 purpose-0.052
 your-0.051
 pleasure-0.050
Token ever
Token ID1683
Feature activation+2.24e-02

Pos logit contributions
 yours+0.104
 our+0.095
 purpose+0.094
 your+0.088
 pleasure+0.087
Neg logit contributions
i-0.083
ppy-0.082
 felt-0.081
 crawled-0.079
s-0.079
Token done
Token ID1760
Feature activation+4.99e-02

Pos logit contributions
 yours+0.054
 our+0.050
 purpose+0.049
 your+0.046
 pleasure+0.045
Neg logit contributions
ppy-0.034
i-0.034
oped-0.034
owed-0.033
ches-0.033
Token,
Token ID11
Feature activation+9.39e-03

Pos logit contributions
 yours+0.128
 our+0.117
 purpose+0.115
 pleasure+0.110
 your+0.109
Neg logit contributions
i-0.070
 felt-0.069
ppy-0.068
s-0.068
owed-0.066
Token and
Token ID290
Feature activation+1.41e-02

Pos logit contributions
 yours+0.026
 our+0.025
 purpose+0.024
 your+0.023
 pleasure+0.023
Neg logit contributions
i-0.011
ppy-0.011
 felt-0.011
 crawled-0.010
owed-0.010
Token it
Token ID340
Feature activation-1.93e-02

Pos logit contributions
 yours+0.031
 our+0.028
 purpose+0.028
 your+0.026
 pleasure+0.026
Neg logit contributions
i-0.025
 felt-0.025
ppy-0.024
s-0.024
 hopped-0.024
Token made
Token ID925
Feature activation+5.18e-02

Pos logit contributions
ppy+0.022
owed+0.021
oped+0.021
i+0.021
ony+0.020
Neg logit contributions
 yours-0.054
 our-0.053
 purpose-0.050
 your-0.049
 my-0.048
Token their
Token ID511
Feature activation+1.53e-02

Pos logit contributions
 yours+0.084
 our+0.071
 purpose+0.071
 pleasure+0.063
 your+0.063
Neg logit contributions
i-0.123
 felt-0.121
ppy-0.120
s-0.118
ie-0.118
Token learn
Token ID2193
Feature activation+7.39e-02

Pos logit contributions
 yours+0.083
 our+0.076
 purpose+0.074
 pleasure+0.071
 your+0.069
Neg logit contributions
ppy-0.060
i-0.059
 felt-0.059
 crawled-0.057
s-0.057
Token more
Token ID517
Feature activation+5.85e-02

Pos logit contributions
 yours+0.156
 pleasure+0.150
 purpose+0.146
 inconvenience+0.138
 electricity+0.138
Neg logit contributions
s-0.122
's-0.120
es-0.114
 felt-0.106
ches-0.104
Token math
Token ID10688
Feature activation+2.49e-02

Pos logit contributions
 yours+0.142
 purpose+0.128
 pleasure+0.126
 our+0.126
 yourselves+0.120
Neg logit contributions
s-0.086
es-0.083
's-0.079
ling-0.076
cases-0.076
Token and
Token ID290
Feature activation+4.51e-02

Pos logit contributions
 yours+0.055
 purpose+0.050
 our+0.050
 pleasure+0.047
 your+0.046
Neg logit contributions
s-0.042
ppy-0.041
es-0.041
 felt-0.041
i-0.040
Token have
Token ID423
Feature activation+8.82e-02

Pos logit contributions
 yours+0.094
 purpose+0.085
 our+0.083
 pleasure+0.077
 ours+0.075
Neg logit contributions
 felt-0.085
 crawled-0.081
i-0.080
 hopped-0.080
 forgot-0.079
Token fun
Token ID1257
Feature activation+4.68e-02

Pos logit contributions
 yours+0.143
arl+0.130
 purpose+0.130
 yourselves+0.129
 inconvenience+0.118
Neg logit contributions
 felt-0.198
's-0.194
 looked-0.190
 got-0.186
 climbed-0.183
Token too
Token ID1165
Feature activation+6.35e-02

Pos logit contributions
 yours+0.115
 purpose+0.106
 pleasure+0.104
 our+0.103
 electricity+0.100
Neg logit contributions
es-0.066
s-0.061
's-0.060
ie-0.059
i-0.059
Token!"
Token ID2474
Feature activation-2.11e-02

Pos logit contributions
 yours+0.125
 purpose+0.111
 our+0.109
 pleasure+0.105
 electricity+0.102
Neg logit contributions
s-0.123
i-0.120
y-0.118
ppy-0.115
 felt-0.114
Token Tom
Token ID4186
Feature activation-1.32e-01

Pos logit contributions
ppy+0.057
ches+0.053
ling+0.053
i+0.052
ie+0.051
Neg logit contributions
 yours-0.031
 our-0.030
 your-0.027
 purpose-0.027
 my-0.024
Token was
Token ID373
Feature activation+5.59e-03

Pos logit contributions
oped+0.190
ppy+0.182
oping+0.180
mun+0.177
ling+0.177
Neg logit contributions
 our-0.295
 my-0.290
 your-0.281
 yours-0.272
 pleasure-0.270
Token so
Token ID523
Feature activation+2.77e-02

Pos logit contributions
 yours+0.013
 purpose+0.012
 our+0.012
 your+0.011
 pleasure+0.011
Neg logit contributions
 felt-0.009
i-0.009
ppy-0.009
 crawled-0.009
owed-0.008
Token "
Token ID366
Feature activation+2.19e-02

Pos logit contributions
ppy+0.076
i+0.076
owed+0.073
oped+0.073
 crawled+0.072
Neg logit contributions
 yours-0.067
 our-0.061
 purpose-0.059
 your-0.056
 pleasure-0.053
TokenI
Token ID40
Feature activation+1.85e-02

Pos logit contributions
 yours+0.046
 our+0.043
 purpose+0.043
 your+0.040
 yourselves+0.038
Neg logit contributions
i-0.041
ppy-0.041
owed-0.040
 crawled-0.039
oped-0.038
Token lost
Token ID2626
Feature activation+6.17e-02

Pos logit contributions
 yours+0.038
 our+0.036
 purpose+0.034
 your+0.033
 pleasure+0.031
Neg logit contributions
i-0.034
ppy-0.034
owed-0.033
oped-0.032
 crawled-0.032
Token my
Token ID616
Feature activation+3.08e-02

Pos logit contributions
 pleasure+0.077
 yours+0.062
 inconvenience+0.062
 purpose+0.057
 Enjoy+0.057
Neg logit contributions
's-0.148
ie-0.148
y-0.144
i-0.142
my-0.141
Token favorite
Token ID4004
Feature activation+3.64e-02

Pos logit contributions
 yours+0.066
 our+0.061
 purpose+0.056
 pleasure+0.055
 your+0.054
Neg logit contributions
 felt-0.055
 crawled-0.052
i-0.052
 forgot-0.051
 skipped-0.051
Token toy
Token ID13373
Feature activation+2.23e-02

Pos logit contributions
 yours+0.074
 our+0.069
 pleasure+0.068
 purpose+0.064
 electricity+0.064
Neg logit contributions
ie-0.064
y-0.063
 felt-0.063
s-0.062
 crawled-0.062
Token."
Token ID526
Feature activation-1.85e-02

Pos logit contributions
 yours+0.051
 purpose+0.047
 our+0.047
 pleasure+0.045
 your+0.043
Neg logit contributions
ppy-0.037
i-0.036
s-0.036
 felt-0.036
y-0.035
Token Tim
Token ID5045
Feature activation-1.36e-01

Pos logit contributions
ppy+0.039
i+0.039
oped+0.038
owed+0.037
 crawled+0.037
Neg logit contributions
 yours-0.035
 our-0.032
 purpose-0.032
 your-0.029
 pleasure-0.028
Tokenmy
Token ID1820
Feature activation-1.21e-01

Pos logit contributions
cases+0.245
oped+0.233
op+0.208
oping+0.207
ops+0.198
Neg logit contributions
 yours-0.282
 my-0.278
 your-0.273
 our-0.269
 purpose-0.257
Token said
Token ID531
Feature activation-1.96e-02

Pos logit contributions
oped+0.199
cases+0.191
 pads+0.187
oping+0.181
ops+0.180
Neg logit contributions
 our-0.255
 my-0.245
 your-0.244
 purpose-0.215
 magic-0.214
Token,
Token ID11
Feature activation-2.40e-02

Pos logit contributions
 felt+0.026
i+0.025
ppy+0.025
s+0.025
owed+0.024
Neg logit contributions
 yours-0.053
 our-0.049
 purpose-0.048
 pleasure-0.045
 your-0.045
Token her
Token ID607
Feature activation-9.14e-03

Pos logit contributions
 yours+0.025
 our+0.023
 purpose+0.022
 your+0.021
 my+0.019
Neg logit contributions
oped-0.029
cases-0.028
ppy-0.028
i-0.027
 felt-0.027
Token read
Token ID1100
Feature activation+3.24e-02

Pos logit contributions
ppy+0.016
oped+0.016
i+0.016
 felt+0.015
ling+0.015
Neg logit contributions
 yours-0.021
 our-0.019
 purpose-0.019
 your-0.018
 pleasure-0.017
Token the
Token ID262
Feature activation+1.92e-02

Pos logit contributions
 yours+0.065
 pleasure+0.059
 purpose+0.059
 our+0.058
 yourselves+0.055
Neg logit contributions
 felt-0.060
s-0.059
i-0.058
ie-0.058
 looked-0.057
Token note
Token ID3465
Feature activation-2.51e-02

Pos logit contributions
 yours+0.047
 our+0.043
 purpose+0.041
 your+0.040
 pleasure+0.040
Neg logit contributions
 felt-0.029
 crawled-0.028
s-0.027
 looked-0.027
's-0.027
Token.
Token ID13
Feature activation-8.77e-03

Pos logit contributions
oped+0.044
i+0.044
ppy+0.044
 felt+0.042
 crawled+0.042
Neg logit contributions
 yours-0.057
 our-0.051
 purpose-0.050
 your-0.048
 my-0.045
Token Her
Token ID2332
Feature activation-2.27e-02

Pos logit contributions
i+0.014
ppy+0.014
 felt+0.014
oped+0.013
ling+0.013
Neg logit contributions
 yours-0.021
 our-0.019
 purpose-0.019
 your-0.018
 pleasure-0.018
Token mom
Token ID1995
Feature activation-1.17e-01

Pos logit contributions
 felt+0.045
i+0.044
 crawled+0.043
ppy+0.043
s+0.043
Neg logit contributions
 yours-0.047
 our-0.042
 purpose-0.040
 your-0.038
 pleasure-0.037
Token read
Token ID1100
Feature activation+8.46e-03

Pos logit contributions
 pads+0.183
ppy+0.176
ling+0.174
cases+0.165
oping+0.163
Neg logit contributions
 gift-0.219
 my-0.214
 our-0.214
 yours-0.209
 your-0.208
Token it
Token ID340
Feature activation-2.29e-02

Pos logit contributions
 yours+0.016
 purpose+0.014
 our+0.014
 pleasure+0.013
 yourselves+0.012
Neg logit contributions
 felt-0.018
s-0.017
i-0.017
 crawled-0.017
ppy-0.017
Token out
Token ID503
Feature activation+9.93e-04

Pos logit contributions
oped+0.041
i+0.041
ppy+0.041
ited+0.039
 forgot+0.038
Neg logit contributions
 yours-0.049
 our-0.046
 purpose-0.046
 your-0.043
 pleasure-0.041
Token loud
Token ID7812
Feature activation-9.64e-03

Pos logit contributions
 yours+0.002
 our+0.002
 purpose+0.002
 pleasure+0.002
 your+0.002
Neg logit contributions
 felt-0.002
i-0.002
 crawled-0.002
ppy-0.002
s-0.002
Token.
Token ID13
Feature activation+4.69e-03

Pos logit contributions
ppy+0.025
i+0.025
oped+0.024
owed+0.024
ches+0.023
Neg logit contributions
 yours-0.033
 our-0.030
 purpose-0.029
 your-0.028
 pleasure-0.027
Token She
Token ID1375
Feature activation-6.97e-02

Pos logit contributions
 yours+0.014
 our+0.013
 purpose+0.013
 your+0.012
 ours+0.012
Neg logit contributions
ppy-0.005
 felt-0.005
s-0.004
 crawled-0.004
i-0.004
Token said
Token ID531
Feature activation+5.05e-03

Pos logit contributions
hetically+0.101
ling+0.098
oped+0.096
oping+0.094
ops+0.088
Neg logit contributions
 yours-0.153
 our-0.152
 your-0.146
 my-0.143
 value-0.140
Token,
Token ID11
Feature activation-2.21e-02

Pos logit contributions
 yours+0.015
 purpose+0.014
 our+0.014
 ours+0.013
 pleasure+0.013
Neg logit contributions
s-0.005
 felt-0.005
es-0.005
i-0.005
owed-0.005
Token "
Token ID366
Feature activation+2.47e-02

Pos logit contributions
ppy+0.041
 felt+0.040
i+0.040
 crawled+0.039
s+0.039
Neg logit contributions
 yours-0.048
 our-0.043
 purpose-0.043
 your-0.039
 pleasure-0.039
TokenYes
Token ID5297
Feature activation+2.81e-02

Pos logit contributions
 yours+0.058
 our+0.053
 purpose+0.052
 your+0.049
 pleasure+0.048
Neg logit contributions
i-0.041
ppy-0.041
 felt-0.040
owed-0.039
oped-0.039
Token,
Token ID11
Feature activation+4.82e-02

Pos logit contributions
 yours+0.083
 purpose+0.079
 our+0.075
 electricity+0.075
 pleasure+0.075
Neg logit contributions
i-0.028
s-0.027
ie-0.027
ppy-0.026
 felt-0.025
Token Mom
Token ID11254
Feature activation-4.12e-02

Pos logit contributions
 purpose+0.094
 yours+0.091
 our+0.081
 yourselves+0.081
 electricity+0.080
Neg logit contributions
 felt-0.093
ppy-0.092
's-0.091
 crawled-0.090
i-0.090
Token.
Token ID13
Feature activation+4.77e-02

Pos logit contributions
ppy+0.059
i+0.059
 felt+0.058
oped+0.057
 crawled+0.057
Neg logit contributions
 yours-0.105
 our-0.097
 purpose-0.095
 your-0.090
 pleasure-0.088
Token That
Token ID1320
Feature activation+3.11e-02

Pos logit contributions
 yours+0.098
 purpose+0.093
 our+0.086
 ours+0.082
 electricity+0.079
Neg logit contributions
ppy-0.090
s-0.088
owed-0.086
 hopped-0.085
es-0.085
Token sounds
Token ID5238
Feature activation+4.18e-02

Pos logit contributions
 yours+0.071
 our+0.065
 purpose+0.064
 your+0.060
 pleasure+0.059
Neg logit contributions
ppy-0.053
i-0.053
 felt-0.051
oped-0.051
owed-0.051
Token They
Token ID1119
Feature activation-4.70e-02

Pos logit contributions
 yours+0.017
 purpose+0.016
 our+0.015
 yourselves+0.015
 pleasure+0.014
Neg logit contributions
s-0.010
 felt-0.009
 walked-0.009
 hopped-0.009
cases-0.009
Token wanted
Token ID2227
Feature activation+5.34e-02

Pos logit contributions
ling+0.066
int+0.062
oped+0.060
ches+0.058
oping+0.057
Neg logit contributions
 our-0.107
 yours-0.105
 your-0.099
 respect-0.093
 common-0.093
Token to
Token ID284
Feature activation+5.28e-02

Pos logit contributions
 yours+0.126
 purpose+0.114
 yourselves+0.112
 pleasure+0.111
 ours+0.109
Neg logit contributions
s-0.091
 felt-0.089
 got-0.085
 walked-0.085
 hopped-0.083
Token show
Token ID905
Feature activation+3.32e-02

Pos logit contributions
 yours+0.128
 purpose+0.111
 our+0.111
 pleasure+0.108
 electricity+0.107
Neg logit contributions
 crawled-0.082
 felt-0.081
 climbed-0.079
 ran-0.079
 got-0.078
Token him
Token ID683
Feature activation-1.07e-02

Pos logit contributions
 yours+0.069
 purpose+0.068
 pleasure+0.062
 yourselves+0.059
 electricity+0.058
Neg logit contributions
s-0.059
 felt-0.058
i-0.056
 ran-0.056
's-0.055
Token to
Token ID284
Feature activation+3.48e-02

Pos logit contributions
i+0.022
 felt+0.022
s+0.022
ppy+0.022
 crawled+0.022
Neg logit contributions
 yours-0.020
 purpose-0.018
 our-0.017
 pleasure-0.017
 electricity-0.016
Token their
Token ID511
Feature activation+1.87e-02

Pos logit contributions
 yours+0.070
 purpose+0.063
 our+0.059
 pleasure+0.059
 ours+0.058
Neg logit contributions
 felt-0.068
i-0.065
 hopped-0.063
 ran-0.063
 walked-0.062
Token mom
Token ID1995
Feature activation-6.40e-02

Pos logit contributions
 yours+0.044
 our+0.036
 purpose+0.036
 pleasure+0.035
 yourselves+0.034
Neg logit contributions
i-0.031
 crawled-0.030
s-0.030
 felt-0.030
 skipped-0.029
Token and
Token ID290
Feature activation-5.87e-03

Pos logit contributions
oped+0.099
 struggled+0.093
ited+0.091
 crawled+0.090
 screamed+0.090
Neg logit contributions
 yours-0.161
 our-0.152
 your-0.144
 purpose-0.136
 my-0.136
Token dad
Token ID9955
Feature activation-6.02e-02

Pos logit contributions
 felt+0.012
i+0.012
ppy+0.012
 crawled+0.011
 hopped+0.011
Neg logit contributions
 yours-0.012
 purpose-0.011
 our-0.010
 pleasure-0.010
 your-0.009
Token later
Token ID1568
Feature activation+5.82e-03

Pos logit contributions
oped+0.106
ppy+0.103
 crawled+0.100
i+0.100
 felt+0.099
Neg logit contributions
 yours-0.134
 our-0.126
 your-0.117
 purpose-0.115
 pleasure-0.108
Token liked
Token ID8288
Feature activation+4.67e-02

Pos logit contributions
int+0.055
owed+0.053
oped+0.051
i+0.050
cases+0.050
Neg logit contributions
 yours-0.127
 our-0.121
 your-0.114
 purpose-0.109
 electricity-0.106
Token to
Token ID284
Feature activation+4.57e-02

Pos logit contributions
 yours+0.101
 purpose+0.093
 pleasure+0.092
 our+0.087
 yourselves+0.084
Neg logit contributions
 felt-0.083
s-0.081
 climbed-0.081
 crawled-0.080
 ran-0.080
Token make
Token ID787
Feature activation+4.92e-02

Pos logit contributions
 yours+0.106
 our+0.097
 purpose+0.096
 pleasure+0.091
 your+0.089
Neg logit contributions
 felt-0.075
ppy-0.074
 crawled-0.074
i-0.073
 climbed-0.072
Token towers
Token ID18028
Feature activation-2.28e-02

Pos logit contributions
 yours+0.094
 purpose+0.083
 our+0.080
 pleasure+0.079
 your+0.071
Neg logit contributions
 felt-0.101
s-0.100
ie-0.098
ling-0.098
i-0.097
Token with
Token ID351
Feature activation+5.00e-02

Pos logit contributions
oped+0.031
ppy+0.030
i+0.029
ited+0.029
 hopped+0.028
Neg logit contributions
 yours-0.061
 our-0.059
 purpose-0.055
 your-0.055
 my-0.052
Token the
Token ID262
Feature activation+1.36e-02

Pos logit contributions
 yours+0.098
 pleasure+0.094
 purpose+0.089
 Goodbye+0.079
 yourselves+0.078
Neg logit contributions
 felt-0.096
ie-0.096
 looked-0.095
s-0.092
 ran-0.092
Token blocks
Token ID7021
Feature activation-2.76e-02

Pos logit contributions
 yours+0.029
 our+0.026
 purpose+0.026
 pleasure+0.024
 your+0.024
Neg logit contributions
 felt-0.026
 crawled-0.024
s-0.024
ppy-0.024
 climbed-0.024
Token.
Token ID13
Feature activation+1.31e-02

Pos logit contributions
i+0.044
oped+0.044
ppy+0.043
owed+0.042
 hopped+0.042
Neg logit contributions
 yours-0.067
 our-0.062
 purpose-0.059
 your-0.057
 ours-0.054
Token They
Token ID1119
Feature activation-4.45e-02

Pos logit contributions
 yours+0.032
 purpose+0.028
 our+0.028
 your+0.026
 ours+0.026
Neg logit contributions
s-0.021
 felt-0.020
ppy-0.020
i-0.020
 crawled-0.019
Token tried
Token ID3088
Feature activation+4.96e-02

Pos logit contributions
int+0.061
owed+0.060
oped+0.059
ling+0.058
ony+0.057
Neg logit contributions
 yours-0.110
 our-0.105
 your-0.098
 electricity-0.092
 purpose-0.092
Token to
Token ID284
Feature activation+4.29e-02

Pos logit contributions
 yours+0.099
 our+0.085
 pleasure+0.085
 purpose+0.084
 ours+0.078
Neg logit contributions
 felt-0.099
s-0.099
y-0.096
i-0.095
 crawled-0.095
Token,
Token ID11
Feature activation-9.86e-03

Pos logit contributions
ppy+0.060
oped+0.058
 forgot+0.057
 climbed+0.056
 crawled+0.056
Neg logit contributions
 yours-0.097
 our-0.089
 purpose-0.087
 your-0.084
 my-0.081
Token it
Token ID340
Feature activation-4.13e-02

Pos logit contributions
ppy+0.011
i+0.011
 felt+0.011
owed+0.011
oped+0.011
Neg logit contributions
 yours-0.028
 our-0.026
 purpose-0.026
 your-0.024
 pleasure-0.024
Token was
Token ID373
Feature activation+3.12e-02

Pos logit contributions
ppy+0.052
ony+0.047
ppa+0.047
oped+0.046
i+0.046
Neg logit contributions
 yours-0.095
 purpose-0.094
 our-0.093
 your-0.089
 power-0.089
Token a
Token ID257
Feature activation+3.43e-02

Pos logit contributions
 yours+0.063
 our+0.058
 purpose+0.057
 yourselves+0.052
 pleasure+0.052
Neg logit contributions
 felt-0.060
i-0.060
 crawled-0.059
 hopped-0.058
ppy-0.058
Token little
Token ID1310
Feature activation+5.87e-03

Pos logit contributions
 yours+0.074
 our+0.067
 yourselves+0.063
 purpose+0.062
 ours+0.060
Neg logit contributions
 felt-0.063
 hopped-0.063
 crawled-0.062
 skipped-0.060
i-0.060
Token bunny
Token ID44915
Feature activation-6.41e-02

Pos logit contributions
 yours+0.013
 our+0.012
 purpose+0.012
 your+0.011
 pleasure+0.011
Neg logit contributions
i-0.010
ppy-0.010
 felt-0.010
 crawled-0.010
s-0.010
Token.
Token ID13
Feature activation-6.98e-03

Pos logit contributions
cases+0.121
ited+0.115
 felt+0.115
oped+0.115
owed+0.114
Neg logit contributions
 yours-0.133
 our-0.121
 purpose-0.115
 your-0.112
 my-0.105
Token She
Token ID1375
Feature activation-6.76e-02

Pos logit contributions
ppy+0.012
i+0.012
 felt+0.012
s+0.012
 crawled+0.012
Neg logit contributions
 yours-0.016
 our-0.014
 purpose-0.014
 your-0.013
 pleasure-0.013
Token looked
Token ID3114
Feature activation+1.30e-02

Pos logit contributions
cases+0.078
ppy+0.075
hetically+0.075
ops+0.073
ited+0.073
Neg logit contributions
 our-0.160
 your-0.156
 my-0.155
 yours-0.154
 purpose-0.140
Token at
Token ID379
Feature activation+7.28e-03

Pos logit contributions
 yours+0.033
 our+0.030
 purpose+0.030
 your+0.028
 pleasure+0.028
Neg logit contributions
ppy-0.019
i-0.019
 felt-0.018
owed-0.018
oped-0.018
Token the
Token ID262
Feature activation+6.67e-03

Pos logit contributions
 yours+0.013
 purpose+0.012
 our+0.012
 pleasure+0.011
 your+0.010
Neg logit contributions
i-0.016
 felt-0.015
ie-0.015
ppy-0.015
s-0.015
Token something
Token ID1223
Feature activation+3.04e-02

Pos logit contributions
 yours+0.097
 purpose+0.083
 our+0.083
 pleasure+0.077
 ours+0.076
Neg logit contributions
 felt-0.102
i-0.101
s-0.099
ie-0.098
ling-0.098
Token beautiful
Token ID4950
Feature activation+1.66e-02

Pos logit contributions
 yours+0.053
 purpose+0.047
 our+0.045
 yourselves+0.042
 pleasure+0.041
Neg logit contributions
s-0.068
 felt-0.067
i-0.065
ling-0.065
 hopped-0.063
Token.
Token ID13
Feature activation+6.49e-03

Pos logit contributions
 yours+0.044
 purpose+0.040
 our+0.040
 pleasure+0.038
 your+0.038
Neg logit contributions
i-0.021
 felt-0.021
s-0.021
y-0.020
ppy-0.020
Token
Token ID198
Feature activation+1.09e-03

Pos logit contributions
 yours+0.015
 our+0.014
 purpose+0.014
 your+0.013
 pleasure+0.013
Neg logit contributions
ppy-0.011
i-0.011
 felt-0.010
owed-0.010
 crawled-0.010
Token
Token ID198
Feature activation-9.14e-03

Pos logit contributions
 yours+0.002
 our+0.002
 purpose+0.002
 your+0.002
 pleasure+0.002
Neg logit contributions
i-0.002
ppy-0.002
owed-0.002
 felt-0.002
 crawled-0.002
TokenHe
Token ID1544
Feature activation-5.56e-02

Pos logit contributions
i+0.017
ppy+0.017
owed+0.016
 felt+0.016
ie+0.016
Neg logit contributions
 yours-0.020
 our-0.019
 purpose-0.018
 your-0.017
 pleasure-0.016
Token knew
Token ID2993
Feature activation+2.67e-02

Pos logit contributions
ppy+0.063
cases+0.058
i+0.057
owed+0.057
oped+0.056
Neg logit contributions
 yours-0.149
 our-0.147
 your-0.139
 my-0.134
 purpose-0.134
Token that
Token ID326
Feature activation+4.40e-02

Pos logit contributions
 yours+0.059
 purpose+0.055
 our+0.054
 pleasure+0.049
 ours+0.049
Neg logit contributions
i-0.046
 felt-0.045
 hopped-0.044
 crawled-0.043
ppy-0.043
Token he
Token ID339
Feature activation-3.77e-02

Pos logit contributions
 yours+0.102
 our+0.092
 purpose+0.092
 your+0.084
 pleasure+0.084
Neg logit contributions
 felt-0.072
i-0.072
 hopped-0.070
ppy-0.069
 crawled-0.069
Token had
Token ID550
Feature activation+5.27e-02

Pos logit contributions
ppy+0.039
cases+0.038
oped+0.038
i+0.037
owed+0.036
Neg logit contributions
 yours-0.107
 our-0.098
 your-0.093
 pleasure-0.092
 purpose-0.091
Token made
Token ID925
Feature activation+4.78e-02

Pos logit contributions
 yours+0.143
 our+0.133
 purpose+0.132
 pleasure+0.123
 your+0.123
Neg logit contributions
 felt-0.070
 crawled-0.065
 climbed-0.064
 hopped-0.064
i-0.064
Token,
Token ID11
Feature activation+3.36e-03

Pos logit contributions
 felt+0.009
i+0.009
s+0.008
ppy+0.008
 crawled+0.008
Neg logit contributions
 yours-0.021
 purpose-0.019
 our-0.019
 pleasure-0.018
 ours-0.018
Token a
Token ID257
Feature activation-7.18e-04

Pos logit contributions
 yours+0.008
 purpose+0.007
 pleasure+0.007
 our+0.007
 ours+0.006
Neg logit contributions
 felt-0.006
 got-0.006
 walked-0.006
 started-0.006
 crawled-0.006
Token big
Token ID1263
Feature activation-7.25e-03

Pos logit contributions
 felt+0.001
 got+0.001
s+0.001
 looked+0.001
 hopped+0.001
Neg logit contributions
 yours-0.002
 our-0.001
 purpose-0.001
 ours-0.001
 yourselves-0.001
Token boy
Token ID2933
Feature activation-1.13e-01

Pos logit contributions
 felt+0.012
 got+0.012
 hopped+0.012
ie+0.012
i+0.011
Neg logit contributions
 yours-0.017
 purpose-0.015
 our-0.015
 pleasure-0.014
 ours-0.014
Token pushed
Token ID7121
Feature activation-4.02e-03

Pos logit contributions
ppy+0.170
cases+0.165
hetically+0.157
ops+0.156
ches+0.154
Neg logit contributions
 our-0.264
 yours-0.255
 your-0.249
 my-0.229
 gift-0.221
Token Lily
Token ID20037
Feature activation-7.84e-02

Pos logit contributions
ppy+0.008
i+0.008
owed+0.008
oped+0.008
 felt+0.008
Neg logit contributions
 yours-0.008
 our-0.007
 purpose-0.007
 your-0.007
 my-0.006
Token and
Token ID290
Feature activation-3.95e-02

Pos logit contributions
i+0.144
ppy+0.142
ited+0.141
mun+0.140
int+0.140
Neg logit contributions
 yours-0.170
 our-0.157
 your-0.150
 purpose-0.144
 my-0.136
Token she
Token ID673
Feature activation-7.10e-02

Pos logit contributions
ppy+0.049
i+0.048
owed+0.048
oped+0.046
ling+0.046
Neg logit contributions
 yours-0.108
 our-0.104
 your-0.099
 purpose-0.098
 my-0.095
Token fell
Token ID3214
Feature activation+1.73e-03

Pos logit contributions
int+0.081
ppy+0.081
hetically+0.076
i+0.075
ited+0.075
Neg logit contributions
 our-0.186
 yours-0.176
 your-0.175
 my-0.168
 purpose-0.164
Token down
Token ID866
Feature activation+2.01e-02

Pos logit contributions
 yours+0.004
 our+0.004
 purpose+0.004
 pleasure+0.004
 your+0.004
Neg logit contributions
i-0.003
ppy-0.003
 felt-0.003
s-0.003
 crawled-0.003
Token.
Token ID13
Feature activation-1.00e-02

Pos logit contributions
 yours+0.054
 our+0.049
 purpose+0.049
 pleasure+0.047
 your+0.046
Neg logit contributions
 felt-0.026
i-0.026
s-0.026
 crawled-0.026
ppy-0.026
Token �
Token ID6184
Feature activation-6.69e-03

Pos logit contributions
ppy+0.026
i+0.026
 felt+0.026
 crawled+0.026
 hopped+0.026
Neg logit contributions
 yours-0.034
 our-0.030
 purpose-0.030
 your-0.028
 pleasure-0.027
Token�
Token ID95
Feature activation-2.36e-02

Pos logit contributions
i+0.011
ppy+0.011
oped+0.011
owed+0.011
 crawled+0.011
Neg logit contributions
 yours-0.016
 our-0.014
 purpose-0.014
 your-0.013
 pleasure-0.013
Token€
Token ID26391
Feature activation-2.19e-02

Pos logit contributions
ppy+0.043
i+0.042
 felt+0.041
 crawled+0.041
owed+0.041
Neg logit contributions
 yours-0.050
 our-0.047
 purpose-0.045
 your-0.042
 pleasure-0.041
Token�
Token ID129
Feature activation-1.07e-02

Pos logit contributions
ppy+0.034
i+0.034
ling+0.033
ie+0.032
owed+0.032
Neg logit contributions
 yours-0.052
 our-0.049
 purpose-0.048
 your-0.046
 pleasure-0.045
Token�
Token ID241
Feature activation+1.23e-02

Pos logit contributions
i+0.019
ppy+0.018
ches+0.018
 felt+0.018
 crawled+0.018
Neg logit contributions
 yours-0.024
 our-0.022
 purpose-0.021
 pleasure-0.020
 your-0.020
TokenJane
Token ID41083
Feature activation-8.39e-02

Pos logit contributions
 yours+0.028
 our+0.026
 purpose+0.025
 your+0.023
 pleasure+0.023
Neg logit contributions
 felt-0.020
i-0.020
s-0.020
ppy-0.020
 crawled-0.020
Token,
Token ID11
Feature activation+1.77e-02

Pos logit contributions
ppy+0.106
oped+0.103
cases+0.101
 crawled+0.100
ling+0.099
Neg logit contributions
 yours-0.219
 our-0.207
 your-0.199
 purpose-0.197
 my-0.193
Token how
Token ID703
Feature activation+6.28e-02

Pos logit contributions
 yours+0.035
 purpose+0.032
 our+0.031
 pleasure+0.028
 ours+0.027
Neg logit contributions
i-0.035
 felt-0.035
ppy-0.034
 crawled-0.034
 hopped-0.034
Token can
Token ID460
Feature activation+5.88e-02

Pos logit contributions
 yours+0.147
 our+0.136
 purpose+0.136
 pleasure+0.124
 your+0.123
Neg logit contributions
 felt-0.101
ppy-0.101
i-0.100
 crawled-0.097
s-0.097
Token you
Token ID345
Feature activation+3.43e-02

Pos logit contributions
 yours+0.110
 our+0.099
 purpose+0.097
 your+0.090
 pleasure+0.087
Neg logit contributions
ppy-0.124
i-0.124
 felt-0.121
oped-0.120
owed-0.120
Token tell
Token ID1560
Feature activation+5.21e-02

Pos logit contributions
 yours+0.072
 our+0.065
 purpose+0.064
 your+0.059
 pleasure+0.058
Neg logit contributions
i-0.064
ppy-0.064
 felt-0.063
 crawled-0.062
owed-0.062
Token ran
Token ID4966
Feature activation-2.09e-02

Pos logit contributions
ppy+0.066
ll+0.065
i+0.064
s+0.063
ony+0.063
Neg logit contributions
 our-0.128
 yours-0.124
 electricity-0.124
 respect-0.122
 purpose-0.121
Token to
Token ID284
Feature activation+1.96e-02

Pos logit contributions
ppy+0.041
i+0.041
owed+0.040
oped+0.040
 felt+0.039
Neg logit contributions
 yours-0.042
 our-0.039
 purpose-0.038
 your-0.036
 pleasure-0.034
Token help
Token ID1037
Feature activation+1.09e-02

Pos logit contributions
 yours+0.039
 purpose+0.034
 our+0.033
 pleasure+0.031
 your+0.030
Neg logit contributions
 felt-0.040
 crawled-0.039
i-0.039
 ran-0.038
 climbed-0.038
Token her
Token ID607
Feature activation-3.25e-02

Pos logit contributions
 yours+0.021
 our+0.019
 purpose+0.018
 your+0.017
 my+0.016
Neg logit contributions
ppy-0.024
oped-0.023
i-0.023
ches-0.022
 hopped-0.022
Token and
Token ID290
Feature activation-2.88e-02

Pos logit contributions
oped+0.062
ppy+0.060
ited+0.060
 skipped+0.058
ops+0.057
Neg logit contributions
 our-0.067
 yours-0.065
 purpose-0.065
 your-0.060
 pleasure-0.056
Token they
Token ID484
Feature activation-5.55e-02

Pos logit contributions
ppy+0.036
ops+0.034
oped+0.034
i+0.033
ony+0.033
Neg logit contributions
 our-0.077
 yours-0.076
 your-0.071
 purpose-0.069
 my-0.068
Token got
Token ID1392
Feature activation+3.03e-02

Pos logit contributions
int+0.069
ony+0.067
i+0.066
ops+0.065
ppy+0.065
Neg logit contributions
 our-0.133
 yours-0.129
 your-0.119
 pleasure-0.117
 gift-0.117
Token pieces
Token ID5207
Feature activation-1.77e-02

Pos logit contributions
 yours+0.061
 our+0.055
 purpose+0.054
 your+0.050
 pleasure+0.049
Neg logit contributions
i-0.060
ppy-0.059
 felt-0.059
 crawled-0.058
s-0.058
Token of
Token ID286
Feature activation+3.77e-02

Pos logit contributions
i+0.030
ppy+0.030
oped+0.029
 felt+0.029
owed+0.029
Neg logit contributions
 yours-0.040
 our-0.037
 purpose-0.035
 your-0.034
 pleasure-0.032
Token water
Token ID1660
Feature activation-1.18e-02

Pos logit contributions
 yours+0.073
 purpose+0.063
 our+0.059
 yourselves+0.057
 pleasure+0.057
Neg logit contributions
 felt-0.076
 crawled-0.075
 climbed-0.074
 looked-0.073
 hopped-0.072
Token buckets
Token ID38674
Feature activation-3.89e-02

Pos logit contributions
ppy+0.020
i+0.019
oped+0.019
 felt+0.019
owed+0.018
Neg logit contributions
 yours-0.027
 our-0.025
 purpose-0.025
 your-0.023
 pleasure-0.022
Token deep
Token ID2769
Feature activation+2.54e-02

Pos logit contributions
 yours+0.123
 purpose+0.119
 pleasure+0.116
 our+0.108
 yourselves+0.103
Neg logit contributions
ppy-0.126
ches-0.123
 crawled-0.123
 climbed-0.122
i-0.121
Token."
Token ID526
Feature activation-2.32e-02

Pos logit contributions
 yours+0.074
 purpose+0.070
 our+0.068
 pleasure+0.067
 your+0.064
Neg logit contributions
ppy-0.027
i-0.026
y-0.025
ling-0.025
 felt-0.025
Token
Token ID198
Feature activation-7.90e-03

Pos logit contributions
i+0.051
oped+0.050
ppy+0.050
oping+0.049
ited+0.048
Neg logit contributions
 our-0.041
 yours-0.040
 purpose-0.038
 your-0.038
 my-0.035
Token
Token ID198
Feature activation-1.64e-02

Pos logit contributions
i+0.016
ppy+0.016
oped+0.015
owed+0.015
 crawled+0.015
Neg logit contributions
 yours-0.015
 our-0.015
 purpose-0.014
 your-0.013
 my-0.012
TokenL
Token ID43
Feature activation-6.77e-02

Pos logit contributions
owed+0.032
i+0.031
oped+0.031
oping+0.030
cases+0.028
Neg logit contributions
 our-0.033
 purpose-0.033
 yours-0.032
 your-0.031
 my-0.030
Tokenily
Token ID813
Feature activation-1.40e-01

Pos logit contributions
oped+0.147
cases+0.142
s+0.142
i+0.142
 felt+0.141
Neg logit contributions
 yours-0.133
 our-0.112
 your-0.106
 purpose-0.104
 my-0.101
Token was
Token ID373
Feature activation+6.76e-03

Pos logit contributions
oped+0.262
oping+0.254
ge+0.251
bars+0.249
cases+0.247
Neg logit contributions
 our-0.235
 my-0.211
 your-0.210
 friendship-0.192
 fun-0.183
Token happy
Token ID3772
Feature activation+3.63e-04

Pos logit contributions
 yours+0.015
 our+0.014
 purpose+0.014
 your+0.013
 pleasure+0.013
Neg logit contributions
i-0.012
ppy-0.012
oped-0.012
s-0.011
owed-0.011
Token and
Token ID290
Feature activation-2.36e-02

Pos logit contributions
 yours+0.001
 our+0.001
 purpose+0.001
 your+0.001
 pleasure+0.001
Neg logit contributions
ppy-0.001
i-0.001
oped-0.001
owed-0.001
ches-0.001
Token started
Token ID2067
Feature activation+1.74e-02

Pos logit contributions
ppy+0.027
oped+0.027
mo+0.025
y+0.024
oping+0.024
Neg logit contributions
 our-0.068
 yours-0.066
 your-0.064
 my-0.062
 pleasure-0.058
Token to
Token ID284
Feature activation+2.23e-02

Pos logit contributions
 yours+0.032
 our+0.029
 purpose+0.028
 your+0.026
 pleasure+0.025
Neg logit contributions
i-0.038
ppy-0.037
 felt-0.037
 crawled-0.036
owed-0.036
Token
Token ID198
Feature activation-2.96e-03

Pos logit contributions
s+0.007
 felt+0.007
ppy+0.007
i+0.007
 crawled+0.006
Neg logit contributions
 yours-0.012
 our-0.011
 purpose-0.011
 your-0.010
 pleasure-0.010
Token
Token ID198
Feature activation-1.39e-02

Pos logit contributions
i+0.005
ppy+0.005
 felt+0.005
owed+0.005
 crawled+0.005
Neg logit contributions
 yours-0.007
 our-0.006
 purpose-0.006
 your-0.006
 pleasure-0.005
TokenOne
Token ID3198
Feature activation-3.92e-03

Pos logit contributions
i+0.026
ppy+0.026
 felt+0.026
owed+0.025
oped+0.025
Neg logit contributions
 yours-0.029
 our-0.027
 purpose-0.026
 your-0.024
 pleasure-0.024
Token day
Token ID1110
Feature activation+1.01e-02

Pos logit contributions
ppy+0.008
i+0.008
oped+0.008
 felt+0.008
owed+0.008
Neg logit contributions
 yours-0.008
 our-0.007
 purpose-0.007
 your-0.006
 pleasure-0.006
Token,
Token ID11
Feature activation+1.39e-02

Pos logit contributions
 yours+0.029
 purpose+0.026
 our+0.026
 pleasure+0.025
 ours+0.024
Neg logit contributions
 felt-0.012
s-0.012
i-0.012
ppy-0.012
 crawled-0.011
Token Lily
Token ID20037
Feature activation-1.13e-01

Pos logit contributions
 yours+0.031
 purpose+0.028
 our+0.027
 pleasure+0.026
 electricity+0.025
Neg logit contributions
 felt-0.025
s-0.024
 got-0.024
 crawled-0.024
i-0.024
Token's
Token ID338
Feature activation-6.32e-03

Pos logit contributions
hetically+0.169
owed+0.163
oped+0.152
ony+0.145
int+0.143
Neg logit contributions
 our-0.278
 your-0.277
 yours-0.262
 my-0.258
 purpose-0.244
Token friend
Token ID1545
Feature activation-7.29e-02

Pos logit contributions
 felt+0.011
s+0.011
i+0.011
ppy+0.011
 skipped+0.011
Neg logit contributions
 yours-0.014
 our-0.013
 purpose-0.012
 yourselves-0.011
 your-0.011
Token Tim
Token ID5045
Feature activation-1.16e-01

Pos logit contributions
oped+0.132
owed+0.132
ppy+0.131
ited+0.130
ony+0.130
Neg logit contributions
 yours-0.149
 our-0.145
 your-0.138
 purpose-0.131
 my-0.129
Tokenmy
Token ID1820
Feature activation-9.62e-02

Pos logit contributions
cases+0.143
oped+0.125
ops+0.123
ony+0.123
owed+0.121
Neg logit contributions
 yours-0.340
 our-0.313
 your-0.307
 my-0.287
 purpose-0.279
Token came
Token ID1625
Feature activation-1.47e-02

Pos logit contributions
hetically+0.121
owed+0.119
cases+0.117
ony+0.116
int+0.111
Neg logit contributions
 yours-0.255
 our-0.248
 your-0.238
 my-0.223
 purpose-0.219
Token free
Token ID1479
Feature activation+5.19e-02

Pos logit contributions
 yours+0.104
 pleasure+0.100
 purpose+0.100
 electricity+0.092
 our+0.091
Neg logit contributions
s-0.065
ches-0.063
 felt-0.063
y-0.063
ppy-0.063
Token."
Token ID526
Feature activation-2.75e-02

Pos logit contributions
 yours+0.107
 purpose+0.106
 pleasure+0.103
 our+0.096
 electricity+0.094
Neg logit contributions
ppy-0.090
y-0.089
i-0.088
s-0.087
ie-0.085
Token
Token ID198
Feature activation-7.23e-04

Pos logit contributions
ppy+0.060
i+0.059
oped+0.057
ling+0.056
owed+0.056
Neg logit contributions
 yours-0.048
 our-0.048
 purpose-0.047
 your-0.046
 my-0.042
Token
Token ID198
Feature activation-9.72e-03

Pos logit contributions
i+0.001
ppy+0.001
owed+0.001
oped+0.001
 crawled+0.001
Neg logit contributions
 yours-0.001
 our-0.001
 purpose-0.001
 your-0.001
 my-0.001
TokenL
Token ID43
Feature activation-6.76e-02

Pos logit contributions
owed+0.018
i+0.018
oped+0.018
ppy+0.017
cases+0.017
Neg logit contributions
 purpose-0.020
 our-0.020
 yours-0.020
 your-0.019
 my-0.018
Tokenily
Token ID813
Feature activation-1.38e-01

Pos logit contributions
oped+0.145
s+0.141
 hopped+0.141
i+0.140
 walked+0.138
Neg logit contributions
 yours-0.133
 our-0.110
 your-0.107
 purpose-0.106
 yourselves-0.100
Token was
Token ID373
Feature activation+1.74e-03

Pos logit contributions
oped+0.238
cases+0.233
oping+0.227
mo+0.225
ules+0.220
Neg logit contributions
 our-0.226
 purpose-0.226
 your-0.225
 my-0.220
 respect-0.204
Token sad
Token ID6507
Feature activation+2.41e-02

Pos logit contributions
 yours+0.004
 our+0.004
 purpose+0.003
 your+0.003
 my+0.003
Neg logit contributions
ppy-0.003
oped-0.003
i-0.003
ops-0.003
s-0.003
Token but
Token ID475
Feature activation-4.04e-02

Pos logit contributions
 yours+0.061
 our+0.057
 purpose+0.056
 your+0.054
 pleasure+0.051
Neg logit contributions
ppy-0.035
oped-0.034
i-0.034
owed-0.034
 crawled-0.033
Token understood
Token ID7247
Feature activation+4.36e-03

Pos logit contributions
oped+0.088
ches+0.083
ops+0.083
cases+0.082
os+0.081
Neg logit contributions
 your-0.075
 yours-0.074
 our-0.073
 my-0.070
 pleasure-0.061
Token.
Token ID13
Feature activation-1.31e-02

Pos logit contributions
 yours+0.009
 our+0.009
 purpose+0.008
 your+0.008
 my+0.007
Neg logit contributions
ppy-0.008
oped-0.008
i-0.008
owed-0.008
 felt-0.008
Token milk
Token ID7545
Feature activation+3.04e-02

Pos logit contributions
 yours+0.107
 purpose+0.096
 our+0.095
 pleasure+0.091
 yourselves+0.087
Neg logit contributions
s-0.093
ie-0.092
 felt-0.092
i-0.091
 started-0.089
Token please
Token ID3387
Feature activation+4.88e-02

Pos logit contributions
 yours+0.057
 purpose+0.052
 our+0.050
 pleasure+0.048
 electricity+0.047
Neg logit contributions
ppy-0.063
i-0.062
 felt-0.062
s-0.061
y-0.061
Token?"
Token ID1701
Feature activation-2.25e-02

Pos logit contributions
 yours+0.088
 purpose+0.084
 our+0.078
 pleasure+0.077
 electricity+0.075
Neg logit contributions
 felt-0.101
ppy-0.100
i-0.100
ie-0.098
 got-0.098
Token His
Token ID2399
Feature activation-1.52e-02

Pos logit contributions
ppy+0.041
i+0.041
oped+0.039
ches+0.038
owed+0.038
Neg logit contributions
 yours-0.050
 our-0.046
 purpose-0.046
 your-0.043
 pleasure-0.041
Token mom
Token ID1995
Feature activation-1.15e-01

Pos logit contributions
 felt+0.030
i+0.030
ppy+0.030
 crawled+0.029
s+0.029
Neg logit contributions
 yours-0.031
 our-0.027
 purpose-0.027
 your-0.025
 pleasure-0.024
Tokenmy
Token ID1820
Feature activation-1.16e-01

Pos logit contributions
cases+0.152
oped+0.139
ops+0.135
oping+0.133
ony+0.126
Neg logit contributions
 yours-0.300
 my-0.280
 our-0.278
 your-0.278
 gift-0.265
Token said
Token ID531
Feature activation-1.42e-02

Pos logit contributions
 pads+0.186
cases+0.171
oped+0.160
oping+0.160
ge+0.150
Neg logit contributions
 our-0.235
 yours-0.228
 your-0.227
 gift-0.226
 my-0.226
Token,
Token ID11
Feature activation-2.51e-02

Pos logit contributions
i+0.020
ppy+0.020
 felt+0.019
owed+0.019
 crawled+0.019
Neg logit contributions
 yours-0.037
 our-0.034
 purpose-0.034
 your-0.032
 pleasure-0.031
Token "
Token ID366
Feature activation+2.21e-02

Pos logit contributions
i+0.057
ppy+0.057
oped+0.055
owed+0.055
ches+0.054
Neg logit contributions
 yours-0.043
 our-0.039
 your-0.036
 purpose-0.036
 my-0.034
TokenSure
Token ID19457
Feature activation+2.35e-02

Pos logit contributions
 yours+0.046
 our+0.041
 purpose+0.041
 your+0.038
 pleasure+0.037
Neg logit contributions
i-0.042
ppy-0.042
 felt-0.042
 crawled-0.041
oped-0.041
Token,
Token ID11
Feature activation+4.69e-02

Pos logit contributions
 yours+0.070
 purpose+0.065
 our+0.063
 outer+0.063
 electricity+0.063
Neg logit contributions
i-0.022
ie-0.022
 felt-0.020
 got-0.020
 ran-0.020
Token nice
Token ID3621
Feature activation+1.59e-02

Pos logit contributions
 yours+0.110
 purpose+0.099
 our+0.096
 pleasure+0.089
 your+0.087
Neg logit contributions
 felt-0.110
i-0.109
s-0.108
ling-0.107
ppy-0.106
Token.
Token ID13
Feature activation+8.23e-03

Pos logit contributions
 yours+0.041
 our+0.038
 purpose+0.037
 your+0.035
 pleasure+0.035
Neg logit contributions
ppy-0.023
i-0.022
 felt-0.022
oped-0.022
 crawled-0.021
Token
Token ID198
Feature activation+6.15e-03

Pos logit contributions
 yours+0.020
 our+0.018
 purpose+0.018
 your+0.017
 pleasure+0.017
Neg logit contributions
ppy-0.013
i-0.013
 felt-0.013
s-0.013
 crawled-0.012
Token
Token ID198
Feature activation-7.86e-03

Pos logit contributions
 yours+0.013
 our+0.012
 purpose+0.012
 your+0.011
 pleasure+0.011
Neg logit contributions
i-0.011
ppy-0.011
owed-0.011
 felt-0.011
oped-0.011
TokenL
Token ID43
Feature activation-6.28e-02

Pos logit contributions
i+0.013
owed+0.013
ppy+0.013
oped+0.012
 felt+0.012
Neg logit contributions
 yours-0.018
 our-0.017
 purpose-0.017
 your-0.016
 yourselves-0.016
Tokenily
Token ID813
Feature activation-1.31e-01

Pos logit contributions
oped+0.139
s+0.134
i+0.133
oping+0.133
hetically+0.133
Neg logit contributions
 yours-0.120
 our-0.102
 your-0.097
 purpose-0.092
 yourselves-0.091
Token felt
Token ID2936
Feature activation+3.47e-02

Pos logit contributions
hetically+0.194
oped+0.192
int+0.189
bi+0.188
owed+0.184
Neg logit contributions
 our-0.259
 your-0.259
 my-0.250
 purpose-0.226
 history-0.223
Token sad
Token ID6507
Feature activation+2.74e-02

Pos logit contributions
 yours+0.091
 purpose+0.085
 our+0.081
 ours+0.076
 electricity+0.076
Neg logit contributions
 felt-0.056
 got-0.048
 crawled-0.047
 forgot-0.046
 started-0.046
Token.
Token ID13
Feature activation-1.62e-03

Pos logit contributions
 yours+0.071
 our+0.066
 purpose+0.065
 pleasure+0.061
 your+0.061
Neg logit contributions
i-0.038
ppy-0.038
 felt-0.038
s-0.037
 crawled-0.036
Token She
Token ID1375
Feature activation-6.07e-02

Pos logit contributions
i+0.002
ppy+0.002
 felt+0.002
oped+0.002
owed+0.002
Neg logit contributions
 yours-0.004
 our-0.004
 purpose-0.004
 your-0.004
 pleasure-0.004
Token thought
Token ID1807
Feature activation+4.57e-02

Pos logit contributions
oped+0.081
ops+0.075
owed+0.074
cases+0.073
es+0.073
Neg logit contributions
 yours-0.153
 our-0.146
 your-0.144
 my-0.138
 purpose-0.136