TOP ACTIVATIONS
MAX = 0.254

saw a big box of Leg os that he really wanted
. The great Zen o hugs Ben and says
a new tank out of Leg os . Tim
The end . <|endoftext|> The clown was very smart . He
happening . The se ag ull replied that it was a
The se ag ull s and the fish agreed
the creature . " Sk ull s are not meant to
was a clown . The clown was very funny . He
arms . The se ag ull flew away . Tommy laughed
park . D ull had an idea . He
one of the se ag ull s suddenly became very aggressive
nearby . The se ag ull swoop ed down to take
away all the se ag ull s . Suddenly
said . The maid smiled . " Yes ,
. The flaming o nodded its head and
, Sally the Se ag ull , heard him and came
came to town . The clown had a big red nose
to the noisy se ag ull s outside . Jimmy looked
and that the se ag ull s would try to take
S ue " on the veil . She was very happy

MIN ACTIVATIONS
MIN = -0.170

set the table and being such a good helper . Lily
as a reward for being such a good dog . Max
a thank you for being such a good friend . From
and thanked Lily for being such a good friend .
and praised them for being such best friends . From then
proud of her for being such an obedient and happy little
things and was no longer such a shy princess . <|endoftext|>
him a treat for being such a good listener . Max
c udd les for being such a clever girl .
know why they would say such things . The
and thanked Max for being such a good friend . From
to know where they got such a beautiful car cover .
proud of her for being such a good helper . They
and praised her for being such a good student .
proud of herself for being such a good helper . <|endoftext|>
all applauded him for being such a good pup . <|endoftext|>
joining him and for being such a great friend .
and thanked Benny for being such a good friend . From
and thanked Max for being such a good friend . The
God for letting him have such a modest and happy life

INTERVAL 0.148 - 0.254
CONTAINS 4.075%

they could fix anything . <|endoftext|> Once upon a time ,
angry and started shaking the radio . Then , out of
. Grand ma brought some tea , but it was too
nice , but my car machine is better . It makes
she saw a dove . <|endoftext|> The sun shone brightly in

INTERVAL 0.042 - 0.148
CONTAINS 27.416%

a frog too ?" " Sure , Max ,
the hole . Just then , a kind
a shiny , white bone . It looked very tasty and
walking , he heard a noise nearby . When he looked
who figured out the answer . And they all lived happily

INTERVAL -0.064 - 0.042
CONTAINS 45.139%

him find the lung , because it changed her life .
her very much . And I love my grandma too .
snack for the journey . When she was done , she
three , and they all jumped into the air , with
to bed that night and fell asleep right away . She

INTERVAL -0.170 - -0.064
CONTAINS 23.370%

my drawings . They are not food . They are art
Tim my 's dad smiled and said ,
a walk , when he heard a noise . He looked
. But Mom does not like it when they
and he liked the way the people smiled at him .
Token saw
Token ID2497
Feature activation-7.00e-02

Pos logit contributions
os+0.156
o+0.151
umping+0.145
oing+0.140
ark+0.139
Neg logit contributions
 chores-0.128
 plan-0.117
 same-0.117
 temptation-0.115
 urge-0.110
Token a
Token ID257
Feature activation-5.37e-02

Pos logit contributions
 plan+0.096
 chores+0.092
 urge+0.091
 temptation+0.085
 same+0.080
Neg logit contributions
os-0.208
o-0.206
umping-0.192
cream-0.189
ark-0.189
Token big
Token ID1263
Feature activation-9.79e-03

Pos logit contributions
 chores+0.063
 same+0.053
 urge+0.052
 plan+0.051
 temptation+0.050
Neg logit contributions
oing-0.168
os-0.165
o-0.164
ark-0.163
umping-0.160
Token box
Token ID3091
Feature activation+1.41e-01

Pos logit contributions
 chores+0.016
 same+0.015
 plan+0.015
 urge+0.014
 temptation+0.014
Neg logit contributions
os-0.025
oing-0.025
o-0.024
umping-0.024
ark-0.024
Token of
Token ID286
Feature activation-9.34e-02

Pos logit contributions
os+0.328
ark+0.308
oing+0.308
o+0.303
umping+0.297
Neg logit contributions
 chores-0.308
 plan-0.296
 same-0.295
 urge-0.280
 way-0.279
Token Leg
Token ID3564
Feature activation+2.54e-01

Pos logit contributions
 plan+0.135
 chores+0.133
 same+0.128
 urge+0.128
 way+0.126
Neg logit contributions
os-0.252
o-0.238
eddy-0.233
ark-0.232
oing-0.231
Tokenos
Token ID418
Feature activation+1.29e-01

Pos logit contributions
oing+0.316
ark+0.311
aron+0.310
pillar+0.301
round+0.268
Neg logit contributions
 plan-0.721
 start-0.714
 going-0.710
 work-0.696
 urge-0.682
Token that
Token ID326
Feature activation-5.75e-02

Pos logit contributions
os+0.313
ark+0.312
oing+0.306
o+0.294
umping+0.291
Neg logit contributions
 chores-0.265
 same-0.241
 start-0.238
 plan-0.237
 work-0.233
Token he
Token ID339
Feature activation-8.39e-04

Pos logit contributions
 chores+0.114
 plan+0.106
 same+0.102
 temptation+0.101
 urge+0.100
Neg logit contributions
os-0.149
o-0.142
ark-0.129
es-0.124
ne-0.124
Token really
Token ID1107
Feature activation-5.47e-02

Pos logit contributions
 chores+0.001
 plan+0.001
 same+0.001
 urge+0.001
 way+0.001
Neg logit contributions
os-0.002
ark-0.002
oing-0.002
o-0.002
umping-0.002
Token wanted
Token ID2227
Feature activation-8.43e-02

Pos logit contributions
 chores+0.098
 plan+0.093
 same+0.093
 urge+0.087
 way+0.086
Neg logit contributions
os-0.149
ark-0.141
oing-0.141
o-0.140
umping-0.137
Token.
Token ID13
Feature activation+7.61e-02

Pos logit contributions
 chores+0.003
 plan+0.002
 same+0.002
 urge+0.002
 start+0.002
Neg logit contributions
os-0.003
ark-0.003
umping-0.003
oing-0.003
o-0.003
Token
Token ID198
Feature activation+8.08e-02

Pos logit contributions
os+0.222
ark+0.213
oing+0.210
umping+0.207
o+0.202
Neg logit contributions
 chores-0.113
 same-0.112
 plan-0.111
 way-0.105
 bad-0.103
Token
Token ID198
Feature activation+8.06e-02

Pos logit contributions
os+0.233
umping+0.224
oing+0.221
ark+0.217
round+0.213
Neg logit contributions
 chores-0.124
 same-0.119
 plan-0.119
 bad-0.115
 other-0.114
TokenThe
Token ID464
Feature activation-5.11e-02

Pos logit contributions
os+0.210
umping+0.190
oing+0.187
ark+0.181
o+0.179
Neg logit contributions
 chores-0.154
 same-0.152
 plan-0.148
 bad-0.144
 other-0.142
Token great
Token ID1049
Feature activation-1.28e-02

Pos logit contributions
 chores+0.086
 plan+0.078
 same+0.077
 urge+0.073
 way+0.073
Neg logit contributions
os-0.142
ark-0.135
oing-0.135
o-0.134
umping-0.130
Token Zen
Token ID14760
Feature activation+2.53e-01

Pos logit contributions
 chores+0.022
 same+0.021
 plan+0.020
 urge+0.019
 way+0.019
Neg logit contributions
os-0.035
ark-0.033
oing-0.032
o-0.032
umping-0.032
Tokeno
Token ID78
Feature activation+1.22e-01

Pos logit contributions
umping+0.356
ark+0.327
umbs+0.326
oing+0.319
ud+0.303
Neg logit contributions
 plan-0.725
 not-0.697
 same-0.695
 start-0.695
 way-0.687
Token hugs
Token ID40657
Feature activation+2.52e-02

Pos logit contributions
ark+0.340
os+0.336
oing+0.334
and+0.323
umping+0.318
Neg logit contributions
 plan-0.195
 same-0.193
 chores-0.191
 start-0.186
 way-0.178
Token Ben
Token ID3932
Feature activation+1.05e-01

Pos logit contributions
os+0.090
oing+0.089
ark+0.087
o+0.085
umping+0.085
Neg logit contributions
 chores-0.023
 same-0.023
 plan-0.020
 start-0.019
 way-0.019
Token and
Token ID290
Feature activation-2.89e-02

Pos logit contributions
os+0.291
ark+0.285
oing+0.279
umping+0.271
lic+0.269
Neg logit contributions
 chores-0.180
 same-0.172
 plan-0.170
 way-0.169
 start-0.164
Token says
Token ID1139
Feature activation-1.48e-02

Pos logit contributions
 chores+0.058
 plan+0.054
 same+0.053
 urge+0.050
 way+0.049
Neg logit contributions
os-0.070
o-0.066
oing-0.063
ne-0.062
umping-0.062
Token a
Token ID257
Feature activation-5.95e-02

Pos logit contributions
os+0.190
oing+0.182
o+0.181
ark+0.180
umping+0.178
Neg logit contributions
 chores-0.064
 plan-0.059
 same-0.059
 way-0.053
 urge-0.053
Token new
Token ID649
Feature activation-1.92e-02

Pos logit contributions
 chores+0.064
 same+0.054
 urge+0.052
 temptation+0.050
 start+0.047
Neg logit contributions
o-0.185
oing-0.184
os-0.180
oc-0.176
ark-0.175
Token tank
Token ID6873
Feature activation+1.60e-01

Pos logit contributions
 chores+0.032
 same+0.029
 plan+0.029
 urge+0.027
 start+0.027
Neg logit contributions
os-0.054
o-0.052
ark-0.051
oing-0.051
umping-0.050
Token out
Token ID503
Feature activation-2.31e-02

Pos logit contributions
os+0.379
ark+0.357
oing+0.356
o+0.352
umping+0.345
Neg logit contributions
 chores-0.340
 plan-0.325
 same-0.324
 urge-0.307
 way-0.306
Token of
Token ID286
Feature activation-1.02e-01

Pos logit contributions
 chores+0.058
 plan+0.056
 same+0.056
 urge+0.054
 way+0.053
Neg logit contributions
os-0.046
ark-0.042
oing-0.042
o-0.042
umping-0.041
Token Leg
Token ID3564
Feature activation+2.53e-01

Pos logit contributions
 chores+0.128
 plan+0.114
 urge+0.106
 same+0.105
 start+0.091
Neg logit contributions
os-0.309
o-0.292
and-0.284
umping-0.279
eddy-0.277
Tokenos
Token ID418
Feature activation+1.30e-01

Pos logit contributions
oing+0.183
aron+0.163
ark+0.161
pillar+0.157
round+0.143
Neg logit contributions
 plan-0.859
 chores-0.830
 work-0.827
 first-0.821
 start-0.819
Token.
Token ID13
Feature activation+7.56e-02

Pos logit contributions
ark+0.313
oing+0.305
os+0.304
umping+0.293
enn+0.291
Neg logit contributions
 chores-0.269
 same-0.253
 plan-0.250
 start-0.243
 work-0.238
Token
Token ID198
Feature activation+7.64e-02

Pos logit contributions
os+0.219
ark+0.210
oing+0.209
umping+0.204
o+0.202
Neg logit contributions
 chores-0.121
 same-0.116
 plan-0.116
 way-0.109
 urge-0.107
Token
Token ID198
Feature activation+7.58e-02

Pos logit contributions
os+0.219
oing+0.214
umping+0.209
ark+0.205
aron+0.201
Neg logit contributions
 chores-0.113
 plan-0.111
 same-0.105
 bad-0.101
 way-0.100
TokenTim
Token ID14967
Feature activation+1.12e-01

Pos logit contributions
os+0.180
oing+0.164
ark+0.161
umping+0.159
o+0.159
Neg logit contributions
 chores-0.167
 plan-0.161
 same-0.159
 urge-0.150
 way-0.149
TokenThe
Token ID464
Feature activation-6.00e-02

Pos logit contributions
os+0.158
oing+0.147
umping+0.146
ark+0.139
o+0.138
Neg logit contributions
 chores-0.140
 plan-0.138
 same-0.132
 start-0.124
 urge-0.123
Token end
Token ID886
Feature activation+1.38e-02

Pos logit contributions
 chores+0.080
 plan+0.072
 same+0.071
 urge+0.068
 way+0.066
Neg logit contributions
os-0.178
o-0.174
oing-0.173
ark-0.173
ne-0.169
Token.
Token ID13
Feature activation+7.89e-02

Pos logit contributions
os+0.037
oing+0.036
ark+0.036
umping+0.036
umbs+0.034
Neg logit contributions
 plan-0.025
 chores-0.025
 bad-0.023
 way-0.023
 same-0.023
Token<|endoftext|>
Token ID50256
Feature activation+1.49e-01

Pos logit contributions
os+0.221
oing+0.217
umping+0.216
ark+0.214
v+0.204
Neg logit contributions
 same-0.108
 plan-0.107
 chores-0.107
 way-0.104
 start-0.100
TokenThe
Token ID464
Feature activation-5.08e-02

Pos logit contributions
oing+0.463
ark+0.459
round+0.442
ist+0.439
os+0.430
Neg logit contributions
 same-0.184
 the-0.168
 way-0.165
 other-0.165
 time-0.161
Token clown
Token ID25573
Feature activation+2.48e-01

Pos logit contributions
 chores+0.060
 plan+0.055
 same+0.053
 urge+0.049
 way+0.049
Neg logit contributions
os-0.166
o-0.160
oing-0.159
ark-0.159
umping-0.156
Token was
Token ID373
Feature activation-1.02e-01

Pos logit contributions
ark+0.696
oing+0.691
os+0.673
and+0.661
aron+0.656
Neg logit contributions
 chores-0.380
 plan-0.375
 same-0.372
 way-0.348
 start-0.346
Token very
Token ID845
Feature activation-4.43e-02

Pos logit contributions
 chores+0.156
 plan+0.146
 same+0.137
 urge+0.132
 way+0.129
Neg logit contributions
os-0.306
o-0.301
umping-0.289
ark-0.288
oing-0.288
Token smart
Token ID4451
Feature activation+1.47e-02

Pos logit contributions
 chores+0.066
 plan+0.059
 same+0.057
 way+0.057
 start+0.053
Neg logit contributions
os-0.136
o-0.133
oing-0.127
ark-0.124
umping-0.122
Token.
Token ID13
Feature activation+7.03e-02

Pos logit contributions
os+0.037
oing+0.035
ark+0.035
o+0.034
umping+0.033
Neg logit contributions
 chores-0.030
 same-0.029
 plan-0.028
 way-0.027
 urge-0.027
Token He
Token ID679
Feature activation+2.39e-03

Pos logit contributions
oing+0.201
os+0.201
ark+0.194
umping+0.190
o+0.187
Neg logit contributions
 chores-0.111
 same-0.110
 plan-0.105
 urge-0.099
 way-0.098
Token happening
Token ID5836
Feature activation-3.08e-02

Pos logit contributions
 chores+0.183
 plan+0.171
 urge+0.163
 same+0.155
 temptation+0.151
Neg logit contributions
os-0.298
o-0.284
ark-0.270
ne-0.265
oing-0.262
Token.
Token ID13
Feature activation+7.13e-02

Pos logit contributions
 chores+0.069
 plan+0.066
 same+0.065
 urge+0.062
 way+0.062
Neg logit contributions
os-0.070
ark-0.066
oing-0.065
o-0.065
umping-0.064
Token The
Token ID383
Feature activation-5.12e-02

Pos logit contributions
os+0.192
oing+0.188
ark+0.186
umping+0.182
o+0.178
Neg logit contributions
 chores-0.122
 same-0.118
 plan-0.116
 way-0.108
 urge-0.108
Token se
Token ID384
Feature activation+1.23e-01

Pos logit contributions
 chores+0.080
 plan+0.073
 same+0.072
 urge+0.069
 way+0.067
Neg logit contributions
os-0.143
o-0.137
oing-0.136
ark-0.136
ne-0.133
Tokenag
Token ID363
Feature activation+8.66e-02

Pos logit contributions
os+0.224
ark+0.210
oing+0.209
o+0.203
umping+0.203
Neg logit contributions
 chores-0.320
 plan-0.320
 same-0.313
 urge-0.303
 idea-0.295
Tokenull
Token ID724
Feature activation+2.44e-01

Pos logit contributions
os+0.129
oing+0.128
ark+0.125
umping+0.125
aron+0.122
Neg logit contributions
 plan-0.240
 same-0.235
 way-0.227
 urge-0.224
 chores-0.222
Token replied
Token ID8712
Feature activation-2.04e-02

Pos logit contributions
os+0.657
ark+0.642
oing+0.628
umping+0.619
o+0.616
Neg logit contributions
 chores-0.416
 plan-0.411
 same-0.405
 urge-0.388
 way-0.384
Token that
Token ID326
Feature activation-5.62e-02

Pos logit contributions
 chores+0.036
 plan+0.034
 same+0.034
 urge+0.032
 way+0.032
Neg logit contributions
os-0.056
ark-0.053
oing-0.053
o-0.053
umping-0.051
Token it
Token ID340
Feature activation+3.52e-02

Pos logit contributions
 chores+0.079
 same+0.073
 plan+0.073
 urge+0.069
 way+0.065
Neg logit contributions
os-0.176
o-0.165
ark-0.161
oing-0.160
umping-0.156
Token was
Token ID373
Feature activation-1.18e-01

Pos logit contributions
os+0.091
ark+0.086
oing+0.086
o+0.085
umping+0.083
Neg logit contributions
 chores-0.068
 plan-0.064
 same-0.064
 urge-0.061
 way-0.060
Token a
Token ID257
Feature activation-5.61e-02

Pos logit contributions
 chores+0.177
 plan+0.158
 same+0.151
 urge+0.147
 laundry+0.141
Neg logit contributions
o-0.326
os-0.314
oing-0.293
aron-0.293
umping-0.289
Token
Token ID198
Feature activation+7.39e-02

Pos logit contributions
os+0.205
ark+0.199
oing+0.198
umping+0.195
o+0.189
Neg logit contributions
 chores-0.157
 plan-0.151
 same-0.148
 urge-0.141
 way-0.140
Token
Token ID198
Feature activation+7.30e-02

Pos logit contributions
os+0.199
umping+0.193
oing+0.193
ark+0.185
round+0.181
Neg logit contributions
 chores-0.126
 plan-0.125
 same-0.115
 other-0.109
 bad-0.108
TokenThe
Token ID464
Feature activation-5.54e-02

Pos logit contributions
os+0.165
oing+0.154
umping+0.151
o+0.149
ark+0.148
Neg logit contributions
 chores-0.165
 plan-0.162
 same-0.154
 urge-0.146
 start-0.146
Token se
Token ID384
Feature activation+1.20e-01

Pos logit contributions
 chores+0.082
 plan+0.069
 same+0.067
 urge+0.065
 way+0.063
Neg logit contributions
os-0.159
ark-0.158
oing-0.155
ne-0.155
o-0.152
Tokenag
Token ID363
Feature activation+8.84e-02

Pos logit contributions
os+0.236
oing+0.228
ark+0.217
umping+0.216
o+0.211
Neg logit contributions
 plan-0.291
 chores-0.287
 same-0.280
 idea-0.272
 urge-0.271
Tokenull
Token ID724
Feature activation+2.44e-01

Pos logit contributions
os+0.136
oing+0.134
round+0.132
ark+0.131
umping+0.130
Neg logit contributions
 plan-0.242
 same-0.235
 way-0.227
 urge-0.223
 idea-0.218
Tokens
Token ID82
Feature activation-4.61e-02

Pos logit contributions
os+0.642
ark+0.642
oing+0.624
umping+0.613
and+0.609
Neg logit contributions
 plan-0.420
 chores-0.413
 same-0.400
 urge-0.388
 way-0.386
Token and
Token ID290
Feature activation-2.65e-02

Pos logit contributions
 chores+0.092
 same+0.085
 plan+0.084
 way+0.079
 urge+0.078
Neg logit contributions
os-0.115
o-0.108
oing-0.108
ark-0.106
umping-0.104
Token the
Token ID262
Feature activation-7.59e-02

Pos logit contributions
 chores+0.042
 same+0.038
 plan+0.036
 temptation+0.036
 urge+0.036
Neg logit contributions
os-0.077
o-0.073
es-0.070
ne-0.069
oing-0.069
Token fish
Token ID5916
Feature activation+2.04e-01

Pos logit contributions
 chores+0.118
 plan+0.103
 same+0.098
 urge+0.097
 start+0.092
Neg logit contributions
os-0.215
oing-0.208
o-0.208
ark-0.207
ne-0.204
Token agreed
Token ID4987
Feature activation-6.33e-02

Pos logit contributions
os+0.529
ark+0.510
oing+0.506
o+0.495
umping+0.493
Neg logit contributions
 chores-0.383
 plan-0.372
 same-0.366
 urge-0.347
 way-0.345
Token the
Token ID262
Feature activation-6.56e-02

Pos logit contributions
 chores+0.010
 plan+0.009
 same+0.009
 urge+0.008
 way+0.008
Neg logit contributions
os-0.022
ark-0.021
oing-0.021
o-0.021
umping-0.021
Token creature
Token ID7185
Feature activation+1.62e-01

Pos logit contributions
 chores+0.091
 plan+0.082
 same+0.079
 urge+0.077
 way+0.076
Neg logit contributions
os-0.197
o-0.190
ark-0.189
oing-0.189
ne-0.184
Token.
Token ID13
Feature activation+8.17e-02

Pos logit contributions
os+0.399
ark+0.381
oing+0.380
o+0.370
umping+0.370
Neg logit contributions
 chores-0.327
 plan-0.316
 same-0.313
 urge-0.298
 way-0.295
Token "
Token ID366
Feature activation+5.15e-02

Pos logit contributions
os+0.223
oing+0.215
ark+0.214
umping+0.211
o+0.201
Neg logit contributions
 same-0.129
 chores-0.128
 plan-0.126
 way-0.118
 start-0.118
TokenSk
Token ID15739
Feature activation+4.62e-02

Pos logit contributions
os+0.136
umping+0.128
oing+0.128
ark+0.127
o+0.125
Neg logit contributions
 same-0.091
 chores-0.088
 plan-0.085
 bad-0.082
 urge-0.080
Tokenull
Token ID724
Feature activation+2.43e-01

Pos logit contributions
oing+0.090
os+0.086
round+0.082
aron+0.082
umping+0.079
Neg logit contributions
 same-0.110
 plan-0.106
 chores-0.104
 urge-0.098
 way-0.097
Tokens
Token ID82
Feature activation-4.74e-02

Pos logit contributions
os+0.506
ark+0.491
oing+0.482
umping+0.476
and+0.461
Neg logit contributions
 chores-0.569
 same-0.556
 plan-0.556
 urge-0.528
 way-0.522
Token are
Token ID389
Feature activation-7.98e-02

Pos logit contributions
 chores+0.081
 plan+0.076
 same+0.076
 way+0.071
 urge+0.070
Neg logit contributions
os-0.133
o-0.125
oing-0.122
ark-0.121
umping-0.117
Token not
Token ID407
Feature activation-1.01e-01

Pos logit contributions
 chores+0.104
 plan+0.097
 same+0.089
 urge+0.087
 way+0.083
Neg logit contributions
o-0.237
os-0.234
aron-0.218
oing-0.218
ark-0.215
Token meant
Token ID4001
Feature activation-9.35e-02

Pos logit contributions
 chores+0.115
 plan+0.101
 same+0.100
 urge+0.093
 way+0.089
Neg logit contributions
os-0.332
o-0.324
ark-0.318
oing-0.317
umping-0.311
Token to
Token ID284
Feature activation-1.35e-01

Pos logit contributions
 chores+0.174
 plan+0.166
 same+0.164
 urge+0.157
 way+0.152
Neg logit contributions
os-0.241
ark-0.228
o-0.228
oing-0.226
umping-0.219
Token was
Token ID373
Feature activation-1.01e-01

Pos logit contributions
 chores+0.001
 plan+0.001
 same+0.001
 urge+0.001
 way+0.001
Neg logit contributions
os-0.001
ark-0.001
oing-0.001
o-0.001
umping-0.001
Token a
Token ID257
Feature activation-4.52e-02

Pos logit contributions
 chores+0.106
 plan+0.099
 same+0.092
 urge+0.086
 way+0.085
Neg logit contributions
os-0.345
o-0.334
ark-0.331
oing-0.330
umping-0.328
Token clown
Token ID25573
Feature activation+2.27e-01

Pos logit contributions
 chores+0.048
 same+0.041
 plan+0.041
 urge+0.037
 way+0.037
Neg logit contributions
os-0.154
oing-0.150
ark-0.150
o-0.149
umping-0.145
Token.
Token ID13
Feature activation+7.34e-02

Pos logit contributions
umbs+0.649
ives+0.639
aron+0.632
umping+0.628
and+0.613
Neg logit contributions
 other-0.294
 not-0.289
 plan-0.284
 only-0.282
 mom-0.276
Token The
Token ID383
Feature activation-4.78e-02

Pos logit contributions
os+0.215
oing+0.212
ark+0.210
umping+0.206
enn+0.203
Neg logit contributions
 same-0.105
 chores-0.102
 plan-0.097
 urge-0.095
 start-0.092
Token clown
Token ID25573
Feature activation+2.43e-01

Pos logit contributions
 chores+0.111
 plan+0.106
 same+0.106
 urge+0.101
 way+0.100
Neg logit contributions
os-0.106
ark-0.099
oing-0.099
o-0.098
umping-0.095
Token was
Token ID373
Feature activation-1.06e-01

Pos logit contributions
oing+0.792
umbs+0.786
umping+0.771
ark+0.765
and+0.762
Neg logit contributions
 plan-0.249
 same-0.239
 way-0.233
 other-0.232
 only-0.232
Token very
Token ID845
Feature activation-4.70e-02

Pos logit contributions
 chores+0.159
 plan+0.148
 temptation+0.132
 urge+0.130
 same+0.129
Neg logit contributions
o-0.320
os-0.314
ark-0.298
oing-0.294
umping-0.288
Token funny
Token ID8258
Feature activation+2.40e-02

Pos logit contributions
 chores+0.052
 plan+0.046
 same+0.045
 way+0.041
 urge+0.040
Neg logit contributions
os-0.161
o-0.155
ark-0.154
oing-0.154
umping-0.149
Token.
Token ID13
Feature activation+7.37e-02

Pos logit contributions
os+0.069
oing+0.068
ark+0.066
enn+0.066
oped+0.065
Neg logit contributions
 way-0.035
 chores-0.035
 plan-0.034
 same-0.034
 bad-0.033
Token He
Token ID679
Feature activation+2.01e-03

Pos logit contributions
os+0.210
oing+0.205
ark+0.203
umping+0.198
o+0.195
Neg logit contributions
 chores-0.120
 same-0.118
 plan-0.114
 urge-0.109
 way-0.108
Token arms
Token ID5101
Feature activation+8.80e-02

Pos logit contributions
 chores+0.124
 same+0.114
 plan+0.113
 urge+0.105
 way+0.104
Neg logit contributions
os-0.205
oing-0.195
o-0.195
ark-0.192
umping-0.187
Token.
Token ID13
Feature activation+7.77e-02

Pos logit contributions
os+0.221
ark+0.211
oing+0.209
o+0.206
umping+0.203
Neg logit contributions
 chores-0.173
 same-0.167
 plan-0.165
 urge-0.157
 way-0.156
Token The
Token ID383
Feature activation-4.66e-02

Pos logit contributions
os+0.204
o+0.193
oing+0.191
ark+0.191
umping+0.189
Neg logit contributions
 chores-0.145
 plan-0.138
 same-0.132
 urge-0.127
 way-0.125
Token se
Token ID384
Feature activation+1.27e-01

Pos logit contributions
 chores+0.078
 plan+0.071
 same+0.069
 urge+0.065
 way+0.064
Neg logit contributions
os-0.127
oing-0.125
o-0.123
umping-0.120
ne-0.119
Tokenag
Token ID363
Feature activation+8.87e-02

Pos logit contributions
os+0.281
oing+0.271
ark+0.261
umping+0.260
enn+0.257
Neg logit contributions
 plan-0.269
 chores-0.267
 same-0.261
 urge-0.257
 idea-0.253
Tokenull
Token ID724
Feature activation+2.42e-01

Pos logit contributions
os+0.138
ark+0.138
oing+0.137
aron+0.136
ives+0.133
Neg logit contributions
 plan-0.238
 same-0.229
 way-0.224
 urge-0.223
 grown-0.214
Token flew
Token ID13112
Feature activation-2.00e-02

Pos logit contributions
os+0.703
ark+0.679
oing+0.670
umping+0.659
o+0.659
Neg logit contributions
 chores-0.376
 plan-0.367
 same-0.360
 urge-0.342
 way-0.340
Token away
Token ID1497
Feature activation-4.20e-02

Pos logit contributions
 chores+0.037
 plan+0.035
 same+0.034
 urge+0.033
 way+0.032
Neg logit contributions
os-0.054
ark-0.051
o-0.051
oing-0.051
umping-0.049
Token.
Token ID13
Feature activation+7.91e-02

Pos logit contributions
 chores+0.093
 plan+0.088
 same+0.087
 urge+0.083
 way+0.082
Neg logit contributions
os-0.097
oing-0.092
ark-0.092
o-0.091
umping-0.088
Token Tommy
Token ID19919
Feature activation+1.05e-01

Pos logit contributions
os+0.209
o+0.197
ark+0.195
oing+0.194
umping+0.193
Neg logit contributions
 chores-0.147
 plan-0.142
 same-0.135
 urge-0.129
 way-0.126
Token laughed
Token ID13818
Feature activation+2.09e-03

Pos logit contributions
os+0.306
ark+0.296
oing+0.295
umping+0.284
and+0.281
Neg logit contributions
 plan-0.158
 same-0.153
 way-0.152
 chores-0.148
 start-0.147
Token park
Token ID3952
Feature activation+8.36e-02

Pos logit contributions
 chores+0.125
 plan+0.116
 urge+0.109
 same+0.107
 start+0.105
Neg logit contributions
os-0.234
o-0.231
ark-0.227
oing-0.224
ne-0.222
Token.
Token ID13
Feature activation+7.71e-02

Pos logit contributions
os+0.196
oing+0.185
ark+0.185
o+0.181
umping+0.179
Neg logit contributions
 chores-0.181
 plan-0.173
 same-0.173
 way-0.164
 urge-0.163
Token
Token ID198
Feature activation+7.65e-02

Pos logit contributions
os+0.216
oing+0.210
ark+0.209
umping+0.207
aron+0.197
Neg logit contributions
 chores-0.124
 same-0.121
 plan-0.118
 way-0.111
 urge-0.108
Token
Token ID198
Feature activation+7.49e-02

Pos logit contributions
os+0.207
oing+0.203
umping+0.203
ark+0.193
aron+0.192
Neg logit contributions
 chores-0.130
 plan-0.126
 same-0.120
 bad-0.114
 way-0.113
TokenD
Token ID35
Feature activation+8.07e-02

Pos logit contributions
os+0.184
umping+0.173
oing+0.171
aron+0.167
ark+0.163
Neg logit contributions
 chores-0.153
 plan-0.146
 same-0.142
 bad-0.134
 start-0.133
Tokenull
Token ID724
Feature activation+2.42e-01

Pos logit contributions
oing+0.134
os+0.131
aron+0.123
ark+0.122
round+0.121
Neg logit contributions
 same-0.219
 chores-0.214
 plan-0.208
 work-0.204
 start-0.200
Token had
Token ID550
Feature activation-8.02e-02

Pos logit contributions
os+0.605
ark+0.587
oing+0.574
umping+0.574
and+0.572
Neg logit contributions
 plan-0.442
 chores-0.434
 same-0.431
 way-0.420
 urge-0.416
Token an
Token ID281
Feature activation-6.60e-02

Pos logit contributions
 plan+0.120
 chores+0.119
 urge+0.115
 temptation+0.111
 same+0.109
Neg logit contributions
os-0.220
o-0.218
umping-0.196
v-0.195
ark-0.191
Token idea
Token ID2126
Feature activation+3.91e-02

Pos logit contributions
 same+0.059
 chores+0.057
 way+0.052
 start+0.051
 plan+0.044
Neg logit contributions
aron-0.193
umping-0.191
round-0.189
ark-0.188
os-0.188
Token.
Token ID13
Feature activation+7.51e-02

Pos logit contributions
os+0.100
oing+0.098
ark+0.096
umping+0.094
o+0.093
Neg logit contributions
 chores-0.073
 plan-0.072
 way-0.069
 urge-0.068
 same-0.067
Token He
Token ID679
Feature activation+1.89e-03

Pos logit contributions
os+0.200
ark+0.190
oing+0.190
o+0.187
umping+0.185
Neg logit contributions
 chores-0.136
 same-0.131
 plan-0.131
 urge-0.122
 way-0.122
Token one
Token ID530
Feature activation+2.50e-02

Pos logit contributions
 chores+0.059
 plan+0.055
 same+0.055
 urge+0.052
 way+0.050
Neg logit contributions
os-0.090
o-0.085
ark-0.084
oing-0.083
umping-0.082
Token of
Token ID286
Feature activation-7.90e-02

Pos logit contributions
os+0.075
o+0.071
oing+0.071
ark+0.070
umping+0.069
Neg logit contributions
 chores-0.038
 same-0.035
 plan-0.035
 urge-0.032
 way-0.032
Token the
Token ID262
Feature activation-7.37e-02

Pos logit contributions
 chores+0.093
 same+0.083
 plan+0.082
 urge+0.076
 way+0.073
Neg logit contributions
os-0.265
o-0.251
ark-0.246
ne-0.244
umping-0.243
Token se
Token ID384
Feature activation+1.16e-01

Pos logit contributions
 chores+0.090
 plan+0.082
 same+0.081
 urge+0.077
 start+0.076
Neg logit contributions
os-0.214
o-0.206
ne-0.204
oing-0.200
ark-0.199
Tokenag
Token ID363
Feature activation+8.70e-02

Pos logit contributions
os+0.238
oing+0.227
ark+0.226
umping+0.217
o+0.217
Neg logit contributions
 chores-0.273
 plan-0.270
 same-0.262
 urge-0.256
 idea-0.248
Tokenull
Token ID724
Feature activation+2.41e-01

Pos logit contributions
os+0.123
ark+0.121
oing+0.119
round+0.113
aron+0.113
Neg logit contributions
 plan-0.253
 same-0.246
 chores-0.244
 way-0.238
 urge-0.238
Tokens
Token ID82
Feature activation-5.97e-02

Pos logit contributions
os+0.666
ark+0.642
oing+0.632
o+0.624
umping+0.618
Neg logit contributions
 chores-0.411
 plan-0.396
 same-0.393
 urge-0.370
 way-0.369
Token suddenly
Token ID6451
Feature activation-4.20e-02

Pos logit contributions
 chores+0.113
 plan+0.106
 same+0.106
 urge+0.099
 way+0.099
Neg logit contributions
os-0.157
oing-0.148
ark-0.147
o-0.147
umping-0.143
Token became
Token ID2627
Feature activation-1.08e-01

Pos logit contributions
 chores+0.080
 plan+0.075
 same+0.075
 urge+0.071
 way+0.070
Neg logit contributions
os-0.110
oing-0.103
o-0.103
ark-0.102
umping-0.101
Token very
Token ID845
Feature activation-5.64e-02

Pos logit contributions
 chores+0.146
 plan+0.137
 urge+0.130
 same+0.126
 way+0.123
Neg logit contributions
os-0.311
o-0.309
umping-0.293
oing-0.287
es-0.277
Token aggressive
Token ID8361
Feature activation-2.93e-03

Pos logit contributions
 chores+0.077
 plan+0.070
 way+0.067
 urge+0.064
 same+0.064
Neg logit contributions
os-0.167
o-0.162
oing-0.158
ark-0.150
umping-0.148
Token nearby
Token ID6716
Feature activation-7.33e-03

Pos logit contributions
os+0.575
ark+0.570
umping+0.543
oing+0.539
enny+0.535
Neg logit contributions
 same-0.410
 chores-0.408
 plan-0.407
 start-0.404
 way-0.401
Token.
Token ID13
Feature activation+8.15e-02

Pos logit contributions
 chores+0.016
 plan+0.016
 same+0.016
 urge+0.015
 way+0.015
Neg logit contributions
os-0.017
ark-0.016
oing-0.016
o-0.015
umping-0.015
Token The
Token ID383
Feature activation-4.95e-02

Pos logit contributions
os+0.212
oing+0.199
ark+0.199
o+0.198
umping+0.195
Neg logit contributions
 chores-0.157
 plan-0.149
 same-0.146
 urge-0.139
 way-0.138
Token se
Token ID384
Feature activation+1.22e-01

Pos logit contributions
 chores+0.077
 plan+0.063
 same+0.060
 urge+0.058
 temptation+0.056
Neg logit contributions
os-0.136
ne-0.131
o-0.131
oing-0.129
v-0.127
Tokenag
Token ID363
Feature activation+8.57e-02

Pos logit contributions
os+0.253
oing+0.243
ark+0.232
enn+0.229
o+0.228
Neg logit contributions
 plan-0.277
 chores-0.273
 same-0.269
 urge-0.265
 idea-0.264
Tokenull
Token ID724
Feature activation+2.41e-01

Pos logit contributions
os+0.131
oing+0.130
ark+0.129
aron+0.123
umping+0.121
Neg logit contributions
 plan-0.233
 same-0.227
 way-0.220
 urge-0.219
 start-0.212
Token swoop
Token ID38527
Feature activation+7.92e-02

Pos logit contributions
ark+0.673
os+0.671
umping+0.644
oing+0.642
enny+0.638
Neg logit contributions
 plan-0.372
 same-0.356
 way-0.351
 urge-0.349
 start-0.349
Tokened
Token ID276
Feature activation-5.56e-02

Pos logit contributions
os+0.097
ark+0.091
aron+0.090
oing+0.089
ne+0.084
Neg logit contributions
 same-0.240
 start-0.236
 chores-0.229
 plan-0.229
 way-0.226
Token down
Token ID866
Feature activation-5.15e-02

Pos logit contributions
 chores+0.106
 plan+0.100
 same+0.098
 urge+0.093
 way+0.092
Neg logit contributions
os-0.146
oing-0.138
ark-0.138
o-0.137
umping-0.134
Token to
Token ID284
Feature activation-1.14e-01

Pos logit contributions
 chores+0.124
 plan+0.118
 same+0.116
 urge+0.111
 way+0.110
Neg logit contributions
os-0.110
oing-0.102
o-0.102
ark-0.102
umping-0.099
Token take
Token ID1011
Feature activation-7.74e-02

Pos logit contributions
 chores+0.141
 plan+0.125
 same+0.122
 way+0.111
 urge+0.109
Neg logit contributions
os-0.377
ark-0.360
oing-0.359
o-0.358
umping-0.347
Token away
Token ID1497
Feature activation-4.49e-02

Pos logit contributions
 chores+0.039
 same+0.037
 plan+0.036
 way+0.034
 start+0.034
Neg logit contributions
os-0.078
oing-0.077
ark-0.076
o-0.074
umping-0.074
Token all
Token ID477
Feature activation-1.86e-02

Pos logit contributions
 chores+0.094
 plan+0.091
 same+0.090
 way+0.086
 urge+0.085
Neg logit contributions
os-0.109
oing-0.103
ark-0.103
o-0.101
umping-0.100
Token the
Token ID262
Feature activation-7.84e-02

Pos logit contributions
 chores+0.033
 same+0.030
 plan+0.030
 urge+0.028
 way+0.027
Neg logit contributions
os-0.052
o-0.048
ark-0.048
oing-0.047
umping-0.046
Token se
Token ID384
Feature activation+1.16e-01

Pos logit contributions
 chores+0.088
 plan+0.075
 same+0.074
 urge+0.071
 start+0.070
Neg logit contributions
os-0.236
ne-0.235
ark-0.233
oing-0.228
o-0.228
Tokenag
Token ID363
Feature activation+8.84e-02

Pos logit contributions
os+0.257
oing+0.245
ark+0.240
umping+0.236
enn+0.233
Neg logit contributions
 chores-0.252
 plan-0.248
 same-0.243
 urge-0.233
 idea-0.229
Tokenull
Token ID724
Feature activation+2.41e-01

Pos logit contributions
oing+0.132
os+0.129
round+0.126
ark+0.125
aron+0.125
Neg logit contributions
 plan-0.242
 same-0.234
 chores-0.234
 way-0.233
 urge-0.224
Tokens
Token ID82
Feature activation-5.34e-02

Pos logit contributions
os+0.447
ark+0.444
umping+0.421
oing+0.415
enny+0.408
Neg logit contributions
 chores-0.595
 way-0.587
 plan-0.586
 same-0.586
 start-0.568
Token.
Token ID13
Feature activation+7.52e-02

Pos logit contributions
 chores+0.114
 plan+0.110
 same+0.109
 urge+0.103
 way+0.103
Neg logit contributions
os-0.126
ark-0.119
oing-0.119
o-0.117
umping-0.115
Token
Token ID198
Feature activation+7.50e-02

Pos logit contributions
os+0.208
oing+0.201
ark+0.200
umping+0.195
o+0.194
Neg logit contributions
 chores-0.129
 same-0.125
 plan-0.122
 way-0.117
 urge-0.113
Token
Token ID198
Feature activation+7.42e-02

Pos logit contributions
os+0.201
oing+0.197
umping+0.197
ark+0.190
round+0.184
Neg logit contributions
 chores-0.129
 plan-0.125
 same-0.120
 bad-0.114
 other-0.113
TokenSuddenly
Token ID38582
Feature activation-7.01e-03

Pos logit contributions
os+0.193
oing+0.181
umping+0.179
ark+0.177
o+0.175
Neg logit contributions
 chores-0.143
 plan-0.136
 same-0.133
 start-0.125
 bad-0.124
Token said
Token ID531
Feature activation-1.06e-02

Pos logit contributions
os+0.036
ark+0.034
o+0.034
oing+0.033
umping+0.032
Neg logit contributions
 chores-0.028
 same-0.026
 plan-0.026
 urge-0.024
 way-0.024
Token.
Token ID13
Feature activation+7.83e-02

Pos logit contributions
 chores+0.023
 plan+0.023
 same+0.022
 urge+0.021
 way+0.021
Neg logit contributions
os-0.024
ark-0.023
oing-0.023
o-0.023
umping-0.022
Token
Token ID198
Feature activation+8.03e-02

Pos logit contributions
os+0.228
ark+0.221
oing+0.219
umping+0.215
o+0.213
Neg logit contributions
 chores-0.120
 same-0.119
 plan-0.115
 urge-0.108
 way-0.108
Token
Token ID198
Feature activation+7.97e-02

Pos logit contributions
os+0.224
umping+0.220
oing+0.219
ark+0.211
round+0.207
Neg logit contributions
 same-0.126
 chores-0.124
 plan-0.121
 other-0.118
 way-0.114
TokenThe
Token ID464
Feature activation-4.81e-02

Pos logit contributions
os+0.206
o+0.191
oing+0.191
ark+0.189
umping+0.189
Neg logit contributions
 chores-0.155
 same-0.151
 plan-0.150
 urge-0.139
 start-0.139
Token maid
Token ID25920
Feature activation+2.40e-01

Pos logit contributions
 chores+0.085
 plan+0.076
 same+0.072
 urge+0.070
 start+0.069
Neg logit contributions
os-0.121
ark-0.119
ne-0.117
o-0.116
oing-0.116
Token smiled
Token ID13541
Feature activation+1.49e-03

Pos logit contributions
oing+0.665
ark+0.664
umping+0.634
round+0.631
os+0.630
Neg logit contributions
 plan-0.396
 chores-0.360
 same-0.352
 urge-0.348
 start-0.347
Token.
Token ID13
Feature activation+7.73e-02

Pos logit contributions
os+0.004
ark+0.004
oing+0.004
o+0.003
umping+0.003
Neg logit contributions
 same-0.003
 chores-0.003
 plan-0.003
 urge-0.003
 way-0.003
Token "
Token ID366
Feature activation+5.03e-02

Pos logit contributions
os+0.252
ark+0.244
oing+0.244
o+0.238
umping+0.237
Neg logit contributions
 chores-0.092
 same-0.090
 plan-0.087
 way-0.079
 urge-0.078
TokenYes
Token ID5297
Feature activation+2.84e-02

Pos logit contributions
os+0.139
oing+0.131
umping+0.128
ark+0.124
o+0.123
Neg logit contributions
 same-0.090
 chores-0.083
 plan-0.081
 urge-0.078
 other-0.077
Token,
Token ID11
Feature activation-3.48e-02

Pos logit contributions
os+0.079
ark+0.075
oing+0.075
o+0.074
umping+0.073
Neg logit contributions
 chores-0.049
 plan-0.047
 same-0.047
 urge-0.043
 way-0.043
Token.
Token ID13
Feature activation+7.61e-02

Pos logit contributions
os+0.026
oing+0.025
ark+0.025
o+0.025
umping+0.024
Neg logit contributions
 chores-0.016
 same-0.015
 plan-0.015
 urge-0.014
 way-0.014
Token
Token ID220
Feature activation+7.35e-02

Pos logit contributions
os+0.219
oing+0.215
ark+0.208
umping+0.205
o+0.195
Neg logit contributions
 same-0.109
 chores-0.107
 plan-0.101
 way-0.099
 start-0.097
Token
Token ID198
Feature activation+7.84e-02

Pos logit contributions
oing+0.195
os+0.188
ark+0.186
umping+0.184
round+0.179
Neg logit contributions
 chores-0.130
 same-0.123
 plan-0.115
 way-0.110
 start-0.105
Token
Token ID198
Feature activation+7.91e-02

Pos logit contributions
os+0.221
oing+0.214
umping+0.208
ark+0.202
ist+0.197
Neg logit contributions
 chores-0.124
 plan-0.116
 same-0.115
 other-0.109
 grown-0.106
TokenThe
Token ID464
Feature activation-4.85e-02

Pos logit contributions
os+0.201
oing+0.183
umping+0.176
o+0.172
ark+0.172
Neg logit contributions
 chores-0.158
 plan-0.148
 same-0.147
 bad-0.138
 start-0.136
Token flaming
Token ID39518
Feature activation+2.40e-01

Pos logit contributions
 chores+0.091
 plan+0.082
 same+0.079
 urge+0.076
 start+0.076
Neg logit contributions
os-0.127
o-0.122
oing-0.120
ark-0.119
ne-0.116
Tokeno
Token ID78
Feature activation+1.31e-01

Pos logit contributions
oing+0.196
ark+0.194
umping+0.183
and+0.179
round+0.177
Neg logit contributions
 plan-0.735
 same-0.697
 way-0.694
 other-0.686
 only-0.683
Token nodded
Token ID14464
Feature activation-2.34e-02

Pos logit contributions
os+0.359
ark+0.356
oing+0.355
and+0.347
umping+0.345
Neg logit contributions
 plan-0.210
 same-0.206
 chores-0.200
 way-0.196
 start-0.194
Token its
Token ID663
Feature activation-6.76e-02

Pos logit contributions
 chores+0.045
 plan+0.043
 same+0.043
 urge+0.040
 way+0.040
Neg logit contributions
os-0.061
ark-0.058
oing-0.058
o-0.057
umping-0.056
Token head
Token ID1182
Feature activation+6.27e-02

Pos logit contributions
 chores+0.098
 plan+0.077
 same+0.075
 laundry+0.075
 bad+0.070
Neg logit contributions
oing-0.198
ark-0.195
o-0.195
aron-0.193
os-0.192
Token and
Token ID290
Feature activation-4.10e-02

Pos logit contributions
os+0.157
ark+0.149
oing+0.146
o+0.145
umping+0.144
Neg logit contributions
 chores-0.121
 same-0.120
 plan-0.117
 urge-0.114
 start-0.113
Token,
Token ID11
Feature activation-1.76e-02

Pos logit contributions
os+0.323
ark+0.310
oing+0.309
o+0.302
umping+0.300
Neg logit contributions
 chores-0.182
 plan-0.174
 same-0.172
 urge-0.161
 way-0.161
Token Sally
Token ID25737
Feature activation+1.11e-01

Pos logit contributions
 chores+0.023
 plan+0.021
 same+0.021
 urge+0.019
 way+0.019
Neg logit contributions
os-0.057
ark-0.055
oing-0.054
o-0.054
umping-0.053
Token the
Token ID262
Feature activation-6.90e-02

Pos logit contributions
os+0.312
ark+0.295
o+0.295
oing+0.295
umping+0.288
Neg logit contributions
 chores-0.192
 same-0.180
 plan-0.179
 urge-0.167
 way-0.165
Token Se
Token ID1001
Feature activation+7.37e-02

Pos logit contributions
 chores+0.092
 her+0.084
 temptation+0.078
 plan+0.077
 avail+0.074
Neg logit contributions
os-0.178
o-0.177
oing-0.171
ne-0.166
ark-0.165
Tokenag
Token ID363
Feature activation+8.61e-02

Pos logit contributions
os+0.148
oing+0.140
ark+0.137
o+0.136
umping+0.133
Neg logit contributions
 chores-0.185
 same-0.181
 plan-0.178
 urge-0.170
 start-0.168
Tokenull
Token ID724
Feature activation+2.40e-01

Pos logit contributions
os+0.122
oing+0.117
ark+0.114
aron+0.111
umping+0.109
Neg logit contributions
 plan-0.254
 same-0.251
 chores-0.247
 way-0.242
 urge-0.242
Token,
Token ID11
Feature activation-1.81e-02

Pos logit contributions
os+0.648
ark+0.622
oing+0.614
umping+0.603
o+0.602
Neg logit contributions
 chores-0.429
 plan-0.413
 same-0.409
 way-0.389
 urge-0.388
Token heard
Token ID2982
Feature activation-6.05e-02

Pos logit contributions
 chores+0.043
 plan+0.040
 same+0.039
 task+0.037
 urge+0.037
Neg logit contributions
os-0.041
o-0.039
oing-0.036
umping-0.035
ark-0.035
Token him
Token ID683
Feature activation+1.94e-03

Pos logit contributions
 chores+0.085
 plan+0.080
 same+0.079
 urge+0.073
 way+0.072
Neg logit contributions
os-0.189
ark-0.180
oing-0.180
o-0.179
umping-0.176
Token and
Token ID290
Feature activation-4.18e-02

Pos logit contributions
os+0.006
ark+0.006
oing+0.006
o+0.006
umping+0.006
Neg logit contributions
 chores-0.003
 plan-0.003
 same-0.003
 urge-0.002
 way-0.002
Token came
Token ID1625
Feature activation-4.90e-02

Pos logit contributions
 chores+0.094
 same+0.085
 plan+0.081
 temptation+0.078
 way+0.077
Neg logit contributions
os-0.103
o-0.099
oing-0.092
ark-0.091
es-0.089
Token came
Token ID1625
Feature activation-3.78e-02

Pos logit contributions
aron+0.670
ark+0.653
umbs+0.650
oing+0.645
and+0.638
Neg logit contributions
 plan-0.300
 go-0.292
 urge-0.291
 same-0.290
 start-0.289
Token to
Token ID284
Feature activation-1.22e-01

Pos logit contributions
 same+0.060
 chores+0.059
 plan+0.058
 way+0.055
 urge+0.053
Neg logit contributions
os-0.108
oing-0.104
ark-0.104
umping-0.102
aron-0.101
Token town
Token ID3240
Feature activation+9.26e-02

Pos logit contributions
 chores+0.144
 plan+0.119
 same+0.118
 urge+0.117
 way+0.113
Neg logit contributions
os-0.394
o-0.384
ark-0.375
oing-0.365
ne-0.358
Token.
Token ID13
Feature activation+7.07e-02

Pos logit contributions
os+0.221
ark+0.209
oing+0.208
umping+0.206
o+0.200
Neg logit contributions
 chores-0.190
 plan-0.188
 same-0.187
 way-0.178
 urge-0.176
Token The
Token ID383
Feature activation-5.08e-02

Pos logit contributions
os+0.194
ark+0.188
oing+0.187
umping+0.183
o+0.180
Neg logit contributions
 chores-0.121
 same-0.118
 plan-0.116
 urge-0.110
 way-0.109
Token clown
Token ID25573
Feature activation+2.40e-01

Pos logit contributions
 chores+0.120
 plan+0.115
 same+0.113
 urge+0.108
 way+0.108
Neg logit contributions
os-0.109
ark-0.102
oing-0.102
o-0.101
umping-0.098
Token had
Token ID550
Feature activation-8.18e-02

Pos logit contributions
oing+0.752
ark+0.741
umbs+0.725
umping+0.724
and+0.717
Neg logit contributions
 plan-0.295
 same-0.287
 way-0.284
 bad-0.266
 chores-0.264
Token a
Token ID257
Feature activation-5.07e-02

Pos logit contributions
 chores+0.106
 plan+0.104
 urge+0.094
 temptation+0.091
 same+0.084
Neg logit contributions
os-0.249
o-0.247
ark-0.232
umping-0.231
v-0.230
Token big
Token ID1263
Feature activation-1.20e-02

Pos logit contributions
 chores+0.058
 temptation+0.052
 same+0.046
 urge+0.046
 start+0.046
Neg logit contributions
o-0.152
oing-0.148
os-0.145
ark-0.144
round-0.141
Token red
Token ID2266
Feature activation+4.24e-02

Pos logit contributions
 chores+0.020
 same+0.018
 temptation+0.017
 plan+0.016
 start+0.016
Neg logit contributions
os-0.030
o-0.030
oing-0.029
umping-0.029
round-0.029
Token nose
Token ID9686
Feature activation+1.32e-01

Pos logit contributions
os+0.115
oing+0.111
o+0.111
ark+0.110
umping+0.108
Neg logit contributions
 chores-0.072
 plan-0.067
 same-0.067
 way-0.062
 urge-0.062
Token to
Token ID284
Feature activation-1.24e-01

Pos logit contributions
 chores+0.070
 plan+0.066
 same+0.063
 urge+0.061
 way+0.058
Neg logit contributions
os-0.146
ark-0.138
o-0.137
oing-0.135
umping-0.134
Token the
Token ID262
Feature activation-8.53e-02

Pos logit contributions
 chores+0.146
 same+0.134
 plan+0.133
 urge+0.124
 way+0.115
Neg logit contributions
os-0.404
o-0.378
ark-0.375
oing-0.371
umping-0.364
Token noisy
Token ID31210
Feature activation+3.19e-02

Pos logit contributions
 chores+0.112
 plan+0.102
 same+0.098
 urge+0.096
 start+0.090
Neg logit contributions
os-0.252
oing-0.241
ark-0.241
o-0.241
ne-0.236
Token se
Token ID384
Feature activation+1.16e-01

Pos logit contributions
os+0.095
ark+0.090
oing+0.090
o+0.089
umping+0.088
Neg logit contributions
 chores-0.049
 plan-0.046
 same-0.045
 urge-0.042
 way-0.041
Tokenag
Token ID363
Feature activation+8.19e-02

Pos logit contributions
os+0.218
ark+0.206
oing+0.203
o+0.196
umping+0.195
Neg logit contributions
 chores-0.293
 plan-0.290
 same-0.286
 urge-0.274
 way-0.268
Tokenull
Token ID724
Feature activation+2.39e-01

Pos logit contributions
oing+0.129
os+0.128
ark+0.124
umping+0.123
round+0.120
Neg logit contributions
 plan-0.219
 same-0.216
 chores-0.214
 way-0.208
 urge-0.203
Tokens
Token ID82
Feature activation-5.84e-02

Pos logit contributions
os+0.512
ark+0.505
umping+0.486
oing+0.483
enny+0.472
Neg logit contributions
 chores-0.525
 same-0.521
 plan-0.515
 way-0.503
 start-0.492
Token outside
Token ID2354
Feature activation+2.17e-02

Pos logit contributions
 chores+0.121
 plan+0.115
 same+0.115
 urge+0.109
 way+0.108
Neg logit contributions
os-0.142
ark-0.134
oing-0.133
o-0.132
umping-0.129
Token.
Token ID13
Feature activation+7.34e-02

Pos logit contributions
os+0.050
ark+0.046
o+0.046
oing+0.046
umping+0.045
Neg logit contributions
 chores-0.048
 plan-0.046
 same-0.045
 urge-0.044
 way-0.043
Token Jimmy
Token ID12963
Feature activation+9.83e-02

Pos logit contributions
os+0.201
oing+0.197
ark+0.195
umping+0.189
o+0.185
Neg logit contributions
 chores-0.121
 same-0.117
 plan-0.114
 way-0.108
 urge-0.106
Token looked
Token ID3114
Feature activation-4.06e-02

Pos logit contributions
os+0.264
oing+0.257
ark+0.256
umping+0.245
o+0.245
Neg logit contributions
 plan-0.168
 chores-0.167
 same-0.166
 way-0.157
 urge-0.156
Token and
Token ID290
Feature activation-5.06e-02

Pos logit contributions
os+0.140
oing+0.133
ark+0.132
o+0.132
umping+0.129
Neg logit contributions
 chores-0.088
 plan-0.083
 same-0.082
 urge-0.077
 way-0.076
Token that
Token ID326
Feature activation-6.28e-02

Pos logit contributions
 chores+0.080
 plan+0.073
 same+0.073
 urge+0.070
 temptation+0.067
Neg logit contributions
os-0.144
o-0.137
ark-0.127
oing-0.127
ne-0.125
Token the
Token ID262
Feature activation-8.14e-02

Pos logit contributions
 chores+0.088
 same+0.085
 plan+0.082
 urge+0.081
 temptation+0.073
Neg logit contributions
os-0.187
o-0.177
ark-0.165
es-0.165
oing-0.165
Token se
Token ID384
Feature activation+1.12e-01

Pos logit contributions
 chores+0.117
 plan+0.107
 same+0.105
 urge+0.103
 start+0.099
Neg logit contributions
os-0.229
o-0.223
oing-0.219
ark-0.217
ne-0.215
Tokenag
Token ID363
Feature activation+8.12e-02

Pos logit contributions
os+0.209
ark+0.199
oing+0.195
o+0.190
umping+0.188
Neg logit contributions
 chores-0.292
 plan-0.286
 same-0.283
 urge-0.272
 way-0.266
Tokenull
Token ID724
Feature activation+2.39e-01

Pos logit contributions
os+0.118
oing+0.118
ark+0.115
umping+0.113
aron+0.108
Neg logit contributions
 plan-0.231
 same-0.226
 chores-0.221
 way-0.217
 urge-0.214
Tokens
Token ID82
Feature activation-5.54e-02

Pos logit contributions
os+0.545
ark+0.541
oing+0.525
umping+0.515
o+0.507
Neg logit contributions
 chores-0.501
 same-0.493
 plan-0.492
 way-0.467
 urge-0.467
Token would
Token ID561
Feature activation-1.07e-01

Pos logit contributions
 chores+0.098
 plan+0.092
 same+0.092
 urge+0.087
 way+0.085
Neg logit contributions
os-0.152
o-0.142
oing-0.142
ark-0.142
umping-0.138
Token try
Token ID1949
Feature activation-7.60e-02

Pos logit contributions
 chores+0.138
 same+0.122
 plan+0.116
 way+0.112
 urge+0.107
Neg logit contributions
os-0.328
ark-0.315
o-0.314
oing-0.313
ne-0.300
Token to
Token ID284
Feature activation-1.46e-01

Pos logit contributions
 chores+0.133
 plan+0.126
 same+0.126
 urge+0.117
 way+0.117
Neg logit contributions
os-0.210
ark-0.200
oing-0.199
o-0.197
umping-0.194
Token take
Token ID1011
Feature activation-8.77e-02

Pos logit contributions
 chores+0.189
 same+0.169
 plan+0.157
 way+0.149
 urge+0.147
Neg logit contributions
os-0.446
ark-0.428
oing-0.427
o-0.426
ne-0.417
TokenS
Token ID50
Feature activation+1.15e-01

Pos logit contributions
umping+0.089
oing+0.088
os+0.087
enn+0.087
ark+0.087
Neg logit contributions
 same-0.092
 her-0.086
 chores-0.084
 rain-0.083
 other-0.083
Tokenue
Token ID518
Feature activation+7.82e-02

Pos logit contributions
os+0.232
ark+0.221
oing+0.219
o+0.207
and+0.206
Neg logit contributions
 chores-0.282
 same-0.275
 plan-0.262
 start-0.256
 work-0.254
Token"
Token ID1
Feature activation+6.46e-02

Pos logit contributions
os+0.180
oing+0.176
ark+0.174
umping+0.171
o+0.164
Neg logit contributions
 chores-0.166
 same-0.161
 plan-0.158
 start-0.149
 way-0.148
Token on
Token ID319
Feature activation-7.34e-02

Pos logit contributions
os+0.178
oing+0.178
ark+0.172
umping+0.169
enn+0.167
Neg logit contributions
 same-0.105
 chores-0.104
 way-0.096
 plan-0.094
 bad-0.093
Token the
Token ID262
Feature activation-6.68e-02

Pos logit contributions
 chores+0.075
 plan+0.074
 same+0.068
 urge+0.066
 way+0.058
Neg logit contributions
os-0.247
ark-0.237
oing-0.234
umping-0.234
o-0.233
Token veil
Token ID30615
Feature activation+2.38e-01

Pos logit contributions
 chores+0.114
 plan+0.110
 urge+0.095
 idea+0.094
 same+0.091
Neg logit contributions
oing-0.156
ark-0.154
ne-0.154
and-0.152
aron-0.152
Token.
Token ID13
Feature activation+7.08e-02

Pos logit contributions
os+0.562
ark+0.531
oing+0.530
o+0.523
umping+0.514
Neg logit contributions
 chores-0.510
 plan-0.489
 same-0.487
 urge-0.462
 way-0.461
Token She
Token ID1375
Feature activation-3.55e-03

Pos logit contributions
os+0.199
ark+0.189
oing+0.189
o+0.187
umping+0.184
Neg logit contributions
 chores-0.122
 plan-0.116
 same-0.115
 urge-0.107
 way-0.107
Token was
Token ID373
Feature activation-1.13e-01

Pos logit contributions
 plan+0.006
 chores+0.006
 same+0.006
 start+0.006
 way+0.006
Neg logit contributions
oing-0.010
os-0.009
ark-0.009
umping-0.009
and-0.009
Token very
Token ID845
Feature activation-5.76e-02

Pos logit contributions
 chores+0.191
 plan+0.176
 task+0.166
 urge+0.164
 temptation+0.163
Neg logit contributions
o-0.298
os-0.290
aron-0.281
bug-0.274
and-0.270
Token happy
Token ID3772
Feature activation-1.51e-02

Pos logit contributions
 chores+0.095
 way+0.082
 same+0.082
 plan+0.081
 start+0.078
Neg logit contributions
os-0.158
o-0.156
oing-0.153
ark-0.149
and-0.147
Token set
Token ID900
Feature activation-5.64e-02

Pos logit contributions
 chores+0.066
 same+0.062
 plan+0.061
 urge+0.053
 way+0.050
Neg logit contributions
os-0.242
o-0.231
oing-0.229
ark-0.229
and-0.229
Token the
Token ID262
Feature activation-9.13e-02

Pos logit contributions
 plan+0.048
 same+0.046
 chores+0.046
 urge+0.043
 way+0.038
Neg logit contributions
os-0.201
o-0.192
oing-0.190
ark-0.190
umping-0.187
Token table
Token ID3084
Feature activation+1.02e-01

Pos logit contributions
 chores+0.118
 plan+0.111
 urge+0.111
 her+0.108
 sorry+0.108
Neg logit contributions
oing-0.226
and-0.224
eddy-0.223
o-0.216
umping-0.215
Token and
Token ID290
Feature activation-4.63e-02

Pos logit contributions
os+0.248
o+0.228
oing+0.223
ark+0.222
and+0.219
Neg logit contributions
 chores-0.205
 same-0.202
 plan-0.198
 urge-0.194
 way-0.186
Token being
Token ID852
Feature activation-6.94e-02

Pos logit contributions
 chores+0.091
 same+0.088
 plan+0.081
 temptation+0.081
 urge+0.081
Neg logit contributions
os-0.114
o-0.114
es-0.102
and-0.099
oing-0.099
Token such
Token ID884
Feature activation-1.61e-01

Pos logit contributions
 chores+0.088
 plan+0.082
 same+0.080
 urge+0.074
 way+0.073
Neg logit contributions
os-0.224
o-0.216
oing-0.214
ark-0.213
umping-0.211
Token a
Token ID257
Feature activation-7.05e-02

Pos logit contributions
 chores+0.214
 way+0.205
 plan+0.204
 same+0.195
 urge+0.187
Neg logit contributions
o-0.431
and-0.420
os-0.416
ark-0.416
sm-0.416
Token good
Token ID922
Feature activation-4.31e-02

Pos logit contributions
 chores+0.066
 same+0.058
 plan+0.053
 urge+0.050
 start+0.047
Neg logit contributions
os-0.237
o-0.237
oing-0.235
ark-0.232
umping-0.226
Token helper
Token ID31904
Feature activation+1.40e-01

Pos logit contributions
 chores+0.065
 same+0.059
 plan+0.058
 start+0.055
 way+0.054
Neg logit contributions
os-0.123
o-0.120
oing-0.119
umping-0.119
ark-0.119
Token.
Token ID13
Feature activation+7.79e-02

Pos logit contributions
os+0.343
oing+0.334
ark+0.328
umping+0.319
o+0.315
Neg logit contributions
 chores-0.282
 plan-0.272
 same-0.268
 way-0.260
 urge-0.255
Token Lily
Token ID20037
Feature activation+7.96e-02

Pos logit contributions
os+0.219
ark+0.214
oing+0.211
umping+0.205
o+0.202
Neg logit contributions
 chores-0.130
 same-0.128
 plan-0.121
 way-0.119
 urge-0.115
Token as
Token ID355
Feature activation-7.13e-02

Pos logit contributions
os+0.220
ark+0.212
oing+0.210
umping+0.206
o+0.198
Neg logit contributions
 chores-0.178
 plan-0.172
 same-0.165
 way-0.161
 urge-0.160
Token a
Token ID257
Feature activation-7.40e-02

Pos logit contributions
 chores+0.080
 plan+0.077
 same+0.075
 way+0.071
 urge+0.070
Neg logit contributions
os-0.236
o-0.228
ark-0.226
oing-0.224
umping-0.220
Token reward
Token ID6721
Feature activation+5.10e-02

Pos logit contributions
 chores+0.109
 same+0.101
 plan+0.093
 urge+0.091
 way+0.088
Neg logit contributions
os-0.216
oing-0.214
ark-0.210
o-0.210
umping-0.202
Token for
Token ID329
Feature activation-1.07e-01

Pos logit contributions
os+0.123
oing+0.117
ark+0.116
umping+0.113
o+0.113
Neg logit contributions
 chores-0.107
 plan-0.103
 same-0.101
 way-0.097
 urge-0.096
Token being
Token ID852
Feature activation-7.03e-02

Pos logit contributions
 same+0.174
 plan+0.166
 urge+0.159
 chores+0.154
 idea+0.148
Neg logit contributions
o-0.269
os-0.266
es-0.265
ark-0.257
med-0.251
Token such
Token ID884
Feature activation-1.61e-01

Pos logit contributions
 chores+0.098
 plan+0.091
 same+0.090
 urge+0.083
 way+0.082
Neg logit contributions
os-0.221
o-0.211
ark-0.210
oing-0.210
umping-0.206
Token a
Token ID257
Feature activation-6.85e-02

Pos logit contributions
 chores+0.231
 plan+0.211
 storm+0.205
 same+0.203
 urge+0.201
Neg logit contributions
o-0.417
ark-0.405
and-0.404
aron-0.398
os-0.392
Token good
Token ID922
Feature activation-3.89e-02

Pos logit contributions
 chores+0.081
 same+0.067
 plan+0.061
 urge+0.061
 start+0.060
Neg logit contributions
ark-0.217
o-0.213
oing-0.213
os-0.211
oc-0.206
Token dog
Token ID3290
Feature activation+1.39e-01

Pos logit contributions
 chores+0.069
 same+0.062
 plan+0.061
 start+0.058
 urge+0.057
Neg logit contributions
os-0.102
ark-0.101
o-0.100
oing-0.099
umping-0.097
Token.
Token ID13
Feature activation+7.58e-02

Pos logit contributions
os+0.354
oing+0.350
ark+0.340
umping+0.337
aron+0.326
Neg logit contributions
 chores-0.263
 same-0.248
 plan-0.246
 way-0.243
 bad-0.241
Token Max
Token ID5436
Feature activation+8.47e-02

Pos logit contributions
os+0.233
ark+0.227
oing+0.226
umping+0.220
o+0.215
Neg logit contributions
 chores-0.106
 same-0.101
 plan-0.098
 way-0.094
 start-0.089
Token a
Token ID257
Feature activation-7.74e-02

Pos logit contributions
 chores+0.088
 plan+0.086
 same+0.083
 urge+0.080
 way+0.076
Neg logit contributions
os-0.252
o-0.243
ark-0.241
oing-0.239
umping-0.235
Token thank
Token ID5875
Feature activation-3.45e-02

Pos logit contributions
 chores+0.110
 same+0.101
 plan+0.095
 urge+0.091
 start+0.086
Neg logit contributions
os-0.225
oing-0.223
o-0.220
ark-0.218
ne-0.211
Token you
Token ID345
Feature activation-2.11e-02

Pos logit contributions
 chores+0.048
 plan+0.046
 same+0.045
 urge+0.042
 way+0.042
Neg logit contributions
os-0.107
ark-0.103
oing-0.102
o-0.101
umping-0.099
Token for
Token ID329
Feature activation-1.11e-01

Pos logit contributions
 chores+0.038
 same+0.036
 plan+0.036
 way+0.033
 urge+0.033
Neg logit contributions
os-0.057
oing-0.055
ark-0.054
o-0.053
umping-0.053
Token being
Token ID852
Feature activation-7.12e-02

Pos logit contributions
 same+0.186
 chores+0.179
 plan+0.175
 urge+0.173
 task+0.164
Neg logit contributions
o-0.262
and-0.255
ark-0.255
os-0.253
es-0.250
Token such
Token ID884
Feature activation-1.60e-01

Pos logit contributions
 chores+0.102
 plan+0.095
 same+0.093
 urge+0.086
 way+0.086
Neg logit contributions
os-0.221
ark-0.211
o-0.210
oing-0.210
umping-0.206
Token a
Token ID257
Feature activation-6.81e-02

Pos logit contributions
 chores+0.214
 plan+0.197
 same+0.190
 way+0.188
 urge+0.187
Neg logit contributions
o-0.445
ark-0.441
os-0.432
aron-0.429
and-0.428
Token good
Token ID922
Feature activation-4.24e-02

Pos logit contributions
 chores+0.071
 same+0.060
 plan+0.057
 urge+0.054
 start+0.052
Neg logit contributions
os-0.223
o-0.223
ark-0.222
oing-0.220
umping-0.213
Token friend
Token ID1545
Feature activation+8.24e-02

Pos logit contributions
 chores+0.072
 same+0.066
 plan+0.065
 start+0.061
 urge+0.061
Neg logit contributions
os-0.116
ark-0.113
o-0.112
oing-0.111
umping-0.110
Token.
Token ID13
Feature activation+7.50e-02

Pos logit contributions
oing+0.223
os+0.217
ark+0.208
umping+0.207
umbs+0.197
Neg logit contributions
 chores-0.144
 way-0.139
 same-0.138
 plan-0.136
 bad-0.134
Token From
Token ID3574
Feature activation-6.05e-02

Pos logit contributions
os+0.207
oing+0.202
ark+0.202
umping+0.196
ne+0.191
Neg logit contributions
 chores-0.125
 same-0.121
 plan-0.119
 way-0.119
 urge-0.113
Token and
Token ID290
Feature activation-5.94e-02

Pos logit contributions
 chores+0.034
 plan+0.033
 same+0.033
 urge+0.033
 bad+0.032
Neg logit contributions
os-0.052
ark-0.048
o-0.047
oing-0.047
umping-0.046
Token thanked
Token ID26280
Feature activation-6.62e-02

Pos logit contributions
 chores+0.127
 same+0.114
 plan+0.111
 urge+0.106
 temptation+0.104
Neg logit contributions
o-0.142
os-0.139
es-0.130
oing-0.130
and-0.125
Token Lily
Token ID20037
Feature activation+7.11e-02

Pos logit contributions
 chores+0.094
 plan+0.088
 same+0.088
 urge+0.080
 way+0.080
Neg logit contributions
os-0.204
ark-0.195
o-0.194
oing-0.193
umping-0.189
Token for
Token ID329
Feature activation-9.98e-02

Pos logit contributions
os+0.205
oing+0.195
ark+0.195
umping+0.192
o+0.190
Neg logit contributions
 chores-0.119
 plan-0.112
 same-0.111
 way-0.106
 urge-0.103
Token being
Token ID852
Feature activation-6.60e-02

Pos logit contributions
 chores+0.176
 same+0.168
 plan+0.159
 urge+0.158
 task+0.155
Neg logit contributions
o-0.245
and-0.234
os-0.230
ark-0.228
es-0.227
Token such
Token ID884
Feature activation-1.60e-01

Pos logit contributions
 chores+0.085
 plan+0.079
 same+0.075
 way+0.070
 urge+0.070
Neg logit contributions
os-0.211
o-0.206
ark-0.201
oing-0.201
umping-0.201
Token a
Token ID257
Feature activation-6.72e-02

Pos logit contributions
 chores+0.235
 way+0.222
 plan+0.216
 same+0.208
 urge+0.201
Neg logit contributions
o-0.407
and-0.388
sm-0.386
it-0.384
ark-0.384
Token good
Token ID922
Feature activation-4.05e-02

Pos logit contributions
 chores+0.066
 same+0.056
 plan+0.049
 urge+0.048
 start+0.048
Neg logit contributions
oing-0.223
o-0.222
os-0.222
ark-0.220
umping-0.213
Token friend
Token ID1545
Feature activation+8.05e-02

Pos logit contributions
 chores+0.071
 same+0.066
 plan+0.065
 urge+0.060
 way+0.060
Neg logit contributions
os-0.111
oing-0.107
ark-0.107
o-0.106
umping-0.104
Token.
Token ID13
Feature activation+7.59e-02

Pos logit contributions
oing+0.226
os+0.225
ark+0.214
umping+0.211
round+0.207
Neg logit contributions
 chores-0.128
 way-0.124
 bad-0.124
 same-0.119
 plan-0.118
Token
Token ID198
Feature activation+8.07e-02

Pos logit contributions
os+0.187
ark+0.180
oing+0.179
umping+0.174
o+0.169
Neg logit contributions
 chores-0.151
 same-0.148
 plan-0.146
 way-0.140
 urge-0.138
Token and
Token ID290
Feature activation-4.36e-02

Pos logit contributions
os+0.058
oing+0.055
ark+0.055
o+0.054
umping+0.054
Neg logit contributions
 chores-0.038
 plan-0.036
 same-0.036
 way-0.034
 urge-0.034
Token praised
Token ID15342
Feature activation-2.93e-02

Pos logit contributions
 chores+0.086
 same+0.080
 plan+0.077
 urge+0.074
 temptation+0.071
Neg logit contributions
os-0.112
o-0.107
ark-0.103
oing-0.103
umping-0.099
Token them
Token ID606
Feature activation+1.06e-02

Pos logit contributions
 chores+0.038
 plan+0.036
 same+0.035
 urge+0.033
 way+0.032
Neg logit contributions
os-0.093
ark-0.089
oing-0.088
o-0.088
umping-0.087
Token for
Token ID329
Feature activation-1.06e-01

Pos logit contributions
os+0.033
oing+0.032
ark+0.032
umping+0.032
o+0.031
Neg logit contributions
 chores-0.015
 same-0.014
 plan-0.013
 way-0.012
 start-0.012
Token being
Token ID852
Feature activation-7.08e-02

Pos logit contributions
 chores+0.162
 same+0.159
 plan+0.150
 urge+0.150
 way+0.143
Neg logit contributions
os-0.289
o-0.284
ark-0.280
and-0.272
aron-0.270
Token such
Token ID884
Feature activation-1.60e-01

Pos logit contributions
 chores+0.104
 plan+0.097
 same+0.096
 urge+0.089
 way+0.088
Neg logit contributions
os-0.216
ark-0.206
o-0.205
oing-0.205
umping-0.201
Token best
Token ID1266
Feature activation-2.77e-02

Pos logit contributions
 chores+0.214
 plan+0.195
 same+0.192
 way+0.189
 urge+0.188
Neg logit contributions
o-0.439
ark-0.434
os-0.429
aron-0.422
and-0.420
Token friends
Token ID2460
Feature activation+9.56e-02

Pos logit contributions
 chores+0.045
 same+0.042
 plan+0.042
 urge+0.039
 way+0.038
Neg logit contributions
os-0.080
ark-0.077
oing-0.076
o-0.076
umping-0.075
Token.
Token ID13
Feature activation+7.64e-02

Pos logit contributions
oing+0.250
os+0.249
umping+0.239
ark+0.238
o+0.232
Neg logit contributions
 chores-0.174
 plan-0.162
 way-0.161
 same-0.159
 bad-0.158
Token From
Token ID3574
Feature activation-5.69e-02

Pos logit contributions
os+0.208
oing+0.200
ark+0.199
o+0.193
umping+0.193
Neg logit contributions
 chores-0.135
 same-0.130
 plan-0.129
 way-0.125
 urge-0.121
Token then
Token ID788
Feature activation-2.48e-02

Pos logit contributions
 chores+0.096
 plan+0.091
 same+0.089
 urge+0.085
 way+0.083
Neg logit contributions
os-0.161
ark-0.154
o-0.153
oing-0.151
umping-0.148
Token proud
Token ID6613
Feature activation-1.69e-02

Pos logit contributions
 chores+0.109
 plan+0.100
 same+0.100
 way+0.092
 urge+0.091
Neg logit contributions
os-0.251
oing-0.241
ark-0.240
o-0.239
umping-0.234
Token of
Token ID286
Feature activation-1.04e-01

Pos logit contributions
 chores+0.035
 plan+0.034
 same+0.034
 way+0.033
 urge+0.032
Neg logit contributions
os-0.041
oing-0.039
ark-0.039
o-0.038
umping-0.037
Token her
Token ID607
Feature activation-5.64e-02

Pos logit contributions
 chores+0.081
 plan+0.076
 same+0.074
 urge+0.066
 way+0.066
Neg logit contributions
os-0.385
ark-0.369
o-0.364
oing-0.362
and-0.356
Token for
Token ID329
Feature activation-9.95e-02

Pos logit contributions
 chores+0.107
 same+0.103
 plan+0.102
 urge+0.096
 way+0.095
Neg logit contributions
os-0.145
oing-0.138
ark-0.138
o-0.136
umping-0.134
Token being
Token ID852
Feature activation-6.72e-02

Pos logit contributions
 chores+0.162
 plan+0.159
 same+0.158
 urge+0.157
 way+0.149
Neg logit contributions
os-0.247
es-0.246
and-0.246
ark-0.245
o-0.238
Token such
Token ID884
Feature activation-1.60e-01

Pos logit contributions
 chores+0.092
 same+0.087
 plan+0.086
 urge+0.079
 way+0.079
Neg logit contributions
os-0.211
ark-0.203
oing-0.202
o-0.199
umping-0.196
Token an
Token ID281
Feature activation-7.15e-02

Pos logit contributions
 chores+0.214
 plan+0.195
 way+0.194
 same+0.193
 urge+0.187
Neg logit contributions
ark-0.432
o-0.429
os-0.420
oing-0.416
aron-0.415
Token obedient
Token ID41652
Feature activation-1.85e-02

Pos logit contributions
 chores+0.088
 plan+0.080
 same+0.079
 way+0.072
 start+0.071
Neg logit contributions
os-0.227
ark-0.223
o-0.218
oing-0.218
umping-0.216
Token and
Token ID290
Feature activation-4.67e-02

Pos logit contributions
 chores+0.034
 same+0.031
 plan+0.031
 start+0.029
 urge+0.029
Neg logit contributions
os-0.048
ark-0.047
o-0.046
oing-0.046
umping-0.045
Token happy
Token ID3772
Feature activation-1.94e-02

Pos logit contributions
 chores+0.073
 plan+0.066
 same+0.066
 urge+0.064
 way+0.061
Neg logit contributions
os-0.132
o-0.130
oing-0.125
ark-0.123
es-0.120
Token little
Token ID1310
Feature activation-1.81e-02

Pos logit contributions
 chores+0.033
 same+0.031
 plan+0.030
 urge+0.028
 start+0.028
Neg logit contributions
os-0.053
ark-0.050
o-0.050
oing-0.050
umping-0.049
Token things
Token ID1243
Feature activation+9.72e-02

Pos logit contributions
 chores+0.101
 same+0.094
 plan+0.093
 urge+0.086
 way+0.082
Neg logit contributions
os-0.289
oing-0.274
ark-0.269
o-0.269
umping-0.269
Token and
Token ID290
Feature activation-4.26e-02

Pos logit contributions
os+0.268
ark+0.257
oing+0.256
o+0.253
umping+0.249
Neg logit contributions
 chores-0.170
 same-0.160
 plan-0.160
 way-0.150
 urge-0.149
Token was
Token ID373
Feature activation-1.23e-01

Pos logit contributions
 chores+0.078
 plan+0.072
 same+0.071
 urge+0.068
 temptation+0.065
Neg logit contributions
os-0.115
o-0.104
oing-0.102
umping-0.101
ark-0.101
Token no
Token ID645
Feature activation-9.86e-02

Pos logit contributions
 chores+0.186
 plan+0.168
 same+0.168
 urge+0.159
 way+0.153
Neg logit contributions
os-0.354
o-0.350
aron-0.327
umping-0.325
oing-0.324
Token longer
Token ID2392
Feature activation-6.98e-02

Pos logit contributions
 chores+0.142
 plan+0.125
 same+0.124
 rain+0.113
 way+0.111
Neg logit contributions
os-0.276
ne-0.274
ark-0.269
aron-0.268
oing-0.266
Token such
Token ID884
Feature activation-1.60e-01

Pos logit contributions
 chores+0.091
 plan+0.083
 same+0.080
 urge+0.074
 way+0.072
Neg logit contributions
os-0.222
oing-0.210
o-0.210
umping-0.209
ark-0.207
Token a
Token ID257
Feature activation-7.02e-02

Pos logit contributions
 chores+0.214
 way+0.190
 plan+0.188
 hospital+0.182
 same+0.182
Neg logit contributions
and-0.418
ark-0.414
aron-0.411
ne-0.408
oing-0.405
Token shy
Token ID15800
Feature activation+1.23e-02

Pos logit contributions
 chores+0.078
 same+0.064
 plan+0.061
 urge+0.057
 start+0.056
Neg logit contributions
os-0.224
oing-0.223
ark-0.219
o-0.219
aron-0.216
Token princess
Token ID21752
Feature activation+1.62e-01

Pos logit contributions
os+0.037
ark+0.035
oing+0.035
o+0.035
umping+0.034
Neg logit contributions
 chores-0.019
 plan-0.018
 same-0.018
 urge-0.016
 way-0.016
Token.
Token ID13
Feature activation+8.00e-02

Pos logit contributions
os+0.402
oing+0.392
ark+0.388
umping+0.380
o+0.377
Neg logit contributions
 chores-0.318
 plan-0.310
 same-0.301
 way-0.301
 urge-0.290
Token<|endoftext|>
Token ID50256
Feature activation+1.62e-01

Pos logit contributions
oing+0.229
ark+0.228
os+0.220
umping+0.217
ist+0.216
Neg logit contributions
 same-0.119
 chores-0.116
 way-0.110
 plan-0.106
 other-0.105
Token him
Token ID683
Feature activation+7.39e-04

Pos logit contributions
 chores+0.087
 plan+0.085
 urge+0.080
 same+0.078
 way+0.074
Neg logit contributions
os-0.144
o-0.139
ark-0.133
umping-0.130
es-0.128
Token a
Token ID257
Feature activation-5.68e-02

Pos logit contributions
os+0.003
oing+0.003
ark+0.003
umping+0.003
o+0.003
Neg logit contributions
 chores-0.001
 same-0.001
 plan-0.001
 way-0.000
 urge-0.000
Token treat
Token ID2190
Feature activation+9.89e-02

Pos logit contributions
 chores+0.068
 same+0.061
 plan+0.060
 urge+0.056
 way+0.053
Neg logit contributions
os-0.183
oing-0.179
o-0.177
ark-0.175
umping-0.171
Token for
Token ID329
Feature activation-1.04e-01

Pos logit contributions
os+0.265
oing+0.262
ark+0.255
umping+0.251
enn+0.239
Neg logit contributions
 chores-0.172
 same-0.161
 plan-0.160
 start-0.157
 way-0.156
Token being
Token ID852
Feature activation-6.86e-02

Pos logit contributions
 same+0.163
 plan+0.161
 chores+0.157
 urge+0.152
 way+0.141
Neg logit contributions
os-0.281
o-0.273
ark-0.270
es-0.267
oing-0.261
Token such
Token ID884
Feature activation-1.60e-01

Pos logit contributions
 chores+0.088
 plan+0.081
 same+0.081
 urge+0.073
 way+0.073
Neg logit contributions
os-0.222
ark-0.213
oing-0.213
o-0.211
umping-0.208
Token a
Token ID257
Feature activation-6.83e-02

Pos logit contributions
 chores+0.216
 plan+0.201
 urge+0.188
 same+0.187
 start+0.183
Neg logit contributions
o-0.443
ark-0.440
os-0.432
aron-0.425
and-0.422
Token good
Token ID922
Feature activation-3.96e-02

Pos logit contributions
 chores+0.076
 same+0.063
 plan+0.061
 urge+0.059
 start+0.056
Neg logit contributions
ark-0.220
oing-0.219
os-0.219
o-0.218
umping-0.209
Token listener
Token ID24783
Feature activation+9.64e-02

Pos logit contributions
 chores+0.067
 same+0.062
 plan+0.061
 urge+0.057
 way+0.057
Neg logit contributions
os-0.111
ark-0.107
oing-0.107
o-0.106
umping-0.104
Token.
Token ID13
Feature activation+7.42e-02

Pos logit contributions
os+0.240
oing+0.233
ark+0.228
o+0.220
umping+0.219
Neg logit contributions
 chores-0.185
 plan-0.182
 same-0.179
 way-0.178
 urge-0.172
Token Max
Token ID5436
Feature activation+8.05e-02

Pos logit contributions
os+0.229
ark+0.224
oing+0.223
umping+0.218
ne+0.213
Neg logit contributions
 chores-0.102
 same-0.096
 plan-0.093
 way-0.090
 start-0.086
Token c
Token ID269
Feature activation+8.87e-02

Pos logit contributions
 chores+0.076
 same+0.073
 plan+0.072
 urge+0.068
 way+0.066
Neg logit contributions
os-0.103
o-0.097
oing-0.095
ark-0.094
umping-0.093
Tokenudd
Token ID4185
Feature activation+3.92e-02

Pos logit contributions
os+0.157
o+0.145
ark+0.144
ne+0.138
oing+0.134
Neg logit contributions
 chores-0.247
 same-0.227
 work-0.223
 plan-0.222
 start-0.214
Tokenles
Token ID829
Feature activation+9.36e-02

Pos logit contributions
os+0.066
o+0.060
oing+0.060
ark+0.059
umping+0.056
Neg logit contributions
 chores-0.114
 same-0.110
 plan-0.109
 urge-0.105
 way-0.105
Token for
Token ID329
Feature activation-9.85e-02

Pos logit contributions
os+0.239
ark+0.232
oing+0.230
o+0.224
umping+0.222
Neg logit contributions
 chores-0.181
 plan-0.171
 same-0.171
 way-0.160
 start-0.159
Token being
Token ID852
Feature activation-6.93e-02

Pos logit contributions
 chores+0.124
 same+0.123
 plan+0.122
 urge+0.114
 way+0.109
Neg logit contributions
os-0.306
o-0.288
oing-0.287
ark-0.286
and-0.282
Token such
Token ID884
Feature activation-1.59e-01

Pos logit contributions
 chores+0.096
 plan+0.090
 same+0.089
 urge+0.082
 way+0.081
Neg logit contributions
os-0.217
ark-0.207
oing-0.207
o-0.205
umping-0.202
Token a
Token ID257
Feature activation-6.79e-02

Pos logit contributions
 chores+0.204
 plan+0.191
 same+0.180
 urge+0.180
 way+0.177
Neg logit contributions
os-0.451
o-0.448
ark-0.447
aron-0.440
and-0.434
Token clever
Token ID14169
Feature activation-1.86e-02

Pos logit contributions
 chores+0.069
 same+0.058
 plan+0.058
 urge+0.053
 start+0.050
Neg logit contributions
os-0.229
ark-0.224
o-0.224
oing-0.223
umping-0.217
Token girl
Token ID2576
Feature activation+9.04e-02

Pos logit contributions
 chores+0.029
 same+0.026
 plan+0.026
 start+0.024
 urge+0.024
Neg logit contributions
os-0.053
ark-0.051
oing-0.051
o-0.051
umping-0.050
Token.
Token ID13
Feature activation+7.48e-02

Pos logit contributions
os+0.226
ark+0.223
oing+0.222
umping+0.214
round+0.208
Neg logit contributions
 chores-0.170
 plan-0.162
 same-0.160
 way-0.160
 start-0.153
Token
Token ID220
Feature activation+7.82e-02

Pos logit contributions
os+0.206
ark+0.205
oing+0.204
umping+0.197
o+0.192
Neg logit contributions
 chores-0.122
 same-0.118
 plan-0.116
 way-0.113
 start-0.109
Token know
Token ID760
Feature activation-5.33e-02

Pos logit contributions
 chores+0.175
 plan+0.168
 same+0.166
 urge+0.156
 way+0.154
Neg logit contributions
os-0.317
oing-0.301
ark-0.301
o-0.296
umping-0.295
Token why
Token ID1521
Feature activation-3.34e-02

Pos logit contributions
 chores+0.094
 plan+0.093
 same+0.092
 urge+0.087
 way+0.084
Neg logit contributions
os-0.144
ark-0.134
o-0.133
oing-0.132
umping-0.129
Token they
Token ID484
Feature activation-4.11e-02

Pos logit contributions
 same+0.052
 chores+0.051
 plan+0.050
 urge+0.048
 way+0.047
Neg logit contributions
os-0.096
ark-0.089
oing-0.088
o-0.088
umping-0.085
Token would
Token ID561
Feature activation-1.30e-01

Pos logit contributions
 chores+0.076
 plan+0.072
 same+0.072
 urge+0.068
 way+0.068
Neg logit contributions
os-0.109
ark-0.103
oing-0.103
o-0.102
umping-0.100
Token say
Token ID910
Feature activation-3.69e-02

Pos logit contributions
 chores+0.176
 same+0.164
 plan+0.143
 laundry+0.143
 way+0.139
Neg logit contributions
os-0.367
ark-0.361
o-0.355
oing-0.351
ne-0.338
Token such
Token ID884
Feature activation-1.59e-01

Pos logit contributions
 chores+0.051
 plan+0.050
 same+0.049
 urge+0.047
 way+0.045
Neg logit contributions
os-0.109
oing-0.101
ark-0.101
o-0.099
es-0.098
Token things
Token ID1243
Feature activation+7.10e-02

Pos logit contributions
 chores+0.191
 same+0.190
 way+0.187
 plan+0.186
 start+0.172
Neg logit contributions
ark-0.412
es-0.412
aron-0.409
os-0.403
umping-0.403
Token.
Token ID13
Feature activation+7.54e-02

Pos logit contributions
os+0.176
ark+0.164
oing+0.163
o+0.163
umping+0.159
Neg logit contributions
 chores-0.141
 plan-0.136
 same-0.135
 urge-0.128
 way-0.126
Token
Token ID198
Feature activation+8.23e-02

Pos logit contributions
os+0.192
o+0.178
ark+0.178
oing+0.174
and+0.173
Neg logit contributions
 chores-0.149
 plan-0.140
 same-0.131
 urge-0.129
 way-0.127
Token
Token ID198
Feature activation+8.24e-02

Pos logit contributions
os+0.217
oing+0.213
umping+0.209
aron+0.202
ark+0.202
Neg logit contributions
 chores-0.148
 same-0.146
 plan-0.144
 way-0.135
 bad-0.131
TokenThe
Token ID464
Feature activation-5.26e-02

Pos logit contributions
os+0.192
ark+0.182
oing+0.181
o+0.178
umping+0.175
Neg logit contributions
 chores-0.179
 plan-0.172
 same-0.171
 urge-0.163
 way-0.162
Token and
Token ID290
Feature activation-5.19e-02

Pos logit contributions
 chores+0.018
 plan+0.018
 same+0.018
 way+0.017
 urge+0.017
Neg logit contributions
os-0.029
oing-0.028
ark-0.028
umping-0.027
o-0.027
Token thanked
Token ID26280
Feature activation-6.58e-02

Pos logit contributions
 chores+0.117
 same+0.109
 plan+0.107
 temptation+0.101
 urge+0.100
Neg logit contributions
os-0.119
o-0.116
oing-0.110
ark-0.109
es-0.107
Token Max
Token ID5436
Feature activation+7.52e-02

Pos logit contributions
 chores+0.117
 plan+0.112
 same+0.110
 urge+0.105
 way+0.103
Neg logit contributions
os-0.178
ark-0.170
o-0.167
oing-0.166
umping-0.161
Token for
Token ID329
Feature activation-1.03e-01

Pos logit contributions
os+0.225
oing+0.216
ark+0.214
umping+0.213
o+0.207
Neg logit contributions
 chores-0.117
 plan-0.111
 same-0.109
 way-0.102
 urge-0.099
Token being
Token ID852
Feature activation-6.76e-02

Pos logit contributions
 chores+0.175
 same+0.174
 urge+0.165
 plan+0.160
 way+0.155
Neg logit contributions
o-0.252
and-0.249
os-0.246
es-0.246
ark-0.245
Token such
Token ID884
Feature activation-1.59e-01

Pos logit contributions
 chores+0.099
 plan+0.092
 same+0.091
 urge+0.085
 way+0.084
Neg logit contributions
os-0.207
ark-0.197
o-0.197
oing-0.196
umping-0.193
Token a
Token ID257
Feature activation-6.77e-02

Pos logit contributions
 chores+0.212
 plan+0.191
 same+0.188
 way+0.186
 urge+0.181
Neg logit contributions
o-0.448
ark-0.439
os-0.432
and-0.429
aron-0.422
Token good
Token ID922
Feature activation-3.99e-02

Pos logit contributions
 chores+0.057
 same+0.048
 plan+0.043
 urge+0.041
 start+0.039
Neg logit contributions
os-0.238
o-0.235
ark-0.234
oing-0.232
umping-0.225
Token friend
Token ID1545
Feature activation+8.10e-02

Pos logit contributions
 chores+0.065
 same+0.061
 plan+0.061
 urge+0.056
 way+0.056
Neg logit contributions
os-0.114
ark-0.109
oing-0.108
o-0.108
umping-0.106
Token.
Token ID13
Feature activation+7.56e-02

Pos logit contributions
oing+0.225
os+0.221
ark+0.213
umping+0.207
round+0.203
Neg logit contributions
 chores-0.133
 way-0.130
 same-0.127
 bad-0.124
 plan-0.123
Token From
Token ID3574
Feature activation-5.93e-02

Pos logit contributions
os+0.191
oing+0.182
ark+0.181
o+0.177
umping+0.176
Neg logit contributions
 chores-0.149
 same-0.143
 plan-0.142
 way-0.134
 urge-0.134
Token to
Token ID284
Feature activation-1.34e-01

Pos logit contributions
 chores+0.145
 plan+0.138
 same+0.135
 urge+0.132
 way+0.122
Neg logit contributions
os-0.228
o-0.213
ark-0.213
oing-0.211
umping-0.207
Token know
Token ID760
Feature activation-4.58e-02

Pos logit contributions
 chores+0.168
 same+0.153
 plan+0.149
 way+0.137
 urge+0.136
Neg logit contributions
os-0.427
ark-0.412
oing-0.407
o-0.405
umping-0.395
Token where
Token ID810
Feature activation-7.59e-02

Pos logit contributions
 chores+0.080
 plan+0.077
 same+0.077
 urge+0.073
 way+0.071
Neg logit contributions
os-0.125
ark-0.118
oing-0.117
o-0.116
umping-0.113
Token they
Token ID484
Feature activation-3.54e-02

Pos logit contributions
 chores+0.120
 urge+0.113
 same+0.113
 plan+0.113
 temptation+0.102
Neg logit contributions
os-0.218
o-0.198
ark-0.190
ne-0.190
oing-0.190
Token got
Token ID1392
Feature activation-9.15e-02

Pos logit contributions
 chores+0.069
 plan+0.066
 same+0.065
 urge+0.062
 way+0.061
Neg logit contributions
os-0.091
ark-0.086
oing-0.085
o-0.085
umping-0.083
Token such
Token ID884
Feature activation-1.59e-01

Pos logit contributions
 chores+0.167
 plan+0.161
 same+0.157
 urge+0.151
 way+0.147
Neg logit contributions
os-0.244
o-0.228
oing-0.225
ark-0.224
umping-0.219
Token a
Token ID257
Feature activation-6.63e-02

Pos logit contributions
 chores+0.196
 same+0.190
 plan+0.189
 urge+0.183
 start+0.179
Neg logit contributions
os-0.438
oing-0.426
ark-0.424
umping-0.418
o-0.418
Token beautiful
Token ID4950
Feature activation-4.13e-03

Pos logit contributions
 chores+0.074
 same+0.063
 plan+0.063
 urge+0.058
 start+0.055
Neg logit contributions
os-0.216
oing-0.213
o-0.210
ark-0.210
umping-0.205
Token car
Token ID1097
Feature activation+1.27e-01

Pos logit contributions
 chores+0.006
 same+0.005
 plan+0.005
 urge+0.005
 start+0.005
Neg logit contributions
os-0.012
oing-0.012
o-0.011
umping-0.011
ark-0.011
Token cover
Token ID3002
Feature activation+3.51e-02

Pos logit contributions
os+0.291
ark+0.272
oing+0.270
o+0.269
umping+0.262
Neg logit contributions
 chores-0.282
 plan-0.269
 same-0.268
 urge-0.255
 way-0.252
Token.
Token ID13
Feature activation+7.81e-02

Pos logit contributions
os+0.083
oing+0.080
ark+0.079
o+0.077
umping+0.076
Neg logit contributions
 chores-0.075
 same-0.072
 plan-0.071
 way-0.068
 urge-0.067
Token proud
Token ID6613
Feature activation-1.56e-02

Pos logit contributions
 chores+0.177
 plan+0.164
 same+0.163
 urge+0.151
 way+0.149
Neg logit contributions
os-0.359
o-0.349
oing-0.344
ark-0.339
umping-0.337
Token of
Token ID286
Feature activation-9.98e-02

Pos logit contributions
 chores+0.035
 plan+0.034
 same+0.034
 way+0.032
 urge+0.032
Neg logit contributions
os-0.036
ark-0.034
oing-0.034
o-0.033
umping-0.032
Token her
Token ID607
Feature activation-5.79e-02

Pos logit contributions
 plan+0.109
 chores+0.103
 way+0.099
 same+0.099
 urge+0.095
Neg logit contributions
os-0.333
ark-0.321
o-0.319
es-0.314
and-0.310
Token for
Token ID329
Feature activation-9.95e-02

Pos logit contributions
 chores+0.104
 same+0.102
 plan+0.100
 urge+0.093
 way+0.093
Neg logit contributions
os-0.151
oing-0.145
ark-0.143
o-0.142
umping-0.139
Token being
Token ID852
Feature activation-6.63e-02

Pos logit contributions
 same+0.182
 plan+0.180
 urge+0.175
 chores+0.171
 idea+0.170
Neg logit contributions
es-0.220
and-0.217
o-0.216
os-0.216
ark-0.215
Token such
Token ID884
Feature activation-1.59e-01

Pos logit contributions
 chores+0.090
 plan+0.084
 same+0.084
 urge+0.077
 way+0.076
Neg logit contributions
os-0.210
ark-0.201
oing-0.200
o-0.198
umping-0.196
Token a
Token ID257
Feature activation-6.73e-02

Pos logit contributions
 chores+0.204
 plan+0.189
 way+0.188
 same+0.188
 urge+0.181
Neg logit contributions
ark-0.433
o-0.426
aron-0.419
oing-0.417
it-0.416
Token good
Token ID922
Feature activation-3.94e-02

Pos logit contributions
 chores+0.067
 same+0.057
 plan+0.053
 urge+0.051
 start+0.049
Neg logit contributions
os-0.224
oing-0.223
ark-0.222
o-0.221
umping-0.214
Token helper
Token ID31904
Feature activation+1.46e-01

Pos logit contributions
 chores+0.062
 same+0.057
 plan+0.056
 start+0.052
 way+0.051
Neg logit contributions
os-0.112
ark-0.108
oing-0.108
o-0.107
umping-0.106
Token.
Token ID13
Feature activation+7.70e-02

Pos logit contributions
os+0.367
oing+0.356
ark+0.349
umping+0.340
o+0.337
Neg logit contributions
 chores-0.287
 plan-0.278
 same-0.274
 way-0.269
 urge-0.261
Token They
Token ID1119
Feature activation-2.06e-02

Pos logit contributions
os+0.203
ark+0.191
o+0.191
oing+0.190
umping+0.186
Neg logit contributions
 chores-0.146
 plan-0.139
 same-0.136
 urge-0.129
 way-0.128
Token and
Token ID290
Feature activation-4.38e-02

Pos logit contributions
 chores+0.106
 same+0.102
 plan+0.101
 urge+0.095
 way+0.095
Neg logit contributions
os-0.139
oing-0.132
ark-0.131
o-0.129
umping-0.128
Token praised
Token ID15342
Feature activation-2.77e-02

Pos logit contributions
 chores+0.083
 same+0.074
 plan+0.074
 urge+0.070
 temptation+0.068
Neg logit contributions
os-0.115
o-0.106
es-0.103
oing-0.102
ark-0.102
Token her
Token ID607
Feature activation-6.16e-02

Pos logit contributions
 chores+0.030
 plan+0.029
 same+0.027
 urge+0.026
 way+0.026
Neg logit contributions
os-0.090
ark-0.088
and-0.085
oing-0.085
enny-0.085
Token for
Token ID329
Feature activation-1.04e-01

Pos logit contributions
 chores+0.103
 same+0.097
 plan+0.097
 urge+0.091
 way+0.090
Neg logit contributions
os-0.172
ark-0.164
oing-0.163
o-0.161
umping-0.159
Token being
Token ID852
Feature activation-6.79e-02

Pos logit contributions
 chores+0.152
 same+0.151
 plan+0.146
 urge+0.143
 way+0.134
Neg logit contributions
os-0.280
and-0.270
ark-0.268
es-0.268
o-0.259
Token such
Token ID884
Feature activation-1.59e-01

Pos logit contributions
 chores+0.101
 plan+0.094
 same+0.091
 urge+0.086
 way+0.085
Neg logit contributions
os-0.205
ark-0.195
o-0.195
oing-0.194
umping-0.192
Token a
Token ID257
Feature activation-6.83e-02

Pos logit contributions
 chores+0.222
 plan+0.199
 same+0.191
 way+0.189
 urge+0.187
Neg logit contributions
ark-0.423
o-0.417
os-0.417
aron-0.415
and-0.414
Token good
Token ID922
Feature activation-3.94e-02

Pos logit contributions
 chores+0.076
 same+0.064
 plan+0.061
 urge+0.058
 start+0.055
Neg logit contributions
os-0.222
ark-0.219
oing-0.217
o-0.216
umping-0.210
Token student
Token ID3710
Feature activation+1.18e-01

Pos logit contributions
 chores+0.072
 same+0.066
 plan+0.065
 urge+0.061
 start+0.061
Neg logit contributions
os-0.103
ark-0.099
oing-0.098
o-0.097
umping-0.096
Token.
Token ID13
Feature activation+7.77e-02

Pos logit contributions
os+0.291
oing+0.279
ark+0.276
o+0.272
umping+0.269
Neg logit contributions
 chores-0.242
 plan-0.234
 same-0.229
 way-0.225
 urge-0.220
Token
Token ID198
Feature activation+8.28e-02

Pos logit contributions
os+0.199
ark+0.191
oing+0.190
o+0.184
umping+0.183
Neg logit contributions
 chores-0.152
 same-0.147
 plan-0.144
 way-0.138
 urge-0.135
Token proud
Token ID6613
Feature activation-2.01e-02

Pos logit contributions
 chores+0.144
 plan+0.134
 same+0.128
 way+0.123
 urge+0.116
Neg logit contributions
os-0.229
and-0.227
o-0.225
oing-0.224
umping-0.220
Token of
Token ID286
Feature activation-1.10e-01

Pos logit contributions
 chores+0.041
 plan+0.040
 same+0.040
 way+0.038
 urge+0.038
Neg logit contributions
os-0.050
ark-0.047
oing-0.047
o-0.046
umping-0.045
Token herself
Token ID5223
Feature activation-2.25e-02

Pos logit contributions
 plan+0.153
 same+0.153
 chores+0.151
 way+0.142
 urge+0.141
Neg logit contributions
os-0.312
ark-0.307
and-0.300
o-0.297
oing-0.294
Token for
Token ID329
Feature activation-1.02e-01

Pos logit contributions
 chores+0.032
 plan+0.030
 same+0.029
 urge+0.028
 way+0.028
Neg logit contributions
os-0.070
ark-0.067
oing-0.067
o-0.066
umping-0.066
Token being
Token ID852
Feature activation-6.72e-02

Pos logit contributions
 same+0.188
 plan+0.183
 chores+0.183
 urge+0.174
 task+0.167
Neg logit contributions
and-0.224
es-0.222
aron-0.218
o-0.218
ark-0.217
Token such
Token ID884
Feature activation-1.59e-01

Pos logit contributions
 chores+0.095
 plan+0.088
 same+0.087
 urge+0.080
 way+0.080
Neg logit contributions
os-0.209
ark-0.200
oing-0.200
o-0.199
umping-0.196
Token a
Token ID257
Feature activation-6.50e-02

Pos logit contributions
 chores+0.229
 plan+0.209
 way+0.207
 same+0.200
 urge+0.198
Neg logit contributions
ark-0.403
o-0.402
aron-0.401
and-0.399
it-0.395
Token good
Token ID922
Feature activation-4.06e-02

Pos logit contributions
 chores+0.073
 same+0.061
 plan+0.059
 urge+0.055
 start+0.054
Neg logit contributions
os-0.212
ark-0.210
oing-0.210
o-0.210
umping-0.203
Token helper
Token ID31904
Feature activation+1.43e-01

Pos logit contributions
 chores+0.070
 same+0.063
 plan+0.063
 start+0.059
 urge+0.058
Neg logit contributions
os-0.109
ark-0.107
o-0.106
oing-0.106
umping-0.104
Token.
Token ID13
Feature activation+7.99e-02

Pos logit contributions
os+0.368
oing+0.354
ark+0.347
umping+0.340
o+0.334
Neg logit contributions
 chores-0.274
 plan-0.265
 same-0.261
 way-0.258
 urge-0.250
Token<|endoftext|>
Token ID50256
Feature activation+1.52e-01

Pos logit contributions
os+0.198
oing+0.191
ark+0.190
umping+0.185
o+0.180
Neg logit contributions
 chores-0.157
 same-0.153
 plan-0.150
 way-0.145
 urge-0.142
Token all
Token ID477
Feature activation-2.58e-02

Pos logit contributions
 chores+0.034
 same+0.032
 plan+0.031
 urge+0.029
 way+0.029
Neg logit contributions
os-0.058
o-0.055
ark-0.055
oing-0.053
umping-0.052
Token applauded
Token ID38911
Feature activation+2.51e-02

Pos logit contributions
 chores+0.050
 same+0.048
 plan+0.048
 urge+0.045
 way+0.044
Neg logit contributions
os-0.066
ark-0.063
oing-0.062
o-0.062
umping-0.061
Token him
Token ID683
Feature activation-3.74e-03

Pos logit contributions
os+0.068
oing+0.065
ark+0.065
o+0.064
umping+0.063
Neg logit contributions
 chores-0.045
 same-0.043
 plan-0.043
 urge-0.040
 way-0.040
Token for
Token ID329
Feature activation-1.03e-01

Pos logit contributions
 chores+0.007
 plan+0.006
 same+0.006
 way+0.006
 urge+0.006
Neg logit contributions
os-0.010
oing-0.010
ark-0.010
umping-0.010
o-0.010
Token being
Token ID852
Feature activation-6.87e-02

Pos logit contributions
 chores+0.148
 same+0.146
 plan+0.139
 urge+0.134
 way+0.128
Neg logit contributions
os-0.296
ark-0.285
o-0.285
oing-0.279
and-0.275
Token such
Token ID884
Feature activation-1.59e-01

Pos logit contributions
 chores+0.095
 same+0.089
 plan+0.089
 urge+0.081
 way+0.081
Neg logit contributions
os-0.216
ark-0.207
oing-0.206
o-0.203
umping-0.201
Token a
Token ID257
Feature activation-6.79e-02

Pos logit contributions
 chores+0.210
 plan+0.187
 same+0.184
 way+0.179
 urge+0.177
Neg logit contributions
o-0.448
ark-0.447
os-0.444
aron-0.433
and-0.430
Token good
Token ID922
Feature activation-4.14e-02

Pos logit contributions
 chores+0.073
 same+0.062
 plan+0.060
 urge+0.056
 start+0.053
Neg logit contributions
os-0.228
ark-0.224
oing-0.223
o-0.222
umping-0.217
Token pup
Token ID15552
Feature activation+1.30e-01

Pos logit contributions
 chores+0.069
 same+0.064
 plan+0.063
 start+0.059
 urge+0.059
Neg logit contributions
os-0.115
ark-0.112
oing-0.111
o-0.110
umping-0.109
Token.
Token ID13
Feature activation+7.67e-02

Pos logit contributions
os+0.325
oing+0.315
ark+0.310
umping+0.304
o+0.295
Neg logit contributions
 chores-0.254
 plan-0.243
 same-0.242
 way-0.238
 urge-0.231
Token<|endoftext|>
Token ID50256
Feature activation+1.57e-01

Pos logit contributions
oing+0.217
ark+0.215
os+0.215
umping+0.207
round+0.201
Neg logit contributions
 chores-0.116
 same-0.115
 plan-0.112
 way-0.110
 urge-0.104
Token joining
Token ID9679
Feature activation-3.30e-02

Pos logit contributions
 same+0.174
 chores+0.171
 urge+0.157
 plan+0.152
 task+0.151
Neg logit contributions
o-0.236
os-0.235
and-0.229
es-0.226
ark-0.224
Token him
Token ID683
Feature activation+7.53e-03

Pos logit contributions
 chores+0.044
 plan+0.041
 same+0.041
 way+0.038
 urge+0.038
Neg logit contributions
os-0.105
ark-0.099
o-0.099
oing-0.098
umping-0.097
Token and
Token ID290
Feature activation-3.93e-02

Pos logit contributions
os+0.019
oing+0.018
ark+0.018
o+0.017
umping+0.017
Neg logit contributions
 chores-0.015
 plan-0.014
 same-0.014
 way-0.014
 urge-0.014
Token for
Token ID329
Feature activation-9.81e-02

Pos logit contributions
 chores+0.074
 same+0.070
 plan+0.064
 urge+0.064
 temptation+0.063
Neg logit contributions
os-0.106
o-0.102
oing-0.095
ark-0.094
es-0.092
Token being
Token ID852
Feature activation-6.81e-02

Pos logit contributions
 chores+0.151
 same+0.151
 urge+0.141
 plan+0.137
 way+0.128
Neg logit contributions
os-0.263
o-0.262
and-0.255
ark-0.249
es-0.247
Token such
Token ID884
Feature activation-1.59e-01

Pos logit contributions
 chores+0.087
 same+0.083
 plan+0.082
 way+0.074
 urge+0.074
Neg logit contributions
os-0.220
oing-0.212
ark-0.211
o-0.206
umping-0.205
Token a
Token ID257
Feature activation-6.77e-02

Pos logit contributions
 chores+0.208
 plan+0.191
 same+0.184
 urge+0.181
 way+0.180
Neg logit contributions
o-0.452
os-0.447
ark-0.445
aron-0.430
and-0.428
Token great
Token ID1049
Feature activation-5.46e-02

Pos logit contributions
 chores+0.066
 same+0.057
 plan+0.056
 urge+0.051
 way+0.048
Neg logit contributions
os-0.236
ark-0.229
oing-0.228
o-0.228
umping-0.223
Token friend
Token ID1545
Feature activation+8.26e-02

Pos logit contributions
 chores+0.097
 same+0.091
 plan+0.090
 urge+0.084
 way+0.084
Neg logit contributions
os-0.148
ark-0.141
oing-0.141
o-0.140
umping-0.138
Token.
Token ID13
Feature activation+7.79e-02

Pos logit contributions
oing+0.221
os+0.215
ark+0.209
umping+0.205
umbs+0.199
Neg logit contributions
 chores-0.144
 way-0.137
 plan-0.136
 same-0.134
 bad-0.132
Token
Token ID220
Feature activation+8.06e-02

Pos logit contributions
os+0.210
ark+0.207
oing+0.206
umping+0.199
ne+0.194
Neg logit contributions
 chores-0.133
 same-0.130
 plan-0.127
 way-0.124
 urge-0.119
Token and
Token ID290
Feature activation-5.04e-02

Pos logit contributions
 same+0.019
 way+0.019
 plan+0.019
 work+0.018
 chores+0.018
Neg logit contributions
os-0.040
ark-0.039
oing-0.038
umping-0.037
umbs-0.037
Token thanked
Token ID26280
Feature activation-6.57e-02

Pos logit contributions
 chores+0.103
 plan+0.094
 same+0.094
 urge+0.086
 start+0.084
Neg logit contributions
os-0.128
o-0.124
oing-0.123
ark-0.119
umping-0.116
Token Benny
Token ID44275
Feature activation+8.71e-02

Pos logit contributions
 chores+0.139
 plan+0.134
 same+0.131
 urge+0.126
 way+0.124
Neg logit contributions
os-0.152
o-0.145
ark-0.145
oing-0.142
umping-0.137
Token for
Token ID329
Feature activation-9.91e-02

Pos logit contributions
os+0.280
oing+0.266
ark+0.265
umping+0.263
o+0.258
Neg logit contributions
 chores-0.116
 plan-0.110
 same-0.109
 way-0.103
 urge-0.099
Token being
Token ID852
Feature activation-6.53e-02

Pos logit contributions
 chores+0.176
 same+0.168
 plan+0.157
 urge+0.157
 task+0.152
Neg logit contributions
o-0.250
and-0.238
os-0.237
es-0.235
ark-0.233
Token such
Token ID884
Feature activation-1.59e-01

Pos logit contributions
 chores+0.088
 plan+0.082
 same+0.082
 urge+0.075
 way+0.074
Neg logit contributions
os-0.208
ark-0.199
oing-0.199
o-0.197
umping-0.194
Token a
Token ID257
Feature activation-6.68e-02

Pos logit contributions
 chores+0.219
 plan+0.189
 same+0.189
 way+0.186
 urge+0.177
Neg logit contributions
o-0.451
ark-0.436
os-0.430
and-0.426
oing-0.422
Token good
Token ID922
Feature activation-3.89e-02

Pos logit contributions
 chores+0.067
 same+0.057
 plan+0.053
 urge+0.050
 start+0.048
Neg logit contributions
os-0.227
o-0.224
oing-0.224
ark-0.223
umping-0.216
Token friend
Token ID1545
Feature activation+7.88e-02

Pos logit contributions
 chores+0.067
 same+0.062
 plan+0.061
 urge+0.057
 start+0.057
Neg logit contributions
os-0.108
ark-0.103
oing-0.103
o-0.103
umping-0.101
Token.
Token ID13
Feature activation+7.55e-02

Pos logit contributions
oing+0.232
os+0.227
ark+0.216
umping+0.215
umbs+0.211
Neg logit contributions
 way-0.114
 bad-0.113
 chores-0.111
 the-0.107
 other-0.107
Token From
Token ID3574
Feature activation-5.87e-02

Pos logit contributions
os+0.184
ark+0.175
oing+0.174
umping+0.169
o+0.166
Neg logit contributions
 chores-0.153
 same-0.150
 plan-0.147
 way-0.141
 urge-0.139
Token and
Token ID290
Feature activation-5.58e-02

Pos logit contributions
os+0.017
ark+0.016
umping+0.015
oing+0.015
o+0.015
Neg logit contributions
 chores-0.011
 plan-0.011
 same-0.011
 way-0.011
 urge-0.011
Token thanked
Token ID26280
Feature activation-6.80e-02

Pos logit contributions
 chores+0.110
 plan+0.101
 same+0.100
 urge+0.094
 temptation+0.093
Neg logit contributions
os-0.147
o-0.142
oing-0.135
ark-0.132
umping-0.131
Token Max
Token ID5436
Feature activation+7.72e-02

Pos logit contributions
 plan+0.115
 chores+0.114
 same+0.111
 urge+0.108
 way+0.104
Neg logit contributions
os-0.184
o-0.173
ark-0.173
oing-0.170
ne-0.163
Token for
Token ID329
Feature activation-1.02e-01

Pos logit contributions
os+0.256
oing+0.247
ark+0.245
umping+0.244
o+0.237
Neg logit contributions
 chores-0.096
 plan-0.088
 same-0.087
 way-0.080
 start-0.076
Token being
Token ID852
Feature activation-6.62e-02

Pos logit contributions
 chores+0.176
 same+0.171
 plan+0.167
 urge+0.162
 task+0.154
Neg logit contributions
o-0.258
os-0.249
and-0.247
ark-0.245
es-0.238
Token such
Token ID884
Feature activation-1.58e-01

Pos logit contributions
 chores+0.083
 same+0.080
 plan+0.077
 urge+0.071
 way+0.070
Neg logit contributions
os-0.217
oing-0.208
ark-0.207
o-0.202
umping-0.201
Token a
Token ID257
Feature activation-6.73e-02

Pos logit contributions
 chores+0.220
 plan+0.201
 same+0.193
 way+0.193
 urge+0.189
Neg logit contributions
o-0.435
ark-0.427
os-0.415
aron-0.412
and-0.410
Token good
Token ID922
Feature activation-4.04e-02

Pos logit contributions
 chores+0.059
 same+0.051
 plan+0.049
 urge+0.044
 way+0.042
Neg logit contributions
os-0.240
oing-0.234
ark-0.234
o-0.234
umping-0.228
Token friend
Token ID1545
Feature activation+8.16e-02

Pos logit contributions
 chores+0.064
 plan+0.062
 same+0.061
 urge+0.057
 way+0.057
Neg logit contributions
os-0.120
ark-0.113
oing-0.113
o-0.112
umping-0.110
Token.
Token ID13
Feature activation+7.58e-02

Pos logit contributions
oing+0.233
os+0.229
ark+0.219
umping+0.216
umbs+0.212
Neg logit contributions
 chores-0.127
 way-0.124
 bad-0.122
 same-0.119
 her-0.118
Token The
Token ID383
Feature activation-5.22e-02

Pos logit contributions
os+0.196
oing+0.191
ark+0.190
umping+0.187
ne+0.182
Neg logit contributions
 chores-0.136
 same-0.135
 way-0.129
 plan-0.129
 urge-0.122
Token God
Token ID1793
Feature activation+1.05e-01

Pos logit contributions
 chores+0.081
 same+0.077
 plan+0.076
 urge+0.071
 way+0.069
Neg logit contributions
os-0.166
ark-0.161
oing-0.158
o-0.157
aron-0.156
Token for
Token ID329
Feature activation-1.02e-01

Pos logit contributions
os+0.339
ark+0.327
oing+0.325
umping+0.323
o+0.316
Neg logit contributions
 chores-0.128
 plan-0.121
 same-0.119
 way-0.111
 start-0.109
Token letting
Token ID9616
Feature activation-7.20e-02

Pos logit contributions
 same+0.150
 chores+0.148
 plan+0.135
 urge+0.135
 task+0.127
Neg logit contributions
os-0.280
o-0.273
ark-0.265
and-0.263
es-0.263
Token him
Token ID683
Feature activation-1.89e-02

Pos logit contributions
 chores+0.117
 same+0.113
 plan+0.112
 urge+0.106
 way+0.102
Neg logit contributions
os-0.200
ark-0.190
o-0.186
oing-0.185
aron-0.181
Token have
Token ID423
Feature activation-9.05e-02

Pos logit contributions
 chores+0.020
 plan+0.019
 same+0.018
 urge+0.017
 way+0.016
Neg logit contributions
os-0.065
ark-0.063
oing-0.063
o-0.062
umping-0.061
Token such
Token ID884
Feature activation-1.58e-01

Pos logit contributions
 chores+0.107
 plan+0.102
 same+0.100
 urge+0.095
 way+0.085
Neg logit contributions
os-0.291
o-0.280
ark-0.274
umping-0.274
oing-0.272
Token a
Token ID257
Feature activation-6.56e-02

Pos logit contributions
 urge+0.210
 plan+0.209
 same+0.207
 chores+0.203
 start+0.200
Neg logit contributions
aron-0.392
o-0.387
os-0.380
umping-0.377
ark-0.375
Token modest
Token ID12949
Feature activation-1.52e-02

Pos logit contributions
 chores+0.071
 same+0.064
 plan+0.061
 urge+0.057
 way+0.054
Neg logit contributions
os-0.214
oing-0.210
o-0.209
ark-0.207
umping-0.203
Token and
Token ID290
Feature activation-4.75e-02

Pos logit contributions
 chores+0.022
 same+0.020
 plan+0.018
 start+0.018
 urge+0.017
Neg logit contributions
os-0.043
oing-0.043
o-0.042
umping-0.042
ark-0.042
Token happy
Token ID3772
Feature activation-2.17e-02

Pos logit contributions
 chores+0.069
 same+0.064
 plan+0.059
 urge+0.058
 temptation+0.055
Neg logit contributions
os-0.131
o-0.130
oing-0.123
and-0.122
ark-0.119
Token life
Token ID1204
Feature activation+5.21e-02

Pos logit contributions
 chores+0.031
 same+0.028
 plan+0.027
 urge+0.025
 start+0.024
Neg logit contributions
os-0.064
oing-0.063
ark-0.062
o-0.062
umping-0.061
Token they
Token ID484
Feature activation-1.53e-02

Pos logit contributions
 chores+0.054
 same+0.052
 plan+0.052
 urge+0.049
 temptation+0.045
Neg logit contributions
os-0.108
o-0.101
oing-0.100
ark-0.100
umping-0.098
Token could
Token ID714
Feature activation-8.83e-02

Pos logit contributions
 chores+0.029
 plan+0.029
 same+0.028
 urge+0.027
 way+0.026
Neg logit contributions
os-0.039
oing-0.038
ark-0.038
umping-0.037
o-0.037
Token fix
Token ID4259
Feature activation-2.20e-02

Pos logit contributions
 chores+0.109
 same+0.106
 temptation+0.102
 way+0.101
 urge+0.097
Neg logit contributions
os-0.256
o-0.251
ark-0.246
ne-0.233
oing-0.232
Token anything
Token ID1997
Feature activation+2.37e-02

Pos logit contributions
 chores+0.026
 plan+0.024
 same+0.024
 urge+0.023
 way+0.021
Neg logit contributions
os-0.072
ark-0.069
o-0.068
oing-0.068
umping-0.067
Token.
Token ID13
Feature activation+7.49e-02

Pos logit contributions
os+0.058
ark+0.055
oing+0.055
o+0.054
umping+0.053
Neg logit contributions
 chores-0.048
 plan-0.046
 same-0.046
 urge-0.043
 way-0.043
Token<|endoftext|>
Token ID50256
Feature activation+1.53e-01

Pos logit contributions
oing+0.225
ark+0.222
os+0.221
umping+0.219
ist+0.206
Neg logit contributions
 chores-0.105
 same-0.102
 way-0.097
 plan-0.095
 bad-0.092
TokenOnce
Token ID7454
Feature activation+3.55e-02

Pos logit contributions
oing+0.470
ark+0.452
round+0.440
ist+0.437
os+0.436
Neg logit contributions
 same-0.217
 other-0.205
 chores-0.204
 the-0.197
 way-0.194
Token upon
Token ID2402
Feature activation-5.60e-02

Pos logit contributions
os+0.090
o+0.084
ark+0.084
oing+0.084
umping+0.081
Neg logit contributions
 chores-0.071
 plan-0.067
 same-0.067
 urge-0.064
 way-0.062
Token a
Token ID257
Feature activation-4.00e-02

Pos logit contributions
 chores+0.048
 same+0.043
 plan+0.043
 urge+0.036
 way+0.036
Neg logit contributions
os-0.205
ark-0.197
oing-0.197
o-0.194
umping-0.193
Token time
Token ID640
Feature activation+2.13e-02

Pos logit contributions
 chores+0.025
 plan+0.022
 same+0.021
 way+0.017
 urge+0.017
Neg logit contributions
os-0.158
oing-0.151
ark-0.151
o-0.148
umping-0.148
Token,
Token ID11
Feature activation-2.12e-02

Pos logit contributions
os+0.065
oing+0.064
ark+0.063
umping+0.062
aron+0.062
Neg logit contributions
 chores-0.030
 plan-0.028
 same-0.028
 way-0.026
 urge-0.026
Token angry
Token ID7954
Feature activation-9.61e-03

Pos logit contributions
 chores+0.095
 plan+0.095
 same+0.091
 way+0.088
 start+0.084
Neg logit contributions
os-0.205
o-0.197
oing-0.186
umping-0.186
iled-0.177
Token and
Token ID290
Feature activation-5.14e-02

Pos logit contributions
 chores+0.018
 same+0.018
 plan+0.017
 way+0.017
 urge+0.016
Neg logit contributions
os-0.025
oing-0.024
ark-0.024
o-0.023
enn-0.023
Token started
Token ID2067
Feature activation-8.59e-02

Pos logit contributions
 chores+0.104
 plan+0.096
 same+0.092
 way+0.086
 urge+0.085
Neg logit contributions
os-0.126
o-0.122
oing-0.111
ne-0.110
round-0.110
Token shaking
Token ID17275
Feature activation-1.59e-02

Pos logit contributions
 chores+0.170
 same+0.165
 plan+0.161
 urge+0.154
 way+0.149
Neg logit contributions
os-0.214
o-0.205
oing-0.200
umping-0.191
ark-0.191
Token the
Token ID262
Feature activation-8.85e-02

Pos logit contributions
 chores+0.031
 plan+0.029
 same+0.029
 urge+0.027
 way+0.027
Neg logit contributions
os-0.041
ark-0.039
oing-0.039
o-0.039
umping-0.038
Token radio
Token ID5243
Feature activation+1.61e-01

Pos logit contributions
 chores+0.142
 plan+0.133
 same+0.121
 urge+0.118
 way+0.111
Neg logit contributions
os-0.235
oing-0.233
ark-0.229
umping-0.225
eddy-0.224
Token.
Token ID13
Feature activation+7.83e-02

Pos logit contributions
os+0.362
oing+0.339
ark+0.337
o+0.334
umping+0.325
Neg logit contributions
 chores-0.355
 plan-0.339
 same-0.338
 urge-0.320
 way-0.315
Token Then
Token ID3244
Feature activation-8.02e-03

Pos logit contributions
os+0.205
ark+0.194
oing+0.194
o+0.191
umping+0.189
Neg logit contributions
 chores-0.149
 same-0.142
 plan-0.142
 urge-0.133
 way-0.133
Token,
Token ID11
Feature activation-2.33e-02

Pos logit contributions
 chores+0.014
 plan+0.014
 same+0.014
 urge+0.013
 temptation+0.012
Neg logit contributions
os-0.020
o-0.020
ark-0.019
aron-0.019
oing-0.019
Token out
Token ID503
Feature activation-1.51e-02

Pos logit contributions
 chores+0.038
 plan+0.037
 same+0.036
 urge+0.034
 way+0.033
Neg logit contributions
os-0.065
o-0.062
ark-0.062
oing-0.061
aron-0.060
Token of
Token ID286
Feature activation-8.75e-02

Pos logit contributions
 chores+0.036
 same+0.034
 plan+0.034
 urge+0.033
 way+0.033
Neg logit contributions
os-0.033
ark-0.030
oing-0.030
o-0.030
umping-0.029
Token.
Token ID13
Feature activation+7.47e-02

Pos logit contributions
os+0.107
oing+0.106
ark+0.104
umping+0.099
enn+0.098
Neg logit contributions
 chores-0.070
 same-0.069
 way-0.067
 start-0.065
 plan-0.064
Token Grand
Token ID5675
Feature activation+9.44e-02

Pos logit contributions
os+0.189
o+0.178
oing+0.176
ark+0.176
umping+0.173
Neg logit contributions
 chores-0.148
 plan-0.143
 same-0.137
 urge-0.130
 way-0.127
Tokenma
Token ID2611
Feature activation+1.31e-01

Pos logit contributions
os+0.232
oing+0.227
ark+0.223
o+0.214
umping+0.212
Neg logit contributions
 chores-0.194
 plan-0.186
 same-0.185
 way-0.173
 urge-0.170
Token brought
Token ID3181
Feature activation-5.37e-02

Pos logit contributions
ark+0.369
oing+0.365
os+0.364
umping+0.351
and+0.351
Neg logit contributions
 plan-0.204
 chores-0.194
 same-0.190
 way-0.190
 start-0.184
Token some
Token ID617
Feature activation-6.13e-02

Pos logit contributions
 chores+0.055
 plan+0.052
 same+0.046
 urge+0.046
 way+0.043
Neg logit contributions
os-0.188
o-0.179
oing-0.174
ark-0.174
umping-0.173
Token tea
Token ID8887
Feature activation+1.61e-01

Pos logit contributions
 chores+0.090
 plan+0.086
 same+0.085
 urge+0.079
 way+0.078
Neg logit contributions
os-0.184
oing-0.175
ark-0.174
o-0.173
umping-0.171
Token,
Token ID11
Feature activation-2.62e-02

Pos logit contributions
os+0.427
oing+0.424
ark+0.422
umping+0.405
o+0.399
Neg logit contributions
 chores-0.287
 same-0.271
 plan-0.267
 way-0.260
 start-0.255
Token but
Token ID475
Feature activation-1.87e-02

Pos logit contributions
 chores+0.046
 plan+0.044
 same+0.042
 urge+0.040
 way+0.039
Neg logit contributions
os-0.071
o-0.068
oing-0.067
umping-0.066
aron-0.065
Token it
Token ID340
Feature activation+3.26e-02

Pos logit contributions
 chores+0.036
 plan+0.033
 urge+0.032
 same+0.031
 temptation+0.029
Neg logit contributions
os-0.044
o-0.044
es-0.042
enny-0.041
ark-0.041
Token was
Token ID373
Feature activation-1.20e-01

Pos logit contributions
os+0.079
o+0.075
oing+0.071
ark+0.071
umping+0.070
Neg logit contributions
 chores-0.069
 plan-0.064
 same-0.063
 urge-0.060
 temptation-0.059
Token too
Token ID1165
Feature activation-3.03e-02

Pos logit contributions
 plan+0.203
 chores+0.199
 urge+0.191
 same+0.189
 temptation+0.177
Neg logit contributions
o-0.303
os-0.287
aron-0.273
umping-0.265
ark-0.258
Token nice
Token ID3621
Feature activation+2.68e-02

Pos logit contributions
 chores+0.093
 plan+0.091
 urge+0.089
 same+0.084
 way+0.079
Neg logit contributions
o-0.247
os-0.243
ark-0.241
aron-0.236
oing-0.229
Token,
Token ID11
Feature activation-2.37e-02

Pos logit contributions
os+0.071
ark+0.068
oing+0.068
o+0.066
umping+0.066
Neg logit contributions
 chores-0.050
 plan-0.048
 same-0.048
 urge-0.045
 way-0.045
Token but
Token ID475
Feature activation-2.18e-02

Pos logit contributions
 chores+0.039
 plan+0.035
 same+0.035
 urge+0.033
 way+0.032
Neg logit contributions
os-0.065
ark-0.062
o-0.062
aron-0.062
oing-0.061
Token my
Token ID616
Feature activation-6.80e-02

Pos logit contributions
 chores+0.041
 plan+0.037
 urge+0.036
 same+0.036
 way+0.034
Neg logit contributions
os-0.054
o-0.054
ark-0.051
ne-0.049
es-0.048
Token car
Token ID1097
Feature activation+1.43e-01

Pos logit contributions
 chores+0.096
 same+0.090
 plan+0.088
 way+0.084
 urge+0.083
Neg logit contributions
os-0.203
o-0.194
oing-0.194
ark-0.189
umping-0.187
Token machine
Token ID4572
Feature activation+1.57e-01

Pos logit contributions
os+0.400
o+0.378
ark+0.374
oing+0.373
umping+0.363
Neg logit contributions
 chores-0.242
 plan-0.227
 same-0.225
 urge-0.214
 way-0.210
Token is
Token ID318
Feature activation-1.10e-01

Pos logit contributions
os+0.443
ark+0.418
o+0.417
oing+0.416
umping+0.406
Neg logit contributions
 chores-0.262
 plan-0.248
 same-0.246
 urge-0.231
 way-0.228
Token better
Token ID1365
Feature activation-2.61e-02

Pos logit contributions
 chores+0.143
 plan+0.139
 urge+0.138
 same+0.129
 temptation+0.126
Neg logit contributions
o-0.326
os-0.315
aron-0.301
ark-0.291
oing-0.290
Token.
Token ID13
Feature activation+6.61e-02

Pos logit contributions
 chores+0.059
 plan+0.057
 same+0.057
 urge+0.054
 way+0.054
Neg logit contributions
os-0.058
ark-0.055
oing-0.055
o-0.054
umping-0.053
Token It
Token ID632
Feature activation+2.19e-02

Pos logit contributions
os+0.180
ark+0.174
oing+0.172
umping+0.170
o+0.165
Neg logit contributions
 chores-0.113
 same-0.110
 plan-0.104
 urge-0.102
 start-0.101
Token makes
Token ID1838
Feature activation-9.30e-02

Pos logit contributions
os+0.059
o+0.056
oing+0.055
ark+0.054
umping+0.053
Neg logit contributions
 chores-0.040
 plan-0.038
 same-0.037
 urge-0.035
 way-0.035
Token she
Token ID673
Feature activation-1.62e-02

Pos logit contributions
 plan+0.090
 chores+0.088
 urge+0.087
 same+0.083
 idea+0.079
Neg logit contributions
os-0.156
es-0.147
o-0.142
ark-0.140
and-0.139
Token saw
Token ID2497
Feature activation-8.45e-02

Pos logit contributions
 chores+0.024
 plan+0.024
 same+0.023
 way+0.022
 urge+0.022
Neg logit contributions
os-0.047
ark-0.047
oing-0.047
umping-0.045
aron-0.044
Token a
Token ID257
Feature activation-6.09e-02

Pos logit contributions
 chores+0.090
 plan+0.088
 urge+0.078
 same+0.077
 start+0.069
Neg logit contributions
os-0.288
o-0.271
ark-0.270
oing-0.266
umping-0.266
Token dove
Token ID38862
Feature activation+1.31e-01

Pos logit contributions
 chores+0.076
 same+0.068
 plan+0.068
 urge+0.062
 way+0.060
Neg logit contributions
os-0.197
oing-0.191
ark-0.190
o-0.189
umping-0.186
Token.
Token ID13
Feature activation+7.88e-02

Pos logit contributions
os+0.321
ark+0.303
oing+0.303
o+0.299
umping+0.293
Neg logit contributions
 chores-0.270
 plan-0.258
 same-0.257
 urge-0.243
 way-0.242
Token<|endoftext|>
Token ID50256
Feature activation+1.61e-01

Pos logit contributions
os+0.242
ark+0.240
oing+0.239
o+0.229
umping+0.229
Neg logit contributions
 chores-0.108
 same-0.106
 plan-0.097
 way-0.097
 start-0.095
TokenThe
Token ID464
Feature activation-4.11e-02

Pos logit contributions
oing+0.510
ark+0.489
ist+0.482
os+0.474
round+0.471
Neg logit contributions
 same-0.208
 way-0.206
 the-0.193
 other-0.192
 chores-0.184
Token sun
Token ID4252
Feature activation+1.68e-01

Pos logit contributions
 chores+0.044
 plan+0.040
 same+0.039
 urge+0.035
 way+0.034
Neg logit contributions
os-0.141
ark-0.136
oing-0.135
o-0.135
umping-0.133
Token shone
Token ID44193
Feature activation+2.35e-02

Pos logit contributions
os+0.464
ark+0.463
oing+0.447
and+0.443
umping+0.433
Neg logit contributions
 same-0.273
 chores-0.270
 plan-0.268
 way-0.260
 urge-0.253
Token brightly
Token ID35254
Feature activation+3.46e-02

Pos logit contributions
os+0.068
ark+0.066
oing+0.064
enn+0.063
o+0.063
Neg logit contributions
 same-0.039
 chores-0.035
 rain-0.035
 start-0.034
 bad-0.034
Token in
Token ID287
Feature activation-9.73e-02

Pos logit contributions
os+0.103
enny+0.099
ark+0.097
oing+0.095
and+0.094
Neg logit contributions
 same-0.055
 rain-0.051
 start-0.050
 chores-0.050
 way-0.049
Token a
Token ID257
Feature activation-4.91e-02

Pos logit contributions
os+0.142
oing+0.139
ark+0.138
umping+0.136
o+0.134
Neg logit contributions
 chores-0.045
 same-0.044
 plan-0.038
 bad-0.037
 start-0.037
Token frog
Token ID21264
Feature activation+1.50e-01

Pos logit contributions
 plan+0.077
 same+0.076
 chores+0.074
 way+0.072
 start+0.068
Neg logit contributions
os-0.147
ark-0.138
umping-0.138
oing-0.135
o-0.133
Token too
Token ID1165
Feature activation-1.17e-02

Pos logit contributions
os+0.415
ark+0.400
oing+0.397
umping+0.392
o+0.380
Neg logit contributions
 chores-0.255
 plan-0.247
 same-0.243
 start-0.226
 urge-0.223
Token?"
Token ID1701
Feature activation+8.84e-02

Pos logit contributions
 chores+0.021
 same+0.020
 plan+0.020
 urge+0.019
 way+0.019
Neg logit contributions
os-0.032
oing-0.030
ark-0.030
o-0.029
umping-0.029
Token
Token ID198
Feature activation+8.18e-02

Pos logit contributions
os+0.211
ark+0.199
oing+0.198
o+0.196
umping+0.192
Neg logit contributions
 chores-0.190
 plan-0.182
 same-0.181
 urge-0.171
 way-0.171
Token
Token ID198
Feature activation+8.11e-02

Pos logit contributions
os+0.223
oing+0.218
umping+0.217
ark+0.212
round+0.210
Neg logit contributions
 chores-0.138
 same-0.135
 plan-0.133
 bad-0.129
 other-0.124
Token"
Token ID1
Feature activation+5.93e-02

Pos logit contributions
os+0.181
oing+0.166
umping+0.165
ark+0.163
o+0.159
Neg logit contributions
 chores-0.186
 same-0.182
 plan-0.181
 bad-0.171
 start-0.170
TokenSure
Token ID19457
Feature activation-7.42e-03

Pos logit contributions
os+0.148
oing+0.136
umping+0.136
ark+0.136
o+0.135
Neg logit contributions
 chores-0.121
 same-0.120
 plan-0.117
 urge-0.110
 start-0.109
Token,
Token ID11
Feature activation-3.29e-02

Pos logit contributions
 chores+0.013
 same+0.013
 plan+0.012
 urge+0.012
 way+0.011
Neg logit contributions
os-0.020
ark-0.019
o-0.019
oing-0.019
umping-0.018
Token Max
Token ID5436
Feature activation+8.19e-02

Pos logit contributions
 chores+0.055
 plan+0.051
 same+0.051
 urge+0.048
 way+0.047
Neg logit contributions
os-0.090
ark-0.087
o-0.087
oing-0.085
ne-0.084
Token,
Token ID11
Feature activation-3.33e-02

Pos logit contributions
os+0.223
oing+0.212
ark+0.211
umping+0.206
o+0.206
Neg logit contributions
 chores-0.146
 plan-0.141
 same-0.141
 way-0.133
 urge-0.132
Token the
Token ID262
Feature activation-8.78e-02

Pos logit contributions
 chores+0.148
 plan+0.138
 same+0.123
 urge+0.121
 start+0.111
Neg logit contributions
os-0.321
o-0.319
aron-0.313
enn-0.306
umping-0.304
Token hole
Token ID7604
Feature activation+1.37e-01

Pos logit contributions
 chores+0.139
 plan+0.127
 same+0.118
 urge+0.116
 start+0.112
Neg logit contributions
o-0.238
os-0.235
oing-0.229
enn-0.228
ne-0.225
Token.
Token ID13
Feature activation+7.88e-02

Pos logit contributions
os+0.320
oing+0.306
ark+0.306
o+0.296
umping+0.293
Neg logit contributions
 chores-0.289
 same-0.281
 plan-0.279
 way-0.268
 urge-0.265
Token
Token ID220
Feature activation+8.04e-02

Pos logit contributions
os+0.210
ark+0.200
oing+0.199
umping+0.194
o+0.191
Neg logit contributions
 chores-0.146
 same-0.140
 plan-0.139
 way-0.133
 urge-0.131
Token
Token ID198
Feature activation+7.78e-02

Pos logit contributions
oing+0.219
os+0.214
ark+0.213
round+0.209
umping+0.209
Neg logit contributions
 chores-0.134
 same-0.128
 plan-0.123
 way-0.117
 not-0.114
Token
Token ID198
Feature activation+7.70e-02

Pos logit contributions
os+0.222
oing+0.207
umping+0.207
ark+0.199
ist+0.195
Neg logit contributions
 chores-0.124
 plan-0.119
 same-0.118
 bad-0.115
 way-0.113
TokenJust
Token ID5703
Feature activation-5.38e-02

Pos logit contributions
os+0.182
oing+0.162
umping+0.160
ark+0.160
o+0.155
Neg logit contributions
 chores-0.174
 plan-0.165
 same-0.164
 bad-0.157
 urge-0.157
Token then
Token ID788
Feature activation-2.80e-02

Pos logit contributions
 chores+0.107
 plan+0.100
 same+0.094
 way+0.089
 urge+0.088
Neg logit contributions
os-0.129
o-0.128
ark-0.125
ne-0.125
aron-0.124
Token,
Token ID11
Feature activation-2.11e-02

Pos logit contributions
 chores+0.047
 plan+0.045
 same+0.045
 urge+0.041
 way+0.041
Neg logit contributions
os-0.079
ark-0.076
oing-0.076
o-0.075
umping-0.073
Token a
Token ID257
Feature activation-4.93e-02

Pos logit contributions
 chores+0.038
 plan+0.036
 same+0.035
 way+0.033
 urge+0.033
Neg logit contributions
o-0.055
os-0.054
oing-0.053
ark-0.053
enn-0.051
Token kind
Token ID1611
Feature activation+3.01e-02

Pos logit contributions
 chores+0.062
 same+0.054
 plan+0.052
 urge+0.049
 way+0.048
Neg logit contributions
os-0.157
oing-0.155
o-0.153
ark-0.152
umping-0.148
Token a
Token ID257
Feature activation-4.75e-02

Pos logit contributions
os+0.320
ark+0.310
oing+0.309
o+0.304
umping+0.302
Neg logit contributions
 chores-0.089
 plan-0.080
 same-0.080
 way-0.069
 urge-0.068
Token shiny
Token ID22441
Feature activation+2.06e-02

Pos logit contributions
 chores+0.051
 same+0.044
 plan+0.043
 urge+0.041
 start+0.037
Neg logit contributions
os-0.159
oing-0.156
o-0.155
ark-0.155
umping-0.151
Token,
Token ID11
Feature activation-3.87e-02

Pos logit contributions
os+0.056
ark+0.055
oing+0.055
o+0.054
umping+0.053
Neg logit contributions
 chores-0.034
 plan-0.031
 same-0.031
 urge-0.029
 way-0.028
Token white
Token ID2330
Feature activation+5.08e-02

Pos logit contributions
 chores+0.050
 plan+0.046
 same+0.042
 urge+0.041
 way+0.039
Neg logit contributions
oing-0.116
os-0.115
umping-0.113
aron-0.113
o-0.113
Token bone
Token ID9970
Feature activation+1.63e-01

Pos logit contributions
os+0.129
ark+0.121
oing+0.121
o+0.119
umping+0.117
Neg logit contributions
 chores-0.101
 plan-0.097
 same-0.097
 urge-0.091
 way-0.091
Token.
Token ID13
Feature activation+7.41e-02

Pos logit contributions
os+0.396
oing+0.373
ark+0.373
o+0.366
umping+0.362
Neg logit contributions
 chores-0.338
 plan-0.325
 same-0.324
 way-0.307
 urge-0.306
Token It
Token ID632
Feature activation+2.66e-02

Pos logit contributions
os+0.220
ark+0.212
oing+0.211
o+0.207
umping+0.205
Neg logit contributions
 chores-0.113
 same-0.108
 plan-0.107
 way-0.099
 urge-0.098
Token looked
Token ID3114
Feature activation-4.41e-02

Pos logit contributions
os+0.072
oing+0.069
ark+0.069
umping+0.067
o+0.067
Neg logit contributions
 chores-0.048
 plan-0.046
 same-0.046
 way-0.043
 urge-0.043
Token very
Token ID845
Feature activation-4.87e-02

Pos logit contributions
 chores+0.087
 plan+0.081
 same+0.075
 way+0.075
 urge+0.074
Neg logit contributions
os-0.101
o-0.099
oing-0.098
aron-0.098
ark-0.097
Token tasty
Token ID25103
Feature activation+2.10e-02

Pos logit contributions
 chores+0.063
 plan+0.056
 same+0.054
 way+0.051
 urge+0.051
Neg logit contributions
os-0.149
ark-0.145
oing-0.145
o-0.145
aron-0.140
Token and
Token ID290
Feature activation-4.32e-02

Pos logit contributions
os+0.058
oing+0.056
ark+0.054
enn+0.052
o+0.052
Neg logit contributions
 chores-0.036
 same-0.036
 way-0.036
 plan-0.034
 start-0.033
Token walking
Token ID6155
Feature activation-7.72e-03

Pos logit contributions
 chores+0.197
 same+0.196
 plan+0.192
 temptation+0.188
 task+0.172
Neg logit contributions
o-0.298
os-0.298
umping-0.282
oing-0.278
ark-0.270
Token,
Token ID11
Feature activation-2.47e-02

Pos logit contributions
 chores+0.015
 plan+0.014
 same+0.014
 urge+0.013
 way+0.013
Neg logit contributions
os-0.020
o-0.019
ark-0.019
oing-0.019
umping-0.018
Token he
Token ID339
Feature activation+7.12e-03

Pos logit contributions
 chores+0.048
 plan+0.047
 same+0.045
 urge+0.042
 way+0.041
Neg logit contributions
os-0.060
o-0.058
oing-0.054
es-0.054
ark-0.053
Token heard
Token ID2982
Feature activation-5.80e-02

Pos logit contributions
os+0.019
o+0.018
oing+0.017
ark+0.017
umping+0.017
Neg logit contributions
 chores-0.014
 plan-0.013
 same-0.013
 urge-0.012
 temptation-0.012
Token a
Token ID257
Feature activation-5.58e-02

Pos logit contributions
 chores+0.051
 same+0.046
 plan+0.046
 urge+0.039
 way+0.039
Neg logit contributions
os-0.211
ark-0.203
oing-0.203
o-0.201
umping-0.198
Token noise
Token ID7838
Feature activation+8.85e-02

Pos logit contributions
 chores+0.062
 same+0.055
 plan+0.051
 way+0.048
 urge+0.047
Neg logit contributions
os-0.186
oing-0.185
ark-0.181
o-0.179
umping-0.174
Token nearby
Token ID6716
Feature activation-1.85e-02

Pos logit contributions
os+0.249
ark+0.243
oing+0.235
o+0.234
enny+0.234
Neg logit contributions
 chores-0.141
 start-0.138
 urge-0.137
 plan-0.137
 rain-0.135
Token.
Token ID13
Feature activation+8.03e-02

Pos logit contributions
 chores+0.038
 plan+0.037
 same+0.037
 urge+0.035
 start+0.035
Neg logit contributions
os-0.045
ark-0.043
oing-0.043
o-0.042
umping-0.041
Token When
Token ID1649
Feature activation-4.82e-02

Pos logit contributions
os+0.218
ark+0.209
oing+0.207
o+0.202
umping+0.201
Neg logit contributions
 chores-0.141
 same-0.138
 plan-0.137
 urge-0.129
 way-0.129
Token he
Token ID339
Feature activation-1.06e-02

Pos logit contributions
 chores+0.111
 plan+0.101
 same+0.101
 urge+0.095
 idea+0.094
Neg logit contributions
o-0.100
os-0.100
oing-0.092
es-0.090
ark-0.088
Token looked
Token ID3114
Feature activation-6.11e-02

Pos logit contributions
 plan+0.016
 start+0.015
 chores+0.015
 same+0.015
 urge+0.015
Neg logit contributions
os-0.031
ark-0.031
oing-0.030
umping-0.029
aron-0.029
Token who
Token ID508
Feature activation-2.38e-02

Pos logit contributions
os+0.355
oing+0.343
ark+0.342
umping+0.336
o+0.331
Neg logit contributions
 chores-0.273
 plan-0.263
 same-0.262
 way-0.254
 urge-0.247
Token figured
Token ID11638
Feature activation-2.80e-02

Pos logit contributions
 chores+0.045
 plan+0.043
 same+0.043
 urge+0.040
 way+0.040
Neg logit contributions
os-0.063
ark-0.059
oing-0.059
o-0.059
umping-0.057
Token out
Token ID503
Feature activation-4.15e-02

Pos logit contributions
 chores+0.044
 plan+0.042
 same+0.041
 urge+0.040
 way+0.038
Neg logit contributions
os-0.081
ark-0.076
o-0.076
oing-0.075
and-0.073
Token the
Token ID262
Feature activation-8.85e-02

Pos logit contributions
 chores+0.065
 same+0.062
 plan+0.062
 urge+0.059
 way+0.057
Neg logit contributions
os-0.120
ark-0.113
o-0.112
oing-0.112
umping-0.110
Token answer
Token ID3280
Feature activation+3.78e-02

Pos logit contributions
 chores+0.116
 her+0.091
 urge+0.086
 same+0.084
 plan+0.084
Neg logit contributions
ne-0.238
os-0.235
ark-0.234
o-0.234
oing-0.234
Token.
Token ID13
Feature activation+8.10e-02

Pos logit contributions
os+0.088
oing+0.084
ark+0.084
o+0.082
umping+0.081
Neg logit contributions
 chores-0.081
 plan-0.078
 same-0.078
 way-0.074
 urge-0.073
Token And
Token ID843
Feature activation+1.39e-03

Pos logit contributions
os+0.219
ark+0.217
oing+0.216
umping+0.210
o+0.206
Neg logit contributions
 chores-0.133
 same-0.129
 plan-0.128
 way-0.122
 start-0.118
Token they
Token ID484
Feature activation-1.91e-02

Pos logit contributions
os+0.004
ark+0.004
o+0.004
oing+0.004
umping+0.004
Neg logit contributions
 chores-0.002
 same-0.002
 plan-0.002
 urge-0.002
 way-0.002
Token all
Token ID477
Feature activation-2.10e-02

Pos logit contributions
 chores+0.030
 same+0.028
 plan+0.028
 urge+0.026
 way+0.026
Neg logit contributions
os-0.057
ark-0.054
oing-0.053
o-0.053
umping-0.052
Token lived
Token ID5615
Feature activation-3.51e-02

Pos logit contributions
 chores+0.039
 plan+0.038
 same+0.037
 urge+0.035
 way+0.035
Neg logit contributions
os-0.055
ark-0.053
oing-0.053
o-0.052
umping-0.051
Token happily
Token ID18177
Feature activation+6.31e-03

Pos logit contributions
 chores+0.077
 same+0.073
 plan+0.072
 urge+0.070
 way+0.069
Neg logit contributions
os-0.081
ark-0.076
o-0.076
oing-0.076
umping-0.073
Token him
Token ID683
Feature activation-1.16e-02

Pos logit contributions
 chores+0.129
 same+0.121
 plan+0.121
 urge+0.113
 way+0.111
Neg logit contributions
os-0.231
ark-0.219
oing-0.217
o-0.216
umping-0.211
Token find
Token ID1064
Feature activation-6.70e-02

Pos logit contributions
 chores+0.021
 same+0.020
 plan+0.019
 urge+0.018
 way+0.018
Neg logit contributions
os-0.032
ark-0.030
oing-0.030
o-0.030
umping-0.029
Token the
Token ID262
Feature activation-8.61e-02

Pos logit contributions
 chores+0.074
 plan+0.073
 same+0.067
 urge+0.065
 start+0.058
Neg logit contributions
os-0.219
o-0.206
ark-0.204
oing-0.202
umping-0.202
Token lung
Token ID12317
Feature activation+1.52e-01

Pos logit contributions
 chores+0.110
 plan+0.099
 start+0.094
 urge+0.093
 her+0.091
Neg logit contributions
oing-0.225
ne-0.225
os-0.222
eddy-0.221
ark-0.220
Token,
Token ID11
Feature activation-2.94e-02

Pos logit contributions
os+0.370
ark+0.347
oing+0.346
o+0.344
umping+0.335
Neg logit contributions
 chores-0.313
 plan-0.298
 same-0.298
 urge-0.282
 way-0.278
Token because
Token ID780
Feature activation-1.55e-02

Pos logit contributions
 chores+0.061
 same+0.057
 plan+0.057
 urge+0.054
 way+0.053
Neg logit contributions
os-0.071
o-0.066
oing-0.066
ark-0.066
umping-0.064
Token it
Token ID340
Feature activation+3.16e-02

Pos logit contributions
 chores+0.027
 plan+0.026
 same+0.026
 urge+0.025
 start+0.023
Neg logit contributions
os-0.041
o-0.039
ark-0.038
and-0.037
ne-0.036
Token changed
Token ID3421
Feature activation-2.20e-02

Pos logit contributions
os+0.082
o+0.077
ark+0.076
oing+0.076
umping+0.074
Neg logit contributions
 chores-0.061
 same-0.057
 plan-0.057
 urge-0.054
 way-0.053
Token her
Token ID607
Feature activation-5.66e-02

Pos logit contributions
 chores+0.039
 plan+0.037
 same+0.037
 urge+0.035
 way+0.034
Neg logit contributions
os-0.060
ark-0.056
o-0.056
oing-0.056
umping-0.055
Token life
Token ID1204
Feature activation+7.09e-02

Pos logit contributions
 chores+0.075
 same+0.069
 plan+0.069
 urge+0.064
 start+0.063
Neg logit contributions
os-0.171
oing-0.163
o-0.162
ark-0.161
umping-0.158
Token.
Token ID13
Feature activation+7.73e-02

Pos logit contributions
os+0.171
oing+0.164
ark+0.164
o+0.160
umping+0.158
Neg logit contributions
 chores-0.149
 same-0.142
 plan-0.142
 way-0.137
 urge-0.132
Token her
Token ID607
Feature activation-5.14e-02

Pos logit contributions
 chores+0.032
 plan+0.032
 same+0.030
 urge+0.029
 way+0.028
Neg logit contributions
os-0.079
o-0.076
ark-0.076
oing-0.075
umping-0.073
Token very
Token ID845
Feature activation-4.22e-02

Pos logit contributions
 chores+0.080
 plan+0.076
 same+0.074
 urge+0.071
 way+0.069
Neg logit contributions
os-0.145
ark-0.138
oing-0.138
o-0.137
umping-0.132
Token much
Token ID881
Feature activation-2.54e-03

Pos logit contributions
 chores+0.067
 same+0.063
 plan+0.063
 urge+0.058
 way+0.058
Neg logit contributions
os-0.124
oing-0.119
ark-0.118
o-0.117
umping-0.116
Token.
Token ID13
Feature activation+7.04e-02

Pos logit contributions
 chores+0.005
 same+0.005
 plan+0.005
 way+0.005
 bad+0.004
Neg logit contributions
os-0.007
oing-0.006
o-0.006
ark-0.006
umping-0.006
Token And
Token ID843
Feature activation-5.26e-03

Pos logit contributions
os+0.201
oing+0.194
ark+0.192
umping+0.188
o+0.187
Neg logit contributions
 chores-0.114
 same-0.112
 plan-0.106
 urge-0.103
 way-0.101
Token I
Token ID314
Feature activation-2.72e-02

Pos logit contributions
 chores+0.009
 plan+0.009
 same+0.009
 urge+0.008
 way+0.008
Neg logit contributions
os-0.013
ark-0.013
o-0.013
ne-0.012
oing-0.012
Token love
Token ID1842
Feature activation-2.69e-02

Pos logit contributions
 chores+0.041
 plan+0.040
 same+0.039
 urge+0.036
 way+0.036
Neg logit contributions
os-0.083
ark-0.079
oing-0.079
umping-0.078
o-0.077
Token my
Token ID616
Feature activation-7.59e-02

Pos logit contributions
 chores+0.032
 plan+0.032
 same+0.030
 urge+0.030
 way+0.027
Neg logit contributions
os-0.087
o-0.084
ark-0.083
oing-0.082
umping-0.079
Token grandma
Token ID49890
Feature activation+1.27e-01

Pos logit contributions
 chores+0.136
 plan+0.128
 same+0.124
 urge+0.122
 start+0.120
Neg logit contributions
os-0.190
oing-0.182
ark-0.181
o-0.180
umping-0.173
Token too
Token ID1165
Feature activation-7.71e-03

Pos logit contributions
ark+0.349
oing+0.349
os+0.347
umping+0.339
enn+0.333
Neg logit contributions
 chores-0.218
 same-0.214
 plan-0.204
 way-0.201
 bad-0.197
Token.
Token ID13
Feature activation+6.94e-02

Pos logit contributions
 chores+0.015
 same+0.015
 plan+0.014
 way+0.014
 bad+0.014
Neg logit contributions
os-0.020
oing-0.019
ark-0.019
umping-0.019
o-0.018
Token snack
Token ID26906
Feature activation+1.26e-01

Pos logit contributions
os+0.030
oing+0.029
o+0.028
ark+0.028
umping+0.028
Neg logit contributions
 chores-0.018
 same-0.016
 plan-0.016
 urge-0.015
 temptation-0.014
Token for
Token ID329
Feature activation-1.01e-01

Pos logit contributions
os+0.291
ark+0.274
oing+0.274
o+0.270
umping+0.265
Neg logit contributions
 chores-0.276
 plan-0.265
 same-0.264
 urge-0.250
 way-0.250
Token the
Token ID262
Feature activation-8.45e-02

Pos logit contributions
 plan+0.109
 same+0.108
 urge+0.104
 way+0.096
 chores+0.092
Neg logit contributions
os-0.314
es-0.300
and-0.291
ark-0.289
o-0.288
Token journey
Token ID7002
Feature activation+8.25e-02

Pos logit contributions
 chores+0.112
 plan+0.105
 same+0.094
 urge+0.094
 start+0.093
Neg logit contributions
os-0.226
ne-0.221
o-0.221
ark-0.219
oing-0.218
Token.
Token ID13
Feature activation+7.90e-02

Pos logit contributions
os+0.190
oing+0.179
ark+0.179
o+0.176
umping+0.173
Neg logit contributions
 chores-0.181
 plan-0.174
 same-0.174
 way-0.164
 urge-0.164
Token When
Token ID1649
Feature activation-5.19e-02

Pos logit contributions
ark+0.221
os+0.220
oing+0.219
umping+0.210
o+0.202
Neg logit contributions
 same-0.128
 chores-0.127
 plan-0.120
 way-0.120
 urge-0.114
Token she
Token ID673
Feature activation-7.26e-03

Pos logit contributions
 plan+0.099
 urge+0.095
 chores+0.093
 same+0.093
 idea+0.088
Neg logit contributions
os-0.115
o-0.108
es-0.103
ne-0.102
oing-0.099
Token was
Token ID373
Feature activation-1.20e-01

Pos logit contributions
 chores+0.014
 same+0.014
 plan+0.013
 urge+0.013
 temptation+0.012
Neg logit contributions
os-0.019
o-0.018
ark-0.017
oing-0.017
umping-0.017
Token done
Token ID1760
Feature activation-6.26e-02

Pos logit contributions
 chores+0.195
 same+0.188
 plan+0.184
 temptation+0.183
 urge+0.173
Neg logit contributions
os-0.332
o-0.322
umping-0.313
aron-0.307
ark-0.305
Token,
Token ID11
Feature activation-3.14e-02

Pos logit contributions
 same+0.123
 chores+0.121
 plan+0.121
 urge+0.115
 way+0.113
Neg logit contributions
os-0.149
oing-0.140
ark-0.140
o-0.140
umping-0.138
Token she
Token ID673
Feature activation+4.92e-03

Pos logit contributions
 chores+0.050
 plan+0.049
 same+0.046
 urge+0.043
 temptation+0.042
Neg logit contributions
os-0.083
es-0.076
o-0.076
ne-0.074
oing-0.074
Token three
Token ID1115
Feature activation+2.36e-02

Pos logit contributions
 chores+0.195
 plan+0.177
 same+0.172
 urge+0.172
 way+0.167
Neg logit contributions
os-0.343
o-0.321
ark-0.316
oing-0.314
ne-0.311
Token,
Token ID11
Feature activation-3.09e-02

Pos logit contributions
os+0.065
ark+0.062
oing+0.062
o+0.062
umping+0.060
Neg logit contributions
 chores-0.042
 plan-0.040
 same-0.039
 way-0.037
 urge-0.037
Token and
Token ID290
Feature activation-3.62e-02

Pos logit contributions
 chores+0.049
 plan+0.046
 same+0.044
 urge+0.042
 way+0.042
Neg logit contributions
os-0.090
o-0.086
oing-0.085
ark-0.084
umping-0.084
Token they
Token ID484
Feature activation-1.08e-02

Pos logit contributions
 chores+0.056
 plan+0.053
 same+0.052
 urge+0.049
 way+0.048
Neg logit contributions
os-0.108
o-0.102
ark-0.102
oing-0.102
umping-0.100
Token all
Token ID477
Feature activation-1.68e-02

Pos logit contributions
 chores+0.018
 plan+0.018
 same+0.017
 start+0.016
 urge+0.016
Neg logit contributions
os-0.030
ark-0.029
oing-0.029
umping-0.029
o-0.028
Token jumped
Token ID11687
Feature activation-2.21e-02

Pos logit contributions
 chores+0.030
 plan+0.030
 same+0.030
 way+0.028
 urge+0.028
Neg logit contributions
ark-0.043
os-0.043
oing-0.043
umping-0.041
o-0.041
Token into
Token ID656
Feature activation-1.10e-01

Pos logit contributions
 chores+0.039
 same+0.037
 plan+0.037
 way+0.034
 urge+0.034
Neg logit contributions
os-0.061
ark-0.058
oing-0.058
o-0.057
umping-0.056
Token the
Token ID262
Feature activation-8.42e-02

Pos logit contributions
 chores+0.102
 plan+0.091
 same+0.090
 urge+0.085
 way+0.078
Neg logit contributions
os-0.385
oing-0.364
o-0.362
ark-0.362
umping-0.361
Token air
Token ID1633
Feature activation+8.75e-02

Pos logit contributions
 chores+0.122
 plan+0.116
 same+0.108
 urge+0.104
 way+0.099
Neg logit contributions
os-0.250
oing-0.239
o-0.237
umping-0.235
ark-0.232
Token,
Token ID11
Feature activation-3.57e-02

Pos logit contributions
os+0.209
ark+0.201
oing+0.197
o+0.194
umping+0.191
Neg logit contributions
 chores-0.183
 same-0.178
 plan-0.177
 way-0.170
 urge-0.167
Token with
Token ID351
Feature activation-9.34e-02

Pos logit contributions
 chores+0.066
 plan+0.062
 same+0.058
 urge+0.055
 way+0.054
Neg logit contributions
os-0.095
oing-0.091
o-0.090
umping-0.087
aron-0.087
Token to
Token ID284
Feature activation-1.16e-01

Pos logit contributions
 chores+0.072
 plan+0.070
 same+0.068
 urge+0.066
 way+0.063
Neg logit contributions
os-0.096
o-0.093
ark-0.091
oing-0.090
umping-0.088
Token bed
Token ID3996
Feature activation+1.13e-01

Pos logit contributions
 chores+0.163
 same+0.160
 temptation+0.151
 urge+0.142
 plan+0.141
Neg logit contributions
os-0.307
o-0.296
and-0.283
enn-0.274
enny-0.268
Token that
Token ID326
Feature activation-6.21e-02

Pos logit contributions
os+0.274
oing+0.262
ark+0.262
o+0.257
umping+0.253
Neg logit contributions
 chores-0.232
 plan-0.222
 same-0.220
 urge-0.209
 way-0.205
Token night
Token ID1755
Feature activation+4.97e-02

Pos logit contributions
 chores+0.091
 plan+0.085
 same+0.076
 start+0.075
 temptation+0.074
Neg logit contributions
os-0.179
o-0.170
and-0.160
umping-0.160
ark-0.158
Token and
Token ID290
Feature activation-4.06e-02

Pos logit contributions
os+0.134
ark+0.128
oing+0.127
o+0.126
umping+0.124
Neg logit contributions
 chores-0.090
 same-0.086
 plan-0.086
 way-0.081
 urge-0.080
Token fell
Token ID3214
Feature activation-1.40e-03

Pos logit contributions
 chores+0.082
 plan+0.075
 temptation+0.071
 urge+0.071
 same+0.070
Neg logit contributions
os-0.100
o-0.095
es-0.090
enny-0.088
and-0.087
Token asleep
Token ID16039
Feature activation+6.20e-02

Pos logit contributions
 chores+0.003
 plan+0.003
 same+0.003
 urge+0.003
 way+0.003
Neg logit contributions
os-0.003
o-0.003
ark-0.003
oing-0.003
umping-0.003
Token right
Token ID826
Feature activation-7.55e-02

Pos logit contributions
os+0.149
oing+0.141
ark+0.141
o+0.139
umping+0.136
Neg logit contributions
 chores-0.129
 plan-0.123
 same-0.122
 urge-0.115
 way-0.114
Token away
Token ID1497
Feature activation-4.84e-02

Pos logit contributions
 chores+0.162
 plan+0.155
 same+0.144
 urge+0.144
 start+0.136
Neg logit contributions
os-0.177
o-0.168
ark-0.168
umping-0.163
aron-0.161
Token.
Token ID13
Feature activation+7.91e-02

Pos logit contributions
 chores+0.110
 plan+0.105
 same+0.105
 urge+0.100
 way+0.099
Neg logit contributions
os-0.108
ark-0.102
oing-0.102
o-0.100
umping-0.098
Token She
Token ID1375
Feature activation+1.22e-03

Pos logit contributions
os+0.203
ark+0.195
oing+0.193
umping+0.187
o+0.186
Neg logit contributions
 chores-0.152
 same-0.150
 plan-0.145
 way-0.141
 urge-0.139
Token my
Token ID616
Feature activation-7.96e-02

Pos logit contributions
 plan+0.057
 chores+0.056
 same+0.053
 urge+0.053
 way+0.049
Neg logit contributions
os-0.143
o-0.138
ark-0.138
oing-0.137
umping-0.133
Token drawings
Token ID23388
Feature activation+1.31e-01

Pos logit contributions
 chores+0.153
 plan+0.138
 same+0.137
 start+0.132
 urge+0.130
Neg logit contributions
oing-0.189
os-0.187
o-0.182
ark-0.170
oc-0.168
Token.
Token ID13
Feature activation+7.06e-02

Pos logit contributions
os+0.312
oing+0.300
ark+0.298
o+0.291
umping+0.289
Neg logit contributions
 chores-0.282
 same-0.268
 plan-0.268
 way-0.254
 start-0.251
Token They
Token ID1119
Feature activation-2.12e-02

Pos logit contributions
os+0.189
ark+0.182
oing+0.180
umping+0.179
o+0.173
Neg logit contributions
 chores-0.127
 same-0.120
 plan-0.116
 urge-0.114
 start-0.110
Token are
Token ID389
Feature activation-8.61e-02

Pos logit contributions
 chores+0.036
 same+0.034
 plan+0.033
 way+0.031
 urge+0.031
Neg logit contributions
os-0.059
o-0.055
oing-0.054
ark-0.053
ne-0.052
Token not
Token ID407
Feature activation-1.13e-01

Pos logit contributions
 plan+0.107
 chores+0.107
 urge+0.102
 same+0.097
 start+0.090
Neg logit contributions
o-0.256
os-0.244
aron-0.240
oing-0.229
ark-0.226
Token food
Token ID2057
Feature activation+9.44e-02

Pos logit contributions
 chores+0.145
 plan+0.131
 same+0.126
 urge+0.122
 start+0.116
Neg logit contributions
os-0.349
o-0.348
oing-0.334
ark-0.333
aron-0.331
Token.
Token ID13
Feature activation+6.62e-02

Pos logit contributions
os+0.230
oing+0.218
ark+0.217
o+0.214
umping+0.211
Neg logit contributions
 chores-0.197
 plan-0.188
 same-0.188
 way-0.178
 urge-0.177
Token They
Token ID1119
Feature activation-2.28e-02

Pos logit contributions
os+0.187
ark+0.183
umping+0.180
oing+0.180
o+0.172
Neg logit contributions
 chores-0.106
 same-0.100
 plan-0.093
 urge-0.092
 start-0.090
Token are
Token ID389
Feature activation-8.66e-02

Pos logit contributions
 chores+0.038
 same+0.036
 plan+0.036
 way+0.034
 urge+0.033
Neg logit contributions
os-0.063
o-0.059
oing-0.058
ark-0.057
ne-0.056
Token art
Token ID1242
Feature activation+9.46e-02

Pos logit contributions
 chores+0.111
 plan+0.109
 urge+0.103
 same+0.100
 start+0.093
Neg logit contributions
o-0.257
os-0.253
aron-0.241
ark-0.235
oing-0.234
Token
Token ID220
Feature activation+7.22e-02

Pos logit contributions
os+0.240
o+0.228
oing+0.226
ark+0.224
umping+0.220
Neg logit contributions
 chores-0.169
 plan-0.160
 same-0.159
 urge-0.148
 way-0.145
Token
Token ID198
Feature activation+7.79e-02

Pos logit contributions
oing+0.185
os+0.184
ark+0.181
umping+0.178
ne+0.175
Neg logit contributions
 chores-0.132
 same-0.127
 plan-0.125
 way-0.118
 start-0.112
Token
Token ID198
Feature activation+7.81e-02

Pos logit contributions
os+0.227
umping+0.219
oing+0.216
ark+0.213
ne+0.210
Neg logit contributions
 chores-0.115
 plan-0.115
 same-0.114
 way-0.109
 other-0.106
TokenTim
Token ID14967
Feature activation+1.08e-01

Pos logit contributions
os+0.191
oing+0.177
ark+0.175
o+0.173
umping+0.172
Neg logit contributions
 chores-0.167
 plan-0.161
 same-0.159
 urge-0.150
 way-0.150
Tokenmy
Token ID1820
Feature activation+6.56e-02

Pos logit contributions
oing+0.234
os+0.233
ark+0.228
and+0.214
uts+0.210
Neg logit contributions
 plan-0.246
 same-0.245
 way-0.238
 chores-0.236
 urge-0.229
Token's
Token ID338
Feature activation-8.05e-02

Pos logit contributions
os+0.186
ark+0.178
oing+0.178
umping+0.172
o+0.170
Neg logit contributions
 plan-0.108
 chores-0.105
 same-0.103
 way-0.099
 urge-0.099
Token dad
Token ID9955
Feature activation+1.40e-01

Pos logit contributions
 chores+0.095
 same+0.092
 plan+0.085
 temptation+0.077
 way+0.076
Neg logit contributions
os-0.242
o-0.236
oing-0.235
umping-0.232
ark-0.231
Token smiled
Token ID13541
Feature activation-1.09e-02

Pos logit contributions
os+0.377
ark+0.370
oing+0.367
umping+0.355
and+0.351
Neg logit contributions
 plan-0.244
 chores-0.240
 same-0.234
 urge-0.226
 start-0.226
Token and
Token ID290
Feature activation-5.11e-02

Pos logit contributions
 chores+0.022
 same+0.022
 plan+0.022
 urge+0.021
 start+0.020
Neg logit contributions
os-0.027
ark-0.025
oing-0.025
o-0.025
umping-0.024
Token said
Token ID531
Feature activation-2.99e-02

Pos logit contributions
 chores+0.102
 same+0.096
 plan+0.095
 urge+0.090
 way+0.089
Neg logit contributions
os-0.130
o-0.123
oing-0.122
ark-0.122
umping-0.118
Token,
Token ID11
Feature activation-3.04e-02

Pos logit contributions
 chores+0.050
 plan+0.048
 same+0.047
 urge+0.044
 way+0.044
Neg logit contributions
os-0.085
ark-0.081
oing-0.080
o-0.080
umping-0.078
Token a
Token ID257
Feature activation-5.64e-02

Pos logit contributions
 chores+0.060
 plan+0.057
 same+0.054
 urge+0.050
 way+0.046
Neg logit contributions
os-0.208
o-0.199
ark-0.197
oing-0.197
umping-0.195
Token walk
Token ID2513
Feature activation+3.32e-02

Pos logit contributions
 chores+0.069
 same+0.063
 plan+0.063
 urge+0.057
 way+0.056
Neg logit contributions
os-0.184
oing-0.177
ark-0.176
o-0.176
umping-0.172
Token,
Token ID11
Feature activation-3.38e-02

Pos logit contributions
os+0.086
oing+0.083
ark+0.082
umping+0.080
o+0.079
Neg logit contributions
 chores-0.063
 same-0.062
 plan-0.060
 way-0.058
 urge-0.057
Token when
Token ID618
Feature activation-5.10e-02

Pos logit contributions
 chores+0.074
 plan+0.068
 same+0.063
 urge+0.060
 idea+0.060
Neg logit contributions
os-0.078
o-0.077
umping-0.073
oing-0.073
and-0.072
Token he
Token ID339
Feature activation-1.50e-02

Pos logit contributions
 chores+0.090
 plan+0.088
 same+0.085
 urge+0.081
 idea+0.078
Neg logit contributions
os-0.129
o-0.125
oing-0.112
es-0.112
umping-0.108
Token heard
Token ID2982
Feature activation-6.91e-02

Pos logit contributions
 chores+0.022
 plan+0.022
 same+0.022
 way+0.021
 urge+0.020
Neg logit contributions
os-0.044
ark-0.043
oing-0.043
o-0.041
umping-0.041
Token a
Token ID257
Feature activation-5.97e-02

Pos logit contributions
 chores+0.075
 plan+0.070
 same+0.067
 urge+0.063
 way+0.060
Neg logit contributions
os-0.235
o-0.223
ark-0.222
oing-0.222
umping-0.219
Token noise
Token ID7838
Feature activation+8.26e-02

Pos logit contributions
 chores+0.063
 same+0.055
 plan+0.053
 way+0.048
 urge+0.048
Neg logit contributions
os-0.204
oing-0.201
ark-0.198
o-0.196
umping-0.192
Token.
Token ID13
Feature activation+7.96e-02

Pos logit contributions
os+0.222
ark+0.220
oing+0.213
o+0.211
enny+0.207
Neg logit contributions
 chores-0.141
 start-0.140
 plan-0.138
 urge-0.137
 rain-0.136
Token He
Token ID679
Feature activation-3.22e-03

Pos logit contributions
os+0.209
ark+0.199
oing+0.198
o+0.195
umping+0.192
Neg logit contributions
 chores-0.149
 same-0.144
 plan-0.143
 urge-0.135
 way-0.134
Token looked
Token ID3114
Feature activation-4.55e-02

Pos logit contributions
 plan+0.005
 same+0.004
 start+0.004
 way+0.004
 chores+0.004
Neg logit contributions
os-0.010
oing-0.009
ark-0.009
umping-0.009
and-0.009
Token.
Token ID13
Feature activation+6.95e-02

Pos logit contributions
os+0.125
oing+0.118
o+0.116
ark+0.116
enn+0.115
Neg logit contributions
 chores-0.087
 same-0.086
 plan-0.082
 start-0.079
 bad-0.078
Token
Token ID198
Feature activation+7.03e-02

Pos logit contributions
os+0.187
oing+0.182
ark+0.182
umping+0.178
enny+0.172
Neg logit contributions
 chores-0.122
 same-0.121
 plan-0.120
 way-0.115
 urge-0.113
Token
Token ID198
Feature activation+7.07e-02

Pos logit contributions
os+0.185
oing+0.184
umping+0.181
ark+0.176
aron+0.175
Neg logit contributions
 plan-0.121
 chores-0.121
 same-0.116
 way-0.112
 other-0.107
TokenBut
Token ID1537
Feature activation-1.86e-03

Pos logit contributions
os+0.184
oing+0.173
ark+0.173
umping+0.169
o+0.169
Neg logit contributions
 chores-0.136
 plan-0.130
 same-0.129
 urge-0.121
 way-0.121
Token Mom
Token ID11254
Feature activation+1.11e-01

Pos logit contributions
 chores+0.003
 plan+0.003
 same+0.003
 urge+0.003
 temptation+0.003
Neg logit contributions
o-0.005
os-0.005
ark-0.004
oing-0.004
round-0.004
Token does
Token ID857
Feature activation-1.02e-01

Pos logit contributions
os+0.303
o+0.286
ark+0.284
oing+0.283
umping+0.279
Neg logit contributions
 chores-0.198
 plan-0.187
 same-0.186
 urge-0.175
 way-0.172
Token not
Token ID407
Feature activation-9.47e-02

Pos logit contributions
 same+0.082
 chores+0.079
 plan+0.074
 urge+0.068
 way+0.068
Neg logit contributions
oing-0.346
ark-0.342
os-0.341
o-0.332
ne-0.327
Token like
Token ID588
Feature activation-9.72e-02

Pos logit contributions
 chores+0.120
 plan+0.112
 same+0.111
 urge+0.101
 way+0.100
Neg logit contributions
os-0.309
ark-0.296
oing-0.295
o-0.292
umping-0.289
Token it
Token ID340
Feature activation+3.23e-02

Pos logit contributions
 plan+0.144
 chores+0.142
 urge+0.141
 same+0.134
 idea+0.123
Neg logit contributions
os-0.277
and-0.254
o-0.253
umping-0.250
oing-0.244
Token when
Token ID618
Feature activation-5.61e-02

Pos logit contributions
os+0.071
ark+0.066
o+0.066
oing+0.065
umping+0.064
Neg logit contributions
 chores-0.074
 plan-0.071
 same-0.071
 urge-0.068
 way-0.067
Token they
Token ID484
Feature activation-2.87e-02

Pos logit contributions
 plan+0.086
 chores+0.086
 same+0.085
 urge+0.083
 way+0.074
Neg logit contributions
os-0.160
o-0.152
ark-0.140
umping-0.140
oing-0.139
Token and
Token ID290
Feature activation-4.94e-02

Pos logit contributions
 chores+0.018
 plan+0.017
 same+0.017
 urge+0.016
 way+0.016
Neg logit contributions
os-0.020
ark-0.018
oing-0.018
o-0.018
umping-0.018
Token he
Token ID339
Feature activation-1.74e-03

Pos logit contributions
 chores+0.064
 plan+0.060
 same+0.060
 urge+0.058
 temptation+0.052
Neg logit contributions
os-0.146
o-0.143
ark-0.133
and-0.131
oing-0.130
Token liked
Token ID8288
Feature activation-1.00e-01

Pos logit contributions
 chores+0.003
 plan+0.003
 same+0.003
 urge+0.003
 way+0.003
Neg logit contributions
os-0.005
ark-0.004
oing-0.004
o-0.004
umping-0.004
Token the
Token ID262
Feature activation-8.74e-02

Pos logit contributions
 chores+0.178
 plan+0.173
 same+0.171
 urge+0.166
 start+0.156
Neg logit contributions
os-0.268
o-0.252
ark-0.249
oing-0.244
umping-0.241
Token way
Token ID835
Feature activation-1.15e-02

Pos logit contributions
 chores+0.094
 plan+0.081
 start+0.074
 urge+0.070
 same+0.070
Neg logit contributions
os-0.273
o-0.261
umping-0.258
ark-0.257
and-0.256
Token the
Token ID262
Feature activation-8.62e-02

Pos logit contributions
 chores+0.020
 plan+0.019
 same+0.019
 urge+0.018
 way+0.018
Neg logit contributions
os-0.032
ark-0.030
oing-0.030
o-0.030
umping-0.029
Token people
Token ID661
Feature activation+1.20e-01

Pos logit contributions
 chores+0.140
 plan+0.129
 same+0.121
 start+0.118
 urge+0.116
Neg logit contributions
os-0.232
ark-0.220
o-0.219
oing-0.216
and-0.216
Token smiled
Token ID13541
Feature activation-1.88e-02

Pos logit contributions
os+0.310
ark+0.300
oing+0.298
o+0.291
umping+0.290
Neg logit contributions
 chores-0.228
 plan-0.218
 same-0.216
 urge-0.205
 way-0.204
Token at
Token ID379
Feature activation-1.03e-01

Pos logit contributions
 chores+0.040
 plan+0.039
 same+0.039
 urge+0.037
 way+0.036
Neg logit contributions
os-0.044
ark-0.042
oing-0.041
o-0.041
umping-0.040
Token him
Token ID683
Feature activation-4.35e-03

Pos logit contributions
 chores+0.195
 plan+0.186
 same+0.185
 urge+0.174
 way+0.173
Neg logit contributions
os-0.268
ark-0.254
oing-0.253
o-0.251
umping-0.246
Token.
Token ID13
Feature activation+7.59e-02

Pos logit contributions
 chores+0.009
 plan+0.009
 same+0.009
 way+0.008
 urge+0.008
Neg logit contributions
os-0.010
oing-0.010
ark-0.010
o-0.010
umping-0.010