TOP ACTIVATIONS
MAX = 19.426

The end . <|endoftext|> D ennis was feeling very lonely .
was a little fish named Nem o . Nem o loved
. The great Zen o hugs Ben and says
was a little fish named Nem o . Nem o liked
was a little fish named Nem o . Nem o loved
! â <|endoftext|> Pa olo was very excited . He
was a little fish named Nem o . Nem o loved
friends . One day , Nem o 's friend D ory
was a little fish named Nem o . Nem o loved
One day , the prince decided to go for a
sea . One day , Nem o saw a shiny er
One day , Nem o got caught on the
They were happy . <|endoftext|> Le o was a rare lion
there was a fish named Nem o . Nem o loved
friends . One day , Nem o saw an incredible sight
friends . One day , Nem o and his friends went
ringing sound . Le o looked around and saw
was a little fish named Nem o . Nem o lived
. <|endoftext|> One day , Milo and his Mom my were
ocean . One day , Nem o saw a plastic bag

MIN ACTIVATIONS
MIN = -14.835

, even when it differed from his own . From then
, it kept on flying err atically with inexplicable speed .
, they wouldn 't have gotten broken . Tim my realized
the green bubble just kept getting bigger and bigger .
, but his mustache kept getting in the way and preventing
kept helping out and kept getting rewarded , until one day
money and didn 't delay getting them ". Bob
so they wouldn 't have gotten in trouble . From then
, so he could avoid getting another band age . <|endoftext|>
, but it still kept getting in the way . Suddenly
the wet p uddle without getting wet . Max
sing and the branch kept getting bigger and stronger . Pretty
crib to prevent falling and getting hurt ." Tim looked at
or use it to sit with your friends ."
, and they kept on getting better and better .
't be fair to take from the generous tree . Now
't . It just kept getting deeper and deeper . Eventually
sharing toys was better than getting angry or hating anyone .
the magical force just kept getting stronger !
! We can start by getting a more comfortable seat !"

INTERVAL 10.861 - 19.426
CONTAINS 5.881%

gave Joe a poem . Joe was so happy to get
jungle ." The monkey was very excited and wanted
was a duck named D ucky . D ucky loved to
She looked at her t eddy bear . He was soft
was a little girl named Lily . One day , she

INTERVAL 2.295 - 10.861
CONTAINS 30.329%

and a smile y face . She was happy with her
humble and listen to me ." L ily nodded
to play with . Tim my was so excited to balance
Once upon a time , there was a girl named Lily
I 'm sorry , Ben . The tree is not mean

INTERVAL -6.270 - 2.295
CONTAINS 47.076%

successful . But then something happened that made her
bunny was so happy and thanked Fox y for being so
up with Jerry . So , he watched the space ship
and lay down . L ily played with her
, she wanted to wear a princess dress , but she

INTERVAL -14.835 - -6.270
CONTAINS 16.714%

a man . The man has a staff . A staff
a special uniform that he loved to wear . One day
and smart . We are very proud of you ," dad
, M addy tried to find something special and colorful every
help me . You can put some cheese on the burgers
TokenThe
Token ID464
Feature activation+7.25e-01

Pos logit contributions
ist+0.071
aron+0.070
ppy+0.068
enny+0.068
ne+0.067
Neg logit contributions
 chores-0.063
 going-0.061
 same-0.057
 temptation-0.055
 top-0.054
Token end
Token ID886
Feature activation+2.16e+00

Pos logit contributions
ne+0.021
ist+0.020
aron+0.020
pillar+0.020
jo+0.019
Neg logit contributions
 chores-0.014
 going-0.014
 temptation-0.013
 same-0.013
wered-0.012
Token.
Token ID13
Feature activation+4.97e+00

Pos logit contributions
enny+0.076
ppy+0.074
aron+0.073
ouse+0.073
pp+0.073
Neg logit contributions
 going-0.039
 chores-0.038
 good-0.034
 rain-0.034
 work-0.033
Token<|endoftext|>
Token ID50256
Feature activation+9.46e+00

Pos logit contributions
ist+0.172
ppy+0.165
aron+0.164
ne+0.161
enny+0.161
Neg logit contributions
 going-0.093
 chores-0.093
 same-0.085
 temptation-0.080
 top-0.079
TokenD
Token ID35
Feature activation+8.51e+00

Pos logit contributions
ist+0.358
aron+0.338
ppy+0.337
owed+0.329
enny+0.321
Neg logit contributions
 going-0.159
 same-0.147
 chores-0.142
 top-0.138
 rain-0.137
Tokenennis
Token ID10679
Feature activation+1.94e+01

Pos logit contributions
aron+0.183
ist+0.166
ppy+0.151
oc+0.149
ne+0.148
Neg logit contributions
 chores-0.276
 same-0.271
 going-0.268
 top-0.252
 work-0.252
Token was
Token ID373
Feature activation-6.55e+00

Pos logit contributions
enny+0.583
aron+0.567
ist+0.566
oing+0.565
itty+0.557
Neg logit contributions
 going-0.433
 same-0.410
 chores-0.398
 work-0.392
 her-0.391
Token feeling
Token ID4203
Feature activation-3.89e+00

Pos logit contributions
 chores+0.113
 temptation+0.111
 same+0.090
wered+0.089
 hurricane+0.088
Neg logit contributions
o-0.233
aron-0.221
ist-0.215
umping-0.214
itty-0.213
Token very
Token ID845
Feature activation-5.07e+00

Pos logit contributions
 chores+0.063
 temptation+0.059
 going+0.058
 avail+0.054
 same+0.053
Neg logit contributions
o-0.128
ppy-0.127
aron-0.127
enny-0.125
ist-0.124
Token lonely
Token ID21757
Feature activation+1.17e+00

Pos logit contributions
 chores+0.100
wered+0.088
 avail+0.087
 top+0.086
 same+0.085
Neg logit contributions
enny-0.145
o-0.143
ouse-0.141
owed-0.141
ry-0.138
Token.
Token ID13
Feature activation+5.01e+00

Pos logit contributions
enny+0.035
aron+0.034
ne+0.034
o+0.034
os+0.034
Neg logit contributions
 going-0.027
 chores-0.026
 same-0.025
 temptation-0.023
 rain-0.023
Token was
Token ID373
Feature activation-5.60e+00

Pos logit contributions
aron+0.119
ist+0.118
enny+0.116
ne+0.115
ppy+0.113
Neg logit contributions
 going-0.113
 chores-0.113
 same-0.108
 temptation-0.103
 rain-0.102
Token a
Token ID257
Feature activation+3.69e-01

Pos logit contributions
 chores+0.078
 temptation+0.073
 going+0.064
 same+0.061
 hurricane+0.057
Neg logit contributions
o-0.212
aron-0.208
enny-0.205
ne-0.203
ppy-0.202
Token little
Token ID1310
Feature activation+4.37e+00

Pos logit contributions
aron+0.015
ist+0.014
enny+0.014
ne+0.014
os+0.014
Neg logit contributions
 chores-0.005
 going-0.005
 same-0.004
 temptation-0.004
 rain-0.004
Token fish
Token ID5916
Feature activation+1.54e+01

Pos logit contributions
aron+0.147
ist+0.146
enny+0.146
owed+0.142
os+0.142
Neg logit contributions
 same-0.081
 going-0.080
 chores-0.079
 rain-0.078
 good-0.075
Token named
Token ID3706
Feature activation+1.96e+00

Pos logit contributions
aron+0.461
enny+0.458
ist+0.435
ppy+0.416
umping+0.416
Neg logit contributions
 going-0.368
 same-0.334
 chores-0.330
 her-0.323
 big-0.322
Token Nem
Token ID22547
Feature activation+1.94e+01

Pos logit contributions
aron+0.062
enny+0.062
ist+0.061
o+0.061
ne+0.061
Neg logit contributions
 chores-0.040
 going-0.039
 temptation-0.037
 same-0.037
 rain-0.034
Tokeno
Token ID78
Feature activation+1.42e+01

Pos logit contributions
ist+0.328
aron+0.310
ol+0.291
enny+0.263
pillar+0.259
Neg logit contributions
 going-0.691
 big-0.618
 their-0.618
 the-0.606
 not-0.599
Token.
Token ID13
Feature activation+6.10e+00

Pos logit contributions
enny+0.498
aron+0.493
ist+0.474
umping+0.467
ne+0.458
Neg logit contributions
 going-0.280
 chores-0.247
 same-0.241
 her-0.232
 top-0.226
Token Nem
Token ID22547
Feature activation+1.89e+01

Pos logit contributions
o+0.173
aron+0.171
enny+0.168
ne+0.160
ppy+0.160
Neg logit contributions
 chores-0.151
 temptation-0.137
 going-0.136
 same-0.130
 rain-0.124
Tokeno
Token ID78
Feature activation+1.39e+01

Pos logit contributions
ist+0.359
ol+0.321
aron+0.313
enny+0.311
ne+0.286
Neg logit contributions
 going-0.592
 big-0.580
 same-0.549
 not-0.548
 their-0.546
Token loved
Token ID6151
Feature activation-6.78e+00

Pos logit contributions
enny+0.490
aron+0.475
ist+0.473
umping+0.451
itty+0.444
Neg logit contributions
 going-0.277
 same-0.257
 big-0.242
 good-0.238
 chores-0.238
Token.
Token ID13
Feature activation+3.66e+00

Pos logit contributions
aron+0.026
enny+0.026
ist+0.026
ne+0.026
o+0.025
Neg logit contributions
 going-0.021
 chores-0.021
 temptation-0.019
 same-0.019
 rain-0.018
Token
Token ID198
Feature activation+1.81e+00

Pos logit contributions
ist+0.122
aron+0.119
enny+0.118
ne+0.118
ppy+0.117
Neg logit contributions
 going-0.070
 chores-0.069
 same-0.065
 temptation-0.063
 rain-0.060
Token
Token ID198
Feature activation+2.30e+00

Pos logit contributions
ist+0.065
aron+0.062
ppy+0.061
ne+0.060
enny+0.060
Neg logit contributions
 chores-0.031
 going-0.030
 same-0.029
 good-0.028
 temptation-0.028
TokenThe
Token ID464
Feature activation+3.87e-01

Pos logit contributions
ist+0.071
jo+0.069
owed+0.068
os+0.067
aron+0.066
Neg logit contributions
 chores-0.048
 good-0.047
 same-0.047
 temptation-0.046
 going-0.045
Token great
Token ID1049
Feature activation+5.92e-01

Pos logit contributions
ne+0.011
aron+0.010
enny+0.010
ist+0.010
o+0.010
Neg logit contributions
 chores-0.009
 going-0.009
 temptation-0.009
 same-0.008
 rain-0.008
Token Zen
Token ID14760
Feature activation+1.93e+01

Pos logit contributions
ne+0.017
enny+0.017
aron+0.017
ist+0.016
o+0.016
Neg logit contributions
 chores-0.013
 going-0.013
 temptation-0.012
 same-0.012
wered-0.011
Tokeno
Token ID78
Feature activation+1.24e+01

Pos logit contributions
umping+0.309
aron+0.306
enny+0.284
ud+0.279
ist+0.269
Neg logit contributions
 going-0.656
 help-0.626
 go-0.626
 the-0.623
 her-0.620
Token hugs
Token ID40657
Feature activation-1.56e+00

Pos logit contributions
enny+0.415
jo+0.398
aron+0.390
itty+0.385
ist+0.383
Neg logit contributions
 going-0.239
 same-0.233
 work-0.224
 chores-0.223
 rain-0.223
Token Ben
Token ID3932
Feature activation+1.31e+01

Pos logit contributions
 going+0.017
 chores+0.017
 same+0.016
 temptation+0.015
 rain+0.014
Neg logit contributions
ist-0.064
aron-0.064
enny-0.063
ne-0.062
jo-0.062
Token and
Token ID290
Feature activation-3.96e+00

Pos logit contributions
enny+0.443
ist+0.437
itty+0.422
ne+0.419
jo+0.417
Neg logit contributions
 going-0.243
 chores-0.242
 her-0.241
 good-0.232
 same-0.226
Token says
Token ID1139
Feature activation-6.73e-01

Pos logit contributions
 chores+0.100
 going+0.094
 temptation+0.092
 same+0.090
wered+0.086
Neg logit contributions
aron-0.104
ne-0.102
o-0.101
enny-0.099
ist-0.099
Token was
Token ID373
Feature activation-5.60e+00

Pos logit contributions
aron+0.115
ist+0.113
enny+0.110
ne+0.110
ppy+0.108
Neg logit contributions
 going-0.101
 chores-0.100
 same-0.097
 rain-0.091
 temptation-0.091
Token a
Token ID257
Feature activation+3.49e-01

Pos logit contributions
 chores+0.066
 temptation+0.060
 going+0.057
 same+0.053
 work+0.043
Neg logit contributions
o-0.224
aron-0.223
enny-0.222
ne-0.220
ppy-0.217
Token little
Token ID1310
Feature activation+4.55e+00

Pos logit contributions
ist+0.014
aron+0.014
enny+0.014
os+0.014
owed+0.014
Neg logit contributions
 going-0.004
 chores-0.004
 same-0.004
 rain-0.003
 good-0.003
Token fish
Token ID5916
Feature activation+1.54e+01

Pos logit contributions
aron+0.160
ist+0.158
enny+0.158
owed+0.157
os+0.155
Neg logit contributions
 same-0.080
 rain-0.078
 going-0.075
 good-0.075
 chores-0.073
Token named
Token ID3706
Feature activation+1.92e+00

Pos logit contributions
aron+0.474
enny+0.469
ist+0.447
umping+0.429
ppy+0.425
Neg logit contributions
 going-0.350
 same-0.317
 big-0.314
 her-0.314
 chores-0.311
Token Nem
Token ID22547
Feature activation+1.93e+01

Pos logit contributions
aron+0.061
enny+0.060
o+0.060
ist+0.060
ne+0.059
Neg logit contributions
 chores-0.040
 going-0.039
 temptation-0.037
 same-0.036
 rain-0.034
Tokeno
Token ID78
Feature activation+1.44e+01

Pos logit contributions
ist+0.310
aron+0.294
ol+0.285
co+0.250
pillar+0.247
Neg logit contributions
 going-0.697
 their-0.633
 big-0.631
 the-0.620
 her-0.618
Token.
Token ID13
Feature activation+6.17e+00

Pos logit contributions
enny+0.518
aron+0.511
ist+0.490
umping+0.489
ne+0.477
Neg logit contributions
 going-0.259
 chores-0.227
 same-0.223
 her-0.221
 the-0.219
Token Nem
Token ID22547
Feature activation+1.89e+01

Pos logit contributions
aron+0.174
o+0.173
enny+0.169
ne+0.162
ppy+0.162
Neg logit contributions
 chores-0.154
 temptation-0.140
 going-0.138
 same-0.134
 rain-0.127
Tokeno
Token ID78
Feature activation+1.40e+01

Pos logit contributions
ist+0.354
ol+0.327
aron+0.314
enny+0.311
co+0.290
Neg logit contributions
 going-0.584
 big-0.577
 not-0.549
 his-0.547
 same-0.547
Token liked
Token ID8288
Feature activation-7.19e+00

Pos logit contributions
enny+0.503
aron+0.486
ist+0.485
umping+0.464
itty+0.455
Neg logit contributions
 going-0.265
 same-0.247
 big-0.240
 good-0.233
 the-0.227
Token was
Token ID373
Feature activation-5.71e+00

Pos logit contributions
aron+0.116
ist+0.115
enny+0.113
ne+0.112
ppy+0.110
Neg logit contributions
 going-0.108
 chores-0.108
 same-0.104
 temptation-0.098
 rain-0.097
Token a
Token ID257
Feature activation+3.23e-01

Pos logit contributions
 chores+0.071
 temptation+0.065
 going+0.060
 same+0.056
 hurricane+0.047
Neg logit contributions
aron-0.224
o-0.224
enny-0.222
ne-0.220
ppy-0.217
Token little
Token ID1310
Feature activation+4.47e+00

Pos logit contributions
aron+0.013
ist+0.013
enny+0.013
ne+0.013
os+0.012
Neg logit contributions
 chores-0.004
 going-0.004
 same-0.004
 temptation-0.003
 rain-0.003
Token fish
Token ID5916
Feature activation+1.54e+01

Pos logit contributions
aron+0.154
ist+0.152
enny+0.152
owed+0.150
os+0.149
Neg logit contributions
 same-0.081
 rain-0.078
 going-0.077
 chores-0.076
 good-0.075
Token named
Token ID3706
Feature activation+1.83e+00

Pos logit contributions
aron+0.466
enny+0.466
ist+0.441
umping+0.425
ppy+0.422
Neg logit contributions
 going-0.355
 same-0.324
 chores-0.319
 big-0.318
 her-0.315
Token Nem
Token ID22547
Feature activation+1.92e+01

Pos logit contributions
aron+0.057
o+0.057
enny+0.057
ist+0.056
ne+0.056
Neg logit contributions
 chores-0.039
 going-0.037
 temptation-0.035
 same-0.035
 rain-0.033
Tokeno
Token ID78
Feature activation+1.44e+01

Pos logit contributions
ist+0.309
aron+0.289
ol+0.281
enny+0.255
pillar+0.252
Neg logit contributions
 going-0.702
 their-0.641
 big-0.635
 the-0.627
 his-0.618
Token.
Token ID13
Feature activation+6.05e+00

Pos logit contributions
enny+0.522
aron+0.510
ist+0.493
umping+0.489
pillar+0.477
Neg logit contributions
 going-0.263
 chores-0.227
 same-0.227
 the-0.222
 her-0.220
Token Nem
Token ID22547
Feature activation+1.88e+01

Pos logit contributions
o+0.170
aron+0.168
enny+0.164
ne+0.157
ppy+0.157
Neg logit contributions
 chores-0.152
 temptation-0.138
 going-0.135
 same-0.131
 work-0.125
Tokeno
Token ID78
Feature activation+1.41e+01

Pos logit contributions
ist+0.352
ol+0.319
enny+0.312
aron+0.305
pillar+0.281
Neg logit contributions
 going-0.592
 big-0.583
 their-0.555
 same-0.554
 not-0.553
Token loved
Token ID6151
Feature activation-6.92e+00

Pos logit contributions
enny+0.509
ist+0.489
aron+0.486
umping+0.467
itty+0.459
Neg logit contributions
 going-0.264
 same-0.246
 big-0.238
 good-0.231
 the-0.225
Token!
Token ID0
Feature activation+3.35e+00

Pos logit contributions
 going+0.051
 chores+0.050
 same+0.048
 temptation+0.046
 rain+0.044
Neg logit contributions
aron-0.060
ist-0.059
enny-0.059
ne-0.057
jo-0.057
Tokenâ
Token ID22940
Feature activation+5.49e+00

Pos logit contributions
ist+0.110
aron+0.106
ppy+0.103
enny+0.103
ne+0.102
Neg logit contributions
 going-0.065
 chores-0.064
 same-0.061
 temptation-0.060
 rain-0.056
Token
Token ID26391
Feature activation+9.78e+00

Pos logit contributions
enny+0.165
os+0.160
as+0.159
es+0.157
ppy+0.156
Neg logit contributions
 temptation-0.121
 same-0.120
 going-0.120
 chores-0.118
 rain-0.107
Token<|endoftext|>
Token ID50256
Feature activation+8.88e+00

Pos logit contributions
ist+0.329
ppy+0.307
aron+0.304
enny+0.303
ne+0.295
Neg logit contributions
 going-0.214
 chores-0.195
 same-0.171
 good-0.171
 top-0.171
TokenPa
Token ID28875
Feature activation+1.02e+01

Pos logit contributions
ist+0.323
aron+0.308
ppy+0.301
owed+0.298
enny+0.295
Neg logit contributions
 going-0.161
 same-0.145
 chores-0.144
 top-0.138
 rain-0.133
Tokenolo
Token ID14057
Feature activation+1.91e+01

Pos logit contributions
aron+0.219
ist+0.217
jo+0.202
os+0.200
enny+0.197
Neg logit contributions
 going-0.322
 chores-0.317
 same-0.316
 her-0.293
 temptation-0.293
Token was
Token ID373
Feature activation-7.27e+00

Pos logit contributions
aron+0.571
enny+0.562
ist+0.557
itty+0.542
ne+0.535
Neg logit contributions
 going-0.437
 chores-0.406
 same-0.400
 work-0.390
 rain-0.387
Token very
Token ID845
Feature activation-4.81e+00

Pos logit contributions
 chores+0.133
 temptation+0.129
 same+0.110
 rain+0.106
 work+0.104
Neg logit contributions
o-0.253
aron-0.243
ist-0.235
umping-0.232
itty-0.230
Token excited
Token ID6568
Feature activation-3.22e-01

Pos logit contributions
 chores+0.105
 temptation+0.097
 avail+0.092
 same+0.090
wered+0.089
Neg logit contributions
o-0.145
enny-0.133
os-0.132
as-0.128
ouse-0.126
Token.
Token ID13
Feature activation+4.98e+00

Pos logit contributions
 chores+0.008
 going+0.008
 same+0.008
 temptation+0.008
 rain+0.007
Neg logit contributions
aron-0.008
ist-0.008
enny-0.008
ne-0.008
o-0.008
Token He
Token ID679
Feature activation+5.51e+00

Pos logit contributions
aron+0.148
enny+0.146
ist+0.143
ne+0.142
o+0.142
Neg logit contributions
 chores-0.114
 going-0.111
 same-0.105
 temptation-0.103
 rain-0.099
Token was
Token ID373
Feature activation-5.68e+00

Pos logit contributions
aron+0.116
ist+0.115
enny+0.113
ne+0.112
ppy+0.110
Neg logit contributions
 chores-0.112
 going-0.112
 same-0.107
 temptation-0.103
 rain-0.100
Token a
Token ID257
Feature activation+2.97e-01

Pos logit contributions
 chores+0.078
 temptation+0.075
 going+0.065
 same+0.063
 hurricane+0.057
Neg logit contributions
o-0.215
aron-0.211
enny-0.210
ne-0.207
umping-0.205
Token little
Token ID1310
Feature activation+4.28e+00

Pos logit contributions
aron+0.012
ist+0.012
enny+0.012
ne+0.011
os+0.011
Neg logit contributions
 chores-0.004
 going-0.004
 same-0.003
 temptation-0.003
 rain-0.003
Token fish
Token ID5916
Feature activation+1.51e+01

Pos logit contributions
aron+0.145
ist+0.144
enny+0.141
owed+0.139
jo+0.138
Neg logit contributions
 same-0.079
 going-0.078
 chores-0.077
 rain-0.076
 good-0.074
Token named
Token ID3706
Feature activation+1.69e+00

Pos logit contributions
aron+0.445
enny+0.438
ist+0.422
ppy+0.403
umping+0.401
Neg logit contributions
 going-0.368
 chores-0.335
 same-0.334
 big-0.322
 her-0.321
Token Nem
Token ID22547
Feature activation+1.89e+01

Pos logit contributions
aron+0.054
enny+0.053
ist+0.053
o+0.053
ne+0.053
Neg logit contributions
 chores-0.035
 going-0.033
 temptation-0.032
 same-0.032
 rain-0.029
Tokeno
Token ID78
Feature activation+1.41e+01

Pos logit contributions
ist+0.302
aron+0.287
ol+0.266
enny+0.239
pillar+0.237
Neg logit contributions
 going-0.706
 big-0.636
 their-0.631
 same-0.620
 her-0.618
Token.
Token ID13
Feature activation+6.07e+00

Pos logit contributions
enny+0.489
aron+0.485
ist+0.466
umping+0.459
ne+0.449
Neg logit contributions
 going-0.278
 chores-0.248
 same-0.238
 her-0.228
 top-0.225
Token Nem
Token ID22547
Feature activation+1.88e+01

Pos logit contributions
o+0.170
aron+0.169
enny+0.164
ne+0.157
ppy+0.156
Neg logit contributions
 chores-0.151
 temptation-0.138
 going-0.136
 same-0.131
 rain-0.125
Tokeno
Token ID78
Feature activation+1.40e+01

Pos logit contributions
ist+0.361
ol+0.319
aron+0.314
enny+0.308
ne+0.289
Neg logit contributions
 going-0.580
 big-0.575
 same-0.544
 their-0.540
 good-0.539
Token loved
Token ID6151
Feature activation-6.88e+00

Pos logit contributions
enny+0.496
ist+0.480
aron+0.479
umping+0.458
itty+0.450
Neg logit contributions
 going-0.273
 same-0.253
 big-0.241
 good-0.235
 chores-0.232
Token friends
Token ID2460
Feature activation+7.67e+00

Pos logit contributions
 chores+0.012
 going+0.011
 temptation+0.010
 same+0.010
 rain+0.009
Neg logit contributions
o-0.025
aron-0.025
enny-0.025
ist-0.025
ne-0.025
Token.
Token ID13
Feature activation+5.30e+00

Pos logit contributions
aron+0.231
enny+0.227
ist+0.225
ne+0.221
jo+0.218
Neg logit contributions
 going-0.178
 chores-0.176
 same-0.163
 temptation-0.154
 rain-0.152
Token One
Token ID1881
Feature activation+9.32e-01

Pos logit contributions
aron+0.179
o+0.177
enny+0.176
ne+0.173
ist+0.172
Neg logit contributions
 chores-0.098
 going-0.095
 temptation-0.090
 same-0.087
 rain-0.080
Token day
Token ID1110
Feature activation+4.52e+00

Pos logit contributions
aron+0.041
ist+0.040
enny+0.040
ne+0.040
ppy+0.039
Neg logit contributions
 chores-0.008
 going-0.008
 same-0.007
 temptation-0.006
 rain-0.006
Token,
Token ID11
Feature activation+3.60e-01

Pos logit contributions
aron+0.128
enny+0.128
ne+0.126
o+0.126
ist+0.124
Neg logit contributions
 chores-0.106
 going-0.101
 same-0.099
 temptation-0.098
 rain-0.091
Token Nem
Token ID22547
Feature activation+1.89e+01

Pos logit contributions
o+0.009
enny+0.008
ne+0.008
y+0.008
aron+0.008
Neg logit contributions
 chores-0.010
 temptation-0.009
 trouble-0.008
 going-0.008
 same-0.008
Tokeno
Token ID78
Feature activation+1.38e+01

Pos logit contributions
ist+0.335
aron+0.320
owed+0.247
ol+0.247
co+0.243
Neg logit contributions
 going-0.675
 big-0.634
 same-0.632
 go-0.611
 not-0.596
Token's
Token ID338
Feature activation-1.34e+00

Pos logit contributions
aron+0.448
ist+0.438
enny+0.421
itty+0.406
ne+0.403
Neg logit contributions
 going-0.303
 same-0.285
 rain-0.276
 chores-0.271
 go-0.260
Token friend
Token ID1545
Feature activation+9.96e+00

Pos logit contributions
 chores+0.029
 going+0.029
 temptation+0.028
 same+0.026
 rain+0.025
Neg logit contributions
o-0.040
enny-0.040
aron-0.039
ne-0.039
ist-0.038
Token D
Token ID360
Feature activation+1.14e+01

Pos logit contributions
aron+0.358
ist+0.357
ne+0.336
jo+0.330
enny+0.329
Neg logit contributions
 going-0.175
 chores-0.160
 same-0.158
 rain-0.151
 her-0.149
Tokenory
Token ID652
Feature activation+1.51e+01

Pos logit contributions
aron+0.266
ist+0.254
jo+0.235
es+0.233
ne+0.229
Neg logit contributions
 same-0.336
 going-0.332
 chores-0.329
 temptation-0.317
 rain-0.312
Token was
Token ID373
Feature activation-5.91e+00

Pos logit contributions
aron+0.113
ist+0.112
enny+0.109
ne+0.108
ppy+0.107
Neg logit contributions
 going-0.099
 chores-0.098
 same-0.095
 rain-0.089
 temptation-0.089
Token a
Token ID257
Feature activation+1.52e-01

Pos logit contributions
 chores+0.070
 temptation+0.063
 going+0.061
 same+0.056
 work+0.045
Neg logit contributions
aron-0.236
o-0.236
enny-0.234
ne-0.232
ppy-0.229
Token little
Token ID1310
Feature activation+4.07e+00

Pos logit contributions
aron+0.006
ist+0.006
enny+0.006
os+0.006
ne+0.006
Neg logit contributions
 going-0.002
 chores-0.002
 same-0.002
 rain-0.002
 temptation-0.002
Token fish
Token ID5916
Feature activation+1.51e+01

Pos logit contributions
aron+0.138
ist+0.137
enny+0.136
owed+0.134
os+0.133
Neg logit contributions
 same-0.075
 going-0.073
 rain-0.072
 chores-0.072
 good-0.070
Token named
Token ID3706
Feature activation+1.62e+00

Pos logit contributions
aron+0.456
enny+0.451
ist+0.430
ppy+0.413
umping+0.411
Neg logit contributions
 going-0.355
 same-0.322
 chores-0.316
 big-0.314
 her-0.310
Token Nem
Token ID22547
Feature activation+1.89e+01

Pos logit contributions
aron+0.051
o+0.050
enny+0.050
ist+0.050
ne+0.050
Neg logit contributions
 chores-0.034
 going-0.033
 temptation-0.031
 same-0.031
 rain-0.029
Tokeno
Token ID78
Feature activation+1.41e+01

Pos logit contributions
ist+0.320
aron+0.300
ol+0.287
enny+0.258
pillar+0.256
Neg logit contributions
 going-0.681
 their-0.617
 big-0.615
 the-0.606
 not-0.598
Token.
Token ID13
Feature activation+5.63e+00

Pos logit contributions
enny+0.508
aron+0.502
ist+0.482
umping+0.478
pillar+0.467
Neg logit contributions
 going-0.259
 same-0.222
 chores-0.221
 the-0.218
 big-0.215
Token Nem
Token ID22547
Feature activation+1.87e+01

Pos logit contributions
o+0.161
aron+0.160
enny+0.157
ne+0.151
ppy+0.150
Neg logit contributions
 chores-0.137
 temptation-0.125
 going-0.124
 same-0.120
 work-0.113
Tokeno
Token ID78
Feature activation+1.40e+01

Pos logit contributions
ist+0.357
ol+0.321
enny+0.313
aron+0.309
ne+0.286
Neg logit contributions
 going-0.579
 big-0.571
 not-0.544
 their-0.543
 good-0.543
Token loved
Token ID6151
Feature activation-7.02e+00

Pos logit contributions
enny+0.513
ist+0.494
aron+0.490
umping+0.472
itty+0.463
Neg logit contributions
 going-0.251
 same-0.235
 big-0.231
 good-0.222
 the-0.218
Token
Token ID198
Feature activation+3.38e+00

Pos logit contributions
ist+0.099
aron+0.097
ppy+0.095
enny+0.092
ne+0.091
Neg logit contributions
 chores-0.052
 going-0.051
 same-0.049
 top-0.045
 temptation-0.045
TokenOne
Token ID3198
Feature activation+1.85e+00

Pos logit contributions
aron+0.119
ist+0.118
enny+0.117
ne+0.114
ppy+0.114
Neg logit contributions
 chores-0.059
 going-0.057
 same-0.054
 temptation-0.052
 rain-0.050
Token day
Token ID1110
Feature activation+4.78e+00

Pos logit contributions
aron+0.086
ist+0.085
enny+0.084
ne+0.082
ppy+0.082
Neg logit contributions
 chores-0.013
 going-0.013
 same-0.011
 rain-0.009
 temptation-0.009
Token,
Token ID11
Feature activation+4.84e-01

Pos logit contributions
aron+0.147
ist+0.145
enny+0.145
ne+0.143
o+0.141
Neg logit contributions
 chores-0.103
 going-0.101
 same-0.096
 temptation-0.094
 rain-0.089
Token the
Token ID262
Feature activation+1.68e+00

Pos logit contributions
o+0.017
ne+0.017
enny+0.017
aron+0.017
ppy+0.017
Neg logit contributions
 chores-0.007
 temptation-0.006
 going-0.006
 same-0.006
 rain-0.005
Token prince
Token ID19716
Feature activation+1.89e+01

Pos logit contributions
ne+0.039
aron+0.038
ppy+0.038
enny+0.037
ouse+0.036
Neg logit contributions
 chores-0.046
 temptation-0.043
 going-0.043
wered-0.039
 same-0.039
Token decided
Token ID3066
Feature activation-2.43e+00

Pos logit contributions
aron+0.624
ist+0.603
enny+0.588
itty+0.569
oing+0.558
Neg logit contributions
 going-0.405
 same-0.362
 rain-0.347
 chores-0.339
 work-0.329
Token to
Token ID284
Feature activation-5.98e+00

Pos logit contributions
 chores+0.060
 temptation+0.056
 same+0.053
wered+0.052
 going+0.051
Neg logit contributions
ne-0.064
o-0.064
enny-0.064
aron-0.062
ist-0.061
Token go
Token ID467
Feature activation-5.13e+00

Pos logit contributions
 temptation+0.086
 going+0.085
 chores+0.084
 same+0.079
 sorry+0.070
Neg logit contributions
ne-0.207
ist-0.202
os-0.196
aron-0.195
r-0.194
Token for
Token ID329
Feature activation-1.04e+01

Pos logit contributions
 chores+0.100
 temptation+0.096
 going+0.093
 same+0.093
 rain+0.085
Neg logit contributions
o-0.163
ist-0.161
ne-0.161
enny-0.160
aron-0.160
Token a
Token ID257
Feature activation-2.18e+00

Pos logit contributions
 same+0.163
 temptation+0.158
 hurricane+0.152
 going+0.145
 urge+0.143
Neg logit contributions
ne-0.312
es-0.311
aron-0.303
o-0.302
ppy-0.298
Token sea
Token ID5417
Feature activation+7.84e+00

Pos logit contributions
 chores+0.083
 going+0.079
 temptation+0.075
 work+0.064
 same+0.061
Neg logit contributions
o-0.157
ne-0.153
aron-0.151
enny-0.149
umping-0.147
Token.
Token ID13
Feature activation+5.10e+00

Pos logit contributions
enny+0.236
aron+0.236
ist+0.231
ne+0.228
jo+0.226
Neg logit contributions
 chores-0.177
 going-0.176
 same-0.170
 temptation-0.157
 rain-0.155
Token One
Token ID1881
Feature activation+8.61e-01

Pos logit contributions
aron+0.152
o+0.149
enny+0.148
ne+0.143
ist+0.142
Neg logit contributions
 chores-0.114
 going-0.110
 temptation-0.106
 same-0.102
 rain-0.096
Token day
Token ID1110
Feature activation+4.53e+00

Pos logit contributions
aron+0.038
ist+0.037
enny+0.037
ne+0.037
ppy+0.036
Neg logit contributions
 chores-0.008
 going-0.007
 same-0.006
 temptation-0.006
 rain-0.005
Token,
Token ID11
Feature activation+2.09e-01

Pos logit contributions
aron+0.128
enny+0.128
ne+0.126
o+0.126
ist+0.124
Neg logit contributions
 chores-0.107
 going-0.102
 same-0.099
 temptation-0.098
 rain-0.091
Token Nem
Token ID22547
Feature activation+1.89e+01

Pos logit contributions
o+0.005
enny+0.005
ne+0.005
aron+0.005
ppy+0.005
Neg logit contributions
 chores-0.005
 temptation-0.005
 trouble-0.005
 same-0.004
 going-0.004
Tokeno
Token ID78
Feature activation+1.38e+01

Pos logit contributions
ist+0.341
aron+0.324
owed+0.257
ol+0.248
co+0.248
Neg logit contributions
 going-0.663
 big-0.635
 same-0.630
 go-0.607
 good-0.595
Token saw
Token ID2497
Feature activation-5.64e+00

Pos logit contributions
aron+0.453
ist+0.440
enny+0.425
itty+0.408
ne+0.405
Neg logit contributions
 going-0.303
 same-0.289
 rain-0.278
 chores-0.278
 go-0.264
Token a
Token ID257
Feature activation-1.46e+00

Pos logit contributions
 chores+0.078
 temptation+0.073
 going+0.065
 same+0.059
 urge+0.053
Neg logit contributions
o-0.221
enny-0.209
aron-0.207
ne-0.206
ppy-0.204
Token shiny
Token ID22441
Feature activation+9.70e-01

Pos logit contributions
 chores+0.017
 going+0.016
 temptation+0.015
 same+0.013
wered+0.012
Neg logit contributions
ne-0.058
aron-0.057
enny-0.057
o-0.057
ist-0.057
Token er
Token ID1931
Feature activation+5.40e+00

Pos logit contributions
ist+0.029
aron+0.029
ne+0.029
enny+0.028
o+0.028
Neg logit contributions
 chores-0.021
 going-0.020
 temptation-0.019
 same-0.018
 rain-0.017
Token
Token ID198
Feature activation+2.82e+00

Pos logit contributions
ist+0.127
aron+0.124
ne+0.122
ppy+0.121
umping+0.118
Neg logit contributions
 chores-0.057
 going-0.057
 same-0.054
 her-0.049
 top-0.049
Token
Token ID198
Feature activation+3.34e+00

Pos logit contributions
ist+0.100
aron+0.097
ppy+0.095
enny+0.092
ne+0.092
Neg logit contributions
 going-0.049
 chores-0.048
 same-0.047
 top-0.043
 her-0.042
TokenOne
Token ID3198
Feature activation+1.84e+00

Pos logit contributions
aron+0.086
ist+0.085
enny+0.084
ne+0.083
o+0.081
Neg logit contributions
 chores-0.090
 going-0.088
 same-0.085
 temptation-0.083
 rain-0.080
Token day
Token ID1110
Feature activation+4.64e+00

Pos logit contributions
aron+0.080
ist+0.080
enny+0.079
ne+0.078
ppy+0.077
Neg logit contributions
 chores-0.016
 going-0.016
 same-0.014
 temptation-0.013
 rain-0.012
Token,
Token ID11
Feature activation+4.65e-01

Pos logit contributions
aron+0.138
enny+0.136
ist+0.135
ne+0.135
o+0.133
Neg logit contributions
 chores-0.105
 going-0.101
 same-0.096
 temptation-0.095
 rain-0.089
Token Nem
Token ID22547
Feature activation+1.89e+01

Pos logit contributions
o+0.012
enny+0.012
aron+0.012
ne+0.012
ppy+0.011
Neg logit contributions
 chores-0.012
 temptation-0.011
 going-0.010
 same-0.010
 plan-0.009
Tokeno
Token ID78
Feature activation+1.36e+01

Pos logit contributions
ist+0.340
aron+0.319
co+0.247
owed+0.243
ows+0.227
Neg logit contributions
 going-0.660
 big-0.648
 same-0.636
 go-0.614
 good-0.599
Token got
Token ID1392
Feature activation-7.84e+00

Pos logit contributions
aron+0.470
ist+0.459
enny+0.438
itty+0.427
jo+0.422
Neg logit contributions
 going-0.272
 same-0.257
 rain-0.252
 go-0.242
 big-0.237
Token caught
Token ID4978
Feature activation-4.54e+00

Pos logit contributions
 temptation+0.138
 chores+0.136
 hurricane+0.128
 same+0.125
 comparison+0.122
Neg logit contributions
o-0.270
os-0.234
enny-0.234
ppy-0.233
ne-0.229
Token on
Token ID319
Feature activation-8.95e+00

Pos logit contributions
 going+0.089
 chores+0.088
 same+0.083
 temptation+0.079
 rain+0.077
Neg logit contributions
enny-0.150
aron-0.150
ist-0.149
ne-0.146
jo-0.146
Token the
Token ID262
Feature activation-3.91e+00

Pos logit contributions
 chores+0.097
 temptation+0.093
 going+0.090
 same+0.083
 hurricane+0.071
Neg logit contributions
o-0.349
aron-0.347
ne-0.347
enny-0.336
ist-0.334
Token They
Token ID1119
Feature activation+2.14e+00

Pos logit contributions
aron+0.106
enny+0.103
o+0.102
ne+0.102
ist+0.101
Neg logit contributions
 chores-0.083
 going-0.081
 same-0.076
 temptation-0.076
 rain-0.070
Token were
Token ID547
Feature activation-8.98e+00

Pos logit contributions
enny+0.066
aron+0.065
ist+0.064
umping+0.062
ne+0.062
Neg logit contributions
 going-0.049
 chores-0.046
 same-0.045
 rain-0.044
 good-0.042
Token happy
Token ID3772
Feature activation-2.40e+00

Pos logit contributions
 chores+0.155
 temptation+0.150
 same+0.136
 top+0.131
wered+0.127
Neg logit contributions
aron-0.313
o-0.306
ist-0.294
as-0.286
os-0.283
Token.
Token ID13
Feature activation+3.23e+00

Pos logit contributions
 going+0.056
 chores+0.054
 same+0.051
 temptation+0.050
 rain+0.050
Neg logit contributions
aron-0.071
ist-0.071
enny-0.071
ne-0.069
ppy-0.068
Token<|endoftext|>
Token ID50256
Feature activation+8.32e+00

Pos logit contributions
aron+0.099
ist+0.099
enny+0.098
ne+0.096
ppy+0.095
Neg logit contributions
 chores-0.071
 going-0.070
 same-0.066
 temptation-0.064
 rain-0.062
TokenLe
Token ID3123
Feature activation+1.89e+01

Pos logit contributions
ist+0.288
aron+0.278
ppy+0.270
enny+0.269
owed+0.266
Neg logit contributions
 going-0.162
 chores-0.158
 same-0.151
 rain-0.141
 top-0.139
Tokeno
Token ID78
Feature activation+1.45e+01

Pos logit contributions
ist+0.303
enny+0.276
ouse+0.250
ave+0.235
lic+0.231
Neg logit contributions
 the-0.646
 their-0.646
 going-0.639
 big-0.629
 her-0.623
Token was
Token ID373
Feature activation-6.95e+00

Pos logit contributions
ist+0.466
enny+0.459
aron+0.451
ne+0.431
ppy+0.428
Neg logit contributions
 going-0.327
 same-0.289
 rain-0.279
 her-0.277
 chores-0.277
Token a
Token ID257
Feature activation-9.41e-01

Pos logit contributions
 temptation+0.146
 chores+0.141
 avail+0.121
 same+0.120
 hurricane+0.119
Neg logit contributions
o-0.229
aron-0.213
ist-0.201
umping-0.201
itty-0.199
Token rare
Token ID4071
Feature activation+2.13e+00

Pos logit contributions
 chores+0.013
 going+0.012
 same+0.011
 temptation+0.011
 rain+0.010
Neg logit contributions
aron-0.036
ist-0.036
enny-0.036
ne-0.036
o-0.035
Token lion
Token ID18744
Feature activation+1.38e+01

Pos logit contributions
aron+0.068
enny+0.067
ist+0.067
ne+0.066
ppy+0.065
Neg logit contributions
 chores-0.044
 going-0.043
 same-0.041
 temptation-0.039
 rain-0.039
Token there
Token ID612
Feature activation+4.24e+00

Pos logit contributions
aron+0.020
ist+0.020
enny+0.020
ne+0.020
o+0.019
Neg logit contributions
 chores-0.009
 going-0.009
 same-0.008
 temptation-0.008
 rain-0.008
Token was
Token ID373
Feature activation-5.77e+00

Pos logit contributions
aron+0.115
ist+0.114
enny+0.111
ne+0.110
ppy+0.108
Neg logit contributions
 going-0.108
 chores-0.108
 same-0.103
 temptation-0.099
 rain-0.097
Token a
Token ID257
Feature activation+1.35e-01

Pos logit contributions
 chores+0.079
 temptation+0.075
 going+0.066
 same+0.064
 hurricane+0.059
Neg logit contributions
o-0.219
aron-0.215
enny-0.212
ne-0.210
ppy-0.209
Token fish
Token ID5916
Feature activation+1.47e+01

Pos logit contributions
aron+0.005
ist+0.005
enny+0.005
ne+0.005
os+0.005
Neg logit contributions
 going-0.002
 chores-0.002
 same-0.002
 rain-0.001
 temptation-0.001
Token named
Token ID3706
Feature activation+1.44e+00

Pos logit contributions
aron+0.439
enny+0.429
ist+0.417
jo+0.397
ppy+0.393
Neg logit contributions
 going-0.350
 same-0.321
 chores-0.313
 big-0.312
 her-0.305
Token Nem
Token ID22547
Feature activation+1.89e+01

Pos logit contributions
aron+0.047
ist+0.046
enny+0.046
ne+0.045
o+0.045
Neg logit contributions
 chores-0.029
 going-0.028
 same-0.026
 temptation-0.026
 rain-0.025
Tokeno
Token ID78
Feature activation+1.39e+01

Pos logit contributions
ist+0.316
aron+0.296
ol+0.269
owed+0.240
ne+0.239
Neg logit contributions
 going-0.677
 big-0.617
 their-0.603
 same-0.596
 her-0.592
Token.
Token ID13
Feature activation+5.94e+00

Pos logit contributions
aron+0.480
enny+0.473
ist+0.457
umping+0.448
ne+0.441
Neg logit contributions
 going-0.272
 chores-0.246
 same-0.234
 her-0.223
 big-0.220
Token Nem
Token ID22547
Feature activation+1.88e+01

Pos logit contributions
o+0.165
aron+0.163
enny+0.161
ne+0.153
ppy+0.152
Neg logit contributions
 chores-0.150
 temptation-0.135
 going-0.133
 same-0.129
 rain-0.122
Tokeno
Token ID78
Feature activation+1.39e+01

Pos logit contributions
ist+0.361
aron+0.314
ol+0.311
enny+0.299
ne+0.288
Neg logit contributions
 going-0.575
 big-0.575
 same-0.542
 good-0.535
 his-0.534
Token loved
Token ID6151
Feature activation-6.78e+00

Pos logit contributions
enny+0.495
ist+0.483
aron+0.481
umping+0.457
itty+0.450
Neg logit contributions
 going-0.266
 same-0.246
 big-0.240
 good-0.231
 chores-0.226
Token friends
Token ID2460
Feature activation+8.02e+00

Pos logit contributions
aron+0.555
ist+0.549
enny+0.545
ne+0.529
jo+0.523
Neg logit contributions
 going-0.182
 chores-0.167
 same-0.150
 big-0.149
 good-0.144
Token.
Token ID13
Feature activation+5.32e+00

Pos logit contributions
aron+0.238
enny+0.234
ist+0.232
ne+0.228
jo+0.224
Neg logit contributions
 going-0.191
 chores-0.185
 same-0.174
 temptation-0.165
 rain-0.164
Token One
Token ID1881
Feature activation+1.00e+00

Pos logit contributions
aron+0.163
o+0.161
enny+0.157
ne+0.154
ist+0.150
Neg logit contributions
 chores-0.115
 going-0.107
 temptation-0.107
 same-0.099
 trouble-0.093
Token day
Token ID1110
Feature activation+4.63e+00

Pos logit contributions
aron+0.044
ist+0.044
enny+0.044
ne+0.043
ppy+0.043
Neg logit contributions
 chores-0.008
 going-0.008
 same-0.007
 temptation-0.006
 rain-0.006
Token,
Token ID11
Feature activation+4.51e-01

Pos logit contributions
enny+0.129
aron+0.129
ne+0.127
o+0.127
ist+0.125
Neg logit contributions
 chores-0.111
 going-0.105
 temptation-0.102
 same-0.102
 rain-0.094
Token Nem
Token ID22547
Feature activation+1.89e+01

Pos logit contributions
o+0.010
enny+0.010
ne+0.009
y+0.009
aron+0.009
Neg logit contributions
 chores-0.013
 temptation-0.012
 trouble-0.011
 same-0.011
 going-0.011
Tokeno
Token ID78
Feature activation+1.38e+01

Pos logit contributions
ist+0.347
aron+0.322
co+0.265
ol+0.256
owed+0.253
Neg logit contributions
 going-0.648
 big-0.618
 same-0.615
 go-0.593
 good-0.577
Token saw
Token ID2497
Feature activation-5.59e+00

Pos logit contributions
aron+0.453
ist+0.442
enny+0.424
ne+0.411
itty+0.410
Neg logit contributions
 going-0.299
 same-0.284
 rain-0.275
 chores-0.268
 go-0.258
Token an
Token ID281
Feature activation-2.37e+00

Pos logit contributions
 chores+0.078
 temptation+0.073
 going+0.066
 same+0.060
 urge+0.054
Neg logit contributions
o-0.217
enny-0.206
ne-0.203
aron-0.203
ppy-0.201
Token incredible
Token ID8082
Feature activation+8.70e-01

Pos logit contributions
 chores+0.045
 going+0.040
 temptation+0.040
 same+0.038
 top+0.034
Neg logit contributions
aron-0.077
ist-0.075
ne-0.075
ppy-0.073
o-0.073
Token sight
Token ID6504
Feature activation+6.37e+00

Pos logit contributions
aron+0.026
ist+0.025
ne+0.025
o+0.025
enny+0.024
Neg logit contributions
 chores-0.020
 going-0.018
 temptation-0.018
 same-0.017
wered-0.016
Token friends
Token ID2460
Feature activation+7.90e+00

Pos logit contributions
 chores+0.060
 temptation+0.057
wered+0.051
 same+0.050
 going+0.049
Neg logit contributions
o-0.108
aron-0.097
jo-0.096
os-0.096
ppy-0.096
Token.
Token ID13
Feature activation+5.40e+00

Pos logit contributions
aron+0.233
enny+0.230
ist+0.228
ne+0.224
ppy+0.218
Neg logit contributions
 going-0.187
 chores-0.184
 same-0.174
 temptation-0.167
 rain-0.163
Token One
Token ID1881
Feature activation+8.72e-01

Pos logit contributions
aron+0.160
o+0.159
enny+0.156
ne+0.152
ist+0.148
Neg logit contributions
 chores-0.122
 going-0.115
 temptation-0.114
 same-0.106
 work-0.100
Token day
Token ID1110
Feature activation+4.57e+00

Pos logit contributions
aron+0.038
ist+0.038
enny+0.038
ne+0.037
ppy+0.037
Neg logit contributions
 chores-0.008
 going-0.007
 same-0.006
 temptation-0.006
 rain-0.005
Token,
Token ID11
Feature activation+4.14e-01

Pos logit contributions
aron+0.129
enny+0.129
ne+0.127
o+0.127
ist+0.125
Neg logit contributions
 chores-0.108
 going-0.102
 temptation-0.099
 same-0.099
 rain-0.092
Token Nem
Token ID22547
Feature activation+1.89e+01

Pos logit contributions
o+0.010
enny+0.009
ne+0.009
y+0.009
aron+0.009
Neg logit contributions
 chores-0.012
 temptation-0.011
 trouble-0.010
 going-0.010
 same-0.010
Tokeno
Token ID78
Feature activation+1.39e+01

Pos logit contributions
ist+0.342
aron+0.314
co+0.253
owed+0.245
ol+0.243
Neg logit contributions
 going-0.662
 big-0.630
 same-0.630
 go-0.604
 good-0.592
Token and
Token ID290
Feature activation-4.30e-01

Pos logit contributions
aron+0.444
ist+0.434
enny+0.415
ne+0.404
itty+0.402
Neg logit contributions
 going-0.310
 same-0.295
 rain-0.285
 chores-0.284
 go-0.267
Token his
Token ID465
Feature activation+2.15e+00

Pos logit contributions
 chores+0.007
 temptation+0.006
 going+0.006
 same+0.006
 top+0.005
Neg logit contributions
o-0.015
enny-0.015
ppy-0.015
jo-0.014
aron-0.014
Token friends
Token ID2460
Feature activation+9.70e+00

Pos logit contributions
enny+0.076
aron+0.076
o+0.075
ist+0.075
ne+0.075
Neg logit contributions
 chores-0.035
 going-0.034
 temptation-0.032
 same-0.031
 rain-0.028
Token went
Token ID1816
Feature activation-5.60e+00

Pos logit contributions
aron+0.283
enny+0.282
ist+0.282
ne+0.278
o+0.276
Neg logit contributions
 chores-0.222
 going-0.214
 same-0.205
 temptation-0.203
 rain-0.190
Token ringing
Token ID32333
Feature activation+5.70e-01

Pos logit contributions
 chores+0.025
 going+0.023
 same+0.020
 temptation+0.019
 money+0.016
Neg logit contributions
ne-0.074
aron-0.074
ist-0.073
jo-0.072
o-0.072
Token sound
Token ID2128
Feature activation+1.89e+00

Pos logit contributions
aron+0.022
ist+0.022
enny+0.021
ne+0.021
o+0.021
Neg logit contributions
 chores-0.008
 going-0.008
 same-0.007
 temptation-0.007
 rain-0.006
Token.
Token ID13
Feature activation+5.78e+00

Pos logit contributions
aron+0.054
enny+0.054
ist+0.054
ne+0.053
o+0.052
Neg logit contributions
 going-0.044
 chores-0.044
 same-0.041
 temptation-0.040
 rain-0.039
Token
Token ID198
Feature activation+3.28e+00

Pos logit contributions
ist+0.192
aron+0.186
enny+0.185
ne+0.183
ppy+0.180
Neg logit contributions
 going-0.113
 chores-0.112
 same-0.107
 temptation-0.103
 top-0.098
Token
Token ID198
Feature activation+3.75e+00

Pos logit contributions
ist+0.121
aron+0.115
ppy+0.113
enny+0.111
owed+0.110
Neg logit contributions
 going-0.052
 chores-0.051
 same-0.049
 big-0.048
 top-0.047
TokenLe
Token ID3123
Feature activation+1.89e+01

Pos logit contributions
ist+0.106
aron+0.100
enny+0.099
owed+0.095
ne+0.095
Neg logit contributions
 chores-0.090
 going-0.090
 same-0.088
 top-0.087
 temptation-0.086
Tokeno
Token ID78
Feature activation+1.40e+01

Pos logit contributions
ist+0.280
round+0.278
aron+0.275
plets+0.239
ms+0.229
Neg logit contributions
 big-0.590
 his-0.573
 time-0.564
 an-0.564
 a-0.558
Token looked
Token ID3114
Feature activation-2.61e+00

Pos logit contributions
aron+0.471
enny+0.450
ist+0.450
umping+0.430
itty+0.427
Neg logit contributions
 going-0.272
 rain-0.253
 big-0.248
 same-0.247
 go-0.241
Token around
Token ID1088
Feature activation-1.94e+00

Pos logit contributions
 chores+0.045
 going+0.043
 same+0.041
 temptation+0.040
 rain+0.037
Neg logit contributions
aron-0.092
ist-0.091
enny-0.090
ne-0.089
o-0.088
Token and
Token ID290
Feature activation-4.31e+00

Pos logit contributions
 chores+0.055
 going+0.051
 same+0.050
 temptation+0.050
 rain+0.046
Neg logit contributions
aron-0.047
ist-0.046
ne-0.046
enny-0.045
o-0.045
Token saw
Token ID2497
Feature activation-6.04e+00

Pos logit contributions
 going+0.099
 same+0.092
 chores+0.089
 rain+0.087
 her+0.085
Neg logit contributions
aron-0.133
ist-0.130
enny-0.129
ne-0.126
umping-0.123
Token was
Token ID373
Feature activation-5.82e+00

Pos logit contributions
aron+0.117
ist+0.116
enny+0.114
ne+0.113
ppy+0.111
Neg logit contributions
 going-0.109
 chores-0.108
 same-0.104
 temptation-0.099
 rain-0.098
Token a
Token ID257
Feature activation+8.20e-02

Pos logit contributions
 chores+0.083
 temptation+0.080
 going+0.069
 same+0.067
 hurricane+0.063
Neg logit contributions
o-0.215
aron-0.212
enny-0.209
ne-0.206
umping-0.205
Token little
Token ID1310
Feature activation+3.98e+00

Pos logit contributions
aron+0.003
ist+0.003
enny+0.003
ne+0.003
os+0.003
Neg logit contributions
 chores-0.001
 going-0.001
 same-0.001
 temptation-0.001
 rain-0.001
Token fish
Token ID5916
Feature activation+1.49e+01

Pos logit contributions
aron+0.133
ist+0.132
enny+0.131
owed+0.128
os+0.127
Neg logit contributions
 same-0.075
 going-0.074
 chores-0.074
 rain-0.072
 temptation-0.069
Token named
Token ID3706
Feature activation+1.57e+00

Pos logit contributions
aron+0.440
enny+0.433
ist+0.416
ppy+0.396
jo+0.394
Neg logit contributions
 going-0.362
 same-0.331
 chores-0.327
 big-0.317
 her-0.315
Token Nem
Token ID22547
Feature activation+1.89e+01

Pos logit contributions
aron+0.050
o+0.049
enny+0.049
ist+0.049
ne+0.049
Neg logit contributions
 chores-0.033
 going-0.031
 temptation-0.030
 same-0.029
 rain-0.027
Tokeno
Token ID78
Feature activation+1.40e+01

Pos logit contributions
ist+0.316
aron+0.296
ol+0.275
pillar+0.245
ne+0.236
Neg logit contributions
 going-0.688
 big-0.617
 their-0.611
 her-0.603
 the-0.600
Token.
Token ID13
Feature activation+5.85e+00

Pos logit contributions
aron+0.486
enny+0.481
ist+0.462
umping+0.456
ne+0.447
Neg logit contributions
 going-0.275
 chores-0.248
 same-0.240
 her-0.225
 big-0.224
Token Nem
Token ID22547
Feature activation+1.88e+01

Pos logit contributions
o+0.164
aron+0.163
enny+0.159
ne+0.153
ppy+0.151
Neg logit contributions
 chores-0.144
 temptation-0.132
 going-0.131
 same-0.125
 work-0.118
Tokeno
Token ID78
Feature activation+1.39e+01

Pos logit contributions
ist+0.378
ol+0.334
aron+0.321
enny+0.317
ome+0.304
Neg logit contributions
 going-0.547
 big-0.547
 his-0.517
 same-0.515
 good-0.511
Token lived
Token ID5615
Feature activation-4.86e+00

Pos logit contributions
enny+0.498
aron+0.481
ist+0.479
umping+0.459
itty+0.450
Neg logit contributions
 going-0.264
 same-0.248
 big-0.237
 good-0.231
 chores-0.229
Token.
Token ID13
Feature activation+4.05e+00

Pos logit contributions
aron+0.070
enny+0.068
ist+0.068
ne+0.067
ppy+0.066
Neg logit contributions
 chores-0.055
 going-0.055
 same-0.052
 temptation-0.050
 rain-0.048
Token<|endoftext|>
Token ID50256
Feature activation+9.31e+00

Pos logit contributions
ist+0.140
ppy+0.138
aron+0.136
ne+0.135
enny+0.132
Neg logit contributions
 going-0.074
 chores-0.071
 same-0.066
 temptation-0.064
 top-0.062
TokenOne
Token ID3198
Feature activation+4.49e-01

Pos logit contributions
ist+0.357
aron+0.336
ppy+0.334
owed+0.329
lets+0.322
Neg logit contributions
 going-0.157
 same-0.144
 chores-0.137
 top-0.134
 the-0.131
Token day
Token ID1110
Feature activation+4.91e+00

Pos logit contributions
enny+0.018
aron+0.018
ne+0.018
o+0.018
ist+0.018
Neg logit contributions
 chores-0.005
 going-0.005
 same-0.004
 temptation-0.004
 rain-0.003
Token,
Token ID11
Feature activation+2.44e-01

Pos logit contributions
enny+0.139
o+0.136
ne+0.135
aron+0.135
ist+0.134
Neg logit contributions
 chores-0.117
 going-0.111
 temptation-0.110
 same-0.109
 rain-0.100
Token Milo
Token ID32662
Feature activation+1.89e+01

Pos logit contributions
aron+0.010
enny+0.010
ist+0.010
o+0.010
ne+0.010
Neg logit contributions
 chores-0.003
 going-0.002
 same-0.002
 temptation-0.002
 rain-0.002
Token and
Token ID290
Feature activation-1.19e+00

Pos logit contributions
aron+0.521
ist+0.520
enny+0.514
ne+0.501
ppy+0.485
Neg logit contributions
 going-0.474
 chores-0.461
 same-0.441
 rain-0.426
 temptation-0.420
Token his
Token ID465
Feature activation+2.63e+00

Pos logit contributions
 chores+0.017
 temptation+0.014
 same+0.014
 going+0.014
 top+0.014
Neg logit contributions
o-0.045
ppy-0.044
enny-0.043
aron-0.043
jo-0.043
Token Mom
Token ID11254
Feature activation+1.28e+01

Pos logit contributions
enny+0.095
o+0.094
ppy+0.093
ne+0.092
ist+0.092
Neg logit contributions
 chores-0.039
 going-0.038
 temptation-0.036
 same-0.033
wered-0.031
Tokenmy
Token ID1820
Feature activation+1.08e+01

Pos logit contributions
aron+0.348
enny+0.342
ist+0.341
o+0.339
ne+0.337
Neg logit contributions
 chores-0.321
 going-0.308
 same-0.299
 temptation-0.298
 rain-0.280
Token were
Token ID547
Feature activation-6.47e+00

Pos logit contributions
aron+0.335
enny+0.330
ist+0.329
ne+0.326
o+0.326
Neg logit contributions
 chores-0.230
 going-0.220
 same-0.211
 temptation-0.210
 rain-0.195
Token ocean
Token ID9151
Feature activation+9.46e+00

Pos logit contributions
aron+0.140
ist+0.139
enny+0.138
ne+0.136
ppy+0.135
Neg logit contributions
 chores-0.042
 going-0.041
 same-0.038
 temptation-0.035
 rain-0.033
Token.
Token ID13
Feature activation+5.10e+00

Pos logit contributions
aron+0.265
ist+0.261
enny+0.261
ne+0.257
o+0.253
Neg logit contributions
 chores-0.230
 going-0.227
 same-0.216
 temptation-0.211
 rain-0.203
Token One
Token ID1881
Feature activation+7.93e-01

Pos logit contributions
aron+0.167
o+0.163
enny+0.163
ne+0.160
ist+0.159
Neg logit contributions
 chores-0.100
 going-0.097
 temptation-0.092
 same-0.089
 rain-0.083
Token day
Token ID1110
Feature activation+4.47e+00

Pos logit contributions
aron+0.034
ist+0.034
enny+0.034
ne+0.034
ppy+0.033
Neg logit contributions
 chores-0.007
 going-0.007
 same-0.006
 temptation-0.006
 rain-0.005
Token,
Token ID11
Feature activation+1.84e-01

Pos logit contributions
aron+0.128
enny+0.127
ne+0.126
o+0.125
ist+0.124
Neg logit contributions
 chores-0.105
 going-0.100
 same-0.097
 temptation-0.096
 rain-0.089
Token Nem
Token ID22547
Feature activation+1.89e+01

Pos logit contributions
o+0.005
enny+0.004
ne+0.004
aron+0.004
ppy+0.004
Neg logit contributions
 chores-0.005
 temptation-0.004
 trouble-0.004
 same-0.004
 going-0.004
Tokeno
Token ID78
Feature activation+1.39e+01

Pos logit contributions
ist+0.352
aron+0.331
ol+0.259
owed+0.258
co+0.258
Neg logit contributions
 going-0.645
 big-0.612
 same-0.608
 go-0.588
 not-0.571
Token saw
Token ID2497
Feature activation-5.63e+00

Pos logit contributions
aron+0.460
ist+0.448
enny+0.431
itty+0.414
ne+0.413
Neg logit contributions
 going-0.298
 same-0.285
 rain-0.276
 chores-0.269
 go-0.262
Token a
Token ID257
Feature activation-1.44e+00

Pos logit contributions
 chores+0.074
 temptation+0.069
 going+0.063
 same+0.057
 urge+0.050
Neg logit contributions
o-0.223
enny-0.213
aron-0.211
ne-0.210
ppy-0.208
Token plastic
Token ID7309
Feature activation+7.15e+00

Pos logit contributions
 chores+0.016
 going+0.015
 temptation+0.014
 same+0.012
wered+0.011
Neg logit contributions
ne-0.058
enny-0.057
aron-0.057
o-0.057
ist-0.057
Token bag
Token ID6131
Feature activation+6.79e+00

Pos logit contributions
aron+0.218
ist+0.217
enny+0.215
ne+0.215
o+0.211
Neg logit contributions
 chores-0.153
 going-0.149
 same-0.141
 temptation-0.140
 rain-0.130
Token,
Token ID11
Feature activation-2.39e+00

Pos logit contributions
aron+0.005
enny+0.005
ist+0.005
ne+0.005
ppy+0.005
Neg logit contributions
 chores-0.005
 going-0.005
 same-0.004
 temptation-0.004
 rain-0.004
Token even
Token ID772
Feature activation-3.04e+00

Pos logit contributions
 chores+0.056
 going+0.053
 same+0.052
 temptation+0.051
 rain+0.049
Neg logit contributions
aron-0.068
enny-0.067
ist-0.067
ne-0.066
o-0.066
Token when
Token ID618
Feature activation-2.89e+00

Pos logit contributions
 chores+0.071
 going+0.068
 temptation+0.067
 same+0.066
 rain+0.066
Neg logit contributions
ne-0.084
aron-0.083
ist-0.083
enny-0.083
o-0.081
Token it
Token ID340
Feature activation+4.92e+00

Pos logit contributions
wered+0.047
 same+0.046
 chores+0.045
 temptation+0.044
 urge+0.042
Neg logit contributions
es-0.093
enny-0.091
ne-0.090
o-0.090
os-0.088
Token differed
Token ID36337
Feature activation-4.57e+00

Pos logit contributions
aron+0.133
enny+0.130
ist+0.130
ne+0.128
ppy+0.125
Neg logit contributions
 chores-0.126
 going-0.124
 same-0.119
 temptation-0.115
 rain-0.112
Token from
Token ID422
Feature activation-1.47e+01

Pos logit contributions
 chores+0.100
 going+0.099
 same+0.094
 temptation+0.090
 rain+0.086
Neg logit contributions
aron-0.138
enny-0.138
itty-0.135
o-0.135
ne-0.134
Token his
Token ID465
Feature activation-2.38e+00

Pos logit contributions
 chores+0.209
wered+0.173
 temptation+0.172
 rain+0.170
 going+0.165
Neg logit contributions
aron-0.535
ist-0.514
ne-0.507
o-0.503
ppy-0.501
Token own
Token ID898
Feature activation-6.33e+00

Pos logit contributions
 going+0.035
 chores+0.032
 same+0.030
 rain+0.029
 temptation+0.029
Neg logit contributions
aron-0.090
ist-0.088
ne-0.087
enny-0.087
jo-0.085
Token.
Token ID13
Feature activation+3.30e+00

Pos logit contributions
 chores+0.159
 going+0.157
 same+0.150
 temptation+0.147
 rain+0.143
Neg logit contributions
aron-0.171
ist-0.168
enny-0.166
ne-0.164
o-0.160
Token From
Token ID3574
Feature activation-5.18e+00

Pos logit contributions
ist+0.118
aron+0.114
ppy+0.111
owed+0.110
ne+0.109
Neg logit contributions
 chores-0.055
 going-0.054
 same-0.054
 temptation-0.051
 top-0.049
Token then
Token ID788
Feature activation-1.33e-01

Pos logit contributions
 chores+0.091
 going+0.090
 same+0.084
 temptation+0.080
 rain+0.076
Neg logit contributions
aron-0.181
enny-0.178
ist-0.178
ne-0.176
o-0.173
Token,
Token ID11
Feature activation-1.31e+00

Pos logit contributions
 chores+0.046
 going+0.045
 same+0.043
 temptation+0.042
 rain+0.040
Neg logit contributions
aron-0.061
ist-0.061
enny-0.061
ne-0.060
o-0.059
Token it
Token ID340
Feature activation+6.19e+00

Pos logit contributions
 chores+0.024
 temptation+0.022
 same+0.022
wered+0.020
 urge+0.019
Neg logit contributions
es-0.040
enny-0.039
ne-0.038
os-0.038
ist-0.038
Token kept
Token ID4030
Feature activation-1.12e+01

Pos logit contributions
aron+0.191
enny+0.190
ist+0.185
ne+0.183
jo+0.181
Neg logit contributions
 going-0.135
 chores-0.130
 same-0.125
 rain-0.121
 temptation-0.117
Token on
Token ID319
Feature activation-1.16e+01

Pos logit contributions
 chores+0.202
 temptation+0.179
 same+0.168
 urge+0.154
 plan+0.151
Neg logit contributions
o-0.384
ist-0.368
aron-0.363
os-0.351
ppy-0.350
Token flying
Token ID7348
Feature activation-3.27e+00

Pos logit contributions
 chores+0.218
 temptation+0.192
 same+0.175
 hurricane+0.175
wered+0.171
Neg logit contributions
aron-0.374
ist-0.361
o-0.357
umping-0.348
jo-0.347
Token err
Token ID11454
Feature activation-1.45e+01

Pos logit contributions
 going+0.067
 chores+0.066
 same+0.065
 temptation+0.061
 rain+0.060
Neg logit contributions
enny-0.105
aron-0.103
ne-0.101
ist-0.101
ppy-0.099
Tokenatically
Token ID4142
Feature activation-2.14e-01

Pos logit contributions
 chores+0.350
 going+0.346
 temptation+0.330
 same+0.323
 rain+0.312
Neg logit contributions
ist-0.391
aron-0.388
enny-0.385
o-0.380
ne-0.377
Token with
Token ID351
Feature activation-1.27e+01

Pos logit contributions
 chores+0.005
 going+0.005
 same+0.005
 temptation+0.005
 rain+0.005
Neg logit contributions
enny-0.006
aron-0.006
ist-0.006
ne-0.006
o-0.006
Token inexplicable
Token ID49602
Feature activation+1.04e+00

Pos logit contributions
 chores+0.177
 going+0.152
 temptation+0.134
 same+0.129
 hurricane+0.120
Neg logit contributions
ne-0.465
aron-0.465
ist-0.464
os-0.463
o-0.459
Token speed
Token ID2866
Feature activation+8.61e-01

Pos logit contributions
ist+0.035
aron+0.034
ne+0.034
jo+0.033
owed+0.032
Neg logit contributions
 chores-0.019
 going-0.019
 same-0.017
 temptation-0.017
wered-0.016
Token.
Token ID13
Feature activation+4.53e+00

Pos logit contributions
aron+0.026
enny+0.025
ist+0.024
ne+0.024
ppy+0.024
Neg logit contributions
 going-0.020
 chores-0.019
 same-0.019
 temptation-0.017
 rain-0.017
Token,
Token ID11
Feature activation-3.84e+00

Pos logit contributions
aron+0.112
ist+0.111
enny+0.111
ne+0.109
o+0.107
Neg logit contributions
 chores-0.081
 going-0.080
 same-0.076
 temptation-0.074
 rain-0.071
Token they
Token ID484
Feature activation+2.50e+00

Pos logit contributions
 chores+0.066
 same+0.065
 temptation+0.063
 going+0.062
wered+0.060
Neg logit contributions
enny-0.121
ist-0.121
aron-0.121
ne-0.120
o-0.119
Token wouldn
Token ID3636
Feature activation-7.24e+00

Pos logit contributions
ist+0.061
enny+0.060
ne+0.059
aron+0.059
o+0.058
Neg logit contributions
 chores-0.068
 going-0.067
 same-0.065
 temptation-0.064
 rain-0.061
Token't
Token ID470
Feature activation-8.40e+00

Pos logit contributions
 chores+0.089
 going+0.086
 same+0.079
 temptation+0.075
 rain+0.068
Neg logit contributions
aron-0.288
enny-0.286
ist-0.286
ne-0.283
o-0.280
Token have
Token ID423
Feature activation-1.02e+01

Pos logit contributions
 chores+0.134
 going+0.124
 same+0.118
 temptation+0.113
 top+0.105
Neg logit contributions
ist-0.303
aron-0.302
ne-0.300
enny-0.296
ppy-0.293
Token gotten
Token ID7891
Feature activation-1.43e+01

Pos logit contributions
 chores+0.177
 going+0.167
 temptation+0.163
 same+0.159
wered+0.154
Neg logit contributions
aron-0.339
enny-0.338
ne-0.338
ist-0.336
o-0.335
Token broken
Token ID5445
Feature activation-4.96e+00

Pos logit contributions
 chores+0.261
wered+0.245
 temptation+0.237
 going+0.232
 same+0.231
Neg logit contributions
o-0.463
ist-0.455
ne-0.455
enny-0.454
aron-0.449
Token.
Token ID13
Feature activation+4.46e+00

Pos logit contributions
 chores+0.127
 going+0.126
 same+0.120
 temptation+0.117
 rain+0.113
Neg logit contributions
aron-0.132
enny-0.130
ist-0.130
ne-0.128
o-0.126
Token Tim
Token ID5045
Feature activation+1.30e+01

Pos logit contributions
ist+0.157
aron+0.150
ppy+0.149
ne+0.147
owed+0.145
Neg logit contributions
 chores-0.076
 going-0.074
 same-0.071
 temptation-0.068
 her-0.068
Tokenmy
Token ID1820
Feature activation+9.14e+00

Pos logit contributions
aron+0.335
oing+0.335
ist+0.334
ne+0.324
enny+0.320
Neg logit contributions
 same-0.342
 going-0.322
 her-0.321
 chores-0.311
 work-0.303
Token realized
Token ID6939
Feature activation-4.65e+00

Pos logit contributions
ist+0.271
aron+0.270
enny+0.268
ne+0.261
os+0.259
Neg logit contributions
 going-0.202
 chores-0.200
 same-0.194
 her-0.184
 good-0.183
Token the
Token ID262
Feature activation-1.26e+00

Pos logit contributions
 chores+0.137
 going+0.128
 same+0.126
 temptation+0.122
 rain+0.116
Neg logit contributions
aron-0.158
ist-0.156
o-0.155
enny-0.154
ne-0.153
Token green
Token ID4077
Feature activation+4.58e+00

Pos logit contributions
 chores+0.028
 going+0.028
wered+0.026
 temptation+0.024
 hurricane+0.024
Neg logit contributions
ne-0.035
enny-0.033
ppy-0.032
ist-0.032
pillar-0.032
Token bubble
Token ID14310
Feature activation+8.61e+00

Pos logit contributions
aron+0.146
ist+0.144
enny+0.144
ne+0.142
o+0.140
Neg logit contributions
 chores-0.094
 going-0.092
 same-0.087
 temptation-0.085
 rain-0.081
Token just
Token ID655
Feature activation-3.77e+00

Pos logit contributions
aron+0.260
enny+0.257
ist+0.255
ne+0.252
o+0.249
Neg logit contributions
 chores-0.188
 going-0.187
 same-0.177
 temptation-0.171
 rain-0.168
Token kept
Token ID4030
Feature activation-1.25e+01

Pos logit contributions
 chores+0.090
 going+0.088
 same+0.084
 temptation+0.082
 rain+0.078
Neg logit contributions
aron-0.108
ist-0.106
enny-0.105
ne-0.104
o-0.103
Token getting
Token ID1972
Feature activation-1.42e+01

Pos logit contributions
 chores+0.223
 same+0.198
 temptation+0.198
wered+0.188
 hurricane+0.171
Neg logit contributions
o-0.419
aron-0.407
ist-0.405
os-0.389
umping-0.388
Token bigger
Token ID5749
Feature activation-2.71e+00

Pos logit contributions
 chores+0.256
 temptation+0.237
 same+0.224
wered+0.221
 going+0.218
Neg logit contributions
o-0.471
ist-0.455
aron-0.451
ppy-0.444
enny-0.444
Token and
Token ID290
Feature activation-6.99e+00

Pos logit contributions
 going+0.063
 chores+0.061
 same+0.058
 temptation+0.057
 rain+0.056
Neg logit contributions
enny-0.080
ist-0.078
aron-0.078
o-0.077
ne-0.077
Token bigger
Token ID5749
Feature activation-2.18e+00

Pos logit contributions
 chores+0.149
 temptation+0.125
wered+0.125
 going+0.122
 urge+0.113
Neg logit contributions
ist-0.208
o-0.208
ne-0.205
aron-0.197
bug-0.196
Token.
Token ID13
Feature activation+4.08e+00

Pos logit contributions
 going+0.052
 chores+0.052
 same+0.049
 temptation+0.048
 rain+0.047
Neg logit contributions
enny-0.061
aron-0.061
ist-0.060
ne-0.059
o-0.059
Token
Token ID198
Feature activation+1.69e+00

Pos logit contributions
ist+0.133
aron+0.132
ne+0.128
enny+0.128
ppy+0.128
Neg logit contributions
 going-0.082
 chores-0.081
 same-0.076
 temptation-0.075
 rain-0.071
Token,
Token ID11
Feature activation-2.58e+00

Pos logit contributions
 chores+0.065
 going+0.064
 same+0.061
 temptation+0.061
 rain+0.058
Neg logit contributions
aron-0.071
ist-0.071
enny-0.070
ne-0.069
o-0.068
Token but
Token ID475
Feature activation-1.82e+00

Pos logit contributions
 chores+0.053
 going+0.049
 same+0.048
 temptation+0.048
wered+0.044
Neg logit contributions
aron-0.081
ist-0.079
ne-0.077
o-0.077
enny-0.077
Token his
Token ID465
Feature activation+1.97e+00

Pos logit contributions
 chores+0.031
 temptation+0.028
 same+0.028
 going+0.028
wered+0.025
Neg logit contributions
o-0.062
ist-0.062
enny-0.061
ne-0.061
aron-0.060
Token mustache
Token ID49303
Feature activation+1.09e+01

Pos logit contributions
enny+0.067
ist+0.067
aron+0.067
ne+0.066
o+0.066
Neg logit contributions
 going-0.031
 chores-0.030
 temptation-0.028
 same-0.028
wered-0.027
Token kept
Token ID4030
Feature activation-1.24e+01

Pos logit contributions
aron+0.309
enny+0.309
ne+0.303
ist+0.302
ppy+0.297
Neg logit contributions
 chores-0.265
 going-0.265
 same-0.258
 temptation-0.238
 rain-0.236
Token getting
Token ID1972
Feature activation-1.42e+01

Pos logit contributions
 chores+0.177
wered+0.173
 temptation+0.167
 same+0.162
 urge+0.147
Neg logit contributions
aron-0.423
ist-0.419
o-0.411
umping-0.394
ne-0.387
Token in
Token ID287
Feature activation-1.09e+01

Pos logit contributions
 chores+0.237
wered+0.236
 temptation+0.232
 same+0.215
 going+0.213
Neg logit contributions
o-0.443
aron-0.435
ist-0.433
enny-0.432
ne-0.429
Token the
Token ID262
Feature activation-5.37e+00

Pos logit contributions
 chores+0.159
wered+0.156
 same+0.152
 temptation+0.149
 going+0.147
Neg logit contributions
aron-0.368
ist-0.337
ne-0.332
o-0.327
oc-0.326
Token way
Token ID835
Feature activation-3.87e+00

Pos logit contributions
 going+0.080
 chores+0.077
 temptation+0.069
wered+0.068
 her+0.063
Neg logit contributions
aron-0.184
ne-0.178
ist-0.177
o-0.175
ney-0.174
Token and
Token ID290
Feature activation-6.33e+00

Pos logit contributions
 chores+0.090
 going+0.085
 temptation+0.083
 same+0.082
 rain+0.077
Neg logit contributions
ist-0.110
enny-0.109
aron-0.108
ne-0.107
o-0.107
Token preventing
Token ID12174
Feature activation-1.17e+01

Pos logit contributions
 chores+0.124
 temptation+0.112
 same+0.109
wered+0.109
 going+0.108
Neg logit contributions
o-0.196
ist-0.195
aron-0.193
os-0.190
enny-0.188
Token kept
Token ID4030
Feature activation-1.11e+01

Pos logit contributions
aron+0.134
enny+0.134
ist+0.133
ne+0.129
ppy+0.127
Neg logit contributions
 going-0.109
 chores-0.107
 same-0.101
 work-0.096
 rain-0.096
Token helping
Token ID5742
Feature activation-6.94e+00

Pos logit contributions
 chores+0.173
 same+0.173
 temptation+0.170
wered+0.165
 rain+0.157
Neg logit contributions
aron-0.368
o-0.365
ist-0.352
es-0.348
umping-0.345
Token out
Token ID503
Feature activation-3.33e+00

Pos logit contributions
 chores+0.108
 going+0.106
 same+0.097
 temptation+0.095
 rain+0.089
Neg logit contributions
aron-0.254
ist-0.251
enny-0.250
ne-0.246
ppy-0.243
Token and
Token ID290
Feature activation-5.70e+00

Pos logit contributions
 chores+0.061
 going+0.060
 same+0.058
 temptation+0.056
 rain+0.052
Neg logit contributions
ist-0.110
aron-0.110
enny-0.108
ne-0.108
o-0.106
Token kept
Token ID4030
Feature activation-1.24e+01

Pos logit contributions
 chores+0.128
 same+0.118
 going+0.117
 temptation+0.115
wered+0.111
Neg logit contributions
ist-0.167
aron-0.165
o-0.163
enny-0.163
ne-0.162
Token getting
Token ID1972
Feature activation-1.42e+01

Pos logit contributions
 same+0.173
 chores+0.169
 temptation+0.164
wered+0.162
 rain+0.154
Neg logit contributions
aron-0.433
o-0.427
ist-0.419
es-0.411
ne-0.409
Token rewarded
Token ID20945
Feature activation-7.44e+00

Pos logit contributions
 chores+0.218
 temptation+0.217
wered+0.211
 same+0.206
 going+0.201
Neg logit contributions
o-0.465
aron-0.458
enny-0.457
ist-0.451
os-0.449
Token,
Token ID11
Feature activation-3.12e+00

Pos logit contributions
 chores+0.170
 going+0.168
 same+0.158
 temptation+0.155
 rain+0.148
Neg logit contributions
aron-0.218
enny-0.215
ist-0.215
ne-0.213
o-0.209
Token until
Token ID1566
Feature activation-3.14e+00

Pos logit contributions
 chores+0.075
 same+0.069
 going+0.069
 temptation+0.067
wered+0.065
Neg logit contributions
aron-0.089
ist-0.086
enny-0.083
ne-0.082
o-0.082
Token one
Token ID530
Feature activation+3.05e+00

Pos logit contributions
 temptation+0.057
 same+0.057
 chores+0.054
 going+0.050
wered+0.050
Neg logit contributions
enny-0.095
o-0.088
ne-0.088
ry-0.085
es-0.084
Token day
Token ID1110
Feature activation+2.15e+00

Pos logit contributions
aron+0.128
ist+0.127
enny+0.126
ne+0.124
ppy+0.123
Neg logit contributions
 chores-0.033
 going-0.032
 same-0.029
 temptation-0.027
 rain-0.025
Token money
Token ID1637
Feature activation+2.60e+00

Pos logit contributions
 chores+0.040
 going+0.040
 same+0.035
 temptation+0.035
 rain+0.033
Neg logit contributions
aron-0.100
ist-0.099
enny-0.099
ne-0.098
o-0.097
Token and
Token ID290
Feature activation-5.84e+00

Pos logit contributions
aron+0.075
enny+0.075
ist+0.074
ne+0.073
jo+0.073
Neg logit contributions
 going-0.060
 chores-0.060
 same-0.057
 temptation-0.054
 good-0.051
Token didn
Token ID1422
Feature activation-8.06e+00

Pos logit contributions
 chores+0.104
 temptation+0.099
wered+0.094
 going+0.093
 same+0.093
Neg logit contributions
aron-0.187
o-0.186
ist-0.186
enny-0.184
ne-0.183
Token't
Token ID470
Feature activation-7.79e+00

Pos logit contributions
 temptation+0.143
 chores+0.140
 same+0.139
 going+0.137
 rain+0.126
Neg logit contributions
ne-0.256
enny-0.251
o-0.249
os-0.242
aron-0.238
Token delay
Token ID5711
Feature activation-6.86e+00

Pos logit contributions
 chores+0.131
 going+0.122
 same+0.118
 temptation+0.112
 rain+0.107
Neg logit contributions
aron-0.274
ne-0.270
ist-0.269
ppy-0.264
enny-0.264
Token getting
Token ID1972
Feature activation-1.40e+01

Pos logit contributions
 chores+0.114
wered+0.111
 temptation+0.108
 same+0.105
 urge+0.100
Neg logit contributions
aron-0.227
ist-0.222
o-0.216
ne-0.214
ppy-0.211
Token them
Token ID606
Feature activation+2.91e+00

Pos logit contributions
wered+0.221
 chores+0.213
 temptation+0.206
 hurricane+0.193
 same+0.192
Neg logit contributions
o-0.465
ne-0.452
enny-0.450
ppy-0.445
os-0.432
Token".
Token ID1911
Feature activation+6.22e+00

Pos logit contributions
ist+0.090
aron+0.090
o+0.090
enny+0.089
ne+0.089
Neg logit contributions
 chores-0.057
 temptation-0.055
 going-0.055
 same-0.052
wered-0.051
Token
Token ID198
Feature activation+2.69e+00

Pos logit contributions
ist+0.229
enny+0.227
ppy+0.225
aron+0.220
ouse+0.216
Neg logit contributions
 going-0.100
 chores-0.099
 same-0.095
 temptation-0.091
 top-0.090
Token
Token ID198
Feature activation+3.07e+00

Pos logit contributions
ist+0.099
ppy+0.095
enny+0.095
aron+0.093
ouse+0.091
Neg logit contributions
 going-0.043
 chores-0.043
 top-0.040
 same-0.040
 good-0.038
TokenBob
Token ID18861
Feature activation+1.45e+01

Pos logit contributions
enny+0.082
ist+0.082
ppy+0.078
ouse+0.077
aron+0.076
Neg logit contributions
 chores-0.079
 going-0.079
 same-0.076
 top-0.076
 temptation-0.075
Token so
Token ID523
Feature activation-7.86e+00

Pos logit contributions
 chores+0.111
 going+0.109
 same+0.103
 temptation+0.102
 rain+0.096
Neg logit contributions
aron-0.181
ist-0.180
enny-0.178
ne-0.176
o-0.174
Token they
Token ID484
Feature activation+3.53e+00

Pos logit contributions
 going+0.118
 chores+0.118
 same+0.113
 temptation+0.103
 rain+0.096
Neg logit contributions
ist-0.283
aron-0.281
enny-0.281
ne-0.280
jo-0.274
Token wouldn
Token ID3636
Feature activation-6.87e+00

Pos logit contributions
aron+0.094
enny+0.092
ist+0.091
ne+0.089
o+0.087
Neg logit contributions
 going-0.093
 chores-0.093
 same-0.087
 temptation-0.084
 rain-0.084
Token't
Token ID470
Feature activation-7.94e+00

Pos logit contributions
 chores+0.078
 going+0.077
 same+0.067
 temptation+0.064
 rain+0.059
Neg logit contributions
aron-0.282
ist-0.279
enny-0.278
ne-0.275
o-0.272
Token have
Token ID423
Feature activation-9.91e+00

Pos logit contributions
 chores+0.114
 going+0.108
 same+0.101
 temptation+0.097
 rain+0.089
Neg logit contributions
aron-0.301
ist-0.299
ne-0.297
enny-0.296
o-0.292
Token gotten
Token ID7891
Feature activation-1.40e+01

Pos logit contributions
 chores+0.197
 temptation+0.186
 same+0.185
wered+0.179
 going+0.179
Neg logit contributions
o-0.296
ne-0.296
aron-0.293
ist-0.289
enny-0.288
Token in
Token ID287
Feature activation-1.06e+01

Pos logit contributions
 chores+0.254
wered+0.245
 temptation+0.231
 same+0.229
 going+0.219
Neg logit contributions
o-0.452
ne-0.436
ist-0.429
aron-0.428
enny-0.423
Token trouble
Token ID5876
Feature activation-3.70e+00

Pos logit contributions
 chores+0.126
wered+0.123
 same+0.118
 going+0.115
 temptation+0.107
Neg logit contributions
aron-0.376
ne-0.365
ist-0.362
es-0.361
o-0.354
Token.
Token ID13
Feature activation+4.21e+00

Pos logit contributions
 chores+0.095
 going+0.093
 same+0.090
 temptation+0.088
 rain+0.084
Neg logit contributions
aron-0.098
ist-0.096
enny-0.095
ne-0.094
o-0.093
Token From
Token ID3574
Feature activation-4.69e+00

Pos logit contributions
ist+0.139
aron+0.137
ppy+0.133
ne+0.133
enny+0.133
Neg logit contributions
 chores-0.081
 going-0.079
 same-0.077
 temptation-0.074
 rain-0.071
Token then
Token ID788
Feature activation+1.97e-01

Pos logit contributions
 chores+0.098
 going+0.093
 temptation+0.090
 same+0.089
 rain+0.082
Neg logit contributions
ist-0.146
aron-0.146
ne-0.144
o-0.142
enny-0.141
Token,
Token ID11
Feature activation-4.36e+00

Pos logit contributions
 chores+0.086
 going+0.084
 same+0.081
 temptation+0.080
 rain+0.076
Neg logit contributions
aron-0.085
ist-0.084
enny-0.084
ne-0.083
o-0.081
Token so
Token ID523
Feature activation-6.46e+00

Pos logit contributions
 chores+0.110
 going+0.101
 same+0.101
 temptation+0.099
wered+0.093
Neg logit contributions
aron-0.117
ist-0.114
enny-0.114
ne-0.113
o-0.112
Token he
Token ID339
Feature activation+5.70e+00

Pos logit contributions
 chores+0.106
 going+0.099
 same+0.094
 temptation+0.092
 rain+0.085
Neg logit contributions
enny-0.227
ist-0.226
ne-0.225
o-0.224
aron-0.224
Token could
Token ID714
Feature activation-5.88e+00

Pos logit contributions
aron+0.167
enny+0.167
ist+0.165
ne+0.161
umping+0.158
Neg logit contributions
 chores-0.131
 going-0.131
 same-0.124
 temptation-0.117
 rain-0.116
Token avoid
Token ID3368
Feature activation-8.45e+00

Pos logit contributions
 chores+0.083
 going+0.078
 same+0.076
 temptation+0.076
 rain+0.068
Neg logit contributions
aron-0.215
ist-0.215
o-0.212
ne-0.211
enny-0.210
Token getting
Token ID1972
Feature activation-1.39e+01

Pos logit contributions
 chores+0.109
wered+0.093
 same+0.090
 temptation+0.089
 going+0.087
Neg logit contributions
ne-0.301
aron-0.300
o-0.300
ist-0.298
jo-0.298
Token another
Token ID1194
Feature activation-2.95e+00

Pos logit contributions
 chores+0.247
 temptation+0.224
wered+0.223
 going+0.221
 same+0.220
Neg logit contributions
o-0.457
ist-0.440
ne-0.433
os-0.432
aron-0.428
Token band
Token ID4097
Feature activation+3.78e+00

Pos logit contributions
 chores+0.039
 going+0.037
 same+0.033
 temptation+0.032
wered+0.030
Neg logit contributions
ist-0.113
aron-0.112
ne-0.112
enny-0.110
o-0.109
Tokenage
Token ID496
Feature activation+2.43e+00

Pos logit contributions
aron+0.086
enny+0.086
ne+0.081
ist+0.080
ppy+0.080
Neg logit contributions
 going-0.114
 chores-0.113
 same-0.107
 temptation-0.105
 rain-0.102
Token.
Token ID13
Feature activation+4.30e+00

Pos logit contributions
aron+0.073
ist+0.073
enny+0.072
ne+0.071
o+0.070
Neg logit contributions
 chores-0.053
 going-0.051
 same-0.049
 temptation-0.048
 rain-0.046
Token<|endoftext|>
Token ID50256
Feature activation+8.55e+00

Pos logit contributions
ist+0.146
aron+0.141
ne+0.138
ppy+0.138
enny+0.136
Neg logit contributions
 chores-0.080
 going-0.079
 same-0.075
 temptation-0.072
 rain-0.069
Token,
Token ID11
Feature activation-3.51e+00

Pos logit contributions
aron+0.041
enny+0.041
ist+0.040
ne+0.040
o+0.040
Neg logit contributions
 chores-0.034
 going-0.034
 same-0.032
 temptation-0.031
 rain-0.030
Token but
Token ID475
Feature activation-2.19e+00

Pos logit contributions
 chores+0.093
 going+0.085
 same+0.084
 temptation+0.084
wered+0.081
Neg logit contributions
aron-0.092
ist-0.089
ne-0.083
o-0.083
ppy-0.082
Token it
Token ID340
Feature activation+5.33e+00

Pos logit contributions
 chores+0.037
 going+0.035
 same+0.034
 temptation+0.034
 rain+0.031
Neg logit contributions
enny-0.074
ist-0.074
o-0.074
ne-0.073
aron-0.073
Token still
Token ID991
Feature activation-4.61e+00

Pos logit contributions
aron+0.155
enny+0.151
ne+0.147
ist+0.146
jo+0.142
Neg logit contributions
 chores-0.126
 going-0.126
 same-0.120
 rain-0.114
 temptation-0.112
Token kept
Token ID4030
Feature activation-1.22e+01

Pos logit contributions
 chores+0.119
 going+0.117
 same+0.112
 temptation+0.110
 rain+0.105
Neg logit contributions
aron-0.122
ist-0.121
enny-0.120
ne-0.118
o-0.116
Token getting
Token ID1972
Feature activation-1.39e+01

Pos logit contributions
 chores+0.174
wered+0.160
 same+0.160
 temptation+0.157
 urge+0.144
Neg logit contributions
o-0.424
ist-0.421
aron-0.414
ppy-0.395
os-0.393
Token in
Token ID287
Feature activation-1.08e+01

Pos logit contributions
 chores+0.237
 temptation+0.228
wered+0.221
 same+0.216
 going+0.216
Neg logit contributions
o-0.446
ist-0.431
ppy-0.431
ne-0.424
enny-0.424
Token the
Token ID262
Feature activation-5.51e+00

Pos logit contributions
 chores+0.164
 going+0.153
 same+0.149
wered+0.149
 temptation+0.139
Neg logit contributions
aron-0.372
ist-0.346
ne-0.340
o-0.336
oc-0.330
Token way
Token ID835
Feature activation-3.95e+00

Pos logit contributions
 going+0.085
 chores+0.081
wered+0.070
 temptation+0.069
 her+0.065
Neg logit contributions
aron-0.191
ne-0.183
ist-0.182
ney-0.178
enny-0.178
Token.
Token ID13
Feature activation+4.40e+00

Pos logit contributions
 chores+0.095
 going+0.090
 same+0.087
 temptation+0.086
 rain+0.082
Neg logit contributions
ist-0.110
enny-0.109
aron-0.108
ne-0.107
o-0.107
Token Suddenly
Token ID24975
Feature activation-9.61e-01

Pos logit contributions
ist+0.149
aron+0.145
ne+0.142
ppy+0.142
enny+0.140
Neg logit contributions
 chores-0.083
 going-0.081
 same-0.076
 temptation-0.074
 top-0.071
Token the
Token ID262
Feature activation-3.76e+00

Pos logit contributions
 chores+0.153
 temptation+0.138
 going+0.133
 same+0.132
wered+0.121
Neg logit contributions
aron-0.345
ist-0.345
ne-0.344
o-0.339
os-0.335
Token wet
Token ID9583
Feature activation+1.36e+00

Pos logit contributions
 chores+0.060
 going+0.059
 temptation+0.054
 same+0.050
wered+0.046
Neg logit contributions
aron-0.133
ist-0.132
ne-0.132
enny-0.129
o-0.128
Token p
Token ID279
Feature activation+3.15e+00

Pos logit contributions
aron+0.048
ist+0.047
enny+0.047
ne+0.047
o+0.046
Neg logit contributions
 chores-0.023
 going-0.022
 same-0.021
 temptation-0.020
 rain-0.019
Tokenuddle
Token ID24500
Feature activation+5.38e+00

Pos logit contributions
aron+0.056
enny+0.051
jo+0.051
ne+0.051
ppy+0.051
Neg logit contributions
 chores-0.109
 same-0.109
 going-0.106
 rain-0.105
 temptation-0.102
Token without
Token ID1231
Feature activation-1.15e+01

Pos logit contributions
aron+0.146
ist+0.144
enny+0.144
ne+0.141
o+0.139
Neg logit contributions
 chores-0.135
 going-0.133
 same-0.127
 temptation-0.124
 rain-0.120
Token getting
Token ID1972
Feature activation-1.39e+01

Pos logit contributions
 chores+0.165
 same+0.154
wered+0.142
 hurricane+0.140
 temptation+0.134
Neg logit contributions
aron-0.393
o-0.392
ppy-0.390
ist-0.385
ne-0.376
Token wet
Token ID9583
Feature activation-2.53e+00

Pos logit contributions
 chores+0.261
 temptation+0.239
wered+0.238
 going+0.227
 same+0.226
Neg logit contributions
o-0.441
enny-0.413
ist-0.412
ne-0.410
os-0.410
Token.
Token ID13
Feature activation+4.51e+00

Pos logit contributions
 chores+0.066
 going+0.064
 same+0.061
 temptation+0.061
 rain+0.057
Neg logit contributions
aron-0.066
ist-0.065
enny-0.064
ne-0.064
o-0.062
Token
Token ID198
Feature activation+1.84e+00

Pos logit contributions
ist+0.149
aron+0.148
enny+0.144
ne+0.144
ppy+0.143
Neg logit contributions
 chores-0.086
 going-0.085
 same-0.080
 temptation-0.078
 rain-0.075
Token
Token ID198
Feature activation+2.05e+00

Pos logit contributions
ist+0.063
aron+0.062
ppy+0.060
ne+0.059
enny+0.059
Neg logit contributions
 chores-0.034
 going-0.033
 same-0.031
 temptation-0.030
 rain-0.029
TokenMax
Token ID11518
Feature activation+1.42e+01

Pos logit contributions
ist+0.058
aron+0.058
enny+0.055
ne+0.055
ppy+0.055
Neg logit contributions
 chores-0.049
 going-0.048
 same-0.046
 temptation-0.045
 rain-0.044
Token sing
Token ID1702
Feature activation-1.33e+00

Pos logit contributions
 chores+0.137
 going+0.127
 temptation+0.126
 same+0.123
 trouble+0.110
Neg logit contributions
ist-0.260
aron-0.255
ne-0.252
o-0.248
ppy-0.248
Token and
Token ID290
Feature activation-6.46e+00

Pos logit contributions
 going+0.028
 chores+0.028
 same+0.026
 rain+0.025
 temptation+0.025
Neg logit contributions
enny-0.042
aron-0.041
ne-0.041
ist-0.041
itty-0.040
Token the
Token ID262
Feature activation-1.87e+00

Pos logit contributions
 chores+0.124
 going+0.112
 temptation+0.111
 same+0.108
wered+0.098
Neg logit contributions
ist-0.209
ne-0.208
o-0.208
aron-0.206
os-0.203
Token branch
Token ID8478
Feature activation+6.02e+00

Pos logit contributions
 chores+0.035
 going+0.034
 temptation+0.030
 same+0.029
wered+0.028
Neg logit contributions
ne-0.061
aron-0.060
ist-0.060
o-0.059
itty-0.058
Token kept
Token ID4030
Feature activation-1.22e+01

Pos logit contributions
enny+0.190
aron+0.189
ne+0.180
ist+0.180
ppy+0.179
Neg logit contributions
 going-0.128
 chores-0.126
 same-0.122
 rain-0.115
 temptation-0.112
Token getting
Token ID1972
Feature activation-1.39e+01

Pos logit contributions
 chores+0.223
 temptation+0.197
 same+0.186
wered+0.179
 laundry+0.171
Neg logit contributions
o-0.401
aron-0.397
ist-0.395
os-0.380
es-0.375
Token bigger
Token ID5749
Feature activation-2.15e+00

Pos logit contributions
 chores+0.253
 temptation+0.232
 going+0.217
 same+0.214
wered+0.207
Neg logit contributions
o-0.452
ist-0.450
aron-0.448
ne-0.446
os-0.434
Token and
Token ID290
Feature activation-6.73e+00

Pos logit contributions
 going+0.049
 chores+0.047
 same+0.045
 rain+0.044
 temptation+0.043
Neg logit contributions
enny-0.067
ist-0.062
o-0.062
aron-0.062
itty-0.061
Token stronger
Token ID7387
Feature activation-1.85e+00

Pos logit contributions
 chores+0.137
 temptation+0.116
 going+0.111
wered+0.108
 same+0.102
Neg logit contributions
ist-0.212
ne-0.209
o-0.207
aron-0.204
ppy-0.199
Token.
Token ID13
Feature activation+4.30e+00

Pos logit contributions
 going+0.043
 chores+0.042
 same+0.040
 temptation+0.039
 rain+0.038
Neg logit contributions
enny-0.055
aron-0.054
ist-0.053
ne-0.052
o-0.052
Token Pretty
Token ID20090
Feature activation-4.03e+00

Pos logit contributions
aron+0.138
ist+0.136
enny+0.136
ne+0.134
o+0.132
Neg logit contributions
 chores-0.088
 going-0.086
 same-0.081
 temptation-0.079
 rain-0.075
Token crib
Token ID48083
Feature activation+8.80e+00

Pos logit contributions
 going+0.081
 chores+0.078
wered+0.077
 temptation+0.075
 same+0.071
Neg logit contributions
aron-0.109
ist-0.104
ne-0.101
jo-0.101
enny-0.099
Token to
Token ID284
Feature activation-8.22e+00

Pos logit contributions
aron+0.276
enny+0.276
ne+0.271
ist+0.271
o+0.265
Neg logit contributions
 chores-0.181
 going-0.177
 same-0.173
 temptation-0.161
 rain-0.156
Token prevent
Token ID2948
Feature activation-1.18e+01

Pos logit contributions
 chores+0.140
 going+0.134
 temptation+0.132
 same+0.122
 comparison+0.117
Neg logit contributions
aron-0.252
ist-0.249
ppy-0.247
ne-0.244
o-0.236
Token falling
Token ID7463
Feature activation-5.19e+00

Pos logit contributions
 chores+0.185
wered+0.164
 same+0.161
 temptation+0.149
 going+0.147
Neg logit contributions
o-0.406
ne-0.394
aron-0.391
ppy-0.384
ist-0.382
Token and
Token ID290
Feature activation-8.36e+00

Pos logit contributions
 chores+0.123
 going+0.114
 temptation+0.113
 same+0.111
wered+0.108
Neg logit contributions
aron-0.147
ist-0.145
o-0.142
ne-0.141
ppy-0.139
Token getting
Token ID1972
Feature activation-1.39e+01

Pos logit contributions
 chores+0.158
wered+0.136
 idea+0.135
 excitement+0.133
 same+0.132
Neg logit contributions
o-0.269
os-0.257
ist-0.254
ne-0.253
ppy-0.250
Token hurt
Token ID5938
Feature activation-5.52e+00

Pos logit contributions
 chores+0.260
wered+0.250
 temptation+0.243
 going+0.228
 same+0.226
Neg logit contributions
o-0.430
ppy-0.408
ne-0.407
ist-0.403
os-0.402
Token."
Token ID526
Feature activation+6.19e+00

Pos logit contributions
 going+0.140
 chores+0.137
 same+0.133
 temptation+0.129
 rain+0.127
Neg logit contributions
enny-0.150
aron-0.149
ist-0.147
ne-0.144
jo-0.142
Token Tim
Token ID5045
Feature activation+1.40e+01

Pos logit contributions
aron+0.222
ist+0.222
enny+0.219
ne+0.218
ppy+0.215
Neg logit contributions
 chores-0.102
 going-0.101
 same-0.093
 temptation-0.090
 rain-0.086
Token looked
Token ID3114
Feature activation-3.73e+00

Pos logit contributions
oing+0.412
rots+0.393
ne+0.387
ist+0.387
plets+0.386
Neg logit contributions
 her-0.329
 same-0.324
 first-0.303
 going-0.296
 other-0.292
Token at
Token ID379
Feature activation-9.15e+00

Pos logit contributions
 chores+0.077
 going+0.075
 same+0.071
 temptation+0.069
 rain+0.066
Neg logit contributions
aron-0.119
ist-0.117
enny-0.117
ne-0.116
o-0.114
Token or
Token ID393
Feature activation-7.64e+00

Pos logit contributions
aron+0.120
enny+0.118
ist+0.117
ne+0.116
o+0.114
Neg logit contributions
 chores-0.092
 going-0.090
 same-0.087
 temptation-0.082
 rain-0.081
Token use
Token ID779
Feature activation-9.27e+00

Pos logit contributions
 chores+0.116
 going+0.114
 temptation+0.111
 same+0.106
wered+0.099
Neg logit contributions
ist-0.269
ne-0.263
aron-0.262
ppy-0.260
o-0.256
Token it
Token ID340
Feature activation+3.01e+00

Pos logit contributions
 chores+0.116
 going+0.116
 temptation+0.108
 same+0.104
wered+0.098
Neg logit contributions
ist-0.354
aron-0.349
ne-0.345
enny-0.339
o-0.336
Token to
Token ID284
Feature activation-9.52e+00

Pos logit contributions
aron+0.095
enny+0.094
ist+0.092
ne+0.092
o+0.090
Neg logit contributions
 chores-0.063
 going-0.062
 same-0.059
 temptation-0.056
 rain-0.054
Token sit
Token ID1650
Feature activation-6.64e+00

Pos logit contributions
 going+0.142
 chores+0.140
 same+0.129
 temptation+0.125
 top+0.108
Neg logit contributions
ist-0.350
aron-0.344
ne-0.336
ppy-0.333
enny-0.329
Token with
Token ID351
Feature activation-1.38e+01

Pos logit contributions
 going+0.132
 chores+0.130
 same+0.122
 rain+0.113
 temptation+0.113
Neg logit contributions
enny-0.218
aron-0.215
ne-0.213
jo-0.210
ist-0.210
Token your
Token ID534
Feature activation-3.54e+00

Pos logit contributions
 chores+0.195
 going+0.190
 temptation+0.170
 same+0.169
wered+0.153
Neg logit contributions
ist-0.515
aron-0.515
ne-0.502
enny-0.499
ppy-0.494
Token friends
Token ID2460
Feature activation+5.00e+00

Pos logit contributions
 chores+0.064
 going+0.063
 same+0.058
 temptation+0.057
 rain+0.054
Neg logit contributions
aron-0.120
ist-0.119
enny-0.118
ne-0.117
o-0.115
Token."
Token ID526
Feature activation+5.86e+00

Pos logit contributions
enny+0.155
aron+0.151
jo+0.148
itty+0.148
umping+0.147
Neg logit contributions
 chores-0.108
 going-0.107
 same-0.099
 rain-0.091
 her-0.091
Token
Token ID198
Feature activation+2.30e+00

Pos logit contributions
ist+0.194
aron+0.188
ppy+0.187
enny+0.186
ne+0.184
Neg logit contributions
 going-0.114
 chores-0.114
 same-0.106
 temptation-0.104
 her-0.101
Token
Token ID198
Feature activation+2.70e+00

Pos logit contributions
ist+0.080
ppy+0.078
enny+0.077
aron+0.077
ne+0.076
Neg logit contributions
 chores-0.041
 going-0.040
 same-0.037
 her-0.036
 temptation-0.036
Token,
Token ID11
Feature activation-3.77e+00

Pos logit contributions
 chores+0.047
 going+0.047
 same+0.044
 temptation+0.043
 rain+0.042
Neg logit contributions
aron-0.049
enny-0.048
ist-0.048
ne-0.047
o-0.046
Token and
Token ID290
Feature activation-4.90e+00

Pos logit contributions
 chores+0.097
 going+0.093
 same+0.092
 temptation+0.091
 top+0.085
Neg logit contributions
aron-0.098
ist-0.098
ne-0.094
enny-0.094
o-0.092
Token they
Token ID484
Feature activation+2.88e+00

Pos logit contributions
 chores+0.089
 temptation+0.082
 same+0.080
 going+0.079
wered+0.077
Neg logit contributions
ist-0.158
ne-0.158
enny-0.157
aron-0.156
o-0.155
Token kept
Token ID4030
Feature activation-1.19e+01

Pos logit contributions
aron+0.096
enny+0.094
ist+0.093
ne+0.092
o+0.090
Neg logit contributions
 chores-0.057
 going-0.056
 same-0.053
 rain-0.049
 temptation-0.049
Token on
Token ID319
Feature activation-1.14e+01

Pos logit contributions
wered+0.177
 temptation+0.177
 chores+0.174
 hurricane+0.173
 same+0.164
Neg logit contributions
o-0.388
es-0.383
ist-0.382
aron-0.378
os-0.371
Token getting
Token ID1972
Feature activation-1.38e+01

Pos logit contributions
 temptation+0.176
wered+0.176
 same+0.175
 hurricane+0.171
 chores+0.167
Neg logit contributions
es-0.368
aron-0.363
ist-0.359
enny-0.358
umping-0.357
Token better
Token ID1365
Feature activation-5.74e+00

Pos logit contributions
 chores+0.198
 temptation+0.195
 going+0.191
 same+0.185
wered+0.185
Neg logit contributions
enny-0.486
ist-0.475
o-0.475
os-0.471
aron-0.471
Token and
Token ID290
Feature activation-7.43e+00

Pos logit contributions
 chores+0.143
 going+0.141
 same+0.134
 temptation+0.131
 rain+0.126
Neg logit contributions
aron-0.158
enny-0.157
ist-0.156
ne-0.154
o-0.153
Token better
Token ID1365
Feature activation-4.35e+00

Pos logit contributions
 chores+0.143
wered+0.132
 same+0.127
 temptation+0.124
 top+0.122
Neg logit contributions
ist-0.224
ne-0.218
o-0.217
enny-0.215
aron-0.214
Token.
Token ID13
Feature activation+4.03e+00

Pos logit contributions
 chores+0.115
 going+0.111
 same+0.109
 temptation+0.107
wered+0.104
Neg logit contributions
aron-0.110
ist-0.108
enny-0.106
ne-0.106
o-0.103
Token
Token ID198
Feature activation+1.57e+00

Pos logit contributions
ist+0.140
aron+0.137
ppy+0.132
ne+0.131
enny+0.129
Neg logit contributions
 chores-0.075
 going-0.073
 same-0.070
 temptation-0.067
 her-0.064
Token't
Token ID470
Feature activation-8.73e+00

Pos logit contributions
 chores+0.105
 going+0.097
 same+0.094
 temptation+0.092
 rain+0.080
Neg logit contributions
enny-0.273
ne-0.273
ist-0.271
aron-0.269
o-0.269
Token be
Token ID307
Feature activation-9.99e+00

Pos logit contributions
 chores+0.151
 going+0.126
 temptation+0.124
 same+0.122
wered+0.121
Neg logit contributions
ist-0.298
ne-0.290
ppy-0.286
o-0.280
aron-0.279
Token fair
Token ID3148
Feature activation-9.59e-01

Pos logit contributions
 chores+0.156
 going+0.146
 same+0.138
 temptation+0.136
 rain+0.124
Neg logit contributions
aron-0.366
ist-0.364
enny-0.358
ne-0.358
o-0.356
Token to
Token ID284
Feature activation-9.10e+00

Pos logit contributions
 going+0.021
 chores+0.021
 same+0.020
 temptation+0.019
 rain+0.018
Neg logit contributions
aron-0.029
enny-0.029
ist-0.028
ne-0.028
itty-0.028
Token take
Token ID1011
Feature activation-1.20e+01

Pos logit contributions
 chores+0.147
 going+0.134
 temptation+0.122
 same+0.120
wered+0.109
Neg logit contributions
ne-0.318
ist-0.317
o-0.310
aron-0.310
ppy-0.308
Token from
Token ID422
Feature activation-1.38e+01

Pos logit contributions
 chores+0.166
 going+0.163
 same+0.144
 temptation+0.140
wered+0.131
Neg logit contributions
aron-0.452
ist-0.448
ne-0.446
enny-0.441
o-0.436
Token the
Token ID262
Feature activation-5.43e+00

Pos logit contributions
 chores+0.196
 going+0.163
wered+0.160
 temptation+0.159
 same+0.155
Neg logit contributions
o-0.480
ist-0.480
ne-0.474
aron-0.467
ppy-0.467
Token generous
Token ID14431
Feature activation+1.70e+00

Pos logit contributions
 chores+0.097
 going+0.094
wered+0.081
 their+0.079
 temptation+0.078
Neg logit contributions
ne-0.173
aron-0.171
itty-0.170
ist-0.168
pillar-0.167
Token tree
Token ID5509
Feature activation+7.65e+00

Pos logit contributions
aron+0.054
ist+0.053
enny+0.053
ne+0.053
o+0.052
Neg logit contributions
 chores-0.035
 going-0.034
 same-0.032
 temptation-0.031
 rain-0.030
Token.
Token ID13
Feature activation+3.57e+00

Pos logit contributions
aron+0.223
enny+0.219
ist+0.216
ne+0.214
ppy+0.209
Neg logit contributions
 chores-0.179
 going-0.177
 same-0.169
 temptation-0.162
 rain-0.157
Token Now
Token ID2735
Feature activation+2.06e-01

Pos logit contributions
ist+0.118
aron+0.116
enny+0.114
ppy+0.114
ne+0.113
Neg logit contributions
 going-0.067
 chores-0.064
 same-0.062
 temptation-0.061
 rain-0.056
Token't
Token ID470
Feature activation-7.20e+00

Pos logit contributions
 chores+0.132
 temptation+0.131
 same+0.130
 going+0.130
 laundry+0.119
Neg logit contributions
ne-0.214
enny-0.210
ist-0.208
os-0.205
o-0.205
Token.
Token ID13
Feature activation+4.55e+00

Pos logit contributions
 chores+0.104
 going+0.090
 same+0.090
 top+0.086
 temptation+0.082
Neg logit contributions
ist-0.263
ppy-0.259
aron-0.258
ne-0.255
o-0.243
Token It
Token ID632
Feature activation+5.03e+00

Pos logit contributions
ist+0.159
aron+0.154
ne+0.151
ppy+0.151
enny+0.148
Neg logit contributions
 chores-0.077
 going-0.077
 same-0.074
 temptation-0.071
 rain-0.068
Token just
Token ID655
Feature activation-3.35e+00

Pos logit contributions
enny+0.143
aron+0.142
ist+0.139
ne+0.137
ppy+0.133
Neg logit contributions
 going-0.121
 chores-0.120
 same-0.116
 rain-0.111
 temptation-0.110
Token kept
Token ID4030
Feature activation-1.19e+01

Pos logit contributions
 chores+0.087
 going+0.082
 same+0.079
 temptation+0.079
 rain+0.073
Neg logit contributions
ist-0.090
aron-0.090
ne-0.087
o-0.086
ppy-0.086
Token getting
Token ID1972
Feature activation-1.37e+01

Pos logit contributions
 chores+0.197
 temptation+0.180
 same+0.174
wered+0.173
 urge+0.158
Neg logit contributions
o-0.408
ist-0.394
aron-0.387
os-0.377
ppy-0.373
Token deeper
Token ID9211
Feature activation-5.04e+00

Pos logit contributions
 chores+0.230
 temptation+0.226
wered+0.208
 same+0.206
 going+0.202
Neg logit contributions
o-0.458
ist-0.437
ppy-0.435
aron-0.431
os-0.427
Token and
Token ID290
Feature activation-7.30e+00

Pos logit contributions
 chores+0.133
 going+0.131
 same+0.125
 temptation+0.123
 rain+0.118
Neg logit contributions
aron-0.130
ist-0.128
enny-0.128
ne-0.126
o-0.124
Token deeper
Token ID9211
Feature activation-4.02e+00

Pos logit contributions
 chores+0.162
 temptation+0.138
 going+0.135
wered+0.133
 same+0.131
Neg logit contributions
o-0.219
ist-0.210
ne-0.203
os-0.203
ppy-0.201
Token.
Token ID13
Feature activation+4.29e+00

Pos logit contributions
 chores+0.098
 going+0.096
 same+0.092
 temptation+0.090
 rain+0.086
Neg logit contributions
aron-0.112
ist-0.111
enny-0.109
ne-0.108
o-0.106
Token Eventually
Token ID16178
Feature activation-1.12e+00

Pos logit contributions
ist+0.140
aron+0.139
ne+0.135
enny+0.134
ppy+0.133
Neg logit contributions
 chores-0.084
 going-0.083
 same-0.078
 temptation-0.076
 rain-0.073
Token sharing
Token ID7373
Feature activation-3.24e+00

Pos logit contributions
 chores+0.057
 same+0.056
wered+0.052
 temptation+0.051
 avail+0.050
Neg logit contributions
o-0.085
enny-0.085
ne-0.083
os-0.081
es-0.079
Token toys
Token ID14958
Feature activation+3.57e+00

Pos logit contributions
 chores+0.075
 going+0.074
 same+0.070
 temptation+0.068
 rain+0.066
Neg logit contributions
aron-0.094
ist-0.093
enny-0.093
ne-0.092
o-0.090
Token was
Token ID373
Feature activation-1.06e+01

Pos logit contributions
aron+0.099
enny+0.097
ist+0.096
jo+0.096
ne+0.095
Neg logit contributions
 chores-0.089
 going-0.088
 same-0.085
 temptation-0.080
 rain-0.079
Token better
Token ID1365
Feature activation-4.90e+00

Pos logit contributions
wered+0.155
 chores+0.153
 temptation+0.149
 top+0.141
 same+0.136
Neg logit contributions
ist-0.351
aron-0.350
o-0.347
enny-0.335
ppy-0.333
Token than
Token ID621
Feature activation-8.97e+00

Pos logit contributions
 going+0.135
 chores+0.134
 same+0.129
 temptation+0.126
 good+0.122
Neg logit contributions
aron-0.124
ist-0.122
enny-0.120
ne-0.119
jo-0.119
Token getting
Token ID1972
Feature activation-1.37e+01

Pos logit contributions
 chores+0.164
 top+0.145
wered+0.145
 temptation+0.142
 urge+0.139
Neg logit contributions
enny-0.279
ne-0.275
ist-0.273
o-0.272
ppy-0.271
Token angry
Token ID7954
Feature activation-2.34e+00

Pos logit contributions
 chores+0.207
 going+0.197
wered+0.190
 temptation+0.185
 same+0.177
Neg logit contributions
ist-0.482
enny-0.479
ne-0.475
ppy-0.474
aron-0.474
Token or
Token ID393
Feature activation-7.32e+00

Pos logit contributions
 going+0.055
 chores+0.055
 same+0.052
 temptation+0.051
 rain+0.049
Neg logit contributions
aron-0.067
enny-0.066
ist-0.066
ne-0.065
o-0.064
Token hating
Token ID45701
Feature activation-6.16e+00

Pos logit contributions
 chores+0.126
wered+0.111
 top+0.107
 temptation+0.105
 same+0.102
Neg logit contributions
o-0.250
ist-0.243
ppy-0.241
ne-0.237
enny-0.234
Token anyone
Token ID2687
Feature activation+5.62e+00

Pos logit contributions
 chores+0.097
 going+0.096
 same+0.089
 temptation+0.087
 rain+0.080
Neg logit contributions
aron-0.220
ist-0.218
enny-0.216
ne-0.215
o-0.212
Token.
Token ID13
Feature activation+4.24e+00

Pos logit contributions
aron+0.146
enny+0.142
ist+0.141
jo+0.135
ne+0.135
Neg logit contributions
 going-0.154
 chores-0.151
 same-0.144
 temptation-0.139
 rain-0.137
Token the
Token ID262
Feature activation-1.64e+00

Pos logit contributions
 chores+0.035
 going+0.031
 same+0.030
 temptation+0.030
 rain+0.026
Neg logit contributions
enny-0.081
ist-0.081
o-0.080
ne-0.080
os-0.079
Token magical
Token ID10883
Feature activation+3.40e+00

Pos logit contributions
 chores+0.041
 going+0.040
 their+0.038
wered+0.037
 comparison+0.035
Neg logit contributions
pillar-0.040
ppy-0.039
itty-0.038
ne-0.038
aron-0.038
Token force
Token ID2700
Feature activation+2.27e+00

Pos logit contributions
aron+0.105
ist+0.103
ne+0.102
enny+0.101
o+0.100
Neg logit contributions
 chores-0.074
 going-0.072
 same-0.067
 temptation-0.066
 rain-0.062
Token just
Token ID655
Feature activation-3.24e+00

Pos logit contributions
aron+0.063
enny+0.061
ist+0.061
ne+0.060
o+0.059
Neg logit contributions
 going-0.056
 chores-0.056
 same-0.053
 temptation-0.051
 rain-0.050
Token kept
Token ID4030
Feature activation-1.19e+01

Pos logit contributions
 chores+0.079
 going+0.073
 same+0.072
 temptation+0.071
 rain+0.067
Neg logit contributions
ist-0.093
aron-0.090
ne-0.089
ppy-0.088
enny-0.088
Token getting
Token ID1972
Feature activation-1.37e+01

Pos logit contributions
 chores+0.186
 same+0.177
wered+0.168
 temptation+0.158
 hurricane+0.154
Neg logit contributions
o-0.404
ist-0.397
aron-0.388
iled-0.381
es-0.377
Token stronger
Token ID7387
Feature activation-3.06e+00

Pos logit contributions
 chores+0.255
wered+0.238
 temptation+0.232
 same+0.232
 going+0.220
Neg logit contributions
o-0.429
ppy-0.396
os-0.394
ne-0.393
ist-0.392
Token!
Token ID0
Feature activation+3.72e+00

Pos logit contributions
 chores+0.073
 going+0.073
 same+0.068
 temptation+0.067
 rain+0.064
Neg logit contributions
enny-0.087
aron-0.086
ist-0.086
o-0.084
ne-0.084
Token
Token ID220
Feature activation+3.61e+00

Pos logit contributions
ist+0.132
aron+0.131
ppy+0.127
ne+0.126
enny+0.125
Neg logit contributions
 chores-0.062
 going-0.061
 same-0.059
 temptation-0.056
 top-0.055
Token
Token ID198
Feature activation+2.64e+00

Pos logit contributions
ist+0.122
aron+0.121
ne+0.118
ppy+0.117
jo+0.116
Neg logit contributions
 chores-0.068
 going-0.066
 same-0.061
 temptation-0.056
 top-0.056
Token
Token ID198
Feature activation+3.07e+00

Pos logit contributions
ist+0.093
aron+0.090
ppy+0.089
ne+0.087
owed+0.086
Neg logit contributions
 chores-0.046
 going-0.044
 same-0.042
 top-0.041
 temptation-0.039
Token!
Token ID0
Feature activation+2.57e+00

Pos logit contributions
 chores+0.076
 going+0.075
 same+0.070
 temptation+0.067
 rain+0.064
Neg logit contributions
aron-0.152
enny-0.150
ist-0.150
ne-0.148
o-0.146
Token We
Token ID775
Feature activation-4.07e-01

Pos logit contributions
aron+0.079
ist+0.078
enny+0.078
ne+0.077
o+0.075
Neg logit contributions
 chores-0.055
 going-0.054
 same-0.051
 temptation-0.050
 rain-0.048
Token can
Token ID460
Feature activation-5.52e+00

Pos logit contributions
 chores+0.009
 going+0.009
 same+0.009
 temptation+0.008
 rain+0.008
Neg logit contributions
aron-0.012
ist-0.012
enny-0.012
ne-0.012
o-0.012
Token start
Token ID923
Feature activation-6.80e+00

Pos logit contributions
 chores+0.072
 going+0.070
 temptation+0.066
 same+0.066
 rain+0.058
Neg logit contributions
ist-0.212
aron-0.208
ne-0.206
enny-0.205
ppy-0.201
Token by
Token ID416
Feature activation-7.00e+00

Pos logit contributions
 chores+0.105
 temptation+0.097
 going+0.097
 same+0.095
 rain+0.086
Neg logit contributions
ist-0.245
aron-0.244
ne-0.240
o-0.238
enny-0.238
Token getting
Token ID1972
Feature activation-1.37e+01

Pos logit contributions
 chores+0.108
 going+0.100
 same+0.097
 temptation+0.096
wered+0.090
Neg logit contributions
ist-0.251
aron-0.245
ne-0.241
enny-0.235
es-0.233
Token a
Token ID257
Feature activation-4.97e+00

Pos logit contributions
wered+0.229
 temptation+0.225
 chores+0.214
 urge+0.208
 going+0.201
Neg logit contributions
o-0.432
ist-0.431
os-0.428
ne-0.427
enny-0.424
Token more
Token ID517
Feature activation-6.29e+00

Pos logit contributions
 chores+0.066
 going+0.066
 temptation+0.060
 same+0.056
wered+0.054
Neg logit contributions
aron-0.187
ist-0.187
ne-0.185
enny-0.184
o-0.180
Token comfortable
Token ID6792
Feature activation-1.38e+00

Pos logit contributions
 chores+0.082
 going+0.081
 same+0.072
 temptation+0.072
wered+0.067
Neg logit contributions
aron-0.243
ist-0.239
ne-0.235
jo-0.231
enny-0.229
Token seat
Token ID5852
Feature activation+3.24e+00

Pos logit contributions
 chores+0.020
 going+0.019
 same+0.018
 temptation+0.017
 rain+0.016
Neg logit contributions
aron-0.052
ist-0.051
ne-0.050
enny-0.050
jo-0.049
Token!"
Token ID2474
Feature activation+5.34e+00

Pos logit contributions
enny+0.100
aron+0.098
ne+0.095
ist+0.095
ppy+0.095
Neg logit contributions
 going-0.071
 chores-0.071
 same-0.066
 temptation-0.061
 work-0.060
Token gave
Token ID2921
Feature activation-7.96e+00

Pos logit contributions
aron+0.468
enny+0.465
ppy+0.454
ist+0.449
umping+0.434
Neg logit contributions
 going-0.350
 chores-0.308
 same-0.302
 her-0.294
 rain-0.292
Token Joe
Token ID5689
Feature activation+1.29e+01

Pos logit contributions
 chores+0.125
 going+0.113
 temptation+0.105
 same+0.104
 rain+0.097
Neg logit contributions
o-0.293
ist-0.288
aron-0.288
enny-0.288
ne-0.286
Token a
Token ID257
Feature activation-2.16e+00

Pos logit contributions
aron+0.605
ist+0.601
enny+0.596
ne+0.588
jo+0.579
Neg logit contributions
 chores-0.084
 going-0.072
 same-0.066
 temptation-0.043
 her-0.042
Token poem
Token ID21247
Feature activation+8.26e+00

Pos logit contributions
 going+0.034
 chores+0.034
 temptation+0.031
 same+0.030
 rain+0.028
Neg logit contributions
enny-0.076
aron-0.076
ist-0.075
ne-0.075
o-0.075
Token.
Token ID13
Feature activation+5.11e+00

Pos logit contributions
aron+0.255
enny+0.252
ist+0.252
ne+0.249
ppy+0.245
Neg logit contributions
 going-0.174
 chores-0.174
 same-0.161
 temptation-0.154
 rain-0.151
Token Joe
Token ID5689
Feature activation+1.43e+01

Pos logit contributions
aron+0.166
ist+0.165
enny+0.163
ne+0.161
ppy+0.160
Neg logit contributions
 going-0.101
 chores-0.100
 same-0.095
 temptation-0.092
 rain-0.088
Token was
Token ID373
Feature activation-7.27e+00

Pos logit contributions
enny+0.462
aron+0.449
ist+0.447
ppy+0.434
umping+0.428
Neg logit contributions
 going-0.296
 same-0.272
 her-0.269
 the-0.258
 chores-0.258
Token so
Token ID523
Feature activation-5.84e+00

Pos logit contributions
 chores+0.135
 temptation+0.124
 same+0.115
 going+0.113
 rain+0.110
Neg logit contributions
aron-0.249
o-0.246
ist-0.240
ne-0.238
enny-0.236
Token happy
Token ID3772
Feature activation-1.04e+00

Pos logit contributions
 chores+0.113
 going+0.102
 temptation+0.101
 same+0.097
 rain+0.094
Neg logit contributions
aron-0.195
o-0.193
enny-0.189
ne-0.186
os-0.185
Token to
Token ID284
Feature activation-5.66e+00

Pos logit contributions
 going+0.025
 chores+0.023
 same+0.022
 temptation+0.022
 rain+0.021
Neg logit contributions
aron-0.031
ist-0.031
enny-0.030
ppy-0.030
ne-0.030
Token get
Token ID651
Feature activation-9.73e+00

Pos logit contributions
 chores+0.099
 going+0.090
 temptation+0.086
 same+0.084
 rain+0.074
Neg logit contributions
aron-0.196
ne-0.195
enny-0.193
ist-0.193
o-0.190
Token jungle
Token ID20712
Feature activation+8.49e+00

Pos logit contributions
 chores+0.062
 going+0.058
 temptation+0.055
 same+0.051
 her+0.049
Neg logit contributions
o-0.092
aron-0.091
ne-0.091
ist-0.089
os-0.088
Token."
Token ID526
Feature activation+6.63e+00

Pos logit contributions
enny+0.243
aron+0.243
ist+0.238
ppy+0.232
ne+0.231
Neg logit contributions
 going-0.206
 chores-0.195
 same-0.193
 temptation-0.180
 rain-0.178
Token
Token ID198
Feature activation+3.09e+00

Pos logit contributions
ist+0.261
ppy+0.256
enny+0.254
as+0.254
aron+0.243
Neg logit contributions
 going-0.092
 same-0.085
 top-0.083
 chores-0.081
 the-0.081
Token
Token ID198
Feature activation+3.49e+00

Pos logit contributions
ist+0.123
ppy+0.120
aron+0.118
umping+0.113
owed+0.113
Neg logit contributions
 big-0.038
 going-0.037
 chores-0.037
 same-0.036
 top-0.036
TokenThe
Token ID464
Feature activation+1.72e+00

Pos logit contributions
enny+0.121
ist+0.120
owed+0.119
aron+0.117
ppy+0.117
Neg logit contributions
 same-0.059
 top-0.059
 chores-0.059
 going-0.058
 rain-0.056
Token monkey
Token ID21657
Feature activation+1.55e+01

Pos logit contributions
o+0.038
ne+0.036
itty+0.036
aron+0.035
oc+0.034
Neg logit contributions
 chores-0.052
 going-0.049
 temptation-0.048
 same-0.044
 laundry-0.044
Token was
Token ID373
Feature activation-7.09e+00

Pos logit contributions
enny+0.542
aron+0.515
umping+0.502
itty+0.497
ppy+0.497
Neg logit contributions
 going-0.276
 top-0.255
 same-0.255
 rain-0.253
 her-0.251
Token very
Token ID845
Feature activation-5.34e+00

Pos logit contributions
 chores+0.146
 temptation+0.134
 same+0.123
 going+0.119
 rain+0.115
Neg logit contributions
aron-0.234
o-0.233
ist-0.220
ne-0.215
ppy-0.213
Token excited
Token ID6568
Feature activation-4.98e-01

Pos logit contributions
 chores+0.109
 going+0.094
 temptation+0.092
 same+0.089
 rain+0.086
Neg logit contributions
o-0.178
aron-0.173
ne-0.167
ist-0.167
enny-0.165
Token and
Token ID290
Feature activation-3.21e+00

Pos logit contributions
 going+0.011
 same+0.010
 chores+0.010
 her+0.009
 rain+0.009
Neg logit contributions
enny-0.016
aron-0.015
ist-0.015
ppy-0.015
itty-0.015
Token wanted
Token ID2227
Feature activation-6.48e+00

Pos logit contributions
 chores+0.084
 going+0.078
 temptation+0.077
 same+0.077
 rain+0.071
Neg logit contributions
aron-0.085
o-0.084
ne-0.083
ist-0.083
enny-0.083
Token was
Token ID373
Feature activation-5.86e+00

Pos logit contributions
aron+0.123
ist+0.122
enny+0.119
ne+0.118
ppy+0.117
Neg logit contributions
 going-0.109
 chores-0.108
 same-0.104
 temptation-0.098
 rain-0.098
Token a
Token ID257
Feature activation+1.82e-01

Pos logit contributions
 chores+0.080
 temptation+0.076
 going+0.068
 same+0.066
 hurricane+0.060
Neg logit contributions
o-0.219
aron-0.216
enny-0.214
ne-0.212
umping-0.210
Token duck
Token ID22045
Feature activation+1.35e+01

Pos logit contributions
aron+0.007
ist+0.007
enny+0.007
ne+0.007
os+0.007
Neg logit contributions
 chores-0.002
 going-0.002
 same-0.002
 temptation-0.002
 rain-0.002
Token named
Token ID3706
Feature activation+1.73e+00

Pos logit contributions
aron+0.378
enny+0.369
ist+0.352
jo+0.340
ne+0.340
Neg logit contributions
 going-0.352
 chores-0.328
 same-0.326
 rain-0.315
 her-0.309
Token D
Token ID360
Feature activation+1.17e+01

Pos logit contributions
o+0.052
aron+0.051
enny+0.051
ne+0.050
ppy+0.050
Neg logit contributions
 chores-0.039
 going-0.037
 temptation-0.036
 same-0.035
 rain-0.032
Tokenucky
Token ID5309
Feature activation+1.29e+01

Pos logit contributions
aron+0.318
ist+0.306
jo+0.289
os+0.285
es+0.285
Neg logit contributions
 same-0.293
 going-0.292
 chores-0.292
 rain-0.280
 temptation-0.277
Token.
Token ID13
Feature activation+6.10e+00

Pos logit contributions
enny+0.469
aron+0.453
ist+0.451
umping+0.448
os+0.441
Neg logit contributions
 going-0.223
 her-0.200
 same-0.199
 other-0.197
 chores-0.196
Token D
Token ID360
Feature activation+1.23e+01

Pos logit contributions
o+0.169
enny+0.164
aron+0.160
ppy+0.159
ne+0.155
Neg logit contributions
 chores-0.155
 going-0.144
 temptation-0.139
 same-0.135
 work-0.129
Tokenucky
Token ID5309
Feature activation+1.32e+01

Pos logit contributions
aron+0.341
ist+0.337
jo+0.328
ne+0.317
os+0.316
Neg logit contributions
 same-0.308
 going-0.302
 chores-0.302
 temptation-0.290
 rain-0.289
Token loved
Token ID6151
Feature activation-6.33e+00

Pos logit contributions
enny+0.474
ist+0.459
aron+0.443
umping+0.441
ne+0.431
Neg logit contributions
 going-0.242
 same-0.240
 good-0.239
 rain-0.233
 big-0.233
Token to
Token ID284
Feature activation-6.71e+00

Pos logit contributions
 chores+0.163
 temptation+0.163
 top+0.154
 avail+0.154
 same+0.151
Neg logit contributions
o-0.174
os-0.144
itty-0.138
ne-0.137
umping-0.134
Token She
Token ID1375
Feature activation+4.40e+00

Pos logit contributions
aron+0.126
enny+0.125
ist+0.125
ne+0.123
o+0.121
Neg logit contributions
 chores-0.075
 going-0.074
 same-0.069
 temptation-0.068
 rain-0.064
Token looked
Token ID3114
Feature activation-4.51e+00

Pos logit contributions
enny+0.137
ist+0.135
umping+0.132
os+0.132
aron+0.130
Neg logit contributions
 going-0.097
 chores-0.088
 same-0.088
 rain-0.087
 her-0.085
Token at
Token ID379
Feature activation-9.27e+00

Pos logit contributions
 chores+0.090
 going+0.088
 same+0.083
 temptation+0.081
 rain+0.077
Neg logit contributions
aron-0.147
ist-0.145
enny-0.145
ne-0.143
o-0.141
Token her
Token ID607
Feature activation+4.90e-01

Pos logit contributions
 chores+0.106
 going+0.104
 temptation+0.088
 same+0.084
 rain+0.074
Neg logit contributions
enny-0.370
ist-0.362
aron-0.360
ppy-0.359
ne-0.358
Token t
Token ID256
Feature activation+2.99e+00

Pos logit contributions
aron+0.018
ist+0.018
enny+0.018
ne+0.017
o+0.017
Neg logit contributions
 chores-0.008
 going-0.008
 same-0.007
 temptation-0.007
 rain-0.006
Tokeneddy
Token ID21874
Feature activation+1.13e+01

Pos logit contributions
aron+0.064
ne+0.054
ist+0.054
jo+0.052
ppy+0.050
Neg logit contributions
 chores-0.088
 same-0.083
 top-0.081
 temptation-0.080
 her-0.080
Token bear
Token ID6842
Feature activation+1.13e+01

Pos logit contributions
enny+0.376
ne+0.359
ist+0.358
aron+0.357
os+0.351
Neg logit contributions
 chores-0.237
 going-0.230
 same-0.224
 temptation-0.219
 her-0.209
Token.
Token ID13
Feature activation+3.40e+00

Pos logit contributions
enny+0.331
aron+0.320
ist+0.315
ne+0.315
jo+0.314
Neg logit contributions
 chores-0.269
 going-0.269
 same-0.242
 temptation-0.238
 top-0.235
Token He
Token ID679
Feature activation+3.74e+00

Pos logit contributions
ist+0.115
aron+0.114
ne+0.112
enny+0.112
ppy+0.110
Neg logit contributions
 chores-0.062
 going-0.062
 same-0.058
 temptation-0.056
 rain-0.054
Token was
Token ID373
Feature activation-8.85e+00

Pos logit contributions
enny+0.113
ist+0.112
aron+0.111
umping+0.109
ne+0.108
Neg logit contributions
 going-0.085
 same-0.078
 chores-0.077
 her-0.076
 rain-0.074
Token soft
Token ID2705
Feature activation-5.31e-01

Pos logit contributions
 chores+0.154
 temptation+0.136
 same+0.129
 going+0.128
 work+0.119
Neg logit contributions
aron-0.321
o-0.306
ist-0.305
ppy-0.297
ne-0.296
Token was
Token ID373
Feature activation-5.62e+00

Pos logit contributions
aron+0.116
ist+0.115
enny+0.113
ne+0.112
ppy+0.110
Neg logit contributions
 chores-0.114
 going-0.114
 same-0.109
 temptation-0.105
 rain-0.103
Token a
Token ID257
Feature activation+2.43e-01

Pos logit contributions
 chores+0.080
 temptation+0.079
 going+0.067
 same+0.067
 hurricane+0.064
Neg logit contributions
o-0.205
aron-0.199
enny-0.199
ne-0.196
itty-0.195
Token little
Token ID1310
Feature activation+4.04e+00

Pos logit contributions
aron+0.010
ist+0.010
enny+0.009
ne+0.009
os+0.009
Neg logit contributions
 chores-0.003
 going-0.003
 same-0.003
 temptation-0.003
 rain-0.003
Token girl
Token ID2576
Feature activation+1.40e+01

Pos logit contributions
aron+0.136
ist+0.134
enny+0.133
owed+0.130
jo+0.129
Neg logit contributions
 going-0.075
 chores-0.075
 same-0.075
 rain-0.073
 good-0.070
Token named
Token ID3706
Feature activation+2.55e+00

Pos logit contributions
aron+0.405
enny+0.401
ist+0.392
umping+0.378
jo+0.372
Neg logit contributions
 going-0.345
 her-0.316
 chores-0.305
 rain-0.299
 same-0.297
Token Lily
Token ID20037
Feature activation+1.44e+01

Pos logit contributions
enny+0.088
aron+0.087
ist+0.087
ne+0.087
o+0.086
Neg logit contributions
 chores-0.046
 going-0.043
 temptation-0.040
 same-0.040
 rain-0.036
Token.
Token ID13
Feature activation+6.27e+00

Pos logit contributions
enny+0.506
ist+0.498
aron+0.489
ouse+0.472
owed+0.472
Neg logit contributions
 going-0.292
 her-0.262
 same-0.236
 chores-0.229
 the-0.223
Token One
Token ID1881
Feature activation+1.15e+00

Pos logit contributions
aron+0.223
ist+0.222
enny+0.219
ne+0.219
o+0.214
Neg logit contributions
 chores-0.105
 going-0.104
 same-0.097
 temptation-0.094
 rain-0.090
Token day
Token ID1110
Feature activation+5.12e+00

Pos logit contributions
aron+0.054
ist+0.054
enny+0.052
ne+0.052
ppy+0.051
Neg logit contributions
 going-0.008
 chores-0.007
 same-0.007
 rain-0.006
 temptation-0.005
Token,
Token ID11
Feature activation+7.89e-01

Pos logit contributions
aron+0.167
ist+0.164
enny+0.161
ne+0.160
ppy+0.157
Neg logit contributions
 chores-0.103
 going-0.103
 same-0.096
 temptation-0.093
 rain-0.090
Token she
Token ID673
Feature activation+6.96e+00

Pos logit contributions
enny+0.025
o+0.025
itty+0.024
ppy+0.024
ne+0.023
Neg logit contributions
 chores-0.016
 going-0.014
 temptation-0.014
 same-0.014
 trouble-0.013
Token and
Token ID290
Feature activation-5.03e+00

Pos logit contributions
enny+0.287
aron+0.277
ist+0.273
ne+0.272
jo+0.271
Neg logit contributions
 going-0.171
 chores-0.165
 same-0.156
 rain-0.151
 top-0.143
Token a
Token ID257
Feature activation-2.43e+00

Pos logit contributions
 chores+0.072
 temptation+0.068
 going+0.062
 same+0.058
wered+0.054
Neg logit contributions
aron-0.185
ne-0.184
ist-0.181
es-0.179
o-0.179
Token smile
Token ID8212
Feature activation+3.74e+00

Pos logit contributions
 chores+0.047
 going+0.046
 temptation+0.043
 same+0.042
wered+0.039
Neg logit contributions
aron-0.079
ist-0.077
ne-0.077
enny-0.077
o-0.076
Tokeny
Token ID88
Feature activation+5.89e+00

Pos logit contributions
enny+0.115
aron+0.112
ist+0.112
ne+0.111
itty+0.107
Neg logit contributions
 going-0.084
 chores-0.079
 same-0.077
 rain-0.073
 her-0.073
Token face
Token ID1986
Feature activation+1.32e+00

Pos logit contributions
enny+0.203
ist+0.196
ne+0.195
aron+0.194
os+0.193
Neg logit contributions
 chores-0.107
 going-0.104
 same-0.102
 temptation-0.100
 rain-0.098
Token.
Token ID13
Feature activation+3.96e+00

Pos logit contributions
enny+0.043
ist+0.041
ne+0.041
aron+0.040
jo+0.040
Neg logit contributions
 going-0.028
 chores-0.026
 her-0.025
 same-0.025
 rain-0.024
Token She
Token ID1375
Feature activation+4.18e+00

Pos logit contributions
aron+0.134
enny+0.132
ist+0.132
ne+0.130
o+0.129
Neg logit contributions
 chores-0.074
 going-0.073
 same-0.068
 temptation-0.066
 rain-0.063
Token was
Token ID373
Feature activation-8.65e+00

Pos logit contributions
enny+0.130
aron+0.126
jo+0.125
itty+0.125
ist+0.125
Neg logit contributions
 going-0.088
 chores-0.081
 her-0.080
 same-0.080
 rain-0.078
Token happy
Token ID3772
Feature activation-1.97e+00

Pos logit contributions
 chores+0.166
 temptation+0.151
 same+0.138
 hurricane+0.136
wered+0.129
Neg logit contributions
aron-0.294
o-0.288
ist-0.274
ppy-0.269
bug-0.267
Token with
Token ID351
Feature activation-9.17e+00

Pos logit contributions
 going+0.044
 chores+0.043
 same+0.041
 temptation+0.040
 rain+0.039
Neg logit contributions
aron-0.060
ist-0.059
enny-0.059
ne-0.058
ppy-0.057
Token her
Token ID607
Feature activation+1.14e+00

Pos logit contributions
 chores+0.100
 same+0.099
 going+0.095
 hurricane+0.086
 temptation+0.084
Neg logit contributions
aron-0.347
enny-0.343
es-0.338
ist-0.337
ppy-0.334
Token humble
Token ID19695
Feature activation-2.32e+00

Pos logit contributions
 chores+0.167
 going+0.156
 temptation+0.156
 hurricane+0.151
 same+0.150
Neg logit contributions
aron-0.269
o-0.259
ist-0.259
ppy-0.253
ne-0.249
Token and
Token ID290
Feature activation-4.55e+00

Pos logit contributions
 chores+0.056
 going+0.056
 same+0.053
 temptation+0.051
 rain+0.049
Neg logit contributions
enny-0.065
ist-0.065
aron-0.065
ne-0.064
o-0.063
Token listen
Token ID6004
Feature activation-3.92e+00

Pos logit contributions
 chores+0.075
wered+0.069
 top+0.067
 same+0.066
 going+0.065
Neg logit contributions
ist-0.150
o-0.144
aron-0.143
ne-0.143
ppy-0.143
Token to
Token ID284
Feature activation-7.09e+00

Pos logit contributions
 chores+0.075
 going+0.073
 same+0.069
 temptation+0.067
 rain+0.064
Neg logit contributions
aron-0.130
ist-0.129
enny-0.128
ne-0.127
o-0.125
Token me
Token ID502
Feature activation+2.36e+00

Pos logit contributions
 chores+0.105
 going+0.098
 temptation+0.090
 hurricane+0.088
 same+0.084
Neg logit contributions
ist-0.247
ppy-0.239
aron-0.238
ne-0.238
enny-0.231
Token."
Token ID526
Feature activation+6.32e+00

Pos logit contributions
aron+0.072
enny+0.072
ist+0.071
ne+0.070
o+0.069
Neg logit contributions
 chores-0.052
 going-0.051
 same-0.048
 temptation-0.047
 rain-0.045
Token
Token ID198
Feature activation+2.76e+00

Pos logit contributions
ist+0.201
aron+0.199
ne+0.194
ppy+0.192
enny+0.192
Neg logit contributions
 chores-0.131
 going-0.128
 same-0.123
 temptation-0.118
 rain-0.114
Token
Token ID198
Feature activation+3.12e+00

Pos logit contributions
ist+0.096
aron+0.093
ppy+0.092
ne+0.090
umping+0.089
Neg logit contributions
 chores-0.050
 going-0.048
 same-0.047
 her-0.044
 temptation-0.043
TokenL
Token ID43
Feature activation+1.03e+01

Pos logit contributions
aron+0.064
ist+0.063
enny+0.062
o+0.061
ne+0.061
Neg logit contributions
 chores-0.100
 going-0.097
 same-0.095
 temptation-0.094
 rain-0.091
Tokenily
Token ID813
Feature activation+1.30e+01

Pos logit contributions
o+0.309
aron+0.307
jo+0.306
ist+0.302
os+0.302
Neg logit contributions
 chores-0.218
 going-0.216
 same-0.207
 temptation-0.199
 her-0.198
Token nodded
Token ID14464
Feature activation-3.16e-01

Pos logit contributions
itty+0.401
enny+0.399
jo+0.392
aron+0.388
oing+0.387
Neg logit contributions
 going-0.258
 same-0.253
 chores-0.244
 her-0.244
 top-0.241
Token to
Token ID284
Feature activation-7.83e+00

Pos logit contributions
enny+0.207
aron+0.207
ist+0.204
jo+0.200
ne+0.200
Neg logit contributions
 chores-0.151
 going-0.151
 same-0.144
 temptation-0.137
 rain-0.136
Token play
Token ID711
Feature activation-3.02e+00

Pos logit contributions
 chores+0.121
 going+0.120
 temptation+0.113
 same+0.112
 comparison+0.094
Neg logit contributions
ist-0.274
aron-0.272
ne-0.266
o-0.264
ppy-0.261
Token with
Token ID351
Feature activation-1.21e+01

Pos logit contributions
 chores+0.068
 going+0.067
 same+0.064
 temptation+0.061
 rain+0.060
Neg logit contributions
aron-0.090
enny-0.089
ist-0.089
ne-0.088
jo-0.087
Token.
Token ID13
Feature activation+4.44e+00

Pos logit contributions
 chores+0.257
 going+0.251
 same+0.239
 temptation+0.229
 rain+0.224
Neg logit contributions
aron-0.379
enny-0.377
ist-0.375
ne-0.371
ppy-0.364
Token Tim
Token ID5045
Feature activation+1.32e+01

Pos logit contributions
ist+0.152
aron+0.150
enny+0.147
ne+0.147
ppy+0.145
Neg logit contributions
 going-0.079
 chores-0.078
 same-0.074
 temptation-0.071
 rain-0.069
Tokenmy
Token ID1820
Feature activation+9.86e+00

Pos logit contributions
ist+0.326
aron+0.324
enny+0.321
ne+0.317
oing+0.317
Neg logit contributions
 going-0.375
 same-0.367
 chores-0.348
 her-0.339
 work-0.339
Token was
Token ID373
Feature activation-7.43e+00

Pos logit contributions
enny+0.296
ist+0.296
aron+0.293
ne+0.288
itty+0.284
Neg logit contributions
 going-0.225
 same-0.211
 chores-0.209
 her-0.197
 work-0.194
Token so
Token ID523
Feature activation-6.42e+00

Pos logit contributions
 chores+0.132
 temptation+0.120
 same+0.108
 hurricane+0.107
wered+0.104
Neg logit contributions
o-0.259
aron-0.256
ist-0.239
umping-0.237
enny-0.236
Token excited
Token ID6568
Feature activation-1.16e+00

Pos logit contributions
 chores+0.105
 going+0.094
 same+0.091
 temptation+0.090
 rain+0.084
Neg logit contributions
aron-0.234
o-0.232
enny-0.228
ist-0.225
ne-0.223
Token to
Token ID284
Feature activation-5.88e+00

Pos logit contributions
 going+0.024
 chores+0.023
 same+0.022
 temptation+0.021
 rain+0.020
Neg logit contributions
aron-0.037
enny-0.037
ist-0.037
ppy-0.037
ne-0.036
Token balance
Token ID5236
Feature activation-2.94e+00

Pos logit contributions
 chores+0.110
 temptation+0.108
 same+0.107
 going+0.098
 trouble+0.093
Neg logit contributions
o-0.174
enny-0.173
ne-0.172
ist-0.172
aron-0.171
TokenOnce
Token ID7454
Feature activation+1.31e+00

Pos logit contributions
ist+0.351
ppy+0.330
aron+0.328
owed+0.322
enny+0.321
Neg logit contributions
 going-0.149
 chores-0.138
 same-0.136
 top-0.129
 her-0.125
Token upon
Token ID2402
Feature activation-1.83e+00

Pos logit contributions
enny+0.036
aron+0.036
ist+0.035
ne+0.035
o+0.035
Neg logit contributions
 chores-0.032
 going-0.031
 same-0.030
 temptation-0.030
 rain-0.028
Token a
Token ID257
Feature activation+1.69e+00

Pos logit contributions
 chores+0.015
 going+0.015
 same+0.013
 temptation+0.012
 rain+0.010
Neg logit contributions
aron-0.080
ist-0.079
enny-0.079
ne-0.078
o-0.078
Token time
Token ID640
Feature activation+3.30e+00

Pos logit contributions
enny+0.077
ist+0.077
aron+0.077
ne+0.076
os+0.076
Neg logit contributions
 chores-0.011
 going-0.010
 same-0.009
 temptation-0.007
 rain-0.006
Token,
Token ID11
Feature activation+5.87e-01

Pos logit contributions
aron+0.100
enny+0.099
ist+0.099
ne+0.098
o+0.097
Neg logit contributions
 chores-0.072
 going-0.070
 same-0.067
 temptation-0.066
 rain-0.062
Token there
Token ID612
Feature activation+4.34e+00

Pos logit contributions
ist+0.020
enny+0.020
aron+0.020
ne+0.020
o+0.020
Neg logit contributions
 chores-0.011
 going-0.010
 temptation-0.010
 same-0.010
 rain-0.009
Token was
Token ID373
Feature activation-5.76e+00

Pos logit contributions
aron+0.120
ist+0.119
enny+0.117
ne+0.116
ppy+0.114
Neg logit contributions
 going-0.108
 chores-0.107
 same-0.103
 temptation-0.098
 rain-0.097
Token a
Token ID257
Feature activation+1.96e-01

Pos logit contributions
 chores+0.076
 temptation+0.074
 going+0.066
 same+0.063
 hurricane+0.057
Neg logit contributions
o-0.216
aron-0.214
enny-0.213
ne-0.211
umping-0.207
Token girl
Token ID2576
Feature activation+1.36e+01

Pos logit contributions
aron+0.008
ist+0.008
enny+0.008
ne+0.008
os+0.008
Neg logit contributions
 going-0.002
 chores-0.002
 same-0.002
 temptation-0.002
 rain-0.002
Token named
Token ID3706
Feature activation+2.42e+00

Pos logit contributions
aron+0.391
enny+0.389
ist+0.381
umping+0.364
ppy+0.360
Neg logit contributions
 going-0.339
 her-0.312
 chores-0.301
 rain-0.298
 same-0.291
Token Lily
Token ID20037
Feature activation+1.43e+01

Pos logit contributions
aron+0.089
ist+0.088
enny+0.088
ne+0.087
o+0.086
Neg logit contributions
 chores-0.038
 going-0.037
 same-0.034
 temptation-0.033
 rain-0.031
TokenI
Token ID40
Feature activation-8.04e-01

Pos logit contributions
ist+0.083
aron+0.083
enny+0.083
ne+0.080
jo+0.080
Neg logit contributions
 chores-0.064
 going-0.063
 same-0.060
 temptation-0.059
 rain-0.057
Token'm
Token ID1101
Feature activation-7.51e+00

Pos logit contributions
 going+0.015
 chores+0.015
 good+0.015
 same+0.015
 help+0.014
Neg logit contributions
aron-0.028
ist-0.027
umping-0.026
ne-0.026
jo-0.026
Token sorry
Token ID7926
Feature activation-1.38e+00

Pos logit contributions
 chores+0.092
 same+0.073
 temptation+0.072
 going+0.065
 work+0.064
Neg logit contributions
o-0.304
aron-0.302
ist-0.291
ppy-0.291
itty-0.290
Token,
Token ID11
Feature activation-2.25e+00

Pos logit contributions
 chores+0.027
 going+0.026
 same+0.025
 temptation+0.024
 rain+0.023
Neg logit contributions
aron-0.046
ist-0.046
enny-0.045
ne-0.045
umping-0.044
Token Ben
Token ID3932
Feature activation+1.28e+01

Pos logit contributions
 chores+0.043
 going+0.041
 same+0.038
 temptation+0.037
 rain+0.034
Neg logit contributions
aron-0.075
ppy-0.073
enny-0.073
ne-0.073
ist-0.073
Token.
Token ID13
Feature activation+3.07e+00

Pos logit contributions
enny+0.437
ist+0.420
os+0.411
aron+0.409
umping+0.409
Neg logit contributions
 going-0.251
 same-0.238
 chores-0.236
 temptation-0.229
 good-0.225
Token The
Token ID383
Feature activation-6.24e-01

Pos logit contributions
aron+0.110
ist+0.109
enny+0.108
ne+0.107
ppy+0.105
Neg logit contributions
 chores-0.051
 going-0.050
 same-0.048
 temptation-0.047
 rain-0.044
Token tree
Token ID5509
Feature activation+9.61e+00

Pos logit contributions
 chores+0.015
 going+0.014
 temptation+0.013
 avail+0.012
 same+0.012
Neg logit contributions
o-0.016
itty-0.016
aron-0.016
ne-0.016
ney-0.016
Token is
Token ID318
Feature activation-7.99e+00

Pos logit contributions
enny+0.307
aron+0.303
ist+0.292
ne+0.291
ppy+0.289
Neg logit contributions
 chores-0.202
 going-0.201
 same-0.193
 temptation-0.184
 rain-0.182
Token not
Token ID407
Feature activation-8.14e+00

Pos logit contributions
 chores+0.135
 temptation+0.122
 same+0.104
wered+0.104
 top+0.103
Neg logit contributions
o-0.282
aron-0.279
ist-0.276
ppy-0.264
ne-0.257
Token mean
Token ID1612
Feature activation-2.30e+00

Pos logit contributions
 chores+0.108
 going+0.094
 temptation+0.092
 same+0.089
 top+0.083
Neg logit contributions
aron-0.317
ist-0.314
o-0.311
ne-0.306
ppy-0.306
Token successful
Token ID4388
Feature activation-1.57e+00

Pos logit contributions
 chores+0.152
wered+0.134
 same+0.133
 going+0.130
 rain+0.128
Neg logit contributions
aron-0.199
ist-0.187
ry-0.185
o-0.183
enny-0.183
Token.
Token ID13
Feature activation+4.64e+00

Pos logit contributions
 chores+0.040
 going+0.040
 same+0.038
 temptation+0.037
 rain+0.036
Neg logit contributions
aron-0.042
ist-0.042
enny-0.042
ne-0.041
o-0.040
Token
Token ID198
Feature activation+2.46e+00

Pos logit contributions
aron+0.145
enny+0.144
ne+0.141
o+0.140
ist+0.140
Neg logit contributions
 chores-0.098
 going-0.096
 same-0.090
 temptation-0.089
 rain-0.085
Token
Token ID198
Feature activation+3.10e+00

Pos logit contributions
ist+0.081
aron+0.080
ppy+0.077
ne+0.077
enny+0.076
Neg logit contributions
 chores-0.049
 going-0.048
 same-0.045
 temptation-0.042
 top-0.042
TokenBut
Token ID1537
Feature activation+7.22e-01

Pos logit contributions
aron+0.086
ist+0.086
enny+0.084
ne+0.082
o+0.082
Neg logit contributions
 chores-0.077
 going-0.076
 same-0.072
 temptation-0.071
 rain-0.068
Token then
Token ID788
Feature activation+7.49e-01

Pos logit contributions
aron+0.022
ist+0.022
ne+0.022
enny+0.022
ppy+0.022
Neg logit contributions
 chores-0.015
 going-0.015
 temptation-0.014
 same-0.014
 rain-0.013
Token something
Token ID1223
Feature activation+3.76e+00

Pos logit contributions
aron+0.024
enny+0.024
ist+0.024
ne+0.024
o+0.023
Neg logit contributions
 chores-0.015
 going-0.015
 same-0.014
 temptation-0.013
 rain-0.013
Token happened
Token ID3022
Feature activation-1.57e+00

Pos logit contributions
ne+0.137
ist+0.135
enny+0.133
aron+0.133
ppy+0.130
Neg logit contributions
 chores-0.054
 going-0.050
 same-0.048
 temptation-0.048
 top-0.041
Token that
Token ID326
Feature activation-1.16e+00

Pos logit contributions
 chores+0.042
 going+0.041
 same+0.039
 temptation+0.039
 rain+0.037
Neg logit contributions
aron-0.040
ist-0.040
enny-0.039
ne-0.039
o-0.038
Token made
Token ID925
Feature activation-7.34e+00

Pos logit contributions
 chores+0.026
 going+0.024
 temptation+0.024
 same+0.023
wered+0.021
Neg logit contributions
ne-0.034
ist-0.034
aron-0.033
o-0.033
os-0.033
Token her
Token ID607
Feature activation+8.31e-01

Pos logit contributions
 chores+0.087
 going+0.086
 same+0.078
 temptation+0.077
 hurricane+0.069
Neg logit contributions
ppy-0.279
es-0.275
ne-0.273
enny-0.272
aron-0.271
Token bunny
Token ID44915
Feature activation+1.37e+01

Pos logit contributions
pillar+0.015
aron+0.015
itty+0.015
ppy+0.015
ne+0.014
Neg logit contributions
 going-0.022
 chores-0.022
wered-0.020
 their-0.019
 temptation-0.019
Token was
Token ID373
Feature activation-7.83e+00

Pos logit contributions
aron+0.430
enny+0.421
ist+0.420
ne+0.407
jo+0.404
Neg logit contributions
 going-0.277
 same-0.259
 her-0.254
 rain-0.252
 big-0.251
Token so
Token ID523
Feature activation-6.20e+00

Pos logit contributions
 chores+0.159
 temptation+0.144
 same+0.132
 going+0.132
 rain+0.123
Neg logit contributions
aron-0.261
o-0.255
ist-0.249
ppy-0.244
enny-0.244
Token happy
Token ID3772
Feature activation-1.01e+00

Pos logit contributions
 chores+0.110
 going+0.098
 temptation+0.094
 same+0.092
 rain+0.083
Neg logit contributions
aron-0.220
o-0.218
enny-0.215
ist-0.212
os-0.210
Token and
Token ID290
Feature activation-3.37e+00

Pos logit contributions
 going+0.021
 chores+0.018
 same+0.018
 her+0.017
 rain+0.017
Neg logit contributions
ist-0.034
aron-0.033
enny-0.033
ppy-0.033
ne-0.032
Token thanked
Token ID26280
Feature activation-5.26e+00

Pos logit contributions
 chores+0.094
 same+0.085
 going+0.085
 temptation+0.084
wered+0.080
Neg logit contributions
aron-0.085
o-0.084
enny-0.083
ist-0.081
ne-0.081
Token Fox
Token ID5426
Feature activation+1.36e+01

Pos logit contributions
 chores+0.128
 going+0.122
 same+0.118
 temptation+0.115
 rain+0.110
Neg logit contributions
aron-0.149
ne-0.145
ist-0.144
enny-0.142
o-0.141
Tokeny
Token ID88
Feature activation+6.61e+00

Pos logit contributions
aron+0.195
umping+0.179
ist+0.176
ouse+0.172
ushed+0.168
Neg logit contributions
 same-0.484
 going-0.483
 her-0.482
 chores-0.465
 good-0.462
Token for
Token ID329
Feature activation-8.91e+00

Pos logit contributions
aron+0.255
ist+0.251
enny+0.249
ne+0.248
umping+0.245
Neg logit contributions
 chores-0.091
 going-0.091
 same-0.085
 temptation-0.080
 work-0.075
Token being
Token ID852
Feature activation-6.56e+00

Pos logit contributions
 chores+0.175
 same+0.166
 top+0.155
 hurricane+0.146
 temptation+0.145
Neg logit contributions
o-0.273
es-0.268
aron-0.262
ppy-0.260
ist-0.254
Token so
Token ID523
Feature activation-7.08e+00

Pos logit contributions
 chores+0.103
 going+0.098
 same+0.090
 temptation+0.089
 rain+0.082
Neg logit contributions
aron-0.243
enny-0.239
ist-0.239
ne-0.237
o-0.235
Token up
Token ID510
Feature activation-5.29e+00

Pos logit contributions
 chores+0.132
 going+0.127
wered+0.122
 temptation+0.121
 same+0.116
Neg logit contributions
o-0.293
aron-0.289
enny-0.282
es-0.282
lier-0.281
Token with
Token ID351
Feature activation-1.09e+01

Pos logit contributions
 chores+0.099
 going+0.096
 same+0.096
 temptation+0.091
wered+0.086
Neg logit contributions
aron-0.166
ist-0.165
enny-0.162
o-0.161
ne-0.161
Token Jerry
Token ID13075
Feature activation+1.13e+01

Pos logit contributions
 chores+0.227
 going+0.212
 same+0.205
 rain+0.199
 hurricane+0.192
Neg logit contributions
aron-0.276
ist-0.272
o-0.269
lier-0.254
es-0.253
Token.
Token ID13
Feature activation+5.04e+00

Pos logit contributions
aron+0.329
ist+0.323
enny+0.321
ne+0.316
jo+0.307
Neg logit contributions
 chores-0.276
 going-0.273
 same-0.254
 temptation-0.247
 rain-0.236
Token So
Token ID1406
Feature activation+3.05e-02

Pos logit contributions
aron+0.160
ist+0.157
enny+0.157
ne+0.155
o+0.153
Neg logit contributions
 chores-0.104
 going-0.102
 same-0.096
 temptation-0.094
 rain-0.090
Token,
Token ID11
Feature activation-2.54e-01

Pos logit contributions
o+0.001
aron+0.001
ne+0.001
enny+0.001
oc+0.001
Neg logit contributions
 chores-0.001
 going-0.001
 same-0.001
 temptation-0.001
 rain-0.001
Token he
Token ID339
Feature activation+5.94e+00

Pos logit contributions
 chores+0.006
 same+0.005
 temptation+0.005
 going+0.005
 laundry+0.005
Neg logit contributions
o-0.007
ne-0.007
aron-0.007
enny-0.007
es-0.007
Token watched
Token ID7342
Feature activation-6.50e+00

Pos logit contributions
aron+0.200
enny+0.199
ist+0.195
jo+0.194
umping+0.192
Neg logit contributions
 going-0.116
 chores-0.104
 same-0.100
 rain-0.098
 go-0.094
Token the
Token ID262
Feature activation-2.51e+00

Pos logit contributions
 chores+0.144
 same+0.135
 going+0.128
 temptation+0.128
 laundry+0.122
Neg logit contributions
aron-0.191
ist-0.190
ne-0.180
enny-0.179
o-0.179
Token space
Token ID2272
Feature activation+5.50e+00

Pos logit contributions
 chores+0.055
 her+0.050
 going+0.049
 temptation+0.048
wered+0.043
Neg logit contributions
ne-0.070
ist-0.068
oc-0.067
aron-0.065
jo-0.065
Token ship
Token ID4074
Feature activation+1.12e+01

Pos logit contributions
ist+0.165
ne+0.160
aron+0.158
enny+0.157
os+0.157
Neg logit contributions
 chores-0.121
 going-0.116
 temptation-0.112
 same-0.112
 rain-0.103
Token and
Token ID290
Feature activation-4.90e+00

Pos logit contributions
aron+0.009
enny+0.009
ppy+0.009
ist+0.009
ne+0.009
Neg logit contributions
 going-0.008
 chores-0.007
 same-0.007
 temptation-0.007
 rain-0.007
Token lay
Token ID3830
Feature activation-4.78e+00

Pos logit contributions
 chores+0.134
 going+0.127
 temptation+0.125
 same+0.120
 rain+0.112
Neg logit contributions
ne-0.121
o-0.120
ist-0.120
enny-0.120
aron-0.119
Token down
Token ID866
Feature activation-4.41e+00

Pos logit contributions
 chores+0.084
 going+0.084
 same+0.079
 temptation+0.075
 rain+0.072
Neg logit contributions
aron-0.165
enny-0.165
ist-0.164
ne-0.163
jo-0.158
Token.
Token ID13
Feature activation+3.16e+00

Pos logit contributions
 chores+0.132
 temptation+0.123
 same+0.121
 going+0.118
 laundry+0.116
Neg logit contributions
aron-0.095
o-0.091
ist-0.090
os-0.088
ne-0.088
Token
Token ID198
Feature activation+1.12e+00

Pos logit contributions
aron+0.094
enny+0.093
ist+0.091
ne+0.091
o+0.090
Neg logit contributions
 chores-0.072
 going-0.071
 same-0.066
 temptation-0.065
 rain-0.062
Token
Token ID198
Feature activation+1.97e+00

Pos logit contributions
ist+0.037
aron+0.036
ne+0.035
ppy+0.035
enny+0.035
Neg logit contributions
 chores-0.023
 going-0.022
 same-0.021
 temptation-0.020
 rain-0.020
TokenL
Token ID43
Feature activation+9.03e+00

Pos logit contributions
ist+0.052
aron+0.052
ne+0.049
os+0.049
enny+0.049
Neg logit contributions
 chores-0.051
 going-0.050
 same-0.049
 temptation-0.048
 rain-0.046
Tokenily
Token ID813
Feature activation+1.17e+01

Pos logit contributions
o+0.279
aron+0.279
oing+0.277
jo+0.274
os+0.273
Neg logit contributions
 chores-0.181
 going-0.179
 same-0.174
 temptation-0.167
 her-0.162
Token played
Token ID2826
Feature activation-3.36e+00

Pos logit contributions
ist+0.409
enny+0.396
aron+0.391
jo+0.389
oing+0.385
Neg logit contributions
 going-0.220
 rain-0.197
 her-0.195
 big-0.192
 same-0.188
Token with
Token ID351
Feature activation-1.02e+01

Pos logit contributions
 going+0.094
 chores+0.094
 temptation+0.090
 same+0.088
 hurricane+0.087
Neg logit contributions
aron-0.078
es-0.075
ist-0.074
enny-0.073
o-0.068
Token her
Token ID607
Feature activation+1.88e-02

Pos logit contributions
 going+0.193
 chores+0.183
 temptation+0.177
 same+0.170
 hurricane+0.160
Neg logit contributions
enny-0.324
ppy-0.313
ist-0.309
o-0.309
es-0.307
Token,
Token ID11
Feature activation-7.24e-01

Pos logit contributions
aron+0.105
enny+0.105
ne+0.104
ist+0.104
o+0.102
Neg logit contributions
 chores-0.078
 going-0.076
 same-0.073
 temptation-0.072
 rain-0.067
Token she
Token ID673
Feature activation+6.24e+00

Pos logit contributions
 chores+0.013
 going+0.012
 temptation+0.011
 same+0.011
wered+0.010
Neg logit contributions
enny-0.025
ne-0.024
o-0.024
aron-0.024
ist-0.024
Token wanted
Token ID2227
Feature activation-6.43e+00

Pos logit contributions
enny+0.164
aron+0.163
ist+0.162
ne+0.161
o+0.159
Neg logit contributions
 chores-0.161
 going-0.155
 temptation-0.152
 same-0.148
 rain-0.140
Token to
Token ID284
Feature activation-7.91e+00

Pos logit contributions
 temptation+0.149
 chores+0.142
 going+0.135
 same+0.133
 urge+0.129
Neg logit contributions
enny-0.185
o-0.182
ne-0.181
os-0.174
ist-0.174
Token wear
Token ID5806
Feature activation-6.21e+00

Pos logit contributions
 temptation+0.138
 going+0.131
 chores+0.125
 same+0.124
 comparison+0.112
Neg logit contributions
ne-0.257
enny-0.251
ist-0.247
ouse-0.242
r-0.240
Token a
Token ID257
Feature activation-3.64e+00

Pos logit contributions
 chores+0.055
 temptation+0.054
 going+0.052
 same+0.045
 hurricane+0.037
Neg logit contributions
ist-0.258
enny-0.256
aron-0.255
ne-0.254
ppy-0.250
Token princess
Token ID21752
Feature activation+1.15e+01

Pos logit contributions
 going+0.060
 chores+0.060
 temptation+0.057
 same+0.054
wered+0.049
Neg logit contributions
ist-0.125
enny-0.124
ne-0.124
aron-0.123
o-0.121
Token dress
Token ID6576
Feature activation+4.04e+00

Pos logit contributions
aron+0.375
enny+0.368
os+0.358
umping+0.354
ist+0.353
Neg logit contributions
 going-0.239
 chores-0.231
 same-0.213
 her-0.206
 rain-0.201
Token,
Token ID11
Feature activation-3.23e+00

Pos logit contributions
aron+0.125
enny+0.123
ne+0.122
o+0.120
os+0.119
Neg logit contributions
 going-0.087
 chores-0.087
 same-0.082
 rain-0.075
 her-0.075
Token but
Token ID475
Feature activation-1.70e+00

Pos logit contributions
 chores+0.072
 going+0.069
 temptation+0.067
 same+0.067
 rain+0.062
Neg logit contributions
aron-0.096
ist-0.095
enny-0.094
ne-0.094
ppy-0.092
Token she
Token ID673
Feature activation+5.43e+00

Pos logit contributions
 temptation+0.024
 chores+0.024
 same+0.022
 going+0.021
wered+0.019
Neg logit contributions
enny-0.064
ppy-0.062
o-0.061
owed-0.060
es-0.060
Token a
Token ID257
Feature activation-2.02e+00

Pos logit contributions
 chores+0.079
 going+0.076
 temptation+0.073
 same+0.065
 urge+0.058
Neg logit contributions
o-0.242
enny-0.238
aron-0.238
os-0.236
ne-0.235
Token man
Token ID582
Feature activation+1.26e+01

Pos logit contributions
 chores+0.028
 going+0.028
 same+0.026
 temptation+0.024
 rain+0.023
Neg logit contributions
aron-0.078
ist-0.077
enny-0.077
ne-0.076
ppy-0.075
Token.
Token ID13
Feature activation+4.05e+00

Pos logit contributions
aron+0.394
enny+0.392
ist+0.388
ppy+0.377
ne+0.377
Neg logit contributions
 going-0.276
 chores-0.267
 same-0.247
 temptation-0.243
 rain-0.234
Token The
Token ID383
Feature activation+4.44e-01

Pos logit contributions
aron+0.129
ist+0.128
enny+0.127
ne+0.125
o+0.123
Neg logit contributions
 chores-0.083
 going-0.082
 same-0.078
 temptation-0.075
 rain-0.072
Token man
Token ID582
Feature activation+1.43e+01

Pos logit contributions
aron+0.014
ne+0.013
ist+0.013
enny+0.013
o+0.013
Neg logit contributions
 chores-0.009
 going-0.009
 temptation-0.008
 same-0.008
 rain-0.008
Token has
Token ID468
Feature activation-7.26e+00

Pos logit contributions
enny+0.464
aron+0.457
ist+0.450
ppy+0.445
ne+0.435
Neg logit contributions
 going-0.301
 chores-0.289
 same-0.275
 temptation-0.256
 rain-0.252
Token a
Token ID257
Feature activation-3.03e+00

Pos logit contributions
 temptation+0.099
 chores+0.095
 going+0.084
 hurricane+0.081
 same+0.076
Neg logit contributions
aron-0.267
o-0.265
ne-0.256
umping-0.251
os-0.248
Token staff
Token ID3085
Feature activation+9.07e+00

Pos logit contributions
 going+0.052
 chores+0.052
 temptation+0.050
 same+0.046
wered+0.045
Neg logit contributions
aron-0.101
o-0.100
ist-0.099
enny-0.099
ouse-0.099
Token.
Token ID13
Feature activation+3.78e+00

Pos logit contributions
aron+0.291
enny+0.289
ist+0.285
ne+0.279
ppy+0.279
Neg logit contributions
 going-0.189
 chores-0.186
 same-0.171
 temptation-0.164
 rain-0.159
Token A
Token ID317
Feature activation+6.11e-01

Pos logit contributions
ist+0.121
aron+0.121
enny+0.119
ne+0.118
ppy+0.116
Neg logit contributions
 chores-0.076
 going-0.075
 same-0.072
 temptation-0.069
 rain-0.066
Token staff
Token ID3085
Feature activation+1.06e+01

Pos logit contributions
aron+0.021
ist+0.021
enny+0.021
ne+0.020
ppy+0.020
Neg logit contributions
 chores-0.011
 going-0.011
 same-0.010
 temptation-0.010
 rain-0.009
Token a
Token ID257
Feature activation-2.58e+00

Pos logit contributions
 temptation+0.100
 chores+0.098
 hurricane+0.081
 going+0.080
wered+0.079
Neg logit contributions
o-0.270
aron-0.258
ne-0.251
enny-0.249
itty-0.246
Token special
Token ID2041
Feature activation+3.51e-01

Pos logit contributions
 temptation+0.037
 going+0.036
 chores+0.035
wered+0.034
 comparison+0.029
Neg logit contributions
o-0.092
ouse-0.091
itty-0.089
enny-0.087
aron-0.087
Token uniform
Token ID8187
Feature activation+1.03e+01

Pos logit contributions
aron+0.011
ist+0.010
o+0.010
ouse+0.010
enny+0.010
Neg logit contributions
 going-0.007
 chores-0.007
 temptation-0.007
 same-0.006
wered-0.006
Token that
Token ID326
Feature activation-2.38e+00

Pos logit contributions
aron+0.354
enny+0.350
ist+0.350
ne+0.345
ppy+0.344
Neg logit contributions
 going-0.191
 chores-0.187
 same-0.175
 temptation-0.162
 rain-0.158
Token he
Token ID339
Feature activation+5.21e+00

Pos logit contributions
 chores+0.046
 temptation+0.045
 going+0.041
 same+0.041
wered+0.039
Neg logit contributions
o-0.078
enny-0.076
os-0.074
jo-0.074
aron-0.074
Token loved
Token ID6151
Feature activation-7.58e+00

Pos logit contributions
aron+0.155
ist+0.152
enny+0.150
ne+0.148
umping+0.146
Neg logit contributions
 going-0.125
 chores-0.121
 same-0.115
 good-0.108
 rain-0.105
Token to
Token ID284
Feature activation-7.66e+00

Pos logit contributions
 chores+0.157
 temptation+0.150
 same+0.147
 going+0.142
wered+0.138
Neg logit contributions
o-0.233
aron-0.229
enny-0.226
jo-0.224
ne-0.222
Token wear
Token ID5806
Feature activation-6.21e+00

Pos logit contributions
 chores+0.134
 temptation+0.131
 going+0.129
 same+0.127
 trouble+0.115
Neg logit contributions
aron-0.249
ist-0.249
jo-0.245
enny-0.242
ne-0.241
Token.
Token ID13
Feature activation+4.79e+00

Pos logit contributions
 chores+0.145
 going+0.141
 same+0.136
 temptation+0.135
 rain+0.128
Neg logit contributions
aron-0.178
ist-0.175
enny-0.174
ne-0.172
o-0.169
Token One
Token ID1881
Feature activation+7.69e-01

Pos logit contributions
ist+0.169
aron+0.169
enny+0.167
ne+0.166
ppy+0.164
Neg logit contributions
 chores-0.082
 going-0.080
 same-0.076
 temptation-0.072
 rain-0.070
Token day
Token ID1110
Feature activation+4.32e+00

Pos logit contributions
aron+0.034
ist+0.034
enny+0.034
ne+0.033
ppy+0.033
Neg logit contributions
 chores-0.006
 going-0.006
 same-0.005
 temptation-0.005
 rain-0.005
Token and
Token ID290
Feature activation-3.94e+00

Pos logit contributions
 going+0.042
 chores+0.041
 same+0.040
 temptation+0.039
 rain+0.037
Neg logit contributions
enny-0.044
ist-0.044
aron-0.043
ne-0.042
o-0.042
Token smart
Token ID4451
Feature activation-6.05e-01

Pos logit contributions
 chores+0.070
 going+0.062
 temptation+0.060
 same+0.059
 top+0.058
Neg logit contributions
o-0.133
aron-0.131
ne-0.131
ist-0.131
ppy-0.130
Token.
Token ID13
Feature activation+2.67e+00

Pos logit contributions
 going+0.015
 chores+0.015
 same+0.014
 temptation+0.014
 rain+0.013
Neg logit contributions
enny-0.016
ist-0.016
aron-0.016
ne-0.016
ppy-0.016
Token We
Token ID775
Feature activation-5.82e-01

Pos logit contributions
aron+0.084
ist+0.084
enny+0.084
ne+0.082
ppy+0.081
Neg logit contributions
 chores-0.054
 going-0.054
 same-0.051
 temptation-0.050
 rain-0.047
Token are
Token ID389
Feature activation-8.32e+00

Pos logit contributions
 chores+0.010
 going+0.010
 same+0.010
 rain+0.009
 work+0.009
Neg logit contributions
enny-0.020
aron-0.020
ist-0.020
jo-0.020
umping-0.019
Token very
Token ID845
Feature activation-7.42e+00

Pos logit contributions
 chores+0.132
 temptation+0.127
 same+0.114
wered+0.111
 rain+0.104
Neg logit contributions
aron-0.293
o-0.286
ist-0.281
ppy-0.277
ne-0.275
Token proud
Token ID6613
Feature activation-3.87e+00

Pos logit contributions
 chores+0.125
 going+0.116
 same+0.111
 temptation+0.110
 rain+0.104
Neg logit contributions
aron-0.261
ist-0.259
o-0.257
ne-0.257
enny-0.256
Token of
Token ID286
Feature activation-7.74e+00

Pos logit contributions
 chores+0.099
 going+0.097
 same+0.093
 temptation+0.091
 rain+0.088
Neg logit contributions
aron-0.104
ist-0.102
enny-0.102
ne-0.101
o-0.099
Token you
Token ID345
Feature activation+3.14e+00

Pos logit contributions
 chores+0.128
 same+0.122
 going+0.120
 temptation+0.111
 urge+0.108
Neg logit contributions
ist-0.265
o-0.260
aron-0.260
ne-0.260
ppy-0.257
Token,"
Token ID553
Feature activation+3.19e+00

Pos logit contributions
enny+0.096
ist+0.093
aron+0.092
ne+0.092
itty+0.092
Neg logit contributions
 going-0.072
 chores-0.070
 same-0.066
 good-0.063
 temptation-0.063
Token dad
Token ID9955
Feature activation+1.22e+01

Pos logit contributions
o+0.106
enny+0.103
itty+0.100
ne+0.099
bug+0.098
Neg logit contributions
 chores-0.055
 going-0.054
 temptation-0.054
 same-0.053
wered-0.047
Token,
Token ID11
Feature activation-7.54e-01

Pos logit contributions
 chores+0.136
 going+0.134
 same+0.127
 temptation+0.126
 rain+0.118
Neg logit contributions
ist-0.172
aron-0.172
enny-0.172
ne-0.171
o-0.165
Token M
Token ID337
Feature activation+9.68e+00

Pos logit contributions
 chores+0.013
 going+0.013
 same+0.012
 temptation+0.012
 rain+0.011
Neg logit contributions
enny-0.026
aron-0.026
ne-0.026
ist-0.026
o-0.025
Tokenaddy
Token ID13218
Feature activation+1.48e+01

Pos logit contributions
aron+0.289
oc+0.287
ist+0.282
es+0.281
o+0.279
Neg logit contributions
 going-0.207
 chores-0.203
 same-0.194
 temptation-0.189
 work-0.187
Token tried
Token ID3088
Feature activation-6.67e+00

Pos logit contributions
enny+0.468
aron+0.459
ist+0.455
ne+0.437
itty+0.434
Neg logit contributions
 going-0.323
 chores-0.296
 same-0.288
 rain-0.283
 her-0.274
Token to
Token ID284
Feature activation-8.45e+00

Pos logit contributions
 chores+0.114
 temptation+0.111
 going+0.107
 same+0.105
wered+0.099
Neg logit contributions
aron-0.218
enny-0.217
ist-0.216
o-0.213
ne-0.213
Token find
Token ID1064
Feature activation-6.60e+00

Pos logit contributions
 chores+0.139
 going+0.133
 temptation+0.123
 same+0.123
 rain+0.109
Neg logit contributions
ist-0.302
aron-0.298
ne-0.292
ppy-0.287
enny-0.282
Token something
Token ID1223
Feature activation+2.73e-01

Pos logit contributions
 chores+0.084
 going+0.082
 temptation+0.074
 same+0.067
 rain+0.061
Neg logit contributions
ist-0.254
aron-0.250
ne-0.247
enny-0.247
os-0.241
Token special
Token ID2041
Feature activation-1.27e+00

Pos logit contributions
ist+0.010
aron+0.010
enny+0.010
ne+0.010
o+0.010
Neg logit contributions
 chores-0.004
 going-0.004
 same-0.004
 temptation-0.004
 rain-0.003
Token and
Token ID290
Feature activation-5.23e+00

Pos logit contributions
 going+0.026
 chores+0.026
 same+0.024
 temptation+0.023
 rain+0.022
Neg logit contributions
aron-0.040
enny-0.040
ist-0.039
ne-0.039
o-0.038
Token colorful
Token ID20239
Feature activation-2.81e-01

Pos logit contributions
 chores+0.096
 temptation+0.088
 going+0.087
 same+0.084
wered+0.083
Neg logit contributions
ist-0.170
enny-0.164
ne-0.164
owed-0.163
o-0.161
Token every
Token ID790
Feature activation-5.13e+00

Pos logit contributions
 chores+0.006
 going+0.006
 same+0.006
 temptation+0.006
 rain+0.006
Neg logit contributions
aron-0.008
ist-0.008
enny-0.008
ne-0.008
o-0.008
Token help
Token ID1037
Feature activation-5.43e+00

Pos logit contributions
 chores+0.091
 going+0.089
 same+0.084
 temptation+0.081
 rain+0.078
Neg logit contributions
aron-0.149
ist-0.147
enny-0.147
ne-0.145
o-0.143
Token me
Token ID502
Feature activation+1.59e+00

Pos logit contributions
 chores+0.084
 going+0.082
 same+0.076
 temptation+0.073
 rain+0.069
Neg logit contributions
aron-0.201
ist-0.199
enny-0.198
ne-0.196
o-0.194
Token.
Token ID13
Feature activation+3.05e+00

Pos logit contributions
aron+0.051
ist+0.051
enny+0.050
ne+0.050
o+0.050
Neg logit contributions
 chores-0.032
 going-0.031
 same-0.029
 temptation-0.029
 rain-0.027
Token You
Token ID921
Feature activation+1.17e+00

Pos logit contributions
aron+0.095
ist+0.095
enny+0.094
ne+0.093
jo+0.091
Neg logit contributions
 chores-0.063
 going-0.062
 same-0.058
 temptation-0.057
 rain-0.055
Token can
Token ID460
Feature activation-5.05e+00

Pos logit contributions
aron+0.034
ist+0.034
enny+0.034
ne+0.034
o+0.033
Neg logit contributions
 chores-0.026
 going-0.026
 same-0.024
 temptation-0.024
 rain-0.023
Token put
Token ID1234
Feature activation-9.06e+00

Pos logit contributions
 chores+0.066
 going+0.066
 temptation+0.061
 same+0.061
 rain+0.054
Neg logit contributions
aron-0.193
ist-0.192
o-0.189
ne-0.189
enny-0.188
Token some
Token ID617
Feature activation-6.60e+00

Pos logit contributions
 chores+0.103
 going+0.101
 temptation+0.091
 same+0.090
 rain+0.079
Neg logit contributions
aron-0.366
ist-0.361
o-0.357
ne-0.355
enny-0.352
Token cheese
Token ID9891
Feature activation+6.40e+00

Pos logit contributions
 chores+0.105
 going+0.103
 same+0.095
 temptation+0.092
 rain+0.086
Neg logit contributions
aron-0.240
ist-0.237
enny-0.236
ne-0.234
o-0.231
Token on
Token ID319
Feature activation-1.06e+01

Pos logit contributions
enny+0.207
aron+0.204
jo+0.202
ist+0.201
ne+0.198
Neg logit contributions
 chores-0.129
 going-0.126
 same-0.120
 rain-0.110
 good-0.109
Token the
Token ID262
Feature activation-4.83e+00

Pos logit contributions
 going+0.121
 chores+0.118
 temptation+0.104
 same+0.102
 rain+0.088
Neg logit contributions
aron-0.430
ist-0.420
ne-0.415
ppy-0.412
o-0.411
Token burgers
Token ID34623
Feature activation+5.43e+00

Pos logit contributions
 chores+0.090
 going+0.090
 temptation+0.082
 same+0.081
wered+0.076
Neg logit contributions
aron-0.159
ist-0.153
ne-0.152
o-0.151
ppy-0.151