TOP ACTIVATIONS
MAX = 28.838

they had reached an imp asse , the strong man bow
had become trapped in the bog . Rose thought and thought
the animal out of the bog . The animal thanked Rose
. The k idd o understood the message ,
Once there was a k idd o who loved to guess
one day , the k idd o â s
hard - and the k idd o soon began to do
grandma 's house for safe keeping . When
referee . The referee is calm and kind .
the ball back to the referee . The referee
bring the quarrel to a resolution , until everyone was happy
into the net ! The referee said it was a goal
s make a deal . You help me organize
. " Look , a mask !" Lucy exclaimed . Lucy
wet and the motorcycle sk idd ed . It tipped over
. He had a special mask of his own . He
saw a lady wearing a mask . The mask was deep
wearing a mask . The mask was deep blue and covered
melodies and even her own compositions . Emma was so proud
slippery and the car sk idd ed and hit a tree

MIN ACTIVATIONS
MIN = -32.044

. He pulls and twists the hose . He sees a
runs to them and grabs the ball . He says ,
works ! We can turn the screw with this screw driver
to fight . They pull the book from each other .
. We have to save the world from bad guys ."
and pulls . She opens the barrel . Tim comes out
okay to not always win the game . <|endoftext|> Once upon
sorry , but he lost the book . He says he
. They let go of the tire . It rolls fast
not jump as high as the others . One
happy . They forgot about the balloon . They
, you have to wind the string around the top ,
to pick good things from the refrigerator ." S
time to go home as the woods could be dangerous .
embarrassed . He had made such a big mess that he
never forget that story and the joy it brought her .
Let 's take turns wearing the sweater ," L ila said
lick and nib ble of the sweet treat . Rachel was
their mess . They saw the book and the rings .
and dots . They mixed the colors and made new ones

INTERVAL 13.618 - 28.838
CONTAINS 7.170%

says . She puts her tower next to Ben 's house
loved very much . Her doll was gifted to her by
safe place . No more shouting . â The rabbit
rock . They made more tunnels and found more worms .
bear , she began to smile . The

INTERVAL -1.603 - 13.618
CONTAINS 54.153%

read math ! She quickly sat down and
sad , but he didn â t give up
to feel the wet and heavy rain . She was so
, and started to chew it . The line
The sun rises in the morning and sets in the evening

INTERVAL -16.824 - -1.603
CONTAINS 33.153%

happy and content . Jimmy 's heart filled with joy ,
But one friend did not come . His name was
the bread cr umbs and had lots of fun .
I love this tree , I love this tree !
. Jerry realized that sometimes being tall can help

INTERVAL -32.044 - -16.824
CONTAINS 5.524%

like cake . Today is their birthday . Mom made a
easier way to go around the block . Joe
write a sorry letter to the teacher . They draw a
moved closer and closer to the tree . Then , a
difference between the lemon and the orange . Tom
Token they
Token ID484
Feature activation-9.22e-01

Pos logit contributions
oped+0.214
uts+0.209
oled+0.201
es+0.201
oot+0.198
Neg logit contributions
 laundry-0.331
 same-0.326
 attic-0.308
 task-0.300
 table-0.292
Token had
Token ID550
Feature activation-2.44e+00

Pos logit contributions
 same+0.018
 laundry+0.018
 task+0.017
 way+0.017
 your+0.017
Neg logit contributions
unch-0.018
oped-0.018
uts-0.018
oled-0.017
oot-0.017
Token reached
Token ID4251
Feature activation+8.39e-01

Pos logit contributions
 laundry+0.050
 task+0.046
 same+0.046
 attic+0.046
 computer+0.044
Neg logit contributions
oped-0.045
uts-0.044
oled-0.042
es-0.041
owing-0.041
Token an
Token ID281
Feature activation-1.35e+01

Pos logit contributions
es+0.017
owing+0.015
oped+0.015
oot+0.015
uts+0.015
Neg logit contributions
 laundry-0.013
 task-0.013
 same-0.012
 attic-0.011
 proper-0.011
Token imp
Token ID848
Feature activation+1.28e+01

Pos logit contributions
 laundry+0.200
 same+0.171
 your+0.170
 homework+0.165
 proper+0.161
Neg logit contributions
oot-0.300
owing-0.298
unch-0.292
ring-0.290
aring-0.287
Tokenasse
Token ID21612
Feature activation+2.88e+01

Pos logit contributions
oped+0.212
uts+0.200
unch+0.198
oled+0.197
owing+0.189
Neg logit contributions
 laundry-0.299
 same-0.295
 task-0.280
 your-0.277
 homework-0.271
Token,
Token ID11
Feature activation-2.71e+00

Pos logit contributions
unch+0.386
uts+0.334
qu+0.332
oped+0.332
oled+0.326
Neg logit contributions
 laundry-0.670
 same-0.666
 way-0.646
 your-0.645
 computer-0.644
Token the
Token ID262
Feature activation-1.93e+01

Pos logit contributions
 laundry+0.044
 same+0.043
 task+0.041
 attic+0.039
 homework+0.039
Neg logit contributions
oped-0.061
uts-0.059
oled-0.059
es-0.058
unch-0.057
Token strong
Token ID1913
Feature activation+4.48e+00

Pos logit contributions
 laundry+0.275
 your+0.263
 proper+0.246
 attic+0.240
 our+0.239
Neg logit contributions
oped-0.428
oled-0.426
uts-0.416
unch-0.408
hed-0.408
Token man
Token ID582
Feature activation+1.72e+01

Pos logit contributions
oped+0.101
oled+0.097
hed+0.096
oot+0.095
uts+0.093
Neg logit contributions
 laundry-0.071
 your-0.063
 proper-0.062
 attic-0.062
 same-0.061
Token bow
Token ID9563
Feature activation+2.19e+01

Pos logit contributions
unch+0.367
uts+0.360
oped+0.352
oled+0.349
umbs+0.345
Neg logit contributions
 same-0.299
 laundry-0.295
 way-0.290
 paper-0.282
 your-0.275
Token had
Token ID550
Feature activation-1.94e+00

Pos logit contributions
 laundry+0.142
 same+0.137
 task+0.136
 homework+0.134
 attic+0.131
Neg logit contributions
oped-0.119
uts-0.111
es-0.108
oled-0.107
hed-0.103
Token become
Token ID1716
Feature activation-4.61e+00

Pos logit contributions
 laundry+0.035
 task+0.033
 same+0.032
 homework+0.032
 attic+0.031
Neg logit contributions
oped-0.041
uts-0.040
oled-0.039
unch-0.038
es-0.038
Token trapped
Token ID13640
Feature activation+9.92e+00

Pos logit contributions
 laundry+0.070
 same+0.067
 task+0.064
 homework+0.061
 attic+0.060
Neg logit contributions
oped-0.112
uts-0.108
oled-0.106
owing-0.105
unch-0.105
Token in
Token ID287
Feature activation-1.36e+01

Pos logit contributions
oped+0.158
unch+0.149
uts+0.147
oled+0.147
owing+0.136
Neg logit contributions
 laundry-0.239
 same-0.237
 your-0.227
 task-0.223
 way-0.219
Token the
Token ID262
Feature activation-1.94e+01

Pos logit contributions
 laundry+0.149
 same+0.144
 task+0.134
 homework+0.129
 proper+0.121
Neg logit contributions
oped-0.377
uts-0.367
oled-0.363
unch-0.358
es-0.355
Token bog
Token ID22922
Feature activation+2.87e+01

Pos logit contributions
 laundry+0.270
 homework+0.264
 your+0.246
 task+0.245
 same+0.238
Neg logit contributions
oped-0.452
oled-0.427
m-0.424
owing-0.423
unch-0.419
Token.
Token ID13
Feature activation+6.99e+00

Pos logit contributions
oped+0.488
unch+0.448
oled+0.446
qu+0.394
ann+0.393
Neg logit contributions
 same-0.662
 way-0.660
 laundry-0.650
 table-0.644
 your-0.609
Token Rose
Token ID8049
Feature activation+1.08e+01

Pos logit contributions
oped+0.159
uts+0.151
unch+0.151
oled+0.149
owing+0.148
Neg logit contributions
 laundry-0.120
 same-0.117
 your-0.109
 task-0.108
 attic-0.105
Token thought
Token ID1807
Feature activation+1.44e+01

Pos logit contributions
unch+0.228
oped+0.228
uts+0.223
oled+0.219
owing+0.217
Neg logit contributions
 laundry-0.193
 same-0.191
 way-0.179
 your-0.178
 task-0.176
Token and
Token ID290
Feature activation-8.87e+00

Pos logit contributions
oped+0.244
uts+0.243
oled+0.236
oot+0.235
es+0.230
Neg logit contributions
 same-0.305
 laundry-0.297
 task-0.287
 attic-0.285
 garage-0.281
Token thought
Token ID1807
Feature activation+1.43e+01

Pos logit contributions
 laundry+0.181
 same+0.177
 your+0.167
 task+0.165
 way+0.162
Neg logit contributions
oped-0.174
uts-0.166
unch-0.164
oled-0.163
owing-0.159
Token the
Token ID262
Feature activation-1.90e+01

Pos logit contributions
oped+0.278
uts+0.266
oled+0.264
unch+0.264
owing+0.257
Neg logit contributions
 laundry-0.186
 same-0.181
 task-0.167
 your-0.166
 way-0.160
Token animal
Token ID5044
Feature activation+1.33e+01

Pos logit contributions
 laundry+0.298
 same+0.276
 your+0.265
 homework+0.264
 task+0.261
Neg logit contributions
oped-0.456
uts-0.433
oled-0.433
unch-0.431
owing-0.425
Token out
Token ID503
Feature activation-2.63e+00

Pos logit contributions
oped+0.253
uts+0.241
oled+0.236
unch+0.236
owing+0.231
Neg logit contributions
 laundry-0.278
 same-0.270
 task-0.255
 your-0.254
 attic-0.248
Token of
Token ID286
Feature activation-1.48e+01

Pos logit contributions
 laundry+0.068
 same+0.068
 your+0.065
 task+0.064
 way+0.063
Neg logit contributions
oped-0.036
unch-0.034
oled-0.034
uts-0.034
owing-0.031
Token the
Token ID262
Feature activation-1.90e+01

Pos logit contributions
 laundry+0.185
 same+0.182
 homework+0.181
 task+0.178
 attic+0.162
Neg logit contributions
oped-0.366
oled-0.345
ann-0.343
es-0.338
unch-0.335
Token bog
Token ID22922
Feature activation+2.87e+01

Pos logit contributions
 laundry+0.270
 homework+0.266
 same+0.247
 your+0.241
 math+0.240
Neg logit contributions
oped-0.436
ann-0.414
owing-0.410
oled-0.406
unch-0.404
Token.
Token ID13
Feature activation+7.67e+00

Pos logit contributions
oped+0.498
oled+0.472
unch+0.471
ann+0.416
oling+0.409
Neg logit contributions
 way-0.644
 same-0.630
 laundry-0.620
 table-0.618
 paper-0.599
Token The
Token ID383
Feature activation-9.89e+00

Pos logit contributions
oped+0.161
uts+0.154
unch+0.153
oled+0.152
owing+0.151
Neg logit contributions
 laundry-0.144
 same-0.140
 your-0.131
 task-0.130
 attic-0.126
Token animal
Token ID5044
Feature activation+1.34e+01

Pos logit contributions
 laundry+0.151
 same+0.146
 task+0.135
 your+0.134
 attic+0.128
Neg logit contributions
oped-0.245
uts-0.235
oled-0.234
unch-0.233
owing-0.228
Token thanked
Token ID26280
Feature activation+6.23e+00

Pos logit contributions
oped+0.263
uts+0.249
oled+0.246
unch+0.244
owing+0.238
Neg logit contributions
 laundry-0.271
 same-0.264
 task-0.249
 your-0.248
 attic-0.242
Token Rose
Token ID8049
Feature activation+1.08e+01

Pos logit contributions
oped+0.161
oled+0.157
uts+0.154
es+0.153
oot+0.153
Neg logit contributions
 laundry-0.083
 same-0.079
 task-0.076
 attic-0.071
 way-0.069
Token.
Token ID13
Feature activation+7.26e+00

Pos logit contributions
oped+0.237
uts+0.227
oled+0.221
es+0.216
owing+0.212
Neg logit contributions
 laundry-0.342
 same-0.336
 task-0.323
 attic-0.309
 homework-0.309
Token
Token ID198
Feature activation+5.61e+00

Pos logit contributions
oped+0.146
owing+0.137
uts+0.137
oled+0.136
unch+0.135
Neg logit contributions
 laundry-0.142
 same-0.140
 your-0.133
 way-0.127
 task-0.127
Token
Token ID198
Feature activation+5.62e+00

Pos logit contributions
owing+0.122
ring+0.122
oped+0.113
ked+0.111
oled+0.108
Neg logit contributions
 laundry-0.098
 your-0.098
 same-0.093
 new-0.089
 way-0.089
TokenThe
Token ID464
Feature activation-1.15e+01

Pos logit contributions
owing+0.119
oped+0.116
uts+0.111
oot+0.110
oled+0.110
Neg logit contributions
 laundry-0.105
 same-0.098
 your-0.098
 task-0.093
 our-0.089
Token k
Token ID479
Feature activation+1.02e+01

Pos logit contributions
 laundry+0.168
 same+0.161
 your+0.155
 task+0.150
 attic+0.145
Neg logit contributions
oped-0.281
oled-0.269
uts-0.267
unch-0.267
owing-0.258
Tokenidd
Token ID1638
Feature activation+2.85e+01

Pos logit contributions
uts+0.118
oled+0.113
umbs+0.107
unch+0.104
ann+0.103
Neg logit contributions
 laundry-0.259
 same-0.256
 your-0.255
 attic-0.242
 garage-0.241
Tokeno
Token ID78
Feature activation+1.33e+01

Pos logit contributions
unch+0.307
oled+0.294
oped+0.292
uts+0.271
oling+0.263
Neg logit contributions
 same-0.773
 laundry-0.759
 computer-0.728
 way-0.724
 task-0.722
Token understood
Token ID7247
Feature activation+5.47e+00

Pos logit contributions
oped+0.269
uts+0.260
unch+0.260
oled+0.257
owing+0.249
Neg logit contributions
 laundry-0.253
 same-0.251
 task-0.234
 your-0.232
 way-0.229
Token the
Token ID262
Feature activation-1.86e+01

Pos logit contributions
oped+0.094
uts+0.089
oled+0.088
unch+0.088
owing+0.085
Neg logit contributions
 laundry-0.122
 same-0.121
 task-0.114
 your-0.112
 attic-0.111
Token message
Token ID3275
Feature activation+2.33e+01

Pos logit contributions
 laundry+0.234
 your+0.229
 same+0.219
 our+0.207
 attic+0.206
Neg logit contributions
oped-0.459
uts-0.450
oled-0.448
unch-0.443
owing-0.437
Token,
Token ID11
Feature activation-1.91e+00

Pos logit contributions
oped+0.384
uts+0.365
oled+0.357
owing+0.348
unch+0.345
Neg logit contributions
 laundry-0.536
 same-0.520
 task-0.495
 your-0.494
 attic-0.488
TokenOnce
Token ID7454
Feature activation+3.93e+00

Pos logit contributions
oot+0.298
owing+0.297
aring+0.297
uts+0.289
ring+0.288
Neg logit contributions
 same-0.227
 laundry-0.217
 way-0.210
 attic-0.209
 your-0.207
Token there
Token ID612
Feature activation+3.57e+00

Pos logit contributions
oped+0.091
uts+0.087
oled+0.086
unch+0.086
owing+0.084
Neg logit contributions
 laundry-0.065
 same-0.064
 your-0.060
 task-0.059
 way-0.058
Token was
Token ID373
Feature activation-6.13e+00

Pos logit contributions
oped+0.077
uts+0.075
oled+0.073
unch+0.073
owing+0.071
Neg logit contributions
 laundry-0.065
 same-0.065
 your-0.060
 way-0.060
 task-0.059
Token a
Token ID257
Feature activation-1.27e+01

Pos logit contributions
 laundry+0.079
 same+0.076
 task+0.071
 homework+0.067
 your+0.067
Neg logit contributions
oped-0.162
unch-0.158
uts-0.156
oled-0.155
owing-0.151
Token k
Token ID479
Feature activation+9.81e+00

Pos logit contributions
 laundry+0.151
 same+0.143
 task+0.128
 your+0.128
 homework+0.125
Neg logit contributions
oped-0.353
unch-0.342
oled-0.340
uts-0.338
owing-0.329
Tokenidd
Token ID1638
Feature activation+2.82e+01

Pos logit contributions
uts+0.139
oled+0.133
oped+0.121
qu+0.116
umbs+0.116
Neg logit contributions
 your-0.243
 same-0.235
 laundry-0.230
 attic-0.229
 my-0.222
Tokeno
Token ID78
Feature activation+1.24e+01

Pos logit contributions
oped+0.361
oled+0.358
unch+0.327
oling+0.325
ement+0.323
Neg logit contributions
 same-0.718
 computer-0.681
 your-0.680
 laundry-0.675
 job-0.668
Token who
Token ID508
Feature activation-5.20e-01

Pos logit contributions
oped+0.255
oled+0.248
uts+0.247
umbs+0.236
omed+0.225
Neg logit contributions
 your-0.247
 same-0.240
 laundry-0.227
 way-0.226
 paper-0.225
Token loved
Token ID6151
Feature activation+1.53e+00

Pos logit contributions
 laundry+0.009
 same+0.009
 task+0.008
 your+0.008
 homework+0.008
Neg logit contributions
oped-0.012
uts-0.011
oled-0.011
unch-0.011
owing-0.011
Token to
Token ID284
Feature activation-1.42e+01

Pos logit contributions
uts+0.026
unch+0.025
oped+0.025
owing+0.024
oot+0.023
Neg logit contributions
 same-0.031
 laundry-0.030
 task-0.030
 attic-0.029
 homework-0.029
Token guess
Token ID4724
Feature activation+1.46e+01

Pos logit contributions
 same+0.205
 laundry+0.203
 attic+0.186
 your+0.183
 proper+0.182
Neg logit contributions
oped-0.338
unch-0.335
uts-0.328
owing-0.317
oot-0.314
Token one
Token ID530
Feature activation+3.73e-02

Pos logit contributions
oped+0.067
uts+0.061
oled+0.061
oot+0.060
unch+0.059
Neg logit contributions
 laundry-0.056
 same-0.054
 task-0.052
 attic-0.052
 proper-0.051
Token day
Token ID1110
Feature activation+8.84e+00

Pos logit contributions
oped+0.001
unch+0.001
owing+0.001
uts+0.001
oled+0.001
Neg logit contributions
 laundry-0.001
 same-0.001
 your-0.001
 attic-0.001
 task-0.001
Token,
Token ID11
Feature activation-2.89e+00

Pos logit contributions
oped+0.171
uts+0.161
oled+0.161
unch+0.156
es+0.153
Neg logit contributions
 laundry-0.181
 same-0.178
 task-0.169
 your-0.164
 homework-0.164
Token the
Token ID262
Feature activation-1.94e+01

Pos logit contributions
 laundry+0.046
 same+0.046
 task+0.045
 homework+0.042
 proper+0.042
Neg logit contributions
oped-0.065
uts-0.063
oled-0.062
es-0.061
unch-0.061
Token k
Token ID479
Feature activation+9.56e+00

Pos logit contributions
 laundry+0.272
 same+0.259
 your+0.254
 task+0.243
 homework+0.237
Neg logit contributions
oped-0.476
oled-0.457
uts-0.455
unch-0.452
owing-0.438
Tokenidd
Token ID1638
Feature activation+2.81e+01

Pos logit contributions
uts+0.113
oled+0.113
unch+0.101
umbs+0.100
ann+0.100
Neg logit contributions
 same-0.235
 laundry-0.232
 your-0.230
 attic-0.218
 garage-0.217
Tokeno
Token ID78
Feature activation+1.27e+01

Pos logit contributions
unch+0.269
oped+0.247
oled+0.246
qu+0.227
uts+0.224
Neg logit contributions
 same-0.799
 laundry-0.777
 computer-0.748
 way-0.747
 task-0.737
Tokenâ
Token ID22940
Feature activation+4.94e+00

Pos logit contributions
oped+0.256
unch+0.247
uts+0.246
oled+0.243
owing+0.237
Neg logit contributions
 laundry-0.245
 same-0.242
 task-0.224
 your-0.223
 way-0.221
Token
Token ID26391
Feature activation+5.46e+00

Pos logit contributions
oped+0.109
uts+0.103
oled+0.102
unch+0.102
owing+0.100
Neg logit contributions
 laundry-0.089
 same-0.086
 task-0.081
 your-0.081
 homework-0.078
Token
Token ID8151
Feature activation+2.10e+01

Pos logit contributions
pt+0.086
oped+0.085
oot+0.085
ann+0.085
owing+0.085
Neg logit contributions
 same-0.112
 laundry-0.112
 way-0.110
 your-0.106
 homework-0.105
Tokens
Token ID82
Feature activation-7.38e+00

Pos logit contributions
qu+0.261
oing+0.252
amb+0.252
ring+0.250
ez+0.248
Neg logit contributions
 way-0.498
 same-0.494
 your-0.486
 laundry-0.475
 table-0.457
Token hard
Token ID1327
Feature activation+3.83e+00

Pos logit contributions
oped+0.230
uts+0.218
unch+0.216
oled+0.216
owing+0.209
Neg logit contributions
 laundry-0.226
 same-0.221
 task-0.207
 your-0.206
 homework-0.200
Token -
Token ID532
Feature activation-1.22e+00

Pos logit contributions
oped+0.065
uts+0.061
oled+0.060
unch+0.060
owing+0.058
Neg logit contributions
 laundry-0.088
 same-0.086
 task-0.082
 your-0.081
 attic-0.079
Token and
Token ID290
Feature activation-8.57e+00

Pos logit contributions
 laundry+0.024
 same+0.024
 task+0.023
 attic+0.022
 your+0.022
Neg logit contributions
oped-0.023
uts-0.022
oled-0.022
unch-0.022
owing-0.021
Token the
Token ID262
Feature activation-1.91e+01

Pos logit contributions
 laundry+0.165
 same+0.159
 task+0.154
 attic+0.154
 computer+0.148
Neg logit contributions
oped-0.160
uts-0.155
es-0.150
oled-0.150
owing-0.146
Token k
Token ID479
Feature activation+9.79e+00

Pos logit contributions
 laundry+0.246
 your+0.240
 same+0.230
 task+0.217
 our+0.216
Neg logit contributions
oped-0.464
oled-0.450
uts-0.444
unch-0.433
qu-0.429
Tokenidd
Token ID1638
Feature activation+2.79e+01

Pos logit contributions
oled+0.116
uts+0.113
unch+0.111
umbs+0.105
oped+0.103
Neg logit contributions
 same-0.247
 laundry-0.245
 your-0.238
 attic-0.224
 garage-0.223
Tokeno
Token ID78
Feature activation+1.29e+01

Pos logit contributions
unch+0.288
oled+0.269
oped+0.265
uts+0.236
oling+0.235
Neg logit contributions
 same-0.789
 laundry-0.766
 way-0.735
 task-0.730
 computer-0.728
Token soon
Token ID2582
Feature activation+5.72e+00

Pos logit contributions
oped+0.251
unch+0.242
uts+0.241
oled+0.239
owing+0.231
Neg logit contributions
 laundry-0.255
 same-0.253
 task-0.236
 your-0.234
 way-0.231
Token began
Token ID2540
Feature activation+3.48e+00

Pos logit contributions
oped+0.116
uts+0.111
oled+0.110
unch+0.107
owing+0.105
Neg logit contributions
 laundry-0.112
 same-0.108
 task-0.103
 your-0.102
 attic-0.100
Token to
Token ID284
Feature activation-1.39e+01

Pos logit contributions
oped+0.072
uts+0.070
oled+0.068
unch+0.067
owing+0.067
Neg logit contributions
 laundry-0.066
 same-0.063
 task-0.060
 your-0.059
 attic-0.058
Token do
Token ID466
Feature activation+8.91e-01

Pos logit contributions
 laundry+0.176
 same+0.165
 your+0.153
 task+0.147
 attic+0.146
Neg logit contributions
oped-0.367
oled-0.352
uts-0.351
unch-0.348
owing-0.345
Token grandma
Token ID49890
Feature activation+1.12e+01

Pos logit contributions
 laundry+0.220
 same+0.207
 your+0.200
 task+0.191
 attic+0.184
Neg logit contributions
oped-0.341
uts-0.328
owing-0.322
oot-0.321
unch-0.319
Token's
Token ID338
Feature activation-1.29e+01

Pos logit contributions
oped+0.205
uts+0.198
unch+0.196
oled+0.192
oot+0.182
Neg logit contributions
 laundry-0.246
 same-0.237
 your-0.230
 way-0.225
 task-0.225
Token house
Token ID2156
Feature activation+1.76e+01

Pos logit contributions
 laundry+0.162
 same+0.155
 your+0.150
 task+0.144
 homework+0.139
Neg logit contributions
oped-0.324
uts-0.315
owing-0.312
oled-0.312
unch-0.308
Token for
Token ID329
Feature activation-9.38e+00

Pos logit contributions
oped+0.273
uts+0.255
oled+0.252
unch+0.252
owing+0.241
Neg logit contributions
 laundry-0.428
 same-0.420
 task-0.399
 your-0.398
 way-0.389
Token safe
Token ID3338
Feature activation+8.09e+00

Pos logit contributions
 laundry+0.128
 same+0.124
 task+0.114
 your+0.111
 attic+0.108
Neg logit contributions
oped-0.246
uts-0.237
oled-0.235
unch-0.235
owing-0.229
Tokenkeeping
Token ID19934
Feature activation+2.78e+01

Pos logit contributions
oped+0.105
uts+0.097
unch+0.095
oled+0.095
owing+0.090
Neg logit contributions
 laundry-0.219
 same-0.215
 task-0.205
 your-0.205
 way-0.201
Token.
Token ID13
Feature activation+6.93e+00

Pos logit contributions
oped+0.513
unch+0.480
qu+0.449
oled+0.447
oing+0.446
Neg logit contributions
 way-0.613
 laundry-0.602
 table-0.585
 same-0.572
 homework-0.569
Token
Token ID220
Feature activation+1.30e+01

Pos logit contributions
oped+0.140
uts+0.133
oled+0.132
unch+0.132
owing+0.131
Neg logit contributions
 laundry-0.136
 same-0.134
 your-0.128
 way-0.124
 task-0.123
Token
Token ID198
Feature activation+5.45e+00

Pos logit contributions
oped+0.245
owing+0.242
ring+0.237
oing+0.228
oled+0.228
Neg logit contributions
 your-0.251
 same-0.243
 laundry-0.236
 our-0.225
 garage-0.220
Token
Token ID198
Feature activation+5.48e+00

Pos logit contributions
owing+0.123
ring+0.114
oped+0.113
ement+0.107
ified+0.107
Neg logit contributions
 your-0.097
 same-0.092
 laundry-0.090
 way-0.087
 new-0.086
TokenWhen
Token ID2215
Feature activation+2.31e+00

Pos logit contributions
oped+0.112
owing+0.110
uts+0.106
oled+0.105
unch+0.103
Neg logit contributions
 laundry-0.106
 same-0.102
 your-0.100
 task-0.098
 computer-0.094
Token referee
Token ID22957
Feature activation+2.65e+01

Pos logit contributions
 laundry+0.279
 your+0.262
 proper+0.258
 homework+0.246
 task+0.245
Neg logit contributions
oled-0.443
uts-0.436
unch-0.436
oped-0.433
m-0.424
Token.
Token ID13
Feature activation+5.92e+00

Pos logit contributions
oped+0.504
unch+0.476
oled+0.465
uts+0.457
oot+0.449
Neg logit contributions
 same-0.548
 laundry-0.545
 your-0.544
 paper-0.521
 homework-0.517
Token
Token ID198
Feature activation+4.16e+00

Pos logit contributions
oped+0.109
uts+0.106
unch+0.104
oled+0.103
oot+0.097
Neg logit contributions
 laundry-0.128
 same-0.124
 task-0.119
 your-0.114
 way-0.114
Token
Token ID198
Feature activation+4.22e+00

Pos logit contributions
oped+0.081
owing+0.080
ring+0.076
oled+0.074
uts+0.072
Neg logit contributions
 your-0.086
 same-0.086
 laundry-0.083
 our-0.080
 new-0.079
TokenThe
Token ID464
Feature activation-1.28e+01

Pos logit contributions
oped+0.085
uts+0.080
oled+0.080
unch+0.080
es+0.075
Neg logit contributions
 laundry-0.085
 same-0.082
 task-0.079
 way-0.076
 attic-0.076
Token referee
Token ID22957
Feature activation+2.65e+01

Pos logit contributions
 proper+0.199
 laundry+0.197
 your+0.197
 attic+0.180
 our+0.176
Neg logit contributions
oled-0.258
unch-0.254
m-0.253
oped-0.249
uts-0.248
Token is
Token ID318
Feature activation-6.76e+00

Pos logit contributions
oped+0.662
uts+0.632
oled+0.631
unch+0.626
owing+0.611
Neg logit contributions
 laundry-0.398
 same-0.380
 task-0.353
 your-0.348
 attic-0.340
Token calm
Token ID9480
Feature activation+9.14e+00

Pos logit contributions
 laundry+0.107
 same+0.103
 task+0.097
 attic+0.093
 your+0.093
Neg logit contributions
oped-0.163
uts-0.157
oled-0.156
unch-0.155
owing-0.152
Token and
Token ID290
Feature activation-9.48e+00

Pos logit contributions
oped+0.203
unch+0.193
oled+0.185
qu+0.184
uts+0.182
Neg logit contributions
 same-0.178
 laundry-0.170
 your-0.167
 way-0.163
 paper-0.156
Token kind
Token ID1611
Feature activation+8.57e+00

Pos logit contributions
 laundry+0.165
 attic+0.160
 same+0.149
 basement+0.148
 task+0.147
Neg logit contributions
oped-0.200
aring-0.188
oled-0.187
uts-0.187
unch-0.186
Token.
Token ID13
Feature activation+6.02e+00

Pos logit contributions
oped+0.164
unch+0.151
oled+0.144
ann+0.135
oot+0.133
Neg logit contributions
 same-0.194
 laundry-0.186
 paper-0.186
 your-0.185
 way-0.179
Token the
Token ID262
Feature activation-2.03e+01

Pos logit contributions
oped+0.095
es+0.093
ed+0.089
us+0.088
hed+0.087
Neg logit contributions
 laundry-0.106
 task-0.102
 attic-0.096
 same-0.095
 computer-0.095
Token ball
Token ID2613
Feature activation+1.93e+01

Pos logit contributions
 laundry+0.311
 your+0.310
 homework+0.292
 same+0.291
 proper+0.289
Neg logit contributions
oped-0.413
uts-0.411
unch-0.409
oled-0.409
m-0.395
Token back
Token ID736
Feature activation-4.05e+00

Pos logit contributions
uts+0.321
oped+0.310
owing+0.298
iled+0.296
es+0.294
Neg logit contributions
 laundry-0.434
 same-0.395
 task-0.394
 attic-0.392
 homework-0.384
Token to
Token ID284
Feature activation-1.48e+01

Pos logit contributions
 laundry+0.077
 same+0.075
 task+0.072
 homework+0.067
 attic+0.067
Neg logit contributions
oped-0.083
uts-0.081
owing-0.078
oled-0.077
unch-0.077
Token the
Token ID262
Feature activation-2.02e+01

Pos logit contributions
 laundry+0.206
 task+0.194
 same+0.186
 attic+0.179
 proper+0.177
Neg logit contributions
oped-0.363
oled-0.359
unch-0.357
es-0.355
uts-0.354
Token referee
Token ID22957
Feature activation+2.65e+01

Pos logit contributions
 laundry+0.279
 your+0.262
 proper+0.258
 homework+0.246
 task+0.245
Neg logit contributions
oled-0.443
uts-0.436
unch-0.436
oped-0.433
m-0.424
Token.
Token ID13
Feature activation+5.92e+00

Pos logit contributions
oped+0.504
unch+0.476
oled+0.465
uts+0.457
oot+0.449
Neg logit contributions
 same-0.548
 laundry-0.545
 your-0.544
 paper-0.521
 homework-0.517
Token
Token ID198
Feature activation+4.16e+00

Pos logit contributions
oped+0.109
uts+0.106
unch+0.104
oled+0.103
oot+0.097
Neg logit contributions
 laundry-0.128
 same-0.124
 task-0.119
 your-0.114
 way-0.114
Token
Token ID198
Feature activation+4.22e+00

Pos logit contributions
oped+0.081
owing+0.080
ring+0.076
oled+0.074
uts+0.072
Neg logit contributions
 your-0.086
 same-0.086
 laundry-0.083
 our-0.080
 new-0.079
TokenThe
Token ID464
Feature activation-1.28e+01

Pos logit contributions
oped+0.085
uts+0.080
oled+0.080
unch+0.080
es+0.075
Neg logit contributions
 laundry-0.085
 same-0.082
 task-0.079
 way-0.076
 attic-0.076
Token referee
Token ID22957
Feature activation+2.65e+01

Pos logit contributions
 proper+0.199
 laundry+0.197
 your+0.197
 attic+0.180
 our+0.176
Neg logit contributions
oled-0.258
unch-0.254
m-0.253
oped-0.249
uts-0.248
Token bring
Token ID2222
Feature activation+3.34e+00

Pos logit contributions
 laundry+0.205
 same+0.189
 attic+0.181
 task+0.179
 homework+0.176
Neg logit contributions
oped-0.363
uts-0.349
oot-0.347
owing-0.346
unch-0.345
Token the
Token ID262
Feature activation-2.00e+01

Pos logit contributions
oped+0.076
uts+0.075
es+0.075
owing+0.073
oot+0.072
Neg logit contributions
 laundry-0.049
 task-0.046
 same-0.045
 attic-0.045
 homework-0.044
Token quarrel
Token ID41998
Feature activation+2.00e+01

Pos logit contributions
 laundry+0.271
 your+0.245
 same+0.236
 homework+0.236
 attic+0.232
Neg logit contributions
oped-0.474
owing-0.465
uts-0.463
unch-0.454
oled-0.453
Token to
Token ID284
Feature activation-1.47e+01

Pos logit contributions
oped+0.304
uts+0.287
oled+0.280
unch+0.278
owing+0.269
Neg logit contributions
 laundry-0.493
 same-0.482
 task-0.460
 your-0.458
 homework-0.447
Token a
Token ID257
Feature activation-1.25e+01

Pos logit contributions
 laundry+0.201
 task+0.189
 same+0.189
 proper+0.175
 attic+0.174
Neg logit contributions
uts-0.347
oped-0.343
es-0.340
oot-0.336
oled-0.335
Token resolution
Token ID6323
Feature activation+2.63e+01

Pos logit contributions
 laundry+0.159
 same+0.142
 your+0.134
 homework+0.134
 attic+0.134
Neg logit contributions
oped-0.311
oled-0.305
unch-0.300
uts-0.299
qu-0.293
Token,
Token ID11
Feature activation-3.11e+00

Pos logit contributions
oped+0.441
uts+0.412
unch+0.409
oled+0.409
owing+0.392
Neg logit contributions
 laundry-0.607
 same-0.594
 your-0.565
 task-0.563
 way-0.547
Token until
Token ID1566
Feature activation+3.02e+00

Pos logit contributions
 laundry+0.074
 same+0.071
 task+0.070
 homework+0.068
 attic+0.067
Neg logit contributions
oped-0.049
oled-0.046
uts-0.046
unch-0.045
owing-0.045
Token everyone
Token ID2506
Feature activation+7.77e+00

Pos logit contributions
oped+0.059
oled+0.054
uts+0.054
es+0.053
hed+0.052
Neg logit contributions
 laundry-0.055
 same-0.054
 proper-0.052
 task-0.052
 computer-0.051
Token was
Token ID373
Feature activation-6.31e+00

Pos logit contributions
oped+0.157
unch+0.152
uts+0.152
oled+0.150
owing+0.146
Neg logit contributions
 laundry-0.148
 same-0.146
 your-0.138
 task-0.136
 way-0.135
Token happy
Token ID3772
Feature activation+5.45e+00

Pos logit contributions
 laundry+0.111
 same+0.110
 your+0.104
 way+0.101
 task+0.098
Neg logit contributions
oped-0.142
uts-0.135
oled-0.133
unch-0.133
oot-0.130
Token into
Token ID656
Feature activation-1.40e+01

Pos logit contributions
oped+0.068
uts+0.065
oled+0.064
unch+0.064
owing+0.062
Neg logit contributions
 laundry-0.070
 same-0.068
 task-0.064
 your-0.064
 attic-0.062
Token the
Token ID262
Feature activation-1.99e+01

Pos logit contributions
 laundry+0.163
 same+0.161
 task+0.147
 homework+0.140
 attic+0.136
Neg logit contributions
oped-0.365
oled-0.358
unch-0.357
uts-0.357
es-0.351
Token net
Token ID2010
Feature activation+1.93e+01

Pos logit contributions
 laundry+0.254
 your+0.243
 homework+0.229
 task+0.224
 brushes+0.213
Neg logit contributions
hed-0.455
oped-0.408
oled-0.405
m-0.396
umbs-0.393
Token!
Token ID0
Feature activation+4.35e+00

Pos logit contributions
oped+0.259
oled+0.239
unch+0.236
uts+0.231
owing+0.220
Neg logit contributions
 same-0.515
 laundry-0.508
 way-0.489
 your-0.489
 task-0.486
Token The
Token ID383
Feature activation-1.14e+01

Pos logit contributions
oped+0.091
owing+0.089
oled+0.085
uts+0.084
es+0.084
Neg logit contributions
 laundry-0.081
 same-0.081
 your-0.076
 task-0.074
 way-0.073
Token referee
Token ID22957
Feature activation+2.63e+01

Pos logit contributions
 laundry+0.180
 proper+0.172
 your+0.165
 attic+0.161
 homework+0.156
Neg logit contributions
uts-0.225
unch-0.218
hed-0.216
umbs-0.216
m-0.213
Token said
Token ID531
Feature activation+8.39e+00

Pos logit contributions
oped+0.547
unch+0.545
uts+0.539
owing+0.529
oled+0.519
Neg logit contributions
 same-0.466
 laundry-0.464
 way-0.434
 paper-0.430
 task-0.427
Token it
Token ID340
Feature activation+1.45e+00

Pos logit contributions
oped+0.154
uts+0.150
oled+0.147
unch+0.144
oot+0.140
Neg logit contributions
 laundry-0.181
 same-0.176
 task-0.168
 attic-0.168
 way-0.161
Token was
Token ID373
Feature activation-6.84e+00

Pos logit contributions
oped+0.029
uts+0.028
oled+0.027
unch+0.027
owing+0.026
Neg logit contributions
 laundry-0.029
 same-0.028
 task-0.026
 your-0.026
 attic-0.026
Token a
Token ID257
Feature activation-1.33e+01

Pos logit contributions
 laundry+0.117
 attic+0.108
 same+0.106
 task+0.101
 table+0.099
Neg logit contributions
oped-0.148
uts-0.146
oled-0.143
unch-0.141
oot-0.140
Token goal
Token ID3061
Feature activation+1.63e+01

Pos logit contributions
 laundry+0.199
 attic+0.177
 brushes+0.160
 your+0.160
 same+0.159
Neg logit contributions
oped-0.291
oled-0.286
unch-0.280
uts-0.273
oling-0.271
Token
Token ID26391
Feature activation+6.85e+00

Pos logit contributions
oped+0.131
uts+0.130
oled+0.126
unch+0.125
es+0.121
Neg logit contributions
 laundry-0.098
 same-0.096
 task-0.090
 your-0.089
 way-0.086
Token
Token ID8151
Feature activation+2.23e+01

Pos logit contributions
oped+0.111
oot+0.110
owing+0.107
oled+0.106
ann+0.106
Neg logit contributions
 laundry-0.150
 your-0.145
 same-0.141
 homework-0.139
 way-0.138
Tokens
Token ID82
Feature activation-6.49e+00

Pos logit contributions
ez+0.307
ring+0.299
qu+0.297
ann+0.296
oled+0.291
Neg logit contributions
 your-0.521
 way-0.494
 same-0.485
 our-0.481
 my-0.467
Token make
Token ID787
Feature activation-1.16e+00

Pos logit contributions
 laundry+0.087
 same+0.083
 task+0.076
 your+0.074
 attic+0.073
Neg logit contributions
oped-0.169
uts-0.162
oled-0.161
unch-0.160
owing-0.159
Token a
Token ID257
Feature activation-1.10e+01

Pos logit contributions
 laundry+0.019
 same+0.018
 task+0.017
 your+0.017
 attic+0.016
Neg logit contributions
oped-0.028
uts-0.026
unch-0.026
oled-0.026
owing-0.025
Token deal
Token ID1730
Feature activation+2.63e+01

Pos logit contributions
 attic+0.128
 same+0.126
 laundry+0.123
 your+0.123
 puzzles+0.112
Neg logit contributions
oped-0.256
unch-0.255
oing-0.243
m-0.240
owing-0.240
Token.
Token ID13
Feature activation+8.31e+00

Pos logit contributions
oped+0.537
oled+0.503
ann+0.457
unch+0.449
oling+0.442
Neg logit contributions
 your-0.580
 laundry-0.545
 our-0.541
 my-0.519
 computer-0.508
Token You
Token ID921
Feature activation+7.86e+00

Pos logit contributions
oped+0.178
owing+0.171
uts+0.169
unch+0.167
oot+0.167
Neg logit contributions
 laundry-0.150
 your-0.149
 same-0.143
 my-0.136
 our-0.135
Token help
Token ID1037
Feature activation+4.40e+00

Pos logit contributions
oped+0.178
uts+0.171
unch+0.170
oled+0.169
owing+0.164
Neg logit contributions
 laundry-0.136
 same-0.133
 task-0.124
 your-0.123
 way-0.119
Token me
Token ID502
Feature activation+2.97e-01

Pos logit contributions
oped+0.109
uts+0.105
unch+0.104
oled+0.104
owing+0.102
Neg logit contributions
 laundry-0.065
 same-0.064
 task-0.059
 your-0.057
 attic-0.057
Token organize
Token ID16481
Feature activation+7.89e+00

Pos logit contributions
oped+0.006
uts+0.006
oled+0.006
unch+0.006
owing+0.006
Neg logit contributions
 laundry-0.006
 same-0.006
 task-0.005
 your-0.005
 homework-0.005
Token.
Token ID13
Feature activation+6.93e+00

Pos logit contributions
oped+0.289
oled+0.289
ann+0.272
unch+0.268
oling+0.260
Neg logit contributions
 same-0.353
 laundry-0.351
 way-0.338
 your-0.332
 table-0.326
Token "
Token ID366
Feature activation+4.70e+00

Pos logit contributions
oped+0.144
uts+0.137
oled+0.135
unch+0.135
owing+0.131
Neg logit contributions
 laundry-0.133
 same-0.130
 task-0.122
 your-0.121
 homework-0.117
TokenLook
Token ID8567
Feature activation+1.24e+01

Pos logit contributions
uts+0.121
oot+0.118
owing+0.116
oped+0.114
unch+0.114
Neg logit contributions
 same-0.071
 laundry-0.071
 your-0.063
 attic-0.062
 sunny-0.059
Token,
Token ID11
Feature activation-2.13e+00

Pos logit contributions
oped+0.259
uts+0.249
oled+0.248
unch+0.243
oling+0.230
Neg logit contributions
 laundry-0.243
 same-0.236
 your-0.234
 task-0.220
 our-0.217
Token a
Token ID257
Feature activation-1.21e+01

Pos logit contributions
 laundry+0.029
 same+0.029
 task+0.028
 homework+0.027
 computer+0.026
Neg logit contributions
oped-0.050
uts-0.049
unch-0.049
oled-0.049
m-0.047
Token mask
Token ID9335
Feature activation+2.62e+01

Pos logit contributions
 laundry+0.155
 same+0.147
 homework+0.145
 your+0.139
 task+0.138
Neg logit contributions
oped-0.310
unch-0.297
oled-0.295
uts-0.289
m-0.282
Token!"
Token ID2474
Feature activation+1.00e+01

Pos logit contributions
oped+0.339
uts+0.314
oled+0.307
unch+0.303
owing+0.288
Neg logit contributions
 laundry-0.705
 same-0.687
 task-0.659
 your-0.657
 attic-0.646
Token Lucy
Token ID22162
Feature activation+5.63e+00

Pos logit contributions
oped+0.211
uts+0.201
oled+0.199
unch+0.199
owing+0.193
Neg logit contributions
 laundry-0.190
 same-0.185
 task-0.174
 your-0.173
 way-0.167
Token exclaimed
Token ID33552
Feature activation+1.23e+01

Pos logit contributions
oped+0.113
oled+0.113
unch+0.111
uts+0.109
umbs+0.106
Neg logit contributions
 laundry-0.107
 same-0.101
 paper-0.100
 way-0.100
 your-0.098
Token.
Token ID13
Feature activation+7.16e+00

Pos logit contributions
oped+0.207
unch+0.189
oled+0.187
umbs+0.183
qu+0.181
Neg logit contributions
 laundry-0.293
 same-0.277
 your-0.274
 task-0.267
 computer-0.265
Token Lucy
Token ID22162
Feature activation+5.63e+00

Pos logit contributions
oped+0.141
owing+0.132
oled+0.131
unch+0.131
uts+0.131
Neg logit contributions
 laundry-0.149
 same-0.144
 your-0.136
 way-0.134
 task-0.133
Token wet
Token ID9583
Feature activation+6.73e+00

Pos logit contributions
 laundry+0.034
 same+0.034
 your+0.031
 task+0.030
 way+0.030
Neg logit contributions
oped-0.049
uts-0.047
unch-0.047
oled-0.047
owing-0.046
Token and
Token ID290
Feature activation-1.03e+01

Pos logit contributions
oped+0.132
unch+0.126
uts+0.122
oled+0.120
oot+0.117
Neg logit contributions
 same-0.143
 laundry-0.143
 your-0.130
 way-0.130
 paper-0.129
Token the
Token ID262
Feature activation-2.08e+01

Pos logit contributions
 laundry+0.176
 task+0.169
 same+0.169
 attic+0.168
 homework+0.163
Neg logit contributions
oped-0.214
oled-0.197
uts-0.196
hed-0.190
m-0.189
Token motorcycle
Token ID18757
Feature activation+1.85e+01

Pos logit contributions
 laundry+0.296
 your+0.296
 homework+0.271
 task+0.271
 same+0.271
Neg logit contributions
oped-0.472
oled-0.461
hed-0.456
unch-0.448
uts-0.443
Token sk
Token ID1341
Feature activation+9.40e+00

Pos logit contributions
oped+0.405
unch+0.398
uts+0.391
oled+0.389
owing+0.382
Neg logit contributions
 laundry-0.323
 same-0.322
 task-0.298
 way-0.297
 paper-0.289
Tokenidd
Token ID1638
Feature activation+2.62e+01

Pos logit contributions
unch+0.130
oped+0.127
qu+0.127
uts+0.127
oled+0.124
Neg logit contributions
 laundry-0.218
 same-0.211
 homework-0.203
 attic-0.203
 garage-0.202
Tokened
Token ID276
Feature activation+2.41e+00

Pos logit contributions
unch+0.271
oled+0.230
ement+0.222
qu+0.220
oped+0.211
Neg logit contributions
 same-0.728
 laundry-0.678
 paper-0.677
 way-0.671
 computer-0.654
Token.
Token ID13
Feature activation+5.90e+00

Pos logit contributions
oped+0.043
unch+0.042
uts+0.041
oled+0.041
oot+0.039
Neg logit contributions
 same-0.053
 laundry-0.052
 your-0.049
 task-0.049
 way-0.049
Token It
Token ID632
Feature activation+3.25e+00

Pos logit contributions
oped+0.104
uts+0.097
unch+0.097
oled+0.096
owing+0.095
Neg logit contributions
 laundry-0.131
 same-0.130
 your-0.123
 task-0.122
 way-0.120
Token tipped
Token ID28395
Feature activation+6.33e+00

Pos logit contributions
oped+0.068
uts+0.064
oled+0.064
unch+0.063
owing+0.061
Neg logit contributions
 laundry-0.062
 same-0.061
 task-0.057
 your-0.057
 attic-0.055
Token over
Token ID625
Feature activation-7.77e+00

Pos logit contributions
oped+0.119
uts+0.113
unch+0.112
oled+0.112
owing+0.108
Neg logit contributions
 laundry-0.134
 same-0.131
 your-0.123
 task-0.123
 way-0.120
Token.
Token ID13
Feature activation+6.74e+00

Pos logit contributions
oped+0.329
oled+0.323
uts+0.317
unch+0.312
umbs+0.310
Neg logit contributions
 your-0.277
 laundry-0.259
 same-0.256
 paper-0.251
 computer-0.245
Token He
Token ID679
Feature activation+3.27e+00

Pos logit contributions
oped+0.123
uts+0.119
unch+0.118
oled+0.116
owing+0.113
Neg logit contributions
 laundry-0.145
 same-0.140
 task-0.134
 your-0.132
 homework-0.129
Token had
Token ID550
Feature activation-2.32e+00

Pos logit contributions
oped+0.051
es+0.048
uts+0.047
iled+0.047
oled+0.047
Neg logit contributions
 laundry-0.076
 same-0.069
 task-0.068
 port-0.067
 your-0.066
Token a
Token ID257
Feature activation-1.23e+01

Pos logit contributions
 task+0.045
 port+0.043
 laundry+0.042
 shelf+0.041
 tasks+0.040
Neg logit contributions
uts-0.040
ez-0.037
ed-0.036
es-0.036
ann-0.036
Token special
Token ID2041
Feature activation+2.26e+00

Pos logit contributions
 your+0.163
 laundry+0.163
 our+0.162
 fills+0.155
 tasks+0.154
Neg logit contributions
oped-0.212
 badly-0.210
pt-0.209
op-0.208
oc-0.205
Token mask
Token ID9335
Feature activation+2.61e+01

Pos logit contributions
oped+0.048
owing+0.048
unch+0.048
oot+0.047
uts+0.047
Neg logit contributions
 our-0.027
 your-0.026
 same-0.026
 laundry-0.025
 my-0.023
Token of
Token ID286
Feature activation-1.49e+01

Pos logit contributions
oped+0.444
uts+0.420
oled+0.414
unch+0.411
owing+0.398
Neg logit contributions
 laundry-0.598
 same-0.583
 task-0.555
 your-0.552
 attic-0.536
Token his
Token ID465
Feature activation-1.47e+01

Pos logit contributions
 laundry+0.179
 same+0.178
 task+0.177
 attic+0.162
 basement+0.159
Neg logit contributions
uts-0.357
unch-0.354
oled-0.349
oped-0.349
ann-0.347
Token own
Token ID898
Feature activation-4.77e+00

Pos logit contributions
 laundry+0.215
 same+0.202
 task+0.193
 your+0.192
 port+0.187
Neg logit contributions
uts-0.317
oped-0.312
owing-0.309
unch-0.302
es-0.298
Token.
Token ID13
Feature activation+6.96e+00

Pos logit contributions
 laundry+0.125
 same+0.122
 your+0.118
 task+0.117
 attic+0.114
Neg logit contributions
oped-0.059
uts-0.059
oled-0.055
owing-0.054
unch-0.054
Token He
Token ID679
Feature activation+3.33e+00

Pos logit contributions
oped+0.147
oled+0.138
uts+0.138
unch+0.135
owing+0.135
Neg logit contributions
 laundry-0.134
 same-0.131
 your-0.124
 task-0.123
 attic-0.120
Token saw
Token ID2497
Feature activation+4.93e-01

Pos logit contributions
oped+0.124
uts+0.120
unch+0.119
oled+0.118
oot+0.115
Neg logit contributions
 laundry-0.104
 same-0.102
 your-0.095
 task-0.094
 way-0.093
Token a
Token ID257
Feature activation-1.20e+01

Pos logit contributions
oped+0.011
unch+0.011
uts+0.011
oled+0.011
oot+0.011
Neg logit contributions
 task-0.007
 same-0.007
 laundry-0.007
 proper-0.007
 homework-0.007
Token lady
Token ID10846
Feature activation+1.74e+01

Pos logit contributions
 same+0.149
 laundry+0.147
 your+0.146
 homework+0.143
 proper+0.137
Neg logit contributions
oped-0.281
unch-0.278
oled-0.274
m-0.270
oing-0.270
Token wearing
Token ID5762
Feature activation+3.05e+00

Pos logit contributions
uts+0.372
oped+0.367
oled+0.355
umbs+0.342
unch+0.341
Neg logit contributions
 laundry-0.326
 same-0.308
 your-0.302
 paper-0.293
 attic-0.292
Token a
Token ID257
Feature activation-1.21e+01

Pos logit contributions
oped+0.072
uts+0.070
oled+0.069
unch+0.068
es+0.068
Neg logit contributions
 laundry-0.043
 task-0.042
 same-0.042
 way-0.039
 your-0.038
Token mask
Token ID9335
Feature activation+2.61e+01

Pos logit contributions
 laundry+0.143
 same+0.139
 tasks+0.132
 attic+0.129
 task+0.129
Neg logit contributions
oped-0.301
unch-0.285
uts-0.283
oled-0.281
oing-0.275
Token.
Token ID13
Feature activation+7.53e+00

Pos logit contributions
oped+0.505
unch+0.482
uts+0.471
oled+0.469
ann+0.467
Neg logit contributions
 same-0.540
 laundry-0.536
 attic-0.502
 computer-0.498
 paper-0.496
Token The
Token ID383
Feature activation-9.90e+00

Pos logit contributions
oped+0.133
uts+0.129
oled+0.127
unch+0.123
owing+0.119
Neg logit contributions
 laundry-0.163
 same-0.159
 task-0.153
 homework-0.151
 your-0.148
Token mask
Token ID9335
Feature activation+2.61e+01

Pos logit contributions
 laundry+0.156
 same+0.148
 your+0.147
 task+0.143
 our+0.142
Neg logit contributions
oped-0.222
uts-0.218
oled-0.216
unch-0.215
oot-0.210
Token was
Token ID373
Feature activation-5.73e+00

Pos logit contributions
oped+0.525
uts+0.499
oled+0.493
unch+0.489
owing+0.474
Neg logit contributions
 laundry-0.517
 same-0.502
 your-0.474
 task-0.474
 homework-0.458
Token deep
Token ID2769
Feature activation-1.16e+00

Pos logit contributions
 task+0.091
 laundry+0.090
 same+0.085
 tasks+0.084
 proper+0.083
Neg logit contributions
uts-0.121
oled-0.119
oped-0.119
unch-0.116
ann-0.115
Token wearing
Token ID5762
Feature activation+3.05e+00

Pos logit contributions
uts+0.372
oped+0.367
oled+0.355
umbs+0.342
unch+0.341
Neg logit contributions
 laundry-0.326
 same-0.308
 your-0.302
 paper-0.293
 attic-0.292
Token a
Token ID257
Feature activation-1.21e+01

Pos logit contributions
oped+0.072
uts+0.070
oled+0.069
unch+0.068
es+0.068
Neg logit contributions
 laundry-0.043
 task-0.042
 same-0.042
 way-0.039
 your-0.038
Token mask
Token ID9335
Feature activation+2.61e+01

Pos logit contributions
 laundry+0.143
 same+0.139
 tasks+0.132
 attic+0.129
 task+0.129
Neg logit contributions
oped-0.301
unch-0.285
uts-0.283
oled-0.281
oing-0.275
Token.
Token ID13
Feature activation+7.53e+00

Pos logit contributions
oped+0.505
unch+0.482
uts+0.471
oled+0.469
ann+0.467
Neg logit contributions
 same-0.540
 laundry-0.536
 attic-0.502
 computer-0.498
 paper-0.496
Token The
Token ID383
Feature activation-9.90e+00

Pos logit contributions
oped+0.133
uts+0.129
oled+0.127
unch+0.123
owing+0.119
Neg logit contributions
 laundry-0.163
 same-0.159
 task-0.153
 homework-0.151
 your-0.148
Token mask
Token ID9335
Feature activation+2.61e+01

Pos logit contributions
 laundry+0.156
 same+0.148
 your+0.147
 task+0.143
 our+0.142
Neg logit contributions
oped-0.222
uts-0.218
oled-0.216
unch-0.215
oot-0.210
Token was
Token ID373
Feature activation-5.73e+00

Pos logit contributions
oped+0.525
uts+0.499
oled+0.493
unch+0.489
owing+0.474
Neg logit contributions
 laundry-0.517
 same-0.502
 your-0.474
 task-0.474
 homework-0.458
Token deep
Token ID2769
Feature activation-1.16e+00

Pos logit contributions
 task+0.091
 laundry+0.090
 same+0.085
 tasks+0.084
 proper+0.083
Neg logit contributions
uts-0.121
oled-0.119
oped-0.119
unch-0.116
ann-0.115
Token blue
Token ID4171
Feature activation+5.34e+00

Pos logit contributions
 laundry+0.027
 same+0.027
 task+0.025
 your+0.025
 homework+0.025
Neg logit contributions
oped-0.019
uts-0.018
oled-0.018
unch-0.018
owing-0.017
Token and
Token ID290
Feature activation-8.20e+00

Pos logit contributions
oped+0.104
oled+0.096
uts+0.095
unch+0.094
qu+0.093
Neg logit contributions
 laundry-0.116
 same-0.113
 paper-0.105
 attic-0.105
 task-0.104
Token covered
Token ID5017
Feature activation+6.96e+00

Pos logit contributions
 laundry+0.144
 same+0.138
 task+0.137
 homework+0.132
 port+0.130
Neg logit contributions
oped-0.174
uts-0.165
oled-0.163
es-0.159
m-0.156
Token melodies
Token ID47077
Feature activation+1.86e+01

Pos logit contributions
 laundry+0.063
 same+0.062
 attic+0.059
 garage+0.058
 task+0.058
Neg logit contributions
es-0.072
oped-0.071
hed-0.068
oled-0.067
uts-0.067
Token and
Token ID290
Feature activation-9.38e+00

Pos logit contributions
oped+0.303
unch+0.282
uts+0.279
oled+0.277
owing+0.268
Neg logit contributions
 laundry-0.440
 same-0.432
 your-0.408
 task-0.408
 way-0.399
Token even
Token ID772
Feature activation-1.44e+00

Pos logit contributions
 laundry+0.154
 same+0.146
 task+0.142
 attic+0.142
 port+0.135
Neg logit contributions
oped-0.197
es-0.192
uts-0.191
oled-0.190
hed-0.183
Token her
Token ID607
Feature activation-1.39e+01

Pos logit contributions
 laundry+0.028
 attic+0.027
 task+0.026
 same+0.025
 basement+0.025
Neg logit contributions
oped-0.028
es-0.026
oled-0.026
hed-0.026
ed-0.025
Token own
Token ID898
Feature activation-5.69e+00

Pos logit contributions
 laundry+0.188
 same+0.174
 your+0.172
 attic+0.169
 task+0.165
Neg logit contributions
uts-0.337
oped-0.327
oled-0.313
es-0.310
owing-0.307
Token compositions
Token ID33543
Feature activation+2.61e+01

Pos logit contributions
 laundry+0.081
 same+0.077
 task+0.072
 your+0.071
 attic+0.071
Neg logit contributions
oped-0.141
uts-0.138
oled-0.135
unch-0.133
owing-0.131
Token.
Token ID13
Feature activation+6.49e+00

Pos logit contributions
oped+0.403
unch+0.386
oled+0.363
ann+0.353
owing+0.346
Neg logit contributions
 same-0.642
 laundry-0.638
 way-0.599
 task-0.598
 your-0.592
Token Emma
Token ID18966
Feature activation+4.72e+00

Pos logit contributions
oped+0.132
uts+0.125
unch+0.125
owing+0.123
oled+0.123
Neg logit contributions
 laundry-0.128
 same-0.126
 your-0.117
 task-0.117
 attic-0.114
Token was
Token ID373
Feature activation-6.68e+00

Pos logit contributions
oped+0.094
uts+0.088
oled+0.087
unch+0.086
owing+0.084
Neg logit contributions
 laundry-0.096
 same-0.093
 task-0.088
 your-0.087
 attic-0.085
Token so
Token ID523
Feature activation-9.35e+00

Pos logit contributions
 laundry+0.142
 task+0.133
 same+0.132
 attic+0.128
 homework+0.127
Neg logit contributions
oped-0.123
oled-0.117
uts-0.115
unch-0.112
owing-0.111
Token proud
Token ID6613
Feature activation+3.98e+00

Pos logit contributions
 laundry+0.192
 same+0.177
 task+0.173
 attic+0.172
 computer+0.168
Neg logit contributions
oped-0.184
uts-0.174
oled-0.173
unch-0.170
owing-0.165
Token slippery
Token ID32911
Feature activation+3.57e+00

Pos logit contributions
oped+0.326
uts+0.311
oled+0.308
unch+0.308
owing+0.300
Neg logit contributions
 laundry-0.267
 same-0.260
 task-0.243
 your-0.242
 attic-0.234
Token and
Token ID290
Feature activation-1.11e+01

Pos logit contributions
oped+0.057
uts+0.053
oled+0.052
unch+0.052
owing+0.050
Neg logit contributions
 laundry-0.086
 same-0.085
 task-0.081
 your-0.081
 way-0.078
Token the
Token ID262
Feature activation-2.15e+01

Pos logit contributions
 laundry+0.191
 attic+0.185
 computer+0.174
 same+0.174
 task+0.173
Neg logit contributions
oped-0.228
uts-0.226
oled-0.213
m-0.211
iled-0.209
Token car
Token ID1097
Feature activation+1.52e+01

Pos logit contributions
 your+0.260
 laundry+0.258
 same+0.244
 homework+0.243
 task+0.240
Neg logit contributions
oped-0.525
oled-0.507
m-0.498
uts-0.497
ann-0.491
Token sk
Token ID1341
Feature activation+8.49e+00

Pos logit contributions
oped+0.309
uts+0.294
oled+0.287
unch+0.280
owing+0.278
Neg logit contributions
 laundry-0.301
 same-0.288
 your-0.274
 task-0.273
 attic-0.268
Tokenidd
Token ID1638
Feature activation+2.61e+01

Pos logit contributions
oped+0.121
unch+0.118
uts+0.118
oled+0.116
qu+0.115
Neg logit contributions
 laundry-0.199
 same-0.190
 homework-0.183
 your-0.181
 port-0.178
Tokened
Token ID276
Feature activation+3.17e+00

Pos logit contributions
unch+0.257
oled+0.238
oped+0.238
ement+0.226
qu+0.208
Neg logit contributions
 same-0.744
 laundry-0.708
 your-0.684
 paper-0.680
 computer-0.675
Token and
Token ID290
Feature activation-1.01e+01

Pos logit contributions
oped+0.056
uts+0.053
oled+0.052
unch+0.052
owing+0.050
Neg logit contributions
 laundry-0.070
 same-0.069
 task-0.065
 your-0.065
 attic-0.063
Token hit
Token ID2277
Feature activation+9.07e+00

Pos logit contributions
 laundry+0.176
 same+0.172
 task+0.172
 attic+0.166
 homework+0.157
Neg logit contributions
oped-0.208
uts-0.200
es-0.196
oled-0.195
iled-0.187
Token a
Token ID257
Feature activation-1.36e+01

Pos logit contributions
oped+0.240
uts+0.230
oled+0.230
unch+0.226
es+0.224
Neg logit contributions
 laundry-0.116
 same-0.109
 task-0.106
 homework-0.099
 attic-0.099
Token tree
Token ID5509
Feature activation+1.41e+01

Pos logit contributions
 laundry+0.169
 same+0.162
 task+0.146
 your+0.146
 homework+0.145
Neg logit contributions
oped-0.366
uts-0.352
oled-0.351
unch-0.350
owing-0.337
Token.
Token ID13
Feature activation-3.27e+00

Pos logit contributions
oped+0.124
unch+0.113
oled+0.112
uts+0.111
oot+0.108
Neg logit contributions
 laundry-0.250
 same-0.246
 task-0.234
 your-0.234
 paper-0.230
Token He
Token ID679
Feature activation-5.75e+00

Pos logit contributions
 laundry+0.062
 same+0.061
 your+0.060
 attic+0.056
 task+0.055
Neg logit contributions
oped-0.067
owing-0.064
oled-0.063
unch-0.063
uts-0.063
Token pulls
Token ID16194
Feature activation+6.83e-01

Pos logit contributions
 laundry+0.124
 same+0.116
 task+0.115
 attic+0.110
 computer+0.108
Neg logit contributions
oped-0.108
es-0.098
oled-0.097
uts-0.095
hed-0.094
Token and
Token ID290
Feature activation-1.71e+01

Pos logit contributions
oped+0.017
uts+0.017
oled+0.016
unch+0.016
owing+0.016
Neg logit contributions
 laundry-0.010
 same-0.009
 task-0.009
 attic-0.009
 your-0.008
Token twists
Token ID30953
Feature activation+2.31e+00

Pos logit contributions
 laundry+0.325
 same+0.313
 attic+0.312
 task+0.309
 port+0.295
Neg logit contributions
oped-0.360
oled-0.334
es-0.328
uts-0.323
hed-0.322
Token the
Token ID262
Feature activation-2.75e+01

Pos logit contributions
oped+0.042
unch+0.041
owing+0.038
oled+0.038
uts+0.038
Neg logit contributions
 same-0.051
 laundry-0.050
 your-0.049
 paper-0.048
 way-0.047
Token hose
Token ID31489
Feature activation+1.34e+01

Pos logit contributions
 laundry+0.366
 same+0.334
 your+0.328
 attic+0.315
 task+0.310
Neg logit contributions
oped-0.705
uts-0.679
oled-0.673
owing-0.666
unch-0.665
Token.
Token ID13
Feature activation-1.75e+00

Pos logit contributions
oped+0.251
unch+0.246
oled+0.237
ann+0.220
uts+0.220
Neg logit contributions
 same-0.299
 laundry-0.285
 way-0.279
 paper-0.277
 task-0.273
Token He
Token ID679
Feature activation-3.65e+00

Pos logit contributions
 laundry+0.035
 same+0.034
 your+0.032
 task+0.031
 attic+0.031
Neg logit contributions
oped-0.035
owing-0.033
unch-0.033
uts-0.033
oled-0.033
Token sees
Token ID7224
Feature activation-5.62e+00

Pos logit contributions
 laundry+0.061
 task+0.055
 same+0.054
 attic+0.054
 computer+0.052
Neg logit contributions
oped-0.085
uts-0.080
es-0.080
oled-0.078
hed-0.077
Token a
Token ID257
Feature activation-1.76e+01

Pos logit contributions
 laundry+0.078
 task+0.073
 same+0.073
 homework+0.067
 way+0.066
Neg logit contributions
oped-0.141
uts-0.139
oled-0.137
unch-0.135
es-0.134
Token runs
Token ID4539
Feature activation-5.76e+00

Pos logit contributions
oped+0.013
owing+0.012
oled+0.012
uts+0.012
ring+0.012
Neg logit contributions
 laundry-0.011
 your-0.011
 same-0.010
 our-0.010
 paper-0.010
Token to
Token ID284
Feature activation-2.62e+01

Pos logit contributions
 same+0.110
 laundry+0.107
 your+0.104
 way+0.099
 task+0.098
Neg logit contributions
oped-0.123
unch-0.120
uts-0.117
oled-0.116
owing-0.113
Token them
Token ID606
Feature activation-7.53e+00

Pos logit contributions
 laundry+0.385
 attic+0.371
 task+0.364
 same+0.350
 proper+0.341
Neg logit contributions
oped-0.557
uts-0.544
oot-0.540
unch-0.529
qu-0.526
Token and
Token ID290
Feature activation-1.67e+01

Pos logit contributions
 same+0.165
 your+0.164
 way+0.163
 laundry+0.163
 table+0.156
Neg logit contributions
oped-0.144
oled-0.134
unch-0.131
ann-0.126
oling-0.121
Token grabs
Token ID22378
Feature activation-3.41e+00

Pos logit contributions
 task+0.366
 laundry+0.358
 attic+0.347
 same+0.342
 tasks+0.333
Neg logit contributions
oped-0.281
es-0.264
 stro-0.263
ed-0.246
uts-0.246
Token the
Token ID262
Feature activation-2.75e+01

Pos logit contributions
 laundry+0.056
 task+0.053
 same+0.053
 attic+0.051
 homework+0.050
Neg logit contributions
oped-0.077
uts-0.073
es-0.073
oled-0.073
hed-0.071
Token ball
Token ID2613
Feature activation+1.27e+01

Pos logit contributions
 laundry+0.352
 homework+0.330
 proper+0.329
 task+0.326
 same+0.324
Neg logit contributions
oped-0.616
uts-0.598
oled-0.595
unch-0.584
owing-0.577
Token.
Token ID13
Feature activation-1.67e+00

Pos logit contributions
oped+0.201
oled+0.187
unch+0.186
uts+0.177
ann+0.176
Neg logit contributions
 same-0.308
 laundry-0.304
 your-0.298
 task-0.289
 way-0.286
Token He
Token ID679
Feature activation-3.66e+00

Pos logit contributions
 laundry+0.032
 same+0.031
 your+0.031
 attic+0.028
 task+0.028
Neg logit contributions
oped-0.034
owing-0.033
oled-0.032
unch-0.032
uts-0.032
Token says
Token ID1139
Feature activation+4.50e+00

Pos logit contributions
 laundry+0.067
 attic+0.063
 task+0.062
 sunny+0.061
 job+0.060
Neg logit contributions
oped-0.078
es-0.073
hed-0.071
oled-0.070
uts-0.070
Token,
Token ID11
Feature activation-9.60e+00

Pos logit contributions
oped+0.080
uts+0.076
oled+0.075
unch+0.074
owing+0.071
Neg logit contributions
 laundry-0.100
 same-0.098
 task-0.093
 your-0.091
 attic-0.091
Token works
Token ID2499
Feature activation+5.35e-01

Pos logit contributions
uts+0.007
oped+0.007
unch+0.007
owing+0.007
oot+0.006
Neg logit contributions
 laundry-0.007
 same-0.007
 your-0.006
 task-0.006
 paper-0.006
Token!
Token ID0
Feature activation-7.68e+00

Pos logit contributions
oped+0.008
uts+0.008
oled+0.008
owing+0.007
es+0.007
Neg logit contributions
 laundry-0.013
 same-0.012
 task-0.012
 attic-0.012
 your-0.012
Token We
Token ID775
Feature activation-9.06e+00

Pos logit contributions
 laundry+0.142
 same+0.141
 your+0.139
 attic+0.128
 task+0.127
Neg logit contributions
oped-0.159
owing-0.152
unch-0.151
oled-0.150
uts-0.150
Token can
Token ID460
Feature activation-6.42e+00

Pos logit contributions
 laundry+0.155
 same+0.153
 task+0.141
 your+0.141
 homework+0.136
Neg logit contributions
oped-0.205
uts-0.199
unch-0.197
oled-0.195
owing-0.190
Token turn
Token ID1210
Feature activation+4.90e+00

Pos logit contributions
 laundry+0.076
 same+0.074
 your+0.066
 task+0.066
 homework+0.062
Neg logit contributions
oped-0.182
oled-0.175
uts-0.175
unch-0.174
owing-0.171
Token the
Token ID262
Feature activation-2.74e+01

Pos logit contributions
oped+0.094
uts+0.091
oled+0.088
es+0.086
oot+0.086
Neg logit contributions
 laundry-0.096
 same-0.092
 task-0.090
 attic-0.087
 homework-0.086
Token screw
Token ID9580
Feature activation+1.01e+01

Pos logit contributions
 laundry+0.338
 your+0.310
 same+0.295
 homework+0.293
 proper+0.293
Neg logit contributions
oped-0.654
owing-0.628
oled-0.620
m-0.619
uts-0.619
Token with
Token ID351
Feature activation-1.27e+01

Pos logit contributions
oped+0.180
uts+0.173
oled+0.166
unch+0.163
owing+0.163
Neg logit contributions
 laundry-0.222
 same-0.213
 task-0.203
 your-0.201
 attic-0.200
Token this
Token ID428
Feature activation-1.58e+01

Pos logit contributions
 laundry+0.179
 task+0.174
 homework+0.165
 attic+0.163
 same+0.162
Neg logit contributions
oped-0.284
uts-0.278
es-0.275
hed-0.271
ann-0.269
Token screw
Token ID9580
Feature activation+1.17e+01

Pos logit contributions
 laundry+0.265
 same+0.248
 your+0.233
 task+0.232
 attic+0.232
Neg logit contributions
oped-0.350
uts-0.335
unch-0.326
oled-0.326
owing-0.324
Tokendriver
Token ID26230
Feature activation+9.43e+00

Pos logit contributions
oped+0.180
uts+0.169
oled+0.166
unch+0.165
owing+0.160
Neg logit contributions
 laundry-0.286
 same-0.280
 task-0.267
 your-0.265
 attic-0.260
Token to
Token ID284
Feature activation-2.73e+01

Pos logit contributions
oped+0.059
owing+0.054
oled+0.053
unch+0.053
oot+0.052
Neg logit contributions
 your-0.041
 laundry-0.040
 same-0.040
 my-0.036
 our-0.036
Token fight
Token ID1907
Feature activation+5.76e+00

Pos logit contributions
 laundry+0.382
 same+0.374
 proper+0.366
 attic+0.365
 task+0.363
Neg logit contributions
uts-0.571
oped-0.562
unch-0.547
qu-0.541
oot-0.535
Token.
Token ID13
Feature activation-3.85e+00

Pos logit contributions
oped+0.072
uts+0.070
oled+0.067
es+0.065
unch+0.063
Neg logit contributions
 laundry-0.153
 same-0.148
 task-0.144
 attic-0.141
 proper-0.138
Token They
Token ID1119
Feature activation-5.73e+00

Pos logit contributions
 laundry+0.070
 same+0.069
 your+0.067
 attic+0.063
 task+0.063
Neg logit contributions
oped-0.082
owing-0.078
unch-0.077
oled-0.077
uts-0.077
Token pull
Token ID2834
Feature activation+3.10e+00

Pos logit contributions
 laundry+0.105
 same+0.097
 task+0.095
 your+0.093
 attic+0.092
Neg logit contributions
oped-0.124
oled-0.116
uts-0.113
owing-0.112
hed-0.110
Token the
Token ID262
Feature activation-2.73e+01

Pos logit contributions
oped+0.063
uts+0.060
oled+0.059
unch+0.058
owing+0.057
Neg logit contributions
 laundry-0.061
 same-0.059
 task-0.056
 your-0.055
 attic-0.054
Token book
Token ID1492
Feature activation+1.10e+01

Pos logit contributions
 laundry+0.377
 your+0.358
 attic+0.342
 proper+0.341
 homework+0.334
Neg logit contributions
oped-0.618
oled-0.604
uts-0.601
unch-0.592
umbs-0.589
Token from
Token ID422
Feature activation-1.95e+01

Pos logit contributions
oped+0.215
unch+0.205
oled+0.202
uts+0.197
owing+0.194
Neg logit contributions
 same-0.225
 laundry-0.225
 task-0.212
 your-0.210
 way-0.206
Token each
Token ID1123
Feature activation-1.32e+01

Pos logit contributions
 laundry+0.271
 task+0.262
 same+0.251
 proper+0.245
 homework+0.233
Neg logit contributions
oped-0.421
es-0.414
oled-0.409
owing-0.406
hed-0.404
Token other
Token ID584
Feature activation-6.85e+00

Pos logit contributions
 laundry+0.145
 same+0.134
 task+0.121
 your+0.119
 attic+0.115
Neg logit contributions
oped-0.381
oled-0.366
uts-0.366
unch-0.364
owing-0.358
Token.
Token ID13
Feature activation+1.68e-01

Pos logit contributions
 laundry+0.179
 same+0.175
 task+0.167
 your+0.167
 way+0.163
Neg logit contributions
oped-0.095
uts-0.088
oled-0.087
unch-0.087
owing-0.083
Token.
Token ID13
Feature activation-3.27e+00

Pos logit contributions
 same+0.198
 laundry+0.195
 attic+0.193
 port+0.192
 task+0.190
Neg logit contributions
uts-0.255
owing-0.255
aring-0.249
oped-0.249
m-0.241
Token We
Token ID775
Feature activation-9.59e+00

Pos logit contributions
 laundry+0.062
 same+0.061
 your+0.060
 attic+0.056
 task+0.055
Neg logit contributions
oped-0.067
owing-0.064
oled-0.063
unch-0.063
uts-0.063
Token have
Token ID423
Feature activation-1.09e+01

Pos logit contributions
 laundry+0.165
 same+0.163
 task+0.150
 your+0.150
 way+0.144
Neg logit contributions
oped-0.216
uts-0.212
unch-0.210
oled-0.207
owing-0.202
Token to
Token ID284
Feature activation-2.31e+01

Pos logit contributions
 laundry+0.197
 attic+0.194
 task+0.193
 same+0.186
 garage+0.181
Neg logit contributions
uts-0.199
oped-0.198
m-0.189
unch-0.188
qu-0.187
Token save
Token ID3613
Feature activation-2.39e+00

Pos logit contributions
 laundry+0.319
 same+0.315
 attic+0.296
 task+0.287
 computer+0.285
Neg logit contributions
oped-0.541
uts-0.526
unch-0.525
qu-0.521
oled-0.516
Token the
Token ID262
Feature activation-2.73e+01

Pos logit contributions
 laundry+0.030
 same+0.029
 task+0.029
 attic+0.028
 garage+0.026
Neg logit contributions
uts-0.060
oped-0.059
es-0.058
owing-0.058
oot-0.058
Token world
Token ID995
Feature activation+7.12e+00

Pos logit contributions
 your+0.331
 laundry+0.319
 same+0.303
 proper+0.301
 our+0.284
Neg logit contributions
unch-0.624
oped-0.617
m-0.610
qu-0.604
oled-0.603
Token from
Token ID422
Feature activation-2.03e+01

Pos logit contributions
oped+0.111
uts+0.110
oled+0.102
owing+0.102
unch+0.101
Neg logit contributions
 laundry-0.168
 same-0.163
 task-0.155
 attic-0.152
 homework-0.151
Token bad
Token ID2089
Feature activation-4.36e+00

Pos logit contributions
 laundry+0.270
 task+0.260
 same+0.240
 proper+0.234
 homework+0.228
Neg logit contributions
oped-0.461
owing-0.457
uts-0.456
es-0.453
oot-0.453
Token guys
Token ID3730
Feature activation+1.27e+01

Pos logit contributions
 laundry+0.060
 same+0.059
 proper+0.058
 homework+0.057
 your+0.057
Neg logit contributions
owing-0.095
es-0.092
oped-0.091
uts-0.089
hed-0.089
Token."
Token ID526
Feature activation+2.76e+00

Pos logit contributions
oped+0.209
oled+0.190
unch+0.190
uts+0.176
owing+0.173
Neg logit contributions
 laundry-0.303
 same-0.297
 your-0.294
 task-0.281
 way-0.279
Token and
Token ID290
Feature activation-1.72e+01

Pos logit contributions
 laundry+0.171
 same+0.160
 task+0.156
 homework+0.142
 attic+0.140
Neg logit contributions
oped-0.399
uts-0.388
oled-0.383
unch-0.382
es-0.376
Token pulls
Token ID16194
Feature activation-1.85e+00

Pos logit contributions
 laundry+0.309
 task+0.300
 same+0.294
 attic+0.284
 basement+0.275
Neg logit contributions
oped-0.339
uts-0.324
es-0.318
oled-0.310
hed-0.304
Token.
Token ID13
Feature activation-4.71e+00

Pos logit contributions
 laundry+0.030
 same+0.029
 task+0.027
 your+0.026
 attic+0.026
Neg logit contributions
oped-0.043
uts-0.042
oled-0.041
unch-0.041
owing-0.040
Token She
Token ID1375
Feature activation-7.60e+00

Pos logit contributions
 laundry+0.096
 same+0.095
 your+0.092
 task+0.087
 homework+0.086
Neg logit contributions
oped-0.090
owing-0.085
unch-0.084
oled-0.084
uts-0.083
Token opens
Token ID9808
Feature activation-8.02e+00

Pos logit contributions
 laundry+0.137
 task+0.129
 same+0.127
 attic+0.124
 job+0.121
Neg logit contributions
oped-0.168
es-0.156
oled-0.155
uts-0.153
hed-0.151
Token the
Token ID262
Feature activation-2.73e+01

Pos logit contributions
 laundry+0.085
 same+0.081
 task+0.073
 your+0.071
 homework+0.067
Neg logit contributions
oped-0.235
uts-0.227
oled-0.225
unch-0.225
owing-0.220
Token barrel
Token ID9036
Feature activation+1.30e+01

Pos logit contributions
 laundry+0.353
 homework+0.333
 your+0.322
 task+0.322
 same+0.319
Neg logit contributions
oped-0.623
m-0.600
oled-0.594
unch-0.584
umbs-0.582
Token.
Token ID13
Feature activation-1.83e+00

Pos logit contributions
oped+0.276
oled+0.262
unch+0.259
ann+0.258
oot+0.255
Neg logit contributions
 same-0.246
 table-0.242
 paper-0.232
 way-0.230
 laundry-0.229
Token Tim
Token ID5045
Feature activation+8.83e-01

Pos logit contributions
 laundry+0.036
 same+0.035
 your+0.034
 attic+0.032
 paper+0.032
Neg logit contributions
oped-0.037
owing-0.036
unch-0.035
oled-0.035
uts-0.034
Token comes
Token ID2058
Feature activation-1.73e+00

Pos logit contributions
oped+0.021
oled+0.020
uts+0.019
unch+0.019
owing+0.019
Neg logit contributions
 laundry-0.015
 same-0.014
 task-0.014
 your-0.013
 attic-0.013
Token out
Token ID503
Feature activation-7.88e+00

Pos logit contributions
 same+0.025
 your+0.024
 laundry+0.023
 way+0.023
 our+0.022
Neg logit contributions
oped-0.045
unch-0.044
oled-0.043
uts-0.043
qu-0.042
Token okay
Token ID8788
Feature activation-2.63e+00

Pos logit contributions
 laundry+0.078
 same+0.077
 attic+0.072
 computer+0.072
 homework+0.069
Neg logit contributions
uts-0.083
unch-0.078
oot-0.077
oped-0.076
owing-0.072
Token to
Token ID284
Feature activation-2.48e+01

Pos logit contributions
 same+0.059
 your+0.058
 laundry+0.058
 way+0.054
 task+0.054
Neg logit contributions
oped-0.050
oled-0.045
uts-0.043
unch-0.042
ann-0.042
Token not
Token ID407
Feature activation-1.22e+01

Pos logit contributions
 laundry+0.338
 same+0.323
 task+0.299
 your+0.289
 attic+0.288
Neg logit contributions
oped-0.644
uts-0.622
unch-0.619
oled-0.616
owing-0.607
Token always
Token ID1464
Feature activation-1.04e+01

Pos logit contributions
 laundry+0.176
 same+0.171
 your+0.158
 task+0.157
 way+0.150
Neg logit contributions
oped-0.314
uts-0.299
oled-0.299
unch-0.297
owing-0.290
Token win
Token ID1592
Feature activation+5.40e+00

Pos logit contributions
 laundry+0.165
 attic+0.152
 same+0.151
 task+0.146
 port+0.145
Neg logit contributions
oped-0.234
owing-0.232
uts-0.230
unch-0.225
oled-0.223
Token the
Token ID262
Feature activation-2.72e+01

Pos logit contributions
oped+0.096
uts+0.092
oled+0.090
unch+0.089
owing+0.087
Neg logit contributions
 laundry-0.119
 same-0.115
 task-0.110
 your-0.107
 attic-0.107
Token game
Token ID983
Feature activation+1.02e+01

Pos logit contributions
 laundry+0.320
 your+0.294
 proper+0.278
 attic+0.272
 our+0.270
Neg logit contributions
oped-0.648
unch-0.633
owing-0.629
uts-0.629
m-0.626
Token.
Token ID13
Feature activation-1.07e+00

Pos logit contributions
oped+0.136
uts+0.124
oled+0.124
unch+0.123
owing+0.117
Neg logit contributions
 laundry-0.270
 same-0.265
 task-0.253
 your-0.253
 way-0.247
Token<|endoftext|>
Token ID50256
Feature activation+4.92e+00

Pos logit contributions
 your+0.021
 same+0.021
 laundry+0.021
 attic+0.020
 my+0.019
Neg logit contributions
oped-0.021
owing-0.020
oled-0.019
uts-0.019
oot-0.019
TokenOnce
Token ID7454
Feature activation-3.56e+00

Pos logit contributions
aring+0.109
owing+0.108
oot+0.108
uts+0.106
ring+0.106
Neg logit contributions
 same-0.085
 your-0.084
 way-0.080
 laundry-0.080
 attic-0.079
Token upon
Token ID2402
Feature activation-1.42e+01

Pos logit contributions
 same+0.057
 laundry+0.056
 your+0.053
 task+0.051
 way+0.050
Neg logit contributions
oped-0.085
uts-0.081
oled-0.081
unch-0.080
owing-0.078
Token sorry
Token ID7926
Feature activation-2.55e-01

Pos logit contributions
 same+0.217
 task+0.216
 math+0.214
 laundry+0.209
 basement+0.206
Neg logit contributions
uts-0.398
oped-0.396
es-0.387
m-0.385
pt-0.384
Token,
Token ID11
Feature activation-1.63e+01

Pos logit contributions
 laundry+0.006
 same+0.006
 your+0.006
 task+0.006
 way+0.006
Neg logit contributions
oped-0.004
uts-0.004
unch-0.004
oled-0.004
owing-0.004
Token but
Token ID475
Feature activation-8.55e+00

Pos logit contributions
 laundry+0.295
 same+0.285
 task+0.281
 attic+0.272
 proper+0.270
Neg logit contributions
uts-0.312
oled-0.302
unch-0.296
oped-0.291
es-0.284
Token he
Token ID339
Feature activation-7.62e+00

Pos logit contributions
 laundry+0.169
 same+0.159
 task+0.157
 attic+0.157
 computer+0.154
Neg logit contributions
oped-0.156
uts-0.151
es-0.150
unch-0.148
owing-0.146
Token lost
Token ID2626
Feature activation-1.37e+00

Pos logit contributions
 laundry+0.154
 same+0.145
 task+0.139
 attic+0.137
 your+0.136
Neg logit contributions
oped-0.153
uts-0.143
oled-0.142
unch-0.140
es-0.139
Token the
Token ID262
Feature activation-2.72e+01

Pos logit contributions
 laundry+0.021
 same+0.021
 task+0.021
 homework+0.019
 attic+0.018
Neg logit contributions
oped-0.030
uts-0.030
es-0.029
oled-0.029
owing-0.029
Token book
Token ID1492
Feature activation+1.06e+01

Pos logit contributions
 laundry+0.343
 your+0.325
 same+0.313
 homework+0.303
 task+0.299
Neg logit contributions
oped-0.642
uts-0.624
oled-0.624
unch-0.623
m-0.620
Token.
Token ID13
Feature activation-1.18e+00

Pos logit contributions
oped+0.168
unch+0.156
uts+0.155
oled+0.155
owing+0.148
Neg logit contributions
 laundry-0.255
 same-0.252
 task-0.239
 your-0.238
 way-0.234
Token He
Token ID679
Feature activation-2.94e+00

Pos logit contributions
 laundry+0.021
 same+0.021
 your+0.020
 attic+0.019
 task+0.019
Neg logit contributions
oped-0.026
owing-0.024
unch-0.024
oled-0.024
uts-0.024
Token says
Token ID1139
Feature activation+4.35e+00

Pos logit contributions
 laundry+0.059
 task+0.054
 same+0.053
 computer+0.052
 job+0.052
Neg logit contributions
oped-0.059
es-0.056
oled-0.054
pt-0.054
hed-0.052
Token he
Token ID339
Feature activation-4.65e+00

Pos logit contributions
oped+0.078
uts+0.077
oled+0.074
unch+0.073
es+0.071
Neg logit contributions
 laundry-0.095
 same-0.092
 task-0.088
 attic-0.087
 way-0.085
Token.
Token ID13
Feature activation-3.27e+00

Pos logit contributions
 laundry+0.264
 same+0.263
 your+0.238
 attic+0.237
 task+0.227
Neg logit contributions
oped-0.465
uts-0.449
unch-0.438
owing-0.431
oled-0.425
Token They
Token ID1119
Feature activation-7.09e+00

Pos logit contributions
 laundry+0.062
 same+0.061
 your+0.060
 attic+0.056
 task+0.055
Neg logit contributions
oped-0.067
owing-0.064
oled-0.063
unch-0.063
uts-0.063
Token let
Token ID1309
Feature activation-5.38e+00

Pos logit contributions
 laundry+0.142
 same+0.135
 task+0.131
 your+0.129
 attic+0.126
Neg logit contributions
oped-0.141
oled-0.130
uts-0.128
owing-0.124
hed-0.124
Token go
Token ID467
Feature activation-6.02e+00

Pos logit contributions
 laundry+0.086
 same+0.084
 task+0.081
 attic+0.081
 homework+0.077
Neg logit contributions
oped-0.122
uts-0.114
oled-0.114
es-0.112
unch-0.112
Token of
Token ID286
Feature activation-2.33e+01

Pos logit contributions
 your+0.106
 same+0.099
 way+0.097
 laundry+0.094
 our+0.094
Neg logit contributions
oled-0.139
uts-0.134
qu-0.134
oped-0.134
oling-0.131
Token the
Token ID262
Feature activation-2.72e+01

Pos logit contributions
 same+0.281
 task+0.276
 laundry+0.262
 proper+0.261
 attic+0.260
Neg logit contributions
uts-0.574
oped-0.568
unch-0.561
es-0.554
oled-0.549
Token tire
Token ID15867
Feature activation+1.22e+01

Pos logit contributions
 your+0.335
 laundry+0.328
 same+0.324
 proper+0.315
 homework+0.304
Neg logit contributions
oped-0.616
m-0.594
unch-0.592
oled-0.585
qu-0.579
Token.
Token ID13
Feature activation-1.28e+00

Pos logit contributions
oped+0.221
oled+0.210
uts+0.207
unch+0.207
owing+0.197
Neg logit contributions
 laundry-0.264
 same-0.261
 your-0.247
 task-0.246
 way-0.239
Token It
Token ID632
Feature activation-8.45e-01

Pos logit contributions
 laundry+0.022
 same+0.021
 your+0.020
 task+0.020
 attic+0.019
Neg logit contributions
oped-0.029
uts-0.028
oled-0.028
unch-0.028
owing-0.027
Token rolls
Token ID14088
Feature activation+5.81e+00

Pos logit contributions
 laundry+0.015
 computer+0.013
 attic+0.013
 port+0.013
 task+0.013
Neg logit contributions
oped-0.019
es-0.017
m-0.017
uts-0.017
hed-0.017
Token fast
Token ID3049
Feature activation-2.06e-01

Pos logit contributions
oped+0.100
uts+0.092
oled+0.091
es+0.087
hed+0.087
Neg logit contributions
 laundry-0.132
 same-0.125
 task-0.122
 homework-0.119
 attic-0.116
Token not
Token ID407
Feature activation-1.61e+01

Pos logit contributions
oped+0.006
uts+0.006
oled+0.006
unch+0.006
owing+0.006
Neg logit contributions
 laundry-0.007
 same-0.007
 task-0.006
 your-0.006
 attic-0.006
Token jump
Token ID4391
Feature activation+1.46e+00

Pos logit contributions
 laundry+0.300
 same+0.282
 task+0.271
 attic+0.268
 homework+0.258
Neg logit contributions
oped-0.334
uts-0.329
unch-0.322
owing-0.321
oled-0.319
Token as
Token ID355
Feature activation-1.41e+01

Pos logit contributions
oped+0.022
uts+0.021
oled+0.021
unch+0.020
owing+0.020
Neg logit contributions
 laundry-0.036
 same-0.035
 task-0.033
 your-0.033
 attic-0.032
Token high
Token ID1029
Feature activation-8.38e+00

Pos logit contributions
 laundry+0.230
 same+0.204
 task+0.204
 computer+0.201
 attic+0.199
Neg logit contributions
oped-0.318
uts-0.314
unch-0.302
oled-0.298
oot-0.295
Token as
Token ID355
Feature activation-1.17e+01

Pos logit contributions
 laundry+0.207
 same+0.200
 task+0.193
 your+0.189
 attic+0.189
Neg logit contributions
oped-0.127
uts-0.120
oled-0.117
unch-0.116
owing-0.112
Token the
Token ID262
Feature activation-2.71e+01

Pos logit contributions
 laundry+0.223
 same+0.208
 task+0.203
 homework+0.196
 computer+0.196
Neg logit contributions
oped-0.241
uts-0.232
unch-0.222
oled-0.221
oot-0.220
Token others
Token ID1854
Feature activation+2.96e+00

Pos logit contributions
 laundry+0.357
 same+0.340
 your+0.336
 homework+0.316
 task+0.315
Neg logit contributions
oped-0.657
oled-0.633
uts-0.633
unch-0.629
owing-0.616
Token.
Token ID13
Feature activation-1.14e+00

Pos logit contributions
oped+0.051
unch+0.049
oled+0.048
uts+0.048
oot+0.046
Neg logit contributions
 laundry-0.065
 same-0.064
 your-0.061
 task-0.060
 way-0.060
Token
Token ID198
Feature activation-1.32e+00

Pos logit contributions
 same+0.022
 laundry+0.021
 your+0.021
 attic+0.020
 math+0.019
Neg logit contributions
oped-0.023
owing-0.022
oled-0.022
uts-0.022
unch-0.022
Token
Token ID198
Feature activation-6.11e-01

Pos logit contributions
 your+0.026
 same+0.025
 laundry+0.024
 attic+0.024
 math+0.024
Neg logit contributions
owing-0.027
oped-0.026
ring-0.025
oled-0.025
uts-0.024
TokenOne
Token ID3198
Feature activation+5.86e-02

Pos logit contributions
 laundry+0.012
 same+0.012
 your+0.012
 task+0.011
 attic+0.011
Neg logit contributions
oped-0.012
owing-0.012
uts-0.012
oled-0.011
unch-0.011
Token happy
Token ID3772
Feature activation-5.02e+00

Pos logit contributions
oped+0.053
oled+0.050
uts+0.050
unch+0.049
owing+0.049
Neg logit contributions
 laundry-0.046
 same-0.045
 your-0.042
 task-0.042
 attic-0.040
Token.
Token ID13
Feature activation-2.43e+00

Pos logit contributions
 laundry+0.126
 same+0.123
 task+0.119
 your+0.116
 attic+0.116
Neg logit contributions
oped-0.072
uts-0.069
unch-0.067
oled-0.066
owing-0.064
Token They
Token ID1119
Feature activation-6.51e+00

Pos logit contributions
 laundry+0.047
 same+0.047
 your+0.046
 attic+0.043
 task+0.043
Neg logit contributions
oped-0.049
owing-0.046
oled-0.046
uts-0.046
unch-0.046
Token forgot
Token ID16453
Feature activation-3.42e+00

Pos logit contributions
 laundry+0.134
 same+0.128
 task+0.125
 your+0.121
 homework+0.119
Neg logit contributions
oped-0.126
oled-0.116
uts-0.113
hed-0.110
unch-0.110
Token about
Token ID546
Feature activation-1.34e+01

Pos logit contributions
 laundry+0.074
 same+0.074
 task+0.072
 attic+0.069
 homework+0.067
Neg logit contributions
uts-0.057
oped-0.055
oled-0.053
unch-0.053
es-0.052
Token the
Token ID262
Feature activation-2.71e+01

Pos logit contributions
 laundry+0.175
 task+0.175
 same+0.172
 attic+0.157
 proper+0.147
Neg logit contributions
uts-0.319
owing-0.312
oped-0.312
unch-0.312
oled-0.306
Token balloon
Token ID21190
Feature activation+1.22e+01

Pos logit contributions
 laundry+0.326
 your+0.319
 proper+0.295
 same+0.291
 our+0.283
Neg logit contributions
oped-0.631
unch-0.625
m-0.623
owing-0.617
oled-0.616
Token.
Token ID13
Feature activation-1.20e+00

Pos logit contributions
oped+0.183
uts+0.169
oled+0.168
unch+0.168
owing+0.160
Neg logit contributions
 laundry-0.303
 same-0.298
 task-0.284
 your-0.283
 way-0.277
Token
Token ID198
Feature activation-1.76e+00

Pos logit contributions
 your+0.018
 laundry+0.018
 same+0.017
 attic+0.016
 our+0.016
Neg logit contributions
oped-0.030
owing-0.030
oled-0.028
unch-0.028
oot-0.028
Token
Token ID198
Feature activation-9.87e-01

Pos logit contributions
 your+0.031
 laundry+0.030
 same+0.030
 our+0.029
 new+0.028
Neg logit contributions
owing-0.040
oped-0.040
oled-0.037
ring-0.037
uts-0.036
TokenThey
Token ID2990
Feature activation-3.19e-01

Pos logit contributions
 laundry+0.020
 your+0.020
 same+0.019
 our+0.018
 attic+0.018
Neg logit contributions
owing-0.019
oped-0.019
uts-0.019
oled-0.018
ann-0.018
Token,
Token ID11
Feature activation-1.37e+01

Pos logit contributions
oped+0.143
oled+0.138
unch+0.137
oling+0.125
uts+0.123
Neg logit contributions
 laundry-0.229
 same-0.223
 paper-0.215
 task-0.214
 way-0.214
Token you
Token ID345
Feature activation-1.16e+01

Pos logit contributions
 laundry+0.253
 same+0.246
 task+0.243
 homework+0.234
 proper+0.234
Neg logit contributions
oped-0.268
uts-0.263
oled-0.260
unch-0.258
es-0.244
Token have
Token ID423
Feature activation-1.07e+01

Pos logit contributions
 laundry+0.192
 same+0.189
 task+0.175
 your+0.175
 way+0.169
Neg logit contributions
oped-0.269
uts-0.258
unch-0.256
oled-0.256
owing-0.249
Token to
Token ID284
Feature activation-2.27e+01

Pos logit contributions
 laundry+0.216
 attic+0.215
 task+0.212
 basement+0.205
 port+0.205
Neg logit contributions
uts-0.173
oped-0.167
es-0.158
hed-0.157
m-0.157
Token wind
Token ID2344
Feature activation+9.51e+00

Pos logit contributions
 laundry+0.312
 same+0.305
 attic+0.282
 task+0.277
 computer+0.274
Neg logit contributions
oped-0.548
unch-0.535
uts-0.535
oled-0.525
owing-0.522
Token the
Token ID262
Feature activation-2.71e+01

Pos logit contributions
oped+0.165
uts+0.155
oled+0.154
unch+0.154
owing+0.147
Neg logit contributions
 laundry-0.215
 same-0.211
 your-0.200
 task-0.199
 homework-0.194
Token string
Token ID4731
Feature activation+1.15e+01

Pos logit contributions
 laundry+0.322
 your+0.310
 homework+0.308
 proper+0.304
 task+0.285
Neg logit contributions
m-0.573
oped-0.565
owing-0.559
unch-0.555
oled-0.550
Token around
Token ID1088
Feature activation-5.11e+00

Pos logit contributions
oped+0.159
uts+0.156
owing+0.146
unch+0.145
oled+0.144
Neg logit contributions
 laundry-0.290
 same-0.274
 task-0.268
 homework-0.262
 attic-0.261
Token the
Token ID262
Feature activation-2.48e+01

Pos logit contributions
 laundry+0.070
 task+0.065
 same+0.065
 proper+0.061
 homework+0.061
Neg logit contributions
oped-0.123
uts-0.121
es-0.121
oled-0.118
unch-0.117
Token top
Token ID1353
Feature activation+6.20e+00

Pos logit contributions
 laundry+0.288
 homework+0.288
 task+0.273
 same+0.273
 your+0.272
Neg logit contributions
oped-0.555
uts-0.546
oled-0.544
hed-0.540
unch-0.540
Token,
Token ID11
Feature activation-7.80e+00

Pos logit contributions
oped+0.082
uts+0.076
unch+0.075
oled+0.074
owing+0.071
Neg logit contributions
 laundry-0.165
 same-0.161
 task-0.154
 your-0.153
 homework-0.150
Token to
Token ID284
Feature activation-2.68e+01

Pos logit contributions
 laundry+0.367
 same+0.344
 attic+0.338
 garage+0.336
 computer+0.335
Neg logit contributions
uts-0.367
oped-0.351
unch-0.347
owing-0.333
qu-0.332
Token pick
Token ID2298
Feature activation-4.26e+00

Pos logit contributions
 laundry+0.363
 same+0.356
 proper+0.343
 attic+0.333
 task+0.332
Neg logit contributions
uts-0.621
unch-0.619
oped-0.608
qu-0.605
owing-0.588
Token good
Token ID922
Feature activation-1.09e+01

Pos logit contributions
 laundry+0.063
 same+0.061
 task+0.058
 homework+0.055
 attic+0.055
Neg logit contributions
oped-0.104
uts-0.100
oled-0.098
unch-0.098
owing-0.097
Token things
Token ID1243
Feature activation-6.33e-02

Pos logit contributions
 laundry+0.121
 same+0.115
 task+0.107
 attic+0.103
 homework+0.100
Neg logit contributions
oped-0.304
uts-0.294
oled-0.291
unch-0.290
owing-0.290
Token from
Token ID422
Feature activation-2.21e+01

Pos logit contributions
 laundry+0.002
 same+0.002
 your+0.002
 task+0.001
 way+0.001
Neg logit contributions
oped-0.001
oled-0.001
unch-0.001
uts-0.001
owing-0.001
Token the
Token ID262
Feature activation-2.71e+01

Pos logit contributions
 laundry+0.332
 same+0.318
 task+0.310
 homework+0.284
 computer+0.284
Neg logit contributions
oped-0.512
uts-0.503
unch-0.494
owing-0.492
oled-0.490
Token refrigerator
Token ID30500
Feature activation+1.35e+01

Pos logit contributions
 laundry+0.336
 your+0.314
 same+0.305
 homework+0.299
 task+0.299
Neg logit contributions
oped-0.640
oled-0.623
unch-0.618
owing-0.615
uts-0.614
Token."
Token ID526
Feature activation+1.97e+00

Pos logit contributions
oped+0.168
oled+0.153
uts+0.152
unch+0.152
owing+0.143
Neg logit contributions
 laundry-0.370
 same-0.364
 task-0.348
 your-0.347
 way-0.341
Token
Token ID198
Feature activation-1.42e+00

Pos logit contributions
oped+0.033
uts+0.031
oled+0.030
owing+0.030
unch+0.030
Neg logit contributions
 laundry-0.046
 same-0.045
 your-0.044
 way-0.042
 task-0.041
Token
Token ID198
Feature activation-8.51e-01

Pos logit contributions
 your+0.030
 laundry+0.030
 same+0.029
 math+0.027
 our+0.027
Neg logit contributions
oped-0.026
owing-0.026
oled-0.025
uts-0.024
ring-0.024
TokenS
Token ID50
Feature activation+2.66e+00

Pos logit contributions
 laundry+0.017
 same+0.017
 your+0.016
 task+0.016
 attic+0.015
Neg logit contributions
oped-0.017
owing-0.016
oled-0.016
uts-0.016
unch-0.015
Token time
Token ID640
Feature activation-9.39e+00

Pos logit contributions
 laundry+0.289
 same+0.282
 task+0.269
 your+0.264
 homework+0.261
Neg logit contributions
oped-0.227
uts-0.214
oled-0.211
unch-0.210
owing-0.203
Token to
Token ID284
Feature activation-2.53e+01

Pos logit contributions
 laundry+0.224
 same+0.219
 task+0.210
 attic+0.204
 your+0.204
Neg logit contributions
oped-0.149
uts-0.141
oled-0.136
unch-0.135
owing-0.131
Token go
Token ID467
Feature activation-6.71e+00

Pos logit contributions
 same+0.357
 proper+0.342
 laundry+0.340
 attic+0.330
 task+0.323
Neg logit contributions
oped-0.550
uts-0.522
unch-0.520
qu-0.506
oot-0.505
Token home
Token ID1363
Feature activation+2.17e+00

Pos logit contributions
 laundry+0.131
 same+0.128
 task+0.122
 attic+0.117
 paper+0.116
Neg logit contributions
oped-0.133
uts-0.126
owing-0.124
unch-0.123
oot-0.123
Token as
Token ID355
Feature activation-1.16e+01

Pos logit contributions
oped+0.029
uts+0.027
oled+0.026
unch+0.026
owing+0.025
Neg logit contributions
 laundry-0.057
 same-0.056
 task-0.054
 your-0.054
 way-0.052
Token the
Token ID262
Feature activation-2.70e+01

Pos logit contributions
 laundry+0.193
 same+0.191
 task+0.185
 computer+0.177
 paper+0.171
Neg logit contributions
uts-0.241
oped-0.241
unch-0.233
us-0.223
owing-0.221
Token woods
Token ID16479
Feature activation+8.55e+00

Pos logit contributions
 laundry+0.356
 same+0.343
 your+0.338
 task+0.314
 proper+0.308
Neg logit contributions
oped-0.672
unch-0.647
uts-0.644
oled-0.644
owing-0.629
Token could
Token ID714
Feature activation-6.10e+00

Pos logit contributions
oped+0.148
uts+0.139
oled+0.137
unch+0.134
owing+0.133
Neg logit contributions
 laundry-0.192
 same-0.188
 task-0.179
 your-0.178
 homework-0.174
Token be
Token ID307
Feature activation-9.56e+00

Pos logit contributions
 laundry+0.125
 same+0.123
 way+0.116
 task+0.116
 your+0.116
Neg logit contributions
oped-0.119
oled-0.114
unch-0.110
uts-0.109
owing-0.107
Token dangerous
Token ID4923
Feature activation-1.44e+00

Pos logit contributions
 laundry+0.179
 same+0.173
 task+0.165
 your+0.161
 homework+0.159
Neg logit contributions
oped-0.200
uts-0.190
oled-0.187
owing-0.186
unch-0.185
Token.
Token ID13
Feature activation+7.48e-01

Pos logit contributions
 laundry+0.037
 same+0.036
 task+0.035
 your+0.035
 homework+0.034
Neg logit contributions
oped-0.020
uts-0.019
oled-0.019
unch-0.018
owing-0.018
Token embarrassed
Token ID21100
Feature activation+1.15e+00

Pos logit contributions
 laundry+0.310
 task+0.287
 same+0.278
 attic+0.275
 computer+0.271
Neg logit contributions
oped-0.317
unch-0.302
uts-0.299
oled-0.299
owing-0.297
Token.
Token ID13
Feature activation-2.06e+00

Pos logit contributions
oped+0.017
uts+0.016
oled+0.016
unch+0.016
owing+0.015
Neg logit contributions
 laundry-0.029
 same-0.028
 task-0.027
 your-0.026
 attic-0.026
Token He
Token ID679
Feature activation-4.03e+00

Pos logit contributions
 laundry+0.041
 same+0.040
 your+0.039
 task+0.037
 attic+0.037
Neg logit contributions
oped-0.041
unch-0.038
owing-0.038
uts-0.038
oled-0.038
Token had
Token ID550
Feature activation-9.12e+00

Pos logit contributions
 laundry+0.091
 same+0.087
 task+0.084
 attic+0.082
 your+0.081
Neg logit contributions
oped-0.071
oled-0.066
uts-0.065
es-0.062
unch-0.062
Token made
Token ID925
Feature activation-9.10e+00

Pos logit contributions
 laundry+0.182
 task+0.176
 attic+0.173
 same+0.168
 port+0.165
Neg logit contributions
oped-0.168
uts-0.166
oled-0.157
ann-0.153
pt-0.150
Token such
Token ID884
Feature activation-2.70e+01

Pos logit contributions
 laundry+0.134
 task+0.130
 attic+0.128
 same+0.127
 port+0.119
Neg logit contributions
uts-0.198
es-0.196
oped-0.193
unch-0.191
oled-0.191
Token a
Token ID257
Feature activation-1.79e+01

Pos logit contributions
 laundry+0.394
 same+0.363
 task+0.359
 homework+0.354
 attic+0.347
Neg logit contributions
uts-0.613
oped-0.600
oled-0.594
unch-0.590
owing-0.579
Token big
Token ID1263
Feature activation-1.02e+01

Pos logit contributions
 laundry+0.208
 your+0.191
 same+0.190
 homework+0.185
 attic+0.183
Neg logit contributions
oped-0.420
unch-0.417
oled-0.412
m-0.406
oing-0.403
Token mess
Token ID2085
Feature activation+1.21e+01

Pos logit contributions
 laundry+0.132
 same+0.126
 your+0.122
 homework+0.117
 proper+0.114
Neg logit contributions
oped-0.241
unch-0.235
owing-0.235
oled-0.232
m-0.226
Token that
Token ID326
Feature activation-1.20e+01

Pos logit contributions
oped+0.190
unch+0.176
oled+0.170
uts+0.168
owing+0.165
Neg logit contributions
 laundry-0.298
 same-0.289
 your-0.282
 task-0.275
 paper-0.272
Token he
Token ID339
Feature activation-4.41e+00

Pos logit contributions
 laundry+0.236
 same+0.230
 attic+0.218
 task+0.218
 your+0.214
Neg logit contributions
oped-0.205
uts-0.204
hed-0.193
oled-0.191
es-0.189
Token never
Token ID1239
Feature activation-1.20e+01

Pos logit contributions
oped+0.005
unch+0.005
uts+0.005
oled+0.005
oot+0.004
Neg logit contributions
 laundry-0.007
 same-0.007
 your-0.006
 way-0.006
 paper-0.006
Token forget
Token ID6044
Feature activation-1.14e+01

Pos logit contributions
 laundry+0.227
 same+0.211
 task+0.203
 attic+0.201
 homework+0.196
Neg logit contributions
oped-0.251
uts-0.240
owing-0.237
unch-0.235
oled-0.235
Token that
Token ID326
Feature activation-1.76e+01

Pos logit contributions
 laundry+0.209
 same+0.201
 task+0.196
 attic+0.187
 proper+0.183
Neg logit contributions
oped-0.229
uts-0.228
oled-0.218
owing-0.217
unch-0.217
Token story
Token ID1621
Feature activation+4.20e+00

Pos logit contributions
 laundry+0.271
 same+0.251
 task+0.233
 homework+0.233
 attic+0.232
Neg logit contributions
uts-0.399
oped-0.397
owing-0.385
unch-0.382
oled-0.372
Token and
Token ID290
Feature activation-1.77e+01

Pos logit contributions
oped+0.079
unch+0.075
uts+0.074
oled+0.074
owing+0.072
Neg logit contributions
 laundry-0.088
 same-0.087
 your-0.082
 task-0.082
 way-0.080
Token the
Token ID262
Feature activation-2.70e+01

Pos logit contributions
 laundry+0.305
 same+0.283
 task+0.280
 homework+0.266
 computer+0.264
Neg logit contributions
es-0.354
oped-0.352
owing-0.347
uts-0.342
hed-0.326
Token joy
Token ID8716
Feature activation+1.07e+01

Pos logit contributions
 laundry+0.366
 your+0.342
 same+0.339
 task+0.319
 homework+0.314
Neg logit contributions
oped-0.644
unch-0.624
oled-0.623
uts-0.620
owing-0.613
Token it
Token ID340
Feature activation-3.41e+00

Pos logit contributions
oped+0.198
uts+0.188
oled+0.184
unch+0.181
owing+0.178
Neg logit contributions
 laundry-0.228
 same-0.221
 task-0.211
 your-0.208
 homework-0.205
Token brought
Token ID3181
Feature activation+2.07e+00

Pos logit contributions
 laundry+0.066
 same+0.061
 task+0.059
 attic+0.058
 homework+0.058
Neg logit contributions
oped-0.070
uts-0.067
es-0.066
oled-0.065
hed-0.063
Token her
Token ID607
Feature activation-1.76e+01

Pos logit contributions
oped+0.045
uts+0.043
es+0.043
owing+0.042
oled+0.042
Neg logit contributions
 laundry-0.036
 same-0.033
 task-0.033
 homework-0.032
 attic-0.031
Token.
Token ID13
Feature activation+1.46e-01

Pos logit contributions
 laundry+0.299
 same+0.285
 task+0.267
 your+0.262
 attic+0.261
Neg logit contributions
oped-0.391
uts-0.382
unch-0.370
owing-0.370
oled-0.367
TokenLet
Token ID5756
Feature activation-1.16e+01

Pos logit contributions
 same+0.158
 laundry+0.157
 your+0.155
 homework+0.148
 task+0.148
Neg logit contributions
oped-0.094
unch-0.086
oled-0.083
uts-0.082
oot-0.080
Token's
Token ID338
Feature activation-2.39e+01

Pos logit contributions
 laundry+0.244
 attic+0.232
 same+0.230
 task+0.224
 homework+0.223
Neg logit contributions
oped-0.208
uts-0.197
unch-0.191
hed-0.189
oled-0.189
Token take
Token ID1011
Feature activation-9.60e+00

Pos logit contributions
 laundry+0.321
 same+0.309
 attic+0.285
 task+0.281
 proper+0.281
Neg logit contributions
oped-0.575
uts-0.572
unch-0.548
owing-0.536
oot-0.534
Token turns
Token ID4962
Feature activation-1.18e+00

Pos logit contributions
 laundry+0.138
 same+0.133
 task+0.129
 attic+0.128
 way+0.120
Neg logit contributions
oped-0.234
uts-0.224
oled-0.221
es-0.221
hed-0.216
Token wearing
Token ID5762
Feature activation-5.38e+00

Pos logit contributions
 laundry+0.022
 same+0.022
 your+0.021
 task+0.020
 way+0.020
Neg logit contributions
oped-0.025
unch-0.024
oled-0.024
uts-0.023
owing-0.023
Token the
Token ID262
Feature activation-2.69e+01

Pos logit contributions
 laundry+0.072
 same+0.069
 task+0.068
 attic+0.064
 way+0.063
Neg logit contributions
oped-0.132
uts-0.129
es-0.126
oled-0.124
hed-0.122
Token sweater
Token ID35685
Feature activation+1.51e+01

Pos logit contributions
 laundry+0.309
 task+0.297
 proper+0.291
 homework+0.286
 your+0.286
Neg logit contributions
oped-0.631
uts-0.608
hed-0.595
owing-0.593
m-0.592
Token,"
Token ID553
Feature activation+6.13e-01

Pos logit contributions
oped+0.240
unch+0.224
oled+0.223
uts+0.218
owing+0.213
Neg logit contributions
 laundry-0.365
 same-0.361
 your-0.342
 task-0.341
 way-0.332
Token L
Token ID406
Feature activation+2.33e+00

Pos logit contributions
oped+0.012
uts+0.012
unch+0.012
oled+0.012
es+0.011
Neg logit contributions
 laundry-0.012
 same-0.011
 task-0.011
 attic-0.011
 way-0.010
Tokenila
Token ID10102
Feature activation+7.31e-01

Pos logit contributions
oped+0.056
oing+0.054
uts+0.054
oled+0.054
umbs+0.054
Neg logit contributions
 your-0.031
 laundry-0.030
 same-0.030
 homework-0.029
 math-0.028
Token said
Token ID531
Feature activation+5.05e+00

Pos logit contributions
oped+0.014
unch+0.014
oled+0.014
uts+0.014
owing+0.013
Neg logit contributions
 same-0.015
 laundry-0.015
 your-0.014
 task-0.014
 homework-0.013
Token lick
Token ID36728
Feature activation-2.22e+00

Pos logit contributions
 laundry+0.119
 same+0.115
 attic+0.111
 task+0.111
 computer+0.107
Neg logit contributions
oped-0.123
uts-0.120
unch-0.116
oled-0.114
es-0.113
Token and
Token ID290
Feature activation-1.97e+01

Pos logit contributions
 laundry+0.035
 same+0.034
 your+0.031
 task+0.031
 way+0.030
Neg logit contributions
oped-0.054
uts-0.052
oled-0.051
unch-0.051
owing-0.050
Token nib
Token ID33272
Feature activation+2.06e+00

Pos logit contributions
 laundry+0.378
 task+0.364
 same+0.361
 attic+0.353
 port+0.340
Neg logit contributions
oped-0.373
es-0.363
uts-0.363
oled-0.348
hed-0.348
Tokenble
Token ID903
Feature activation+5.97e+00

Pos logit contributions
oped+0.037
oled+0.035
uts+0.034
owing+0.034
aring+0.034
Neg logit contributions
 same-0.044
 your-0.044
 laundry-0.043
 our-0.040
 task-0.039
Token of
Token ID286
Feature activation-2.31e+01

Pos logit contributions
oped+0.102
unch+0.096
oled+0.095
uts+0.093
owing+0.091
Neg logit contributions
 laundry-0.137
 same-0.136
 your-0.131
 task-0.126
 paper-0.125
Token the
Token ID262
Feature activation-2.69e+01

Pos logit contributions
 laundry+0.307
 task+0.296
 same+0.287
 computer+0.283
 attic+0.281
Neg logit contributions
oped-0.554
unch-0.544
uts-0.540
ann-0.527
es-0.525
Token sweet
Token ID6029
Feature activation-3.71e+00

Pos logit contributions
 laundry+0.357
 your+0.323
 same+0.321
 homework+0.313
 task+0.309
Neg logit contributions
oped-0.643
unch-0.626
uts-0.617
owing-0.615
oled-0.612
Token treat
Token ID2190
Feature activation+1.52e+01

Pos logit contributions
 laundry+0.056
 same+0.053
 attic+0.051
 homework+0.050
 your+0.050
Neg logit contributions
oped-0.086
unch-0.084
uts-0.083
owing-0.082
oled-0.082
Token.
Token ID13
Feature activation-3.21e-01

Pos logit contributions
oped+0.220
oled+0.204
unch+0.203
uts+0.199
owing+0.191
Neg logit contributions
 laundry-0.393
 same-0.383
 task-0.366
 your-0.363
 paper-0.361
Token Rachel
Token ID15984
Feature activation+1.55e+00

Pos logit contributions
 laundry+0.006
 same+0.006
 your+0.006
 attic+0.006
 task+0.006
Neg logit contributions
oped-0.006
owing-0.006
uts-0.006
oled-0.006
oot-0.006
Token was
Token ID373
Feature activation-1.05e+01

Pos logit contributions
oped+0.029
uts+0.027
oled+0.026
unch+0.026
es+0.026
Neg logit contributions
 laundry-0.034
 same-0.032
 task-0.031
 attic-0.030
 homework-0.030
Token their
Token ID511
Feature activation-2.40e+01

Pos logit contributions
 laundry+0.237
 task+0.233
 homework+0.232
 basement+0.225
 same+0.224
Neg logit contributions
oped-0.259
es-0.255
uts-0.252
ed-0.250
qu-0.240
Token mess
Token ID2085
Feature activation+6.37e+00

Pos logit contributions
 laundry+0.299
 task+0.274
 same+0.273
 your+0.261
 attic+0.255
Neg logit contributions
uts-0.542
oped-0.534
owing-0.530
oled-0.516
pt-0.513
Token.
Token ID13
Feature activation-3.50e+00

Pos logit contributions
oped+0.075
uts+0.069
oled+0.068
unch+0.067
owing+0.064
Neg logit contributions
 laundry-0.178
 same-0.175
 task-0.168
 your-0.167
 attic-0.164
Token They
Token ID1119
Feature activation-5.94e+00

Pos logit contributions
 laundry+0.060
 same+0.059
 your+0.058
 attic+0.054
 task+0.053
Neg logit contributions
oped-0.079
owing-0.075
unch-0.074
uts-0.074
oled-0.074
Token saw
Token ID2497
Feature activation-7.32e+00

Pos logit contributions
 laundry+0.108
 same+0.103
 task+0.100
 your+0.099
 way+0.097
Neg logit contributions
oped-0.123
oled-0.117
pt-0.112
uts-0.112
hed-0.111
Token the
Token ID262
Feature activation-2.69e+01

Pos logit contributions
 task+0.096
 laundry+0.093
 same+0.090
 way+0.083
 proper+0.082
Neg logit contributions
uts-0.180
oled-0.178
oped-0.177
es-0.177
unch-0.171
Token book
Token ID1492
Feature activation+1.15e+01

Pos logit contributions
 your+0.312
 laundry+0.311
 proper+0.306
 task+0.288
 same+0.284
Neg logit contributions
oled-0.644
oped-0.635
m-0.633
uts-0.629
unch-0.628
Token and
Token ID290
Feature activation-1.44e+01

Pos logit contributions
oped+0.199
uts+0.190
oled+0.186
unch+0.184
owing+0.178
Neg logit contributions
 laundry-0.260
 same-0.254
 your-0.241
 task-0.240
 attic-0.233
Token the
Token ID262
Feature activation-2.53e+01

Pos logit contributions
 task+0.267
 laundry+0.255
 same+0.246
 job+0.242
 attic+0.230
Neg logit contributions
es-0.289
oled-0.282
 cheered-0.281
oped-0.277
uts-0.263
Token rings
Token ID13917
Feature activation+1.39e+01

Pos logit contributions
 laundry+0.314
 task+0.289
 your+0.288
 proper+0.281
 same+0.280
Neg logit contributions
oled-0.601
oped-0.597
uts-0.590
es-0.576
m-0.574
Token.
Token ID13
Feature activation+4.03e-01

Pos logit contributions
oped+0.221
unch+0.211
uts+0.204
oled+0.204
owing+0.196
Neg logit contributions
 laundry-0.331
 same-0.328
 your-0.310
 paper-0.307
 way-0.305
Token and
Token ID290
Feature activation-1.72e+01

Pos logit contributions
oped+0.130
oled+0.123
unch+0.122
uts+0.122
oot+0.120
Neg logit contributions
 laundry-0.077
 same-0.076
 your-0.069
 task-0.068
 homework-0.067
Token dots
Token ID22969
Feature activation+5.50e+00

Pos logit contributions
 laundry+0.309
 task+0.300
 same+0.294
 attic+0.284
 basement+0.275
Neg logit contributions
oped-0.339
uts-0.324
es-0.318
oled-0.310
hed-0.304
Token.
Token ID13
Feature activation-3.51e+00

Pos logit contributions
oped+0.093
unch+0.087
oled+0.086
owing+0.083
uts+0.082
Neg logit contributions
 same-0.127
 laundry-0.125
 your-0.120
 paper-0.117
 way-0.115
Token They
Token ID1119
Feature activation-5.85e+00

Pos logit contributions
 laundry+0.067
 same+0.066
 your+0.065
 attic+0.061
 task+0.060
Neg logit contributions
oped-0.072
owing-0.068
oled-0.067
uts-0.067
unch-0.067
Token mixed
Token ID7668
Feature activation-9.54e-01

Pos logit contributions
 laundry+0.111
 same+0.104
 task+0.103
 your+0.099
 attic+0.098
Neg logit contributions
oped-0.118
oled-0.110
uts-0.109
es-0.108
hed-0.107
Token the
Token ID262
Feature activation-2.69e+01

Pos logit contributions
 laundry+0.016
 same+0.016
 task+0.015
 your+0.015
 attic+0.014
Neg logit contributions
oped-0.021
uts-0.021
unch-0.020
oled-0.020
owing-0.020
Token colors
Token ID7577
Feature activation+7.44e+00

Pos logit contributions
 laundry+0.328
 homework+0.306
 attic+0.303
 your+0.303
 basement+0.299
Neg logit contributions
m-0.642
oped-0.616
qu-0.610
hed-0.608
unch-0.605
Token and
Token ID290
Feature activation-1.49e+01

Pos logit contributions
oped+0.127
oled+0.118
unch+0.116
uts+0.115
owing+0.113
Neg logit contributions
 laundry-0.172
 same-0.170
 your-0.161
 task-0.160
 paper-0.156
Token made
Token ID925
Feature activation-7.69e+00

Pos logit contributions
 laundry+0.237
 task+0.229
 attic+0.225
 same+0.222
 basement+0.216
Neg logit contributions
oped-0.330
qu-0.324
es-0.324
uts-0.316
hed-0.314
Token new
Token ID649
Feature activation-7.94e+00

Pos logit contributions
 laundry+0.103
 task+0.095
 same+0.095
 attic+0.091
 homework+0.088
Neg logit contributions
oped-0.192
uts-0.191
unch-0.188
oled-0.186
es-0.185
Token ones
Token ID3392
Feature activation+6.10e+00

Pos logit contributions
 laundry+0.084
 same+0.082
 task+0.071
 your+0.071
 way+0.068
Neg logit contributions
oped-0.236
oled-0.226
uts-0.226
unch-0.225
owing-0.220
Token says
Token ID1139
Feature activation+9.13e+00

Pos logit contributions
oped+0.105
uts+0.102
oled+0.100
unch+0.098
es+0.098
Neg logit contributions
 laundry-0.103
 same-0.100
 task-0.096
 your-0.093
 attic-0.093
Token.
Token ID13
Feature activation+5.92e+00

Pos logit contributions
oped+0.177
ann+0.163
unch+0.160
rol+0.160
oled+0.157
Neg logit contributions
 your-0.188
 paper-0.185
 same-0.185
 laundry-0.184
 homework-0.179
Token She
Token ID1375
Feature activation+1.02e+00

Pos logit contributions
oped+0.151
owing+0.147
unch+0.141
oot+0.140
ann+0.139
Neg logit contributions
 same-0.086
 laundry-0.084
 your-0.082
 paper-0.076
 table-0.075
Token puts
Token ID7584
Feature activation+3.65e+00

Pos logit contributions
oped+0.024
es+0.023
oled+0.022
uts+0.022
hed+0.022
Neg logit contributions
 laundry-0.016
 task-0.015
 same-0.015
 job-0.015
 proper-0.014
Token her
Token ID607
Feature activation-1.38e+01

Pos logit contributions
oped+0.072
es+0.068
ed+0.067
hed+0.066
oled+0.065
Neg logit contributions
 task-0.068
 laundry-0.068
 way-0.065
 same-0.064
 attic-0.062
Token tower
Token ID10580
Feature activation+1.72e+01

Pos logit contributions
 laundry+0.197
 task+0.168
 same+0.165
 proper+0.162
 your+0.159
Neg logit contributions
owing-0.280
oped-0.276
uts-0.274
es-0.273
m-0.272
Token next
Token ID1306
Feature activation-2.56e+00

Pos logit contributions
oped+0.322
uts+0.304
oled+0.301
unch+0.300
owing+0.291
Neg logit contributions
 laundry-0.365
 same-0.356
 task-0.336
 your-0.335
 homework-0.325
Token to
Token ID284
Feature activation-1.48e+01

Pos logit contributions
 laundry+0.046
 same+0.045
 task+0.042
 your+0.042
 way+0.040
Neg logit contributions
oped-0.057
uts-0.054
oled-0.054
unch-0.054
owing-0.052
Token Ben
Token ID3932
Feature activation+7.33e+00

Pos logit contributions
 laundry+0.217
 task+0.216
 same+0.207
 proper+0.201
 way+0.197
Neg logit contributions
oped-0.342
oled-0.335
unch-0.331
uts-0.323
oot-0.321
Token's
Token ID338
Feature activation-1.32e+01

Pos logit contributions
oped+0.130
unch+0.120
oled+0.117
uts+0.111
oot+0.110
Neg logit contributions
 your-0.169
 laundry-0.169
 same-0.166
 paper-0.164
 table-0.163
Token house
Token ID2156
Feature activation+1.67e+01

Pos logit contributions
 laundry+0.176
 same+0.167
 your+0.155
 task+0.149
 homework+0.145
Neg logit contributions
uts-0.322
oped-0.317
owing-0.311
unch-0.311
es-0.304
Token loved
Token ID6151
Feature activation-8.58e-01

Pos logit contributions
 laundry+0.064
 same+0.061
 task+0.058
 your+0.056
 attic+0.055
Neg logit contributions
oped-0.073
uts-0.069
oled-0.068
unch-0.067
owing-0.066
Token very
Token ID845
Feature activation-9.26e+00

Pos logit contributions
 laundry+0.015
 same+0.015
 task+0.014
 your+0.013
 attic+0.013
Neg logit contributions
oped-0.019
uts-0.018
oled-0.018
unch-0.018
owing-0.018
Token much
Token ID881
Feature activation-2.45e+00

Pos logit contributions
 same+0.157
 laundry+0.154
 your+0.151
 task+0.148
 homework+0.147
Neg logit contributions
oped-0.218
oled-0.204
owing-0.197
unch-0.194
oling-0.193
Token.
Token ID13
Feature activation+4.28e+00

Pos logit contributions
 new+0.050
 your+0.049
 same+0.049
 way+0.049
 laundry+0.049
Neg logit contributions
oped-0.057
unch-0.050
oled-0.049
rol-0.047
oing-0.046
Token Her
Token ID2332
Feature activation-6.47e+00

Pos logit contributions
oped+0.093
uts+0.088
oled+0.087
owing+0.086
unch+0.086
Neg logit contributions
 laundry-0.080
 same-0.079
 your-0.074
 task-0.072
 attic-0.072
Token doll
Token ID3654
Feature activation+1.56e+01

Pos logit contributions
 laundry+0.083
 same+0.076
 task+0.074
 your+0.074
 port+0.071
Neg logit contributions
oped-0.161
uts-0.158
oled-0.154
unch-0.152
owing-0.152
Token was
Token ID373
Feature activation-7.57e+00

Pos logit contributions
oped+0.305
uts+0.289
oled+0.282
unch+0.278
owing+0.274
Neg logit contributions
 laundry-0.318
 same-0.306
 task-0.293
 your-0.289
 attic-0.280
Token gifted
Token ID22527
Feature activation+3.54e+00

Pos logit contributions
 laundry+0.135
 task+0.130
 same+0.124
 homework+0.119
 attic+0.118
Neg logit contributions
oped-0.155
uts-0.153
unch-0.151
oled-0.150
ann-0.146
Token to
Token ID284
Feature activation-1.58e+01

Pos logit contributions
oped+0.070
unch+0.066
oled+0.064
uts+0.063
qu+0.061
Neg logit contributions
 laundry-0.074
 same-0.073
 your-0.069
 task-0.067
 paper-0.067
Token her
Token ID607
Feature activation-1.46e+01

Pos logit contributions
 laundry+0.223
 same+0.199
 task+0.196
 attic+0.190
 computer+0.182
Neg logit contributions
oped-0.389
uts-0.384
oot-0.380
oled-0.372
owing-0.369
Token by
Token ID416
Feature activation-1.19e+01

Pos logit contributions
 laundry+0.318
 same+0.312
 task+0.295
 your+0.294
 way+0.286
Neg logit contributions
oped-0.268
uts-0.252
oled-0.250
unch-0.250
owing-0.240
Token safe
Token ID3338
Feature activation+6.83e+00

Pos logit contributions
 laundry+0.168
 same+0.157
 homework+0.145
 your+0.143
 task+0.137
Neg logit contributions
oped-0.357
unch-0.339
oled-0.338
uts-0.330
owing-0.329
Token place
Token ID1295
Feature activation+6.94e+00

Pos logit contributions
oped+0.210
uts+0.204
unch+0.203
oled+0.202
owing+0.200
Neg logit contributions
 laundry-0.056
 same-0.051
 homework-0.044
 task-0.043
 your-0.042
Token.
Token ID13
Feature activation+6.36e+00

Pos logit contributions
oped+0.092
uts+0.084
oled+0.083
unch+0.083
owing+0.079
Neg logit contributions
 laundry-0.185
 same-0.182
 task-0.174
 your-0.173
 way-0.169
Token No
Token ID1400
Feature activation-1.79e+00

Pos logit contributions
oped+0.138
owing+0.131
uts+0.130
oot+0.127
oled+0.126
Neg logit contributions
 laundry-0.115
 your-0.113
 same-0.113
 my-0.105
 way-0.103
Token more
Token ID517
Feature activation-7.32e+00

Pos logit contributions
 laundry+0.023
 same+0.022
 task+0.020
 your+0.019
 attic+0.019
Neg logit contributions
oped-0.049
unch-0.046
oled-0.046
uts-0.046
owing-0.046
Token shouting
Token ID19642
Feature activation+1.77e+01

Pos logit contributions
 same+0.105
 laundry+0.103
 task+0.094
 attic+0.091
 your+0.090
Neg logit contributions
uts-0.173
oped-0.170
unch-0.167
oled-0.166
owing-0.164
Token.
Token ID13
Feature activation+6.03e+00

Pos logit contributions
oped+0.295
uts+0.276
unch+0.274
oled+0.273
owing+0.263
Neg logit contributions
 laundry-0.410
 same-0.402
 your-0.381
 task-0.381
 homework-0.370
Tokenâ
Token ID22940
Feature activation+3.24e+00

Pos logit contributions
oped+0.135
owing+0.127
uts+0.126
oot+0.124
oled+0.123
Neg logit contributions
 laundry-0.105
 same-0.104
 your-0.102
 attic-0.094
 my-0.094
Token
Token ID26391
Feature activation+4.78e+00

Pos logit contributions
oped+0.077
uts+0.076
unch+0.074
oled+0.074
es+0.071
Neg logit contributions
 same-0.051
 laundry-0.051
 task-0.047
 your-0.047
 way-0.045
Token The
Token ID383
Feature activation-1.18e+01

Pos logit contributions
oped+0.087
ann+0.085
oot+0.084
oled+0.083
owing+0.082
Neg logit contributions
 your-0.093
 same-0.092
 laundry-0.092
 way-0.091
 our-0.088
Token rabbit
Token ID22746
Feature activation+1.61e+01

Pos logit contributions
 laundry+0.220
 same+0.201
 your+0.200
 homework+0.199
 task+0.193
Neg logit contributions
oped-0.244
oled-0.235
unch-0.234
uts-0.228
owing-0.223
Token rock
Token ID3881
Feature activation+1.78e+01

Pos logit contributions
 homework+0.283
 laundry+0.254
 your+0.242
 same+0.237
 math+0.237
Neg logit contributions
uts-0.467
ann-0.463
oped-0.462
pt-0.453
m-0.446
Token.
Token ID13
Feature activation+6.35e+00

Pos logit contributions
oped+0.317
oled+0.306
unch+0.290
oling+0.275
ann+0.274
Neg logit contributions
 laundry-0.395
 same-0.392
 way-0.390
 paper-0.384
 table-0.382
Token They
Token ID1119
Feature activation+2.79e+00

Pos logit contributions
oped+0.139
uts+0.133
oled+0.132
unch+0.131
owing+0.127
Neg logit contributions
 laundry-0.116
 same-0.112
 task-0.105
 your-0.104
 homework-0.101
Token made
Token ID925
Feature activation-2.74e+00

Pos logit contributions
oped+0.059
oled+0.054
uts+0.054
unch+0.053
es+0.053
Neg logit contributions
 laundry-0.054
 same-0.052
 your-0.049
 task-0.049
 homework-0.048
Token more
Token ID517
Feature activation-6.74e+00

Pos logit contributions
 laundry+0.041
 task+0.037
 attic+0.036
 same+0.036
 homework+0.036
Neg logit contributions
oped-0.062
es-0.062
unch-0.062
uts-0.061
qu-0.060
Token tunnels
Token ID22642
Feature activation+1.63e+01

Pos logit contributions
 laundry+0.078
 same+0.075
 your+0.067
 task+0.067
 homework+0.064
Neg logit contributions
oped-0.189
uts-0.184
unch-0.183
oled-0.181
owing-0.179
Token and
Token ID290
Feature activation-9.76e+00

Pos logit contributions
oped+0.269
uts+0.251
oled+0.250
unch+0.249
owing+0.239
Neg logit contributions
 laundry-0.381
 same-0.374
 task-0.355
 your-0.353
 way-0.346
Token found
Token ID1043
Feature activation+1.33e+00

Pos logit contributions
 laundry+0.171
 same+0.155
 task+0.155
 homework+0.153
 computer+0.149
Neg logit contributions
oped-0.200
es-0.196
qu-0.194
uts-0.191
oled-0.189
Token more
Token ID517
Feature activation-6.88e+00

Pos logit contributions
oped+0.033
es+0.032
uts+0.032
oled+0.031
unch+0.031
Neg logit contributions
 laundry-0.019
 task-0.018
 same-0.018
 homework-0.017
 your-0.017
Token worms
Token ID29174
Feature activation+1.69e+01

Pos logit contributions
 laundry+0.084
 same+0.080
 task+0.073
 homework+0.071
 your+0.071
Neg logit contributions
oped-0.186
uts-0.184
unch-0.179
oled-0.177
owing-0.176
Token.
Token ID13
Feature activation+5.99e+00

Pos logit contributions
oped+0.299
oled+0.282
unch+0.277
amb+0.262
ann+0.261
Neg logit contributions
 laundry-0.367
 same-0.364
 your-0.353
 way-0.353
 paper-0.350
Token bear
Token ID6842
Feature activation+1.79e+01

Pos logit contributions
 laundry+0.050
 same+0.048
 task+0.046
 homework+0.045
 your+0.045
Neg logit contributions
oped-0.053
oled-0.050
unch-0.050
uts-0.050
owing-0.050
Token,
Token ID11
Feature activation-2.81e+00

Pos logit contributions
oped+0.339
uts+0.333
oled+0.327
unch+0.322
oot+0.313
Neg logit contributions
 laundry-0.363
 same-0.361
 way-0.343
 your-0.341
 task-0.337
Token she
Token ID673
Feature activation-1.79e+00

Pos logit contributions
 laundry+0.060
 same+0.059
 task+0.059
 homework+0.058
 proper+0.057
Neg logit contributions
oped-0.048
es-0.046
uts-0.045
oled-0.045
unch-0.044
Token began
Token ID2540
Feature activation+2.03e+00

Pos logit contributions
 laundry+0.032
 same+0.032
 your+0.030
 task+0.029
 way+0.029
Neg logit contributions
oped-0.038
uts-0.037
unch-0.037
oled-0.036
owing-0.036
Token to
Token ID284
Feature activation-1.53e+01

Pos logit contributions
oped+0.049
oled+0.047
uts+0.047
unch+0.046
owing+0.045
Neg logit contributions
 laundry-0.032
 same-0.031
 your-0.029
 task-0.029
 way-0.028
Token smile
Token ID8212
Feature activation+1.63e+01

Pos logit contributions
 laundry+0.270
 same+0.255
 attic+0.246
 task+0.245
 your+0.244
Neg logit contributions
oped-0.325
unch-0.309
oled-0.309
owing-0.308
uts-0.301
Token.
Token ID13
Feature activation+5.97e+00

Pos logit contributions
oped+0.297
unch+0.286
umbs+0.271
oled+0.265
ann+0.259
Neg logit contributions
 same-0.377
 your-0.367
 laundry-0.360
 paper-0.347
 math-0.342
Token
Token ID220
Feature activation+1.17e+01

Pos logit contributions
oped+0.122
uts+0.116
oled+0.114
unch+0.114
owing+0.111
Neg logit contributions
 laundry-0.118
 same-0.115
 task-0.108
 your-0.107
 homework-0.104
Token
Token ID198
Feature activation+4.57e+00

Pos logit contributions
oped+0.263
ring+0.252
owing+0.248
oled+0.241
ann+0.239
Neg logit contributions
 your-0.206
 same-0.199
 new-0.187
 laundry-0.180
 math-0.177
Token
Token ID198
Feature activation+4.56e+00

Pos logit contributions
owing+0.106
oped+0.103
ring+0.101
oled+0.096
ked+0.095
Neg logit contributions
 your-0.080
 same-0.075
 new-0.071
 way-0.070
 laundry-0.069
TokenThe
Token ID464
Feature activation-1.31e+01

Pos logit contributions
oped+0.090
uts+0.084
oled+0.084
unch+0.084
owing+0.083
Neg logit contributions
 laundry-0.093
 same-0.091
 your-0.086
 task-0.086
 attic-0.083
Token read
Token ID1100
Feature activation+1.12e+01

Pos logit contributions
 laundry+0.079
 same+0.072
 port+0.070
 computer+0.067
 our+0.067
Neg logit contributions
owing-0.090
uts-0.089
oped-0.088
ring-0.085
es-0.084
Token math
Token ID10688
Feature activation+8.06e+00

Pos logit contributions
oped+0.284
uts+0.276
oled+0.273
es+0.268
unch+0.267
Neg logit contributions
 laundry-0.156
 same-0.149
 task-0.141
 your-0.134
 attic-0.131
Token!
Token ID0
Feature activation+3.08e+00

Pos logit contributions
oped+0.139
uts+0.131
oled+0.130
unch+0.130
owing+0.125
Neg logit contributions
 laundry-0.182
 same-0.178
 task-0.169
 your-0.168
 homework-0.164
Token
Token ID220
Feature activation+1.07e+01

Pos logit contributions
oped+0.052
uts+0.049
oled+0.049
unch+0.048
owing+0.046
Neg logit contributions
 laundry-0.071
 same-0.069
 task-0.066
 your-0.065
 homework-0.064
Token
Token ID198
Feature activation+3.66e+00

Pos logit contributions
oped+0.170
owing+0.156
oled+0.151
unch+0.151
uts+0.147
Neg logit contributions
 laundry-0.257
 same-0.254
 your-0.241
 attic-0.235
 math-0.235
Token
Token ID198
Feature activation+3.59e+00

Pos logit contributions
owing+0.071
oped+0.068
uts+0.066
ring+0.064
oled+0.063
Neg logit contributions
 laundry-0.078
 same-0.075
 your-0.073
 math-0.070
 task-0.070
TokenShe
Token ID3347
Feature activation+1.47e+00

Pos logit contributions
oped+0.063
uts+0.059
oled+0.059
unch+0.058
oot+0.055
Neg logit contributions
 laundry-0.080
 same-0.079
 task-0.074
 your-0.074
 way-0.073
Token quickly
Token ID2952
Feature activation+3.96e+00

Pos logit contributions
oped+0.028
uts+0.026
oled+0.026
unch+0.026
es+0.025
Neg logit contributions
 laundry-0.031
 same-0.030
 task-0.028
 your-0.028
 attic-0.027
Token sat
Token ID3332
Feature activation+4.45e+00

Pos logit contributions
oped+0.079
oled+0.074
uts+0.074
unch+0.072
es+0.071
Neg logit contributions
 laundry-0.079
 same-0.077
 task-0.073
 your-0.072
 attic-0.071
Token down
Token ID866
Feature activation-4.94e+00

Pos logit contributions
oped+0.079
oled+0.073
owing+0.073
uts+0.072
es+0.072
Neg logit contributions
 laundry-0.097
 same-0.094
 task-0.093
 way-0.089
 proper-0.088
Token and
Token ID290
Feature activation-1.14e+01

Pos logit contributions
 laundry+0.115
 same+0.111
 task+0.109
 proper+0.105
 your+0.104
Neg logit contributions
oped-0.082
oled-0.075
uts-0.075
es-0.073
owing-0.071
Token sad
Token ID6507
Feature activation+7.97e+00

Pos logit contributions
 laundry+0.109
 same+0.107
 your+0.100
 task+0.099
 way+0.096
Neg logit contributions
oped-0.130
uts-0.124
oled-0.123
unch-0.123
owing-0.119
Token,
Token ID11
Feature activation-3.28e+00

Pos logit contributions
oped+0.130
uts+0.121
oled+0.120
unch+0.120
owing+0.115
Neg logit contributions
 laundry-0.190
 same-0.186
 task-0.177
 your-0.176
 way-0.172
Token but
Token ID475
Feature activation+1.11e+00

Pos logit contributions
 laundry+0.074
 same+0.071
 task+0.071
 homework+0.068
 attic+0.068
Neg logit contributions
oped-0.055
uts-0.053
oled-0.052
unch-0.051
owing-0.049
Token he
Token ID339
Feature activation-5.14e-02

Pos logit contributions
oped+0.023
uts+0.021
unch+0.021
oled+0.021
oot+0.021
Neg logit contributions
 laundry-0.021
 same-0.021
 task-0.020
 attic-0.019
 computer-0.019
Token didn
Token ID1422
Feature activation+6.84e+00

Pos logit contributions
 laundry+0.001
 same+0.001
 task+0.001
 your+0.001
 way+0.001
Neg logit contributions
oped-0.001
uts-0.001
oled-0.001
unch-0.001
owing-0.001
Tokenâ
Token ID22940
Feature activation+4.48e+00

Pos logit contributions
oped+0.129
owing+0.123
oled+0.119
unch+0.115
oing+0.112
Neg logit contributions
 laundry-0.145
 same-0.144
 your-0.139
 task-0.138
 homework-0.133
Token
Token ID26391
Feature activation+4.95e+00

Pos logit contributions
oped+0.116
uts+0.113
unch+0.111
oled+0.111
es+0.108
Neg logit contributions
 laundry-0.062
 same-0.061
 task-0.056
 your-0.055
 homework-0.053
Token
Token ID8151
Feature activation+2.05e+01

Pos logit contributions
pt+0.071
ann+0.071
oped+0.069
oot+0.068
amb+0.068
Neg logit contributions
 laundry-0.107
 same-0.106
 homework-0.106
 your-0.104
 way-0.103
Tokent
Token ID83
Feature activation-2.44e+00

Pos logit contributions
ann+0.261
ez+0.254
qu+0.251
oing+0.251
oped+0.250
Neg logit contributions
 same-0.466
 your-0.457
 way-0.453
 laundry-0.439
 homework-0.430
Token give
Token ID1577
Feature activation+9.02e+00

Pos logit contributions
 same+0.039
 laundry+0.038
 your+0.037
 task+0.036
 way+0.036
Neg logit contributions
oped-0.061
unch-0.058
uts-0.056
oled-0.056
es-0.054
Token up
Token ID510
Feature activation-3.55e+00

Pos logit contributions
oped+0.206
uts+0.197
oled+0.194
unch+0.193
owing+0.192
Neg logit contributions
 laundry-0.149
 same-0.143
 task-0.137
 attic-0.133
 homework-0.128
Token to
Token ID284
Feature activation-1.56e+01

Pos logit contributions
oped+0.065
oled+0.062
unch+0.061
uts+0.060
oot+0.057
Neg logit contributions
 laundry-0.076
 same-0.075
 your-0.073
 way-0.071
 task-0.070
Token feel
Token ID1254
Feature activation-6.76e-01

Pos logit contributions
 laundry+0.213
 same+0.211
 homework+0.201
 task+0.197
 computer+0.191
Neg logit contributions
oped-0.383
owing-0.355
unch-0.353
oot-0.345
oled-0.345
Token the
Token ID262
Feature activation-2.10e+01

Pos logit contributions
 laundry+0.009
 same+0.009
 task+0.008
 homework+0.008
 computer+0.008
Neg logit contributions
oped-0.017
oled-0.017
unch-0.017
uts-0.017
owing-0.016
Token wet
Token ID9583
Feature activation+6.53e+00

Pos logit contributions
 laundry+0.223
 homework+0.221
 your+0.217
 math+0.211
 task+0.203
Neg logit contributions
oped-0.565
oled-0.540
unch-0.540
m-0.539
owing-0.529
Token and
Token ID290
Feature activation-1.04e+01

Pos logit contributions
oped+0.192
uts+0.184
oled+0.184
unch+0.183
owing+0.181
Neg logit contributions
 laundry-0.065
 same-0.061
 your-0.056
 task-0.055
 homework-0.054
Token heavy
Token ID4334
Feature activation+4.27e-01

Pos logit contributions
 homework+0.149
 computer+0.146
 laundry+0.145
 task+0.143
 math+0.140
Neg logit contributions
oped-0.222
m-0.219
owing-0.209
pt-0.206
unch-0.205
Token rain
Token ID6290
Feature activation+1.32e+01

Pos logit contributions
oped+0.012
owing+0.011
oled+0.011
uts+0.011
unch+0.011
Neg logit contributions
 laundry-0.005
 same-0.005
 your-0.004
 homework-0.004
 task-0.004
Token.
Token ID13
Feature activation+5.95e+00

Pos logit contributions
oped+0.214
unch+0.213
oled+0.199
uts+0.198
ann+0.194
Neg logit contributions
 same-0.314
 laundry-0.312
 task-0.293
 way-0.293
 paper-0.277
Token She
Token ID1375
Feature activation+1.52e-01

Pos logit contributions
oped+0.109
uts+0.103
oled+0.102
unch+0.102
owing+0.099
Neg logit contributions
 laundry-0.129
 same-0.126
 task-0.120
 your-0.119
 homework-0.116
Token was
Token ID373
Feature activation-7.53e+00

Pos logit contributions
oped+0.003
uts+0.003
oled+0.003
unch+0.003
es+0.003
Neg logit contributions
 laundry-0.003
 same-0.003
 task-0.003
 math-0.003
 attic-0.003
Token so
Token ID523
Feature activation-1.01e+01

Pos logit contributions
 laundry+0.141
 same+0.136
 task+0.130
 your+0.126
 attic+0.124
Neg logit contributions
oped-0.160
uts-0.152
oled-0.151
unch-0.151
owing-0.147
Token,
Token ID11
Feature activation-2.24e+00

Pos logit contributions
oped+0.086
uts+0.083
oled+0.083
unch+0.082
owing+0.081
Neg logit contributions
 laundry-0.076
 same-0.074
 task-0.070
 attic-0.068
 your-0.068
Token and
Token ID290
Feature activation-8.81e+00

Pos logit contributions
 laundry+0.051
 same+0.050
 task+0.049
 attic+0.047
 homework+0.047
Neg logit contributions
oped-0.037
uts-0.036
oled-0.035
unch-0.035
owing-0.034
Token started
Token ID2067
Feature activation+1.91e+00

Pos logit contributions
 laundry+0.197
 same+0.193
 task+0.187
 computer+0.179
 attic+0.178
Neg logit contributions
oped-0.149
es-0.145
uts-0.139
ed-0.137
oled-0.135
Token to
Token ID284
Feature activation-1.43e+01

Pos logit contributions
oped+0.041
uts+0.039
unch+0.039
oled+0.038
owing+0.038
Neg logit contributions
 laundry-0.035
 same-0.034
 task-0.032
 attic-0.032
 your-0.032
Token chew
Token ID34722
Feature activation+1.52e+01

Pos logit contributions
 laundry+0.238
 same+0.227
 your+0.223
 attic+0.220
 our+0.216
Neg logit contributions
oped-0.303
owing-0.296
unch-0.295
oled-0.286
m-0.275
Token it
Token ID340
Feature activation+2.19e+00

Pos logit contributions
oped+0.311
unch+0.295
oled+0.288
owing+0.286
uts+0.284
Neg logit contributions
 laundry-0.306
 same-0.298
 your-0.293
 paper-0.281
 task-0.278
Token.
Token ID13
Feature activation+6.68e+00

Pos logit contributions
oped+0.037
oled+0.035
unch+0.034
uts+0.032
ann+0.032
Neg logit contributions
 same-0.053
 laundry-0.052
 task-0.051
 way-0.050
 your-0.050
Token
Token ID198
Feature activation+5.39e+00

Pos logit contributions
oped+0.140
uts+0.132
oled+0.131
unch+0.131
owing+0.128
Neg logit contributions
 laundry-0.129
 same-0.126
 task-0.118
 your-0.118
 way-0.115
Token
Token ID198
Feature activation+5.44e+00

Pos logit contributions
owing+0.127
ring+0.126
oped+0.123
oled+0.120
ked+0.118
Neg logit contributions
 your-0.088
 same-0.084
 way-0.083
 new-0.081
 big-0.076
TokenThe
Token ID464
Feature activation-1.21e+01

Pos logit contributions
oped+0.100
owing+0.097
uts+0.094
unch+0.093
oled+0.093
Neg logit contributions
 laundry-0.115
 same-0.115
 your-0.110
 task-0.110
 attic-0.107
Token line
Token ID1627
Feature activation+1.09e+01

Pos logit contributions
 your+0.181
 laundry+0.175
 proper+0.174
 our+0.164
 same+0.162
Neg logit contributions
unch-0.268
oped-0.256
oled-0.254
m-0.254
uts-0.251
TokenThe
Token ID464
Feature activation-1.16e+01

Pos logit contributions
oped+0.087
oled+0.081
owing+0.080
unch+0.080
uts+0.079
Neg logit contributions
 laundry-0.105
 same-0.105
 your-0.097
 task-0.097
 attic-0.095
Token sun
Token ID4252
Feature activation+1.76e+01

Pos logit contributions
 laundry+0.148
 same+0.137
 your+0.133
 homework+0.133
 task+0.128
Neg logit contributions
oped-0.297
unch-0.294
uts-0.293
oled-0.284
owing-0.282
Token rises
Token ID16736
Feature activation+1.14e+01

Pos logit contributions
oped+0.442
uts+0.416
owing+0.415
unch+0.411
es+0.409
Neg logit contributions
 laundry-0.256
 same-0.240
 attic-0.225
 homework-0.224
 proper-0.224
Token in
Token ID287
Feature activation-1.36e+01

Pos logit contributions
oped+0.199
uts+0.187
unch+0.183
oled+0.182
owing+0.180
Neg logit contributions
 laundry-0.256
 same-0.244
 task-0.236
 your-0.232
 homework-0.231
Token the
Token ID262
Feature activation-1.93e+01

Pos logit contributions
 homework+0.172
 laundry+0.165
 task+0.164
 proper+0.159
 same+0.156
Neg logit contributions
unch-0.308
qu-0.302
oped-0.302
m-0.301
h-0.294
Token morning
Token ID3329
Feature activation+7.84e+00

Pos logit contributions
 laundry+0.356
 homework+0.336
 task+0.329
 same+0.326
 your+0.324
Neg logit contributions
oped-0.392
unch-0.369
uts-0.363
oled-0.363
hed-0.357
Token and
Token ID290
Feature activation-8.47e+00

Pos logit contributions
oped+0.128
uts+0.121
oled+0.119
unch+0.119
owing+0.115
Neg logit contributions
 laundry-0.183
 same-0.178
 task-0.170
 your-0.169
 homework-0.166
Token sets
Token ID5621
Feature activation+8.32e+00

Pos logit contributions
 laundry+0.160
 task+0.151
 same+0.148
 attic+0.146
 homework+0.146
Neg logit contributions
oped-0.168
uts-0.157
oled-0.155
qu-0.152
owing-0.151
Token in
Token ID287
Feature activation-1.37e+01

Pos logit contributions
oped+0.196
uts+0.190
unch+0.187
oled+0.185
owing+0.183
Neg logit contributions
 laundry-0.128
 same-0.121
 task-0.116
 homework-0.113
 attic-0.111
Token the
Token ID262
Feature activation-1.94e+01

Pos logit contributions
 laundry+0.170
 same+0.162
 task+0.153
 homework+0.148
 computer+0.143
Neg logit contributions
oped-0.345
unch-0.340
uts-0.337
owing-0.331
oled-0.329
Token evening
Token ID6180
Feature activation+1.29e+01

Pos logit contributions
 homework+0.265
 laundry+0.264
 your+0.247
 task+0.242
 math+0.237
Neg logit contributions
oped-0.417
unch-0.413
m-0.409
hed-0.402
umbs-0.393
Token happy
Token ID3772
Feature activation+9.12e-01

Pos logit contributions
 laundry+0.186
 task+0.183
 homework+0.175
 attic+0.172
 same+0.170
Neg logit contributions
oped-0.148
oled-0.143
unch-0.142
uts-0.140
owing-0.137
Token and
Token ID290
Feature activation-1.29e+01

Pos logit contributions
uts+0.017
oped+0.017
unch+0.017
owing+0.017
oled+0.016
Neg logit contributions
 laundry-0.018
 same-0.017
 task-0.017
 homework-0.016
 attic-0.016
Token content
Token ID2695
Feature activation+3.46e+00

Pos logit contributions
 laundry+0.245
 attic+0.228
 homework+0.226
 task+0.226
 computer+0.226
Neg logit contributions
oped-0.253
uts-0.250
owing-0.242
unch-0.240
oled-0.235
Token.
Token ID13
Feature activation+1.89e+00

Pos logit contributions
oped+0.054
oled+0.050
unch+0.048
uts+0.045
ann+0.045
Neg logit contributions
 same-0.087
 laundry-0.087
 your-0.084
 way-0.082
 task-0.081
Token Jimmy
Token ID12963
Feature activation+3.76e+00

Pos logit contributions
oped+0.034
owing+0.032
oled+0.032
uts+0.032
unch+0.031
Neg logit contributions
 laundry-0.040
 same-0.040
 your-0.039
 attic-0.037
 way-0.037
Token's
Token ID338
Feature activation-1.59e+01

Pos logit contributions
oped+0.070
oled+0.064
uts+0.064
unch+0.063
owing+0.062
Neg logit contributions
 laundry-0.081
 same-0.078
 task-0.076
 attic-0.074
 homework-0.073
Token heart
Token ID2612
Feature activation+1.55e+01

Pos logit contributions
 laundry+0.212
 same+0.200
 task+0.189
 your+0.188
 homework+0.181
Neg logit contributions
oped-0.399
unch-0.391
uts-0.383
owing-0.380
oled-0.378
Token filled
Token ID5901
Feature activation-2.55e+00

Pos logit contributions
oped+0.309
uts+0.297
oled+0.289
unch+0.282
es+0.278
Neg logit contributions
 laundry-0.314
 same-0.296
 task-0.286
 attic-0.284
 your-0.280
Token with
Token ID351
Feature activation-8.93e+00

Pos logit contributions
 laundry+0.055
 same+0.052
 task+0.052
 attic+0.050
 homework+0.049
Neg logit contributions
oped-0.045
uts-0.043
unch-0.042
oled-0.042
es-0.041
Token joy
Token ID8716
Feature activation+1.44e+01

Pos logit contributions
 laundry+0.137
 same+0.132
 task+0.130
 homework+0.128
 attic+0.123
Neg logit contributions
oped-0.206
unch-0.199
oled-0.196
uts-0.196
owing-0.193
Token,
Token ID11
Feature activation-5.95e+00

Pos logit contributions
oped+0.269
unch+0.250
oled+0.245
uts+0.240
oot+0.234
Neg logit contributions
 laundry-0.309
 same-0.303
 your-0.291
 way-0.281
 paper-0.279
Token
Token ID198
Feature activation+3.53e+00

Pos logit contributions
owing+0.075
oped+0.069
ring+0.068
oot+0.065
uts+0.064
Neg logit contributions
 same-0.070
 your-0.069
 laundry-0.068
 new-0.065
 attic-0.063
TokenBut
Token ID1537
Feature activation+2.19e+00

Pos logit contributions
oped+0.063
uts+0.059
oled+0.059
unch+0.058
owing+0.055
Neg logit contributions
 laundry-0.078
 same-0.076
 task-0.072
 your-0.072
 homework-0.070
Token one
Token ID530
Feature activation-1.69e+00

Pos logit contributions
oped+0.038
es+0.036
oled+0.036
oot+0.034
uts+0.034
Neg logit contributions
 laundry-0.045
 task-0.043
 same-0.043
 proper-0.041
 computer-0.040
Token friend
Token ID1545
Feature activation+1.26e+01

Pos logit contributions
 laundry+0.028
 same+0.025
 your+0.022
 attic+0.022
 tasks+0.022
Neg logit contributions
oped-0.037
m-0.037
owing-0.035
unch-0.035
es-0.035
Token did
Token ID750
Feature activation+1.29e+00

Pos logit contributions
oped+0.218
oled+0.203
es+0.203
uts+0.201
unch+0.193
Neg logit contributions
 laundry-0.279
 same-0.271
 task-0.265
 proper-0.255
 attic-0.251
Token not
Token ID407
Feature activation-5.44e+00

Pos logit contributions
oped+0.034
uts+0.033
oled+0.032
unch+0.032
es+0.032
Neg logit contributions
 laundry-0.017
 same-0.017
 task-0.016
 attic-0.015
 proper-0.014
Token come
Token ID1282
Feature activation+2.20e+00

Pos logit contributions
 laundry+0.095
 same+0.092
 task+0.086
 your+0.086
 homework+0.083
Neg logit contributions
oped-0.124
uts-0.118
oled-0.117
unch-0.117
owing-0.114
Token.
Token ID13
Feature activation+4.54e+00

Pos logit contributions
oped+0.048
oled+0.046
unch+0.045
hed+0.044
uts+0.043
Neg logit contributions
 same-0.042
 your-0.042
 laundry-0.041
 our-0.039
 task-0.038
Token His
Token ID2399
Feature activation-8.22e+00

Pos logit contributions
uts+0.092
oped+0.092
oled+0.091
unch+0.085
es+0.085
Neg logit contributions
 laundry-0.084
 same-0.078
 task-0.076
 your-0.075
 way-0.074
Token name
Token ID1438
Feature activation+1.44e+01

Pos logit contributions
 laundry+0.150
 your+0.141
 same+0.137
 attic+0.133
 task+0.131
Neg logit contributions
uts-0.157
oped-0.157
owing-0.152
unch-0.151
es-0.146
Token was
Token ID373
Feature activation-8.41e+00

Pos logit contributions
oped+0.282
uts+0.266
oled+0.264
unch+0.261
owing+0.253
Neg logit contributions
 laundry-0.293
 same-0.286
 task-0.270
 your-0.269
 attic-0.261
Token the
Token ID262
Feature activation-2.00e+01

Pos logit contributions
 laundry+0.165
 task+0.154
 attic+0.153
 same+0.151
 homework+0.143
Neg logit contributions
oped-0.255
unch-0.253
oled-0.253
uts-0.245
hed-0.242
Token bread
Token ID8509
Feature activation+1.19e+01

Pos logit contributions
 homework+0.263
 laundry+0.255
 math+0.249
 attic+0.238
 task+0.235
Neg logit contributions
oped-0.475
m-0.465
uts-0.465
unch-0.464
owing-0.458
Tokencr
Token ID6098
Feature activation+1.33e+01

Pos logit contributions
oped+0.210
oled+0.197
uts+0.195
unch+0.194
owing+0.187
Neg logit contributions
 laundry-0.265
 same-0.260
 your-0.246
 task-0.244
 way-0.242
Tokenumbs
Token ID18146
Feature activation+1.28e+01

Pos logit contributions
uts+0.134
oot+0.126
oped+0.119
owing+0.115
oled+0.114
Neg logit contributions
 same-0.348
 proper-0.340
 laundry-0.329
 your-0.327
 paper-0.322
Token and
Token ID290
Feature activation-9.48e+00

Pos logit contributions
oped+0.240
oled+0.227
uts+0.226
unch+0.226
oot+0.218
Neg logit contributions
 laundry-0.266
 same-0.266
 your-0.250
 way-0.248
 task-0.245
Token had
Token ID550
Feature activation-2.91e+00

Pos logit contributions
 laundry+0.191
 task+0.189
 same+0.188
 attic+0.184
 homework+0.181
Neg logit contributions
oped-0.178
hed-0.166
es-0.166
uts-0.165
oled-0.164
Token lots
Token ID6041
Feature activation+2.83e+00

Pos logit contributions
 laundry+0.050
 task+0.048
 attic+0.047
 same+0.046
 computer+0.046
Neg logit contributions
uts-0.060
oped-0.059
unch-0.059
umbs-0.058
ann-0.058
Token of
Token ID286
Feature activation-1.63e+01

Pos logit contributions
oped+0.050
oled+0.046
uts+0.044
owing+0.044
oling+0.044
Neg logit contributions
 laundry-0.065
 same-0.063
 your-0.062
 way-0.060
 paper-0.060
Token fun
Token ID1257
Feature activation+8.10e+00

Pos logit contributions
 laundry+0.135
 same+0.127
 your+0.109
 task+0.106
 way+0.098
Neg logit contributions
oped-0.529
oled-0.510
uts-0.509
unch-0.503
owing-0.498
Token.
Token ID13
Feature activation+6.62e+00

Pos logit contributions
oped+0.145
oled+0.139
uts+0.127
oot+0.125
unch+0.125
Neg logit contributions
 laundry-0.191
 same-0.179
 your-0.177
 task-0.175
 way-0.173
Token
Token ID198
Feature activation+4.89e+00

Pos logit contributions
oped+0.112
oled+0.106
uts+0.106
unch+0.105
owing+0.099
Neg logit contributions
 laundry-0.152
 same-0.148
 task-0.142
 your-0.140
 homework-0.137
Token
Token ID241
Feature activation+4.43e+00

Pos logit contributions
oped+0.028
oled+0.026
uts+0.026
unch+0.026
owing+0.025
Neg logit contributions
 same-0.030
 laundry-0.030
 your-0.028
 task-0.028
 homework-0.027
TokenI
Token ID40
Feature activation+1.31e+00

Pos logit contributions
oped+0.091
owing+0.089
oot+0.086
es+0.086
oled+0.085
Neg logit contributions
 same-0.084
 laundry-0.081
 your-0.077
 task-0.072
 sunny-0.072
Token love
Token ID1842
Feature activation+6.70e+00

Pos logit contributions
oped+0.027
uts+0.027
oled+0.027
unch+0.026
owing+0.026
Neg logit contributions
 same-0.024
 laundry-0.023
 task-0.023
 your-0.022
 way-0.022
Token this
Token ID428
Feature activation-9.90e+00

Pos logit contributions
uts+0.140
oped+0.140
unch+0.136
oot+0.133
owing+0.132
Neg logit contributions
 laundry-0.117
 same-0.115
 task-0.107
 attic-0.106
 port-0.105
Token tree
Token ID5509
Feature activation+1.57e+01

Pos logit contributions
 laundry+0.196
 same+0.190
 your+0.177
 homework+0.177
 task+0.177
Neg logit contributions
oped-0.189
uts-0.184
unch-0.181
oled-0.178
owing-0.176
Token,
Token ID11
Feature activation-2.15e+00

Pos logit contributions
oped+0.232
uts+0.220
unch+0.214
oled+0.212
owing+0.208
Neg logit contributions
 laundry-0.390
 same-0.381
 task-0.363
 your-0.359
 attic-0.355
Token I
Token ID314
Feature activation+2.06e+00

Pos logit contributions
 laundry+0.038
 same+0.037
 task+0.036
 homework+0.035
 attic+0.034
Neg logit contributions
oped-0.044
uts-0.044
unch-0.043
oled-0.042
oot-0.041
Token love
Token ID1842
Feature activation+6.70e+00

Pos logit contributions
oped+0.046
unch+0.046
uts+0.046
oled+0.045
oot+0.044
Neg logit contributions
 same-0.035
 laundry-0.034
 task-0.033
 your-0.032
 way-0.031
Token this
Token ID428
Feature activation-9.91e+00

Pos logit contributions
uts+0.141
oped+0.141
unch+0.137
owing+0.134
oot+0.133
Neg logit contributions
 laundry-0.117
 same-0.113
 task-0.108
 attic-0.104
 homework-0.103
Token tree
Token ID5509
Feature activation+1.57e+01

Pos logit contributions
 laundry+0.200
 same+0.194
 task+0.183
 your+0.182
 homework+0.178
Neg logit contributions
oped-0.194
uts-0.185
unch-0.183
oled-0.182
owing-0.177
Token!
Token ID0
Feature activation+5.01e+00

Pos logit contributions
oped+0.237
oled+0.218
uts+0.218
unch+0.216
owing+0.206
Neg logit contributions
 laundry-0.389
 same-0.383
 task-0.365
 your-0.365
 way-0.355
Token.
Token ID13
Feature activation-1.05e+00

Pos logit contributions
oped+0.160
uts+0.157
owing+0.145
unch+0.143
oled+0.141
Neg logit contributions
 laundry-0.348
 same-0.338
 task-0.325
 attic-0.322
 your-0.320
Token
Token ID198
Feature activation-1.10e+00

Pos logit contributions
 same+0.018
 laundry+0.018
 your+0.017
 way+0.016
 math+0.016
Neg logit contributions
oped-0.023
owing-0.022
oled-0.022
uts-0.022
unch-0.022
Token
Token ID198
Feature activation-2.06e-01

Pos logit contributions
 your+0.023
 same+0.022
 laundry+0.022
 attic+0.021
 math+0.021
Neg logit contributions
owing-0.021
oped-0.020
oled-0.019
ring-0.019
uts-0.019
TokenJerry
Token ID43462
Feature activation+7.48e+00

Pos logit contributions
 laundry+0.004
 your+0.004
 same+0.004
 task+0.004
 math+0.004
Neg logit contributions
oped-0.004
owing-0.004
oled-0.004
uts-0.004
ann-0.004
Token realized
Token ID6939
Feature activation-1.07e+00

Pos logit contributions
oped+0.151
uts+0.147
unch+0.146
oled+0.144
owing+0.140
Neg logit contributions
 same-0.143
 laundry-0.143
 your-0.133
 task-0.132
 way-0.131
Token that
Token ID326
Feature activation-1.08e+01

Pos logit contributions
 laundry+0.022
 same+0.022
 task+0.020
 your+0.020
 attic+0.020
Neg logit contributions
oped-0.020
uts-0.020
oled-0.019
unch-0.019
owing-0.019
Token sometimes
Token ID3360
Feature activation-2.76e+00

Pos logit contributions
 same+0.207
 laundry+0.207
 task+0.194
 proper+0.192
 attic+0.186
Neg logit contributions
uts-0.203
unch-0.193
oped-0.190
es-0.186
oled-0.185
Token being
Token ID852
Feature activation-5.16e+00

Pos logit contributions
 laundry+0.051
 same+0.049
 attic+0.047
 apr+0.046
 proper+0.046
Neg logit contributions
uts-0.051
es-0.050
owing-0.048
oped-0.048
aring-0.047
Token tall
Token ID7331
Feature activation-5.69e-01

Pos logit contributions
 laundry+0.093
 attic+0.087
 table+0.085
 task+0.084
 shelf+0.081
Neg logit contributions
owing-0.107
uts-0.106
aring-0.106
iled-0.103
es-0.102
Token can
Token ID460
Feature activation+2.81e-01

Pos logit contributions
 laundry+0.012
 same+0.011
 attic+0.011
 task+0.011
 your+0.011
Neg logit contributions
oped-0.010
uts-0.010
es-0.010
owing-0.010
oled-0.009
Token help
Token ID1037
Feature activation+2.91e-01

Pos logit contributions
oped+0.008
uts+0.007
unch+0.007
oled+0.007
owing+0.007
Neg logit contributions
 laundry-0.004
 same-0.003
 task-0.003
 your-0.003
 attic-0.003
Token like
Token ID588
Feature activation-9.70e+00

Pos logit contributions
 laundry+0.203
 task+0.183
 attic+0.180
 computer+0.177
 same+0.177
Neg logit contributions
oped-0.172
uts-0.162
oled-0.162
owing-0.162
aring-0.159
Token cake
Token ID12187
Feature activation+1.13e+01

Pos logit contributions
 task+0.220
 laundry+0.218
 same+0.211
 attic+0.201
 job+0.197
Neg logit contributions
uts-0.152
es-0.126
pt-0.123
oled-0.119
oot-0.119
Token.
Token ID13
Feature activation+1.82e-01

Pos logit contributions
oped+0.156
uts+0.152
unch+0.144
oled+0.144
owing+0.141
Neg logit contributions
 laundry-0.288
 same-0.284
 task-0.271
 your-0.267
 attic-0.265
Token Today
Token ID6288
Feature activation+6.09e+00

Pos logit contributions
oped+0.005
owing+0.004
oot+0.004
oled+0.004
uts+0.004
Neg logit contributions
 laundry-0.003
 your-0.003
 same-0.003
 our-0.002
 attic-0.002
Token is
Token ID318
Feature activation-1.11e+01

Pos logit contributions
uts+0.082
oped+0.076
es+0.073
m+0.072
unch+0.072
Neg logit contributions
 laundry-0.146
 same-0.144
 proper-0.138
 computer-0.138
 apr-0.137
Token their
Token ID511
Feature activation-2.03e+01

Pos logit contributions
 same+0.159
 garage+0.158
 laundry+0.157
 task+0.155
 attic+0.154
Neg logit contributions
m-0.191
.-0.188
pt-0.187
ful-0.186
uts-0.185
Token birthday
Token ID10955
Feature activation+8.21e+00

Pos logit contributions
 laundry+0.257
 your+0.235
 same+0.230
 proper+0.227
 attic+0.225
Neg logit contributions
uts-0.449
unch-0.426
m-0.422
owing-0.420
qu-0.408
Token.
Token ID13
Feature activation+1.84e+00

Pos logit contributions
oped+0.107
uts+0.103
oled+0.098
unch+0.097
owing+0.095
Neg logit contributions
 laundry-0.214
 same-0.211
 task-0.201
 your-0.199
 attic-0.197
Token Mom
Token ID11254
Feature activation+6.01e+00

Pos logit contributions
oped+0.042
owing+0.040
oot+0.039
unch+0.039
oled+0.039
Neg logit contributions
 laundry-0.032
 your-0.031
 same-0.030
 our-0.029
 task-0.028
Token made
Token ID925
Feature activation-5.62e+00

Pos logit contributions
oped+0.138
oled+0.131
uts+0.130
owing+0.128
unch+0.128
Neg logit contributions
 laundry-0.102
 same-0.096
 task-0.093
 attic-0.088
 your-0.087
Token a
Token ID257
Feature activation-1.57e+01

Pos logit contributions
 laundry+0.080
 same+0.077
 task+0.074
 attic+0.072
 way+0.069
Neg logit contributions
oped-0.137
uts-0.135
unch-0.135
oled-0.133
owing-0.130
Token easier
Token ID4577
Feature activation+5.30e+00

Pos logit contributions
 laundry+0.183
 same+0.173
 your+0.157
 task+0.152
 homework+0.148
Neg logit contributions
oped-0.448
uts-0.435
unch-0.434
oled-0.430
owing-0.429
Token way
Token ID835
Feature activation+2.36e+00

Pos logit contributions
uts+0.135
oot+0.134
owing+0.134
oped+0.133
unch+0.128
Neg logit contributions
 laundry-0.052
 same-0.051
 your-0.051
 our-0.048
 my-0.045
Token to
Token ID284
Feature activation-1.70e+01

Pos logit contributions
oped+0.045
oled+0.042
unch+0.042
uts+0.042
owing+0.040
Neg logit contributions
 laundry-0.050
 same-0.049
 your-0.046
 task-0.046
 way-0.045
Token go
Token ID467
Feature activation+1.94e-01

Pos logit contributions
 laundry+0.241
 same+0.232
 task+0.211
 attic+0.210
 your+0.209
Neg logit contributions
oped-0.427
uts-0.408
unch-0.407
oled-0.403
owing-0.401
Token around
Token ID1088
Feature activation-2.42e+00

Pos logit contributions
oped+0.003
oled+0.003
uts+0.003
unch+0.003
owing+0.003
Neg logit contributions
 laundry-0.004
 same-0.004
 your-0.004
 task-0.004
 way-0.004
Token the
Token ID262
Feature activation-2.23e+01

Pos logit contributions
 laundry+0.040
 same+0.039
 task+0.037
 your+0.035
 homework+0.035
Neg logit contributions
oped-0.055
oled-0.053
uts-0.053
unch-0.052
owing-0.051
Token block
Token ID2512
Feature activation+1.26e+01

Pos logit contributions
 laundry+0.284
 same+0.263
 task+0.247
 your+0.245
 homework+0.238
Neg logit contributions
oped-0.598
uts-0.574
unch-0.574
oled-0.571
owing-0.561
Token.
Token ID13
Feature activation+4.47e+00

Pos logit contributions
oped+0.199
oled+0.189
uts+0.184
unch+0.183
owing+0.171
Neg logit contributions
 laundry-0.302
 same-0.299
 task-0.285
 way-0.283
 your-0.282
Token
Token ID198
Feature activation+2.85e+00

Pos logit contributions
oped+0.096
oled+0.092
uts+0.092
owing+0.089
oot+0.088
Neg logit contributions
 same-0.081
 laundry-0.080
 your-0.076
 way-0.074
 task-0.074
Token
Token ID198
Feature activation+2.80e+00

Pos logit contributions
owing+0.062
ring+0.061
oped+0.060
oled+0.059
ked+0.057
Neg logit contributions
 your-0.049
 laundry-0.048
 same-0.048
 new-0.047
 our-0.045
TokenJoe
Token ID19585
Feature activation+9.93e+00

Pos logit contributions
oped+0.055
owing+0.054
oled+0.053
uts+0.052
oot+0.051
Neg logit contributions
 laundry-0.055
 same-0.053
 your-0.052
 task-0.052
 our-0.049
Token write
Token ID3551
Feature activation+1.73e+01

Pos logit contributions
oped+0.031
uts+0.030
oled+0.030
unch+0.029
owing+0.029
Neg logit contributions
 laundry-0.027
 same-0.026
 task-0.025
 your-0.025
 attic-0.024
Token a
Token ID257
Feature activation-1.41e+01

Pos logit contributions
oped+0.324
uts+0.323
es+0.320
oled+0.309
oot+0.296
Neg logit contributions
 laundry-0.358
 same-0.337
 task-0.332
 attic-0.320
 way-0.316
Token sorry
Token ID7926
Feature activation+1.09e+01

Pos logit contributions
 laundry+0.207
 same+0.187
 your+0.174
 attic+0.174
 task+0.171
Neg logit contributions
oped-0.334
uts-0.329
oled-0.326
unch-0.322
oing-0.313
Token letter
Token ID3850
Feature activation+1.89e+01

Pos logit contributions
uts+0.208
oot+0.197
oped+0.197
oled+0.195
es+0.194
Neg logit contributions
 laundry-0.211
 same-0.198
 task-0.188
 homework-0.185
 attic-0.184
Token to
Token ID284
Feature activation-1.60e+01

Pos logit contributions
oped+0.336
unch+0.322
oled+0.312
ann+0.295
uts+0.293
Neg logit contributions
 same-0.422
 laundry-0.418
 your-0.402
 paper-0.399
 homework-0.393
Token the
Token ID262
Feature activation-2.16e+01

Pos logit contributions
 laundry+0.239
 task+0.232
 same+0.227
 attic+0.213
 proper+0.205
Neg logit contributions
uts-0.343
es-0.337
oot-0.336
oped-0.331
qu-0.327
Token teacher
Token ID4701
Feature activation+1.70e+01

Pos logit contributions
 laundry+0.272
 proper+0.267
 your+0.258
 our+0.245
 task+0.241
Neg logit contributions
uts-0.494
m-0.493
oled-0.485
unch-0.472
qu-0.469
Token.
Token ID13
Feature activation+5.24e+00

Pos logit contributions
oped+0.271
unch+0.261
oled+0.249
uts+0.246
oot+0.237
Neg logit contributions
 laundry-0.408
 same-0.404
 your-0.396
 paper-0.387
 homework-0.386
Token They
Token ID1119
Feature activation+9.91e-01

Pos logit contributions
oped+0.116
uts+0.110
oled+0.110
unch+0.109
owing+0.107
Neg logit contributions
 laundry-0.094
 same-0.092
 your-0.086
 task-0.086
 attic-0.083
Token draw
Token ID3197
Feature activation+9.33e+00

Pos logit contributions
oped+0.023
uts+0.022
oled+0.022
unch+0.021
owing+0.021
Neg logit contributions
 laundry-0.016
 same-0.016
 task-0.015
 your-0.015
 attic-0.014
Token a
Token ID257
Feature activation-1.47e+01

Pos logit contributions
oped+0.188
uts+0.181
oled+0.178
unch+0.175
es+0.173
Neg logit contributions
 laundry-0.183
 same-0.176
 task-0.169
 your-0.163
 way-0.162
Token moved
Token ID3888
Feature activation+1.12e+01

Pos logit contributions
oped+0.313
uts+0.296
oled+0.293
unch+0.292
owing+0.283
Neg logit contributions
 laundry-0.335
 same-0.327
 task-0.308
 your-0.307
 attic-0.299
Token closer
Token ID5699
Feature activation+6.45e+00

Pos logit contributions
oped+0.224
unch+0.215
uts+0.214
oled+0.212
owing+0.203
Neg logit contributions
 laundry-0.225
 same-0.222
 your-0.209
 task-0.207
 paper-0.200
Token and
Token ID290
Feature activation-9.38e+00

Pos logit contributions
oped+0.141
oled+0.134
unch+0.131
uts+0.125
oling+0.122
Neg logit contributions
 same-0.123
 laundry-0.121
 your-0.116
 way-0.116
 paper-0.113
Token closer
Token ID5699
Feature activation+6.53e+00

Pos logit contributions
 laundry+0.234
 task+0.216
 homework+0.214
 attic+0.213
 same+0.212
Neg logit contributions
oped-0.147
oled-0.128
es-0.127
hed-0.125
ed-0.125
Token to
Token ID284
Feature activation-1.45e+01

Pos logit contributions
oped+0.131
unch+0.126
oled+0.126
umbs+0.120
uts+0.118
Neg logit contributions
 same-0.133
 your-0.129
 laundry-0.128
 way-0.123
 paper-0.122
Token the
Token ID262
Feature activation-1.96e+01

Pos logit contributions
 laundry+0.249
 homework+0.234
 task+0.229
 proper+0.217
 computer+0.216
Neg logit contributions
oped-0.291
hed-0.269
oled-0.267
es-0.267
bird-0.263
Token tree
Token ID5509
Feature activation+1.49e+01

Pos logit contributions
 laundry+0.279
 same+0.264
 your+0.262
 homework+0.252
 task+0.245
Neg logit contributions
oped-0.474
unch-0.459
oled-0.454
uts-0.452
owing-0.438
Token.
Token ID13
Feature activation+6.50e+00

Pos logit contributions
oped+0.239
oled+0.238
uts+0.231
unch+0.228
amb+0.223
Neg logit contributions
 same-0.345
 laundry-0.338
 way-0.337
 table-0.329
 paper-0.325
Token Then
Token ID3244
Feature activation+7.32e+00

Pos logit contributions
oped+0.131
uts+0.125
oled+0.123
unch+0.123
owing+0.118
Neg logit contributions
 laundry-0.129
 same-0.126
 task-0.119
 your-0.117
 homework-0.114
Token,
Token ID11
Feature activation-2.54e+00

Pos logit contributions
oped+0.116
es+0.114
uts+0.106
oled+0.103
ed+0.101
Neg logit contributions
 laundry-0.169
 same-0.163
 task-0.158
 attic-0.157
 proper-0.155
Token a
Token ID257
Feature activation-1.22e+01

Pos logit contributions
 laundry+0.047
 same+0.044
 task+0.043
 proper+0.042
 attic+0.042
Neg logit contributions
oped-0.051
es-0.050
uts-0.048
oled-0.047
ed-0.047
Token difference
Token ID3580
Feature activation+1.54e+01

Pos logit contributions
 your+0.256
 laundry+0.250
 attic+0.244
 basement+0.236
 proper+0.234
Neg logit contributions
m-0.479
unch-0.476
uts-0.466
oped-0.458
oled-0.453
Token between
Token ID1022
Feature activation-9.88e+00

Pos logit contributions
oped+0.249
uts+0.237
es+0.233
oled+0.232
owing+0.231
Neg logit contributions
 laundry-0.352
 same-0.342
 task-0.333
 proper-0.320
 attic-0.319
Token the
Token ID262
Feature activation-2.03e+01

Pos logit contributions
 laundry+0.136
 same+0.132
 your+0.125
 task+0.118
 paper+0.115
Neg logit contributions
oped-0.262
uts-0.249
oled-0.248
unch-0.247
owing-0.245
Token lemon
Token ID18873
Feature activation+1.25e+01

Pos logit contributions
 laundry+0.238
 your+0.226
 same+0.221
 proper+0.211
 homework+0.207
Neg logit contributions
unch-0.509
oped-0.507
m-0.498
oled-0.494
uts-0.492
Token and
Token ID290
Feature activation-9.36e+00

Pos logit contributions
oped+0.186
oled+0.171
unch+0.168
uts+0.166
owing+0.162
Neg logit contributions
 laundry-0.315
 same-0.309
 task-0.295
 your-0.294
 way-0.288
Token the
Token ID262
Feature activation-2.02e+01

Pos logit contributions
 task+0.154
 laundry+0.153
 attic+0.151
 basement+0.145
 port+0.144
Neg logit contributions
es-0.196
uts-0.186
oped-0.179
 stro-0.176
hed-0.176
Token orange
Token ID10912
Feature activation+7.41e+00

Pos logit contributions
 laundry+0.274
 your+0.259
 same+0.255
 task+0.249
 proper+0.245
Neg logit contributions
oped-0.476
unch-0.469
oled-0.466
uts-0.464
m-0.458
Token.
Token ID13
Feature activation+6.16e+00

Pos logit contributions
oped+0.124
oled+0.115
uts+0.114
unch+0.112
owing+0.110
Neg logit contributions
 laundry-0.175
 same-0.172
 task-0.163
 your-0.161
 way-0.158
Token
Token ID198
Feature activation+4.28e+00

Pos logit contributions
oped+0.137
owing+0.132
oled+0.126
unch+0.123
oot+0.123
Neg logit contributions
 same-0.111
 laundry-0.110
 your-0.107
 math-0.101
 paper-0.099
Token
Token ID198
Feature activation+4.30e+00

Pos logit contributions
oped+0.086
owing+0.085
ring+0.081
oled+0.080
ified+0.077
Neg logit contributions
 your-0.087
 same-0.085
 laundry-0.081
 new-0.081
 our-0.080
TokenTom
Token ID13787
Feature activation+1.03e+01

Pos logit contributions
oped+0.092
owing+0.091
oled+0.087
uts+0.086
ann+0.086
Neg logit contributions
 laundry-0.078
 your-0.078
 same-0.077
 math-0.071
 homework-0.071