TOP ACTIVATIONS
MAX = 0.268

his coins . She wished she could hug her mom and
and ashamed . She wished she had not che wed the
and lonely . She wished she never found the ring .
L ily wished she could own Max . She
and cold . She wished she had listened to her mom
and sad . She wished she had tid ied up her
was very sad and wished she had listened to the rules
ons , but she wished she had someone to share them
felt so sad and wished she had listened to her mom
and sad . She wished she had listened to her mom
hard lesson . She wished she had listened to her mom
princess es . She wished she could go to that world
very sorry , and wished she had not waved the purse
, the baby bear knew she could always rely on her
home . Her mom said she would accept if it was
and friends . She wished she had listened to her mom
too fast . She wished she was brave like Ben .
was very sad and wished she had known how to help
it too . She thought she could wash his eye with
and sad . She says she is sorry to her dad

MIN ACTIVATIONS
MIN = -0.284

sorts of adventures in the vast ocean . She was happy
the stars and planets in outer space . So one day
food for the rh in oc eros and his friends .
. She grabbed the fire extingu isher and put out the
what the rocket discovered in outer space . It was a
and took the rh in oc eros home with her .
quickly checked for a fire extingu isher , but there wasn
good to roll in the lush green grass and jump high
farmer who lived on a lush farm with many animals and
. It ran through a lush green me adow , with
have stopped the rh in oc eros . He is sleeping
trapped with the rh in oc eros . They looked for
to see the rh in oc eros anymore . He is
look for the rh in oc eros . They
kinds of wonders in the big woods . One
izz ed off back into outer space . Earth was an
dog who lived in a big house with her owner ,
where he belonged in the big ocean . The fish listened
it and posted it on social media . L
try to grab the av oc ados . But the rock

INTERVAL 0.130 - 0.268
CONTAINS 12.323%

had a toy microphone that she used to sing into .
felt proud of what she had done . When she went
buckle . Her mom my had bought it for her at
ran to the balloon . It was tied to a pole
at Tom and Sue . He nods his head . He

INTERVAL -0.008 - 0.130
CONTAINS 33.086%

that was much smaller . But the kitten was still sad
it was too late . Sara started to cry . She
. He saw Lily and said , " Sh oo ,
nose . She saw a bird who was fast and she
and walked away . She went inside and found her Mom

INTERVAL -0.146 - -0.008
CONTAINS 52.069%

bottle . It was red and had spark les all over
. He thinks Tim is mean . He goes to his
the rabbit forgot about the carrot and ran away as fast
them . They have each other . They are quiet and
and didn 't shine very much . Then one day ,

INTERVAL -0.284 - -0.146
CONTAINS 2.522%

. Whenever she was in the park she always spun ,
. It was a big , round , yellow ball .
He noticed another infant in the garden who was also playing
loved to play outside in the sun . One day ,
airplane . It took a little bit of practice , but
Token his
Token ID465
Feature activation+6.80e-02

Pos logit contributions
 managed+0.109
pped+0.102
'd+0.102
ched+0.101
cher+0.093
Neg logit contributions
 Grand-0.089
 festivals-0.088
 grand-0.088
ummy-0.087
 Aunt-0.087
Token coins
Token ID10796
Feature activation+8.55e-02

Pos logit contributions
 managed+0.151
'd+0.142
ched+0.139
pped+0.138
 stayed+0.131
Neg logit contributions
ummy-0.093
 festivals-0.092
igsaw-0.092
 Grand-0.092
 Aunt-0.091
Token.
Token ID13
Feature activation-2.37e-02

Pos logit contributions
 managed+0.182
pped+0.175
ched+0.170
'd+0.168
tted+0.156
Neg logit contributions
 grand-0.121
ummy-0.120
 Grand-0.116
 batch-0.112
igsaw-0.105
Token She
Token ID1375
Feature activation+1.99e-01

Pos logit contributions
 batch+0.038
igsaw+0.037
ummy+0.036
 festivals+0.036
 grand+0.035
Neg logit contributions
 managed-0.046
'd-0.044
 could-0.043
ched-0.043
pped-0.041
Token wished
Token ID16555
Feature activation+8.76e-02

Pos logit contributions
 managed+0.288
pped+0.285
ched+0.273
cher+0.264
ier+0.263
Neg logit contributions
 Grand-0.422
 grand-0.419
 batch-0.407
 Aunt-0.400
 festivals-0.385
Token she
Token ID673
Feature activation+2.57e-01

Pos logit contributions
 managed+0.172
'd+0.159
ched+0.157
pped+0.156
 stayed+0.144
Neg logit contributions
ummy-0.144
igsaw-0.143
 festivals-0.143
 Grand-0.143
 batch-0.142
Token could
Token ID714
Feature activation+1.64e-01

Pos logit contributions
pped+0.367
ched+0.366
ps+0.348
amed+0.339
ier+0.339
Neg logit contributions
 grand-0.548
 Grand-0.524
 Aunt-0.480
 Wednesday-0.457
 Sunday-0.444
Token hug
Token ID16225
Feature activation-3.32e-04

Pos logit contributions
 managed+0.317
pped+0.307
ched+0.299
'd+0.285
ier+0.279
Neg logit contributions
 Grand-0.265
 grand-0.264
 Aunt-0.253
 festivals-0.252
ummy-0.249
Token her
Token ID607
Feature activation+6.51e-02

Pos logit contributions
igsaw+0.001
 batch+0.001
ummy+0.001
 festivals+0.001
 Of+0.001
Neg logit contributions
 managed-0.001
pped-0.000
'd-0.000
ched-0.000
 wouldn-0.000
Token mom
Token ID1995
Feature activation+5.13e-02

Pos logit contributions
 managed+0.121
'd+0.114
ched+0.107
pped+0.107
cher+0.107
Neg logit contributions
igsaw-0.111
 festivals-0.110
ummy-0.108
 Of-0.107
 batch-0.107
Token and
Token ID290
Feature activation-7.17e-02

Pos logit contributions
 managed+0.102
ched+0.094
pped+0.094
'd+0.091
tted+0.085
Neg logit contributions
 Grand-0.088
 grand-0.086
 Aunt-0.086
ummy-0.085
 batch-0.084
Token and
Token ID290
Feature activation-5.78e-02

Pos logit contributions
 managed+0.011
pped+0.010
'd+0.010
ched+0.010
cher+0.009
Neg logit contributions
 Grand-0.007
ummy-0.007
 grand-0.007
 Aunt-0.007
 Of-0.007
Token ashamed
Token ID22461
Feature activation-2.48e-02

Pos logit contributions
 grand+0.109
 Grand+0.109
 Aunt+0.107
 Gig+0.100
 Of+0.096
Neg logit contributions
pped-0.108
 managed-0.098
'd-0.097
ched-0.095
ier-0.094
Token.
Token ID13
Feature activation-3.88e-02

Pos logit contributions
 Grand+0.037
ummy+0.036
 Aunt+0.036
 Of+0.036
 grand+0.035
Neg logit contributions
 managed-0.056
'd-0.052
pped-0.051
cher-0.050
ched-0.049
Token She
Token ID1375
Feature activation+1.91e-01

Pos logit contributions
 batch+0.084
igsaw+0.080
 festivals+0.078
 orig+0.075
 fireworks+0.075
Neg logit contributions
 managed-0.056
 could-0.055
ched-0.050
'd-0.049
pped-0.048
Token wished
Token ID16555
Feature activation+7.49e-02

Pos logit contributions
ier+0.321
cher+0.321
iers+0.315
pped+0.303
tted+0.301
Neg logit contributions
 grand-0.385
 Grand-0.367
 Aunt-0.345
 Gig-0.313
 Uncle-0.306
Token she
Token ID673
Feature activation+2.56e-01

Pos logit contributions
 managed+0.147
'd+0.136
ched+0.134
pped+0.134
 stayed+0.123
Neg logit contributions
ummy-0.124
 Grand-0.123
 festivals-0.123
igsaw-0.123
 grand-0.122
Token had
Token ID550
Feature activation+2.01e-01

Pos logit contributions
iers+0.434
pered+0.427
iece+0.415
ier+0.414
ched+0.405
Neg logit contributions
 grand-0.474
 Grand-0.455
 Aunt-0.423
 grandma-0.396
 Wednesday-0.385
Token not
Token ID407
Feature activation+1.16e-01

Pos logit contributions
ched+0.284
tted+0.273
ier+0.269
pped+0.268
'd+0.262
Neg logit contributions
 grand-0.439
 Grand-0.436
 Aunt-0.414
 Wednesday-0.397
 Sunday-0.395
Token che
Token ID1125
Feature activation-5.78e-02

Pos logit contributions
pped+0.134
'd+0.132
 managed+0.131
ched+0.130
cher+0.125
Neg logit contributions
 grand-0.267
 Grand-0.266
 Aunt-0.254
 festivals-0.249
 history-0.245
Tokenwed
Token ID19103
Feature activation+1.23e-02

Pos logit contributions
 batch+0.106
ummy+0.099
igsaw+0.099
 common+0.097
 Sunday+0.097
Neg logit contributions
'd-0.090
 managed-0.082
cher-0.081
tted-0.079
 could-0.076
Token the
Token ID262
Feature activation-1.20e-01

Pos logit contributions
 managed+0.027
'd+0.025
ched+0.024
pped+0.024
cher+0.023
Neg logit contributions
 Grand-0.018
ummy-0.018
 festivals-0.018
 grand-0.018
 Aunt-0.018
Token and
Token ID290
Feature activation-6.25e-02

Pos logit contributions
 Grand+0.074
ummy+0.074
 grand+0.073
 Aunt+0.072
 batch+0.070
Neg logit contributions
 managed-0.102
'd-0.095
pped-0.093
ched-0.092
cher-0.088
Token lonely
Token ID21757
Feature activation-6.08e-02

Pos logit contributions
 Grand+0.102
 grand+0.101
 Aunt+0.097
 Uncle+0.086
 sunset+0.086
Neg logit contributions
 managed-0.125
'd-0.124
pped-0.124
tted-0.119
ched-0.117
Token.
Token ID13
Feature activation-3.37e-02

Pos logit contributions
 Grand+0.069
 grand+0.066
 Aunt+0.064
 Sunday+0.062
 Wednesday+0.060
Neg logit contributions
 managed-0.156
pped-0.148
'd-0.147
ched-0.145
tted-0.137
Token She
Token ID1375
Feature activation+1.92e-01

Pos logit contributions
 batch+0.062
igsaw+0.059
 festivals+0.059
ummy+0.058
 grand+0.057
Neg logit contributions
 managed-0.059
 could-0.056
'd-0.055
ched-0.054
pped-0.053
Token wished
Token ID16555
Feature activation+7.49e-02

Pos logit contributions
cher+0.342
ier+0.333
iers+0.321
tted+0.315
pped+0.309
Neg logit contributions
 grand-0.387
 Grand-0.370
 Aunt-0.343
 solar-0.308
 T-0.302
Token she
Token ID673
Feature activation+2.56e-01

Pos logit contributions
 managed+0.145
'd+0.135
ched+0.133
pped+0.132
 stayed+0.122
Neg logit contributions
ummy-0.125
 festivals-0.124
 Grand-0.124
igsaw-0.124
 grand-0.124
Token never
Token ID1239
Feature activation+1.42e-01

Pos logit contributions
ier+0.402
iers+0.402
ched+0.396
amed+0.384
pered+0.384
Neg logit contributions
 grand-0.507
 Grand-0.490
 Aunt-0.464
 grandma-0.437
 Wednesday-0.422
Token found
Token ID1043
Feature activation+1.12e-01

Pos logit contributions
pped+0.177
ched+0.173
 managed+0.172
'd+0.158
tted+0.155
Neg logit contributions
 grand-0.328
 Grand-0.326
 Aunt-0.306
 festivals-0.305
 Of-0.296
Token the
Token ID262
Feature activation-9.70e-02

Pos logit contributions
 managed+0.249
'd+0.240
ched+0.228
pped+0.228
cher+0.215
Neg logit contributions
 Grand-0.168
 Aunt-0.164
 grand-0.161
 festivals-0.154
ummy-0.152
Token ring
Token ID5858
Feature activation+2.38e-03

Pos logit contributions
ummy+0.147
 Of+0.147
igsaw+0.146
 batch+0.143
 festivals+0.143
Neg logit contributions
 managed-0.196
'd-0.189
pped-0.174
 wouldn-0.172
ched-0.172
Token.
Token ID13
Feature activation-5.08e-02

Pos logit contributions
 managed+0.005
pped+0.005
ched+0.005
'd+0.004
tted+0.004
Neg logit contributions
 Grand-0.004
 Aunt-0.004
 grand-0.004
ummy-0.004
 Of-0.004
Token
Token ID198
Feature activation-2.60e-02

Pos logit contributions
 batch+0.052
igsaw+0.051
 festivals+0.050
ummy+0.050
 grand+0.048
Neg logit contributions
 managed-0.069
'd-0.065
ched-0.063
 could-0.063
pped-0.061
Token
Token ID198
Feature activation-3.22e-02

Pos logit contributions
 batch+0.041
igsaw+0.041
ummy+0.041
 festivals+0.041
 Of+0.040
Neg logit contributions
 managed-0.051
ched-0.048
pped-0.048
'd-0.047
 could-0.045
TokenL
Token ID43
Feature activation+1.70e-02

Pos logit contributions
ummy+0.059
 batch+0.058
 Grand+0.057
 Aunt+0.057
igsaw+0.057
Neg logit contributions
 managed-0.059
pped-0.054
ched-0.054
'd-0.053
 could-0.048
Tokenily
Token ID813
Feature activation+1.11e-01

Pos logit contributions
 managed+0.024
'd+0.021
ched+0.019
 wouldn+0.019
pped+0.018
Neg logit contributions
ummy-0.039
 Grand-0.038
igsaw-0.038
 batch-0.037
 grand-0.037
Token wished
Token ID16555
Feature activation+7.76e-02

Pos logit contributions
 managed+0.181
pped+0.176
ched+0.174
cher+0.173
ier+0.169
Neg logit contributions
 grand-0.213
 Grand-0.213
 Aunt-0.207
 batch-0.201
ummy-0.194
Token she
Token ID673
Feature activation+2.56e-01

Pos logit contributions
 managed+0.157
'd+0.145
ched+0.145
pped+0.144
cher+0.133
Neg logit contributions
 Grand-0.127
 grand-0.125
 Aunt-0.125
ummy-0.124
 festivals-0.123
Token could
Token ID714
Feature activation+1.59e-01

Pos logit contributions
pped+0.373
ched+0.361
iers+0.338
pered+0.332
ps+0.332
Neg logit contributions
 grand-0.573
 Grand-0.560
 Aunt-0.527
 Wednesday-0.491
 Sunday-0.478
Token own
Token ID898
Feature activation-3.36e-02

Pos logit contributions
 managed+0.331
pped+0.320
ched+0.308
ier+0.296
cher+0.286
Neg logit contributions
 Aunt-0.246
 Grand-0.246
 grand-0.245
 festivals-0.231
 Sunday-0.224
Token Max
Token ID5436
Feature activation+1.23e-01

Pos logit contributions
ummy+0.051
igsaw+0.050
 Grand+0.050
 festivals+0.050
 batch+0.050
Neg logit contributions
 managed-0.071
'd-0.066
ched-0.065
pped-0.065
 stayed-0.060
Token.
Token ID13
Feature activation-4.77e-02

Pos logit contributions
 managed+0.238
pped+0.235
ched+0.220
'd+0.211
tted+0.202
Neg logit contributions
 grand-0.206
 festivals-0.202
 Grand-0.201
 Aunt-0.197
 batch-0.192
Token She
Token ID1375
Feature activation+1.81e-01

Pos logit contributions
 batch+0.083
 festivals+0.082
igsaw+0.081
ummy+0.081
 Gig+0.076
Neg logit contributions
 managed-0.088
'd-0.082
 could-0.079
ched-0.079
pped-0.077
Token and
Token ID290
Feature activation-9.69e-02

Pos logit contributions
ummy+0.152
 Grand+0.146
 grand+0.143
 Aunt+0.143
 festivals+0.142
Neg logit contributions
 managed-0.199
'd-0.190
ched-0.183
cher-0.178
pped-0.177
Token cold
Token ID4692
Feature activation-8.70e-02

Pos logit contributions
 Grand+0.171
 Aunt+0.168
 grand+0.168
 batch+0.163
 Sunday+0.158
Neg logit contributions
 managed-0.176
pped-0.171
'd-0.169
ched-0.169
cher-0.165
Token.
Token ID13
Feature activation-3.21e-02

Pos logit contributions
ummy+0.140
 Grand+0.136
 Aunt+0.134
 festivals+0.132
 grand+0.131
Neg logit contributions
 managed-0.182
'd-0.174
ched-0.166
pped-0.163
cher-0.162
Token She
Token ID1375
Feature activation+1.92e-01

Pos logit contributions
igsaw+0.058
 festivals+0.056
 batch+0.055
ummy+0.053
 grand+0.053
Neg logit contributions
 managed-0.057
 could-0.054
ched-0.053
'd-0.051
pped-0.050
Token wished
Token ID16555
Feature activation+7.67e-02

Pos logit contributions
cher+0.293
ier+0.281
tted+0.276
pped+0.271
 managed+0.261
Neg logit contributions
 grand-0.409
 Grand-0.408
 Aunt-0.387
 batch-0.373
 Wednesday-0.358
Token she
Token ID673
Feature activation+2.56e-01

Pos logit contributions
 managed+0.147
'd+0.136
ched+0.135
pped+0.134
 stayed+0.124
Neg logit contributions
igsaw-0.129
 festivals-0.129
ummy-0.129
 Grand-0.128
 batch-0.128
Token had
Token ID550
Feature activation+2.00e-01

Pos logit contributions
ier+0.353
cher+0.335
ps+0.332
ched+0.331
iece+0.323
Neg logit contributions
 Grand-0.567
 grand-0.553
 Aunt-0.539
 Wednesday-0.514
 M-0.511
Token listened
Token ID16399
Feature activation+1.44e-03

Pos logit contributions
ched+0.287
cher+0.277
 managed+0.269
tted+0.269
pped+0.263
Neg logit contributions
 Grand-0.428
 grand-0.420
 Sunday-0.411
 Aunt-0.409
 Wednesday-0.402
Token to
Token ID284
Feature activation-1.67e-02

Pos logit contributions
 managed+0.003
pped+0.003
'd+0.003
ched+0.003
cher+0.003
Neg logit contributions
 grand-0.002
 Grand-0.002
 Aunt-0.002
ummy-0.002
 festivals-0.002
Token her
Token ID607
Feature activation+7.95e-02

Pos logit contributions
igsaw+0.030
 batch+0.029
ummy+0.029
 Of+0.029
 festivals+0.028
Neg logit contributions
 managed-0.030
pped-0.028
'd-0.028
ched-0.027
 stayed-0.027
Token mom
Token ID1995
Feature activation+4.59e-02

Pos logit contributions
 managed+0.151
 wouldn+0.145
 stayed+0.143
'd+0.141
 could+0.139
Neg logit contributions
igsaw-0.130
 Of-0.123
 festivals-0.122
 batch-0.119
 Gig-0.116
Token and
Token ID290
Feature activation-8.23e-02

Pos logit contributions
ummy+0.129
 Grand+0.124
 grand+0.123
 Aunt+0.121
 festivals+0.116
Neg logit contributions
 managed-0.195
'd-0.184
pped-0.182
ched-0.181
cher-0.175
Token sad
Token ID6507
Feature activation-5.15e-02

Pos logit contributions
 grand+0.131
 Grand+0.131
 Aunt+0.123
 early+0.117
 Gig+0.115
Neg logit contributions
pped-0.165
ier-0.150
ched-0.147
cher-0.146
'd-0.145
Token.
Token ID13
Feature activation-3.71e-02

Pos logit contributions
 Grand+0.088
ummy+0.088
 grand+0.087
 Aunt+0.087
 festivals+0.086
Neg logit contributions
 managed-0.100
'd-0.092
pped-0.091
ched-0.091
cher-0.084
Token She
Token ID1375
Feature activation+1.91e-01

Pos logit contributions
 batch+0.063
igsaw+0.062
 festivals+0.061
 grand+0.059
ummy+0.059
Neg logit contributions
 managed-0.072
 could-0.067
ched-0.066
'd-0.066
pped-0.062
Token wished
Token ID16555
Feature activation+7.47e-02

Pos logit contributions
ier+0.326
cher+0.311
pped+0.295
iers+0.295
tted+0.278
Neg logit contributions
 grand-0.399
 Grand-0.383
 Aunt-0.355
 T-0.320
 Uncle-0.319
Token she
Token ID673
Feature activation+2.56e-01

Pos logit contributions
 managed+0.142
'd+0.132
ched+0.129
pped+0.129
 stayed+0.120
Neg logit contributions
igsaw-0.127
 festivals-0.127
ummy-0.127
 batch-0.126
 grand-0.125
Token had
Token ID550
Feature activation+2.01e-01

Pos logit contributions
ier+0.418
iece+0.409
amed+0.399
iers+0.391
ps+0.378
Neg logit contributions
 grand-0.474
 Grand-0.458
 Aunt-0.430
 Wednesday-0.408
 Tuesday-0.385
Token tid
Token ID29770
Feature activation-8.33e-02

Pos logit contributions
ched+0.286
ier+0.277
tted+0.275
pped+0.270
cher+0.260
Neg logit contributions
 grand-0.435
 Grand-0.432
 Aunt-0.412
 Sunday-0.408
 Wednesday-0.408
Tokenied
Token ID798
Feature activation+1.13e-02

Pos logit contributions
ummy+0.129
 Grand+0.121
igsaw+0.110
 festivals+0.110
 grand+0.110
Neg logit contributions
 managed-0.182
'd-0.178
 actually-0.154
ched-0.154
 stayed-0.153
Token up
Token ID510
Feature activation-2.72e-02

Pos logit contributions
 managed+0.024
ched+0.022
pped+0.022
'd+0.022
 wouldn+0.021
Neg logit contributions
ummy-0.017
igsaw-0.017
 festivals-0.016
 Of-0.016
 batch-0.016
Token her
Token ID607
Feature activation+5.72e-02

Pos logit contributions
ummy+0.042
igsaw+0.042
 batch+0.042
 festivals+0.041
 Of+0.041
Neg logit contributions
 managed-0.056
ched-0.051
'd-0.051
pped-0.051
 stayed-0.048
Token was
Token ID373
Feature activation+1.61e-01

Pos logit contributions
 managed+0.238
ched+0.233
pped+0.228
tted+0.223
cher+0.220
Neg logit contributions
 Grand-0.302
 grand-0.301
 batch-0.294
 Aunt-0.293
ummy-0.280
Token very
Token ID845
Feature activation-1.29e-02

Pos logit contributions
 managed+0.287
'd+0.279
ched+0.277
pped+0.263
cher+0.250
Neg logit contributions
 Grand-0.300
 Aunt-0.293
 grand-0.293
 batch-0.292
 festivals-0.288
Token sad
Token ID6507
Feature activation-3.58e-02

Pos logit contributions
ummy+0.022
 Grand+0.022
 festivals+0.022
igsaw+0.022
 Aunt+0.022
Neg logit contributions
 managed-0.024
'd-0.022
ched-0.022
pped-0.022
 stayed-0.020
Token and
Token ID290
Feature activation-5.90e-02

Pos logit contributions
 festivals+0.061
 Grand+0.061
ummy+0.061
igsaw+0.061
 grand+0.060
Neg logit contributions
 managed-0.068
'd-0.063
ched-0.062
pped-0.062
 wouldn-0.057
Token wished
Token ID16555
Feature activation+7.71e-02

Pos logit contributions
 Grand+0.117
 grand+0.117
 Aunt+0.115
 batch+0.111
 festivals+0.107
Neg logit contributions
 managed-0.097
pped-0.096
ched-0.094
'd-0.091
tted-0.088
Token she
Token ID673
Feature activation+2.56e-01

Pos logit contributions
 managed+0.163
pped+0.151
'd+0.151
ched+0.151
cher+0.140
Neg logit contributions
 Grand-0.120
 Aunt-0.118
 grand-0.117
ummy-0.116
 batch-0.114
Token had
Token ID550
Feature activation+2.02e-01

Pos logit contributions
pped+0.343
ps+0.334
ier+0.330
pered+0.327
amed+0.323
Neg logit contributions
 grand-0.584
 Grand-0.581
 Aunt-0.542
 Wednesday-0.488
 grandma-0.486
Token listened
Token ID16399
Feature activation+1.28e-03

Pos logit contributions
ched+0.292
tted+0.282
pped+0.275
cher+0.267
 managed+0.263
Neg logit contributions
 grand-0.444
 Grand-0.440
 Aunt-0.415
 Sunday-0.398
 Wednesday-0.396
Token to
Token ID284
Feature activation-1.60e-02

Pos logit contributions
 managed+0.003
pped+0.003
'd+0.003
ched+0.002
cher+0.002
Neg logit contributions
 Grand-0.002
 grand-0.002
 Aunt-0.002
ummy-0.002
 festivals-0.002
Token the
Token ID262
Feature activation-8.02e-02

Pos logit contributions
ummy+0.027
igsaw+0.027
 batch+0.027
 festivals+0.027
 Of+0.026
Neg logit contributions
 managed-0.030
'd-0.028
pped-0.027
ched-0.027
 stayed-0.026
Token rules
Token ID3173
Feature activation-4.87e-03

Pos logit contributions
igsaw+0.114
ummy+0.114
 festivals+0.113
 Of+0.113
 batch+0.112
Neg logit contributions
 managed-0.170
'd-0.164
ched-0.156
pped-0.156
 stayed-0.150
Tokenons
Token ID684
Feature activation+6.61e-02

Pos logit contributions
ummy+0.247
 Grand+0.246
 grand+0.243
igsaw+0.243
 Aunt+0.243
Neg logit contributions
 managed-0.280
'd-0.258
ched-0.253
pped-0.253
 stayed-0.233
Token,
Token ID11
Feature activation-1.23e-01

Pos logit contributions
 managed+0.143
pped+0.135
'd+0.134
ched+0.133
cher+0.124
Neg logit contributions
ummy-0.099
 Grand-0.097
 grand-0.097
 batch-0.092
 Aunt-0.092
Token but
Token ID475
Feature activation+4.92e-02

Pos logit contributions
igsaw+0.216
ummy+0.211
 batch+0.204
 Of+0.201
 festivals+0.201
Neg logit contributions
 managed-0.232
'd-0.219
pped-0.211
ched-0.206
 stayed-0.202
Token she
Token ID673
Feature activation+2.26e-01

Pos logit contributions
 managed+0.096
'd+0.089
ched+0.088
pped+0.088
 stayed+0.081
Neg logit contributions
ummy-0.081
 Grand-0.081
 festivals-0.081
igsaw-0.081
 grand-0.080
Token wished
Token ID16555
Feature activation+7.38e-02

Pos logit contributions
cher+0.325
pped+0.319
tted+0.317
ched+0.309
ier+0.287
Neg logit contributions
 grand-0.494
 Grand-0.487
 Aunt-0.458
 batch-0.456
 Wednesday-0.427
Token she
Token ID673
Feature activation+2.56e-01

Pos logit contributions
 managed+0.138
'd+0.128
ched+0.125
pped+0.124
 wouldn+0.117
Neg logit contributions
igsaw-0.128
 festivals-0.127
 batch-0.126
ummy-0.126
 grand-0.124
Token had
Token ID550
Feature activation+1.95e-01

Pos logit contributions
ched+0.300
pped+0.298
ier+0.271
amed+0.266
tted+0.264
Neg logit contributions
 grand-0.634
 Grand-0.631
 Aunt-0.589
 Wednesday-0.555
 Sunday-0.541
Token someone
Token ID2130
Feature activation+1.14e-01

Pos logit contributions
ched+0.391
 managed+0.376
tted+0.373
pped+0.367
cher+0.364
Neg logit contributions
 Grand-0.324
 grand-0.322
 Aunt-0.296
 festivals-0.295
 Sunday-0.286
Token to
Token ID284
Feature activation-3.30e-02

Pos logit contributions
pped+0.268
 managed+0.263
ched+0.258
tted+0.248
'd+0.246
Neg logit contributions
 grand-0.149
 Grand-0.145
 Aunt-0.138
 festivals-0.133
 Of-0.127
Token share
Token ID2648
Feature activation+2.22e-02

Pos logit contributions
ummy+0.054
igsaw+0.054
 festivals+0.054
 batch+0.053
 Aunt+0.053
Neg logit contributions
 managed-0.066
'd-0.061
pped-0.060
ched-0.059
 stayed-0.056
Token them
Token ID606
Feature activation+1.51e-01

Pos logit contributions
 managed+0.038
pped+0.035
ched+0.035
'd+0.035
 could+0.032
Neg logit contributions
igsaw-0.041
ummy-0.041
 Of-0.040
 festivals-0.040
 batch-0.040
Token felt
Token ID2936
Feature activation+1.31e-01

Pos logit contributions
ier+0.247
pped+0.242
cher+0.238
tted+0.232
ched+0.230
Neg logit contributions
 grand-0.292
 Grand-0.278
 Aunt-0.262
 batch-0.251
 Uncle-0.241
Token so
Token ID523
Feature activation-6.27e-02

Pos logit contributions
'd+0.322
cher+0.319
 managed+0.314
pped+0.288
ried+0.276
Neg logit contributions
 Grand-0.150
 grand-0.147
 Aunt-0.144
 Gig-0.141
 Mr-0.138
Token sad
Token ID6507
Feature activation-3.39e-02

Pos logit contributions
ummy+0.084
 grand+0.084
 Grand+0.081
 Sunday+0.079
 sunset+0.078
Neg logit contributions
cher-0.140
 managed-0.138
'd-0.136
pped-0.133
ier-0.120
Token and
Token ID290
Feature activation-5.65e-02

Pos logit contributions
ummy+0.053
 Grand+0.052
 grand+0.052
 Aunt+0.052
 festivals+0.051
Neg logit contributions
 managed-0.072
'd-0.067
pped-0.066
ched-0.065
cher-0.061
Token wished
Token ID16555
Feature activation+7.50e-02

Pos logit contributions
 Grand+0.106
 grand+0.105
 Aunt+0.102
 Uncle+0.095
 Gig+0.095
Neg logit contributions
pped-0.099
tted-0.098
 managed-0.096
'd-0.096
cher-0.094
Token she
Token ID673
Feature activation+2.56e-01

Pos logit contributions
 managed+0.151
'd+0.140
ched+0.138
pped+0.138
cher+0.127
Neg logit contributions
 Grand-0.121
ummy-0.121
 festivals-0.120
 grand-0.120
 Aunt-0.120
Token had
Token ID550
Feature activation+1.98e-01

Pos logit contributions
ier+0.409
iers+0.402
ps+0.399
iece+0.398
amed+0.390
Neg logit contributions
 grand-0.499
 Grand-0.479
 Aunt-0.441
 grandma-0.401
 Wednesday-0.394
Token listened
Token ID16399
Feature activation+2.95e-04

Pos logit contributions
ched+0.266
tted+0.259
ier+0.254
pped+0.243
cher+0.240
Neg logit contributions
 grand-0.444
 Grand-0.439
 Aunt-0.416
 Sunday-0.400
 Wednesday-0.400
Token to
Token ID284
Feature activation-1.64e-02

Pos logit contributions
 managed+0.001
pped+0.001
'd+0.001
ched+0.001
cher+0.001
Neg logit contributions
 grand-0.000
 Grand-0.000
 Aunt-0.000
ummy-0.000
 Of-0.000
Token her
Token ID607
Feature activation+8.07e-02

Pos logit contributions
igsaw+0.028
 batch+0.028
ummy+0.028
 festivals+0.027
 Of+0.027
Neg logit contributions
 managed-0.031
'd-0.028
pped-0.028
ched-0.027
 stayed-0.027
Token mom
Token ID1995
Feature activation+4.41e-02

Pos logit contributions
 managed+0.154
'd+0.147
 wouldn+0.145
 stayed+0.141
 could+0.139
Neg logit contributions
igsaw-0.124
 festivals-0.123
 Of-0.119
 Gig-0.117
 batch-0.116
Token and
Token ID290
Feature activation-8.09e-02

Pos logit contributions
ummy+0.060
 Grand+0.059
 Aunt+0.059
 grand+0.058
 batch+0.055
Neg logit contributions
 managed-0.081
'd-0.078
ched-0.074
pped-0.073
cher-0.072
Token sad
Token ID6507
Feature activation-4.63e-02

Pos logit contributions
 grand+0.139
 Grand+0.139
 Aunt+0.136
 Uncle+0.127
 Gig+0.126
Neg logit contributions
pped-0.156
tted-0.153
 managed-0.150
cher-0.149
'd-0.146
Token.
Token ID13
Feature activation-4.33e-02

Pos logit contributions
 festivals+0.087
igsaw+0.086
ummy+0.085
 batch+0.085
 Grand+0.084
Neg logit contributions
 managed-0.081
ched-0.074
'd-0.074
pped-0.073
 wouldn-0.068
Token She
Token ID1375
Feature activation+1.90e-01

Pos logit contributions
 batch+0.082
 festivals+0.081
igsaw+0.080
ummy+0.077
 grand+0.076
Neg logit contributions
 managed-0.071
'd-0.065
 could-0.064
pped-0.063
ched-0.063
Token wished
Token ID16555
Feature activation+7.76e-02

Pos logit contributions
ier+0.361
cher+0.357
iers+0.335
tted+0.324
bridge+0.306
Neg logit contributions
 grand-0.385
 Grand-0.364
 Aunt-0.345
 Uncle-0.312
 Gig-0.302
Token she
Token ID673
Feature activation+2.56e-01

Pos logit contributions
 managed+0.140
'd+0.129
pped+0.126
ched+0.126
 wouldn+0.119
Neg logit contributions
igsaw-0.141
 festivals-0.140
 batch-0.139
ummy-0.138
 Of-0.135
Token had
Token ID550
Feature activation+2.02e-01

Pos logit contributions
ier+0.430
iers+0.413
iece+0.407
amed+0.397
bridge+0.396
Neg logit contributions
 grand-0.500
 Grand-0.479
 Aunt-0.463
 grandma-0.432
 Wednesday-0.416
Token listened
Token ID16399
Feature activation+1.34e-03

Pos logit contributions
ched+0.291
pped+0.283
'd+0.280
 managed+0.280
tted+0.272
Neg logit contributions
 grand-0.431
 Grand-0.427
 Aunt-0.415
 festivals-0.398
 Sunday-0.397
Token to
Token ID284
Feature activation-1.54e-02

Pos logit contributions
 managed+0.003
pped+0.003
'd+0.003
ched+0.003
cher+0.003
Neg logit contributions
 grand-0.002
 Grand-0.002
 Aunt-0.002
ummy-0.002
 Of-0.002
Token her
Token ID607
Feature activation+8.05e-02

Pos logit contributions
igsaw+0.029
 batch+0.029
ummy+0.028
 Of+0.028
 festivals+0.027
Neg logit contributions
 managed-0.026
pped-0.024
'd-0.024
 stayed-0.023
ched-0.023
Token mom
Token ID1995
Feature activation+4.38e-02

Pos logit contributions
 managed+0.138
 wouldn+0.135
'd+0.132
 could+0.130
 back+0.129
Neg logit contributions
igsaw-0.141
 festivals-0.138
 batch-0.133
 Of-0.133
 Gig-0.127
Token hard
Token ID1327
Feature activation-9.00e-02

Pos logit contributions
ummy+0.067
 festivals+0.066
 Of+0.066
 July+0.065
 Grand+0.065
Neg logit contributions
'd-0.079
 managed-0.073
 wouldn-0.071
 could-0.070
ched-0.065
Token lesson
Token ID11483
Feature activation-3.46e-02

Pos logit contributions
 festivals+0.124
 Grand+0.120
ummy+0.120
igsaw+0.118
 Aunt+0.117
Neg logit contributions
 managed-0.197
'd-0.191
ched-0.190
 wouldn-0.182
 stayed-0.172
Token.
Token ID13
Feature activation-3.59e-02

Pos logit contributions
 Grand+0.053
 grand+0.051
 Aunt+0.051
ummy+0.050
 Of+0.048
Neg logit contributions
 managed-0.077
'd-0.071
pped-0.070
ched-0.070
cher-0.066
Token She
Token ID1375
Feature activation+1.92e-01

Pos logit contributions
 batch+0.072
igsaw+0.067
 festivals+0.065
ummy+0.064
 fireworks+0.064
Neg logit contributions
 managed-0.057
 could-0.055
ched-0.052
pped-0.052
'd-0.052
Token wished
Token ID16555
Feature activation+7.39e-02

Pos logit contributions
cher+0.298
pped+0.279
ier+0.269
iers+0.257
tted+0.251
Neg logit contributions
 grand-0.420
 Grand-0.401
 Aunt-0.375
 batch-0.350
 Uncle-0.343
Token she
Token ID673
Feature activation+2.56e-01

Pos logit contributions
 managed+0.135
'd+0.125
pped+0.122
ched+0.122
 stayed+0.115
Neg logit contributions
igsaw-0.131
 festivals-0.130
 batch-0.130
ummy-0.128
 grand-0.127
Token had
Token ID550
Feature activation+2.01e-01

Pos logit contributions
amed+0.383
pered+0.380
ched+0.356
iece+0.355
iers+0.352
Neg logit contributions
 grand-0.519
 Grand-0.499
 Wednesday-0.456
 Aunt-0.453
 Sunday-0.440
Token listened
Token ID16399
Feature activation+3.38e-04

Pos logit contributions
ched+0.299
pped+0.285
tted+0.283
cher+0.278
 managed+0.273
Neg logit contributions
 Grand-0.422
 grand-0.421
 Sunday-0.400
 Aunt-0.397
 Wednesday-0.393
Token to
Token ID284
Feature activation-1.76e-02

Pos logit contributions
 managed+0.001
pped+0.001
'd+0.001
ched+0.001
cher+0.001
Neg logit contributions
 grand-0.000
 Grand-0.000
 Aunt-0.000
ummy-0.000
 Sunday-0.000
Token her
Token ID607
Feature activation+7.80e-02

Pos logit contributions
igsaw+0.033
 batch+0.032
ummy+0.031
 Of+0.031
 festivals+0.030
Neg logit contributions
 managed-0.030
pped-0.028
'd-0.028
 stayed-0.027
ched-0.027
Token mom
Token ID1995
Feature activation+4.49e-02

Pos logit contributions
 managed+0.141
 wouldn+0.136
'd+0.136
 stayed+0.134
 could+0.131
Neg logit contributions
igsaw-0.134
 Of-0.125
 batch-0.125
ummy-0.120
 Gig-0.118
Token princess
Token ID21752
Feature activation+6.57e-02

Pos logit contributions
 grand+0.151
 Grand+0.147
 festivals+0.144
 Aunt+0.143
 Gig+0.134
Neg logit contributions
pped-0.199
 managed-0.198
ched-0.193
'd-0.184
ier-0.172
Tokenes
Token ID274
Feature activation+6.10e-02

Pos logit contributions
 managed+0.110
pped+0.108
'd+0.102
ched+0.101
 wouldn+0.095
Neg logit contributions
 grand-0.123
 festivals-0.122
 Grand-0.118
 Aunt-0.117
 history-0.112
Token.
Token ID13
Feature activation-3.51e-02

Pos logit contributions
 managed+0.126
pped+0.119
'd+0.117
ched+0.115
 wouldn+0.106
Neg logit contributions
 grand-0.093
 Grand-0.092
 festivals-0.092
 Aunt-0.091
ummy-0.088
Token She
Token ID1375
Feature activation+1.94e-01

Pos logit contributions
 batch+0.057
igsaw+0.055
ummy+0.053
 orig+0.051
 perspective+0.051
Neg logit contributions
 managed-0.065
 could-0.064
'd-0.061
ched-0.059
pped-0.056
Token wished
Token ID16555
Feature activation+7.88e-02

Pos logit contributions
pped+0.365
 managed+0.342
ched+0.328
tted+0.315
ier+0.313
Neg logit contributions
 grand-0.345
 Grand-0.336
 festivals-0.324
 Aunt-0.315
 history-0.306
Token she
Token ID673
Feature activation+2.56e-01

Pos logit contributions
 managed+0.159
'd+0.148
ched+0.145
pped+0.145
 stayed+0.134
Neg logit contributions
ummy-0.125
igsaw-0.125
 festivals-0.125
 Grand-0.124
 batch-0.124
Token could
Token ID714
Feature activation+1.58e-01

Pos logit contributions
pped+0.392
ched+0.379
ier+0.342
pping+0.339
amed+0.337
Neg logit contributions
 grand-0.543
 Grand-0.504
 Aunt-0.480
 Wednesday-0.461
 festivals-0.456
Token go
Token ID467
Feature activation+6.81e-02

Pos logit contributions
 managed+0.308
pped+0.301
ched+0.291
'd+0.272
 wouldn+0.266
Neg logit contributions
 grand-0.263
 festivals-0.258
 Grand-0.257
 Aunt-0.256
 sunset-0.236
Token to
Token ID284
Feature activation-3.59e-02

Pos logit contributions
 managed+0.117
pped+0.107
ched+0.106
'd+0.104
 wouldn+0.101
Neg logit contributions
 festivals-0.136
 grand-0.131
 Aunt-0.130
 Grand-0.129
ummy-0.129
Token that
Token ID326
Feature activation-4.29e-02

Pos logit contributions
ummy+0.054
igsaw+0.052
 batch+0.052
 Of+0.051
 Grand+0.049
Neg logit contributions
 managed-0.077
'd-0.071
pped-0.069
ched-0.069
 stayed-0.066
Token world
Token ID995
Feature activation+4.31e-02

Pos logit contributions
ummy+0.059
igsaw+0.059
 batch+0.058
 Grand+0.058
 Of+0.058
Neg logit contributions
 managed-0.096
'd-0.090
ched-0.087
pped-0.087
 stayed-0.082
Token very
Token ID845
Feature activation-2.71e-02

Pos logit contributions
'd+0.305
 managed+0.298
cher+0.298
pped+0.282
ier+0.263
Neg logit contributions
 Grand-0.177
 grand-0.175
 Aunt-0.166
 summer-0.157
 Gig-0.157
Token sorry
Token ID7926
Feature activation+9.97e-03

Pos logit contributions
 grand+0.039
 Grand+0.038
ummy+0.037
 Aunt+0.036
 festivals+0.035
Neg logit contributions
 managed-0.058
pped-0.058
'd-0.057
cher-0.056
ched-0.053
Token,
Token ID11
Feature activation-1.04e-01

Pos logit contributions
 managed+0.022
pped+0.021
'd+0.020
ched+0.020
cher+0.019
Neg logit contributions
 Grand-0.016
ummy-0.015
 grand-0.015
 Aunt-0.015
 Of-0.014
Token and
Token ID290
Feature activation-4.21e-02

Pos logit contributions
 Grand+0.148
 Aunt+0.147
 grand+0.146
 festivals+0.141
 batch+0.140
Neg logit contributions
 managed-0.230
pped-0.218
ched-0.218
'd-0.214
cher-0.202
Token wished
Token ID16555
Feature activation+7.56e-02

Pos logit contributions
 Grand+0.064
 Aunt+0.061
 grand+0.061
 Gig+0.059
 Uncle+0.055
Neg logit contributions
pped-0.095
tted-0.088
ched-0.088
'd-0.087
 managed-0.087
Token she
Token ID673
Feature activation+2.56e-01

Pos logit contributions
 managed+0.153
pped+0.142
'd+0.142
ched+0.142
cher+0.130
Neg logit contributions
 Grand-0.125
 Aunt-0.122
 grand-0.121
ummy-0.121
 festivals-0.119
Token had
Token ID550
Feature activation+1.99e-01

Pos logit contributions
iers+0.467
iece+0.463
pered+0.450
ps+0.445
ier+0.437
Neg logit contributions
 grand-0.442
 Grand-0.401
 Aunt-0.376
 grandma-0.352
 Mom-0.335
Token not
Token ID407
Feature activation+1.16e-01

Pos logit contributions
tted+0.283
ched+0.277
ier+0.269
pped+0.260
cher+0.260
Neg logit contributions
 grand-0.434
 Grand-0.423
 Aunt-0.395
 Wednesday-0.389
 Sunday-0.386
Token waved
Token ID26834
Feature activation-5.02e-03

Pos logit contributions
pped+0.142
ched+0.141
 managed+0.138
'd+0.137
cher+0.133
Neg logit contributions
 grand-0.258
 Grand-0.255
 Aunt-0.244
 festivals-0.244
 history-0.239
Token the
Token ID262
Feature activation-1.09e-01

Pos logit contributions
 Grand+0.008
 grand+0.008
 Aunt+0.008
 Of+0.008
 festivals+0.007
Neg logit contributions
 managed-0.010
'd-0.010
pped-0.009
ched-0.009
cher-0.009
Token purse
Token ID27517
Feature activation+3.05e-02

Pos logit contributions
 grand+0.138
 Grand+0.136
 Aunt+0.136
 batch+0.134
igsaw+0.132
Neg logit contributions
 managed-0.260
ched-0.245
pped-0.242
'd-0.240
cher-0.225
Token,
Token ID11
Feature activation-1.04e-01

Pos logit contributions
 Grand+0.067
ummy+0.065
 Aunt+0.064
 grand+0.064
 festivals+0.061
Neg logit contributions
 managed-0.149
pped-0.143
'd-0.142
ched-0.142
cher-0.137
Token the
Token ID262
Feature activation-6.04e-02

Pos logit contributions
ummy+0.142
 batch+0.137
igsaw+0.136
 festivals+0.132
 grand+0.130
Neg logit contributions
 managed-0.239
'd-0.221
ched-0.213
pped-0.209
 could-0.207
Token baby
Token ID5156
Feature activation+2.96e-03

Pos logit contributions
ummy+0.082
 Grand+0.082
 festivals+0.081
igsaw+0.081
 Aunt+0.081
Neg logit contributions
 managed-0.136
'd-0.127
ched-0.126
pped-0.125
 stayed-0.117
Token bear
Token ID6842
Feature activation+9.36e-02

Pos logit contributions
 managed+0.004
pped+0.004
ched+0.004
tted+0.004
cher+0.004
Neg logit contributions
 Grand-0.006
 grand-0.006
 Aunt-0.006
ummy-0.006
 batch-0.006
Token knew
Token ID2993
Feature activation+9.34e-02

Pos logit contributions
pped+0.146
cher+0.139
 managed+0.138
ched+0.133
ier+0.126
Neg logit contributions
 grand-0.177
 Grand-0.171
 Aunt-0.168
 festivals-0.165
 batch-0.162
Token she
Token ID673
Feature activation+2.55e-01

Pos logit contributions
 managed+0.224
'd+0.211
pped+0.210
ched+0.209
tted+0.199
Neg logit contributions
 Grand-0.117
 grand-0.116
 Aunt-0.115
ummy-0.110
 Of-0.108
Token could
Token ID714
Feature activation+1.58e-01

Pos logit contributions
pped+0.475
iers+0.453
pered+0.433
bridge+0.421
ps+0.421
Neg logit contributions
 grand-0.430
 Grand-0.385
 Aunt-0.364
 sorts-0.360
 T-0.340
Token always
Token ID1464
Feature activation+8.86e-02

Pos logit contributions
 managed+0.285
pped+0.276
ched+0.269
cher+0.262
'd+0.253
Neg logit contributions
 grand-0.275
 Aunt-0.270
 Grand-0.270
 festivals-0.268
ummy-0.268
Token rely
Token ID8814
Feature activation+8.77e-02

Pos logit contributions
pped+0.193
ched+0.186
 managed+0.179
ed+0.176
ier+0.172
Neg logit contributions
 grand-0.129
 festivals-0.118
 Grand-0.116
 Sunday-0.112
 Aunt-0.111
Token on
Token ID319
Feature activation-5.86e-02

Pos logit contributions
 managed+0.159
'd+0.149
ched+0.133
ave+0.132
 succeeded+0.131
Neg logit contributions
 grand-0.153
 Grand-0.152
 Sunday-0.144
 Wednesday-0.143
 festivals-0.142
Token her
Token ID607
Feature activation+7.29e-02

Pos logit contributions
 Grand+0.089
 grand+0.088
 Aunt+0.088
 festivals+0.087
ummy+0.087
Neg logit contributions
 managed-0.124
'd-0.116
pped-0.114
ched-0.114
cher-0.106
Token home
Token ID1363
Feature activation-5.04e-03

Pos logit contributions
 grand+0.024
 Grand+0.024
ummy+0.024
 batch+0.023
 festivals+0.023
Neg logit contributions
 managed-0.028
pped-0.026
'd-0.026
ched-0.026
 stayed-0.023
Token.
Token ID13
Feature activation-2.96e-02

Pos logit contributions
 Grand+0.008
 grand+0.008
ummy+0.008
 batch+0.008
 Aunt+0.008
Neg logit contributions
 managed-0.010
pped-0.010
'd-0.010
ched-0.010
cher-0.009
Token Her
Token ID2332
Feature activation+7.76e-02

Pos logit contributions
 batch+0.043
igsaw+0.043
ummy+0.042
 festivals+0.042
 grand+0.040
Neg logit contributions
 managed-0.061
'd-0.058
 could-0.057
ched-0.056
pped-0.054
Token mom
Token ID1995
Feature activation+9.35e-02

Pos logit contributions
 managed+0.191
'd+0.180
ched+0.177
pped+0.177
 stayed+0.168
Neg logit contributions
igsaw-0.088
 festivals-0.088
ummy-0.088
 Grand-0.087
 batch-0.087
Token said
Token ID531
Feature activation+1.39e-01

Pos logit contributions
 managed+0.148
ched+0.144
pped+0.143
tted+0.135
'd+0.132
Neg logit contributions
 Grand-0.186
 grand-0.184
 batch-0.183
 Aunt-0.181
ummy-0.174
Token she
Token ID673
Feature activation+2.55e-01

Pos logit contributions
 managed+0.311
pped+0.295
ched+0.291
'd+0.289
tted+0.274
Neg logit contributions
 Grand-0.198
 grand-0.193
 Aunt-0.189
ummy-0.188
 Of-0.185
Token would
Token ID561
Feature activation+2.05e-01

Pos logit contributions
pped+0.429
pping+0.367
ched+0.353
ier+0.343
iece+0.336
Neg logit contributions
 grand-0.570
 Grand-0.557
 Aunt-0.504
 batch-0.463
 T-0.459
Token accept
Token ID2453
Feature activation+4.22e-02

Pos logit contributions
 managed+0.375
ched+0.351
pped+0.348
'd+0.336
cher+0.321
Neg logit contributions
 Grand-0.372
 grand-0.370
 Aunt-0.363
 festivals-0.356
ummy-0.355
Token if
Token ID611
Feature activation+8.33e-03

Pos logit contributions
 managed+0.090
'd+0.084
ched+0.083
pped+0.083
 wouldn+0.077
Neg logit contributions
ummy-0.062
igsaw-0.062
 festivals-0.062
 Grand-0.061
 batch-0.061
Token it
Token ID340
Feature activation+1.92e-01

Pos logit contributions
 managed+0.017
ched+0.016
'd+0.016
pped+0.016
cher+0.015
Neg logit contributions
 Grand-0.013
 grand-0.013
 Aunt-0.013
 festivals-0.013
ummy-0.013
Token was
Token ID373
Feature activation+1.68e-01

Pos logit contributions
pped+0.276
 managed+0.249
cher+0.225
ched+0.222
amed+0.216
Neg logit contributions
 grand-0.418
 Grand-0.417
 Aunt-0.405
 batch-0.401
 Of-0.394
Token and
Token ID290
Feature activation-7.42e-02

Pos logit contributions
 managed+0.038
pped+0.037
ched+0.036
'd+0.036
cher+0.034
Neg logit contributions
 Grand-0.033
 grand-0.033
 Aunt-0.033
 batch-0.032
ummy-0.032
Token friends
Token ID2460
Feature activation+5.61e-02

Pos logit contributions
 grand+0.165
 Grand+0.163
 Aunt+0.161
 Uncle+0.150
 festivals+0.150
Neg logit contributions
pped-0.110
tted-0.108
ched-0.107
 managed-0.103
cher-0.100
Token.
Token ID13
Feature activation-3.87e-02

Pos logit contributions
pped+0.122
 managed+0.121
ched+0.117
'd+0.113
cher+0.111
Neg logit contributions
 grand-0.074
 Grand-0.072
 Aunt-0.069
 batch-0.066
 festivals-0.065
Token She
Token ID1375
Feature activation+1.93e-01

Pos logit contributions
igsaw+0.064
 festivals+0.064
 batch+0.064
ummy+0.061
 grand+0.061
Neg logit contributions
 managed-0.071
'd-0.067
 could-0.065
ched-0.064
 wouldn-0.062
Token wished
Token ID16555
Feature activation+7.66e-02

Pos logit contributions
cher+0.316
ier+0.316
tted+0.298
iers+0.297
pped+0.287
Neg logit contributions
 grand-0.418
 Grand-0.394
 Aunt-0.377
 Uncle-0.340
 sunset-0.340
Token she
Token ID673
Feature activation+2.55e-01

Pos logit contributions
 managed+0.149
'd+0.138
ched+0.136
pped+0.136
cher+0.125
Neg logit contributions
 Grand-0.128
ummy-0.128
 festivals-0.127
 grand-0.127
 Aunt-0.126
Token had
Token ID550
Feature activation+2.00e-01

Pos logit contributions
ier+0.365
ps+0.361
iers+0.361
pered+0.356
ched+0.348
Neg logit contributions
 grand-0.535
 Grand-0.518
 Aunt-0.492
 Wednesday-0.475
 Sunday-0.455
Token listened
Token ID16399
Feature activation+8.97e-04

Pos logit contributions
ched+0.279
tted+0.277
pped+0.261
cher+0.258
ier+0.254
Neg logit contributions
 grand-0.441
 Grand-0.437
 Aunt-0.417
 Sunday-0.409
 Wednesday-0.406
Token to
Token ID284
Feature activation-1.89e-02

Pos logit contributions
 managed+0.002
pped+0.002
'd+0.002
ched+0.002
cher+0.002
Neg logit contributions
 grand-0.001
 Grand-0.001
 Aunt-0.001
ummy-0.001
 Sunday-0.001
Token her
Token ID607
Feature activation+7.53e-02

Pos logit contributions
igsaw+0.032
ummy+0.032
 batch+0.032
 festivals+0.031
 Of+0.031
Neg logit contributions
 managed-0.036
'd-0.033
pped-0.033
ched-0.033
 stayed-0.031
Token mom
Token ID1995
Feature activation+4.23e-02

Pos logit contributions
 managed+0.147
'd+0.143
 wouldn+0.138
 stayed+0.135
 could+0.133
Neg logit contributions
igsaw-0.118
 Of-0.112
 Gig-0.112
 festivals-0.111
 batch-0.110
Token too
Token ID1165
Feature activation+7.94e-02

Pos logit contributions
 grand+0.131
 Grand+0.129
 Aunt+0.127
 Of+0.119
 Gig+0.119
Neg logit contributions
pped-0.164
 managed-0.156
ched-0.155
cher-0.150
'd-0.148
Token fast
Token ID3049
Feature activation-5.80e-02

Pos logit contributions
 managed+0.182
'd+0.170
ched+0.168
pped+0.166
 wouldn+0.156
Neg logit contributions
ummy-0.108
 festivals-0.107
igsaw-0.106
 Grand-0.106
 batch-0.106
Token.
Token ID13
Feature activation-3.78e-02

Pos logit contributions
 Grand+0.076
 grand+0.075
ummy+0.072
 Aunt+0.072
 Of+0.070
Neg logit contributions
 managed-0.134
'd-0.129
pped-0.125
cher-0.119
ched-0.118
Token She
Token ID1375
Feature activation+1.92e-01

Pos logit contributions
igsaw+0.062
 batch+0.062
 festivals+0.061
ummy+0.059
 grand+0.058
Neg logit contributions
 managed-0.074
'd-0.068
ched-0.068
 could-0.067
pped-0.066
Token wished
Token ID16555
Feature activation+7.26e-02

Pos logit contributions
ier+0.325
cher+0.313
pped+0.308
tted+0.291
ched+0.283
Neg logit contributions
 grand-0.392
 Grand-0.377
 Aunt-0.358
 Uncle-0.328
 batch-0.321
Token she
Token ID673
Feature activation+2.55e-01

Pos logit contributions
 managed+0.134
'd+0.124
ched+0.122
pped+0.122
 wouldn+0.112
Neg logit contributions
 festivals-0.127
igsaw-0.127
ummy-0.127
 batch-0.126
 Grand-0.126
Token was
Token ID373
Feature activation+1.75e-01

Pos logit contributions
ier+0.336
ps+0.321
amed+0.319
pped+0.313
iece+0.311
Neg logit contributions
 grand-0.555
 Grand-0.544
 Aunt-0.514
 Wednesday-0.483
 Uncle-0.469
Token brave
Token ID14802
Feature activation-1.24e-01

Pos logit contributions
ched+0.346
ave+0.341
tted+0.323
'd+0.321
pped+0.319
Neg logit contributions
 Sunday-0.242
 Wednesday-0.239
 festivals-0.236
 summer-0.236
 new-0.232
Token like
Token ID588
Feature activation-5.24e-02

Pos logit contributions
 grand+0.175
 Grand+0.164
ummy+0.154
 Aunt+0.149
 Uncle+0.146
Neg logit contributions
 managed-0.282
pped-0.278
ched-0.271
tted-0.270
'd-0.265
Token Ben
Token ID3932
Feature activation+8.01e-02

Pos logit contributions
 Grand+0.069
 grand+0.068
 Aunt+0.068
 festivals+0.067
ummy+0.066
Neg logit contributions
 managed-0.122
'd-0.116
ched-0.114
pped-0.113
cher-0.106
Token.
Token ID13
Feature activation-5.17e-02

Pos logit contributions
 managed+0.171
pped+0.168
ched+0.160
'd+0.156
tted+0.156
Neg logit contributions
 grand-0.116
 Grand-0.113
 Aunt-0.108
 festivals-0.107
 batch-0.107
Token was
Token ID373
Feature activation+1.53e-01

Pos logit contributions
ched+0.208
tted+0.205
aces+0.204
 managed+0.198
cher+0.198
Neg logit contributions
 Grand-0.292
 grand-0.291
 Aunt-0.282
 batch-0.267
 Uncle-0.263
Token very
Token ID845
Feature activation-1.94e-02

Pos logit contributions
'd+0.307
 managed+0.295
ched+0.292
 wouldn+0.270
pped+0.262
Neg logit contributions
 Grand-0.263
 Aunt-0.254
 grand-0.254
 batch-0.245
 festivals-0.241
Token sad
Token ID6507
Feature activation-4.11e-02

Pos logit contributions
 Grand+0.037
ummy+0.037
 grand+0.037
 festivals+0.037
 Aunt+0.037
Neg logit contributions
 managed-0.033
'd-0.030
ched-0.030
pped-0.030
cher-0.027
Token and
Token ID290
Feature activation-6.43e-02

Pos logit contributions
 Grand+0.070
ummy+0.070
 grand+0.069
 Aunt+0.069
 batch+0.068
Neg logit contributions
 managed-0.081
'd-0.075
ched-0.074
pped-0.074
cher-0.068
Token wished
Token ID16555
Feature activation+7.17e-02

Pos logit contributions
 Grand+0.121
 grand+0.118
 Aunt+0.118
 batch+0.110
 Gig+0.107
Neg logit contributions
 managed-0.112
ched-0.112
pped-0.112
'd-0.110
tted-0.108
Token she
Token ID673
Feature activation+2.55e-01

Pos logit contributions
 managed+0.165
pped+0.155
ched+0.155
'd+0.154
cher+0.146
Neg logit contributions
 Grand-0.103
 Aunt-0.099
 grand-0.097
ummy-0.096
 Of-0.094
Token had
Token ID550
Feature activation+2.02e-01

Pos logit contributions
pped+0.339
ched+0.333
ier+0.325
amed+0.324
pping+0.324
Neg logit contributions
 Grand-0.574
 grand-0.571
 Aunt-0.524
 grandma-0.486
 Wednesday-0.463
Token known
Token ID1900
Feature activation+5.79e-02

Pos logit contributions
ched+0.275
tted+0.257
pped+0.256
'd+0.240
ier+0.240
Neg logit contributions
 grand-0.464
 Grand-0.461
 Aunt-0.432
 festivals-0.412
 Sunday-0.412
Token how
Token ID703
Feature activation-2.96e-02

Pos logit contributions
 managed+0.109
'd+0.101
ched+0.100
pped+0.099
cher+0.091
Neg logit contributions
 Grand-0.100
ummy-0.100
 festivals-0.099
igsaw-0.099
 grand-0.099
Token to
Token ID284
Feature activation-1.73e-02

Pos logit contributions
 Grand+0.047
 grand+0.046
 Aunt+0.045
ummy+0.045
 festivals+0.045
Neg logit contributions
 managed-0.061
'd-0.058
pped-0.057
ched-0.057
 stayed-0.052
Token help
Token ID1037
Feature activation-4.57e-02

Pos logit contributions
 Grand+0.030
 grand+0.030
 Aunt+0.030
 festivals+0.029
ummy+0.029
Neg logit contributions
 managed-0.033
ched-0.031
'd-0.030
pped-0.030
 wouldn-0.028
Token it
Token ID340
Feature activation+1.96e-01

Pos logit contributions
 managed+0.231
'd+0.228
ched+0.218
pped+0.217
 wouldn+0.204
Neg logit contributions
 Grand-0.227
 Aunt-0.225
 grand-0.221
 festivals-0.219
 Gig-0.209
Token too
Token ID1165
Feature activation+1.12e-01

Pos logit contributions
 managed+0.343
pped+0.342
cher+0.306
ched+0.303
'd+0.298
Neg logit contributions
ummy-0.337
 grand-0.334
 Grand-0.333
 batch-0.331
 festivals-0.319
Token.
Token ID13
Feature activation-1.56e-02

Pos logit contributions
 managed+0.239
'd+0.223
ched+0.220
pped+0.219
 wouldn+0.203
Neg logit contributions
ummy-0.166
 Grand-0.166
 festivals-0.166
igsaw-0.165
 Aunt-0.165
Token She
Token ID1375
Feature activation+2.02e-01

Pos logit contributions
igsaw+0.023
 batch+0.023
ummy+0.023
 festivals+0.023
 grand+0.022
Neg logit contributions
 managed-0.032
'd-0.030
ched-0.029
 could-0.029
pped-0.028
Token thought
Token ID1807
Feature activation+1.17e-01

Pos logit contributions
 managed+0.297
pped+0.290
cher+0.276
ched+0.274
ier+0.270
Neg logit contributions
 Grand-0.422
 grand-0.418
 batch-0.404
 Aunt-0.397
 festivals-0.381
Token she
Token ID673
Feature activation+2.55e-01

Pos logit contributions
 managed+0.237
'd+0.221
ched+0.218
pped+0.217
cher+0.201
Neg logit contributions
 Grand-0.187
ummy-0.185
 grand-0.184
 Aunt-0.184
 festivals-0.183
Token could
Token ID714
Feature activation+1.56e-01

Pos logit contributions
pped+0.419
ier+0.389
ched+0.381
cher+0.375
tted+0.374
Neg logit contributions
 grand-0.528
 Grand-0.510
 Aunt-0.481
 M-0.446
 T-0.439
Token wash
Token ID13502
Feature activation-5.20e-03

Pos logit contributions
 managed+0.289
pped+0.278
ched+0.270
'd+0.259
ier+0.257
Neg logit contributions
 Grand-0.277
 grand-0.272
 Aunt-0.268
 festivals-0.257
 Of-0.253
Token his
Token ID465
Feature activation+3.14e-02

Pos logit contributions
 Grand+0.008
ummy+0.008
 grand+0.008
 Aunt+0.008
igsaw+0.008
Neg logit contributions
 managed-0.011
'd-0.010
ched-0.010
pped-0.010
cher-0.009
Token eye
Token ID4151
Feature activation+1.74e-02

Pos logit contributions
 managed+0.072
'd+0.067
pped+0.067
ched+0.067
cher+0.062
Neg logit contributions
 Grand-0.043
 grand-0.043
ummy-0.043
 batch-0.042
igsaw-0.042
Token with
Token ID351
Feature activation-7.71e-02

Pos logit contributions
 managed+0.034
'd+0.031
pped+0.031
ched+0.030
cher+0.028
Neg logit contributions
 Grand-0.029
ummy-0.029
 grand-0.029
 Aunt-0.028
 batch-0.028
Token and
Token ID290
Feature activation-5.17e-02

Pos logit contributions
 managed+0.022
'd+0.020
ched+0.020
pped+0.020
 stayed+0.018
Neg logit contributions
 Grand-0.019
ummy-0.019
 festivals-0.019
igsaw-0.019
 grand-0.019
Token sad
Token ID6507
Feature activation-3.74e-02

Pos logit contributions
 Grand+0.082
 grand+0.081
 Aunt+0.079
 Of+0.076
 batch+0.075
Neg logit contributions
pped-0.108
 managed-0.106
'd-0.103
ched-0.101
tted-0.096
Token.
Token ID13
Feature activation-3.22e-02

Pos logit contributions
igsaw+0.069
 festivals+0.069
ummy+0.068
 Grand+0.068
 batch+0.068
Neg logit contributions
 managed-0.067
'd-0.061
ched-0.061
pped-0.060
 wouldn-0.055
Token She
Token ID1375
Feature activation+1.89e-01

Pos logit contributions
 batch+0.075
igsaw+0.073
 sorts+0.067
 orig+0.067
 festivals+0.067
Neg logit contributions
 could-0.041
 managed-0.037
ched-0.033
'd-0.033
pped-0.032
Token says
Token ID1139
Feature activation+1.22e-01

Pos logit contributions
 managed+0.445
cher+0.444
pped+0.430
ier+0.419
tted+0.411
Neg logit contributions
 grand-0.242
 Grand-0.235
 Aunt-0.216
 batch-0.193
 Gig-0.192
Token she
Token ID673
Feature activation+2.55e-01

Pos logit contributions
 managed+0.237
'd+0.215
ched+0.215
pped+0.210
 could+0.207
Neg logit contributions
igsaw-0.210
 batch-0.209
 festivals-0.204
ummy-0.201
 Of-0.196
Token is
Token ID318
Feature activation+1.48e-01

Pos logit contributions
pped+0.529
cher+0.516
 managed+0.498
ier+0.487
bridge+0.478
Neg logit contributions
 grand-0.383
 Grand-0.361
 Aunt-0.328
 batch-0.313
 history-0.310
Token sorry
Token ID7926
Feature activation+2.38e-02

Pos logit contributions
'd+0.315
 managed+0.313
ched+0.297
pped+0.287
 wouldn+0.270
Neg logit contributions
 Grand-0.243
 Aunt-0.230
 grand-0.229
 Uncle-0.210
 Gig-0.207
Token to
Token ID284
Feature activation-1.84e-02

Pos logit contributions
 managed+0.051
pped+0.048
ched+0.047
'd+0.046
cher+0.044
Neg logit contributions
 Grand-0.039
ummy-0.038
 grand-0.038
 Aunt-0.038
 Of-0.036
Token her
Token ID607
Feature activation+7.07e-02

Pos logit contributions
igsaw+0.029
 batch+0.029
ummy+0.028
 festivals+0.026
 Of+0.026
Neg logit contributions
 managed-0.038
pped-0.035
 stayed-0.035
'd-0.035
 could-0.034
Token dad
Token ID9955
Feature activation+6.59e-02

Pos logit contributions
 managed+0.129
'd+0.119
ched+0.113
pped+0.112
 wouldn+0.112
Neg logit contributions
igsaw-0.125
ummy-0.122
 festivals-0.121
 batch-0.121
 Of-0.120
Token sorts
Token ID10524
Feature activation-1.17e-01

Pos logit contributions
ummy+0.037
igsaw+0.035
 grand+0.035
 Of+0.035
 batch+0.035
Neg logit contributions
 managed-0.057
ched-0.054
'd-0.052
 could-0.051
 stayed-0.051
Token of
Token ID286
Feature activation-1.39e-01

Pos logit contributions
ummy+0.182
igsaw+0.174
 festivals+0.172
 Grand+0.172
 Of+0.172
Neg logit contributions
 managed-0.249
'd-0.235
pped-0.226
ched-0.221
cher-0.213
Token adventures
Token ID17545
Feature activation-4.21e-02

Pos logit contributions
igsaw+0.173
ummy+0.172
 Grand+0.170
 batch+0.169
 Aunt+0.169
Neg logit contributions
 managed-0.331
'd-0.311
ched-0.309
pped-0.306
 stayed-0.289
Token in
Token ID287
Feature activation-1.45e-01

Pos logit contributions
 Grand+0.065
 grand+0.065
 festivals+0.064
 Aunt+0.063
ummy+0.063
Neg logit contributions
 managed-0.089
pped-0.085
'd-0.082
ched-0.082
cher-0.075
Token the
Token ID262
Feature activation-1.68e-01

Pos logit contributions
ummy+0.220
 Of+0.218
 batch+0.217
igsaw+0.215
 festivals+0.208
Neg logit contributions
 managed-0.305
pped-0.281
'd-0.278
ched-0.270
 stayed-0.261
Token vast
Token ID5909
Feature activation-2.84e-01

Pos logit contributions
 festivals+0.170
 Grand+0.170
 grand+0.170
ummy+0.169
 Aunt+0.166
Neg logit contributions
 managed-0.440
'd-0.414
ched-0.412
pped-0.411
cher-0.389
Token ocean
Token ID9151
Feature activation-5.04e-02

Pos logit contributions
 Of+0.388
 batch+0.374
igsaw+0.368
 Gig+0.365
 Aunt+0.365
Neg logit contributions
 managed-0.629
'd-0.601
pped-0.578
ched-0.571
 stayed-0.561
Token.
Token ID13
Feature activation-5.88e-02

Pos logit contributions
 grand+0.080
 Grand+0.079
 festivals+0.079
ummy+0.077
 batch+0.076
Neg logit contributions
 managed-0.104
pped-0.100
ched-0.097
'd-0.095
cher-0.088
Token She
Token ID1375
Feature activation+1.76e-01

Pos logit contributions
 festivals+0.090
 batch+0.090
ummy+0.089
igsaw+0.088
 grand+0.088
Neg logit contributions
 managed-0.121
'd-0.114
ched-0.112
pped-0.109
 could-0.106
Token was
Token ID373
Feature activation+1.33e-01

Pos logit contributions
cher+0.237
pped+0.237
tted+0.229
ier+0.228
ched+0.226
Neg logit contributions
 grand-0.379
 Grand-0.351
 festivals-0.336
 orig-0.330
 landscape-0.330
Token happy
Token ID3772
Feature activation-5.80e-02

Pos logit contributions
'd+0.261
ched+0.258
pped+0.256
 managed+0.250
cher+0.244
Neg logit contributions
 Grand-0.212
 festivals-0.207
 Aunt-0.207
 grand-0.201
 Sunday-0.192
Token the
Token ID262
Feature activation-1.27e-01

Pos logit contributions
ummy+0.027
igsaw+0.026
 batch+0.026
 Of+0.025
 Aunt+0.024
Neg logit contributions
 managed-0.032
'd-0.030
pped-0.028
 could-0.028
 stayed-0.028
Token stars
Token ID5788
Feature activation+6.57e-04

Pos logit contributions
 batch+0.147
 Of+0.133
 Gig+0.133
igsaw+0.131
 Aunt+0.130
Neg logit contributions
 managed-0.319
'd-0.316
 could-0.296
pped-0.289
ched-0.289
Token and
Token ID290
Feature activation-9.79e-02

Pos logit contributions
 managed+0.001
ched+0.001
pped+0.001
'd+0.001
tted+0.001
Neg logit contributions
ummy-0.001
 grand-0.001
 festivals-0.001
 Grand-0.001
 batch-0.001
Token planets
Token ID14705
Feature activation+2.22e-02

Pos logit contributions
 grand+0.157
 Grand+0.156
 festivals+0.154
 Aunt+0.152
 Of+0.142
Neg logit contributions
 managed-0.202
ched-0.197
pped-0.195
'd-0.186
cher-0.185
Token in
Token ID287
Feature activation-1.44e-01

Pos logit contributions
 managed+0.050
pped+0.048
ched+0.048
tted+0.046
'd+0.045
Neg logit contributions
 festivals-0.028
 grand-0.027
 Grand-0.025
 sunrise-0.025
ummy-0.025
Token outer
Token ID12076
Feature activation-2.78e-01

Pos logit contributions
ummy+0.211
 batch+0.211
 Of+0.206
igsaw+0.204
 festivals+0.201
Neg logit contributions
 managed-0.308
'd-0.284
pped-0.283
ched-0.278
 could-0.263
Token space
Token ID2272
Feature activation-9.27e-02

Pos logit contributions
 batch+0.314
 Of+0.306
 Gig+0.293
ummy+0.288
 chip+0.279
Neg logit contributions
 managed-0.672
'd-0.630
 found-0.630
pped-0.621
 could-0.620
Token.
Token ID13
Feature activation-4.42e-02

Pos logit contributions
 grand+0.143
 Grand+0.142
 festivals+0.140
ummy+0.135
 Aunt+0.134
Neg logit contributions
 managed-0.197
pped-0.186
ched-0.186
'd-0.183
cher-0.171
Token So
Token ID1406
Feature activation-3.87e-02

Pos logit contributions
igsaw+0.077
 batch+0.075
 grand+0.073
 festivals+0.071
 orig+0.070
Neg logit contributions
 managed-0.080
'd-0.077
 could-0.075
 found-0.070
ched-0.070
Token one
Token ID530
Feature activation-9.76e-03

Pos logit contributions
igsaw+0.071
 festivals+0.069
 batch+0.068
 grand+0.066
ummy+0.066
Neg logit contributions
 managed-0.072
'd-0.065
 stayed-0.062
 could-0.061
pped-0.061
Token day
Token ID1110
Feature activation-1.71e-02

Pos logit contributions
igsaw+0.015
 festivals+0.014
ummy+0.014
 batch+0.014
 Gig+0.014
Neg logit contributions
 managed-0.020
'd-0.020
 stayed-0.019
 could-0.019
ched-0.018
Token food
Token ID2057
Feature activation-1.05e-02

Pos logit contributions
 managed+0.145
'd+0.138
pped+0.135
ched+0.134
cher+0.127
Neg logit contributions
 Grand-0.102
 Aunt-0.101
 grand-0.100
ummy-0.100
 festivals-0.100
Token for
Token ID329
Feature activation-3.45e-02

Pos logit contributions
 Grand+0.015
ummy+0.015
 grand+0.015
 batch+0.015
 Aunt+0.015
Neg logit contributions
 managed-0.023
pped-0.021
'd-0.021
ched-0.021
 wouldn-0.019
Token the
Token ID262
Feature activation-1.37e-01

Pos logit contributions
igsaw+0.054
ummy+0.052
 Of+0.051
 batch+0.051
 festivals+0.051
Neg logit contributions
 managed-0.072
'd-0.066
pped-0.066
ched-0.065
 stayed-0.063
Token rh
Token ID9529
Feature activation-1.48e-01

Pos logit contributions
igsaw+0.134
 festivals+0.127
ummy+0.126
 Grand+0.125
 batch+0.125
Neg logit contributions
 managed-0.364
'd-0.349
ched-0.338
pped-0.338
 stayed-0.327
Tokenin
Token ID259
Feature activation-5.68e-02

Pos logit contributions
ummy+0.272
 batch+0.269
 grand+0.260
 festivals+0.259
igsaw+0.254
Neg logit contributions
 managed-0.265
'd-0.242
ched-0.236
pped-0.230
 wouldn-0.221
Tokenoc
Token ID420
Feature activation-2.73e-01

Pos logit contributions
 Grand+0.048
ummy+0.047
 Aunt+0.047
 batch+0.047
 grand+0.047
Neg logit contributions
 managed-0.158
pped-0.151
'd-0.150
ched-0.149
 wouldn-0.140
Tokeneros
Token ID27498
Feature activation+6.50e-03

Pos logit contributions
 festivals+0.409
 grand+0.404
 Grand+0.402
igsaw+0.399
 Aunt+0.397
Neg logit contributions
 managed-0.573
pped-0.545
ched-0.536
'd-0.527
 stayed-0.497
Token and
Token ID290
Feature activation-9.68e-02

Pos logit contributions
pped+0.014
 managed+0.014
ched+0.013
'd+0.013
cher+0.012
Neg logit contributions
 grand-0.009
 Grand-0.009
ummy-0.009
 batch-0.009
 Gig-0.009
Token his
Token ID465
Feature activation+3.43e-02

Pos logit contributions
 Grand+0.161
 grand+0.160
 Aunt+0.156
 Gig+0.155
 Uncle+0.148
Neg logit contributions
pped-0.196
ched-0.190
 managed-0.188
'd-0.181
tted-0.180
Token friends
Token ID2460
Feature activation+3.71e-02

Pos logit contributions
 managed+0.072
'd+0.067
ched+0.067
pped+0.067
cher+0.062
Neg logit contributions
ummy-0.052
 Grand-0.052
 grand-0.052
 Aunt-0.052
 festivals-0.051
Token.
Token ID13
Feature activation-5.87e-02

Pos logit contributions
 managed+0.077
pped+0.076
ched+0.073
'd+0.070
cher+0.068
Neg logit contributions
 grand-0.054
 Grand-0.054
 festivals-0.053
 batch-0.052
ummy-0.051
Token.
Token ID13
Feature activation-5.98e-02

Pos logit contributions
 Grand+0.035
ummy+0.035
 festivals+0.035
 grand+0.035
 Aunt+0.035
Neg logit contributions
 managed-0.037
'd-0.034
ched-0.033
pped-0.033
cher-0.030
Token She
Token ID1375
Feature activation+1.76e-01

Pos logit contributions
 batch+0.096
igsaw+0.095
 festivals+0.094
ummy+0.091
 grand+0.091
Neg logit contributions
 managed-0.119
'd-0.112
ched-0.107
 could-0.105
pped-0.102
Token grabbed
Token ID13176
Feature activation-4.62e-03

Pos logit contributions
pped+0.272
ier+0.271
cher+0.257
iers+0.238
ched+0.233
Neg logit contributions
 grand-0.371
 Grand-0.361
 Aunt-0.354
 Uncle-0.323
 batch-0.313
Token the
Token ID262
Feature activation-1.31e-01

Pos logit contributions
ummy+0.008
 festivals+0.007
igsaw+0.007
 batch+0.007
 Of+0.007
Neg logit contributions
 managed-0.009
pped-0.009
ched-0.009
'd-0.008
 wouldn-0.008
Token fire
Token ID2046
Feature activation-8.95e-02

Pos logit contributions
igsaw+0.149
 festivals+0.149
ummy+0.148
 Grand+0.146
 batch+0.146
Neg logit contributions
 managed-0.324
'd-0.307
ched-0.303
pped-0.301
 stayed-0.285
Token extingu
Token ID24995
Feature activation-2.72e-01

Pos logit contributions
 Grand+0.147
ummy+0.146
 grand+0.145
 Aunt+0.145
igsaw+0.143
Neg logit contributions
 managed-0.178
'd-0.165
pped-0.164
ched-0.163
cher-0.149
Tokenisher
Token ID4828
Feature activation+3.53e-02

Pos logit contributions
 festivals+0.490
 Grand+0.485
 grand+0.483
igsaw+0.483
ummy+0.481
Neg logit contributions
 managed-0.492
'd-0.452
ched-0.448
pped-0.447
cher-0.411
Token and
Token ID290
Feature activation-8.57e-02

Pos logit contributions
 managed+0.072
'd+0.068
pped+0.067
ched+0.066
cher+0.061
Neg logit contributions
 grand-0.055
 Grand-0.055
ummy-0.054
igsaw-0.054
 Aunt-0.053
Token put
Token ID1234
Feature activation+9.40e-03

Pos logit contributions
 Grand+0.196
 grand+0.193
 Aunt+0.188
 Uncle+0.178
 batch+0.175
Neg logit contributions
'd-0.113
pped-0.109
 managed-0.108
ched-0.105
ier-0.103
Token out
Token ID503
Feature activation+7.33e-05

Pos logit contributions
'd+0.022
 managed+0.022
ave+0.020
cher+0.020
ched+0.019
Neg logit contributions
 Grand-0.013
 Aunt-0.012
 grand-0.011
 Sunday-0.011
 Of-0.011
Token the
Token ID262
Feature activation-1.14e-01

Pos logit contributions
 managed+0.000
'd+0.000
ched+0.000
pped+0.000
cher+0.000
Neg logit contributions
 Grand-0.000
 Aunt-0.000
 grand-0.000
 festivals-0.000
igsaw-0.000
Token what
Token ID644
Feature activation+1.84e-02

Pos logit contributions
ummy+0.175
igsaw+0.175
 batch+0.170
 Of+0.168
 festivals+0.160
Neg logit contributions
 managed-0.264
'd-0.245
pped-0.239
ched-0.237
 stayed-0.233
Token the
Token ID262
Feature activation-1.13e-01

Pos logit contributions
 managed+0.036
pped+0.035
ched+0.034
'd+0.033
cher+0.031
Neg logit contributions
 Grand-0.030
 grand-0.030
 Aunt-0.030
 festivals-0.030
ummy-0.029
Token rocket
Token ID10701
Feature activation-1.61e-03

Pos logit contributions
ummy+0.159
 Grand+0.158
igsaw+0.157
 batch+0.157
 festivals+0.157
Neg logit contributions
 managed-0.248
'd-0.233
ched-0.228
pped-0.228
 stayed-0.213
Token discovered
Token ID5071
Feature activation+1.80e-02

Pos logit contributions
 grand+0.004
 Grand+0.003
 Aunt+0.003
 landscape+0.003
 Of+0.003
Neg logit contributions
pped-0.002
ched-0.002
 managed-0.002
tted-0.002
amed-0.002
Token in
Token ID287
Feature activation-1.17e-01

Pos logit contributions
 managed+0.038
pped+0.035
ched+0.035
'd+0.035
ave+0.033
Neg logit contributions
 Grand-0.027
 grand-0.026
 festivals-0.026
ummy-0.025
 Aunt-0.025
Token outer
Token ID12076
Feature activation-2.71e-01

Pos logit contributions
ummy+0.182
 Of+0.179
 batch+0.177
igsaw+0.172
 Gig+0.164
Neg logit contributions
 managed-0.241
'd-0.220
pped-0.219
 stayed-0.206
ched-0.205
Token space
Token ID2272
Feature activation-8.30e-02

Pos logit contributions
 Of+0.265
 batch+0.263
ummy+0.253
 Gig+0.249
 festivals+0.247
Neg logit contributions
 managed-0.700
'd-0.662
pped-0.649
ched-0.641
 could-0.630
Token.
Token ID13
Feature activation-5.34e-02

Pos logit contributions
 grand+0.130
 Grand+0.129
 festivals+0.128
ummy+0.127
 Aunt+0.124
Neg logit contributions
 managed-0.170
pped-0.161
ched-0.161
'd-0.155
ave-0.145
Token It
Token ID632
Feature activation+1.94e-01

Pos logit contributions
 batch+0.076
ummy+0.075
 festivals+0.074
igsaw+0.074
 grand+0.073
Neg logit contributions
 managed-0.117
'd-0.110
ched-0.107
 could-0.104
pped-0.103
Token was
Token ID373
Feature activation+1.43e-01

Pos logit contributions
pped+0.308
ier+0.254
cher+0.253
 managed+0.248
ched+0.247
Neg logit contributions
 grand-0.398
 Grand-0.380
 sorts-0.379
 Aunt-0.377
 landscape-0.361
Token a
Token ID257
Feature activation-7.98e-02

Pos logit contributions
 managed+0.294
ched+0.278
pped+0.276
'd+0.274
cher+0.262
Neg logit contributions
 Grand-0.225
 grand-0.222
 Aunt-0.220
 festivals-0.215
ummy-0.212
Token and
Token ID290
Feature activation-9.31e-02

Pos logit contributions
 Of+0.183
igsaw+0.181
 Gig+0.179
 Aunt+0.178
ummy+0.178
Neg logit contributions
 managed-0.150
'd-0.135
ched-0.132
pped-0.130
 could-0.126
Token took
Token ID1718
Feature activation+2.72e-02

Pos logit contributions
 Grand+0.209
 grand+0.201
 Aunt+0.200
 festivals+0.188
 Uncle+0.186
Neg logit contributions
pped-0.136
ched-0.132
 managed-0.129
'd-0.128
ier-0.124
Token the
Token ID262
Feature activation-1.29e-01

Pos logit contributions
 managed+0.064
'd+0.063
pped+0.059
ched+0.058
cher+0.058
Neg logit contributions
 Grand-0.037
 Aunt-0.036
 grand-0.035
 festivals-0.034
 Of-0.033
Token rh
Token ID9529
Feature activation-1.43e-01

Pos logit contributions
igsaw+0.160
ummy+0.140
 festivals+0.139
 batch+0.135
 Of+0.135
Neg logit contributions
 managed-0.312
'd-0.305
ched-0.293
pped-0.291
 stayed-0.291
Tokenin
Token ID259
Feature activation-5.62e-02

Pos logit contributions
ummy+0.269
 batch+0.266
 festivals+0.262
 grand+0.260
 Grand+0.256
Neg logit contributions
 managed-0.252
'd-0.230
ched-0.223
pped-0.220
 wouldn-0.209
Tokenoc
Token ID420
Feature activation-2.70e-01

Pos logit contributions
igsaw+0.077
ummy+0.072
 festivals+0.072
 Grand+0.069
 batch+0.068
Neg logit contributions
 managed-0.127
ched-0.122
'd-0.120
pped-0.117
cher-0.115
Tokeneros
Token ID27498
Feature activation+5.20e-03

Pos logit contributions
 festivals+0.411
 grand+0.410
igsaw+0.409
 Grand+0.408
 Aunt+0.408
Neg logit contributions
 managed-0.555
pped-0.535
ched-0.522
'd-0.508
ier-0.483
Token home
Token ID1363
Feature activation-3.37e-02

Pos logit contributions
 managed+0.011
'd+0.010
pped+0.010
ched+0.010
cher+0.009
Neg logit contributions
 Grand-0.008
 grand-0.008
 festivals-0.008
ummy-0.008
 Aunt-0.008
Token with
Token ID351
Feature activation-1.16e-01

Pos logit contributions
 Grand+0.055
 grand+0.054
 Aunt+0.052
ummy+0.052
 batch+0.052
Neg logit contributions
 managed-0.069
'd-0.066
pped-0.065
ched-0.065
cher-0.058
Token her
Token ID607
Feature activation+3.27e-02

Pos logit contributions
 Grand+0.170
 grand+0.168
 Aunt+0.166
ummy+0.159
 batch+0.159
Neg logit contributions
 managed-0.256
'd-0.245
ched-0.240
pped-0.237
cher-0.227
Token.
Token ID13
Feature activation-6.01e-02

Pos logit contributions
 managed+0.068
'd+0.063
ched+0.062
pped+0.062
 stayed+0.057
Neg logit contributions
ummy-0.051
 festivals-0.051
 Grand-0.051
igsaw-0.051
 Aunt-0.050
Token quickly
Token ID2952
Feature activation+3.66e-02

Pos logit contributions
ier+0.312
pped+0.290
bridge+0.273
cher+0.271
iers+0.268
Neg logit contributions
 grand-0.377
 Grand-0.341
 Aunt-0.340
 Uncle-0.309
 M-0.306
Token checked
Token ID10667
Feature activation+3.28e-02

Pos logit contributions
pped+0.045
'd+0.045
ier+0.043
cher+0.042
 managed+0.042
Neg logit contributions
 grand-0.080
 Grand-0.077
 Gig-0.076
 Aunt-0.075
 batch-0.072
Token for
Token ID329
Feature activation-1.81e-02

Pos logit contributions
 managed+0.072
'd+0.067
pped+0.066
ched+0.066
 wouldn+0.062
Neg logit contributions
igsaw-0.048
ummy-0.048
 festivals-0.047
 batch-0.047
 grand-0.046
Token a
Token ID257
Feature activation-1.02e-01

Pos logit contributions
 Grand+0.025
 grand+0.024
 Aunt+0.024
 festivals+0.023
ummy+0.023
Neg logit contributions
 managed-0.042
'd-0.040
ched-0.040
pped-0.039
cher-0.036
Token fire
Token ID2046
Feature activation-8.84e-02

Pos logit contributions
ummy+0.132
 festivals+0.131
 Grand+0.129
igsaw+0.128
 Of+0.127
Neg logit contributions
 managed-0.242
'd-0.229
ched-0.221
pped-0.217
cher-0.210
Token extingu
Token ID24995
Feature activation-2.67e-01

Pos logit contributions
 grand+0.131
 Grand+0.131
 Aunt+0.129
ummy+0.127
 batch+0.122
Neg logit contributions
 managed-0.193
pped-0.183
ched-0.180
'd-0.176
cher-0.161
Tokenisher
Token ID4828
Feature activation+3.60e-02

Pos logit contributions
ummy+0.444
 Grand+0.438
 batch+0.437
igsaw+0.434
 grand+0.434
Neg logit contributions
 managed-0.525
'd-0.490
ched-0.482
pped-0.477
 stayed-0.440
Token,
Token ID11
Feature activation-1.34e-01

Pos logit contributions
 managed+0.081
pped+0.077
'd+0.076
ched+0.075
amed+0.071
Neg logit contributions
 grand-0.047
 Grand-0.046
 Aunt-0.043
ummy-0.042
 batch-0.041
Token but
Token ID475
Feature activation+2.66e-02

Pos logit contributions
 Grand+0.175
 grand+0.170
 Aunt+0.167
 Of+0.156
 festivals+0.155
Neg logit contributions
 managed-0.311
ched-0.300
'd-0.296
pped-0.294
tted-0.278
Token there
Token ID612
Feature activation+1.67e-02

Pos logit contributions
ched+0.063
tted+0.060
ave+0.059
pped+0.059
'd+0.059
Neg logit contributions
 Grand-0.037
 grand-0.034
 Uncle-0.032
 Aunt-0.032
 Wednesday-0.028
Token wasn
Token ID2492
Feature activation+1.35e-01

Pos logit contributions
pped+0.025
cher+0.025
 managed+0.025
ched+0.024
ave+0.023
Neg logit contributions
 Grand-0.034
 Aunt-0.033
 grand-0.033
 Uncle-0.032
ummy-0.031
Token good
Token ID922
Feature activation-7.24e-02

Pos logit contributions
ummy+0.108
 Grand+0.107
 grand+0.107
 festivals+0.106
igsaw+0.105
Neg logit contributions
 managed-0.173
pped-0.164
'd-0.164
ched-0.160
cher-0.155
Token to
Token ID284
Feature activation-3.84e-02

Pos logit contributions
ummy+0.095
 Grand+0.093
 grand+0.092
 Of+0.091
 festivals+0.090
Neg logit contributions
 managed-0.166
'd-0.161
pped-0.153
ched-0.151
cher-0.148
Token roll
Token ID4836
Feature activation+6.12e-03

Pos logit contributions
 Aunt+0.066
 Grand+0.066
 festivals+0.066
 grand+0.065
 Sunday+0.063
Neg logit contributions
 managed-0.072
ched-0.070
pped-0.069
'd-0.066
ave-0.064
Token in
Token ID287
Feature activation-1.24e-01

Pos logit contributions
 managed+0.011
'd+0.011
pped+0.010
ched+0.010
cher+0.009
Neg logit contributions
 Grand-0.011
ummy-0.011
 Aunt-0.011
 grand-0.011
 festivals-0.011
Token the
Token ID262
Feature activation-1.56e-01

Pos logit contributions
ummy+0.175
igsaw+0.174
 batch+0.173
 festivals+0.171
 Of+0.171
Neg logit contributions
 managed-0.274
'd-0.254
pped-0.252
ched-0.250
 stayed-0.235
Token lush
Token ID37408
Feature activation-2.65e-01

Pos logit contributions
 grand+0.173
 Grand+0.169
 Aunt+0.169
 festivals+0.167
ummy+0.166
Neg logit contributions
 managed-0.398
pped-0.378
ched-0.372
'd-0.370
cher-0.363
Token green
Token ID4077
Feature activation-2.11e-01

Pos logit contributions
 Of+0.420
 batch+0.407
igsaw+0.396
 Aunt+0.395
 Grand+0.392
Neg logit contributions
 managed-0.528
'd-0.490
ched-0.484
pped-0.480
 could-0.468
Token grass
Token ID8701
Feature activation-4.42e-02

Pos logit contributions
ummy+0.245
 festivals+0.244
 Grand+0.243
 grand+0.242
 Aunt+0.239
Neg logit contributions
 managed-0.522
'd-0.488
pped-0.486
ched-0.484
cher-0.460
Token and
Token ID290
Feature activation-1.25e-01

Pos logit contributions
ummy+0.073
 Grand+0.072
 grand+0.071
 festivals+0.070
 Aunt+0.070
Neg logit contributions
 managed-0.088
'd-0.081
pped-0.081
ched-0.080
cher-0.074
Token jump
Token ID4391
Feature activation-1.88e-03

Pos logit contributions
 Grand+0.199
 festivals+0.197
 grand+0.196
 Aunt+0.195
 Gig+0.188
Neg logit contributions
pped-0.246
 managed-0.245
ched-0.239
'd-0.231
ier-0.222
Token high
Token ID1029
Feature activation-1.44e-01

Pos logit contributions
ummy+0.003
 festivals+0.003
 grand+0.003
 Gig+0.003
 Grand+0.003
Neg logit contributions
 managed-0.004
'd-0.003
pped-0.003
cher-0.003
 remembered-0.003
Token farmer
Token ID18739
Feature activation+1.63e-02

Pos logit contributions
igsaw+0.251
 Of+0.250
 festivals+0.243
 outer+0.232
 perspective+0.229
Neg logit contributions
 managed-0.201
'd-0.196
 found-0.194
 had-0.194
 could-0.191
Token who
Token ID508
Feature activation-2.39e-02

Pos logit contributions
 managed+0.028
'd+0.026
ched+0.025
pped+0.025
 stayed+0.023
Neg logit contributions
igsaw-0.032
 festivals-0.031
ummy-0.031
 batch-0.030
 Grand-0.030
Token lived
Token ID5615
Feature activation+3.65e-02

Pos logit contributions
 grand+0.045
 Grand+0.045
 Aunt+0.043
 festivals+0.042
 Gig+0.042
Neg logit contributions
pped-0.042
 managed-0.042
ched-0.039
cher-0.039
ier-0.037
Token on
Token ID319
Feature activation-1.34e-01

Pos logit contributions
 managed+0.051
'd+0.046
ched+0.045
pped+0.044
 wouldn+0.039
Neg logit contributions
 Grand-0.081
ummy-0.081
 festivals-0.081
 grand-0.081
igsaw-0.080
Token a
Token ID257
Feature activation-1.44e-01

Pos logit contributions
 Of+0.287
igsaw+0.280
ummy+0.279
utes+0.272
 batch+0.268
Neg logit contributions
 managed-0.183
pped-0.173
'd-0.164
 stayed-0.163
ched-0.162
Token lush
Token ID37408
Feature activation-2.63e-01

Pos logit contributions
 Of+0.191
igsaw+0.165
ummy+0.155
 Grand+0.150
 Aunt+0.146
Neg logit contributions
'd-0.329
 managed-0.323
 stayed-0.303
 found-0.299
 could-0.296
Token farm
Token ID5318
Feature activation-8.72e-02

Pos logit contributions
 Of+0.471
utes+0.385
igsaw+0.381
 Sorry+0.376
 Aunt+0.374
Neg logit contributions
 found-0.454
 managed-0.439
 back-0.434
'd-0.432
 stayed-0.427
Token with
Token ID351
Feature activation-1.46e-01

Pos logit contributions
igsaw+0.201
ummy+0.188
 festivals+0.187
 Of+0.186
 Aunt+0.184
Neg logit contributions
 managed-0.115
'd-0.105
ched-0.103
pped-0.099
 could-0.098
Token many
Token ID867
Feature activation-1.83e-01

Pos logit contributions
igsaw+0.227
ummy+0.216
 Of+0.213
 batch+0.211
 festivals+0.210
Neg logit contributions
 managed-0.309
ched-0.283
pped-0.279
'd-0.277
 could-0.269
Token animals
Token ID4695
Feature activation+5.76e-03

Pos logit contributions
igsaw+0.318
 Of+0.304
ummy+0.296
 Grand+0.295
 batch+0.293
Neg logit contributions
 managed-0.355
'd-0.333
ched-0.318
pped-0.314
 could-0.307
Token and
Token ID290
Feature activation-1.27e-01

Pos logit contributions
 managed+0.011
'd+0.010
pped+0.010
ched+0.010
 wouldn+0.009
Neg logit contributions
 Grand-0.010
 festivals-0.010
ummy-0.010
 grand-0.010
 Aunt-0.010
Token.
Token ID13
Feature activation-6.61e-02

Pos logit contributions
ummy+0.141
 Grand+0.141
 grand+0.138
 festivals+0.137
 Aunt+0.137
Neg logit contributions
 managed-0.173
'd-0.157
pped-0.157
ched-0.156
cher-0.149
Token It
Token ID632
Feature activation+1.76e-01

Pos logit contributions
 batch+0.142
igsaw+0.130
ummy+0.128
 perspective+0.125
 festivals+0.124
Neg logit contributions
 could-0.102
 managed-0.102
'd-0.096
tted-0.091
ched-0.091
Token ran
Token ID4966
Feature activation+6.19e-02

Pos logit contributions
pped+0.287
cher+0.275
 managed+0.256
ier+0.252
ched+0.233
Neg logit contributions
 grand-0.340
 Grand-0.333
 landscape-0.333
 blue-0.328
 sunset-0.326
Token through
Token ID832
Feature activation-1.11e-01

Pos logit contributions
 managed+0.109
'd+0.100
ched+0.098
pped+0.098
 stayed+0.089
Neg logit contributions
ummy-0.115
 Grand-0.115
 festivals-0.115
igsaw-0.114
 Aunt-0.114
Token a
Token ID257
Feature activation-1.35e-01

Pos logit contributions
igsaw+0.202
 batch+0.195
ummy+0.193
 Of+0.192
 Aunt+0.187
Neg logit contributions
 managed-0.210
 could-0.193
pped-0.192
 found-0.188
'd-0.186
Token lush
Token ID37408
Feature activation-2.62e-01

Pos logit contributions
 Of+0.188
ummy+0.180
 Grand+0.179
igsaw+0.178
 Aunt+0.178
Neg logit contributions
 managed-0.307
'd-0.291
 could-0.275
ched-0.273
pped-0.268
Token green
Token ID4077
Feature activation-2.03e-01

Pos logit contributions
 Of+0.452
 Aunt+0.407
 batch+0.395
 Grand+0.394
igsaw+0.389
Neg logit contributions
 managed-0.493
 found-0.476
'd-0.456
 could-0.451
 stayed-0.449
Token me
Token ID502
Feature activation+3.30e-02

Pos logit contributions
 Of+0.286
 Grand+0.284
 Aunt+0.281
igsaw+0.277
 grand+0.268
Neg logit contributions
 managed-0.444
'd-0.429
pped-0.416
ched-0.414
 stayed-0.408
Tokenadow
Token ID4584
Feature activation-4.33e-03

Pos logit contributions
 managed+0.071
pped+0.067
ched+0.065
'd+0.064
cher+0.063
Neg logit contributions
 Grand-0.047
ummy-0.046
 Sunday-0.046
 festivals-0.045
 grand-0.045
Token,
Token ID11
Feature activation-1.61e-01

Pos logit contributions
ummy+0.007
 Grand+0.007
 grand+0.007
 festivals+0.007
 Aunt+0.007
Neg logit contributions
 managed-0.009
'd-0.008
pped-0.008
ched-0.008
cher-0.008
Token with
Token ID351
Feature activation-1.14e-01

Pos logit contributions
igsaw+0.292
ummy+0.288
 batch+0.283
 Grand+0.282
 Aunt+0.282
Neg logit contributions
 managed-0.297
'd-0.275
ched-0.269
pped-0.268
 stayed-0.251
Token have
Token ID423
Feature activation+1.33e-01

Pos logit contributions
 managed+0.128
ched+0.117
pped+0.117
'd+0.115
tted+0.105
Neg logit contributions
 Grand-0.138
 festivals-0.137
 grand-0.136
 Aunt-0.131
ummy-0.130
Token stopped
Token ID5025
Feature activation-6.93e-03

Pos logit contributions
 managed+0.265
ched+0.251
pped+0.250
'd+0.250
cher+0.232
Neg logit contributions
 grand-0.220
 Grand-0.220
 Aunt-0.209
 festivals-0.208
 Sunday-0.197
Token the
Token ID262
Feature activation-1.36e-01

Pos logit contributions
igsaw+0.012
 batch+0.012
 festivals+0.012
ummy+0.012
 Grand+0.012
Neg logit contributions
 managed-0.013
'd-0.012
pped-0.012
ched-0.012
 stayed-0.011
Token rh
Token ID9529
Feature activation-1.46e-01

Pos logit contributions
igsaw+0.177
 Of+0.165
ummy+0.165
 festivals+0.157
 batch+0.157
Neg logit contributions
 managed-0.311
'd-0.307
cher-0.289
pped-0.288
 stayed-0.286
Tokenin
Token ID259
Feature activation-5.43e-02

Pos logit contributions
ummy+0.279
 batch+0.277
 grand+0.271
 festivals+0.271
 Grand+0.270
Neg logit contributions
 managed-0.255
'd-0.233
ched-0.227
pped-0.219
 wouldn-0.209
Tokenoc
Token ID420
Feature activation-2.62e-01

Pos logit contributions
igsaw+0.060
 festivals+0.058
ummy+0.058
 Grand+0.057
 grand+0.056
Neg logit contributions
 managed-0.137
'd-0.130
ched-0.129
pped-0.127
cher-0.122
Tokeneros
Token ID27498
Feature activation+1.33e-03

Pos logit contributions
 festivals+0.405
igsaw+0.396
 grand+0.395
 Aunt+0.390
 Grand+0.389
Neg logit contributions
 managed-0.533
pped-0.508
ched-0.498
'd-0.490
ier-0.471
Token.
Token ID13
Feature activation-6.99e-02

Pos logit contributions
 managed+0.003
'd+0.002
pped+0.002
ched+0.002
cher+0.002
Neg logit contributions
 Grand-0.002
 grand-0.002
 festivals-0.002
ummy-0.002
 Aunt-0.002
Token He
Token ID679
Feature activation+1.67e-01

Pos logit contributions
 batch+0.122
igsaw+0.119
ummy+0.113
 festivals+0.110
 perspective+0.107
Neg logit contributions
 managed-0.132
'd-0.126
ched-0.125
 could-0.124
pped-0.120
Token is
Token ID318
Feature activation+1.20e-01

Pos logit contributions
pped+0.308
 managed+0.300
ched+0.282
ier+0.275
amed+0.264
Neg logit contributions
 grand-0.305
 Grand-0.303
 Aunt-0.280
 batch-0.266
 festivals-0.265
Token sleeping
Token ID11029
Feature activation-4.29e-02

Pos logit contributions
 managed+0.229
'd+0.216
ched+0.216
pped+0.213
ave+0.195
Neg logit contributions
 Grand-0.219
 grand-0.214
 Aunt-0.208
 festivals-0.203
 Uncle-0.197
Token trapped
Token ID13640
Feature activation-3.33e-02

Pos logit contributions
'd+0.208
ched+0.204
pped+0.203
 managed+0.203
ave+0.187
Neg logit contributions
 grand-0.229
 Grand-0.228
 Aunt-0.228
 Uncle-0.208
 Gig-0.201
Token with
Token ID351
Feature activation-1.10e-01

Pos logit contributions
 grand+0.053
 Grand+0.052
ummy+0.051
 Aunt+0.049
 festivals+0.048
Neg logit contributions
 managed-0.065
'd-0.064
ched-0.061
ave-0.059
pped-0.056
Token the
Token ID262
Feature activation-1.37e-01

Pos logit contributions
 Grand+0.132
 grand+0.130
 Aunt+0.129
 festivals+0.126
ummy+0.124
Neg logit contributions
 managed-0.269
'd-0.257
ched-0.252
pped-0.251
cher-0.239
Token rh
Token ID9529
Feature activation-1.41e-01

Pos logit contributions
igsaw+0.169
ummy+0.166
 festivals+0.165
 Grand+0.163
 batch+0.162
Neg logit contributions
 managed-0.332
'd-0.315
ched-0.308
pped-0.307
 stayed-0.291
Tokenin
Token ID259
Feature activation-5.75e-02

Pos logit contributions
ummy+0.262
 batch+0.258
 grand+0.255
 festivals+0.254
 Grand+0.253
Neg logit contributions
 managed-0.247
'd-0.227
ched-0.221
pped-0.215
 wouldn-0.205
Tokenoc
Token ID420
Feature activation-2.61e-01

Pos logit contributions
 Grand+0.048
 grand+0.048
 Aunt+0.048
ummy+0.048
 batch+0.047
Neg logit contributions
 managed-0.160
'd-0.152
pped-0.151
ched-0.150
 wouldn-0.141
Tokeneros
Token ID27498
Feature activation+1.48e-03

Pos logit contributions
 festivals+0.397
igsaw+0.394
 grand+0.389
 Grand+0.387
 batch+0.383
Neg logit contributions
 managed-0.543
pped-0.514
ched-0.508
'd-0.501
cher-0.475
Token.
Token ID13
Feature activation-6.98e-02

Pos logit contributions
 managed+0.003
pped+0.003
'd+0.003
ched+0.003
cher+0.003
Neg logit contributions
 grand-0.002
 Grand-0.002
 Aunt-0.002
ummy-0.002
 batch-0.002
Token They
Token ID1119
Feature activation+1.49e-01

Pos logit contributions
 batch+0.098
igsaw+0.096
ummy+0.095
 festivals+0.090
 orig+0.086
Neg logit contributions
 managed-0.153
ched-0.144
 could-0.143
'd-0.142
pped-0.136
Token looked
Token ID3114
Feature activation+9.78e-02

Pos logit contributions
ier+0.237
cher+0.209
pped+0.200
amed+0.198
pping+0.193
Neg logit contributions
 grand-0.296
 Grand-0.284
 Aunt-0.265
 Gig-0.255
 T-0.248
Token for
Token ID329
Feature activation-5.38e-03

Pos logit contributions
 managed+0.178
'd+0.166
pped+0.165
ched+0.160
cher+0.153
Neg logit contributions
 grand-0.179
 Grand-0.178
 Aunt-0.176
 festivals-0.171
ummy-0.170
Token to
Token ID284
Feature activation-2.95e-02

Pos logit contributions
 managed+0.095
 could+0.090
'd+0.088
 stayed+0.087
ched+0.084
Neg logit contributions
igsaw-0.169
 batch-0.155
 Of-0.151
ummy-0.147
utes-0.146
Token see
Token ID766
Feature activation+2.85e-02

Pos logit contributions
igsaw+0.062
ummy+0.061
 batch+0.061
 Of+0.060
 Grand+0.060
Neg logit contributions
 managed-0.046
'd-0.042
pped-0.041
ched-0.041
 stayed-0.037
Token the
Token ID262
Feature activation-1.27e-01

Pos logit contributions
 managed+0.052
pped+0.049
'd+0.048
 could+0.047
ched+0.047
Neg logit contributions
igsaw-0.052
ummy-0.048
 batch-0.047
 Of-0.047
 festivals-0.044
Token rh
Token ID9529
Feature activation-1.39e-01

Pos logit contributions
igsaw+0.177
 Of+0.164
 festivals+0.148
 batch+0.148
ummy+0.147
Neg logit contributions
'd-0.284
 managed-0.284
pped-0.270
ched-0.261
 stayed-0.261
Tokenin
Token ID259
Feature activation-5.00e-02

Pos logit contributions
ummy+0.270
 batch+0.267
 festivals+0.265
 grand+0.262
 Grand+0.261
Neg logit contributions
 managed-0.238
'd-0.216
ched-0.212
pped-0.206
 wouldn-0.193
Tokenoc
Token ID420
Feature activation-2.61e-01

Pos logit contributions
igsaw+0.051
ummy+0.050
 Grand+0.050
 festivals+0.050
 grand+0.050
Neg logit contributions
 managed-0.130
'd-0.123
ched-0.122
pped-0.121
cher-0.115
Tokeneros
Token ID27498
Feature activation+1.11e-02

Pos logit contributions
igsaw+0.424
 festivals+0.423
 grand+0.416
 Grand+0.408
 Aunt+0.407
Neg logit contributions
 managed-0.500
pped-0.488
ched-0.467
'd-0.459
cher-0.447
Token anymore
Token ID7471
Feature activation-2.94e-02

Pos logit contributions
 managed+0.023
pped+0.021
ched+0.021
'd+0.021
tted+0.020
Neg logit contributions
 festivals-0.017
 Grand-0.017
 grand-0.017
ummy-0.017
 Aunt-0.016
Token.
Token ID13
Feature activation-7.19e-02

Pos logit contributions
 Grand+0.048
 grand+0.047
ummy+0.047
 festivals+0.046
 Aunt+0.046
Neg logit contributions
 managed-0.061
'd-0.056
pped-0.056
ched-0.056
cher-0.051
Token He
Token ID679
Feature activation+1.62e-01

Pos logit contributions
 batch+0.120
igsaw+0.118
 festivals+0.112
ummy+0.111
 perspective+0.105
Neg logit contributions
 managed-0.141
'd-0.136
ched-0.132
 could-0.132
pped-0.126
Token is
Token ID318
Feature activation+1.14e-01

Pos logit contributions
pped+0.336
amed+0.302
 managed+0.300
ier+0.290
cher+0.288
Neg logit contributions
 grand-0.261
 Grand-0.245
 Aunt-0.224
 history-0.217
 batch-0.213
Token look
Token ID804
Feature activation+9.79e-02

Pos logit contributions
 Grand+0.137
 grand+0.134
 Aunt+0.132
 festivals+0.125
 Uncle+0.120
Neg logit contributions
pped-0.159
ched-0.156
ier-0.152
 managed-0.151
'd-0.140
Token for
Token ID329
Feature activation-3.64e-03

Pos logit contributions
 managed+0.172
'd+0.158
ched+0.156
pped+0.155
cher+0.141
Neg logit contributions
 Grand-0.183
 grand-0.182
ummy-0.182
 Aunt-0.181
 festivals-0.181
Token the
Token ID262
Feature activation-1.12e-01

Pos logit contributions
igsaw+0.007
ummy+0.006
 batch+0.006
 Of+0.006
 festivals+0.006
Neg logit contributions
 managed-0.007
pped-0.006
'd-0.006
ched-0.006
 stayed-0.006
Token rh
Token ID9529
Feature activation-1.30e-01

Pos logit contributions
igsaw+0.158
ummy+0.151
 festivals+0.147
 Of+0.146
 batch+0.146
Neg logit contributions
 managed-0.253
'd-0.244
pped-0.231
ched-0.231
 stayed-0.224
Tokenin
Token ID259
Feature activation-4.22e-02

Pos logit contributions
ummy+0.249
 batch+0.244
 festivals+0.241
 grand+0.241
 Grand+0.239
Neg logit contributions
 managed-0.227
'd-0.206
ched-0.202
pped-0.198
 wouldn-0.187
Tokenoc
Token ID420
Feature activation-2.61e-01

Pos logit contributions
ummy+0.041
 Grand+0.040
igsaw+0.040
 festivals+0.040
 grand+0.040
Neg logit contributions
 managed-0.112
'd-0.106
ched-0.105
pped-0.105
cher-0.099
Tokeneros
Token ID27498
Feature activation+1.86e-02

Pos logit contributions
 festivals+0.430
 grand+0.420
igsaw+0.419
 Aunt+0.411
 Grand+0.408
Neg logit contributions
 managed-0.493
pped-0.480
ched-0.463
'd-0.453
ier-0.445
Token.
Token ID13
Feature activation-5.82e-02

Pos logit contributions
pped+0.039
 managed+0.038
ched+0.037
'd+0.035
amed+0.034
Neg logit contributions
 grand-0.028
 Grand-0.027
 Aunt-0.026
 festivals-0.025
 batch-0.025
Token
Token ID198
Feature activation-4.66e-02

Pos logit contributions
igsaw+0.082
ummy+0.081
 festivals+0.080
 batch+0.080
 grand+0.079
Neg logit contributions
 managed-0.129
'd-0.120
ched-0.120
pped-0.116
 could-0.114
Token
Token ID198
Feature activation-5.27e-02

Pos logit contributions
igsaw+0.068
ummy+0.067
 batch+0.066
 festivals+0.066
 Aunt+0.064
Neg logit contributions
 managed-0.101
ched-0.095
'd-0.094
pped-0.093
 could-0.088
TokenThey
Token ID2990
Feature activation+1.77e-01

Pos logit contributions
 festivals+0.068
 grand+0.067
 Grand+0.065
 Sunday+0.064
 batch+0.064
Neg logit contributions
 managed-0.119
'd-0.113
pped-0.110
 wouldn-0.110
ched-0.110
Token kinds
Token ID6982
Feature activation-9.75e-02

Pos logit contributions
igsaw+0.044
ummy+0.044
 batch+0.043
 grand+0.042
 Grand+0.042
Neg logit contributions
 managed-0.102
'd-0.096
ched-0.096
 could-0.093
 stayed-0.092
Token of
Token ID286
Feature activation-1.48e-01

Pos logit contributions
ummy+0.139
 festivals+0.132
igsaw+0.130
 grand+0.130
 Of+0.129
Neg logit contributions
 managed-0.216
pped-0.206
'd-0.205
ched-0.198
 wouldn-0.190
Token wonders
Token ID19294
Feature activation-9.81e-03

Pos logit contributions
igsaw+0.199
ummy+0.194
 batch+0.193
 Grand+0.192
 Aunt+0.188
Neg logit contributions
 managed-0.336
'd-0.317
ched-0.312
pped-0.302
 could-0.297
Token in
Token ID287
Feature activation-1.36e-01

Pos logit contributions
 Grand+0.017
ummy+0.017
 festivals+0.017
 grand+0.017
 Aunt+0.017
Neg logit contributions
 managed-0.019
'd-0.017
ched-0.017
pped-0.017
cher-0.016
Token the
Token ID262
Feature activation-1.66e-01

Pos logit contributions
ummy+0.213
 batch+0.210
igsaw+0.206
 Of+0.205
 Aunt+0.187
Neg logit contributions
 managed-0.279
'd-0.255
pped-0.246
 could-0.243
 found-0.242
Token big
Token ID1263
Feature activation-2.60e-01

Pos logit contributions
 Grand+0.211
ummy+0.211
 festivals+0.210
 grand+0.209
 Aunt+0.208
Neg logit contributions
 managed-0.392
'd-0.367
ched-0.364
pped-0.364
cher-0.340
Token woods
Token ID16479
Feature activation-6.61e-03

Pos logit contributions
 batch+0.390
igsaw+0.388
 Of+0.386
 Grand+0.378
 Aunt+0.371
Neg logit contributions
 managed-0.541
'd-0.513
pped-0.480
ched-0.480
 could-0.475
Token.
Token ID13
Feature activation-5.95e-02

Pos logit contributions
 grand+0.011
 Grand+0.011
 festivals+0.011
ummy+0.011
 Aunt+0.010
Neg logit contributions
 managed-0.013
pped-0.012
ched-0.012
'd-0.012
cher-0.011
Token
Token ID198
Feature activation-4.29e-02

Pos logit contributions
ummy+0.084
 batch+0.083
igsaw+0.081
 festivals+0.080
 grand+0.078
Neg logit contributions
 managed-0.130
ched-0.124
'd-0.121
 could-0.118
pped-0.116
Token
Token ID198
Feature activation-4.90e-02

Pos logit contributions
igsaw+0.067
ummy+0.066
 batch+0.065
 festivals+0.065
 Grand+0.064
Neg logit contributions
 managed-0.088
ched-0.084
'd-0.083
pped-0.080
 could-0.078
TokenOne
Token ID3198
Feature activation-9.66e-02

Pos logit contributions
 festivals+0.068
 grand+0.067
igsaw+0.066
 Grand+0.066
ummy+0.066
Neg logit contributions
 managed-0.109
'd-0.102
pped-0.100
ched-0.100
 wouldn-0.095
Tokenizz
Token ID6457
Feature activation-1.03e-02

Pos logit contributions
ummy+0.111
igsaw+0.111
 festivals+0.110
 batch+0.110
 Grand+0.110
Neg logit contributions
 managed-0.098
'd-0.091
ched-0.088
pped-0.086
 wouldn-0.081
Tokened
Token ID276
Feature activation+6.26e-03

Pos logit contributions
ummy+0.019
 Grand+0.019
 Aunt+0.019
 grand+0.018
 festivals+0.018
Neg logit contributions
 managed-0.018
'd-0.016
pped-0.016
ched-0.015
 actually-0.014
Token off
Token ID572
Feature activation-3.54e-02

Pos logit contributions
 managed+0.009
ched+0.009
'd+0.008
pped+0.008
 stayed+0.008
Neg logit contributions
ummy-0.013
igsaw-0.013
 batch-0.013
 festivals-0.013
 grand-0.012
Token back
Token ID736
Feature activation+3.95e-03

Pos logit contributions
ummy+0.065
igsaw+0.064
 festivals+0.064
 batch+0.063
 Aunt+0.063
Neg logit contributions
 managed-0.063
'd-0.057
ched-0.057
pped-0.055
 stayed-0.052
Token into
Token ID656
Feature activation-1.25e-01

Pos logit contributions
 managed+0.007
'd+0.006
ched+0.006
pped+0.006
 stayed+0.006
Neg logit contributions
 batch-0.007
ummy-0.007
igsaw-0.007
 festivals-0.007
 Of-0.007
Token outer
Token ID12076
Feature activation-2.60e-01

Pos logit contributions
ummy+0.204
 Of+0.186
 batch+0.183
 Gig+0.183
igsaw+0.179
Neg logit contributions
 managed-0.255
 found-0.220
'd-0.217
pped-0.216
 stayed-0.211
Token space
Token ID2272
Feature activation-7.84e-02

Pos logit contributions
 Of+0.292
 batch+0.282
ummy+0.266
 Gig+0.263
igsaw+0.261
Neg logit contributions
 managed-0.625
'd-0.576
pped-0.569
 found-0.568
ched-0.563
Token.
Token ID13
Feature activation-4.88e-02

Pos logit contributions
 Grand+0.133
 grand+0.133
 festivals+0.132
ummy+0.131
igsaw+0.130
Neg logit contributions
 managed-0.151
pped-0.140
'd-0.139
ched-0.139
 wouldn-0.125
Token Earth
Token ID3668
Feature activation-1.24e-02

Pos logit contributions
 batch+0.077
ummy+0.074
igsaw+0.073
 grand+0.071
 festivals+0.071
Neg logit contributions
 managed-0.099
'd-0.092
ched-0.091
 could-0.091
tted-0.085
Token was
Token ID373
Feature activation+1.38e-01

Pos logit contributions
 grand+0.026
 Grand+0.025
 Aunt+0.025
 festivals+0.025
ummy+0.024
Neg logit contributions
pped-0.018
 managed-0.017
ched-0.016
cher-0.015
'd-0.015
Token an
Token ID281
Feature activation-1.25e-01

Pos logit contributions
 managed+0.257
pped+0.247
ched+0.246
'd+0.243
cher+0.234
Neg logit contributions
 Grand-0.242
 grand-0.238
 Aunt-0.237
 festivals-0.227
 Sunday-0.221
Token dog
Token ID3290
Feature activation+2.54e-02

Pos logit contributions
igsaw+0.170
 festivals+0.162
 batch+0.161
ummy+0.159
 Of+0.159
Neg logit contributions
 managed-0.179
'd-0.177
 could-0.164
 remembered-0.163
 found-0.155
Token who
Token ID508
Feature activation-1.76e-02

Pos logit contributions
 managed+0.039
'd+0.037
ched+0.036
pped+0.036
 could+0.034
Neg logit contributions
igsaw-0.052
 festivals-0.050
ummy-0.049
 batch-0.049
 Of-0.049
Token lived
Token ID5615
Feature activation+4.08e-02

Pos logit contributions
 grand+0.033
 Grand+0.032
 Aunt+0.031
 festivals+0.031
 Of+0.030
Neg logit contributions
 managed-0.031
pped-0.031
cher-0.029
ched-0.028
ier-0.028
Token in
Token ID287
Feature activation-1.71e-01

Pos logit contributions
 managed+0.057
'd+0.051
ched+0.050
pped+0.050
cher+0.044
Neg logit contributions
ummy-0.090
 Grand-0.090
igsaw-0.090
 festivals-0.090
 Aunt-0.089
Token a
Token ID257
Feature activation-1.64e-01

Pos logit contributions
ummy+0.338
 batch+0.332
igsaw+0.320
 festivals+0.316
 Of+0.316
Neg logit contributions
 managed-0.267
pped-0.248
ched-0.242
'd-0.236
 stayed-0.227
Token big
Token ID1263
Feature activation-2.60e-01

Pos logit contributions
 Of+0.253
ummy+0.243
igsaw+0.239
 Gig+0.239
 batch+0.230
Neg logit contributions
 managed-0.329
'd-0.306
 found-0.299
 could-0.296
pped-0.288
Token house
Token ID2156
Feature activation-2.67e-02

Pos logit contributions
 Of+0.496
utes+0.484
igsaw+0.438
 Gig+0.438
 batch+0.434
Neg logit contributions
 back-0.414
 found-0.412
 managed-0.400
 remembered-0.379
 had-0.371
Token with
Token ID351
Feature activation-1.46e-01

Pos logit contributions
igsaw+0.050
 festivals+0.049
 Aunt+0.049
ummy+0.048
 Grand+0.048
Neg logit contributions
 managed-0.046
'd-0.043
pped-0.042
ched-0.042
 stayed-0.039
Token her
Token ID607
Feature activation+1.09e-02

Pos logit contributions
igsaw+0.270
 batch+0.265
 festivals+0.263
ummy+0.263
 Of+0.257
Neg logit contributions
 managed-0.250
ched-0.232
pped-0.230
'd-0.224
 stayed-0.218
Token owner
Token ID4870
Feature activation+7.22e-03

Pos logit contributions
 managed+0.021
'd+0.020
 could+0.020
 stayed+0.020
 wouldn+0.019
Neg logit contributions
igsaw-0.017
 festivals-0.016
 batch-0.016
ummy-0.016
 Of-0.016
Token,
Token ID11
Feature activation-1.56e-01

Pos logit contributions
 managed+0.014
'd+0.014
ched+0.013
pped+0.013
 could+0.013
Neg logit contributions
igsaw-0.011
 festivals-0.011
 batch-0.011
ummy-0.011
 grand-0.011
Token where
Token ID810
Feature activation-2.40e-02

Pos logit contributions
 managed+0.141
'd+0.132
ched+0.130
pped+0.130
 stayed+0.120
Neg logit contributions
ummy-0.108
igsaw-0.108
 Grand-0.108
 festivals-0.107
 batch-0.107
Token he
Token ID339
Feature activation+1.90e-01

Pos logit contributions
 Grand+0.038
 grand+0.037
 festivals+0.037
ummy+0.037
 Aunt+0.037
Neg logit contributions
 managed-0.049
'd-0.046
pped-0.046
ched-0.046
cher-0.042
Token belonged
Token ID19611
Feature activation+9.63e-02

Pos logit contributions
cher+0.310
pped+0.299
ier+0.294
amed+0.292
bridge+0.279
Neg logit contributions
 grand-0.366
 Grand-0.346
 T-0.322
 Aunt-0.321
 Of-0.316
Token in
Token ID287
Feature activation-1.53e-01

Pos logit contributions
 managed+0.224
ched+0.210
'd+0.210
pped+0.209
cher+0.201
Neg logit contributions
ummy-0.137
 grand-0.124
 festivals-0.123
 Grand-0.122
igsaw-0.119
Token the
Token ID262
Feature activation-1.80e-01

Pos logit contributions
ummy+0.230
 batch+0.226
 Of+0.223
 Aunt+0.218
igsaw+0.217
Neg logit contributions
 managed-0.322
pped-0.296
ched-0.291
'd-0.289
 could-0.276
Token big
Token ID1263
Feature activation-2.59e-01

Pos logit contributions
ummy+0.167
 Grand+0.167
 festivals+0.166
 grand+0.165
igsaw+0.164
Neg logit contributions
 managed-0.486
'd-0.460
ched-0.456
pped-0.455
cher-0.429
Token ocean
Token ID9151
Feature activation-7.08e-02

Pos logit contributions
 Aunt+0.313
igsaw+0.313
 Grand+0.312
 batch+0.309
 Of+0.309
Neg logit contributions
 managed-0.621
'd-0.581
pped-0.566
ched-0.565
 stayed-0.541
Token.
Token ID13
Feature activation-6.30e-02

Pos logit contributions
 grand+0.108
 Grand+0.107
 festivals+0.105
ummy+0.104
 batch+0.101
Neg logit contributions
 managed-0.149
pped-0.145
ched-0.142
'd-0.137
cher-0.129
Token The
Token ID383
Feature activation-1.22e-01

Pos logit contributions
igsaw+0.092
 festivals+0.092
ummy+0.092
 batch+0.089
 sunrise+0.089
Neg logit contributions
 managed-0.133
 could-0.127
'd-0.118
ched-0.117
 found-0.114
Token fish
Token ID5916
Feature activation+1.71e-02

Pos logit contributions
 Grand+0.140
ummy+0.139
igsaw+0.138
 Aunt+0.138
 festivals+0.138
Neg logit contributions
 managed-0.303
'd-0.286
ched-0.282
pped-0.281
 stayed-0.265
Token listened
Token ID16399
Feature activation-1.43e-02

Pos logit contributions
 managed+0.026
pped+0.026
ched+0.024
'd+0.024
cher+0.023
Neg logit contributions
 grand-0.034
 batch-0.033
ummy-0.033
 Aunt-0.033
 Grand-0.033
Token it
Token ID340
Feature activation+1.19e-01

Pos logit contributions
ummy+0.215
igsaw+0.215
 festivals+0.212
 Of+0.211
 batch+0.209
Neg logit contributions
 managed-0.282
'd-0.260
pped-0.258
ched-0.254
 stayed-0.241
Token and
Token ID290
Feature activation-9.53e-02

Pos logit contributions
 managed+0.233
'd+0.216
pped+0.216
ched+0.215
cher+0.198
Neg logit contributions
ummy-0.198
 Grand-0.198
 grand-0.196
 batch-0.195
 Aunt-0.194
Token posted
Token ID4481
Feature activation-2.13e-02

Pos logit contributions
 Grand+0.195
 grand+0.192
 Aunt+0.187
 batch+0.186
 festivals+0.180
Neg logit contributions
 managed-0.149
ched-0.147
pped-0.146
'd-0.142
tted-0.136
Token it
Token ID340
Feature activation+1.34e-01

Pos logit contributions
ummy+0.036
 festivals+0.036
igsaw+0.036
 Grand+0.036
 grand+0.036
Neg logit contributions
 managed-0.040
'd-0.037
ched-0.037
pped-0.037
 wouldn-0.034
Token on
Token ID319
Feature activation-1.17e-01

Pos logit contributions
 managed+0.252
'd+0.237
pped+0.237
ched+0.227
cher+0.213
Neg logit contributions
 Grand-0.232
 grand-0.227
 Aunt-0.225
ummy-0.225
igsaw-0.224
Token social
Token ID1919
Feature activation-2.59e-01

Pos logit contributions
 Of+0.188
ummy+0.183
 festivals+0.183
igsaw+0.182
 batch+0.178
Neg logit contributions
 managed-0.228
pped-0.214
'd-0.206
ched-0.203
 could-0.194
Token media
Token ID2056
Feature activation-8.15e-02

Pos logit contributions
 Aunt+0.328
 Grand+0.327
 Of+0.324
ummy+0.324
igsaw+0.321
Neg logit contributions
 managed-0.601
ched-0.566
pped-0.564
'd-0.559
 stayed-0.519
Token.
Token ID13
Feature activation-5.71e-02

Pos logit contributions
 Grand+0.139
ummy+0.139
 festivals+0.138
igsaw+0.138
 grand+0.138
Neg logit contributions
 managed-0.154
'd-0.143
ched-0.141
pped-0.141
cher-0.129
Token
Token ID198
Feature activation-4.45e-02

Pos logit contributions
 festivals+0.095
 perspective+0.092
igsaw+0.091
 grand+0.090
 sunrise+0.089
Neg logit contributions
 managed-0.106
 could-0.103
ched-0.103
'd-0.094
pped-0.093
Token
Token ID198
Feature activation-5.20e-02

Pos logit contributions
igsaw+0.075
ummy+0.073
 festivals+0.073
 grand+0.071
 batch+0.071
Neg logit contributions
 managed-0.084
ched-0.083
pped-0.079
'd-0.078
 could-0.077
TokenL
Token ID43
Feature activation-7.06e-03

Pos logit contributions
 Grand+0.070
ummy+0.070
 festivals+0.069
 Aunt+0.069
 grand+0.069
Neg logit contributions
 managed-0.119
'd-0.112
ched-0.111
pped-0.111
 stayed-0.102
Token try
Token ID1949
Feature activation+1.10e-01

Pos logit contributions
 Grand+0.130
 grand+0.129
 Aunt+0.126
 festivals+0.126
 Of+0.124
Neg logit contributions
 managed-0.165
'd-0.156
pped-0.155
ched-0.153
cher-0.142
Token to
Token ID284
Feature activation-4.56e-03

Pos logit contributions
 managed+0.194
'd+0.178
ched+0.176
pped+0.170
 could+0.161
Neg logit contributions
igsaw-0.208
ummy-0.203
 batch-0.201
 Aunt-0.199
 Grand-0.199
Token grab
Token ID5552
Feature activation-1.46e-02

Pos logit contributions
 grand+0.007
 festivals+0.007
 Grand+0.007
 Aunt+0.007
igsaw+0.007
Neg logit contributions
 managed-0.009
'd-0.008
ched-0.008
pped-0.008
 wouldn-0.008
Token the
Token ID262
Feature activation-1.18e-01

Pos logit contributions
ummy+0.023
igsaw+0.023
 festivals+0.022
 batch+0.022
 Aunt+0.022
Neg logit contributions
 managed-0.031
ched-0.029
pped-0.028
'd-0.027
 stayed-0.026
Token av
Token ID1196
Feature activation-7.95e-02

Pos logit contributions
ummy+0.129
 Grand+0.126
 grand+0.126
igsaw+0.126
 batch+0.125
Neg logit contributions
 managed-0.298
'd-0.281
ched-0.279
pped-0.279
cher-0.261
Tokenoc
Token ID420
Feature activation-2.59e-01

Pos logit contributions
 Grand+0.082
 festivals+0.081
 Aunt+0.081
ummy+0.080
 grand+0.080
Neg logit contributions
 managed-0.206
'd-0.194
ched-0.193
pped-0.192
cher-0.181
Tokenados
Token ID22484
Feature activation+3.16e-02

Pos logit contributions
 festivals+0.316
 Grand+0.314
 grand+0.312
 Aunt+0.311
igsaw+0.300
Neg logit contributions
 managed-0.617
pped-0.578
ched-0.578
'd-0.572
cher-0.538
Token.
Token ID13
Feature activation-4.82e-02

Pos logit contributions
 managed+0.064
'd+0.060
pped+0.059
ched+0.059
cher+0.054
Neg logit contributions
ummy-0.050
 Grand-0.050
 grand-0.050
 festivals-0.049
igsaw-0.049
Token But
Token ID887
Feature activation+1.40e-02

Pos logit contributions
igsaw+0.091
ummy+0.080
 batch+0.079
 sorts+0.079
 festivals+0.077
Neg logit contributions
 managed-0.086
 could-0.085
ched-0.081
tted-0.076
'd-0.074
Token the
Token ID262
Feature activation-6.81e-02

Pos logit contributions
 managed+0.033
'd+0.031
ched+0.031
pped+0.031
cher+0.029
Neg logit contributions
 Grand-0.018
 Aunt-0.018
 grand-0.017
 festivals-0.017
ummy-0.017
Token rock
Token ID3881
Feature activation+4.35e-02

Pos logit contributions
ummy+0.073
 grand+0.072
 Grand+0.070
 batch+0.070
 festivals+0.069
Neg logit contributions
 managed-0.177
ched-0.170
pped-0.169
'd-0.164
cher-0.157
Token had
Token ID550
Feature activation+1.54e-01

Pos logit contributions
cher+0.295
pped+0.287
 managed+0.285
ier+0.277
ched+0.270
Neg logit contributions
 grand-0.361
 Grand-0.349
 Aunt-0.324
 Wednesday-0.323
 Sunday-0.322
Token a
Token ID257
Feature activation-9.13e-02

Pos logit contributions
 managed+0.314
'd+0.292
ched+0.289
pped+0.288
cher+0.266
Neg logit contributions
 Grand-0.243
ummy-0.241
 festivals-0.241
 grand-0.240
 Aunt-0.240
Token toy
Token ID13373
Feature activation-5.59e-02

Pos logit contributions
 festivals+0.126
 Of+0.125
ummy+0.125
igsaw+0.124
 Gig+0.122
Neg logit contributions
'd-0.189
 managed-0.187
 remembered-0.173
 could-0.169
 found-0.168
Token microphone
Token ID21822
Feature activation+5.92e-03

Pos logit contributions
igsaw+0.091
 festivals+0.089
 batch+0.086
ummy+0.086
 Grand+0.085
Neg logit contributions
 managed-0.110
'd-0.105
ched-0.101
pped-0.101
 could-0.097
Token that
Token ID326
Feature activation-6.45e-02

Pos logit contributions
 managed+0.013
pped+0.013
'd+0.012
ched+0.012
cher+0.012
Neg logit contributions
 Grand-0.008
 grand-0.008
ummy-0.008
 Aunt-0.008
 batch-0.008
Token she
Token ID673
Feature activation+1.89e-01

Pos logit contributions
 Grand+0.119
 grand+0.115
 Aunt+0.115
 Of+0.109
 festivals+0.109
Neg logit contributions
 managed-0.119
pped-0.117
ched-0.110
'd-0.110
cher-0.103
Token used
Token ID973
Feature activation-3.08e-03

Pos logit contributions
pped+0.357
ier+0.339
 managed+0.337
iece+0.324
ched+0.318
Neg logit contributions
 grand-0.328
 Grand-0.308
 Aunt-0.267
 jazz-0.265
 common-0.257
Token to
Token ID284
Feature activation-3.98e-02

Pos logit contributions
 Grand+0.004
 grand+0.004
ummy+0.004
 Aunt+0.004
 festivals+0.004
Neg logit contributions
 managed-0.007
pped-0.006
'd-0.006
ched-0.006
cher-0.006
Token sing
Token ID1702
Feature activation+2.06e-02

Pos logit contributions
 Grand+0.061
 grand+0.061
 festivals+0.061
 Aunt+0.060
ummy+0.060
Neg logit contributions
 managed-0.083
'd-0.077
pped-0.076
ched-0.076
cher-0.070
Token into
Token ID656
Feature activation-1.12e-01

Pos logit contributions
 managed+0.040
'd+0.037
pped+0.036
ched+0.036
cher+0.033
Neg logit contributions
 Grand-0.035
 grand-0.035
 Aunt-0.035
 festivals-0.034
ummy-0.034
Token.
Token ID13
Feature activation-5.57e-02

Pos logit contributions
ummy+0.176
igsaw+0.174
 Of+0.172
 batch+0.171
 festivals+0.168
Neg logit contributions
 managed-0.221
pped-0.205
'd-0.205
ched-0.203
 could-0.193
Token felt
Token ID2936
Feature activation+1.10e-01

Pos logit contributions
pped+0.278
ier+0.268
ched+0.258
cher+0.255
tted+0.243
Neg logit contributions
 grand-0.427
 Grand-0.417
 Aunt-0.390
 batch-0.379
 sorts-0.367
Token proud
Token ID6613
Feature activation-1.04e-01

Pos logit contributions
cher+0.233
'd+0.226
 managed+0.223
pped+0.219
oscope+0.204
Neg logit contributions
 Grand-0.150
 grand-0.145
 summer-0.143
 festivals-0.143
 sunset-0.139
Token of
Token ID286
Feature activation-9.43e-02

Pos logit contributions
ummy+0.159
 Grand+0.156
 grand+0.154
 Aunt+0.152
 Of+0.151
Neg logit contributions
 managed-0.223
pped-0.207
'd-0.207
ched-0.206
cher-0.195
Token what
Token ID644
Feature activation+1.72e-02

Pos logit contributions
 Grand+0.186
 festivals+0.183
 Aunt+0.183
ummy+0.182
 grand+0.181
Neg logit contributions
 managed-0.160
'd-0.149
ched-0.146
pped-0.141
cher-0.135
Token she
Token ID673
Feature activation+1.75e-01

Pos logit contributions
 managed+0.035
pped+0.034
ched+0.034
'd+0.032
cher+0.031
Neg logit contributions
 Grand-0.028
 Aunt-0.028
 grand-0.028
 festivals-0.027
 Of-0.027
Token had
Token ID550
Feature activation+1.41e-01

Pos logit contributions
ier+0.251
iers+0.249
ched+0.248
aces+0.246
tted+0.231
Neg logit contributions
 grand-0.344
 Grand-0.332
 sunrise-0.305
 sunset-0.304
 sorts-0.299
Token done
Token ID1760
Feature activation+4.85e-02

Pos logit contributions
paste+0.170
cher+0.170
bo+0.155
oscope+0.152
ched+0.152
Neg logit contributions
 grand-0.313
 Grand-0.299
 Sunday-0.292
 festivals-0.292
 Wednesday-0.291
Token.
Token ID13
Feature activation-6.29e-02

Pos logit contributions
 managed+0.105
'd+0.100
pped+0.097
ched+0.096
cher+0.093
Neg logit contributions
 Grand-0.071
 festivals-0.070
ummy-0.069
 grand-0.069
igsaw-0.068
Token When
Token ID1649
Feature activation-6.28e-03

Pos logit contributions
 festivals+0.095
 batch+0.095
 grand+0.094
igsaw+0.093
ummy+0.093
Neg logit contributions
 managed-0.132
'd-0.123
ched-0.121
pped-0.120
 could-0.113
Token she
Token ID673
Feature activation+1.97e-01

Pos logit contributions
igsaw+0.011
ummy+0.011
 festivals+0.011
 batch+0.011
 grand+0.011
Neg logit contributions
 managed-0.011
'd-0.010
pped-0.010
ched-0.010
 stayed-0.009
Token went
Token ID1816
Feature activation+7.02e-02

Pos logit contributions
cher+0.281
aces+0.267
ched+0.267
ier+0.265
iers+0.258
Neg logit contributions
 Grand-0.417
 grand-0.416
 Aunt-0.379
 batch-0.375
 M-0.370
Token buckle
Token ID44983
Feature activation+1.99e-02

Pos logit contributions
 Grand+0.228
 grand+0.227
ummy+0.225
 Aunt+0.224
 batch+0.223
Neg logit contributions
 managed-0.312
'd-0.288
ched-0.286
pped-0.284
cher-0.265
Token.
Token ID13
Feature activation-5.57e-02

Pos logit contributions
 managed+0.039
pped+0.034
ched+0.034
'd+0.034
 remembered+0.032
Neg logit contributions
 Grand-0.034
 grand-0.034
ummy-0.032
 Aunt-0.032
 batch-0.032
Token Her
Token ID2332
Feature activation+4.83e-02

Pos logit contributions
igsaw+0.094
 festivals+0.091
 grand+0.090
 batch+0.088
 July+0.088
Neg logit contributions
 could-0.098
 managed-0.095
'd-0.093
ched-0.092
 found-0.087
Token mom
Token ID1995
Feature activation+6.16e-02

Pos logit contributions
 managed+0.118
ched+0.113
pped+0.111
'd+0.109
cher+0.105
Neg logit contributions
 grand-0.066
 Grand-0.065
 Aunt-0.062
ummy-0.057
 Of-0.056
Tokenmy
Token ID1820
Feature activation+1.20e-01

Pos logit contributions
 managed+0.107
ched+0.106
pped+0.103
tted+0.102
cher+0.100
Neg logit contributions
 Grand-0.116
 grand-0.114
 Aunt-0.112
 batch-0.110
ummy-0.106
Token had
Token ID550
Feature activation+1.53e-01

Pos logit contributions
 managed+0.193
pped+0.187
ched+0.187
tted+0.178
cher+0.174
Neg logit contributions
 grand-0.240
 Grand-0.238
 Aunt-0.233
 batch-0.225
ummy-0.218
Token bought
Token ID5839
Feature activation+1.40e-02

Pos logit contributions
 managed+0.306
ched+0.297
'd+0.288
pped+0.285
cher+0.274
Neg logit contributions
 grand-0.263
 Grand-0.262
 Aunt-0.241
 batch-0.235
 festivals-0.231
Token it
Token ID340
Feature activation+1.43e-01

Pos logit contributions
 managed+0.030
'd+0.028
pped+0.027
ched+0.027
cher+0.026
Neg logit contributions
 Grand-0.022
 grand-0.021
ummy-0.021
 Aunt-0.021
 batch-0.021
Token for
Token ID329
Feature activation-2.07e-02

Pos logit contributions
 managed+0.280
'd+0.259
ched+0.256
pped+0.255
cher+0.234
Neg logit contributions
 Grand-0.239
ummy-0.239
 festivals-0.238
igsaw-0.237
 grand-0.237
Token her
Token ID607
Feature activation+4.38e-02

Pos logit contributions
 Grand+0.031
ummy+0.031
 grand+0.031
 festivals+0.030
 Aunt+0.030
Neg logit contributions
 managed-0.045
'd-0.042
ched-0.041
pped-0.041
cher-0.038
Token at
Token ID379
Feature activation-1.85e-01

Pos logit contributions
 managed+0.095
pped+0.091
ched+0.090
'd+0.085
cher+0.082
Neg logit contributions
 grand-0.069
 Grand-0.069
 Aunt-0.066
ummy-0.065
 Of-0.065
Token ran
Token ID4966
Feature activation+7.80e-02

Pos logit contributions
cher+0.241
ched+0.238
ier+0.237
 managed+0.225
pped+0.224
Neg logit contributions
 grand-0.408
 Grand-0.399
 Aunt-0.382
 Gig-0.371
 batch-0.368
Token to
Token ID284
Feature activation-3.50e-02

Pos logit contributions
 managed+0.156
'd+0.148
pped+0.145
ched+0.143
cher+0.135
Neg logit contributions
 Grand-0.127
 festivals-0.127
 Aunt-0.127
 grand-0.126
 Of-0.123
Token the
Token ID262
Feature activation-1.16e-01

Pos logit contributions
ummy+0.071
igsaw+0.070
 batch+0.064
 Of+0.060
utes+0.060
Neg logit contributions
 managed-0.059
 back-0.054
 found-0.053
pped-0.051
 stayed-0.050
Token balloon
Token ID21190
Feature activation-1.53e-02

Pos logit contributions
ummy+0.132
igsaw+0.132
 Grand+0.131
 festivals+0.130
 Aunt+0.129
Neg logit contributions
 managed-0.285
'd-0.269
pped-0.266
ched-0.266
 stayed-0.250
Token.
Token ID13
Feature activation-5.42e-02

Pos logit contributions
igsaw+0.029
ummy+0.029
 festivals+0.029
 Grand+0.028
 Aunt+0.028
Neg logit contributions
 managed-0.026
'd-0.024
ched-0.024
pped-0.023
 stayed-0.022
Token It
Token ID632
Feature activation+1.89e-01

Pos logit contributions
 perspective+0.100
igsaw+0.100
ummy+0.098
 batch+0.097
 festivals+0.092
Neg logit contributions
 could-0.092
 managed-0.089
ched-0.081
tted-0.079
 found-0.075
Token was
Token ID373
Feature activation+1.38e-01

Pos logit contributions
pped+0.278
ier+0.261
cher+0.261
 managed+0.259
ched+0.242
Neg logit contributions
 Grand-0.385
 batch-0.383
 grand-0.382
 Aunt-0.375
 chip-0.362
Token tied
Token ID8165
Feature activation+2.00e-02

Pos logit contributions
 managed+0.333
'd+0.313
ched+0.310
pped+0.309
 stayed+0.289
Neg logit contributions
ummy-0.167
 Grand-0.167
 festivals-0.166
igsaw-0.166
 grand-0.165
Token to
Token ID284
Feature activation-4.41e-02

Pos logit contributions
 managed+0.042
'd+0.039
pped+0.039
ched+0.038
cher+0.036
Neg logit contributions
 Grand-0.031
 grand-0.031
ummy-0.030
 Aunt-0.029
 Of-0.029
Token a
Token ID257
Feature activation-8.82e-02

Pos logit contributions
igsaw+0.075
ummy+0.075
 festivals+0.073
 batch+0.071
 Of+0.071
Neg logit contributions
 managed-0.087
'd-0.080
pped-0.078
ched-0.078
 stayed-0.074
Token pole
Token ID16825
Feature activation-1.46e-02

Pos logit contributions
ummy+0.116
igsaw+0.115
 festivals+0.115
 Aunt+0.113
 Of+0.113
Neg logit contributions
 managed-0.203
'd-0.194
ched-0.186
pped-0.184
 could-0.176
Token at
Token ID379
Feature activation-1.60e-01

Pos logit contributions
 managed+0.156
'd+0.145
pped+0.143
ched+0.142
cher+0.131
Neg logit contributions
 Grand-0.146
ummy-0.145
 grand-0.145
 Aunt-0.144
 festivals-0.144
Token Tom
Token ID4186
Feature activation+9.08e-02

Pos logit contributions
igsaw+0.272
ummy+0.264
 festivals+0.261
 batch+0.260
 Of+0.257
Neg logit contributions
 managed-0.314
pped-0.290
'd-0.288
ched-0.283
 stayed-0.269
Token and
Token ID290
Feature activation-9.15e-02

Pos logit contributions
 managed+0.201
'd+0.185
pped+0.180
ched+0.178
cher+0.174
Neg logit contributions
 Grand-0.131
ummy-0.131
 grand-0.128
 batch-0.127
 festivals-0.124
Token Sue
Token ID26089
Feature activation+1.17e-01

Pos logit contributions
ummy+0.123
igsaw+0.122
 festivals+0.115
 batch+0.109
utes+0.105
Neg logit contributions
 managed-0.220
'd-0.202
 stayed-0.198
ched-0.196
 could-0.193
Token.
Token ID13
Feature activation-5.92e-02

Pos logit contributions
 managed+0.238
'd+0.221
pped+0.221
ched+0.217
cher+0.209
Neg logit contributions
 Grand-0.185
 Of-0.179
 festivals-0.179
 grand-0.179
 Aunt-0.178
Token He
Token ID679
Feature activation+1.77e-01

Pos logit contributions
igsaw+0.120
ummy+0.117
 batch+0.114
 festivals+0.113
 perspective+0.108
Neg logit contributions
 managed-0.099
 could-0.089
ched-0.087
'd-0.087
tted-0.083
Token nods
Token ID34833
Feature activation+2.80e-02

Pos logit contributions
 managed+0.348
pped+0.328
ched+0.320
'd+0.316
cher+0.303
Neg logit contributions
 Grand-0.298
 grand-0.296
 Aunt-0.291
 batch-0.286
 festivals-0.284
Token his
Token ID465
Feature activation+5.34e-02

Pos logit contributions
 managed+0.063
pped+0.059
ched+0.059
'd+0.058
cher+0.056
Neg logit contributions
 Grand-0.040
 grand-0.040
 Aunt-0.038
 Sunday-0.038
 Of-0.037
Token head
Token ID1182
Feature activation+4.62e-02

Pos logit contributions
 managed+0.086
pped+0.079
'd+0.077
ched+0.077
cher+0.069
Neg logit contributions
 grand-0.110
 Grand-0.110
 Aunt-0.109
ummy-0.108
 Of-0.106
Token.
Token ID13
Feature activation-5.50e-02

Pos logit contributions
 managed+0.104
'd+0.096
pped+0.095
ched+0.091
cher+0.091
Neg logit contributions
 Grand-0.064
 grand-0.063
 Aunt-0.060
 Of-0.060
ummy-0.058
Token He
Token ID679
Feature activation+1.79e-01

Pos logit contributions
igsaw+0.112
 batch+0.108
ummy+0.107
 festivals+0.104
 perspective+0.101
Neg logit contributions
 managed-0.091
 could-0.084
'd-0.082
ched-0.082
pped-0.077
Token that
Token ID326
Feature activation-6.16e-02

Pos logit contributions
igsaw+0.067
 festivals+0.067
ummy+0.066
 Grand+0.066
 Aunt+0.065
Neg logit contributions
 managed-0.085
'd-0.079
ched-0.078
pped-0.076
 stayed-0.073
Token was
Token ID373
Feature activation+1.22e-01

Pos logit contributions
 Grand+0.135
 grand+0.134
 Aunt+0.132
 festivals+0.131
 Of+0.131
Neg logit contributions
pped-0.085
 managed-0.084
ched-0.078
'd-0.077
cher-0.072
Token much
Token ID881
Feature activation-5.43e-02

Pos logit contributions
 managed+0.222
'd+0.217
pped+0.214
ched+0.213
ave+0.200
Neg logit contributions
 grand-0.207
 Grand-0.207
 batch-0.198
 Aunt-0.197
 festivals-0.191
Token smaller
Token ID4833
Feature activation-9.64e-02

Pos logit contributions
 grand+0.099
 batch+0.098
 Grand+0.097
igsaw+0.095
 Aunt+0.094
Neg logit contributions
 managed-0.095
pped-0.093
'd-0.092
ched-0.090
cher-0.083
Token.
Token ID13
Feature activation-6.26e-02

Pos logit contributions
 grand+0.137
 Grand+0.132
ummy+0.130
 batch+0.126
 Of+0.122
Neg logit contributions
 managed-0.216
pped-0.208
'd-0.205
ched-0.202
cher-0.189
Token But
Token ID887
Feature activation-2.63e-03

Pos logit contributions
igsaw+0.086
ummy+0.085
 festivals+0.084
 batch+0.080
 perspective+0.078
Neg logit contributions
 managed-0.142
 could-0.132
ched-0.129
'd-0.127
 wouldn-0.125
Token the
Token ID262
Feature activation-8.78e-02

Pos logit contributions
igsaw+0.004
ummy+0.004
 festivals+0.004
 batch+0.004
 Grand+0.004
Neg logit contributions
 managed-0.006
'd-0.005
ched-0.005
pped-0.005
 stayed-0.005
Token kitten
Token ID42143
Feature activation+3.84e-02

Pos logit contributions
 grand+0.103
ummy+0.100
 batch+0.099
 Grand+0.099
 festivals+0.098
Neg logit contributions
 managed-0.218
pped-0.206
ched-0.206
'd-0.205
cher-0.191
Token was
Token ID373
Feature activation+1.36e-01

Pos logit contributions
pped+0.056
cher+0.049
ier+0.047
tted+0.046
ched+0.046
Neg logit contributions
 grand-0.083
 Aunt-0.079
 batch-0.079
 Grand-0.078
 Uncle-0.075
Token still
Token ID991
Feature activation+4.29e-02

Pos logit contributions
'd+0.244
pped+0.244
ched+0.238
 managed+0.232
cher+0.230
Neg logit contributions
 grand-0.222
 Grand-0.219
 batch-0.214
 Aunt-0.213
 festivals-0.211
Token sad
Token ID6507
Feature activation-6.06e-02

Pos logit contributions
 managed+0.078
'd+0.076
pped+0.075
ched+0.074
cher+0.068
Neg logit contributions
 Grand-0.076
 grand-0.076
 Aunt-0.074
 batch-0.074
 festivals-0.074
Token it
Token ID340
Feature activation+1.85e-01

Pos logit contributions
 managed+0.086
'd+0.084
pped+0.084
ched+0.083
ave+0.080
Neg logit contributions
 Grand-0.060
 grand-0.054
 Aunt-0.053
 Of-0.050
 Wednesday-0.048
Token was
Token ID373
Feature activation+1.49e-01

Pos logit contributions
cher+0.322
pped+0.314
ier+0.285
iers+0.284
pping+0.279
Neg logit contributions
 Grand-0.299
 Aunt-0.295
 chip-0.294
 grand-0.289
 batch-0.282
Token too
Token ID1165
Feature activation+8.31e-02

Pos logit contributions
 managed+0.226
'd+0.220
ched+0.219
pped+0.209
tted+0.205
Neg logit contributions
 Grand-0.310
 grand-0.307
 Aunt-0.293
ummy-0.282
 batch-0.282
Token late
Token ID2739
Feature activation-4.97e-02

Pos logit contributions
 managed+0.160
ched+0.143
pped+0.137
'd+0.135
 could+0.134
Neg logit contributions
 festivals-0.159
 Of-0.158
 Aunt-0.149
igsaw-0.148
 batch-0.148
Token.
Token ID13
Feature activation-5.50e-02

Pos logit contributions
 Grand+0.078
 grand+0.076
ummy+0.076
 Aunt+0.073
 Of+0.072
Neg logit contributions
 managed-0.107
'd-0.098
pped-0.098
ched-0.097
cher-0.090
Token Sara
Token ID24799
Feature activation+1.19e-01

Pos logit contributions
 batch+0.092
 festivals+0.091
 perspective+0.090
igsaw+0.089
 bou+0.088
Neg logit contributions
 managed-0.102
 could-0.097
ched-0.094
pped-0.091
'd-0.087
Token started
Token ID2067
Feature activation+8.13e-02

Pos logit contributions
cher+0.178
ier+0.168
wn+0.161
bridge+0.160
'd+0.159
Neg logit contributions
 grand-0.218
 Grand-0.214
 chip-0.188
 T-0.185
 batch-0.184
Token to
Token ID284
Feature activation-2.95e-02

Pos logit contributions
 managed+0.150
ched+0.136
pped+0.134
'd+0.133
 wouldn+0.124
Neg logit contributions
igsaw-0.148
ummy-0.148
 festivals-0.146
 Aunt-0.143
 batch-0.143
Token cry
Token ID3960
Feature activation+4.00e-02

Pos logit contributions
ummy+0.060
igsaw+0.059
 Of+0.059
 batch+0.058
 festivals+0.057
Neg logit contributions
 managed-0.049
'd-0.044
ched-0.044
pped-0.043
 stayed-0.041
Token.
Token ID13
Feature activation-5.25e-02

Pos logit contributions
 managed+0.080
'd+0.075
pped+0.073
ched+0.073
cher+0.068
Neg logit contributions
 Grand-0.067
ummy-0.067
 grand-0.066
 Aunt-0.066
 festivals-0.063
Token She
Token ID1375
Feature activation+1.78e-01

Pos logit contributions
 batch+0.117
igsaw+0.114
 bou+0.110
 sorts+0.107
 festivals+0.106
Neg logit contributions
 could-0.071
 managed-0.069
ched-0.061
pped-0.060
 missed-0.059
Token.
Token ID13
Feature activation-5.77e-02

Pos logit contributions
 Grand+0.133
ummy+0.133
 grand+0.132
 Aunt+0.132
 festivals+0.131
Neg logit contributions
 managed-0.143
'd-0.133
ched-0.130
pped-0.130
cher-0.120
Token He
Token ID679
Feature activation+1.75e-01

Pos logit contributions
ummy+0.087
 festivals+0.087
 batch+0.086
igsaw+0.086
 Grand+0.085
Neg logit contributions
 managed-0.122
'd-0.113
ched-0.112
pped-0.111
 could-0.103
Token saw
Token ID2497
Feature activation+7.61e-03

Pos logit contributions
ier+0.277
pped+0.275
 managed+0.269
cher+0.268
ched+0.266
Neg logit contributions
 grand-0.356
 Grand-0.348
 Aunt-0.332
 Uncle-0.304
 batch-0.303
Token Lily
Token ID20037
Feature activation+1.11e-01

Pos logit contributions
 managed+0.017
ched+0.016
pped+0.016
'd+0.015
 could+0.015
Neg logit contributions
ummy-0.011
 festivals-0.011
igsaw-0.010
 batch-0.010
 Of-0.010
Token and
Token ID290
Feature activation-8.79e-02

Pos logit contributions
 managed+0.197
'd+0.188
ched+0.176
pped+0.172
cher+0.167
Neg logit contributions
 Grand-0.200
 grand-0.197
 batch-0.197
 Aunt-0.196
ummy-0.194
Token said
Token ID531
Feature activation+9.90e-02

Pos logit contributions
 Grand+0.103
 grand+0.101
 Aunt+0.099
 Uncle+0.087
 batch+0.087
Neg logit contributions
 managed-0.222
'd-0.216
ched-0.215
pped-0.211
tted-0.206
Token,
Token ID11
Feature activation-9.44e-02

Pos logit contributions
 managed+0.223
'd+0.209
pped+0.207
ched+0.206
cher+0.192
Neg logit contributions
 Grand-0.135
ummy-0.135
 grand-0.134
 festivals-0.134
 Aunt-0.133
Token "
Token ID366
Feature activation+3.65e-02

Pos logit contributions
 Grand+0.116
 grand+0.114
 Aunt+0.114
ummy+0.113
 festivals+0.112
Neg logit contributions
 managed-0.227
'd-0.213
ched-0.211
pped-0.210
cher-0.197
TokenSh
Token ID2484
Feature activation-3.29e-02

Pos logit contributions
 managed+0.073
'd+0.067
ched+0.066
pped+0.066
cher+0.061
Neg logit contributions
 Grand-0.060
ummy-0.060
 festivals-0.060
igsaw-0.059
 grand-0.059
Tokenoo
Token ID2238
Feature activation+6.02e-03

Pos logit contributions
 festivals+0.064
 grand+0.064
 batch+0.063
 Grand+0.063
 Aunt+0.062
Neg logit contributions
 managed-0.054
pped-0.051
ched-0.050
'd-0.050
cher-0.045
Token,
Token ID11
Feature activation-1.06e-01

Pos logit contributions
 managed+0.012
'd+0.011
ched+0.011
pped+0.011
cher+0.010
Neg logit contributions
 festivals-0.010
ummy-0.009
 Grand-0.009
igsaw-0.009
 grand-0.009
Token nose
Token ID9686
Feature activation-3.55e-02

Pos logit contributions
 managed+0.015
'd+0.014
pped+0.014
ched+0.014
 wouldn+0.013
Neg logit contributions
 grand-0.010
 Grand-0.010
 Aunt-0.010
ummy-0.010
 batch-0.010
Token.
Token ID13
Feature activation-5.60e-02

Pos logit contributions
 Grand+0.064
ummy+0.063
 grand+0.063
 festivals+0.063
 Aunt+0.063
Neg logit contributions
 managed-0.065
'd-0.059
ched-0.058
pped-0.058
cher-0.053
Token She
Token ID1375
Feature activation+1.76e-01

Pos logit contributions
igsaw+0.096
 batch+0.094
ummy+0.091
 festivals+0.091
 perspective+0.086
Neg logit contributions
 managed-0.105
ched-0.098
pped-0.098
'd-0.097
 could-0.097
Token saw
Token ID2497
Feature activation+1.12e-02

Pos logit contributions
pped+0.267
 managed+0.258
ier+0.252
ched+0.251
cher+0.243
Neg logit contributions
 grand-0.375
 Grand-0.368
 Aunt-0.364
 Gig-0.339
 Uncle-0.332
Token a
Token ID257
Feature activation-9.06e-02

Pos logit contributions
 managed+0.022
ched+0.021
pped+0.021
'd+0.020
 could+0.020
Neg logit contributions
igsaw-0.018
ummy-0.018
 festivals-0.017
 batch-0.017
 Of-0.017
Token bird
Token ID6512
Feature activation+2.34e-02

Pos logit contributions
igsaw+0.125
ummy+0.125
 festivals+0.124
 Of+0.123
 Grand+0.120
Neg logit contributions
 managed-0.203
'd-0.195
ched-0.181
pped-0.179
 stayed-0.175
Token who
Token ID508
Feature activation-5.61e-03

Pos logit contributions
 managed+0.050
'd+0.046
ched+0.045
pped+0.045
cher+0.042
Neg logit contributions
 batch-0.035
 Aunt-0.034
ummy-0.034
 festivals-0.034
 Grand-0.034
Token was
Token ID373
Feature activation+1.15e-01

Pos logit contributions
 grand+0.013
 Grand+0.013
 Aunt+0.013
 festivals+0.013
 batch+0.013
Neg logit contributions
 managed-0.007
pped-0.006
ched-0.006
'd-0.006
cher-0.006
Token fast
Token ID3049
Feature activation-9.84e-02

Pos logit contributions
 managed+0.233
'd+0.217
pped+0.214
ched+0.213
cher+0.199
Neg logit contributions
 Grand-0.187
 Aunt-0.186
 grand-0.185
 festivals-0.184
 Gig-0.179
Token and
Token ID290
Feature activation-1.03e-01

Pos logit contributions
 grand+0.157
 Grand+0.154
 festivals+0.150
 Aunt+0.149
ummy+0.146
Neg logit contributions
 managed-0.202
'd-0.190
pped-0.183
ched-0.183
cher-0.174
Token she
Token ID673
Feature activation+1.86e-01

Pos logit contributions
 grand+0.181
 Grand+0.177
 Aunt+0.173
 Gig+0.167
 Of+0.163
Neg logit contributions
pped-0.192
 managed-0.184
ched-0.180
ier-0.176
tted-0.170
Token and
Token ID290
Feature activation-8.06e-02

Pos logit contributions
ummy+0.104
 Grand+0.102
 grand+0.101
 Aunt+0.101
 Of+0.100
Neg logit contributions
 managed-0.141
'd-0.131
pped-0.130
ched-0.129
cher-0.121
Token walked
Token ID6807
Feature activation+6.23e-02

Pos logit contributions
 Grand+0.157
 grand+0.154
 Aunt+0.152
 Gig+0.145
 Of+0.145
Neg logit contributions
pped-0.134
 managed-0.132
ched-0.129
'd-0.127
tted-0.125
Token away
Token ID1497
Feature activation-3.73e-02

Pos logit contributions
 managed+0.089
'd+0.085
pped+0.083
ched+0.079
cher+0.072
Neg logit contributions
 Aunt-0.138
 Grand-0.138
 grand-0.138
 festivals-0.135
 Of-0.133
Token.
Token ID13
Feature activation-4.59e-02

Pos logit contributions
 festivals+0.073
igsaw+0.073
 batch+0.073
ummy+0.073
 Of+0.071
Neg logit contributions
 managed-0.063
ched-0.055
'd-0.055
pped-0.053
 stayed-0.051
Token She
Token ID1375
Feature activation+1.85e-01

Pos logit contributions
igsaw+0.067
 batch+0.067
 festivals+0.067
 fireworks+0.062
 grand+0.062
Neg logit contributions
 managed-0.097
 could-0.090
'd-0.090
ched-0.086
 wouldn-0.085
Token went
Token ID1816
Feature activation+8.81e-02

Pos logit contributions
ier+0.321
pped+0.315
cher+0.305
ched+0.303
tted+0.293
Neg logit contributions
 Grand-0.374
 grand-0.374
 Aunt-0.360
 Uncle-0.329
 Gig-0.316
Token inside
Token ID2641
Feature activation-4.31e-02

Pos logit contributions
 managed+0.168
'd+0.163
pped+0.161
ched+0.158
 wouldn+0.143
Neg logit contributions
 Aunt-0.156
 Grand-0.153
 grand-0.150
 festivals-0.150
 batch-0.146
Token and
Token ID290
Feature activation-7.35e-02

Pos logit contributions
igsaw+0.080
 festivals+0.078
ummy+0.078
 Gig+0.077
 batch+0.077
Neg logit contributions
 managed-0.076
'd-0.068
pped-0.066
ched-0.066
 could-0.064
Token found
Token ID1043
Feature activation+9.10e-02

Pos logit contributions
 Grand+0.160
 grand+0.156
 Aunt+0.155
 festivals+0.146
 Of+0.145
Neg logit contributions
 managed-0.109
pped-0.108
ched-0.106
'd-0.104
ier-0.099
Token her
Token ID607
Feature activation+6.91e-02

Pos logit contributions
 managed+0.180
ched+0.166
'd+0.166
pped+0.165
 stayed+0.152
Neg logit contributions
 festivals-0.149
ummy-0.148
igsaw-0.148
 Grand-0.146
 batch-0.146
Token Mom
Token ID11254
Feature activation+3.53e-02

Pos logit contributions
 managed+0.153
'd+0.143
 stayed+0.136
 wouldn+0.136
ched+0.136
Neg logit contributions
 festivals-0.092
igsaw-0.091
 Of-0.089
ummy-0.089
 batch-0.087
Token bottle
Token ID9294
Feature activation+9.12e-03

Pos logit contributions
 Of+0.255
 festivals+0.228
igsaw+0.228
 disl+0.219
 personalities+0.216
Neg logit contributions
'd-0.260
 managed-0.256
 found-0.250
 had-0.238
 came-0.237
Token.
Token ID13
Feature activation-5.98e-02

Pos logit contributions
 managed+0.014
'd+0.013
ched+0.013
pped+0.013
 could+0.012
Neg logit contributions
igsaw-0.019
 festivals-0.019
 Grand-0.018
 Of-0.018
 grand-0.018
Token It
Token ID632
Feature activation+1.80e-01

Pos logit contributions
 batch+0.128
utes+0.122
 bou+0.122
aman+0.118
 perspective+0.117
Neg logit contributions
 could-0.084
 managed-0.082
'd-0.073
 found-0.072
tted-0.071
Token was
Token ID373
Feature activation+1.30e-01

Pos logit contributions
 managed+0.262
pped+0.250
cher+0.233
ier+0.231
ched+0.230
Neg logit contributions
 Grand-0.399
 grand-0.392
 Aunt-0.387
 batch-0.377
 Of-0.377
Token red
Token ID2266
Feature activation-1.26e-01

Pos logit contributions
 managed+0.239
'd+0.223
pped+0.214
ched+0.214
 stayed+0.206
Neg logit contributions
igsaw-0.236
 festivals-0.229
ummy-0.225
 Of-0.225
 batch-0.221
Token and
Token ID290
Feature activation-1.12e-01

Pos logit contributions
igsaw+0.267
 festivals+0.263
 Of+0.255
 batch+0.251
 July+0.249
Neg logit contributions
 managed-0.178
'd-0.172
pped-0.165
ched-0.161
 could-0.161
Token had
Token ID550
Feature activation+1.46e-01

Pos logit contributions
igsaw+0.176
 festivals+0.175
ummy+0.174
 batch+0.171
 Aunt+0.171
Neg logit contributions
 managed-0.232
'd-0.217
ched-0.211
pped-0.210
 stayed-0.199
Token spark
Token ID9009
Feature activation-5.71e-02

Pos logit contributions
 managed+0.350
'd+0.328
ched+0.326
pped+0.325
cher+0.309
Neg logit contributions
 Grand-0.183
 grand-0.180
ummy-0.178
 Aunt-0.178
 festivals-0.176
Tokenles
Token ID829
Feature activation+1.90e-02

Pos logit contributions
ummy+0.111
 grand+0.108
 Grand+0.107
 batch+0.107
igsaw+0.107
Neg logit contributions
 managed-0.099
'd-0.090
pped-0.084
cher-0.082
ched-0.082
Token all
Token ID477
Feature activation-5.44e-02

Pos logit contributions
 managed+0.036
'd+0.033
ched+0.031
pped+0.031
cher+0.029
Neg logit contributions
ummy-0.033
 Grand-0.032
 grand-0.032
 festivals-0.031
 Aunt-0.030
Token over
Token ID625
Feature activation-8.04e-02

Pos logit contributions
igsaw+0.110
 batch+0.109
 Aunt+0.109
 grand+0.109
 festivals+0.109
Neg logit contributions
 managed-0.086
'd-0.079
ched-0.078
pped-0.076
 could-0.071
Token.
Token ID13
Feature activation-5.40e-02

Pos logit contributions
ummy+0.123
 Grand+0.122
 grand+0.119
 Of+0.119
 Aunt+0.118
Neg logit contributions
 managed-0.149
'd-0.140
pped-0.135
ched-0.134
cher-0.126
Token He
Token ID679
Feature activation+1.80e-01

Pos logit contributions
 festivals+0.108
 batch+0.107
igsaw+0.106
ummy+0.104
 grand+0.104
Neg logit contributions
 managed-0.088
'd-0.079
ched-0.078
 could-0.076
pped-0.075
Token thinks
Token ID6834
Feature activation+6.87e-02

Pos logit contributions
 managed+0.384
pped+0.371
cher+0.370
ier+0.357
'd+0.351
Neg logit contributions
 grand-0.267
 Grand-0.258
 Aunt-0.241
 batch-0.235
 Of-0.232
Token Tim
Token ID5045
Feature activation+1.33e-01

Pos logit contributions
 managed+0.155
'd+0.143
ched+0.143
pped+0.143
 could+0.136
Neg logit contributions
 festivals-0.095
 batch-0.093
igsaw-0.092
 Aunt-0.092
ummy-0.091
Token is
Token ID318
Feature activation+1.39e-01

Pos logit contributions
cher+0.290
tted+0.276
 managed+0.276
pped+0.276
bridge+0.275
Neg logit contributions
 grand-0.155
 Grand-0.148
 batch-0.147
ummy-0.144
 sorts-0.138
Token mean
Token ID1612
Feature activation-6.84e-02

Pos logit contributions
 managed+0.281
'd+0.278
ched+0.267
pped+0.263
tted+0.246
Neg logit contributions
 Grand-0.218
 grand-0.214
 festivals-0.209
 history-0.203
 Of-0.200
Token.
Token ID13
Feature activation-5.58e-02

Pos logit contributions
ummy+0.088
 Grand+0.085
 grand+0.085
 Aunt+0.079
 Of+0.078
Neg logit contributions
 managed-0.168
pped-0.158
'd-0.158
cher-0.157
tted-0.154
Token He
Token ID679
Feature activation+1.78e-01

Pos logit contributions
 batch+0.113
 festivals+0.112
igsaw+0.111
ummy+0.108
 grand+0.107
Neg logit contributions
 managed-0.091
'd-0.081
ched-0.081
 could-0.080
pped-0.078
Token goes
Token ID2925
Feature activation+6.67e-02

Pos logit contributions
 managed+0.391
pped+0.375
cher+0.371
'd+0.358
ier+0.357
Neg logit contributions
 grand-0.253
 Grand-0.245
 Aunt-0.229
 batch-0.228
 Of-0.222
Token to
Token ID284
Feature activation-4.15e-02

Pos logit contributions
'd+0.141
 managed+0.140
ched+0.135
pped+0.134
 wouldn+0.124
Neg logit contributions
 festivals-0.096
 Grand-0.096
 Aunt-0.095
 grand-0.093
 batch-0.091
Token his
Token ID465
Feature activation+2.94e-02

Pos logit contributions
ummy+0.065
igsaw+0.062
 batch+0.061
 festivals+0.059
 Of+0.057
Neg logit contributions
 managed-0.086
pped-0.077
 found-0.077
ched-0.074
'd-0.074
Token the
Token ID262
Feature activation-8.77e-02

Pos logit contributions
ummy+0.105
igsaw+0.100
 batch+0.098
 festivals+0.097
 Of+0.093
Neg logit contributions
 managed-0.138
'd-0.125
 stayed-0.123
ched-0.122
 could-0.119
Token rabbit
Token ID22746
Feature activation+4.11e-02

Pos logit contributions
 Grand+0.148
ummy+0.148
 festivals+0.147
igsaw+0.147
 grand+0.147
Neg logit contributions
 managed-0.169
'd-0.157
ched-0.155
pped-0.154
cher-0.142
Token forgot
Token ID16453
Feature activation+1.23e-01

Pos logit contributions
pped+0.059
cher+0.055
ier+0.055
 managed+0.053
ched+0.052
Neg logit contributions
 Aunt-0.082
 grand-0.081
 Gig-0.079
 Grand-0.079
 batch-0.077
Token about
Token ID546
Feature activation-1.36e-02

Pos logit contributions
 managed+0.256
'd+0.239
pped+0.237
ched+0.237
cher+0.220
Neg logit contributions
 Grand-0.189
 grand-0.187
 festivals-0.187
 Aunt-0.187
ummy-0.187
Token the
Token ID262
Feature activation-1.26e-01

Pos logit contributions
ummy+0.022
igsaw+0.022
 batch+0.020
 Of+0.020
utes+0.020
Neg logit contributions
 managed-0.029
'd-0.026
 stayed-0.025
 wouldn-0.025
pped-0.025
Token carrot
Token ID40562
Feature activation-3.05e-02

Pos logit contributions
 Grand+0.177
ummy+0.177
 festivals+0.177
igsaw+0.176
 Aunt+0.175
Neg logit contributions
 managed-0.277
'd-0.260
ched-0.256
pped-0.256
 stayed-0.238
Token and
Token ID290
Feature activation-8.44e-02

Pos logit contributions
ummy+0.049
 grand+0.048
 batch+0.048
 Grand+0.048
 Aunt+0.047
Neg logit contributions
 managed-0.061
pped-0.058
ched-0.057
'd-0.057
cher-0.052
Token ran
Token ID4966
Feature activation+6.13e-02

Pos logit contributions
 grand+0.143
 Grand+0.142
 Aunt+0.138
 festivals+0.135
 Uncle+0.132
Neg logit contributions
pped-0.161
ched-0.157
 managed-0.151
'd-0.148
tted-0.147
Token away
Token ID1497
Feature activation-4.05e-02

Pos logit contributions
'd+0.087
 managed+0.085
pped+0.080
cher+0.075
ched+0.075
Neg logit contributions
 Aunt-0.132
 Grand-0.130
 festivals-0.130
 grand-0.130
 Gig-0.129
Token as
Token ID355
Feature activation-4.62e-02

Pos logit contributions
ummy+0.072
 Grand+0.072
 festivals+0.072
igsaw+0.072
 batch+0.071
Neg logit contributions
 managed-0.074
'd-0.068
ched-0.067
pped-0.067
 stayed-0.061
Token fast
Token ID3049
Feature activation-8.74e-02

Pos logit contributions
ummy+0.073
igsaw+0.073
 batch+0.073
 festivals+0.072
 Grand+0.072
Neg logit contributions
 managed-0.095
'd-0.088
ched-0.087
pped-0.086
 could-0.080
Token them
Token ID606
Feature activation+1.76e-01

Pos logit contributions
 managed+0.111
ched+0.102
'd+0.101
pped+0.101
 stayed+0.094
Neg logit contributions
igsaw-0.101
ummy-0.100
 batch-0.099
 festivals-0.099
 Of-0.097
Token.
Token ID13
Feature activation-5.67e-02

Pos logit contributions
 managed+0.371
pped+0.349
'd+0.346
ched+0.342
cher+0.315
Neg logit contributions
 grand-0.274
 Grand-0.271
 Aunt-0.264
 festivals-0.260
ummy-0.258
Token They
Token ID1119
Feature activation+1.57e-01

Pos logit contributions
 batch+0.096
igsaw+0.093
ummy+0.091
 bou+0.086
 perspective+0.086
Neg logit contributions
 managed-0.109
 could-0.107
'd-0.099
tted-0.098
ched-0.097
Token have
Token ID423
Feature activation+1.41e-01

Pos logit contributions
pped+0.311
 managed+0.310
ched+0.292
ier+0.288
cher+0.288
Neg logit contributions
 grand-0.256
 Grand-0.249
 Aunt-0.235
 festivals-0.232
 Gig-0.222
Token each
Token ID1123
Feature activation+1.39e-02

Pos logit contributions
 managed+0.318
pped+0.308
ched+0.304
'd+0.300
cher+0.290
Neg logit contributions
 grand-0.199
 Grand-0.196
 Aunt-0.186
 festivals-0.185
 batch-0.181
Token other
Token ID584
Feature activation-2.88e-02

Pos logit contributions
 managed+0.029
pped+0.027
'd+0.027
ched+0.026
cher+0.025
Neg logit contributions
 grand-0.022
 Grand-0.022
 Aunt-0.021
 Of-0.021
 festivals-0.021
Token.
Token ID13
Feature activation-5.46e-02

Pos logit contributions
 grand+0.045
 Grand+0.042
 Aunt+0.040
 festivals+0.039
 Sunday+0.038
Neg logit contributions
 managed-0.061
pped-0.060
'd-0.057
ched-0.056
cher-0.055
Token They
Token ID1119
Feature activation+1.58e-01

Pos logit contributions
 batch+0.099
igsaw+0.095
ummy+0.091
 bou+0.091
 perspective+0.090
Neg logit contributions
 could-0.099
 managed-0.096
tted-0.090
ched-0.088
'd-0.088
Token are
Token ID389
Feature activation+1.09e-01

Pos logit contributions
 managed+0.305
pped+0.300
ched+0.288
ier+0.276
cher+0.274
Neg logit contributions
 grand-0.269
 Grand-0.262
 Aunt-0.249
 festivals-0.249
 Gig-0.235
Token quiet
Token ID5897
Feature activation-1.06e-01

Pos logit contributions
 managed+0.225
pped+0.219
'd+0.217
ched+0.215
cher+0.197
Neg logit contributions
 grand-0.180
 Grand-0.180
 Aunt-0.170
 festivals-0.160
 Uncle-0.158
Token and
Token ID290
Feature activation-9.23e-02

Pos logit contributions
 Grand+0.148
 grand+0.146
ummy+0.138
 Aunt+0.137
 festivals+0.136
Neg logit contributions
 managed-0.244
ched-0.233
pped-0.228
'd-0.228
cher-0.216
Token and
Token ID290
Feature activation-8.29e-02

Pos logit contributions
igsaw+0.219
 Of+0.217
 festivals+0.217
 Grand+0.213
 grand+0.212
Neg logit contributions
 managed-0.175
'd-0.163
pped-0.161
ched-0.159
 could-0.150
Token didn
Token ID1422
Feature activation+1.60e-01

Pos logit contributions
 grand+0.142
 Grand+0.142
 Aunt+0.137
ummy+0.136
 batch+0.134
Neg logit contributions
 managed-0.161
pped-0.152
ched-0.151
'd-0.146
cher-0.139
Token't
Token ID470
Feature activation+1.10e-01

Pos logit contributions
 managed+0.285
'd+0.260
cher+0.243
pped+0.241
ched+0.237
Neg logit contributions
 Grand-0.300
ummy-0.299
 grand-0.295
 Aunt-0.294
igsaw-0.293
Token shine
Token ID18340
Feature activation+2.32e-02

Pos logit contributions
 managed+0.208
pped+0.195
'd+0.192
ched+0.190
cher+0.181
Neg logit contributions
 grand-0.186
 Grand-0.184
 festivals-0.180
 Aunt-0.178
 Gig-0.177
Token very
Token ID845
Feature activation-4.28e-02

Pos logit contributions
 managed+0.047
pped+0.043
'd+0.043
ched+0.042
cher+0.040
Neg logit contributions
 grand-0.039
 Grand-0.039
ummy-0.039
 Aunt-0.038
 festivals-0.037
Token much
Token ID881
Feature activation-4.85e-02

Pos logit contributions
 festivals+0.073
 Grand+0.073
igsaw+0.073
ummy+0.073
 batch+0.073
Neg logit contributions
 managed-0.082
'd-0.075
ched-0.074
pped-0.074
 wouldn-0.068
Token.
Token ID13
Feature activation-3.89e-02

Pos logit contributions
ummy+0.083
 Grand+0.082
 grand+0.082
 Aunt+0.080
 festivals+0.080
Neg logit contributions
 managed-0.095
'd-0.088
pped-0.087
ched-0.086
cher-0.081
Token Then
Token ID3244
Feature activation+4.68e-02

Pos logit contributions
igsaw+0.092
 batch+0.091
 grand+0.085
ummy+0.084
 festivals+0.084
Neg logit contributions
 could-0.046
 managed-0.042
'd-0.040
 stayed-0.036
ched-0.036
Token one
Token ID530
Feature activation-6.63e-03

Pos logit contributions
 managed+0.103
'd+0.096
pped+0.094
ched+0.094
 stayed+0.089
Neg logit contributions
igsaw-0.067
ummy-0.067
 batch-0.066
 festivals-0.066
 Grand-0.066
Token day
Token ID1110
Feature activation-1.68e-02

Pos logit contributions
igsaw+0.007
ummy+0.007
 festivals+0.007
 Grand+0.007
 batch+0.007
Neg logit contributions
 managed-0.017
'd-0.016
ched-0.016
pped-0.015
 stayed-0.015
Token,
Token ID11
Feature activation-1.00e-01

Pos logit contributions
ummy+0.023
 Grand+0.023
 festivals+0.023
igsaw+0.023
 grand+0.023
Neg logit contributions
 managed-0.037
'd-0.035
ched-0.035
pped-0.035
 stayed-0.032
Token.
Token ID13
Feature activation-5.59e-02

Pos logit contributions
 Grand+0.029
 Aunt+0.028
ummy+0.028
 grand+0.028
 festivals+0.028
Neg logit contributions
 managed-0.036
pped-0.034
ched-0.034
'd-0.034
cher-0.031
Token Whenever
Token ID21326
Feature activation+1.76e-02

Pos logit contributions
 batch+0.092
 festivals+0.089
ummy+0.088
 grand+0.088
igsaw+0.086
Neg logit contributions
 managed-0.111
'd-0.103
ched-0.102
 could-0.098
pped-0.098
Token she
Token ID673
Feature activation+2.00e-01

Pos logit contributions
 managed+0.035
ched+0.033
'd+0.032
pped+0.032
cher+0.030
Neg logit contributions
 Grand-0.030
 Aunt-0.029
 festivals-0.029
 grand-0.029
ummy-0.029
Token was
Token ID373
Feature activation+1.36e-01

Pos logit contributions
cher+0.287
pped+0.273
ier+0.272
ched+0.270
tted+0.270
Neg logit contributions
 grand-0.414
 Grand-0.399
 Aunt-0.375
 festivals-0.371
 sunset-0.359
Token in
Token ID287
Feature activation-1.18e-01

Pos logit contributions
'd+0.270
 managed+0.263
cher+0.258
ched+0.253
pped+0.240
Neg logit contributions
 festivals-0.213
 Grand-0.209
 Aunt-0.207
 grand-0.203
 Sunday-0.201
Token the
Token ID262
Feature activation-1.49e-01

Pos logit contributions
ummy+0.161
 Grand+0.159
 festivals+0.159
igsaw+0.158
 grand+0.157
Neg logit contributions
 managed-0.266
'd-0.248
ched-0.245
pped-0.245
 stayed-0.228
Token park
Token ID3952
Feature activation-1.02e-02

Pos logit contributions
 Grand+0.173
 grand+0.171
 festivals+0.171
 Aunt+0.170
ummy+0.170
Neg logit contributions
 managed-0.370
'd-0.347
ched-0.347
pped-0.345
cher-0.327
Token she
Token ID673
Feature activation+1.71e-01

Pos logit contributions
 Grand+0.012
 Aunt+0.012
 grand+0.012
 festivals+0.012
ummy+0.012
Neg logit contributions
 managed-0.025
pped-0.024
ched-0.024
'd-0.023
cher-0.023
Token always
Token ID1464
Feature activation+6.52e-02

Pos logit contributions
pped+0.235
cher+0.225
ched+0.217
tted+0.212
ier+0.211
Neg logit contributions
 grand-0.358
 Grand-0.351
 festivals-0.330
 Aunt-0.323
 sorts-0.317
Token spun
Token ID26843
Feature activation-5.84e-03

Pos logit contributions
pped+0.088
ched+0.084
tted+0.078
cher+0.076
 managed+0.074
Neg logit contributions
 grand-0.139
 Grand-0.138
 festivals-0.135
 sorts-0.129
 Aunt-0.129
Token,
Token ID11
Feature activation-1.21e-01

Pos logit contributions
 festivals+0.010
 Grand+0.010
 Aunt+0.010
igsaw+0.010
 grand+0.010
Neg logit contributions
 managed-0.011
'd-0.010
pped-0.010
ched-0.009
cher-0.009
Token.
Token ID13
Feature activation-6.66e-02

Pos logit contributions
igsaw+0.085
 festivals+0.082
 grand+0.081
 Aunt+0.081
 batch+0.081
Neg logit contributions
 managed-0.060
'd-0.056
pped-0.055
ched-0.055
 stayed-0.051
Token It
Token ID632
Feature activation+1.71e-01

Pos logit contributions
igsaw+0.097
ummy+0.090
 batch+0.090
 festivals+0.090
 orig+0.087
Neg logit contributions
 managed-0.145
'd-0.137
 could-0.134
ched-0.131
pped-0.131
Token was
Token ID373
Feature activation+1.22e-01

Pos logit contributions
 managed+0.226
pped+0.210
ched+0.199
cher+0.193
'd+0.188
Neg logit contributions
 Grand-0.397
 grand-0.392
 Aunt-0.389
ummy-0.382
 Of-0.379
Token a
Token ID257
Feature activation-1.01e-01

Pos logit contributions
 managed+0.244
'd+0.228
pped+0.224
ched+0.221
 stayed+0.211
Neg logit contributions
igsaw-0.203
 batch-0.198
 festivals-0.197
ummy-0.195
 Of-0.194
Token big
Token ID1263
Feature activation-1.98e-01

Pos logit contributions
igsaw+0.154
 festivals+0.154
 Of+0.149
 batch+0.149
ummy+0.142
Neg logit contributions
'd-0.206
 managed-0.203
 could-0.198
 had-0.190
 remembered-0.185
Token,
Token ID11
Feature activation-1.74e-01

Pos logit contributions
 festivals+0.337
 Of+0.325
igsaw+0.318
 batch+0.314
 Gig+0.310
Neg logit contributions
 managed-0.355
'd-0.351
 could-0.331
 found-0.326
 had-0.326
Token round
Token ID2835
Feature activation-1.15e-01

Pos logit contributions
ummy+0.225
igsaw+0.220
 batch+0.219
 festivals+0.217
 Of+0.206
Neg logit contributions
 managed-0.406
'd-0.393
ched-0.372
 could-0.370
pped-0.365
Token,
Token ID11
Feature activation-1.69e-01

Pos logit contributions
 festivals+0.191
 Aunt+0.180
 Grand+0.178
 Of+0.178
igsaw+0.177
Neg logit contributions
 managed-0.216
'd-0.212
pped-0.204
ched-0.200
 could-0.198
Token yellow
Token ID7872
Feature activation-1.50e-01

Pos logit contributions
igsaw+0.212
ummy+0.204
 festivals+0.202
 batch+0.200
 Aunt+0.188
Neg logit contributions
 managed-0.409
'd-0.393
ched-0.377
 could-0.375
pped-0.373
Token ball
Token ID2613
Feature activation+2.19e-02

Pos logit contributions
 festivals+0.245
igsaw+0.236
 batch+0.229
 Aunt+0.228
 Grand+0.226
Neg logit contributions
 managed-0.281
'd-0.281
pped-0.269
 could-0.264
ched-0.263
Token.
Token ID13
Feature activation-5.77e-02

Pos logit contributions
 managed+0.039
'd+0.036
ched+0.035
pped+0.035
 stayed+0.032
Neg logit contributions
 festivals-0.040
igsaw-0.040
 Grand-0.040
ummy-0.040
 grand-0.040
Token He
Token ID679
Feature activation+1.95e-01

Pos logit contributions
igsaw+0.080
 batch+0.076
ummy+0.072
 festivals+0.072
 orig+0.072
Neg logit contributions
 managed-0.065
 could-0.064
'd-0.061
ched-0.058
 found-0.056
Token noticed
Token ID6810
Feature activation+1.04e-02

Pos logit contributions
pped+0.293
ched+0.290
ier+0.286
cher+0.282
 managed+0.281
Neg logit contributions
 grand-0.414
 Grand-0.411
 Aunt-0.390
 batch-0.376
 festivals-0.373
Token another
Token ID1194
Feature activation-1.11e-01

Pos logit contributions
 managed+0.019
pped+0.018
ched+0.017
'd+0.017
 could+0.016
Neg logit contributions
igsaw-0.020
ummy-0.019
 batch-0.019
 Of-0.018
 festivals-0.018
Token infant
Token ID11212
Feature activation+1.97e-02

Pos logit contributions
igsaw+0.147
 festivals+0.145
 Of+0.144
 Grand+0.144
 grand+0.138
Neg logit contributions
 managed-0.246
'd-0.235
ched-0.224
pped-0.222
 could-0.218
Token in
Token ID287
Feature activation-1.26e-01

Pos logit contributions
 managed+0.040
ched+0.039
pped+0.037
tted+0.037
'd+0.036
Neg logit contributions
ummy-0.030
 Grand-0.030
 Aunt-0.030
 batch-0.029
 grand-0.029
Token the
Token ID262
Feature activation-1.53e-01

Pos logit contributions
ummy+0.210
 Of+0.205
 batch+0.201
utes+0.199
 festivals+0.195
Neg logit contributions
 managed-0.243
pped-0.226
ched-0.215
'd-0.213
 found-0.203
Token garden
Token ID11376
Feature activation-1.15e-02

Pos logit contributions
 Grand+0.190
ummy+0.190
 festivals+0.189
igsaw+0.188
 grand+0.188
Neg logit contributions
 managed-0.365
'd-0.343
ched-0.339
pped-0.338
cher-0.316
Token who
Token ID508
Feature activation-1.65e-03

Pos logit contributions
ummy+0.017
 Grand+0.017
 grand+0.017
 batch+0.016
 festivals+0.016
Neg logit contributions
 managed-0.025
ched-0.024
pped-0.023
'd-0.023
cher-0.022
Token was
Token ID373
Feature activation+1.26e-01

Pos logit contributions
 Grand+0.004
 Aunt+0.004
 grand+0.004
 batch+0.004
 festivals+0.004
Neg logit contributions
 managed-0.002
pped-0.002
ched-0.002
'd-0.002
cher-0.002
Token also
Token ID635
Feature activation+7.45e-02

Pos logit contributions
 managed+0.227
ched+0.214
'd+0.211
pped+0.210
cher+0.200
Neg logit contributions
 Grand-0.232
 grand-0.228
 Aunt-0.227
 batch-0.226
 festivals-0.224
Token playing
Token ID2712
Feature activation+1.96e-03

Pos logit contributions
 managed+0.148
'd+0.142
ched+0.140
pped+0.138
cher+0.129
Neg logit contributions
 Grand-0.122
 grand-0.120
 batch-0.119
 Aunt-0.117
 festivals-0.116
Token loved
Token ID6151
Feature activation-2.89e-02

Pos logit contributions
 managed+0.223
pped+0.213
cher+0.202
ier+0.202
ched+0.201
Neg logit contributions
 grand-0.272
 Grand-0.269
 Aunt-0.259
 festivals-0.252
ummy-0.250
Token to
Token ID284
Feature activation-3.73e-02

Pos logit contributions
igsaw+0.070
ummy+0.064
 Of+0.061
 batch+0.061
 perspective+0.060
Neg logit contributions
 managed-0.035
 stayed-0.034
 could-0.033
'd-0.032
 found-0.030
Token play
Token ID711
Feature activation-1.80e-04

Pos logit contributions
ummy+0.070
igsaw+0.069
 batch+0.067
 Of+0.066
 perspective+0.064
Neg logit contributions
 managed-0.065
'd-0.063
 stayed-0.059
ched-0.059
pped-0.059
Token outside
Token ID2354
Feature activation-5.43e-02

Pos logit contributions
igsaw+0.000
ummy+0.000
 Grand+0.000
 Aunt+0.000
 grand+0.000
Neg logit contributions
 managed-0.000
pped-0.000
ched-0.000
'd-0.000
 stayed-0.000
Token in
Token ID287
Feature activation-1.26e-01

Pos logit contributions
igsaw+0.125
 batch+0.120
 Aunt+0.119
ummy+0.119
 festivals+0.119
Neg logit contributions
 managed-0.073
'd-0.066
ched-0.065
pped-0.064
 could-0.062
Token the
Token ID262
Feature activation-1.50e-01

Pos logit contributions
 batch+0.212
ummy+0.211
 Of+0.209
igsaw+0.206
utes+0.193
Neg logit contributions
 managed-0.239
pped-0.227
 found-0.213
'd-0.210
 could-0.210
Token sun
Token ID4252
Feature activation-6.73e-02

Pos logit contributions
 batch+0.198
igsaw+0.197
 Of+0.197
 Grand+0.191
ummy+0.190
Neg logit contributions
 managed-0.338
pped-0.317
ched-0.317
'd-0.315
 stayed-0.303
Token.
Token ID13
Feature activation-4.36e-02

Pos logit contributions
ummy+0.119
 Grand+0.119
 festivals+0.118
 grand+0.118
 Aunt+0.118
Neg logit contributions
 managed-0.125
'd-0.115
ched-0.113
pped-0.113
cher-0.103
Token One
Token ID1881
Feature activation-5.26e-02

Pos logit contributions
igsaw+0.079
 orig+0.075
 batch+0.072
 perspective+0.071
ummy+0.071
Neg logit contributions
'd-0.076
 managed-0.074
 could-0.074
ched-0.071
 stayed-0.067
Token day
Token ID1110
Feature activation-1.78e-02

Pos logit contributions
igsaw+0.072
 festivals+0.069
 batch+0.069
 Grand+0.067
ummy+0.067
Neg logit contributions
 managed-0.120
'd-0.115
ched-0.111
 stayed-0.110
pped-0.108
Token,
Token ID11
Feature activation-1.13e-01

Pos logit contributions
igsaw+0.026
 festivals+0.026
 batch+0.026
 Grand+0.026
ummy+0.026
Neg logit contributions
 managed-0.038
ched-0.036
'd-0.036
pped-0.036
 stayed-0.033
Token airplane
Token ID19401
Feature activation+2.93e-02

Pos logit contributions
 grand+0.090
 Grand+0.090
ummy+0.087
 Aunt+0.087
 festivals+0.087
Neg logit contributions
 managed-0.117
pped-0.114
ched-0.110
'd-0.108
 wouldn-0.098
Token.
Token ID13
Feature activation-4.78e-02

Pos logit contributions
pped+0.061
 managed+0.060
ched+0.057
'd+0.055
ave+0.052
Neg logit contributions
 grand-0.045
 Grand-0.044
 festivals-0.043
 Aunt-0.043
igsaw-0.042
Token It
Token ID632
Feature activation+1.99e-01

Pos logit contributions
 batch+0.077
 festivals+0.077
 orig+0.077
 bou+0.076
 grand+0.076
Neg logit contributions
 managed-0.087
ched-0.083
 could-0.081
pped-0.077
'd-0.077
Token took
Token ID1718
Feature activation+5.25e-02

Pos logit contributions
pped+0.319
 managed+0.261
cher+0.257
ched+0.255
tted+0.242
Neg logit contributions
 grand-0.412
 Grand-0.403
 sorts-0.402
 Aunt-0.389
 batch-0.382
Token a
Token ID257
Feature activation-8.37e-02

Pos logit contributions
 managed+0.106
'd+0.100
pped+0.098
ched+0.096
cher+0.091
Neg logit contributions
 Grand-0.084
 grand-0.083
igsaw-0.082
 Aunt-0.082
ummy-0.082
Token little
Token ID1310
Feature activation-1.83e-01

Pos logit contributions
ummy+0.114
 festivals+0.108
 Of+0.108
 Gig+0.108
 Aunt+0.107
Neg logit contributions
 managed-0.196
'd-0.192
ched-0.181
pped-0.169
cher-0.169
Token bit
Token ID1643
Feature activation-1.21e-01

Pos logit contributions
 festivals+0.317
 Grand+0.310
 Aunt+0.309
ummy+0.307
 Gig+0.303
Neg logit contributions
 managed-0.351
'd-0.324
ched-0.318
pped-0.303
cher-0.294
Token of
Token ID286
Feature activation-1.20e-01

Pos logit contributions
 festivals+0.218
 Aunt+0.217
 Grand+0.217
igsaw+0.216
 grand+0.215
Neg logit contributions
 managed-0.223
'd-0.203
ched-0.203
pped-0.200
 stayed-0.183
Token practice
Token ID3357
Feature activation-4.19e-02

Pos logit contributions
ummy+0.186
 batch+0.178
 Of+0.177
 Gig+0.176
 Aunt+0.175
Neg logit contributions
 managed-0.250
ched-0.226
pped-0.225
'd-0.223
 stayed-0.208
Token,
Token ID11
Feature activation-1.28e-01

Pos logit contributions
 grand+0.065
 Grand+0.064
 festivals+0.064
igsaw+0.063
 batch+0.062
Neg logit contributions
 managed-0.087
'd-0.081
pped-0.081
ched-0.080
cher-0.075
Token but
Token ID475
Feature activation+3.57e-02

Pos logit contributions
 Grand+0.219
 grand+0.217
 festivals+0.214
 Aunt+0.214
 batch+0.212
Neg logit contributions
 managed-0.243
ched-0.228
'd-0.227
pped-0.227
cher-0.209