TOP ACTIVATIONS
MAX = 0.178

Ben had an idea . He wanted to shoot something with
Max had an idea . He showed Sam his ball and
Bob had an idea . He said , " Let 's
Zip had an idea . He thought , " I will
Tim had an idea . He said , " Let 's
Tim had an idea . He said to the tree ,
o had an idea . He found some tape and told
Tom had an idea . He started to whistle very loud
he had an idea . He decided to do a big
Bob had an idea . He went to the town and
George was confused . He said , " What 's
my had an idea . He asked them what they could
Tim my was sad . He really wanted to wear the
squirrel had an idea . He told the mouse : "
Tim had an idea . He brought his friends to see
Joe was sad . He shook his head .
rubber had an idea . He started to bounce and make
Max had an idea . He saw a small switch on
match had an idea . He asked a nearby bird to
Tom had an idea . He saw a swing nearby and

MIN ACTIVATIONS
MIN = -0.277

to take a journey to outer space . He hopped into
what the rocket discovered in outer space . It was a
the stars and planets in outer space . So one day
special star that comes from outer space ," she explained .
izz ed off back into outer space . Earth was an
looked like it came from outer space . L
Emma who loved learning about outer space . Everyday , she
. Soon they were in outer space and having fun .
, and even went to outer space ! It was the
that they were going to outer space . They watched out
mouse never suffered from the rud eness of other animals again
getting very annoyed with his rud eness and complained that he
fold the knob into an orig ami figure . But no
boat saved them from their dire situation and they were able
sailed for days , exploring unch arted waters until he came
she got rid of the intr uders . She did not
he couldn 't touch the v ase . Tim my felt
y cat knocked over the v ase . The flowers fell
brother accidentally knocked over the v ase and it broke into
he couldn 't catch the aer opl ane . Suddenly ,

INTERVAL 0.064 - 0.178
CONTAINS 15.387%

hurt and couldn 't help him . <|endoftext|> Once upon a
fish a nice home . He grabbed some seaw eed and
woman was still scared . She decided to call the police
his friend , a little girl named Sue . They wanted
thought marking things made him seem more important . One day

INTERVAL -0.049 - 0.064
CONTAINS 63.848%

bag . He shouted excited ly to his friends
named Jessica . She loved playing with glue and making things
, on top of trees , and near the river .
said , " Good job . Now we need to put
did that day . When she finished ,

INTERVAL -0.163 - -0.049
CONTAINS 20.564%

a little pig gy named Pe ppa . Pe ppa lived
closer family . The moral of this story is
a noise and saw a wild animal coming her way .
young kids playing nearby . The pirate decided to steal their
I won 't let anything bad happen to you ." So

INTERVAL -0.277 - -0.163
CONTAINS 0.201%

big , dark cloud in the sky . Tim felt upset
family about her adventure at the lovely cabin . <|endoftext|> Once
taking him on adventures around the garden . They would explore
He saw a fairy in the woods . The fairy was
planted some beautiful flowers around the edges to make it look
TokenBen
Token ID11696
Feature activation+5.47e-02

Pos logit contributions
igsaw+0.094
ummy+0.083
 types+0.080
opl+0.078
umm+0.078
Neg logit contributions
 managed-0.070
 tried-0.070
 decided-0.069
 remembered-0.068
 never-0.066
Token had
Token ID550
Feature activation+1.02e-01

Pos logit contributions
 unable+0.055
pped+0.053
 managed+0.048
 wouldn+0.044
'd+0.043
Neg logit contributions
 types-0.117
 new-0.110
 seats-0.109
aj-0.107
 art-0.106
Token an
Token ID281
Feature activation-1.31e-01

Pos logit contributions
 managed+0.131
 tried+0.126
 wouldn+0.125
 remembered+0.125
 decided+0.124
Neg logit contributions
igsaw-0.222
 types-0.203
ummy-0.198
 new-0.198
 art-0.195
Token idea
Token ID2126
Feature activation+5.26e-02

Pos logit contributions
igsaw+0.159
ummy+0.126
 types+0.119
umm+0.114
opl+0.113
Neg logit contributions
 managed-0.329
 tried-0.328
 decided-0.327
 remembered-0.323
 never-0.317
Token.
Token ID13
Feature activation+2.15e-02

Pos logit contributions
 managed+0.108
 remembered+0.107
 accidentally+0.102
'd+0.101
 stumbled+0.100
Neg logit contributions
igsaw-0.071
ummy-0.065
 types-0.065
 new-0.064
 art-0.060
Token He
Token ID679
Feature activation+1.78e-01

Pos logit contributions
 tried+0.027
 decided+0.026
 managed+0.026
 remembered+0.024
 could+0.024
Neg logit contributions
igsaw-0.054
opl-0.048
ummy-0.048
 bou-0.047
 types-0.047
Token wanted
Token ID2227
Feature activation+5.13e-02

Pos logit contributions
 unable+0.185
pped+0.169
 never+0.145
 managed+0.139
Please+0.137
Neg logit contributions
 types-0.404
 new-0.373
igsaw-0.370
aj-0.370
 bou-0.367
Token to
Token ID284
Feature activation+5.04e-02

Pos logit contributions
 tried+0.073
 managed+0.073
 decided+0.072
 remembered+0.070
 never+0.069
Neg logit contributions
igsaw-0.120
ummy-0.106
 types-0.103
umm-0.103
opl-0.102
Token shoot
Token ID2686
Feature activation-4.54e-02

Pos logit contributions
 managed+0.089
 remembered+0.089
 unable+0.089
 wouldn+0.087
 stumbled+0.085
Neg logit contributions
igsaw-0.093
 types-0.081
ummy-0.079
 art-0.077
 new-0.076
Token something
Token ID1223
Feature activation+8.73e-03

Pos logit contributions
igsaw+0.103
ummy+0.091
umm+0.089
opl+0.087
 types+0.086
Neg logit contributions
 tried-0.072
 decided-0.070
 managed-0.069
 remembered-0.068
 didn-0.065
Token with
Token ID351
Feature activation-3.33e-02

Pos logit contributions
 tried+0.013
 decided+0.013
 managed+0.012
 never+0.012
 remembered+0.012
Neg logit contributions
igsaw-0.021
ummy-0.018
umm-0.018
opl-0.018
 types-0.018
TokenMax
Token ID11518
Feature activation+8.46e-02

Pos logit contributions
igsaw+0.076
ay+0.067
ummy+0.065
 Other+0.063
 types+0.063
Neg logit contributions
 managed-0.080
 wouldn-0.079
 remembered-0.077
 tried-0.077
 dropped-0.076
Token had
Token ID550
Feature activation+1.05e-01

Pos logit contributions
pped+0.109
 unable+0.103
 managed+0.095
'd+0.085
 wouldn+0.084
Neg logit contributions
 new-0.159
 types-0.158
ummy-0.153
 coll-0.153
 bou-0.151
Token an
Token ID281
Feature activation-1.31e-01

Pos logit contributions
 managed+0.171
 tried+0.169
 decided+0.168
 remembered+0.165
 wouldn+0.159
Neg logit contributions
igsaw-0.216
ummy-0.189
 types-0.186
opl-0.179
umm-0.178
Token idea
Token ID2126
Feature activation+5.32e-02

Pos logit contributions
igsaw+0.163
ummy+0.130
 types+0.123
umm+0.120
opl+0.115
Neg logit contributions
 tried-0.323
 managed-0.322
 decided-0.321
 remembered-0.317
 never-0.312
Token.
Token ID13
Feature activation+2.05e-02

Pos logit contributions
 managed+0.113
 remembered+0.111
 accidentally+0.108
 stumbled+0.107
 unable+0.106
Neg logit contributions
igsaw-0.064
 new-0.061
 types-0.059
ummy-0.058
 art-0.055
Token He
Token ID679
Feature activation+1.77e-01

Pos logit contributions
 tried+0.029
 managed+0.028
 decided+0.028
 remembered+0.027
 never+0.026
Neg logit contributions
igsaw-0.048
ummy-0.043
opl-0.042
 types-0.042
umm-0.041
Token showed
Token ID3751
Feature activation-4.47e-02

Pos logit contributions
 unable+0.200
pped+0.197
 managed+0.161
 never+0.156
Please+0.152
Neg logit contributions
 types-0.378
 new-0.359
 bou-0.351
 coll-0.349
aj-0.349
Token Sam
Token ID3409
Feature activation+6.51e-02

Pos logit contributions
igsaw+0.105
umm+0.096
ummy+0.092
aj+0.091
opl+0.090
Neg logit contributions
 tried-0.068
 decided-0.064
 managed-0.064
 never-0.061
 remembered-0.061
Token his
Token ID465
Feature activation-4.47e-02

Pos logit contributions
 tried+0.105
 decided+0.104
 managed+0.103
 remembered+0.101
 never+0.100
Neg logit contributions
igsaw-0.142
umm-0.129
ummy-0.127
opl-0.121
 types-0.120
Token ball
Token ID2613
Feature activation+1.83e-03

Pos logit contributions
igsaw+0.078
umm+0.076
ummy+0.069
 types+0.066
Whe+0.065
Neg logit contributions
 tried-0.092
 remembered-0.088
 dropped-0.088
 didn-0.086
 never-0.086
Token and
Token ID290
Feature activation+1.13e-02

Pos logit contributions
 managed+0.003
 remembered+0.003
 tried+0.003
 decided+0.003
 stumbled+0.003
Neg logit contributions
igsaw-0.004
ummy-0.003
 types-0.003
 new-0.003
opl-0.003
TokenBob
Token ID18861
Feature activation+7.45e-02

Pos logit contributions
igsaw+0.108
ummy+0.097
 types+0.092
opl+0.091
umm+0.089
Neg logit contributions
 managed-0.086
 tried-0.085
 remembered-0.083
 wouldn-0.082
 decided-0.081
Token had
Token ID550
Feature activation+1.01e-01

Pos logit contributions
pped+0.092
 unable+0.091
 managed+0.086
'd+0.081
 wouldn+0.080
Neg logit contributions
igsaw-0.151
 types-0.150
 bou-0.143
ummy-0.142
 batch-0.142
Token an
Token ID281
Feature activation-1.31e-01

Pos logit contributions
 managed+0.171
 tried+0.168
 decided+0.167
 remembered+0.165
 wouldn+0.161
Neg logit contributions
igsaw-0.198
ummy-0.172
 types-0.170
opl-0.163
 batch-0.160
Token idea
Token ID2126
Feature activation+5.31e-02

Pos logit contributions
igsaw+0.174
ummy+0.138
umm+0.133
 types+0.131
opl+0.125
Neg logit contributions
 tried-0.316
 decided-0.314
 managed-0.311
 remembered-0.307
 never-0.304
Token.
Token ID13
Feature activation+1.66e-02

Pos logit contributions
 managed+0.110
 remembered+0.107
 accidentally+0.105
 stumbled+0.102
'd+0.102
Neg logit contributions
igsaw-0.076
ummy-0.069
 new-0.066
 types-0.066
ay-0.062
Token He
Token ID679
Feature activation+1.77e-01

Pos logit contributions
 tried+0.020
 decided+0.020
 managed+0.020
 could+0.019
 remembered+0.019
Neg logit contributions
igsaw-0.042
opl-0.038
ummy-0.036
 types-0.036
umm-0.036
Token said
Token ID531
Feature activation-8.85e-04

Pos logit contributions
 unable+0.204
pped+0.190
 managed+0.181
 never+0.181
 stumbled+0.171
Neg logit contributions
 types-0.376
igsaw-0.371
 bou-0.362
 new-0.353
 coll-0.352
Token,
Token ID11
Feature activation+3.05e-02

Pos logit contributions
igsaw+0.002
ummy+0.002
 types+0.002
umm+0.001
opl+0.001
Neg logit contributions
 managed-0.002
 tried-0.001
 decided-0.001
 remembered-0.001
 never-0.001
Token "
Token ID366
Feature activation+3.08e-02

Pos logit contributions
 managed+0.066
 remembered+0.064
 decided+0.064
 tried+0.064
 never+0.062
Neg logit contributions
igsaw-0.042
ummy-0.038
 types-0.035
 new-0.034
 bou-0.034
TokenLet
Token ID5756
Feature activation+6.71e-02

Pos logit contributions
 managed+0.045
 remembered+0.044
 tried+0.044
 dropped+0.043
 wouldn+0.043
Neg logit contributions
igsaw-0.065
 types-0.060
umm-0.058
ummy-0.057
Whe-0.055
Token's
Token ID338
Feature activation-1.27e-03

Pos logit contributions
 managed+0.114
 tried+0.109
 remembered+0.109
 decided+0.108
 unable+0.106
Neg logit contributions
igsaw-0.131
ummy-0.119
 types-0.115
 difference-0.110
opl-0.109
TokenZip
Token ID41729
Feature activation+3.37e-02

Pos logit contributions
igsaw+0.081
 types+0.072
ay+0.071
ummy+0.071
opl+0.070
Neg logit contributions
 managed-0.081
 remembered-0.079
 wouldn-0.078
 accidentally-0.078
 decided-0.077
Token had
Token ID550
Feature activation+9.88e-02

Pos logit contributions
 unable+0.031
 managed+0.029
'd+0.027
 wouldn+0.027
 never+0.026
Neg logit contributions
 types-0.081
igsaw-0.080
 coll-0.077
 batch-0.076
 seats-0.075
Token an
Token ID281
Feature activation-1.32e-01

Pos logit contributions
 tried+0.171
 managed+0.170
 decided+0.170
 remembered+0.166
 never+0.165
Neg logit contributions
igsaw-0.205
ummy-0.179
umm-0.172
 types-0.171
opl-0.168
Token idea
Token ID2126
Feature activation+5.12e-02

Pos logit contributions
igsaw+0.186
ummy+0.147
umm+0.144
 types+0.138
aj+0.135
Neg logit contributions
 tried-0.310
 decided-0.307
 managed-0.300
 remembered-0.298
 never-0.296
Token.
Token ID13
Feature activation+2.43e-02

Pos logit contributions
 managed+0.105
 remembered+0.104
 unable+0.099
 accidentally+0.098
'd+0.098
Neg logit contributions
igsaw-0.067
ummy-0.063
 types-0.062
 new-0.062
ay-0.059
Token He
Token ID679
Feature activation+1.77e-01

Pos logit contributions
 tried+0.036
 decided+0.035
 managed+0.034
 remembered+0.033
 never+0.033
Neg logit contributions
igsaw-0.057
ummy-0.050
umm-0.048
 types-0.048
opl-0.048
Token thought
Token ID1807
Feature activation+5.06e-02

Pos logit contributions
 unable+0.163
 managed+0.145
pped+0.141
 never+0.140
 stumbled+0.129
Neg logit contributions
 types-0.432
igsaw-0.428
 batch-0.406
 coll-0.405
aj-0.403
Token,
Token ID11
Feature activation+3.79e-02

Pos logit contributions
 tried+0.081
 managed+0.080
 decided+0.078
 remembered+0.076
 never+0.076
Neg logit contributions
igsaw-0.110
umm-0.096
ummy-0.096
 types-0.094
opl-0.093
Token "
Token ID366
Feature activation+3.37e-02

Pos logit contributions
 managed+0.073
 remembered+0.072
 decided+0.071
 tried+0.070
 never+0.068
Neg logit contributions
igsaw-0.064
ummy-0.057
 types-0.056
 batch-0.052
 coll-0.052
TokenI
Token ID40
Feature activation+1.34e-01

Pos logit contributions
 unable+0.052
 remembered+0.050
 managed+0.049
 accidentally+0.049
 dropped+0.049
Neg logit contributions
 types-0.063
igsaw-0.063
Whe-0.061
umm-0.059
ay-0.058
Token will
Token ID481
Feature activation+1.35e-01

Pos logit contributions
 unable+0.200
pped+0.180
 decided+0.169
 managed+0.169
 remembered+0.167
Neg logit contributions
igsaw-0.275
 types-0.251
ay-0.251
 difference-0.248
 sorts-0.244
TokenTim
Token ID14967
Feature activation+7.20e-02

Pos logit contributions
igsaw+0.080
ummy+0.071
 types+0.069
ay+0.069
opl+0.068
Neg logit contributions
 managed-0.087
 wouldn-0.086
 remembered-0.085
 unable-0.083
 tried-0.082
Token had
Token ID550
Feature activation+1.00e-01

Pos logit contributions
 unable+0.099
pped+0.090
'd+0.089
 managed+0.087
 wouldn+0.085
Neg logit contributions
 types-0.129
 seats-0.119
 batch-0.119
udd-0.118
 new-0.118
Token an
Token ID281
Feature activation-1.33e-01

Pos logit contributions
 managed+0.170
 tried+0.170
 decided+0.169
 remembered+0.165
 never+0.162
Neg logit contributions
igsaw-0.208
ummy-0.182
 types-0.176
umm-0.174
opl-0.172
Token idea
Token ID2126
Feature activation+4.88e-02

Pos logit contributions
igsaw+0.169
ummy+0.134
 types+0.127
umm+0.125
opl+0.120
Neg logit contributions
 tried-0.326
 managed-0.323
 decided-0.323
 remembered-0.318
 never-0.313
Token.
Token ID13
Feature activation+1.72e-02

Pos logit contributions
 managed+0.106
 remembered+0.104
 unable+0.099
 stumbled+0.099
'd+0.099
Neg logit contributions
igsaw-0.056
 new-0.054
ummy-0.052
 types-0.051
 art-0.050
Token He
Token ID679
Feature activation+1.76e-01

Pos logit contributions
 tried+0.024
 managed+0.024
 decided+0.024
 remembered+0.023
 never+0.023
Neg logit contributions
igsaw-0.040
ummy-0.035
 types-0.035
umm-0.034
opl-0.034
Token said
Token ID531
Feature activation-5.83e-04

Pos logit contributions
 unable+0.198
pped+0.178
 managed+0.154
 never+0.154
 stumbled+0.148
Neg logit contributions
 types-0.383
 new-0.364
aj-0.362
igsaw-0.359
 coll-0.356
Token,
Token ID11
Feature activation+2.92e-02

Pos logit contributions
igsaw+0.001
ummy+0.001
 types+0.001
umm+0.001
opl+0.001
Neg logit contributions
 managed-0.001
 tried-0.001
 decided-0.001
 remembered-0.001
 never-0.001
Token "
Token ID366
Feature activation+2.87e-02

Pos logit contributions
 managed+0.063
 decided+0.061
 remembered+0.061
 tried+0.060
 never+0.059
Neg logit contributions
igsaw-0.042
ummy-0.038
 types-0.034
 new-0.034
 batch-0.033
TokenLet
Token ID5756
Feature activation+6.74e-02

Pos logit contributions
 unable+0.049
 managed+0.046
 wouldn+0.046
 dropped+0.045
 remembered+0.045
Neg logit contributions
igsaw-0.050
 types-0.050
umm-0.048
Whe-0.047
ay-0.046
Token's
Token ID338
Feature activation-9.22e-04

Pos logit contributions
 managed+0.121
 unable+0.117
 remembered+0.115
 tried+0.114
'd+0.113
Neg logit contributions
igsaw-0.119
ummy-0.112
 types-0.106
ay-0.103
 new-0.102
TokenTim
Token ID14967
Feature activation+6.66e-02

Pos logit contributions
igsaw+0.088
ummy+0.079
 types+0.078
ay+0.075
 batch+0.074
Neg logit contributions
 managed-0.078
 remembered-0.077
 tried-0.075
 wouldn-0.075
 unable-0.074
Token had
Token ID550
Feature activation+9.71e-02

Pos logit contributions
 unable+0.073
 managed+0.063
 wouldn+0.063
pped+0.063
'd+0.060
Neg logit contributions
 types-0.152
igsaw-0.141
 batch-0.138
ummy-0.137
 new-0.135
Token an
Token ID281
Feature activation-1.34e-01

Pos logit contributions
 managed+0.171
 tried+0.171
 decided+0.170
 remembered+0.166
 never+0.163
Neg logit contributions
igsaw-0.194
ummy-0.169
 types-0.164
umm-0.160
opl-0.159
Token idea
Token ID2126
Feature activation+5.07e-02

Pos logit contributions
igsaw+0.183
ummy+0.145
umm+0.140
 types+0.137
aj+0.131
Neg logit contributions
 tried-0.320
 decided-0.317
 managed-0.314
 remembered-0.309
 never-0.306
Token.
Token ID13
Feature activation+2.08e-02

Pos logit contributions
 managed+0.106
 remembered+0.106
'd+0.102
 stumbled+0.100
 accidentally+0.100
Neg logit contributions
igsaw-0.065
ummy-0.062
 new-0.059
 types-0.059
 art-0.056
Token He
Token ID679
Feature activation+1.76e-01

Pos logit contributions
 tried+0.026
 decided+0.025
 managed+0.025
 could+0.023
 remembered+0.023
Neg logit contributions
igsaw-0.053
opl-0.046
ummy-0.046
 bou-0.046
 types-0.045
Token said
Token ID531
Feature activation+1.17e-03

Pos logit contributions
 unable+0.184
pped+0.162
 managed+0.150
 never+0.150
 stumbled+0.141
Neg logit contributions
 types-0.410
igsaw-0.397
 new-0.378
aj-0.378
 coll-0.376
Token to
Token ID284
Feature activation+4.23e-02

Pos logit contributions
 managed+0.002
 tried+0.002
 decided+0.002
 remembered+0.002
 never+0.002
Neg logit contributions
igsaw-0.002
ummy-0.002
 types-0.002
umm-0.002
opl-0.002
Token the
Token ID262
Feature activation-1.42e-01

Pos logit contributions
 tried+0.068
 decided+0.065
 never+0.065
 managed+0.065
 remembered+0.063
Neg logit contributions
igsaw-0.095
ummy-0.082
umm-0.082
 types-0.079
opl-0.079
Token tree
Token ID5509
Feature activation+3.30e-02

Pos logit contributions
igsaw+0.270
ummy+0.222
umm+0.212
 types+0.210
aj+0.210
Neg logit contributions
 tried-0.285
 decided-0.281
 managed-0.275
 remembered-0.274
 never-0.273
Token,
Token ID11
Feature activation+2.69e-02

Pos logit contributions
 managed+0.056
 tried+0.055
 decided+0.054
 remembered+0.054
 stumbled+0.052
Neg logit contributions
igsaw-0.066
ummy-0.058
 types-0.057
 batch-0.054
opl-0.053
Tokeno
Token ID78
Feature activation+6.51e-02

Pos logit contributions
 managed+0.074
 unable+0.072
 tried+0.071
 wouldn+0.070
 never+0.069
Neg logit contributions
igsaw-0.088
ummy-0.084
 types-0.082
aj-0.081
 batch-0.076
Token had
Token ID550
Feature activation+1.07e-01

Pos logit contributions
 unable+0.068
 managed+0.068
pped+0.066
'd+0.064
 wouldn+0.063
Neg logit contributions
igsaw-0.140
 types-0.133
ummy-0.132
 batch-0.131
 bou-0.129
Token an
Token ID281
Feature activation-1.30e-01

Pos logit contributions
 managed+0.111
 tried+0.108
 decided+0.108
 remembered+0.106
 wouldn+0.101
Neg logit contributions
igsaw-0.276
ummy-0.247
 types-0.245
 batch-0.236
opl-0.235
Token idea
Token ID2126
Feature activation+4.90e-02

Pos logit contributions
igsaw+0.172
ummy+0.138
umm+0.132
 types+0.130
opl+0.124
Neg logit contributions
 tried-0.314
 decided-0.311
 managed-0.311
 remembered-0.306
 never-0.302
Token.
Token ID13
Feature activation+1.36e-02

Pos logit contributions
 managed+0.101
 remembered+0.099
 accidentally+0.098
 stumbled+0.096
 decided+0.095
Neg logit contributions
igsaw-0.066
ummy-0.058
 new-0.058
 types-0.055
ay-0.052
Token He
Token ID679
Feature activation+1.76e-01

Pos logit contributions
 decided+0.015
 tried+0.015
 managed+0.014
 could+0.014
 remembered+0.014
Neg logit contributions
igsaw-0.035
opl-0.033
 bou-0.031
 coll-0.031
 batch-0.031
Token found
Token ID1043
Feature activation+2.26e-02

Pos logit contributions
 unable+0.183
pped+0.178
 never+0.153
Please+0.150
 stumbled+0.146
Neg logit contributions
 types-0.377
igsaw-0.361
 bou-0.355
 new-0.354
aj-0.348
Token some
Token ID617
Feature activation-8.94e-02

Pos logit contributions
 tried+0.032
 decided+0.030
 managed+0.029
 never+0.029
 remembered+0.028
Neg logit contributions
igsaw-0.058
umm-0.051
opl-0.051
ummy-0.050
aj-0.049
Token tape
Token ID9154
Feature activation+1.40e-02

Pos logit contributions
igsaw+0.156
umm+0.132
opl+0.130
ummy+0.128
aj+0.121
Neg logit contributions
 tried-0.188
 decided-0.184
 managed-0.177
 could-0.174
 remembered-0.174
Token and
Token ID290
Feature activation+1.11e-02

Pos logit contributions
 managed+0.020
 tried+0.019
 remembered+0.019
 decided+0.019
 stumbled+0.018
Neg logit contributions
igsaw-0.032
ummy-0.029
 types-0.028
opl-0.027
 batch-0.027
Token told
Token ID1297
Feature activation-3.38e-02

Pos logit contributions
 managed+0.013
 tried+0.013
 decided+0.013
 remembered+0.013
 never+0.012
Neg logit contributions
igsaw-0.028
ummy-0.025
 types-0.025
umm-0.024
opl-0.024
TokenTom
Token ID13787
Feature activation+5.80e-02

Pos logit contributions
igsaw+0.086
opl+0.074
ummy+0.072
 types+0.071
ay+0.070
Neg logit contributions
 managed-0.068
 tried-0.068
 remembered-0.067
 wouldn-0.067
 decided-0.066
Token had
Token ID550
Feature activation+1.03e-01

Pos logit contributions
 unable+0.074
pped+0.073
 managed+0.067
Please+0.067
'd+0.065
Neg logit contributions
 types-0.096
 new-0.095
 coll-0.091
aj-0.091
 seats-0.091
Token an
Token ID281
Feature activation-1.32e-01

Pos logit contributions
 managed+0.175
 tried+0.172
 decided+0.172
 remembered+0.169
 wouldn+0.166
Neg logit contributions
igsaw-0.199
ummy-0.171
 types-0.170
 batch-0.161
opl-0.161
Token idea
Token ID2126
Feature activation+5.12e-02

Pos logit contributions
igsaw+0.159
ummy+0.125
 types+0.119
umm+0.113
opl+0.112
Neg logit contributions
 managed-0.334
 tried-0.333
 decided-0.332
 remembered-0.328
 never-0.322
Token.
Token ID13
Feature activation+1.63e-02

Pos logit contributions
 managed+0.109
 remembered+0.108
 accidentally+0.106
 stumbled+0.104
'd+0.103
Neg logit contributions
igsaw-0.062
 new-0.058
ummy-0.057
 types-0.052
ay-0.051
Token He
Token ID679
Feature activation+1.76e-01

Pos logit contributions
 tried+0.019
 decided+0.018
 managed+0.018
 could+0.017
 remembered+0.017
Neg logit contributions
igsaw-0.042
 bou-0.038
opl-0.038
ummy-0.038
 types-0.037
Token started
Token ID2067
Feature activation+4.24e-02

Pos logit contributions
 unable+0.178
pped+0.168
Please+0.145
 managed+0.143
 stumbled+0.142
Neg logit contributions
 types-0.385
igsaw-0.363
 new-0.359
aj-0.355
 coll-0.355
Token to
Token ID284
Feature activation+4.89e-02

Pos logit contributions
 managed+0.071
 decided+0.069
'd+0.067
 remembered+0.067
 tried+0.066
Neg logit contributions
igsaw-0.084
ummy-0.073
 types-0.073
 coll-0.071
 batch-0.071
Token whistle
Token ID16121
Feature activation+5.53e-04

Pos logit contributions
 managed+0.081
 remembered+0.077
 decided+0.077
 unable+0.076
 stumbled+0.076
Neg logit contributions
igsaw-0.092
 difference-0.083
 types-0.083
 art-0.079
 bou-0.079
Token very
Token ID845
Feature activation+3.43e-02

Pos logit contributions
 tried+0.001
 managed+0.001
 decided+0.001
 remembered+0.001
 never+0.001
Neg logit contributions
igsaw-0.001
ummy-0.001
umm-0.001
 types-0.001
opl-0.001
Token loud
Token ID7812
Feature activation-6.94e-02

Pos logit contributions
 managed+0.064
 tried+0.063
 remembered+0.062
 decided+0.062
 stumbled+0.061
Neg logit contributions
igsaw-0.062
ummy-0.054
 types-0.051
 new-0.050
opl-0.049
Token he
Token ID339
Feature activation+1.33e-01

Pos logit contributions
 managed+0.021
'd+0.021
 never+0.020
 decided+0.020
 unable+0.020
Neg logit contributions
igsaw-0.015
ummy-0.013
 new-0.013
ay-0.012
 types-0.012
Token had
Token ID550
Feature activation+9.88e-02

Pos logit contributions
 unable+0.137
pped+0.119
pper+0.115
 wouldn+0.115
 managed+0.112
Neg logit contributions
 new-0.267
 types-0.265
igsaw-0.257
 gym-0.253
aj-0.249
Token an
Token ID281
Feature activation-1.28e-01

Pos logit contributions
 managed+0.179
 tried+0.177
 decided+0.176
 remembered+0.174
 wouldn+0.170
Neg logit contributions
igsaw-0.184
ummy-0.157
 types-0.153
opl-0.147
umm-0.145
Token idea
Token ID2126
Feature activation+4.59e-02

Pos logit contributions
igsaw+0.177
ummy+0.142
umm+0.137
 types+0.134
aj+0.131
Neg logit contributions
 tried-0.301
 decided-0.300
 managed-0.295
 remembered-0.292
 never-0.289
Token.
Token ID13
Feature activation+1.97e-02

Pos logit contributions
 managed+0.092
 remembered+0.092
'd+0.087
 decided+0.087
 unable+0.086
Neg logit contributions
igsaw-0.065
ummy-0.058
 new-0.057
 types-0.055
 art-0.053
Token He
Token ID679
Feature activation+1.75e-01

Pos logit contributions
 tried+0.026
 decided+0.025
 managed+0.025
 remembered+0.024
 could+0.023
Neg logit contributions
igsaw-0.048
ummy-0.043
opl-0.042
 types-0.042
 bou-0.042
Token decided
Token ID3066
Feature activation+5.67e-02

Pos logit contributions
 unable+0.146
pped+0.132
 never+0.113
 managed+0.111
Please+0.096
Neg logit contributions
 types-0.427
igsaw-0.416
 new-0.392
 bou-0.391
aj-0.389
Token to
Token ID284
Feature activation+4.78e-02

Pos logit contributions
 managed+0.108
'd+0.104
 remembered+0.103
 tried+0.101
pped+0.101
Neg logit contributions
igsaw-0.082
 new-0.082
 art-0.072
ummy-0.072
 types-0.071
Token do
Token ID466
Feature activation+3.37e-02

Pos logit contributions
 remembered+0.080
 managed+0.079
 unable+0.077
 decided+0.076
 stumbled+0.074
Neg logit contributions
igsaw-0.096
ummy-0.082
 types-0.082
 difference-0.079
 art-0.078
Token a
Token ID257
Feature activation-8.38e-02

Pos logit contributions
 remembered+0.061
 managed+0.060
'd+0.060
 decided+0.060
 stumbled+0.057
Neg logit contributions
igsaw-0.058
 new-0.053
 art-0.050
 math-0.048
 types-0.048
Token big
Token ID1263
Feature activation-1.33e-01

Pos logit contributions
igsaw+0.115
ummy+0.091
 types+0.087
 batch+0.082
umm+0.082
Neg logit contributions
 managed-0.198
 decided-0.196
 tried-0.196
 remembered-0.194
 never-0.190
TokenBob
Token ID18861
Feature activation+7.18e-02

Pos logit contributions
igsaw+0.079
ummy+0.072
 types+0.070
opl+0.068
 Other+0.068
Neg logit contributions
 managed-0.085
 remembered-0.083
 wouldn-0.082
 tried-0.082
 accidentally-0.080
Token had
Token ID550
Feature activation+1.00e-01

Pos logit contributions
 unable+0.070
 managed+0.064
pped+0.063
 wouldn+0.059
'd+0.059
Neg logit contributions
 types-0.158
igsaw-0.151
 seats-0.149
 goals-0.148
aj-0.147
Token an
Token ID281
Feature activation-1.29e-01

Pos logit contributions
 managed+0.166
 tried+0.166
 decided+0.164
 remembered+0.161
 never+0.158
Neg logit contributions
igsaw-0.211
ummy-0.185
 types-0.180
umm-0.177
opl-0.175
Token idea
Token ID2126
Feature activation+5.27e-02

Pos logit contributions
igsaw+0.175
ummy+0.140
umm+0.134
 types+0.132
aj+0.125
Neg logit contributions
 tried-0.310
 decided-0.307
 managed-0.306
 remembered-0.301
 never-0.298
Token.
Token ID13
Feature activation+1.75e-02

Pos logit contributions
 managed+0.112
 remembered+0.110
 accidentally+0.107
 stumbled+0.105
 unable+0.105
Neg logit contributions
igsaw-0.060
 new-0.059
ummy-0.056
 types-0.055
 art-0.052
Token He
Token ID679
Feature activation+1.75e-01

Pos logit contributions
 tried+0.022
 decided+0.021
 managed+0.021
 remembered+0.019
 could+0.019
Neg logit contributions
igsaw-0.045
opl-0.039
ummy-0.039
 bou-0.039
 coll-0.038
Token went
Token ID1816
Feature activation+1.58e-02

Pos logit contributions
 unable+0.153
pped+0.130
 managed+0.129
 never+0.127
 stumbled+0.114
Neg logit contributions
 types-0.427
igsaw-0.419
aj-0.400
 new-0.399
 bou-0.394
Token to
Token ID284
Feature activation+3.59e-02

Pos logit contributions
 managed+0.024
 tried+0.024
 decided+0.024
 remembered+0.024
 never+0.023
Neg logit contributions
igsaw-0.035
ummy-0.030
 types-0.030
opl-0.029
umm-0.029
Token the
Token ID262
Feature activation-1.44e-01

Pos logit contributions
 tried+0.051
 never+0.045
 decided+0.044
 accidentally+0.043
 managed+0.043
Neg logit contributions
igsaw-0.091
umm-0.089
ummy-0.085
aj-0.082
opl-0.079
Token town
Token ID3240
Feature activation-7.99e-03

Pos logit contributions
igsaw+0.257
ummy+0.218
 types+0.211
aj+0.210
umm+0.207
Neg logit contributions
 tried-0.292
 decided-0.287
 never-0.283
 managed-0.278
 remembered-0.277
Token and
Token ID290
Feature activation+2.33e-02

Pos logit contributions
igsaw+0.020
ummy+0.018
umm+0.018
opl+0.017
 types+0.017
Neg logit contributions
 tried-0.010
 decided-0.010
 managed-0.010
 never-0.010
 remembered-0.009
Token
Token ID198
Feature activation-4.59e-02

Pos logit contributions
igsaw+0.102
umm+0.089
 types+0.088
ummy+0.087
opl+0.086
Neg logit contributions
 tried-0.084
 decided-0.083
 managed-0.082
 never-0.081
 remembered-0.079
TokenGeorge
Token ID20191
Feature activation+9.19e-02

Pos logit contributions
igsaw+0.089
ummy+0.079
 types+0.075
 batch+0.072
umm+0.071
Neg logit contributions
 wouldn-0.080
 managed-0.079
 tried-0.078
 remembered-0.077
 dropped-0.074
Token was
Token ID373
Feature activation+7.84e-02

Pos logit contributions
 unable+0.103
 managed+0.098
pped+0.097
 never+0.095
 wouldn+0.094
Neg logit contributions
igsaw-0.211
 types-0.200
ummy-0.196
 batch-0.191
 bou-0.183
Token confused
Token ID10416
Feature activation-2.75e-02

Pos logit contributions
 managed+0.113
 decided+0.112
 tried+0.112
 remembered+0.110
 wouldn+0.108
Neg logit contributions
igsaw-0.170
ummy-0.150
 types-0.149
 batch-0.145
opl-0.144
Token.
Token ID13
Feature activation+1.74e-02

Pos logit contributions
igsaw+0.057
ummy+0.053
 types+0.049
 new+0.047
umm+0.046
Neg logit contributions
 managed-0.047
 tried-0.044
 decided-0.044
 remembered-0.044
 stumbled-0.043
Token He
Token ID679
Feature activation+1.75e-01

Pos logit contributions
 tried+0.028
 managed+0.028
 decided+0.028
 remembered+0.027
 never+0.026
Neg logit contributions
igsaw-0.037
ummy-0.033
 types-0.032
opl-0.031
umm-0.031
Token said
Token ID531
Feature activation-2.49e-03

Pos logit contributions
pped+0.199
 unable+0.180
 managed+0.153
pper+0.151
 risking+0.150
Neg logit contributions
 types-0.366
igsaw-0.353
 new-0.346
 batch-0.338
 Grand-0.333
Token,
Token ID11
Feature activation+2.95e-02

Pos logit contributions
igsaw+0.005
ummy+0.004
 types+0.004
umm+0.004
opl+0.004
Neg logit contributions
 managed-0.004
 tried-0.004
 decided-0.004
 remembered-0.004
 never-0.004
Token "
Token ID366
Feature activation+2.90e-02

Pos logit contributions
 managed+0.068
 remembered+0.065
 decided+0.064
 tried+0.064
 stumbled+0.063
Neg logit contributions
igsaw-0.037
ummy-0.034
 new-0.031
 types-0.030
 bou-0.029
TokenWhat
Token ID2061
Feature activation+9.60e-02

Pos logit contributions
 managed+0.045
 remembered+0.044
 tried+0.044
 dropped+0.044
 unable+0.043
Neg logit contributions
igsaw-0.058
 types-0.054
umm-0.052
Whe-0.052
ummy-0.050
Token's
Token ID338
Feature activation+4.59e-03

Pos logit contributions
 managed+0.146
 tried+0.146
 decided+0.144
 remembered+0.141
 never+0.137
Neg logit contributions
igsaw-0.212
ummy-0.187
 types-0.183
umm-0.178
opl-0.178
Tokenmy
Token ID1820
Feature activation+1.04e-01

Pos logit contributions
 managed+0.048
 tried+0.047
 decided+0.047
 remembered+0.046
 never+0.045
Neg logit contributions
igsaw-0.060
ummy-0.054
 types-0.053
opl-0.050
 batch-0.050
Token had
Token ID550
Feature activation+9.74e-02

Pos logit contributions
 managed+0.112
 tried+0.106
 decided+0.103
 remembered+0.102
 unable+0.101
Neg logit contributions
igsaw-0.260
ummy-0.238
 types-0.238
 batch-0.227
 bou-0.225
Token an
Token ID281
Feature activation-1.35e-01

Pos logit contributions
 decided+0.150
 tried+0.149
 managed+0.147
 never+0.146
 remembered+0.142
Neg logit contributions
igsaw-0.233
ummy-0.209
umm-0.205
opl-0.195
 types-0.195
Token idea
Token ID2126
Feature activation+5.21e-02

Pos logit contributions
igsaw+0.199
ummy+0.159
umm+0.158
aj+0.154
 types+0.148
Neg logit contributions
 decided-0.308
 tried-0.305
 never-0.298
 managed-0.296
 remembered-0.296
Token.
Token ID13
Feature activation+1.62e-02

Pos logit contributions
 managed+0.101
 remembered+0.099
 accidentally+0.098
 stumbled+0.095
 dropped+0.094
Neg logit contributions
igsaw-0.080
ummy-0.074
 types-0.073
 new-0.069
 art-0.066
Token He
Token ID679
Feature activation+1.75e-01

Pos logit contributions
 tried+0.016
 decided+0.016
 could+0.015
 managed+0.014
 accidentally+0.014
Neg logit contributions
igsaw-0.043
opl-0.041
 bou-0.041
 difference-0.039
aj-0.039
Token asked
Token ID1965
Feature activation+7.96e-03

Pos logit contributions
 unable+0.171
pped+0.156
 managed+0.154
 never+0.150
 stumbled+0.140
Neg logit contributions
igsaw-0.406
 types-0.405
ummy-0.378
aj-0.378
 batch-0.376
Token them
Token ID606
Feature activation+9.55e-02

Pos logit contributions
 decided+0.011
 tried+0.011
 managed+0.010
 remembered+0.010
 never+0.010
Neg logit contributions
igsaw-0.020
umm-0.018
opl-0.018
ummy-0.018
aj-0.017
Token what
Token ID644
Feature activation+4.80e-02

Pos logit contributions
 decided+0.137
 tried+0.136
 managed+0.135
 remembered+0.132
 never+0.128
Neg logit contributions
igsaw-0.223
ummy-0.195
umm-0.195
opl-0.194
 types-0.190
Token they
Token ID484
Feature activation+1.22e-01

Pos logit contributions
 tried+0.056
 managed+0.056
 decided+0.055
 remembered+0.053
 never+0.052
Neg logit contributions
igsaw-0.124
ummy-0.111
 types-0.108
umm-0.107
opl-0.107
Token could
Token ID714
Feature activation+1.18e-01

Pos logit contributions
 unable+0.130
 managed+0.110
pped+0.108
 accidentally+0.101
 never+0.101
Neg logit contributions
 types-0.292
 kinds-0.267
igsaw-0.260
 sorts-0.260
 parties-0.259
TokenTim
Token ID14967
Feature activation+5.25e-02

Pos logit contributions
igsaw+0.096
ummy+0.086
 types+0.083
 batch+0.080
ay+0.079
Neg logit contributions
 managed-0.084
 tried-0.082
 remembered-0.081
 wouldn-0.080
 stumbled-0.079
Tokenmy
Token ID1820
Feature activation+1.12e-01

Pos logit contributions
 unable+0.108
 managed+0.108
 stumbled+0.106
 possibly+0.103
 dropped+0.102
Neg logit contributions
igsaw-0.061
 types-0.061
ummy-0.058
aj-0.057
ay-0.053
Token was
Token ID373
Feature activation+7.10e-02

Pos logit contributions
pped+0.110
 unable+0.110
 risking+0.109
 managed+0.099
 possibly+0.095
Neg logit contributions
 batch-0.204
 types-0.200
 new-0.200
ummy-0.195
aj-0.195
Token sad
Token ID6507
Feature activation-2.19e-02

Pos logit contributions
 managed+0.114
'd+0.111
 decided+0.109
 wouldn+0.108
 tried+0.106
Neg logit contributions
igsaw-0.138
ummy-0.121
 new-0.121
 types-0.120
opl-0.119
Token.
Token ID13
Feature activation+1.35e-02

Pos logit contributions
igsaw+0.048
ummy+0.045
 types+0.041
umm+0.040
 new+0.040
Neg logit contributions
 managed-0.034
 decided-0.033
 tried-0.032
 remembered-0.032
'd-0.031
Token He
Token ID679
Feature activation+1.75e-01

Pos logit contributions
 managed+0.025
 tried+0.024
 remembered+0.024
 decided+0.024
 dropped+0.023
Neg logit contributions
igsaw-0.024
ummy-0.022
 types-0.021
umm-0.020
opl-0.020
Token really
Token ID1107
Feature activation+8.33e-02

Pos logit contributions
pped+0.215
ppers+0.182
pper+0.181
iers+0.174
ging+0.173
Neg logit contributions
 new-0.333
 old-0.315
 favorite-0.313
 Grand-0.307
 original-0.299
Token wanted
Token ID2227
Feature activation+4.86e-02

Pos logit contributions
 managed+0.077
pped+0.076
 unable+0.065
 stumbled+0.062
'd+0.061
Neg logit contributions
igsaw-0.194
 types-0.190
 new-0.188
 goals-0.182
ummy-0.179
Token to
Token ID284
Feature activation+4.93e-02

Pos logit contributions
 managed+0.081
 remembered+0.078
 decided+0.078
 tried+0.078
'd+0.076
Neg logit contributions
igsaw-0.098
ummy-0.088
 types-0.085
 new-0.082
 batch-0.081
Token wear
Token ID5806
Feature activation+7.45e-04

Pos logit contributions
 managed+0.086
 remembered+0.083
 decided+0.083
 tried+0.081
 unable+0.081
Neg logit contributions
igsaw-0.096
ummy-0.083
 types-0.081
opl-0.079
 difference-0.079
Token the
Token ID262
Feature activation-1.53e-01

Pos logit contributions
 managed+0.001
 decided+0.001
 tried+0.001
 remembered+0.001
 never+0.001
Neg logit contributions
igsaw-0.002
ummy-0.001
 types-0.001
opl-0.001
umm-0.001
Token squirrel
Token ID33039
Feature activation+7.38e-02

Pos logit contributions
igsaw+0.279
ummy+0.243
 types+0.237
umm+0.235
opl+0.234
Neg logit contributions
 tried-0.176
 decided-0.172
 remembered-0.169
 managed-0.166
 never-0.162
Token had
Token ID550
Feature activation+1.02e-01

Pos logit contributions
 managed+0.096
 unable+0.092
 never+0.092
 decided+0.091
 stumbled+0.091
Neg logit contributions
igsaw-0.160
 types-0.145
ummy-0.145
 batch-0.143
 bou-0.138
Token an
Token ID281
Feature activation-1.32e-01

Pos logit contributions
 managed+0.169
 tried+0.169
 decided+0.167
 remembered+0.166
 wouldn+0.161
Neg logit contributions
igsaw-0.204
ummy-0.177
 types-0.176
opl-0.167
 batch-0.167
Token idea
Token ID2126
Feature activation+5.15e-02

Pos logit contributions
igsaw+0.184
ummy+0.145
umm+0.141
 types+0.137
aj+0.136
Neg logit contributions
 tried-0.310
 decided-0.309
 managed-0.305
 remembered-0.302
 never-0.298
Token.
Token ID13
Feature activation+1.50e-02

Pos logit contributions
 remembered+0.101
 managed+0.099
 accidentally+0.098
 stumbled+0.096
'd+0.095
Neg logit contributions
igsaw-0.079
ummy-0.073
 types-0.070
 new-0.066
 art-0.064
Token He
Token ID679
Feature activation+1.75e-01

Pos logit contributions
 tried+0.023
 decided+0.022
 managed+0.022
 could+0.021
 remembered+0.021
Neg logit contributions
igsaw-0.034
opl-0.030
ummy-0.029
 types-0.029
umm-0.029
Token told
Token ID1297
Feature activation-2.37e-02

Pos logit contributions
 unable+0.176
pped+0.167
 never+0.165
 managed+0.158
 stumbled+0.152
Neg logit contributions
igsaw-0.404
 types-0.399
 batch-0.377
ummy-0.374
 bou-0.368
Token the
Token ID262
Feature activation-1.24e-01

Pos logit contributions
igsaw+0.054
umm+0.049
ummy+0.049
aj+0.049
opl+0.047
Neg logit contributions
 tried-0.036
 decided-0.034
 never-0.034
 managed-0.033
 didn-0.032
Token mouse
Token ID10211
Feature activation+7.53e-02

Pos logit contributions
igsaw+0.314
ummy+0.275
opl+0.269
umm+0.264
aj+0.264
Neg logit contributions
 tried-0.165
 decided-0.160
 remembered-0.153
 managed-0.153
 never-0.148
Token:
Token ID25
Feature activation+4.22e-02

Pos logit contributions
 managed+0.127
 tried+0.124
 remembered+0.123
pped+0.122
 stumbled+0.122
Neg logit contributions
igsaw-0.141
 types-0.127
ummy-0.124
 new-0.118
 art-0.114
Token "
Token ID366
Feature activation+2.47e-02

Pos logit contributions
 managed+0.078
 tried+0.078
 decided+0.077
 remembered+0.076
 never+0.074
Neg logit contributions
igsaw-0.079
ummy-0.068
 types-0.066
umm-0.064
opl-0.064
TokenTim
Token ID14967
Feature activation+7.04e-02

Pos logit contributions
igsaw+0.080
ummy+0.070
 types+0.067
ay+0.066
opl+0.066
Neg logit contributions
 managed-0.083
 wouldn-0.082
 remembered-0.081
 tried-0.078
'd-0.077
Token had
Token ID550
Feature activation+1.00e-01

Pos logit contributions
 unable+0.078
 managed+0.069
pped+0.068
'd+0.065
 wouldn+0.062
Neg logit contributions
 types-0.151
 seats-0.138
ummy-0.136
udd-0.136
 batch-0.136
Token an
Token ID281
Feature activation-1.30e-01

Pos logit contributions
 managed+0.155
 tried+0.155
 decided+0.153
 remembered+0.150
 never+0.146
Neg logit contributions
igsaw-0.222
ummy-0.196
 types-0.190
umm-0.188
opl-0.186
Token idea
Token ID2126
Feature activation+4.88e-02

Pos logit contributions
igsaw+0.165
ummy+0.131
 types+0.123
umm+0.121
opl+0.117
Neg logit contributions
 tried-0.321
 managed-0.319
 decided-0.319
 remembered-0.314
 never-0.309
Token.
Token ID13
Feature activation+1.52e-02

Pos logit contributions
 managed+0.101
 remembered+0.098
 accidentally+0.096
 unable+0.095
 stumbled+0.094
Neg logit contributions
igsaw-0.064
ummy-0.060
 types-0.059
 new-0.058
 art-0.056
Token He
Token ID679
Feature activation+1.75e-01

Pos logit contributions
 tried+0.018
 decided+0.017
 managed+0.017
 could+0.016
 remembered+0.015
Neg logit contributions
igsaw-0.040
opl-0.035
 bou-0.035
ummy-0.034
aj-0.034
Token brought
Token ID3181
Feature activation+1.17e-02

Pos logit contributions
 unable+0.189
pped+0.170
 managed+0.161
 never+0.152
 stumbled+0.150
Neg logit contributions
 types-0.394
igsaw-0.380
 batch-0.367
 coll-0.363
 bou-0.362
Token his
Token ID465
Feature activation-3.08e-02

Pos logit contributions
 tried+0.018
 decided+0.017
 managed+0.017
 never+0.016
 dropped+0.016
Neg logit contributions
igsaw-0.028
umm-0.025
ummy-0.025
opl-0.025
aj-0.024
Token friends
Token ID2460
Feature activation+5.37e-02

Pos logit contributions
igsaw+0.054
ummy+0.046
 types+0.044
umm+0.043
opl+0.043
Neg logit contributions
 managed-0.061
 tried-0.061
 decided-0.061
 remembered-0.060
 never-0.059
Token to
Token ID284
Feature activation+3.29e-02

Pos logit contributions
 tried+0.070
 decided+0.068
 managed+0.067
 remembered+0.065
 never+0.065
Neg logit contributions
igsaw-0.133
umm-0.125
ummy-0.123
opl-0.120
 types-0.118
Token see
Token ID766
Feature activation+3.07e-02

Pos logit contributions
 tried+0.053
 decided+0.051
 managed+0.050
 never+0.050
 remembered+0.049
Neg logit contributions
igsaw-0.073
umm-0.065
ummy-0.065
opl-0.062
 types-0.061
Token
Token ID198
Feature activation-5.10e-02

Pos logit contributions
igsaw+0.102
ummy+0.087
 types+0.085
umm+0.084
opl+0.082
Neg logit contributions
 tried-0.098
 managed-0.098
 decided-0.097
 remembered-0.095
 never-0.094
TokenJoe
Token ID19585
Feature activation+8.10e-02

Pos logit contributions
igsaw+0.105
ummy+0.093
 types+0.090
ay+0.088
opl+0.086
Neg logit contributions
 wouldn-0.080
 tried-0.078
 managed-0.078
 unable-0.077
 remembered-0.077
Token was
Token ID373
Feature activation+7.13e-02

Pos logit contributions
 unable+0.099
pped+0.096
 managed+0.093
 never+0.090
'd+0.086
Neg logit contributions
igsaw-0.155
 types-0.151
ummy-0.150
 batch-0.150
 bou-0.144
Token sad
Token ID6507
Feature activation-2.52e-02

Pos logit contributions
 managed+0.106
'd+0.104
 decided+0.103
 wouldn+0.102
 tried+0.101
Neg logit contributions
igsaw-0.147
ummy-0.131
 new-0.130
 types-0.130
 batch-0.127
Token.
Token ID13
Feature activation+1.37e-02

Pos logit contributions
igsaw+0.052
ummy+0.050
 types+0.045
 new+0.045
 difference+0.044
Neg logit contributions
 managed-0.041
 decided-0.038
 tried-0.038
'd-0.038
 remembered-0.038
Token He
Token ID679
Feature activation+1.75e-01

Pos logit contributions
 managed+0.027
 remembered+0.027
 tried+0.026
 decided+0.026
pped+0.026
Neg logit contributions
igsaw-0.023
ummy-0.021
 types-0.019
umm-0.018
 new-0.018
Token shook
Token ID14682
Feature activation+3.66e-02

Pos logit contributions
pped+0.213
 unable+0.166
ppers+0.164
ppings+0.164
pper+0.163
Neg logit contributions
 new-0.351
 old-0.326
 types-0.325
 Grand-0.311
 favorite-0.309
Token his
Token ID465
Feature activation-3.59e-02

Pos logit contributions
 managed+0.068
'd+0.066
 decided+0.062
 tried+0.062
 remembered+0.062
Neg logit contributions
igsaw-0.066
ummy-0.058
 types-0.057
 new-0.056
opl-0.054
Token head
Token ID1182
Feature activation+3.24e-02

Pos logit contributions
igsaw+0.085
ummy+0.075
 types+0.075
umm+0.073
opl+0.073
Neg logit contributions
 tried-0.050
 remembered-0.049
 managed-0.048
 decided-0.048
 never-0.047
Token.
Token ID13
Feature activation+5.20e-03

Pos logit contributions
 managed+0.049
 tried+0.048
 remembered+0.048
 decided+0.046
 stumbled+0.045
Neg logit contributions
igsaw-0.065
ummy-0.062
 new-0.057
 types-0.056
ay-0.056
Token
Token ID198
Feature activation-4.90e-02

Pos logit contributions
 managed+0.010
 remembered+0.010
 tried+0.010
 decided+0.010
 never+0.010
Neg logit contributions
igsaw-0.009
ummy-0.008
 types-0.007
umm-0.007
opl-0.007
Token rubber
Token ID14239
Feature activation-6.53e-02

Pos logit contributions
igsaw+0.209
ummy+0.176
 types+0.171
umm+0.167
opl+0.167
Neg logit contributions
 tried-0.235
 decided-0.232
 managed-0.231
 remembered-0.228
 never-0.224
Token had
Token ID550
Feature activation+1.00e-01

Pos logit contributions
 types+0.080
 new+0.076
 coll+0.074
 batch+0.073
 chip+0.072
Neg logit contributions
 managed-0.121
 unable-0.119
'd-0.118
 almost-0.113
 nearly-0.113
Token an
Token ID281
Feature activation-1.31e-01

Pos logit contributions
 managed+0.142
 tried+0.142
 remembered+0.140
 decided+0.140
 wouldn+0.137
Neg logit contributions
igsaw-0.211
 types-0.188
ummy-0.183
 new-0.178
 art-0.176
Token idea
Token ID2126
Feature activation+5.13e-02

Pos logit contributions
igsaw+0.166
ummy+0.131
 types+0.125
umm+0.121
opl+0.118
Neg logit contributions
 tried-0.321
 managed-0.320
 decided-0.319
 remembered-0.314
 never-0.310
Token.
Token ID13
Feature activation+1.85e-02

Pos logit contributions
 remembered+0.100
 managed+0.098
 accidentally+0.097
 stumbled+0.093
pped+0.093
Neg logit contributions
igsaw-0.061
 new-0.059
 types-0.058
ummy-0.056
 art-0.054
Token He
Token ID679
Feature activation+1.75e-01

Pos logit contributions
 tried+0.025
 decided+0.024
 managed+0.024
 remembered+0.023
 could+0.022
Neg logit contributions
igsaw-0.045
ummy-0.040
opl-0.039
 types-0.039
 bou-0.039
Token started
Token ID2067
Feature activation+4.25e-02

Pos logit contributions
 unable+0.154
pped+0.146
 never+0.126
 managed+0.122
Please+0.113
Neg logit contributions
 types-0.416
igsaw-0.393
 bou-0.383
 new-0.383
 batch-0.378
Token to
Token ID284
Feature activation+4.74e-02

Pos logit contributions
 managed+0.070
 decided+0.069
'd+0.068
 remembered+0.068
 wouldn+0.067
Neg logit contributions
igsaw-0.083
 types-0.072
ummy-0.071
 coll-0.069
 new-0.069
Token bounce
Token ID20110
Feature activation-6.47e-04

Pos logit contributions
 managed+0.075
 remembered+0.072
 decided+0.072
 tried+0.070
 wouldn+0.069
Neg logit contributions
igsaw-0.098
 types-0.085
 difference-0.083
 bou-0.082
ummy-0.081
Token and
Token ID290
Feature activation+1.04e-02

Pos logit contributions
igsaw+0.001
ummy+0.001
 types+0.001
umm+0.001
opl+0.001
Neg logit contributions
 tried-0.001
 managed-0.001
 decided-0.001
 remembered-0.001
 never-0.001
Token make
Token ID787
Feature activation-1.25e-02

Pos logit contributions
 managed+0.015
 decided+0.015
 remembered+0.015
 tried+0.015
 never+0.014
Neg logit contributions
igsaw-0.023
 types-0.020
ummy-0.020
opl-0.019
 bou-0.019
TokenMax
Token ID11518
Feature activation+8.20e-02

Pos logit contributions
igsaw+0.078
ay+0.068
opl+0.068
ummy+0.067
 Other+0.067
Neg logit contributions
 managed-0.081
 decided-0.078
 wouldn-0.077
 remembered-0.076
 tried-0.075
Token had
Token ID550
Feature activation+1.01e-01

Pos logit contributions
 unable+0.118
pped+0.112
'd+0.104
 managed+0.098
 possibly+0.094
Neg logit contributions
 types-0.124
 new-0.120
udd-0.120
 seats-0.120
 gym-0.118
Token an
Token ID281
Feature activation-1.33e-01

Pos logit contributions
 managed+0.180
 decided+0.177
 tried+0.176
 remembered+0.174
'd+0.169
Neg logit contributions
igsaw-0.189
ummy-0.163
 types-0.162
 batch-0.154
opl-0.153
Token idea
Token ID2126
Feature activation+4.98e-02

Pos logit contributions
igsaw+0.164
ummy+0.130
 types+0.123
umm+0.119
opl+0.116
Neg logit contributions
 tried-0.330
 managed-0.329
 decided-0.328
 remembered-0.323
 never-0.318
Token.
Token ID13
Feature activation+1.76e-02

Pos logit contributions
 managed+0.107
 remembered+0.106
'd+0.103
 accidentally+0.100
 unable+0.100
Neg logit contributions
igsaw-0.056
 new-0.054
ummy-0.052
 types-0.051
 art-0.049
Token He
Token ID679
Feature activation+1.75e-01

Pos logit contributions
 tried+0.025
 managed+0.025
 decided+0.024
 remembered+0.024
 never+0.023
Neg logit contributions
igsaw-0.041
ummy-0.036
 types-0.036
umm-0.035
opl-0.035
Token saw
Token ID2497
Feature activation-5.54e-03

Pos logit contributions
 unable+0.194
pped+0.168
 never+0.144
 managed+0.139
 stumbled+0.133
Neg logit contributions
 types-0.386
 new-0.360
aj-0.356
 coll-0.355
 batch-0.353
Token a
Token ID257
Feature activation-9.14e-02

Pos logit contributions
igsaw+0.014
umm+0.013
ummy+0.013
opl+0.012
 bou+0.012
Neg logit contributions
 tried-0.008
 decided-0.007
 accidentally-0.007
 managed-0.007
 remembered-0.007
Token small
Token ID1402
Feature activation-1.25e-01

Pos logit contributions
igsaw+0.146
ummy+0.124
 types+0.119
opl+0.117
umm+0.117
Neg logit contributions
 tried-0.195
 managed-0.194
 decided-0.192
 remembered-0.191
 accidentally-0.186
Token switch
Token ID5078
Feature activation-5.01e-03

Pos logit contributions
igsaw+0.235
ummy+0.203
 types+0.197
umm+0.193
opl+0.192
Neg logit contributions
 tried-0.230
 managed-0.230
 decided-0.228
 remembered-0.224
 never-0.218
Token on
Token ID319
Feature activation-3.87e-02

Pos logit contributions
igsaw+0.011
ummy+0.010
 types+0.010
opl+0.009
 batch+0.009
Neg logit contributions
 managed-0.007
 remembered-0.007
 tried-0.007
 decided-0.007
 stumbled-0.007
Token match
Token ID2872
Feature activation-8.88e-04

Pos logit contributions
igsaw+0.196
ummy+0.164
 types+0.159
 new+0.156
 art+0.153
Neg logit contributions
 managed-0.254
 decided-0.246
 remembered-0.242
 wouldn-0.240
 stumbled-0.239
Token had
Token ID550
Feature activation+1.02e-01

Pos logit contributions
igsaw+0.002
 types+0.002
ummy+0.002
 batch+0.002
 difference+0.002
Neg logit contributions
 managed-0.001
 unable-0.001
pped-0.001
'd-0.001
 remembered-0.001
Token an
Token ID281
Feature activation-1.32e-01

Pos logit contributions
 managed+0.154
 decided+0.152
 tried+0.152
 remembered+0.150
 wouldn+0.147
Neg logit contributions
igsaw-0.203
 types-0.184
ummy-0.180
 new-0.172
 coll-0.172
Token idea
Token ID2126
Feature activation+5.08e-02

Pos logit contributions
igsaw+0.162
ummy+0.128
 types+0.121
umm+0.117
opl+0.114
Neg logit contributions
 managed-0.328
 tried-0.328
 decided-0.326
 remembered-0.322
 never-0.317
Token.
Token ID13
Feature activation+1.97e-02

Pos logit contributions
 managed+0.103
 remembered+0.103
 accidentally+0.099
 unable+0.097
pped+0.096
Neg logit contributions
igsaw-0.058
 types-0.056
 new-0.056
ummy-0.056
 batch-0.051
Token He
Token ID679
Feature activation+1.75e-01

Pos logit contributions
 tried+0.036
 managed+0.036
 decided+0.036
 remembered+0.035
 never+0.034
Neg logit contributions
igsaw-0.038
ummy-0.033
 types-0.031
opl-0.031
umm-0.031
Token asked
Token ID1965
Feature activation+7.47e-03

Pos logit contributions
 unable+0.174
pped+0.168
 managed+0.144
 never+0.142
 stumbled+0.134
Neg logit contributions
 types-0.398
igsaw-0.374
 batch-0.369
 bou-0.368
 new-0.367
Token a
Token ID257
Feature activation-8.37e-02

Pos logit contributions
 tried+0.012
 decided+0.012
 remembered+0.011
 managed+0.011
 never+0.011
Neg logit contributions
igsaw-0.018
umm-0.016
ummy-0.015
opl-0.015
aj-0.015
Token nearby
Token ID6716
Feature activation-4.78e-02

Pos logit contributions
igsaw+0.156
ummy+0.137
opl+0.134
 types+0.131
umm+0.131
Neg logit contributions
 tried-0.158
 managed-0.155
 decided-0.154
 remembered-0.152
 never-0.148
Token bird
Token ID6512
Feature activation+3.81e-02

Pos logit contributions
igsaw+0.097
opl+0.085
ummy+0.084
umm+0.084
 types+0.082
Neg logit contributions
 tried-0.083
 decided-0.082
 managed-0.078
 remembered-0.078
 accidentally-0.076
Token to
Token ID284
Feature activation+2.45e-02

Pos logit contributions
 managed+0.062
 tried+0.061
 decided+0.060
 remembered+0.060
 never+0.058
Neg logit contributions
igsaw-0.079
ummy-0.070
 types-0.069
opl-0.067
 batch-0.065
TokenTom
Token ID13787
Feature activation+4.66e-02

Pos logit contributions
igsaw+0.135
ummy+0.120
 types+0.118
opl+0.115
umm+0.115
Neg logit contributions
 managed-0.083
 tried-0.083
 decided-0.082
 remembered-0.080
 never-0.077
Token had
Token ID550
Feature activation+1.01e-01

Pos logit contributions
 managed+0.055
 unable+0.053
 never+0.051
 tried+0.050
 wouldn+0.050
Neg logit contributions
igsaw-0.107
 types-0.099
ummy-0.098
aj-0.096
 batch-0.093
Token an
Token ID281
Feature activation-1.37e-01

Pos logit contributions
 managed+0.169
 tried+0.168
 decided+0.167
 remembered+0.163
 never+0.160
Neg logit contributions
igsaw-0.210
ummy-0.184
 types-0.178
umm-0.176
opl-0.174
Token idea
Token ID2126
Feature activation+5.06e-02

Pos logit contributions
igsaw+0.199
umm+0.160
ummy+0.160
aj+0.150
 types+0.147
Neg logit contributions
 tried-0.314
 decided-0.312
 managed-0.305
 remembered-0.304
 never-0.299
Token.
Token ID13
Feature activation+7.33e-03

Pos logit contributions
 managed+0.098
 remembered+0.096
 accidentally+0.094
'd+0.092
 decided+0.090
Neg logit contributions
igsaw-0.086
ummy-0.079
 types-0.075
ay-0.071
 new-0.070
Token He
Token ID679
Feature activation+1.75e-01

Pos logit contributions
 tried+0.010
 decided+0.010
 managed+0.010
 could+0.009
 remembered+0.009
Neg logit contributions
igsaw-0.018
ummy-0.016
opl-0.016
umm-0.015
 types-0.015
Token saw
Token ID2497
Feature activation-6.82e-03

Pos logit contributions
 unable+0.175
pped+0.163
 never+0.152
 managed+0.152
 stumbled+0.146
Neg logit contributions
igsaw-0.399
 types-0.396
aj-0.371
 batch-0.369
 bou-0.369
Token a
Token ID257
Feature activation-9.43e-02

Pos logit contributions
igsaw+0.016
umm+0.014
ummy+0.014
aj+0.013
opl+0.013
Neg logit contributions
 tried-0.011
 decided-0.011
 never-0.010
 managed-0.010
 could-0.010
Token swing
Token ID9628
Feature activation-1.90e-02

Pos logit contributions
igsaw+0.157
ummy+0.133
 types+0.128
umm+0.125
opl+0.124
Neg logit contributions
 managed-0.193
 tried-0.193
 decided-0.192
 remembered-0.189
 never-0.185
Token nearby
Token ID6716
Feature activation-5.43e-02

Pos logit contributions
igsaw+0.048
ummy+0.042
 types+0.041
umm+0.041
aj+0.040
Neg logit contributions
 tried-0.024
 decided-0.024
 managed-0.024
 never-0.023
 remembered-0.023
Token and
Token ID290
Feature activation+7.70e-03

Pos logit contributions
igsaw+0.129
ummy+0.115
 types+0.113
umm+0.110
opl+0.110
Neg logit contributions
 managed-0.073
 tried-0.072
 decided-0.071
 remembered-0.071
 never-0.068
Token to
Token ID284
Feature activation+3.65e-02

Pos logit contributions
 managed+0.074
 remembered+0.072
 tried+0.072
 accidentally+0.068
 decided+0.068
Neg logit contributions
igsaw-0.077
ummy-0.068
 new-0.068
 types-0.066
 art-0.064
Token take
Token ID1011
Feature activation-3.48e-02

Pos logit contributions
 managed+0.058
 remembered+0.057
 tried+0.057
 decided+0.056
 wouldn+0.055
Neg logit contributions
igsaw-0.076
ummy-0.066
 types-0.066
opl-0.062
umm-0.062
Token a
Token ID257
Feature activation-9.38e-02

Pos logit contributions
igsaw+0.074
ummy+0.066
 types+0.064
umm+0.064
opl+0.063
Neg logit contributions
 tried-0.055
 managed-0.055
 decided-0.055
 remembered-0.053
 never-0.052
Token journey
Token ID7002
Feature activation-3.80e-02

Pos logit contributions
igsaw+0.183
ummy+0.162
 types+0.160
umm+0.157
opl+0.156
Neg logit contributions
 tried-0.164
 managed-0.162
 decided-0.161
 remembered-0.157
 never-0.155
Token to
Token ID284
Feature activation+2.11e-02

Pos logit contributions
igsaw+0.090
ummy+0.080
 types+0.079
umm+0.078
opl+0.077
Neg logit contributions
 decided-0.052
 managed-0.051
 tried-0.051
 never-0.049
 remembered-0.048
Token outer
Token ID12076
Feature activation-2.77e-01

Pos logit contributions
 tried+0.030
 decided+0.030
 never+0.029
 managed+0.028
 remembered+0.027
Neg logit contributions
igsaw-0.050
ummy-0.047
umm-0.047
aj-0.045
 types-0.044
Token space
Token ID2272
Feature activation-5.30e-02

Pos logit contributions
igsaw+0.438
umm+0.387
opl+0.363
ummy+0.347
Whe+0.339
Neg logit contributions
 never-0.588
 decided-0.573
 tried-0.572
 remembered-0.552
 just-0.547
Token.
Token ID13
Feature activation-1.51e-02

Pos logit contributions
igsaw+0.127
ummy+0.113
umm+0.110
 types+0.109
opl+0.108
Neg logit contributions
 decided-0.071
 tried-0.070
 managed-0.070
 never-0.068
 remembered-0.067
Token He
Token ID679
Feature activation+1.53e-01

Pos logit contributions
igsaw+0.030
ummy+0.026
 types+0.025
umm+0.024
opl+0.024
Neg logit contributions
 tried-0.027
 managed-0.027
 decided-0.027
 remembered-0.026
 never-0.026
Token hopped
Token ID45230
Feature activation-6.90e-03

Pos logit contributions
 managed+0.134
 tried+0.128
 decided+0.125
 remembered+0.124
 accidentally+0.124
Neg logit contributions
igsaw-0.398
 types-0.377
ummy-0.363
aj-0.353
 seats-0.352
Token into
Token ID656
Feature activation-5.03e-02

Pos logit contributions
igsaw+0.015
ummy+0.014
 types+0.013
umm+0.013
opl+0.013
Neg logit contributions
 managed-0.010
 tried-0.010
 decided-0.010
 remembered-0.010
 never-0.010
Token what
Token ID644
Feature activation+3.59e-02

Pos logit contributions
igsaw+0.152
ummy+0.132
umm+0.129
aj+0.126
 types+0.122
Neg logit contributions
 tried-0.127
 never-0.126
 decided-0.124
 managed-0.123
 remembered-0.121
Token the
Token ID262
Feature activation-1.41e-01

Pos logit contributions
 managed+0.043
 tried+0.042
 decided+0.042
 remembered+0.041
 never+0.039
Neg logit contributions
igsaw-0.090
ummy-0.081
 types-0.080
umm-0.078
opl-0.078
Token rocket
Token ID10701
Feature activation-4.34e-04

Pos logit contributions
igsaw+0.250
ummy+0.215
 types+0.204
umm+0.200
 batch+0.199
Neg logit contributions
 tried-0.279
 decided-0.275
 managed-0.272
 never-0.271
 remembered-0.270
Token discovered
Token ID5071
Feature activation-1.67e-02

Pos logit contributions
igsaw+0.001
 types+0.001
ummy+0.001
opl+0.001
 difference+0.001
Neg logit contributions
 managed-0.000
 remembered-0.000
 tried-0.000
 accidentally-0.000
 decided-0.000
Token in
Token ID287
Feature activation-2.33e-02

Pos logit contributions
igsaw+0.040
ummy+0.035
 types+0.034
umm+0.034
opl+0.033
Neg logit contributions
 managed-0.023
 decided-0.023
 tried-0.023
 remembered-0.022
 never-0.022
Token outer
Token ID12076
Feature activation-2.75e-01

Pos logit contributions
igsaw+0.047
ummy+0.042
umm+0.042
aj+0.041
 types+0.040
Neg logit contributions
 never-0.040
 tried-0.040
 decided-0.039
 managed-0.039
 remembered-0.038
Token space
Token ID2272
Feature activation-5.77e-02

Pos logit contributions
igsaw+0.397
umm+0.347
ummy+0.336
 batch+0.334
 types+0.302
Neg logit contributions
 never-0.637
 remembered-0.594
 tried-0.593
 decided-0.582
 could-0.574
Token.
Token ID13
Feature activation-1.01e-02

Pos logit contributions
igsaw+0.131
ummy+0.116
 types+0.113
umm+0.111
opl+0.110
Neg logit contributions
 managed-0.083
 tried-0.083
 decided-0.083
 remembered-0.080
 never-0.079
Token It
Token ID632
Feature activation+1.42e-01

Pos logit contributions
igsaw+0.020
ummy+0.018
 types+0.018
 bou+0.017
umm+0.017
Neg logit contributions
 decided-0.017
 tried-0.017
 managed-0.017
 never-0.017
 could-0.017
Token was
Token ID373
Feature activation+6.30e-02

Pos logit contributions
 managed+0.137
 tried+0.137
 decided+0.132
 accidentally+0.130
 remembered+0.130
Neg logit contributions
igsaw-0.369
 types-0.341
ummy-0.334
opl-0.327
umm-0.324
Token a
Token ID257
Feature activation-8.25e-02

Pos logit contributions
 managed+0.100
 decided+0.099
 tried+0.099
 never+0.098
 remembered+0.097
Neg logit contributions
igsaw-0.139
ummy-0.122
umm-0.117
 types-0.117
opl-0.114
Token the
Token ID262
Feature activation-1.47e-01

Pos logit contributions
 tried+0.033
 decided+0.032
 never+0.031
 managed+0.031
 remembered+0.031
Neg logit contributions
igsaw-0.043
ummy-0.038
umm-0.038
aj-0.036
opl-0.036
Token stars
Token ID5788
Feature activation+9.71e-03

Pos logit contributions
igsaw+0.187
 batch+0.154
aj+0.153
ummy+0.145
 bou+0.138
Neg logit contributions
 decided-0.368
 tried-0.364
 never-0.363
 could-0.354
 didn-0.354
Token and
Token ID290
Feature activation+1.53e-03

Pos logit contributions
 managed+0.015
 remembered+0.015
 tried+0.015
 decided+0.014
 stumbled+0.014
Neg logit contributions
igsaw-0.020
ummy-0.018
 types-0.017
opl-0.017
 bou-0.017
Token planets
Token ID14705
Feature activation-1.64e-03

Pos logit contributions
 managed+0.002
 tried+0.002
 remembered+0.002
 decided+0.002
 wouldn+0.002
Neg logit contributions
igsaw-0.003
 types-0.003
ummy-0.003
 new-0.003
opl-0.003
Token in
Token ID287
Feature activation-3.88e-02

Pos logit contributions
igsaw+0.003
 types+0.003
ummy+0.003
 bou+0.003
 new+0.003
Neg logit contributions
 managed-0.003
 remembered-0.003
 tried-0.003
 accidentally-0.003
 decided-0.003
Token outer
Token ID12076
Feature activation-2.74e-01

Pos logit contributions
igsaw+0.087
umm+0.081
aj+0.080
ummy+0.080
 batch+0.077
Neg logit contributions
 never-0.057
 tried-0.057
 decided-0.056
 managed-0.053
 remembered-0.052
Token space
Token ID2272
Feature activation-5.60e-02

Pos logit contributions
igsaw+0.393
umm+0.382
 batch+0.355
ummy+0.343
Whe+0.319
Neg logit contributions
 never-0.625
 tried-0.598
 remembered-0.588
 decided-0.570
 could-0.566
Token.
Token ID13
Feature activation-1.17e-02

Pos logit contributions
igsaw+0.127
ummy+0.113
 types+0.111
opl+0.107
umm+0.107
Neg logit contributions
 managed-0.081
 tried-0.081
 decided-0.080
 remembered-0.078
 never-0.075
Token So
Token ID1406
Feature activation+1.60e-02

Pos logit contributions
igsaw+0.027
 coll+0.024
opl+0.024
umm+0.023
aj+0.023
Neg logit contributions
 decided-0.017
 tried-0.017
 could-0.016
 never-0.016
 didn-0.016
Token one
Token ID530
Feature activation-4.03e-02

Pos logit contributions
 managed+0.024
 tried+0.024
 decided+0.024
 remembered+0.023
 never+0.023
Neg logit contributions
igsaw-0.036
ummy-0.032
 types-0.031
umm-0.030
opl-0.030
Token day
Token ID1110
Feature activation+8.15e-03

Pos logit contributions
igsaw+0.067
opl+0.056
ummy+0.055
umm+0.055
 types+0.053
Neg logit contributions
 tried-0.088
 decided-0.087
 remembered-0.085
 didn-0.083
 managed-0.083
Token special
Token ID2041
Feature activation-1.19e-01

Pos logit contributions
igsaw+0.139
ummy+0.116
 types+0.111
umm+0.108
opl+0.106
Neg logit contributions
 managed-0.199
 tried-0.197
 decided-0.197
 remembered-0.193
 never-0.188
Token star
Token ID3491
Feature activation-4.23e-02

Pos logit contributions
igsaw+0.193
ummy+0.160
 types+0.154
umm+0.154
opl+0.154
Neg logit contributions
 tried-0.252
 decided-0.248
 managed-0.245
 remembered-0.245
 never-0.243
Token that
Token ID326
Feature activation-7.76e-03

Pos logit contributions
igsaw+0.090
ummy+0.079
 types+0.077
umm+0.075
opl+0.075
Neg logit contributions
 managed-0.067
 tried-0.067
 decided-0.066
 remembered-0.065
 never-0.063
Token comes
Token ID2058
Feature activation+4.70e-02

Pos logit contributions
igsaw+0.017
ummy+0.015
 types+0.014
umm+0.014
opl+0.014
Neg logit contributions
 tried-0.012
 managed-0.012
 decided-0.012
 remembered-0.012
 never-0.012
Token from
Token ID422
Feature activation-6.28e-02

Pos logit contributions
 decided+0.074
 tried+0.074
 managed+0.074
 never+0.072
 remembered+0.072
Neg logit contributions
igsaw-0.102
ummy-0.088
 types-0.087
umm-0.086
opl-0.084
Token outer
Token ID12076
Feature activation-2.65e-01

Pos logit contributions
igsaw+0.136
ummy+0.117
umm+0.115
 types+0.113
opl+0.110
Neg logit contributions
 tried-0.103
 decided-0.102
 never-0.100
 managed-0.099
 remembered-0.099
Token space
Token ID2272
Feature activation-5.47e-02

Pos logit contributions
igsaw+0.391
umm+0.341
ummy+0.320
 batch+0.315
Whe+0.306
Neg logit contributions
 never-0.610
 tried-0.596
 decided-0.579
 remembered-0.571
 could-0.557
Token,"
Token ID553
Feature activation-4.59e-02

Pos logit contributions
igsaw+0.119
ummy+0.105
 types+0.102
umm+0.101
opl+0.100
Neg logit contributions
 tried-0.085
 decided-0.084
 managed-0.084
 remembered-0.082
 never-0.081
Token she
Token ID673
Feature activation+1.27e-01

Pos logit contributions
igsaw+0.078
ummy+0.069
 types+0.067
 bou+0.063
umm+0.063
Neg logit contributions
 managed-0.091
 tried-0.088
 remembered-0.087
 decided-0.087
 accidentally-0.085
Token explained
Token ID4893
Feature activation-2.83e-02

Pos logit contributions
 managed+0.181
 tried+0.172
 decided+0.172
 never+0.170
 remembered+0.170
Neg logit contributions
igsaw-0.275
 types-0.247
ummy-0.245
 batch-0.234
 bou-0.233
Token.
Token ID13
Feature activation-4.75e-03

Pos logit contributions
igsaw+0.069
ummy+0.060
opl+0.060
 types+0.059
umm+0.059
Neg logit contributions
 decided-0.037
 tried-0.037
 managed-0.037
 remembered-0.036
 never-0.035
Tokenizz
Token ID6457
Feature activation-4.32e-02

Pos logit contributions
igsaw+0.185
ummy+0.159
 types+0.153
opl+0.152
 batch+0.146
Neg logit contributions
 managed-0.187
 tried-0.186
 decided-0.185
 remembered-0.181
 never-0.177
Tokened
Token ID276
Feature activation-1.60e-02

Pos logit contributions
igsaw+0.094
ummy+0.082
umm+0.080
 types+0.079
opl+0.078
Neg logit contributions
 managed-0.068
 tried-0.068
 decided-0.068
 remembered-0.066
 never-0.065
Token off
Token ID572
Feature activation-1.69e-02

Pos logit contributions
igsaw+0.038
ummy+0.035
umm+0.035
opl+0.034
 types+0.033
Neg logit contributions
 tried-0.022
 decided-0.022
 managed-0.021
 never-0.021
 remembered-0.020
Token back
Token ID736
Feature activation-9.74e-03

Pos logit contributions
igsaw+0.039
ummy+0.036
 types+0.035
umm+0.035
opl+0.035
Neg logit contributions
 tried-0.024
 decided-0.023
 managed-0.023
 never-0.023
 remembered-0.022
Token into
Token ID656
Feature activation-6.16e-02

Pos logit contributions
igsaw+0.021
umm+0.019
ummy+0.018
 types+0.018
 batch+0.018
Neg logit contributions
 tried-0.015
 decided-0.015
 never-0.014
 managed-0.014
 remembered-0.014
Token outer
Token ID12076
Feature activation-2.63e-01

Pos logit contributions
igsaw+0.134
ummy+0.130
aj+0.129
umm+0.129
opl+0.125
Neg logit contributions
 tried-0.095
 never-0.090
 decided-0.089
 managed-0.088
 remembered-0.088
Token space
Token ID2272
Feature activation-4.79e-02

Pos logit contributions
igsaw+0.383
umm+0.352
Whe+0.345
 batch+0.319
ummy+0.319
Neg logit contributions
 never-0.588
 remembered-0.549
 tried-0.547
 decided-0.540
 could-0.515
Token.
Token ID13
Feature activation-8.42e-03

Pos logit contributions
igsaw+0.115
ummy+0.103
umm+0.101
 types+0.100
opl+0.099
Neg logit contributions
 managed-0.063
 tried-0.063
 decided-0.063
 never-0.061
 remembered-0.060
Token Earth
Token ID3668
Feature activation-2.17e-02

Pos logit contributions
igsaw+0.018
 types+0.016
ummy+0.016
umm+0.016
 batch+0.016
Neg logit contributions
 decided-0.013
 could-0.013
 tried-0.013
 managed-0.013
 never-0.013
Token was
Token ID373
Feature activation+6.86e-02

Pos logit contributions
igsaw+0.054
 types+0.050
ummy+0.049
 bou+0.047
 difference+0.046
Neg logit contributions
 managed-0.023
 tried-0.022
 decided-0.022
 remembered-0.022
 accidentally-0.021
Token an
Token ID281
Feature activation-1.25e-01

Pos logit contributions
 managed+0.098
 decided+0.097
 never+0.096
 tried+0.096
 remembered+0.095
Neg logit contributions
igsaw-0.165
ummy-0.147
umm-0.143
 types-0.141
opl-0.137
Token looked
Token ID3114
Feature activation+5.89e-02

Pos logit contributions
pped+0.144
 unable+0.141
 managed+0.141
 remembered+0.135
ppa+0.130
Neg logit contributions
 new-0.152
 types-0.150
igsaw-0.147
ummy-0.146
 puzzle-0.143
Token like
Token ID588
Feature activation+2.30e-02

Pos logit contributions
 managed+0.103
 tried+0.100
 remembered+0.098
 decided+0.098
 never+0.097
Neg logit contributions
igsaw-0.110
 new-0.098
ummy-0.097
 types-0.096
 bou-0.092
Token it
Token ID340
Feature activation+7.46e-02

Pos logit contributions
 managed+0.038
 tried+0.038
 decided+0.037
 remembered+0.037
 never+0.036
Neg logit contributions
igsaw-0.047
ummy-0.041
 types-0.040
umm-0.039
opl-0.039
Token came
Token ID1625
Feature activation+4.11e-02

Pos logit contributions
 managed+0.094
 remembered+0.091
 decided+0.088
 tried+0.087
 stumbled+0.085
Neg logit contributions
igsaw-0.168
 types-0.157
ummy-0.157
 new-0.153
opl-0.148
Token from
Token ID422
Feature activation-5.83e-02

Pos logit contributions
 managed+0.070
 tried+0.070
 decided+0.069
 remembered+0.068
 never+0.066
Neg logit contributions
igsaw-0.083
ummy-0.073
 types-0.071
umm-0.069
opl-0.068
Token outer
Token ID12076
Feature activation-2.62e-01

Pos logit contributions
igsaw+0.117
ummy+0.102
 types+0.100
umm+0.097
opl+0.097
Neg logit contributions
 managed-0.100
 tried-0.099
 decided-0.099
 remembered-0.097
 never-0.094
Token space
Token ID2272
Feature activation-5.46e-02

Pos logit contributions
igsaw+0.386
umm+0.320
ummy+0.319
 batch+0.304
opl+0.290
Neg logit contributions
 tried-0.597
 never-0.591
 decided-0.577
 remembered-0.570
 managed-0.555
Token.
Token ID13
Feature activation-1.25e-02

Pos logit contributions
igsaw+0.122
ummy+0.109
 types+0.106
umm+0.103
opl+0.103
Neg logit contributions
 managed-0.080
 tried-0.079
 decided-0.078
 remembered-0.078
 never-0.073
Token
Token ID198
Feature activation-5.35e-02

Pos logit contributions
igsaw+0.027
 types+0.023
opl+0.023
umm+0.022
ummy+0.022
Neg logit contributions
 tried-0.021
 decided-0.021
 accidentally-0.020
 managed-0.020
 didn-0.020
Token
Token ID198
Feature activation-4.89e-02

Pos logit contributions
igsaw+0.118
umm+0.105
 types+0.100
ummy+0.099
opl+0.098
Neg logit contributions
 tried-0.086
 decided-0.083
 never-0.082
 managed-0.081
 could-0.081
TokenL
Token ID43
Feature activation-6.11e-02

Pos logit contributions
igsaw+0.081
ummy+0.073
 types+0.068
ay+0.067
 batch+0.067
Neg logit contributions
 managed-0.098
 remembered-0.094
 tried-0.094
 wouldn-0.093
 unable-0.091
Token Emma
Token ID18966
Feature activation+7.72e-02

Pos logit contributions
igsaw+0.058
umm+0.057
opl+0.056
 types+0.053
 difference+0.052
Neg logit contributions
 never-0.037
 tried-0.036
 accidentally-0.036
 decided-0.036
 didn-0.034
Token who
Token ID508
Feature activation+8.05e-02

Pos logit contributions
 managed+0.158
 tried+0.141
 wouldn+0.139
 stumbled+0.138
 accidentally+0.136
Neg logit contributions
ummy-0.114
igsaw-0.114
 types-0.108
opl-0.105
 new-0.104
Token loved
Token ID6151
Feature activation-2.71e-02

Pos logit contributions
 managed+0.142
pped+0.131
 unable+0.131
 possibly+0.125
 stumbled+0.124
Neg logit contributions
igsaw-0.125
 types-0.120
 sorts-0.120
 new-0.120
opl-0.119
Token learning
Token ID4673
Feature activation-1.58e-02

Pos logit contributions
igsaw+0.065
ummy+0.057
umm+0.056
 types+0.056
opl+0.055
Neg logit contributions
 tried-0.037
 decided-0.036
 managed-0.035
 remembered-0.035
 never-0.035
Token about
Token ID546
Feature activation+1.05e-02

Pos logit contributions
umm+0.024
igsaw+0.024
Whe+0.021
ummy+0.019
opl+0.019
Neg logit contributions
 decided-0.033
 could-0.033
 tried-0.032
 just-0.032
 never-0.032
Token outer
Token ID12076
Feature activation-2.60e-01

Pos logit contributions
 tried+0.020
 never+0.019
 decided+0.019
 accidentally+0.019
 remembered+0.018
Neg logit contributions
igsaw-0.020
umm-0.018
ummy-0.017
aj-0.017
ay-0.016
Token space
Token ID2272
Feature activation-5.63e-02

Pos logit contributions
igsaw+0.392
umm+0.332
Whe+0.313
 batch+0.312
aj+0.286
Neg logit contributions
 never-0.554
 could-0.537
 tried-0.533
 just-0.523
 decided-0.521
Token.
Token ID13
Feature activation-1.28e-02

Pos logit contributions
igsaw+0.132
ummy+0.117
 types+0.115
umm+0.112
opl+0.112
Neg logit contributions
 managed-0.078
 tried-0.077
 decided-0.077
 remembered-0.075
 never-0.073
Token Everyday
Token ID48154
Feature activation-1.90e-02

Pos logit contributions
igsaw+0.030
aj+0.028
 types+0.028
opl+0.028
umm+0.027
Neg logit contributions
 tried-0.015
 could-0.014
 accidentally-0.014
 decided-0.014
 never-0.014
Token,
Token ID11
Feature activation+1.91e-02

Pos logit contributions
igsaw+0.041
ummy+0.036
 types+0.035
opl+0.034
umm+0.034
Neg logit contributions
 tried-0.031
 decided-0.030
 managed-0.030
 remembered-0.030
 never-0.029
Token she
Token ID673
Feature activation+1.24e-01

Pos logit contributions
 tried+0.025
 decided+0.024
 managed+0.024
 remembered+0.024
 never+0.023
Neg logit contributions
igsaw-0.047
ummy-0.042
 types-0.041
opl-0.041
umm-0.040
Token.
Token ID13
Feature activation-2.65e-03

Pos logit contributions
igsaw+0.038
ummy+0.034
umm+0.034
 types+0.034
opl+0.033
Neg logit contributions
 tried-0.022
 managed-0.022
 decided-0.021
 never-0.021
 remembered-0.020
Token Soon
Token ID15894
Feature activation+1.88e-02

Pos logit contributions
igsaw+0.005
ummy+0.004
 types+0.004
umm+0.004
opl+0.004
Neg logit contributions
 tried-0.005
 managed-0.005
 decided-0.005
 never-0.005
 remembered-0.005
Token they
Token ID484
Feature activation+1.16e-01

Pos logit contributions
 tried+0.030
 managed+0.030
 decided+0.030
 remembered+0.029
 never+0.028
Neg logit contributions
igsaw-0.039
ummy-0.034
 types-0.033
umm-0.032
opl-0.031
Token were
Token ID547
Feature activation+5.78e-02

Pos logit contributions
 tried+0.092
 accidentally+0.090
 unable+0.089
 managed+0.087
 decided+0.086
Neg logit contributions
igsaw-0.296
 types-0.280
ay-0.274
 bou-0.271
ummy-0.270
Token in
Token ID287
Feature activation-1.94e-02

Pos logit contributions
 tried+0.087
 managed+0.087
 decided+0.087
 remembered+0.085
 never+0.082
Neg logit contributions
igsaw-0.127
ummy-0.112
 types-0.109
umm-0.107
opl-0.107
Token outer
Token ID12076
Feature activation-2.59e-01

Pos logit contributions
igsaw+0.039
ummy+0.036
umm+0.035
aj+0.034
opl+0.034
Neg logit contributions
 tried-0.033
 never-0.033
 managed-0.032
 remembered-0.032
 decided-0.031
Token space
Token ID2272
Feature activation-5.32e-02

Pos logit contributions
igsaw+0.389
umm+0.355
ummy+0.333
 batch+0.328
 types+0.292
Neg logit contributions
 never-0.593
 tried-0.564
 remembered-0.557
 decided-0.537
 managed-0.533
Token and
Token ID290
Feature activation+1.08e-03

Pos logit contributions
igsaw+0.123
ummy+0.109
 types+0.106
umm+0.105
opl+0.104
Neg logit contributions
 managed-0.076
 tried-0.076
 decided-0.075
 remembered-0.074
 never-0.073
Token having
Token ID1719
Feature activation+2.63e-02

Pos logit contributions
 managed+0.002
 tried+0.002
 decided+0.002
 remembered+0.002
 never+0.001
Neg logit contributions
igsaw-0.003
ummy-0.002
umm-0.002
 types-0.002
opl-0.002
Token fun
Token ID1257
Feature activation-2.43e-02

Pos logit contributions
 managed+0.053
 decided+0.052
 tried+0.052
 never+0.052
 remembered+0.051
Neg logit contributions
igsaw-0.047
ummy-0.042
umm-0.040
opl-0.038
 types-0.038
Token.
Token ID13
Feature activation-3.40e-03

Pos logit contributions
igsaw+0.053
ummy+0.047
 types+0.046
opl+0.045
umm+0.044
Neg logit contributions
 managed-0.037
 tried-0.037
 decided-0.037
 remembered-0.036
 accidentally-0.035
Token,
Token ID11
Feature activation+6.99e-05

Pos logit contributions
igsaw+0.083
ummy+0.074
 types+0.072
umm+0.071
opl+0.071
Neg logit contributions
 managed-0.045
 tried-0.045
 decided-0.045
 remembered-0.043
 never-0.043
Token and
Token ID290
Feature activation+4.22e-03

Pos logit contributions
 managed+0.000
 tried+0.000
 decided+0.000
 remembered+0.000
 never+0.000
Neg logit contributions
igsaw-0.000
ummy-0.000
 types-0.000
umm-0.000
opl-0.000
Token even
Token ID772
Feature activation+3.74e-02

Pos logit contributions
 tried+0.006
 decided+0.006
 managed+0.006
 remembered+0.006
 never+0.006
Neg logit contributions
igsaw-0.010
ummy-0.009
opl-0.008
umm-0.008
 types-0.008
Token went
Token ID1816
Feature activation+4.49e-03

Pos logit contributions
 tried+0.060
 decided+0.059
 managed+0.059
 remembered+0.057
 never+0.056
Neg logit contributions
igsaw-0.083
ummy-0.075
opl-0.071
umm-0.071
 types-0.069
Token to
Token ID284
Feature activation+2.01e-02

Pos logit contributions
 managed+0.007
 tried+0.007
 decided+0.007
 remembered+0.006
 never+0.006
Neg logit contributions
igsaw-0.010
ummy-0.009
 types-0.009
opl-0.008
umm-0.008
Token outer
Token ID12076
Feature activation-2.57e-01

Pos logit contributions
 never+0.034
 tried+0.032
 decided+0.030
 accidentally+0.030
 remembered+0.029
Neg logit contributions
igsaw-0.044
umm-0.044
ummy-0.042
ay-0.040
aj-0.040
Token space
Token ID2272
Feature activation-4.55e-02

Pos logit contributions
igsaw+0.424
umm+0.361
opl+0.338
ummy+0.327
 batch+0.321
Neg logit contributions
 never-0.566
 tried-0.546
 remembered-0.540
 decided-0.539
 just-0.526
Token!
Token ID0
Feature activation-1.40e-02

Pos logit contributions
igsaw+0.108
ummy+0.096
umm+0.095
opl+0.092
 types+0.092
Neg logit contributions
 decided-0.063
 never-0.063
 managed-0.062
 tried-0.062
 remembered-0.060
Token It
Token ID632
Feature activation+1.38e-01

Pos logit contributions
igsaw+0.030
 types+0.029
umm+0.028
 bou+0.027
opl+0.027
Neg logit contributions
 never-0.021
 could-0.021
 decided-0.021
 didn-0.020
 accidentally-0.020
Token was
Token ID373
Feature activation+5.90e-02

Pos logit contributions
 managed+0.137
 tried+0.136
 decided+0.133
 remembered+0.130
 accidentally+0.123
Neg logit contributions
igsaw-0.366
ummy-0.332
 types-0.332
umm-0.320
opl-0.319
Token the
Token ID262
Feature activation-1.24e-01

Pos logit contributions
 decided+0.093
 never+0.093
 managed+0.093
 tried+0.091
 remembered+0.090
Neg logit contributions
igsaw-0.132
ummy-0.116
umm-0.113
 types-0.111
opl-0.109
Token that
Token ID326
Feature activation+2.09e-03

Pos logit contributions
 managed+0.001
 tried+0.001
 decided+0.001
 remembered+0.001
 never+0.001
Neg logit contributions
igsaw-0.002
 types-0.002
ummy-0.002
umm-0.002
 batch-0.002
Token they
Token ID484
Feature activation+1.09e-01

Pos logit contributions
 decided+0.002
 tried+0.002
 managed+0.002
 remembered+0.002
 never+0.002
Neg logit contributions
igsaw-0.006
ummy-0.005
umm-0.005
opl-0.005
aj-0.005
Token were
Token ID547
Feature activation+5.60e-02

Pos logit contributions
 unable+0.079
 managed+0.078
 accidentally+0.077
 tried+0.077
 remembered+0.076
Neg logit contributions
igsaw-0.273
 types-0.268
 seats-0.257
ay-0.253
 art-0.248
Token going
Token ID1016
Feature activation-9.46e-03

Pos logit contributions
 managed+0.086
 tried+0.086
 decided+0.085
 remembered+0.083
 never+0.082
Neg logit contributions
igsaw-0.124
ummy-0.109
 types-0.106
umm-0.104
opl-0.103
Token to
Token ID284
Feature activation+3.66e-02

Pos logit contributions
igsaw+0.018
ummy+0.016
 types+0.016
 new+0.016
ay+0.015
Neg logit contributions
 managed-0.016
 remembered-0.016
 tried-0.016
 decided-0.015
 dropped-0.015
Token outer
Token ID12076
Feature activation-2.53e-01

Pos logit contributions
 never+0.051
 tried+0.051
 managed+0.049
 decided+0.049
 remembered+0.046
Neg logit contributions
igsaw-0.090
ummy-0.084
umm-0.081
aj-0.079
 types-0.078
Token space
Token ID2272
Feature activation-5.18e-02

Pos logit contributions
igsaw+0.383
umm+0.330
ummy+0.311
opl+0.302
 batch+0.297
Neg logit contributions
 never-0.598
 tried-0.558
 decided-0.550
 could-0.549
 remembered-0.544
Token.
Token ID13
Feature activation-2.07e-03

Pos logit contributions
igsaw+0.118
ummy+0.105
 types+0.102
umm+0.101
opl+0.100
Neg logit contributions
 managed-0.075
 tried-0.075
 decided-0.074
 remembered-0.072
 never-0.070
Token They
Token ID1119
Feature activation+1.50e-01

Pos logit contributions
igsaw+0.004
opl+0.004
 types+0.004
ummy+0.004
umm+0.004
Neg logit contributions
 tried-0.004
 decided-0.004
 managed-0.003
 accidentally-0.003
 could-0.003
Token watched
Token ID7342
Feature activation-4.19e-02

Pos logit contributions
 managed+0.133
 unable+0.127
 accidentally+0.127
 tried+0.125
 remembered+0.124
Neg logit contributions
igsaw-0.370
 types-0.357
 seats-0.340
 bou-0.331
ummy-0.330
Token out
Token ID503
Feature activation+8.69e-03

Pos logit contributions
igsaw+0.102
opl+0.090
umm+0.089
ummy+0.087
 types+0.086
Neg logit contributions
 tried-0.059
 decided-0.056
 managed-0.056
 never-0.055
 didn-0.055
Token mouse
Token ID10211
Feature activation+5.91e-02

Pos logit contributions
igsaw+0.257
ummy+0.223
 types+0.216
umm+0.212
opl+0.210
Neg logit contributions
 tried-0.231
 managed-0.231
 decided-0.229
 remembered-0.226
 never-0.220
Token never
Token ID1239
Feature activation+8.37e-02

Pos logit contributions
 managed+0.058
 tried+0.058
 decided+0.056
 accidentally+0.055
 stumbled+0.055
Neg logit contributions
igsaw-0.143
 types-0.134
ummy-0.133
opl-0.132
 difference-0.128
Token suffered
Token ID6989
Feature activation-2.43e-02

Pos logit contributions
 managed+0.050
 decided+0.048
 never+0.046
 tried+0.045
pped+0.045
Neg logit contributions
igsaw-0.239
 types-0.224
ummy-0.217
 difference-0.215
umm-0.212
Token from
Token ID422
Feature activation-6.40e-02

Pos logit contributions
igsaw+0.052
ummy+0.046
 types+0.045
umm+0.044
opl+0.043
Neg logit contributions
 managed-0.038
 tried-0.038
 decided-0.038
 remembered-0.037
 never-0.036
Token the
Token ID262
Feature activation-1.57e-01

Pos logit contributions
igsaw+0.124
ummy+0.108
 types+0.105
umm+0.102
opl+0.102
Neg logit contributions
 managed-0.114
 tried-0.114
 decided-0.113
 remembered-0.111
 never-0.108
Token rud
Token ID28906
Feature activation-2.31e-01

Pos logit contributions
igsaw+0.289
ummy+0.248
 types+0.241
umm+0.234
opl+0.234
Neg logit contributions
 tried-0.298
 managed-0.297
 decided-0.295
 remembered-0.291
 never-0.285
Tokeneness
Token ID9449
Feature activation+9.06e-03

Pos logit contributions
igsaw+0.396
ummy+0.346
umm+0.325
 types+0.324
opl+0.323
Neg logit contributions
 managed-0.469
 decided-0.467
 tried-0.467
 remembered-0.455
 never-0.448
Token of
Token ID286
Feature activation-8.02e-02

Pos logit contributions
 managed+0.015
 tried+0.015
 remembered+0.015
 decided+0.015
 stumbled+0.014
Neg logit contributions
igsaw-0.018
ummy-0.016
 types-0.015
umm-0.015
opl-0.015
Token other
Token ID584
Feature activation-8.36e-02

Pos logit contributions
igsaw+0.147
ummy+0.127
 types+0.124
opl+0.120
umm+0.119
Neg logit contributions
 managed-0.151
 tried-0.150
 decided-0.150
 remembered-0.147
 never-0.143
Token animals
Token ID4695
Feature activation+2.45e-02

Pos logit contributions
igsaw+0.150
ummy+0.129
 types+0.125
umm+0.121
opl+0.120
Neg logit contributions
 managed-0.163
 tried-0.162
 decided-0.161
 remembered-0.158
 never-0.155
Token again
Token ID757
Feature activation-1.27e-02

Pos logit contributions
 managed+0.039
 remembered+0.039
 tried+0.039
 decided+0.038
 stumbled+0.037
Neg logit contributions
igsaw-0.048
ummy-0.044
 difference-0.042
opl-0.041
 types-0.040
Token getting
Token ID1972
Feature activation+3.66e-03

Pos logit contributions
 managed+0.084
 tried+0.083
 decided+0.082
 remembered+0.080
 wouldn+0.077
Neg logit contributions
igsaw-0.172
ummy-0.156
 types-0.154
opl-0.151
 batch-0.148
Token very
Token ID845
Feature activation+4.61e-02

Pos logit contributions
 managed+0.006
'd+0.006
 remembered+0.006
 tried+0.005
 decided+0.005
Neg logit contributions
igsaw-0.007
 types-0.006
 new-0.006
ummy-0.006
 seats-0.006
Token annoyed
Token ID25602
Feature activation-3.10e-02

Pos logit contributions
 tried+0.070
 managed+0.070
 decided+0.070
 remembered+0.068
 never+0.066
Neg logit contributions
igsaw-0.104
ummy-0.091
 types-0.089
umm-0.088
opl-0.087
Token with
Token ID351
Feature activation-1.72e-02

Pos logit contributions
igsaw+0.066
ummy+0.061
 types+0.058
umm+0.055
opl+0.055
Neg logit contributions
 managed-0.050
 remembered-0.048
 tried-0.047
 decided-0.047
'd-0.046
Token his
Token ID465
Feature activation-4.53e-02

Pos logit contributions
igsaw+0.037
ummy+0.032
 types+0.032
opl+0.031
umm+0.030
Neg logit contributions
 managed-0.027
 decided-0.027
 tried-0.027
 remembered-0.027
 stumbled-0.025
Token rud
Token ID28906
Feature activation-2.24e-01

Pos logit contributions
igsaw+0.087
umm+0.073
ummy+0.072
 types+0.071
 bou+0.069
Neg logit contributions
 tried-0.086
 remembered-0.083
 never-0.081
 managed-0.081
 dropped-0.081
Tokeneness
Token ID9449
Feature activation+2.62e-02

Pos logit contributions
igsaw+0.421
ummy+0.365
 types+0.353
umm+0.345
opl+0.343
Neg logit contributions
 managed-0.415
 tried-0.414
 decided-0.411
 remembered-0.404
 never-0.395
Token and
Token ID290
Feature activation+4.41e-03

Pos logit contributions
 managed+0.041
 tried+0.040
 decided+0.040
 remembered+0.040
 stumbled+0.038
Neg logit contributions
igsaw-0.055
ummy-0.050
 types-0.048
opl-0.047
umm-0.046
Token complained
Token ID13832
Feature activation-2.23e-02

Pos logit contributions
 managed+0.005
pped+0.005
'd+0.005
 stumbled+0.004
 remembered+0.004
Neg logit contributions
 types-0.010
igsaw-0.010
 new-0.009
 coll-0.009
 seats-0.009
Token that
Token ID326
Feature activation+9.61e-03

Pos logit contributions
igsaw+0.046
ummy+0.041
 types+0.040
umm+0.039
opl+0.038
Neg logit contributions
 managed-0.036
 tried-0.036
 decided-0.035
 remembered-0.035
 never-0.034
Token he
Token ID339
Feature activation+1.25e-01

Pos logit contributions
 tried+0.011
 managed+0.011
 decided+0.011
 remembered+0.011
 never+0.011
Neg logit contributions
igsaw-0.025
ummy-0.022
 types-0.022
umm-0.021
opl-0.021
Token fold
Token ID5591
Feature activation-5.10e-02

Pos logit contributions
 managed+0.025
 remembered+0.025
 unable+0.025
 stumbled+0.024
'd+0.024
Neg logit contributions
igsaw-0.032
 new-0.028
 art-0.028
 types-0.027
ummy-0.026
Token the
Token ID262
Feature activation-1.53e-01

Pos logit contributions
igsaw+0.138
ummy+0.125
umm+0.124
opl+0.124
 types+0.120
Neg logit contributions
 tried-0.056
 decided-0.053
 managed-0.053
 remembered-0.049
 never-0.049
Token knob
Token ID36842
Feature activation-1.89e-03

Pos logit contributions
igsaw+0.287
ummy+0.249
 types+0.242
umm+0.234
opl+0.232
Neg logit contributions
 managed-0.285
 tried-0.283
 decided-0.282
 remembered-0.277
 never-0.271
Token into
Token ID656
Feature activation-6.14e-02

Pos logit contributions
igsaw+0.004
ummy+0.004
 types+0.004
umm+0.004
opl+0.004
Neg logit contributions
 managed-0.003
 tried-0.003
 decided-0.003
 remembered-0.003
 never-0.003
Token an
Token ID281
Feature activation-1.56e-01

Pos logit contributions
igsaw+0.130
opl+0.115
ummy+0.115
umm+0.113
aj+0.108
Neg logit contributions
 tried-0.104
 decided-0.100
 managed-0.100
 remembered-0.097
 never-0.097
Token orig
Token ID1796
Feature activation-2.18e-01

Pos logit contributions
igsaw+0.275
ummy+0.229
 types+0.223
umm+0.220
opl+0.216
Neg logit contributions
 decided-0.313
 tried-0.311
 managed-0.305
 remembered-0.301
 never-0.297
Tokenami
Token ID6277
Feature activation-4.79e-02

Pos logit contributions
igsaw+0.326
ummy+0.263
 types+0.261
 bou+0.245
opl+0.243
Neg logit contributions
 managed-0.478
 tried-0.477
 decided-0.476
 remembered-0.468
 never-0.460
Token figure
Token ID3785
Feature activation-2.03e-03

Pos logit contributions
igsaw+0.091
ummy+0.080
 types+0.079
 bou+0.077
umm+0.076
Neg logit contributions
 managed-0.084
 remembered-0.082
 tried-0.082
 decided-0.082
 accidentally-0.079
Token.
Token ID13
Feature activation-5.46e-03

Pos logit contributions
igsaw+0.004
ummy+0.004
 types+0.004
opl+0.004
umm+0.004
Neg logit contributions
 managed-0.003
 tried-0.003
 decided-0.003
 remembered-0.003
 never-0.003
Token But
Token ID887
Feature activation+4.99e-02

Pos logit contributions
igsaw+0.012
opl+0.011
aj+0.011
 coll+0.011
ummy+0.011
Neg logit contributions
 decided-0.008
 tried-0.008
 managed-0.008
 could-0.007
 never-0.007
Token no
Token ID645
Feature activation-1.56e-02

Pos logit contributions
 managed+0.072
 decided+0.070
 tried+0.070
 remembered+0.069
'd+0.067
Neg logit contributions
igsaw-0.109
ummy-0.099
 types-0.096
umm-0.092
 batch-0.092
Token boat
Token ID8848
Feature activation+1.83e-02

Pos logit contributions
igsaw+0.240
ummy+0.201
umm+0.194
 types+0.189
 batch+0.187
Neg logit contributions
 tried-0.236
 decided-0.233
 remembered-0.230
 managed-0.227
 never-0.224
Token saved
Token ID7448
Feature activation-3.92e-02

Pos logit contributions
 managed+0.019
 tried+0.018
 decided+0.018
 remembered+0.018
 accidentally+0.018
Neg logit contributions
igsaw-0.044
 types-0.041
ummy-0.041
 difference-0.039
 new-0.039
Token them
Token ID606
Feature activation+7.95e-02

Pos logit contributions
igsaw+0.075
ummy+0.066
 types+0.064
umm+0.061
opl+0.061
Neg logit contributions
 managed-0.070
 tried-0.070
 decided-0.069
 remembered-0.069
 never-0.067
Token from
Token ID422
Feature activation-6.24e-02

Pos logit contributions
 managed+0.114
 tried+0.114
 decided+0.113
 never+0.111
 remembered+0.108
Neg logit contributions
igsaw-0.184
ummy-0.161
 types-0.160
umm-0.160
opl-0.156
Token their
Token ID511
Feature activation-6.96e-02

Pos logit contributions
igsaw+0.119
ummy+0.103
 types+0.100
umm+0.097
opl+0.097
Neg logit contributions
 managed-0.114
 tried-0.114
 decided-0.113
 remembered-0.111
 never-0.108
Token dire
Token ID19958
Feature activation-2.16e-01

Pos logit contributions
igsaw+0.124
ummy+0.104
umm+0.101
 types+0.099
opl+0.099
Neg logit contributions
 tried-0.144
 never-0.142
 remembered-0.141
 decided-0.140
 managed-0.139
Token situation
Token ID3074
Feature activation-1.48e-02

Pos logit contributions
igsaw+0.381
ummy+0.317
 types+0.299
 bou+0.296
 batch+0.296
Neg logit contributions
 tried-0.433
 managed-0.423
 remembered-0.422
 decided-0.422
 never-0.419
Token and
Token ID290
Feature activation+1.21e-02

Pos logit contributions
igsaw+0.035
ummy+0.031
 types+0.030
umm+0.030
opl+0.030
Neg logit contributions
 managed-0.020
 tried-0.020
 decided-0.020
 remembered-0.019
 never-0.019
Token they
Token ID484
Feature activation+1.08e-01

Pos logit contributions
 managed+0.016
 tried+0.016
 decided+0.016
 remembered+0.016
'd+0.015
Neg logit contributions
igsaw-0.027
ummy-0.025
 types-0.025
umm-0.023
opl-0.023
Token were
Token ID547
Feature activation+6.12e-02

Pos logit contributions
 accidentally+0.112
 tried+0.108
 managed+0.107
 remembered+0.103
 decided+0.102
Neg logit contributions
igsaw-0.241
ay-0.224
 types-0.224
 seats-0.220
 bou-0.220
Token able
Token ID1498
Feature activation+2.09e-02

Pos logit contributions
 tried+0.084
 managed+0.082
 decided+0.082
 accidentally+0.080
 remembered+0.080
Neg logit contributions
igsaw-0.132
ummy-0.119
opl-0.117
 types-0.117
 difference-0.115
Token sailed
Token ID33835
Feature activation-4.35e-02

Pos logit contributions
 managed+0.069
 tried+0.068
 decided+0.068
 remembered+0.066
 never+0.063
Neg logit contributions
igsaw-0.158
ummy-0.144
 types-0.142
opl-0.138
umm-0.137
Token for
Token ID329
Feature activation-2.27e-02

Pos logit contributions
igsaw+0.100
ummy+0.088
umm+0.085
opl+0.085
 types+0.084
Neg logit contributions
 managed-0.066
 tried-0.066
 decided-0.066
 remembered-0.064
 never-0.063
Token days
Token ID1528
Feature activation-2.35e-02

Pos logit contributions
igsaw+0.047
ummy+0.042
umm+0.040
 types+0.040
opl+0.039
Neg logit contributions
 tried-0.039
 managed-0.038
 decided-0.038
 never-0.037
 remembered-0.037
Token,
Token ID11
Feature activation+9.68e-03

Pos logit contributions
igsaw+0.053
ummy+0.047
 types+0.045
umm+0.044
opl+0.044
Neg logit contributions
 managed-0.035
 tried-0.034
 decided-0.034
 remembered-0.034
 never-0.032
Token exploring
Token ID13504
Feature activation+9.29e-03

Pos logit contributions
 managed+0.014
 tried+0.014
 remembered+0.014
 decided+0.014
 accidentally+0.013
Neg logit contributions
igsaw-0.020
 types-0.018
ummy-0.018
opl-0.017
 bou-0.017
Token unch
Token ID16258
Feature activation-2.14e-01

Pos logit contributions
 decided+0.014
 tried+0.014
 managed+0.013
 never+0.013
 remembered+0.013
Neg logit contributions
igsaw-0.022
ummy-0.019
umm-0.019
aj-0.019
opl-0.019
Tokenarted
Token ID19112
Feature activation-6.43e-03

Pos logit contributions
igsaw+0.409
 bou+0.350
 difference+0.348
 batch+0.344
 types+0.343
Neg logit contributions
 decided-0.367
 tried-0.352
 managed-0.349
 never-0.348
 accidentally-0.340
Token waters
Token ID10150
Feature activation+3.47e-04

Pos logit contributions
igsaw+0.014
ummy+0.012
 types+0.012
umm+0.011
opl+0.011
Neg logit contributions
 tried-0.010
 managed-0.010
 decided-0.010
 remembered-0.010
 never-0.010
Token until
Token ID1566
Feature activation+4.98e-02

Pos logit contributions
 tried+0.000
 decided+0.000
 managed+0.000
 never+0.000
 remembered+0.000
Neg logit contributions
igsaw-0.001
ummy-0.001
umm-0.001
 types-0.001
opl-0.001
Token he
Token ID339
Feature activation+1.19e-01

Pos logit contributions
 managed+0.077
 tried+0.076
 decided+0.075
 remembered+0.075
 never+0.071
Neg logit contributions
igsaw-0.108
ummy-0.095
 types-0.093
umm-0.091
opl-0.091
Token came
Token ID1625
Feature activation+4.60e-02

Pos logit contributions
 tried+0.096
 managed+0.093
 remembered+0.092
 decided+0.091
 unable+0.090
Neg logit contributions
igsaw-0.315
 types-0.291
ummy-0.289
opl-0.282
ay-0.280
Token she
Token ID673
Feature activation+1.24e-01

Pos logit contributions
 tried+0.030
 decided+0.030
 managed+0.030
 remembered+0.029
 never+0.029
Neg logit contributions
igsaw-0.045
ummy-0.039
 types-0.038
umm-0.038
opl-0.037
Token got
Token ID1392
Feature activation+1.23e-02

Pos logit contributions
 unable+0.102
pped+0.087
Please+0.085
 managed+0.084
 stumbled+0.083
Neg logit contributions
 new-0.242
 elderly-0.231
 types-0.231
 seats-0.230
 art-0.227
Token rid
Token ID5755
Feature activation-6.90e-02

Pos logit contributions
 managed+0.021
'd+0.020
 remembered+0.020
 tried+0.020
 decided+0.020
Neg logit contributions
igsaw-0.023
 new-0.020
ummy-0.020
 types-0.020
 art-0.019
Token of
Token ID286
Feature activation-4.80e-02

Pos logit contributions
igsaw+0.097
ummy+0.082
 types+0.077
umm+0.074
opl+0.073
Neg logit contributions
 managed-0.158
 tried-0.156
 remembered-0.155
 decided-0.155
 never-0.152
Token the
Token ID262
Feature activation-1.54e-01

Pos logit contributions
igsaw+0.094
ummy+0.081
 types+0.079
umm+0.077
opl+0.077
Neg logit contributions
 managed-0.085
 tried-0.084
 decided-0.084
 remembered-0.083
 never-0.080
Token intr
Token ID9913
Feature activation-2.14e-01

Pos logit contributions
igsaw+0.271
ummy+0.235
 types+0.229
umm+0.223
opl+0.221
Neg logit contributions
 managed-0.300
 tried-0.294
 decided-0.293
 remembered-0.287
 never-0.280
Tokenuders
Token ID48739
Feature activation-1.43e-03

Pos logit contributions
igsaw+0.352
ummy+0.301
 types+0.289
umm+0.281
opl+0.280
Neg logit contributions
 managed-0.442
 tried-0.441
 decided-0.438
 remembered-0.430
 never-0.421
Token.
Token ID13
Feature activation-8.74e-03

Pos logit contributions
igsaw+0.003
ummy+0.003
 types+0.003
opl+0.002
 bou+0.002
Neg logit contributions
 managed-0.002
 remembered-0.002
 tried-0.002
 decided-0.002
 never-0.002
Token She
Token ID1375
Feature activation+1.50e-01

Pos logit contributions
igsaw+0.020
 types+0.018
ummy+0.018
umm+0.018
 bou+0.017
Neg logit contributions
 decided-0.012
 never-0.012
 tried-0.012
 managed-0.012
 could-0.012
Token did
Token ID750
Feature activation+7.25e-02

Pos logit contributions
pped+0.147
 unable+0.144
 managed+0.142
 stumbled+0.135
 accidentally+0.131
Neg logit contributions
 types-0.316
igsaw-0.311
 bou-0.308
 seats-0.307
 new-0.305
Token not
Token ID407
Feature activation+1.04e-01

Pos logit contributions
 decided+0.072
 managed+0.072
 tried+0.070
 remembered+0.068
 stumbled+0.067
Neg logit contributions
igsaw-0.191
 new-0.172
ummy-0.168
opl-0.168
 types-0.167
Token he
Token ID339
Feature activation+1.19e-01

Pos logit contributions
 managed+0.156
 remembered+0.152
 tried+0.150
'd+0.149
 stumbled+0.148
Neg logit contributions
igsaw-0.151
 new-0.140
ummy-0.137
 types-0.129
 art-0.123
Token couldn
Token ID3521
Feature activation+3.87e-02

Pos logit contributions
pped+0.164
Please+0.161
 unable+0.160
pper+0.158
ppa+0.146
Neg logit contributions
 new-0.202
 types-0.196
ummy-0.178
 old-0.172
udd-0.166
Token't
Token ID470
Feature activation+7.91e-02

Pos logit contributions
 managed+0.042
 remembered+0.041
 decided+0.040
 tried+0.040
 wouldn+0.037
Neg logit contributions
igsaw-0.103
ummy-0.095
opl-0.091
ay-0.089
 types-0.089
Token touch
Token ID3638
Feature activation-6.99e-02

Pos logit contributions
 unable+0.155
 managed+0.150
 remembered+0.149
 wouldn+0.143
 accidentally+0.143
Neg logit contributions
igsaw-0.120
ay-0.109
 new-0.104
 difference-0.103
ummy-0.102
Token the
Token ID262
Feature activation-1.67e-01

Pos logit contributions
igsaw+0.153
ummy+0.136
 types+0.133
opl+0.130
umm+0.129
Neg logit contributions
 managed-0.107
 tried-0.105
 decided-0.104
 remembered-0.103
 never-0.099
Token v
Token ID410
Feature activation-2.13e-01

Pos logit contributions
igsaw+0.265
ummy+0.216
 types+0.201
umm+0.199
aj+0.193
Neg logit contributions
 tried-0.381
 decided-0.368
 never-0.367
 dropped-0.363
 managed-0.361
Tokenase
Token ID589
Feature activation-1.47e-03

Pos logit contributions
igsaw+0.324
 types+0.278
ummy+0.263
 bou+0.261
 difference+0.258
Neg logit contributions
 managed-0.447
 tried-0.443
 decided-0.442
 never-0.441
 accidentally-0.424
Token.
Token ID13
Feature activation-1.52e-03

Pos logit contributions
igsaw+0.003
ummy+0.003
 types+0.003
opl+0.003
umm+0.003
Neg logit contributions
 managed-0.002
 tried-0.002
 remembered-0.002
 decided-0.002
 stumbled-0.002
Token Tim
Token ID5045
Feature activation+3.79e-02

Pos logit contributions
igsaw+0.003
ummy+0.003
 types+0.002
opl+0.002
umm+0.002
Neg logit contributions
 tried-0.003
 decided-0.003
 managed-0.003
 remembered-0.003
 never-0.003
Tokenmy
Token ID1820
Feature activation+1.04e-01

Pos logit contributions
 managed+0.077
 unable+0.074
 stumbled+0.074
 decided+0.073
 tried+0.072
Neg logit contributions
igsaw-0.054
ummy-0.052
 types-0.051
aj-0.047
ay-0.047
Token felt
Token ID2936
Feature activation+4.71e-02

Pos logit contributions
'd+0.138
pped+0.132
 unable+0.130
 managed+0.126
 wouldn+0.125
Neg logit contributions
 batch-0.179
 new-0.174
 types-0.172
ummy-0.166
 chip-0.165
Tokeny
Token ID88
Feature activation+1.02e-02

Pos logit contributions
igsaw+0.216
ummy+0.197
 types+0.191
umm+0.183
ay+0.182
Neg logit contributions
 managed-0.151
 tried-0.146
 decided-0.143
 remembered-0.142
 never-0.142
Token cat
Token ID3797
Feature activation+2.16e-02

Pos logit contributions
 managed+0.019
 tried+0.019
 decided+0.019
 remembered+0.019
 never+0.018
Neg logit contributions
igsaw-0.019
ummy-0.016
 types-0.016
umm-0.015
opl-0.015
Token knocked
Token ID13642
Feature activation-6.04e-02

Pos logit contributions
 managed+0.031
 tried+0.028
 unable+0.028
'd+0.028
 decided+0.027
Neg logit contributions
igsaw-0.045
ummy-0.042
 types-0.040
 batch-0.040
opl-0.039
Token over
Token ID625
Feature activation-2.39e-02

Pos logit contributions
igsaw+0.128
ummy+0.114
 types+0.110
opl+0.106
umm+0.106
Neg logit contributions
 managed-0.097
 tried-0.094
 remembered-0.094
 decided-0.094
 never-0.090
Token the
Token ID262
Feature activation-1.59e-01

Pos logit contributions
igsaw+0.054
umm+0.048
ummy+0.048
opl+0.047
 types+0.046
Neg logit contributions
 tried-0.036
 decided-0.036
 managed-0.034
 never-0.034
 remembered-0.033
Token v
Token ID410
Feature activation-2.12e-01

Pos logit contributions
igsaw+0.263
 types+0.225
ummy+0.221
opl+0.216
umm+0.210
Neg logit contributions
 tried-0.322
 decided-0.320
 dropped-0.308
 remembered-0.307
 accidentally-0.306
Tokenase
Token ID589
Feature activation-1.90e-03

Pos logit contributions
igsaw+0.338
 types+0.300
 difference+0.285
ummy+0.278
 bou+0.276
Neg logit contributions
 never-0.427
 tried-0.425
 decided-0.424
 managed-0.422
 accidentally-0.412
Token.
Token ID13
Feature activation-7.07e-03

Pos logit contributions
igsaw+0.005
ummy+0.004
 types+0.004
umm+0.004
opl+0.004
Neg logit contributions
 decided-0.002
 tried-0.002
 managed-0.002
 never-0.002
 remembered-0.002
Token The
Token ID383
Feature activation-1.19e-01

Pos logit contributions
igsaw+0.013
 types+0.011
ummy+0.011
umm+0.011
opl+0.011
Neg logit contributions
 tried-0.014
 decided-0.013
 accidentally-0.013
 managed-0.013
 remembered-0.013
Token flowers
Token ID12734
Feature activation+4.85e-03

Pos logit contributions
igsaw+0.237
ummy+0.202
 types+0.200
umm+0.194
opl+0.192
Neg logit contributions
 tried-0.214
 decided-0.212
 remembered-0.206
 managed-0.205
 never-0.203
Token fell
Token ID3214
Feature activation+7.85e-03

Pos logit contributions
 remembered+0.005
 managed+0.005
'd+0.005
Please+0.005
 unable+0.005
Neg logit contributions
 types-0.010
igsaw-0.010
 bou-0.010
ummy-0.010
 batch-0.010
Token brother
Token ID3956
Feature activation+5.66e-02

Pos logit contributions
igsaw+0.065
opl+0.058
 types+0.054
umm+0.053
Whe+0.050
Neg logit contributions
 tried-0.042
 never-0.041
 decided-0.040
 didn-0.040
 dropped-0.040
Token accidentally
Token ID14716
Feature activation+7.26e-02

Pos logit contributions
 unable+0.092
 managed+0.089
'd+0.086
 wouldn+0.081
 decided+0.081
Neg logit contributions
igsaw-0.099
ummy-0.096
 batch-0.092
 types-0.090
 Grand-0.089
Token knocked
Token ID13642
Feature activation-5.76e-02

Pos logit contributions
'd+0.068
 managed+0.065
 unable+0.063
 decided+0.063
 never+0.060
Neg logit contributions
igsaw-0.165
 art-0.153
 batch-0.152
 seats-0.151
 new-0.150
Token over
Token ID625
Feature activation-2.80e-02

Pos logit contributions
igsaw+0.123
ummy+0.109
 types+0.106
opl+0.102
 batch+0.102
Neg logit contributions
 managed-0.092
 tried-0.089
 decided-0.089
 remembered-0.089
 never-0.085
Token the
Token ID262
Feature activation-1.69e-01

Pos logit contributions
igsaw+0.060
ummy+0.054
umm+0.053
opl+0.052
 types+0.052
Neg logit contributions
 tried-0.044
 decided-0.044
 managed-0.043
 remembered-0.042
 never-0.042
Token v
Token ID410
Feature activation-2.12e-01

Pos logit contributions
igsaw+0.302
 types+0.255
opl+0.250
ummy+0.234
umm+0.230
Neg logit contributions
 tried-0.336
 dropped-0.330
 never-0.328
 accidentally-0.321
 decided-0.318
Tokenase
Token ID589
Feature activation+4.00e-03

Pos logit contributions
igsaw+0.350
 types+0.312
 bou+0.294
 difference+0.293
ummy+0.284
Neg logit contributions
 never-0.420
 tried-0.410
 decided-0.408
 managed-0.406
 accidentally-0.399
Token and
Token ID290
Feature activation+1.19e-02

Pos logit contributions
 managed+0.005
 tried+0.005
 decided+0.005
 remembered+0.005
 never+0.005
Neg logit contributions
igsaw-0.009
ummy-0.008
 types-0.008
opl-0.008
umm-0.008
Token it
Token ID340
Feature activation+6.73e-02

Pos logit contributions
'd+0.017
 managed+0.015
 unable+0.015
 stumbled+0.014
 remembered+0.014
Neg logit contributions
 types-0.022
 batch-0.022
 new-0.022
igsaw-0.022
 art-0.022
Token broke
Token ID6265
Feature activation-5.81e-02

Pos logit contributions
 managed+0.066
'd+0.063
 remembered+0.062
 decided+0.062
 unable+0.061
Neg logit contributions
igsaw-0.154
ummy-0.149
 new-0.147
ay-0.146
 types-0.145
Token into
Token ID656
Feature activation-6.81e-02

Pos logit contributions
igsaw+0.124
ummy+0.111
 types+0.107
umm+0.104
opl+0.102
Neg logit contributions
 managed-0.092
 tried-0.091
 decided-0.090
 remembered-0.090
 never-0.086
Token he
Token ID339
Feature activation+1.28e-01

Pos logit contributions
 managed+0.159
 tried+0.154
'd+0.153
 decided+0.153
 remembered+0.151
Neg logit contributions
igsaw-0.183
ummy-0.163
 types-0.154
 batch-0.153
opl-0.152
Token couldn
Token ID3521
Feature activation+4.93e-02

Pos logit contributions
pped+0.097
 unable+0.097
 prayed+0.091
 managed+0.089
Please+0.088
Neg logit contributions
 new-0.266
 seats-0.262
 gym-0.258
aj-0.255
ummy-0.252
Token't
Token ID470
Feature activation+9.21e-02

Pos logit contributions
 managed+0.054
 remembered+0.054
 decided+0.053
 tried+0.052
 wouldn+0.047
Neg logit contributions
igsaw-0.129
ummy-0.118
opl-0.114
ay-0.111
 types-0.109
Token catch
Token ID4929
Feature activation-4.63e-02

Pos logit contributions
 remembered+0.165
 wouldn+0.160
 managed+0.159
 unable+0.158
 accidentally+0.154
Neg logit contributions
igsaw-0.147
 difference-0.127
 new-0.126
 parties-0.124
Whe-0.123
Token the
Token ID262
Feature activation-1.49e-01

Pos logit contributions
igsaw+0.102
ummy+0.090
 types+0.088
opl+0.088
 new+0.088
Neg logit contributions
 remembered-0.069
 managed-0.068
'd-0.066
 decided-0.065
 tried-0.065
Token aer
Token ID9551
Feature activation-2.11e-01

Pos logit contributions
igsaw+0.274
ummy+0.233
 types+0.223
umm+0.220
opl+0.219
Neg logit contributions
 tried-0.287
 decided-0.284
 managed-0.279
 never-0.274
 dropped-0.274
Tokenopl
Token ID20106
Feature activation-1.79e-01

Pos logit contributions
igsaw+0.141
 types+0.100
ummy+0.088
umm+0.081
 bou+0.075
Neg logit contributions
 never-0.640
 managed-0.626
 tried-0.617
 decided-0.608
 didn-0.605
Tokenane
Token ID1531
Feature activation-2.90e-02

Pos logit contributions
igsaw+0.189
ummy+0.140
 types+0.136
umm+0.127
opl+0.121
Neg logit contributions
 managed-0.478
 tried-0.477
 decided-0.475
 remembered-0.469
 never-0.464
Token.
Token ID13
Feature activation-1.08e-02

Pos logit contributions
igsaw+0.061
ummy+0.054
 types+0.052
opl+0.052
umm+0.051
Neg logit contributions
 managed-0.046
 tried-0.045
 remembered-0.045
 decided-0.044
 stumbled-0.042
Token Suddenly
Token ID24975
Feature activation+1.13e-02

Pos logit contributions
igsaw+0.023
ummy+0.020
 types+0.019
umm+0.019
opl+0.019
Neg logit contributions
 tried-0.018
 decided-0.018
 managed-0.018
 remembered-0.017
 never-0.017
Token,
Token ID11
Feature activation+1.49e-02

Pos logit contributions
 managed+0.019
 decided+0.019
 tried+0.019
 remembered+0.018
 never+0.018
Neg logit contributions
igsaw-0.022
ummy-0.019
 types-0.018
opl-0.018
umm-0.018
Token hurt
Token ID5938
Feature activation-5.57e-02

Pos logit contributions
'd+0.021
 managed+0.020
 remembered+0.019
 stumbled+0.018
 decided+0.018
Neg logit contributions
 new-0.028
igsaw-0.028
ummy-0.026
 batch-0.025
 types-0.025
Token and
Token ID290
Feature activation+1.39e-02

Pos logit contributions
igsaw+0.112
ummy+0.107
 types+0.098
ay+0.094
 new+0.094
Neg logit contributions
 managed-0.091
 tried-0.087
 remembered-0.086
'd-0.084
 decided-0.083
Token couldn
Token ID3521
Feature activation+4.54e-02

Pos logit contributions
 managed+0.015
pped+0.014
'd+0.013
 stumbled+0.013
 remembered+0.013
Neg logit contributions
igsaw-0.031
 types-0.030
 new-0.030
ummy-0.030
 batch-0.029
Token't
Token ID470
Feature activation+9.00e-02

Pos logit contributions
 managed+0.047
 decided+0.045
 remembered+0.045
 tried+0.045
 stumbled+0.040
Neg logit contributions
igsaw-0.123
ummy-0.112
opl-0.108
 types-0.106
umm-0.106
Token help
Token ID1037
Feature activation-1.11e-02

Pos logit contributions
 managed+0.182
 accidentally+0.175
 remembered+0.173
 unable+0.166
 stumbled+0.164
Neg logit contributions
igsaw-0.124
 birthday-0.123
 difference-0.121
ummy-0.110
ay-0.109
Token him
Token ID683
Feature activation+7.92e-02

Pos logit contributions
ummy+0.020
igsaw+0.019
 new+0.019
 difference+0.017
 birthday+0.017
Neg logit contributions
 managed-0.021
 remembered-0.020
 tried-0.019
 stumbled-0.019
 decided-0.019
Token.
Token ID13
Feature activation-1.02e-02

Pos logit contributions
 managed+0.135
 remembered+0.133
 tried+0.129
 decided+0.127
 stumbled+0.126
Neg logit contributions
igsaw-0.156
ummy-0.142
 difference-0.137
 new-0.132
 types-0.130
Token<|endoftext|>
Token ID50256
Feature activation-1.07e-01

Pos logit contributions
igsaw+0.020
ummy+0.017
 types+0.017
umm+0.016
opl+0.016
Neg logit contributions
 tried-0.018
 decided-0.018
 managed-0.018
 remembered-0.018
 never-0.018
TokenOnce
Token ID7454
Feature activation-1.95e-02

Pos logit contributions
igsaw+0.210
ummy+0.180
 types+0.179
umm+0.176
opl+0.172
Neg logit contributions
 tried-0.194
 decided-0.190
 managed-0.190
 remembered-0.184
 never-0.184
Token upon
Token ID2402
Feature activation-2.13e-02

Pos logit contributions
igsaw+0.041
umm+0.038
opl+0.036
 types+0.035
 coll+0.035
Neg logit contributions
 tried-0.033
 could-0.032
 didn-0.032
 remembered-0.032
 accidentally-0.032
Token a
Token ID257
Feature activation-8.09e-02

Pos logit contributions
igsaw+0.034
 new+0.029
ummy+0.028
 types+0.028
 birthday+0.027
Neg logit contributions
 managed-0.043
 wouldn-0.042
 remembered-0.042
 tried-0.041
 decided-0.040
Tokenfish
Token ID11084
Feature activation+3.40e-02

Pos logit contributions
igsaw+0.127
ummy+0.112
 types+0.109
umm+0.107
opl+0.106
Neg logit contributions
 managed-0.093
 tried-0.093
 decided-0.092
 remembered-0.090
 never-0.088
Token a
Token ID257
Feature activation-8.39e-02

Pos logit contributions
 managed+0.054
 tried+0.054
 decided+0.053
 remembered+0.052
 never+0.051
Neg logit contributions
igsaw-0.072
ummy-0.063
 types-0.062
umm-0.060
opl-0.060
Token nice
Token ID3621
Feature activation-7.61e-02

Pos logit contributions
igsaw+0.147
ummy+0.126
 types+0.122
umm+0.119
opl+0.119
Neg logit contributions
 managed-0.166
 tried-0.166
 decided-0.164
 remembered-0.161
 never-0.158
Token home
Token ID1363
Feature activation-2.67e-02

Pos logit contributions
igsaw+0.145
ummy+0.125
opl+0.123
umm+0.121
 types+0.121
Neg logit contributions
 tried-0.140
 managed-0.137
 decided-0.137
 remembered-0.134
 never-0.131
Token.
Token ID13
Feature activation-1.71e-02

Pos logit contributions
igsaw+0.058
ummy+0.052
 types+0.050
umm+0.048
opl+0.048
Neg logit contributions
 managed-0.041
 tried-0.041
 remembered-0.040
 decided-0.040
 accidentally-0.038
Token He
Token ID679
Feature activation+1.52e-01

Pos logit contributions
igsaw+0.038
ummy+0.034
 types+0.034
umm+0.034
opl+0.033
Neg logit contributions
 tried-0.026
 decided-0.025
 managed-0.025
 could-0.024
 accidentally-0.024
Token grabbed
Token ID13176
Feature activation-2.25e-02

Pos logit contributions
 unable+0.141
 managed+0.139
pped+0.135
 stumbled+0.126
 remembered+0.123
Neg logit contributions
igsaw-0.358
 types-0.356
 batch-0.345
 bou-0.335
 seats-0.331
Token some
Token ID617
Feature activation-9.03e-02

Pos logit contributions
igsaw+0.051
ummy+0.047
opl+0.047
umm+0.047
aj+0.045
Neg logit contributions
 tried-0.036
 managed-0.033
 decided-0.032
 dropped-0.032
 never-0.031
Token seaw
Token ID32085
Feature activation-1.30e-01

Pos logit contributions
igsaw+0.169
opl+0.141
umm+0.140
ummy+0.140
 types+0.131
Neg logit contributions
 tried-0.178
 decided-0.173
 managed-0.169
 remembered-0.165
 dropped-0.163
Tokeneed
Token ID2308
Feature activation-1.11e-02

Pos logit contributions
igsaw+0.161
ummy+0.134
 types+0.119
umm+0.118
ay+0.113
Neg logit contributions
 decided-0.326
 tried-0.325
 managed-0.324
 remembered-0.322
 never-0.316
Token and
Token ID290
Feature activation+2.34e-03

Pos logit contributions
igsaw+0.026
ummy+0.024
 types+0.023
umm+0.022
opl+0.022
Neg logit contributions
 managed-0.015
 tried-0.015
 remembered-0.014
 decided-0.014
 accidentally-0.014
Token woman
Token ID2415
Feature activation+5.13e-02

Pos logit contributions
igsaw+0.185
 new+0.160
ummy+0.158
 types+0.157
 difference+0.157
Neg logit contributions
 managed-0.269
 accidentally-0.253
 stumbled-0.251
 tried-0.251
 decided-0.250
Token was
Token ID373
Feature activation+6.67e-02

Pos logit contributions
 unable+0.049
 stumbled+0.049
pped+0.049
 managed+0.049
 accidentally+0.046
Neg logit contributions
 batch-0.102
igsaw-0.102
 types-0.100
 difference-0.099
ummy-0.098
Token still
Token ID991
Feature activation+7.63e-02

Pos logit contributions
 wouldn+0.095
 decided+0.091
'd+0.091
 tried+0.090
 managed+0.090
Neg logit contributions
igsaw-0.133
 new-0.124
 batch-0.116
 difference-0.115
opl-0.112
Token scared
Token ID12008
Feature activation-3.62e-02

Pos logit contributions
 managed+0.109
 decided+0.107
 tried+0.107
'd+0.103
 remembered+0.102
Neg logit contributions
igsaw-0.170
ummy-0.149
 types-0.147
 new-0.142
opl-0.142
Token.
Token ID13
Feature activation-2.21e-03

Pos logit contributions
igsaw+0.072
ummy+0.068
 types+0.062
umm+0.061
 difference+0.061
Neg logit contributions
 managed-0.061
 remembered-0.058
 tried-0.058
 decided-0.057
'd-0.056
Token She
Token ID1375
Feature activation+1.57e-01

Pos logit contributions
igsaw+0.005
 types+0.004
ummy+0.004
opl+0.004
 bou+0.004
Neg logit contributions
 decided-0.003
 tried-0.003
 managed-0.003
 never-0.003
 remembered-0.003
Token decided
Token ID3066
Feature activation+5.27e-02

Pos logit contributions
pped+0.163
 unable+0.145
 managed+0.136
 stumbled+0.132
Please+0.124
Neg logit contributions
 new-0.307
 types-0.297
 batch-0.297
 seats-0.296
 bou-0.293
Token to
Token ID284
Feature activation+4.39e-02

Pos logit contributions
 managed+0.093
'd+0.090
 tried+0.089
 remembered+0.088
 stumbled+0.085
Neg logit contributions
igsaw-0.083
 new-0.080
ummy-0.075
 types-0.072
 art-0.072
Token call
Token ID869
Feature activation+2.14e-02

Pos logit contributions
 managed+0.068
 remembered+0.067
 unable+0.065
 decided+0.064
 accidentally+0.064
Neg logit contributions
igsaw-0.085
 difference-0.079
ummy-0.076
 types-0.074
opl-0.073
Token the
Token ID262
Feature activation-1.42e-01

Pos logit contributions
 tried+0.033
 decided+0.033
 never+0.032
 managed+0.032
 remembered+0.031
Neg logit contributions
igsaw-0.048
umm-0.042
ummy-0.042
opl-0.041
 types-0.041
Token police
Token ID1644
Feature activation-1.65e-02

Pos logit contributions
igsaw+0.223
 difference+0.201
 new+0.199
ummy+0.197
 types+0.192
Neg logit contributions
 managed-0.294
 tried-0.278
 decided-0.276
 stumbled-0.271
 accidentally-0.271
Token his
Token ID465
Feature activation-4.41e-02

Pos logit contributions
igsaw+0.059
ummy+0.053
 types+0.051
umm+0.051
opl+0.051
Neg logit contributions
 tried-0.034
 decided-0.033
 managed-0.033
 remembered-0.032
 never-0.032
Token friend
Token ID1545
Feature activation+1.60e-02

Pos logit contributions
igsaw+0.092
opl+0.078
 types+0.076
umm+0.076
ummy+0.073
Neg logit contributions
 tried-0.078
 remembered-0.075
 decided-0.075
 never-0.073
 accidentally-0.073
Token,
Token ID11
Feature activation+6.59e-03

Pos logit contributions
 tried+0.026
 decided+0.026
 managed+0.025
 remembered+0.025
 accidentally+0.024
Neg logit contributions
igsaw-0.035
ummy-0.030
 types-0.030
umm-0.030
opl-0.030
Token a
Token ID257
Feature activation-8.47e-02

Pos logit contributions
 managed+0.013
 tried+0.012
 decided+0.012
 remembered+0.012
 never+0.012
Neg logit contributions
igsaw-0.012
ummy-0.010
 types-0.010
 batch-0.009
 bou-0.009
Token little
Token ID1310
Feature activation-8.33e-02

Pos logit contributions
igsaw+0.133
ummy+0.112
 types+0.107
umm+0.103
opl+0.102
Neg logit contributions
 managed-0.182
 tried-0.181
 decided-0.180
 remembered-0.177
 never-0.174
Token girl
Token ID2576
Feature activation+8.45e-02

Pos logit contributions
igsaw+0.150
ummy+0.127
opl+0.124
 types+0.123
umm+0.121
Neg logit contributions
 tried-0.163
 decided-0.160
 managed-0.160
 remembered-0.158
 accidentally-0.153
Token named
Token ID3706
Feature activation-2.78e-02

Pos logit contributions
 managed+0.128
 wouldn+0.119
 unable+0.117
'd+0.113
 almost+0.111
Neg logit contributions
igsaw-0.160
ummy-0.160
 birthday-0.154
 batch-0.151
 bou-0.151
Token Sue
Token ID26089
Feature activation+5.56e-02

Pos logit contributions
igsaw+0.052
opl+0.044
ummy+0.044
 types+0.044
umm+0.043
Neg logit contributions
 tried-0.053
 decided-0.052
 managed-0.052
 remembered-0.051
 never-0.050
Token.
Token ID13
Feature activation-1.29e-02

Pos logit contributions
 managed+0.108
 unable+0.101
 wouldn+0.098
 remembered+0.096
pped+0.096
Neg logit contributions
igsaw-0.079
ummy-0.078
 new-0.075
 types-0.074
 bou-0.073
Token They
Token ID1119
Feature activation+1.44e-01

Pos logit contributions
igsaw+0.026
ummy+0.022
umm+0.022
 types+0.022
opl+0.022
Neg logit contributions
 tried-0.023
 decided-0.022
 managed-0.022
 remembered-0.021
 didn-0.021
Token wanted
Token ID2227
Feature activation+3.93e-02

Pos logit contributions
 unable+0.170
 managed+0.163
pped+0.157
 wouldn+0.144
 never+0.138
Neg logit contributions
 types-0.301
 bou-0.288
 batch-0.285
 parties-0.282
 Grand-0.280
Token thought
Token ID1807
Feature activation+4.66e-02

Pos logit contributions
 unable+0.108
pped+0.106
 managed+0.106
 stumbled+0.090
 remembered+0.089
Neg logit contributions
 types-0.368
igsaw-0.351
 batch-0.344
 seats-0.342
 goals-0.341
Token marking
Token ID18730
Feature activation-2.03e-02

Pos logit contributions
 managed+0.063
 tried+0.063
 decided+0.062
 remembered+0.060
 never+0.059
Neg logit contributions
igsaw-0.111
ummy-0.098
 types-0.097
umm-0.096
opl-0.095
Token things
Token ID1243
Feature activation+7.19e-02

Pos logit contributions
igsaw+0.044
ummy+0.038
umm+0.037
 types+0.037
opl+0.036
Neg logit contributions
 tried-0.032
 managed-0.032
 decided-0.032
 remembered-0.031
 never-0.030
Token made
Token ID925
Feature activation-1.45e-02

Pos logit contributions
 remembered+0.108
 managed+0.108
 decided+0.107
 tried+0.106
 stumbled+0.103
Neg logit contributions
igsaw-0.154
ummy-0.137
 types-0.134
opl-0.131
umm-0.127
Token him
Token ID683
Feature activation+9.05e-02

Pos logit contributions
igsaw+0.038
ummy+0.034
 types+0.034
umm+0.033
opl+0.033
Neg logit contributions
 managed-0.016
 tried-0.016
 decided-0.015
 remembered-0.015
 never-0.015
Token seem
Token ID1283
Feature activation+6.74e-02

Pos logit contributions
 managed+0.126
 tried+0.125
 remembered+0.124
 decided+0.123
 stumbled+0.119
Neg logit contributions
igsaw-0.204
ummy-0.179
 types-0.178
opl-0.174
 difference-0.172
Token more
Token ID517
Feature activation-5.41e-02

Pos logit contributions
 managed+0.114
 tried+0.113
 decided+0.112
 remembered+0.111
 never+0.108
Neg logit contributions
igsaw-0.135
ummy-0.118
 types-0.116
opl-0.111
umm-0.111
Token important
Token ID1593
Feature activation-6.07e-02

Pos logit contributions
igsaw+0.098
ummy+0.084
 types+0.083
opl+0.079
umm+0.078
Neg logit contributions
 managed-0.102
 tried-0.102
 decided-0.101
 remembered-0.100
 stumbled-0.097
Token.
Token ID13
Feature activation-1.08e-02

Pos logit contributions
igsaw+0.131
ummy+0.117
 types+0.114
umm+0.110
opl+0.109
Neg logit contributions
 managed-0.095
 tried-0.094
 decided-0.092
 remembered-0.092
 stumbled-0.088
Token One
Token ID1881
Feature activation-4.54e-02

Pos logit contributions
igsaw+0.024
 bou+0.021
 types+0.021
umm+0.021
ummy+0.021
Neg logit contributions
 decided-0.017
 tried-0.017
 could-0.016
 never-0.016
 accidentally-0.016
Token day
Token ID1110
Feature activation+1.57e-02

Pos logit contributions
igsaw+0.068
ummy+0.055
umm+0.053
opl+0.052
 types+0.052
Neg logit contributions
 tried-0.104
 decided-0.104
 managed-0.102
 remembered-0.102
 never-0.099
Token bag
Token ID6131
Feature activation-2.84e-02

Pos logit contributions
igsaw+0.236
ummy+0.198
aj+0.192
umm+0.190
opl+0.189
Neg logit contributions
 tried-0.288
 managed-0.282
 decided-0.278
 dropped-0.273
 never-0.270
Token.
Token ID13
Feature activation+5.36e-04

Pos logit contributions
igsaw+0.054
ummy+0.049
 types+0.045
umm+0.044
ay+0.044
Neg logit contributions
 tried-0.051
 remembered-0.051
 managed-0.050
 decided-0.049
 accidentally-0.048
Token
Token ID198
Feature activation-4.95e-02

Pos logit contributions
 tried+0.001
 decided+0.001
 managed+0.001
 could+0.001
 never+0.001
Neg logit contributions
igsaw-0.001
ummy-0.001
 bou-0.001
 coll-0.001
opl-0.001
Token
Token ID198
Feature activation-4.63e-02

Pos logit contributions
igsaw+0.103
umm+0.090
ummy+0.090
 types+0.088
 bou+0.085
Neg logit contributions
 tried-0.086
 managed-0.085
 decided-0.084
 never-0.083
 remembered-0.081
TokenHe
Token ID1544
Feature activation+1.51e-01

Pos logit contributions
igsaw+0.104
ummy+0.089
opl+0.088
 types+0.088
ay+0.086
Neg logit contributions
 managed-0.066
 tried-0.066
 remembered-0.065
 wouldn-0.064
 decided-0.064
Token shouted
Token ID17293
Feature activation-4.25e-03

Pos logit contributions
 unable+0.125
pped+0.119
 managed+0.117
'd+0.103
 remembered+0.103
Neg logit contributions
 types-0.350
 batch-0.346
igsaw-0.343
 seats-0.341
 new-0.328
Token excited
Token ID6568
Feature activation-6.90e-02

Pos logit contributions
igsaw+0.010
 types+0.009
umm+0.008
opl+0.008
ummy+0.008
Neg logit contributions
 tried-0.006
 decided-0.006
 managed-0.006
 remembered-0.006
 never-0.006
Tokenly
Token ID306
Feature activation+5.45e-02

Pos logit contributions
igsaw+0.152
ummy+0.141
 types+0.134
umm+0.129
 batch+0.126
Neg logit contributions
 managed-0.107
 tried-0.102
 remembered-0.101
 decided-0.100
 stumbled-0.097
Token to
Token ID284
Feature activation+3.35e-02

Pos logit contributions
 managed+0.101
 decided+0.098
 tried+0.097
 remembered+0.096
'd+0.095
Neg logit contributions
igsaw-0.102
ummy-0.091
 types-0.083
 batch-0.081
ay-0.080
Token his
Token ID465
Feature activation-4.76e-02

Pos logit contributions
 managed+0.057
 tried+0.056
 decided+0.056
 remembered+0.055
 stumbled+0.053
Neg logit contributions
igsaw-0.068
ummy-0.059
 types-0.058
opl-0.057
umm-0.055
Token friends
Token ID2460
Feature activation+4.69e-02

Pos logit contributions
igsaw+0.103
opl+0.092
Whe+0.091
umm+0.087
 types+0.086
Neg logit contributions
 remembered-0.076
 tried-0.075
 decided-0.073
 managed-0.069
 never-0.069
Token named
Token ID3706
Feature activation-2.32e-02

Pos logit contributions
 managed+0.117
 remembered+0.104
 tried+0.103
 wouldn+0.101
'd+0.099
Neg logit contributions
igsaw-0.220
ummy-0.211
 types-0.201
 batch-0.200
opl-0.195
Token Jessica
Token ID17352
Feature activation+6.10e-02

Pos logit contributions
igsaw+0.041
opl+0.036
 types+0.036
umm+0.036
ummy+0.035
Neg logit contributions
 tried-0.045
 decided-0.045
 managed-0.044
 accidentally-0.044
 never-0.044
Token.
Token ID13
Feature activation-3.18e-03

Pos logit contributions
 managed+0.103
 tried+0.097
 remembered+0.095
 decided+0.093
 wouldn+0.092
Neg logit contributions
igsaw-0.119
ummy-0.107
 types-0.105
 bou-0.100
opl-0.099
Token She
Token ID1375
Feature activation+1.46e-01

Pos logit contributions
igsaw+0.007
opl+0.006
ummy+0.006
umm+0.006
 batch+0.006
Neg logit contributions
 decided-0.005
 tried-0.005
 remembered-0.005
 managed-0.005
 never-0.005
Token loved
Token ID6151
Feature activation-2.13e-02

Pos logit contributions
 managed+0.183
 unable+0.174
pped+0.163
 remembered+0.161
 tried+0.159
Neg logit contributions
igsaw-0.317
 types-0.301
ummy-0.291
 batch-0.280
 art-0.277
Token playing
Token ID2712
Feature activation-2.44e-02

Pos logit contributions
igsaw+0.051
ummy+0.045
umm+0.044
opl+0.044
 types+0.043
Neg logit contributions
 tried-0.030
 decided-0.029
 never-0.028
 managed-0.027
 accidentally-0.027
Token with
Token ID351
Feature activation-3.46e-02

Pos logit contributions
igsaw+0.052
ummy+0.046
 types+0.045
umm+0.044
opl+0.043
Neg logit contributions
 managed-0.039
 tried-0.038
 decided-0.038
 remembered-0.038
 never-0.036
Token glue
Token ID22749
Feature activation+2.33e-02

Pos logit contributions
igsaw+0.088
umm+0.082
opl+0.080
ummy+0.079
 difference+0.079
Neg logit contributions
 tried-0.042
 decided-0.040
 accidentally-0.039
 dropped-0.037
 never-0.036
Token and
Token ID290
Feature activation-6.92e-03

Pos logit contributions
 managed+0.032
 tried+0.032
 decided+0.031
 remembered+0.031
 never+0.030
Neg logit contributions
igsaw-0.055
ummy-0.049
 types-0.048
umm-0.047
opl-0.047
Token making
Token ID1642
Feature activation-2.06e-02

Pos logit contributions
igsaw+0.010
 types+0.009
ummy+0.009
 coll+0.009
 art+0.009
Neg logit contributions
 managed-0.015
 remembered-0.014
 tried-0.014
 decided-0.014
 stumbled-0.014
Token things
Token ID1243
Feature activation+6.68e-02

Pos logit contributions
igsaw+0.034
opl+0.030
ummy+0.028
umm+0.028
aj+0.026
Neg logit contributions
 tried-0.044
 decided-0.043
 managed-0.042
 accidentally-0.042
 dropped-0.041
Token,
Token ID11
Feature activation-4.11e-03

Pos logit contributions
igsaw+0.000
ummy+0.000
 types+0.000
 bou+0.000
ay+0.000
Neg logit contributions
 remembered-0.000
 managed-0.000
 decided-0.000
 tried-0.000
 accidentally-0.000
Token on
Token ID319
Feature activation-3.43e-02

Pos logit contributions
igsaw+0.007
 bou+0.007
 new+0.007
 types+0.007
 seats+0.007
Neg logit contributions
 managed-0.006
 remembered-0.006
'd-0.006
 wouldn-0.006
 tried-0.006
Token top
Token ID1353
Feature activation-3.97e-02

Pos logit contributions
igsaw+0.063
ummy+0.054
 types+0.053
umm+0.052
opl+0.051
Neg logit contributions
 tried-0.065
 managed-0.065
 decided-0.064
 remembered-0.063
 never-0.062
Token of
Token ID286
Feature activation-6.98e-02

Pos logit contributions
igsaw+0.081
ummy+0.069
 types+0.068
umm+0.068
opl+0.066
Neg logit contributions
 tried-0.068
 managed-0.067
 decided-0.067
 never-0.064
 remembered-0.064
Token trees
Token ID7150
Feature activation+7.37e-03

Pos logit contributions
igsaw+0.136
ummy+0.118
 types+0.116
umm+0.112
opl+0.111
Neg logit contributions
 managed-0.123
 tried-0.122
 decided-0.121
 remembered-0.120
 never-0.116
Token,
Token ID11
Feature activation-1.13e-02

Pos logit contributions
 remembered+0.013
 managed+0.013
 decided+0.013
 tried+0.013
 accidentally+0.012
Neg logit contributions
igsaw-0.013
ummy-0.012
 types-0.011
opl-0.011
 bou-0.011
Token and
Token ID290
Feature activation-3.23e-03

Pos logit contributions
igsaw+0.023
 types+0.021
ummy+0.021
 bou+0.021
 coll+0.020
Neg logit contributions
 managed-0.017
 tried-0.017
 remembered-0.016
 decided-0.016
 accidentally-0.016
Token near
Token ID1474
Feature activation-5.25e-02

Pos logit contributions
igsaw+0.006
ummy+0.005
 types+0.005
umm+0.005
 bou+0.005
Neg logit contributions
 managed-0.006
 tried-0.006
 decided-0.006
 remembered-0.006
 never-0.006
Token the
Token ID262
Feature activation-1.54e-01

Pos logit contributions
igsaw+0.105
ummy+0.089
umm+0.086
opl+0.085
 types+0.085
Neg logit contributions
 tried-0.094
 decided-0.094
 managed-0.093
 remembered-0.091
 never-0.091
Token river
Token ID7850
Feature activation-8.43e-03

Pos logit contributions
igsaw+0.250
ummy+0.207
 types+0.201
opl+0.194
umm+0.193
Neg logit contributions
 decided-0.326
 tried-0.326
 managed-0.324
 remembered-0.319
 never-0.317
Token.
Token ID13
Feature activation-1.69e-02

Pos logit contributions
igsaw+0.019
ummy+0.017
 types+0.016
opl+0.016
umm+0.016
Neg logit contributions
 managed-0.012
 tried-0.012
 decided-0.012
 remembered-0.012
 never-0.011
Token said
Token ID531
Feature activation-9.60e-03

Pos logit contributions
 managed+0.067
 decided+0.065
 tried+0.065
 remembered+0.064
 stumbled+0.062
Neg logit contributions
igsaw-0.137
 types-0.126
ummy-0.126
 batch-0.122
 bou-0.118
Token,
Token ID11
Feature activation+1.21e-02

Pos logit contributions
igsaw+0.023
opl+0.021
umm+0.020
 types+0.020
ummy+0.020
Neg logit contributions
 tried-0.014
 decided-0.013
 managed-0.013
 remembered-0.013
 never-0.012
Token "
Token ID366
Feature activation+1.81e-02

Pos logit contributions
 managed+0.024
 decided+0.024
 tried+0.024
 remembered+0.023
 never+0.023
Neg logit contributions
igsaw-0.020
ummy-0.018
 types-0.016
umm-0.016
 batch-0.016
TokenGood
Token ID10248
Feature activation-4.47e-02

Pos logit contributions
 decided+0.029
 tried+0.028
 managed+0.028
 remembered+0.027
 never+0.027
Neg logit contributions
igsaw-0.040
ummy-0.036
 types-0.034
opl-0.034
aj-0.033
Token job
Token ID1693
Feature activation+7.37e-03

Pos logit contributions
igsaw+0.071
ummy+0.056
 types+0.056
opl+0.055
umm+0.052
Neg logit contributions
 tried-0.098
 decided-0.097
 managed-0.096
 remembered-0.095
 never-0.093
Token.
Token ID13
Feature activation-1.78e-02

Pos logit contributions
 tried+0.010
 managed+0.010
 decided+0.010
 didn+0.010
 remembered+0.010
Neg logit contributions
igsaw-0.018
opl-0.016
 types-0.015
umm-0.015
ummy-0.015
Token Now
Token ID2735
Feature activation+6.70e-02

Pos logit contributions
igsaw+0.031
 types+0.027
 bou+0.026
 coll+0.026
ummy+0.026
Neg logit contributions
 tried-0.034
 never-0.034
 managed-0.034
 decided-0.033
 could-0.033
Token we
Token ID356
Feature activation+1.04e-01

Pos logit contributions
 managed+0.091
 tried+0.091
 decided+0.090
 remembered+0.088
 never+0.085
Neg logit contributions
igsaw-0.159
ummy-0.142
 types-0.139
umm-0.136
opl-0.136
Token need
Token ID761
Feature activation+1.35e-02

Pos logit contributions
 unable+0.161
 managed+0.146
 stumbled+0.144
 decided+0.144
 remembered+0.144
Neg logit contributions
igsaw-0.198
ay-0.183
 difference-0.183
 art-0.177
ummy-0.176
Token to
Token ID284
Feature activation+3.21e-02

Pos logit contributions
 tried+0.021
 decided+0.020
 managed+0.020
 never+0.020
 remembered+0.020
Neg logit contributions
igsaw-0.031
opl-0.027
umm-0.026
ummy-0.026
 types-0.026
Token put
Token ID1234
Feature activation-4.70e-02

Pos logit contributions
 managed+0.056
 decided+0.055
 remembered+0.055
 tried+0.054
 stumbled+0.053
Neg logit contributions
igsaw-0.061
ummy-0.055
 types-0.052
 art-0.051
 difference-0.051
Token did
Token ID750
Feature activation+6.36e-02

Pos logit contributions
 unable+0.079
pped+0.070
 managed+0.069
 accidentally+0.065
 tried+0.064
Neg logit contributions
igsaw-0.281
 types-0.274
 art-0.261
 new-0.257
ummy-0.251
Token that
Token ID326
Feature activation+1.33e-02

Pos logit contributions
 managed+0.095
 tried+0.095
 remembered+0.094
 decided+0.093
 stumbled+0.089
Neg logit contributions
igsaw-0.138
ummy-0.122
 types-0.120
opl-0.116
 art-0.113
Token day
Token ID1110
Feature activation+1.64e-02

Pos logit contributions
 tried+0.024
 decided+0.024
 managed+0.024
 never+0.024
 remembered+0.024
Neg logit contributions
igsaw-0.025
ummy-0.022
 types-0.021
umm-0.021
aj-0.021
Token.
Token ID13
Feature activation-1.31e-02

Pos logit contributions
 managed+0.023
 tried+0.023
 decided+0.023
 remembered+0.022
 never+0.022
Neg logit contributions
igsaw-0.038
ummy-0.034
 types-0.033
umm-0.032
opl-0.032
Token
Token ID220
Feature activation-4.16e-03

Pos logit contributions
igsaw+0.029
 types+0.026
umm+0.026
 coll+0.025
ummy+0.025
Neg logit contributions
 tried-0.020
 decided-0.020
 could-0.019
 accidentally-0.019
 managed-0.019
Token
Token ID198
Feature activation-4.67e-02

Pos logit contributions
igsaw+0.010
umm+0.009
 types+0.009
 bou+0.009
 batch+0.008
Neg logit contributions
 could-0.006
 managed-0.006
 tried-0.006
 never-0.005
 decided-0.005
Token
Token ID198
Feature activation-4.43e-02

Pos logit contributions
igsaw+0.104
umm+0.098
ummy+0.093
 types+0.093
 difference+0.089
Neg logit contributions
 decided-0.071
 never-0.071
 tried-0.071
 managed-0.069
 could-0.066
TokenWhen
Token ID2215
Feature activation+4.01e-02

Pos logit contributions
igsaw+0.096
ummy+0.086
 types+0.084
umm+0.083
opl+0.081
Neg logit contributions
 decided-0.069
 managed-0.069
 tried-0.069
 never-0.067
 remembered-0.066
Token she
Token ID673
Feature activation+1.20e-01

Pos logit contributions
 tried+0.053
 managed+0.052
 decided+0.052
 remembered+0.050
 never+0.049
Neg logit contributions
igsaw-0.098
ummy-0.087
 types-0.086
opl-0.085
umm-0.084
Token finished
Token ID5201
Feature activation-6.20e-02

Pos logit contributions
 managed+0.123
 unable+0.121
 decided+0.119
 remembered+0.116
 tried+0.116
Neg logit contributions
igsaw-0.303
ummy-0.268
 types-0.267
 batch-0.261
ay-0.259
Token,
Token ID11
Feature activation+1.76e-02

Pos logit contributions
igsaw+0.126
umm+0.113
 types+0.111
ummy+0.110
opl+0.108
Neg logit contributions
 tried-0.105
 decided-0.102
 managed-0.101
 remembered-0.101
 accidentally-0.100
Token a
Token ID257
Feature activation-7.55e-02

Pos logit contributions
 decided+0.085
 tried+0.085
 never+0.082
 managed+0.080
 accidentally+0.080
Neg logit contributions
igsaw-0.157
 types-0.142
umm-0.141
opl-0.138
 difference-0.138
Token little
Token ID1310
Feature activation-7.95e-02

Pos logit contributions
igsaw+0.117
ummy+0.100
 types+0.093
umm+0.091
 batch+0.090
Neg logit contributions
 managed-0.164
 tried-0.161
 decided-0.160
 remembered-0.157
 never-0.155
Token pig
Token ID12967
Feature activation+3.25e-02

Pos logit contributions
igsaw+0.143
ummy+0.120
 types+0.118
opl+0.117
umm+0.115
Neg logit contributions
 tried-0.154
 decided-0.152
 managed-0.151
 remembered-0.150
 never-0.145
Tokengy
Token ID1360
Feature activation+3.38e-02

Pos logit contributions
 managed+0.043
 tried+0.039
 wouldn+0.037
 remembered+0.037
 decided+0.036
Neg logit contributions
igsaw-0.072
ummy-0.072
opl-0.069
 types-0.069
 batch-0.066
Token named
Token ID3706
Feature activation-2.84e-02

Pos logit contributions
 managed+0.042
 tried+0.041
 decided+0.041
 remembered+0.040
 never+0.038
Neg logit contributions
igsaw-0.084
ummy-0.075
 types-0.074
umm-0.072
opl-0.072
Token Pe
Token ID2631
Feature activation-8.11e-02

Pos logit contributions
igsaw+0.050
ummy+0.045
 types+0.042
umm+0.041
 batch+0.040
Neg logit contributions
 managed-0.055
 tried-0.054
 decided-0.054
 remembered-0.054
 stumbled-0.051
Tokenppa
Token ID44989
Feature activation+4.62e-02

Pos logit contributions
igsaw+0.181
ummy+0.174
aj+0.167
opl+0.166
 coll+0.163
Neg logit contributions
 managed-0.108
 tried-0.108
 decided-0.107
 stumbled-0.099
 wouldn-0.099
Token.
Token ID13
Feature activation-7.04e-03

Pos logit contributions
 managed+0.077
 tried+0.074
 decided+0.071
 remembered+0.070
 stumbled+0.070
Neg logit contributions
igsaw-0.089
ummy-0.084
 types-0.082
opl-0.079
 coll-0.077
Token Pe
Token ID2631
Feature activation-7.17e-02

Pos logit contributions
igsaw+0.013
ummy+0.011
 types+0.011
opl+0.011
umm+0.011
Neg logit contributions
 decided-0.013
 tried-0.013
 managed-0.013
 remembered-0.013
 never-0.012
Tokenppa
Token ID44989
Feature activation+5.40e-02

Pos logit contributions
ummy+0.168
igsaw+0.165
aj+0.161
opl+0.158
 types+0.157
Neg logit contributions
 managed-0.082
 tried-0.081
 decided-0.079
 stumbled-0.075
 wouldn-0.071
Token lived
Token ID5615
Feature activation-1.52e-02

Pos logit contributions
 managed+0.076
 unable+0.072
 tried+0.070
 wouldn+0.067
 accidentally+0.067
Neg logit contributions
igsaw-0.108
 types-0.108
ummy-0.106
opl-0.101
aj-0.097
Token closer
Token ID5699
Feature activation-2.03e-02

Pos logit contributions
 decided+0.010
 managed+0.010
 tried+0.010
 remembered+0.010
 never+0.010
Neg logit contributions
igsaw-0.020
ummy-0.018
opl-0.018
umm-0.018
 types-0.017
Token family
Token ID1641
Feature activation+3.07e-02

Pos logit contributions
igsaw+0.048
ummy+0.042
umm+0.041
 types+0.040
opl+0.040
Neg logit contributions
 tried-0.029
 decided-0.029
 managed-0.028
 never-0.028
 remembered-0.028
Token.
Token ID13
Feature activation-1.02e-02

Pos logit contributions
 managed+0.040
 tried+0.040
 decided+0.039
 remembered+0.039
 accidentally+0.037
Neg logit contributions
igsaw-0.072
ummy-0.065
 types-0.064
umm-0.062
opl-0.062
Token
Token ID198
Feature activation-5.68e-02

Pos logit contributions
igsaw+0.021
 types+0.018
ummy+0.018
umm+0.018
 bou+0.018
Neg logit contributions
 decided-0.017
 tried-0.017
 never-0.017
 managed-0.016
 stumbled-0.016
Token
Token ID198
Feature activation-5.25e-02

Pos logit contributions
igsaw+0.122
umm+0.115
ummy+0.106
 types+0.106
 bou+0.099
Neg logit contributions
 never-0.096
 decided-0.095
 tried-0.092
 managed-0.091
 could-0.088
TokenThe
Token ID464
Feature activation-1.23e-01

Pos logit contributions
igsaw+0.110
ummy+0.098
 types+0.095
umm+0.095
opl+0.090
Neg logit contributions
 decided-0.088
 managed-0.087
 tried-0.086
 never-0.085
 remembered-0.083
Token moral
Token ID6573
Feature activation-7.21e-02

Pos logit contributions
igsaw+0.252
ummy+0.207
opl+0.203
 types+0.203
umm+0.200
Neg logit contributions
 decided-0.219
 remembered-0.216
 tried-0.214
 never-0.209
 didn-0.204
Token of
Token ID286
Feature activation-2.62e-02

Pos logit contributions
igsaw+0.121
ummy+0.101
umm+0.097
 types+0.097
opl+0.095
Neg logit contributions
 decided-0.149
 tried-0.149
 managed-0.148
 never-0.145
 remembered-0.144
Token this
Token ID428
Feature activation-7.59e-02

Pos logit contributions
igsaw+0.056
umm+0.052
aj+0.049
ummy+0.048
 types+0.047
Neg logit contributions
 never-0.045
 tried-0.043
 decided-0.042
 didn-0.041
 remembered-0.041
Token story
Token ID1621
Feature activation+2.72e-02

Pos logit contributions
igsaw+0.103
umm+0.084
 types+0.082
Whe+0.081
aj+0.080
Neg logit contributions
 never-0.178
 decided-0.177
 tried-0.177
 didn-0.176
 remembered-0.175
Token is
Token ID318
Feature activation+6.94e-02

Pos logit contributions
 never+0.049
 tried+0.049
 decided+0.048
 didn+0.048
 managed+0.048
Neg logit contributions
igsaw-0.053
umm-0.045
 bou-0.044
ummy-0.044
 types-0.043
Token a
Token ID257
Feature activation-8.54e-02

Pos logit contributions
 decided+0.019
 tried+0.019
 managed+0.018
 never+0.018
 remembered+0.018
Neg logit contributions
igsaw-0.031
ummy-0.027
umm-0.027
opl-0.027
 types-0.026
Token noise
Token ID7838
Feature activation+1.46e-03

Pos logit contributions
igsaw+0.147
ummy+0.126
 types+0.120
 bou+0.119
 difference+0.119
Neg logit contributions
 managed-0.171
 decided-0.164
 tried-0.164
 remembered-0.163
 stumbled-0.160
Token and
Token ID290
Feature activation+1.69e-02

Pos logit contributions
 managed+0.003
 remembered+0.003
 accidentally+0.002
'd+0.002
 never+0.002
Neg logit contributions
igsaw-0.003
ummy-0.002
 new-0.002
 types-0.002
opl-0.002
Token saw
Token ID2497
Feature activation-2.07e-02

Pos logit contributions
 managed+0.017
'd+0.017
pped+0.016
 unable+0.016
 wouldn+0.015
Neg logit contributions
 new-0.035
igsaw-0.035
 types-0.035
 Grand-0.034
 amaz-0.034
Token a
Token ID257
Feature activation-9.68e-02

Pos logit contributions
igsaw+0.047
ummy+0.041
umm+0.040
opl+0.040
 types+0.040
Neg logit contributions
 tried-0.032
 decided-0.031
 managed-0.031
 remembered-0.030
 never-0.030
Token wild
Token ID4295
Feature activation-1.15e-01

Pos logit contributions
igsaw+0.157
ummy+0.132
 types+0.125
umm+0.122
 bou+0.121
Neg logit contributions
 managed-0.204
 tried-0.200
 decided-0.200
 remembered-0.195
 never-0.193
Token animal
Token ID5044
Feature activation-2.42e-03

Pos logit contributions
igsaw+0.221
ummy+0.191
 types+0.186
umm+0.181
opl+0.180
Neg logit contributions
 tried-0.208
 managed-0.208
 decided-0.207
 remembered-0.203
 never-0.198
Token coming
Token ID2406
Feature activation-2.14e-02

Pos logit contributions
igsaw+0.004
ummy+0.004
 new+0.004
 difference+0.004
 bou+0.004
Neg logit contributions
 managed-0.004
 remembered-0.004
 tried-0.004
'd-0.004
 wouldn-0.004
Token her
Token ID607
Feature activation-4.46e-02

Pos logit contributions
igsaw+0.040
ummy+0.035
 types+0.033
 bou+0.032
ay+0.032
Neg logit contributions
 managed-0.040
 remembered-0.039
 tried-0.039
 decided-0.038
 accidentally-0.038
Token way
Token ID835
Feature activation-1.81e-02

Pos logit contributions
igsaw+0.100
 types+0.090
Whe+0.086
 batch+0.086
umm+0.085
Neg logit contributions
 never-0.067
 decided-0.067
 tried-0.062
 dropped-0.061
 remembered-0.061
Token.
Token ID13
Feature activation+5.41e-03

Pos logit contributions
igsaw+0.034
ummy+0.032
 new+0.029
ay+0.028
opl+0.028
Neg logit contributions
 managed-0.032
 remembered-0.032
 tried-0.031
 decided-0.030
 accidentally-0.030
Token young
Token ID1862
Feature activation-9.93e-02

Pos logit contributions
igsaw+0.124
ummy+0.105
 types+0.101
umm+0.098
opl+0.097
Neg logit contributions
 managed-0.160
 tried-0.160
 decided-0.159
 remembered-0.156
 never-0.153
Token kids
Token ID3988
Feature activation+4.94e-02

Pos logit contributions
igsaw+0.184
ummy+0.155
opl+0.152
umm+0.148
ay+0.142
Neg logit contributions
 decided-0.194
 tried-0.194
 managed-0.190
 remembered-0.189
 never-0.181
Token playing
Token ID2712
Feature activation-2.61e-02

Pos logit contributions
 managed+0.099
 remembered+0.095
 wouldn+0.092
 never+0.091
'd+0.091
Neg logit contributions
igsaw-0.070
ummy-0.066
 types-0.065
 gym-0.061
 new-0.060
Token nearby
Token ID6716
Feature activation-4.63e-02

Pos logit contributions
igsaw+0.058
ummy+0.052
umm+0.050
 types+0.050
opl+0.049
Neg logit contributions
 tried-0.040
 managed-0.039
 decided-0.039
 remembered-0.038
 never-0.037
Token.
Token ID13
Feature activation+9.53e-03

Pos logit contributions
igsaw+0.111
ummy+0.099
umm+0.096
 types+0.096
opl+0.094
Neg logit contributions
 tried-0.062
 decided-0.062
 managed-0.061
 remembered-0.059
 never-0.058
Token The
Token ID383
Feature activation-1.20e-01

Pos logit contributions
 tried+0.015
 decided+0.015
 could+0.014
 accidentally+0.014
 managed+0.014
Neg logit contributions
igsaw-0.021
opl-0.019
ummy-0.018
aj-0.018
umm-0.018
Token pirate
Token ID25868
Feature activation+4.82e-05

Pos logit contributions
igsaw+0.243
opl+0.213
ummy+0.209
umm+0.205
 batch+0.197
Neg logit contributions
 decided-0.226
 tried-0.226
 remembered-0.214
 dropped-0.207
 had-0.207
Token decided
Token ID3066
Feature activation+5.41e-02

Pos logit contributions
pped+0.000
 managed+0.000
 unable+0.000
 tried+0.000
 never+0.000
Neg logit contributions
igsaw-0.000
 types-0.000
 new-0.000
ummy-0.000
 batch-0.000
Token to
Token ID284
Feature activation+4.61e-02

Pos logit contributions
 managed+0.104
pped+0.103
'd+0.098
 remembered+0.098
 unable+0.096
Neg logit contributions
 new-0.081
igsaw-0.073
 art-0.067
 types-0.067
ummy-0.066
Token steal
Token ID8711
Feature activation-3.29e-02

Pos logit contributions
 managed+0.076
 remembered+0.074
 decided+0.073
 tried+0.073
 stumbled+0.071
Neg logit contributions
igsaw-0.095
ummy-0.082
 types-0.082
 difference-0.077
opl-0.077
Token their
Token ID511
Feature activation-6.09e-02

Pos logit contributions
igsaw+0.080
umm+0.071
ummy+0.070
aj+0.070
opl+0.069
Neg logit contributions
 decided-0.048
 tried-0.047
 never-0.043
 managed-0.043
 remembered-0.042
Token I
Token ID314
Feature activation+1.19e-01

Pos logit contributions
igsaw+0.027
ummy+0.023
 types+0.023
umm+0.022
opl+0.022
Neg logit contributions
 tried-0.022
 decided-0.022
 managed-0.021
 remembered-0.021
 never-0.021
Token won
Token ID1839
Feature activation+3.68e-02

Pos logit contributions
 managed+0.160
 unable+0.159
 decided+0.151
 remembered+0.150
pped+0.149
Neg logit contributions
igsaw-0.243
 types-0.224
 difference-0.220
aj-0.216
 batch-0.215
Token't
Token ID470
Feature activation+8.47e-02

Pos logit contributions
 tried+0.031
 managed+0.031
 decided+0.031
 remembered+0.029
 never+0.028
Neg logit contributions
igsaw-0.106
ummy-0.096
 types-0.095
umm-0.094
opl-0.092
Token let
Token ID1309
Feature activation+1.60e-02

Pos logit contributions
 managed+0.121
 remembered+0.116
 unable+0.115
 decided+0.112
 stumbled+0.112
Neg logit contributions
igsaw-0.179
ummy-0.165
 difference-0.162
opl-0.157
 types-0.155
Token anything
Token ID1997
Feature activation+2.06e-02

Pos logit contributions
 tried+0.024
 managed+0.024
 decided+0.024
 remembered+0.023
 never+0.023
Neg logit contributions
igsaw-0.036
ummy-0.032
 types-0.031
umm-0.031
opl-0.030
Token bad
Token ID2089
Feature activation-6.31e-02

Pos logit contributions
 tried+0.032
 decided+0.032
 managed+0.032
 never+0.031
 remembered+0.031
Neg logit contributions
igsaw-0.046
ummy-0.040
 types-0.039
umm-0.039
opl-0.038
Token happen
Token ID1645
Feature activation-5.44e-02

Pos logit contributions
igsaw+0.122
ummy+0.107
 types+0.103
umm+0.101
opl+0.100
Neg logit contributions
 managed-0.112
 tried-0.111
 decided-0.110
 remembered-0.109
 never-0.106
Token to
Token ID284
Feature activation+2.61e-02

Pos logit contributions
igsaw+0.104
ummy+0.093
 types+0.088
umm+0.086
opl+0.085
Neg logit contributions
 managed-0.097
 tried-0.097
 decided-0.095
 remembered-0.094
 stumbled-0.090
Token you
Token ID345
Feature activation+8.04e-02

Pos logit contributions
 tried+0.041
 decided+0.040
 managed+0.040
 never+0.040
 remembered+0.039
Neg logit contributions
igsaw-0.057
ummy-0.049
 types-0.049
umm-0.048
opl-0.047
Token."
Token ID526
Feature activation-3.53e-02

Pos logit contributions
 tried+0.131
 managed+0.131
 decided+0.131
 remembered+0.127
 never+0.125
Neg logit contributions
igsaw-0.169
ummy-0.147
 types-0.144
umm-0.142
opl-0.140
Token So
Token ID1406
Feature activation+2.02e-02

Pos logit contributions
igsaw+0.066
ummy+0.056
 types+0.056
umm+0.055
opl+0.054
Neg logit contributions
 tried-0.067
 decided-0.066
 managed-0.065
 remembered-0.064
 never-0.063
Token big
Token ID1263
Feature activation-1.47e-01

Pos logit contributions
igsaw+0.142
ummy+0.117
 types+0.112
umm+0.108
opl+0.107
Neg logit contributions
 managed-0.223
 tried-0.222
 decided-0.221
 remembered-0.218
 never-0.214
Token,
Token ID11
Feature activation-2.16e-02

Pos logit contributions
igsaw+0.256
 types+0.215
ummy+0.214
umm+0.209
opl+0.206
Neg logit contributions
 tried-0.289
 decided-0.280
 remembered-0.279
 managed-0.279
 accidentally-0.273
Token dark
Token ID3223
Feature activation-1.05e-01

Pos logit contributions
igsaw+0.036
ummy+0.031
 types+0.030
umm+0.029
opl+0.029
Neg logit contributions
 tried-0.044
 decided-0.044
 managed-0.044
 remembered-0.043
 never-0.042
Token cloud
Token ID6279
Feature activation-1.28e-02

Pos logit contributions
igsaw+0.169
ummy+0.143
 types+0.138
umm+0.133
opl+0.132
Neg logit contributions
 managed-0.221
 tried-0.221
 decided-0.219
 remembered-0.216
 never-0.212
Token in
Token ID287
Feature activation-4.63e-02

Pos logit contributions
igsaw+0.025
ummy+0.024
ay+0.022
 batch+0.022
opl+0.022
Neg logit contributions
 managed-0.022
 remembered-0.021
'd-0.019
 decided-0.019
 wouldn-0.019
Token the
Token ID262
Feature activation-1.63e-01

Pos logit contributions
igsaw+0.103
aj+0.092
ummy+0.091
umm+0.090
 bou+0.087
Neg logit contributions
 tried-0.072
 decided-0.072
 never-0.071
 managed-0.066
 remembered-0.065
Token sky
Token ID6766
Feature activation+1.64e-02

Pos logit contributions
igsaw+0.286
 types+0.228
ummy+0.221
aj+0.220
 batch+0.218
Neg logit contributions
 decided-0.354
 tried-0.349
 never-0.349
 stumbled-0.338
 got-0.333
Token.
Token ID13
Feature activation-9.97e-03

Pos logit contributions
 managed+0.025
 tried+0.024
 decided+0.024
 remembered+0.024
 never+0.023
Neg logit contributions
igsaw-0.036
ummy-0.033
 types-0.032
opl-0.031
umm-0.031
Token Tim
Token ID5045
Feature activation+6.00e-02

Pos logit contributions
igsaw+0.021
ummy+0.018
 types+0.018
umm+0.018
 bou+0.018
Neg logit contributions
 tried-0.017
 decided-0.016
 managed-0.016
 didn-0.016
 could-0.016
Token felt
Token ID2936
Feature activation+4.42e-02

Pos logit contributions
pped+0.064
 unable+0.063
 managed+0.062
'd+0.053
 wouldn+0.052
Neg logit contributions
 types-0.129
 new-0.119
aj-0.116
 seats-0.115
ummy-0.113
Token upset
Token ID9247
Feature activation-3.16e-02

Pos logit contributions
 managed+0.075
'd+0.072
 remembered+0.069
 accidentally+0.067
 tried+0.067
Neg logit contributions
 new-0.080
igsaw-0.079
 biggest-0.075
 types-0.074
ummy-0.074
Token family
Token ID1641
Feature activation+3.08e-02

Pos logit contributions
igsaw+0.044
ummy+0.037
 types+0.036
umm+0.036
opl+0.035
Neg logit contributions
 managed-0.051
 decided-0.051
 tried-0.051
 remembered-0.050
 never-0.050
Token about
Token ID546
Feature activation+3.95e-03

Pos logit contributions
 decided+0.060
 tried+0.059
 managed+0.059
 remembered+0.058
 never+0.058
Neg logit contributions
igsaw-0.055
ummy-0.047
umm-0.047
 types-0.045
opl-0.045
Token her
Token ID607
Feature activation-4.49e-02

Pos logit contributions
 managed+0.007
 tried+0.007
 decided+0.007
 remembered+0.007
 stumbled+0.007
Neg logit contributions
igsaw-0.007
ummy-0.006
 types-0.006
opl-0.006
 batch-0.006
Token adventure
Token ID8855
Feature activation-7.52e-02

Pos logit contributions
igsaw+0.069
umm+0.053
ummy+0.053
opl+0.051
ay+0.051
Neg logit contributions
 tried-0.102
 decided-0.101
 remembered-0.101
 never-0.100
 didn-0.099
Token at
Token ID379
Feature activation-7.45e-02

Pos logit contributions
igsaw+0.166
ummy+0.148
 types+0.143
opl+0.139
umm+0.139
Neg logit contributions
 managed-0.113
 tried-0.112
 remembered-0.111
 decided-0.110
 accidentally-0.105
Token the
Token ID262
Feature activation-1.65e-01

Pos logit contributions
igsaw+0.152
umm+0.136
aj+0.135
ummy+0.132
 types+0.127
Neg logit contributions
 never-0.129
 tried-0.129
 decided-0.124
 managed-0.122
 remembered-0.121
Token lovely
Token ID14081
Feature activation-1.20e-01

Pos logit contributions
igsaw+0.290
 types+0.231
ummy+0.229
 batch+0.220
aj+0.220
Neg logit contributions
 never-0.346
 decided-0.341
 tried-0.339
 remembered-0.331
 managed-0.326
Token cabin
Token ID9351
Feature activation+8.78e-03

Pos logit contributions
igsaw+0.209
opl+0.165
 coll+0.161
ummy+0.161
 types+0.160
Neg logit contributions
 tried-0.246
 decided-0.244
 remembered-0.235
 never-0.234
 came-0.232
Token.
Token ID13
Feature activation-1.71e-02

Pos logit contributions
 managed+0.013
 tried+0.013
 decided+0.013
 remembered+0.013
 accidentally+0.012
Neg logit contributions
igsaw-0.020
ummy-0.017
 types-0.017
umm-0.016
opl-0.016
Token<|endoftext|>
Token ID50256
Feature activation-1.09e-01

Pos logit contributions
igsaw+0.041
umm+0.038
 types+0.037
opl+0.036
 styles+0.035
Neg logit contributions
 could-0.024
 decided-0.023
 never-0.023
 tried-0.023
 had-0.022
TokenOnce
Token ID7454
Feature activation-2.35e-02

Pos logit contributions
igsaw+0.230
umm+0.200
ummy+0.198
 types+0.198
opl+0.190
Neg logit contributions
 tried-0.181
 decided-0.176
 managed-0.175
 never-0.174
 stumbled-0.169
Token taking
Token ID2263
Feature activation-2.32e-02

Pos logit contributions
 managed+0.018
 decided+0.017
 remembered+0.017
 tried+0.017
 stumbled+0.016
Neg logit contributions
igsaw-0.026
 types-0.023
ummy-0.023
opl-0.023
 bou-0.022
Token him
Token ID683
Feature activation+9.11e-02

Pos logit contributions
igsaw+0.041
ummy+0.035
 types+0.034
umm+0.033
opl+0.033
Neg logit contributions
 tried-0.046
 managed-0.046
 decided-0.045
 remembered-0.044
 never-0.044
Token on
Token ID319
Feature activation-2.21e-02

Pos logit contributions
 tried+0.124
 managed+0.121
 decided+0.120
 never+0.117
 remembered+0.115
Neg logit contributions
igsaw-0.220
umm-0.201
ummy-0.201
 types-0.193
 batch-0.188
Token adventures
Token ID17545
Feature activation-2.06e-02

Pos logit contributions
igsaw+0.049
umm+0.045
ummy+0.045
 types+0.043
opl+0.042
Neg logit contributions
 never-0.034
 tried-0.033
 decided-0.032
 managed-0.032
 accidentally-0.031
Token around
Token ID1088
Feature activation+3.60e-03

Pos logit contributions
igsaw+0.050
ummy+0.044
 types+0.043
umm+0.043
opl+0.042
Neg logit contributions
 tried-0.027
 decided-0.027
 managed-0.027
 never-0.026
 remembered-0.026
Token the
Token ID262
Feature activation-1.65e-01

Pos logit contributions
 tried+0.005
 managed+0.005
 decided+0.005
 remembered+0.005
 never+0.005
Neg logit contributions
igsaw-0.008
ummy-0.007
 types-0.007
umm-0.007
opl-0.007
Token garden
Token ID11376
Feature activation+2.38e-02

Pos logit contributions
igsaw+0.293
umm+0.260
ummy+0.252
 types+0.251
 batch+0.232
Neg logit contributions
 never-0.328
 decided-0.323
 tried-0.321
 had-0.313
 stumbled-0.306
Token.
Token ID13
Feature activation-5.92e-03

Pos logit contributions
 decided+0.031
 tried+0.030
 managed+0.030
 never+0.029
 remembered+0.029
Neg logit contributions
igsaw-0.060
ummy-0.052
umm-0.051
 types-0.050
opl-0.050
Token They
Token ID1119
Feature activation+1.52e-01

Pos logit contributions
igsaw+0.013
 bou+0.012
ummy+0.012
 types+0.011
umm+0.011
Neg logit contributions
 tried-0.009
 never-0.009
 could-0.009
 managed-0.009
 accidentally-0.009
Token would
Token ID561
Feature activation+1.31e-01

Pos logit contributions
 unable+0.153
 managed+0.152
 accidentally+0.144
 tried+0.143
Please+0.140
Neg logit contributions
igsaw-0.351
 types-0.338
 bou-0.331
 parties-0.324
 seats-0.319
Token explore
Token ID7301
Feature activation-3.12e-03

Pos logit contributions
 managed+0.212
 decided+0.207
 tried+0.206
 remembered+0.206
 accidentally+0.197
Neg logit contributions
igsaw-0.266
ummy-0.230
 types-0.229
opl-0.228
 bou-0.226
Token He
Token ID679
Feature activation+1.49e-01

Pos logit contributions
igsaw+0.029
 types+0.025
 batch+0.025
ummy+0.025
 bou+0.024
Neg logit contributions
 decided-0.022
 tried-0.022
 never-0.021
 could-0.021
 managed-0.021
Token saw
Token ID2497
Feature activation-1.62e-02

Pos logit contributions
 managed+0.220
 unable+0.214
 accidentally+0.211
 decided+0.207
 tried+0.206
Neg logit contributions
igsaw-0.285
 types-0.280
ummy-0.263
opl-0.257
aj-0.255
Token a
Token ID257
Feature activation-1.01e-01

Pos logit contributions
igsaw+0.042
umm+0.034
ummy+0.033
opl+0.032
aj+0.032
Neg logit contributions
 decided-0.025
 tried-0.024
 never-0.023
 could-0.023
 remembered-0.022
Token fairy
Token ID25607
Feature activation+1.85e-02

Pos logit contributions
igsaw+0.166
ummy+0.141
 types+0.136
umm+0.131
opl+0.131
Neg logit contributions
 managed-0.209
 tried-0.209
 decided-0.207
 remembered-0.204
 never-0.200
Token in
Token ID287
Feature activation-4.37e-02

Pos logit contributions
 managed+0.028
 tried+0.027
 decided+0.026
 remembered+0.026
 stumbled+0.025
Neg logit contributions
igsaw-0.041
ummy-0.036
opl-0.035
 types-0.035
 bou-0.035
Token the
Token ID262
Feature activation-1.67e-01

Pos logit contributions
igsaw+0.095
umm+0.086
ummy+0.086
aj+0.083
 types+0.082
Neg logit contributions
 tried-0.068
 never-0.067
 decided-0.066
 managed-0.063
 remembered-0.063
Token woods
Token ID16479
Feature activation+5.61e-03

Pos logit contributions
igsaw+0.276
ummy+0.227
 types+0.219
umm+0.208
aj+0.204
Neg logit contributions
 decided-0.363
 tried-0.359
 remembered-0.358
 never-0.357
 managed-0.351
Token.
Token ID13
Feature activation-1.57e-02

Pos logit contributions
 tried+0.007
 decided+0.007
 managed+0.007
 never+0.007
 remembered+0.007
Neg logit contributions
igsaw-0.014
ummy-0.012
umm-0.012
 types-0.012
opl-0.011
Token The
Token ID383
Feature activation-1.31e-01

Pos logit contributions
igsaw+0.037
ummy+0.032
 types+0.032
 batch+0.031
umm+0.031
Neg logit contributions
 decided-0.023
 tried-0.022
 could-0.021
 remembered-0.021
 managed-0.021
Token fairy
Token ID25607
Feature activation+3.20e-02

Pos logit contributions
igsaw+0.266
ummy+0.244
 types+0.238
 coll+0.234
 bou+0.230
Neg logit contributions
 managed-0.219
 tried-0.210
 decided-0.204
 remembered-0.201
 accidentally-0.199
Token was
Token ID373
Feature activation+6.14e-02

Pos logit contributions
 managed+0.035
 unable+0.032
 decided+0.032
 tried+0.031
 accidentally+0.031
Neg logit contributions
igsaw-0.072
 types-0.070
ummy-0.068
 bou-0.068
 new-0.067
Token planted
Token ID18969
Feature activation-6.74e-02

Pos logit contributions
 managed+0.039
 tried+0.038
 never+0.036
 decided+0.036
 remembered+0.035
Neg logit contributions
igsaw-0.160
 types-0.148
ummy-0.147
 batch-0.142
 bou-0.140
Token some
Token ID617
Feature activation-8.78e-02

Pos logit contributions
igsaw+0.145
ummy+0.121
opl+0.118
umm+0.117
aj+0.116
Neg logit contributions
 decided-0.116
 tried-0.116
 managed-0.114
 never-0.109
 remembered-0.109
Token beautiful
Token ID4950
Feature activation-1.12e-01

Pos logit contributions
igsaw+0.170
opl+0.139
ummy+0.135
umm+0.130
aj+0.130
Neg logit contributions
 decided-0.173
 tried-0.171
 managed-0.166
 remembered-0.162
 could-0.161
Token flowers
Token ID12734
Feature activation+1.32e-03

Pos logit contributions
igsaw+0.180
ummy+0.149
 types+0.141
opl+0.141
umm+0.140
Neg logit contributions
 tried-0.242
 decided-0.241
 managed-0.240
 remembered-0.235
 never-0.229
Token around
Token ID1088
Feature activation-1.47e-03

Pos logit contributions
 managed+0.002
 remembered+0.002
 tried+0.002
 decided+0.002
 accidentally+0.002
Neg logit contributions
igsaw-0.003
ummy-0.003
 types-0.003
 bou-0.003
umm-0.002
Token the
Token ID262
Feature activation-1.65e-01

Pos logit contributions
igsaw+0.002
ummy+0.002
 types+0.002
 new+0.002
 art+0.002
Neg logit contributions
 managed-0.003
 remembered-0.003
 decided-0.003
 tried-0.003
'd-0.003
Token edges
Token ID13015
Feature activation+7.09e-05

Pos logit contributions
igsaw+0.222
ummy+0.187
 types+0.177
umm+0.170
 difference+0.165
Neg logit contributions
 managed-0.393
 tried-0.385
 decided-0.381
 remembered-0.381
 never-0.372
Token to
Token ID284
Feature activation+1.70e-02

Pos logit contributions
 managed+0.000
 tried+0.000
 decided+0.000
 remembered+0.000
 never+0.000
Neg logit contributions
igsaw-0.000
ummy-0.000
 types-0.000
umm-0.000
opl-0.000
Token make
Token ID787
Feature activation-2.12e-02

Pos logit contributions
 managed+0.027
 remembered+0.026
 decided+0.026
 stumbled+0.026
 tried+0.026
Neg logit contributions
igsaw-0.035
 bou-0.031
 types-0.031
ummy-0.031
 art-0.030
Token it
Token ID340
Feature activation+7.79e-02

Pos logit contributions
igsaw+0.051
ummy+0.044
umm+0.044
opl+0.044
aj+0.043
Neg logit contributions
 decided-0.030
 tried-0.030
 managed-0.029
 never-0.028
 didn-0.027
Token look
Token ID804
Feature activation+6.29e-02

Pos logit contributions
 decided+0.129
 managed+0.129
 tried+0.128
 remembered+0.123
 never+0.123
Neg logit contributions
igsaw-0.165
ummy-0.144
opl-0.140
umm-0.139
 types-0.137