TOP ACTIVATIONS
MAX = 12.964

grandma 's house for safe keeping . When
it to me for safe keeping . He told me to
." " Who would do that ?" Lily asks
s what fire places can do ! They can
asked Mike : " What would you like best in the
Tim my knew that fire places were not dull or boring
his friends , " Who can help me make something ?"
" Where can I find a hook ?"
and continued , " It 'll be fun . But first
." " Who can help them ?" Ben says
. " Where 'd you find that crown ?
the children , " What would you like to eat ?"
to the students " Who would like to bring an object
and said , " What would you like to eat ,
each other , " Who can fix the moon ?" They
Tim asked , " Where can I find the key ?"
Lily . " But how can we get it ? It
" But it would be fun !" Tom said
. " Where can we drink , Lia ?"
little boy , " It would be very hard for you

MIN ACTIVATIONS
MIN = -18.060

to take a journey to outer space . He hopped into
she got rid of the intr uders . She did not
away , a bright blue spe ck in the sky .
highest branches and grab the ju ic iest fruit . One
garden , and picked the ju ic iest carrots he could
farmer who lived on a lush farm with many animals and
branch , looking for the ju ic iest ones . Finally
play without going near the sm elly leaves . Tim and
She takes the v ase from them and puts
was running around on the lush grass , enjoying the sunshine
what the rocket discovered in outer space . It was a
Alice saw Lisa with the y ummy cand ies , she
the garden to pick the ju ic iest and sweet est
" I can wear the blue dress under the pink dress
had not fought over the blue t eddy bear . They
. It ran through a lush green me adow , with
owner always gave him the ju ic iest sa us ages
friend . They shared the y ummy paper together and had
a while , enjoying the y ummy treats .
it up and enjoy the y ummy taste . <|endoftext|> One

INTERVAL 5.208 - 12.964
CONTAINS 12.284%

Once upon a time , there was a happy little pengu
big and blue ! Can we go near it ?"
And every time she found it , she 'd wave and
in the garden . But she couldn 't find the black
saw a blue potato . She thought it was very pretty

INTERVAL -2.548 - 5.208
CONTAINS 63.184%

. Lily 's cube blinked too , which made her laugh
dish that she had just bought . It was shiny and
But when she got closer , she noticed the w affle
!" Tom said . " Do you want to be our
just his friend , Tim my , playing with some coc

INTERVAL -10.304 - -2.548
CONTAINS 19.156%

come relax with you ?" G iz mo answered
oph ie was a very successful swim mer . One day
't . You hid in a bad place . That is
all . It was actually kind of cute . From then
touched the jelly fish . O uch ! The

INTERVAL -18.060 - -10.304
CONTAINS 5.376%

made them last longer . The moral of the story is
saw a shiny penny on the ground and picked it up
crawling on the sand . The crab had a shiny shell
again . The end . <|endoftext|> Once upon
gathered her courage and asked the dolphin , " Are you
Token grandma
Token ID49890
Feature activation+1.03e+00

Pos logit contributions
igsaw+0.185
aj+0.156
Whe+0.153
udd+0.149
ushes+0.148
Neg logit contributions
 decided-0.181
 tried-0.162
 came-0.156
 wanted-0.153
 could-0.148
Token's
Token ID338
Feature activation-6.45e+00

Pos logit contributions
 decided+0.022
 tried+0.019
 came+0.019
 stumbled+0.019
pped+0.019
Neg logit contributions
igsaw-0.033
aj-0.031
 difference-0.030
 coll-0.029
 former-0.028
Token house
Token ID2156
Feature activation+5.00e+00

Pos logit contributions
igsaw+0.163
aj+0.138
Whe+0.128
udd+0.123
ushes+0.122
Neg logit contributions
 decided-0.197
 tried-0.176
 came-0.176
 wanted-0.170
 could-0.164
Token for
Token ID329
Feature activation+8.25e-02

Pos logit contributions
 decided+0.093
 tried+0.079
 came+0.079
pped+0.074
 stumbled+0.074
Neg logit contributions
igsaw-0.175
aj-0.167
 difference-0.158
 coll-0.155
 former-0.153
Token safe
Token ID3338
Feature activation-1.32e+00

Pos logit contributions
 decided+0.002
 could+0.002
 came+0.002
 stumbled+0.002
 tried+0.002
Neg logit contributions
igsaw-0.002
aj-0.002
 coll-0.002
 former-0.002
 new-0.002
Tokenkeeping
Token ID19934
Feature activation+1.30e+01

Pos logit contributions
igsaw+0.042
aj+0.041
 coll+0.038
 difference+0.038
 former+0.037
Neg logit contributions
 decided-0.027
pped-0.023
 tried-0.023
 came-0.023
 stumbled-0.022
Token.
Token ID13
Feature activation+3.77e-01

Pos logit contributions
 decided+0.297
Please+0.270
pped+0.261
 tried+0.258
 came+0.247
Neg logit contributions
aj-0.355
igsaw-0.348
 coll-0.318
 difference-0.316
 fault-0.315
Token
Token ID220
Feature activation-1.42e+00

Pos logit contributions
 decided+0.009
 came+0.008
 tried+0.008
 could+0.007
 stumbled+0.007
Neg logit contributions
igsaw-0.012
aj-0.011
 difference-0.010
 coll-0.010
 goals-0.010
Token
Token ID198
Feature activation-4.51e+00

Pos logit contributions
igsaw+0.047
aj+0.043
 difference+0.042
 coll+0.041
 former+0.041
Neg logit contributions
 decided-0.030
 came-0.026
 tried-0.026
 could-0.025
 stumbled-0.024
Token
Token ID198
Feature activation-4.69e+00

Pos logit contributions
igsaw+0.140
aj+0.131
 difference+0.124
 coll+0.120
 former+0.119
Neg logit contributions
 decided-0.106
 came-0.093
 tried-0.093
 stumbled-0.087
 could-0.087
TokenWhen
Token ID2215
Feature activation+4.27e+00

Pos logit contributions
igsaw+0.126
aj+0.116
udd+0.107
 difference+0.107
 coll+0.106
Neg logit contributions
 decided-0.117
 came-0.112
 tried-0.109
 stumbled-0.105
 could-0.102
Token it
Token ID340
Feature activation+5.03e+00

Pos logit contributions
igsaw+0.031
aj+0.030
 difference+0.029
 coll+0.028
 former+0.028
Neg logit contributions
 decided-0.009
 came-0.007
 tried-0.007
 could-0.006
 stumbled-0.006
Token to
Token ID284
Feature activation+2.57e+00

Pos logit contributions
 decided+0.079
 came+0.064
 tried+0.062
 could+0.058
 got+0.056
Neg logit contributions
igsaw-0.202
aj-0.189
 difference-0.186
 former-0.180
udd-0.176
Token me
Token ID502
Feature activation+1.02e+00

Pos logit contributions
 decided+0.061
 came+0.053
 tried+0.053
 stumbled+0.049
 could+0.049
Neg logit contributions
igsaw-0.080
aj-0.075
 difference-0.071
 coll-0.068
 former-0.068
Token for
Token ID329
Feature activation-5.93e-01

Pos logit contributions
 decided+0.019
 came+0.016
 tried+0.016
 could+0.015
 stumbled+0.015
Neg logit contributions
igsaw-0.037
aj-0.034
 difference-0.033
 coll-0.032
 former-0.032
Token safe
Token ID3338
Feature activation-2.25e+00

Pos logit contributions
igsaw+0.017
aj+0.016
 difference+0.015
 coll+0.015
 former+0.015
Neg logit contributions
 decided-0.015
 came-0.013
 tried-0.012
 stumbled-0.012
 could-0.012
Tokenkeeping
Token ID19934
Feature activation+1.24e+01

Pos logit contributions
igsaw+0.075
aj+0.072
 difference+0.068
 coll+0.067
 goals+0.066
Neg logit contributions
 decided-0.045
 tried-0.039
 came-0.039
pped-0.038
 stumbled-0.037
Token.
Token ID13
Feature activation-8.93e-01

Pos logit contributions
 decided+0.282
pped+0.244
 tried+0.239
 stumbled+0.238
Please+0.236
Neg logit contributions
aj-0.364
igsaw-0.343
 difference-0.325
 coll-0.325
 fault-0.315
Token He
Token ID679
Feature activation+6.86e+00

Pos logit contributions
igsaw+0.030
aj+0.028
 difference+0.026
 coll+0.026
 former+0.025
Neg logit contributions
 decided-0.019
 came-0.017
 tried-0.016
 could-0.015
 stumbled-0.015
Token told
Token ID1297
Feature activation+9.18e-01

Pos logit contributions
 decided+0.147
 came+0.128
 tried+0.126
pped+0.124
 stumbled+0.123
Neg logit contributions
igsaw-0.214
aj-0.208
 difference-0.195
 coll-0.191
 goals-0.187
Token me
Token ID502
Feature activation+2.21e+00

Pos logit contributions
 decided+0.017
 came+0.015
 tried+0.014
 could+0.013
 stumbled+0.013
Neg logit contributions
igsaw-0.034
aj-0.032
 difference-0.030
udd-0.029
 former-0.029
Token to
Token ID284
Feature activation+3.07e+00

Pos logit contributions
 decided+0.046
 tried+0.040
 came+0.040
 stumbled+0.038
pped+0.038
Neg logit contributions
igsaw-0.072
aj-0.069
 difference-0.065
 coll-0.063
 goals-0.063
Token."
Token ID526
Feature activation-6.36e-01

Pos logit contributions
 decided+0.001
pped+0.001
 stumbled+0.001
 tried+0.001
 rushed+0.001
Neg logit contributions
igsaw-0.002
aj-0.002
 difference-0.002
 coll-0.002
 goals-0.002
Token
Token ID198
Feature activation-5.12e+00

Pos logit contributions
igsaw+0.020
aj+0.019
 difference+0.018
 coll+0.017
 former+0.017
Neg logit contributions
 decided-0.015
 came-0.013
 tried-0.013
 could-0.012
 stumbled-0.012
Token
Token ID198
Feature activation-5.13e+00

Pos logit contributions
igsaw+0.188
aj+0.170
 difference+0.165
 former+0.164
 coll+0.160
Neg logit contributions
 decided-0.095
 came-0.083
 tried-0.079
 could-0.079
 stumbled-0.072
Token"
Token ID1
Feature activation+9.95e-01

Pos logit contributions
igsaw+0.181
aj+0.170
 difference+0.163
 coll+0.157
 former+0.157
Neg logit contributions
 decided-0.104
 tried-0.086
 came-0.086
 stumbled-0.080
 could-0.080
TokenWho
Token ID8241
Feature activation+8.53e+00

Pos logit contributions
 decided+0.021
 came+0.019
 tried+0.019
pped+0.018
 stumbled+0.018
Neg logit contributions
igsaw-0.031
aj-0.029
 goals-0.028
 former-0.028
 coll-0.027
Token would
Token ID561
Feature activation+1.16e+01

Pos logit contributions
 decided+0.199
 came+0.174
 tried+0.173
 stumbled+0.164
pped+0.163
Neg logit contributions
igsaw-0.261
aj-0.247
 difference-0.232
 coll-0.225
 former-0.222
Token do
Token ID466
Feature activation+4.70e+00

Pos logit contributions
pped+0.275
 decided+0.271
ged+0.260
 stumbled+0.250
 came+0.246
Neg logit contributions
igsaw-0.308
 difference-0.295
 coll-0.267
 goals-0.266
 former-0.266
Token that
Token ID326
Feature activation+2.53e+00

Pos logit contributions
 decided+0.104
 came+0.087
 tried+0.087
 stumbled+0.086
pped+0.085
Neg logit contributions
igsaw-0.148
aj-0.139
 difference-0.134
 coll-0.130
 goals-0.126
Token?"
Token ID1701
Feature activation-2.79e-01

Pos logit contributions
 decided+0.063
 tried+0.056
 came+0.056
pped+0.054
 stumbled+0.054
Neg logit contributions
igsaw-0.072
aj-0.068
 difference-0.065
 coll-0.063
 former-0.061
Token Lily
Token ID20037
Feature activation+1.97e+00

Pos logit contributions
igsaw+0.008
aj+0.007
 difference+0.007
 coll+0.007
 former+0.007
Neg logit contributions
 decided-0.007
 came-0.007
 tried-0.007
 stumbled-0.006
 could-0.006
Token asks
Token ID7893
Feature activation+2.47e+00

Pos logit contributions
 decided+0.054
 came+0.049
pped+0.049
 tried+0.047
 stumbled+0.046
Neg logit contributions
igsaw-0.048
aj-0.048
 difference-0.044
 goals-0.044
 coll-0.042
Token
Token ID26391
Feature activation-5.36e+00

Pos logit contributions
igsaw+0.126
aj+0.119
 difference+0.110
 coll+0.107
 former+0.106
Neg logit contributions
 decided-0.143
 came-0.128
 tried-0.128
 stumbled-0.122
 could-0.121
Token
Token ID8151
Feature activation+6.60e+00

Pos logit contributions
igsaw+0.151
aj+0.141
 difference+0.135
 coll+0.130
 former+0.130
Neg logit contributions
 decided-0.139
 came-0.125
 tried-0.123
 stumbled-0.119
 could-0.119
Tokens
Token ID82
Feature activation-2.88e+00

Pos logit contributions
 decided+0.137
 came+0.120
 tried+0.117
 could+0.112
 stumbled+0.109
Neg logit contributions
igsaw-0.223
aj-0.211
 difference-0.203
 former-0.198
 coll-0.197
Token what
Token ID644
Feature activation+3.91e+00

Pos logit contributions
igsaw+0.092
aj+0.087
 difference+0.080
 coll+0.080
 former+0.077
Neg logit contributions
 decided-0.067
 came-0.058
 tried-0.058
 could-0.056
 stumbled-0.054
Token fire
Token ID2046
Feature activation-3.92e-01

Pos logit contributions
 decided+0.086
 came+0.075
 tried+0.074
 stumbled+0.070
 could+0.070
Neg logit contributions
igsaw-0.127
aj-0.120
 difference-0.114
 coll-0.110
 former-0.109
Tokenplaces
Token ID23625
Feature activation+1.16e+01

Pos logit contributions
igsaw+0.011
aj+0.010
 difference+0.009
 coll+0.009
udd+0.009
Neg logit contributions
 decided-0.010
 tried-0.009
 came-0.009
pped-0.009
 stumbled-0.009
Token can
Token ID460
Feature activation+1.05e+01

Pos logit contributions
 decided+0.254
pped+0.230
 stumbled+0.216
 came+0.212
ged+0.212
Neg logit contributions
igsaw-0.338
 difference-0.318
aj-0.316
udd-0.293
 goals-0.292
Token do
Token ID466
Feature activation+3.57e+00

Pos logit contributions
 decided+0.240
pped+0.214
 stumbled+0.207
ged+0.207
 tried+0.206
Neg logit contributions
igsaw-0.308
 difference-0.286
aj-0.268
 former-0.266
 biggest-0.261
Token!
Token ID0
Feature activation+1.29e+00

Pos logit contributions
 decided+0.077
 tried+0.066
 came+0.066
 stumbled+0.062
pped+0.062
Neg logit contributions
igsaw-0.116
aj-0.109
 difference-0.104
 coll-0.100
 goals-0.099
Token They
Token ID1119
Feature activation+8.39e+00

Pos logit contributions
 decided+0.031
 came+0.027
 tried+0.027
 could+0.026
 stumbled+0.026
Neg logit contributions
igsaw-0.039
aj-0.037
 difference-0.034
 coll-0.034
 former-0.033
Token can
Token ID460
Feature activation+1.00e+01

Pos logit contributions
 decided+0.118
pped+0.108
Please+0.102
 came+0.101
 stumbled+0.098
Neg logit contributions
igsaw-0.294
aj-0.282
 difference-0.275
 former-0.269
 goals-0.269
Token asked
Token ID1965
Feature activation+3.32e+00

Pos logit contributions
 decided+0.049
pped+0.047
 came+0.040
 tried+0.040
ged+0.040
Neg logit contributions
aj-0.111
igsaw-0.110
udd-0.103
 difference-0.102
 coll-0.101
Token Mike
Token ID4995
Feature activation+2.92e+00

Pos logit contributions
 decided+0.064
 came+0.057
 tried+0.053
 could+0.049
 him+0.048
Neg logit contributions
igsaw-0.125
aj-0.118
 former-0.110
 difference-0.109
 coll-0.107
Token:
Token ID25
Feature activation+2.73e+00

Pos logit contributions
 decided+0.060
 came+0.051
 tried+0.051
 could+0.047
 stumbled+0.047
Neg logit contributions
igsaw-0.100
aj-0.094
 difference-0.089
 coll-0.087
 former-0.086
Token "
Token ID366
Feature activation+5.16e-01

Pos logit contributions
 decided+0.066
 tried+0.058
 came+0.058
pped+0.054
 stumbled+0.054
Neg logit contributions
igsaw-0.082
aj-0.077
 difference-0.073
 coll-0.070
 former-0.069
TokenWhat
Token ID2061
Feature activation+6.76e+00

Pos logit contributions
 decided+0.009
 tried+0.008
 came+0.008
 dropped+0.008
 rushed+0.008
Neg logit contributions
igsaw-0.016
aj-0.016
 former-0.015
 difference-0.015
 coll-0.015
Token would
Token ID561
Feature activation+1.15e+01

Pos logit contributions
 decided+0.174
 came+0.155
 tried+0.153
 could+0.147
 stumbled+0.144
Neg logit contributions
igsaw-0.198
aj-0.186
 difference-0.172
 former-0.168
 coll-0.166
Token you
Token ID345
Feature activation+7.97e+00

Pos logit contributions
 decided+0.279
pped+0.254
 came+0.247
 tried+0.238
 stumbled+0.238
Neg logit contributions
igsaw-0.331
 difference-0.295
aj-0.290
 coll-0.279
 goals-0.278
Token like
Token ID588
Feature activation+1.84e+00

Pos logit contributions
 decided+0.189
 came+0.166
pped+0.166
 tried+0.163
 stumbled+0.162
Neg logit contributions
igsaw-0.230
 difference-0.214
aj-0.209
 coll-0.201
 former-0.200
Token best
Token ID1266
Feature activation-2.11e+00

Pos logit contributions
 decided+0.028
 came+0.024
 tried+0.023
 could+0.021
 it+0.020
Neg logit contributions
igsaw-0.073
aj-0.071
udd-0.065
 difference-0.064
ushes-0.064
Token in
Token ID287
Feature activation-8.29e-01

Pos logit contributions
igsaw+0.070
aj+0.066
 difference+0.061
 former+0.060
 coll+0.060
Neg logit contributions
 decided-0.046
 came-0.040
 tried-0.039
 could-0.037
 stumbled-0.036
Token the
Token ID262
Feature activation-1.14e+01

Pos logit contributions
igsaw+0.026
aj+0.026
udd+0.023
 difference+0.023
 former+0.023
Neg logit contributions
 decided-0.019
 tried-0.017
 came-0.017
 could-0.015
 stumbled-0.015
Token Tim
Token ID5045
Feature activation+1.90e+00

Pos logit contributions
 decided+0.012
 came+0.010
 tried+0.010
 could+0.009
 stumbled+0.009
Neg logit contributions
igsaw-0.021
aj-0.020
 difference-0.019
 coll-0.019
 former-0.019
Tokenmy
Token ID1820
Feature activation+5.82e+00

Pos logit contributions
 decided+0.047
 tried+0.041
 came+0.041
pped+0.040
 stumbled+0.040
Neg logit contributions
igsaw-0.051
aj-0.050
udd-0.047
 difference-0.047
 coll-0.045
Token knew
Token ID2993
Feature activation+3.73e+00

Pos logit contributions
 decided+0.090
pped+0.079
Please+0.077
 tried+0.074
ged+0.074
Neg logit contributions
aj-0.191
udd-0.188
igsaw-0.186
 difference-0.182
 coll-0.178
Token that
Token ID326
Feature activation+2.86e+00

Pos logit contributions
 decided+0.072
 tried+0.063
 came+0.062
pped+0.060
 stumbled+0.059
Neg logit contributions
igsaw-0.123
aj-0.119
 difference-0.113
 former-0.110
 coll-0.109
Token fire
Token ID2046
Feature activation-8.32e-01

Pos logit contributions
 decided+0.049
 tried+0.040
 came+0.040
pped+0.039
 stumbled+0.037
Neg logit contributions
igsaw-0.103
aj-0.098
 difference-0.096
 coll-0.092
 former-0.092
Tokenplaces
Token ID23625
Feature activation+1.14e+01

Pos logit contributions
igsaw+0.027
aj+0.026
 difference+0.025
udd+0.024
 biggest+0.023
Neg logit contributions
 decided-0.016
pped-0.014
 tried-0.013
 stumbled-0.013
ged-0.013
Token were
Token ID547
Feature activation+3.35e+00

Pos logit contributions
 decided+0.212
pped+0.203
Please+0.184
ged+0.184
 stumbled+0.181
Neg logit contributions
 difference-0.334
igsaw-0.321
 new-0.308
 goals-0.307
udd-0.307
Token not
Token ID407
Feature activation+5.65e+00

Pos logit contributions
 decided+0.083
pped+0.079
ged+0.079
 stumbled+0.074
 tried+0.074
Neg logit contributions
 difference-0.082
 new-0.081
igsaw-0.078
 goals-0.076
aj-0.075
Token dull
Token ID19222
Feature activation-6.60e+00

Pos logit contributions
 decided+0.145
pped+0.136
 stumbled+0.130
ged+0.130
 tried+0.128
Neg logit contributions
igsaw-0.139
 difference-0.138
aj-0.132
 new-0.131
 coll-0.127
Token or
Token ID393
Feature activation+3.45e-01

Pos logit contributions
igsaw+0.216
aj+0.202
 difference+0.200
 goals+0.191
 coll+0.189
Neg logit contributions
 decided-0.135
 tried-0.116
pped-0.113
 came-0.113
 stumbled-0.112
Token boring
Token ID14262
Feature activation-5.35e+00

Pos logit contributions
 decided+0.009
pped+0.008
 came+0.008
 stumbled+0.007
 tried+0.007
Neg logit contributions
igsaw-0.009
aj-0.009
 difference-0.008
 goals-0.008
 former-0.008
Token his
Token ID465
Feature activation-5.34e+00

Pos logit contributions
 decided+0.055
 came+0.048
 tried+0.047
 could+0.043
 stumbled+0.042
Neg logit contributions
igsaw-0.108
aj-0.103
 difference-0.097
 former-0.094
 coll-0.094
Token friends
Token ID2460
Feature activation+3.76e+00

Pos logit contributions
igsaw+0.155
aj+0.139
udd+0.131
opl+0.130
Whe+0.127
Neg logit contributions
 decided-0.148
 came-0.136
 tried-0.130
 wanted-0.124
 could-0.122
Token,
Token ID11
Feature activation+8.21e-01

Pos logit contributions
 decided+0.082
 tried+0.071
 came+0.071
pped+0.068
 stumbled+0.068
Neg logit contributions
igsaw-0.120
aj-0.112
 difference-0.108
 coll-0.104
 goals-0.103
Token "
Token ID366
Feature activation-3.19e-01

Pos logit contributions
 decided+0.021
pped+0.019
 tried+0.019
 came+0.019
 stumbled+0.018
Neg logit contributions
igsaw-0.022
aj-0.021
 difference-0.020
 coll-0.019
 goals-0.018
TokenWho
Token ID8241
Feature activation+8.43e+00

Pos logit contributions
igsaw+0.009
 difference+0.008
 coll+0.008
 batch+0.008
aj+0.008
Neg logit contributions
 dropped-0.006
 decided-0.006
 tried-0.006
 stumbled-0.005
 rushed-0.005
Token can
Token ID460
Feature activation+1.14e+01

Pos logit contributions
 decided+0.199
 came+0.174
 tried+0.174
 stumbled+0.164
pped+0.163
Neg logit contributions
igsaw-0.256
aj-0.241
 difference-0.228
 coll-0.220
 former-0.217
Token help
Token ID1037
Feature activation+2.45e+00

Pos logit contributions
 decided+0.260
ged+0.241
pped+0.236
 stumbled+0.231
 came+0.224
Neg logit contributions
igsaw-0.331
 difference-0.285
 biggest-0.278
 art-0.276
 puzzle-0.275
Token me
Token ID502
Feature activation+2.41e+00

Pos logit contributions
 decided+0.061
 tried+0.052
 came+0.052
 stumbled+0.051
pped+0.050
Neg logit contributions
igsaw-0.071
aj-0.066
 difference-0.064
 coll-0.061
 goals-0.060
Token make
Token ID787
Feature activation+5.40e-01

Pos logit contributions
 decided+0.057
 came+0.050
 tried+0.049
 stumbled+0.049
pped+0.048
Neg logit contributions
igsaw-0.070
aj-0.066
 difference-0.065
 former-0.061
 coll-0.061
Token something
Token ID1223
Feature activation+2.89e-02

Pos logit contributions
 decided+0.010
 came+0.009
 tried+0.009
 could+0.008
 it+0.008
Neg logit contributions
igsaw-0.019
aj-0.019
 difference-0.017
 former-0.017
udd-0.017
Token?"
Token ID1701
Feature activation-6.49e-01

Pos logit contributions
 decided+0.001
 came+0.001
 tried+0.001
 could+0.001
 got+0.001
Neg logit contributions
igsaw-0.001
aj-0.001
 coll-0.001
 former-0.001
udd-0.001
Token
Token ID220
Feature activation-1.10e+00

Pos logit contributions
 decided+0.024
 came+0.021
 tried+0.021
 could+0.020
 stumbled+0.020
Neg logit contributions
igsaw-0.030
aj-0.028
 difference-0.026
 coll-0.025
 former-0.025
Token
Token ID198
Feature activation-4.06e+00

Pos logit contributions
igsaw+0.038
aj+0.035
 difference+0.034
 coll+0.034
 former+0.033
Neg logit contributions
 decided-0.022
 came-0.019
 could-0.019
 tried-0.019
 stumbled-0.018
Token
Token ID198
Feature activation-4.23e+00

Pos logit contributions
igsaw+0.121
aj+0.113
 difference+0.106
 coll+0.102
 former+0.101
Neg logit contributions
 decided-0.099
 came-0.088
 tried-0.087
 stumbled-0.082
pped-0.082
Token"
Token ID1
Feature activation+1.67e+00

Pos logit contributions
igsaw+0.119
aj+0.109
 difference+0.100
 former+0.099
udd+0.098
Neg logit contributions
 decided-0.105
 came-0.097
 tried-0.095
 could-0.090
pped-0.090
TokenWhere
Token ID8496
Feature activation+7.76e+00

Pos logit contributions
 decided+0.031
 came+0.030
 tried+0.028
 could+0.027
 him+0.027
Neg logit contributions
igsaw-0.049
aj-0.046
 former-0.045
Whe-0.044
 difference-0.043
Token can
Token ID460
Feature activation+1.14e+01

Pos logit contributions
 decided+0.131
 came+0.108
 tried+0.107
 could+0.098
 stumbled+0.098
Neg logit contributions
igsaw-0.294
aj-0.280
 difference-0.266
 coll-0.260
 former-0.259
Token I
Token ID314
Feature activation+9.66e+00

Pos logit contributions
 decided+0.283
pped+0.242
 came+0.238
 stumbled+0.235
ged+0.235
Neg logit contributions
igsaw-0.325
aj-0.287
 difference-0.282
 puzzle-0.270
 coll-0.262
Token find
Token ID1064
Feature activation+5.39e+00

Pos logit contributions
 decided+0.226
ged+0.220
 stumbled+0.193
 came+0.190
pped+0.188
Neg logit contributions
igsaw-0.273
 difference-0.245
 puzzle-0.229
 goal-0.219
 network-0.219
Token a
Token ID257
Feature activation-8.16e+00

Pos logit contributions
 decided+0.118
 came+0.104
 tried+0.103
pped+0.097
 could+0.096
Neg logit contributions
igsaw-0.177
aj-0.168
 difference-0.157
 coll-0.153
 former-0.152
Token hook
Token ID8011
Feature activation-6.67e-01

Pos logit contributions
igsaw+0.195
aj+0.184
 difference+0.159
udd+0.158
 coll+0.157
Neg logit contributions
 decided-0.251
 tried-0.229
 came-0.228
 stumbled-0.217
 could-0.216
Token?"
Token ID1701
Feature activation+1.69e-01

Pos logit contributions
igsaw+0.018
aj+0.017
 difference+0.017
 coll+0.015
 batch+0.015
Neg logit contributions
 decided-0.017
 tried-0.015
 came-0.015
 stumbled-0.014
pped-0.014
Token and
Token ID290
Feature activation+2.20e+00

Pos logit contributions
 decided+0.056
 came+0.047
 tried+0.047
pped+0.044
 stumbled+0.044
Neg logit contributions
igsaw-0.108
aj-0.103
 difference-0.098
 coll-0.095
 goals-0.094
Token continued
Token ID3767
Feature activation+3.84e+00

Pos logit contributions
 decided+0.036
pped+0.032
 came+0.030
 tried+0.030
 stumbled+0.029
Neg logit contributions
igsaw-0.077
aj-0.075
 difference-0.070
 coll-0.070
 former-0.068
Token,
Token ID11
Feature activation+6.75e-01

Pos logit contributions
 decided+0.081
 tried+0.069
 came+0.068
pped+0.067
 could+0.065
Neg logit contributions
igsaw-0.126
aj-0.120
 coll-0.112
 difference-0.111
 goals-0.108
Token "
Token ID366
Feature activation-2.99e-01

Pos logit contributions
 decided+0.015
 came+0.013
 tried+0.013
pped+0.012
 could+0.012
Neg logit contributions
igsaw-0.021
aj-0.020
 difference-0.019
 coll-0.019
 former-0.018
TokenIt
Token ID1026
Feature activation+8.55e+00

Pos logit contributions
igsaw+0.010
aj+0.009
 difference+0.009
 former+0.009
 coll+0.009
Neg logit contributions
 decided-0.006
 tried-0.005
 came-0.005
 could-0.005
 stumbled-0.005
Token'll
Token ID1183
Feature activation+1.14e+01

Pos logit contributions
 decided+0.209
 came+0.185
 tried+0.183
 could+0.174
 stumbled+0.171
Neg logit contributions
igsaw-0.263
aj-0.246
 difference-0.228
 coll-0.224
 former-0.222
Token be
Token ID307
Feature activation+3.28e+00

Pos logit contributions
 decided+0.250
 tried+0.218
pped+0.216
 came+0.213
ged+0.211
Neg logit contributions
igsaw-0.344
 difference-0.329
aj-0.323
 former-0.311
 coll-0.304
Token fun
Token ID1257
Feature activation+1.36e+00

Pos logit contributions
 decided+0.088
 came+0.079
 tried+0.078
 stumbled+0.074
 could+0.074
Neg logit contributions
igsaw-0.091
aj-0.085
 difference-0.079
 coll-0.076
 former-0.076
Token.
Token ID13
Feature activation+1.33e+00

Pos logit contributions
 decided+0.032
 came+0.028
 tried+0.028
 stumbled+0.027
pped+0.027
Neg logit contributions
igsaw-0.041
aj-0.039
 difference-0.037
 coll-0.035
 former-0.035
Token But
Token ID887
Feature activation+4.45e+00

Pos logit contributions
 decided+0.034
 came+0.030
 tried+0.029
 could+0.028
 stumbled+0.028
Neg logit contributions
igsaw-0.039
aj-0.037
 difference-0.034
 coll-0.034
udd-0.033
Token first
Token ID717
Feature activation+1.31e+00

Pos logit contributions
 decided+0.087
 came+0.074
 tried+0.073
pped+0.070
 stumbled+0.069
Neg logit contributions
igsaw-0.152
aj-0.144
 difference-0.139
 former-0.134
 coll-0.132
Token."
Token ID526
Feature activation-3.66e-01

Pos logit contributions
 decided+0.037
 came+0.032
 tried+0.032
 stumbled+0.031
pped+0.030
Neg logit contributions
igsaw-0.047
aj-0.045
 difference-0.043
 coll-0.041
 goals-0.040
Token
Token ID198
Feature activation-5.24e+00

Pos logit contributions
igsaw+0.013
aj+0.012
 difference+0.011
 former+0.011
 coll+0.011
Neg logit contributions
 decided-0.008
 came-0.007
 tried-0.007
 could-0.006
 stumbled-0.006
Token
Token ID198
Feature activation-5.28e+00

Pos logit contributions
igsaw+0.191
aj+0.174
 former+0.168
 difference+0.168
 coll+0.162
Neg logit contributions
 decided-0.100
 came-0.088
 tried-0.086
 could-0.084
 stumbled-0.077
Token"
Token ID1
Feature activation+8.38e-01

Pos logit contributions
igsaw+0.190
aj+0.179
 difference+0.171
 coll+0.166
 former+0.165
Neg logit contributions
 decided-0.102
 came-0.085
 tried-0.085
 stumbled-0.078
 could-0.078
TokenWho
Token ID8241
Feature activation+8.42e+00

Pos logit contributions
 decided+0.019
 came+0.018
pped+0.018
 stumbled+0.017
 tried+0.016
Neg logit contributions
igsaw-0.022
aj-0.021
 coll-0.021
 difference-0.020
 goals-0.020
Token can
Token ID460
Feature activation+1.14e+01

Pos logit contributions
 decided+0.178
 came+0.151
 tried+0.150
pped+0.146
 stumbled+0.144
Neg logit contributions
igsaw-0.267
aj-0.259
 difference-0.242
 coll-0.235
 goals-0.234
Token help
Token ID1037
Feature activation+2.58e+00

Pos logit contributions
 decided+0.236
ged+0.199
pped+0.196
 stumbled+0.193
 came+0.192
Neg logit contributions
igsaw-0.357
 difference-0.324
aj-0.316
 puzzle-0.309
 biggest-0.306
Token them
Token ID606
Feature activation+5.55e+00

Pos logit contributions
 decided+0.063
 came+0.055
 tried+0.055
 stumbled+0.053
pped+0.052
Neg logit contributions
igsaw-0.077
aj-0.072
 difference-0.068
 coll-0.066
 former-0.064
Token?"
Token ID1701
Feature activation-3.99e-01

Pos logit contributions
 decided+0.143
 came+0.126
pped+0.125
 tried+0.125
 stumbled+0.125
Neg logit contributions
igsaw-0.147
aj-0.138
 difference-0.136
 former-0.126
 coll-0.124
Token Ben
Token ID3932
Feature activation+2.25e+00

Pos logit contributions
igsaw+0.012
aj+0.011
 difference+0.010
 coll+0.010
 former+0.010
Neg logit contributions
 decided-0.010
 tried-0.009
 came-0.009
 stumbled-0.009
 could-0.009
Token says
Token ID1139
Feature activation+2.92e-01

Pos logit contributions
 decided+0.061
pped+0.055
 came+0.055
 tried+0.053
 stumbled+0.053
Neg logit contributions
igsaw-0.056
aj-0.055
 difference-0.051
 goals-0.050
 coll-0.049
Token.
Token ID13
Feature activation+6.95e-01

Pos logit contributions
igsaw+0.010
aj+0.010
 coll+0.010
 puzzle+0.009
 difference+0.009
Neg logit contributions
 decided-0.008
 tried-0.007
 stumbled-0.007
Please-0.007
 rushed-0.007
Token
Token ID198
Feature activation-4.52e+00

Pos logit contributions
 decided+0.016
 came+0.014
 tried+0.014
 could+0.013
 stumbled+0.013
Neg logit contributions
igsaw-0.022
aj-0.021
 difference-0.019
 coll-0.019
 former-0.019
Token
Token ID198
Feature activation-4.70e+00

Pos logit contributions
igsaw+0.140
aj+0.132
 difference+0.124
 coll+0.120
 former+0.119
Neg logit contributions
 decided-0.106
 came-0.093
 tried-0.093
 stumbled-0.087
 could-0.087
Token"
Token ID1
Feature activation+1.17e+00

Pos logit contributions
igsaw+0.136
aj+0.127
 difference+0.119
 coll+0.115
 former+0.113
Neg logit contributions
 decided-0.120
 came-0.106
 tried-0.106
 stumbled-0.100
 could-0.100
TokenWhere
Token ID8496
Feature activation+7.27e+00

Pos logit contributions
 decided+0.026
 came+0.024
 tried+0.024
pped+0.023
 got+0.022
Neg logit contributions
igsaw-0.035
aj-0.030
 difference-0.030
 former-0.029
 coll-0.029
Token'd
Token ID1549
Feature activation+1.14e+01

Pos logit contributions
 decided+0.171
 came+0.149
 tried+0.148
 stumbled+0.140
pped+0.139
Neg logit contributions
igsaw-0.225
aj-0.211
 difference-0.199
 coll-0.192
 former-0.190
Token you
Token ID345
Feature activation+8.18e+00

Pos logit contributions
pped+0.274
ged+0.269
 decided+0.263
 stumbled+0.238
 rushed+0.237
Neg logit contributions
igsaw-0.277
 new-0.261
 difference-0.256
 puzzle-0.244
 coll-0.243
Token find
Token ID1064
Feature activation+5.26e+00

Pos logit contributions
ged+0.190
Please+0.176
 decided+0.167
pped+0.159
Don+0.155
Neg logit contributions
igsaw-0.201
 difference-0.193
 biggest-0.175
 network-0.174
 art-0.174
Token that
Token ID326
Feature activation+9.03e-01

Pos logit contributions
 decided+0.119
 tried+0.102
 came+0.102
 stumbled+0.097
 could+0.095
Neg logit contributions
igsaw-0.166
aj-0.156
 difference-0.148
 coll-0.143
 former-0.141
Token crown
Token ID12389
Feature activation+2.90e+00

Pos logit contributions
 decided+0.022
 came+0.019
 tried+0.019
 could+0.018
 stumbled+0.017
Neg logit contributions
igsaw-0.028
aj-0.027
 difference-0.025
 coll-0.024
 former-0.024
Token?
Token ID30
Feature activation+1.48e+00

Pos logit contributions
 decided+0.075
 tried+0.066
 came+0.065
 stumbled+0.064
pped+0.064
Neg logit contributions
igsaw-0.079
aj-0.076
 difference-0.069
udd-0.067
 coll-0.066
Token the
Token ID262
Feature activation-1.05e+01

Pos logit contributions
 decided+0.080
 tried+0.071
pped+0.071
 stumbled+0.070
 came+0.068
Neg logit contributions
igsaw-0.081
aj-0.077
 difference-0.074
 coll-0.073
 batch-0.071
Token children
Token ID1751
Feature activation+2.83e+00

Pos logit contributions
igsaw+0.294
aj+0.264
 former+0.238
opl+0.234
 goals+0.231
Neg logit contributions
 decided-0.311
 came-0.283
 tried-0.273
 could-0.258
 wanted-0.256
Token,
Token ID11
Feature activation+1.05e+00

Pos logit contributions
 decided+0.061
 tried+0.053
 came+0.052
pped+0.051
 stumbled+0.050
Neg logit contributions
igsaw-0.091
aj-0.085
 difference-0.082
 coll-0.078
 goals-0.078
Token "
Token ID366
Feature activation-2.31e-01

Pos logit contributions
 decided+0.026
 came+0.023
 tried+0.023
pped+0.022
 stumbled+0.022
Neg logit contributions
igsaw-0.030
aj-0.028
 difference-0.027
 coll-0.026
 goals-0.025
TokenWhat
Token ID2061
Feature activation+6.38e+00

Pos logit contributions
igsaw+0.007
aj+0.007
 batch+0.007
 coll+0.007
 difference+0.007
Neg logit contributions
 decided-0.004
 tried-0.004
 dropped-0.004
 came-0.004
 stumbled-0.004
Token would
Token ID561
Feature activation+1.14e+01

Pos logit contributions
 decided+0.156
 came+0.138
 tried+0.136
 could+0.131
 stumbled+0.127
Neg logit contributions
igsaw-0.197
aj-0.185
 difference-0.171
 former-0.169
 coll-0.166
Token you
Token ID345
Feature activation+7.80e+00

Pos logit contributions
 decided+0.287
pped+0.274
 came+0.256
ged+0.256
 stumbled+0.254
Neg logit contributions
igsaw-0.308
 difference-0.275
aj-0.266
 coll-0.264
 new-0.262
Token like
Token ID588
Feature activation+1.62e+00

Pos logit contributions
 decided+0.160
pped+0.145
ged+0.137
 stumbled+0.137
 came+0.137
Neg logit contributions
igsaw-0.240
aj-0.225
 difference-0.224
 coll-0.216
 former-0.213
Token to
Token ID284
Feature activation+3.42e+00

Pos logit contributions
 decided+0.031
 came+0.028
 tried+0.026
 could+0.025
 got+0.024
Neg logit contributions
igsaw-0.058
aj-0.055
 difference-0.051
 former-0.051
udd-0.051
Token eat
Token ID4483
Feature activation+5.51e-01

Pos logit contributions
 decided+0.075
 came+0.065
 tried+0.064
 stumbled+0.061
 could+0.060
Neg logit contributions
igsaw-0.112
aj-0.105
 difference-0.100
 coll-0.097
 former-0.096
Token?"
Token ID1701
Feature activation-7.63e-01

Pos logit contributions
 decided+0.012
 came+0.011
 tried+0.011
 could+0.010
pped+0.010
Neg logit contributions
igsaw-0.018
aj-0.017
 difference-0.016
 former-0.016
 coll-0.015
Token to
Token ID284
Feature activation+3.59e+00

Pos logit contributions
 decided+0.035
 tried+0.032
pped+0.031
 stumbled+0.031
 came+0.030
Neg logit contributions
igsaw-0.039
aj-0.035
 difference-0.033
 coll-0.033
 batch-0.032
Token the
Token ID262
Feature activation-1.20e+01

Pos logit contributions
 decided+0.095
 stumbled+0.085
 tried+0.084
 could+0.084
pped+0.083
Neg logit contributions
igsaw-0.098
 difference-0.083
 former-0.078
 coll-0.077
aj-0.077
Token students
Token ID2444
Feature activation+2.35e+00

Pos logit contributions
igsaw+0.288
aj+0.269
 difference+0.246
 coll+0.238
 former+0.234
Neg logit contributions
 decided-0.371
 came-0.335
 tried-0.334
 stumbled-0.318
 could-0.317
Token "
Token ID366
Feature activation-8.04e-01

Pos logit contributions
 decided+0.050
pped+0.044
 tried+0.043
 came+0.043
 stumbled+0.042
Neg logit contributions
igsaw-0.073
aj-0.066
 difference-0.065
 goals-0.064
 coll-0.062
TokenWho
Token ID8241
Feature activation+7.81e+00

Pos logit contributions
igsaw+0.023
aj+0.021
 difference+0.021
 coll+0.020
 former+0.020
Neg logit contributions
 decided-0.017
 came-0.016
 tried-0.015
 dropped-0.015
 stumbled-0.015
Token would
Token ID561
Feature activation+1.14e+01

Pos logit contributions
 decided+0.182
 came+0.158
 tried+0.157
pped+0.152
 stumbled+0.151
Neg logit contributions
igsaw-0.233
aj-0.221
 difference-0.209
 goals-0.202
 coll-0.201
Token like
Token ID588
Feature activation+1.71e+00

Pos logit contributions
ged+0.265
 decided+0.260
pped+0.259
 stumbled+0.241
 came+0.232
Neg logit contributions
igsaw-0.295
 difference-0.293
 new-0.262
 former-0.258
 art-0.256
Token to
Token ID284
Feature activation+3.63e+00

Pos logit contributions
 decided+0.036
 came+0.031
 tried+0.031
 stumbled+0.029
pped+0.029
Neg logit contributions
igsaw-0.057
aj-0.053
 difference-0.051
 coll-0.049
 former-0.049
Token bring
Token ID2222
Feature activation+2.06e+00

Pos logit contributions
 decided+0.084
 stumbled+0.072
 came+0.072
ged+0.071
 tried+0.070
Neg logit contributions
igsaw-0.111
 difference-0.103
aj-0.096
 former-0.094
 coll-0.094
Token an
Token ID281
Feature activation-1.26e+01

Pos logit contributions
 decided+0.044
 came+0.039
 tried+0.038
 could+0.036
 stumbled+0.035
Neg logit contributions
igsaw-0.068
aj-0.065
 difference-0.061
 coll-0.059
 former-0.059
Token object
Token ID2134
Feature activation+4.41e-01

Pos logit contributions
igsaw+0.316
aj+0.311
udd+0.268
 coll+0.263
 difference+0.258
Neg logit contributions
 decided-0.381
 tried-0.334
 came-0.323
 wanted-0.310
 could-0.309
Token and
Token ID290
Feature activation+1.81e+00

Pos logit contributions
igsaw+0.080
aj+0.075
 difference+0.072
 coll+0.070
 former+0.069
Neg logit contributions
 decided-0.036
 came-0.029
 tried-0.029
 could-0.026
 stumbled-0.026
Token said
Token ID531
Feature activation+1.23e+00

Pos logit contributions
 decided+0.030
pped+0.025
 came+0.025
 tried+0.024
 stumbled+0.024
Neg logit contributions
igsaw-0.063
aj-0.062
 difference-0.057
 coll-0.057
udd-0.056
Token,
Token ID11
Feature activation+5.39e-01

Pos logit contributions
 decided+0.028
 tried+0.025
 came+0.025
 stumbled+0.023
pped+0.023
Neg logit contributions
igsaw-0.039
aj-0.036
 difference-0.034
 coll-0.033
 former-0.033
Token "
Token ID366
Feature activation-3.01e-01

Pos logit contributions
 decided+0.016
pped+0.014
 tried+0.014
 came+0.014
 stumbled+0.014
Neg logit contributions
igsaw-0.012
aj-0.012
 difference-0.010
 coll-0.010
 former-0.010
TokenWhat
Token ID2061
Feature activation+6.25e+00

Pos logit contributions
igsaw+0.010
aj+0.009
 difference+0.009
 coll+0.009
 former+0.009
Neg logit contributions
 decided-0.006
 came-0.006
 tried-0.006
 stumbled-0.005
 could-0.005
Token would
Token ID561
Feature activation+1.13e+01

Pos logit contributions
 decided+0.147
 came+0.128
 tried+0.127
 could+0.124
 stumbled+0.117
Neg logit contributions
igsaw-0.202
aj-0.192
 former-0.176
 difference-0.175
 coll-0.172
Token you
Token ID345
Feature activation+7.81e+00

Pos logit contributions
 decided+0.277
pped+0.257
 came+0.246
ged+0.244
 tried+0.240
Neg logit contributions
igsaw-0.321
 difference-0.286
aj-0.278
 coll-0.265
 former-0.265
Token like
Token ID588
Feature activation+1.50e+00

Pos logit contributions
 decided+0.174
pped+0.158
ged+0.157
 came+0.152
 tried+0.149
Neg logit contributions
igsaw-0.227
 difference-0.214
aj-0.214
 former-0.203
 biggest-0.193
Token to
Token ID284
Feature activation+3.27e+00

Pos logit contributions
 decided+0.031
 came+0.027
 tried+0.026
 could+0.024
 stumbled+0.024
Neg logit contributions
igsaw-0.052
aj-0.049
 difference-0.046
 former-0.045
 coll-0.045
Token eat
Token ID4483
Feature activation+5.19e-01

Pos logit contributions
 decided+0.076
 tried+0.066
 came+0.066
 stumbled+0.063
pped+0.063
Neg logit contributions
igsaw-0.102
aj-0.095
 difference-0.091
 coll-0.086
 former-0.086
Token,
Token ID11
Feature activation-2.99e-01

Pos logit contributions
 decided+0.012
 came+0.010
 tried+0.010
 could+0.010
pped+0.009
Neg logit contributions
igsaw-0.017
aj-0.016
 difference-0.015
 former-0.014
 coll-0.014
Token each
Token ID1123
Feature activation-9.24e-01

Pos logit contributions
 decided+0.064
 came+0.057
 tried+0.055
 could+0.051
 stumbled+0.049
Neg logit contributions
igsaw-0.104
aj-0.099
 difference-0.090
 former-0.088
udd-0.087
Token other
Token ID584
Feature activation-3.36e+00

Pos logit contributions
igsaw+0.020
 difference+0.018
aj+0.017
 goals+0.017
 coll+0.016
Neg logit contributions
 decided-0.029
 stumbled-0.026
pped-0.026
 came-0.026
 tried-0.026
Token,
Token ID11
Feature activation+5.10e-01

Pos logit contributions
igsaw+0.110
aj+0.104
 difference+0.097
 former+0.095
 coll+0.094
Neg logit contributions
 decided-0.075
 came-0.065
 tried-0.064
 could-0.061
 stumbled-0.059
Token "
Token ID366
Feature activation-4.74e-01

Pos logit contributions
 decided+0.014
pped+0.013
 came+0.012
 stumbled+0.012
 tried+0.012
Neg logit contributions
igsaw-0.012
aj-0.012
 difference-0.011
 coll-0.011
 new-0.011
TokenWho
Token ID8241
Feature activation+8.20e+00

Pos logit contributions
igsaw+0.012
 difference+0.012
aj+0.011
 former+0.011
 batch+0.011
Neg logit contributions
 decided-0.010
 could-0.009
 came-0.009
 dropped-0.009
 tried-0.009
Token can
Token ID460
Feature activation+1.13e+01

Pos logit contributions
 decided+0.174
 came+0.149
 tried+0.148
 stumbled+0.141
pped+0.140
Neg logit contributions
igsaw-0.262
aj-0.251
 difference-0.238
 coll-0.229
 goals-0.228
Token fix
Token ID4259
Feature activation-8.43e-02

Pos logit contributions
 decided+0.224
ged+0.191
 stumbled+0.183
pped+0.183
 came+0.180
Neg logit contributions
igsaw-0.370
 difference-0.335
aj-0.324
 puzzle-0.320
 biggest-0.313
Token the
Token ID262
Feature activation-1.36e+01

Pos logit contributions
igsaw+0.003
aj+0.003
 difference+0.003
 former+0.003
 coll+0.003
Neg logit contributions
 decided-0.001
 came-0.001
 tried-0.001
 could-0.001
pped-0.001
Token moon
Token ID8824
Feature activation-1.64e+00

Pos logit contributions
igsaw+0.302
aj+0.275
udd+0.228
 coll+0.228
 batch+0.228
Neg logit contributions
 decided-0.463
 came-0.428
 tried-0.425
 could-0.413
 stumbled-0.403
Token?"
Token ID1701
Feature activation-9.97e-01

Pos logit contributions
igsaw+0.049
aj+0.046
 difference+0.043
 coll+0.041
 former+0.041
Neg logit contributions
 decided-0.040
 tried-0.035
 came-0.035
 stumbled-0.033
pped-0.033
Token They
Token ID1119
Feature activation+7.32e+00

Pos logit contributions
igsaw+0.024
aj+0.022
 difference+0.021
 coll+0.020
 former+0.020
Neg logit contributions
 decided-0.029
 came-0.026
pped-0.026
 tried-0.026
 stumbled-0.025
Token Tim
Token ID5045
Feature activation+2.99e+00

Pos logit contributions
 decided+0.002
 tried+0.002
 came+0.002
 could+0.002
 stumbled+0.002
Neg logit contributions
igsaw-0.003
aj-0.002
 difference-0.002
 former-0.002
 coll-0.002
Token asked
Token ID1965
Feature activation+3.03e+00

Pos logit contributions
 decided+0.039
pped+0.034
 came+0.032
 stumbled+0.031
 tried+0.030
Neg logit contributions
igsaw-0.111
aj-0.109
udd-0.102
 difference-0.102
 coll-0.101
Token,
Token ID11
Feature activation+7.36e-01

Pos logit contributions
 decided+0.075
 tried+0.065
 came+0.064
 stumbled+0.063
pped+0.063
Neg logit contributions
igsaw-0.088
aj-0.081
 difference-0.079
 coll-0.076
 goals-0.075
Token "
Token ID366
Feature activation-3.65e-01

Pos logit contributions
 decided+0.022
pped+0.021
 stumbled+0.020
 tried+0.019
 rushed+0.019
Neg logit contributions
igsaw-0.014
 new-0.013
aj-0.013
 difference-0.012
 batch-0.012
TokenWhere
Token ID8496
Feature activation+7.97e+00

Pos logit contributions
igsaw+0.011
 difference+0.010
 former+0.010
aj+0.010
 coll+0.010
Neg logit contributions
 decided-0.007
 came-0.007
 dropped-0.006
 tried-0.006
 rushed-0.006
Token can
Token ID460
Feature activation+1.13e+01

Pos logit contributions
 decided+0.182
 came+0.159
 tried+0.158
 could+0.150
 stumbled+0.148
Neg logit contributions
igsaw-0.255
aj-0.240
 difference-0.226
 coll-0.220
 former-0.219
Token I
Token ID314
Feature activation+9.51e+00

Pos logit contributions
 decided+0.274
pped+0.238
 came+0.232
 stumbled+0.229
ged+0.228
Neg logit contributions
igsaw-0.327
aj-0.293
 difference-0.290
 puzzle-0.278
 coll-0.274
Token find
Token ID1064
Feature activation+5.18e+00

Pos logit contributions
 decided+0.255
ged+0.254
pped+0.233
 stumbled+0.226
 came+0.223
Neg logit contributions
igsaw-0.227
 puzzle-0.196
 difference-0.195
aj-0.184
 biggest-0.181
Token the
Token ID262
Feature activation-1.19e+01

Pos logit contributions
 decided+0.113
 came+0.099
 tried+0.098
 could+0.092
 stumbled+0.091
Neg logit contributions
igsaw-0.170
aj-0.161
 difference-0.151
 coll-0.147
 former-0.146
Token key
Token ID1994
Feature activation+1.86e+00

Pos logit contributions
igsaw+0.331
aj+0.319
udd+0.290
 difference+0.274
 coll+0.272
Neg logit contributions
 decided-0.323
 tried-0.291
 came-0.289
 could-0.273
 stumbled-0.271
Token?"
Token ID1701
Feature activation-5.54e-01

Pos logit contributions
 decided+0.048
 tried+0.043
 came+0.042
 stumbled+0.041
pped+0.041
Neg logit contributions
igsaw-0.051
aj-0.048
 difference-0.045
 coll-0.043
 former-0.042
Token Lily
Token ID20037
Feature activation+2.36e+00

Pos logit contributions
 decided+0.027
 came+0.025
 tried+0.024
 stumbled+0.023
 could+0.023
Neg logit contributions
igsaw-0.030
aj-0.028
 difference-0.027
 former-0.026
udd-0.026
Token.
Token ID13
Feature activation+1.12e-01

Pos logit contributions
 decided+0.046
pped+0.040
 came+0.038
 tried+0.038
 stumbled+0.037
Neg logit contributions
igsaw-0.077
aj-0.075
 difference-0.069
 coll-0.068
udd-0.067
Token "
Token ID366
Feature activation-4.54e-01

Pos logit contributions
 decided+0.003
 tried+0.003
pped+0.003
 came+0.003
 stumbled+0.003
Neg logit contributions
igsaw-0.003
aj-0.003
 difference-0.003
 coll-0.003
 goals-0.003
TokenBut
Token ID1537
Feature activation+4.31e+00

Pos logit contributions
igsaw+0.013
Whe+0.012
 goals+0.012
aj+0.012
 coll+0.012
Neg logit contributions
 came-0.009
 decided-0.009
 tried-0.009
 rushed-0.008
pped-0.008
Token how
Token ID703
Feature activation+2.88e+00

Pos logit contributions
 decided+0.091
 came+0.078
 tried+0.077
pped+0.074
 stumbled+0.074
Neg logit contributions
igsaw-0.139
aj-0.134
 difference-0.128
 coll-0.122
 goals-0.122
Token can
Token ID460
Feature activation+1.13e+01

Pos logit contributions
 decided+0.064
 could+0.056
 came+0.055
 tried+0.054
 wanted+0.050
Neg logit contributions
igsaw-0.098
aj-0.089
 difference-0.084
 former-0.084
 coll-0.083
Token we
Token ID356
Feature activation+9.34e+00

Pos logit contributions
 decided+0.259
 came+0.225
pped+0.223
 tried+0.223
 stumbled+0.218
Neg logit contributions
igsaw-0.336
aj-0.308
 difference-0.302
 coll-0.291
 goals-0.285
Token get
Token ID651
Feature activation+2.79e+00

Pos logit contributions
 decided+0.208
ged+0.204
pped+0.189
 came+0.184
 rushed+0.183
Neg logit contributions
igsaw-0.264
 difference-0.248
 former-0.240
aj-0.231
 tallest-0.227
Token it
Token ID340
Feature activation+6.49e+00

Pos logit contributions
 decided+0.057
 came+0.051
 tried+0.050
 rushed+0.049
 stumbled+0.049
Neg logit contributions
igsaw-0.085
aj-0.082
 difference-0.078
 coll-0.076
 biggest-0.076
Token?
Token ID30
Feature activation+1.63e+00

Pos logit contributions
 decided+0.150
 came+0.132
 tried+0.129
 could+0.123
 stumbled+0.120
Neg logit contributions
igsaw-0.207
aj-0.193
 difference-0.184
 coll-0.178
 former-0.177
Token It
Token ID632
Feature activation+8.33e+00

Pos logit contributions
 decided+0.040
 came+0.036
 tried+0.035
 stumbled+0.033
pped+0.033
Neg logit contributions
igsaw-0.048
aj-0.045
 difference-0.043
 coll-0.041
 former-0.041
Token
Token ID198
Feature activation-4.40e+00

Pos logit contributions
igsaw+0.000
aj+0.000
 difference+0.000
 former+0.000
 coll+0.000
Neg logit contributions
 decided-0.000
 came-0.000
 tried-0.000
 could-0.000
 stumbled-0.000
Token
Token ID198
Feature activation-4.63e+00

Pos logit contributions
igsaw+0.150
aj+0.140
 difference+0.133
 former+0.131
 coll+0.128
Neg logit contributions
 decided-0.093
 came-0.081
 tried-0.080
 could-0.076
 stumbled-0.074
Token"
Token ID1
Feature activation+9.90e-01

Pos logit contributions
igsaw+0.150
aj+0.142
 difference+0.133
 coll+0.131
 goals+0.130
Neg logit contributions
 decided-0.099
 came-0.088
 tried-0.086
 stumbled-0.082
 could-0.080
TokenBut
Token ID1537
Feature activation+4.10e+00

Pos logit contributions
 decided+0.023
 came+0.021
pped+0.020
 tried+0.020
 stumbled+0.020
Neg logit contributions
igsaw-0.028
aj-0.026
 coll-0.025
 difference-0.025
 goals-0.024
Token it
Token ID340
Feature activation+7.41e+00

Pos logit contributions
 decided+0.094
 came+0.081
 tried+0.080
 stumbled+0.077
pped+0.077
Neg logit contributions
igsaw-0.127
aj-0.120
 difference-0.115
 coll-0.109
 goals-0.109
Token would
Token ID561
Feature activation+1.13e+01

Pos logit contributions
 decided+0.209
 came+0.189
 tried+0.189
 could+0.179
 stumbled+0.177
Neg logit contributions
igsaw-0.198
aj-0.184
 difference-0.169
 former-0.165
 coll-0.164
Token be
Token ID307
Feature activation+3.51e+00

Pos logit contributions
 decided+0.242
 came+0.210
pped+0.205
 tried+0.204
 stumbled+0.200
Neg logit contributions
igsaw-0.358
aj-0.337
 difference-0.335
 coll-0.320
 former-0.316
Token fun
Token ID1257
Feature activation+1.48e+00

Pos logit contributions
 decided+0.106
 came+0.096
 tried+0.095
pped+0.092
 stumbled+0.092
Neg logit contributions
igsaw-0.082
aj-0.079
 difference-0.071
 coll-0.069
 goals-0.068
Token!"
Token ID2474
Feature activation-2.34e-01

Pos logit contributions
 decided+0.033
 came+0.029
 tried+0.029
pped+0.028
 stumbled+0.027
Neg logit contributions
igsaw-0.047
aj-0.044
 difference-0.042
 coll-0.040
 goals-0.040
Token Tom
Token ID4186
Feature activation+2.11e+00

Pos logit contributions
igsaw+0.007
aj+0.007
 difference+0.006
 former+0.006
udd+0.006
Neg logit contributions
 decided-0.006
 tried-0.005
 came-0.005
 could-0.005
 stumbled-0.005
Token said
Token ID531
Feature activation+3.92e-01

Pos logit contributions
 decided+0.039
pped+0.037
 came+0.034
 stumbled+0.034
 could+0.031
Neg logit contributions
aj-0.065
igsaw-0.064
udd-0.060
 coll-0.060
 difference-0.059
Token.
Token ID13
Feature activation+1.24e+00

Pos logit contributions
 decided+0.011
 tried+0.010
 came+0.010
pped+0.010
 stumbled+0.009
Neg logit contributions
igsaw-0.019
aj-0.018
 goals-0.017
 difference-0.017
 coll-0.016
Token
Token ID198
Feature activation-4.18e+00

Pos logit contributions
 decided+0.026
 tried+0.023
 came+0.023
 could+0.022
 stumbled+0.021
Neg logit contributions
igsaw-0.043
aj-0.040
 coll-0.038
 difference-0.038
 former-0.038
Token
Token ID198
Feature activation-4.41e+00

Pos logit contributions
igsaw+0.144
aj+0.133
 difference+0.128
 former+0.126
 coll+0.125
Neg logit contributions
 decided-0.087
 came-0.077
 tried-0.075
 could-0.073
 stumbled-0.070
Token"
Token ID1
Feature activation+1.26e+00

Pos logit contributions
igsaw+0.130
aj+0.123
 difference+0.114
 coll+0.111
 goals+0.110
Neg logit contributions
 decided-0.108
 came-0.096
 tried-0.096
 stumbled-0.090
pped-0.090
TokenWhere
Token ID8496
Feature activation+7.66e+00

Pos logit contributions
 decided+0.025
 came+0.022
 tried+0.022
 could+0.021
pped+0.021
Neg logit contributions
igsaw-0.039
aj-0.037
 difference-0.035
 coll-0.034
 former-0.034
Token can
Token ID460
Feature activation+1.13e+01

Pos logit contributions
 decided+0.184
 came+0.162
 tried+0.161
 could+0.152
 stumbled+0.151
Neg logit contributions
igsaw-0.236
aj-0.221
 difference-0.208
 coll-0.202
 former-0.201
Token we
Token ID356
Feature activation+9.71e+00

Pos logit contributions
 decided+0.279
pped+0.251
 tried+0.238
ged+0.236
 came+0.235
Neg logit contributions
igsaw-0.304
aj-0.274
 difference-0.273
 new-0.257
 puzzle-0.257
Token drink
Token ID4144
Feature activation+2.22e+00

Pos logit contributions
ged+0.268
pped+0.233
 decided+0.230
 rushed+0.213
ously+0.211
Neg logit contributions
 difference-0.232
igsaw-0.225
 biggest-0.209
 new-0.196
 goal-0.195
Token,
Token ID11
Feature activation+5.02e-01

Pos logit contributions
 decided+0.050
 came+0.044
 tried+0.043
 could+0.041
 stumbled+0.039
Neg logit contributions
igsaw-0.073
aj-0.068
 difference-0.064
 former-0.063
 coll-0.062
Token Lia
Token ID49520
Feature activation+1.29e+00

Pos logit contributions
 decided+0.015
 came+0.013
 tried+0.013
 could+0.013
 stumbled+0.012
Neg logit contributions
igsaw-0.013
aj-0.012
 difference-0.011
 coll-0.011
 former-0.011
Token?"
Token ID1701
Feature activation+2.20e-01

Pos logit contributions
 decided+0.036
pped+0.034
 tried+0.034
 rushed+0.033
ged+0.033
Neg logit contributions
aj-0.029
igsaw-0.026
 difference-0.025
udd-0.024
 coll-0.023
Token little
Token ID1310
Feature activation-9.07e+00

Pos logit contributions
igsaw+0.281
aj+0.273
 difference+0.260
 coll+0.253
udd+0.243
Neg logit contributions
 decided-0.309
 came-0.278
 tried-0.275
pped-0.275
 stumbled-0.268
Token boy
Token ID2933
Feature activation+4.58e+00

Pos logit contributions
igsaw+0.217
aj+0.200
 difference+0.185
 coll+0.177
 former+0.175
Neg logit contributions
 decided-0.278
 came-0.252
 tried-0.251
 stumbled-0.240
 could-0.239
Token,
Token ID11
Feature activation+1.21e+00

Pos logit contributions
 decided+0.118
pped+0.111
 tried+0.105
 came+0.103
 stumbled+0.102
Neg logit contributions
igsaw-0.119
aj-0.109
 difference-0.106
 goals-0.103
 coll-0.101
Token "
Token ID366
Feature activation-1.53e-01

Pos logit contributions
 decided+0.034
pped+0.033
 stumbled+0.030
 tried+0.030
 came+0.029
Neg logit contributions
igsaw-0.025
aj-0.024
 difference-0.023
 new-0.023
 coll-0.022
TokenIt
Token ID1026
Feature activation+8.77e+00

Pos logit contributions
igsaw+0.005
aj+0.004
Whe+0.004
 difference+0.004
 former+0.004
Neg logit contributions
 decided-0.003
 came-0.003
 tried-0.002
 dropped-0.002
pped-0.002
Token would
Token ID561
Feature activation+1.13e+01

Pos logit contributions
 decided+0.218
 came+0.194
 tried+0.193
 could+0.183
 stumbled+0.179
Neg logit contributions
igsaw-0.269
aj-0.250
 difference-0.230
 coll-0.227
 former-0.225
Token be
Token ID307
Feature activation+3.51e+00

Pos logit contributions
 decided+0.238
 came+0.205
 tried+0.203
pped+0.193
 stumbled+0.193
Neg logit contributions
igsaw-0.366
aj-0.345
 difference-0.338
 coll-0.324
 former-0.323
Token very
Token ID845
Feature activation+1.60e+00

Pos logit contributions
 decided+0.102
 came+0.091
 tried+0.091
 stumbled+0.087
 could+0.086
Neg logit contributions
igsaw-0.091
aj-0.084
 difference-0.078
 coll-0.075
 former-0.074
Token hard
Token ID1327
Feature activation-1.71e+00

Pos logit contributions
 decided+0.047
 came+0.042
 tried+0.042
 stumbled+0.040
 could+0.040
Neg logit contributions
igsaw-0.041
aj-0.037
 difference-0.035
 coll-0.033
 former-0.033
Token for
Token ID329
Feature activation+2.97e-01

Pos logit contributions
igsaw+0.054
aj+0.051
 difference+0.048
 coll+0.047
 goals+0.046
Neg logit contributions
 decided-0.038
 came-0.033
 tried-0.033
pped-0.032
 stumbled-0.031
Token you
Token ID345
Feature activation+5.72e+00

Pos logit contributions
 decided+0.007
 came+0.006
 tried+0.006
 stumbled+0.006
 could+0.006
Neg logit contributions
igsaw-0.009
aj-0.009
 difference-0.008
 coll-0.008
 former-0.008
Token to
Token ID284
Feature activation+3.86e+00

Pos logit contributions
pped+0.116
 decided+0.114
 tried+0.108
ged+0.105
 came+0.104
Neg logit contributions
igsaw-0.140
 new-0.136
aj-0.134
 difference-0.125
 math-0.124
Token take
Token ID1011
Feature activation+9.76e-01

Pos logit contributions
 decided+0.088
ged+0.081
pped+0.079
 came+0.078
 tried+0.077
Neg logit contributions
igsaw-0.116
 difference-0.105
aj-0.099
 former-0.099
 goals-0.097
Token a
Token ID257
Feature activation-9.12e+00

Pos logit contributions
 decided+0.022
 tried+0.019
 came+0.019
 could+0.018
 stumbled+0.018
Neg logit contributions
igsaw-0.031
aj-0.029
 difference-0.027
 coll-0.027
 former-0.026
Token journey
Token ID7002
Feature activation+1.55e+00

Pos logit contributions
igsaw+0.257
aj+0.249
 difference+0.229
 former+0.224
 goals+0.223
Neg logit contributions
 decided-0.232
 came-0.209
 tried-0.208
 stumbled-0.195
 could-0.193
Token to
Token ID284
Feature activation+1.62e+00

Pos logit contributions
 decided+0.029
 tried+0.024
 came+0.024
 stumbled+0.022
pped+0.022
Neg logit contributions
igsaw-0.055
aj-0.052
 difference-0.050
 coll-0.049
 former-0.048
Token outer
Token ID12076
Feature activation-1.81e+01

Pos logit contributions
 decided+0.034
 came+0.029
 tried+0.029
 stumbled+0.027
 could+0.027
Neg logit contributions
igsaw-0.055
aj-0.053
 difference-0.048
udd-0.048
 coll-0.047
Token space
Token ID2272
Feature activation-1.59e+00

Pos logit contributions
igsaw+0.420
aj+0.395
udd+0.356
uddle+0.336
 fault+0.332
Neg logit contributions
 decided-0.549
 it-0.522
 him-0.496
 asked-0.490
 tried-0.489
Token.
Token ID13
Feature activation+2.73e-01

Pos logit contributions
igsaw+0.058
aj+0.055
 difference+0.052
 coll+0.051
 former+0.050
Neg logit contributions
 decided-0.029
 came-0.024
 tried-0.024
 stumbled-0.022
pped-0.022
Token He
Token ID679
Feature activation+6.94e+00

Pos logit contributions
 decided+0.007
 came+0.006
 tried+0.006
pped+0.006
 stumbled+0.006
Neg logit contributions
igsaw-0.008
aj-0.008
 difference-0.007
 coll-0.007
 former-0.007
Token hopped
Token ID45230
Feature activation+2.00e+00

Pos logit contributions
 decided+0.094
pped+0.077
 came+0.074
 tried+0.073
 stumbled+0.066
Neg logit contributions
igsaw-0.259
aj-0.258
 difference-0.238
 coll-0.236
udd-0.235
Token into
Token ID656
Feature activation-3.47e+00

Pos logit contributions
 decided+0.041
 tried+0.035
 came+0.035
 stumbled+0.033
pped+0.033
Neg logit contributions
igsaw-0.067
aj-0.063
 difference-0.059
 coll-0.058
 goals-0.057
Token she
Token ID673
Feature activation+7.30e+00

Pos logit contributions
 decided+0.035
 tried+0.030
 came+0.030
 stumbled+0.028
pped+0.028
Neg logit contributions
igsaw-0.059
aj-0.056
 difference-0.053
 coll-0.051
 former-0.051
Token got
Token ID1392
Feature activation+2.28e+00

Pos logit contributions
ged+0.089
 decided+0.086
pped+0.085
 him+0.080
Please+0.076
Neg logit contributions
aj-0.225
udd-0.224
 new-0.203
 coll-0.201
 difference-0.197
Token rid
Token ID5755
Feature activation-3.80e+00

Pos logit contributions
 decided+0.048
pped+0.042
 came+0.041
 stumbled+0.041
 tried+0.041
Neg logit contributions
igsaw-0.068
aj-0.066
 difference-0.063
 new-0.061
 coll-0.061
Token of
Token ID286
Feature activation-3.59e+00

Pos logit contributions
igsaw+0.086
aj+0.075
udd+0.072
 difference+0.071
 goals+0.068
Neg logit contributions
 decided-0.114
 tried-0.103
 stumbled-0.102
 came-0.098
 could-0.096
Token the
Token ID262
Feature activation-1.28e+01

Pos logit contributions
igsaw+0.112
aj+0.104
 difference+0.100
 coll+0.097
 goals+0.096
Neg logit contributions
 decided-0.083
 came-0.071
 tried-0.071
 stumbled-0.068
pped-0.067
Token intr
Token ID9913
Feature activation-1.79e+01

Pos logit contributions
igsaw+0.331
aj+0.307
 difference+0.285
 coll+0.274
 former+0.271
Neg logit contributions
 decided-0.370
 came-0.332
 tried-0.330
 stumbled-0.315
 could-0.314
Tokenuders
Token ID48739
Feature activation+1.81e+00

Pos logit contributions
igsaw+0.407
aj+0.373
 difference+0.343
 coll+0.327
 former+0.322
Neg logit contributions
 decided-0.567
 came-0.515
 tried-0.513
 stumbled-0.492
 could-0.489
Token.
Token ID13
Feature activation-1.34e-01

Pos logit contributions
 decided+0.037
 tried+0.032
Please+0.030
 stumbled+0.030
pped+0.030
Neg logit contributions
igsaw-0.055
aj-0.052
 difference-0.051
 coll-0.049
udd-0.049
Token She
Token ID1375
Feature activation+7.02e+00

Pos logit contributions
igsaw+0.005
aj+0.004
 difference+0.004
 coll+0.004
 former+0.004
Neg logit contributions
 decided-0.003
 came-0.002
 could-0.002
 tried-0.002
 stumbled-0.002
Token did
Token ID750
Feature activation+5.70e+00

Pos logit contributions
pped+0.097
 decided+0.096
ged+0.091
 stumbled+0.080
Please+0.080
Neg logit contributions
aj-0.241
igsaw-0.225
udd-0.222
 coll-0.222
 former-0.222
Token not
Token ID407
Feature activation+5.73e+00

Pos logit contributions
 decided+0.095
 came+0.076
 tried+0.075
 stumbled+0.072
pped+0.071
Neg logit contributions
igsaw-0.213
aj-0.203
 difference-0.195
 goals-0.190
 coll-0.189
Token away
Token ID1497
Feature activation+4.02e-01

Pos logit contributions
 decided+0.026
 tried+0.022
 came+0.022
 stumbled+0.021
pped+0.021
Neg logit contributions
igsaw-0.043
aj-0.041
 difference-0.039
 coll-0.038
 goals-0.038
Token,
Token ID11
Feature activation-4.85e-01

Pos logit contributions
 decided+0.008
 tried+0.007
pped+0.007
 came+0.007
 stumbled+0.006
Neg logit contributions
igsaw-0.013
aj-0.012
 difference-0.012
 former-0.012
 coll-0.012
Token a
Token ID257
Feature activation-7.90e+00

Pos logit contributions
igsaw+0.015
aj+0.015
 difference+0.014
 coll+0.013
 former+0.013
Neg logit contributions
 decided-0.010
 came-0.009
pped-0.009
 tried-0.009
 stumbled-0.008
Token bright
Token ID6016
Feature activation-1.18e+01

Pos logit contributions
igsaw+0.194
aj+0.183
udd+0.159
 goals+0.157
 difference+0.157
Neg logit contributions
 decided-0.239
 came-0.218
 tried-0.216
 could-0.206
 stumbled-0.205
Token blue
Token ID4171
Feature activation-1.36e+01

Pos logit contributions
igsaw+0.293
aj+0.287
udd+0.247
 former+0.243
 goals+0.243
Neg logit contributions
 decided-0.349
 came-0.324
 tried-0.318
 stumbled-0.296
 got-0.296
Token spe
Token ID693
Feature activation-1.79e+01

Pos logit contributions
igsaw+0.329
aj+0.303
 difference+0.283
 coll+0.271
 former+0.267
Neg logit contributions
 decided-0.409
 came-0.367
 tried-0.367
 stumbled-0.351
 could-0.350
Tokenck
Token ID694
Feature activation-4.41e+00

Pos logit contributions
igsaw+0.656
aj+0.624
 difference+0.593
 coll+0.575
 former+0.572
Neg logit contributions
 decided-0.318
 came-0.264
 tried-0.263
 stumbled-0.241
pped-0.238
Token in
Token ID287
Feature activation-2.80e+00

Pos logit contributions
igsaw+0.126
aj+0.117
 difference+0.117
 goals+0.117
 coll+0.117
Neg logit contributions
 decided-0.095
 tried-0.081
 stumbled-0.078
 could-0.076
Please-0.076
Token the
Token ID262
Feature activation-1.22e+01

Pos logit contributions
igsaw+0.091
aj+0.090
udd+0.082
 difference+0.080
uddle+0.079
Neg logit contributions
 decided-0.063
 came-0.054
 tried-0.054
 stumbled-0.049
pped-0.049
Token sky
Token ID6766
Feature activation+2.50e+00

Pos logit contributions
igsaw+0.299
aj+0.278
udd+0.240
ushes+0.234
 former+0.234
Neg logit contributions
 decided-0.380
 came-0.350
 tried-0.339
 got-0.330
 stumbled-0.324
Token.
Token ID13
Feature activation+1.98e-01

Pos logit contributions
 decided+0.046
 tried+0.039
 came+0.038
pped+0.037
 stumbled+0.036
Neg logit contributions
igsaw-0.087
aj-0.083
 difference-0.080
 coll-0.078
 former-0.076
Token highest
Token ID4511
Feature activation-8.62e+00

Pos logit contributions
igsaw+0.310
aj+0.307
ushes+0.234
 coll+0.231
udd+0.229
Neg logit contributions
 decided-0.392
 came-0.337
 could-0.334
 tried-0.331
 never-0.321
Token branches
Token ID13737
Feature activation-5.16e-01

Pos logit contributions
igsaw+0.241
aj+0.240
udd+0.201
 former+0.198
 coll+0.196
Neg logit contributions
 decided-0.239
 tried-0.217
 came-0.217
 could-0.208
 got-0.203
Token and
Token ID290
Feature activation+1.75e+00

Pos logit contributions
igsaw+0.015
aj+0.014
 difference+0.014
udd+0.014
 goals+0.014
Neg logit contributions
 decided-0.011
 tried-0.010
pped-0.010
 stumbled-0.009
 came-0.009
Token grab
Token ID5552
Feature activation-1.85e-01

Pos logit contributions
 decided+0.038
pped+0.036
 came+0.034
 tried+0.033
ged+0.032
Neg logit contributions
igsaw-0.052
 difference-0.048
aj-0.047
 former-0.046
 goals-0.046
Token the
Token ID262
Feature activation-1.28e+01

Pos logit contributions
igsaw+0.006
aj+0.006
 difference+0.005
 coll+0.005
 former+0.005
Neg logit contributions
 decided-0.004
 came-0.003
 tried-0.003
 stumbled-0.003
 could-0.003
Token ju
Token ID7544
Feature activation-1.78e+01

Pos logit contributions
igsaw+0.331
aj+0.310
udd+0.253
 difference+0.249
 coll+0.248
Neg logit contributions
 decided-0.408
 tried-0.367
 came-0.360
 could-0.359
 wanted-0.348
Tokenic
Token ID291
Feature activation-7.42e+00

Pos logit contributions
igsaw+0.363
aj+0.328
 difference+0.272
 coll+0.271
 goals+0.269
Neg logit contributions
 decided-0.583
pped-0.537
 tried-0.530
 came-0.518
 got-0.506
Tokeniest
Token ID6386
Feature activation-8.46e+00

Pos logit contributions
igsaw+0.148
 coll+0.119
 difference+0.118
opl+0.118
 biggest+0.117
Neg logit contributions
 decided-0.238
 stumbled-0.220
 tried-0.212
Please-0.212
 could-0.205
Token fruit
Token ID8234
Feature activation-2.14e+00

Pos logit contributions
igsaw+0.198
aj+0.182
 difference+0.161
 coll+0.154
udd+0.153
Neg logit contributions
 decided-0.270
 came-0.246
 tried-0.245
 stumbled-0.233
 could-0.232
Token.
Token ID13
Feature activation+5.23e-01

Pos logit contributions
igsaw+0.065
 difference+0.058
 biggest+0.057
 coll+0.057
 goals+0.057
Neg logit contributions
 decided-0.043
pped-0.041
 tried-0.040
 stumbled-0.038
 rushed-0.038
Token One
Token ID1881
Feature activation-3.94e+00

Pos logit contributions
 decided+0.010
 came+0.009
 could+0.008
 tried+0.008
 stumbled+0.007
Neg logit contributions
igsaw-0.019
aj-0.019
 coll-0.017
 former-0.017
 difference-0.017
Token garden
Token ID11376
Feature activation+2.70e+00

Pos logit contributions
igsaw+0.297
aj+0.273
 difference+0.232
 coll+0.228
udd+0.222
Neg logit contributions
 decided-0.409
 came-0.366
 tried-0.359
 could-0.354
 stumbled-0.348
Token,
Token ID11
Feature activation-4.04e-02

Pos logit contributions
 decided+0.048
 came+0.041
 tried+0.039
 stumbled+0.036
 could+0.036
Neg logit contributions
igsaw-0.101
aj-0.096
 difference-0.090
 coll-0.088
 former-0.087
Token and
Token ID290
Feature activation+2.77e+00

Pos logit contributions
aj+0.001
igsaw+0.001
 difference+0.001
 new+0.001
 former+0.001
Neg logit contributions
 decided-0.001
pped-0.001
 came-0.001
 could-0.001
 tried-0.001
Token picked
Token ID6497
Feature activation+1.42e+00

Pos logit contributions
 decided+0.040
pped+0.037
 came+0.032
 tried+0.031
Please+0.031
Neg logit contributions
igsaw-0.093
aj-0.091
 former-0.086
 difference-0.086
 new-0.085
Token the
Token ID262
Feature activation-1.28e+01

Pos logit contributions
 decided+0.036
 tried+0.032
 came+0.031
 stumbled+0.030
pped+0.030
Neg logit contributions
igsaw-0.040
aj-0.037
 difference-0.036
 coll-0.035
 goals-0.035
Token ju
Token ID7544
Feature activation-1.78e+01

Pos logit contributions
igsaw+0.311
aj+0.284
 difference+0.252
 coll+0.242
 former+0.242
Neg logit contributions
 decided-0.401
 came-0.359
 tried-0.359
 could-0.348
 stumbled-0.340
Tokenic
Token ID291
Feature activation-7.89e+00

Pos logit contributions
igsaw+0.365
aj+0.318
 difference+0.274
 goals+0.266
 coll+0.262
Neg logit contributions
 decided-0.601
pped-0.545
 tried-0.544
 came-0.544
 got-0.520
Tokeniest
Token ID6386
Feature activation-8.78e+00

Pos logit contributions
igsaw+0.147
 coll+0.117
 batch+0.115
 difference+0.114
 biggest+0.111
Neg logit contributions
 decided-0.266
 stumbled-0.255
Please-0.243
 tried-0.240
 could-0.231
Token carrots
Token ID34397
Feature activation-8.77e-01

Pos logit contributions
igsaw+0.203
aj+0.187
 difference+0.176
 coll+0.169
 goals+0.164
Neg logit contributions
 decided-0.272
 came-0.246
 tried-0.245
 stumbled-0.238
pped-0.236
Token he
Token ID339
Feature activation+6.60e+00

Pos logit contributions
igsaw+0.020
 new+0.019
 difference+0.018
 biggest+0.018
 coll+0.018
Neg logit contributions
 decided-0.020
Please-0.020
pped-0.020
 rushed-0.020
 stumbled-0.019
Token could
Token ID714
Feature activation+7.59e+00

Pos logit contributions
pped+0.077
 decided+0.066
ged+0.063
 rushed+0.055
 tried+0.051
Neg logit contributions
igsaw-0.226
aj-0.223
 new-0.222
 biggest-0.217
 coll-0.216
Token farmer
Token ID18739
Feature activation+2.04e+00

Pos logit contributions
igsaw+0.262
aj+0.251
 goals+0.226
 difference+0.225
 former+0.225
Neg logit contributions
 decided-0.246
 came-0.222
 tried-0.215
 could-0.203
 got-0.201
Token who
Token ID508
Feature activation+4.51e+00

Pos logit contributions
 decided+0.038
pped+0.033
 tried+0.033
 came+0.031
 stumbled+0.031
Neg logit contributions
igsaw-0.065
aj-0.064
 difference-0.061
 coll-0.060
udd-0.059
Token lived
Token ID5615
Feature activation-1.80e+00

Pos logit contributions
pped+0.111
ged+0.100
 decided+0.097
 stumbled+0.091
 tried+0.091
Neg logit contributions
igsaw-0.109
 difference-0.104
udd-0.100
 coll-0.100
aj-0.099
Token on
Token ID319
Feature activation-2.34e+00

Pos logit contributions
igsaw+0.068
aj+0.066
 difference+0.062
 coll+0.061
 former+0.061
Neg logit contributions
 decided-0.028
 came-0.023
 tried-0.023
 stumbled-0.021
 could-0.021
Token a
Token ID257
Feature activation-1.05e+01

Pos logit contributions
igsaw+0.086
aj+0.084
 difference+0.078
udd+0.077
uddle+0.076
Neg logit contributions
 decided-0.043
 came-0.036
 tried-0.035
 him-0.032
 stumbled-0.032
Token lush
Token ID37408
Feature activation-1.75e+01

Pos logit contributions
igsaw+0.202
aj+0.190
 difference+0.162
ushes+0.159
 goals+0.159
Neg logit contributions
 decided-0.373
 came-0.342
 tried-0.337
 got-0.326
 could-0.324
Token farm
Token ID5318
Feature activation-1.53e+00

Pos logit contributions
igsaw+0.426
aj+0.424
ushes+0.418
 fault+0.393
Whe+0.378
Neg logit contributions
 decided-0.489
 came-0.457
 tried-0.435
 got-0.424
 wanted-0.416
Token with
Token ID351
Feature activation-3.40e+00

Pos logit contributions
igsaw+0.057
aj+0.054
 difference+0.052
 coll+0.050
 former+0.050
Neg logit contributions
 decided-0.026
 came-0.022
 tried-0.021
 stumbled-0.020
 could-0.019
Token many
Token ID867
Feature activation-8.96e+00

Pos logit contributions
igsaw+0.097
aj+0.093
 difference+0.086
udd+0.081
 former+0.081
Neg logit contributions
 decided-0.093
 came-0.084
 tried-0.081
 could-0.077
 stumbled-0.075
Token animals
Token ID4695
Feature activation+9.36e-01

Pos logit contributions
igsaw+0.255
aj+0.228
 difference+0.202
 former+0.200
udd+0.190
Neg logit contributions
 decided-0.260
 came-0.230
 tried-0.225
 could-0.215
 wanted-0.214
Token and
Token ID290
Feature activation+4.39e-01

Pos logit contributions
 decided+0.020
pped+0.019
 tried+0.018
 stumbled+0.017
 rushed+0.017
Neg logit contributions
igsaw-0.026
aj-0.026
 goals-0.025
 difference-0.024
 coll-0.024
Token branch
Token ID8478
Feature activation-3.30e-01

Pos logit contributions
 decided+0.044
 tried+0.038
 came+0.038
 stumbled+0.035
 could+0.035
Neg logit contributions
igsaw-0.064
aj-0.062
 difference-0.057
 coll-0.055
udd-0.055
Token,
Token ID11
Feature activation+1.85e-01

Pos logit contributions
igsaw+0.012
aj+0.012
 difference+0.011
 former+0.011
 coll+0.011
Neg logit contributions
 decided-0.006
 came-0.005
 tried-0.005
 could-0.004
 stumbled-0.004
Token looking
Token ID2045
Feature activation+5.98e+00

Pos logit contributions
 decided+0.004
 came+0.003
pped+0.003
 tried+0.003
 could+0.003
Neg logit contributions
igsaw-0.005
aj-0.005
 difference-0.005
 coll-0.005
 goals-0.005
Token for
Token ID329
Feature activation+5.93e-01

Pos logit contributions
 decided+0.159
 came+0.141
 tried+0.141
 stumbled+0.134
 could+0.133
Neg logit contributions
igsaw-0.167
aj-0.156
 difference-0.146
 coll-0.141
 former-0.139
Token the
Token ID262
Feature activation-1.20e+01

Pos logit contributions
 decided+0.018
 came+0.016
 stumbled+0.016
 tried+0.016
pped+0.015
Neg logit contributions
igsaw-0.013
 difference-0.013
aj-0.012
 goals-0.012
 coll-0.012
Token ju
Token ID7544
Feature activation-1.73e+01

Pos logit contributions
igsaw+0.298
aj+0.281
udd+0.237
 difference+0.236
 coll+0.235
Neg logit contributions
 decided-0.371
 tried-0.333
 came-0.329
 could-0.317
 stumbled-0.312
Tokenic
Token ID291
Feature activation-7.33e+00

Pos logit contributions
igsaw+0.326
aj+0.295
 difference+0.254
 coll+0.245
 goals+0.240
Neg logit contributions
 decided-0.605
 tried-0.555
 came-0.552
pped-0.550
 stumbled-0.529
Tokeniest
Token ID6386
Feature activation-8.31e+00

Pos logit contributions
igsaw+0.139
 difference+0.107
 puzzle+0.107
opl+0.106
 batch+0.103
Neg logit contributions
 decided-0.241
 stumbled-0.227
Please-0.219
 tried-0.215
 could-0.213
Token ones
Token ID3392
Feature activation+1.46e+00

Pos logit contributions
igsaw+0.183
aj+0.166
 difference+0.161
 coll+0.151
 goals+0.148
Neg logit contributions
 decided-0.264
 came-0.240
 tried-0.238
pped-0.232
 stumbled-0.232
Token.
Token ID13
Feature activation+3.29e-01

Pos logit contributions
pped+0.029
 decided+0.028
 rushed+0.026
 tried+0.025
 stumbled+0.025
Neg logit contributions
igsaw-0.041
 new-0.040
 difference-0.038
 goals-0.038
aj-0.038
Token Finally
Token ID9461
Feature activation+4.01e+00

Pos logit contributions
 decided+0.007
 tried+0.007
 came+0.007
 could+0.006
 stumbled+0.006
Neg logit contributions
igsaw-0.011
aj-0.010
 difference-0.010
 coll-0.009
udd-0.009
Token play
Token ID711
Feature activation-3.34e-01

Pos logit contributions
 decided+0.178
 came+0.156
 stumbled+0.151
pped+0.151
ged+0.150
Neg logit contributions
igsaw-0.238
 difference-0.227
aj-0.226
 former-0.216
 goals-0.211
Token without
Token ID1231
Feature activation-5.65e-01

Pos logit contributions
igsaw+0.012
aj+0.011
 difference+0.010
 former+0.010
 coll+0.010
Neg logit contributions
 decided-0.007
 came-0.006
 tried-0.006
 could-0.005
 stumbled-0.005
Token going
Token ID1016
Feature activation+1.89e+00

Pos logit contributions
igsaw+0.016
aj+0.015
 difference+0.015
 coll+0.014
 goals+0.014
Neg logit contributions
 decided-0.014
 came-0.012
 could-0.012
 stumbled-0.012
pped-0.012
Token near
Token ID1474
Feature activation-3.28e+00

Pos logit contributions
 decided+0.041
 tried+0.036
 came+0.035
pped+0.034
 stumbled+0.034
Neg logit contributions
igsaw-0.059
aj-0.055
 difference-0.053
 goals-0.052
 coll-0.052
Token the
Token ID262
Feature activation-1.48e+01

Pos logit contributions
igsaw+0.107
aj+0.101
 difference+0.093
 coll+0.091
 former+0.091
Neg logit contributions
 decided-0.074
 came-0.064
 tried-0.064
 could-0.060
 stumbled-0.059
Token sm
Token ID895
Feature activation-1.72e+01

Pos logit contributions
igsaw+0.360
aj+0.348
udd+0.274
 former+0.273
 difference+0.269
Neg logit contributions
 decided-0.470
 tried-0.426
 came-0.422
 could-0.410
 stumbled-0.399
Tokenelly
Token ID6148
Feature activation-1.23e+01

Pos logit contributions
igsaw+0.294
aj+0.267
 difference+0.235
 coll+0.221
 former+0.217
Neg logit contributions
 decided-0.642
 came-0.591
 tried-0.589
 stumbled-0.568
pped-0.567
Token leaves
Token ID5667
Feature activation-1.48e+00

Pos logit contributions
igsaw+0.272
aj+0.252
 difference+0.219
 former+0.212
 coll+0.211
Neg logit contributions
 decided-0.403
 tried-0.369
 came-0.369
 stumbled-0.352
 could-0.351
Token.
Token ID13
Feature activation-2.87e-01

Pos logit contributions
igsaw+0.051
aj+0.048
 difference+0.047
 coll+0.045
 goals+0.045
Neg logit contributions
 decided-0.028
 tried-0.023
 came-0.023
 stumbled-0.022
pped-0.022
Token Tim
Token ID5045
Feature activation+1.60e+00

Pos logit contributions
igsaw+0.010
aj+0.009
 coll+0.009
 difference+0.009
udd+0.008
Neg logit contributions
 decided-0.006
 came-0.005
 tried-0.005
 could-0.005
 stumbled-0.005
Token and
Token ID290
Feature activation+1.51e+00

Pos logit contributions
 decided+0.028
 came+0.023
 tried+0.022
pped+0.021
 stumbled+0.021
Neg logit contributions
igsaw-0.058
aj-0.056
 difference-0.053
 coll-0.052
udd-0.051
Token
Token ID198
Feature activation-3.05e+00

Pos logit contributions
 decided+0.038
 came+0.032
 tried+0.032
 could+0.029
 stumbled+0.029
Neg logit contributions
igsaw-0.080
aj-0.076
 difference-0.072
 coll-0.070
 former-0.070
Token
Token ID198
Feature activation-3.59e+00

Pos logit contributions
igsaw+0.096
aj+0.090
 difference+0.086
 coll+0.083
udd+0.082
Neg logit contributions
 decided-0.072
 tried-0.062
 came-0.062
 stumbled-0.059
 could-0.058
TokenShe
Token ID3347
Feature activation+6.59e+00

Pos logit contributions
igsaw+0.111
aj+0.105
 difference+0.100
 coll+0.096
 former+0.095
Neg logit contributions
 decided-0.085
 tried-0.073
 came-0.073
 stumbled-0.069
 could-0.068
Token takes
Token ID2753
Feature activation+1.05e+00

Pos logit contributions
 decided+0.095
 came+0.076
 tried+0.075
pped+0.071
 stumbled+0.069
Neg logit contributions
igsaw-0.253
aj-0.246
 difference-0.233
 coll-0.228
 goals-0.226
Token the
Token ID262
Feature activation-1.35e+01

Pos logit contributions
 decided+0.020
 came+0.018
 tried+0.017
pped+0.015
 got+0.015
Neg logit contributions
aj-0.037
igsaw-0.037
 difference-0.035
udd-0.034
 former-0.033
Token v
Token ID410
Feature activation-1.71e+01

Pos logit contributions
igsaw+0.340
aj+0.325
udd+0.299
 former+0.286
 difference+0.278
Neg logit contributions
 decided-0.382
 tried-0.354
 came-0.353
 dropped-0.329
 could-0.329
Tokenase
Token ID589
Feature activation+2.38e-01

Pos logit contributions
igsaw+0.396
 difference+0.376
aj+0.352
 goals+0.339
 former+0.333
Neg logit contributions
 decided-0.505
 came-0.482
pped-0.459
 got-0.457
 could-0.454
Token from
Token ID422
Feature activation-3.92e+00

Pos logit contributions
 decided+0.004
 came+0.004
 tried+0.004
 stumbled+0.003
 could+0.003
Neg logit contributions
igsaw-0.009
aj-0.008
 difference-0.008
 coll-0.008
 former-0.008
Token them
Token ID606
Feature activation+3.01e+00

Pos logit contributions
igsaw+0.129
aj+0.125
 difference+0.116
udd+0.113
 coll+0.111
Neg logit contributions
 decided-0.084
 came-0.073
 tried-0.073
 stumbled-0.067
pped-0.067
Token and
Token ID290
Feature activation+1.36e+00

Pos logit contributions
 decided+0.055
 tried+0.046
 came+0.046
 stumbled+0.043
pped+0.043
Neg logit contributions
igsaw-0.107
aj-0.102
 difference-0.097
 coll-0.095
 former-0.093
Token puts
Token ID7584
Feature activation+1.28e+00

Pos logit contributions
 decided+0.033
pped+0.029
 came+0.028
 stumbled+0.028
 tried+0.027
Neg logit contributions
igsaw-0.036
aj-0.036
 coll-0.034
 difference-0.033
 goals-0.032
Token was
Token ID373
Feature activation+3.64e+00

Pos logit contributions
 decided+0.091
 came+0.071
 tried+0.070
pped+0.069
 stumbled+0.063
Neg logit contributions
igsaw-0.277
aj-0.273
 difference-0.256
 coll-0.253
udd-0.253
Token running
Token ID2491
Feature activation+2.58e+00

Pos logit contributions
 decided+0.079
 came+0.068
 tried+0.068
 stumbled+0.063
 could+0.063
Neg logit contributions
igsaw-0.121
aj-0.114
 difference-0.108
 coll-0.104
 former-0.103
Token around
Token ID1088
Feature activation+2.62e+00

Pos logit contributions
 decided+0.055
 came+0.047
 tried+0.047
pped+0.044
 stumbled+0.044
Neg logit contributions
igsaw-0.084
aj-0.079
 difference-0.075
 coll-0.073
 goals-0.072
Token on
Token ID319
Feature activation-5.94e-01

Pos logit contributions
 decided+0.050
 came+0.042
 tried+0.042
 stumbled+0.039
 could+0.039
Neg logit contributions
igsaw-0.093
aj-0.088
 difference-0.083
 coll-0.081
 former-0.080
Token the
Token ID262
Feature activation-1.29e+01

Pos logit contributions
aj+0.021
igsaw+0.021
 difference+0.019
udd+0.019
 former+0.019
Neg logit contributions
 decided-0.011
 came-0.010
 tried-0.010
 it-0.009
 stumbled-0.009
Token lush
Token ID37408
Feature activation-1.71e+01

Pos logit contributions
igsaw+0.331
aj+0.305
 former+0.276
ushes+0.269
 difference+0.263
Neg logit contributions
 decided-0.396
 came-0.348
 tried-0.344
 got-0.335
 could-0.330
Token grass
Token ID8701
Feature activation-1.55e+00

Pos logit contributions
aj+0.454
igsaw+0.449
ushes+0.434
 former+0.425
 fault+0.424
Neg logit contributions
 decided-0.451
 came-0.423
 tried-0.402
 got-0.392
 it-0.378
Token,
Token ID11
Feature activation-4.66e-01

Pos logit contributions
igsaw+0.054
aj+0.051
 difference+0.048
 coll+0.047
 former+0.046
Neg logit contributions
 decided-0.030
 came-0.026
 tried-0.026
 stumbled-0.024
pped-0.024
Token enjoying
Token ID13226
Feature activation-1.16e+00

Pos logit contributions
igsaw+0.015
aj+0.014
 difference+0.014
 coll+0.013
udd+0.013
Neg logit contributions
 decided-0.010
 came-0.008
 tried-0.008
pped-0.008
 could-0.008
Token the
Token ID262
Feature activation-1.17e+01

Pos logit contributions
igsaw+0.039
aj+0.037
 difference+0.034
 former+0.033
 coll+0.033
Neg logit contributions
 decided-0.025
 came-0.022
 tried-0.022
 stumbled-0.020
pped-0.020
Token sunshine
Token ID34488
Feature activation-5.95e-01

Pos logit contributions
igsaw+0.284
aj+0.261
ushes+0.214
 former+0.209
 batch+0.206
Neg logit contributions
 decided-0.378
 came-0.335
 tried-0.327
 wanted-0.319
 stumbled-0.310
Token what
Token ID644
Feature activation+2.72e+00

Pos logit contributions
igsaw+0.154
aj+0.147
 difference+0.132
udd+0.131
 coll+0.129
Neg logit contributions
 decided-0.116
 came-0.103
 tried-0.102
 could-0.095
 stumbled-0.094
Token the
Token ID262
Feature activation-1.10e+01

Pos logit contributions
 decided+0.042
 tried+0.034
 came+0.034
pped+0.032
 stumbled+0.031
Neg logit contributions
igsaw-0.104
aj-0.098
 difference-0.096
 coll-0.093
 goals-0.092
Token rocket
Token ID10701
Feature activation+1.36e+00

Pos logit contributions
igsaw+0.283
aj+0.271
udd+0.238
 coll+0.234
 difference+0.229
Neg logit contributions
 decided-0.320
 came-0.290
 tried-0.289
 could-0.273
 stumbled-0.272
Token discovered
Token ID5071
Feature activation+1.28e+00

Pos logit contributions
 decided+0.019
pped+0.019
Please+0.016
ged+0.015
 stumbled+0.015
Neg logit contributions
igsaw-0.047
 difference-0.046
aj-0.044
 coll-0.044
 puzzle-0.043
Token in
Token ID287
Feature activation-1.70e+00

Pos logit contributions
 decided+0.024
 tried+0.021
 came+0.020
 stumbled+0.019
pped+0.019
Neg logit contributions
igsaw-0.044
aj-0.041
 difference-0.040
 coll-0.038
 goals-0.038
Token outer
Token ID12076
Feature activation-1.71e+01

Pos logit contributions
igsaw+0.053
aj+0.052
udd+0.047
 difference+0.047
 coll+0.046
Neg logit contributions
 decided-0.040
 came-0.035
 tried-0.034
 stumbled-0.032
 could-0.032
Token space
Token ID2272
Feature activation-1.11e+00

Pos logit contributions
igsaw+0.371
aj+0.357
uddle+0.318
 batch+0.316
udd+0.303
Neg logit contributions
 decided-0.521
 came-0.483
 it-0.481
 could-0.477
 never-0.476
Token.
Token ID13
Feature activation+3.10e-03

Pos logit contributions
igsaw+0.039
aj+0.037
 difference+0.035
 coll+0.034
 former+0.034
Neg logit contributions
 decided-0.021
 tried-0.018
 came-0.018
 stumbled-0.017
pped-0.017
Token It
Token ID632
Feature activation+8.04e+00

Pos logit contributions
 decided+0.000
 came+0.000
 tried+0.000
 could+0.000
 stumbled+0.000
Neg logit contributions
igsaw-0.000
aj-0.000
 difference-0.000
 coll-0.000
udd-0.000
Token was
Token ID373
Feature activation+3.44e+00

Pos logit contributions
 decided+0.116
 tried+0.093
 came+0.093
pped+0.091
 stumbled+0.085
Neg logit contributions
igsaw-0.306
aj-0.293
 difference-0.286
 former-0.276
 coll-0.275
Token a
Token ID257
Feature activation-7.33e+00

Pos logit contributions
 decided+0.077
 came+0.067
 tried+0.066
 stumbled+0.062
 could+0.062
Neg logit contributions
igsaw-0.113
aj-0.107
 difference-0.099
 coll-0.097
udd-0.096
Token Alice
Token ID14862
Feature activation+1.95e+00

Pos logit contributions
 decided+0.082
 tried+0.071
 came+0.071
pped+0.068
 stumbled+0.068
Neg logit contributions
igsaw-0.106
aj-0.099
 difference-0.094
 coll-0.091
 former-0.089
Token saw
Token ID2497
Feature activation-5.38e-01

Pos logit contributions
 decided+0.030
pped+0.025
Please+0.025
ged+0.025
 tried+0.021
Neg logit contributions
igsaw-0.065
aj-0.065
udd-0.061
 difference-0.060
 batch-0.059
Token Lisa
Token ID15378
Feature activation+1.19e+00

Pos logit contributions
igsaw+0.019
aj+0.018
 difference+0.017
udd+0.016
 former+0.016
Neg logit contributions
 decided-0.011
 came-0.009
 tried-0.009
 could-0.008
pped-0.008
Token with
Token ID351
Feature activation-2.77e+00

Pos logit contributions
 decided+0.026
 tried+0.022
 came+0.022
pped+0.021
 stumbled+0.021
Neg logit contributions
igsaw-0.038
aj-0.035
 difference-0.034
 coll-0.033
 former-0.032
Token the
Token ID262
Feature activation-1.28e+01

Pos logit contributions
igsaw+0.090
aj+0.088
 difference+0.081
udd+0.080
 former+0.079
Neg logit contributions
 decided-0.060
 tried-0.054
 came-0.053
 could-0.049
 stumbled-0.049
Token y
Token ID331
Feature activation-1.70e+01

Pos logit contributions
igsaw+0.301
aj+0.276
 difference+0.239
 former+0.237
udd+0.236
Neg logit contributions
 decided-0.408
 came-0.374
 tried-0.371
 could-0.350
 got-0.348
Tokenummy
Token ID13513
Feature activation-9.16e+00

Pos logit contributions
igsaw+0.168
aj+0.150
 difference+0.114
 former+0.104
 coll+0.100
Neg logit contributions
 decided-0.739
 came-0.700
 tried-0.690
pped-0.669
 stumbled-0.668
Token cand
Token ID2658
Feature activation-5.41e+00

Pos logit contributions
igsaw+0.187
aj+0.170
 difference+0.155
 coll+0.147
 former+0.144
Neg logit contributions
 decided-0.312
 came-0.284
 tried-0.284
 stumbled-0.273
 could-0.272
Tokenies
Token ID444
Feature activation+3.53e+00

Pos logit contributions
igsaw+0.125
aj+0.116
 coll+0.105
udd+0.104
 difference+0.101
Neg logit contributions
 decided-0.163
Please-0.153
 stumbled-0.150
 tried-0.149
pped-0.146
Token,
Token ID11
Feature activation-4.39e-01

Pos logit contributions
 decided+0.082
pped+0.074
Please+0.072
 stumbled+0.070
 tried+0.070
Neg logit contributions
igsaw-0.098
 difference-0.092
aj-0.087
 goals-0.084
 new-0.083
Token she
Token ID673
Feature activation+6.52e+00

Pos logit contributions
igsaw+0.016
aj+0.015
 difference+0.014
 coll+0.014
 goals+0.014
Neg logit contributions
 decided-0.008
 tried-0.006
 came-0.006
pped-0.006
 stumbled-0.006
Token the
Token ID262
Feature activation-1.13e+01

Pos logit contributions
 decided+0.057
 tried+0.055
 came+0.053
 it+0.049
 him+0.048
Neg logit contributions
aj-0.133
igsaw-0.129
udd-0.126
uddle-0.123
umm-0.121
Token garden
Token ID11376
Feature activation+3.52e+00

Pos logit contributions
igsaw+0.289
aj+0.270
ushes+0.240
 goals+0.233
 difference+0.231
Neg logit contributions
 decided-0.349
 came-0.314
 tried-0.297
 got-0.292
Please-0.286
Token to
Token ID284
Feature activation+3.45e+00

Pos logit contributions
 decided+0.057
 came+0.048
 tried+0.046
 could+0.041
 stumbled+0.040
Neg logit contributions
igsaw-0.140
aj-0.133
 difference-0.125
 former-0.121
 coll-0.121
Token pick
Token ID2298
Feature activation+6.03e-01

Pos logit contributions
 decided+0.079
 came+0.068
pped+0.067
 tried+0.067
 stumbled+0.066
Neg logit contributions
igsaw-0.108
aj-0.097
 difference-0.097
 former-0.092
 coll-0.091
Token the
Token ID262
Feature activation-1.22e+01

Pos logit contributions
 decided+0.013
 came+0.012
 tried+0.011
 could+0.011
 got+0.010
Neg logit contributions
igsaw-0.020
aj-0.020
 former-0.018
 difference-0.018
 goals-0.017
Token ju
Token ID7544
Feature activation-1.70e+01

Pos logit contributions
igsaw+0.319
aj+0.274
 difference+0.245
 former+0.245
opl+0.232
Neg logit contributions
 decided-0.391
 came-0.353
 tried-0.344
 could-0.341
 got-0.328
Tokenic
Token ID291
Feature activation-6.63e+00

Pos logit contributions
igsaw+0.384
aj+0.331
 goals+0.294
 difference+0.289
 former+0.267
Neg logit contributions
 decided-0.535
 came-0.478
 tried-0.478
pped-0.477
 got-0.459
Tokeniest
Token ID6386
Feature activation-7.49e+00

Pos logit contributions
igsaw+0.132
 coll+0.111
 batch+0.109
 difference+0.105
 puzzle+0.101
Neg logit contributions
 decided-0.214
 stumbled-0.208
 tried-0.195
pped-0.192
Please-0.191
Token and
Token ID290
Feature activation+1.29e-01

Pos logit contributions
igsaw+0.153
aj+0.140
 difference+0.127
 coll+0.120
 former+0.119
Neg logit contributions
 decided-0.256
 came-0.233
 tried-0.233
 stumbled-0.224
 could-0.223
Token sweet
Token ID6029
Feature activation-8.90e+00

Pos logit contributions
 decided+0.004
 came+0.004
 tried+0.004
 could+0.004
 stumbled+0.004
Neg logit contributions
igsaw-0.003
aj-0.003
 difference-0.002
 coll-0.002
 former-0.002
Tokenest
Token ID395
Feature activation-7.24e+00

Pos logit contributions
igsaw+0.196
aj+0.179
 difference+0.166
 coll+0.159
 former+0.154
Neg logit contributions
 decided-0.288
 tried-0.261
 came-0.261
 stumbled-0.252
pped-0.251
Token"
Token ID1
Feature activation+4.47e-01

Pos logit contributions
igsaw+0.193
aj+0.180
 difference+0.174
 former+0.167
 coll+0.166
Neg logit contributions
 decided-0.129
 tried-0.110
 came-0.109
 could-0.102
 stumbled-0.102
TokenI
Token ID40
Feature activation+6.77e+00

Pos logit contributions
 decided+0.009
 tried+0.007
 came+0.007
 could+0.007
 stumbled+0.007
Neg logit contributions
igsaw-0.016
aj-0.015
 difference-0.015
 coll-0.014
 former-0.014
Token can
Token ID460
Feature activation+9.28e+00

Pos logit contributions
pped+0.110
 decided+0.106
ged+0.100
Please+0.099
 him+0.091
Neg logit contributions
aj-0.216
igsaw-0.211
udd-0.205
 former-0.200
 difference-0.195
Token wear
Token ID5806
Feature activation+2.87e+00

Pos logit contributions
 decided+0.203
 came+0.175
 stumbled+0.170
 tried+0.170
pped+0.163
Neg logit contributions
igsaw-0.285
aj-0.275
 difference-0.257
 coll-0.251
 goals-0.247
Token the
Token ID262
Feature activation-1.44e+01

Pos logit contributions
 decided+0.063
 came+0.056
 tried+0.055
 could+0.052
pped+0.051
Neg logit contributions
igsaw-0.094
aj-0.087
 difference-0.086
 former-0.082
udd-0.080
Token blue
Token ID4171
Feature activation-1.68e+01

Pos logit contributions
igsaw+0.403
 former+0.329
 difference+0.311
aj+0.310
opl+0.309
Neg logit contributions
 decided-0.419
 came-0.404
 tried-0.391
 could-0.374
 got-0.373
Token dress
Token ID6576
Feature activation-1.79e+00

Pos logit contributions
igsaw+0.462
aj+0.404
 difference+0.394
 former+0.390
 goals+0.373
Neg logit contributions
 decided-0.474
 came-0.446
 tried-0.425
 could-0.409
 got-0.407
Token under
Token ID739
Feature activation-1.74e+00

Pos logit contributions
igsaw+0.063
aj+0.060
 difference+0.058
 former+0.056
 goals+0.056
Neg logit contributions
 decided-0.034
 came-0.029
 tried-0.028
 could-0.027
 got-0.025
Token the
Token ID262
Feature activation-1.46e+01

Pos logit contributions
igsaw+0.056
aj+0.054
 difference+0.052
udd+0.051
 former+0.050
Neg logit contributions
 decided-0.039
 came-0.034
 tried-0.034
 it-0.032
pped-0.032
Token pink
Token ID11398
Feature activation-1.46e+01

Pos logit contributions
igsaw+0.404
 former+0.353
aj+0.352
 difference+0.321
udd+0.319
Neg logit contributions
 decided-0.405
 came-0.386
 tried-0.360
 could-0.359
 got-0.352
Token dress
Token ID6576
Feature activation-1.99e+00

Pos logit contributions
igsaw+0.386
aj+0.348
 difference+0.334
 former+0.330
 goals+0.315
Neg logit contributions
 decided-0.423
 came-0.395
 tried-0.380
 got-0.364
 could-0.363
Token had
Token ID550
Feature activation+6.55e+00

Pos logit contributions
ged+0.131
pped+0.122
ging+0.109
 decided+0.104
dd+0.100
Neg logit contributions
 favorite-0.201
 beds-0.200
aj-0.200
 goals-0.199
 puzzle-0.195
Token not
Token ID407
Feature activation+4.52e+00

Pos logit contributions
 decided+0.108
 came+0.089
 tried+0.087
pped+0.085
 stumbled+0.083
Neg logit contributions
igsaw-0.231
aj-0.231
 difference-0.217
 coll-0.214
 goals-0.212
Token fought
Token ID8350
Feature activation+3.66e+00

Pos logit contributions
 decided+0.073
pped+0.061
 came+0.059
 tried+0.058
 stumbled+0.056
Neg logit contributions
igsaw-0.159
aj-0.159
 difference-0.147
 coll-0.146
 former-0.144
Token over
Token ID625
Feature activation-2.39e-02

Pos logit contributions
 decided+0.071
 tried+0.060
 came+0.059
 stumbled+0.056
 could+0.055
Neg logit contributions
igsaw-0.126
aj-0.120
 difference-0.114
 coll-0.111
 goals-0.111
Token the
Token ID262
Feature activation-1.38e+01

Pos logit contributions
igsaw+0.001
aj+0.001
 difference+0.001
 coll+0.001
 goals+0.001
Neg logit contributions
 decided-0.001
 tried-0.000
 came-0.000
 stumbled-0.000
pped-0.000
Token blue
Token ID4171
Feature activation-1.67e+01

Pos logit contributions
igsaw+0.391
aj+0.329
udd+0.321
ushes+0.310
rig+0.294
Neg logit contributions
 decided-0.366
 came-0.350
 tried-0.339
 wanted-0.319
 could-0.316
Token t
Token ID256
Feature activation-1.17e+01

Pos logit contributions
igsaw+0.494
aj+0.446
 difference+0.415
 former+0.410
udd+0.409
Neg logit contributions
 decided-0.422
 came-0.400
 tried-0.374
 could-0.358
 got-0.353
Tokeneddy
Token ID21874
Feature activation-6.80e+00

Pos logit contributions
igsaw+0.185
aj+0.164
 difference+0.144
 coll+0.134
 former+0.131
Neg logit contributions
 decided-0.454
 came-0.419
 tried-0.419
 stumbled-0.405
 could-0.403
Token bear
Token ID6842
Feature activation+3.29e+00

Pos logit contributions
igsaw+0.196
aj+0.185
 difference+0.178
 coll+0.173
 former+0.164
Neg logit contributions
 decided-0.169
 tried-0.147
 came-0.145
 stumbled-0.144
pped-0.143
Token.
Token ID13
Feature activation-5.56e-02

Pos logit contributions
 decided+0.065
 tried+0.056
 stumbled+0.055
 came+0.055
pped+0.054
Neg logit contributions
igsaw-0.110
aj-0.103
 difference-0.101
 coll-0.098
 goals-0.097
Token They
Token ID1119
Feature activation+7.77e+00

Pos logit contributions
igsaw+0.002
aj+0.002
 difference+0.002
 coll+0.002
 former+0.002
Neg logit contributions
 decided-0.001
 came-0.001
 tried-0.001
 could-0.001
 stumbled-0.001
Token.
Token ID13
Feature activation+9.16e-01

Pos logit contributions
igsaw+0.061
aj+0.060
 difference+0.057
 coll+0.057
udd+0.055
Neg logit contributions
 decided-0.037
 tried-0.031
pped-0.031
 stumbled-0.030
 rushed-0.029
Token It
Token ID632
Feature activation+7.67e+00

Pos logit contributions
 decided+0.021
 came+0.019
 tried+0.018
 could+0.017
 stumbled+0.017
Neg logit contributions
igsaw-0.029
aj-0.028
 difference-0.026
 coll-0.025
 former-0.025
Token ran
Token ID4966
Feature activation+3.39e+00

Pos logit contributions
 decided+0.105
 tried+0.082
 came+0.081
pped+0.077
 stumbled+0.073
Neg logit contributions
igsaw-0.303
aj-0.292
 difference-0.280
 coll-0.272
udd-0.272
Token through
Token ID832
Feature activation-1.87e+00

Pos logit contributions
 decided+0.073
 came+0.063
 tried+0.062
 stumbled+0.059
pped+0.058
Neg logit contributions
igsaw-0.112
aj-0.105
 difference-0.100
 coll-0.097
 former-0.096
Token a
Token ID257
Feature activation-9.26e+00

Pos logit contributions
igsaw+0.059
aj+0.056
 difference+0.052
 coll+0.051
 former+0.050
Neg logit contributions
 decided-0.044
 came-0.038
 tried-0.038
 stumbled-0.036
 could-0.035
Token lush
Token ID37408
Feature activation-1.66e+01

Pos logit contributions
igsaw+0.224
aj+0.219
 former+0.188
 coll+0.179
 difference+0.179
Neg logit contributions
 decided-0.293
 came-0.267
 tried-0.263
 could-0.257
 got-0.251
Token green
Token ID4077
Feature activation-1.39e+01

Pos logit contributions
igsaw+0.430
aj+0.427
ushes+0.377
 fault+0.371
 former+0.370
Neg logit contributions
 decided-0.479
 came-0.441
 tried-0.429
 got-0.403
 wanted-0.400
Token me
Token ID502
Feature activation-1.61e+00

Pos logit contributions
igsaw+0.306
aj+0.286
 difference+0.253
 former+0.240
 coll+0.239
Neg logit contributions
 decided-0.463
 came-0.424
 tried-0.419
 stumbled-0.402
 could-0.398
Tokenadow
Token ID4584
Feature activation+5.94e-01

Pos logit contributions
igsaw+0.028
aj+0.026
udd+0.022
 difference+0.021
 goals+0.020
Neg logit contributions
 decided-0.059
 tried-0.054
 came-0.054
pped-0.053
 stumbled-0.052
Token,
Token ID11
Feature activation-1.42e+00

Pos logit contributions
 decided+0.012
pped+0.011
 tried+0.010
 came+0.010
Please+0.010
Neg logit contributions
igsaw-0.018
aj-0.017
 difference-0.016
udd-0.016
 goals-0.016
Token with
Token ID351
Feature activation-1.83e+00

Pos logit contributions
igsaw+0.045
aj+0.043
 difference+0.041
 coll+0.040
 former+0.039
Neg logit contributions
 decided-0.030
 came-0.026
 tried-0.026
pped-0.025
 stumbled-0.025
Token owner
Token ID4870
Feature activation+3.94e+00

Pos logit contributions
igsaw+0.101
aj+0.092
 difference+0.082
 former+0.080
udd+0.080
Neg logit contributions
 decided-0.116
 tried-0.105
 came-0.105
 could-0.099
 stumbled-0.098
Token always
Token ID1464
Feature activation+6.81e+00

Pos logit contributions
 decided+0.060
ged+0.054
pped+0.053
Please+0.051
 stumbled+0.051
Neg logit contributions
igsaw-0.128
udd-0.125
 difference-0.125
aj-0.124
 coll-0.118
Token gave
Token ID2921
Feature activation-7.33e-02

Pos logit contributions
pped+0.097
 decided+0.094
 stumbled+0.079
 possibly+0.078
ged+0.077
Neg logit contributions
 difference-0.228
aj-0.226
igsaw-0.225
 coll-0.222
 new-0.214
Token him
Token ID683
Feature activation+3.53e+00

Pos logit contributions
igsaw+0.003
 difference+0.003
aj+0.003
 coll+0.003
 goals+0.003
Neg logit contributions
 decided-0.001
 came-0.001
 stumbled-0.001
 tried-0.001
pped-0.001
Token the
Token ID262
Feature activation-1.19e+01

Pos logit contributions
 decided+0.077
 came+0.066
 tried+0.064
 could+0.061
 got+0.060
Neg logit contributions
igsaw-0.119
aj-0.114
 difference-0.105
 former-0.104
 coll-0.101
Token ju
Token ID7544
Feature activation-1.66e+01

Pos logit contributions
igsaw+0.286
aj+0.265
 difference+0.229
 former+0.228
 coll+0.226
Neg logit contributions
 decided-0.368
 tried-0.334
 came-0.331
 could-0.317
 stumbled-0.313
Tokenic
Token ID291
Feature activation-6.58e+00

Pos logit contributions
igsaw+0.303
aj+0.266
 difference+0.227
 goals+0.222
 coll+0.221
Neg logit contributions
 decided-0.592
 tried-0.544
 came-0.540
pped-0.535
 could-0.516
Tokeniest
Token ID6386
Feature activation-7.13e+00

Pos logit contributions
igsaw+0.135
 difference+0.107
 coll+0.106
 batch+0.102
ay+0.096
Neg logit contributions
 decided-0.210
 stumbled-0.206
 tried-0.187
Please-0.186
 could-0.184
Token sa
Token ID473
Feature activation-8.94e+00

Pos logit contributions
igsaw+0.175
 difference+0.162
aj+0.161
 coll+0.152
 batch+0.147
Neg logit contributions
 decided-0.206
 came-0.185
 stumbled-0.183
 tried-0.182
pped-0.180
Tokenus
Token ID385
Feature activation-2.45e+00

Pos logit contributions
igsaw+0.259
aj+0.240
udd+0.223
ummy+0.219
uddle+0.216
Neg logit contributions
 decided-0.233
 came-0.206
 stumbled-0.203
 tried-0.203
 rushed-0.193
Tokenages
Token ID1095
Feature activation+2.68e+00

Pos logit contributions
igsaw+0.045
aj+0.044
 coll+0.040
 difference+0.037
 biggest+0.036
Neg logit contributions
 decided-0.078
pped-0.072
 stumbled-0.072
 tried-0.072
 came-0.070
Token friend
Token ID1545
Feature activation+1.47e+00

Pos logit contributions
igsaw+0.123
aj+0.113
 difference+0.105
 coll+0.100
 former+0.098
Neg logit contributions
 decided-0.170
 came-0.154
 tried-0.153
 stumbled-0.148
pped-0.147
Token.
Token ID13
Feature activation+3.19e-01

Pos logit contributions
 decided+0.028
 tried+0.023
 came+0.023
pped+0.023
 stumbled+0.022
Neg logit contributions
igsaw-0.050
aj-0.047
 difference-0.046
 goals-0.044
 coll-0.044
Token They
Token ID1119
Feature activation+7.39e+00

Pos logit contributions
 decided+0.007
 came+0.006
 tried+0.006
 could+0.006
 stumbled+0.006
Neg logit contributions
igsaw-0.011
aj-0.010
 difference-0.009
 former-0.009
 coll-0.009
Token shared
Token ID4888
Feature activation+8.87e-01

Pos logit contributions
 decided+0.129
 came+0.110
pped+0.109
 tried+0.108
Please+0.106
Neg logit contributions
igsaw-0.244
aj-0.229
 difference-0.224
 coll-0.223
 goals-0.222
Token the
Token ID262
Feature activation-1.31e+01

Pos logit contributions
 decided+0.017
 came+0.015
 tried+0.014
 it+0.014
 could+0.013
Neg logit contributions
igsaw-0.033
aj-0.032
 former-0.029
 difference-0.029
udd-0.028
Token y
Token ID331
Feature activation-1.66e+01

Pos logit contributions
aj+0.318
igsaw+0.307
 former+0.296
udd+0.272
uddle+0.260
Neg logit contributions
 decided-0.386
 tried-0.372
 came-0.360
 had-0.348
 wanted-0.341
Tokenummy
Token ID13513
Feature activation-8.89e+00

Pos logit contributions
aj+0.177
igsaw+0.172
 difference+0.131
 former+0.126
 coll+0.115
Neg logit contributions
 decided-0.685
 came-0.643
 tried-0.637
pped-0.620
 got-0.613
Token paper
Token ID3348
Feature activation-2.22e+00

Pos logit contributions
aj+0.197
igsaw+0.197
 former+0.188
 difference+0.173
udd+0.170
Neg logit contributions
 decided-0.272
 came-0.259
 tried-0.255
 got-0.242
 could-0.237
Token together
Token ID1978
Feature activation+1.27e+00

Pos logit contributions
igsaw+0.083
aj+0.083
 former+0.077
 difference+0.075
 goals+0.074
Neg logit contributions
 decided-0.034
 came-0.029
 tried-0.026
 could-0.025
 got-0.024
Token and
Token ID290
Feature activation+1.03e+00

Pos logit contributions
 decided+0.023
 came+0.019
 tried+0.019
 stumbled+0.017
 could+0.017
Neg logit contributions
igsaw-0.047
aj-0.044
 difference-0.042
 coll-0.041
 former-0.041
Token had
Token ID550
Feature activation+5.98e+00

Pos logit contributions
 decided+0.018
 came+0.015
 tried+0.015
 stumbled+0.014
pped+0.013
Neg logit contributions
igsaw-0.038
aj-0.036
 difference-0.035
 coll-0.034
 former-0.033
Token a
Token ID257
Feature activation-9.04e+00

Pos logit contributions
igsaw+0.004
aj+0.004
 difference+0.004
 coll+0.004
 former+0.004
Neg logit contributions
 decided-0.004
 came-0.003
 tried-0.003
 stumbled-0.003
 could-0.003
Token while
Token ID981
Feature activation+8.37e-01

Pos logit contributions
igsaw+0.276
aj+0.261
 difference+0.245
 coll+0.237
 former+0.236
Neg logit contributions
 decided-0.215
 came-0.189
 tried-0.188
 stumbled-0.177
 could-0.177
Token,
Token ID11
Feature activation-5.00e-01

Pos logit contributions
 decided+0.019
 tried+0.016
pped+0.016
 came+0.016
 stumbled+0.015
Neg logit contributions
igsaw-0.025
aj-0.023
 difference-0.023
 coll-0.022
 goals-0.022
Token enjoying
Token ID13226
Feature activation-1.92e+00

Pos logit contributions
igsaw+0.016
aj+0.015
 difference+0.015
 coll+0.014
 former+0.014
Neg logit contributions
 decided-0.010
 came-0.009
 tried-0.009
pped-0.009
 could-0.008
Token the
Token ID262
Feature activation-1.24e+01

Pos logit contributions
igsaw+0.061
aj+0.057
 difference+0.054
 coll+0.052
 former+0.051
Neg logit contributions
 decided-0.045
 came-0.039
 tried-0.039
 stumbled-0.036
 could-0.036
Token y
Token ID331
Feature activation-1.65e+01

Pos logit contributions
igsaw+0.337
aj+0.313
 former+0.267
ushes+0.266
 coll+0.265
Neg logit contributions
 decided-0.353
 came-0.318
 tried-0.315
 got-0.300
 wanted-0.294
Tokenummy
Token ID13513
Feature activation-8.74e+00

Pos logit contributions
igsaw+0.160
aj+0.140
 difference+0.102
 coll+0.096
 former+0.091
Neg logit contributions
 decided-0.725
 came-0.680
 tried-0.674
pped-0.654
 stumbled-0.653
Token treats
Token ID18432
Feature activation+1.87e+00

Pos logit contributions
igsaw+0.219
aj+0.203
 difference+0.190
 coll+0.181
 former+0.177
Neg logit contributions
 decided-0.257
 came-0.230
 tried-0.229
 stumbled-0.220
pped-0.219
Token.
Token ID13
Feature activation-8.47e-02

Pos logit contributions
 decided+0.038
 tried+0.034
pped+0.033
 stumbled+0.033
 came+0.032
Neg logit contributions
igsaw-0.060
aj-0.055
 difference-0.054
 coll-0.053
 batch-0.052
Token
Token ID220
Feature activation-2.22e+00

Pos logit contributions
igsaw+0.002
aj+0.002
 difference+0.002
 coll+0.002
 former+0.002
Neg logit contributions
 decided-0.002
 came-0.002
 tried-0.002
 stumbled-0.002
pped-0.002
Token
Token ID198
Feature activation-5.08e+00

Pos logit contributions
igsaw+0.077
aj+0.071
 difference+0.070
 coll+0.069
 former+0.068
Neg logit contributions
 decided-0.042
 came-0.037
 could-0.036
 tried-0.035
 stumbled-0.034
Token it
Token ID340
Feature activation+4.71e+00

Pos logit contributions
 decided+0.008
 tried+0.007
 came+0.007
 stumbled+0.007
 rushed+0.006
Neg logit contributions
igsaw-0.020
aj-0.019
 coll-0.019
 difference-0.018
 goals-0.018
Token up
Token ID510
Feature activation-1.46e+00

Pos logit contributions
 decided+0.094
 came+0.080
 tried+0.079
 could+0.074
 stumbled+0.073
Neg logit contributions
igsaw-0.163
aj-0.156
 difference-0.146
 coll-0.143
 former-0.141
Token and
Token ID290
Feature activation+1.98e+00

Pos logit contributions
igsaw+0.053
aj+0.050
 difference+0.048
 coll+0.046
 former+0.046
Neg logit contributions
 decided-0.027
 came-0.022
 tried-0.022
 stumbled-0.020
 could-0.020
Token enjoy
Token ID2883
Feature activation+8.25e-01

Pos logit contributions
 decided+0.038
pped+0.033
 came+0.033
 tried+0.032
 stumbled+0.032
Neg logit contributions
igsaw-0.065
aj-0.061
 difference-0.061
 coll-0.059
 former-0.058
Token the
Token ID262
Feature activation-1.24e+01

Pos logit contributions
 decided+0.013
 tried+0.010
 came+0.010
 stumbled+0.009
pped+0.009
Neg logit contributions
igsaw-0.032
aj-0.030
 difference-0.029
 coll-0.029
 goals-0.028
Token y
Token ID331
Feature activation-1.65e+01

Pos logit contributions
igsaw+0.305
aj+0.301
 former+0.251
udd+0.250
ushes+0.247
Neg logit contributions
 decided-0.368
 came-0.345
 tried-0.335
 had-0.316
 got-0.316
Tokenummy
Token ID13513
Feature activation-8.77e+00

Pos logit contributions
aj+0.163
igsaw+0.162
 difference+0.114
 former+0.108
 goals+0.104
Neg logit contributions
 decided-0.697
 came-0.658
 tried-0.648
pped-0.630
 got-0.628
Token taste
Token ID6938
Feature activation-6.99e-01

Pos logit contributions
igsaw+0.182
aj+0.177
 former+0.160
udd+0.150
 difference+0.149
Neg logit contributions
 decided-0.286
 came-0.272
 tried-0.267
 got-0.254
 could-0.249
Token.
Token ID13
Feature activation-6.67e-02

Pos logit contributions
igsaw+0.023
aj+0.021
 difference+0.020
 coll+0.020
 goals+0.019
Neg logit contributions
 decided-0.015
 tried-0.013
 came-0.013
 stumbled-0.013
pped-0.012
Token<|endoftext|>
Token ID50256
Feature activation-4.27e+00

Pos logit contributions
aj+0.002
igsaw+0.002
 coll+0.002
 difference+0.002
 former+0.002
Neg logit contributions
 decided-0.001
 could-0.001
 came-0.001
 tried-0.001
 stumbled-0.001
TokenOne
Token ID3198
Feature activation-5.11e+00

Pos logit contributions
igsaw+0.112
aj+0.104
 difference+0.097
 coll+0.092
 goals+0.091
Neg logit contributions
 decided-0.120
 tried-0.107
 came-0.106
 stumbled-0.101
pped-0.101
TokenOnce
Token ID7454
Feature activation+5.47e-01

Pos logit contributions
igsaw+0.107
aj+0.099
 difference+0.093
 coll+0.088
 goals+0.087
Neg logit contributions
 decided-0.114
 tried-0.102
 came-0.102
 stumbled-0.097
pped-0.096
Token upon
Token ID2402
Feature activation-2.42e+00

Pos logit contributions
 decided+0.013
 came+0.013
 could+0.012
 tried+0.012
 got+0.011
Neg logit contributions
igsaw-0.016
 former-0.015
aj-0.015
 coll-0.015
 difference-0.015
Token a
Token ID257
Feature activation-6.93e+00

Pos logit contributions
igsaw+0.061
aj+0.057
 goals+0.053
 new+0.053
 difference+0.053
Neg logit contributions
 decided-0.063
 came-0.056
 tried-0.056
pped-0.055
 stumbled-0.055
Token time
Token ID640
Feature activation+3.96e+00

Pos logit contributions
igsaw+0.214
aj+0.192
 former+0.188
 coll+0.187
 difference+0.187
Neg logit contributions
 decided-0.157
 came-0.146
 tried-0.145
 stumbled-0.134
 got-0.134
Token,
Token ID11
Feature activation+1.20e+00

Pos logit contributions
 decided+0.073
 came+0.068
 tried+0.064
 could+0.062
 stumbled+0.060
Neg logit contributions
igsaw-0.139
aj-0.134
 former-0.132
 difference-0.131
 coll-0.129
Token there
Token ID612
Feature activation+5.33e+00

Pos logit contributions
 decided+0.020
 came+0.016
 tried+0.016
pped+0.015
 stumbled+0.015
Neg logit contributions
igsaw-0.045
aj-0.043
 difference-0.040
 goals-0.039
 coll-0.039
Token was
Token ID373
Feature activation+2.78e+00

Pos logit contributions
 decided+0.081
 tried+0.064
 came+0.064
pped+0.059
 stumbled+0.058
Neg logit contributions
igsaw-0.207
aj-0.197
 difference-0.189
 coll-0.183
 goals-0.182
Token a
Token ID257
Feature activation-7.13e+00

Pos logit contributions
 decided+0.055
 tried+0.045
 came+0.045
 stumbled+0.041
pped+0.041
Neg logit contributions
igsaw-0.099
aj-0.096
 difference-0.092
 former-0.087
udd-0.087
Token happy
Token ID3772
Feature activation-6.55e+00

Pos logit contributions
igsaw+0.177
aj+0.164
 difference+0.149
 former+0.145
 goals+0.142
Neg logit contributions
 decided-0.214
 came-0.194
 tried-0.192
 could-0.183
 stumbled-0.181
Token little
Token ID1310
Feature activation-7.25e+00

Pos logit contributions
igsaw+0.184
aj+0.179
 goals+0.160
 difference+0.159
 former+0.159
Neg logit contributions
 decided-0.172
 came-0.160
 tried-0.151
 got-0.142
 wanted-0.141
Token pengu
Token ID38373
Feature activation-7.13e+00

Pos logit contributions
igsaw+0.198
aj+0.193
 former+0.173
 difference+0.171
 goals+0.168
Neg logit contributions
 decided-0.196
 came-0.179
 tried-0.175
 could-0.164
 got-0.162
Token big
Token ID1263
Feature activation-7.41e+00

Pos logit contributions
 decided+0.058
 came+0.053
 tried+0.052
 could+0.051
 stumbled+0.050
Neg logit contributions
igsaw-0.043
aj-0.040
 difference-0.036
 coll-0.035
 former-0.035
Token and
Token ID290
Feature activation-1.08e-01

Pos logit contributions
igsaw+0.205
aj+0.193
 difference+0.192
 goals+0.185
 coll+0.184
Neg logit contributions
 decided-0.172
 stumbled-0.152
 tried-0.151
 rushed-0.150
pped-0.148
Token blue
Token ID4171
Feature activation-1.07e+01

Pos logit contributions
igsaw+0.003
aj+0.003
 difference+0.002
 coll+0.002
 former+0.002
Neg logit contributions
 decided-0.003
 came-0.003
 tried-0.003
 stumbled-0.003
 could-0.003
Token!
Token ID0
Feature activation+6.88e-01

Pos logit contributions
igsaw+0.318
aj+0.299
 difference+0.294
 goals+0.290
 coll+0.278
Neg logit contributions
 decided-0.239
 tried-0.211
 came-0.206
 rushed-0.205
pped-0.203
Token Can
Token ID1680
Feature activation+4.29e+00

Pos logit contributions
 decided+0.015
 came+0.013
 tried+0.013
 could+0.013
 stumbled+0.012
Neg logit contributions
igsaw-0.022
aj-0.021
 difference-0.020
 coll-0.019
 former-0.019
Token we
Token ID356
Feature activation+9.11e+00

Pos logit contributions
 decided+0.101
 came+0.087
 tried+0.086
pped+0.083
 stumbled+0.083
Neg logit contributions
igsaw-0.128
aj-0.121
 difference-0.114
 coll-0.109
 goals-0.109
Token go
Token ID467
Feature activation+4.07e+00

Pos logit contributions
ged+0.237
 decided+0.231
pped+0.219
 stumbled+0.202
 rushed+0.201
Neg logit contributions
 difference-0.221
igsaw-0.213
 goal-0.194
 biggest-0.193
aj-0.192
Token near
Token ID1474
Feature activation-1.99e+00

Pos logit contributions
 decided+0.090
 rushed+0.079
pped+0.078
 tried+0.077
 stumbled+0.076
Neg logit contributions
aj-0.110
igsaw-0.106
 new-0.105
 difference-0.103
uddle-0.102
Token it
Token ID340
Feature activation+5.64e+00

Pos logit contributions
igsaw+0.070
aj+0.067
 difference+0.064
 coll+0.062
 goals+0.061
Neg logit contributions
 decided-0.038
 came-0.032
 tried-0.031
 stumbled-0.029
 could-0.029
Token?"
Token ID1701
Feature activation-2.55e-01

Pos logit contributions
 decided+0.131
 came+0.116
 tried+0.113
 could+0.108
 stumbled+0.106
Neg logit contributions
igsaw-0.180
aj-0.169
 difference-0.158
 coll-0.155
 former-0.154
Token
Token ID198
Feature activation-5.03e+00

Pos logit contributions
igsaw+0.008
aj+0.007
 difference+0.007
 coll+0.007
 former+0.007
Neg logit contributions
 decided-0.006
 came-0.005
 tried-0.005
 could-0.005
 stumbled-0.005
Token And
Token ID843
Feature activation+1.15e+00

Pos logit contributions
 decided+0.004
 came+0.003
 could+0.003
 tried+0.003
 stumbled+0.003
Neg logit contributions
aj-0.006
igsaw-0.006
 coll-0.005
 difference-0.005
udd-0.005
Token every
Token ID790
Feature activation-4.22e+00

Pos logit contributions
 decided+0.022
 came+0.018
 tried+0.018
 could+0.017
 stumbled+0.017
Neg logit contributions
igsaw-0.042
aj-0.040
 difference-0.037
 coll-0.036
udd-0.036
Token time
Token ID640
Feature activation+3.33e+00

Pos logit contributions
igsaw+0.104
aj+0.097
 difference+0.089
 coll+0.087
udd+0.085
Neg logit contributions
 decided-0.127
 came-0.116
 tried-0.115
 stumbled-0.110
 could-0.109
Token she
Token ID673
Feature activation+6.70e+00

Pos logit contributions
 decided+0.055
 tried+0.044
 came+0.044
pped+0.041
 stumbled+0.041
Neg logit contributions
igsaw-0.125
aj-0.119
 difference-0.114
 coll-0.111
 former-0.110
Token found
Token ID1043
Feature activation+2.50e+00

Pos logit contributions
 decided+0.099
pped+0.093
ged+0.084
Please+0.080
 tried+0.076
Neg logit contributions
aj-0.226
igsaw-0.224
 difference-0.216
 former-0.213
udd-0.212
Token it
Token ID340
Feature activation+5.29e+00

Pos logit contributions
 decided+0.053
 came+0.046
 tried+0.046
 could+0.043
 stumbled+0.042
Neg logit contributions
igsaw-0.084
aj-0.080
 difference-0.074
 coll-0.072
 former-0.072
Token,
Token ID11
Feature activation-4.28e-01

Pos logit contributions
 decided+0.112
 tried+0.096
 came+0.095
pped+0.092
 stumbled+0.090
Neg logit contributions
igsaw-0.171
aj-0.161
 difference-0.154
 coll-0.148
 goals-0.147
Token she
Token ID673
Feature activation+6.56e+00

Pos logit contributions
igsaw+0.016
aj+0.016
 difference+0.015
 coll+0.015
 former+0.015
Neg logit contributions
 decided-0.007
 tried-0.005
 came-0.005
pped-0.005
 stumbled-0.005
Token'd
Token ID1549
Feature activation+9.68e+00

Pos logit contributions
 decided+0.084
pped+0.081
ged+0.070
 tried+0.067
Please+0.066
Neg logit contributions
aj-0.233
igsaw-0.227
 difference-0.226
 former-0.222
 goals-0.220
Token wave
Token ID6769
Feature activation+1.32e+00

Pos logit contributions
 decided+0.152
pped+0.137
ged+0.131
Please+0.123
 tried+0.122
Neg logit contributions
igsaw-0.333
 difference-0.324
aj-0.311
 former-0.308
 goals-0.305
Token and
Token ID290
Feature activation+9.11e-01

Pos logit contributions
 decided+0.029
 tried+0.025
 came+0.025
pped+0.024
 stumbled+0.024
Neg logit contributions
igsaw-0.041
aj-0.039
 difference-0.037
 coll-0.036
 goals-0.036
Token in
Token ID287
Feature activation-1.61e+00

Pos logit contributions
 decided+0.053
pped+0.050
 tried+0.046
 stumbled+0.045
 came+0.043
Neg logit contributions
igsaw-0.078
aj-0.072
 difference-0.072
 coll-0.070
 new-0.070
Token the
Token ID262
Feature activation-1.18e+01

Pos logit contributions
igsaw+0.050
aj+0.047
 difference+0.044
 coll+0.043
 former+0.042
Neg logit contributions
 decided-0.038
 came-0.034
 tried-0.034
 stumbled-0.032
 could-0.031
Token garden
Token ID11376
Feature activation+2.91e+00

Pos logit contributions
igsaw+0.292
aj+0.276
 coll+0.235
 goals+0.233
 batch+0.229
Neg logit contributions
 decided-0.377
 came-0.347
 tried-0.324
 got-0.319
 stumbled-0.318
Token.
Token ID13
Feature activation+9.23e-01

Pos logit contributions
 decided+0.050
 came+0.041
 tried+0.041
 stumbled+0.037
 could+0.037
Neg logit contributions
igsaw-0.108
aj-0.103
 difference-0.098
 coll-0.096
 former-0.095
Token But
Token ID887
Feature activation+3.82e+00

Pos logit contributions
 decided+0.023
 came+0.020
 tried+0.020
 stumbled+0.019
 could+0.019
Neg logit contributions
igsaw-0.027
aj-0.026
 difference-0.024
 coll-0.024
 former-0.023
Token she
Token ID673
Feature activation+7.35e+00

Pos logit contributions
 decided+0.070
 came+0.059
 tried+0.057
pped+0.057
 rushed+0.054
Neg logit contributions
igsaw-0.126
aj-0.119
 difference-0.113
 former-0.111
 goals-0.111
Token couldn
Token ID3521
Feature activation+5.35e+00

Pos logit contributions
ged+0.104
pped+0.097
Please+0.096
 decided+0.091
 rushed+0.084
Neg logit contributions
aj-0.213
uddle-0.207
udd-0.204
 favourite-0.191
 favorite-0.189
Token't
Token ID470
Feature activation+5.41e+00

Pos logit contributions
 decided+0.084
 tried+0.064
 came+0.062
ged+0.060
 stumbled+0.058
Neg logit contributions
igsaw-0.209
aj-0.186
 difference-0.180
 puzzle-0.175
opl-0.174
Token find
Token ID1064
Feature activation+4.72e+00

Pos logit contributions
ged+0.155
pped+0.150
 decided+0.148
 came+0.139
dd+0.138
Neg logit contributions
igsaw-0.106
 difference-0.100
 former-0.086
 biggest-0.085
 new-0.085
Token the
Token ID262
Feature activation-1.17e+01

Pos logit contributions
 decided+0.097
 came+0.081
 tried+0.081
 stumbled+0.077
pped+0.076
Neg logit contributions
igsaw-0.160
aj-0.149
 difference-0.144
 coll-0.139
 goals-0.137
Token black
Token ID2042
Feature activation-1.43e+01

Pos logit contributions
aj+0.287
igsaw+0.282
udd+0.224
 former+0.219
 goals+0.214
Neg logit contributions
 decided-0.377
 came-0.352
 tried-0.341
 could-0.329
 wanted-0.325
Token saw
Token ID2497
Feature activation-5.00e-01

Pos logit contributions
 decided+0.035
pped+0.034
 ever+0.031
ged+0.031
Please+0.029
Neg logit contributions
aj-0.070
udd-0.066
igsaw-0.061
uddle-0.061
 coll-0.061
Token a
Token ID257
Feature activation-9.23e+00

Pos logit contributions
igsaw+0.017
aj+0.016
 difference+0.015
 former+0.015
 coll+0.015
Neg logit contributions
 decided-0.011
 came-0.009
 tried-0.009
 stumbled-0.008
 could-0.008
Token blue
Token ID4171
Feature activation-1.43e+01

Pos logit contributions
igsaw+0.206
aj+0.192
 difference+0.172
 former+0.167
 goals+0.166
Neg logit contributions
 decided-0.295
 came-0.272
 tried-0.268
 could-0.256
 stumbled-0.256
Token potato
Token ID21219
Feature activation-1.32e+00

Pos logit contributions
igsaw+0.309
aj+0.276
 difference+0.259
 coll+0.248
 former+0.237
Neg logit contributions
 decided-0.471
 tried-0.427
 came-0.424
 stumbled-0.412
 could-0.411
Token.
Token ID13
Feature activation+7.56e-01

Pos logit contributions
 puzzle+0.034
igsaw+0.034
 coll+0.033
 batch+0.033
 difference+0.033
Neg logit contributions
 decided-0.028
 tried-0.026
 managed-0.026
 rushed-0.025
 stumbled-0.025
Token She
Token ID1375
Feature activation+6.78e+00

Pos logit contributions
 decided+0.016
 came+0.014
 tried+0.014
 could+0.013
 stumbled+0.013
Neg logit contributions
igsaw-0.025
aj-0.024
 difference-0.023
 former-0.022
 coll-0.022
Token thought
Token ID1807
Feature activation+4.93e+00

Pos logit contributions
pped+0.081
 decided+0.068
ged+0.063
 ever+0.061
 came+0.057
Neg logit contributions
aj-0.246
 new-0.226
 coll-0.221
igsaw-0.221
udd-0.220
Token it
Token ID340
Feature activation+6.96e+00

Pos logit contributions
 decided+0.059
 tried+0.040
 stumbled+0.039
 came+0.038
pped+0.037
Neg logit contributions
igsaw-0.207
aj-0.193
 coll-0.187
 difference-0.186
 goals-0.185
Token was
Token ID373
Feature activation+3.99e+00

Pos logit contributions
 decided+0.121
pped+0.121
ged+0.105
 rushed+0.099
Please+0.098
Neg logit contributions
igsaw-0.206
 new-0.206
 biggest-0.205
 difference-0.200
udd-0.198
Token very
Token ID845
Feature activation+9.73e-01

Pos logit contributions
 decided+0.113
pped+0.108
ged+0.106
 stumbled+0.104
 tried+0.103
Neg logit contributions
igsaw-0.090
 new-0.084
aj-0.083
 difference-0.082
 coll-0.082
Token pretty
Token ID2495
Feature activation-6.58e+00

Pos logit contributions
 decided+0.038
pped+0.037
 came+0.036
 stumbled+0.036
 tried+0.036
Neg logit contributions
igsaw-0.013
aj-0.011
 new-0.010
 difference-0.010
 coll-0.009
Token.
Token ID13
Feature activation+6.23e-01

Pos logit contributions
 decided+0.024
 came+0.021
 tried+0.020
pped+0.020
 stumbled+0.020
Neg logit contributions
igsaw-0.037
aj-0.035
 coll-0.033
 difference-0.033
 goals-0.033
Token Lily
Token ID20037
Feature activation+2.48e+00

Pos logit contributions
 decided+0.014
 came+0.012
 tried+0.012
 could+0.012
 got+0.011
Neg logit contributions
igsaw-0.021
aj-0.019
 difference-0.019
 former-0.018
 coll-0.018
Token's
Token ID338
Feature activation-3.87e+00

Pos logit contributions
 decided+0.039
 tried+0.032
 came+0.032
 could+0.029
 stumbled+0.028
Neg logit contributions
igsaw-0.098
aj-0.093
 difference-0.089
 former-0.086
 coll-0.086
Token cube
Token ID23441
Feature activation+1.19e+00

Pos logit contributions
igsaw+0.106
aj+0.099
 difference+0.091
 former+0.091
 coll+0.089
Neg logit contributions
 decided-0.107
 tried-0.096
 came-0.096
 could-0.091
 got-0.089
Token blinked
Token ID40360
Feature activation-6.34e-01

Pos logit contributions
 decided+0.017
 came+0.014
 tried+0.014
pped+0.012
 stumbled+0.012
Neg logit contributions
igsaw-0.047
aj-0.045
 difference-0.043
 coll-0.042
udd-0.042
Token too
Token ID1165
Feature activation+4.76e+00

Pos logit contributions
igsaw+0.023
aj+0.022
 difference+0.021
 coll+0.020
 former+0.020
Neg logit contributions
 decided-0.011
 came-0.009
 tried-0.009
 stumbled-0.009
pped-0.009
Token,
Token ID11
Feature activation+4.23e-01

Pos logit contributions
 decided+0.110
 came+0.096
 tried+0.096
 stumbled+0.091
pped+0.091
Neg logit contributions
igsaw-0.149
aj-0.139
 difference-0.132
 coll-0.128
 goals-0.127
Token which
Token ID543
Feature activation+3.11e+00

Pos logit contributions
 decided+0.009
pped+0.008
 came+0.008
 stumbled+0.008
 tried+0.008
Neg logit contributions
igsaw-0.012
aj-0.012
 coll-0.011
 difference-0.011
udd-0.011
Token made
Token ID925
Feature activation-2.31e-01

Pos logit contributions
 decided+0.043
 came+0.034
 tried+0.034
 could+0.030
 stumbled+0.030
Neg logit contributions
igsaw-0.128
aj-0.123
 difference-0.117
 coll-0.114
 former-0.114
Token her
Token ID607
Feature activation-4.79e+00

Pos logit contributions
igsaw+0.009
aj+0.008
 difference+0.008
 former+0.008
 coll+0.008
Neg logit contributions
 decided-0.004
 came-0.003
 tried-0.003
 could-0.003
pped-0.003
Token laugh
Token ID6487
Feature activation-1.04e+00

Pos logit contributions
igsaw+0.143
aj+0.135
 difference+0.123
 former+0.121
 coll+0.120
Neg logit contributions
 decided-0.121
 tried-0.107
 came-0.107
 could-0.101
 stumbled-0.099
Token dish
Token ID9433
Feature activation-2.31e+00

Pos logit contributions
igsaw+0.260
aj+0.246
 difference+0.220
 former+0.214
 goals+0.213
Neg logit contributions
 decided-0.327
 came-0.300
 tried-0.298
 stumbled-0.279
 could-0.278
Token that
Token ID326
Feature activation-1.44e-01

Pos logit contributions
igsaw+0.064
aj+0.055
 difference+0.053
 batch+0.053
 puzzle+0.053
Neg logit contributions
 decided-0.058
 stumbled-0.054
 tried-0.054
pped-0.053
 rushed-0.051
Token she
Token ID673
Feature activation+6.75e+00

Pos logit contributions
igsaw+0.005
aj+0.005
 difference+0.005
 coll+0.005
 puzzle+0.005
Neg logit contributions
 decided-0.002
pped-0.002
 tried-0.002
 came-0.002
ged-0.002
Token had
Token ID550
Feature activation+6.17e+00

Pos logit contributions
pped+0.100
 decided+0.096
ged+0.092
 stumbled+0.089
 came+0.081
Neg logit contributions
aj-0.219
igsaw-0.212
udd-0.211
 batch-0.196
 new-0.195
Token just
Token ID655
Feature activation+2.99e+00

Pos logit contributions
 decided+0.106
ged+0.090
pped+0.088
Please+0.087
 came+0.086
Neg logit contributions
aj-0.193
igsaw-0.184
udd-0.175
 former-0.175
 new-0.174
Token bought
Token ID5839
Feature activation+1.77e+00

Pos logit contributions
 decided+0.045
pped+0.039
 came+0.038
Please+0.037
 stumbled+0.036
Neg logit contributions
igsaw-0.101
aj-0.098
 goals-0.092
 former-0.091
udd-0.091
Token.
Token ID13
Feature activation-7.55e-02

Pos logit contributions
 decided+0.035
 tried+0.031
 came+0.029
 stumbled+0.029
pped+0.028
Neg logit contributions
igsaw-0.059
aj-0.055
 difference-0.053
 coll-0.052
 goals-0.051
Token It
Token ID632
Feature activation+8.10e+00

Pos logit contributions
igsaw+0.002
aj+0.002
 difference+0.002
 coll+0.002
 goals+0.002
Neg logit contributions
 decided-0.002
 came-0.001
 tried-0.001
 could-0.001
 stumbled-0.001
Token was
Token ID373
Feature activation+3.68e+00

Pos logit contributions
 decided+0.124
pped+0.100
 tried+0.099
 came+0.099
 stumbled+0.095
Neg logit contributions
igsaw-0.305
aj-0.290
 difference-0.280
 coll-0.272
udd-0.271
Token shiny
Token ID22441
Feature activation-7.44e+00

Pos logit contributions
 decided+0.093
 came+0.082
 tried+0.082
 stumbled+0.078
pped+0.078
Neg logit contributions
igsaw-0.106
aj-0.100
 difference-0.094
 coll-0.091
 former-0.090
Token and
Token ID290
Feature activation+3.62e-01

Pos logit contributions
igsaw+0.267
aj+0.254
 difference+0.241
 coll+0.235
 former+0.232
Neg logit contributions
 decided-0.137
 came-0.115
 tried-0.114
 stumbled-0.107
pped-0.105
Token But
Token ID887
Feature activation+3.99e+00

Pos logit contributions
 decided+0.017
 came+0.015
 tried+0.015
 could+0.014
 stumbled+0.014
Neg logit contributions
igsaw-0.022
aj-0.020
 difference-0.020
 coll-0.019
 former-0.019
Token when
Token ID618
Feature activation+4.08e+00

Pos logit contributions
 decided+0.074
 came+0.063
 tried+0.062
pped+0.059
 stumbled+0.057
Neg logit contributions
igsaw-0.137
aj-0.131
 difference-0.123
 former-0.121
 coll-0.121
Token she
Token ID673
Feature activation+7.55e+00

Pos logit contributions
 decided+0.070
 tried+0.057
pped+0.057
 came+0.056
 stumbled+0.052
Neg logit contributions
igsaw-0.146
aj-0.138
 difference-0.132
 coll-0.129
 former-0.129
Token got
Token ID1392
Feature activation+2.67e+00

Pos logit contributions
 decided+0.082
pped+0.066
ged+0.062
 him+0.058
 came+0.056
Neg logit contributions
igsaw-0.281
aj-0.265
 difference-0.247
uddle-0.244
 goal-0.241
Token closer
Token ID5699
Feature activation-7.83e-01

Pos logit contributions
 decided+0.051
pped+0.044
 came+0.044
 tried+0.042
 could+0.042
Neg logit contributions
igsaw-0.086
aj-0.079
 difference-0.074
 new-0.074
 biggest-0.073
Token,
Token ID11
Feature activation+6.85e-01

Pos logit contributions
igsaw+0.023
aj+0.021
 difference+0.020
 coll+0.019
udd+0.019
Neg logit contributions
 decided-0.018
pped-0.016
 tried-0.016
 came-0.016
 stumbled-0.015
Token she
Token ID673
Feature activation+7.22e+00

Pos logit contributions
 decided+0.012
 came+0.010
 tried+0.010
pped+0.010
 stumbled+0.009
Neg logit contributions
igsaw-0.024
aj-0.023
 difference-0.022
 coll-0.021
 former-0.021
Token noticed
Token ID6810
Feature activation-6.90e-01

Pos logit contributions
 decided+0.093
pped+0.080
ged+0.079
 ever+0.073
Please+0.073
Neg logit contributions
aj-0.246
udd-0.229
igsaw-0.226
uddle-0.226
 goals-0.218
Token the
Token ID262
Feature activation-1.17e+01

Pos logit contributions
igsaw+0.021
aj+0.020
 difference+0.019
 coll+0.018
udd+0.018
Neg logit contributions
 decided-0.015
 tried-0.013
 came-0.013
 stumbled-0.012
 rushed-0.012
Token w
Token ID266
Feature activation-1.15e+01

Pos logit contributions
igsaw+0.346
aj+0.321
 difference+0.296
 coll+0.292
 former+0.287
Neg logit contributions
 decided-0.301
 tried-0.269
 came-0.268
 could-0.251
 stumbled-0.250
Tokenaffle
Token ID30697
Feature activation-8.75e-01

Pos logit contributions
igsaw+0.220
aj+0.199
 difference+0.177
 coll+0.167
udd+0.165
Neg logit contributions
 decided-0.412
 came-0.375
 tried-0.374
 stumbled-0.362
 could-0.360
Token!"
Token ID2474
Feature activation+3.85e-01

Pos logit contributions
 decided+0.006
pped+0.006
 came+0.005
 stumbled+0.005
 tried+0.005
Neg logit contributions
igsaw-0.012
aj-0.012
 coll-0.011
 difference-0.011
 goals-0.011
Token Tom
Token ID4186
Feature activation+2.16e+00

Pos logit contributions
 decided+0.009
pped+0.008
 came+0.008
 tried+0.008
 stumbled+0.008
Neg logit contributions
igsaw-0.011
aj-0.010
 difference-0.010
 coll-0.009
 goals-0.009
Token said
Token ID531
Feature activation+7.46e-01

Pos logit contributions
 decided+0.047
pped+0.041
 stumbled+0.040
 came+0.040
 tried+0.039
Neg logit contributions
aj-0.063
igsaw-0.063
 difference-0.058
udd-0.057
 coll-0.057
Token.
Token ID13
Feature activation-1.77e-01

Pos logit contributions
 decided+0.016
pped+0.015
 stumbled+0.014
 tried+0.014
Please+0.014
Neg logit contributions
aj-0.020
igsaw-0.020
 difference-0.019
 new-0.018
 goals-0.018
Token "
Token ID366
Feature activation-6.85e-01

Pos logit contributions
igsaw+0.005
aj+0.005
 difference+0.005
 coll+0.005
 goals+0.004
Neg logit contributions
 decided-0.004
 tried-0.004
 came-0.004
pped-0.004
 stumbled-0.004
TokenDo
Token ID5211
Feature activation+3.60e+00

Pos logit contributions
igsaw+0.023
aj+0.021
 difference+0.021
 goals+0.021
 coll+0.020
Neg logit contributions
 decided-0.013
 came-0.012
 tried-0.011
pped-0.011
 stumbled-0.011
Token you
Token ID345
Feature activation+8.11e+00

Pos logit contributions
 decided+0.077
 came+0.066
 tried+0.065
 stumbled+0.061
pped+0.061
Neg logit contributions
igsaw-0.120
aj-0.113
 difference-0.107
 coll-0.104
 former-0.103
Token want
Token ID765
Feature activation+5.85e+00

Pos logit contributions
 decided+0.150
pped+0.141
ged+0.137
 came+0.128
 stumbled+0.126
Neg logit contributions
igsaw-0.253
 difference-0.242
aj-0.241
 goals-0.232
 former-0.226
Token to
Token ID284
Feature activation+3.56e+00

Pos logit contributions
 decided+0.098
 came+0.082
 tried+0.082
 could+0.073
 stumbled+0.071
Neg logit contributions
igsaw-0.226
aj-0.216
 difference-0.205
 former-0.201
udd-0.199
Token be
Token ID307
Feature activation+2.02e+00

Pos logit contributions
 decided+0.079
 came+0.068
 tried+0.068
 stumbled+0.064
 could+0.063
Neg logit contributions
igsaw-0.115
aj-0.109
 difference-0.103
 coll-0.099
 former-0.099
Token our
Token ID674
Feature activation-9.16e+00

Pos logit contributions
 decided+0.060
 came+0.054
 tried+0.054
 could+0.050
 stumbled+0.050
Neg logit contributions
igsaw-0.053
aj-0.047
 difference-0.045
 former-0.042
udd-0.042
Token just
Token ID655
Feature activation+3.77e+00

Pos logit contributions
 decided+0.075
 tried+0.065
 came+0.064
 stumbled+0.061
pped+0.061
Neg logit contributions
igsaw-0.102
aj-0.097
 difference-0.092
 coll-0.089
 former-0.088
Token his
Token ID465
Feature activation-5.69e+00

Pos logit contributions
 decided+0.083
 came+0.073
 tried+0.072
 stumbled+0.067
 could+0.066
Neg logit contributions
igsaw-0.124
aj-0.118
 difference-0.111
 coll-0.107
udd-0.106
Token friend
Token ID1545
Feature activation+1.76e+00

Pos logit contributions
igsaw+0.143
aj+0.126
 difference+0.115
 coll+0.115
udd+0.112
Neg logit contributions
 decided-0.170
 came-0.157
 tried-0.157
 could-0.150
 stumbled-0.149
Token,
Token ID11
Feature activation+2.38e-01

Pos logit contributions
 decided+0.042
 tried+0.037
pped+0.036
 came+0.035
 stumbled+0.035
Neg logit contributions
igsaw-0.050
aj-0.048
 difference-0.046
 coll-0.044
udd-0.043
Token Tim
Token ID5045
Feature activation+1.53e+00

Pos logit contributions
 decided+0.006
 tried+0.005
 came+0.005
pped+0.005
 stumbled+0.005
Neg logit contributions
igsaw-0.007
aj-0.006
 difference-0.006
 coll-0.006
 former-0.006
Tokenmy
Token ID1820
Feature activation+5.17e+00

Pos logit contributions
 decided+0.039
pped+0.035
 stumbled+0.034
 tried+0.034
 came+0.033
Neg logit contributions
igsaw-0.040
aj-0.038
 difference-0.038
 coll-0.036
udd-0.035
Token,
Token ID11
Feature activation-6.61e-02

Pos logit contributions
pped+0.126
 decided+0.126
 stumbled+0.114
Please+0.113
 tried+0.110
Neg logit contributions
igsaw-0.117
 difference-0.115
 coll-0.113
aj-0.109
 new-0.106
Token playing
Token ID2712
Feature activation+1.80e-01

Pos logit contributions
aj+0.002
udd+0.002
 difference+0.002
 new+0.002
igsaw+0.002
Neg logit contributions
 decided-0.001
pped-0.001
 possibly-0.001
 tried-0.001
Please-0.001
Token with
Token ID351
Feature activation-1.62e+00

Pos logit contributions
 decided+0.004
 came+0.003
 tried+0.003
 stumbled+0.003
 could+0.003
Neg logit contributions
igsaw-0.006
aj-0.006
 difference-0.005
 coll-0.005
 former-0.005
Token some
Token ID617
Feature activation-6.66e+00

Pos logit contributions
igsaw+0.056
aj+0.053
 difference+0.050
 coll+0.049
 former+0.049
Neg logit contributions
 decided-0.032
 came-0.027
 tried-0.027
 stumbled-0.025
 could-0.025
Token coc
Token ID8954
Feature activation-2.72e+00

Pos logit contributions
igsaw+0.183
aj+0.168
 former+0.151
 difference+0.150
udd+0.143
Neg logit contributions
 decided-0.190
 tried-0.176
 came-0.174
 could-0.166
 wanted-0.159
Token come
Token ID1282
Feature activation+5.55e+00

Pos logit contributions
 decided+0.198
pped+0.176
ged+0.174
 tried+0.167
 stumbled+0.163
Neg logit contributions
igsaw-0.242
aj-0.237
 difference-0.227
 new-0.213
 coll-0.213
Token relax
Token ID8960
Feature activation+1.99e+00

Pos logit contributions
 decided+0.127
 tried+0.112
pped+0.108
 stumbled+0.108
 came+0.107
Neg logit contributions
igsaw-0.158
aj-0.158
 difference-0.146
 coll-0.143
 former-0.141
Token with
Token ID351
Feature activation-1.37e+00

Pos logit contributions
 decided+0.043
 came+0.037
 tried+0.036
 stumbled+0.034
 could+0.034
Neg logit contributions
igsaw-0.066
aj-0.062
 difference-0.059
 coll-0.057
 former-0.057
Token you
Token ID345
Feature activation+5.04e+00

Pos logit contributions
igsaw+0.041
aj+0.039
 difference+0.036
 coll+0.035
 former+0.035
Neg logit contributions
 decided-0.034
 came-0.030
 tried-0.030
 stumbled-0.028
 could-0.028
Token?"
Token ID1701
Feature activation-1.01e+00

Pos logit contributions
 decided+0.112
 came+0.100
 tried+0.097
 could+0.093
 stumbled+0.089
Neg logit contributions
igsaw-0.169
aj-0.153
 difference-0.148
 coll-0.144
 former-0.143
Token
Token ID198
Feature activation-5.84e+00

Pos logit contributions
igsaw+0.031
aj+0.029
 difference+0.028
 coll+0.027
 former+0.027
Neg logit contributions
 decided-0.025
 came-0.022
 tried-0.022
 could-0.020
 stumbled-0.020
Token
Token ID198
Feature activation-5.75e+00

Pos logit contributions
igsaw+0.216
aj+0.197
 difference+0.192
 former+0.190
 goals+0.186
Neg logit contributions
 decided-0.112
 came-0.101
 tried-0.096
 could-0.095
 stumbled-0.089
TokenG
Token ID38
Feature activation-5.79e+00

Pos logit contributions
igsaw+0.168
aj+0.156
 difference+0.147
 coll+0.140
 former+0.140
Neg logit contributions
 decided-0.152
 came-0.134
 tried-0.133
 stumbled-0.126
 could-0.126
Tokeniz
Token ID528
Feature activation-1.58e+00

Pos logit contributions
igsaw+0.126
aj+0.116
 difference+0.105
 coll+0.100
 former+0.098
Neg logit contributions
 decided-0.190
 came-0.173
 tried-0.173
 stumbled-0.166
 could-0.165
Tokenmo
Token ID5908
Feature activation+1.15e+00

Pos logit contributions
igsaw+0.048
aj+0.046
 difference+0.042
udd+0.041
 coll+0.040
Neg logit contributions
 decided-0.037
 came-0.032
 tried-0.032
pped-0.031
 stumbled-0.031
Token answered
Token ID9373
Feature activation-8.30e-01

Pos logit contributions
 decided+0.019
pped+0.018
 came+0.016
 stumbled+0.015
 tried+0.015
Neg logit contributions
aj-0.039
igsaw-0.038
 difference-0.036
udd-0.035
 goals-0.035
Tokenoph
Token ID2522
Feature activation-5.35e+00

Pos logit contributions
igsaw+0.104
aj+0.095
 difference+0.092
 goals+0.089
 coll+0.089
Neg logit contributions
 decided-0.129
 came-0.118
 tried-0.114
 could-0.110
 stumbled-0.110
Tokenie
Token ID494
Feature activation+4.63e+00

Pos logit contributions
igsaw+0.171
aj+0.162
 difference+0.152
 coll+0.147
 former+0.146
Neg logit contributions
 decided-0.119
 tried-0.103
 came-0.103
 stumbled-0.097
pped-0.096
Token was
Token ID373
Feature activation+3.17e+00

Pos logit contributions
 decided+0.073
 came+0.059
 tried+0.059
pped+0.056
 stumbled+0.054
Neg logit contributions
igsaw-0.171
aj-0.165
 difference-0.156
 coll-0.152
udd-0.152
Token a
Token ID257
Feature activation-6.88e+00

Pos logit contributions
 decided+0.068
 came+0.059
 tried+0.059
 stumbled+0.055
pped+0.054
Neg logit contributions
igsaw-0.109
aj-0.100
 difference-0.098
 coll-0.093
 goals-0.092
Token very
Token ID845
Feature activation-2.85e+00

Pos logit contributions
igsaw+0.179
aj+0.170
 difference+0.154
 coll+0.149
 former+0.149
Neg logit contributions
 decided-0.198
 came-0.178
 tried-0.176
 could-0.167
 stumbled-0.166
Token successful
Token ID4388
Feature activation-7.06e+00

Pos logit contributions
igsaw+0.076
aj+0.072
 difference+0.068
 coll+0.066
 former+0.064
Neg logit contributions
 decided-0.079
 came-0.071
 tried-0.070
 could-0.066
 stumbled-0.066
Token swim
Token ID9422
Feature activation-1.19e-01

Pos logit contributions
igsaw+0.176
aj+0.169
 difference+0.151
 former+0.147
 coll+0.147
Neg logit contributions
 decided-0.209
 came-0.190
 tried-0.187
 stumbled-0.176
 could-0.176
Tokenmer
Token ID647
Feature activation-4.09e+00

Pos logit contributions
igsaw+0.004
aj+0.004
 difference+0.004
 coll+0.004
 former+0.004
Neg logit contributions
 decided-0.002
 came-0.002
 tried-0.002
 could-0.002
 stumbled-0.002
Token.
Token ID13
Feature activation+1.40e+00

Pos logit contributions
igsaw+0.139
aj+0.132
 difference+0.126
 coll+0.123
 goals+0.122
Neg logit contributions
 decided-0.076
 tried-0.065
 came-0.064
pped-0.063
 stumbled-0.061
Token One
Token ID1881
Feature activation-3.79e+00

Pos logit contributions
 decided+0.033
 came+0.029
 tried+0.028
 stumbled+0.027
 could+0.027
Neg logit contributions
igsaw-0.043
aj-0.042
 difference-0.039
 coll-0.038
 former-0.037
Token day
Token ID1110
Feature activation+4.48e+00

Pos logit contributions
igsaw+0.090
aj+0.084
 difference+0.076
 coll+0.074
 former+0.073
Neg logit contributions
 decided-0.122
 came-0.110
 tried-0.109
 could-0.105
 stumbled-0.104
Token't
Token ID470
Feature activation+5.91e+00

Pos logit contributions
 decided+0.076
 tried+0.058
 came+0.058
 stumbled+0.056
 could+0.050
Neg logit contributions
igsaw-0.197
aj-0.182
 difference-0.177
 goals-0.174
 coll-0.172
Token.
Token ID13
Feature activation+9.70e-01

Pos logit contributions
 decided+0.128
pped+0.118
ged+0.110
 came+0.106
 stumbled+0.105
Neg logit contributions
igsaw-0.181
 difference-0.169
 goals-0.160
aj-0.156
udd-0.154
Token You
Token ID921
Feature activation+8.80e+00

Pos logit contributions
 decided+0.020
 could+0.018
 tried+0.018
 came+0.017
 stumbled+0.017
Neg logit contributions
igsaw-0.031
aj-0.030
 coll-0.029
 difference-0.029
 former-0.028
Token hid
Token ID24519
Feature activation+2.70e+00

Pos logit contributions
 decided+0.196
ged+0.190
pped+0.175
Please+0.173
 rushed+0.168
Neg logit contributions
igsaw-0.221
aj-0.218
 goals-0.213
 difference-0.212
udd-0.209
Token in
Token ID287
Feature activation-2.10e+00

Pos logit contributions
 decided+0.055
 came+0.047
 tried+0.047
 stumbled+0.044
 could+0.043
Neg logit contributions
igsaw-0.092
aj-0.087
 difference-0.082
 coll-0.080
 goals-0.079
Token a
Token ID257
Feature activation-9.55e+00

Pos logit contributions
igsaw+0.065
aj+0.064
udd+0.059
 difference+0.059
uddle+0.059
Neg logit contributions
 decided-0.047
 tried-0.044
 it-0.042
 never-0.041
pped-0.040
Token bad
Token ID2089
Feature activation-9.18e+00

Pos logit contributions
igsaw+0.217
aj+0.208
udd+0.181
 former+0.180
 difference+0.179
Neg logit contributions
 decided-0.300
 tried-0.277
 came-0.276
 could-0.261
 got-0.259
Token place
Token ID1295
Feature activation+2.14e-01

Pos logit contributions
igsaw+0.203
aj+0.193
 difference+0.169
 former+0.164
udd+0.164
Neg logit contributions
 decided-0.295
 came-0.273
 tried-0.271
 stumbled-0.258
 could-0.254
Token.
Token ID13
Feature activation+3.37e-02

Pos logit contributions
 decided+0.004
 came+0.003
 tried+0.003
 could+0.003
 stumbled+0.003
Neg logit contributions
igsaw-0.008
aj-0.007
 difference-0.007
 coll-0.007
 former-0.007
Token That
Token ID1320
Feature activation+3.56e-01

Pos logit contributions
 decided+0.001
 tried+0.001
 could+0.001
 came+0.001
 stumbled+0.001
Neg logit contributions
igsaw-0.001
aj-0.001
 difference-0.001
 coll-0.001
udd-0.001
Token is
Token ID318
Feature activation+3.18e+00

Pos logit contributions
 decided+0.010
 came+0.009
 tried+0.009
 stumbled+0.009
 could+0.009
Neg logit contributions
igsaw-0.009
aj-0.009
 difference-0.008
 coll-0.008
 former-0.008
Token all
Token ID477
Feature activation+8.06e-01

Pos logit contributions
igsaw+0.093
aj+0.083
 difference+0.081
 goals+0.078
 coll+0.077
Neg logit contributions
 decided-0.070
 came-0.060
 tried-0.059
 stumbled-0.057
pped-0.057
Token.
Token ID13
Feature activation+4.22e-01

Pos logit contributions
 decided+0.016
 came+0.014
 tried+0.014
pped+0.013
 stumbled+0.013
Neg logit contributions
igsaw-0.028
aj-0.025
 difference-0.025
 goals-0.024
 coll-0.024
Token It
Token ID632
Feature activation+8.19e+00

Pos logit contributions
 decided+0.009
 came+0.008
 tried+0.008
 could+0.007
 stumbled+0.007
Neg logit contributions
igsaw-0.014
aj-0.013
 difference-0.013
 coll-0.013
 goals-0.012
Token was
Token ID373
Feature activation+3.86e+00

Pos logit contributions
pped+0.131
ged+0.119
 decided+0.118
 him+0.099
 stumbled+0.098
Neg logit contributions
igsaw-0.247
 difference-0.242
aj-0.241
 prett-0.235
 new-0.235
Token actually
Token ID1682
Feature activation+4.72e+00

Pos logit contributions
 decided+0.087
 came+0.076
 tried+0.076
 stumbled+0.071
pped+0.071
Neg logit contributions
igsaw-0.121
aj-0.114
 difference-0.108
 coll-0.105
 former-0.105
Token kind
Token ID1611
Feature activation-6.29e+00

Pos logit contributions
 decided+0.110
pped+0.098
 came+0.097
 tried+0.096
 stumbled+0.093
Neg logit contributions
igsaw-0.137
aj-0.126
 difference-0.121
 coll-0.119
 former-0.119
Token of
Token ID286
Feature activation-4.32e+00

Pos logit contributions
igsaw+0.216
aj+0.204
 difference+0.195
 coll+0.189
 goals+0.187
Neg logit contributions
 decided-0.125
 tried-0.106
 came-0.106
 stumbled-0.100
pped-0.100
Token cute
Token ID13779
Feature activation-8.25e+00

Pos logit contributions
igsaw+0.108
aj+0.100
 difference+0.093
 coll+0.089
 former+0.088
Neg logit contributions
 decided-0.128
 came-0.115
 tried-0.114
 stumbled-0.109
 could-0.109
Token.
Token ID13
Feature activation+2.71e-01

Pos logit contributions
igsaw+0.224
aj+0.207
 difference+0.197
 coll+0.189
 goals+0.186
Neg logit contributions
 decided-0.223
 tried-0.197
 came-0.197
pped-0.190
 stumbled-0.189
Token From
Token ID3574
Feature activation-1.77e+00

Pos logit contributions
 decided+0.006
 came+0.005
 could+0.005
 tried+0.005
 stumbled+0.005
Neg logit contributions
igsaw-0.009
aj-0.009
 difference-0.008
 coll-0.008
 goals-0.008
Token then
Token ID788
Feature activation+6.48e+00

Pos logit contributions
igsaw+0.063
aj+0.060
 difference+0.057
 coll+0.055
 former+0.055
Neg logit contributions
 decided-0.033
 came-0.028
 tried-0.028
 stumbled-0.026
 could-0.025
Token touched
Token ID12615
Feature activation-2.80e+00

Pos logit contributions
pped+0.037
 decided+0.035
 ever+0.031
'd+0.030
can+0.029
Neg logit contributions
aj-0.074
 coll-0.069
 prett-0.069
 batch-0.068
 new-0.067
Token the
Token ID262
Feature activation-1.35e+01

Pos logit contributions
igsaw+0.097
aj+0.091
 difference+0.089
 goals+0.088
 coll+0.087
Neg logit contributions
 decided-0.052
 came-0.044
 tried-0.043
 stumbled-0.042
 rushed-0.041
Token jelly
Token ID27644
Feature activation-7.93e+00

Pos logit contributions
igsaw+0.322
aj+0.294
 difference+0.268
 former+0.259
 coll+0.256
Neg logit contributions
 decided-0.422
 came-0.382
 tried-0.382
 could-0.365
 stumbled-0.364
Tokenfish
Token ID11084
Feature activation+9.63e-01

Pos logit contributions
igsaw+0.204
aj+0.186
 goals+0.179
uddle+0.178
 difference+0.178
Neg logit contributions
 decided-0.213
 rushed-0.195
 tried-0.194
 stumbled-0.191
pped-0.190
Token.
Token ID13
Feature activation+2.93e-01

Pos logit contributions
 decided+0.020
 stumbled+0.017
 came+0.017
pped+0.017
 tried+0.017
Neg logit contributions
igsaw-0.029
aj-0.028
 difference-0.028
 goals-0.028
 coll-0.027
Token
Token ID198
Feature activation-4.71e+00

Pos logit contributions
 decided+0.006
 came+0.006
 tried+0.006
 could+0.005
 stumbled+0.005
Neg logit contributions
igsaw-0.010
aj-0.009
 difference-0.009
 former-0.009
 coll-0.009
Token
Token ID198
Feature activation-4.96e+00

Pos logit contributions
igsaw+0.150
aj+0.141
 difference+0.133
 coll+0.129
 former+0.129
Neg logit contributions
 decided-0.107
 came-0.094
 tried-0.093
 could-0.088
 stumbled-0.088
TokenO
Token ID46
Feature activation-6.69e+00

Pos logit contributions
igsaw+0.147
aj+0.139
 difference+0.126
udd+0.124
 coll+0.124
Neg logit contributions
 decided-0.120
 came-0.108
 tried-0.106
 stumbled-0.101
 could-0.100
Tokenuch
Token ID794
Feature activation-3.97e+00

Pos logit contributions
igsaw+0.167
aj+0.164
udd+0.149
 difference+0.141
 coll+0.139
Neg logit contributions
 decided-0.193
 came-0.174
 tried-0.172
 could-0.163
 stumbled-0.163
Token!
Token ID0
Feature activation+8.29e-01

Pos logit contributions
igsaw+0.094
aj+0.094
udd+0.092
uddle+0.084
 goals+0.082
Neg logit contributions
 decided-0.109
 tried-0.095
 came-0.094
 stumbled-0.093
Please-0.091
Token The
Token ID383
Feature activation-1.08e+01

Pos logit contributions
 decided+0.020
 came+0.017
 tried+0.017
 could+0.016
 stumbled+0.016
Neg logit contributions
igsaw-0.025
aj-0.024
 difference-0.023
 coll-0.022
 former-0.022
Token made
Token ID925
Feature activation-3.15e-01

Pos logit contributions
 decided+0.007
pped+0.006
 came+0.006
 stumbled+0.006
 tried+0.006
Neg logit contributions
igsaw-0.010
 difference-0.010
aj-0.009
 former-0.009
 goals-0.009
Token them
Token ID606
Feature activation+3.70e+00

Pos logit contributions
igsaw+0.012
aj+0.011
 difference+0.011
 coll+0.010
 former+0.010
Neg logit contributions
 decided-0.005
 came-0.005
 tried-0.005
 could-0.004
 stumbled-0.004
Token last
Token ID938
Feature activation-3.12e+00

Pos logit contributions
 decided+0.084
 tried+0.072
 came+0.072
 could+0.068
 stumbled+0.067
Neg logit contributions
igsaw-0.118
aj-0.112
 difference-0.104
 coll-0.102
 former-0.100
Token longer
Token ID2392
Feature activation+1.73e+00

Pos logit contributions
igsaw+0.086
aj+0.080
 difference+0.075
 coll+0.072
 former+0.071
Neg logit contributions
 decided-0.084
 came-0.075
 tried-0.074
 stumbled-0.071
 could-0.070
Token.
Token ID13
Feature activation-4.97e-01

Pos logit contributions
 decided+0.033
 came+0.028
 tried+0.028
 stumbled+0.026
 could+0.026
Neg logit contributions
igsaw-0.061
aj-0.058
 difference-0.055
 coll-0.053
 former-0.053
Token The
Token ID383
Feature activation-1.11e+01

Pos logit contributions
igsaw+0.016
aj+0.015
 difference+0.014
 coll+0.014
udd+0.014
Neg logit contributions
 decided-0.011
 came-0.009
 tried-0.009
 could-0.009
 stumbled-0.009
Token moral
Token ID6573
Feature activation-1.19e+00

Pos logit contributions
igsaw+0.284
aj+0.280
 coll+0.243
 difference+0.242
udd+0.241
Neg logit contributions
 decided-0.322
 tried-0.285
 came-0.281
 could-0.267
 stumbled-0.266
Token of
Token ID286
Feature activation-1.95e+00

Pos logit contributions
igsaw+0.036
aj+0.035
 difference+0.032
 former+0.031
 coll+0.031
Neg logit contributions
 decided-0.029
 came-0.026
 tried-0.025
 could-0.024
 stumbled-0.024
Token the
Token ID262
Feature activation-1.00e+01

Pos logit contributions
igsaw+0.064
aj+0.063
 difference+0.057
udd+0.056
 goals+0.056
Neg logit contributions
 decided-0.042
 came-0.037
 tried-0.037
 could-0.034
pped-0.034
Token story
Token ID1621
Feature activation+3.85e+00

Pos logit contributions
aj+0.187
igsaw+0.182
ushes+0.155
udd+0.154
 former+0.150
Neg logit contributions
 decided-0.353
 came-0.327
 tried-0.324
 could-0.307
 stumbled-0.306
Token is
Token ID318
Feature activation+3.75e+00

Pos logit contributions
 decided+0.111
 came+0.101
 tried+0.099
 could+0.098
 stumbled+0.093
Neg logit contributions
igsaw-0.101
aj-0.096
 coll-0.085
 former-0.085
 difference-0.084
Token saw
Token ID2497
Feature activation-1.53e-01

Pos logit contributions
 decided+0.083
pped+0.083
ged+0.079
Please+0.069
 tried+0.067
Neg logit contributions
aj-0.277
udd-0.259
igsaw-0.258
 coll-0.248
 difference-0.246
Token a
Token ID257
Feature activation-9.10e+00

Pos logit contributions
igsaw+0.005
aj+0.005
 difference+0.005
 former+0.005
 coll+0.005
Neg logit contributions
 decided-0.003
 came-0.003
 tried-0.003
pped-0.002
 could-0.002
Token shiny
Token ID22441
Feature activation-1.25e+01

Pos logit contributions
igsaw+0.207
aj+0.197
 former+0.172
 difference+0.171
 goals+0.171
Neg logit contributions
 decided-0.286
 came-0.266
 tried-0.261
 could-0.249
 got-0.247
Token penny
Token ID26369
Feature activation-1.35e+00

Pos logit contributions
igsaw+0.312
aj+0.292
 difference+0.267
 former+0.256
 coll+0.256
Neg logit contributions
 decided-0.369
 came-0.334
 tried-0.332
 stumbled-0.315
 could-0.313
Token on
Token ID319
Feature activation-2.28e+00

Pos logit contributions
igsaw+0.034
 puzzle+0.034
 coll+0.032
 new+0.032
 biggest+0.031
Neg logit contributions
 decided-0.030
 rushed-0.027
 stumbled-0.026
 managed-0.026
 came-0.026
Token the
Token ID262
Feature activation-1.31e+01

Pos logit contributions
igsaw+0.073
aj+0.071
 difference+0.066
 coll+0.064
udd+0.064
Neg logit contributions
 decided-0.051
 tried-0.044
 came-0.044
pped-0.041
 stumbled-0.041
Token ground
Token ID2323
Feature activation+1.73e+00

Pos logit contributions
igsaw+0.324
aj+0.310
 difference+0.276
 former+0.266
udd+0.262
Neg logit contributions
 decided-0.398
 came-0.353
 tried-0.352
 could-0.337
 stumbled-0.336
Token and
Token ID290
Feature activation+2.63e+00

Pos logit contributions
 decided+0.036
 tried+0.031
pped+0.030
 came+0.030
 rushed+0.030
Neg logit contributions
igsaw-0.051
aj-0.048
udd-0.046
 difference-0.045
 goals-0.045
Token picked
Token ID6497
Feature activation+1.36e+00

Pos logit contributions
pped+0.037
 ever+0.029
dd+0.026
pper+0.026
ged+0.025
Neg logit contributions
 new-0.087
 prett-0.083
aj-0.081
 coll-0.081
 types-0.080
Token it
Token ID340
Feature activation+5.29e+00

Pos logit contributions
 decided+0.016
 tried+0.012
 came+0.011
 stumbled+0.011
 rushed+0.011
Neg logit contributions
igsaw-0.054
 difference-0.051
 coll-0.050
 new-0.050
 biggest-0.050
Token up
Token ID510
Feature activation-6.19e-01

Pos logit contributions
 decided+0.168
 came+0.153
 tried+0.152
 stumbled+0.149
pped+0.148
Neg logit contributions
igsaw-0.114
aj-0.105
 difference-0.098
 coll-0.093
 goals-0.091
Token crawling
Token ID34499
Feature activation-9.75e-01

Pos logit contributions
igsaw+0.050
aj+0.049
 difference+0.046
 coll+0.046
 batch+0.045
Neg logit contributions
 decided-0.032
 tried-0.027
pped-0.027
 came-0.027
 rushed-0.026
Token on
Token ID319
Feature activation-1.38e+00

Pos logit contributions
igsaw+0.029
aj+0.027
 goals+0.027
 coll+0.026
udd+0.026
Neg logit contributions
 decided-0.019
 tried-0.017
 came-0.016
 rushed-0.016
 stumbled-0.016
Token the
Token ID262
Feature activation-1.37e+01

Pos logit contributions
igsaw+0.044
aj+0.040
 difference+0.039
 coll+0.038
 former+0.037
Neg logit contributions
 decided-0.031
 came-0.027
 tried-0.027
 stumbled-0.026
 could-0.026
Token sand
Token ID6450
Feature activation-4.66e+00

Pos logit contributions
igsaw+0.339
aj+0.320
 difference+0.289
 coll+0.279
 former+0.278
Neg logit contributions
 decided-0.415
 came-0.369
 tried-0.368
 stumbled-0.352
 could-0.351
Token.
Token ID13
Feature activation+2.65e-01

Pos logit contributions
igsaw+0.150
aj+0.146
 difference+0.137
 coll+0.134
udd+0.134
Neg logit contributions
 decided-0.091
 tried-0.082
pped-0.079
 came-0.078
 rushed-0.076
Token The
Token ID383
Feature activation-1.10e+01

Pos logit contributions
 decided+0.006
 came+0.006
 tried+0.006
 stumbled+0.005
 could+0.005
Neg logit contributions
igsaw-0.008
aj-0.008
 difference-0.007
 coll-0.007
 former-0.007
Token crab
Token ID32202
Feature activation+4.69e-01

Pos logit contributions
igsaw+0.339
aj+0.315
 difference+0.301
 coll+0.288
 goals+0.288
Neg logit contributions
 decided-0.255
 came-0.225
 tried-0.223
pped-0.214
 stumbled-0.211
Token had
Token ID550
Feature activation+6.71e+00

Pos logit contributions
 decided+0.007
pped+0.006
 came+0.005
 tried+0.005
 stumbled+0.005
Neg logit contributions
igsaw-0.017
aj-0.017
 difference-0.016
udd-0.016
 coll-0.016
Token a
Token ID257
Feature activation-8.17e+00

Pos logit contributions
 decided+0.146
 came+0.127
 tried+0.126
pped+0.125
 could+0.119
Neg logit contributions
igsaw-0.203
aj-0.195
 difference-0.184
 coll-0.183
 goals-0.181
Token shiny
Token ID22441
Feature activation-1.22e+01

Pos logit contributions
igsaw+0.176
aj+0.160
 difference+0.149
 coll+0.142
 former+0.139
Neg logit contributions
 decided-0.268
 came-0.243
 tried-0.242
pped-0.236
 stumbled-0.233
Token shell
Token ID7582
Feature activation-6.28e-01

Pos logit contributions
igsaw+0.285
aj+0.263
 difference+0.244
 coll+0.237
 goals+0.229
Neg logit contributions
 decided-0.372
 came-0.334
 tried-0.331
 could-0.322
 stumbled-0.321
Token again
Token ID757
Feature activation+3.45e+00

Pos logit contributions
 decided+0.051
 came+0.043
 tried+0.043
 could+0.041
 stumbled+0.041
Neg logit contributions
igsaw-0.079
aj-0.075
 difference-0.073
 coll-0.071
 former-0.070
Token.
Token ID13
Feature activation+2.17e-01

Pos logit contributions
 decided+0.069
 came+0.061
 tried+0.060
pped+0.059
 could+0.056
Neg logit contributions
igsaw-0.113
aj-0.106
 difference-0.103
 former-0.102
 goals-0.101
Token
Token ID220
Feature activation-1.53e+00

Pos logit contributions
 decided+0.004
 could+0.004
 came+0.004
 tried+0.004
 stumbled+0.004
Neg logit contributions
igsaw-0.007
aj-0.007
 coll-0.007
 difference-0.007
udd-0.006
Token
Token ID198
Feature activation-4.70e+00

Pos logit contributions
igsaw+0.053
aj+0.049
 difference+0.048
 coll+0.048
 former+0.047
Neg logit contributions
 decided-0.030
 came-0.026
 could-0.026
 tried-0.025
 stumbled-0.024
Token
Token ID198
Feature activation-4.84e+00

Pos logit contributions
igsaw+0.146
aj+0.137
 difference+0.129
 coll+0.127
 former+0.125
Neg logit contributions
 decided-0.114
 came-0.099
 tried-0.098
 could-0.096
 stumbled-0.094
TokenThe
Token ID464
Feature activation-1.12e+01

Pos logit contributions
igsaw+0.138
aj+0.129
 difference+0.121
 coll+0.117
 former+0.115
Neg logit contributions
 decided-0.128
 came-0.113
 tried-0.112
 stumbled-0.107
 could-0.106
Token end
Token ID886
Feature activation-1.54e+00

Pos logit contributions
igsaw+0.289
aj+0.278
 difference+0.246
udd+0.244
 coll+0.244
Neg logit contributions
 decided-0.325
 tried-0.291
 came-0.289
 stumbled-0.274
 could-0.272
Token.
Token ID13
Feature activation+3.51e-01

Pos logit contributions
igsaw+0.058
aj+0.056
 difference+0.053
 coll+0.052
 former+0.051
Neg logit contributions
 decided-0.026
 came-0.021
 tried-0.021
 stumbled-0.019
 could-0.019
Token<|endoftext|>
Token ID50256
Feature activation-4.00e+00

Pos logit contributions
 decided+0.008
 came+0.007
 tried+0.007
 could+0.006
 stumbled+0.006
Neg logit contributions
igsaw-0.011
aj-0.011
 difference-0.010
 coll-0.010
udd-0.010
TokenOnce
Token ID7454
Feature activation+6.85e-01

Pos logit contributions
igsaw+0.100
aj+0.092
 difference+0.085
 coll+0.081
 goals+0.080
Neg logit contributions
 decided-0.117
 tried-0.105
 came-0.105
 stumbled-0.100
pped-0.100
Token upon
Token ID2402
Feature activation-2.26e+00

Pos logit contributions
 decided+0.017
 came+0.016
 could+0.015
 tried+0.015
 got+0.014
Neg logit contributions
igsaw-0.020
aj-0.019
 former-0.019
 coll-0.019
 difference-0.018
Token gathered
Token ID9272
Feature activation+1.18e-01

Pos logit contributions
pped+0.102
 decided+0.079
ged+0.078
 ever+0.075
Please+0.071
Neg logit contributions
aj-0.224
 new-0.207
udd-0.206
 goals-0.197
 coll-0.196
Token her
Token ID607
Feature activation-4.15e+00

Pos logit contributions
 decided+0.003
 tried+0.002
 came+0.002
 could+0.002
 stumbled+0.002
Neg logit contributions
igsaw-0.004
aj-0.004
 difference-0.004
 former-0.004
udd-0.004
Token courage
Token ID11917
Feature activation-4.81e-01

Pos logit contributions
igsaw+0.102
aj+0.089
 difference+0.087
udd+0.083
 former+0.083
Neg logit contributions
 decided-0.129
 tried-0.119
 came-0.116
 could-0.113
 stumbled-0.110
Token and
Token ID290
Feature activation+1.64e+00

Pos logit contributions
igsaw+0.014
aj+0.014
 coll+0.013
 new+0.013
 difference+0.013
Neg logit contributions
 decided-0.010
pped-0.009
Please-0.009
 came-0.009
 tried-0.009
Token asked
Token ID1965
Feature activation+2.77e+00

Pos logit contributions
 decided+0.020
pped+0.018
 came+0.016
 tried+0.016
 stumbled+0.015
Neg logit contributions
igsaw-0.062
aj-0.061
 coll-0.058
udd-0.057
 difference-0.057
Token the
Token ID262
Feature activation-1.12e+01

Pos logit contributions
 decided+0.052
 came+0.045
 tried+0.044
 could+0.040
 stumbled+0.040
Neg logit contributions
igsaw-0.101
aj-0.096
 difference-0.090
 former-0.088
 coll-0.088
Token dolphin
Token ID44136
Feature activation+3.88e-01

Pos logit contributions
igsaw+0.309
aj+0.289
 difference+0.273
 coll+0.263
 goals+0.258
Neg logit contributions
 decided-0.299
 came-0.267
 tried-0.266
 stumbled-0.253
pped-0.253
Token,
Token ID11
Feature activation+3.69e-01

Pos logit contributions
 decided+0.008
 came+0.007
 tried+0.007
 stumbled+0.007
pped+0.007
Neg logit contributions
igsaw-0.013
aj-0.012
 difference-0.011
 coll-0.011
 former-0.011
Token "
Token ID366
Feature activation-8.02e-01

Pos logit contributions
 decided+0.010
pped+0.009
 tried+0.009
 came+0.009
 stumbled+0.009
Neg logit contributions
igsaw-0.009
aj-0.009
 difference-0.008
 coll-0.008
 new-0.008
TokenAre
Token ID8491
Feature activation+3.53e+00

Pos logit contributions
igsaw+0.024
aj+0.023
 coll+0.022
 difference+0.022
 former+0.022
Neg logit contributions
 decided-0.015
 tried-0.014
 came-0.013
 could-0.013
 dropped-0.013
Token you
Token ID345
Feature activation+7.72e+00

Pos logit contributions
 decided+0.085
 came+0.073
 tried+0.073
pped+0.070
 stumbled+0.069
Neg logit contributions
igsaw-0.105
aj-0.098
 difference-0.092
 goals-0.089
 coll-0.088