TOP ACTIVATIONS
MAX = 29.530

was the brightest it could possibly be . Ab bi had
wanted to do everything she possibly could . One
, " What could you possibly give me that 's better
as many treats as they possibly could , they stopped .
of his had passed away recently , and he was so
they were little . But recently Max and Sarah had begun
He has been so spoiled recently , so let 's pick
a long time , but recently they started to disagree about
laughed and said it was probably just the wind in the
resa replied , " I 'd much rather stay here and
" Of course , I 'd be happy to teach you
said , " We should definitely play hide and seek again
, " Sure , we 'd love to ."
. She said it had probably died of natural causes .
leaves , and this one probably used to belong to someone
. Jack knew he would definitely come back to slide again
the mixer . It was definitely sm elly .
Pick out whatever yogurt you 'd like for a treat ,
the garden . He was probably just looking for some food
amazed at how neatly he 'd done . "

MIN ACTIVATIONS
MIN = -30.994

wagon crashed into a d andel ion bush ! Jack was
, Jack held a d andel ion . He took a
a large field of d andel ions . It was so
spotted a patch of d andel ions . He ran to
to be okay ?" The paramed ic smiled and said ,
even spotted a big ch andel ier that spark led like
L ily asked the paramed ic , " Is the
the dirt , yellow d andel ions sprung up , and
a big , beautiful ch andel ier sparkling in the sunlight
. The driver and the paramed ic jumped out . They
shiny floor and a ch andel ier . L
tree , a yellow d andel ion and a little red
which was filled with d andel ions and butterflies .
garden and started picking d andel ions and other pretty flowers
little girl picked the d andel ion and took a deep
stop and smell the d andel ion too !"
the park and feed the pige ons . One day ,
that it was a ch andel ier , an old and
pick and pl uck d andel ions from the ground .
- it was bigger and ju ic ier than any he

INTERVAL 14.399 - 29.530
CONTAINS 11.155%

got angry . He threw the ball very hard
got louder and louder . He was sure that this flower
upon a time , there was a little girl named Lily
them look different . Everywhere she went she was in a
" It 's hurt ," she said sadly .

INTERVAL -0.732 - 14.399
CONTAINS 33.882%

feeling the warm sunshine in its fur . The
. Harmony slowly bent down and pe ered in - and
k ang aroo would be okay . They wished they could
me the hat !" He tries to pull the hat from
, " I don 't know , Lily . Maybe he

INTERVAL -15.863 - -0.732
CONTAINS 53.590%

it . It was a passport . It had a picture
so excited to wear her new boots outside . She put
noises like a car . But they pushed the
not to hurt your lip when you fall ."
friend ?" The cow replied , " No way

INTERVAL -30.994 - -15.863
CONTAINS 1.373%

the best of friends . <|endoftext|> Once upon a time ,
popcorn . It was so y ummy that everyone wanted to
Leo and Spike found a big tree with a twist y
, Mom gives them new v ests . They are blue
to keep it forever ! <|endoftext|> Once upon a time ,
Token was
Token ID373
Feature activation+1.82e+01

Pos logit contributions
 hadn+0.117
 wouldn+0.116
'd+0.112
 never+0.104
 probably+0.094
Neg logit contributions
igsaw-0.172
andel-0.153
 pige-0.153
ay-0.149
 mel-0.147
Token the
Token ID262
Feature activation-1.18e+01

Pos logit contributions
 wouldn+0.446
 hadn+0.437
'd+0.432
 never+0.392
 also+0.354
Neg logit contributions
igsaw-0.518
andel-0.460
ay-0.437
 pige-0.431
 bou-0.416
Token brightest
Token ID33871
Feature activation-1.47e+01

Pos logit contributions
igsaw+0.332
andel+0.293
ay+0.280
 pige+0.271
aff+0.262
Neg logit contributions
 wouldn-0.299
 hadn-0.294
'd-0.291
 never-0.272
 also-0.248
Token it
Token ID340
Feature activation+8.89e+00

Pos logit contributions
igsaw+0.317
andel+0.262
ay+0.248
 pige+0.231
aff+0.224
Neg logit contributions
 wouldn-0.477
 hadn-0.471
'd-0.468
 never-0.442
 also-0.415
Token could
Token ID714
Feature activation+1.57e+01

Pos logit contributions
 wouldn+0.128
 hadn+0.122
itates+0.119
'd+0.119
bled+0.111
Neg logit contributions
igsaw-0.282
 past-0.262
 mel-0.259
 pige-0.252
ay-0.248
Token possibly
Token ID5457
Feature activation+2.95e+01

Pos logit contributions
 wouldn+0.417
 hadn+0.408
'd+0.386
 never+0.350
 forgot+0.329
Neg logit contributions
igsaw-0.405
andel-0.350
ay-0.342
 bou-0.328
 purpose-0.327
Token be
Token ID307
Feature activation+1.19e+01

Pos logit contributions
 wouldn+0.712
 hadn+0.704
'd+0.675
 never+0.624
 also+0.573
Neg logit contributions
igsaw-0.817
andel-0.728
ay-0.708
 pige-0.701
 mel-0.673
Token.
Token ID13
Feature activation-2.70e+00

Pos logit contributions
 wouldn+0.333
 hadn+0.330
'd+0.326
 never+0.309
 also+0.283
Neg logit contributions
igsaw-0.304
andel-0.265
ay-0.252
 pige-0.239
aff-0.232
Token Ab
Token ID2275
Feature activation-5.98e+00

Pos logit contributions
igsaw+0.070
andel+0.062
ay+0.059
 pige+0.057
aff+0.055
Neg logit contributions
 wouldn-0.074
 hadn-0.072
'd-0.072
 never-0.067
 also-0.061
Tokenbi
Token ID8482
Feature activation+5.26e+00

Pos logit contributions
igsaw+0.157
andel+0.139
ay+0.132
 pige+0.128
aff+0.123
Neg logit contributions
 wouldn-0.161
 hadn-0.159
'd-0.157
 never-0.147
 also-0.135
Token had
Token ID550
Feature activation+1.86e+01

Pos logit contributions
 wouldn+0.096
 hadn+0.094
'd+0.089
 never+0.077
 probably+0.073
Neg logit contributions
igsaw-0.169
andel-0.151
ay-0.150
 pige-0.150
 mel-0.146
Token wanted
Token ID2227
Feature activation+1.01e+01

Pos logit contributions
bled+0.053
 hadn+0.053
 wouldn+0.052
'd+0.049
itates+0.046
Neg logit contributions
 past-0.065
igsaw-0.062
 oak-0.061
 blue-0.060
 mel-0.057
Token to
Token ID284
Feature activation+1.97e-01

Pos logit contributions
 wouldn+0.295
 hadn+0.289
'd+0.287
 never+0.281
 also+0.258
Neg logit contributions
igsaw-0.250
andel-0.217
ay-0.207
aff-0.191
 pige-0.189
Token do
Token ID466
Feature activation+4.89e+00

Pos logit contributions
'd+0.004
 never+0.004
 wouldn+0.004
 hadn+0.004
 also+0.004
Neg logit contributions
igsaw-0.006
andel-0.006
ay-0.005
aff-0.005
 pige-0.005
Token everything
Token ID2279
Feature activation+6.39e-01

Pos logit contributions
'd+0.158
 hadn+0.157
 wouldn+0.157
 never+0.142
 also+0.133
Neg logit contributions
igsaw-0.100
andel-0.078
 purpose-0.074
 pige-0.073
ay-0.072
Token she
Token ID673
Feature activation+1.79e+01

Pos logit contributions
 wouldn+0.018
 hadn+0.018
'd+0.018
 never+0.016
 also+0.015
Neg logit contributions
igsaw-0.016
andel-0.014
ay-0.013
 pige-0.013
 mel-0.012
Token possibly
Token ID5457
Feature activation+2.83e+01

Pos logit contributions
bled+0.272
 wouldn+0.255
 hadn+0.246
 never+0.230
 not+0.221
Neg logit contributions
 past-0.533
igsaw-0.533
ay-0.525
 par-0.521
oc-0.518
Token could
Token ID714
Feature activation+1.43e+01

Pos logit contributions
 hadn+0.478
 wouldn+0.470
bled+0.462
'd+0.429
 never+0.411
Neg logit contributions
igsaw-0.845
ay-0.798
 past-0.795
 mel-0.794
 oak-0.792
Token.
Token ID13
Feature activation-3.37e+00

Pos logit contributions
 wouldn+0.367
 hadn+0.363
'd+0.353
 never+0.322
 also+0.292
Neg logit contributions
igsaw-0.397
andel-0.344
ay-0.336
 pige-0.320
 mel-0.317
Token
Token ID198
Feature activation-5.17e+00

Pos logit contributions
igsaw+0.108
andel+0.101
 pige+0.092
 gol+0.090
ay+0.089
Neg logit contributions
 hadn-0.072
 wouldn-0.072
'd-0.070
 never-0.067
 also-0.063
Token
Token ID198
Feature activation-5.18e+00

Pos logit contributions
igsaw+0.140
andel+0.124
 pige+0.113
ay+0.112
 gol+0.109
Neg logit contributions
 hadn-0.136
 wouldn-0.135
'd-0.134
 never-0.129
 also-0.119
TokenOne
Token ID3198
Feature activation-7.90e+00

Pos logit contributions
igsaw+0.150
andel+0.135
ay+0.121
 pige+0.120
aff+0.119
Neg logit contributions
'd-0.133
 hadn-0.132
 never-0.131
 wouldn-0.127
 also-0.122
Token,
Token ID11
Feature activation-8.25e+00

Pos logit contributions
 wouldn+0.054
 hadn+0.053
'd+0.053
 never+0.050
 also+0.047
Neg logit contributions
igsaw-0.037
andel-0.031
ay-0.029
 pige-0.028
aff-0.027
Token "
Token ID366
Feature activation-2.00e+00

Pos logit contributions
igsaw+0.160
andel+0.147
 pige+0.138
 mel+0.134
ay+0.133
Neg logit contributions
 wouldn-0.250
 hadn-0.249
'd-0.248
 never-0.222
 forgot-0.220
TokenWhat
Token ID2061
Feature activation+5.35e+00

Pos logit contributions
igsaw+0.053
andel+0.048
 pige+0.047
ay+0.046
 perspective+0.046
Neg logit contributions
 wouldn-0.049
 hadn-0.048
'd-0.045
 never-0.043
 forgot-0.040
Token could
Token ID714
Feature activation+1.33e+01

Pos logit contributions
 hadn+0.116
 wouldn+0.116
'd+0.107
 never+0.101
bled+0.096
Neg logit contributions
igsaw-0.152
 pige-0.137
ay-0.137
 mel-0.135
 par-0.130
Token you
Token ID345
Feature activation+1.39e+01

Pos logit contributions
 wouldn+0.308
'd+0.295
 hadn+0.295
 never+0.261
 also+0.235
Neg logit contributions
igsaw-0.403
 pige-0.344
andel-0.339
 mel-0.333
ay-0.332
Token possibly
Token ID5457
Feature activation+2.71e+01

Pos logit contributions
 wouldn+0.340
 hadn+0.337
'd+0.326
 never+0.302
 also+0.270
Neg logit contributions
igsaw-0.394
andel-0.344
ay-0.334
 pige-0.321
 mel-0.316
Token give
Token ID1577
Feature activation-5.13e+00

Pos logit contributions
 wouldn+0.661
 hadn+0.658
'd+0.632
 never+0.584
bled+0.562
Neg logit contributions
igsaw-0.704
 pige-0.609
 mel-0.603
andel-0.594
ay-0.593
Token me
Token ID502
Feature activation+4.74e+00

Pos logit contributions
igsaw+0.143
andel+0.128
ay+0.119
 pige+0.116
aff+0.113
Neg logit contributions
 wouldn-0.134
 hadn-0.129
'd-0.128
 never-0.123
 also-0.112
Token that
Token ID326
Feature activation-1.28e+00

Pos logit contributions
 wouldn+0.129
'd+0.123
 never+0.122
 hadn+0.120
 also+0.111
Neg logit contributions
igsaw-0.125
andel-0.112
aff-0.104
ay-0.102
 gol-0.101
Token's
Token ID338
Feature activation+1.56e+00

Pos logit contributions
igsaw+0.026
andel+0.022
ay+0.020
aff+0.019
 pige+0.019
Neg logit contributions
 wouldn-0.044
 hadn-0.043
'd-0.042
 never-0.040
 also-0.038
Token better
Token ID1365
Feature activation-6.20e+00

Pos logit contributions
 wouldn+0.048
 hadn+0.047
'd+0.046
 never+0.045
 also+0.041
Neg logit contributions
igsaw-0.036
andel-0.032
ay-0.029
 pige-0.028
aff-0.027
Token as
Token ID355
Feature activation-7.19e-02

Pos logit contributions
igsaw+0.056
andel+0.049
ay+0.046
 pige+0.044
 gol+0.044
Neg logit contributions
 wouldn-0.058
 hadn-0.056
'd-0.055
 never-0.053
 also-0.050
Token many
Token ID867
Feature activation-9.19e+00

Pos logit contributions
igsaw+0.002
andel+0.002
ay+0.002
 pige+0.002
 mel+0.001
Neg logit contributions
 wouldn-0.002
 hadn-0.002
'd-0.002
 never-0.002
 probably-0.002
Token treats
Token ID18432
Feature activation-7.98e+00

Pos logit contributions
igsaw+0.200
andel+0.169
ay+0.158
 gol+0.149
 perspective+0.144
Neg logit contributions
 wouldn-0.291
 hadn-0.290
'd-0.285
 never-0.272
 also-0.259
Token as
Token ID355
Feature activation-1.68e-01

Pos logit contributions
igsaw+0.142
andel+0.114
ay+0.109
 pige+0.102
 mel+0.097
Neg logit contributions
 wouldn-0.279
 hadn-0.277
'd-0.275
 never-0.262
 also-0.245
Token they
Token ID484
Feature activation+1.52e+01

Pos logit contributions
igsaw+0.005
andel+0.005
ay+0.005
 pige+0.005
 mel+0.004
Neg logit contributions
 wouldn-0.004
 hadn-0.004
'd-0.004
 never-0.003
 probably-0.003
Token possibly
Token ID5457
Feature activation+2.68e+01

Pos logit contributions
 wouldn+0.253
 hadn+0.244
'd+0.238
 never+0.217
 probably+0.205
Neg logit contributions
igsaw-0.512
andel-0.464
 pige-0.461
ay-0.458
 mel-0.457
Token could
Token ID714
Feature activation+1.29e+01

Pos logit contributions
 wouldn+0.416
 hadn+0.398
'd+0.391
 never+0.361
 also+0.323
Neg logit contributions
igsaw-0.965
andel-0.871
ay-0.869
aff-0.847
 pige-0.846
Token,
Token ID11
Feature activation-8.73e+00

Pos logit contributions
 wouldn+0.391
 hadn+0.382
'd+0.367
 never+0.338
 probably+0.322
Neg logit contributions
igsaw-0.284
ay-0.238
andel-0.231
 bou-0.228
aff-0.221
Token they
Token ID484
Feature activation+1.51e+01

Pos logit contributions
igsaw+0.248
 pige+0.220
andel+0.218
 golf+0.209
 mel+0.208
Neg logit contributions
'd-0.208
 wouldn-0.206
 hadn-0.202
 never-0.183
 probably-0.168
Token stopped
Token ID5025
Feature activation-2.92e+00

Pos logit contributions
 wouldn+0.267
 hadn+0.262
'd+0.242
itates+0.235
 probably+0.231
Neg logit contributions
igsaw-0.427
 mel-0.421
 bou-0.420
 pige-0.418
 oak-0.402
Token.
Token ID13
Feature activation-4.80e+00

Pos logit contributions
igsaw+0.072
andel+0.062
ay+0.059
 pige+0.057
aff+0.054
Neg logit contributions
 wouldn-0.085
 hadn-0.084
'd-0.083
 never-0.078
 also-0.071
Token of
Token ID286
Feature activation-2.71e+00

Pos logit contributions
igsaw+0.198
ay+0.177
 pige+0.172
andel+0.168
 par+0.168
Neg logit contributions
 wouldn-0.148
 hadn-0.147
'd-0.145
 never-0.133
 almost-0.125
Token his
Token ID465
Feature activation+2.96e+00

Pos logit contributions
igsaw+0.061
andel+0.052
ay+0.049
 pige+0.047
aff+0.045
Neg logit contributions
 wouldn-0.084
 hadn-0.083
'd-0.083
 never-0.078
 also-0.072
Token had
Token ID550
Feature activation+1.91e+01

Pos logit contributions
 wouldn+0.091
 hadn+0.090
'd+0.088
 never+0.085
 also+0.078
Neg logit contributions
igsaw-0.069
andel-0.060
ay-0.055
aff-0.053
 pige-0.051
Token passed
Token ID3804
Feature activation-8.93e-01

Pos logit contributions
 wouldn+0.399
'd+0.392
 hadn+0.391
 never+0.348
 also+0.320
Neg logit contributions
igsaw-0.591
andel-0.531
ay-0.518
 pige-0.509
 mel-0.497
Token away
Token ID1497
Feature activation-3.96e+00

Pos logit contributions
igsaw+0.021
andel+0.018
ay+0.017
 pige+0.016
aff+0.016
Neg logit contributions
 wouldn-0.027
 hadn-0.026
'd-0.026
 never-0.025
 also-0.023
Token recently
Token ID2904
Feature activation+2.37e+01

Pos logit contributions
igsaw+0.080
andel+0.066
ay+0.062
 pige+0.059
 mel+0.056
Neg logit contributions
 wouldn-0.129
 hadn-0.128
'd-0.128
 never-0.119
 also-0.112
Token,
Token ID11
Feature activation-6.12e+00

Pos logit contributions
 wouldn+0.779
 hadn+0.769
'd+0.763
 never+0.723
 also+0.675
Neg logit contributions
igsaw-0.488
andel-0.409
ay-0.384
 pige-0.365
aff-0.347
Token and
Token ID290
Feature activation+3.36e+00

Pos logit contributions
igsaw+0.123
 pige+0.104
 golf+0.104
 mel+0.103
 past+0.101
Neg logit contributions
'd-0.180
 wouldn-0.179
 hadn-0.173
bled-0.158
itates-0.156
Token he
Token ID339
Feature activation+2.03e+01

Pos logit contributions
'd+0.093
 wouldn+0.088
 hadn+0.087
itates+0.081
 never+0.077
Neg logit contributions
igsaw-0.077
 mel-0.064
 golf-0.063
 sight-0.061
 past-0.060
Token was
Token ID373
Feature activation+1.84e+01

Pos logit contributions
 hadn+0.326
'd+0.323
 wouldn+0.313
bled+0.311
itates+0.303
Neg logit contributions
oc-0.534
 par-0.530
 k-0.525
 rh-0.518
igsaw-0.500
Token so
Token ID523
Feature activation-9.77e-01

Pos logit contributions
 wouldn+0.423
'd+0.410
 hadn+0.399
 forgot+0.322
 never+0.321
Neg logit contributions
igsaw-0.519
andel-0.466
ay-0.445
 pige-0.443
 k-0.437
Token they
Token ID484
Feature activation+1.79e+01

Pos logit contributions
 wouldn+0.023
 hadn+0.023
'd+0.023
 never+0.020
 probably+0.019
Neg logit contributions
igsaw-0.028
 pige-0.026
 k-0.025
 par-0.025
ay-0.025
Token were
Token ID547
Feature activation+1.65e+01

Pos logit contributions
 hadn+0.311
 wouldn+0.310
'd+0.287
bled+0.255
 probably+0.251
Neg logit contributions
igsaw-0.517
 pige-0.494
 past-0.480
 oak-0.478
 par-0.478
Token little
Token ID1310
Feature activation-1.13e+01

Pos logit contributions
 wouldn+0.407
 hadn+0.398
'd+0.395
 never+0.338
 could+0.312
Neg logit contributions
igsaw-0.451
andel-0.409
 pige-0.402
ay-0.390
 past-0.378
Token.
Token ID13
Feature activation-2.64e+00

Pos logit contributions
igsaw+0.277
andel+0.241
ay+0.221
 pige+0.205
aff+0.204
Neg logit contributions
 wouldn-0.336
 hadn-0.329
'd-0.325
 never-0.309
 also-0.290
Token But
Token ID887
Feature activation+6.29e+00

Pos logit contributions
igsaw+0.075
andel+0.070
 pige+0.064
ay+0.061
 bou+0.061
Neg logit contributions
 hadn-0.065
 wouldn-0.064
'd-0.063
 never-0.059
 also-0.056
Token recently
Token ID2904
Feature activation+2.35e+01

Pos logit contributions
 wouldn+0.154
 hadn+0.152
'd+0.151
 never+0.138
 also+0.126
Neg logit contributions
igsaw-0.180
andel-0.157
ay-0.153
 pige-0.147
 mel-0.141
Token Max
Token ID5436
Feature activation+5.72e+00

Pos logit contributions
 wouldn+0.886
 hadn+0.875
'd+0.869
 never+0.831
 also+0.782
Neg logit contributions
igsaw-0.375
andel-0.298
ay-0.270
 pige-0.252
aff-0.235
Token and
Token ID290
Feature activation+3.21e+00

Pos logit contributions
'd+0.099
 hadn+0.098
 wouldn+0.096
bled+0.089
Looks+0.082
Neg logit contributions
igsaw-0.163
 pige-0.153
 k-0.152
 par-0.151
 rh-0.150
Token Sarah
Token ID10490
Feature activation+6.90e+00

Pos logit contributions
 wouldn+0.081
 hadn+0.080
'd+0.078
 never+0.075
 also+0.068
Neg logit contributions
igsaw-0.087
andel-0.085
ay-0.078
 pige-0.077
aff-0.076
Token had
Token ID550
Feature activation+1.85e+01

Pos logit contributions
 hadn+0.108
 wouldn+0.105
'd+0.105
bled+0.098
 never+0.086
Neg logit contributions
igsaw-0.207
 pige-0.202
 oak-0.197
 past-0.195
 k-0.194
Token begun
Token ID9258
Feature activation+5.20e+00

Pos logit contributions
 wouldn+0.513
 hadn+0.498
'd+0.495
 never+0.440
 also+0.414
Neg logit contributions
igsaw-0.459
 pige-0.394
andel-0.390
ay-0.377
 purpose-0.366
Token He
Token ID679
Feature activation+1.92e+01

Pos logit contributions
igsaw+0.067
andel+0.059
ay+0.058
 pige+0.054
aff+0.052
Neg logit contributions
 wouldn-0.063
'd-0.061
 hadn-0.061
 never-0.057
 also-0.051
Token has
Token ID468
Feature activation+1.89e+01

Pos logit contributions
bled+0.355
 hadn+0.351
'd+0.322
 wouldn+0.320
itates+0.319
Neg logit contributions
igsaw-0.500
 k-0.500
ay-0.498
 par-0.485
 rh-0.477
Token been
Token ID587
Feature activation+1.64e+01

Pos logit contributions
 wouldn+0.563
 hadn+0.546
'd+0.539
itates+0.472
 never+0.465
Neg logit contributions
igsaw-0.406
 pige-0.356
ay-0.348
 mel-0.345
andel-0.341
Token so
Token ID523
Feature activation-1.19e+00

Pos logit contributions
 wouldn+0.415
'd+0.411
 hadn+0.405
 never+0.353
 could+0.328
Neg logit contributions
igsaw-0.442
andel-0.397
 pige-0.375
ay-0.374
 past-0.367
Token spoiled
Token ID34781
Feature activation-7.26e+00

Pos logit contributions
igsaw+0.027
andel+0.022
ay+0.021
 pige+0.021
 past+0.020
Neg logit contributions
 wouldn-0.037
'd-0.037
 hadn-0.036
 never-0.033
 probably-0.031
Token recently
Token ID2904
Feature activation+2.35e+01

Pos logit contributions
igsaw+0.152
andel+0.127
ay+0.123
 pige+0.116
aff+0.110
Neg logit contributions
 wouldn-0.231
 hadn-0.231
'd-0.230
 never-0.213
 also-0.200
Token,
Token ID11
Feature activation-6.18e+00

Pos logit contributions
 wouldn+0.764
 hadn+0.756
'd+0.750
 never+0.706
 also+0.663
Neg logit contributions
igsaw-0.485
andel-0.406
ay-0.384
 pige-0.363
aff-0.346
Token so
Token ID523
Feature activation-1.22e+00

Pos logit contributions
igsaw+0.121
ay+0.113
 pige+0.111
andel+0.110
 mel+0.109
Neg logit contributions
'd-0.179
 hadn-0.170
 wouldn-0.167
itates-0.158
 managed-0.154
Token let
Token ID1309
Feature activation+3.03e+00

Pos logit contributions
igsaw+0.028
ay+0.026
andel+0.025
 pige+0.024
 mel+0.024
Neg logit contributions
'd-0.034
 wouldn-0.034
 hadn-0.033
 never-0.029
 probably-0.029
Token's
Token ID338
Feature activation+4.13e+00

Pos logit contributions
 hadn+0.083
'd+0.082
 wouldn+0.077
 never+0.070
 also+0.067
Neg logit contributions
ay-0.070
igsaw-0.068
andel-0.066
 k-0.059
 par-0.059
Token pick
Token ID2298
Feature activation-3.75e+00

Pos logit contributions
 hadn+0.114
 wouldn+0.112
'd+0.111
itates+0.095
 never+0.094
Neg logit contributions
igsaw-0.098
ay-0.088
 purpose-0.079
 mel-0.078
andel-0.078
Token a
Token ID257
Feature activation-6.35e+00

Pos logit contributions
 never+0.044
 hadn+0.044
 wouldn+0.043
'd+0.042
 also+0.039
Neg logit contributions
igsaw-0.060
andel-0.054
ay-0.052
 gol-0.049
 bou-0.048
Token long
Token ID890
Feature activation-1.18e+01

Pos logit contributions
igsaw+0.150
andel+0.140
ay+0.130
 pige+0.120
aff+0.116
Neg logit contributions
'd-0.198
 never-0.190
 hadn-0.190
 wouldn-0.190
 also-0.171
Token time
Token ID640
Feature activation-4.28e+00

Pos logit contributions
igsaw+0.331
andel+0.297
ay+0.272
 pige+0.270
aff+0.263
Neg logit contributions
 never-0.315
'd-0.304
 wouldn-0.292
 hadn-0.289
 also-0.264
Token,
Token ID11
Feature activation-7.44e+00

Pos logit contributions
igsaw+0.098
andel+0.084
ay+0.078
 pige+0.074
aff+0.071
Neg logit contributions
 wouldn-0.133
 hadn-0.132
'd-0.130
 never-0.126
 also-0.117
Token but
Token ID475
Feature activation+8.34e+00

Pos logit contributions
igsaw+0.144
 pige+0.129
 golf+0.128
 past+0.126
 mel+0.121
Neg logit contributions
 wouldn-0.234
'd-0.227
 hadn-0.224
 never-0.199
itates-0.192
Token recently
Token ID2904
Feature activation+2.24e+01

Pos logit contributions
 wouldn+0.203
'd+0.202
 hadn+0.198
 never+0.175
 probably+0.162
Neg logit contributions
igsaw-0.229
 pige-0.197
andel-0.195
ay-0.190
 mel-0.188
Token they
Token ID484
Feature activation+1.63e+01

Pos logit contributions
 wouldn+0.737
 hadn+0.729
'd+0.723
 never+0.684
 also+0.640
Neg logit contributions
igsaw-0.456
andel-0.381
ay-0.358
 pige-0.343
aff-0.323
Token started
Token ID2067
Feature activation+7.02e+00

Pos logit contributions
 hadn+0.254
'd+0.248
 wouldn+0.241
Looks+0.224
bled+0.214
Neg logit contributions
 pige-0.452
igsaw-0.449
 oak-0.445
 k-0.439
 mel-0.439
Token to
Token ID284
Feature activation-3.04e-01

Pos logit contributions
 wouldn+0.216
 hadn+0.212
'd+0.210
 never+0.201
 also+0.186
Neg logit contributions
igsaw-0.162
andel-0.139
ay-0.133
 pige-0.124
aff-0.120
Token disagree
Token ID12546
Feature activation-3.97e+00

Pos logit contributions
igsaw+0.008
andel+0.007
 pige+0.006
ay+0.006
 purpose+0.006
Neg logit contributions
 wouldn-0.008
 hadn-0.008
'd-0.008
 never-0.007
 could-0.007
Token about
Token ID546
Feature activation+1.38e+00

Pos logit contributions
igsaw+0.091
andel+0.078
ay+0.075
 pige+0.070
aff+0.068
Neg logit contributions
 wouldn-0.121
 hadn-0.120
'd-0.119
 never-0.112
 also-0.103
Token laughed
Token ID13818
Feature activation+1.18e-01

Pos logit contributions
igsaw+0.070
ay+0.068
 mel+0.067
 batch+0.067
 k+0.066
Neg logit contributions
 wouldn-0.076
 hadn-0.075
'd-0.074
 probably-0.068
 doomed-0.067
Token and
Token ID290
Feature activation+3.25e+00

Pos logit contributions
 wouldn+0.004
 hadn+0.004
'd+0.004
 never+0.004
 also+0.003
Neg logit contributions
igsaw-0.002
andel-0.002
ay-0.002
 pige-0.002
 bou-0.002
Token said
Token ID531
Feature activation+6.29e+00

Pos logit contributions
'd+0.076
 wouldn+0.074
 hadn+0.073
 never+0.066
 probably+0.065
Neg logit contributions
igsaw-0.087
 pige-0.078
 bou-0.077
 mel-0.077
andel-0.075
Token it
Token ID340
Feature activation+9.39e+00

Pos logit contributions
 hadn+0.226
 wouldn+0.226
'd+0.226
 never+0.211
 also+0.201
Neg logit contributions
igsaw-0.105
andel-0.083
ay-0.078
 pige-0.076
 mel-0.069
Token was
Token ID373
Feature activation+1.85e+01

Pos logit contributions
'd+0.185
 hadn+0.185
bled+0.184
itates+0.169
 wouldn+0.168
Neg logit contributions
igsaw-0.218
 k-0.212
 past-0.209
ay-0.200
 video-0.200
Token probably
Token ID2192
Feature activation+2.22e+01

Pos logit contributions
'd+0.506
 wouldn+0.502
 hadn+0.495
 never+0.457
 also+0.419
Neg logit contributions
igsaw-0.469
andel-0.402
ay-0.391
 purpose-0.370
 mel-0.365
Token just
Token ID655
Feature activation+1.39e+01

Pos logit contributions
'd+0.589
 hadn+0.566
 wouldn+0.556
itates+0.536
 never+0.509
Neg logit contributions
igsaw-0.521
 sight-0.457
 past-0.451
 purpose-0.447
 mel-0.427
Token the
Token ID262
Feature activation-1.15e+01

Pos logit contributions
'd+0.399
 wouldn+0.399
 hadn+0.394
 never+0.362
 also+0.337
Neg logit contributions
igsaw-0.340
andel-0.285
ay-0.270
 pige-0.264
 mel-0.259
Token wind
Token ID2344
Feature activation-6.81e+00

Pos logit contributions
igsaw+0.295
andel+0.257
ay+0.244
 pige+0.236
aff+0.226
Neg logit contributions
 wouldn-0.320
 hadn-0.315
'd-0.311
 never-0.292
 also-0.269
Token in
Token ID287
Feature activation-3.60e+00

Pos logit contributions
igsaw+0.137
ay+0.115
andel+0.113
 pige+0.109
 past+0.105
Neg logit contributions
 hadn-0.205
 wouldn-0.203
'd-0.203
 never-0.193
 also-0.180
Token the
Token ID262
Feature activation-1.15e+01

Pos logit contributions
igsaw+0.097
andel+0.089
 pige+0.086
aff+0.084
ay+0.082
Neg logit contributions
 hadn-0.090
 wouldn-0.088
 never-0.088
'd-0.085
 also-0.080
Tokenresa
Token ID14625
Feature activation-5.70e+00

Pos logit contributions
igsaw+0.211
andel+0.182
ay+0.173
aff+0.167
round+0.157
Neg logit contributions
 hadn-0.282
 wouldn-0.279
'd-0.276
 never-0.270
 also-0.259
Token replied
Token ID8712
Feature activation+4.55e+00

Pos logit contributions
igsaw+0.166
ay+0.144
 pige+0.143
andel+0.143
 mel+0.142
Neg logit contributions
 wouldn-0.116
 hadn-0.113
'd-0.113
 never-0.101
 probably-0.095
Token,
Token ID11
Feature activation-6.32e+00

Pos logit contributions
 hadn+0.183
'd+0.183
 wouldn+0.181
 never+0.170
 also+0.163
Neg logit contributions
igsaw-0.055
andel-0.039
ay-0.036
 past-0.031
 pige-0.031
Token "
Token ID366
Feature activation+1.56e-01

Pos logit contributions
igsaw+0.134
andel+0.116
ay+0.110
 past+0.107
 pige+0.107
Neg logit contributions
 wouldn-0.186
 hadn-0.184
'd-0.184
 never-0.168
 forgot-0.161
TokenI
Token ID40
Feature activation+1.89e+01

Pos logit contributions
 wouldn+0.004
 hadn+0.003
'd+0.003
 never+0.003
 forgot+0.003
Neg logit contributions
igsaw-0.005
 pige-0.004
andel-0.004
ay-0.004
 perspective-0.004
Token'd
Token ID1549
Feature activation+2.21e+01

Pos logit contributions
bled+0.416
 hadn+0.396
 wouldn+0.382
'd+0.355
 never+0.343
Neg logit contributions
igsaw-0.496
ay-0.453
oc-0.447
 gol-0.431
andel-0.421
Token much
Token ID881
Feature activation-1.53e+00

Pos logit contributions
 wouldn+0.600
 hadn+0.596
'd+0.563
bled+0.522
 never+0.502
Neg logit contributions
igsaw-0.506
andel-0.439
 perspective-0.432
 past-0.430
 pige-0.430
Token rather
Token ID2138
Feature activation+5.37e+00

Pos logit contributions
igsaw+0.035
andel+0.028
ay+0.027
 pige+0.026
 mel+0.026
Neg logit contributions
 wouldn-0.048
 hadn-0.047
'd-0.046
 never-0.043
 probably-0.039
Token stay
Token ID2652
Feature activation+5.32e+00

Pos logit contributions
 wouldn+0.150
 hadn+0.148
'd+0.147
 never+0.130
 could+0.122
Neg logit contributions
igsaw-0.132
andel-0.112
 pige-0.109
 mel-0.107
ay-0.106
Token here
Token ID994
Feature activation+1.60e+00

Pos logit contributions
 wouldn+0.147
 hadn+0.145
'd+0.145
 never+0.132
 also+0.121
Neg logit contributions
igsaw-0.137
andel-0.119
ay-0.114
 pige-0.107
 bou-0.106
Token and
Token ID290
Feature activation+3.09e+00

Pos logit contributions
 wouldn+0.050
 hadn+0.049
'd+0.048
 never+0.045
 also+0.042
Neg logit contributions
igsaw-0.036
andel-0.030
ay-0.029
 pige-0.027
aff-0.026
Token "
Token ID366
Feature activation+2.42e-01

Pos logit contributions
igsaw+0.162
andel+0.142
ay+0.135
 pige+0.131
aff+0.128
Neg logit contributions
 wouldn-0.182
 hadn-0.179
'd-0.178
 never-0.165
 also-0.151
TokenOf
Token ID5189
Feature activation-1.44e+00

Pos logit contributions
 wouldn+0.007
 hadn+0.007
'd+0.007
 never+0.006
 also+0.006
Neg logit contributions
igsaw-0.006
andel-0.005
ay-0.005
 pige-0.005
aff-0.005
Token course
Token ID1781
Feature activation+3.67e+00

Pos logit contributions
igsaw+0.030
andel+0.026
 pige+0.022
aff+0.022
ay+0.021
Neg logit contributions
 wouldn-0.048
 never-0.048
'd-0.047
 hadn-0.047
 also-0.043
Token,
Token ID11
Feature activation-6.53e+00

Pos logit contributions
 wouldn+0.115
 hadn+0.114
'd+0.113
 never+0.106
 also+0.099
Neg logit contributions
igsaw-0.081
andel-0.068
ay-0.065
 pige-0.062
aff-0.059
Token I
Token ID314
Feature activation+1.99e+01

Pos logit contributions
igsaw+0.196
andel+0.174
ay+0.171
 pige+0.164
 mel+0.158
Neg logit contributions
 wouldn-0.150
 hadn-0.148
'd-0.146
 never-0.133
 also-0.120
Token'd
Token ID1549
Feature activation+2.21e+01

Pos logit contributions
 hadn+0.359
 wouldn+0.346
 never+0.301
bled+0.293
'd+0.288
Neg logit contributions
igsaw-0.624
ay-0.614
oc-0.553
 gol-0.548
andel-0.530
Token be
Token ID307
Feature activation+1.21e+01

Pos logit contributions
 wouldn+0.575
'd+0.566
 hadn+0.566
 never+0.530
 also+0.481
Neg logit contributions
igsaw-0.615
andel-0.544
ay-0.512
 pige-0.498
aff-0.481
Token happy
Token ID3772
Feature activation-6.60e+00

Pos logit contributions
 wouldn+0.290
 hadn+0.285
'd+0.285
 never+0.272
 also+0.247
Neg logit contributions
igsaw-0.362
andel-0.325
ay-0.303
 pige-0.294
aff-0.292
Token to
Token ID284
Feature activation+9.84e-01

Pos logit contributions
igsaw+0.150
andel+0.127
ay+0.121
 pige+0.117
aff+0.110
Neg logit contributions
 wouldn-0.202
 hadn-0.200
'd-0.198
 never-0.185
 also-0.173
Token teach
Token ID4545
Feature activation-7.96e+00

Pos logit contributions
'd+0.023
 wouldn+0.022
 never+0.022
 hadn+0.021
 also+0.020
Neg logit contributions
igsaw-0.031
andel-0.028
 pige-0.025
ay-0.025
 bou-0.024
Token you
Token ID345
Feature activation+1.64e+01

Pos logit contributions
igsaw+0.279
andel+0.255
ay+0.234
 gol+0.230
 pige+0.228
Neg logit contributions
 wouldn-0.148
 hadn-0.144
 never-0.140
'd-0.140
 also-0.119
Token said
Token ID531
Feature activation+6.25e+00

Pos logit contributions
'd+0.090
 wouldn+0.087
 hadn+0.084
itates+0.082
 probably+0.076
Neg logit contributions
 mel-0.067
 par-0.066
 past-0.065
 oak-0.065
 bou-0.065
Token,
Token ID11
Feature activation-6.03e+00

Pos logit contributions
 hadn+0.250
 wouldn+0.249
'd+0.247
 never+0.233
 also+0.225
Neg logit contributions
igsaw-0.079
andel-0.056
ay-0.053
 pige-0.046
 bou-0.043
Token "
Token ID366
Feature activation+3.52e-01

Pos logit contributions
igsaw+0.113
andel+0.100
 past+0.093
ay+0.092
 pige+0.091
Neg logit contributions
 wouldn-0.187
'd-0.184
 hadn-0.183
 never-0.166
 forgot-0.164
TokenWe
Token ID1135
Feature activation+2.00e+01

Pos logit contributions
 wouldn+0.009
 hadn+0.008
'd+0.008
 never+0.007
 forgot+0.007
Neg logit contributions
igsaw-0.010
andel-0.009
ay-0.009
 pige-0.009
 gol-0.008
Token should
Token ID815
Feature activation+1.23e+01

Pos logit contributions
 hadn+0.371
bled+0.368
 wouldn+0.354
'd+0.337
itates+0.320
Neg logit contributions
igsaw-0.536
ay-0.528
 k-0.487
 gol-0.483
 past-0.480
Token definitely
Token ID4753
Feature activation+2.21e+01

Pos logit contributions
 hadn+0.271
 wouldn+0.269
'd+0.260
itates+0.249
bled+0.238
Neg logit contributions
igsaw-0.314
 purpose-0.295
 oak-0.292
 k-0.291
ay-0.288
Token play
Token ID711
Feature activation-1.67e+00

Pos logit contributions
 wouldn+0.594
 hadn+0.589
'd+0.567
 never+0.516
 also+0.481
Neg logit contributions
igsaw-0.554
andel-0.473
ay-0.463
 pige-0.449
 mel-0.443
Token hide
Token ID7808
Feature activation-1.60e-01

Pos logit contributions
igsaw+0.040
andel+0.034
ay+0.032
 pige+0.031
aff+0.030
Neg logit contributions
 wouldn-0.050
 hadn-0.049
'd-0.049
 never-0.046
 also-0.043
Token and
Token ID290
Feature activation+3.46e+00

Pos logit contributions
igsaw+0.004
andel+0.003
ay+0.003
 pige+0.003
aff+0.003
Neg logit contributions
 wouldn-0.005
 hadn-0.005
'd-0.005
 never-0.004
 also-0.004
Token seek
Token ID5380
Feature activation-1.78e+00

Pos logit contributions
 wouldn+0.125
'd+0.122
 hadn+0.122
 never+0.109
 probably+0.109
Neg logit contributions
igsaw-0.054
 golf-0.041
 oak-0.040
 mel-0.040
 past-0.040
Token again
Token ID757
Feature activation+8.41e-01

Pos logit contributions
igsaw+0.047
andel+0.041
ay+0.039
 pige+0.038
aff+0.036
Neg logit contributions
 wouldn-0.049
 hadn-0.048
'd-0.048
 never-0.045
 also-0.041
Token,
Token ID11
Feature activation-6.33e+00

Pos logit contributions
 hadn+0.286
'd+0.280
 wouldn+0.280
 never+0.264
 probably+0.260
Neg logit contributions
igsaw-0.062
ay-0.044
andel-0.037
 pige-0.031
 past-0.031
Token "
Token ID366
Feature activation+4.80e-01

Pos logit contributions
igsaw+0.175
andel+0.154
ay+0.147
 pige+0.142
aff+0.137
Neg logit contributions
 wouldn-0.165
 hadn-0.162
'd-0.161
 never-0.150
 also-0.137
TokenSure
Token ID19457
Feature activation+2.19e+00

Pos logit contributions
'd+0.012
 hadn+0.012
 wouldn+0.012
 never+0.011
 also+0.010
Neg logit contributions
igsaw-0.014
andel-0.012
ay-0.011
 pige-0.011
 bou-0.010
Token,
Token ID11
Feature activation-6.35e+00

Pos logit contributions
 wouldn+0.076
 hadn+0.075
'd+0.075
 never+0.073
 also+0.067
Neg logit contributions
igsaw-0.043
andel-0.037
 pige-0.031
ay-0.030
aff-0.030
Token we
Token ID356
Feature activation+1.81e+01

Pos logit contributions
igsaw+0.196
andel+0.175
ay+0.170
 pige+0.165
aff+0.159
Neg logit contributions
 wouldn-0.143
 hadn-0.141
'd-0.139
 never-0.127
 also-0.114
Token'd
Token ID1549
Feature activation+2.20e+01

Pos logit contributions
 hadn+0.367
 wouldn+0.359
'd+0.328
 never+0.312
 probably+0.285
Neg logit contributions
igsaw-0.568
ay-0.541
andel-0.504
 gol-0.488
 mel-0.481
Token love
Token ID1842
Feature activation+9.45e-01

Pos logit contributions
 wouldn+0.593
 hadn+0.584
'd+0.576
 never+0.538
 also+0.495
Neg logit contributions
igsaw-0.586
andel-0.512
ay-0.490
 pige-0.473
aff-0.454
Token to
Token ID284
Feature activation+1.12e+00

Pos logit contributions
 never+0.025
 wouldn+0.024
 hadn+0.024
'd+0.024
 also+0.022
Neg logit contributions
igsaw-0.027
andel-0.025
ay-0.022
aff-0.021
 gol-0.020
Token."
Token ID526
Feature activation-4.57e+00

Pos logit contributions
'd+0.032
 never+0.031
 wouldn+0.031
 hadn+0.030
 also+0.028
Neg logit contributions
igsaw-0.030
andel-0.028
 pige-0.024
ay-0.023
aff-0.022
Token
Token ID198
Feature activation-3.94e+00

Pos logit contributions
igsaw+0.142
andel+0.130
ay+0.116
 perspective+0.116
 oak+0.113
Neg logit contributions
 hadn-0.100
 wouldn-0.100
'd-0.096
 never-0.095
 also-0.091
Token
Token ID198
Feature activation-3.95e+00

Pos logit contributions
igsaw+0.126
andel+0.112
ay+0.098
 gol+0.098
 perspective+0.097
Neg logit contributions
 never-0.086
 hadn-0.085
'd-0.083
 also-0.083
 wouldn-0.081
Token.
Token ID13
Feature activation-3.11e+00

Pos logit contributions
igsaw+0.198
andel+0.166
 pige+0.160
ay+0.159
 mel+0.152
Neg logit contributions
 wouldn-0.258
 hadn-0.256
'd-0.255
 never-0.237
 also-0.220
Token She
Token ID1375
Feature activation+1.82e+01

Pos logit contributions
igsaw+0.090
andel+0.081
ay+0.077
 pige+0.076
 mel+0.072
Neg logit contributions
 wouldn-0.074
 hadn-0.074
'd-0.072
 never-0.068
 also-0.061
Token said
Token ID531
Feature activation+6.11e+00

Pos logit contributions
 wouldn+0.394
 hadn+0.376
'd+0.376
itates+0.344
 probably+0.330
Neg logit contributions
igsaw-0.459
 par-0.431
 mel-0.427
 pige-0.422
 k-0.418
Token it
Token ID340
Feature activation+9.15e+00

Pos logit contributions
 hadn+0.196
'd+0.196
 wouldn+0.195
 never+0.176
 also+0.169
Neg logit contributions
igsaw-0.123
andel-0.102
 pige-0.099
ay-0.096
 mel-0.092
Token had
Token ID550
Feature activation+1.90e+01

Pos logit contributions
 hadn+0.154
'd+0.152
bled+0.148
itates+0.147
 wouldn+0.146
Neg logit contributions
igsaw-0.245
 k-0.226
 past-0.226
ay-0.224
 par-0.219
Token probably
Token ID2192
Feature activation+2.20e+01

Pos logit contributions
 wouldn+0.428
 hadn+0.418
'd+0.410
 never+0.355
itates+0.344
Neg logit contributions
igsaw-0.552
andel-0.485
 pige-0.478
ay-0.475
 mel-0.473
Token died
Token ID3724
Feature activation+4.14e+00

Pos logit contributions
'd+0.426
 hadn+0.425
 wouldn+0.416
itates+0.410
 also+0.357
Neg logit contributions
igsaw-0.625
 sight-0.561
 mel-0.560
 past-0.559
 purpose-0.556
Token of
Token ID286
Feature activation-2.88e+00

Pos logit contributions
'd+0.131
 hadn+0.131
 wouldn+0.130
 never+0.121
 also+0.116
Neg logit contributions
igsaw-0.085
andel-0.070
ay-0.067
 pige-0.062
 purpose-0.062
Token natural
Token ID3288
Feature activation-8.87e+00

Pos logit contributions
igsaw+0.070
andel+0.060
ay+0.057
 pige+0.055
aff+0.053
Neg logit contributions
 wouldn-0.084
 hadn-0.083
'd-0.082
 never-0.077
 also-0.071
Token causes
Token ID5640
Feature activation+3.83e+00

Pos logit contributions
igsaw+0.196
andel+0.166
ay+0.157
 pige+0.146
aff+0.141
Neg logit contributions
 wouldn-0.281
 hadn-0.277
'd-0.273
 never-0.262
 also-0.240
Token.
Token ID13
Feature activation-3.10e+00

Pos logit contributions
 wouldn+0.107
 hadn+0.105
'd+0.104
 never+0.098
 also+0.090
Neg logit contributions
igsaw-0.098
andel-0.085
ay-0.081
 pige-0.078
aff-0.075
Token leaves
Token ID5667
Feature activation-3.88e+00

Pos logit contributions
igsaw+0.231
andel+0.184
ay+0.176
 gol+0.168
 pige+0.160
Neg logit contributions
 wouldn-0.336
 hadn-0.334
'd-0.333
 never-0.323
 also-0.297
Token,
Token ID11
Feature activation-6.18e+00

Pos logit contributions
igsaw+0.084
andel+0.071
ay+0.067
 pige+0.064
aff+0.061
Neg logit contributions
 wouldn-0.122
 hadn-0.121
'd-0.120
 never-0.113
 also-0.105
Token and
Token ID290
Feature activation+3.40e+00

Pos logit contributions
igsaw+0.134
andel+0.116
 pige+0.112
ay+0.112
 mel+0.106
Neg logit contributions
 wouldn-0.186
 hadn-0.185
'd-0.183
 never-0.168
 forgot-0.157
Token this
Token ID428
Feature activation-7.22e+00

Pos logit contributions
 wouldn+0.092
 hadn+0.091
'd+0.091
 never+0.081
 probably+0.077
Neg logit contributions
igsaw-0.083
andel-0.072
 pige-0.071
ay-0.069
 mel-0.069
Token one
Token ID530
Feature activation-4.57e+00

Pos logit contributions
igsaw+0.151
andel+0.127
ay+0.119
 pige+0.114
aff+0.108
Neg logit contributions
 wouldn-0.235
 hadn-0.232
'd-0.229
 never-0.217
 also-0.203
Token probably
Token ID2192
Feature activation+2.20e+01

Pos logit contributions
igsaw+0.157
andel+0.142
ay+0.139
 pige+0.135
 mel+0.133
Neg logit contributions
 wouldn-0.083
 hadn-0.082
'd-0.080
 never-0.072
 also-0.062
Token used
Token ID973
Feature activation+2.99e+00

Pos logit contributions
 wouldn+0.520
 hadn+0.519
'd+0.494
 never+0.458
 also+0.431
Neg logit contributions
igsaw-0.574
andel-0.519
 past-0.499
 pige-0.493
ay-0.490
Token to
Token ID284
Feature activation+1.08e+00

Pos logit contributions
 wouldn+0.092
 hadn+0.091
'd+0.090
 never+0.084
 also+0.079
Neg logit contributions
igsaw-0.067
andel-0.058
ay-0.054
 pige-0.052
aff-0.049
Token belong
Token ID5594
Feature activation+3.37e+00

Pos logit contributions
 wouldn+0.028
 hadn+0.028
'd+0.027
 never+0.025
 also+0.023
Neg logit contributions
igsaw-0.029
andel-0.026
ay-0.025
 pige-0.024
 mel-0.023
Token to
Token ID284
Feature activation+1.08e+00

Pos logit contributions
 wouldn+0.108
 hadn+0.108
'd+0.105
 never+0.097
 also+0.093
Neg logit contributions
igsaw-0.065
andel-0.053
ay-0.052
 pige-0.052
 past-0.050
Token someone
Token ID2130
Feature activation+6.27e+00

Pos logit contributions
 wouldn+0.031
 hadn+0.031
'd+0.030
 never+0.029
 also+0.027
Neg logit contributions
igsaw-0.027
andel-0.023
ay-0.022
 pige-0.021
aff-0.020
Token.
Token ID13
Feature activation-2.90e+00

Pos logit contributions
 wouldn+0.524
 hadn+0.520
'd+0.511
 never+0.483
 also+0.452
Neg logit contributions
igsaw-0.345
andel-0.288
ay-0.275
 pige-0.260
aff-0.249
Token Jack
Token ID3619
Feature activation+6.65e+00

Pos logit contributions
igsaw+0.076
ay+0.063
andel+0.062
 pige+0.057
 gol+0.056
Neg logit contributions
 wouldn-0.082
'd-0.080
 hadn-0.079
 never-0.074
 also-0.068
Token knew
Token ID2993
Feature activation+6.70e+00

Pos logit contributions
 wouldn+0.126
 hadn+0.122
'd+0.119
 probably+0.103
 never+0.103
Neg logit contributions
igsaw-0.201
ay-0.184
 par-0.179
 k-0.179
andel-0.178
Token he
Token ID339
Feature activation+2.04e+01

Pos logit contributions
'd+0.196
 hadn+0.186
 wouldn+0.185
itates+0.181
 missed+0.168
Neg logit contributions
igsaw-0.143
 mel-0.117
 golf-0.116
 gol-0.110
 steel-0.107
Token would
Token ID561
Feature activation+1.92e+01

Pos logit contributions
bled+0.383
Acc+0.342
Looks+0.335
itates+0.324
bling+0.324
Neg logit contributions
oc-0.507
ay-0.493
 past-0.468
 par-0.461
 animal-0.457
Token definitely
Token ID4753
Feature activation+2.20e+01

Pos logit contributions
 wouldn+0.405
 hadn+0.401
'd+0.367
bled+0.323
itates+0.318
Neg logit contributions
igsaw-0.575
ay-0.518
andel-0.500
 oak-0.494
 purpose-0.493
Token come
Token ID1282
Feature activation+7.09e+00

Pos logit contributions
 wouldn+0.574
 hadn+0.570
'd+0.523
bled+0.496
 never+0.480
Neg logit contributions
igsaw-0.549
ay-0.453
 oak-0.451
 mel-0.447
 pige-0.443
Token back
Token ID736
Feature activation-2.25e+00

Pos logit contributions
 wouldn+0.187
 hadn+0.186
'd+0.179
 never+0.165
 forgot+0.151
Neg logit contributions
igsaw-0.188
andel-0.165
ay-0.164
 pige-0.152
 bou-0.150
Token to
Token ID284
Feature activation+8.62e-01

Pos logit contributions
igsaw+0.056
andel+0.048
ay+0.047
 pige+0.045
 mel+0.043
Neg logit contributions
 wouldn-0.064
 hadn-0.063
'd-0.062
 never-0.057
 also-0.053
Token slide
Token ID10649
Feature activation-5.02e+00

Pos logit contributions
 wouldn+0.026
 hadn+0.026
'd+0.025
 never+0.025
 also+0.023
Neg logit contributions
igsaw-0.020
andel-0.018
ay-0.016
 pige-0.015
aff-0.015
Token again
Token ID757
Feature activation+9.41e-01

Pos logit contributions
igsaw+0.140
andel+0.123
ay+0.117
 pige+0.113
aff+0.110
Neg logit contributions
 wouldn-0.130
 hadn-0.127
'd-0.126
 never-0.118
 also-0.108
Token the
Token ID262
Feature activation-1.14e+01

Pos logit contributions
igsaw+0.194
andel+0.168
ay+0.157
 pige+0.151
aff+0.146
Neg logit contributions
 wouldn-0.266
 hadn-0.262
'd-0.257
 never-0.248
 also-0.232
Token mixer
Token ID33938
Feature activation-7.94e+00

Pos logit contributions
igsaw+0.203
andel+0.166
ay+0.151
 pige+0.144
aff+0.136
Neg logit contributions
 wouldn-0.411
 hadn-0.406
'd-0.403
 never-0.383
 also-0.361
Token.
Token ID13
Feature activation-2.46e+00

Pos logit contributions
igsaw+0.150
andel+0.118
ay+0.116
 mel+0.111
 past+0.110
Neg logit contributions
 hadn-0.260
'd-0.258
 wouldn-0.256
 never-0.240
 forgot-0.226
Token It
Token ID632
Feature activation+1.49e+01

Pos logit contributions
igsaw+0.063
andel+0.051
ay+0.051
aff+0.047
 pige+0.045
Neg logit contributions
 wouldn-0.072
'd-0.069
 hadn-0.068
 never-0.064
 forgot-0.059
Token was
Token ID373
Feature activation+1.84e+01

Pos logit contributions
 wouldn+0.257
 hadn+0.252
'd+0.248
 never+0.225
itates+0.217
Neg logit contributions
igsaw-0.445
 k-0.414
ay-0.408
 mel-0.407
 past-0.407
Token definitely
Token ID4753
Feature activation+2.20e+01

Pos logit contributions
 wouldn+0.484
'd+0.475
 hadn+0.474
 never+0.434
 also+0.392
Neg logit contributions
igsaw-0.489
andel-0.433
ay-0.417
 pige-0.397
 mel-0.391
Token sm
Token ID895
Feature activation-1.51e+01

Pos logit contributions
'd+0.660
 wouldn+0.660
 hadn+0.650
 never+0.609
itates+0.579
Neg logit contributions
igsaw-0.458
 mel-0.368
 past-0.361
 sight-0.359
andel-0.355
Tokenelly
Token ID6148
Feature activation-7.71e+00

Pos logit contributions
igsaw+0.219
andel+0.172
 pige+0.158
 bou+0.148
ay+0.147
Neg logit contributions
 wouldn-0.569
 hadn-0.560
'd-0.558
 never-0.535
 also-0.503
Token.
Token ID13
Feature activation-2.56e+00

Pos logit contributions
igsaw+0.149
andel+0.122
ay+0.115
 pige+0.110
 mel+0.105
Neg logit contributions
 wouldn-0.258
 hadn-0.258
'd-0.257
 never-0.241
 also-0.227
Token
Token ID198
Feature activation-3.94e+00

Pos logit contributions
igsaw+0.062
ay+0.049
andel+0.048
aff+0.044
 pige+0.043
Neg logit contributions
 wouldn-0.079
'd-0.077
 hadn-0.074
 never-0.070
 forgot-0.065
Token
Token ID198
Feature activation-3.96e+00

Pos logit contributions
igsaw+0.081
ay+0.066
andel+0.066
aff+0.058
 pige+0.058
Neg logit contributions
 wouldn-0.133
'd-0.127
 hadn-0.126
 never-0.116
 forgot-0.110
TokenPick
Token ID31686
Feature activation+4.03e+00

Pos logit contributions
igsaw+0.004
andel+0.003
ay+0.003
 pige+0.003
aff+0.003
Neg logit contributions
 wouldn-0.003
 hadn-0.003
'd-0.003
 never-0.003
 also-0.003
Token out
Token ID503
Feature activation+1.07e+00

Pos logit contributions
 wouldn+0.111
'd+0.107
 hadn+0.107
 never+0.101
 also+0.093
Neg logit contributions
igsaw-0.105
andel-0.092
ay-0.087
 pige-0.084
 gol-0.082
Token whatever
Token ID4232
Feature activation-1.98e+00

Pos logit contributions
 wouldn+0.032
 hadn+0.032
'd+0.032
 never+0.030
 also+0.028
Neg logit contributions
igsaw-0.025
andel-0.021
ay-0.020
 pige-0.019
aff-0.018
Token yogurt
Token ID32132
Feature activation-5.20e+00

Pos logit contributions
igsaw+0.057
andel+0.050
 pige+0.050
ay+0.049
 mel+0.047
Neg logit contributions
 wouldn-0.046
 hadn-0.046
'd-0.045
 never-0.041
 also-0.037
Token you
Token ID345
Feature activation+1.63e+01

Pos logit contributions
igsaw+0.137
ay+0.118
andel+0.115
 pige+0.113
 mel+0.112
Neg logit contributions
 hadn-0.137
 wouldn-0.134
'd-0.132
 never-0.124
 also-0.113
Token'd
Token ID1549
Feature activation+2.20e+01

Pos logit contributions
 hadn+0.380
 wouldn+0.379
'd+0.349
 never+0.338
bled+0.308
Neg logit contributions
igsaw-0.455
andel-0.396
ay-0.393
 mel-0.381
 pige-0.380
Token like
Token ID588
Feature activation+2.77e+00

Pos logit contributions
 wouldn+0.548
 hadn+0.537
'd+0.512
 never+0.461
 also+0.434
Neg logit contributions
igsaw-0.598
andel-0.516
ay-0.505
 pige-0.501
 mel-0.497
Token for
Token ID329
Feature activation+2.87e+00

Pos logit contributions
 wouldn+0.083
 hadn+0.081
'd+0.081
 never+0.078
 also+0.071
Neg logit contributions
igsaw-0.067
andel-0.057
ay-0.053
 pige-0.050
 gol-0.049
Token a
Token ID257
Feature activation-5.77e+00

Pos logit contributions
 wouldn+0.071
 hadn+0.070
'd+0.069
 never+0.066
 also+0.059
Neg logit contributions
igsaw-0.083
andel-0.073
ay-0.070
 pige-0.067
aff-0.065
Token treat
Token ID2190
Feature activation-3.28e+00

Pos logit contributions
igsaw+0.115
andel+0.103
ay+0.096
 pige+0.093
aff+0.089
Neg logit contributions
'd-0.191
 wouldn-0.187
 hadn-0.186
 never-0.176
 also-0.164
Token,
Token ID11
Feature activation-6.57e+00

Pos logit contributions
igsaw+0.075
andel+0.065
ay+0.059
 pige+0.057
aff+0.055
Neg logit contributions
 wouldn-0.101
'd-0.099
 hadn-0.098
 never-0.094
 also-0.087
Token the
Token ID262
Feature activation-1.14e+01

Pos logit contributions
igsaw+0.087
andel+0.081
aff+0.079
round+0.074
 pige+0.074
Neg logit contributions
 never-0.091
 hadn-0.086
 also-0.083
 wouldn-0.082
'd-0.081
Token garden
Token ID11376
Feature activation-2.26e+00

Pos logit contributions
igsaw+0.252
andel+0.215
ay+0.198
aff+0.198
 pige+0.190
Neg logit contributions
 wouldn-0.351
 hadn-0.350
'd-0.348
 never-0.341
 also-0.317
Token.
Token ID13
Feature activation-2.28e+00

Pos logit contributions
igsaw+0.049
andel+0.041
ay+0.038
 pige+0.036
aff+0.035
Neg logit contributions
 wouldn-0.073
 hadn-0.072
'd-0.071
 never-0.068
 also-0.064
Token He
Token ID679
Feature activation+1.97e+01

Pos logit contributions
igsaw+0.061
ay+0.054
andel+0.052
oc+0.049
 pige+0.048
Neg logit contributions
 wouldn-0.060
 hadn-0.058
'd-0.057
 never-0.052
 forgot-0.048
Token was
Token ID373
Feature activation+1.86e+01

Pos logit contributions
bled+0.423
 hadn+0.401
 wouldn+0.373
'd+0.370
Looks+0.347
Neg logit contributions
 rh-0.479
 k-0.479
 par-0.478
oc-0.474
igsaw-0.461
Token probably
Token ID2192
Feature activation+2.20e+01

Pos logit contributions
 wouldn+0.526
'd+0.525
 hadn+0.502
 possibly+0.415
 could+0.413
Neg logit contributions
igsaw-0.422
 sight-0.373
andel-0.372
 past-0.371
 blue-0.364
Token just
Token ID655
Feature activation+1.42e+01

Pos logit contributions
'd+0.578
 wouldn+0.576
 hadn+0.565
 never+0.505
 also+0.473
Neg logit contributions
igsaw-0.544
andel-0.468
 pige-0.464
 past-0.460
 sight-0.456
Token looking
Token ID2045
Feature activation+8.54e+00

Pos logit contributions
 wouldn+0.376
'd+0.371
 hadn+0.369
 never+0.338
 also+0.309
Neg logit contributions
igsaw-0.377
andel-0.331
ay-0.308
 pige-0.308
 mel-0.296
Token for
Token ID329
Feature activation+3.19e+00

Pos logit contributions
 wouldn+0.232
 hadn+0.229
'd+0.227
 never+0.212
 also+0.195
Neg logit contributions
igsaw-0.224
andel-0.196
ay-0.187
 pige-0.180
aff-0.173
Token some
Token ID617
Feature activation-4.21e+00

Pos logit contributions
 wouldn+0.088
 hadn+0.087
'd+0.086
 never+0.081
 also+0.075
Neg logit contributions
igsaw-0.083
andel-0.072
ay-0.068
 pige-0.066
aff-0.063
Token food
Token ID2057
Feature activation-7.56e+00

Pos logit contributions
igsaw+0.095
andel+0.078
ay+0.073
aff+0.069
 pige+0.068
Neg logit contributions
 wouldn-0.133
 hadn-0.131
'd-0.130
 never-0.124
 also-0.116
Token amazed
Token ID24562
Feature activation-6.83e+00

Pos logit contributions
'd+0.487
 wouldn+0.486
 hadn+0.462
 possibly+0.381
 could+0.376
Neg logit contributions
igsaw-0.461
andel-0.419
 past-0.399
 pige-0.397
 blue-0.395
Token at
Token ID379
Feature activation-8.91e+00

Pos logit contributions
igsaw+0.185
andel+0.162
 pige+0.144
ay+0.142
 bou+0.141
Neg logit contributions
 wouldn-0.188
 hadn-0.182
'd-0.177
 never-0.175
 could-0.164
Token how
Token ID703
Feature activation-5.00e+00

Pos logit contributions
igsaw+0.202
andel+0.164
ay+0.158
 mel+0.148
 pige+0.147
Neg logit contributions
 wouldn-0.277
'd-0.276
 hadn-0.272
 never-0.251
 forgot-0.235
Token neatly
Token ID29776
Feature activation+6.48e+00

Pos logit contributions
igsaw+0.109
ay+0.089
andel+0.084
 mel+0.083
 pige+0.082
Neg logit contributions
 wouldn-0.158
'd-0.156
 hadn-0.154
 never-0.139
 probably-0.133
Token he
Token ID339
Feature activation+2.04e+01

Pos logit contributions
'd+0.156
 wouldn+0.155
 hadn+0.155
 never+0.145
 probably+0.133
Neg logit contributions
igsaw-0.177
andel-0.148
ay-0.145
 mel-0.139
 past-0.138
Token'd
Token ID1549
Feature activation+2.20e+01

Pos logit contributions
 wouldn+0.307
itates+0.305
bled+0.301
 not+0.284
'd+0.282
Neg logit contributions
igsaw-0.553
 mel-0.526
ay-0.512
 par-0.510
oc-0.507
Token done
Token ID1760
Feature activation+6.36e+00

Pos logit contributions
 wouldn+0.394
 hadn+0.351
'd+0.335
bled+0.318
itates+0.307
Neg logit contributions
igsaw-0.709
ay-0.619
 mel-0.619
 purpose-0.613
 view-0.612
Token.
Token ID13
Feature activation-3.08e+00

Pos logit contributions
'd+0.205
 wouldn+0.204
 hadn+0.203
 never+0.184
 forgot+0.176
Neg logit contributions
igsaw-0.132
ay-0.100
andel-0.098
 purpose-0.097
 golf-0.093
Token
Token ID198
Feature activation-3.90e+00

Pos logit contributions
igsaw+0.078
andel+0.065
ay+0.064
 pige+0.060
aff+0.058
Neg logit contributions
 wouldn-0.089
'd-0.087
 hadn-0.086
 never-0.080
 also-0.073
Token
Token ID198
Feature activation-3.86e+00

Pos logit contributions
igsaw+0.096
andel+0.083
ay+0.078
 pige+0.076
aff+0.072
Neg logit contributions
 wouldn-0.112
 hadn-0.111
'd-0.110
 never-0.104
 also-0.096
Token"
Token ID1
Feature activation-1.12e+00

Pos logit contributions
igsaw+0.098
andel+0.083
ay+0.082
 pige+0.076
 mel+0.074
Neg logit contributions
 wouldn-0.111
 hadn-0.107
'd-0.106
 never-0.098
 probably-0.090
Token wagon
Token ID31860
Feature activation-9.75e+00

Pos logit contributions
igsaw+0.444
andel+0.338
 pige+0.303
aff+0.287
 gol+0.283
Neg logit contributions
 wouldn-0.714
'd-0.696
 hadn-0.671
 never-0.655
 also-0.616
Token crashed
Token ID14997
Feature activation-7.48e+00

Pos logit contributions
 k+0.276
ay+0.274
 automobile+0.270
 past+0.269
 par+0.269
Neg logit contributions
'd-0.147
itates-0.144
 hadn-0.143
Acc-0.135
Looks-0.135
Token into
Token ID656
Feature activation-6.47e+00

Pos logit contributions
igsaw+0.164
ay+0.148
andel+0.144
 past+0.136
 sight+0.130
Neg logit contributions
'd-0.219
 hadn-0.217
 wouldn-0.215
 never-0.200
 also-0.190
Token a
Token ID257
Feature activation-7.21e+00

Pos logit contributions
igsaw+0.182
andel+0.168
aff+0.156
 pige+0.155
ay+0.154
Neg logit contributions
 hadn-0.158
 wouldn-0.154
 never-0.152
'd-0.147
 also-0.138
Token d
Token ID288
Feature activation-6.10e+00

Pos logit contributions
igsaw+0.153
andel+0.130
aff+0.121
 pige+0.116
ay+0.111
Neg logit contributions
 hadn-0.235
'd-0.234
 wouldn-0.232
 never-0.224
 also-0.209
Tokenandel
Token ID33134
Feature activation-3.10e+01

Pos logit contributions
igsaw+0.130
 pige+0.106
 bou+0.101
 purpose+0.097
andel+0.096
Neg logit contributions
 wouldn-0.191
'd-0.185
 hadn-0.184
 never-0.176
 also-0.166
Tokenion
Token ID295
Feature activation-8.41e+00

Pos logit contributions
igsaw+1.044
andel+0.924
ay+0.891
 pige+0.889
 bou+0.865
Neg logit contributions
 wouldn-0.617
 hadn-0.595
'd-0.593
 never-0.538
 also-0.475
Token bush
Token ID24484
Feature activation-9.51e+00

Pos logit contributions
igsaw+0.181
andel+0.157
ay+0.157
 pige+0.145
 past+0.144
Neg logit contributions
 hadn-0.247
'd-0.245
 wouldn-0.242
 never-0.228
 also-0.213
Token!
Token ID0
Feature activation-2.93e+00

Pos logit contributions
igsaw+0.172
ay+0.140
andel+0.140
 pige+0.124
 past+0.123
Neg logit contributions
 hadn-0.318
 wouldn-0.317
'd-0.317
 never-0.296
 also-0.282
Token Jack
Token ID3619
Feature activation+3.50e+00

Pos logit contributions
igsaw+0.074
andel+0.062
ay+0.060
 pige+0.055
 mel+0.054
Neg logit contributions
 wouldn-0.085
'd-0.084
 hadn-0.083
 never-0.078
 also-0.073
Token was
Token ID373
Feature activation+1.56e+01

Pos logit contributions
 wouldn+0.061
'd+0.059
 hadn+0.057
 probably+0.051
 never+0.047
Neg logit contributions
igsaw-0.098
oc-0.096
ay-0.096
 k-0.096
 par-0.094
Token,
Token ID11
Feature activation-8.15e+00

Pos logit contributions
igsaw+0.175
andel+0.146
ay+0.136
 pige+0.129
aff+0.123
Neg logit contributions
 wouldn-0.281
 hadn-0.277
'd-0.275
 never-0.261
 also-0.244
Token Jack
Token ID3619
Feature activation+4.07e+00

Pos logit contributions
igsaw+0.186
 golf+0.158
 mel+0.155
 past+0.151
ay+0.149
Neg logit contributions
'd-0.235
 wouldn-0.231
 hadn-0.226
 never-0.213
 probably-0.202
Token held
Token ID2714
Feature activation-1.60e-01

Pos logit contributions
 wouldn+0.094
 hadn+0.091
'd+0.090
 never+0.082
 probably+0.078
Neg logit contributions
igsaw-0.097
 pige-0.093
 mel-0.092
 par-0.091
 past-0.091
Token a
Token ID257
Feature activation-7.18e+00

Pos logit contributions
igsaw+0.005
andel+0.004
 pige+0.004
ay+0.004
aff+0.004
Neg logit contributions
 wouldn-0.004
 hadn-0.004
'd-0.004
 never-0.004
 also-0.003
Token d
Token ID288
Feature activation-5.87e+00

Pos logit contributions
igsaw+0.192
aff+0.167
andel+0.161
ican+0.156
 pige+0.154
Neg logit contributions
'd-0.194
 wouldn-0.188
 hadn-0.186
 also-0.179
 never-0.176
Tokenandel
Token ID33134
Feature activation-3.08e+01

Pos logit contributions
igsaw+0.083
 pige+0.058
 purpose+0.052
oc+0.051
ay+0.050
Neg logit contributions
 wouldn-0.223
'd-0.219
 hadn-0.216
 never-0.209
 also-0.202
Tokenion
Token ID295
Feature activation-8.12e+00

Pos logit contributions
igsaw+0.902
andel+0.783
ay+0.752
 pige+0.745
 purpose+0.715
Neg logit contributions
 wouldn-0.745
'd-0.724
 hadn-0.722
 never-0.672
 also-0.612
Token.
Token ID13
Feature activation-4.42e+00

Pos logit contributions
igsaw+0.196
andel+0.171
ay+0.164
 pige+0.157
 mel+0.152
Neg logit contributions
 wouldn-0.231
 hadn-0.231
'd-0.228
 never-0.213
 also-0.196
Token He
Token ID679
Feature activation+1.68e+01

Pos logit contributions
igsaw+0.129
andel+0.112
ay+0.110
 pige+0.103
aff+0.101
Neg logit contributions
 wouldn-0.110
 hadn-0.107
'd-0.106
 never-0.099
 also-0.089
Token took
Token ID1718
Feature activation+3.86e+00

Pos logit contributions
 wouldn+0.319
 hadn+0.306
'd+0.305
 never+0.265
 probably+0.260
Neg logit contributions
igsaw-0.463
 pige-0.455
 mel-0.451
 k-0.449
 par-0.442
Token a
Token ID257
Feature activation-7.45e+00

Pos logit contributions
 wouldn+0.094
 hadn+0.090
'd+0.090
 never+0.086
 also+0.077
Neg logit contributions
igsaw-0.115
andel-0.099
 pige-0.096
ay-0.093
aff-0.092
Token a
Token ID257
Feature activation-7.03e+00

Pos logit contributions
igsaw+0.161
andel+0.143
ay+0.138
 pige+0.133
aff+0.130
Neg logit contributions
 wouldn-0.137
 hadn-0.136
'd-0.134
 never-0.128
 also-0.116
Token large
Token ID1588
Feature activation-1.80e+01

Pos logit contributions
igsaw+0.164
andel+0.146
ay+0.139
aff+0.135
ites+0.129
Neg logit contributions
'd-0.209
 hadn-0.206
 wouldn-0.204
 never-0.196
 also-0.193
Token field
Token ID2214
Feature activation-1.23e+01

Pos logit contributions
igsaw+0.396
aff+0.368
andel+0.366
ites+0.342
 pige+0.335
Neg logit contributions
 wouldn-0.482
'd-0.481
 also-0.476
 hadn-0.474
 never-0.466
Token of
Token ID286
Feature activation-5.18e+00

Pos logit contributions
igsaw+0.276
andel+0.235
ay+0.221
 pige+0.212
aff+0.202
Neg logit contributions
 wouldn-0.383
 hadn-0.378
'd-0.375
 never-0.355
 also-0.330
Token d
Token ID288
Feature activation-5.99e+00

Pos logit contributions
igsaw+0.111
andel+0.088
ay+0.084
aff+0.079
 gol+0.073
Neg logit contributions
 wouldn-0.168
 hadn-0.167
'd-0.166
 never-0.160
 also-0.154
Tokenandel
Token ID33134
Feature activation-3.07e+01

Pos logit contributions
igsaw+0.115
 pige+0.090
 bou+0.086
oc+0.085
ay+0.085
Neg logit contributions
 wouldn-0.191
'd-0.187
 hadn-0.185
 never-0.180
 also-0.173
Tokenions
Token ID507
Feature activation+2.52e-01

Pos logit contributions
igsaw+0.737
andel+0.608
ay+0.591
 pige+0.573
 purpose+0.563
Neg logit contributions
 wouldn-0.898
'd-0.879
 hadn-0.878
 never-0.829
 also-0.769
Token.
Token ID13
Feature activation-4.56e+00

Pos logit contributions
 hadn+0.008
 wouldn+0.008
'd+0.008
 never+0.008
 forgot+0.007
Neg logit contributions
igsaw-0.004
andel-0.003
 past-0.003
 pige-0.003
ay-0.003
Token It
Token ID632
Feature activation+1.22e+01

Pos logit contributions
igsaw+0.108
andel+0.094
ay+0.087
 pige+0.084
aff+0.082
Neg logit contributions
 wouldn-0.137
 hadn-0.136
'd-0.133
 never-0.127
 could-0.119
Token was
Token ID373
Feature activation+1.56e+01

Pos logit contributions
 wouldn+0.235
 hadn+0.226
'd+0.223
bled+0.218
 never+0.201
Neg logit contributions
 blue-0.313
 k-0.309
 past-0.307
 oak-0.306
 pine-0.305
Token so
Token ID523
Feature activation-3.52e+00

Pos logit contributions
 wouldn+0.404
'd+0.398
 hadn+0.394
 never+0.348
 forgot+0.327
Neg logit contributions
igsaw-0.405
andel-0.372
ay-0.345
 pige-0.338
 past-0.338
Token spotted
Token ID13489
Feature activation+1.36e+00

Pos logit contributions
 wouldn+0.308
 hadn+0.295
'd+0.286
itates+0.279
 probably+0.272
Neg logit contributions
 mel-0.489
oc-0.486
 par-0.478
igsaw-0.472
ay-0.470
Token a
Token ID257
Feature activation-6.41e+00

Pos logit contributions
 wouldn+0.035
 hadn+0.034
'd+0.033
 never+0.032
 also+0.029
Neg logit contributions
igsaw-0.040
andel-0.035
ay-0.034
 pige-0.032
aff-0.032
Token patch
Token ID8529
Feature activation-9.36e+00

Pos logit contributions
igsaw+0.152
andel+0.134
ay+0.123
aff+0.122
 pige+0.116
Neg logit contributions
'd-0.196
 hadn-0.195
 wouldn-0.193
 never-0.180
 also-0.175
Token of
Token ID286
Feature activation-4.66e+00

Pos logit contributions
igsaw+0.172
andel+0.144
ay+0.140
 pige+0.125
 mel+0.122
Neg logit contributions
 wouldn-0.321
 hadn-0.319
'd-0.315
 never-0.296
 forgot-0.280
Token d
Token ID288
Feature activation-5.37e+00

Pos logit contributions
igsaw+0.091
andel+0.075
ay+0.070
 pige+0.066
aff+0.063
Neg logit contributions
 wouldn-0.159
 hadn-0.157
'd-0.156
 never-0.148
 also-0.140
Tokenandel
Token ID33134
Feature activation-3.02e+01

Pos logit contributions
igsaw+0.102
 pige+0.079
 bou+0.075
 purpose+0.073
ay+0.073
Neg logit contributions
 wouldn-0.178
'd-0.173
 hadn-0.172
 never-0.164
 also-0.156
Tokenions
Token ID507
Feature activation+7.69e-01

Pos logit contributions
igsaw+0.795
andel+0.672
ay+0.652
 pige+0.640
 bou+0.621
Neg logit contributions
 wouldn-0.818
'd-0.794
 hadn-0.794
 never-0.742
 also-0.684
Token.
Token ID13
Feature activation-3.90e+00

Pos logit contributions
'd+0.026
 hadn+0.026
 wouldn+0.025
 never+0.024
 forgot+0.024
Neg logit contributions
igsaw-0.011
 past-0.010
ay-0.010
 mel-0.009
andel-0.009
Token He
Token ID679
Feature activation+1.75e+01

Pos logit contributions
igsaw+0.131
andel+0.119
ay+0.113
 pige+0.113
 bou+0.109
Neg logit contributions
 wouldn-0.078
 hadn-0.076
'd-0.074
 never-0.068
 could-0.061
Token ran
Token ID4966
Feature activation+3.74e+00

Pos logit contributions
 wouldn+0.319
 hadn+0.302
'd+0.301
 never+0.263
 probably+0.247
Neg logit contributions
igsaw-0.542
 mel-0.503
andel-0.498
 pige-0.488
 par-0.485
Token to
Token ID284
Feature activation-6.35e-01

Pos logit contributions
 wouldn+0.106
 hadn+0.102
'd+0.101
 never+0.100
 also+0.090
Neg logit contributions
igsaw-0.095
andel-0.084
ay-0.079
 pige-0.078
aff-0.074
Token to
Token ID284
Feature activation-3.25e-01

Pos logit contributions
 hadn+0.048
 wouldn+0.048
'd+0.047
 never+0.044
 also+0.041
Neg logit contributions
igsaw-0.037
andel-0.032
ay-0.030
 pige-0.029
 mel-0.028
Token be
Token ID307
Feature activation+1.06e+01

Pos logit contributions
igsaw+0.008
andel+0.007
ay+0.007
 pige+0.007
 purpose+0.007
Neg logit contributions
 wouldn-0.009
 hadn-0.009
'd-0.009
 never-0.008
 could-0.007
Token okay
Token ID8788
Feature activation+1.57e+00

Pos logit contributions
'd+0.287
 hadn+0.281
 wouldn+0.280
 never+0.241
 could+0.230
Neg logit contributions
igsaw-0.252
andel-0.230
 pige-0.219
 past-0.218
 golf-0.216
Token?"
Token ID1701
Feature activation-6.40e+00

Pos logit contributions
 wouldn+0.045
'd+0.044
 hadn+0.044
 never+0.043
 also+0.041
Neg logit contributions
igsaw-0.041
andel-0.035
ay-0.031
 pige-0.031
 gol-0.030
Token The
Token ID383
Feature activation-1.16e+01

Pos logit contributions
igsaw+0.146
andel+0.127
ay+0.118
 pige+0.110
aff+0.105
Neg logit contributions
 wouldn-0.202
 hadn-0.199
'd-0.195
 never-0.188
 also-0.174
Token paramed
Token ID34581
Feature activation-3.01e+01

Pos logit contributions
igsaw+0.283
andel+0.235
ay+0.222
aff+0.204
 pige+0.204
Neg logit contributions
 wouldn-0.354
 hadn-0.348
'd-0.345
 never-0.332
 also-0.311
Tokenic
Token ID291
Feature activation-7.90e+00

Pos logit contributions
igsaw+0.534
andel+0.433
ay+0.415
 pige+0.386
aff+0.371
Neg logit contributions
 wouldn-1.064
 hadn-1.058
'd-1.053
 never-0.996
 also-0.938
Token smiled
Token ID13541
Feature activation-2.31e+00

Pos logit contributions
igsaw+0.171
ay+0.156
andel+0.149
 pige+0.142
 mel+0.141
Neg logit contributions
 hadn-0.229
'd-0.227
 wouldn-0.225
 never-0.211
 probably-0.199
Token and
Token ID290
Feature activation+1.42e+00

Pos logit contributions
igsaw+0.043
andel+0.036
ay+0.035
 pige+0.031
 bou+0.031
Neg logit contributions
 hadn-0.078
 wouldn-0.078
'd-0.077
 never-0.073
 probably-0.067
Token said
Token ID531
Feature activation+4.05e+00

Pos logit contributions
'd+0.036
 hadn+0.034
 wouldn+0.034
 never+0.031
 probably+0.029
Neg logit contributions
igsaw-0.036
 mel-0.032
 pige-0.032
andel-0.031
 par-0.031
Token,
Token ID11
Feature activation-8.13e+00

Pos logit contributions
 hadn+0.159
'd+0.158
 wouldn+0.158
 never+0.149
 also+0.143
Neg logit contributions
igsaw-0.053
andel-0.039
ay-0.037
 pige-0.034
 mel-0.032
Token even
Token ID772
Feature activation+8.45e+00

Pos logit contributions
 wouldn+0.315
 hadn+0.287
bled+0.282
'd+0.279
itates+0.268
Neg logit contributions
igsaw-0.453
 mel-0.445
 par-0.445
 past-0.435
 k-0.432
Token spotted
Token ID13489
Feature activation+1.77e+00

Pos logit contributions
 wouldn+0.174
 hadn+0.171
'd+0.165
 never+0.153
 probably+0.141
Neg logit contributions
igsaw-0.262
andel-0.238
 pige-0.226
ay-0.223
 mel-0.222
Token a
Token ID257
Feature activation-6.22e+00

Pos logit contributions
 wouldn+0.047
 hadn+0.046
'd+0.046
 never+0.043
 also+0.039
Neg logit contributions
igsaw-0.048
andel-0.042
ay-0.040
 pige-0.038
aff-0.037
Token big
Token ID1263
Feature activation-1.72e+01

Pos logit contributions
igsaw+0.138
andel+0.119
ay+0.108
aff+0.103
 pige+0.101
Neg logit contributions
 hadn-0.197
'd-0.197
 wouldn-0.194
 never-0.185
 also-0.177
Token ch
Token ID442
Feature activation-1.17e+01

Pos logit contributions
igsaw+0.355
andel+0.318
aff+0.280
ay+0.273
 pige+0.270
Neg logit contributions
 hadn-0.540
 wouldn-0.536
'd-0.534
 never-0.519
 also-0.492
Tokenandel
Token ID33134
Feature activation-2.99e+01

Pos logit contributions
igsaw+0.204
 pige+0.170
 mel+0.155
 bou+0.154
 purpose+0.149
Neg logit contributions
 wouldn-0.387
'd-0.382
 hadn-0.379
 never-0.365
 probably-0.348
Tokenier
Token ID959
Feature activation-2.37e-01

Pos logit contributions
igsaw+0.886
andel+0.779
ay+0.749
 pige+0.732
aff+0.703
Neg logit contributions
 wouldn-0.717
 hadn-0.701
'd-0.696
 never-0.645
 also-0.585
Token that
Token ID326
Feature activation-1.93e-01

Pos logit contributions
igsaw+0.005
andel+0.004
ay+0.004
 pige+0.004
aff+0.004
Neg logit contributions
 wouldn-0.007
 hadn-0.007
'd-0.007
 never-0.007
 also-0.006
Token spark
Token ID9009
Feature activation-8.45e+00

Pos logit contributions
igsaw+0.006
ay+0.005
 past+0.005
andel+0.005
 sight+0.005
Neg logit contributions
'd-0.004
 wouldn-0.004
 hadn-0.003
 never-0.003
 probably-0.003
Tokenled
Token ID992
Feature activation+3.13e+00

Pos logit contributions
igsaw+0.235
andel+0.213
ay+0.206
 pige+0.197
aff+0.195
Neg logit contributions
 hadn-0.205
'd-0.201
 wouldn-0.201
 never-0.192
 also-0.175
Token like
Token ID588
Feature activation+1.54e+00

Pos logit contributions
 wouldn+0.086
 hadn+0.085
'd+0.084
 never+0.077
 forgot+0.072
Neg logit contributions
igsaw-0.078
andel-0.069
ay-0.065
 pige-0.063
 past-0.062
Token
Token ID198
Feature activation-5.20e+00

Pos logit contributions
igsaw+0.140
andel+0.124
ay+0.113
 pige+0.113
 mel+0.107
Neg logit contributions
 hadn-0.137
 wouldn-0.136
'd-0.135
 never-0.129
 also-0.120
TokenL
Token ID43
Feature activation-2.24e+00

Pos logit contributions
igsaw+0.114
andel+0.096
ay+0.093
 pige+0.087
aff+0.083
Neg logit contributions
 wouldn-0.165
 hadn-0.161
'd-0.161
 never-0.150
 also-0.139
Tokenily
Token ID813
Feature activation+5.11e+00

Pos logit contributions
igsaw+0.076
andel+0.065
ay+0.064
aff+0.062
 gol+0.059
Neg logit contributions
 wouldn-0.049
'd-0.047
 hadn-0.046
 never-0.043
 almost-0.037
Token asked
Token ID1965
Feature activation+1.20e+00

Pos logit contributions
 wouldn+0.150
'd+0.149
 hadn+0.147
 never+0.132
 probably+0.130
Neg logit contributions
igsaw-0.100
 pige-0.087
 par-0.085
oc-0.085
ay-0.085
Token the
Token ID262
Feature activation-1.26e+01

Pos logit contributions
 wouldn+0.032
 hadn+0.031
'd+0.030
 never+0.030
 also+0.027
Neg logit contributions
igsaw-0.034
andel-0.030
ay-0.028
 pige-0.026
aff-0.026
Token paramed
Token ID34581
Feature activation-2.98e+01

Pos logit contributions
igsaw+0.300
andel+0.256
ay+0.242
 pige+0.232
aff+0.222
Neg logit contributions
 wouldn-0.382
 hadn-0.376
'd-0.373
 never-0.352
 also-0.327
Tokenic
Token ID291
Feature activation-7.55e+00

Pos logit contributions
igsaw+0.585
andel+0.487
ay+0.471
 pige+0.445
aff+0.432
Neg logit contributions
 wouldn-0.992
 hadn-0.991
'd-0.980
 never-0.923
 also-0.867
Token,
Token ID11
Feature activation-7.62e+00

Pos logit contributions
igsaw+0.140
andel+0.123
ay+0.122
 pige+0.114
aff+0.111
Neg logit contributions
 hadn-0.249
'd-0.241
 wouldn-0.240
 never-0.224
 probably-0.214
Token "
Token ID366
Feature activation-1.50e+00

Pos logit contributions
igsaw+0.166
andel+0.147
ay+0.138
 pige+0.138
 mel+0.134
Neg logit contributions
 wouldn-0.223
'd-0.221
 hadn-0.220
 never-0.202
 forgot-0.189
TokenIs
Token ID3792
Feature activation+4.60e+00

Pos logit contributions
igsaw+0.040
andel+0.036
 pige+0.035
 mel+0.034
ay+0.034
Neg logit contributions
 wouldn-0.038
 hadn-0.037
'd-0.037
 never-0.034
 forgot-0.031
Token the
Token ID262
Feature activation-1.28e+01

Pos logit contributions
'd+0.141
 hadn+0.139
 wouldn+0.134
 never+0.122
 probably+0.118
Neg logit contributions
igsaw-0.094
 pige-0.084
 mel-0.078
andel-0.077
ay-0.075
Token the
Token ID262
Feature activation-1.26e+01

Pos logit contributions
igsaw+0.103
andel+0.087
ay+0.083
 pige+0.080
aff+0.079
Neg logit contributions
 wouldn-0.121
 hadn-0.119
'd-0.115
 never-0.114
 also-0.106
Token dirt
Token ID13647
Feature activation-6.40e+00

Pos logit contributions
igsaw+0.257
andel+0.213
ay+0.199
 pige+0.187
aff+0.180
Neg logit contributions
 wouldn-0.421
 hadn-0.416
'd-0.412
 never-0.393
 also-0.366
Token,
Token ID11
Feature activation-7.43e+00

Pos logit contributions
igsaw+0.133
andel+0.112
ay+0.105
 pige+0.100
aff+0.095
Neg logit contributions
 wouldn-0.208
 hadn-0.206
'd-0.204
 never-0.193
 also-0.180
Token yellow
Token ID7872
Feature activation-1.85e+01

Pos logit contributions
igsaw+0.137
 pige+0.132
 bou+0.130
andel+0.129
 past+0.126
Neg logit contributions
 wouldn-0.227
 hadn-0.226
'd-0.226
 never-0.201
 probably-0.193
Token d
Token ID288
Feature activation-5.03e+00

Pos logit contributions
igsaw+0.388
andel+0.319
ay+0.298
 pige+0.279
 gol+0.269
Neg logit contributions
 wouldn-0.612
 hadn-0.602
'd-0.599
 never-0.568
 also-0.527
Tokenandel
Token ID33134
Feature activation-2.96e+01

Pos logit contributions
igsaw+0.086
 pige+0.060
andel+0.059
ay+0.058
 purpose+0.054
Neg logit contributions
 wouldn-0.183
 hadn-0.179
'd-0.178
 never-0.170
 also-0.160
Tokenions
Token ID507
Feature activation+1.42e+00

Pos logit contributions
igsaw+0.868
andel+0.769
ay+0.736
 pige+0.713
aff+0.690
Neg logit contributions
 wouldn-0.718
 hadn-0.706
'd-0.698
 never-0.649
 also-0.588
Token sprung
Token ID39388
Feature activation+2.01e+00

Pos logit contributions
 wouldn+0.051
 hadn+0.051
'd+0.051
 never+0.048
 also+0.045
Neg logit contributions
igsaw-0.022
andel-0.018
ay-0.017
 pige-0.016
 bou-0.015
Token up
Token ID510
Feature activation-1.63e+00

Pos logit contributions
 wouldn+0.049
 hadn+0.047
'd+0.047
 never+0.044
 also+0.040
Neg logit contributions
igsaw-0.059
andel-0.051
 pige-0.049
ay-0.049
 gol-0.047
Token,
Token ID11
Feature activation-7.60e+00

Pos logit contributions
igsaw+0.043
andel+0.037
ay+0.035
 pige+0.034
aff+0.033
Neg logit contributions
 wouldn-0.045
 hadn-0.044
'd-0.044
 never-0.041
 also-0.038
Token and
Token ID290
Feature activation+2.06e+00

Pos logit contributions
igsaw+0.124
andel+0.111
 bou+0.110
 pige+0.108
 past+0.107
Neg logit contributions
 hadn-0.255
 wouldn-0.255
'd-0.253
 never-0.228
 probably-0.218
Token a
Token ID257
Feature activation-5.76e+00

Pos logit contributions
 wouldn+0.071
 hadn+0.070
'd+0.070
 never+0.065
 also+0.061
Neg logit contributions
igsaw-0.057
andel-0.049
ay-0.046
 pige-0.044
aff-0.042
Token big
Token ID1263
Feature activation-1.67e+01

Pos logit contributions
igsaw+0.124
andel+0.109
ay+0.098
aff+0.094
 pige+0.094
Neg logit contributions
 hadn-0.186
'd-0.183
 wouldn-0.183
 never-0.173
 also-0.165
Token,
Token ID11
Feature activation-6.85e+00

Pos logit contributions
igsaw+0.351
andel+0.324
aff+0.290
 pige+0.273
ay+0.272
Neg logit contributions
 hadn-0.510
 wouldn-0.508
'd-0.493
 never-0.490
 also-0.472
Token beautiful
Token ID4950
Feature activation-1.46e+01

Pos logit contributions
igsaw+0.140
andel+0.118
ay+0.110
 pige+0.104
aff+0.101
Neg logit contributions
 wouldn-0.226
 hadn-0.224
'd-0.221
 never-0.210
 also-0.197
Token ch
Token ID442
Feature activation-1.13e+01

Pos logit contributions
igsaw+0.305
andel+0.272
ites+0.244
aff+0.234
 pige+0.231
Neg logit contributions
 hadn-0.444
 wouldn-0.429
'd-0.425
 never-0.422
 also-0.402
Tokenandel
Token ID33134
Feature activation-2.94e+01

Pos logit contributions
igsaw+0.205
 pige+0.171
 mel+0.160
 bou+0.157
 purpose+0.152
Neg logit contributions
 wouldn-0.363
 hadn-0.358
'd-0.352
 never-0.343
 also-0.324
Tokenier
Token ID959
Feature activation+9.61e-02

Pos logit contributions
igsaw+0.780
andel+0.678
ay+0.647
 pige+0.626
aff+0.600
Neg logit contributions
 wouldn-0.799
 hadn-0.785
'd-0.778
 never-0.729
 also-0.669
Token sparkling
Token ID39072
Feature activation-9.27e+00

Pos logit contributions
 wouldn+0.003
 hadn+0.003
'd+0.003
 never+0.003
 also+0.003
Neg logit contributions
igsaw-0.002
andel-0.002
ay-0.002
 pige-0.002
aff-0.002
Token in
Token ID287
Feature activation-4.50e+00

Pos logit contributions
igsaw+0.284
andel+0.235
oc+0.230
opl+0.227
 gol+0.225
Neg logit contributions
 wouldn-0.209
 hadn-0.200
'd-0.195
 never-0.181
 almost-0.177
Token the
Token ID262
Feature activation-1.25e+01

Pos logit contributions
igsaw+0.125
andel+0.112
aff+0.110
round+0.109
 gol+0.107
Neg logit contributions
 never-0.110
 hadn-0.109
 also-0.102
 wouldn-0.099
'd-0.096
Token sunlight
Token ID19606
Feature activation-9.39e-01

Pos logit contributions
igsaw+0.295
andel+0.249
 pige+0.241
aff+0.239
 gol+0.233
Neg logit contributions
 hadn-0.364
 wouldn-0.362
'd-0.354
 never-0.350
 could-0.319
Token.
Token ID13
Feature activation-3.35e+00

Pos logit contributions
 hadn+0.091
'd+0.090
 wouldn+0.090
 never+0.084
 also+0.079
Neg logit contributions
igsaw-0.041
andel-0.036
ay-0.034
 past-0.030
 purpose-0.029
Token The
Token ID383
Feature activation-1.07e+01

Pos logit contributions
igsaw+0.082
andel+0.071
ay+0.069
 pige+0.063
aff+0.062
Neg logit contributions
 wouldn-0.098
'd-0.097
 hadn-0.096
 never-0.087
 forgot-0.079
Token driver
Token ID4639
Feature activation-2.88e+00

Pos logit contributions
igsaw+0.203
andel+0.172
ay+0.162
 pige+0.154
 mel+0.142
Neg logit contributions
 wouldn-0.366
 hadn-0.361
'd-0.359
 never-0.337
 also-0.316
Token and
Token ID290
Feature activation+2.27e+00

Pos logit contributions
ay+0.077
igsaw+0.076
 pige+0.076
 past+0.075
 par+0.074
Neg logit contributions
 hadn-0.057
 wouldn-0.057
'd-0.056
 probably-0.049
 never-0.048
Token the
Token ID262
Feature activation-1.25e+01

Pos logit contributions
 hadn+0.066
'd+0.066
 wouldn+0.065
 never+0.058
 probably+0.055
Neg logit contributions
igsaw-0.052
andel-0.046
 pige-0.045
 past-0.043
 par-0.043
Token paramed
Token ID34581
Feature activation-2.91e+01

Pos logit contributions
igsaw+0.249
andel+0.207
ay+0.192
 pige+0.181
aff+0.174
Neg logit contributions
 wouldn-0.422
 hadn-0.417
'd-0.413
 never-0.393
 also-0.368
Tokenic
Token ID291
Feature activation-6.70e+00

Pos logit contributions
igsaw+0.570
andel+0.479
ay+0.470
 pige+0.437
aff+0.431
Neg logit contributions
 wouldn-0.968
 hadn-0.966
'd-0.960
 never-0.898
 also-0.851
Token jumped
Token ID11687
Feature activation+1.96e+00

Pos logit contributions
ay+0.153
igsaw+0.147
andel+0.145
 pige+0.138
aff+0.136
Neg logit contributions
 hadn-0.174
'd-0.171
 wouldn-0.169
 probably-0.153
 never-0.152
Token out
Token ID503
Feature activation+2.97e-01

Pos logit contributions
 wouldn+0.053
 hadn+0.053
'd+0.053
 never+0.048
 also+0.045
Neg logit contributions
igsaw-0.051
andel-0.045
ay-0.044
 pige-0.041
aff-0.040
Token.
Token ID13
Feature activation-3.54e+00

Pos logit contributions
 wouldn+0.009
 hadn+0.009
'd+0.008
 never+0.008
 also+0.008
Neg logit contributions
igsaw-0.007
andel-0.006
 pige-0.006
 gol-0.006
ay-0.005
Token They
Token ID1119
Feature activation+1.73e+01

Pos logit contributions
igsaw+0.095
andel+0.083
ay+0.080
 pige+0.076
aff+0.074
Neg logit contributions
 wouldn-0.096
 hadn-0.094
'd-0.094
 never-0.086
 also-0.078
Token shiny
Token ID22441
Feature activation-1.38e+01

Pos logit contributions
igsaw+0.108
andel+0.091
ay+0.081
 pige+0.077
aff+0.075
Neg logit contributions
 hadn-0.176
 wouldn-0.175
'd-0.175
 never-0.167
 also-0.159
Token floor
Token ID4314
Feature activation+2.23e-01

Pos logit contributions
igsaw+0.316
andel+0.255
 pige+0.233
ay+0.232
ites+0.218
Neg logit contributions
 wouldn-0.429
'd-0.413
 hadn-0.402
 never-0.388
 almost-0.370
Token and
Token ID290
Feature activation+3.39e+00

Pos logit contributions
 wouldn+0.007
 hadn+0.007
'd+0.007
 never+0.006
 also+0.006
Neg logit contributions
igsaw-0.005
andel-0.004
ay-0.004
 pige-0.004
aff-0.004
Token a
Token ID257
Feature activation-5.35e+00

Pos logit contributions
 wouldn+0.111
'd+0.107
 hadn+0.105
 never+0.096
 probably+0.091
Neg logit contributions
igsaw-0.066
 pige-0.058
 past-0.055
 mel-0.054
 par-0.054
Token ch
Token ID442
Feature activation-1.04e+01

Pos logit contributions
igsaw+0.107
andel+0.089
ay+0.082
 pige+0.077
aff+0.074
Neg logit contributions
 wouldn-0.179
 hadn-0.178
'd-0.177
 never-0.168
 also-0.159
Tokenandel
Token ID33134
Feature activation-2.83e+01

Pos logit contributions
igsaw+0.172
 pige+0.124
andel+0.123
ay+0.117
 mel+0.116
Neg logit contributions
 wouldn-0.375
'd-0.368
 hadn-0.368
 never-0.353
 also-0.331
Tokenier
Token ID959
Feature activation+1.24e+00

Pos logit contributions
igsaw+0.940
andel+0.850
ay+0.817
 pige+0.793
aff+0.774
Neg logit contributions
 wouldn-0.577
 hadn-0.567
'd-0.558
 never-0.511
 also-0.453
Token.
Token ID13
Feature activation-2.58e+00

Pos logit contributions
 wouldn+0.039
 hadn+0.039
'd+0.038
 never+0.037
 also+0.034
Neg logit contributions
igsaw-0.027
andel-0.023
ay-0.021
 pige-0.020
aff-0.019
Token
Token ID198
Feature activation-4.18e+00

Pos logit contributions
igsaw+0.083
andel+0.074
ay+0.071
 pige+0.069
aff+0.067
Neg logit contributions
 wouldn-0.055
 hadn-0.055
'd-0.053
 never-0.050
 also-0.045
Token
Token ID198
Feature activation-4.21e+00

Pos logit contributions
igsaw+0.098
andel+0.084
ay+0.080
 pige+0.077
aff+0.074
Neg logit contributions
 wouldn-0.126
 hadn-0.123
'd-0.122
 never-0.114
 also-0.106
TokenL
Token ID43
Feature activation-1.26e+00

Pos logit contributions
igsaw+0.098
andel+0.083
ay+0.079
 pige+0.076
aff+0.072
Neg logit contributions
 wouldn-0.129
 hadn-0.126
'd-0.126
 never-0.117
 also-0.108
Token tree
Token ID5509
Feature activation-7.41e+00

Pos logit contributions
igsaw+0.312
andel+0.259
ay+0.223
 purpose+0.213
 bou+0.207
Neg logit contributions
 wouldn-0.565
 hadn-0.563
 never-0.559
'd-0.559
 also-0.538
Token,
Token ID11
Feature activation-5.92e+00

Pos logit contributions
igsaw+0.161
andel+0.136
ay+0.128
 pige+0.122
aff+0.116
Neg logit contributions
 wouldn-0.236
 hadn-0.233
'd-0.231
 never-0.219
 also-0.204
Token a
Token ID257
Feature activation-5.21e+00

Pos logit contributions
igsaw+0.108
 pige+0.102
 past+0.095
 mel+0.095
 oak+0.095
Neg logit contributions
 wouldn-0.188
'd-0.185
 hadn-0.182
 never-0.163
 forgot-0.157
Token yellow
Token ID7872
Feature activation-1.75e+01

Pos logit contributions
igsaw+0.106
andel+0.089
ay+0.081
aff+0.077
 pige+0.072
Neg logit contributions
 hadn-0.175
'd-0.175
 wouldn-0.172
 never-0.164
 also-0.158
Token d
Token ID288
Feature activation-3.97e+00

Pos logit contributions
igsaw+0.376
andel+0.284
ay+0.245
aff+0.241
ites+0.240
Neg logit contributions
 wouldn-0.584
'd-0.582
 hadn-0.559
 never-0.550
 also-0.519
Tokenandel
Token ID33134
Feature activation-2.83e+01

Pos logit contributions
igsaw+0.073
 purpose+0.058
 pige+0.058
oc+0.057
 bou+0.055
Neg logit contributions
 wouldn-0.128
'd-0.126
 hadn-0.123
 never-0.120
 also-0.115
Tokenion
Token ID295
Feature activation-5.29e+00

Pos logit contributions
igsaw+0.821
andel+0.698
ay+0.674
 pige+0.666
 purpose+0.659
Neg logit contributions
 wouldn-0.697
'd-0.678
 hadn-0.672
 never-0.631
 also-0.576
Token and
Token ID290
Feature activation+3.46e+00

Pos logit contributions
igsaw+0.118
andel+0.101
ay+0.097
 pige+0.094
 mel+0.090
Neg logit contributions
 wouldn-0.159
 hadn-0.159
'd-0.157
 never-0.147
 also-0.136
Token a
Token ID257
Feature activation-5.27e+00

Pos logit contributions
 wouldn+0.087
'd+0.085
 hadn+0.084
 never+0.073
itates+0.072
Neg logit contributions
 pige-0.081
 mel-0.080
igsaw-0.078
 past-0.078
 oak-0.077
Token little
Token ID1310
Feature activation-1.13e+01

Pos logit contributions
igsaw+0.105
andel+0.086
ay+0.080
 pige+0.074
aff+0.073
Neg logit contributions
 wouldn-0.178
 hadn-0.177
'd-0.177
 never-0.167
 also-0.158
Token red
Token ID2266
Feature activation-1.85e+01

Pos logit contributions
igsaw+0.257
andel+0.214
ay+0.205
aff+0.188
 pige+0.179
Neg logit contributions
 wouldn-0.358
 hadn-0.348
'd-0.348
 never-0.325
 also-0.311
Token which
Token ID543
Feature activation+3.35e+00

Pos logit contributions
 past+0.095
 golf+0.087
 oak+0.087
 pige+0.087
 blue+0.086
Neg logit contributions
 hadn-0.194
 wouldn-0.193
'd-0.190
bled-0.169
 managed-0.168
Token was
Token ID373
Feature activation+1.86e+01

Pos logit contributions
 hadn+0.073
 wouldn+0.073
'd+0.072
 never+0.065
 probably+0.059
Neg logit contributions
igsaw-0.100
andel-0.087
ay-0.087
 pige-0.087
 oak-0.085
Token filled
Token ID5901
Feature activation-8.23e+00

Pos logit contributions
 wouldn+0.525
 hadn+0.517
'd+0.512
 never+0.466
 probably+0.425
Neg logit contributions
igsaw-0.460
andel-0.402
ay-0.382
 pige-0.370
 mel-0.361
Token with
Token ID351
Feature activation-1.61e+00

Pos logit contributions
igsaw+0.205
andel+0.180
ay+0.164
aff+0.159
 pige+0.158
Neg logit contributions
 wouldn-0.235
 never-0.232
 hadn-0.230
'd-0.228
 also-0.211
Token d
Token ID288
Feature activation-4.06e+00

Pos logit contributions
igsaw+0.037
andel+0.030
ay+0.029
aff+0.027
round+0.026
Neg logit contributions
 wouldn-0.049
 hadn-0.049
'd-0.048
 never-0.047
 also-0.044
Tokenandel
Token ID33134
Feature activation-2.81e+01

Pos logit contributions
igsaw+0.066
oc+0.053
 pige+0.051
 perspective+0.051
 purpose+0.050
Neg logit contributions
 wouldn-0.133
'd-0.129
 hadn-0.128
 never-0.124
 also-0.121
Tokenions
Token ID507
Feature activation+2.99e+00

Pos logit contributions
igsaw+0.749
andel+0.617
ay+0.603
 purpose+0.594
 pige+0.592
Neg logit contributions
 wouldn-0.744
'd-0.723
 hadn-0.719
 never-0.682
 also-0.622
Token and
Token ID290
Feature activation+3.50e+00

Pos logit contributions
 hadn+0.101
 wouldn+0.100
'd+0.100
 never+0.093
 probably+0.088
Neg logit contributions
igsaw-0.053
andel-0.044
ay-0.043
 pige-0.040
 past-0.039
Token butterflies
Token ID44571
Feature activation-6.11e+00

Pos logit contributions
 wouldn+0.107
 hadn+0.107
'd+0.106
 never+0.094
 probably+0.092
Neg logit contributions
igsaw-0.071
 pige-0.062
 mel-0.062
andel-0.062
 past-0.059
Token.
Token ID13
Feature activation-2.41e+00

Pos logit contributions
igsaw+0.118
andel+0.100
ay+0.095
 pige+0.092
aff+0.088
Neg logit contributions
 wouldn-0.199
 hadn-0.199
'd-0.195
 never-0.184
 also-0.173
Token
Token ID198
Feature activation-3.87e+00

Pos logit contributions
igsaw+0.075
andel+0.067
ay+0.063
 pige+0.062
 gol+0.061
Neg logit contributions
 hadn-0.054
 wouldn-0.054
'd-0.053
 never-0.050
 also-0.047
Token garden
Token ID11376
Feature activation-2.53e+00

Pos logit contributions
igsaw+0.259
andel+0.218
ay+0.205
 pige+0.196
aff+0.186
Neg logit contributions
 wouldn-0.389
 hadn-0.383
'd-0.380
 never-0.359
 also-0.334
Token and
Token ID290
Feature activation+3.25e+00

Pos logit contributions
igsaw+0.053
andel+0.044
ay+0.042
 pige+0.040
 bou+0.038
Neg logit contributions
 wouldn-0.082
 hadn-0.081
'd-0.080
 never-0.075
 also-0.070
Token started
Token ID2067
Feature activation+9.22e+00

Pos logit contributions
'd+0.070
 hadn+0.069
 wouldn+0.068
 probably+0.060
 never+0.058
Neg logit contributions
igsaw-0.089
 past-0.083
 mel-0.082
 bou-0.078
andel-0.078
Token picking
Token ID10868
Feature activation-3.86e+00

Pos logit contributions
 wouldn+0.275
 hadn+0.271
'd+0.269
 never+0.251
 also+0.233
Neg logit contributions
igsaw-0.218
andel-0.188
ay-0.176
 pige-0.171
 mel-0.163
Token d
Token ID288
Feature activation-4.05e+00

Pos logit contributions
igsaw+0.107
andel+0.091
ay+0.089
 gol+0.085
 pige+0.083
Neg logit contributions
 wouldn-0.099
 hadn-0.092
'd-0.091
 never-0.089
 also-0.086
Tokenandel
Token ID33134
Feature activation-2.80e+01

Pos logit contributions
igsaw+0.090
 pige+0.075
oc+0.071
 purpose+0.071
 billboard+0.070
Neg logit contributions
 wouldn-0.119
 hadn-0.114
'd-0.113
 never-0.106
 also-0.100
Tokenions
Token ID507
Feature activation+3.02e+00

Pos logit contributions
igsaw+0.640
andel+0.543
ay+0.515
 pige+0.494
aff+0.468
Neg logit contributions
 wouldn-0.862
 hadn-0.848
'd-0.842
 never-0.795
 also-0.738
Token and
Token ID290
Feature activation+3.40e+00

Pos logit contributions
 hadn+0.102
 wouldn+0.102
'd+0.101
 never+0.095
 probably+0.089
Neg logit contributions
igsaw-0.056
andel-0.046
ay-0.044
 pige-0.040
 bou-0.039
Token other
Token ID584
Feature activation-2.22e+00

Pos logit contributions
 wouldn+0.094
'd+0.094
 hadn+0.093
 never+0.085
 probably+0.079
Neg logit contributions
igsaw-0.083
andel-0.072
 mel-0.068
 pige-0.067
 bou-0.066
Token pretty
Token ID2495
Feature activation-1.40e+01

Pos logit contributions
igsaw+0.044
andel+0.037
ay+0.035
 pige+0.033
aff+0.031
Neg logit contributions
 wouldn-0.074
 hadn-0.073
'd-0.073
 never-0.069
 also-0.064
Token flowers
Token ID12734
Feature activation-5.10e+00

Pos logit contributions
igsaw+0.313
andel+0.246
ay+0.244
 gol+0.229
 pige+0.222
Neg logit contributions
 wouldn-0.449
 hadn-0.436
'd-0.432
 never-0.404
 could-0.390
Token little
Token ID1310
Feature activation-1.14e+01

Pos logit contributions
igsaw+0.230
andel+0.193
ay+0.179
 pige+0.172
aff+0.161
Neg logit contributions
 wouldn-0.382
 hadn-0.377
'd-0.374
 never-0.354
 also-0.330
Token girl
Token ID2576
Feature activation+4.34e+00

Pos logit contributions
igsaw+0.261
andel+0.222
ay+0.209
 pige+0.196
aff+0.192
Neg logit contributions
 wouldn-0.351
 hadn-0.345
'd-0.343
 never-0.323
 also-0.305
Token picked
Token ID6497
Feature activation+1.02e+00

Pos logit contributions
 hadn+0.085
 wouldn+0.082
'd+0.080
itates+0.070
 probably+0.069
Neg logit contributions
igsaw-0.125
 k-0.113
 pige-0.112
 par-0.112
andel-0.111
Token the
Token ID262
Feature activation-1.12e+01

Pos logit contributions
 wouldn+0.023
 hadn+0.022
'd+0.021
 never+0.020
 also+0.019
Neg logit contributions
igsaw-0.032
andel-0.028
ay-0.027
 pige-0.026
aff-0.025
Token d
Token ID288
Feature activation-4.71e+00

Pos logit contributions
igsaw+0.268
andel+0.207
ay+0.204
 gol+0.195
 pige+0.191
Neg logit contributions
 wouldn-0.354
 hadn-0.340
'd-0.340
 never-0.321
 also-0.309
Tokenandel
Token ID33134
Feature activation-2.80e+01

Pos logit contributions
igsaw+0.058
 pige+0.035
ay+0.032
andel+0.032
 purpose+0.030
Neg logit contributions
 wouldn-0.192
 hadn-0.186
'd-0.186
 never-0.178
 also-0.170
Tokenion
Token ID295
Feature activation-5.17e+00

Pos logit contributions
igsaw+0.531
andel+0.452
ay+0.414
 pige+0.387
aff+0.373
Neg logit contributions
 wouldn-0.965
 hadn-0.963
'd-0.951
 never-0.903
 also-0.844
Token and
Token ID290
Feature activation+3.04e+00

Pos logit contributions
igsaw+0.109
andel+0.092
ay+0.087
 pige+0.084
 mel+0.080
Neg logit contributions
 hadn-0.164
 wouldn-0.163
'd-0.162
 never-0.152
 also-0.142
Token took
Token ID1718
Feature activation+6.16e+00

Pos logit contributions
'd+0.065
 hadn+0.063
 wouldn+0.061
 never+0.054
itates+0.053
Neg logit contributions
igsaw-0.086
 mel-0.080
 pige-0.080
andel-0.078
 bou-0.078
Token a
Token ID257
Feature activation-5.40e+00

Pos logit contributions
 wouldn+0.159
 hadn+0.156
'd+0.155
 never+0.144
 also+0.132
Neg logit contributions
igsaw-0.171
andel-0.150
ay-0.144
 pige-0.139
aff-0.134
Token deep
Token ID2769
Feature activation-1.24e+01

Pos logit contributions
igsaw+0.132
 pige+0.119
andel+0.117
aff+0.114
opl+0.113
Neg logit contributions
'd-0.150
 wouldn-0.145
 hadn-0.142
 never-0.135
 also-0.125
Token stop
Token ID2245
Feature activation-1.19e+00

Pos logit contributions
 wouldn+0.002
 hadn+0.002
'd+0.002
 never+0.002
 could+0.002
Neg logit contributions
igsaw-0.003
andel-0.002
ay-0.002
 purpose-0.002
 pige-0.002
Token and
Token ID290
Feature activation+2.64e+00

Pos logit contributions
igsaw+0.028
andel+0.025
 pige+0.024
ay+0.024
 mel+0.022
Neg logit contributions
 wouldn-0.033
 hadn-0.033
'd-0.033
 never-0.030
 also-0.028
Token smell
Token ID8508
Feature activation+4.86e+00

Pos logit contributions
 hadn+0.069
'd+0.067
 wouldn+0.067
bled+0.056
 never+0.054
Neg logit contributions
igsaw-0.063
 past-0.055
 pige-0.055
 mel-0.055
 k-0.054
Token the
Token ID262
Feature activation-1.24e+01

Pos logit contributions
 wouldn+0.140
 hadn+0.140
'd+0.139
 never+0.128
 also+0.118
Neg logit contributions
igsaw-0.116
andel-0.101
ay-0.097
 pige-0.093
 mel-0.088
Token d
Token ID288
Feature activation-4.35e+00

Pos logit contributions
igsaw+0.283
andel+0.243
ay+0.229
 pige+0.221
 mel+0.209
Neg logit contributions
 wouldn-0.377
 hadn-0.373
'd-0.368
 never-0.347
 also-0.322
Tokenandel
Token ID33134
Feature activation-2.80e+01

Pos logit contributions
igsaw+0.094
andel+0.074
 pige+0.073
ay+0.071
 bou+0.069
Neg logit contributions
 wouldn-0.137
'd-0.132
 hadn-0.132
 never-0.124
 also-0.117
Tokenion
Token ID295
Feature activation-5.12e+00

Pos logit contributions
igsaw+0.739
andel+0.663
ay+0.624
 pige+0.600
aff+0.586
Neg logit contributions
 wouldn-0.754
 hadn-0.752
'd-0.738
 never-0.691
 also-0.634
Token too
Token ID1165
Feature activation+6.85e+00

Pos logit contributions
igsaw+0.110
andel+0.093
ay+0.092
 pige+0.090
 past+0.086
Neg logit contributions
 hadn-0.152
'd-0.148
 wouldn-0.146
 never-0.139
 forgot-0.128
Token!"
Token ID2474
Feature activation-4.02e+00

Pos logit contributions
 wouldn+0.221
 hadn+0.220
'd+0.218
 never+0.203
 also+0.189
Neg logit contributions
igsaw-0.137
andel-0.114
ay-0.111
 pige-0.103
 mel-0.097
Token
Token ID220
Feature activation+3.02e+00

Pos logit contributions
igsaw+0.101
andel+0.088
ay+0.083
 pige+0.081
aff+0.077
Neg logit contributions
 wouldn-0.114
 hadn-0.112
'd-0.111
 never-0.104
 also-0.096
Token
Token ID198
Feature activation-4.13e+00

Pos logit contributions
 wouldn+0.093
 hadn+0.091
'd+0.091
 never+0.085
 also+0.078
Neg logit contributions
igsaw-0.069
andel-0.058
ay-0.057
 pige-0.053
aff-0.051
Token the
Token ID262
Feature activation-1.35e+01

Pos logit contributions
igsaw+0.030
andel+0.030
ay+0.028
aff+0.027
oc+0.027
Neg logit contributions
 also-0.018
 not-0.017
 never-0.017
 almost-0.015
 going-0.014
Token park
Token ID3952
Feature activation-4.58e+00

Pos logit contributions
igsaw+0.329
andel+0.289
 tal+0.263
ay+0.263
oc+0.247
Neg logit contributions
 never-0.387
 also-0.385
'd-0.366
 wouldn-0.349
 not-0.347
Token and
Token ID290
Feature activation+1.17e+00

Pos logit contributions
igsaw+0.124
andel+0.111
ay+0.099
aff+0.094
 gol+0.093
Neg logit contributions
 wouldn-0.125
 never-0.125
'd-0.122
 hadn-0.120
 also-0.116
Token feed
Token ID3745
Feature activation-1.28e+01

Pos logit contributions
 hadn+0.042
 wouldn+0.042
'd+0.040
bled+0.038
 never+0.037
Neg logit contributions
igsaw-0.017
 oak-0.015
 past-0.015
 pige-0.014
 mel-0.014
Token the
Token ID262
Feature activation-1.37e+01

Pos logit contributions
igsaw+0.471
andel+0.406
 gol+0.385
 perspective+0.377
ay+0.371
Neg logit contributions
 wouldn-0.242
 hadn-0.222
 never-0.220
 also-0.218
'd-0.216
Token pige
Token ID32372
Feature activation-2.79e+01

Pos logit contributions
igsaw+0.345
andel+0.274
ay+0.240
 gol+0.228
 purpose+0.222
Neg logit contributions
'd-0.414
 hadn-0.408
 wouldn-0.404
 never-0.394
 also-0.390
Tokenons
Token ID684
Feature activation-4.26e+00

Pos logit contributions
igsaw+0.680
andel+0.577
ay+0.525
 purpose+0.491
 gol+0.489
Neg logit contributions
 wouldn-0.821
 hadn-0.819
'd-0.811
 never-0.792
 also-0.746
Token.
Token ID13
Feature activation-4.50e+00

Pos logit contributions
igsaw+0.116
andel+0.100
ay+0.093
 gol+0.088
 perspective+0.086
Neg logit contributions
 wouldn-0.115
'd-0.111
 never-0.110
 hadn-0.108
 also-0.105
Token One
Token ID1881
Feature activation-7.10e+00

Pos logit contributions
igsaw+0.129
andel+0.115
 pige+0.106
 gol+0.104
ay+0.103
Neg logit contributions
 wouldn-0.113
'd-0.111
 hadn-0.110
 never-0.103
 also-0.101
Token day
Token ID1110
Feature activation-6.41e+00

Pos logit contributions
igsaw+0.217
andel+0.180
aff+0.173
 bou+0.167
 gol+0.164
Neg logit contributions
 also-0.183
 never-0.180
'd-0.176
 wouldn-0.170
 hadn-0.165
Token,
Token ID11
Feature activation-8.59e+00

Pos logit contributions
igsaw+0.160
andel+0.135
 pige+0.124
ican+0.119
aff+0.118
Neg logit contributions
 wouldn-0.195
 hadn-0.186
 never-0.180
'd-0.180
 could-0.176
Token that
Token ID326
Feature activation+8.05e-01

Pos logit contributions
 wouldn+0.161
 hadn+0.158
'd+0.156
 never+0.147
 also+0.136
Neg logit contributions
igsaw-0.148
andel-0.128
ay-0.122
 pige-0.117
aff-0.113
Token it
Token ID340
Feature activation+9.28e+00

Pos logit contributions
 wouldn+0.022
 hadn+0.022
'd+0.022
 never+0.020
 also+0.019
Neg logit contributions
igsaw-0.021
andel-0.018
ay-0.017
 pige-0.017
 mel-0.016
Token was
Token ID373
Feature activation+1.84e+01

Pos logit contributions
'd+0.169
bled+0.168
 hadn+0.161
 wouldn+0.161
itates+0.148
Neg logit contributions
igsaw-0.250
 k-0.235
 par-0.230
 past-0.228
 billboard-0.227
Token a
Token ID257
Feature activation-5.34e+00

Pos logit contributions
'd+0.507
 wouldn+0.500
 hadn+0.494
 never+0.442
 forgot+0.416
Neg logit contributions
igsaw-0.439
andel-0.388
 sight-0.366
 purpose-0.364
 pige-0.362
Token ch
Token ID442
Feature activation-1.03e+01

Pos logit contributions
igsaw+0.109
andel+0.095
ay+0.086
aff+0.083
 pige+0.082
Neg logit contributions
 wouldn-0.178
 hadn-0.176
'd-0.175
 never-0.169
 also-0.157
Tokenandel
Token ID33134
Feature activation-2.79e+01

Pos logit contributions
igsaw+0.203
 pige+0.176
 bou+0.165
 mel+0.159
andel+0.152
Neg logit contributions
 wouldn-0.323
'd-0.314
 hadn-0.313
 never-0.299
 probably-0.285
Tokenier
Token ID959
Feature activation+1.37e+00

Pos logit contributions
igsaw+0.792
andel+0.691
ay+0.663
 pige+0.648
 bou+0.623
Neg logit contributions
 wouldn-0.704
 hadn-0.688
'd-0.684
 never-0.636
 also-0.580
Token,
Token ID11
Feature activation-6.20e+00

Pos logit contributions
 wouldn+0.046
 hadn+0.046
'd+0.045
 never+0.043
 also+0.040
Neg logit contributions
igsaw-0.027
andel-0.022
ay-0.021
 pige-0.020
aff-0.019
Token an
Token ID281
Feature activation-9.58e+00

Pos logit contributions
igsaw+0.114
 pige+0.102
 golf+0.098
andel+0.097
 mel+0.096
Neg logit contributions
'd-0.196
 hadn-0.190
 wouldn-0.189
 never-0.169
itates-0.162
Token old
Token ID1468
Feature activation-1.27e+01

Pos logit contributions
igsaw+0.208
ay+0.156
ites+0.154
andel+0.151
 pige+0.146
Neg logit contributions
 wouldn-0.309
 hadn-0.306
'd-0.296
 never-0.295
 also-0.282
Token and
Token ID290
Feature activation+3.37e+00

Pos logit contributions
igsaw+0.262
andel+0.220
ay+0.206
aff+0.184
 pige+0.182
Neg logit contributions
 wouldn-0.426
 hadn-0.419
'd-0.416
 never-0.398
 also-0.372
Token pick
Token ID2298
Feature activation-4.13e+00

Pos logit contributions
 wouldn+0.001
 hadn+0.001
'd+0.001
 never+0.001
 also+0.001
Neg logit contributions
igsaw-0.001
andel-0.001
ay-0.001
 pige-0.001
aff-0.001
Token and
Token ID290
Feature activation+2.83e+00

Pos logit contributions
igsaw+0.114
andel+0.100
ay+0.095
 pige+0.092
aff+0.089
Neg logit contributions
 wouldn-0.108
 hadn-0.105
'd-0.104
 never-0.098
 also-0.089
Token pl
Token ID458
Feature activation-5.25e+00

Pos logit contributions
'd+0.078
 hadn+0.078
 wouldn+0.076
 never+0.068
itates+0.066
Neg logit contributions
igsaw-0.068
 mel-0.058
 golf-0.055
 pige-0.055
 bou-0.055
Tokenuck
Token ID1347
Feature activation+6.11e+00

Pos logit contributions
igsaw+0.142
andel+0.125
ay+0.122
 pige+0.115
aff+0.112
Neg logit contributions
 wouldn-0.139
 hadn-0.139
'd-0.138
 never-0.127
 also-0.117
Token d
Token ID288
Feature activation-4.46e+00

Pos logit contributions
 hadn+0.160
 wouldn+0.159
'd+0.158
 never+0.146
 also+0.135
Neg logit contributions
igsaw-0.165
andel-0.146
ay-0.139
 pige-0.133
aff-0.129
Tokenandel
Token ID33134
Feature activation-2.79e+01

Pos logit contributions
igsaw+0.123
 pige+0.102
andel+0.101
ay+0.098
 purpose+0.096
Neg logit contributions
 wouldn-0.116
 hadn-0.111
'd-0.108
 never-0.102
 also-0.093
Tokenions
Token ID507
Feature activation+3.31e+00

Pos logit contributions
igsaw+0.675
andel+0.591
ay+0.555
 pige+0.530
aff+0.516
Neg logit contributions
 wouldn-0.817
 hadn-0.812
'd-0.802
 never-0.756
 also-0.701
Token from
Token ID422
Feature activation-7.65e+00

Pos logit contributions
 hadn+0.109
'd+0.109
 wouldn+0.105
 never+0.101
 probably+0.097
Neg logit contributions
igsaw-0.055
andel-0.046
ay-0.044
 past-0.043
 bou-0.043
Token the
Token ID262
Feature activation-1.17e+01

Pos logit contributions
igsaw+0.195
andel+0.164
ay+0.164
 pige+0.152
aff+0.150
Neg logit contributions
 wouldn-0.218
 hadn-0.211
 never-0.207
'd-0.200
 also-0.190
Token ground
Token ID2323
Feature activation+3.67e+00

Pos logit contributions
igsaw+0.250
andel+0.204
ay+0.194
 pige+0.180
aff+0.178
Neg logit contributions
 wouldn-0.381
 hadn-0.375
'd-0.374
 never-0.357
 also-0.330
Token.
Token ID13
Feature activation-2.92e+00

Pos logit contributions
 wouldn+0.118
 hadn+0.116
'd+0.115
 never+0.109
 also+0.102
Neg logit contributions
igsaw-0.078
andel-0.066
ay-0.062
 pige-0.059
aff-0.056
Token -
Token ID532
Feature activation+2.81e+00

Pos logit contributions
igsaw+0.279
andel+0.243
ay+0.234
 pige+0.225
 mel+0.223
Neg logit contributions
 wouldn-0.352
 hadn-0.351
'd-0.347
 never-0.325
 also-0.301
Token it
Token ID340
Feature activation+6.85e+00

Pos logit contributions
 wouldn+0.084
 hadn+0.084
'd+0.082
 never+0.075
 forgot+0.073
Neg logit contributions
igsaw-0.061
andel-0.052
 mel-0.051
 pige-0.050
ay-0.050
Token was
Token ID373
Feature activation+1.56e+01

Pos logit contributions
itates+0.145
 wouldn+0.134
 hadn+0.132
'd+0.131
 Deal+0.129
Neg logit contributions
 blue-0.158
 mel-0.158
 past-0.152
 lime-0.149
 yellow-0.148
Token bigger
Token ID5749
Feature activation-1.57e+01

Pos logit contributions
 wouldn+0.441
 hadn+0.435
'd+0.433
 never+0.391
 forgot+0.367
Neg logit contributions
igsaw-0.371
andel-0.330
ay-0.313
 mel-0.304
 pige-0.300
Token and
Token ID290
Feature activation+1.08e+00

Pos logit contributions
igsaw+0.337
andel+0.285
ay+0.260
 pige+0.241
aff+0.236
Neg logit contributions
 wouldn-0.509
 hadn-0.501
'd-0.498
 never-0.484
 also-0.447
Token ju
Token ID7544
Feature activation-2.78e+01

Pos logit contributions
'd+0.032
 hadn+0.032
 wouldn+0.032
 never+0.030
 probably+0.028
Neg logit contributions
 blue-0.018
 mel-0.018
igsaw-0.018
 past-0.017
 golf-0.017
Tokenic
Token ID291
Feature activation-8.68e+00

Pos logit contributions
igsaw+0.569
andel+0.488
ay+0.473
 pige+0.429
aff+0.420
Neg logit contributions
 wouldn-0.911
 hadn-0.904
'd-0.890
 never-0.838
 also-0.783
Tokenier
Token ID959
Feature activation-1.75e+00

Pos logit contributions
igsaw+0.252
andel+0.237
ay+0.231
 mel+0.215
aff+0.213
Neg logit contributions
 hadn-0.193
'd-0.187
 wouldn-0.182
 never-0.172
 also-0.165
Token than
Token ID621
Feature activation+8.44e-02

Pos logit contributions
igsaw+0.033
andel+0.027
ay+0.025
 pige+0.024
aff+0.022
Neg logit contributions
 wouldn-0.061
 hadn-0.060
'd-0.060
 never-0.056
 also-0.053
Token any
Token ID597
Feature activation-9.80e+00

Pos logit contributions
 hadn+0.002
'd+0.002
 wouldn+0.002
 never+0.002
itates+0.002
Neg logit contributions
igsaw-0.002
andel-0.002
 mel-0.002
 pige-0.002
 past-0.002
Token he
Token ID339
Feature activation+1.70e+01

Pos logit contributions
igsaw+0.231
andel+0.201
ay+0.191
 pige+0.184
 mel+0.175
Neg logit contributions
 wouldn-0.290
 hadn-0.286
'd-0.283
 never-0.267
 also-0.246
Token got
Token ID1392
Feature activation+7.56e+00

Pos logit contributions
'd+0.393
 hadn+0.382
 wouldn+0.365
 probably+0.336
bled+0.316
Neg logit contributions
 k-0.466
 par-0.463
 pige-0.451
 gol-0.450
 golf-0.443
Token angry
Token ID7954
Feature activation-1.09e+01

Pos logit contributions
'd+0.218
 hadn+0.212
 wouldn+0.207
 never+0.188
 probably+0.177
Neg logit contributions
igsaw-0.179
andel-0.159
 pige-0.154
ay-0.154
 golf-0.153
Token.
Token ID13
Feature activation-2.96e+00

Pos logit contributions
igsaw+0.226
andel+0.192
ay+0.182
 pige+0.173
aff+0.165
Neg logit contributions
 wouldn-0.352
 hadn-0.349
'd-0.347
 never-0.325
 also-0.302
Token
Token ID198
Feature activation-4.66e+00

Pos logit contributions
igsaw+0.086
andel+0.076
ay+0.073
 pige+0.071
aff+0.069
Neg logit contributions
 wouldn-0.073
 hadn-0.071
'd-0.071
 never-0.066
 also-0.060
Token
Token ID198
Feature activation-4.67e+00

Pos logit contributions
igsaw+0.118
andel+0.103
ay+0.096
 pige+0.094
aff+0.090
Neg logit contributions
 wouldn-0.131
 hadn-0.130
'd-0.128
 never-0.122
 also-0.113
TokenHe
Token ID1544
Feature activation+1.83e+01

Pos logit contributions
igsaw+0.123
andel+0.107
ay+0.102
 pige+0.098
aff+0.094
Neg logit contributions
 wouldn-0.130
 hadn-0.127
'd-0.126
 never-0.117
 also-0.107
Token threw
Token ID9617
Feature activation+5.32e-01

Pos logit contributions
'd+0.399
 hadn+0.386
 wouldn+0.368
bled+0.336
itates+0.329
Neg logit contributions
 par-0.439
 past-0.436
 k-0.427
 pige-0.420
 mel-0.412
Token the
Token ID262
Feature activation-1.24e+01

Pos logit contributions
 wouldn+0.014
 never+0.013
 hadn+0.013
 also+0.012
'd+0.012
Neg logit contributions
igsaw-0.016
andel-0.014
aff-0.013
 pige-0.013
ay-0.012
Token ball
Token ID2613
Feature activation-9.80e+00

Pos logit contributions
igsaw+0.190
andel+0.167
aff+0.142
 pige+0.140
ay+0.132
Neg logit contributions
 wouldn-0.462
 hadn-0.459
'd-0.451
 never-0.426
 also-0.415
Token very
Token ID845
Feature activation-7.42e-01

Pos logit contributions
igsaw+0.239
andel+0.208
ay+0.203
 pige+0.189
 past+0.183
Neg logit contributions
 hadn-0.277
 wouldn-0.274
'd-0.273
 never-0.248
 also-0.230
Token hard
Token ID1327
Feature activation-5.97e+00

Pos logit contributions
igsaw+0.019
andel+0.017
ay+0.016
 pige+0.015
aff+0.015
Neg logit contributions
 wouldn-0.020
 hadn-0.020
'd-0.020
 never-0.018
 probably-0.017
Token got
Token ID1392
Feature activation+6.58e+00

Pos logit contributions
igsaw+0.051
andel+0.047
ay+0.045
 pige+0.044
aff+0.043
Neg logit contributions
 wouldn-0.035
 hadn-0.035
'd-0.035
 never-0.032
 also-0.028
Token louder
Token ID27089
Feature activation-2.15e+00

Pos logit contributions
'd+0.184
 hadn+0.175
 wouldn+0.174
 never+0.154
 forgot+0.148
Neg logit contributions
 past-0.148
 blue-0.135
andel-0.135
 sight-0.131
 golf-0.129
Token and
Token ID290
Feature activation+2.17e+00

Pos logit contributions
igsaw+0.052
andel+0.044
ay+0.042
 pige+0.040
 gol+0.038
Neg logit contributions
 wouldn-0.064
 hadn-0.063
'd-0.062
 never-0.060
 also-0.055
Token louder
Token ID27089
Feature activation-2.20e+00

Pos logit contributions
'd+0.058
 hadn+0.057
 wouldn+0.056
 never+0.052
 probably+0.050
Neg logit contributions
igsaw-0.046
 past-0.046
 sight-0.043
 bou-0.043
 mel-0.043
Token.
Token ID13
Feature activation-3.59e+00

Pos logit contributions
igsaw+0.054
andel+0.045
ay+0.043
 pige+0.041
 gol+0.039
Neg logit contributions
 wouldn-0.065
 hadn-0.064
'd-0.063
 never-0.062
 also-0.056
Token He
Token ID679
Feature activation+1.79e+01

Pos logit contributions
igsaw+0.093
andel+0.081
ay+0.077
 pige+0.075
aff+0.072
Neg logit contributions
 wouldn-0.100
 hadn-0.099
'd-0.098
 never-0.092
 also-0.084
Token was
Token ID373
Feature activation+1.66e+01

Pos logit contributions
 hadn+0.334
 wouldn+0.333
'd+0.322
 probably+0.280
 never+0.278
Neg logit contributions
igsaw-0.478
 par-0.473
 k-0.468
 past-0.465
 pige-0.463
Token sure
Token ID1654
Feature activation+3.49e+00

Pos logit contributions
 wouldn+0.409
'd+0.408
 hadn+0.401
 never+0.324
 could+0.315
Neg logit contributions
igsaw-0.424
andel-0.396
 past-0.381
 pige-0.381
 sight-0.380
Token that
Token ID326
Feature activation-5.01e-01

Pos logit contributions
 hadn+0.103
'd+0.103
 wouldn+0.101
 probably+0.087
 never+0.085
Neg logit contributions
igsaw-0.072
 pige-0.066
 mel-0.063
 oak-0.061
 past-0.059
Token this
Token ID428
Feature activation-8.62e+00

Pos logit contributions
igsaw+0.011
andel+0.009
 pige+0.009
ay+0.009
 mel+0.008
Neg logit contributions
'd-0.015
 wouldn-0.015
 hadn-0.015
 never-0.014
 probably-0.013
Token flower
Token ID15061
Feature activation-8.74e+00

Pos logit contributions
igsaw+0.213
andel+0.184
ay+0.175
 pige+0.168
aff+0.162
Neg logit contributions
 wouldn-0.250
 hadn-0.247
'd-0.244
 never-0.230
 also-0.213
Token upon
Token ID2402
Feature activation+5.86e+00

Pos logit contributions
igsaw+0.351
andel+0.311
ay+0.303
 pige+0.288
aff+0.280
Neg logit contributions
 wouldn-0.277
 hadn-0.271
'd-0.269
 never-0.248
 also-0.222
Token a
Token ID257
Feature activation-6.14e+00

Pos logit contributions
 wouldn+0.211
 hadn+0.197
'd+0.194
 forgot+0.172
 never+0.171
Neg logit contributions
igsaw-0.094
 view-0.078
ay-0.077
 pige-0.077
 golf-0.075
Token time
Token ID640
Feature activation-4.13e+00

Pos logit contributions
igsaw+0.139
andel+0.120
ay+0.110
 pige+0.108
aff+0.104
Neg logit contributions
 wouldn-0.188
 hadn-0.187
'd-0.187
 never-0.178
 also-0.165
Token,
Token ID11
Feature activation-7.16e+00

Pos logit contributions
igsaw+0.078
andel+0.071
 pige+0.061
aff+0.059
ay+0.058
Neg logit contributions
 hadn-0.141
 wouldn-0.140
'd-0.137
 never-0.135
 also-0.127
Token there
Token ID612
Feature activation-3.12e+00

Pos logit contributions
igsaw+0.171
ay+0.150
 past+0.142
andel+0.142
 pige+0.133
Neg logit contributions
 wouldn-0.202
'd-0.195
 hadn-0.194
 never-0.177
 forgot-0.164
Token was
Token ID373
Feature activation+1.69e+01

Pos logit contributions
igsaw+0.105
ay+0.097
andel+0.092
 mel+0.090
 pige+0.090
Neg logit contributions
 wouldn-0.055
'd-0.053
 hadn-0.052
 never-0.045
 probably-0.039
Token a
Token ID257
Feature activation-6.27e+00

Pos logit contributions
 hadn+0.407
 wouldn+0.406
'd+0.400
 never+0.383
 also+0.354
Neg logit contributions
igsaw-0.491
andel-0.436
ay-0.412
 pige-0.398
aff-0.396
Token little
Token ID1310
Feature activation-1.26e+01

Pos logit contributions
igsaw+0.155
andel+0.132
ay+0.123
aff+0.116
 perspective+0.115
Neg logit contributions
'd-0.183
 hadn-0.181
 wouldn-0.180
 never-0.175
 also-0.175
Token girl
Token ID2576
Feature activation+3.36e+00

Pos logit contributions
igsaw+0.344
andel+0.294
aff+0.274
 perspective+0.272
ay+0.270
Neg logit contributions
 wouldn-0.300
'd-0.293
 also-0.288
 hadn-0.285
 never-0.276
Token named
Token ID3706
Feature activation-8.25e+00

Pos logit contributions
 never+0.073
 hadn+0.066
 wouldn+0.064
 also+0.064
'd+0.063
Neg logit contributions
igsaw-0.120
andel-0.103
ay-0.096
aff-0.095
 gol-0.092
Token Lily
Token ID20037
Feature activation+4.93e+00

Pos logit contributions
igsaw+0.224
andel+0.197
ay+0.187
 pige+0.182
aff+0.175
Neg logit contributions
 wouldn-0.219
 hadn-0.216
'd-0.214
 never-0.202
 also-0.185
Token them
Token ID606
Feature activation+1.05e+01

Pos logit contributions
 wouldn+0.049
 hadn+0.048
'd+0.048
 never+0.045
 also+0.041
Neg logit contributions
igsaw-0.041
andel-0.035
ay-0.033
 pige-0.032
aff-0.031
Token look
Token ID804
Feature activation+7.91e+00

Pos logit contributions
 hadn+0.273
 wouldn+0.272
'd+0.268
 never+0.244
 probably+0.224
Neg logit contributions
igsaw-0.282
andel-0.240
ay-0.233
 bou-0.222
 pige-0.221
Token different
Token ID1180
Feature activation-1.29e+01

Pos logit contributions
 wouldn+0.203
 hadn+0.199
'd+0.197
 never+0.184
 also+0.168
Neg logit contributions
igsaw-0.221
andel-0.195
ay-0.186
 pige-0.181
aff-0.174
Token.
Token ID13
Feature activation-3.27e+00

Pos logit contributions
igsaw+0.310
andel+0.270
ay+0.252
 pige+0.245
 gol+0.233
Neg logit contributions
 wouldn-0.385
 hadn-0.375
'd-0.370
 never-0.350
 also-0.328
Token Everywhere
Token ID48201
Feature activation-8.57e+00

Pos logit contributions
igsaw+0.096
andel+0.083
ay+0.081
aff+0.074
 pige+0.073
Neg logit contributions
 wouldn-0.082
'd-0.081
 hadn-0.080
 never-0.074
 also-0.068
Token she
Token ID673
Feature activation+1.81e+01

Pos logit contributions
igsaw+0.252
 pige+0.222
andel+0.220
ay+0.220
 oak+0.214
Neg logit contributions
'd-0.184
 wouldn-0.178
 hadn-0.175
 never-0.157
bled-0.146
Token went
Token ID1816
Feature activation+6.51e+00

Pos logit contributions
 not+0.334
 wouldn+0.324
 never+0.324
 hadn+0.323
'd+0.320
Neg logit contributions
igsaw-0.540
andel-0.483
ay-0.478
 mel-0.474
 gol-0.462
Token she
Token ID673
Feature activation+1.81e+01

Pos logit contributions
 wouldn+0.228
 hadn+0.227
'd+0.227
 never+0.213
 also+0.201
Neg logit contributions
igsaw-0.118
andel-0.094
ay-0.091
 pige-0.084
 past-0.078
Token was
Token ID373
Feature activation+1.71e+01

Pos logit contributions
bled+0.287
 hadn+0.278
itates+0.274
 wouldn+0.273
'd+0.265
Neg logit contributions
igsaw-0.524
 mel-0.488
 past-0.487
 par-0.486
ay-0.481
Token in
Token ID287
Feature activation-4.53e+00

Pos logit contributions
 wouldn+0.405
 hadn+0.397
'd+0.394
 never+0.350
 probably+0.317
Neg logit contributions
igsaw-0.495
andel-0.432
ay-0.417
 pige-0.409
 bou-0.395
Token a
Token ID257
Feature activation-5.93e+00

Pos logit contributions
igsaw+0.113
andel+0.100
 pige+0.094
aff+0.093
ay+0.093
Neg logit contributions
 hadn-0.127
 wouldn-0.126
'd-0.123
 never-0.121
 also-0.111
Token"
Token ID1
Feature activation-8.48e-01

Pos logit contributions
igsaw+0.088
ay+0.073
andel+0.073
 pige+0.067
 gol+0.065
Neg logit contributions
 wouldn-0.114
'd-0.108
 hadn-0.108
 never-0.099
 probably-0.092
TokenIt
Token ID1026
Feature activation+1.75e+01

Pos logit contributions
igsaw+0.025
andel+0.022
 pige+0.021
ay+0.021
 gol+0.020
Neg logit contributions
 wouldn-0.020
'd-0.020
 hadn-0.019
 never-0.018
 could-0.016
Token's
Token ID338
Feature activation+4.23e+00

Pos logit contributions
 hadn+0.447
'd+0.447
bled+0.406
 wouldn+0.400
 possibly+0.391
Neg logit contributions
igsaw-0.369
 k-0.352
ay-0.325
 past-0.320
 picture-0.311
Token hurt
Token ID5938
Feature activation-3.96e+00

Pos logit contributions
'd+0.080
 wouldn+0.078
 hadn+0.077
 never+0.067
 could+0.060
Neg logit contributions
igsaw-0.142
andel-0.129
ay-0.125
 pige-0.120
 mel-0.120
Token,"
Token ID553
Feature activation-6.21e+00

Pos logit contributions
igsaw+0.089
andel+0.076
 pige+0.068
ay+0.067
aff+0.065
Neg logit contributions
 wouldn-0.125
 hadn-0.123
'd-0.121
 never-0.119
 also-0.108
Token she
Token ID673
Feature activation+1.93e+01

Pos logit contributions
igsaw+0.171
 pige+0.148
andel+0.147
 mel+0.143
ay+0.142
Neg logit contributions
'd-0.158
 wouldn-0.153
 hadn-0.150
 never-0.139
 probably-0.125
Token said
Token ID531
Feature activation+6.36e+00

Pos logit contributions
'd+0.550
 hadn+0.521
 wouldn+0.505
 never+0.485
 probably+0.455
Neg logit contributions
igsaw-0.365
 par-0.359
 past-0.342
 k-0.337
andel-0.337
Token sadly
Token ID21098
Feature activation+9.22e+00

Pos logit contributions
'd+0.224
 hadn+0.220
 wouldn+0.219
 never+0.205
 also+0.194
Neg logit contributions
igsaw-0.113
andel-0.090
ay-0.083
 pige-0.079
 bou-0.076
Token.
Token ID13
Feature activation-2.58e+00

Pos logit contributions
 wouldn+0.256
 hadn+0.247
'd+0.235
 never+0.231
 also+0.215
Neg logit contributions
igsaw-0.244
andel-0.215
ay-0.209
 pige-0.201
aff-0.189
Token
Token ID198
Feature activation-3.87e+00

Pos logit contributions
igsaw+0.061
andel+0.051
ay+0.049
 pige+0.047
aff+0.047
Neg logit contributions
 wouldn-0.075
'd-0.075
 hadn-0.073
 never-0.068
 also-0.063
Token
Token ID198
Feature activation-3.89e+00

Pos logit contributions
igsaw+0.099
andel+0.087
ay+0.081
 pige+0.079
aff+0.075
Neg logit contributions
 hadn-0.107
 wouldn-0.107
'd-0.105
 never-0.101
 also-0.093
Token feeling
Token ID4203
Feature activation+7.56e+00

Pos logit contributions
 wouldn+0.096
'd+0.095
 hadn+0.093
 probably+0.080
 never+0.080
Neg logit contributions
igsaw-0.064
 past-0.062
 sight-0.061
 view-0.060
 oak-0.059
Token the
Token ID262
Feature activation-1.15e+01

Pos logit contributions
 wouldn+0.223
 hadn+0.219
'd+0.218
 never+0.189
 forgot+0.182
Neg logit contributions
igsaw-0.166
andel-0.150
 pige-0.140
ay-0.138
 sight-0.135
Token warm
Token ID5814
Feature activation-1.38e+01

Pos logit contributions
igsaw+0.212
andel+0.183
ay+0.172
 pige+0.166
 bou+0.158
Neg logit contributions
 wouldn-0.395
 hadn-0.388
'd-0.381
 never-0.357
 could-0.335
Token sunshine
Token ID34488
Feature activation-6.89e+00

Pos logit contributions
igsaw+0.279
andel+0.214
ay+0.194
 pige+0.193
 gol+0.187
Neg logit contributions
 wouldn-0.476
 hadn-0.469
'd-0.469
 never-0.447
 also-0.420
Token in
Token ID287
Feature activation-3.53e+00

Pos logit contributions
igsaw+0.137
ay+0.127
andel+0.121
 k+0.119
 past+0.118
Neg logit contributions
 hadn-0.199
 wouldn-0.199
'd-0.194
 never-0.180
 forgot-0.176
Token its
Token ID663
Feature activation-1.66e-01

Pos logit contributions
igsaw+0.094
andel+0.082
 pige+0.076
aff+0.076
ay+0.075
Neg logit contributions
 hadn-0.096
 never-0.096
 wouldn-0.094
'd-0.093
 also-0.086
Token fur
Token ID9230
Feature activation-6.70e+00

Pos logit contributions
igsaw+0.004
andel+0.003
 gol+0.003
 pige+0.003
ay+0.003
Neg logit contributions
 wouldn-0.005
 hadn-0.005
 never-0.005
'd-0.005
 also-0.004
Token.
Token ID13
Feature activation-2.60e+00

Pos logit contributions
igsaw+0.128
andel+0.108
ay+0.103
 pige+0.097
aff+0.093
Neg logit contributions
 wouldn-0.220
 hadn-0.220
'd-0.218
 never-0.204
 also-0.193
Token
Token ID198
Feature activation-3.89e+00

Pos logit contributions
igsaw+0.061
andel+0.052
ay+0.050
 pige+0.047
aff+0.046
Neg logit contributions
 wouldn-0.078
'd-0.077
 hadn-0.076
 never-0.071
 also-0.065
Token
Token ID198
Feature activation-3.89e+00

Pos logit contributions
igsaw+0.086
andel+0.072
ay+0.070
 pige+0.066
aff+0.063
Neg logit contributions
 wouldn-0.124
 hadn-0.119
'd-0.119
 never-0.110
 forgot-0.102
TokenThe
Token ID464
Feature activation-9.44e+00

Pos logit contributions
igsaw+0.088
andel+0.075
ay+0.071
 pige+0.070
 gol+0.066
Neg logit contributions
 wouldn-0.122
 hadn-0.116
'd-0.115
 never-0.105
 forgot-0.099
Token.
Token ID13
Feature activation-3.27e+00

Pos logit contributions
igsaw+0.136
 pige+0.108
andel+0.106
ay+0.106
 par+0.102
Neg logit contributions
 wouldn-0.281
 hadn-0.276
'd-0.275
 never-0.257
 also-0.243
Token Harmony
Token ID35088
Feature activation+7.68e-01

Pos logit contributions
igsaw+0.095
andel+0.083
ay+0.080
 pige+0.078
aff+0.075
Neg logit contributions
 wouldn-0.081
 hadn-0.080
'd-0.078
 never-0.074
 also-0.067
Token slowly
Token ID6364
Feature activation+4.69e+00

Pos logit contributions
 wouldn+0.014
 hadn+0.014
'd+0.014
itates+0.013
bled+0.013
Neg logit contributions
 past-0.018
 k-0.018
igsaw-0.018
 par-0.018
 oak-0.018
Token bent
Token ID17157
Feature activation+1.23e+00

Pos logit contributions
 wouldn+0.086
 hadn+0.083
'd+0.082
 never+0.074
 also+0.065
Neg logit contributions
igsaw-0.168
andel-0.151
ay-0.147
 pige-0.142
aff-0.138
Token down
Token ID866
Feature activation-1.54e+00

Pos logit contributions
 hadn+0.036
 wouldn+0.036
'd+0.036
 never+0.033
 also+0.030
Neg logit contributions
igsaw-0.029
andel-0.025
ay-0.024
 pige-0.022
 past-0.022
Token and
Token ID290
Feature activation+2.27e+00

Pos logit contributions
igsaw+0.033
andel+0.028
ay+0.026
 pige+0.025
aff+0.024
Neg logit contributions
 wouldn-0.049
 hadn-0.049
'd-0.048
 never-0.045
 also-0.042
Token pe
Token ID613
Feature activation-7.05e+00

Pos logit contributions
 hadn+0.051
'd+0.050
 wouldn+0.049
 never+0.046
 probably+0.042
Neg logit contributions
igsaw-0.062
 past-0.057
andel-0.055
 mel-0.054
 bou-0.053
Tokenered
Token ID1068
Feature activation+9.23e+00

Pos logit contributions
igsaw+0.200
andel+0.196
 past+0.188
ites+0.182
 mel+0.181
Neg logit contributions
 hadn-0.143
'd-0.137
 wouldn-0.135
 never-0.121
 forgot-0.110
Token in
Token ID287
Feature activation-4.82e+00

Pos logit contributions
 wouldn+0.238
 hadn+0.234
'd+0.232
 never+0.217
 also+0.198
Neg logit contributions
igsaw-0.257
andel-0.226
ay-0.216
 pige-0.209
aff-0.201
Token -
Token ID532
Feature activation+3.86e+00

Pos logit contributions
igsaw+0.126
andel+0.112
 pige+0.106
ay+0.104
aff+0.103
Neg logit contributions
 hadn-0.131
 wouldn-0.128
'd-0.128
 never-0.125
 also-0.115
Token and
Token ID290
Feature activation+2.18e+00

Pos logit contributions
'd+0.112
 wouldn+0.112
 hadn+0.111
 forgot+0.099
 never+0.098
Neg logit contributions
igsaw-0.079
 past-0.071
andel-0.068
 oak-0.068
 golf-0.068
Token k
Token ID479
Feature activation-1.60e+01

Pos logit contributions
igsaw+0.146
 pige+0.141
 k+0.135
andel+0.134
ay+0.131
Neg logit contributions
 hadn-0.299
 wouldn-0.295
'd-0.293
 never-0.274
 probably-0.255
Tokenang
Token ID648
Feature activation-1.18e+01

Pos logit contributions
igsaw+0.314
andel+0.232
 purpose+0.201
 bou+0.200
ay+0.198
Neg logit contributions
 wouldn-0.565
 hadn-0.552
'd-0.547
 never-0.534
 also-0.511
Tokenaroo
Token ID38049
Feature activation-5.49e+00

Pos logit contributions
igsaw+0.158
andel+0.137
ay+0.131
 pige+0.127
aff+0.111
Neg logit contributions
 hadn-0.443
'd-0.437
 wouldn-0.437
 never-0.410
 forgot-0.391
Token would
Token ID561
Feature activation+1.69e+01

Pos logit contributions
 k+0.170
 pige+0.165
 par+0.165
 rh+0.164
 past+0.163
Neg logit contributions
 hadn-0.076
bled-0.068
'd-0.067
Looks-0.064
 wouldn-0.063
Token be
Token ID307
Feature activation+9.99e+00

Pos logit contributions
 wouldn+0.363
 hadn+0.360
'd+0.341
bled+0.274
 could+0.273
Neg logit contributions
igsaw-0.485
andel-0.444
 oak-0.438
 bou-0.437
 purpose-0.436
Token okay
Token ID8788
Feature activation+1.28e+00

Pos logit contributions
 wouldn+0.250
'd+0.248
 hadn+0.246
 never+0.216
 could+0.200
Neg logit contributions
igsaw-0.268
andel-0.241
 pige-0.235
ay-0.231
 bou-0.225
Token.
Token ID13
Feature activation-4.61e+00

Pos logit contributions
 hadn+0.046
 wouldn+0.046
'd+0.046
 never+0.042
 forgot+0.041
Neg logit contributions
igsaw-0.017
ay-0.014
 past-0.013
andel-0.013
 purpose-0.012
Token They
Token ID1119
Feature activation+1.62e+01

Pos logit contributions
igsaw+0.126
andel+0.109
ay+0.106
 pige+0.102
 gol+0.098
Neg logit contributions
 wouldn-0.122
'd-0.119
 hadn-0.119
 never-0.109
 also-0.098
Token wished
Token ID16555
Feature activation-9.47e-01

Pos logit contributions
bled+0.276
itates+0.266
bling+0.262
 hadn+0.254
'd+0.243
Neg logit contributions
 k-0.415
 par-0.414
 oak-0.406
 mel-0.396
 pigeon-0.395
Token they
Token ID484
Feature activation+1.54e+01

Pos logit contributions
igsaw+0.023
 pige+0.021
andel+0.020
 past+0.020
 golf+0.020
Neg logit contributions
'd-0.025
 hadn-0.024
 wouldn-0.023
 managed-0.021
 never-0.020
Token could
Token ID714
Feature activation+1.33e+01

Pos logit contributions
Looks+0.255
bling+0.245
bled+0.242
 probably+0.220
itates+0.219
Neg logit contributions
 pige-0.357
 k-0.353
 past-0.345
 oak-0.342
 pigeon-0.340
Token me
Token ID502
Feature activation+6.97e+00

Pos logit contributions
 never+0.043
 wouldn+0.042
 almost+0.036
 hadn+0.034
 not+0.033
Neg logit contributions
igsaw-0.102
andel-0.090
 perspective-0.083
 bou-0.080
 gol-0.080
Token the
Token ID262
Feature activation-1.17e+01

Pos logit contributions
 never+0.176
 wouldn+0.170
 not+0.159
'd+0.157
 also+0.147
Neg logit contributions
igsaw-0.214
andel-0.188
 gol-0.176
aff-0.176
 perspective-0.175
Token hat
Token ID6877
Feature activation-5.97e+00

Pos logit contributions
igsaw+0.287
andel+0.235
ay+0.214
 pige+0.205
aff+0.203
Neg logit contributions
 wouldn-0.359
'd-0.356
 hadn-0.347
 never-0.333
 also-0.312
Token!"
Token ID2474
Feature activation-3.46e+00

Pos logit contributions
igsaw+0.125
andel+0.105
ay+0.094
 pige+0.093
aff+0.087
Neg logit contributions
 wouldn-0.198
 hadn-0.193
'd-0.193
 never-0.184
 also-0.171
Token He
Token ID679
Feature activation+1.92e+01

Pos logit contributions
igsaw+0.100
andel+0.089
ay+0.085
 pige+0.082
aff+0.079
Neg logit contributions
 wouldn-0.086
 hadn-0.084
'd-0.083
 never-0.078
 also-0.071
Token tries
Token ID8404
Feature activation+6.23e+00

Pos logit contributions
 hadn+0.528
'd+0.521
 wouldn+0.496
 never+0.465
 managed+0.413
Neg logit contributions
igsaw-0.449
 k-0.416
andel-0.411
ay-0.409
 past-0.405
Token to
Token ID284
Feature activation+9.48e-01

Pos logit contributions
 wouldn+0.183
 hadn+0.178
'd+0.174
 never+0.173
 also+0.162
Neg logit contributions
igsaw-0.157
andel-0.139
ay-0.128
 pige-0.124
aff-0.120
Token pull
Token ID2834
Feature activation+4.50e-01

Pos logit contributions
 wouldn+0.025
 hadn+0.024
'd+0.024
 never+0.023
 also+0.021
Neg logit contributions
igsaw-0.026
andel-0.023
ay-0.022
 pige-0.021
aff-0.021
Token the
Token ID262
Feature activation-1.17e+01

Pos logit contributions
 wouldn+0.012
 hadn+0.011
 never+0.011
'd+0.011
 also+0.010
Neg logit contributions
igsaw-0.013
andel-0.011
 pige-0.010
 bou-0.010
aff-0.010
Token hat
Token ID6877
Feature activation-6.08e+00

Pos logit contributions
igsaw+0.270
andel+0.222
ay+0.204
 pige+0.197
aff+0.191
Neg logit contributions
 wouldn-0.371
'd-0.363
 hadn-0.360
 never-0.342
 also-0.320
Token from
Token ID422
Feature activation-7.75e+00

Pos logit contributions
igsaw+0.140
andel+0.122
ay+0.120
 pige+0.107
aff+0.106
Neg logit contributions
 hadn-0.180
'd-0.177
 wouldn-0.176
 never-0.163
 also-0.151
Token,
Token ID11
Feature activation-7.80e+00

Pos logit contributions
 wouldn+0.177
 hadn+0.176
'd+0.174
 never+0.166
 also+0.157
Neg logit contributions
igsaw-0.065
andel-0.050
ay-0.045
 pige-0.042
aff-0.038
Token "
Token ID366
Feature activation-1.60e+00

Pos logit contributions
igsaw+0.184
andel+0.159
ay+0.154
 pige+0.148
 mel+0.144
Neg logit contributions
 wouldn-0.222
 hadn-0.219
'd-0.218
 never-0.204
 also-0.187
TokenI
Token ID40
Feature activation+1.68e+01

Pos logit contributions
igsaw+0.043
 pige+0.041
 par+0.041
andel+0.040
 k+0.039
Neg logit contributions
 wouldn-0.037
 hadn-0.036
'd-0.035
 never-0.031
 probably-0.030
Token don
Token ID836
Feature activation+9.09e+00

Pos logit contributions
bled+0.395
 hadn+0.377
 wouldn+0.373
'd+0.354
 never+0.315
Neg logit contributions
igsaw-0.423
oc-0.384
ay-0.380
 par-0.379
 past-0.360
Token't
Token ID470
Feature activation+1.35e+01

Pos logit contributions
 wouldn+0.183
'd+0.174
 hadn+0.174
 never+0.146
 forgot+0.140
Neg logit contributions
igsaw-0.305
ay-0.282
andel-0.272
 pige-0.249
aff-0.246
Token know
Token ID760
Feature activation+3.73e+00

Pos logit contributions
 wouldn+0.310
 hadn+0.306
'd+0.304
 never+0.286
 also+0.257
Neg logit contributions
igsaw-0.419
andel-0.374
ay-0.357
 pige-0.349
aff-0.339
Token,
Token ID11
Feature activation-7.99e+00

Pos logit contributions
 wouldn+0.135
 hadn+0.134
'd+0.134
 never+0.127
 also+0.120
Neg logit contributions
igsaw-0.062
andel-0.050
ay-0.046
 pige-0.044
aff-0.041
Token Lily
Token ID20037
Feature activation+4.46e+00

Pos logit contributions
igsaw+0.160
andel+0.136
ay+0.132
 bou+0.129
 pige+0.129
Neg logit contributions
'd-0.253
 wouldn-0.247
 hadn-0.246
 never-0.228
 managed-0.211
Token.
Token ID13
Feature activation-3.98e+00

Pos logit contributions
 hadn+0.162
'd+0.162
 wouldn+0.160
 never+0.149
 also+0.143
Neg logit contributions
igsaw-0.063
ay-0.049
andel-0.045
 purpose-0.042
 past-0.041
Token Maybe
Token ID6674
Feature activation+1.08e+01

Pos logit contributions
igsaw+0.092
andel+0.079
oc+0.078
ay+0.078
 rh+0.076
Neg logit contributions
'd-0.111
 hadn-0.109
 wouldn-0.105
 never-0.100
 managed-0.095
Token he
Token ID339
Feature activation+1.78e+01

Pos logit contributions
'd+0.291
 hadn+0.291
 wouldn+0.286
 never+0.258
 managed+0.246
Neg logit contributions
igsaw-0.243
 par-0.227
andel-0.224
 pige-0.222
 k-0.217
Token it
Token ID340
Feature activation+9.34e+00

Pos logit contributions
igsaw+0.243
andel+0.213
ay+0.209
 pige+0.199
 mel+0.194
Neg logit contributions
 wouldn-0.215
'd-0.214
 hadn-0.212
 never-0.189
 forgot-0.171
Token.
Token ID13
Feature activation-2.75e+00

Pos logit contributions
 hadn+0.328
'd+0.327
 wouldn+0.325
 never+0.300
 forgot+0.290
Neg logit contributions
igsaw-0.157
ay-0.125
 past-0.118
andel-0.118
 golf-0.111
Token It
Token ID632
Feature activation+1.51e+01

Pos logit contributions
igsaw+0.081
andel+0.071
ay+0.069
 pige+0.066
aff+0.065
Neg logit contributions
 wouldn-0.067
'd-0.065
 hadn-0.065
 never-0.060
 also-0.054
Token was
Token ID373
Feature activation+1.85e+01

Pos logit contributions
 wouldn+0.319
'd+0.319
 hadn+0.299
Acc+0.290
bled+0.281
Neg logit contributions
 blue-0.354
 k-0.341
 past-0.340
 par-0.330
 yellow-0.329
Token a
Token ID257
Feature activation-5.14e+00

Pos logit contributions
'd+0.571
 wouldn+0.569
 hadn+0.548
 never+0.507
 forgot+0.478
Neg logit contributions
igsaw-0.391
andel-0.337
 mel-0.320
ay-0.320
 golf-0.320
Token passport
Token ID17981
Feature activation-8.13e+00

Pos logit contributions
igsaw+0.106
andel+0.093
ay+0.084
aff+0.080
 pige+0.078
Neg logit contributions
 hadn-0.169
'd-0.168
 wouldn-0.167
 never-0.159
 also-0.151
Token.
Token ID13
Feature activation-2.64e+00

Pos logit contributions
igsaw+0.139
ay+0.114
andel+0.104
 pige+0.096
 past+0.095
Neg logit contributions
 wouldn-0.286
 hadn-0.283
'd-0.280
 never-0.258
 forgot-0.246
Token It
Token ID632
Feature activation+1.50e+01

Pos logit contributions
igsaw+0.077
andel+0.069
ay+0.065
 pige+0.064
aff+0.061
Neg logit contributions
 wouldn-0.064
 hadn-0.063
'd-0.062
 never-0.058
 also-0.053
Token had
Token ID550
Feature activation+1.91e+01

Pos logit contributions
 wouldn+0.287
'd+0.277
Acc+0.274
bled+0.267
 hadn+0.259
Neg logit contributions
 k-0.396
 par-0.386
 rh-0.379
 blue-0.379
 past-0.363
Token a
Token ID257
Feature activation-5.14e+00

Pos logit contributions
 wouldn+0.573
 hadn+0.556
'd+0.546
 never+0.481
 forgot+0.464
Neg logit contributions
igsaw-0.421
andel-0.378
ay-0.372
 pige-0.364
 mel-0.347
Token picture
Token ID4286
Feature activation-1.14e+01

Pos logit contributions
igsaw+0.093
andel+0.076
ay+0.070
 pige+0.066
aff+0.062
Neg logit contributions
 wouldn-0.182
 hadn-0.180
'd-0.179
 never-0.170
 also-0.160
Token so
Token ID523
Feature activation-1.01e+00

Pos logit contributions
 wouldn+0.450
'd+0.442
 hadn+0.438
 never+0.374
 could+0.349
Neg logit contributions
igsaw-0.512
andel-0.463
 pige-0.446
ay-0.435
 past-0.421
Token excited
Token ID6568
Feature activation-7.30e+00

Pos logit contributions
igsaw+0.028
andel+0.024
ay+0.024
 pige+0.023
 mel+0.022
Neg logit contributions
 wouldn-0.027
'd-0.026
 hadn-0.026
 never-0.024
 probably-0.021
Token to
Token ID284
Feature activation+1.19e+00

Pos logit contributions
igsaw+0.184
andel+0.161
ay+0.149
 pige+0.140
 gol+0.138
Neg logit contributions
 wouldn-0.212
 hadn-0.206
'd-0.203
 never-0.200
 also-0.181
Token wear
Token ID5806
Feature activation+1.34e+00

Pos logit contributions
 wouldn+0.033
 hadn+0.033
'd+0.033
 never+0.031
 also+0.029
Neg logit contributions
igsaw-0.030
andel-0.026
ay-0.025
 pige-0.024
aff-0.023
Token her
Token ID607
Feature activation+3.52e+00

Pos logit contributions
 wouldn+0.034
 hadn+0.034
'd+0.033
 never+0.031
 also+0.028
Neg logit contributions
igsaw-0.038
andel-0.033
ay-0.032
 pige-0.031
aff-0.030
Token new
Token ID649
Feature activation-1.20e+01

Pos logit contributions
 wouldn+0.119
 hadn+0.116
'd+0.116
 never+0.109
 also+0.103
Neg logit contributions
igsaw-0.073
andel-0.060
ay-0.055
 pige-0.054
aff-0.053
Token boots
Token ID14412
Feature activation-9.03e+00

Pos logit contributions
igsaw+0.319
andel+0.271
 pige+0.256
ay+0.249
aff+0.248
Neg logit contributions
 wouldn-0.337
'd-0.327
 hadn-0.317
 never-0.304
 also-0.280
Token outside
Token ID2354
Feature activation-3.88e+00

Pos logit contributions
igsaw+0.203
andel+0.173
ay+0.161
 pige+0.155
aff+0.148
Neg logit contributions
 wouldn-0.282
 hadn-0.278
'd-0.275
 never-0.261
 also-0.242
Token.
Token ID13
Feature activation-2.19e+00

Pos logit contributions
igsaw+0.076
andel+0.063
ay+0.061
 pige+0.057
 mel+0.054
Neg logit contributions
 wouldn-0.129
 hadn-0.128
'd-0.127
 never-0.119
 also-0.112
Token She
Token ID1375
Feature activation+1.82e+01

Pos logit contributions
igsaw+0.069
andel+0.062
ay+0.059
 pige+0.057
aff+0.056
Neg logit contributions
 wouldn-0.048
 hadn-0.048
'd-0.047
 never-0.043
 also-0.038
Token put
Token ID1234
Feature activation-8.88e-01

Pos logit contributions
 hadn+0.378
 wouldn+0.365
itates+0.363
 doomed+0.362
'd+0.358
Neg logit contributions
 k-0.437
 past-0.431
 par-0.423
 blue-0.421
 mel-0.412
Token noises
Token ID26782
Feature activation+1.84e+00

Pos logit contributions
 hadn+0.147
 wouldn+0.146
'd+0.145
 never+0.131
 also+0.122
Neg logit contributions
igsaw-0.106
andel-0.094
ay-0.091
 pige-0.087
 bou-0.083
Token like
Token ID588
Feature activation+2.94e+00

Pos logit contributions
 wouldn+0.054
 hadn+0.053
'd+0.053
 never+0.050
 also+0.047
Neg logit contributions
igsaw-0.046
andel-0.039
ay-0.037
 pige-0.035
aff-0.033
Token a
Token ID257
Feature activation-5.10e+00

Pos logit contributions
 wouldn+0.093
 hadn+0.092
'd+0.091
 never+0.087
 also+0.081
Neg logit contributions
igsaw-0.065
andel-0.055
ay-0.051
 pige-0.048
aff-0.047
Token car
Token ID1097
Feature activation-1.58e+00

Pos logit contributions
igsaw+0.106
andel+0.089
ay+0.079
 pige+0.074
aff+0.072
Neg logit contributions
 wouldn-0.169
'd-0.169
 hadn-0.168
 never-0.161
 also-0.151
Token.
Token ID13
Feature activation-2.53e+00

Pos logit contributions
igsaw+0.038
andel+0.032
ay+0.029
 pige+0.028
 bou+0.027
Neg logit contributions
 wouldn-0.049
 hadn-0.048
'd-0.047
 never-0.046
 also-0.042
Token
Token ID198
Feature activation-4.03e+00

Pos logit contributions
igsaw+0.084
andel+0.076
ay+0.069
 pige+0.069
 mel+0.069
Neg logit contributions
 hadn-0.051
 wouldn-0.051
'd-0.049
 never-0.048
 could-0.044
Token
Token ID198
Feature activation-4.06e+00

Pos logit contributions
igsaw+0.101
andel+0.087
ay+0.082
 pige+0.079
aff+0.076
Neg logit contributions
 wouldn-0.115
 hadn-0.114
'd-0.113
 never-0.107
 also-0.099
TokenBut
Token ID1537
Feature activation+4.40e+00

Pos logit contributions
igsaw+0.117
andel+0.103
ay+0.098
 pige+0.095
aff+0.092
Neg logit contributions
 wouldn-0.101
 hadn-0.100
'd-0.099
 never-0.093
 also-0.085
Token they
Token ID484
Feature activation+1.81e+01

Pos logit contributions
'd+0.106
 wouldn+0.106
 hadn+0.105
 never+0.094
 probably+0.085
Neg logit contributions
igsaw-0.125
andel-0.112
ay-0.107
 pige-0.105
 mel-0.102
Token pushed
Token ID7121
Feature activation+4.46e-01

Pos logit contributions
 hadn+0.265
 wouldn+0.265
'd+0.262
Looks+0.249
 probably+0.239
Neg logit contributions
 mel-0.507
 pige-0.500
 pigeon-0.481
 par-0.476
 past-0.472
Token the
Token ID262
Feature activation-1.12e+01

Pos logit contributions
 wouldn+0.012
 never+0.011
 hadn+0.011
'd+0.011
 also+0.010
Neg logit contributions
igsaw-0.013
andel-0.011
 pige-0.010
ay-0.010
aff-0.010
Token not
Token ID407
Feature activation+1.71e+01

Pos logit contributions
igsaw+0.125
ay+0.100
andel+0.095
 purpose+0.081
 past+0.079
Neg logit contributions
 hadn-0.322
'd-0.320
 wouldn-0.318
 never-0.287
 forgot-0.282
Token to
Token ID284
Feature activation+8.96e-01

Pos logit contributions
 wouldn+0.479
 hadn+0.454
'd+0.447
 never+0.445
 not+0.405
Neg logit contributions
igsaw-0.478
andel-0.413
ay-0.395
 pige-0.375
aff-0.370
Token hurt
Token ID5938
Feature activation-4.44e+00

Pos logit contributions
 wouldn+0.021
 never+0.020
 hadn+0.020
'd+0.020
 also+0.018
Neg logit contributions
igsaw-0.029
andel-0.025
 pige-0.024
ay-0.024
 bou-0.023
Token your
Token ID534
Feature activation-2.26e+00

Pos logit contributions
igsaw+0.133
andel+0.116
ay+0.112
 pige+0.107
 bou+0.106
Neg logit contributions
 wouldn-0.107
 hadn-0.104
 never-0.101
'd-0.100
 also-0.090
Token lip
Token ID10645
Feature activation-1.05e+01

Pos logit contributions
igsaw+0.058
andel+0.051
aff+0.048
ay+0.047
 pige+0.046
Neg logit contributions
 wouldn-0.066
 hadn-0.063
'd-0.060
 never-0.060
 not-0.055
Token when
Token ID618
Feature activation-1.47e+00

Pos logit contributions
igsaw+0.222
andel+0.187
ay+0.173
 pige+0.167
aff+0.159
Neg logit contributions
 wouldn-0.343
 hadn-0.336
'd-0.334
 never-0.317
 also-0.296
Token you
Token ID345
Feature activation+1.62e+01

Pos logit contributions
igsaw+0.038
andel+0.033
 pige+0.033
ay+0.032
 purpose+0.032
Neg logit contributions
'd-0.038
 wouldn-0.037
 hadn-0.036
 never-0.032
 also-0.030
Token fall
Token ID2121
Feature activation+1.72e+00

Pos logit contributions
 wouldn+0.409
 hadn+0.396
'd+0.393
 never+0.368
 also+0.335
Neg logit contributions
igsaw-0.469
andel-0.411
ay-0.388
 pige-0.382
aff-0.370
Token."
Token ID526
Feature activation-4.32e+00

Pos logit contributions
 wouldn+0.049
 hadn+0.047
'd+0.047
 never+0.045
 also+0.041
Neg logit contributions
igsaw-0.044
andel-0.038
ay-0.036
 pige-0.035
aff-0.034
Token
Token ID198
Feature activation-4.15e+00

Pos logit contributions
igsaw+0.109
andel+0.095
ay+0.089
 pige+0.087
aff+0.082
Neg logit contributions
 wouldn-0.123
 hadn-0.121
'd-0.119
 never-0.113
 also-0.104
Token
Token ID198
Feature activation-4.15e+00

Pos logit contributions
igsaw+0.111
andel+0.098
ay+0.090
 pige+0.090
 gol+0.085
Neg logit contributions
 hadn-0.111
 wouldn-0.110
'd-0.108
 never-0.105
 also-0.097
Token friend
Token ID1545
Feature activation-7.59e+00

Pos logit contributions
igsaw+0.086
andel+0.072
ay+0.067
 pige+0.061
aff+0.061
Neg logit contributions
 wouldn-0.124
'd-0.122
 hadn-0.121
 never-0.117
 also-0.110
Token?"
Token ID1701
Feature activation-4.87e+00

Pos logit contributions
igsaw+0.190
andel+0.164
ay+0.143
 gol+0.138
 perspective+0.135
Neg logit contributions
 wouldn-0.230
'd-0.227
 hadn-0.222
 never-0.215
 also-0.203
Token
Token ID220
Feature activation+2.93e+00

Pos logit contributions
igsaw+0.139
andel+0.123
ay+0.116
 pige+0.111
 gol+0.107
Neg logit contributions
 wouldn-0.124
 hadn-0.123
'd-0.119
 never-0.115
 also-0.105
Token
Token ID198
Feature activation-4.23e+00

Pos logit contributions
 wouldn+0.091
 hadn+0.090
'd+0.089
 never+0.083
 also+0.076
Neg logit contributions
igsaw-0.066
andel-0.056
ay-0.054
 pige-0.050
aff-0.049
TokenThe
Token ID464
Feature activation-9.67e+00

Pos logit contributions
igsaw+0.124
andel+0.112
 pige+0.101
ay+0.100
 gol+0.099
Neg logit contributions
 hadn-0.101
'd-0.100
 never-0.099
 wouldn-0.097
 also-0.090
Token cow
Token ID9875
Feature activation-9.33e+00

Pos logit contributions
igsaw+0.173
andel+0.132
ay+0.121
 pige+0.105
 gol+0.104
Neg logit contributions
 wouldn-0.350
'd-0.348
 hadn-0.347
 never-0.335
 also-0.323
Token replied
Token ID8712
Feature activation+4.48e+00

Pos logit contributions
igsaw+0.259
andel+0.228
 pige+0.226
 mel+0.222
ay+0.222
Neg logit contributions
 hadn-0.207
 wouldn-0.205
'd-0.202
 never-0.184
 probably-0.168
Token,
Token ID11
Feature activation-6.75e+00

Pos logit contributions
 wouldn+0.172
 hadn+0.171
'd+0.170
 never+0.161
 also+0.153
Neg logit contributions
igsaw-0.067
andel-0.052
ay-0.047
 pige-0.044
aff-0.040
Token "
Token ID366
Feature activation-1.46e-01

Pos logit contributions
igsaw+0.167
andel+0.147
ay+0.139
 pige+0.135
aff+0.131
Neg logit contributions
 wouldn-0.187
 hadn-0.185
'd-0.183
 never-0.170
 also-0.156
TokenNo
Token ID2949
Feature activation+2.96e+00

Pos logit contributions
igsaw+0.004
andel+0.003
ay+0.003
 pige+0.003
aff+0.003
Neg logit contributions
 wouldn-0.004
 hadn-0.004
'd-0.004
 never-0.004
 also-0.004
Token way
Token ID835
Feature activation-3.63e+00

Pos logit contributions
 wouldn+0.105
 hadn+0.103
'd+0.102
 never+0.099
 also+0.092
Neg logit contributions
igsaw-0.056
andel-0.047
ay-0.042
 pige-0.040
aff-0.038
Token the
Token ID262
Feature activation-1.13e+01

Pos logit contributions
 wouldn+0.237
 hadn+0.234
'd+0.231
 never+0.213
 also+0.195
Neg logit contributions
igsaw-0.241
andel-0.212
ay-0.202
 pige-0.197
aff-0.188
Token best
Token ID1266
Feature activation-3.23e+00

Pos logit contributions
igsaw+0.368
andel+0.332
ay+0.320
 pige+0.311
aff+0.301
Neg logit contributions
 wouldn-0.235
 hadn-0.231
'd-0.226
 never-0.206
 also-0.182
Token of
Token ID286
Feature activation-2.88e+00

Pos logit contributions
igsaw+0.072
andel+0.059
ay+0.055
 pige+0.055
aff+0.053
Neg logit contributions
 wouldn-0.101
 hadn-0.099
'd-0.098
 never-0.095
 also-0.089
Token friends
Token ID2460
Feature activation-2.34e+00

Pos logit contributions
igsaw+0.047
 pige+0.038
ay+0.038
andel+0.038
 k+0.035
Neg logit contributions
 wouldn-0.106
'd-0.104
 hadn-0.104
 never-0.092
 forgot-0.091
Token.
Token ID13
Feature activation-2.61e+00

Pos logit contributions
igsaw+0.045
andel+0.036
ay+0.036
 pige+0.034
aff+0.032
Neg logit contributions
 wouldn-0.078
 hadn-0.077
'd-0.077
 never-0.071
 also-0.067
Token<|endoftext|>
Token ID50256
Feature activation-1.73e+01

Pos logit contributions
igsaw+0.068
andel+0.060
ay+0.056
 pige+0.054
 gol+0.052
Neg logit contributions
 wouldn-0.071
 hadn-0.071
'd-0.070
 never-0.067
 also-0.061
TokenOnce
Token ID7454
Feature activation-1.11e+01

Pos logit contributions
igsaw+0.424
andel+0.368
ay+0.345
 pige+0.334
aff+0.320
Neg logit contributions
 wouldn-0.497
 hadn-0.492
'd-0.487
 never-0.460
 also-0.425
Token upon
Token ID2402
Feature activation+6.92e+00

Pos logit contributions
igsaw+0.350
andel+0.314
ay+0.299
 pige+0.293
aff+0.284
Neg logit contributions
 wouldn-0.242
 hadn-0.238
'd-0.234
 never-0.217
 also-0.196
Token a
Token ID257
Feature activation-5.16e+00

Pos logit contributions
 wouldn+0.238
 hadn+0.223
'd+0.219
 never+0.194
 forgot+0.193
Neg logit contributions
igsaw-0.129
 pige-0.106
ay-0.105
 view-0.103
 par-0.101
Token time
Token ID640
Feature activation-3.10e+00

Pos logit contributions
igsaw+0.124
andel+0.109
 pige+0.100
ay+0.099
aff+0.096
Neg logit contributions
'd-0.151
 hadn-0.149
 wouldn-0.148
 never-0.144
 also-0.133
Token,
Token ID11
Feature activation-6.18e+00

Pos logit contributions
igsaw+0.059
andel+0.054
 pige+0.047
aff+0.045
ay+0.044
Neg logit contributions
 hadn-0.105
 wouldn-0.103
'd-0.102
 never-0.100
 also-0.094
Token popcorn
Token ID38914
Feature activation-1.05e+01

Pos logit contributions
igsaw+0.084
andel+0.070
ay+0.064
aff+0.063
 gol+0.060
Neg logit contributions
 wouldn-0.097
 hadn-0.097
'd-0.095
 never-0.092
 also-0.087
Token.
Token ID13
Feature activation-3.72e+00

Pos logit contributions
igsaw+0.269
andel+0.230
ay+0.210
 gol+0.199
 pige+0.198
Neg logit contributions
 wouldn-0.301
'd-0.294
 hadn-0.291
 never-0.280
 also-0.264
Token It
Token ID632
Feature activation+1.30e+01

Pos logit contributions
igsaw+0.110
andel+0.106
 pige+0.099
 bou+0.098
aff+0.098
Neg logit contributions
 hadn-0.082
'd-0.076
 wouldn-0.073
 could-0.073
 never-0.073
Token was
Token ID373
Feature activation+1.65e+01

Pos logit contributions
 wouldn+0.250
 hadn+0.230
'd+0.221
bled+0.206
 never+0.195
Neg logit contributions
 blue-0.359
 rh-0.356
 k-0.354
igsaw-0.353
oc-0.352
Token so
Token ID523
Feature activation-2.75e+00

Pos logit contributions
 wouldn+0.446
 hadn+0.439
'd+0.434
 never+0.410
 also+0.378
Neg logit contributions
igsaw-0.447
andel-0.389
ay-0.368
 pige-0.356
aff-0.346
Token y
Token ID331
Feature activation-1.93e+01

Pos logit contributions
igsaw+0.061
andel+0.051
ay+0.048
 pige+0.046
aff+0.044
Neg logit contributions
 wouldn-0.087
 hadn-0.086
'd-0.085
 never-0.081
 also-0.075
Tokenummy
Token ID13513
Feature activation-7.97e+00

Pos logit contributions
igsaw+0.301
 pige+0.236
andel+0.236
 bou+0.219
 gol+0.209
Neg logit contributions
 wouldn-0.676
 hadn-0.668
'd-0.659
 never-0.644
 almost-0.602
Token that
Token ID326
Feature activation-6.33e-01

Pos logit contributions
igsaw+0.191
andel+0.167
ay+0.151
 pige+0.142
 bou+0.139
Neg logit contributions
 wouldn-0.240
'd-0.233
 hadn-0.233
 never-0.229
 also-0.208
Token everyone
Token ID2506
Feature activation+2.96e+00

Pos logit contributions
igsaw+0.013
 mel+0.012
ay+0.012
 blue+0.012
 k+0.012
Neg logit contributions
'd-0.017
 hadn-0.017
 wouldn-0.017
itates-0.015
 managed-0.015
Token wanted
Token ID2227
Feature activation+9.30e+00

Pos logit contributions
 wouldn+0.053
 hadn+0.053
'd+0.050
 never+0.047
 probably+0.043
Neg logit contributions
igsaw-0.088
 pige-0.083
 past-0.081
 mel-0.080
ay-0.080
Token to
Token ID284
Feature activation-5.53e-01

Pos logit contributions
 wouldn+0.301
 hadn+0.299
'd+0.294
 never+0.276
 also+0.258
Neg logit contributions
igsaw-0.192
andel-0.162
ay-0.154
 pige-0.150
 mel-0.141
Token Leo
Token ID19632
Feature activation+1.55e+00

Pos logit contributions
igsaw+0.175
andel+0.154
ay+0.147
 pige+0.142
aff+0.137
Neg logit contributions
 wouldn-0.165
 hadn-0.162
'd-0.160
 never-0.150
 also-0.138
Token and
Token ID290
Feature activation+3.18e+00

Pos logit contributions
 wouldn+0.039
 hadn+0.037
'd+0.036
 never+0.033
bled+0.032
Neg logit contributions
igsaw-0.038
 par-0.034
 pige-0.034
oc-0.034
andel-0.033
Token Spike
Token ID26309
Feature activation+4.16e+00

Pos logit contributions
 wouldn+0.085
 hadn+0.083
'd+0.083
 never+0.079
 also+0.073
Neg logit contributions
igsaw-0.086
andel-0.077
ay-0.073
 pige-0.069
aff-0.067
Token found
Token ID1043
Feature activation+6.87e+00

Pos logit contributions
 wouldn+0.079
 hadn+0.077
'd+0.073
 probably+0.072
bled+0.071
Neg logit contributions
igsaw-0.109
 pige-0.107
 par-0.106
 oak-0.105
 past-0.105
Token a
Token ID257
Feature activation-5.52e+00

Pos logit contributions
 wouldn+0.205
 hadn+0.204
'd+0.204
 never+0.183
 forgot+0.172
Neg logit contributions
igsaw-0.161
andel-0.137
ay-0.134
 pige-0.128
 mel-0.125
Token big
Token ID1263
Feature activation-1.60e+01

Pos logit contributions
igsaw+0.122
andel+0.107
ay+0.099
aff+0.094
 pige+0.093
Neg logit contributions
 wouldn-0.177
'd-0.176
 hadn-0.172
 never-0.163
 also-0.156
Token tree
Token ID5509
Feature activation-6.92e+00

Pos logit contributions
igsaw+0.357
andel+0.324
aff+0.303
 pige+0.291
ites+0.285
Neg logit contributions
 wouldn-0.492
'd-0.476
 hadn-0.464
 never-0.447
 also-0.430
Token with
Token ID351
Feature activation-1.77e+00

Pos logit contributions
igsaw+0.165
andel+0.143
ay+0.134
 pige+0.129
aff+0.125
Neg logit contributions
 wouldn-0.206
 hadn-0.203
'd-0.201
 never-0.191
 also-0.177
Token a
Token ID257
Feature activation-5.46e+00

Pos logit contributions
igsaw+0.044
andel+0.038
ay+0.036
 pige+0.035
 mel+0.034
Neg logit contributions
 wouldn-0.051
 hadn-0.050
'd-0.050
 never-0.046
 also-0.042
Token twist
Token ID14528
Feature activation-2.14e+00

Pos logit contributions
igsaw+0.112
andel+0.095
ay+0.086
 pige+0.083
aff+0.081
Neg logit contributions
 wouldn-0.182
 hadn-0.181
'd-0.180
 never-0.170
 also-0.160
Tokeny
Token ID88
Feature activation+5.75e+00

Pos logit contributions
igsaw+0.048
andel+0.041
ay+0.038
 pige+0.037
aff+0.035
Neg logit contributions
 wouldn-0.067
 hadn-0.066
'd-0.065
 never-0.061
 also-0.057
Token,
Token ID11
Feature activation-5.88e+00

Pos logit contributions
igsaw+0.055
andel+0.043
ay+0.039
 pige+0.036
aff+0.034
Neg logit contributions
 wouldn-0.132
 hadn-0.130
'd-0.129
 never-0.123
 also-0.117
Token Mom
Token ID11254
Feature activation-3.41e+00

Pos logit contributions
igsaw+0.144
andel+0.125
ay+0.120
 past+0.115
 mel+0.115
Neg logit contributions
 hadn-0.166
'd-0.164
 wouldn-0.164
 never-0.149
 probably-0.138
Token gives
Token ID3607
Feature activation-8.31e-01

Pos logit contributions
igsaw+0.082
ay+0.076
 pige+0.073
aff+0.071
 mel+0.069
Neg logit contributions
 wouldn-0.092
 hadn-0.092
'd-0.087
 never-0.080
bled-0.075
Token them
Token ID606
Feature activation+1.16e+01

Pos logit contributions
igsaw+0.021
andel+0.019
ay+0.018
 pige+0.018
 mel+0.017
Neg logit contributions
 hadn-0.024
 wouldn-0.023
'd-0.023
 never-0.021
 probably-0.019
Token new
Token ID649
Feature activation-1.19e+01

Pos logit contributions
 wouldn+0.249
 never+0.234
'd+0.224
 also+0.218
 hadn+0.218
Neg logit contributions
igsaw-0.363
andel-0.342
 perspective-0.328
oc-0.313
 gol-0.310
Token v
Token ID410
Feature activation-1.85e+01

Pos logit contributions
igsaw+0.315
andel+0.297
ay+0.265
aff+0.260
round+0.250
Neg logit contributions
 wouldn-0.305
'd-0.283
 hadn-0.282
 never-0.273
 not-0.257
Tokenests
Token ID3558
Feature activation-1.55e+00

Pos logit contributions
igsaw+0.364
andel+0.311
 pige+0.272
 bou+0.269
 perspective+0.266
Neg logit contributions
 wouldn-0.608
'd-0.594
 hadn-0.588
 never-0.575
 also-0.530
Token.
Token ID13
Feature activation-2.22e+00

Pos logit contributions
igsaw+0.032
andel+0.027
ay+0.026
 pige+0.024
aff+0.023
Neg logit contributions
 wouldn-0.050
 hadn-0.050
'd-0.050
 never-0.047
 also-0.043
Token They
Token ID1119
Feature activation+1.87e+01

Pos logit contributions
igsaw+0.064
andel+0.056
ay+0.054
 pige+0.052
aff+0.050
Neg logit contributions
 wouldn-0.056
 hadn-0.055
'd-0.054
 never-0.050
 also-0.045
Token are
Token ID389
Feature activation+1.58e+01

Pos logit contributions
 hadn+0.464
 wouldn+0.463
'd+0.448
 never+0.411
 probably+0.382
Neg logit contributions
igsaw-0.513
andel-0.445
ay-0.444
 pige-0.429
 bou-0.420
Token blue
Token ID4171
Feature activation-2.08e+01

Pos logit contributions
'd+0.375
 wouldn+0.372
 hadn+0.368
 never+0.318
 probably+0.290
Neg logit contributions
igsaw-0.437
andel-0.401
ay-0.398
 pige-0.388
 past-0.382
Token to
Token ID284
Feature activation-8.56e-01

Pos logit contributions
 wouldn+0.268
 hadn+0.263
'd+0.260
 never+0.250
 also+0.229
Neg logit contributions
igsaw-0.210
andel-0.181
ay-0.170
 pige-0.162
aff-0.156
Token keep
Token ID1394
Feature activation+1.87e+00

Pos logit contributions
igsaw+0.024
andel+0.021
ay+0.020
 pige+0.019
 mel+0.019
Neg logit contributions
 wouldn-0.022
 hadn-0.022
'd-0.021
 never-0.019
 also-0.018
Token it
Token ID340
Feature activation+7.18e+00

Pos logit contributions
'd+0.051
 hadn+0.050
 wouldn+0.050
 never+0.042
 forgot+0.041
Neg logit contributions
igsaw-0.046
 golf-0.040
 sight-0.040
 past-0.039
 mel-0.039
Token forever
Token ID8097
Feature activation+4.23e+00

Pos logit contributions
 hadn+0.194
 wouldn+0.193
'd+0.192
 never+0.173
 also+0.164
Neg logit contributions
igsaw-0.184
ay-0.152
andel-0.152
 golf-0.140
 mel-0.140
Token!
Token ID0
Feature activation-2.75e+00

Pos logit contributions
 wouldn+0.139
 hadn+0.137
'd+0.136
 never+0.129
 also+0.120
Neg logit contributions
igsaw-0.088
andel-0.073
ay-0.069
 pige-0.065
aff-0.062
Token<|endoftext|>
Token ID50256
Feature activation-1.91e+01

Pos logit contributions
igsaw+0.066
andel+0.056
ay+0.054
 pige+0.051
aff+0.049
Neg logit contributions
 wouldn-0.082
 hadn-0.080
'd-0.080
 never-0.075
 also-0.069
TokenOnce
Token ID7454
Feature activation-1.32e+01

Pos logit contributions
igsaw+0.389
ay+0.327
andel+0.311
 pige+0.282
aff+0.277
Neg logit contributions
 wouldn-0.635
'd-0.607
 hadn-0.607
 never-0.574
 forgot-0.538
Token upon
Token ID2402
Feature activation+4.43e+00

Pos logit contributions
igsaw+0.396
andel+0.348
ay+0.342
 pige+0.322
aff+0.315
Neg logit contributions
 wouldn-0.307
 hadn-0.302
'd-0.299
 never-0.276
 also-0.246
Token a
Token ID257
Feature activation-7.16e+00

Pos logit contributions
 wouldn+0.160
 hadn+0.148
'd+0.145
 forgot+0.130
 never+0.129
Neg logit contributions
igsaw-0.075
 view-0.060
 pige-0.058
ay-0.058
 golf-0.058
Token time
Token ID640
Feature activation-5.38e+00

Pos logit contributions
igsaw+0.166
andel+0.146
 pige+0.133
ay+0.132
aff+0.127
Neg logit contributions
 hadn-0.214
 wouldn-0.214
'd-0.213
 never-0.204
 also-0.189
Token,
Token ID11
Feature activation-8.37e+00

Pos logit contributions
igsaw+0.088
andel+0.073
ay+0.063
 pige+0.062
aff+0.058
Neg logit contributions
 wouldn-0.200
 hadn-0.199
'd-0.196
 never-0.188
 also-0.177