TOP ACTIVATIONS
MAX = 268.801

you go , my d arl ings . Enjoy your lunch
reward . It was ch ocol ates and sweets !
When Daisy put the magn ifying glass down , the cater
Everyday he would take the av oc ados and scatter them
't know the rh in oc eros was wild . Can
's wonderful , my d arl ings !" Mom my says
Ch ir py and Wh isk ers knew that poison was
persistent â so they gathered around the ladder
he would take the av oc ados and scatter them around
give to the rh in oc eros . Lily walked to
it to the rh in oc eros . When he tasted
am taking the rh in oc eros to a new home
there was a rh in oc eros who was always naughty
. She loved to eat av oc ados . One day
. One day , Wh isk ers woke up from his
Sally told him , Wh isk ers always wanted to help
and play with your or nam ents . Maybe you can
day , Spot and Wh isk ers went to the park
that the bad rh in oc eros was now nice .
" The rh in oc eros is in there ,

MIN ACTIVATIONS
MIN = -111.984

ex cellent â until everyone in
window . It said " Army ". " Wow
the ground that said " short cut ". She thought it
. It pays off or resolves the conflict by revealing his
King was cruel and had enslaved many of the people of
have a toy he could rely on all the time .
it knew it could always rely on the oxygen tree to
Remember that you can always rely on your inner strength and
too strong . They both drowned , and their bodies were
that the child l aug heed in delight .
The brave duck believed he could make it and
same veterinarian and chose to emulate her hard work . What
an independent girl . She believed in herself and always tried
, you can reach your goal . <|endoftext|> Once upon a
to wear it . He believed he would be too hot
soft pink color . He breathed a deep breath in and
little girl . She always believed she could do anything .
replied , " I have cancer and I lost my hair
They raced each other and competed to see who the fastest
happy that the coat had prevented the bad weather . Now

INTERVAL 173.605 - 268.801
CONTAINS 0.030%

there was a three - year - old who found a
grabbed each other â s hand and set off
was a big rh in oc eros . He was very
healthy . It â s full of life .
and watched as the wood pe cker worked away . He

INTERVAL 78.408 - 173.605
CONTAINS 7.083%

wide slide . <|endoftext|> Once upon a time , there was
. Finally , she felt it w iggle out of her
gently and put her on his palm . He said ,
her mum and asked : " M um , what 's
the book said , so she went to find her mom

INTERVAL -16.788 - 78.408
CONTAINS 50.764%

" B un - Cloud . I like it
. The girl laughed and suggested that they
Yes , please !" The fairy smiled and
borrow one . Here , take this one ."
knew they had been naughty . They said sorry to Mom

INTERVAL -111.984 - -16.788
CONTAINS 42.124%

Anna . I 'm sorry I yelled at you ." Ben
" But why ? It smells so nice ," said
L ily and Tom were scared . They ran to
lived in a small house with his mom and dad .
would read to her . But sometimes , she wanted to
Token you
Token ID345
Feature activation+1.61e+02

Pos logit contributions
uddled+0.261
:+0.193
 -+0.188
;+0.185
ered+0.179
Neg logit contributions
igsaw-0.295
 redd-0.288
 ey-0.280
Mic-0.267
nam-0.257
Token go
Token ID467
Feature activation+1.20e+02

Pos logit contributions
 welcome+0.438
igator+0.434
 universe+0.421
 Avery+0.416
acy+0.409
Neg logit contributions
uddled-0.710
cream-0.469
apy-0.463
aged-0.462
oving-0.462
Token,
Token ID11
Feature activation+9.73e+01

Pos logit contributions
 distance+0.335
|+0.319
 seeker+0.307
igsaw+0.304
 begg+0.301
Neg logit contributions
uddled-0.520
icked-0.414
owed-0.395
der-0.388
rums-0.377
Token my
Token ID616
Feature activation+1.23e+02

Pos logit contributions
 redd+0.325
igsaw+0.318
 ey+0.301
 ju+0.296
Mic+0.289
Neg logit contributions
uddled-0.390
icked-0.288
ered-0.272
:-0.266
 died-0.264
Token d
Token ID288
Feature activation+1.34e+02

Pos logit contributions
igsaw+0.321
 redd+0.290
 ey+0.286
 gol+0.272
 universe+0.267
Neg logit contributions
uddled-0.563
icked-0.470
ered-0.468
olded-0.429
ged-0.429
Tokenarl
Token ID7063
Feature activation+2.35e+02

Pos logit contributions
igsaw+0.258
 ey+0.204
aph+0.190
 redd+0.165
 gol+0.164
Neg logit contributions
uddled-0.701
 listened-0.630
 watched-0.627
ered-0.623
 told-0.621
Tokenings
Token ID654
Feature activation+9.52e+01

Pos logit contributions
igsaw+0.942
 ey+0.872
 redd+0.869
Mic+0.826
 gol+0.817
Neg logit contributions
uddled-0.684
:-0.510
ered-0.506
 --0.499
 kissed-0.496
Token.
Token ID13
Feature activation+6.87e+01

Pos logit contributions
igator+0.311
igsaw+0.280
ji+0.273
 sill+0.271
 Prince+0.266
Neg logit contributions
uddled-0.342
ered-0.273
aged-0.269
oving-0.268
ided-0.260
Token Enjoy
Token ID18179
Feature activation+9.07e+01

Pos logit contributions
igsaw+0.239
 redd+0.222
 ey+0.220
Mic+0.211
 yak+0.193
Neg logit contributions
uddled-0.257
:-0.206
 --0.198
;-0.193
ered-0.191
Token your
Token ID534
Feature activation+1.07e+02

Pos logit contributions
igsaw+0.250
 ey+0.235
 redd+0.235
 angles+0.217
 yak+0.214
Neg logit contributions
uddled-0.394
olded-0.304
 softened-0.302
 died-0.298
 calmed-0.297
Token lunch
Token ID9965
Feature activation+9.06e+01

Pos logit contributions
igsaw+0.270
 ey+0.266
 redd+0.244
 ju+0.216
 universe+0.208
Neg logit contributions
uddled-0.504
Because-0.428
icked-0.422
olded-0.409
:-0.398
Token reward
Token ID6721
Feature activation+1.21e+02

Pos logit contributions
igsaw+0.508
 ey+0.483
 redd+0.460
 universe+0.419
 angles+0.417
Neg logit contributions
uddled-0.849
icked-0.666
ered-0.657
olded-0.648
ves-0.646
Token.
Token ID13
Feature activation+8.49e+01

Pos logit contributions
igsaw+0.401
igator+0.389
keeping+0.388
 redd+0.356
 angles+0.352
Neg logit contributions
uddled-0.422
 calmed-0.307
wr-0.296
oving-0.293
 blamed-0.293
Token It
Token ID632
Feature activation+1.02e+02

Pos logit contributions
igsaw+0.279
Mic+0.248
 redd+0.238
 ey+0.230
|+0.228
Neg logit contributions
uddled-0.362
ered-0.277
ves-0.262
 died-0.256
icked-0.255
Token was
Token ID373
Feature activation+8.27e+01

Pos logit contributions
 redd+0.343
igator+0.331
 ju+0.322
die+0.318
 sill+0.310
Neg logit contributions
uddled-0.398
ged-0.274
;-0.271
okes-0.268
anas-0.265
Token ch
Token ID442
Feature activation+1.21e+02

Pos logit contributions
 redd+0.264
igsaw+0.260
die+0.236
 ju+0.232
 ey+0.229
Neg logit contributions
uddled-0.357
Because-0.258
okes-0.248
ves-0.244
wr-0.242
Tokenocol
Token ID4668
Feature activation+2.27e+02

Pos logit contributions
igsaw+0.249
 ey+0.210
 redd+0.209
Mic+0.188
 angles+0.160
Neg logit contributions
uddled-0.636
:-0.551
 --0.534
;-0.526
ered-0.517
Tokenates
Token ID689
Feature activation+9.07e+01

Pos logit contributions
igsaw+0.862
 ey+0.807
 redd+0.798
Mic+0.779
 angles+0.720
Neg logit contributions
uddled-0.750
:-0.593
 --0.561
;-0.551
 kissed-0.534
Token and
Token ID290
Feature activation+8.07e+01

Pos logit contributions
igsaw+0.366
 redd+0.349
 sill+0.342
 ju+0.333
acy+0.331
Neg logit contributions
uddled-0.257
 blamed-0.191
 calmed-0.184
okes-0.177
rums-0.172
Token sweets
Token ID42402
Feature activation+8.63e+01

Pos logit contributions
 redd+0.269
igsaw+0.266
nam+0.255
 ey+0.248
die+0.248
Neg logit contributions
uddled-0.318
Because-0.219
okes-0.216
olded-0.213
:-0.211
Token!
Token ID0
Feature activation+5.99e+01

Pos logit contributions
 ju+0.323
 redd+0.319
igsaw+0.312
 sill+0.304
hel+0.295
Neg logit contributions
uddled-0.257
okes-0.227
rums-0.202
olded-0.198
 blamed-0.187
Token
Token ID198
Feature activation+8.03e+01

Pos logit contributions
igsaw+0.200
Mic+0.180
 redd+0.174
 ey+0.168
 gol+0.161
Neg logit contributions
uddled-0.250
ered-0.189
:-0.178
 --0.175
ves-0.175
Token When
Token ID1649
Feature activation+9.85e+01

Pos logit contributions
igsaw+0.285
 ey+0.255
 redd+0.251
Mic+0.246
die+0.229
Neg logit contributions
uddled-0.442
ered-0.337
 died-0.318
ves-0.314
icked-0.309
Token Daisy
Token ID40355
Feature activation+1.16e+02

Pos logit contributions
igsaw+0.327
 redd+0.286
 ey+0.279
 avail+0.276
 Avery+0.273
Neg logit contributions
uddled-0.410
Because-0.285
der-0.283
icked-0.280
anas-0.278
Token put
Token ID1234
Feature activation+7.63e+01

Pos logit contributions
acy+0.353
 Avery+0.349
cy+0.348
|+0.343
agu+0.343
Neg logit contributions
uddled-0.446
oving-0.368
ikes-0.357
der-0.342
ves-0.319
Token the
Token ID262
Feature activation+1.11e+02

Pos logit contributions
igsaw+0.308
 redd+0.290
 ey+0.288
Mic+0.266
 angles+0.257
Neg logit contributions
uddled-0.254
:-0.180
 --0.171
;-0.171
Because-0.170
Token magn
Token ID7842
Feature activation+1.05e+02

Pos logit contributions
igsaw+0.311
 redd+0.301
 ey+0.296
 ju+0.252
Mic+0.240
Neg logit contributions
uddled-0.501
:-0.412
icked-0.402
ered-0.393
;-0.392
Tokenifying
Token ID4035
Feature activation+2.26e+02

Pos logit contributions
igsaw+0.232
 redd+0.193
 ey+0.189
Mic+0.166
acy+0.154
Neg logit contributions
uddled-0.529
:-0.446
Because-0.434
 calmed-0.430
 died-0.428
Token glass
Token ID5405
Feature activation+9.87e+01

Pos logit contributions
igsaw+0.697
 redd+0.636
 ey+0.630
Mic+0.592
 yak+0.537
Neg logit contributions
uddled-0.930
:-0.775
 --0.745
;-0.731
ered-0.717
Token down
Token ID866
Feature activation+9.61e+01

Pos logit contributions
igsaw+0.458
 ey+0.405
 redd+0.401
igator+0.390
 sill+0.383
Neg logit contributions
uddled-0.287
oving-0.191
anas-0.188
ered-0.182
Because-0.178
Token,
Token ID11
Feature activation+7.06e+01

Pos logit contributions
igsaw+0.391
 ey+0.368
 redd+0.358
igator+0.338
 sill+0.338
Neg logit contributions
uddled-0.330
anas-0.247
oving-0.230
Because-0.210
okes-0.209
Token the
Token ID262
Feature activation+1.01e+02

Pos logit contributions
igsaw+0.283
 redd+0.265
 ey+0.263
Mic+0.253
 angles+0.235
Neg logit contributions
uddled-0.222
:-0.177
 --0.167
;-0.162
ered-0.157
Token cater
Token ID20825
Feature activation+1.12e+02

Pos logit contributions
igsaw+0.262
 ey+0.247
 redd+0.237
Mic+0.214
 ju+0.198
Neg logit contributions
uddled-0.467
:-0.385
ered-0.382
icked-0.381
;-0.370
Token Everyday
Token ID48154
Feature activation+1.30e+02

Pos logit contributions
uddled+0.235
okes+0.196
rums+0.191
anas+0.182
OP+0.180
Neg logit contributions
igator-0.172
 sorts-0.167
acy-0.165
 universe-0.164
 Avery-0.160
Token he
Token ID339
Feature activation+1.22e+02

Pos logit contributions
igsaw+0.373
 universe+0.341
 Avery+0.326
 yak+0.324
 avail+0.322
Neg logit contributions
uddled-0.581
icked-0.457
okes-0.425
olded-0.419
ided-0.413
Token would
Token ID561
Feature activation+1.05e+02

Pos logit contributions
die+0.351
igator+0.343
 redd+0.325
|+0.305
 universe+0.301
Neg logit contributions
uddled-0.544
anked-0.376
der-0.369
okes-0.358
ended-0.353
Token take
Token ID1011
Feature activation+9.26e+01

Pos logit contributions
 universe+0.250
igator+0.244
 sill+0.238
igsaw+0.226
 redd+0.219
Neg logit contributions
uddled-0.566
okes-0.400
icked-0.394
awed-0.387
ounced-0.381
Token the
Token ID262
Feature activation+1.16e+02

Pos logit contributions
igsaw+0.356
 redd+0.311
Mic+0.309
 ey+0.307
 yak+0.285
Neg logit contributions
uddled-0.343
:-0.243
Because-0.232
:"-0.227
;-0.223
Token av
Token ID1196
Feature activation+2.26e+02

Pos logit contributions
igsaw+0.349
 redd+0.324
 ey+0.314
Mic+0.296
 ju+0.275
Neg logit contributions
uddled-0.500
:-0.414
 --0.390
icked-0.390
ered-0.387
Tokenoc
Token ID420
Feature activation+2.19e+02

Pos logit contributions
 ey+0.962
igsaw+0.918
 microsc+0.875
 seeker+0.859
 redd+0.845
Neg logit contributions
 and-0.522
 --0.513
uddled-0.492
;-0.483
:-0.479
Tokenados
Token ID22484
Feature activation+9.09e+01

Pos logit contributions
igsaw+0.779
 redd+0.717
 ey+0.713
Mic+0.688
 angles+0.631
Neg logit contributions
uddled-0.793
:-0.646
 --0.615
;-0.600
ered-0.585
Token and
Token ID290
Feature activation+7.87e+01

Pos logit contributions
igsaw+0.364
 redd+0.329
 ju+0.321
Mic+0.320
igator+0.314
Neg logit contributions
uddled-0.306
 calmed-0.209
ered-0.208
 died-0.205
 kissed-0.203
Token scatter
Token ID41058
Feature activation+6.74e+01

Pos logit contributions
igsaw+0.278
 redd+0.276
Mic+0.258
 ey+0.254
 ju+0.247
Neg logit contributions
uddled-0.301
:-0.218
Because-0.209
icked-0.206
;-0.205
Token them
Token ID606
Feature activation+8.94e+01

Pos logit contributions
igsaw+0.284
 redd+0.275
 ey+0.255
Mic+0.253
 ju+0.250
Neg logit contributions
uddled-0.227
:-0.144
Because-0.141
owed-0.141
 kissed-0.137
Token't
Token ID470
Feature activation+1.52e+02

Pos logit contributions
uddled+0.307
oving+0.253
opes+0.232
anas+0.229
rolled+0.219
Neg logit contributions
 redd-0.270
igsaw-0.255
 ju-0.250
 Po-0.236
nam-0.235
Token know
Token ID760
Feature activation+1.06e+02

Pos logit contributions
igator+0.447
 gol+0.389
igsaw+0.387
 possible+0.370
aph+0.369
Neg logit contributions
uddled-0.751
ipped-0.489
Because-0.489
ves-0.476
icked-0.469
Token the
Token ID262
Feature activation+1.23e+02

Pos logit contributions
 microsc+0.345
nam+0.335
igsaw+0.313
 universe+0.311
 angles+0.309
Neg logit contributions
uddled-0.430
orted-0.288
ged-0.287
oped-0.286
icked-0.279
Token rh
Token ID9529
Feature activation+1.53e+02

Pos logit contributions
igsaw+0.355
 redd+0.339
 ey+0.330
Mic+0.297
 ju+0.285
Neg logit contributions
uddled-0.543
:-0.437
icked-0.425
ered-0.418
 --0.417
Tokenin
Token ID259
Feature activation+1.55e+02

Pos logit contributions
igsaw+0.480
 redd+0.440
 ey+0.435
Mic+0.415
 angles+0.376
Neg logit contributions
uddled-0.616
:-0.516
 --0.495
;-0.485
ered-0.473
Tokenoc
Token ID420
Feature activation+2.25e+02

Pos logit contributions
igsaw+0.235
 redd+0.210
igator+0.195
 ju+0.189
 yak+0.184
Neg logit contributions
uddled-0.880
icked-0.729
okes-0.711
ered-0.699
ided-0.692
Tokeneros
Token ID27498
Feature activation+9.95e+01

Pos logit contributions
igsaw+0.676
 ey+0.605
 redd+0.593
Mic+0.572
umm+0.570
Neg logit contributions
uddled-0.906
 kissed-0.750
:-0.749
 --0.714
 calmed-0.708
Token was
Token ID373
Feature activation+8.01e+01

Pos logit contributions
igator+0.301
 vill+0.266
 kingdom+0.251
 yak+0.250
 goblin+0.243
Neg logit contributions
uddled-0.385
apy-0.306
okes-0.303
ues-0.276
inches-0.274
Token wild
Token ID4295
Feature activation+1.16e+02

Pos logit contributions
 redd+0.233
igsaw+0.227
die+0.219
 sill+0.217
chard+0.213
Neg logit contributions
uddled-0.346
Because-0.244
okes-0.243
owed-0.242
anas-0.239
Token.
Token ID13
Feature activation+6.48e+01

Pos logit contributions
igsaw+0.506
keeping+0.469
 redd+0.466
 ey+0.464
 sill+0.462
Neg logit contributions
uddled-0.314
ves-0.221
icked-0.216
ered-0.209
 kissed-0.198
Token Can
Token ID1680
Feature activation+8.21e+01

Pos logit contributions
igsaw+0.207
Mic+0.184
 ey+0.179
 redd+0.177
|+0.172
Neg logit contributions
uddled-0.278
ered-0.211
icked-0.201
ves-0.200
 --0.192
Token's
Token ID338
Feature activation+1.21e+02

Pos logit contributions
 redd+0.355
 ju+0.319
igsaw+0.318
die+0.309
 drawer+0.286
Neg logit contributions
uddled-0.461
icked-0.379
ged-0.359
olded-0.347
ided-0.334
Token wonderful
Token ID7932
Feature activation+1.22e+02

Pos logit contributions
 redd+0.337
nam+0.308
igsaw+0.304
die+0.299
 ju+0.287
Neg logit contributions
uddled-0.585
ves-0.438
oped-0.425
okes-0.421
ipped-0.418
Token,
Token ID11
Feature activation+8.47e+01

Pos logit contributions
igsaw+0.578
 redd+0.536
 ey+0.523
die+0.514
 angles+0.487
Neg logit contributions
uddled-0.353
 died-0.224
icked-0.220
 tipped-0.212
 slid-0.208
Token my
Token ID616
Feature activation+1.12e+02

Pos logit contributions
igsaw+0.241
 redd+0.227
 ey+0.200
 ju+0.198
Mic+0.194
Neg logit contributions
uddled-0.395
icked-0.306
ered-0.298
owed-0.297
 died-0.295
Token d
Token ID288
Feature activation+1.23e+02

Pos logit contributions
igsaw+0.302
 redd+0.261
 ey+0.257
 gol+0.244
 universe+0.235
Neg logit contributions
uddled-0.517
ered-0.447
icked-0.435
ves-0.425
ched-0.421
Tokenarl
Token ID7063
Feature activation+2.24e+02

Pos logit contributions
igsaw+0.282
 ey+0.225
 redd+0.196
aph+0.194
acy+0.179
Neg logit contributions
uddled-0.616
 listened-0.553
 watched-0.551
ered-0.546
 died-0.546
Tokenings
Token ID654
Feature activation+8.55e+01

Pos logit contributions
igsaw+0.929
 ey+0.865
 redd+0.854
Mic+0.809
 gol+0.796
Neg logit contributions
uddled-0.641
:-0.474
 --0.463
ered-0.457
;-0.453
Token!"
Token ID2474
Feature activation+6.06e+01

Pos logit contributions
igsaw+0.276
igator+0.274
ji+0.266
 welcome+0.247
 Penny+0.246
Neg logit contributions
uddled-0.327
ided-0.256
aged-0.255
oving-0.249
rolled-0.238
Token Mom
Token ID11254
Feature activation+8.73e+01

Pos logit contributions
igsaw+0.164
 Prince+0.156
 Avery+0.155
 Po+0.154
Mic+0.152
Neg logit contributions
uddled-0.275
ered-0.205
owed-0.196
oving-0.193
ided-0.190
Tokenmy
Token ID1820
Feature activation+1.26e+02

Pos logit contributions
 ey+0.280
igsaw+0.278
die+0.277
 Alexa+0.264
 angles+0.259
Neg logit contributions
uddled-0.331
ves-0.259
wr-0.226
 died-0.225
Because-0.225
Token says
Token ID1139
Feature activation+6.36e+01

Pos logit contributions
 Alexa+0.440
 Avery+0.420
die+0.413
 Justin+0.406
 Prince+0.400
Neg logit contributions
uddled-0.456
ves-0.318
 badly-0.311
isters-0.310
Because-0.310
Token Ch
Token ID609
Feature activation+1.30e+02

Pos logit contributions
igsaw+0.270
Mic+0.256
 ey+0.232
 redd+0.222
|+0.222
Neg logit contributions
uddled-0.371
ered-0.279
icked-0.271
ves-0.261
der-0.258
Tokenir
Token ID343
Feature activation+1.90e+02

Pos logit contributions
igsaw+0.336
 ey+0.286
Mic+0.277
 redd+0.263
|+0.244
Neg logit contributions
uddled-0.609
:-0.502
 --0.495
;-0.486
 kissed-0.482
Tokenpy
Token ID9078
Feature activation+1.22e+02

Pos logit contributions
igsaw+0.518
 ey+0.511
aph+0.493
Mic+0.480
 redd+0.463
Neg logit contributions
uddled-0.780
 kissed-0.582
 calmed-0.569
Because-0.562
anas-0.561
Token and
Token ID290
Feature activation+8.44e+01

Pos logit contributions
die+0.412
igator+0.389
Prin+0.365
 angles+0.351
|+0.335
Neg logit contributions
uddled-0.447
OP-0.293
anas-0.291
ended-0.290
 afford-0.285
Token Wh
Token ID854
Feature activation+1.19e+02

Pos logit contributions
igsaw+0.257
 redd+0.242
 ey+0.240
Mic+0.235
die+0.222
Neg logit contributions
uddled-0.350
:-0.270
 --0.261
;-0.258
ered-0.254
Tokenisk
Token ID1984
Feature activation+2.20e+02

Pos logit contributions
igsaw+0.232
Mic+0.208
 ey+0.201
die+0.184
|+0.183
Neg logit contributions
uddled-0.607
:-0.504
 kissed-0.497
 calmed-0.493
 --0.488
Tokeners
Token ID364
Feature activation+9.97e+01

Pos logit contributions
igsaw+0.802
 redd+0.741
Mic+0.735
 ey+0.726
|+0.675
Neg logit contributions
uddled-0.757
:-0.595
 --0.573
;-0.565
 kissed-0.542
Token knew
Token ID2993
Feature activation+6.09e+01

Pos logit contributions
Prin+0.261
igator+0.253
ho+0.250
 distance+0.245
 sill+0.242
Neg logit contributions
uddled-0.372
okes-0.328
ours-0.283
ues-0.280
inches-0.277
Token that
Token ID326
Feature activation+8.41e+01

Pos logit contributions
 redd+0.190
igsaw+0.183
 microsc+0.183
Mic+0.182
 Po+0.182
Neg logit contributions
uddled-0.189
okes-0.151
inches-0.134
anas-0.134
zes-0.128
Token poison
Token ID8764
Feature activation+8.31e+01

Pos logit contributions
 redd+0.260
 microsc+0.245
Mic+0.242
igsaw+0.240
 ju+0.240
Neg logit contributions
uddled-0.323
icked-0.239
okes-0.237
:-0.220
 :-0.217
Token was
Token ID373
Feature activation+6.24e+01

Pos logit contributions
igator+0.267
 mushroom+0.220
keeping+0.220
|+0.219
 goblin+0.217
Neg logit contributions
uddled-0.261
okes-0.253
anas-0.231
mmm-0.227
Because-0.222
Token persistent
Token ID16218
Feature activation+1.19e+02

Pos logit contributions
 redd+0.300
igsaw+0.272
nam+0.252
 angles+0.231
igator+0.228
Neg logit contributions
uddled-0.647
;-0.515
 --0.513
 –-0.509
wr-0.508
Tokenâ
Token ID22940
Feature activation+1.32e+02

Pos logit contributions
keeping+0.394
 angles+0.392
igsaw+0.376
 student+0.352
 sill+0.349
Neg logit contributions
uddled-0.398
icked-0.307
wr-0.304
ered-0.300
OP-0.293
Token
Token ID26391
Feature activation+1.53e+02

Pos logit contributions
igsaw+0.265
 redd+0.218
 ey+0.206
Mic+0.200
|+0.185
Neg logit contributions
uddled-0.687
:-0.588
 --0.571
 how-0.561
;-0.561
Token
Token ID447
Feature activation+1.43e+02

Pos logit contributions
igsaw+0.487
|+0.475
keeping+0.429
 gol+0.412
agu+0.407
Neg logit contributions
uddled-0.486
ered-0.452
der-0.432
icked-0.415
 how-0.394
Token
Token ID251
Feature activation+7.91e+01

Pos logit contributions
Mic+0.340
igsaw+0.336
 gol+0.295
 redd+0.284
 ju+0.274
Neg logit contributions
uddled-0.618
 about-0.614
ered-0.596
 how-0.585
:-0.566
Tokenso
Token ID568
Feature activation+2.20e+02

Pos logit contributions
 Avery+0.188
 gol+0.185
 Pre+0.184
acy+0.179
igsaw+0.178
Neg logit contributions
uddled-0.360
icked-0.265
okes-0.260
opes-0.254
owed-0.254
Token they
Token ID484
Feature activation+9.12e+01

Pos logit contributions
igsaw+0.769
 redd+0.767
 gol+0.726
igator+0.699
acy+0.691
Neg logit contributions
uddled-0.713
opes-0.482
der-0.457
icked-0.455
 died-0.440
Token gathered
Token ID9272
Feature activation+6.98e+01

Pos logit contributions
igator+0.328
acy+0.327
 redd+0.325
nam+0.323
 Avery+0.317
Neg logit contributions
uddled-0.321
okes-0.217
ves-0.196
anas-0.188
wr-0.188
Token around
Token ID1088
Feature activation+8.38e+01

Pos logit contributions
igsaw+0.239
 redd+0.235
 ey+0.210
 angles+0.207
 ju+0.204
Neg logit contributions
uddled-0.250
anas-0.200
wr-0.190
:-0.189
ves-0.185
Token the
Token ID262
Feature activation+1.01e+02

Pos logit contributions
igsaw+0.216
 microsc+0.188
 Avery+0.186
 Pre+0.182
 gol+0.181
Neg logit contributions
uddled-0.373
icked-0.319
okes-0.304
wr-0.289
ered-0.285
Token ladder
Token ID18002
Feature activation+8.38e+01

Pos logit contributions
igsaw+0.286
 redd+0.260
 ey+0.260
Mic+0.235
 gol+0.220
Neg logit contributions
uddled-0.447
icked-0.374
:-0.371
ered-0.366
 --0.354
Token he
Token ID339
Feature activation+1.22e+02

Pos logit contributions
igsaw+0.373
 universe+0.341
 Avery+0.326
 yak+0.324
 avail+0.322
Neg logit contributions
uddled-0.581
icked-0.457
okes-0.425
olded-0.419
ided-0.413
Token would
Token ID561
Feature activation+1.05e+02

Pos logit contributions
die+0.351
igator+0.343
 redd+0.325
|+0.305
 universe+0.301
Neg logit contributions
uddled-0.544
anked-0.376
der-0.369
okes-0.358
ended-0.353
Token take
Token ID1011
Feature activation+9.26e+01

Pos logit contributions
 universe+0.250
igator+0.244
 sill+0.238
igsaw+0.226
 redd+0.219
Neg logit contributions
uddled-0.566
okes-0.400
icked-0.394
awed-0.387
ounced-0.381
Token the
Token ID262
Feature activation+1.16e+02

Pos logit contributions
igsaw+0.356
 redd+0.311
Mic+0.309
 ey+0.307
 yak+0.285
Neg logit contributions
uddled-0.343
:-0.243
Because-0.232
:"-0.227
;-0.223
Token av
Token ID1196
Feature activation+2.26e+02

Pos logit contributions
igsaw+0.349
 redd+0.324
 ey+0.314
Mic+0.296
 ju+0.275
Neg logit contributions
uddled-0.500
:-0.414
 --0.390
icked-0.390
ered-0.387
Tokenoc
Token ID420
Feature activation+2.19e+02

Pos logit contributions
 ey+0.962
igsaw+0.918
 microsc+0.875
 seeker+0.859
 redd+0.845
Neg logit contributions
 and-0.522
 --0.513
uddled-0.492
;-0.483
:-0.479
Tokenados
Token ID22484
Feature activation+9.09e+01

Pos logit contributions
igsaw+0.779
 redd+0.717
 ey+0.713
Mic+0.688
 angles+0.631
Neg logit contributions
uddled-0.793
:-0.646
 --0.615
;-0.600
ered-0.585
Token and
Token ID290
Feature activation+7.87e+01

Pos logit contributions
igsaw+0.364
 redd+0.329
 ju+0.321
Mic+0.320
igator+0.314
Neg logit contributions
uddled-0.306
 calmed-0.209
ered-0.208
 died-0.205
 kissed-0.203
Token scatter
Token ID41058
Feature activation+6.74e+01

Pos logit contributions
igsaw+0.278
 redd+0.276
Mic+0.258
 ey+0.254
 ju+0.247
Neg logit contributions
uddled-0.301
:-0.218
Because-0.209
icked-0.206
;-0.205
Token them
Token ID606
Feature activation+8.94e+01

Pos logit contributions
igsaw+0.284
 redd+0.275
 ey+0.255
Mic+0.253
 ju+0.250
Neg logit contributions
uddled-0.227
:-0.144
Because-0.141
owed-0.141
 kissed-0.137
Token around
Token ID1088
Feature activation+8.49e+01

Pos logit contributions
igsaw+0.278
 redd+0.260
nam+0.255
igator+0.253
 ju+0.246
Neg logit contributions
uddled-0.378
Because-0.296
okes-0.280
anas-0.269
wr-0.268
Token give
Token ID1577
Feature activation+9.02e+01

Pos logit contributions
igsaw+0.379
 redd+0.364
 ey+0.355
Mic+0.339
igator+0.335
Neg logit contributions
uddled-0.486
:-0.337
icked-0.332
;-0.327
olded-0.324
Token to
Token ID284
Feature activation+1.04e+02

Pos logit contributions
 redd+0.302
igsaw+0.274
 ju+0.266
nam+0.265
die+0.263
Neg logit contributions
uddled-0.341
Because-0.252
anas-0.246
:-0.245
;-0.239
Token the
Token ID262
Feature activation+1.20e+02

Pos logit contributions
igsaw+0.319
 redd+0.291
 ey+0.282
Mic+0.265
die+0.255
Neg logit contributions
uddled-0.451
:-0.342
Because-0.332
 :-0.332
icked-0.329
Token rh
Token ID9529
Feature activation+1.49e+02

Pos logit contributions
 redd+0.319
igsaw+0.295
 ey+0.290
 ju+0.280
Mic+0.272
Neg logit contributions
uddled-0.561
icked-0.464
ered-0.457
:-0.455
owed-0.438
Tokenin
Token ID259
Feature activation+1.48e+02

Pos logit contributions
igsaw+0.509
 redd+0.475
 ey+0.470
Mic+0.435
 ju+0.403
Neg logit contributions
uddled-0.568
:-0.469
 --0.448
;-0.435
ered-0.420
Tokenoc
Token ID420
Feature activation+2.19e+02

Pos logit contributions
igsaw+0.270
 redd+0.205
 yak+0.198
igator+0.197
Mic+0.183
Neg logit contributions
uddled-0.862
ered-0.700
icked-0.690
pped-0.668
oving-0.668
Tokeneros
Token ID27498
Feature activation+9.52e+01

Pos logit contributions
igsaw+0.666
 ey+0.600
 redd+0.577
Mic+0.572
 gol+0.545
Neg logit contributions
uddled-0.880
:-0.722
 kissed-0.721
 --0.699
;-0.690
Token.
Token ID13
Feature activation+6.53e+01

Pos logit contributions
igator+0.333
igsaw+0.306
 redd+0.277
 yak+0.272
die+0.272
Neg logit contributions
uddled-0.358
pped-0.247
oving-0.240
Because-0.238
wr-0.238
Token Lily
Token ID20037
Feature activation+1.00e+02

Pos logit contributions
igsaw+0.214
Mic+0.196
 redd+0.192
 ey+0.182
die+0.174
Neg logit contributions
uddled-0.273
ered-0.216
ves-0.196
icked-0.195
 --0.195
Token walked
Token ID6807
Feature activation+6.56e+01

Pos logit contributions
die+0.379
ji+0.337
pert+0.314
 Alexa+0.314
 yak+0.312
Neg logit contributions
uddled-0.321
apy-0.264
ves-0.246
icked-0.242
ided-0.237
Token to
Token ID284
Feature activation+7.85e+01

Pos logit contributions
igsaw+0.291
 redd+0.274
 ey+0.271
Mic+0.263
 yak+0.247
Neg logit contributions
uddled-0.182
:-0.137
 --0.128
;-0.124
ered-0.119
Token it
Token ID340
Feature activation+1.12e+02

Pos logit contributions
igsaw+0.341
 redd+0.329
 ey+0.309
Mic+0.300
 angles+0.298
Neg logit contributions
uddled-0.327
:-0.246
;-0.236
Because-0.235
 --0.234
Token to
Token ID284
Feature activation+1.01e+02

Pos logit contributions
igsaw+0.318
 redd+0.297
 sill+0.278
 ju+0.276
igator+0.273
Neg logit contributions
uddled-0.483
Because-0.423
 :-0.382
anas-0.382
oving-0.381
Token the
Token ID262
Feature activation+1.16e+02

Pos logit contributions
igsaw+0.321
Mic+0.283
 redd+0.278
 ey+0.274
 avail+0.265
Neg logit contributions
uddled-0.432
:-0.311
Because-0.309
;-0.303
 --0.302
Token rh
Token ID9529
Feature activation+1.46e+02

Pos logit contributions
igsaw+0.305
 redd+0.294
 ey+0.284
Mic+0.268
 gol+0.261
Neg logit contributions
uddled-0.527
icked-0.431
ered-0.430
:-0.418
owed-0.415
Tokenin
Token ID259
Feature activation+1.48e+02

Pos logit contributions
igsaw+0.502
 redd+0.469
 ey+0.462
Mic+0.430
 ju+0.400
Neg logit contributions
uddled-0.556
:-0.457
 --0.437
;-0.424
 died-0.413
Tokenoc
Token ID420
Feature activation+2.18e+02

Pos logit contributions
igsaw+0.220
 yak+0.144
Mic+0.143
|+0.137
 sill+0.134
Neg logit contributions
uddled-0.862
icked-0.740
ered-0.722
okes-0.695
 calmed-0.694
Tokeneros
Token ID27498
Feature activation+9.46e+01

Pos logit contributions
igsaw+0.666
 ey+0.606
 redd+0.583
Mic+0.573
umm+0.547
Neg logit contributions
uddled-0.866
:-0.709
 kissed-0.703
 --0.684
;-0.674
Token.
Token ID13
Feature activation+6.27e+01

Pos logit contributions
igator+0.318
igsaw+0.305
 sill+0.293
 kingdom+0.284
|+0.281
Neg logit contributions
uddled-0.323
pped-0.236
ged-0.222
Because-0.218
okes-0.218
Token When
Token ID1649
Feature activation+7.53e+01

Pos logit contributions
igsaw+0.209
Mic+0.188
 redd+0.179
 gol+0.169
 ey+0.169
Neg logit contributions
uddled-0.261
ered-0.210
 --0.188
ves-0.187
owed-0.186
Token he
Token ID339
Feature activation+8.70e+01

Pos logit contributions
igsaw+0.249
 redd+0.227
 avail+0.217
 yak+0.214
 Prince+0.212
Neg logit contributions
uddled-0.311
owed-0.227
Because-0.221
okes-0.221
icked-0.215
Token tasted
Token ID29187
Feature activation+6.94e+01

Pos logit contributions
|+0.263
 begg+0.258
 gol+0.241
 Prince+0.238
 yak+0.238
Neg logit contributions
uddled-0.319
ikes-0.285
der-0.267
oving-0.260
rums-0.257
Token am
Token ID716
Feature activation+1.07e+02

Pos logit contributions
|+0.238
 redd+0.231
igator+0.226
 ju+0.221
Prin+0.197
Neg logit contributions
uddled-0.604
opes-0.477
rums-0.454
anas-0.451
ves-0.445
Token taking
Token ID2263
Feature activation+9.36e+01

Pos logit contributions
 redd+0.266
igsaw+0.266
 Avery+0.262
 Alexa+0.259
nam+0.257
Neg logit contributions
uddled-0.513
owed-0.351
ipped-0.348
ged-0.347
anas-0.344
Token the
Token ID262
Feature activation+1.19e+02

Pos logit contributions
igsaw+0.293
Mic+0.241
 redd+0.239
die+0.231
 ey+0.228
Neg logit contributions
uddled-0.428
owed-0.302
Because-0.298
:"-0.298
ided-0.288
Token rh
Token ID9529
Feature activation+1.47e+02

Pos logit contributions
igsaw+0.354
 redd+0.325
 ey+0.311
Mic+0.295
 ju+0.277
Neg logit contributions
uddled-0.522
:-0.428
icked-0.415
ered-0.411
 --0.408
Tokenin
Token ID259
Feature activation+1.47e+02

Pos logit contributions
igsaw+0.518
 ey+0.483
 redd+0.482
Mic+0.438
 ju+0.412
Neg logit contributions
uddled-0.544
:-0.442
 --0.426
;-0.412
 died-0.400
Tokenoc
Token ID420
Feature activation+2.18e+02

Pos logit contributions
igsaw+0.269
Mic+0.218
 redd+0.201
igator+0.196
 yak+0.192
Neg logit contributions
uddled-0.812
ered-0.674
icked-0.665
 softened-0.654
 calmed-0.651
Tokeneros
Token ID27498
Feature activation+9.47e+01

Pos logit contributions
igsaw+0.656
 ey+0.591
 redd+0.575
Mic+0.559
umm+0.557
Neg logit contributions
uddled-0.875
 kissed-0.723
:-0.720
 --0.687
 calmed-0.684
Token to
Token ID284
Feature activation+8.65e+01

Pos logit contributions
igator+0.320
igsaw+0.310
 distance+0.292
keeping+0.291
|+0.290
Neg logit contributions
uddled-0.354
icked-0.253
 softened-0.250
apy-0.244
 calmed-0.243
Token a
Token ID257
Feature activation+1.00e+02

Pos logit contributions
igsaw+0.298
 redd+0.270
Mic+0.264
 ey+0.263
 yak+0.242
Neg logit contributions
uddled-0.349
:-0.258
icked-0.246
 --0.242
Because-0.242
Token new
Token ID649
Feature activation+1.15e+02

Pos logit contributions
igsaw+0.310
 ey+0.291
 redd+0.287
Mic+0.266
 angles+0.248
Neg logit contributions
uddled-0.388
:-0.341
 --0.329
;-0.319
 –-0.308
Token home
Token ID1363
Feature activation+7.89e+01

Pos logit contributions
igsaw+0.239
 angle+0.206
 redd+0.204
 universe+0.202
die+0.192
Neg logit contributions
uddled-0.625
icked-0.538
olded-0.503
owed-0.492
ered-0.490
Token there
Token ID612
Feature activation+1.14e+02

Pos logit contributions
igsaw+0.477
 gol+0.428
 ey+0.408
aph+0.406
 angles+0.400
Neg logit contributions
uddled-0.390
icked-0.265
 died-0.251
anas-0.250
 kissed-0.240
Token was
Token ID373
Feature activation+9.29e+01

Pos logit contributions
 day+0.321
cy+0.319
 distance+0.318
 Rachel+0.316
 night+0.313
Neg logit contributions
uddled-0.446
der-0.356
anas-0.352
rums-0.344
aged-0.343
Token a
Token ID257
Feature activation+1.11e+02

Pos logit contributions
igsaw+0.325
 redd+0.304
Mic+0.292
 ey+0.283
 sill+0.272
Neg logit contributions
uddled-0.347
icked-0.261
ered-0.253
:-0.251
okes-0.251
Token rh
Token ID9529
Feature activation+1.43e+02

Pos logit contributions
igsaw+0.357
 ey+0.328
 redd+0.327
Mic+0.305
 angles+0.291
Neg logit contributions
uddled-0.427
:-0.371
 --0.354
;-0.344
 and-0.334
Tokenin
Token ID259
Feature activation+1.47e+02

Pos logit contributions
igsaw+0.615
 redd+0.578
 ey+0.572
 ju+0.519
 microsc+0.501
Neg logit contributions
uddled-0.445
:-0.349
 --0.339
 and-0.333
 that-0.322
Tokenoc
Token ID420
Feature activation+2.17e+02

Pos logit contributions
igsaw+0.275
Mic+0.246
 redd+0.246
 ey+0.224
 yak+0.224
Neg logit contributions
uddled-0.772
ered-0.639
icked-0.632
:-0.620
 calmed-0.601
Tokeneros
Token ID27498
Feature activation+9.32e+01

Pos logit contributions
igsaw+0.721
 ey+0.663
 redd+0.660
Mic+0.636
 gol+0.590
Neg logit contributions
uddled-0.816
:-0.667
 --0.636
 kissed-0.627
;-0.624
Token who
Token ID508
Feature activation+9.09e+01

Pos logit contributions
igator+0.269
 yak+0.264
 oct+0.258
 gol+0.254
 jungle+0.247
Neg logit contributions
uddled-0.355
ered-0.259
okes-0.251
 wipes-0.251
icked-0.249
Token was
Token ID373
Feature activation+7.20e+01

Pos logit contributions
igator+0.299
|+0.299
Mic+0.283
die+0.280
igsaw+0.280
Neg logit contributions
uddled-0.337
okes-0.249
der-0.228
ided-0.219
rums-0.216
Token always
Token ID1464
Feature activation+7.37e+01

Pos logit contributions
igsaw+0.250
 redd+0.238
 ey+0.230
Mic+0.227
die+0.209
Neg logit contributions
uddled-0.268
:-0.206
 --0.198
;-0.194
ered-0.194
Token naughty
Token ID45128
Feature activation+1.06e+02

Pos logit contributions
igsaw+0.250
 redd+0.246
 ey+0.230
Mic+0.221
 angles+0.218
Neg logit contributions
uddled-0.279
:-0.213
Because-0.199
 --0.199
icked-0.197
Token.
Token ID13
Feature activation+7.86e+01

Pos logit contributions
die+0.398
Prin+0.362
 angles+0.361
my+0.360
Mic+0.358
Neg logit contributions
uddled-0.411
ves-0.322
apy-0.314
OP-0.299
ered-0.293
Token She
Token ID1375
Feature activation+9.25e+01

Pos logit contributions
igsaw+0.296
 redd+0.284
 ey+0.279
Mic+0.263
 angles+0.251
Neg logit contributions
uddled-0.254
:-0.217
 --0.201
;-0.198
Because-0.187
Token loved
Token ID6151
Feature activation+7.46e+01

Pos logit contributions
die+0.218
Mic+0.193
Prin+0.192
|+0.191
ji+0.186
Neg logit contributions
uddled-0.375
ided-0.342
okes-0.330
wr-0.327
der-0.322
Token to
Token ID284
Feature activation+8.53e+01

Pos logit contributions
igsaw+0.364
 redd+0.344
 ey+0.342
Mic+0.332
 yak+0.313
Neg logit contributions
uddled-0.171
:-0.122
 --0.112
;-0.107
ered-0.101
Token eat
Token ID4483
Feature activation+7.85e+01

Pos logit contributions
igsaw+0.246
 ey+0.236
 redd+0.232
Mic+0.219
 universe+0.212
Neg logit contributions
uddled-0.396
olded-0.297
icked-0.288
der-0.285
:-0.281
Token av
Token ID1196
Feature activation+2.16e+02

Pos logit contributions
igsaw+0.237
 redd+0.235
 ey+0.225
Mic+0.213
 ju+0.205
Neg logit contributions
uddled-0.322
:-0.254
ered-0.248
 died-0.243
 --0.242
Tokenoc
Token ID420
Feature activation+2.08e+02

Pos logit contributions
igsaw+0.976
 ey+0.945
 Avery+0.902
acy+0.894
aph+0.879
Neg logit contributions
 and-0.513
 --0.467
,-0.464
!-0.452
;-0.439
Tokenados
Token ID22484
Feature activation+8.24e+01

Pos logit contributions
igsaw+0.695
 ey+0.632
 redd+0.630
Mic+0.618
 angles+0.553
Neg logit contributions
uddled-0.787
:-0.647
 --0.621
;-0.606
 kissed-0.604
Token.
Token ID13
Feature activation+5.97e+01

Pos logit contributions
igsaw+0.376
 redd+0.362
Mic+0.351
 ey+0.344
 ju+0.339
Neg logit contributions
uddled-0.214
ered-0.135
 died-0.133
 calmed-0.133
 kissed-0.127
Token One
Token ID1881
Feature activation+7.41e+01

Pos logit contributions
igsaw+0.267
 ey+0.250
 redd+0.250
Mic+0.239
 angles+0.228
Neg logit contributions
uddled-0.156
:-0.127
 --0.116
;-0.114
 –-0.105
Token day
Token ID1110
Feature activation+7.35e+01

Pos logit contributions
igsaw+0.207
 ey+0.194
Mic+0.182
 angles+0.176
 redd+0.173
Neg logit contributions
uddled-0.304
:-0.302
 and-0.288
 --0.288
;-0.282
Token.
Token ID13
Feature activation+8.69e+01

Pos logit contributions
igsaw+0.424
 sunny+0.371
 day+0.363
 week+0.358
agu+0.353
Neg logit contributions
uddled-0.384
Because-0.336
okes-0.299
 calmed-0.291
ered-0.289
Token One
Token ID1881
Feature activation+9.38e+01

Pos logit contributions
igsaw+0.307
Mic+0.275
 redd+0.262
 ey+0.257
|+0.253
Neg logit contributions
uddled-0.347
ered-0.267
 calmed-0.245
icked-0.245
ves-0.244
Token day
Token ID1110
Feature activation+9.49e+01

Pos logit contributions
igsaw+0.145
 redd+0.122
 ey+0.114
Mic+0.097
 yak+0.082
Neg logit contributions
uddled-0.533
:-0.462
 --0.448
;-0.443
ered-0.440
Token,
Token ID11
Feature activation+8.14e+01

Pos logit contributions
igsaw+0.325
 Po+0.311
 Prince+0.301
 Avery+0.301
 Penny+0.295
Neg logit contributions
uddled-0.359
okes-0.265
der-0.257
oving-0.247
anas-0.246
Token Wh
Token ID854
Feature activation+1.12e+02

Pos logit contributions
igsaw+0.278
 redd+0.262
Mic+0.255
nam+0.242
 ey+0.239
Neg logit contributions
uddled-0.312
:-0.243
icked-0.233
ered-0.226
okes-0.221
Tokenisk
Token ID1984
Feature activation+2.15e+02

Pos logit contributions
igsaw+0.219
 ey+0.185
Mic+0.184
 redd+0.170
|+0.158
Neg logit contributions
uddled-0.566
:-0.493
 --0.479
;-0.476
 kissed-0.474
Tokeners
Token ID364
Feature activation+9.63e+01

Pos logit contributions
igsaw+0.779
 redd+0.719
Mic+0.709
 ey+0.702
|+0.651
Neg logit contributions
uddled-0.734
:-0.584
 --0.564
;-0.561
 kissed-0.533
Token woke
Token ID19092
Feature activation+6.75e+01

Pos logit contributions
ji+0.293
 yak+0.269
|+0.267
Mic+0.267
 vill+0.262
Neg logit contributions
uddled-0.348
apy-0.294
der-0.281
ided-0.274
ves-0.269
Token up
Token ID510
Feature activation+1.04e+02

Pos logit contributions
igsaw+0.185
 redd+0.168
Mic+0.162
 ey+0.159
die+0.151
Neg logit contributions
uddled-0.286
icked-0.231
:-0.231
Because-0.231
 calmed-0.228
Token from
Token ID422
Feature activation+8.04e+01

Pos logit contributions
igsaw+0.359
 ey+0.341
 distance+0.312
 sill+0.312
 redd+0.310
Neg logit contributions
uddled-0.386
ves-0.329
anas-0.311
icked-0.299
Because-0.294
Token his
Token ID465
Feature activation+1.04e+02

Pos logit contributions
igsaw+0.273
 ey+0.252
 redd+0.244
Mic+0.232
 angles+0.218
Neg logit contributions
uddled-0.317
:-0.248
 --0.237
;-0.234
Because-0.232
Token Sally
Token ID25737
Feature activation+1.16e+02

Pos logit contributions
igsaw+0.375
 ey+0.356
 universe+0.348
 angles+0.327
acy+0.319
Neg logit contributions
uddled-0.468
anas-0.368
icked-0.337
oving-0.324
 overheard-0.318
Token told
Token ID1297
Feature activation+7.98e+01

Pos logit contributions
igsaw+0.415
 ey+0.370
 redd+0.361
ji+0.358
acy+0.356
Neg logit contributions
uddled-0.399
icked-0.298
apy-0.291
ided-0.290
ves-0.284
Token him
Token ID683
Feature activation+1.06e+02

Pos logit contributions
igsaw+0.247
 ey+0.228
 sill+0.221
 angles+0.218
 redd+0.217
Neg logit contributions
uddled-0.335
icked-0.246
wr-0.243
okes-0.236
Because-0.228
Token,
Token ID11
Feature activation+8.09e+01

Pos logit contributions
igsaw+0.362
 microsc+0.322
 sill+0.322
 angles+0.317
 Alexa+0.313
Neg logit contributions
uddled-0.372
wr-0.257
Sorry-0.247
ged-0.247
okes-0.243
Token Wh
Token ID854
Feature activation+1.15e+02

Pos logit contributions
igsaw+0.254
 redd+0.249
 ey+0.241
 ju+0.233
Mic+0.232
Neg logit contributions
uddled-0.335
icked-0.256
owed-0.238
okes-0.236
ered-0.233
Tokenisk
Token ID1984
Feature activation+2.15e+02

Pos logit contributions
igsaw+0.178
Mic+0.136
 ey+0.131
die+0.121
aph+0.116
Neg logit contributions
uddled-0.610
apy-0.542
 kissed-0.535
ered-0.529
:-0.528
Tokeners
Token ID364
Feature activation+9.52e+01

Pos logit contributions
igsaw+0.766
 redd+0.702
Mic+0.693
 ey+0.672
|+0.635
Neg logit contributions
uddled-0.749
:-0.595
 --0.575
;-0.572
 kissed-0.550
Token always
Token ID1464
Feature activation+7.29e+01

Pos logit contributions
ji+0.283
igator+0.269
pert+0.254
ho+0.253
igsaw+0.253
Neg logit contributions
uddled-0.305
apy-0.261
icked-0.252
okes-0.250
ided-0.242
Token wanted
Token ID2227
Feature activation+5.89e+01

Pos logit contributions
die+0.250
 angles+0.229
 sill+0.217
 redd+0.214
nam+0.214
Neg logit contributions
uddled-0.280
ves-0.197
ades-0.196
der-0.182
resp-0.177
Token to
Token ID284
Feature activation+7.51e+01

Pos logit contributions
igsaw+0.185
 redd+0.181
 microsc+0.180
Mic+0.174
 Po+0.171
Neg logit contributions
uddled-0.234
anas-0.164
owed-0.164
:"-0.164
icked-0.161
Token help
Token ID1037
Feature activation+6.24e+01

Pos logit contributions
igsaw+0.189
 sill+0.178
 avail+0.171
igator+0.170
 universe+0.167
Neg logit contributions
uddled-0.405
icked-0.288
ged-0.286
ipped-0.270
awed-0.270
Token and
Token ID290
Feature activation+9.81e+01

Pos logit contributions
keeping+0.382
igsaw+0.374
 sill+0.366
 distance+0.359
|+0.353
Neg logit contributions
uddled-0.376
icked-0.320
okes-0.293
ered-0.280
 softened-0.279
Token play
Token ID711
Feature activation+9.55e+01

Pos logit contributions
igsaw+0.377
 redd+0.365
 ey+0.354
|+0.339
Mic+0.335
Neg logit contributions
uddled-0.320
:-0.229
;-0.218
 --0.217
Because-0.212
Token with
Token ID351
Feature activation+9.23e+01

Pos logit contributions
|+0.278
igsaw+0.270
 sunny+0.259
 Princess+0.258
 chamber+0.254
Neg logit contributions
uddled-0.360
olded-0.322
 calmed-0.301
ounced-0.301
okes-0.299
Token your
Token ID534
Feature activation+1.16e+02

Pos logit contributions
igsaw+0.266
 ey+0.236
 redd+0.233
Mic+0.223
 angles+0.217
Neg logit contributions
uddled-0.410
olded-0.315
:-0.314
 :-0.304
icked-0.304
Token or
Token ID393
Feature activation+9.22e+01

Pos logit contributions
igsaw+0.382
 ey+0.358
 redd+0.349
Mic+0.329
 universe+0.305
Neg logit contributions
uddled-0.451
:-0.375
icked-0.358
ered-0.357
 --0.356
Tokennam
Token ID7402
Feature activation+2.14e+02

Pos logit contributions
igsaw+0.356
 redd+0.323
 ey+0.322
Mic+0.308
 yak+0.287
Neg logit contributions
uddled-0.301
:-0.251
 --0.239
 how-0.232
;-0.231
Tokenents
Token ID658
Feature activation+1.06e+02

Pos logit contributions
igsaw+1.095
 ey+1.079
 yak+1.050
 redd+1.041
 goblin+1.021
Neg logit contributions
uddled-0.418
:-0.276
 --0.274
 and-0.246
;-0.241
Token.
Token ID13
Feature activation+6.28e+01

Pos logit contributions
igsaw+0.455
igator+0.435
keeping+0.428
 redd+0.421
 angles+0.413
Neg logit contributions
uddled-0.279
Because-0.173
 softened-0.172
ered-0.172
 calmed-0.167
Token Maybe
Token ID6674
Feature activation+6.92e+01

Pos logit contributions
igsaw+0.288
 ey+0.272
 redd+0.268
Mic+0.251
 angles+0.250
Neg logit contributions
uddled-0.138
:-0.131
 --0.117
;-0.115
 and-0.112
Token you
Token ID345
Feature activation+9.34e+01

Pos logit contributions
 redd+0.274
igsaw+0.273
Mic+0.256
 ey+0.254
nam+0.246
Neg logit contributions
uddled-0.220
:-0.148
ered-0.143
icked-0.142
 --0.139
Token can
Token ID460
Feature activation+7.62e+01

Pos logit contributions
 universe+0.163
|+0.148
 seeker+0.138
 knight+0.131
igator+0.130
Neg logit contributions
uddled-0.527
oving-0.413
anked-0.406
aged-0.404
rums-0.397
Token day
Token ID1110
Feature activation+9.81e+01

Pos logit contributions
igsaw+0.102
 ey+0.085
 sunny+0.051
 day+0.045
 universe+0.045
Neg logit contributions
uddled-0.930
olded-0.821
icked-0.815
der-0.798
anas-0.780
Token,
Token ID11
Feature activation+8.23e+01

Pos logit contributions
igsaw+0.388
 Po+0.342
 Prince+0.340
 Avery+0.339
 Amber+0.335
Neg logit contributions
uddled-0.339
okes-0.239
oving-0.223
der-0.221
ered-0.216
Token Spot
Token ID15899
Feature activation+9.70e+01

Pos logit contributions
igsaw+0.262
 redd+0.240
Mic+0.231
 ey+0.227
 yak+0.215
Neg logit contributions
uddled-0.334
:-0.266
ered-0.251
icked-0.249
 --0.248
Token and
Token ID290
Feature activation+8.17e+01

Pos logit contributions
die+0.380
 yak+0.354
ji+0.347
|+0.332
ny+0.330
Neg logit contributions
uddled-0.338
apy-0.264
ves-0.262
der-0.243
ered-0.238
Token Wh
Token ID854
Feature activation+1.10e+02

Pos logit contributions
igsaw+0.353
 ey+0.333
 redd+0.320
umm+0.306
 angles+0.304
Neg logit contributions
uddled-0.215
:-0.208
 and-0.193
 --0.191
 how-0.190
Tokenisk
Token ID1984
Feature activation+2.12e+02

Pos logit contributions
igsaw+0.144
Mic+0.137
|+0.129
 ey+0.111
die+0.105
Neg logit contributions
uddled-0.609
 kissed-0.522
apy-0.521
 how-0.519
Because-0.517
Tokeners
Token ID364
Feature activation+9.31e+01

Pos logit contributions
igsaw+0.741
 redd+0.690
Mic+0.678
 ey+0.663
|+0.636
Neg logit contributions
uddled-0.741
:-0.584
 --0.567
;-0.566
Because-0.542
Token went
Token ID1816
Feature activation+5.78e+01

Pos logit contributions
 distance+0.266
|+0.231
chard+0.214
hel+0.214
ji+0.212
Neg logit contributions
uddled-0.380
okes-0.339
rums-0.335
ues-0.325
der-0.324
Token to
Token ID284
Feature activation+7.38e+01

Pos logit contributions
igsaw+0.248
 ey+0.227
 redd+0.225
Mic+0.220
 yak+0.218
Neg logit contributions
uddled-0.163
ered-0.112
Because-0.108
 calmed-0.106
:-0.106
Token the
Token ID262
Feature activation+9.68e+01

Pos logit contributions
igsaw+0.318
 redd+0.307
 ey+0.303
Mic+0.291
umm+0.279
Neg logit contributions
uddled-0.192
:-0.167
 --0.155
;-0.147
 how-0.145
Token park
Token ID3952
Feature activation+6.63e+01

Pos logit contributions
igsaw+0.319
 ey+0.295
 redd+0.289
Mic+0.277
 angles+0.264
Neg logit contributions
uddled-0.364
:-0.321
 --0.303
 that-0.292
;-0.290
Token that
Token ID326
Feature activation+9.62e+01

Pos logit contributions
igsaw+0.354
 sill+0.347
 day+0.335
keeping+0.330
 gol+0.329
Neg logit contributions
uddled-0.425
icked-0.388
ered-0.353
OP-0.346
ves-0.334
Token the
Token ID262
Feature activation+1.12e+02

Pos logit contributions
igsaw+0.331
 redd+0.294
 universe+0.276
 avail+0.275
 Avery+0.273
Neg logit contributions
uddled-0.394
icked-0.287
okes-0.283
anas-0.263
wr-0.262
Token bad
Token ID2089
Feature activation+1.15e+02

Pos logit contributions
igsaw+0.307
 redd+0.267
 ey+0.259
 gol+0.239
Mic+0.232
Neg logit contributions
uddled-0.521
icked-0.431
:-0.425
ered-0.413
 --0.413
Token rh
Token ID9529
Feature activation+1.38e+02

Pos logit contributions
igsaw+0.259
 universe+0.204
 ey+0.198
 gol+0.196
 redd+0.193
Neg logit contributions
uddled-0.612
icked-0.545
ered-0.493
olded-0.492
okes-0.489
Tokenin
Token ID259
Feature activation+1.41e+02

Pos logit contributions
igsaw+0.317
Mic+0.286
 redd+0.281
 ey+0.262
 gol+0.261
Neg logit contributions
uddled-0.626
:-0.540
 --0.526
;-0.525
ered-0.516
Tokenoc
Token ID420
Feature activation+2.12e+02

Pos logit contributions
igsaw+0.202
Mic+0.194
 gol+0.192
 yak+0.182
igator+0.179
Neg logit contributions
uddled-0.749
icked-0.648
okes-0.635
ered-0.626
ided-0.603
Tokeneros
Token ID27498
Feature activation+8.77e+01

Pos logit contributions
igsaw+0.561
umm+0.506
 gol+0.488
 ey+0.486
 redd+0.470
Neg logit contributions
uddled-0.877
 kissed-0.770
:-0.737
 calmed-0.719
ves-0.718
Token was
Token ID373
Feature activation+6.89e+01

Pos logit contributions
igator+0.240
 kingdom+0.209
 vill+0.200
 gol+0.200
 Prince+0.199
Neg logit contributions
uddled-0.333
okes-0.289
apy-0.272
ues-0.262
anked-0.259
Token now
Token ID783
Feature activation+9.12e+01

Pos logit contributions
 redd+0.188
 sill+0.183
igsaw+0.176
 sunny+0.175
 ju+0.173
Neg logit contributions
uddled-0.299
okes-0.244
wr-0.222
anas-0.221
owed-0.220
Token nice
Token ID3621
Feature activation+1.06e+02

Pos logit contributions
igsaw+0.266
 sunny+0.248
 distance+0.239
 possible+0.234
 day+0.228
Neg logit contributions
uddled-0.405
okes-0.338
icked-0.313
owed-0.311
ves-0.300
Token.
Token ID13
Feature activation+5.38e+01

Pos logit contributions
igsaw+0.424
 sill+0.391
 gol+0.387
keeping+0.382
 world+0.380
Neg logit contributions
uddled-0.331
ered-0.260
icked-0.252
ves-0.246
okes-0.241
Token
Token ID198
Feature activation+9.92e+01

Pos logit contributions
igsaw+0.490
 redd+0.466
 ey+0.458
Mic+0.443
 angles+0.418
Neg logit contributions
uddled-0.262
:-0.204
 --0.185
;-0.178
ered-0.167
Token"
Token ID1
Feature activation+8.50e+01

Pos logit contributions
igsaw+0.417
 redd+0.391
 ey+0.388
Mic+0.375
 yak+0.349
Neg logit contributions
uddled-0.294
:-0.229
 --0.215
;-0.209
ered-0.201
TokenThe
Token ID464
Feature activation+1.07e+02

Pos logit contributions
 redd+0.224
Mic+0.224
igsaw+0.222
 ey+0.202
Prin+0.193
Neg logit contributions
uddled-0.376
:-0.325
 --0.315
;-0.311
ered-0.305
Token rh
Token ID9529
Feature activation+1.37e+02

Pos logit contributions
igsaw+0.293
 redd+0.276
 ey+0.271
Mic+0.247
 yak+0.241
Neg logit contributions
uddled-0.482
:-0.381
icked-0.379
ered-0.372
 --0.370
Tokenin
Token ID259
Feature activation+1.37e+02

Pos logit contributions
igsaw+0.389
 redd+0.354
 ey+0.353
Mic+0.341
 yak+0.306
Neg logit contributions
uddled-0.581
:-0.490
 --0.474
;-0.468
ered-0.457
Tokenoc
Token ID420
Feature activation+2.11e+02

Pos logit contributions
igator+0.160
igsaw+0.155
 yak+0.135
 page+0.130
 vill+0.123
Neg logit contributions
uddled-0.827
der-0.668
icked-0.665
okes-0.631
pped-0.630
Tokeneros
Token ID27498
Feature activation+8.81e+01

Pos logit contributions
igsaw+0.658
 ey+0.611
 redd+0.598
Mic+0.573
 angles+0.543
Neg logit contributions
uddled-0.828
:-0.680
 kissed-0.661
 --0.658
;-0.650
Token is
Token ID318
Feature activation+7.14e+01

Pos logit contributions
igator+0.266
 page+0.189
 brav+0.188
 yak+0.187
 redd+0.186
Neg logit contributions
uddled-0.411
der-0.295
ged-0.273
ided-0.273
iked-0.271
Token in
Token ID287
Feature activation+8.96e+01

Pos logit contributions
 redd+0.186
igator+0.176
die+0.173
 Africa+0.172
 sill+0.171
Neg logit contributions
uddled-0.357
ipped-0.256
Because-0.256
owed-0.242
der-0.239
Token there
Token ID612
Feature activation+8.52e+01

Pos logit contributions
igsaw+0.319
 redd+0.290
 ey+0.287
Mic+0.274
 yak+0.256
Neg logit contributions
uddled-0.346
:-0.265
 --0.250
;-0.247
ered-0.243
Token,
Token ID11
Feature activation+6.43e+01

Pos logit contributions
igsaw+0.320
igator+0.298
 sill+0.296
 angles+0.296
 redd+0.295
Neg logit contributions
uddled-0.332
icked-0.234
aged-0.215
olded-0.211
ves-0.210
Token
Token ID95
Feature activation-5.99e+01

Pos logit contributions
uddled+0.172
 died+0.143
icked+0.137
aged+0.135
 checked+0.135
Neg logit contributions
igsaw-0.136
agu-0.113
acy-0.108
 gol-0.102
aph-0.102
Token
Token ID26391
Feature activation-2.55e+00

Pos logit contributions
uddled+0.299
icked+0.246
 died+0.239
:+0.238
ered+0.236
Neg logit contributions
igsaw-0.149
 redd-0.115
 ey-0.114
|-0.109
 yak-0.101
Token
Token ID129
Feature activation-4.29e-01

Pos logit contributions
uddled+0.008
ered+0.006
der+0.006
icked+0.006
:+0.006
Neg logit contributions
igsaw-0.010
 ey-0.009
 yak-0.009
 redd-0.009
Mic-0.009
Token
Token ID241
Feature activation-4.41e+01

Pos logit contributions
uddled+0.002
icked+0.001
 kissed+0.001
 died+0.001
 checked+0.001
Neg logit contributions
igsaw-0.002
 redd-0.001
 ey-0.001
 yak-0.001
|-0.001
Tokenex
Token ID1069
Feature activation+3.89e+01

Pos logit contributions
uddled+0.167
:+0.132
ered+0.125
 -+0.125
 died+0.122
Neg logit contributions
igsaw-0.150
 redd-0.142
 ey-0.136
Mic-0.130
 angles-0.122
Tokencellent
Token ID5666
Feature activation-1.12e+02

Pos logit contributions
igsaw+0.121
die+0.110
 redd+0.109
 yak+0.108
ny+0.100
Neg logit contributions
uddled-0.163
 died-0.115
icked-0.113
pt-0.110
 calmed-0.110
Tokenâ
Token ID22940
Feature activation-1.81e+01

Pos logit contributions
uddled+0.409
 died+0.305
ered+0.296
 attacked+0.291
anas+0.289
Neg logit contributions
igsaw-0.416
 redd-0.372
 ey-0.365
 sill-0.339
acy-0.335
Token
Token ID26391
Feature activation-5.59e+00

Pos logit contributions
uddled+0.099
:+0.084
 -+0.082
;+0.081
icked+0.080
Neg logit contributions
igsaw-0.033
 redd-0.026
 ey-0.026
Mic-0.022
|-0.021
Token until
Token ID1566
Feature activation-8.04e+01

Pos logit contributions
uddled+0.019
ered+0.015
:+0.014
icked+0.014
 died+0.013
Neg logit contributions
igsaw-0.022
 redd-0.019
 ey-0.019
 yak-0.018
Mic-0.018
Token everyone
Token ID2506
Feature activation-7.06e+01

Pos logit contributions
uddled+0.401
anas+0.325
icked+0.313
oving+0.306
OP+0.304
Neg logit contributions
 redd-0.186
 Avery-0.171
igsaw-0.170
nam-0.166
 ju-0.161
Token in
Token ID287
Feature activation-5.00e+01

Pos logit contributions
uddled+0.290
OP+0.197
pped+0.193
icked+0.192
 injuries+0.191
Neg logit contributions
 sorts-0.221
igator-0.216
acy-0.214
igsaw-0.212
 Alexa-0.207
Token window
Token ID4324
Feature activation-6.16e+01

Pos logit contributions
uddled+0.149
:+0.124
 -+0.119
;+0.117
ered+0.117
Neg logit contributions
igsaw-0.101
 redd-0.094
 ey-0.093
Mic-0.088
 angles-0.079
Token.
Token ID13
Feature activation-7.81e+01

Pos logit contributions
uddled+0.168
oving+0.096
Because+0.089
 kissed+0.088
 slapped+0.088
Neg logit contributions
 sill-0.289
igsaw-0.257
 angles-0.256
 ey-0.254
 redd-0.246
Token It
Token ID632
Feature activation-6.01e+01

Pos logit contributions
uddled+0.251
ered+0.184
:+0.184
 -+0.181
;+0.175
Neg logit contributions
igsaw-0.317
 redd-0.294
 ey-0.289
Mic-0.279
 gol-0.263
Token said
Token ID531
Feature activation-7.58e+01

Pos logit contributions
uddled+0.224
;+0.168
oving+0.164
:"+0.157
 –+0.156
Neg logit contributions
 redd-0.200
 sill-0.188
igator-0.187
 ey-0.181
die-0.178
Token "
Token ID366
Feature activation-6.20e+01

Pos logit contributions
uddled+0.276
oving+0.235
 stamped+0.201
Because+0.201
 argued+0.196
Neg logit contributions
 Princess-0.246
 Prince-0.244
igsaw-0.241
 hex-0.241
 Po-0.241
TokenArmy
Token ID45272
Feature activation-1.11e+02

Pos logit contributions
uddled+0.278
:+0.245
ered+0.227
 -+0.227
;+0.226
Neg logit contributions
igsaw-0.165
Mic-0.159
 redd-0.159
|-0.143
 gol-0.142
Token".
Token ID1911
Feature activation-8.51e+01

Pos logit contributions
uddled+0.336
 calmed+0.238
Because+0.238
 softened+0.234
apy+0.228
Neg logit contributions
igsaw-0.452
 yak-0.429
acy-0.411
 page-0.410
keeping-0.397
Token
Token ID198
Feature activation-5.82e+01

Pos logit contributions
uddled+0.295
ered+0.226
 -+0.212
:+0.210
owed+0.209
Neg logit contributions
igsaw-0.316
 redd-0.289
 ey-0.288
Mic-0.276
 gol-0.263
Token
Token ID198
Feature activation-5.88e+01

Pos logit contributions
uddled+0.179
:+0.130
 -+0.128
;+0.126
ered+0.126
Neg logit contributions
igsaw-0.235
 ey-0.218
 redd-0.218
Mic-0.217
|-0.199
Token"
Token ID1
Feature activation-6.67e+01

Pos logit contributions
uddled+0.180
:+0.139
 -+0.132
;+0.129
ered+0.126
Neg logit contributions
igsaw-0.243
 redd-0.228
 ey-0.226
Mic-0.221
 yak-0.204
TokenWow
Token ID22017
Feature activation-6.19e+01

Pos logit contributions
uddled+0.264
:+0.220
 -+0.211
;+0.206
ered+0.201
Neg logit contributions
igsaw-0.216
 redd-0.198
 ey-0.196
Mic-0.185
 angles-0.171
Token the
Token ID262
Feature activation-2.89e+01

Pos logit contributions
uddled+0.109
:+0.096
 -+0.090
;+0.086
 and+0.084
Neg logit contributions
igsaw-0.172
 ey-0.162
 redd-0.161
Mic-0.156
die-0.147
Token ground
Token ID2323
Feature activation-6.24e+01

Pos logit contributions
uddled+0.109
:+0.092
 -+0.087
;+0.085
ered+0.083
Neg logit contributions
igsaw-0.097
 redd-0.089
 ey-0.089
Mic-0.085
 yak-0.078
Token that
Token ID326
Feature activation-5.17e+01

Pos logit contributions
uddled+0.194
rums+0.131
 kissed+0.128
anas+0.126
icked+0.124
Neg logit contributions
 sill-0.238
igsaw-0.224
 ey-0.223
 angles-0.219
 angle-0.210
Token said
Token ID531
Feature activation-7.08e+01

Pos logit contributions
uddled+0.146
:+0.102
;+0.097
 -+0.096
 –+0.091
Neg logit contributions
 redd-0.220
igsaw-0.217
 ey-0.206
Mic-0.202
nam-0.200
Token "
Token ID366
Feature activation-5.59e+01

Pos logit contributions
uddled+0.257
oving+0.195
zes+0.171
cream+0.167
okes+0.163
Neg logit contributions
igsaw-0.256
 redd-0.240
 Po-0.228
 Alexa-0.227
 Princess-0.222
Tokenshort
Token ID19509
Feature activation-1.11e+02

Pos logit contributions
uddled+0.241
:+0.210
 -+0.190
;+0.189
der+0.186
Neg logit contributions
igsaw-0.161
Mic-0.155
 redd-0.155
|-0.140
 ey-0.136
Tokencut
Token ID8968
Feature activation+1.37e+00

Pos logit contributions
uddled+0.423
 softened+0.307
ered+0.302
 calmed+0.295
 died+0.289
Neg logit contributions
igsaw-0.432
 yak-0.371
 gol-0.364
 student-0.359
 page-0.358
Token".
Token ID1911
Feature activation-7.86e+01

Pos logit contributions
igsaw+0.004
 redd+0.004
 yak+0.004
Mic+0.004
 gol+0.004
Neg logit contributions
uddled-0.006
 kissed-0.004
Because-0.004
 softened-0.004
ered-0.004
Token She
Token ID1375
Feature activation-5.49e+01

Pos logit contributions
uddled+0.296
:+0.237
 -+0.231
ered+0.227
;+0.226
Neg logit contributions
igsaw-0.273
 redd-0.244
 ey-0.243
Mic-0.233
|-0.212
Token thought
Token ID1807
Feature activation-7.88e+01

Pos logit contributions
uddled+0.164
;+0.137
oor+0.134
 anymore+0.131
,"+0.129
Neg logit contributions
die-0.192
Mic-0.177
 Gn-0.170
 begg-0.166
Ru-0.165
Token it
Token ID340
Feature activation-5.07e+01

Pos logit contributions
uddled+0.358
Sorry+0.271
oving+0.263
Because+0.261
 overheard+0.255
Neg logit contributions
igsaw-0.225
igator-0.190
 redd-0.190
 Alexa-0.183
 page-0.183
Token.
Token ID13
Feature activation-4.85e+01

Pos logit contributions
uddled+0.151
apy+0.108
Because+0.100
okes+0.098
oving+0.097
Neg logit contributions
 sill-0.143
keeping-0.133
 distance-0.131
 angles-0.128
 night-0.124
Token It
Token ID632
Feature activation-2.89e+01

Pos logit contributions
uddled+0.168
:+0.132
 -+0.127
;+0.124
ered+0.123
Neg logit contributions
igsaw-0.185
 redd-0.169
 ey-0.168
Mic-0.162
 yak-0.149
Token pays
Token ID13831
Feature activation-3.46e+01

Pos logit contributions
uddled+0.106
;+0.075
ged+0.072
 -+0.068
aged+0.068
Neg logit contributions
igator-0.099
 redd-0.097
 sill-0.097
|-0.094
 ey-0.094
Token off
Token ID572
Feature activation-2.97e+01

Pos logit contributions
uddled+0.150
Because+0.123
apy+0.104
der+0.104
 softened+0.102
Neg logit contributions
igsaw-0.093
die-0.085
 angles-0.084
 Africa-0.083
 redd-0.082
Token or
Token ID393
Feature activation-2.63e+01

Pos logit contributions
uddled+0.097
icked+0.068
okes+0.066
oving+0.066
ered+0.063
Neg logit contributions
igsaw-0.109
 distance-0.100
keeping-0.099
 sill-0.099
 angles-0.098
Token resolves
Token ID38709
Feature activation-1.10e+02

Pos logit contributions
uddled+0.113
ves+0.081
;+0.081
Because+0.081
:+0.080
Neg logit contributions
nam-0.083
 redd-0.080
chard-0.077
igsaw-0.074
 angles-0.073
Token the
Token ID262
Feature activation-8.35e+00

Pos logit contributions
uddled+0.550
 slid+0.391
Because+0.385
icked+0.378
olded+0.377
Neg logit contributions
igsaw-0.304
 angles-0.267
 ey-0.256
 student-0.248
 universe-0.245
Token conflict
Token ID5358
Feature activation-5.15e+01

Pos logit contributions
uddled+0.035
:+0.028
icked+0.027
 -+0.026
;+0.026
Neg logit contributions
igsaw-0.026
 ey-0.024
 redd-0.023
Mic-0.022
 angles-0.021
Token by
Token ID416
Feature activation-2.21e+01

Pos logit contributions
uddled+0.160
 died+0.105
 calmed+0.105
 slid+0.104
icked+0.102
Neg logit contributions
igsaw-0.211
 angles-0.206
 ey-0.198
 microsc-0.193
nam-0.187
Token revealing
Token ID13477
Feature activation-4.21e+01

Pos logit contributions
uddled+0.076
:+0.058
 -+0.055
;+0.054
ered+0.052
Neg logit contributions
igsaw-0.084
 ey-0.078
 redd-0.078
Mic-0.074
 angles-0.071
Token his
Token ID465
Feature activation-8.10e+00

Pos logit contributions
uddled+0.173
:+0.124
Because+0.120
 -+0.118
;+0.117
Neg logit contributions
igsaw-0.141
 redd-0.133
 ey-0.130
 angles-0.125
die-0.117
Token King
Token ID2677
Feature activation-5.82e+01

Pos logit contributions
uddled+0.082
icked+0.068
ided+0.066
 kissed+0.063
owed+0.063
Neg logit contributions
igsaw-0.041
 ey-0.038
 redd-0.037
 yak-0.033
 universe-0.033
Token was
Token ID373
Feature activation-7.09e+01

Pos logit contributions
uddled+0.199
okes+0.144
icked+0.141
wr+0.140
oving+0.137
Neg logit contributions
igsaw-0.190
igator-0.179
 angles-0.178
 yak-0.174
 vill-0.171
Token cruel
Token ID12177
Feature activation-3.82e+01

Pos logit contributions
uddled+0.294
okes+0.207
Because+0.207
ves+0.206
wr+0.206
Neg logit contributions
igsaw-0.215
 redd-0.209
 sill-0.204
 ey-0.202
 angles-0.196
Token and
Token ID290
Feature activation-6.25e+01

Pos logit contributions
uddled+0.117
ered+0.084
rolled+0.079
oving+0.078
apy+0.077
Neg logit contributions
 sill-0.141
igsaw-0.141
 yak-0.137
 goblin-0.137
 angles-0.136
Token had
Token ID550
Feature activation-6.75e+01

Pos logit contributions
uddled+0.232
:+0.168
 -+0.164
;+0.164
ves+0.159
Neg logit contributions
 redd-0.215
igsaw-0.211
 ey-0.201
Mic-0.193
nam-0.192
Token enslaved
Token ID39216
Feature activation-1.08e+02

Pos logit contributions
uddled+0.270
owed+0.194
anas+0.190
 :+0.188
zes+0.188
Neg logit contributions
 redd-0.218
 ju-0.202
 sill-0.201
 microsc-0.190
igsaw-0.188
Token many
Token ID867
Feature activation-4.14e+01

Pos logit contributions
uddled+0.419
Because+0.302
okes+0.298
 calmed+0.296
 slid+0.293
Neg logit contributions
igsaw-0.370
acy-0.340
 yak-0.336
 Prince-0.330
 Avery-0.320
Token of
Token ID286
Feature activation-3.04e+01

Pos logit contributions
uddled+0.186
icked+0.142
:+0.139
owed+0.138
 :+0.135
Neg logit contributions
igsaw-0.119
 ey-0.110
 angles-0.109
 redd-0.100
 microsc-0.098
Token the
Token ID262
Feature activation-3.65e+01

Pos logit contributions
uddled+0.124
:+0.088
icked+0.087
Because+0.085
owed+0.085
Neg logit contributions
igsaw-0.103
 ey-0.096
 redd-0.091
 angles-0.091
 microsc-0.087
Token people
Token ID661
Feature activation-6.71e+01

Pos logit contributions
uddled+0.171
:+0.138
icked+0.134
ered+0.133
 -+0.132
Neg logit contributions
igsaw-0.097
 ey-0.089
 redd-0.088
Mic-0.080
 angles-0.077
Token of
Token ID286
Feature activation-3.18e+01

Pos logit contributions
uddled+0.216
icked+0.143
wr+0.131
ered+0.131
OP+0.125
Neg logit contributions
igsaw-0.272
igator-0.253
 angles-0.250
 ey-0.244
 distance-0.239
Token have
Token ID423
Feature activation-6.46e+01

Pos logit contributions
uddled+0.210
:+0.157
 -+0.148
;+0.146
icked+0.143
Neg logit contributions
igsaw-0.239
 redd-0.220
 ey-0.219
Mic-0.211
 yak-0.196
Token a
Token ID257
Feature activation-4.52e+01

Pos logit contributions
uddled+0.229
:+0.170
owed+0.159
 -+0.156
icked+0.156
Neg logit contributions
igsaw-0.247
 redd-0.236
 ey-0.220
Mic-0.219
 ju-0.214
Token toy
Token ID13373
Feature activation-6.21e+01

Pos logit contributions
uddled+0.163
:+0.136
 -+0.130
;+0.126
 died+0.121
Neg logit contributions
igsaw-0.156
 ey-0.146
 redd-0.145
Mic-0.139
 angles-0.132
Token he
Token ID339
Feature activation-5.42e+01

Pos logit contributions
uddled+0.233
ered+0.187
 calmed+0.172
owed+0.167
ves+0.166
Neg logit contributions
igsaw-0.209
 gol-0.194
 redd-0.194
Mic-0.192
 yak-0.187
Token could
Token ID714
Feature activation-6.30e+01

Pos logit contributions
uddled+0.182
der+0.141
:+0.129
;+0.125
ered+0.125
Neg logit contributions
 redd-0.196
Mic-0.192
igsaw-0.182
 ju-0.177
|-0.177
Token rely
Token ID8814
Feature activation-1.06e+02

Pos logit contributions
uddled+0.297
 calmed+0.230
der+0.227
wr+0.226
 kissed+0.224
Neg logit contributions
igsaw-0.157
 sill-0.138
 redd-0.133
 ey-0.129
 sunny-0.128
Token on
Token ID319
Feature activation-5.16e+01

Pos logit contributions
uddled+0.415
der+0.351
 calmed+0.295
Because+0.287
oving+0.282
Neg logit contributions
Mic-0.304
igsaw-0.297
nam-0.290
Ru-0.285
 redd-0.285
Token all
Token ID477
Feature activation-5.44e+01

Pos logit contributions
uddled+0.222
icked+0.168
 calmed+0.158
olded+0.157
owed+0.154
Neg logit contributions
igsaw-0.162
 ey-0.148
nam-0.145
 redd-0.143
 avail-0.142
Token the
Token ID262
Feature activation-4.08e+01

Pos logit contributions
:+0.136
uddled+0.133
 -+0.130
 how+0.126
.+0.125
Neg logit contributions
igsaw-0.231
Mic-0.227
 redd-0.222
 ey-0.215
 ju-0.202
Token time
Token ID640
Feature activation-6.50e+01

Pos logit contributions
uddled+0.154
:+0.131
 -+0.127
;+0.124
 and+0.119
Neg logit contributions
igsaw-0.137
 ey-0.122
 redd-0.121
Mic-0.116
 yak-0.106
Token.
Token ID13
Feature activation-8.47e+01

Pos logit contributions
uddled+0.226
ered+0.152
icked+0.151
 kissed+0.149
Because+0.148
Neg logit contributions
igsaw-0.233
 redd-0.218
keeping-0.215
nam-0.214
 sill-0.214
Token it
Token ID340
Feature activation-5.26e+01

Pos logit contributions
uddled+0.247
icked+0.180
ered+0.177
:+0.172
Because+0.169
Neg logit contributions
 redd-0.268
igsaw-0.251
 ey-0.250
 ju-0.242
 sill-0.234
Token knew
Token ID2993
Feature activation-8.52e+01

Pos logit contributions
uddled+0.216
okes+0.178
ikes+0.174
oving+0.165
anas+0.163
Neg logit contributions
igator-0.141
die-0.138
 sill-0.137
 redd-0.136
 ju-0.135
Token it
Token ID340
Feature activation-5.37e+01

Pos logit contributions
uddled+0.297
anas+0.221
okes+0.210
olded+0.205
ged+0.201
Neg logit contributions
 redd-0.286
 angles-0.285
 possible-0.268
 ey-0.267
igsaw-0.264
Token could
Token ID714
Feature activation-6.12e+01

Pos logit contributions
uddled+0.220
okes+0.186
inches+0.165
ipped+0.162
ooming+0.162
Neg logit contributions
 page-0.147
 universe-0.146
 redd-0.146
die-0.146
 sorts-0.143
Token always
Token ID1464
Feature activation-7.12e+01

Pos logit contributions
uddled+0.296
ged+0.219
okes+0.217
olded+0.213
cases+0.202
Neg logit contributions
 universe-0.161
 sill-0.159
igsaw-0.155
 ey-0.151
 redd-0.149
Token rely
Token ID8814
Feature activation-1.05e+02

Pos logit contributions
uddled+0.321
olded+0.223
okes+0.219
icked+0.215
Because+0.214
Neg logit contributions
 redd-0.209
 ey-0.206
 sill-0.205
 distance-0.200
 avail-0.198
Token on
Token ID319
Feature activation-5.08e+01

Pos logit contributions
uddled+0.479
Because+0.372
der+0.363
olded+0.356
 calmed+0.333
Neg logit contributions
Ru-0.264
igsaw-0.247
igator-0.239
 page-0.237
 Africa-0.235
Token the
Token ID262
Feature activation-3.96e+01

Pos logit contributions
uddled+0.221
Because+0.159
olded+0.158
icked+0.153
 slid+0.152
Neg logit contributions
igsaw-0.151
 redd-0.144
 angles-0.142
 ey-0.140
die-0.134
Token oxygen
Token ID11863
Feature activation-6.74e+01

Pos logit contributions
uddled+0.204
:+0.163
icked+0.163
olded+0.159
 -+0.157
Neg logit contributions
 redd-0.094
igsaw-0.087
 ey-0.086
 angles-0.076
 angle-0.074
Token tree
Token ID5509
Feature activation-7.47e+01

Pos logit contributions
uddled+0.259
okes+0.182
olded+0.180
icked+0.176
ered+0.174
Neg logit contributions
igsaw-0.230
igator-0.230
 distance-0.224
 redd-0.222
 universe-0.221
Token to
Token ID284
Feature activation-6.57e+01

Pos logit contributions
uddled+0.217
Because+0.133
icked+0.126
ered+0.122
okes+0.122
Neg logit contributions
igsaw-0.331
 redd-0.313
 ju-0.297
igator-0.297
 sill-0.296
Token Remember
Token ID11436
Feature activation-5.40e+01

Pos logit contributions
uddled+0.288
 died+0.211
icked+0.208
ves+0.207
ered+0.207
Neg logit contributions
igsaw-0.217
 ey-0.202
 redd-0.196
 angles-0.186
acy-0.176
Token that
Token ID326
Feature activation-5.78e+01

Pos logit contributions
uddled+0.165
anas+0.120
 kissed+0.120
ves+0.118
 tipped+0.118
Neg logit contributions
igsaw-0.208
 ey-0.201
aph-0.189
 angles-0.184
 microsc-0.181
Token you
Token ID345
Feature activation-4.84e+01

Pos logit contributions
uddled+0.218
icked+0.163
:+0.152
ered+0.150
owed+0.149
Neg logit contributions
igsaw-0.198
 ey-0.191
 redd-0.188
 angles-0.177
 universe-0.173
Token can
Token ID460
Feature activation-6.32e+01

Pos logit contributions
uddled+0.206
ged+0.141
okes+0.139
 :+0.137
wr+0.137
Neg logit contributions
igsaw-0.146
 ey-0.135
 redd-0.131
acy-0.131
 universe-0.130
Token always
Token ID1464
Feature activation-7.00e+01

Pos logit contributions
uddled+0.272
icked+0.202
 kissed+0.192
ged+0.191
olded+0.186
Neg logit contributions
igsaw-0.189
 ey-0.175
Mic-0.170
 redd-0.167
 universe-0.166
Token rely
Token ID8814
Feature activation-1.04e+02

Pos logit contributions
uddled+0.294
icked+0.206
olded+0.188
 died+0.188
:+0.187
Neg logit contributions
igsaw-0.233
 ey-0.224
 redd-0.215
Mic-0.208
 avail-0.207
Token on
Token ID319
Feature activation-5.06e+01

Pos logit contributions
uddled+0.431
der+0.332
 died+0.324
 calmed+0.323
OP+0.300
Neg logit contributions
Ru-0.301
igsaw-0.297
Mic-0.270
acy-0.265
nam-0.259
Token your
Token ID534
Feature activation-4.06e+01

Pos logit contributions
uddled+0.227
icked+0.174
 tipped+0.170
 calmed+0.170
 died+0.169
Neg logit contributions
igsaw-0.142
 ey-0.128
 angles-0.121
 redd-0.118
die-0.115
Token inner
Token ID8434
Feature activation-5.08e+01

Pos logit contributions
uddled+0.201
icked+0.164
owed+0.157
ered+0.156
 kissed+0.154
Neg logit contributions
igsaw-0.101
 ey-0.098
 angles-0.085
 redd-0.085
 universe-0.083
Token strength
Token ID4202
Feature activation-7.49e+01

Pos logit contributions
uddled+0.253
icked+0.202
owed+0.186
ered+0.183
 kissed+0.182
Neg logit contributions
igsaw-0.134
 ey-0.129
 redd-0.120
 angles-0.111
 angle-0.111
Token and
Token ID290
Feature activation-6.93e+01

Pos logit contributions
uddled+0.217
icked+0.139
ered+0.133
wr+0.131
 died+0.128
Neg logit contributions
igsaw-0.334
 angles-0.300
 ey-0.291
 redd-0.287
 sill-0.283
Token too
Token ID1165
Feature activation-4.42e+01

Pos logit contributions
uddled+0.220
:+0.158
anas+0.154
;+0.154
Because+0.152
Neg logit contributions
 redd-0.242
igsaw-0.240
Mic-0.223
 ey-0.221
 ju-0.216
Token strong
Token ID1913
Feature activation-3.26e+01

Pos logit contributions
uddled+0.188
:+0.158
 -+0.152
;+0.149
ered+0.147
Neg logit contributions
igsaw-0.129
 redd-0.119
 ey-0.114
Mic-0.112
die-0.099
Token.
Token ID13
Feature activation-7.96e+01

Pos logit contributions
uddled+0.067
ered+0.038
Because+0.035
 kissed+0.034
:+0.033
Neg logit contributions
igsaw-0.162
 redd-0.152
Mic-0.146
 ey-0.146
 gol-0.144
Token They
Token ID1119
Feature activation-6.73e+01

Pos logit contributions
uddled+0.352
ves+0.288
 -+0.287
ered+0.282
:+0.281
Neg logit contributions
igsaw-0.223
 redd-0.207
 ey-0.188
 sill-0.185
 ju-0.176
Token both
Token ID1111
Feature activation-4.43e+01

Pos logit contributions
uddled+0.199
:+0.138
Because+0.135
:"+0.133
;+0.133
Neg logit contributions
Mic-0.257
 redd-0.255
 distance-0.248
nam-0.246
|-0.243
Token drowned
Token ID30859
Feature activation-1.04e+02

Pos logit contributions
uddled+0.158
ikes+0.116
Because+0.113
:"+0.110
okes+0.108
Neg logit contributions
 distance-0.151
igator-0.145
|-0.140
 redd-0.137
 gol-0.135
Token,
Token ID11
Feature activation-7.20e+01

Pos logit contributions
uddled+0.326
Because+0.324
okes+0.295
OP+0.276
:"+0.276
Neg logit contributions
keeping-0.342
 distance-0.304
|-0.301
 night-0.284
 angles-0.277
Token and
Token ID290
Feature activation-6.73e+01

Pos logit contributions
uddled+0.217
icked+0.171
ered+0.165
okes+0.159
owed+0.154
Neg logit contributions
 distance-0.290
 redd-0.270
|-0.259
nam-0.256
 ey-0.254
Token their
Token ID511
Feature activation-3.58e+01

Pos logit contributions
uddled+0.180
:+0.142
 -+0.132
 died+0.132
;+0.126
Neg logit contributions
igsaw-0.311
 ey-0.291
 redd-0.286
Mic-0.281
 yak-0.258
Token bodies
Token ID5920
Feature activation-6.29e+01

Pos logit contributions
uddled+0.142
:+0.119
 -+0.115
;+0.114
icked+0.114
Neg logit contributions
igsaw-0.108
 redd-0.102
 ey-0.100
Mic-0.092
 angles-0.089
Token were
Token ID547
Feature activation-7.20e+01

Pos logit contributions
mmm+0.189
Because+0.179
okes+0.171
mm+0.167
 :+0.166
Neg logit contributions
 distance-0.192
keeping-0.173
|-0.168
ho-0.163
igator-0.162
Token that
Token ID326
Feature activation-4.01e+01

Pos logit contributions
uddled+0.042
olded+0.035
icked+0.033
ered+0.033
OP+0.032
Neg logit contributions
igsaw-0.071
 redd-0.065
 ju-0.064
umm-0.063
 yak-0.062
Token the
Token ID262
Feature activation-1.94e+01

Pos logit contributions
uddled+0.110
:+0.081
 -+0.075
;+0.073
ered+0.070
Neg logit contributions
igsaw-0.178
 redd-0.169
 ey-0.165
Mic-0.158
 angles-0.152
Token child
Token ID1200
Feature activation-4.00e+01

Pos logit contributions
uddled+0.086
:+0.073
 -+0.070
;+0.069
ered+0.067
Neg logit contributions
igsaw-0.054
 redd-0.049
 ey-0.048
Mic-0.046
 angles-0.041
Token l
Token ID300
Feature activation-4.00e+01

Pos logit contributions
uddled+0.159
wr+0.130
oving+0.125
resp+0.123
icked+0.123
Neg logit contributions
 sill-0.107
acy-0.105
 Avery-0.104
 universe-0.103
 redd-0.101
Tokenaug
Token ID7493
Feature activation+1.00e+01

Pos logit contributions
uddled+0.149
:+0.119
 -+0.117
;+0.116
 how+0.116
Neg logit contributions
igsaw-0.137
 ey-0.127
acy-0.116
 redd-0.114
Mic-0.108
Tokenheed
Token ID23616
Feature activation-1.03e+02

Pos logit contributions
igsaw+0.032
 ey+0.027
Mic+0.027
|+0.026
 redd+0.025
Neg logit contributions
uddled-0.040
:-0.033
 --0.033
;-0.032
 how-0.032
Token in
Token ID287
Feature activation-3.54e+01

Pos logit contributions
uddled+0.323
Because+0.241
ves+0.223
 died+0.219
:+0.219
Neg logit contributions
igsaw-0.417
 redd-0.388
 ey-0.380
 angles-0.359
die-0.357
Token delight
Token ID10974
Feature activation-4.17e+01

Pos logit contributions
uddled+0.154
:+0.123
 -+0.118
;+0.116
ered+0.115
Neg logit contributions
igsaw-0.106
 ey-0.097
 redd-0.096
Mic-0.089
 angles-0.082
Token.
Token ID13
Feature activation-6.57e+01

Pos logit contributions
uddled+0.110
Because+0.089
 died+0.080
 slid+0.078
okes+0.076
Neg logit contributions
igsaw-0.179
 sill-0.167
 angles-0.166
 ey-0.164
 redd-0.159
Token
Token ID198
Feature activation-4.24e+01

Pos logit contributions
uddled+0.277
oving+0.218
ered+0.216
owed+0.204
eding+0.200
Neg logit contributions
igsaw-0.216
 Avery-0.201
 Alexa-0.184
 Amber-0.183
 sill-0.181
Token
Token ID198
Feature activation-4.31e+01

Pos logit contributions
uddled+0.129
:+0.090
 -+0.089
;+0.087
 died+0.086
Neg logit contributions
igsaw-0.175
 ey-0.162
 redd-0.159
Mic-0.158
 angles-0.148
Token
Token ID198
Feature activation-5.49e+01

Pos logit contributions
uddled+0.360
ered+0.321
pped+0.296
ves+0.293
owed+0.293
Neg logit contributions
igsaw-0.173
 gol-0.171
 Po-0.164
aph-0.163
 Prince-0.161
Token
Token ID198
Feature activation-5.56e+01

Pos logit contributions
uddled+0.228
der+0.185
icked+0.184
owed+0.183
 stamped+0.180
Neg logit contributions
Mic-0.161
igsaw-0.152
Prin-0.139
|-0.137
Ru-0.127
TokenThe
Token ID464
Feature activation-3.71e+01

Pos logit contributions
uddled+0.224
icked+0.188
owed+0.182
 died+0.177
okes+0.177
Neg logit contributions
Mic-0.179
igsaw-0.171
Prin-0.156
 gol-0.145
Ru-0.142
Token brave
Token ID14802
Feature activation-4.43e+01

Pos logit contributions
uddled+0.140
:+0.116
 -+0.111
;+0.109
ered+0.105
Neg logit contributions
igsaw-0.126
 redd-0.118
 ey-0.116
Mic-0.111
 yak-0.102
Token duck
Token ID22045
Feature activation-6.79e+01

Pos logit contributions
uddled+0.158
:+0.126
 -+0.121
;+0.119
ered+0.116
Neg logit contributions
igsaw-0.158
 redd-0.151
 ey-0.146
Mic-0.140
 yak-0.132
Token believed
Token ID4762
Feature activation-1.02e+02

Pos logit contributions
uddled+0.220
apy+0.180
okes+0.176
wr+0.169
ikes+0.161
Neg logit contributions
die-0.231
 yak-0.210
ji-0.206
igator-0.205
 tad-0.200
Token he
Token ID339
Feature activation-5.50e+01

Pos logit contributions
uddled+0.286
 calmed+0.180
 died+0.180
anas+0.180
 kissed+0.179
Neg logit contributions
igsaw-0.435
 ey-0.392
 redd-0.391
Mic-0.387
 angles-0.378
Token could
Token ID714
Feature activation-6.34e+01

Pos logit contributions
uddled+0.218
apy+0.177
okes+0.162
wr+0.158
 s+0.157
Neg logit contributions
 angles-0.141
 brav-0.139
die-0.137
 redd-0.133
 angle-0.127
Token make
Token ID787
Feature activation-8.26e+01

Pos logit contributions
uddled+0.323
okes+0.236
ounced+0.229
olded+0.229
 kissed+0.226
Neg logit contributions
 sill-0.135
 avail-0.131
 universe-0.130
igsaw-0.128
 angle-0.126
Token it
Token ID340
Feature activation-5.70e+01

Pos logit contributions
uddled+0.253
:+0.192
 -+0.179
;+0.175
Because+0.167
Neg logit contributions
igsaw-0.344
 redd-0.323
 ey-0.317
Mic-0.306
 yak-0.290
Token and
Token ID290
Feature activation-6.99e+01

Pos logit contributions
uddled+0.255
Because+0.187
olded+0.176
anas+0.173
icked+0.173
Neg logit contributions
igsaw-0.186
 redd-0.166
 sill-0.161
 universe-0.153
 ju-0.151
Token same
Token ID976
Feature activation-2.38e+01

Pos logit contributions
uddled+0.037
:+0.031
icked+0.030
 -+0.029
ered+0.029
Neg logit contributions
 redd-0.025
igsaw-0.024
Mic-0.022
 ju-0.021
 ey-0.021
Token veterinarian
Token ID41404
Feature activation-4.81e+01

Pos logit contributions
uddled+0.106
icked+0.093
okes+0.089
:+0.086
 slid+0.083
Neg logit contributions
 redd-0.062
igsaw-0.062
Mic-0.055
 distance-0.053
 avail-0.052
Token and
Token ID290
Feature activation-3.92e+01

Pos logit contributions
uddled+0.112
ered+0.068
icked+0.064
:+0.064
Because+0.064
Neg logit contributions
igsaw-0.234
 redd-0.213
Mic-0.205
 ey-0.204
 yak-0.202
Token chose
Token ID7690
Feature activation-5.61e+01

Pos logit contributions
uddled+0.117
:+0.078
okes+0.075
 -+0.073
;+0.073
Neg logit contributions
igsaw-0.160
 redd-0.154
 ey-0.149
Mic-0.147
nam-0.145
Token to
Token ID284
Feature activation-3.68e+01

Pos logit contributions
uddled+0.204
:+0.145
wr+0.143
okes+0.142
ered+0.142
Neg logit contributions
igsaw-0.201
 redd-0.193
 ju-0.174
Mic-0.173
die-0.169
Token emulate
Token ID33836
Feature activation-1.01e+02

Pos logit contributions
uddled+0.148
icked+0.099
okes+0.099
:+0.097
Because+0.093
Neg logit contributions
igsaw-0.132
 ey-0.119
 redd-0.118
Mic-0.118
 sill-0.113
Token her
Token ID607
Feature activation-1.50e+01

Pos logit contributions
uddled+0.458
okes+0.323
Because+0.314
:+0.314
icked+0.314
Neg logit contributions
igsaw-0.306
Mic-0.254
 redd-0.251
nam-0.249
 angles-0.248
Token hard
Token ID1327
Feature activation-1.40e+01

Pos logit contributions
uddled+0.052
:+0.043
 -+0.041
;+0.040
ered+0.038
Neg logit contributions
igsaw-0.054
 redd-0.051
 ey-0.051
Mic-0.048
 angles-0.045
Token work
Token ID670
Feature activation-5.10e+01

Pos logit contributions
uddled+0.057
:+0.043
ves+0.041
Because+0.041
 kissed+0.041
Neg logit contributions
igsaw-0.046
 ey-0.040
 angles-0.037
 gol-0.036
acy-0.036
Token.
Token ID13
Feature activation-5.77e+01

Pos logit contributions
uddled+0.127
okes+0.080
Because+0.075
icked+0.073
wr+0.067
Neg logit contributions
igsaw-0.249
 redd-0.216
 angles-0.215
 sill-0.212
 ey-0.207
Token What
Token ID1867
Feature activation-5.68e+01

Pos logit contributions
uddled+0.226
ered+0.169
 -+0.158
 stamped+0.157
 slapped+0.156
Neg logit contributions
igsaw-0.213
Mic-0.182
 ey-0.177
 redd-0.167
die-0.164
Token an
Token ID281
Feature activation-4.46e+01

Pos logit contributions
uddled+0.068
:+0.064
 -+0.063
 how+0.060
 as+0.059
Neg logit contributions
igsaw-0.218
 ey-0.213
Mic-0.199
 redd-0.199
|-0.198
Token independent
Token ID4795
Feature activation-4.79e+01

Pos logit contributions
uddled+0.150
:+0.140
 -+0.135
;+0.132
 –+0.126
Neg logit contributions
igsaw-0.149
 ey-0.144
Mic-0.142
 redd-0.140
die-0.131
Token girl
Token ID2576
Feature activation-5.19e+01

Pos logit contributions
uddled+0.218
:+0.188
 -+0.182
;+0.179
ered+0.174
Neg logit contributions
igsaw-0.124
 redd-0.112
 ey-0.111
Mic-0.105
 angles-0.092
Token.
Token ID13
Feature activation-8.00e+01

Pos logit contributions
uddled+0.130
ered+0.073
icked+0.070
wr+0.066
 died+0.066
Neg logit contributions
igsaw-0.246
 redd-0.230
 angles-0.220
 student-0.220
 ey-0.220
Token She
Token ID1375
Feature activation-5.89e+01

Pos logit contributions
uddled+0.299
:+0.225
 -+0.223
ered+0.220
;+0.217
Neg logit contributions
igsaw-0.283
 ey-0.258
 redd-0.256
Mic-0.243
 angles-0.231
Token believed
Token ID4762
Feature activation-1.01e+02

Pos logit contributions
uddled+0.219
wr+0.201
okes+0.183
oned+0.177
oving+0.176
Neg logit contributions
die-0.161
 sorts-0.154
 student-0.153
 redd-0.146
acy-0.146
Token in
Token ID287
Feature activation-5.31e+01

Pos logit contributions
uddled+0.282
anas+0.197
 died+0.182
OP+0.180
 calmed+0.178
Neg logit contributions
igsaw-0.439
acy-0.392
 ey-0.391
 redd-0.388
 angles-0.385
Token herself
Token ID5223
Feature activation-5.54e+01

Pos logit contributions
uddled+0.192
:+0.137
icked+0.132
 died+0.131
 -+0.131
Neg logit contributions
igsaw-0.202
 ey-0.187
 redd-0.181
Mic-0.172
 angles-0.171
Token and
Token ID290
Feature activation-6.81e+01

Pos logit contributions
uddled+0.175
icked+0.131
 died+0.127
okes+0.124
 calmed+0.123
Neg logit contributions
igsaw-0.228
 student-0.209
 page-0.201
 angles-0.197
 sill-0.196
Token always
Token ID1464
Feature activation-7.36e+01

Pos logit contributions
uddled+0.186
:+0.118
;+0.112
 -+0.112
wr+0.110
Neg logit contributions
igsaw-0.298
 redd-0.283
 ey-0.279
 angles-0.270
nam-0.260
Token tried
Token ID3088
Feature activation-8.77e+01

Pos logit contributions
uddled+0.226
ves+0.143
:+0.132
;+0.129
 -+0.129
Neg logit contributions
igsaw-0.315
 ey-0.302
 redd-0.294
 angles-0.289
nam-0.274
Token,
Token ID11
Feature activation-6.41e+01

Pos logit contributions
uddled+0.179
okes+0.120
Because+0.110
 knocked+0.109
anas+0.109
Neg logit contributions
igsaw-0.274
 angles-0.258
 ey-0.239
die-0.235
Ru-0.235
Token you
Token ID345
Feature activation-4.13e+01

Pos logit contributions
uddled+0.219
:+0.164
 -+0.152
icked+0.151
;+0.149
Neg logit contributions
igsaw-0.243
 redd-0.233
 ey-0.228
Mic-0.216
 angles-0.208
Token can
Token ID460
Feature activation-5.64e+01

Pos logit contributions
uddled+0.190
okes+0.141
oving+0.133
 :+0.130
:"+0.128
Neg logit contributions
igsaw-0.103
acy-0.095
 universe-0.094
|-0.094
aph-0.094
Token reach
Token ID3151
Feature activation-7.72e+01

Pos logit contributions
uddled+0.238
icked+0.175
okes+0.173
olded+0.173
 kissed+0.168
Neg logit contributions
igsaw-0.156
Mic-0.151
 redd-0.146
aph-0.142
Ru-0.140
Token your
Token ID534
Feature activation-3.20e+01

Pos logit contributions
uddled+0.317
olded+0.221
Because+0.221
icked+0.219
 calmed+0.215
Neg logit contributions
igsaw-0.262
 redd-0.250
 ey-0.243
Mic-0.227
 angles-0.226
Tokengoal
Token ID35231
Feature activation-1.01e+02

Pos logit contributions
uddled+0.136
:+0.109
ered+0.107
icked+0.107
Because+0.104
Neg logit contributions
igsaw-0.096
 redd-0.091
 ey-0.086
Mic-0.082
 ju-0.078
Token.
Token ID13
Feature activation-7.57e+01

Pos logit contributions
uddled+0.335
 died+0.198
 kissed+0.193
 checked+0.190
icked+0.188
Neg logit contributions
igsaw-0.434
 redd-0.386
aph-0.385
 angles-0.384
acy-0.384
Token<|endoftext|>
Token ID50256
Feature activation-6.47e+01

Pos logit contributions
uddled+0.314
ves+0.228
ered+0.227
:+0.221
 stamped+0.221
Neg logit contributions
igsaw-0.254
Mic-0.219
 ey-0.218
 redd-0.215
aph-0.203
TokenOnce
Token ID7454
Feature activation-5.49e+01

Pos logit contributions
uddled+0.191
:+0.142
 -+0.134
ered+0.131
;+0.131
Neg logit contributions
igsaw-0.273
 redd-0.252
 ey-0.252
Mic-0.245
|-0.230
Token upon
Token ID2402
Feature activation-1.70e+01

Pos logit contributions
uddled+0.211
:+0.197
 -+0.182
;+0.177
 and+0.173
Neg logit contributions
igsaw-0.179
 redd-0.171
Mic-0.170
 ey-0.164
umm-0.150
Token a
Token ID257
Feature activation-4.22e+01

Pos logit contributions
uddled+0.039
:+0.030
 -+0.028
;+0.026
 and+0.025
Neg logit contributions
igsaw-0.083
 redd-0.079
 ey-0.078
Mic-0.076
 angles-0.072
Token to
Token ID284
Feature activation-5.91e+01

Pos logit contributions
uddled+0.263
owed+0.196
anas+0.188
+0.188
:"+0.184
Neg logit contributions
igsaw-0.209
 redd-0.204
 ey-0.197
 microsc-0.190
 sill-0.189
Token wear
Token ID5806
Feature activation-6.97e+01

Pos logit contributions
uddled+0.236
icked+0.168
:"+0.158
ves+0.154
der+0.153
Neg logit contributions
igsaw-0.195
 redd-0.185
 avail-0.183
|-0.174
 gol-0.173
Token it
Token ID340
Feature activation-5.11e+01

Pos logit contributions
uddled+0.301
owed+0.228
:"+0.227
icked+0.223
anas+0.221
Neg logit contributions
igsaw-0.190
 redd-0.187
 ey-0.168
nam-0.166
 avail-0.164
Token.
Token ID13
Feature activation-7.95e+01

Pos logit contributions
uddled+0.184
okes+0.132
apy+0.130
anas+0.130
icked+0.126
Neg logit contributions
igsaw-0.183
nam-0.175
 redd-0.172
 day-0.160
 sill-0.159
Token He
Token ID679
Feature activation-5.80e+01

Pos logit contributions
uddled+0.203
:+0.164
 -+0.147
;+0.144
 and+0.136
Neg logit contributions
igsaw-0.369
 ey-0.343
 redd-0.342
Mic-0.331
 yak-0.318
Token believed
Token ID4762
Feature activation-1.01e+02

Pos logit contributions
uddled+0.175
oor+0.139
;+0.134
 -+0.129
wr+0.128
Neg logit contributions
 redd-0.215
die-0.208
Mic-0.196
ji-0.191
 ju-0.184
Token he
Token ID339
Feature activation-5.28e+01

Pos logit contributions
uddled+0.326
anas+0.255
OP+0.236
ves+0.227
 calmed+0.227
Neg logit contributions
igsaw-0.368
nam-0.334
 ey-0.324
 redd-0.323
Mic-0.319
Token would
Token ID561
Feature activation-6.89e+01

Pos logit contributions
uddled+0.165
:+0.118
der+0.118
;+0.117
wr+0.117
Neg logit contributions
 redd-0.182
Mic-0.173
igsaw-0.172
die-0.170
|-0.166
Token be
Token ID307
Feature activation-7.06e+01

Pos logit contributions
uddled+0.352
icked+0.280
okes+0.265
:"+0.258
cream+0.256
Neg logit contributions
 sill-0.168
 distance-0.160
|-0.158
 universe-0.154
 seeker-0.150
Token too
Token ID1165
Feature activation-4.78e+01

Pos logit contributions
uddled+0.289
:+0.233
;+0.224
 -+0.223
der+0.219
Neg logit contributions
 redd-0.211
igsaw-0.206
 ey-0.187
Mic-0.186
nam-0.184
Token hot
Token ID3024
Feature activation-3.93e+01

Pos logit contributions
uddled+0.247
icked+0.208
ered+0.202
:"+0.199
:+0.198
Neg logit contributions
igsaw-0.089
 redd-0.085
nam-0.069
die-0.067
 ey-0.066
Token soft
Token ID2705
Feature activation-2.14e+01

Pos logit contributions
uddled+0.188
:+0.155
icked+0.149
ered+0.149
 -+0.148
Neg logit contributions
igsaw-0.117
 redd-0.106
 ey-0.106
Mic-0.098
 angles-0.089
Token pink
Token ID11398
Feature activation-3.07e+01

Pos logit contributions
uddled+0.084
icked+0.067
olded+0.066
Because+0.063
:+0.061
Neg logit contributions
igsaw-0.075
 ey-0.069
 redd-0.066
 universe-0.060
 angle-0.060
Token color
Token ID3124
Feature activation-6.41e+01

Pos logit contributions
uddled+0.152
olded+0.139
icked+0.132
oving+0.125
ered+0.125
Neg logit contributions
igsaw-0.073
 universe-0.067
 ey-0.065
 page-0.064
 sill-0.058
Token.
Token ID13
Feature activation-7.97e+01

Pos logit contributions
uddled+0.147
olded+0.088
icked+0.083
ered+0.081
 died+0.080
Neg logit contributions
igsaw-0.322
 redd-0.299
 ey-0.296
 sill-0.284
 angles-0.276
Token He
Token ID679
Feature activation-5.81e+01

Pos logit contributions
uddled+0.251
:+0.215
 -+0.195
;+0.191
 –+0.180
Neg logit contributions
igsaw-0.311
 redd-0.298
 ey-0.291
Mic-0.283
 angles-0.266
Token breathed
Token ID39545
Feature activation-1.01e+02

Pos logit contributions
uddled+0.193
ikes+0.158
wr+0.155
oses+0.150
oor+0.149
Neg logit contributions
die-0.192
 ey-0.184
Rachel-0.176
 redd-0.173
Mic-0.171
Token a
Token ID257
Feature activation-4.58e+01

Pos logit contributions
uddled+0.332
Because+0.249
:+0.230
 died+0.224
olded+0.223
Neg logit contributions
igsaw-0.390
 ey-0.373
 redd-0.369
Mic-0.347
 ju-0.326
Token deep
Token ID2769
Feature activation-2.75e+01

Pos logit contributions
uddled+0.174
:+0.138
 -+0.133
;+0.130
ered+0.129
Neg logit contributions
igsaw-0.161
 ey-0.147
 redd-0.146
Mic-0.136
 ju-0.127
Token breath
Token ID8033
Feature activation-6.76e+01

Pos logit contributions
uddled+0.084
:+0.069
 -+0.065
;+0.063
ered+0.061
Neg logit contributions
igsaw-0.111
 redd-0.105
 ey-0.104
Mic-0.102
 angles-0.095
Token in
Token ID287
Feature activation-5.39e+01

Pos logit contributions
uddled+0.192
okes+0.129
ered+0.126
icked+0.123
rums+0.123
Neg logit contributions
igsaw-0.291
 sill-0.272
 redd-0.259
 distance-0.258
 gol-0.255
Token and
Token ID290
Feature activation-6.80e+01

Pos logit contributions
uddled+0.192
:+0.155
 -+0.147
;+0.143
ered+0.139
Neg logit contributions
igsaw-0.198
 redd-0.184
 ey-0.183
Mic-0.174
 angles-0.162
Token little
Token ID1310
Feature activation-2.19e+01

Pos logit contributions
uddled+0.126
:+0.105
 -+0.101
;+0.099
ered+0.097
Neg logit contributions
igsaw-0.097
 redd-0.088
 ey-0.087
Mic-0.083
 yak-0.076
Token girl
Token ID2576
Feature activation-5.05e+01

Pos logit contributions
uddled+0.097
:+0.085
 -+0.083
 and+0.082
;+0.081
Neg logit contributions
igsaw-0.058
 ey-0.053
 redd-0.052
Mic-0.049
 angles-0.047
Token.
Token ID13
Feature activation-7.87e+01

Pos logit contributions
uddled+0.127
:+0.090
 -+0.084
;+0.081
ered+0.080
Neg logit contributions
igsaw-0.234
 redd-0.221
 ey-0.220
Mic-0.214
 yak-0.201
Token She
Token ID1375
Feature activation-5.83e+01

Pos logit contributions
uddled+0.247
:+0.214
 -+0.199
;+0.196
 –+0.183
Neg logit contributions
igsaw-0.304
 redd-0.284
 ey-0.281
Mic-0.278
 yak-0.259
Token always
Token ID1464
Feature activation-7.03e+01

Pos logit contributions
uddled+0.193
ided+0.157
der+0.148
ered+0.146
okes+0.145
Neg logit contributions
|-0.194
 redd-0.191
 universe-0.190
 ey-0.187
Mic-0.182
Token believed
Token ID4762
Feature activation-1.00e+02

Pos logit contributions
uddled+0.241
ves+0.183
der+0.167
:+0.166
ades+0.164
Neg logit contributions
|-0.252
 redd-0.250
die-0.239
 ey-0.233
nam-0.232
Token she
Token ID673
Feature activation-5.39e+01

Pos logit contributions
uddled+0.338
anas+0.255
 calmed+0.240
OP+0.237
 died+0.236
Neg logit contributions
igsaw-0.373
 ey-0.350
 redd-0.348
nam-0.335
 angles-0.317
Token could
Token ID714
Feature activation-6.34e+01

Pos logit contributions
uddled+0.128
:+0.083
;+0.077
 -+0.077
ered+0.072
Neg logit contributions
igsaw-0.250
 redd-0.241
 ey-0.238
Mic-0.230
 angles-0.223
Token do
Token ID466
Feature activation-5.98e+01

Pos logit contributions
uddled+0.312
ged+0.230
icked+0.227
olded+0.226
okes+0.223
Neg logit contributions
 sill-0.177
 universe-0.162
 ey-0.153
igsaw-0.151
 redd-0.150
Token anything
Token ID1997
Feature activation-6.42e+01

Pos logit contributions
uddled+0.252
icked+0.193
olded+0.185
 kissed+0.179
 :+0.178
Neg logit contributions
igsaw-0.183
 ey-0.161
 redd-0.160
 angles-0.157
 microsc-0.156
Token.
Token ID13
Feature activation-8.27e+01

Pos logit contributions
uddled+0.197
icked+0.127
 died+0.119
ered+0.117
olded+0.113
Neg logit contributions
igsaw-0.281
 redd-0.271
 ju-0.257
 ey-0.247
 angles-0.242
Token replied
Token ID8712
Feature activation-6.77e+01

Pos logit contributions
uddled+0.146
oor+0.129
der+0.114
ided+0.113
ves+0.113
Neg logit contributions
die-0.125
 redd-0.120
 Po-0.115
 ey-0.115
chard-0.113
Token,
Token ID11
Feature activation-6.24e+01

Pos logit contributions
uddled+0.192
anas+0.160
OP+0.148
Because+0.147
ered+0.144
Neg logit contributions
igsaw-0.230
 redd-0.224
 Po-0.218
chard-0.213
Mic-0.211
Token "
Token ID366
Feature activation-5.40e+01

Pos logit contributions
uddled+0.255
ered+0.234
ves+0.222
isters+0.218
OP+0.216
Neg logit contributions
chard-0.156
 redd-0.140
nam-0.137
Mic-0.136
 Prince-0.136
TokenI
Token ID40
Feature activation-5.41e+01

Pos logit contributions
uddled+0.158
Because+0.123
:+0.122
 -+0.114
;+0.112
Neg logit contributions
igsaw-0.241
 ey-0.217
 redd-0.209
Mic-0.205
 angles-0.200
Token have
Token ID423
Feature activation-5.70e+01

Pos logit contributions
uddled+0.184
:+0.140
 -+0.130
;+0.128
ered+0.128
Neg logit contributions
igsaw-0.206
 redd-0.199
 ey-0.192
Mic-0.184
 gol-0.174
Token cancer
Token ID4890
Feature activation-1.00e+02

Pos logit contributions
uddled+0.225
owed+0.170
ered+0.168
:"+0.167
:+0.166
Neg logit contributions
 redd-0.198
igsaw-0.185
 ju-0.178
nam-0.173
 ey-0.172
Token and
Token ID290
Feature activation-6.02e+01

Pos logit contributions
uddled+0.310
oving+0.242
 softened+0.227
ered+0.223
Because+0.223
Neg logit contributions
igsaw-0.363
 sill-0.350
 redd-0.343
ro-0.337
acy-0.335
Token I
Token ID314
Feature activation-5.52e+01

Pos logit contributions
uddled+0.221
:+0.162
 -+0.158
;+0.158
:"+0.156
Neg logit contributions
igsaw-0.213
 redd-0.204
 ey-0.202
nam-0.195
Mic-0.187
Token lost
Token ID2626
Feature activation-6.12e+01

Pos logit contributions
uddled+0.228
:+0.180
:"+0.173
;+0.172
 :+0.170
Neg logit contributions
 redd-0.165
|-0.155
 ey-0.153
igsaw-0.149
 sill-0.145
Token my
Token ID616
Feature activation-3.56e+01

Pos logit contributions
uddled+0.299
:+0.227
Because+0.224
 calmed+0.219
:"+0.218
Neg logit contributions
igsaw-0.154
 redd-0.148
 ey-0.142
nam-0.131
Mic-0.126
Token hair
Token ID4190
Feature activation-6.37e+01

Pos logit contributions
uddled+0.142
icked+0.113
:+0.111
 -+0.106
:"+0.106
Neg logit contributions
 redd-0.115
 ey-0.114
 sill-0.104
igsaw-0.102
 angles-0.098
Token They
Token ID1119
Feature activation-5.95e+01

Pos logit contributions
uddled+0.290
 died+0.220
 kissed+0.219
:+0.217
 calmed+0.216
Neg logit contributions
igsaw-0.230
 redd-0.210
 ey-0.206
Mic-0.199
 yak-0.195
Token raced
Token ID32897
Feature activation-8.30e+01

Pos logit contributions
uddled+0.192
der+0.147
ves+0.146
:+0.145
wr+0.142
Neg logit contributions
nam-0.221
 redd-0.216
Mic-0.215
chard-0.208
 gol-0.197
Token each
Token ID1123
Feature activation-4.87e+01

Pos logit contributions
uddled+0.258
icked+0.176
:+0.170
 died+0.169
der+0.168
Neg logit contributions
igsaw-0.342
 redd-0.316
Mic-0.308
 ey-0.307
 yak-0.299
Token other
Token ID584
Feature activation-1.90e+01

Pos logit contributions
 and+0.145
uddled+0.142
:+0.134
 -+0.124
,+0.123
Neg logit contributions
igsaw-0.211
 ey-0.198
 redd-0.181
umm-0.172
 ju-0.169
Token and
Token ID290
Feature activation-5.93e+01

Pos logit contributions
uddled+0.049
icked+0.031
ered+0.030
 died+0.030
 kissed+0.030
Neg logit contributions
igsaw-0.087
 redd-0.080
Mic-0.078
 angles-0.078
 ey-0.077
Token competed
Token ID32440
Feature activation-1.00e+02

Pos logit contributions
uddled+0.129
:+0.090
 -+0.081
;+0.078
ered+0.073
Neg logit contributions
igsaw-0.297
 redd-0.282
 ey-0.280
Mic-0.273
 angles-0.257
Token to
Token ID284
Feature activation-5.74e+01

Pos logit contributions
uddled+0.359
 died+0.281
 kissed+0.280
 wipes+0.277
icked+0.271
Neg logit contributions
igsaw-0.344
 gol-0.314
 student-0.303
aph-0.294
 sill-0.290
Token see
Token ID766
Feature activation-7.10e+01

Pos logit contributions
uddled+0.171
:+0.138
 -+0.130
;+0.126
ered+0.123
Neg logit contributions
igsaw-0.237
 ey-0.222
 redd-0.221
Mic-0.212
 yak-0.198
Token who
Token ID508
Feature activation-5.58e+01

Pos logit contributions
uddled+0.190
 died+0.117
:+0.112
icked+0.106
wr+0.103
Neg logit contributions
igsaw-0.315
 redd-0.313
Mic-0.299
 angles-0.298
 ey-0.294
Token the
Token ID262
Feature activation-3.44e+01

Pos logit contributions
uddled+0.194
der+0.133
ered+0.127
 :+0.126
;+0.126
Neg logit contributions
igator-0.193
 redd-0.192
igsaw-0.187
nam-0.186
 gol-0.186
Token fastest
Token ID14162
Feature activation-5.05e+01

Pos logit contributions
uddled+0.134
:+0.112
 -+0.107
;+0.105
ered+0.102
Neg logit contributions
igsaw-0.113
 redd-0.104
 ey-0.103
Mic-0.098
 angles-0.089
Token happy
Token ID3772
Feature activation-4.34e+01

Pos logit contributions
uddled+0.181
:+0.151
 -+0.145
;+0.142
ered+0.138
Neg logit contributions
igsaw-0.147
 redd-0.138
 ey-0.134
Mic-0.126
 ju-0.116
Token that
Token ID326
Feature activation-5.81e+01

Pos logit contributions
uddled+0.153
icked+0.142
ered+0.141
ided+0.136
OP+0.129
Neg logit contributions
igsaw-0.125
keeping-0.117
 sill-0.113
 ey-0.112
 gol-0.110
Token the
Token ID262
Feature activation-3.79e+01

Pos logit contributions
uddled+0.196
:+0.155
 -+0.146
;+0.143
ered+0.142
Neg logit contributions
igsaw-0.220
 redd-0.208
 ey-0.203
Mic-0.196
 angles-0.182
Token coat
Token ID13209
Feature activation-6.87e+01

Pos logit contributions
uddled+0.182
icked+0.160
:+0.157
owed+0.149
ered+0.147
Neg logit contributions
 redd-0.091
 ey-0.085
igsaw-0.078
 ju-0.074
 distance-0.073
Token had
Token ID550
Feature activation-7.15e+01

Pos logit contributions
okes+0.211
rums+0.208
uddled+0.201
oving+0.197
wr+0.188
Neg logit contributions
 sill-0.218
 redd-0.200
 ju-0.195
 distance-0.195
hel-0.187
Token prevented
Token ID13351
Feature activation-1.00e+02

Pos logit contributions
uddled+0.261
okes+0.242
Because+0.233
:"+0.230
oving+0.224
Neg logit contributions
 redd-0.210
 sill-0.207
 distance-0.205
 ju-0.203
keeping-0.199
Token the
Token ID262
Feature activation-3.99e+01

Pos logit contributions
uddled+0.389
:+0.301
Because+0.291
icked+0.290
okes+0.283
Neg logit contributions
igsaw-0.328
 redd-0.289
Mic-0.286
 ey-0.273
 avail-0.267
Token bad
Token ID2089
Feature activation-4.11e+01

Pos logit contributions
uddled+0.180
:+0.155
icked+0.151
ered+0.145
;+0.141
Neg logit contributions
 redd-0.106
igsaw-0.099
 ey-0.098
 ju-0.089
Mic-0.086
Token weather
Token ID6193
Feature activation-7.13e+01

Pos logit contributions
uddled+0.178
:+0.145
icked+0.142
olded+0.136
Because+0.136
Neg logit contributions
 redd-0.114
igsaw-0.112
 ey-0.108
 avail-0.096
 yak-0.094
Token.
Token ID13
Feature activation-8.40e+01

Pos logit contributions
uddled+0.186
icked+0.128
:+0.127
ered+0.120
Because+0.117
Neg logit contributions
igsaw-0.327
 redd-0.300
 ey-0.285
 sill-0.284
 ju-0.278
Token Now
Token ID2735
Feature activation-8.34e+01

Pos logit contributions
uddled+0.312
:+0.252
ered+0.247
 -+0.245
;+0.238
Neg logit contributions
igsaw-0.291
 redd-0.266
 ey-0.261
Mic-0.246
|-0.229
Token there
Token ID612
Feature activation+1.09e+02

Pos logit contributions
igsaw+0.351
 redd+0.331
 ey+0.317
Mic+0.314
umm+0.287
Neg logit contributions
uddled-0.427
:-0.388
 --0.357
;-0.349
 and-0.338
Token was
Token ID373
Feature activation+8.92e+01

Pos logit contributions
igsaw+0.519
 ey+0.508
 redd+0.505
Mic+0.476
 angles+0.473
Neg logit contributions
uddled-0.238
:-0.124
 kissed-0.123
ered-0.123
der-0.122
Token a
Token ID257
Feature activation+1.09e+02

Pos logit contributions
igsaw+0.471
 redd+0.447
 ey+0.445
 angles+0.437
Mic+0.431
Neg logit contributions
uddled-0.169
:-0.130
 --0.117
 and-0.112
;-0.108
Token three
Token ID1115
Feature activation+1.26e+02

Pos logit contributions
igsaw+0.521
keeping+0.518
 angles+0.502
 ey+0.496
 redd+0.455
Neg logit contributions
 and-0.252
uddled-0.251
,-0.233
 s-0.230
:-0.225
Token-
Token ID12
Feature activation+1.80e+02

Pos logit contributions
igsaw+0.451
 redd+0.421
 ey+0.419
Mic+0.401
 yak+0.376
Neg logit contributions
uddled-0.447
:-0.354
 --0.342
;-0.334
ered-0.327
Tokenyear
Token ID1941
Feature activation+1.83e+02

Pos logit contributions
igsaw+0.623
 ey+0.557
 avail+0.545
 redd+0.512
umm+0.510
Neg logit contributions
uddled-0.688
:-0.619
ered-0.594
icked-0.586
 --0.570
Token-
Token ID12
Feature activation+1.77e+02

Pos logit contributions
igsaw+1.012
 ey+0.968
Mic+0.965
 redd+0.961
 avail+0.942
Neg logit contributions
uddled-0.303
:-0.226
 and-0.174
 --0.174
;-0.161
Tokenold
Token ID727
Feature activation+1.09e+02

Pos logit contributions
igsaw+0.746
 ey+0.697
 redd+0.688
 avail+0.654
Mic+0.636
Neg logit contributions
uddled-0.541
:-0.446
 --0.415
ered-0.409
;-0.393
Token who
Token ID508
Feature activation+8.99e+01

Pos logit contributions
igsaw+0.377
 redd+0.365
Mic+0.355
 ey+0.352
 yak+0.345
Neg logit contributions
uddled-0.380
ered-0.278
:-0.277
icked-0.265
Because-0.264
Token found
Token ID1043
Feature activation+6.55e+01

Pos logit contributions
|+0.279
 redd+0.276
 sill+0.275
igsaw+0.270
Mic+0.267
Neg logit contributions
uddled-0.326
okes-0.259
der-0.252
inches-0.232
wr-0.226
Token a
Token ID257
Feature activation+9.63e+01

Pos logit contributions
igsaw+0.268
 redd+0.256
 ey+0.250
Mic+0.245
 ju+0.226
Neg logit contributions
uddled-0.200
:-0.149
 --0.139
;-0.137
ered-0.134
Token grabbed
Token ID13176
Feature activation+9.34e+01

Pos logit contributions
igator+0.434
chard+0.380
 Avery+0.357
 redd+0.357
nam+0.350
Neg logit contributions
uddled-0.505
okes-0.358
rums-0.357
anas-0.351
anked-0.339
Token each
Token ID1123
Feature activation+1.09e+02

Pos logit contributions
igsaw+0.388
 redd+0.359
 ey+0.352
Mic+0.337
 angles+0.325
Neg logit contributions
uddled-0.302
:-0.220
 --0.205
;-0.204
Because-0.201
Token other
Token ID584
Feature activation+1.43e+02

Pos logit contributions
igsaw+0.266
 redd+0.253
 ey+0.235
 angles+0.214
Mic+0.208
Neg logit contributions
uddled-0.537
:-0.419
icked-0.415
Because-0.410
;-0.403
Tokenâ
Token ID22940
Feature activation+1.38e+02

Pos logit contributions
igsaw+0.617
 angles+0.577
 ey+0.570
 redd+0.569
keeping+0.535
Neg logit contributions
uddled-0.437
Because-0.299
icked-0.299
wr-0.277
ered-0.264
Token
Token ID26391
Feature activation+1.58e+02

Pos logit contributions
igsaw+0.330
|+0.261
Mic+0.260
 redd+0.258
 ey+0.247
Neg logit contributions
uddled-0.684
:-0.554
icked-0.545
 how-0.544
 --0.537
Token
Token ID8151
Feature activation+1.81e+02

Pos logit contributions
igsaw+0.506
|+0.470
keeping+0.440
Mic+0.430
 gol+0.409
Neg logit contributions
uddled-0.540
ered-0.484
der-0.472
icked-0.444
 softened-0.436
Tokens
Token ID82
Feature activation+1.33e+02

Pos logit contributions
igsaw+0.523
|+0.483
 redd+0.449
 gol+0.446
Mic+0.435
Neg logit contributions
uddled-0.615
ered-0.572
 wipes-0.530
 swallowed-0.523
 how-0.523
Token hand
Token ID1021
Feature activation+9.06e+01

Pos logit contributions
igsaw+0.442
 ey+0.408
 sill+0.400
 angles+0.397
keeping+0.382
Neg logit contributions
uddled-0.514
Because-0.377
wr-0.365
 overheard-0.362
okes-0.348
Token and
Token ID290
Feature activation+7.86e+01

Pos logit contributions
 sill+0.274
ny+0.236
 ju+0.234
 redd+0.230
keeping+0.229
Neg logit contributions
uddled-0.273
okes-0.259
Because-0.254
mmm-0.240
ves-0.238
Token set
Token ID900
Feature activation+8.31e+01

Pos logit contributions
 redd+0.301
igsaw+0.292
nam+0.288
 ju+0.285
 gol+0.277
Neg logit contributions
uddled-0.228
:-0.174
;-0.164
 :-0.162
Because-0.160
Token off
Token ID572
Feature activation+8.64e+01

Pos logit contributions
igsaw+0.295
 redd+0.256
 ey+0.245
agu+0.240
 gol+0.239
Neg logit contributions
uddled-0.300
Because-0.235
 overheard-0.217
der-0.212
 hated-0.212
Token was
Token ID373
Feature activation+6.89e+01

Pos logit contributions
igsaw+0.501
 redd+0.478
 ey+0.476
Mic+0.463
 angles+0.440
Neg logit contributions
uddled-0.159
:-0.097
 --0.084
;-0.079
ered-0.072
Token a
Token ID257
Feature activation+9.46e+01

Pos logit contributions
 angles+0.457
igsaw+0.442
 ey+0.439
 avail+0.424
 angle+0.421
Neg logit contributions
 and-0.046
uddled-0.038
:-0.032
 that-0.028
 some-0.022
Token big
Token ID1263
Feature activation+1.16e+02

Pos logit contributions
igsaw+0.447
keeping+0.442
 angles+0.429
 ey+0.427
 redd+0.388
Neg logit contributions
uddled-0.223
 and-0.220
,-0.204
 s-0.201
:-0.201
Token rh
Token ID9529
Feature activation+1.24e+02

Pos logit contributions
igsaw+0.437
 redd+0.410
 ey+0.402
Mic+0.392
 angles+0.367
Neg logit contributions
uddled-0.379
:-0.323
 --0.302
;-0.294
 and-0.285
Tokenin
Token ID259
Feature activation+1.27e+02

Pos logit contributions
igsaw+0.535
 redd+0.498
 ey+0.486
 ju+0.452
 avail+0.436
Neg logit contributions
uddled-0.399
:-0.320
 and-0.300
 --0.295
Because-0.294
Tokenoc
Token ID420
Feature activation+2.00e+02

Pos logit contributions
igsaw+0.276
Mic+0.251
 redd+0.248
 ey+0.237
 yak+0.224
Neg logit contributions
uddled-0.621
ered-0.516
:-0.506
icked-0.500
 --0.493
Tokeneros
Token ID27498
Feature activation+7.49e+01

Pos logit contributions
igsaw+0.747
 redd+0.697
 ey+0.686
Mic+0.660
 angles+0.609
Neg logit contributions
uddled-0.689
:-0.560
 --0.531
;-0.518
ered-0.498
Token.
Token ID13
Feature activation+5.04e+01

Pos logit contributions
igator+0.160
 day+0.151
 oct+0.148
 yak+0.147
 universe+0.145
Neg logit contributions
uddled-0.346
ered-0.279
uing-0.264
ided-0.262
oor-0.262
Token He
Token ID679
Feature activation+7.01e+01

Pos logit contributions
igsaw+0.202
 redd+0.188
 ey+0.186
Mic+0.180
 yak+0.167
Neg logit contributions
uddled-0.162
:-0.126
 --0.120
;-0.116
ered-0.114
Token was
Token ID373
Feature activation+5.23e+01

Pos logit contributions
Mic+0.264
|+0.257
 yak+0.244
 redd+0.240
die+0.237
Neg logit contributions
uddled-0.225
okes-0.164
der-0.159
olded-0.157
ided-0.153
Token very
Token ID845
Feature activation+9.54e+01

Pos logit contributions
igsaw+0.295
 ey+0.266
 avail+0.253
umm+0.252
 angles+0.251
Neg logit contributions
:-0.097
 that-0.094
 and-0.088
,-0.083
 s-0.080
Token healthy
Token ID5448
Feature activation+1.40e+02

Pos logit contributions
 redd+0.442
die+0.415
 angles+0.411
 sill+0.411
nam+0.388
Neg logit contributions
uddled-0.559
wr-0.373
ves-0.373
Because-0.365
;-0.361
Token.
Token ID13
Feature activation+8.26e+01

Pos logit contributions
keeping+0.514
 sill+0.490
 angles+0.487
 night+0.484
igsaw+0.482
Neg logit contributions
uddled-0.455
ves-0.319
ided-0.317
ered-0.305
rolled-0.301
Token It
Token ID632
Feature activation+9.65e+01

Pos logit contributions
igsaw+0.289
 redd+0.260
 ey+0.258
Mic+0.257
die+0.239
Neg logit contributions
uddled-0.324
ered-0.243
 --0.229
:-0.228
 died-0.227
Tokenâ
Token ID22940
Feature activation+1.24e+02

Pos logit contributions
 redd+0.303
igator+0.299
 ju+0.285
die+0.274
 sill+0.256
Neg logit contributions
uddled-0.393
;-0.299
iked-0.294
ged-0.292
rolled-0.289
Token
Token ID26391
Feature activation+1.50e+02

Pos logit contributions
igsaw+0.357
 redd+0.318
 ey+0.315
Mic+0.301
|+0.271
Neg logit contributions
uddled-0.539
:-0.452
 --0.437
;-0.429
 –-0.414
Token
Token ID8151
Feature activation+1.75e+02

Pos logit contributions
igsaw+0.505
|+0.449
 redd+0.412
Mic+0.410
keeping+0.409
Neg logit contributions
uddled-0.528
ered-0.442
der-0.440
pt-0.403
 softened-0.397
Tokens
Token ID82
Feature activation+1.28e+02

Pos logit contributions
|+0.544
igsaw+0.541
 redd+0.476
aph+0.452
igator+0.448
Neg logit contributions
uddled-0.596
ered-0.515
 stamped-0.479
aged-0.476
 how-0.470
Token full
Token ID1336
Feature activation+9.29e+01

Pos logit contributions
 redd+0.374
 ju+0.368
die+0.357
 sill+0.353
igator+0.348
Neg logit contributions
uddled-0.525
ged-0.444
ipped-0.394
ooming-0.391
iked-0.381
Token of
Token ID286
Feature activation+1.15e+02

Pos logit contributions
 sill+0.176
igator+0.171
igsaw+0.167
 redd+0.160
 ey+0.159
Neg logit contributions
uddled-0.471
oving-0.403
aged-0.396
ged-0.394
ered-0.390
Token life
Token ID1204
Feature activation+8.84e+01

Pos logit contributions
 ey+0.313
igsaw+0.308
 redd+0.293
 angles+0.281
 microsc+0.256
Neg logit contributions
uddled-0.529
owed-0.402
icked-0.394
wr-0.393
ided-0.390
Token.
Token ID13
Feature activation+6.03e+01

Pos logit contributions
igsaw+0.399
 redd+0.330
igator+0.327
 ey+0.325
 ju+0.324
Neg logit contributions
uddled-0.254
icked-0.206
ered-0.176
okes-0.175
wr-0.166
Token and
Token ID290
Feature activation+1.17e+02

Pos logit contributions
uddled+0.157
anas+0.123
icked+0.117
olded+0.114
cream+0.113
Neg logit contributions
igsaw-0.150
 sill-0.148
 angles-0.146
 distance-0.141
aph-0.139
Token watched
Token ID7342
Feature activation+8.76e+01

Pos logit contributions
 redd+0.433
igsaw+0.413
 ey+0.411
nam+0.408
die+0.386
Neg logit contributions
uddled-0.406
:-0.278
;-0.272
 --0.269
Because-0.267
Token as
Token ID355
Feature activation+9.82e+01

Pos logit contributions
igsaw+0.266
 Avery+0.233
 angles+0.221
 sill+0.220
 ey+0.219
Neg logit contributions
uddled-0.363
Because-0.282
 calmed-0.267
icked-0.262
olded-0.262
Token the
Token ID262
Feature activation+1.19e+02

Pos logit contributions
igsaw+0.343
 redd+0.319
 ey+0.307
Mic+0.290
 angles+0.280
Neg logit contributions
uddled-0.370
:-0.288
 --0.275
;-0.272
icked-0.264
Token wood
Token ID4898
Feature activation+1.12e+02

Pos logit contributions
igsaw+0.336
 redd+0.306
 ey+0.304
Mic+0.278
 gol+0.270
Neg logit contributions
uddled-0.521
icked-0.432
:-0.427
ered-0.418
;-0.410
Tokenpe
Token ID431
Feature activation+1.86e+02

Pos logit contributions
igator+0.356
 sill+0.352
 distance+0.341
 angles+0.333
 redd+0.328
Neg logit contributions
uddled-0.370
okes-0.308
icked-0.282
anas-0.282
Because-0.275
Tokencker
Token ID15280
Feature activation+9.89e+01

Pos logit contributions
 angles+0.426
 sill+0.424
 distance+0.422
 redd+0.393
 blu+0.370
Neg logit contributions
uddled-0.793
:"-0.637
icked-0.591
Because-0.590
:-0.576
Token worked
Token ID3111
Feature activation+7.34e+01

Pos logit contributions
 distance+0.285
 sill+0.282
 angles+0.274
 night+0.254
 blu+0.248
Neg logit contributions
uddled-0.385
:"-0.277
Because-0.276
okes-0.268
resp-0.264
Token away
Token ID1497
Feature activation+9.68e+01

Pos logit contributions
igsaw+0.293
 redd+0.261
 angles+0.252
 ju+0.243
die+0.233
Neg logit contributions
uddled-0.259
okes-0.183
icked-0.178
Because-0.172
 kissed-0.166
Token.
Token ID13
Feature activation+6.34e+01

Pos logit contributions
igsaw+0.372
 angles+0.337
 sill+0.334
 redd+0.332
 microsc+0.313
Neg logit contributions
uddled-0.362
okes-0.251
Because-0.244
icked-0.236
olded-0.235
Token He
Token ID679
Feature activation+8.14e+01

Pos logit contributions
igsaw+0.205
Mic+0.186
 redd+0.182
 ey+0.174
die+0.171
Neg logit contributions
uddled-0.261
ered-0.199
ves-0.188
icked-0.185
 --0.185
Token wide
Token ID3094
Feature activation+1.31e+02

Pos logit contributions
igsaw+0.245
 ey+0.218
 redd+0.217
Mic+0.210
 angles+0.191
Neg logit contributions
uddled-0.346
:-0.296
 --0.286
;-0.280
 and-0.271
Token slide
Token ID10649
Feature activation+7.83e+01

Pos logit contributions
igsaw+0.320
 ey+0.286
 redd+0.271
 angles+0.233
 yak+0.231
Neg logit contributions
uddled-0.664
icked-0.551
olded-0.533
ered-0.527
Because-0.527
Token.
Token ID13
Feature activation+6.62e+01

Pos logit contributions
 redd+0.310
igsaw+0.301
 sill+0.300
 distance+0.289
igator+0.285
Neg logit contributions
uddled-0.225
icked-0.174
rums-0.166
anas-0.163
ered-0.155
Token<|endoftext|>
Token ID50256
Feature activation+6.96e+01

Pos logit contributions
igsaw+0.238
 redd+0.218
 ey+0.213
Mic+0.211
|+0.191
Neg logit contributions
uddled-0.248
ered-0.188
:-0.188
 --0.182
 died-0.179
TokenOnce
Token ID7454
Feature activation+7.82e+01

Pos logit contributions
igsaw+0.358
 redd+0.351
 ey+0.336
Mic+0.328
nam+0.319
Neg logit contributions
uddled-0.135
:-0.116
 --0.099
 and-0.091
;-0.090
Token upon
Token ID2402
Feature activation+1.19e+02

Pos logit contributions
 redd+0.303
igsaw+0.303
Mic+0.299
umm+0.292
 ey+0.292
Neg logit contributions
:-0.256
uddled-0.243
,-0.226
 --0.213
 and-0.213
Token a
Token ID257
Feature activation+9.07e+01

Pos logit contributions
igsaw+0.793
umm+0.749
 ey+0.740
 angles+0.736
 redd+0.711
Neg logit contributions
 and-0.083
.-0.069
uddled-0.063
 that-0.053
 how-0.047
Token time
Token ID640
Feature activation+7.69e+01

Pos logit contributions
igsaw+0.350
 ey+0.334
 redd+0.333
keeping+0.323
nam+0.321
Neg logit contributions
uddled-0.284
:-0.262
 and-0.250
 --0.237
;-0.233
Token,
Token ID11
Feature activation+6.44e+01

Pos logit contributions
igsaw+0.492
 redd+0.487
 ey+0.475
Mic+0.465
 angles+0.451
Neg logit contributions
uddled-0.055
:-0.030
 --0.013
;-0.009
,-0.000
Token there
Token ID612
Feature activation+7.75e+01

Pos logit contributions
igsaw+0.363
 ey+0.350
umm+0.332
 redd+0.330
 avail+0.323
Neg logit contributions
uddled-0.109
:-0.097
 and-0.084
 at-0.078
,-0.076
Token was
Token ID373
Feature activation+5.50e+01

Pos logit contributions
igsaw+0.441
 redd+0.420
 ey+0.415
Mic+0.408
 yak+0.386
Neg logit contributions
uddled-0.119
:-0.073
 --0.061
;-0.055
ered-0.047
Token.
Token ID13
Feature activation+7.90e+01

Pos logit contributions
 sill+0.467
 angles+0.457
keeping+0.428
ro+0.419
 ey+0.414
Neg logit contributions
uddled-0.452
Because-0.318
 blamed-0.291
icked-0.290
ves-0.276
Token Finally
Token ID9461
Feature activation+9.17e+01

Pos logit contributions
igsaw+0.248
Mic+0.215
 redd+0.203
 ey+0.199
 gol+0.196
Neg logit contributions
uddled-0.344
ered-0.274
ves-0.255
 died-0.249
icked-0.248
Token,
Token ID11
Feature activation+7.76e+01

Pos logit contributions
igsaw+0.298
 Po+0.264
 Avery+0.255
 Penny+0.252
 recess+0.247
Neg logit contributions
uddled-0.407
anas-0.296
ikes-0.272
oving-0.271
owed-0.269
Token she
Token ID673
Feature activation+9.45e+01

Pos logit contributions
igsaw+0.264
 redd+0.242
Mic+0.225
 gol+0.224
 ju+0.224
Neg logit contributions
uddled-0.310
:-0.226
icked-0.225
ered-0.213
okes-0.212
Token felt
Token ID2936
Feature activation+6.55e+01

Pos logit contributions
 gol+0.303
acy+0.302
 Avery+0.300
 ey+0.294
agu+0.293
Neg logit contributions
uddled-0.327
ikes-0.236
oving-0.230
rums-0.228
;-0.222
Token it
Token ID340
Feature activation+8.87e+01

Pos logit contributions
 redd+0.168
 ju+0.161
igsaw+0.154
 sill+0.151
 distance+0.144
Neg logit contributions
uddled-0.319
anas-0.268
ikes-0.250
opes-0.248
ves-0.237
Token w
Token ID266
Feature activation+1.03e+02

Pos logit contributions
 sill+0.266
igsaw+0.250
 redd+0.246
 distance+0.239
 sunny+0.236
Neg logit contributions
uddled-0.327
anas-0.313
okes-0.304
rums-0.276
Because-0.267
Tokeniggle
Token ID24082
Feature activation+8.98e+01

Pos logit contributions
igsaw+0.311
acy+0.246
umm+0.240
 ey+0.240
agu+0.234
Neg logit contributions
 knocked-0.391
uddled-0.380
 which-0.379
 how-0.372
 rang-0.369
Token out
Token ID503
Feature activation+9.44e+01

Pos logit contributions
igsaw+0.326
 sill+0.320
 redd+0.285
ji+0.285
 page+0.283
Neg logit contributions
uddled-0.283
okes-0.236
Because-0.219
anas-0.207
cream-0.205
Token of
Token ID286
Feature activation+1.09e+02

Pos logit contributions
 sill+0.324
igsaw+0.307
 ey+0.286
 distance+0.275
 angles+0.275
Neg logit contributions
uddled-0.393
anas-0.298
Because-0.284
okes-0.273
ves-0.272
Token her
Token ID607
Feature activation+9.89e+01

Pos logit contributions
igsaw+0.402
 redd+0.373
 ey+0.370
Mic+0.356
 angles+0.328
Neg logit contributions
uddled-0.377
:-0.307
 --0.291
;-0.284
ered-0.275
Token gently
Token ID15165
Feature activation+7.00e+01

Pos logit contributions
igsaw+0.395
 ey+0.363
 redd+0.361
 sill+0.345
 angles+0.340
Neg logit contributions
uddled-0.236
icked-0.145
wr-0.138
 died-0.130
Because-0.129
Token and
Token ID290
Feature activation+5.26e+01

Pos logit contributions
igsaw+0.296
 ey+0.276
 redd+0.272
 sill+0.267
 angles+0.259
Neg logit contributions
uddled-0.236
Because-0.149
icked-0.146
olded-0.138
ves-0.131
Token put
Token ID1234
Feature activation+3.83e+01

Pos logit contributions
 redd+0.209
 ju+0.196
igsaw+0.188
die+0.187
nam+0.183
Neg logit contributions
uddled-0.155
;-0.120
 --0.113
:-0.112
 –-0.112
Token her
Token ID607
Feature activation+7.67e+01

Pos logit contributions
igsaw+0.146
 redd+0.137
 ey+0.134
Mic+0.128
 angles+0.120
Neg logit contributions
uddled-0.132
:-0.102
 --0.097
;-0.096
Because-0.091
Token on
Token ID319
Feature activation+6.51e+01

Pos logit contributions
igsaw+0.266
 redd+0.251
 ey+0.247
nam+0.242
 ju+0.222
Neg logit contributions
uddled-0.296
Because-0.223
:-0.221
der-0.211
anas-0.210
Token his
Token ID465
Feature activation+8.03e+01

Pos logit contributions
igsaw+0.248
 redd+0.230
 ey+0.227
Mic+0.219
 angles+0.203
Neg logit contributions
uddled-0.221
:-0.176
 --0.167
;-0.163
ered-0.159
Token palm
Token ID18057
Feature activation+7.16e+01

Pos logit contributions
igsaw+0.216
 ey+0.201
 redd+0.179
 sill+0.166
Mic+0.165
Neg logit contributions
uddled-0.385
Because-0.323
icked-0.314
ered-0.294
 calmed-0.291
Token.
Token ID13
Feature activation+3.09e+01

Pos logit contributions
igsaw+0.282
 ey+0.264
 redd+0.254
Mic+0.236
 sill+0.236
Neg logit contributions
uddled-0.237
ered-0.174
:-0.168
icked-0.167
 calmed-0.163
Token He
Token ID679
Feature activation+4.96e+01

Pos logit contributions
igsaw+0.101
 redd+0.088
 ey+0.085
Mic+0.083
 sill+0.077
Neg logit contributions
uddled-0.128
ered-0.107
 --0.098
owed-0.097
ves-0.096
Token said
Token ID531
Feature activation+3.46e+01

Pos logit contributions
 redd+0.152
 ju+0.145
 begg+0.145
die+0.142
Mic+0.137
Neg logit contributions
uddled-0.149
;-0.144
wr-0.136
oor-0.135
ikes-0.133
Token,
Token ID11
Feature activation+3.65e+01

Pos logit contributions
igsaw+0.128
 redd+0.126
 Po+0.118
 Avery+0.116
 ju+0.116
Neg logit contributions
uddled-0.111
oving-0.078
anas-0.076
opes-0.076
Because-0.072
Token her
Token ID607
Feature activation+1.23e+02

Pos logit contributions
igsaw+0.367
 redd+0.338
 ey+0.336
Mic+0.322
 angles+0.298
Neg logit contributions
uddled-0.402
:-0.310
 --0.298
;-0.295
icked-0.287
Token mum
Token ID25682
Feature activation+1.04e+02

Pos logit contributions
igsaw+0.428
 ey+0.392
 redd+0.383
Mic+0.366
die+0.351
Neg logit contributions
uddled-0.464
ered-0.354
:-0.352
icked-0.347
Because-0.344
Token and
Token ID290
Feature activation+8.53e+01

Pos logit contributions
igsaw+0.535
 redd+0.505
 ey+0.504
Mic+0.489
die+0.476
Neg logit contributions
uddled-0.216
Because-0.105
ered-0.103
 died-0.103
icked-0.102
Token asked
Token ID1965
Feature activation+6.97e+01

Pos logit contributions
nam+0.327
chard+0.321
 redd+0.315
 Avery+0.314
 ey+0.306
Neg logit contributions
uddled-0.265
der-0.196
ves-0.187
oving-0.181
;-0.179
Token:
Token ID25
Feature activation+7.42e+01

Pos logit contributions
igsaw+0.242
 redd+0.235
nam+0.223
 Prince+0.220
 Avery+0.219
Neg logit contributions
uddled-0.249
Because-0.179
 died-0.175
 tipped-0.168
 slid-0.167
Token "
Token ID366
Feature activation+7.98e+01

Pos logit contributions
igsaw+0.262
 redd+0.238
 ey+0.234
chard+0.228
Mic+0.226
Neg logit contributions
uddled-0.270
ered-0.211
ves-0.195
 died-0.195
 stamped-0.186
TokenM
Token ID44
Feature activation+1.10e+02

Pos logit contributions
igsaw+0.266
 redd+0.254
Mic+0.249
 ey+0.246
 ju+0.221
Neg logit contributions
uddled-0.302
:-0.249
 --0.234
ered-0.231
;-0.231
Tokenum
Token ID388
Feature activation+1.03e+02

Pos logit contributions
igsaw+0.329
 ey+0.298
acy+0.271
 redd+0.267
umm+0.260
Neg logit contributions
uddled-0.455
 kissed-0.371
 died-0.367
 how-0.365
 stamped-0.365
Token,
Token ID11
Feature activation+6.76e+01

Pos logit contributions
igsaw+0.397
die+0.357
 ey+0.355
ji+0.349
 yak+0.345
Neg logit contributions
uddled-0.361
oving-0.243
ves-0.240
ided-0.231
der-0.230
Token what
Token ID644
Feature activation+7.61e+01

Pos logit contributions
igsaw+0.243
 redd+0.229
 ey+0.226
 ju+0.215
igator+0.200
Neg logit contributions
uddled-0.274
oving-0.201
ves-0.188
ided-0.187
ered-0.184
Token's
Token ID338
Feature activation+9.24e+01

Pos logit contributions
igsaw+0.331
 redd+0.315
 ey+0.313
Mic+0.294
 yak+0.282
Neg logit contributions
uddled-0.219
:-0.150
 --0.144
;-0.142
ered-0.141
Token the
Token ID262
Feature activation+1.09e+02

Pos logit contributions
 redd+0.302
igsaw+0.289
 microsc+0.288
 page+0.284
 universe+0.278
Neg logit contributions
uddled-0.368
der-0.252
olded-0.247
ided-0.244
oor-0.244
Token book
Token ID1492
Feature activation+9.22e+01

Pos logit contributions
igsaw+0.340
 redd+0.313
 ey+0.309
Mic+0.294
 yak+0.267
Neg logit contributions
uddled-0.443
:-0.370
 --0.355
;-0.348
ered-0.339
Token said
Token ID531
Feature activation+7.55e+01

Pos logit contributions
igator+0.286
 page+0.255
 sill+0.251
igsaw+0.247
 universe+0.246
Neg logit contributions
uddled-0.353
oving-0.291
ged-0.287
pped-0.282
okes-0.275
Token,
Token ID11
Feature activation+7.33e+01

Pos logit contributions
 sill+0.244
 page+0.215
 angles+0.198
 drawer+0.197
 sunny+0.196
Neg logit contributions
uddled-0.278
oving-0.230
okes-0.229
ounced-0.221
Sorry-0.215
Token so
Token ID523
Feature activation+9.49e+01

Pos logit contributions
 redd+0.223
 ju+0.215
 ey+0.194
 sill+0.188
 yak+0.181
Neg logit contributions
uddled-0.318
 died-0.251
icked-0.239
ered-0.230
ipped-0.228
Token she
Token ID673
Feature activation+8.61e+01

Pos logit contributions
igsaw+0.297
 redd+0.286
Mic+0.263
 ey+0.262
nam+0.249
Neg logit contributions
uddled-0.371
:-0.280
 died-0.266
ered-0.265
 --0.264
Token went
Token ID1816
Feature activation+5.56e+01

Pos logit contributions
die+0.268
 redd+0.258
Mic+0.251
acy+0.243
 yak+0.242
Neg logit contributions
uddled-0.311
;-0.250
oor-0.239
oving-0.235
ooming-0.232
Token to
Token ID284
Feature activation+7.44e+01

Pos logit contributions
igsaw+0.275
 redd+0.261
 ey+0.257
Mic+0.255
umm+0.236
Neg logit contributions
uddled-0.124
:-0.098
 --0.088
;-0.082
 and-0.078
Token find
Token ID1064
Feature activation+5.91e+01

Pos logit contributions
igsaw+0.303
 redd+0.287
 ey+0.280
Mic+0.274
 yak+0.253
Neg logit contributions
uddled-0.226
:-0.187
 --0.176
;-0.170
ered-0.167
Token her
Token ID607
Feature activation+9.41e+01

Pos logit contributions
igsaw+0.208
 redd+0.205
 ey+0.192
 angles+0.179
Mic+0.178
Neg logit contributions
uddled-0.223
Because-0.158
:-0.156
anas-0.151
;-0.149
Token mom
Token ID1995
Feature activation+7.69e+01

Pos logit contributions
igsaw+0.282
 redd+0.259
 ey+0.252
 angles+0.234
nam+0.232
Neg logit contributions
uddled-0.415
icked-0.305
ered-0.301
 calmed-0.301
Because-0.300
Token
Token ID198
Feature activation-9.42e+00

Pos logit contributions
uddled+0.127
ered+0.114
oving+0.098
owed+0.097
 died+0.096
Neg logit contributions
 redd-0.078
 Rib-0.076
 sill-0.075
Mic-0.074
igsaw-0.073
Token
Token ID198
Feature activation-1.01e+01

Pos logit contributions
uddled+0.029
:+0.022
 -+0.021
;+0.021
ered+0.021
Neg logit contributions
igsaw-0.037
 ey-0.034
 redd-0.034
Mic-0.034
die-0.032
Token"
Token ID1
Feature activation-1.79e+01

Pos logit contributions
uddled+0.025
:+0.019
 -+0.017
;+0.016
ered+0.015
Neg logit contributions
igsaw-0.049
 ey-0.046
 redd-0.045
Mic-0.044
 angles-0.041
TokenB
Token ID33
Feature activation+2.65e+01

Pos logit contributions
uddled+0.073
:+0.064
 -+0.062
;+0.061
ered+0.059
Neg logit contributions
 redd-0.051
igsaw-0.051
Mic-0.050
 ey-0.047
 ju-0.044
Tokenun
Token ID403
Feature activation+1.85e+01

Pos logit contributions
igsaw+0.078
aph+0.063
|+0.062
agu+0.059
 ey+0.057
Neg logit contributions
uddled-0.111
 explained-0.099
 softened-0.098
 died-0.097
:-0.097
Token-
Token ID12
Feature activation+6.60e+01

Pos logit contributions
igator+0.074
igsaw+0.070
|+0.066
 yak+0.064
ny+0.063
Neg logit contributions
uddled-0.060
der-0.038
oving-0.037
anas-0.036
icked-0.036
TokenCloud
Token ID18839
Feature activation-2.25e+01

Pos logit contributions
igsaw+0.225
 redd+0.209
 ey+0.202
Mic+0.199
 yak+0.181
Neg logit contributions
uddled-0.247
:-0.201
 --0.193
;-0.189
ered-0.185
Token.
Token ID13
Feature activation-3.59e+01

Pos logit contributions
uddled+0.098
ered+0.085
:"+0.079
oving+0.078
Just+0.077
Neg logit contributions
igator-0.053
igsaw-0.053
cy-0.051
able-0.048
|-0.047
Token I
Token ID314
Feature activation-1.95e+01

Pos logit contributions
uddled+0.159
ered+0.127
ves+0.123
 died+0.119
:+0.119
Neg logit contributions
igsaw-0.096
 redd-0.094
 ey-0.093
 sill-0.093
aph-0.084
Token like
Token ID588
Feature activation-2.76e+01

Pos logit contributions
uddled+0.090
der+0.071
;+0.064
:+0.064
 :+0.063
Neg logit contributions
|-0.055
 redd-0.048
cy-0.046
die-0.046
igsaw-0.045
Token it
Token ID340
Feature activation-1.31e+01

Pos logit contributions
uddled+0.132
owed+0.101
ered+0.100
olded+0.098
icked+0.097
Neg logit contributions
 universe-0.072
 redd-0.069
igsaw-0.068
 ey-0.067
 microsc-0.067
Token.
Token ID13
Feature activation-3.61e+01

Pos logit contributions
uddled+0.027
 kissed+0.018
olded+0.016
 calmed+0.016
Because+0.015
Neg logit contributions
igsaw-0.037
 redd-0.036
 sill-0.035
 angles-0.033
aph-0.033
Token
Token ID220
Feature activation+1.87e+01

Pos logit contributions
uddled+0.140
:+0.111
 -+0.109
ered+0.106
;+0.106
Neg logit contributions
igsaw-0.123
 redd-0.113
 ey-0.110
Mic-0.106
 ju-0.097
Token
Token ID198
Feature activation-1.39e+01

Pos logit contributions
igsaw+0.064
 redd+0.056
Mic+0.056
 ey+0.055
aph+0.053
Neg logit contributions
uddled-0.067
 stamped-0.051
 died-0.051
ered-0.051
 kissed-0.051
Token
Token ID198
Feature activation-1.48e+01

Pos logit contributions
uddled+0.035
:+0.027
 -+0.025
;+0.024
ered+0.022
Neg logit contributions
igsaw-0.065
 redd-0.061
 ey-0.061
Mic-0.058
 yak-0.055
TokenThe
Token ID464
Feature activation+4.66e+00

Pos logit contributions
uddled+0.048
:+0.037
 -+0.036
;+0.035
icked+0.035
Neg logit contributions
igsaw-0.059
Mic-0.055
 redd-0.054
 ey-0.053
die-0.049
Token girl
Token ID2576
Feature activation-1.00e+01

Pos logit contributions
igsaw+0.012
 ey+0.011
 redd+0.010
Mic+0.010
 yak+0.009
Neg logit contributions
uddled-0.021
:-0.019
 --0.018
;-0.017
 and-0.017
Token laughed
Token ID13818
Feature activation-3.35e+01

Pos logit contributions
uddled+0.033
ikes+0.027
icked+0.027
wr+0.025
oor+0.025
Neg logit contributions
 redd-0.029
die-0.029
 tad-0.028
 ju-0.028
 gol-0.028
Token and
Token ID290
Feature activation-2.60e+01

Pos logit contributions
uddled+0.120
OP+0.109
ves+0.109
ered+0.105
okes+0.104
Neg logit contributions
 sill-0.092
 Prince-0.091
 downward-0.090
 Po-0.089
die-0.087
Token suggested
Token ID5220
Feature activation-3.61e+01

Pos logit contributions
uddled+0.080
ades+0.056
ves+0.055
:+0.054
okes+0.053
Neg logit contributions
 redd-0.098
 gol-0.095
nam-0.093
die-0.092
 ju-0.091
Token that
Token ID326
Feature activation-2.00e+01

Pos logit contributions
uddled+0.120
okes+0.094
anas+0.092
icked+0.091
 checked+0.088
Neg logit contributions
igsaw-0.118
nam-0.106
 redd-0.104
 gol-0.104
 ju-0.101
Token they
Token ID484
Feature activation-1.78e+01

Pos logit contributions
uddled+0.067
:+0.051
 -+0.048
;+0.047
ered+0.047
Neg logit contributions
igsaw-0.077
 redd-0.074
 ey-0.072
Mic-0.069
 angles-0.066
TokenYes
Token ID5297
Feature activation+3.79e+01

Pos logit contributions
 redd+0.138
Mic+0.137
igsaw+0.135
 ju+0.121
|+0.114
Neg logit contributions
uddled-0.222
:-0.195
ered-0.179
icked-0.178
 --0.173
Token,
Token ID11
Feature activation+4.20e+01

Pos logit contributions
igsaw+0.142
 Avery+0.131
acy+0.127
 Alexa+0.124
 Amber+0.124
Neg logit contributions
uddled-0.135
okes-0.097
aged-0.097
 blamed-0.096
oving-0.095
Token please
Token ID3387
Feature activation+6.04e+01

Pos logit contributions
igsaw+0.168
 redd+0.159
 ey+0.155
Mic+0.152
 yak+0.142
Neg logit contributions
uddled-0.137
:-0.101
 --0.096
;-0.093
ered-0.093
Token!"
Token ID2474
Feature activation+2.72e+01

Pos logit contributions
 welcome+0.182
igsaw+0.178
 yak+0.162
 sill+0.162
 page+0.161
Neg logit contributions
uddled-0.260
okes-0.190
ged-0.184
 sank-0.182
 blamed-0.182
Token
Token ID220
Feature activation+8.37e+01

Pos logit contributions
igsaw+0.069
 Avery+0.065
 Alexa+0.065
 Princess+0.064
 Prince+0.064
Neg logit contributions
uddled-0.125
oving-0.102
okes-0.101
ered-0.100
ves-0.099
Token
Token ID198
Feature activation+4.87e+01

Pos logit contributions
igsaw+0.268
Mic+0.231
 redd+0.224
aph+0.219
acy+0.219
Neg logit contributions
uddled-0.323
 stamped-0.251
 listened-0.247
 calmed-0.242
 watched-0.241
Token
Token ID198
Feature activation+4.71e+01

Pos logit contributions
Mic+0.162
igsaw+0.158
Prin+0.147
|+0.140
Beck+0.139
Neg logit contributions
uddled-0.186
icked-0.144
ered-0.141
okes-0.140
 tipped-0.139
TokenThe
Token ID464
Feature activation+6.51e+01

Pos logit contributions
Mic+0.101
Prin+0.090
igsaw+0.087
Beck+0.086
Rachel+0.077
Neg logit contributions
uddled-0.246
icked-0.215
okes-0.213
:-0.205
 died-0.204
Token fairy
Token ID25607
Feature activation+3.74e+01

Pos logit contributions
igsaw+0.188
 redd+0.169
 ey+0.167
Mic+0.154
|+0.148
Neg logit contributions
uddled-0.268
:-0.247
 --0.233
 died-0.225
 kissed-0.223
Token smiled
Token ID13541
Feature activation+2.92e+01

Pos logit contributions
die+0.113
igator+0.105
Prin+0.103
 Alexa+0.103
 page+0.102
Neg logit contributions
uddled-0.124
ikes-0.108
okes-0.108
wr-0.102
 NOT-0.101
Token and
Token ID290
Feature activation+3.45e+01

Pos logit contributions
 sill+0.123
 redd+0.123
die+0.118
igsaw+0.116
 ey+0.113
Neg logit contributions
uddled-0.077
ves-0.067
ered-0.057
okes-0.056
OP-0.054
Token borrow
Token ID8804
Feature activation+4.90e+01

Pos logit contributions
igsaw+0.136
agu+0.119
acy+0.119
 redd+0.119
Mic+0.118
Neg logit contributions
uddled-0.284
der-0.221
icked-0.208
ounced-0.207
ged-0.206
Token one
Token ID530
Feature activation+6.70e+01

Pos logit contributions
igsaw+0.148
 redd+0.127
Mic+0.126
 ey+0.117
 microsc+0.116
Neg logit contributions
uddled-0.224
owed-0.169
ered-0.162
icked-0.160
 calmed-0.159
Token.
Token ID13
Feature activation+3.62e+01

Pos logit contributions
igsaw+0.266
 redd+0.251
 ey+0.236
Mic+0.229
igator+0.227
Neg logit contributions
uddled-0.223
ered-0.161
 softened-0.154
 calmed-0.153
icked-0.153
Token Here
Token ID3423
Feature activation+5.13e+01

Pos logit contributions
igsaw+0.158
 ey+0.149
 redd+0.149
Mic+0.141
 angles+0.138
Neg logit contributions
uddled-0.096
:-0.080
 --0.073
;-0.071
 how-0.069
Token,
Token ID11
Feature activation+4.33e+01

Pos logit contributions
 Po+0.119
 Penny+0.119
ji+0.114
 Elsa+0.109
 universe+0.109
Neg logit contributions
uddled-0.247
der-0.194
oving-0.192
ided-0.186
overed-0.182
Token take
Token ID1011
Feature activation+3.89e+01

Pos logit contributions
igsaw+0.162
 redd+0.155
 ey+0.148
Mic+0.147
 ju+0.141
Neg logit contributions
uddled-0.152
:-0.112
;-0.106
 --0.106
icked-0.103
Token this
Token ID428
Feature activation+6.50e+01

Pos logit contributions
igsaw+0.157
 redd+0.141
 ey+0.139
Mic+0.138
 yak+0.128
Neg logit contributions
uddled-0.129
:-0.096
 --0.092
;-0.091
Because-0.087
Token one
Token ID530
Feature activation+5.89e+01

Pos logit contributions
igsaw+0.165
 redd+0.153
 gol+0.131
 ju+0.129
 ey+0.127
Neg logit contributions
uddled-0.304
icked-0.267
anas-0.249
Because-0.243
:-0.238
Token."
Token ID526
Feature activation+2.67e+01

Pos logit contributions
igsaw+0.241
 redd+0.225
 ey+0.210
Mic+0.207
igator+0.202
Neg logit contributions
uddled-0.196
icked-0.136
ered-0.134
 softened-0.133
 calmed-0.130
Token
Token ID198
Feature activation+4.96e+01

Pos logit contributions
igsaw+0.094
Mic+0.084
 redd+0.083
 ey+0.083
die+0.075
Neg logit contributions
uddled-0.100
ered-0.079
:-0.078
 --0.076
;-0.074
Token
Token ID198
Feature activation+4.81e+01

Pos logit contributions
igsaw+0.203
Mic+0.189
 ey+0.185
 redd+0.183
die+0.172
Neg logit contributions
uddled-0.153
:-0.113
ered-0.111
 --0.111
;-0.108
Token knew
Token ID2993
Feature activation-1.19e+01

Pos logit contributions
Mic+0.029
 redd+0.028
nam+0.026
Prin+0.025
ji+0.025
Neg logit contributions
uddled-0.032
ves-0.025
okes-0.025
ided-0.024
ooming-0.024
Token they
Token ID484
Feature activation+1.83e+01

Pos logit contributions
uddled+0.034
ered+0.022
:+0.021
 -+0.021
wr+0.021
Neg logit contributions
igsaw-0.049
 redd-0.047
 ey-0.045
nam-0.045
 microsc-0.043
Token had
Token ID550
Feature activation+3.02e+00

Pos logit contributions
|+0.046
 redd+0.046
cy+0.043
igator+0.042
Mic+0.040
Neg logit contributions
uddled-0.068
wr-0.062
okes-0.061
ues-0.058
ided-0.058
Token been
Token ID587
Feature activation+1.00e+01

Pos logit contributions
nam+0.008
 redd+0.008
 moderation+0.008
 ju+0.007
die+0.007
Neg logit contributions
uddled-0.012
ided-0.010
ered-0.010
owed-0.010
ves-0.010
Token naughty
Token ID45128
Feature activation+3.34e+01

Pos logit contributions
 redd+0.023
nam+0.021
igsaw+0.021
Mic+0.019
 yak+0.019
Neg logit contributions
uddled-0.048
Because-0.037
der-0.037
;-0.037
owed-0.037
Token.
Token ID13
Feature activation-8.67e+00

Pos logit contributions
 sill+0.131
 angles+0.130
igsaw+0.129
keeping+0.126
 student+0.124
Neg logit contributions
uddled-0.092
ered-0.071
ves-0.071
apy-0.064
wr-0.064
Token They
Token ID1119
Feature activation+3.38e+00

Pos logit contributions
uddled+0.025
:+0.019
 -+0.018
;+0.017
ered+0.016
Neg logit contributions
igsaw-0.038
 redd-0.036
 ey-0.035
Mic-0.034
 yak-0.032
Token said
Token ID531
Feature activation-5.95e+00

Pos logit contributions
Mic+0.011
 redd+0.011
 ju+0.010
Prin+0.010
 gol+0.010
Neg logit contributions
uddled-0.012
okes-0.009
wr-0.009
ooming-0.009
ves-0.009
Token sorry
Token ID7926
Feature activation+1.68e+01

Pos logit contributions
uddled+0.019
anas+0.014
Because+0.014
ered+0.013
oving+0.013
Neg logit contributions
igsaw-0.021
 redd-0.019
 Po-0.019
 microsc-0.018
 gol-0.018
Token to
Token ID284
Feature activation+5.95e+00

Pos logit contributions
 night+0.035
 sill+0.034
die+0.034
 girl+0.033
 day+0.032
Neg logit contributions
uddled-0.070
ered-0.066
pped-0.064
rolled-0.062
oving-0.061
Token Mom
Token ID11254
Feature activation+1.03e+01

Pos logit contributions
igsaw+0.024
 redd+0.023
 ey+0.022
Mic+0.022
|+0.020
Neg logit contributions
uddled-0.019
:-0.014
 --0.013
;-0.013
ered-0.013
Token Anna
Token ID11735
Feature activation-5.97e+00

Pos logit contributions
uddled+0.127
:+0.111
 -+0.106
;+0.104
ered+0.101
Neg logit contributions
igsaw-0.079
 redd-0.071
 ey-0.070
Mic-0.067
 angles-0.059
Token.
Token ID13
Feature activation-3.86e+01

Pos logit contributions
uddled+0.013
ered+0.007
Because+0.006
icked+0.006
ves+0.006
Neg logit contributions
igsaw-0.029
 redd-0.028
die-0.028
Mic-0.028
 ey-0.028
Token I
Token ID314
Feature activation-2.34e+01

Pos logit contributions
uddled+0.109
:+0.100
 -+0.091
;+0.090
icked+0.086
Neg logit contributions
igsaw-0.156
 redd-0.148
Mic-0.144
 ey-0.141
 angles-0.136
Token'm
Token ID1101
Feature activation-2.22e+01

Pos logit contributions
uddled+0.077
:+0.057
;+0.054
 -+0.053
ered+0.053
Neg logit contributions
igsaw-0.087
 redd-0.085
 ey-0.083
Mic-0.081
|-0.081
Token sorry
Token ID7926
Feature activation-1.27e+01

Pos logit contributions
uddled+0.044
:+0.038
 that+0.032
 -+0.032
 how+0.030
Neg logit contributions
igsaw-0.118
 ey-0.110
 redd-0.107
Mic-0.105
 angles-0.102
Token I
Token ID314
Feature activation-2.50e+01

Pos logit contributions
uddled+0.043
ered+0.031
 died+0.029
rolled+0.028
anas+0.028
Neg logit contributions
igsaw-0.046
 redd-0.044
 ey-0.044
die-0.042
 Po-0.041
Token yelled
Token ID22760
Feature activation-4.62e+01

Pos logit contributions
uddled+0.078
;+0.066
der+0.064
:"+0.064
 :+0.062
Neg logit contributions
|-0.093
 gol-0.076
ott-0.074
Mic-0.073
 Fro-0.072
Token at
Token ID379
Feature activation-1.99e+01

Pos logit contributions
uddled+0.174
anas+0.136
Because+0.130
rolled+0.127
ves+0.124
Neg logit contributions
igsaw-0.146
 sill-0.144
 Po-0.142
 Alexa-0.137
 Prince-0.136
Token you
Token ID345
Feature activation-1.37e+01

Pos logit contributions
uddled+0.085
:+0.066
 -+0.064
;+0.064
ered+0.063
Neg logit contributions
igsaw-0.060
 ey-0.054
Mic-0.054
 redd-0.053
 yak-0.048
Token."
Token ID526
Feature activation-4.44e+01

Pos logit contributions
uddled+0.047
Because+0.034
ered+0.032
apy+0.032
ves+0.031
Neg logit contributions
 sill-0.047
 angles-0.045
igsaw-0.045
 redd-0.044
 microsc-0.043
Token Ben
Token ID3932
Feature activation-1.63e+01

Pos logit contributions
uddled+0.205
:+0.171
 -+0.167
ered+0.164
;+0.164
Neg logit contributions
igsaw-0.116
 ey-0.104
 redd-0.104
Mic-0.097
 yak-0.087
Token
Token ID198
Feature activation-2.86e+01

Pos logit contributions
uddled+0.093
 died+0.067
ered+0.067
 -+0.066
:+0.066
Neg logit contributions
igsaw-0.107
 redd-0.099
Mic-0.098
 ey-0.096
die-0.090
Token"
Token ID1
Feature activation-3.55e+01

Pos logit contributions
uddled+0.103
 died+0.078
:+0.077
icked+0.076
ered+0.076
Neg logit contributions
igsaw-0.104
 redd-0.100
Mic-0.094
 ey-0.093
 ju-0.089
TokenBut
Token ID1537
Feature activation-4.50e+01

Pos logit contributions
uddled+0.116
:+0.093
 -+0.088
;+0.086
Because+0.085
Neg logit contributions
igsaw-0.140
 ey-0.128
 redd-0.127
Mic-0.122
 angles-0.115
Token why
Token ID1521
Feature activation-4.53e+01

Pos logit contributions
uddled+0.183
 died+0.132
ves+0.131
icked+0.125
Because+0.121
Neg logit contributions
 redd-0.149
igsaw-0.142
 ey-0.137
 ju-0.134
 angles-0.126
Token?
Token ID30
Feature activation-5.18e+01

Pos logit contributions
uddled+0.176
icked+0.135
ged+0.131
ided+0.128
ered+0.123
Neg logit contributions
 redd-0.138
igsaw-0.138
 ey-0.133
nam-0.132
 universe-0.125
Token It
Token ID632
Feature activation-3.52e+01

Pos logit contributions
uddled+0.226
ered+0.176
ves+0.176
 died+0.165
:+0.163
Neg logit contributions
igsaw-0.167
 redd-0.149
 ey-0.144
Mic-0.135
acy-0.133
Token smells
Token ID25760
Feature activation-3.67e+01

Pos logit contributions
uddled+0.122
:+0.089
;+0.086
 -+0.085
 :+0.082
Neg logit contributions
 redd-0.130
igsaw-0.129
 ey-0.121
 ju-0.116
die-0.114
Token so
Token ID523
Feature activation-2.25e+01

Pos logit contributions
uddled+0.214
ided+0.172
okes+0.171
rolled+0.170
ipped+0.168
Neg logit contributions
 redd-0.074
 ju-0.073
 sunny-0.064
igsaw-0.061
igator-0.059
Token nice
Token ID3621
Feature activation-1.01e+01

Pos logit contributions
uddled+0.105
ered+0.081
ided+0.080
:+0.079
 -+0.079
Neg logit contributions
igsaw-0.062
 redd-0.058
 ey-0.051
 ju-0.049
 microsc-0.047
Token,"
Token ID553
Feature activation-5.45e+01

Pos logit contributions
uddled+0.035
ided+0.025
 died+0.024
icked+0.024
rolled+0.024
Neg logit contributions
igsaw-0.038
 redd-0.038
 ju-0.036
igator-0.035
die-0.033
Token said
Token ID531
Feature activation-5.41e+01

Pos logit contributions
uddled+0.171
anas+0.107
 died+0.106
owed+0.105
icked+0.105
Neg logit contributions
 redd-0.226
igsaw-0.215
 ju-0.208
 ey-0.202
 gol-0.197
Token
Token ID198
Feature activation-1.57e+01

Pos logit contributions
uddled+0.045
:+0.033
 -+0.032
;+0.031
ered+0.031
Neg logit contributions
igsaw-0.061
 redd-0.057
Mic-0.056
 ey-0.056
|-0.052
TokenL
Token ID43
Feature activation+3.72e+01

Pos logit contributions
uddled+0.053
:+0.041
 -+0.040
icked+0.039
;+0.039
Neg logit contributions
igsaw-0.060
 redd-0.057
Mic-0.057
 ey-0.052
 gol-0.051
Tokenily
Token ID813
Feature activation+6.33e+00

Pos logit contributions
igsaw+0.129
 redd+0.120
 ey+0.118
Mic+0.114
 angles+0.104
Neg logit contributions
uddled-0.139
:-0.115
 --0.109
;-0.107
ered-0.104
Token and
Token ID290
Feature activation-2.51e+01

Pos logit contributions
die+0.023
ji+0.022
 ey+0.021
Mic+0.021
 yak+0.021
Neg logit contributions
uddled-0.020
apy-0.016
ves-0.015
icked-0.014
ered-0.014
Token Tom
Token ID4186
Feature activation-9.80e+00

Pos logit contributions
uddled+0.072
:+0.059
 -+0.055
;+0.053
 and+0.051
Neg logit contributions
igsaw-0.109
 redd-0.102
 ey-0.101
Mic-0.097
 angles-0.092
Token were
Token ID547
Feature activation-2.94e+01

Pos logit contributions
uddled+0.043
ves+0.036
oses+0.033
rums+0.032
mmm+0.032
Neg logit contributions
 distance-0.026
 tad-0.024
ny-0.022
 seaw-0.022
 mer-0.022
Token scared
Token ID12008
Feature activation-2.41e+01

Pos logit contributions
uddled+0.120
 -+0.090
:+0.089
;+0.089
Because+0.088
Neg logit contributions
 redd-0.085
igsaw-0.084
 ey-0.082
Mic-0.077
die-0.076
Token.
Token ID13
Feature activation-4.33e+01

Pos logit contributions
uddled+0.065
ered+0.041
ves+0.039
anas+0.038
OP+0.036
Neg logit contributions
igsaw-0.104
 ey-0.102
 redd-0.096
 angles-0.096
 sill-0.094
Token They
Token ID1119
Feature activation-3.04e+01

Pos logit contributions
uddled+0.186
ered+0.143
 kissed+0.142
pt+0.141
icked+0.140
Neg logit contributions
igsaw-0.122
 redd-0.116
 sill-0.107
 ey-0.102
|-0.098
Token ran
Token ID4966
Feature activation-4.65e+01

Pos logit contributions
uddled+0.110
ves+0.074
Because+0.072
;+0.068
okes+0.067
Neg logit contributions
 redd-0.101
 ey-0.101
Mic-0.101
nam-0.099
die-0.096
Token to
Token ID284
Feature activation-2.57e+01

Pos logit contributions
uddled+0.122
:+0.091
 -+0.084
;+0.081
ered+0.078
Neg logit contributions
igsaw-0.212
 redd-0.200
 ey-0.198
Mic-0.192
 angles-0.180
Token lived
Token ID5615
Feature activation-3.05e+01

Pos logit contributions
uddled+0.073
rums+0.059
okes+0.059
icked+0.058
wr+0.058
Neg logit contributions
 universe-0.043
|-0.038
ji-0.038
 sunny-0.037
 day-0.036
Token in
Token ID287
Feature activation-8.84e+00

Pos logit contributions
uddled+0.075
:+0.054
 -+0.051
;+0.050
ered+0.047
Neg logit contributions
igsaw-0.141
 redd-0.135
 ey-0.133
Mic-0.129
die-0.121
Token a
Token ID257
Feature activation-2.94e+00

Pos logit contributions
 and+0.008
 that+0.008
 how+0.008
 -+0.008
:+0.007
Neg logit contributions
 redd-0.049
igsaw-0.048
Mic-0.048
 ey-0.047
aph-0.046
Token small
Token ID1402
Feature activation+1.95e+01

Pos logit contributions
uddled+0.009
:+0.009
 -+0.008
,+0.008
 and+0.008
Neg logit contributions
igsaw-0.010
 ey-0.010
 redd-0.010
Mic-0.009
keeping-0.009
Token house
Token ID2156
Feature activation-2.63e+01

Pos logit contributions
igsaw+0.062
 redd+0.059
Mic+0.057
 ey+0.057
 yak+0.050
Neg logit contributions
uddled-0.072
:-0.064
 --0.063
;-0.061
 and-0.060
Token with
Token ID351
Feature activation-2.68e+01

Pos logit contributions
uddled+0.097
icked+0.073
wr+0.070
 calmed+0.070
rums+0.069
Neg logit contributions
 sill-0.091
igsaw-0.086
 redd-0.082
 sunny-0.080
 distance-0.080
Token his
Token ID465
Feature activation+5.49e+00

Pos logit contributions
uddled+0.086
:+0.070
 -+0.066
;+0.064
ered+0.062
Neg logit contributions
igsaw-0.105
 redd-0.098
 ey-0.097
Mic-0.094
 yak-0.087
Token mom
Token ID1995
Feature activation-1.90e+01

Pos logit contributions
igsaw+0.018
 ey+0.016
 redd+0.016
Mic+0.015
 universe+0.014
Neg logit contributions
uddled-0.022
ered-0.018
icked-0.017
:-0.017
 calmed-0.017
Token and
Token ID290
Feature activation-2.66e+01

Pos logit contributions
uddled+0.064
ves+0.047
ered+0.046
wr+0.043
icked+0.043
Neg logit contributions
igsaw-0.074
 sill-0.071
 redd-0.070
ji-0.068
die-0.068
Token dad
Token ID9955
Feature activation-2.02e+01

Pos logit contributions
uddled+0.108
:+0.081
 -+0.080
ered+0.080
;+0.079
Neg logit contributions
 redd-0.081
igsaw-0.078
nam-0.077
 ey-0.076
Mic-0.070
Token.
Token ID13
Feature activation-4.31e+01

Pos logit contributions
uddled+0.087
wr+0.069
ves+0.069
okes+0.066
 tipped+0.063
Neg logit contributions
 day-0.062
nam-0.060
 student-0.056
 week-0.056
 sorts-0.056
Token would
Token ID561
Feature activation-9.02e+00

Pos logit contributions
 sill+0.011
 student+0.011
nam+0.011
 Prince+0.011
 day+0.010
Neg logit contributions
uddled-0.013
apy-0.010
ered-0.010
ided-0.010
oving-0.010
Token read
Token ID1100
Feature activation-2.20e+01

Pos logit contributions
uddled+0.046
icked+0.032
ed+0.032
ipped+0.031
okes+0.031
Neg logit contributions
 sill-0.026
 redd-0.024
 universe-0.024
nam-0.024
 ey-0.024
Token to
Token ID284
Feature activation-2.68e+00

Pos logit contributions
uddled+0.070
icked+0.046
okes+0.045
ered+0.044
 slapped+0.043
Neg logit contributions
igsaw-0.089
 redd-0.082
nam-0.079
 universe-0.079
Mic-0.079
Token her
Token ID607
Feature activation+1.86e+01

Pos logit contributions
uddled+0.010
:+0.007
icked+0.007
 -+0.007
;+0.007
Neg logit contributions
igsaw-0.009
 redd-0.009
 ey-0.009
Mic-0.008
nam-0.008
Token.
Token ID13
Feature activation-2.39e+01

Pos logit contributions
igsaw+0.063
nam+0.057
 redd+0.056
 student+0.055
 ey+0.055
Neg logit contributions
uddled-0.074
 calmed-0.055
icked-0.054
 died-0.054
ered-0.054
Token But
Token ID887
Feature activation-2.03e+01

Pos logit contributions
uddled+0.070
:+0.056
 -+0.052
;+0.051
 –+0.048
Neg logit contributions
igsaw-0.101
 redd-0.095
 ey-0.094
Mic-0.091
 yak-0.086
Token sometimes
Token ID3360
Feature activation-3.65e-01

Pos logit contributions
uddled+0.071
:+0.053
 -+0.051
;+0.050
ered+0.050
Neg logit contributions
igsaw-0.075
 redd-0.071
 ey-0.069
Mic-0.065
nam-0.063
Token,
Token ID11
Feature activation-1.58e+01

Pos logit contributions
uddled+0.002
icked+0.001
ided+0.001
olded+0.001
OP+0.001
Neg logit contributions
 universe-0.001
 avail-0.001
nam-0.001
igsaw-0.001
 Avery-0.001
Token she
Token ID673
Feature activation+2.08e-01

Pos logit contributions
uddled+0.048
:+0.038
 -+0.036
;+0.035
ered+0.033
Neg logit contributions
igsaw-0.065
 redd-0.060
 ey-0.060
Mic-0.059
 yak-0.054
Token wanted
Token ID2227
Feature activation-2.98e+01

Pos logit contributions
 student+0.001
acy+0.001
die+0.001
 redd+0.001
 universe+0.001
Neg logit contributions
uddled-0.001
ided-0.001
ered-0.001
wr-0.001
:"-0.001
Token to
Token ID284
Feature activation-8.22e+00

Pos logit contributions
uddled+0.123
ided+0.091
oving+0.090
icked+0.088
owed+0.086
Neg logit contributions
igsaw-0.096
 microsc-0.091
 redd-0.087
nam-0.086
 sill-0.085