TOP ACTIVATIONS
MAX = 37.492

My dear , without taking a
valuable ?" " My secret ," Lily explained .
boy said sadly , " My mum 's poorly , and
smiled and said , " My name is Billy . Nice
!" Lily cried . " My swing is broken . I
said the creature . " My name is Yoga ! I
agu ar replied , " My name is John . I
My hair is rainbow coloured and
, Bob said , " My prints look great ! I
closer and said , " My name is Fl uffy !"
My magic is in the salt
," he said . " My name is King . What
little girl said , " My name is Sophia . What
little girl replied , " My room is so empty ,
The boy replied : " My mum gave it to me
smiled and said , " My pleasure . Enjoy your new
Frank said sadly , " My fountain is broken and I
?" Lily replied , " My thumb hurts , but it
My name is Thunder . I
Tim my replied , " My car broke . Can you

MIN ACTIVATIONS
MIN = -43.607

. She took a deep breath and shouted " ST OP
, Lily took a deep breath and went into the woods
Tim my took a deep breath and opened the rocket .
she was almost out of breath . Finally she stopped .
L ily took a deep breath and started to climb .
. He took a deep breath and started to yell .
a cave with my fire breath ." They go
Tom took a deep breath and said , " I
. She took a deep breath , and slowly stepped closer
torch , took a deep breath and started on her adventure
He took a deep breath . He said , "
so he took a deep breath and kn elt down on
mom my took a deep breath and smiled . She said
Mark took a big breath and dove down to the
. He took a big breath and then carefully walked up
So he took a deep breath and whist led as loud
her to take a deep breath . Sarah closed her eyes
. She took a deep breath and decided to go inside
Jack took a deep breath and was relieved . They
. She took a deep breath and said , " Please

INTERVAL 17.217 - 37.492
CONTAINS 9.798%

better than before . The End <|endoftext|> Once
angry and very worried . She called the doctor and took
and sad . The king realized that he had
where you walk ." " Yes , mom ,
the wagon with Spot . They are very excited . They

INTERVAL -3.057 - 17.217
CONTAINS 46.959%

. They said that being quiet and kind is better than
for a while . Suddenly the new car blinked its lights
fire !" they shouted . Their mom and dad
up with dirt and after a few minutes the mouse and
to bed , hoping that maybe he would find his toy

INTERVAL -23.332 - -3.057
CONTAINS 41.763%

wanted to have their own . One day ,
who liked to learn new things . She had an exam
Sarah heard someone calling her name . It was her mom
hugs the doll and says , " Wow , Tom !
!" she said , her eyes wide with amaz ement .

INTERVAL -43.607 - -23.332
CONTAINS 1.480%

. It was a nice image of a castle , with
started to fold pieces of paper . The goose admired the
all about the great trade tree in his garden and how
. She felt the comb brush brush brush her hair .
he saw a brightly wrapped present sitting on the floor .
Token
Token ID6184
Feature activation+5.77e+00

Pos logit contributions
-+0.700
ies+0.698
ish+0.587
ig+0.582
 window+0.578
Neg logit contributions
Don-0.474
Here-0.453
My-0.453
Well-0.437
Would-0.435
Token
Token ID95
Feature activation+4.55e-01

Pos logit contributions
Don+0.214
Well+0.197
Would+0.197
My+0.196
It+0.185
Neg logit contributions
ies-0.244
--0.220
ish-0.208
ishing-0.208
ig-0.204
Token
Token ID26391
Feature activation+1.09e+01

Pos logit contributions
Don+0.015
Would+0.014
My+0.013
Well+0.013
 Did+0.013
Neg logit contributions
--0.020
ies-0.018
an-0.018
ishing-0.018
ig-0.018
Token
Token ID129
Feature activation+7.09e+00

Pos logit contributions
Don+0.377
Would+0.355
My+0.346
Well+0.345
Here+0.327
Neg logit contributions
ies-0.478
--0.451
ig-0.413
ish-0.410
ishing-0.407
Token
Token ID241
Feature activation+1.25e+00

Pos logit contributions
Don+0.252
Would+0.233
Well+0.232
My+0.230
 Did+0.218
Neg logit contributions
ies-0.304
--0.286
ishing-0.265
ish-0.265
an-0.262
TokenMy
Token ID3666
Feature activation+3.75e+01

Pos logit contributions
 Did+0.022
Would+0.022
Don+0.021
Here+0.021
My+0.020
Neg logit contributions
--0.063
ies-0.059
.-0.054
ater-0.054
!-0.053
Token dear
Token ID13674
Feature activation-4.28e+00

Pos logit contributions
Don+1.769
My+1.682
Well+1.640
It+1.635
 It+1.633
Neg logit contributions
ies-1.047
ishing-0.899
 island-0.883
ish-0.851
--0.842
Token,
Token ID11
Feature activation-1.58e+01

Pos logit contributions
ies+0.092
ishing+0.085
ish+0.076
an+0.075
 island+0.074
Neg logit contributions
Don-0.226
Well-0.216
My-0.214
It-0.213
I-0.211
Token without
Token ID1231
Feature activation-2.12e+00

Pos logit contributions
ies+0.675
-+0.632
ish+0.582
ig+0.578
ishing+0.573
Neg logit contributions
Don-0.561
Would-0.527
Well-0.514
My-0.513
 Did-0.484
Token taking
Token ID2263
Feature activation+2.44e+00

Pos logit contributions
ies+0.081
-+0.072
ish+0.070
an+0.069
ishing+0.068
Neg logit contributions
Don-0.087
My-0.081
Would-0.080
Well-0.080
It-0.076
Token a
Token ID257
Feature activation+1.19e+01

Pos logit contributions
Don+0.096
My+0.089
Well+0.088
Would+0.088
It+0.084
Neg logit contributions
ies-0.097
--0.083
ish-0.083
ishing-0.082
an-0.081
Token valuable
Token ID8119
Feature activation+5.40e+00

Pos logit contributions
Don+0.297
My+0.279
Would+0.274
Well+0.273
It+0.264
Neg logit contributions
ies-0.246
--0.216
ish-0.205
ishing-0.204
ig-0.202
Token?"
Token ID1701
Feature activation-9.11e+00

Pos logit contributions
Don+0.210
 Did+0.197
 It+0.197
My+0.196
It+0.195
Neg logit contributions
ies-0.182
ishing-0.180
ig-0.173
 breath-0.171
ish-0.168
Token
Token ID198
Feature activation+2.08e+01

Pos logit contributions
-+0.381
ies+0.355
.+0.347
,+0.337
 fort+0.310
Neg logit contributions
Don-0.262
Even-0.254
Take-0.252
My-0.252
Me-0.246
Token
Token ID198
Feature activation+2.07e+01

Pos logit contributions
Don+0.532
Well+0.481
Here+0.466
My+0.463
 Did+0.455
Neg logit contributions
--1.119
ies-1.055
ig-0.940
.-0.933
ie-0.916
Token"
Token ID1
Feature activation+5.06e+00

Pos logit contributions
Don+0.801
Well+0.780
Would+0.768
My+0.765
 Did+0.760
Neg logit contributions
--0.802
ies-0.757
ig-0.645
an-0.630
 fort-0.611
TokenMy
Token ID3666
Feature activation+3.75e+01

Pos logit contributions
 Everything+0.099
 Eventually+0.096
 Finally+0.094
 From+0.093
 Afterwards+0.083
Neg logit contributions
ater-0.211
--0.203
ies-0.198
ute-0.183
an-0.181
Token secret
Token ID3200
Feature activation-2.62e+00

Pos logit contributions
 It+1.238
My+1.229
 Something+1.205
Don+1.186
 Who+1.161
Neg logit contributions
ies-1.495
ishing-1.458
ig-1.298
ie-1.287
--1.272
Token,"
Token ID553
Feature activation-1.18e+01

Pos logit contributions
ishing+0.088
ies+0.087
ater+0.080
ish+0.080
ig+0.077
Neg logit contributions
 It-0.095
Don-0.093
It-0.092
I-0.091
 Did-0.090
Token Lily
Token ID20037
Feature activation+1.37e+01

Pos logit contributions
ies+0.444
-+0.420
 mess+0.363
ie+0.360
 choice+0.360
Neg logit contributions
Don-0.426
Would-0.414
Well-0.400
Even-0.399
My-0.396
Token explained
Token ID4893
Feature activation+3.57e+00

Pos logit contributions
Don+0.492
Would+0.459
Well+0.449
My+0.447
Here+0.432
Neg logit contributions
ies-0.591
--0.547
ish-0.507
ishing-0.496
ig-0.495
Token.
Token ID13
Feature activation-1.32e+01

Pos logit contributions
Don+0.175
My+0.167
 She+0.164
 It+0.161
Well+0.157
Neg logit contributions
ies-0.088
ig-0.086
an-0.083
ishing-0.083
ish-0.081
Token boy
Token ID2933
Feature activation-1.01e+01

Pos logit contributions
Don+1.289
Well+1.215
Would+1.203
My+1.194
That+1.149
Neg logit contributions
ies-1.111
ishing-0.988
--0.949
ish-0.938
ig-0.917
Token said
Token ID531
Feature activation+7.54e+00

Pos logit contributions
ies+0.448
-+0.417
ish+0.392
ishing+0.388
ig+0.385
Neg logit contributions
Don-0.345
Would-0.317
Well-0.314
My-0.312
 Did-0.294
Token sadly
Token ID21098
Feature activation+4.67e+00

Pos logit contributions
Don+0.410
Would+0.387
My+0.387
Well+0.385
It+0.372
Neg logit contributions
ies-0.186
--0.151
ishing-0.146
ish-0.144
ig-0.139
Token,
Token ID11
Feature activation-1.58e+01

Pos logit contributions
Don+0.244
Would+0.226
Well+0.224
My+0.221
 It+0.221
Neg logit contributions
ies-0.114
ishing-0.101
an-0.098
ater-0.093
ish-0.093
Token "
Token ID366
Feature activation+1.33e+01

Pos logit contributions
ies+0.715
-+0.715
ish+0.612
ishing+0.606
ig+0.602
Neg logit contributions
Don-0.457
Here-0.421
Would-0.419
My-0.418
There-0.416
TokenMy
Token ID3666
Feature activation+3.75e+01

Pos logit contributions
 Did+0.198
 Everything+0.188
 Their+0.173
 From+0.166
 There+0.164
Neg logit contributions
--0.695
ater-0.675
ies-0.665
an-0.645
ishing-0.626
Token mum
Token ID25682
Feature activation-1.46e+01

Pos logit contributions
Don+1.712
My+1.610
It+1.606
 It+1.600
Well+1.573
Neg logit contributions
ies-1.007
ishing-0.960
ater-0.809
ish-0.804
--0.788
Token's
Token ID338
Feature activation+6.83e+00

Pos logit contributions
ies+0.604
-+0.538
ishing+0.534
an+0.519
ig+0.514
Neg logit contributions
Don-0.520
Well-0.484
My-0.482
Would-0.468
 Did-0.460
Token poorly
Token ID13455
Feature activation+6.73e+00

Pos logit contributions
Don+0.310
It+0.289
Well+0.288
My+0.285
 It+0.284
Neg logit contributions
ies-0.197
ishing-0.185
ish-0.164
--0.158
an-0.155
Token,
Token ID11
Feature activation-1.58e+01

Pos logit contributions
Don+0.280
Well+0.258
My+0.255
 Did+0.251
I+0.250
Neg logit contributions
ies-0.230
ishing-0.204
--0.196
ish-0.196
ig-0.195
Token and
Token ID290
Feature activation-1.04e+01

Pos logit contributions
ies+0.648
-+0.597
ish+0.558
ishing+0.552
ig+0.546
Neg logit contributions
Don-0.593
Would-0.553
Well-0.548
My-0.545
 Did-0.513
Token smiled
Token ID13541
Feature activation+2.92e+00

Pos logit contributions
ies+0.438
-+0.412
ish+0.381
ishing+0.374
ig+0.374
Neg logit contributions
Don-0.357
Would-0.326
Well-0.325
My-0.317
There-0.309
Token and
Token ID290
Feature activation-1.04e+01

Pos logit contributions
Don+0.149
My+0.140
It+0.138
Would+0.136
Well+0.135
Neg logit contributions
ies-0.078
ig-0.067
 decision-0.065
an-0.065
ishing-0.064
Token said
Token ID531
Feature activation+7.55e+00

Pos logit contributions
ies+0.399
-+0.371
ish+0.322
ishing+0.315
 airplane+0.306
Neg logit contributions
Don-0.411
Would-0.397
Well-0.386
Here-0.384
 Did-0.380
Token,
Token ID11
Feature activation-1.58e+01

Pos logit contributions
Don+0.457
My+0.437
Would+0.437
Well+0.433
 Did+0.423
Neg logit contributions
ies-0.134
--0.102
ishing-0.096
ish-0.092
an-0.089
Token "
Token ID366
Feature activation+1.34e+01

Pos logit contributions
-+0.641
ies+0.640
 window+0.543
,+0.541
 head+0.536
Neg logit contributions
Don-0.520
Here-0.492
Well-0.487
Ah-0.478
There-0.472
TokenMy
Token ID3666
Feature activation+3.75e+01

Pos logit contributions
 Sadly+0.203
 Everything+0.202
 Finally+0.190
 Did+0.177
 Eventually+0.174
Neg logit contributions
--0.652
ies-0.633
an-0.606
 hat-0.569
ie-0.568
Token name
Token ID1438
Feature activation-1.41e+01

Pos logit contributions
Don+1.273
My+1.246
Would+1.189
 It+1.160
Well+1.145
Neg logit contributions
ies-1.543
ishing-1.452
ig-1.418
--1.389
ish-1.339
Token is
Token ID318
Feature activation+8.23e+00

Pos logit contributions
ies+0.543
-+0.493
ish+0.462
ishing+0.462
ig+0.454
Neg logit contributions
Don-0.566
Would-0.526
Well-0.524
My-0.523
 Did-0.493
Token Billy
Token ID15890
Feature activation+6.65e-01

Pos logit contributions
Don+0.271
Well+0.244
Would+0.234
Here+0.227
There+0.227
Neg logit contributions
ies-0.376
--0.368
ish-0.327
ie-0.318
 home-0.318
Token.
Token ID13
Feature activation-1.32e+01

Pos logit contributions
Don+0.033
My+0.032
Would+0.031
 Did+0.030
Well+0.030
Neg logit contributions
ies-0.017
ishing-0.015
--0.015
ig-0.014
 breath-0.014
Token Nice
Token ID18460
Feature activation+1.62e+01

Pos logit contributions
-+0.435
ies+0.417
,+0.406
ish+0.381
 ship+0.377
Neg logit contributions
Don-0.426
Here-0.382
There-0.381
Well-0.376
Ah-0.373
Token!"
Token ID2474
Feature activation-1.88e+01

Pos logit contributions
Don+0.671
Well+0.648
It+0.647
See+0.645
That+0.626
Neg logit contributions
an-0.514
 island-0.482
ies-0.478
ishing-0.476
ish-0.441
Token Lily
Token ID20037
Feature activation+1.37e+01

Pos logit contributions
ies+0.752
-+0.741
 eye+0.635
 mess+0.622
 breath+0.622
Neg logit contributions
Would-0.694
Don-0.664
Well-0.648
Here-0.607
Did-0.606
Token cried
Token ID16896
Feature activation+6.64e+00

Pos logit contributions
Don+0.429
Would+0.393
Well+0.389
My+0.385
 Did+0.358
Neg logit contributions
ies-0.654
--0.608
ish-0.574
ishing-0.571
ig-0.566
Token.
Token ID13
Feature activation-1.32e+01

Pos logit contributions
Don+0.352
My+0.333
It+0.316
 She+0.316
Well+0.314
Neg logit contributions
ies-0.172
an-0.154
ishing-0.146
ig-0.145
 island-0.144
Token "
Token ID366
Feature activation+1.33e+01

Pos logit contributions
-+0.535
ies+0.535
,+0.519
 head+0.516
 window+0.502
Neg logit contributions
Looks-0.413
Don-0.394
Ah-0.379
Would-0.378
Well-0.375
TokenMy
Token ID3666
Feature activation+3.75e+01

Pos logit contributions
 Did+0.192
 From+0.186
 Everything+0.184
Would+0.182
 Finally+0.174
Neg logit contributions
--0.694
ies-0.680
ater-0.673
 mess-0.666
an-0.652
Token swing
Token ID9628
Feature activation-2.23e+01

Pos logit contributions
Don+1.698
My+1.620
It+1.611
I+1.600
Well+1.581
Neg logit contributions
ies-1.052
ishing-0.996
 island-0.908
le-0.893
an-0.887
Token is
Token ID318
Feature activation+8.22e+00

Pos logit contributions
ies+0.912
ishing+0.818
-+0.786
ish+0.772
ig+0.771
Neg logit contributions
Don-0.831
My-0.763
Well-0.746
It-0.733
 Did-0.726
Token broken
Token ID5445
Feature activation+1.41e+00

Pos logit contributions
Don+0.349
My+0.328
It+0.322
There+0.317
Well+0.308
Neg logit contributions
ies-0.299
an-0.262
ish-0.253
ishing-0.251
--0.247
Token.
Token ID13
Feature activation-1.32e+01

Pos logit contributions
Don+0.051
 She+0.050
It+0.049
I+0.048
 He+0.047
Neg logit contributions
an-0.045
ies-0.045
ig-0.044
 eye-0.043
 decision-0.043
Token I
Token ID314
Feature activation+1.64e+01

Pos logit contributions
-+0.491
 head+0.474
 eye+0.468
ish+0.448
 house+0.448
Neg logit contributions
Don-0.416
Well-0.381
Here-0.372
Looks-0.367
Would-0.359
Token said
Token ID531
Feature activation+7.55e+00

Pos logit contributions
ies+0.440
-+0.406
 breath+0.366
 eye+0.347
 island+0.345
Neg logit contributions
Don-0.459
Would-0.449
Here-0.436
Even-0.423
 Did-0.417
Token the
Token ID262
Feature activation+7.96e+00

Pos logit contributions
Don+0.265
Would+0.245
Well+0.236
Here+0.232
My+0.230
Neg logit contributions
ies-0.328
--0.316
 breath-0.286
ig-0.284
ish-0.283
Token creature
Token ID7185
Feature activation-1.43e+01

Pos logit contributions
Don+0.418
Would+0.398
My+0.397
Well+0.396
It+0.380
Neg logit contributions
ies-0.218
--0.184
ishing-0.173
ish-0.171
ig-0.167
Token.
Token ID13
Feature activation-1.32e+01

Pos logit contributions
ies+0.335
ishing+0.290
 mess+0.272
ater+0.260
ish+0.256
Neg logit contributions
Don-0.758
My-0.755
Would-0.737
Well-0.700
It-0.695
Token "
Token ID366
Feature activation+1.33e+01

Pos logit contributions
,+0.476
-+0.466
.+0.456
ies+0.445
!+0.427
Neg logit contributions
Here-0.496
Don-0.449
Take-0.419
There-0.416
My-0.405
TokenMy
Token ID3666
Feature activation+3.75e+01

Pos logit contributions
 Finally+0.264
 Sadly+0.257
 From+0.238
 Eventually+0.236
 Everything+0.235
Neg logit contributions
ater-0.586
--0.583
ies-0.572
an-0.546
le-0.512
Token name
Token ID1438
Feature activation-1.41e+01

Pos logit contributions
Don+1.163
My+1.147
 It+1.107
Would+1.082
 Something+1.067
Neg logit contributions
ies-1.526
ishing-1.461
--1.332
ig-1.327
ish-1.312
Token is
Token ID318
Feature activation+8.22e+00

Pos logit contributions
ies+0.580
-+0.537
ish+0.499
ishing+0.494
ig+0.489
Neg logit contributions
Don-0.528
Would-0.491
Well-0.487
My-0.480
 Did-0.453
Token Yoga
Token ID32856
Feature activation+4.00e+00

Pos logit contributions
Don+0.277
Well+0.251
Would+0.243
There+0.232
Here+0.232
Neg logit contributions
ies-0.370
--0.360
ish-0.325
an-0.317
ig-0.315
Token!
Token ID0
Feature activation-1.75e+01

Pos logit contributions
Don+0.185
 It+0.182
Well+0.178
I+0.178
It+0.177
Neg logit contributions
ies-0.100
ishing-0.100
ig-0.090
 breath-0.089
ish-0.088
Token I
Token ID314
Feature activation+1.64e+01

Pos logit contributions
-+0.655
ies+0.552
ish+0.548
 eye+0.548
ig+0.543
Neg logit contributions
Don-0.503
Here-0.446
Well-0.436
There-0.431
Over-0.427
Tokenagu
Token ID11433
Feature activation-1.05e+01

Pos logit contributions
Don+0.101
Well+0.099
My+0.095
Would+0.095
There+0.093
Neg logit contributions
ies-0.059
--0.048
ish-0.047
ishing-0.046
 mess-0.045
Tokenar
Token ID283
Feature activation+7.44e-01

Pos logit contributions
ies+0.352
-+0.315
ish+0.292
ishing+0.291
ig+0.285
Neg logit contributions
Don-0.477
Would-0.448
Well-0.446
My-0.444
 Did-0.422
Token replied
Token ID8712
Feature activation+8.10e-01

Pos logit contributions
Don+0.024
Well+0.022
Would+0.022
My+0.021
It+0.020
Neg logit contributions
ies-0.035
--0.032
ishing-0.030
ish-0.030
ig-0.030
Token,
Token ID11
Feature activation-1.58e+01

Pos logit contributions
Don+0.044
My+0.042
Would+0.042
 It+0.041
 Did+0.041
Neg logit contributions
ies-0.019
ishing-0.015
an-0.014
 eye-0.014
ish-0.014
Token "
Token ID366
Feature activation+1.33e+01

Pos logit contributions
ies+0.713
-+0.704
ish+0.607
 breath+0.600
an+0.597
Neg logit contributions
Don-0.475
Here-0.446
Well-0.445
My-0.439
There-0.435
TokenMy
Token ID3666
Feature activation+3.75e+01

Pos logit contributions
 Finally+0.253
 Everything+0.246
 Their+0.241
 Sadly+0.236
 Something+0.236
Neg logit contributions
--0.582
ater-0.543
an-0.540
ies-0.539
er-0.508
Token name
Token ID1438
Feature activation-1.41e+01

Pos logit contributions
Don+1.320
My+1.302
Would+1.252
Well+1.237
 It+1.227
Neg logit contributions
ies-1.462
ishing-1.354
--1.269
ig-1.245
an-1.229
Token is
Token ID318
Feature activation+8.22e+00

Pos logit contributions
ies+0.549
-+0.501
ish+0.469
ishing+0.467
ig+0.460
Neg logit contributions
Don-0.560
Would-0.520
Well-0.517
My-0.515
 Did-0.486
Token John
Token ID1757
Feature activation+6.82e+00

Pos logit contributions
Don+0.276
Well+0.246
Would+0.241
Here+0.232
There+0.229
Neg logit contributions
ies-0.370
--0.359
ish-0.324
ig-0.314
an-0.313
Token.
Token ID13
Feature activation-1.32e+01

Pos logit contributions
Don+0.351
My+0.351
Would+0.334
Well+0.328
It+0.327
Neg logit contributions
ies-0.157
ishing-0.136
 breath-0.125
ig-0.125
--0.122
Token I
Token ID314
Feature activation+1.64e+01

Pos logit contributions
-+0.427
,+0.393
ies+0.390
ish+0.381
 ship+0.372
Neg logit contributions
Don-0.459
There-0.395
Here-0.394
See-0.389
Well-0.383
Token
Token ID6184
Feature activation+5.76e+00

Pos logit contributions
ies+0.714
-+0.711
ish+0.605
ig+0.602
 eye+0.602
Neg logit contributions
Don-0.440
Here-0.430
My-0.426
Well-0.415
Would-0.413
Token
Token ID95
Feature activation+4.74e-01

Pos logit contributions
Don+0.215
Would+0.198
Well+0.197
My+0.196
 Did+0.184
Neg logit contributions
ies-0.240
--0.221
ish-0.208
ishing-0.207
ig-0.204
Token
Token ID26391
Feature activation+1.09e+01

Pos logit contributions
Don+0.015
Would+0.014
My+0.014
Well+0.014
 Did+0.013
Neg logit contributions
--0.020
ies-0.019
an-0.019
ig-0.018
ishing-0.018
Token
Token ID129
Feature activation+7.11e+00

Pos logit contributions
Don+0.377
Would+0.362
My+0.348
Well+0.346
Here+0.335
Neg logit contributions
ies-0.474
--0.454
ig-0.417
ish-0.410
an-0.399
Token
Token ID241
Feature activation+1.22e+00

Pos logit contributions
Don+0.268
Would+0.249
Well+0.248
My+0.246
 Did+0.234
Neg logit contributions
ies-0.290
--0.272
ish-0.251
ishing-0.251
an-0.249
TokenMy
Token ID3666
Feature activation+3.75e+01

Pos logit contributions
 Did+0.019
Here+0.018
Would+0.018
Don+0.018
My+0.016
Neg logit contributions
--0.065
ies-0.063
.-0.058
!-0.057
ater-0.057
Token hair
Token ID4190
Feature activation-2.12e+01

Pos logit contributions
Don+1.510
My+1.474
 It+1.464
It+1.453
Well+1.434
Neg logit contributions
ies-1.256
ishing-1.098
 island-1.065
an-1.039
ie-1.024
Token is
Token ID318
Feature activation+8.22e+00

Pos logit contributions
ies+0.938
ishing+0.845
-+0.841
ish+0.818
an+0.809
Neg logit contributions
Don-0.697
My-0.653
Well-0.646
It-0.618
Would-0.612
Token rainbow
Token ID27223
Feature activation-1.65e+01

Pos logit contributions
Don+0.331
My+0.308
Would+0.303
Well+0.300
It+0.293
Neg logit contributions
ies-0.320
--0.284
ish-0.277
ishing-0.275
ig-0.270
Token coloured
Token ID34746
Feature activation-1.03e+01

Pos logit contributions
ies+0.353
ishing+0.309
 eye+0.303
 island+0.286
an+0.283
Neg logit contributions
Don-0.817
It-0.808
My-0.806
I-0.789
That-0.784
Token and
Token ID290
Feature activation-1.04e+01

Pos logit contributions
ies+0.277
ishing+0.234
an+0.230
 decision+0.223
ish+0.218
Neg logit contributions
Don-0.513
My-0.499
It-0.494
I-0.488
 It-0.485
Token,
Token ID11
Feature activation-1.58e+01

Pos logit contributions
ies+0.022
an+0.021
 eye+0.019
ish+0.019
ater+0.018
Neg logit contributions
Don-0.037
My-0.036
It-0.036
Well-0.036
Would-0.035
Token Bob
Token ID5811
Feature activation+4.44e+00

Pos logit contributions
ies+0.664
-+0.618
 breath+0.572
ish+0.559
 home+0.556
Neg logit contributions
Don-0.551
Would-0.534
 Did-0.503
Well-0.503
My-0.489
Token said
Token ID531
Feature activation+7.57e+00

Pos logit contributions
Don+0.146
Well+0.133
Would+0.133
My+0.133
It+0.124
Neg logit contributions
ies-0.203
--0.186
ishing-0.178
ish-0.177
ig-0.175
Token,
Token ID11
Feature activation-1.58e+01

Pos logit contributions
Don+0.428
Would+0.409
My+0.407
Well+0.405
 Did+0.395
Neg logit contributions
ies-0.165
ishing-0.129
--0.129
ish-0.125
an-0.123
Token "
Token ID366
Feature activation+1.33e+01

Pos logit contributions
-+0.648
ies+0.624
,+0.537
.+0.530
 for+0.526
Neg logit contributions
Don-0.519
Here-0.506
There-0.485
My-0.479
Well-0.475
TokenMy
Token ID3666
Feature activation+3.75e+01

Pos logit contributions
 Their+0.249
 Did+0.242
 Sadly+0.231
 There+0.223
 Something+0.208
Neg logit contributions
ater-0.594
--0.589
ies-0.582
est-0.580
an-0.565
Token prints
Token ID20842
Feature activation-1.82e+01

Pos logit contributions
Don+1.529
 It+1.515
 Something+1.478
My+1.474
It+1.448
Neg logit contributions
ies-1.092
ishing-0.996
 island-0.960
--0.946
ie-0.916
Token look
Token ID804
Feature activation-6.14e+00

Pos logit contributions
ies+0.728
-+0.654
ishing+0.648
ish+0.628
an+0.624
Neg logit contributions
Don-0.678
My-0.638
Well-0.629
It-0.617
Would-0.606
Token great
Token ID1049
Feature activation+5.63e+00

Pos logit contributions
ies+0.248
an+0.222
ig+0.221
ishing+0.216
ish+0.216
Neg logit contributions
Don-0.228
My-0.215
Would-0.210
It-0.210
Well-0.204
Token!
Token ID0
Feature activation-1.75e+01

Pos logit contributions
It+0.231
Don+0.231
 It+0.229
My+0.228
I+0.227
Neg logit contributions
an-0.152
ies-0.147
 decision-0.132
ig-0.129
 eye-0.128
Token I
Token ID314
Feature activation+1.64e+01

Pos logit contributions
-+0.632
ish+0.604
 eye+0.573
!"+0.571
ig+0.569
Neg logit contributions
Don-0.580
Would-0.470
There-0.462
Over-0.460
Here-0.459
Token closer
Token ID5699
Feature activation-5.21e-01

Pos logit contributions
ies+0.177
ishing+0.154
ig+0.151
ish+0.145
 eye+0.141
Neg logit contributions
Don-0.268
Would-0.248
My-0.246
It-0.241
Well-0.238
Token and
Token ID290
Feature activation-1.04e+01

Pos logit contributions
ies+0.017
an+0.015
ishing+0.015
ish+0.015
ig+0.015
Neg logit contributions
Don-0.022
 It-0.021
It-0.021
 She-0.021
My-0.021
Token said
Token ID531
Feature activation+7.57e+00

Pos logit contributions
ies+0.379
-+0.345
ish+0.319
ishing+0.314
 breath+0.296
Neg logit contributions
Don-0.413
Well-0.405
My-0.387
Here-0.387
Would-0.386
Token,
Token ID11
Feature activation-1.58e+01

Pos logit contributions
Don+0.447
Would+0.426
My+0.424
Well+0.424
 Did+0.409
Neg logit contributions
ies-0.149
--0.121
ishing-0.108
ish-0.107
ig-0.102
Token "
Token ID366
Feature activation+1.33e+01

Pos logit contributions
ies+0.699
-+0.685
ish+0.596
an+0.583
 window+0.583
Neg logit contributions
Don-0.476
Here-0.461
Well-0.447
My-0.443
There-0.432
TokenMy
Token ID3666
Feature activation+3.75e+01

Pos logit contributions
 Sadly+0.240
 Their+0.236
 Did+0.233
 From+0.219
 Everything+0.214
Neg logit contributions
ater-0.616
an-0.597
--0.576
ies-0.571
est-0.555
Token name
Token ID1438
Feature activation-1.41e+01

Pos logit contributions
Don+1.430
My+1.381
 It+1.354
Would+1.353
 Something+1.310
Neg logit contributions
ies-1.220
ishing-1.151
ig-1.131
--1.101
ater-1.039
Token is
Token ID318
Feature activation+8.22e+00

Pos logit contributions
ies+0.543
-+0.492
ishing+0.465
ish+0.463
ig+0.456
Neg logit contributions
Don-0.563
Would-0.522
My-0.522
Well-0.521
It-0.491
Token Fl
Token ID1610
Feature activation+1.34e+01

Pos logit contributions
Don+0.259
Well+0.235
Would+0.232
My+0.225
There+0.213
Neg logit contributions
ies-0.391
--0.368
ish-0.346
ig-0.338
ishing-0.338
Tokenuffy
Token ID15352
Feature activation-4.13e-01

Pos logit contributions
Don+0.536
My+0.507
Well+0.506
I+0.505
Would+0.499
Neg logit contributions
ies-0.460
 decision-0.409
 airplane-0.406
 eye-0.404
 mess-0.400
Token!"
Token ID2474
Feature activation-1.89e+01

Pos logit contributions
ies+0.009
ishing+0.009
 eye+0.008
ig+0.008
-+0.008
Neg logit contributions
Don-0.020
My-0.020
Would-0.019
 Did-0.019
 It-0.019
Token
Token ID6184
Feature activation+5.76e+00

Pos logit contributions
ies+0.732
-+0.730
ish+0.625
 mess+0.620
an+0.619
Neg logit contributions
Don-0.451
My-0.422
Would-0.412
Well-0.411
Here-0.409
Token
Token ID95
Feature activation+4.74e-01

Pos logit contributions
Don+0.225
Well+0.208
Would+0.208
My+0.207
It+0.196
Neg logit contributions
ies-0.233
--0.208
ishing-0.196
ish-0.196
ig-0.192
Token
Token ID26391
Feature activation+1.09e+01

Pos logit contributions
Don+0.017
Would+0.015
Well+0.015
My+0.015
There+0.014
Neg logit contributions
--0.019
ies-0.019
an-0.018
ishing-0.018
ish-0.018
Token
Token ID129
Feature activation+7.11e+00

Pos logit contributions
Don+0.394
Would+0.371
My+0.362
Well+0.361
Here+0.346
Neg logit contributions
ies-0.459
--0.441
ig-0.398
ish-0.393
an-0.387
Token
Token ID241
Feature activation+1.22e+00

Pos logit contributions
Don+0.268
Would+0.249
Well+0.249
My+0.247
 Did+0.236
Neg logit contributions
ies-0.286
--0.275
an-0.253
ishing-0.251
ish-0.250
TokenMy
Token ID3666
Feature activation+3.75e+01

Pos logit contributions
Don+0.020
Would+0.019
 Did+0.018
Here+0.017
My+0.017
Neg logit contributions
--0.065
ies-0.062
ater-0.058
.-0.056
!-0.055
Token magic
Token ID5536
Feature activation-3.50e+00

Pos logit contributions
Don+1.284
 It+1.248
My+1.246
 Something+1.217
 Did+1.204
Neg logit contributions
ies-1.496
ishing-1.345
--1.261
ish-1.245
an-1.242
Token is
Token ID318
Feature activation+8.22e+00

Pos logit contributions
ies+0.102
ishing+0.093
ish+0.085
-+0.081
an+0.080
Neg logit contributions
Don-0.152
Well-0.149
 It-0.149
It-0.149
My-0.146
Token in
Token ID287
Feature activation-2.28e+00

Pos logit contributions
Don+0.325
Would+0.301
My+0.301
Well+0.300
It+0.285
Neg logit contributions
ies-0.321
--0.289
ishing-0.276
ish-0.276
ig-0.271
Token the
Token ID262
Feature activation+7.94e+00

Pos logit contributions
ies+0.096
-+0.086
ish+0.084
ishing+0.084
ig+0.082
Neg logit contributions
Don-0.084
My-0.077
Would-0.077
Well-0.077
It-0.073
Token salt
Token ID8268
Feature activation-1.34e+01

Pos logit contributions
Don+0.369
My+0.351
Would+0.347
Ah+0.341
Here+0.341
Neg logit contributions
ies-0.269
ishing-0.220
--0.216
ish-0.204
ig-0.198
Token,"
Token ID553
Feature activation-1.18e+01

Pos logit contributions
Would+1.022
Don+1.013
My+0.992
Well+0.984
It+0.977
Neg logit contributions
ies-0.458
ishing-0.438
 eye-0.428
an-0.420
ig-0.417
Token he
Token ID339
Feature activation+1.64e+01

Pos logit contributions
ies+0.501
-+0.482
ish+0.424
 airplane+0.419
 breath+0.414
Neg logit contributions
Don-0.396
Would-0.374
Well-0.360
Even-0.352
My-0.351
Token said
Token ID531
Feature activation+7.56e+00

Pos logit contributions
Don+0.598
Would+0.554
Well+0.546
My+0.545
Here+0.529
Neg logit contributions
ies-0.714
--0.667
ish-0.600
an-0.590
ishing-0.584
Token.
Token ID13
Feature activation-1.32e+01

Pos logit contributions
Don+0.402
My+0.391
Would+0.385
 It+0.375
 Did+0.375
Neg logit contributions
ies-0.180
ig-0.159
ishing-0.157
an-0.149
ish-0.148
Token "
Token ID366
Feature activation+1.33e+01

Pos logit contributions
-+0.447
.+0.430
,+0.429
ies+0.391
 window+0.379
Neg logit contributions
Don-0.513
Looks-0.441
Here-0.439
Well-0.435
My-0.433
TokenMy
Token ID3666
Feature activation+3.75e+01

Pos logit contributions
 Sadly+0.213
 Everything+0.202
Don+0.197
 Did+0.195
 From+0.194
Neg logit contributions
--0.637
ater-0.635
ies-0.619
ishing-0.560
an-0.555
Token name
Token ID1438
Feature activation-1.41e+01

Pos logit contributions
Would+1.450
 Who+1.425
Don+1.423
 It+1.411
My+1.405
Neg logit contributions
ishing-1.121
ies-1.109
ig-1.075
ish-0.996
--0.980
Token is
Token ID318
Feature activation+8.22e+00

Pos logit contributions
ies+0.551
-+0.499
ishing+0.474
ish+0.472
ig+0.465
Neg logit contributions
Don-0.555
My-0.515
Would-0.514
Well-0.513
It-0.484
Token King
Token ID2677
Feature activation-4.94e+00

Pos logit contributions
Don+0.278
Well+0.247
Would+0.241
My+0.234
There+0.228
Neg logit contributions
ies-0.377
--0.359
ish-0.326
ishing-0.316
an-0.315
Token.
Token ID13
Feature activation-1.32e+01

Pos logit contributions
ishing+0.122
ies+0.120
 eye+0.113
 breath+0.112
ig+0.112
Neg logit contributions
Don-0.230
My-0.229
Would-0.224
 Did-0.222
Well-0.219
Token What
Token ID1867
Feature activation+1.65e+01

Pos logit contributions
-+0.510
ies+0.486
,+0.462
.+0.457
!"+0.454
Neg logit contributions
Don-0.418
Here-0.349
Well-0.344
There-0.333
Looks-0.330
Token little
Token ID1310
Feature activation+1.38e+00

Pos logit contributions
Don+1.226
My+1.182
Would+1.160
I+1.142
It+1.113
Neg logit contributions
ies-1.062
ishing-1.017
ig-0.992
ish-0.966
--0.906
Token girl
Token ID2576
Feature activation-8.95e+00

Pos logit contributions
Don+0.050
My+0.047
I+0.047
Would+0.045
 She+0.045
Neg logit contributions
ies-0.051
ishing-0.051
ig-0.049
an-0.047
--0.047
Token said
Token ID531
Feature activation+7.55e+00

Pos logit contributions
ies+0.375
-+0.354
ish+0.320
ig+0.310
an+0.307
Neg logit contributions
Don-0.333
Would-0.309
Well-0.306
There-0.299
Here-0.299
Token,
Token ID11
Feature activation-1.58e+01

Pos logit contributions
Don+0.443
My+0.426
Would+0.423
Well+0.415
 Did+0.411
Neg logit contributions
ies-0.148
ishing-0.117
--0.110
ish-0.108
an-0.107
Token "
Token ID366
Feature activation+1.33e+01

Pos logit contributions
-+0.647
ies+0.633
,+0.550
.+0.533
ish+0.529
Neg logit contributions
Don-0.515
Here-0.495
Well-0.494
There-0.492
Ah-0.475
TokenMy
Token ID3666
Feature activation+3.75e+01

Pos logit contributions
 Finally+0.282
 Eventually+0.267
 Sadly+0.264
 Everything+0.251
 Hours+0.237
Neg logit contributions
--0.561
ies-0.525
an-0.503
flower-0.497
ater-0.496
Token name
Token ID1438
Feature activation-1.41e+01

Pos logit contributions
Don+1.247
My+1.214
Would+1.155
 It+1.134
 Did+1.106
Neg logit contributions
ies-1.580
ishing-1.491
ig-1.431
--1.420
ish-1.379
Token is
Token ID318
Feature activation+8.23e+00

Pos logit contributions
ies+0.551
-+0.498
ishing+0.473
ish+0.471
ig+0.463
Neg logit contributions
Don-0.559
My-0.518
Would-0.517
Well-0.516
It-0.488
Token Sophia
Token ID39953
Feature activation+3.17e+00

Pos logit contributions
Don+0.277
Well+0.253
Would+0.244
Here+0.234
There+0.231
Neg logit contributions
ies-0.369
--0.360
ish-0.314
ie-0.313
 home-0.313
Token.
Token ID13
Feature activation-1.32e+01

Pos logit contributions
My+0.162
Don+0.159
It+0.153
Would+0.153
I+0.151
Neg logit contributions
ishing-0.066
ies-0.065
 breath-0.058
 decision-0.058
 eye-0.057
Token What
Token ID1867
Feature activation+1.65e+01

Pos logit contributions
-+0.456
ies+0.452
,+0.451
 eye+0.417
.+0.415
Neg logit contributions
Don-0.404
Well-0.360
There-0.360
Here-0.355
Even-0.354
Token little
Token ID1310
Feature activation+1.39e+00

Pos logit contributions
Don+1.121
My+1.071
Would+1.065
Well+1.056
I+1.020
Neg logit contributions
ies-0.958
ishing-0.889
ig-0.881
ish-0.857
--0.825
Token girl
Token ID2576
Feature activation-8.96e+00

Pos logit contributions
Don+0.050
My+0.047
I+0.045
Would+0.045
Let+0.045
Neg logit contributions
ies-0.051
ishing-0.050
ater-0.050
ig-0.050
an-0.049
Token replied
Token ID8712
Feature activation+8.12e-01

Pos logit contributions
ies+0.379
-+0.359
ish+0.328
ishing+0.314
ig+0.314
Neg logit contributions
Don-0.329
Would-0.308
There-0.302
Well-0.301
Here-0.299
Token,
Token ID11
Feature activation-1.58e+01

Pos logit contributions
Don+0.043
My+0.041
Would+0.040
 It+0.040
It+0.040
Neg logit contributions
ies-0.020
an-0.017
 island-0.017
ishing-0.017
 eye-0.016
Token "
Token ID366
Feature activation+1.33e+01

Pos logit contributions
ies+0.741
-+0.728
ish+0.640
ishing+0.627
an+0.614
Neg logit contributions
Don-0.453
Well-0.426
There-0.416
Here-0.414
Would-0.410
TokenMy
Token ID3666
Feature activation+3.75e+01

Pos logit contributions
Would+0.244
 From+0.231
 Everything+0.231
 Higher+0.230
Here+0.224
Neg logit contributions
--0.694
ies-0.677
ater-0.619
an-0.606
ish-0.564
Token room
Token ID2119
Feature activation-2.32e+01

Pos logit contributions
Don+1.336
 It+1.309
 Did+1.279
 Something+1.277
 Who+1.260
Neg logit contributions
ies-1.344
ishing-1.306
ig-1.283
an-1.229
ater-1.181
Token is
Token ID318
Feature activation+8.20e+00

Pos logit contributions
ies+1.011
ishing+0.952
ig+0.906
-+0.906
ish+0.905
Neg logit contributions
Don-0.723
My-0.673
Well-0.666
It-0.654
I-0.654
Token so
Token ID523
Feature activation+2.80e+00

Pos logit contributions
Don+0.320
My+0.292
Would+0.289
Well+0.283
It+0.282
Neg logit contributions
ies-0.327
ish-0.291
an-0.289
ishing-0.288
ig-0.287
Token empty
Token ID6565
Feature activation+1.92e+00

Pos logit contributions
Don+0.144
Would+0.137
Well+0.136
My+0.136
 Did+0.129
Neg logit contributions
ies-0.077
--0.069
ish-0.059
ishing-0.059
ig-0.057
Token,
Token ID11
Feature activation-1.58e+01

Pos logit contributions
Don+0.085
 It+0.081
It+0.080
I+0.080
 She+0.080
Neg logit contributions
ies-0.059
an-0.053
ish-0.051
ig-0.051
ishing-0.049
TokenThe
Token ID464
Feature activation+2.99e+01

Pos logit contributions
 Did+0.421
Don+0.413
Well+0.374
Would+0.371
 Everything+0.322
Neg logit contributions
--1.088
ies-1.007
ater-1.007
 mess-0.943
an-0.942
Token boy
Token ID2933
Feature activation-1.01e+01

Pos logit contributions
Don+1.230
Would+1.165
My+1.165
Well+1.137
I+1.117
Neg logit contributions
ies-1.141
ishing-0.991
--0.977
ig-0.973
ish-0.973
Token replied
Token ID8712
Feature activation+8.12e-01

Pos logit contributions
ies+0.415
-+0.404
ish+0.360
ig+0.347
 eye+0.346
Neg logit contributions
Don-0.376
Would-0.350
There-0.338
Here-0.337
Well-0.335
Token:
Token ID25
Feature activation+9.88e-02

Pos logit contributions
Don+0.034
Would+0.032
My+0.032
Well+0.032
It+0.030
Neg logit contributions
ies-0.030
--0.026
ishing-0.026
ish-0.025
ig-0.025
Token "
Token ID366
Feature activation+1.33e+01

Pos logit contributions
Don+0.003
Well+0.003
There+0.003
Here+0.003
Looks+0.003
Neg logit contributions
--0.004
ies-0.004
,-0.003
.-0.003
 castle-0.003
TokenMy
Token ID3666
Feature activation+3.75e+01

Pos logit contributions
 Their+0.219
 Everything+0.215
 Sadly+0.205
 From+0.193
 Did+0.186
Neg logit contributions
ater-0.648
--0.643
ies-0.608
an-0.592
ish-0.574
Token mum
Token ID25682
Feature activation-1.46e+01

Pos logit contributions
My+1.750
Don+1.712
It+1.645
Would+1.638
I+1.638
Neg logit contributions
ies-1.057
ishing-0.887
 island-0.871
an-0.854
--0.845
Token gave
Token ID2921
Feature activation+5.91e+00

Pos logit contributions
ies+0.502
-+0.439
ishing+0.436
an+0.426
 breath+0.423
Neg logit contributions
Don-0.609
My-0.583
Well-0.581
Would-0.568
 Did-0.562
Token it
Token ID340
Feature activation-5.81e-01

Pos logit contributions
Don+0.232
Would+0.216
Well+0.215
My+0.213
 Did+0.201
Neg logit contributions
ies-0.236
--0.217
ish-0.202
ishing-0.201
ig-0.198
Token to
Token ID284
Feature activation-3.31e+00

Pos logit contributions
ies+0.021
ishing+0.018
ig+0.018
ish+0.018
an+0.018
Neg logit contributions
Don-0.023
My-0.023
Would-0.022
It-0.021
Well-0.021
Token me
Token ID502
Feature activation-1.66e+00

Pos logit contributions
ies+0.116
-+0.105
ish+0.097
ishing+0.097
ig+0.095
Neg logit contributions
Don-0.145
Would-0.136
Well-0.135
My-0.135
 Did-0.128
Token smiled
Token ID13541
Feature activation+2.95e+00

Pos logit contributions
ies+0.436
-+0.401
ishing+0.386
ish+0.385
ig+0.377
Neg logit contributions
Don-0.303
Well-0.276
Would-0.276
My-0.275
It-0.255
Token and
Token ID290
Feature activation-1.04e+01

Pos logit contributions
Don+0.155
My+0.146
It+0.143
Would+0.143
Well+0.143
Neg logit contributions
ies-0.075
ig-0.061
ishing-0.061
an-0.060
ish-0.060
Token said
Token ID531
Feature activation+7.57e+00

Pos logit contributions
ies+0.413
-+0.397
ish+0.336
ig+0.335
 breath+0.333
Neg logit contributions
Don-0.393
Would-0.385
 Did-0.365
My-0.360
Here-0.359
Token,
Token ID11
Feature activation-1.58e+01

Pos logit contributions
Don+0.468
Would+0.447
Well+0.445
My+0.444
 Did+0.429
Neg logit contributions
ies-0.131
--0.103
ish-0.088
ishing-0.087
ig-0.082
Token "
Token ID366
Feature activation+1.33e+01

Pos logit contributions
-+0.671
ies+0.659
 airplane+0.563
 window+0.556
ig+0.556
Neg logit contributions
Don-0.483
Here-0.458
My-0.451
Well-0.439
Ah-0.437
TokenMy
Token ID3666
Feature activation+3.75e+01

Pos logit contributions
 Did+0.242
 Their+0.229
 Something+0.222
 Sadly+0.211
 There+0.205
Neg logit contributions
--0.672
ies-0.662
ishing-0.615
ater-0.606
ie-0.602
Token pleasure
Token ID9476
Feature activation-8.85e+00

Pos logit contributions
Don+1.291
 It+1.271
 Something+1.242
It+1.231
My+1.226
Neg logit contributions
ies-1.461
ishing-1.304
an-1.206
ish-1.203
ater-1.200
Token.
Token ID13
Feature activation-1.32e+01

Pos logit contributions
ies+0.208
ishing+0.184
an+0.180
ish+0.175
ig+0.168
Neg logit contributions
Don-0.453
Well-0.439
It-0.430
I-0.428
My-0.425
Token Enjoy
Token ID18179
Feature activation+1.07e+01

Pos logit contributions
-+0.491
ig+0.454
 eye+0.451
ish+0.451
,+0.434
Neg logit contributions
Don-0.414
My-0.352
Over-0.351
Here-0.350
Ah-0.340
Token your
Token ID534
Feature activation+9.10e+00

Pos logit contributions
Don+0.391
Would+0.374
It+0.369
My+0.364
Well+0.363
Neg logit contributions
ies-0.387
ish-0.371
an-0.352
ishing-0.349
 eye-0.343
Token new
Token ID649
Feature activation+1.44e+00

Pos logit contributions
Don+0.342
Would+0.331
Here+0.327
My+0.320
Me+0.320
Neg logit contributions
ies-0.344
ish-0.291
ishing-0.288
an-0.285
ater-0.282
TokenFrank
Token ID17439
Feature activation+6.35e+00

Pos logit contributions
Don+0.581
 Did+0.569
Well+0.552
Would+0.547
My+0.522
Neg logit contributions
--0.958
ies-0.866
an-0.825
ater-0.820
ish-0.774
Token said
Token ID531
Feature activation+7.56e+00

Pos logit contributions
Don+0.221
Would+0.204
Well+0.202
My+0.202
 Did+0.188
Neg logit contributions
ies-0.280
--0.258
ish-0.243
ishing-0.242
ig-0.239
Token sadly
Token ID21098
Feature activation+4.65e+00

Pos logit contributions
Don+0.437
My+0.417
Would+0.415
Well+0.409
 Did+0.406
Neg logit contributions
ies-0.154
ishing-0.122
an-0.113
ish-0.112
 eye-0.112
Token,
Token ID11
Feature activation-1.58e+01

Pos logit contributions
Don+0.235
 It+0.219
Would+0.219
 Did+0.216
 She+0.213
Neg logit contributions
ies-0.118
ishing-0.109
an-0.107
ater-0.103
ig-0.101
Token "
Token ID366
Feature activation+1.33e+01

Pos logit contributions
-+0.674
ies+0.661
,+0.564
ish+0.559
.+0.558
Neg logit contributions
Don-0.490
Here-0.465
There-0.465
Well-0.456
My-0.452
TokenMy
Token ID3666
Feature activation+3.75e+01

Pos logit contributions
 Everything+0.208
 Finally+0.206
 Higher+0.194
 From+0.190
 Did+0.185
Neg logit contributions
--0.628
an-0.628
ater-0.619
ies-0.619
ish-0.571
Token fountain
Token ID29420
Feature activation-1.40e+01

Pos logit contributions
Don+1.581
 It+1.562
My+1.540
It+1.526
 Something+1.524
Neg logit contributions
ies-1.086
ishing-1.008
--0.879
ater-0.868
ie-0.854
Token is
Token ID318
Feature activation+8.23e+00

Pos logit contributions
ies+0.573
ishing+0.531
-+0.519
ig+0.510
ater+0.504
Neg logit contributions
Don-0.464
Well-0.424
My-0.423
It-0.415
I-0.409
Token broken
Token ID5445
Feature activation+1.42e+00

Pos logit contributions
Don+0.340
My+0.315
It+0.310
Would+0.306
Well+0.300
Neg logit contributions
ies-0.308
ig-0.264
an-0.262
ish-0.261
--0.261
Token and
Token ID290
Feature activation-1.04e+01

Pos logit contributions
Don+0.061
 It+0.060
 He+0.059
 She+0.058
It+0.058
Neg logit contributions
ies-0.041
ig-0.037
an-0.037
ishing-0.035
ish-0.034
Token I
Token ID314
Feature activation+1.64e+01

Pos logit contributions
ies+0.495
-+0.461
ish+0.437
ishing+0.431
ig+0.425
Neg logit contributions
Don-0.320
Would-0.299
Well-0.293
My-0.290
Here-0.270
Token?"
Token ID1701
Feature activation-9.12e+00

Pos logit contributions
Don+0.540
I+0.494
Would+0.485
My+0.485
 Did+0.483
Neg logit contributions
ies-0.474
 breath-0.463
ig-0.452
ishing-0.452
 decision-0.428
Token Lily
Token ID20037
Feature activation+1.37e+01

Pos logit contributions
-+0.365
ies+0.341
,+0.313
.+0.313
 mess+0.294
Neg logit contributions
Don-0.327
Well-0.308
Ah-0.306
Over-0.304
Would-0.304
Token replied
Token ID8712
Feature activation+7.88e-01

Pos logit contributions
Don+0.495
Would+0.464
Well+0.457
My+0.448
There+0.441
Neg logit contributions
ies-0.582
--0.544
ish-0.508
ishing-0.492
ig-0.490
Token,
Token ID11
Feature activation-1.58e+01

Pos logit contributions
Don+0.045
My+0.043
Would+0.043
Well+0.042
It+0.042
Neg logit contributions
ies-0.017
ishing-0.013
an-0.013
ish-0.012
 eye-0.012
Token "
Token ID366
Feature activation+1.33e+01

Pos logit contributions
ies+0.721
-+0.716
ish+0.625
ishing+0.616
 mess+0.603
Neg logit contributions
Don-0.449
Well-0.433
Ah-0.415
Here-0.415
There-0.414
TokenMy
Token ID3666
Feature activation+3.75e+01

Pos logit contributions
 Everything+0.257
 From+0.223
Would+0.223
 Finally+0.218
 Did+0.215
Neg logit contributions
--0.684
ies-0.676
ater-0.619
 eye-0.592
ie-0.590
Token thumb
Token ID15683
Feature activation-2.11e+01

Pos logit contributions
Don+1.730
My+1.703
It+1.664
I+1.628
Well+1.602
Neg logit contributions
ies-1.035
 island-0.928
ishing-0.907
--0.858
an-0.825
Token hurts
Token ID20406
Feature activation+2.47e+00

Pos logit contributions
ies+0.942
ishing+0.865
-+0.843
 island+0.838
ig+0.818
Neg logit contributions
Don-0.670
My-0.622
Well-0.595
It-0.589
I-0.586
Token,
Token ID11
Feature activation-1.58e+01

Pos logit contributions
Don+0.128
My+0.122
It+0.119
Well+0.117
Would+0.117
Neg logit contributions
ies-0.065
ishing-0.055
ig-0.053
ish-0.052
--0.051
Token but
Token ID475
Feature activation-1.97e+00

Pos logit contributions
ies+0.612
-+0.558
ish+0.521
ishing+0.518
ig+0.511
Neg logit contributions
Don-0.636
Would-0.592
Well-0.589
My-0.586
 Did-0.553
Token it
Token ID340
Feature activation-5.85e-01

Pos logit contributions
ies+0.094
-+0.087
ish+0.082
ishing+0.081
ig+0.081
Neg logit contributions
Don-0.062
Would-0.057
Well-0.057
My-0.055
There-0.052
Token
Token ID6184
Feature activation+5.77e+00

Pos logit contributions
ies+0.635
-+0.602
ish+0.535
 eye+0.534
ishing+0.523
Neg logit contributions
Don-0.570
Would-0.542
Well-0.540
My-0.524
Here-0.516
Token
Token ID95
Feature activation+4.84e-01

Pos logit contributions
Don+0.210
Would+0.193
Well+0.193
My+0.192
It+0.180
Neg logit contributions
ies-0.248
--0.226
ish-0.214
ishing-0.213
ig-0.209
Token
Token ID26391
Feature activation+1.09e+01

Pos logit contributions
Don+0.016
Would+0.015
Well+0.015
My+0.014
 Did+0.014
Neg logit contributions
--0.021
ies-0.020
an-0.019
ig-0.019
 eye-0.019
Token
Token ID129
Feature activation+7.10e+00

Pos logit contributions
Don+0.426
Would+0.407
My+0.396
Well+0.394
Here+0.381
Neg logit contributions
ies-0.423
--0.405
ig-0.368
ish-0.359
ishing-0.353
Token
Token ID241
Feature activation+1.25e+00

Pos logit contributions
Don+0.270
Well+0.252
Would+0.252
My+0.249
 Did+0.240
Neg logit contributions
ies-0.281
--0.270
an-0.248
ishing-0.245
ish-0.244
TokenMy
Token ID3666
Feature activation+3.75e+01

Pos logit contributions
Don+0.021
 Did+0.020
Would+0.020
Here+0.019
 Everything+0.017
Neg logit contributions
--0.066
ies-0.063
ater-0.059
.-0.058
!-0.057
Token name
Token ID1438
Feature activation-1.40e+01

Pos logit contributions
Don+1.359
My+1.293
Would+1.248
 It+1.242
Well+1.238
Neg logit contributions
ies-1.478
ishing-1.340
--1.272
ish-1.259
ig-1.247
Token is
Token ID318
Feature activation+8.21e+00

Pos logit contributions
ies+0.508
-+0.479
ish+0.429
ig+0.421
ishing+0.418
Neg logit contributions
Don-0.594
Would-0.559
Well-0.550
My-0.540
There-0.519
Token Thunder
Token ID9850
Feature activation+7.97e+00

Pos logit contributions
Don+0.280
Would+0.250
Well+0.249
Here+0.241
There+0.236
Neg logit contributions
--0.355
ies-0.355
ig-0.310
ish-0.310
an-0.303
Token.
Token ID13
Feature activation-1.32e+01

Pos logit contributions
Don+0.436
My+0.417
Well+0.416
Would+0.413
It+0.405
Neg logit contributions
ies-0.178
ishing-0.146
ish-0.135
--0.135
 eye-0.131
Token I
Token ID314
Feature activation+1.64e+01

Pos logit contributions
-+0.555
.+0.506
 eye+0.505
ies+0.503
!+0.500
Neg logit contributions
Don-0.323
Over-0.282
Here-0.277
There-0.254
Well-0.254
Token Tim
Token ID5045
Feature activation+9.58e+00

Pos logit contributions
-+0.428
,+0.405
.+0.397
ies+0.389
!+0.361
Neg logit contributions
Ah-0.237
Well-0.217
Don-0.217
Looks-0.214
Take-0.213
Tokenmy
Token ID1820
Feature activation-1.40e+00

Pos logit contributions
Don+0.346
My+0.317
Well+0.315
Would+0.310
It+0.298
Neg logit contributions
ies-0.395
--0.367
ishing-0.349
ig-0.343
ish-0.340
Token replied
Token ID8712
Feature activation+8.12e-01

Pos logit contributions
ies+0.063
-+0.059
ish+0.055
ishing+0.054
ig+0.053
Neg logit contributions
Don-0.048
Would-0.044
Well-0.044
My-0.043
There-0.041
Token,
Token ID11
Feature activation-1.58e+01

Pos logit contributions
Don+0.047
My+0.045
Would+0.044
Well+0.043
It+0.043
Neg logit contributions
ies-0.017
ishing-0.013
ish-0.013
an-0.013
ig-0.013
Token "
Token ID366
Feature activation+1.34e+01

Pos logit contributions
-+0.699
ies+0.694
 mess+0.591
 head+0.584
ish+0.583
Neg logit contributions
Don-0.455
Well-0.451
Ah-0.447
There-0.440
Here-0.440
TokenMy
Token ID3666
Feature activation+3.75e+01

Pos logit contributions
 Everything+0.205
Would+0.203
 Finally+0.202
 Would+0.185
 Higher+0.184
Neg logit contributions
--0.700
ies-0.700
an-0.676
ater-0.643
 mess-0.638
Token car
Token ID1097
Feature activation-1.03e+01

Pos logit contributions
Don+1.714
My+1.698
 It+1.666
It+1.627
I+1.600
Neg logit contributions
ies-1.105
ishing-0.947
ig-0.879
--0.862
ish-0.850
Token broke
Token ID6265
Feature activation+7.02e+00

Pos logit contributions
ies+0.480
ishing+0.435
ig+0.418
-+0.414
ish+0.414
Neg logit contributions
Don-0.319
My-0.293
Well-0.279
I-0.277
 Did-0.276
Token.
Token ID13
Feature activation-1.32e+01

Pos logit contributions
Don+0.295
My+0.291
It+0.289
 It+0.280
I+0.276
Neg logit contributions
ies-0.219
 decision-0.201
an-0.190
ish-0.187
 eye-0.187
Token Can
Token ID1680
Feature activation+9.29e+00

Pos logit contributions
-+0.537
 eye+0.499
ish+0.498
 head+0.492
 ship+0.482
Neg logit contributions
Don-0.383
Well-0.359
Would-0.349
Looks-0.335
Here-0.334
Token you
Token ID345
Feature activation+9.86e+00

Pos logit contributions
Would+0.408
Don+0.387
Ah+0.384
Well+0.383
See+0.382
Neg logit contributions
--0.271
ies-0.264
ie-0.246
 airplane-0.230
ig-0.222
Token.
Token ID13
Feature activation-2.03e+01

Pos logit contributions
ies+0.653
-+0.610
 breath+0.544
 home+0.536
 decision+0.529
Neg logit contributions
Don-0.583
Would-0.549
Well-0.547
My-0.526
 Did-0.525
Token She
Token ID1375
Feature activation+1.85e+01

Pos logit contributions
ies+0.930
-+0.916
.+0.878
,+0.869
 house+0.845
Neg logit contributions
Don-0.450
Well-0.388
Ah-0.364
Would-0.361
Here-0.361
Token took
Token ID1718
Feature activation+3.09e+00

Pos logit contributions
Don+0.640
Would+0.589
Well+0.584
My+0.581
There+0.546
Neg logit contributions
ies-0.825
--0.764
ish-0.715
ishing-0.707
ig-0.703
Token a
Token ID257
Feature activation+4.28e+00

Pos logit contributions
Don+0.122
Would+0.113
Well+0.112
My+0.111
 Did+0.106
Neg logit contributions
ies-0.123
--0.113
ish-0.105
ishing-0.105
ig-0.103
Token deep
Token ID2769
Feature activation+1.57e-01

Pos logit contributions
Don+0.187
My+0.178
It+0.177
Would+0.174
There+0.171
Neg logit contributions
ies-0.144
ishing-0.128
an-0.121
ish-0.116
--0.115
Token breath
Token ID8033
Feature activation-4.21e+01

Pos logit contributions
Don+0.008
It+0.008
There+0.008
 It+0.008
 She+0.007
Neg logit contributions
ies-0.003
ishing-0.002
ish-0.002
ys-0.002
 decision-0.002
Token and
Token ID290
Feature activation-1.71e+01

Pos logit contributions
ies+1.136
ishing+0.927
ish+0.911
an+0.898
 decision+0.893
Neg logit contributions
Don-2.209
My-2.085
Well-2.067
It-2.034
Would-2.011
Token shouted
Token ID17293
Feature activation-1.37e+00

Pos logit contributions
ies+0.695
-+0.657
ig+0.585
ishing+0.584
 eye+0.584
Neg logit contributions
Don-0.629
Would-0.609
Well-0.603
My-0.583
 Did-0.571
Token "
Token ID366
Feature activation+7.07e+00

Pos logit contributions
ies+0.038
ishing+0.032
ish+0.031
an+0.031
 island+0.031
Neg logit contributions
Don-0.068
My-0.065
 It-0.064
Would-0.064
It-0.062
TokenST
Token ID2257
Feature activation+3.40e+00

Pos logit contributions
Don+0.139
 Did+0.135
Would+0.132
Well+0.127
My+0.120
Neg logit contributions
ies-0.370
--0.369
ishing-0.340
ater-0.339
ish-0.337
TokenOP
Token ID3185
Feature activation+3.71e+00

Pos logit contributions
Don+0.106
Would+0.096
My+0.094
Well+0.094
There+0.086
Neg logit contributions
ies-0.161
--0.155
ish-0.147
ig-0.146
ishing-0.145
Token,
Token ID11
Feature activation-2.29e+01

Pos logit contributions
Don+1.015
My+0.958
Would+0.955
Well+0.950
There+0.942
Neg logit contributions
ies-0.329
--0.266
ishing-0.264
an-0.259
ish-0.252
Token Lily
Token ID20037
Feature activation+5.17e+00

Pos logit contributions
ies+0.998
-+0.916
ish+0.853
ishing+0.839
ig+0.833
Neg logit contributions
Don-0.801
Well-0.746
Would-0.741
My-0.733
There-0.686
Token took
Token ID1718
Feature activation+3.09e+00

Pos logit contributions
Don+0.239
My+0.223
Would+0.219
Well+0.218
 Did+0.212
Neg logit contributions
ies-0.159
--0.143
ishing-0.141
ish-0.135
an-0.134
Token a
Token ID257
Feature activation+4.28e+00

Pos logit contributions
Don+0.104
My+0.096
Would+0.095
Well+0.094
There+0.090
Neg logit contributions
ies-0.140
--0.126
ishing-0.122
ish-0.121
an-0.120
Token deep
Token ID2769
Feature activation+1.57e-01

Pos logit contributions
Don+0.186
My+0.178
It+0.177
Would+0.174
Here+0.172
Neg logit contributions
ies-0.139
ishing-0.127
an-0.119
--0.110
ising-0.110
Token breath
Token ID8033
Feature activation-4.21e+01

Pos logit contributions
Don+0.008
There+0.008
It+0.008
 It+0.008
 She+0.007
Neg logit contributions
ies-0.003
 decision-0.002
ishing-0.002
ish-0.002
ys-0.002
Token and
Token ID290
Feature activation-1.71e+01

Pos logit contributions
ies+1.165
ishing+0.949
 decision+0.946
ish+0.940
an+0.926
Neg logit contributions
Don-2.169
My-2.057
Well-2.030
It-2.002
Would-1.955
Token went
Token ID1816
Feature activation-2.43e+00

Pos logit contributions
ies+0.706
-+0.658
ishing+0.591
ig+0.591
ish+0.590
Neg logit contributions
Don-0.633
Would-0.604
Well-0.601
My-0.583
 Did-0.560
Token into
Token ID656
Feature activation-7.17e+00

Pos logit contributions
ies+0.081
ish+0.073
ater+0.073
an+0.071
 decision+0.070
Neg logit contributions
Don-0.103
My-0.100
Would-0.099
 She-0.096
There-0.096
Token the
Token ID262
Feature activation+1.88e+00

Pos logit contributions
ies+0.318
-+0.290
ish+0.281
ishing+0.276
an+0.274
Neg logit contributions
Don-0.252
Would-0.232
My-0.231
Well-0.228
There-0.216
Token woods
Token ID16479
Feature activation-1.02e+01

Pos logit contributions
Don+0.095
My+0.090
Would+0.089
Well+0.089
It+0.087
Neg logit contributions
ies-0.055
--0.047
ishing-0.043
ish-0.042
ig-0.041
Token Tim
Token ID5045
Feature activation+1.08e+00

Pos logit contributions
Don+0.082
Would+0.076
Well+0.075
My+0.075
 Did+0.070
Neg logit contributions
ies-0.090
--0.082
ish-0.076
ishing-0.076
ig-0.075
Tokenmy
Token ID1820
Feature activation-9.36e+00

Pos logit contributions
Don+0.038
Well+0.035
My+0.035
Would+0.034
 Did+0.033
Neg logit contributions
ies-0.044
--0.040
ishing-0.039
ig-0.038
 breath-0.038
Token took
Token ID1718
Feature activation+3.45e+00

Pos logit contributions
ies+0.434
-+0.398
ishing+0.393
ig+0.382
ish+0.382
Neg logit contributions
Don-0.287
My-0.259
Well-0.259
Would-0.256
 Did-0.242
Token a
Token ID257
Feature activation+4.60e+00

Pos logit contributions
Don+0.119
Would+0.109
Well+0.108
My+0.108
There+0.101
Neg logit contributions
ies-0.154
--0.141
ish-0.134
ishing-0.134
ig-0.132
Token deep
Token ID2769
Feature activation+4.47e-01

Pos logit contributions
Don+0.204
My+0.193
It+0.191
Would+0.190
There+0.189
Neg logit contributions
ies-0.150
ishing-0.132
an-0.125
--0.118
ish-0.114
Token breath
Token ID8033
Feature activation-4.18e+01

Pos logit contributions
Don+0.021
 It+0.021
There+0.021
 He+0.020
It+0.020
Neg logit contributions
ies-0.009
 decision-0.007
ys-0.007
ishing-0.006
owing-0.006
Token and
Token ID290
Feature activation-1.69e+01

Pos logit contributions
ies+1.267
 decision+1.039
an+1.028
ishing+1.021
ish+1.018
Neg logit contributions
Don-2.094
My-1.960
Well-1.940
It-1.917
There-1.871
Token opened
Token ID4721
Feature activation+2.03e+00

Pos logit contributions
ies+0.703
-+0.655
ish+0.594
ishing+0.590
ig+0.589
Neg logit contributions
Don-0.618
Would-0.590
Well-0.586
My-0.568
 Did-0.548
Token the
Token ID262
Feature activation+1.88e+00

Pos logit contributions
Don+0.064
My+0.059
Would+0.057
Well+0.057
It+0.055
Neg logit contributions
ies-0.096
--0.087
ish-0.084
an-0.084
ishing-0.084
Token rocket
Token ID10701
Feature activation-2.10e+01

Pos logit contributions
Don+0.090
It+0.088
 It+0.086
My+0.086
Here+0.085
Neg logit contributions
ies-0.057
ishing-0.048
--0.045
ie-0.040
ig-0.039
Token.
Token ID13
Feature activation-1.89e+01

Pos logit contributions
ies+0.652
ishing+0.560
 choice+0.517
ater+0.506
ig+0.503
Neg logit contributions
 It-0.891
My-0.889
It-0.888
Don-0.870
 He-0.860
Token she
Token ID673
Feature activation+9.54e+00

Pos logit contributions
ies+0.507
-+0.472
ish+0.452
ishing+0.452
an+0.445
Neg logit contributions
Don-0.295
Would-0.265
Well-0.264
My-0.264
There-0.242
Token was
Token ID373
Feature activation+8.52e+00

Pos logit contributions
Don+0.311
Would+0.284
Well+0.282
My+0.280
 Did+0.260
Neg logit contributions
ies-0.442
--0.410
ish-0.387
ishing-0.385
ig-0.381
Token almost
Token ID2048
Feature activation+2.01e+00

Pos logit contributions
Don+0.318
My+0.307
It+0.297
Would+0.295
There+0.293
Neg logit contributions
ies-0.331
ish-0.294
an-0.293
--0.291
ishing-0.284
Token out
Token ID503
Feature activation-1.01e+01

Pos logit contributions
Don+0.080
My+0.074
Would+0.073
There+0.072
It+0.072
Neg logit contributions
ies-0.072
an-0.068
--0.066
ish-0.066
ig-0.066
Token of
Token ID286
Feature activation-2.09e+00

Pos logit contributions
an+0.296
ies+0.294
ish+0.294
ig+0.272
ishing+0.262
Neg logit contributions
Don-0.460
My-0.431
Would-0.431
Well-0.429
There-0.422
Token breath
Token ID8033
Feature activation-4.18e+01

Pos logit contributions
ies+0.091
an+0.084
ish+0.083
-+0.081
ig+0.080
Neg logit contributions
Don-0.076
Would-0.070
Well-0.068
My-0.068
There-0.067
Token.
Token ID13
Feature activation-1.97e+01

Pos logit contributions
ies+0.993
ish+0.816
an+0.800
ishing+0.799
ig+0.785
Neg logit contributions
Don-2.307
My-2.187
Well-2.173
Would-2.137
It-2.123
Token Finally
Token ID9461
Feature activation+1.34e+01

Pos logit contributions
-+0.808
ies+0.805
 home+0.799
.+0.795
,+0.773
Neg logit contributions
Don-0.486
Listen-0.403
Over-0.398
Take-0.393
My-0.390
Token she
Token ID673
Feature activation+1.20e+01

Pos logit contributions
Don+0.772
Well+0.735
My+0.732
Would+0.731
There+0.707
Neg logit contributions
ies-0.282
--0.231
ishing-0.216
ish-0.216
an-0.210
Token stopped
Token ID5025
Feature activation-2.27e+00

Pos logit contributions
Don+0.391
Would+0.360
Well+0.353
My+0.353
 Did+0.337
Neg logit contributions
ies-0.553
--0.522
ish-0.481
ig-0.479
ishing-0.476
Token.
Token ID13
Feature activation-1.89e+01

Pos logit contributions
ies+0.071
an+0.066
ish+0.062
ishing+0.061
ig+0.060
Neg logit contributions
Don-0.107
My-0.100
Would-0.096
It-0.095
Well-0.095
TokenL
Token ID43
Feature activation+1.08e+01

Pos logit contributions
 Did+0.362
Don+0.341
Well+0.320
Would+0.312
 Everything+0.301
Neg logit contributions
--0.570
ies-0.559
 home-0.521
ig-0.521
an-0.511
Tokenily
Token ID813
Feature activation+1.20e+01

Pos logit contributions
Don+0.342
Would+0.310
My+0.304
Well+0.304
 Did+0.287
Neg logit contributions
ies-0.515
--0.480
ig-0.448
ishing-0.446
ish-0.444
Token took
Token ID1718
Feature activation+3.45e+00

Pos logit contributions
Don+0.486
Well+0.450
Would+0.450
My+0.447
 Did+0.422
Neg logit contributions
ies-0.454
--0.412
ishing-0.391
ish-0.386
ig-0.380
Token a
Token ID257
Feature activation+4.60e+00

Pos logit contributions
Don+0.116
My+0.107
Would+0.106
Well+0.105
There+0.100
Neg logit contributions
ies-0.156
--0.141
ishing-0.136
ish-0.135
an-0.135
Token deep
Token ID2769
Feature activation+4.47e-01

Pos logit contributions
Don+0.208
My+0.197
It+0.192
Would+0.192
There+0.188
Neg logit contributions
ies-0.151
ishing-0.133
an-0.125
--0.125
ish-0.122
Token breath
Token ID8033
Feature activation-4.18e+01

Pos logit contributions
Don+0.025
There+0.023
It+0.023
Well+0.023
My+0.023
Neg logit contributions
ies-0.008
ishing-0.006
ish-0.005
 decision-0.005
--0.005
Token and
Token ID290
Feature activation-1.69e+01

Pos logit contributions
ies+1.157
ishing+0.952
ish+0.935
an+0.925
 decision+0.915
Neg logit contributions
Don-2.168
My-2.039
Well-2.026
It-1.976
Would-1.961
Token started
Token ID2067
Feature activation-4.19e+00

Pos logit contributions
ies+0.699
-+0.655
ig+0.587
ishing+0.583
ish+0.581
Neg logit contributions
Don-0.612
Would-0.592
Well-0.590
My-0.564
 Did-0.551
Token to
Token ID284
Feature activation-9.39e+00

Pos logit contributions
ies+0.154
an+0.139
ish+0.138
-+0.133
ishing+0.132
Neg logit contributions
Don-0.177
My-0.165
Would-0.162
Well-0.157
It-0.154
Token climb
Token ID12080
Feature activation-1.46e+01

Pos logit contributions
ies+0.362
-+0.329
ishing+0.315
an+0.307
ish+0.304
Neg logit contributions
Don-0.381
My-0.362
Would-0.342
Well-0.333
It-0.333
Token.
Token ID13
Feature activation-1.89e+01

Pos logit contributions
ies+0.507
ishing+0.433
-+0.430
ish+0.430
an+0.424
Neg logit contributions
Don-0.658
My-0.613
Well-0.605
Would-0.604
It-0.588
Token.
Token ID13
Feature activation-2.11e+01

Pos logit contributions
ies+0.792
ishing+0.697
ish+0.680
ig+0.641
an+0.633
Neg logit contributions
Don-1.783
Well-1.689
My-1.683
Would-1.661
It-1.639
Token He
Token ID679
Feature activation+1.76e+01

Pos logit contributions
ies+0.922
-+0.899
.+0.888
 home+0.860
,+0.857
Neg logit contributions
Don-0.508
My-0.412
Well-0.407
Here-0.406
Listen-0.402
Token took
Token ID1718
Feature activation+3.77e+00

Pos logit contributions
Don+0.595
Would+0.545
Well+0.542
My+0.540
 Did+0.502
Neg logit contributions
ies-0.797
--0.738
ish-0.696
ishing-0.692
ig-0.685
Token a
Token ID257
Feature activation+4.89e+00

Pos logit contributions
Don+0.140
Would+0.129
Well+0.129
My+0.129
There+0.121
Neg logit contributions
ies-0.158
--0.143
ish-0.136
ishing-0.135
ig-0.133
Token deep
Token ID2769
Feature activation+7.06e-01

Pos logit contributions
Don+0.212
My+0.200
It+0.199
Would+0.196
Here+0.192
Neg logit contributions
ies-0.169
ishing-0.150
an-0.139
ish-0.136
--0.135
Token breath
Token ID8033
Feature activation-4.16e+01

Pos logit contributions
Don+0.035
It+0.033
 It+0.033
There+0.033
Ah+0.033
Neg logit contributions
ies-0.015
ishing-0.011
ish-0.010
ys-0.010
owing-0.010
Token and
Token ID290
Feature activation-1.67e+01

Pos logit contributions
ies+1.137
ishing+0.908
ish+0.900
an+0.884
 decision+0.871
Neg logit contributions
Don-2.184
My-2.046
Well-2.043
It-2.004
Would-1.986
Token started
Token ID2067
Feature activation-4.00e+00

Pos logit contributions
ies+0.691
-+0.652
ishing+0.588
ish+0.585
ig+0.584
Neg logit contributions
Don-0.604
Would-0.576
Well-0.572
My-0.560
 Did-0.540
Token to
Token ID284
Feature activation-9.19e+00

Pos logit contributions
ies+0.146
ish+0.129
an+0.128
-+0.126
ishing+0.124
Neg logit contributions
Don-0.170
My-0.158
Would-0.157
Well-0.153
It-0.149
Token yell
Token ID28946
Feature activation-1.50e+01

Pos logit contributions
ies+0.346
-+0.311
ishing+0.299
ish+0.294
an+0.290
Neg logit contributions
Don-0.382
My-0.358
Would-0.347
Well-0.338
It-0.336
Token.
Token ID13
Feature activation-1.87e+01

Pos logit contributions
ies+0.417
ishing+0.361
ig+0.348
ish+0.344
an+0.344
Neg logit contributions
Don-0.747
 It-0.708
My-0.701
Would-0.688
It-0.684
Token a
Token ID257
Feature activation+3.92e+00

Pos logit contributions
ies+0.463
-+0.420
ish+0.409
ishing+0.405
ig+0.398
Neg logit contributions
Don-0.385
Would-0.356
My-0.355
Well-0.349
There-0.333
Token cave
Token ID11527
Feature activation-2.97e+01

Pos logit contributions
Don+0.182
Would+0.175
It+0.173
My+0.171
I+0.168
Neg logit contributions
ies-0.126
ishing-0.113
ish-0.103
--0.101
an-0.097
Token with
Token ID351
Feature activation-1.21e+01

Pos logit contributions
ies+0.738
ish+0.666
ishing+0.648
ig+0.633
an+0.603
Neg logit contributions
Don-1.491
It-1.396
Well-1.395
My-1.393
Would-1.391
Token my
Token ID616
Feature activation+4.72e-01

Pos logit contributions
ies+0.480
-+0.440
ish+0.393
ishing+0.392
 eye+0.391
Neg logit contributions
Don-0.479
Well-0.451
Would-0.445
My-0.431
 Did-0.415
Token fire
Token ID2046
Feature activation-1.82e+01

Pos logit contributions
Don+0.022
My+0.021
Would+0.020
Well+0.020
Here+0.020
Neg logit contributions
ies-0.013
ishing-0.012
--0.011
ig-0.011
ish-0.011
Token breath
Token ID8033
Feature activation-4.16e+01

Pos logit contributions
ishing+0.442
ies+0.423
 decision+0.411
ig+0.411
 breath+0.390
Neg logit contributions
 Did-0.792
 It-0.781
Don-0.767
I-0.764
 He-0.754
Token."
Token ID526
Feature activation-1.59e+01

Pos logit contributions
ies+0.938
-+0.763
ishing+0.731
ish+0.728
ig+0.723
Neg logit contributions
Don-2.321
My-2.199
Well-2.192
Would-2.177
It-2.127
Token
Token ID198
Feature activation+1.47e+01

Pos logit contributions
ies+0.737
-+0.724
.+0.675
 home+0.666
,+0.659
Neg logit contributions
Don-0.401
Would-0.354
Well-0.348
Over-0.336
There-0.336
Token
Token ID198
Feature activation+1.49e+01

Pos logit contributions
 Did+0.414
Don+0.389
Well+0.362
My+0.348
Looks+0.342
Neg logit contributions
--0.672
ies-0.628
.-0.579
an-0.576
 home-0.573
TokenThey
Token ID2990
Feature activation+1.08e+01

Pos logit contributions
 Did+0.447
Don+0.382
Well+0.378
 Everything+0.373
 It+0.363
Neg logit contributions
--0.631
ies-0.606
ater-0.574
an-0.558
ig-0.529
Token go
Token ID467
Feature activation-1.12e+01

Pos logit contributions
Don+0.408
Would+0.378
Well+0.376
My+0.373
 Did+0.351
Neg logit contributions
ies-0.445
--0.407
ish-0.383
ishing-0.380
ig-0.376
Token
Token ID198
Feature activation+1.35e+01

Pos logit contributions
 Did+0.357
Don+0.351
My+0.328
Well+0.314
Here+0.311
Neg logit contributions
--0.592
ies-0.539
.-0.528
 home-0.513
ig-0.500
TokenTom
Token ID13787
Feature activation+3.29e+00

Pos logit contributions
 Did+0.450
Don+0.404
Would+0.401
Well+0.388
Here+0.373
Neg logit contributions
--0.516
ies-0.470
ater-0.435
ig-0.433
an-0.426
Token took
Token ID1718
Feature activation+4.32e+00

Pos logit contributions
Don+0.121
Would+0.113
Well+0.112
My+0.111
 Did+0.105
Neg logit contributions
ies-0.139
--0.129
ish-0.120
ishing-0.119
ig-0.118
Token a
Token ID257
Feature activation+5.38e+00

Pos logit contributions
Don+0.158
Would+0.146
My+0.145
Well+0.145
There+0.136
Neg logit contributions
ies-0.183
--0.167
ish-0.158
ishing-0.158
ig-0.156
Token deep
Token ID2769
Feature activation+1.16e+00

Pos logit contributions
Don+0.233
My+0.216
It+0.213
Would+0.212
Well+0.204
Neg logit contributions
ies-0.190
ishing-0.171
--0.159
ish-0.158
an-0.156
Token breath
Token ID8033
Feature activation-4.12e+01

Pos logit contributions
Don+0.063
It+0.059
 It+0.059
There+0.057
 He+0.057
Neg logit contributions
ies-0.023
ishing-0.016
ish-0.016
ater-0.014
 decision-0.013
Token and
Token ID290
Feature activation-1.63e+01

Pos logit contributions
ies+1.160
ishing+0.960
ish+0.931
an+0.900
 decision+0.895
Neg logit contributions
Don-2.131
My-1.995
Well-1.976
It-1.942
Would-1.916
Token said
Token ID531
Feature activation+1.87e+00

Pos logit contributions
ies+0.647
-+0.625
 eye+0.549
 breath+0.544
ig+0.543
Neg logit contributions
Don-0.590
Would-0.583
Well-0.581
My-0.553
 Did-0.549
Token,
Token ID11
Feature activation-2.13e+01

Pos logit contributions
Don+0.116
Would+0.111
Well+0.111
My+0.110
 Did+0.106
Neg logit contributions
ies-0.031
--0.025
ish-0.021
ishing-0.020
ig-0.020
Token "
Token ID366
Feature activation+7.94e+00

Pos logit contributions
ies+0.929
-+0.919
ish+0.800
ig+0.787
 window+0.773
Neg logit contributions
Don-0.635
Here-0.614
Well-0.608
My-0.604
There-0.590
TokenI
Token ID40
Feature activation+2.17e+01

Pos logit contributions
 Did+0.134
 From+0.125
 Their+0.120
 Everywhere+0.104
 Everything+0.101
Neg logit contributions
an-0.405
--0.403
ater-0.396
ies-0.393
ishing-0.382
Token.
Token ID13
Feature activation-2.05e+01

Pos logit contributions
ies+0.177
-+0.151
ishing+0.147
an+0.142
ig+0.140
Neg logit contributions
Don-0.442
My-0.422
Would-0.415
Well-0.415
It-0.409
Token She
Token ID1375
Feature activation+2.00e+01

Pos logit contributions
ies+0.882
-+0.876
.+0.855
,+0.836
 home+0.824
Neg logit contributions
Don-0.486
Well-0.407
Listen-0.400
Take-0.385
Even-0.383
Token took
Token ID1718
Feature activation+4.32e+00

Pos logit contributions
Don+0.693
Would+0.632
Well+0.630
My+0.625
There+0.596
Neg logit contributions
ies-0.896
--0.833
ish-0.777
ishing-0.762
ig-0.759
Token a
Token ID257
Feature activation+5.38e+00

Pos logit contributions
Don+0.167
Well+0.154
Would+0.154
My+0.151
 Did+0.145
Neg logit contributions
ies-0.174
--0.163
ish-0.151
ishing-0.150
ig-0.147
Token deep
Token ID2769
Feature activation+1.16e+00

Pos logit contributions
Don+0.233
My+0.224
It+0.223
Would+0.219
There+0.216
Neg logit contributions
ies-0.179
ishing-0.159
an-0.150
--0.144
ig-0.142
Token breath
Token ID8033
Feature activation-4.12e+01

Pos logit contributions
Don+0.060
It+0.058
 It+0.058
There+0.057
Well+0.055
Neg logit contributions
ies-0.022
ishing-0.015
ys-0.015
ish-0.015
 decision-0.014
Token,
Token ID11
Feature activation-2.17e+01

Pos logit contributions
ies+1.166
ishing+0.947
ish+0.946
an+0.940
 decision+0.915
Neg logit contributions
Don-2.119
My-2.001
Well-1.978
It-1.960
Would-1.934
Token and
Token ID290
Feature activation-1.61e+01

Pos logit contributions
ies+0.622
-+0.565
ishing+0.504
ish+0.484
 eye+0.483
Neg logit contributions
Don-1.046
Well-1.009
Would-1.007
My-0.982
 Did-0.973
Token slowly
Token ID6364
Feature activation-2.42e+00

Pos logit contributions
ies+0.666
-+0.627
ishing+0.565
ish+0.562
ig+0.558
Neg logit contributions
Don-0.582
Well-0.551
Would-0.550
My-0.532
 Did-0.519
Token stepped
Token ID10764
Feature activation-6.63e-01

Pos logit contributions
ies+0.116
ish+0.104
an+0.104
 decision+0.102
-+0.102
Neg logit contributions
Don-0.074
My-0.066
Would-0.065
Well-0.065
There-0.063
Token closer
Token ID5699
Feature activation-5.76e+00

Pos logit contributions
ies+0.024
 decision+0.019
 choice+0.019
an+0.019
-+0.019
Neg logit contributions
Don-0.031
Would-0.029
My-0.028
It-0.028
There-0.028
Token torch
Token ID28034
Feature activation-2.22e+01

Pos logit contributions
Don+0.017
My+0.017
Here+0.017
Would+0.016
I+0.016
Neg logit contributions
ies-0.011
ishing-0.009
--0.008
ish-0.008
ig-0.008
Token,
Token ID11
Feature activation-2.28e+01

Pos logit contributions
ies+0.630
ishing+0.550
an+0.538
ish+0.537
 decision+0.531
Neg logit contributions
Don-1.058
My-0.996
Well-0.976
It-0.972
 It-0.958
Token took
Token ID1718
Feature activation+4.32e+00

Pos logit contributions
ies+0.775
-+0.701
ishing+0.636
ish+0.635
ig+0.624
Neg logit contributions
Don-1.009
Well-0.957
Would-0.957
My-0.939
 Did-0.900
Token a
Token ID257
Feature activation+5.38e+00

Pos logit contributions
Don+0.172
Would+0.160
My+0.159
Well+0.159
There+0.150
Neg logit contributions
ies-0.168
--0.152
ish-0.144
ishing-0.143
ig-0.141
Token deep
Token ID2769
Feature activation+1.16e+00

Pos logit contributions
Don+0.212
It+0.210
My+0.202
Ah+0.196
There+0.196
Neg logit contributions
ies-0.200
ishing-0.185
an-0.167
ish-0.164
--0.155
Token breath
Token ID8033
Feature activation-4.12e+01

Pos logit contributions
 It+0.052
It+0.051
Don+0.050
 He+0.049
There+0.049
Neg logit contributions
ies-0.028
ishing-0.022
ish-0.021
ys-0.020
owing-0.020
Token and
Token ID290
Feature activation-1.63e+01

Pos logit contributions
ies+1.050
ishing+0.879
ish+0.853
an+0.827
 decision+0.815
Neg logit contributions
Don-2.178
My-2.065
Well-2.048
It-2.021
Would-1.975
Token started
Token ID2067
Feature activation-3.61e+00

Pos logit contributions
ies+0.662
-+0.616
ishing+0.560
ish+0.559
ig+0.556
Neg logit contributions
Don-0.614
Would-0.580
Well-0.576
My-0.564
 Did-0.540
Token on
Token ID319
Feature activation-7.58e+00

Pos logit contributions
ies+0.132
ish+0.120
an+0.118
ishing+0.115
-+0.114
Neg logit contributions
Don-0.151
My-0.140
Would-0.138
Well-0.135
It-0.132
Token her
Token ID607
Feature activation+9.05e+00

Pos logit contributions
ies+0.328
ish+0.300
an+0.295
ishing+0.289
-+0.283
Neg logit contributions
Don-0.277
My-0.254
Would-0.250
Well-0.244
It-0.240
Token adventure
Token ID8855
Feature activation-3.22e+01

Pos logit contributions
My+0.393
Don+0.375
There+0.358
 +0.350
It+0.349
Neg logit contributions
ies-0.292
ishing-0.256
ish-0.235
ater-0.230
an-0.217
Token
Token ID198
Feature activation+1.35e+01

Pos logit contributions
 Did+0.381
Don+0.377
My+0.359
Well+0.345
Looks+0.324
Neg logit contributions
--0.542
ies-0.500
.-0.492
 home-0.489
an-0.472
TokenHe
Token ID1544
Feature activation+1.70e+01

Pos logit contributions
 Did+0.397
Don+0.344
 Everything+0.332
Well+0.330
Would+0.321
Neg logit contributions
--0.542
ies-0.502
ater-0.501
an-0.486
ig-0.472
Token took
Token ID1718
Feature activation+4.32e+00

Pos logit contributions
Don+0.608
Would+0.558
Well+0.556
My+0.554
 Did+0.522
Neg logit contributions
ies-0.736
--0.686
ish-0.639
ishing-0.632
ig-0.630
Token a
Token ID257
Feature activation+5.38e+00

Pos logit contributions
Don+0.158
Would+0.146
My+0.145
Well+0.145
There+0.136
Neg logit contributions
ies-0.183
--0.166
ish-0.158
ishing-0.158
ig-0.155
Token deep
Token ID2769
Feature activation+1.16e+00

Pos logit contributions
Don+0.229
It+0.216
My+0.216
Would+0.213
There+0.206
Neg logit contributions
ies-0.186
ishing-0.170
ish-0.153
an-0.153
--0.150
Token breath
Token ID8033
Feature activation-4.12e+01

Pos logit contributions
Don+0.057
It+0.055
 It+0.055
There+0.054
 He+0.053
Neg logit contributions
ies-0.025
ishing-0.018
ish-0.018
ys-0.017
ater-0.016
Token.
Token ID13
Feature activation-1.91e+01

Pos logit contributions
ies+1.143
ishing+0.937
ish+0.912
an+0.889
 decision+0.875
Neg logit contributions
Don-2.146
My-2.019
Well-2.003
It-1.965
Would-1.950
Token He
Token ID679
Feature activation+1.95e+01

Pos logit contributions
-+0.908
 eye+0.869
ies+0.852
 window+0.827
.+0.823
Neg logit contributions
Don-0.471
My-0.410
Well-0.381
Would-0.378
Here-0.369
Token said
Token ID531
Feature activation+2.03e+00

Pos logit contributions
Don+0.695
Would+0.648
Well+0.640
My+0.635
 Did+0.608
Neg logit contributions
ies-0.845
--0.792
ish-0.729
ig-0.723
ishing-0.722
Token,
Token ID11
Feature activation-2.12e+01

Pos logit contributions
Don+0.113
Would+0.107
My+0.107
Well+0.107
 Did+0.103
Neg logit contributions
ies-0.047
--0.038
ishing-0.036
ish-0.036
an-0.034
Token "
Token ID366
Feature activation+8.10e+00

Pos logit contributions
ies+0.947
-+0.934
ish+0.817
 eye+0.812
ig+0.809
Neg logit contributions
Don-0.623
Well-0.593
My-0.589
Here-0.579
Would-0.564
Token so
Token ID523
Feature activation-4.48e+00

Pos logit contributions
ies+0.874
-+0.797
ish+0.734
ishing+0.723
ig+0.718
Neg logit contributions
Don-0.956
Would-0.896
Well-0.894
My-0.888
 Did-0.844
Token he
Token ID339
Feature activation+9.44e+00

Pos logit contributions
ies+0.216
-+0.200
ish+0.183
ig+0.181
ishing+0.180
Neg logit contributions
Don-0.139
Would-0.130
Well-0.127
My-0.126
 Did-0.119
Token took
Token ID1718
Feature activation+4.32e+00

Pos logit contributions
Don+0.313
Would+0.287
Well+0.285
My+0.284
 Did+0.264
Neg logit contributions
ies-0.432
--0.400
ish-0.377
ishing-0.374
ig-0.371
Token a
Token ID257
Feature activation+5.38e+00

Pos logit contributions
Don+0.157
My+0.144
Would+0.144
Well+0.144
There+0.136
Neg logit contributions
ies-0.184
--0.166
ish-0.159
ishing-0.159
ig-0.156
Token deep
Token ID2769
Feature activation+1.16e+00

Pos logit contributions
Don+0.209
It+0.201
My+0.195
There+0.191
Would+0.190
Neg logit contributions
ies-0.201
ishing-0.187
an-0.169
ish-0.165
--0.163
Token breath
Token ID8033
Feature activation-4.12e+01

Pos logit contributions
 It+0.051
 He+0.051
Don+0.051
It+0.050
There+0.049
Neg logit contributions
ies-0.025
ys-0.019
ishing-0.019
ater-0.018
ish-0.018
Token and
Token ID290
Feature activation-1.63e+01

Pos logit contributions
ies+1.086
ishing+0.871
ish+0.863
an+0.838
ig+0.819
Neg logit contributions
Don-2.195
My-2.063
Well-2.054
It-2.018
Would-2.003
Token kn
Token ID638
Feature activation-6.00e+00

Pos logit contributions
ies+0.680
-+0.637
ish+0.578
ishing+0.576
ig+0.571
Neg logit contributions
Don-0.591
Would-0.560
Well-0.553
My-0.546
 Did-0.522
Tokenelt
Token ID2120
Feature activation+4.32e+00

Pos logit contributions
ies+0.309
-+0.293
ish+0.277
ishing+0.277
ig+0.272
Neg logit contributions
Don-0.165
Would-0.150
Well-0.147
My-0.146
 Did-0.133
Token down
Token ID866
Feature activation-1.05e+01

Pos logit contributions
Don+0.205
Well+0.186
My+0.185
Would+0.184
It+0.182
Neg logit contributions
ies-0.132
--0.114
ig-0.111
an-0.109
ish-0.109
Token on
Token ID319
Feature activation-7.29e+00

Pos logit contributions
ies+0.305
ish+0.255
ig+0.248
an+0.246
-+0.246
Neg logit contributions
Don-0.534
Would-0.495
Well-0.495
My-0.495
It-0.480
Token mom
Token ID1995
Feature activation-2.00e+01

Pos logit contributions
ies+0.028
ishing+0.026
-+0.024
ish+0.024
an+0.023
Neg logit contributions
Don-0.028
My-0.027
Well-0.027
Would-0.026
It-0.026
Tokenmy
Token ID1820
Feature activation-8.39e+00

Pos logit contributions
ies+0.886
-+0.818
ish+0.771
ishing+0.768
ig+0.758
Neg logit contributions
Don-0.697
Would-0.640
Well-0.637
My-0.634
 Did-0.591
Token took
Token ID1718
Feature activation+4.32e+00

Pos logit contributions
ies+0.376
-+0.346
ishing+0.330
ish+0.329
ig+0.326
Neg logit contributions
Don-0.284
Would-0.261
Well-0.258
My-0.258
 Did-0.240
Token a
Token ID257
Feature activation+5.38e+00

Pos logit contributions
Don+0.147
Would+0.135
Well+0.134
My+0.134
 Did+0.125
Neg logit contributions
ies-0.194
--0.178
ish-0.169
ishing-0.169
ig-0.166
Token deep
Token ID2769
Feature activation+1.16e+00

Pos logit contributions
Don+0.227
My+0.217
It+0.217
Would+0.214
Here+0.210
Neg logit contributions
ies-0.186
ishing-0.167
an-0.159
ish-0.152
ater-0.145
Token breath
Token ID8033
Feature activation-4.12e+01

Pos logit contributions
Don+0.059
There+0.057
 It+0.056
It+0.055
 She+0.055
Neg logit contributions
ies-0.023
ishing-0.017
 decision-0.017
ish-0.017
ys-0.016
Token and
Token ID290
Feature activation-1.63e+01

Pos logit contributions
ies+1.092
ishing+0.893
an+0.879
ish+0.879
 decision+0.861
Neg logit contributions
Don-2.156
Well-2.033
My-2.030
It-1.978
Would-1.966
Token smiled
Token ID13541
Feature activation-2.75e+00

Pos logit contributions
ies+0.673
-+0.624
ishing+0.559
ish+0.554
ig+0.552
Neg logit contributions
Don-0.603
Well-0.578
Would-0.571
My-0.558
 Did-0.538
Token.
Token ID13
Feature activation-1.87e+01

Pos logit contributions
ies+0.066
ig+0.064
an+0.060
ish+0.058
 decision+0.056
Neg logit contributions
Don-0.143
My-0.135
Would-0.132
It-0.131
 She-0.129
Token She
Token ID1375
Feature activation+2.16e+01

Pos logit contributions
 window+0.720
ies+0.700
.+0.676
,+0.671
-+0.665
Neg logit contributions
Don-0.525
Ah-0.518
My-0.508
Take-0.499
Over-0.474
Token said
Token ID531
Feature activation+2.32e+00

Pos logit contributions
Don+0.803
Would+0.749
My+0.736
Well+0.729
 Did+0.725
Neg logit contributions
ies-0.914
--0.875
ig-0.765
ish-0.761
an-0.746
Token
Token ID198
Feature activation+1.35e+01

Pos logit contributions
 Did+0.373
Don+0.361
My+0.352
Well+0.339
Would+0.308
Neg logit contributions
--0.554
ies-0.526
.-0.504
 home-0.501
 mistake-0.469
TokenMark
Token ID9704
Feature activation+9.65e+00

Pos logit contributions
 Did+0.457
Don+0.403
Well+0.400
 Everything+0.396
Would+0.394
Neg logit contributions
--0.491
ies-0.465
ater-0.443
an-0.420
ig-0.417
Token took
Token ID1718
Feature activation+4.32e+00

Pos logit contributions
Don+0.407
Well+0.380
My+0.379
Would+0.378
It+0.357
Neg logit contributions
ies-0.345
--0.305
ishing-0.294
ish-0.290
ig-0.286
Token a
Token ID257
Feature activation+5.38e+00

Pos logit contributions
Don+0.148
My+0.137
Would+0.136
Well+0.135
There+0.128
Neg logit contributions
ies-0.191
--0.170
ishing-0.166
ish-0.166
ig-0.164
Token big
Token ID1263
Feature activation-1.52e+00

Pos logit contributions
Don+0.235
My+0.225
It+0.223
Would+0.218
There+0.212
Neg logit contributions
ies-0.171
ishing-0.157
an-0.143
--0.138
ish-0.138
Token breath
Token ID8033
Feature activation-4.12e+01

Pos logit contributions
ies+0.050
ishing+0.045
an+0.039
ater+0.038
 surroundings+0.038
Neg logit contributions
 It-0.060
Don-0.056
Grab-0.056
 -0.056
Be-0.055
Token and
Token ID290
Feature activation-1.63e+01

Pos logit contributions
ies+1.237
ishing+1.041
ish+1.001
an+0.985
 decision+0.971
Neg logit contributions
Don-2.045
My-1.933
Well-1.902
It-1.890
That-1.839
Token dove
Token ID38862
Feature activation-1.90e+01

Pos logit contributions
ies+0.658
-+0.635
ishing+0.551
ig+0.549
 eye+0.548
Neg logit contributions
Don-0.592
Would-0.575
Well-0.568
My-0.543
 Did-0.541
Token down
Token ID866
Feature activation-1.06e+01

Pos logit contributions
ies+0.610
 decision+0.549
 mess+0.534
 choice+0.527
ishing+0.516
Neg logit contributions
Don-0.864
My-0.834
Would-0.799
It-0.769
Come-0.759
Token to
Token ID284
Feature activation-8.71e+00

Pos logit contributions
ies+0.366
ish+0.314
-+0.303
ishing+0.303
ig+0.300
Neg logit contributions
Don-0.490
Would-0.454
My-0.454
Well-0.449
There-0.434
Token the
Token ID262
Feature activation+2.72e+00

Pos logit contributions
ies+0.339
-+0.320
ish+0.293
ishing+0.292
 eye+0.290
Neg logit contributions
Don-0.342
Well-0.319
Would-0.314
My-0.308
 Did-0.304
Token.
Token ID13
Feature activation-2.05e+01

Pos logit contributions
ies+0.643
-+0.558
ishing+0.537
ish+0.535
an+0.522
Neg logit contributions
Don-1.098
My-1.046
Would-1.033
Well-1.028
It-0.997
Token He
Token ID679
Feature activation+1.82e+01

Pos logit contributions
ies+0.833
-+0.824
.+0.817
,+0.796
 house+0.795
Neg logit contributions
Don-0.494
Listen-0.415
Well-0.411
Over-0.407
Here-0.392
Token took
Token ID1718
Feature activation+4.32e+00

Pos logit contributions
Don+0.655
Would+0.605
Well+0.598
My+0.595
There+0.565
Neg logit contributions
ies-0.785
--0.732
ish-0.681
ig-0.668
ishing-0.662
Token a
Token ID257
Feature activation+5.38e+00

Pos logit contributions
Don+0.170
Would+0.158
My+0.158
Well+0.157
There+0.149
Neg logit contributions
ies-0.170
--0.153
ish-0.145
ishing-0.145
ig-0.142
Token big
Token ID1263
Feature activation-1.52e+00

Pos logit contributions
Don+0.222
It+0.215
My+0.214
Would+0.206
There+0.205
Neg logit contributions
ies-0.188
ishing-0.169
an-0.160
ish-0.153
--0.149
Token breath
Token ID8033
Feature activation-4.12e+01

Pos logit contributions
ies+0.055
ishing+0.048
an+0.047
ans+0.044
ater+0.043
Neg logit contributions
 It-0.057
Grab-0.052
It-0.052
 Higher-0.052
Don-0.052
Token and
Token ID290
Feature activation-1.63e+01

Pos logit contributions
ies+1.167
ishing+0.955
ish+0.932
an+0.921
 decision+0.913
Neg logit contributions
Don-2.097
My-1.988
Well-1.971
It-1.952
Would-1.907
Token then
Token ID788
Feature activation-4.01e+00

Pos logit contributions
ies+0.666
-+0.642
 eye+0.564
ishing+0.559
ish+0.555
Neg logit contributions
Don-0.587
Would-0.569
Well-0.560
My-0.540
 Did-0.529
Token carefully
Token ID7773
Feature activation-8.30e-01

Pos logit contributions
ies+0.181
-+0.167
ish+0.158
ishing+0.158
ig+0.156
Neg logit contributions
Don-0.136
Would-0.124
Well-0.123
My-0.123
It-0.114
Token walked
Token ID6807
Feature activation+2.09e+00

Pos logit contributions
ies+0.039
ish+0.034
an+0.034
 decision+0.034
ishing+0.033
Neg logit contributions
Don-0.027
My-0.024
Would-0.024
Well-0.024
There-0.023
Token up
Token ID510
Feature activation-1.26e+01

Pos logit contributions
Don+0.103
Would+0.095
My+0.095
Well+0.093
It+0.092
Neg logit contributions
ies-0.064
ish-0.055
ig-0.054
ishing-0.052
--0.051
Token So
Token ID1406
Feature activation-2.31e+00

Pos logit contributions
-+0.856
ies+0.848
.+0.832
,+0.813
 window+0.792
Neg logit contributions
Don-0.486
Well-0.436
Over-0.428
Listen-0.409
Here-0.395
Token he
Token ID339
Feature activation+9.70e+00

Pos logit contributions
ies+0.083
-+0.075
ish+0.069
ishing+0.069
ig+0.068
Neg logit contributions
Don-0.100
Would-0.094
Well-0.093
My-0.092
 Did-0.088
Token took
Token ID1718
Feature activation+4.55e+00

Pos logit contributions
Don+0.360
Would+0.333
Well+0.331
My+0.330
 Did+0.309
Neg logit contributions
ies-0.404
--0.370
ish-0.349
ishing-0.349
ig-0.343
Token a
Token ID257
Feature activation+5.60e+00

Pos logit contributions
Don+0.164
Would+0.151
My+0.150
Well+0.150
There+0.140
Neg logit contributions
ies-0.196
--0.179
ish-0.169
ishing-0.169
ig-0.167
Token deep
Token ID2769
Feature activation+1.35e+00

Pos logit contributions
Don+0.238
It+0.226
My+0.224
Would+0.218
There+0.216
Neg logit contributions
ies-0.191
ishing-0.178
an-0.160
ish-0.157
 size-0.151
Token breath
Token ID8033
Feature activation-4.10e+01

Pos logit contributions
Don+0.064
 It+0.062
It+0.062
There+0.060
 She+0.060
Neg logit contributions
ies-0.029
ishing-0.022
ys-0.021
castle-0.021
ish-0.020
Token and
Token ID290
Feature activation-1.61e+01

Pos logit contributions
ies+1.143
ishing+0.927
ish+0.906
an+0.891
 decision+0.884
Neg logit contributions
Don-2.139
My-2.019
Well-1.989
It-1.975
Would-1.943
Token whist
Token ID32262
Feature activation-1.47e+01

Pos logit contributions
ies+0.654
-+0.617
ishing+0.550
ish+0.547
ig+0.544
Neg logit contributions
Don-0.593
Would-0.569
Well-0.569
My-0.550
 Did-0.539
Tokenled
Token ID992
Feature activation+9.71e+00

Pos logit contributions
ies+0.436
-+0.385
ish+0.353
ishing+0.351
ig+0.343
Neg logit contributions
Don-0.727
Would-0.684
Well-0.682
My-0.680
 Did-0.648
Token as
Token ID355
Feature activation-1.79e+01

Pos logit contributions
Don+0.434
My+0.398
Would+0.397
Well+0.384
It+0.383
Neg logit contributions
ies-0.337
ish-0.291
ig-0.289
an-0.285
ishing-0.284
Token loud
Token ID7812
Feature activation+2.15e+00

Pos logit contributions
ies+0.810
-+0.764
ig+0.674
 breath+0.670
ish+0.669
Neg logit contributions
Don-0.585
Would-0.563
Well-0.554
My-0.537
 Did-0.515
Token her
Token ID607
Feature activation+7.72e+00

Pos logit contributions
ies+0.083
-+0.077
ish+0.072
ishing+0.072
ig+0.071
Neg logit contributions
Don-0.062
Would-0.057
Well-0.057
My-0.057
 Did-0.053
Token to
Token ID284
Feature activation-9.80e+00

Pos logit contributions
Don+0.317
Would+0.296
Well+0.294
My+0.292
 Did+0.276
Neg logit contributions
ies-0.295
--0.268
ish-0.249
ishing-0.247
ig-0.244
Token take
Token ID1011
Feature activation-2.26e+00

Pos logit contributions
ies+0.360
-+0.324
ishing+0.315
ish+0.309
ig+0.303
Neg logit contributions
Don-0.408
My-0.385
Well-0.373
Would-0.372
It-0.367
Token a
Token ID257
Feature activation+5.79e+00

Pos logit contributions
ies+0.097
-+0.088
ish+0.084
ishing+0.083
ig+0.082
Neg logit contributions
Don-0.082
Would-0.076
Well-0.075
My-0.075
It-0.070
Token deep
Token ID2769
Feature activation+1.55e+00

Pos logit contributions
Don+0.243
My+0.236
It+0.233
Would+0.231
There+0.224
Neg logit contributions
ies-0.201
ishing-0.180
an-0.172
ish-0.164
ater-0.153
Token breath
Token ID8033
Feature activation-4.08e+01

Pos logit contributions
Don+0.067
There+0.064
Be+0.064
It+0.064
 It+0.063
Neg logit contributions
ies-0.036
ys-0.029
ish-0.027
ishing-0.027
an-0.026
Token.
Token ID13
Feature activation-1.87e+01

Pos logit contributions
ies+1.017
ishing+0.838
ish+0.820
an+0.816
ig+0.781
Neg logit contributions
Don-2.185
My-2.086
Well-2.049
It-2.011
Would-2.008
Token Sarah
Token ID10490
Feature activation-3.12e-01

Pos logit contributions
-+0.740
.+0.721
ies+0.707
,+0.700
 home+0.698
Neg logit contributions
Don-0.508
Looks-0.461
Take-0.460
Listen-0.447
Over-0.446
Token closed
Token ID4838
Feature activation-6.01e+00

Pos logit contributions
ies+0.014
-+0.013
ish+0.012
ishing+0.012
ig+0.012
Neg logit contributions
Don-0.011
Would-0.010
Well-0.010
My-0.010
 Did-0.009
Token her
Token ID607
Feature activation+9.36e+00

Pos logit contributions
ies+0.286
-+0.261
ish+0.258
an+0.252
ishing+0.250
Neg logit contributions
Don-0.189
My-0.172
Would-0.169
Well-0.168
It-0.157
Token eyes
Token ID2951
Feature activation-2.54e+01

Pos logit contributions
Don+0.394
My+0.384
 It+0.375
 Something+0.364
It+0.364
Neg logit contributions
ies-0.306
ishing-0.262
an-0.248
ish-0.247
 island-0.242
Token.
Token ID13
Feature activation-1.99e+01

Pos logit contributions
ies+0.227
ishing+0.216
ig+0.212
an+0.212
ater+0.207
Neg logit contributions
Don-0.366
 It-0.357
 She-0.345
Would-0.339
My-0.337
Token She
Token ID1375
Feature activation+2.05e+01

Pos logit contributions
ies+0.839
 house+0.835
 window+0.825
 home+0.823
-+0.822
Neg logit contributions
Don-0.485
My-0.417
Ah-0.413
Take-0.413
Here-0.412
Token took
Token ID1718
Feature activation+4.77e+00

Pos logit contributions
Don+0.685
Well+0.633
Would+0.632
My+0.629
Here+0.610
Neg logit contributions
ies-0.930
--0.883
ish-0.797
ig-0.788
 eye-0.783
Token a
Token ID257
Feature activation+5.79e+00

Pos logit contributions
Don+0.189
Would+0.176
Well+0.175
My+0.174
 Did+0.164
Neg logit contributions
ies-0.189
--0.172
ish-0.161
ishing-0.161
ig-0.158
Token deep
Token ID2769
Feature activation+1.55e+00

Pos logit contributions
Don+0.232
It+0.222
My+0.220
Would+0.215
There+0.208
Neg logit contributions
ies-0.209
ishing-0.193
an-0.180
ish-0.173
 size-0.167
Token breath
Token ID8033
Feature activation-4.08e+01

Pos logit contributions
Don+0.076
 It+0.074
It+0.073
 She+0.072
There+0.071
Neg logit contributions
ies-0.031
ishing-0.023
ish-0.022
ater-0.021
ys-0.021
Token and
Token ID290
Feature activation-1.59e+01

Pos logit contributions
ies+1.110
ishing+0.911
ish+0.898
an+0.893
 decision+0.878
Neg logit contributions
Don-2.137
My-2.009
Well-1.994
It-1.955
Would-1.940
Token decided
Token ID3066
Feature activation+4.83e+00

Pos logit contributions
ies+0.665
-+0.629
ishing+0.562
 eye+0.562
ish+0.561
Neg logit contributions
Don-0.573
Would-0.547
Well-0.544
My-0.527
 Did-0.512
Token to
Token ID284
Feature activation-8.55e+00

Pos logit contributions
Don+0.224
Would+0.209
My+0.208
Well+0.208
It+0.198
Neg logit contributions
ies-0.159
--0.142
ish-0.133
ishing-0.132
ig-0.130
Token go
Token ID467
Feature activation-1.08e+01

Pos logit contributions
ies+0.333
-+0.299
ishing+0.290
ish+0.281
ig+0.277
Neg logit contributions
Don-0.342
My-0.325
Would-0.316
Well-0.313
It-0.308
Token inside
Token ID2641
Feature activation-1.97e+01

Pos logit contributions
ies+0.346
ater+0.297
ishing+0.297
ish+0.289
an+0.283
Neg logit contributions
Don-0.464
My-0.454
Would-0.443
It-0.441
There-0.433
Token
Token ID198
Feature activation+1.40e+01

Pos logit contributions
 Did+0.370
Don+0.355
Well+0.329
Would+0.313
My+0.310
Neg logit contributions
--0.644
ies-0.612
.-0.570
an-0.554
 home-0.548
TokenJack
Token ID14295
Feature activation+7.15e+00

Pos logit contributions
 Did+0.422
Don+0.367
 Everything+0.362
Well+0.356
Would+0.354
Neg logit contributions
--0.596
ies-0.579
ater-0.531
an-0.522
ig-0.503
Token took
Token ID1718
Feature activation+4.77e+00

Pos logit contributions
Don+0.249
Well+0.228
Would+0.227
My+0.226
It+0.211
Neg logit contributions
ies-0.315
--0.289
ishing-0.274
ish-0.273
ig-0.270
Token a
Token ID257
Feature activation+5.79e+00

Pos logit contributions
Don+0.155
My+0.143
Would+0.142
Well+0.141
There+0.133
Neg logit contributions
ies-0.220
--0.196
ish-0.190
ishing-0.190
ig-0.189
Token deep
Token ID2769
Feature activation+1.55e+00

Pos logit contributions
Don+0.267
My+0.252
It+0.246
Would+0.246
Well+0.241
Neg logit contributions
ies-0.185
ishing-0.160
--0.153
an-0.151
ish-0.148
Token breath
Token ID8033
Feature activation-4.08e+01

Pos logit contributions
Don+0.087
It+0.082
 It+0.081
Well+0.081
My+0.080
Neg logit contributions
ies-0.028
ish-0.019
ishing-0.019
 island-0.017
an-0.016
Token and
Token ID290
Feature activation-1.59e+01

Pos logit contributions
ies+1.219
ishing+1.014
ish+0.988
an+0.982
ig+0.932
Neg logit contributions
Don-2.057
My-1.926
Well-1.910
It-1.891
Would-1.846
Token was
Token ID373
Feature activation+1.11e+01

Pos logit contributions
ies+0.657
-+0.631
ishing+0.557
ish+0.553
ig+0.551
Neg logit contributions
Don-0.571
Would-0.553
Well-0.542
My-0.529
 Did-0.521
Token relieved
Token ID22598
Feature activation+6.76e+00

Pos logit contributions
Don+0.441
Would+0.410
Well+0.408
My+0.406
 Did+0.383
Neg logit contributions
ies-0.435
--0.398
ish-0.371
ishing-0.370
ig-0.365
Token.
Token ID13
Feature activation-1.83e+01

Pos logit contributions
Don+0.337
Well+0.310
It+0.309
My+0.308
Would+0.306
Neg logit contributions
ies-0.195
ish-0.170
ig-0.168
an-0.168
ishing-0.162
Token They
Token ID1119
Feature activation+1.50e+01

Pos logit contributions
-+0.781
 home+0.741
ies+0.740
.+0.730
 window+0.727
Neg logit contributions
Don-0.477
My-0.422
Here-0.388
Would-0.382
There-0.379
Token.
Token ID13
Feature activation-1.99e+01

Pos logit contributions
ies+0.102
ig+0.100
an+0.095
ishing+0.093
 island+0.090
Neg logit contributions
Don-0.194
Well-0.185
 Did-0.184
I-0.182
 It-0.181
Token She
Token ID1375
Feature activation+2.05e+01

Pos logit contributions
-+0.695
.+0.665
,+0.654
ies+0.648
ish+0.636
Neg logit contributions
Don-0.635
Over-0.541
There-0.525
Well-0.515
Ah-0.514
Token took
Token ID1718
Feature activation+4.77e+00

Pos logit contributions
Don+0.747
Well+0.690
Would+0.687
My+0.684
It+0.643
Neg logit contributions
ies-0.869
--0.797
ishing-0.758
ish-0.748
ig-0.745
Token a
Token ID257
Feature activation+5.79e+00

Pos logit contributions
Don+0.171
Would+0.157
Well+0.156
My+0.156
It+0.146
Neg logit contributions
ies-0.206
--0.189
ish-0.179
ishing-0.178
ig-0.176
Token deep
Token ID2769
Feature activation+1.55e+00

Pos logit contributions
Don+0.265
It+0.251
My+0.250
Would+0.246
Here+0.243
Neg logit contributions
ies-0.183
ishing-0.164
an-0.154
--0.149
ish-0.148
Token breath
Token ID8033
Feature activation-4.08e+01

Pos logit contributions
Don+0.073
 It+0.071
It+0.071
Look+0.069
Be+0.068
Neg logit contributions
ies-0.031
ishing-0.024
ys-0.024
ish-0.023
 island-0.022
Token and
Token ID290
Feature activation-1.59e+01

Pos logit contributions
ies+0.978
ishing+0.781
-+0.779
ish+0.764
ig+0.736
Neg logit contributions
Don-2.240
My-2.116
Well-2.116
Would-2.081
It-2.067
Token said
Token ID531
Feature activation+2.16e+00

Pos logit contributions
ies+0.634
-+0.588
ish+0.540
ishing+0.539
ig+0.530
Neg logit contributions
Don-0.611
Would-0.576
Well-0.572
My-0.563
 Did-0.535
Token,
Token ID11
Feature activation-2.10e+01

Pos logit contributions
Don+0.120
Would+0.113
Well+0.113
My+0.113
 Did+0.109
Neg logit contributions
ies-0.051
--0.043
ishing-0.039
ish-0.039
ig-0.037
Token "
Token ID366
Feature activation+8.25e+00

Pos logit contributions
ies+0.920
-+0.891
ish+0.801
ig+0.791
 mess+0.790
Neg logit contributions
Don-0.597
Ah-0.582
Well-0.580
My-0.569
Would-0.564
TokenPlease
Token ID5492
Feature activation+1.39e+01

Pos logit contributions
 Did+0.153
 Everything+0.151
 From+0.134
 Who+0.129
Would+0.128
Neg logit contributions
ater-0.412
--0.406
ies-0.384
ishing-0.383
an-0.366
Token better
Token ID1365
Feature activation-1.88e-01

Pos logit contributions
Don+0.560
Would+0.514
My+0.494
There+0.490
It+0.488
Neg logit contributions
ies-0.645
an-0.566
ish-0.550
--0.550
ig-0.545
Token than
Token ID621
Feature activation-1.55e+01

Pos logit contributions
ies+0.006
an+0.005
ish+0.005
 fort+0.005
ig+0.005
Neg logit contributions
Don-0.009
My-0.008
Well-0.008
It-0.008
That-0.008
Token before
Token ID878
Feature activation-3.97e+00

Pos logit contributions
ies+0.571
-+0.517
ish+0.485
ishing+0.481
ig+0.474
Neg logit contributions
Don-0.653
Would-0.609
Well-0.606
My-0.605
 Did-0.571
Token.
Token ID13
Feature activation-1.34e+01

Pos logit contributions
ies+0.094
ish+0.083
an+0.080
-+0.079
ishing+0.078
Neg logit contributions
Don-0.219
Well-0.206
Would-0.205
My-0.205
It-0.200
Token
Token ID220
Feature activation+1.37e+01

Pos logit contributions
,+0.541
-+0.536
.+0.535
 window+0.523
 home+0.520
Neg logit contributions
Don-0.387
Get-0.384
Ah-0.358
Here-0.349
Looks-0.344
Token
Token ID198
Feature activation+2.06e+01

Pos logit contributions
Don+0.376
Would+0.340
Well+0.329
Ah+0.328
Looks+0.308
Neg logit contributions
ies-0.648
--0.643
 home-0.603
.-0.593
ig-0.574
Token
Token ID198
Feature activation+2.06e+01

Pos logit contributions
Don+0.622
 Did+0.599
Looks+0.585
Well+0.548
Get+0.539
Neg logit contributions
--0.907
 home-0.811
ies-0.807
.-0.767
,-0.755
TokenThe
Token ID464
Feature activation+2.98e+01

Pos logit contributions
 Did+0.524
Don+0.487
Well+0.448
Would+0.436
Looks+0.412
Neg logit contributions
--1.019
ies-0.931
 airplane-0.852
 home-0.847
ig-0.845
Token End
Token ID5268
Feature activation-6.11e-01

Pos logit contributions
Don+1.262
Well+1.197
Would+1.182
My+1.157
That+1.113
Neg logit contributions
ies-1.144
--0.987
ishing-0.974
ish-0.952
ig-0.940
Token<|endoftext|>
Token ID50256
Feature activation-7.45e-01

Pos logit contributions
ies+0.014
-+0.013
ig+0.010
ish+0.010
ishing+0.010
Neg logit contributions
Don-0.034
Would-0.032
My-0.032
Well-0.032
 Did-0.031
TokenOnce
Token ID7454
Feature activation+9.69e+00

Pos logit contributions
-+0.032
ies+0.030
.+0.029
ig+0.029
 home+0.028
Neg logit contributions
Don-0.023
Well-0.020
Would-0.020
My-0.020
Even-0.019
Token angry
Token ID7954
Feature activation+6.92e-01

Pos logit contributions
Don+0.119
Would+0.110
My+0.109
Well+0.109
It+0.104
Neg logit contributions
ies-0.106
--0.096
ishing-0.091
ish-0.091
ig-0.090
Token and
Token ID290
Feature activation-1.09e+01

Pos logit contributions
Don+0.032
 She+0.030
 Why+0.029
It+0.029
 It+0.029
Neg logit contributions
ies-0.022
ish-0.020
an-0.020
ig-0.020
 island-0.020
Token very
Token ID845
Feature activation+2.88e+00

Pos logit contributions
ies+0.389
-+0.354
 mess+0.314
ish+0.313
 breath+0.299
Neg logit contributions
Would-0.449
Don-0.448
Well-0.443
My-0.435
There-0.423
Token worried
Token ID7960
Feature activation+1.61e+00

Pos logit contributions
Don+0.134
Would+0.127
Well+0.126
My+0.125
 Did+0.119
Neg logit contributions
ies-0.095
--0.085
ish-0.077
ishing-0.076
ig-0.075
Token.
Token ID13
Feature activation-1.36e+01

Pos logit contributions
Don+0.089
My+0.084
Well+0.083
It+0.082
That+0.081
Neg logit contributions
ies-0.037
ish-0.032
ishing-0.032
an-0.031
ig-0.031
Token She
Token ID1375
Feature activation+2.66e+01

Pos logit contributions
 window+0.482
,+0.479
.+0.471
 house+0.457
 eye+0.425
Neg logit contributions
Would-0.403
Take-0.401
Over-0.398
Looks-0.396
Ah-0.381
Token called
Token ID1444
Feature activation-5.04e+00

Pos logit contributions
Don+0.975
Would+0.943
My+0.916
Well+0.911
There+0.882
Neg logit contributions
ies-1.120
--1.061
ish-0.939
 eye-0.932
 mess-0.932
Token the
Token ID262
Feature activation+7.58e+00

Pos logit contributions
ies+0.210
-+0.187
ishing+0.186
ish+0.182
ig+0.181
Neg logit contributions
Don-0.188
My-0.173
Would-0.171
Well-0.166
 Did-0.163
Token doctor
Token ID6253
Feature activation-6.69e+00

Pos logit contributions
Don+0.361
Would+0.340
My+0.330
Here+0.328
Well+0.327
Neg logit contributions
ies-0.251
ishing-0.220
--0.214
ig-0.211
ish-0.210
Token and
Token ID290
Feature activation-1.07e+01

Pos logit contributions
ies+0.188
ishing+0.162
ig+0.157
ish+0.154
an+0.154
Neg logit contributions
Don-0.332
Would-0.316
My-0.309
 Did-0.308
Well-0.304
Token took
Token ID1718
Feature activation+1.07e+01

Pos logit contributions
ies+0.427
-+0.404
 eye+0.351
ish+0.349
 mess+0.347
Neg logit contributions
Don-0.405
Would-0.397
Well-0.390
My-0.370
That-0.363
Token and
Token ID290
Feature activation-1.04e+01

Pos logit contributions
Don+0.057
My+0.053
Would+0.053
 Why+0.052
 It+0.052
Neg logit contributions
ies-0.035
an-0.031
ig-0.030
ish-0.030
ishing-0.030
Token sad
Token ID6507
Feature activation+4.04e+00

Pos logit contributions
ies+0.332
-+0.306
ish+0.269
 mess+0.265
ishing+0.265
Neg logit contributions
Don-0.459
Would-0.449
Well-0.448
My-0.438
Here-0.421
Token.
Token ID13
Feature activation-1.32e+01

Pos logit contributions
Don+0.208
Well+0.197
Would+0.191
 It+0.191
My+0.190
Neg logit contributions
ies-0.098
an-0.088
ish-0.088
ishing-0.087
ig-0.087
Token
Token ID198
Feature activation+2.07e+01

Pos logit contributions
.+0.487
-+0.444
,+0.444
 home+0.442
 house+0.435
Neg logit contributions
Don-0.394
Over-0.368
Take-0.363
Looks-0.360
Grab-0.345
Token
Token ID198
Feature activation+2.08e+01

Pos logit contributions
Don+0.719
 Did+0.689
Looks+0.670
Well+0.638
Get+0.622
Neg logit contributions
--0.814
 home-0.743
.-0.720
ies-0.719
an-0.700
TokenThe
Token ID464
Feature activation+2.99e+01

Pos logit contributions
 Did+0.630
 It+0.504
Don+0.489
 Oh+0.478
 Everything+0.475
Neg logit contributions
--0.888
an-0.802
 mistake-0.789
ies-0.774
ater-0.763
Token king
Token ID5822
Feature activation-1.24e+01

Pos logit contributions
Don+1.311
Would+1.245
My+1.213
Well+1.189
There+1.182
Neg logit contributions
ies-1.020
ishing-0.960
ish-0.922
ig-0.896
--0.868
Token realized
Token ID6939
Feature activation+9.69e+00

Pos logit contributions
ies+0.568
ishing+0.516
-+0.512
ish+0.507
ig+0.490
Neg logit contributions
Don-0.400
My-0.368
Well-0.365
Would-0.360
It-0.340
Token that
Token ID326
Feature activation-2.27e+00

Pos logit contributions
Don+0.371
Would+0.343
Well+0.341
My+0.340
 Did+0.320
Neg logit contributions
ies-0.397
--0.365
ish-0.340
ishing-0.339
ig-0.335
Token he
Token ID339
Feature activation+1.64e+01

Pos logit contributions
ies+0.103
-+0.097
ish+0.089
ishing+0.088
ig+0.087
Neg logit contributions
Don-0.074
Well-0.068
Would-0.068
My-0.067
Here-0.063
Token had
Token ID550
Feature activation+1.73e+01

Pos logit contributions
Don+0.460
Would+0.418
Well+0.412
My+0.409
 Did+0.377
Neg logit contributions
ies-0.838
--0.786
ish-0.744
ishing-0.738
ig-0.731
Token where
Token ID810
Feature activation-4.64e+00

Pos logit contributions
ies+0.389
ishing+0.343
-+0.336
an+0.332
ig+0.331
Neg logit contributions
Don-0.345
My-0.330
It-0.314
Would-0.312
 It-0.310
Token you
Token ID345
Feature activation+9.87e+00

Pos logit contributions
ies+0.207
-+0.190
ish+0.180
ishing+0.179
ig+0.177
Neg logit contributions
Don-0.160
Would-0.148
Well-0.147
My-0.146
 Did-0.136
Token walk
Token ID2513
Feature activation-1.36e+01

Pos logit contributions
Don+0.371
My+0.346
Well+0.332
Would+0.332
There+0.328
Neg logit contributions
ies-0.397
--0.359
ishing-0.352
ig-0.334
an-0.332
Token."
Token ID526
Feature activation-9.64e+00

Pos logit contributions
ies+0.335
ishing+0.275
ig+0.270
ish+0.270
 eye+0.264
Neg logit contributions
Don-0.715
My-0.665
Would-0.654
It-0.649
I-0.648
Token
Token ID198
Feature activation+2.07e+01

Pos logit contributions
ies+0.426
.+0.399
-+0.399
,+0.382
 home+0.364
Neg logit contributions
Don-0.277
Looks-0.264
Ah-0.260
Well-0.258
Would-0.251
Token
Token ID198
Feature activation+2.07e+01

Pos logit contributions
Don+0.542
Well+0.511
 Did+0.499
Would+0.490
My+0.482
Neg logit contributions
ies-1.086
--1.047
an-0.946
 eye-0.925
 home-0.914
Token"
Token ID1
Feature activation+5.03e+00

Pos logit contributions
Don+0.543
 Did+0.537
Would+0.526
Well+0.521
My+0.493
Neg logit contributions
ies-1.079
--1.015
an-0.936
ig-0.915
ater-0.899
TokenYes
Token ID5297
Feature activation+6.59e+00

Pos logit contributions
 Did+0.131
 Everything+0.127
 It+0.123
 From+0.122
 Where+0.119
Neg logit contributions
ies-0.205
ater-0.202
--0.200
 size-0.186
ishing-0.183
Token,
Token ID11
Feature activation-1.58e+01

Pos logit contributions
Don+0.291
 It+0.287
It+0.286
 +0.275
Well+0.275
Neg logit contributions
ishing-0.180
ies-0.165
ater-0.151
an-0.150
ish-0.147
Token mom
Token ID1995
Feature activation-1.27e+01

Pos logit contributions
ies+0.650
-+0.594
ishing+0.566
ish+0.562
ig+0.550
Neg logit contributions
Don-0.606
Would-0.559
Well-0.556
My-0.555
It-0.524
Token,
Token ID11
Feature activation-1.58e+01

Pos logit contributions
ishing+0.352
ies+0.325
an+0.323
 breath+0.321
ater+0.319
Neg logit contributions
Don-0.539
 Did-0.525
 She-0.520
-0.517
Well-0.516
Token the
Token ID262
Feature activation+7.95e+00

Pos logit contributions
ies+0.107
ish+0.098
-+0.097
ishing+0.096
an+0.096
Neg logit contributions
Don-0.074
Well-0.067
My-0.066
Would-0.066
It-0.063
Token wagon
Token ID31860
Feature activation-2.12e+01

Pos logit contributions
Don+0.466
Would+0.448
My+0.440
Well+0.440
 Did+0.427
Neg logit contributions
ies-0.158
--0.135
ish-0.122
ig-0.115
ishing-0.113
Token with
Token ID351
Feature activation-4.84e+00

Pos logit contributions
ies+0.582
ishing+0.528
an+0.473
ig+0.473
ish+0.469
Neg logit contributions
Don-1.046
My-0.994
 She-0.994
 It-0.966
It-0.965
Token Spot
Token ID15899
Feature activation+7.78e+00

Pos logit contributions
ies+0.202
-+0.186
ish+0.174
ishing+0.172
ig+0.170
Neg logit contributions
Don-0.181
Would-0.170
Well-0.168
My-0.164
 Did-0.159
Token.
Token ID13
Feature activation-1.32e+01

Pos logit contributions
Don+0.369
My+0.359
 It+0.342
 She+0.338
It+0.337
Neg logit contributions
ies-0.221
ishing-0.193
an-0.186
 choice-0.182
 decision-0.181
Token They
Token ID1119
Feature activation+1.99e+01

Pos logit contributions
ies+0.650
-+0.623
 fort+0.602
 eye+0.601
ish+0.583
Neg logit contributions
Don-0.335
Well-0.318
Would-0.310
My-0.307
Ah-0.283
Token are
Token ID389
Feature activation+1.25e+01

Pos logit contributions
Don+0.832
My+0.777
Well+0.772
Would+0.769
There+0.742
Neg logit contributions
ies-0.740
--0.674
ishing-0.640
ish-0.610
ig-0.608
Token very
Token ID845
Feature activation+3.38e+00

Pos logit contributions
Don+0.478
My+0.444
Would+0.434
Well+0.422
It+0.418
Neg logit contributions
ies-0.498
--0.457
ish-0.447
ig-0.446
ishing-0.440
Token excited
Token ID6568
Feature activation+6.10e+00

Pos logit contributions
Don+0.141
My+0.130
Well+0.130
Would+0.129
It+0.125
Neg logit contributions
ies-0.123
--0.111
ish-0.107
ishing-0.107
ig-0.105
Token.
Token ID13
Feature activation-1.32e+01

Pos logit contributions
 She+0.208
+0.204
Don+0.201
 It+0.192
 He+0.191
Neg logit contributions
ig-0.250
ish-0.239
an-0.230
ishing-0.219
ater-0.217
Token They
Token ID1119
Feature activation+1.99e+01

Pos logit contributions
ies+0.522
 eye+0.509
-+0.505
 fort+0.488
,+0.478
Neg logit contributions
Don-0.391
Would-0.375
My-0.374
Well-0.362
Looks-0.356
Token.
Token ID13
Feature activation-1.32e+01

Pos logit contributions
Don+0.118
Would+0.109
Well+0.109
 Did+0.107
My+0.107
Neg logit contributions
ig-0.057
an-0.053
ishing-0.053
ies-0.051
ish-0.051
Token They
Token ID1119
Feature activation+1.99e+01

Pos logit contributions
ies+0.561
.+0.534
 eye+0.533
-+0.526
 home+0.481
Neg logit contributions
Don-0.370
My-0.330
Over-0.330
Would-0.326
Ah-0.324
Token said
Token ID531
Feature activation+7.54e+00

Pos logit contributions
Don+0.722
Would+0.678
My+0.653
Well+0.646
 Did+0.641
Neg logit contributions
ies-0.858
--0.768
ish-0.733
 eye-0.723
ig-0.697
Token that
Token ID326
Feature activation-2.30e+00

Pos logit contributions
Don+0.294
Would+0.273
My+0.271
Well+0.271
 Did+0.255
Neg logit contributions
ies-0.306
--0.284
ish-0.254
ishing-0.251
ig-0.250
Token being
Token ID852
Feature activation+9.61e-02

Pos logit contributions
ies+0.105
-+0.096
ish+0.088
ishing+0.086
 mess+0.086
Neg logit contributions
Don-0.076
Well-0.069
Would-0.069
My-0.067
There-0.066
Token quiet
Token ID5897
Feature activation+3.31e+00

Pos logit contributions
Don+0.005
Would+0.004
Well+0.004
My+0.004
 Did+0.004
Neg logit contributions
ies-0.003
--0.003
ish-0.002
ishing-0.002
an-0.002
Token and
Token ID290
Feature activation-1.04e+01

Pos logit contributions
Don+0.160
Well+0.150
My+0.149
Would+0.148
 Did+0.143
Neg logit contributions
ies-0.098
--0.085
ishing-0.083
ish-0.082
ig-0.082
Token kind
Token ID1611
Feature activation-1.27e+00

Pos logit contributions
ies+0.377
-+0.341
ish+0.319
ishing+0.317
ig+0.315
Neg logit contributions
Don-0.449
Would-0.417
Well-0.415
My-0.414
It-0.393
Token is
Token ID318
Feature activation+8.17e+00

Pos logit contributions
ies+0.033
ishing+0.029
ig+0.029
ish+0.028
-+0.028
Neg logit contributions
Don-0.065
Well-0.061
Would-0.060
My-0.060
 Did-0.059
Token better
Token ID1365
Feature activation+3.41e-02

Pos logit contributions
Don+0.355
Would+0.332
Well+0.330
My+0.329
It+0.312
Neg logit contributions
ies-0.292
--0.263
ish-0.245
ishing-0.245
ig-0.241
Token than
Token ID621
Feature activation-1.53e+01

Pos logit contributions
Don+0.002
Well+0.002
I+0.002
It+0.002
My+0.002
Neg logit contributions
ies-0.001
ig-0.001
ish-0.001
an-0.001
ishing-0.001
Token for
Token ID329
Feature activation-6.63e+00

Pos logit contributions
ies+0.325
ishing+0.272
an+0.265
ater+0.263
ish+0.255
Neg logit contributions
Don-0.466
My-0.456
It-0.440
Well-0.438
 It-0.435
Token a
Token ID257
Feature activation+1.18e+01

Pos logit contributions
ies+0.303
-+0.273
ish+0.271
ishing+0.265
ig+0.264
Neg logit contributions
Don-0.226
Would-0.209
My-0.207
Well-0.205
It-0.194
Token while
Token ID981
Feature activation-7.92e+00

Pos logit contributions
Don+0.461
Would+0.437
My+0.423
It+0.412
Well+0.410
Neg logit contributions
ies-0.481
ishing-0.437
ish-0.424
an-0.421
--0.413
Token.
Token ID13
Feature activation-1.33e+01

Pos logit contributions
ies+0.208
ish+0.177
an+0.176
ishing+0.173
ig+0.160
Neg logit contributions
Don-0.416
My-0.394
Would-0.392
Well-0.390
It-0.378
Token Suddenly
Token ID24975
Feature activation+1.77e+01

Pos logit contributions
-+0.652
 eye+0.624
 decision+0.605
ies+0.603
 window+0.603
Neg logit contributions
Don-0.299
My-0.263
Well-0.242
Would-0.237
There-0.225
Token the
Token ID262
Feature activation+7.87e+00

Pos logit contributions
Don+0.994
My+0.940
Well+0.936
Would+0.928
There+0.916
Neg logit contributions
ies-0.390
ishing-0.312
--0.308
ish-0.306
an-0.304
Token new
Token ID649
Feature activation+1.33e+00

Pos logit contributions
Don+0.373
My+0.360
Well+0.352
Would+0.351
There+0.349
Neg logit contributions
ies-0.256
ishing-0.214
ish-0.212
--0.202
ig-0.198
Token car
Token ID1097
Feature activation-1.04e+01

Pos logit contributions
My+0.053
Don+0.052
 It+0.051
It+0.051
I+0.051
Neg logit contributions
ies-0.047
ishing-0.042
ish-0.040
ater-0.039
an-0.038
Token blinked
Token ID40360
Feature activation+4.20e+00

Pos logit contributions
ies+0.489
ishing+0.440
ish+0.437
-+0.435
ig+0.423
Neg logit contributions
Don-0.333
My-0.302
Well-0.299
Would-0.292
It-0.283
Token its
Token ID663
Feature activation+1.36e+01

Pos logit contributions
Don+0.212
My+0.200
It+0.196
 It+0.192
Would+0.192
Neg logit contributions
ies-0.113
ig-0.093
ish-0.092
an-0.091
ishing-0.090
Token lights
Token ID7588
Feature activation-1.22e+01

Pos logit contributions
 It+0.556
My+0.553
Don+0.545
 Something+0.540
 He+0.534
Neg logit contributions
ies-0.420
ishing-0.362
ie-0.347
ig-0.344
--0.336
Token fire
Token ID2046
Feature activation-1.75e+01

Pos logit contributions
ies+0.371
ish+0.348
ig+0.347
an+0.334
ishing+0.334
Neg logit contributions
Don-0.286
Would-0.255
My-0.255
It-0.254
There-0.254
Token!"
Token ID2474
Feature activation-2.47e+01

Pos logit contributions
ig+0.437
ies+0.427
ishing+0.393
ish+0.372
 decision+0.366
Neg logit contributions
Don-0.794
 It-0.778
It-0.758
 He-0.746
 They-0.745
Token they
Token ID484
Feature activation+9.81e+00

Pos logit contributions
-+1.048
ies+1.040
.+0.986
,+0.966
 home+0.934
Neg logit contributions
Don-0.662
Would-0.649
Over-0.601
Well-0.586
My-0.586
Token shouted
Token ID17293
Feature activation-4.05e-01

Pos logit contributions
Don+0.398
Would+0.375
Well+0.372
My+0.367
There+0.350
Neg logit contributions
ies-0.379
--0.342
ish-0.319
 eye-0.310
ishing-0.308
Token.
Token ID13
Feature activation-1.86e+01

Pos logit contributions
ies+0.011
ig+0.009
ishing+0.009
an+0.009
 island+0.009
Neg logit contributions
Don-0.021
 It-0.020
My-0.019
Would-0.019
It-0.019
Token
Token ID198
Feature activation+1.55e+01

Pos logit contributions
.+0.726
-+0.690
,+0.689
 home+0.630
!+0.629
Neg logit contributions
Don-0.522
My-0.470
Looks-0.460
Here-0.459
Would-0.451
Token
Token ID198
Feature activation+1.57e+01

Pos logit contributions
 Did+0.471
Don+0.447
My+0.420
Well+0.410
Would+0.392
Neg logit contributions
--0.656
.-0.614
ies-0.587
,-0.557
!-0.556
TokenTheir
Token ID14574
Feature activation+1.95e+01

Pos logit contributions
 Did+0.502
Don+0.447
Would+0.430
Well+0.425
My+0.414
Neg logit contributions
--0.651
ies-0.618
ater-0.575
an-0.541
 breath-0.528
Token mom
Token ID1995
Feature activation-1.75e+01

Pos logit contributions
Don+0.675
Well+0.636
Would+0.617
Me+0.611
My+0.606
Neg logit contributions
ies-0.739
ishing-0.728
ig-0.690
--0.679
ish-0.674
Token and
Token ID290
Feature activation-1.51e+01

Pos logit contributions
ies+0.690
-+0.623
ish+0.595
ishing+0.594
ig+0.584
Neg logit contributions
Don-0.694
Would-0.642
Well-0.640
My-0.637
It-0.602
Token dad
Token ID9955
Feature activation-2.00e+01

Pos logit contributions
ies+0.452
-+0.398
ish+0.357
ishing+0.353
 eye+0.345
Neg logit contributions
Don-0.744
Would-0.708
Well-0.703
My-0.698
 Did-0.670
Token up
Token ID510
Feature activation-7.35e+00

Pos logit contributions
ies+0.890
an+0.793
ig+0.792
ishing+0.780
ish+0.776
Neg logit contributions
Don-1.012
My-0.981
It-0.976
 It-0.946
 She-0.928
Token with
Token ID351
Feature activation-4.84e+00

Pos logit contributions
ies+0.252
ig+0.219
an+0.218
ish+0.214
ishing+0.209
Neg logit contributions
Don-0.326
My-0.303
It-0.296
Would-0.295
Well-0.289
Token dirt
Token ID13647
Feature activation-1.30e+01

Pos logit contributions
ies+0.210
-+0.191
ishing+0.183
ish+0.182
ig+0.182
Neg logit contributions
Don-0.173
My-0.159
Would-0.159
Well-0.156
It-0.150
Token and
Token ID290
Feature activation-1.04e+01

Pos logit contributions
ies+0.376
ig+0.336
ishing+0.331
an+0.330
 choice+0.325
Neg logit contributions
Don-0.576
 It-0.545
It-0.543
There-0.532
 She-0.532
Token after
Token ID706
Feature activation-5.54e+00

Pos logit contributions
ies+0.384
-+0.350
ishing+0.328
ish+0.327
ig+0.327
Neg logit contributions
Don-0.437
Would-0.408
My-0.404
Well-0.401
It-0.381
Token a
Token ID257
Feature activation+1.19e+01

Pos logit contributions
ies+0.237
-+0.214
ish+0.210
ishing+0.208
an+0.206
Neg logit contributions
Don-0.198
Would-0.183
Well-0.183
My-0.182
It-0.172
Token few
Token ID1178
Feature activation+1.99e+00

Pos logit contributions
Don+0.434
It+0.420
Would+0.415
There+0.395
See+0.392
Neg logit contributions
ies-0.493
ishing-0.460
an-0.447
ish-0.431
ig-0.410
Token minutes
Token ID2431
Feature activation-1.13e+01

Pos logit contributions
Don+0.078
Well+0.075
There+0.074
See+0.073
Would+0.072
Neg logit contributions
ies-0.071
an-0.064
ish-0.064
ishing-0.061
 airplane-0.058
Token the
Token ID262
Feature activation+7.96e+00

Pos logit contributions
ies+0.291
-+0.245
ish+0.230
an+0.227
ishing+0.226
Neg logit contributions
Don-0.597
Would-0.567
Well-0.566
My-0.563
There-0.543
Token mouse
Token ID10211
Feature activation-6.12e+00

Pos logit contributions
Don+0.341
Well+0.318
Would+0.313
I+0.313
Ah+0.309
Neg logit contributions
ies-0.271
ishing-0.255
ig-0.242
--0.238
ish-0.227
Token and
Token ID290
Feature activation-1.04e+01

Pos logit contributions
ies+0.286
ishing+0.258
-+0.256
ish+0.250
ig+0.247
Neg logit contributions
Don-0.194
Would-0.176
My-0.175
Well-0.170
It-0.164
Token to
Token ID284
Feature activation-3.40e+00

Pos logit contributions
Don+0.177
Would+0.171
My+0.167
There+0.163
It+0.161
Neg logit contributions
ies-0.131
ish-0.119
ishing-0.112
ater-0.112
an-0.110
Token bed
Token ID3996
Feature activation-2.15e+01

Pos logit contributions
ies+0.139
-+0.127
ish+0.119
ishing+0.119
ig+0.117
Neg logit contributions
Don-0.131
Would-0.122
My-0.121
Well-0.121
 Did-0.113
Token,
Token ID11
Feature activation-1.59e+01

Pos logit contributions
ies+0.636
ish+0.587
an+0.541
ishing+0.538
ig+0.532
Neg logit contributions
Don-1.029
My-0.968
Well-0.953
Would-0.950
It-0.929
Token hoping
Token ID7725
Feature activation-1.71e+00

Pos logit contributions
ies+0.489
-+0.443
 eye+0.391
ishing+0.387
 mess+0.381
Neg logit contributions
Don-0.734
Well-0.711
Would-0.710
My-0.692
Here-0.683
Token that
Token ID326
Feature activation-2.42e+00

Pos logit contributions
ies+0.077
ish+0.070
ishing+0.070
an+0.068
-+0.068
Neg logit contributions
Don-0.057
My-0.054
Would-0.052
Well-0.051
It-0.050
Token maybe
Token ID3863
Feature activation+1.57e+01

Pos logit contributions
ies+0.106
-+0.099
ish+0.090
ishing+0.089
 decision+0.089
Neg logit contributions
Don-0.083
Well-0.078
Would-0.076
My-0.074
There-0.072
Token he
Token ID339
Feature activation+1.63e+01

Pos logit contributions
Don+0.498
Would+0.458
My+0.455
Well+0.446
 Did+0.417
Neg logit contributions
ies-0.744
--0.680
ish-0.664
ishing-0.657
an-0.649
Token would
Token ID561
Feature activation+1.04e+01

Pos logit contributions
Don+0.419
Would+0.373
Well+0.370
My+0.368
 Did+0.333
Neg logit contributions
ies-0.867
--0.811
ish-0.774
ishing-0.773
ig-0.764
Token find
Token ID1064
Feature activation-5.79e+00

Pos logit contributions
Don+0.381
My+0.358
Would+0.351
It+0.345
Well+0.338
Neg logit contributions
ies-0.427
ishing-0.399
--0.394
ish-0.385
ig-0.373
Token his
Token ID465
Feature activation+1.33e+01

Pos logit contributions
ies+0.268
-+0.241
ish+0.232
ishing+0.232
ig+0.230
Neg logit contributions
Don-0.189
My-0.177
Would-0.173
Well-0.170
It-0.162
Token toy
Token ID13373
Feature activation-1.35e+01

Pos logit contributions
My+0.536
Don+0.509
 It+0.499
I+0.494
 Who+0.491
Neg logit contributions
ies-0.464
ishing-0.380
le-0.369
ie-0.367
--0.357
Token wanted
Token ID2227
Feature activation+9.68e+00

Pos logit contributions
Don+0.467
Would+0.425
Well+0.423
My+0.422
 Did+0.390
Neg logit contributions
ies-0.705
--0.655
ishing-0.622
ish-0.620
ig-0.613
Token to
Token ID284
Feature activation-3.88e+00

Pos logit contributions
Don+0.379
Would+0.367
My+0.363
Well+0.347
It+0.341
Neg logit contributions
ies-0.366
an-0.333
ish-0.331
ig-0.331
ishing-0.329
Token have
Token ID423
Feature activation+1.10e+01

Pos logit contributions
ies+0.164
-+0.150
ishing+0.149
ish+0.145
ig+0.145
Neg logit contributions
Don-0.137
My-0.134
Would-0.129
It-0.127
Well-0.126
Token their
Token ID511
Feature activation+1.31e+01

Pos logit contributions
Don+0.461
Would+0.438
My+0.436
Well+0.427
There+0.419
Neg logit contributions
ies-0.402
--0.359
ish-0.349
an-0.345
ishing-0.344
Token own
Token ID898
Feature activation+2.01e+00

Pos logit contributions
My+0.488
Me+0.483
Don+0.476
Well+0.467
Here+0.463
Neg logit contributions
ies-0.446
--0.408
ishing-0.406
an-0.406
ig-0.376
Token.
Token ID13
Feature activation-1.37e+01

Pos logit contributions
 It+0.079
My+0.078
 Did+0.077
+0.077
 The+0.076
Neg logit contributions
ies-0.058
an-0.056
ishing-0.054
ish-0.052
ig-0.052
Token
Token ID198
Feature activation+2.03e+01

Pos logit contributions
.+0.509
ies+0.503
 tent+0.493
-+0.488
,+0.487
Neg logit contributions
Don-0.443
My-0.368
Looks-0.361
Get-0.360
Well-0.359
Token
Token ID198
Feature activation+2.03e+01

Pos logit contributions
Don+0.641
Well+0.535
Looks+0.534
 Did+0.530
Get+0.526
Neg logit contributions
--0.938
ies-0.896
.-0.832
 home-0.818
,-0.797
TokenOne
Token ID3198
Feature activation+1.71e+01

Pos logit contributions
Don+0.632
 Did+0.581
Well+0.571
Would+0.549
My+0.537
Neg logit contributions
ies-0.919
--0.905
ig-0.802
ater-0.785
an-0.779
Token day
Token ID1110
Feature activation-1.01e+01

Pos logit contributions
Don+0.638
My+0.619
Me+0.604
Well+0.599
That+0.598
Neg logit contributions
ies-0.619
ishing-0.608
ish-0.576
an-0.566
ig-0.555
Token,
Token ID11
Feature activation-1.62e+01

Pos logit contributions
ies+0.162
-+0.128
ish+0.105
ishing+0.104
ig+0.099
Neg logit contributions
Don-0.637
Would-0.608
Well-0.607
My-0.605
 Did-0.584
Token who
Token ID508
Feature activation+2.09e+00

Pos logit contributions
ies+0.291
 decision+0.262
ishing+0.245
 breath+0.243
 choice+0.240
Neg logit contributions
My-0.438
Don-0.428
Would-0.422
I-0.414
Well-0.403
Token liked
Token ID8288
Feature activation+1.40e+01

Pos logit contributions
Don+0.072
Would+0.068
Well+0.066
There+0.063
 Did+0.062
Neg logit contributions
ies-0.089
--0.087
ish-0.077
 eye-0.076
an-0.074
Token to
Token ID284
Feature activation-3.56e+00

Pos logit contributions
Don+0.529
My+0.501
Would+0.497
Well+0.468
I+0.461
Neg logit contributions
ies-0.554
an-0.527
ish-0.519
ig-0.519
ishing-0.509
Token learn
Token ID2193
Feature activation-4.48e+00

Pos logit contributions
ies+0.133
-+0.117
ishing+0.117
ish+0.114
an+0.111
Neg logit contributions
Don-0.145
My-0.140
Would-0.136
Well-0.135
It-0.130
Token new
Token ID649
Feature activation+1.15e+00

Pos logit contributions
ies+0.146
ish+0.140
an+0.139
ishing+0.138
ig+0.134
Neg logit contributions
Don-0.192
My-0.186
Would-0.186
Well-0.179
It-0.176
Token things
Token ID1243
Feature activation-2.31e+01

Pos logit contributions
Don+0.057
Well+0.055
Would+0.055
My+0.054
 It+0.053
Neg logit contributions
ies-0.027
ish-0.027
ishing-0.027
ig-0.026
an-0.026
Token.
Token ID13
Feature activation-1.35e+01

Pos logit contributions
ies+0.553
ishing+0.480
an+0.473
ish+0.467
ig+0.458
Neg logit contributions
Don-1.212
My-1.176
Well-1.156
It-1.139
Would-1.139
Token She
Token ID1375
Feature activation+2.67e+01

Pos logit contributions
ies+0.539
 eye+0.527
-+0.525
.+0.511
ish+0.495
Neg logit contributions
Don-0.428
Ah-0.386
Well-0.383
See-0.355
Get-0.353
Token had
Token ID550
Feature activation+1.70e+01

Pos logit contributions
Don+0.880
Would+0.797
Well+0.784
My+0.779
Here+0.756
Neg logit contributions
ies-1.229
--1.163
ish-1.053
ig-1.049
 eye-1.041
Token an
Token ID281
Feature activation+1.53e+01

Pos logit contributions
Don+0.679
Well+0.632
Would+0.627
 Did+0.595
My+0.586
Neg logit contributions
ies-0.641
--0.609
 eye-0.538
 home-0.528
 castle-0.521
Token exam
Token ID2814
Feature activation-1.89e+01

Pos logit contributions
My+0.580
Don+0.568
 It+0.565
Well+0.560
 Everywhere+0.557
Neg logit contributions
ies-0.565
ishing-0.466
ig-0.438
yt-0.432
--0.423
Token Sarah
Token ID10490
Feature activation+4.03e+00

Pos logit contributions
ies+0.733
-+0.701
 airplane+0.619
 eye+0.612
ishing+0.606
Neg logit contributions
Don-0.559
Would-0.533
Well-0.519
My-0.500
Here-0.483
Token heard
Token ID2982
Feature activation+4.51e+00

Pos logit contributions
Don+0.132
My+0.122
Would+0.120
Well+0.120
It+0.111
Neg logit contributions
ies-0.184
--0.169
ishing-0.163
ish-0.163
ig-0.160
Token someone
Token ID2130
Feature activation+1.04e+01

Pos logit contributions
Don+0.169
Well+0.156
Would+0.156
My+0.153
 Did+0.147
Neg logit contributions
ies-0.186
--0.174
ishing-0.156
ish-0.155
 eye-0.155
Token calling
Token ID4585
Feature activation-1.44e+01

Pos logit contributions
Don+0.440
My+0.425
Would+0.417
Well+0.395
It+0.387
Neg logit contributions
ies-0.367
ig-0.338
ishing-0.337
ish-0.333
an-0.326
Token her
Token ID607
Feature activation+1.35e+01

Pos logit contributions
ies+0.629
-+0.566
ish+0.563
ishing+0.563
ig+0.556
Neg logit contributions
Don-0.515
Would-0.469
My-0.465
Well-0.457
It-0.440
Token name
Token ID1438
Feature activation-1.50e+01

Pos logit contributions
Don+0.502
Would+0.469
My+0.458
Well+0.457
 It+0.448
Neg logit contributions
ies-0.521
ishing-0.489
ig-0.476
ish-0.464
--0.462
Token.
Token ID13
Feature activation-1.41e+01

Pos logit contributions
ies+0.358
ishing+0.290
ish+0.283
-+0.277
ig+0.273
Neg logit contributions
Don-0.833
My-0.792
Would-0.788
Well-0.776
It-0.765
Token It
Token ID632
Feature activation+2.11e+01

Pos logit contributions
-+0.573
.+0.571
,+0.563
 home+0.551
 window+0.541
Neg logit contributions
Don-0.343
Ah-0.326
Get-0.317
Take-0.317
My-0.307
Token was
Token ID373
Feature activation+1.57e+01

Pos logit contributions
Don+0.738
Would+0.697
Here+0.692
Well+0.683
My+0.675
Neg logit contributions
ies-0.908
--0.878
 mess-0.753
an-0.752
ish-0.744
Token her
Token ID607
Feature activation+1.37e+01

Pos logit contributions
Don+0.531
Would+0.487
Well+0.483
My+0.482
 Did+0.448
Neg logit contributions
ies-0.707
--0.652
ish-0.619
ishing-0.617
ig-0.609
Token mom
Token ID1995
Feature activation-1.35e+01

Pos logit contributions
Don+0.544
My+0.539
Well+0.519
Would+0.506
 It+0.498
Neg logit contributions
ies-0.435
ishing-0.432
ig-0.396
ish-0.389
ater-0.383
Token hugs
Token ID40657
Feature activation-1.06e+01

Pos logit contributions
Don+0.788
Would+0.736
My+0.728
Well+0.726
 Did+0.683
Neg logit contributions
ies-0.845
--0.768
ishing-0.736
ish-0.727
an-0.713
Token the
Token ID262
Feature activation+5.44e+00

Pos logit contributions
ies+0.483
-+0.445
ish+0.424
ishing+0.424
ig+0.418
Neg logit contributions
Don-0.359
Would-0.329
My-0.327
Well-0.325
It-0.303
Token doll
Token ID3654
Feature activation-1.75e+01

Pos logit contributions
Don+0.268
My+0.263
Would+0.258
Here+0.251
Well+0.251
Neg logit contributions
ies-0.161
ishing-0.147
ish-0.142
--0.137
ig-0.132
Token and
Token ID290
Feature activation-1.28e+01

Pos logit contributions
ishing+0.559
ies+0.552
ater+0.498
ig+0.482
an+0.476
Neg logit contributions
My-0.754
 She-0.737
Don-0.722
 It-0.720
Would-0.701
Token says
Token ID1139
Feature activation-2.76e+00

Pos logit contributions
ies+0.494
-+0.447
ish+0.417
ishing+0.412
ig+0.406
Neg logit contributions
Don-0.520
Would-0.486
Well-0.483
My-0.477
 Did-0.454
Token,
Token ID11
Feature activation-1.81e+01

Pos logit contributions
ies+0.054
-+0.044
ish+0.038
ishing+0.038
ig+0.036
Neg logit contributions
Don-0.165
Would-0.157
Well-0.156
My-0.156
 Did-0.151
Token "
Token ID366
Feature activation+1.11e+01

Pos logit contributions
ies+0.726
-+0.717
 window+0.625
,+0.614
 castle+0.604
Neg logit contributions
Don-0.584
Well-0.557
Ah-0.554
My-0.552
Here-0.546
TokenWow
Token ID22017
Feature activation+8.53e+00

Pos logit contributions
 Everything+0.212
 Did+0.208
 Sadly+0.206
 From+0.200
 Everywhere+0.196
Neg logit contributions
ies-0.532
--0.516
ater-0.479
 hat-0.470
an-0.470
Token,
Token ID11
Feature activation-1.79e+01

Pos logit contributions
It+0.373
My+0.366
 It+0.364
 She+0.363
There+0.359
Neg logit contributions
an-0.214
ishing-0.206
ies-0.203
ater-0.199
 island-0.197
Token Tom
Token ID4186
Feature activation+6.74e+00

Pos logit contributions
ies+0.708
-+0.659
ish+0.600
ig+0.600
 eye+0.591
Neg logit contributions
Don-0.697
Would-0.652
Well-0.642
My-0.623
 Did-0.600
Token!
Token ID0
Feature activation-1.95e+01

Pos logit contributions
Don+0.295
 She+0.282
It+0.282
 It+0.281
My+0.276
Neg logit contributions
ies-0.170
ishing-0.165
 island-0.165
an-0.162
 breath-0.151
Token!"
Token ID2474
Feature activation-2.19e+01

Pos logit contributions
ishing+0.183
ies+0.170
 eye+0.166
ig+0.165
an+0.160
Neg logit contributions
Don-0.366
Well-0.356
It-0.354
I-0.347
 She-0.343
Token she
Token ID673
Feature activation+1.50e+01

Pos logit contributions
ies+0.847
-+0.834
.+0.726
 home+0.719
,+0.716
Neg logit contributions
Would-0.768
Don-0.743
Did-0.701
 Did-0.698
Even-0.690
Token said
Token ID531
Feature activation+4.63e+00

Pos logit contributions
Don+0.532
Would+0.498
Well+0.491
My+0.477
Here+0.472
Neg logit contributions
ies-0.669
--0.620
ish-0.558
an-0.547
ig-0.546
Token,
Token ID11
Feature activation-1.86e+01

Pos logit contributions
Don+0.251
My+0.243
Would+0.235
 It+0.231
It+0.231
Neg logit contributions
ies-0.109
ishing-0.092
ig-0.089
an-0.088
ish-0.088
Token her
Token ID607
Feature activation+1.17e+01

Pos logit contributions
ies+0.869
-+0.815
ish+0.757
ishing+0.754
ig+0.740
Neg logit contributions
Don-0.591
Well-0.542
Would-0.539
My-0.531
There-0.499
Token eyes
Token ID2951
Feature activation-2.32e+01

Pos logit contributions
Don+0.315
Well+0.309
 It+0.304
Me+0.299
 From+0.298
Neg logit contributions
ishing-0.480
ies-0.458
ater-0.433
le-0.427
ie-0.424
Token wide
Token ID3094
Feature activation+5.83e+00

Pos logit contributions
ies+1.114
ish+1.024
ig+1.014
ishing+1.003
an+1.001
Neg logit contributions
Don-0.735
My-0.672
Would-0.654
It-0.642
Well-0.639
Token with
Token ID351
Feature activation-7.45e+00

Pos logit contributions
Don+0.266
My+0.257
 It+0.254
 She+0.250
Would+0.248
Neg logit contributions
ies-0.169
ig-0.163
an-0.147
ish-0.143
 choice-0.140
Token amaz
Token ID40642
Feature activation-1.24e+01

Pos logit contributions
ies+0.264
-+0.240
ish+0.221
ishing+0.219
 eye+0.216
Neg logit contributions
Don-0.323
Would-0.303
Well-0.302
My-0.298
 Did-0.286
Tokenement
Token ID972
Feature activation-5.65e+00

Pos logit contributions
ies+0.440
 fort+0.382
ish+0.376
 eye+0.369
an+0.367
Neg logit contributions
It-0.461
Don-0.459
Well-0.457
My-0.450
 It-0.433
Token.
Token ID13
Feature activation-1.56e+01

Pos logit contributions
ig+0.147
an+0.142
ish+0.138
ishing+0.135
 choice+0.133
Neg logit contributions
Don-0.244
 She-0.235
It-0.234
My-0.233
Well-0.231
Token.
Token ID13
Feature activation-1.33e+01

Pos logit contributions
ies+0.750
ish+0.704
ishing+0.694
 decision+0.627
ig+0.623
Neg logit contributions
There-1.052
It-1.045
Don-1.038
I-1.022
 It-1.008
Token It
Token ID632
Feature activation+2.19e+01

Pos logit contributions
.+0.512
 eye+0.493
,+0.480
 window+0.474
 fort+0.467
Neg logit contributions
Don-0.403
My-0.357
Looks-0.354
Well-0.346
Get-0.320
Token was
Token ID373
Feature activation+1.64e+01

Pos logit contributions
Don+0.813
Would+0.742
Well+0.723
Here+0.716
My+0.704
Neg logit contributions
ies-0.922
--0.886
 castle-0.776
 mess-0.770
 island-0.758
Token a
Token ID257
Feature activation+1.19e+01

Pos logit contributions
Don+0.587
Would+0.546
My+0.544
It+0.523
Well+0.520
Neg logit contributions
ies-0.692
ish-0.634
--0.627
ishing-0.623
ig-0.617
Token nice
Token ID3621
Feature activation+2.33e+00

Pos logit contributions
Don+0.562
It+0.547
Would+0.539
My+0.536
See+0.524
Neg logit contributions
ies-0.356
ishing-0.344
ish-0.311
an-0.285
--0.277
Token image
Token ID2939
Feature activation-2.59e+01

Pos logit contributions
 It+0.103
It+0.096
 She+0.096
+0.090
Don+0.090
Neg logit contributions
ishing-0.076
ish-0.067
ies-0.067
immers-0.062
rying-0.061
Token of
Token ID286
Feature activation+4.86e+00

Pos logit contributions
ies+1.162
ishing+1.135
 decision+1.090
an+1.055
ish+1.055
Neg logit contributions
 It-0.767
It-0.762
My-0.762
I-0.726
 She-0.723
Token a
Token ID257
Feature activation+1.19e+01

Pos logit contributions
Don+0.176
Would+0.162
Well+0.161
My+0.160
 Did+0.150
Neg logit contributions
ies-0.208
--0.192
ish-0.180
ishing-0.180
ig-0.177
Token castle
Token ID16669
Feature activation-2.41e+01

Pos logit contributions
Don+0.551
It+0.541
My+0.536
Would+0.530
See+0.527
Neg logit contributions
ies-0.359
ishing-0.353
ish-0.311
an-0.299
--0.295
Token,
Token ID11
Feature activation-1.58e+01

Pos logit contributions
ies+0.868
ish+0.821
ishing+0.817
ig+0.738
 decision+0.726
Neg logit contributions
It-0.904
There-0.897
I-0.891
Don-0.889
 She-0.873
Token with
Token ID351
Feature activation-4.86e+00

Pos logit contributions
ies+0.602
-+0.544
ish+0.522
ishing+0.518
ig+0.511
Neg logit contributions
Don-0.652
Would-0.609
My-0.609
Well-0.598
It-0.572
Token started
Token ID2067
Feature activation-4.41e+00

Pos logit contributions
ies+0.704
-+0.642
ishing+0.623
ish+0.617
an+0.611
Neg logit contributions
Don-0.528
My-0.487
Would-0.481
Well-0.479
It-0.445
Token to
Token ID284
Feature activation-9.58e+00

Pos logit contributions
ies+0.159
ish+0.149
an+0.148
ishing+0.145
ig+0.142
Neg logit contributions
Don-0.181
My-0.169
Would-0.162
Well-0.160
It-0.160
Token fold
Token ID5591
Feature activation-1.52e+01

Pos logit contributions
ies+0.351
-+0.322
ishing+0.308
ish+0.300
ig+0.299
Neg logit contributions
Don-0.409
My-0.384
Would-0.375
Well-0.367
It-0.364
Token pieces
Token ID5207
Feature activation-2.20e+01

Pos logit contributions
ies+0.624
an+0.553
-+0.553
ishing+0.552
ish+0.548
Neg logit contributions
Don-0.549
My-0.511
It-0.493
Well-0.490
Would-0.489
Token of
Token ID286
Feature activation-8.02e-01

Pos logit contributions
ies+0.644
an+0.609
 decision+0.579
ish+0.579
ishing+0.579
Neg logit contributions
Don-0.954
It-0.926
There-0.917
My-0.909
Well-0.890
Token paper
Token ID3348
Feature activation-2.46e+01

Pos logit contributions
ies+0.026
an+0.023
-+0.023
ishing+0.022
ig+0.022
Neg logit contributions
Don-0.036
My-0.034
Would-0.033
It-0.033
See-0.033
Token.
Token ID13
Feature activation-1.86e+01

Pos logit contributions
ies+0.620
an+0.602
 decision+0.596
ishing+0.572
ish+0.545
Neg logit contributions
Don-1.081
There-1.059
It-1.039
My-1.035
 It-1.033
Token The
Token ID383
Feature activation+2.14e+01

Pos logit contributions
-+0.739
.+0.735
ies+0.733
 home+0.726
 house+0.702
Neg logit contributions
Don-0.470
Listen-0.382
Over-0.381
My-0.377
Take-0.374
Token goose
Token ID37246
Feature activation-2.18e+01

Pos logit contributions
Don+0.895
Would+0.832
My+0.824
Well+0.820
Here+0.809
Neg logit contributions
ies-0.706
ishing-0.668
--0.628
ig-0.611
ish-0.598
Token admired
Token ID29382
Feature activation-2.68e+00

Pos logit contributions
ies+1.007
-+0.934
ish+0.884
ishing+0.880
ig+0.870
Neg logit contributions
Don-0.707
Would-0.647
Well-0.641
My-0.638
 Did-0.594
Token the
Token ID262
Feature activation+3.15e+00

Pos logit contributions
ies+0.114
-+0.105
ish+0.099
ishing+0.099
ig+0.097
Neg logit contributions
Don-0.098
Would-0.090
Well-0.090
My-0.089
 Did-0.084
Token all
Token ID477
Feature activation-9.95e+00

Pos logit contributions
Don+0.026
My+0.025
Would+0.024
Well+0.024
 Did+0.023
Neg logit contributions
ies-0.026
--0.024
ishing-0.023
ish-0.023
ig-0.023
Token about
Token ID546
Feature activation-9.50e+00

Pos logit contributions
ies+0.378
-+0.345
ishing+0.330
ish+0.329
ig+0.328
Neg logit contributions
Don-0.406
Would-0.374
My-0.373
Well-0.371
It-0.352
Token the
Token ID262
Feature activation+1.23e+00

Pos logit contributions
ies+0.408
-+0.375
ish+0.362
ishing+0.361
ig+0.359
Neg logit contributions
Don-0.343
Would-0.317
My-0.314
Well-0.309
It-0.292
Token great
Token ID1049
Feature activation-8.43e-01

Pos logit contributions
Don+0.055
Would+0.053
Here+0.052
I+0.049
My+0.049
Neg logit contributions
ies-0.037
ishing-0.035
ig-0.034
--0.033
ish-0.031
Token trade
Token ID3292
Feature activation-1.79e+01

Pos logit contributions
ishing+0.020
ater+0.020
ies+0.020
ig+0.019
an+0.019
Neg logit contributions
Don-0.031
Would-0.031
 It-0.030
Let-0.030
Come-0.030
Token tree
Token ID5509
Feature activation-2.58e+01

Pos logit contributions
ies+0.420
ishing+0.361
ish+0.338
ig+0.334
 breath+0.334
Neg logit contributions
Don-0.918
My-0.880
Well-0.874
Would-0.870
I-0.861
Token in
Token ID287
Feature activation-8.14e+00

Pos logit contributions
ies+0.706
ishing+0.633
ig+0.599
ish+0.581
 breath+0.580
Neg logit contributions
Don-1.223
My-1.169
Would-1.134
Well-1.126
 It-1.110
Token his
Token ID465
Feature activation+7.81e+00

Pos logit contributions
ies+0.332
-+0.309
ishing+0.285
ish+0.284
ig+0.280
Neg logit contributions
Don-0.310
Would-0.286
Well-0.286
My-0.282
 Did-0.269
Token garden
Token ID11376
Feature activation-2.56e+01

Pos logit contributions
Don+0.390
My+0.378
Would+0.373
Well+0.369
Here+0.365
Neg logit contributions
ies-0.213
--0.166
ishing-0.164
ish-0.152
ater-0.151
Token and
Token ID290
Feature activation-1.58e+01

Pos logit contributions
ies+0.607
ishing+0.505
-+0.492
ish+0.491
ig+0.491
Neg logit contributions
Don-1.349
Would-1.281
My-1.280
Well-1.252
It-1.238
Token how
Token ID703
Feature activation-1.36e+01

Pos logit contributions
ies+0.712
-+0.655
ish+0.612
ishing+0.607
ig+0.596
Neg logit contributions
Don-0.535
Well-0.491
Would-0.490
My-0.485
There-0.454
Token.
Token ID13
Feature activation-1.83e+01

Pos logit contributions
ies+0.759
ishing+0.696
 island+0.695
an+0.689
 decision+0.675
Neg logit contributions
Don-1.158
My-1.157
There-1.117
It-1.108
I-1.071
Token She
Token ID1375
Feature activation+2.21e+01

Pos logit contributions
ies+0.727
.+0.725
-+0.706
,+0.702
 home+0.686
Neg logit contributions
Don-0.461
Over-0.444
Take-0.418
Ah-0.418
Looks-0.417
Token felt
Token ID2936
Feature activation+6.21e+00

Pos logit contributions
Don+0.797
Would+0.740
Well+0.730
My+0.719
 Did+0.703
Neg logit contributions
ies-0.951
--0.890
ish-0.807
ig-0.806
ishing-0.799
Token the
Token ID262
Feature activation+3.27e+00

Pos logit contributions
Don+0.268
Well+0.261
Would+0.256
 Did+0.249
My+0.248
Neg logit contributions
ies-0.206
--0.198
 eye-0.174
ish-0.172
ishing-0.170
Token comb
Token ID1974
Feature activation-2.54e+01

Pos logit contributions
Don+0.147
My+0.141
Well+0.136
I+0.135
Here+0.134
Neg logit contributions
ies-0.109
ishing-0.098
--0.095
ish-0.089
an-0.086
Token brush
Token ID14093
Feature activation-2.46e+01

Pos logit contributions
ies+0.868
ishing+0.783
 island+0.776
an+0.761
ish+0.742
Neg logit contributions
Don-1.071
My-1.058
It-0.985
Well-0.977
That-0.956
Token brush
Token ID14093
Feature activation-2.45e+01

Pos logit contributions
ies+0.711
ishing+0.613
an+0.609
ish+0.604
-+0.593
Neg logit contributions
Don-1.188
My-1.142
It-1.093
Well-1.090
Would-1.077
Token brush
Token ID14093
Feature activation-2.44e+01

Pos logit contributions
ies+0.748
-+0.641
ishing+0.629
ish+0.626
an+0.618
Neg logit contributions
Don-1.167
My-1.104
Well-1.081
Would-1.077
It-1.056
Token her
Token ID607
Feature activation+1.03e+01

Pos logit contributions
ies+0.745
-+0.644
ishing+0.624
ish+0.621
an+0.611
Neg logit contributions
Don-1.161
My-1.096
Well-1.081
Would-1.077
It-1.048
Token hair
Token ID4190
Feature activation-2.52e+01

Pos logit contributions
Don+0.484
My+0.474
Well+0.456
 It+0.452
It+0.449
Neg logit contributions
ies-0.284
 island-0.239
ishing-0.235
--0.226
an-0.221
Token.
Token ID13
Feature activation-1.71e+01

Pos logit contributions
ies+0.610
ishing+0.504
an+0.495
ish+0.488
 island+0.486
Neg logit contributions
Don-1.338
My-1.299
Well-1.267
It-1.248
Would-1.227
Token he
Token ID339
Feature activation+1.64e+01

Pos logit contributions
Don+1.213
Would+1.152
Well+1.151
My+1.126
Here+1.085
Neg logit contributions
ies-0.750
--0.680
ish-0.598
ig-0.591
ishing-0.588
Token saw
Token ID2497
Feature activation+7.59e+00

Pos logit contributions
Don+0.653
Well+0.627
Would+0.599
Here+0.580
My+0.580
Neg logit contributions
ies-0.622
--0.601
 eye-0.557
ig-0.545
 castle-0.531
Token a
Token ID257
Feature activation+1.18e+01

Pos logit contributions
Don+0.296
Well+0.277
Would+0.273
My+0.264
 Did+0.257
Neg logit contributions
ies-0.301
--0.282
 eye-0.252
ish-0.252
an-0.250
Token brightly
Token ID35254
Feature activation+8.96e+00

Pos logit contributions
Don+0.491
It+0.480
Would+0.467
My+0.458
I+0.445
Neg logit contributions
ies-0.435
ishing-0.427
ish-0.393
ig-0.377
an-0.369
Token wrapped
Token ID12908
Feature activation-6.26e+00

Pos logit contributions
Don+0.472
It+0.449
My+0.449
Would+0.448
That+0.445
Neg logit contributions
ies-0.213
ish-0.196
ig-0.179
ishing-0.179
 decision-0.161
Token present
Token ID1944
Feature activation-2.40e+01

Pos logit contributions
ies+0.186
ishing+0.161
ish+0.152
ig+0.145
ans+0.135
Neg logit contributions
My-0.275
Don-0.272
It-0.264
I-0.259
 It-0.256
Token sitting
Token ID5586
Feature activation-3.99e+00

Pos logit contributions
ies+0.776
ishing+0.773
ater+0.711
ish+0.705
ig+0.693
Neg logit contributions
My-1.006
 It-0.976
Don-0.967
It-0.959
 She-0.936
Token on
Token ID319
Feature activation-2.10e+00

Pos logit contributions
ies+0.133
 decision+0.111
ish+0.110
 choice+0.108
an+0.105
Neg logit contributions
Don-0.185
My-0.173
It-0.168
Well-0.166
Would-0.166
Token the
Token ID262
Feature activation+7.87e+00

Pos logit contributions
ies+0.102
ish+0.090
ishing+0.090
-+0.090
ig+0.089
Neg logit contributions
Don-0.067
My-0.062
Would-0.061
Well-0.059
It-0.058
Token floor
Token ID4314
Feature activation-1.47e+01

Pos logit contributions
My+0.382
Don+0.371
I+0.358
It+0.358
That+0.355
Neg logit contributions
ies-0.252
ishing-0.219
ig-0.217
ish-0.204
--0.192
Token.
Token ID13
Feature activation-1.33e+01

Pos logit contributions
ies+0.429
ish+0.393
ishing+0.376
an+0.373
ig+0.365
Neg logit contributions
Don-0.676
My-0.663
 She-0.652
It-0.650
Would-0.644