TOP ACTIVATIONS
MAX = 3.191

!" L ily and Tom met at the bottom
to be kind . That 's what makes you happy and
families happy too . And so , the whole town became
ummy cube . But the cube is not a
day ever ! But soon it was time for
and free . But then , he hears a
" You are making too much noise and you are being
noises . They had so much fun with the car .
day , she ate too much candy and her stomach hurt
it . She had so much fun that she didn 't
They are happy . And so are Mom and Dad .
. You are making too much noise . You are in
aren and Ann had so much fun playing together ! Ann
love this movie !" and sn ugg led up with her
my . You ate too much candy , didn 't you
her new home . Sara and Tom smiled as they watched
One day , Sara and Tom want to do art
to cry . She regretted not being more careful . Her
After he was done chewing , Tim my
, " I gained so much confidence !" But then ,

MIN ACTIVATIONS
MIN = -4.022

that was broken . The little boat felt sad for the
very sad and missed the little bird ie . <|endoftext|> Once
boat . So , the little boat offered to help the
. " How can we get the jet back
sad and scared . How can they get their rocket back
are you ?" But the little boat said nothing , only
?" asked Sammy in a tiny voice . "
and wanted to know why the airplane could fly but not
is getting wet . What can we do ?" Cow said
pigeon wanted to help the little bird , so he told
said . " But how can we fly in a jet
is not happy . What can I do ?" She thought
and our corn . What can we do ?"
the bird , " What can I do to get to
" OK , what can we make ?" Sam asks
boat . The little boat breathed a sigh of
bird came and scared the little bird ie . The little
it was small . The little boat was mo ored in
One day , the tiny pink boat had to move
We are not going on a trip today . We are

INTERVAL 1.388 - 3.191
CONTAINS 2.240%

many pictures . He sees a picture of a shoe .
Mom my ." Her mom hugged her and said
are not friends . A lady sees them .
my loved to look at his calendar every day . His
had protected his honey . From that day on , the

INTERVAL -0.416 - 1.388
CONTAINS 70.855%

very messy boy . His toys were everywhere in his room
angry and took the axe away from the boy .
. The shark saw them and sw am towards
the two clean beans . John was very happy
to heal itself . The tree and the wild animal became

INTERVAL -2.219 - -0.416
CONTAINS 26.829%

for a better couch ," his mom said . " This
and Jen slide down the slide . The rope catches them
, as she took Ben 's hand . They
watched the little duck for a while , licking his lips
there sweet ie ? Can I help ?" The

INTERVAL -4.022 - -2.219
CONTAINS 0.076%

of the desert . How can there be an o asis
was wrong , but the little duck could not answer .
little bird gasped . How could this be happening ?
strong , but the two could not get along .
Tim my said , " Sure , I 'll show
Token!"
Token ID2474
Feature activation+7.74e-01

Pos logit contributions
reating+0.642
ward+0.637
ained+0.559
tr+0.551
live+0.550
Neg logit contributions
 spe-0.809
mer-0.779
chie-0.761
 se-0.711
 en-0.664
Token
Token ID198
Feature activation+1.34e+00

Pos logit contributions
reating+1.215
ward+1.215
live+1.121
tr+1.098
ained+1.080
Neg logit contributions
 spe-0.961
mer-0.919
chie-0.876
 se-0.858
 en-0.815
Token
Token ID198
Feature activation+2.25e+00

Pos logit contributions
reating+2.165
ward+2.160
tr+1.959
live+1.931
ained+1.930
Neg logit contributions
 spe-1.554
mer-1.507
 se-1.453
chie-1.452
 en-1.351
TokenL
Token ID43
Feature activation+6.11e-01

Pos logit contributions
ward+4.148
reating+4.049
icated+3.761
ained+3.694
inea+3.651
Neg logit contributions
mer-2.239
chie-2.108
 se-2.100
 spe-2.045
 en-1.949
Tokenily
Token ID813
Feature activation+6.49e-01

Pos logit contributions
ward+0.851
reating+0.816
tr+0.750
ained+0.732
live+0.729
Neg logit contributions
 spe-0.867
mer-0.850
chie-0.844
 se-0.775
 en-0.729
Token and
Token ID290
Feature activation+3.19e+00

Pos logit contributions
reating+1.029
ward+1.012
ained+0.922
live+0.900
tr+0.898
Neg logit contributions
 spe-0.834
mer-0.788
chie-0.752
 se-0.742
 en-0.673
Token Tom
Token ID4186
Feature activation+1.22e+00

Pos logit contributions
ward+5.812
reating+5.781
reetings+5.387
ained+5.355
icated+5.330
Neg logit contributions
 se-3.321
 spe-3.310
 low-3.131
 shy-3.095
 en-3.069
Token met
Token ID1138
Feature activation+1.68e+00

Pos logit contributions
reating+2.019
ward+1.922
ained+1.802
live+1.773
tr+1.739
Neg logit contributions
 spe-1.558
mer-1.443
 se-1.429
chie-1.402
 en-1.254
Token at
Token ID379
Feature activation+5.46e-01

Pos logit contributions
ward+2.783
reating+2.737
live+2.637
tr+2.629
icated+2.524
Neg logit contributions
 spe-1.882
mer-1.865
 se-1.713
 en-1.710
chie-1.655
Token the
Token ID262
Feature activation+1.26e+00

Pos logit contributions
ward+0.916
reating+0.906
tr+0.851
live+0.838
ained+0.835
Neg logit contributions
 spe-0.641
mer-0.598
chie-0.580
 se-0.545
 en-0.488
Token bottom
Token ID4220
Feature activation-5.01e-01

Pos logit contributions
ward+1.906
reating+1.858
live+1.766
ained+1.737
icated+1.728
Neg logit contributions
 spe-1.659
chie-1.543
mer-1.541
 se-1.513
 en-1.396
Token to
Token ID284
Feature activation+1.96e+00

Pos logit contributions
ward+2.576
reating+2.495
ained+2.411
live+2.368
ouncing+2.313
Neg logit contributions
 spe-1.663
 se-1.531
chie-1.468
mer-1.452
 en-1.299
Token be
Token ID307
Feature activation+6.26e-01

Pos logit contributions
ward+3.089
reating+2.963
tr+2.900
ained+2.834
live+2.771
Neg logit contributions
 spe-2.510
chie-2.426
 se-2.375
 en-2.274
mer-2.218
Token kind
Token ID1611
Feature activation+3.99e-01

Pos logit contributions
ward+1.133
reating+1.124
tr+1.034
live+1.025
icated+1.018
Neg logit contributions
 spe-0.656
mer-0.609
chie-0.603
 se-0.548
 en-0.519
Token.
Token ID13
Feature activation+5.73e-01

Pos logit contributions
ward+0.643
reating+0.629
live+0.582
tr+0.578
ained+0.574
Neg logit contributions
 spe-0.490
mer-0.471
chie-0.454
 se-0.415
 en-0.383
Token That
Token ID1320
Feature activation+6.48e-01

Pos logit contributions
ward+0.864
reating+0.855
live+0.766
tr+0.766
ained+0.761
Neg logit contributions
 spe-0.765
mer-0.714
chie-0.702
 se-0.644
 en-0.612
Token's
Token ID338
Feature activation+3.18e+00

Pos logit contributions
ward+1.055
reating+1.039
tr+0.942
live+0.939
ained+0.935
Neg logit contributions
 spe-0.787
mer-0.741
chie-0.698
 se-0.686
 en-0.620
Token what
Token ID644
Feature activation+5.15e-01

Pos logit contributions
ops+5.147
reating+5.044
ward+4.979
ugg+4.929
opl+4.602
Neg logit contributions
 spe-4.206
 shy-4.146
 en-4.132
chie-3.808
 boss-3.681
Token makes
Token ID1838
Feature activation+6.35e-02

Pos logit contributions
ward+0.899
reating+0.885
tr+0.823
live+0.813
ained+0.811
Neg logit contributions
 spe-0.565
mer-0.516
chie-0.510
 se-0.482
 en-0.437
Token you
Token ID345
Feature activation+3.85e-01

Pos logit contributions
ward+0.115
reating+0.114
tr+0.105
live+0.104
ained+0.104
Neg logit contributions
 spe-0.066
mer-0.062
chie-0.060
 se-0.054
 en-0.048
Token happy
Token ID3772
Feature activation+6.39e-01

Pos logit contributions
ward+0.670
reating+0.665
tr+0.615
live+0.614
ained+0.603
Neg logit contributions
 spe-0.419
mer-0.392
chie-0.386
 se-0.355
 en-0.329
Token and
Token ID290
Feature activation+1.31e+00

Pos logit contributions
ward+1.041
reating+1.017
live+0.946
tr+0.934
ained+0.916
Neg logit contributions
 spe-0.773
mer-0.745
chie-0.708
 se-0.671
 en-0.623
Token families
Token ID4172
Feature activation-3.26e-01

Pos logit contributions
 spe+0.858
chie+0.820
mer+0.814
 se+0.734
 en+0.675
Neg logit contributions
ward-0.917
reating-0.904
 about-0.824
tr-0.815
ained-0.813
Token happy
Token ID3772
Feature activation+4.55e-01

Pos logit contributions
 spe+0.416
mer+0.396
chie+0.386
 se+0.347
 en+0.317
Neg logit contributions
ward-0.511
reating-0.502
 about-0.461
tr-0.453
ained-0.450
Token too
Token ID1165
Feature activation-3.61e-01

Pos logit contributions
ward+0.672
reating+0.665
live+0.607
tr+0.602
ained+0.594
Neg logit contributions
 spe-0.620
mer-0.593
chie-0.572
 se-0.539
 en-0.500
Token.
Token ID13
Feature activation+1.40e+00

Pos logit contributions
 spe+0.529
chie+0.501
mer+0.500
 se+0.459
 en+0.426
Neg logit contributions
ward-0.503
reating-0.485
 about-0.451
tr-0.436
ained-0.432
Token And
Token ID843
Feature activation+1.16e+00

Pos logit contributions
reating+2.291
ward+2.216
ained+2.050
icated+1.986
live+1.981
Neg logit contributions
 spe-1.716
mer-1.551
chie-1.473
 se-1.381
 en-1.347
Token so
Token ID523
Feature activation+3.17e+00

Pos logit contributions
ward+1.916
reating+1.900
tr+1.794
ained+1.778
live+1.746
Neg logit contributions
 spe-1.391
mer-1.278
chie-1.203
 se-1.170
 en-1.104
Token,
Token ID11
Feature activation+1.72e+00

Pos logit contributions
reating+5.839
ward+5.394
affle+5.013
icated+5.012
ruck+4.985
Neg logit contributions
 spe-3.708
mer-3.664
 se-3.223
 shy-3.066
 en-3.059
Token the
Token ID262
Feature activation+4.84e-01

Pos logit contributions
reating+3.122
ward+3.110
tr+2.932
ained+2.913
icated+2.882
Neg logit contributions
 spe-1.827
mer-1.585
 se-1.395
 en-1.370
chie-1.344
Token whole
Token ID2187
Feature activation+1.93e-01

Pos logit contributions
ward+0.677
reating+0.673
tr+0.606
live+0.606
ained+0.595
Neg logit contributions
 spe-0.712
mer-0.667
chie-0.656
 se-0.630
 en-0.585
Token town
Token ID3240
Feature activation-5.04e-02

Pos logit contributions
ward+0.311
reating+0.306
tr+0.281
live+0.277
ained+0.276
Neg logit contributions
 spe-0.242
mer-0.229
chie-0.223
 se-0.207
 en-0.190
Token became
Token ID2627
Feature activation+1.55e-02

Pos logit contributions
 spe+0.075
mer+0.071
chie+0.071
 se+0.065
 en+0.061
Neg logit contributions
ward-0.068
reating-0.067
tr-0.060
 about-0.060
live-0.059
Tokenummy
Token ID13513
Feature activation+1.15e+00

Pos logit contributions
 spe+0.111
mer+0.106
chie+0.105
 se+0.098
 en+0.092
Neg logit contributions
ward-0.080
reating-0.078
tr-0.069
live-0.068
 about-0.068
Token cube
Token ID23441
Feature activation-6.77e-02

Pos logit contributions
ward+1.661
reating+1.641
live+1.538
ained+1.537
tr+1.447
Neg logit contributions
 spe-1.677
mer-1.600
 se-1.525
chie-1.477
 en-1.367
Token.
Token ID13
Feature activation+8.30e-01

Pos logit contributions
 spe+0.097
mer+0.092
chie+0.091
 se+0.084
 en+0.078
Neg logit contributions
ward-0.095
reating-0.094
tr-0.085
 about-0.084
live-0.083
Token
Token ID198
Feature activation+1.21e+00

Pos logit contributions
reating+1.358
ward+1.352
live+1.204
ained+1.195
tr+1.194
Neg logit contributions
 spe-1.003
mer-0.946
chie-0.908
 se-0.867
 en-0.825
Token
Token ID198
Feature activation+2.18e+00

Pos logit contributions
reating+1.930
ward+1.897
tr+1.741
ained+1.705
icated+1.688
Neg logit contributions
 spe-1.478
mer-1.380
chie-1.349
 se-1.331
 en-1.231
TokenBut
Token ID1537
Feature activation+3.15e+00

Pos logit contributions
ward+3.531
reating+3.487
icated+3.154
tr+3.119
inea+3.099
Neg logit contributions
mer-2.654
 spe-2.548
chie-2.474
 se-2.418
 en-2.342
Token the
Token ID262
Feature activation-1.36e-01

Pos logit contributions
reating+5.871
ugg+5.706
ward+5.559
icated+5.554
live+5.495
Neg logit contributions
 spe-3.305
 en-3.035
 se-2.967
 shy-2.932
 low-2.923
Token cube
Token ID23441
Feature activation-8.94e-02

Pos logit contributions
 spe+0.175
mer+0.165
chie+0.163
 se+0.146
 en+0.134
Neg logit contributions
ward-0.213
reating-0.210
tr-0.191
 about-0.190
ained-0.189
Token is
Token ID318
Feature activation+8.02e-01

Pos logit contributions
 spe+0.145
mer+0.138
chie+0.137
 se+0.127
 en+0.119
Neg logit contributions
ward-0.109
reating-0.106
tr-0.095
 about-0.094
live-0.094
Token not
Token ID407
Feature activation-3.80e-01

Pos logit contributions
reating+1.292
ward+1.282
live+1.157
tr+1.155
icated+1.155
Neg logit contributions
 spe-1.024
mer-0.930
chie-0.911
 se-0.863
 en-0.826
Token a
Token ID257
Feature activation-6.22e-01

Pos logit contributions
 spe+0.504
mer+0.486
chie+0.477
 se+0.429
 en+0.392
Neg logit contributions
ward-0.576
reating-0.564
 about-0.522
ained-0.508
tr-0.507
Token day
Token ID1110
Feature activation-8.45e-01

Pos logit contributions
ward+0.549
reating+0.539
live+0.486
ained+0.483
tr+0.482
Neg logit contributions
 spe-0.539
mer-0.512
chie-0.499
 se-0.472
 en-0.434
Token ever
Token ID1683
Feature activation+8.55e-01

Pos logit contributions
 spe+1.284
chie+1.245
mer+1.232
 se+1.100
 en+1.062
Neg logit contributions
ward-1.125
reating-1.066
 about-1.002
icated-0.945
live-0.943
Token!
Token ID0
Feature activation+1.86e+00

Pos logit contributions
ward+1.231
reating+1.209
tr+1.142
live+1.107
ained+1.096
Neg logit contributions
 spe-1.188
mer-1.177
chie-1.095
 se-1.056
 en-0.963
Token
Token ID198
Feature activation+1.16e+00

Pos logit contributions
reating+3.257
ward+3.209
icated+2.867
live+2.831
tr+2.825
Neg logit contributions
 spe-1.900
mer-1.812
 Int-1.719
 en-1.716
chie-1.683
Token
Token ID198
Feature activation+2.24e+00

Pos logit contributions
ward+1.822
reating+1.822
tr+1.662
icated+1.622
ained+1.610
Neg logit contributions
 spe-1.389
mer-1.328
chie-1.269
 se-1.242
 en-1.160
TokenBut
Token ID1537
Feature activation+3.14e+00

Pos logit contributions
ward+3.436
reating+3.282
inea+3.102
icated+3.035
ugg+2.979
Neg logit contributions
mer-2.696
 spe-2.680
 se-2.573
 en-2.550
 low-2.503
Token soon
Token ID2582
Feature activation+2.16e+00

Pos logit contributions
reating+5.400
ugg+5.355
ward+5.159
tr+5.013
live+5.007
Neg logit contributions
 spe-3.440
 en-3.352
mer-3.331
 shy-3.182
 but-3.175
Token it
Token ID340
Feature activation-1.83e-01

Pos logit contributions
tr+3.575
reating+3.532
ward+3.470
ugg+3.379
live+3.284
Neg logit contributions
 spe-2.519
mer-2.456
 se-2.285
 en-2.243
 shy-2.186
Token was
Token ID373
Feature activation+3.52e-01

Pos logit contributions
 spe+0.289
mer+0.274
chie+0.273
 se+0.251
 en+0.237
Neg logit contributions
ward-0.232
reating-0.225
tr-0.204
live-0.203
ained-0.202
Token time
Token ID640
Feature activation+9.11e-01

Pos logit contributions
ward+0.483
reating+0.481
tr+0.430
live+0.428
icated+0.421
Neg logit contributions
 spe-0.522
mer-0.490
chie-0.484
 se-0.453
 en-0.427
Token for
Token ID329
Feature activation+1.14e+00

Pos logit contributions
reating+1.594
ward+1.581
live+1.501
tr+1.485
ained+1.442
Neg logit contributions
 spe-0.940
mer-0.880
chie-0.877
 se-0.807
 en-0.707
Token and
Token ID290
Feature activation+7.42e-01

Pos logit contributions
ward+1.815
reating+1.787
live+1.737
tr+1.681
ained+1.674
Neg logit contributions
 spe-1.340
mer-1.243
 se-1.196
chie-1.142
 en-1.072
Token free
Token ID1479
Feature activation+7.23e-01

Pos logit contributions
reating+1.329
ward+1.297
tr+1.214
live+1.204
ained+1.186
Neg logit contributions
 spe-0.802
mer-0.742
chie-0.713
 se-0.680
 en-0.633
Token.
Token ID13
Feature activation+1.83e+00

Pos logit contributions
ward+1.118
reating+1.098
live+1.038
tr+1.008
ained+1.005
Neg logit contributions
 spe-0.922
mer-0.888
chie-0.841
 se-0.791
 en-0.734
Token
Token ID198
Feature activation+1.40e+00

Pos logit contributions
reating+3.376
ward+3.313
icated+2.951
live+2.931
tr+2.911
Neg logit contributions
 spe-1.843
mer-1.707
 en-1.643
chie-1.568
 skinny-1.561
Token
Token ID198
Feature activation+2.38e+00

Pos logit contributions
reating+2.195
ward+2.152
tr+1.990
ained+1.936
icated+1.909
Neg logit contributions
 spe-1.739
mer-1.630
 se-1.563
chie-1.561
 en-1.473
TokenBut
Token ID1537
Feature activation+3.13e+00

Pos logit contributions
ward+4.015
reating+3.902
inea+3.648
icated+3.627
tr+3.563
Neg logit contributions
mer-2.684
 spe-2.564
 se-2.438
 en-2.394
chie-2.384
Token then
Token ID788
Feature activation+1.94e+00

Pos logit contributions
reating+5.735
ward+5.439
icated+5.386
ugg+5.301
ouncing+5.201
Neg logit contributions
 spe-3.436
mer-3.149
 boat-3.048
 en-3.037
 se-3.024
Token,
Token ID11
Feature activation+2.39e+00

Pos logit contributions
reating+3.186
ward+3.070
tr+2.929
ained+2.923
ugg+2.868
Neg logit contributions
 spe-2.516
mer-2.218
 se-2.159
 boat-2.066
chie-2.058
Token he
Token ID339
Feature activation+3.22e-01

Pos logit contributions
reating+4.122
ward+4.073
live+3.897
ugg+3.892
ouncing+3.867
Neg logit contributions
 spe-2.665
 se-2.368
mer-2.347
 en-2.280
 shy-2.251
Token hears
Token ID23180
Feature activation+1.12e+00

Pos logit contributions
ward+0.537
reating+0.530
tr+0.482
live+0.481
ained+0.479
Neg logit contributions
 spe-0.381
mer-0.357
chie-0.347
 se-0.324
 en-0.295
Token a
Token ID257
Feature activation-9.34e-01

Pos logit contributions
ward+1.806
reating+1.778
live+1.696
ouncing+1.662
ained+1.661
Neg logit contributions
 spe-1.330
mer-1.219
 se-1.208
chie-1.194
 en-1.100
Token "
Token ID366
Feature activation-3.32e-01

Pos logit contributions
 spe+2.112
chie+2.016
mer+1.974
 se+1.754
 blu+1.683
Neg logit contributions
ward-2.056
 about-1.982
reating-1.926
tr-1.915
ouncing-1.828
TokenYou
Token ID1639
Feature activation+3.63e-01

Pos logit contributions
 spe+0.535
mer+0.504
chie+0.497
 se+0.459
 en+0.430
Neg logit contributions
ward-0.414
reating-0.412
tr-0.362
ained-0.359
icated-0.357
Token are
Token ID389
Feature activation+9.29e-01

Pos logit contributions
ward+0.464
reating+0.458
live+0.403
ained+0.401
tr+0.399
Neg logit contributions
 spe-0.575
mer-0.546
chie-0.543
 se-0.507
 en-0.474
Token making
Token ID1642
Feature activation+9.92e-01

Pos logit contributions
ward+1.372
reating+1.355
tr+1.234
ugg+1.226
live+1.220
Neg logit contributions
 spe-1.289
chie-1.208
mer-1.196
 se-1.116
 en-1.076
Token too
Token ID1165
Feature activation+6.02e-02

Pos logit contributions
ward+1.579
reating+1.540
live+1.485
tr+1.470
ained+1.451
Neg logit contributions
 spe-1.205
chie-1.106
mer-1.095
 se-1.073
 en-1.000
Token much
Token ID881
Feature activation+3.11e+00

Pos logit contributions
ward+0.066
reating+0.064
tr+0.056
live+0.055
ained+0.055
Neg logit contributions
 spe-0.106
mer-0.102
chie-0.101
 se-0.094
 en-0.089
Token noise
Token ID7838
Feature activation+1.50e-01

Pos logit contributions
live+6.265
ward+5.841
inea+5.734
ugg+5.684
ained+5.666
Neg logit contributions
 spe-3.429
 se-3.242
 en-2.622
 skinny-2.604
mer-2.566
Token and
Token ID290
Feature activation+6.06e-01

Pos logit contributions
ward+0.224
reating+0.220
tr+0.200
live+0.199
ained+0.198
Neg logit contributions
 spe-0.203
mer-0.193
chie-0.189
 se-0.174
 en-0.161
Token you
Token ID345
Feature activation+1.03e+00

Pos logit contributions
ward+0.945
reating+0.938
live+0.868
tr+0.855
ained+0.849
Neg logit contributions
 spe-0.785
mer-0.725
chie-0.716
 se-0.675
 en-0.626
Token are
Token ID389
Feature activation+6.97e-01

Pos logit contributions
reating+1.578
ward+1.551
live+1.369
ained+1.357
 Afterwards+1.352
Neg logit contributions
 spe-1.427
mer-1.360
chie-1.325
 se-1.245
 en-1.137
Token being
Token ID852
Feature activation-5.50e-01

Pos logit contributions
ward+1.044
reating+1.034
tr+0.946
live+0.941
ained+0.920
Neg logit contributions
 spe-0.952
chie-0.885
mer-0.884
 se-0.822
 en-0.790
Token noises
Token ID26782
Feature activation+3.09e-01

Pos logit contributions
ward+1.459
reating+1.420
live+1.397
ained+1.324
tr+1.298
Neg logit contributions
 spe-1.108
mer-1.023
chie-1.000
 se-0.949
 en-0.887
Token.
Token ID13
Feature activation-8.15e-01

Pos logit contributions
ward+0.465
reating+0.456
live+0.421
tr+0.416
ained+0.415
Neg logit contributions
 spe-0.415
mer-0.394
chie-0.382
 se-0.357
 en-0.331
Token They
Token ID1119
Feature activation+1.51e-01

Pos logit contributions
 spe+1.308
chie+1.272
mer+1.243
 se+1.141
 skinny+1.036
Neg logit contributions
ward-1.052
reating-0.981
 about-0.961
ained-0.928
live-0.911
Token had
Token ID550
Feature activation-8.02e-01

Pos logit contributions
ward+0.211
reating+0.208
tr+0.186
live+0.186
ained+0.185
Neg logit contributions
 spe-0.219
mer-0.208
chie-0.205
 se-0.190
 en-0.177
Token so
Token ID523
Feature activation+1.36e+00

Pos logit contributions
mer+1.353
 spe+1.334
chie+1.313
 se+1.174
 en+1.102
Neg logit contributions
ward-0.962
reating-0.913
 about-0.866
tr-0.828
ained-0.811
Token much
Token ID881
Feature activation+3.09e+00

Pos logit contributions
reating+1.675
ward+1.655
live+1.439
tr+1.409
ained+1.364
Neg logit contributions
 spe-2.311
mer-2.109
chie-2.029
 se-1.961
 en-1.944
Token fun
Token ID1257
Feature activation+7.75e-01

Pos logit contributions
live+6.075
inea+5.978
reating+5.881
ward+5.837
ugg+5.779
Neg logit contributions
 se-3.155
 spe-3.033
 en-2.537
 skinny-2.454
 boat-2.429
Token with
Token ID351
Feature activation-1.84e-01

Pos logit contributions
reating+1.056
ward+1.044
live+0.959
tr+0.953
ained+0.931
Neg logit contributions
 spe-1.151
mer-1.094
chie-1.044
 se-1.016
 en-0.940
Token the
Token ID262
Feature activation-6.43e-01

Pos logit contributions
 spe+0.246
mer+0.237
chie+0.235
 se+0.211
 en+0.194
Neg logit contributions
ward-0.279
reating-0.274
 about-0.249
tr-0.247
ained-0.245
Token car
Token ID1097
Feature activation+4.89e-01

Pos logit contributions
 spe+0.809
mer+0.784
chie+0.758
 se+0.663
 dunno+0.608
Neg logit contributions
ward-1.021
reating-1.010
tr-0.932
 about-0.917
ained-0.916
Token.
Token ID13
Feature activation+1.29e-01

Pos logit contributions
ward+0.686
reating+0.682
live+0.621
ained+0.607
tr+0.601
Neg logit contributions
 spe-0.698
mer-0.667
chie-0.647
 se-0.607
 en-0.567
Token day
Token ID1110
Feature activation+1.72e+00

Pos logit contributions
ward+0.799
reating+0.786
tr+0.642
live+0.634
ained+0.630
Neg logit contributions
 spe-1.424
mer-1.384
chie-1.339
 se-1.304
 en-1.236
Token,
Token ID11
Feature activation-4.44e-01

Pos logit contributions
reating+2.636
ward+2.493
tr+2.366
ained+2.322
icated+2.294
Neg logit contributions
 spe-2.389
mer-2.151
 se-2.074
chie-2.036
 en-1.910
Token she
Token ID673
Feature activation-9.96e-01

Pos logit contributions
 spe+0.570
chie+0.549
mer+0.548
 se+0.474
 en+0.436
Neg logit contributions
ward-0.706
reating-0.688
 about-0.631
tr-0.626
live-0.620
Token ate
Token ID15063
Feature activation+1.82e+00

Pos logit contributions
 spe+1.349
chie+1.343
mer+1.256
 se+1.027
rier+1.024
Neg logit contributions
ward-1.478
reating-1.462
 about-1.399
live-1.381
ained-1.378
Token too
Token ID1165
Feature activation+9.09e-01

Pos logit contributions
reating+3.073
ward+3.068
ouncing+2.953
tr+2.920
live+2.891
Neg logit contributions
 spe-2.104
 se-1.950
 en-1.705
mer-1.699
 shy-1.611
Token much
Token ID881
Feature activation+3.09e+00

Pos logit contributions
ward+0.975
reating+0.969
tr+0.894
ained+0.832
live+0.815
Neg logit contributions
 spe-1.593
mer-1.488
chie-1.484
 se-1.436
 en-1.362
Token candy
Token ID18550
Feature activation+7.66e-01

Pos logit contributions
ward+4.706
tr+4.572
reating+4.552
live+4.498
nce+4.460
Neg logit contributions
 spe-4.523
 se-4.360
 skinny-3.841
 en-3.699
 shy-3.657
Token and
Token ID290
Feature activation+1.39e+00

Pos logit contributions
reating+1.096
ward+1.092
ained+0.983
tr+0.979
icated+0.973
Neg logit contributions
 spe-1.061
mer-1.023
chie-0.949
 se-0.924
 en-0.849
Token her
Token ID607
Feature activation+5.34e-01

Pos logit contributions
reating+2.287
ward+2.278
ained+2.158
tr+2.151
ouncing+2.083
Neg logit contributions
 spe-1.677
 se-1.502
mer-1.423
chie-1.314
 en-1.312
Token stomach
Token ID11384
Feature activation+3.02e-01

Pos logit contributions
ward+0.762
reating+0.760
icated+0.670
ained+0.669
tr+0.667
Neg logit contributions
 spe-0.766
mer-0.715
chie-0.699
 se-0.672
 en-0.627
Token hurt
Token ID5938
Feature activation+9.47e-01

Pos logit contributions
ward+0.462
reating+0.459
ained+0.410
live+0.409
tr+0.408
Neg logit contributions
 spe-0.401
mer-0.377
chie-0.367
 se-0.343
 en-0.315
Token it
Token ID340
Feature activation+3.25e-01

Pos logit contributions
 spe+1.062
mer+1.037
chie+1.021
 se+0.927
 skinny+0.870
Neg logit contributions
ward-0.745
reating-0.741
 about-0.658
ained-0.640
tr-0.636
Token.
Token ID13
Feature activation+6.70e-01

Pos logit contributions
ward+0.393
reating+0.391
live+0.342
tr+0.341
ained+0.339
Neg logit contributions
 spe-0.527
mer-0.506
chie-0.495
 se-0.465
 en-0.442
Token She
Token ID1375
Feature activation+2.38e-01

Pos logit contributions
reating+1.055
ward+1.046
live+0.949
nce+0.933
icated+0.932
Neg logit contributions
 spe-0.842
mer-0.778
chie-0.763
 se-0.713
 en-0.690
Token had
Token ID550
Feature activation-1.75e-01

Pos logit contributions
ward+0.348
reating+0.345
tr+0.309
ained+0.308
live+0.307
Neg logit contributions
 spe-0.329
mer-0.311
chie-0.304
 se-0.286
 en-0.264
Token so
Token ID523
Feature activation+1.52e+00

Pos logit contributions
 spe+0.212
mer+0.203
chie+0.198
 se+0.176
 en+0.161
Neg logit contributions
ward-0.290
reating-0.284
 about-0.262
tr-0.260
ained-0.259
Token much
Token ID881
Feature activation+3.09e+00

Pos logit contributions
reating+1.953
ward+1.917
tr+1.650
live+1.614
icated+1.550
Neg logit contributions
 spe-2.479
mer-2.239
chie-2.177
 se-2.156
 en-2.112
Token fun
Token ID1257
Feature activation+9.00e-01

Pos logit contributions
ward+5.953
inea+5.941
tr+5.917
reating+5.875
live+5.735
Neg logit contributions
 se-3.105
 spe-2.825
 en-2.563
 shy-2.428
 skinny-2.404
Token that
Token ID326
Feature activation+2.03e-01

Pos logit contributions
reating+1.139
ward+1.129
tr+1.037
live+1.023
ained+0.999
Neg logit contributions
 spe-1.428
mer-1.341
chie-1.302
 se-1.277
 en-1.190
Token she
Token ID673
Feature activation-5.81e-01

Pos logit contributions
ward+0.371
reating+0.366
tr+0.339
live+0.337
ained+0.335
Neg logit contributions
 spe-0.208
mer-0.192
chie-0.187
 se-0.170
 en-0.153
Token didn
Token ID1422
Feature activation-3.47e-01

Pos logit contributions
 spe+0.919
chie+0.879
mer+0.861
 se+0.771
 skinny+0.735
Neg logit contributions
ward-0.738
reating-0.708
 about-0.658
live-0.653
ained-0.641
Token't
Token ID470
Feature activation-2.53e-01

Pos logit contributions
 spe+0.440
mer+0.418
chie+0.405
 se+0.366
 en+0.335
Neg logit contributions
ward-0.556
reating-0.546
 about-0.500
tr-0.498
ained-0.497
Token They
Token ID1119
Feature activation+3.87e-01

Pos logit contributions
 spe+0.070
mer+0.067
chie+0.066
 se+0.060
 en+0.056
Neg logit contributions
ward-0.071
reating-0.070
tr-0.064
ained-0.063
 about-0.063
Token are
Token ID389
Feature activation+1.03e+00

Pos logit contributions
ward+0.561
reating+0.555
live+0.495
ained+0.494
tr+0.491
Neg logit contributions
 spe-0.548
mer-0.514
chie-0.506
 se-0.474
 en-0.440
Token happy
Token ID3772
Feature activation+7.24e-01

Pos logit contributions
reating+1.892
ward+1.880
live+1.734
icated+1.716
tr+1.712
Neg logit contributions
 spe-1.097
mer-0.968
chie-0.933
 se-0.895
 en-0.875
Token.
Token ID13
Feature activation-2.35e-02

Pos logit contributions
ward+1.165
reating+1.145
live+1.078
tr+1.066
ained+1.048
Neg logit contributions
 spe-0.908
mer-0.852
chie-0.801
 se-0.792
 en-0.715
Token And
Token ID843
Feature activation+7.14e-01

Pos logit contributions
 spe+0.034
mer+0.033
chie+0.032
 se+0.030
 en+0.028
Neg logit contributions
ward-0.033
reating-0.032
tr-0.029
ained-0.029
 about-0.029
Token so
Token ID523
Feature activation+3.09e+00

Pos logit contributions
reating+1.236
ward+1.233
live+1.143
tr+1.140
ained+1.137
Neg logit contributions
 spe-0.806
mer-0.731
chie-0.697
 se-0.677
 en-0.623
Token are
Token ID389
Feature activation+5.59e-01

Pos logit contributions
reating+5.951
ward+5.240
live+5.019
ruck+4.925
affle+4.903
Neg logit contributions
 spe-3.696
mer-3.458
 shy-3.262
 se-3.185
 en-3.125
Token Mom
Token ID11254
Feature activation-1.09e+00

Pos logit contributions
reating+0.913
ward+0.908
tr+0.836
live+0.820
ained+0.818
Neg logit contributions
 spe-0.677
mer-0.627
chie-0.611
 se-0.575
 en-0.534
Token and
Token ID290
Feature activation-1.15e+00

Pos logit contributions
 spe+1.730
chie+1.697
mer+1.624
 se+1.473
 blu+1.431
Neg logit contributions
ward-1.397
reating-1.394
 about-1.352
 done-1.240
tr-1.231
Token Dad
Token ID17415
Feature activation+2.31e-01

Pos logit contributions
 spe+1.559
mer+1.549
chie+1.537
 se+1.282
 en+1.217
Neg logit contributions
ward-1.779
reating-1.717
 about-1.559
 done-1.508
icated-1.507
Token.
Token ID13
Feature activation+2.79e-01

Pos logit contributions
ward+0.343
reating+0.338
tr+0.308
live+0.305
ained+0.304
Neg logit contributions
 spe-0.312
mer-0.297
chie-0.290
 se-0.268
 en-0.249
Token.
Token ID13
Feature activation+3.26e-01

Pos logit contributions
reating+1.628
ward+1.601
tr+1.489
live+1.460
ained+1.442
Neg logit contributions
 spe-1.380
mer-1.356
chie-1.286
 se-1.208
 en-1.134
Token You
Token ID921
Feature activation+5.03e-01

Pos logit contributions
ward+0.492
reating+0.489
live+0.437
tr+0.436
ained+0.435
Neg logit contributions
 spe-0.432
mer-0.409
chie-0.403
 se-0.369
 en-0.346
Token are
Token ID389
Feature activation+9.63e-01

Pos logit contributions
ward+0.673
reating+0.669
live+0.587
tr+0.584
ained+0.583
Neg logit contributions
 spe-0.766
mer-0.724
chie-0.721
 se-0.679
 en-0.630
Token making
Token ID1642
Feature activation+1.11e+00

Pos logit contributions
reating+1.504
ward+1.491
tr+1.364
ugg+1.354
live+1.350
Neg logit contributions
 spe-1.256
chie-1.183
mer-1.149
 se-1.096
 en-1.054
Token too
Token ID1165
Feature activation+4.51e-01

Pos logit contributions
ward+1.780
reating+1.759
live+1.689
tr+1.680
ugg+1.661
Neg logit contributions
 spe-1.301
chie-1.216
 se-1.202
mer-1.180
 en-1.124
Token much
Token ID881
Feature activation+3.09e+00

Pos logit contributions
ward+0.562
reating+0.554
tr+0.495
live+0.488
ained+0.479
Neg logit contributions
 spe-0.726
mer-0.693
chie-0.685
 se-0.640
 en-0.609
Token noise
Token ID7838
Feature activation+4.04e-01

Pos logit contributions
live+6.380
ward+6.111
inea+6.062
nce+5.964
ugg+5.911
Neg logit contributions
 se-3.138
 spe-2.869
 en-2.594
 skinny-2.476
 shy-2.386
Token.
Token ID13
Feature activation+2.02e-01

Pos logit contributions
ward+0.627
reating+0.615
live+0.565
tr+0.563
ained+0.556
Neg logit contributions
 spe-0.519
mer-0.499
chie-0.482
 se-0.443
 en-0.411
Token You
Token ID921
Feature activation+3.88e-01

Pos logit contributions
ward+0.307
reating+0.302
tr+0.272
live+0.272
ained+0.271
Neg logit contributions
 spe-0.267
mer-0.252
chie-0.248
 se-0.227
 en-0.212
Token are
Token ID389
Feature activation+9.16e-01

Pos logit contributions
ward+0.489
reating+0.483
live+0.421
tr+0.420
ained+0.418
Neg logit contributions
 spe-0.621
mer-0.591
chie-0.586
 se-0.551
 en-0.515
Token in
Token ID287
Feature activation-2.43e-01

Pos logit contributions
reating+1.462
ward+1.453
tr+1.327
live+1.322
ugg+1.311
Neg logit contributions
 spe-1.164
chie-1.084
mer-1.059
 se-1.007
 en-0.965
Tokenaren
Token ID5757
Feature activation+7.44e-01

Pos logit contributions
ward+0.374
reating+0.368
tr+0.326
ained+0.324
 about+0.322
Neg logit contributions
 spe-0.458
mer-0.439
chie-0.433
 se-0.400
 en-0.373
Token and
Token ID290
Feature activation+1.64e+00

Pos logit contributions
reating+1.075
ward+1.053
ained+0.953
tr+0.932
live+0.920
Neg logit contributions
 spe-1.060
mer-1.011
chie-0.983
 se-0.937
 en-0.874
Token Ann
Token ID5506
Feature activation+1.58e+00

Pos logit contributions
ward+2.582
reating+2.510
ained+2.430
tr+2.305
live+2.288
Neg logit contributions
 spe-2.177
chie-1.956
mer-1.914
 se-1.833
 en-1.741
Token had
Token ID550
Feature activation-2.31e-01

Pos logit contributions
reating+2.362
ward+2.271
ained+2.159
live+2.107
tr+2.051
Neg logit contributions
 spe-2.301
mer-2.134
chie-2.125
 se-2.022
 en-1.837
Token so
Token ID523
Feature activation+2.39e+00

Pos logit contributions
 spe+0.331
mer+0.321
chie+0.313
 se+0.284
 en+0.265
Neg logit contributions
ward-0.329
reating-0.321
 about-0.292
tr-0.290
ained-0.286
Token much
Token ID881
Feature activation+3.08e+00

Pos logit contributions
reating+3.509
ward+3.250
live+3.098
tr+3.069
ained+2.863
Neg logit contributions
 spe-3.774
 en-3.215
 se-3.180
 shy-3.128
mer-3.096
Token fun
Token ID1257
Feature activation+1.24e+00

Pos logit contributions
reating+5.697
tr+5.628
inea+5.613
ward+5.548
live+5.508
Neg logit contributions
 spe-3.323
 se-3.307
 en-2.650
 shy-2.596
 nervous-2.513
Token playing
Token ID2712
Feature activation-1.77e-02

Pos logit contributions
reating+1.601
ward+1.555
tr+1.459
live+1.442
inea+1.365
Neg logit contributions
 spe-1.925
mer-1.812
chie-1.761
 se-1.752
 en-1.616
Token together
Token ID1978
Feature activation+6.47e-01

Pos logit contributions
 spe+0.031
mer+0.030
chie+0.030
 se+0.028
 en+0.026
Neg logit contributions
ward-0.019
reating-0.019
tr-0.016
 about-0.016
ained-0.016
Token!
Token ID0
Feature activation+1.45e+00

Pos logit contributions
ward+0.968
reating+0.966
tr+0.877
live+0.869
ained+0.865
Neg logit contributions
 spe-0.868
mer-0.832
chie-0.791
 se-0.742
 en-0.691
Token Ann
Token ID5506
Feature activation+2.89e-01

Pos logit contributions
reating+2.321
ward+2.241
icated+2.041
live+2.041
 done+1.986
Neg logit contributions
 spe-1.851
mer-1.649
chie-1.629
 en-1.464
 Int-1.436
Token love
Token ID1842
Feature activation+9.69e-02

Pos logit contributions
reating+1.265
ward+1.254
ained+1.097
tr+1.096
live+1.089
Neg logit contributions
 spe-1.113
mer-1.063
chie-1.012
 se-0.975
 en-0.883
Token this
Token ID428
Feature activation+1.76e-01

Pos logit contributions
ward+0.154
reating+0.152
tr+0.139
live+0.138
ained+0.138
Neg logit contributions
 spe-0.122
mer-0.115
chie-0.113
 se-0.103
 en-0.094
Token movie
Token ID3807
Feature activation+1.63e-01

Pos logit contributions
ward+0.242
reating+0.238
tr+0.213
ained+0.213
live+0.213
Neg logit contributions
 spe-0.259
mer-0.247
chie-0.243
 se-0.225
 en-0.210
Token!"
Token ID2474
Feature activation+6.24e-01

Pos logit contributions
ward+0.211
reating+0.207
tr+0.185
ained+0.184
live+0.183
Neg logit contributions
 spe-0.252
mer-0.241
chie-0.237
 se-0.220
 en-0.206
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
reating+0.979
ward+0.970
live+0.888
tr+0.875
ained+0.872
Neg logit contributions
 spe-0.776
mer-0.727
chie-0.708
 se-0.684
 en-0.637
Token sn
Token ID3013
Feature activation+3.04e+00

Pos logit contributions
reating+1.694
ward+1.656
ained+1.550
live+1.537
tr+1.537
Neg logit contributions
 spe-1.185
mer-1.077
 se-1.041
chie-1.027
 en-0.965
Tokenugg
Token ID6837
Feature activation+2.26e-01

Pos logit contributions
reating+2.788
icated+2.550
ruck+1.993
tr+1.976
ward+1.964
Neg logit contributions
mer-6.509
 spe-6.213
chie-5.925
 se-5.866
 boat-5.663
Tokenled
Token ID992
Feature activation+9.31e-01

Pos logit contributions
ward+0.294
reating+0.290
tr+0.259
live+0.254
ained+0.254
Neg logit contributions
 spe-0.351
mer-0.339
chie-0.330
 se-0.309
 en-0.289
Token up
Token ID510
Feature activation+7.80e-01

Pos logit contributions
reating+1.319
ward+1.319
live+1.201
ained+1.188
tr+1.178
Neg logit contributions
 spe-1.351
mer-1.277
chie-1.213
 se-1.173
 en-1.103
Token with
Token ID351
Feature activation-1.35e-01

Pos logit contributions
reating+1.097
ward+1.086
live+0.990
icated+0.978
ained+0.975
Neg logit contributions
 spe-1.144
mer-1.077
chie-1.037
 se-1.000
 en-0.925
Token her
Token ID607
Feature activation-6.01e-01

Pos logit contributions
 spe+0.188
mer+0.181
chie+0.178
 se+0.161
 en+0.150
Neg logit contributions
ward-0.196
reating-0.193
 about-0.173
tr-0.172
ained-0.172
Tokenmy
Token ID1820
Feature activation-3.84e-01

Pos logit contributions
ward+1.982
reating+1.721
tr+1.542
 Enjoy+1.540
live+1.517
Neg logit contributions
mer-3.630
 spe-3.623
chie-3.556
 se-3.492
 skinny-3.290
Token.
Token ID13
Feature activation-1.56e-01

Pos logit contributions
 spe+0.507
mer+0.480
chie+0.476
 se+0.430
 en+0.396
Neg logit contributions
ward-0.582
reating-0.575
tr-0.526
 about-0.526
ained-0.517
Token You
Token ID921
Feature activation+1.60e-02

Pos logit contributions
 spe+0.181
mer+0.172
chie+0.168
 se+0.152
 en+0.137
Neg logit contributions
ward-0.263
reating-0.258
tr-0.240
 about-0.237
ained-0.235
Token ate
Token ID15063
Feature activation+1.43e+00

Pos logit contributions
ward+0.022
reating+0.021
tr+0.019
live+0.019
ained+0.019
Neg logit contributions
 spe-0.024
mer-0.023
chie-0.022
 se-0.021
 en-0.019
Token too
Token ID1165
Feature activation+9.10e-01

Pos logit contributions
ward+2.450
reating+2.407
live+2.306
tr+2.288
ouncing+2.267
Neg logit contributions
 spe-1.639
mer-1.449
chie-1.396
 se-1.340
 en-1.255
Token much
Token ID881
Feature activation+3.04e+00

Pos logit contributions
ward+1.027
reating+1.016
tr+0.930
live+0.871
ained+0.866
Neg logit contributions
 spe-1.562
mer-1.457
chie-1.453
 se-1.373
 skinny-1.326
Token candy
Token ID18550
Feature activation+6.46e-01

Pos logit contributions
live+5.516
ward+5.455
tr+5.316
reating+5.247
nce+5.231
Neg logit contributions
 spe-3.645
 se-3.180
 skinny-2.753
 shy-2.596
mer-2.578
Token,
Token ID11
Feature activation-3.66e-01

Pos logit contributions
ward+0.952
reating+0.940
ained+0.851
tr+0.848
live+0.847
Neg logit contributions
 spe-0.874
mer-0.842
chie-0.798
 se-0.754
 en-0.691
Token didn
Token ID1422
Feature activation-3.98e-01

Pos logit contributions
 spe+0.426
mer+0.410
chie+0.402
 se+0.350
 en+0.315
Neg logit contributions
ward-0.611
reating-0.607
 about-0.563
tr-0.556
ained-0.547
Token't
Token ID470
Feature activation+1.22e+00

Pos logit contributions
 spe+0.505
mer+0.489
chie+0.468
 se+0.423
 en+0.390
Neg logit contributions
ward-0.635
reating-0.623
tr-0.571
 about-0.567
live-0.563
Token you
Token ID345
Feature activation+1.17e+00

Pos logit contributions
ward+1.956
reating+1.846
ained+1.797
live+1.791
 Afterwards+1.728
Neg logit contributions
 spe-1.623
chie-1.460
 se-1.417
mer-1.415
 en-1.301
Token her
Token ID607
Feature activation-1.96e-01

Pos logit contributions
reating+1.664
ward+1.658
tr+1.602
ained+1.553
ouncing+1.540
Neg logit contributions
 spe-1.179
mer-1.032
chie-1.019
 se-0.958
 en-0.898
Token new
Token ID649
Feature activation-2.38e-01

Pos logit contributions
 spe+0.256
mer+0.245
chie+0.243
 se+0.219
 en+0.202
Neg logit contributions
ward-0.303
reating-0.295
tr-0.273
 about-0.271
live-0.268
Token home
Token ID1363
Feature activation+1.39e-01

Pos logit contributions
 spe+0.336
mer+0.321
chie+0.320
 se+0.290
 en+0.271
Neg logit contributions
ward-0.339
reating-0.332
tr-0.302
 about-0.300
ained-0.297
Token.
Token ID13
Feature activation+1.91e+00

Pos logit contributions
ward+0.197
reating+0.195
tr+0.175
live+0.174
ained+0.174
Neg logit contributions
 spe-0.197
mer-0.187
chie-0.184
 se-0.170
 en-0.158
Token Sara
Token ID24799
Feature activation+1.24e-01

Pos logit contributions
reating+3.507
ward+3.308
icated+3.067
ruck+3.050
ained+3.001
Neg logit contributions
 spe-2.074
mer-1.807
 en-1.723
 Int-1.635
 se-1.625
Token and
Token ID290
Feature activation+3.03e+00

Pos logit contributions
ward+0.172
reating+0.170
tr+0.152
ained+0.151
live+0.151
Neg logit contributions
 spe-0.179
mer-0.170
chie-0.167
 se-0.156
 en-0.145
Token Tom
Token ID4186
Feature activation+1.44e+00

Pos logit contributions
reating+5.074
ward+4.999
ruck+4.791
inea+4.660
ained+4.613
Neg logit contributions
 spe-3.853
 en-3.425
 se-3.380
 shy-3.340
 nervous-3.177
Token smiled
Token ID13541
Feature activation+4.34e-01

Pos logit contributions
reating+2.160
ward+2.018
ained+1.919
tr+1.855
live+1.829
Neg logit contributions
 spe-2.116
mer-1.910
 se-1.884
chie-1.838
 boat-1.662
Token as
Token ID355
Feature activation+1.21e-02

Pos logit contributions
reating+0.647
ward+0.645
live+0.587
tr+0.580
ained+0.576
Neg logit contributions
 spe-0.587
mer-0.555
chie-0.539
 se-0.506
 en-0.470
Token they
Token ID484
Feature activation+1.60e+00

Pos logit contributions
ward+0.021
reating+0.020
tr+0.019
live+0.019
 about+0.019
Neg logit contributions
 spe-0.014
mer-0.013
chie-0.013
 se-0.012
 en-0.010
Token watched
Token ID7342
Feature activation+8.38e-01

Pos logit contributions
reating+2.488
ward+2.357
ained+2.172
 about+2.144
ouncing+2.131
Neg logit contributions
 spe-2.367
mer-2.118
 se-2.069
chie-2.011
 boat-1.745
Token
Token ID198
Feature activation+1.41e+00

Pos logit contributions
ward+0.735
reating+0.724
tr+0.657
ained+0.643
live+0.643
Neg logit contributions
 spe-0.687
mer-0.654
chie-0.637
 se-0.595
 en-0.557
TokenOne
Token ID3198
Feature activation+1.22e+00

Pos logit contributions
ward+2.235
reating+2.172
icated+2.000
tr+1.985
 about+1.917
Neg logit contributions
 spe-1.812
mer-1.789
chie-1.680
 se-1.544
 en-1.518
Token day
Token ID1110
Feature activation+2.12e+00

Pos logit contributions
ward+1.329
reating+1.285
live+1.121
ained+1.082
 Afterwards+1.078
Neg logit contributions
 spe-2.185
mer-2.136
 se-2.043
chie-2.031
 en-1.918
Token,
Token ID11
Feature activation+1.65e+00

Pos logit contributions
reating+3.373
ward+3.209
tr+3.197
icated+3.089
ained+3.061
Neg logit contributions
 spe-2.865
mer-2.627
 se-2.467
chie-2.316
 en-2.283
Token Sara
Token ID24799
Feature activation-6.18e-01

Pos logit contributions
reating+2.761
ward+2.758
tr+2.588
icated+2.574
ained+2.534
Neg logit contributions
 spe-1.952
mer-1.757
 se-1.611
 en-1.567
chie-1.548
Token and
Token ID290
Feature activation+3.01e+00

Pos logit contributions
 spe+0.902
chie+0.863
mer+0.847
 se+0.737
 skinny+0.705
Neg logit contributions
ward-0.849
reating-0.845
 about-0.783
tr-0.779
ained-0.767
Token Tom
Token ID4186
Feature activation+5.66e-01

Pos logit contributions
ward+5.400
reating+4.974
icated+4.917
tr+4.842
ained+4.784
Neg logit contributions
 spe-3.303
 se-3.113
 shy-3.076
 en-2.945
 low-2.891
Token want
Token ID765
Feature activation-4.28e-02

Pos logit contributions
ward+0.925
reating+0.900
live+0.833
ained+0.819
tr+0.818
Neg logit contributions
 spe-0.694
mer-0.653
chie-0.642
 se-0.609
 en-0.556
Token to
Token ID284
Feature activation-1.07e+00

Pos logit contributions
 spe+0.045
mer+0.043
chie+0.042
 se+0.037
 en+0.033
Neg logit contributions
ward-0.077
reating-0.076
tr-0.070
 about-0.070
ained-0.069
Token do
Token ID466
Feature activation-1.77e-01

Pos logit contributions
mer+1.478
 spe+1.468
chie+1.378
 se+1.146
 en+1.131
Neg logit contributions
ward-1.580
reating-1.571
 about-1.423
tr-1.419
 done-1.386
Token art
Token ID1242
Feature activation-6.77e-01

Pos logit contributions
 spe+0.276
mer+0.265
chie+0.259
 se+0.237
 en+0.222
Neg logit contributions
ward-0.232
reating-0.227
 about-0.203
tr-0.200
ained-0.199
Token to
Token ID284
Feature activation+7.88e-01

Pos logit contributions
ward+0.767
reating+0.767
live+0.706
tr+0.703
icated+0.701
Neg logit contributions
 spe-0.413
chie-0.371
mer-0.371
 se-0.337
 en-0.303
Token cry
Token ID3960
Feature activation+1.18e+00

Pos logit contributions
reating+1.304
ward+1.300
live+1.179
ained+1.173
tr+1.160
Neg logit contributions
 spe-0.953
chie-0.865
mer-0.854
 se-0.830
 en-0.760
Token.
Token ID13
Feature activation+8.41e-02

Pos logit contributions
reating+1.996
ward+1.989
live+1.921
ained+1.825
nce+1.812
Neg logit contributions
 spe-1.325
mer-1.249
chie-1.228
 se-1.144
 en-1.078
Token She
Token ID1375
Feature activation+7.24e-01

Pos logit contributions
ward+0.145
reating+0.143
tr+0.131
live+0.130
ained+0.130
Neg logit contributions
 spe-0.094
mer-0.088
chie-0.087
 se-0.078
 en-0.071
Token regretted
Token ID44652
Feature activation+1.08e+00

Pos logit contributions
reating+1.211
ward+1.193
ained+1.075
live+1.052
icated+1.049
Neg logit contributions
 spe-0.887
mer-0.809
chie-0.801
 se-0.779
 en-0.706
Token not
Token ID407
Feature activation+3.00e+00

Pos logit contributions
reating+1.524
ward+1.477
tr+1.397
live+1.376
ained+1.369
Neg logit contributions
 spe-1.548
chie-1.401
mer-1.367
 se-1.356
 en-1.301
Token being
Token ID852
Feature activation-2.01e-01

Pos logit contributions
ward+5.071
ops+5.051
live+5.023
mm+5.008
 Afterwards+4.889
Neg logit contributions
 spe-3.467
chie-3.466
 se-3.401
 en-3.359
 shy-3.253
Token more
Token ID517
Feature activation+9.07e-01

Pos logit contributions
 spe+0.208
mer+0.193
chie+0.186
 se+0.166
 en+0.145
Neg logit contributions
ward-0.366
reating-0.360
 about-0.336
ained-0.334
tr-0.333
Token careful
Token ID8161
Feature activation+4.78e-01

Pos logit contributions
ward+1.676
reating+1.655
tr+1.527
icated+1.504
live+1.491
Neg logit contributions
 spe-0.937
chie-0.864
mer-0.855
 se-0.783
 en-0.748
Token.
Token ID13
Feature activation+5.58e-01

Pos logit contributions
ward+0.674
reating+0.657
tr+0.597
live+0.595
ained+0.586
Neg logit contributions
 spe-0.677
mer-0.648
chie-0.635
 se-0.587
 en-0.555
Token Her
Token ID2332
Feature activation+4.16e-01

Pos logit contributions
reating+0.981
ward+0.973
live+0.885
ained+0.878
tr+0.875
Neg logit contributions
 spe-0.599
mer-0.556
chie-0.548
 se-0.498
 en-0.477
Token
Token ID220
Feature activation+3.83e-01

Pos logit contributions
reating+0.942
ward+0.929
live+0.836
icated+0.827
tr+0.823
Neg logit contributions
 spe-0.758
mer-0.711
chie-0.695
 se-0.657
 en-0.623
Token
Token ID198
Feature activation+4.22e-01

Pos logit contributions
ward+0.544
reating+0.538
tr+0.482
live+0.477
ained+0.472
Neg logit contributions
 spe-0.529
mer-0.507
chie-0.501
 se-0.462
 en-0.436
Token
Token ID198
Feature activation+1.35e+00

Pos logit contributions
ward+0.603
reating+0.597
tr+0.536
live+0.529
ained+0.529
Neg logit contributions
 spe-0.587
mer-0.564
chie-0.556
 se-0.516
 en-0.481
TokenAfter
Token ID3260
Feature activation+2.11e+00

Pos logit contributions
ward+1.754
reating+1.746
 about+1.520
tr+1.495
icated+1.493
Neg logit contributions
 spe-2.010
chie-1.988
mer-1.983
 se-1.862
 skinny-1.810
Token he
Token ID339
Feature activation+2.32e+00

Pos logit contributions
reating+3.391
ained+3.206
tr+3.182
ouncing+3.130
live+3.096
Neg logit contributions
 spe-2.896
 se-2.711
chie-2.283
 en-2.262
 skinny-2.246
Token was
Token ID373
Feature activation+3.00e+00

Pos logit contributions
reating+3.356
ward+3.099
ained+3.063
ouncing+2.994
asc+2.779
Neg logit contributions
 spe-3.783
 se-3.470
mer-3.311
 en-3.029
 fill-3.007
Token done
Token ID1760
Feature activation+1.50e+00

Pos logit contributions
reating+5.086
tr+4.614
ouncing+4.552
icated+4.467
ward+4.455
Neg logit contributions
 spe-4.042
 shy-3.681
 low-3.680
 en-3.669
 nervous-3.632
Token chewing
Token ID36615
Feature activation+8.89e-01

Pos logit contributions
reating+1.802
ward+1.722
ained+1.711
tr+1.693
live+1.634
Neg logit contributions
 spe-2.528
 se-2.285
mer-2.229
 en-2.142
chie-2.137
Token,
Token ID11
Feature activation+8.18e-01

Pos logit contributions
reating+1.499
ward+1.491
tr+1.412
ained+1.376
live+1.372
Neg logit contributions
 spe-1.037
mer-0.974
chie-0.904
 se-0.858
 en-0.822
Token Tim
Token ID5045
Feature activation+6.94e-01

Pos logit contributions
reating+1.169
ward+1.144
live+1.058
tr+1.044
ained+1.032
Neg logit contributions
 spe-1.161
mer-1.062
chie-1.047
 se-1.041
 en-0.975
Tokenmy
Token ID1820
Feature activation+9.11e-02

Pos logit contributions
ward+0.548
reating+0.511
tr+0.407
ained+0.406
live+0.405
Neg logit contributions
 spe-1.414
mer-1.384
chie-1.368
 se-1.331
 en-1.257
Token,
Token ID11
Feature activation-3.24e-03

Pos logit contributions
reating+0.683
ward+0.681
live+0.610
ained+0.610
tr+0.609
Neg logit contributions
 spe-0.902
chie-0.843
mer-0.842
 se-0.799
 en-0.756
Token "
Token ID366
Feature activation-6.17e-01

Pos logit contributions
 spe+0.004
mer+0.003
chie+0.003
 se+0.003
 en+0.003
Neg logit contributions
ward-0.006
reating-0.006
tr-0.005
 about-0.005
ained-0.005
TokenI
Token ID40
Feature activation+1.18e+00

Pos logit contributions
 spe+1.067
mer+1.026
chie+1.025
 se+0.936
 en+0.895
Neg logit contributions
ward-0.691
reating-0.687
live-0.609
ained-0.607
tr-0.606
Token gained
Token ID8618
Feature activation+1.06e+00

Pos logit contributions
reating+1.877
ward+1.855
ained+1.653
icated+1.617
tr+1.613
Neg logit contributions
 spe-1.584
mer-1.467
chie-1.410
 se-1.404
 dunno-1.243
Token so
Token ID523
Feature activation+1.31e+00

Pos logit contributions
ward+1.684
reating+1.680
live+1.594
tr+1.570
ained+1.555
Neg logit contributions
 spe-1.318
chie-1.219
mer-1.146
 se-1.117
 en-1.064
Token much
Token ID881
Feature activation+3.00e+00

Pos logit contributions
reating+1.553
ward+1.523
tr+1.409
live+1.313
ained+1.296
Neg logit contributions
 spe-2.225
chie-2.075
mer-2.023
 se-1.976
 en-1.957
Token confidence
Token ID6628
Feature activation+1.76e-01

Pos logit contributions
live+5.476
nce+5.421
ward+5.415
tr+5.343
reating+5.314
Neg logit contributions
 spe-3.366
 se-3.162
 en-2.909
 skinny-2.866
 shy-2.731
Token!"
Token ID2474
Feature activation+1.76e+00

Pos logit contributions
ward+0.238
reating+0.235
tr+0.211
live+0.209
ained+0.209
Neg logit contributions
 spe-0.262
mer-0.249
chie-0.246
 se-0.227
 en-0.213
Token But
Token ID887
Feature activation+1.95e+00

Pos logit contributions
reating+3.015
ward+2.845
inea+2.674
live+2.654
nce+2.607
Neg logit contributions
 spe-1.970
 se-1.864
mer-1.804
 en-1.778
chie-1.718
Token then
Token ID788
Feature activation+1.64e+00

Pos logit contributions
reating+3.123
ward+2.989
tr+2.873
live+2.870
icated+2.847
Neg logit contributions
 spe-2.472
 en-2.171
mer-2.170
chie-2.164
 se-2.164
Token,
Token ID11
Feature activation+1.37e+00

Pos logit contributions
reating+2.678
ward+2.570
ained+2.465
tr+2.464
live+2.418
Neg logit contributions
 spe-2.085
 se-1.879
chie-1.787
mer-1.771
 en-1.726
Token that
Token ID326
Feature activation-8.32e-01

Pos logit contributions
 spe+0.546
chie+0.519
mer+0.516
 se+0.465
 skinny+0.442
Neg logit contributions
ward-0.468
reating-0.454
 about-0.415
tr-0.408
ained-0.408
Token was
Token ID373
Feature activation-5.24e-01

Pos logit contributions
chie+1.273
 spe+1.269
mer+1.230
 se+1.080
 en+1.030
Neg logit contributions
ward-1.073
reating-1.004
tr-0.966
 about-0.962
ained-0.953
Token broken
Token ID5445
Feature activation-3.56e-01

Pos logit contributions
 spe+0.660
mer+0.643
chie+0.630
 se+0.548
 dunno+0.506
Neg logit contributions
ward-0.828
reating-0.799
 about-0.754
ained-0.747
tr-0.734
Token.
Token ID13
Feature activation-1.66e+00

Pos logit contributions
 spe+0.515
chie+0.490
mer+0.489
 se+0.444
 en+0.416
Neg logit contributions
ward-0.497
reating-0.488
 about-0.446
tr-0.440
ained-0.437
Token The
Token ID383
Feature activation-2.57e+00

Pos logit contributions
 spe+2.366
chie+2.360
mer+2.312
 se+2.015
 blu+1.834
Neg logit contributions
ward-2.421
 about-2.282
 Afterwards-2.198
 After-2.159
tr-2.156
Token little
Token ID1310
Feature activation-3.69e+00

Pos logit contributions
 spe+3.172
chie+2.980
mer+2.972
 dunno+2.425
 Int+2.416
Neg logit contributions
ward-4.241
reating-4.020
 stories-3.892
 done-3.853
 about-3.759
Token boat
Token ID8848
Feature activation-1.05e+00

Pos logit contributions
chie+3.791
 spe+3.728
mer+3.366
 Int+3.351
ihu+3.176
Neg logit contributions
ward-6.344
reating-6.221
 end-5.991
ruck-5.976
inea-5.855
Token felt
Token ID2936
Feature activation-6.70e-01

Pos logit contributions
 spe+1.628
chie+1.550
mer+1.502
 se+1.334
 blu+1.323
Neg logit contributions
ward-1.355
 about-1.249
reating-1.226
tr-1.212
 done-1.193
Token sad
Token ID6507
Feature activation+3.48e-01

Pos logit contributions
 spe+0.756
mer+0.731
chie+0.707
 se+0.602
 dunno+0.545
Neg logit contributions
ward-1.155
reating-1.103
 about-1.077
ained-1.048
tr-1.019
Token for
Token ID329
Feature activation-8.45e-01

Pos logit contributions
ward+0.515
reating+0.509
live+0.466
tr+0.462
ained+0.455
Neg logit contributions
 spe-0.473
mer-0.450
chie-0.438
 se-0.411
 en-0.382
Token the
Token ID262
Feature activation-2.15e+00

Pos logit contributions
mer+1.171
 spe+1.152
chie+1.114
 se+0.979
 dunno+0.911
Neg logit contributions
ward-1.268
reating-1.216
 about-1.135
icated-1.094
 what-1.079
Token very
Token ID845
Feature activation-1.60e-01

Pos logit contributions
 spe+0.162
mer+0.154
chie+0.149
 se+0.134
 en+0.120
Neg logit contributions
ward-0.242
reating-0.237
 about-0.220
tr-0.218
ained-0.218
Token sad
Token ID6507
Feature activation+8.01e-01

Pos logit contributions
 spe+0.150
mer+0.140
chie+0.136
 se+0.118
 en+0.103
Neg logit contributions
ward-0.304
reating-0.298
 about-0.277
tr-0.277
ained-0.277
Token and
Token ID290
Feature activation-4.70e-01

Pos logit contributions
reating+1.280
ward+1.280
live+1.194
tr+1.178
ained+1.147
Neg logit contributions
 spe-1.000
mer-0.942
chie-0.899
 se-0.862
 en-0.792
Token missed
Token ID6825
Feature activation+2.01e-02

Pos logit contributions
 spe+0.624
mer+0.595
chie+0.585
 se+0.526
 blu+0.488
Neg logit contributions
ward-0.714
reating-0.688
 about-0.645
ained-0.635
tr-0.625
Token the
Token ID262
Feature activation-1.64e+00

Pos logit contributions
ward+0.030
reating+0.029
tr+0.027
 about+0.027
ained+0.026
Neg logit contributions
 spe-0.027
mer-0.026
chie-0.026
 se-0.023
 en-0.022
Token little
Token ID1310
Feature activation-3.64e+00

Pos logit contributions
mer+1.577
 spe+1.500
chie+1.495
rily+1.102
 Int+1.059
Neg logit contributions
ward-3.089
reating-2.998
tr-2.915
 about-2.897
 done-2.841
Token bird
Token ID6512
Feature activation-3.03e-01

Pos logit contributions
chie+4.646
mer+4.300
rily+3.952
 Int+3.924
venth+3.687
Neg logit contributions
ward-5.437
reating-5.350
 about-5.307
tr-5.182
inea-5.079
Tokenie
Token ID494
Feature activation-1.10e-01

Pos logit contributions
 spe+0.487
mer+0.463
chie+0.457
 se+0.427
 en+0.398
Neg logit contributions
ward-0.377
reating-0.371
 about-0.333
tr-0.329
ained-0.326
Token.
Token ID13
Feature activation-2.65e-01

Pos logit contributions
 spe+0.158
mer+0.150
chie+0.148
 se+0.136
 en+0.126
Neg logit contributions
ward-0.154
reating-0.151
tr-0.137
 about-0.136
ained-0.135
Token<|endoftext|>
Token ID50256
Feature activation+2.22e-01

Pos logit contributions
 spe+0.383
mer+0.363
chie+0.359
 se+0.331
 skinny+0.302
Neg logit contributions
ward-0.376
reating-0.363
 about-0.335
tr-0.334
ained-0.331
TokenOnce
Token ID7454
Feature activation-4.72e-01

Pos logit contributions
ward+0.310
reating+0.307
tr+0.275
live+0.272
ained+0.271
Neg logit contributions
 spe-0.317
mer-0.302
chie-0.295
 se-0.276
 en-0.256
Token boat
Token ID8848
Feature activation-9.82e-01

Pos logit contributions
 spe+1.587
chie+1.532
mer+1.424
 Int+1.097
 dunno+1.058
Neg logit contributions
ward-4.174
reating-4.163
 about-3.900
ained-3.855
ruck-3.803
Token.
Token ID13
Feature activation-9.11e-01

Pos logit contributions
 spe+1.579
chie+1.512
mer+1.479
 se+1.348
 skinny+1.278
Neg logit contributions
ward-1.256
reating-1.199
 about-1.164
tr-1.096
ained-1.066
Token So
Token ID1406
Feature activation+6.72e-01

Pos logit contributions
 spe+1.500
chie+1.441
mer+1.428
 se+1.307
 blu+1.199
Neg logit contributions
ward-1.138
 about-1.022
reating-1.021
 Afterwards-0.990
tr-0.988
Token,
Token ID11
Feature activation-7.55e-01

Pos logit contributions
reating+0.902
ward+0.853
tr+0.775
live+0.765
ained+0.764
Neg logit contributions
 spe-1.085
mer-1.013
chie-0.983
 se-0.944
 en-0.889
Token the
Token ID262
Feature activation-2.72e+00

Pos logit contributions
 spe+1.007
chie+1.002
mer+0.993
 se+0.851
 dunno+0.793
Neg logit contributions
ward-1.154
reating-1.069
 about-1.018
ained-0.988
icated-0.977
Token little
Token ID1310
Feature activation-3.52e+00

Pos logit contributions
 spe+3.087
chie+3.020
mer+2.998
 dunno+2.311
 Int+2.305
Neg logit contributions
ward-4.784
reating-4.390
 stories-4.382
 done-4.335
nce-4.294
Token boat
Token ID8848
Feature activation-1.22e+00

Pos logit contributions
chie+3.664
 spe+3.436
mer+3.128
 Int+3.015
 dunno+2.810
Neg logit contributions
ward-6.249
reating-5.950
 end-5.841
ruck-5.782
ained-5.763
Token offered
Token ID4438
Feature activation-1.82e-01

Pos logit contributions
 spe+1.859
chie+1.774
mer+1.693
 skinny+1.488
 blu+1.485
Neg logit contributions
ward-1.632
 about-1.506
reating-1.493
tr-1.465
 done-1.448
Token to
Token ID284
Feature activation-1.27e+00

Pos logit contributions
 spe+0.203
mer+0.192
chie+0.188
 se+0.165
 en+0.150
Neg logit contributions
ward-0.319
reating-0.315
 about-0.292
tr-0.288
ained-0.287
Token help
Token ID1037
Feature activation-1.16e+00

Pos logit contributions
mer+1.722
 spe+1.701
chie+1.572
 skinny+1.332
 se+1.324
Neg logit contributions
ward-1.973
reating-1.910
 about-1.811
 done-1.718
tr-1.697
Token the
Token ID262
Feature activation-2.80e+00

Pos logit contributions
 spe+1.769
mer+1.699
chie+1.693
 se+1.467
 blu+1.432
Neg logit contributions
ward-1.562
reating-1.557
 about-1.475
icated-1.364
tr-1.333
Token.
Token ID13
Feature activation-1.57e+00

Pos logit contributions
 spe+1.150
mer+1.103
chie+1.098
 se+0.997
 en+0.930
Neg logit contributions
ward-1.162
reating-1.142
 about-1.067
ained-1.033
icated-1.004
Token
Token ID198
Feature activation+3.18e-01

Pos logit contributions
chie+2.647
 spe+2.645
mer+2.573
 se+2.281
sie+2.169
Neg logit contributions
ward-1.938
 about-1.823
ained-1.728
tr-1.714
 Afterwards-1.686
Token
Token ID198
Feature activation+1.27e-01

Pos logit contributions
ward+0.451
reating+0.449
tr+0.403
live+0.398
ained+0.397
Neg logit contributions
 spe-0.450
mer-0.427
chie-0.420
 se-0.393
 en-0.365
Token"
Token ID1
Feature activation-1.42e+00

Pos logit contributions
ward+0.181
reating+0.179
tr+0.161
ained+0.159
 about+0.159
Neg logit contributions
 spe-0.181
mer-0.173
chie-0.170
 se-0.157
 en-0.146
TokenHow
Token ID2437
Feature activation+3.83e-01

Pos logit contributions
 spe+2.364
chie+2.320
mer+2.269
 se+2.000
 en+1.999
Neg logit contributions
ward-1.674
reating-1.588
tr-1.523
live-1.459
ained-1.456
Token can
Token ID460
Feature activation-3.49e+00

Pos logit contributions
ward+0.500
reating+0.495
live+0.438
tr+0.435
ained+0.432
Neg logit contributions
 spe-0.590
mer-0.557
chie-0.550
 se-0.521
 en-0.486
Token we
Token ID356
Feature activation-9.68e-01

Pos logit contributions
mer+4.836
 spe+4.375
 redd+4.281
chie+4.220
 blu+3.917
Neg logit contributions
ward-5.884
 what-5.447
 about-5.304
reating-5.112
icated-4.990
Token get
Token ID651
Feature activation+3.95e-01

Pos logit contributions
 spe+1.330
mer+1.292
chie+1.235
 se+1.099
 skinny+1.076
Neg logit contributions
ward-1.392
reating-1.343
tr-1.260
 about-1.256
 done-1.212
Token the
Token ID262
Feature activation-1.25e+00

Pos logit contributions
ward+0.624
reating+0.618
live+0.572
tr+0.572
ained+0.559
Neg logit contributions
 spe-0.501
mer-0.468
chie-0.464
 se-0.432
 en-0.395
Token jet
Token ID12644
Feature activation-6.54e-01

Pos logit contributions
 spe+1.054
mer+1.043
chie+0.968
 se+0.742
 skinny+0.711
Neg logit contributions
ward-2.497
reating-2.448
tr-2.330
ained-2.301
 about-2.245
Token back
Token ID736
Feature activation+1.93e-01

Pos logit contributions
 spe+1.035
mer+0.983
chie+0.966
 se+0.887
 en+0.849
Neg logit contributions
ward-0.824
reating-0.819
tr-0.743
 about-0.735
ained-0.717
Token sad
Token ID6507
Feature activation+1.23e+00

Pos logit contributions
reating+1.419
ward+1.399
live+1.311
icated+1.294
tr+1.293
Neg logit contributions
 spe-0.651
chie-0.566
mer-0.560
 se-0.495
 en-0.477
Token and
Token ID290
Feature activation+2.04e-01

Pos logit contributions
ward+2.074
reating+2.056
live+1.998
tr+1.914
ained+1.892
Neg logit contributions
 spe-1.436
mer-1.326
 se-1.245
chie-1.231
 en-1.108
Token scared
Token ID12008
Feature activation+3.24e-02

Pos logit contributions
ward+0.397
reating+0.394
tr+0.365
live+0.365
ained+0.362
Neg logit contributions
 spe-0.186
mer-0.169
chie-0.166
 se-0.147
 en-0.131
Token.
Token ID13
Feature activation-1.24e+00

Pos logit contributions
ward+0.048
reating+0.047
tr+0.043
ained+0.042
 about+0.042
Neg logit contributions
 spe-0.045
mer-0.042
chie-0.042
 se-0.038
 en-0.036
Token How
Token ID1374
Feature activation-1.06e+00

Pos logit contributions
 spe+2.146
chie+2.101
mer+2.037
 se+1.911
 blu+1.763
Neg logit contributions
ward-1.427
 about-1.269
reating-1.252
tr-1.249
ained-1.233
Token can
Token ID460
Feature activation-3.45e+00

Pos logit contributions
 spe+1.843
mer+1.829
chie+1.817
 se+1.605
 skinny+1.527
Neg logit contributions
ward-1.194
 about-1.185
reating-1.153
tr-1.078
icated-1.030
Token they
Token ID484
Feature activation-1.57e-01

Pos logit contributions
mer+5.194
 spe+4.996
 redd+4.657
chie+4.635
 blu+4.247
Neg logit contributions
ward-5.456
 about-4.995
 what-4.816
reating-4.657
nce-4.592
Token get
Token ID651
Feature activation+2.66e-01

Pos logit contributions
 spe+0.211
mer+0.200
chie+0.196
 se+0.179
 en+0.166
Neg logit contributions
ward-0.236
reating-0.232
tr-0.212
 about-0.210
live-0.209
Token their
Token ID511
Feature activation-7.86e-01

Pos logit contributions
ward+0.381
reating+0.376
live+0.341
tr+0.341
ained+0.336
Neg logit contributions
 spe-0.377
mer-0.356
chie-0.353
 se-0.328
 en-0.303
Token rocket
Token ID10701
Feature activation-5.74e-01

Pos logit contributions
 spe+0.927
mer+0.908
chie+0.897
 se+0.779
 en+0.732
Neg logit contributions
ward-1.302
reating-1.253
tr-1.184
 about-1.159
ained-1.144
Token back
Token ID736
Feature activation+6.48e-02

Pos logit contributions
 spe+0.870
mer+0.826
chie+0.817
 se+0.743
 en+0.712
Neg logit contributions
ward-0.761
reating-0.753
tr-0.699
 about-0.677
ained-0.666
Token are
Token ID389
Feature activation-7.10e-01

Pos logit contributions
 spe+0.642
mer+0.617
chie+0.609
 se+0.558
 en+0.528
Neg logit contributions
ward-0.496
reating-0.484
 about-0.442
tr-0.437
live-0.431
Token you
Token ID345
Feature activation+7.62e-01

Pos logit contributions
 spe+0.848
mer+0.810
chie+0.774
 se+0.686
 skinny+0.631
Neg logit contributions
ward-1.193
reating-1.162
 about-1.083
ained-1.055
icated-1.052
Token?"
Token ID1701
Feature activation-1.70e+00

Pos logit contributions
reating+1.087
ward+1.082
tr+0.993
live+0.977
ained+0.946
Neg logit contributions
 spe-1.068
mer-1.027
chie-1.010
 se-0.950
 en-0.892
Token But
Token ID887
Feature activation-5.76e-01

Pos logit contributions
 spe+2.434
chie+2.409
mer+2.331
 blu+2.002
 se+1.962
Neg logit contributions
ward-2.432
 about-2.259
reating-2.104
 Afterwards-2.098
tr-2.091
Token the
Token ID262
Feature activation-3.16e+00

Pos logit contributions
 spe+0.758
mer+0.718
chie+0.709
 se+0.641
 skinny+0.580
Neg logit contributions
ward-0.898
reating-0.840
 about-0.800
ained-0.776
tr-0.765
Token little
Token ID1310
Feature activation-3.37e+00

Pos logit contributions
 spe+3.975
mer+3.712
chie+3.601
 dunno+3.073
 Int+2.947
Neg logit contributions
ward-5.399
 stories-4.915
 End-4.864
 end-4.780
nce-4.749
Token boat
Token ID8848
Feature activation-4.89e-01

Pos logit contributions
chie+3.229
 spe+3.219
 Int+2.761
mer+2.673
 dunno+2.656
Neg logit contributions
ward-6.155
reating-5.718
ruck-5.541
 end-5.534
 luck-5.376
Token said
Token ID531
Feature activation-7.45e-01

Pos logit contributions
 spe+0.761
chie+0.712
mer+0.709
 se+0.647
 skinny+0.610
Neg logit contributions
ward-0.629
reating-0.596
 about-0.558
tr-0.553
live-0.544
Token nothing
Token ID2147
Feature activation-9.22e-01

Pos logit contributions
 spe+1.314
mer+1.261
chie+1.248
 se+1.149
 blu+1.097
Neg logit contributions
ward-0.839
reating-0.765
 about-0.745
tr-0.692
ained-0.663
Token,
Token ID11
Feature activation-5.47e-01

Pos logit contributions
 spe+1.431
chie+1.356
mer+1.356
 se+1.228
 blu+1.165
Neg logit contributions
ward-1.204
reating-1.163
 about-1.126
 done-1.027
icated-1.025
Token only
Token ID691
Feature activation+5.18e-01

Pos logit contributions
 spe+0.779
mer+0.746
chie+0.744
 se+0.658
 skinny+0.612
Neg logit contributions
ward-0.788
reating-0.763
 about-0.721
ained-0.689
icated-0.680
Token?"
Token ID1701
Feature activation-1.36e+00

Pos logit contributions
ward+0.512
reating+0.505
tr+0.451
live+0.447
ained+0.443
Neg logit contributions
 spe-0.551
mer-0.527
chie-0.518
 se-0.485
 en-0.456
Token asked
Token ID1965
Feature activation-4.67e-01

Pos logit contributions
 spe+1.906
chie+1.812
mer+1.791
 se+1.450
 blu+1.420
Neg logit contributions
ward-2.014
reating-1.851
 about-1.841
 Afterwards-1.751
ained-1.746
Token Sammy
Token ID44067
Feature activation-7.04e-01

Pos logit contributions
 spe+0.676
mer+0.642
chie+0.635
 se+0.577
 en+0.534
Neg logit contributions
ward-0.654
reating-0.637
 about-0.580
live-0.574
ained-0.572
Token in
Token ID287
Feature activation-9.40e-01

Pos logit contributions
 spe+1.061
mer+1.006
chie+0.989
 se+0.896
 blu+0.835
Neg logit contributions
ward-0.952
reating-0.923
 about-0.872
tr-0.825
live-0.814
Token a
Token ID257
Feature activation-2.61e+00

Pos logit contributions
 spe+1.362
mer+1.354
chie+1.345
 se+1.186
 blu+1.108
Neg logit contributions
ward-1.298
reating-1.248
 about-1.174
tr-1.112
live-1.090
Token tiny
Token ID7009
Feature activation-3.36e+00

Pos logit contributions
mer+3.133
 spe+2.955
chie+2.893
rily+2.556
rier+2.451
Neg logit contributions
ward-4.170
 about-3.943
 done-3.899
 stories-3.851
tr-3.841
Token voice
Token ID3809
Feature activation-2.29e-01

Pos logit contributions
mer+3.918
 spe+3.910
chie+3.690
 redd+3.394
 blu+3.264
Neg logit contributions
 about-5.043
reating-4.829
 stories-4.817
 what-4.795
ward-4.794
Token.
Token ID13
Feature activation-1.08e+00

Pos logit contributions
 spe+0.329
mer+0.311
chie+0.309
 se+0.280
 en+0.261
Neg logit contributions
ward-0.322
reating-0.318
 about-0.289
tr-0.287
ained-0.281
Token
Token ID198
Feature activation-8.50e-01

Pos logit contributions
 spe+1.807
mer+1.762
chie+1.740
 se+1.552
 blu+1.463
Neg logit contributions
ward-1.324
reating-1.182
 about-1.149
tr-1.098
 Afterwards-1.084
Token
Token ID198
Feature activation-1.10e+00

Pos logit contributions
 spe+1.436
mer+1.365
chie+1.343
 se+1.226
 blu+1.171
Neg logit contributions
ward-1.010
reating-0.964
 about-0.893
live-0.877
tr-0.865
Token"
Token ID1
Feature activation-1.39e+00

Pos logit contributions
 spe+1.827
mer+1.669
chie+1.635
 se+1.509
 blu+1.457
Neg logit contributions
ward-1.366
reating-1.296
live-1.216
tr-1.182
nce-1.163
Token and
Token ID290
Feature activation+6.42e-02

Pos logit contributions
ward+0.858
reating+0.849
live+0.783
tr+0.775
ained+0.757
Neg logit contributions
 spe-0.771
mer-0.735
chie-0.711
 se-0.680
 en-0.634
Token wanted
Token ID2227
Feature activation+2.98e-01

Pos logit contributions
ward+0.107
reating+0.105
tr+0.097
live+0.096
ained+0.096
Neg logit contributions
 spe-0.076
mer-0.072
chie-0.070
 se-0.064
 en-0.058
Token to
Token ID284
Feature activation-4.33e-01

Pos logit contributions
ward+0.554
reating+0.547
tr+0.508
live+0.508
ained+0.506
Neg logit contributions
 spe-0.296
mer-0.268
chie-0.265
 se-0.239
 en-0.214
Token know
Token ID760
Feature activation+1.26e+00

Pos logit contributions
 spe+0.446
mer+0.434
chie+0.411
 se+0.352
 en+0.320
Neg logit contributions
ward-0.783
reating-0.771
 about-0.716
tr-0.713
ained-0.702
Token why
Token ID1521
Feature activation-4.09e-01

Pos logit contributions
ward+1.358
live+1.302
reating+1.270
ained+1.254
ugg+1.220
Neg logit contributions
 spe-2.205
chie-2.021
 se-1.986
mer-1.960
 en-1.925
Token the
Token ID262
Feature activation-3.33e+00

Pos logit contributions
 spe+0.552
mer+0.535
chie+0.529
 se+0.479
 en+0.441
Neg logit contributions
ward-0.607
reating-0.592
 about-0.547
tr-0.539
ained-0.533
Token airplane
Token ID19401
Feature activation-5.23e-01

Pos logit contributions
mer+5.337
chie+4.700
 spe+4.495
rily+4.033
 dunno+3.784
Neg logit contributions
reating-5.010
ward-5.005
 stories-4.968
 done-4.927
ruck-4.672
Token could
Token ID714
Feature activation-1.35e+00

Pos logit contributions
 spe+0.760
chie+0.730
mer+0.726
 se+0.652
 skinny+0.615
Neg logit contributions
ward-0.707
reating-0.687
tr-0.648
 about-0.642
ained-0.631
Token fly
Token ID6129
Feature activation-1.20e+00

Pos logit contributions
mer+1.629
 spe+1.564
chie+1.540
 se+1.286
 blu+1.242
Neg logit contributions
ward-2.213
reating-2.141
 about-2.058
tr-2.041
live-1.939
Token but
Token ID475
Feature activation+5.12e-02

Pos logit contributions
 spe+1.950
chie+1.851
mer+1.826
 se+1.705
 dunno+1.631
Neg logit contributions
reating-1.442
ward-1.428
 about-1.366
ained-1.296
tr-1.277
Token not
Token ID407
Feature activation-3.71e-01

Pos logit contributions
ward+0.082
reating+0.081
tr+0.074
live+0.074
ained+0.074
Neg logit contributions
 spe-0.064
mer-0.060
chie-0.059
 se-0.054
 en-0.049
Token is
Token ID318
Feature activation-2.61e-01

Pos logit contributions
 spe+0.465
mer+0.445
chie+0.441
 se+0.411
 en+0.388
Neg logit contributions
ward-0.336
reating-0.323
tr-0.295
 about-0.291
ained-0.287
Token getting
Token ID1972
Feature activation-3.81e-01

Pos logit contributions
 spe+0.357
mer+0.344
chie+0.335
 se+0.308
 en+0.284
Neg logit contributions
ward-0.383
reating-0.373
 about-0.341
tr-0.340
ained-0.339
Token wet
Token ID9583
Feature activation-2.91e-01

Pos logit contributions
 spe+0.474
mer+0.459
chie+0.444
 se+0.395
 en+0.367
Neg logit contributions
ward-0.605
reating-0.594
ained-0.544
 about-0.540
tr-0.538
Token.
Token ID13
Feature activation-7.36e-01

Pos logit contributions
 spe+0.445
chie+0.423
mer+0.423
 se+0.386
 en+0.365
Neg logit contributions
ward-0.380
reating-0.376
 about-0.335
ained-0.334
tr-0.333
Token What
Token ID1867
Feature activation-7.54e-01

Pos logit contributions
 spe+0.761
mer+0.729
chie+0.707
 se+0.634
 en+0.550
Neg logit contributions
ward-1.325
reating-1.277
tr-1.234
 about-1.215
ained-1.200
Token can
Token ID460
Feature activation-3.33e+00

Pos logit contributions
 spe+1.182
mer+1.164
chie+1.159
 se+1.023
 en+0.963
Neg logit contributions
ward-0.958
reating-0.925
 about-0.890
tr-0.856
ained-0.839
Token we
Token ID356
Feature activation-1.02e+00

Pos logit contributions
mer+4.562
 spe+4.203
chie+3.806
 redd+3.513
 blu+3.429
Neg logit contributions
ward-5.765
nce-5.072
reating-5.057
 about-5.056
mm-5.042
Token do
Token ID466
Feature activation-1.16e+00

Pos logit contributions
 spe+1.496
mer+1.459
chie+1.387
 se+1.232
 skinny+1.206
Neg logit contributions
ward-1.391
reating-1.334
 about-1.258
tr-1.237
 done-1.231
Token?"
Token ID1701
Feature activation-7.41e-01

Pos logit contributions
 spe+2.080
chie+1.993
mer+1.952
 se+1.783
 blu+1.724
Neg logit contributions
ward-1.320
reating-1.265
 about-1.257
ained-1.129
 done-1.121
Token Cow
Token ID10417
Feature activation+6.36e-01

Pos logit contributions
 spe+1.131
chie+1.111
mer+1.105
 se+0.959
 blu+0.904
Neg logit contributions
ward-0.991
reating-0.917
tr-0.862
 about-0.861
ained-0.852
Token said
Token ID531
Feature activation-5.90e-01

Pos logit contributions
reating+0.900
ward+0.887
ained+0.799
tr+0.785
live+0.770
Neg logit contributions
 spe-0.917
mer-0.886
chie-0.857
 se-0.812
 en-0.750
Token pigeon
Token ID46387
Feature activation+7.47e-02

Pos logit contributions
chie+2.802
 spe+2.768
mer+2.731
 dunno+2.322
 Int+2.297
Neg logit contributions
ward-3.707
reating-3.655
 about-3.511
 done-3.478
 stories-3.388
Token wanted
Token ID2227
Feature activation-3.66e-01

Pos logit contributions
ward+0.120
reating+0.119
tr+0.108
live+0.107
ained+0.107
Neg logit contributions
 spe-0.092
mer-0.087
chie-0.086
 se-0.078
 en-0.072
Token to
Token ID284
Feature activation-2.96e-01

Pos logit contributions
 spe+0.412
mer+0.397
chie+0.389
 se+0.333
 en+0.307
Neg logit contributions
ward-0.632
reating-0.628
 about-0.584
tr-0.568
ained-0.563
Token help
Token ID1037
Feature activation-1.16e+00

Pos logit contributions
 spe+0.332
mer+0.319
chie+0.306
 se+0.268
 en+0.247
Neg logit contributions
ward-0.510
reating-0.504
 about-0.463
tr-0.462
live-0.457
Token the
Token ID262
Feature activation-2.55e+00

Pos logit contributions
 spe+1.831
chie+1.762
mer+1.760
 se+1.544
 dunno+1.484
Neg logit contributions
reating-1.499
ward-1.460
 about-1.419
icated-1.294
 what-1.290
Token little
Token ID1310
Feature activation-3.32e+00

Pos logit contributions
mer+3.172
 spe+3.160
chie+3.059
 dunno+2.515
ihu+2.369
Neg logit contributions
reating-4.069
ward-4.058
 done-3.914
 about-3.890
 stories-3.845
Token bird
Token ID6512
Feature activation-5.95e-01

Pos logit contributions
chie+4.741
 spe+4.289
mer+4.191
 dunno+3.906
ihu+3.783
Neg logit contributions
reating-4.850
ward-4.740
 about-4.384
 done-4.331
 what-4.312
Token,
Token ID11
Feature activation-1.52e+00

Pos logit contributions
 spe+0.968
mer+0.928
chie+0.910
 se+0.835
 skinny+0.783
Neg logit contributions
ward-0.732
reating-0.724
 about-0.668
tr-0.633
icated-0.621
Token so
Token ID523
Feature activation-3.32e-01

Pos logit contributions
mer+2.151
 spe+2.085
chie+1.987
 se+1.613
ihu+1.554
Neg logit contributions
 about-2.408
ward-2.374
reating-2.348
icated-2.162
 what-2.114
Token he
Token ID339
Feature activation-7.95e-01

Pos logit contributions
 spe+0.455
mer+0.440
chie+0.439
 se+0.385
 en+0.359
Neg logit contributions
ward-0.494
reating-0.476
 about-0.437
tr-0.431
ained-0.430
Token told
Token ID1297
Feature activation-7.94e-01

Pos logit contributions
 spe+1.104
mer+1.061
chie+1.052
 se+0.890
 blu+0.876
Neg logit contributions
ward-1.141
reating-1.084
tr-1.023
 about-1.014
live-1.001
Token said
Token ID531
Feature activation-1.39e+00

Pos logit contributions
 spe+1.063
chie+1.009
mer+0.997
 se+0.903
 blu+0.863
Neg logit contributions
ward-0.723
reating-0.676
 about-0.643
tr-0.627
live-0.617
Token.
Token ID13
Feature activation-1.15e+00

Pos logit contributions
 spe+2.351
chie+2.204
mer+2.182
 se+2.002
 blu+1.928
Neg logit contributions
ward-1.711
 about-1.623
reating-1.562
tr-1.437
 done-1.380
Token "
Token ID366
Feature activation-1.67e+00

Pos logit contributions
 spe+1.611
chie+1.516
mer+1.484
 se+1.342
 blu+1.253
Neg logit contributions
ward-1.740
 about-1.606
reating-1.564
tr-1.523
 Afterwards-1.509
TokenBut
Token ID1537
Feature activation-2.52e-01

Pos logit contributions
 spe+2.366
chie+2.238
mer+2.180
 se+1.933
ihu+1.829
Neg logit contributions
ward-2.435
reating-2.364
tr-2.273
ained-2.157
After-2.149
Token how
Token ID703
Feature activation-5.70e-01

Pos logit contributions
 spe+0.280
mer+0.266
chie+0.259
 se+0.230
 en+0.209
Neg logit contributions
ward-0.437
reating-0.427
tr-0.396
 about-0.395
ained-0.389
Token can
Token ID460
Feature activation-3.31e+00

Pos logit contributions
 spe+0.921
mer+0.890
chie+0.878
 se+0.803
 en+0.749
Neg logit contributions
ward-0.706
reating-0.690
 about-0.654
tr-0.622
ained-0.621
Token we
Token ID356
Feature activation-1.08e+00

Pos logit contributions
mer+4.384
 spe+3.875
chie+3.806
 redd+3.501
rily+3.305
Neg logit contributions
ward-5.628
reating-5.167
 what-5.027
 about-4.961
icated-4.958
Token fly
Token ID6129
Feature activation-4.69e-01

Pos logit contributions
 spe+1.330
mer+1.290
chie+1.210
 se+1.079
 skinny+1.037
Neg logit contributions
ward-1.717
reating-1.647
tr-1.563
 about-1.548
ained-1.523
Token in
Token ID287
Feature activation-1.16e+00

Pos logit contributions
 spe+0.678
mer+0.641
chie+0.637
 se+0.581
 en+0.545
Neg logit contributions
ward-0.653
reating-0.643
 about-0.589
tr-0.584
ained-0.581
Token a
Token ID257
Feature activation-2.07e+00

Pos logit contributions
mer+1.620
 spe+1.602
chie+1.579
 se+1.386
 dunno+1.296
Neg logit contributions
ward-1.699
reating-1.654
 about-1.523
ained-1.466
icated-1.458
Token jet
Token ID12644
Feature activation-7.10e-01

Pos logit contributions
mer+2.409
 spe+2.359
chie+2.279
rily+1.983
 dunno+1.935
Neg logit contributions
ward-3.380
reating-3.281
tr-3.189
THE-3.004
 done-2.962
Token is
Token ID318
Feature activation-6.77e-01

Pos logit contributions
 spe+1.897
chie+1.833
mer+1.809
 se+1.682
 blu+1.600
Neg logit contributions
ward-1.096
reating-1.013
 about-1.003
tr-0.977
live-0.961
Token not
Token ID407
Feature activation-1.01e-01

Pos logit contributions
 spe+0.896
mer+0.891
chie+0.859
 se+0.777
 dunno+0.710
Neg logit contributions
ward-1.014
reating-0.975
 about-0.928
tr-0.899
ained-0.895
Token happy
Token ID3772
Feature activation+1.43e-01

Pos logit contributions
 spe+0.127
mer+0.121
chie+0.118
 se+0.107
 en+0.098
Neg logit contributions
ward-0.160
reating-0.157
 about-0.144
tr-0.143
ained-0.143
Token.
Token ID13
Feature activation-1.12e+00

Pos logit contributions
ward+0.218
reating+0.214
tr+0.195
live+0.194
ained+0.193
Neg logit contributions
 spe-0.190
mer-0.180
chie-0.176
 se-0.162
 en-0.150
Token What
Token ID1867
Feature activation-3.05e-01

Pos logit contributions
 spe+1.252
mer+1.225
chie+1.213
 se+1.103
 en+0.938
Neg logit contributions
ward-1.904
tr-1.813
reating-1.800
 about-1.764
icated-1.701
Token can
Token ID460
Feature activation-3.30e+00

Pos logit contributions
 spe+0.483
mer+0.468
chie+0.464
 se+0.420
 en+0.395
Neg logit contributions
ward-0.383
reating-0.377
 about-0.343
tr-0.338
live-0.332
Token I
Token ID314
Feature activation-5.12e-01

Pos logit contributions
mer+5.072
 spe+4.598
chie+4.173
 redd+3.746
 blu+3.713
Neg logit contributions
ward-5.142
 about-4.799
reating-4.712
icated-4.681
 what-4.666
Token do
Token ID466
Feature activation-1.13e+00

Pos logit contributions
 spe+0.787
mer+0.757
chie+0.746
 se+0.668
 skinny+0.634
Neg logit contributions
ward-0.666
reating-0.652
 about-0.598
tr-0.594
live-0.577
Token?"
Token ID1701
Feature activation-8.46e-02

Pos logit contributions
 spe+2.050
chie+1.935
mer+1.901
 se+1.735
 blu+1.673
Neg logit contributions
ward-1.270
reating-1.264
 about-1.239
 done-1.089
ained-1.077
Token She
Token ID1375
Feature activation-1.03e+00

Pos logit contributions
 spe+0.107
mer+0.101
chie+0.100
 se+0.089
 en+0.082
Neg logit contributions
ward-0.136
reating-0.133
tr-0.122
 about-0.121
live-0.121
Token thought
Token ID1807
Feature activation-7.76e-01

Pos logit contributions
 spe+1.506
mer+1.470
chie+1.463
 blu+1.196
 se+1.169
Neg logit contributions
ward-1.441
reating-1.330
 about-1.319
tr-1.313
live-1.264
Token and
Token ID290
Feature activation+2.05e-01

Pos logit contributions
ward+0.127
reating+0.125
tr+0.113
live+0.113
ained+0.113
Neg logit contributions
 spe-0.118
mer-0.112
chie-0.110
 se-0.101
 en-0.094
Token our
Token ID674
Feature activation-4.53e-01

Pos logit contributions
ward+0.318
reating+0.314
tr+0.285
live+0.284
ained+0.284
Neg logit contributions
 spe-0.265
mer-0.249
chie-0.245
 se-0.227
 en-0.207
Token corn
Token ID11676
Feature activation-2.33e-01

Pos logit contributions
 spe+0.585
mer+0.561
chie+0.551
 se+0.496
 en+0.465
Neg logit contributions
ward-0.693
reating-0.679
tr-0.630
 about-0.616
ained-0.610
Token.
Token ID13
Feature activation-7.67e-02

Pos logit contributions
 spe+0.292
mer+0.275
chie+0.273
 se+0.247
 en+0.230
Neg logit contributions
ward-0.369
reating-0.360
tr-0.333
 about-0.328
ained-0.326
Token What
Token ID1867
Feature activation-4.33e-01

Pos logit contributions
 spe+0.099
mer+0.094
chie+0.093
 se+0.084
 en+0.077
Neg logit contributions
ward-0.119
reating-0.117
tr-0.107
ained-0.106
 about-0.106
Token can
Token ID460
Feature activation-3.30e+00

Pos logit contributions
 spe+0.725
mer+0.706
chie+0.703
 se+0.634
 en+0.602
Neg logit contributions
ward-0.506
reating-0.495
 about-0.455
tr-0.441
icated-0.435
Token we
Token ID356
Feature activation-4.72e-01

Pos logit contributions
mer+5.172
 spe+5.040
chie+4.621
 redd+4.199
 blu+4.087
Neg logit contributions
ward-4.823
 about-4.505
reating-4.448
mm-4.363
 what-4.322
Token do
Token ID466
Feature activation-2.85e-01

Pos logit contributions
 spe+0.727
mer+0.697
chie+0.687
 se+0.623
 en+0.594
Neg logit contributions
ward-0.612
reating-0.596
 about-0.542
tr-0.541
ained-0.530
Token?"
Token ID1701
Feature activation-1.03e+00

Pos logit contributions
 spe+0.463
mer+0.440
chie+0.439
 se+0.400
 en+0.378
Neg logit contributions
ward-0.355
reating-0.347
 about-0.312
tr-0.305
ained-0.305
Token
Token ID198
Feature activation-4.75e-01

Pos logit contributions
 spe+1.756
chie+1.713
mer+1.702
 se+1.485
 blu+1.434
Neg logit contributions
ward-1.235
reating-1.133
 about-1.095
 Afterwards-1.029
live-1.028
Token
Token ID198
Feature activation-3.48e-01

Pos logit contributions
 spe+0.763
mer+0.731
chie+0.722
 se+0.656
 en+0.622
Neg logit contributions
ward-0.595
reating-0.578
live-0.520
 about-0.517
tr-0.510
Token the
Token ID262
Feature activation-1.54e+00

Pos logit contributions
 spe+0.806
mer+0.787
chie+0.760
 se+0.696
 skinny+0.647
Neg logit contributions
reating-0.780
ward-0.778
 about-0.706
live-0.675
tr-0.672
Token bird
Token ID6512
Feature activation+3.62e-01

Pos logit contributions
mer+1.267
 spe+1.251
chie+1.141
 dunno+0.882
 se+0.877
Neg logit contributions
ward-3.057
reating-2.850
tr-2.824
nce-2.708
 about-2.706
Token,
Token ID11
Feature activation-4.46e-01

Pos logit contributions
ward+0.446
reating+0.441
ained+0.391
live+0.390
tr+0.389
Neg logit contributions
 spe-0.583
mer-0.556
chie-0.550
 se-0.514
 en-0.485
Token "
Token ID366
Feature activation-3.03e-01

Pos logit contributions
 spe+0.543
mer+0.521
chie+0.507
 se+0.454
 skinny+0.406
Neg logit contributions
ward-0.733
reating-0.700
 about-0.664
tr-0.649
ained-0.638
TokenWhat
Token ID2061
Feature activation-5.02e-01

Pos logit contributions
 spe+0.427
mer+0.405
chie+0.397
 se+0.365
 en+0.343
Neg logit contributions
ward-0.437
reating-0.433
tr-0.390
live-0.386
ained-0.385
Token can
Token ID460
Feature activation-3.30e+00

Pos logit contributions
 spe+0.838
mer+0.823
chie+0.806
 se+0.734
 en+0.693
Neg logit contributions
ward-0.588
reating-0.575
 about-0.531
tr-0.513
icated-0.509
Token I
Token ID314
Feature activation-3.29e-01

Pos logit contributions
mer+4.717
 spe+4.516
chie+3.842
 redd+3.715
rily+3.695
Neg logit contributions
ward-5.226
 about-5.039
 what-4.810
 done-4.694
reating-4.670
Token do
Token ID466
Feature activation-9.70e-01

Pos logit contributions
 spe+0.516
mer+0.495
chie+0.484
 se+0.444
 en+0.419
Neg logit contributions
ward-0.421
reating-0.411
 about-0.374
tr-0.371
live-0.365
Token to
Token ID284
Feature activation-4.96e-01

Pos logit contributions
 spe+1.640
chie+1.554
mer+1.539
 se+1.408
 blu+1.338
Neg logit contributions
ward-1.166
reating-1.134
 about-1.095
live-0.978
ained-0.977
Token get
Token ID651
Feature activation+3.98e-01

Pos logit contributions
 spe+0.686
mer+0.669
chie+0.642
 se+0.575
 en+0.543
Neg logit contributions
ward-0.727
reating-0.715
 about-0.650
live-0.633
icated-0.628
Token to
Token ID284
Feature activation-4.58e-02

Pos logit contributions
ward+0.639
reating+0.634
tr+0.587
live+0.583
ained+0.571
Neg logit contributions
 spe-0.493
mer-0.462
chie-0.461
 se-0.423
 en-0.386
Token
Token ID198
Feature activation-1.13e+00

Pos logit contributions
 spe+2.463
chie+2.342
mer+2.289
 se+2.066
 blu+2.048
Neg logit contributions
ward-1.435
reating-1.358
 about-1.312
live-1.292
tr-1.247
Token"
Token ID1
Feature activation-1.85e+00

Pos logit contributions
 spe+1.923
chie+1.784
mer+1.739
 se+1.614
 en+1.560
Neg logit contributions
ward-1.360
reating-1.312
live-1.214
tr-1.186
 about-1.176
TokenOK
Token ID11380
Feature activation-7.81e-01

Pos logit contributions
 spe+2.987
chie+2.816
mer+2.793
 se+2.487
rily+2.468
Neg logit contributions
ward-2.272
reating-2.153
tr-2.085
After-2.058
live-1.977
Token,
Token ID11
Feature activation-4.53e-01

Pos logit contributions
 spe+1.414
mer+1.364
chie+1.347
 se+1.254
 en+1.181
Neg logit contributions
ward-0.837
reating-0.789
 about-0.732
tr-0.676
icated-0.659
Token what
Token ID644
Feature activation+1.05e+00

Pos logit contributions
 spe+0.422
mer+0.406
chie+0.392
 se+0.328
 en+0.291
Neg logit contributions
ward-0.868
reating-0.854
 about-0.807
tr-0.787
icated-0.785
Token can
Token ID460
Feature activation-3.29e+00

Pos logit contributions
ward+1.536
reating+1.505
live+1.396
ugg+1.360
ained+1.356
Neg logit contributions
 spe-1.472
mer-1.314
 se-1.305
chie-1.297
 en-1.200
Token we
Token ID356
Feature activation-1.40e+00

Pos logit contributions
mer+4.379
 spe+4.085
chie+3.725
 redd+3.475
rier+3.290
Neg logit contributions
ward-5.656
 what-5.438
 about-5.433
icated-5.325
reating-5.200
Token make
Token ID787
Feature activation-2.25e-01

Pos logit contributions
 spe+2.197
mer+2.146
chie+2.026
rily+1.766
 se+1.750
Neg logit contributions
ward-1.768
reating-1.745
tr-1.628
 about-1.626
icated-1.597
Token?"
Token ID1701
Feature activation-1.44e+00

Pos logit contributions
 spe+0.334
mer+0.321
chie+0.315
 se+0.285
 en+0.269
Neg logit contributions
ward-0.309
reating-0.304
 about-0.274
tr-0.271
ained-0.268
Token Sam
Token ID3409
Feature activation-8.52e-01

Pos logit contributions
 spe+2.657
chie+2.597
mer+2.568
 se+2.225
rily+2.181
Neg logit contributions
reating-1.492
ward-1.473
 about-1.431
ained-1.258
 done-1.255
Token asks
Token ID7893
Feature activation-6.33e-01

Pos logit contributions
 spe+1.126
chie+1.047
mer+1.045
 se+0.894
 blu+0.818
Neg logit contributions
reating-1.328
ward-1.307
 about-1.259
icated-1.194
tr-1.190
Token boat
Token ID8848
Feature activation-3.88e-01

Pos logit contributions
 spe+1.120
chie+1.000
mer+0.988
 skinny+0.711
 Int+0.693
Neg logit contributions
reating-2.814
ward-2.762
 about-2.644
 stories-2.525
ained-2.516
Token.
Token ID13
Feature activation-5.55e-01

Pos logit contributions
 spe+0.567
mer+0.533
chie+0.530
 se+0.484
 en+0.450
Neg logit contributions
ward-0.541
reating-0.516
 about-0.481
tr-0.477
ained-0.462
Token
Token ID198
Feature activation+5.58e-01

Pos logit contributions
 spe+0.822
mer+0.783
chie+0.782
 se+0.705
 en+0.648
Neg logit contributions
ward-0.781
reating-0.726
 about-0.690
tr-0.680
ained-0.666
Token
Token ID198
Feature activation-2.21e-01

Pos logit contributions
reating+0.803
ward+0.797
tr+0.716
ained+0.710
live+0.700
Neg logit contributions
 spe-0.770
mer-0.735
chie-0.716
 se-0.680
 en-0.628
TokenThe
Token ID464
Feature activation-2.00e+00

Pos logit contributions
 spe+0.324
mer+0.305
chie+0.301
 se+0.277
 en+0.258
Neg logit contributions
ward-0.312
reating-0.304
live-0.274
tr-0.274
 about-0.271
Token little
Token ID1310
Feature activation-3.29e+00

Pos logit contributions
 spe+2.650
mer+2.496
chie+2.488
 dunno+2.062
 Int+1.976
Neg logit contributions
ward-3.164
reating-2.956
 stories-2.822
nce-2.771
 about-2.742
Token boat
Token ID8848
Feature activation-3.24e-01

Pos logit contributions
 spe+3.370
chie+3.359
mer+2.919
 Int+2.600
 dunno+2.540
Neg logit contributions
ward-5.984
reating-5.729
 luck-5.449
ained-5.420
ruck-5.414
Token breathed
Token ID39545
Feature activation-3.50e-01

Pos logit contributions
 spe+0.480
chie+0.454
mer+0.453
 se+0.409
 en+0.386
Neg logit contributions
ward-0.441
reating-0.421
 about-0.388
tr-0.387
live-0.380
Token a
Token ID257
Feature activation+4.36e-01

Pos logit contributions
 spe+0.518
mer+0.499
chie+0.496
 se+0.448
 en+0.419
Neg logit contributions
ward-0.474
reating-0.456
 about-0.416
ained-0.412
tr-0.407
Token sigh
Token ID19680
Feature activation+1.35e-01

Pos logit contributions
reating+0.656
ward+0.653
ained+0.600
live+0.599
tr+0.590
Neg logit contributions
 spe-0.593
mer-0.549
chie-0.542
 se-0.512
 en-0.468
Token of
Token ID286
Feature activation+1.07e+00

Pos logit contributions
ward+0.179
reating+0.177
tr+0.158
live+0.157
ained+0.156
Neg logit contributions
 spe-0.204
mer-0.195
chie-0.191
 se-0.178
 en-0.166
Token bird
Token ID6512
Feature activation-1.55e-01

Pos logit contributions
 spe+0.898
mer+0.896
chie+0.882
 se+0.730
 blu+0.692
Neg logit contributions
ward-1.350
reating-1.344
 about-1.266
tr-1.237
icated-1.207
Token came
Token ID1625
Feature activation-5.26e-01

Pos logit contributions
 spe+0.232
mer+0.221
chie+0.219
 se+0.200
 en+0.189
Neg logit contributions
ward-0.207
reating-0.204
tr-0.184
ained-0.182
 about-0.182
Token and
Token ID290
Feature activation-6.98e-01

Pos logit contributions
 spe+0.787
mer+0.747
chie+0.727
 se+0.670
 en+0.626
Neg logit contributions
ward-0.723
reating-0.700
ained-0.646
 about-0.646
tr-0.633
Token scared
Token ID12008
Feature activation-6.60e-01

Pos logit contributions
 spe+0.992
chie+0.962
mer+0.959
 se+0.831
 blu+0.794
Neg logit contributions
ward-0.982
reating-0.929
tr-0.864
 about-0.857
icated-0.850
Token the
Token ID262
Feature activation-1.81e+00

Pos logit contributions
 spe+0.928
mer+0.888
chie+0.865
 se+0.792
 en+0.730
Neg logit contributions
reating-0.944
ward-0.942
 about-0.875
ained-0.837
icated-0.837
Token little
Token ID1310
Feature activation-3.28e+00

Pos logit contributions
mer+1.590
chie+1.513
 spe+1.435
 Int+1.109
 dunno+1.094
Neg logit contributions
ward-3.500
tr-3.412
reating-3.361
 about-3.215
 done-3.200
Token bird
Token ID6512
Feature activation-8.08e-01

Pos logit contributions
chie+3.735
mer+3.724
 dunno+3.216
 Int+3.213
rily+3.186
Neg logit contributions
tr-5.162
ward-5.139
reating-4.994
inea-4.865
 about-4.701
Tokenie
Token ID494
Feature activation-6.92e-01

Pos logit contributions
 spe+1.390
mer+1.305
chie+1.292
 se+1.203
 skinny+1.147
Neg logit contributions
ward-0.922
reating-0.913
 about-0.829
ained-0.808
tr-0.808
Token.
Token ID13
Feature activation-1.42e+00

Pos logit contributions
 spe+1.023
mer+0.959
chie+0.955
 se+0.883
 blu+0.816
Neg logit contributions
ward-0.956
reating-0.913
 about-0.861
tr-0.844
ained-0.832
Token The
Token ID383
Feature activation-1.56e+00

Pos logit contributions
 spe+1.802
mer+1.768
chie+1.724
 se+1.530
 blu+1.376
Neg logit contributions
ward-2.264
 about-2.147
reating-2.046
tr-2.029
 Afterwards-2.016
Token little
Token ID1310
Feature activation-2.86e+00

Pos logit contributions
mer+1.359
 spe+1.226
chie+1.221
 dunno+0.898
rily+0.876
Neg logit contributions
ward-3.076
reating-2.950
tr-2.947
 done-2.808
 about-2.802
Token it
Token ID340
Feature activation-1.52e+00

Pos logit contributions
 spe+1.187
mer+1.163
chie+1.150
 se+0.988
 dunno+0.895
Neg logit contributions
ward-1.445
reating-1.342
 about-1.336
ained-1.275
 done-1.252
Token was
Token ID373
Feature activation+3.19e-02

Pos logit contributions
 spe+2.616
chie+2.580
mer+2.436
 se+2.224
rily+2.207
Neg logit contributions
ward-1.653
 about-1.529
tr-1.517
ained-1.463
 done-1.450
Token small
Token ID1402
Feature activation-3.88e-01

Pos logit contributions
ward+0.050
reating+0.049
tr+0.045
ained+0.045
live+0.045
Neg logit contributions
 spe-0.040
mer-0.038
chie-0.038
 se-0.034
 en-0.032
Token.
Token ID13
Feature activation-1.21e+00

Pos logit contributions
 spe+0.561
chie+0.538
mer+0.536
 se+0.486
 en+0.452
Neg logit contributions
ward-0.541
reating-0.531
 about-0.489
tr-0.476
ained-0.472
Token The
Token ID383
Feature activation-2.47e+00

Pos logit contributions
 spe+1.530
mer+1.526
chie+1.485
 se+1.308
 en+1.119
Neg logit contributions
ward-1.969
reating-1.870
 about-1.844
ained-1.777
 Afterwards-1.761
Token little
Token ID1310
Feature activation-3.28e+00

Pos logit contributions
 spe+2.648
chie+2.514
mer+2.406
 dunno+1.865
 Int+1.864
Neg logit contributions
ward-4.582
reating-4.339
 stories-4.225
 done-4.160
ained-4.142
Token boat
Token ID8848
Feature activation-6.36e-01

Pos logit contributions
 spe+3.351
chie+3.248
mer+2.895
 Int+2.694
 dunno+2.587
Neg logit contributions
ward-5.922
reating-5.776
ruck-5.458
ained-5.431
icated-5.408
Token was
Token ID373
Feature activation-5.31e-01

Pos logit contributions
 spe+1.040
chie+0.996
mer+0.972
 se+0.886
 skinny+0.843
Neg logit contributions
ward-0.769
reating-0.717
 about-0.689
tr-0.681
ained-0.679
Token mo
Token ID6941
Feature activation-1.61e-01

Pos logit contributions
 spe+0.651
mer+0.634
chie+0.612
 se+0.540
 dunno+0.489
Neg logit contributions
ward-0.861
reating-0.823
 about-0.793
ained-0.782
tr-0.765
Tokenored
Token ID1850
Feature activation+2.67e-01

Pos logit contributions
 spe+0.291
mer+0.277
chie+0.273
 se+0.258
 en+0.245
Neg logit contributions
ward-0.170
reating-0.166
ained-0.144
live-0.144
tr-0.143
Token in
Token ID287
Feature activation-9.91e-01

Pos logit contributions
ward+0.389
reating+0.386
tr+0.349
live+0.347
ained+0.343
Neg logit contributions
 spe-0.372
mer-0.353
chie-0.345
 se-0.320
 en-0.299
Token
Token ID198
Feature activation-1.16e-01

Pos logit contributions
 spe+0.598
mer+0.569
chie+0.564
 se+0.519
 en+0.485
Neg logit contributions
ward-0.485
reating-0.471
 about-0.428
live-0.422
ained-0.419
TokenOne
Token ID3198
Feature activation+4.73e-02

Pos logit contributions
 spe+0.173
mer+0.165
chie+0.162
 se+0.150
 en+0.140
Neg logit contributions
ward-0.158
reating-0.155
tr-0.139
live-0.138
ained-0.138
Token day
Token ID1110
Feature activation+1.54e+00

Pos logit contributions
ward+0.042
reating+0.041
tr+0.034
live+0.034
ained+0.034
Neg logit contributions
 spe-0.093
mer-0.090
chie-0.089
 se-0.084
 en-0.080
Token,
Token ID11
Feature activation-4.24e-01

Pos logit contributions
reating+2.396
ward+2.179
tr+2.108
live+2.075
ained+2.057
Neg logit contributions
 spe-2.283
mer-2.082
 se-1.914
chie-1.894
 en-1.796
Token the
Token ID262
Feature activation-2.59e+00

Pos logit contributions
 spe+0.581
chie+0.562
mer+0.560
 se+0.504
 en+0.462
Neg logit contributions
ward-0.629
reating-0.593
 about-0.554
tr-0.546
ained-0.544
Token tiny
Token ID7009
Feature activation-3.27e+00

Pos logit contributions
 spe+3.030
chie+3.018
mer+2.776
 Int+2.361
 dunno+2.361
Neg logit contributions
ward-4.406
reating-4.062
nce-3.983
 stories-3.959
 done-3.907
Token pink
Token ID11398
Feature activation-2.51e+00

Pos logit contributions
 spe+3.015
chie+2.914
 Int+2.700
mer+2.350
 dunno+2.259
Neg logit contributions
ward-5.967
reating-5.938
ruck-5.920
nce-5.690
inea-5.605
Token boat
Token ID8848
Feature activation-8.40e-01

Pos logit contributions
 spe+2.129
chie+2.024
mer+1.675
 Int+1.505
 skinny+1.492
Neg logit contributions
ward-5.008
reating-4.965
nce-4.840
ained-4.774
ruck-4.674
Token had
Token ID550
Feature activation-1.09e+00

Pos logit contributions
 spe+1.192
chie+1.120
mer+1.072
 se+0.956
 skinny+0.930
Neg logit contributions
ward-1.202
reating-1.129
 about-1.096
ained-1.092
tr-1.077
Token to
Token ID284
Feature activation-9.39e-01

Pos logit contributions
mer+1.665
 spe+1.646
chie+1.563
 se+1.369
 dunno+1.316
Neg logit contributions
ward-1.483
reating-1.388
 about-1.339
ained-1.293
 done-1.277
Token move
Token ID1445
Feature activation-1.78e-01

Pos logit contributions
 spe+1.260
mer+1.246
chie+1.188
 se+1.018
 skinny+0.987
Neg logit contributions
ward-1.421
reating-1.397
 about-1.262
ained-1.242
tr-1.233
Token We
Token ID775
Feature activation-2.69e-02

Pos logit contributions
 spe+1.543
mer+1.434
chie+1.375
 se+1.341
 skinny+1.171
Neg logit contributions
ward-2.035
reating-1.895
tr-1.888
ained-1.844
ouncing-1.809
Token are
Token ID389
Feature activation-1.79e-02

Pos logit contributions
 spe+0.035
mer+0.033
chie+0.033
 se+0.030
 en+0.028
Neg logit contributions
ward-0.042
reating-0.041
tr-0.037
ained-0.037
 about-0.037
Token not
Token ID407
Feature activation-6.60e-01

Pos logit contributions
 spe+0.028
mer+0.027
chie+0.027
 se+0.025
 en+0.023
Neg logit contributions
ward-0.023
reating-0.023
tr-0.020
 about-0.020
ained-0.020
Token going
Token ID1016
Feature activation-6.77e-01

Pos logit contributions
 spe+0.995
mer+0.966
chie+0.924
 se+0.861
 blu+0.799
Neg logit contributions
ward-0.878
reating-0.867
 about-0.798
ained-0.775
 done-0.763
Token on
Token ID319
Feature activation-1.13e+00

Pos logit contributions
 spe+0.876
mer+0.824
chie+0.810
 se+0.733
 en+0.669
Neg logit contributions
ward-1.060
reating-1.057
 about-0.977
ained-0.967
tr-0.943
Token a
Token ID257
Feature activation-3.27e+00

Pos logit contributions
mer+1.684
 spe+1.635
chie+1.623
 se+1.425
 en+1.358
Neg logit contributions
ward-1.622
reating-1.574
 about-1.431
ained-1.376
icated-1.355
Token trip
Token ID5296
Feature activation-6.76e-01

Pos logit contributions
chie+5.052
mer+5.005
 spe+4.664
rily+4.263
rier+4.199
Neg logit contributions
reating-4.758
ward-4.677
 about-4.365
tr-4.287
 stories-4.165
Token today
Token ID1909
Feature activation-1.71e+00

Pos logit contributions
 spe+1.011
chie+0.942
mer+0.923
 se+0.865
 en+0.807
Neg logit contributions
ward-0.936
reating-0.934
 about-0.866
ained-0.844
tr-0.827
Token.
Token ID13
Feature activation-1.03e+00

Pos logit contributions
 spe+2.828
chie+2.752
mer+2.608
 se+2.482
 en+2.330
Neg logit contributions
ward-2.168
reating-2.131
 about-2.108
ained-1.898
 done-1.875
Token We
Token ID775
Feature activation+2.19e-01

Pos logit contributions
 spe+1.162
mer+1.116
chie+1.082
 se+0.998
 skinny+0.839
Neg logit contributions
ward-1.762
reating-1.700
tr-1.656
 about-1.633
ained-1.603
Token are
Token ID389
Feature activation-4.93e-02

Pos logit contributions
ward+0.346
reating+0.341
tr+0.309
live+0.308
ained+0.306
Neg logit contributions
 spe-0.278
mer-0.262
chie-0.258
 se-0.235
 en-0.217
Token many
Token ID867
Feature activation+8.05e-01

Pos logit contributions
ward+1.425
reating+1.395
live+1.347
ained+1.324
tr+1.297
Neg logit contributions
 spe-1.403
mer-1.272
chie-1.247
 se-1.247
 en-1.173
Token pictures
Token ID5986
Feature activation+1.60e+00

Pos logit contributions
ward+1.032
reating+0.970
tr+0.917
ained+0.912
live+0.901
Neg logit contributions
 spe-1.303
mer-1.219
chie-1.177
 se-1.125
 en-1.082
Token.
Token ID13
Feature activation+4.11e-01

Pos logit contributions
ward+2.559
live+2.494
reating+2.491
ained+2.391
oof+2.381
Neg logit contributions
 spe-1.953
mer-1.768
 se-1.687
chie-1.666
 en-1.510
Token He
Token ID679
Feature activation+3.83e-01

Pos logit contributions
ward+0.644
reating+0.635
live+0.577
tr+0.571
ained+0.570
Neg logit contributions
 spe-0.530
mer-0.499
chie-0.486
 se-0.454
 en-0.429
Token sees
Token ID7224
Feature activation+3.95e-01

Pos logit contributions
ward+0.596
reating+0.587
ained+0.534
live+0.532
tr+0.530
Neg logit contributions
 spe-0.500
mer-0.464
chie-0.455
 se-0.430
 en-0.399
Token a
Token ID257
Feature activation+1.41e+00

Pos logit contributions
ward+0.547
reating+0.534
live+0.493
ained+0.488
tr+0.488
Neg logit contributions
 spe-0.577
mer-0.541
chie-0.533
 se-0.507
 en-0.474
Token picture
Token ID4286
Feature activation+1.55e+00

Pos logit contributions
ward+2.546
reating+2.524
ained+2.487
live+2.381
 Afterwards+2.317
Neg logit contributions
 spe-1.614
mer-1.371
 se-1.329
chie-1.271
 skinny-1.198
Token of
Token ID286
Feature activation+4.22e-01

Pos logit contributions
ward+2.292
reating+2.211
live+2.155
ained+2.097
oof+2.069
Neg logit contributions
 spe-2.098
mer-1.939
 se-1.885
chie-1.808
 en-1.694
Token a
Token ID257
Feature activation+9.75e-01

Pos logit contributions
ward+0.628
reating+0.616
live+0.573
ained+0.571
tr+0.571
Neg logit contributions
 spe-0.569
mer-0.523
chie-0.519
 se-0.492
 en-0.454
Token shoe
Token ID17292
Feature activation+9.28e-01

Pos logit contributions
ward+1.691
reating+1.672
ained+1.607
live+1.559
ouncing+1.519
Neg logit contributions
 spe-1.131
mer-1.008
 se-0.944
chie-0.938
 skinny-0.842
Token.
Token ID13
Feature activation+3.56e-01

Pos logit contributions
ward+1.307
reating+1.297
ained+1.188
live+1.178
icated+1.155
Neg logit contributions
 spe-1.319
mer-1.254
chie-1.173
 se-1.170
 en-1.064
Token Mom
Token ID11254
Feature activation+2.35e-01

Pos logit contributions
 spe+0.539
mer+0.534
chie+0.515
 se+0.458
 en+0.423
Neg logit contributions
ward-0.585
reating-0.579
tr-0.526
 about-0.526
icated-0.511
Tokenmy
Token ID1820
Feature activation-3.50e-01

Pos logit contributions
ward+0.174
reating+0.168
tr+0.135
live+0.134
ained+0.134
Neg logit contributions
 spe-0.493
mer-0.476
chie-0.471
 se-0.449
 en-0.428
Token."
Token ID526
Feature activation+4.23e-01

Pos logit contributions
 spe+0.538
chie+0.513
mer+0.513
 se+0.469
 en+0.441
Neg logit contributions
ward-0.455
reating-0.442
 about-0.402
tr-0.400
live-0.394
Token
Token ID198
Feature activation+1.10e+00

Pos logit contributions
reating+0.672
ward+0.668
live+0.599
tr+0.598
ained+0.596
Neg logit contributions
 spe-0.523
mer-0.493
chie-0.481
 se-0.453
 en-0.422
Token
Token ID198
Feature activation+1.25e+00

Pos logit contributions
reating+1.736
ward+1.726
tr+1.557
ained+1.540
icated+1.526
Neg logit contributions
 spe-1.302
mer-1.251
chie-1.229
 se-1.188
 en-1.105
TokenHer
Token ID9360
Feature activation+1.53e+00

Pos logit contributions
reating+2.065
ward+2.042
ained+1.842
 about+1.842
inea+1.826
Neg logit contributions
 spe-1.384
chie-1.352
mer-1.344
 se-1.292
 en-1.199
Token mom
Token ID1995
Feature activation-3.43e-02

Pos logit contributions
reating+2.995
ward+2.990
ained+2.853
icated+2.817
live+2.774
Neg logit contributions
 spe-1.498
 se-1.324
mer-1.193
chie-1.165
 en-1.143
Token hugged
Token ID41130
Feature activation-1.58e-01

Pos logit contributions
 spe+0.054
mer+0.051
chie+0.051
 se+0.047
 en+0.044
Neg logit contributions
ward-0.044
reating-0.043
tr-0.038
 about-0.038
live-0.038
Token her
Token ID607
Feature activation+6.65e-01

Pos logit contributions
 spe+0.219
mer+0.208
chie+0.205
 se+0.186
 en+0.172
Neg logit contributions
ward-0.233
reating-0.232
 about-0.210
tr-0.208
ained-0.207
Token and
Token ID290
Feature activation+3.58e-01

Pos logit contributions
ward+1.062
reating+1.055
live+0.975
ained+0.974
tr+0.946
Neg logit contributions
 spe-0.827
mer-0.785
chie-0.765
 se-0.715
 en-0.679
Token said
Token ID531
Feature activation+1.51e-01

Pos logit contributions
ward+0.480
reating+0.476
ained+0.427
tr+0.425
live+0.422
Neg logit contributions
 spe-0.538
mer-0.505
chie-0.496
 se-0.481
 en-0.444
Token are
Token ID389
Feature activation+1.07e+00

Pos logit contributions
reating+1.210
ward+1.204
ained+1.068
tr+1.063
live+1.061
Neg logit contributions
 spe-1.062
mer-0.986
chie-0.977
 se-0.913
 en-0.857
Token not
Token ID407
Feature activation+4.92e-01

Pos logit contributions
reating+1.896
ward+1.881
tr+1.731
live+1.721
icated+1.694
Neg logit contributions
 spe-1.174
mer-1.060
chie-1.054
 se-1.002
 en-0.989
Token friends
Token ID2460
Feature activation-8.96e-01

Pos logit contributions
ward+0.776
reating+0.765
tr+0.706
live+0.701
ained+0.685
Neg logit contributions
 spe-0.631
mer-0.593
chie-0.589
 se-0.546
 en-0.515
Token.
Token ID13
Feature activation-1.99e-01

Pos logit contributions
 spe+1.505
chie+1.484
mer+1.413
 se+1.336
 blu+1.278
Neg logit contributions
reating-1.045
ward-1.035
 about-1.001
tr-0.895
ained-0.885
Token
Token ID198
Feature activation+1.33e+00

Pos logit contributions
 spe+0.285
mer+0.271
chie+0.270
 se+0.247
 en+0.227
Neg logit contributions
ward-0.282
reating-0.275
tr-0.251
 about-0.250
ained-0.250
Token
Token ID198
Feature activation+1.47e+00

Pos logit contributions
reating+2.171
ward+2.096
live+1.922
tr+1.910
ained+1.885
Neg logit contributions
 spe-1.612
mer-1.507
 se-1.463
chie-1.452
 en-1.343
TokenA
Token ID32
Feature activation-1.56e-01

Pos logit contributions
reating+2.435
ward+2.300
tr+2.149
 about+2.120
ained+2.114
Neg logit contributions
mer-1.788
 spe-1.773
chie-1.689
 se-1.599
 skinny-1.472
Token lady
Token ID10846
Feature activation-2.21e-01

Pos logit contributions
 spe+0.173
mer+0.162
chie+0.160
 se+0.141
 en+0.129
Neg logit contributions
ward-0.271
reating-0.267
tr-0.246
 about-0.244
live-0.244
Token sees
Token ID7224
Feature activation-4.55e-01

Pos logit contributions
 spe+0.279
mer+0.264
chie+0.264
 se+0.235
 en+0.218
Neg logit contributions
ward-0.346
reating-0.342
 about-0.314
tr-0.312
live-0.309
Token them
Token ID606
Feature activation+1.51e+00

Pos logit contributions
 spe+0.510
mer+0.478
chie+0.478
 se+0.409
 skinny+0.375
Neg logit contributions
ward-0.803
reating-0.798
 about-0.737
icated-0.714
ained-0.708
Token.
Token ID13
Feature activation-6.21e-01

Pos logit contributions
ward+2.575
reating+2.526
live+2.437
tr+2.372
ugg+2.323
Neg logit contributions
 spe-1.707
mer-1.599
chie-1.513
 se-1.508
 en-1.484
Tokenmy
Token ID1820
Feature activation+6.51e-01

Pos logit contributions
ward+0.861
reating+0.816
tr+0.632
live+0.606
 Afterwards+0.603
Neg logit contributions
 spe-1.997
mer-1.954
chie-1.904
 se-1.863
 en-1.781
Token loved
Token ID6151
Feature activation+1.61e-01

Pos logit contributions
reating+0.898
ward+0.882
ained+0.761
tr+0.751
live+0.747
Neg logit contributions
 spe-0.977
mer-0.927
chie-0.891
 se-0.871
 en-0.813
Token to
Token ID284
Feature activation-5.43e-01

Pos logit contributions
ward+0.279
reating+0.274
tr+0.253
live+0.252
ained+0.251
Neg logit contributions
 spe-0.180
mer-0.166
chie-0.164
 se-0.150
 en-0.135
Token look
Token ID804
Feature activation+1.02e+00

Pos logit contributions
 spe+0.853
mer+0.849
chie+0.825
 se+0.718
 en+0.698
Neg logit contributions
ward-0.702
reating-0.682
tr-0.612
 about-0.607
icated-0.597
Token at
Token ID379
Feature activation-2.04e-01

Pos logit contributions
ward+1.499
reating+1.402
live+1.373
tr+1.360
ained+1.316
Neg logit contributions
 spe-1.414
mer-1.375
chie-1.333
 se-1.288
 en-1.242
Token his
Token ID465
Feature activation+1.54e+00

Pos logit contributions
 spe+0.273
mer+0.263
chie+0.257
 se+0.231
 en+0.214
Neg logit contributions
ward-0.309
reating-0.307
 about-0.277
tr-0.274
ained-0.273
Token calendar
Token ID11845
Feature activation+9.77e-02

Pos logit contributions
ward+2.756
reating+2.653
live+2.514
icated+2.436
ained+2.434
Neg logit contributions
 spe-1.692
mer-1.603
 se-1.565
chie-1.505
 boat-1.428
Token every
Token ID790
Feature activation-1.41e+00

Pos logit contributions
ward+0.133
reating+0.130
tr+0.117
live+0.116
ained+0.116
Neg logit contributions
 spe-0.145
mer-0.138
chie-0.136
 se-0.126
 en-0.118
Token day
Token ID1110
Feature activation-2.60e-01

Pos logit contributions
 spe+3.259
chie+3.227
mer+3.155
 se+2.963
 en+2.886
Neg logit contributions
ward-0.773
reating-0.734
 about-0.624
tr-0.557
 done-0.527
Token.
Token ID13
Feature activation-5.64e-01

Pos logit contributions
 spe+0.373
mer+0.355
chie+0.353
 se+0.320
 en+0.297
Neg logit contributions
ward-0.370
reating-0.362
 about-0.330
tr-0.323
ained-0.323
Token His
Token ID2399
Feature activation+2.93e-01

Pos logit contributions
 spe+0.768
mer+0.742
chie+0.729
 se+0.653
 en+0.592
Neg logit contributions
ward-0.847
reating-0.843
 about-0.781
tr-0.776
ained-0.762
Token had
Token ID550
Feature activation+5.16e-02

Pos logit contributions
 spe+1.763
chie+1.690
mer+1.616
 se+1.495
 blu+1.473
Neg logit contributions
ward-1.407
reating-1.309
live-1.292
 about-1.277
 done-1.250
Token protected
Token ID6861
Feature activation+1.98e-01

Pos logit contributions
ward+0.072
reating+0.071
tr+0.064
live+0.063
ained+0.063
Neg logit contributions
 spe-0.075
mer-0.072
chie-0.071
 se-0.065
 en-0.061
Token his
Token ID465
Feature activation+6.28e-01

Pos logit contributions
ward+0.283
reating+0.277
tr+0.252
live+0.250
ained+0.249
Neg logit contributions
 spe-0.279
mer-0.266
chie-0.261
 se-0.241
 en-0.225
Token honey
Token ID12498
Feature activation+2.85e-01

Pos logit contributions
ward+0.767
reating+0.757
ained+0.670
tr+0.654
live+0.651
Neg logit contributions
 spe-1.042
mer-0.990
chie-0.963
 se-0.912
 skinny-0.864
Token.
Token ID13
Feature activation+5.53e-01

Pos logit contributions
ward+0.388
reating+0.384
ained+0.343
tr+0.342
live+0.341
Neg logit contributions
 spe-0.417
mer-0.399
chie-0.391
 se-0.363
 en-0.338
Token From
Token ID3574
Feature activation+1.64e+00

Pos logit contributions
reating+0.831
ward+0.817
tr+0.742
ained+0.730
live+0.728
Neg logit contributions
 spe-0.748
mer-0.707
chie-0.694
 se-0.636
 en-0.608
Token that
Token ID326
Feature activation+1.07e+00

Pos logit contributions
reating+2.313
ward+2.162
tr+2.095
icated+1.986
ained+1.985
Neg logit contributions
 spe-2.466
mer-2.381
 se-2.306
chie-2.217
 boat-2.112
Token day
Token ID1110
Feature activation+1.37e+00

Pos logit contributions
reating+1.119
ward+1.070
tr+0.931
ained+0.901
live+0.896
Neg logit contributions
 spe-1.968
mer-1.884
chie-1.837
 se-1.759
 en-1.689
Token on
Token ID319
Feature activation+1.54e-01

Pos logit contributions
reating+1.404
ward+1.271
icated+1.133
tr+1.129
live+1.096
Neg logit contributions
 spe-2.575
chie-2.493
mer-2.458
 se-2.295
 en-2.214
Token,
Token ID11
Feature activation-1.17e+00

Pos logit contributions
ward+0.177
reating+0.175
tr+0.153
ained+0.151
live+0.150
Neg logit contributions
 spe-0.263
mer-0.251
chie-0.248
 se-0.232
 en-0.219
Token the
Token ID262
Feature activation+1.33e+00

Pos logit contributions
chie+1.639
 spe+1.623
mer+1.579
 se+1.426
 dunno+1.321
Neg logit contributions
ward-1.688
reating-1.576
 about-1.512
ained-1.440
icated-1.427
Token very
Token ID845
Feature activation-1.17e+00

Pos logit contributions
chie+2.443
mer+2.325
 spe+2.257
rily+2.036
 Int+1.895
Neg logit contributions
ward-3.164
tr-3.050
reating-2.859
 about-2.857
ugg-2.811
Token messy
Token ID24097
Feature activation-3.99e-01

Pos logit contributions
 spe+1.347
chie+1.263
mer+1.223
 se+1.050
 blu+0.968
Neg logit contributions
ward-1.946
reating-1.848
 about-1.826
live-1.786
 done-1.781
Token boy
Token ID2933
Feature activation-4.11e-01

Pos logit contributions
 spe+0.440
chie+0.412
mer+0.405
 se+0.349
 en+0.321
Neg logit contributions
ward-0.690
reating-0.687
tr-0.638
 about-0.634
ained-0.622
Token.
Token ID13
Feature activation+1.98e-01

Pos logit contributions
 spe+0.548
chie+0.524
mer+0.517
 se+0.464
 blu+0.431
Neg logit contributions
ward-0.617
reating-0.606
 about-0.566
tr-0.558
live-0.546
Token His
Token ID2399
Feature activation+1.29e+00

Pos logit contributions
ward+0.307
reating+0.303
tr+0.274
live+0.274
ained+0.271
Neg logit contributions
 spe-0.258
mer-0.244
chie-0.239
 se-0.220
 en-0.205
Token toys
Token ID14958
Feature activation+9.67e-02

Pos logit contributions
reating+2.418
ward+2.365
icated+2.161
live+2.150
nce+2.115
Neg logit contributions
 spe-1.283
mer-1.204
 se-1.150
chie-1.132
 en-1.054
Token were
Token ID547
Feature activation+2.39e-01

Pos logit contributions
ward+0.120
reating+0.118
tr+0.104
live+0.103
ained+0.103
Neg logit contributions
 spe-0.155
mer-0.148
chie-0.146
 se-0.136
 en-0.128
Token everywhere
Token ID8347
Feature activation-4.59e-01

Pos logit contributions
ward+0.374
reating+0.369
tr+0.336
live+0.335
ained+0.331
Neg logit contributions
 spe-0.307
mer-0.287
chie-0.284
 se-0.262
 en-0.244
Token in
Token ID287
Feature activation-7.85e-01

Pos logit contributions
 spe+0.744
mer+0.712
chie+0.707
 se+0.641
 en+0.610
Neg logit contributions
ward-0.578
reating-0.554
 about-0.511
ained-0.494
tr-0.494
Token his
Token ID465
Feature activation-9.78e-01

Pos logit contributions
 spe+1.151
mer+1.119
chie+1.089
 se+0.974
 en+0.917
Neg logit contributions
ward-1.110
reating-1.105
ained-0.970
 about-0.958
icated-0.955
Token room
Token ID2119
Feature activation+5.53e-02

Pos logit contributions
 spe+1.523
mer+1.453
chie+1.444
 se+1.270
 en+1.173
Neg logit contributions
ward-1.323
reating-1.247
tr-1.236
 about-1.177
ained-1.159
Token angry
Token ID7954
Feature activation+6.16e-01

Pos logit contributions
 spe+0.274
mer+0.257
chie+0.253
 se+0.224
 en+0.199
Neg logit contributions
ward-0.422
reating-0.412
 about-0.383
tr-0.379
live-0.378
Token and
Token ID290
Feature activation-4.96e-01

Pos logit contributions
ward+0.950
reating+0.928
live+0.861
tr+0.860
ained+0.849
Neg logit contributions
 spe-0.792
mer-0.768
chie-0.735
 se-0.692
 en-0.646
Token took
Token ID1718
Feature activation-1.95e-02

Pos logit contributions
 spe+0.662
mer+0.626
chie+0.620
 se+0.555
 blu+0.518
Neg logit contributions
ward-0.744
reating-0.716
 about-0.667
tr-0.659
live-0.655
Token the
Token ID262
Feature activation-6.76e-01

Pos logit contributions
 spe+0.026
mer+0.025
chie+0.024
 se+0.022
 en+0.021
Neg logit contributions
ward-0.029
reating-0.029
tr-0.026
 about-0.026
live-0.026
Token axe
Token ID23272
Feature activation-1.48e-01

Pos logit contributions
 spe+0.909
mer+0.881
chie+0.855
 se+0.764
 en+0.719
Neg logit contributions
ward-1.000
reating-0.988
tr-0.917
 about-0.894
ained-0.869
Token away
Token ID1497
Feature activation+5.53e-01

Pos logit contributions
 spe+0.222
mer+0.210
chie+0.208
 se+0.191
 en+0.179
Neg logit contributions
ward-0.198
reating-0.196
tr-0.176
 about-0.174
ained-0.173
Token from
Token ID422
Feature activation+5.03e-01

Pos logit contributions
ward+0.782
reating+0.781
live+0.707
ained+0.702
icated+0.693
Neg logit contributions
 spe-0.784
mer-0.755
chie-0.735
 se-0.681
 en-0.632
Token the
Token ID262
Feature activation+1.98e-01

Pos logit contributions
ward+0.782
reating+0.771
tr+0.717
live+0.708
ained+0.706
Neg logit contributions
 spe-0.653
mer-0.614
chie-0.604
 se-0.556
 en-0.518
Token boy
Token ID2933
Feature activation+9.15e-02

Pos logit contributions
ward+0.341
reating+0.337
live+0.309
ained+0.309
tr+0.308
Neg logit contributions
 spe-0.224
mer-0.209
chie-0.205
 se-0.186
 en-0.168
Token.
Token ID13
Feature activation-3.38e-01

Pos logit contributions
ward+0.141
reating+0.139
tr+0.126
live+0.126
ained+0.126
Neg logit contributions
 spe-0.119
mer-0.112
chie-0.110
 se-0.101
 en-0.093
Token
Token ID198
Feature activation+3.67e-01

Pos logit contributions
 spe+0.479
mer+0.451
chie+0.448
 se+0.411
 blu+0.372
Neg logit contributions
ward-0.489
reating-0.470
 about-0.435
tr-0.430
ained-0.428
Token.
Token ID13
Feature activation-5.06e-01

Pos logit contributions
 spe+0.913
chie+0.864
mer+0.842
 se+0.767
 skinny+0.729
Neg logit contributions
ward-0.769
reating-0.760
 about-0.719
ained-0.676
live-0.666
Token
Token ID198
Feature activation-2.77e-01

Pos logit contributions
 spe+0.758
chie+0.712
mer+0.705
 se+0.646
 blu+0.588
Neg logit contributions
ward-0.701
reating-0.671
 about-0.635
ained-0.624
tr-0.621
Token
Token ID198
Feature activation+4.39e-01

Pos logit contributions
 spe+0.434
mer+0.409
chie+0.408
 se+0.374
 en+0.351
Neg logit contributions
ward-0.358
reating-0.349
 about-0.317
live-0.313
tr-0.312
TokenThe
Token ID464
Feature activation-6.44e-01

Pos logit contributions
ward+0.601
reating+0.597
tr+0.530
ained+0.520
live+0.517
Neg logit contributions
 spe-0.641
mer-0.626
chie-0.608
 se-0.566
 en-0.535
Token shark
Token ID21027
Feature activation+1.16e-01

Pos logit contributions
 spe+0.797
chie+0.740
mer+0.725
 se+0.615
 en+0.579
Neg logit contributions
ward-1.060
reating-1.033
 about-0.947
nce-0.940
ained-0.937
Token saw
Token ID2497
Feature activation+3.65e-01

Pos logit contributions
ward+0.174
reating+0.172
tr+0.155
ained+0.154
live+0.154
Neg logit contributions
 spe-0.156
mer-0.148
chie-0.145
 se-0.134
 en-0.124
Token them
Token ID606
Feature activation+1.29e+00

Pos logit contributions
ward+0.466
reating+0.457
tr+0.414
live+0.410
ained+0.407
Neg logit contributions
 spe-0.572
mer-0.546
chie-0.536
 se-0.513
 en-0.481
Token and
Token ID290
Feature activation+5.48e-02

Pos logit contributions
reating+2.156
ward+2.103
live+2.045
tr+2.020
ugg+1.935
Neg logit contributions
 spe-1.505
mer-1.411
 se-1.383
chie-1.360
 en-1.311
Token sw
Token ID1509
Feature activation+4.38e-01

Pos logit contributions
ward+0.086
reating+0.085
tr+0.077
live+0.076
ained+0.076
Neg logit contributions
 spe-0.070
mer-0.066
chie-0.065
 se-0.059
 en-0.055
Tokenam
Token ID321
Feature activation-6.33e-01

Pos logit contributions
ward+0.540
reating+0.539
tr+0.465
icated+0.457
live+0.455
Neg logit contributions
 spe-0.704
mer-0.691
chie-0.680
 se-0.626
 en-0.593
Token towards
Token ID3371
Feature activation-1.02e+00

Pos logit contributions
 spe+0.933
chie+0.855
mer+0.842
 se+0.778
 skinny+0.720
Neg logit contributions
ward-0.877
reating-0.857
 about-0.808
ained-0.788
tr-0.767
Token the
Token ID262
Feature activation-1.03e+00

Pos logit contributions
 spe+0.591
mer+0.572
chie+0.564
 se+0.505
 en+0.474
Neg logit contributions
ward-0.530
reating-0.520
 about-0.466
ained-0.453
tr-0.452
Token two
Token ID734
Feature activation-1.17e+00

Pos logit contributions
 spe+1.421
chie+1.368
mer+1.347
 se+1.170
 dunno+1.109
Neg logit contributions
ward-1.530
reating-1.480
 about-1.338
tr-1.325
ained-1.318
Token clean
Token ID3424
Feature activation-1.20e+00

Pos logit contributions
 spe+1.937
mer+1.865
chie+1.861
 se+1.644
 dunno+1.597
Neg logit contributions
ward-1.375
reating-1.359
 about-1.229
 stories-1.167
 done-1.151
Token beans
Token ID16567
Feature activation+7.34e-02

Pos logit contributions
 spe+1.774
chie+1.709
mer+1.627
 se+1.426
 en+1.350
Neg logit contributions
reating-1.724
ward-1.718
 about-1.552
ained-1.527
 stories-1.509
Token.
Token ID13
Feature activation+9.40e-01

Pos logit contributions
ward+0.103
reating+0.101
tr+0.091
live+0.090
ained+0.090
Neg logit contributions
 spe-0.105
mer-0.100
chie-0.099
 se-0.091
 en-0.085
Token
Token ID198
Feature activation+1.37e+00

Pos logit contributions
reating+1.480
ward+1.464
live+1.337
tr+1.331
ained+1.323
Neg logit contributions
 spe-1.172
mer-1.102
chie-1.086
 se-0.979
 en-0.950
Token
Token ID198
Feature activation+1.27e+00

Pos logit contributions
reating+1.503
ward+1.443
tr+1.333
ained+1.288
icated+1.267
Neg logit contributions
 spe-2.333
chie-2.199
mer-2.198
 se-2.161
 en-2.044
TokenJohn
Token ID7554
Feature activation-2.00e-02

Pos logit contributions
ward+1.794
reating+1.758
tr+1.643
ained+1.627
icated+1.588
Neg logit contributions
 spe-1.753
chie-1.672
mer-1.668
 se-1.580
 en-1.457
Token was
Token ID373
Feature activation+2.78e-01

Pos logit contributions
 spe+0.028
mer+0.027
chie+0.027
 se+0.024
 en+0.023
Neg logit contributions
ward-0.028
reating-0.028
tr-0.025
live-0.025
 about-0.025
Token very
Token ID845
Feature activation-2.39e-01

Pos logit contributions
ward+0.459
reating+0.457
tr+0.419
live+0.415
ained+0.412
Neg logit contributions
 spe-0.331
mer-0.308
chie-0.306
 se-0.277
 en-0.259
Token happy
Token ID3772
Feature activation+9.60e-01

Pos logit contributions
 spe+0.299
mer+0.281
chie+0.277
 se+0.249
 skinny+0.224
Neg logit contributions
ward-0.384
reating-0.374
 about-0.346
ained-0.342
live-0.341
Token to
Token ID284
Feature activation-1.39e-01

Pos logit contributions
 spe+0.291
mer+0.278
chie+0.272
 se+0.239
 en+0.218
Neg logit contributions
ward-0.456
reating-0.442
 about-0.411
tr-0.410
ained-0.408
Token heal
Token ID12035
Feature activation-6.60e-02

Pos logit contributions
 spe+0.165
mer+0.158
chie+0.154
 se+0.138
 en+0.127
Neg logit contributions
ward-0.231
reating-0.226
tr-0.208
 about-0.206
ained-0.206
Token itself
Token ID2346
Feature activation+2.71e-01

Pos logit contributions
 spe+0.091
mer+0.087
chie+0.086
 se+0.079
 en+0.073
Neg logit contributions
ward-0.096
reating-0.094
tr-0.085
 about-0.085
ained-0.085
Token.
Token ID13
Feature activation-1.78e-01

Pos logit contributions
ward+0.399
reating+0.396
live+0.357
tr+0.356
ained+0.355
Neg logit contributions
 spe-0.370
mer-0.351
chie-0.346
 se-0.317
 en-0.295
Token The
Token ID383
Feature activation+7.29e-01

Pos logit contributions
 spe+0.228
mer+0.216
chie+0.213
 se+0.194
 en+0.177
Neg logit contributions
ward-0.283
reating-0.273
tr-0.252
 about-0.252
ained-0.250
Token tree
Token ID5509
Feature activation-2.13e-01

Pos logit contributions
reating+1.454
ward+1.440
ained+1.334
live+1.329
tr+1.322
Neg logit contributions
 spe-0.682
mer-0.597
chie-0.587
 se-0.529
 en-0.479
Token and
Token ID290
Feature activation+8.43e-01

Pos logit contributions
 spe+0.317
mer+0.302
chie+0.300
 se+0.273
 en+0.256
Neg logit contributions
ward-0.286
reating-0.276
tr-0.254
 about-0.252
live-0.250
Token the
Token ID262
Feature activation-1.79e-02

Pos logit contributions
reating+1.498
ward+1.495
live+1.392
tr+1.387
ained+1.373
Neg logit contributions
 spe-0.955
mer-0.853
chie-0.837
 se-0.775
 en-0.723
Token wild
Token ID4295
Feature activation+5.79e-01

Pos logit contributions
 spe+0.017
mer+0.016
chie+0.016
 se+0.014
 en+0.012
Neg logit contributions
ward-0.034
reating-0.034
tr-0.031
live-0.031
 about-0.031
Token animal
Token ID5044
Feature activation-4.28e-01

Pos logit contributions
reating+1.029
ward+1.028
icated+0.925
live+0.924
ained+0.921
Neg logit contributions
 spe-0.651
mer-0.597
chie-0.582
 se-0.513
 en-0.477
Token became
Token ID2627
Feature activation-1.00e-01

Pos logit contributions
 spe+0.665
chie+0.637
mer+0.637
 se+0.579
 en+0.547
Neg logit contributions
ward-0.544
reating-0.514
 about-0.477
tr-0.473
live-0.469
Token for
Token ID329
Feature activation-8.26e-01

Pos logit contributions
ward+0.062
reating+0.061
tr+0.054
live+0.053
ained+0.053
Neg logit contributions
 spe-0.087
mer-0.083
chie-0.082
 se-0.076
 en-0.072
Token a
Token ID257
Feature activation-1.07e-01

Pos logit contributions
mer+1.251
 spe+1.239
chie+1.218
 se+1.094
 en+1.027
Neg logit contributions
ward-1.120
reating-1.102
 about-1.000
icated-0.947
 stories-0.938
Token better
Token ID1365
Feature activation-5.52e-01

Pos logit contributions
 spe+0.133
mer+0.127
chie+0.125
 se+0.113
 en+0.104
Neg logit contributions
ward-0.171
reating-0.168
tr-0.154
 about-0.153
live-0.152
Token couch
Token ID18507
Feature activation-1.61e-01

Pos logit contributions
 spe+0.734
chie+0.717
mer+0.707
 se+0.637
 en+0.592
Neg logit contributions
reating-0.831
ward-0.825
 about-0.759
tr-0.753
ained-0.728
Token,"
Token ID553
Feature activation+6.06e-01

Pos logit contributions
 spe+0.246
mer+0.233
chie+0.232
 se+0.214
 en+0.201
Neg logit contributions
ward-0.211
reating-0.207
tr-0.185
 about-0.185
ained-0.182
Token his
Token ID465
Feature activation-4.74e-01

Pos logit contributions
ward+0.833
reating+0.831
ained+0.734
tr+0.732
live+0.727
Neg logit contributions
 spe-0.899
mer-0.836
chie-0.825
 se-0.809
 en-0.754
Token mom
Token ID1995
Feature activation+4.90e-01

Pos logit contributions
 spe+0.653
mer+0.626
chie+0.615
 se+0.549
 en+0.507
Neg logit contributions
ward-0.701
reating-0.682
tr-0.645
 about-0.632
live-0.621
Token said
Token ID531
Feature activation-1.39e+00

Pos logit contributions
ward+0.654
reating+0.650
ained+0.582
tr+0.570
live+0.566
Neg logit contributions
 spe-0.735
mer-0.709
chie-0.692
 se-0.663
 en-0.614
Token.
Token ID13
Feature activation+8.75e-02

Pos logit contributions
 spe+2.322
chie+2.207
mer+2.194
 se+1.976
 blu+1.926
Neg logit contributions
ward-1.762
 about-1.647
reating-1.584
tr-1.454
 done-1.420
Token "
Token ID366
Feature activation-1.53e+00

Pos logit contributions
ward+0.135
reating+0.133
tr+0.121
live+0.120
ained+0.120
Neg logit contributions
 spe-0.114
mer-0.108
chie-0.106
 se-0.097
 en-0.090
TokenThis
Token ID1212
Feature activation-5.48e-01

Pos logit contributions
 spe+2.452
mer+2.272
chie+2.266
 se+2.032
 en+1.963
Neg logit contributions
reating-2.036
ward-1.995
tr-1.800
icated-1.778
After-1.755
Token and
Token ID290
Feature activation+1.59e-01

Pos logit contributions
 spe+0.426
chie+0.406
mer+0.406
 se+0.369
 en+0.346
Neg logit contributions
ward-0.357
reating-0.345
 about-0.315
tr-0.315
live-0.312
Token Jen
Token ID13364
Feature activation+1.30e-01

Pos logit contributions
ward+0.199
reating+0.195
tr+0.172
live+0.171
ained+0.171
Neg logit contributions
 spe-0.256
mer-0.244
chie-0.242
 se-0.225
 en-0.212
Token slide
Token ID10649
Feature activation-3.13e-01

Pos logit contributions
ward+0.157
reating+0.154
tr+0.135
live+0.135
ained+0.134
Neg logit contributions
 spe-0.214
mer-0.205
chie-0.202
 se-0.189
 en-0.178
Token down
Token ID866
Feature activation-6.11e-01

Pos logit contributions
 spe+0.511
mer+0.486
chie+0.484
 se+0.449
 en+0.419
Neg logit contributions
ward-0.379
reating-0.373
ained-0.335
 about-0.333
tr-0.331
Token the
Token ID262
Feature activation-7.15e-01

Pos logit contributions
 spe+0.909
chie+0.879
mer+0.867
 se+0.793
 blu+0.753
Neg logit contributions
ward-0.825
reating-0.797
 about-0.737
ained-0.722
tr-0.710
Token slide
Token ID10649
Feature activation-4.88e-01

Pos logit contributions
 spe+1.094
chie+1.041
mer+1.039
 se+0.923
 en+0.883
Neg logit contributions
ward-0.951
reating-0.921
tr-0.837
live-0.822
ained-0.812
Token.
Token ID13
Feature activation-1.64e-01

Pos logit contributions
 spe+0.718
chie+0.677
mer+0.671
 se+0.616
 en+0.571
Neg logit contributions
ward-0.676
reating-0.662
 about-0.608
tr-0.607
ained-0.606
Token The
Token ID383
Feature activation+1.77e-01

Pos logit contributions
 spe+0.222
mer+0.211
chie+0.209
 se+0.190
 en+0.175
Neg logit contributions
ward-0.247
reating-0.241
tr-0.221
 about-0.220
ained-0.219
Token rope
Token ID17182
Feature activation-7.99e-01

Pos logit contributions
ward+0.322
reating+0.319
tr+0.293
live+0.292
ained+0.291
Neg logit contributions
 spe-0.184
mer-0.171
chie-0.167
 se-0.150
 en-0.135
Token catches
Token ID17591
Feature activation-2.16e-01

Pos logit contributions
 spe+1.317
mer+1.278
chie+1.263
 se+1.148
 skinny+1.091
Neg logit contributions
ward-0.937
reating-0.901
tr-0.844
 about-0.842
ained-0.824
Token them
Token ID606
Feature activation+2.55e-01

Pos logit contributions
 spe+0.322
mer+0.306
chie+0.305
 se+0.279
 en+0.261
Neg logit contributions
ward-0.293
reating-0.287
tr-0.258
 about-0.257
ained-0.256
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
 spe+1.959
mer+1.883
chie+1.872
 se+1.717
 blu+1.652
Neg logit contributions
ward-0.991
reating-0.873
 about-0.858
tr-0.773
ouncing-0.757
Token as
Token ID355
Feature activation-1.66e+00

Pos logit contributions
 spe+1.622
mer+1.607
chie+1.601
 se+1.394
 dunno+1.337
Neg logit contributions
ward-1.266
reating-1.142
 about-1.096
tr-1.041
nce-1.002
Token she
Token ID673
Feature activation-2.13e-01

Pos logit contributions
chie+2.383
mer+2.266
 spe+2.209
 se+1.947
 Int+1.811
Neg logit contributions
ward-2.547
reating-2.443
 about-2.335
live-2.200
 what-2.191
Token took
Token ID1718
Feature activation-1.12e-01

Pos logit contributions
 spe+0.292
mer+0.278
chie+0.277
 se+0.247
 en+0.234
Neg logit contributions
ward-0.314
reating-0.305
tr-0.282
live-0.278
 about-0.278
Token Ben
Token ID3932
Feature activation+4.89e-01

Pos logit contributions
 spe+0.159
mer+0.152
chie+0.150
 se+0.137
 en+0.128
Neg logit contributions
ward-0.161
reating-0.159
tr-0.143
 about-0.142
live-0.142
Token's
Token ID338
Feature activation-1.36e+00

Pos logit contributions
ward+0.865
reating+0.856
live+0.788
ained+0.788
tr+0.784
Neg logit contributions
 spe-0.516
mer-0.491
chie-0.477
 se-0.429
 en-0.390
Token hand
Token ID1021
Feature activation+6.00e-02

Pos logit contributions
 spe+1.891
mer+1.823
chie+1.768
 se+1.581
 en+1.517
Neg logit contributions
ward-1.965
reating-1.887
tr-1.855
 about-1.771
 stories-1.732
Token.
Token ID13
Feature activation-5.53e-01

Pos logit contributions
ward+0.086
reating+0.085
tr+0.077
live+0.076
ained+0.076
Neg logit contributions
 spe-0.085
mer-0.080
chie-0.079
 se-0.073
 en-0.068
Token
Token ID198
Feature activation+7.47e-01

Pos logit contributions
 spe+0.894
chie+0.866
mer+0.861
 se+0.784
 en+0.726
Neg logit contributions
ward-0.699
reating-0.661
 about-0.616
tr-0.607
ained-0.598
Token
Token ID198
Feature activation+7.67e-01

Pos logit contributions
reating+1.119
ward+1.118
tr+1.022
ained+1.007
live+0.996
Neg logit contributions
 spe-0.967
mer-0.909
chie-0.891
 se-0.855
 en-0.780
TokenThey
Token ID2990
Feature activation-2.72e-01

Pos logit contributions
ward+1.048
reating+1.042
tr+0.952
ained+0.949
 about+0.942
Neg logit contributions
 spe-1.097
mer-1.074
chie-1.043
 se-0.990
 skinny-0.908
Token watched
Token ID7342
Feature activation-5.14e-01

Pos logit contributions
 spe+0.500
chie+0.471
mer+0.468
 se+0.418
 blu+0.389
Neg logit contributions
ward-0.549
reating-0.529
 about-0.497
tr-0.495
live-0.491
Token the
Token ID262
Feature activation+6.18e-01

Pos logit contributions
 spe+0.760
mer+0.730
chie+0.722
 se+0.653
 blu+0.608
Neg logit contributions
ward-0.712
reating-0.691
 about-0.644
ained-0.605
icated-0.604
Token little
Token ID1310
Feature activation-1.13e+00

Pos logit contributions
ward+1.140
reating+1.133
live+1.050
ained+1.043
 about+1.041
Neg logit contributions
 spe-0.625
mer-0.590
chie-0.562
 se-0.533
 en-0.467
Token duck
Token ID22045
Feature activation+2.52e-01

Pos logit contributions
chie+1.132
 spe+1.108
mer+1.046
 se+0.881
 dunno+0.849
Neg logit contributions
ward-2.037
reating-2.002
tr-1.891
ained-1.813
icated-1.812
Token for
Token ID329
Feature activation+1.00e+00

Pos logit contributions
ward+0.361
reating+0.357
tr+0.324
live+0.323
ained+0.320
Neg logit contributions
 spe-0.351
mer-0.336
chie-0.330
 se-0.306
 en-0.285
Token a
Token ID257
Feature activation-4.42e-01

Pos logit contributions
reating+1.629
ward+1.568
ouncing+1.527
live+1.521
tr+1.521
Neg logit contributions
 spe-1.256
chie-1.125
mer-1.075
 se-1.063
 en-0.956
Token while
Token ID981
Feature activation-1.27e+00

Pos logit contributions
 spe+0.546
mer+0.525
chie+0.516
 se+0.460
 en+0.434
Neg logit contributions
ward-0.713
reating-0.682
tr-0.636
 about-0.628
live-0.621
Token,
Token ID11
Feature activation-5.82e-01

Pos logit contributions
 spe+2.172
chie+2.139
mer+2.127
 se+1.919
 redd+1.854
Neg logit contributions
ward-1.472
reating-1.330
 about-1.307
 what-1.149
icated-1.131
Token licking
Token ID43390
Feature activation+1.11e-01

Pos logit contributions
 spe+0.670
mer+0.644
chie+0.643
 se+0.550
 skinny+0.496
Neg logit contributions
ward-0.998
reating-0.955
 about-0.908
ained-0.877
tr-0.877
Token his
Token ID465
Feature activation-2.48e-01

Pos logit contributions
ward+0.185
reating+0.183
tr+0.167
live+0.167
ained+0.166
Neg logit contributions
 spe-0.132
mer-0.124
chie-0.121
 se-0.110
 en-0.101
Token lips
Token ID11914
Feature activation+7.20e-01

Pos logit contributions
 spe+0.350
mer+0.335
chie+0.329
 se+0.300
 en+0.279
Neg logit contributions
ward-0.356
reating-0.344
tr-0.319
 about-0.313
ained-0.311
Token there
Token ID612
Feature activation-1.21e-01

Pos logit contributions
ward+0.317
reating+0.314
tr+0.286
live+0.283
ained+0.282
Neg logit contributions
 spe-0.279
mer-0.263
chie-0.259
 se-0.239
 en-0.221
Token sweet
Token ID6029
Feature activation+1.97e-01

Pos logit contributions
 spe+0.193
mer+0.185
chie+0.182
 se+0.169
 en+0.159
Neg logit contributions
ward-0.150
reating-0.147
 about-0.131
tr-0.130
ained-0.129
Tokenie
Token ID494
Feature activation-8.32e-01

Pos logit contributions
ward+0.261
reating+0.254
tr+0.230
live+0.227
ained+0.227
Neg logit contributions
 spe-0.302
mer-0.288
chie-0.282
 se-0.265
 en-0.247
Token?
Token ID30
Feature activation-1.56e-01

Pos logit contributions
 spe+1.329
chie+1.248
mer+1.244
 se+1.137
 en+1.064
Neg logit contributions
ward-1.056
reating-1.027
 about-0.955
 done-0.909
live-0.907
Token Can
Token ID1680
Feature activation-1.44e+00

Pos logit contributions
 spe+0.191
mer+0.181
chie+0.177
 se+0.162
 en+0.147
Neg logit contributions
ward-0.252
reating-0.246
tr-0.227
 about-0.225
ained-0.224
Token I
Token ID314
Feature activation-7.59e-01

Pos logit contributions
 spe+1.655
mer+1.635
chie+1.473
 se+1.295
 skinny+1.224
Neg logit contributions
ward-2.521
reating-2.436
 about-2.277
tr-2.266
live-2.232
Token help
Token ID1037
Feature activation-5.99e-01

Pos logit contributions
 spe+1.042
mer+1.018
chie+0.983
 se+0.870
 en+0.839
Neg logit contributions
ward-1.108
reating-1.069
 about-0.994
tr-0.993
live-0.970
Token?"
Token ID1701
Feature activation-1.49e+00

Pos logit contributions
 spe+0.726
mer+0.682
chie+0.656
 se+0.580
 en+0.540
Neg logit contributions
reating-1.001
ward-0.994
 about-0.912
icated-0.892
live-0.886
Token
Token ID198
Feature activation-7.38e-01

Pos logit contributions
 spe+2.137
chie+2.102
mer+2.078
 se+1.718
 blu+1.713
Neg logit contributions
ward-2.194
 about-2.042
reating-2.020
 Afterwards-1.930
live-1.917
Token
Token ID198
Feature activation-8.64e-01

Pos logit contributions
 spe+1.228
mer+1.177
chie+1.158
 se+1.052
 blu+1.009
Neg logit contributions
ward-0.899
reating-0.859
live-0.790
 about-0.785
tr-0.764
TokenThe
Token ID464
Feature activation+6.37e-01

Pos logit contributions
 spe+1.405
mer+1.322
chie+1.279
 se+1.190
 blu+1.144
Neg logit contributions
ward-1.102
reating-1.044
live-0.991
tr-0.950
nce-0.937
Token of
Token ID286
Feature activation-4.17e-01

Pos logit contributions
 spe+0.686
chie+0.657
mer+0.656
 se+0.605
 en+0.569
Neg logit contributions
ward-0.521
reating-0.514
 about-0.460
tr-0.449
ained-0.446
Token the
Token ID262
Feature activation+8.77e-01

Pos logit contributions
 spe+0.538
mer+0.525
chie+0.511
 se+0.452
 en+0.427
Neg logit contributions
ward-0.654
reating-0.643
 about-0.572
ained-0.569
icated-0.567
Token desert
Token ID10326
Feature activation-7.36e-01

Pos logit contributions
ward+1.459
reating+1.435
 about+1.327
ained+1.322
live+1.321
Neg logit contributions
 spe-1.054
mer-0.978
chie-0.933
 se-0.905
 skinny-0.809
Token.
Token ID13
Feature activation+4.30e-02

Pos logit contributions
 spe+1.138
chie+1.090
mer+1.056
 se+0.991
 en+0.927
Neg logit contributions
ward-0.971
reating-0.932
 about-0.871
ained-0.838
tr-0.834
Token How
Token ID1374
Feature activation-8.71e-01

Pos logit contributions
ward+0.074
reating+0.073
tr+0.067
live+0.067
ained+0.067
Neg logit contributions
 spe-0.048
mer-0.045
chie-0.044
 se-0.040
 en-0.036
Token can
Token ID460
Feature activation-2.53e+00

Pos logit contributions
 spe+1.456
mer+1.435
chie+1.408
 se+1.274
 en+1.203
Neg logit contributions
ward-1.026
reating-1.001
 about-0.984
tr-0.920
icated-0.895
Token there
Token ID612
Feature activation-1.09e+00

Pos logit contributions
mer+2.914
 spe+2.692
chie+2.517
 redd+2.084
 blu+2.028
Neg logit contributions
ward-4.703
reating-4.466
 about-4.206
 what-4.201
icated-4.175
Token be
Token ID307
Feature activation-9.32e-01

Pos logit contributions
 spe+1.594
mer+1.550
chie+1.517
 se+1.356
 skinny+1.293
Neg logit contributions
reating-1.515
ward-1.514
 about-1.416
tr-1.344
 done-1.322
Token an
Token ID281
Feature activation+2.03e-01

Pos logit contributions
 spe+1.306
mer+1.266
chie+1.244
 se+1.093
 skinny+1.044
Neg logit contributions
ward-1.327
reating-1.315
 about-1.228
ained-1.174
tr-1.160
Token o
Token ID267
Feature activation+9.16e-02

Pos logit contributions
ward+0.332
reating+0.325
tr+0.300
live+0.298
ained+0.296
Neg logit contributions
 spe-0.247
mer-0.232
chie-0.228
 se-0.208
 en-0.192
Tokenasis
Token ID17765
Feature activation-1.97e-01

Pos logit contributions
ward+0.092
reating+0.090
tr+0.077
live+0.076
ained+0.076
Neg logit contributions
 spe-0.169
mer-0.163
chie-0.161
 se-0.151
 en-0.143
Token was
Token ID373
Feature activation+2.33e-01

Pos logit contributions
chie+2.227
 spe+2.171
mer+2.163
rily+1.875
 blu+1.865
Neg logit contributions
ward-1.395
 about-1.321
reating-1.297
live-1.208
tr-1.161
Token wrong
Token ID2642
Feature activation+1.05e-01

Pos logit contributions
ward+0.287
reating+0.283
tr+0.251
live+0.248
ained+0.246
Neg logit contributions
 spe-0.376
mer-0.360
chie-0.354
 se-0.332
 en-0.314
Token,
Token ID11
Feature activation-1.19e+00

Pos logit contributions
ward+0.149
reating+0.146
tr+0.132
live+0.131
ained+0.131
Neg logit contributions
 spe-0.150
mer-0.142
chie-0.140
 se-0.129
 en-0.121
Token but
Token ID475
Feature activation-7.96e-02

Pos logit contributions
 spe+1.608
mer+1.603
chie+1.573
 se+1.318
 blu+1.246
Neg logit contributions
ward-1.824
reating-1.765
 about-1.745
icated-1.585
ained-1.584
Token the
Token ID262
Feature activation-1.17e+00

Pos logit contributions
 spe+0.096
mer+0.091
chie+0.090
 se+0.081
 en+0.074
Neg logit contributions
ward-0.131
reating-0.128
tr-0.117
 about-0.117
ained-0.116
Token little
Token ID1310
Feature activation-2.55e+00

Pos logit contributions
 spe+1.004
chie+0.967
mer+0.942
 se+0.705
 dunno+0.695
Neg logit contributions
ward-2.303
reating-2.174
tr-2.087
 about-2.075
ained-2.075
Token duck
Token ID22045
Feature activation-2.70e-01

Pos logit contributions
chie+2.738
 spe+2.431
mer+2.302
 Int+2.194
 dunno+2.098
Neg logit contributions
ward-4.376
reating-4.056
ained-4.006
 about-3.919
nce-3.859
Token could
Token ID714
Feature activation-1.17e+00

Pos logit contributions
 spe+0.414
chie+0.389
mer+0.389
 se+0.355
 en+0.331
Neg logit contributions
ward-0.353
reating-0.339
 about-0.312
live-0.310
tr-0.310
Token not
Token ID407
Feature activation-1.07e+00

Pos logit contributions
 spe+1.973
mer+1.937
chie+1.888
 se+1.658
 blu+1.635
Neg logit contributions
ward-1.426
reating-1.312
 about-1.289
live-1.223
tr-1.213
Token answer
Token ID3280
Feature activation-1.28e-01

Pos logit contributions
 spe+1.492
mer+1.399
chie+1.381
 se+1.206
 blu+1.187
Neg logit contributions
ward-1.533
reating-1.514
 about-1.432
tr-1.378
live-1.343
Token.
Token ID13
Feature activation-9.70e-01

Pos logit contributions
 spe+0.175
mer+0.166
chie+0.164
 se+0.150
 en+0.139
Neg logit contributions
ward-0.188
reating-0.185
 about-0.168
tr-0.167
ained-0.166
Token little
Token ID1310
Feature activation-1.46e+00

Pos logit contributions
 spe+0.113
mer+0.105
chie+0.103
 se+0.087
 en+0.076
Neg logit contributions
ward-0.262
reating-0.258
tr-0.241
 about-0.239
live-0.238
Token bird
Token ID6512
Feature activation-1.19e-01

Pos logit contributions
mer+1.129
chie+1.116
 spe+1.112
 dunno+0.818
 se+0.808
Neg logit contributions
ward-2.853
reating-2.760
tr-2.691
 about-2.645
icated-2.632
Token gasped
Token ID45236
Feature activation-5.50e-01

Pos logit contributions
 spe+0.161
mer+0.152
chie+0.151
 se+0.137
 en+0.127
Neg logit contributions
ward-0.176
reating-0.172
tr-0.158
 about-0.156
live-0.156
Token.
Token ID13
Feature activation+1.63e-01

Pos logit contributions
 spe+0.856
chie+0.813
mer+0.803
 se+0.736
 blu+0.685
Neg logit contributions
ward-0.716
reating-0.688
 about-0.649
ained-0.616
tr-0.612
Token How
Token ID1374
Feature activation-1.34e+00

Pos logit contributions
ward+0.287
reating+0.285
tr+0.261
live+0.261
ained+0.260
Neg logit contributions
 spe-0.177
mer-0.165
chie-0.162
 se-0.146
 en-0.133
Token could
Token ID714
Feature activation-2.51e+00

Pos logit contributions
 spe+1.897
mer+1.839
chie+1.816
 se+1.543
 skinny+1.500
Neg logit contributions
 about-1.979
ward-1.962
reating-1.863
tr-1.799
icated-1.793
Token this
Token ID428
Feature activation-1.98e-01

Pos logit contributions
mer+4.234
 spe+4.018
chie+3.774
 redd+3.524
rily+3.433
Neg logit contributions
ward-3.355
reating-3.058
 about-3.010
 what-2.904
 done-2.830
Token be
Token ID307
Feature activation-9.56e-01

Pos logit contributions
 spe+0.296
mer+0.285
chie+0.282
 se+0.257
 en+0.241
Neg logit contributions
ward-0.266
reating-0.262
tr-0.237
 about-0.235
ained-0.231
Token happening
Token ID5836
Feature activation-5.53e-01

Pos logit contributions
 spe+1.508
mer+1.443
chie+1.428
 se+1.293
 skinny+1.216
Neg logit contributions
ward-1.212
reating-1.200
 about-1.147
ained-1.094
tr-1.090
Token?
Token ID30
Feature activation-4.81e-01

Pos logit contributions
 spe+0.904
chie+0.852
mer+0.847
 se+0.780
 en+0.738
Neg logit contributions
ward-0.679
reating-0.666
 about-0.613
ained-0.584
tr-0.580
Token
Token ID198
Feature activation+9.98e-02

Pos logit contributions
 spe+0.624
mer+0.591
chie+0.579
 se+0.526
 en+0.481
Neg logit contributions
ward-0.757
reating-0.723
tr-0.677
 about-0.675
icated-0.663
Token strong
Token ID1913
Feature activation-4.75e-01

Pos logit contributions
 spe+0.456
mer+0.416
chie+0.413
 se+0.354
 blu+0.303
Neg logit contributions
ward-0.891
reating-0.873
 about-0.821
ained-0.817
live-0.814
Token,
Token ID11
Feature activation-1.51e+00

Pos logit contributions
 spe+0.692
chie+0.670
mer+0.654
 se+0.599
 en+0.558
Neg logit contributions
ward-0.645
reating-0.637
 about-0.589
tr-0.569
ained-0.563
Token but
Token ID475
Feature activation-1.75e-01

Pos logit contributions
chie+1.940
 spe+1.867
mer+1.810
 se+1.517
 dunno+1.451
Neg logit contributions
ward-2.400
 about-2.374
reating-2.320
icated-2.192
ained-2.169
Token the
Token ID262
Feature activation-5.21e-01

Pos logit contributions
 spe+0.220
mer+0.209
chie+0.207
 se+0.186
 en+0.169
Neg logit contributions
ward-0.278
reating-0.272
 about-0.250
tr-0.248
ained-0.247
Token two
Token ID734
Feature activation-1.55e+00

Pos logit contributions
 spe+0.645
mer+0.612
chie+0.605
 se+0.528
 en+0.482
Neg logit contributions
ward-0.838
reating-0.827
 about-0.751
tr-0.748
ained-0.741
Token could
Token ID714
Feature activation-2.32e+00

Pos logit contributions
 spe+2.955
mer+2.801
chie+2.778
 se+2.484
 blu+2.458
Neg logit contributions
ward-1.408
reating-1.376
 about-1.355
 stories-1.230
 done-1.210
Token not
Token ID407
Feature activation-4.66e-01

Pos logit contributions
 spe+3.431
mer+3.208
chie+3.154
 redd+2.745
 se+2.709
Neg logit contributions
ward-3.302
 about-3.129
tr-2.958
reating-2.947
 done-2.910
Token get
Token ID651
Feature activation-6.79e-01

Pos logit contributions
 spe+0.628
mer+0.598
chie+0.591
 se+0.535
 en+0.494
Neg logit contributions
ward-0.691
reating-0.678
 about-0.626
tr-0.618
ained-0.605
Token along
Token ID1863
Feature activation-2.94e-01

Pos logit contributions
 spe+0.883
mer+0.847
chie+0.818
 se+0.737
 blu+0.680
Neg logit contributions
ward-1.042
reating-1.025
 about-0.944
ained-0.933
 done-0.907
Token.
Token ID13
Feature activation-5.48e-01

Pos logit contributions
 spe+0.418
mer+0.396
chie+0.396
 se+0.363
 en+0.335
Neg logit contributions
ward-0.417
reating-0.409
 about-0.372
tr-0.366
ained-0.364
Token
Token ID198
Feature activation-7.44e-01

Pos logit contributions
 spe+0.766
mer+0.723
chie+0.721
 se+0.655
 en+0.594
Neg logit contributions
ward-0.808
reating-0.786
 about-0.730
tr-0.724
ained-0.718
Token
Token ID198
Feature activation+9.14e-01

Pos logit contributions
 spe+0.221
mer+0.211
chie+0.208
 se+0.191
 en+0.179
Neg logit contributions
ward-0.191
reating-0.187
tr-0.168
 about-0.167
live-0.167
TokenTim
Token ID14967
Feature activation+1.01e+00

Pos logit contributions
reating+1.487
ward+1.480
ained+1.331
 about+1.317
icated+1.308
Neg logit contributions
 spe-1.115
mer-1.054
chie-1.038
 se-0.988
 en-0.910
Tokenmy
Token ID1820
Feature activation-2.53e-02

Pos logit contributions
ward+0.898
reating+0.856
ained+0.677
tr+0.658
live+0.637
Neg logit contributions
 spe-1.973
mer-1.972
chie-1.952
 se-1.859
 en-1.765
Token said
Token ID531
Feature activation-6.20e-01

Pos logit contributions
 spe+0.038
mer+0.036
chie+0.035
 se+0.033
 en+0.030
Neg logit contributions
ward-0.034
reating-0.034
tr-0.030
live-0.030
 about-0.030
Token,
Token ID11
Feature activation-6.42e-01

Pos logit contributions
 spe+1.147
mer+1.115
chie+1.097
 se+1.014
 blu+0.968
Neg logit contributions
ward-0.632
reating-0.579
 about-0.546
tr-0.519
live-0.500
Token "
Token ID366
Feature activation-2.33e+00

Pos logit contributions
 spe+0.796
mer+0.767
chie+0.760
 se+0.657
 blu+0.607
Neg logit contributions
ward-1.052
reating-0.987
 about-0.951
tr-0.933
live-0.903
TokenSure
Token ID19457
Feature activation-3.70e-01

Pos logit contributions
 spe+3.728
chie+3.478
mer+3.415
rily+3.116
ihu+3.039
Neg logit contributions
ward-2.931
After-2.824
reating-2.770
tr-2.755
live-2.724
Token,
Token ID11
Feature activation-4.01e-01

Pos logit contributions
 spe+0.658
mer+0.639
chie+0.626
 se+0.587
 en+0.559
Neg logit contributions
ward-0.394
reating-0.378
 about-0.336
tr-0.333
live-0.328
Token I
Token ID314
Feature activation+4.90e-01

Pos logit contributions
 spe+0.454
mer+0.442
chie+0.426
 se+0.371
 en+0.339
Neg logit contributions
ward-0.688
reating-0.682
 about-0.635
tr-0.621
icated-0.614
Token'll
Token ID1183
Feature activation-5.45e-01

Pos logit contributions
ward+0.824
reating+0.815
live+0.732
tr+0.731
icated+0.730
Neg logit contributions
 spe-0.580
mer-0.548
chie-0.535
 se-0.496
 en-0.449
Token show
Token ID905
Feature activation+1.33e-01

Pos logit contributions
 spe+0.713
mer+0.687
chie+0.663
 se+0.597
 en+0.559
Neg logit contributions
ward-0.839
reating-0.810
 about-0.759
tr-0.759
live-0.742