TOP ACTIVATIONS
MAX = 3.191

!" L ily and Tom met at the bottom
to be kind . That 's what makes you happy and
families happy too . And so , the whole town became
ummy cube . But the cube is not a
day ever ! But soon it was time for
and free . But then , he hears a
" You are making too much noise and you are being
noises . They had so much fun with the car .
day , she ate too much candy and her stomach hurt
it . She had so much fun that she didn 't
They are happy . And so are Mom and Dad .
. You are making too much noise . You are in
aren and Ann had so much fun playing together ! Ann
love this movie !" and sn ugg led up with her
my . You ate too much candy , didn 't you
her new home . Sara and Tom smiled as they watched
One day , Sara and Tom want to do art
to cry . She regretted not being more careful . Her
After he was done chewing , Tim my
, " I gained so much confidence !" But then ,

MIN ACTIVATIONS
MIN = -4.022

that was broken . The little boat felt sad for the
very sad and missed the little bird ie . <|endoftext|> Once
boat . So , the little boat offered to help the
. " How can we get the jet back
sad and scared . How can they get their rocket back
are you ?" But the little boat said nothing , only
?" asked Sammy in a tiny voice . "
and wanted to know why the airplane could fly but not
is getting wet . What can we do ?" Cow said
pigeon wanted to help the little bird , so he told
said . " But how can we fly in a jet
is not happy . What can I do ?" She thought
and our corn . What can we do ?"
the bird , " What can I do to get to
" OK , what can we make ?" Sam asks
boat . The little boat breathed a sigh of
bird came and scared the little bird ie . The little
it was small . The little boat was mo ored in
One day , the tiny pink boat had to move
We are not going on a trip today . We are

INTERVAL 1.388 - 3.191
CONTAINS 2.240%

said . " You should be more careful . You could
something !" E lla decided to solve the
and have fun . One day , Sam saw a big
grass y hills . One day , the cow climbed higher
was showing off his moves and he accidentally broke his toe

INTERVAL -0.416 - 1.388
CONTAINS 70.855%

feel happier . Everyone agreed . Taking the medicine felt nice
All her friends had different ideas , but she couldn 't
He w agged his tail and bark ed , " Come
" No , Lily . I want some red
to share the toy . They played together and laughed a

INTERVAL -2.219 - -0.416
CONTAINS 26.829%

after the environment ! <|endoftext|> Once upon a time , there
. He was the store owner . He ran after them
when she opened the box . She quickly opened it and
the symbol . They were no longer scared , but instead
. No one understood where it came from , but it

INTERVAL -4.022 - -2.219
CONTAINS 0.076%

was a mighty pot . The pot was so big and
said , " Mom , can we fix the pigeon ?
A girl came over to them . She had a
woman wanted to help the little girl , so she decided
One day , he took his boat out on the water
Token!"
Token ID2474
Feature activation+7.74e-01

Pos logit contributions
reating+0.642
ward+0.637
ained+0.559
tr+0.551
live+0.550
Neg logit contributions
 spe-0.809
mer-0.779
chie-0.761
 se-0.711
 en-0.664
Token
Token ID198
Feature activation+1.34e+00

Pos logit contributions
reating+1.215
ward+1.215
live+1.121
tr+1.098
ained+1.080
Neg logit contributions
 spe-0.961
mer-0.919
chie-0.876
 se-0.858
 en-0.815
Token
Token ID198
Feature activation+2.25e+00

Pos logit contributions
reating+2.165
ward+2.160
tr+1.959
live+1.931
ained+1.930
Neg logit contributions
 spe-1.554
mer-1.507
 se-1.453
chie-1.452
 en-1.351
TokenL
Token ID43
Feature activation+6.11e-01

Pos logit contributions
ward+4.148
reating+4.049
icated+3.761
ained+3.694
inea+3.651
Neg logit contributions
mer-2.239
chie-2.108
 se-2.100
 spe-2.045
 en-1.949
Tokenily
Token ID813
Feature activation+6.49e-01

Pos logit contributions
ward+0.851
reating+0.816
tr+0.750
ained+0.732
live+0.729
Neg logit contributions
 spe-0.867
mer-0.850
chie-0.844
 se-0.775
 en-0.729
Token and
Token ID290
Feature activation+3.19e+00

Pos logit contributions
reating+1.029
ward+1.012
ained+0.922
live+0.900
tr+0.898
Neg logit contributions
 spe-0.834
mer-0.788
chie-0.752
 se-0.742
 en-0.673
Token Tom
Token ID4186
Feature activation+1.22e+00

Pos logit contributions
ward+5.812
reating+5.781
reetings+5.387
ained+5.355
icated+5.330
Neg logit contributions
 se-3.321
 spe-3.310
 low-3.131
 shy-3.095
 en-3.069
Token met
Token ID1138
Feature activation+1.68e+00

Pos logit contributions
reating+2.019
ward+1.922
ained+1.802
live+1.773
tr+1.739
Neg logit contributions
 spe-1.558
mer-1.443
 se-1.429
chie-1.402
 en-1.254
Token at
Token ID379
Feature activation+5.46e-01

Pos logit contributions
ward+2.783
reating+2.737
live+2.637
tr+2.629
icated+2.524
Neg logit contributions
 spe-1.882
mer-1.865
 se-1.713
 en-1.710
chie-1.655
Token the
Token ID262
Feature activation+1.26e+00

Pos logit contributions
ward+0.916
reating+0.906
tr+0.851
live+0.838
ained+0.835
Neg logit contributions
 spe-0.641
mer-0.598
chie-0.580
 se-0.545
 en-0.488
Token bottom
Token ID4220
Feature activation-5.01e-01

Pos logit contributions
ward+1.906
reating+1.858
live+1.766
ained+1.737
icated+1.728
Neg logit contributions
 spe-1.659
chie-1.543
mer-1.541
 se-1.513
 en-1.396
Token to
Token ID284
Feature activation+1.96e+00

Pos logit contributions
ward+2.576
reating+2.495
ained+2.411
live+2.368
ouncing+2.313
Neg logit contributions
 spe-1.663
 se-1.531
chie-1.468
mer-1.452
 en-1.299
Token be
Token ID307
Feature activation+6.26e-01

Pos logit contributions
ward+3.089
reating+2.963
tr+2.900
ained+2.834
live+2.771
Neg logit contributions
 spe-2.510
chie-2.426
 se-2.375
 en-2.274
mer-2.218
Token kind
Token ID1611
Feature activation+3.99e-01

Pos logit contributions
ward+1.133
reating+1.124
tr+1.034
live+1.025
icated+1.018
Neg logit contributions
 spe-0.656
mer-0.609
chie-0.603
 se-0.548
 en-0.519
Token.
Token ID13
Feature activation+5.73e-01

Pos logit contributions
ward+0.643
reating+0.629
live+0.582
tr+0.578
ained+0.574
Neg logit contributions
 spe-0.490
mer-0.471
chie-0.454
 se-0.415
 en-0.383
Token That
Token ID1320
Feature activation+6.48e-01

Pos logit contributions
ward+0.864
reating+0.855
live+0.766
tr+0.766
ained+0.761
Neg logit contributions
 spe-0.765
mer-0.714
chie-0.702
 se-0.644
 en-0.612
Token's
Token ID338
Feature activation+3.18e+00

Pos logit contributions
ward+1.055
reating+1.039
tr+0.942
live+0.939
ained+0.935
Neg logit contributions
 spe-0.787
mer-0.741
chie-0.698
 se-0.686
 en-0.620
Token what
Token ID644
Feature activation+5.15e-01

Pos logit contributions
ops+5.147
reating+5.044
ward+4.979
ugg+4.929
opl+4.602
Neg logit contributions
 spe-4.206
 shy-4.146
 en-4.132
chie-3.808
 boss-3.681
Token makes
Token ID1838
Feature activation+6.35e-02

Pos logit contributions
ward+0.899
reating+0.885
tr+0.823
live+0.813
ained+0.811
Neg logit contributions
 spe-0.565
mer-0.516
chie-0.510
 se-0.482
 en-0.437
Token you
Token ID345
Feature activation+3.85e-01

Pos logit contributions
ward+0.115
reating+0.114
tr+0.105
live+0.104
ained+0.104
Neg logit contributions
 spe-0.066
mer-0.062
chie-0.060
 se-0.054
 en-0.048
Token happy
Token ID3772
Feature activation+6.39e-01

Pos logit contributions
ward+0.670
reating+0.665
tr+0.615
live+0.614
ained+0.603
Neg logit contributions
 spe-0.419
mer-0.392
chie-0.386
 se-0.355
 en-0.329
Token and
Token ID290
Feature activation+1.31e+00

Pos logit contributions
ward+1.041
reating+1.017
live+0.946
tr+0.934
ained+0.916
Neg logit contributions
 spe-0.773
mer-0.745
chie-0.708
 se-0.671
 en-0.623
Token families
Token ID4172
Feature activation-3.26e-01

Pos logit contributions
 spe+0.858
chie+0.820
mer+0.814
 se+0.734
 en+0.675
Neg logit contributions
ward-0.917
reating-0.904
 about-0.824
tr-0.815
ained-0.813
Token happy
Token ID3772
Feature activation+4.55e-01

Pos logit contributions
 spe+0.416
mer+0.396
chie+0.386
 se+0.347
 en+0.317
Neg logit contributions
ward-0.511
reating-0.502
 about-0.461
tr-0.453
ained-0.450
Token too
Token ID1165
Feature activation-3.61e-01

Pos logit contributions
ward+0.672
reating+0.665
live+0.607
tr+0.602
ained+0.594
Neg logit contributions
 spe-0.620
mer-0.593
chie-0.572
 se-0.539
 en-0.500
Token.
Token ID13
Feature activation+1.40e+00

Pos logit contributions
 spe+0.529
chie+0.501
mer+0.500
 se+0.459
 en+0.426
Neg logit contributions
ward-0.503
reating-0.485
 about-0.451
tr-0.436
ained-0.432
Token And
Token ID843
Feature activation+1.16e+00

Pos logit contributions
reating+2.291
ward+2.216
ained+2.050
icated+1.986
live+1.981
Neg logit contributions
 spe-1.716
mer-1.551
chie-1.473
 se-1.381
 en-1.347
Token so
Token ID523
Feature activation+3.17e+00

Pos logit contributions
ward+1.916
reating+1.900
tr+1.794
ained+1.778
live+1.746
Neg logit contributions
 spe-1.391
mer-1.278
chie-1.203
 se-1.170
 en-1.104
Token,
Token ID11
Feature activation+1.72e+00

Pos logit contributions
reating+5.839
ward+5.394
affle+5.013
icated+5.012
ruck+4.985
Neg logit contributions
 spe-3.708
mer-3.664
 se-3.223
 shy-3.066
 en-3.059
Token the
Token ID262
Feature activation+4.84e-01

Pos logit contributions
reating+3.122
ward+3.110
tr+2.932
ained+2.913
icated+2.882
Neg logit contributions
 spe-1.827
mer-1.585
 se-1.395
 en-1.370
chie-1.344
Token whole
Token ID2187
Feature activation+1.93e-01

Pos logit contributions
ward+0.677
reating+0.673
tr+0.606
live+0.606
ained+0.595
Neg logit contributions
 spe-0.712
mer-0.667
chie-0.656
 se-0.630
 en-0.585
Token town
Token ID3240
Feature activation-5.04e-02

Pos logit contributions
ward+0.311
reating+0.306
tr+0.281
live+0.277
ained+0.276
Neg logit contributions
 spe-0.242
mer-0.229
chie-0.223
 se-0.207
 en-0.190
Token became
Token ID2627
Feature activation+1.55e-02

Pos logit contributions
 spe+0.075
mer+0.071
chie+0.071
 se+0.065
 en+0.061
Neg logit contributions
ward-0.068
reating-0.067
tr-0.060
 about-0.060
live-0.059
Tokenummy
Token ID13513
Feature activation+1.15e+00

Pos logit contributions
 spe+0.111
mer+0.106
chie+0.105
 se+0.098
 en+0.092
Neg logit contributions
ward-0.080
reating-0.078
tr-0.069
live-0.068
 about-0.068
Token cube
Token ID23441
Feature activation-6.77e-02

Pos logit contributions
ward+1.661
reating+1.641
live+1.538
ained+1.537
tr+1.447
Neg logit contributions
 spe-1.677
mer-1.600
 se-1.525
chie-1.477
 en-1.367
Token.
Token ID13
Feature activation+8.30e-01

Pos logit contributions
 spe+0.097
mer+0.092
chie+0.091
 se+0.084
 en+0.078
Neg logit contributions
ward-0.095
reating-0.094
tr-0.085
 about-0.084
live-0.083
Token
Token ID198
Feature activation+1.21e+00

Pos logit contributions
reating+1.358
ward+1.352
live+1.204
ained+1.195
tr+1.194
Neg logit contributions
 spe-1.003
mer-0.946
chie-0.908
 se-0.867
 en-0.825
Token
Token ID198
Feature activation+2.18e+00

Pos logit contributions
reating+1.930
ward+1.897
tr+1.741
ained+1.705
icated+1.688
Neg logit contributions
 spe-1.478
mer-1.380
chie-1.349
 se-1.331
 en-1.231
TokenBut
Token ID1537
Feature activation+3.15e+00

Pos logit contributions
ward+3.531
reating+3.487
icated+3.154
tr+3.119
inea+3.099
Neg logit contributions
mer-2.654
 spe-2.548
chie-2.474
 se-2.418
 en-2.342
Token the
Token ID262
Feature activation-1.36e-01

Pos logit contributions
reating+5.871
ugg+5.706
ward+5.559
icated+5.554
live+5.495
Neg logit contributions
 spe-3.305
 en-3.035
 se-2.967
 shy-2.932
 low-2.923
Token cube
Token ID23441
Feature activation-8.94e-02

Pos logit contributions
 spe+0.175
mer+0.165
chie+0.163
 se+0.146
 en+0.134
Neg logit contributions
ward-0.213
reating-0.210
tr-0.191
 about-0.190
ained-0.189
Token is
Token ID318
Feature activation+8.02e-01

Pos logit contributions
 spe+0.145
mer+0.138
chie+0.137
 se+0.127
 en+0.119
Neg logit contributions
ward-0.109
reating-0.106
tr-0.095
 about-0.094
live-0.094
Token not
Token ID407
Feature activation-3.80e-01

Pos logit contributions
reating+1.292
ward+1.282
live+1.157
tr+1.155
icated+1.155
Neg logit contributions
 spe-1.024
mer-0.930
chie-0.911
 se-0.863
 en-0.826
Token a
Token ID257
Feature activation-6.22e-01

Pos logit contributions
 spe+0.504
mer+0.486
chie+0.477
 se+0.429
 en+0.392
Neg logit contributions
ward-0.576
reating-0.564
 about-0.522
ained-0.508
tr-0.507
Token day
Token ID1110
Feature activation-8.45e-01

Pos logit contributions
ward+0.549
reating+0.539
live+0.486
ained+0.483
tr+0.482
Neg logit contributions
 spe-0.539
mer-0.512
chie-0.499
 se-0.472
 en-0.434
Token ever
Token ID1683
Feature activation+8.55e-01

Pos logit contributions
 spe+1.284
chie+1.245
mer+1.232
 se+1.100
 en+1.062
Neg logit contributions
ward-1.125
reating-1.066
 about-1.002
icated-0.945
live-0.943
Token!
Token ID0
Feature activation+1.86e+00

Pos logit contributions
ward+1.231
reating+1.209
tr+1.142
live+1.107
ained+1.096
Neg logit contributions
 spe-1.188
mer-1.177
chie-1.095
 se-1.056
 en-0.963
Token
Token ID198
Feature activation+1.16e+00

Pos logit contributions
reating+3.257
ward+3.209
icated+2.867
live+2.831
tr+2.825
Neg logit contributions
 spe-1.900
mer-1.812
 Int-1.719
 en-1.716
chie-1.683
Token
Token ID198
Feature activation+2.24e+00

Pos logit contributions
ward+1.822
reating+1.822
tr+1.662
icated+1.622
ained+1.610
Neg logit contributions
 spe-1.389
mer-1.328
chie-1.269
 se-1.242
 en-1.160
TokenBut
Token ID1537
Feature activation+3.14e+00

Pos logit contributions
ward+3.436
reating+3.282
inea+3.102
icated+3.035
ugg+2.979
Neg logit contributions
mer-2.696
 spe-2.680
 se-2.573
 en-2.550
 low-2.503
Token soon
Token ID2582
Feature activation+2.16e+00

Pos logit contributions
reating+5.400
ugg+5.355
ward+5.159
tr+5.013
live+5.007
Neg logit contributions
 spe-3.440
 en-3.352
mer-3.331
 shy-3.182
 but-3.175
Token it
Token ID340
Feature activation-1.83e-01

Pos logit contributions
tr+3.575
reating+3.532
ward+3.470
ugg+3.379
live+3.284
Neg logit contributions
 spe-2.519
mer-2.456
 se-2.285
 en-2.243
 shy-2.186
Token was
Token ID373
Feature activation+3.52e-01

Pos logit contributions
 spe+0.289
mer+0.274
chie+0.273
 se+0.251
 en+0.237
Neg logit contributions
ward-0.232
reating-0.225
tr-0.204
live-0.203
ained-0.202
Token time
Token ID640
Feature activation+9.11e-01

Pos logit contributions
ward+0.483
reating+0.481
tr+0.430
live+0.428
icated+0.421
Neg logit contributions
 spe-0.522
mer-0.490
chie-0.484
 se-0.453
 en-0.427
Token for
Token ID329
Feature activation+1.14e+00

Pos logit contributions
reating+1.594
ward+1.581
live+1.501
tr+1.485
ained+1.442
Neg logit contributions
 spe-0.940
mer-0.880
chie-0.877
 se-0.807
 en-0.707
Token and
Token ID290
Feature activation+7.42e-01

Pos logit contributions
ward+1.815
reating+1.787
live+1.737
tr+1.681
ained+1.674
Neg logit contributions
 spe-1.340
mer-1.243
 se-1.196
chie-1.142
 en-1.072
Token free
Token ID1479
Feature activation+7.23e-01

Pos logit contributions
reating+1.329
ward+1.297
tr+1.214
live+1.204
ained+1.186
Neg logit contributions
 spe-0.802
mer-0.742
chie-0.713
 se-0.680
 en-0.633
Token.
Token ID13
Feature activation+1.83e+00

Pos logit contributions
ward+1.118
reating+1.098
live+1.038
tr+1.008
ained+1.005
Neg logit contributions
 spe-0.922
mer-0.888
chie-0.841
 se-0.791
 en-0.734
Token
Token ID198
Feature activation+1.40e+00

Pos logit contributions
reating+3.376
ward+3.313
icated+2.951
live+2.931
tr+2.911
Neg logit contributions
 spe-1.843
mer-1.707
 en-1.643
chie-1.568
 skinny-1.561
Token
Token ID198
Feature activation+2.38e+00

Pos logit contributions
reating+2.195
ward+2.152
tr+1.990
ained+1.936
icated+1.909
Neg logit contributions
 spe-1.739
mer-1.630
 se-1.563
chie-1.561
 en-1.473
TokenBut
Token ID1537
Feature activation+3.13e+00

Pos logit contributions
ward+4.015
reating+3.902
inea+3.648
icated+3.627
tr+3.563
Neg logit contributions
mer-2.684
 spe-2.564
 se-2.438
 en-2.394
chie-2.384
Token then
Token ID788
Feature activation+1.94e+00

Pos logit contributions
reating+5.735
ward+5.439
icated+5.386
ugg+5.301
ouncing+5.201
Neg logit contributions
 spe-3.436
mer-3.149
 boat-3.048
 en-3.037
 se-3.024
Token,
Token ID11
Feature activation+2.39e+00

Pos logit contributions
reating+3.186
ward+3.070
tr+2.929
ained+2.923
ugg+2.868
Neg logit contributions
 spe-2.516
mer-2.218
 se-2.159
 boat-2.066
chie-2.058
Token he
Token ID339
Feature activation+3.22e-01

Pos logit contributions
reating+4.122
ward+4.073
live+3.897
ugg+3.892
ouncing+3.867
Neg logit contributions
 spe-2.665
 se-2.368
mer-2.347
 en-2.280
 shy-2.251
Token hears
Token ID23180
Feature activation+1.12e+00

Pos logit contributions
ward+0.537
reating+0.530
tr+0.482
live+0.481
ained+0.479
Neg logit contributions
 spe-0.381
mer-0.357
chie-0.347
 se-0.324
 en-0.295
Token a
Token ID257
Feature activation-9.34e-01

Pos logit contributions
ward+1.806
reating+1.778
live+1.696
ouncing+1.662
ained+1.661
Neg logit contributions
 spe-1.330
mer-1.219
 se-1.208
chie-1.194
 en-1.100
Token "
Token ID366
Feature activation-3.32e-01

Pos logit contributions
 spe+2.112
chie+2.016
mer+1.974
 se+1.754
 blu+1.683
Neg logit contributions
ward-2.056
 about-1.982
reating-1.926
tr-1.915
ouncing-1.828
TokenYou
Token ID1639
Feature activation+3.63e-01

Pos logit contributions
 spe+0.535
mer+0.504
chie+0.497
 se+0.459
 en+0.430
Neg logit contributions
ward-0.414
reating-0.412
tr-0.362
ained-0.359
icated-0.357
Token are
Token ID389
Feature activation+9.29e-01

Pos logit contributions
ward+0.464
reating+0.458
live+0.403
ained+0.401
tr+0.399
Neg logit contributions
 spe-0.575
mer-0.546
chie-0.543
 se-0.507
 en-0.474
Token making
Token ID1642
Feature activation+9.92e-01

Pos logit contributions
ward+1.372
reating+1.355
tr+1.234
ugg+1.226
live+1.220
Neg logit contributions
 spe-1.289
chie-1.208
mer-1.196
 se-1.116
 en-1.076
Token too
Token ID1165
Feature activation+6.02e-02

Pos logit contributions
ward+1.579
reating+1.540
live+1.485
tr+1.470
ained+1.451
Neg logit contributions
 spe-1.205
chie-1.106
mer-1.095
 se-1.073
 en-1.000
Token much
Token ID881
Feature activation+3.11e+00

Pos logit contributions
ward+0.066
reating+0.064
tr+0.056
live+0.055
ained+0.055
Neg logit contributions
 spe-0.106
mer-0.102
chie-0.101
 se-0.094
 en-0.089
Token noise
Token ID7838
Feature activation+1.50e-01

Pos logit contributions
live+6.265
ward+5.841
inea+5.734
ugg+5.684
ained+5.666
Neg logit contributions
 spe-3.429
 se-3.242
 en-2.622
 skinny-2.604
mer-2.566
Token and
Token ID290
Feature activation+6.06e-01

Pos logit contributions
ward+0.224
reating+0.220
tr+0.200
live+0.199
ained+0.198
Neg logit contributions
 spe-0.203
mer-0.193
chie-0.189
 se-0.174
 en-0.161
Token you
Token ID345
Feature activation+1.03e+00

Pos logit contributions
ward+0.945
reating+0.938
live+0.868
tr+0.855
ained+0.849
Neg logit contributions
 spe-0.785
mer-0.725
chie-0.716
 se-0.675
 en-0.626
Token are
Token ID389
Feature activation+6.97e-01

Pos logit contributions
reating+1.578
ward+1.551
live+1.369
ained+1.357
 Afterwards+1.352
Neg logit contributions
 spe-1.427
mer-1.360
chie-1.325
 se-1.245
 en-1.137
Token being
Token ID852
Feature activation-5.50e-01

Pos logit contributions
ward+1.044
reating+1.034
tr+0.946
live+0.941
ained+0.920
Neg logit contributions
 spe-0.952
chie-0.885
mer-0.884
 se-0.822
 en-0.790
Token noises
Token ID26782
Feature activation+3.09e-01

Pos logit contributions
ward+1.459
reating+1.420
live+1.397
ained+1.324
tr+1.298
Neg logit contributions
 spe-1.108
mer-1.023
chie-1.000
 se-0.949
 en-0.887
Token.
Token ID13
Feature activation-8.15e-01

Pos logit contributions
ward+0.465
reating+0.456
live+0.421
tr+0.416
ained+0.415
Neg logit contributions
 spe-0.415
mer-0.394
chie-0.382
 se-0.357
 en-0.331
Token They
Token ID1119
Feature activation+1.51e-01

Pos logit contributions
 spe+1.308
chie+1.272
mer+1.243
 se+1.141
 skinny+1.036
Neg logit contributions
ward-1.052
reating-0.981
 about-0.961
ained-0.928
live-0.911
Token had
Token ID550
Feature activation-8.02e-01

Pos logit contributions
ward+0.211
reating+0.208
tr+0.186
live+0.186
ained+0.185
Neg logit contributions
 spe-0.219
mer-0.208
chie-0.205
 se-0.190
 en-0.177
Token so
Token ID523
Feature activation+1.36e+00

Pos logit contributions
mer+1.353
 spe+1.334
chie+1.313
 se+1.174
 en+1.102
Neg logit contributions
ward-0.962
reating-0.913
 about-0.866
tr-0.828
ained-0.811
Token much
Token ID881
Feature activation+3.09e+00

Pos logit contributions
reating+1.675
ward+1.655
live+1.439
tr+1.409
ained+1.364
Neg logit contributions
 spe-2.311
mer-2.109
chie-2.029
 se-1.961
 en-1.944
Token fun
Token ID1257
Feature activation+7.75e-01

Pos logit contributions
live+6.075
inea+5.978
reating+5.881
ward+5.837
ugg+5.779
Neg logit contributions
 se-3.155
 spe-3.033
 en-2.537
 skinny-2.454
 boat-2.429
Token with
Token ID351
Feature activation-1.84e-01

Pos logit contributions
reating+1.056
ward+1.044
live+0.959
tr+0.953
ained+0.931
Neg logit contributions
 spe-1.151
mer-1.094
chie-1.044
 se-1.016
 en-0.940
Token the
Token ID262
Feature activation-6.43e-01

Pos logit contributions
 spe+0.246
mer+0.237
chie+0.235
 se+0.211
 en+0.194
Neg logit contributions
ward-0.279
reating-0.274
 about-0.249
tr-0.247
ained-0.245
Token car
Token ID1097
Feature activation+4.89e-01

Pos logit contributions
 spe+0.809
mer+0.784
chie+0.758
 se+0.663
 dunno+0.608
Neg logit contributions
ward-1.021
reating-1.010
tr-0.932
 about-0.917
ained-0.916
Token.
Token ID13
Feature activation+1.29e-01

Pos logit contributions
ward+0.686
reating+0.682
live+0.621
ained+0.607
tr+0.601
Neg logit contributions
 spe-0.698
mer-0.667
chie-0.647
 se-0.607
 en-0.567
Token day
Token ID1110
Feature activation+1.72e+00

Pos logit contributions
ward+0.799
reating+0.786
tr+0.642
live+0.634
ained+0.630
Neg logit contributions
 spe-1.424
mer-1.384
chie-1.339
 se-1.304
 en-1.236
Token,
Token ID11
Feature activation-4.44e-01

Pos logit contributions
reating+2.636
ward+2.493
tr+2.366
ained+2.322
icated+2.294
Neg logit contributions
 spe-2.389
mer-2.151
 se-2.074
chie-2.036
 en-1.910
Token she
Token ID673
Feature activation-9.96e-01

Pos logit contributions
 spe+0.570
chie+0.549
mer+0.548
 se+0.474
 en+0.436
Neg logit contributions
ward-0.706
reating-0.688
 about-0.631
tr-0.626
live-0.620
Token ate
Token ID15063
Feature activation+1.82e+00

Pos logit contributions
 spe+1.349
chie+1.343
mer+1.256
 se+1.027
rier+1.024
Neg logit contributions
ward-1.478
reating-1.462
 about-1.399
live-1.381
ained-1.378
Token too
Token ID1165
Feature activation+9.09e-01

Pos logit contributions
reating+3.073
ward+3.068
ouncing+2.953
tr+2.920
live+2.891
Neg logit contributions
 spe-2.104
 se-1.950
 en-1.705
mer-1.699
 shy-1.611
Token much
Token ID881
Feature activation+3.09e+00

Pos logit contributions
ward+0.975
reating+0.969
tr+0.894
ained+0.832
live+0.815
Neg logit contributions
 spe-1.593
mer-1.488
chie-1.484
 se-1.436
 en-1.362
Token candy
Token ID18550
Feature activation+7.66e-01

Pos logit contributions
ward+4.706
tr+4.572
reating+4.552
live+4.498
nce+4.460
Neg logit contributions
 spe-4.523
 se-4.360
 skinny-3.841
 en-3.699
 shy-3.657
Token and
Token ID290
Feature activation+1.39e+00

Pos logit contributions
reating+1.096
ward+1.092
ained+0.983
tr+0.979
icated+0.973
Neg logit contributions
 spe-1.061
mer-1.023
chie-0.949
 se-0.924
 en-0.849
Token her
Token ID607
Feature activation+5.34e-01

Pos logit contributions
reating+2.287
ward+2.278
ained+2.158
tr+2.151
ouncing+2.083
Neg logit contributions
 spe-1.677
 se-1.502
mer-1.423
chie-1.314
 en-1.312
Token stomach
Token ID11384
Feature activation+3.02e-01

Pos logit contributions
ward+0.762
reating+0.760
icated+0.670
ained+0.669
tr+0.667
Neg logit contributions
 spe-0.766
mer-0.715
chie-0.699
 se-0.672
 en-0.627
Token hurt
Token ID5938
Feature activation+9.47e-01

Pos logit contributions
ward+0.462
reating+0.459
ained+0.410
live+0.409
tr+0.408
Neg logit contributions
 spe-0.401
mer-0.377
chie-0.367
 se-0.343
 en-0.315
Token it
Token ID340
Feature activation+3.25e-01

Pos logit contributions
 spe+1.062
mer+1.037
chie+1.021
 se+0.927
 skinny+0.870
Neg logit contributions
ward-0.745
reating-0.741
 about-0.658
ained-0.640
tr-0.636
Token.
Token ID13
Feature activation+6.70e-01

Pos logit contributions
ward+0.393
reating+0.391
live+0.342
tr+0.341
ained+0.339
Neg logit contributions
 spe-0.527
mer-0.506
chie-0.495
 se-0.465
 en-0.442
Token She
Token ID1375
Feature activation+2.38e-01

Pos logit contributions
reating+1.055
ward+1.046
live+0.949
nce+0.933
icated+0.932
Neg logit contributions
 spe-0.842
mer-0.778
chie-0.763
 se-0.713
 en-0.690
Token had
Token ID550
Feature activation-1.75e-01

Pos logit contributions
ward+0.348
reating+0.345
tr+0.309
ained+0.308
live+0.307
Neg logit contributions
 spe-0.329
mer-0.311
chie-0.304
 se-0.286
 en-0.264
Token so
Token ID523
Feature activation+1.52e+00

Pos logit contributions
 spe+0.212
mer+0.203
chie+0.198
 se+0.176
 en+0.161
Neg logit contributions
ward-0.290
reating-0.284
 about-0.262
tr-0.260
ained-0.259
Token much
Token ID881
Feature activation+3.09e+00

Pos logit contributions
reating+1.953
ward+1.917
tr+1.650
live+1.614
icated+1.550
Neg logit contributions
 spe-2.479
mer-2.239
chie-2.177
 se-2.156
 en-2.112
Token fun
Token ID1257
Feature activation+9.00e-01

Pos logit contributions
ward+5.953
inea+5.941
tr+5.917
reating+5.875
live+5.735
Neg logit contributions
 se-3.105
 spe-2.825
 en-2.563
 shy-2.428
 skinny-2.404
Token that
Token ID326
Feature activation+2.03e-01

Pos logit contributions
reating+1.139
ward+1.129
tr+1.037
live+1.023
ained+0.999
Neg logit contributions
 spe-1.428
mer-1.341
chie-1.302
 se-1.277
 en-1.190
Token she
Token ID673
Feature activation-5.81e-01

Pos logit contributions
ward+0.371
reating+0.366
tr+0.339
live+0.337
ained+0.335
Neg logit contributions
 spe-0.208
mer-0.192
chie-0.187
 se-0.170
 en-0.153
Token didn
Token ID1422
Feature activation-3.47e-01

Pos logit contributions
 spe+0.919
chie+0.879
mer+0.861
 se+0.771
 skinny+0.735
Neg logit contributions
ward-0.738
reating-0.708
 about-0.658
live-0.653
ained-0.641
Token't
Token ID470
Feature activation-2.53e-01

Pos logit contributions
 spe+0.440
mer+0.418
chie+0.405
 se+0.366
 en+0.335
Neg logit contributions
ward-0.556
reating-0.546
 about-0.500
tr-0.498
ained-0.497
Token They
Token ID1119
Feature activation+3.87e-01

Pos logit contributions
 spe+0.070
mer+0.067
chie+0.066
 se+0.060
 en+0.056
Neg logit contributions
ward-0.071
reating-0.070
tr-0.064
ained-0.063
 about-0.063
Token are
Token ID389
Feature activation+1.03e+00

Pos logit contributions
ward+0.561
reating+0.555
live+0.495
ained+0.494
tr+0.491
Neg logit contributions
 spe-0.548
mer-0.514
chie-0.506
 se-0.474
 en-0.440
Token happy
Token ID3772
Feature activation+7.24e-01

Pos logit contributions
reating+1.892
ward+1.880
live+1.734
icated+1.716
tr+1.712
Neg logit contributions
 spe-1.097
mer-0.968
chie-0.933
 se-0.895
 en-0.875
Token.
Token ID13
Feature activation-2.35e-02

Pos logit contributions
ward+1.165
reating+1.145
live+1.078
tr+1.066
ained+1.048
Neg logit contributions
 spe-0.908
mer-0.852
chie-0.801
 se-0.792
 en-0.715
Token And
Token ID843
Feature activation+7.14e-01

Pos logit contributions
 spe+0.034
mer+0.033
chie+0.032
 se+0.030
 en+0.028
Neg logit contributions
ward-0.033
reating-0.032
tr-0.029
ained-0.029
 about-0.029
Token so
Token ID523
Feature activation+3.09e+00

Pos logit contributions
reating+1.236
ward+1.233
live+1.143
tr+1.140
ained+1.137
Neg logit contributions
 spe-0.806
mer-0.731
chie-0.697
 se-0.677
 en-0.623
Token are
Token ID389
Feature activation+5.59e-01

Pos logit contributions
reating+5.951
ward+5.240
live+5.019
ruck+4.925
affle+4.903
Neg logit contributions
 spe-3.696
mer-3.458
 shy-3.262
 se-3.185
 en-3.125
Token Mom
Token ID11254
Feature activation-1.09e+00

Pos logit contributions
reating+0.913
ward+0.908
tr+0.836
live+0.820
ained+0.818
Neg logit contributions
 spe-0.677
mer-0.627
chie-0.611
 se-0.575
 en-0.534
Token and
Token ID290
Feature activation-1.15e+00

Pos logit contributions
 spe+1.730
chie+1.697
mer+1.624
 se+1.473
 blu+1.431
Neg logit contributions
ward-1.397
reating-1.394
 about-1.352
 done-1.240
tr-1.231
Token Dad
Token ID17415
Feature activation+2.31e-01

Pos logit contributions
 spe+1.559
mer+1.549
chie+1.537
 se+1.282
 en+1.217
Neg logit contributions
ward-1.779
reating-1.717
 about-1.559
 done-1.508
icated-1.507
Token.
Token ID13
Feature activation+2.79e-01

Pos logit contributions
ward+0.343
reating+0.338
tr+0.308
live+0.305
ained+0.304
Neg logit contributions
 spe-0.312
mer-0.297
chie-0.290
 se-0.268
 en-0.249
Token.
Token ID13
Feature activation+3.26e-01

Pos logit contributions
reating+1.628
ward+1.601
tr+1.489
live+1.460
ained+1.442
Neg logit contributions
 spe-1.380
mer-1.356
chie-1.286
 se-1.208
 en-1.134
Token You
Token ID921
Feature activation+5.03e-01

Pos logit contributions
ward+0.492
reating+0.489
live+0.437
tr+0.436
ained+0.435
Neg logit contributions
 spe-0.432
mer-0.409
chie-0.403
 se-0.369
 en-0.346
Token are
Token ID389
Feature activation+9.63e-01

Pos logit contributions
ward+0.673
reating+0.669
live+0.587
tr+0.584
ained+0.583
Neg logit contributions
 spe-0.766
mer-0.724
chie-0.721
 se-0.679
 en-0.630
Token making
Token ID1642
Feature activation+1.11e+00

Pos logit contributions
reating+1.504
ward+1.491
tr+1.364
ugg+1.354
live+1.350
Neg logit contributions
 spe-1.256
chie-1.183
mer-1.149
 se-1.096
 en-1.054
Token too
Token ID1165
Feature activation+4.51e-01

Pos logit contributions
ward+1.780
reating+1.759
live+1.689
tr+1.680
ugg+1.661
Neg logit contributions
 spe-1.301
chie-1.216
 se-1.202
mer-1.180
 en-1.124
Token much
Token ID881
Feature activation+3.09e+00

Pos logit contributions
ward+0.562
reating+0.554
tr+0.495
live+0.488
ained+0.479
Neg logit contributions
 spe-0.726
mer-0.693
chie-0.685
 se-0.640
 en-0.609
Token noise
Token ID7838
Feature activation+4.04e-01

Pos logit contributions
live+6.380
ward+6.111
inea+6.062
nce+5.964
ugg+5.911
Neg logit contributions
 se-3.138
 spe-2.869
 en-2.594
 skinny-2.476
 shy-2.386
Token.
Token ID13
Feature activation+2.02e-01

Pos logit contributions
ward+0.627
reating+0.615
live+0.565
tr+0.563
ained+0.556
Neg logit contributions
 spe-0.519
mer-0.499
chie-0.482
 se-0.443
 en-0.411
Token You
Token ID921
Feature activation+3.88e-01

Pos logit contributions
ward+0.307
reating+0.302
tr+0.272
live+0.272
ained+0.271
Neg logit contributions
 spe-0.267
mer-0.252
chie-0.248
 se-0.227
 en-0.212
Token are
Token ID389
Feature activation+9.16e-01

Pos logit contributions
ward+0.489
reating+0.483
live+0.421
tr+0.420
ained+0.418
Neg logit contributions
 spe-0.621
mer-0.591
chie-0.586
 se-0.551
 en-0.515
Token in
Token ID287
Feature activation-2.43e-01

Pos logit contributions
reating+1.462
ward+1.453
tr+1.327
live+1.322
ugg+1.311
Neg logit contributions
 spe-1.164
chie-1.084
mer-1.059
 se-1.007
 en-0.965
Tokenaren
Token ID5757
Feature activation+7.44e-01

Pos logit contributions
ward+0.374
reating+0.368
tr+0.326
ained+0.324
 about+0.322
Neg logit contributions
 spe-0.458
mer-0.439
chie-0.433
 se-0.400
 en-0.373
Token and
Token ID290
Feature activation+1.64e+00

Pos logit contributions
reating+1.075
ward+1.053
ained+0.953
tr+0.932
live+0.920
Neg logit contributions
 spe-1.060
mer-1.011
chie-0.983
 se-0.937
 en-0.874
Token Ann
Token ID5506
Feature activation+1.58e+00

Pos logit contributions
ward+2.582
reating+2.510
ained+2.430
tr+2.305
live+2.288
Neg logit contributions
 spe-2.177
chie-1.956
mer-1.914
 se-1.833
 en-1.741
Token had
Token ID550
Feature activation-2.31e-01

Pos logit contributions
reating+2.362
ward+2.271
ained+2.159
live+2.107
tr+2.051
Neg logit contributions
 spe-2.301
mer-2.134
chie-2.125
 se-2.022
 en-1.837
Token so
Token ID523
Feature activation+2.39e+00

Pos logit contributions
 spe+0.331
mer+0.321
chie+0.313
 se+0.284
 en+0.265
Neg logit contributions
ward-0.329
reating-0.321
 about-0.292
tr-0.290
ained-0.286
Token much
Token ID881
Feature activation+3.08e+00

Pos logit contributions
reating+3.509
ward+3.250
live+3.098
tr+3.069
ained+2.863
Neg logit contributions
 spe-3.774
 en-3.215
 se-3.180
 shy-3.128
mer-3.096
Token fun
Token ID1257
Feature activation+1.24e+00

Pos logit contributions
reating+5.697
tr+5.628
inea+5.613
ward+5.548
live+5.508
Neg logit contributions
 spe-3.323
 se-3.307
 en-2.650
 shy-2.596
 nervous-2.513
Token playing
Token ID2712
Feature activation-1.77e-02

Pos logit contributions
reating+1.601
ward+1.555
tr+1.459
live+1.442
inea+1.365
Neg logit contributions
 spe-1.925
mer-1.812
chie-1.761
 se-1.752
 en-1.616
Token together
Token ID1978
Feature activation+6.47e-01

Pos logit contributions
 spe+0.031
mer+0.030
chie+0.030
 se+0.028
 en+0.026
Neg logit contributions
ward-0.019
reating-0.019
tr-0.016
 about-0.016
ained-0.016
Token!
Token ID0
Feature activation+1.45e+00

Pos logit contributions
ward+0.968
reating+0.966
tr+0.877
live+0.869
ained+0.865
Neg logit contributions
 spe-0.868
mer-0.832
chie-0.791
 se-0.742
 en-0.691
Token Ann
Token ID5506
Feature activation+2.89e-01

Pos logit contributions
reating+2.321
ward+2.241
icated+2.041
live+2.041
 done+1.986
Neg logit contributions
 spe-1.851
mer-1.649
chie-1.629
 en-1.464
 Int-1.436
Token love
Token ID1842
Feature activation+9.69e-02

Pos logit contributions
reating+1.265
ward+1.254
ained+1.097
tr+1.096
live+1.089
Neg logit contributions
 spe-1.113
mer-1.063
chie-1.012
 se-0.975
 en-0.883
Token this
Token ID428
Feature activation+1.76e-01

Pos logit contributions
ward+0.154
reating+0.152
tr+0.139
live+0.138
ained+0.138
Neg logit contributions
 spe-0.122
mer-0.115
chie-0.113
 se-0.103
 en-0.094
Token movie
Token ID3807
Feature activation+1.63e-01

Pos logit contributions
ward+0.242
reating+0.238
tr+0.213
ained+0.213
live+0.213
Neg logit contributions
 spe-0.259
mer-0.247
chie-0.243
 se-0.225
 en-0.210
Token!"
Token ID2474
Feature activation+6.24e-01

Pos logit contributions
ward+0.211
reating+0.207
tr+0.185
ained+0.184
live+0.183
Neg logit contributions
 spe-0.252
mer-0.241
chie-0.237
 se-0.220
 en-0.206
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
reating+0.979
ward+0.970
live+0.888
tr+0.875
ained+0.872
Neg logit contributions
 spe-0.776
mer-0.727
chie-0.708
 se-0.684
 en-0.637
Token sn
Token ID3013
Feature activation+3.04e+00

Pos logit contributions
reating+1.694
ward+1.656
ained+1.550
live+1.537
tr+1.537
Neg logit contributions
 spe-1.185
mer-1.077
 se-1.041
chie-1.027
 en-0.965
Tokenugg
Token ID6837
Feature activation+2.26e-01

Pos logit contributions
reating+2.788
icated+2.550
ruck+1.993
tr+1.976
ward+1.964
Neg logit contributions
mer-6.509
 spe-6.213
chie-5.925
 se-5.866
 boat-5.663
Tokenled
Token ID992
Feature activation+9.31e-01

Pos logit contributions
ward+0.294
reating+0.290
tr+0.259
live+0.254
ained+0.254
Neg logit contributions
 spe-0.351
mer-0.339
chie-0.330
 se-0.309
 en-0.289
Token up
Token ID510
Feature activation+7.80e-01

Pos logit contributions
reating+1.319
ward+1.319
live+1.201
ained+1.188
tr+1.178
Neg logit contributions
 spe-1.351
mer-1.277
chie-1.213
 se-1.173
 en-1.103
Token with
Token ID351
Feature activation-1.35e-01

Pos logit contributions
reating+1.097
ward+1.086
live+0.990
icated+0.978
ained+0.975
Neg logit contributions
 spe-1.144
mer-1.077
chie-1.037
 se-1.000
 en-0.925
Token her
Token ID607
Feature activation-6.01e-01

Pos logit contributions
 spe+0.188
mer+0.181
chie+0.178
 se+0.161
 en+0.150
Neg logit contributions
ward-0.196
reating-0.193
 about-0.173
tr-0.172
ained-0.172
Tokenmy
Token ID1820
Feature activation-3.84e-01

Pos logit contributions
ward+1.982
reating+1.721
tr+1.542
 Enjoy+1.540
live+1.517
Neg logit contributions
mer-3.630
 spe-3.623
chie-3.556
 se-3.492
 skinny-3.290
Token.
Token ID13
Feature activation-1.56e-01

Pos logit contributions
 spe+0.507
mer+0.480
chie+0.476
 se+0.430
 en+0.396
Neg logit contributions
ward-0.582
reating-0.575
tr-0.526
 about-0.526
ained-0.517
Token You
Token ID921
Feature activation+1.60e-02

Pos logit contributions
 spe+0.181
mer+0.172
chie+0.168
 se+0.152
 en+0.137
Neg logit contributions
ward-0.263
reating-0.258
tr-0.240
 about-0.237
ained-0.235
Token ate
Token ID15063
Feature activation+1.43e+00

Pos logit contributions
ward+0.022
reating+0.021
tr+0.019
live+0.019
ained+0.019
Neg logit contributions
 spe-0.024
mer-0.023
chie-0.022
 se-0.021
 en-0.019
Token too
Token ID1165
Feature activation+9.10e-01

Pos logit contributions
ward+2.450
reating+2.407
live+2.306
tr+2.288
ouncing+2.267
Neg logit contributions
 spe-1.639
mer-1.449
chie-1.396
 se-1.340
 en-1.255
Token much
Token ID881
Feature activation+3.04e+00

Pos logit contributions
ward+1.027
reating+1.016
tr+0.930
live+0.871
ained+0.866
Neg logit contributions
 spe-1.562
mer-1.457
chie-1.453
 se-1.373
 skinny-1.326
Token candy
Token ID18550
Feature activation+6.46e-01

Pos logit contributions
live+5.516
ward+5.455
tr+5.316
reating+5.247
nce+5.231
Neg logit contributions
 spe-3.645
 se-3.180
 skinny-2.753
 shy-2.596
mer-2.578
Token,
Token ID11
Feature activation-3.66e-01

Pos logit contributions
ward+0.952
reating+0.940
ained+0.851
tr+0.848
live+0.847
Neg logit contributions
 spe-0.874
mer-0.842
chie-0.798
 se-0.754
 en-0.691
Token didn
Token ID1422
Feature activation-3.98e-01

Pos logit contributions
 spe+0.426
mer+0.410
chie+0.402
 se+0.350
 en+0.315
Neg logit contributions
ward-0.611
reating-0.607
 about-0.563
tr-0.556
ained-0.547
Token't
Token ID470
Feature activation+1.22e+00

Pos logit contributions
 spe+0.505
mer+0.489
chie+0.468
 se+0.423
 en+0.390
Neg logit contributions
ward-0.635
reating-0.623
tr-0.571
 about-0.567
live-0.563
Token you
Token ID345
Feature activation+1.17e+00

Pos logit contributions
ward+1.956
reating+1.846
ained+1.797
live+1.791
 Afterwards+1.728
Neg logit contributions
 spe-1.623
chie-1.460
 se-1.417
mer-1.415
 en-1.301
Token her
Token ID607
Feature activation-1.96e-01

Pos logit contributions
reating+1.664
ward+1.658
tr+1.602
ained+1.553
ouncing+1.540
Neg logit contributions
 spe-1.179
mer-1.032
chie-1.019
 se-0.958
 en-0.898
Token new
Token ID649
Feature activation-2.38e-01

Pos logit contributions
 spe+0.256
mer+0.245
chie+0.243
 se+0.219
 en+0.202
Neg logit contributions
ward-0.303
reating-0.295
tr-0.273
 about-0.271
live-0.268
Token home
Token ID1363
Feature activation+1.39e-01

Pos logit contributions
 spe+0.336
mer+0.321
chie+0.320
 se+0.290
 en+0.271
Neg logit contributions
ward-0.339
reating-0.332
tr-0.302
 about-0.300
ained-0.297
Token.
Token ID13
Feature activation+1.91e+00

Pos logit contributions
ward+0.197
reating+0.195
tr+0.175
live+0.174
ained+0.174
Neg logit contributions
 spe-0.197
mer-0.187
chie-0.184
 se-0.170
 en-0.158
Token Sara
Token ID24799
Feature activation+1.24e-01

Pos logit contributions
reating+3.507
ward+3.308
icated+3.067
ruck+3.050
ained+3.001
Neg logit contributions
 spe-2.074
mer-1.807
 en-1.723
 Int-1.635
 se-1.625
Token and
Token ID290
Feature activation+3.03e+00

Pos logit contributions
ward+0.172
reating+0.170
tr+0.152
ained+0.151
live+0.151
Neg logit contributions
 spe-0.179
mer-0.170
chie-0.167
 se-0.156
 en-0.145
Token Tom
Token ID4186
Feature activation+1.44e+00

Pos logit contributions
reating+5.074
ward+4.999
ruck+4.791
inea+4.660
ained+4.613
Neg logit contributions
 spe-3.853
 en-3.425
 se-3.380
 shy-3.340
 nervous-3.177
Token smiled
Token ID13541
Feature activation+4.34e-01

Pos logit contributions
reating+2.160
ward+2.018
ained+1.919
tr+1.855
live+1.829
Neg logit contributions
 spe-2.116
mer-1.910
 se-1.884
chie-1.838
 boat-1.662
Token as
Token ID355
Feature activation+1.21e-02

Pos logit contributions
reating+0.647
ward+0.645
live+0.587
tr+0.580
ained+0.576
Neg logit contributions
 spe-0.587
mer-0.555
chie-0.539
 se-0.506
 en-0.470
Token they
Token ID484
Feature activation+1.60e+00

Pos logit contributions
ward+0.021
reating+0.020
tr+0.019
live+0.019
 about+0.019
Neg logit contributions
 spe-0.014
mer-0.013
chie-0.013
 se-0.012
 en-0.010
Token watched
Token ID7342
Feature activation+8.38e-01

Pos logit contributions
reating+2.488
ward+2.357
ained+2.172
 about+2.144
ouncing+2.131
Neg logit contributions
 spe-2.367
mer-2.118
 se-2.069
chie-2.011
 boat-1.745
Token
Token ID198
Feature activation+1.41e+00

Pos logit contributions
ward+0.735
reating+0.724
tr+0.657
ained+0.643
live+0.643
Neg logit contributions
 spe-0.687
mer-0.654
chie-0.637
 se-0.595
 en-0.557
TokenOne
Token ID3198
Feature activation+1.22e+00

Pos logit contributions
ward+2.235
reating+2.172
icated+2.000
tr+1.985
 about+1.917
Neg logit contributions
 spe-1.812
mer-1.789
chie-1.680
 se-1.544
 en-1.518
Token day
Token ID1110
Feature activation+2.12e+00

Pos logit contributions
ward+1.329
reating+1.285
live+1.121
ained+1.082
 Afterwards+1.078
Neg logit contributions
 spe-2.185
mer-2.136
 se-2.043
chie-2.031
 en-1.918
Token,
Token ID11
Feature activation+1.65e+00

Pos logit contributions
reating+3.373
ward+3.209
tr+3.197
icated+3.089
ained+3.061
Neg logit contributions
 spe-2.865
mer-2.627
 se-2.467
chie-2.316
 en-2.283
Token Sara
Token ID24799
Feature activation-6.18e-01

Pos logit contributions
reating+2.761
ward+2.758
tr+2.588
icated+2.574
ained+2.534
Neg logit contributions
 spe-1.952
mer-1.757
 se-1.611
 en-1.567
chie-1.548
Token and
Token ID290
Feature activation+3.01e+00

Pos logit contributions
 spe+0.902
chie+0.863
mer+0.847
 se+0.737
 skinny+0.705
Neg logit contributions
ward-0.849
reating-0.845
 about-0.783
tr-0.779
ained-0.767
Token Tom
Token ID4186
Feature activation+5.66e-01

Pos logit contributions
ward+5.400
reating+4.974
icated+4.917
tr+4.842
ained+4.784
Neg logit contributions
 spe-3.303
 se-3.113
 shy-3.076
 en-2.945
 low-2.891
Token want
Token ID765
Feature activation-4.28e-02

Pos logit contributions
ward+0.925
reating+0.900
live+0.833
ained+0.819
tr+0.818
Neg logit contributions
 spe-0.694
mer-0.653
chie-0.642
 se-0.609
 en-0.556
Token to
Token ID284
Feature activation-1.07e+00

Pos logit contributions
 spe+0.045
mer+0.043
chie+0.042
 se+0.037
 en+0.033
Neg logit contributions
ward-0.077
reating-0.076
tr-0.070
 about-0.070
ained-0.069
Token do
Token ID466
Feature activation-1.77e-01

Pos logit contributions
mer+1.478
 spe+1.468
chie+1.378
 se+1.146
 en+1.131
Neg logit contributions
ward-1.580
reating-1.571
 about-1.423
tr-1.419
 done-1.386
Token art
Token ID1242
Feature activation-6.77e-01

Pos logit contributions
 spe+0.276
mer+0.265
chie+0.259
 se+0.237
 en+0.222
Neg logit contributions
ward-0.232
reating-0.227
 about-0.203
tr-0.200
ained-0.199
Token to
Token ID284
Feature activation+7.88e-01

Pos logit contributions
ward+0.767
reating+0.767
live+0.706
tr+0.703
icated+0.701
Neg logit contributions
 spe-0.413
chie-0.371
mer-0.371
 se-0.337
 en-0.303
Token cry
Token ID3960
Feature activation+1.18e+00

Pos logit contributions
reating+1.304
ward+1.300
live+1.179
ained+1.173
tr+1.160
Neg logit contributions
 spe-0.953
chie-0.865
mer-0.854
 se-0.830
 en-0.760
Token.
Token ID13
Feature activation+8.41e-02

Pos logit contributions
reating+1.996
ward+1.989
live+1.921
ained+1.825
nce+1.812
Neg logit contributions
 spe-1.325
mer-1.249
chie-1.228
 se-1.144
 en-1.078
Token She
Token ID1375
Feature activation+7.24e-01

Pos logit contributions
ward+0.145
reating+0.143
tr+0.131
live+0.130
ained+0.130
Neg logit contributions
 spe-0.094
mer-0.088
chie-0.087
 se-0.078
 en-0.071
Token regretted
Token ID44652
Feature activation+1.08e+00

Pos logit contributions
reating+1.211
ward+1.193
ained+1.075
live+1.052
icated+1.049
Neg logit contributions
 spe-0.887
mer-0.809
chie-0.801
 se-0.779
 en-0.706
Token not
Token ID407
Feature activation+3.00e+00

Pos logit contributions
reating+1.524
ward+1.477
tr+1.397
live+1.376
ained+1.369
Neg logit contributions
 spe-1.548
chie-1.401
mer-1.367
 se-1.356
 en-1.301
Token being
Token ID852
Feature activation-2.01e-01

Pos logit contributions
ward+5.071
ops+5.051
live+5.023
mm+5.008
 Afterwards+4.889
Neg logit contributions
 spe-3.467
chie-3.466
 se-3.401
 en-3.359
 shy-3.253
Token more
Token ID517
Feature activation+9.07e-01

Pos logit contributions
 spe+0.208
mer+0.193
chie+0.186
 se+0.166
 en+0.145
Neg logit contributions
ward-0.366
reating-0.360
 about-0.336
ained-0.334
tr-0.333
Token careful
Token ID8161
Feature activation+4.78e-01

Pos logit contributions
ward+1.676
reating+1.655
tr+1.527
icated+1.504
live+1.491
Neg logit contributions
 spe-0.937
chie-0.864
mer-0.855
 se-0.783
 en-0.748
Token.
Token ID13
Feature activation+5.58e-01

Pos logit contributions
ward+0.674
reating+0.657
tr+0.597
live+0.595
ained+0.586
Neg logit contributions
 spe-0.677
mer-0.648
chie-0.635
 se-0.587
 en-0.555
Token Her
Token ID2332
Feature activation+4.16e-01

Pos logit contributions
reating+0.981
ward+0.973
live+0.885
ained+0.878
tr+0.875
Neg logit contributions
 spe-0.599
mer-0.556
chie-0.548
 se-0.498
 en-0.477
Token
Token ID220
Feature activation+3.83e-01

Pos logit contributions
reating+0.942
ward+0.929
live+0.836
icated+0.827
tr+0.823
Neg logit contributions
 spe-0.758
mer-0.711
chie-0.695
 se-0.657
 en-0.623
Token
Token ID198
Feature activation+4.22e-01

Pos logit contributions
ward+0.544
reating+0.538
tr+0.482
live+0.477
ained+0.472
Neg logit contributions
 spe-0.529
mer-0.507
chie-0.501
 se-0.462
 en-0.436
Token
Token ID198
Feature activation+1.35e+00

Pos logit contributions
ward+0.603
reating+0.597
tr+0.536
live+0.529
ained+0.529
Neg logit contributions
 spe-0.587
mer-0.564
chie-0.556
 se-0.516
 en-0.481
TokenAfter
Token ID3260
Feature activation+2.11e+00

Pos logit contributions
ward+1.754
reating+1.746
 about+1.520
tr+1.495
icated+1.493
Neg logit contributions
 spe-2.010
chie-1.988
mer-1.983
 se-1.862
 skinny-1.810
Token he
Token ID339
Feature activation+2.32e+00

Pos logit contributions
reating+3.391
ained+3.206
tr+3.182
ouncing+3.130
live+3.096
Neg logit contributions
 spe-2.896
 se-2.711
chie-2.283
 en-2.262
 skinny-2.246
Token was
Token ID373
Feature activation+3.00e+00

Pos logit contributions
reating+3.356
ward+3.099
ained+3.063
ouncing+2.994
asc+2.779
Neg logit contributions
 spe-3.783
 se-3.470
mer-3.311
 en-3.029
 fill-3.007
Token done
Token ID1760
Feature activation+1.50e+00

Pos logit contributions
reating+5.086
tr+4.614
ouncing+4.552
icated+4.467
ward+4.455
Neg logit contributions
 spe-4.042
 shy-3.681
 low-3.680
 en-3.669
 nervous-3.632
Token chewing
Token ID36615
Feature activation+8.89e-01

Pos logit contributions
reating+1.802
ward+1.722
ained+1.711
tr+1.693
live+1.634
Neg logit contributions
 spe-2.528
 se-2.285
mer-2.229
 en-2.142
chie-2.137
Token,
Token ID11
Feature activation+8.18e-01

Pos logit contributions
reating+1.499
ward+1.491
tr+1.412
ained+1.376
live+1.372
Neg logit contributions
 spe-1.037
mer-0.974
chie-0.904
 se-0.858
 en-0.822
Token Tim
Token ID5045
Feature activation+6.94e-01

Pos logit contributions
reating+1.169
ward+1.144
live+1.058
tr+1.044
ained+1.032
Neg logit contributions
 spe-1.161
mer-1.062
chie-1.047
 se-1.041
 en-0.975
Tokenmy
Token ID1820
Feature activation+9.11e-02

Pos logit contributions
ward+0.548
reating+0.511
tr+0.407
ained+0.406
live+0.405
Neg logit contributions
 spe-1.414
mer-1.384
chie-1.368
 se-1.331
 en-1.257
Token,
Token ID11
Feature activation-3.24e-03

Pos logit contributions
reating+0.683
ward+0.681
live+0.610
ained+0.610
tr+0.609
Neg logit contributions
 spe-0.902
chie-0.843
mer-0.842
 se-0.799
 en-0.756
Token "
Token ID366
Feature activation-6.17e-01

Pos logit contributions
 spe+0.004
mer+0.003
chie+0.003
 se+0.003
 en+0.003
Neg logit contributions
ward-0.006
reating-0.006
tr-0.005
 about-0.005
ained-0.005
TokenI
Token ID40
Feature activation+1.18e+00

Pos logit contributions
 spe+1.067
mer+1.026
chie+1.025
 se+0.936
 en+0.895
Neg logit contributions
ward-0.691
reating-0.687
live-0.609
ained-0.607
tr-0.606
Token gained
Token ID8618
Feature activation+1.06e+00

Pos logit contributions
reating+1.877
ward+1.855
ained+1.653
icated+1.617
tr+1.613
Neg logit contributions
 spe-1.584
mer-1.467
chie-1.410
 se-1.404
 dunno-1.243
Token so
Token ID523
Feature activation+1.31e+00

Pos logit contributions
ward+1.684
reating+1.680
live+1.594
tr+1.570
ained+1.555
Neg logit contributions
 spe-1.318
chie-1.219
mer-1.146
 se-1.117
 en-1.064
Token much
Token ID881
Feature activation+3.00e+00

Pos logit contributions
reating+1.553
ward+1.523
tr+1.409
live+1.313
ained+1.296
Neg logit contributions
 spe-2.225
chie-2.075
mer-2.023
 se-1.976
 en-1.957
Token confidence
Token ID6628
Feature activation+1.76e-01

Pos logit contributions
live+5.476
nce+5.421
ward+5.415
tr+5.343
reating+5.314
Neg logit contributions
 spe-3.366
 se-3.162
 en-2.909
 skinny-2.866
 shy-2.731
Token!"
Token ID2474
Feature activation+1.76e+00

Pos logit contributions
ward+0.238
reating+0.235
tr+0.211
live+0.209
ained+0.209
Neg logit contributions
 spe-0.262
mer-0.249
chie-0.246
 se-0.227
 en-0.213
Token But
Token ID887
Feature activation+1.95e+00

Pos logit contributions
reating+3.015
ward+2.845
inea+2.674
live+2.654
nce+2.607
Neg logit contributions
 spe-1.970
 se-1.864
mer-1.804
 en-1.778
chie-1.718
Token then
Token ID788
Feature activation+1.64e+00

Pos logit contributions
reating+3.123
ward+2.989
tr+2.873
live+2.870
icated+2.847
Neg logit contributions
 spe-2.472
 en-2.171
mer-2.170
chie-2.164
 se-2.164
Token,
Token ID11
Feature activation+1.37e+00

Pos logit contributions
reating+2.678
ward+2.570
ained+2.465
tr+2.464
live+2.418
Neg logit contributions
 spe-2.085
 se-1.879
chie-1.787
mer-1.771
 en-1.726
Token that
Token ID326
Feature activation-8.32e-01

Pos logit contributions
 spe+0.546
chie+0.519
mer+0.516
 se+0.465
 skinny+0.442
Neg logit contributions
ward-0.468
reating-0.454
 about-0.415
tr-0.408
ained-0.408
Token was
Token ID373
Feature activation-5.24e-01

Pos logit contributions
chie+1.273
 spe+1.269
mer+1.230
 se+1.080
 en+1.030
Neg logit contributions
ward-1.073
reating-1.004
tr-0.966
 about-0.962
ained-0.953
Token broken
Token ID5445
Feature activation-3.56e-01

Pos logit contributions
 spe+0.660
mer+0.643
chie+0.630
 se+0.548
 dunno+0.506
Neg logit contributions
ward-0.828
reating-0.799
 about-0.754
ained-0.747
tr-0.734
Token.
Token ID13
Feature activation-1.66e+00

Pos logit contributions
 spe+0.515
chie+0.490
mer+0.489
 se+0.444
 en+0.416
Neg logit contributions
ward-0.497
reating-0.488
 about-0.446
tr-0.440
ained-0.437
Token The
Token ID383
Feature activation-2.57e+00

Pos logit contributions
 spe+2.366
chie+2.360
mer+2.312
 se+2.015
 blu+1.834
Neg logit contributions
ward-2.421
 about-2.282
 Afterwards-2.198
 After-2.159
tr-2.156
Token little
Token ID1310
Feature activation-3.69e+00

Pos logit contributions
 spe+3.172
chie+2.980
mer+2.972
 dunno+2.425
 Int+2.416
Neg logit contributions
ward-4.241
reating-4.020
 stories-3.892
 done-3.853
 about-3.759
Token boat
Token ID8848
Feature activation-1.05e+00

Pos logit contributions
chie+3.791
 spe+3.728
mer+3.366
 Int+3.351
ihu+3.176
Neg logit contributions
ward-6.344
reating-6.221
 end-5.991
ruck-5.976
inea-5.855
Token felt
Token ID2936
Feature activation-6.70e-01

Pos logit contributions
 spe+1.628
chie+1.550
mer+1.502
 se+1.334
 blu+1.323
Neg logit contributions
ward-1.355
 about-1.249
reating-1.226
tr-1.212
 done-1.193
Token sad
Token ID6507
Feature activation+3.48e-01

Pos logit contributions
 spe+0.756
mer+0.731
chie+0.707
 se+0.602
 dunno+0.545
Neg logit contributions
ward-1.155
reating-1.103
 about-1.077
ained-1.048
tr-1.019
Token for
Token ID329
Feature activation-8.45e-01

Pos logit contributions
ward+0.515
reating+0.509
live+0.466
tr+0.462
ained+0.455
Neg logit contributions
 spe-0.473
mer-0.450
chie-0.438
 se-0.411
 en-0.382
Token the
Token ID262
Feature activation-2.15e+00

Pos logit contributions
mer+1.171
 spe+1.152
chie+1.114
 se+0.979
 dunno+0.911
Neg logit contributions
ward-1.268
reating-1.216
 about-1.135
icated-1.094
 what-1.079
Token very
Token ID845
Feature activation-1.60e-01

Pos logit contributions
 spe+0.162
mer+0.154
chie+0.149
 se+0.134
 en+0.120
Neg logit contributions
ward-0.242
reating-0.237
 about-0.220
tr-0.218
ained-0.218
Token sad
Token ID6507
Feature activation+8.01e-01

Pos logit contributions
 spe+0.150
mer+0.140
chie+0.136
 se+0.118
 en+0.103
Neg logit contributions
ward-0.304
reating-0.298
 about-0.277
tr-0.277
ained-0.277
Token and
Token ID290
Feature activation-4.70e-01

Pos logit contributions
reating+1.280
ward+1.280
live+1.194
tr+1.178
ained+1.147
Neg logit contributions
 spe-1.000
mer-0.942
chie-0.899
 se-0.862
 en-0.792
Token missed
Token ID6825
Feature activation+2.01e-02

Pos logit contributions
 spe+0.624
mer+0.595
chie+0.585
 se+0.526
 blu+0.488
Neg logit contributions
ward-0.714
reating-0.688
 about-0.645
ained-0.635
tr-0.625
Token the
Token ID262
Feature activation-1.64e+00

Pos logit contributions
ward+0.030
reating+0.029
tr+0.027
 about+0.027
ained+0.026
Neg logit contributions
 spe-0.027
mer-0.026
chie-0.026
 se-0.023
 en-0.022
Token little
Token ID1310
Feature activation-3.64e+00

Pos logit contributions
mer+1.577
 spe+1.500
chie+1.495
rily+1.102
 Int+1.059
Neg logit contributions
ward-3.089
reating-2.998
tr-2.915
 about-2.897
 done-2.841
Token bird
Token ID6512
Feature activation-3.03e-01

Pos logit contributions
chie+4.646
mer+4.300
rily+3.952
 Int+3.924
venth+3.687
Neg logit contributions
ward-5.437
reating-5.350
 about-5.307
tr-5.182
inea-5.079
Tokenie
Token ID494
Feature activation-1.10e-01

Pos logit contributions
 spe+0.487
mer+0.463
chie+0.457
 se+0.427
 en+0.398
Neg logit contributions
ward-0.377
reating-0.371
 about-0.333
tr-0.329
ained-0.326
Token.
Token ID13
Feature activation-2.65e-01

Pos logit contributions
 spe+0.158
mer+0.150
chie+0.148
 se+0.136
 en+0.126
Neg logit contributions
ward-0.154
reating-0.151
tr-0.137
 about-0.136
ained-0.135
Token<|endoftext|>
Token ID50256
Feature activation+2.22e-01

Pos logit contributions
 spe+0.383
mer+0.363
chie+0.359
 se+0.331
 skinny+0.302
Neg logit contributions
ward-0.376
reating-0.363
 about-0.335
tr-0.334
ained-0.331
TokenOnce
Token ID7454
Feature activation-4.72e-01

Pos logit contributions
ward+0.310
reating+0.307
tr+0.275
live+0.272
ained+0.271
Neg logit contributions
 spe-0.317
mer-0.302
chie-0.295
 se-0.276
 en-0.256
Token boat
Token ID8848
Feature activation-9.82e-01

Pos logit contributions
 spe+1.587
chie+1.532
mer+1.424
 Int+1.097
 dunno+1.058
Neg logit contributions
ward-4.174
reating-4.163
 about-3.900
ained-3.855
ruck-3.803
Token.
Token ID13
Feature activation-9.11e-01

Pos logit contributions
 spe+1.579
chie+1.512
mer+1.479
 se+1.348
 skinny+1.278
Neg logit contributions
ward-1.256
reating-1.199
 about-1.164
tr-1.096
ained-1.066
Token So
Token ID1406
Feature activation+6.72e-01

Pos logit contributions
 spe+1.500
chie+1.441
mer+1.428
 se+1.307
 blu+1.199
Neg logit contributions
ward-1.138
 about-1.022
reating-1.021
 Afterwards-0.990
tr-0.988
Token,
Token ID11
Feature activation-7.55e-01

Pos logit contributions
reating+0.902
ward+0.853
tr+0.775
live+0.765
ained+0.764
Neg logit contributions
 spe-1.085
mer-1.013
chie-0.983
 se-0.944
 en-0.889
Token the
Token ID262
Feature activation-2.72e+00

Pos logit contributions
 spe+1.007
chie+1.002
mer+0.993
 se+0.851
 dunno+0.793
Neg logit contributions
ward-1.154
reating-1.069
 about-1.018
ained-0.988
icated-0.977
Token little
Token ID1310
Feature activation-3.52e+00

Pos logit contributions
 spe+3.087
chie+3.020
mer+2.998
 dunno+2.311
 Int+2.305
Neg logit contributions
ward-4.784
reating-4.390
 stories-4.382
 done-4.335
nce-4.294
Token boat
Token ID8848
Feature activation-1.22e+00

Pos logit contributions
chie+3.664
 spe+3.436
mer+3.128
 Int+3.015
 dunno+2.810
Neg logit contributions
ward-6.249
reating-5.950
 end-5.841
ruck-5.782
ained-5.763
Token offered
Token ID4438
Feature activation-1.82e-01

Pos logit contributions
 spe+1.859
chie+1.774
mer+1.693
 skinny+1.488
 blu+1.485
Neg logit contributions
ward-1.632
 about-1.506
reating-1.493
tr-1.465
 done-1.448
Token to
Token ID284
Feature activation-1.27e+00

Pos logit contributions
 spe+0.203
mer+0.192
chie+0.188
 se+0.165
 en+0.150
Neg logit contributions
ward-0.319
reating-0.315
 about-0.292
tr-0.288
ained-0.287
Token help
Token ID1037
Feature activation-1.16e+00

Pos logit contributions
mer+1.722
 spe+1.701
chie+1.572
 skinny+1.332
 se+1.324
Neg logit contributions
ward-1.973
reating-1.910
 about-1.811
 done-1.718
tr-1.697
Token the
Token ID262
Feature activation-2.80e+00

Pos logit contributions
 spe+1.769
mer+1.699
chie+1.693
 se+1.467
 blu+1.432
Neg logit contributions
ward-1.562
reating-1.557
 about-1.475
icated-1.364
tr-1.333
Token.
Token ID13
Feature activation-1.57e+00

Pos logit contributions
 spe+1.150
mer+1.103
chie+1.098
 se+0.997
 en+0.930
Neg logit contributions
ward-1.162
reating-1.142
 about-1.067
ained-1.033
icated-1.004
Token
Token ID198
Feature activation+3.18e-01

Pos logit contributions
chie+2.647
 spe+2.645
mer+2.573
 se+2.281
sie+2.169
Neg logit contributions
ward-1.938
 about-1.823
ained-1.728
tr-1.714
 Afterwards-1.686
Token
Token ID198
Feature activation+1.27e-01

Pos logit contributions
ward+0.451
reating+0.449
tr+0.403
live+0.398
ained+0.397
Neg logit contributions
 spe-0.450
mer-0.427
chie-0.420
 se-0.393
 en-0.365
Token"
Token ID1
Feature activation-1.42e+00

Pos logit contributions
ward+0.181
reating+0.179
tr+0.161
ained+0.159
 about+0.159
Neg logit contributions
 spe-0.181
mer-0.173
chie-0.170
 se-0.157
 en-0.146
TokenHow
Token ID2437
Feature activation+3.83e-01

Pos logit contributions
 spe+2.364
chie+2.320
mer+2.269
 se+2.000
 en+1.999
Neg logit contributions
ward-1.674
reating-1.588
tr-1.523
live-1.459
ained-1.456
Token can
Token ID460
Feature activation-3.49e+00

Pos logit contributions
ward+0.500
reating+0.495
live+0.438
tr+0.435
ained+0.432
Neg logit contributions
 spe-0.590
mer-0.557
chie-0.550
 se-0.521
 en-0.486
Token we
Token ID356
Feature activation-9.68e-01

Pos logit contributions
mer+4.836
 spe+4.375
 redd+4.281
chie+4.220
 blu+3.917
Neg logit contributions
ward-5.884
 what-5.447
 about-5.304
reating-5.112
icated-4.990
Token get
Token ID651
Feature activation+3.95e-01

Pos logit contributions
 spe+1.330
mer+1.292
chie+1.235
 se+1.099
 skinny+1.076
Neg logit contributions
ward-1.392
reating-1.343
tr-1.260
 about-1.256
 done-1.212
Token the
Token ID262
Feature activation-1.25e+00

Pos logit contributions
ward+0.624
reating+0.618
live+0.572
tr+0.572
ained+0.559
Neg logit contributions
 spe-0.501
mer-0.468
chie-0.464
 se-0.432
 en-0.395
Token jet
Token ID12644
Feature activation-6.54e-01

Pos logit contributions
 spe+1.054
mer+1.043
chie+0.968
 se+0.742
 skinny+0.711
Neg logit contributions
ward-2.497
reating-2.448
tr-2.330
ained-2.301
 about-2.245
Token back
Token ID736
Feature activation+1.93e-01

Pos logit contributions
 spe+1.035
mer+0.983
chie+0.966
 se+0.887
 en+0.849
Neg logit contributions
ward-0.824
reating-0.819
tr-0.743
 about-0.735
ained-0.717
Token sad
Token ID6507
Feature activation+1.23e+00

Pos logit contributions
reating+1.419
ward+1.399
live+1.311
icated+1.294
tr+1.293
Neg logit contributions
 spe-0.651
chie-0.566
mer-0.560
 se-0.495
 en-0.477
Token and
Token ID290
Feature activation+2.04e-01

Pos logit contributions
ward+2.074
reating+2.056
live+1.998
tr+1.914
ained+1.892
Neg logit contributions
 spe-1.436
mer-1.326
 se-1.245
chie-1.231
 en-1.108
Token scared
Token ID12008
Feature activation+3.24e-02

Pos logit contributions
ward+0.397
reating+0.394
tr+0.365
live+0.365
ained+0.362
Neg logit contributions
 spe-0.186
mer-0.169
chie-0.166
 se-0.147
 en-0.131
Token.
Token ID13
Feature activation-1.24e+00

Pos logit contributions
ward+0.048
reating+0.047
tr+0.043
ained+0.042
 about+0.042
Neg logit contributions
 spe-0.045
mer-0.042
chie-0.042
 se-0.038
 en-0.036
Token How
Token ID1374
Feature activation-1.06e+00

Pos logit contributions
 spe+2.146
chie+2.101
mer+2.037
 se+1.911
 blu+1.763
Neg logit contributions
ward-1.427
 about-1.269
reating-1.252
tr-1.249
ained-1.233
Token can
Token ID460
Feature activation-3.45e+00

Pos logit contributions
 spe+1.843
mer+1.829
chie+1.817
 se+1.605
 skinny+1.527
Neg logit contributions
ward-1.194
 about-1.185
reating-1.153
tr-1.078
icated-1.030
Token they
Token ID484
Feature activation-1.57e-01

Pos logit contributions
mer+5.194
 spe+4.996
 redd+4.657
chie+4.635
 blu+4.247
Neg logit contributions
ward-5.456
 about-4.995
 what-4.816
reating-4.657
nce-4.592
Token get
Token ID651
Feature activation+2.66e-01

Pos logit contributions
 spe+0.211
mer+0.200
chie+0.196
 se+0.179
 en+0.166
Neg logit contributions
ward-0.236
reating-0.232
tr-0.212
 about-0.210
live-0.209
Token their
Token ID511
Feature activation-7.86e-01

Pos logit contributions
ward+0.381
reating+0.376
live+0.341
tr+0.341
ained+0.336
Neg logit contributions
 spe-0.377
mer-0.356
chie-0.353
 se-0.328
 en-0.303
Token rocket
Token ID10701
Feature activation-5.74e-01

Pos logit contributions
 spe+0.927
mer+0.908
chie+0.897
 se+0.779
 en+0.732
Neg logit contributions
ward-1.302
reating-1.253
tr-1.184
 about-1.159
ained-1.144
Token back
Token ID736
Feature activation+6.48e-02

Pos logit contributions
 spe+0.870
mer+0.826
chie+0.817
 se+0.743
 en+0.712
Neg logit contributions
ward-0.761
reating-0.753
tr-0.699
 about-0.677
ained-0.666
Token are
Token ID389
Feature activation-7.10e-01

Pos logit contributions
 spe+0.642
mer+0.617
chie+0.609
 se+0.558
 en+0.528
Neg logit contributions
ward-0.496
reating-0.484
 about-0.442
tr-0.437
live-0.431
Token you
Token ID345
Feature activation+7.62e-01

Pos logit contributions
 spe+0.848
mer+0.810
chie+0.774
 se+0.686
 skinny+0.631
Neg logit contributions
ward-1.193
reating-1.162
 about-1.083
ained-1.055
icated-1.052
Token?"
Token ID1701
Feature activation-1.70e+00

Pos logit contributions
reating+1.087
ward+1.082
tr+0.993
live+0.977
ained+0.946
Neg logit contributions
 spe-1.068
mer-1.027
chie-1.010
 se-0.950
 en-0.892
Token But
Token ID887
Feature activation-5.76e-01

Pos logit contributions
 spe+2.434
chie+2.409
mer+2.331
 blu+2.002
 se+1.962
Neg logit contributions
ward-2.432
 about-2.259
reating-2.104
 Afterwards-2.098
tr-2.091
Token the
Token ID262
Feature activation-3.16e+00

Pos logit contributions
 spe+0.758
mer+0.718
chie+0.709
 se+0.641
 skinny+0.580
Neg logit contributions
ward-0.898
reating-0.840
 about-0.800
ained-0.776
tr-0.765
Token little
Token ID1310
Feature activation-3.37e+00

Pos logit contributions
 spe+3.975
mer+3.712
chie+3.601
 dunno+3.073
 Int+2.947
Neg logit contributions
ward-5.399
 stories-4.915
 End-4.864
 end-4.780
nce-4.749
Token boat
Token ID8848
Feature activation-4.89e-01

Pos logit contributions
chie+3.229
 spe+3.219
 Int+2.761
mer+2.673
 dunno+2.656
Neg logit contributions
ward-6.155
reating-5.718
ruck-5.541
 end-5.534
 luck-5.376
Token said
Token ID531
Feature activation-7.45e-01

Pos logit contributions
 spe+0.761
chie+0.712
mer+0.709
 se+0.647
 skinny+0.610
Neg logit contributions
ward-0.629
reating-0.596
 about-0.558
tr-0.553
live-0.544
Token nothing
Token ID2147
Feature activation-9.22e-01

Pos logit contributions
 spe+1.314
mer+1.261
chie+1.248
 se+1.149
 blu+1.097
Neg logit contributions
ward-0.839
reating-0.765
 about-0.745
tr-0.692
ained-0.663
Token,
Token ID11
Feature activation-5.47e-01

Pos logit contributions
 spe+1.431
chie+1.356
mer+1.356
 se+1.228
 blu+1.165
Neg logit contributions
ward-1.204
reating-1.163
 about-1.126
 done-1.027
icated-1.025
Token only
Token ID691
Feature activation+5.18e-01

Pos logit contributions
 spe+0.779
mer+0.746
chie+0.744
 se+0.658
 skinny+0.612
Neg logit contributions
ward-0.788
reating-0.763
 about-0.721
ained-0.689
icated-0.680
Token?"
Token ID1701
Feature activation-1.36e+00

Pos logit contributions
ward+0.512
reating+0.505
tr+0.451
live+0.447
ained+0.443
Neg logit contributions
 spe-0.551
mer-0.527
chie-0.518
 se-0.485
 en-0.456
Token asked
Token ID1965
Feature activation-4.67e-01

Pos logit contributions
 spe+1.906
chie+1.812
mer+1.791
 se+1.450
 blu+1.420
Neg logit contributions
ward-2.014
reating-1.851
 about-1.841
 Afterwards-1.751
ained-1.746
Token Sammy
Token ID44067
Feature activation-7.04e-01

Pos logit contributions
 spe+0.676
mer+0.642
chie+0.635
 se+0.577
 en+0.534
Neg logit contributions
ward-0.654
reating-0.637
 about-0.580
live-0.574
ained-0.572
Token in
Token ID287
Feature activation-9.40e-01

Pos logit contributions
 spe+1.061
mer+1.006
chie+0.989
 se+0.896
 blu+0.835
Neg logit contributions
ward-0.952
reating-0.923
 about-0.872
tr-0.825
live-0.814
Token a
Token ID257
Feature activation-2.61e+00

Pos logit contributions
 spe+1.362
mer+1.354
chie+1.345
 se+1.186
 blu+1.108
Neg logit contributions
ward-1.298
reating-1.248
 about-1.174
tr-1.112
live-1.090
Token tiny
Token ID7009
Feature activation-3.36e+00

Pos logit contributions
mer+3.133
 spe+2.955
chie+2.893
rily+2.556
rier+2.451
Neg logit contributions
ward-4.170
 about-3.943
 done-3.899
 stories-3.851
tr-3.841
Token voice
Token ID3809
Feature activation-2.29e-01

Pos logit contributions
mer+3.918
 spe+3.910
chie+3.690
 redd+3.394
 blu+3.264
Neg logit contributions
 about-5.043
reating-4.829
 stories-4.817
 what-4.795
ward-4.794
Token.
Token ID13
Feature activation-1.08e+00

Pos logit contributions
 spe+0.329
mer+0.311
chie+0.309
 se+0.280
 en+0.261
Neg logit contributions
ward-0.322
reating-0.318
 about-0.289
tr-0.287
ained-0.281
Token
Token ID198
Feature activation-8.50e-01

Pos logit contributions
 spe+1.807
mer+1.762
chie+1.740
 se+1.552
 blu+1.463
Neg logit contributions
ward-1.324
reating-1.182
 about-1.149
tr-1.098
 Afterwards-1.084
Token
Token ID198
Feature activation-1.10e+00

Pos logit contributions
 spe+1.436
mer+1.365
chie+1.343
 se+1.226
 blu+1.171
Neg logit contributions
ward-1.010
reating-0.964
 about-0.893
live-0.877
tr-0.865
Token"
Token ID1
Feature activation-1.39e+00

Pos logit contributions
 spe+1.827
mer+1.669
chie+1.635
 se+1.509
 blu+1.457
Neg logit contributions
ward-1.366
reating-1.296
live-1.216
tr-1.182
nce-1.163
Token and
Token ID290
Feature activation+6.42e-02

Pos logit contributions
ward+0.858
reating+0.849
live+0.783
tr+0.775
ained+0.757
Neg logit contributions
 spe-0.771
mer-0.735
chie-0.711
 se-0.680
 en-0.634
Token wanted
Token ID2227
Feature activation+2.98e-01

Pos logit contributions
ward+0.107
reating+0.105
tr+0.097
live+0.096
ained+0.096
Neg logit contributions
 spe-0.076
mer-0.072
chie-0.070
 se-0.064
 en-0.058
Token to
Token ID284
Feature activation-4.33e-01

Pos logit contributions
ward+0.554
reating+0.547
tr+0.508
live+0.508
ained+0.506
Neg logit contributions
 spe-0.296
mer-0.268
chie-0.265
 se-0.239
 en-0.214
Token know
Token ID760
Feature activation+1.26e+00

Pos logit contributions
 spe+0.446
mer+0.434
chie+0.411
 se+0.352
 en+0.320
Neg logit contributions
ward-0.783
reating-0.771
 about-0.716
tr-0.713
ained-0.702
Token why
Token ID1521
Feature activation-4.09e-01

Pos logit contributions
ward+1.358
live+1.302
reating+1.270
ained+1.254
ugg+1.220
Neg logit contributions
 spe-2.205
chie-2.021
 se-1.986
mer-1.960
 en-1.925
Token the
Token ID262
Feature activation-3.33e+00

Pos logit contributions
 spe+0.552
mer+0.535
chie+0.529
 se+0.479
 en+0.441
Neg logit contributions
ward-0.607
reating-0.592
 about-0.547
tr-0.539
ained-0.533
Token airplane
Token ID19401
Feature activation-5.23e-01

Pos logit contributions
mer+5.337
chie+4.700
 spe+4.495
rily+4.033
 dunno+3.784
Neg logit contributions
reating-5.010
ward-5.005
 stories-4.968
 done-4.927
ruck-4.672
Token could
Token ID714
Feature activation-1.35e+00

Pos logit contributions
 spe+0.760
chie+0.730
mer+0.726
 se+0.652
 skinny+0.615
Neg logit contributions
ward-0.707
reating-0.687
tr-0.648
 about-0.642
ained-0.631
Token fly
Token ID6129
Feature activation-1.20e+00

Pos logit contributions
mer+1.629
 spe+1.564
chie+1.540
 se+1.286
 blu+1.242
Neg logit contributions
ward-2.213
reating-2.141
 about-2.058
tr-2.041
live-1.939
Token but
Token ID475
Feature activation+5.12e-02

Pos logit contributions
 spe+1.950
chie+1.851
mer+1.826
 se+1.705
 dunno+1.631
Neg logit contributions
reating-1.442
ward-1.428
 about-1.366
ained-1.296
tr-1.277
Token not
Token ID407
Feature activation-3.71e-01

Pos logit contributions
ward+0.082
reating+0.081
tr+0.074
live+0.074
ained+0.074
Neg logit contributions
 spe-0.064
mer-0.060
chie-0.059
 se-0.054
 en-0.049
Token is
Token ID318
Feature activation-2.61e-01

Pos logit contributions
 spe+0.465
mer+0.445
chie+0.441
 se+0.411
 en+0.388
Neg logit contributions
ward-0.336
reating-0.323
tr-0.295
 about-0.291
ained-0.287
Token getting
Token ID1972
Feature activation-3.81e-01

Pos logit contributions
 spe+0.357
mer+0.344
chie+0.335
 se+0.308
 en+0.284
Neg logit contributions
ward-0.383
reating-0.373
 about-0.341
tr-0.340
ained-0.339
Token wet
Token ID9583
Feature activation-2.91e-01

Pos logit contributions
 spe+0.474
mer+0.459
chie+0.444
 se+0.395
 en+0.367
Neg logit contributions
ward-0.605
reating-0.594
ained-0.544
 about-0.540
tr-0.538
Token.
Token ID13
Feature activation-7.36e-01

Pos logit contributions
 spe+0.445
chie+0.423
mer+0.423
 se+0.386
 en+0.365
Neg logit contributions
ward-0.380
reating-0.376
 about-0.335
ained-0.334
tr-0.333
Token What
Token ID1867
Feature activation-7.54e-01

Pos logit contributions
 spe+0.761
mer+0.729
chie+0.707
 se+0.634
 en+0.550
Neg logit contributions
ward-1.325
reating-1.277
tr-1.234
 about-1.215
ained-1.200
Token can
Token ID460
Feature activation-3.33e+00

Pos logit contributions
 spe+1.182
mer+1.164
chie+1.159
 se+1.023
 en+0.963
Neg logit contributions
ward-0.958
reating-0.925
 about-0.890
tr-0.856
ained-0.839
Token we
Token ID356
Feature activation-1.02e+00

Pos logit contributions
mer+4.562
 spe+4.203
chie+3.806
 redd+3.513
 blu+3.429
Neg logit contributions
ward-5.765
nce-5.072
reating-5.057
 about-5.056
mm-5.042
Token do
Token ID466
Feature activation-1.16e+00

Pos logit contributions
 spe+1.496
mer+1.459
chie+1.387
 se+1.232
 skinny+1.206
Neg logit contributions
ward-1.391
reating-1.334
 about-1.258
tr-1.237
 done-1.231
Token?"
Token ID1701
Feature activation-7.41e-01

Pos logit contributions
 spe+2.080
chie+1.993
mer+1.952
 se+1.783
 blu+1.724
Neg logit contributions
ward-1.320
reating-1.265
 about-1.257
ained-1.129
 done-1.121
Token Cow
Token ID10417
Feature activation+6.36e-01

Pos logit contributions
 spe+1.131
chie+1.111
mer+1.105
 se+0.959
 blu+0.904
Neg logit contributions
ward-0.991
reating-0.917
tr-0.862
 about-0.861
ained-0.852
Token said
Token ID531
Feature activation-5.90e-01

Pos logit contributions
reating+0.900
ward+0.887
ained+0.799
tr+0.785
live+0.770
Neg logit contributions
 spe-0.917
mer-0.886
chie-0.857
 se-0.812
 en-0.750
Token pigeon
Token ID46387
Feature activation+7.47e-02

Pos logit contributions
chie+2.802
 spe+2.768
mer+2.731
 dunno+2.322
 Int+2.297
Neg logit contributions
ward-3.707
reating-3.655
 about-3.511
 done-3.478
 stories-3.388
Token wanted
Token ID2227
Feature activation-3.66e-01

Pos logit contributions
ward+0.120
reating+0.119
tr+0.108
live+0.107
ained+0.107
Neg logit contributions
 spe-0.092
mer-0.087
chie-0.086
 se-0.078
 en-0.072
Token to
Token ID284
Feature activation-2.96e-01

Pos logit contributions
 spe+0.412
mer+0.397
chie+0.389
 se+0.333
 en+0.307
Neg logit contributions
ward-0.632
reating-0.628
 about-0.584
tr-0.568
ained-0.563
Token help
Token ID1037
Feature activation-1.16e+00

Pos logit contributions
 spe+0.332
mer+0.319
chie+0.306
 se+0.268
 en+0.247
Neg logit contributions
ward-0.510
reating-0.504
 about-0.463
tr-0.462
live-0.457
Token the
Token ID262
Feature activation-2.55e+00

Pos logit contributions
 spe+1.831
chie+1.762
mer+1.760
 se+1.544
 dunno+1.484
Neg logit contributions
reating-1.499
ward-1.460
 about-1.419
icated-1.294
 what-1.290
Token little
Token ID1310
Feature activation-3.32e+00

Pos logit contributions
mer+3.172
 spe+3.160
chie+3.059
 dunno+2.515
ihu+2.369
Neg logit contributions
reating-4.069
ward-4.058
 done-3.914
 about-3.890
 stories-3.845
Token bird
Token ID6512
Feature activation-5.95e-01

Pos logit contributions
chie+4.741
 spe+4.289
mer+4.191
 dunno+3.906
ihu+3.783
Neg logit contributions
reating-4.850
ward-4.740
 about-4.384
 done-4.331
 what-4.312
Token,
Token ID11
Feature activation-1.52e+00

Pos logit contributions
 spe+0.968
mer+0.928
chie+0.910
 se+0.835
 skinny+0.783
Neg logit contributions
ward-0.732
reating-0.724
 about-0.668
tr-0.633
icated-0.621
Token so
Token ID523
Feature activation-3.32e-01

Pos logit contributions
mer+2.151
 spe+2.085
chie+1.987
 se+1.613
ihu+1.554
Neg logit contributions
 about-2.408
ward-2.374
reating-2.348
icated-2.162
 what-2.114
Token he
Token ID339
Feature activation-7.95e-01

Pos logit contributions
 spe+0.455
mer+0.440
chie+0.439
 se+0.385
 en+0.359
Neg logit contributions
ward-0.494
reating-0.476
 about-0.437
tr-0.431
ained-0.430
Token told
Token ID1297
Feature activation-7.94e-01

Pos logit contributions
 spe+1.104
mer+1.061
chie+1.052
 se+0.890
 blu+0.876
Neg logit contributions
ward-1.141
reating-1.084
tr-1.023
 about-1.014
live-1.001
Token said
Token ID531
Feature activation-1.39e+00

Pos logit contributions
 spe+1.063
chie+1.009
mer+0.997
 se+0.903
 blu+0.863
Neg logit contributions
ward-0.723
reating-0.676
 about-0.643
tr-0.627
live-0.617
Token.
Token ID13
Feature activation-1.15e+00

Pos logit contributions
 spe+2.351
chie+2.204
mer+2.182
 se+2.002
 blu+1.928
Neg logit contributions
ward-1.711
 about-1.623
reating-1.562
tr-1.437
 done-1.380
Token "
Token ID366
Feature activation-1.67e+00

Pos logit contributions
 spe+1.611
chie+1.516
mer+1.484
 se+1.342
 blu+1.253
Neg logit contributions
ward-1.740
 about-1.606
reating-1.564
tr-1.523
 Afterwards-1.509
TokenBut
Token ID1537
Feature activation-2.52e-01

Pos logit contributions
 spe+2.366
chie+2.238
mer+2.180
 se+1.933
ihu+1.829
Neg logit contributions
ward-2.435
reating-2.364
tr-2.273
ained-2.157
After-2.149
Token how
Token ID703
Feature activation-5.70e-01

Pos logit contributions
 spe+0.280
mer+0.266
chie+0.259
 se+0.230
 en+0.209
Neg logit contributions
ward-0.437
reating-0.427
tr-0.396
 about-0.395
ained-0.389
Token can
Token ID460
Feature activation-3.31e+00

Pos logit contributions
 spe+0.921
mer+0.890
chie+0.878
 se+0.803
 en+0.749
Neg logit contributions
ward-0.706
reating-0.690
 about-0.654
tr-0.622
ained-0.621
Token we
Token ID356
Feature activation-1.08e+00

Pos logit contributions
mer+4.384
 spe+3.875
chie+3.806
 redd+3.501
rily+3.305
Neg logit contributions
ward-5.628
reating-5.167
 what-5.027
 about-4.961
icated-4.958
Token fly
Token ID6129
Feature activation-4.69e-01

Pos logit contributions
 spe+1.330
mer+1.290
chie+1.210
 se+1.079
 skinny+1.037
Neg logit contributions
ward-1.717
reating-1.647
tr-1.563
 about-1.548
ained-1.523
Token in
Token ID287
Feature activation-1.16e+00

Pos logit contributions
 spe+0.678
mer+0.641
chie+0.637
 se+0.581
 en+0.545
Neg logit contributions
ward-0.653
reating-0.643
 about-0.589
tr-0.584
ained-0.581
Token a
Token ID257
Feature activation-2.07e+00

Pos logit contributions
mer+1.620
 spe+1.602
chie+1.579
 se+1.386
 dunno+1.296
Neg logit contributions
ward-1.699
reating-1.654
 about-1.523
ained-1.466
icated-1.458
Token jet
Token ID12644
Feature activation-7.10e-01

Pos logit contributions
mer+2.409
 spe+2.359
chie+2.279
rily+1.983
 dunno+1.935
Neg logit contributions
ward-3.380
reating-3.281
tr-3.189
THE-3.004
 done-2.962
Token is
Token ID318
Feature activation-6.77e-01

Pos logit contributions
 spe+1.897
chie+1.833
mer+1.809
 se+1.682
 blu+1.600
Neg logit contributions
ward-1.096
reating-1.013
 about-1.003
tr-0.977
live-0.961
Token not
Token ID407
Feature activation-1.01e-01

Pos logit contributions
 spe+0.896
mer+0.891
chie+0.859
 se+0.777
 dunno+0.710
Neg logit contributions
ward-1.014
reating-0.975
 about-0.928
tr-0.899
ained-0.895
Token happy
Token ID3772
Feature activation+1.43e-01

Pos logit contributions
 spe+0.127
mer+0.121
chie+0.118
 se+0.107
 en+0.098
Neg logit contributions
ward-0.160
reating-0.157
 about-0.144
tr-0.143
ained-0.143
Token.
Token ID13
Feature activation-1.12e+00

Pos logit contributions
ward+0.218
reating+0.214
tr+0.195
live+0.194
ained+0.193
Neg logit contributions
 spe-0.190
mer-0.180
chie-0.176
 se-0.162
 en-0.150
Token What
Token ID1867
Feature activation-3.05e-01

Pos logit contributions
 spe+1.252
mer+1.225
chie+1.213
 se+1.103
 en+0.938
Neg logit contributions
ward-1.904
tr-1.813
reating-1.800
 about-1.764
icated-1.701
Token can
Token ID460
Feature activation-3.30e+00

Pos logit contributions
 spe+0.483
mer+0.468
chie+0.464
 se+0.420
 en+0.395
Neg logit contributions
ward-0.383
reating-0.377
 about-0.343
tr-0.338
live-0.332
Token I
Token ID314
Feature activation-5.12e-01

Pos logit contributions
mer+5.072
 spe+4.598
chie+4.173
 redd+3.746
 blu+3.713
Neg logit contributions
ward-5.142
 about-4.799
reating-4.712
icated-4.681
 what-4.666
Token do
Token ID466
Feature activation-1.13e+00

Pos logit contributions
 spe+0.787
mer+0.757
chie+0.746
 se+0.668
 skinny+0.634
Neg logit contributions
ward-0.666
reating-0.652
 about-0.598
tr-0.594
live-0.577
Token?"
Token ID1701
Feature activation-8.46e-02

Pos logit contributions
 spe+2.050
chie+1.935
mer+1.901
 se+1.735
 blu+1.673
Neg logit contributions
ward-1.270
reating-1.264
 about-1.239
 done-1.089
ained-1.077
Token She
Token ID1375
Feature activation-1.03e+00

Pos logit contributions
 spe+0.107
mer+0.101
chie+0.100
 se+0.089
 en+0.082
Neg logit contributions
ward-0.136
reating-0.133
tr-0.122
 about-0.121
live-0.121
Token thought
Token ID1807
Feature activation-7.76e-01

Pos logit contributions
 spe+1.506
mer+1.470
chie+1.463
 blu+1.196
 se+1.169
Neg logit contributions
ward-1.441
reating-1.330
 about-1.319
tr-1.313
live-1.264
Token and
Token ID290
Feature activation+2.05e-01

Pos logit contributions
ward+0.127
reating+0.125
tr+0.113
live+0.113
ained+0.113
Neg logit contributions
 spe-0.118
mer-0.112
chie-0.110
 se-0.101
 en-0.094
Token our
Token ID674
Feature activation-4.53e-01

Pos logit contributions
ward+0.318
reating+0.314
tr+0.285
live+0.284
ained+0.284
Neg logit contributions
 spe-0.265
mer-0.249
chie-0.245
 se-0.227
 en-0.207
Token corn
Token ID11676
Feature activation-2.33e-01

Pos logit contributions
 spe+0.585
mer+0.561
chie+0.551
 se+0.496
 en+0.465
Neg logit contributions
ward-0.693
reating-0.679
tr-0.630
 about-0.616
ained-0.610
Token.
Token ID13
Feature activation-7.67e-02

Pos logit contributions
 spe+0.292
mer+0.275
chie+0.273
 se+0.247
 en+0.230
Neg logit contributions
ward-0.369
reating-0.360
tr-0.333
 about-0.328
ained-0.326
Token What
Token ID1867
Feature activation-4.33e-01

Pos logit contributions
 spe+0.099
mer+0.094
chie+0.093
 se+0.084
 en+0.077
Neg logit contributions
ward-0.119
reating-0.117
tr-0.107
ained-0.106
 about-0.106
Token can
Token ID460
Feature activation-3.30e+00

Pos logit contributions
 spe+0.725
mer+0.706
chie+0.703
 se+0.634
 en+0.602
Neg logit contributions
ward-0.506
reating-0.495
 about-0.455
tr-0.441
icated-0.435
Token we
Token ID356
Feature activation-4.72e-01

Pos logit contributions
mer+5.172
 spe+5.040
chie+4.621
 redd+4.199
 blu+4.087
Neg logit contributions
ward-4.823
 about-4.505
reating-4.448
mm-4.363
 what-4.322
Token do
Token ID466
Feature activation-2.85e-01

Pos logit contributions
 spe+0.727
mer+0.697
chie+0.687
 se+0.623
 en+0.594
Neg logit contributions
ward-0.612
reating-0.596
 about-0.542
tr-0.541
ained-0.530
Token?"
Token ID1701
Feature activation-1.03e+00

Pos logit contributions
 spe+0.463
mer+0.440
chie+0.439
 se+0.400
 en+0.378
Neg logit contributions
ward-0.355
reating-0.347
 about-0.312
tr-0.305
ained-0.305
Token
Token ID198
Feature activation-4.75e-01

Pos logit contributions
 spe+1.756
chie+1.713
mer+1.702
 se+1.485
 blu+1.434
Neg logit contributions
ward-1.235
reating-1.133
 about-1.095
 Afterwards-1.029
live-1.028
Token
Token ID198
Feature activation-3.48e-01

Pos logit contributions
 spe+0.763
mer+0.731
chie+0.722
 se+0.656
 en+0.622
Neg logit contributions
ward-0.595
reating-0.578
live-0.520
 about-0.517
tr-0.510
Token the
Token ID262
Feature activation-1.54e+00

Pos logit contributions
 spe+0.806
mer+0.787
chie+0.760
 se+0.696
 skinny+0.647
Neg logit contributions
reating-0.780
ward-0.778
 about-0.706
live-0.675
tr-0.672
Token bird
Token ID6512
Feature activation+3.62e-01

Pos logit contributions
mer+1.267
 spe+1.251
chie+1.141
 dunno+0.882
 se+0.877
Neg logit contributions
ward-3.057
reating-2.850
tr-2.824
nce-2.708
 about-2.706
Token,
Token ID11
Feature activation-4.46e-01

Pos logit contributions
ward+0.446
reating+0.441
ained+0.391
live+0.390
tr+0.389
Neg logit contributions
 spe-0.583
mer-0.556
chie-0.550
 se-0.514
 en-0.485
Token "
Token ID366
Feature activation-3.03e-01

Pos logit contributions
 spe+0.543
mer+0.521
chie+0.507
 se+0.454
 skinny+0.406
Neg logit contributions
ward-0.733
reating-0.700
 about-0.664
tr-0.649
ained-0.638
TokenWhat
Token ID2061
Feature activation-5.02e-01

Pos logit contributions
 spe+0.427
mer+0.405
chie+0.397
 se+0.365
 en+0.343
Neg logit contributions
ward-0.437
reating-0.433
tr-0.390
live-0.386
ained-0.385
Token can
Token ID460
Feature activation-3.30e+00

Pos logit contributions
 spe+0.838
mer+0.823
chie+0.806
 se+0.734
 en+0.693
Neg logit contributions
ward-0.588
reating-0.575
 about-0.531
tr-0.513
icated-0.509
Token I
Token ID314
Feature activation-3.29e-01

Pos logit contributions
mer+4.717
 spe+4.516
chie+3.842
 redd+3.715
rily+3.695
Neg logit contributions
ward-5.226
 about-5.039
 what-4.810
 done-4.694
reating-4.670
Token do
Token ID466
Feature activation-9.70e-01

Pos logit contributions
 spe+0.516
mer+0.495
chie+0.484
 se+0.444
 en+0.419
Neg logit contributions
ward-0.421
reating-0.411
 about-0.374
tr-0.371
live-0.365
Token to
Token ID284
Feature activation-4.96e-01

Pos logit contributions
 spe+1.640
chie+1.554
mer+1.539
 se+1.408
 blu+1.338
Neg logit contributions
ward-1.166
reating-1.134
 about-1.095
live-0.978
ained-0.977
Token get
Token ID651
Feature activation+3.98e-01

Pos logit contributions
 spe+0.686
mer+0.669
chie+0.642
 se+0.575
 en+0.543
Neg logit contributions
ward-0.727
reating-0.715
 about-0.650
live-0.633
icated-0.628
Token to
Token ID284
Feature activation-4.58e-02

Pos logit contributions
ward+0.639
reating+0.634
tr+0.587
live+0.583
ained+0.571
Neg logit contributions
 spe-0.493
mer-0.462
chie-0.461
 se-0.423
 en-0.386
Token
Token ID198
Feature activation-1.13e+00

Pos logit contributions
 spe+2.463
chie+2.342
mer+2.289
 se+2.066
 blu+2.048
Neg logit contributions
ward-1.435
reating-1.358
 about-1.312
live-1.292
tr-1.247
Token"
Token ID1
Feature activation-1.85e+00

Pos logit contributions
 spe+1.923
chie+1.784
mer+1.739
 se+1.614
 en+1.560
Neg logit contributions
ward-1.360
reating-1.312
live-1.214
tr-1.186
 about-1.176
TokenOK
Token ID11380
Feature activation-7.81e-01

Pos logit contributions
 spe+2.987
chie+2.816
mer+2.793
 se+2.487
rily+2.468
Neg logit contributions
ward-2.272
reating-2.153
tr-2.085
After-2.058
live-1.977
Token,
Token ID11
Feature activation-4.53e-01

Pos logit contributions
 spe+1.414
mer+1.364
chie+1.347
 se+1.254
 en+1.181
Neg logit contributions
ward-0.837
reating-0.789
 about-0.732
tr-0.676
icated-0.659
Token what
Token ID644
Feature activation+1.05e+00

Pos logit contributions
 spe+0.422
mer+0.406
chie+0.392
 se+0.328
 en+0.291
Neg logit contributions
ward-0.868
reating-0.854
 about-0.807
tr-0.787
icated-0.785
Token can
Token ID460
Feature activation-3.29e+00

Pos logit contributions
ward+1.536
reating+1.505
live+1.396
ugg+1.360
ained+1.356
Neg logit contributions
 spe-1.472
mer-1.314
 se-1.305
chie-1.297
 en-1.200
Token we
Token ID356
Feature activation-1.40e+00

Pos logit contributions
mer+4.379
 spe+4.085
chie+3.725
 redd+3.475
rier+3.290
Neg logit contributions
ward-5.656
 what-5.438
 about-5.433
icated-5.325
reating-5.200
Token make
Token ID787
Feature activation-2.25e-01

Pos logit contributions
 spe+2.197
mer+2.146
chie+2.026
rily+1.766
 se+1.750
Neg logit contributions
ward-1.768
reating-1.745
tr-1.628
 about-1.626
icated-1.597
Token?"
Token ID1701
Feature activation-1.44e+00

Pos logit contributions
 spe+0.334
mer+0.321
chie+0.315
 se+0.285
 en+0.269
Neg logit contributions
ward-0.309
reating-0.304
 about-0.274
tr-0.271
ained-0.268
Token Sam
Token ID3409
Feature activation-8.52e-01

Pos logit contributions
 spe+2.657
chie+2.597
mer+2.568
 se+2.225
rily+2.181
Neg logit contributions
reating-1.492
ward-1.473
 about-1.431
ained-1.258
 done-1.255
Token asks
Token ID7893
Feature activation-6.33e-01

Pos logit contributions
 spe+1.126
chie+1.047
mer+1.045
 se+0.894
 blu+0.818
Neg logit contributions
reating-1.328
ward-1.307
 about-1.259
icated-1.194
tr-1.190
Token boat
Token ID8848
Feature activation-3.88e-01

Pos logit contributions
 spe+1.120
chie+1.000
mer+0.988
 skinny+0.711
 Int+0.693
Neg logit contributions
reating-2.814
ward-2.762
 about-2.644
 stories-2.525
ained-2.516
Token.
Token ID13
Feature activation-5.55e-01

Pos logit contributions
 spe+0.567
mer+0.533
chie+0.530
 se+0.484
 en+0.450
Neg logit contributions
ward-0.541
reating-0.516
 about-0.481
tr-0.477
ained-0.462
Token
Token ID198
Feature activation+5.58e-01

Pos logit contributions
 spe+0.822
mer+0.783
chie+0.782
 se+0.705
 en+0.648
Neg logit contributions
ward-0.781
reating-0.726
 about-0.690
tr-0.680
ained-0.666
Token
Token ID198
Feature activation-2.21e-01

Pos logit contributions
reating+0.803
ward+0.797
tr+0.716
ained+0.710
live+0.700
Neg logit contributions
 spe-0.770
mer-0.735
chie-0.716
 se-0.680
 en-0.628
TokenThe
Token ID464
Feature activation-2.00e+00

Pos logit contributions
 spe+0.324
mer+0.305
chie+0.301
 se+0.277
 en+0.258
Neg logit contributions
ward-0.312
reating-0.304
live-0.274
tr-0.274
 about-0.271
Token little
Token ID1310
Feature activation-3.29e+00

Pos logit contributions
 spe+2.650
mer+2.496
chie+2.488
 dunno+2.062
 Int+1.976
Neg logit contributions
ward-3.164
reating-2.956
 stories-2.822
nce-2.771
 about-2.742
Token boat
Token ID8848
Feature activation-3.24e-01

Pos logit contributions
 spe+3.370
chie+3.359
mer+2.919
 Int+2.600
 dunno+2.540
Neg logit contributions
ward-5.984
reating-5.729
 luck-5.449
ained-5.420
ruck-5.414
Token breathed
Token ID39545
Feature activation-3.50e-01

Pos logit contributions
 spe+0.480
chie+0.454
mer+0.453
 se+0.409
 en+0.386
Neg logit contributions
ward-0.441
reating-0.421
 about-0.388
tr-0.387
live-0.380
Token a
Token ID257
Feature activation+4.36e-01

Pos logit contributions
 spe+0.518
mer+0.499
chie+0.496
 se+0.448
 en+0.419
Neg logit contributions
ward-0.474
reating-0.456
 about-0.416
ained-0.412
tr-0.407
Token sigh
Token ID19680
Feature activation+1.35e-01

Pos logit contributions
reating+0.656
ward+0.653
ained+0.600
live+0.599
tr+0.590
Neg logit contributions
 spe-0.593
mer-0.549
chie-0.542
 se-0.512
 en-0.468
Token of
Token ID286
Feature activation+1.07e+00

Pos logit contributions
ward+0.179
reating+0.177
tr+0.158
live+0.157
ained+0.156
Neg logit contributions
 spe-0.204
mer-0.195
chie-0.191
 se-0.178
 en-0.166
Token bird
Token ID6512
Feature activation-1.55e-01

Pos logit contributions
 spe+0.898
mer+0.896
chie+0.882
 se+0.730
 blu+0.692
Neg logit contributions
ward-1.350
reating-1.344
 about-1.266
tr-1.237
icated-1.207
Token came
Token ID1625
Feature activation-5.26e-01

Pos logit contributions
 spe+0.232
mer+0.221
chie+0.219
 se+0.200
 en+0.189
Neg logit contributions
ward-0.207
reating-0.204
tr-0.184
ained-0.182
 about-0.182
Token and
Token ID290
Feature activation-6.98e-01

Pos logit contributions
 spe+0.787
mer+0.747
chie+0.727
 se+0.670
 en+0.626
Neg logit contributions
ward-0.723
reating-0.700
ained-0.646
 about-0.646
tr-0.633
Token scared
Token ID12008
Feature activation-6.60e-01

Pos logit contributions
 spe+0.992
chie+0.962
mer+0.959
 se+0.831
 blu+0.794
Neg logit contributions
ward-0.982
reating-0.929
tr-0.864
 about-0.857
icated-0.850
Token the
Token ID262
Feature activation-1.81e+00

Pos logit contributions
 spe+0.928
mer+0.888
chie+0.865
 se+0.792
 en+0.730
Neg logit contributions
reating-0.944
ward-0.942
 about-0.875
ained-0.837
icated-0.837
Token little
Token ID1310
Feature activation-3.28e+00

Pos logit contributions
mer+1.590
chie+1.513
 spe+1.435
 Int+1.109
 dunno+1.094
Neg logit contributions
ward-3.500
tr-3.412
reating-3.361
 about-3.215
 done-3.200
Token bird
Token ID6512
Feature activation-8.08e-01

Pos logit contributions
chie+3.735
mer+3.724
 dunno+3.216
 Int+3.213
rily+3.186
Neg logit contributions
tr-5.162
ward-5.139
reating-4.994
inea-4.865
 about-4.701
Tokenie
Token ID494
Feature activation-6.92e-01

Pos logit contributions
 spe+1.390
mer+1.305
chie+1.292
 se+1.203
 skinny+1.147
Neg logit contributions
ward-0.922
reating-0.913
 about-0.829
ained-0.808
tr-0.808
Token.
Token ID13
Feature activation-1.42e+00

Pos logit contributions
 spe+1.023
mer+0.959
chie+0.955
 se+0.883
 blu+0.816
Neg logit contributions
ward-0.956
reating-0.913
 about-0.861
tr-0.844
ained-0.832
Token The
Token ID383
Feature activation-1.56e+00

Pos logit contributions
 spe+1.802
mer+1.768
chie+1.724
 se+1.530
 blu+1.376
Neg logit contributions
ward-2.264
 about-2.147
reating-2.046
tr-2.029
 Afterwards-2.016
Token little
Token ID1310
Feature activation-2.86e+00

Pos logit contributions
mer+1.359
 spe+1.226
chie+1.221
 dunno+0.898
rily+0.876
Neg logit contributions
ward-3.076
reating-2.950
tr-2.947
 done-2.808
 about-2.802
Token it
Token ID340
Feature activation-1.52e+00

Pos logit contributions
 spe+1.187
mer+1.163
chie+1.150
 se+0.988
 dunno+0.895
Neg logit contributions
ward-1.445
reating-1.342
 about-1.336
ained-1.275
 done-1.252
Token was
Token ID373
Feature activation+3.19e-02

Pos logit contributions
 spe+2.616
chie+2.580
mer+2.436
 se+2.224
rily+2.207
Neg logit contributions
ward-1.653
 about-1.529
tr-1.517
ained-1.463
 done-1.450
Token small
Token ID1402
Feature activation-3.88e-01

Pos logit contributions
ward+0.050
reating+0.049
tr+0.045
ained+0.045
live+0.045
Neg logit contributions
 spe-0.040
mer-0.038
chie-0.038
 se-0.034
 en-0.032
Token.
Token ID13
Feature activation-1.21e+00

Pos logit contributions
 spe+0.561
chie+0.538
mer+0.536
 se+0.486
 en+0.452
Neg logit contributions
ward-0.541
reating-0.531
 about-0.489
tr-0.476
ained-0.472
Token The
Token ID383
Feature activation-2.47e+00

Pos logit contributions
 spe+1.530
mer+1.526
chie+1.485
 se+1.308
 en+1.119
Neg logit contributions
ward-1.969
reating-1.870
 about-1.844
ained-1.777
 Afterwards-1.761
Token little
Token ID1310
Feature activation-3.28e+00

Pos logit contributions
 spe+2.648
chie+2.514
mer+2.406
 dunno+1.865
 Int+1.864
Neg logit contributions
ward-4.582
reating-4.339
 stories-4.225
 done-4.160
ained-4.142
Token boat
Token ID8848
Feature activation-6.36e-01

Pos logit contributions
 spe+3.351
chie+3.248
mer+2.895
 Int+2.694
 dunno+2.587
Neg logit contributions
ward-5.922
reating-5.776
ruck-5.458
ained-5.431
icated-5.408
Token was
Token ID373
Feature activation-5.31e-01

Pos logit contributions
 spe+1.040
chie+0.996
mer+0.972
 se+0.886
 skinny+0.843
Neg logit contributions
ward-0.769
reating-0.717
 about-0.689
tr-0.681
ained-0.679
Token mo
Token ID6941
Feature activation-1.61e-01

Pos logit contributions
 spe+0.651
mer+0.634
chie+0.612
 se+0.540
 dunno+0.489
Neg logit contributions
ward-0.861
reating-0.823
 about-0.793
ained-0.782
tr-0.765
Tokenored
Token ID1850
Feature activation+2.67e-01

Pos logit contributions
 spe+0.291
mer+0.277
chie+0.273
 se+0.258
 en+0.245
Neg logit contributions
ward-0.170
reating-0.166
ained-0.144
live-0.144
tr-0.143
Token in
Token ID287
Feature activation-9.91e-01

Pos logit contributions
ward+0.389
reating+0.386
tr+0.349
live+0.347
ained+0.343
Neg logit contributions
 spe-0.372
mer-0.353
chie-0.345
 se-0.320
 en-0.299
Token
Token ID198
Feature activation-1.16e-01

Pos logit contributions
 spe+0.598
mer+0.569
chie+0.564
 se+0.519
 en+0.485
Neg logit contributions
ward-0.485
reating-0.471
 about-0.428
live-0.422
ained-0.419
TokenOne
Token ID3198
Feature activation+4.73e-02

Pos logit contributions
 spe+0.173
mer+0.165
chie+0.162
 se+0.150
 en+0.140
Neg logit contributions
ward-0.158
reating-0.155
tr-0.139
live-0.138
ained-0.138
Token day
Token ID1110
Feature activation+1.54e+00

Pos logit contributions
ward+0.042
reating+0.041
tr+0.034
live+0.034
ained+0.034
Neg logit contributions
 spe-0.093
mer-0.090
chie-0.089
 se-0.084
 en-0.080
Token,
Token ID11
Feature activation-4.24e-01

Pos logit contributions
reating+2.396
ward+2.179
tr+2.108
live+2.075
ained+2.057
Neg logit contributions
 spe-2.283
mer-2.082
 se-1.914
chie-1.894
 en-1.796
Token the
Token ID262
Feature activation-2.59e+00

Pos logit contributions
 spe+0.581
chie+0.562
mer+0.560
 se+0.504
 en+0.462
Neg logit contributions
ward-0.629
reating-0.593
 about-0.554
tr-0.546
ained-0.544
Token tiny
Token ID7009
Feature activation-3.27e+00

Pos logit contributions
 spe+3.030
chie+3.018
mer+2.776
 Int+2.361
 dunno+2.361
Neg logit contributions
ward-4.406
reating-4.062
nce-3.983
 stories-3.959
 done-3.907
Token pink
Token ID11398
Feature activation-2.51e+00

Pos logit contributions
 spe+3.015
chie+2.914
 Int+2.700
mer+2.350
 dunno+2.259
Neg logit contributions
ward-5.967
reating-5.938
ruck-5.920
nce-5.690
inea-5.605
Token boat
Token ID8848
Feature activation-8.40e-01

Pos logit contributions
 spe+2.129
chie+2.024
mer+1.675
 Int+1.505
 skinny+1.492
Neg logit contributions
ward-5.008
reating-4.965
nce-4.840
ained-4.774
ruck-4.674
Token had
Token ID550
Feature activation-1.09e+00

Pos logit contributions
 spe+1.192
chie+1.120
mer+1.072
 se+0.956
 skinny+0.930
Neg logit contributions
ward-1.202
reating-1.129
 about-1.096
ained-1.092
tr-1.077
Token to
Token ID284
Feature activation-9.39e-01

Pos logit contributions
mer+1.665
 spe+1.646
chie+1.563
 se+1.369
 dunno+1.316
Neg logit contributions
ward-1.483
reating-1.388
 about-1.339
ained-1.293
 done-1.277
Token move
Token ID1445
Feature activation-1.78e-01

Pos logit contributions
 spe+1.260
mer+1.246
chie+1.188
 se+1.018
 skinny+0.987
Neg logit contributions
ward-1.421
reating-1.397
 about-1.262
ained-1.242
tr-1.233
Token We
Token ID775
Feature activation-2.69e-02

Pos logit contributions
 spe+1.543
mer+1.434
chie+1.375
 se+1.341
 skinny+1.171
Neg logit contributions
ward-2.035
reating-1.895
tr-1.888
ained-1.844
ouncing-1.809
Token are
Token ID389
Feature activation-1.79e-02

Pos logit contributions
 spe+0.035
mer+0.033
chie+0.033
 se+0.030
 en+0.028
Neg logit contributions
ward-0.042
reating-0.041
tr-0.037
ained-0.037
 about-0.037
Token not
Token ID407
Feature activation-6.60e-01

Pos logit contributions
 spe+0.028
mer+0.027
chie+0.027
 se+0.025
 en+0.023
Neg logit contributions
ward-0.023
reating-0.023
tr-0.020
 about-0.020
ained-0.020
Token going
Token ID1016
Feature activation-6.77e-01

Pos logit contributions
 spe+0.995
mer+0.966
chie+0.924
 se+0.861
 blu+0.799
Neg logit contributions
ward-0.878
reating-0.867
 about-0.798
ained-0.775
 done-0.763
Token on
Token ID319
Feature activation-1.13e+00

Pos logit contributions
 spe+0.876
mer+0.824
chie+0.810
 se+0.733
 en+0.669
Neg logit contributions
ward-1.060
reating-1.057
 about-0.977
ained-0.967
tr-0.943
Token a
Token ID257
Feature activation-3.27e+00

Pos logit contributions
mer+1.684
 spe+1.635
chie+1.623
 se+1.425
 en+1.358
Neg logit contributions
ward-1.622
reating-1.574
 about-1.431
ained-1.376
icated-1.355
Token trip
Token ID5296
Feature activation-6.76e-01

Pos logit contributions
chie+5.052
mer+5.005
 spe+4.664
rily+4.263
rier+4.199
Neg logit contributions
reating-4.758
ward-4.677
 about-4.365
tr-4.287
 stories-4.165
Token today
Token ID1909
Feature activation-1.71e+00

Pos logit contributions
 spe+1.011
chie+0.942
mer+0.923
 se+0.865
 en+0.807
Neg logit contributions
ward-0.936
reating-0.934
 about-0.866
ained-0.844
tr-0.827
Token.
Token ID13
Feature activation-1.03e+00

Pos logit contributions
 spe+2.828
chie+2.752
mer+2.608
 se+2.482
 en+2.330
Neg logit contributions
ward-2.168
reating-2.131
 about-2.108
ained-1.898
 done-1.875
Token We
Token ID775
Feature activation+2.19e-01

Pos logit contributions
 spe+1.162
mer+1.116
chie+1.082
 se+0.998
 skinny+0.839
Neg logit contributions
ward-1.762
reating-1.700
tr-1.656
 about-1.633
ained-1.603
Token are
Token ID389
Feature activation-4.93e-02

Pos logit contributions
ward+0.346
reating+0.341
tr+0.309
live+0.308
ained+0.306
Neg logit contributions
 spe-0.278
mer-0.262
chie-0.258
 se-0.235
 en-0.217
Token said
Token ID531
Feature activation-1.12e+00

Pos logit contributions
 spe+0.526
mer+0.498
chie+0.495
 se+0.448
 blu+0.423
Neg logit contributions
ward-0.392
reating-0.380
 about-0.347
tr-0.340
live-0.337
Token.
Token ID13
Feature activation-4.78e-01

Pos logit contributions
 spe+1.840
chie+1.737
mer+1.735
 se+1.542
 blu+1.523
Neg logit contributions
ward-1.412
 about-1.347
reating-1.336
tr-1.193
ouncing-1.158
Token "
Token ID366
Feature activation-7.99e-01

Pos logit contributions
 spe+0.590
chie+0.557
mer+0.552
 se+0.485
 blu+0.449
Neg logit contributions
ward-0.784
reating-0.759
 about-0.719
tr-0.713
ained-0.689
TokenYou
Token ID1639
Feature activation+2.89e-01

Pos logit contributions
 spe+1.191
chie+1.119
mer+1.099
 se+0.990
 blu+0.921
Neg logit contributions
ward-1.086
reating-1.072
tr-0.992
ained-0.990
icated-0.971
Token should
Token ID815
Feature activation+9.99e-01

Pos logit contributions
ward+0.379
reating+0.375
tr+0.330
live+0.329
ained+0.328
Neg logit contributions
 spe-0.448
mer-0.426
chie-0.422
 se-0.396
 en-0.370
Token be
Token ID307
Feature activation+1.85e+00

Pos logit contributions
ward+1.534
reating+1.517
tr+1.375
ained+1.367
live+1.328
Neg logit contributions
 spe-1.309
chie-1.214
mer-1.191
 se-1.176
 en-1.078
Token more
Token ID517
Feature activation+6.14e-01

Pos logit contributions
reating+3.580
ward+3.464
tr+3.261
ops+3.246
icated+3.210
Neg logit contributions
 en-1.782
 spe-1.780
chie-1.725
 se-1.661
 shy-1.614
Token careful
Token ID8161
Feature activation-2.68e-01

Pos logit contributions
ward+1.089
reating+1.084
tr+0.998
live+0.971
icated+0.969
Neg logit contributions
 spe-0.677
mer-0.631
chie-0.607
 se-0.578
 en-0.533
Token.
Token ID13
Feature activation+2.11e-02

Pos logit contributions
 spe+0.409
mer+0.385
chie+0.383
 se+0.354
 en+0.329
Neg logit contributions
ward-0.357
reating-0.353
 about-0.320
tr-0.313
ained-0.313
Token You
Token ID921
Feature activation+1.41e-01

Pos logit contributions
ward+0.030
reating+0.030
tr+0.027
live+0.027
ained+0.027
Neg logit contributions
 spe-0.030
mer-0.028
chie-0.028
 se-0.026
 en-0.024
Token could
Token ID714
Feature activation+9.20e-01

Pos logit contributions
ward+0.196
reating+0.193
tr+0.173
live+0.172
ained+0.171
Neg logit contributions
 spe-0.207
mer-0.197
chie-0.194
 se-0.180
 en-0.168
Token something
Token ID1223
Feature activation-9.08e-01

Pos logit contributions
 spe+0.482
mer+0.462
chie+0.456
 se+0.421
 en+0.388
Neg logit contributions
ward-0.506
reating-0.495
 about-0.450
tr-0.443
ained-0.443
Token!"
Token ID2474
Feature activation+1.76e+00

Pos logit contributions
 spe+1.273
chie+1.219
mer+1.204
 se+1.100
 blu+1.003
Neg logit contributions
ward-1.314
reating-1.283
 about-1.246
tr-1.148
ained-1.144
Token
Token ID220
Feature activation-1.11e-01

Pos logit contributions
reating+3.204
ward+3.034
live+2.849
icated+2.826
ained+2.760
Neg logit contributions
 spe-1.786
mer-1.637
chie-1.621
 se-1.615
 en-1.548
Token
Token ID198
Feature activation+4.13e-01

Pos logit contributions
 spe+0.168
mer+0.160
chie+0.157
 se+0.146
 en+0.136
Neg logit contributions
ward-0.147
reating-0.144
tr-0.130
 about-0.129
ained-0.129
Token
Token ID198
Feature activation+1.48e+00

Pos logit contributions
ward+0.593
reating+0.586
tr+0.530
ained+0.522
live+0.522
Neg logit contributions
 spe-0.568
mer-0.542
chie-0.535
 se-0.498
 en-0.465
TokenE
Token ID36
Feature activation+1.64e+00

Pos logit contributions
ward+2.532
reating+2.518
ained+2.316
icated+2.276
 about+2.259
Neg logit contributions
 spe-1.637
chie-1.579
mer-1.538
 se-1.441
 en-1.371
Tokenlla
Token ID8466
Feature activation-2.03e-01

Pos logit contributions
reating+2.408
ward+2.261
tr+2.120
ained+2.039
 about+2.037
Neg logit contributions
 spe-2.480
mer-2.474
chie-2.293
 se-2.086
 en-1.878
Token decided
Token ID3066
Feature activation-1.59e-01

Pos logit contributions
 spe+0.279
mer+0.262
chie+0.260
 se+0.235
 en+0.220
Neg logit contributions
ward-0.300
reating-0.290
tr-0.269
 about-0.267
live-0.266
Token to
Token ID284
Feature activation+2.64e-01

Pos logit contributions
 spe+0.175
mer+0.164
chie+0.160
 se+0.141
 en+0.128
Neg logit contributions
ward-0.282
reating-0.278
 about-0.257
tr-0.256
live-0.253
Token solve
Token ID8494
Feature activation+7.05e-02

Pos logit contributions
ward+0.436
reating+0.431
ained+0.394
tr+0.392
live+0.391
Neg logit contributions
 spe-0.313
mer-0.292
chie-0.291
 se-0.265
 en-0.241
Token the
Token ID262
Feature activation+3.73e-01

Pos logit contributions
ward+0.109
reating+0.108
tr+0.098
live+0.097
ained+0.097
Neg logit contributions
 spe-0.091
mer-0.086
chie-0.084
 se-0.077
 en-0.071
Token and
Token ID290
Feature activation+8.72e-01

Pos logit contributions
 spe+0.591
chie+0.563
mer+0.555
 se+0.516
 en+0.481
Neg logit contributions
ward-0.483
reating-0.465
 about-0.428
tr-0.422
ained-0.415
Token have
Token ID423
Feature activation-8.68e-01

Pos logit contributions
ward+1.426
reating+1.421
ained+1.314
live+1.310
tr+1.281
Neg logit contributions
 spe-1.076
mer-0.997
chie-0.952
 se-0.909
 en-0.836
Token fun
Token ID1257
Feature activation+2.88e-01

Pos logit contributions
mer+1.491
 spe+1.483
chie+1.456
 se+1.288
 en+1.231
Neg logit contributions
ward-1.022
reating-0.961
 about-0.904
ained-0.841
icated-0.831
Token.
Token ID13
Feature activation+6.34e-01

Pos logit contributions
ward+0.435
reating+0.431
tr+0.392
live+0.389
ained+0.384
Neg logit contributions
 spe-0.387
mer-0.370
chie-0.360
 se-0.333
 en-0.309
Token One
Token ID1881
Feature activation+7.10e-01

Pos logit contributions
reating+1.006
ward+0.999
live+0.889
tr+0.883
icated+0.876
Neg logit contributions
 spe-0.789
mer-0.748
chie-0.721
 se-0.670
 en-0.634
Token day
Token ID1110
Feature activation+1.64e+00

Pos logit contributions
ward+0.714
reating+0.710
tr+0.593
live+0.574
icated+0.570
Neg logit contributions
 spe-1.304
mer-1.281
chie-1.230
 se-1.191
 en-1.142
Token,
Token ID11
Feature activation+5.68e-01

Pos logit contributions
reating+2.551
ward+2.322
tr+2.248
live+2.170
ugg+2.138
Neg logit contributions
 spe-2.289
mer-2.169
 se-2.033
chie-2.018
 boat-1.877
Token Sam
Token ID3409
Feature activation-1.04e+00

Pos logit contributions
reating+0.965
ward+0.958
tr+0.877
live+0.869
ained+0.863
Neg logit contributions
 spe-0.650
mer-0.608
chie-0.582
 se-0.551
 en-0.498
Token saw
Token ID2497
Feature activation+7.04e-01

Pos logit contributions
 spe+1.427
chie+1.389
mer+1.281
 se+1.117
 blu+1.087
Neg logit contributions
ward-1.541
reating-1.496
 about-1.474
live-1.440
tr-1.405
Token a
Token ID257
Feature activation+3.25e-01

Pos logit contributions
ward+1.083
reating+1.082
tr+0.994
live+0.985
ained+0.982
Neg logit contributions
 spe-0.890
mer-0.825
chie-0.808
 se-0.803
 en-0.728
Token big
Token ID1263
Feature activation+5.15e-01

Pos logit contributions
ward+0.621
reating+0.614
ained+0.571
live+0.567
tr+0.566
Neg logit contributions
 spe-0.307
mer-0.279
chie-0.270
 se-0.247
 en-0.215
Token grass
Token ID8701
Feature activation-6.80e-01

Pos logit contributions
 spe+0.010
mer+0.009
chie+0.009
 se+0.008
 en+0.007
Neg logit contributions
ward-0.016
reating-0.016
tr-0.014
ained-0.014
live-0.014
Tokeny
Token ID88
Feature activation-4.79e-01

Pos logit contributions
 spe+1.001
chie+0.961
mer+0.934
 se+0.862
 en+0.808
Neg logit contributions
ward-0.951
reating-0.902
 about-0.852
ained-0.836
tr-0.835
Token hills
Token ID18639
Feature activation-1.21e+00

Pos logit contributions
 spe+0.624
chie+0.593
mer+0.585
 se+0.532
 en+0.496
Neg logit contributions
ward-0.733
reating-0.716
tr-0.660
 about-0.657
ained-0.653
Token.
Token ID13
Feature activation-8.23e-01

Pos logit contributions
 spe+1.912
chie+1.875
mer+1.799
 se+1.662
 dunno+1.607
Neg logit contributions
ward-1.560
reating-1.504
 about-1.448
ained-1.341
tr-1.298
Token One
Token ID1881
Feature activation+2.27e-02

Pos logit contributions
 spe+1.062
mer+1.041
chie+1.027
 se+0.930
 en+0.813
Neg logit contributions
ward-1.288
reating-1.237
 about-1.193
tr-1.182
ained-1.143
Token day
Token ID1110
Feature activation+1.49e+00

Pos logit contributions
ward+0.020
reating+0.019
tr+0.016
live+0.016
ained+0.016
Neg logit contributions
 spe-0.045
mer-0.044
chie-0.043
 se-0.041
 en-0.039
Token,
Token ID11
Feature activation-5.83e-01

Pos logit contributions
reating+2.297
ward+2.114
tr+2.035
ained+1.996
live+1.977
Neg logit contributions
 spe-2.184
mer-2.018
 se-1.842
chie-1.836
 en-1.708
Token the
Token ID262
Feature activation-4.89e-01

Pos logit contributions
 spe+0.797
chie+0.785
mer+0.775
 se+0.690
 en+0.637
Neg logit contributions
ward-0.866
reating-0.811
 about-0.763
tr-0.747
ained-0.739
Token cow
Token ID9875
Feature activation-1.80e-01

Pos logit contributions
 spe+0.502
chie+0.486
mer+0.480
 se+0.417
 en+0.372
Neg logit contributions
ward-0.882
reating-0.855
tr-0.813
live-0.806
 about-0.805
Token climbed
Token ID19952
Feature activation+3.91e-01

Pos logit contributions
 spe+0.218
mer+0.204
chie+0.203
 se+0.180
 en+0.166
Neg logit contributions
ward-0.294
reating-0.287
tr-0.268
 about-0.265
live-0.264
Token higher
Token ID2440
Feature activation-7.15e-02

Pos logit contributions
ward+0.562
reating+0.561
tr+0.506
live+0.504
icated+0.497
Neg logit contributions
 spe-0.549
mer-0.518
chie-0.506
 se-0.471
 en-0.439
Token was
Token ID373
Feature activation-2.37e-01

Pos logit contributions
 spe+1.329
chie+1.279
mer+1.226
 se+1.057
 skinny+1.041
Neg logit contributions
ward-1.283
reating-1.246
 about-1.178
ained-1.177
live-1.158
Token showing
Token ID4478
Feature activation+1.11e+00

Pos logit contributions
 spe+0.382
mer+0.367
chie+0.360
 se+0.332
 en+0.308
Neg logit contributions
ward-0.297
reating-0.286
 about-0.260
ained-0.258
tr-0.255
Token off
Token ID572
Feature activation+1.48e+00

Pos logit contributions
ward+1.362
reating+1.276
live+1.244
ained+1.197
tr+1.195
Neg logit contributions
 spe-1.737
chie-1.716
mer-1.661
 se-1.641
 en-1.566
Token his
Token ID465
Feature activation-5.61e-01

Pos logit contributions
reating+2.434
ward+2.412
live+2.331
tr+2.293
ained+2.235
Neg logit contributions
 spe-1.738
mer-1.564
chie-1.542
 se-1.541
 en-1.439
Token moves
Token ID6100
Feature activation-1.14e+00

Pos logit contributions
 spe+0.815
mer+0.773
chie+0.760
 se+0.675
 en+0.636
Neg logit contributions
ward-0.790
reating-0.763
 about-0.694
tr-0.690
ained-0.685
Token and
Token ID290
Feature activation+1.46e+00

Pos logit contributions
 spe+1.830
chie+1.655
mer+1.646
 se+1.501
 dunno+1.423
Neg logit contributions
ward-1.506
reating-1.482
 about-1.402
ained-1.322
tr-1.311
Token he
Token ID339
Feature activation-5.54e-01

Pos logit contributions
reating+2.356
ward+2.347
live+2.245
tr+2.170
oof+2.155
Neg logit contributions
 spe-1.783
mer-1.672
 se-1.672
chie-1.592
 en-1.521
Token accidentally
Token ID14716
Feature activation+5.01e-01

Pos logit contributions
 spe+0.787
chie+0.738
mer+0.737
 se+0.649
 skinny+0.613
Neg logit contributions
ward-0.796
reating-0.763
 about-0.712
ained-0.705
tr-0.703
Token broke
Token ID6265
Feature activation+5.12e-01

Pos logit contributions
reating+0.812
ward+0.808
live+0.734
ained+0.733
tr+0.728
Neg logit contributions
 spe-0.608
mer-0.564
chie-0.558
 se-0.527
 en-0.473
Token his
Token ID465
Feature activation-2.32e-01

Pos logit contributions
ward+0.727
reating+0.714
live+0.661
tr+0.657
ained+0.643
Neg logit contributions
 spe-0.724
mer-0.686
chie-0.681
 se-0.637
 en-0.591
Token toe
Token ID21189
Feature activation-9.46e-02

Pos logit contributions
 spe+0.286
mer+0.270
chie+0.265
 se+0.238
 en+0.219
Neg logit contributions
ward-0.374
reating-0.365
tr-0.336
 about-0.334
ained-0.332
Token feel
Token ID1254
Feature activation-5.76e-01

Pos logit contributions
reating+1.461
ward+1.454
tr+1.362
live+1.352
ained+1.298
Neg logit contributions
 spe-1.221
mer-1.143
chie-1.134
 se-1.068
 en-1.010
Token happier
Token ID23030
Feature activation-1.37e-02

Pos logit contributions
 spe+0.879
chie+0.852
mer+0.847
 se+0.759
 skinny+0.704
Neg logit contributions
ward-0.758
reating-0.724
 about-0.680
ained-0.654
live-0.632
Token.
Token ID13
Feature activation+6.65e-02

Pos logit contributions
 spe+0.019
mer+0.018
chie+0.018
 se+0.017
 en+0.016
Neg logit contributions
ward-0.020
reating-0.019
tr-0.017
 about-0.017
ained-0.017
Token Everyone
Token ID11075
Feature activation+4.25e-01

Pos logit contributions
ward+0.094
reating+0.093
tr+0.084
live+0.083
ained+0.083
Neg logit contributions
 spe-0.095
mer-0.090
chie-0.089
 se-0.082
 en-0.077
Token agreed
Token ID4987
Feature activation+2.78e-01

Pos logit contributions
ward+0.590
reating+0.588
tr+0.529
ained+0.524
live+0.520
Neg logit contributions
 spe-0.618
mer-0.592
chie-0.578
 se-0.546
 en-0.507
Token.
Token ID13
Feature activation+6.01e-01

Pos logit contributions
ward+0.409
reating+0.406
tr+0.371
ained+0.367
live+0.367
Neg logit contributions
 spe-0.378
mer-0.357
chie-0.351
 se-0.327
 en-0.302
Token Taking
Token ID20879
Feature activation-3.17e-01

Pos logit contributions
reating+0.870
ward+0.856
live+0.780
tr+0.778
ained+0.766
Neg logit contributions
 spe-0.832
mer-0.785
chie-0.775
 se-0.716
 en-0.676
Token the
Token ID262
Feature activation+5.68e-01

Pos logit contributions
 spe+0.463
mer+0.440
chie+0.438
 se+0.399
 en+0.369
Neg logit contributions
ward-0.438
reating-0.430
 about-0.391
ained-0.386
live-0.383
Token medicine
Token ID9007
Feature activation-4.03e-01

Pos logit contributions
reating+0.739
ward+0.739
ained+0.650
live+0.650
tr+0.648
Neg logit contributions
 spe-0.884
mer-0.841
chie-0.822
 se-0.780
 en-0.732
Token felt
Token ID2936
Feature activation-2.70e-01

Pos logit contributions
 spe+0.618
chie+0.586
mer+0.583
 se+0.534
 en+0.506
Neg logit contributions
ward-0.532
reating-0.512
 about-0.464
tr-0.463
live-0.456
Token nice
Token ID3621
Feature activation+7.05e-01

Pos logit contributions
 spe+0.415
mer+0.395
chie+0.394
 se+0.358
 skinny+0.332
Neg logit contributions
ward-0.355
reating-0.343
 about-0.314
ained-0.306
tr-0.303
Token All
Token ID1439
Feature activation-1.07e+00

Pos logit contributions
 spe+2.193
mer+2.076
chie+2.020
 se+1.902
 skinny+1.747
Neg logit contributions
ward-1.547
reating-1.386
 Afterwards-1.377
 about-1.315
 After-1.280
Token her
Token ID607
Feature activation-7.76e-01

Pos logit contributions
 spe+1.677
mer+1.587
chie+1.568
 se+1.438
 skinny+1.345
Neg logit contributions
ward-1.389
reating-1.371
 about-1.287
ained-1.210
 done-1.206
Token friends
Token ID2460
Feature activation-2.61e-01

Pos logit contributions
 spe+1.321
mer+1.276
chie+1.241
 se+1.137
 skinny+1.096
Neg logit contributions
ward-0.885
reating-0.833
 about-0.756
 done-0.730
nce-0.728
Token had
Token ID550
Feature activation-6.31e-01

Pos logit contributions
 spe+0.396
mer+0.377
chie+0.373
 se+0.343
 en+0.322
Neg logit contributions
ward-0.346
reating-0.335
 about-0.304
tr-0.301
live-0.298
Token different
Token ID1180
Feature activation+3.10e-01

Pos logit contributions
 spe+0.915
mer+0.896
chie+0.864
 se+0.780
 en+0.729
Neg logit contributions
ward-0.890
reating-0.856
 about-0.796
ained-0.759
tr-0.759
Token ideas
Token ID4213
Feature activation-1.14e-01

Pos logit contributions
ward+0.444
reating+0.435
live+0.397
tr+0.397
ained+0.392
Neg logit contributions
 spe-0.441
mer-0.418
chie-0.413
 se-0.382
 en-0.358
Token,
Token ID11
Feature activation-6.39e-01

Pos logit contributions
 spe+0.173
mer+0.165
chie+0.163
 se+0.151
 en+0.141
Neg logit contributions
ward-0.151
reating-0.148
 about-0.133
tr-0.132
ained-0.131
Token but
Token ID475
Feature activation+2.26e-01

Pos logit contributions
 spe+0.814
mer+0.792
chie+0.784
 se+0.684
 skinny+0.626
Neg logit contributions
ward-0.999
reating-0.982
 about-0.917
tr-0.880
ained-0.875
Token she
Token ID673
Feature activation-2.31e-01

Pos logit contributions
ward+0.367
reating+0.364
tr+0.334
live+0.331
ained+0.330
Neg logit contributions
 spe-0.274
mer-0.257
chie-0.252
 se-0.230
 en-0.212
Token couldn
Token ID3521
Feature activation-1.03e+00

Pos logit contributions
 spe+0.345
mer+0.326
chie+0.324
 se+0.297
 en+0.278
Neg logit contributions
ward-0.313
reating-0.303
live-0.277
 about-0.276
tr-0.275
Token't
Token ID470
Feature activation-9.74e-01

Pos logit contributions
 spe+1.469
mer+1.401
chie+1.329
 se+1.216
 blu+1.161
Neg logit contributions
ward-1.544
reating-1.469
 about-1.366
ained-1.349
live-1.349
Token He
Token ID679
Feature activation-6.57e-01

Pos logit contributions
ward+0.336
reating+0.335
tr+0.299
ained+0.298
live+0.298
Neg logit contributions
 spe-0.312
mer-0.295
chie-0.288
 se-0.265
 en-0.249
Token w
Token ID266
Feature activation+1.12e+00

Pos logit contributions
 spe+0.908
chie+0.887
mer+0.870
 se+0.767
 en+0.730
Neg logit contributions
ward-0.940
reating-0.896
tr-0.851
live-0.849
 about-0.848
Tokenagged
Token ID14655
Feature activation+1.11e+00

Pos logit contributions
reating+1.372
ward+1.366
tr+1.214
icated+1.148
ained+1.082
Neg logit contributions
 spe-1.841
chie-1.809
mer-1.769
 se-1.644
 en-1.579
Token his
Token ID465
Feature activation+2.10e-01

Pos logit contributions
reating+1.675
ward+1.666
tr+1.481
live+1.467
icated+1.464
Neg logit contributions
 spe-1.532
mer-1.427
chie-1.401
 se-1.313
 en-1.267
Token tail
Token ID7894
Feature activation+6.54e-01

Pos logit contributions
ward+0.316
reating+0.315
tr+0.280
ained+0.279
live+0.279
Neg logit contributions
 spe-0.285
mer-0.270
chie-0.266
 se-0.243
 en-0.227
Token and
Token ID290
Feature activation+8.29e-01

Pos logit contributions
reating+1.032
ward+1.008
ained+0.917
icated+0.914
live+0.913
Neg logit contributions
 spe-0.841
mer-0.794
chie-0.756
 se-0.719
 en-0.677
Token bark
Token ID21405
Feature activation-2.02e-01

Pos logit contributions
reating+1.329
ward+1.295
ained+1.195
tr+1.164
icated+1.159
Neg logit contributions
 spe-1.093
mer-1.009
chie-0.951
 se-0.942
 en-0.867
Tokened
Token ID276
Feature activation-2.99e-01

Pos logit contributions
 spe+0.389
mer+0.372
chie+0.372
 se+0.349
 en+0.331
Neg logit contributions
ward-0.185
reating-0.177
 about-0.153
tr-0.152
ained-0.149
Token,
Token ID11
Feature activation+9.46e-02

Pos logit contributions
 spe+0.447
chie+0.426
mer+0.424
 se+0.391
 en+0.359
Neg logit contributions
ward-0.402
reating-0.392
 about-0.360
tr-0.353
ained-0.346
Token "
Token ID366
Feature activation+3.88e-01

Pos logit contributions
ward+0.154
reating+0.152
tr+0.139
live+0.138
ained+0.138
Neg logit contributions
 spe-0.116
mer-0.109
chie-0.107
 se-0.097
 en-0.090
TokenCome
Token ID16773
Feature activation+7.00e-01

Pos logit contributions
ward+0.589
reating+0.576
 about+0.525
ained+0.518
tr+0.516
Neg logit contributions
 spe-0.523
mer-0.500
chie-0.490
 se-0.448
 en-0.414
Token
Token ID198
Feature activation+6.57e-01

Pos logit contributions
 spe+0.734
mer+0.708
chie+0.705
 se+0.635
 en+0.584
Neg logit contributions
ward-0.719
reating-0.688
tr-0.640
 about-0.636
ained-0.624
Token
Token ID198
Feature activation+6.35e-01

Pos logit contributions
ward+0.973
reating+0.963
tr+0.882
ained+0.864
live+0.863
Neg logit contributions
 spe-0.873
mer-0.817
chie-0.812
 se-0.768
 en-0.709
Token"
Token ID1
Feature activation-8.66e-01

Pos logit contributions
ward+0.914
reating+0.904
tr+0.822
 about+0.822
ained+0.814
Neg logit contributions
 spe-0.878
mer-0.838
chie-0.838
 se-0.776
 en-0.716
TokenNo
Token ID2949
Feature activation+8.93e-01

Pos logit contributions
 spe+1.258
chie+1.200
mer+1.190
 se+1.061
 en+1.013
Neg logit contributions
ward-1.219
reating-1.166
tr-1.108
live-1.074
ained-1.055
Token,
Token ID11
Feature activation-7.79e-01

Pos logit contributions
reating+1.139
ward+1.130
ained+1.045
tr+1.029
live+1.017
Neg logit contributions
 spe-1.418
chie-1.316
mer-1.292
 se-1.253
 en-1.152
Token Lily
Token ID20037
Feature activation-2.53e-01

Pos logit contributions
mer+1.066
 spe+1.052
chie+1.024
 se+0.900
 skinny+0.845
Neg logit contributions
ward-1.143
reating-1.118
 about-1.030
tr-1.019
icated-0.987
Token.
Token ID13
Feature activation-3.17e-01

Pos logit contributions
 spe+0.436
mer+0.417
chie+0.416
 se+0.387
 en+0.365
Neg logit contributions
ward-0.286
reating-0.274
 about-0.246
tr-0.245
live-0.243
Token I
Token ID314
Feature activation+4.24e-01

Pos logit contributions
 spe+0.367
mer+0.347
chie+0.339
 se+0.310
 en+0.276
Neg logit contributions
ward-0.536
reating-0.521
tr-0.494
ained-0.481
 about-0.481
Token want
Token ID765
Feature activation+2.22e-01

Pos logit contributions
ward+0.658
reating+0.652
tr+0.577
icated+0.572
ained+0.572
Neg logit contributions
 spe-0.563
mer-0.536
chie-0.517
 se-0.482
 en-0.445
Token some
Token ID617
Feature activation+1.01e+00

Pos logit contributions
ward+0.392
reating+0.387
tr+0.358
live+0.356
ained+0.356
Neg logit contributions
 spe-0.241
mer-0.222
chie-0.220
 se-0.200
 en-0.180
Token red
Token ID2266
Feature activation+7.21e-01

Pos logit contributions
ward+1.182
reating+1.128
live+1.064
ained+1.047
tr+1.026
Neg logit contributions
 spe-1.734
mer-1.603
chie-1.578
 se-1.557
 en-1.458
Token to
Token ID284
Feature activation+8.25e-01

Pos logit contributions
ward+0.782
reating+0.775
live+0.723
tr+0.722
ained+0.719
Neg logit contributions
 spe-0.422
mer-0.389
chie-0.383
 se-0.350
 en-0.318
Token share
Token ID2648
Feature activation+8.69e-01

Pos logit contributions
ward+1.216
reating+1.203
ained+1.101
live+1.075
tr+1.074
Neg logit contributions
 spe-1.118
chie-1.055
mer-1.053
 se-1.006
 en-0.915
Token the
Token ID262
Feature activation+4.23e-01

Pos logit contributions
ward+1.422
reating+1.381
live+1.301
tr+1.301
ained+1.298
Neg logit contributions
 spe-1.051
mer-0.981
chie-0.928
 se-0.918
 en-0.864
Token toy
Token ID13373
Feature activation-8.90e-01

Pos logit contributions
ward+0.673
reating+0.659
live+0.603
tr+0.598
ained+0.596
Neg logit contributions
 spe-0.534
mer-0.504
chie-0.492
 se-0.463
 en-0.430
Token.
Token ID13
Feature activation+4.97e-01

Pos logit contributions
 spe+1.493
chie+1.394
mer+1.356
 se+1.253
 blu+1.203
Neg logit contributions
ward-1.094
reating-1.087
 about-0.975
tr-0.967
icated-0.920
Token They
Token ID1119
Feature activation+1.32e+00

Pos logit contributions
reating+0.747
ward+0.745
live+0.665
tr+0.663
icated+0.656
Neg logit contributions
 spe-0.654
mer-0.621
chie-0.606
 se-0.568
 en-0.539
Token played
Token ID2826
Feature activation+3.17e-01

Pos logit contributions
reating+1.887
ward+1.819
ained+1.679
tr+1.632
live+1.619
Neg logit contributions
 spe-1.948
 se-1.806
mer-1.786
chie-1.731
 en-1.615
Token together
Token ID1978
Feature activation+7.75e-01

Pos logit contributions
ward+0.371
reating+0.368
tr+0.323
live+0.320
ained+0.319
Neg logit contributions
 spe-0.528
mer-0.509
chie-0.497
 se-0.471
 en-0.447
Token and
Token ID290
Feature activation+1.79e+00

Pos logit contributions
reating+1.151
ward+1.143
tr+1.042
ained+1.038
live+1.032
Neg logit contributions
 spe-1.049
mer-1.007
chie-0.946
 se-0.907
 en-0.857
Token laughed
Token ID13818
Feature activation+5.25e-01

Pos logit contributions
reating+2.801
ward+2.667
ained+2.456
tr+2.442
live+2.405
Neg logit contributions
 spe-2.564
mer-2.335
 se-2.235
chie-2.109
 en-2.071
Token a
Token ID257
Feature activation+5.45e-01

Pos logit contributions
reating+0.711
ward+0.705
live+0.641
tr+0.632
ained+0.626
Neg logit contributions
 spe-0.774
mer-0.741
chie-0.715
 se-0.681
 en-0.648
Token after
Token ID706
Feature activation-4.83e-01

Pos logit contributions
 spe+0.384
mer+0.366
chie+0.362
 se+0.336
 en+0.313
Neg logit contributions
ward-0.281
reating-0.277
 about-0.247
tr-0.240
ained-0.239
Token the
Token ID262
Feature activation+1.28e-01

Pos logit contributions
 spe+0.683
mer+0.656
chie+0.642
 se+0.581
 en+0.539
Neg logit contributions
ward-0.696
reating-0.686
 about-0.621
ained-0.599
live-0.596
Token environment
Token ID2858
Feature activation-4.73e-01

Pos logit contributions
ward+0.180
reating+0.177
tr+0.159
live+0.158
ained+0.158
Neg logit contributions
 spe-0.185
mer-0.176
chie-0.174
 se-0.160
 en-0.150
Token!
Token ID0
Feature activation+9.15e-01

Pos logit contributions
 spe+0.719
chie+0.679
mer+0.678
 se+0.621
 en+0.581
Neg logit contributions
ward-0.629
reating-0.615
 about-0.558
tr-0.543
ained-0.542
Token<|endoftext|>
Token ID50256
Feature activation+8.65e-01

Pos logit contributions
reating+1.396
ward+1.377
ained+1.253
tr+1.251
icated+1.230
Neg logit contributions
 spe-1.175
mer-1.107
chie-1.079
 se-0.984
 en-0.958
TokenOnce
Token ID7454
Feature activation-5.90e-01

Pos logit contributions
reating+1.108
ward+1.087
tr+0.967
live+0.945
icated+0.940
Neg logit contributions
 spe-1.346
mer-1.321
chie-1.245
 se-1.195
 en-1.109
Token upon
Token ID2402
Feature activation-2.38e-01

Pos logit contributions
 spe+1.059
mer+1.009
chie+0.998
 se+0.930
 skinny+0.877
Neg logit contributions
ward-0.631
reating-0.619
 about-0.570
ained-0.539
live-0.533
Token a
Token ID257
Feature activation-7.75e-01

Pos logit contributions
 spe+0.347
mer+0.336
chie+0.329
 se+0.298
 en+0.278
Neg logit contributions
ward-0.330
reating-0.323
 about-0.293
tr-0.289
ained-0.287
Token time
Token ID640
Feature activation-6.72e-02

Pos logit contributions
 spe+1.328
chie+1.304
mer+1.286
 se+1.192
 en+1.118
Neg logit contributions
ward-0.867
reating-0.810
 about-0.788
live-0.785
tr-0.762
Token,
Token ID11
Feature activation+1.74e-01

Pos logit contributions
 spe+0.117
mer+0.113
chie+0.111
 se+0.104
 en+0.098
Neg logit contributions
ward-0.074
reating-0.072
tr-0.063
 about-0.063
live-0.062
Token there
Token ID612
Feature activation+1.96e-01

Pos logit contributions
ward+0.307
reating+0.305
tr+0.279
live+0.277
ained+0.277
Neg logit contributions
 spe-0.189
mer-0.177
chie-0.173
 se-0.155
 en-0.141
Token.
Token ID13
Feature activation-1.26e+00

Pos logit contributions
ward+0.227
reating+0.224
tr+0.200
live+0.200
ained+0.199
Neg logit contributions
 spe-0.253
mer-0.242
chie-0.238
 se-0.221
 en-0.208
Token He
Token ID679
Feature activation+2.68e-01

Pos logit contributions
 spe+1.990
chie+1.948
mer+1.921
 se+1.694
 blu+1.655
Neg logit contributions
ward-1.636
reating-1.544
 about-1.527
tr-1.434
 Afterwards-1.395
Token was
Token ID373
Feature activation+9.63e-02

Pos logit contributions
ward+0.383
reating+0.379
ained+0.339
tr+0.336
live+0.335
Neg logit contributions
 spe-0.386
mer-0.365
chie-0.358
 se-0.337
 en-0.311
Token the
Token ID262
Feature activation-6.83e-02

Pos logit contributions
ward+0.160
reating+0.158
tr+0.145
live+0.144
ained+0.144
Neg logit contributions
 spe-0.114
mer-0.107
chie-0.105
 se-0.095
 en-0.088
Token store
Token ID3650
Feature activation-1.31e+00

Pos logit contributions
 spe+0.090
mer+0.085
chie+0.084
 se+0.076
 en+0.070
Neg logit contributions
ward-0.105
reating-0.104
tr-0.095
 about-0.093
ained-0.093
Token owner
Token ID4870
Feature activation-5.69e-01

Pos logit contributions
 spe+1.896
chie+1.801
mer+1.728
 se+1.581
 dunno+1.517
Neg logit contributions
ward-1.808
reating-1.787
 about-1.725
tr-1.691
 done-1.587
Token.
Token ID13
Feature activation-9.73e-01

Pos logit contributions
 spe+0.881
chie+0.826
mer+0.824
 se+0.758
 skinny+0.710
Neg logit contributions
ward-0.747
reating-0.743
 about-0.677
tr-0.649
live-0.644
Token He
Token ID679
Feature activation+2.99e-01

Pos logit contributions
 spe+1.486
chie+1.421
mer+1.413
 se+1.286
 blu+1.199
Neg logit contributions
ward-1.298
reating-1.245
 about-1.199
tr-1.141
ained-1.123
Token ran
Token ID4966
Feature activation+1.90e-01

Pos logit contributions
ward+0.416
reating+0.412
ained+0.367
tr+0.364
live+0.362
Neg logit contributions
 spe-0.439
mer-0.417
chie-0.408
 se-0.384
 en-0.357
Token after
Token ID706
Feature activation-9.17e-01

Pos logit contributions
ward+0.317
reating+0.311
tr+0.286
live+0.285
ained+0.284
Neg logit contributions
 spe-0.223
mer-0.211
chie-0.206
 se-0.187
 en-0.171
Token them
Token ID606
Feature activation+6.01e-01

Pos logit contributions
 spe+1.128
mer+1.073
chie+1.043
 se+0.905
 blu+0.865
Neg logit contributions
ward-1.520
reating-1.506
 about-1.359
icated-1.322
live-1.301
Token when
Token ID618
Feature activation-8.28e-01

Pos logit contributions
 spe+0.490
mer+0.464
chie+0.464
 se+0.423
 en+0.397
Neg logit contributions
ward-0.390
reating-0.387
 about-0.350
ained-0.337
live-0.335
Token she
Token ID673
Feature activation-9.00e-01

Pos logit contributions
 spe+0.996
mer+0.951
chie+0.924
 se+0.785
 skinny+0.727
Neg logit contributions
ward-1.409
reating-1.373
 about-1.267
ained-1.233
live-1.211
Token opened
Token ID4721
Feature activation-5.87e-01

Pos logit contributions
 spe+1.053
chie+1.011
mer+0.995
 se+0.829
 skinny+0.814
Neg logit contributions
ward-1.496
reating-1.414
 about-1.350
tr-1.333
live-1.329
Token the
Token ID262
Feature activation+2.79e-02

Pos logit contributions
 spe+0.714
mer+0.692
chie+0.673
 se+0.595
 en+0.549
Neg logit contributions
ward-0.962
reating-0.943
 about-0.873
ained-0.840
tr-0.838
Token box
Token ID3091
Feature activation-8.94e-01

Pos logit contributions
ward+0.044
reating+0.044
tr+0.040
ained+0.039
live+0.039
Neg logit contributions
 spe-0.036
mer-0.034
chie-0.033
 se-0.030
 en-0.028
Token.
Token ID13
Feature activation-5.45e-01

Pos logit contributions
 spe+1.406
chie+1.334
mer+1.317
 se+1.185
 en+1.135
Neg logit contributions
ward-1.194
reating-1.141
 about-1.090
tr-1.025
ained-1.002
Token She
Token ID1375
Feature activation-9.73e-01

Pos logit contributions
 spe+0.823
mer+0.783
chie+0.768
 se+0.700
 en+0.638
Neg logit contributions
ward-0.759
reating-0.730
 about-0.673
ained-0.661
tr-0.659
Token quickly
Token ID2952
Feature activation+4.12e-02

Pos logit contributions
 spe+1.364
mer+1.334
chie+1.329
 se+1.132
 skinny+1.096
Neg logit contributions
ward-1.370
reating-1.281
 about-1.250
tr-1.224
live-1.193
Token opened
Token ID4721
Feature activation-5.07e-01

Pos logit contributions
ward+0.065
reating+0.064
tr+0.059
live+0.058
ained+0.058
Neg logit contributions
 spe-0.052
mer-0.049
chie-0.048
 se-0.044
 en-0.040
Token it
Token ID340
Feature activation+3.73e-01

Pos logit contributions
 spe+0.634
mer+0.619
chie+0.601
 se+0.531
 en+0.493
Neg logit contributions
ward-0.814
reating-0.796
 about-0.736
tr-0.708
ained-0.707
Token and
Token ID290
Feature activation+2.20e-01

Pos logit contributions
ward+0.534
reating+0.534
ained+0.480
live+0.478
tr+0.476
Neg logit contributions
 spe-0.518
mer-0.495
chie-0.482
 se-0.451
 en-0.416
Token the
Token ID262
Feature activation+4.67e-01

Pos logit contributions
ward+0.028
reating+0.028
tr+0.025
 about+0.025
ained+0.025
Neg logit contributions
 spe-0.028
mer-0.027
chie-0.027
 se-0.025
 en-0.023
Token symbol
Token ID6194
Feature activation-2.17e-01

Pos logit contributions
ward+0.812
reating+0.806
live+0.744
ained+0.737
tr+0.736
Neg logit contributions
 spe-0.530
mer-0.497
chie-0.483
 se-0.443
 en-0.402
Token.
Token ID13
Feature activation+5.60e-01

Pos logit contributions
 spe+0.329
mer+0.313
chie+0.311
 se+0.285
 en+0.268
Neg logit contributions
ward-0.289
reating-0.283
 about-0.255
tr-0.253
ained-0.250
Token They
Token ID1119
Feature activation+1.15e+00

Pos logit contributions
reating+0.858
ward+0.852
live+0.766
tr+0.758
ained+0.755
Neg logit contributions
 spe-0.726
mer-0.694
chie-0.678
 se-0.616
 en-0.587
Token were
Token ID547
Feature activation+5.61e-01

Pos logit contributions
reating+1.646
ward+1.615
tr+1.514
ained+1.510
live+1.470
Neg logit contributions
 spe-1.670
mer-1.567
 se-1.528
chie-1.527
 en-1.380
Token no
Token ID645
Feature activation-4.59e-01

Pos logit contributions
reating+0.953
ward+0.948
tr+0.882
live+0.870
ained+0.860
Neg logit contributions
 spe-0.648
mer-0.590
chie-0.587
 se-0.536
 en-0.504
Token longer
Token ID2392
Feature activation-5.85e-01

Pos logit contributions
 spe+0.467
mer+0.439
chie+0.435
 se+0.376
 skinny+0.337
Neg logit contributions
ward-0.826
reating-0.823
 about-0.762
ained-0.747
icated-0.742
Token scared
Token ID12008
Feature activation-8.96e-01

Pos logit contributions
 spe+0.539
mer+0.500
chie+0.495
 se+0.421
 dunno+0.361
Neg logit contributions
ward-1.111
reating-1.090
 about-1.033
ained-1.016
 done-0.994
Token,
Token ID11
Feature activation-6.89e-01

Pos logit contributions
 spe+1.469
chie+1.392
mer+1.372
 se+1.276
 skinny+1.204
Neg logit contributions
reating-1.091
ward-1.071
 about-1.012
ained-0.917
icated-0.912
Token but
Token ID475
Feature activation+2.87e-01

Pos logit contributions
 spe+0.773
chie+0.739
mer+0.738
 se+0.638
 skinny+0.577
Neg logit contributions
ward-1.183
reating-1.156
 about-1.099
ained-1.051
tr-1.050
Token instead
Token ID2427
Feature activation+3.60e-01

Pos logit contributions
ward+0.469
reating+0.466
tr+0.428
live+0.424
ained+0.423
Neg logit contributions
 spe-0.348
mer-0.327
chie-0.319
 se-0.291
 en-0.270
Token.
Token ID13
Feature activation+3.64e-01

Pos logit contributions
ward+0.158
reating+0.156
tr+0.141
live+0.140
ained+0.139
Neg logit contributions
 spe-0.150
mer-0.143
chie-0.140
 se-0.129
 en-0.120
Token No
Token ID1400
Feature activation+3.26e-01

Pos logit contributions
ward+0.672
reating+0.668
live+0.613
tr+0.611
ained+0.609
Neg logit contributions
 spe-0.368
mer-0.334
chie-0.329
 se-0.291
 en-0.268
Token one
Token ID530
Feature activation-6.09e-01

Pos logit contributions
ward+0.496
reating+0.484
live+0.444
tr+0.442
ained+0.440
Neg logit contributions
 spe-0.439
mer-0.414
chie-0.406
 se-0.376
 en-0.348
Token understood
Token ID7247
Feature activation+9.62e-01

Pos logit contributions
 spe+0.903
chie+0.883
mer+0.862
 se+0.781
 en+0.735
Neg logit contributions
ward-0.804
reating-0.784
 about-0.742
tr-0.727
live-0.714
Token where
Token ID810
Feature activation-7.98e-01

Pos logit contributions
ward+1.451
reating+1.433
tr+1.336
live+1.319
ained+1.303
Neg logit contributions
 spe-1.277
mer-1.182
chie-1.134
 se-1.094
 en-1.029
Token it
Token ID340
Feature activation-7.93e-01

Pos logit contributions
 spe+0.968
mer+0.956
chie+0.904
 se+0.791
 en+0.729
Neg logit contributions
ward-1.309
reating-1.283
 about-1.203
tr-1.163
ained-1.153
Token came
Token ID1625
Feature activation-3.31e-01

Pos logit contributions
 spe+1.146
chie+1.107
mer+1.096
 se+0.973
 skinny+0.922
Neg logit contributions
ward-1.083
reating-1.047
 about-1.009
tr-0.996
ained-0.968
Token from
Token ID422
Feature activation-1.91e-01

Pos logit contributions
 spe+0.694
mer+0.674
chie+0.661
 se+0.626
 en+0.594
Neg logit contributions
ward-0.250
reating-0.248
 about-0.205
ained-0.197
tr-0.195
Token,
Token ID11
Feature activation-6.55e-01

Pos logit contributions
 spe+0.265
mer+0.252
chie+0.250
 se+0.227
 en+0.211
Neg logit contributions
ward-0.277
reating-0.273
 about-0.247
tr-0.245
ained-0.245
Token but
Token ID475
Feature activation+5.74e-01

Pos logit contributions
 spe+0.819
mer+0.797
chie+0.783
 se+0.682
 skinny+0.622
Neg logit contributions
ward-1.043
reating-1.018
 about-0.958
tr-0.931
ained-0.926
Token it
Token ID340
Feature activation-5.77e-01

Pos logit contributions
reating+1.041
ward+1.038
live+0.959
tr+0.958
ained+0.945
Neg logit contributions
 spe-0.600
mer-0.545
chie-0.537
 se-0.491
 en-0.451
Token was
Token ID373
Feature activation+4.61e-02

Pos logit contributions
ward+0.422
reating+0.419
tr+0.371
ained+0.367
live+0.365
Neg logit contributions
 spe-0.460
mer-0.438
chie-0.427
 se-0.401
 en-0.375
Token a
Token ID257
Feature activation-7.82e-01

Pos logit contributions
ward+0.067
reating+0.066
tr+0.059
live+0.059
ained+0.059
Neg logit contributions
 spe-0.065
mer-0.061
chie-0.060
 se-0.056
 en-0.052
Token mighty
Token ID18680
Feature activation+2.01e-01

Pos logit contributions
 spe+0.693
chie+0.685
mer+0.659
 se+0.523
 blu+0.474
Neg logit contributions
ward-1.487
reating-1.434
tr-1.373
live-1.355
 about-1.352
Token pot
Token ID1787
Feature activation+1.84e-01

Pos logit contributions
ward+0.353
reating+0.349
ained+0.321
tr+0.321
live+0.319
Neg logit contributions
 spe-0.218
mer-0.203
chie-0.198
 se-0.180
 en-0.162
Token.
Token ID13
Feature activation-9.83e-02

Pos logit contributions
ward+0.258
reating+0.254
tr+0.227
live+0.226
ained+0.226
Neg logit contributions
 spe-0.264
mer-0.252
chie-0.247
 se-0.230
 en-0.214
Token The
Token ID383
Feature activation-2.38e+00

Pos logit contributions
 spe+0.107
mer+0.101
chie+0.099
 se+0.089
 en+0.080
Neg logit contributions
ward-0.173
reating-0.170
tr-0.157
 about-0.156
ained-0.156
Token pot
Token ID1787
Feature activation+1.61e-01

Pos logit contributions
 spe+3.122
chie+3.083
mer+2.965
 Int+2.268
venth+2.234
Neg logit contributions
ward-3.902
 done-3.655
 stories-3.566
reating-3.489
 about-3.469
Token was
Token ID373
Feature activation+1.00e-01

Pos logit contributions
ward+0.208
reating+0.206
tr+0.181
live+0.180
ained+0.179
Neg logit contributions
 spe-0.250
mer-0.239
chie-0.235
 se-0.220
 en-0.206
Token so
Token ID523
Feature activation-8.32e-01

Pos logit contributions
ward+0.162
reating+0.160
tr+0.145
live+0.145
ained+0.144
Neg logit contributions
 spe-0.125
mer-0.118
chie-0.116
 se-0.106
 en-0.097
Token big
Token ID1263
Feature activation-4.63e-01

Pos logit contributions
 spe+0.870
mer+0.849
chie+0.844
 se+0.694
 Int+0.628
Neg logit contributions
ward-1.480
reating-1.393
 about-1.361
ained-1.328
live-1.328
Token and
Token ID290
Feature activation-1.31e+00

Pos logit contributions
 spe+0.798
chie+0.785
mer+0.766
 se+0.703
 en+0.669
Neg logit contributions
ward-0.514
reating-0.500
 about-0.459
tr-0.431
ained-0.430
Token said
Token ID531
Feature activation+8.79e-02

Pos logit contributions
 spe+2.409
mer+2.365
chie+2.312
 blu+1.993
 skinny+1.962
Neg logit contributions
ward-1.586
reating-1.437
 about-1.413
 what-1.369
 done-1.339
Token,
Token ID11
Feature activation+1.13e-01

Pos logit contributions
ward+0.101
reating+0.099
tr+0.087
live+0.086
ained+0.086
Neg logit contributions
 spe-0.149
mer-0.143
chie-0.141
 se-0.132
 en-0.125
Token "
Token ID366
Feature activation-1.54e+00

Pos logit contributions
ward+0.199
reating+0.197
tr+0.181
live+0.181
ained+0.180
Neg logit contributions
 spe-0.123
mer-0.114
chie-0.112
 se-0.101
 en-0.091
TokenMom
Token ID29252
Feature activation+1.27e+00

Pos logit contributions
 spe+2.877
chie+2.816
mer+2.787
 en+2.444
 skinny+2.424
Neg logit contributions
reating-1.632
ward-1.510
tr-1.405
live-1.296
ugg-1.274
Token,
Token ID11
Feature activation-1.99e-02

Pos logit contributions
ward+1.492
reating+1.479
live+1.328
ained+1.325
nce+1.283
Neg logit contributions
 spe-2.086
chie-1.966
mer-1.947
 se-1.879
 en-1.755
Token can
Token ID460
Feature activation-2.54e+00

Pos logit contributions
 spe+0.024
mer+0.023
chie+0.023
 se+0.021
 en+0.019
Neg logit contributions
ward-0.032
reating-0.032
tr-0.029
 about-0.029
live-0.029
Token we
Token ID356
Feature activation-1.04e+00

Pos logit contributions
mer+3.248
chie+2.901
 spe+2.861
 abrupt+2.308
rily+2.268
Neg logit contributions
ward-4.430
 what-4.197
mm-3.974
reating-3.970
 about-3.952
Token fix
Token ID4259
Feature activation-2.01e-01

Pos logit contributions
mer+1.419
 spe+1.419
chie+1.359
 se+1.147
 skinny+1.136
Neg logit contributions
ward-1.513
reating-1.462
tr-1.387
 about-1.361
live-1.314
Token the
Token ID262
Feature activation-2.03e+00

Pos logit contributions
 spe+0.259
mer+0.247
chie+0.243
 se+0.218
 en+0.201
Neg logit contributions
ward-0.312
reating-0.309
 about-0.278
tr-0.278
ained-0.276
Token pigeon
Token ID46387
Feature activation-3.19e-01

Pos logit contributions
mer+2.410
chie+2.392
 spe+2.364
 Int+1.996
 dunno+1.946
Neg logit contributions
ward-3.282
reating-3.220
tr-3.062
 done-3.048
ained-2.953
Token?
Token ID30
Feature activation-1.16e+00

Pos logit contributions
 spe+0.508
mer+0.480
chie+0.478
 se+0.438
 en+0.411
Neg logit contributions
ward-0.404
reating-0.399
tr-0.359
 about-0.358
ained-0.352
Token
Token ID198
Feature activation+1.53e+00

Pos logit contributions
reating+1.553
ward+1.500
tr+1.385
ained+1.365
live+1.361
Neg logit contributions
 spe-1.228
mer-1.155
chie-1.140
 se-1.093
 en-1.011
TokenA
Token ID32
Feature activation+2.99e-01

Pos logit contributions
reating+2.772
ward+2.532
 about+2.416
tr+2.402
ained+2.401
Neg logit contributions
 spe-1.698
mer-1.665
chie-1.611
 se-1.577
 skinny-1.383
Token girl
Token ID2576
Feature activation-1.16e+00

Pos logit contributions
ward+0.533
reating+0.529
ained+0.487
live+0.485
tr+0.484
Neg logit contributions
 spe-0.323
mer-0.299
chie-0.293
 se-0.268
 en-0.237
Token came
Token ID1625
Feature activation-1.39e+00

Pos logit contributions
chie+1.471
mer+1.445
 spe+1.443
 blu+1.209
 skinny+1.205
Neg logit contributions
ward-1.784
reating-1.742
 about-1.709
tr-1.638
 what-1.606
Token over
Token ID625
Feature activation-6.66e-01

Pos logit contributions
 spe+2.365
mer+2.195
chie+2.117
 se+1.932
 redd+1.932
Neg logit contributions
reating-1.710
ward-1.665
 about-1.567
ained-1.460
 what-1.457
Token to
Token ID284
Feature activation-2.23e+00

Pos logit contributions
 spe+1.012
chie+0.981
mer+0.962
 se+0.852
 blu+0.841
Neg logit contributions
ward-0.898
reating-0.875
 about-0.807
ained-0.756
live-0.749
Token them
Token ID606
Feature activation+2.72e-01

Pos logit contributions
 spe+3.736
chie+3.668
mer+3.633
 skinny+3.212
 redd+3.170
Neg logit contributions
ward-2.869
 what-2.809
reating-2.753
 Enjoy-2.509
 about-2.436
Token.
Token ID13
Feature activation-9.64e-01

Pos logit contributions
ward+0.415
reating+0.409
live+0.372
tr+0.372
ained+0.371
Neg logit contributions
 spe-0.359
mer-0.341
chie-0.333
 se-0.309
 en-0.287
Token She
Token ID1375
Feature activation-7.15e-01

Pos logit contributions
 spe+1.344
chie+1.313
mer+1.311
 se+1.117
 blu+1.035
Neg logit contributions
ward-1.413
reating-1.368
 about-1.345
tr-1.281
 Afterwards-1.242
Token had
Token ID550
Feature activation-1.44e+00

Pos logit contributions
 spe+0.911
mer+0.902
chie+0.900
 se+0.752
 blu+0.735
Neg logit contributions
ward-1.091
reating-1.057
 about-1.014
tr-1.008
live-0.983
Token a
Token ID257
Feature activation-6.25e-01

Pos logit contributions
mer+2.343
 spe+2.111
chie+2.086
rily+1.799
rier+1.768
Neg logit contributions
ward-2.012
reating-1.886
 done-1.859
 about-1.793
tr-1.741
Token woman
Token ID2415
Feature activation-5.81e-01

Pos logit contributions
 spe+0.679
mer+0.651
chie+0.648
 se+0.535
 en+0.484
Neg logit contributions
ward-1.196
reating-1.192
tr-1.112
 about-1.112
live-1.089
Token wanted
Token ID2227
Feature activation-3.68e-01

Pos logit contributions
 spe+0.805
chie+0.774
mer+0.764
 se+0.664
 blu+0.636
Neg logit contributions
ward-0.836
reating-0.812
 about-0.774
tr-0.759
live-0.745
Token to
Token ID284
Feature activation-8.60e-01

Pos logit contributions
 spe+0.419
mer+0.408
chie+0.398
 se+0.339
 en+0.317
Neg logit contributions
ward-0.633
reating-0.629
 about-0.581
tr-0.571
live-0.567
Token help
Token ID1037
Feature activation-1.38e+00

Pos logit contributions
 spe+1.032
mer+1.018
chie+0.960
 se+0.809
 en+0.781
Neg logit contributions
ward-1.417
reating-1.402
 about-1.305
tr-1.280
live-1.268
Token the
Token ID262
Feature activation-1.98e+00

Pos logit contributions
 spe+2.148
mer+2.114
chie+2.107
 se+1.836
 blu+1.784
Neg logit contributions
reating-1.832
ward-1.778
 about-1.725
 what-1.581
 stories-1.525
Token little
Token ID1310
Feature activation-2.63e+00

Pos logit contributions
 spe+2.322
chie+2.300
mer+2.282
 se+1.700
 dunno+1.660
Neg logit contributions
reating-3.484
 about-3.329
ward-3.222
 stories-3.137
tr-3.082
Token girl
Token ID2576
Feature activation-9.94e-02

Pos logit contributions
chie+3.646
mer+3.282
 spe+3.260
 se+2.813
 Int+2.797
Neg logit contributions
reating-4.122
ward-3.747
 about-3.697
tr-3.565
 what-3.516
Token,
Token ID11
Feature activation-1.45e+00

Pos logit contributions
 spe+0.134
mer+0.127
chie+0.126
 se+0.115
 en+0.105
Neg logit contributions
ward-0.149
reating-0.146
 about-0.133
tr-0.132
ained-0.131
Token so
Token ID523
Feature activation-8.81e-01

Pos logit contributions
mer+2.020
 spe+1.926
chie+1.862
 se+1.533
ihu+1.443
Neg logit contributions
 about-2.346
ward-2.309
reating-2.248
 what-2.072
 done-2.051
Token she
Token ID673
Feature activation-8.90e-01

Pos logit contributions
chie+1.051
mer+1.050
 spe+1.049
 se+0.873
 blu+0.800
Neg logit contributions
ward-1.473
reating-1.400
 about-1.342
live-1.296
ained-1.269
Token decided
Token ID3066
Feature activation-4.70e-01

Pos logit contributions
 spe+1.301
chie+1.246
mer+1.243
 se+1.052
 blu+1.049
Neg logit contributions
ward-1.225
reating-1.163
 about-1.106
tr-1.102
live-1.088
Token One
Token ID1881
Feature activation+2.76e-02

Pos logit contributions
 spe+0.020
mer+0.019
chie+0.019
 se+0.017
 en+0.016
Neg logit contributions
ward-0.026
reating-0.025
tr-0.023
 about-0.023
ained-0.023
Token day
Token ID1110
Feature activation+1.86e+00

Pos logit contributions
ward+0.023
reating+0.022
tr+0.018
live+0.018
ained+0.018
Neg logit contributions
 spe-0.056
mer-0.054
chie-0.054
 se-0.051
 en-0.048
Token,
Token ID11
Feature activation-2.69e-01

Pos logit contributions
reating+3.061
ward+2.813
tr+2.719
live+2.681
ained+2.677
Neg logit contributions
 spe-2.557
mer-2.332
 se-2.161
chie-2.087
 en-2.009
Token he
Token ID339
Feature activation-8.46e-01

Pos logit contributions
 spe+0.364
chie+0.348
mer+0.348
 se+0.310
 en+0.286
Neg logit contributions
ward-0.405
reating-0.394
 about-0.359
tr-0.357
ained-0.353
Token took
Token ID1718
Feature activation+1.37e+00

Pos logit contributions
 spe+1.170
chie+1.138
mer+1.077
 se+0.926
 skinny+0.911
Neg logit contributions
ward-1.229
reating-1.187
 about-1.119
ained-1.113
tr-1.099
Token his
Token ID465
Feature activation-2.24e+00

Pos logit contributions
ward+2.250
reating+2.211
live+2.133
tr+2.093
ouncing+2.072
Neg logit contributions
 spe-1.698
chie-1.519
mer-1.519
 se-1.512
 en-1.401
Token boat
Token ID8848
Feature activation-2.61e-02

Pos logit contributions
 spe+2.022
chie+1.897
mer+1.854
 dunno+1.402
 Int+1.361
Neg logit contributions
ward-4.533
reating-4.144
 done-4.057
 stories-4.039
tr-4.009
Token out
Token ID503
Feature activation-2.33e-01

Pos logit contributions
 spe+0.034
mer+0.032
chie+0.031
 se+0.028
 en+0.026
Neg logit contributions
ward-0.041
reating-0.040
tr-0.036
 about-0.036
ained-0.036
Token on
Token ID319
Feature activation-1.04e-02

Pos logit contributions
 spe+0.360
mer+0.345
chie+0.340
 se+0.312
 en+0.293
Neg logit contributions
ward-0.305
reating-0.298
 about-0.269
tr-0.265
ained-0.265
Token the
Token ID262
Feature activation-5.97e-01

Pos logit contributions
 spe+0.014
mer+0.013
chie+0.013
 se+0.012
 en+0.011
Neg logit contributions
ward-0.016
reating-0.016
tr-0.014
 about-0.014
ained-0.014
Token water
Token ID1660
Feature activation-9.48e-01

Pos logit contributions
 spe+0.766
mer+0.725
chie+0.722
 se+0.618
 en+0.591
Neg logit contributions
ward-0.946
reating-0.927
tr-0.854
 about-0.838
ained-0.828