TOP ACTIVATIONS
MAX = 4.658

and yawn ed . He was hungry and wanted to find
." Tom nods . He is sleepy . He says ,
Anna nodded . She was a brave girl . She
pretty , but Tweet ie was scared it would hurt her
. L ila was very happy . She thanked
?" L ila was surprised . She did not
?" S ara is surprised . She says ,
. S ara is happy and hopeful . She
sky got dark and he was tired . So he decided
?" L ily was happy to have a new
L ily nodded . She was sorry for making Mom my
things ." Anna was eager to hear the story
?" L ily was curious and said , "
." S ara is sad . She says ,
goodbye to Anna ." Ben is sad . He does not
asleep . The turkey was very happy . <|endoftext|> One sunny
." S ara is not happy . She likes
?" L ila was excited . She said ,
. L ily is curious . She wants to
?" S ara is happy . She gives Ben

MIN ACTIVATIONS
MIN = -4.724

!" Mia says . She follows Sam to the cave .
!" Lily says . She follows him with her sled .
of the hill . Ben follows her . They look down
to the box . Jen follows him . But
Sam tries to scare things with the lion ,
let 's go ." He followed Lily up the ladder and
cave . Sam follows him . He does not
." M ia chased after him . She wanted
knees ." Sam followed them into the tunnel .
." L ily followed Sam . She was happy
!" Sam says . He follows Tom and slides down .
wrong . They tried to comfort them and explain that sometimes
cries . Mom hugs Sam and tells him it
house . Mom hugs them and says she is
She looked anxious . She hugged them and took them to
!" Sam says . He follows Tim and slides down .
again ." Mom hugs them . " That 's
Sam smiled . He followed Tom and they looked for
pull some beans . Sam followed him , but he felt
if it fits !" She helped Sil via put on the

INTERVAL 2.312 - 4.658
CONTAINS 2.699%

she imagined . The flowers were huge and so colorful !
R avi , your bird is very pretty !" Mir a
are right . The sea is not for me . The
on a bench . She was reading a book . She
a new way . <|endoftext|> Once upon a time there were

INTERVAL -0.033 - 2.312
CONTAINS 48.748%

upon a time , there was a little boy named Ben
learned something new . From that day on , Tim my
? The air outside is not clear , remember to stay
a way to care for them all so he could keep
sc ooter , but Emily said no . Lily felt sad

INTERVAL -2.378 - -0.033
CONTAINS 47.279%

walking . As they walked , they saw more animals and
The end . <|endoftext|> L ily and Ben are friends .
sad . They did not want to miss their holiday .
But then , Tom swung his stick too hard and
gifts that he has been given . Fred die

INTERVAL -4.724 - -2.378
CONTAINS 1.274%

said to Ben . " Okay , that sounds
nicely why Ell a couldn 't have all the books but
touch Max . " Y uck , I don
for some . He gave them each a cone with a
just couldn 't decide . Then Bill had a mighty idea
Token and
Token ID290
Feature activation+1.55e-01

Pos logit contributions
 glad+0.815
 lucky+0.710
 important+0.687
 supposed+0.668
 pleased+0.667
Neg logit contributions
mine-0.502
al-0.465
talk-0.439
als-0.429
ted-0.426
Token yawn
Token ID40908
Feature activation+1.32e-01

Pos logit contributions
 glad+0.386
 lucky+0.331
 important+0.324
 supposed+0.322
 pleased+0.314
Neg logit contributions
mine-0.262
al-0.248
talk-0.235
ted-0.234
als-0.234
Tokened
Token ID276
Feature activation+3.71e-01

Pos logit contributions
 glad+0.375
 lucky+0.327
 important+0.318
 supposed+0.313
 pleased+0.310
Neg logit contributions
mine-0.184
al-0.169
als-0.156
ted-0.155
talk-0.155
Token.
Token ID13
Feature activation+1.50e+00

Pos logit contributions
 glad+0.897
 lucky+0.762
 supposed+0.745
 important+0.733
 pleased+0.720
Neg logit contributions
mine-0.659
al-0.628
ted-0.589
als-0.586
talk-0.583
Token He
Token ID679
Feature activation+1.26e+00

Pos logit contributions
 glad+3.534
 supposed+3.071
 lucky+2.984
 important+2.966
 pleased+2.925
Neg logit contributions
mine-2.733
al-2.611
ted-2.594
als-2.537
of-2.494
Token was
Token ID373
Feature activation+4.66e+00

Pos logit contributions
 glad+3.150
 supposed+2.702
 lucky+2.648
 important+2.569
 pleased+2.549
Neg logit contributions
mine-2.196
al-2.117
ted-2.015
talk-1.980
of-1.974
Token hungry
Token ID14720
Feature activation-1.56e-01

Pos logit contributions
 fault+10.133
 valuable+9.800
 glad+9.626
 rewarding+9.530
 pleasure+9.326
Neg logit contributions
mine-8.601
al-8.160
ting-8.123
ted-7.648
 him-7.618
Token and
Token ID290
Feature activation+2.40e+00

Pos logit contributions
mine+0.257
al+0.243
talk+0.227
als+0.225
of+0.223
Neg logit contributions
 glad-0.394
 lucky-0.339
 important-0.328
 pleased-0.322
 supposed-0.321
Token wanted
Token ID2227
Feature activation+6.86e-01

Pos logit contributions
 glad+5.639
 fault+5.220
 supposed+5.197
 lucky+4.802
 important+4.694
Neg logit contributions
mine-4.005
ted-3.870
al-3.724
Give-3.679
als-3.565
Token to
Token ID284
Feature activation-3.44e-01

Pos logit contributions
 glad+1.547
 lucky+1.276
 pleased+1.260
 supposed+1.210
 important+1.208
Neg logit contributions
mine-1.305
al-1.261
talk-1.237
ted-1.225
of-1.197
Token find
Token ID1064
Feature activation-7.44e-02

Pos logit contributions
mine+0.526
al+0.485
talk+0.457
of+0.442
als+0.442
Neg logit contributions
 glad-0.927
 lucky-0.807
 important-0.785
 supposed-0.767
 pleased-0.767
Token."
Token ID526
Feature activation-2.68e-01

Pos logit contributions
 glad+3.131
 lucky+2.671
 supposed+2.591
 pleased+2.586
 important+2.572
Neg logit contributions
mine-2.097
al-2.036
talk-1.875
als-1.797
ted-1.770
Token Tom
Token ID4186
Feature activation-1.03e-01

Pos logit contributions
mine+0.592
al+0.559
talk+0.545
of+0.533
als+0.532
Neg logit contributions
 glad-0.533
 lucky-0.445
 important-0.429
 supposed-0.418
 pleased-0.414
Token nods
Token ID34833
Feature activation+4.21e-01

Pos logit contributions
mine+0.186
al+0.176
talk+0.168
of+0.165
als+0.164
Neg logit contributions
 glad-0.245
 lucky-0.210
 important-0.204
 supposed-0.201
 pleased-0.198
Token.
Token ID13
Feature activation+1.35e+00

Pos logit contributions
 glad+1.033
 lucky+0.892
 supposed+0.876
 important+0.859
 pleased+0.834
Neg logit contributions
mine-0.741
al-0.680
talk-0.652
of-0.634
ted-0.624
Token He
Token ID679
Feature activation+9.45e-01

Pos logit contributions
 glad+3.369
 lucky+2.942
 supposed+2.929
 important+2.849
 pleased+2.838
Neg logit contributions
mine-2.139
talk-2.099
al-2.056
ted-2.004
of-2.001
Token is
Token ID318
Feature activation+4.64e+00

Pos logit contributions
 glad+2.315
 lucky+1.957
 supposed+1.936
 important+1.915
 pleased+1.908
Neg logit contributions
al-1.594
mine-1.587
als-1.496
ted-1.483
of-1.478
Token sleepy
Token ID42023
Feature activation+6.54e-01

Pos logit contributions
 fault+10.366
 valuable+10.178
 tempting+10.172
 rewarding+9.328
 important+9.163
Neg logit contributions
mine-8.532
al-7.816
Give-7.009
 cash-6.853
 her-6.850
Token.
Token ID13
Feature activation+1.89e+00

Pos logit contributions
 glad+1.584
 lucky+1.345
 important+1.288
 pleased+1.286
 supposed+1.285
Neg logit contributions
mine-1.186
al-1.108
talk-1.050
als-1.013
of-1.010
Token He
Token ID679
Feature activation+1.64e+00

Pos logit contributions
 glad+4.561
 supposed+4.022
 lucky+3.970
 important+3.885
 pleased+3.870
Neg logit contributions
mine-3.007
talk-2.967
of-2.919
al-2.907
ted-2.841
Token says
Token ID1139
Feature activation+8.90e-01

Pos logit contributions
 glad+3.962
 supposed+3.329
 lucky+3.307
 pleased+3.287
 important+3.227
Neg logit contributions
al-2.846
mine-2.787
als-2.704
ted-2.697
of-2.653
Token,
Token ID11
Feature activation+1.51e+00

Pos logit contributions
 glad+2.249
 supposed+1.974
 lucky+1.928
 important+1.884
 pleased+1.843
Neg logit contributions
mine-1.424
al-1.349
als-1.288
of-1.272
ted-1.254
Token
Token ID198
Feature activation-1.10e+00

Pos logit contributions
 glad+1.984
 supposed+1.653
 lucky+1.645
 important+1.606
 pleased+1.525
Neg logit contributions
mine-2.531
al-2.389
of-2.380
talk-2.304
ted-2.251
TokenAnna
Token ID31160
Feature activation-3.57e-01

Pos logit contributions
mine+1.475
al+1.346
talk+1.268
als+1.230
ted+1.227
Neg logit contributions
 glad-3.183
 lucky-2.810
 important-2.723
 pleased-2.667
 supposed-2.625
Token nodded
Token ID14464
Feature activation+4.39e-01

Pos logit contributions
mine+0.652
al+0.614
talk+0.581
als+0.569
of+0.565
Neg logit contributions
 glad-0.852
 lucky-0.730
 important-0.707
 supposed-0.694
 pleased-0.681
Token.
Token ID13
Feature activation+6.36e-01

Pos logit contributions
 glad+1.146
 lucky+0.998
 supposed+0.984
 important+0.966
 pleased+0.934
Neg logit contributions
mine-0.709
al-0.649
talk-0.593
ted-0.590
of-0.586
Token She
Token ID1375
Feature activation+1.98e-01

Pos logit contributions
 glad+1.687
 lucky+1.485
 important+1.451
 supposed+1.447
 pleased+1.405
Neg logit contributions
mine-0.974
al-0.942
als-0.894
ted-0.851
talk-0.845
Token was
Token ID373
Feature activation+4.50e+00

Pos logit contributions
 glad+0.468
 lucky+0.397
 supposed+0.384
 important+0.382
 pleased+0.372
Neg logit contributions
mine-0.360
al-0.344
talk-0.322
ted-0.319
als-0.319
Token a
Token ID257
Feature activation+1.57e+00

Pos logit contributions
 fault+9.978
 valuable+9.412
 glad+9.324
 tempting+9.211
 rewarding+8.860
Neg logit contributions
mine-8.193
al-7.972
Give-6.784
 cash-6.647
 her-6.458
Token brave
Token ID14802
Feature activation-8.25e-01

Pos logit contributions
 glad+3.209
 supposed+2.838
 important+2.604
 lucky+2.583
 fault+2.533
Neg logit contributions
mine-3.077
al-2.986
 cash-2.862
of-2.753
talk-2.699
Token girl
Token ID2576
Feature activation+1.81e-02

Pos logit contributions
mine+1.527
al+1.427
talk+1.365
als+1.336
ted+1.327
Neg logit contributions
 glad-1.953
 lucky-1.677
 important-1.617
 pleased-1.563
 supposed-1.562
Token.
Token ID13
Feature activation+9.18e-01

Pos logit contributions
 glad+0.042
 lucky+0.036
 important+0.035
 supposed+0.034
 pleased+0.034
Neg logit contributions
mine-0.034
al-0.032
talk-0.030
als-0.030
ted-0.029
Token She
Token ID1375
Feature activation+2.67e-01

Pos logit contributions
 glad+2.498
 lucky+2.229
 supposed+2.211
 important+2.193
 pleased+2.105
Neg logit contributions
mine-1.324
al-1.280
als-1.216
ted-1.154
of-1.148
Token pretty
Token ID2495
Feature activation-2.20e-02

Pos logit contributions
 fault+6.023
 glad+6.007
 supposed+5.661
 pleasure+4.677
 lucky+4.410
Neg logit contributions
mine-7.782
al-7.440
als-7.145
of-7.008
Give-6.918
Token,
Token ID11
Feature activation+7.94e-01

Pos logit contributions
mine+0.036
al+0.034
als+0.032
talk+0.031
ted+0.031
Neg logit contributions
 glad-0.057
 lucky-0.049
 important-0.047
 pleased-0.047
 supposed-0.046
Token but
Token ID475
Feature activation+8.62e-01

Pos logit contributions
 glad+2.032
 lucky+1.768
 supposed+1.688
 important+1.683
 pleased+1.674
Neg logit contributions
mine-1.252
al-1.153
ted-1.101
als-1.072
talk-1.038
Token Tweet
Token ID18752
Feature activation+6.32e-01

Pos logit contributions
 glad+2.358
 supposed+2.072
 lucky+2.031
 pleased+1.993
 important+1.970
Neg logit contributions
mine-1.176
al-1.123
ted-1.062
als-1.035
of-0.990
Tokenie
Token ID494
Feature activation+1.12e+00

Pos logit contributions
 glad+2.214
 lucky+1.978
 important+1.917
 pleased+1.915
 supposed+1.903
Neg logit contributions
mine-0.505
al-0.433
of-0.361
x-0.344
ted-0.329
Token was
Token ID373
Feature activation+4.50e+00

Pos logit contributions
 glad+2.613
 lucky+2.170
 supposed+2.121
 important+2.088
 pleased+2.081
Neg logit contributions
mine-2.148
al-2.097
als-1.920
ted-1.911
of-1.873
Token scared
Token ID12008
Feature activation+1.16e+00

Pos logit contributions
 glad+8.288
 fault+8.171
 pleasure+7.682
aj+7.539
 tempting+7.364
Neg logit contributions
al-9.574
mine-9.430
ting-8.077
 each-8.064
als-7.812
Token it
Token ID340
Feature activation+1.06e+00

Pos logit contributions
 glad+2.876
 lucky+2.456
 pleased+2.434
 supposed+2.409
 important+2.343
Neg logit contributions
mine-1.881
al-1.799
ted-1.729
of-1.712
als-1.664
Token would
Token ID561
Feature activation+8.05e-01

Pos logit contributions
 glad+2.593
 lucky+2.133
 pleased+2.120
 important+2.021
 supposed+2.017
Neg logit contributions
mine-1.971
al-1.829
ted-1.732
of-1.731
als-1.724
Token hurt
Token ID5938
Feature activation-9.04e-01

Pos logit contributions
 glad+2.023
 lucky+1.736
 important+1.685
 pleased+1.655
 supposed+1.641
Neg logit contributions
mine-1.386
al-1.237
of-1.196
talk-1.192
als-1.149
Token her
Token ID607
Feature activation+1.40e+00

Pos logit contributions
mine+1.688
al+1.580
talk+1.520
als+1.479
ted+1.469
Neg logit contributions
 glad-2.129
 lucky-1.829
 important-1.765
 supposed-1.709
 pleased-1.690
Token.
Token ID13
Feature activation+1.07e+00

Pos logit contributions
 glad+1.132
 lucky+0.966
 supposed+0.919
 important+0.917
 pleased+0.911
Neg logit contributions
mine-0.809
al-0.759
talk-0.712
als-0.708
of-0.695
Token
Token ID198
Feature activation+1.09e+00

Pos logit contributions
 glad+2.611
 supposed+2.300
 lucky+2.299
 important+2.284
 pleased+2.205
Neg logit contributions
mine-1.772
al-1.691
ted-1.624
als-1.624
of-1.571
Token
Token ID198
Feature activation-2.26e+00

Pos logit contributions
 glad+2.009
 lucky+1.684
 important+1.642
 supposed+1.635
 pleased+1.567
Neg logit contributions
mine-2.564
of-2.447
al-2.378
talk-2.337
ted-2.317
TokenL
Token ID43
Feature activation-7.61e-01

Pos logit contributions
mine+3.115
al+2.857
talk+2.680
x+2.594
als+2.580
Neg logit contributions
 glad-6.846
 lucky-6.031
 important-5.830
 pleased-5.673
 tired-5.570
Tokenila
Token ID10102
Feature activation+7.34e-02

Pos logit contributions
mine+1.185
al+1.093
talk+1.037
als+1.005
ted+0.997
Neg logit contributions
 glad-2.030
 lucky-1.774
 important-1.718
 supposed-1.679
 pleased-1.669
Token was
Token ID373
Feature activation+4.50e+00

Pos logit contributions
 glad+0.178
 lucky+0.153
 important+0.149
 supposed+0.147
 pleased+0.143
Neg logit contributions
mine-0.130
al-0.124
talk-0.116
als-0.114
of-0.114
Token very
Token ID845
Feature activation+3.63e+00

Pos logit contributions
 fault+9.209
 valuable+8.747
 glad+8.455
 tempting+8.445
 rewarding+8.300
Neg logit contributions
mine-9.124
al-8.690
Give-7.770
 cash-7.641
talk-7.366
Token happy
Token ID3772
Feature activation+1.56e-01

Pos logit contributions
 fault+4.467
 supposed+3.969
 glad+3.820
 important+3.413
 pleasure+3.265
Neg logit contributions
mine-9.810
al-9.444
of-9.036
Give-8.886
talk-8.856
Token.
Token ID13
Feature activation+6.69e-01

Pos logit contributions
 glad+0.378
 lucky+0.326
 important+0.316
 supposed+0.309
 pleased+0.308
Neg logit contributions
mine-0.273
al-0.258
talk-0.241
als-0.239
ted-0.238
Token She
Token ID1375
Feature activation+8.44e-01

Pos logit contributions
 glad+1.840
 lucky+1.629
 important+1.592
 pleased+1.582
 supposed+1.576
Neg logit contributions
mine-0.932
al-0.893
als-0.850
ted-0.824
talk-0.795
Token thanked
Token ID26280
Feature activation-2.27e+00

Pos logit contributions
 glad+2.080
 lucky+1.773
 important+1.728
 supposed+1.725
 pleased+1.686
Neg logit contributions
mine-1.416
al-1.363
talk-1.290
als-1.276
of-1.268
Token?"
Token ID1701
Feature activation-5.79e-01

Pos logit contributions
 glad+2.203
 lucky+1.851
 supposed+1.735
 pleased+1.727
 important+1.725
Neg logit contributions
mine-2.313
al-2.226
talk-2.044
als-2.033
of-2.027
Token
Token ID198
Feature activation-2.46e-02

Pos logit contributions
mine+1.318
al+1.250
talk+1.202
als+1.179
ted+1.175
Neg logit contributions
 glad-1.119
 lucky-0.926
 important-0.884
 supposed-0.855
 pleased-0.848
Token
Token ID198
Feature activation-2.04e+00

Pos logit contributions
mine+0.063
al+0.060
talk+0.058
of+0.058
ted+0.057
Neg logit contributions
 glad-0.040
 lucky-0.032
 important-0.031
 supposed-0.030
 pleased-0.029
TokenL
Token ID43
Feature activation-9.27e-01

Pos logit contributions
mine+2.971
al+2.793
ted+2.654
als+2.646
talk+2.587
Neg logit contributions
 glad-5.765
 lucky-5.019
 important-4.823
 pleased-4.781
 happy-4.596
Tokenila
Token ID10102
Feature activation-1.13e-01

Pos logit contributions
mine+1.383
al+1.270
talk+1.207
als+1.180
ted+1.172
Neg logit contributions
 glad-2.532
 lucky-2.213
 important-2.144
 supposed-2.102
 pleased-2.087
Token was
Token ID373
Feature activation+4.49e+00

Pos logit contributions
mine+0.202
al+0.191
talk+0.179
als+0.177
ted+0.175
Neg logit contributions
 glad-0.274
 lucky-0.236
 important-0.228
 supposed-0.225
 pleased-0.221
Token surprised
Token ID6655
Feature activation-8.50e-01

Pos logit contributions
 glad+8.050
 fault+7.737
 valuable+7.686
 rewarding+7.460
 pleasure+7.371
Neg logit contributions
mine-9.929
al-9.585
Give-9.001
 cash-8.154
ting-8.103
Token.
Token ID13
Feature activation-4.09e-01

Pos logit contributions
mine+1.635
al+1.533
talk+1.469
als+1.432
x+1.423
Neg logit contributions
 glad-1.965
 lucky-1.678
 important-1.617
 pleased-1.565
 supposed-1.560
Token She
Token ID1375
Feature activation+7.53e-01

Pos logit contributions
mine+0.684
al+0.640
talk+0.600
als+0.592
ted+0.583
Neg logit contributions
 glad-1.035
 lucky-0.900
 important-0.872
 supposed-0.853
 pleased-0.847
Token did
Token ID750
Feature activation+7.47e-01

Pos logit contributions
 glad+1.815
 lucky+1.553
 supposed+1.523
 important+1.512
 pleased+1.435
Neg logit contributions
mine-1.326
al-1.281
als-1.180
ted-1.178
of-1.176
Token not
Token ID407
Feature activation-7.88e-02

Pos logit contributions
 glad+1.127
 lucky+0.846
 important+0.797
 supposed+0.796
 pleased+0.776
Neg logit contributions
mine-1.954
al-1.841
of-1.795
talk-1.773
ede-1.771
Token?"
Token ID1701
Feature activation-2.32e-01

Pos logit contributions
 glad+1.701
 lucky+1.389
 supposed+1.339
 pleased+1.306
 important+1.298
Neg logit contributions
mine-1.915
al-1.802
talk-1.694
of-1.693
als-1.690
Token
Token ID198
Feature activation+8.83e-01

Pos logit contributions
mine+0.555
al+0.527
talk+0.512
of+0.502
ted+0.501
Neg logit contributions
 glad-0.420
 lucky-0.343
 important-0.329
 supposed-0.323
 pleased-0.313
Token
Token ID198
Feature activation-2.06e+00

Pos logit contributions
 glad+1.600
 supposed+1.332
 lucky+1.314
 important+1.266
 pleased+1.220
Neg logit contributions
mine-2.081
of-1.989
al-1.985
talk-1.947
ted-1.887
TokenS
Token ID50
Feature activation-2.37e+00

Pos logit contributions
mine+3.220
al+2.891
als+2.711
x+2.711
ted+2.696
Neg logit contributions
 glad-5.750
 lucky-5.069
 important-4.890
 pleased-4.752
 tired-4.666
Tokenara
Token ID3301
Feature activation-4.29e-01

Pos logit contributions
mine+3.156
al+2.681
talk+2.657
x+2.549
als+2.496
Neg logit contributions
 glad-6.851
 lucky-6.101
 important-5.891
 pleased-5.707
 tired-5.677
Token is
Token ID318
Feature activation+4.47e+00

Pos logit contributions
mine+0.781
al+0.737
talk+0.699
als+0.685
of+0.681
Neg logit contributions
 glad-1.023
 lucky-0.878
 important-0.849
 supposed-0.829
 pleased-0.822
Token surprised
Token ID6655
Feature activation-6.43e-01

Pos logit contributions
 pleasure+7.899
 fault+7.846
 valuable+7.755
 glad+7.665
 lifetime+7.591
Neg logit contributions
Give-9.682
mine-9.513
al-9.255
 cash-8.156
Tom-8.034
Token.
Token ID13
Feature activation-7.66e-01

Pos logit contributions
mine+1.306
al+1.232
talk+1.179
als+1.154
ted+1.148
Neg logit contributions
 glad-1.411
 lucky-1.194
 important-1.146
 supposed-1.112
 pleased-1.108
Token She
Token ID1375
Feature activation+1.07e+00

Pos logit contributions
mine+1.334
al+1.244
talk+1.185
als+1.150
ted+1.143
Neg logit contributions
 glad-1.897
 lucky-1.636
 important-1.579
 supposed-1.535
 pleased-1.533
Token says
Token ID1139
Feature activation-2.35e-01

Pos logit contributions
 glad+2.648
 lucky+2.256
 supposed+2.233
 important+2.222
 pleased+2.173
Neg logit contributions
mine-1.767
al-1.758
ted-1.641
als-1.636
of-1.628
Token,
Token ID11
Feature activation+1.39e+00

Pos logit contributions
mine+0.427
al+0.402
talk+0.378
als+0.377
ted+0.373
Neg logit contributions
 glad-0.559
 lucky-0.480
 important-0.465
 supposed-0.460
 pleased-0.450
Token.
Token ID13
Feature activation+1.84e+00

Pos logit contributions
 glad+1.142
 lucky+0.973
 important+0.928
 pleased+0.918
 supposed+0.909
Neg logit contributions
mine-0.895
al-0.845
talk-0.801
of-0.784
als-0.783
Token
Token ID198
Feature activation+1.56e+00

Pos logit contributions
 glad+4.461
 supposed+4.090
 lucky+3.944
 important+3.922
 pleased+3.776
Neg logit contributions
mine-3.043
al-2.912
talk-2.824
of-2.777
ted-2.757
Token
Token ID198
Feature activation-1.07e+00

Pos logit contributions
 glad+2.915
 supposed+2.534
 lucky+2.484
 important+2.406
 pleased+2.281
Neg logit contributions
mine-3.538
of-3.442
talk-3.319
al-3.290
ted-3.136
TokenS
Token ID50
Feature activation-1.73e+00

Pos logit contributions
mine+1.471
al+1.353
talk+1.247
als+1.243
ted+1.222
Neg logit contributions
 glad-3.094
 lucky-2.721
 important-2.644
 pleased-2.583
 supposed-2.546
Tokenara
Token ID3301
Feature activation-3.04e-02

Pos logit contributions
mine+2.075
al+1.823
talk+1.743
als+1.685
x+1.675
Neg logit contributions
 glad-5.224
 lucky-4.638
 important-4.513
 pleased-4.397
 supposed-4.374
Token is
Token ID318
Feature activation+4.46e+00

Pos logit contributions
mine+0.055
al+0.052
talk+0.049
als+0.048
of+0.048
Neg logit contributions
 glad-0.073
 lucky-0.063
 important-0.061
 supposed-0.060
 pleased-0.059
Token happy
Token ID3772
Feature activation+6.04e-01

Pos logit contributions
 tempting+7.718
 fault+7.511
 valuable+7.444
 important+7.137
 rewarding+7.004
Neg logit contributions
mine-10.382
al-9.654
Give-8.955
 cash-8.502
ting-8.184
Token and
Token ID290
Feature activation+1.77e+00

Pos logit contributions
 glad+1.438
 lucky+1.248
 important+1.217
 supposed+1.213
 pleased+1.176
Neg logit contributions
mine-1.074
al-1.011
talk-0.953
of-0.947
ted-0.938
Token hopeful
Token ID17836
Feature activation-5.15e-02

Pos logit contributions
 glad+3.595
 supposed+3.164
 important+3.092
 lucky+2.979
 fault+2.950
Neg logit contributions
mine-3.339
al-3.336
als-3.248
ted-3.237
talk-3.135
Token.
Token ID13
Feature activation+1.18e+00

Pos logit contributions
mine+0.099
al+0.093
talk+0.089
of+0.088
als+0.087
Neg logit contributions
 glad-0.117
 lucky-0.100
 important-0.096
 supposed-0.095
 pleased-0.094
Token She
Token ID1375
Feature activation+6.34e-01

Pos logit contributions
 glad+3.314
 lucky+2.974
 important+2.923
 supposed+2.900
 pleased+2.876
Neg logit contributions
mine-1.551
al-1.475
als-1.383
talk-1.335
ted-1.335
Token sky
Token ID6766
Feature activation+8.58e-03

Pos logit contributions
 glad+1.102
 lucky+0.908
 supposed+0.905
 important+0.876
 pleased+0.870
Neg logit contributions
mine-0.965
al-0.934
talk-0.887
als-0.881
ted-0.868
Token got
Token ID1392
Feature activation+1.29e+00

Pos logit contributions
 glad+0.021
 lucky+0.017
 important+0.017
 supposed+0.017
 pleased+0.016
Neg logit contributions
mine-0.016
al-0.015
als-0.014
talk-0.014
ted-0.014
Token dark
Token ID3223
Feature activation-8.30e-02

Pos logit contributions
 glad+2.251
 lucky+1.741
 supposed+1.647
 pleased+1.603
 important+1.597
Neg logit contributions
mine-3.186
al-3.023
talk-2.958
ted-2.921
als-2.896
Token and
Token ID290
Feature activation+9.80e-01

Pos logit contributions
mine+0.147
al+0.139
talk+0.131
als+0.129
ted+0.129
Neg logit contributions
 glad-0.203
 lucky-0.173
 important-0.166
 pleased-0.164
 supposed-0.162
Token he
Token ID339
Feature activation+1.44e+00

Pos logit contributions
 glad+2.109
 supposed+1.734
 lucky+1.731
 important+1.665
 pleased+1.644
Neg logit contributions
mine-1.983
al-1.881
ted-1.832
als-1.823
talk-1.773
Token was
Token ID373
Feature activation+4.46e+00

Pos logit contributions
 glad+3.659
 supposed+3.093
 lucky+3.047
 important+3.010
 pleased+2.881
Neg logit contributions
mine-2.500
al-2.429
als-2.255
ted-2.238
talk-2.226
Token tired
Token ID10032
Feature activation-4.06e-01

Pos logit contributions
 glad+10.480
 valuable+10.065
 important+9.748
 tempting+9.579
 rewarding+9.504
Neg logit contributions
mine-8.585
al-7.817
ting-7.372
ted-6.747
talk-6.685
Token.
Token ID13
Feature activation+9.86e-01

Pos logit contributions
mine+0.800
al+0.753
talk+0.719
als+0.705
of+0.701
Neg logit contributions
 glad-0.908
 lucky-0.768
 important-0.737
 pleased-0.718
 supposed-0.717
Token So
Token ID1406
Feature activation-1.30e+00

Pos logit contributions
 glad+2.407
 lucky+2.048
 supposed+2.001
 pleased+1.955
 important+1.942
Neg logit contributions
mine-1.713
al-1.654
talk-1.591
als-1.580
ted-1.574
Token he
Token ID339
Feature activation-1.59e-01

Pos logit contributions
mine+2.472
al+2.280
talk+2.203
x+2.143
als+2.125
Neg logit contributions
 glad-3.132
 lucky-2.716
 important-2.599
 pleased-2.494
 tired-2.458
Token decided
Token ID3066
Feature activation+4.31e-01

Pos logit contributions
mine+0.253
al+0.238
talk+0.225
als+0.218
ted+0.218
Neg logit contributions
 glad-0.417
 lucky-0.360
 important-0.348
 supposed-0.345
 pleased-0.340
Token?"
Token ID1701
Feature activation+2.91e-01

Pos logit contributions
 glad+1.349
 lucky+1.130
 supposed+1.071
 important+1.068
 pleased+1.065
Neg logit contributions
mine-1.310
al-1.238
talk-1.156
of-1.154
x-1.147
Token
Token ID198
Feature activation+9.21e-01

Pos logit contributions
 glad+0.601
 lucky+0.508
 supposed+0.494
 important+0.493
 pleased+0.475
Neg logit contributions
mine-0.628
al-0.581
talk-0.559
of-0.555
ted-0.555
Token
Token ID198
Feature activation-1.82e+00

Pos logit contributions
 glad+1.687
 lucky+1.389
 supposed+1.379
 important+1.326
 pleased+1.294
Neg logit contributions
mine-2.177
of-2.061
al-2.031
talk-1.993
ted-1.953
TokenL
Token ID43
Feature activation-4.97e-01

Pos logit contributions
mine+1.947
al+1.755
als+1.583
talk+1.569
ted+1.547
Neg logit contributions
 glad-5.892
 lucky-5.280
 important-5.172
 pleased-5.014
 happy-4.925
Tokenily
Token ID813
Feature activation-9.62e-01

Pos logit contributions
mine+0.752
al+0.695
talk+0.652
als+0.631
of+0.628
Neg logit contributions
 glad-1.352
 lucky-1.183
 important-1.145
 pleased-1.119
 supposed-1.116
Token was
Token ID373
Feature activation+4.44e+00

Pos logit contributions
mine+1.852
al+1.734
talk+1.663
als+1.622
x+1.613
Neg logit contributions
 glad-2.193
 lucky-1.874
 important-1.800
 pleased-1.748
 supposed-1.730
Token happy
Token ID3772
Feature activation+4.91e-01

Pos logit contributions
 glad+8.465
 fault+8.414
 pleasure+7.945
 valuable+7.680
 tempting+7.651
Neg logit contributions
mine-9.404
al-9.095
Give-8.740
 cash-8.303
Wh-7.760
Token to
Token ID284
Feature activation-1.04e+00

Pos logit contributions
 glad+1.214
 lucky+1.054
 important+1.026
 supposed+1.012
 pleased+0.994
Neg logit contributions
mine-0.833
al-0.785
talk-0.743
ted-0.738
of-0.730
Token have
Token ID423
Feature activation+2.51e-01

Pos logit contributions
mine+1.949
al+1.843
talk+1.748
als+1.729
ted+1.725
Neg logit contributions
 glad-2.419
 lucky-2.070
 important-1.984
 supposed-1.931
 pleased-1.919
Token a
Token ID257
Feature activation+1.51e+00

Pos logit contributions
 glad+0.619
 lucky+0.526
 pleased+0.507
 important+0.506
 supposed+0.501
Neg logit contributions
mine-0.433
al-0.410
talk-0.389
ted-0.383
als-0.380
Token new
Token ID649
Feature activation+4.50e-01

Pos logit contributions
 glad+2.723
 supposed+2.260
 lucky+2.096
 pleased+2.049
 important+2.041
Neg logit contributions
mine-3.459
al-3.302
of-3.259
 cash-3.211
talk-3.181
TokenL
Token ID43
Feature activation-9.63e-01

Pos logit contributions
mine+1.185
al+1.033
talk+0.932
als+0.905
ted+0.896
Neg logit contributions
 glad-4.128
 lucky-3.696
 important-3.576
 pleased-3.519
 supposed-3.467
Tokenily
Token ID813
Feature activation-3.26e-02

Pos logit contributions
mine+1.524
al+1.408
talk+1.348
als+1.310
ted+1.304
Neg logit contributions
 glad-2.537
 lucky-2.209
 important-2.141
 supposed-2.103
 pleased-2.074
Token nodded
Token ID14464
Feature activation+6.93e-01

Pos logit contributions
mine+0.056
al+0.053
talk+0.050
als+0.049
of+0.049
Neg logit contributions
 glad-0.081
 lucky-0.070
 important-0.068
 supposed-0.067
 pleased-0.065
Token.
Token ID13
Feature activation+1.15e+00

Pos logit contributions
 glad+1.747
 lucky+1.509
 supposed+1.491
 important+1.471
 pleased+1.415
Neg logit contributions
mine-1.188
al-1.087
talk-1.002
ted-0.987
of-0.986
Token She
Token ID1375
Feature activation+1.83e-01

Pos logit contributions
 glad+3.097
 lucky+2.754
 important+2.744
 supposed+2.711
 pleased+2.672
Neg logit contributions
mine-1.660
al-1.579
als-1.510
ted-1.428
talk-1.411
Token was
Token ID373
Feature activation+4.43e+00

Pos logit contributions
 glad+0.428
 lucky+0.364
 important+0.351
 supposed+0.350
 pleased+0.339
Neg logit contributions
mine-0.337
al-0.321
talk-0.301
ted-0.299
of-0.298
Token sorry
Token ID7926
Feature activation+1.18e+00

Pos logit contributions
 valuable+8.465
 fault+8.347
 glad+8.195
 pleasure+8.067
 tempting+8.027
Neg logit contributions
mine-8.704
al-8.682
Give-7.906
 cash-7.704
ting-7.355
Token for
Token ID329
Feature activation+3.13e-01

Pos logit contributions
 glad+2.872
 supposed+2.465
 lucky+2.461
 pleased+2.411
 important+2.346
Neg logit contributions
mine-1.957
al-1.811
of-1.810
als-1.738
ted-1.707
Token making
Token ID1642
Feature activation+2.92e-01

Pos logit contributions
 glad+0.623
 lucky+0.524
 important+0.505
 pleased+0.494
 supposed+0.485
Neg logit contributions
mine-0.685
al-0.644
ted-0.601
x-0.599
of-0.594
Token Mom
Token ID11254
Feature activation+1.54e-01

Pos logit contributions
 glad+0.749
 lucky+0.646
 important+0.625
 pleased+0.619
 supposed+0.615
Neg logit contributions
mine-0.478
al-0.441
talk-0.408
of-0.402
ted-0.399
Tokenmy
Token ID1820
Feature activation+1.72e-01

Pos logit contributions
 glad+0.403
 lucky+0.353
 important+0.344
 supposed+0.335
 pleased+0.331
Neg logit contributions
mine-0.246
al-0.222
talk-0.212
of-0.208
als-0.202
Token things
Token ID1243
Feature activation+9.45e-01

Pos logit contributions
 glad+0.688
 lucky+0.586
 important+0.564
 supposed+0.560
 pleased+0.560
Neg logit contributions
mine-0.498
al-0.475
als-0.449
talk-0.447
ted-0.442
Token."
Token ID526
Feature activation+3.81e-01

Pos logit contributions
 glad+2.091
 lucky+1.737
 pleased+1.675
 important+1.647
 supposed+1.631
Neg logit contributions
mine-1.935
al-1.839
als-1.746
talk-1.725
of-1.712
Token
Token ID198
Feature activation+1.46e+00

Pos logit contributions
 glad+0.795
 lucky+0.677
 important+0.671
 supposed+0.664
 pleased+0.634
Neg logit contributions
mine-0.797
al-0.754
talk-0.725
of-0.722
als-0.720
Token
Token ID198
Feature activation-9.37e-01

Pos logit contributions
 glad+2.659
 supposed+2.340
 lucky+2.247
 important+2.205
 pleased+2.067
Neg logit contributions
mine-3.391
of-3.284
al-3.174
talk-3.077
ted-3.044
TokenAnna
Token ID31160
Feature activation-4.56e-01

Pos logit contributions
mine+1.236
al+1.131
talk+1.052
als+1.023
ted+1.014
Neg logit contributions
 glad-2.741
 lucky-2.418
 important-2.349
 pleased-2.297
 supposed-2.278
Token was
Token ID373
Feature activation+4.43e+00

Pos logit contributions
mine+0.829
al+0.780
talk+0.740
als+0.725
ted+0.719
Neg logit contributions
 glad-1.089
 lucky-0.935
 important-0.904
 supposed-0.884
 pleased-0.874
Token eager
Token ID11069
Feature activation+6.77e-02

Pos logit contributions
 tempting+8.948
 fault+8.766
 valuable+8.579
 important+8.173
 glad+8.131
Neg logit contributions
mine-9.675
al-8.439
 cash-7.592
Give-7.589
 her-7.190
Token to
Token ID284
Feature activation+3.48e-01

Pos logit contributions
 glad+0.144
 lucky+0.120
 important+0.114
 pleased+0.113
 supposed+0.112
Neg logit contributions
mine-0.141
al-0.133
talk-0.129
als-0.126
of-0.126
Token hear
Token ID3285
Feature activation+1.78e-01

Pos logit contributions
 glad+0.958
 lucky+0.837
 important+0.821
 pleased+0.808
 supposed+0.802
Neg logit contributions
mine-0.498
al-0.459
talk-0.428
of-0.420
als-0.404
Token the
Token ID262
Feature activation+8.32e-01

Pos logit contributions
 glad+0.472
 lucky+0.412
 important+0.399
 supposed+0.399
 pleased+0.392
Neg logit contributions
mine-0.271
al-0.253
talk-0.234
ted-0.233
als-0.230
Token story
Token ID1621
Feature activation-3.77e-01

Pos logit contributions
 glad+2.255
 lucky+1.937
 supposed+1.919
 pleased+1.892
 important+1.874
Neg logit contributions
mine-1.213
al-1.143
talk-1.055
 cash-1.043
ted-1.033
Token?"
Token ID1701
Feature activation-5.27e-01

Pos logit contributions
 glad+0.738
 lucky+0.652
 important+0.637
 supposed+0.635
 pleased+0.632
Neg logit contributions
mine-0.267
al-0.249
ted-0.220
als-0.219
talk-0.217
Token
Token ID198
Feature activation+2.38e-01

Pos logit contributions
mine+1.394
al+1.332
talk+1.288
als+1.269
ted+1.265
Neg logit contributions
 glad-0.826
 lucky-0.651
 important-0.616
 supposed-0.590
 pleased-0.581
Token
Token ID198
Feature activation-2.52e+00

Pos logit contributions
 glad+0.397
 lucky+0.322
 important+0.309
 supposed+0.306
 pleased+0.291
Neg logit contributions
mine-0.598
al-0.567
of-0.555
talk-0.551
ted-0.535
TokenL
Token ID43
Feature activation-1.46e+00

Pos logit contributions
mine+3.801
al+3.549
ted+3.422
als+3.388
talk+3.265
Neg logit contributions
 glad-7.195
 lucky-6.236
 pleased-5.957
 important-5.947
 happy-5.792
Tokenily
Token ID813
Feature activation-9.66e-01

Pos logit contributions
mine+2.432
al+2.233
talk+2.182
ted+2.145
als+2.144
Neg logit contributions
 glad-3.747
 lucky-3.246
 important-3.144
 supposed-3.103
 pleased-3.027
Token was
Token ID373
Feature activation+4.43e+00

Pos logit contributions
mine+1.722
al+1.603
talk+1.533
als+1.490
x+1.485
Neg logit contributions
 glad-2.336
 lucky-2.012
 important-1.937
 pleased-1.892
 supposed-1.870
Token curious
Token ID11040
Feature activation+9.58e-01

Pos logit contributions
 fault+7.763
 glad+7.622
 valuable+7.376
 tempting+7.130
 important+7.109
Neg logit contributions
mine-10.064
al-9.883
Give-8.911
 cash-8.289
talk-8.126
Token and
Token ID290
Feature activation+1.44e+00

Pos logit contributions
 glad+2.597
 lucky+2.267
 supposed+2.215
 important+2.165
 pleased+2.133
Neg logit contributions
mine-1.436
al-1.341
of-1.243
ted-1.238
als-1.235
Token said
Token ID531
Feature activation+3.73e-01

Pos logit contributions
 glad+2.413
 supposed+1.996
 fault+1.902
 lucky+1.880
 important+1.849
Neg logit contributions
al-3.308
mine-3.291
ted-3.197
als-3.182
Give-3.168
Token,
Token ID11
Feature activation+5.90e-01

Pos logit contributions
 glad+0.923
 lucky+0.795
 supposed+0.787
 important+0.775
 pleased+0.747
Neg logit contributions
mine-0.638
al-0.599
als-0.564
of-0.553
talk-0.550
Token "
Token ID366
Feature activation-2.86e+00

Pos logit contributions
 glad+1.029
 lucky+0.846
 important+0.837
 supposed+0.828
 pleased+0.761
Neg logit contributions
mine-1.421
al-1.356
als-1.311
ted-1.292
of-1.274
Token."
Token ID526
Feature activation+3.18e-01

Pos logit contributions
mine+1.255
al+1.181
talk+1.128
als+1.104
ted+1.097
Neg logit contributions
 glad-1.456
 lucky-1.238
 important-1.191
 supposed-1.157
 pleased-1.153
Token
Token ID198
Feature activation+1.24e+00

Pos logit contributions
 glad+0.541
 lucky+0.443
 important+0.434
 supposed+0.431
 pleased+0.407
Neg logit contributions
mine-0.787
al-0.744
talk-0.722
of-0.712
als-0.705
Token
Token ID198
Feature activation-1.58e+00

Pos logit contributions
 glad+2.298
 supposed+1.961
 lucky+1.927
 important+1.906
 pleased+1.800
Neg logit contributions
mine-2.852
of-2.707
al-2.676
talk-2.609
ted-2.525
TokenS
Token ID50
Feature activation-1.54e+00

Pos logit contributions
mine+2.308
al+2.140
als+1.998
ted+1.990
talk+1.990
Neg logit contributions
 glad-4.526
 lucky-3.969
 important-3.823
 pleased-3.748
 tired-3.647
Tokenara
Token ID3301
Feature activation-2.90e-01

Pos logit contributions
mine+1.823
al+1.585
talk+1.531
x+1.455
ted+1.442
Neg logit contributions
 glad-4.661
 lucky-4.173
 important-4.040
 supposed-3.942
 pleased-3.929
Token is
Token ID318
Feature activation+4.42e+00

Pos logit contributions
mine+0.568
al+0.539
talk+0.514
als+0.503
ted+0.500
Neg logit contributions
 glad-0.652
 lucky-0.554
 important-0.534
 supposed-0.522
 pleased-0.517
Token sad
Token ID6507
Feature activation+6.40e-01

Pos logit contributions
 pleasure+7.547
 glad+6.977
 valuable+6.953
 lifetime+6.940
 rewarding+6.864
Neg logit contributions
mine-10.353
Give-10.089
al-9.898
 cash-9.289
ting-8.844
Token.
Token ID13
Feature activation+3.56e-01

Pos logit contributions
 glad+1.567
 lucky+1.352
 pleased+1.291
 supposed+1.282
 important+1.282
Neg logit contributions
mine-1.116
al-1.030
talk-1.001
of-0.986
ted-0.950
Token She
Token ID1375
Feature activation+1.49e+00

Pos logit contributions
 glad+0.959
 lucky+0.847
 important+0.825
 supposed+0.810
 pleased+0.810
Neg logit contributions
mine-0.533
al-0.501
als-0.465
talk-0.456
ted-0.448
Token says
Token ID1139
Feature activation+2.08e-01

Pos logit contributions
 glad+3.558
 lucky+2.990
 pleased+2.983
 supposed+2.910
 important+2.876
Neg logit contributions
mine-2.612
al-2.559
ted-2.414
Give-2.412
talk-2.383
Token,
Token ID11
Feature activation+1.69e+00

Pos logit contributions
 glad+0.507
 lucky+0.436
 supposed+0.426
 important+0.423
 pleased+0.413
Neg logit contributions
mine-0.361
al-0.339
als-0.320
talk-0.315
ted-0.314
Token goodbye
Token ID24829
Feature activation+1.08e+00

Pos logit contributions
 glad+0.409
 lucky+0.345
 important+0.339
 supposed+0.334
 pleased+0.325
Neg logit contributions
mine-0.360
al-0.337
talk-0.321
ted-0.318
als-0.317
Token to
Token ID284
Feature activation-2.51e-01

Pos logit contributions
 glad+2.421
 lucky+2.048
 important+1.998
 supposed+1.980
 pleased+1.975
Neg logit contributions
mine-2.039
al-1.979
talk-1.935
als-1.912
ted-1.857
Token Anna
Token ID11735
Feature activation+2.89e-01

Pos logit contributions
mine+0.428
al+0.400
talk+0.376
of+0.369
ted+0.368
Neg logit contributions
 glad-0.626
 lucky-0.541
 important-0.525
 supposed-0.514
 pleased-0.514
Token."
Token ID526
Feature activation+5.86e-01

Pos logit contributions
 glad+0.759
 lucky+0.662
 important+0.642
 pleased+0.631
 supposed+0.630
Neg logit contributions
mine-0.463
al-0.427
talk-0.395
of-0.389
als-0.389
Token Ben
Token ID3932
Feature activation-1.02e-01

Pos logit contributions
 glad+1.399
 lucky+1.230
 important+1.207
 supposed+1.201
 pleased+1.151
Neg logit contributions
mine-1.062
al-0.964
talk-0.934
of-0.933
ted-0.910
Token is
Token ID318
Feature activation+4.42e+00

Pos logit contributions
mine+0.194
al+0.182
talk+0.174
of+0.171
als+0.170
Neg logit contributions
 glad-0.236
 lucky-0.202
 important-0.195
 supposed-0.193
 pleased-0.189
Token sad
Token ID6507
Feature activation+7.46e-01

Pos logit contributions
 valuable+8.799
 fault+8.411
 tempting+8.345
 rewarding+8.173
 important+8.129
Neg logit contributions
mine-9.255
Give-7.995
al-7.858
 cash-7.790
talk-7.276
Token.
Token ID13
Feature activation+1.96e+00

Pos logit contributions
 glad+1.855
 lucky+1.600
 supposed+1.546
 pleased+1.538
 important+1.535
Neg logit contributions
mine-1.251
al-1.152
talk-1.131
of-1.120
ted-1.082
Token He
Token ID679
Feature activation+1.78e+00

Pos logit contributions
 glad+5.097
 supposed+4.675
 lucky+4.575
 important+4.540
 pleased+4.286
Neg logit contributions
mine-2.977
al-2.692
of-2.692
ted-2.621
Give-2.610
Token does
Token ID857
Feature activation+1.34e+00

Pos logit contributions
 glad+4.179
 pleased+3.454
 lucky+3.447
 supposed+3.447
 important+3.330
Neg logit contributions
mine-3.177
al-3.123
talk-3.026
ted-3.021
Give-2.951
Token not
Token ID407
Feature activation+1.52e-01

Pos logit contributions
 glad+2.074
 lucky+1.619
 supposed+1.558
 important+1.555
 pleased+1.522
Neg logit contributions
mine-3.395
talk-3.118
al-3.110
of-3.062
Give-3.030
Token asleep
Token ID16039
Feature activation+6.42e-01

Pos logit contributions
 glad+2.906
 lucky+2.428
 important+2.288
 pleased+2.262
 supposed+2.229
Neg logit contributions
mine-3.231
ted-3.043
al-3.037
als-2.984
talk-2.935
Token.
Token ID13
Feature activation+2.00e+00

Pos logit contributions
 glad+1.508
 lucky+1.291
 supposed+1.255
 important+1.254
 pleased+1.207
Neg logit contributions
mine-1.185
al-1.117
talk-1.033
ted-1.016
als-1.014
Token The
Token ID383
Feature activation+9.18e-01

Pos logit contributions
 glad+4.381
 supposed+3.968
 important+3.932
 lucky+3.871
 pleased+3.684
Neg logit contributions
mine-3.756
al-3.609
of-3.509
ted-3.506
talk-3.500
Token turkey
Token ID26876
Feature activation+1.11e+00

Pos logit contributions
 glad+2.329
 supposed+2.001
 lucky+1.968
 important+1.919
 pleased+1.913
Neg logit contributions
mine-1.476
al-1.417
als-1.309
ted-1.269
talk-1.251
Token was
Token ID373
Feature activation+4.17e+00

Pos logit contributions
 glad+2.706
 supposed+2.313
 lucky+2.281
 important+2.246
 pleased+2.166
Neg logit contributions
mine-2.008
al-1.960
als-1.822
ted-1.786
talk-1.784
Token very
Token ID845
Feature activation+4.41e+00

Pos logit contributions
 glad+8.260
 fault+7.952
 tempting+7.926
 valuable+7.653
 supposed+7.621
Neg logit contributions
mine-8.706
al-7.853
talk-6.993
ted-6.920
ting-6.746
Token happy
Token ID3772
Feature activation+1.19e+00

Pos logit contributions
 fault+6.847
 supposed+6.787
 glad+5.633
 pleasure+5.074
 tempting+4.905
Neg logit contributions
mine-11.453
of-10.610
al-10.500
talk-10.069
ted-9.900
Token.
Token ID13
Feature activation+1.52e+00

Pos logit contributions
 glad+3.168
 lucky+2.770
 important+2.690
 supposed+2.674
 pleased+2.649
Neg logit contributions
mine-1.762
al-1.701
ted-1.583
als-1.572
of-1.504
Token<|endoftext|>
Token ID50256
Feature activation+1.46e+00

Pos logit contributions
 glad+3.173
 supposed+2.815
 important+2.810
 lucky+2.795
 pleased+2.644
Neg logit contributions
mine-3.067
al-2.953
ted-2.822
of-2.810
als-2.802
TokenOne
Token ID3198
Feature activation+2.37e+00

Pos logit contributions
 glad+2.949
 lucky+2.436
 important+2.417
 pleased+2.387
 supposed+2.287
Neg logit contributions
mine-3.388
al-3.232
of-3.046
talk-3.038
ted-3.034
Token sunny
Token ID27737
Feature activation+3.92e-01

Pos logit contributions
 glad+5.103
 supposed+4.226
 pleased+4.106
 lucky+4.061
 fault+4.001
Neg logit contributions
mine-4.831
of-4.717
al-4.479
talk-4.115
ted-4.077
Token."
Token ID526
Feature activation+1.26e+00

Pos logit contributions
mine+1.597
al+1.507
talk+1.450
als+1.418
ted+1.415
Neg logit contributions
 glad-1.507
 lucky-1.261
 important-1.207
 supposed-1.167
 pleased-1.161
Token
Token ID198
Feature activation+1.13e+00

Pos logit contributions
 glad+2.391
 supposed+2.128
 important+1.996
 lucky+1.984
 pleased+1.875
Neg logit contributions
mine-2.778
al-2.662
of-2.646
talk-2.593
ted-2.570
Token
Token ID198
Feature activation-1.87e+00

Pos logit contributions
 glad+2.124
 supposed+1.779
 lucky+1.763
 important+1.689
 pleased+1.650
Neg logit contributions
mine-2.594
of-2.527
al-2.479
talk-2.431
ted-2.367
TokenS
Token ID50
Feature activation-1.91e+00

Pos logit contributions
mine+3.207
al+2.871
x+2.720
talk+2.716
ted+2.704
Neg logit contributions
 glad-4.905
 lucky-4.299
 important-4.168
 pleased-4.015
 tired-3.952
Tokenara
Token ID3301
Feature activation-4.21e-01

Pos logit contributions
mine+2.444
al+2.059
talk+2.029
x+1.993
als+1.888
Neg logit contributions
 glad-5.615
 lucky-5.033
 important-4.899
 supposed-4.713
 pleased-4.702
Token is
Token ID318
Feature activation+4.41e+00

Pos logit contributions
mine+0.818
al+0.776
talk+0.737
als+0.724
ted+0.718
Neg logit contributions
 glad-0.951
 lucky-0.808
 important-0.778
 supposed-0.758
 pleased-0.754
Token not
Token ID407
Feature activation+2.63e+00

Pos logit contributions
 fault+7.759
 lifetime+7.538
 pleasure+7.433
 glad+7.355
 valuable+7.223
Neg logit contributions
Give-9.687
mine-9.271
al-8.915
 cash-8.401
ting-7.807
Token happy
Token ID3772
Feature activation+5.29e-01

Pos logit contributions
 glad+3.845
 important+3.068
 tempting+2.980
 lucky+2.911
 cozy+2.816
Neg logit contributions
mine-7.153
al-6.445
talk-6.126
of-5.944
x-5.787
Token.
Token ID13
Feature activation+3.54e-01

Pos logit contributions
 glad+1.279
 lucky+1.091
 important+1.044
 pleased+1.042
 supposed+1.021
Neg logit contributions
mine-0.942
al-0.892
of-0.833
ted-0.828
talk-0.827
Token She
Token ID1375
Feature activation+3.78e-01

Pos logit contributions
 glad+0.941
 lucky+0.833
 important+0.819
 supposed+0.804
 pleased+0.792
Neg logit contributions
mine-0.534
al-0.505
als-0.466
ted-0.456
talk-0.451
Token likes
Token ID7832
Feature activation+3.40e-01

Pos logit contributions
 glad+0.811
 lucky+0.675
 important+0.652
 pleased+0.641
 supposed+0.638
Neg logit contributions
mine-0.762
al-0.748
als-0.696
talk-0.693
ted-0.692
Token?"
Token ID1701
Feature activation-1.44e-01

Pos logit contributions
 glad+2.275
 lucky+1.843
 supposed+1.779
 important+1.738
 pleased+1.732
Neg logit contributions
mine-2.576
al-2.507
als-2.342
of-2.326
x-2.284
Token
Token ID198
Feature activation+6.20e-01

Pos logit contributions
mine+0.368
al+0.351
talk+0.342
of+0.337
ted+0.336
Neg logit contributions
 glad-0.236
 lucky-0.188
 important-0.179
 supposed-0.177
 pleased-0.170
Token
Token ID198
Feature activation-2.25e+00

Pos logit contributions
 glad+1.097
 supposed+0.894
 lucky+0.893
 important+0.858
 pleased+0.818
Neg logit contributions
mine-1.496
of-1.423
al-1.419
talk-1.390
ted-1.354
TokenL
Token ID43
Feature activation-1.56e+00

Pos logit contributions
mine+2.756
al+2.420
ted+2.202
als+2.201
talk+2.177
Neg logit contributions
 glad-7.068
 lucky-6.331
 important-6.137
 pleased-6.045
 tired-5.979
Tokenila
Token ID10102
Feature activation-1.31e+00

Pos logit contributions
mine+2.502
al+2.252
talk+2.210
als+2.162
x+2.141
Neg logit contributions
 glad-4.112
 lucky-3.596
 important-3.470
 supposed-3.419
 pleased-3.358
Token was
Token ID373
Feature activation+4.40e+00

Pos logit contributions
mine+2.546
al+2.370
talk+2.302
x+2.246
ted+2.220
Neg logit contributions
 glad-2.961
 lucky-2.528
 important-2.426
 pleased-2.373
 supposed-2.303
Token excited
Token ID6568
Feature activation-3.10e-01

Pos logit contributions
 fault+7.523
 glad+7.417
 pleasure+7.202
 valuable+7.195
 tempting+6.936
Neg logit contributions
mine-10.086
Give-9.604
al-9.512
 cash-8.808
er-8.158
Token.
Token ID13
Feature activation-3.14e-01

Pos logit contributions
mine+0.600
al+0.563
talk+0.540
als+0.527
of+0.527
Neg logit contributions
 glad-0.708
 lucky-0.602
 important-0.577
 supposed-0.564
 pleased-0.562
Token She
Token ID1375
Feature activation-2.92e-01

Pos logit contributions
mine+0.513
al+0.480
talk+0.449
als+0.447
ted+0.441
Neg logit contributions
 glad-0.805
 lucky-0.702
 important-0.682
 supposed-0.667
 pleased-0.661
Token said
Token ID531
Feature activation-3.98e-01

Pos logit contributions
mine+0.533
al+0.503
talk+0.476
als+0.469
ted+0.465
Neg logit contributions
 glad-0.695
 lucky-0.595
 important-0.574
 supposed-0.563
 pleased-0.555
Token,
Token ID11
Feature activation+9.66e-01

Pos logit contributions
mine+0.735
al+0.691
talk+0.654
als+0.646
ted+0.640
Neg logit contributions
 glad-0.935
 lucky-0.801
 important-0.773
 supposed-0.760
 pleased-0.749
Token.
Token ID13
Feature activation+6.41e-01

Pos logit contributions
 glad+1.401
 lucky+1.209
 important+1.174
 supposed+1.167
 pleased+1.117
Neg logit contributions
mine-1.068
al-0.991
talk-0.943
ted-0.912
of-0.910
Token
Token ID198
Feature activation+1.13e+00

Pos logit contributions
 glad+1.136
 lucky+0.936
 supposed+0.922
 important+0.909
 pleased+0.846
Neg logit contributions
mine-1.537
al-1.465
talk-1.410
ted-1.396
of-1.396
Token
Token ID198
Feature activation-2.38e+00

Pos logit contributions
 glad+2.080
 lucky+1.742
 supposed+1.724
 important+1.686
 pleased+1.625
Neg logit contributions
mine-2.611
of-2.478
al-2.434
talk-2.422
ted-2.324
TokenL
Token ID43
Feature activation-1.21e+00

Pos logit contributions
mine+3.992
al+3.737
als+3.594
ted+3.567
talk+3.418
Neg logit contributions
 glad-6.429
 lucky-5.552
 important-5.288
 pleased-5.246
 happy-5.101
Tokenily
Token ID813
Feature activation-4.69e-01

Pos logit contributions
mine+1.980
al+1.822
talk+1.757
als+1.727
ted+1.722
Neg logit contributions
 glad-3.131
 lucky-2.720
 important-2.645
 supposed-2.599
 pleased-2.543
Token is
Token ID318
Feature activation+4.40e+00

Pos logit contributions
mine+0.919
al+0.869
talk+0.828
als+0.813
ted+0.807
Neg logit contributions
 glad-1.054
 lucky-0.895
 important-0.862
 supposed-0.839
 pleased-0.834
Token curious
Token ID11040
Feature activation+5.30e-01

Pos logit contributions
 fault+7.696
 pleasure+7.342
 glad+7.334
 valuable+7.112
 tempting+7.029
Neg logit contributions
mine-9.765
al-9.648
Give-9.124
er-8.183
 cash-8.162
Token.
Token ID13
Feature activation-8.03e-02

Pos logit contributions
 glad+1.286
 lucky+1.101
 supposed+1.061
 important+1.049
 pleased+1.034
Neg logit contributions
mine-0.950
al-0.887
of-0.840
talk-0.840
ted-0.827
Token She
Token ID1375
Feature activation+3.72e-01

Pos logit contributions
mine+0.130
al+0.122
talk+0.113
als+0.112
of+0.111
Neg logit contributions
 glad-0.207
 lucky-0.181
 important-0.176
 supposed-0.173
 pleased-0.171
Token wants
Token ID3382
Feature activation-6.02e-01

Pos logit contributions
 glad+0.827
 lucky+0.690
 important+0.669
 supposed+0.661
 pleased+0.656
Neg logit contributions
mine-0.713
al-0.703
talk-0.663
als-0.656
ted-0.655
Token to
Token ID284
Feature activation-1.01e+00

Pos logit contributions
mine+1.350
al+1.283
talk+1.234
als+1.211
ted+1.205
Neg logit contributions
 glad-1.191
 lucky-0.986
 important-0.941
 supposed-0.910
 pleased-0.908
Token?"
Token ID1701
Feature activation+2.52e-01

Pos logit contributions
 glad+1.007
 lucky+0.822
 supposed+0.785
 important+0.781
 pleased+0.774
Neg logit contributions
mine-1.047
al-0.991
of-0.929
als-0.919
x-0.912
Token
Token ID198
Feature activation+1.05e+00

Pos logit contributions
 glad+0.434
 lucky+0.352
 important+0.343
 supposed+0.342
 pleased+0.321
Neg logit contributions
mine-0.624
al-0.592
talk-0.581
of-0.573
ted-0.569
Token
Token ID198
Feature activation-1.99e+00

Pos logit contributions
 glad+1.951
 supposed+1.638
 lucky+1.619
 important+1.563
 pleased+1.494
Neg logit contributions
mine-2.448
of-2.348
al-2.317
talk-2.284
ted-2.192
TokenS
Token ID50
Feature activation-2.24e+00

Pos logit contributions
mine+3.085
al+2.832
als+2.676
ted+2.664
x+2.621
Neg logit contributions
 glad-5.512
 lucky-4.839
 important-4.673
 pleased-4.582
 tired-4.452
Tokenara
Token ID3301
Feature activation-7.44e-01

Pos logit contributions
mine+2.950
talk+2.499
al+2.490
x+2.401
als+2.314
Neg logit contributions
 glad-6.480
 lucky-5.778
 important-5.597
 tired-5.426
 pleased-5.402
Token is
Token ID318
Feature activation+4.39e+00

Pos logit contributions
mine+1.488
al+1.397
talk+1.339
als+1.309
ted+1.302
Neg logit contributions
 glad-1.641
 lucky-1.390
 important-1.335
 supposed-1.293
 pleased-1.291
Token happy
Token ID3772
Feature activation+2.81e-01

Pos logit contributions
 fault+7.725
 pleasure+7.531
 valuable+7.528
 lifetime+7.424
 glad+7.390
Neg logit contributions
mine-9.469
Give-9.408
al-8.772
 cash-8.277
Tom-7.723
Token.
Token ID13
Feature activation+9.16e-02

Pos logit contributions
 glad+0.669
 lucky+0.576
 important+0.560
 supposed+0.553
 pleased+0.541
Neg logit contributions
mine-0.507
al-0.480
talk-0.452
of-0.448
ted-0.448
Token She
Token ID1375
Feature activation+4.79e-01

Pos logit contributions
 glad+0.241
 lucky+0.213
 important+0.208
 supposed+0.204
 pleased+0.201
Neg logit contributions
mine-0.141
al-0.133
als-0.124
ted-0.122
talk-0.122
Token gives
Token ID3607
Feature activation-2.22e+00

Pos logit contributions
 glad+1.122
 lucky+0.949
 important+0.929
 supposed+0.919
 pleased+0.906
Neg logit contributions
mine-0.858
al-0.841
als-0.790
talk-0.783
ted-0.782
Token Ben
Token ID3932
Feature activation-1.04e+00

Pos logit contributions
mine+3.955
talk+3.626
al+3.546
ede+3.463
x+3.421
Neg logit contributions
 glad-5.503
 lucky-4.811
 tired-4.485
 important-4.466
 hungry-4.328
Token!"
Token ID2474
Feature activation-1.35e+00

Pos logit contributions
 glad+2.179
 lucky+1.811
 supposed+1.810
 pleased+1.796
 important+1.791
Neg logit contributions
al-1.613
mine-1.604
als-1.472
of-1.453
ted-1.446
Token Mia
Token ID32189
Feature activation-1.26e+00

Pos logit contributions
mine+2.348
al+2.204
x+2.051
talk+2.049
als+2.011
Neg logit contributions
 glad-3.431
 lucky-2.970
 important-2.831
 pleased-2.767
 tired-2.726
Token says
Token ID1139
Feature activation-7.33e-01

Pos logit contributions
mine+2.560
al+2.403
talk+2.292
x+2.265
als+2.263
Neg logit contributions
 glad-2.756
 lucky-2.331
 important-2.231
 pleased-2.158
 supposed-2.141
Token.
Token ID13
Feature activation-8.02e-01

Pos logit contributions
mine+1.500
al+1.416
talk+1.355
als+1.328
ted+1.320
Neg logit contributions
 glad-1.593
 lucky-1.345
 important-1.290
 supposed-1.250
 pleased-1.247
Token She
Token ID1375
Feature activation-9.75e-02

Pos logit contributions
mine+1.890
al+1.795
talk+1.730
als+1.693
x+1.688
Neg logit contributions
 glad-1.505
 lucky-1.230
 important-1.168
 supposed-1.123
 pleased-1.121
Token follows
Token ID5679
Feature activation-4.72e+00

Pos logit contributions
mine+0.172
al+0.165
talk+0.154
als+0.153
ted+0.152
Neg logit contributions
 glad-0.235
 lucky-0.201
 important-0.196
 supposed-0.191
 pleased-0.190
Token Sam
Token ID3409
Feature activation-6.71e-01

Pos logit contributions
mine+9.224
al+9.022
x+8.793
als+8.745
talk+8.700
Neg logit contributions
 glad-11.416
 lucky-9.740
 tired-9.509
 hungry-9.470
 happy-9.401
Token to
Token ID284
Feature activation-1.72e+00

Pos logit contributions
mine+1.324
al+1.246
talk+1.192
als+1.166
ted+1.159
Neg logit contributions
 glad-1.500
 lucky-1.274
 important-1.224
 supposed-1.188
 pleased-1.184
Token the
Token ID262
Feature activation+2.69e-01

Pos logit contributions
mine+3.334
al+3.147
als+3.033
talk+3.012
ted+2.948
Neg logit contributions
 glad-4.085
 lucky-3.495
 important-3.255
 supposed-3.200
 pleased-3.171
Token cave
Token ID11527
Feature activation-3.84e-01

Pos logit contributions
 glad+0.637
 lucky+0.531
 supposed+0.525
 pleased+0.519
 important+0.519
Neg logit contributions
mine-0.489
al-0.471
talk-0.446
als-0.440
of-0.428
Token.
Token ID13
Feature activation-1.78e-01

Pos logit contributions
mine+0.696
al+0.651
talk+0.622
als+0.606
of+0.601
Neg logit contributions
 glad-0.915
 lucky-0.787
 important-0.757
 supposed-0.741
 pleased-0.735
Token!"
Token ID2474
Feature activation-2.13e+00

Pos logit contributions
mine+0.396
al+0.374
talk+0.356
of+0.349
als+0.347
Neg logit contributions
 glad-0.444
 lucky-0.375
 important-0.359
 pleased-0.351
 supposed-0.350
Token Lily
Token ID20037
Feature activation-1.29e+00

Pos logit contributions
mine+4.209
al+3.991
x+3.762
talk+3.718
als+3.679
Neg logit contributions
 glad-5.114
 lucky-4.454
 important-4.251
 pleased-4.083
 tired-4.075
Token says
Token ID1139
Feature activation-5.60e-01

Pos logit contributions
mine+2.638
al+2.484
talk+2.362
x+2.340
als+2.336
Neg logit contributions
 glad-2.811
 lucky-2.386
 important-2.296
 pleased-2.202
 supposed-2.189
Token.
Token ID13
Feature activation-8.03e-01

Pos logit contributions
mine+1.128
al+1.061
talk+1.017
als+0.994
ted+0.989
Neg logit contributions
 glad-1.231
 lucky-1.043
 important-1.002
 supposed-0.972
 pleased-0.968
Token She
Token ID1375
Feature activation+2.67e-01

Pos logit contributions
mine+1.556
al+1.463
talk+1.398
als+1.363
x+1.356
Neg logit contributions
 glad-1.838
 lucky-1.563
 important-1.501
 supposed-1.458
 pleased-1.455
Token follows
Token ID5679
Feature activation-4.65e+00

Pos logit contributions
 glad+0.658
 lucky+0.565
 important+0.553
 supposed+0.544
 pleased+0.540
Neg logit contributions
mine-0.450
al-0.433
ted-0.402
als-0.400
talk-0.399
Token him
Token ID683
Feature activation-9.65e-02

Pos logit contributions
mine+8.526
al+8.494
als+8.492
talk+8.107
x+7.754
Neg logit contributions
 glad-11.956
 lucky-10.314
 tired-10.106
 sad-9.947
 hungry-9.877
Token with
Token ID351
Feature activation-1.16e+00

Pos logit contributions
mine+0.186
al+0.173
talk+0.168
of+0.163
als+0.162
Neg logit contributions
 glad-0.220
 lucky-0.186
 important-0.180
 pleased-0.175
 supposed-0.174
Token her
Token ID607
Feature activation+8.10e-01

Pos logit contributions
mine+2.215
al+2.083
talk+1.999
als+1.947
x+1.944
Neg logit contributions
 glad-2.742
 lucky-2.349
 important-2.246
 pleased-2.170
 supposed-2.161
Token sled
Token ID46822
Feature activation-1.07e+00

Pos logit contributions
 glad+2.078
 lucky+1.785
 important+1.757
 supposed+1.756
 pleased+1.730
Neg logit contributions
mine-1.341
al-1.187
of-1.156
talk-1.149
ted-1.126
Token.
Token ID13
Feature activation+8.80e-02

Pos logit contributions
mine+2.204
al+2.095
talk+1.984
als+1.948
of+1.929
Neg logit contributions
 glad-2.297
 lucky-1.941
 important-1.875
 supposed-1.806
 pleased-1.785
Token of
Token ID286
Feature activation-1.66e+00

Pos logit contributions
mine+2.825
al+2.670
talk+2.574
ted+2.523
x+2.512
Neg logit contributions
 glad-2.822
 lucky-2.379
 important-2.305
 pleased-2.200
 supposed-2.196
Token the
Token ID262
Feature activation+2.70e-01

Pos logit contributions
mine+3.139
talk+2.962
al+2.946
als+2.800
of+2.765
Neg logit contributions
 glad-3.988
 lucky-3.404
 important-3.200
 pleased-3.114
 supposed-3.110
Token hill
Token ID12788
Feature activation-6.42e-01

Pos logit contributions
 glad+0.798
 lucky+0.698
 supposed+0.685
 important+0.684
 pleased+0.676
Neg logit contributions
mine-0.341
al-0.306
of-0.281
als-0.281
talk-0.278
Token.
Token ID13
Feature activation-1.65e-01

Pos logit contributions
mine+1.298
al+1.224
talk+1.172
als+1.147
ted+1.140
Neg logit contributions
 glad-1.401
 lucky-1.185
 important-1.138
 supposed-1.104
 pleased-1.100
Token Ben
Token ID3932
Feature activation-1.71e-01

Pos logit contributions
mine+0.324
al+0.305
talk+0.291
als+0.288
ted+0.287
Neg logit contributions
 glad-0.369
 lucky-0.316
 important-0.307
 supposed-0.298
 pleased-0.296
Token follows
Token ID5679
Feature activation-4.58e+00

Pos logit contributions
mine+0.302
al+0.286
talk+0.268
als+0.265
ted+0.264
Neg logit contributions
 glad-0.413
 lucky-0.355
 important-0.346
 supposed-0.338
 pleased-0.335
Token her
Token ID607
Feature activation-7.17e-02

Pos logit contributions
als+9.888
al+9.615
mine+9.581
x+9.436
ted+9.101
Neg logit contributions
 glad-10.822
 lucky-9.381
 happy-9.064
 sad-8.885
 tired-8.624
Token.
Token ID13
Feature activation+1.20e-01

Pos logit contributions
mine+0.136
al+0.126
talk+0.121
of+0.119
als+0.118
Neg logit contributions
 glad-0.167
 lucky-0.142
 important-0.138
 supposed-0.135
 pleased-0.134
Token They
Token ID1119
Feature activation-1.78e+00

Pos logit contributions
 glad+0.254
 lucky+0.216
 important+0.209
 supposed+0.206
 pleased+0.201
Neg logit contributions
mine-0.247
al-0.232
talk-0.226
ted-0.222
als-0.221
Token look
Token ID804
Feature activation+7.80e-01

Pos logit contributions
mine+3.108
al+3.005
talk+2.845
ted+2.836
als+2.766
Neg logit contributions
 glad-4.417
 lucky-3.807
 tired-3.571
 important-3.528
 pleased-3.510
Token down
Token ID866
Feature activation+1.28e-01

Pos logit contributions
 glad+1.684
 supposed+1.426
 lucky+1.410
 important+1.388
 pleased+1.335
Neg logit contributions
mine-1.507
al-1.405
ted-1.373
talk-1.336
of-1.335
Token to
Token ID284
Feature activation-2.17e+00

Pos logit contributions
mine+6.835
al+6.533
als+6.115
x+6.063
ede+6.055
Neg logit contributions
 glad-4.254
 lucky-3.485
 important-3.199
 tired-3.084
 supposed-3.062
Token the
Token ID262
Feature activation-7.00e-01

Pos logit contributions
mine+4.262
al+4.101
als+4.020
talk+4.005
ted+3.928
Neg logit contributions
 glad-5.110
 lucky-4.365
 happy-3.973
 important-3.961
 sad-3.954
Token box
Token ID3091
Feature activation-1.40e+00

Pos logit contributions
mine+1.133
al+1.050
talk+0.994
als+0.966
ted+0.960
Neg logit contributions
 glad-1.830
 lucky-1.595
 important-1.542
 supposed-1.503
 pleased-1.499
Token.
Token ID13
Feature activation-2.50e-01

Pos logit contributions
mine+2.550
al+2.412
talk+2.275
ted+2.253
als+2.248
Neg logit contributions
 glad-3.343
 lucky-2.853
 important-2.772
 pleased-2.693
 supposed-2.655
Token Jen
Token ID13364
Feature activation-1.12e+00

Pos logit contributions
mine+0.444
al+0.415
talk+0.397
als+0.389
ted+0.388
Neg logit contributions
 glad-0.604
 lucky-0.522
 important-0.505
 supposed-0.495
 pleased-0.489
Token follows
Token ID5679
Feature activation-4.56e+00

Pos logit contributions
mine+2.151
al+1.989
talk+1.920
als+1.858
x+1.855
Neg logit contributions
 glad-2.583
 lucky-2.207
 important-2.113
 supposed-2.050
 pleased-2.041
Token him
Token ID683
Feature activation-7.91e-01

Pos logit contributions
als+9.397
al+9.216
mine+9.043
ede+8.936
talk+8.831
Neg logit contributions
 glad-10.902
 lucky-9.342
 tired-8.915
 happy-8.858
 sad-8.839
Token.
Token ID13
Feature activation-6.37e-01

Pos logit contributions
mine+1.572
al+1.481
talk+1.413
als+1.386
ted+1.380
Neg logit contributions
 glad-1.763
 lucky-1.498
 important-1.437
 supposed-1.399
 pleased-1.388
Token
Token ID198
Feature activation+2.60e-01

Pos logit contributions
mine+1.282
al+1.208
talk+1.156
als+1.132
ted+1.126
Neg logit contributions
 glad-1.404
 lucky-1.190
 important-1.143
 supposed-1.109
 pleased-1.105
Token
Token ID198
Feature activation-3.39e+00

Pos logit contributions
 glad+0.442
 lucky+0.359
 important+0.342
 supposed+0.331
 pleased+0.327
Neg logit contributions
mine-0.649
al-0.606
talk-0.603
of-0.601
ted-0.579
TokenBut
Token ID1537
Feature activation-1.99e+00

Pos logit contributions
mine+6.720
al+6.541
x+6.190
als+6.155
ted+6.014
Neg logit contributions
 glad-8.456
 lucky-7.303
 important-7.018
 pleased-6.873
 happy-6.719
Token
Token ID198
Feature activation+1.02e+00

Pos logit contributions
 glad+0.092
 lucky+0.077
 important+0.073
 supposed+0.072
 pleased+0.070
Neg logit contributions
mine-0.113
al-0.106
talk-0.104
ted-0.102
of-0.101
Token
Token ID198
Feature activation-2.03e+00

Pos logit contributions
 glad+1.847
 lucky+1.545
 supposed+1.477
 important+1.475
 pleased+1.428
Neg logit contributions
mine-2.382
of-2.241
talk-2.214
al-2.205
ted-2.132
TokenSam
Token ID16305
Feature activation-7.71e-01

Pos logit contributions
mine+2.975
al+2.724
als+2.558
talk+2.431
ted+2.405
Neg logit contributions
 glad-5.966
 lucky-5.254
 important-5.133
 pleased-4.952
 supposed-4.849
Token tries
Token ID8404
Feature activation-4.18e-01

Pos logit contributions
mine+1.592
al+1.498
talk+1.436
als+1.407
ted+1.398
Neg logit contributions
 glad-1.655
 lucky-1.395
 important-1.338
 supposed-1.295
 pleased-1.289
Token to
Token ID284
Feature activation-3.30e+00

Pos logit contributions
mine+0.921
al+0.868
talk+0.843
of+0.821
ted+0.819
Neg logit contributions
 glad-0.841
 lucky-0.700
 important-0.666
 pleased-0.647
 supposed-0.645
Token scare
Token ID19437
Feature activation-4.56e+00

Pos logit contributions
als+6.682
mine+6.501
ted+6.464
al+6.463
talk+5.944
Neg logit contributions
 glad-7.632
 lucky-6.719
 happy-6.439
 sad-6.393
 important-6.353
Token things
Token ID1243
Feature activation-1.84e+00

Pos logit contributions
mine+10.239
talk+9.271
al+9.269
als+9.260
ede+8.902
Neg logit contributions
 glad-10.547
 lucky-9.151
 sad-8.849
 tired-8.523
 important-8.255
Token with
Token ID351
Feature activation-1.33e+00

Pos logit contributions
mine+3.706
al+3.529
talk+3.456
als+3.353
x+3.310
Neg logit contributions
 glad-4.091
 lucky-3.453
 important-3.313
 sad-3.209
 pleased-3.174
Token the
Token ID262
Feature activation+1.59e-01

Pos logit contributions
mine+2.291
al+2.139
talk+2.050
als+1.995
of+1.948
Neg logit contributions
 glad-3.347
 lucky-2.901
 important-2.810
 supposed-2.702
 pleased-2.685
Token lion
Token ID18744
Feature activation-4.80e-01

Pos logit contributions
 glad+0.427
 lucky+0.368
 pleased+0.355
 important+0.355
 supposed+0.352
Neg logit contributions
mine-0.241
al-0.226
talk-0.213
x-0.208
als-0.204
Token,
Token ID11
Feature activation-6.77e-01

Pos logit contributions
mine+0.915
al+0.862
talk+0.821
als+0.807
ted+0.800
Neg logit contributions
 glad-1.103
 lucky-0.939
 important-0.904
 supposed-0.880
 pleased-0.879
Token let
Token ID1309
Feature activation-1.44e+00

Pos logit contributions
mine+2.599
al+2.413
talk+2.353
ted+2.263
als+2.246
Neg logit contributions
 glad-3.018
 lucky-2.597
 important-2.476
 pleased-2.382
 supposed-2.330
Token's
Token ID338
Feature activation-8.66e-01

Pos logit contributions
mine+2.909
al+2.697
talk+2.646
x+2.565
als+2.552
Neg logit contributions
 glad-3.180
 lucky-2.768
 important-2.649
 supposed-2.530
 pleased-2.482
Token go
Token ID467
Feature activation-1.09e+00

Pos logit contributions
mine+1.287
al+1.190
talk+1.121
als+1.088
ted+1.086
Neg logit contributions
 glad-2.368
 lucky-2.079
 important-2.012
 supposed-1.964
 pleased-1.954
Token."
Token ID526
Feature activation-9.90e-01

Pos logit contributions
mine+2.377
al+2.253
talk+2.154
als+2.127
ted+2.116
Neg logit contributions
 glad-2.234
 lucky-1.869
 important-1.774
 supposed-1.722
 pleased-1.701
Token He
Token ID679
Feature activation-1.44e+00

Pos logit contributions
mine+2.468
al+2.360
talk+2.260
als+2.227
x+2.221
Neg logit contributions
 glad-1.741
 lucky-1.391
 important-1.307
 pleased-1.263
 supposed-1.244
Token followed
Token ID3940
Feature activation-4.52e+00

Pos logit contributions
mine+2.529
al+2.320
talk+2.205
x+2.148
als+2.130
Neg logit contributions
 glad-3.610
 lucky-3.137
 important-3.025
 pleased-2.957
 tired-2.884
Token Lily
Token ID20037
Feature activation-6.58e-01

Pos logit contributions
mine+10.307
al+9.791
als+9.523
talk+9.185
Give+9.011
Neg logit contributions
 glad-10.158
 lucky-8.757
 sad-8.471
 happy-8.312
 tired-8.302
Token up
Token ID510
Feature activation-1.41e+00

Pos logit contributions
mine+1.285
al+1.209
talk+1.156
als+1.131
ted+1.124
Neg logit contributions
 glad-1.482
 lucky-1.261
 important-1.213
 supposed-1.177
 pleased-1.173
Token the
Token ID262
Feature activation-2.32e-01

Pos logit contributions
mine+2.712
al+2.548
talk+2.434
als+2.412
x+2.389
Neg logit contributions
 glad-3.290
 lucky-2.797
 important-2.682
 pleased-2.631
 tired-2.572
Token ladder
Token ID18002
Feature activation-1.68e+00

Pos logit contributions
mine+0.271
al+0.242
talk+0.226
of+0.218
als+0.217
Neg logit contributions
 glad-0.708
 lucky-0.627
 important-0.613
 supposed-0.604
 pleased-0.599
Token and
Token ID290
Feature activation-1.93e+00

Pos logit contributions
mine+3.194
al+3.007
talk+2.822
als+2.800
ted+2.791
Neg logit contributions
 glad-3.932
 lucky-3.326
 important-3.225
 pleased-3.128
 tired-3.117
Token cave
Token ID11527
Feature activation+2.39e-02

Pos logit contributions
 glad+2.291
 supposed+1.822
 pleased+1.798
 lucky+1.726
 important+1.649
Neg logit contributions
al-2.789
mine-2.755
talk-2.702
als-2.652
ted-2.616
Token.
Token ID13
Feature activation+7.98e-01

Pos logit contributions
 glad+0.059
 lucky+0.051
 important+0.049
 supposed+0.049
 pleased+0.048
Neg logit contributions
mine-0.041
al-0.038
talk-0.036
als-0.035
ted-0.035
Token
Token ID198
Feature activation+5.12e-02

Pos logit contributions
 glad+1.761
 lucky+1.519
 important+1.483
 supposed+1.483
 pleased+1.421
Neg logit contributions
mine-1.521
al-1.448
ted-1.426
talk-1.402
als-1.392
Token
Token ID198
Feature activation-2.89e+00

Pos logit contributions
 glad+0.085
 lucky+0.069
 important+0.065
 supposed+0.064
 pleased+0.062
Neg logit contributions
mine-0.129
al-0.122
talk-0.121
of-0.119
ted-0.117
TokenSam
Token ID16305
Feature activation-8.34e-01

Pos logit contributions
mine+5.603
al+5.213
als+4.864
talk+4.753
ted+4.739
Neg logit contributions
 glad-7.460
 lucky-6.506
 important-6.309
 pleased-6.116
 happy-5.882
Token follows
Token ID5679
Feature activation-4.46e+00

Pos logit contributions
mine+1.589
al+1.483
talk+1.418
als+1.383
x+1.373
Neg logit contributions
 glad-1.926
 lucky-1.646
 important-1.582
 supposed-1.534
 pleased-1.531
Token him
Token ID683
Feature activation-5.84e-02

Pos logit contributions
mine+9.634
als+9.617
al+9.416
of+8.818
talk+8.688
Neg logit contributions
 glad-10.098
 lucky-8.675
 sad-8.193
 hungry-8.105
 important-8.099
Token.
Token ID13
Feature activation+9.32e-02

Pos logit contributions
mine+0.107
al+0.101
talk+0.097
of+0.094
als+0.094
Neg logit contributions
 glad-0.138
 lucky-0.117
 important-0.114
 pleased-0.111
 supposed-0.111
Token He
Token ID679
Feature activation+3.59e-01

Pos logit contributions
 glad+0.215
 lucky+0.185
 important+0.179
 supposed+0.176
 pleased+0.173
Neg logit contributions
mine-0.173
al-0.164
talk-0.158
ted-0.157
als-0.156
Token does
Token ID857
Feature activation+8.12e-01

Pos logit contributions
 glad+0.825
 lucky+0.699
 important+0.685
 supposed+0.675
 pleased+0.664
Neg logit contributions
mine-0.670
al-0.653
talk-0.617
als-0.611
ted-0.608
Token not
Token ID407
Feature activation-9.24e-01

Pos logit contributions
 glad+1.267
 lucky+0.982
 important+0.950
 supposed+0.935
 pleased+0.910
Neg logit contributions
mine-2.056
al-1.934
of-1.884
talk-1.874
ede-1.855
Token."
Token ID526
Feature activation-2.22e-01

Pos logit contributions
mine+1.082
al+1.023
talk+0.977
als+0.960
of+0.954
Neg logit contributions
 glad-1.085
 lucky-0.911
 important-0.873
 pleased-0.846
 supposed-0.845
Token
Token ID198
Feature activation+8.28e-01

Pos logit contributions
mine+0.563
al+0.535
talk+0.525
of+0.513
als+0.510
Neg logit contributions
 glad-0.370
 lucky-0.297
 important-0.283
 supposed-0.275
 pleased-0.268
Token
Token ID198
Feature activation-1.65e+00

Pos logit contributions
 glad+1.476
 lucky+1.223
 supposed+1.206
 important+1.189
 pleased+1.122
Neg logit contributions
mine-1.946
of-1.865
talk-1.837
al-1.833
ted-1.761
TokenM
Token ID44
Feature activation-1.30e+00

Pos logit contributions
mine+2.556
al+2.359
als+2.181
ted+2.150
talk+2.119
Neg logit contributions
 glad-4.606
 lucky-4.021
 important-3.889
 pleased-3.802
 supposed-3.709
Tokenia
Token ID544
Feature activation-1.81e+00

Pos logit contributions
mine+2.771
al+2.598
talk+2.471
ted+2.449
als+2.446
Neg logit contributions
 glad-2.777
 lucky-2.348
 important-2.262
 pleased-2.163
 supposed-2.152
Token chased
Token ID26172
Feature activation-4.45e+00

Pos logit contributions
mine+3.765
al+3.492
talk+3.387
x+3.378
als+3.290
Neg logit contributions
 glad-3.852
 lucky-3.264
 important-3.071
 sad-3.043
 pleased-3.022
Token after
Token ID706
Feature activation-3.05e+00

Pos logit contributions
mine+9.414
al+8.819
ede+8.474
x+8.417
als+8.339
Neg logit contributions
 glad-10.728
 lucky-9.443
 tired-8.888
 happy-8.881
 hungry-8.764
Token him
Token ID683
Feature activation-9.59e-01

Pos logit contributions
talk+5.526
mine+5.502
al+5.325
x+5.111
als+4.990
Neg logit contributions
 glad-7.810
 lucky-6.684
 sad-6.258
 tired-6.254
 happy-6.252
Token.
Token ID13
Feature activation-1.05e+00

Pos logit contributions
mine+1.948
al+1.846
talk+1.763
als+1.733
ted+1.717
Neg logit contributions
 glad-2.099
 lucky-1.777
 important-1.705
 supposed-1.653
 pleased-1.641
Token She
Token ID1375
Feature activation-1.38e+00

Pos logit contributions
mine+1.870
al+1.748
talk+1.658
x+1.613
of+1.608
Neg logit contributions
 glad-2.562
 lucky-2.194
 important-2.109
 supposed-2.057
 pleased-2.047
Token wanted
Token ID2227
Feature activation-1.54e+00

Pos logit contributions
mine+2.696
al+2.505
talk+2.403
als+2.336
x+2.334
Neg logit contributions
 glad-3.151
 lucky-2.702
 important-2.577
 pleased-2.503
 supposed-2.470
Token knees
Token ID14475
Feature activation+6.98e-01

Pos logit contributions
mine+1.573
al+1.459
talk+1.374
als+1.340
ted+1.339
Neg logit contributions
 glad-2.423
 lucky-2.110
 important-2.037
 supposed-1.975
 pleased-1.970
Token."
Token ID526
Feature activation-8.87e-01

Pos logit contributions
 glad+1.679
 lucky+1.446
 supposed+1.399
 important+1.387
 pleased+1.382
Neg logit contributions
mine-1.256
al-1.155
talk-1.086
of-1.086
als-1.085
Token
Token ID198
Feature activation+6.13e-01

Pos logit contributions
mine+2.423
al+2.320
talk+2.240
als+2.207
x+2.198
Neg logit contributions
 glad-1.348
 lucky-1.041
 important-0.968
 pleased-0.919
 supposed-0.912
Token
Token ID198
Feature activation-2.94e+00

Pos logit contributions
 glad+1.073
 lucky+0.883
 supposed+0.857
 important+0.851
 pleased+0.809
Neg logit contributions
mine-1.479
of-1.395
al-1.394
talk-1.371
ted-1.337
TokenSam
Token ID16305
Feature activation-1.39e+00

Pos logit contributions
mine+5.094
al+4.655
als+4.333
ted+4.280
talk+4.110
Neg logit contributions
 glad-8.203
 lucky-7.092
 important-6.853
 pleased-6.748
 happy-6.593
Token followed
Token ID3940
Feature activation-4.44e+00

Pos logit contributions
mine+2.623
al+2.425
talk+2.328
x+2.273
als+2.242
Neg logit contributions
 glad-3.262
 lucky-2.802
 important-2.684
 pleased-2.621
 supposed-2.572
Token them
Token ID606
Feature activation-5.11e-01

Pos logit contributions
mine+9.464
al+8.856
als+8.766
ede+8.413
talk+8.294
Neg logit contributions
 glad-10.415
 lucky-9.086
 important-8.677
 sad-8.582
 tired-8.496
Token into
Token ID656
Feature activation-1.18e+00

Pos logit contributions
mine+0.931
al+0.873
talk+0.832
als+0.814
ted+0.809
Neg logit contributions
 glad-1.216
 lucky-1.042
 important-1.006
 supposed-0.979
 pleased-0.978
Token the
Token ID262
Feature activation-2.28e-01

Pos logit contributions
mine+2.146
al+1.999
talk+1.922
als+1.884
x+1.850
Neg logit contributions
 glad-2.890
 lucky-2.490
 important-2.388
 pleased-2.321
 supposed-2.300
Token tunnel
Token ID13275
Feature activation-6.24e-01

Pos logit contributions
mine+0.467
al+0.447
talk+0.428
als+0.419
of+0.416
Neg logit contributions
 glad-0.488
 lucky-0.407
 important-0.392
 supposed-0.388
 pleased-0.384
Token.
Token ID13
Feature activation-3.77e-01

Pos logit contributions
mine+1.171
al+1.099
talk+1.049
als+1.025
ted+1.019
Neg logit contributions
 glad-1.452
 lucky-1.242
 important-1.196
 supposed-1.164
 pleased-1.159
Token."
Token ID526
Feature activation+5.88e-01

Pos logit contributions
 glad+0.227
 lucky+0.191
 important+0.184
 pleased+0.181
 supposed+0.178
Neg logit contributions
mine-0.219
al-0.211
talk-0.200
als-0.198
ted-0.195
Token
Token ID198
Feature activation+1.33e+00

Pos logit contributions
 glad+1.031
 lucky+0.858
 supposed+0.852
 important+0.835
 pleased+0.771
Neg logit contributions
mine-1.400
al-1.327
talk-1.307
of-1.297
ted-1.279
Token
Token ID198
Feature activation-2.05e+00

Pos logit contributions
 glad+2.489
 supposed+2.105
 lucky+2.104
 important+2.008
 pleased+1.937
Neg logit contributions
mine-2.989
of-2.900
al-2.815
talk-2.794
ted-2.701
TokenL
Token ID43
Feature activation-2.12e+00

Pos logit contributions
mine+3.268
al+2.920
als+2.732
ted+2.717
talk+2.681
Neg logit contributions
 glad-5.737
 lucky-5.024
 important-4.930
 pleased-4.782
 happy-4.628
Tokenily
Token ID813
Feature activation-1.05e+00

Pos logit contributions
mine+3.841
talk+3.522
al+3.472
als+3.439
ted+3.364
Neg logit contributions
 glad-5.211
 lucky-4.485
 important-4.371
 supposed-4.354
 pleased-4.133
Token followed
Token ID3940
Feature activation-4.41e+00

Pos logit contributions
mine+1.927
al+1.787
talk+1.718
x+1.665
als+1.660
Neg logit contributions
 glad-2.500
 lucky-2.147
 important-2.054
 pleased-2.013
 supposed-1.984
Token Sam
Token ID3409
Feature activation-1.78e-01

Pos logit contributions
mine+8.750
al+8.238
als+8.075
talk+7.957
x+7.470
Neg logit contributions
 glad-11.091
 lucky-9.614
 tired-9.276
 sad-9.140
 hungry-8.947
Token.
Token ID13
Feature activation-5.03e-01

Pos logit contributions
mine+0.334
al+0.315
talk+0.302
of+0.295
als+0.294
Neg logit contributions
 glad-0.414
 lucky-0.353
 important-0.342
 supposed-0.333
 pleased-0.332
Token She
Token ID1375
Feature activation-2.20e-01

Pos logit contributions
mine+0.862
al+0.805
talk+0.763
als+0.748
ted+0.743
Neg logit contributions
 glad-1.254
 lucky-1.088
 important-1.053
 supposed-1.026
 pleased-1.021
Token was
Token ID373
Feature activation+2.92e+00

Pos logit contributions
mine+0.422
al+0.402
talk+0.380
als+0.375
ted+0.374
Neg logit contributions
 glad-0.499
 lucky-0.425
 important-0.412
 supposed-0.403
 pleased-0.396
Token happy
Token ID3772
Feature activation+4.63e-01

Pos logit contributions
 glad+5.217
 fault+4.743
 important+4.686
 valuable+4.631
 tempting+4.543
Neg logit contributions
mine-5.989
al-5.910
ting-5.382
Give-5.302
ted-5.262
Token!"
Token ID2474
Feature activation-1.97e+00

Pos logit contributions
 glad+1.991
 lucky+1.691
 important+1.639
 supposed+1.624
 pleased+1.610
Neg logit contributions
mine-1.529
al-1.403
of-1.350
talk-1.345
als-1.328
Token Sam
Token ID3409
Feature activation-9.01e-01

Pos logit contributions
mine+3.524
al+3.355
x+3.126
talk+3.045
als+3.034
Neg logit contributions
 glad-5.059
 lucky-4.398
 important-4.204
 tired-4.124
 pleased-4.116
Token says
Token ID1139
Feature activation-3.61e-01

Pos logit contributions
mine+1.755
al+1.652
talk+1.578
als+1.542
x+1.538
Neg logit contributions
 glad-2.043
 lucky-1.740
 important-1.671
 pleased-1.619
 supposed-1.617
Token.
Token ID13
Feature activation-3.44e-02

Pos logit contributions
mine+0.714
al+0.667
talk+0.642
als+0.624
of+0.622
Neg logit contributions
 glad-0.804
 lucky-0.683
 important-0.657
 supposed-0.640
 pleased-0.634
Token He
Token ID679
Feature activation+1.32e-01

Pos logit contributions
mine+0.076
al+0.071
talk+0.069
ted+0.068
als+0.068
Neg logit contributions
 glad-0.068
 lucky-0.057
 important-0.055
 supposed-0.054
 pleased-0.053
Token follows
Token ID5679
Feature activation-4.36e+00

Pos logit contributions
 glad+0.316
 lucky+0.271
 important+0.263
 supposed+0.257
 pleased+0.256
Neg logit contributions
mine-0.234
al-0.221
talk-0.208
als-0.207
of-0.207
Token Tom
Token ID4186
Feature activation-2.17e-01

Pos logit contributions
als+9.334
al+9.262
mine+9.247
x+8.577
ede+8.461
Neg logit contributions
 glad-9.866
 lucky-8.416
 sad-8.171
 happy-7.986
 tired-7.982
Token and
Token ID290
Feature activation-1.22e+00

Pos logit contributions
mine+0.396
al+0.369
talk+0.354
of+0.345
als+0.344
Neg logit contributions
 glad-0.514
 lucky-0.441
 important-0.427
 pleased-0.414
 supposed-0.413
Token slides
Token ID19392
Feature activation-1.22e+00

Pos logit contributions
mine+2.284
al+2.117
talk+2.029
x+1.971
als+1.961
Neg logit contributions
 glad-2.922
 lucky-2.505
 important-2.382
 pleased-2.328
 supposed-2.309
Token down
Token ID866
Feature activation-1.05e+00

Pos logit contributions
mine+2.464
al+2.355
talk+2.238
als+2.178
x+2.174
Neg logit contributions
 glad-2.691
 lucky-2.275
 important-2.175
 pleased-2.108
 supposed-2.107
Token.
Token ID13
Feature activation+5.22e-01

Pos logit contributions
mine+2.246
al+2.129
talk+2.044
als+2.008
x+2.000
Neg logit contributions
 glad-2.193
 lucky-1.835
 important-1.758
 pleased-1.694
 supposed-1.690
Token wrong
Token ID2642
Feature activation-3.53e-01

Pos logit contributions
 glad+1.738
 pleased+1.304
 lucky+1.294
 important+1.229
 valuable+1.173
Neg logit contributions
mine-3.176
al-2.976
talk-2.854
x-2.819
ted-2.806
Token.
Token ID13
Feature activation-8.73e-01

Pos logit contributions
mine+0.676
al+0.637
talk+0.607
als+0.592
ted+0.589
Neg logit contributions
 glad-0.808
 lucky-0.687
 important-0.658
 pleased-0.643
 supposed-0.643
Token They
Token ID1119
Feature activation-9.80e-01

Pos logit contributions
mine+1.638
al+1.529
talk+1.452
of+1.420
als+1.419
Neg logit contributions
 glad-2.067
 lucky-1.769
 important-1.704
 supposed-1.651
 pleased-1.644
Token tried
Token ID3088
Feature activation-3.69e-01

Pos logit contributions
mine+2.092
al+1.983
talk+1.904
als+1.872
x+1.853
Neg logit contributions
 glad-2.053
 lucky-1.717
 important-1.630
 pleased-1.583
 supposed-1.574
Token to
Token ID284
Feature activation-3.31e+00

Pos logit contributions
mine+0.810
al+0.763
talk+0.741
ted+0.721
of+0.721
Neg logit contributions
 glad-0.743
 lucky-0.620
 important-0.592
 pleased-0.573
 supposed-0.572
Token comfort
Token ID4467
Feature activation-4.36e+00

Pos logit contributions
als+7.629
mine+7.475
ted+7.421
al+7.375
x+6.882
Neg logit contributions
 glad-7.048
 lucky-6.078
 happy-5.768
 important-5.619
 sad-5.600
Token them
Token ID606
Feature activation-1.49e+00

Pos logit contributions
ede+9.038
mine+9.012
 cash+8.407
al+7.926
talk+7.859
Neg logit contributions
 glad-10.377
 lucky-9.040
 important-8.529
 sad-8.146
 happy-8.041
Token and
Token ID290
Feature activation-1.40e+00

Pos logit contributions
mine+3.055
al+2.886
talk+2.800
als+2.726
ted+2.688
Neg logit contributions
 glad-3.295
 lucky-2.805
 important-2.706
 supposed-2.553
 pleased-2.551
Token explain
Token ID4727
Feature activation-7.59e-01

Pos logit contributions
mine+2.516
al+2.346
talk+2.190
als+2.187
x+2.164
Neg logit contributions
 glad-3.474
 lucky-2.988
 important-2.889
 pleased-2.805
 supposed-2.770
Token that
Token ID326
Feature activation-6.25e-01

Pos logit contributions
mine+1.576
al+1.485
talk+1.424
als+1.394
ted+1.384
Neg logit contributions
 glad-1.631
 lucky-1.375
 important-1.323
 supposed-1.276
 pleased-1.271
Token sometimes
Token ID3360
Feature activation+6.70e-01

Pos logit contributions
mine+0.979
al+0.906
talk+0.855
als+0.832
ted+0.826
Neg logit contributions
 glad-1.650
 lucky-1.439
 important-1.393
 supposed-1.361
 pleased-1.357
Token cries
Token ID24691
Feature activation-5.19e-01

Pos logit contributions
 glad+0.062
 lucky+0.053
 important+0.051
 supposed+0.050
 pleased+0.049
Neg logit contributions
mine-0.054
al-0.051
talk-0.049
als-0.048
ted-0.048
Token.
Token ID13
Feature activation+3.60e-02

Pos logit contributions
mine+1.042
al+0.984
talk+0.940
als+0.924
ted+0.918
Neg logit contributions
 glad-1.140
 lucky-0.964
 important-0.926
 supposed-0.902
 pleased-0.898
Token
Token ID198
Feature activation+9.26e-01

Pos logit contributions
 glad+0.081
 lucky+0.068
 important+0.066
 supposed+0.066
 pleased+0.064
Neg logit contributions
mine-0.070
al-0.067
talk-0.064
ted-0.063
als-0.063
Token
Token ID198
Feature activation-2.13e+00

Pos logit contributions
 glad+1.689
 lucky+1.407
 important+1.355
 supposed+1.346
 pleased+1.304
Neg logit contributions
mine-2.172
of-2.059
talk-2.029
al-2.019
ted-1.942
TokenMom
Token ID29252
Feature activation-6.78e-01

Pos logit contributions
mine+3.305
al+2.955
als+2.731
talk+2.722
x+2.685
Neg logit contributions
 glad-6.064
 lucky-5.372
 important-5.223
 pleased-5.027
 tired-4.959
Token hugs
Token ID40657
Feature activation-4.33e+00

Pos logit contributions
mine+1.231
al+1.152
talk+1.097
als+1.071
ted+1.064
Neg logit contributions
 glad-1.626
 lucky-1.397
 important-1.347
 supposed-1.311
 pleased-1.307
Token Sam
Token ID3409
Feature activation+4.96e-01

Pos logit contributions
mine+8.672
talk+8.433
al+8.022
ede+7.924
 cash+7.783
Neg logit contributions
 glad-10.662
 lucky-8.896
 tired-8.668
 sad-8.437
 happy-8.351
Token and
Token ID290
Feature activation-1.03e+00

Pos logit contributions
 glad+1.318
 lucky+1.156
 important+1.119
 supposed+1.117
 pleased+1.102
Neg logit contributions
mine-0.764
al-0.709
of-0.657
talk-0.651
ted-0.646
Token tells
Token ID4952
Feature activation-2.98e+00

Pos logit contributions
mine+1.865
al+1.723
talk+1.641
als+1.590
x+1.586
Neg logit contributions
 glad-2.553
 lucky-2.201
 important-2.114
 supposed-2.049
 pleased-2.042
Token him
Token ID683
Feature activation+8.36e-01

Pos logit contributions
mine+4.151
al+3.844
talk+3.755
x+3.575
als+3.545
Neg logit contributions
 glad-8.858
 lucky-7.857
 important-7.624
 sad-7.493
 tired-7.418
Token it
Token ID340
Feature activation+1.34e+00

Pos logit contributions
 glad+1.848
 lucky+1.551
 supposed+1.508
 important+1.499
 pleased+1.494
Neg logit contributions
mine-1.570
al-1.477
of-1.433
als-1.425
talk-1.415
Token house
Token ID2156
Feature activation-8.19e-01

Pos logit contributions
 glad+2.192
 lucky+1.824
 pleased+1.817
 supposed+1.765
 important+1.747
Neg logit contributions
mine-1.583
al-1.490
talk-1.402
als-1.392
x-1.364
Token.
Token ID13
Feature activation+9.16e-01

Pos logit contributions
mine+1.670
al+1.578
talk+1.511
als+1.480
ted+1.474
Neg logit contributions
 glad-1.770
 lucky-1.496
 important-1.438
 supposed-1.391
 pleased-1.385
Token
Token ID198
Feature activation+1.24e+00

Pos logit contributions
 glad+2.287
 supposed+2.012
 lucky+2.009
 important+1.949
 pleased+1.922
Neg logit contributions
mine-1.455
talk-1.358
al-1.353
ted-1.305
als-1.276
Token
Token ID198
Feature activation-1.93e+00

Pos logit contributions
 glad+2.259
 lucky+1.907
 supposed+1.907
 important+1.839
 pleased+1.792
Neg logit contributions
mine-2.795
talk-2.654
of-2.645
al-2.611
ted-2.540
TokenMom
Token ID29252
Feature activation-9.67e-01

Pos logit contributions
mine+3.312
al+3.035
als+2.874
ted+2.776
x+2.754
Neg logit contributions
 glad-5.287
 lucky-4.575
 important-4.420
 pleased-4.277
 supposed-4.119
Token hugs
Token ID40657
Feature activation-4.32e+00

Pos logit contributions
mine+1.969
al+1.849
talk+1.768
als+1.737
x+1.724
Neg logit contributions
 glad-2.122
 lucky-1.787
 important-1.708
 supposed-1.658
 pleased-1.653
Token them
Token ID606
Feature activation-1.49e+00

Pos logit contributions
ede+8.303
al+8.168
mine+8.061
als+7.999
talk+7.952
Neg logit contributions
 glad-11.136
 lucky-9.159
 sad-8.880
 happy-8.758
 important-8.720
Token and
Token ID290
Feature activation-8.65e-01

Pos logit contributions
mine+2.855
al+2.714
talk+2.604
x+2.543
als+2.516
Neg logit contributions
 glad-3.497
 lucky-2.962
 important-2.816
 tired-2.745
 pleased-2.743
Token says
Token ID1139
Feature activation-7.31e-02

Pos logit contributions
mine+1.715
al+1.605
talk+1.534
als+1.500
x+1.491
Neg logit contributions
 glad-1.964
 lucky-1.668
 important-1.600
 supposed-1.547
 pleased-1.546
Token she
Token ID673
Feature activation+1.02e+00

Pos logit contributions
mine+0.108
al+0.099
ted+0.091
talk+0.091
als+0.090
Neg logit contributions
 glad-0.198
 lucky-0.174
 important-0.169
 supposed-0.168
 pleased-0.166
Token is
Token ID318
Feature activation+2.60e+00

Pos logit contributions
 glad+1.874
 lucky+1.506
 pleased+1.496
 important+1.429
 supposed+1.386
Neg logit contributions
mine-2.346
al-2.304
talk-2.210
ted-2.207
x-2.163
Token She
Token ID1375
Feature activation+1.73e-01

Pos logit contributions
 glad+0.239
 lucky+0.208
 important+0.203
 supposed+0.197
 pleased+0.195
Neg logit contributions
mine-0.174
al-0.163
ted-0.156
talk-0.155
als-0.155
Token looked
Token ID3114
Feature activation+1.56e+00

Pos logit contributions
 glad+0.395
 lucky+0.336
 important+0.330
 supposed+0.323
 pleased+0.315
Neg logit contributions
mine-0.325
al-0.313
talk-0.294
als-0.292
ted-0.291
Token anxious
Token ID18116
Feature activation-1.77e-01

Pos logit contributions
 glad+3.609
 supposed+3.173
 important+3.083
 lucky+3.061
 fault+3.004
Neg logit contributions
mine-2.711
al-2.555
ted-2.551
als-2.421
talk-2.382
Token.
Token ID13
Feature activation+1.39e-01

Pos logit contributions
mine+0.341
al+0.320
talk+0.306
als+0.301
ted+0.300
Neg logit contributions
 glad-0.405
 lucky-0.344
 important-0.329
 pleased-0.323
 supposed-0.321
Token She
Token ID1375
Feature activation+2.68e-01

Pos logit contributions
 glad+0.271
 lucky+0.226
 important+0.220
 supposed+0.214
 pleased+0.210
Neg logit contributions
mine-0.308
al-0.293
talk-0.281
ted-0.281
als-0.281
Token hugged
Token ID41130
Feature activation-4.29e+00

Pos logit contributions
 glad+0.619
 lucky+0.526
 important+0.512
 supposed+0.506
 pleased+0.496
Neg logit contributions
mine-0.492
al-0.472
talk-0.445
of-0.441
ted-0.440
Token them
Token ID606
Feature activation-1.71e+00

Pos logit contributions
talk+8.388
ede+8.320
mine+8.320
al+8.236
als+7.869
Neg logit contributions
 glad-10.886
 lucky-9.216
 sad-8.734
 happy-8.628
 tired-8.531
Token and
Token ID290
Feature activation-1.30e+00

Pos logit contributions
mine+3.376
al+3.260
talk+3.090
als+3.013
ted+2.993
Neg logit contributions
 glad-3.871
 lucky-3.291
 important-3.162
 tired-3.048
 supposed-3.004
Token took
Token ID1718
Feature activation-3.20e+00

Pos logit contributions
mine+2.716
al+2.548
talk+2.431
als+2.387
x+2.385
Neg logit contributions
 glad-2.846
 lucky-2.401
 important-2.292
 pleased-2.220
 supposed-2.187
Token them
Token ID606
Feature activation-9.49e-01

Pos logit contributions
mine+7.518
al+6.848
ede+6.827
x+6.605
of+6.529
Neg logit contributions
 glad-6.577
 lucky-5.865
 important-5.526
 supposed-5.138
 tired-5.088
Token to
Token ID284
Feature activation-2.44e-01

Pos logit contributions
mine+2.098
al+1.996
talk+1.894
als+1.876
ted+1.860
Neg logit contributions
 glad-1.904
 lucky-1.593
 important-1.523
 supposed-1.476
 pleased-1.446
Token!"
Token ID2474
Feature activation-2.70e+00

Pos logit contributions
 glad+1.018
 lucky+0.838
 important+0.789
 pleased+0.784
 supposed+0.763
Neg logit contributions
mine-1.110
al-1.042
talk-1.034
of-1.000
als-0.975
Token Sam
Token ID3409
Feature activation-1.06e+00

Pos logit contributions
mine+5.030
al+4.816
x+4.473
talk+4.342
als+4.313
Neg logit contributions
 glad-6.963
 lucky-6.107
 important-5.832
 tired-5.770
 sad-5.759
Token says
Token ID1139
Feature activation-6.67e-01

Pos logit contributions
mine+2.100
al+1.979
talk+1.886
als+1.848
x+1.847
Neg logit contributions
 glad-2.376
 lucky-2.019
 important-1.942
 pleased-1.876
 supposed-1.872
Token.
Token ID13
Feature activation-8.79e-02

Pos logit contributions
mine+1.355
al+1.278
talk+1.224
als+1.198
ted+1.191
Neg logit contributions
 glad-1.454
 lucky-1.229
 important-1.180
 supposed-1.145
 pleased-1.141
Token He
Token ID679
Feature activation+4.06e-02

Pos logit contributions
mine+0.180
al+0.169
talk+0.163
ted+0.160
als+0.159
Neg logit contributions
 glad-0.189
 lucky-0.160
 important-0.154
 supposed-0.152
 pleased-0.148
Token follows
Token ID5679
Feature activation-4.29e+00

Pos logit contributions
 glad+0.098
 lucky+0.084
 important+0.082
 supposed+0.080
 pleased+0.080
Neg logit contributions
mine-0.072
al-0.068
talk-0.064
als-0.063
ted-0.063
Token Tim
Token ID5045
Feature activation-5.04e-01

Pos logit contributions
als+9.283
mine+9.266
al+9.191
of+8.503
talk+8.474
Neg logit contributions
 glad-9.662
 lucky-8.228
 sad-8.061
 happy-7.837
 important-7.832
Token and
Token ID290
Feature activation-1.21e+00

Pos logit contributions
mine+0.924
al+0.863
talk+0.824
als+0.803
of+0.800
Neg logit contributions
 glad-1.197
 lucky-1.028
 important-0.992
 supposed-0.962
 pleased-0.962
Token slides
Token ID19392
Feature activation-1.04e+00

Pos logit contributions
mine+2.284
al+2.114
talk+2.028
x+1.970
als+1.962
Neg logit contributions
 glad-2.867
 lucky-2.456
 important-2.334
 pleased-2.276
 supposed-2.265
Token down
Token ID866
Feature activation-9.30e-01

Pos logit contributions
mine+2.044
al+1.943
talk+1.851
als+1.801
x+1.793
Neg logit contributions
 glad-2.315
 lucky-1.965
 important-1.884
 supposed-1.827
 pleased-1.825
Token.
Token ID13
Feature activation+6.78e-01

Pos logit contributions
mine+1.960
al+1.855
talk+1.780
als+1.747
x+1.736
Neg logit contributions
 glad-1.967
 lucky-1.651
 important-1.583
 pleased-1.527
 supposed-1.527
Token again
Token ID757
Feature activation+5.74e-01

Pos logit contributions
mine+0.011
al+0.010
talk+0.010
als+0.009
of+0.009
Neg logit contributions
 glad-0.013
 lucky-0.011
 important-0.010
 supposed-0.010
 pleased-0.010
Token."
Token ID526
Feature activation-9.74e-02

Pos logit contributions
 glad+1.388
 lucky+1.182
 supposed+1.134
 pleased+1.124
 important+1.123
Neg logit contributions
mine-1.020
al-0.977
talk-0.907
of-0.901
als-0.894
Token
Token ID198
Feature activation+1.45e+00

Pos logit contributions
mine+0.249
al+0.236
talk+0.231
of+0.226
ted+0.226
Neg logit contributions
 glad-0.158
 lucky-0.128
 important-0.123
 supposed-0.119
 pleased-0.115
Token
Token ID198
Feature activation-1.83e+00

Pos logit contributions
 glad+2.675
 lucky+2.278
 supposed+2.244
 important+2.204
 pleased+2.121
Neg logit contributions
mine-3.284
of-3.148
al-3.062
talk-3.037
ted-2.928
TokenMom
Token ID29252
Feature activation-9.86e-01

Pos logit contributions
mine+2.517
al+2.322
als+2.187
ted+2.114
talk+2.072
Neg logit contributions
 glad-5.460
 lucky-4.788
 important-4.661
 pleased-4.543
 supposed-4.460
Token hugs
Token ID40657
Feature activation-4.29e+00

Pos logit contributions
mine+2.145
al+2.014
talk+1.944
als+1.903
x+1.896
Neg logit contributions
 glad-2.021
 lucky-1.678
 important-1.596
 pleased-1.549
 supposed-1.549
Token them
Token ID606
Feature activation-1.41e+00

Pos logit contributions
talk+8.408
mine+8.386
ede+8.372
al+8.210
als+8.069
Neg logit contributions
 glad-10.440
 lucky-8.666
 sad-8.150
 important-8.147
 happy-7.984
Token.
Token ID13
Feature activation+2.34e-01

Pos logit contributions
mine+2.694
al+2.534
talk+2.451
x+2.385
als+2.345
Neg logit contributions
 glad-3.302
 lucky-2.812
 important-2.678
 pleased-2.606
 tired-2.595
Token "
Token ID366
Feature activation-1.96e+00

Pos logit contributions
 glad+0.532
 lucky+0.458
 important+0.442
 supposed+0.438
 pleased+0.433
Neg logit contributions
mine-0.435
al-0.412
ted-0.395
als-0.394
talk-0.394
TokenThat
Token ID2504
Feature activation-4.19e-01

Pos logit contributions
mine+3.474
al+3.190
talk+2.986
als+2.955
x+2.955
Neg logit contributions
 glad-5.129
 lucky-4.407
 important-4.262
 pleased-4.100
 supposed-4.009
Token's
Token ID338
Feature activation+7.45e-01

Pos logit contributions
mine+0.815
al+0.773
talk+0.734
als+0.722
ted+0.715
Neg logit contributions
 glad-0.958
 lucky-0.811
 important-0.774
 pleased-0.761
 supposed-0.758
Token
Token ID198
Feature activation-2.93e+00

Pos logit contributions
 glad+0.440
 lucky+0.359
 supposed+0.345
 important+0.344
 pleased+0.323
Neg logit contributions
mine-0.654
al-0.619
of-0.614
talk-0.607
ted-0.592
TokenSam
Token ID16305
Feature activation-2.26e+00

Pos logit contributions
mine+5.253
al+4.879
als+4.549
ted+4.539
x+4.329
Neg logit contributions
 glad-8.052
 lucky-6.898
 important-6.701
 pleased-6.671
 happy-6.504
Token smiled
Token ID13541
Feature activation+8.50e-01

Pos logit contributions
mine+4.625
al+4.211
talk+4.063
x+4.016
als+3.916
Neg logit contributions
 glad-4.962
 lucky-4.238
 important-4.025
 pleased-3.987
 amazed-3.889
Token.
Token ID13
Feature activation-1.43e-02

Pos logit contributions
 glad+2.206
 supposed+1.938
 lucky+1.902
 important+1.862
 pleased+1.815
Neg logit contributions
mine-1.296
al-1.252
als-1.160
ted-1.155
talk-1.143
Token He
Token ID679
Feature activation-3.95e-01

Pos logit contributions
mine+0.027
al+0.026
talk+0.025
als+0.025
ted+0.025
Neg logit contributions
 glad-0.032
 lucky-0.028
 important-0.027
 supposed-0.026
 pleased-0.026
Token followed
Token ID3940
Feature activation-4.28e+00

Pos logit contributions
mine+0.737
al+0.698
talk+0.663
als+0.652
ted+0.649
Neg logit contributions
 glad-0.922
 lucky-0.785
 important-0.758
 supposed-0.743
 pleased-0.734
Token Tom
Token ID4186
Feature activation-4.69e-01

Pos logit contributions
mine+9.433
al+9.100
als+8.810
talk+8.471
ede+8.453
Neg logit contributions
 glad-9.669
 lucky-8.220
 sad-7.955
 important-7.804
 tired-7.744
Token and
Token ID290
Feature activation-1.84e+00

Pos logit contributions
mine+0.898
al+0.846
talk+0.808
als+0.790
of+0.786
Neg logit contributions
 glad-1.076
 lucky-0.916
 important-0.883
 supposed-0.858
 pleased-0.857
Token they
Token ID484
Feature activation-2.00e+00

Pos logit contributions
mine+3.432
al+3.087
talk+2.964
x+2.904
of+2.885
Neg logit contributions
 glad-4.542
 lucky-3.918
 important-3.691
 pleased-3.657
 tired-3.591
Token looked
Token ID3114
Feature activation+1.03e+00

Pos logit contributions
mine+3.541
al+3.316
talk+3.106
x+3.047
of+3.026
Neg logit contributions
 glad-4.943
 lucky-4.286
 tired-4.064
 important-4.012
 pleased-3.962
Token for
Token ID329
Feature activation-1.82e+00

Pos logit contributions
 glad+2.260
 supposed+1.949
 lucky+1.885
 important+1.861
 pleased+1.815
Neg logit contributions
mine-1.937
al-1.885
ted-1.816
als-1.809
talk-1.765
Token pull
Token ID2834
Feature activation-2.68e+00

Pos logit contributions
mine+1.843
al+1.717
talk+1.644
als+1.606
ted+1.594
Neg logit contributions
 glad-2.740
 lucky-2.381
 important-2.294
 supposed-2.234
 pleased-2.227
Token some
Token ID617
Feature activation-3.95e-01

Pos logit contributions
mine+5.495
talk+5.080
al+4.994
x+4.774
als+4.747
Neg logit contributions
 glad-6.148
 lucky-5.413
 important-5.163
 sad-4.996
 tired-4.942
Token beans
Token ID16567
Feature activation-1.07e+00

Pos logit contributions
mine+0.716
al+0.677
talk+0.637
als+0.631
ted+0.626
Neg logit contributions
 glad-0.947
 lucky-0.808
 important-0.777
 supposed-0.765
 pleased-0.759
Token.
Token ID13
Feature activation-4.67e-01

Pos logit contributions
mine+2.152
al+2.025
talk+1.945
ted+1.894
als+1.894
Neg logit contributions
 glad-2.348
 lucky-2.000
 important-1.920
 supposed-1.862
 pleased-1.846
Token Sam
Token ID3409
Feature activation-1.24e+00

Pos logit contributions
mine+0.892
al+0.838
talk+0.802
als+0.783
ted+0.782
Neg logit contributions
 glad-1.075
 lucky-0.918
 important-0.884
 supposed-0.863
 pleased-0.856
Token followed
Token ID3940
Feature activation-4.28e+00

Pos logit contributions
mine+2.454
al+2.280
talk+2.224
x+2.148
of+2.134
Neg logit contributions
 glad-2.763
 lucky-2.359
 important-2.255
 pleased-2.189
 supposed-2.172
Token him
Token ID683
Feature activation-1.27e+00

Pos logit contributions
mine+9.526
al+9.237
als+9.183
talk+8.768
ede+8.582
Neg logit contributions
 glad-9.927
 lucky-8.582
 sad-8.343
 happy-8.121
 important-8.004
Token,
Token ID11
Feature activation+1.74e-01

Pos logit contributions
mine+2.555
al+2.397
talk+2.296
als+2.250
ted+2.233
Neg logit contributions
 glad-2.823
 lucky-2.407
 important-2.302
 supposed-2.249
 pleased-2.206
Token but
Token ID475
Feature activation-1.75e+00

Pos logit contributions
 glad+0.400
 lucky+0.348
 important+0.341
 supposed+0.329
 pleased+0.322
Neg logit contributions
mine-0.321
al-0.302
talk-0.283
ted-0.283
als-0.278
Token he
Token ID339
Feature activation-9.68e-01

Pos logit contributions
mine+3.259
al+3.021
talk+2.966
of+2.828
als+2.812
Neg logit contributions
 glad-4.357
 lucky-3.758
 important-3.596
 pleased-3.482
 tired-3.462
Token felt
Token ID2936
Feature activation+2.11e+00

Pos logit contributions
mine+2.083
al+1.959
talk+1.895
als+1.846
of+1.836
Neg logit contributions
 glad-2.001
 lucky-1.678
 important-1.604
 supposed-1.546
 pleased-1.546
Token if
Token ID611
Feature activation-7.87e-02

Pos logit contributions
mine+0.772
al+0.733
talk+0.693
als+0.686
ted+0.681
Neg logit contributions
 glad-0.887
 lucky-0.750
 important-0.722
 pleased-0.706
 supposed-0.705
Token it
Token ID340
Feature activation+3.46e-01

Pos logit contributions
mine+0.119
al+0.112
als+0.104
talk+0.103
ted+0.102
Neg logit contributions
 glad-0.210
 lucky-0.181
 important-0.175
 pleased-0.175
 supposed-0.174
Token fits
Token ID11414
Feature activation-6.85e-01

Pos logit contributions
 glad+0.819
 lucky+0.682
 pleased+0.658
 important+0.652
 supposed+0.642
Neg logit contributions
mine-0.656
al-0.627
als-0.592
talk-0.587
of-0.583
Token!"
Token ID2474
Feature activation-2.76e+00

Pos logit contributions
mine+1.405
al+1.325
talk+1.271
als+1.244
ted+1.237
Neg logit contributions
 glad-1.480
 lucky-1.250
 important-1.200
 supposed-1.162
 pleased-1.157
Token She
Token ID1375
Feature activation-1.36e+00

Pos logit contributions
mine+6.045
al+5.549
x+5.419
talk+5.351
of+5.340
Neg logit contributions
 glad-5.758
 lucky-4.853
 important-4.515
 amazed-4.331
 supposed-4.303
Token helped
Token ID4193
Feature activation-4.28e+00

Pos logit contributions
mine+2.415
al+2.225
talk+2.145
x+2.068
als+2.061
Neg logit contributions
 glad-3.347
 lucky-2.905
 important-2.809
 pleased-2.743
 supposed-2.696
Token Sil
Token ID4243
Feature activation-5.31e-01

Pos logit contributions
mine+8.493
ede+8.308
al+7.883
talk+7.850
als+7.685
Neg logit contributions
 glad-10.173
 lucky-9.009
 happy-8.561
 sad-8.457
 important-8.435
Tokenvia
Token ID8869
Feature activation-5.00e-01

Pos logit contributions
mine+1.114
al+1.050
talk+1.009
als+0.983
of+0.983
Neg logit contributions
 glad-1.123
 lucky-0.945
 important-0.903
 supposed-0.873
 pleased-0.872
Token put
Token ID1234
Feature activation-2.21e+00

Pos logit contributions
mine+0.752
al+0.694
talk+0.652
als+0.633
of+0.628
Neg logit contributions
 glad-1.351
 lucky-1.181
 important-1.147
 supposed-1.118
 pleased-1.116
Token on
Token ID319
Feature activation-2.03e+00

Pos logit contributions
mine+4.519
talk+4.264
al+4.174
x+4.016
of+4.015
Neg logit contributions
 glad-4.906
 lucky-4.246
 important-4.002
 tired-3.812
 cozy-3.767
Token the
Token ID262
Feature activation-1.24e-01

Pos logit contributions
mine+3.901
al+3.614
talk+3.582
x+3.375
als+3.358
Neg logit contributions
 glad-4.818
 lucky-4.224
 important-4.034
 cozy-3.861
 tired-3.827
Token she
Token ID673
Feature activation+7.20e-01

Pos logit contributions
 glad+0.125
 lucky+0.107
 important+0.104
 pleased+0.101
 supposed+0.101
Neg logit contributions
mine-0.092
al-0.086
als-0.081
talk-0.081
ted-0.080
Token imagined
Token ID15758
Feature activation+1.03e+00

Pos logit contributions
 glad+1.659
 lucky+1.370
 important+1.330
 pleased+1.318
 supposed+1.280
Neg logit contributions
mine-1.405
al-1.377
als-1.271
talk-1.255
of-1.252
Token.
Token ID13
Feature activation-3.63e-01

Pos logit contributions
 glad+2.534
 lucky+2.150
 important+2.051
 pleased+2.044
 supposed+2.019
Neg logit contributions
mine-1.825
al-1.728
als-1.574
of-1.546
ted-1.535
Token The
Token ID383
Feature activation+1.58e-01

Pos logit contributions
mine+0.639
al+0.601
talk+0.568
als+0.562
ted+0.553
Neg logit contributions
 glad-0.890
 lucky-0.766
 important-0.739
 supposed-0.725
 pleased-0.720
Token flowers
Token ID12734
Feature activation+2.12e-01

Pos logit contributions
 glad+0.407
 lucky+0.347
 supposed+0.342
 important+0.337
 pleased+0.334
Neg logit contributions
mine-0.252
al-0.243
als-0.228
talk-0.221
ted-0.216
Token were
Token ID547
Feature activation+2.39e+00

Pos logit contributions
 glad+0.533
 lucky+0.462
 important+0.449
 supposed+0.443
 pleased+0.431
Neg logit contributions
mine-0.359
al-0.337
als-0.315
ted-0.308
talk-0.305
Token huge
Token ID3236
Feature activation+1.97e-02

Pos logit contributions
 glad+4.640
 lucky+3.783
 supposed+3.705
 important+3.695
 fault+3.638
Neg logit contributions
mine-5.155
al-5.053
als-4.804
ted-4.441
x-4.355
Token and
Token ID290
Feature activation+1.40e+00

Pos logit contributions
 glad+0.050
 lucky+0.043
 important+0.042
 supposed+0.041
 pleased+0.041
Neg logit contributions
mine-0.032
al-0.031
als-0.029
ted-0.029
talk-0.028
Token so
Token ID523
Feature activation+1.94e+00

Pos logit contributions
 glad+2.884
 supposed+2.398
 lucky+2.341
 fault+2.296
 important+2.237
Neg logit contributions
mine-2.888
al-2.810
als-2.763
ted-2.604
of-2.562
Token colorful
Token ID20239
Feature activation-7.53e-01

Pos logit contributions
 glad+3.361
 supposed+2.843
 fault+2.572
 lucky+2.552
 important+2.443
Neg logit contributions
mine-4.607
al-4.563
als-4.557
of-4.293
ted-4.208
Token!
Token ID0
Feature activation-9.77e-01

Pos logit contributions
mine+1.468
al+1.374
talk+1.320
als+1.281
of+1.278
Neg logit contributions
 glad-1.705
 lucky-1.453
 important-1.399
 supposed-1.357
 pleased-1.350
Token R
Token ID371
Feature activation-5.30e-01

Pos logit contributions
mine+2.239
al+2.062
talk+2.007
x+1.939
ted+1.922
Neg logit contributions
 glad-3.089
 lucky-2.690
 important-2.535
 supposed-2.457
 pleased-2.454
Tokenavi
Token ID15820
Feature activation+3.43e-01

Pos logit contributions
mine+0.718
al+0.662
talk+0.615
als+0.596
of+0.590
Neg logit contributions
 glad-1.513
 lucky-1.333
 important-1.296
 supposed-1.268
 pleased-1.266
Token,
Token ID11
Feature activation-9.21e-01

Pos logit contributions
 glad+0.820
 lucky+0.699
 important+0.680
 supposed+0.667
 pleased+0.664
Neg logit contributions
mine-0.617
al-0.598
talk-0.547
als-0.546
of-0.544
Token your
Token ID534
Feature activation+4.23e-01

Pos logit contributions
mine+1.684
al+1.565
talk+1.518
ted+1.458
als+1.455
Neg logit contributions
 glad-2.212
 lucky-1.911
 important-1.829
 supposed-1.775
 pleased-1.767
Token bird
Token ID6512
Feature activation+1.17e+00

Pos logit contributions
 glad+1.261
 lucky+1.095
 important+1.078
 supposed+1.072
 pleased+1.069
Neg logit contributions
mine-0.515
al-0.489
als-0.439
of-0.423
talk-0.421
Token is
Token ID318
Feature activation+2.73e+00

Pos logit contributions
 glad+2.706
 lucky+2.239
 important+2.190
 supposed+2.171
 pleased+2.165
Neg logit contributions
mine-2.258
al-2.219
of-2.059
als-2.050
x-1.963
Token very
Token ID845
Feature activation+1.56e+00

Pos logit contributions
 glad+5.158
 fault+3.941
 pleased+3.929
 tempting+3.870
 lucky+3.840
Neg logit contributions
al-6.214
mine-6.015
als-5.590
x-5.211
of-5.101
Token pretty
Token ID2495
Feature activation-3.26e-02

Pos logit contributions
 glad+2.128
 supposed+1.573
 fault+1.572
 lucky+1.454
 important+1.431
Neg logit contributions
mine-4.194
al-4.105
of-3.989
als-3.883
talk-3.811
Token!"
Token ID2474
Feature activation-1.45e+00

Pos logit contributions
mine+0.065
al+0.061
talk+0.058
als+0.057
of+0.056
Neg logit contributions
 glad-0.073
 lucky-0.061
 important-0.059
 supposed-0.058
 pleased-0.057
Token Mir
Token ID7381
Feature activation-5.23e-01

Pos logit contributions
mine+2.594
al+2.408
x+2.294
talk+2.285
als+2.226
Neg logit contributions
 glad-3.572
 lucky-3.086
 important-2.926
 pleased-2.865
 tired-2.811
Tokena
Token ID64
Feature activation-4.57e-01

Pos logit contributions
mine+1.086
al+1.027
talk+0.985
als+0.962
of+0.959
Neg logit contributions
 glad-1.113
 lucky-0.937
 important-0.900
 supposed-0.873
 pleased-0.867
Token are
Token ID389
Feature activation+1.69e+00

Pos logit contributions
mine+0.143
al+0.132
talk+0.124
als+0.122
of+0.120
Neg logit contributions
 glad-0.241
 lucky-0.208
 important-0.204
 pleased-0.200
 supposed-0.200
Token right
Token ID826
Feature activation+1.16e+00

Pos logit contributions
 glad+3.492
 pleased+2.956
 tempting+2.882
 important+2.878
 supposed+2.853
Neg logit contributions
al-3.193
mine-3.178
x-2.763
als-2.702
 cash-2.685
Token.
Token ID13
Feature activation-2.70e-01

Pos logit contributions
 glad+2.849
 lucky+2.408
 supposed+2.392
 pleased+2.382
 important+2.371
Neg logit contributions
mine-1.968
al-1.951
als-1.752
ted-1.738
talk-1.706
Token The
Token ID383
Feature activation+8.64e-01

Pos logit contributions
mine+0.544
al+0.521
talk+0.495
als+0.487
ted+0.484
Neg logit contributions
 glad-0.583
 lucky-0.495
 important-0.477
 supposed-0.466
 pleased-0.464
Token sea
Token ID5417
Feature activation+7.57e-01

Pos logit contributions
 glad+2.109
 supposed+1.792
 lucky+1.764
 pleased+1.726
 important+1.699
Neg logit contributions
mine-1.453
al-1.421
als-1.302
talk-1.281
of-1.259
Token is
Token ID318
Feature activation+3.15e+00

Pos logit contributions
 glad+1.571
 lucky+1.288
 supposed+1.265
 important+1.231
 pleased+1.219
Neg logit contributions
mine-1.575
al-1.555
als-1.458
of-1.436
talk-1.425
Token not
Token ID407
Feature activation+2.01e+00

Pos logit contributions
 glad+6.803
 pleased+5.621
 lucky+5.588
 fault+5.537
 supposed+5.510
Neg logit contributions
al-5.572
mine-5.340
x-5.006
 course-4.823
 ribbon-4.750
Token for
Token ID329
Feature activation+1.59e+00

Pos logit contributions
 glad+4.213
 lucky+3.374
 pleased+3.374
 important+3.122
 supposed+3.105
Neg logit contributions
mine-3.986
al-3.847
x-3.554
talk-3.493
als-3.492
Token me
Token ID502
Feature activation+1.43e+00

Pos logit contributions
 glad+4.185
 lucky+3.589
 pleased+3.572
 important+3.516
 supposed+3.451
Neg logit contributions
mine-2.419
al-2.298
ted-2.107
als-2.057
of-2.050
Token.
Token ID13
Feature activation-3.96e-01

Pos logit contributions
 glad+3.579
 lucky+3.081
 pleased+2.998
 important+2.980
 supposed+2.837
Neg logit contributions
mine-2.453
al-2.286
talk-2.181
of-2.161
als-2.124
Token The
Token ID383
Feature activation+9.98e-01

Pos logit contributions
mine+0.807
al+0.768
talk+0.732
als+0.720
ted+0.713
Neg logit contributions
 glad-0.852
 lucky-0.720
 important-0.694
 supposed-0.675
 pleased-0.673
Token on
Token ID319
Feature activation-1.53e+00

Pos logit contributions
 glad+3.252
 fault+2.792
 important+2.742
 lucky+2.687
 valuable+2.686
Neg logit contributions
mine-5.533
al-5.168
 cash-4.899
talk-4.859
Give-4.857
Token a
Token ID257
Feature activation+6.13e-01

Pos logit contributions
mine+2.888
al+2.719
talk+2.637
als+2.608
of+2.541
Neg logit contributions
 glad-3.633
 lucky-3.109
 important-3.006
 supposed-2.892
 pleased-2.858
Token bench
Token ID7624
Feature activation-5.61e-01

Pos logit contributions
 glad+1.391
 lucky+1.164
 supposed+1.139
 important+1.126
 pleased+1.105
Neg logit contributions
mine-1.154
al-1.048
of-1.031
talk-1.021
 cash-0.985
Token.
Token ID13
Feature activation+6.95e-01

Pos logit contributions
mine+1.063
al+0.999
talk+0.952
als+0.931
ted+0.925
Neg logit contributions
 glad-1.294
 lucky-1.106
 important-1.064
 supposed-1.035
 pleased-1.032
Token She
Token ID1375
Feature activation+2.62e-01

Pos logit contributions
 glad+1.841
 lucky+1.625
 supposed+1.597
 important+1.576
 pleased+1.553
Neg logit contributions
mine-1.051
al-0.970
ted-0.937
als-0.933
talk-0.925
Token was
Token ID373
Feature activation+2.56e+00

Pos logit contributions
 glad+0.614
 lucky+0.525
 important+0.508
 supposed+0.506
 pleased+0.492
Neg logit contributions
mine-0.476
al-0.455
talk-0.428
ted-0.424
of-0.424
Token reading
Token ID3555
Feature activation-6.68e-01

Pos logit contributions
 glad+4.451
 valuable+3.851
 fault+3.813
 important+3.805
 lucky+3.774
Neg logit contributions
mine-5.600
al-5.434
 cash-5.057
Give-4.980
talk-4.879
Token a
Token ID257
Feature activation+8.94e-01

Pos logit contributions
mine+1.429
al+1.351
talk+1.297
als+1.272
ted+1.265
Neg logit contributions
 glad-1.377
 lucky-1.152
 important-1.104
 supposed-1.068
 pleased-1.064
Token book
Token ID1492
Feature activation-3.44e-01

Pos logit contributions
 glad+2.159
 lucky+1.823
 supposed+1.778
 important+1.763
 pleased+1.716
Neg logit contributions
mine-1.578
of-1.448
al-1.421
talk-1.407
als-1.345
Token.
Token ID13
Feature activation+5.53e-01

Pos logit contributions
mine+0.768
al+0.727
talk+0.701
als+0.687
of+0.683
Neg logit contributions
 glad-0.678
 lucky-0.561
 important-0.532
 pleased-0.516
 supposed-0.516
Token She
Token ID1375
Feature activation+2.60e-02

Pos logit contributions
 glad+1.342
 lucky+1.166
 supposed+1.135
 important+1.127
 pleased+1.110
Neg logit contributions
mine-0.969
al-0.897
ted-0.864
talk-0.862
als-0.858
Token a
Token ID257
Feature activation+1.32e+00

Pos logit contributions
 glad+1.776
 lucky+1.512
 pleased+1.470
 important+1.460
 supposed+1.451
Neg logit contributions
mine-1.261
al-1.213
talk-1.096
of-1.096
x-1.082
Token new
Token ID649
Feature activation+6.36e-01

Pos logit contributions
 glad+2.313
 supposed+1.827
 lucky+1.746
 pleased+1.647
 important+1.643
Neg logit contributions
mine-3.138
al-3.048
of-2.952
talk-2.909
als-2.870
Token way
Token ID835
Feature activation+7.24e-01

Pos logit contributions
 glad+1.326
 lucky+1.076
 supposed+1.040
 pleased+1.024
 important+1.003
Neg logit contributions
mine-1.352
al-1.302
talk-1.256
of-1.219
als-1.215
Token.
Token ID13
Feature activation+7.20e-01

Pos logit contributions
 glad+1.715
 lucky+1.454
 pleased+1.397
 important+1.389
 supposed+1.370
Neg logit contributions
mine-1.338
al-1.260
of-1.176
als-1.176
talk-1.171
Token<|endoftext|>
Token ID50256
Feature activation+1.13e+00

Pos logit contributions
 glad+1.257
 lucky+1.049
 important+1.011
 supposed+0.995
 pleased+0.968
Neg logit contributions
mine-1.736
al-1.662
of-1.596
talk-1.595
als-1.558
TokenOnce
Token ID7454
Feature activation+2.61e+00

Pos logit contributions
 glad+2.279
 lucky+1.882
 important+1.855
 pleased+1.806
 supposed+1.779
Neg logit contributions
mine-2.602
al-2.484
of-2.348
ted-2.334
als-2.318
Token upon
Token ID2402
Feature activation+9.66e-01

Pos logit contributions
 glad+4.373
 supposed+3.905
 important+3.568
 lucky+3.561
 pleased+3.318
Neg logit contributions
mine-6.881
of-6.488
x-5.899
al-5.879
als-5.664
Token a
Token ID257
Feature activation+1.56e+00

Pos logit contributions
 glad+2.175
 lucky+1.810
 important+1.771
 pleased+1.766
 supposed+1.753
Neg logit contributions
mine-1.861
al-1.695
of-1.664
talk-1.614
ted-1.580
Token time
Token ID640
Feature activation+1.41e+00

Pos logit contributions
 glad+2.497
 lucky+1.947
 supposed+1.853
 pleased+1.843
 important+1.790
Neg logit contributions
mine-4.022
of-3.891
al-3.707
talk-3.661
ede-3.561
Token there
Token ID612
Feature activation+1.23e+00

Pos logit contributions
 glad+3.768
 important+3.336
 supposed+3.326
 lucky+3.300
 pleased+3.215
Neg logit contributions
mine-2.150
of-1.895
al-1.821
ted-1.689
als-1.683
Token were
Token ID547
Feature activation+3.25e-01

Pos logit contributions
 glad+3.100
 lucky+2.635
 supposed+2.589
 important+2.564
 pleased+2.545
Neg logit contributions
mine-2.189
al-1.975
of-1.891
ted-1.827
Give-1.774
Token upon
Token ID2402
Feature activation+1.11e+00

Pos logit contributions
 glad+4.597
 important+3.972
 supposed+3.956
 lucky+3.838
 valuable+3.683
Neg logit contributions
mine-7.219
of-6.425
al-6.226
x-6.162
ted-5.797
Token a
Token ID257
Feature activation+9.03e-01

Pos logit contributions
 glad+2.537
 lucky+2.106
 pleased+2.080
 important+2.072
 supposed+2.034
Neg logit contributions
mine-2.125
al-1.978
of-1.854
talk-1.851
ted-1.822
Token time
Token ID640
Feature activation+1.25e+00

Pos logit contributions
 glad+1.330
 lucky+1.011
 pleased+0.956
 supposed+0.947
 important+0.940
Neg logit contributions
mine-2.411
of-2.274
al-2.273
talk-2.218
ede-2.172
Token,
Token ID11
Feature activation+1.31e+00

Pos logit contributions
 glad+3.272
 important+2.903
 lucky+2.854
 supposed+2.827
 pleased+2.816
Neg logit contributions
mine-1.932
al-1.728
of-1.611
als-1.569
ted-1.568
Token there
Token ID612
Feature activation+9.97e-01

Pos logit contributions
 glad+2.629
 important+2.191
 lucky+2.176
 supposed+2.163
 pleased+2.141
Neg logit contributions
mine-2.878
al-2.730
x-2.543
als-2.451
of-2.427
Token was
Token ID373
Feature activation+6.12e-01

Pos logit contributions
 glad+2.444
 lucky+2.076
 important+2.031
 pleased+2.024
 supposed+2.008
Neg logit contributions
mine-1.821
al-1.692
of-1.572
ted-1.546
als-1.513
Token a
Token ID257
Feature activation+2.51e-01

Pos logit contributions
 glad+1.319
 lucky+1.097
 pleased+1.072
 important+1.068
 supposed+1.038
Neg logit contributions
mine-1.226
al-1.157
ted-1.089
talk-1.075
als-1.071
Token little
Token ID1310
Feature activation-1.35e+00

Pos logit contributions
 glad+0.422
 supposed+0.331
 lucky+0.329
 important+0.320
 pleased+0.311
Neg logit contributions
mine-0.609
al-0.587
of-0.550
als-0.548
talk-0.547
Token boy
Token ID2933
Feature activation-8.80e-01

Pos logit contributions
mine+2.330
al+2.131
talk+2.110
ted+2.014
of+1.988
Neg logit contributions
 glad-3.402
 lucky-2.970
 important-2.843
 pleased-2.736
 tired-2.708
Token named
Token ID3706
Feature activation-3.66e-01

Pos logit contributions
mine+1.637
al+1.530
talk+1.462
als+1.438
ted+1.424
Neg logit contributions
 glad-2.083
 lucky-1.782
 important-1.720
 supposed-1.671
 pleased-1.656
Token Ben
Token ID3932
Feature activation+2.74e-01

Pos logit contributions
mine+0.501
al+0.455
talk+0.426
als+0.412
of+0.411
Neg logit contributions
 glad-1.050
 lucky-0.925
 important-0.902
 pleased-0.884
 supposed-0.880
Token learned
Token ID4499
Feature activation+2.46e-01

Pos logit contributions
 glad+1.583
 lucky+1.296
 pleased+1.237
 important+1.223
 supposed+1.204
Neg logit contributions
mine-1.501
al-1.463
talk-1.362
x-1.354
ted-1.350
Token something
Token ID1223
Feature activation+1.25e+00

Pos logit contributions
 glad+0.578
 lucky+0.487
 pleased+0.468
 supposed+0.461
 important+0.458
Neg logit contributions
mine-0.442
al-0.421
talk-0.402
ted-0.396
of-0.392
Token new
Token ID649
Feature activation+5.04e-01

Pos logit contributions
 glad+1.695
 pleased+1.128
 lucky+1.126
 supposed+1.064
 tired+1.025
Neg logit contributions
mine-3.349
al-3.306
of-3.199
ted-3.134
als-3.119
Token.
Token ID13
Feature activation+8.18e-01

Pos logit contributions
 glad+1.251
 lucky+1.065
 pleased+1.021
 supposed+1.007
 important+1.002
Neg logit contributions
mine-0.847
al-0.814
talk-0.759
als-0.748
ted-0.743
Token From
Token ID3574
Feature activation-5.37e-01

Pos logit contributions
 glad+1.675
 lucky+1.434
 supposed+1.369
 important+1.363
 pleased+1.348
Neg logit contributions
mine-1.688
al-1.594
ted-1.555
talk-1.546
of-1.539
Token that
Token ID326
Feature activation+7.56e-01

Pos logit contributions
mine+0.838
al+0.781
talk+0.733
als+0.714
ted+0.713
Neg logit contributions
 glad-1.424
 lucky-1.242
 important-1.202
 supposed-1.179
 pleased-1.173
Token day
Token ID1110
Feature activation-9.66e-01

Pos logit contributions
 glad+1.380
 lucky+1.114
 pleased+1.064
 supposed+1.057
 important+1.015
Neg logit contributions
mine-1.791
al-1.661
of-1.617
talk-1.583
x-1.582
Token on
Token ID319
Feature activation-5.38e-01

Pos logit contributions
mine+1.983
al+1.876
talk+1.803
als+1.765
of+1.747
Neg logit contributions
 glad-2.105
 lucky-1.779
 important-1.702
 supposed-1.650
 pleased-1.633
Token,
Token ID11
Feature activation-6.23e-01

Pos logit contributions
mine+1.011
al+0.947
talk+0.901
als+0.881
ted+0.879
Neg logit contributions
 glad-1.263
 lucky-1.080
 important-1.039
 supposed-1.015
 pleased-1.011
Token Tim
Token ID5045
Feature activation+1.65e-01

Pos logit contributions
mine+0.888
al+0.814
talk+0.763
als+0.740
ted+0.735
Neg logit contributions
 glad-1.746
 lucky-1.535
 important-1.489
 supposed-1.457
 pleased-1.453
Tokenmy
Token ID1820
Feature activation+2.07e-01

Pos logit contributions
 glad+0.441
 lucky+0.383
 pleased+0.367
 important+0.367
 supposed+0.359
Neg logit contributions
mine-0.258
al-0.238
x-0.222
talk-0.221
of-0.218
Token?
Token ID30
Feature activation+7.14e-02

Pos logit contributions
mine+1.501
al+1.421
talk+1.366
als+1.339
ted+1.332
Neg logit contributions
 glad-1.399
 lucky-1.167
 important-1.116
 supposed-1.079
 pleased-1.075
Token The
Token ID383
Feature activation+9.76e-01

Pos logit contributions
 glad+0.159
 lucky+0.136
 important+0.133
 supposed+0.129
 pleased+0.128
Neg logit contributions
mine-0.139
al-0.133
talk-0.124
als-0.124
ted-0.123
Token air
Token ID1633
Feature activation+7.03e-01

Pos logit contributions
 glad+2.262
 lucky+1.874
 supposed+1.867
 pleased+1.846
 important+1.804
Neg logit contributions
mine-1.798
al-1.734
als-1.592
 cash-1.559
ted-1.556
Token outside
Token ID2354
Feature activation+7.34e-02

Pos logit contributions
 glad+1.554
 lucky+1.298
 important+1.234
 supposed+1.227
 pleased+1.209
Neg logit contributions
mine-1.423
al-1.354
als-1.266
of-1.251
ted-1.234
Token is
Token ID318
Feature activation+2.97e+00

Pos logit contributions
 glad+0.168
 lucky+0.142
 important+0.136
 supposed+0.135
 pleased+0.133
Neg logit contributions
mine-0.142
al-0.134
als-0.126
talk-0.124
of-0.124
Token not
Token ID407
Feature activation+2.05e+00

Pos logit contributions
 glad+6.515
 lucky+5.237
 pleased+5.108
 supposed+4.688
 impressed+4.599
Neg logit contributions
mine-5.808
al-5.621
als-5.171
of-5.129
ted-5.124
Token clear
Token ID1598
Feature activation+1.16e+00

Pos logit contributions
 glad+4.030
 lucky+3.144
 pleased+3.106
 important+2.851
 supposed+2.837
Neg logit contributions
mine-4.489
al-4.218
of-3.932
x-3.899
als-3.862
Token,
Token ID11
Feature activation+4.02e-01

Pos logit contributions
 glad+2.909
 lucky+2.450
 pleased+2.349
 important+2.290
 supposed+2.290
Neg logit contributions
mine-2.069
al-1.988
als-1.852
ted-1.801
of-1.770
Token remember
Token ID3505
Feature activation+1.34e+00

Pos logit contributions
 glad+0.984
 lucky+0.843
 supposed+0.815
 important+0.808
 pleased+0.798
Neg logit contributions
mine-0.687
al-0.640
als-0.598
of-0.596
ted-0.585
Token to
Token ID284
Feature activation+3.11e-01

Pos logit contributions
 glad+3.402
 lucky+2.910
 pleased+2.860
 important+2.775
 supposed+2.761
Neg logit contributions
mine-2.192
al-2.167
als-2.014
ted-1.978
of-1.968
Token stay
Token ID2652
Feature activation+1.83e+00

Pos logit contributions
 glad+0.787
 lucky+0.679
 important+0.657
 pleased+0.651
 supposed+0.647
Neg logit contributions
mine-0.513
al-0.478
of-0.442
talk-0.441
als-0.426
Token a
Token ID257
Feature activation+1.49e+00

Pos logit contributions
mine+0.361
al+0.346
talk+0.331
als+0.326
ted+0.325
Neg logit contributions
 glad-0.351
 lucky-0.290
 important-0.278
 supposed-0.276
 pleased-0.273
Token way
Token ID835
Feature activation+2.88e-01

Pos logit contributions
 glad+3.173
 pleased+2.516
 supposed+2.501
 lucky+2.496
 important+2.391
Neg logit contributions
mine-2.910
al-2.841
of-2.721
talk-2.679
 cash-2.672
Token to
Token ID284
Feature activation-1.22e+00

Pos logit contributions
 glad+0.617
 lucky+0.520
 important+0.500
 pleased+0.496
 supposed+0.486
Neg logit contributions
mine-0.590
al-0.544
talk-0.525
of-0.520
als-0.514
Token care
Token ID1337
Feature activation+1.61e-01

Pos logit contributions
mine+2.132
al+1.998
talk+1.910
als+1.892
ted+1.883
Neg logit contributions
 glad-3.041
 lucky-2.652
 important-2.559
 supposed-2.460
 pleased-2.458
Token for
Token ID329
Feature activation-5.17e-01

Pos logit contributions
 glad+0.337
 lucky+0.282
 important+0.265
 supposed+0.265
 pleased+0.263
Neg logit contributions
mine-0.338
al-0.316
of-0.301
talk-0.299
x-0.296
Token them
Token ID606
Feature activation+7.16e-01

Pos logit contributions
mine+0.914
al+0.855
talk+0.811
als+0.790
ted+0.787
Neg logit contributions
 glad-1.266
 lucky-1.087
 important-1.049
 supposed-1.026
 pleased-1.024
Token all
Token ID477
Feature activation+6.80e-01

Pos logit contributions
 glad+1.708
 lucky+1.437
 pleased+1.385
 important+1.373
 supposed+1.369
Neg logit contributions
mine-1.317
al-1.217
of-1.144
als-1.143
talk-1.129
Token so
Token ID523
Feature activation-1.61e-01

Pos logit contributions
 glad+1.655
 lucky+1.423
 pleased+1.356
 important+1.349
 supposed+1.336
Neg logit contributions
mine-1.204
al-1.111
of-1.052
talk-1.041
ted-1.020
Token he
Token ID339
Feature activation+1.31e-01

Pos logit contributions
mine+0.270
al+0.249
talk+0.235
of+0.232
als+0.231
Neg logit contributions
 glad-0.406
 lucky-0.350
 important-0.337
 supposed-0.335
 pleased-0.331
Token could
Token ID714
Feature activation-8.45e-01

Pos logit contributions
 glad+0.294
 lucky+0.245
 supposed+0.235
 important+0.234
 pleased+0.231
Neg logit contributions
mine-0.259
al-0.248
talk-0.235
als-0.232
ted-0.231
Token keep
Token ID1394
Feature activation+5.96e-01

Pos logit contributions
mine+1.467
al+1.377
talk+1.305
als+1.279
ted+1.276
Neg logit contributions
 glad-2.090
 lucky-1.807
 important-1.744
 supposed-1.696
 pleased-1.690
Token sc
Token ID629
Feature activation+1.25e-01

Pos logit contributions
mine+0.236
al+0.216
talk+0.203
als+0.196
x+0.195
Neg logit contributions
 glad-0.461
 lucky-0.402
 important-0.389
 supposed-0.385
 pleased-0.383
Tokenooter
Token ID25141
Feature activation-7.89e-01

Pos logit contributions
 glad+0.304
 lucky+0.262
 important+0.254
 supposed+0.248
 pleased+0.247
Neg logit contributions
mine-0.219
al-0.205
talk-0.198
of-0.195
x-0.192
Token,
Token ID11
Feature activation-4.27e-01

Pos logit contributions
mine+1.557
al+1.468
talk+1.405
als+1.373
ted+1.367
Neg logit contributions
 glad-1.749
 lucky-1.485
 important-1.430
 supposed-1.384
 pleased-1.379
Token but
Token ID475
Feature activation-1.38e+00

Pos logit contributions
mine+0.796
al+0.745
talk+0.707
als+0.695
of+0.692
Neg logit contributions
 glad-0.993
 lucky-0.851
 important-0.820
 supposed-0.800
 pleased-0.793
Token Emily
Token ID17608
Feature activation+3.50e-01

Pos logit contributions
mine+2.271
al+2.130
talk+2.071
als+1.961
ted+1.948
Neg logit contributions
 glad-3.572
 lucky-3.109
 important-3.010
 pleased-2.920
 supposed-2.888
Token said
Token ID531
Feature activation+3.47e-01

Pos logit contributions
 glad+0.836
 lucky+0.710
 supposed+0.679
 important+0.676
 pleased+0.662
Neg logit contributions
mine-0.642
al-0.594
talk-0.553
of-0.552
ted-0.545
Token no
Token ID645
Feature activation-8.52e-01

Pos logit contributions
 glad+0.822
 lucky+0.702
 supposed+0.685
 important+0.679
 pleased+0.659
Neg logit contributions
mine-0.630
al-0.587
als-0.549
ted-0.542
of-0.540
Token.
Token ID13
Feature activation-3.75e-01

Pos logit contributions
mine+1.750
al+1.652
talk+1.586
als+1.553
ted+1.542
Neg logit contributions
 glad-1.858
 lucky-1.574
 important-1.521
 supposed-1.463
 pleased-1.457
Token Lily
Token ID20037
Feature activation-1.29e+00

Pos logit contributions
mine+0.703
al+0.657
talk+0.626
als+0.616
ted+0.611
Neg logit contributions
 glad-0.873
 lucky-0.749
 important-0.722
 supposed-0.706
 pleased-0.698
Token felt
Token ID2936
Feature activation+2.19e+00

Pos logit contributions
mine+2.679
al+2.540
talk+2.448
als+2.408
ted+2.374
Neg logit contributions
 glad-2.742
 lucky-2.322
 important-2.242
 pleased-2.159
 supposed-2.124
Token sad
Token ID6507
Feature activation+3.96e-01

Pos logit contributions
 glad+2.734
 supposed+2.212
 lucky+1.916
 fault+1.829
 important+1.798
Neg logit contributions
mine-6.088
al-5.910
ted-5.737
Give-5.559
of-5.437
Token walking
Token ID6155
Feature activation-9.07e-01

Pos logit contributions
 glad+1.184
 lucky+0.997
 important+0.993
 pleased+0.963
 supposed+0.952
Neg logit contributions
mine-1.077
al-1.032
ted-0.968
talk-0.946
als-0.939
Token.
Token ID13
Feature activation+5.10e-01

Pos logit contributions
mine+1.853
al+1.750
talk+1.677
als+1.643
ted+1.637
Neg logit contributions
 glad-1.970
 lucky-1.666
 important-1.600
 pleased-1.542
 supposed-1.534
Token As
Token ID1081
Feature activation-2.94e-01

Pos logit contributions
 glad+1.228
 lucky+1.080
 important+1.058
 supposed+1.017
 pleased+1.007
Neg logit contributions
mine-0.895
al-0.844
talk-0.819
als-0.805
ted-0.790
Token they
Token ID484
Feature activation-1.45e+00

Pos logit contributions
mine+0.438
al+0.406
talk+0.381
als+0.372
of+0.372
Neg logit contributions
 glad-0.792
 lucky-0.691
 important-0.671
 supposed-0.660
 pleased-0.656
Token walked
Token ID6807
Feature activation-1.52e+00

Pos logit contributions
mine+2.308
al+2.165
talk+2.040
x+1.988
als+1.969
Neg logit contributions
 glad-3.798
 lucky-3.314
 important-3.149
 pleased-3.111
 tired-3.070
Token,
Token ID11
Feature activation-8.08e-01

Pos logit contributions
mine+3.146
al+2.962
talk+2.824
x+2.795
als+2.750
Neg logit contributions
 glad-3.365
 lucky-2.853
 important-2.714
 pleased-2.639
 tired-2.580
Token they
Token ID484
Feature activation-4.96e-01

Pos logit contributions
mine+1.310
al+1.214
talk+1.152
als+1.115
x+1.110
Neg logit contributions
 glad-2.099
 lucky-1.827
 important-1.764
 pleased-1.717
 supposed-1.717
Token saw
Token ID2497
Feature activation+5.61e-01

Pos logit contributions
mine+0.946
al+0.888
talk+0.847
als+0.830
ted+0.825
Neg logit contributions
 glad-1.143
 lucky-0.975
 important-0.943
 supposed-0.917
 pleased-0.909
Token more
Token ID517
Feature activation+9.73e-01

Pos logit contributions
 glad+1.249
 lucky+1.046
 supposed+1.031
 important+1.012
 pleased+0.997
Neg logit contributions
mine-1.069
al-1.047
als-0.987
ted-0.985
talk-0.966
Token animals
Token ID4695
Feature activation+5.07e-01

Pos logit contributions
 glad+2.246
 lucky+1.874
 supposed+1.854
 important+1.802
 pleased+1.758
Neg logit contributions
mine-1.773
al-1.739
als-1.640
ted-1.604
talk-1.595
Token and
Token ID290
Feature activation+1.03e-01

Pos logit contributions
 glad+1.163
 lucky+0.994
 important+0.975
 supposed+0.941
 pleased+0.928
Neg logit contributions
mine-0.949
al-0.903
als-0.846
of-0.841
ted-0.838
Token The
Token ID383
Feature activation+2.18e-01

Pos logit contributions
 glad+1.895
 lucky+1.610
 important+1.553
 supposed+1.541
 pleased+1.538
Neg logit contributions
mine-2.090
al-1.980
talk-1.941
of-1.872
ted-1.854
Token end
Token ID886
Feature activation+1.04e+00

Pos logit contributions
 glad+0.560
 lucky+0.482
 supposed+0.470
 important+0.465
 pleased+0.463
Neg logit contributions
mine-0.341
al-0.325
talk-0.300
als-0.296
ted-0.291
Token.
Token ID13
Feature activation+1.35e+00

Pos logit contributions
 glad+2.649
 lucky+2.254
 pleased+2.176
 supposed+2.140
 important+2.099
Neg logit contributions
mine-1.676
al-1.655
talk-1.631
of-1.546
als-1.543
Token<|endoftext|>
Token ID50256
Feature activation+1.06e+00

Pos logit contributions
 glad+2.504
 lucky+2.140
 important+2.116
 supposed+2.076
 pleased+2.033
Neg logit contributions
mine-3.042
al-2.893
talk-2.811
of-2.749
ted-2.705
TokenL
Token ID43
Feature activation-1.27e+00

Pos logit contributions
 glad+2.205
 lucky+1.825
 important+1.782
 pleased+1.781
 supposed+1.718
Neg logit contributions
mine-2.331
al-2.284
talk-2.115
ted-2.112
als-2.105
Tokenily
Token ID813
Feature activation-1.53e-01

Pos logit contributions
mine+1.961
al+1.834
talk+1.743
als+1.726
ted+1.701
Neg logit contributions
 glad-3.403
 lucky-2.976
 important-2.871
 supposed-2.826
 pleased-2.771
Token and
Token ID290
Feature activation-1.05e+00

Pos logit contributions
mine+0.258
al+0.245
talk+0.230
ted+0.224
als+0.224
Neg logit contributions
 glad-0.382
 lucky-0.330
 important-0.319
 pleased-0.313
 supposed-0.311
Token Ben
Token ID3932
Feature activation-7.08e-01

Pos logit contributions
mine+1.853
al+1.741
talk+1.669
als+1.620
x+1.614
Neg logit contributions
 glad-2.607
 lucky-2.244
 important-2.135
 supposed-2.082
 pleased-2.076
Token are
Token ID389
Feature activation+2.87e+00

Pos logit contributions
mine+1.208
al+1.125
talk+1.069
als+1.041
ted+1.035
Neg logit contributions
 glad-1.766
 lucky-1.529
 important-1.475
 supposed-1.438
 pleased-1.432
Token friends
Token ID2460
Feature activation+7.69e-01

Pos logit contributions
 glad+5.028
 valuable+4.843
 tempting+4.641
 pleased+4.620
 important+4.570
Neg logit contributions
mine-6.116
al-5.495
talk-5.263
 cash-5.209
ted-5.185
Token.
Token ID13
Feature activation-4.51e-01

Pos logit contributions
 glad+1.824
 lucky+1.603
 supposed+1.574
 important+1.564
 pleased+1.499
Neg logit contributions
mine-1.404
al-1.277
talk-1.208
of-1.186
ted-1.147
Token sad
Token ID6507
Feature activation+8.11e-01

Pos logit contributions
 glad+2.956
 supposed+2.543
 fault+2.170
 pleased+2.113
 lucky+2.086
Neg logit contributions
mine-5.769
al-5.625
ted-5.493
Give-5.379
talk-5.365
Token.
Token ID13
Feature activation+7.87e-01

Pos logit contributions
 glad+2.010
 lucky+1.731
 supposed+1.680
 pleased+1.663
 important+1.650
Neg logit contributions
mine-1.363
al-1.272
ted-1.203
talk-1.193
als-1.177
Token They
Token ID1119
Feature activation-1.63e-01

Pos logit contributions
 glad+2.084
 lucky+1.832
 important+1.795
 supposed+1.792
 pleased+1.750
Neg logit contributions
mine-1.182
al-1.117
talk-1.049
als-1.040
ted-1.004
Token did
Token ID750
Feature activation+1.19e+00

Pos logit contributions
mine+0.338
al+0.318
talk+0.303
als+0.299
ted+0.296
Neg logit contributions
 glad-0.347
 lucky-0.292
 important-0.283
 supposed-0.276
 pleased-0.273
Token not
Token ID407
Feature activation-4.47e-01

Pos logit contributions
 glad+1.871
 lucky+1.424
 pleased+1.401
 important+1.327
 supposed+1.314
Neg logit contributions
mine-3.020
al-2.794
talk-2.792
ede-2.742
Give-2.726
Token want
Token ID765
Feature activation-1.63e-01

Pos logit contributions
mine+0.991
al+0.936
talk+0.905
of+0.879
als+0.878
Neg logit contributions
 glad-0.895
 lucky-0.741
 important-0.707
 pleased-0.687
 supposed-0.682
Token to
Token ID284
Feature activation-1.70e+00

Pos logit contributions
mine+0.344
al+0.328
talk+0.317
ted+0.309
als+0.309
Neg logit contributions
 glad-0.338
 lucky-0.281
 important-0.266
 pleased-0.265
 supposed-0.260
Token miss
Token ID2051
Feature activation-7.53e-03

Pos logit contributions
mine+3.022
al+2.887
ted+2.768
als+2.765
talk+2.672
Neg logit contributions
 glad-4.180
 lucky-3.651
 important-3.540
 supposed-3.448
 sad-3.354
Token their
Token ID511
Feature activation+8.29e-01

Pos logit contributions
mine+0.012
al+0.012
talk+0.011
als+0.011
ted+0.011
Neg logit contributions
 glad-0.019
 lucky-0.016
 important-0.016
 pleased-0.016
 supposed-0.016
Token holiday
Token ID9912
Feature activation-1.08e-01

Pos logit contributions
 glad+1.992
 supposed+1.653
 lucky+1.646
 pleased+1.633
 important+1.606
Neg logit contributions
mine-1.436
al-1.351
talk-1.314
x-1.287
als-1.282
Token.
Token ID13
Feature activation+1.38e+00

Pos logit contributions
mine+0.209
al+0.195
talk+0.188
als+0.182
of+0.182
Neg logit contributions
 glad-0.247
 lucky-0.208
 important-0.198
 pleased-0.196
 supposed-0.195
Token
Token ID198
Feature activation-1.08e+00

Pos logit contributions
 glad+3.785
 lucky+3.256
 supposed+3.195
 important+3.113
 pleased+3.056
Neg logit contributions
mine-4.118
of-3.958
talk-3.862
al-3.720
ted-3.663
TokenBut
Token ID1537
Feature activation-1.34e+00

Pos logit contributions
mine+1.920
al+1.798
talk+1.689
als+1.677
ted+1.653
Neg logit contributions
 glad-2.690
 lucky-2.322
 important-2.247
 pleased-2.176
 supposed-2.170
Token then
Token ID788
Feature activation-1.71e+00

Pos logit contributions
mine+1.973
al+1.797
talk+1.735
als+1.657
of+1.655
Neg logit contributions
 glad-3.793
 lucky-3.336
 important-3.219
 pleased-3.116
 supposed-3.106
Token,
Token ID11
Feature activation-1.79e+00

Pos logit contributions
mine+3.647
al+3.394
talk+3.320
x+3.247
of+3.163
Neg logit contributions
 glad-3.754
 lucky-3.240
 important-3.067
 tired-2.911
 sad-2.890
Token Tom
Token ID4186
Feature activation-2.97e-01

Pos logit contributions
mine+3.868
al+3.647
talk+3.544
x+3.474
of+3.441
Neg logit contributions
 glad-3.880
 lucky-3.325
 important-3.134
 tired-2.986
 sad-2.964
Token swung
Token ID28507
Feature activation-8.97e-01

Pos logit contributions
mine+0.588
al+0.558
talk+0.530
als+0.522
ted+0.520
Neg logit contributions
 glad-0.659
 lucky-0.555
 important-0.537
 supposed-0.521
 pleased-0.519
Token his
Token ID465
Feature activation+1.56e-01

Pos logit contributions
mine+2.023
al+1.920
talk+1.850
als+1.808
x+1.799
Neg logit contributions
 glad-1.767
 lucky-1.466
 important-1.402
 pleased-1.346
 supposed-1.340
Token stick
Token ID4859
Feature activation-6.78e-01

Pos logit contributions
 glad+0.402
 lucky+0.343
 important+0.335
 supposed+0.334
 pleased+0.331
Neg logit contributions
mine-0.254
al-0.237
talk-0.222
als-0.222
x-0.220
Token too
Token ID1165
Feature activation-5.87e-01

Pos logit contributions
mine+1.348
al+1.270
talk+1.215
als+1.189
ted+1.182
Neg logit contributions
 glad-1.498
 lucky-1.270
 important-1.221
 supposed-1.184
 pleased-1.180
Token hard
Token ID1327
Feature activation-1.81e-01

Pos logit contributions
mine+1.571
al+1.502
talk+1.455
als+1.432
of+1.427
Neg logit contributions
 glad-0.895
 lucky-0.697
 important-0.654
 supposed-0.624
 pleased-0.621
Token and
Token ID290
Feature activation-2.71e-01

Pos logit contributions
mine+0.322
al+0.299
talk+0.284
ted+0.280
als+0.278
Neg logit contributions
 glad-0.441
 lucky-0.380
 important-0.362
 pleased-0.357
 supposed-0.351
Token gifts
Token ID13201
Feature activation-2.75e-01

Pos logit contributions
mine+0.034
al+0.032
talk+0.030
als+0.030
of+0.029
Neg logit contributions
 glad-0.049
 lucky-0.042
 important-0.040
 pleased-0.040
 supposed-0.040
Token that
Token ID326
Feature activation+4.85e-01

Pos logit contributions
mine+0.508
al+0.476
talk+0.448
als+0.443
of+0.438
Neg logit contributions
 glad-0.645
 lucky-0.551
 important-0.530
 supposed-0.518
 pleased-0.517
Token he
Token ID339
Feature activation+7.51e-01

Pos logit contributions
 glad+1.196
 lucky+1.019
 pleased+0.977
 important+0.972
 supposed+0.965
Neg logit contributions
mine-0.834
al-0.781
talk-0.729
als-0.729
ted-0.717
Token has
Token ID468
Feature activation+1.19e+00

Pos logit contributions
 glad+1.604
 lucky+1.315
 pleased+1.267
 important+1.245
 supposed+1.224
Neg logit contributions
mine-1.530
al-1.501
talk-1.400
x-1.385
of-1.381
Token been
Token ID587
Feature activation+2.08e+00

Pos logit contributions
 glad+2.644
 lucky+2.187
 pleased+2.116
 important+2.074
 supposed+2.018
Neg logit contributions
mine-2.308
al-2.228
talk-2.122
als-2.065
ted-2.062
Token given
Token ID1813
Feature activation-1.47e-01

Pos logit contributions
 glad+4.142
 lucky+3.371
 pleased+3.262
 important+3.246
 cozy+3.213
Neg logit contributions
al-4.305
mine-4.287
talk-3.861
x-3.813
ted-3.739
Token.
Token ID13
Feature activation+1.19e+00

Pos logit contributions
mine+0.288
al+0.272
talk+0.259
als+0.254
of+0.253
Neg logit contributions
 glad-0.331
 lucky-0.278
 important-0.265
 pleased-0.262
 supposed-0.260
Token
Token ID220
Feature activation+2.00e+00

Pos logit contributions
 glad+2.499
 lucky+2.174
 supposed+2.147
 important+2.111
 pleased+1.990
Neg logit contributions
mine-2.457
al-2.309
of-2.228
ted-2.220
talk-2.172
Token
Token ID198
Feature activation+9.74e-01

Pos logit contributions
 glad+3.872
 lucky+3.345
 important+3.287
 supposed+3.132
 pleased+3.103
Neg logit contributions
mine-4.590
of-4.214
al-4.138
talk-4.035
x-3.999
TokenFred
Token ID30847
Feature activation+3.25e-01

Pos logit contributions
 glad+1.728
 lucky+1.444
 supposed+1.401
 important+1.375
 pleased+1.288
Neg logit contributions
mine-2.370
of-2.199
al-2.181
talk-2.127
ted-2.111
Tokendie
Token ID11979
Feature activation+5.89e-02

Pos logit contributions
 glad+0.724
 lucky+0.611
 important+0.578
 supposed+0.578
 pleased+0.566
Neg logit contributions
mine-0.651
al-0.619
of-0.587
talk-0.583
x-0.577
Token said
Token ID531
Feature activation-2.78e-01

Pos logit contributions
mine+1.185
al+1.110
talk+1.059
als+1.034
ted+1.027
Neg logit contributions
 glad-1.511
 lucky-1.295
 important-1.248
 supposed-1.214
 pleased-1.210
Token to
Token ID284
Feature activation-1.60e+00

Pos logit contributions
mine+0.549
al+0.514
talk+0.492
als+0.480
ted+0.478
Neg logit contributions
 glad-0.622
 lucky-0.530
 important-0.510
 supposed-0.496
 pleased-0.491
Token Ben
Token ID3932
Feature activation-4.89e-01

Pos logit contributions
mine+2.625
al+2.468
talk+2.465
als+2.358
x+2.314
Neg logit contributions
 glad-4.162
 lucky-3.592
 important-3.405
 supposed-3.344
 pleased-3.338
Token.
Token ID13
Feature activation-5.36e-01

Pos logit contributions
mine+0.976
al+0.917
talk+0.877
als+0.856
of+0.853
Neg logit contributions
 glad-1.079
 lucky-0.915
 important-0.881
 supposed-0.854
 pleased-0.850
Token
Token ID198
Feature activation-4.11e-02

Pos logit contributions
mine+0.886
al+0.824
talk+0.778
als+0.761
ted+0.757
Neg logit contributions
 glad-1.370
 lucky-1.191
 important-1.154
 supposed-1.126
 pleased-1.119
Token
Token ID198
Feature activation-2.86e+00

Pos logit contributions
mine+0.105
al+0.100
of+0.097
talk+0.097
ted+0.095
Neg logit contributions
 glad-0.068
 lucky-0.054
 important-0.051
 supposed-0.050
 pleased-0.049
Token"
Token ID1
Feature activation-2.03e+00

Pos logit contributions
mine+5.515
al+5.156
talk+5.033
ted+4.988
x+4.925
Neg logit contributions
 glad-6.901
 lucky-6.037
 important-5.835
 pleased-5.747
 happy-5.502
TokenOkay
Token ID16454
Feature activation-3.57e-01

Pos logit contributions
mine+2.850
al+2.628
x+2.398
talk+2.389
als+2.361
Neg logit contributions
 glad-5.847
 lucky-5.182
 important-5.071
 pleased-4.917
 happy-4.751
Token,
Token ID11
Feature activation-1.23e+00

Pos logit contributions
mine+0.671
al+0.630
talk+0.596
als+0.587
of+0.585
Neg logit contributions
 glad-0.831
 lucky-0.708
 important-0.683
 supposed-0.673
 pleased-0.665
Token that
Token ID326
Feature activation+3.21e-01

Pos logit contributions
mine+2.259
al+2.088
talk+2.037
ted+1.938
x+1.935
Neg logit contributions
 glad-2.973
 lucky-2.575
 important-2.467
 pleased-2.387
 supposed-2.343
Token sounds
Token ID5238
Feature activation+2.19e+00

Pos logit contributions
 glad+0.793
 lucky+0.671
 pleased+0.645
 supposed+0.642
 important+0.636
Neg logit contributions
mine-0.562
al-0.532
als-0.503
of-0.500
talk-0.497
Token nicely
Token ID16576
Feature activation-1.66e-01

Pos logit contributions
mine+1.969
al+1.850
talk+1.781
als+1.730
x+1.726
Neg logit contributions
 glad-2.199
 lucky-1.872
 important-1.809
 pleased-1.733
 supposed-1.730
Token why
Token ID1521
Feature activation+2.40e-01

Pos logit contributions
mine+0.309
al+0.292
talk+0.275
als+0.273
of+0.270
Neg logit contributions
 glad-0.387
 lucky-0.329
 important-0.315
 supposed-0.310
 pleased-0.309
Token Ell
Token ID7122
Feature activation+3.30e-01

Pos logit contributions
 glad+0.623
 lucky+0.541
 important+0.523
 supposed+0.518
 pleased+0.514
Neg logit contributions
mine-0.384
al-0.348
of-0.320
talk-0.319
als-0.318
Tokena
Token ID64
Feature activation-1.26e-01

Pos logit contributions
 glad+0.762
 lucky+0.646
 important+0.620
 pleased+0.601
 supposed+0.599
Neg logit contributions
mine-0.647
al-0.608
of-0.572
talk-0.569
x-0.563
Token couldn
Token ID3521
Feature activation-1.50e+00

Pos logit contributions
mine+0.257
al+0.243
talk+0.230
als+0.228
ted+0.226
Neg logit contributions
 glad-0.273
 lucky-0.229
 important-0.219
 supposed-0.215
 pleased-0.214
Token't
Token ID470
Feature activation-2.99e+00

Pos logit contributions
mine+2.917
al+2.797
talk+2.738
als+2.644
x+2.631
Neg logit contributions
 glad-3.325
 lucky-2.849
 important-2.766
 supposed-2.690
 pleased-2.598
Token have
Token ID423
Feature activation+4.73e-02

Pos logit contributions
ted+5.872
al+5.794
mine+5.772
als+5.759
x+5.513
Neg logit contributions
 glad-6.815
 lucky-6.035
 important-5.926
 supposed-5.599
 sad-5.547
Token all
Token ID477
Feature activation-1.52e-01

Pos logit contributions
 glad+0.116
 lucky+0.098
 important+0.095
 pleased+0.094
 supposed+0.093
Neg logit contributions
mine-0.083
al-0.077
talk-0.074
of-0.072
ted-0.072
Token the
Token ID262
Feature activation+3.99e-01

Pos logit contributions
mine+0.250
al+0.232
talk+0.219
of+0.214
als+0.214
Neg logit contributions
 glad-0.387
 lucky-0.336
 important-0.323
 pleased-0.317
 supposed-0.315
Token books
Token ID3835
Feature activation-6.06e-01

Pos logit contributions
 glad+1.110
 lucky+0.955
 pleased+0.931
 important+0.917
 supposed+0.914
Neg logit contributions
mine-0.565
al-0.512
talk-0.491
of-0.473
x-0.466
Token but
Token ID475
Feature activation+5.23e-01

Pos logit contributions
mine+1.144
al+1.073
talk+1.023
als+1.001
ted+0.994
Neg logit contributions
 glad-1.399
 lucky-1.195
 important-1.150
 supposed-1.118
 pleased-1.115
Token touch
Token ID3638
Feature activation-8.38e-01

Pos logit contributions
mine+2.478
al+2.361
ted+2.234
als+2.229
talk+2.223
Neg logit contributions
 glad-3.688
 lucky-3.217
 important-3.100
 supposed-3.027
 pleased-2.969
Token Max
Token ID5436
Feature activation-6.70e-01

Pos logit contributions
mine+1.092
al+0.995
talk+0.935
als+0.894
of+0.883
Neg logit contributions
 glad-2.425
 lucky-2.146
 important-2.085
 supposed-2.034
 pleased-2.027
Token.
Token ID13
Feature activation+6.33e-02

Pos logit contributions
mine+1.309
al+1.232
talk+1.178
als+1.152
ted+1.145
Neg logit contributions
 glad-1.507
 lucky-1.282
 important-1.233
 supposed-1.197
 pleased-1.193
Token
Token ID198
Feature activation+7.75e-01

Pos logit contributions
 glad+0.135
 lucky+0.115
 important+0.111
 supposed+0.109
 pleased+0.108
Neg logit contributions
mine-0.130
al-0.122
talk-0.117
als-0.116
ted-0.116
Token
Token ID198
Feature activation-2.43e+00

Pos logit contributions
 glad+1.362
 lucky+1.133
 important+1.104
 supposed+1.074
 pleased+1.047
Neg logit contributions
mine-1.863
of-1.740
al-1.729
talk-1.710
ted-1.647
Token"
Token ID1
Feature activation-2.41e+00

Pos logit contributions
mine+4.347
al+4.045
als+3.849
ted+3.763
talk+3.739
Neg logit contributions
 glad-6.443
 lucky-5.526
 important-5.295
 pleased-5.131
 happy-4.987
TokenY
Token ID56
Feature activation-2.48e+00

Pos logit contributions
mine+3.495
al+3.199
x+2.881
als+2.877
talk+2.798
Neg logit contributions
 glad-6.873
 lucky-6.093
 important-6.058
 pleased-5.773
 supposed-5.636
Tokenuck
Token ID1347
Feature activation-6.03e-01

Pos logit contributions
mine+4.553
al+4.319
talk+4.106
x+4.012
als+3.926
Neg logit contributions
 glad-5.838
 lucky-5.114
 important-5.026
 supposed-4.870
 tired-4.753
Token,
Token ID11
Feature activation-1.70e+00

Pos logit contributions
mine+1.164
al+1.096
talk+1.044
als+1.023
ted+1.017
Neg logit contributions
 glad-1.374
 lucky-1.170
 important-1.126
 supposed-1.094
 pleased-1.091
Token I
Token ID314
Feature activation+3.16e-01

Pos logit contributions
mine+3.382
al+3.165
talk+3.128
x+2.941
als+2.939
Neg logit contributions
 glad-3.940
 lucky-3.411
 important-3.248
 pleased-3.071
 supposed-3.061
Token don
Token ID836
Feature activation-8.45e-01

Pos logit contributions
 glad+0.744
 lucky+0.626
 important+0.606
 pleased+0.598
 supposed+0.585
Neg logit contributions
mine-0.588
al-0.550
talk-0.523
of-0.522
ted-0.507
Token for
Token ID329
Feature activation-1.34e+00

Pos logit contributions
mine+0.655
al+0.602
talk+0.563
als+0.546
ted+0.544
Neg logit contributions
 glad-1.279
 lucky-1.123
 important-1.089
 supposed-1.071
 pleased-1.065
Token some
Token ID617
Feature activation+3.84e-01

Pos logit contributions
mine+2.531
al+2.361
talk+2.242
als+2.229
x+2.211
Neg logit contributions
 glad-3.229
 lucky-2.787
 important-2.684
 pleased-2.570
 supposed-2.547
Token.
Token ID13
Feature activation-3.63e-01

Pos logit contributions
 glad+0.977
 lucky+0.837
 supposed+0.810
 pleased+0.802
 important+0.800
Neg logit contributions
mine-0.613
al-0.577
talk-0.543
ted-0.540
of-0.535
Token He
Token ID679
Feature activation-1.00e+00

Pos logit contributions
mine+0.740
al+0.700
talk+0.669
als+0.657
ted+0.656
Neg logit contributions
 glad-0.785
 lucky-0.664
 important-0.642
 supposed-0.629
 pleased-0.616
Token gave
Token ID2921
Feature activation-1.79e+00

Pos logit contributions
mine+2.006
al+1.876
talk+1.805
als+1.766
of+1.747
Neg logit contributions
 glad-2.239
 lucky-1.903
 important-1.830
 pleased-1.771
 supposed-1.756
Token them
Token ID606
Feature activation-2.73e+00

Pos logit contributions
mine+2.502
al+2.257
talk+2.188
ede+2.143
x+2.117
Neg logit contributions
 glad-5.010
 lucky-4.470
 important-4.269
 pleased-4.109
 tired-4.108
Token each
Token ID1123
Feature activation-1.25e+00

Pos logit contributions
mine+6.477
al+6.011
ede+5.868
talk+5.837
x+5.825
Neg logit contributions
 glad-5.444
 lucky-4.718
 important-4.406
 tired-4.158
 happy-4.005
Token a
Token ID257
Feature activation+1.08e+00

Pos logit contributions
mine+2.748
al+2.601
talk+2.486
x+2.471
als+2.459
Neg logit contributions
 glad-2.556
 lucky-2.151
 important-2.053
 pleased-1.951
 supposed-1.940
Token cone
Token ID27763
Feature activation-1.45e+00

Pos logit contributions
 glad+2.532
 supposed+2.133
 lucky+2.120
 important+2.067
 pleased+1.978
Neg logit contributions
mine-1.937
al-1.788
of-1.780
talk-1.710
 cash-1.692
Token with
Token ID351
Feature activation-7.69e-01

Pos logit contributions
mine+2.913
al+2.729
talk+2.645
ted+2.576
x+2.559
Neg logit contributions
 glad-3.231
 lucky-2.747
 important-2.641
 pleased-2.546
 supposed-2.529
Token a
Token ID257
Feature activation+1.12e+00

Pos logit contributions
mine+1.616
al+1.525
talk+1.466
als+1.433
x+1.424
Neg logit contributions
 glad-1.624
 lucky-1.366
 important-1.309
 supposed-1.265
 pleased-1.262
Token just
Token ID655
Feature activation-2.95e-01

Pos logit contributions
mine+2.447
al+2.331
talk+2.246
als+2.192
ted+2.190
Neg logit contributions
 glad-2.415
 lucky-2.032
 important-1.945
 pleased-1.873
 supposed-1.868
Token couldn
Token ID3521
Feature activation-8.91e-01

Pos logit contributions
mine+0.621
al+0.590
talk+0.561
als+0.555
ted+0.551
Neg logit contributions
 glad-0.621
 lucky-0.521
 important-0.500
 supposed-0.487
 pleased-0.483
Token't
Token ID470
Feature activation-2.10e+00

Pos logit contributions
mine+1.646
al+1.553
talk+1.486
als+1.450
ted+1.441
Neg logit contributions
 glad-2.088
 lucky-1.792
 important-1.731
 supposed-1.682
 pleased-1.666
Token decide
Token ID5409
Feature activation-3.43e-01

Pos logit contributions
mine+3.936
al+3.804
ted+3.795
als+3.686
x+3.612
Neg logit contributions
 glad-4.816
 lucky-4.203
 important-4.052
 supposed-3.928
 pleased-3.820
Token.
Token ID13
Feature activation-6.44e-01

Pos logit contributions
mine+0.692
al+0.654
talk+0.625
of+0.612
als+0.611
Neg logit contributions
 glad-0.745
 lucky-0.627
 important-0.602
 pleased-0.586
 supposed-0.583
Token Then
Token ID3244
Feature activation-2.65e+00

Pos logit contributions
mine+1.267
al+1.193
talk+1.141
als+1.116
ted+1.109
Neg logit contributions
 glad-1.443
 lucky-1.227
 important-1.180
 supposed-1.145
 pleased-1.141
Token Bill
Token ID3941
Feature activation-9.30e-01

Pos logit contributions
mine+5.863
al+5.414
talk+5.242
x+5.182
of+5.113
Neg logit contributions
 glad-5.887
 lucky-5.134
 important-4.776
 tired-4.584
 pleased-4.503
Token had
Token ID550
Feature activation+3.69e-01

Pos logit contributions
mine+1.767
al+1.652
talk+1.583
als+1.544
ted+1.535
Neg logit contributions
 glad-2.141
 lucky-1.833
 important-1.764
 pleased-1.711
 supposed-1.709
Token a
Token ID257
Feature activation+1.52e+00

Pos logit contributions
 glad+0.765
 lucky+0.632
 important+0.594
 pleased+0.590
 supposed+0.584
Neg logit contributions
mine-0.784
al-0.746
talk-0.724
of-0.702
ted-0.699
Token mighty
Token ID18680
Feature activation+4.46e-01

Pos logit contributions
 glad+3.157
 supposed+2.616
 lucky+2.540
 important+2.475
 pleased+2.421
Neg logit contributions
mine-3.039
al-2.845
of-2.775
ede-2.723
talk-2.714
Token idea
Token ID2126
Feature activation-1.29e+00

Pos logit contributions
 glad+0.938
 lucky+0.781
 supposed+0.750
 important+0.745
 pleased+0.729
Neg logit contributions
mine-0.908
al-0.861
talk-0.818
of-0.798
als-0.795