TOP ACTIVATIONS
MAX = 4.658

and yawn ed . He was hungry and wanted to find
." Tom nods . He is sleepy . He says ,
Anna nodded . She was a brave girl . She
pretty , but Tweet ie was scared it would hurt her
. L ila was very happy . She thanked
?" L ila was surprised . She did not
?" S ara is surprised . She says ,
. S ara is happy and hopeful . She
sky got dark and he was tired . So he decided
?" L ily was happy to have a new
L ily nodded . She was sorry for making Mom my
things ." Anna was eager to hear the story
?" L ily was curious and said , "
." S ara is sad . She says ,
goodbye to Anna ." Ben is sad . He does not
asleep . The turkey was very happy . <|endoftext|> One sunny
." S ara is not happy . She likes
?" L ila was excited . She said ,
. L ily is curious . She wants to
?" S ara is happy . She gives Ben

MIN ACTIVATIONS
MIN = -4.724

!" Mia says . She follows Sam to the cave .
!" Lily says . She follows him with her sled .
of the hill . Ben follows her . They look down
to the box . Jen follows him . But
Sam tries to scare things with the lion ,
let 's go ." He followed Lily up the ladder and
cave . Sam follows him . He does not
." M ia chased after him . She wanted
knees ." Sam followed them into the tunnel .
." L ily followed Sam . She was happy
!" Sam says . He follows Tom and slides down .
wrong . They tried to comfort them and explain that sometimes
cries . Mom hugs Sam and tells him it
house . Mom hugs them and says she is
She looked anxious . She hugged them and took them to
!" Sam says . He follows Tim and slides down .
again ." Mom hugs them . " That 's
Sam smiled . He followed Tom and they looked for
pull some beans . Sam followed him , but he felt
if it fits !" She helped Sil via put on the

INTERVAL 2.312 - 4.658
CONTAINS 2.699%

John 's heart . He was so happy and thanked his
okay ." Tim felt a little better . He
the queen again . <|endoftext|> Once there was a messy piece
he liked it . He was very happy .
Jane 's feet . Jane was very happy and thanked the

INTERVAL -0.033 - 2.312
CONTAINS 48.748%

girl named Lily . She had a big sun flower in
water for the cat . He put the water in a
the dog and they played together for a while .
. <|endoftext|> Once upon a time , there was a little
home , but when she tried it on she realized it

INTERVAL -2.378 - -0.033
CONTAINS 47.279%

chew it ." She hugs Tim and says sorry
his books . He told them stories and jokes . They
It was small , but smart . It said it knew
upon a time , there was a little girl named Lily
everyone laughed so hard that they forgot about the trumpet .

INTERVAL -4.724 - -2.378
CONTAINS 1.274%

L ily said , " No , Tom . Stop
go ," she said , pat ting the little girl 's
Flowers are boring ." " But flowers are pretty
the boy and said , " Give me back my tiger
with other toys ." Tom listened to Sue .
Token and
Token ID290
Feature activation+1.55e-01

Pos logit contributions
 glad+0.815
 lucky+0.710
 important+0.687
 supposed+0.668
 pleased+0.667
Neg logit contributions
mine-0.502
al-0.465
talk-0.439
als-0.429
ted-0.426
Token yawn
Token ID40908
Feature activation+1.32e-01

Pos logit contributions
 glad+0.386
 lucky+0.331
 important+0.324
 supposed+0.322
 pleased+0.314
Neg logit contributions
mine-0.262
al-0.248
talk-0.235
ted-0.234
als-0.234
Tokened
Token ID276
Feature activation+3.71e-01

Pos logit contributions
 glad+0.375
 lucky+0.327
 important+0.318
 supposed+0.313
 pleased+0.310
Neg logit contributions
mine-0.184
al-0.169
als-0.156
ted-0.155
talk-0.155
Token.
Token ID13
Feature activation+1.50e+00

Pos logit contributions
 glad+0.897
 lucky+0.762
 supposed+0.745
 important+0.733
 pleased+0.720
Neg logit contributions
mine-0.659
al-0.628
ted-0.589
als-0.586
talk-0.583
Token He
Token ID679
Feature activation+1.26e+00

Pos logit contributions
 glad+3.534
 supposed+3.071
 lucky+2.984
 important+2.966
 pleased+2.925
Neg logit contributions
mine-2.733
al-2.611
ted-2.594
als-2.537
of-2.494
Token was
Token ID373
Feature activation+4.66e+00

Pos logit contributions
 glad+3.150
 supposed+2.702
 lucky+2.648
 important+2.569
 pleased+2.549
Neg logit contributions
mine-2.196
al-2.117
ted-2.015
talk-1.980
of-1.974
Token hungry
Token ID14720
Feature activation-1.56e-01

Pos logit contributions
 fault+10.133
 valuable+9.800
 glad+9.626
 rewarding+9.530
 pleasure+9.326
Neg logit contributions
mine-8.601
al-8.160
ting-8.123
ted-7.648
 him-7.618
Token and
Token ID290
Feature activation+2.40e+00

Pos logit contributions
mine+0.257
al+0.243
talk+0.227
als+0.225
of+0.223
Neg logit contributions
 glad-0.394
 lucky-0.339
 important-0.328
 pleased-0.322
 supposed-0.321
Token wanted
Token ID2227
Feature activation+6.86e-01

Pos logit contributions
 glad+5.639
 fault+5.220
 supposed+5.197
 lucky+4.802
 important+4.694
Neg logit contributions
mine-4.005
ted-3.870
al-3.724
Give-3.679
als-3.565
Token to
Token ID284
Feature activation-3.44e-01

Pos logit contributions
 glad+1.547
 lucky+1.276
 pleased+1.260
 supposed+1.210
 important+1.208
Neg logit contributions
mine-1.305
al-1.261
talk-1.237
ted-1.225
of-1.197
Token find
Token ID1064
Feature activation-7.44e-02

Pos logit contributions
mine+0.526
al+0.485
talk+0.457
of+0.442
als+0.442
Neg logit contributions
 glad-0.927
 lucky-0.807
 important-0.785
 supposed-0.767
 pleased-0.767
Token."
Token ID526
Feature activation-2.68e-01

Pos logit contributions
 glad+3.131
 lucky+2.671
 supposed+2.591
 pleased+2.586
 important+2.572
Neg logit contributions
mine-2.097
al-2.036
talk-1.875
als-1.797
ted-1.770
Token Tom
Token ID4186
Feature activation-1.03e-01

Pos logit contributions
mine+0.592
al+0.559
talk+0.545
of+0.533
als+0.532
Neg logit contributions
 glad-0.533
 lucky-0.445
 important-0.429
 supposed-0.418
 pleased-0.414
Token nods
Token ID34833
Feature activation+4.21e-01

Pos logit contributions
mine+0.186
al+0.176
talk+0.168
of+0.165
als+0.164
Neg logit contributions
 glad-0.245
 lucky-0.210
 important-0.204
 supposed-0.201
 pleased-0.198
Token.
Token ID13
Feature activation+1.35e+00

Pos logit contributions
 glad+1.033
 lucky+0.892
 supposed+0.876
 important+0.859
 pleased+0.834
Neg logit contributions
mine-0.741
al-0.680
talk-0.652
of-0.634
ted-0.624
Token He
Token ID679
Feature activation+9.45e-01

Pos logit contributions
 glad+3.369
 lucky+2.942
 supposed+2.929
 important+2.849
 pleased+2.838
Neg logit contributions
mine-2.139
talk-2.099
al-2.056
ted-2.004
of-2.001
Token is
Token ID318
Feature activation+4.64e+00

Pos logit contributions
 glad+2.315
 lucky+1.957
 supposed+1.936
 important+1.915
 pleased+1.908
Neg logit contributions
al-1.594
mine-1.587
als-1.496
ted-1.483
of-1.478
Token sleepy
Token ID42023
Feature activation+6.54e-01

Pos logit contributions
 fault+10.366
 valuable+10.178
 tempting+10.172
 rewarding+9.328
 important+9.163
Neg logit contributions
mine-8.532
al-7.816
Give-7.009
 cash-6.853
 her-6.850
Token.
Token ID13
Feature activation+1.89e+00

Pos logit contributions
 glad+1.584
 lucky+1.345
 important+1.288
 pleased+1.286
 supposed+1.285
Neg logit contributions
mine-1.186
al-1.108
talk-1.050
als-1.013
of-1.010
Token He
Token ID679
Feature activation+1.64e+00

Pos logit contributions
 glad+4.561
 supposed+4.022
 lucky+3.970
 important+3.885
 pleased+3.870
Neg logit contributions
mine-3.007
talk-2.967
of-2.919
al-2.907
ted-2.841
Token says
Token ID1139
Feature activation+8.90e-01

Pos logit contributions
 glad+3.962
 supposed+3.329
 lucky+3.307
 pleased+3.287
 important+3.227
Neg logit contributions
al-2.846
mine-2.787
als-2.704
ted-2.697
of-2.653
Token,
Token ID11
Feature activation+1.51e+00

Pos logit contributions
 glad+2.249
 supposed+1.974
 lucky+1.928
 important+1.884
 pleased+1.843
Neg logit contributions
mine-1.424
al-1.349
als-1.288
of-1.272
ted-1.254
Token
Token ID198
Feature activation-1.10e+00

Pos logit contributions
 glad+1.984
 supposed+1.653
 lucky+1.645
 important+1.606
 pleased+1.525
Neg logit contributions
mine-2.531
al-2.389
of-2.380
talk-2.304
ted-2.251
TokenAnna
Token ID31160
Feature activation-3.57e-01

Pos logit contributions
mine+1.475
al+1.346
talk+1.268
als+1.230
ted+1.227
Neg logit contributions
 glad-3.183
 lucky-2.810
 important-2.723
 pleased-2.667
 supposed-2.625
Token nodded
Token ID14464
Feature activation+4.39e-01

Pos logit contributions
mine+0.652
al+0.614
talk+0.581
als+0.569
of+0.565
Neg logit contributions
 glad-0.852
 lucky-0.730
 important-0.707
 supposed-0.694
 pleased-0.681
Token.
Token ID13
Feature activation+6.36e-01

Pos logit contributions
 glad+1.146
 lucky+0.998
 supposed+0.984
 important+0.966
 pleased+0.934
Neg logit contributions
mine-0.709
al-0.649
talk-0.593
ted-0.590
of-0.586
Token She
Token ID1375
Feature activation+1.98e-01

Pos logit contributions
 glad+1.687
 lucky+1.485
 important+1.451
 supposed+1.447
 pleased+1.405
Neg logit contributions
mine-0.974
al-0.942
als-0.894
ted-0.851
talk-0.845
Token was
Token ID373
Feature activation+4.50e+00

Pos logit contributions
 glad+0.468
 lucky+0.397
 supposed+0.384
 important+0.382
 pleased+0.372
Neg logit contributions
mine-0.360
al-0.344
talk-0.322
ted-0.319
als-0.319
Token a
Token ID257
Feature activation+1.57e+00

Pos logit contributions
 fault+9.978
 valuable+9.412
 glad+9.324
 tempting+9.211
 rewarding+8.860
Neg logit contributions
mine-8.193
al-7.972
Give-6.784
 cash-6.647
 her-6.458
Token brave
Token ID14802
Feature activation-8.25e-01

Pos logit contributions
 glad+3.209
 supposed+2.838
 important+2.604
 lucky+2.583
 fault+2.533
Neg logit contributions
mine-3.077
al-2.986
 cash-2.862
of-2.753
talk-2.699
Token girl
Token ID2576
Feature activation+1.81e-02

Pos logit contributions
mine+1.527
al+1.427
talk+1.365
als+1.336
ted+1.327
Neg logit contributions
 glad-1.953
 lucky-1.677
 important-1.617
 pleased-1.563
 supposed-1.562
Token.
Token ID13
Feature activation+9.18e-01

Pos logit contributions
 glad+0.042
 lucky+0.036
 important+0.035
 supposed+0.034
 pleased+0.034
Neg logit contributions
mine-0.034
al-0.032
talk-0.030
als-0.030
ted-0.029
Token She
Token ID1375
Feature activation+2.67e-01

Pos logit contributions
 glad+2.498
 lucky+2.229
 supposed+2.211
 important+2.193
 pleased+2.105
Neg logit contributions
mine-1.324
al-1.280
als-1.216
ted-1.154
of-1.148
Token pretty
Token ID2495
Feature activation-2.20e-02

Pos logit contributions
 fault+6.023
 glad+6.007
 supposed+5.661
 pleasure+4.677
 lucky+4.410
Neg logit contributions
mine-7.782
al-7.440
als-7.145
of-7.008
Give-6.918
Token,
Token ID11
Feature activation+7.94e-01

Pos logit contributions
mine+0.036
al+0.034
als+0.032
talk+0.031
ted+0.031
Neg logit contributions
 glad-0.057
 lucky-0.049
 important-0.047
 pleased-0.047
 supposed-0.046
Token but
Token ID475
Feature activation+8.62e-01

Pos logit contributions
 glad+2.032
 lucky+1.768
 supposed+1.688
 important+1.683
 pleased+1.674
Neg logit contributions
mine-1.252
al-1.153
ted-1.101
als-1.072
talk-1.038
Token Tweet
Token ID18752
Feature activation+6.32e-01

Pos logit contributions
 glad+2.358
 supposed+2.072
 lucky+2.031
 pleased+1.993
 important+1.970
Neg logit contributions
mine-1.176
al-1.123
ted-1.062
als-1.035
of-0.990
Tokenie
Token ID494
Feature activation+1.12e+00

Pos logit contributions
 glad+2.214
 lucky+1.978
 important+1.917
 pleased+1.915
 supposed+1.903
Neg logit contributions
mine-0.505
al-0.433
of-0.361
x-0.344
ted-0.329
Token was
Token ID373
Feature activation+4.50e+00

Pos logit contributions
 glad+2.613
 lucky+2.170
 supposed+2.121
 important+2.088
 pleased+2.081
Neg logit contributions
mine-2.148
al-2.097
als-1.920
ted-1.911
of-1.873
Token scared
Token ID12008
Feature activation+1.16e+00

Pos logit contributions
 glad+8.288
 fault+8.171
 pleasure+7.682
aj+7.539
 tempting+7.364
Neg logit contributions
al-9.574
mine-9.430
ting-8.077
 each-8.064
als-7.812
Token it
Token ID340
Feature activation+1.06e+00

Pos logit contributions
 glad+2.876
 lucky+2.456
 pleased+2.434
 supposed+2.409
 important+2.343
Neg logit contributions
mine-1.881
al-1.799
ted-1.729
of-1.712
als-1.664
Token would
Token ID561
Feature activation+8.05e-01

Pos logit contributions
 glad+2.593
 lucky+2.133
 pleased+2.120
 important+2.021
 supposed+2.017
Neg logit contributions
mine-1.971
al-1.829
ted-1.732
of-1.731
als-1.724
Token hurt
Token ID5938
Feature activation-9.04e-01

Pos logit contributions
 glad+2.023
 lucky+1.736
 important+1.685
 pleased+1.655
 supposed+1.641
Neg logit contributions
mine-1.386
al-1.237
of-1.196
talk-1.192
als-1.149
Token her
Token ID607
Feature activation+1.40e+00

Pos logit contributions
mine+1.688
al+1.580
talk+1.520
als+1.479
ted+1.469
Neg logit contributions
 glad-2.129
 lucky-1.829
 important-1.765
 supposed-1.709
 pleased-1.690
Token.
Token ID13
Feature activation+1.07e+00

Pos logit contributions
 glad+1.132
 lucky+0.966
 supposed+0.919
 important+0.917
 pleased+0.911
Neg logit contributions
mine-0.809
al-0.759
talk-0.712
als-0.708
of-0.695
Token
Token ID198
Feature activation+1.09e+00

Pos logit contributions
 glad+2.611
 supposed+2.300
 lucky+2.299
 important+2.284
 pleased+2.205
Neg logit contributions
mine-1.772
al-1.691
ted-1.624
als-1.624
of-1.571
Token
Token ID198
Feature activation-2.26e+00

Pos logit contributions
 glad+2.009
 lucky+1.684
 important+1.642
 supposed+1.635
 pleased+1.567
Neg logit contributions
mine-2.564
of-2.447
al-2.378
talk-2.337
ted-2.317
TokenL
Token ID43
Feature activation-7.61e-01

Pos logit contributions
mine+3.115
al+2.857
talk+2.680
x+2.594
als+2.580
Neg logit contributions
 glad-6.846
 lucky-6.031
 important-5.830
 pleased-5.673
 tired-5.570
Tokenila
Token ID10102
Feature activation+7.34e-02

Pos logit contributions
mine+1.185
al+1.093
talk+1.037
als+1.005
ted+0.997
Neg logit contributions
 glad-2.030
 lucky-1.774
 important-1.718
 supposed-1.679
 pleased-1.669
Token was
Token ID373
Feature activation+4.50e+00

Pos logit contributions
 glad+0.178
 lucky+0.153
 important+0.149
 supposed+0.147
 pleased+0.143
Neg logit contributions
mine-0.130
al-0.124
talk-0.116
als-0.114
of-0.114
Token very
Token ID845
Feature activation+3.63e+00

Pos logit contributions
 fault+9.209
 valuable+8.747
 glad+8.455
 tempting+8.445
 rewarding+8.300
Neg logit contributions
mine-9.124
al-8.690
Give-7.770
 cash-7.641
talk-7.366
Token happy
Token ID3772
Feature activation+1.56e-01

Pos logit contributions
 fault+4.467
 supposed+3.969
 glad+3.820
 important+3.413
 pleasure+3.265
Neg logit contributions
mine-9.810
al-9.444
of-9.036
Give-8.886
talk-8.856
Token.
Token ID13
Feature activation+6.69e-01

Pos logit contributions
 glad+0.378
 lucky+0.326
 important+0.316
 supposed+0.309
 pleased+0.308
Neg logit contributions
mine-0.273
al-0.258
talk-0.241
als-0.239
ted-0.238
Token She
Token ID1375
Feature activation+8.44e-01

Pos logit contributions
 glad+1.840
 lucky+1.629
 important+1.592
 pleased+1.582
 supposed+1.576
Neg logit contributions
mine-0.932
al-0.893
als-0.850
ted-0.824
talk-0.795
Token thanked
Token ID26280
Feature activation-2.27e+00

Pos logit contributions
 glad+2.080
 lucky+1.773
 important+1.728
 supposed+1.725
 pleased+1.686
Neg logit contributions
mine-1.416
al-1.363
talk-1.290
als-1.276
of-1.268
Token?"
Token ID1701
Feature activation-5.79e-01

Pos logit contributions
 glad+2.203
 lucky+1.851
 supposed+1.735
 pleased+1.727
 important+1.725
Neg logit contributions
mine-2.313
al-2.226
talk-2.044
als-2.033
of-2.027
Token
Token ID198
Feature activation-2.46e-02

Pos logit contributions
mine+1.318
al+1.250
talk+1.202
als+1.179
ted+1.175
Neg logit contributions
 glad-1.119
 lucky-0.926
 important-0.884
 supposed-0.855
 pleased-0.848
Token
Token ID198
Feature activation-2.04e+00

Pos logit contributions
mine+0.063
al+0.060
talk+0.058
of+0.058
ted+0.057
Neg logit contributions
 glad-0.040
 lucky-0.032
 important-0.031
 supposed-0.030
 pleased-0.029
TokenL
Token ID43
Feature activation-9.27e-01

Pos logit contributions
mine+2.971
al+2.793
ted+2.654
als+2.646
talk+2.587
Neg logit contributions
 glad-5.765
 lucky-5.019
 important-4.823
 pleased-4.781
 happy-4.596
Tokenila
Token ID10102
Feature activation-1.13e-01

Pos logit contributions
mine+1.383
al+1.270
talk+1.207
als+1.180
ted+1.172
Neg logit contributions
 glad-2.532
 lucky-2.213
 important-2.144
 supposed-2.102
 pleased-2.087
Token was
Token ID373
Feature activation+4.49e+00

Pos logit contributions
mine+0.202
al+0.191
talk+0.179
als+0.177
ted+0.175
Neg logit contributions
 glad-0.274
 lucky-0.236
 important-0.228
 supposed-0.225
 pleased-0.221
Token surprised
Token ID6655
Feature activation-8.50e-01

Pos logit contributions
 glad+8.050
 fault+7.737
 valuable+7.686
 rewarding+7.460
 pleasure+7.371
Neg logit contributions
mine-9.929
al-9.585
Give-9.001
 cash-8.154
ting-8.103
Token.
Token ID13
Feature activation-4.09e-01

Pos logit contributions
mine+1.635
al+1.533
talk+1.469
als+1.432
x+1.423
Neg logit contributions
 glad-1.965
 lucky-1.678
 important-1.617
 pleased-1.565
 supposed-1.560
Token She
Token ID1375
Feature activation+7.53e-01

Pos logit contributions
mine+0.684
al+0.640
talk+0.600
als+0.592
ted+0.583
Neg logit contributions
 glad-1.035
 lucky-0.900
 important-0.872
 supposed-0.853
 pleased-0.847
Token did
Token ID750
Feature activation+7.47e-01

Pos logit contributions
 glad+1.815
 lucky+1.553
 supposed+1.523
 important+1.512
 pleased+1.435
Neg logit contributions
mine-1.326
al-1.281
als-1.180
ted-1.178
of-1.176
Token not
Token ID407
Feature activation-7.88e-02

Pos logit contributions
 glad+1.127
 lucky+0.846
 important+0.797
 supposed+0.796
 pleased+0.776
Neg logit contributions
mine-1.954
al-1.841
of-1.795
talk-1.773
ede-1.771
Token?"
Token ID1701
Feature activation-2.32e-01

Pos logit contributions
 glad+1.701
 lucky+1.389
 supposed+1.339
 pleased+1.306
 important+1.298
Neg logit contributions
mine-1.915
al-1.802
talk-1.694
of-1.693
als-1.690
Token
Token ID198
Feature activation+8.83e-01

Pos logit contributions
mine+0.555
al+0.527
talk+0.512
of+0.502
ted+0.501
Neg logit contributions
 glad-0.420
 lucky-0.343
 important-0.329
 supposed-0.323
 pleased-0.313
Token
Token ID198
Feature activation-2.06e+00

Pos logit contributions
 glad+1.600
 supposed+1.332
 lucky+1.314
 important+1.266
 pleased+1.220
Neg logit contributions
mine-2.081
of-1.989
al-1.985
talk-1.947
ted-1.887
TokenS
Token ID50
Feature activation-2.37e+00

Pos logit contributions
mine+3.220
al+2.891
als+2.711
x+2.711
ted+2.696
Neg logit contributions
 glad-5.750
 lucky-5.069
 important-4.890
 pleased-4.752
 tired-4.666
Tokenara
Token ID3301
Feature activation-4.29e-01

Pos logit contributions
mine+3.156
al+2.681
talk+2.657
x+2.549
als+2.496
Neg logit contributions
 glad-6.851
 lucky-6.101
 important-5.891
 pleased-5.707
 tired-5.677
Token is
Token ID318
Feature activation+4.47e+00

Pos logit contributions
mine+0.781
al+0.737
talk+0.699
als+0.685
of+0.681
Neg logit contributions
 glad-1.023
 lucky-0.878
 important-0.849
 supposed-0.829
 pleased-0.822
Token surprised
Token ID6655
Feature activation-6.43e-01

Pos logit contributions
 pleasure+7.899
 fault+7.846
 valuable+7.755
 glad+7.665
 lifetime+7.591
Neg logit contributions
Give-9.682
mine-9.513
al-9.255
 cash-8.156
Tom-8.034
Token.
Token ID13
Feature activation-7.66e-01

Pos logit contributions
mine+1.306
al+1.232
talk+1.179
als+1.154
ted+1.148
Neg logit contributions
 glad-1.411
 lucky-1.194
 important-1.146
 supposed-1.112
 pleased-1.108
Token She
Token ID1375
Feature activation+1.07e+00

Pos logit contributions
mine+1.334
al+1.244
talk+1.185
als+1.150
ted+1.143
Neg logit contributions
 glad-1.897
 lucky-1.636
 important-1.579
 supposed-1.535
 pleased-1.533
Token says
Token ID1139
Feature activation-2.35e-01

Pos logit contributions
 glad+2.648
 lucky+2.256
 supposed+2.233
 important+2.222
 pleased+2.173
Neg logit contributions
mine-1.767
al-1.758
ted-1.641
als-1.636
of-1.628
Token,
Token ID11
Feature activation+1.39e+00

Pos logit contributions
mine+0.427
al+0.402
talk+0.378
als+0.377
ted+0.373
Neg logit contributions
 glad-0.559
 lucky-0.480
 important-0.465
 supposed-0.460
 pleased-0.450
Token.
Token ID13
Feature activation+1.84e+00

Pos logit contributions
 glad+1.142
 lucky+0.973
 important+0.928
 pleased+0.918
 supposed+0.909
Neg logit contributions
mine-0.895
al-0.845
talk-0.801
of-0.784
als-0.783
Token
Token ID198
Feature activation+1.56e+00

Pos logit contributions
 glad+4.461
 supposed+4.090
 lucky+3.944
 important+3.922
 pleased+3.776
Neg logit contributions
mine-3.043
al-2.912
talk-2.824
of-2.777
ted-2.757
Token
Token ID198
Feature activation-1.07e+00

Pos logit contributions
 glad+2.915
 supposed+2.534
 lucky+2.484
 important+2.406
 pleased+2.281
Neg logit contributions
mine-3.538
of-3.442
talk-3.319
al-3.290
ted-3.136
TokenS
Token ID50
Feature activation-1.73e+00

Pos logit contributions
mine+1.471
al+1.353
talk+1.247
als+1.243
ted+1.222
Neg logit contributions
 glad-3.094
 lucky-2.721
 important-2.644
 pleased-2.583
 supposed-2.546
Tokenara
Token ID3301
Feature activation-3.04e-02

Pos logit contributions
mine+2.075
al+1.823
talk+1.743
als+1.685
x+1.675
Neg logit contributions
 glad-5.224
 lucky-4.638
 important-4.513
 pleased-4.397
 supposed-4.374
Token is
Token ID318
Feature activation+4.46e+00

Pos logit contributions
mine+0.055
al+0.052
talk+0.049
als+0.048
of+0.048
Neg logit contributions
 glad-0.073
 lucky-0.063
 important-0.061
 supposed-0.060
 pleased-0.059
Token happy
Token ID3772
Feature activation+6.04e-01

Pos logit contributions
 tempting+7.718
 fault+7.511
 valuable+7.444
 important+7.137
 rewarding+7.004
Neg logit contributions
mine-10.382
al-9.654
Give-8.955
 cash-8.502
ting-8.184
Token and
Token ID290
Feature activation+1.77e+00

Pos logit contributions
 glad+1.438
 lucky+1.248
 important+1.217
 supposed+1.213
 pleased+1.176
Neg logit contributions
mine-1.074
al-1.011
talk-0.953
of-0.947
ted-0.938
Token hopeful
Token ID17836
Feature activation-5.15e-02

Pos logit contributions
 glad+3.595
 supposed+3.164
 important+3.092
 lucky+2.979
 fault+2.950
Neg logit contributions
mine-3.339
al-3.336
als-3.248
ted-3.237
talk-3.135
Token.
Token ID13
Feature activation+1.18e+00

Pos logit contributions
mine+0.099
al+0.093
talk+0.089
of+0.088
als+0.087
Neg logit contributions
 glad-0.117
 lucky-0.100
 important-0.096
 supposed-0.095
 pleased-0.094
Token She
Token ID1375
Feature activation+6.34e-01

Pos logit contributions
 glad+3.314
 lucky+2.974
 important+2.923
 supposed+2.900
 pleased+2.876
Neg logit contributions
mine-1.551
al-1.475
als-1.383
talk-1.335
ted-1.335
Token sky
Token ID6766
Feature activation+8.58e-03

Pos logit contributions
 glad+1.102
 lucky+0.908
 supposed+0.905
 important+0.876
 pleased+0.870
Neg logit contributions
mine-0.965
al-0.934
talk-0.887
als-0.881
ted-0.868
Token got
Token ID1392
Feature activation+1.29e+00

Pos logit contributions
 glad+0.021
 lucky+0.017
 important+0.017
 supposed+0.017
 pleased+0.016
Neg logit contributions
mine-0.016
al-0.015
als-0.014
talk-0.014
ted-0.014
Token dark
Token ID3223
Feature activation-8.30e-02

Pos logit contributions
 glad+2.251
 lucky+1.741
 supposed+1.647
 pleased+1.603
 important+1.597
Neg logit contributions
mine-3.186
al-3.023
talk-2.958
ted-2.921
als-2.896
Token and
Token ID290
Feature activation+9.80e-01

Pos logit contributions
mine+0.147
al+0.139
talk+0.131
als+0.129
ted+0.129
Neg logit contributions
 glad-0.203
 lucky-0.173
 important-0.166
 pleased-0.164
 supposed-0.162
Token he
Token ID339
Feature activation+1.44e+00

Pos logit contributions
 glad+2.109
 supposed+1.734
 lucky+1.731
 important+1.665
 pleased+1.644
Neg logit contributions
mine-1.983
al-1.881
ted-1.832
als-1.823
talk-1.773
Token was
Token ID373
Feature activation+4.46e+00

Pos logit contributions
 glad+3.659
 supposed+3.093
 lucky+3.047
 important+3.010
 pleased+2.881
Neg logit contributions
mine-2.500
al-2.429
als-2.255
ted-2.238
talk-2.226
Token tired
Token ID10032
Feature activation-4.06e-01

Pos logit contributions
 glad+10.480
 valuable+10.065
 important+9.748
 tempting+9.579
 rewarding+9.504
Neg logit contributions
mine-8.585
al-7.817
ting-7.372
ted-6.747
talk-6.685
Token.
Token ID13
Feature activation+9.86e-01

Pos logit contributions
mine+0.800
al+0.753
talk+0.719
als+0.705
of+0.701
Neg logit contributions
 glad-0.908
 lucky-0.768
 important-0.737
 pleased-0.718
 supposed-0.717
Token So
Token ID1406
Feature activation-1.30e+00

Pos logit contributions
 glad+2.407
 lucky+2.048
 supposed+2.001
 pleased+1.955
 important+1.942
Neg logit contributions
mine-1.713
al-1.654
talk-1.591
als-1.580
ted-1.574
Token he
Token ID339
Feature activation-1.59e-01

Pos logit contributions
mine+2.472
al+2.280
talk+2.203
x+2.143
als+2.125
Neg logit contributions
 glad-3.132
 lucky-2.716
 important-2.599
 pleased-2.494
 tired-2.458
Token decided
Token ID3066
Feature activation+4.31e-01

Pos logit contributions
mine+0.253
al+0.238
talk+0.225
als+0.218
ted+0.218
Neg logit contributions
 glad-0.417
 lucky-0.360
 important-0.348
 supposed-0.345
 pleased-0.340
Token?"
Token ID1701
Feature activation+2.91e-01

Pos logit contributions
 glad+1.349
 lucky+1.130
 supposed+1.071
 important+1.068
 pleased+1.065
Neg logit contributions
mine-1.310
al-1.238
talk-1.156
of-1.154
x-1.147
Token
Token ID198
Feature activation+9.21e-01

Pos logit contributions
 glad+0.601
 lucky+0.508
 supposed+0.494
 important+0.493
 pleased+0.475
Neg logit contributions
mine-0.628
al-0.581
talk-0.559
of-0.555
ted-0.555
Token
Token ID198
Feature activation-1.82e+00

Pos logit contributions
 glad+1.687
 lucky+1.389
 supposed+1.379
 important+1.326
 pleased+1.294
Neg logit contributions
mine-2.177
of-2.061
al-2.031
talk-1.993
ted-1.953
TokenL
Token ID43
Feature activation-4.97e-01

Pos logit contributions
mine+1.947
al+1.755
als+1.583
talk+1.569
ted+1.547
Neg logit contributions
 glad-5.892
 lucky-5.280
 important-5.172
 pleased-5.014
 happy-4.925
Tokenily
Token ID813
Feature activation-9.62e-01

Pos logit contributions
mine+0.752
al+0.695
talk+0.652
als+0.631
of+0.628
Neg logit contributions
 glad-1.352
 lucky-1.183
 important-1.145
 pleased-1.119
 supposed-1.116
Token was
Token ID373
Feature activation+4.44e+00

Pos logit contributions
mine+1.852
al+1.734
talk+1.663
als+1.622
x+1.613
Neg logit contributions
 glad-2.193
 lucky-1.874
 important-1.800
 pleased-1.748
 supposed-1.730
Token happy
Token ID3772
Feature activation+4.91e-01

Pos logit contributions
 glad+8.465
 fault+8.414
 pleasure+7.945
 valuable+7.680
 tempting+7.651
Neg logit contributions
mine-9.404
al-9.095
Give-8.740
 cash-8.303
Wh-7.760
Token to
Token ID284
Feature activation-1.04e+00

Pos logit contributions
 glad+1.214
 lucky+1.054
 important+1.026
 supposed+1.012
 pleased+0.994
Neg logit contributions
mine-0.833
al-0.785
talk-0.743
ted-0.738
of-0.730
Token have
Token ID423
Feature activation+2.51e-01

Pos logit contributions
mine+1.949
al+1.843
talk+1.748
als+1.729
ted+1.725
Neg logit contributions
 glad-2.419
 lucky-2.070
 important-1.984
 supposed-1.931
 pleased-1.919
Token a
Token ID257
Feature activation+1.51e+00

Pos logit contributions
 glad+0.619
 lucky+0.526
 pleased+0.507
 important+0.506
 supposed+0.501
Neg logit contributions
mine-0.433
al-0.410
talk-0.389
ted-0.383
als-0.380
Token new
Token ID649
Feature activation+4.50e-01

Pos logit contributions
 glad+2.723
 supposed+2.260
 lucky+2.096
 pleased+2.049
 important+2.041
Neg logit contributions
mine-3.459
al-3.302
of-3.259
 cash-3.211
talk-3.181
TokenL
Token ID43
Feature activation-9.63e-01

Pos logit contributions
mine+1.185
al+1.033
talk+0.932
als+0.905
ted+0.896
Neg logit contributions
 glad-4.128
 lucky-3.696
 important-3.576
 pleased-3.519
 supposed-3.467
Tokenily
Token ID813
Feature activation-3.26e-02

Pos logit contributions
mine+1.524
al+1.408
talk+1.348
als+1.310
ted+1.304
Neg logit contributions
 glad-2.537
 lucky-2.209
 important-2.141
 supposed-2.103
 pleased-2.074
Token nodded
Token ID14464
Feature activation+6.93e-01

Pos logit contributions
mine+0.056
al+0.053
talk+0.050
als+0.049
of+0.049
Neg logit contributions
 glad-0.081
 lucky-0.070
 important-0.068
 supposed-0.067
 pleased-0.065
Token.
Token ID13
Feature activation+1.15e+00

Pos logit contributions
 glad+1.747
 lucky+1.509
 supposed+1.491
 important+1.471
 pleased+1.415
Neg logit contributions
mine-1.188
al-1.087
talk-1.002
ted-0.987
of-0.986
Token She
Token ID1375
Feature activation+1.83e-01

Pos logit contributions
 glad+3.097
 lucky+2.754
 important+2.744
 supposed+2.711
 pleased+2.672
Neg logit contributions
mine-1.660
al-1.579
als-1.510
ted-1.428
talk-1.411
Token was
Token ID373
Feature activation+4.43e+00

Pos logit contributions
 glad+0.428
 lucky+0.364
 important+0.351
 supposed+0.350
 pleased+0.339
Neg logit contributions
mine-0.337
al-0.321
talk-0.301
ted-0.299
of-0.298
Token sorry
Token ID7926
Feature activation+1.18e+00

Pos logit contributions
 valuable+8.465
 fault+8.347
 glad+8.195
 pleasure+8.067
 tempting+8.027
Neg logit contributions
mine-8.704
al-8.682
Give-7.906
 cash-7.704
ting-7.355
Token for
Token ID329
Feature activation+3.13e-01

Pos logit contributions
 glad+2.872
 supposed+2.465
 lucky+2.461
 pleased+2.411
 important+2.346
Neg logit contributions
mine-1.957
al-1.811
of-1.810
als-1.738
ted-1.707
Token making
Token ID1642
Feature activation+2.92e-01

Pos logit contributions
 glad+0.623
 lucky+0.524
 important+0.505
 pleased+0.494
 supposed+0.485
Neg logit contributions
mine-0.685
al-0.644
ted-0.601
x-0.599
of-0.594
Token Mom
Token ID11254
Feature activation+1.54e-01

Pos logit contributions
 glad+0.749
 lucky+0.646
 important+0.625
 pleased+0.619
 supposed+0.615
Neg logit contributions
mine-0.478
al-0.441
talk-0.408
of-0.402
ted-0.399
Tokenmy
Token ID1820
Feature activation+1.72e-01

Pos logit contributions
 glad+0.403
 lucky+0.353
 important+0.344
 supposed+0.335
 pleased+0.331
Neg logit contributions
mine-0.246
al-0.222
talk-0.212
of-0.208
als-0.202
Token things
Token ID1243
Feature activation+9.45e-01

Pos logit contributions
 glad+0.688
 lucky+0.586
 important+0.564
 supposed+0.560
 pleased+0.560
Neg logit contributions
mine-0.498
al-0.475
als-0.449
talk-0.447
ted-0.442
Token."
Token ID526
Feature activation+3.81e-01

Pos logit contributions
 glad+2.091
 lucky+1.737
 pleased+1.675
 important+1.647
 supposed+1.631
Neg logit contributions
mine-1.935
al-1.839
als-1.746
talk-1.725
of-1.712
Token
Token ID198
Feature activation+1.46e+00

Pos logit contributions
 glad+0.795
 lucky+0.677
 important+0.671
 supposed+0.664
 pleased+0.634
Neg logit contributions
mine-0.797
al-0.754
talk-0.725
of-0.722
als-0.720
Token
Token ID198
Feature activation-9.37e-01

Pos logit contributions
 glad+2.659
 supposed+2.340
 lucky+2.247
 important+2.205
 pleased+2.067
Neg logit contributions
mine-3.391
of-3.284
al-3.174
talk-3.077
ted-3.044
TokenAnna
Token ID31160
Feature activation-4.56e-01

Pos logit contributions
mine+1.236
al+1.131
talk+1.052
als+1.023
ted+1.014
Neg logit contributions
 glad-2.741
 lucky-2.418
 important-2.349
 pleased-2.297
 supposed-2.278
Token was
Token ID373
Feature activation+4.43e+00

Pos logit contributions
mine+0.829
al+0.780
talk+0.740
als+0.725
ted+0.719
Neg logit contributions
 glad-1.089
 lucky-0.935
 important-0.904
 supposed-0.884
 pleased-0.874
Token eager
Token ID11069
Feature activation+6.77e-02

Pos logit contributions
 tempting+8.948
 fault+8.766
 valuable+8.579
 important+8.173
 glad+8.131
Neg logit contributions
mine-9.675
al-8.439
 cash-7.592
Give-7.589
 her-7.190
Token to
Token ID284
Feature activation+3.48e-01

Pos logit contributions
 glad+0.144
 lucky+0.120
 important+0.114
 pleased+0.113
 supposed+0.112
Neg logit contributions
mine-0.141
al-0.133
talk-0.129
als-0.126
of-0.126
Token hear
Token ID3285
Feature activation+1.78e-01

Pos logit contributions
 glad+0.958
 lucky+0.837
 important+0.821
 pleased+0.808
 supposed+0.802
Neg logit contributions
mine-0.498
al-0.459
talk-0.428
of-0.420
als-0.404
Token the
Token ID262
Feature activation+8.32e-01

Pos logit contributions
 glad+0.472
 lucky+0.412
 important+0.399
 supposed+0.399
 pleased+0.392
Neg logit contributions
mine-0.271
al-0.253
talk-0.234
ted-0.233
als-0.230
Token story
Token ID1621
Feature activation-3.77e-01

Pos logit contributions
 glad+2.255
 lucky+1.937
 supposed+1.919
 pleased+1.892
 important+1.874
Neg logit contributions
mine-1.213
al-1.143
talk-1.055
 cash-1.043
ted-1.033
Token?"
Token ID1701
Feature activation-5.27e-01

Pos logit contributions
 glad+0.738
 lucky+0.652
 important+0.637
 supposed+0.635
 pleased+0.632
Neg logit contributions
mine-0.267
al-0.249
ted-0.220
als-0.219
talk-0.217
Token
Token ID198
Feature activation+2.38e-01

Pos logit contributions
mine+1.394
al+1.332
talk+1.288
als+1.269
ted+1.265
Neg logit contributions
 glad-0.826
 lucky-0.651
 important-0.616
 supposed-0.590
 pleased-0.581
Token
Token ID198
Feature activation-2.52e+00

Pos logit contributions
 glad+0.397
 lucky+0.322
 important+0.309
 supposed+0.306
 pleased+0.291
Neg logit contributions
mine-0.598
al-0.567
of-0.555
talk-0.551
ted-0.535
TokenL
Token ID43
Feature activation-1.46e+00

Pos logit contributions
mine+3.801
al+3.549
ted+3.422
als+3.388
talk+3.265
Neg logit contributions
 glad-7.195
 lucky-6.236
 pleased-5.957
 important-5.947
 happy-5.792
Tokenily
Token ID813
Feature activation-9.66e-01

Pos logit contributions
mine+2.432
al+2.233
talk+2.182
ted+2.145
als+2.144
Neg logit contributions
 glad-3.747
 lucky-3.246
 important-3.144
 supposed-3.103
 pleased-3.027
Token was
Token ID373
Feature activation+4.43e+00

Pos logit contributions
mine+1.722
al+1.603
talk+1.533
als+1.490
x+1.485
Neg logit contributions
 glad-2.336
 lucky-2.012
 important-1.937
 pleased-1.892
 supposed-1.870
Token curious
Token ID11040
Feature activation+9.58e-01

Pos logit contributions
 fault+7.763
 glad+7.622
 valuable+7.376
 tempting+7.130
 important+7.109
Neg logit contributions
mine-10.064
al-9.883
Give-8.911
 cash-8.289
talk-8.126
Token and
Token ID290
Feature activation+1.44e+00

Pos logit contributions
 glad+2.597
 lucky+2.267
 supposed+2.215
 important+2.165
 pleased+2.133
Neg logit contributions
mine-1.436
al-1.341
of-1.243
ted-1.238
als-1.235
Token said
Token ID531
Feature activation+3.73e-01

Pos logit contributions
 glad+2.413
 supposed+1.996
 fault+1.902
 lucky+1.880
 important+1.849
Neg logit contributions
al-3.308
mine-3.291
ted-3.197
als-3.182
Give-3.168
Token,
Token ID11
Feature activation+5.90e-01

Pos logit contributions
 glad+0.923
 lucky+0.795
 supposed+0.787
 important+0.775
 pleased+0.747
Neg logit contributions
mine-0.638
al-0.599
als-0.564
of-0.553
talk-0.550
Token "
Token ID366
Feature activation-2.86e+00

Pos logit contributions
 glad+1.029
 lucky+0.846
 important+0.837
 supposed+0.828
 pleased+0.761
Neg logit contributions
mine-1.421
al-1.356
als-1.311
ted-1.292
of-1.274
Token."
Token ID526
Feature activation+3.18e-01

Pos logit contributions
mine+1.255
al+1.181
talk+1.128
als+1.104
ted+1.097
Neg logit contributions
 glad-1.456
 lucky-1.238
 important-1.191
 supposed-1.157
 pleased-1.153
Token
Token ID198
Feature activation+1.24e+00

Pos logit contributions
 glad+0.541
 lucky+0.443
 important+0.434
 supposed+0.431
 pleased+0.407
Neg logit contributions
mine-0.787
al-0.744
talk-0.722
of-0.712
als-0.705
Token
Token ID198
Feature activation-1.58e+00

Pos logit contributions
 glad+2.298
 supposed+1.961
 lucky+1.927
 important+1.906
 pleased+1.800
Neg logit contributions
mine-2.852
of-2.707
al-2.676
talk-2.609
ted-2.525
TokenS
Token ID50
Feature activation-1.54e+00

Pos logit contributions
mine+2.308
al+2.140
als+1.998
ted+1.990
talk+1.990
Neg logit contributions
 glad-4.526
 lucky-3.969
 important-3.823
 pleased-3.748
 tired-3.647
Tokenara
Token ID3301
Feature activation-2.90e-01

Pos logit contributions
mine+1.823
al+1.585
talk+1.531
x+1.455
ted+1.442
Neg logit contributions
 glad-4.661
 lucky-4.173
 important-4.040
 supposed-3.942
 pleased-3.929
Token is
Token ID318
Feature activation+4.42e+00

Pos logit contributions
mine+0.568
al+0.539
talk+0.514
als+0.503
ted+0.500
Neg logit contributions
 glad-0.652
 lucky-0.554
 important-0.534
 supposed-0.522
 pleased-0.517
Token sad
Token ID6507
Feature activation+6.40e-01

Pos logit contributions
 pleasure+7.547
 glad+6.977
 valuable+6.953
 lifetime+6.940
 rewarding+6.864
Neg logit contributions
mine-10.353
Give-10.089
al-9.898
 cash-9.289
ting-8.844
Token.
Token ID13
Feature activation+3.56e-01

Pos logit contributions
 glad+1.567
 lucky+1.352
 pleased+1.291
 supposed+1.282
 important+1.282
Neg logit contributions
mine-1.116
al-1.030
talk-1.001
of-0.986
ted-0.950
Token She
Token ID1375
Feature activation+1.49e+00

Pos logit contributions
 glad+0.959
 lucky+0.847
 important+0.825
 supposed+0.810
 pleased+0.810
Neg logit contributions
mine-0.533
al-0.501
als-0.465
talk-0.456
ted-0.448
Token says
Token ID1139
Feature activation+2.08e-01

Pos logit contributions
 glad+3.558
 lucky+2.990
 pleased+2.983
 supposed+2.910
 important+2.876
Neg logit contributions
mine-2.612
al-2.559
ted-2.414
Give-2.412
talk-2.383
Token,
Token ID11
Feature activation+1.69e+00

Pos logit contributions
 glad+0.507
 lucky+0.436
 supposed+0.426
 important+0.423
 pleased+0.413
Neg logit contributions
mine-0.361
al-0.339
als-0.320
talk-0.315
ted-0.314
Token goodbye
Token ID24829
Feature activation+1.08e+00

Pos logit contributions
 glad+0.409
 lucky+0.345
 important+0.339
 supposed+0.334
 pleased+0.325
Neg logit contributions
mine-0.360
al-0.337
talk-0.321
ted-0.318
als-0.317
Token to
Token ID284
Feature activation-2.51e-01

Pos logit contributions
 glad+2.421
 lucky+2.048
 important+1.998
 supposed+1.980
 pleased+1.975
Neg logit contributions
mine-2.039
al-1.979
talk-1.935
als-1.912
ted-1.857
Token Anna
Token ID11735
Feature activation+2.89e-01

Pos logit contributions
mine+0.428
al+0.400
talk+0.376
of+0.369
ted+0.368
Neg logit contributions
 glad-0.626
 lucky-0.541
 important-0.525
 supposed-0.514
 pleased-0.514
Token."
Token ID526
Feature activation+5.86e-01

Pos logit contributions
 glad+0.759
 lucky+0.662
 important+0.642
 pleased+0.631
 supposed+0.630
Neg logit contributions
mine-0.463
al-0.427
talk-0.395
of-0.389
als-0.389
Token Ben
Token ID3932
Feature activation-1.02e-01

Pos logit contributions
 glad+1.399
 lucky+1.230
 important+1.207
 supposed+1.201
 pleased+1.151
Neg logit contributions
mine-1.062
al-0.964
talk-0.934
of-0.933
ted-0.910
Token is
Token ID318
Feature activation+4.42e+00

Pos logit contributions
mine+0.194
al+0.182
talk+0.174
of+0.171
als+0.170
Neg logit contributions
 glad-0.236
 lucky-0.202
 important-0.195
 supposed-0.193
 pleased-0.189
Token sad
Token ID6507
Feature activation+7.46e-01

Pos logit contributions
 valuable+8.799
 fault+8.411
 tempting+8.345
 rewarding+8.173
 important+8.129
Neg logit contributions
mine-9.255
Give-7.995
al-7.858
 cash-7.790
talk-7.276
Token.
Token ID13
Feature activation+1.96e+00

Pos logit contributions
 glad+1.855
 lucky+1.600
 supposed+1.546
 pleased+1.538
 important+1.535
Neg logit contributions
mine-1.251
al-1.152
talk-1.131
of-1.120
ted-1.082
Token He
Token ID679
Feature activation+1.78e+00

Pos logit contributions
 glad+5.097
 supposed+4.675
 lucky+4.575
 important+4.540
 pleased+4.286
Neg logit contributions
mine-2.977
al-2.692
of-2.692
ted-2.621
Give-2.610
Token does
Token ID857
Feature activation+1.34e+00

Pos logit contributions
 glad+4.179
 pleased+3.454
 lucky+3.447
 supposed+3.447
 important+3.330
Neg logit contributions
mine-3.177
al-3.123
talk-3.026
ted-3.021
Give-2.951
Token not
Token ID407
Feature activation+1.52e-01

Pos logit contributions
 glad+2.074
 lucky+1.619
 supposed+1.558
 important+1.555
 pleased+1.522
Neg logit contributions
mine-3.395
talk-3.118
al-3.110
of-3.062
Give-3.030
Token asleep
Token ID16039
Feature activation+6.42e-01

Pos logit contributions
 glad+2.906
 lucky+2.428
 important+2.288
 pleased+2.262
 supposed+2.229
Neg logit contributions
mine-3.231
ted-3.043
al-3.037
als-2.984
talk-2.935
Token.
Token ID13
Feature activation+2.00e+00

Pos logit contributions
 glad+1.508
 lucky+1.291
 supposed+1.255
 important+1.254
 pleased+1.207
Neg logit contributions
mine-1.185
al-1.117
talk-1.033
ted-1.016
als-1.014
Token The
Token ID383
Feature activation+9.18e-01

Pos logit contributions
 glad+4.381
 supposed+3.968
 important+3.932
 lucky+3.871
 pleased+3.684
Neg logit contributions
mine-3.756
al-3.609
of-3.509
ted-3.506
talk-3.500
Token turkey
Token ID26876
Feature activation+1.11e+00

Pos logit contributions
 glad+2.329
 supposed+2.001
 lucky+1.968
 important+1.919
 pleased+1.913
Neg logit contributions
mine-1.476
al-1.417
als-1.309
ted-1.269
talk-1.251
Token was
Token ID373
Feature activation+4.17e+00

Pos logit contributions
 glad+2.706
 supposed+2.313
 lucky+2.281
 important+2.246
 pleased+2.166
Neg logit contributions
mine-2.008
al-1.960
als-1.822
ted-1.786
talk-1.784
Token very
Token ID845
Feature activation+4.41e+00

Pos logit contributions
 glad+8.260
 fault+7.952
 tempting+7.926
 valuable+7.653
 supposed+7.621
Neg logit contributions
mine-8.706
al-7.853
talk-6.993
ted-6.920
ting-6.746
Token happy
Token ID3772
Feature activation+1.19e+00

Pos logit contributions
 fault+6.847
 supposed+6.787
 glad+5.633
 pleasure+5.074
 tempting+4.905
Neg logit contributions
mine-11.453
of-10.610
al-10.500
talk-10.069
ted-9.900
Token.
Token ID13
Feature activation+1.52e+00

Pos logit contributions
 glad+3.168
 lucky+2.770
 important+2.690
 supposed+2.674
 pleased+2.649
Neg logit contributions
mine-1.762
al-1.701
ted-1.583
als-1.572
of-1.504
Token<|endoftext|>
Token ID50256
Feature activation+1.46e+00

Pos logit contributions
 glad+3.173
 supposed+2.815
 important+2.810
 lucky+2.795
 pleased+2.644
Neg logit contributions
mine-3.067
al-2.953
ted-2.822
of-2.810
als-2.802
TokenOne
Token ID3198
Feature activation+2.37e+00

Pos logit contributions
 glad+2.949
 lucky+2.436
 important+2.417
 pleased+2.387
 supposed+2.287
Neg logit contributions
mine-3.388
al-3.232
of-3.046
talk-3.038
ted-3.034
Token sunny
Token ID27737
Feature activation+3.92e-01

Pos logit contributions
 glad+5.103
 supposed+4.226
 pleased+4.106
 lucky+4.061
 fault+4.001
Neg logit contributions
mine-4.831
of-4.717
al-4.479
talk-4.115
ted-4.077
Token."
Token ID526
Feature activation+1.26e+00

Pos logit contributions
mine+1.597
al+1.507
talk+1.450
als+1.418
ted+1.415
Neg logit contributions
 glad-1.507
 lucky-1.261
 important-1.207
 supposed-1.167
 pleased-1.161
Token
Token ID198
Feature activation+1.13e+00

Pos logit contributions
 glad+2.391
 supposed+2.128
 important+1.996
 lucky+1.984
 pleased+1.875
Neg logit contributions
mine-2.778
al-2.662
of-2.646
talk-2.593
ted-2.570
Token
Token ID198
Feature activation-1.87e+00

Pos logit contributions
 glad+2.124
 supposed+1.779
 lucky+1.763
 important+1.689
 pleased+1.650
Neg logit contributions
mine-2.594
of-2.527
al-2.479
talk-2.431
ted-2.367
TokenS
Token ID50
Feature activation-1.91e+00

Pos logit contributions
mine+3.207
al+2.871
x+2.720
talk+2.716
ted+2.704
Neg logit contributions
 glad-4.905
 lucky-4.299
 important-4.168
 pleased-4.015
 tired-3.952
Tokenara
Token ID3301
Feature activation-4.21e-01

Pos logit contributions
mine+2.444
al+2.059
talk+2.029
x+1.993
als+1.888
Neg logit contributions
 glad-5.615
 lucky-5.033
 important-4.899
 supposed-4.713
 pleased-4.702
Token is
Token ID318
Feature activation+4.41e+00

Pos logit contributions
mine+0.818
al+0.776
talk+0.737
als+0.724
ted+0.718
Neg logit contributions
 glad-0.951
 lucky-0.808
 important-0.778
 supposed-0.758
 pleased-0.754
Token not
Token ID407
Feature activation+2.63e+00

Pos logit contributions
 fault+7.759
 lifetime+7.538
 pleasure+7.433
 glad+7.355
 valuable+7.223
Neg logit contributions
Give-9.687
mine-9.271
al-8.915
 cash-8.401
ting-7.807
Token happy
Token ID3772
Feature activation+5.29e-01

Pos logit contributions
 glad+3.845
 important+3.068
 tempting+2.980
 lucky+2.911
 cozy+2.816
Neg logit contributions
mine-7.153
al-6.445
talk-6.126
of-5.944
x-5.787
Token.
Token ID13
Feature activation+3.54e-01

Pos logit contributions
 glad+1.279
 lucky+1.091
 important+1.044
 pleased+1.042
 supposed+1.021
Neg logit contributions
mine-0.942
al-0.892
of-0.833
ted-0.828
talk-0.827
Token She
Token ID1375
Feature activation+3.78e-01

Pos logit contributions
 glad+0.941
 lucky+0.833
 important+0.819
 supposed+0.804
 pleased+0.792
Neg logit contributions
mine-0.534
al-0.505
als-0.466
ted-0.456
talk-0.451
Token likes
Token ID7832
Feature activation+3.40e-01

Pos logit contributions
 glad+0.811
 lucky+0.675
 important+0.652
 pleased+0.641
 supposed+0.638
Neg logit contributions
mine-0.762
al-0.748
als-0.696
talk-0.693
ted-0.692
Token?"
Token ID1701
Feature activation-1.44e-01

Pos logit contributions
 glad+2.275
 lucky+1.843
 supposed+1.779
 important+1.738
 pleased+1.732
Neg logit contributions
mine-2.576
al-2.507
als-2.342
of-2.326
x-2.284
Token
Token ID198
Feature activation+6.20e-01

Pos logit contributions
mine+0.368
al+0.351
talk+0.342
of+0.337
ted+0.336
Neg logit contributions
 glad-0.236
 lucky-0.188
 important-0.179
 supposed-0.177
 pleased-0.170
Token
Token ID198
Feature activation-2.25e+00

Pos logit contributions
 glad+1.097
 supposed+0.894
 lucky+0.893
 important+0.858
 pleased+0.818
Neg logit contributions
mine-1.496
of-1.423
al-1.419
talk-1.390
ted-1.354
TokenL
Token ID43
Feature activation-1.56e+00

Pos logit contributions
mine+2.756
al+2.420
ted+2.202
als+2.201
talk+2.177
Neg logit contributions
 glad-7.068
 lucky-6.331
 important-6.137
 pleased-6.045
 tired-5.979
Tokenila
Token ID10102
Feature activation-1.31e+00

Pos logit contributions
mine+2.502
al+2.252
talk+2.210
als+2.162
x+2.141
Neg logit contributions
 glad-4.112
 lucky-3.596
 important-3.470
 supposed-3.419
 pleased-3.358
Token was
Token ID373
Feature activation+4.40e+00

Pos logit contributions
mine+2.546
al+2.370
talk+2.302
x+2.246
ted+2.220
Neg logit contributions
 glad-2.961
 lucky-2.528
 important-2.426
 pleased-2.373
 supposed-2.303
Token excited
Token ID6568
Feature activation-3.10e-01

Pos logit contributions
 fault+7.523
 glad+7.417
 pleasure+7.202
 valuable+7.195
 tempting+6.936
Neg logit contributions
mine-10.086
Give-9.604
al-9.512
 cash-8.808
er-8.158
Token.
Token ID13
Feature activation-3.14e-01

Pos logit contributions
mine+0.600
al+0.563
talk+0.540
als+0.527
of+0.527
Neg logit contributions
 glad-0.708
 lucky-0.602
 important-0.577
 supposed-0.564
 pleased-0.562
Token She
Token ID1375
Feature activation-2.92e-01

Pos logit contributions
mine+0.513
al+0.480
talk+0.449
als+0.447
ted+0.441
Neg logit contributions
 glad-0.805
 lucky-0.702
 important-0.682
 supposed-0.667
 pleased-0.661
Token said
Token ID531
Feature activation-3.98e-01

Pos logit contributions
mine+0.533
al+0.503
talk+0.476
als+0.469
ted+0.465
Neg logit contributions
 glad-0.695
 lucky-0.595
 important-0.574
 supposed-0.563
 pleased-0.555
Token,
Token ID11
Feature activation+9.66e-01

Pos logit contributions
mine+0.735
al+0.691
talk+0.654
als+0.646
ted+0.640
Neg logit contributions
 glad-0.935
 lucky-0.801
 important-0.773
 supposed-0.760
 pleased-0.749
Token.
Token ID13
Feature activation+6.41e-01

Pos logit contributions
 glad+1.401
 lucky+1.209
 important+1.174
 supposed+1.167
 pleased+1.117
Neg logit contributions
mine-1.068
al-0.991
talk-0.943
ted-0.912
of-0.910
Token
Token ID198
Feature activation+1.13e+00

Pos logit contributions
 glad+1.136
 lucky+0.936
 supposed+0.922
 important+0.909
 pleased+0.846
Neg logit contributions
mine-1.537
al-1.465
talk-1.410
ted-1.396
of-1.396
Token
Token ID198
Feature activation-2.38e+00

Pos logit contributions
 glad+2.080
 lucky+1.742
 supposed+1.724
 important+1.686
 pleased+1.625
Neg logit contributions
mine-2.611
of-2.478
al-2.434
talk-2.422
ted-2.324
TokenL
Token ID43
Feature activation-1.21e+00

Pos logit contributions
mine+3.992
al+3.737
als+3.594
ted+3.567
talk+3.418
Neg logit contributions
 glad-6.429
 lucky-5.552
 important-5.288
 pleased-5.246
 happy-5.101
Tokenily
Token ID813
Feature activation-4.69e-01

Pos logit contributions
mine+1.980
al+1.822
talk+1.757
als+1.727
ted+1.722
Neg logit contributions
 glad-3.131
 lucky-2.720
 important-2.645
 supposed-2.599
 pleased-2.543
Token is
Token ID318
Feature activation+4.40e+00

Pos logit contributions
mine+0.919
al+0.869
talk+0.828
als+0.813
ted+0.807
Neg logit contributions
 glad-1.054
 lucky-0.895
 important-0.862
 supposed-0.839
 pleased-0.834
Token curious
Token ID11040
Feature activation+5.30e-01

Pos logit contributions
 fault+7.696
 pleasure+7.342
 glad+7.334
 valuable+7.112
 tempting+7.029
Neg logit contributions
mine-9.765
al-9.648
Give-9.124
er-8.183
 cash-8.162
Token.
Token ID13
Feature activation-8.03e-02

Pos logit contributions
 glad+1.286
 lucky+1.101
 supposed+1.061
 important+1.049
 pleased+1.034
Neg logit contributions
mine-0.950
al-0.887
of-0.840
talk-0.840
ted-0.827
Token She
Token ID1375
Feature activation+3.72e-01

Pos logit contributions
mine+0.130
al+0.122
talk+0.113
als+0.112
of+0.111
Neg logit contributions
 glad-0.207
 lucky-0.181
 important-0.176
 supposed-0.173
 pleased-0.171
Token wants
Token ID3382
Feature activation-6.02e-01

Pos logit contributions
 glad+0.827
 lucky+0.690
 important+0.669
 supposed+0.661
 pleased+0.656
Neg logit contributions
mine-0.713
al-0.703
talk-0.663
als-0.656
ted-0.655
Token to
Token ID284
Feature activation-1.01e+00

Pos logit contributions
mine+1.350
al+1.283
talk+1.234
als+1.211
ted+1.205
Neg logit contributions
 glad-1.191
 lucky-0.986
 important-0.941
 supposed-0.910
 pleased-0.908
Token?"
Token ID1701
Feature activation+2.52e-01

Pos logit contributions
 glad+1.007
 lucky+0.822
 supposed+0.785
 important+0.781
 pleased+0.774
Neg logit contributions
mine-1.047
al-0.991
of-0.929
als-0.919
x-0.912
Token
Token ID198
Feature activation+1.05e+00

Pos logit contributions
 glad+0.434
 lucky+0.352
 important+0.343
 supposed+0.342
 pleased+0.321
Neg logit contributions
mine-0.624
al-0.592
talk-0.581
of-0.573
ted-0.569
Token
Token ID198
Feature activation-1.99e+00

Pos logit contributions
 glad+1.951
 supposed+1.638
 lucky+1.619
 important+1.563
 pleased+1.494
Neg logit contributions
mine-2.448
of-2.348
al-2.317
talk-2.284
ted-2.192
TokenS
Token ID50
Feature activation-2.24e+00

Pos logit contributions
mine+3.085
al+2.832
als+2.676
ted+2.664
x+2.621
Neg logit contributions
 glad-5.512
 lucky-4.839
 important-4.673
 pleased-4.582
 tired-4.452
Tokenara
Token ID3301
Feature activation-7.44e-01

Pos logit contributions
mine+2.950
talk+2.499
al+2.490
x+2.401
als+2.314
Neg logit contributions
 glad-6.480
 lucky-5.778
 important-5.597
 tired-5.426
 pleased-5.402
Token is
Token ID318
Feature activation+4.39e+00

Pos logit contributions
mine+1.488
al+1.397
talk+1.339
als+1.309
ted+1.302
Neg logit contributions
 glad-1.641
 lucky-1.390
 important-1.335
 supposed-1.293
 pleased-1.291
Token happy
Token ID3772
Feature activation+2.81e-01

Pos logit contributions
 fault+7.725
 pleasure+7.531
 valuable+7.528
 lifetime+7.424
 glad+7.390
Neg logit contributions
mine-9.469
Give-9.408
al-8.772
 cash-8.277
Tom-7.723
Token.
Token ID13
Feature activation+9.16e-02

Pos logit contributions
 glad+0.669
 lucky+0.576
 important+0.560
 supposed+0.553
 pleased+0.541
Neg logit contributions
mine-0.507
al-0.480
talk-0.452
of-0.448
ted-0.448
Token She
Token ID1375
Feature activation+4.79e-01

Pos logit contributions
 glad+0.241
 lucky+0.213
 important+0.208
 supposed+0.204
 pleased+0.201
Neg logit contributions
mine-0.141
al-0.133
als-0.124
ted-0.122
talk-0.122
Token gives
Token ID3607
Feature activation-2.22e+00

Pos logit contributions
 glad+1.122
 lucky+0.949
 important+0.929
 supposed+0.919
 pleased+0.906
Neg logit contributions
mine-0.858
al-0.841
als-0.790
talk-0.783
ted-0.782
Token Ben
Token ID3932
Feature activation-1.04e+00

Pos logit contributions
mine+3.955
talk+3.626
al+3.546
ede+3.463
x+3.421
Neg logit contributions
 glad-5.503
 lucky-4.811
 tired-4.485
 important-4.466
 hungry-4.328
Token!"
Token ID2474
Feature activation-1.35e+00

Pos logit contributions
 glad+2.179
 lucky+1.811
 supposed+1.810
 pleased+1.796
 important+1.791
Neg logit contributions
al-1.613
mine-1.604
als-1.472
of-1.453
ted-1.446
Token Mia
Token ID32189
Feature activation-1.26e+00

Pos logit contributions
mine+2.348
al+2.204
x+2.051
talk+2.049
als+2.011
Neg logit contributions
 glad-3.431
 lucky-2.970
 important-2.831
 pleased-2.767
 tired-2.726
Token says
Token ID1139
Feature activation-7.33e-01

Pos logit contributions
mine+2.560
al+2.403
talk+2.292
x+2.265
als+2.263
Neg logit contributions
 glad-2.756
 lucky-2.331
 important-2.231
 pleased-2.158
 supposed-2.141
Token.
Token ID13
Feature activation-8.02e-01

Pos logit contributions
mine+1.500
al+1.416
talk+1.355
als+1.328
ted+1.320
Neg logit contributions
 glad-1.593
 lucky-1.345
 important-1.290
 supposed-1.250
 pleased-1.247
Token She
Token ID1375
Feature activation-9.75e-02

Pos logit contributions
mine+1.890
al+1.795
talk+1.730
als+1.693
x+1.688
Neg logit contributions
 glad-1.505
 lucky-1.230
 important-1.168
 supposed-1.123
 pleased-1.121
Token follows
Token ID5679
Feature activation-4.72e+00

Pos logit contributions
mine+0.172
al+0.165
talk+0.154
als+0.153
ted+0.152
Neg logit contributions
 glad-0.235
 lucky-0.201
 important-0.196
 supposed-0.191
 pleased-0.190
Token Sam
Token ID3409
Feature activation-6.71e-01

Pos logit contributions
mine+9.224
al+9.022
x+8.793
als+8.745
talk+8.700
Neg logit contributions
 glad-11.416
 lucky-9.740
 tired-9.509
 hungry-9.470
 happy-9.401
Token to
Token ID284
Feature activation-1.72e+00

Pos logit contributions
mine+1.324
al+1.246
talk+1.192
als+1.166
ted+1.159
Neg logit contributions
 glad-1.500
 lucky-1.274
 important-1.224
 supposed-1.188
 pleased-1.184
Token the
Token ID262
Feature activation+2.69e-01

Pos logit contributions
mine+3.334
al+3.147
als+3.033
talk+3.012
ted+2.948
Neg logit contributions
 glad-4.085
 lucky-3.495
 important-3.255
 supposed-3.200
 pleased-3.171
Token cave
Token ID11527
Feature activation-3.84e-01

Pos logit contributions
 glad+0.637
 lucky+0.531
 supposed+0.525
 pleased+0.519
 important+0.519
Neg logit contributions
mine-0.489
al-0.471
talk-0.446
als-0.440
of-0.428
Token.
Token ID13
Feature activation-1.78e-01

Pos logit contributions
mine+0.696
al+0.651
talk+0.622
als+0.606
of+0.601
Neg logit contributions
 glad-0.915
 lucky-0.787
 important-0.757
 supposed-0.741
 pleased-0.735
Token!"
Token ID2474
Feature activation-2.13e+00

Pos logit contributions
mine+0.396
al+0.374
talk+0.356
of+0.349
als+0.347
Neg logit contributions
 glad-0.444
 lucky-0.375
 important-0.359
 pleased-0.351
 supposed-0.350
Token Lily
Token ID20037
Feature activation-1.29e+00

Pos logit contributions
mine+4.209
al+3.991
x+3.762
talk+3.718
als+3.679
Neg logit contributions
 glad-5.114
 lucky-4.454
 important-4.251
 pleased-4.083
 tired-4.075
Token says
Token ID1139
Feature activation-5.60e-01

Pos logit contributions
mine+2.638
al+2.484
talk+2.362
x+2.340
als+2.336
Neg logit contributions
 glad-2.811
 lucky-2.386
 important-2.296
 pleased-2.202
 supposed-2.189
Token.
Token ID13
Feature activation-8.03e-01

Pos logit contributions
mine+1.128
al+1.061
talk+1.017
als+0.994
ted+0.989
Neg logit contributions
 glad-1.231
 lucky-1.043
 important-1.002
 supposed-0.972
 pleased-0.968
Token She
Token ID1375
Feature activation+2.67e-01

Pos logit contributions
mine+1.556
al+1.463
talk+1.398
als+1.363
x+1.356
Neg logit contributions
 glad-1.838
 lucky-1.563
 important-1.501
 supposed-1.458
 pleased-1.455
Token follows
Token ID5679
Feature activation-4.65e+00

Pos logit contributions
 glad+0.658
 lucky+0.565
 important+0.553
 supposed+0.544
 pleased+0.540
Neg logit contributions
mine-0.450
al-0.433
ted-0.402
als-0.400
talk-0.399
Token him
Token ID683
Feature activation-9.65e-02

Pos logit contributions
mine+8.526
al+8.494
als+8.492
talk+8.107
x+7.754
Neg logit contributions
 glad-11.956
 lucky-10.314
 tired-10.106
 sad-9.947
 hungry-9.877
Token with
Token ID351
Feature activation-1.16e+00

Pos logit contributions
mine+0.186
al+0.173
talk+0.168
of+0.163
als+0.162
Neg logit contributions
 glad-0.220
 lucky-0.186
 important-0.180
 pleased-0.175
 supposed-0.174
Token her
Token ID607
Feature activation+8.10e-01

Pos logit contributions
mine+2.215
al+2.083
talk+1.999
als+1.947
x+1.944
Neg logit contributions
 glad-2.742
 lucky-2.349
 important-2.246
 pleased-2.170
 supposed-2.161
Token sled
Token ID46822
Feature activation-1.07e+00

Pos logit contributions
 glad+2.078
 lucky+1.785
 important+1.757
 supposed+1.756
 pleased+1.730
Neg logit contributions
mine-1.341
al-1.187
of-1.156
talk-1.149
ted-1.126
Token.
Token ID13
Feature activation+8.80e-02

Pos logit contributions
mine+2.204
al+2.095
talk+1.984
als+1.948
of+1.929
Neg logit contributions
 glad-2.297
 lucky-1.941
 important-1.875
 supposed-1.806
 pleased-1.785
Token of
Token ID286
Feature activation-1.66e+00

Pos logit contributions
mine+2.825
al+2.670
talk+2.574
ted+2.523
x+2.512
Neg logit contributions
 glad-2.822
 lucky-2.379
 important-2.305
 pleased-2.200
 supposed-2.196
Token the
Token ID262
Feature activation+2.70e-01

Pos logit contributions
mine+3.139
talk+2.962
al+2.946
als+2.800
of+2.765
Neg logit contributions
 glad-3.988
 lucky-3.404
 important-3.200
 pleased-3.114
 supposed-3.110
Token hill
Token ID12788
Feature activation-6.42e-01

Pos logit contributions
 glad+0.798
 lucky+0.698
 supposed+0.685
 important+0.684
 pleased+0.676
Neg logit contributions
mine-0.341
al-0.306
of-0.281
als-0.281
talk-0.278
Token.
Token ID13
Feature activation-1.65e-01

Pos logit contributions
mine+1.298
al+1.224
talk+1.172
als+1.147
ted+1.140
Neg logit contributions
 glad-1.401
 lucky-1.185
 important-1.138
 supposed-1.104
 pleased-1.100
Token Ben
Token ID3932
Feature activation-1.71e-01

Pos logit contributions
mine+0.324
al+0.305
talk+0.291
als+0.288
ted+0.287
Neg logit contributions
 glad-0.369
 lucky-0.316
 important-0.307
 supposed-0.298
 pleased-0.296
Token follows
Token ID5679
Feature activation-4.58e+00

Pos logit contributions
mine+0.302
al+0.286
talk+0.268
als+0.265
ted+0.264
Neg logit contributions
 glad-0.413
 lucky-0.355
 important-0.346
 supposed-0.338
 pleased-0.335
Token her
Token ID607
Feature activation-7.17e-02

Pos logit contributions
als+9.888
al+9.615
mine+9.581
x+9.436
ted+9.101
Neg logit contributions
 glad-10.822
 lucky-9.381
 happy-9.064
 sad-8.885
 tired-8.624
Token.
Token ID13
Feature activation+1.20e-01

Pos logit contributions
mine+0.136
al+0.126
talk+0.121
of+0.119
als+0.118
Neg logit contributions
 glad-0.167
 lucky-0.142
 important-0.138
 supposed-0.135
 pleased-0.134
Token They
Token ID1119
Feature activation-1.78e+00

Pos logit contributions
 glad+0.254
 lucky+0.216
 important+0.209
 supposed+0.206
 pleased+0.201
Neg logit contributions
mine-0.247
al-0.232
talk-0.226
ted-0.222
als-0.221
Token look
Token ID804
Feature activation+7.80e-01

Pos logit contributions
mine+3.108
al+3.005
talk+2.845
ted+2.836
als+2.766
Neg logit contributions
 glad-4.417
 lucky-3.807
 tired-3.571
 important-3.528
 pleased-3.510
Token down
Token ID866
Feature activation+1.28e-01

Pos logit contributions
 glad+1.684
 supposed+1.426
 lucky+1.410
 important+1.388
 pleased+1.335
Neg logit contributions
mine-1.507
al-1.405
ted-1.373
talk-1.336
of-1.335
Token to
Token ID284
Feature activation-2.17e+00

Pos logit contributions
mine+6.835
al+6.533
als+6.115
x+6.063
ede+6.055
Neg logit contributions
 glad-4.254
 lucky-3.485
 important-3.199
 tired-3.084
 supposed-3.062
Token the
Token ID262
Feature activation-7.00e-01

Pos logit contributions
mine+4.262
al+4.101
als+4.020
talk+4.005
ted+3.928
Neg logit contributions
 glad-5.110
 lucky-4.365
 happy-3.973
 important-3.961
 sad-3.954
Token box
Token ID3091
Feature activation-1.40e+00

Pos logit contributions
mine+1.133
al+1.050
talk+0.994
als+0.966
ted+0.960
Neg logit contributions
 glad-1.830
 lucky-1.595
 important-1.542
 supposed-1.503
 pleased-1.499
Token.
Token ID13
Feature activation-2.50e-01

Pos logit contributions
mine+2.550
al+2.412
talk+2.275
ted+2.253
als+2.248
Neg logit contributions
 glad-3.343
 lucky-2.853
 important-2.772
 pleased-2.693
 supposed-2.655
Token Jen
Token ID13364
Feature activation-1.12e+00

Pos logit contributions
mine+0.444
al+0.415
talk+0.397
als+0.389
ted+0.388
Neg logit contributions
 glad-0.604
 lucky-0.522
 important-0.505
 supposed-0.495
 pleased-0.489
Token follows
Token ID5679
Feature activation-4.56e+00

Pos logit contributions
mine+2.151
al+1.989
talk+1.920
als+1.858
x+1.855
Neg logit contributions
 glad-2.583
 lucky-2.207
 important-2.113
 supposed-2.050
 pleased-2.041
Token him
Token ID683
Feature activation-7.91e-01

Pos logit contributions
als+9.397
al+9.216
mine+9.043
ede+8.936
talk+8.831
Neg logit contributions
 glad-10.902
 lucky-9.342
 tired-8.915
 happy-8.858
 sad-8.839
Token.
Token ID13
Feature activation-6.37e-01

Pos logit contributions
mine+1.572
al+1.481
talk+1.413
als+1.386
ted+1.380
Neg logit contributions
 glad-1.763
 lucky-1.498
 important-1.437
 supposed-1.399
 pleased-1.388
Token
Token ID198
Feature activation+2.60e-01

Pos logit contributions
mine+1.282
al+1.208
talk+1.156
als+1.132
ted+1.126
Neg logit contributions
 glad-1.404
 lucky-1.190
 important-1.143
 supposed-1.109
 pleased-1.105
Token
Token ID198
Feature activation-3.39e+00

Pos logit contributions
 glad+0.442
 lucky+0.359
 important+0.342
 supposed+0.331
 pleased+0.327
Neg logit contributions
mine-0.649
al-0.606
talk-0.603
of-0.601
ted-0.579
TokenBut
Token ID1537
Feature activation-1.99e+00

Pos logit contributions
mine+6.720
al+6.541
x+6.190
als+6.155
ted+6.014
Neg logit contributions
 glad-8.456
 lucky-7.303
 important-7.018
 pleased-6.873
 happy-6.719
Token
Token ID198
Feature activation+1.02e+00

Pos logit contributions
 glad+0.092
 lucky+0.077
 important+0.073
 supposed+0.072
 pleased+0.070
Neg logit contributions
mine-0.113
al-0.106
talk-0.104
ted-0.102
of-0.101
Token
Token ID198
Feature activation-2.03e+00

Pos logit contributions
 glad+1.847
 lucky+1.545
 supposed+1.477
 important+1.475
 pleased+1.428
Neg logit contributions
mine-2.382
of-2.241
talk-2.214
al-2.205
ted-2.132
TokenSam
Token ID16305
Feature activation-7.71e-01

Pos logit contributions
mine+2.975
al+2.724
als+2.558
talk+2.431
ted+2.405
Neg logit contributions
 glad-5.966
 lucky-5.254
 important-5.133
 pleased-4.952
 supposed-4.849
Token tries
Token ID8404
Feature activation-4.18e-01

Pos logit contributions
mine+1.592
al+1.498
talk+1.436
als+1.407
ted+1.398
Neg logit contributions
 glad-1.655
 lucky-1.395
 important-1.338
 supposed-1.295
 pleased-1.289
Token to
Token ID284
Feature activation-3.30e+00

Pos logit contributions
mine+0.921
al+0.868
talk+0.843
of+0.821
ted+0.819
Neg logit contributions
 glad-0.841
 lucky-0.700
 important-0.666
 pleased-0.647
 supposed-0.645
Token scare
Token ID19437
Feature activation-4.56e+00

Pos logit contributions
als+6.682
mine+6.501
ted+6.464
al+6.463
talk+5.944
Neg logit contributions
 glad-7.632
 lucky-6.719
 happy-6.439
 sad-6.393
 important-6.353
Token things
Token ID1243
Feature activation-1.84e+00

Pos logit contributions
mine+10.239
talk+9.271
al+9.269
als+9.260
ede+8.902
Neg logit contributions
 glad-10.547
 lucky-9.151
 sad-8.849
 tired-8.523
 important-8.255
Token with
Token ID351
Feature activation-1.33e+00

Pos logit contributions
mine+3.706
al+3.529
talk+3.456
als+3.353
x+3.310
Neg logit contributions
 glad-4.091
 lucky-3.453
 important-3.313
 sad-3.209
 pleased-3.174
Token the
Token ID262
Feature activation+1.59e-01

Pos logit contributions
mine+2.291
al+2.139
talk+2.050
als+1.995
of+1.948
Neg logit contributions
 glad-3.347
 lucky-2.901
 important-2.810
 supposed-2.702
 pleased-2.685
Token lion
Token ID18744
Feature activation-4.80e-01

Pos logit contributions
 glad+0.427
 lucky+0.368
 pleased+0.355
 important+0.355
 supposed+0.352
Neg logit contributions
mine-0.241
al-0.226
talk-0.213
x-0.208
als-0.204
Token,
Token ID11
Feature activation-6.77e-01

Pos logit contributions
mine+0.915
al+0.862
talk+0.821
als+0.807
ted+0.800
Neg logit contributions
 glad-1.103
 lucky-0.939
 important-0.904
 supposed-0.880
 pleased-0.879
Token let
Token ID1309
Feature activation-1.44e+00

Pos logit contributions
mine+2.599
al+2.413
talk+2.353
ted+2.263
als+2.246
Neg logit contributions
 glad-3.018
 lucky-2.597
 important-2.476
 pleased-2.382
 supposed-2.330
Token's
Token ID338
Feature activation-8.66e-01

Pos logit contributions
mine+2.909
al+2.697
talk+2.646
x+2.565
als+2.552
Neg logit contributions
 glad-3.180
 lucky-2.768
 important-2.649
 supposed-2.530
 pleased-2.482
Token go
Token ID467
Feature activation-1.09e+00

Pos logit contributions
mine+1.287
al+1.190
talk+1.121
als+1.088
ted+1.086
Neg logit contributions
 glad-2.368
 lucky-2.079
 important-2.012
 supposed-1.964
 pleased-1.954
Token."
Token ID526
Feature activation-9.90e-01

Pos logit contributions
mine+2.377
al+2.253
talk+2.154
als+2.127
ted+2.116
Neg logit contributions
 glad-2.234
 lucky-1.869
 important-1.774
 supposed-1.722
 pleased-1.701
Token He
Token ID679
Feature activation-1.44e+00

Pos logit contributions
mine+2.468
al+2.360
talk+2.260
als+2.227
x+2.221
Neg logit contributions
 glad-1.741
 lucky-1.391
 important-1.307
 pleased-1.263
 supposed-1.244
Token followed
Token ID3940
Feature activation-4.52e+00

Pos logit contributions
mine+2.529
al+2.320
talk+2.205
x+2.148
als+2.130
Neg logit contributions
 glad-3.610
 lucky-3.137
 important-3.025
 pleased-2.957
 tired-2.884
Token Lily
Token ID20037
Feature activation-6.58e-01

Pos logit contributions
mine+10.307
al+9.791
als+9.523
talk+9.185
Give+9.011
Neg logit contributions
 glad-10.158
 lucky-8.757
 sad-8.471
 happy-8.312
 tired-8.302
Token up
Token ID510
Feature activation-1.41e+00

Pos logit contributions
mine+1.285
al+1.209
talk+1.156
als+1.131
ted+1.124
Neg logit contributions
 glad-1.482
 lucky-1.261
 important-1.213
 supposed-1.177
 pleased-1.173
Token the
Token ID262
Feature activation-2.32e-01

Pos logit contributions
mine+2.712
al+2.548
talk+2.434
als+2.412
x+2.389
Neg logit contributions
 glad-3.290
 lucky-2.797
 important-2.682
 pleased-2.631
 tired-2.572
Token ladder
Token ID18002
Feature activation-1.68e+00

Pos logit contributions
mine+0.271
al+0.242
talk+0.226
of+0.218
als+0.217
Neg logit contributions
 glad-0.708
 lucky-0.627
 important-0.613
 supposed-0.604
 pleased-0.599
Token and
Token ID290
Feature activation-1.93e+00

Pos logit contributions
mine+3.194
al+3.007
talk+2.822
als+2.800
ted+2.791
Neg logit contributions
 glad-3.932
 lucky-3.326
 important-3.225
 pleased-3.128
 tired-3.117
Token cave
Token ID11527
Feature activation+2.39e-02

Pos logit contributions
 glad+2.291
 supposed+1.822
 pleased+1.798
 lucky+1.726
 important+1.649
Neg logit contributions
al-2.789
mine-2.755
talk-2.702
als-2.652
ted-2.616
Token.
Token ID13
Feature activation+7.98e-01

Pos logit contributions
 glad+0.059
 lucky+0.051
 important+0.049
 supposed+0.049
 pleased+0.048
Neg logit contributions
mine-0.041
al-0.038
talk-0.036
als-0.035
ted-0.035
Token
Token ID198
Feature activation+5.12e-02

Pos logit contributions
 glad+1.761
 lucky+1.519
 important+1.483
 supposed+1.483
 pleased+1.421
Neg logit contributions
mine-1.521
al-1.448
ted-1.426
talk-1.402
als-1.392
Token
Token ID198
Feature activation-2.89e+00

Pos logit contributions
 glad+0.085
 lucky+0.069
 important+0.065
 supposed+0.064
 pleased+0.062
Neg logit contributions
mine-0.129
al-0.122
talk-0.121
of-0.119
ted-0.117
TokenSam
Token ID16305
Feature activation-8.34e-01

Pos logit contributions
mine+5.603
al+5.213
als+4.864
talk+4.753
ted+4.739
Neg logit contributions
 glad-7.460
 lucky-6.506
 important-6.309
 pleased-6.116
 happy-5.882
Token follows
Token ID5679
Feature activation-4.46e+00

Pos logit contributions
mine+1.589
al+1.483
talk+1.418
als+1.383
x+1.373
Neg logit contributions
 glad-1.926
 lucky-1.646
 important-1.582
 supposed-1.534
 pleased-1.531
Token him
Token ID683
Feature activation-5.84e-02

Pos logit contributions
mine+9.634
als+9.617
al+9.416
of+8.818
talk+8.688
Neg logit contributions
 glad-10.098
 lucky-8.675
 sad-8.193
 hungry-8.105
 important-8.099
Token.
Token ID13
Feature activation+9.32e-02

Pos logit contributions
mine+0.107
al+0.101
talk+0.097
of+0.094
als+0.094
Neg logit contributions
 glad-0.138
 lucky-0.117
 important-0.114
 pleased-0.111
 supposed-0.111
Token He
Token ID679
Feature activation+3.59e-01

Pos logit contributions
 glad+0.215
 lucky+0.185
 important+0.179
 supposed+0.176
 pleased+0.173
Neg logit contributions
mine-0.173
al-0.164
talk-0.158
ted-0.157
als-0.156
Token does
Token ID857
Feature activation+8.12e-01

Pos logit contributions
 glad+0.825
 lucky+0.699
 important+0.685
 supposed+0.675
 pleased+0.664
Neg logit contributions
mine-0.670
al-0.653
talk-0.617
als-0.611
ted-0.608
Token not
Token ID407
Feature activation-9.24e-01

Pos logit contributions
 glad+1.267
 lucky+0.982
 important+0.950
 supposed+0.935
 pleased+0.910
Neg logit contributions
mine-2.056
al-1.934
of-1.884
talk-1.874
ede-1.855
Token."
Token ID526
Feature activation-2.22e-01

Pos logit contributions
mine+1.082
al+1.023
talk+0.977
als+0.960
of+0.954
Neg logit contributions
 glad-1.085
 lucky-0.911
 important-0.873
 pleased-0.846
 supposed-0.845
Token
Token ID198
Feature activation+8.28e-01

Pos logit contributions
mine+0.563
al+0.535
talk+0.525
of+0.513
als+0.510
Neg logit contributions
 glad-0.370
 lucky-0.297
 important-0.283
 supposed-0.275
 pleased-0.268
Token
Token ID198
Feature activation-1.65e+00

Pos logit contributions
 glad+1.476
 lucky+1.223
 supposed+1.206
 important+1.189
 pleased+1.122
Neg logit contributions
mine-1.946
of-1.865
talk-1.837
al-1.833
ted-1.761
TokenM
Token ID44
Feature activation-1.30e+00

Pos logit contributions
mine+2.556
al+2.359
als+2.181
ted+2.150
talk+2.119
Neg logit contributions
 glad-4.606
 lucky-4.021
 important-3.889
 pleased-3.802
 supposed-3.709
Tokenia
Token ID544
Feature activation-1.81e+00

Pos logit contributions
mine+2.771
al+2.598
talk+2.471
ted+2.449
als+2.446
Neg logit contributions
 glad-2.777
 lucky-2.348
 important-2.262
 pleased-2.163
 supposed-2.152
Token chased
Token ID26172
Feature activation-4.45e+00

Pos logit contributions
mine+3.765
al+3.492
talk+3.387
x+3.378
als+3.290
Neg logit contributions
 glad-3.852
 lucky-3.264
 important-3.071
 sad-3.043
 pleased-3.022
Token after
Token ID706
Feature activation-3.05e+00

Pos logit contributions
mine+9.414
al+8.819
ede+8.474
x+8.417
als+8.339
Neg logit contributions
 glad-10.728
 lucky-9.443
 tired-8.888
 happy-8.881
 hungry-8.764
Token him
Token ID683
Feature activation-9.59e-01

Pos logit contributions
talk+5.526
mine+5.502
al+5.325
x+5.111
als+4.990
Neg logit contributions
 glad-7.810
 lucky-6.684
 sad-6.258
 tired-6.254
 happy-6.252
Token.
Token ID13
Feature activation-1.05e+00

Pos logit contributions
mine+1.948
al+1.846
talk+1.763
als+1.733
ted+1.717
Neg logit contributions
 glad-2.099
 lucky-1.777
 important-1.705
 supposed-1.653
 pleased-1.641
Token She
Token ID1375
Feature activation-1.38e+00

Pos logit contributions
mine+1.870
al+1.748
talk+1.658
x+1.613
of+1.608
Neg logit contributions
 glad-2.562
 lucky-2.194
 important-2.109
 supposed-2.057
 pleased-2.047
Token wanted
Token ID2227
Feature activation-1.54e+00

Pos logit contributions
mine+2.696
al+2.505
talk+2.403
als+2.336
x+2.334
Neg logit contributions
 glad-3.151
 lucky-2.702
 important-2.577
 pleased-2.503
 supposed-2.470
Token knees
Token ID14475
Feature activation+6.98e-01

Pos logit contributions
mine+1.573
al+1.459
talk+1.374
als+1.340
ted+1.339
Neg logit contributions
 glad-2.423
 lucky-2.110
 important-2.037
 supposed-1.975
 pleased-1.970
Token."
Token ID526
Feature activation-8.87e-01

Pos logit contributions
 glad+1.679
 lucky+1.446
 supposed+1.399
 important+1.387
 pleased+1.382
Neg logit contributions
mine-1.256
al-1.155
talk-1.086
of-1.086
als-1.085
Token
Token ID198
Feature activation+6.13e-01

Pos logit contributions
mine+2.423
al+2.320
talk+2.240
als+2.207
x+2.198
Neg logit contributions
 glad-1.348
 lucky-1.041
 important-0.968
 pleased-0.919
 supposed-0.912
Token
Token ID198
Feature activation-2.94e+00

Pos logit contributions
 glad+1.073
 lucky+0.883
 supposed+0.857
 important+0.851
 pleased+0.809
Neg logit contributions
mine-1.479
of-1.395
al-1.394
talk-1.371
ted-1.337
TokenSam
Token ID16305
Feature activation-1.39e+00

Pos logit contributions
mine+5.094
al+4.655
als+4.333
ted+4.280
talk+4.110
Neg logit contributions
 glad-8.203
 lucky-7.092
 important-6.853
 pleased-6.748
 happy-6.593
Token followed
Token ID3940
Feature activation-4.44e+00

Pos logit contributions
mine+2.623
al+2.425
talk+2.328
x+2.273
als+2.242
Neg logit contributions
 glad-3.262
 lucky-2.802
 important-2.684
 pleased-2.621
 supposed-2.572
Token them
Token ID606
Feature activation-5.11e-01

Pos logit contributions
mine+9.464
al+8.856
als+8.766
ede+8.413
talk+8.294
Neg logit contributions
 glad-10.415
 lucky-9.086
 important-8.677
 sad-8.582
 tired-8.496
Token into
Token ID656
Feature activation-1.18e+00

Pos logit contributions
mine+0.931
al+0.873
talk+0.832
als+0.814
ted+0.809
Neg logit contributions
 glad-1.216
 lucky-1.042
 important-1.006
 supposed-0.979
 pleased-0.978
Token the
Token ID262
Feature activation-2.28e-01

Pos logit contributions
mine+2.146
al+1.999
talk+1.922
als+1.884
x+1.850
Neg logit contributions
 glad-2.890
 lucky-2.490
 important-2.388
 pleased-2.321
 supposed-2.300
Token tunnel
Token ID13275
Feature activation-6.24e-01

Pos logit contributions
mine+0.467
al+0.447
talk+0.428
als+0.419
of+0.416
Neg logit contributions
 glad-0.488
 lucky-0.407
 important-0.392
 supposed-0.388
 pleased-0.384
Token.
Token ID13
Feature activation-3.77e-01

Pos logit contributions
mine+1.171
al+1.099
talk+1.049
als+1.025
ted+1.019
Neg logit contributions
 glad-1.452
 lucky-1.242
 important-1.196
 supposed-1.164
 pleased-1.159
Token."
Token ID526
Feature activation+5.88e-01

Pos logit contributions
 glad+0.227
 lucky+0.191
 important+0.184
 pleased+0.181
 supposed+0.178
Neg logit contributions
mine-0.219
al-0.211
talk-0.200
als-0.198
ted-0.195
Token
Token ID198
Feature activation+1.33e+00

Pos logit contributions
 glad+1.031
 lucky+0.858
 supposed+0.852
 important+0.835
 pleased+0.771
Neg logit contributions
mine-1.400
al-1.327
talk-1.307
of-1.297
ted-1.279
Token
Token ID198
Feature activation-2.05e+00

Pos logit contributions
 glad+2.489
 supposed+2.105
 lucky+2.104
 important+2.008
 pleased+1.937
Neg logit contributions
mine-2.989
of-2.900
al-2.815
talk-2.794
ted-2.701
TokenL
Token ID43
Feature activation-2.12e+00

Pos logit contributions
mine+3.268
al+2.920
als+2.732
ted+2.717
talk+2.681
Neg logit contributions
 glad-5.737
 lucky-5.024
 important-4.930
 pleased-4.782
 happy-4.628
Tokenily
Token ID813
Feature activation-1.05e+00

Pos logit contributions
mine+3.841
talk+3.522
al+3.472
als+3.439
ted+3.364
Neg logit contributions
 glad-5.211
 lucky-4.485
 important-4.371
 supposed-4.354
 pleased-4.133
Token followed
Token ID3940
Feature activation-4.41e+00

Pos logit contributions
mine+1.927
al+1.787
talk+1.718
x+1.665
als+1.660
Neg logit contributions
 glad-2.500
 lucky-2.147
 important-2.054
 pleased-2.013
 supposed-1.984
Token Sam
Token ID3409
Feature activation-1.78e-01

Pos logit contributions
mine+8.750
al+8.238
als+8.075
talk+7.957
x+7.470
Neg logit contributions
 glad-11.091
 lucky-9.614
 tired-9.276
 sad-9.140
 hungry-8.947
Token.
Token ID13
Feature activation-5.03e-01

Pos logit contributions
mine+0.334
al+0.315
talk+0.302
of+0.295
als+0.294
Neg logit contributions
 glad-0.414
 lucky-0.353
 important-0.342
 supposed-0.333
 pleased-0.332
Token She
Token ID1375
Feature activation-2.20e-01

Pos logit contributions
mine+0.862
al+0.805
talk+0.763
als+0.748
ted+0.743
Neg logit contributions
 glad-1.254
 lucky-1.088
 important-1.053
 supposed-1.026
 pleased-1.021
Token was
Token ID373
Feature activation+2.92e+00

Pos logit contributions
mine+0.422
al+0.402
talk+0.380
als+0.375
ted+0.374
Neg logit contributions
 glad-0.499
 lucky-0.425
 important-0.412
 supposed-0.403
 pleased-0.396
Token happy
Token ID3772
Feature activation+4.63e-01

Pos logit contributions
 glad+5.217
 fault+4.743
 important+4.686
 valuable+4.631
 tempting+4.543
Neg logit contributions
mine-5.989
al-5.910
ting-5.382
Give-5.302
ted-5.262
Token!"
Token ID2474
Feature activation-1.97e+00

Pos logit contributions
 glad+1.991
 lucky+1.691
 important+1.639
 supposed+1.624
 pleased+1.610
Neg logit contributions
mine-1.529
al-1.403
of-1.350
talk-1.345
als-1.328
Token Sam
Token ID3409
Feature activation-9.01e-01

Pos logit contributions
mine+3.524
al+3.355
x+3.126
talk+3.045
als+3.034
Neg logit contributions
 glad-5.059
 lucky-4.398
 important-4.204
 tired-4.124
 pleased-4.116
Token says
Token ID1139
Feature activation-3.61e-01

Pos logit contributions
mine+1.755
al+1.652
talk+1.578
als+1.542
x+1.538
Neg logit contributions
 glad-2.043
 lucky-1.740
 important-1.671
 pleased-1.619
 supposed-1.617
Token.
Token ID13
Feature activation-3.44e-02

Pos logit contributions
mine+0.714
al+0.667
talk+0.642
als+0.624
of+0.622
Neg logit contributions
 glad-0.804
 lucky-0.683
 important-0.657
 supposed-0.640
 pleased-0.634
Token He
Token ID679
Feature activation+1.32e-01

Pos logit contributions
mine+0.076
al+0.071
talk+0.069
ted+0.068
als+0.068
Neg logit contributions
 glad-0.068
 lucky-0.057
 important-0.055
 supposed-0.054
 pleased-0.053
Token follows
Token ID5679
Feature activation-4.36e+00

Pos logit contributions
 glad+0.316
 lucky+0.271
 important+0.263
 supposed+0.257
 pleased+0.256
Neg logit contributions
mine-0.234
al-0.221
talk-0.208
als-0.207
of-0.207
Token Tom
Token ID4186
Feature activation-2.17e-01

Pos logit contributions
als+9.334
al+9.262
mine+9.247
x+8.577
ede+8.461
Neg logit contributions
 glad-9.866
 lucky-8.416
 sad-8.171
 happy-7.986
 tired-7.982
Token and
Token ID290
Feature activation-1.22e+00

Pos logit contributions
mine+0.396
al+0.369
talk+0.354
of+0.345
als+0.344
Neg logit contributions
 glad-0.514
 lucky-0.441
 important-0.427
 pleased-0.414
 supposed-0.413
Token slides
Token ID19392
Feature activation-1.22e+00

Pos logit contributions
mine+2.284
al+2.117
talk+2.029
x+1.971
als+1.961
Neg logit contributions
 glad-2.922
 lucky-2.505
 important-2.382
 pleased-2.328
 supposed-2.309
Token down
Token ID866
Feature activation-1.05e+00

Pos logit contributions
mine+2.464
al+2.355
talk+2.238
als+2.178
x+2.174
Neg logit contributions
 glad-2.691
 lucky-2.275
 important-2.175
 pleased-2.108
 supposed-2.107
Token.
Token ID13
Feature activation+5.22e-01

Pos logit contributions
mine+2.246
al+2.129
talk+2.044
als+2.008
x+2.000
Neg logit contributions
 glad-2.193
 lucky-1.835
 important-1.758
 pleased-1.694
 supposed-1.690
Token wrong
Token ID2642
Feature activation-3.53e-01

Pos logit contributions
 glad+1.738
 pleased+1.304
 lucky+1.294
 important+1.229
 valuable+1.173
Neg logit contributions
mine-3.176
al-2.976
talk-2.854
x-2.819
ted-2.806
Token.
Token ID13
Feature activation-8.73e-01

Pos logit contributions
mine+0.676
al+0.637
talk+0.607
als+0.592
ted+0.589
Neg logit contributions
 glad-0.808
 lucky-0.687
 important-0.658
 pleased-0.643
 supposed-0.643
Token They
Token ID1119
Feature activation-9.80e-01

Pos logit contributions
mine+1.638
al+1.529
talk+1.452
of+1.420
als+1.419
Neg logit contributions
 glad-2.067
 lucky-1.769
 important-1.704
 supposed-1.651
 pleased-1.644
Token tried
Token ID3088
Feature activation-3.69e-01

Pos logit contributions
mine+2.092
al+1.983
talk+1.904
als+1.872
x+1.853
Neg logit contributions
 glad-2.053
 lucky-1.717
 important-1.630
 pleased-1.583
 supposed-1.574
Token to
Token ID284
Feature activation-3.31e+00

Pos logit contributions
mine+0.810
al+0.763
talk+0.741
ted+0.721
of+0.721
Neg logit contributions
 glad-0.743
 lucky-0.620
 important-0.592
 pleased-0.573
 supposed-0.572
Token comfort
Token ID4467
Feature activation-4.36e+00

Pos logit contributions
als+7.629
mine+7.475
ted+7.421
al+7.375
x+6.882
Neg logit contributions
 glad-7.048
 lucky-6.078
 happy-5.768
 important-5.619
 sad-5.600
Token them
Token ID606
Feature activation-1.49e+00

Pos logit contributions
ede+9.038
mine+9.012
 cash+8.407
al+7.926
talk+7.859
Neg logit contributions
 glad-10.377
 lucky-9.040
 important-8.529
 sad-8.146
 happy-8.041
Token and
Token ID290
Feature activation-1.40e+00

Pos logit contributions
mine+3.055
al+2.886
talk+2.800
als+2.726
ted+2.688
Neg logit contributions
 glad-3.295
 lucky-2.805
 important-2.706
 supposed-2.553
 pleased-2.551
Token explain
Token ID4727
Feature activation-7.59e-01

Pos logit contributions
mine+2.516
al+2.346
talk+2.190
als+2.187
x+2.164
Neg logit contributions
 glad-3.474
 lucky-2.988
 important-2.889
 pleased-2.805
 supposed-2.770
Token that
Token ID326
Feature activation-6.25e-01

Pos logit contributions
mine+1.576
al+1.485
talk+1.424
als+1.394
ted+1.384
Neg logit contributions
 glad-1.631
 lucky-1.375
 important-1.323
 supposed-1.276
 pleased-1.271
Token sometimes
Token ID3360
Feature activation+6.70e-01

Pos logit contributions
mine+0.979
al+0.906
talk+0.855
als+0.832
ted+0.826
Neg logit contributions
 glad-1.650
 lucky-1.439
 important-1.393
 supposed-1.361
 pleased-1.357
Token cries
Token ID24691
Feature activation-5.19e-01

Pos logit contributions
 glad+0.062
 lucky+0.053
 important+0.051
 supposed+0.050
 pleased+0.049
Neg logit contributions
mine-0.054
al-0.051
talk-0.049
als-0.048
ted-0.048
Token.
Token ID13
Feature activation+3.60e-02

Pos logit contributions
mine+1.042
al+0.984
talk+0.940
als+0.924
ted+0.918
Neg logit contributions
 glad-1.140
 lucky-0.964
 important-0.926
 supposed-0.902
 pleased-0.898
Token
Token ID198
Feature activation+9.26e-01

Pos logit contributions
 glad+0.081
 lucky+0.068
 important+0.066
 supposed+0.066
 pleased+0.064
Neg logit contributions
mine-0.070
al-0.067
talk-0.064
ted-0.063
als-0.063
Token
Token ID198
Feature activation-2.13e+00

Pos logit contributions
 glad+1.689
 lucky+1.407
 important+1.355
 supposed+1.346
 pleased+1.304
Neg logit contributions
mine-2.172
of-2.059
talk-2.029
al-2.019
ted-1.942
TokenMom
Token ID29252
Feature activation-6.78e-01

Pos logit contributions
mine+3.305
al+2.955
als+2.731
talk+2.722
x+2.685
Neg logit contributions
 glad-6.064
 lucky-5.372
 important-5.223
 pleased-5.027
 tired-4.959
Token hugs
Token ID40657
Feature activation-4.33e+00

Pos logit contributions
mine+1.231
al+1.152
talk+1.097
als+1.071
ted+1.064
Neg logit contributions
 glad-1.626
 lucky-1.397
 important-1.347
 supposed-1.311
 pleased-1.307
Token Sam
Token ID3409
Feature activation+4.96e-01

Pos logit contributions
mine+8.672
talk+8.433
al+8.022
ede+7.924
 cash+7.783
Neg logit contributions
 glad-10.662
 lucky-8.896
 tired-8.668
 sad-8.437
 happy-8.351
Token and
Token ID290
Feature activation-1.03e+00

Pos logit contributions
 glad+1.318
 lucky+1.156
 important+1.119
 supposed+1.117
 pleased+1.102
Neg logit contributions
mine-0.764
al-0.709
of-0.657
talk-0.651
ted-0.646
Token tells
Token ID4952
Feature activation-2.98e+00

Pos logit contributions
mine+1.865
al+1.723
talk+1.641
als+1.590
x+1.586
Neg logit contributions
 glad-2.553
 lucky-2.201
 important-2.114
 supposed-2.049
 pleased-2.042
Token him
Token ID683
Feature activation+8.36e-01

Pos logit contributions
mine+4.151
al+3.844
talk+3.755
x+3.575
als+3.545
Neg logit contributions
 glad-8.858
 lucky-7.857
 important-7.624
 sad-7.493
 tired-7.418
Token it
Token ID340
Feature activation+1.34e+00

Pos logit contributions
 glad+1.848
 lucky+1.551
 supposed+1.508
 important+1.499
 pleased+1.494
Neg logit contributions
mine-1.570
al-1.477
of-1.433
als-1.425
talk-1.415
Token house
Token ID2156
Feature activation-8.19e-01

Pos logit contributions
 glad+2.192
 lucky+1.824
 pleased+1.817
 supposed+1.765
 important+1.747
Neg logit contributions
mine-1.583
al-1.490
talk-1.402
als-1.392
x-1.364
Token.
Token ID13
Feature activation+9.16e-01

Pos logit contributions
mine+1.670
al+1.578
talk+1.511
als+1.480
ted+1.474
Neg logit contributions
 glad-1.770
 lucky-1.496
 important-1.438
 supposed-1.391
 pleased-1.385
Token
Token ID198
Feature activation+1.24e+00

Pos logit contributions
 glad+2.287
 supposed+2.012
 lucky+2.009
 important+1.949
 pleased+1.922
Neg logit contributions
mine-1.455
talk-1.358
al-1.353
ted-1.305
als-1.276
Token
Token ID198
Feature activation-1.93e+00

Pos logit contributions
 glad+2.259
 lucky+1.907
 supposed+1.907
 important+1.839
 pleased+1.792
Neg logit contributions
mine-2.795
talk-2.654
of-2.645
al-2.611
ted-2.540
TokenMom
Token ID29252
Feature activation-9.67e-01

Pos logit contributions
mine+3.312
al+3.035
als+2.874
ted+2.776
x+2.754
Neg logit contributions
 glad-5.287
 lucky-4.575
 important-4.420
 pleased-4.277
 supposed-4.119
Token hugs
Token ID40657
Feature activation-4.32e+00

Pos logit contributions
mine+1.969
al+1.849
talk+1.768
als+1.737
x+1.724
Neg logit contributions
 glad-2.122
 lucky-1.787
 important-1.708
 supposed-1.658
 pleased-1.653
Token them
Token ID606
Feature activation-1.49e+00

Pos logit contributions
ede+8.303
al+8.168
mine+8.061
als+7.999
talk+7.952
Neg logit contributions
 glad-11.136
 lucky-9.159
 sad-8.880
 happy-8.758
 important-8.720
Token and
Token ID290
Feature activation-8.65e-01

Pos logit contributions
mine+2.855
al+2.714
talk+2.604
x+2.543
als+2.516
Neg logit contributions
 glad-3.497
 lucky-2.962
 important-2.816
 tired-2.745
 pleased-2.743
Token says
Token ID1139
Feature activation-7.31e-02

Pos logit contributions
mine+1.715
al+1.605
talk+1.534
als+1.500
x+1.491
Neg logit contributions
 glad-1.964
 lucky-1.668
 important-1.600
 supposed-1.547
 pleased-1.546
Token she
Token ID673
Feature activation+1.02e+00

Pos logit contributions
mine+0.108
al+0.099
ted+0.091
talk+0.091
als+0.090
Neg logit contributions
 glad-0.198
 lucky-0.174
 important-0.169
 supposed-0.168
 pleased-0.166
Token is
Token ID318
Feature activation+2.60e+00

Pos logit contributions
 glad+1.874
 lucky+1.506
 pleased+1.496
 important+1.429
 supposed+1.386
Neg logit contributions
mine-2.346
al-2.304
talk-2.210
ted-2.207
x-2.163
Token She
Token ID1375
Feature activation+1.73e-01

Pos logit contributions
 glad+0.239
 lucky+0.208
 important+0.203
 supposed+0.197
 pleased+0.195
Neg logit contributions
mine-0.174
al-0.163
ted-0.156
talk-0.155
als-0.155
Token looked
Token ID3114
Feature activation+1.56e+00

Pos logit contributions
 glad+0.395
 lucky+0.336
 important+0.330
 supposed+0.323
 pleased+0.315
Neg logit contributions
mine-0.325
al-0.313
talk-0.294
als-0.292
ted-0.291
Token anxious
Token ID18116
Feature activation-1.77e-01

Pos logit contributions
 glad+3.609
 supposed+3.173
 important+3.083
 lucky+3.061
 fault+3.004
Neg logit contributions
mine-2.711
al-2.555
ted-2.551
als-2.421
talk-2.382
Token.
Token ID13
Feature activation+1.39e-01

Pos logit contributions
mine+0.341
al+0.320
talk+0.306
als+0.301
ted+0.300
Neg logit contributions
 glad-0.405
 lucky-0.344
 important-0.329
 pleased-0.323
 supposed-0.321
Token She
Token ID1375
Feature activation+2.68e-01

Pos logit contributions
 glad+0.271
 lucky+0.226
 important+0.220
 supposed+0.214
 pleased+0.210
Neg logit contributions
mine-0.308
al-0.293
talk-0.281
ted-0.281
als-0.281
Token hugged
Token ID41130
Feature activation-4.29e+00

Pos logit contributions
 glad+0.619
 lucky+0.526
 important+0.512
 supposed+0.506
 pleased+0.496
Neg logit contributions
mine-0.492
al-0.472
talk-0.445
of-0.441
ted-0.440
Token them
Token ID606
Feature activation-1.71e+00

Pos logit contributions
talk+8.388
ede+8.320
mine+8.320
al+8.236
als+7.869
Neg logit contributions
 glad-10.886
 lucky-9.216
 sad-8.734
 happy-8.628
 tired-8.531
Token and
Token ID290
Feature activation-1.30e+00

Pos logit contributions
mine+3.376
al+3.260
talk+3.090
als+3.013
ted+2.993
Neg logit contributions
 glad-3.871
 lucky-3.291
 important-3.162
 tired-3.048
 supposed-3.004
Token took
Token ID1718
Feature activation-3.20e+00

Pos logit contributions
mine+2.716
al+2.548
talk+2.431
als+2.387
x+2.385
Neg logit contributions
 glad-2.846
 lucky-2.401
 important-2.292
 pleased-2.220
 supposed-2.187
Token them
Token ID606
Feature activation-9.49e-01

Pos logit contributions
mine+7.518
al+6.848
ede+6.827
x+6.605
of+6.529
Neg logit contributions
 glad-6.577
 lucky-5.865
 important-5.526
 supposed-5.138
 tired-5.088
Token to
Token ID284
Feature activation-2.44e-01

Pos logit contributions
mine+2.098
al+1.996
talk+1.894
als+1.876
ted+1.860
Neg logit contributions
 glad-1.904
 lucky-1.593
 important-1.523
 supposed-1.476
 pleased-1.446
Token!"
Token ID2474
Feature activation-2.70e+00

Pos logit contributions
 glad+1.018
 lucky+0.838
 important+0.789
 pleased+0.784
 supposed+0.763
Neg logit contributions
mine-1.110
al-1.042
talk-1.034
of-1.000
als-0.975
Token Sam
Token ID3409
Feature activation-1.06e+00

Pos logit contributions
mine+5.030
al+4.816
x+4.473
talk+4.342
als+4.313
Neg logit contributions
 glad-6.963
 lucky-6.107
 important-5.832
 tired-5.770
 sad-5.759
Token says
Token ID1139
Feature activation-6.67e-01

Pos logit contributions
mine+2.100
al+1.979
talk+1.886
als+1.848
x+1.847
Neg logit contributions
 glad-2.376
 lucky-2.019
 important-1.942
 pleased-1.876
 supposed-1.872
Token.
Token ID13
Feature activation-8.79e-02

Pos logit contributions
mine+1.355
al+1.278
talk+1.224
als+1.198
ted+1.191
Neg logit contributions
 glad-1.454
 lucky-1.229
 important-1.180
 supposed-1.145
 pleased-1.141
Token He
Token ID679
Feature activation+4.06e-02

Pos logit contributions
mine+0.180
al+0.169
talk+0.163
ted+0.160
als+0.159
Neg logit contributions
 glad-0.189
 lucky-0.160
 important-0.154
 supposed-0.152
 pleased-0.148
Token follows
Token ID5679
Feature activation-4.29e+00

Pos logit contributions
 glad+0.098
 lucky+0.084
 important+0.082
 supposed+0.080
 pleased+0.080
Neg logit contributions
mine-0.072
al-0.068
talk-0.064
als-0.063
ted-0.063
Token Tim
Token ID5045
Feature activation-5.04e-01

Pos logit contributions
als+9.283
mine+9.266
al+9.191
of+8.503
talk+8.474
Neg logit contributions
 glad-9.662
 lucky-8.228
 sad-8.061
 happy-7.837
 important-7.832
Token and
Token ID290
Feature activation-1.21e+00

Pos logit contributions
mine+0.924
al+0.863
talk+0.824
als+0.803
of+0.800
Neg logit contributions
 glad-1.197
 lucky-1.028
 important-0.992
 supposed-0.962
 pleased-0.962
Token slides
Token ID19392
Feature activation-1.04e+00

Pos logit contributions
mine+2.284
al+2.114
talk+2.028
x+1.970
als+1.962
Neg logit contributions
 glad-2.867
 lucky-2.456
 important-2.334
 pleased-2.276
 supposed-2.265
Token down
Token ID866
Feature activation-9.30e-01

Pos logit contributions
mine+2.044
al+1.943
talk+1.851
als+1.801
x+1.793
Neg logit contributions
 glad-2.315
 lucky-1.965
 important-1.884
 supposed-1.827
 pleased-1.825
Token.
Token ID13
Feature activation+6.78e-01

Pos logit contributions
mine+1.960
al+1.855
talk+1.780
als+1.747
x+1.736
Neg logit contributions
 glad-1.967
 lucky-1.651
 important-1.583
 pleased-1.527
 supposed-1.527
Token again
Token ID757
Feature activation+5.74e-01

Pos logit contributions
mine+0.011
al+0.010
talk+0.010
als+0.009
of+0.009
Neg logit contributions
 glad-0.013
 lucky-0.011
 important-0.010
 supposed-0.010
 pleased-0.010
Token."
Token ID526
Feature activation-9.74e-02

Pos logit contributions
 glad+1.388
 lucky+1.182
 supposed+1.134
 pleased+1.124
 important+1.123
Neg logit contributions
mine-1.020
al-0.977
talk-0.907
of-0.901
als-0.894
Token
Token ID198
Feature activation+1.45e+00

Pos logit contributions
mine+0.249
al+0.236
talk+0.231
of+0.226
ted+0.226
Neg logit contributions
 glad-0.158
 lucky-0.128
 important-0.123
 supposed-0.119
 pleased-0.115
Token
Token ID198
Feature activation-1.83e+00

Pos logit contributions
 glad+2.675
 lucky+2.278
 supposed+2.244
 important+2.204
 pleased+2.121
Neg logit contributions
mine-3.284
of-3.148
al-3.062
talk-3.037
ted-2.928
TokenMom
Token ID29252
Feature activation-9.86e-01

Pos logit contributions
mine+2.517
al+2.322
als+2.187
ted+2.114
talk+2.072
Neg logit contributions
 glad-5.460
 lucky-4.788
 important-4.661
 pleased-4.543
 supposed-4.460
Token hugs
Token ID40657
Feature activation-4.29e+00

Pos logit contributions
mine+2.145
al+2.014
talk+1.944
als+1.903
x+1.896
Neg logit contributions
 glad-2.021
 lucky-1.678
 important-1.596
 pleased-1.549
 supposed-1.549
Token them
Token ID606
Feature activation-1.41e+00

Pos logit contributions
talk+8.408
mine+8.386
ede+8.372
al+8.210
als+8.069
Neg logit contributions
 glad-10.440
 lucky-8.666
 sad-8.150
 important-8.147
 happy-7.984
Token.
Token ID13
Feature activation+2.34e-01

Pos logit contributions
mine+2.694
al+2.534
talk+2.451
x+2.385
als+2.345
Neg logit contributions
 glad-3.302
 lucky-2.812
 important-2.678
 pleased-2.606
 tired-2.595
Token "
Token ID366
Feature activation-1.96e+00

Pos logit contributions
 glad+0.532
 lucky+0.458
 important+0.442
 supposed+0.438
 pleased+0.433
Neg logit contributions
mine-0.435
al-0.412
ted-0.395
als-0.394
talk-0.394
TokenThat
Token ID2504
Feature activation-4.19e-01

Pos logit contributions
mine+3.474
al+3.190
talk+2.986
als+2.955
x+2.955
Neg logit contributions
 glad-5.129
 lucky-4.407
 important-4.262
 pleased-4.100
 supposed-4.009
Token's
Token ID338
Feature activation+7.45e-01

Pos logit contributions
mine+0.815
al+0.773
talk+0.734
als+0.722
ted+0.715
Neg logit contributions
 glad-0.958
 lucky-0.811
 important-0.774
 pleased-0.761
 supposed-0.758
Token
Token ID198
Feature activation-2.93e+00

Pos logit contributions
 glad+0.440
 lucky+0.359
 supposed+0.345
 important+0.344
 pleased+0.323
Neg logit contributions
mine-0.654
al-0.619
of-0.614
talk-0.607
ted-0.592
TokenSam
Token ID16305
Feature activation-2.26e+00

Pos logit contributions
mine+5.253
al+4.879
als+4.549
ted+4.539
x+4.329
Neg logit contributions
 glad-8.052
 lucky-6.898
 important-6.701
 pleased-6.671
 happy-6.504
Token smiled
Token ID13541
Feature activation+8.50e-01

Pos logit contributions
mine+4.625
al+4.211
talk+4.063
x+4.016
als+3.916
Neg logit contributions
 glad-4.962
 lucky-4.238
 important-4.025
 pleased-3.987
 amazed-3.889
Token.
Token ID13
Feature activation-1.43e-02

Pos logit contributions
 glad+2.206
 supposed+1.938
 lucky+1.902
 important+1.862
 pleased+1.815
Neg logit contributions
mine-1.296
al-1.252
als-1.160
ted-1.155
talk-1.143
Token He
Token ID679
Feature activation-3.95e-01

Pos logit contributions
mine+0.027
al+0.026
talk+0.025
als+0.025
ted+0.025
Neg logit contributions
 glad-0.032
 lucky-0.028
 important-0.027
 supposed-0.026
 pleased-0.026
Token followed
Token ID3940
Feature activation-4.28e+00

Pos logit contributions
mine+0.737
al+0.698
talk+0.663
als+0.652
ted+0.649
Neg logit contributions
 glad-0.922
 lucky-0.785
 important-0.758
 supposed-0.743
 pleased-0.734
Token Tom
Token ID4186
Feature activation-4.69e-01

Pos logit contributions
mine+9.433
al+9.100
als+8.810
talk+8.471
ede+8.453
Neg logit contributions
 glad-9.669
 lucky-8.220
 sad-7.955
 important-7.804
 tired-7.744
Token and
Token ID290
Feature activation-1.84e+00

Pos logit contributions
mine+0.898
al+0.846
talk+0.808
als+0.790
of+0.786
Neg logit contributions
 glad-1.076
 lucky-0.916
 important-0.883
 supposed-0.858
 pleased-0.857
Token they
Token ID484
Feature activation-2.00e+00

Pos logit contributions
mine+3.432
al+3.087
talk+2.964
x+2.904
of+2.885
Neg logit contributions
 glad-4.542
 lucky-3.918
 important-3.691
 pleased-3.657
 tired-3.591
Token looked
Token ID3114
Feature activation+1.03e+00

Pos logit contributions
mine+3.541
al+3.316
talk+3.106
x+3.047
of+3.026
Neg logit contributions
 glad-4.943
 lucky-4.286
 tired-4.064
 important-4.012
 pleased-3.962
Token for
Token ID329
Feature activation-1.82e+00

Pos logit contributions
 glad+2.260
 supposed+1.949
 lucky+1.885
 important+1.861
 pleased+1.815
Neg logit contributions
mine-1.937
al-1.885
ted-1.816
als-1.809
talk-1.765
Token pull
Token ID2834
Feature activation-2.68e+00

Pos logit contributions
mine+1.843
al+1.717
talk+1.644
als+1.606
ted+1.594
Neg logit contributions
 glad-2.740
 lucky-2.381
 important-2.294
 supposed-2.234
 pleased-2.227
Token some
Token ID617
Feature activation-3.95e-01

Pos logit contributions
mine+5.495
talk+5.080
al+4.994
x+4.774
als+4.747
Neg logit contributions
 glad-6.148
 lucky-5.413
 important-5.163
 sad-4.996
 tired-4.942
Token beans
Token ID16567
Feature activation-1.07e+00

Pos logit contributions
mine+0.716
al+0.677
talk+0.637
als+0.631
ted+0.626
Neg logit contributions
 glad-0.947
 lucky-0.808
 important-0.777
 supposed-0.765
 pleased-0.759
Token.
Token ID13
Feature activation-4.67e-01

Pos logit contributions
mine+2.152
al+2.025
talk+1.945
ted+1.894
als+1.894
Neg logit contributions
 glad-2.348
 lucky-2.000
 important-1.920
 supposed-1.862
 pleased-1.846
Token Sam
Token ID3409
Feature activation-1.24e+00

Pos logit contributions
mine+0.892
al+0.838
talk+0.802
als+0.783
ted+0.782
Neg logit contributions
 glad-1.075
 lucky-0.918
 important-0.884
 supposed-0.863
 pleased-0.856
Token followed
Token ID3940
Feature activation-4.28e+00

Pos logit contributions
mine+2.454
al+2.280
talk+2.224
x+2.148
of+2.134
Neg logit contributions
 glad-2.763
 lucky-2.359
 important-2.255
 pleased-2.189
 supposed-2.172
Token him
Token ID683
Feature activation-1.27e+00

Pos logit contributions
mine+9.526
al+9.237
als+9.183
talk+8.768
ede+8.582
Neg logit contributions
 glad-9.927
 lucky-8.582
 sad-8.343
 happy-8.121
 important-8.004
Token,
Token ID11
Feature activation+1.74e-01

Pos logit contributions
mine+2.555
al+2.397
talk+2.296
als+2.250
ted+2.233
Neg logit contributions
 glad-2.823
 lucky-2.407
 important-2.302
 supposed-2.249
 pleased-2.206
Token but
Token ID475
Feature activation-1.75e+00

Pos logit contributions
 glad+0.400
 lucky+0.348
 important+0.341
 supposed+0.329
 pleased+0.322
Neg logit contributions
mine-0.321
al-0.302
talk-0.283
ted-0.283
als-0.278
Token he
Token ID339
Feature activation-9.68e-01

Pos logit contributions
mine+3.259
al+3.021
talk+2.966
of+2.828
als+2.812
Neg logit contributions
 glad-4.357
 lucky-3.758
 important-3.596
 pleased-3.482
 tired-3.462
Token felt
Token ID2936
Feature activation+2.11e+00

Pos logit contributions
mine+2.083
al+1.959
talk+1.895
als+1.846
of+1.836
Neg logit contributions
 glad-2.001
 lucky-1.678
 important-1.604
 supposed-1.546
 pleased-1.546
Token if
Token ID611
Feature activation-7.87e-02

Pos logit contributions
mine+0.772
al+0.733
talk+0.693
als+0.686
ted+0.681
Neg logit contributions
 glad-0.887
 lucky-0.750
 important-0.722
 pleased-0.706
 supposed-0.705
Token it
Token ID340
Feature activation+3.46e-01

Pos logit contributions
mine+0.119
al+0.112
als+0.104
talk+0.103
ted+0.102
Neg logit contributions
 glad-0.210
 lucky-0.181
 important-0.175
 pleased-0.175
 supposed-0.174
Token fits
Token ID11414
Feature activation-6.85e-01

Pos logit contributions
 glad+0.819
 lucky+0.682
 pleased+0.658
 important+0.652
 supposed+0.642
Neg logit contributions
mine-0.656
al-0.627
als-0.592
talk-0.587
of-0.583
Token!"
Token ID2474
Feature activation-2.76e+00

Pos logit contributions
mine+1.405
al+1.325
talk+1.271
als+1.244
ted+1.237
Neg logit contributions
 glad-1.480
 lucky-1.250
 important-1.200
 supposed-1.162
 pleased-1.157
Token She
Token ID1375
Feature activation-1.36e+00

Pos logit contributions
mine+6.045
al+5.549
x+5.419
talk+5.351
of+5.340
Neg logit contributions
 glad-5.758
 lucky-4.853
 important-4.515
 amazed-4.331
 supposed-4.303
Token helped
Token ID4193
Feature activation-4.28e+00

Pos logit contributions
mine+2.415
al+2.225
talk+2.145
x+2.068
als+2.061
Neg logit contributions
 glad-3.347
 lucky-2.905
 important-2.809
 pleased-2.743
 supposed-2.696
Token Sil
Token ID4243
Feature activation-5.31e-01

Pos logit contributions
mine+8.493
ede+8.308
al+7.883
talk+7.850
als+7.685
Neg logit contributions
 glad-10.173
 lucky-9.009
 happy-8.561
 sad-8.457
 important-8.435
Tokenvia
Token ID8869
Feature activation-5.00e-01

Pos logit contributions
mine+1.114
al+1.050
talk+1.009
als+0.983
of+0.983
Neg logit contributions
 glad-1.123
 lucky-0.945
 important-0.903
 supposed-0.873
 pleased-0.872
Token put
Token ID1234
Feature activation-2.21e+00

Pos logit contributions
mine+0.752
al+0.694
talk+0.652
als+0.633
of+0.628
Neg logit contributions
 glad-1.351
 lucky-1.181
 important-1.147
 supposed-1.118
 pleased-1.116
Token on
Token ID319
Feature activation-2.03e+00

Pos logit contributions
mine+4.519
talk+4.264
al+4.174
x+4.016
of+4.015
Neg logit contributions
 glad-4.906
 lucky-4.246
 important-4.002
 tired-3.812
 cozy-3.767
Token the
Token ID262
Feature activation-1.24e-01

Pos logit contributions
mine+3.901
al+3.614
talk+3.582
x+3.375
als+3.358
Neg logit contributions
 glad-4.818
 lucky-4.224
 important-4.034
 cozy-3.861
 tired-3.827
Token John
Token ID1757
Feature activation+8.29e-01

Pos logit contributions
mine+1.093
al+1.027
talk+0.977
als+0.957
ted+0.951
Neg logit contributions
 glad-1.353
 lucky-1.156
 important-1.112
 supposed-1.083
 pleased-1.080
Token's
Token ID338
Feature activation+1.55e+00

Pos logit contributions
 glad+2.073
 lucky+1.764
 important+1.714
 supposed+1.679
 pleased+1.670
Neg logit contributions
mine-1.450
al-1.344
of-1.264
talk-1.224
als-1.216
Token heart
Token ID2612
Feature activation+1.05e+00

Pos logit contributions
 glad+3.403
 supposed+2.773
 lucky+2.718
 pleased+2.665
 important+2.605
Neg logit contributions
mine-2.995
al-2.972
talk-2.674
als-2.672
x-2.610
Token.
Token ID13
Feature activation+9.50e-01

Pos logit contributions
 glad+2.556
 lucky+2.134
 important+2.036
 supposed+2.027
 pleased+1.994
Neg logit contributions
mine-1.912
al-1.790
als-1.680
of-1.665
talk-1.634
Token He
Token ID679
Feature activation+2.33e-01

Pos logit contributions
 glad+2.352
 lucky+2.092
 important+2.019
 supposed+1.966
 pleased+1.949
Neg logit contributions
mine-1.599
al-1.451
talk-1.413
ted-1.361
of-1.351
Token was
Token ID373
Feature activation+3.04e+00

Pos logit contributions
 glad+0.533
 lucky+0.452
 important+0.434
 supposed+0.434
 pleased+0.420
Neg logit contributions
mine-0.446
al-0.426
talk-0.397
als-0.396
of-0.393
Token so
Token ID523
Feature activation+3.47e+00

Pos logit contributions
 glad+5.444
 tempting+4.890
 important+4.850
 fault+4.710
 supposed+4.705
Neg logit contributions
mine-6.924
al-6.468
talk-5.504
of-5.461
ted-5.455
Token happy
Token ID3772
Feature activation+1.08e-03

Pos logit contributions
 fault+3.542
 supposed+3.307
 glad+3.167
 important+2.609
 tempting+2.596
Neg logit contributions
mine-10.499
al-9.896
of-9.320
als-9.068
Give-9.057
Token and
Token ID290
Feature activation+1.02e+00

Pos logit contributions
 glad+0.003
 lucky+0.002
 important+0.002
 supposed+0.002
 pleased+0.002
Neg logit contributions
mine-0.002
al-0.002
talk-0.002
ted-0.002
als-0.002
Token thanked
Token ID26280
Feature activation-2.28e+00

Pos logit contributions
 glad+2.286
 lucky+1.942
 supposed+1.941
 important+1.912
 fault+1.842
Neg logit contributions
mine-1.844
al-1.780
ted-1.701
als-1.685
of-1.633
Token his
Token ID465
Feature activation+7.06e-01

Pos logit contributions
mine+4.352
al+4.211
als+4.090
talk+4.090
ede+3.996
Neg logit contributions
 glad-5.410
 lucky-4.610
 important-4.433
 pleased-4.241
 happy-4.165
Token okay
Token ID8788
Feature activation+1.90e+00

Pos logit contributions
 glad+5.302
 pleased+4.451
 lucky+4.280
 amazed+4.046
 impressed+4.012
Neg logit contributions
mine-4.577
al-4.573
x-4.381
talk-4.108
 ribbon-4.087
Token."
Token ID526
Feature activation-3.44e-01

Pos logit contributions
 glad+4.655
 pleased+3.968
 lucky+3.910
 amazed+3.690
 impressed+3.688
Neg logit contributions
mine-3.406
al-3.165
of-3.038
als-2.986
talk-2.982
Token
Token ID198
Feature activation+9.64e-01

Pos logit contributions
mine+0.899
al+0.855
talk+0.831
als+0.816
ted+0.814
Neg logit contributions
 glad-0.550
 lucky-0.437
 important-0.417
 supposed-0.402
 pleased-0.391
Token
Token ID198
Feature activation-2.29e+00

Pos logit contributions
 glad+1.739
 lucky+1.459
 supposed+1.445
 important+1.424
 pleased+1.336
Neg logit contributions
mine-2.276
al-2.128
of-2.126
talk-2.083
ted-2.035
TokenTim
Token ID14967
Feature activation-1.18e+00

Pos logit contributions
mine+2.846
al+2.604
als+2.479
ted+2.357
talk+2.334
Neg logit contributions
 glad-7.298
 lucky-6.424
 important-6.257
 pleased-6.139
 happy-5.997
Token felt
Token ID2936
Feature activation+2.55e+00

Pos logit contributions
mine+2.247
al+2.097
talk+2.008
als+1.965
x+1.961
Neg logit contributions
 glad-2.725
 lucky-2.324
 important-2.219
 pleased-2.180
 supposed-2.144
Token a
Token ID257
Feature activation+9.20e-02

Pos logit contributions
 glad+4.477
 supposed+4.269
 important+3.736
 fault+3.660
 lucky+3.560
Neg logit contributions
mine-5.595
al-5.455
ted-5.288
Give-5.138
 cash-4.959
Token little
Token ID1310
Feature activation+7.11e-01

Pos logit contributions
 glad+0.161
 lucky+0.127
 supposed+0.124
 important+0.123
 pleased+0.117
Neg logit contributions
mine-0.223
al-0.212
talk-0.202
of-0.202
als-0.199
Token better
Token ID1365
Feature activation-3.55e-01

Pos logit contributions
 glad+1.214
 supposed+0.982
 lucky+0.955
 important+0.940
 pleased+0.883
Neg logit contributions
mine-1.682
al-1.651
talk-1.561
als-1.543
ted-1.536
Token.
Token ID13
Feature activation+1.51e-01

Pos logit contributions
mine+0.700
al+0.660
talk+0.631
als+0.612
of+0.610
Neg logit contributions
 glad-0.790
 lucky-0.668
 important-0.646
 supposed-0.630
 pleased-0.624
Token He
Token ID679
Feature activation+3.00e-01

Pos logit contributions
 glad+0.346
 lucky+0.297
 important+0.284
 supposed+0.283
 pleased+0.278
Neg logit contributions
mine-0.283
al-0.271
talk-0.260
als-0.254
ted-0.252
Token the
Token ID262
Feature activation+3.22e-01

Pos logit contributions
 glad+0.415
 lucky+0.356
 important+0.346
 supposed+0.342
 pleased+0.342
Neg logit contributions
mine-0.274
al-0.260
als-0.242
talk-0.242
ted-0.241
Token queen
Token ID16599
Feature activation-2.96e-01

Pos logit contributions
 glad+0.697
 lucky+0.577
 supposed+0.560
 important+0.559
 pleased+0.551
Neg logit contributions
mine-0.639
al-0.615
als-0.576
talk-0.574
x-0.562
Token again
Token ID757
Feature activation+2.25e-01

Pos logit contributions
mine+0.492
al+0.458
talk+0.427
als+0.422
of+0.419
Neg logit contributions
 glad-0.750
 lucky-0.650
 important-0.633
 supposed-0.617
 pleased-0.610
Token.
Token ID13
Feature activation+7.23e-01

Pos logit contributions
 glad+0.547
 lucky+0.471
 important+0.459
 supposed+0.452
 pleased+0.441
Neg logit contributions
mine-0.397
al-0.375
talk-0.346
of-0.346
als-0.343
Token<|endoftext|>
Token ID50256
Feature activation+8.32e-01

Pos logit contributions
 glad+1.396
 lucky+1.185
 important+1.175
 supposed+1.143
 pleased+1.112
Neg logit contributions
mine-1.601
al-1.505
talk-1.457
of-1.442
als-1.432
TokenOnce
Token ID7454
Feature activation+2.49e+00

Pos logit contributions
 glad+1.596
 lucky+1.306
 important+1.287
 pleased+1.245
 supposed+1.226
Neg logit contributions
mine-1.987
al-1.903
of-1.790
talk-1.786
ted-1.782
Token there
Token ID612
Feature activation+9.86e-01

Pos logit contributions
 glad+4.080
 supposed+3.632
 important+3.316
 lucky+3.293
 pleased+3.097
Neg logit contributions
mine-6.658
of-6.140
x-5.739
al-5.726
ted-5.468
Token was
Token ID373
Feature activation+4.78e-01

Pos logit contributions
 glad+2.348
 supposed+2.012
 lucky+2.001
 important+1.989
 pleased+1.920
Neg logit contributions
mine-1.862
al-1.675
of-1.588
ted-1.538
als-1.514
Token a
Token ID257
Feature activation+1.92e-01

Pos logit contributions
 glad+0.995
 lucky+0.829
 important+0.802
 pleased+0.801
 supposed+0.791
Neg logit contributions
mine-0.986
al-0.928
ted-0.874
talk-0.871
of-0.859
Token messy
Token ID24097
Feature activation-5.62e-01

Pos logit contributions
 glad+0.331
 lucky+0.262
 supposed+0.259
 important+0.255
 pleased+0.245
Neg logit contributions
mine-0.460
al-0.442
of-0.421
talk-0.415
x-0.414
Token piece
Token ID3704
Feature activation-3.38e-01

Pos logit contributions
mine+1.060
al+0.997
talk+0.947
als+0.929
ted+0.921
Neg logit contributions
 glad-1.301
 lucky-1.110
 important-1.069
 supposed-1.043
 pleased-1.038
Token he
Token ID339
Feature activation+5.38e-01

Pos logit contributions
 glad+2.774
 lucky+2.347
 supposed+2.339
 important+2.250
 pleased+2.204
Neg logit contributions
mine-2.010
al-1.909
ted-1.841
als-1.825
of-1.765
Token liked
Token ID8288
Feature activation+1.02e+00

Pos logit contributions
 glad+1.260
 lucky+1.068
 supposed+1.028
 important+1.027
 pleased+0.988
Neg logit contributions
mine-1.002
al-0.951
als-0.896
ted-0.882
of-0.879
Token it
Token ID340
Feature activation+1.03e+00

Pos logit contributions
 glad+2.902
 lucky+2.535
 pleased+2.445
 important+2.430
 supposed+2.422
Neg logit contributions
mine-1.344
al-1.245
ted-1.191
als-1.136
of-1.110
Token.
Token ID13
Feature activation+1.06e+00

Pos logit contributions
 glad+2.424
 lucky+1.990
 pleased+1.928
 important+1.871
 supposed+1.871
Neg logit contributions
mine-1.939
al-1.818
als-1.687
of-1.683
ted-1.658
Token He
Token ID679
Feature activation-1.06e-01

Pos logit contributions
 glad+2.520
 lucky+2.188
 important+2.113
 supposed+2.108
 pleased+2.056
Neg logit contributions
mine-1.911
talk-1.732
al-1.729
ted-1.683
als-1.636
Token was
Token ID373
Feature activation+2.86e+00

Pos logit contributions
mine+0.187
al+0.176
talk+0.165
als+0.164
ted+0.163
Neg logit contributions
 glad-0.257
 lucky-0.221
 important-0.212
 supposed-0.210
 pleased-0.206
Token very
Token ID845
Feature activation+3.13e+00

Pos logit contributions
 glad+4.897
 important+4.284
 lucky+4.190
 valuable+4.118
 fault+4.091
Neg logit contributions
mine-6.211
al-5.951
ted-5.562
Give-5.538
ting-5.507
Token happy
Token ID3772
Feature activation+2.25e-01

Pos logit contributions
 glad+3.194
 fault+3.183
 supposed+2.991
 important+2.446
 tempting+2.210
Neg logit contributions
mine-8.509
Give-8.222
al-8.171
 cash-7.948
of-7.856
Token.
Token ID13
Feature activation+1.49e+00

Pos logit contributions
 glad+0.548
 lucky+0.471
 important+0.454
 pleased+0.444
 supposed+0.443
Neg logit contributions
mine-0.394
al-0.373
ted-0.348
talk-0.348
als-0.346
Token
Token ID198
Feature activation+1.24e+00

Pos logit contributions
 glad+3.084
 lucky+2.663
 important+2.585
 supposed+2.568
 pleased+2.495
Neg logit contributions
mine-3.129
talk-2.881
al-2.843
ted-2.800
of-2.728
Token
Token ID198
Feature activation-6.32e-01

Pos logit contributions
 glad+2.271
 lucky+1.911
 important+1.841
 supposed+1.799
 lonely+1.741
Neg logit contributions
mine-2.921
of-2.679
talk-2.679
al-2.610
ted-2.528
Token Jane
Token ID12091
Feature activation+2.00e-01

Pos logit contributions
mine+2.345
al+2.188
talk+2.142
x+2.066
als+2.059
Neg logit contributions
 glad-2.895
 lucky-2.487
 important-2.399
 supposed-2.307
 pleased-2.305
Token's
Token ID338
Feature activation+5.97e-01

Pos logit contributions
 glad+0.487
 lucky+0.417
 important+0.407
 pleased+0.393
 supposed+0.389
Neg logit contributions
mine-0.359
al-0.337
of-0.315
talk-0.314
als-0.313
Token feet
Token ID3625
Feature activation-6.46e-02

Pos logit contributions
 glad+1.417
 lucky+1.182
 supposed+1.155
 important+1.144
 pleased+1.138
Neg logit contributions
mine-1.079
al-1.046
als-0.967
talk-0.961
of-0.947
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
mine+0.125
al+0.118
talk+0.111
als+0.110
of+0.109
Neg logit contributions
 glad-0.146
 lucky-0.125
 important-0.120
 supposed-0.116
 pleased-0.116
Token Jane
Token ID12091
Feature activation-1.30e-01

Pos logit contributions
 glad+2.585
 lucky+2.312
 important+2.271
 supposed+2.225
 pleased+2.120
Neg logit contributions
mine-1.610
al-1.488
als-1.406
of-1.367
talk-1.357
Token was
Token ID373
Feature activation+2.85e+00

Pos logit contributions
mine+0.246
al+0.232
talk+0.217
als+0.216
of+0.215
Neg logit contributions
 glad-0.300
 lucky-0.257
 important-0.249
 supposed-0.244
 pleased-0.238
Token very
Token ID845
Feature activation+3.03e+00

Pos logit contributions
 glad+5.028
 important+4.607
 tempting+4.359
 fault+4.286
 lucky+4.257
Neg logit contributions
mine-6.700
al-6.392
als-5.571
of-5.558
talk-5.513
Token happy
Token ID3772
Feature activation+2.48e-01

Pos logit contributions
 glad+2.853
 fault+2.831
 supposed+2.732
 important+2.321
 tempting+2.029
Neg logit contributions
mine-8.837
al-8.524
of-8.053
als-7.890
talk-7.838
Token and
Token ID290
Feature activation+5.48e-01

Pos logit contributions
 glad+0.621
 lucky+0.539
 important+0.524
 supposed+0.517
 pleased+0.506
Neg logit contributions
mine-0.408
al-0.385
als-0.358
ted-0.355
talk-0.354
Token thanked
Token ID26280
Feature activation-1.76e+00

Pos logit contributions
 glad+1.301
 lucky+1.118
 important+1.098
 supposed+1.097
 pleased+1.039
Neg logit contributions
mine-0.942
al-0.903
als-0.856
ted-0.847
talk-0.827
Token the
Token ID262
Feature activation+6.26e-01

Pos logit contributions
mine+3.155
talk+2.974
al+2.969
x+2.871
ted+2.816
Neg logit contributions
 glad-4.341
 lucky-3.717
 important-3.524
 pleased-3.467
 supposed-3.355
Token girl
Token ID2576
Feature activation-1.07e+00

Pos logit contributions
mine+3.449
talk+3.168
al+3.116
ted+2.980
x+2.961
Neg logit contributions
 glad-4.578
 lucky-4.000
 important-3.800
 tired-3.660
 sad-3.636
Token named
Token ID3706
Feature activation+1.74e-01

Pos logit contributions
mine+2.044
al+1.894
talk+1.828
als+1.794
ted+1.780
Neg logit contributions
 glad-2.512
 lucky-2.142
 important-2.065
 supposed-1.997
 pleased-1.976
Token Lily
Token ID20037
Feature activation-2.17e-01

Pos logit contributions
 glad+0.475
 lucky+0.414
 important+0.405
 pleased+0.401
 supposed+0.395
Neg logit contributions
mine-0.257
al-0.234
als-0.216
of-0.215
talk-0.215
Token.
Token ID13
Feature activation-1.57e-01

Pos logit contributions
mine+0.420
al+0.395
talk+0.377
of+0.366
x+0.365
Neg logit contributions
 glad-0.492
 lucky-0.419
 important-0.402
 pleased-0.396
 supposed-0.389
Token She
Token ID1375
Feature activation+1.10e+00

Pos logit contributions
mine+0.256
al+0.235
talk+0.221
als+0.219
of+0.217
Neg logit contributions
 glad-0.408
 lucky-0.356
 important-0.342
 supposed-0.337
 pleased-0.337
Token had
Token ID550
Feature activation+1.02e+00

Pos logit contributions
 glad+1.987
 lucky+1.545
 pleased+1.540
 fault+1.462
 supposed+1.447
Neg logit contributions
mine-2.625
al-2.504
ted-2.372
talk-2.370
Give-2.360
Token a
Token ID257
Feature activation+1.13e+00

Pos logit contributions
 glad+2.246
 pleased+1.889
 lucky+1.844
 important+1.803
 tempting+1.721
Neg logit contributions
mine-1.980
al-1.857
talk-1.763
als-1.749
ted-1.743
Token big
Token ID1263
Feature activation+3.18e-01

Pos logit contributions
 glad+2.166
 supposed+1.729
 pleased+1.710
 lucky+1.682
 important+1.618
Neg logit contributions
mine-2.451
al-2.342
 cash-2.237
als-2.188
x-2.175
Token sun
Token ID4252
Feature activation-1.38e+00

Pos logit contributions
 glad+0.720
 lucky+0.608
 supposed+0.589
 important+0.585
 pleased+0.585
Neg logit contributions
mine-0.605
al-0.567
talk-0.526
als-0.522
x-0.520
Tokenflower
Token ID25547
Feature activation-2.72e-01

Pos logit contributions
mine+2.467
al+2.294
talk+2.252
of+2.168
ted+2.139
Neg logit contributions
 glad-3.318
 lucky-2.848
 important-2.726
 supposed-2.658
 pleased-2.629
Token in
Token ID287
Feature activation-1.41e+00

Pos logit contributions
mine+0.486
al+0.460
talk+0.431
als+0.427
ted+0.420
Neg logit contributions
 glad-0.654
 lucky-0.562
 important-0.541
 pleased-0.531
 supposed-0.530
Token water
Token ID1660
Feature activation-6.42e-01

Pos logit contributions
 glad+2.466
 lucky+2.098
 supposed+2.097
 pleased+2.068
 important+2.038
Neg logit contributions
mine-1.552
al-1.473
ted-1.401
talk-1.374
als-1.336
Token for
Token ID329
Feature activation+2.70e-01

Pos logit contributions
mine+1.248
al+1.174
talk+1.122
als+1.097
ted+1.091
Neg logit contributions
 glad-1.456
 lucky-1.239
 important-1.192
 supposed-1.158
 pleased-1.154
Token the
Token ID262
Feature activation+6.04e-01

Pos logit contributions
 glad+0.668
 lucky+0.571
 important+0.553
 supposed+0.553
 pleased+0.551
Neg logit contributions
mine-0.452
al-0.426
ted-0.401
talk-0.397
of-0.389
Token cat
Token ID3797
Feature activation+5.53e-01

Pos logit contributions
 glad+1.620
 lucky+1.381
 supposed+1.369
 pleased+1.360
 important+1.341
Neg logit contributions
mine-0.899
al-0.835
talk-0.801
als-0.799
ted-0.796
Token.
Token ID13
Feature activation+2.03e-01

Pos logit contributions
 glad+1.334
 lucky+1.127
 pleased+1.091
 important+1.077
 supposed+1.057
Neg logit contributions
mine-0.995
al-0.953
als-0.899
talk-0.890
ted-0.879
Token He
Token ID679
Feature activation+1.16e-01

Pos logit contributions
 glad+0.468
 lucky+0.403
 important+0.388
 supposed+0.385
 pleased+0.377
Neg logit contributions
mine-0.376
al-0.360
talk-0.349
als-0.345
ted-0.344
Token put
Token ID1234
Feature activation-5.37e-01

Pos logit contributions
 glad+0.289
 lucky+0.248
 important+0.240
 supposed+0.239
 pleased+0.234
Neg logit contributions
mine-0.199
al-0.189
talk-0.177
als-0.175
ted-0.175
Token the
Token ID262
Feature activation+8.77e-01

Pos logit contributions
mine+0.975
al+0.915
talk+0.867
als+0.851
ted+0.848
Neg logit contributions
 glad-1.291
 lucky-1.108
 important-1.069
 supposed-1.045
 pleased-1.040
Token water
Token ID1660
Feature activation-3.21e-01

Pos logit contributions
 glad+2.523
 supposed+2.200
 lucky+2.157
 pleased+2.142
 important+2.126
Neg logit contributions
mine-1.126
al-1.088
als-1.027
ted-0.966
talk-0.961
Token in
Token ID287
Feature activation-2.02e-01

Pos logit contributions
mine+0.616
al+0.583
talk+0.550
als+0.548
ted+0.544
Neg logit contributions
 glad-0.731
 lucky-0.622
 important-0.599
 supposed-0.588
 pleased-0.582
Token a
Token ID257
Feature activation+5.51e-01

Pos logit contributions
mine+0.338
al+0.313
talk+0.293
ted+0.287
of+0.286
Neg logit contributions
 glad-0.513
 lucky-0.444
 important-0.430
 supposed-0.426
 pleased-0.420
Token the
Token ID262
Feature activation+6.70e-01

Pos logit contributions
mine+4.622
talk+4.385
ede+4.237
al+4.173
x+4.020
Neg logit contributions
 glad-7.522
 lucky-6.655
 tired-6.374
 sad-6.327
 important-6.281
Token dog
Token ID3290
Feature activation+3.37e-01

Pos logit contributions
 glad+1.729
 lucky+1.480
 supposed+1.446
 pleased+1.439
 important+1.432
Neg logit contributions
mine-1.071
al-1.000
talk-0.931
ted-0.928
als-0.916
Token and
Token ID290
Feature activation-1.21e+00

Pos logit contributions
 glad+0.826
 lucky+0.708
 important+0.689
 supposed+0.689
 pleased+0.665
Neg logit contributions
mine-0.588
al-0.557
ted-0.525
als-0.524
of-0.519
Token they
Token ID484
Feature activation-1.74e+00

Pos logit contributions
mine+2.192
al+2.050
talk+1.962
als+1.893
x+1.888
Neg logit contributions
 glad-2.937
 lucky-2.524
 important-2.425
 pleased-2.361
 supposed-2.323
Token played
Token ID2826
Feature activation-4.07e-01

Pos logit contributions
mine+3.203
al+3.052
talk+2.963
x+2.844
als+2.793
Neg logit contributions
 glad-4.106
 lucky-3.499
 important-3.299
 tired-3.263
 pleased-3.255
Token together
Token ID1978
Feature activation+7.42e-02

Pos logit contributions
mine+0.771
al+0.722
talk+0.688
als+0.673
ted+0.672
Neg logit contributions
 glad-0.938
 lucky-0.802
 important-0.773
 supposed-0.757
 pleased-0.748
Token for
Token ID329
Feature activation-6.85e-02

Pos logit contributions
 glad+0.178
 lucky+0.153
 important+0.147
 supposed+0.145
 pleased+0.144
Neg logit contributions
mine-0.135
al-0.126
talk-0.118
als-0.116
of-0.116
Token a
Token ID257
Feature activation+3.96e-01

Pos logit contributions
mine+0.140
al+0.133
talk+0.127
ted+0.125
of+0.124
Neg logit contributions
 glad-0.146
 lucky-0.122
 important-0.118
 pleased-0.116
 supposed-0.115
Token while
Token ID981
Feature activation-1.69e-01

Pos logit contributions
 glad+0.979
 lucky+0.833
 supposed+0.812
 important+0.811
 pleased+0.791
Neg logit contributions
mine-0.676
of-0.605
al-0.600
talk-0.575
x-0.565
Token.
Token ID13
Feature activation+1.51e+00

Pos logit contributions
mine+0.315
al+0.295
talk+0.278
ted+0.275
als+0.274
Neg logit contributions
 glad-0.395
 lucky-0.339
 important-0.327
 supposed-0.321
 pleased-0.317
Token
Token ID198
Feature activation+1.73e+00

Pos logit contributions
 glad+3.484
 lucky+3.064
 supposed+3.003
 pleased+2.982
 important+2.974
Neg logit contributions
mine-2.678
ted-2.531
al-2.420
of-2.385
talk-2.372
Token.
Token ID13
Feature activation+1.19e+00

Pos logit contributions
mine+0.777
al+0.732
talk+0.697
als+0.684
ted+0.679
Neg logit contributions
 glad-0.909
 lucky-0.771
 important-0.739
 supposed-0.722
 pleased-0.722
Token<|endoftext|>
Token ID50256
Feature activation+7.18e-01

Pos logit contributions
 glad+2.387
 lucky+2.041
 important+1.987
 pleased+1.937
 supposed+1.913
Neg logit contributions
mine-2.524
al-2.362
of-2.350
talk-2.307
als-2.291
TokenOnce
Token ID7454
Feature activation+2.18e+00

Pos logit contributions
 glad+1.372
 lucky+1.121
 important+1.097
 pleased+1.070
 supposed+1.049
Neg logit contributions
mine-1.709
al-1.632
ted-1.547
of-1.539
talk-1.535
Token upon
Token ID2402
Feature activation+1.09e+00

Pos logit contributions
 glad+3.591
 supposed+3.013
 lucky+2.903
 important+2.902
 pleased+2.760
Neg logit contributions
mine-5.702
of-5.322
al-4.912
x-4.891
ted-4.782
Token a
Token ID257
Feature activation+1.60e+00

Pos logit contributions
 glad+2.466
 lucky+2.055
 important+2.020
 pleased+2.017
 supposed+1.971
Neg logit contributions
mine-2.077
al-1.861
of-1.837
talk-1.784
Give-1.752
Token time
Token ID640
Feature activation+1.09e+00

Pos logit contributions
 glad+2.595
 lucky+2.006
 pleased+1.940
 supposed+1.902
 important+1.843
Neg logit contributions
mine-4.070
of-3.936
al-3.735
talk-3.715
ede-3.603
Token,
Token ID11
Feature activation+1.37e+00

Pos logit contributions
 glad+2.832
 important+2.472
 lucky+2.463
 supposed+2.451
 pleased+2.399
Neg logit contributions
mine-1.723
of-1.508
al-1.505
ted-1.393
als-1.380
Token there
Token ID612
Feature activation+9.77e-01

Pos logit contributions
 glad+2.787
 supposed+2.339
 lucky+2.274
 pleased+2.267
 important+2.265
Neg logit contributions
mine-2.985
al-2.765
of-2.606
ted-2.543
als-2.532
Token was
Token ID373
Feature activation+1.33e-01

Pos logit contributions
 glad+2.402
 lucky+2.037
 supposed+1.981
 pleased+1.979
 important+1.976
Neg logit contributions
mine-1.776
al-1.629
of-1.539
ted-1.514
als-1.468
Token a
Token ID257
Feature activation+2.90e-01

Pos logit contributions
 glad+0.279
 lucky+0.232
 important+0.223
 pleased+0.221
 supposed+0.218
Neg logit contributions
mine-0.278
al-0.262
talk-0.246
ted-0.246
als-0.243
Token little
Token ID1310
Feature activation-1.73e+00

Pos logit contributions
 glad+0.500
 supposed+0.397
 lucky+0.393
 important+0.385
 pleased+0.370
Neg logit contributions
mine-0.684
al-0.659
of-0.619
als-0.617
talk-0.613
Token home
Token ID1363
Feature activation-9.89e-01

Pos logit contributions
mine+5.093
al+4.937
talk+4.758
Give+4.672
ted+4.626
Neg logit contributions
 glad-5.402
 lucky-4.642
 important-4.406
 pleased-4.282
 tired-4.249
Token,
Token ID11
Feature activation-2.30e-01

Pos logit contributions
mine+1.776
al+1.672
talk+1.593
ted+1.554
als+1.553
Neg logit contributions
 glad-2.388
 lucky-2.055
 important-1.981
 pleased-1.929
 supposed-1.921
Token but
Token ID475
Feature activation-6.65e-01

Pos logit contributions
mine+0.412
al+0.384
talk+0.363
als+0.357
ted+0.356
Neg logit contributions
 glad-0.551
 lucky-0.476
 important-0.462
 supposed-0.451
 pleased-0.442
Token when
Token ID618
Feature activation-6.18e-01

Pos logit contributions
mine+1.146
al+1.069
talk+1.015
als+0.989
ted+0.983
Neg logit contributions
 glad-1.655
 lucky-1.430
 important-1.382
 supposed-1.346
 pleased-1.342
Token she
Token ID673
Feature activation-4.66e-01

Pos logit contributions
mine+0.958
al+0.886
talk+0.835
als+0.813
ted+0.807
Neg logit contributions
 glad-1.641
 lucky-1.433
 important-1.388
 supposed-1.356
 pleased-1.351
Token tried
Token ID3088
Feature activation+2.89e-02

Pos logit contributions
mine+0.833
al+0.780
talk+0.736
als+0.721
ted+0.719
Neg logit contributions
 glad-1.136
 lucky-0.976
 important-0.945
 supposed-0.920
 pleased-0.914
Token it
Token ID340
Feature activation-5.85e-01

Pos logit contributions
 glad+0.062
 lucky+0.052
 important+0.050
 pleased+0.049
 supposed+0.049
Neg logit contributions
mine-0.059
al-0.055
talk-0.053
ted-0.052
als-0.052
Token on
Token ID319
Feature activation-1.38e+00

Pos logit contributions
mine+1.159
al+1.090
talk+1.041
als+1.020
ted+1.015
Neg logit contributions
 glad-1.303
 lucky-1.104
 important-1.060
 supposed-1.030
 pleased-1.028
Token she
Token ID673
Feature activation-2.93e-01

Pos logit contributions
mine+2.598
al+2.452
talk+2.398
als+2.331
ted+2.319
Neg logit contributions
 glad-3.188
 lucky-2.735
 important-2.662
 supposed-2.553
 pleased-2.535
Token realized
Token ID6939
Feature activation-5.06e-01

Pos logit contributions
mine+0.581
al+0.548
talk+0.521
als+0.511
of+0.508
Neg logit contributions
 glad-0.656
 lucky-0.554
 important-0.533
 pleased-0.518
 supposed-0.517
Token it
Token ID340
Feature activation-5.86e-01

Pos logit contributions
mine+0.819
al+0.762
talk+0.715
als+0.700
ted+0.696
Neg logit contributions
 glad-1.304
 lucky-1.132
 important-1.094
 supposed-1.070
 pleased-1.068
Token chew
Token ID34722
Feature activation-9.39e-01

Pos logit contributions
mine+2.139
al+2.000
talk+1.888
als+1.861
ted+1.846
Neg logit contributions
 glad-3.412
 lucky-2.966
 important-2.872
 supposed-2.777
 pleased-2.765
Token it
Token ID340
Feature activation-2.77e-01

Pos logit contributions
mine+1.811
al+1.692
talk+1.635
als+1.581
of+1.571
Neg logit contributions
 glad-2.137
 lucky-1.831
 important-1.771
 supposed-1.700
 pleased-1.693
Token."
Token ID526
Feature activation+6.74e-01

Pos logit contributions
mine+0.562
al+0.529
talk+0.506
als+0.498
of+0.494
Neg logit contributions
 glad-0.609
 lucky-0.511
 important-0.485
 supposed-0.480
 pleased-0.477
Token
Token ID198
Feature activation+8.92e-01

Pos logit contributions
 glad+1.219
 supposed+1.006
 lucky+1.005
 important+0.998
 pleased+0.942
Neg logit contributions
mine-1.565
al-1.473
of-1.443
talk-1.429
ted-1.424
Token
Token ID198
Feature activation-2.22e+00

Pos logit contributions
 glad+1.573
 lucky+1.322
 important+1.287
 supposed+1.266
 pleased+1.218
Neg logit contributions
mine-2.128
al-2.011
of-1.995
talk-1.945
ted-1.918
TokenShe
Token ID3347
Feature activation-7.24e-01

Pos logit contributions
mine+3.291
al+3.029
talk+2.876
als+2.869
ted+2.819
Neg logit contributions
 glad-6.424
 lucky-5.545
 important-5.335
 pleased-5.251
 happy-5.125
Token hugs
Token ID40657
Feature activation-3.68e+00

Pos logit contributions
mine+1.331
al+1.243
talk+1.188
als+1.157
ted+1.149
Neg logit contributions
 glad-1.719
 lucky-1.476
 important-1.422
 supposed-1.382
 pleased-1.378
Token Tim
Token ID5045
Feature activation-7.78e-01

Pos logit contributions
mine+7.456
talk+7.302
al+6.902
x+6.833
ede+6.680
Neg logit contributions
 glad-8.284
 lucky-6.903
 happy-6.650
 tired-6.648
 sad-6.629
Token and
Token ID290
Feature activation-9.26e-01

Pos logit contributions
mine+1.377
al+1.284
talk+1.224
als+1.193
ted+1.184
Neg logit contributions
 glad-1.903
 lucky-1.639
 important-1.580
 supposed-1.535
 pleased-1.533
Token says
Token ID1139
Feature activation+9.88e-01

Pos logit contributions
mine+1.844
al+1.722
talk+1.654
x+1.607
als+1.606
Neg logit contributions
 glad-2.088
 lucky-1.774
 important-1.697
 pleased-1.642
 supposed-1.641
Token sorry
Token ID7926
Feature activation+5.09e-02

Pos logit contributions
 glad+2.248
 lucky+1.961
 supposed+1.951
 important+1.899
 pleased+1.833
Neg logit contributions
mine-1.858
al-1.715
of-1.629
ted-1.624
als-1.600
Token his
Token ID465
Feature activation+1.23e+00

Pos logit contributions
mine+0.721
al+0.678
talk+0.644
als+0.631
ted+0.627
Neg logit contributions
 glad-1.027
 lucky-0.885
 important-0.854
 supposed-0.835
 pleased-0.831
Token books
Token ID3835
Feature activation-1.69e-01

Pos logit contributions
 glad+2.987
 supposed+2.513
 lucky+2.461
 pleased+2.446
 important+2.419
Neg logit contributions
mine-2.080
al-2.030
als-1.953
talk-1.899
x-1.877
Token.
Token ID13
Feature activation+5.56e-01

Pos logit contributions
mine+0.326
al+0.305
talk+0.290
als+0.285
of+0.283
Neg logit contributions
 glad-0.387
 lucky-0.329
 important-0.316
 supposed-0.308
 pleased-0.307
Token He
Token ID679
Feature activation-6.03e-01

Pos logit contributions
 glad+1.257
 lucky+1.083
 supposed+1.059
 important+1.045
 pleased+1.016
Neg logit contributions
mine-1.042
al-0.983
talk-0.954
ted-0.944
als-0.937
Token told
Token ID1297
Feature activation-2.00e+00

Pos logit contributions
mine+1.185
al+1.117
talk+1.068
als+1.045
ted+1.039
Neg logit contributions
 glad-1.351
 lucky-1.147
 important-1.103
 supposed-1.072
 pleased-1.067
Token them
Token ID606
Feature activation-2.14e-01

Pos logit contributions
mine+2.867
al+2.642
talk+2.450
als+2.409
x+2.346
Neg logit contributions
 glad-5.714
 lucky-5.092
 important-5.017
 sad-4.753
 pleased-4.713
Token stories
Token ID3923
Feature activation+4.52e-02

Pos logit contributions
mine+0.424
al+0.399
talk+0.383
als+0.378
of+0.376
Neg logit contributions
 glad-0.471
 lucky-0.396
 important-0.379
 supposed-0.373
 pleased-0.372
Token and
Token ID290
Feature activation-5.67e-02

Pos logit contributions
 glad+0.102
 lucky+0.087
 important+0.083
 supposed+0.081
 pleased+0.081
Neg logit contributions
mine-0.088
al-0.083
talk-0.079
als-0.078
of-0.078
Token jokes
Token ID14532
Feature activation-1.87e-01

Pos logit contributions
mine+0.105
al+0.100
talk+0.096
als+0.094
ted+0.094
Neg logit contributions
 glad-0.130
 lucky-0.111
 important-0.106
 supposed-0.104
 pleased-0.104
Token.
Token ID13
Feature activation+8.50e-01

Pos logit contributions
mine+0.352
al+0.332
talk+0.313
als+0.312
ted+0.310
Neg logit contributions
 glad-0.430
 lucky-0.368
 important-0.353
 supposed-0.345
 pleased-0.342
Token They
Token ID1119
Feature activation-1.20e+00

Pos logit contributions
 glad+1.666
 lucky+1.407
 supposed+1.396
 important+1.349
 pleased+1.305
Neg logit contributions
mine-1.843
al-1.737
talk-1.702
ted-1.692
als-1.668
Token It
Token ID632
Feature activation+9.22e-01

Pos logit contributions
 glad+1.043
 lucky+0.892
 supposed+0.866
 important+0.858
 pleased+0.835
Neg logit contributions
mine-0.817
al-0.788
ted-0.747
als-0.742
talk-0.736
Token was
Token ID373
Feature activation+2.43e+00

Pos logit contributions
 glad+2.277
 lucky+1.911
 supposed+1.866
 important+1.840
 pleased+1.822
Neg logit contributions
mine-1.586
al-1.563
ted-1.485
als-1.443
of-1.401
Token small
Token ID1402
Feature activation-3.88e-02

Pos logit contributions
 glad+5.542
 lucky+4.642
 pleased+4.535
 fault+4.382
 supposed+4.380
Neg logit contributions
al-4.175
mine-4.058
ted-3.887
 cash-3.776
Give-3.699
Token,
Token ID11
Feature activation+8.07e-01

Pos logit contributions
mine+0.060
al+0.058
als+0.053
talk+0.053
ted+0.053
Neg logit contributions
 glad-0.102
 lucky-0.088
 important-0.085
 pleased-0.084
 supposed-0.083
Token but
Token ID475
Feature activation+1.85e+00

Pos logit contributions
 glad+1.575
 lucky+1.284
 supposed+1.236
 pleased+1.204
 important+1.195
Neg logit contributions
mine-1.725
al-1.675
ted-1.607
als-1.597
talk-1.549
Token smart
Token ID4451
Feature activation-4.35e-01

Pos logit contributions
 glad+4.898
 supposed+4.366
 lucky+4.208
 fault+4.194
 important+4.116
Neg logit contributions
mine-2.620
al-2.592
ted-2.456
als-2.401
Give-2.358
Token.
Token ID13
Feature activation+5.62e-01

Pos logit contributions
mine+0.858
al+0.815
talk+0.773
als+0.765
ted+0.762
Neg logit contributions
 glad-0.974
 lucky-0.824
 important-0.791
 pleased-0.771
 supposed-0.770
Token It
Token ID632
Feature activation+5.86e-01

Pos logit contributions
 glad+1.326
 lucky+1.135
 supposed+1.116
 important+1.097
 pleased+1.072
Neg logit contributions
mine-1.005
al-0.979
ted-0.932
als-0.931
talk-0.909
Token said
Token ID531
Feature activation+3.94e-01

Pos logit contributions
 glad+1.381
 lucky+1.160
 supposed+1.128
 important+1.118
 pleased+1.096
Neg logit contributions
mine-1.058
al-1.033
ted-0.969
als-0.962
talk-0.938
Token it
Token ID340
Feature activation+5.36e-01

Pos logit contributions
 glad+0.934
 lucky+0.792
 supposed+0.787
 important+0.767
 pleased+0.749
Neg logit contributions
mine-0.700
al-0.670
als-0.632
ted-0.621
of-0.604
Token knew
Token ID2993
Feature activation+2.92e-01

Pos logit contributions
 glad+1.244
 lucky+1.022
 pleased+0.992
 supposed+0.967
 important+0.952
Neg logit contributions
mine-1.021
al-0.993
als-0.939
ted-0.924
talk-0.910
Token upon
Token ID2402
Feature activation+9.55e-01

Pos logit contributions
 glad+4.062
 supposed+3.451
 lucky+3.309
 important+3.250
 pleased+3.164
Neg logit contributions
mine-6.165
of-5.846
x-5.313
al-5.291
ted-5.213
Token a
Token ID257
Feature activation+1.56e+00

Pos logit contributions
 glad+2.143
 lucky+1.788
 important+1.755
 pleased+1.745
 supposed+1.714
Neg logit contributions
mine-1.850
al-1.669
of-1.627
talk-1.605
Give-1.566
Token time
Token ID640
Feature activation+1.36e+00

Pos logit contributions
 glad+2.486
 lucky+1.941
 pleased+1.836
 supposed+1.821
 important+1.792
Neg logit contributions
mine-4.025
of-3.887
al-3.672
talk-3.658
ede-3.572
Token,
Token ID11
Feature activation+1.63e+00

Pos logit contributions
 glad+3.624
 important+3.207
 supposed+3.172
 lucky+3.160
 pleased+3.110
Neg logit contributions
mine-2.080
of-1.807
al-1.762
ted-1.633
als-1.621
Token there
Token ID612
Feature activation+1.47e+00

Pos logit contributions
 glad+3.351
 supposed+2.845
 pleased+2.745
 lucky+2.734
 important+2.726
Neg logit contributions
mine-3.444
al-3.174
of-2.968
x-2.937
ted-2.893
Token was
Token ID373
Feature activation-3.32e-02

Pos logit contributions
 glad+3.757
 lucky+3.195
 pleased+3.155
 important+3.135
 supposed+3.135
Neg logit contributions
mine-2.536
al-2.282
of-2.212
ted-2.119
Give-2.079
Token a
Token ID257
Feature activation+1.66e-01

Pos logit contributions
mine+0.070
al+0.066
talk+0.063
ted+0.062
als+0.062
Neg logit contributions
 glad-0.068
 lucky-0.056
 important-0.054
 pleased-0.054
 supposed-0.053
Token little
Token ID1310
Feature activation-1.88e+00

Pos logit contributions
 glad+0.283
 lucky+0.223
 supposed+0.221
 important+0.217
 pleased+0.208
Neg logit contributions
mine-0.399
al-0.383
of-0.361
als-0.360
talk-0.360
Token girl
Token ID2576
Feature activation-1.07e+00

Pos logit contributions
mine+3.516
talk+3.231
al+3.189
ted+3.048
x+3.021
Neg logit contributions
 glad-4.615
 lucky-4.034
 important-3.828
 tired-3.691
 sad-3.660
Token named
Token ID3706
Feature activation+4.00e-02

Pos logit contributions
mine+2.045
al+1.896
talk+1.829
als+1.797
ted+1.784
Neg logit contributions
 glad-2.516
 lucky-2.147
 important-2.070
 supposed-2.002
 pleased-1.979
Token Lily
Token ID20037
Feature activation-1.08e-01

Pos logit contributions
 glad+0.108
 lucky+0.094
 important+0.092
 pleased+0.091
 supposed+0.090
Neg logit contributions
mine-0.060
al-0.055
talk-0.051
als-0.051
of-0.051
Token everyone
Token ID2506
Feature activation+1.83e-01

Pos logit contributions
mine+1.524
al+1.418
talk+1.350
als+1.313
of+1.306
Neg logit contributions
 glad-2.142
 lucky-1.850
 important-1.788
 pleased-1.735
 supposed-1.734
Token laughed
Token ID13818
Feature activation+2.36e-01

Pos logit contributions
 glad+0.416
 lucky+0.354
 important+0.338
 supposed+0.333
 pleased+0.326
Neg logit contributions
mine-0.354
al-0.332
talk-0.313
ted-0.309
als-0.308
Token so
Token ID523
Feature activation-4.39e-01

Pos logit contributions
 glad+0.557
 lucky+0.473
 supposed+0.456
 important+0.452
 pleased+0.442
Neg logit contributions
mine-0.435
al-0.408
ted-0.383
als-0.382
talk-0.377
Token hard
Token ID1327
Feature activation-6.87e-01

Pos logit contributions
mine+1.047
al+0.997
talk+0.955
als+0.944
ted+0.943
Neg logit contributions
 glad-0.798
 lucky-0.649
 important-0.614
 supposed-0.601
 pleased-0.592
Token that
Token ID326
Feature activation-3.27e-01

Pos logit contributions
mine+1.344
al+1.264
talk+1.208
als+1.182
ted+1.174
Neg logit contributions
 glad-1.551
 lucky-1.320
 important-1.271
 supposed-1.233
 pleased-1.228
Token they
Token ID484
Feature activation-3.63e-01

Pos logit contributions
mine+0.525
al+0.487
talk+0.455
als+0.449
ted+0.446
Neg logit contributions
 glad-0.849
 lucky-0.737
 important-0.712
 supposed-0.698
 pleased-0.696
Token forgot
Token ID16453
Feature activation+9.97e-01

Pos logit contributions
mine+0.739
al+0.692
talk+0.660
als+0.651
ted+0.647
Neg logit contributions
 glad-0.796
 lucky-0.672
 important-0.645
 supposed-0.628
 pleased-0.624
Token about
Token ID546
Feature activation+7.23e-01

Pos logit contributions
 glad+2.192
 lucky+1.836
 pleased+1.728
 important+1.725
 supposed+1.718
Neg logit contributions
mine-1.950
al-1.829
ted-1.765
als-1.752
of-1.741
Token the
Token ID262
Feature activation+9.78e-01

Pos logit contributions
 glad+1.745
 lucky+1.508
 important+1.450
 pleased+1.430
 supposed+1.414
Neg logit contributions
mine-1.265
al-1.178
ted-1.093
x-1.083
of-1.082
Token trumpet
Token ID40334
Feature activation-4.65e-01

Pos logit contributions
 glad+2.294
 lucky+1.900
 pleased+1.839
 supposed+1.836
 important+1.816
Neg logit contributions
mine-1.789
al-1.706
talk-1.588
als-1.573
ted-1.566
Token.
Token ID13
Feature activation+5.36e-01

Pos logit contributions
mine+0.913
al+0.859
talk+0.818
als+0.805
ted+0.799
Neg logit contributions
 glad-1.041
 lucky-0.883
 important-0.848
 supposed-0.824
 pleased-0.822
Token
Token ID198
Feature activation-3.29e+00

Pos logit contributions
mine+0.923
al+0.878
talk+0.855
of+0.844
ted+0.833
Neg logit contributions
 glad-0.562
 lucky-0.447
 important-0.421
 supposed-0.408
 pleased-0.398
TokenL
Token ID43
Feature activation-1.73e+00

Pos logit contributions
mine+5.446
al+5.389
als+5.334
ted+5.262
x+4.717
Neg logit contributions
 glad-9.204
 lucky-7.884
 important-7.694
 pleased-7.598
 happy-7.482
Tokenily
Token ID813
Feature activation-1.40e+00

Pos logit contributions
mine+2.930
al+2.707
als+2.666
talk+2.644
ted+2.628
Neg logit contributions
 glad-4.380
 lucky-3.795
 important-3.694
 supposed-3.637
 pleased-3.508
Token said
Token ID531
Feature activation+3.22e-01

Pos logit contributions
mine+2.721
al+2.532
talk+2.432
x+2.395
als+2.387
Neg logit contributions
 glad-3.210
 lucky-2.733
 important-2.640
 pleased-2.563
 supposed-2.503
Token,
Token ID11
Feature activation+2.63e-01

Pos logit contributions
 glad+0.793
 lucky+0.686
 supposed+0.675
 important+0.672
 pleased+0.645
Neg logit contributions
mine-0.554
al-0.513
of-0.480
als-0.479
talk-0.477
Token "
Token ID366
Feature activation-2.47e+00

Pos logit contributions
 glad+0.445
 lucky+0.366
 important+0.358
 supposed+0.347
 pleased+0.325
Neg logit contributions
mine-0.652
al-0.619
als-0.592
ted-0.589
talk-0.582
TokenNo
Token ID2949
Feature activation-1.19e+00

Pos logit contributions
mine+3.802
al+3.509
x+3.258
talk+3.185
als+3.163
Neg logit contributions
 glad-6.955
 lucky-6.066
 important-5.810
 pleased-5.713
 happy-5.552
Token,
Token ID11
Feature activation-1.49e+00

Pos logit contributions
mine+2.428
al+2.302
talk+2.214
als+2.160
x+2.134
Neg logit contributions
 glad-2.674
 lucky-2.281
 important-2.202
 supposed-2.106
 pleased-2.081
Token Tom
Token ID4186
Feature activation+1.61e-01

Pos logit contributions
mine+3.032
talk+2.854
al+2.828
x+2.659
ted+2.655
Neg logit contributions
 glad-3.402
 lucky-2.918
 important-2.752
 pleased-2.637
 sad-2.607
Token.
Token ID13
Feature activation-1.20e+00

Pos logit contributions
 glad+0.384
 lucky+0.327
 important+0.314
 pleased+0.313
 supposed+0.308
Neg logit contributions
mine-0.290
al-0.277
talk-0.257
of-0.255
als-0.252
Token Stop
Token ID13707
Feature activation+7.54e-02

Pos logit contributions
mine+2.582
al+2.404
talk+2.346
of+2.278
ted+2.265
Neg logit contributions
 glad-2.516
 lucky-2.109
 important-1.987
 supposed-1.909
 pleased-1.898
Token go
Token ID467
Feature activation+6.80e-02

Pos logit contributions
mine+1.155
al+1.077
talk+1.022
als+0.996
ted+0.990
Neg logit contributions
 glad-1.682
 lucky-1.456
 important-1.406
 supposed-1.370
 pleased-1.366
Token,"
Token ID553
Feature activation-1.47e+00

Pos logit contributions
 glad+0.153
 lucky+0.129
 important+0.124
 pleased+0.122
 supposed+0.121
Neg logit contributions
mine-0.133
al-0.126
talk-0.119
als-0.118
of-0.117
Token she
Token ID673
Feature activation-1.33e+00

Pos logit contributions
mine+2.405
al+2.285
talk+2.148
als+2.092
x+2.090
Neg logit contributions
 glad-3.895
 lucky-3.384
 important-3.277
 pleased-3.203
 sad-3.115
Token said
Token ID531
Feature activation-3.61e-01

Pos logit contributions
mine+2.608
al+2.457
talk+2.340
x+2.302
als+2.288
Neg logit contributions
 glad-3.027
 lucky-2.595
 important-2.509
 pleased-2.451
 supposed-2.382
Token,
Token ID11
Feature activation+5.50e-01

Pos logit contributions
mine+0.708
al+0.662
talk+0.633
als+0.617
of+0.613
Neg logit contributions
 glad-0.810
 lucky-0.690
 important-0.665
 supposed-0.647
 pleased-0.639
Token pat
Token ID1458
Feature activation-2.53e+00

Pos logit contributions
 glad+1.143
 lucky+0.987
 important+0.973
 supposed+0.949
 pleased+0.879
Neg logit contributions
mine-1.138
al-1.057
talk-0.979
ted-0.978
als-0.971
Tokenting
Token ID889
Feature activation-2.87e+00

Pos logit contributions
mine+4.341
talk+4.311
al+4.111
of+4.064
als+3.964
Neg logit contributions
 glad-6.216
 lucky-5.294
 important-5.032
 pleased-4.991
 sad-4.991
Token the
Token ID262
Feature activation-6.64e-01

Pos logit contributions
mine+5.551
al+5.293
talk+5.277
of+5.073
x+5.063
Neg logit contributions
 glad-6.820
 lucky-5.777
 sad-5.563
 important-5.511
 pleased-5.390
Token little
Token ID1310
Feature activation-1.76e+00

Pos logit contributions
mine+1.176
al+1.098
talk+1.044
als+1.018
ted+1.012
Neg logit contributions
 glad-1.629
 lucky-1.404
 important-1.356
 supposed-1.320
 pleased-1.316
Token girl
Token ID2576
Feature activation+3.38e-02

Pos logit contributions
mine+3.139
talk+2.832
al+2.797
ted+2.727
x+2.717
Neg logit contributions
 glad-4.320
 lucky-3.819
 important-3.659
 pleased-3.515
 sad-3.501
Token's
Token ID338
Feature activation-2.91e-02

Pos logit contributions
 glad+0.078
 lucky+0.067
 important+0.065
 supposed+0.063
 pleased+0.062
Neg logit contributions
mine-0.063
al-0.059
talk-0.056
als-0.055
of-0.055
Token Flowers
Token ID30036
Feature activation+3.66e-01

Pos logit contributions
mine+2.858
al+2.663
talk+2.575
of+2.514
ted+2.511
Neg logit contributions
 glad-2.815
 lucky-2.355
 important-2.260
 supposed-2.147
 pleased-2.139
Token are
Token ID389
Feature activation+1.46e+00

Pos logit contributions
 glad+0.957
 lucky+0.827
 important+0.800
 pleased+0.796
 supposed+0.780
Neg logit contributions
mine-0.585
al-0.556
als-0.520
of-0.504
talk-0.504
Token boring
Token ID14262
Feature activation+6.25e-01

Pos logit contributions
 glad+2.663
 lucky+2.091
 pleased+2.014
 supposed+1.938
 important+1.890
Neg logit contributions
mine-3.299
al-3.253
als-3.105
x-3.026
of-3.007
Token."
Token ID526
Feature activation+8.55e-02

Pos logit contributions
 glad+1.385
 lucky+1.156
 pleased+1.109
 important+1.076
 supposed+1.056
Neg logit contributions
mine-1.247
al-1.204
als-1.143
talk-1.142
x-1.115
Token
Token ID198
Feature activation+1.22e+00

Pos logit contributions
 glad+0.143
 lucky+0.115
 important+0.111
 supposed+0.110
 pleased+0.104
Neg logit contributions
mine-0.215
al-0.204
talk-0.199
of-0.197
ted-0.195
Token
Token ID198
Feature activation-2.81e+00

Pos logit contributions
 glad+2.296
 lucky+1.937
 supposed+1.900
 important+1.843
 pleased+1.803
Neg logit contributions
mine-2.756
of-2.655
talk-2.571
al-2.560
ted-2.458
Token"
Token ID1
Feature activation-2.15e+00

Pos logit contributions
mine+4.361
al+3.919
ted+3.730
als+3.722
talk+3.422
Neg logit contributions
 glad-8.265
 lucky-7.269
 important-7.240
 pleased-6.892
 happy-6.785
TokenBut
Token ID1537
Feature activation-1.37e+00

Pos logit contributions
mine+3.294
al+3.065
talk+2.876
als+2.831
x+2.812
Neg logit contributions
 glad-5.921
 lucky-5.226
 important-5.158
 pleased-4.911
 supposed-4.835
Token flowers
Token ID12734
Feature activation-2.30e-01

Pos logit contributions
mine+2.831
talk+2.616
al+2.588
of+2.469
ted+2.464
Neg logit contributions
 glad-3.020
 lucky-2.588
 important-2.463
 pleased-2.326
 supposed-2.315
Token are
Token ID389
Feature activation+1.49e+00

Pos logit contributions
mine+0.351
al+0.325
talk+0.299
als+0.299
of+0.296
Neg logit contributions
 glad-0.613
 lucky-0.535
 important-0.518
 pleased-0.508
 supposed-0.508
Token pretty
Token ID2495
Feature activation+2.97e-01

Pos logit contributions
 glad+2.497
 lucky+1.882
 pleased+1.859
 supposed+1.736
 important+1.655
Neg logit contributions
mine-3.580
al-3.547
als-3.420
of-3.319
x-3.308
Token the
Token ID262
Feature activation+5.88e-01

Pos logit contributions
mine+3.578
al+3.554
als+3.435
talk+3.398
x+3.248
Neg logit contributions
 glad-6.484
 lucky-5.574
 sad-5.221
 tired-5.181
 happy-5.162
Token boy
Token ID2933
Feature activation-1.10e+00

Pos logit contributions
 glad+1.633
 supposed+1.416
 lucky+1.412
 important+1.411
 pleased+1.377
Neg logit contributions
mine-0.816
al-0.750
talk-0.704
als-0.700
ted-0.697
Token and
Token ID290
Feature activation-1.60e+00

Pos logit contributions
mine+1.943
al+1.822
talk+1.713
als+1.696
ted+1.668
Neg logit contributions
 glad-2.717
 lucky-2.333
 important-2.233
 supposed-2.183
 pleased-2.176
Token said
Token ID531
Feature activation+2.01e-01

Pos logit contributions
mine+3.342
al+3.132
talk+3.029
als+2.955
x+2.935
Neg logit contributions
 glad-3.540
 lucky-2.979
 important-2.787
 pleased-2.748
 tired-2.711
Token,
Token ID11
Feature activation+1.34e+00

Pos logit contributions
 glad+0.481
 lucky+0.416
 important+0.405
 supposed+0.405
 pleased+0.392
Neg logit contributions
mine-0.357
al-0.331
ted-0.313
talk-0.309
als-0.308
Token "
Token ID366
Feature activation-2.84e+00

Pos logit contributions
 glad+2.422
 important+2.110
 lucky+2.085
 supposed+2.055
 pleased+1.899
Neg logit contributions
mine-3.093
al-2.898
ted-2.797
ting-2.712
als-2.704
TokenGive
Token ID23318
Feature activation-2.12e+00

Pos logit contributions
mine+4.907
al+4.800
talk+4.617
als+4.580
x+4.478
Neg logit contributions
 glad-7.238
 lucky-6.195
 important-5.782
 pleased-5.762
 tired-5.629
Token me
Token ID502
Feature activation-1.56e+00

Pos logit contributions
mine+4.038
al+3.735
talk+3.647
als+3.538
x+3.533
Neg logit contributions
 glad-4.990
 lucky-4.395
 important-4.175
 supposed-4.060
 sad-3.901
Token back
Token ID736
Feature activation-1.17e+00

Pos logit contributions
mine+3.590
al+3.379
talk+3.279
als+3.194
x+3.178
Neg logit contributions
 glad-3.090
 lucky-2.599
 important-2.439
 supposed-2.324
 pleased-2.272
Token my
Token ID616
Feature activation+1.19e+00

Pos logit contributions
mine+2.612
al+2.453
talk+2.363
als+2.307
x+2.284
Neg logit contributions
 glad-2.343
 lucky-1.968
 important-1.872
 supposed-1.793
 pleased-1.761
Token tiger
Token ID26241
Feature activation+1.38e-01

Pos logit contributions
 glad+3.646
 supposed+3.193
 pleased+3.180
 lucky+3.120
 important+3.072
Neg logit contributions
mine-1.296
al-1.186
ted-1.095
x-1.079
of-1.076
Token with
Token ID351
Feature activation-9.17e-01

Pos logit contributions
mine+0.145
al+0.137
talk+0.129
als+0.125
ted+0.125
Neg logit contributions
 glad-0.198
 lucky-0.169
 important-0.162
 supposed-0.162
 pleased-0.160
Token other
Token ID584
Feature activation-8.94e-02

Pos logit contributions
mine+1.601
al+1.487
talk+1.428
als+1.380
of+1.369
Neg logit contributions
 glad-2.285
 lucky-1.981
 important-1.909
 pleased-1.843
 supposed-1.843
Token toys
Token ID14958
Feature activation-5.04e-01

Pos logit contributions
mine+0.150
al+0.141
talk+0.133
als+0.131
ted+0.129
Neg logit contributions
 glad-0.224
 lucky-0.191
 important-0.184
 supposed-0.184
 pleased-0.183
Token."
Token ID526
Feature activation-3.29e-01

Pos logit contributions
mine+1.062
al+1.004
talk+0.962
als+0.945
ted+0.938
Neg logit contributions
 glad-1.057
 lucky-0.886
 important-0.848
 supposed-0.823
 pleased-0.823
Token
Token ID198
Feature activation+8.85e-01

Pos logit contributions
mine+0.640
al+0.599
talk+0.578
ted+0.567
als+0.566
Neg logit contributions
 glad-0.741
 lucky-0.634
 important-0.612
 pleased-0.593
 supposed-0.593
Token
Token ID198
Feature activation-2.67e+00

Pos logit contributions
 glad+1.568
 lucky+1.295
 important+1.228
 pleased+1.212
 supposed+1.209
Neg logit contributions
mine-2.099
of-1.988
talk-1.967
al-1.962
ted-1.924
TokenTom
Token ID13787
Feature activation-1.14e+00

Pos logit contributions
mine+4.678
al+4.455
als+4.083
x+3.954
ted+3.917
Neg logit contributions
 glad-7.176
 lucky-6.292
 important-6.236
 supposed-5.883
 pleased-5.831
Token listened
Token ID16399
Feature activation+4.37e-01

Pos logit contributions
mine+2.157
al+2.012
talk+1.919
als+1.882
ted+1.865
Neg logit contributions
 glad-2.667
 lucky-2.291
 important-2.214
 pleased-2.135
 supposed-2.128
Token to
Token ID284
Feature activation-1.55e+00

Pos logit contributions
 glad+0.975
 lucky+0.826
 supposed+0.782
 important+0.766
 pleased+0.761
Neg logit contributions
mine-0.852
al-0.788
talk-0.782
of-0.762
ted-0.761
Token Sue
Token ID26089
Feature activation-8.96e-01

Pos logit contributions
mine+2.936
al+2.793
talk+2.661
als+2.658
x+2.580
Neg logit contributions
 glad-3.713
 lucky-3.165
 important-3.039
 supposed-2.925
 pleased-2.909
Token.
Token ID13
Feature activation-4.68e-01

Pos logit contributions
mine+1.706
al+1.604
talk+1.533
als+1.501
ted+1.490
Neg logit contributions
 glad-2.070
 lucky-1.770
 important-1.706
 supposed-1.653
 pleased-1.645