TOP ACTIVATIONS
MAX = 4.469

The mom my heard her daughter scream and came running .
raven wanted to help the girl . He tried to talk
He soon heard the girl 's voice . She said
didn 't mind helping the bird . They walked and walked
's wrong ?" asked the bear . The
car ?" His mom heard him and came to help .
the truck heard the little girl 's voice . He looked
felt better and went back to the barn to play with
The farmer heard the cow and came to help .
kind farmer came to the pig and said , " Don
little boy named Tim heard Kitty complain . He came to
hugged each other and went to play somewhere else . <|endoftext|>
when he heard his friend Lucy crying . "
ily felt better and went to play with her other friends
sad . Max went to Lucy and asked , " Why
you going ?" asked the bear . " I
how the farmer helped the cow . They knew he was
crying ?" asked the rich man . The
tears . Tim went to Sue and asked , " Why
The dog heard the man and bark ed . It

MIN ACTIVATIONS
MIN = -5.588

at the starting line . " Ready , set , go
sign that said " RE VERS E ". L ila and
and John chased Jenny . " You can 't catch me
reached its top speed ! " Whe e , this is
" Â M ira mi mu Ã
and look around . " Wow , we are
. The vet showed Wh isk ers the x - ray
half for himself . They both take a big
and the flowers . " You are a great
it on the castle . " There , now it 's
. Whe e ! " Wow , Tom ,
like a clown . They ran around the room
in laughter and said , ' These new clothes look incredible
in the desert . " Wow , look at
does . â They kept walking until Jack
to the park . At the park , they
and a grape . But Tom wanted more .
and won the race . " Great job , Tim my
on the surface . " Wow , it is
fast and careful . At the hospital , Lily

INTERVAL 1.955 - 4.469
CONTAINS 2.474%

. He is not going to hurt us ."
. They apologized to Mr . Brown and promised to never
Then , they sneak behind her and tick le her .
explore the world . <|endoftext|> Once upon a time , there
of the day looking up at the cloud ' Fl uffy

INTERVAL -0.560 - 1.955
CONTAINS 71.313%

smile . They are not anxious anymore . They are brave
Hello , gir affe ," Anna said . " You are
, stop !" Ben says . " The car is too
had an idea . He asked the girl if they could
his snacks with his friends . One day , Tim found

INTERVAL -3.074 - -0.560
CONTAINS 24.719%

" Hello there ! What brings you here ?" asked the
a safe place to put it . He saw
Mama Bird ! â M ama Bird
, Lily . You were a good customer . You have
laugh . They feel the water on their feet and hands

INTERVAL -5.588 - -3.074
CONTAINS 1.494%

a dragon . " Hey , that 's not
wizard 's work . The vine thanked the wizard
he said . M ummy smiled and said
for the cat . The pretty mouse made it
Dad said to her , " You know Ab ig ail
TokenThe
Token ID464
Feature activation+4.55e-01

Pos logit contributions
P+0.068
sc+0.066
ww+0.066
bly+0.064
lish+0.063
Neg logit contributions
 cry-0.117
 tell-0.113
 call-0.111
 smile-0.111
 crying-0.110
Token mom
Token ID1995
Feature activation+3.36e-01

Pos logit contributions
 smile+0.009
 cry+0.009
 tell+0.008
 stomach+0.007
 gaze+0.007
Neg logit contributions
P-0.035
Now-0.033
ww-0.033
bly-0.032
lish-0.031
Tokenmy
Token ID1820
Feature activation-3.50e-02

Pos logit contributions
 smile+0.012
 cry+0.012
 bed+0.012
 stomach+0.012
 tell+0.012
Neg logit contributions
P-0.021
Now-0.019
ww-0.019
bly-0.018
lish-0.018
Token heard
Token ID2982
Feature activation+3.29e-01

Pos logit contributions
P+0.002
Now+0.002
ww+0.002
bly+0.002
lish+0.002
Neg logit contributions
 smile-0.001
 cry-0.001
 tell-0.001
 bed-0.001
 stomach-0.001
Token her
Token ID607
Feature activation+2.65e+00

Pos logit contributions
 smile+0.011
 cry+0.011
 tell+0.011
 stomach+0.010
 call+0.010
Neg logit contributions
P-0.021
Now-0.020
ww-0.019
bly-0.019
lish-0.018
Token daughter
Token ID4957
Feature activation+4.47e+00

Pos logit contributions
 stomach+0.097
 bed+0.090
 tum+0.086
 collection+0.083
 throat+0.082
Neg logit contributions
Just-0.117
lish-0.116
ww-0.112
ing-0.110
bly-0.109
Token scream
Token ID8196
Feature activation+8.35e-01

Pos logit contributions
ses+0.139
 fault+0.129
sed+0.126
 stomach+0.122
 throat+0.119
Neg logit contributions
it-0.204
aming-0.201
lish-0.199
sc-0.195
bly-0.194
Token and
Token ID290
Feature activation+1.36e+00

Pos logit contributions
 smile+0.029
 tell+0.029
 cry+0.028
 stomach+0.028
 gaze+0.027
Neg logit contributions
P-0.048
lish-0.046
Now-0.046
bly-0.046
ww-0.045
Token came
Token ID1625
Feature activation+7.79e-01

Pos logit contributions
 stomach+0.058
 bed+0.057
 fault+0.057
 smile+0.053
 question+0.052
Neg logit contributions
P-0.069
ww-0.061
bly-0.061
Now-0.060
ha-0.059
Token running
Token ID2491
Feature activation-5.10e-01

Pos logit contributions
 stomach+0.034
 smile+0.033
 cry+0.033
 tell+0.032
 tum+0.031
Neg logit contributions
P-0.042
Now-0.040
Spl-0.037
bly-0.037
Soon-0.036
Token.
Token ID13
Feature activation+7.00e-01

Pos logit contributions
P+0.032
Now+0.030
ww+0.029
bly+0.028
lish+0.027
Neg logit contributions
 smile-0.020
 cry-0.019
 tell-0.019
 call-0.018
 bed-0.018
Token raven
Token ID37735
Feature activation+6.88e-01

Pos logit contributions
P+0.019
Now+0.018
ww+0.018
bly+0.017
lish+0.017
Neg logit contributions
 smile-0.014
 cry-0.014
 tell-0.014
 call-0.013
 bed-0.013
Token wanted
Token ID2227
Feature activation+1.69e+00

Pos logit contributions
 smile+0.028
 bed+0.028
 cry+0.027
 tell+0.026
 stomach+0.025
Neg logit contributions
P-0.041
ww-0.037
Now-0.036
lish-0.034
ha-0.034
Token to
Token ID284
Feature activation+2.64e+00

Pos logit contributions
 smile+0.076
 cry+0.076
 gaze+0.073
sing+0.073
 grin+0.072
Neg logit contributions
P-0.078
A-0.070
Now-0.068
B-0.064
Soon-0.064
Token help
Token ID1037
Feature activation+2.22e+00

Pos logit contributions
sing+0.089
 eyes+0.087
 fault+0.086
 weeping+0.086
ses+0.085
Neg logit contributions
P-0.149
Now-0.136
A-0.130
O-0.129
Hey-0.128
Token the
Token ID262
Feature activation+1.72e-02

Pos logit contributions
ses+0.106
 fault+0.105
sing+0.101
sed+0.101
 gaze+0.097
Neg logit contributions
P-0.092
Now-0.082
lish-0.080
L-0.079
Hey-0.078
Token girl
Token ID2576
Feature activation+4.46e+00

Pos logit contributions
 smile+0.001
 cry+0.001
 tell+0.001
 call+0.001
 bed+0.001
Neg logit contributions
P-0.001
Now-0.001
ww-0.001
bly-0.001
lish-0.001
Token.
Token ID13
Feature activation+2.40e-01

Pos logit contributions
ses+0.224
sed+0.223
sing+0.178
 beard+0.172
ric+0.172
Neg logit contributions
sc-0.141
"-0.122
P-0.117
hest-0.117
it-0.116
Token He
Token ID679
Feature activation+2.21e-01

Pos logit contributions
 cry+0.009
 smile+0.009
 tell+0.009
 bed+0.008
 crying+0.008
Neg logit contributions
P-0.015
Now-0.014
ww-0.014
lish-0.013
Soon-0.013
Token tried
Token ID3088
Feature activation+2.04e+00

Pos logit contributions
 smile+0.008
 cry+0.008
 bed+0.008
 tell+0.008
 stomach+0.008
Neg logit contributions
P-0.014
Now-0.013
ww-0.013
bly-0.012
lish-0.012
Token to
Token ID284
Feature activation+1.39e+00

Pos logit contributions
 gaze+0.096
 stomach+0.095
 stare+0.094
 comment+0.093
 call+0.093
Neg logit contributions
B-0.073
P-0.070
Soon-0.065
sc-0.064
Now-0.064
Token talk
Token ID1561
Feature activation+3.98e-02

Pos logit contributions
 stomach+0.040
 eyes+0.036
 tum+0.036
 fault+0.036
 mouth+0.035
Neg logit contributions
P-0.092
Now-0.084
bly-0.083
A-0.078
sc-0.078
Token
Token ID198
Feature activation-3.28e+00

Pos logit contributions
 smile+0.013
 cry+0.013
 tell+0.013
 call+0.012
 stomach+0.012
Neg logit contributions
P-0.030
Now-0.029
ww-0.028
Soon-0.027
Spl-0.026
TokenHe
Token ID1544
Feature activation+2.24e-01

Pos logit contributions
iest+0.137
ww+0.132
bly+0.132
w+0.128
ier+0.127
Neg logit contributions
 tell-0.167
 cry-0.165
 know-0.161
 find-0.158
 call-0.156
Token soon
Token ID2582
Feature activation+4.08e-01

Pos logit contributions
 smile+0.008
 cry+0.008
 tell+0.008
 bed+0.008
 crying+0.008
Neg logit contributions
P-0.014
Now-0.013
ww-0.012
bly-0.012
Spl-0.012
Token heard
Token ID2982
Feature activation+2.90e-01

Pos logit contributions
 cry+0.018
 smile+0.017
 tell+0.017
 call+0.017
 question+0.017
Neg logit contributions
P-0.023
Now-0.021
ww-0.020
Spl-0.019
lish-0.019
Token the
Token ID262
Feature activation+6.76e-01

Pos logit contributions
 smile+0.012
 cry+0.012
 tell+0.012
 call+0.011
 bed+0.011
Neg logit contributions
P-0.016
Now-0.015
ww-0.015
bly-0.014
lish-0.014
Token girl
Token ID2576
Feature activation+4.38e+00

Pos logit contributions
 tell+0.030
 smile+0.029
 cry+0.029
 bed+0.028
 call+0.028
Neg logit contributions
P-0.035
Now-0.033
ww-0.032
lish-0.032
bly-0.029
Token's
Token ID338
Feature activation+3.05e+00

Pos logit contributions
 fault+0.188
ses+0.186
sed+0.173
 stomach+0.173
 throat+0.165
Neg logit contributions
lish-0.156
aming-0.152
it-0.150
sc-0.150
med-0.140
Token voice
Token ID3809
Feature activation+1.98e+00

Pos logit contributions
 tell+0.092
ric+0.086
 +0.084
 stare+0.083
 haven+0.083
Neg logit contributions
lish-0.137
P-0.118
 brightly-0.117
ww-0.117
Days-0.116
Token.
Token ID13
Feature activation+4.97e-01

Pos logit contributions
 stomach+0.081
 bed+0.078
 throat+0.077
 smile+0.075
ses+0.074
Neg logit contributions
lish-0.092
Now-0.090
bly-0.087
sc-0.086
P-0.086
Token She
Token ID1375
Feature activation-2.79e-02

Pos logit contributions
 cry+0.018
 smile+0.018
 tell+0.018
 bed+0.017
 call+0.016
Neg logit contributions
P-0.030
Now-0.029
ww-0.028
lish-0.026
Soon-0.026
Token said
Token ID531
Feature activation-6.79e-01

Pos logit contributions
P+0.002
Now+0.002
ww+0.002
bly+0.002
lish+0.002
Neg logit contributions
 cry-0.001
 smile-0.001
 bed-0.001
 tell-0.001
 stomach-0.001
Token didn
Token ID1422
Feature activation-4.27e-01

Pos logit contributions
 bed+0.034
 cry+0.034
 smile+0.033
 tell+0.032
 call+0.031
Neg logit contributions
P-0.052
Now-0.047
ww-0.046
lish-0.045
Spl-0.043
Token't
Token ID470
Feature activation+1.18e+00

Pos logit contributions
P+0.024
Now+0.022
ww+0.022
bly+0.021
lish+0.021
Neg logit contributions
 smile-0.019
 cry-0.018
 tell-0.018
 call-0.017
 bed-0.017
Token mind
Token ID2000
Feature activation+1.25e+00

Pos logit contributions
 bed+0.030
 stomach+0.029
 eyes+0.028
 mouth+0.028
 crying+0.027
Neg logit contributions
P-0.081
ww-0.075
Now-0.075
ha-0.072
lish-0.072
Token helping
Token ID5742
Feature activation+1.67e+00

Pos logit contributions
 tell+0.042
 gaze+0.041
 smile+0.041
 call+0.040
 search+0.040
Neg logit contributions
P-0.071
lish-0.069
bly-0.067
Now-0.066
ww-0.066
Token the
Token ID262
Feature activation+1.18e+00

Pos logit contributions
 gaze+0.073
 cry+0.072
 smile+0.072
 weeping+0.069
 tell+0.069
Neg logit contributions
P-0.073
Now-0.067
lish-0.065
ww-0.063
A-0.063
Token bird
Token ID6512
Feature activation+4.23e+00

Pos logit contributions
 smile+0.057
 tell+0.057
 cry+0.055
 call+0.054
 gaze+0.054
Neg logit contributions
P-0.054
ww-0.051
Now-0.049
A-0.044
lish-0.044
Token.
Token ID13
Feature activation+2.56e-01

Pos logit contributions
ses+0.200
sed+0.196
 grin+0.170
 eyes+0.170
 horror+0.169
Neg logit contributions
sc-0.159
P-0.135
hest-0.132
it-0.131
"-0.128
Token They
Token ID1119
Feature activation+1.59e+00

Pos logit contributions
 cry+0.009
 smile+0.009
 tell+0.009
 bed+0.008
 call+0.008
Neg logit contributions
P-0.016
Now-0.015
ww-0.015
Soon-0.014
Spl-0.014
Token walked
Token ID6807
Feature activation-5.62e-01

Pos logit contributions
 bed+0.069
 cry+0.069
 stomach+0.065
 crying+0.065
 smile+0.065
Neg logit contributions
P-0.083
Now-0.072
bly-0.071
sc-0.068
lish-0.067
Token and
Token ID290
Feature activation+1.16e+00

Pos logit contributions
P+0.034
Now+0.032
ww+0.032
bly+0.030
lish+0.029
Neg logit contributions
 smile-0.023
 cry-0.022
 tell-0.022
 call-0.021
 bed-0.021
Token walked
Token ID6807
Feature activation-3.58e-01

Pos logit contributions
 cry+0.048
 smile+0.046
 stomach+0.046
 question+0.045
 tell+0.043
Neg logit contributions
P-0.068
Now-0.060
Spl-0.057
ww-0.057
sc-0.057
Token's
Token ID338
Feature activation-2.04e-01

Pos logit contributions
 smile+0.017
 cry+0.016
 bed+0.016
 tell+0.016
 call+0.016
Neg logit contributions
P-0.024
ww-0.022
Now-0.021
bly-0.020
Spl-0.020
Token wrong
Token ID2642
Feature activation-1.41e-01

Pos logit contributions
P+0.011
Now+0.010
ww+0.010
bly+0.009
lish+0.009
Neg logit contributions
 smile-0.010
 cry-0.009
 tell-0.009
 call-0.009
 bed-0.009
Token?"
Token ID1701
Feature activation+9.49e-01

Pos logit contributions
P+0.009
ww+0.008
Now+0.008
bly+0.008
lish+0.008
Neg logit contributions
 smile-0.005
 cry-0.005
 tell-0.005
 call-0.005
 bed-0.005
Token asked
Token ID1965
Feature activation+6.66e-01

Pos logit contributions
 smile+0.034
 cry+0.032
 stomach+0.032
 bed+0.031
 weeping+0.031
Neg logit contributions
P-0.058
Now-0.054
ww-0.053
Spl-0.051
bly-0.051
Token the
Token ID262
Feature activation+6.92e-01

Pos logit contributions
 cry+0.026
 smile+0.026
 tell+0.025
 weeping+0.024
 bed+0.024
Neg logit contributions
P-0.040
Now-0.036
ww-0.036
bly-0.035
lish-0.034
Token bear
Token ID6842
Feature activation+4.19e+00

Pos logit contributions
 cry+0.025
 smile+0.024
 tell+0.024
 gaze+0.023
 bed+0.022
Neg logit contributions
P-0.044
Now-0.039
lish-0.039
ww-0.038
bly-0.038
Token.
Token ID13
Feature activation+1.43e+00

Pos logit contributions
ses+0.191
 bed+0.179
 fault+0.177
sed+0.171
 imagination+0.168
Neg logit contributions
it-0.178
P-0.174
lish-0.169
bly-0.164
Just-0.163
Token
Token ID220
Feature activation+2.17e+00

Pos logit contributions
 cry+0.058
 smile+0.058
 tell+0.055
 weep+0.053
 bed+0.053
Neg logit contributions
P-0.083
Now-0.076
A-0.073
Spl-0.073
O-0.072
Token
Token ID198
Feature activation-2.00e-01

Pos logit contributions
 smile+0.093
 cry+0.089
 gaze+0.087
 weep+0.086
 tell+0.085
Neg logit contributions
P-0.123
Now-0.116
Soon-0.112
it-0.108
CR-0.106
Token
Token ID198
Feature activation-2.60e+00

Pos logit contributions
P+0.015
Now+0.014
ww+0.014
bly+0.013
Soon+0.013
Neg logit contributions
 smile-0.005
 cry-0.005
 tell-0.005
 call-0.005
 bed-0.005
TokenThe
Token ID464
Feature activation+9.85e-02

Pos logit contributions
bly+0.098
ww+0.097
lish+0.095
P+0.094
Now+0.092
Neg logit contributions
 call-0.141
 tell-0.141
 smile-0.139
 question-0.138
 cry-0.135
Token car
Token ID1097
Feature activation+1.15e-01

Pos logit contributions
 cry+0.037
 tell+0.037
 smile+0.036
 weep+0.034
 question+0.034
Neg logit contributions
P-0.042
ww-0.040
Now-0.039
bly-0.037
sc-0.036
Token?"
Token ID1701
Feature activation+1.86e+00

Pos logit contributions
 cry+0.004
 smile+0.004
 tell+0.004
 bed+0.004
 call+0.004
Neg logit contributions
P-0.007
Now-0.006
ww-0.006
bly-0.006
lish-0.006
Token His
Token ID2399
Feature activation+1.93e+00

Pos logit contributions
 smile+0.068
 cry+0.066
 tell+0.063
 die+0.062
 question+0.061
Neg logit contributions
P-0.106
Now-0.103
Soon-0.097
Just-0.096
A-0.095
Token mom
Token ID1995
Feature activation+1.40e+00

Pos logit contributions
 stare+0.057
 tell+0.054
 gaze+0.054
 know+0.053
 cry+0.051
Neg logit contributions
P-0.113
ww-0.102
Now-0.101
lish-0.100
A-0.099
Token heard
Token ID2982
Feature activation+5.84e-01

Pos logit contributions
 bed+0.058
 stomach+0.057
 fault+0.055
 cry+0.055
ses+0.053
Neg logit contributions
P-0.074
Now-0.067
bly-0.067
lish-0.066
ha-0.063
Token him
Token ID683
Feature activation+4.14e+00

Pos logit contributions
 smile+0.020
 cry+0.020
 tell+0.019
 question+0.019
 stomach+0.019
Neg logit contributions
P-0.035
Now-0.035
ww-0.033
bly-0.032
Hey-0.031
Token and
Token ID290
Feature activation+2.37e-01

Pos logit contributions
 tum+0.205
 stomach+0.200
ses+0.200
sed+0.199
 fault+0.199
Neg logit contributions
it-0.116
lish-0.113
sc-0.109
ing-0.109
aming-0.109
Token came
Token ID1625
Feature activation+1.04e+00

Pos logit contributions
 cry+0.009
 smile+0.009
 bed+0.008
 tell+0.008
 stomach+0.008
Neg logit contributions
P-0.015
Now-0.014
ww-0.013
bly-0.013
lish-0.013
Token to
Token ID284
Feature activation+1.48e+00

Pos logit contributions
 stomach+0.045
 cry+0.044
 smile+0.043
 tum+0.042
 tell+0.041
Neg logit contributions
P-0.055
Now-0.052
Spl-0.049
bly-0.048
Soon-0.048
Token help
Token ID1037
Feature activation+1.41e+00

Pos logit contributions
 stomach+0.046
 mouth+0.041
 eyes+0.041
 throat+0.040
 tum+0.040
Neg logit contributions
P-0.098
Now-0.094
Hey-0.090
Good-0.089
O-0.087
Token.
Token ID13
Feature activation+1.40e+00

Pos logit contributions
 stomach+0.053
 smile+0.052
 question+0.051
 throat+0.050
ses+0.050
Neg logit contributions
P-0.076
Now-0.072
bly-0.070
lish-0.069
CR-0.068
Token the
Token ID262
Feature activation-2.15e-01

Pos logit contributions
P+0.004
Now+0.004
ww+0.004
bly+0.004
lish+0.004
Neg logit contributions
 cry-0.002
 smile-0.002
 tell-0.002
 call-0.002
 bed-0.002
Token truck
Token ID7779
Feature activation+1.28e+00

Pos logit contributions
P+0.013
Now+0.013
ww+0.012
bly+0.012
lish+0.012
Neg logit contributions
 smile-0.008
 cry-0.008
 tell-0.008
 call-0.007
 bed-0.007
Token heard
Token ID2982
Feature activation+5.19e-01

Pos logit contributions
 cry+0.053
 bed+0.052
 smile+0.051
 weeping+0.051
 tell+0.050
Neg logit contributions
P-0.065
Now-0.064
sc-0.059
Spl-0.058
Soon-0.057
Token the
Token ID262
Feature activation+8.44e-01

Pos logit contributions
 smile+0.019
 cry+0.019
 tell+0.019
 stomach+0.018
 gaze+0.018
Neg logit contributions
P-0.030
Now-0.029
ww-0.028
bly-0.027
lish-0.027
Token little
Token ID1310
Feature activation-4.20e-01

Pos logit contributions
 tell+0.034
 smile+0.034
 cry+0.032
 stomach+0.032
 gaze+0.032
Neg logit contributions
P-0.044
Now-0.043
ww-0.042
lish-0.041
sc-0.040
Token girl
Token ID2576
Feature activation+4.14e+00

Pos logit contributions
P+0.023
Now+0.021
ww+0.021
bly+0.020
lish+0.020
Neg logit contributions
 smile-0.019
 cry-0.019
 tell-0.018
 call-0.018
 bed-0.018
Token's
Token ID338
Feature activation+2.65e+00

Pos logit contributions
ses+0.170
 fault+0.169
 stomach+0.164
 imagination+0.163
sed+0.163
Neg logit contributions
lish-0.149
sc-0.147
it-0.146
aw-0.143
aming-0.140
Token voice
Token ID3809
Feature activation+1.52e+00

Pos logit contributions
 haven+0.093
 die+0.093
oppy+0.092
 bed+0.092
 stomach+0.091
Neg logit contributions
lish-0.102
Now-0.096
Just-0.095
aming-0.093
Soon-0.092
Token.
Token ID13
Feature activation+8.30e-01

Pos logit contributions
 stomach+0.052
 bed+0.051
 smile+0.050
 gaze+0.050
 cry+0.049
Neg logit contributions
lish-0.082
Now-0.080
P-0.079
bly-0.077
sc-0.075
Token He
Token ID679
Feature activation-5.74e-02

Pos logit contributions
 smile+0.037
 cry+0.037
 tell+0.037
 bed+0.035
 stomach+0.035
Neg logit contributions
Now-0.040
P-0.040
ww-0.037
Spl-0.036
Hey-0.036
Token looked
Token ID3114
Feature activation+2.94e-01

Pos logit contributions
P+0.004
Now+0.003
ww+0.003
bly+0.003
lish+0.003
Neg logit contributions
 cry-0.002
 smile-0.002
 tell-0.002
 bed-0.002
 stomach-0.002
Token felt
Token ID2936
Feature activation+2.39e-01

Pos logit contributions
 cry+0.012
 bed+0.012
 smile+0.012
 tell+0.012
 call+0.011
Neg logit contributions
P-0.020
Now-0.019
ww-0.018
lish-0.017
bly-0.017
Token better
Token ID1365
Feature activation+6.83e-01

Pos logit contributions
 smile+0.009
 bed+0.008
 tell+0.008
 cry+0.008
 call+0.008
Neg logit contributions
P-0.015
Now-0.014
ww-0.013
bly-0.013
lish-0.013
Token and
Token ID290
Feature activation+9.17e-01

Pos logit contributions
 cry+0.028
 smile+0.027
 question+0.027
 call+0.027
 bed+0.026
Neg logit contributions
P-0.039
Now-0.037
ww-0.034
lish-0.033
bly-0.033
Token went
Token ID1816
Feature activation+9.54e-01

Pos logit contributions
 question+0.038
 cry+0.038
 smile+0.037
 bed+0.036
 call+0.036
Neg logit contributions
P-0.049
Now-0.047
ww-0.045
Soon-0.043
bly-0.042
Token back
Token ID736
Feature activation+2.22e+00

Pos logit contributions
 cry+0.037
 smile+0.034
 stomach+0.033
 question+0.033
 weep+0.032
Neg logit contributions
P-0.059
Now-0.053
Spl-0.049
bly-0.049
ww-0.049
Token to
Token ID284
Feature activation+4.14e+00

Pos logit contributions
 stomach+0.109
 cry+0.109
 eyes+0.106
 smile+0.104
 horror+0.103
Neg logit contributions
P-0.101
Now-0.086
B-0.082
Spl-0.079
CR-0.077
Token the
Token ID262
Feature activation-1.39e-01

Pos logit contributions
 eye+0.182
 question+0.177
engu+0.176
ses+0.176
 eyes+0.175
Neg logit contributions
Now-0.118
P-0.101
Good-0.095
But-0.093
 late-0.090
Token barn
Token ID25203
Feature activation+3.30e+00

Pos logit contributions
P+0.009
Now+0.009
ww+0.008
bly+0.008
lish+0.008
Neg logit contributions
 smile-0.005
 cry-0.005
 tell-0.005
 call-0.004
 bed-0.004
Token to
Token ID284
Feature activation+2.39e+00

Pos logit contributions
 cry+0.144
 question+0.137
 stare+0.130
 smile+0.129
 frown+0.127
Neg logit contributions
Now-0.141
sc-0.132
P-0.132
it-0.128
Soon-0.126
Token play
Token ID711
Feature activation-5.06e-01

Pos logit contributions
 fault+0.077
 eyes+0.076
 cries+0.073
ses+0.073
sed+0.072
Neg logit contributions
Now-0.135
P-0.131
bly-0.122
Good-0.118
A-0.117
Token with
Token ID351
Feature activation+1.01e+00

Pos logit contributions
P+0.029
Now+0.027
ww+0.027
bly+0.026
lish+0.025
Neg logit contributions
 smile-0.021
 cry-0.021
 tell-0.021
 call-0.020
 bed-0.020
Token
Token ID198
Feature activation-2.21e+00

Pos logit contributions
 smile+0.078
 cry+0.073
 stomach+0.073
 eyes+0.071
 throat+0.071
Neg logit contributions
P-0.109
Now-0.105
Soon-0.101
A-0.095
ww-0.094
TokenThe
Token ID464
Feature activation-7.04e-01

Pos logit contributions
ww+0.069
bly+0.069
sc+0.069
P+0.067
iest+0.065
Neg logit contributions
 cry-0.138
 tell-0.132
 crying-0.130
 call-0.130
 Her-0.129
Token farmer
Token ID18739
Feature activation-1.41e-02

Pos logit contributions
P+0.043
Now+0.040
ww+0.040
bly+0.039
lish+0.037
Neg logit contributions
 smile-0.027
 tell-0.027
 cry-0.027
 call-0.026
 bed-0.025
Token heard
Token ID2982
Feature activation+6.22e-01

Pos logit contributions
P+0.001
Now+0.001
ww+0.001
bly+0.001
lish+0.001
Neg logit contributions
 smile-0.001
 cry-0.001
 tell-0.001
 bed-0.001
 call-0.000
Token the
Token ID262
Feature activation+3.33e-01

Pos logit contributions
 smile+0.023
 cry+0.022
 tell+0.021
 stomach+0.021
 gaze+0.021
Neg logit contributions
P-0.036
Now-0.035
ww-0.034
bly-0.032
Soon-0.032
Token cow
Token ID9875
Feature activation+4.11e+00

Pos logit contributions
 smile+0.014
 cry+0.014
 tell+0.013
 bed+0.013
 stomach+0.013
Neg logit contributions
P-0.019
Now-0.018
ww-0.018
Soon-0.016
lish-0.016
Token and
Token ID290
Feature activation+2.17e-01

Pos logit contributions
 stomach+0.141
 throat+0.138
ses+0.137
 imagination+0.137
 fault+0.136
Neg logit contributions
Now-0.168
sc-0.166
lish-0.161
it-0.161
bly-0.161
Token came
Token ID1625
Feature activation+8.86e-01

Pos logit contributions
 smile+0.008
 cry+0.008
 bed+0.008
 tell+0.008
 stomach+0.008
Neg logit contributions
P-0.013
Now-0.012
ww-0.012
bly-0.011
lish-0.011
Token to
Token ID284
Feature activation+1.29e+00

Pos logit contributions
 stomach+0.036
 smile+0.035
 cry+0.034
 bed+0.033
 tum+0.033
Neg logit contributions
P-0.050
Now-0.047
Spl-0.045
Soon-0.044
sc-0.044
Token help
Token ID1037
Feature activation+1.22e+00

Pos logit contributions
 stomach+0.037
 eyes+0.034
 mouth+0.033
 throat+0.032
 smile+0.032
Neg logit contributions
P-0.087
Now-0.085
Good-0.080
Hey-0.079
O-0.077
Token.
Token ID13
Feature activation+4.28e-01

Pos logit contributions
 smile+0.043
 stomach+0.042
 eyes+0.040
 tell+0.039
 question+0.039
Neg logit contributions
P-0.071
Now-0.068
lish-0.065
bly-0.064
CR-0.064
Token kind
Token ID1611
Feature activation+7.86e-01

Pos logit contributions
P+0.014
ww+0.013
Now+0.013
bly+0.012
lish+0.012
Neg logit contributions
 cry-0.008
 smile-0.008
 tell-0.008
 call-0.008
 bed-0.008
Token farmer
Token ID18739
Feature activation+1.85e-01

Pos logit contributions
 cry+0.034
 tell+0.033
 smile+0.032
 gaze+0.031
 bed+0.031
Neg logit contributions
P-0.043
ww-0.041
Now-0.039
lish-0.036
bly-0.036
Token came
Token ID1625
Feature activation+3.88e-01

Pos logit contributions
 cry+0.007
 smile+0.007
 bed+0.007
 tell+0.006
 stomach+0.006
Neg logit contributions
P-0.012
Now-0.011
ww-0.011
bly-0.010
Spl-0.010
Token to
Token ID284
Feature activation+1.50e+00

Pos logit contributions
 smile+0.014
 cry+0.014
 stomach+0.013
 tell+0.013
 bed+0.013
Neg logit contributions
P-0.025
Now-0.024
ww-0.022
bly-0.022
sc-0.021
Token the
Token ID262
Feature activation-4.70e-02

Pos logit contributions
 stomach+0.047
 cry+0.047
 smile+0.046
 eyes+0.046
 weeping+0.046
Neg logit contributions
P-0.094
Now-0.089
Hey-0.082
A-0.082
Soon-0.081
Token pig
Token ID12967
Feature activation+4.11e+00

Pos logit contributions
P+0.003
Now+0.003
ww+0.003
bly+0.003
lish+0.003
Neg logit contributions
 smile-0.002
 cry-0.002
 tell-0.002
 call-0.001
 bed-0.001
Token and
Token ID290
Feature activation+4.20e-01

Pos logit contributions
 cry+0.210
ses+0.209
 smile+0.206
 eyes+0.202
 grin+0.199
Neg logit contributions
P-0.152
Now-0.143
it-0.124
sc-0.121
They-0.117
Token said
Token ID531
Feature activation-2.78e-02

Pos logit contributions
 cry+0.014
 smile+0.014
 bed+0.013
 stomach+0.013
 tell+0.013
Neg logit contributions
P-0.027
Now-0.025
ww-0.025
bly-0.023
Soon-0.023
Token,
Token ID11
Feature activation+5.10e-01

Pos logit contributions
P+0.002
ww+0.002
Now+0.002
bly+0.002
lish+0.002
Neg logit contributions
 cry-0.001
 smile-0.001
 tell-0.001
 call-0.001
 bed-0.001
Token "
Token ID366
Feature activation-3.02e+00

Pos logit contributions
 cry+0.019
 smile+0.019
 tell+0.018
 stomach+0.017
 bed+0.017
Neg logit contributions
P-0.030
Now-0.029
ww-0.028
bly-0.028
Soon-0.027
TokenDon
Token ID3987
Feature activation-1.70e+00

Pos logit contributions
bly+0.112
ww+0.111
br+0.105
lish+0.103
hest+0.102
Neg logit contributions
 cry-0.154
 tell-0.154
 crying-0.153
 call-0.152
 say-0.151
Token little
Token ID1310
Feature activation+1.26e+00

Pos logit contributions
 tell+0.039
 cry+0.038
 bed+0.037
 smile+0.037
 gaze+0.036
Neg logit contributions
P-0.049
ww-0.048
Now-0.043
ha-0.042
lish-0.042
Token boy
Token ID2933
Feature activation+1.49e+00

Pos logit contributions
 bed+0.057
 tell+0.056
 cry+0.054
 gaze+0.053
 stomach+0.053
Neg logit contributions
P-0.058
ww-0.055
lish-0.054
sc-0.050
Spl-0.049
Token named
Token ID3706
Feature activation+5.56e-01

Pos logit contributions
 bed+0.065
 stomach+0.060
 smile+0.060
 cry+0.059
 tell+0.055
Neg logit contributions
P-0.080
Now-0.070
ww-0.069
lish-0.069
Hey-0.068
Token Tim
Token ID5045
Feature activation+1.46e+00

Pos logit contributions
 smile+0.018
 cry+0.017
 tell+0.016
 bed+0.016
 stomach+0.016
Neg logit contributions
P-0.037
Now-0.035
ww-0.035
bly-0.033
Hey-0.033
Token heard
Token ID2982
Feature activation+8.69e-01

Pos logit contributions
 bed+0.062
 cry+0.059
 stomach+0.058
 smile+0.058
 tum+0.056
Neg logit contributions
P-0.079
Now-0.072
Hey-0.069
lish-0.068
ha-0.067
Token Kitty
Token ID21168
Feature activation+4.11e+00

Pos logit contributions
 smile+0.033
 stomach+0.031
 cry+0.031
 gaze+0.030
 tell+0.030
Neg logit contributions
P-0.051
Now-0.050
ww-0.048
bly-0.046
O-0.046
Token complain
Token ID13121
Feature activation+3.04e-01

Pos logit contributions
ses+0.103
 stomach+0.102
 tum+0.101
 fault+0.100
 imagination+0.100
Neg logit contributions
lish-0.210
sc-0.202
it-0.202
bly-0.198
B-0.193
Token.
Token ID13
Feature activation+1.13e+00

Pos logit contributions
 smile+0.009
 cry+0.009
 tell+0.009
 bed+0.008
 call+0.008
Neg logit contributions
P-0.020
Now-0.019
lish-0.019
ww-0.019
bly-0.019
Token He
Token ID679
Feature activation+7.81e-02

Pos logit contributions
 smile+0.046
 tell+0.044
 cry+0.044
 gaze+0.042
 weep+0.042
Neg logit contributions
P-0.061
Now-0.061
ww-0.060
Soon-0.056
Spl-0.055
Token came
Token ID1625
Feature activation+5.54e-01

Pos logit contributions
 smile+0.003
 cry+0.003
 bed+0.003
 tell+0.003
 stomach+0.002
Neg logit contributions
P-0.005
Now-0.005
ww-0.005
bly-0.004
lish-0.004
Token to
Token ID284
Feature activation+1.75e+00

Pos logit contributions
 smile+0.024
 cry+0.023
 stomach+0.023
 tell+0.023
 bed+0.023
Neg logit contributions
P-0.030
Now-0.028
ww-0.026
Spl-0.026
sc-0.026
Token hugged
Token ID41130
Feature activation+1.68e-01

Pos logit contributions
 cry+0.057
 bed+0.057
 crying+0.056
 weeping+0.055
 question+0.054
Neg logit contributions
P-0.063
Now-0.060
ww-0.052
Soon-0.050
Good-0.049
Token each
Token ID1123
Feature activation-5.91e-02

Pos logit contributions
 cry+0.007
 tell+0.006
 smile+0.006
 call+0.006
 bed+0.006
Neg logit contributions
P-0.010
Now-0.009
ww-0.009
bly-0.009
lish-0.009
Token other
Token ID584
Feature activation-8.74e-01

Pos logit contributions
P+0.004
Now+0.003
ww+0.003
bly+0.003
lish+0.003
Neg logit contributions
 cry-0.002
 smile-0.002
 tell-0.002
 bed-0.002
 call-0.002
Token and
Token ID290
Feature activation+1.43e+00

Pos logit contributions
P+0.061
Now+0.058
ww+0.058
bly+0.055
Spl+0.054
Neg logit contributions
 smile-0.027
 cry-0.026
 tell-0.026
 bed-0.024
 call-0.024
Token went
Token ID1816
Feature activation+1.09e+00

Pos logit contributions
 question+0.057
 cry+0.056
 weeping+0.056
 bed+0.055
 call+0.054
Neg logit contributions
P-0.078
Now-0.072
ww-0.071
Sorry-0.065
bly-0.065
Token to
Token ID284
Feature activation+4.11e+00

Pos logit contributions
 cry+0.040
 question+0.037
 smile+0.037
 tell+0.036
 weep+0.035
Neg logit contributions
P-0.068
Now-0.062
ww-0.059
bly-0.058
Spl-0.058
Token play
Token ID711
Feature activation-4.66e-01

Pos logit contributions
 grin+0.169
 stomach+0.169
 cheeks+0.165
owed+0.163
 eyes+0.162
Neg logit contributions
Soon-0.117
Now-0.115
Happy-0.110
Good-0.109
 late-0.109
Token somewhere
Token ID7382
Feature activation+3.02e-02

Pos logit contributions
P+0.026
Now+0.024
ww+0.024
bly+0.023
lish+0.022
Neg logit contributions
 smile-0.020
 cry-0.020
 tell-0.020
 call-0.019
 bed-0.019
Token else
Token ID2073
Feature activation+8.88e-01

Pos logit contributions
 cry+0.002
 smile+0.002
 tell+0.002
 call+0.002
 bed+0.002
Neg logit contributions
P-0.001
Now-0.001
ww-0.001
bly-0.001
lish-0.001
Token.
Token ID13
Feature activation+1.03e+00

Pos logit contributions
 cry+0.029
 smile+0.028
 tell+0.028
 call+0.027
 bed+0.026
Neg logit contributions
P-0.057
Now-0.053
lish-0.052
bly-0.050
ww-0.050
Token<|endoftext|>
Token ID50256
Feature activation-2.71e-01

Pos logit contributions
 smile+0.043
 cry+0.043
 tell+0.041
 weep+0.039
 call+0.038
Neg logit contributions
P-0.058
Now-0.058
ww-0.055
Spl-0.053
Soon-0.053
Token when
Token ID618
Feature activation-1.44e-01

Pos logit contributions
 cry+0.028
 smile+0.028
 tell+0.027
 question+0.025
 weeping+0.025
Neg logit contributions
P-0.045
Now-0.041
ww-0.040
sc-0.039
Spl-0.038
Token he
Token ID339
Feature activation-7.97e-02

Pos logit contributions
P+0.008
Now+0.008
ww+0.008
bly+0.007
Spl+0.007
Neg logit contributions
 smile-0.006
 cry-0.006
 tell-0.006
 call-0.006
 question-0.006
Token heard
Token ID2982
Feature activation+4.13e-01

Pos logit contributions
P+0.005
Now+0.005
ww+0.005
lish+0.004
bly+0.004
Neg logit contributions
 smile-0.003
 cry-0.003
 tell-0.003
 call-0.003
 bed-0.003
Token his
Token ID465
Feature activation+2.36e+00

Pos logit contributions
 smile+0.019
 cry+0.018
 tell+0.018
 call+0.017
 question+0.017
Neg logit contributions
P-0.021
Now-0.020
ww-0.020
bly-0.019
lish-0.018
Token friend
Token ID1545
Feature activation+3.75e+00

Pos logit contributions
 smile+0.079
 stare+0.079
 weep+0.078
 cry+0.077
 tell+0.076
Neg logit contributions
P-0.138
ww-0.123
A-0.114
lish-0.114
W-0.110
Token Lucy
Token ID22162
Feature activation+4.10e+00

Pos logit contributions
 fault+0.133
ses+0.131
 throat+0.125
 stomach+0.124
 imagination+0.122
Neg logit contributions
P-0.182
lish-0.168
B-0.161
Hey-0.159
it-0.156
Token crying
Token ID13774
Feature activation+6.63e-01

Pos logit contributions
 stomach+0.132
 fault+0.130
 tum+0.127
ses+0.125
sed+0.122
Neg logit contributions
P-0.200
sc-0.196
B-0.195
lish-0.186
bly-0.184
Token.
Token ID13
Feature activation+1.39e+00

Pos logit contributions
 smile+0.021
 cry+0.020
 tell+0.020
 gaze+0.019
 stomach+0.019
Neg logit contributions
P-0.044
lish-0.041
Now-0.040
ww-0.039
bly-0.039
Token
Token ID198
Feature activation+5.36e-01

Pos logit contributions
 cry+0.053
 smile+0.052
 tell+0.052
 weep+0.049
 stomach+0.049
Neg logit contributions
P-0.080
Now-0.077
ww-0.073
Soon-0.071
Spl-0.071
Token
Token ID198
Feature activation-3.00e+00

Pos logit contributions
 smile+0.017
 cry+0.017
 stomach+0.016
 tell+0.016
 call+0.015
Neg logit contributions
P-0.037
Now-0.035
ww-0.033
Soon-0.033
Spl-0.032
Token"
Token ID1
Feature activation-1.78e+00

Pos logit contributions
ier+0.114
w+0.112
iest+0.109
bridge+0.106
ww+0.105
Neg logit contributions
 play-0.157
 tell-0.157
 search-0.156
 find-0.155
 call-0.155
Tokenily
Token ID813
Feature activation+1.35e-01

Pos logit contributions
P+0.080
ww+0.077
Now+0.072
bly+0.072
Spl+0.068
Neg logit contributions
 cry-0.092
 tell-0.090
 smile-0.090
 crying-0.089
 bed-0.087
Token felt
Token ID2936
Feature activation+2.34e-02

Pos logit contributions
 cry+0.005
 smile+0.005
 tell+0.005
 bed+0.005
 call+0.005
Neg logit contributions
P-0.008
Now-0.008
ww-0.007
bly-0.007
lish-0.007
Token better
Token ID1365
Feature activation+1.63e-01

Pos logit contributions
 smile+0.001
 cry+0.001
 tell+0.001
 bed+0.001
 call+0.001
Neg logit contributions
P-0.002
Now-0.001
ww-0.001
bly-0.001
lish-0.001
Token and
Token ID290
Feature activation+6.44e-01

Pos logit contributions
 smile+0.007
 cry+0.007
 tell+0.006
 call+0.006
 bed+0.006
Neg logit contributions
P-0.010
Now-0.009
ww-0.009
bly-0.008
lish-0.008
Token went
Token ID1816
Feature activation+6.56e-01

Pos logit contributions
 cry+0.027
 smile+0.027
 question+0.027
 call+0.026
 bed+0.026
Neg logit contributions
P-0.034
Now-0.033
ww-0.031
bly-0.030
Soon-0.029
Token to
Token ID284
Feature activation+4.10e+00

Pos logit contributions
 cry+0.025
 smile+0.023
 tell+0.022
 stomach+0.022
 question+0.022
Neg logit contributions
P-0.041
Now-0.038
bly-0.035
ww-0.035
Spl-0.035
Token play
Token ID711
Feature activation-2.49e-01

Pos logit contributions
sed+0.169
 mouths+0.163
ses+0.160
 stomach+0.160
 tum+0.159
Neg logit contributions
Now-0.126
But-0.114
 your-0.112
Soon-0.111
P-0.109
Token with
Token ID351
Feature activation+1.60e+00

Pos logit contributions
P+0.013
Now+0.012
ww+0.012
bly+0.011
lish+0.011
Neg logit contributions
 cry-0.012
 smile-0.012
 tell-0.011
 call-0.011
 bed-0.011
Token her
Token ID607
Feature activation+1.56e+00

Pos logit contributions
 cry+0.071
 question+0.070
 gaze+0.070
 smile+0.069
 tell+0.069
Neg logit contributions
P-0.078
Now-0.069
bly-0.067
lish-0.067
O-0.064
Token other
Token ID584
Feature activation+1.14e-01

Pos logit contributions
 gaze+0.053
 cry+0.053
 stare+0.052
 tell+0.051
 know+0.051
Neg logit contributions
P-0.091
Now-0.082
bly-0.082
lish-0.082
ww-0.077
Token friends
Token ID2460
Feature activation+9.91e-01

Pos logit contributions
 cry+0.004
 smile+0.004
 tell+0.004
 call+0.004
 gaze+0.004
Neg logit contributions
P-0.007
Now-0.007
ww-0.007
lish-0.006
bly-0.006
Token sad
Token ID6507
Feature activation+1.68e+00

Pos logit contributions
 smile+0.033
 cry+0.033
 tell+0.032
 bed+0.031
 gaze+0.030
Neg logit contributions
P-0.053
bly-0.048
Now-0.048
ww-0.046
Spl-0.045
Token.
Token ID13
Feature activation+1.25e+00

Pos logit contributions
 gaze+0.074
 cry+0.074
 call+0.072
 weep+0.071
 tell+0.071
Neg logit contributions
lish-0.069
P-0.069
ww-0.068
Soon-0.065
Now-0.065
Token Max
Token ID5436
Feature activation+8.66e-01

Pos logit contributions
 tell+0.055
 cry+0.055
 smile+0.053
 call+0.053
 die+0.051
Neg logit contributions
Now-0.061
P-0.060
ww-0.056
Soon-0.055
Spl-0.055
Token went
Token ID1816
Feature activation+6.25e-01

Pos logit contributions
 cry+0.034
 smile+0.033
 bed+0.033
 tell+0.032
 stomach+0.031
Neg logit contributions
P-0.050
Now-0.046
ww-0.045
lish-0.043
Spl-0.042
Token to
Token ID284
Feature activation+2.93e+00

Pos logit contributions
 cry+0.024
 stomach+0.023
 smile+0.023
 tell+0.022
 eyes+0.021
Neg logit contributions
P-0.037
Now-0.034
sc-0.033
Soon-0.032
ww-0.032
Token Lucy
Token ID22162
Feature activation+4.10e+00

Pos logit contributions
 stomach+0.145
 throat+0.143
 tum+0.140
 mouths+0.138
sed+0.137
Neg logit contributions
Now-0.105
Hey-0.098
Go-0.096
P-0.095
Good-0.095
Token and
Token ID290
Feature activation+1.67e+00

Pos logit contributions
ses+0.223
sed+0.216
 stomach+0.195
 eyes+0.192
arter+0.190
Neg logit contributions
sc-0.128
P-0.121
it-0.115
When-0.109
"-0.109
Token asked
Token ID1965
Feature activation+1.13e+00

Pos logit contributions
 cry+0.063
 stomach+0.062
 question+0.061
 smile+0.061
 eyes+0.061
Neg logit contributions
P-0.092
ww-0.082
Now-0.081
ha-0.079
Sorry-0.076
Token,
Token ID11
Feature activation+5.10e-01

Pos logit contributions
 cry+0.044
 gaze+0.043
 smile+0.042
 call+0.042
 weep+0.042
Neg logit contributions
P-0.059
lish-0.055
ww-0.055
Now-0.054
bly-0.053
Token "
Token ID366
Feature activation-2.82e+00

Pos logit contributions
 cry+0.019
 smile+0.019
 tell+0.018
 stomach+0.018
 bed+0.018
Neg logit contributions
P-0.030
Now-0.029
ww-0.028
bly-0.028
Soon-0.027
TokenWhy
Token ID5195
Feature activation+6.78e-01

Pos logit contributions
ww+0.115
bly+0.100
P+0.097
hest+0.097
br+0.097
Neg logit contributions
 tell-0.158
 crying-0.155
 call-0.154
 smile-0.153
 cry-0.152
Token you
Token ID345
Feature activation+1.31e+00

Pos logit contributions
P+0.044
Now+0.044
ww+0.040
bly+0.039
lish+0.038
Neg logit contributions
 cry-0.053
 tell-0.053
 smile-0.053
 bed-0.051
 call-0.051
Token going
Token ID1016
Feature activation+5.35e-01

Pos logit contributions
 smile+0.046
 cry+0.044
 stomach+0.042
 call+0.041
 question+0.041
Neg logit contributions
P-0.078
ww-0.071
Now-0.067
bly-0.067
lish-0.064
Token?"
Token ID1701
Feature activation+1.19e+00

Pos logit contributions
 cry+0.023
 smile+0.022
 tell+0.021
 stomach+0.020
 call+0.020
Neg logit contributions
P-0.030
Now-0.028
ww-0.028
bly-0.026
Soon-0.026
Token asked
Token ID1965
Feature activation+1.01e+00

Pos logit contributions
 smile+0.044
 fault+0.043
 stomach+0.042
 bed+0.041
 cry+0.040
Neg logit contributions
P-0.070
Now-0.064
ww-0.062
Hey-0.061
A-0.061
Token the
Token ID262
Feature activation+1.11e+00

Pos logit contributions
 cry+0.042
 smile+0.041
 tell+0.040
 weeping+0.040
 gaze+0.039
Neg logit contributions
P-0.054
Now-0.050
ww-0.048
bly-0.047
Hey-0.046
Token bear
Token ID6842
Feature activation+4.10e+00

Pos logit contributions
 tell+0.047
 smile+0.047
 cry+0.046
 know+0.043
 gaze+0.043
Neg logit contributions
P-0.061
ww-0.052
Now-0.052
A-0.052
bly-0.050
Token.
Token ID13
Feature activation+1.78e+00

Pos logit contributions
ses+0.200
sed+0.183
 bed+0.177
 fault+0.177
 imagination+0.175
Neg logit contributions
it-0.160
P-0.145
lish-0.141
bly-0.140
Just-0.138
Token
Token ID198
Feature activation+3.52e-01

Pos logit contributions
 cry+0.077
 smile+0.075
 regret+0.072
 weep+0.071
 tell+0.071
Neg logit contributions
P-0.092
Now-0.088
A-0.086
Spl-0.084
O-0.083
Token
Token ID198
Feature activation-2.59e+00

Pos logit contributions
 smile+0.011
 cry+0.011
 tell+0.010
 stomach+0.010
 bed+0.010
Neg logit contributions
P-0.025
Now-0.023
ww-0.022
Soon-0.022
Spl-0.021
Token"
Token ID1
Feature activation-1.65e+00

Pos logit contributions
bly+0.107
ww+0.101
lish+0.098
P+0.096
iest+0.096
Neg logit contributions
 tell-0.140
 call-0.136
 smile-0.134
 find-0.134
 cry-0.133
TokenI
Token ID40
Feature activation+1.11e+00

Pos logit contributions
ww+0.051
P+0.049
lish+0.047
Now+0.047
bly+0.046
Neg logit contributions
 cry-0.107
 crying-0.105
 tell-0.103
 smile-0.102
 call-0.101
Token how
Token ID703
Feature activation-2.71e-01

Pos logit contributions
 smile+0.029
 cry+0.029
 tell+0.027
 call+0.027
 gaze+0.026
Neg logit contributions
P-0.039
Now-0.037
ww-0.035
bly-0.034
Soon-0.034
Token the
Token ID262
Feature activation+1.88e-01

Pos logit contributions
P+0.015
Now+0.014
ww+0.014
bly+0.014
lish+0.013
Neg logit contributions
 cry-0.012
 smile-0.012
 tell-0.011
 call-0.011
 bed-0.011
Token farmer
Token ID18739
Feature activation+1.30e+00

Pos logit contributions
 cry+0.009
 smile+0.009
 tell+0.008
 bed+0.008
 weep+0.008
Neg logit contributions
P-0.010
Now-0.010
ww-0.009
lish-0.009
bly-0.009
Token helped
Token ID4193
Feature activation+1.81e+00

Pos logit contributions
 cry+0.057
 bed+0.056
 smile+0.055
 tell+0.054
 gaze+0.052
Neg logit contributions
P-0.067
Now-0.065
ww-0.059
bly-0.059
lish-0.058
Token the
Token ID262
Feature activation+3.97e-01

Pos logit contributions
 smile+0.069
 cry+0.067
 weeping+0.067
 bed+0.065
ses+0.065
Neg logit contributions
P-0.094
Now-0.085
bly-0.083
lish-0.082
ww-0.080
Token cow
Token ID9875
Feature activation+4.05e+00

Pos logit contributions
 cry+0.018
 smile+0.018
 tell+0.017
 bed+0.017
 weep+0.016
Neg logit contributions
P-0.021
Now-0.020
ww-0.020
lish-0.018
Soon-0.018
Token.
Token ID13
Feature activation-5.52e-02

Pos logit contributions
ses+0.130
sed+0.126
 weeping+0.116
 eyes+0.115
 smile+0.114
Neg logit contributions
Now-0.191
sc-0.186
bly-0.183
P-0.181
wing-0.180
Token They
Token ID1119
Feature activation+1.48e+00

Pos logit contributions
P+0.004
Now+0.003
ww+0.003
bly+0.003
lish+0.003
Neg logit contributions
 cry-0.002
 smile-0.002
 tell-0.002
 crying-0.002
 call-0.002
Token knew
Token ID2993
Feature activation+5.11e-01

Pos logit contributions
 cry+0.063
 question+0.057
 crying+0.057
 weeping+0.057
 gaze+0.057
Neg logit contributions
P-0.080
Now-0.074
bly-0.069
ww-0.068
ha-0.066
Token he
Token ID339
Feature activation+1.62e-01

Pos logit contributions
 cry+0.023
 smile+0.023
 weep+0.022
 call+0.021
 tell+0.021
Neg logit contributions
P-0.027
ww-0.025
Now-0.025
bly-0.024
Soon-0.023
Token was
Token ID373
Feature activation-6.86e-04

Pos logit contributions
 cry+0.007
 smile+0.007
 bed+0.007
 tell+0.007
 call+0.007
Neg logit contributions
P-0.009
Now-0.008
ww-0.008
bly-0.008
lish-0.008
Token crying
Token ID13774
Feature activation-1.90e-01

Pos logit contributions
 smile+0.034
 gaze+0.033
 tell+0.031
 cry+0.031
 bed+0.031
Neg logit contributions
P-0.067
bly-0.065
ww-0.064
Now-0.062
it-0.059
Token?"
Token ID1701
Feature activation+8.92e-01

Pos logit contributions
P+0.011
Now+0.011
ww+0.010
bly+0.010
lish+0.010
Neg logit contributions
 smile-0.008
 cry-0.008
 tell-0.007
 call-0.007
 bed-0.007
Token asked
Token ID1965
Feature activation+1.08e+00

Pos logit contributions
 smile+0.033
 stomach+0.032
 bed+0.032
 cry+0.032
 tell+0.031
Neg logit contributions
P-0.053
Now-0.049
ww-0.047
Hey-0.046
lish-0.045
Token the
Token ID262
Feature activation+6.74e-01

Pos logit contributions
 cry+0.045
 smile+0.043
 bed+0.042
 tell+0.042
 stomach+0.042
Neg logit contributions
P-0.060
Now-0.054
Hey-0.051
lish-0.051
bly-0.051
Token rich
Token ID5527
Feature activation+1.01e+00

Pos logit contributions
 cry+0.031
 tell+0.031
 bed+0.030
 call+0.029
 smile+0.029
Neg logit contributions
P-0.035
Now-0.030
lish-0.029
ww-0.029
Hey-0.028
Token man
Token ID582
Feature activation+4.05e+00

Pos logit contributions
 cry+0.050
 bed+0.049
 tell+0.048
 call+0.048
 search+0.047
Neg logit contributions
P-0.046
ww-0.041
Now-0.041
lish-0.040
ha-0.037
Token.
Token ID13
Feature activation+1.81e+00

Pos logit contributions
ses+0.193
sed+0.179
 bed+0.177
 fault+0.175
 imagination+0.166
Neg logit contributions
lish-0.168
it-0.164
P-0.158
bly-0.153
year-0.149
Token
Token ID220
Feature activation+2.13e+00

Pos logit contributions
 cry+0.069
 bed+0.066
 tell+0.066
 stomach+0.065
 smile+0.064
Neg logit contributions
P-0.100
Now-0.098
Spl-0.090
A-0.090
O-0.089
Token
Token ID198
Feature activation-6.89e-01

Pos logit contributions
 smile+0.092
 cry+0.088
 gaze+0.086
 weep+0.086
 tell+0.085
Neg logit contributions
P-0.118
Now-0.114
Soon-0.109
it-0.106
CR-0.105
Token
Token ID198
Feature activation-2.99e+00

Pos logit contributions
P+0.050
Now+0.048
ww+0.048
bly+0.047
lish+0.046
Neg logit contributions
 smile-0.018
 cry-0.018
 tell-0.017
 call-0.016
 bed-0.016
TokenThe
Token ID464
Feature activation-1.78e-01

Pos logit contributions
bly+0.123
dd+0.116
w+0.115
ww+0.114
lish+0.110
Neg logit contributions
 tell-0.161
 smile-0.160
 cry-0.153
 call-0.152
 question-0.150
Token tears
Token ID10953
Feature activation+2.57e-01

Pos logit contributions
 tell+0.029
 smile+0.028
 cry+0.028
 question+0.027
 call+0.027
Neg logit contributions
P-0.041
Now-0.039
ww-0.039
bly-0.038
lish-0.036
Token.
Token ID13
Feature activation+7.08e-01

Pos logit contributions
 cry+0.008
 tell+0.008
 smile+0.008
 call+0.008
 question+0.008
Neg logit contributions
P-0.017
Now-0.016
ww-0.016
bly-0.016
lish-0.015
Token Tim
Token ID5045
Feature activation+8.51e-01

Pos logit contributions
 cry+0.027
 smile+0.027
 tell+0.027
 call+0.026
 weep+0.025
Neg logit contributions
P-0.043
Now-0.042
ww-0.039
Soon-0.039
Spl-0.038
Token went
Token ID1816
Feature activation+2.45e-01

Pos logit contributions
 bed+0.033
 cry+0.032
 smile+0.032
 tell+0.031
 stomach+0.031
Neg logit contributions
P-0.050
Now-0.045
ww-0.044
Spl-0.041
lish-0.041
Token to
Token ID284
Feature activation+2.82e+00

Pos logit contributions
 cry+0.009
 smile+0.009
 tell+0.008
 stomach+0.008
 bed+0.008
Neg logit contributions
P-0.015
Now-0.014
ww-0.013
sc-0.013
bly-0.013
Token Sue
Token ID26089
Feature activation+4.05e+00

Pos logit contributions
 throat+0.135
 stomach+0.129
 tum+0.127
 fault+0.126
 mouths+0.123
Neg logit contributions
Hey-0.113
P-0.107
Now-0.105
Go-0.104
O-0.104
Token and
Token ID290
Feature activation+1.99e+00

Pos logit contributions
ses+0.253
sed+0.248
 stomach+0.222
 throat+0.217
 eyes+0.214
Neg logit contributions
sc-0.105
P-0.095
"-0.093
it-0.086
-0.081
Token asked
Token ID1965
Feature activation+1.24e+00

Pos logit contributions
 bed+0.082
 cry+0.082
 stomach+0.081
 weeping+0.081
 fault+0.080
Neg logit contributions
P-0.106
ww-0.088
Now-0.086
Spl-0.084
Sorry-0.083
Token,
Token ID11
Feature activation+9.75e-01

Pos logit contributions
 cry+0.051
 weep+0.049
 gaze+0.048
 smile+0.047
 bed+0.047
Neg logit contributions
P-0.067
Now-0.060
lish-0.058
ww-0.058
bly-0.057
Token "
Token ID366
Feature activation-2.37e+00

Pos logit contributions
 cry+0.040
 smile+0.038
 tell+0.037
 bed+0.037
 stomach+0.037
Neg logit contributions
P-0.054
Now-0.052
bly-0.050
Soon-0.049
ww-0.049
TokenWhy
Token ID5195
Feature activation+8.04e-01

Pos logit contributions
ww+0.095
bly+0.086
sc+0.079
P+0.079
br+0.079
Neg logit contributions
 tell-0.137
 call-0.132
 crying-0.130
 cry-0.130
 smile-0.129
Token
Token ID198
Feature activation-2.95e+00

Pos logit contributions
P+0.002
Now+0.002
ww+0.002
bly+0.002
Soon+0.002
Neg logit contributions
 smile-0.001
 cry-0.001
 tell-0.001
 call-0.001
 stomach-0.001
TokenThe
Token ID464
Feature activation-3.23e-01

Pos logit contributions
bly+0.116
ww+0.109
dd+0.102
P+0.100
iest+0.099
Neg logit contributions
 tell-0.157
 call-0.157
 cry-0.154
 smile-0.150
 mom-0.149
Token dog
Token ID3290
Feature activation+2.58e-01

Pos logit contributions
P+0.019
Now+0.018
ww+0.018
bly+0.017
lish+0.017
Neg logit contributions
 smile-0.013
 cry-0.013
 tell-0.013
 call-0.012
 bed-0.012
Token heard
Token ID2982
Feature activation+3.90e-01

Pos logit contributions
 cry+0.009
 smile+0.009
 tell+0.009
 bed+0.008
 call+0.008
Neg logit contributions
P-0.016
Now-0.015
ww-0.015
lish-0.014
Spl-0.014
Token the
Token ID262
Feature activation+7.41e-01

Pos logit contributions
 smile+0.014
 cry+0.014
 tell+0.014
 weep+0.013
 stomach+0.013
Neg logit contributions
P-0.024
Now-0.023
ww-0.022
bly-0.021
lish-0.021
Token man
Token ID582
Feature activation+4.04e+00

Pos logit contributions
 tell+0.030
 smile+0.029
 cry+0.028
 weeping+0.027
 bed+0.027
Neg logit contributions
P-0.040
Now-0.039
ww-0.038
lish-0.036
Too-0.035
Token and
Token ID290
Feature activation+7.18e-01

Pos logit contributions
 fault+0.135
sed+0.134
ses+0.132
 stomach+0.130
 tum+0.128
Neg logit contributions
sc-0.182
it-0.181
lish-0.180
aming-0.178
bly-0.178
Token bark
Token ID21405
Feature activation-1.39e+00

Pos logit contributions
 bed+0.028
 cry+0.027
 smile+0.027
 tell+0.027
 stomach+0.026
Neg logit contributions
P-0.039
ww-0.036
Now-0.036
Spl-0.035
Hey-0.035
Tokened
Token ID276
Feature activation-5.40e-02

Pos logit contributions
P+0.074
Now+0.072
ww+0.071
Soon+0.064
bly+0.064
Neg logit contributions
 cry-0.065
 smile-0.065
 crying-0.062
 tell-0.062
 call-0.060
Token.
Token ID13
Feature activation+1.02e+00

Pos logit contributions
P+0.004
Now+0.004
ww+0.004
bly+0.003
lish+0.003
Neg logit contributions
 smile-0.001
 cry-0.001
 tell-0.001
 bed-0.001
 call-0.001
Token It
Token ID632
Feature activation+1.77e-02

Pos logit contributions
 smile+0.047
 cry+0.046
 tell+0.045
 weep+0.044
 weeping+0.043
Neg logit contributions
P-0.050
Now-0.049
ww-0.047
Spl-0.046
Soon-0.045
Token at
Token ID379
Feature activation+8.80e-01

Pos logit contributions
P+0.036
Now+0.034
ww+0.034
bly+0.032
lish+0.032
Neg logit contributions
 smile-0.014
 cry-0.014
 tell-0.014
 call-0.013
 bed-0.013
Token the
Token ID262
Feature activation-1.02e+00

Pos logit contributions
 cry+0.035
 smile+0.035
 stomach+0.034
 tell+0.034
 weeping+0.032
Neg logit contributions
P-0.050
Now-0.049
Soon-0.045
bly-0.045
ww-0.044
Token starting
Token ID3599
Feature activation-4.31e-01

Pos logit contributions
P+0.066
ww+0.061
Now+0.061
bly+0.060
lish+0.058
Neg logit contributions
 smile-0.037
 tell-0.036
 cry-0.036
 call-0.036
 bed-0.036
Token line
Token ID1627
Feature activation-1.55e+00

Pos logit contributions
P+0.026
Now+0.025
ww+0.024
bly+0.023
lish+0.023
Neg logit contributions
 smile-0.016
 cry-0.016
 tell-0.016
 call-0.015
 bed-0.015
Token.
Token ID13
Feature activation+1.75e-01

Pos logit contributions
P+0.123
ww+0.119
Now+0.116
lish+0.112
bly+0.112
Neg logit contributions
 smile-0.033
 tell-0.031
 bed-0.030
 cry-0.029
 call-0.029
Token "
Token ID366
Feature activation-5.59e+00

Pos logit contributions
 cry+0.007
 smile+0.006
 tell+0.006
 crying+0.006
 call+0.006
Neg logit contributions
P-0.011
Now-0.010
ww-0.010
Soon-0.010
sc-0.010
TokenReady
Token ID35474
Feature activation-1.23e+00

Pos logit contributions
lish+0.259
led+0.237
izing+0.230
bly+0.223
ww+0.218
Neg logit contributions
 play-0.249
 call-0.244
 gaze-0.243
 visit-0.243
 find-0.242
Token,
Token ID11
Feature activation-2.36e+00

Pos logit contributions
P+0.082
Now+0.077
ww+0.075
bly+0.073
lish+0.072
Neg logit contributions
 smile-0.044
 tell-0.043
 cry-0.042
 call-0.041
 bed-0.040
Token set
Token ID900
Feature activation-1.82e+00

Pos logit contributions
P+0.176
lish+0.171
bly+0.166
Now+0.164
ww+0.162
Neg logit contributions
 tell-0.069
 smile-0.065
 call-0.064
 find-0.061
 bed-0.059
Token,
Token ID11
Feature activation-2.81e+00

Pos logit contributions
P+0.145
Now+0.141
bly+0.133
lish+0.133
ww+0.131
Neg logit contributions
 tell-0.040
 smile-0.040
 call-0.037
 bed-0.036
 cry-0.036
Token go
Token ID467
Feature activation-2.87e+00

Pos logit contributions
P+0.233
lish+0.227
bly+0.219
Now+0.217
ww+0.214
Neg logit contributions
 tell-0.061
 smile-0.057
 call-0.056
 find-0.053
 say-0.051
Token sign
Token ID1051
Feature activation-7.54e-01

Pos logit contributions
 tell+0.049
 cry+0.048
 weeping+0.047
 smile+0.046
 weep+0.046
Neg logit contributions
ww-0.059
P-0.058
Now-0.054
Soon-0.051
Spl-0.051
Token that
Token ID326
Feature activation+2.43e-01

Pos logit contributions
P+0.047
Now+0.044
ww+0.044
bly+0.042
lish+0.041
Neg logit contributions
 smile-0.029
 cry-0.028
 tell-0.028
 call-0.027
 bed-0.026
Token said
Token ID531
Feature activation-5.03e-01

Pos logit contributions
 smile+0.008
 cry+0.007
 tell+0.007
 bed+0.007
 stomach+0.007
Neg logit contributions
P-0.016
Now-0.016
ww-0.015
bly-0.015
Spl-0.014
Token "
Token ID366
Feature activation-3.07e+00

Pos logit contributions
P+0.035
Now+0.033
ww+0.033
bly+0.032
lish+0.031
Neg logit contributions
 smile-0.015
 cry-0.015
 tell-0.014
 call-0.014
 bed-0.013
TokenRE
Token ID2200
Feature activation-1.98e+00

Pos logit contributions
P+0.103
bly+0.099
ww+0.096
lish+0.093
Now+0.091
Neg logit contributions
 cry-0.179
 call-0.177
 tell-0.173
 smile-0.172
 crying-0.171
TokenVERS
Token ID28884
Feature activation-5.46e+00

Pos logit contributions
P+0.101
Now+0.095
ww+0.091
bly+0.089
Soon+0.084
Neg logit contributions
 call-0.092
 cry-0.092
 smile-0.092
 tell-0.092
 bed-0.091
TokenE
Token ID36
Feature activation-3.38e+00

Pos logit contributions
ww+0.181
Now+0.170
iest+0.157
P+0.152
lish+0.147
Neg logit contributions
 cry-0.286
 crying-0.283
 weep-0.270
 weeping-0.264
 call-0.262
Token".
Token ID1911
Feature activation-1.63e-01

Pos logit contributions
Now+0.160
P+0.159
Soon+0.154
Sorry+0.151
Looks+0.146
Neg logit contributions
 stomach-0.136
 smile-0.133
 mom-0.132
 tell-0.131
 bed-0.130
Token L
Token ID406
Feature activation-1.27e+00

Pos logit contributions
P+0.011
Now+0.010
ww+0.010
bly+0.010
Spl+0.010
Neg logit contributions
 smile-0.005
 cry-0.005
 tell-0.005
 call-0.005
 crying-0.005
Tokenila
Token ID10102
Feature activation-1.84e-01

Pos logit contributions
P+0.057
Now+0.055
ww+0.053
bly+0.050
lish+0.050
Neg logit contributions
 smile-0.067
 cry-0.067
 tell-0.066
 crying-0.063
 call-0.063
Token and
Token ID290
Feature activation+1.06e+00

Pos logit contributions
P+0.012
Now+0.011
ww+0.011
bly+0.010
Spl+0.010
Neg logit contributions
 smile-0.006
 cry-0.006
 tell-0.006
 bed-0.006
 call-0.006
Token and
Token ID290
Feature activation-2.08e-01

Pos logit contributions
 cry+0.058
 tell+0.055
 weep+0.052
 weeping+0.052
 question+0.052
Neg logit contributions
P-0.088
Now-0.088
sc-0.086
ww-0.083
Spl-0.082
Token John
Token ID1757
Feature activation+4.69e-01

Pos logit contributions
P+0.013
Now+0.012
ww+0.012
bly+0.011
lish+0.011
Neg logit contributions
 cry-0.008
 smile-0.008
 tell-0.008
 call-0.008
 bed-0.007
Token chased
Token ID26172
Feature activation-1.00e+00

Pos logit contributions
 cry+0.019
 smile+0.018
 tell+0.018
 bed+0.018
 call+0.017
Neg logit contributions
P-0.027
Now-0.025
ww-0.025
bly-0.024
sc-0.024
Token Jenny
Token ID28437
Feature activation+6.23e-01

Pos logit contributions
P+0.065
Now+0.062
ww+0.061
bly+0.059
lish+0.056
Neg logit contributions
 smile-0.037
 cry-0.036
 tell-0.035
 call-0.033
 bed-0.032
Token.
Token ID13
Feature activation+7.47e-01

Pos logit contributions
 tell+0.024
 cry+0.024
 smile+0.024
 call+0.023
 question+0.023
Neg logit contributions
P-0.036
Now-0.033
ww-0.033
sc-0.032
bly-0.032
Token "
Token ID366
Feature activation-5.40e+00

Pos logit contributions
 cry+0.030
 tell+0.029
 smile+0.028
 call+0.027
 weep+0.027
Neg logit contributions
Now-0.043
P-0.042
Soon-0.039
ww-0.039
Spl-0.038
TokenYou
Token ID1639
Feature activation-8.82e-01

Pos logit contributions
bly+0.209
lish+0.209
izing+0.197
ier+0.197
iest+0.195
Neg logit contributions
 gaze-0.239
 smile-0.238
 play-0.236
 -0.235
 cry-0.233
Token can
Token ID460
Feature activation-6.78e-02

Pos logit contributions
P+0.059
Now+0.056
ww+0.054
bly+0.053
lish+0.052
Neg logit contributions
 smile-0.031
 cry-0.030
 tell-0.030
 call-0.028
 bed-0.027
Token't
Token ID470
Feature activation-1.55e-01

Pos logit contributions
P+0.005
Now+0.004
ww+0.004
bly+0.004
Spl+0.004
Neg logit contributions
 smile-0.002
 cry-0.002
 tell-0.002
 bed-0.002
 crying-0.002
Token catch
Token ID4929
Feature activation-1.48e+00

Pos logit contributions
P+0.011
Now+0.011
ww+0.010
bly+0.010
lish+0.010
Neg logit contributions
 smile-0.004
 cry-0.004
 tell-0.004
 bed-0.004
 crying-0.004
Token me
Token ID502
Feature activation-1.11e+00

Pos logit contributions
P+0.087
Now+0.085
ww+0.080
bly+0.079
Soon+0.076
Neg logit contributions
 cry-0.061
 tell-0.059
 smile-0.059
 crying-0.055
 call-0.055
Token reached
Token ID4251
Feature activation-4.52e-01

Pos logit contributions
P+0.088
Now+0.082
ww+0.082
bly+0.079
lish+0.076
Neg logit contributions
 smile-0.049
 cry-0.048
 tell-0.047
 call-0.047
 crying-0.044
Token its
Token ID663
Feature activation+8.42e-01

Pos logit contributions
P+0.030
Now+0.028
ww+0.028
bly+0.027
lish+0.026
Neg logit contributions
 smile-0.015
 cry-0.015
 tell-0.015
 call-0.014
 bed-0.014
Token top
Token ID1353
Feature activation-5.44e-01

Pos logit contributions
 cry+0.030
 tell+0.029
 smile+0.029
 crying+0.028
 weeping+0.028
Neg logit contributions
P-0.049
Now-0.047
ww-0.047
Soon-0.045
lish-0.044
Token speed
Token ID2866
Feature activation-2.16e+00

Pos logit contributions
P+0.037
Now+0.035
ww+0.034
bly+0.033
lish+0.032
Neg logit contributions
 smile-0.017
 cry-0.017
 tell-0.017
 call-0.016
 bed-0.016
Token!
Token ID0
Feature activation+1.59e-01

Pos logit contributions
P+0.163
Now+0.152
ww+0.152
Spl+0.142
bly+0.142
Neg logit contributions
 smile-0.059
 cry-0.055
 call-0.052
 crying-0.051
 tell-0.050
Token "
Token ID366
Feature activation-5.36e+00

Pos logit contributions
 cry+0.005
 smile+0.005
 tell+0.005
 call+0.004
 question+0.004
Neg logit contributions
P-0.011
Now-0.011
ww-0.010
bly-0.010
Soon-0.010
TokenWhe
Token ID10842
Feature activation-1.31e+00

Pos logit contributions
lish+0.203
bly+0.184
led+0.176
ww+0.172
izing+0.172
Neg logit contributions
 smile-0.254
 gaze-0.252
 -0.244
 tell-0.239
 stare-0.237
Tokene
Token ID68
Feature activation-2.34e+00

Pos logit contributions
P+0.059
Now+0.056
ww+0.053
lish+0.051
bly+0.050
Neg logit contributions
 smile-0.070
 cry-0.068
 tell-0.068
 bed-0.065
 crying-0.064
Token,
Token ID11
Feature activation-1.68e+00

Pos logit contributions
P+0.134
Now+0.124
bly+0.115
lish+0.114
ww+0.112
Neg logit contributions
 smile-0.108
 cry-0.106
 tell-0.098
 call-0.095
 bed-0.095
Token this
Token ID428
Feature activation-1.70e+00

Pos logit contributions
P+0.110
Now+0.100
ww+0.096
bly+0.095
lish+0.094
Neg logit contributions
 smile-0.067
 cry-0.065
 tell-0.064
 call-0.060
 bed-0.060
Token is
Token ID318
Feature activation-1.77e+00

Pos logit contributions
P+0.113
Now+0.105
bly+0.104
lish+0.102
ww+0.099
Neg logit contributions
 smile-0.062
 cry-0.059
 tell-0.057
 call-0.054
 feel-0.053
Token
Token ID198
Feature activation+3.04e-01

Pos logit contributions
 cry+0.019
 tell+0.019
 smile+0.018
 call+0.017
 weep+0.017
Neg logit contributions
P-0.032
Now-0.030
ww-0.028
Spl-0.028
Soon-0.028
Token
Token ID198
Feature activation-4.23e+00

Pos logit contributions
 cry+0.009
 smile+0.009
 tell+0.009
 call+0.008
 weeping+0.008
Neg logit contributions
P-0.021
Now-0.020
ww-0.019
Soon-0.019
Spl-0.019
Token"
Token ID1
Feature activation-3.57e+00

Pos logit contributions
rob+0.214
iest+0.209
ww+0.202
bly+0.199
plets+0.199
Neg logit contributions
 mom-0.164
 find-0.152
-0.151
 -0.145
 ask-0.140
TokenÂ
Token ID5523
Feature activation-4.70e-01

Pos logit contributions
bly+0.139
lish+0.136
dd+0.134
wy+0.131
ww+0.128
Neg logit contributions
 smile-0.189
 gaze-0.178
 call-0.175
 question-0.175
 cry-0.174
Token
Token ID126
Feature activation-1.42e+00

Pos logit contributions
P+0.027
Now+0.025
ww+0.025
bly+0.024
lish+0.023
Neg logit contributions
 smile-0.020
 cry-0.020
 tell-0.020
 call-0.019
 bed-0.019
Token
Token ID94
Feature activation-5.35e+00

Pos logit contributions
P+0.075
ww+0.067
Now+0.065
bly+0.065
lish+0.062
Neg logit contributions
 cry-0.068
 smile-0.068
 tell-0.068
 call-0.066
 bed-0.064
TokenM
Token ID44
Feature activation-1.71e+00

Pos logit contributions
ww+0.195
lish+0.192
bly+0.188
iest+0.180
dd+0.156
Neg logit contributions
 cry-0.239
 tell-0.239
 call-0.236
 play-0.228
 question-0.227
Tokenira
Token ID8704
Feature activation+1.47e-01

Pos logit contributions
Now+0.086
P+0.084
bly+0.079
sc+0.077
ww+0.074
Neg logit contributions
 smile-0.079
 question-0.078
 tell-0.078
 stomach-0.078
 call-0.077
Token mi
Token ID21504
Feature activation-1.23e+00

Pos logit contributions
 cry+0.005
 smile+0.005
 tell+0.004
 bed+0.004
 call+0.004
Neg logit contributions
P-0.010
Now-0.009
ww-0.009
bly-0.009
lish-0.009
Token mu
Token ID38779
Feature activation-2.61e+00

Pos logit contributions
P+0.075
Now+0.070
ww+0.068
bly+0.065
lish+0.064
Neg logit contributions
 smile-0.050
 cry-0.049
 tell-0.049
 call-0.047
 bed-0.045
TokenÃ
Token ID5746
Feature activation-1.07e+00

Pos logit contributions
P+0.114
Now+0.111
bly+0.109
dd+0.101
ww+0.100
Neg logit contributions
 call-0.139
 smile-0.135
 tell-0.134
 question-0.128
 cry-0.128
Token and
Token ID290
Feature activation+1.17e+00

Pos logit contributions
 cry+0.023
 smile+0.023
 tell+0.022
 question+0.021
 weeping+0.021
Neg logit contributions
P-0.041
Now-0.039
ww-0.038
sc-0.037
bly-0.036
Token look
Token ID804
Feature activation-3.86e-01

Pos logit contributions
 weeping+0.033
 question+0.031
 crying+0.031
 stomach+0.030
 bed+0.028
Neg logit contributions
P-0.074
Now-0.073
Spl-0.069
ww-0.068
Soon-0.067
Token around
Token ID1088
Feature activation+7.92e-01

Pos logit contributions
P+0.022
Now+0.021
ww+0.021
bly+0.020
lish+0.019
Neg logit contributions
 smile-0.016
 cry-0.016
 tell-0.016
 call-0.015
 bed-0.015
Token.
Token ID13
Feature activation+8.20e-02

Pos logit contributions
 cry+0.026
 tell+0.025
 call+0.024
 stomach+0.024
 question+0.024
Neg logit contributions
P-0.049
Now-0.047
lish-0.045
bly-0.044
Spl-0.044
Token
Token ID198
Feature activation-6.96e-01

Pos logit contributions
 cry+0.003
 smile+0.003
 tell+0.003
 call+0.002
 weep+0.002
Neg logit contributions
P-0.006
Now-0.005
ww-0.005
Soon-0.005
Spl-0.005
Token
Token ID198
Feature activation-5.29e+00

Pos logit contributions
P+0.054
ww+0.051
Now+0.051
bly+0.049
lish+0.049
Neg logit contributions
 smile-0.015
 cry-0.015
 tell-0.015
 call-0.013
 bed-0.013
Token"
Token ID1
Feature activation-3.35e+00

Pos logit contributions
rob+0.263
iest+0.259
isher+0.256
bly+0.248
ka+0.246
Neg logit contributions
-0.175
 -0.173
 mom-0.167
 find-0.162
 play-0.160
TokenWow
Token ID22017
Feature activation-7.43e-01

Pos logit contributions
ww+0.126
bly+0.125
lish+0.123
ross+0.120
izing+0.116
Neg logit contributions
 smile-0.164
 gaze-0.159
 stare-0.158
 call-0.158
 say-0.156
Token,
Token ID11
Feature activation-6.48e-01

Pos logit contributions
P+0.055
Now+0.053
ww+0.051
bly+0.050
lish+0.048
Neg logit contributions
 smile-0.020
 cry-0.019
 tell-0.019
 bed-0.018
 call-0.018
Token we
Token ID356
Feature activation-8.11e-01

Pos logit contributions
P+0.042
Now+0.040
ww+0.038
bly+0.038
lish+0.037
Neg logit contributions
 cry-0.023
 smile-0.023
 tell-0.023
 call-0.022
 bed-0.021
Token are
Token ID389
Feature activation+6.09e-02

Pos logit contributions
P+0.052
ww+0.049
Now+0.049
bly+0.047
lish+0.046
Neg logit contributions
 smile-0.029
 cry-0.029
 tell-0.029
 call-0.027
 bed-0.026
Token.
Token ID13
Feature activation+8.95e-02

Pos logit contributions
P+0.009
Now+0.009
ww+0.009
bly+0.008
lish+0.008
Neg logit contributions
 smile-0.004
 cry-0.004
 tell-0.004
 bed-0.003
 call-0.003
Token The
Token ID383
Feature activation-2.53e-01

Pos logit contributions
 smile+0.003
 cry+0.003
 tell+0.003
 call+0.002
 bed+0.002
Neg logit contributions
P-0.006
Now-0.006
ww-0.006
bly-0.005
Soon-0.005
Token vet
Token ID20202
Feature activation-1.05e-01

Pos logit contributions
P+0.016
Now+0.016
ww+0.015
bly+0.015
lish+0.014
Neg logit contributions
 smile-0.009
 cry-0.009
 tell-0.009
 call-0.008
 bed-0.008
Token showed
Token ID3751
Feature activation-4.04e-01

Pos logit contributions
P+0.007
Now+0.006
ww+0.006
bly+0.006
lish+0.006
Neg logit contributions
 cry-0.004
 smile-0.004
 tell-0.004
 bed-0.004
 call-0.003
Token Wh
Token ID854
Feature activation-1.33e+00

Pos logit contributions
P+0.027
Now+0.026
ww+0.025
bly+0.024
lish+0.024
Neg logit contributions
 smile-0.013
 cry-0.013
 tell-0.013
 call-0.012
 bed-0.012
Tokenisk
Token ID1984
Feature activation-5.23e+00

Pos logit contributions
P+0.097
ww+0.092
Now+0.091
bly+0.088
Spl+0.086
Neg logit contributions
 cry-0.034
 crying-0.033
 smile-0.032
 tell-0.032
 call-0.032
Tokeners
Token ID364
Feature activation-1.21e-01

Pos logit contributions
ww+0.200
P+0.191
Sorry+0.174
lish+0.171
Uh+0.164
Neg logit contributions
 find-0.260
 feel-0.254
 crying-0.252
 tell-0.250
 smile-0.249
Token the
Token ID262
Feature activation+4.87e-01

Pos logit contributions
P+0.007
Now+0.007
ww+0.007
bly+0.007
lish+0.006
Neg logit contributions
 cry-0.004
 smile-0.004
 tell-0.004
 call-0.004
 bed-0.004
Token x
Token ID2124
Feature activation-3.41e-01

Pos logit contributions
 smile+0.019
 tell+0.018
 cry+0.018
 bed+0.017
 call+0.017
Neg logit contributions
P-0.029
Now-0.027
ww-0.027
bly-0.025
lish-0.025
Token-
Token ID12
Feature activation-2.96e+00

Pos logit contributions
P+0.014
Now+0.013
ww+0.012
bly+0.011
lish+0.011
Neg logit contributions
 smile-0.020
 cry-0.020
 tell-0.020
 call-0.019
 bed-0.019
Tokenray
Token ID2433
Feature activation-1.79e-01

Pos logit contributions
P+0.164
ww+0.153
Now+0.152
bly+0.149
Spl+0.145
Neg logit contributions
 call-0.115
 cry-0.113
 tell-0.113
 crying-0.110
 question-0.109
Token half
Token ID2063
Feature activation-8.16e-01

Pos logit contributions
P+0.060
Now+0.057
ww+0.056
bly+0.055
sc+0.052
Neg logit contributions
 smile-0.046
 cry-0.044
 tell-0.043
 bed-0.041
 stomach-0.041
Token for
Token ID329
Feature activation+2.09e-01

Pos logit contributions
P+0.051
Now+0.049
ww+0.048
bly+0.045
lish+0.045
Neg logit contributions
 smile-0.030
 cry-0.030
 tell-0.029
 bed-0.028
 call-0.027
Token himself
Token ID2241
Feature activation+1.42e+00

Pos logit contributions
 smile+0.008
 cry+0.008
 tell+0.008
 call+0.008
 question+0.008
Neg logit contributions
P-0.012
Now-0.012
bly-0.011
ww-0.011
lish-0.011
Token.
Token ID13
Feature activation+2.75e-01

Pos logit contributions
 cry+0.047
 tell+0.045
 call+0.045
 smile+0.044
 gaze+0.044
Neg logit contributions
P-0.088
Now-0.080
ww-0.080
lish-0.079
bly-0.079
Token
Token ID198
Feature activation-3.73e-01

Pos logit contributions
 cry+0.009
 smile+0.009
 tell+0.009
 call+0.008
 weep+0.008
Neg logit contributions
P-0.019
Now-0.018
ww-0.017
Soon-0.017
Spl-0.016
Token
Token ID198
Feature activation-5.20e+00

Pos logit contributions
P+0.028
Now+0.027
ww+0.026
bly+0.025
lish+0.025
Neg logit contributions
 smile-0.009
 cry-0.009
 tell-0.009
 call-0.008
 bed-0.008
TokenThey
Token ID2990
Feature activation+1.11e+00

Pos logit contributions
rob+0.269
iest+0.252
w+0.252
ier+0.243
ka+0.240
Neg logit contributions
 find-0.167
-0.157
 play-0.155
 ask-0.154
 -0.153
Token both
Token ID1111
Feature activation+1.35e+00

Pos logit contributions
 crying+0.040
 weeping+0.040
 bed+0.039
 cry+0.038
 question+0.036
Neg logit contributions
P-0.065
Now-0.061
ww-0.056
bly-0.056
Hey-0.056
Token take
Token ID1011
Feature activation-6.72e-01

Pos logit contributions
 bed+0.039
 weeping+0.039
 crying+0.039
 eyes+0.035
 stomach+0.035
Neg logit contributions
P-0.085
Now-0.082
ww-0.078
Spl-0.076
Hey-0.076
Token a
Token ID257
Feature activation+1.35e+00

Pos logit contributions
P+0.040
Now+0.038
ww+0.037
bly+0.036
lish+0.035
Neg logit contributions
 smile-0.027
 cry-0.027
 tell-0.026
 call-0.025
 bed-0.025
Token big
Token ID1263
Feature activation+1.00e+00

Pos logit contributions
 tell+0.044
 weeping+0.042
 call+0.042
 cry+0.042
 eyes+0.041
Neg logit contributions
P-0.086
ww-0.084
sc-0.077
ha-0.077
Now-0.075
Token and
Token ID290
Feature activation+8.72e-01

Pos logit contributions
 cry+0.064
 question+0.061
 stomach+0.061
 tell+0.060
 regret+0.059
Neg logit contributions
P-0.085
bly-0.079
Now-0.078
sc-0.078
ww-0.078
Token the
Token ID262
Feature activation-2.10e-01

Pos logit contributions
 cry+0.033
 smile+0.031
 weeping+0.031
 tell+0.031
 crying+0.031
Neg logit contributions
P-0.050
Now-0.047
ww-0.046
bly-0.043
Spl-0.043
Token flowers
Token ID12734
Feature activation+1.65e+00

Pos logit contributions
P+0.012
Now+0.011
ww+0.011
bly+0.010
lish+0.010
Neg logit contributions
 cry-0.009
 smile-0.009
 tell-0.009
 call-0.008
 bed-0.008
Token.
Token ID13
Feature activation-7.90e-02

Pos logit contributions
 cry+0.061
 question+0.057
 tell+0.056
 weep+0.055
 stomach+0.054
Neg logit contributions
P-0.095
Now-0.091
bly-0.090
sc-0.086
ww-0.085
Token
Token ID198
Feature activation-5.56e-01

Pos logit contributions
P+0.005
Now+0.005
ww+0.005
Soon+0.005
bly+0.005
Neg logit contributions
 cry-0.003
 smile-0.002
 tell-0.002
 call-0.002
 crying-0.002
Token
Token ID198
Feature activation-5.20e+00

Pos logit contributions
P+0.042
Now+0.040
ww+0.040
bly+0.039
lish+0.038
Neg logit contributions
 smile-0.013
 cry-0.013
 tell-0.012
 call-0.011
 bed-0.011
Token"
Token ID1
Feature activation-3.26e+00

Pos logit contributions
iest+0.240
bly+0.238
ymes+0.234
w+0.230
ww+0.228
Neg logit contributions
 -0.183
-0.182
 find-0.177
 mom-0.174
 play-0.171
TokenYou
Token ID1639
Feature activation-4.98e-01

Pos logit contributions
bly+0.138
lish+0.133
ww+0.129
w+0.125
dd+0.122
Neg logit contributions
 smile-0.158
 gaze-0.152
 tell-0.149
 call-0.148
 say-0.148
Token are
Token ID389
Feature activation-6.28e-01

Pos logit contributions
P+0.036
Now+0.035
ww+0.034
bly+0.033
lish+0.032
Neg logit contributions
 smile-0.013
 cry-0.013
 tell-0.013
 call-0.012
 bed-0.012
Token a
Token ID257
Feature activation-1.32e+00

Pos logit contributions
P+0.038
Now+0.036
ww+0.035
bly+0.034
lish+0.033
Neg logit contributions
 smile-0.025
 cry-0.025
 tell-0.024
 call-0.023
 bed-0.023
Token great
Token ID1049
Feature activation+1.47e-01

Pos logit contributions
P+0.078
Now+0.076
bly+0.072
ww+0.071
lish+0.070
Neg logit contributions
 smile-0.055
 tell-0.052
 cry-0.052
 call-0.050
 question-0.048
Token it
Token ID340
Feature activation-1.85e+00

Pos logit contributions
P+0.109
ww+0.105
Now+0.103
lish+0.098
Too+0.096
Neg logit contributions
 smile-0.054
 bed-0.050
 cry-0.049
 tell-0.049
 eyes-0.046
Token on
Token ID319
Feature activation-3.03e-01

Pos logit contributions
P+0.111
ww+0.108
Now+0.106
Too+0.097
lish+0.096
Neg logit contributions
 smile-0.077
 cry-0.073
 tell-0.072
 bed-0.070
 crying-0.070
Token the
Token ID262
Feature activation-6.88e-01

Pos logit contributions
P+0.020
Now+0.019
ww+0.018
bly+0.018
lish+0.017
Neg logit contributions
 smile-0.010
 cry-0.010
 tell-0.010
 call-0.010
 crying-0.009
Token castle
Token ID16669
Feature activation+3.80e-02

Pos logit contributions
P+0.044
Now+0.041
ww+0.041
bly+0.039
lish+0.038
Neg logit contributions
 smile-0.026
 cry-0.025
 tell-0.024
 bed-0.024
 call-0.024
Token.
Token ID13
Feature activation+5.60e-02

Pos logit contributions
 cry+0.001
 smile+0.001
 tell+0.001
 call+0.001
 crying+0.001
Neg logit contributions
P-0.003
Now-0.003
ww-0.003
bly-0.002
sc-0.002
Token "
Token ID366
Feature activation-5.19e+00

Pos logit contributions
 cry+0.002
 smile+0.002
 tell+0.002
 call+0.002
 weeping+0.002
Neg logit contributions
P-0.004
Now-0.003
ww-0.003
Spl-0.003
Soon-0.003
TokenThere
Token ID1858
Feature activation-4.56e-01

Pos logit contributions
ier+0.186
lish+0.182
iest+0.180
ing+0.179
izing+0.178
Neg logit contributions
 cry-0.231
 wonder-0.228
 gaze-0.228
 know-0.228
 search-0.227
Token,
Token ID11
Feature activation-2.23e+00

Pos logit contributions
P+0.030
Now+0.028
ww+0.028
bly+0.027
lish+0.026
Neg logit contributions
 smile-0.015
 cry-0.015
 tell-0.015
 call-0.014
 bed-0.014
Token now
Token ID783
Feature activation-1.53e+00

Pos logit contributions
P+0.153
lish+0.143
Now+0.139
ww+0.136
bly+0.136
Neg logit contributions
 tell-0.080
 smile-0.077
 bed-0.076
 cry-0.071
 call-0.071
Token it
Token ID340
Feature activation-1.37e+00

Pos logit contributions
P+0.091
Now+0.086
bly+0.082
ww+0.079
lish+0.079
Neg logit contributions
 tell-0.065
 smile-0.065
 cry-0.063
 bed-0.060
 call-0.059
Token's
Token ID338
Feature activation-7.89e-01

Pos logit contributions
P+0.093
Now+0.088
ww+0.086
bly+0.084
lish+0.081
Neg logit contributions
 cry-0.046
 smile-0.045
 tell-0.045
 call-0.043
 crying-0.040
Token.
Token ID13
Feature activation+2.83e-01

Pos logit contributions
P+0.116
ww+0.112
Now+0.109
bly+0.106
lish+0.105
Neg logit contributions
 smile-0.025
 cry-0.023
 tell-0.021
 bed-0.020
 crying-0.020
Token Whe
Token ID5792
Feature activation-6.35e-01

Pos logit contributions
 cry+0.010
 smile+0.010
 tell+0.010
 weep+0.010
 weeping+0.010
Neg logit contributions
P-0.018
Now-0.017
ww-0.016
Soon-0.016
Spl-0.016
Tokene
Token ID68
Feature activation-1.54e+00

Pos logit contributions
P+0.030
Now+0.028
ww+0.027
bly+0.026
lish+0.026
Neg logit contributions
 smile-0.033
 cry-0.033
 tell-0.033
 call-0.031
 bed-0.031
Token!
Token ID0
Feature activation+1.30e-01

Pos logit contributions
P+0.096
Now+0.091
ww+0.086
bly+0.085
Soon+0.083
Neg logit contributions
 smile-0.060
 cry-0.059
 tell-0.056
 bed-0.054
 call-0.054
Token
Token ID198
Feature activation+5.42e-01

Pos logit contributions
 smile+0.004
 cry+0.004
 tell+0.004
 call+0.004
 weeping+0.004
Neg logit contributions
P-0.009
Now-0.009
ww-0.008
Spl-0.008
Soon-0.008
Token
Token ID198
Feature activation-5.18e+00

Pos logit contributions
 smile+0.017
 cry+0.017
 tell+0.016
 weeping+0.016
 stomach+0.016
Neg logit contributions
P-0.037
Now-0.036
ww-0.033
Soon-0.033
Spl-0.032
Token"
Token ID1
Feature activation-3.50e+00

Pos logit contributions
rob+0.254
w+0.235
ymes+0.234
plets+0.232
iest+0.232
Neg logit contributions
 find-0.193
 ask-0.175
 mom-0.174
 call-0.173
 play-0.172
TokenWow
Token ID22017
Feature activation-7.62e-01

Pos logit contributions
bly+0.151
lish+0.142
izing+0.133
ww+0.129
bling+0.128
Neg logit contributions
 smile-0.167
 gaze-0.164
 stare-0.160
 -0.156
 cry-0.155
Token,
Token ID11
Feature activation-5.09e-01

Pos logit contributions
P+0.054
Now+0.052
ww+0.050
bly+0.049
lish+0.048
Neg logit contributions
 smile-0.023
 cry-0.022
 tell-0.022
 bed-0.020
 call-0.020
Token Tom
Token ID4186
Feature activation-1.60e-01

Pos logit contributions
P+0.034
Now+0.033
ww+0.032
bly+0.031
lish+0.030
Neg logit contributions
 smile-0.017
 cry-0.017
 tell-0.016
 call-0.015
 bed-0.015
Token,
Token ID11
Feature activation-1.22e+00

Pos logit contributions
P+0.009
ww+0.009
Now+0.009
bly+0.008
lish+0.008
Neg logit contributions
 cry-0.006
 smile-0.006
 tell-0.006
 call-0.006
 bed-0.006
Token like
Token ID588
Feature activation-9.33e-01

Pos logit contributions
P+0.044
Now+0.041
ww+0.041
bly+0.039
lish+0.038
Neg logit contributions
 smile-0.028
 cry-0.027
 tell-0.027
 call-0.025
 bed-0.025
Token a
Token ID257
Feature activation-2.50e-01

Pos logit contributions
P+0.058
Now+0.054
ww+0.054
bly+0.051
lish+0.051
Neg logit contributions
 smile-0.036
 cry-0.035
 tell-0.034
 bed-0.033
 call-0.033
Token clown
Token ID25573
Feature activation+3.26e-01

Pos logit contributions
P+0.015
Now+0.014
ww+0.014
bly+0.013
lish+0.013
Neg logit contributions
 cry-0.010
 smile-0.010
 tell-0.010
 call-0.009
 bed-0.009
Token.
Token ID13
Feature activation-2.40e-01

Pos logit contributions
 cry+0.011
 smile+0.011
 tell+0.011
 bed+0.010
 question+0.010
Neg logit contributions
P-0.021
ww-0.020
Now-0.020
bly-0.019
lish-0.019
Token
Token ID198
Feature activation+9.34e-03

Pos logit contributions
P+0.017
Now+0.016
ww+0.016
bly+0.015
lish+0.015
Neg logit contributions
 cry-0.007
 smile-0.007
 tell-0.007
 call-0.006
 bed-0.006
Token
Token ID198
Feature activation-5.15e+00

Pos logit contributions
 smile+0.000
 cry+0.000
 tell+0.000
 call+0.000
 stomach+0.000
Neg logit contributions
P-0.001
Now-0.001
ww-0.001
Soon-0.001
bly-0.001
TokenThey
Token ID2990
Feature activation-1.25e-01

Pos logit contributions
ka+0.256
iest+0.254
ww+0.251
umm+0.243
ymes+0.243
Neg logit contributions
 find-0.170
-0.163
 -0.156
 play-0.149
 mom-0.145
Token ran
Token ID4966
Feature activation-7.14e-01

Pos logit contributions
P+0.008
Now+0.007
ww+0.007
bly+0.007
Spl+0.007
Neg logit contributions
 cry-0.005
 smile-0.005
 tell-0.005
 bed-0.004
 call-0.004
Token around
Token ID1088
Feature activation+8.38e-01

Pos logit contributions
P+0.047
Now+0.043
ww+0.043
bly+0.042
lish+0.040
Neg logit contributions
 smile-0.026
 cry-0.025
 tell-0.025
 call-0.024
 bed-0.024
Token the
Token ID262
Feature activation-6.78e-01

Pos logit contributions
 cry+0.034
 stomach+0.033
 smile+0.033
 tell+0.032
 call+0.031
Neg logit contributions
Now-0.045
P-0.044
Spl-0.040
Soon-0.040
sc-0.039
Token room
Token ID2119
Feature activation+1.07e+00

Pos logit contributions
P+0.053
ww+0.052
Now+0.051
bly+0.050
lish+0.049
Neg logit contributions
 smile-0.014
 cry-0.014
 tell-0.014
 bed-0.013
 call-0.013
Token in
Token ID287
Feature activation-1.44e-02

Pos logit contributions
P+0.079
ww+0.074
Now+0.074
bly+0.069
Spl+0.068
Neg logit contributions
 smile-0.049
 cry-0.047
 tell-0.047
 crying-0.045
 bed-0.045
Token laughter
Token ID20263
Feature activation-8.91e-01

Pos logit contributions
P+0.001
Now+0.001
ww+0.001
bly+0.001
Spl+0.001
Neg logit contributions
 smile-0.000
 cry-0.000
 tell-0.000
 call-0.000
 stomach-0.000
Token and
Token ID290
Feature activation-4.45e-02

Pos logit contributions
P+0.062
Now+0.059
ww+0.058
bly+0.056
Spl+0.054
Neg logit contributions
 smile-0.028
 cry-0.027
 tell-0.026
 call-0.025
 bed-0.025
Token said
Token ID531
Feature activation-1.28e+00

Pos logit contributions
P+0.003
Now+0.003
ww+0.003
bly+0.002
lish+0.002
Neg logit contributions
 cry-0.002
 smile-0.002
 tell-0.002
 call-0.002
 bed-0.002
Token,
Token ID11
Feature activation-1.10e+00

Pos logit contributions
P+0.097
Now+0.092
ww+0.091
bly+0.086
Spl+0.086
Neg logit contributions
 smile-0.033
 cry-0.031
 tell-0.031
 crying-0.029
 call-0.028
Token '
Token ID705
Feature activation-5.15e+00

Pos logit contributions
P+0.076
ww+0.071
Now+0.070
bly+0.067
lish+0.067
Neg logit contributions
 smile-0.035
 cry-0.034
 tell-0.034
 call-0.032
 bed-0.031
TokenThese
Token ID4711
Feature activation-1.62e+00

Pos logit contributions
bly+0.150
ross+0.140
lish+0.137
bridge+0.136
CR+0.135
Neg logit contributions
 smile-0.283
 tell-0.268
 -0.264
 say-0.264
 feel-0.262
Token new
Token ID649
Feature activation-1.42e+00

Pos logit contributions
P+0.116
Now+0.112
ww+0.108
bly+0.106
lish+0.104
Neg logit contributions
 smile-0.049
 cry-0.047
 tell-0.046
 call-0.044
 bed-0.042
Token clothes
Token ID8242
Feature activation-2.27e+00

Pos logit contributions
P+0.087
Now+0.083
ww+0.081
bly+0.079
lish+0.077
Neg logit contributions
 smile-0.057
 tell-0.055
 cry-0.054
 call-0.052
 bed-0.052
Token look
Token ID804
Feature activation-3.47e+00

Pos logit contributions
P+0.153
Now+0.142
ww+0.138
Spl+0.133
lish+0.133
Neg logit contributions
 smile-0.081
 tell-0.077
 cry-0.076
 call-0.072
 feel-0.070
Token incredible
Token ID8082
Feature activation-2.97e+00

Pos logit contributions
P+0.222
Now+0.210
Spl+0.196
ww+0.194
CR+0.187
Neg logit contributions
 smile-0.140
 tell-0.133
 cry-0.125
 call-0.125
 feel-0.123
Token in
Token ID287
Feature activation-9.16e-01

Pos logit contributions
P+0.074
Now+0.071
ww+0.069
bly+0.067
lish+0.066
Neg logit contributions
 smile-0.024
 cry-0.022
 tell-0.022
 call-0.021
 bed-0.021
Token the
Token ID262
Feature activation-1.14e+00

Pos logit contributions
P+0.064
ww+0.061
Now+0.059
lish+0.058
bly+0.057
Neg logit contributions
 smile-0.028
 cry-0.028
 tell-0.027
 bed-0.027
 call-0.025
Token desert
Token ID10326
Feature activation-4.45e-01

Pos logit contributions
P+0.070
ww+0.067
Now+0.066
bly+0.066
lish+0.063
Neg logit contributions
 smile-0.044
 bed-0.041
 cry-0.041
 tell-0.040
 stomach-0.040
Token.
Token ID13
Feature activation-6.10e-01

Pos logit contributions
P+0.034
Now+0.032
ww+0.032
bly+0.031
lish+0.030
Neg logit contributions
 smile-0.011
 cry-0.011
 tell-0.010
 call-0.010
 bed-0.009
Token
Token ID198
Feature activation-2.87e-01

Pos logit contributions
P+0.047
Now+0.044
ww+0.044
bly+0.043
lish+0.042
Neg logit contributions
 smile-0.014
 cry-0.014
 tell-0.014
 call-0.013
 bed-0.013
Token
Token ID198
Feature activation-5.15e+00

Pos logit contributions
P+0.021
Now+0.020
ww+0.020
bly+0.019
lish+0.019
Neg logit contributions
 smile-0.007
 cry-0.007
 tell-0.007
 call-0.007
 bed-0.006
Token"
Token ID1
Feature activation-3.37e+00

Pos logit contributions
iest+0.260
bly+0.252
w+0.246
ww+0.243
ka+0.235
Neg logit contributions
-0.175
 -0.174
 find-0.170
 mom-0.168
 see-0.155
TokenWow
Token ID22017
Feature activation-6.44e-01

Pos logit contributions
bly+0.142
lish+0.139
ww+0.133
ier+0.132
dd+0.131
Neg logit contributions
 gaze-0.153
 question-0.152
 call-0.148
 search-0.145
 stare-0.144
Token,
Token ID11
Feature activation-8.65e-01

Pos logit contributions
P+0.045
Now+0.043
ww+0.042
bly+0.041
lish+0.040
Neg logit contributions
 smile-0.020
 cry-0.019
 tell-0.019
 call-0.018
 bed-0.018
Token look
Token ID804
Feature activation-1.56e+00

Pos logit contributions
P+0.057
Now+0.055
ww+0.052
bly+0.052
lish+0.051
Neg logit contributions
 smile-0.030
 cry-0.030
 tell-0.030
 call-0.028
 bed-0.028
Token at
Token ID379
Feature activation-5.59e-01

Pos logit contributions
P+0.101
Now+0.100
bly+0.093
ww+0.092
sc+0.089
Neg logit contributions
 smile-0.057
 tell-0.055
 cry-0.053
 call-0.049
 bed-0.048
Token does
Token ID857
Feature activation-5.08e-01

Pos logit contributions
P+0.055
Now+0.051
ww+0.050
bly+0.049
lish+0.048
Neg logit contributions
 smile-0.030
 cry-0.030
 tell-0.030
 call-0.029
 bed-0.027
Token.
Token ID13
Feature activation-1.65e+00

Pos logit contributions
P+0.033
Now+0.031
ww+0.031
bly+0.029
lish+0.029
Neg logit contributions
 smile-0.018
 cry-0.018
 tell-0.017
 call-0.016
 bed-0.016
Tokenâ
Token ID22940
Feature activation-2.32e+00

Pos logit contributions
P+0.102
ww+0.095
Now+0.094
bly+0.092
lish+0.091
Neg logit contributions
 smile-0.062
 tell-0.061
 cry-0.060
 call-0.059
 bed-0.059
Token
Token ID26391
Feature activation-6.03e-01

Pos logit contributions
P+0.116
bly+0.106
ww+0.104
Now+0.098
Soon+0.098
Neg logit contributions
 tell-0.109
 smile-0.108
 cry-0.106
 call-0.106
 say-0.104
Token
Token ID198
Feature activation-1.69e+00

Pos logit contributions
P+0.036
Now+0.034
ww+0.033
bly+0.032
lish+0.031
Neg logit contributions
 smile-0.024
 cry-0.024
 tell-0.024
 call-0.023
 bed-0.023
Token
Token ID198
Feature activation-5.15e+00

Pos logit contributions
P+0.136
ww+0.133
lish+0.131
bly+0.130
Now+0.127
Neg logit contributions
 tell-0.024
 smile-0.024
 bed-0.023
 cry-0.023
 call-0.021
TokenThey
Token ID2990
Feature activation-6.79e-01

Pos logit contributions
ier+0.219
w+0.218
iest+0.211
rier+0.210
lish+0.208
Neg logit contributions
 -0.222
-0.204
 tell-0.196
 search-0.191
 know-0.188
Token kept
Token ID4030
Feature activation+2.17e-01

Pos logit contributions
P+0.042
Now+0.039
ww+0.039
bly+0.037
lish+0.036
Neg logit contributions
 smile-0.026
 cry-0.026
 tell-0.025
 call-0.024
 bed-0.024
Token walking
Token ID6155
Feature activation-1.74e+00

Pos logit contributions
 cry+0.008
 smile+0.007
 tell+0.007
 stomach+0.007
 call+0.007
Neg logit contributions
P-0.014
Now-0.013
ww-0.013
bly-0.012
Spl-0.012
Token until
Token ID1566
Feature activation-1.15e+00

Pos logit contributions
P+0.128
ww+0.124
Now+0.120
bly+0.116
lish+0.114
Neg logit contributions
 smile-0.052
 tell-0.047
 bed-0.046
 cry-0.045
 call-0.044
Token Jack
Token ID3619
Feature activation+2.34e-01

Pos logit contributions
P+0.076
Now+0.073
ww+0.071
bly+0.069
lish+0.068
Neg logit contributions
 smile-0.040
 bed-0.038
 cry-0.038
 tell-0.038
 call-0.036
Token to
Token ID284
Feature activation+1.54e+00

Pos logit contributions
P+0.111
ww+0.110
Now+0.106
bly+0.104
lish+0.102
Neg logit contributions
 smile-0.045
 tell-0.042
 call-0.040
 bed-0.040
 cry-0.039
Token the
Token ID262
Feature activation-6.82e-01

Pos logit contributions
 weeping+0.073
 stomach+0.073
 cry+0.070
 throat+0.069
 smile+0.069
Neg logit contributions
P-0.067
Now-0.064
A-0.062
Soon-0.061
Good-0.058
Token park
Token ID3952
Feature activation+1.15e+00

Pos logit contributions
P+0.045
Now+0.043
ww+0.043
bly+0.041
lish+0.040
Neg logit contributions
 smile-0.023
 cry-0.023
 tell-0.022
 call-0.022
 bed-0.021
Token.
Token ID13
Feature activation+1.81e-02

Pos logit contributions
 cry+0.041
 weeping+0.038
 stomach+0.038
 smile+0.037
 regret+0.037
Neg logit contributions
P-0.067
Now-0.065
sc-0.064
Spl-0.061
Soon-0.061
Token
Token ID198
Feature activation-4.04e-01

Pos logit contributions
 cry+0.001
 smile+0.001
 tell+0.001
 call+0.001
 crying+0.001
Neg logit contributions
P-0.001
Now-0.001
ww-0.001
Spl-0.001
bly-0.001
Token
Token ID198
Feature activation-5.14e+00

Pos logit contributions
P+0.030
Now+0.029
ww+0.029
bly+0.028
lish+0.027
Neg logit contributions
 smile-0.010
 cry-0.010
 tell-0.009
 call-0.009
 bed-0.009
TokenAt
Token ID2953
Feature activation+5.63e-01

Pos logit contributions
iest+0.250
rob+0.246
ka+0.240
w+0.239
ww+0.238
Neg logit contributions
 find-0.204
-0.188
 -0.182
 play-0.171
 mom-0.169
Token the
Token ID262
Feature activation-5.79e-01

Pos logit contributions
 cry+0.022
 smile+0.022
 stomach+0.021
 tell+0.021
 weeping+0.020
Neg logit contributions
P-0.033
Now-0.031
ww-0.030
Soon-0.030
bly-0.029
Token park
Token ID3952
Feature activation-1.67e-01

Pos logit contributions
P+0.041
Now+0.039
ww+0.038
bly+0.037
lish+0.036
Neg logit contributions
 smile-0.017
 cry-0.017
 tell-0.017
 call-0.016
 bed-0.016
Token,
Token ID11
Feature activation-1.77e+00

Pos logit contributions
P+0.013
Now+0.012
ww+0.012
bly+0.011
lish+0.011
Neg logit contributions
 cry-0.004
 smile-0.004
 tell-0.004
 call-0.004
 bed-0.003
Token they
Token ID484
Feature activation+7.50e-02

Pos logit contributions
P+0.126
Now+0.121
ww+0.120
bly+0.113
lish+0.112
Neg logit contributions
 smile-0.053
 bed-0.051
 tell-0.051
 mom-0.050
 cry-0.049
Token and
Token ID290
Feature activation-5.68e-01

Pos logit contributions
P+0.020
Now+0.019
ww+0.019
bly+0.018
lish+0.017
Neg logit contributions
 smile-0.011
 cry-0.011
 tell-0.010
 call-0.010
 bed-0.010
Token a
Token ID257
Feature activation+4.42e-01

Pos logit contributions
P+0.035
Now+0.033
ww+0.032
bly+0.031
lish+0.030
Neg logit contributions
 smile-0.022
 cry-0.022
 tell-0.021
 call-0.020
 bed-0.020
Token grape
Token ID30777
Feature activation+1.54e-01

Pos logit contributions
 tell+0.017
 cry+0.017
 smile+0.016
 call+0.016
 weeping+0.016
Neg logit contributions
P-0.026
ww-0.025
Now-0.024
bly-0.023
sc-0.022
Token.
Token ID13
Feature activation+1.95e-01

Pos logit contributions
 cry+0.005
 tell+0.005
 smile+0.005
 call+0.005
 question+0.005
Neg logit contributions
P-0.010
Now-0.009
ww-0.009
bly-0.009
lish-0.009
Token
Token ID198
Feature activation+3.97e-01

Pos logit contributions
 cry+0.007
 smile+0.007
 tell+0.006
 weeping+0.006
 weep+0.006
Neg logit contributions
P-0.013
Now-0.012
ww-0.012
Soon-0.011
Spl-0.011
Token
Token ID198
Feature activation-5.12e+00

Pos logit contributions
 smile+0.012
 cry+0.012
 tell+0.012
 stomach+0.011
 call+0.011
Neg logit contributions
P-0.027
Now-0.027
ww-0.025
Soon-0.025
Spl-0.024
TokenBut
Token ID1537
Feature activation-1.18e+00

Pos logit contributions
iest+0.242
ier+0.229
lish+0.219
isher+0.218
rob+0.210
Neg logit contributions
 play-0.200
 call-0.193
 tell-0.192
 find-0.189
 buy-0.186
Token Tom
Token ID4186
Feature activation+3.82e-01

Pos logit contributions
P+0.067
Now+0.062
ww+0.061
bly+0.058
sc+0.057
Neg logit contributions
 smile-0.052
 tell-0.051
 cry-0.051
 bed-0.050
 call-0.049
Token wanted
Token ID2227
Feature activation+6.82e-01

Pos logit contributions
 smile+0.015
 cry+0.015
 tell+0.015
 bed+0.015
 call+0.014
Neg logit contributions
P-0.022
Now-0.021
ww-0.020
lish-0.019
Spl-0.019
Token more
Token ID517
Feature activation+6.72e-01

Pos logit contributions
 cry+0.026
 smile+0.026
 gaze+0.025
 tell+0.025
 call+0.024
Neg logit contributions
P-0.040
Now-0.037
ww-0.036
Spl-0.034
lish-0.034
Token.
Token ID13
Feature activation-1.74e-01

Pos logit contributions
 cry+0.028
 tell+0.028
 smile+0.027
 gaze+0.027
 weeping+0.027
Neg logit contributions
P-0.036
ww-0.035
Now-0.033
lish-0.032
sc-0.031
Token and
Token ID290
Feature activation-2.48e-01

Pos logit contributions
 tell+0.041
 cry+0.040
 smile+0.039
 gaze+0.038
 weep+0.037
Neg logit contributions
P-0.052
Now-0.051
ww-0.049
Soon-0.048
Spl-0.048
Token won
Token ID1839
Feature activation-1.05e+00

Pos logit contributions
P+0.015
Now+0.014
ww+0.014
bly+0.013
lish+0.013
Neg logit contributions
 cry-0.009
 smile-0.009
 tell-0.009
 call-0.009
 bed-0.009
Token the
Token ID262
Feature activation-1.61e+00

Pos logit contributions
P+0.075
ww+0.070
Now+0.069
bly+0.066
lish+0.066
Neg logit contributions
 cry-0.032
 smile-0.031
 tell-0.031
 call-0.029
 bed-0.029
Token race
Token ID3234
Feature activation-1.08e+00

Pos logit contributions
P+0.106
Now+0.100
ww+0.096
bly+0.095
lish+0.093
Neg logit contributions
 cry-0.057
 call-0.057
 tell-0.057
 smile-0.056
 bed-0.053
Token.
Token ID13
Feature activation+2.57e-01

Pos logit contributions
P+0.082
Now+0.078
ww+0.077
bly+0.075
lish+0.073
Neg logit contributions
 smile-0.027
 cry-0.026
 tell-0.026
 call-0.025
 bed-0.024
Token "
Token ID366
Feature activation-5.11e+00

Pos logit contributions
 smile+0.009
 cry+0.009
 tell+0.009
 weep+0.008
 crying+0.008
Neg logit contributions
P-0.016
Now-0.016
ww-0.015
Soon-0.014
Spl-0.014
TokenGreat
Token ID13681
Feature activation-1.26e+00

Pos logit contributions
lish+0.211
bly+0.202
led+0.192
ww+0.178
iders+0.177
Neg logit contributions
 play-0.231
 call-0.227
 tell-0.225
 -0.224
 visit-0.221
Token job
Token ID1693
Feature activation-5.12e-01

Pos logit contributions
P+0.084
Now+0.082
bly+0.076
ww+0.074
lish+0.074
Neg logit contributions
 smile-0.044
 cry-0.042
 tell-0.042
 call-0.040
 bed-0.038
Token,
Token ID11
Feature activation-1.04e+00

Pos logit contributions
P+0.032
Now+0.030
ww+0.030
bly+0.028
lish+0.028
Neg logit contributions
 smile-0.019
 cry-0.019
 tell-0.019
 call-0.018
 bed-0.017
Token Tim
Token ID5045
Feature activation-7.34e-01

Pos logit contributions
P+0.087
Now+0.082
ww+0.080
bly+0.079
lish+0.079
Neg logit contributions
 smile-0.020
 tell-0.019
 cry-0.019
 call-0.017
 bed-0.017
Tokenmy
Token ID1820
Feature activation-1.02e+00

Pos logit contributions
P+0.051
Now+0.049
ww+0.048
bly+0.047
Spl+0.045
Neg logit contributions
 smile-0.022
 cry-0.022
 tell-0.021
 call-0.020
 bed-0.019
Token on
Token ID319
Feature activation+7.52e-01

Pos logit contributions
P+0.060
Now+0.058
ww+0.057
Spl+0.052
Soon+0.051
Neg logit contributions
 smile-0.041
 cry-0.039
 tell-0.038
 bed-0.037
 call-0.037
Token the
Token ID262
Feature activation-2.59e-02

Pos logit contributions
 tell+0.029
 cry+0.029
 smile+0.028
 weep+0.026
 call+0.026
Neg logit contributions
P-0.043
Now-0.041
bly-0.039
ww-0.039
lish-0.038
Token surface
Token ID4417
Feature activation-4.77e-01

Pos logit contributions
P+0.001
Now+0.001
ww+0.001
bly+0.001
lish+0.001
Neg logit contributions
 cry-0.001
 tell-0.001
 smile-0.001
 call-0.001
 crying-0.001
Token.
Token ID13
Feature activation-6.23e-02

Pos logit contributions
P+0.036
Now+0.034
ww+0.034
bly+0.032
lish+0.032
Neg logit contributions
 smile-0.012
 cry-0.012
 tell-0.011
 call-0.011
 bed-0.010
Token
Token ID198
Feature activation-5.22e-01

Pos logit contributions
P+0.004
Now+0.004
ww+0.004
bly+0.004
Spl+0.004
Neg logit contributions
 cry-0.002
 smile-0.002
 tell-0.002
 call-0.002
 crying-0.002
Token
Token ID198
Feature activation-5.11e+00

Pos logit contributions
P+0.040
Now+0.038
ww+0.038
bly+0.036
lish+0.036
Neg logit contributions
 smile-0.012
 cry-0.012
 tell-0.012
 call-0.011
 bed-0.011
Token"
Token ID1
Feature activation-3.07e+00

Pos logit contributions
rob+0.263
w+0.262
ww+0.250
ymes+0.248
ka+0.246
Neg logit contributions
-0.150
 find-0.147
 -0.144
 mom-0.143
 play-0.141
TokenWow
Token ID22017
Feature activation-6.60e-01

Pos logit contributions
lish+0.117
ross+0.109
bly+0.108
ww+0.105
sc+0.103
Neg logit contributions
 smile-0.162
 gaze-0.159
 stare-0.156
 tell-0.156
 say-0.155
Token,
Token ID11
Feature activation-2.57e-01

Pos logit contributions
P+0.048
Now+0.045
ww+0.044
bly+0.043
lish+0.042
Neg logit contributions
 smile-0.019
 cry-0.018
 tell-0.018
 call-0.017
 bed-0.017
Token it
Token ID340
Feature activation+3.58e-01

Pos logit contributions
P+0.017
ww+0.016
Now+0.016
bly+0.015
lish+0.015
Neg logit contributions
 smile-0.008
 cry-0.008
 tell-0.008
 call-0.008
 bed-0.008
Token is
Token ID318
Feature activation-3.51e-02

Pos logit contributions
 smile+0.014
 cry+0.014
 bed+0.013
 tell+0.013
 crying+0.013
Neg logit contributions
P-0.021
ww-0.020
Now-0.020
bly-0.018
lish-0.018
Token fast
Token ID3049
Feature activation-8.65e-02

Pos logit contributions
P+0.065
ww+0.061
Now+0.061
bly+0.058
lish+0.057
Neg logit contributions
 smile-0.031
 cry-0.030
 tell-0.030
 bed-0.029
 call-0.028
Token and
Token ID290
Feature activation-9.44e-01

Pos logit contributions
P+0.005
Now+0.005
ww+0.005
bly+0.005
lish+0.005
Neg logit contributions
 smile-0.003
 cry-0.003
 tell-0.003
 call-0.003
 bed-0.003
Token careful
Token ID8161
Feature activation-1.54e-01

Pos logit contributions
P+0.057
Now+0.054
ww+0.053
bly+0.051
lish+0.049
Neg logit contributions
 smile-0.038
 cry-0.037
 tell-0.037
 call-0.035
 bed-0.035
Token.
Token ID13
Feature activation+1.68e-01

Pos logit contributions
P+0.009
Now+0.009
ww+0.009
bly+0.009
lish+0.008
Neg logit contributions
 smile-0.006
 cry-0.006
 tell-0.005
 call-0.005
 bed-0.005
Token
Token ID198
Feature activation+1.34e-01

Pos logit contributions
 cry+0.006
 smile+0.006
 tell+0.006
 call+0.005
 weep+0.005
Neg logit contributions
P-0.011
Now-0.011
ww-0.010
Soon-0.010
Spl-0.010
Token
Token ID198
Feature activation-5.11e+00

Pos logit contributions
 smile+0.004
 cry+0.004
 tell+0.004
 call+0.003
 stomach+0.003
Neg logit contributions
P-0.010
Now-0.009
ww-0.009
Soon-0.009
bly-0.008
TokenAt
Token ID2953
Feature activation+8.76e-01

Pos logit contributions
bly+0.219
ka+0.217
isher+0.216
w+0.215
iest+0.207
Neg logit contributions
 find-0.199
 call-0.196
 -0.195
-0.190
 Her-0.190
Token the
Token ID262
Feature activation+1.33e-01

Pos logit contributions
 cry+0.037
 smile+0.036
 stomach+0.034
 question+0.034
 weep+0.034
Neg logit contributions
P-0.048
Now-0.047
Soon-0.045
ww-0.045
bly-0.042
Token hospital
Token ID4436
Feature activation+1.09e+00

Pos logit contributions
 smile+0.004
 cry+0.004
 tell+0.004
 weeping+0.003
 crying+0.003
Neg logit contributions
P-0.009
Now-0.009
ww-0.009
Soon-0.008
lish-0.008
Token,
Token ID11
Feature activation-1.08e+00

Pos logit contributions
 smile+0.032
 cry+0.032
 tell+0.029
 weeping+0.029
 stomach+0.028
Neg logit contributions
P-0.068
Now-0.066
ww-0.066
Soon-0.064
lish-0.063
Token Lily
Token ID20037
Feature activation-1.10e-01

Pos logit contributions
P+0.078
Now+0.073
ww+0.073
bly+0.070
lish+0.068
Neg logit contributions
 smile-0.032
 cry-0.031
 tell-0.031
 bed-0.030
 call-0.029
Token.
Token ID13
Feature activation-6.60e-01

Pos logit contributions
P+0.044
Now+0.041
ww+0.041
bly+0.039
lish+0.038
Neg logit contributions
 smile-0.020
 cry-0.020
 tell-0.019
 call-0.018
 bed-0.018
Token He
Token ID679
Feature activation+1.23e-01

Pos logit contributions
P+0.044
Now+0.042
ww+0.041
bly+0.040
lish+0.039
Neg logit contributions
 smile-0.021
 cry-0.021
 tell-0.021
 call-0.020
 bed-0.020
Token is
Token ID318
Feature activation+1.73e-01

Pos logit contributions
 smile+0.005
 cry+0.005
 bed+0.005
 tell+0.005
 stomach+0.005
Neg logit contributions
P-0.007
Now-0.007
ww-0.007
bly-0.006
lish-0.006
Token not
Token ID407
Feature activation+3.97e-01

Pos logit contributions
 cry+0.007
 smile+0.007
 tell+0.007
 stomach+0.007
 gaze+0.007
Neg logit contributions
P-0.010
Now-0.009
bly-0.009
ww-0.009
Soon-0.008
Token going
Token ID1016
Feature activation-1.54e-02

Pos logit contributions
 smile+0.017
 cry+0.017
 tell+0.016
 gaze+0.016
 stomach+0.016
Neg logit contributions
P-0.021
ww-0.020
Now-0.020
bly-0.019
lish-0.018
Token to
Token ID284
Feature activation+2.21e+00

Pos logit contributions
P+0.001
Now+0.001
ww+0.001
bly+0.001
lish+0.001
Neg logit contributions
 smile-0.000
 cry-0.000
 tell-0.000
 call-0.000
 bed-0.000
Token hurt
Token ID5938
Feature activation-4.86e-01

Pos logit contributions
 stomach+0.067
 smile+0.064
 mouth+0.063
 eyes+0.063
sed+0.062
Neg logit contributions
P-0.123
ww-0.120
Now-0.120
A-0.118
CR-0.115
Token us
Token ID514
Feature activation+3.02e-01

Pos logit contributions
P+0.028
Now+0.027
ww+0.026
bly+0.025
lish+0.024
Neg logit contributions
 smile-0.020
 cry-0.020
 tell-0.020
 call-0.019
 bed-0.019
Token."
Token ID526
Feature activation+1.04e+00

Pos logit contributions
 smile+0.009
 cry+0.009
 tell+0.008
 gaze+0.008
 call+0.008
Neg logit contributions
P-0.020
Now-0.019
ww-0.019
bly-0.019
lish-0.018
Token
Token ID198
Feature activation+1.20e+00

Pos logit contributions
 smile+0.038
 cry+0.038
 tell+0.037
 call+0.037
 weeping+0.037
Neg logit contributions
P-0.062
Now-0.060
Soon-0.055
A-0.054
Spl-0.054
Token
Token ID198
Feature activation-1.76e+00

Pos logit contributions
 smile+0.045
 stomach+0.044
 cry+0.044
 gaze+0.043
 tell+0.043
Neg logit contributions
P-0.071
Now-0.071
Soon-0.066
A-0.063
ww-0.062
Token.
Token ID13
Feature activation+3.93e-01

Pos logit contributions
 smile+0.017
 cry+0.017
 tell+0.017
 bed+0.017
 gaze+0.017
Neg logit contributions
P-0.037
lish-0.036
Now-0.035
ww-0.035
bly-0.034
Token They
Token ID1119
Feature activation+8.99e-01

Pos logit contributions
 smile+0.017
 tell+0.016
 cry+0.016
 stomach+0.015
 call+0.015
Neg logit contributions
P-0.022
Now-0.022
ww-0.021
Soon-0.020
Spl-0.020
Token apologized
Token ID22893
Feature activation+3.64e-01

Pos logit contributions
 bed+0.039
 smile+0.037
 question+0.037
 cry+0.037
 stomach+0.037
Neg logit contributions
P-0.048
Now-0.044
ww-0.042
bly-0.040
Hey-0.040
Token to
Token ID284
Feature activation+1.50e+00

Pos logit contributions
 cry+0.013
 smile+0.012
 tell+0.012
 call+0.012
 gaze+0.012
Neg logit contributions
P-0.022
Now-0.021
ww-0.021
lish-0.020
Soon-0.020
Token Mr
Token ID1770
Feature activation+1.09e+00

Pos logit contributions
 smile+0.060
 cry+0.058
 question+0.058
 weep+0.056
 weeping+0.056
Neg logit contributions
P-0.080
Now-0.078
O-0.073
ww-0.072
Soon-0.071
Token.
Token ID13
Feature activation+2.34e+00

Pos logit contributions
 cry+0.037
 tell+0.037
 smile+0.037
 bed+0.034
 gaze+0.034
Neg logit contributions
P-0.071
lish-0.065
ww-0.064
Now-0.063
bly-0.062
Token Brown
Token ID4373
Feature activation+2.86e+00

Pos logit contributions
 smile+0.085
 cry+0.083
 tell+0.082
 know+0.077
 weep+0.076
Neg logit contributions
P-0.121
Now-0.115
ww-0.114
bly-0.111
Good-0.110
Token and
Token ID290
Feature activation+7.38e-01

Pos logit contributions
 gaze+0.113
 question+0.111
 bed+0.110
 stare+0.107
 search+0.103
Neg logit contributions
lish-0.129
ers-0.121
bly-0.117
P-0.116
in-0.112
Token promised
Token ID8072
Feature activation+1.09e+00

Pos logit contributions
 cry+0.028
 bed+0.028
 smile+0.028
 stomach+0.027
 weeping+0.026
Neg logit contributions
P-0.043
ww-0.040
Now-0.039
bly-0.036
Spl-0.036
Token to
Token ID284
Feature activation+1.39e+00

Pos logit contributions
 smile+0.039
 cry+0.039
 stomach+0.038
 tell+0.038
 bed+0.037
Neg logit contributions
P-0.061
Now-0.059
Soon-0.057
ww-0.055
A-0.055
Token never
Token ID1239
Feature activation+2.24e+00

Pos logit contributions
 bed+0.046
 stomach+0.045
 weeping+0.044
 eyes+0.044
 crying+0.043
Neg logit contributions
P-0.083
Now-0.080
ww-0.075
bly-0.075
Soon-0.073
Token Then
Token ID3244
Feature activation+2.12e-01

Pos logit contributions
 smile+0.010
 cry+0.010
 tell+0.010
 call+0.009
 weep+0.009
Neg logit contributions
P-0.015
Now-0.015
ww-0.014
Soon-0.014
Spl-0.014
Token,
Token ID11
Feature activation-3.72e-01

Pos logit contributions
 cry+0.008
 smile+0.008
 tell+0.008
 call+0.008
 weeping+0.008
Neg logit contributions
P-0.012
ww-0.012
Now-0.011
lish-0.011
bly-0.011
Token they
Token ID484
Feature activation+1.52e+00

Pos logit contributions
P+0.022
Now+0.021
ww+0.020
bly+0.019
lish+0.019
Neg logit contributions
 smile-0.015
 cry-0.015
 tell-0.015
 call-0.014
 bed-0.014
Token sneak
Token ID20528
Feature activation-4.24e-01

Pos logit contributions
 weeping+0.044
 crying+0.043
 stomach+0.040
 bed+0.040
 eyes+0.038
Neg logit contributions
P-0.096
Now-0.090
sc-0.086
Hey-0.086
Spl-0.085
Token behind
Token ID2157
Feature activation+5.32e-01

Pos logit contributions
P+0.026
Now+0.024
ww+0.024
bly+0.023
lish+0.022
Neg logit contributions
 smile-0.017
 cry-0.017
 tell-0.016
 call-0.016
 bed-0.015
Token her
Token ID607
Feature activation+2.27e+00

Pos logit contributions
 cry+0.021
 smile+0.021
 tell+0.021
 call+0.020
 question+0.020
Neg logit contributions
P-0.030
Now-0.029
ww-0.027
bly-0.026
Soon-0.026
Token and
Token ID290
Feature activation+1.17e+00

Pos logit contributions
 regret+0.092
 stare+0.087
 gaze+0.087
 weeping+0.087
 weep+0.087
Neg logit contributions
P-0.097
sc-0.095
Now-0.091
br-0.089
it-0.088
Token tick
Token ID4378
Feature activation-1.44e+00

Pos logit contributions
 stomach+0.033
 weeping+0.033
 bed+0.030
 smile+0.030
 question+0.030
Neg logit contributions
P-0.072
Now-0.069
sc-0.068
CR-0.067
ww-0.066
Tokenle
Token ID293
Feature activation-1.45e-01

Pos logit contributions
P+0.063
Now+0.062
ww+0.060
Soon+0.055
Sorry+0.054
Neg logit contributions
 cry-0.077
 smile-0.077
 eyes-0.076
 bed-0.075
 tell-0.074
Token her
Token ID607
Feature activation+7.68e-01

Pos logit contributions
P+0.010
Now+0.010
ww+0.009
bly+0.009
lish+0.009
Neg logit contributions
 smile-0.004
 cry-0.004
 tell-0.004
 call-0.004
 bed-0.004
Token.
Token ID13
Feature activation+3.34e-01

Pos logit contributions
 tell+0.025
 cry+0.024
 weeping+0.024
 regret+0.024
 crying+0.023
Neg logit contributions
P-0.046
Now-0.044
bly-0.043
ww-0.043
sc-0.042
Token explore
Token ID7301
Feature activation-4.59e-01

Pos logit contributions
 cry+0.024
 smile+0.023
 crying+0.023
 weeping+0.023
 tell+0.022
Neg logit contributions
P-0.059
Now-0.058
ww-0.056
bly-0.054
Soon-0.053
Token the
Token ID262
Feature activation-9.80e-01

Pos logit contributions
P+0.032
Now+0.030
ww+0.030
bly+0.028
lish+0.028
Neg logit contributions
 smile-0.014
 cry-0.014
 tell-0.014
 call-0.013
 bed-0.013
Token world
Token ID995
Feature activation-1.12e+00

Pos logit contributions
P+0.060
Now+0.056
ww+0.056
bly+0.054
lish+0.052
Neg logit contributions
 smile-0.038
 cry-0.037
 tell-0.037
 call-0.036
 bed-0.036
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
P+0.074
ww+0.069
Now+0.069
bly+0.065
lish+0.064
Neg logit contributions
 smile-0.040
 cry-0.038
 tell-0.038
 call-0.036
 bed-0.036
Token<|endoftext|>
Token ID50256
Feature activation+8.57e-01

Pos logit contributions
 cry+0.036
 smile+0.036
 tell+0.034
 call+0.033
 crying+0.032
Neg logit contributions
P-0.064
Now-0.064
ww-0.060
Soon-0.057
Spl-0.056
TokenOnce
Token ID7454
Feature activation+2.64e+00

Pos logit contributions
 cry+0.032
 smile+0.031
 tell+0.031
 call+0.029
 weep+0.029
Neg logit contributions
P-0.055
Now-0.053
ww-0.049
Spl-0.047
sc-0.047
Token upon
Token ID2402
Feature activation-8.69e-01

Pos logit contributions
 call+0.152
 cry+0.151
 tell+0.146
 scream+0.144
 gaze+0.143
Neg logit contributions
P-0.100
it-0.086
ha-0.085
ww-0.084
Soon-0.081
Token a
Token ID257
Feature activation+2.19e-01

Pos logit contributions
P+0.053
ww+0.050
Now+0.050
bly+0.048
lish+0.047
Neg logit contributions
 smile-0.033
 cry-0.033
 tell-0.033
 bed-0.031
 call-0.031
Token time
Token ID640
Feature activation+6.96e-01

Pos logit contributions
 cry+0.010
 smile+0.010
 tell+0.010
 call+0.009
 bed+0.009
Neg logit contributions
P-0.012
Now-0.011
ww-0.011
lish-0.010
Spl-0.010
Token,
Token ID11
Feature activation-1.08e+00

Pos logit contributions
 cry+0.020
 smile+0.020
 tell+0.020
 call+0.019
 bed+0.018
Neg logit contributions
P-0.048
ww-0.046
Now-0.044
Spl-0.043
bly-0.042
Token there
Token ID612
Feature activation+1.45e+00

Pos logit contributions
P+0.056
Now+0.054
ww+0.053
bly+0.048
lish+0.048
Neg logit contributions
 smile-0.051
 cry-0.050
 tell-0.049
 crying-0.048
 bed-0.047
Token of
Token ID286
Feature activation+1.54e+00

Pos logit contributions
P+0.072
ww+0.067
Now+0.066
Too+0.060
Spl+0.060
Neg logit contributions
 smile-0.053
 tell-0.051
 cry-0.051
 crying-0.050
 bed-0.050
Token the
Token ID262
Feature activation-3.99e-01

Pos logit contributions
 smile+0.072
 stomach+0.071
 cry+0.069
 question+0.066
 tell+0.065
Neg logit contributions
Now-0.077
P-0.073
Soon-0.070
Spl-0.068
bly-0.067
Token day
Token ID1110
Feature activation+7.66e-01

Pos logit contributions
P+0.024
Now+0.023
ww+0.022
bly+0.021
lish+0.021
Neg logit contributions
 smile-0.016
 cry-0.016
 tell-0.015
 call-0.015
 bed-0.015
Token looking
Token ID2045
Feature activation-8.32e-02

Pos logit contributions
 cry+0.028
 smile+0.026
 question+0.026
 tell+0.026
 stomach+0.025
Neg logit contributions
P-0.047
Now-0.044
ww-0.042
bly-0.042
Spl-0.041
Token up
Token ID510
Feature activation-5.10e-01

Pos logit contributions
P+0.005
Now+0.005
ww+0.005
bly+0.004
lish+0.004
Neg logit contributions
 cry-0.003
 smile-0.003
 tell-0.003
 call-0.003
 bed-0.003
Token at
Token ID379
Feature activation+2.37e+00

Pos logit contributions
P+0.030
Now+0.028
ww+0.028
bly+0.027
lish+0.026
Neg logit contributions
 smile-0.021
 cry-0.021
 tell-0.020
 call-0.019
 bed-0.019
Token the
Token ID262
Feature activation+2.56e-03

Pos logit contributions
 smile+0.122
 cry+0.121
 tell+0.116
 comment+0.115
 grin+0.113
Neg logit contributions
Now-0.086
bly-0.081
Soon-0.072
P-0.071
A-0.071
Token cloud
Token ID6279
Feature activation+1.03e+00

Pos logit contributions
 cry+0.000
 smile+0.000
 tell+0.000
 call+0.000
 crying+0.000
Neg logit contributions
P-0.000
Now-0.000
ww-0.000
bly-0.000
Soon-0.000
Token '
Token ID705
Feature activation-2.93e+00

Pos logit contributions
 cry+0.038
 tell+0.036
 smile+0.035
 question+0.033
 call+0.033
Neg logit contributions
P-0.061
Now-0.059
ww-0.057
bly-0.057
lish-0.056
TokenFl
Token ID7414
Feature activation-2.10e-01

Pos logit contributions
P+0.121
ww+0.116
Now+0.108
bly+0.107
lish+0.104
Neg logit contributions
 cry-0.152
 smile-0.152
 tell-0.149
 call-0.148
 crying-0.147
Tokenuffy
Token ID15352
Feature activation-5.85e-02

Pos logit contributions
P+0.013
Now+0.012
ww+0.012
bly+0.012
lish+0.011
Neg logit contributions
 smile-0.008
 cry-0.008
 tell-0.008
 call-0.007
 bed-0.007
Token smile
Token ID8212
Feature activation-2.05e-01

Pos logit contributions
 tell+0.028
 cry+0.028
 weeping+0.027
 crying+0.026
 weep+0.026
Neg logit contributions
P-0.068
ww-0.066
Now-0.061
sc-0.061
ha-0.060
Token.
Token ID13
Feature activation+2.42e-01

Pos logit contributions
P+0.013
Now+0.013
ww+0.013
bly+0.012
lish+0.012
Neg logit contributions
 cry-0.007
 smile-0.007
 tell-0.007
 call-0.006
 bed-0.006
Token They
Token ID1119
Feature activation+1.06e+00

Pos logit contributions
 cry+0.009
 smile+0.009
 tell+0.009
 call+0.008
 crying+0.008
Neg logit contributions
P-0.015
Now-0.015
ww-0.014
Spl-0.013
Soon-0.013
Token are
Token ID389
Feature activation-3.25e-02

Pos logit contributions
 crying+0.029
 weeping+0.029
 cry+0.026
 bed+0.026
 question+0.026
Neg logit contributions
P-0.072
Now-0.068
ww-0.065
bly-0.063
Spl-0.062
Token not
Token ID407
Feature activation+4.79e-01

Pos logit contributions
P+0.002
Now+0.002
ww+0.002
bly+0.002
lish+0.002
Neg logit contributions
 cry-0.001
 smile-0.001
 tell-0.001
 call-0.001
 question-0.001
Token anxious
Token ID18116
Feature activation+2.30e-01

Pos logit contributions
 tell+0.018
 cry+0.018
 smile+0.018
 bed+0.018
 gaze+0.017
Neg logit contributions
P-0.028
ww-0.026
Now-0.026
bly-0.025
lish-0.025
Token anymore
Token ID7471
Feature activation+9.46e-01

Pos logit contributions
 tell+0.007
 smile+0.007
 cry+0.007
 bed+0.007
 call+0.007
Neg logit contributions
P-0.015
ww-0.014
Now-0.014
lish-0.014
bly-0.013
Token.
Token ID13
Feature activation-8.45e-02

Pos logit contributions
 cry+0.031
 smile+0.031
 gaze+0.030
 call+0.029
 tell+0.029
Neg logit contributions
P-0.057
Now-0.057
ww-0.055
lish-0.054
bly-0.053
Token They
Token ID1119
Feature activation+1.23e+00

Pos logit contributions
P+0.006
Now+0.006
ww+0.005
bly+0.005
Spl+0.005
Neg logit contributions
 smile-0.003
 cry-0.003
 tell-0.003
 call-0.002
 bed-0.002
Token are
Token ID389
Feature activation+2.92e-01

Pos logit contributions
 crying+0.041
 bed+0.040
 weeping+0.040
 eyes+0.038
 stomach+0.038
Neg logit contributions
P-0.074
Now-0.072
ww-0.067
Hey-0.065
bly-0.064
Token brave
Token ID14802
Feature activation-3.16e-01

Pos logit contributions
 cry+0.011
 smile+0.010
 tell+0.010
 gaze+0.010
 call+0.010
Neg logit contributions
P-0.017
Now-0.016
bly-0.016
ww-0.016
lish-0.015
TokenHello
Token ID15496
Feature activation-2.44e-01

Pos logit contributions
ww+0.123
izing+0.123
hest+0.120
ross+0.120
ier+0.115
Neg logit contributions
 smile-0.160
 call-0.159
 gaze-0.158
 say-0.156
 cry-0.156
Token,
Token ID11
Feature activation+1.56e-01

Pos logit contributions
P+0.018
ww+0.017
Now+0.017
bly+0.017
lish+0.016
Neg logit contributions
 cry-0.006
 smile-0.006
 tell-0.006
 call-0.005
 bed-0.005
Token gir
Token ID37370
Feature activation-8.43e-01

Pos logit contributions
 cry+0.006
 smile+0.006
 tell+0.005
 bed+0.005
 call+0.005
Neg logit contributions
P-0.010
ww-0.009
Now-0.009
bly-0.008
lish-0.008
Tokenaffe
Token ID21223
Feature activation-4.57e-01

Pos logit contributions
P+0.040
Now+0.037
ww+0.036
bly+0.035
Spl+0.033
Neg logit contributions
 smile-0.044
 cry-0.043
 tell-0.043
 call-0.042
 bed-0.041
Token,"
Token ID553
Feature activation+3.14e-01

Pos logit contributions
P+0.029
Now+0.027
ww+0.027
bly+0.026
lish+0.025
Neg logit contributions
 smile-0.017
 cry-0.017
 tell-0.016
 call-0.016
 bed-0.015
Token Anna
Token ID11735
Feature activation+1.32e+00

Pos logit contributions
 cry+0.010
 smile+0.010
 tell+0.010
 stomach+0.010
 call+0.010
Neg logit contributions
P-0.020
Now-0.019
ww-0.019
bly-0.018
Soon-0.017
Token said
Token ID531
Feature activation+1.57e+00

Pos logit contributions
 bed+0.038
 stomach+0.037
 cry+0.036
 weeping+0.036
 smile+0.036
Neg logit contributions
P-0.085
Now-0.079
ww-0.077
sc-0.077
Hey-0.075
Token.
Token ID13
Feature activation+7.22e-01

Pos logit contributions
 tell+0.054
 cry+0.054
 bed+0.053
 question+0.052
 call+0.050
Neg logit contributions
P-0.090
bly-0.089
lish-0.085
it-0.081
sc-0.079
Token "
Token ID366
Feature activation-2.78e+00

Pos logit contributions
 cry+0.028
 tell+0.027
 smile+0.027
 weep+0.026
 stomach+0.025
Neg logit contributions
P-0.043
Now-0.041
Soon-0.038
ww-0.038
bly-0.037
TokenYou
Token ID1639
Feature activation+3.68e-01

Pos logit contributions
ww+0.108
lish+0.096
P+0.095
ha+0.095
bly+0.091
Neg logit contributions
 cry-0.160
 smile-0.157
 gaze-0.155
 tell-0.155
 crying-0.154
Token are
Token ID389
Feature activation-1.05e-01

Pos logit contributions
 smile+0.012
 cry+0.011
 bed+0.011
 crying+0.011
 weeping+0.011
Neg logit contributions
P-0.024
ww-0.022
Now-0.022
bly-0.021
Spl-0.021
Token,
Token ID11
Feature activation-1.13e+00

Pos logit contributions
 smile+0.009
 cry+0.009
 tell+0.008
 call+0.008
 gaze+0.008
Neg logit contributions
P-0.013
ww-0.012
Now-0.012
lish-0.011
bly-0.011
Token stop
Token ID2245
Feature activation-1.10e+00

Pos logit contributions
P+0.081
Now+0.075
ww+0.074
bly+0.071
lish+0.071
Neg logit contributions
 cry-0.037
 tell-0.037
 smile-0.036
 call-0.034
 bed-0.033
Token!"
Token ID2474
Feature activation+1.54e-01

Pos logit contributions
P+0.066
Now+0.061
ww+0.059
bly+0.057
sc+0.056
Neg logit contributions
 cry-0.046
 smile-0.046
 tell-0.045
 crying-0.043
 call-0.043
Token Ben
Token ID3932
Feature activation+3.93e-01

Pos logit contributions
 smile+0.005
 cry+0.005
 tell+0.004
 call+0.004
 weeping+0.004
Neg logit contributions
P-0.011
Now-0.010
ww-0.010
lish-0.009
Spl-0.009
Token says
Token ID1139
Feature activation+1.54e+00

Pos logit contributions
 smile+0.010
 bed+0.010
 stomach+0.010
 cry+0.009
 tell+0.009
Neg logit contributions
P-0.028
Now-0.026
ww-0.025
lish-0.025
sc-0.025
Token.
Token ID13
Feature activation+7.35e-01

Pos logit contributions
 question+0.052
 tell+0.052
 bed+0.051
 stomach+0.051
 gaze+0.050
Neg logit contributions
bly-0.084
lish-0.083
P-0.079
sc-0.077
it-0.077
Token "
Token ID366
Feature activation-3.68e+00

Pos logit contributions
 cry+0.031
 tell+0.031
 smile+0.030
 weep+0.028
 bed+0.028
Neg logit contributions
P-0.040
Now-0.038
Spl-0.036
lish-0.036
Soon-0.036
TokenThe
Token ID464
Feature activation-6.58e-01

Pos logit contributions
ww+0.139
ha+0.126
P+0.117
lish+0.112
iest+0.112
Neg logit contributions
 cry-0.236
 feel-0.217
 scream-0.215
 weep-0.213
 crying-0.212
Token car
Token ID1097
Feature activation-1.43e+00

Pos logit contributions
P+0.045
Now+0.042
ww+0.042
bly+0.040
lish+0.040
Neg logit contributions
 cry-0.022
 smile-0.022
 tell-0.021
 call-0.020
 bed-0.020
Token is
Token ID318
Feature activation-8.11e-01

Pos logit contributions
P+0.094
Now+0.088
ww+0.086
bly+0.085
lish+0.082
Neg logit contributions
 cry-0.052
 smile-0.051
 tell-0.051
 call-0.049
 crying-0.045
Token too
Token ID1165
Feature activation-2.01e+00

Pos logit contributions
P+0.051
Now+0.048
ww+0.047
bly+0.045
lish+0.044
Neg logit contributions
 smile-0.031
 cry-0.031
 tell-0.031
 call-0.030
 bed-0.029
Token had
Token ID550
Feature activation-3.61e-01

Pos logit contributions
 cry+0.020
 smile+0.020
 tell+0.019
 bed+0.018
 weeping+0.018
Neg logit contributions
P-0.032
Now-0.030
ww-0.029
lish-0.027
ha-0.027
Token an
Token ID281
Feature activation+7.10e-01

Pos logit contributions
P+0.022
Now+0.021
ww+0.020
bly+0.019
lish+0.019
Neg logit contributions
 smile-0.014
 cry-0.014
 tell-0.014
 call-0.013
 bed-0.013
Token idea
Token ID2126
Feature activation+4.41e-01

Pos logit contributions
 cry+0.023
 smile+0.023
 crying+0.021
 tell+0.021
 bed+0.021
Neg logit contributions
P-0.045
Now-0.043
ww-0.043
bly-0.040
lish-0.039
Token.
Token ID13
Feature activation+3.45e-01

Pos logit contributions
 cry+0.013
 smile+0.013
 stomach+0.012
 tell+0.012
 weep+0.011
Neg logit contributions
P-0.030
Now-0.029
ww-0.028
bly-0.027
Spl-0.027
Token He
Token ID679
Feature activation-1.94e-01

Pos logit contributions
 cry+0.013
 smile+0.012
 tell+0.012
 call+0.012
 bed+0.011
Neg logit contributions
P-0.022
Now-0.021
ww-0.020
Soon-0.019
Spl-0.019
Token asked
Token ID1965
Feature activation+2.02e-01

Pos logit contributions
P+0.013
Now+0.012
ww+0.012
bly+0.011
lish+0.011
Neg logit contributions
 cry-0.006
 smile-0.006
 tell-0.006
 bed-0.006
 call-0.006
Token the
Token ID262
Feature activation-6.27e-01

Pos logit contributions
 cry+0.007
 smile+0.007
 tell+0.007
 call+0.007
 bed+0.006
Neg logit contributions
P-0.013
Now-0.012
ww-0.012
bly-0.011
lish-0.011
Token girl
Token ID2576
Feature activation+7.19e-01

Pos logit contributions
P+0.035
Now+0.033
ww+0.032
bly+0.031
lish+0.030
Neg logit contributions
 smile-0.028
 cry-0.027
 tell-0.027
 call-0.026
 bed-0.026
Token if
Token ID611
Feature activation-9.13e-01

Pos logit contributions
 cry+0.029
 smile+0.027
 bed+0.027
 tell+0.026
 weep+0.026
Neg logit contributions
P-0.041
ww-0.038
Now-0.038
bly-0.036
Soon-0.036
Token they
Token ID484
Feature activation+2.96e-01

Pos logit contributions
P+0.057
Now+0.054
ww+0.053
bly+0.051
lish+0.050
Neg logit contributions
 smile-0.034
 tell-0.034
 cry-0.034
 call-0.031
 bed-0.031
Token could
Token ID714
Feature activation+1.55e+00

Pos logit contributions
 cry+0.014
 bed+0.014
 smile+0.014
 tell+0.013
 crying+0.013
Neg logit contributions
P-0.015
Now-0.014
ww-0.014
bly-0.013
lish-0.013
Token his
Token ID465
Feature activation-1.63e-01

Pos logit contributions
P+0.006
Now+0.006
ww+0.006
bly+0.006
lish+0.005
Neg logit contributions
 cry-0.004
 smile-0.004
 tell-0.004
 call-0.003
 crying-0.003
Token snacks
Token ID28487
Feature activation-2.07e-01

Pos logit contributions
P+0.010
Now+0.009
ww+0.009
bly+0.008
lish+0.008
Neg logit contributions
 cry-0.007
 smile-0.007
 tell-0.007
 call-0.006
 crying-0.006
Token with
Token ID351
Feature activation-3.15e-03

Pos logit contributions
P+0.011
Now+0.010
ww+0.010
bly+0.009
lish+0.009
Neg logit contributions
 cry-0.010
 smile-0.010
 tell-0.010
 call-0.009
 crying-0.009
Token his
Token ID465
Feature activation+1.84e+00

Pos logit contributions
P+0.000
Now+0.000
ww+0.000
bly+0.000
lish+0.000
Neg logit contributions
 cry-0.000
 smile-0.000
 tell-0.000
 call-0.000
 crying-0.000
Token friends
Token ID2460
Feature activation+1.03e-01

Pos logit contributions
 cry+0.086
 stare+0.082
 gaze+0.081
 weep+0.080
 tell+0.075
Neg logit contributions
P-0.101
ww-0.086
Now-0.082
B-0.077
A-0.076
Token.
Token ID13
Feature activation+1.30e+00

Pos logit contributions
 cry+0.003
 smile+0.003
 tell+0.003
 call+0.003
 bed+0.003
Neg logit contributions
P-0.007
Now-0.007
ww-0.007
bly-0.007
lish-0.006
Token One
Token ID1881
Feature activation-1.30e-01

Pos logit contributions
 cry+0.057
 weep+0.053
 weeping+0.053
 smile+0.052
 crying+0.052
Neg logit contributions
P-0.073
Now-0.072
Soon-0.067
Spl-0.066
sc-0.065
Token day
Token ID1110
Feature activation-2.31e-01

Pos logit contributions
P+0.008
Now+0.007
ww+0.007
bly+0.007
lish+0.006
Neg logit contributions
 cry-0.005
 smile-0.005
 tell-0.005
 call-0.005
 crying-0.005
Token,
Token ID11
Feature activation-5.64e-01

Pos logit contributions
P+0.018
Now+0.017
ww+0.017
bly+0.016
lish+0.016
Neg logit contributions
 smile-0.005
 cry-0.005
 tell-0.005
 call-0.005
 bed-0.004
Token Tim
Token ID5045
Feature activation+2.67e-02

Pos logit contributions
P+0.042
Now+0.041
ww+0.040
bly+0.039
lish+0.038
Neg logit contributions
 smile-0.014
 cry-0.014
 tell-0.013
 call-0.012
 bed-0.012
Token found
Token ID1043
Feature activation-2.02e-01

Pos logit contributions
 smile+0.001
 cry+0.001
 tell+0.001
 call+0.001
 bed+0.001
Neg logit contributions
P-0.002
Now-0.002
ww-0.001
Spl-0.001
bly-0.001
Token "
Token ID366
Feature activation-3.42e+00

Pos logit contributions
 cry+0.032
 smile+0.031
 tell+0.029
 weeping+0.029
 weep+0.029
Neg logit contributions
P-0.046
Now-0.045
Soon-0.043
ww-0.041
Spl-0.041
TokenHello
Token ID15496
Feature activation-2.15e-01

Pos logit contributions
ww+0.151
bly+0.146
lish+0.142
ross+0.139
ier+0.138
Neg logit contributions
 call-0.157
 gaze-0.157
 search-0.156
 question-0.155
 tell-0.154
Token there
Token ID612
Feature activation+6.10e-01

Pos logit contributions
P+0.015
ww+0.015
Now+0.015
bly+0.014
lish+0.014
Neg logit contributions
 cry-0.006
 smile-0.006
 tell-0.006
 call-0.005
 bed-0.005
Token!
Token ID0
Feature activation-6.15e-01

Pos logit contributions
 smile+0.020
 cry+0.020
 tell+0.020
 bed+0.019
 gaze+0.019
Neg logit contributions
P-0.038
ww-0.037
Now-0.036
lish-0.034
bly-0.034
Token What
Token ID1867
Feature activation+6.48e-02

Pos logit contributions
P+0.033
Now+0.030
ww+0.030
bly+0.028
lish+0.028
Neg logit contributions
 smile-0.029
 cry-0.029
 tell-0.028
 call-0.027
 bed-0.027
Token brings
Token ID6774
Feature activation-9.71e-01

Pos logit contributions
 smile+0.002
 cry+0.002
 tell+0.002
 bed+0.002
 call+0.002
Neg logit contributions
P-0.004
ww-0.004
Now-0.004
bly-0.004
Spl-0.004
Token you
Token ID345
Feature activation-4.88e-01

Pos logit contributions
P+0.045
Now+0.043
bly+0.041
ww+0.040
lish+0.038
Neg logit contributions
 smile-0.051
 cry-0.051
 tell-0.051
 call-0.048
 crying-0.048
Token here
Token ID994
Feature activation-1.75e-01

Pos logit contributions
P+0.029
Now+0.027
ww+0.027
bly+0.025
lish+0.025
Neg logit contributions
 smile-0.020
 cry-0.020
 tell-0.019
 call-0.019
 bed-0.018
Token?"
Token ID1701
Feature activation+1.32e+00

Pos logit contributions
P+0.010
Now+0.009
ww+0.009
bly+0.009
lish+0.008
Neg logit contributions
 cry-0.008
 smile-0.008
 tell-0.007
 call-0.007
 bed-0.007
Token asked
Token ID1965
Feature activation+5.83e-01

Pos logit contributions
 smile+0.050
 stomach+0.050
 cry+0.048
 fault+0.048
 bed+0.047
Neg logit contributions
P-0.080
Now-0.073
Soon-0.068
ww-0.068
A-0.068
Token the
Token ID262
Feature activation+8.67e-01

Pos logit contributions
 cry+0.023
 smile+0.022
 tell+0.022
 weeping+0.021
 bed+0.021
Neg logit contributions
P-0.034
Now-0.032
bly-0.030
ww-0.030
lish-0.029
Token a
Token ID257
Feature activation+7.00e-01

Pos logit contributions
 cry+0.019
 smile+0.019
 tell+0.018
 weeping+0.018
 call+0.018
Neg logit contributions
P-0.023
Now-0.022
bly-0.021
Spl-0.020
Soon-0.020
Token safe
Token ID3338
Feature activation+1.18e+00

Pos logit contributions
 tell+0.029
 cry+0.029
 smile+0.028
 call+0.026
 stomach+0.026
Neg logit contributions
P-0.039
ww-0.037
Now-0.035
sc-0.034
ha-0.033
Token place
Token ID1295
Feature activation+1.30e+00

Pos logit contributions
 cry+0.044
 tell+0.044
 smile+0.042
 weeping+0.041
 crying+0.040
Neg logit contributions
P-0.069
Now-0.064
ww-0.064
sc-0.062
ha-0.059
Token to
Token ID284
Feature activation+1.04e+00

Pos logit contributions
 cry+0.045
 tell+0.042
 smile+0.041
 weep+0.040
 stomach+0.040
Neg logit contributions
P-0.077
Now-0.073
sc-0.073
ww-0.072
bly-0.071
Token put
Token ID1234
Feature activation-8.07e-01

Pos logit contributions
 weeping+0.027
 crying+0.027
 stomach+0.026
 cry+0.026
 eyes+0.025
Neg logit contributions
P-0.069
Now-0.068
bly-0.064
Spl-0.062
Soon-0.062
Token it
Token ID340
Feature activation-1.17e+00

Pos logit contributions
P+0.053
ww+0.050
Now+0.050
bly+0.047
lish+0.046
Neg logit contributions
 smile-0.028
 cry-0.028
 tell-0.027
 bed-0.026
 call-0.026
Token.
Token ID13
Feature activation+3.50e-01

Pos logit contributions
P+0.066
Now+0.064
ww+0.063
bly+0.057
Too+0.057
Neg logit contributions
 smile-0.051
 cry-0.049
 tell-0.049
 bed-0.048
 call-0.047
Token
Token ID198
Feature activation-2.16e-01

Pos logit contributions
 cry+0.015
 smile+0.015
 tell+0.014
 call+0.014
 weep+0.014
Neg logit contributions
P-0.021
Now-0.019
ww-0.018
Spl-0.018
Soon-0.018
Token
Token ID198
Feature activation-3.55e+00

Pos logit contributions
P+0.016
Now+0.015
ww+0.015
bly+0.014
Soon+0.014
Neg logit contributions
 smile-0.006
 cry-0.006
 tell-0.005
 call-0.005
 bed-0.005
TokenHe
Token ID1544
Feature activation-7.74e-01

Pos logit contributions
ww+0.124
ka+0.124
rob+0.122
bridge+0.120
bly+0.120
Neg logit contributions
 find-0.184
 search-0.180
 tell-0.180
 question-0.175
 smile-0.172
Token saw
Token ID2497
Feature activation+2.94e-01

Pos logit contributions
P+0.052
Now+0.050
ww+0.050
bly+0.048
lish+0.047
Neg logit contributions
 smile-0.025
 cry-0.024
 tell-0.024
 call-0.023
 bed-0.023
Token Mama
Token ID40084
Feature activation-5.35e-01

Pos logit contributions
P+0.144
Now+0.134
ww+0.129
bly+0.129
lish+0.127
Neg logit contributions
 tell-0.086
 smile-0.084
 call-0.079
 cry-0.077
 bed-0.076
Token Bird
Token ID14506
Feature activation-4.16e-01

Pos logit contributions
P+0.034
Now+0.033
ww+0.032
bly+0.031
lish+0.030
Neg logit contributions
 smile-0.019
 cry-0.019
 tell-0.018
 call-0.017
 bed-0.017
Token!
Token ID0
Feature activation-1.93e+00

Pos logit contributions
P+0.025
Now+0.024
ww+0.023
bly+0.022
lish+0.022
Neg logit contributions
 smile-0.016
 cry-0.016
 tell-0.016
 call-0.015
 bed-0.015
Tokenâ
Token ID22940
Feature activation-2.18e+00

Pos logit contributions
P+0.123
Now+0.114
ww+0.113
lish+0.111
bly+0.111
Neg logit contributions
 smile-0.065
 tell-0.065
 call-0.063
 cry-0.063
 bed-0.061
Token
Token ID26391
Feature activation-2.37e-01

Pos logit contributions
P+0.100
bly+0.091
ww+0.090
Now+0.088
Soon+0.086
Neg logit contributions
 tell-0.111
 smile-0.109
 cry-0.106
 call-0.106
 say-0.104
Token
Token ID220
Feature activation-1.05e+00

Pos logit contributions
P+0.012
Now+0.011
ww+0.011
bly+0.010
lish+0.010
Neg logit contributions
 smile-0.012
 cry-0.012
 tell-0.011
 call-0.011
 bed-0.011
Token
Token ID198
Feature activation-1.43e+00

Pos logit contributions
P+0.075
Now+0.070
ww+0.070
bly+0.068
lish+0.067
Neg logit contributions
 smile-0.029
 cry-0.029
 tell-0.029
 call-0.027
 bed-0.027
Token
Token ID198
Feature activation-4.32e+00

Pos logit contributions
P+0.111
ww+0.108
lish+0.108
bly+0.106
Now+0.104
Neg logit contributions
 smile-0.025
 tell-0.025
 cry-0.023
 bed-0.023
 call-0.021
TokenM
Token ID44
Feature activation-5.37e-01

Pos logit contributions
led+0.189
ier+0.188
lish+0.187
ka+0.180
isher+0.179
Neg logit contributions
 tell-0.181
 -0.176
-0.166
 know-0.161
 say-0.157
Tokenama
Token ID1689
Feature activation-8.61e-01

Pos logit contributions
P+0.033
Now+0.031
ww+0.030
bly+0.029
lish+0.028
Neg logit contributions
 smile-0.021
 cry-0.021
 tell-0.020
 call-0.020
 bed-0.019
Token Bird
Token ID14506
Feature activation-3.51e-01

Pos logit contributions
P+0.058
Now+0.055
ww+0.054
bly+0.053
lish+0.051
Neg logit contributions
 smile-0.028
 cry-0.027
 tell-0.027
 call-0.025
 crying-0.024
Token,
Token ID11
Feature activation+5.03e-02

Pos logit contributions
 cry+0.003
 smile+0.003
 tell+0.003
 call+0.002
 bed+0.002
Neg logit contributions
P-0.008
ww-0.008
Now-0.007
lish-0.007
bly-0.007
Token Lily
Token ID20037
Feature activation+1.63e-01

Pos logit contributions
 cry+0.001
 smile+0.001
 tell+0.001
 call+0.001
 crying+0.001
Neg logit contributions
P-0.004
Now-0.003
ww-0.003
bly-0.003
lish-0.003
Token.
Token ID13
Feature activation-1.51e+00

Pos logit contributions
 cry+0.004
 smile+0.004
 tell+0.004
 call+0.004
 gaze+0.004
Neg logit contributions
P-0.012
Now-0.011
ww-0.011
lish-0.010
bly-0.010
Token You
Token ID921
Feature activation-3.76e-01

Pos logit contributions
P+0.080
Now+0.073
ww+0.072
bly+0.072
lish+0.069
Neg logit contributions
 smile-0.070
 tell-0.070
 cry-0.069
 bed-0.067
 call-0.067
Token were
Token ID547
Feature activation-9.22e-01

Pos logit contributions
P+0.025
Now+0.024
ww+0.024
bly+0.023
lish+0.022
Neg logit contributions
 smile-0.012
 cry-0.012
 tell-0.012
 call-0.011
 bed-0.011
Token a
Token ID257
Feature activation-1.15e+00

Pos logit contributions
P+0.056
Now+0.053
ww+0.053
bly+0.050
lish+0.049
Neg logit contributions
 smile-0.037
 tell-0.036
 cry-0.035
 call-0.034
 bed-0.034
Token good
Token ID922
Feature activation+2.08e-01

Pos logit contributions
P+0.061
Now+0.059
ww+0.056
bly+0.055
sc+0.055
Neg logit contributions
 smile-0.055
 tell-0.052
 cry-0.052
 call-0.050
 bed-0.049
Token customer
Token ID6491
Feature activation-9.28e-01

Pos logit contributions
 cry+0.007
 smile+0.007
 tell+0.007
 weeping+0.006
 bed+0.006
Neg logit contributions
P-0.014
ww-0.013
Now-0.013
lish-0.012
bly-0.012
Token.
Token ID13
Feature activation-2.00e+00

Pos logit contributions
P+0.072
Now+0.069
ww+0.068
bly+0.066
Soon+0.064
Neg logit contributions
 smile-0.021
 tell-0.021
 cry-0.020
 call-0.019
 bed-0.018
Token You
Token ID921
Feature activation-4.21e-01

Pos logit contributions
P+0.112
bly+0.103
ww+0.102
Now+0.102
lish+0.099
Neg logit contributions
 tell-0.086
 smile-0.084
 call-0.082
 cry-0.082
 bed-0.082
Token have
Token ID423
Feature activation-1.14e+00

Pos logit contributions
P+0.027
Now+0.025
ww+0.025
bly+0.024
lish+0.023
Neg logit contributions
 smile-0.015
 cry-0.015
 tell-0.015
 call-0.014
 bed-0.014
Token laugh
Token ID6487
Feature activation-1.62e+00

Pos logit contributions
 weeping+0.009
 tell+0.008
 crying+0.008
 cry+0.008
 smile+0.008
Neg logit contributions
P-0.025
Now-0.024
ww-0.023
Spl-0.023
bly-0.023
Token.
Token ID13
Feature activation-1.18e-02

Pos logit contributions
P+0.142
ww+0.135
Now+0.134
Too+0.127
bly+0.127
Neg logit contributions
 smile-0.026
 tell-0.022
 cry-0.022
 bed-0.021
 call-0.019
Token They
Token ID1119
Feature activation+1.29e+00

Pos logit contributions
P+0.001
Now+0.001
ww+0.001
Spl+0.001
Soon+0.001
Neg logit contributions
 cry-0.000
 smile-0.000
 tell-0.000
 call-0.000
 crying-0.000
Token feel
Token ID1254
Feature activation-3.33e-01

Pos logit contributions
 weeping+0.038
 crying+0.037
 eyes+0.035
 stomach+0.033
 bed+0.032
Neg logit contributions
P-0.083
Now-0.081
ww-0.076
Spl-0.076
bly-0.075
Token the
Token ID262
Feature activation-8.53e-01

Pos logit contributions
P+0.021
Now+0.020
ww+0.020
bly+0.019
lish+0.018
Neg logit contributions
 smile-0.012
 cry-0.012
 tell-0.012
 call-0.011
 bed-0.011
Token water
Token ID1660
Feature activation-6.78e-01

Pos logit contributions
P+0.054
Now+0.051
ww+0.050
bly+0.048
lish+0.047
Neg logit contributions
 smile-0.033
 cry-0.032
 tell-0.031
 bed-0.030
 call-0.030
Token on
Token ID319
Feature activation-3.77e-02

Pos logit contributions
P+0.043
Now+0.041
ww+0.040
bly+0.038
lish+0.038
Neg logit contributions
 smile-0.025
 cry-0.025
 tell-0.024
 call-0.023
 bed-0.023
Token their
Token ID511
Feature activation+1.04e+00

Pos logit contributions
P+0.002
Now+0.002
ww+0.002
bly+0.002
lish+0.002
Neg logit contributions
 smile-0.001
 cry-0.001
 tell-0.001
 call-0.001
 crying-0.001
Token feet
Token ID3625
Feature activation+4.78e-02

Pos logit contributions
 tell+0.034
 question+0.032
 cry+0.031
 weeping+0.031
 call+0.031
Neg logit contributions
P-0.065
Now-0.060
ww-0.060
bly-0.059
Spl-0.059
Token and
Token ID290
Feature activation+1.47e+00

Pos logit contributions
 smile+0.001
 cry+0.001
 tell+0.001
 call+0.001
 question+0.001
Neg logit contributions
P-0.003
Now-0.003
ww-0.003
bly-0.003
lish-0.003
Token hands
Token ID2832
Feature activation+7.13e-01

Pos logit contributions
 question+0.048
 weeping+0.048
 tell+0.045
 call+0.045
 crying+0.045
Neg logit contributions
Now-0.080
P-0.080
Spl-0.077
ww-0.076
bly-0.074
Token a
Token ID257
Feature activation+5.06e-01

Pos logit contributions
P+0.036
Now+0.034
ww+0.033
bly+0.032
lish+0.031
Neg logit contributions
 smile-0.015
 cry-0.015
 tell-0.015
 call-0.014
 bed-0.014
Token dragon
Token ID10441
Feature activation+9.13e-01

Pos logit contributions
 tell+0.021
 cry+0.021
 smile+0.020
 call+0.019
 bed+0.019
Neg logit contributions
P-0.028
ww-0.027
Now-0.026
sc-0.024
lish-0.024
Token.
Token ID13
Feature activation+7.46e-01

Pos logit contributions
 smile+0.027
 cry+0.027
 stomach+0.026
 tell+0.026
 bed+0.025
Neg logit contributions
P-0.059
Now-0.057
ww-0.056
sc-0.054
bly-0.054
Token
Token ID198
Feature activation+1.06e+00

Pos logit contributions
 cry+0.029
 smile+0.028
 tell+0.027
 weep+0.027
 call+0.025
Neg logit contributions
P-0.046
Now-0.045
Soon-0.042
Spl-0.041
ww-0.041
Token
Token ID198
Feature activation-3.76e+00

Pos logit contributions
 smile+0.041
 cry+0.040
 stomach+0.037
 gaze+0.037
 weep+0.037
Neg logit contributions
P-0.066
Now-0.066
Soon-0.062
ww-0.059
Spl-0.059
Token"
Token ID1
Feature activation-3.49e+00

Pos logit contributions
iest+0.175
ka+0.169
ww+0.169
bly+0.165
lish+0.159
Neg logit contributions
-0.157
 find-0.152
 -0.150
 mom-0.145
 tell-0.139
TokenHey
Token ID10814
Feature activation-3.88e-01

Pos logit contributions
ww+0.121
bly+0.115
lish+0.114
sc+0.106
ing+0.104
Neg logit contributions
 cry-0.175
 tell-0.173
 call-0.173
 frown-0.169
 say-0.169
Token,
Token ID11
Feature activation+1.89e-02

Pos logit contributions
P+0.030
Now+0.029
ww+0.028
bly+0.027
lish+0.027
Neg logit contributions
 smile-0.008
 cry-0.008
 tell-0.008
 call-0.007
 bed-0.007
Token that
Token ID326
Feature activation-4.82e-01

Pos logit contributions
 smile+0.001
 cry+0.001
 tell+0.001
 bed+0.001
 call+0.001
Neg logit contributions
P-0.001
ww-0.001
Now-0.001
bly-0.001
lish-0.001
Token's
Token ID338
Feature activation-7.51e-01

Pos logit contributions
P+0.031
Now+0.030
ww+0.029
bly+0.028
lish+0.028
Neg logit contributions
 smile-0.017
 cry-0.017
 tell-0.016
 call-0.015
 bed-0.015
Token not
Token ID407
Feature activation-3.45e-01

Pos logit contributions
P+0.043
Now+0.041
ww+0.039
bly+0.038
lish+0.037
Neg logit contributions
 cry-0.032
 smile-0.032
 tell-0.032
 call-0.031
 bed-0.030
Token wizard
Token ID18731
Feature activation+1.42e+00

Pos logit contributions
P+0.018
Now+0.017
ww+0.017
bly+0.016
lish+0.015
Neg logit contributions
 cry-0.014
 smile-0.014
 tell-0.014
 call-0.013
 bed-0.013
Token's
Token ID338
Feature activation+1.74e+00

Pos logit contributions
 cry+0.058
 bed+0.056
 smile+0.054
 tell+0.054
 stomach+0.053
Neg logit contributions
P-0.077
Now-0.075
ww-0.072
bly-0.070
lish-0.067
Token work
Token ID670
Feature activation-7.27e-01

Pos logit contributions
 cry+0.064
 tell+0.062
 bed+0.060
 weep+0.058
 call+0.056
Neg logit contributions
Now-0.092
P-0.091
ww-0.085
bly-0.085
Soon-0.082
Token.
Token ID13
Feature activation+1.60e-01

Pos logit contributions
P+0.045
Now+0.042
ww+0.041
bly+0.039
lish+0.039
Neg logit contributions
 smile-0.028
 cry-0.028
 tell-0.027
 call-0.026
 bed-0.026
Token
Token ID198
Feature activation+1.34e-01

Pos logit contributions
 cry+0.006
 smile+0.006
 tell+0.006
 call+0.005
 crying+0.005
Neg logit contributions
P-0.010
Now-0.010
ww-0.009
bly-0.009
Soon-0.009
Token
Token ID198
Feature activation-3.61e+00

Pos logit contributions
 smile+0.004
 cry+0.004
 tell+0.004
 call+0.004
 stomach+0.004
Neg logit contributions
P-0.009
Now-0.009
ww-0.009
Soon-0.008
bly-0.008
TokenThe
Token ID464
Feature activation-8.11e-01

Pos logit contributions
lish+0.154
ww+0.145
ka+0.128
w+0.128
bly+0.125
Neg logit contributions
 tell-0.182
 know-0.176
 -0.171
 smile-0.166
 find-0.166
Token vine
Token ID17793
Feature activation+1.53e-01

Pos logit contributions
P+0.047
ww+0.044
Now+0.044
bly+0.042
lish+0.041
Neg logit contributions
 smile-0.034
 tell-0.033
 cry-0.033
 call-0.032
 question-0.032
Token thanked
Token ID26280
Feature activation+2.22e-01

Pos logit contributions
 cry+0.006
 smile+0.006
 tell+0.006
 bed+0.006
 call+0.006
Neg logit contributions
P-0.009
Now-0.009
ww-0.008
bly-0.008
lish-0.008
Token the
Token ID262
Feature activation-4.82e-01

Pos logit contributions
 cry+0.008
 smile+0.008
 tell+0.008
 call+0.007
 crying+0.007
Neg logit contributions
P-0.014
Now-0.013
ww-0.013
bly-0.012
lish-0.012
Token wizard
Token ID18731
Feature activation+7.44e-01

Pos logit contributions
P+0.029
Now+0.027
ww+0.027
bly+0.026
lish+0.025
Neg logit contributions
 smile-0.019
 cry-0.019
 tell-0.019
 call-0.018
 bed-0.018
Token he
Token ID339
Feature activation-2.03e-02

Pos logit contributions
 cry+0.020
 smile+0.020
 tell+0.019
 call+0.019
 weeping+0.019
Neg logit contributions
P-0.036
Now-0.034
ww-0.033
bly-0.031
Soon-0.031
Token said
Token ID531
Feature activation+1.09e+00

Pos logit contributions
P+0.001
Now+0.001
ww+0.001
bly+0.001
lish+0.001
Neg logit contributions
 cry-0.001
 smile-0.001
 tell-0.001
 bed-0.001
 crying-0.001
Token.
Token ID13
Feature activation+8.63e-01

Pos logit contributions
 cry+0.038
 tell+0.037
 question+0.035
 call+0.035
 bed+0.035
Neg logit contributions
P-0.063
bly-0.062
Now-0.059
lish-0.058
ww-0.057
Token
Token ID220
Feature activation+1.04e+00

Pos logit contributions
 cry+0.025
 tell+0.024
 smile+0.024
 call+0.022
 weep+0.022
Neg logit contributions
P-0.059
Now-0.058
ww-0.054
Soon-0.054
Spl-0.053
Token
Token ID198
Feature activation-1.33e+00

Pos logit contributions
 cry+0.035
 smile+0.035
 tell+0.034
 weep+0.033
 call+0.032
Neg logit contributions
P-0.067
Now-0.065
Soon-0.061
bly-0.061
ww-0.061
Token
Token ID198
Feature activation-3.47e+00

Pos logit contributions
P+0.101
ww+0.100
Now+0.096
lish+0.096
bly+0.095
Neg logit contributions
 smile-0.028
 tell-0.027
 cry-0.026
 call-0.024
 bed-0.023
TokenM
Token ID44
Feature activation-6.90e-01

Pos logit contributions
ww+0.142
w+0.138
lish+0.131
plets+0.126
sc+0.123
Neg logit contributions
 tell-0.177
 smile-0.177
 find-0.169
 -0.168
 question-0.168
Tokenummy
Token ID13513
Feature activation-3.76e-01

Pos logit contributions
P+0.036
Now+0.033
ww+0.033
bly+0.031
Spl+0.030
Neg logit contributions
 smile-0.033
 cry-0.033
 tell-0.032
 call-0.031
 bed-0.031
Token smiled
Token ID13541
Feature activation-5.82e-02

Pos logit contributions
P+0.024
Now+0.023
ww+0.023
bly+0.022
lish+0.021
Neg logit contributions
 cry-0.013
 smile-0.013
 tell-0.013
 call-0.012
 bed-0.012
Token and
Token ID290
Feature activation-9.91e-02

Pos logit contributions
P+0.004
Now+0.004
ww+0.003
bly+0.003
lish+0.003
Neg logit contributions
 cry-0.002
 tell-0.002
 smile-0.002
 call-0.002
 bed-0.002
Token said
Token ID531
Feature activation-9.76e-01

Pos logit contributions
P+0.007
Now+0.006
ww+0.006
bly+0.006
lish+0.006
Neg logit contributions
 cry-0.003
 smile-0.003
 tell-0.003
 bed-0.003
 call-0.003
Token for
Token ID329
Feature activation+1.17e+00

Pos logit contributions
 smile+0.009
 cry+0.009
 tell+0.009
 call+0.008
 stomach+0.008
Neg logit contributions
P-0.020
Now-0.019
ww-0.019
sc-0.018
lish-0.018
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 smile+0.048
 cry+0.045
 tell+0.044
 question+0.044
 stomach+0.042
Neg logit contributions
P-0.064
Now-0.062
bly-0.059
Soon-0.058
ww-0.058
Token cat
Token ID3797
Feature activation+1.08e+00

Pos logit contributions
 smile+0.040
 tell+0.040
 cry+0.038
 gaze+0.037
 stare+0.036
Neg logit contributions
P-0.053
ww-0.051
Now-0.050
lish-0.049
sc-0.048
Token.
Token ID13
Feature activation+8.66e-01

Pos logit contributions
 smile+0.034
 cry+0.032
 tell+0.032
 gaze+0.031
 question+0.030
Neg logit contributions
P-0.070
ww-0.066
Now-0.066
sc-0.065
lish-0.062
Token
Token ID198
Feature activation+9.08e-01

Pos logit contributions
 smile+0.032
 tell+0.032
 cry+0.032
 call+0.030
 question+0.029
Neg logit contributions
Now-0.054
P-0.054
Soon-0.049
ww-0.049
Spl-0.048
Token
Token ID198
Feature activation-3.46e+00

Pos logit contributions
 smile+0.033
 cry+0.032
 stomach+0.030
 tell+0.030
 call+0.030
Neg logit contributions
P-0.059
Now-0.058
Soon-0.055
ww-0.053
Spl-0.052
TokenThe
Token ID464
Feature activation+2.35e-01

Pos logit contributions
iest+0.152
ka+0.144
ww+0.141
ier+0.139
sc+0.137
Neg logit contributions
 cry-0.156
 tell-0.154
 know-0.151
 find-0.148
 call-0.147
Token pretty
Token ID2495
Feature activation+9.27e-01

Pos logit contributions
 smile+0.009
 cry+0.009
 tell+0.009
 gaze+0.008
 bed+0.008
Neg logit contributions
P-0.014
Now-0.013
ww-0.013
lish-0.012
bly-0.012
Token mouse
Token ID10211
Feature activation+1.02e-01

Pos logit contributions
 tell+0.037
 cry+0.037
 smile+0.037
 gaze+0.035
 stare+0.035
Neg logit contributions
P-0.049
ww-0.046
Now-0.045
lish-0.044
bly-0.043
Token made
Token ID925
Feature activation+5.77e-01

Pos logit contributions
 smile+0.004
 cry+0.004
 tell+0.004
 bed+0.004
 call+0.004
Neg logit contributions
P-0.006
Now-0.006
ww-0.005
bly-0.005
lish-0.005
Token it
Token ID340
Feature activation+8.06e-01

Pos logit contributions
 smile+0.026
 tell+0.025
 cry+0.025
 question+0.025
 call+0.024
Neg logit contributions
P-0.029
Now-0.029
ww-0.027
bly-0.026
Soon-0.026
Token Dad
Token ID17415
Feature activation+1.04e-01

Pos logit contributions
P+0.034
Now+0.033
ww+0.032
bly+0.031
lish+0.030
Neg logit contributions
 smile-0.016
 cry-0.016
 tell-0.016
 call-0.015
 bed-0.015
Token said
Token ID531
Feature activation-5.43e-01

Pos logit contributions
 cry+0.004
 smile+0.004
 tell+0.003
 bed+0.003
 call+0.003
Neg logit contributions
P-0.007
Now-0.006
ww-0.006
bly-0.006
lish-0.006
Token to
Token ID284
Feature activation+1.02e+00

Pos logit contributions
P+0.040
Now+0.038
ww+0.037
bly+0.036
lish+0.035
Neg logit contributions
 smile-0.015
 cry-0.014
 tell-0.014
 call-0.013
 bed-0.013
Token her
Token ID607
Feature activation+1.54e+00

Pos logit contributions
 cry+0.038
 smile+0.038
 stomach+0.036
 tell+0.035
 crying+0.035
Neg logit contributions
P-0.058
Now-0.057
bly-0.052
ww-0.052
Hey-0.051
Token,
Token ID11
Feature activation-3.36e-01

Pos logit contributions
 cry+0.050
 bed+0.049
 stomach+0.048
 tell+0.046
 weep+0.046
Neg logit contributions
P-0.083
bly-0.080
Now-0.078
lish-0.078
it-0.076
Token "
Token ID366
Feature activation-3.90e+00

Pos logit contributions
P+0.022
Now+0.021
ww+0.021
bly+0.020
lish+0.020
Neg logit contributions
 cry-0.011
 smile-0.011
 tell-0.011
 call-0.010
 bed-0.010
TokenYou
Token ID1639
Feature activation-5.64e-01

Pos logit contributions
bly+0.136
br+0.128
lish+0.127
ww+0.123
gers+0.120
Neg logit contributions
 say-0.192
 tell-0.191
 gaze-0.190
 smile-0.188
 -0.187
Token know
Token ID760
Feature activation-8.44e-01

Pos logit contributions
P+0.039
Now+0.037
ww+0.036
bly+0.035
lish+0.034
Neg logit contributions
 smile-0.018
 cry-0.018
 tell-0.017
 call-0.016
 bed-0.016
Token Ab
Token ID2275
Feature activation-1.54e-01

Pos logit contributions
P+0.060
Now+0.057
ww+0.055
bly+0.053
Spl+0.052
Neg logit contributions
 smile-0.026
 cry-0.025
 tell-0.025
 bed-0.023
 call-0.023
Tokenig
Token ID328
Feature activation-1.64e-01

Pos logit contributions
P+0.008
Now+0.007
ww+0.007
bly+0.007
lish+0.007
Neg logit contributions
 cry-0.007
 smile-0.007
 tell-0.007
 call-0.007
 bed-0.007
Tokenail
Token ID603
Feature activation+3.07e-01

Pos logit contributions
P+0.011
Now+0.010
ww+0.010
bly+0.009
lish+0.009
Neg logit contributions
 smile-0.006
 cry-0.006
 tell-0.006
 call-0.005
 bed-0.005