TOP ACTIVATIONS
MAX = 0.109

raven wanted to help the girl . He tried to talk
The mom my heard her daughter scream and came running .
He soon heard the girl 's voice . She said
didn 't mind helping the bird . They walked and walked
kind farmer came to the pig and said , " Don
felt better and went back to the barn to play with
hugged each other and went to play somewhere else . <|endoftext|>
sad . Max went to Lucy and asked , " Why
ily felt better and went to play with her other friends
finished her sandwich and went to play outside . She saw
how the farmer helped the cow . They knew he was
car ?" His mom heard him and came to help .
the truck heard the little girl 's voice . He looked
's wrong ?" asked the bear . The
tears . Tim went to Sue and asked , " Why
The farmer heard the cow and came to help .
felt better and went back to playing at the park .
little boy named Tim heard Kitty complain . He came to
felt much better and went to sleep with a smile on
D ucky 's mom heard him cry and came running .

MIN ACTIVATIONS
MIN = -0.137

sign that said " RE VERS E ". L ila and
at the starting line . " Ready , set , go
. The vet showed Wh isk ers the x - ray
and John chased Jenny . " You can 't catch me
reached its top speed ! " Whe e , this is
" Â M ira mi mu Ã
and look around . " Wow , we are
for playing a game called ' skip rope '. Both Sus
and the flowers . " You are a great
half for himself . They both take a big
. Whe e ! " Wow , Tom ,
in laughter and said , ' These new clothes look incredible
in the desert . " Wow , look at
like a clown . They ran around the room
does . â They kept walking until Jack
to the park . At the park , they
on the surface . " Wow , it is
and opened it . They saw many pages with
nod his head . L ily looked around for
to the middle . " I do , I

INTERVAL 0.048 - 0.109
CONTAINS 2.517%

small opening . He remembered his mom 's necklace and thought
a brave boy who wanted to solve a problem . He
stop the robot . But they do not know how .
The lizard quickly moved back to its home under a big
girl offered to help the bird get warm . She took

INTERVAL -0.014 - 0.048
CONTAINS 71.551%

, she smiled , knowing she had done it all by
Suddenly , there was a knock on the door .
animal ?" Her mom says , " That is
icked Anna 's hand . Anna gigg led and stro ked
She lived in a cave underground . Every day she would

INTERVAL -0.075 - -0.014
CONTAINS 24.414%

But as the months passed , something strange started happening .
. " Help , help !" they cry .
, so he asked , " What is it ?" The
couldn 't decide who was right and who was wrong .
come inside now , it 's time for dinner !" Annie

INTERVAL -0.137 - -0.075
CONTAINS 1.519%

much more . The dragon and Princess stayed
my 's mom said , " Let 's go to the
?" Lily asks . " It is a tail
of her work . L ily saw that there
strong . â G ian na thought this
Token raven
Token ID37735
Feature activation+1.48e-02

Pos logit contributions
P+0.019
ww+0.017
Now+0.017
bly+0.017
lish+0.016
Neg logit contributions
 smile-0.013
 cry-0.013
 tell-0.013
 bed-0.013
 call-0.013
Token wanted
Token ID2227
Feature activation+4.21e-02

Pos logit contributions
 bed+0.024
 smile+0.024
 cry+0.023
 tell+0.022
 stomach+0.022
Neg logit contributions
P-0.036
ww-0.033
ha-0.031
lish-0.031
Now-0.031
Token to
Token ID284
Feature activation+6.35e-02

Pos logit contributions
 smile+0.076
 cry+0.075
 gaze+0.074
sing+0.074
 comment+0.071
Neg logit contributions
P-0.080
A-0.071
ha-0.068
ww-0.066
Now-0.066
Token help
Token ID1037
Feature activation+5.17e-02

Pos logit contributions
sing+0.089
 eyes+0.087
ses+0.086
 fault+0.085
 weeping+0.083
Neg logit contributions
P-0.146
Now-0.129
O-0.127
A-0.126
Hey-0.126
Token the
Token ID262
Feature activation-3.93e-05

Pos logit contributions
ses+0.100
 fault+0.098
sing+0.095
sed+0.093
 gaze+0.091
Neg logit contributions
P-0.091
lish-0.080
Hey-0.077
L-0.077
Now-0.077
Token girl
Token ID2576
Feature activation+1.09e-01

Pos logit contributions
P+0.000
ww+0.000
Now+0.000
bly+0.000
lish+0.000
Neg logit contributions
 smile-0.000
 cry-0.000
 tell-0.000
 bed-0.000
 call-0.000
Token.
Token ID13
Feature activation+7.59e-03

Pos logit contributions
ses+0.227
sed+0.222
sing+0.178
 eyes+0.171
 mouths+0.170
Neg logit contributions
sc-0.140
"-0.128
it-0.121
P-0.120
wing-0.120
Token He
Token ID679
Feature activation+4.19e-03

Pos logit contributions
 cry+0.011
 smile+0.011
 bed+0.011
 tell+0.011
 call+0.010
Neg logit contributions
P-0.019
ww-0.018
Now-0.018
lish-0.017
bly-0.017
Token tried
Token ID3088
Feature activation+5.09e-02

Pos logit contributions
 smile+0.006
 cry+0.006
 bed+0.006
 tell+0.006
 stomach+0.006
Neg logit contributions
P-0.011
ww-0.010
Now-0.010
ha-0.009
bly-0.009
Token to
Token ID284
Feature activation+3.31e-02

Pos logit contributions
 gaze+0.099
 stomach+0.097
ses+0.097
 stare+0.096
sing+0.095
Neg logit contributions
B-0.075
P-0.072
Soon-0.066
Sp-0.065
sc-0.064
Token talk
Token ID1561
Feature activation+2.20e-03

Pos logit contributions
 stomach+0.039
 tum+0.036
 eyes+0.036
 mouth+0.035
 throat+0.035
Neg logit contributions
P-0.089
bly-0.082
Now-0.079
ww-0.076
B-0.076
TokenThe
Token ID464
Feature activation+1.15e-02

Pos logit contributions
P+0.070
ww+0.069
bly+0.067
sc+0.067
lish+0.066
Neg logit contributions
 cry-0.119
 tell-0.115
 call-0.114
 smile-0.113
 Her-0.112
Token mom
Token ID1995
Feature activation+5.02e-03

Pos logit contributions
 smile+0.009
 cry+0.009
 tell+0.009
 stomach+0.008
 gaze+0.008
Neg logit contributions
P-0.035
ww-0.034
Now-0.033
bly-0.032
lish-0.032
Tokenmy
Token ID1820
Feature activation-5.23e-03

Pos logit contributions
 bed+0.007
 smile+0.007
 cry+0.007
 stomach+0.007
 tell+0.007
Neg logit contributions
P-0.013
ww-0.012
Now-0.012
bly-0.011
ha-0.011
Token heard
Token ID2982
Feature activation+6.64e-03

Pos logit contributions
P+0.014
ww+0.014
Now+0.013
bly+0.013
lish+0.013
Neg logit contributions
 smile-0.007
 cry-0.007
 bed-0.007
 tell-0.007
 call-0.006
Token her
Token ID607
Feature activation+6.23e-02

Pos logit contributions
 smile+0.009
 cry+0.009
 tell+0.009
 stomach+0.008
 bed+0.008
Neg logit contributions
P-0.017
ww-0.016
Now-0.016
bly-0.016
lish-0.015
Token daughter
Token ID4957
Feature activation+1.06e-01

Pos logit contributions
 stomach+0.092
 bed+0.088
 tum+0.082
 throat+0.079
 collection+0.076
Neg logit contributions
lish-0.116
ww-0.113
Just-0.113
ing-0.111
bly-0.110
Token scream
Token ID8196
Feature activation+1.92e-02

Pos logit contributions
ses+0.137
 fault+0.125
sed+0.121
 stomach+0.119
 throat+0.118
Neg logit contributions
it-0.204
lish-0.200
bly-0.196
aming-0.195
ing-0.192
Token and
Token ID290
Feature activation+2.88e-02

Pos logit contributions
 smile+0.026
 tell+0.026
 cry+0.026
 stomach+0.025
 gaze+0.025
Neg logit contributions
P-0.046
lish-0.045
bly-0.045
ww-0.044
Now-0.043
Token came
Token ID1625
Feature activation+1.72e-02

Pos logit contributions
 bed+0.050
 stomach+0.049
 fault+0.047
 smile+0.045
 question+0.044
Neg logit contributions
P-0.062
ww-0.056
bly-0.056
ha-0.055
lish-0.054
Token running
Token ID2491
Feature activation-1.37e-02

Pos logit contributions
 stomach+0.030
 smile+0.029
 cry+0.029
 tum+0.028
 tell+0.028
Neg logit contributions
P-0.039
Now-0.035
bly-0.034
ww-0.034
Spl-0.033
Token.
Token ID13
Feature activation+1.86e-02

Pos logit contributions
P+0.035
ww+0.033
Now+0.032
bly+0.032
lish+0.031
Neg logit contributions
 smile-0.021
 cry-0.021
 tell-0.020
 bed-0.020
 call-0.020
Token
Token ID198
Feature activation-8.10e-02

Pos logit contributions
 smile+0.009
 cry+0.009
 tell+0.009
 bed+0.009
 stomach+0.009
Neg logit contributions
P-0.023
Now-0.022
ww-0.022
Soon-0.021
bly-0.021
TokenHe
Token ID1544
Feature activation+5.53e-03

Pos logit contributions
iest+0.144
bly+0.138
ww+0.137
ier+0.135
w+0.134
Neg logit contributions
 tell-0.165
 cry-0.163
 know-0.159
 find-0.158
 call-0.155
Token soon
Token ID2582
Feature activation+8.33e-03

Pos logit contributions
 smile+0.008
 cry+0.008
 bed+0.008
 tell+0.008
 stomach+0.008
Neg logit contributions
P-0.014
ww-0.013
Now-0.013
bly-0.012
ha-0.012
Token heard
Token ID2982
Feature activation+5.34e-03

Pos logit contributions
 cry+0.014
 smile+0.014
 tell+0.014
 bed+0.014
 call+0.014
Neg logit contributions
P-0.019
ww-0.017
Now-0.017
ha-0.017
bly-0.016
Token the
Token ID262
Feature activation+1.58e-02

Pos logit contributions
 smile+0.009
 cry+0.009
 tell+0.009
 bed+0.008
 call+0.008
Neg logit contributions
P-0.012
ww-0.012
Now-0.011
bly-0.011
lish-0.011
Token girl
Token ID2576
Feature activation+1.06e-01

Pos logit contributions
 tell+0.028
 bed+0.027
 smile+0.027
 cry+0.027
 call+0.027
Neg logit contributions
P-0.034
ww-0.032
lish-0.031
Now-0.030
bly-0.029
Token's
Token ID338
Feature activation+7.39e-02

Pos logit contributions
ses+0.186
 fault+0.184
 stomach+0.170
sed+0.169
 throat+0.164
Neg logit contributions
lish-0.160
it-0.155
aming-0.150
sc-0.148
ing-0.145
Token voice
Token ID3809
Feature activation+4.68e-02

Pos logit contributions
 tell+0.090
ric+0.085
 stare+0.083
 +0.082
 ask+0.082
Neg logit contributions
lish-0.139
ww-0.119
P-0.118
bly-0.116
Days-0.116
Token.
Token ID13
Feature activation+1.27e-02

Pos logit contributions
 stomach+0.077
 bed+0.076
 throat+0.075
ses+0.072
 smile+0.071
Neg logit contributions
lish-0.092
bly-0.088
P-0.086
ww-0.085
Now-0.085
Token She
Token ID1375
Feature activation-2.64e-03

Pos logit contributions
 cry+0.018
 smile+0.018
 tell+0.018
 bed+0.017
 call+0.016
Neg logit contributions
P-0.032
ww-0.030
Now-0.030
lish-0.029
bly-0.029
Token said
Token ID531
Feature activation-2.07e-02

Pos logit contributions
P+0.007
ww+0.006
Now+0.006
bly+0.006
lish+0.006
Neg logit contributions
 cry-0.004
 smile-0.004
 bed-0.004
 tell-0.004
 stomach-0.004
Token didn
Token ID1422
Feature activation-1.14e-02

Pos logit contributions
 bed+0.035
 cry+0.033
 smile+0.033
 tell+0.032
 stomach+0.031
Neg logit contributions
P-0.052
ww-0.047
Now-0.046
lish-0.046
ha-0.045
Token't
Token ID470
Feature activation+2.80e-02

Pos logit contributions
P+0.026
ww+0.024
Now+0.023
bly+0.023
lish+0.023
Neg logit contributions
 smile-0.020
 cry-0.020
 tell-0.020
 bed-0.020
 call-0.019
Token mind
Token ID2000
Feature activation+2.81e-02

Pos logit contributions
 bed+0.030
 stomach+0.028
 eyes+0.028
 mouth+0.027
 throat+0.026
Neg logit contributions
P-0.079
ww-0.074
ha-0.073
lish-0.071
Now-0.071
Token helping
Token ID5742
Feature activation+3.91e-02

Pos logit contributions
 gaze+0.037
 tell+0.037
 smile+0.036
 bed+0.036
 search+0.036
Neg logit contributions
P-0.066
lish-0.065
bly-0.064
ww-0.063
it-0.062
Token the
Token ID262
Feature activation+3.03e-02

Pos logit contributions
 gaze+0.069
 smile+0.068
 cry+0.068
 tell+0.065
 weeping+0.065
Neg logit contributions
P-0.071
lish-0.064
ww-0.064
Now-0.063
bly-0.062
Token bird
Token ID6512
Feature activation+1.03e-01

Pos logit contributions
 smile+0.059
 tell+0.058
 gaze+0.056
 bed+0.056
 cry+0.056
Neg logit contributions
P-0.057
ww-0.055
Now-0.049
lish-0.047
ha-0.047
Token.
Token ID13
Feature activation+9.05e-03

Pos logit contributions
ses+0.203
sed+0.195
 eyes+0.171
 throat+0.167
 bed+0.167
Neg logit contributions
sc-0.155
P-0.136
it-0.134
"-0.133
hest-0.130
Token They
Token ID1119
Feature activation+3.94e-02

Pos logit contributions
 cry+0.013
 smile+0.013
 tell+0.012
 bed+0.012
 call+0.012
Neg logit contributions
P-0.023
Now-0.022
ww-0.022
Soon-0.021
lish-0.020
Token walked
Token ID6807
Feature activation-1.41e-02

Pos logit contributions
 bed+0.071
 cry+0.068
 stomach+0.066
 smile+0.064
 crying+0.064
Neg logit contributions
P-0.085
bly-0.074
ha-0.072
Now-0.070
lish-0.070
Token and
Token ID290
Feature activation+2.76e-02

Pos logit contributions
P+0.035
ww+0.033
Now+0.032
bly+0.032
lish+0.031
Neg logit contributions
 smile-0.023
 cry-0.022
 tell-0.022
 bed-0.022
 call-0.021
Token walked
Token ID6807
Feature activation-9.51e-03

Pos logit contributions
 cry+0.045
 stomach+0.044
 smile+0.044
 question+0.043
 bed+0.041
Neg logit contributions
P-0.066
ww-0.057
Now-0.057
ha-0.057
bly-0.055
Token kind
Token ID1611
Feature activation+1.73e-02

Pos logit contributions
P+0.021
ww+0.020
Now+0.019
bly+0.019
lish+0.019
Neg logit contributions
 smile-0.012
 cry-0.012
 tell-0.012
 bed-0.012
 call-0.012
Token farmer
Token ID18739
Feature activation+2.97e-03

Pos logit contributions
 cry+0.030
 tell+0.029
 smile+0.029
 bed+0.029
 gaze+0.028
Neg logit contributions
P-0.039
ww-0.037
Now-0.034
ha-0.034
lish-0.033
Token came
Token ID1625
Feature activation+9.61e-03

Pos logit contributions
 smile+0.004
 cry+0.004
 bed+0.004
 tell+0.004
 stomach+0.004
Neg logit contributions
P-0.008
ww-0.007
Now-0.007
bly-0.007
ha-0.007
Token to
Token ID284
Feature activation+3.60e-02

Pos logit contributions
 smile+0.014
 cry+0.013
 stomach+0.013
 bed+0.013
 tell+0.013
Neg logit contributions
P-0.025
Now-0.023
ww-0.023
bly-0.022
Soon-0.022
Token the
Token ID262
Feature activation-7.42e-04

Pos logit contributions
 stomach+0.046
 eyes+0.045
 cry+0.045
 smile+0.044
 weeping+0.044
Neg logit contributions
P-0.093
Now-0.086
ww-0.081
Hey-0.081
Soon-0.080
Token pig
Token ID12967
Feature activation+1.02e-01

Pos logit contributions
P+0.002
ww+0.002
Now+0.002
bly+0.002
lish+0.002
Neg logit contributions
 smile-0.001
 cry-0.001
 tell-0.001
 bed-0.001
 call-0.001
Token and
Token ID290
Feature activation+9.38e-03

Pos logit contributions
ses+0.218
 cry+0.210
 eyes+0.208
 smile+0.207
sed+0.203
Neg logit contributions
P-0.155
Now-0.138
it-0.129
"-0.122
sc-0.120
Token said
Token ID531
Feature activation-3.59e-03

Pos logit contributions
 cry+0.013
 smile+0.013
 bed+0.012
 stomach+0.012
 tell+0.012
Neg logit contributions
P-0.025
ww-0.023
Now-0.022
bly-0.022
ha-0.022
Token,
Token ID11
Feature activation+1.12e-02

Pos logit contributions
P+0.011
ww+0.010
bly+0.010
Now+0.010
lish+0.010
Neg logit contributions
 smile-0.004
 cry-0.004
 tell-0.003
 bed-0.003
 call-0.003
Token "
Token ID366
Feature activation-7.44e-02

Pos logit contributions
 cry+0.016
 smile+0.016
 tell+0.016
 bed+0.015
 stomach+0.015
Neg logit contributions
P-0.028
ww-0.027
Now-0.026
bly-0.026
lish-0.025
TokenDon
Token ID3987
Feature activation-4.45e-02

Pos logit contributions
bly+0.119
ww+0.117
br+0.110
lish+0.109
hest+0.104
Neg logit contributions
 cry-0.150
 tell-0.150
 call-0.149
 crying-0.149
 say-0.147
Token felt
Token ID2936
Feature activation+8.05e-03

Pos logit contributions
 bed+0.013
 cry+0.013
 smile+0.013
 tell+0.013
 call+0.012
Neg logit contributions
P-0.022
Now-0.020
ww-0.020
bly-0.019
lish-0.019
Token better
Token ID1365
Feature activation+1.67e-02

Pos logit contributions
 bed+0.012
 smile+0.012
 tell+0.012
 call+0.012
 cry+0.011
Neg logit contributions
P-0.020
Now-0.018
ww-0.018
bly-0.018
lish-0.018
Token and
Token ID290
Feature activation+2.27e-02

Pos logit contributions
 cry+0.027
 bed+0.027
 smile+0.027
 call+0.026
 question+0.026
Neg logit contributions
P-0.039
Now-0.036
ww-0.035
lish-0.034
bly-0.034
Token went
Token ID1816
Feature activation+2.27e-02

Pos logit contributions
 question+0.038
 cry+0.038
 bed+0.038
 smile+0.037
 call+0.036
Neg logit contributions
P-0.050
ww-0.046
Now-0.046
bly-0.044
lish-0.043
Token back
Token ID736
Feature activation+5.50e-02

Pos logit contributions
 cry+0.035
 smile+0.033
 stomach+0.032
 eyes+0.032
 question+0.032
Neg logit contributions
P-0.057
Now-0.050
bly-0.049
ww-0.049
Soon-0.047
Token to
Token ID284
Feature activation+1.02e-01

Pos logit contributions
 stomach+0.110
 cry+0.108
 eyes+0.107
 smile+0.104
 tongue+0.104
Neg logit contributions
P-0.103
B-0.084
Now-0.083
Spl-0.078
CR-0.077
Token the
Token ID262
Feature activation-2.95e-03

Pos logit contributions
 eye+0.186
ses+0.182
 eyes+0.178
 question+0.176
 fault+0.175
Neg logit contributions
Now-0.113
P-0.104
Good-0.094
But-0.092
bly-0.089
Token barn
Token ID25203
Feature activation+8.18e-02

Pos logit contributions
P+0.008
ww+0.007
Now+0.007
bly+0.007
lish+0.007
Neg logit contributions
 smile-0.004
 cry-0.004
 tell-0.004
 bed-0.004
 call-0.004
Token to
Token ID284
Feature activation+6.01e-02

Pos logit contributions
 cry+0.143
 question+0.137
 stare+0.131
 gaze+0.129
 smile+0.128
Neg logit contributions
Now-0.137
P-0.135
it-0.134
sc-0.132
Soon-0.127
Token play
Token ID711
Feature activation-1.07e-02

Pos logit contributions
 fault+0.080
 eyes+0.079
ses+0.078
sed+0.075
 cries+0.074
Neg logit contributions
P-0.134
Now-0.134
bly-0.127
Soon-0.119
Good-0.119
Token with
Token ID351
Feature activation+2.46e-02

Pos logit contributions
P+0.025
ww+0.024
Now+0.023
bly+0.023
lish+0.022
Neg logit contributions
 smile-0.018
 cry-0.018
 tell-0.018
 bed-0.018
 call-0.017
Token hugged
Token ID41130
Feature activation+3.84e-03

Pos logit contributions
 bed+0.062
 cry+0.060
 eyes+0.059
 weeping+0.059
 crying+0.058
Neg logit contributions
P-0.069
Now-0.062
ww-0.057
ha-0.054
bly-0.054
Token each
Token ID1123
Feature activation-2.41e-03

Pos logit contributions
 cry+0.006
 smile+0.006
 tell+0.006
 bed+0.006
 call+0.006
Neg logit contributions
P-0.009
ww-0.009
Now-0.009
bly-0.009
lish-0.008
Token other
Token ID584
Feature activation-2.06e-02

Pos logit contributions
P+0.006
ww+0.005
Now+0.005
bly+0.005
lish+0.005
Neg logit contributions
 smile-0.004
 cry-0.004
 tell-0.004
 bed-0.004
 call-0.004
Token and
Token ID290
Feature activation+3.64e-02

Pos logit contributions
P+0.060
ww+0.057
Now+0.055
bly+0.055
lish+0.053
Neg logit contributions
 smile-0.025
 cry-0.024
 tell-0.024
 bed-0.023
 call-0.022
Token went
Token ID1816
Feature activation+2.67e-02

Pos logit contributions
 question+0.059
 bed+0.058
 cry+0.058
 weeping+0.057
 gaze+0.057
Neg logit contributions
P-0.081
ww-0.075
Now-0.072
bly-0.069
Sorry-0.068
Token to
Token ID284
Feature activation+1.01e-01

Pos logit contributions
 cry+0.040
 question+0.037
 smile+0.037
 tell+0.036
 gaze+0.036
Neg logit contributions
P-0.068
ww-0.060
Now-0.059
bly-0.059
lish-0.058
Token play
Token ID711
Feature activation-9.59e-03

Pos logit contributions
 stomach+0.172
 grin+0.168
 cheeks+0.167
 eyes+0.166
 tum+0.163
Neg logit contributions
Soon-0.116
 our-0.110
Happy-0.110
Now-0.108
P-0.108
Token somewhere
Token ID7382
Feature activation+6.35e-04

Pos logit contributions
P+0.022
ww+0.020
Now+0.020
bly+0.020
lish+0.019
Neg logit contributions
 smile-0.017
 cry-0.017
 tell-0.017
 bed-0.016
 call-0.016
Token else
Token ID2073
Feature activation+2.24e-02

Pos logit contributions
 cry+0.001
 smile+0.001
 bed+0.001
 tell+0.001
 call+0.001
Neg logit contributions
P-0.001
ww-0.001
Now-0.001
bly-0.001
lish-0.001
Token.
Token ID13
Feature activation+2.77e-02

Pos logit contributions
 cry+0.029
 smile+0.028
 bed+0.028
 tell+0.028
 call+0.027
Neg logit contributions
P-0.059
lish-0.055
bly-0.053
ww-0.053
Now-0.053
Token<|endoftext|>
Token ID50256
Feature activation-4.38e-03

Pos logit contributions
 smile+0.047
 cry+0.046
 tell+0.044
 weep+0.043
 stomach+0.042
Neg logit contributions
P-0.064
Now-0.062
ww-0.061
Soon-0.058
Spl-0.058
Token sad
Token ID6507
Feature activation+4.26e-02

Pos logit contributions
 smile+0.032
 cry+0.032
 bed+0.031
 tell+0.031
 gaze+0.030
Neg logit contributions
P-0.053
bly-0.049
ww-0.047
Now-0.047
lish-0.046
Token.
Token ID13
Feature activation+3.22e-02

Pos logit contributions
 gaze+0.077
 cry+0.075
 call+0.073
 weep+0.073
 stare+0.073
Neg logit contributions
lish-0.073
ww-0.072
P-0.071
Soon-0.066
it-0.066
Token Max
Token ID5436
Feature activation+2.09e-02

Pos logit contributions
 tell+0.056
 cry+0.055
 smile+0.054
 call+0.054
 die+0.052
Neg logit contributions
P-0.065
Now-0.063
ww-0.062
lish-0.059
Soon-0.059
Token went
Token ID1816
Feature activation+1.43e-02

Pos logit contributions
 bed+0.033
 cry+0.032
 smile+0.032
 tell+0.031
 stomach+0.030
Neg logit contributions
P-0.050
ww-0.046
Now-0.044
lish-0.043
ha-0.043
Token to
Token ID284
Feature activation+7.00e-02

Pos logit contributions
 cry+0.022
 stomach+0.021
 smile+0.021
 eyes+0.020
 tell+0.020
Neg logit contributions
P-0.035
Now-0.032
ww-0.031
sc-0.031
bly-0.030
Token Lucy
Token ID22162
Feature activation+1.01e-01

Pos logit contributions
 stomach+0.141
 throat+0.140
 tum+0.136
 mouths+0.135
 mouth+0.133
Neg logit contributions
Now-0.100
Hey-0.098
P-0.096
Go-0.094
O-0.093
Token and
Token ID290
Feature activation+3.86e-02

Pos logit contributions
ses+0.227
sed+0.216
 stomach+0.195
 eyes+0.193
 fault+0.190
Neg logit contributions
sc-0.127
P-0.124
it-0.121
"-0.116
When-0.110
Token asked
Token ID1965
Feature activation+2.55e-02

Pos logit contributions
 cry+0.058
 stomach+0.058
 eyes+0.057
 question+0.056
 smile+0.056
Neg logit contributions
P-0.088
ww-0.080
ha-0.078
Now-0.075
Sorry-0.073
Token,
Token ID11
Feature activation+1.05e-02

Pos logit contributions
 cry+0.040
 gaze+0.040
 smile+0.038
 stomach+0.038
 call+0.038
Neg logit contributions
P-0.056
ww-0.052
lish-0.052
bly-0.051
ha-0.051
Token "
Token ID366
Feature activation-7.10e-02

Pos logit contributions
 cry+0.015
 smile+0.015
 bed+0.015
 tell+0.014
 stomach+0.014
Neg logit contributions
P-0.026
ww-0.025
Now-0.024
bly-0.024
lish-0.023
TokenWhy
Token ID5195
Feature activation+1.60e-02

Pos logit contributions
ww+0.125
bly+0.110
br+0.104
P+0.103
hest+0.103
Neg logit contributions
 tell-0.158
 crying-0.154
 call-0.154
 search-0.152
 smile-0.151
Tokenily
Token ID813
Feature activation+3.53e-03

Pos logit contributions
P+0.081
ww+0.080
bly+0.074
Now+0.069
Sorry+0.068
Neg logit contributions
 cry-0.091
 tell-0.090
 smile-0.090
 bed-0.089
 crying-0.089
Token felt
Token ID2936
Feature activation+2.79e-03

Pos logit contributions
 bed+0.005
 cry+0.005
 smile+0.005
 tell+0.005
 call+0.005
Neg logit contributions
P-0.009
ww-0.008
Now-0.008
bly-0.008
lish-0.008
Token better
Token ID1365
Feature activation+3.98e-03

Pos logit contributions
 bed+0.004
 smile+0.004
 cry+0.004
 tell+0.004
 call+0.004
Neg logit contributions
P-0.007
ww-0.007
Now-0.007
bly-0.007
lish-0.007
Token and
Token ID290
Feature activation+1.66e-02

Pos logit contributions
 smile+0.006
 cry+0.006
 bed+0.006
 tell+0.006
 call+0.006
Neg logit contributions
P-0.010
ww-0.009
Now-0.009
bly-0.009
lish-0.009
Token went
Token ID1816
Feature activation+1.55e-02

Pos logit contributions
 cry+0.028
 bed+0.028
 question+0.028
 smile+0.028
 call+0.027
Neg logit contributions
P-0.036
ww-0.033
Now-0.033
bly-0.032
lish-0.031
Token to
Token ID284
Feature activation+1.00e-01

Pos logit contributions
 cry+0.023
 smile+0.022
 stomach+0.021
 tell+0.021
 question+0.021
Neg logit contributions
P-0.040
Now-0.035
bly-0.035
ww-0.034
lish-0.033
Token play
Token ID711
Feature activation-4.61e-03

Pos logit contributions
sed+0.168
 mouths+0.165
ses+0.164
 tum+0.160
 stomach+0.159
Neg logit contributions
Now-0.121
 your-0.115
But-0.113
P-0.112
Soon-0.112
Token with
Token ID351
Feature activation+3.87e-02

Pos logit contributions
P+0.010
ww+0.009
Now+0.009
bly+0.009
lish+0.009
Neg logit contributions
 cry-0.009
 smile-0.009
 tell-0.008
 bed-0.008
 call-0.008
Token her
Token ID607
Feature activation+3.82e-02

Pos logit contributions
 gaze+0.070
 question+0.069
 cry+0.069
 smile+0.068
 tell+0.067
Neg logit contributions
P-0.078
bly-0.068
lish-0.068
Now-0.067
O-0.064
Token other
Token ID584
Feature activation+2.33e-03

Pos logit contributions
 gaze+0.053
 cry+0.052
 stare+0.052
 tell+0.051
 know+0.050
Neg logit contributions
P-0.091
bly-0.083
lish-0.083
Now-0.079
ww-0.079
Token friends
Token ID2460
Feature activation+2.45e-02

Pos logit contributions
 cry+0.003
 smile+0.003
 tell+0.003
 bed+0.003
 call+0.003
Neg logit contributions
P-0.006
ww-0.006
Now-0.006
bly-0.006
lish-0.006
Token finished
Token ID5201
Feature activation+4.06e-02

Pos logit contributions
 bed+0.007
 smile+0.007
 cry+0.007
 tell+0.007
 call+0.006
Neg logit contributions
P-0.011
ww-0.010
Now-0.010
lish-0.010
bly-0.010
Token her
Token ID607
Feature activation+5.61e-02

Pos logit contributions
 call+0.065
 gaze+0.065
 question+0.064
 smile+0.062
 cry+0.062
Neg logit contributions
P-0.089
Now-0.082
bly-0.078
ww-0.076
Soon-0.074
Token sandwich
Token ID20433
Feature activation+1.24e-02

Pos logit contributions
 smile+0.086
 cry+0.085
 call+0.084
 stare+0.083
 weep+0.082
Neg logit contributions
P-0.123
ww-0.109
w-0.101
lish-0.098
Now-0.096
Token and
Token ID290
Feature activation+2.66e-02

Pos logit contributions
 cry+0.020
 tell+0.019
 smile+0.019
 call+0.019
 question+0.019
Neg logit contributions
P-0.030
Now-0.028
ww-0.027
bly-0.027
lish-0.026
Token went
Token ID1816
Feature activation+3.56e-02

Pos logit contributions
 question+0.045
 cry+0.043
 call+0.042
 gaze+0.042
 smile+0.041
Neg logit contributions
P-0.059
ww-0.055
Now-0.054
bly-0.051
ha-0.050
Token to
Token ID284
Feature activation+1.00e-01

Pos logit contributions
 cry+0.060
 question+0.055
 stomach+0.055
 frown+0.054
 eyes+0.054
Neg logit contributions
P-0.082
Now-0.070
ww-0.070
bly-0.068
Spl-0.067
Token play
Token ID711
Feature activation+4.09e-03

Pos logit contributions
sed+0.162
 fault+0.161
ses+0.157
owing+0.149
ric+0.145
Neg logit contributions
 my-0.149
Happy-0.140
Now-0.140
 your-0.139
P-0.137
Token outside
Token ID2354
Feature activation+9.50e-03

Pos logit contributions
 cry+0.008
 smile+0.008
 tell+0.007
 call+0.007
 gaze+0.007
Neg logit contributions
P-0.009
Now-0.008
ww-0.008
bly-0.008
lish-0.008
Token.
Token ID13
Feature activation+2.03e-02

Pos logit contributions
 cry+0.013
 smile+0.013
 tell+0.012
 call+0.012
 stomach+0.012
Neg logit contributions
P-0.025
Now-0.023
ww-0.022
bly-0.022
lish-0.022
Token She
Token ID1375
Feature activation+1.66e-02

Pos logit contributions
 smile+0.033
 tell+0.032
 cry+0.032
 call+0.030
 stomach+0.030
Neg logit contributions
Now-0.044
P-0.044
ww-0.043
Soon-0.042
CR-0.041
Token saw
Token ID2497
Feature activation+9.36e-03

Pos logit contributions
 cry+0.027
 smile+0.027
 question+0.026
 bed+0.026
 call+0.026
Neg logit contributions
P-0.039
Now-0.035
ww-0.035
ha-0.033
Spl-0.033
Token how
Token ID703
Feature activation-7.66e-03

Pos logit contributions
 smile+0.026
 cry+0.026
 bed+0.024
 gaze+0.024
 call+0.024
Neg logit contributions
P-0.036
ww-0.034
Now-0.033
bly-0.032
Hey-0.031
Token the
Token ID262
Feature activation+4.89e-03

Pos logit contributions
P+0.018
ww+0.017
Now+0.016
bly+0.016
lish+0.016
Neg logit contributions
 smile-0.013
 cry-0.013
 tell-0.013
 bed-0.013
 call-0.012
Token farmer
Token ID18739
Feature activation+3.33e-02

Pos logit contributions
 cry+0.009
 smile+0.009
 bed+0.009
 tell+0.009
 gaze+0.008
Neg logit contributions
P-0.011
ww-0.010
Now-0.010
bly-0.009
lish-0.009
Token helped
Token ID4193
Feature activation+4.11e-02

Pos logit contributions
 bed+0.059
 cry+0.058
 smile+0.056
 tell+0.055
 gaze+0.054
Neg logit contributions
P-0.071
Now-0.066
ww-0.064
bly-0.063
lish-0.062
Token the
Token ID262
Feature activation+9.25e-03

Pos logit contributions
 smile+0.063
 bed+0.062
 cry+0.061
 weeping+0.061
ses+0.060
Neg logit contributions
P-0.089
bly-0.080
lish-0.079
ww-0.078
Now-0.078
Token cow
Token ID9875
Feature activation+1.00e-01

Pos logit contributions
 cry+0.017
 smile+0.017
 bed+0.016
 tell+0.016
 gaze+0.016
Neg logit contributions
P-0.020
ww-0.019
Now-0.018
lish-0.017
bly-0.017
Token.
Token ID13
Feature activation+1.94e-03

Pos logit contributions
ses+0.134
sed+0.126
 eyes+0.116
 weeping+0.115
 smile+0.112
Neg logit contributions
bly-0.190
wing-0.188
Now-0.188
sc-0.186
P-0.186
Token They
Token ID1119
Feature activation+3.69e-02

Pos logit contributions
 cry+0.003
 smile+0.003
 tell+0.003
 bed+0.002
 call+0.002
Neg logit contributions
P-0.005
ww-0.005
Now-0.005
bly-0.005
lish-0.005
Token knew
Token ID2993
Feature activation+1.13e-02

Pos logit contributions
 cry+0.063
 gaze+0.058
 bed+0.058
 question+0.057
 stomach+0.057
Neg logit contributions
P-0.082
Now-0.073
bly-0.072
ww-0.071
ha-0.071
Token he
Token ID339
Feature activation+4.97e-03

Pos logit contributions
 cry+0.021
 smile+0.020
 call+0.019
 weep+0.019
 tell+0.019
Neg logit contributions
P-0.024
ww-0.023
bly-0.022
Now-0.022
lish-0.021
Token was
Token ID373
Feature activation+3.38e-04

Pos logit contributions
 bed+0.009
 cry+0.009
 smile+0.009
 tell+0.008
 stomach+0.008
Neg logit contributions
P-0.011
ww-0.010
Now-0.010
bly-0.010
lish-0.010
Token car
Token ID1097
Feature activation+1.82e-03

Pos logit contributions
 cry+0.036
 tell+0.036
 smile+0.036
 gaze+0.034
 stomach+0.034
Neg logit contributions
P-0.042
ww-0.040
bly-0.038
Now-0.038
ha-0.037
Token?"
Token ID1701
Feature activation+4.62e-02

Pos logit contributions
 cry+0.003
 smile+0.003
 tell+0.003
 bed+0.003
 call+0.003
Neg logit contributions
P-0.004
ww-0.004
Now-0.004
bly-0.004
lish-0.004
Token His
Token ID2399
Feature activation+4.61e-02

Pos logit contributions
 smile+0.067
 cry+0.065
 tell+0.062
 die+0.062
 question+0.060
Neg logit contributions
P-0.109
Now-0.102
Soon-0.098
Just-0.098
A-0.097
Token mom
Token ID1995
Feature activation+3.15e-02

Pos logit contributions
 stare+0.057
 gaze+0.054
 tell+0.053
 know+0.052
 feel+0.051
Neg logit contributions
P-0.110
ww-0.100
lish-0.099
bly-0.098
B-0.096
Token heard
Token ID2982
Feature activation+1.23e-02

Pos logit contributions
 bed+0.054
 stomach+0.052
 fault+0.049
 cry+0.049
ses+0.049
Neg logit contributions
P-0.069
bly-0.064
lish-0.062
ha-0.061
Now-0.061
Token him
Token ID683
Feature activation+1.00e-01

Pos logit contributions
 smile+0.017
 cry+0.017
 stomach+0.016
 tell+0.016
 question+0.016
Neg logit contributions
P-0.031
Now-0.029
ww-0.029
bly-0.028
lish-0.028
Token and
Token ID290
Feature activation+1.23e-03

Pos logit contributions
 tum+0.201
ses+0.198
 stomach+0.195
 fault+0.194
sed+0.194
Neg logit contributions
it-0.122
lish-0.119
ing-0.119
bly-0.110
aming-0.109
Token came
Token ID1625
Feature activation+2.37e-02

Pos logit contributions
 cry+0.002
 smile+0.002
 bed+0.002
 tell+0.002
 stomach+0.002
Neg logit contributions
P-0.003
ww-0.003
bly-0.003
Now-0.003
lish-0.003
Token to
Token ID284
Feature activation+3.43e-02

Pos logit contributions
 stomach+0.041
 cry+0.039
 smile+0.039
 tum+0.038
 throat+0.038
Neg logit contributions
P-0.052
Now-0.047
bly-0.046
Spl-0.045
Soon-0.045
Token help
Token ID1037
Feature activation+3.23e-02

Pos logit contributions
 stomach+0.043
 mouth+0.039
 eyes+0.039
 throat+0.039
 tum+0.038
Neg logit contributions
P-0.093
Now-0.087
Hey-0.086
Good-0.084
O-0.082
Token.
Token ID13
Feature activation+3.58e-02

Pos logit contributions
 stomach+0.049
 smile+0.047
 throat+0.047
ses+0.047
 question+0.046
Neg logit contributions
P-0.072
bly-0.068
lish-0.066
Now-0.066
CR-0.064
Token the
Token ID262
Feature activation-6.48e-03

Pos logit contributions
P+0.011
ww+0.011
Now+0.011
bly+0.010
lish+0.010
Neg logit contributions
 smile-0.006
 cry-0.006
 tell-0.006
 bed-0.006
 call-0.005
Token truck
Token ID7779
Feature activation+3.08e-02

Pos logit contributions
P+0.017
ww+0.016
Now+0.015
bly+0.015
lish+0.015
Neg logit contributions
 smile-0.010
 cry-0.009
 tell-0.009
 bed-0.009
 call-0.009
Token heard
Token ID2982
Feature activation+9.34e-03

Pos logit contributions
 bed+0.052
 cry+0.051
 smile+0.049
 weeping+0.049
 tell+0.049
Neg logit contributions
P-0.064
Now-0.061
ww-0.058
sc-0.058
lish-0.057
Token the
Token ID262
Feature activation+1.90e-02

Pos logit contributions
 smile+0.014
 cry+0.014
 tell+0.013
 stomach+0.013
 gaze+0.013
Neg logit contributions
P-0.023
ww-0.022
Now-0.021
bly-0.021
lish-0.020
Token little
Token ID1310
Feature activation-1.30e-02

Pos logit contributions
 tell+0.031
 smile+0.031
 bed+0.029
 cry+0.029
 gaze+0.029
Neg logit contributions
P-0.041
ww-0.040
Now-0.039
lish-0.038
bly-0.037
Token girl
Token ID2576
Feature activation+9.94e-02

Pos logit contributions
P+0.030
ww+0.028
Now+0.027
bly+0.026
lish+0.025
Neg logit contributions
 smile-0.023
 cry-0.023
 tell-0.023
 bed-0.022
 call-0.022
Token's
Token ID338
Feature activation+6.36e-02

Pos logit contributions
ses+0.168
 fault+0.164
 imagination+0.161
 bed+0.161
 stomach+0.160
Neg logit contributions
lish-0.153
it-0.150
sc-0.145
aw-0.144
bly-0.139
Token voice
Token ID3809
Feature activation+3.64e-02

Pos logit contributions
 bed+0.093
 haven+0.091
 stomach+0.091
 die+0.090
 tell+0.087
Neg logit contributions
lish-0.102
bly-0.093
Just-0.092
aming-0.090
Now-0.090
Token.
Token ID13
Feature activation+2.09e-02

Pos logit contributions
 bed+0.050
 stomach+0.050
 gaze+0.048
 smile+0.047
 cry+0.046
Neg logit contributions
lish-0.083
P-0.079
bly-0.078
Now-0.076
ww-0.076
Token He
Token ID679
Feature activation-3.15e-03

Pos logit contributions
 smile+0.037
 cry+0.036
 tell+0.036
 bed+0.036
 stomach+0.035
Neg logit contributions
P-0.042
Now-0.041
ww-0.040
Hey-0.038
lish-0.038
Token looked
Token ID3114
Feature activation+6.17e-03

Pos logit contributions
P+0.008
ww+0.008
Now+0.007
bly+0.007
lish+0.007
Neg logit contributions
 cry-0.004
 smile-0.004
 bed-0.004
 tell-0.004
 stomach-0.004
Token's
Token ID338
Feature activation-5.03e-03

Pos logit contributions
 smile+0.017
 bed+0.017
 cry+0.017
 tell+0.016
 call+0.016
Neg logit contributions
P-0.025
ww-0.024
bly-0.022
Now-0.022
ha-0.022
Token wrong
Token ID2642
Feature activation-5.54e-03

Pos logit contributions
P+0.011
ww+0.010
bly+0.010
Now+0.010
lish+0.009
Neg logit contributions
 smile-0.010
 cry-0.009
 tell-0.009
 bed-0.009
 call-0.009
Token?"
Token ID1701
Feature activation+2.26e-02

Pos logit contributions
P+0.014
ww+0.013
Now+0.013
bly+0.013
lish+0.013
Neg logit contributions
 smile-0.008
 cry-0.008
 tell-0.008
 bed-0.008
 call-0.008
Token asked
Token ID1965
Feature activation+1.35e-02

Pos logit contributions
 smile+0.033
 bed+0.031
 stomach+0.031
 cry+0.030
 tell+0.030
Neg logit contributions
P-0.057
ww-0.053
Now-0.051
bly-0.051
lish-0.050
Token the
Token ID262
Feature activation+1.65e-02

Pos logit contributions
 cry+0.021
 smile+0.021
 bed+0.020
 tell+0.020
 stomach+0.019
Neg logit contributions
P-0.033
ww-0.031
bly-0.030
Now-0.029
lish-0.029
Token bear
Token ID6842
Feature activation+9.93e-02

Pos logit contributions
 cry+0.024
 smile+0.024
 tell+0.023
 bed+0.023
 gaze+0.023
Neg logit contributions
P-0.043
lish-0.038
ww-0.038
bly-0.037
Now-0.037
Token.
Token ID13
Feature activation+3.54e-02

Pos logit contributions
ses+0.185
 bed+0.174
 fault+0.168
 imagination+0.163
sed+0.162
Neg logit contributions
it-0.179
P-0.174
lish-0.171
bly-0.167
Just-0.159
Token
Token ID220
Feature activation+5.00e-02

Pos logit contributions
 cry+0.057
 smile+0.057
 tell+0.054
 bed+0.054
 weep+0.053
Neg logit contributions
P-0.085
Now-0.075
A-0.074
ww-0.074
O-0.074
Token
Token ID198
Feature activation-7.60e-03

Pos logit contributions
 smile+0.083
 cry+0.080
 gaze+0.079
 stomach+0.078
 bed+0.077
Neg logit contributions
P-0.119
Now-0.109
Soon-0.108
it-0.106
bly-0.103
Token
Token ID198
Feature activation-6.53e-02

Pos logit contributions
P+0.023
ww+0.022
Now+0.022
bly+0.021
lish+0.021
Neg logit contributions
 smile-0.008
 cry-0.008
 tell-0.007
 bed-0.007
 call-0.007
TokenThe
Token ID464
Feature activation+2.05e-03

Pos logit contributions
bly+0.102
ww+0.100
lish+0.098
iest+0.096
P+0.095
Neg logit contributions
 call-0.146
 tell-0.144
 smile-0.143
 question-0.142
 find-0.140
Token tears
Token ID10953
Feature activation+8.46e-03

Pos logit contributions
 tell+0.031
 smile+0.030
 cry+0.030
 call+0.030
 question+0.030
Neg logit contributions
P-0.046
ww-0.044
bly-0.043
Now-0.042
lish-0.041
Token.
Token ID13
Feature activation+1.81e-02

Pos logit contributions
 tell+0.011
 cry+0.011
 smile+0.011
 bed+0.010
 call+0.010
Neg logit contributions
P-0.023
ww-0.021
bly-0.021
lish-0.021
Now-0.021
Token Tim
Token ID5045
Feature activation+2.01e-02

Pos logit contributions
 cry+0.027
 smile+0.027
 tell+0.027
 bed+0.026
 call+0.026
Neg logit contributions
P-0.046
Now-0.043
ww-0.042
Soon-0.040
Hey-0.039
Token went
Token ID1816
Feature activation+4.93e-03

Pos logit contributions
 bed+0.033
 cry+0.030
 smile+0.030
 stomach+0.030
 tell+0.029
Neg logit contributions
P-0.049
ww-0.043
Now-0.043
lish-0.041
ha-0.041
Token to
Token ID284
Feature activation+6.74e-02

Pos logit contributions
 cry+0.007
 smile+0.007
 stomach+0.007
 tell+0.007
 bed+0.007
Neg logit contributions
P-0.013
Now-0.011
ww-0.011
bly-0.011
lish-0.011
Token Sue
Token ID26089
Feature activation+9.91e-02

Pos logit contributions
 throat+0.132
 stomach+0.125
 tum+0.124
 fault+0.123
 mouths+0.121
Neg logit contributions
Hey-0.112
P-0.107
O-0.104
Go-0.103
Now-0.100
Token and
Token ID290
Feature activation+4.67e-02

Pos logit contributions
ses+0.256
sed+0.247
 stomach+0.221
 throat+0.218
 eyes+0.214
Neg logit contributions
sc-0.104
"-0.100
P-0.098
it-0.091
-0.084
Token asked
Token ID1965
Feature activation+2.74e-02

Pos logit contributions
 bed+0.080
 stomach+0.077
 cry+0.077
 eyes+0.076
 fault+0.076
Neg logit contributions
P-0.102
ww-0.087
ha-0.083
Sorry-0.081
Now-0.081
Token,
Token ID11
Feature activation+2.21e-02

Pos logit contributions
 cry+0.045
 weep+0.043
 bed+0.043
 gaze+0.043
 smile+0.042
Neg logit contributions
P-0.062
ww-0.055
lish-0.054
ha-0.054
bly-0.054
Token "
Token ID366
Feature activation-5.96e-02

Pos logit contributions
 cry+0.035
 bed+0.034
 smile+0.033
 stomach+0.033
 tell+0.033
Neg logit contributions
P-0.052
bly-0.048
Now-0.048
ww-0.047
Soon-0.046
TokenWhy
Token ID5195
Feature activation+1.94e-02

Pos logit contributions
ww+0.103
bly+0.093
br+0.085
lish+0.084
P+0.084
Neg logit contributions
 tell-0.137
 call-0.132
 crying-0.129
 cry-0.129
 find-0.128
Token
Token ID198
Feature activation-5.43e-02

Pos logit contributions
 smile+0.072
 stomach+0.068
 cry+0.068
 throat+0.067
 eyes+0.067
Neg logit contributions
P-0.107
Now-0.100
Soon-0.098
ww-0.094
ha-0.092
TokenThe
Token ID464
Feature activation-1.68e-02

Pos logit contributions
bly+0.072
ww+0.071
iest+0.068
sc+0.068
P+0.068
Neg logit contributions
 cry-0.137
 tell-0.131
 Her-0.130
 call-0.129
 crying-0.128
Token farmer
Token ID18739
Feature activation-3.25e-03

Pos logit contributions
P+0.041
ww+0.039
bly+0.038
Now+0.037
lish+0.036
Neg logit contributions
 smile-0.027
 tell-0.027
 cry-0.026
 call-0.026
 bed-0.026
Token heard
Token ID2982
Feature activation+1.40e-02

Pos logit contributions
P+0.008
ww+0.008
Now+0.008
bly+0.007
lish+0.007
Neg logit contributions
 smile-0.005
 cry-0.005
 bed-0.005
 tell-0.005
 call-0.005
Token the
Token ID262
Feature activation+7.41e-03

Pos logit contributions
 smile+0.021
 cry+0.020
 stomach+0.019
 gaze+0.019
 tell+0.019
Neg logit contributions
P-0.034
ww-0.032
Now-0.032
bly-0.031
lish-0.030
Token cow
Token ID9875
Feature activation+9.89e-02

Pos logit contributions
 smile+0.013
 bed+0.012
 cry+0.012
 tell+0.012
 stomach+0.012
Neg logit contributions
P-0.017
ww-0.016
Now-0.016
lish-0.015
bly-0.015
Token and
Token ID290
Feature activation+2.34e-03

Pos logit contributions
 stomach+0.138
ses+0.137
 throat+0.137
 imagination+0.137
 fault+0.133
Neg logit contributions
lish-0.165
bly-0.165
ing-0.165
it-0.164
wing-0.164
Token came
Token ID1625
Feature activation+2.00e-02

Pos logit contributions
 smile+0.004
 cry+0.004
 bed+0.004
 tell+0.003
 stomach+0.003
Neg logit contributions
P-0.006
ww-0.005
Now-0.005
bly-0.005
lish-0.005
Token to
Token ID284
Feature activation+3.05e-02

Pos logit contributions
 stomach+0.032
 smile+0.031
 bed+0.030
 cry+0.030
 tum+0.030
Neg logit contributions
P-0.047
Now-0.043
bly-0.042
Soon-0.041
Spl-0.041
Token help
Token ID1037
Feature activation+2.75e-02

Pos logit contributions
 stomach+0.036
 eyes+0.033
 mouth+0.033
 throat+0.032
 bed+0.031
Neg logit contributions
P-0.084
Now-0.080
Hey-0.077
Good-0.076
O-0.075
Token.
Token ID13
Feature activation+1.34e-02

Pos logit contributions
 smile+0.038
 stomach+0.037
 eyes+0.035
 gaze+0.035
 bed+0.035
Neg logit contributions
P-0.067
Now-0.062
lish-0.062
bly-0.061
ww-0.060
Token felt
Token ID2936
Feature activation+3.79e-03

Pos logit contributions
 bed+0.017
 cry+0.017
 smile+0.017
 tell+0.016
 call+0.016
Neg logit contributions
P-0.028
ww-0.026
Now-0.025
bly-0.025
lish-0.024
Token better
Token ID1365
Feature activation+1.49e-03

Pos logit contributions
 bed+0.005
 smile+0.005
 cry+0.005
 tell+0.005
 call+0.005
Neg logit contributions
P-0.010
ww-0.009
Now-0.009
bly-0.009
lish-0.009
Token and
Token ID290
Feature activation+1.23e-02

Pos logit contributions
 cry+0.002
 smile+0.002
 bed+0.002
 tell+0.002
 call+0.002
Neg logit contributions
P-0.004
ww-0.003
Now-0.003
bly-0.003
lish-0.003
Token went
Token ID1816
Feature activation+2.03e-02

Pos logit contributions
 cry+0.020
 smile+0.020
 question+0.020
 bed+0.020
 call+0.019
Neg logit contributions
P-0.028
ww-0.026
Now-0.025
bly-0.025
lish-0.024
Token back
Token ID736
Feature activation+5.32e-02

Pos logit contributions
 cry+0.032
 smile+0.029
 stomach+0.029
 weep+0.028
 question+0.028
Neg logit contributions
P-0.051
Now-0.045
bly-0.044
ww-0.044
Spl-0.043
Token to
Token ID284
Feature activation+9.89e-02

Pos logit contributions
 stomach+0.104
 cry+0.102
 eyes+0.099
 tongue+0.099
 smile+0.099
Neg logit contributions
P-0.101
B-0.083
Now-0.082
Spl-0.080
CR-0.078
Token playing
Token ID2712
Feature activation-3.20e-03

Pos logit contributions
 eye+0.172
ses+0.167
 stomach+0.164
 tum+0.162
 eyes+0.162
Neg logit contributions
Now-0.116
P-0.106
Spl-0.100
But-0.096
Soon-0.096
Token at
Token ID379
Feature activation+4.73e-02

Pos logit contributions
P+0.007
ww+0.006
Now+0.006
bly+0.006
lish+0.006
Neg logit contributions
 cry-0.006
 smile-0.006
 tell-0.006
 bed-0.006
 call-0.006
Token the
Token ID262
Feature activation-6.69e-03

Pos logit contributions
 eyes+0.085
 smile+0.084
 stomach+0.083
 cry+0.083
 question+0.083
Neg logit contributions
P-0.099
Now-0.098
Soon-0.084
bly-0.084
Spl-0.080
Token park
Token ID3952
Feature activation+2.36e-02

Pos logit contributions
P+0.018
ww+0.017
Now+0.017
bly+0.016
lish+0.016
Neg logit contributions
 smile-0.009
 cry-0.009
 tell-0.009
 bed-0.008
 call-0.008
Token.
Token ID13
Feature activation+5.13e-02

Pos logit contributions
 cry+0.034
 smile+0.032
 tell+0.032
 question+0.030
 call+0.030
Neg logit contributions
P-0.058
Now-0.054
ww-0.053
lish-0.053
bly-0.052
Token little
Token ID1310
Feature activation+2.99e-02

Pos logit contributions
 bed+0.033
 tell+0.033
 cry+0.033
 smile+0.032
 gaze+0.032
Neg logit contributions
P-0.044
ww-0.042
ha-0.038
lish-0.037
Now-0.037
Token boy
Token ID2933
Feature activation+3.45e-02

Pos logit contributions
 bed+0.057
 tell+0.054
 gaze+0.052
 stomach+0.052
 cry+0.052
Neg logit contributions
P-0.056
ww-0.055
lish-0.053
ha-0.048
Hey-0.048
Token named
Token ID3706
Feature activation+9.99e-03

Pos logit contributions
 bed+0.061
 stomach+0.056
 smile+0.055
 cry+0.055
 gaze+0.052
Neg logit contributions
P-0.077
ww-0.069
lish-0.068
ha-0.066
Hey-0.066
Token Tim
Token ID5045
Feature activation+3.30e-02

Pos logit contributions
 smile+0.013
 cry+0.012
 bed+0.012
 tell+0.012
 stomach+0.011
Neg logit contributions
P-0.027
ww-0.026
Now-0.025
bly-0.025
lish-0.025
Token heard
Token ID2982
Feature activation+1.86e-02

Pos logit contributions
 bed+0.058
 stomach+0.053
 cry+0.053
 smile+0.052
 tum+0.051
Neg logit contributions
P-0.075
Hey-0.065
ha-0.065
Now-0.065
lish-0.065
Token Kitty
Token ID21168
Feature activation+9.88e-02

Pos logit contributions
 smile+0.028
 stomach+0.027
 cry+0.026
 gaze+0.026
 tell+0.025
Neg logit contributions
P-0.045
ww-0.044
Now-0.043
bly-0.042
ha-0.041
Token complain
Token ID13121
Feature activation+7.82e-03

Pos logit contributions
ses+0.105
 imagination+0.102
 tum+0.102
 stomach+0.101
 fault+0.099
Neg logit contributions
lish-0.211
it-0.203
bly-0.200
sc-0.197
ing-0.194
Token.
Token ID13
Feature activation+2.73e-02

Pos logit contributions
 smile+0.009
 bed+0.009
 cry+0.009
 tell+0.009
 gaze+0.008
Neg logit contributions
P-0.022
lish-0.020
ww-0.020
Now-0.020
bly-0.020
Token He
Token ID679
Feature activation-2.84e-04

Pos logit contributions
 smile+0.044
 tell+0.042
 cry+0.042
 bed+0.042
 gaze+0.041
Neg logit contributions
P-0.062
ww-0.062
Now-0.059
O-0.056
lish-0.056
Token came
Token ID1625
Feature activation+1.13e-02

Pos logit contributions
P+0.001
ww+0.001
Now+0.001
bly+0.001
lish+0.001
Neg logit contributions
 bed-0.000
 smile-0.000
 cry-0.000
 tell-0.000
 stomach-0.000
Token to
Token ID284
Feature activation+4.09e-02

Pos logit contributions
 smile+0.019
 stomach+0.019
 bed+0.019
 cry+0.019
 tell+0.018
Neg logit contributions
P-0.026
ww-0.023
Now-0.023
bly-0.022
Soon-0.022
Token felt
Token ID2936
Feature activation-6.16e-03

Pos logit contributions
 cry+0.007
 bed+0.007
 smile+0.007
 tell+0.006
 call+0.006
Neg logit contributions
P-0.012
ww-0.011
Now-0.011
bly-0.011
lish-0.010
Token much
Token ID881
Feature activation+1.55e-03

Pos logit contributions
P+0.015
ww+0.014
Now+0.013
bly+0.013
lish+0.013
Neg logit contributions
 smile-0.010
 cry-0.010
 tell-0.010
 bed-0.010
 call-0.010
Token better
Token ID1365
Feature activation+1.82e-04

Pos logit contributions
 smile+0.003
 cry+0.003
 bed+0.003
 tell+0.003
 call+0.003
Neg logit contributions
P-0.003
ww-0.003
Now-0.003
bly-0.003
lish-0.003
Token and
Token ID290
Feature activation+1.32e-02

Pos logit contributions
 smile+0.000
 cry+0.000
 bed+0.000
 tell+0.000
 call+0.000
Neg logit contributions
P-0.000
ww-0.000
Now-0.000
bly-0.000
lish-0.000
Token went
Token ID1816
Feature activation+1.67e-02

Pos logit contributions
 cry+0.022
 question+0.022
 smile+0.022
 call+0.021
 bed+0.021
Neg logit contributions
P-0.030
ww-0.028
Now-0.028
bly-0.027
lish-0.026
Token to
Token ID284
Feature activation+9.86e-02

Pos logit contributions
 cry+0.026
 smile+0.025
 question+0.024
 stomach+0.024
 tell+0.024
Neg logit contributions
P-0.041
Now-0.037
ww-0.037
bly-0.036
Soon-0.035
Token sleep
Token ID3993
Feature activation+1.45e-02

Pos logit contributions
sed+0.140
ses+0.134
 mouths+0.128
 stomach+0.127
bi+0.127
Neg logit contributions
Now-0.161
Good-0.152
 my-0.149
Soon-0.149
CR-0.147
Token with
Token ID351
Feature activation+4.33e-02

Pos logit contributions
 cry+0.024
 smile+0.023
 tell+0.023
 question+0.023
 call+0.023
Neg logit contributions
P-0.033
bly-0.031
Now-0.031
lish-0.030
ww-0.030
Token a
Token ID257
Feature activation+4.63e-02

Pos logit contributions
 question+0.079
 gaze+0.075
 call+0.074
sing+0.073
 cry+0.073
Neg logit contributions
P-0.084
bly-0.077
Now-0.073
lish-0.072
A-0.071
Token smile
Token ID8212
Feature activation-5.20e-03

Pos logit contributions
ses+0.060
 tell+0.059
 gaze+0.059
 calls+0.058
 cry+0.057
Neg logit contributions
P-0.108
ww-0.103
br-0.097
ha-0.096
Happy-0.094
Token on
Token ID319
Feature activation+3.65e-02

Pos logit contributions
P+0.010
ww+0.009
Now+0.009
bly+0.009
lish+0.009
Neg logit contributions
 cry-0.011
 smile-0.011
 tell-0.011
 bed-0.011
 call-0.010
TokenD
Token ID35
Feature activation-2.60e-02

Pos logit contributions
bly+0.062
bridge+0.061
ww+0.060
sc+0.057
ha+0.057
Neg logit contributions
 tell-0.152
 cry-0.149
 stomach-0.148
 call-0.148
 search-0.147
Tokenucky
Token ID5309
Feature activation+1.51e-02

Pos logit contributions
P+0.053
ww+0.050
Now+0.050
bly+0.048
Soon+0.045
Neg logit contributions
 cry-0.050
 smile-0.050
 stomach-0.048
 tell-0.048
 bed-0.048
Token's
Token ID338
Feature activation+3.50e-02

Pos logit contributions
 bed+0.024
 smile+0.024
 cry+0.023
 tell+0.023
 gaze+0.023
Neg logit contributions
P-0.036
ww-0.032
Now-0.032
lish-0.031
ha-0.030
Token mom
Token ID1995
Feature activation+2.63e-02

Pos logit contributions
 tell+0.048
 stare+0.048
 gaze+0.047
 cry+0.047
 smile+0.046
Neg logit contributions
P-0.086
ww-0.077
lish-0.075
bly-0.073
w-0.070
Token heard
Token ID2982
Feature activation+1.87e-02

Pos logit contributions
 bed+0.045
 stomach+0.044
 gaze+0.043
 cry+0.043
 smile+0.043
Neg logit contributions
P-0.060
Now-0.053
ha-0.052
lish-0.051
ww-0.051
Token him
Token ID683
Feature activation+9.85e-02

Pos logit contributions
 smile+0.029
 cry+0.028
 question+0.026
 tell+0.026
 gaze+0.026
Neg logit contributions
P-0.047
ww-0.043
Now-0.043
lish-0.041
bly-0.041
Token cry
Token ID3960
Feature activation+3.63e-02

Pos logit contributions
sed+0.153
ses+0.152
 stomach+0.150
 tum+0.150
 fault+0.149
Neg logit contributions
lish-0.157
it-0.153
B-0.151
ing-0.148
Just-0.147
Token and
Token ID290
Feature activation+3.63e-02

Pos logit contributions
 gaze+0.062
 tell+0.062
 question+0.062
 smile+0.061
 bed+0.060
Neg logit contributions
lish-0.072
P-0.069
bly-0.067
ww-0.065
it-0.064
Token came
Token ID1625
Feature activation+3.32e-02

Pos logit contributions
 fault+0.069
 bed+0.068
 stomach+0.068
sing+0.065
ses+0.064
Neg logit contributions
P-0.071
ha-0.062
ww-0.060
lish-0.059
bly-0.059
Token running
Token ID2491
Feature activation-1.57e-03

Pos logit contributions
 stomach+0.057
 smile+0.053
 tum+0.052
 cry+0.052
 gaze+0.052
Neg logit contributions
P-0.076
Now-0.067
Spl-0.066
B-0.066
bly-0.065
Token.
Token ID13
Feature activation+2.83e-02

Pos logit contributions
P+0.004
ww+0.004
Now+0.004
bly+0.004
lish+0.004
Neg logit contributions
 smile-0.002
 cry-0.002
 tell-0.002
 bed-0.002
 stomach-0.002
Token sign
Token ID1051
Feature activation-1.85e-02

Pos logit contributions
 tell+0.047
 cry+0.046
 weeping+0.045
 smile+0.045
 bed+0.044
Neg logit contributions
ww-0.059
P-0.058
ha-0.052
Now-0.051
Soon-0.050
Token that
Token ID326
Feature activation+7.17e-03

Pos logit contributions
P+0.047
ww+0.044
Now+0.043
bly+0.042
lish+0.041
Neg logit contributions
 smile-0.028
 tell-0.028
 cry-0.028
 bed-0.027
 call-0.027
Token said
Token ID531
Feature activation-1.41e-02

Pos logit contributions
 smile+0.009
 cry+0.009
 bed+0.009
 tell+0.008
 stomach+0.008
Neg logit contributions
P-0.020
ww-0.019
Now-0.018
bly-0.018
ha-0.017
Token "
Token ID366
Feature activation-7.69e-02

Pos logit contributions
P+0.041
ww+0.039
Now+0.038
bly+0.037
lish+0.037
Neg logit contributions
 smile-0.016
 cry-0.016
 tell-0.016
 bed-0.015
 call-0.015
TokenRE
Token ID2200
Feature activation-4.76e-02

Pos logit contributions
P+0.107
bly+0.106
ww+0.102
lish+0.099
Now+0.088
Neg logit contributions
 cry-0.179
 call-0.179
 search-0.175
 tell-0.174
 smile-0.172
TokenVERS
Token ID28884
Feature activation-1.37e-01

Pos logit contributions
P+0.100
ww+0.092
bly+0.090
Now+0.090
lish+0.085
Neg logit contributions
 bed-0.090
 call-0.089
 cry-0.089
 smile-0.088
 tell-0.088
TokenE
Token ID36
Feature activation-8.48e-02

Pos logit contributions
ww+0.189
iest+0.168
Now+0.163
P+0.155
lish+0.155
Neg logit contributions
 cry-0.289
 crying-0.285
 weep-0.274
 weeping-0.267
 call-0.266
Token".
Token ID1911
Feature activation-4.59e-03

Pos logit contributions
P+0.162
Now+0.158
Soon+0.156
Sorry+0.155
Looks+0.149
Neg logit contributions
 stomach-0.141
 bed-0.137
 smile-0.134
 tell-0.132
 mom-0.132
Token L
Token ID406
Feature activation-3.43e-02

Pos logit contributions
P+0.013
ww+0.012
Now+0.012
bly+0.012
lish+0.011
Neg logit contributions
 smile-0.006
 cry-0.006
 tell-0.005
 bed-0.005
 call-0.005
Tokenila
Token ID10102
Feature activation-3.50e-03

Pos logit contributions
P+0.062
ww+0.059
Now+0.057
bly+0.056
lish+0.056
Neg logit contributions
 smile-0.073
 cry-0.073
 tell-0.073
 bed-0.070
 call-0.070
Token and
Token ID290
Feature activation+2.37e-02

Pos logit contributions
P+0.009
ww+0.009
Now+0.009
bly+0.008
lish+0.008
Neg logit contributions
 smile-0.005
 cry-0.005
 bed-0.005
 tell-0.005
 call-0.005
Token at
Token ID379
Feature activation+2.15e-02

Pos logit contributions
P+0.038
ww+0.036
Now+0.035
bly+0.034
lish+0.034
Neg logit contributions
 smile-0.014
 cry-0.014
 tell-0.013
 bed-0.013
 call-0.013
Token the
Token ID262
Feature activation-2.44e-02

Pos logit contributions
 cry+0.035
 smile+0.035
 stomach+0.034
 tell+0.033
 gaze+0.032
Neg logit contributions
P-0.050
Now-0.047
bly-0.045
ww-0.045
Soon-0.044
Token starting
Token ID3599
Feature activation-1.11e-02

Pos logit contributions
P+0.065
ww+0.061
bly+0.060
Now+0.059
lish+0.058
Neg logit contributions
 bed-0.035
 smile-0.035
 call-0.034
 tell-0.034
 cry-0.034
Token line
Token ID1627
Feature activation-3.76e-02

Pos logit contributions
P+0.028
ww+0.026
Now+0.026
bly+0.025
lish+0.025
Neg logit contributions
 smile-0.017
 cry-0.017
 tell-0.016
 bed-0.016
 call-0.016
Token.
Token ID13
Feature activation+5.87e-03

Pos logit contributions
P+0.123
ww+0.120
lish+0.113
Now+0.113
bly+0.113
Neg logit contributions
 smile-0.031
 bed-0.030
 tell-0.029
 call-0.028
 cry-0.027
Token "
Token ID366
Feature activation-1.35e-01

Pos logit contributions
 cry+0.009
 smile+0.008
 tell+0.008
 call+0.008
 weep+0.008
Neg logit contributions
P-0.015
Now-0.014
ww-0.014
bly-0.014
Soon-0.013
TokenReady
Token ID35474
Feature activation-3.22e-02

Pos logit contributions
lish+0.263
led+0.238
izing+0.233
bly+0.229
ww+0.223
Neg logit contributions
 play-0.247
 visit-0.240
 gaze-0.239
 search-0.237
 call-0.236
Token,
Token ID11
Feature activation-5.87e-02

Pos logit contributions
P+0.088
ww+0.082
Now+0.080
bly+0.079
lish+0.078
Neg logit contributions
 smile-0.046
 tell-0.045
 cry-0.044
 bed-0.044
 call-0.043
Token set
Token ID900
Feature activation-4.91e-02

Pos logit contributions
P+0.181
lish+0.178
bly+0.172
ww+0.168
Now+0.164
Neg logit contributions
 tell-0.067
 call-0.063
 smile-0.063
 find-0.061
 bed-0.060
Token,
Token ID11
Feature activation-6.98e-02

Pos logit contributions
P+0.161
Now+0.153
lish+0.149
bly+0.149
ww+0.146
Neg logit contributions
 tell-0.043
 bed-0.042
 smile-0.042
 call-0.039
 cry-0.037
Token go
Token ID467
Feature activation-7.28e-02

Pos logit contributions
P+0.238
lish+0.235
bly+0.226
ww+0.221
dd+0.220
Neg logit contributions
 tell-0.060
 smile-0.055
 call-0.055
 find-0.053
 bed-0.052
Token.
Token ID13
Feature activation+2.54e-03

Pos logit contributions
P+0.008
ww+0.008
Now+0.007
bly+0.007
lish+0.007
Neg logit contributions
 smile-0.003
 cry-0.003
 tell-0.003
 bed-0.003
 call-0.003
Token The
Token ID383
Feature activation-7.21e-03

Pos logit contributions
 smile+0.003
 cry+0.003
 tell+0.003
 bed+0.003
 call+0.003
Neg logit contributions
P-0.007
ww-0.007
Now-0.007
bly-0.007
lish-0.006
Token vet
Token ID20202
Feature activation-5.02e-03

Pos logit contributions
P+0.019
ww+0.018
Now+0.018
bly+0.017
lish+0.017
Neg logit contributions
 smile-0.010
 cry-0.010
 tell-0.010
 bed-0.010
 call-0.009
Token showed
Token ID3751
Feature activation-1.37e-02

Pos logit contributions
P+0.013
ww+0.012
Now+0.012
bly+0.012
lish+0.012
Neg logit contributions
 smile-0.007
 cry-0.007
 bed-0.007
 tell-0.007
 call-0.007
Token Wh
Token ID854
Feature activation-3.48e-02

Pos logit contributions
P+0.038
ww+0.036
Now+0.035
bly+0.035
lish+0.034
Neg logit contributions
 smile-0.017
 cry-0.017
 tell-0.017
 bed-0.017
 call-0.016
Tokenisk
Token ID1984
Feature activation-1.32e-01

Pos logit contributions
P+0.104
ww+0.100
bly+0.096
Now+0.095
Spl+0.091
Neg logit contributions
 cry-0.035
 call-0.034
 crying-0.034
 tell-0.033
 smile-0.033
Tokeners
Token ID364
Feature activation-4.27e-03

Pos logit contributions
ww+0.215
P+0.200
Sorry+0.185
lish+0.184
Uh+0.173
Neg logit contributions
 find-0.265
 feel-0.258
 know-0.251
 crying-0.251
 tell-0.249
Token the
Token ID262
Feature activation+1.17e-02

Pos logit contributions
P+0.011
ww+0.010
Now+0.010
bly+0.010
lish+0.010
Neg logit contributions
 cry-0.006
 smile-0.006
 tell-0.006
 bed-0.006
 call-0.006
Token x
Token ID2124
Feature activation-1.21e-02

Pos logit contributions
 smile+0.018
 tell+0.017
 cry+0.017
 bed+0.017
 gaze+0.016
Neg logit contributions
P-0.028
ww-0.027
Now-0.026
bly-0.025
lish-0.025
Token-
Token ID12
Feature activation-7.50e-02

Pos logit contributions
P+0.020
ww+0.018
Now+0.017
bly+0.017
lish+0.016
Neg logit contributions
 smile-0.029
 cry-0.029
 tell-0.029
 bed-0.028
 call-0.028
Tokenray
Token ID2433
Feature activation-3.81e-03

Pos logit contributions
P+0.171
ww+0.163
bly+0.159
Now+0.153
lish+0.151
Neg logit contributions
 call-0.116
 cry-0.113
 tell-0.113
 crying-0.110
 question-0.110
Token and
Token ID290
Feature activation-7.65e-03

Pos logit contributions
 cry+0.055
 tell+0.052
 bed+0.050
 weep+0.049
 smile+0.049
Neg logit contributions
P-0.087
Now-0.083
ww-0.083
sc-0.082
bly-0.081
Token John
Token ID1757
Feature activation+8.46e-03

Pos logit contributions
P+0.019
ww+0.018
Now+0.018
bly+0.017
lish+0.017
Neg logit contributions
 smile-0.012
 cry-0.012
 tell-0.011
 bed-0.011
 call-0.011
Token chased
Token ID26172
Feature activation-2.73e-02

Pos logit contributions
 cry+0.013
 bed+0.013
 smile+0.013
 tell+0.013
 call+0.012
Neg logit contributions
P-0.020
ww-0.019
Now-0.018
bly-0.018
lish-0.018
Token Jenny
Token ID28437
Feature activation+1.28e-02

Pos logit contributions
P+0.073
ww+0.070
bly+0.069
Now+0.068
lish+0.065
Neg logit contributions
 smile-0.039
 cry-0.039
 tell-0.038
 bed-0.036
 call-0.035
Token.
Token ID13
Feature activation+1.96e-02

Pos logit contributions
 tell+0.019
 smile+0.019
 cry+0.019
 call+0.019
 bed+0.018
Neg logit contributions
P-0.031
ww-0.029
bly-0.028
Now-0.028
lish-0.027
Token "
Token ID366
Feature activation-1.32e-01

Pos logit contributions
 cry+0.030
 tell+0.030
 smile+0.029
 call+0.028
 weep+0.028
Neg logit contributions
P-0.046
Now-0.046
ww-0.043
Soon-0.043
bly-0.041
TokenYou
Token ID1639
Feature activation-2.22e-02

Pos logit contributions
bly+0.218
lish+0.217
ier+0.208
iest+0.205
izing+0.205
Neg logit contributions
 gaze-0.236
 play-0.235
 smile-0.230
 -0.229
 search-0.225
Token can
Token ID460
Feature activation-4.24e-03

Pos logit contributions
P+0.061
ww+0.057
Now+0.056
bly+0.056
lish+0.055
Neg logit contributions
 smile-0.031
 cry-0.030
 tell-0.030
 call-0.029
 bed-0.028
Token't
Token ID470
Feature activation-6.33e-03

Pos logit contributions
P+0.012
ww+0.011
Now+0.011
bly+0.011
lish+0.011
Neg logit contributions
 smile-0.005
 cry-0.005
 bed-0.005
 tell-0.005
 call-0.005
Token catch
Token ID4929
Feature activation-4.00e-02

Pos logit contributions
P+0.019
ww+0.018
Now+0.018
bly+0.017
lish+0.017
Neg logit contributions
 smile-0.007
 cry-0.006
 bed-0.006
 tell-0.006
 call-0.006
Token me
Token ID502
Feature activation-3.05e-02

Pos logit contributions
P+0.097
Now+0.092
ww+0.091
bly+0.090
Sorry+0.085
Neg logit contributions
 cry-0.066
 tell-0.064
 smile-0.063
 bed-0.062
 call-0.060
Token reached
Token ID4251
Feature activation-1.19e-02

Pos logit contributions
P+0.095
ww+0.089
bly+0.086
Now+0.085
lish+0.083
Neg logit contributions
 smile-0.050
 cry-0.049
 call-0.048
 tell-0.048
 bed-0.046
Token its
Token ID663
Feature activation+2.13e-02

Pos logit contributions
P+0.033
ww+0.031
Now+0.030
bly+0.030
lish+0.029
Neg logit contributions
 smile-0.016
 cry-0.016
 tell-0.015
 bed-0.015
 call-0.015
Token top
Token ID1353
Feature activation-1.34e-02

Pos logit contributions
 cry+0.030
 tell+0.030
 smile+0.030
 weeping+0.029
 crying+0.029
Neg logit contributions
P-0.051
ww-0.049
Now-0.047
bly-0.046
Soon-0.046
Token speed
Token ID2866
Feature activation-5.59e-02

Pos logit contributions
P+0.037
ww+0.035
Now+0.034
bly+0.034
lish+0.033
Neg logit contributions
 smile-0.017
 cry-0.017
 tell-0.017
 bed-0.016
 call-0.016
Token!
Token ID0
Feature activation+5.07e-03

Pos logit contributions
P+0.175
ww+0.164
Now+0.159
bly+0.154
lish+0.150
Neg logit contributions
 smile-0.059
 cry-0.054
 call-0.052
 bed-0.051
 crying-0.051
Token "
Token ID366
Feature activation-1.31e-01

Pos logit contributions
 cry+0.006
 smile+0.006
 tell+0.006
 call+0.005
 bed+0.005
Neg logit contributions
P-0.014
Now-0.014
ww-0.014
bly-0.013
Soon-0.013
TokenWhe
Token ID10842
Feature activation-3.38e-02

Pos logit contributions
lish+0.211
bly+0.193
led+0.182
iest+0.181
ier+0.181
Neg logit contributions
 gaze-0.250
 smile-0.247
 -0.238
 stare-0.233
 tell-0.232
Tokene
Token ID68
Feature activation-6.09e-02

Pos logit contributions
P+0.062
ww+0.057
Now+0.056
lish+0.054
bly+0.054
Neg logit contributions
 smile-0.073
 cry-0.071
 tell-0.071
 bed-0.071
 stomach-0.068
Token,
Token ID11
Feature activation-4.41e-02

Pos logit contributions
P+0.144
Now+0.129
bly+0.125
lish+0.124
ww+0.121
Neg logit contributions
 smile-0.113
 cry-0.110
 bed-0.102
 tell-0.102
 feel-0.100
Token this
Token ID428
Feature activation-4.29e-02

Pos logit contributions
P+0.120
Now+0.106
ww+0.105
bly+0.105
lish+0.104
Neg logit contributions
 smile-0.071
 cry-0.069
 tell-0.068
 bed-0.066
 call-0.064
Token is
Token ID318
Feature activation-4.33e-02

Pos logit contributions
P+0.118
bly+0.111
lish+0.108
Now+0.107
ww+0.104
Neg logit contributions
 smile-0.062
 cry-0.059
 tell-0.057
 bed-0.055
 feel-0.054
Token
Token ID198
Feature activation+5.91e-03

Pos logit contributions
 cry+0.021
 tell+0.020
 smile+0.020
 call+0.019
 weep+0.019
Neg logit contributions
P-0.036
Now-0.032
ww-0.032
bly-0.031
lish-0.031
Token
Token ID198
Feature activation-1.05e-01

Pos logit contributions
 cry+0.007
 smile+0.007
 tell+0.007
 bed+0.006
 call+0.006
Neg logit contributions
P-0.017
Now-0.016
ww-0.016
bly-0.015
Soon-0.015
Token"
Token ID1
Feature activation-8.61e-02

Pos logit contributions
iest+0.214
rob+0.214
ww+0.205
bly+0.202
w+0.200
Neg logit contributions
 mom-0.162
-0.158
 find-0.156
 -0.148
 play-0.147
TokenÂ
Token ID5523
Feature activation-1.18e-02

Pos logit contributions
bly+0.141
lish+0.138
dd+0.137
ww+0.130
wy+0.129
Neg logit contributions
 smile-0.183
 gaze-0.176
 call-0.171
 question-0.170
 play-0.170
Token
Token ID126
Feature activation-3.19e-02

Pos logit contributions
P+0.028
ww+0.026
Now+0.025
bly+0.025
lish+0.024
Neg logit contributions
 smile-0.021
 cry-0.020
 tell-0.020
 bed-0.020
 call-0.019
Token
Token ID94
Feature activation-1.30e-01

Pos logit contributions
P+0.068
ww+0.063
bly+0.061
lish+0.058
Now+0.058
Neg logit contributions
 smile-0.062
 cry-0.062
 tell-0.062
 call-0.060
 bed-0.060
TokenM
Token ID44
Feature activation-4.41e-02

Pos logit contributions
ww+0.197
lish+0.194
bly+0.190
iest+0.183
dd+0.160
Neg logit contributions
 tell-0.236
 cry-0.235
 call-0.234
 play-0.231
 sleep-0.227
Tokenira
Token ID8704
Feature activation+2.11e-04

Pos logit contributions
P+0.087
Now+0.086
bly+0.085
sc+0.080
ww+0.078
Neg logit contributions
 stomach-0.083
 smile-0.083
 question-0.083
 tell-0.082
 eyes-0.082
Token mi
Token ID21504
Feature activation-3.33e-02

Pos logit contributions
 cry+0.000
 smile+0.000
 bed+0.000
 tell+0.000
 call+0.000
Neg logit contributions
P-0.001
ww-0.001
Now-0.001
bly-0.001
lish-0.001
Token mu
Token ID38779
Feature activation-6.64e-02

Pos logit contributions
P+0.083
ww+0.076
Now+0.075
bly+0.073
lish+0.072
Neg logit contributions
 smile-0.055
 tell-0.054
 cry-0.054
 call-0.052
 bed-0.051
TokenÃ
Token ID5746
Feature activation-3.12e-02

Pos logit contributions
P+0.117
bly+0.115
Now+0.109
dd+0.108
ww+0.105
Neg logit contributions
 call-0.145
 smile-0.140
 tell-0.139
 question-0.133
 hug-0.133
Token and
Token ID290
Feature activation+2.98e-02

Pos logit contributions
 cry+0.023
 smile+0.023
 tell+0.023
 stomach+0.022
 question+0.022
Neg logit contributions
P-0.043
ww-0.041
Now-0.040
bly-0.039
sc-0.039
Token look
Token ID804
Feature activation-1.06e-02

Pos logit contributions
 weeping+0.034
 question+0.032
 crying+0.031
 stomach+0.031
 bed+0.031
Neg logit contributions
P-0.077
Now-0.074
ww-0.072
Spl-0.070
bly-0.070
Token around
Token ID1088
Feature activation+1.94e-02

Pos logit contributions
P+0.025
ww+0.024
Now+0.023
bly+0.023
lish+0.022
Neg logit contributions
 smile-0.018
 cry-0.018
 tell-0.017
 bed-0.017
 call-0.017
Token.
Token ID13
Feature activation+4.42e-03

Pos logit contributions
 cry+0.025
 tell+0.024
 stomach+0.024
 call+0.024
 question+0.023
Neg logit contributions
P-0.049
Now-0.046
lish-0.046
bly-0.046
ww-0.044
Token
Token ID198
Feature activation-1.74e-02

Pos logit contributions
 cry+0.006
 smile+0.005
 tell+0.005
 call+0.005
 weep+0.005
Neg logit contributions
P-0.012
Now-0.012
ww-0.011
Soon-0.011
bly-0.011
Token
Token ID198
Feature activation-1.30e-01

Pos logit contributions
P+0.056
ww+0.054
bly+0.052
Now+0.051
lish+0.051
Neg logit contributions
 smile-0.014
 cry-0.014
 tell-0.014
 bed-0.014
 call-0.013
Token"
Token ID1
Feature activation-7.97e-02

Pos logit contributions
iest+0.263
rob+0.260
isher+0.258
bly+0.250
ka+0.247
Neg logit contributions
-0.181
 -0.175
 play-0.168
 find-0.165
 mom-0.163
TokenWow
Token ID22017
Feature activation-2.22e-02

Pos logit contributions
ww+0.126
bly+0.125
lish+0.123
ross+0.119
dd+0.116
Neg logit contributions
 smile-0.157
 gaze-0.155
 stare-0.153
 call-0.152
 tell-0.150
Token,
Token ID11
Feature activation-1.84e-02

Pos logit contributions
P+0.067
Now+0.063
ww+0.063
bly+0.062
lish+0.060
Neg logit contributions
 smile-0.024
 cry-0.023
 tell-0.023
 bed-0.022
 call-0.021
Token we
Token ID356
Feature activation-1.68e-02

Pos logit contributions
P+0.049
Now+0.045
ww+0.045
bly+0.045
lish+0.044
Neg logit contributions
 cry-0.026
 smile-0.026
 tell-0.026
 bed-0.025
 call-0.024
Token are
Token ID389
Feature activation+2.90e-03

Pos logit contributions
P+0.044
ww+0.042
bly+0.040
Now+0.040
lish+0.039
Neg logit contributions
 smile-0.024
 cry-0.024
 tell-0.024
 bed-0.023
 call-0.023
Token for
Token ID329
Feature activation+5.52e-05

Pos logit contributions
P+0.008
ww+0.007
Now+0.007
bly+0.007
lish+0.007
Neg logit contributions
 smile-0.006
 cry-0.005
 tell-0.005
 bed-0.005
 call-0.005
Token playing
Token ID2712
Feature activation-5.87e-02

Pos logit contributions
 smile+0.000
 cry+0.000
 tell+0.000
 bed+0.000
 call+0.000
Neg logit contributions
P-0.000
ww-0.000
Now-0.000
bly-0.000
lish-0.000
Token a
Token ID257
Feature activation-4.37e-02

Pos logit contributions
ww+0.163
P+0.159
ha+0.144
Now+0.144
lish+0.143
Neg logit contributions
 bed-0.081
 smile-0.075
 tell-0.075
 sleep-0.072
 cry-0.071
Token game
Token ID983
Feature activation-3.85e-02

Pos logit contributions
P+0.104
bly+0.100
Now+0.099
ww+0.096
lish+0.095
Neg logit contributions
 bed-0.073
 smile-0.071
 tell-0.070
 cry-0.069
 call-0.069
Token called
Token ID1444
Feature activation-5.80e-02

Pos logit contributions
P+0.117
ww+0.113
Now+0.109
bly+0.108
lish+0.107
Neg logit contributions
 bed-0.039
 tell-0.039
 smile-0.038
 call-0.037
 cry-0.036
Token '
Token ID705
Feature activation-1.28e-01

Pos logit contributions
P+0.165
ww+0.155
Now+0.154
bly+0.149
lish+0.149
Neg logit contributions
 bed-0.071
 call-0.069
 tell-0.068
 smile-0.068
 cry-0.066
Tokenskip
Token ID48267
Feature activation-8.21e-02

Pos logit contributions
bly+0.154
ww+0.149
lish+0.144
P+0.135
ross+0.128
Neg logit contributions
 smile-0.288
 search-0.287
 tell-0.283
 call-0.282
 question-0.280
Token rope
Token ID17182
Feature activation-7.67e-02

Pos logit contributions
P+0.166
Now+0.158
bly+0.154
ww+0.150
Soon+0.149
Neg logit contributions
 smile-0.162
 cry-0.149
 tell-0.148
 bed-0.148
 question-0.145
Token'.
Token ID4458
Feature activation-2.46e-02

Pos logit contributions
P+0.189
ww+0.175
Now+0.173
lish+0.167
bly+0.166
Neg logit contributions
 smile-0.125
 cry-0.119
 tell-0.119
 call-0.117
 bed-0.117
Token Both
Token ID5747
Feature activation-1.04e-02

Pos logit contributions
P+0.070
ww+0.067
bly+0.064
Now+0.064
lish+0.063
Neg logit contributions
 smile-0.029
 cry-0.029
 tell-0.028
 bed-0.028
 call-0.027
Token Sus
Token ID8932
Feature activation-3.07e-02

Pos logit contributions
P+0.028
ww+0.026
Now+0.026
bly+0.025
lish+0.025
Neg logit contributions
 smile-0.014
 cry-0.014
 tell-0.014
 bed-0.014
 call-0.013
Token and
Token ID290
Feature activation+2.09e-02

Pos logit contributions
 cry+0.060
 stomach+0.059
 question+0.058
 tell+0.057
 bed+0.056
Neg logit contributions
P-0.084
bly-0.080
ww-0.078
sc-0.076
lish-0.076
Token the
Token ID262
Feature activation-6.26e-03

Pos logit contributions
 cry+0.032
 stomach+0.030
 smile+0.030
 bed+0.030
 tell+0.030
Neg logit contributions
P-0.050
ww-0.046
Now-0.045
bly-0.043
lish-0.043
Token flowers
Token ID12734
Feature activation+4.18e-02

Pos logit contributions
P+0.015
ww+0.014
Now+0.013
bly+0.013
lish+0.013
Neg logit contributions
 smile-0.011
 cry-0.011
 tell-0.010
 bed-0.010
 call-0.010
Token.
Token ID13
Feature activation-8.11e-04

Pos logit contributions
 cry+0.062
 question+0.058
 tell+0.056
 stomach+0.056
 weep+0.055
Neg logit contributions
P-0.098
bly-0.095
Now-0.091
ww-0.089
sc-0.088
Token
Token ID198
Feature activation-1.45e-02

Pos logit contributions
P+0.002
ww+0.002
Now+0.002
bly+0.002
lish+0.002
Neg logit contributions
 cry-0.001
 smile-0.001
 tell-0.001
 bed-0.001
 call-0.001
Token
Token ID198
Feature activation-1.28e-01

Pos logit contributions
P+0.046
ww+0.044
Now+0.042
bly+0.042
lish+0.042
Neg logit contributions
 smile-0.013
 cry-0.012
 tell-0.012
 bed-0.012
 call-0.011
Token"
Token ID1
Feature activation-7.81e-02

Pos logit contributions
iest+0.247
bly+0.243
ymes+0.237
w+0.235
ww+0.232
Neg logit contributions
-0.187
 -0.183
 find-0.178
 play-0.177
 mom-0.169
TokenYou
Token ID1639
Feature activation-1.21e-02

Pos logit contributions
bly+0.139
lish+0.134
ww+0.130
w+0.126
dd+0.124
Neg logit contributions
 smile-0.152
 gaze-0.149
 tell-0.144
 call-0.143
 stare-0.142
Token are
Token ID389
Feature activation-1.54e-02

Pos logit contributions
P+0.037
ww+0.035
Now+0.034
bly+0.034
lish+0.033
Neg logit contributions
 smile-0.013
 cry-0.013
 tell-0.013
 bed-0.012
 call-0.012
Token a
Token ID257
Feature activation-3.37e-02

Pos logit contributions
P+0.038
ww+0.036
Now+0.035
bly+0.035
lish+0.034
Neg logit contributions
 smile-0.024
 tell-0.024
 cry-0.024
 bed-0.024
 call-0.023
Token great
Token ID1049
Feature activation+2.60e-03

Pos logit contributions
P+0.081
bly+0.077
Now+0.077
ww+0.076
lish+0.075
Neg logit contributions
 smile-0.056
 tell-0.053
 cry-0.052
 call-0.052
 bed-0.051
Token half
Token ID2063
Feature activation-2.14e-02

Pos logit contributions
P+0.058
ww+0.054
bly+0.054
Now+0.053
lish+0.050
Neg logit contributions
 smile-0.043
 cry-0.042
 tell-0.041
 bed-0.040
 stomach-0.039
Token for
Token ID329
Feature activation+3.68e-03

Pos logit contributions
P+0.055
ww+0.053
Now+0.051
bly+0.050
lish+0.049
Neg logit contributions
 smile-0.032
 cry-0.031
 tell-0.030
 bed-0.030
 call-0.029
Token himself
Token ID2241
Feature activation+3.47e-02

Pos logit contributions
 smile+0.006
 cry+0.006
 tell+0.006
 call+0.005
 bed+0.005
Neg logit contributions
P-0.009
ww-0.008
bly-0.008
Now-0.008
lish-0.008
Token.
Token ID13
Feature activation+7.70e-03

Pos logit contributions
 cry+0.045
 tell+0.044
 call+0.044
 gaze+0.043
 smile+0.043
Neg logit contributions
P-0.089
ww-0.082
bly-0.081
lish-0.081
Now-0.079
Token
Token ID198
Feature activation-1.03e-02

Pos logit contributions
 cry+0.010
 smile+0.010
 tell+0.010
 call+0.009
 weep+0.009
Neg logit contributions
P-0.022
Now-0.020
ww-0.020
bly-0.019
Soon-0.019
Token
Token ID198
Feature activation-1.28e-01

Pos logit contributions
P+0.032
ww+0.030
Now+0.030
bly+0.029
lish+0.029
Neg logit contributions
 smile-0.010
 cry-0.010
 tell-0.009
 bed-0.009
 call-0.009
TokenThey
Token ID2990
Feature activation+2.77e-02

Pos logit contributions
rob+0.266
iest+0.256
w+0.254
ier+0.248
ka+0.242
Neg logit contributions
 find-0.171
-0.164
 play-0.163
 ask-0.157
 -0.156
Token both
Token ID1111
Feature activation+3.45e-02

Pos logit contributions
 bed+0.040
 weeping+0.040
 crying+0.040
 cry+0.038
 stomach+0.037
Neg logit contributions
P-0.066
Now-0.061
ww-0.059
bly-0.059
Hey-0.057
Token take
Token ID1011
Feature activation-1.72e-02

Pos logit contributions
 bed+0.043
 weeping+0.040
 crying+0.039
 eyes+0.037
 stomach+0.036
Neg logit contributions
P-0.089
Now-0.084
ww-0.083
lish-0.081
bly-0.081
Token a
Token ID257
Feature activation+3.06e-02

Pos logit contributions
P+0.043
ww+0.040
Now+0.039
bly+0.039
lish+0.037
Neg logit contributions
 smile-0.028
 cry-0.027
 tell-0.027
 bed-0.026
 call-0.026
Token big
Token ID1263
Feature activation+2.31e-02

Pos logit contributions
 tell+0.039
 weeping+0.038
 call+0.038
 eyes+0.037
 cry+0.037
Neg logit contributions
P-0.081
ww-0.080
ha-0.074
sc-0.071
Soon-0.070
Token.
Token ID13
Feature activation+8.05e-03

Pos logit contributions
P+0.114
ww+0.111
bly+0.106
Now+0.104
lish+0.104
Neg logit contributions
 smile-0.024
 cry-0.022
 bed-0.021
 tell-0.020
 call-0.019
Token Whe
Token ID5792
Feature activation-1.66e-02

Pos logit contributions
 cry+0.012
 smile+0.012
 tell+0.012
 weep+0.011
 weeping+0.011
Neg logit contributions
P-0.021
Now-0.019
ww-0.019
Soon-0.019
Spl-0.018
Tokene
Token ID68
Feature activation-3.95e-02

Pos logit contributions
P+0.032
ww+0.030
Now+0.029
bly+0.028
lish+0.028
Neg logit contributions
 smile-0.035
 cry-0.035
 tell-0.034
 bed-0.034
 call-0.033
Token!
Token ID0
Feature activation+4.41e-03

Pos logit contributions
P+0.101
Now+0.093
ww+0.091
bly+0.091
lish+0.087
Neg logit contributions
 smile-0.063
 cry-0.061
 bed-0.059
 tell-0.057
 stomach-0.057
Token
Token ID198
Feature activation+1.27e-02

Pos logit contributions
 smile+0.005
 cry+0.005
 tell+0.005
 bed+0.005
 call+0.005
Neg logit contributions
P-0.012
ww-0.012
Now-0.012
bly-0.011
Soon-0.011
Token
Token ID198
Feature activation-1.27e-01

Pos logit contributions
 smile+0.016
 cry+0.016
 tell+0.015
 stomach+0.015
 weeping+0.015
Neg logit contributions
P-0.035
Now-0.034
ww-0.033
Soon-0.032
bly-0.031
Token"
Token ID1
Feature activation-8.38e-02

Pos logit contributions
rob+0.252
w+0.239
iest+0.237
ymes+0.236
ka+0.233
Neg logit contributions
 find-0.196
 play-0.180
 ask-0.178
 call-0.174
 be-0.173
TokenWow
Token ID22017
Feature activation-2.32e-02

Pos logit contributions
bly+0.151
lish+0.142
izing+0.132
ww+0.131
bling+0.130
Neg logit contributions
 gaze-0.161
 smile-0.161
 stare-0.156
 play-0.152
 -0.151
Token,
Token ID11
Feature activation-1.57e-02

Pos logit contributions
P+0.068
Now+0.064
ww+0.063
bly+0.062
lish+0.060
Neg logit contributions
 smile-0.028
 cry-0.027
 tell-0.026
 bed-0.026
 call-0.024
Token Tom
Token ID4186
Feature activation-6.14e-03

Pos logit contributions
P+0.044
ww+0.041
Now+0.041
bly+0.040
lish+0.039
Neg logit contributions
 smile-0.021
 cry-0.020
 tell-0.020
 bed-0.020
 call-0.019
Token,
Token ID11
Feature activation-3.13e-02

Pos logit contributions
P+0.015
ww+0.014
Now+0.013
bly+0.013
lish+0.013
Neg logit contributions
 smile-0.010
 cry-0.010
 tell-0.010
 bed-0.010
 call-0.009
Token in
Token ID287
Feature activation+3.69e-04

Pos logit contributions
P+0.082
ww+0.077
Now+0.073
bly+0.072
lish+0.070
Neg logit contributions
 smile-0.049
 cry-0.047
 tell-0.046
 bed-0.046
 call-0.044
Token laughter
Token ID20263
Feature activation-2.10e-02

Pos logit contributions
 smile+0.001
 cry+0.001
 tell+0.001
 bed+0.000
 call+0.000
Neg logit contributions
P-0.001
ww-0.001
Now-0.001
bly-0.001
lish-0.001
Token and
Token ID290
Feature activation-2.46e-03

Pos logit contributions
P+0.061
ww+0.057
Now+0.056
bly+0.055
lish+0.053
Neg logit contributions
 smile-0.026
 cry-0.024
 bed-0.024
 tell-0.024
 call-0.023
Token said
Token ID531
Feature activation-3.36e-02

Pos logit contributions
P+0.006
ww+0.006
Now+0.006
bly+0.006
lish+0.005
Neg logit contributions
 smile-0.004
 cry-0.004
 tell-0.004
 bed-0.004
 call-0.003
Token,
Token ID11
Feature activation-2.79e-02

Pos logit contributions
P+0.104
ww+0.098
Now+0.097
bly+0.093
lish+0.092
Neg logit contributions
 smile-0.035
 cry-0.032
 tell-0.032
 bed-0.030
 crying-0.029
Token '
Token ID705
Feature activation-1.27e-01

Pos logit contributions
P+0.079
ww+0.075
Now+0.071
bly+0.071
lish+0.070
Neg logit contributions
 smile-0.036
 cry-0.034
 tell-0.034
 bed-0.033
 call-0.033
TokenThese
Token ID4711
Feature activation-4.06e-02

Pos logit contributions
bly+0.159
ross+0.150
bling+0.147
lish+0.146
ww+0.142
Neg logit contributions
 smile-0.278
 tell-0.263
 -0.261
 feel-0.261
 say-0.258
Token new
Token ID649
Feature activation-3.53e-02

Pos logit contributions
P+0.120
ww+0.113
Now+0.113
bly+0.111
lish+0.109
Neg logit contributions
 smile-0.048
 cry-0.046
 tell-0.046
 bed-0.043
 call-0.043
Token clothes
Token ID8242
Feature activation-5.69e-02

Pos logit contributions
P+0.090
ww+0.084
Now+0.083
bly+0.083
lish+0.081
Neg logit contributions
 smile-0.057
 tell-0.054
 cry-0.053
 bed-0.053
 call-0.052
Token look
Token ID804
Feature activation-8.59e-02

Pos logit contributions
P+0.158
ww+0.144
Now+0.142
lish+0.139
bly+0.137
Neg logit contributions
 smile-0.082
 tell-0.077
 cry-0.076
 call-0.073
 bed-0.072
Token incredible
Token ID8082
Feature activation-7.35e-02

Pos logit contributions
P+0.227
Now+0.209
ww+0.202
Spl+0.198
CR+0.192
Neg logit contributions
 smile-0.137
 tell-0.130
 feel-0.123
 call-0.123
 cry-0.121
Token in
Token ID287
Feature activation-2.10e-02

Pos logit contributions
P+0.070
ww+0.066
Now+0.065
bly+0.064
lish+0.063
Neg logit contributions
 smile-0.022
 cry-0.020
 tell-0.020
 bed-0.020
 call-0.019
Token the
Token ID262
Feature activation-2.63e-02

Pos logit contributions
P+0.059
ww+0.057
lish+0.054
bly+0.054
Now+0.054
Neg logit contributions
 bed-0.027
 smile-0.027
 cry-0.026
 tell-0.026
 call-0.024
Token desert
Token ID10326
Feature activation-8.83e-03

Pos logit contributions
P+0.067
ww+0.065
bly+0.065
Now+0.062
lish+0.061
Neg logit contributions
 smile-0.039
 bed-0.039
 cry-0.037
 stomach-0.037
 tell-0.037
Token.
Token ID13
Feature activation-1.40e-02

Pos logit contributions
P+0.027
ww+0.026
Now+0.025
bly+0.025
lish+0.025
Neg logit contributions
 smile-0.008
 cry-0.008
 tell-0.008
 bed-0.008
 call-0.008
Token
Token ID198
Feature activation-7.57e-03

Pos logit contributions
P+0.044
ww+0.042
bly+0.041
Now+0.041
lish+0.040
Neg logit contributions
 smile-0.013
 cry-0.013
 tell-0.012
 bed-0.012
 call-0.012
Token
Token ID198
Feature activation-1.27e-01

Pos logit contributions
P+0.023
ww+0.022
Now+0.022
bly+0.021
lish+0.021
Neg logit contributions
 smile-0.007
 cry-0.007
 tell-0.007
 bed-0.007
 call-0.007
Token"
Token ID1
Feature activation-8.01e-02

Pos logit contributions
iest+0.266
bly+0.256
w+0.250
ww+0.246
ka+0.238
Neg logit contributions
-0.181
 -0.175
 find-0.173
 mom-0.164
 see-0.158
TokenWow
Token ID22017
Feature activation-2.08e-02

Pos logit contributions
bly+0.142
lish+0.139
ww+0.134
ier+0.132
dd+0.132
Neg logit contributions
 gaze-0.149
 question-0.146
 call-0.142
 search-0.142
 stare-0.139
Token,
Token ID11
Feature activation-2.30e-02

Pos logit contributions
P+0.060
Now+0.056
ww+0.056
bly+0.055
lish+0.054
Neg logit contributions
 smile-0.025
 cry-0.024
 tell-0.024
 bed-0.024
 call-0.023
Token look
Token ID804
Feature activation-3.94e-02

Pos logit contributions
P+0.063
Now+0.058
bly+0.058
ww+0.057
lish+0.057
Neg logit contributions
 tell-0.032
 smile-0.032
 cry-0.032
 bed-0.031
 call-0.030
Token at
Token ID379
Feature activation-1.42e-02

Pos logit contributions
P+0.105
Now+0.102
bly+0.098
ww+0.096
Too+0.092
Neg logit contributions
 smile-0.057
 tell-0.056
 cry-0.054
 bed-0.051
 feel-0.050
Token like
Token ID588
Feature activation-2.17e-02

Pos logit contributions
P+0.044
ww+0.042
Now+0.040
bly+0.040
lish+0.038
Neg logit contributions
 smile-0.027
 cry-0.026
 tell-0.026
 bed-0.026
 call-0.025
Token a
Token ID257
Feature activation-7.77e-03

Pos logit contributions
P+0.055
ww+0.052
Now+0.050
bly+0.050
lish+0.049
Neg logit contributions
 smile-0.033
 bed-0.032
 cry-0.032
 tell-0.032
 call-0.031
Token clown
Token ID25573
Feature activation+7.05e-03

Pos logit contributions
P+0.019
ww+0.018
Now+0.017
bly+0.017
lish+0.016
Neg logit contributions
 smile-0.013
 cry-0.012
 tell-0.012
 bed-0.012
 call-0.012
Token.
Token ID13
Feature activation-5.09e-03

Pos logit contributions
 cry+0.010
 smile+0.009
 tell+0.009
 bed+0.009
 stomach+0.009
Neg logit contributions
P-0.019
ww-0.018
Now-0.017
bly-0.017
lish-0.017
Token
Token ID198
Feature activation-2.36e-04

Pos logit contributions
P+0.015
ww+0.014
Now+0.014
bly+0.014
lish+0.013
Neg logit contributions
 cry-0.006
 smile-0.006
 tell-0.005
 bed-0.005
 call-0.005
Token
Token ID198
Feature activation-1.27e-01

Pos logit contributions
P+0.001
ww+0.001
Now+0.001
bly+0.001
Soon+0.001
Neg logit contributions
 smile-0.000
 cry-0.000
 tell-0.000
 bed-0.000
 call-0.000
TokenThey
Token ID2990
Feature activation-3.65e-03

Pos logit contributions
iest+0.259
ka+0.258
ww+0.254
bly+0.244
ymes+0.244
Neg logit contributions
 find-0.173
-0.169
 -0.158
 play-0.156
 see-0.146
Token ran
Token ID4966
Feature activation-1.77e-02

Pos logit contributions
P+0.009
ww+0.009
Now+0.008
bly+0.008
lish+0.008
Neg logit contributions
 cry-0.005
 smile-0.005
 tell-0.005
 bed-0.005
 call-0.005
Token around
Token ID1088
Feature activation+2.11e-02

Pos logit contributions
P+0.048
ww+0.045
bly+0.043
Now+0.043
lish+0.042
Neg logit contributions
 smile-0.026
 cry-0.025
 bed-0.025
 tell-0.025
 call-0.024
Token the
Token ID262
Feature activation-1.55e-02

Pos logit contributions
 cry+0.035
 stomach+0.034
 smile+0.033
 tell+0.033
 call+0.032
Neg logit contributions
P-0.045
Now-0.044
bly-0.041
Soon-0.040
Spl-0.040
Token room
Token ID2119
Feature activation+2.90e-02

Pos logit contributions
P+0.050
ww+0.049
bly+0.048
Now+0.047
lish+0.046
Neg logit contributions
 smile-0.013
 bed-0.012
 cry-0.012
 tell-0.012
 call-0.011
Token does
Token ID857
Feature activation-1.34e-02

Pos logit contributions
P+0.055
ww+0.051
Now+0.050
bly+0.049
lish+0.048
Neg logit contributions
 smile-0.029
 cry-0.029
 tell-0.029
 call-0.028
 bed-0.027
Token.
Token ID13
Feature activation-3.90e-02

Pos logit contributions
P+0.036
ww+0.033
Now+0.033
bly+0.032
lish+0.032
Neg logit contributions
 smile-0.019
 cry-0.019
 tell-0.018
 bed-0.018
 call-0.018
Tokenâ
Token ID22940
Feature activation-5.63e-02

Pos logit contributions
P+0.100
ww+0.094
bly+0.091
lish+0.090
Now+0.089
Neg logit contributions
 smile-0.057
 tell-0.057
 bed-0.057
 cry-0.056
 call-0.056
Token
Token ID26391
Feature activation-1.50e-02

Pos logit contributions
P+0.115
bly+0.106
ww+0.105
lish+0.098
Soon+0.096
Neg logit contributions
 tell-0.107
 smile-0.106
 call-0.104
 cry-0.104
 say-0.101
Token
Token ID198
Feature activation-4.19e-02

Pos logit contributions
P+0.036
ww+0.034
bly+0.033
Now+0.033
lish+0.032
Neg logit contributions
 smile-0.024
 cry-0.024
 tell-0.024
 bed-0.024
 call-0.023
Token
Token ID198
Feature activation-1.26e-01

Pos logit contributions
P+0.139
ww+0.137
lish+0.136
bly+0.134
w+0.128
Neg logit contributions
 bed-0.024
 tell-0.022
 smile-0.022
 cry-0.021
 call-0.019
TokenThey
Token ID2990
Feature activation-1.79e-02

Pos logit contributions
ier+0.225
w+0.222
rier+0.217
iest+0.216
lish+0.212
Neg logit contributions
 -0.221
-0.207
 search-0.195
 tell-0.193
 know-0.186
Token kept
Token ID4030
Feature activation+5.32e-03

Pos logit contributions
P+0.045
ww+0.043
Now+0.042
bly+0.041
lish+0.041
Neg logit contributions
 smile-0.027
 cry-0.027
 tell-0.027
 bed-0.026
 call-0.025
Token walking
Token ID6155
Feature activation-4.40e-02

Pos logit contributions
 cry+0.007
 smile+0.007
 stomach+0.007
 tell+0.007
 bed+0.007
Neg logit contributions
P-0.014
ww-0.013
Now-0.013
bly-0.013
lish-0.012
Token until
Token ID1566
Feature activation-3.01e-02

Pos logit contributions
P+0.135
ww+0.131
bly+0.124
Now+0.122
lish+0.121
Neg logit contributions
 smile-0.052
 bed-0.048
 tell-0.046
 feel-0.044
 call-0.044
Token Jack
Token ID3619
Feature activation+4.35e-03

Pos logit contributions
P+0.081
ww+0.077
Now+0.076
bly+0.075
lish+0.074
Neg logit contributions
 bed-0.043
 smile-0.041
 cry-0.039
 tell-0.039
 call-0.038
Token to
Token ID284
Feature activation+3.65e-02

Pos logit contributions
ww+0.117
P+0.117
bly+0.110
Now+0.108
lish+0.108
Neg logit contributions
 smile-0.045
 tell-0.042
 bed-0.042
 call-0.040
 search-0.039
Token the
Token ID262
Feature activation-1.72e-02

Pos logit contributions
 stomach+0.070
 weeping+0.069
 throat+0.067
 cry+0.066
 smile+0.065
Neg logit contributions
P-0.066
Now-0.060
Soon-0.059
A-0.059
Hey-0.057
Token park
Token ID3952
Feature activation+2.80e-02

Pos logit contributions
P+0.048
ww+0.045
bly+0.044
Now+0.044
lish+0.043
Neg logit contributions
 smile-0.023
 cry-0.022
 bed-0.022
 tell-0.022
 call-0.021
Token.
Token ID13
Feature activation+3.06e-03

Pos logit contributions
 cry+0.040
 stomach+0.037
 weeping+0.036
 smile+0.036
 regret+0.035
Neg logit contributions
P-0.067
sc-0.064
Now-0.063
ww-0.061
it-0.060
Token
Token ID198
Feature activation-1.06e-02

Pos logit contributions
 cry+0.004
 smile+0.004
 tell+0.004
 call+0.004
 bed+0.004
Neg logit contributions
P-0.008
ww-0.007
Now-0.007
bly-0.007
Soon-0.007
Token
Token ID198
Feature activation-1.26e-01

Pos logit contributions
P+0.033
ww+0.031
Now+0.031
bly+0.030
lish+0.030
Neg logit contributions
 smile-0.010
 cry-0.010
 tell-0.010
 bed-0.009
 call-0.009
TokenAt
Token ID2953
Feature activation+1.26e-02

Pos logit contributions
iest+0.255
rob+0.244
ka+0.243
w+0.243
ww+0.242
Neg logit contributions
 find-0.204
-0.191
 -0.181
 play-0.175
 see-0.169
Token the
Token ID262
Feature activation-1.41e-02

Pos logit contributions
 cry+0.020
 smile+0.019
 stomach+0.019
 tell+0.019
 call+0.019
Neg logit contributions
P-0.030
ww-0.028
Now-0.028
Soon-0.027
bly-0.027
Token park
Token ID3952
Feature activation-6.27e-03

Pos logit contributions
P+0.042
ww+0.040
bly+0.039
Now+0.038
lish+0.038
Neg logit contributions
 smile-0.016
 cry-0.016
 tell-0.016
 bed-0.015
 call-0.015
Token,
Token ID11
Feature activation-4.46e-02

Pos logit contributions
P+0.019
ww+0.018
Now+0.018
bly+0.018
lish+0.018
Neg logit contributions
 cry-0.006
 smile-0.006
 tell-0.006
 bed-0.006
 call-0.005
Token they
Token ID484
Feature activation+2.58e-04

Pos logit contributions
P+0.130
ww+0.126
Now+0.122
bly+0.118
lish+0.117
Neg logit contributions
 bed-0.055
 smile-0.054
 tell-0.051
 feel-0.050
 mom-0.050
Token on
Token ID319
Feature activation+1.83e-02

Pos logit contributions
P+0.069
ww+0.065
Now+0.064
bly+0.059
lish+0.058
Neg logit contributions
 smile-0.045
 cry-0.042
 bed-0.042
 tell-0.041
 call-0.040
Token the
Token ID262
Feature activation+3.05e-04

Pos logit contributions
 tell+0.028
 cry+0.028
 smile+0.027
 weep+0.026
 call+0.026
Neg logit contributions
P-0.043
Now-0.040
ww-0.040
bly-0.040
lish-0.039
Token surface
Token ID4417
Feature activation-1.04e-02

Pos logit contributions
 cry+0.001
 tell+0.001
 smile+0.001
 bed+0.000
 call+0.000
Neg logit contributions
P-0.001
ww-0.001
Now-0.001
bly-0.001
lish-0.001
Token.
Token ID13
Feature activation+4.46e-04

Pos logit contributions
P+0.032
ww+0.030
Now+0.030
bly+0.029
lish+0.029
Neg logit contributions
 smile-0.010
 cry-0.010
 tell-0.010
 bed-0.009
 call-0.009
Token
Token ID198
Feature activation-1.34e-02

Pos logit contributions
 cry+0.001
 smile+0.001
 tell+0.001
 bed+0.001
 call+0.001
Neg logit contributions
P-0.001
ww-0.001
Now-0.001
bly-0.001
lish-0.001
Token
Token ID198
Feature activation-1.26e-01

Pos logit contributions
P+0.042
ww+0.041
Now+0.039
bly+0.039
lish+0.039
Neg logit contributions
 smile-0.012
 cry-0.012
 tell-0.012
 bed-0.011
 call-0.011
Token"
Token ID1
Feature activation-7.31e-02

Pos logit contributions
w+0.264
rob+0.259
ww+0.251
ymes+0.248
iest+0.247
Neg logit contributions
-0.157
 find-0.151
 play-0.149
 -0.147
 be-0.142
TokenWow
Token ID22017
Feature activation-2.14e-02

Pos logit contributions
lish+0.117
bly+0.109
ross+0.108
ww+0.106
dd+0.102
Neg logit contributions
 smile-0.155
 gaze-0.155
 stare-0.152
 tell-0.150
 question-0.149
Token,
Token ID11
Feature activation-9.77e-03

Pos logit contributions
P+0.064
Now+0.060
ww+0.060
bly+0.058
lish+0.057
Neg logit contributions
 smile-0.024
 cry-0.023
 tell-0.023
 bed-0.022
 call-0.021
Token it
Token ID340
Feature activation+9.42e-03

Pos logit contributions
P+0.027
ww+0.026
Now+0.025
bly+0.025
lish+0.024
Neg logit contributions
 smile-0.013
 cry-0.013
 tell-0.012
 bed-0.012
 call-0.012
Token is
Token ID318
Feature activation+1.14e-03

Pos logit contributions
 bed+0.015
 smile+0.015
 cry+0.015
 tell+0.014
 stomach+0.014
Neg logit contributions
P-0.023
ww-0.022
Now-0.021
bly-0.020
lish-0.020
Token and
Token ID290
Feature activation+1.84e-02

Pos logit contributions
 cry+0.050
 stomach+0.048
 smile+0.048
 tell+0.047
 gaze+0.046
Neg logit contributions
P-0.064
sc-0.060
Now-0.059
bly-0.057
ww-0.056
Token opened
Token ID4721
Feature activation-5.82e-03

Pos logit contributions
 cry+0.031
 smile+0.030
 stomach+0.030
 bed+0.030
 weeping+0.029
Neg logit contributions
P-0.041
ww-0.036
Now-0.036
bly-0.036
ha-0.036
Token it
Token ID340
Feature activation+7.50e-03

Pos logit contributions
P+0.014
ww+0.013
Now+0.013
bly+0.013
lish+0.012
Neg logit contributions
 smile-0.009
 cry-0.009
 tell-0.009
 bed-0.009
 call-0.009
Token.
Token ID13
Feature activation+7.83e-03

Pos logit contributions
 cry+0.009
 smile+0.009
 tell+0.008
 stomach+0.008
 bed+0.008
Neg logit contributions
P-0.021
ww-0.020
Now-0.019
bly-0.019
sc-0.019
Token
Token ID198
Feature activation-1.59e-02

Pos logit contributions
 cry+0.012
 smile+0.012
 tell+0.011
 weep+0.011
 call+0.011
Neg logit contributions
P-0.020
ww-0.018
Now-0.018
bly-0.018
Soon-0.017
Token
Token ID198
Feature activation-1.25e-01

Pos logit contributions
P+0.050
ww+0.048
bly+0.047
Now+0.047
lish+0.046
Neg logit contributions
 smile-0.014
 cry-0.013
 tell-0.013
 bed-0.013
 call-0.012
TokenThey
Token ID2990
Feature activation+1.22e-02

Pos logit contributions
rob+0.261
iest+0.250
ka+0.248
w+0.243
ww+0.241
Neg logit contributions
 find-0.190
 ask-0.169
 play-0.166
 see-0.165
 call-0.163
Token saw
Token ID2497
Feature activation+2.38e-02

Pos logit contributions
 cry+0.020
 bed+0.020
 smile+0.019
 weeping+0.019
 tell+0.019
Neg logit contributions
P-0.029
Now-0.025
ww-0.025
bly-0.025
ha-0.025
Token many
Token ID867
Feature activation-1.23e-02

Pos logit contributions
 smile+0.048
 cry+0.047
 tell+0.047
 question+0.046
 call+0.046
Neg logit contributions
P-0.044
ww-0.041
bly-0.040
Now-0.038
lish-0.037
Token pages
Token ID5468
Feature activation+1.34e-02

Pos logit contributions
P+0.028
ww+0.026
Now+0.025
bly+0.025
lish+0.024
Neg logit contributions
 smile-0.022
 cry-0.022
 tell-0.022
 bed-0.021
 call-0.021
Token with
Token ID351
Feature activation+2.24e-02

Pos logit contributions
 cry+0.020
 smile+0.019
 tell+0.019
 stomach+0.019
 bed+0.019
Neg logit contributions
P-0.033
bly-0.031
ww-0.030
Now-0.030
lish-0.029
Token nod
Token ID18666
Feature activation+8.83e-03

Pos logit contributions
 bed+0.077
 cry+0.075
 tell+0.073
 weeping+0.071
 regret+0.070
Neg logit contributions
bly-0.115
P-0.114
ww-0.110
lish-0.109
Now-0.105
Token his
Token ID465
Feature activation+1.48e-02

Pos logit contributions
 cry+0.012
 tell+0.012
 bed+0.012
 smile+0.011
 call+0.011
Neg logit contributions
P-0.022
bly-0.021
ww-0.021
lish-0.021
Now-0.021
Token head
Token ID1182
Feature activation+3.34e-03

Pos logit contributions
 tell+0.013
 bed+0.012
 cry+0.012
 weeping+0.012
 smile+0.012
Neg logit contributions
P-0.044
ww-0.042
bly-0.042
lish-0.041
Now-0.039
Token.
Token ID13
Feature activation+8.73e-03

Pos logit contributions
 cry+0.004
 tell+0.004
 smile+0.004
 bed+0.004
 call+0.004
Neg logit contributions
P-0.009
ww-0.009
bly-0.008
Now-0.008
lish-0.008
Token
Token ID198
Feature activation+9.46e-04

Pos logit contributions
 cry+0.011
 smile+0.011
 tell+0.011
 gaze+0.010
 weep+0.010
Neg logit contributions
P-0.024
ww-0.023
Now-0.022
bly-0.021
Soon-0.021
Token
Token ID198
Feature activation-1.25e-01

Pos logit contributions
 smile+0.001
 cry+0.001
 tell+0.001
 bed+0.001
 stomach+0.001
Neg logit contributions
P-0.003
ww-0.003
Now-0.003
bly-0.003
Soon-0.002
TokenL
Token ID43
Feature activation-4.50e-02

Pos logit contributions
w+0.259
bly+0.251
dd+0.250
iest+0.250
ww+0.229
Neg logit contributions
 -0.179
-0.177
 find-0.174
 call-0.174
 play-0.173
Tokenily
Token ID813
Feature activation-3.81e-03

Pos logit contributions
P+0.089
ww+0.087
bly+0.078
Sorry+0.075
Now+0.075
Neg logit contributions
 tell-0.091
 call-0.088
 cry-0.087
 smile-0.086
 question-0.086
Token looked
Token ID3114
Feature activation+1.72e-03

Pos logit contributions
P+0.010
ww+0.010
Now+0.009
bly+0.009
lish+0.009
Neg logit contributions
 cry-0.005
 smile-0.005
 tell-0.005
 bed-0.005
 call-0.005
Token around
Token ID1088
Feature activation+2.59e-02

Pos logit contributions
 cry+0.003
 bed+0.003
 smile+0.003
 tell+0.003
 call+0.003
Neg logit contributions
P-0.004
ww-0.003
bly-0.003
lish-0.003
Now-0.003
Token for
Token ID329
Feature activation+1.54e-02

Pos logit contributions
 cry+0.044
 tell+0.041
 call+0.041
 stomach+0.040
 weep+0.040
Neg logit contributions
P-0.056
Now-0.053
lish-0.051
Soon-0.050
bly-0.049
Token to
Token ID284
Feature activation+2.25e-02

Pos logit contributions
 cry+0.025
 smile+0.024
 tell+0.023
 stomach+0.023
 weep+0.022
Neg logit contributions
P-0.046
sc-0.042
ww-0.042
Now-0.042
bly-0.041
Token the
Token ID262
Feature activation-5.69e-03

Pos logit contributions
 cry+0.027
 weeping+0.026
 crying+0.026
 stomach+0.026
 smile+0.025
Neg logit contributions
P-0.062
Now-0.057
ww-0.056
bly-0.055
ha-0.054
Token middle
Token ID3504
Feature activation+1.94e-02

Pos logit contributions
P+0.016
ww+0.015
Now+0.014
bly+0.014
lish+0.014
Neg logit contributions
 smile-0.008
 cry-0.008
 tell-0.007
 bed-0.007
 call-0.007
Token.
Token ID13
Feature activation+1.90e-03

Pos logit contributions
 cry+0.038
 smile+0.037
 tell+0.036
 weeping+0.036
 weep+0.036
Neg logit contributions
P-0.039
ww-0.036
Now-0.035
bly-0.033
lish-0.033
Token
Token ID198
Feature activation-2.24e-02

Pos logit contributions
 cry+0.002
 smile+0.002
 tell+0.002
 call+0.002
 bed+0.002
Neg logit contributions
P-0.005
ww-0.005
Now-0.005
bly-0.005
Soon-0.005
Token
Token ID198
Feature activation-1.25e-01

Pos logit contributions
P+0.072
ww+0.070
bly+0.068
lish+0.067
Now+0.067
Neg logit contributions
 smile-0.017
 tell-0.017
 cry-0.017
 bed-0.016
 call-0.016
Token"
Token ID1
Feature activation-7.91e-02

Pos logit contributions
iest+0.233
bly+0.229
w+0.228
isher+0.227
ymes+0.217
Neg logit contributions
 -0.200
-0.198
 play-0.195
 find-0.187
 have-0.185
TokenI
Token ID40
Feature activation-1.20e-02

Pos logit contributions
bly+0.146
ww+0.138
lish+0.136
dd+0.131
bling+0.126
Neg logit contributions
 question-0.145
 gaze-0.144
 call-0.144
 smile-0.142
 search-0.141
Token do
Token ID466
Feature activation-1.83e-02

Pos logit contributions
P+0.033
ww+0.032
Now+0.031
bly+0.031
lish+0.030
Neg logit contributions
 smile-0.015
 cry-0.015
 tell-0.015
 bed-0.014
 call-0.014
Token,
Token ID11
Feature activation-2.18e-02

Pos logit contributions
P+0.042
ww+0.039
Now+0.038
bly+0.038
lish+0.037
Neg logit contributions
 smile-0.033
 tell-0.032
 cry-0.032
 bed-0.032
 call-0.031
Token I
Token ID314
Feature activation-9.09e-03

Pos logit contributions
P+0.063
ww+0.059
Now+0.058
bly+0.058
lish+0.056
Neg logit contributions
 smile-0.027
 tell-0.027
 cry-0.027
 bed-0.026
 call-0.025
Token small
Token ID1402
Feature activation+8.51e-03

Pos logit contributions
P+0.009
ww+0.008
Now+0.008
bly+0.008
lish+0.008
Neg logit contributions
 cry-0.005
 smile-0.005
 tell-0.005
 bed-0.005
 call-0.005
Token opening
Token ID4756
Feature activation-4.47e-02

Pos logit contributions
 tell+0.012
 cry+0.012
 smile+0.011
 call+0.011
 bed+0.011
Neg logit contributions
P-0.022
ww-0.021
Now-0.019
bly-0.019
lish-0.019
Token.
Token ID13
Feature activation+8.33e-03

Pos logit contributions
P+0.116
ww+0.110
Now+0.107
lish+0.101
bly+0.101
Neg logit contributions
 smile-0.068
 bed-0.067
 cry-0.064
 eyes-0.064
 call-0.063
Token He
Token ID679
Feature activation-1.23e-02

Pos logit contributions
 smile+0.012
 cry+0.012
 tell+0.012
 call+0.011
 bed+0.011
Neg logit contributions
P-0.021
ww-0.020
Now-0.020
bly-0.019
Spl-0.018
Token remembered
Token ID12086
Feature activation+1.85e-02

Pos logit contributions
P+0.033
ww+0.031
Now+0.030
bly+0.030
lish+0.029
Neg logit contributions
 smile-0.017
 cry-0.017
 tell-0.017
 bed-0.016
 call-0.016
Token his
Token ID465
Feature activation+5.15e-02

Pos logit contributions
 cry+0.031
 smile+0.030
 call+0.029
 weep+0.028
 gaze+0.028
Neg logit contributions
P-0.042
ww-0.038
Now-0.037
bly-0.037
ha-0.036
Token mom
Token ID1995
Feature activation+4.18e-02

Pos logit contributions
 cry+0.053
 tell+0.053
 Her+0.051
 call+0.051
 weep+0.050
Neg logit contributions
P-0.131
ww-0.124
ha-0.115
Soon-0.112
A-0.111
Token's
Token ID338
Feature activation+4.91e-02

Pos logit contributions
 bed+0.076
 cry+0.073
 stomach+0.073
ses+0.071
 gaze+0.069
Neg logit contributions
P-0.079
Now-0.073
lish-0.071
it-0.071
bly-0.070
Token necklace
Token ID38987
Feature activation+1.06e-02

Pos logit contributions
 cry+0.070
 bed+0.069
 tell+0.068
 stomach+0.067
 weep+0.067
Neg logit contributions
P-0.099
Now-0.093
ww-0.092
ha-0.089
bly-0.087
Token and
Token ID290
Feature activation+1.17e-02

Pos logit contributions
 cry+0.014
 smile+0.014
 tell+0.013
 bed+0.013
 gaze+0.013
Neg logit contributions
P-0.028
ww-0.026
bly-0.026
Now-0.025
lish-0.025
Token thought
Token ID1807
Feature activation+1.06e-03

Pos logit contributions
 cry+0.019
 smile+0.019
 bed+0.018
 tell+0.018
 call+0.018
Neg logit contributions
P-0.027
ww-0.025
Now-0.025
bly-0.024
ha-0.023
Token a
Token ID257
Feature activation-8.47e-03

Pos logit contributions
P+0.053
ww+0.051
bly+0.049
Now+0.049
lish+0.049
Neg logit contributions
 smile-0.032
 bed-0.032
 cry-0.032
 tell-0.031
 call-0.030
Token brave
Token ID14802
Feature activation-5.71e-03

Pos logit contributions
P+0.022
ww+0.021
Now+0.020
bly+0.020
lish+0.019
Neg logit contributions
 smile-0.012
 cry-0.012
 tell-0.012
 bed-0.012
 call-0.012
Token boy
Token ID2933
Feature activation+2.71e-02

Pos logit contributions
P+0.014
ww+0.014
Now+0.013
bly+0.013
lish+0.013
Neg logit contributions
 cry-0.009
 smile-0.009
 tell-0.008
 bed-0.008
 call-0.008
Token who
Token ID508
Feature activation-9.82e-03

Pos logit contributions
 smile+0.048
 cry+0.048
 bed+0.045
 stomach+0.044
 eyes+0.043
Neg logit contributions
P-0.065
ww-0.056
Now-0.053
lish-0.052
Spl-0.052
Token wanted
Token ID2227
Feature activation+4.59e-03

Pos logit contributions
P+0.025
ww+0.024
Now+0.023
bly+0.023
lish+0.022
Neg logit contributions
 smile-0.015
 cry-0.015
 tell-0.014
 bed-0.014
 call-0.014
Token to
Token ID284
Feature activation+5.25e-02

Pos logit contributions
 smile+0.006
 cry+0.006
 tell+0.006
 call+0.005
 bed+0.005
Neg logit contributions
P-0.013
ww-0.012
Now-0.011
bly-0.011
ha-0.011
Token solve
Token ID8494
Feature activation-5.50e-03

Pos logit contributions
 weeping+0.073
 crying+0.069
ses+0.066
sed+0.063
 eyes+0.062
Neg logit contributions
P-0.132
ww-0.117
A-0.115
Now-0.113
Soon-0.109
Token a
Token ID257
Feature activation+1.43e-02

Pos logit contributions
P+0.014
ww+0.013
Now+0.013
bly+0.013
lish+0.012
Neg logit contributions
 smile-0.008
 cry-0.008
 tell-0.008
 bed-0.008
 call-0.008
Token problem
Token ID1917
Feature activation+1.33e-02

Pos logit contributions
 smile+0.024
 cry+0.024
 tell+0.024
 bed+0.023
 call+0.022
Neg logit contributions
P-0.034
ww-0.032
ha-0.029
Now-0.028
lish-0.028
Token.
Token ID13
Feature activation+2.05e-02

Pos logit contributions
 smile+0.016
 cry+0.016
 stomach+0.015
 tell+0.014
 gaze+0.014
Neg logit contributions
P-0.038
ww-0.035
Now-0.034
bly-0.034
lish-0.033
Token He
Token ID679
Feature activation-1.64e-02

Pos logit contributions
 cry+0.040
 smile+0.039
 tell+0.038
 weeping+0.038
 crying+0.037
Neg logit contributions
P-0.041
Now-0.039
ww-0.037
Spl-0.036
Soon-0.035
Token stop
Token ID2245
Feature activation+2.83e-02

Pos logit contributions
 stomach+0.044
 eyes+0.042
 weeping+0.040
 tum+0.040
 throat+0.040
Neg logit contributions
P-0.076
bly-0.074
Now-0.070
ww-0.068
wing-0.068
Token the
Token ID262
Feature activation-9.86e-03

Pos logit contributions
 stomach+0.043
 question+0.043
 smile+0.042
 gaze+0.041
 tell+0.041
Neg logit contributions
P-0.060
bly-0.060
Now-0.059
lish-0.058
ww-0.056
Token robot
Token ID9379
Feature activation+1.66e-02

Pos logit contributions
P+0.026
ww+0.024
Now+0.024
bly+0.023
lish+0.023
Neg logit contributions
 smile-0.014
 cry-0.014
 tell-0.014
 bed-0.014
 call-0.013
Token.
Token ID13
Feature activation+3.56e-03

Pos logit contributions
 smile+0.021
 stomach+0.020
 cry+0.020
 tell+0.020
 bed+0.020
Neg logit contributions
P-0.043
ww-0.041
bly-0.039
Now-0.039
sc-0.039
Token But
Token ID887
Feature activation-1.68e-02

Pos logit contributions
 smile+0.005
 cry+0.005
 tell+0.005
 bed+0.004
 weeping+0.004
Neg logit contributions
P-0.010
Now-0.009
ww-0.009
bly-0.009
Soon-0.009
Token they
Token ID484
Feature activation+5.45e-02

Pos logit contributions
P+0.047
ww+0.044
Now+0.043
bly+0.042
lish+0.041
Neg logit contributions
 smile-0.022
 cry-0.022
 tell-0.022
 bed-0.022
 call-0.021
Token do
Token ID466
Feature activation-6.67e-03

Pos logit contributions
 weeping+0.090
 bed+0.089
 crying+0.084
 stomach+0.081
 Her+0.080
Neg logit contributions
P-0.104
w-0.102
Now-0.098
bly-0.096
ww-0.095
Token not
Token ID407
Feature activation+8.88e-03

Pos logit contributions
P+0.019
ww+0.018
Now+0.017
bly+0.017
lish+0.017
Neg logit contributions
 smile-0.008
 cry-0.008
 tell-0.008
 bed-0.008
 call-0.008
Token know
Token ID760
Feature activation+1.49e-03

Pos logit contributions
 bed+0.012
 stomach+0.012
 eyes+0.011
 weeping+0.011
 smile+0.011
Neg logit contributions
P-0.023
ww-0.022
Now-0.021
bly-0.021
lish-0.020
Token how
Token ID703
Feature activation-7.74e-03

Pos logit contributions
 smile+0.002
 cry+0.002
 tell+0.002
 bed+0.002
 call+0.002
Neg logit contributions
P-0.004
ww-0.004
bly-0.004
Now-0.004
lish-0.003
Token.
Token ID13
Feature activation+8.60e-03

Pos logit contributions
P+0.023
ww+0.022
Now+0.021
bly+0.021
lish+0.021
Neg logit contributions
 smile-0.008
 cry-0.008
 tell-0.008
 bed-0.008
 call-0.008
Token The
Token ID383
Feature activation-3.01e-04

Pos logit contributions
 smile+0.013
 cry+0.012
 tell+0.012
 call+0.012
 bed+0.011
Neg logit contributions
P-0.025
ww-0.023
Now-0.023
Spl-0.022
ha-0.022
Token lizard
Token ID42406
Feature activation-4.38e-03

Pos logit contributions
P+0.001
ww+0.001
Now+0.001
bly+0.001
lish+0.001
Neg logit contributions
 smile-0.001
 cry-0.001
 tell-0.001
 bed-0.001
 call-0.001
Token quickly
Token ID2952
Feature activation-1.20e-02

Pos logit contributions
P+0.011
ww+0.011
Now+0.010
bly+0.010
lish+0.010
Neg logit contributions
 smile-0.006
 cry-0.006
 bed-0.006
 tell-0.006
 call-0.006
Token moved
Token ID3888
Feature activation-4.09e-02

Pos logit contributions
P+0.029
ww+0.028
Now+0.027
bly+0.027
lish+0.026
Neg logit contributions
 smile-0.019
 cry-0.019
 tell-0.019
 bed-0.019
 call-0.018
Token back
Token ID736
Feature activation+1.80e-02

Pos logit contributions
P+0.115
ww+0.113
Now+0.103
bly+0.102
lish+0.099
Neg logit contributions
 smile-0.061
 cry-0.055
 tell-0.054
 feel-0.053
 search-0.052
Token to
Token ID284
Feature activation+4.94e-02

Pos logit contributions
 cry+0.028
 tell+0.027
 smile+0.027
 stomach+0.026
 call+0.026
Neg logit contributions
P-0.043
ww-0.040
Now-0.040
bly-0.038
Soon-0.038
Token its
Token ID663
Feature activation+4.22e-02

Pos logit contributions
 stomach+0.097
 throat+0.094
 cry+0.094
 tum+0.092
 smile+0.091
Neg logit contributions
Now-0.073
Soon-0.068
Good-0.065
P-0.064
Spl-0.061
Token home
Token ID1363
Feature activation+4.37e-02

Pos logit contributions
 cry+0.056
 tell+0.054
 call+0.053
 stare+0.052
 weep+0.051
Neg logit contributions
P-0.096
ww-0.091
Soon-0.086
Now-0.085
lish-0.085
Token under
Token ID739
Feature activation+2.14e-02

Pos logit contributions
 cry+0.071
 question+0.068
 tell+0.067
 call+0.066
 weep+0.065
Neg logit contributions
it-0.082
bly-0.081
Now-0.081
P-0.079
Soon-0.078
Token a
Token ID257
Feature activation+1.36e-02

Pos logit contributions
 cry+0.039
 smile+0.038
 tell+0.038
 call+0.036
 question+0.036
Neg logit contributions
P-0.045
Now-0.042
ww-0.040
bly-0.039
A-0.038
Token big
Token ID1263
Feature activation+3.54e-02

Pos logit contributions
 tell+0.020
 cry+0.020
 smile+0.020
 weeping+0.019
 call+0.019
Neg logit contributions
P-0.033
ww-0.032
ha-0.030
Now-0.030
bly-0.028
Token girl
Token ID2576
Feature activation+5.25e-03

Pos logit contributions
P+0.008
ww+0.008
Now+0.007
bly+0.007
lish+0.007
Neg logit contributions
 smile-0.008
 cry-0.008
 tell-0.008
 bed-0.008
 call-0.007
Token offered
Token ID4438
Feature activation+3.18e-02

Pos logit contributions
 cry+0.008
 bed+0.008
 smile+0.008
 tell+0.008
 call+0.007
Neg logit contributions
P-0.013
ww-0.012
Now-0.012
bly-0.011
lish-0.011
Token to
Token ID284
Feature activation+4.24e-02

Pos logit contributions
 cry+0.058
 smile+0.055
 gaze+0.055
 weep+0.054
 bed+0.054
Neg logit contributions
P-0.065
Now-0.056
ww-0.055
S-0.054
Soon-0.054
Token help
Token ID1037
Feature activation+2.53e-02

Pos logit contributions
 weeping+0.052
 cry+0.051
 fault+0.050
 crying+0.049
 horror+0.049
Neg logit contributions
P-0.106
Now-0.097
bly-0.094
ww-0.092
A-0.092
Token the
Token ID262
Feature activation+2.21e-02

Pos logit contributions
 smile+0.040
 cry+0.039
 gaze+0.038
 bed+0.038
 stomach+0.037
Neg logit contributions
P-0.059
Now-0.053
bly-0.052
lish-0.051
ww-0.051
Token bird
Token ID6512
Feature activation+6.39e-02

Pos logit contributions
 tell+0.040
 bed+0.039
 smile+0.039
 cry+0.039
 call+0.038
Neg logit contributions
P-0.049
ww-0.044
Now-0.041
Soon-0.039
bly-0.039
Token get
Token ID651
Feature activation-3.84e-03

Pos logit contributions
ses+0.108
sed+0.104
 bed+0.098
 horror+0.098
 cry+0.096
Neg logit contributions
P-0.122
bly-0.111
sc-0.109
it-0.107
wing-0.106
Token warm
Token ID5814
Feature activation+5.65e-03

Pos logit contributions
P+0.009
ww+0.009
bly+0.008
Now+0.008
lish+0.008
Neg logit contributions
 cry-0.006
 smile-0.006
 tell-0.006
 bed-0.006
 call-0.006
Token.
Token ID13
Feature activation+2.17e-02

Pos logit contributions
 cry+0.008
 tell+0.007
 smile+0.007
 call+0.007
 question+0.007
Neg logit contributions
P-0.015
ww-0.014
bly-0.014
Now-0.014
lish-0.014
Token She
Token ID1375
Feature activation+1.45e-03

Pos logit contributions
 cry+0.030
 smile+0.029
 tell+0.028
 bed+0.028
 crying+0.027
Neg logit contributions
P-0.054
Now-0.053
ww-0.052
Soon-0.049
bly-0.049
Token took
Token ID1718
Feature activation-1.89e-02

Pos logit contributions
 cry+0.002
 smile+0.002
 bed+0.002
 tell+0.002
 call+0.002
Neg logit contributions
P-0.004
ww-0.003
Now-0.003
bly-0.003
lish-0.003
Token,
Token ID11
Feature activation-3.41e-03

Pos logit contributions
P+0.003
ww+0.003
Now+0.003
bly+0.003
lish+0.003
Neg logit contributions
 cry-0.001
 smile-0.001
 tell-0.001
 bed-0.001
 call-0.001
Token she
Token ID673
Feature activation+9.76e-03

Pos logit contributions
P+0.008
ww+0.008
Now+0.007
bly+0.007
lish+0.007
Neg logit contributions
 smile-0.006
 cry-0.005
 tell-0.005
 bed-0.005
 call-0.005
Token smiled
Token ID13541
Feature activation-1.46e-02

Pos logit contributions
 cry+0.017
 bed+0.017
 smile+0.016
 tell+0.016
 stomach+0.016
Neg logit contributions
P-0.022
Now-0.020
ww-0.020
bly-0.019
lish-0.019
Token,
Token ID11
Feature activation+8.98e-03

Pos logit contributions
P+0.039
ww+0.037
Now+0.036
bly+0.035
lish+0.035
Neg logit contributions
 smile-0.021
 cry-0.020
 tell-0.020
 bed-0.019
 call-0.019
Token knowing
Token ID6970
Feature activation+9.55e-03

Pos logit contributions
 cry+0.014
 bed+0.013
 tell+0.013
 smile+0.013
 call+0.013
Neg logit contributions
P-0.022
Now-0.021
ww-0.020
bly-0.020
Soon-0.019
Token she
Token ID673
Feature activation-6.28e-03

Pos logit contributions
 cry+0.017
 smile+0.017
 call+0.016
 tell+0.016
 bed+0.016
Neg logit contributions
P-0.021
ww-0.019
Now-0.019
bly-0.019
ha-0.018
Token had
Token ID550
Feature activation-7.87e-03

Pos logit contributions
P+0.015
ww+0.014
Now+0.014
bly+0.013
lish+0.013
Neg logit contributions
 cry-0.010
 smile-0.010
 bed-0.010
 tell-0.010
 call-0.010
Token done
Token ID1760
Feature activation-2.97e-02

Pos logit contributions
P+0.019
ww+0.018
Now+0.017
bly+0.017
lish+0.016
Neg logit contributions
 smile-0.013
 cry-0.013
 tell-0.013
 bed-0.013
 call-0.012
Token it
Token ID340
Feature activation-2.91e-02

Pos logit contributions
P+0.078
ww+0.075
Now+0.070
bly+0.069
lish+0.068
Neg logit contributions
 smile-0.045
 bed-0.043
 tell-0.043
 cry-0.042
 call-0.040
Token all
Token ID477
Feature activation-6.27e-03

Pos logit contributions
P+0.089
ww+0.086
Now+0.082
bly+0.081
lish+0.080
Neg logit contributions
 smile-0.031
 tell-0.029
 bed-0.029
 cry-0.028
 call-0.027
Token by
Token ID416
Feature activation+2.34e-02

Pos logit contributions
P+0.015
ww+0.014
Now+0.013
bly+0.013
lish+0.013
Neg logit contributions
 cry-0.011
 smile-0.011
 tell-0.010
 bed-0.010
 call-0.010
Token
Token ID198
Feature activation-8.46e-02

Pos logit contributions
P+0.076
ww+0.074
bly+0.071
lish+0.071
Now+0.070
Neg logit contributions
 smile-0.017
 tell-0.017
 cry-0.017
 bed-0.017
 call-0.016
TokenSuddenly
Token ID38582
Feature activation-4.00e-02

Pos logit contributions
P+0.139
bly+0.139
sc+0.138
lish+0.136
w+0.136
Neg logit contributions
 tell-0.170
 -0.165
 question-0.160
 call-0.160
 know-0.159
Token,
Token ID11
Feature activation-3.97e-02

Pos logit contributions
P+0.141
Now+0.134
ww+0.132
bly+0.131
lish+0.128
Neg logit contributions
 smile-0.024
 bed-0.024
 cry-0.021
 tell-0.021
 call-0.019
Token there
Token ID612
Feature activation-4.39e-03

Pos logit contributions
P+0.117
ww+0.110
Now+0.107
lish+0.106
sc+0.105
Neg logit contributions
 bed-0.048
 smile-0.047
 tell-0.046
 cry-0.045
 call-0.043
Token was
Token ID373
Feature activation-3.58e-02

Pos logit contributions
P+0.011
ww+0.011
Now+0.010
bly+0.010
lish+0.010
Neg logit contributions
 smile-0.007
 cry-0.006
 tell-0.006
 bed-0.006
 call-0.006
Token a
Token ID257
Feature activation+2.10e-04

Pos logit contributions
P+0.096
ww+0.092
Now+0.088
bly+0.087
lish+0.085
Neg logit contributions
 bed-0.051
 smile-0.049
 cry-0.048
 stomach-0.048
 tell-0.047
Token knock
Token ID10643
Feature activation-6.58e-02

Pos logit contributions
 cry+0.000
 smile+0.000
 tell+0.000
 bed+0.000
 call+0.000
Neg logit contributions
P-0.000
ww-0.000
Now-0.000
bly-0.000
lish-0.000
Token on
Token ID319
Feature activation-9.51e-03

Pos logit contributions
P+0.152
ww+0.146
Now+0.145
bly+0.134
Soon+0.134
Neg logit contributions
 crying-0.109
 call-0.103
 tell-0.102
 smile-0.102
 cry-0.100
Token the
Token ID262
Feature activation+2.28e-02

Pos logit contributions
P+0.026
ww+0.025
Now+0.024
bly+0.024
lish+0.023
Neg logit contributions
 smile-0.012
 cry-0.012
 tell-0.012
 bed-0.012
 call-0.011
Token door
Token ID3420
Feature activation+2.90e-02

Pos logit contributions
 tell+0.032
 cry+0.031
 smile+0.030
 weep+0.028
 weeping+0.028
Neg logit contributions
P-0.055
ww-0.052
Now-0.052
bly-0.050
lish-0.048
Token.
Token ID13
Feature activation-3.54e-03

Pos logit contributions
 stomach+0.038
 cry+0.037
 smile+0.036
 weeping+0.035
 weep+0.034
Neg logit contributions
P-0.070
Now-0.068
ww-0.067
bly-0.067
it-0.065
Token animal
Token ID5044
Feature activation+5.39e-03

Pos logit contributions
 smile+0.020
 cry+0.019
 bed+0.019
 tell+0.018
 gaze+0.018
Neg logit contributions
P-0.029
ww-0.028
Now-0.025
bly-0.025
lish-0.025
Token?"
Token ID1701
Feature activation+5.08e-02

Pos logit contributions
 smile+0.009
 cry+0.008
 bed+0.008
 tell+0.008
 gaze+0.008
Neg logit contributions
P-0.013
ww-0.012
bly-0.012
Now-0.012
lish-0.011
Token
Token ID198
Feature activation+2.21e-02

Pos logit contributions
 cry+0.073
 smile+0.070
 fault+0.069
 weeping+0.069
 regret+0.069
Neg logit contributions
P-0.124
S-0.111
A-0.110
Now-0.109
Soon-0.107
Token
Token ID198
Feature activation-4.91e-02

Pos logit contributions
 cry+0.031
 smile+0.031
 weeping+0.030
 bed+0.030
 tell+0.029
Neg logit contributions
P-0.059
Now-0.054
ww-0.053
Soon-0.052
O-0.050
TokenHer
Token ID9360
Feature activation+5.59e-02

Pos logit contributions
ww+0.095
sc+0.095
lish+0.094
bly+0.092
P+0.092
Neg logit contributions
 smile-0.097
 tell-0.095
 cry-0.094
 call-0.092
 gaze-0.092
Token mom
Token ID1995
Feature activation+3.25e-02

Pos logit contributions
 stare+0.085
 weep+0.085
 gaze+0.083
 die+0.081
 sleep+0.080
Neg logit contributions
P-0.107
ww-0.099
S-0.098
Mom-0.096
A-0.095
Token says
Token ID1139
Feature activation+9.85e-03

Pos logit contributions
 bed+0.042
 stomach+0.040
 cry+0.038
 throat+0.037
 tum+0.037
Neg logit contributions
P-0.083
ww-0.079
Hey-0.078
Now-0.078
ha-0.077
Token,
Token ID11
Feature activation+3.49e-03

Pos logit contributions
 cry+0.012
 tell+0.011
 bed+0.011
 smile+0.011
 call+0.011
Neg logit contributions
P-0.027
ww-0.026
bly-0.025
lish-0.025
ha-0.025
Token "
Token ID366
Feature activation-7.02e-02

Pos logit contributions
 cry+0.005
 smile+0.005
 tell+0.005
 bed+0.005
 call+0.004
Neg logit contributions
P-0.009
ww-0.009
Now-0.008
bly-0.008
lish-0.008
TokenThat
Token ID2504
Feature activation-6.74e-03

Pos logit contributions
bly+0.108
lish+0.099
ross+0.098
P+0.092
ww+0.090
Neg logit contributions
 crying-0.151
 say-0.150
 cry-0.150
 call-0.149
 gaze-0.149
Token is
Token ID318
Feature activation-9.18e-03

Pos logit contributions
P+0.017
ww+0.016
Now+0.016
bly+0.015
lish+0.015
Neg logit contributions
 smile-0.010
 cry-0.010
 tell-0.010
 bed-0.010
 call-0.009
Tokenicked
Token ID9484
Feature activation-2.06e-02

Pos logit contributions
P+0.016
ww+0.015
Now+0.015
bly+0.015
lish+0.014
Neg logit contributions
 smile-0.012
 cry-0.012
 tell-0.012
 bed-0.012
 call-0.011
Token Anna
Token ID11735
Feature activation+1.78e-02

Pos logit contributions
P+0.060
ww+0.057
Now+0.055
bly+0.054
lish+0.053
Neg logit contributions
 smile-0.025
 cry-0.025
 tell-0.024
 bed-0.024
 call-0.023
Token's
Token ID338
Feature activation+1.64e-02

Pos logit contributions
 tell+0.027
 bed+0.027
 question+0.027
 smile+0.027
 call+0.026
Neg logit contributions
P-0.041
ww-0.038
lish-0.037
bly-0.037
Now-0.037
Token hand
Token ID1021
Feature activation+2.58e-02

Pos logit contributions
 tell+0.013
 cry+0.011
 bed+0.011
 weeping+0.011
 question+0.011
Neg logit contributions
P-0.051
ww-0.048
bly-0.048
Now-0.047
lish-0.047
Token.
Token ID13
Feature activation+3.73e-04

Pos logit contributions
 tell+0.032
 cry+0.031
 bed+0.030
 question+0.030
 smile+0.029
Neg logit contributions
P-0.065
lish-0.064
bly-0.063
ww-0.061
Now-0.060
Token Anna
Token ID11735
Feature activation+3.93e-03

Pos logit contributions
 cry+0.001
 smile+0.001
 tell+0.001
 bed+0.001
 call+0.001
Neg logit contributions
P-0.001
ww-0.001
Now-0.001
bly-0.001
lish-0.001
Token gigg
Token ID30442
Feature activation-4.57e-02

Pos logit contributions
 cry+0.006
 bed+0.006
 smile+0.006
 tell+0.005
 call+0.005
Neg logit contributions
P-0.010
ww-0.009
Now-0.009
bly-0.009
lish-0.009
Tokenled
Token ID992
Feature activation-1.71e-02

Pos logit contributions
ww+0.067
P+0.065
Now+0.059
Soon+0.056
sc+0.056
Neg logit contributions
 smile-0.115
 cry-0.110
 eyes-0.107
 tell-0.106
 feel-0.106
Token and
Token ID290
Feature activation+1.80e-02

Pos logit contributions
P+0.049
ww+0.047
Now+0.046
bly+0.045
ha+0.043
Neg logit contributions
 smile-0.021
 cry-0.020
 tell-0.020
 bed-0.020
 call-0.019
Token stro
Token ID9660
Feature activation-2.21e-02

Pos logit contributions
 cry+0.025
 bed+0.025
 question+0.025
 tell+0.024
 smile+0.024
Neg logit contributions
P-0.046
ww-0.043
Now-0.041
bly-0.041
lish-0.040
Tokenked
Token ID9091
Feature activation+1.02e-02

Pos logit contributions
P+0.045
ww+0.043
Now+0.041
bly+0.038
ha+0.038
Neg logit contributions
 smile-0.045
 cry-0.044
 tell-0.043
 call-0.042
 crying-0.042
Token She
Token ID1375
Feature activation-6.78e-03

Pos logit contributions
 tell+0.032
 smile+0.031
 cry+0.031
 call+0.030
 bed+0.030
Neg logit contributions
P-0.049
Now-0.047
ww-0.047
Hey-0.043
Spl-0.043
Token lived
Token ID5615
Feature activation-1.81e-02

Pos logit contributions
P+0.017
ww+0.016
Now+0.016
bly+0.016
lish+0.015
Neg logit contributions
 smile-0.010
 cry-0.010
 tell-0.010
 bed-0.010
 call-0.009
Token in
Token ID287
Feature activation+5.80e-03

Pos logit contributions
P+0.040
ww+0.039
Now+0.036
bly+0.036
lish+0.035
Neg logit contributions
 smile-0.033
 tell-0.032
 cry-0.032
 bed-0.032
 call-0.031
Token a
Token ID257
Feature activation+1.33e-02

Pos logit contributions
 smile+0.011
 cry+0.010
 tell+0.010
 call+0.010
 stomach+0.010
Neg logit contributions
P-0.013
ww-0.012
Now-0.012
bly-0.011
lish-0.011
Token cave
Token ID11527
Feature activation-4.78e-03

Pos logit contributions
 smile+0.022
 cry+0.022
 tell+0.022
 call+0.021
 bed+0.020
Neg logit contributions
P-0.033
ww-0.030
Now-0.028
Soon-0.027
lish-0.027
Token underground
Token ID11447
Feature activation-7.54e-03

Pos logit contributions
P+0.011
ww+0.010
Now+0.010
bly+0.010
lish+0.009
Neg logit contributions
 cry-0.008
 smile-0.008
 tell-0.008
 bed-0.008
 call-0.008
Token.
Token ID13
Feature activation+3.97e-02

Pos logit contributions
P+0.020
ww+0.019
Now+0.018
bly+0.018
lish+0.018
Neg logit contributions
 smile-0.011
 cry-0.011
 tell-0.010
 bed-0.010
 call-0.010
Token Every
Token ID3887
Feature activation+9.63e-03

Pos logit contributions
 cry+0.071
 smile+0.071
 tell+0.069
 die+0.066
 call+0.066
Neg logit contributions
Now-0.078
P-0.077
A-0.073
Soon-0.070
Spl-0.069
Token day
Token ID1110
Feature activation-2.00e-02

Pos logit contributions
 cry+0.017
 smile+0.017
 tell+0.017
 call+0.016
 bed+0.016
Neg logit contributions
P-0.022
ww-0.021
Now-0.019
ha-0.019
Spl-0.018
Token she
Token ID673
Feature activation+2.02e-03

Pos logit contributions
P+0.067
Now+0.064
bly+0.064
ww+0.063
lish+0.061
Neg logit contributions
 smile-0.015
 cry-0.014
 bed-0.014
 tell-0.014
 call-0.012
Token would
Token ID561
Feature activation+5.44e-02

Pos logit contributions
 cry+0.004
 smile+0.003
 bed+0.003
 tell+0.003
 call+0.003
Neg logit contributions
P-0.005
ww-0.004
Now-0.004
bly-0.004
lish-0.004
TokenBut
Token ID1537
Feature activation-1.40e-02

Pos logit contributions
ww+0.106
lish+0.089
ha+0.088
w+0.086
bly+0.084
Neg logit contributions
 smile-0.175
 tell-0.174
 find-0.167
 feel-0.165
 question-0.163
Token as
Token ID355
Feature activation-2.25e-02

Pos logit contributions
P+0.034
ww+0.032
Now+0.031
bly+0.030
lish+0.029
Neg logit contributions
 smile-0.023
 cry-0.023
 tell-0.023
 bed-0.022
 call-0.022
Token the
Token ID262
Feature activation-5.18e-03

Pos logit contributions
P+0.057
ww+0.053
Now+0.052
bly+0.052
lish+0.050
Neg logit contributions
 smile-0.035
 tell-0.034
 bed-0.034
 cry-0.034
 call-0.032
Token months
Token ID1933
Feature activation-4.58e-02

Pos logit contributions
P+0.013
ww+0.012
Now+0.011
bly+0.011
lish+0.011
Neg logit contributions
 cry-0.008
 smile-0.008
 tell-0.008
 bed-0.008
 call-0.008
Token passed
Token ID3804
Feature activation-2.89e-02

Pos logit contributions
P+0.108
ww+0.103
lish+0.097
bly+0.096
Now+0.095
Neg logit contributions
 smile-0.079
 tell-0.077
 bed-0.075
 call-0.073
 cry-0.073
Token,
Token ID11
Feature activation-2.43e-02

Pos logit contributions
P+0.089
ww+0.085
Now+0.083
bly+0.083
lish+0.079
Neg logit contributions
 smile-0.030
 cry-0.027
 bed-0.027
 tell-0.026
 stomach-0.025
Token something
Token ID1223
Feature activation-1.57e-02

Pos logit contributions
P+0.062
ww+0.059
Now+0.057
bly+0.056
lish+0.055
Neg logit contributions
 smile-0.038
 tell-0.036
 cry-0.036
 bed-0.036
 stomach-0.034
Token strange
Token ID6283
Feature activation-8.02e-03

Pos logit contributions
P+0.036
ww+0.034
Now+0.033
bly+0.032
lish+0.031
Neg logit contributions
 smile-0.028
 cry-0.028
 tell-0.027
 bed-0.027
 call-0.026
Token started
Token ID2067
Feature activation+6.86e-03

Pos logit contributions
P+0.020
ww+0.019
Now+0.018
bly+0.018
lish+0.018
Neg logit contributions
 smile-0.012
 cry-0.012
 tell-0.012
 bed-0.012
 call-0.011
Token happening
Token ID5836
Feature activation-3.01e-02

Pos logit contributions
 cry+0.009
 smile+0.008
 tell+0.008
 bed+0.008
 call+0.008
Neg logit contributions
P-0.019
ww-0.018
bly-0.017
Now-0.017
lish-0.017
Token.
Token ID13
Feature activation-4.81e-03

Pos logit contributions
P+0.098
ww+0.094
Now+0.091
bly+0.090
lish+0.087
Neg logit contributions
 smile-0.025
 cry-0.024
 bed-0.023
 tell-0.023
 call-0.022
Token.
Token ID13
Feature activation+1.64e-02

Pos logit contributions
 smile+0.021
 cry+0.020
 tell+0.020
 stomach+0.019
 gaze+0.019
Neg logit contributions
P-0.045
ww-0.043
bly-0.042
Now-0.042
sc-0.041
Token
Token ID198
Feature activation+3.20e-02

Pos logit contributions
 smile+0.025
 tell+0.024
 cry+0.024
 call+0.022
 weeping+0.022
Neg logit contributions
P-0.040
Now-0.038
ww-0.038
bly-0.037
Spl-0.037
Token
Token ID198
Feature activation-7.09e-02

Pos logit contributions
 smile+0.052
 cry+0.049
 gaze+0.048
 tell+0.048
 stomach+0.047
Neg logit contributions
P-0.078
Now-0.075
Soon-0.074
ww-0.070
A-0.069
Token"
Token ID1
Feature activation-7.63e-02

Pos logit contributions
iest+0.121
bridge+0.109
ww+0.105
sc+0.105
lish+0.105
Neg logit contributions
 mom-0.153
 cry-0.144
 call-0.143
 bed-0.142
 tell-0.141
TokenHelp
Token ID22087
Feature activation-2.21e-02

Pos logit contributions
ww+0.131
bly+0.124
lish+0.122
ross+0.121
hest+0.121
Neg logit contributions
 cry-0.157
 mom-0.147
 frown-0.144
 scream-0.144
 call-0.144
Token,
Token ID11
Feature activation-2.17e-02

Pos logit contributions
P+0.060
ww+0.057
Now+0.056
bly+0.056
lish+0.053
Neg logit contributions
 smile-0.029
 cry-0.029
 tell-0.029
 bed-0.028
 call-0.028
Token help
Token ID1037
Feature activation-4.29e-03

Pos logit contributions
P+0.073
ww+0.069
Now+0.069
bly+0.067
lish+0.067
Neg logit contributions
 cry-0.016
 tell-0.016
 smile-0.016
 bed-0.016
 call-0.015
Token!"
Token ID2474
Feature activation+2.04e-02

Pos logit contributions
P+0.009
ww+0.008
Now+0.008
bly+0.008
lish+0.008
Neg logit contributions
 smile-0.008
 cry-0.008
 tell-0.008
 bed-0.008
 call-0.008
Token they
Token ID484
Feature activation+3.14e-02

Pos logit contributions
 smile+0.031
 tell+0.030
 question+0.030
 weeping+0.030
 cry+0.030
Neg logit contributions
P-0.049
ww-0.045
Now-0.043
bly-0.042
Spl-0.042
Token cry
Token ID3960
Feature activation+2.36e-02

Pos logit contributions
 weeping+0.026
 stomach+0.026
 smile+0.024
 bed+0.023
 crying+0.022
Neg logit contributions
P-0.093
ww-0.088
bly-0.085
lish-0.084
ha-0.084
Token.
Token ID13
Feature activation+6.07e-03

Pos logit contributions
 tell+0.030
 question+0.030
 smile+0.030
 gaze+0.029
 bed+0.029
Neg logit contributions
P-0.059
lish-0.059
bly-0.057
ww-0.056
Spl-0.053
Token,
Token ID11
Feature activation-2.12e-02

Pos logit contributions
 cry+0.047
 tell+0.045
 gaze+0.044
 call+0.044
 smile+0.043
Neg logit contributions
lish-0.066
ww-0.065
P-0.065
Now-0.062
ha-0.061
Token so
Token ID523
Feature activation-9.68e-03

Pos logit contributions
P+0.055
ww+0.052
bly+0.050
Now+0.050
lish+0.048
Neg logit contributions
 smile-0.032
 cry-0.031
 tell-0.031
 bed-0.030
 call-0.030
Token he
Token ID339
Feature activation-3.10e-02

Pos logit contributions
P+0.024
ww+0.022
Now+0.021
bly+0.021
lish+0.021
Neg logit contributions
 smile-0.016
 cry-0.016
 tell-0.015
 bed-0.015
 call-0.015
Token asked
Token ID1965
Feature activation+1.12e-02

Pos logit contributions
P+0.083
ww+0.081
Now+0.079
bly+0.079
lish+0.076
Neg logit contributions
 smile-0.042
 tell-0.041
 call-0.040
 cry-0.039
 bed-0.037
Token,
Token ID11
Feature activation+6.04e-03

Pos logit contributions
 cry+0.017
 smile+0.016
 bed+0.015
 tell+0.015
 weep+0.015
Neg logit contributions
P-0.028
ww-0.026
bly-0.025
Now-0.025
lish-0.025
Token "
Token ID366
Feature activation-7.08e-02

Pos logit contributions
 cry+0.008
 smile+0.008
 bed+0.008
 tell+0.008
 stomach+0.008
Neg logit contributions
P-0.016
ww-0.015
Now-0.014
bly-0.014
lish-0.014
TokenWhat
Token ID2061
Feature activation+6.80e-03

Pos logit contributions
ww+0.139
bly+0.120
dd+0.115
br+0.113
ha+0.113
Neg logit contributions
 question-0.146
 tell-0.144
 call-0.143
 search-0.142
 gaze-0.140
Token is
Token ID318
Feature activation-1.99e-02

Pos logit contributions
 smile+0.011
 cry+0.011
 bed+0.011
 tell+0.010
 call+0.010
Neg logit contributions
P-0.016
ww-0.016
Now-0.014
bly-0.014
ha-0.014
Token it
Token ID340
Feature activation+1.93e-02

Pos logit contributions
P+0.051
ww+0.047
Now+0.047
bly+0.045
lish+0.045
Neg logit contributions
 bed-0.030
 tell-0.030
 smile-0.030
 cry-0.029
 call-0.029
Token?"
Token ID1701
Feature activation+1.85e-02

Pos logit contributions
 cry+0.036
 smile+0.035
 weep+0.033
 bed+0.033
 gaze+0.033
Neg logit contributions
P-0.041
ww-0.039
bly-0.037
Now-0.036
lish-0.036
Token The
Token ID383
Feature activation+7.82e-03

Pos logit contributions
 cry+0.025
 smile+0.024
 tell+0.023
 bed+0.023
 weeping+0.022
Neg logit contributions
P-0.052
Now-0.047
ww-0.046
Soon-0.044
bly-0.044
Token couldn
Token ID3521
Feature activation-2.80e-02

Pos logit contributions
 bed+0.048
 cry+0.047
 stomach+0.046
 call+0.045
 smile+0.045
Neg logit contributions
bly-0.041
P-0.040
lish-0.040
ww-0.039
Now-0.038
Token't
Token ID470
Feature activation+3.29e-02

Pos logit contributions
P+0.068
ww+0.063
Now+0.061
bly+0.060
lish+0.060
Neg logit contributions
 cry-0.047
 smile-0.046
 tell-0.045
 bed-0.045
 call-0.044
Token decide
Token ID5409
Feature activation-1.51e-03

Pos logit contributions
 bed+0.041
 eyes+0.040
 stomach+0.040
 weeping+0.039
 throat+0.038
Neg logit contributions
P-0.082
bly-0.078
Now-0.077
Soon-0.074
ww-0.073
Token who
Token ID508
Feature activation-8.10e-03

Pos logit contributions
P+0.004
ww+0.004
Now+0.004
bly+0.004
lish+0.003
Neg logit contributions
 cry-0.002
 smile-0.002
 bed-0.002
 tell-0.002
 call-0.002
Token was
Token ID373
Feature activation+4.02e-03

Pos logit contributions
P+0.020
ww+0.019
Now+0.018
bly+0.018
lish+0.017
Neg logit contributions
 smile-0.013
 cry-0.013
 tell-0.013
 bed-0.012
 call-0.012
Token right
Token ID826
Feature activation-2.91e-02

Pos logit contributions
 cry+0.006
 smile+0.006
 bed+0.006
 tell+0.006
 call+0.006
Neg logit contributions
P-0.010
ww-0.009
bly-0.009
Now-0.009
lish-0.009
Token and
Token ID290
Feature activation+1.50e-02

Pos logit contributions
P+0.097
ww+0.094
Now+0.090
bly+0.089
lish+0.086
Neg logit contributions
 smile-0.022
 tell-0.021
 cry-0.020
 bed-0.020
 call-0.019
Token who
Token ID508
Feature activation-2.57e-03

Pos logit contributions
 bed+0.024
 cry+0.023
 smile+0.023
 stomach+0.023
 tell+0.023
Neg logit contributions
P-0.034
ww-0.032
bly-0.032
Now-0.032
lish-0.030
Token was
Token ID373
Feature activation-6.85e-03

Pos logit contributions
P+0.006
ww+0.006
Now+0.006
bly+0.006
lish+0.005
Neg logit contributions
 smile-0.004
 cry-0.004
 bed-0.004
 tell-0.004
 call-0.004
Token wrong
Token ID2642
Feature activation-1.24e-02

Pos logit contributions
P+0.017
ww+0.016
Now+0.016
bly+0.015
lish+0.015
Neg logit contributions
 smile-0.011
 cry-0.011
 tell-0.010
 bed-0.010
 call-0.010
Token.
Token ID13
Feature activation+1.04e-02

Pos logit contributions
P+0.039
ww+0.038
Now+0.037
bly+0.036
lish+0.036
Neg logit contributions
 smile-0.011
 cry-0.011
 tell-0.010
 bed-0.010
 call-0.010
Token come
Token ID1282
Feature activation-1.76e-02

Pos logit contributions
P+0.049
ww+0.046
Now+0.045
bly+0.044
lish+0.044
Neg logit contributions
 smile-0.028
 tell-0.028
 cry-0.028
 bed-0.027
 call-0.027
Token inside
Token ID2641
Feature activation-1.32e-02

Pos logit contributions
P+0.043
ww+0.040
Now+0.039
bly+0.038
lish+0.038
Neg logit contributions
 smile-0.029
 tell-0.028
 cry-0.028
 bed-0.028
 call-0.028
Token now
Token ID783
Feature activation-3.10e-02

Pos logit contributions
P+0.036
ww+0.034
Now+0.033
bly+0.033
lish+0.032
Neg logit contributions
 smile-0.017
 cry-0.017
 tell-0.017
 bed-0.017
 call-0.016
Token,
Token ID11
Feature activation-2.81e-02

Pos logit contributions
P+0.080
ww+0.073
Now+0.072
bly+0.071
lish+0.069
Neg logit contributions
 smile-0.048
 tell-0.047
 bed-0.047
 cry-0.047
 call-0.045
Token it
Token ID340
Feature activation-2.16e-02

Pos logit contributions
P+0.070
ww+0.064
Now+0.062
bly+0.060
lish+0.060
Neg logit contributions
 smile-0.048
 tell-0.047
 cry-0.047
 bed-0.047
 call-0.046
Token's
Token ID338
Feature activation-1.81e-02

Pos logit contributions
P+0.058
ww+0.054
Now+0.053
bly+0.053
lish+0.051
Neg logit contributions
 cry-0.030
 smile-0.030
 tell-0.030
 call-0.029
 bed-0.028
Token time
Token ID640
Feature activation+2.82e-03

Pos logit contributions
P+0.046
ww+0.043
Now+0.042
bly+0.042
lish+0.041
Neg logit contributions
 smile-0.028
 bed-0.027
 cry-0.027
 tell-0.027
 call-0.026
Token for
Token ID329
Feature activation+9.68e-03

Pos logit contributions
 cry+0.003
 smile+0.003
 tell+0.003
 bed+0.003
 call+0.003
Neg logit contributions
P-0.008
ww-0.007
Now-0.007
bly-0.007
lish-0.007
Token dinner
Token ID8073
Feature activation-2.51e-02

Pos logit contributions
 smile+0.013
 cry+0.013
 tell+0.012
 gaze+0.012
 call+0.012
Neg logit contributions
P-0.026
ww-0.024
Now-0.023
bly-0.023
lish-0.022
Token!"
Token ID2474
Feature activation+3.98e-02

Pos logit contributions
P+0.060
ww+0.056
Now+0.055
bly+0.054
lish+0.053
Neg logit contributions
 smile-0.042
 bed-0.042
 tell-0.041
 cry-0.041
 call-0.040
Token Annie
Token ID25860
Feature activation+1.87e-02

Pos logit contributions
 smile+0.058
 cry+0.057
 stomach+0.056
 weeping+0.054
 tell+0.054
Neg logit contributions
Now-0.091
P-0.089
Soon-0.083
ww-0.082
bly-0.081
Token much
Token ID881
Feature activation+1.23e-03

Pos logit contributions
 cry+0.034
 stomach+0.033
 bed+0.033
 crying+0.032
 weeping+0.032
Neg logit contributions
P-0.060
Now-0.059
ww-0.058
bly-0.056
Soon-0.056
Token more
Token ID517
Feature activation-9.26e-03

Pos logit contributions
 cry+0.002
 tell+0.002
 smile+0.002
 bed+0.002
 call+0.002
Neg logit contributions
P-0.003
ww-0.002
Now-0.002
bly-0.002
lish-0.002
Token.
Token ID13
Feature activation+1.50e-02

Pos logit contributions
P+0.021
ww+0.019
Now+0.019
bly+0.019
lish+0.018
Neg logit contributions
 smile-0.017
 cry-0.017
 tell-0.016
 bed-0.016
 call-0.016
Token
Token ID220
Feature activation+2.18e-02

Pos logit contributions
 cry+0.018
 smile+0.017
 tell+0.017
 crying+0.016
 bed+0.016
Neg logit contributions
P-0.043
Now-0.040
ww-0.040
sc-0.038
bly-0.038
Token
Token ID198
Feature activation-3.73e-02

Pos logit contributions
 smile+0.029
 cry+0.029
 tell+0.027
 bed+0.027
 call+0.026
Neg logit contributions
P-0.059
Now-0.056
ww-0.054
bly-0.054
Soon-0.053
Token
Token ID198
Feature activation-9.88e-02

Pos logit contributions
ww+0.122
P+0.121
lish+0.118
bly+0.117
w+0.111
Neg logit contributions
 smile-0.025
 tell-0.023
 cry-0.022
 bed-0.021
 call-0.020
TokenThe
Token ID464
Feature activation-2.47e-02

Pos logit contributions
ww+0.176
iest+0.171
lish+0.167
w+0.163
bly+0.155
Neg logit contributions
 find-0.194
 tell-0.184
 smile-0.183
 -0.183
 feel-0.180
Token dragon
Token ID10441
Feature activation+2.97e-03

Pos logit contributions
P+0.067
ww+0.064
bly+0.062
Now+0.062
lish+0.059
Neg logit contributions
 smile-0.034
 tell-0.033
 cry-0.033
 bed-0.032
 call-0.032
Token and
Token ID290
Feature activation-3.09e-03

Pos logit contributions
 cry+0.004
 smile+0.004
 bed+0.004
 tell+0.004
 call+0.004
Neg logit contributions
P-0.007
ww-0.007
Now-0.007
bly-0.007
lish-0.007
Token Princess
Token ID8449
Feature activation-1.68e-02

Pos logit contributions
P+0.008
ww+0.007
Now+0.007
bly+0.007
lish+0.007
Neg logit contributions
 cry-0.005
 smile-0.005
 tell-0.005
 bed-0.004
 call-0.004
Token stayed
Token ID9658
Feature activation-1.92e-02

Pos logit contributions
P+0.045
ww+0.042
Now+0.041
bly+0.041
lish+0.040
Neg logit contributions
 smile-0.023
 cry-0.023
 tell-0.023
 bed-0.022
 call-0.022
Tokenmy
Token ID1820
Feature activation+4.66e-02

Pos logit contributions
 smile+0.005
 cry+0.005
 bed+0.005
 tell+0.005
 stomach+0.004
Neg logit contributions
P-0.008
ww-0.007
Now-0.007
lish-0.007
bly-0.007
Token's
Token ID338
Feature activation+4.02e-02

Pos logit contributions
 bed+0.069
 question+0.065
 cry+0.065
 stomach+0.064
 gaze+0.063
Neg logit contributions
P-0.108
ww-0.101
bly-0.099
Now-0.099
lish-0.096
Token mom
Token ID1995
Feature activation+3.43e-02

Pos logit contributions
 cry+0.044
 smile+0.044
 tell+0.044
 gaze+0.044
 stare+0.042
Neg logit contributions
P-0.104
ww-0.101
bly-0.097
Now-0.097
lish-0.091
Token said
Token ID531
Feature activation-7.69e-03

Pos logit contributions
 bed+0.059
 cry+0.056
 stomach+0.055
 gaze+0.054
 smile+0.054
Neg logit contributions
P-0.074
Now-0.070
ww-0.069
bly-0.068
lish-0.066
Token,
Token ID11
Feature activation-2.32e-02

Pos logit contributions
P+0.020
ww+0.019
Now+0.018
bly+0.018
lish+0.018
Neg logit contributions
 smile-0.011
 cry-0.011
 tell-0.011
 bed-0.010
 call-0.010
Token "
Token ID366
Feature activation-9.92e-02

Pos logit contributions
P+0.071
ww+0.066
Now+0.064
bly+0.064
lish+0.063
Neg logit contributions
 smile-0.026
 tell-0.025
 cry-0.025
 bed-0.024
 call-0.024
TokenLet
Token ID5756
Feature activation-3.36e-02

Pos logit contributions
lish+0.145
ww+0.144
bly+0.142
iest+0.141
br+0.138
Neg logit contributions
 tell-0.190
 gaze-0.188
 call-0.187
 crying-0.187
 visit-0.186
Token's
Token ID338
Feature activation+2.63e-02

Pos logit contributions
P+0.099
bly+0.096
Now+0.094
ww+0.093
lish+0.092
Neg logit contributions
 cry-0.040
 tell-0.038
 smile-0.038
 call-0.036
 bed-0.034
Token go
Token ID467
Feature activation-9.40e-03

Pos logit contributions
 weeping+0.029
 smile+0.029
 crying+0.028
 cry+0.028
 stomach+0.027
Neg logit contributions
P-0.073
bly-0.068
ww-0.068
Now-0.066
Soon-0.062
Token to
Token ID284
Feature activation+2.62e-02

Pos logit contributions
P+0.027
ww+0.025
Now+0.025
bly+0.024
lish+0.024
Neg logit contributions
 smile-0.011
 cry-0.011
 tell-0.011
 bed-0.011
 call-0.010
Token the
Token ID262
Feature activation-1.84e-02

Pos logit contributions
 smile+0.043
 stomach+0.041
 cry+0.041
 weeping+0.039
 question+0.039
Neg logit contributions
P-0.059
ww-0.055
Now-0.054
bly-0.052
ha-0.052
Token?"
Token ID1701
Feature activation-1.81e-02

Pos logit contributions
P+0.009
ww+0.008
Now+0.008
bly+0.008
lish+0.008
Neg logit contributions
 smile-0.006
 cry-0.006
 tell-0.006
 bed-0.006
 call-0.006
Token Lily
Token ID20037
Feature activation+3.45e-02

Pos logit contributions
P+0.053
ww+0.050
bly+0.049
Now+0.049
lish+0.048
Neg logit contributions
 smile-0.021
 cry-0.021
 tell-0.020
 bed-0.020
 call-0.019
Token asks
Token ID7893
Feature activation+1.77e-02

Pos logit contributions
 stomach+0.043
 smile+0.042
 bed+0.041
 tum+0.038
 cry+0.038
Neg logit contributions
P-0.094
Now-0.085
ww-0.083
Hey-0.081
CR-0.080
Token.
Token ID13
Feature activation+2.12e-02

Pos logit contributions
 smile+0.022
 tell+0.022
 cry+0.022
 bed+0.021
 stomach+0.021
Neg logit contributions
P-0.048
lish-0.044
bly-0.043
CR-0.042
ww-0.042
Token
Token ID198
Feature activation+2.50e-03

Pos logit contributions
 smile+0.030
 cry+0.029
 tell+0.029
 bed+0.029
 stomach+0.028
Neg logit contributions
P-0.056
Now-0.052
ww-0.050
Soon-0.050
Spl-0.048
Token
Token ID198
Feature activation-7.95e-02

Pos logit contributions
 smile+0.003
 cry+0.003
 tell+0.003
 bed+0.003
 stomach+0.003
Neg logit contributions
P-0.007
ww-0.007
Now-0.007
bly-0.007
Soon-0.006
Token"
Token ID1
Feature activation-7.17e-02

Pos logit contributions
iest+0.157
bly+0.153
lish+0.146
sc+0.141
ww+0.141
Neg logit contributions
 tell-0.140
 find-0.139
 play-0.138
 mom-0.138
 call-0.138
TokenIt
Token ID1026
Feature activation+6.42e-04

Pos logit contributions
lish+0.103
ross+0.101
ww+0.096
ier+0.093
bly+0.092
Neg logit contributions
 cry-0.161
 gaze-0.157
 call-0.155
 question-0.155
 stare-0.154
Token is
Token ID318
Feature activation-3.05e-04

Pos logit contributions
 smile+0.001
 cry+0.001
 bed+0.001
 tell+0.001
 stomach+0.001
Neg logit contributions
P-0.002
ww-0.001
Now-0.001
bly-0.001
lish-0.001
Token a
Token ID257
Feature activation+3.40e-03

Pos logit contributions
P+0.001
ww+0.001
Now+0.001
bly+0.001
lish+0.001
Neg logit contributions
 cry-0.001
 smile-0.001
 tell-0.001
 bed-0.000
 call-0.000
Token tail
Token ID7894
Feature activation-1.71e-02

Pos logit contributions
 cry+0.005
 smile+0.005
 tell+0.005
 bed+0.005
 call+0.005
Neg logit contributions
P-0.009
ww-0.008
Now-0.008
bly-0.008
lish-0.007
Token of
Token ID286
Feature activation+2.14e-02

Pos logit contributions
P+0.010
ww+0.009
Now+0.009
bly+0.009
lish+0.009
Neg logit contributions
 cry-0.009
 smile-0.009
 tell-0.008
 bed-0.008
 call-0.008
Token her
Token ID607
Feature activation+4.26e-02

Pos logit contributions
 smile+0.040
 gaze+0.040
 stomach+0.039
 call+0.039
 cry+0.039
Neg logit contributions
Now-0.044
P-0.043
bly-0.042
Soon-0.039
lish-0.038
Token work
Token ID670
Feature activation+2.77e-03

Pos logit contributions
 stare+0.051
 gaze+0.051
 cry+0.051
 tell+0.051
 weep+0.050
Neg logit contributions
bly-0.096
P-0.095
lish-0.095
ww-0.092
Now-0.090
Token.
Token ID13
Feature activation+3.44e-02

Pos logit contributions
 cry+0.003
 tell+0.003
 smile+0.003
 bed+0.003
 call+0.003
Neg logit contributions
P-0.008
ww-0.008
Now-0.007
bly-0.007
lish-0.007
Token
Token ID198
Feature activation+5.15e-02

Pos logit contributions
 smile+0.059
 cry+0.058
 tell+0.055
 call+0.055
 gaze+0.054
Neg logit contributions
P-0.081
Now-0.079
ww-0.073
Soon-0.072
O-0.068
Token
Token ID198
Feature activation-8.01e-02

Pos logit contributions
 smile+0.088
 stomach+0.087
 throat+0.086
 cry+0.086
 eyes+0.085
Neg logit contributions
P-0.122
Now-0.120
Soon-0.115
ww-0.106
A-0.105
TokenL
Token ID43
Feature activation-4.50e-02

Pos logit contributions
iest+0.132
ww+0.126
lish+0.122
bly+0.121
ier+0.121
Neg logit contributions
 tell-0.169
 cry-0.162
 bed-0.162
 find-0.161
 play-0.159
Tokenily
Token ID813
Feature activation-7.97e-03

Pos logit contributions
P+0.084
ww+0.082
bly+0.078
Now+0.073
lish+0.072
Neg logit contributions
 tell-0.092
 cry-0.091
 bed-0.090
 smile-0.089
 call-0.087
Token saw
Token ID2497
Feature activation+2.27e-02

Pos logit contributions
P+0.021
ww+0.020
Now+0.019
bly+0.019
lish+0.018
Neg logit contributions
 smile-0.012
 cry-0.011
 tell-0.011
 bed-0.011
 call-0.011
Token that
Token ID326
Feature activation+9.84e-03

Pos logit contributions
 smile+0.042
 gaze+0.042
 search+0.041
 cry+0.041
 question+0.040
Neg logit contributions
P-0.046
bly-0.043
Now-0.043
Soon-0.041
ww-0.040
Token there
Token ID612
Feature activation+3.01e-02

Pos logit contributions
 cry+0.014
 smile+0.014
 call+0.014
 gaze+0.014
 tell+0.014
Neg logit contributions
P-0.024
Now-0.022
ww-0.022
bly-0.022
lish-0.021
Token strong
Token ID1913
Feature activation-2.53e-02

Pos logit contributions
 cry+0.003
 smile+0.003
 bed+0.003
 tell+0.003
 call+0.002
Neg logit contributions
P-0.005
ww-0.005
Now-0.004
bly-0.004
lish-0.004
Token.
Token ID13
Feature activation+1.25e-02

Pos logit contributions
P+0.080
ww+0.075
Now+0.073
bly+0.072
lish+0.070
Neg logit contributions
 smile-0.024
 cry-0.023
 tell-0.023
 bed-0.022
 call-0.021
Tokenâ
Token ID22940
Feature activation-4.89e-02

Pos logit contributions
 smile+0.015
 cry+0.015
 tell+0.014
 bed+0.014
 call+0.014
Neg logit contributions
P-0.035
ww-0.034
Now-0.033
bly-0.031
Soon-0.031
Token
Token ID26391
Feature activation+1.02e-02

Pos logit contributions
P+0.095
ww+0.087
bly+0.083
lish+0.079
br+0.079
Neg logit contributions
 tell-0.099
 cry-0.099
 smile-0.097
 call-0.096
 say-0.093
Token
Token ID198
Feature activation-1.67e-02

Pos logit contributions
 smile+0.013
 cry+0.013
 tell+0.013
 bed+0.013
 call+0.013
Neg logit contributions
P-0.028
ww-0.027
Now-0.026
bly-0.025
ha-0.024
Token
Token ID198
Feature activation-9.85e-02

Pos logit contributions
P+0.052
ww+0.050
bly+0.049
Now+0.048
lish+0.048
Neg logit contributions
 smile-0.015
 cry-0.015
 tell-0.014
 bed-0.014
 call-0.013
TokenG
Token ID38
Feature activation-4.78e-02

Pos logit contributions
lish+0.150
led+0.143
bly+0.138
ka+0.134
ww+0.134
Neg logit contributions
 tell-0.204
 know-0.199
 -0.196
 call-0.192
 search-0.187
Tokenian
Token ID666
Feature activation-3.16e-02

Pos logit contributions
P+0.091
ww+0.086
bly+0.084
Now+0.081
lish+0.080
Neg logit contributions
 smile-0.094
 tell-0.092
 cry-0.090
 bed-0.089
 gaze-0.089
Tokenna
Token ID2616
Feature activation-2.17e-03

Pos logit contributions
P+0.080
ww+0.076
bly+0.072
Now+0.072
lish+0.070
Neg logit contributions
 smile-0.049
 tell-0.048
 cry-0.048
 call-0.046
 bed-0.045
Token thought
Token ID1807
Feature activation-1.38e-03

Pos logit contributions
P+0.005
ww+0.005
Now+0.005
bly+0.005
lish+0.005
Neg logit contributions
 smile-0.003
 cry-0.003
 bed-0.003
 tell-0.003
 call-0.003
Token this
Token ID428
Feature activation-1.26e-02

Pos logit contributions
P+0.003
ww+0.003
Now+0.003
bly+0.003
lish+0.003
Neg logit contributions
 smile-0.002
 cry-0.002
 tell-0.002
 bed-0.002
 call-0.002