TOP ACTIVATIONS
MAX = 9.065

Hi , I am Max . Yes , I want to
" Hello , children . Do you like the famous
Hi , I 'm Tom . Do you like my k
, this is my photo . This is my family .
" Hello , little ones . Do you want to see
, " Hello , children . What a nice ball you
" Hello , class . Today I will teach you
" Hello , kids . Don 't be afraid .
. " Hi , Tom . I cannot play with you
Hello , I 'm Anna . This is my dog ,
" Hello , kids . Do you want your ball
Hi , I 'm Lily . What 's your name ?"
" Hello , little ones . Welcome to my magic shop
" Come on , Mia . The slide is fun .
, " Hey , you . What are you doing here
Hi , I 'm Tom . Can I play with you
" Hi , Sam . What are you doing ?"
It says : " No ! You are bad ! You
, that 's not nice . That 's our ball .
Hello , I am Max . What is your name ?"

MIN ACTIVATIONS
MIN = -11.684

as he opened the ch ocol ates and started eating them
he ate all the ch ocol ates and sweets . Then
and started eating the ch ocol ates . The shop keeper
bled up all the ch ocol ates , and she was
giving him too many ch ocol ates ." The king learned
and laughed and ate ch ocol ates . They had a
and they shared the ch ocol ates with everyone in the
lived happily ever after . The moral of the
son is eating our ch ocol ates !"
't have all these ch ocol ates ! Now , go
never felt guilty again . The moral of the
Lucy was so happy ! The next day ,
had been resolved . They both left the
his hard work . And they lived happily
apped and cheered . The k itty had
took some lovely bows and rib bons and used them to
lived happily ever after . The moral of the
the bear again . The boy also remembers
not eat too many ch ocol ates and his friend learned
, feeling important . M ia loved being

INTERVAL 3.878 - 9.065
CONTAINS 5.137%

you out of the hole . Just stay calm and settle
" What 's wrong , sweet ie ?" Mom
angry and said ' Why did you say no ?' The
came to see what was wrong . She saw Max on
They were too far away . They did not see him

INTERVAL -1.309 - 3.878
CONTAINS 67.480%

is to care for something , so he said that she
He was 3 years old and loved going outside in the
and started to seem less miserable . Suddenly , she
Once upon a time , there was a child who loved
run away . Now he could eat one of them .

INTERVAL -6.496 - -1.309
CONTAINS 25.688%

not know . They take the book and say thank you
led as they ran around the beach . They lay down
cat was not available after all . He thought he had
it felt better . She was so proud of
, they hugged each other to make themselves feel better .

INTERVAL -11.684 - -6.496
CONTAINS 1.695%

arms above the post . " What are you
and pet ted Spot . " That 's okay
was already walking away . Mom my ran back
the tigers were safe . As Bob walked through
little fish was safe . The little fish was
TokenHi
Token ID17250
Feature activation+4.52e+00

Pos logit contributions
aches+0.001
cking+0.001
cks+0.001
ck+0.001
els+0.001
Neg logit contributions
 Can-0.001
 Please-0.001
 sorry-0.001
 You-0.001
 Come-0.001
Token,
Token ID11
Feature activation+5.52e+00

Pos logit contributions
 Please+0.080
 Can+0.078
 sorry+0.067
 guilty+0.065
 You+0.064
Neg logit contributions
aches-0.119
els-0.103
cks-0.100
ries-0.100
ck-0.099
Token I
Token ID314
Feature activation+1.83e+00

Pos logit contributions
 Please+0.100
 Can+0.097
 sorry+0.083
 guilty+0.082
 You+0.080
Neg logit contributions
aches-0.144
ck-0.119
els-0.117
cking-0.117
cks-0.117
Token am
Token ID716
Feature activation+3.59e+00

Pos logit contributions
 Can+0.040
 Please+0.039
 sorry+0.034
 You+0.034
 Come+0.033
Neg logit contributions
aches-0.042
ck-0.035
cks-0.034
cking-0.034
ric-0.033
Token Max
Token ID5436
Feature activation+3.97e+00

Pos logit contributions
 Please+0.073
 Can+0.072
 Come+0.065
 Give+0.061
 guy+0.060
Neg logit contributions
aches-0.075
cks-0.067
cking-0.061
ric-0.060
ck-0.060
Token.
Token ID13
Feature activation+9.06e+00

Pos logit contributions
 Please+0.102
 Can+0.102
 Come+0.093
 guilty+0.091
 unfair+0.090
Neg logit contributions
aches-0.057
cks-0.048
els-0.046
ck-0.043
ries-0.043
Token Yes
Token ID3363
Feature activation+4.45e+00

Pos logit contributions
 guilty+0.109
 unfair+0.103
 impossible+0.099
 wrong+0.098
 foolish+0.097
Neg logit contributions
aches-0.202
ric-0.201
of-0.198
cks-0.196
els-0.182
Token,
Token ID11
Feature activation+4.20e+00

Pos logit contributions
 Can+0.089
 Please+0.088
 guilty+0.082
 unfair+0.080
 War+0.079
Neg logit contributions
aches-0.088
ries-0.077
els-0.075
cks-0.075
ck-0.069
Token I
Token ID314
Feature activation+1.73e+00

Pos logit contributions
 Please+0.046
 Can+0.044
 guilty+0.043
 sorry+0.039
 unfair+0.036
Neg logit contributions
aches-0.128
ck-0.109
of-0.102
els-0.101
cking-0.100
Token want
Token ID765
Feature activation+2.49e+00

Pos logit contributions
 Please+0.032
 Can+0.032
 sorry+0.028
 You+0.027
 Come+0.026
Neg logit contributions
aches-0.044
ck-0.036
cks-0.034
ric-0.033
cking-0.033
Token to
Token ID284
Feature activation-1.25e+00

Pos logit contributions
 Please+0.045
 Can+0.044
 sorry+0.044
 guilty+0.042
 You+0.040
Neg logit contributions
aches-0.056
ries-0.046
els-0.045
ck-0.044
of-0.043
Token
Token ID198
Feature activation-5.75e+00

Pos logit contributions
aches+0.212
cking+0.188
els+0.183
ries+0.181
cks+0.179
Neg logit contributions
 Can-0.109
 Please-0.105
 You-0.093
 Come-0.089
 sorry-0.088
Token"
Token ID1
Feature activation-1.38e+00

Pos logit contributions
aches+0.138
cking+0.116
cks+0.113
els+0.113
ck+0.110
Neg logit contributions
 Can-0.127
 Please-0.125
 You-0.111
 sorry-0.107
 Come-0.107
TokenHello
Token ID15496
Feature activation+4.28e+00

Pos logit contributions
aches+0.043
cking+0.037
cks+0.037
els+0.036
ric+0.036
Neg logit contributions
 Can-0.023
 Please-0.022
 You-0.018
 Come-0.018
 I-0.016
Token,
Token ID11
Feature activation+5.44e+00

Pos logit contributions
 Please+0.072
 Can+0.070
 You+0.061
 sorry+0.060
 Come+0.060
Neg logit contributions
aches-0.114
ries-0.100
els-0.098
cks-0.095
ck-0.094
Token children
Token ID1751
Feature activation+1.89e+00

Pos logit contributions
 Please+0.090
 Can+0.088
 Come+0.076
 Just+0.075
 You+0.073
Neg logit contributions
aches-0.147
ries-0.122
els-0.122
ck-0.122
cking-0.121
Token.
Token ID13
Feature activation+9.05e+00

Pos logit contributions
 Can+0.038
 Please+0.037
 Come+0.032
 sorry+0.032
 You+0.031
Neg logit contributions
aches-0.047
cks-0.040
cking-0.039
ck-0.039
els-0.039
Token Do
Token ID2141
Feature activation+3.19e+00

Pos logit contributions
 guilty+0.107
 impossible+0.103
 unfair+0.101
 ok+0.099
 stupid+0.093
Neg logit contributions
aches-0.217
ric-0.216
cks-0.209
els-0.199
ries-0.197
Token you
Token ID345
Feature activation+1.71e+00

Pos logit contributions
 Can+0.045
 Please+0.043
 You+0.035
 sorry+0.035
 Come+0.034
Neg logit contributions
aches-0.100
cking-0.087
cks-0.086
ck-0.085
els-0.084
Token like
Token ID588
Feature activation+2.45e+00

Pos logit contributions
 Can+0.027
 Please+0.026
 You+0.021
 Come+0.021
 sorry+0.020
Neg logit contributions
aches-0.051
ck-0.045
cks-0.041
cking-0.041
els-0.039
Token the
Token ID262
Feature activation+4.54e-01

Pos logit contributions
 Can+0.046
 Please+0.045
 You+0.039
 sorry+0.039
 Come+0.038
Neg logit contributions
aches-0.065
cking-0.055
cks-0.055
els-0.054
ck-0.054
Token famous
Token ID5863
Feature activation+2.60e+00

Pos logit contributions
 Please+0.009
 Can+0.009
 Come+0.008
 You+0.008
 Just+0.008
Neg logit contributions
aches-0.011
cks-0.009
cking-0.009
els-0.009
ries-0.008
TokenHi
Token ID17250
Feature activation+5.19e+00

Pos logit contributions
aches+0.015
cking+0.013
els+0.013
cks+0.013
ries+0.013
Neg logit contributions
 Please-0.007
 Can-0.007
 You-0.006
 Come-0.005
 I-0.005
Token,
Token ID11
Feature activation+6.01e+00

Pos logit contributions
 Please+0.082
 Can+0.081
 sorry+0.069
 You+0.069
 guilty+0.068
Neg logit contributions
aches-0.145
ries-0.124
els-0.123
cks-0.123
cking-0.123
Token I
Token ID314
Feature activation+2.26e+00

Pos logit contributions
 Please+0.065
 Can+0.063
 Come+0.049
 guilty+0.046
 Just+0.046
Neg logit contributions
aches-0.196
cking-0.170
ck-0.170
cks-0.169
ries-0.167
Token'm
Token ID1101
Feature activation+4.31e+00

Pos logit contributions
 Can+0.042
 Please+0.042
 You+0.036
 sorry+0.036
 Come+0.035
Neg logit contributions
aches-0.061
cking-0.052
cks-0.051
ck-0.051
els-0.050
Token Tom
Token ID4186
Feature activation+2.78e+00

Pos logit contributions
 Can+0.076
 Please+0.074
 Come+0.063
 guilty+0.059
 Give+0.058
Neg logit contributions
aches-0.108
cks-0.099
cking-0.095
ck-0.093
ric-0.091
Token.
Token ID13
Feature activation+9.03e+00

Pos logit contributions
 Can+0.070
 Please+0.068
 sorry+0.061
 Come+0.061
 Yes+0.061
Neg logit contributions
aches-0.048
cks-0.041
els-0.039
ck-0.039
cking-0.039
Token Do
Token ID2141
Feature activation+2.96e+00

Pos logit contributions
 guilty+0.090
 impossible+0.087
 unfair+0.083
 ok+0.080
 foolish+0.079
Neg logit contributions
aches-0.245
cks-0.239
ric-0.233
els-0.224
ries-0.220
Token you
Token ID345
Feature activation+1.86e+00

Pos logit contributions
 Can+0.040
 Please+0.039
 You+0.031
 sorry+0.031
 Come+0.030
Neg logit contributions
aches-0.094
cking-0.082
cks-0.082
ck-0.081
els-0.080
Token like
Token ID588
Feature activation+2.38e+00

Pos logit contributions
 Can+0.027
 Please+0.026
 You+0.021
 sorry+0.020
 Come+0.020
Neg logit contributions
aches-0.058
ck-0.051
cks-0.048
cking-0.048
ric-0.047
Token my
Token ID616
Feature activation+2.15e+00

Pos logit contributions
 Can+0.042
 Please+0.040
 You+0.035
 I+0.034
 Come+0.032
Neg logit contributions
aches-0.069
cking-0.061
cks-0.058
ck-0.058
els-0.057
Token k
Token ID479
Feature activation+7.54e-01

Pos logit contributions
 Can+0.051
 Please+0.050
 You+0.045
 Come+0.044
 sorry+0.044
Neg logit contributions
aches-0.047
cking-0.038
cks-0.037
els-0.037
ck-0.037
Token,
Token ID11
Feature activation+4.62e+00

Pos logit contributions
 Can+0.053
 Please+0.050
 Yes+0.044
 Come+0.044
 You+0.042
Neg logit contributions
aches-0.055
cks-0.047
ck-0.046
cking-0.044
els-0.043
Token this
Token ID428
Feature activation+3.36e+00

Pos logit contributions
 Please+0.058
 Can+0.056
 Come+0.046
 guilty+0.045
 You+0.045
Neg logit contributions
aches-0.145
cks-0.121
ck-0.121
els-0.120
ries-0.120
Token is
Token ID318
Feature activation+2.74e+00

Pos logit contributions
 Please+0.061
 Can+0.059
 sorry+0.051
 Come+0.050
 guilty+0.050
Neg logit contributions
aches-0.091
els-0.075
cks-0.074
ries-0.071
ck-0.071
Token my
Token ID616
Feature activation+2.21e+00

Pos logit contributions
 Can+0.046
 Please+0.046
 Come+0.039
 You+0.039
 sorry+0.037
Neg logit contributions
aches-0.077
cks-0.066
cking-0.065
els-0.064
ck-0.064
Token photo
Token ID4590
Feature activation+1.37e+00

Pos logit contributions
 Please+0.052
 Can+0.050
 Come+0.047
 You+0.044
 Just+0.042
Neg logit contributions
aches-0.046
els-0.039
ries-0.036
ck-0.035
ric-0.035
Token.
Token ID13
Feature activation+8.99e+00

Pos logit contributions
 Can+0.027
 Please+0.025
 Come+0.023
 sorry+0.023
 You+0.022
Neg logit contributions
aches-0.034
cks-0.028
els-0.028
ries-0.028
cking-0.028
Token This
Token ID770
Feature activation+2.30e+00

Pos logit contributions
 guilty+0.114
 wrong+0.104
 impossible+0.103
 unfair+0.102
 foolish+0.101
Neg logit contributions
aches-0.212
els-0.211
cks-0.210
ric-0.201
ries-0.193
Token is
Token ID318
Feature activation+1.96e+00

Pos logit contributions
 Please+0.039
 Can+0.039
 Come+0.033
 sorry+0.033
 You+0.031
Neg logit contributions
aches-0.065
els-0.055
cks-0.055
cking-0.053
ck-0.053
Token my
Token ID616
Feature activation+1.62e+00

Pos logit contributions
 Can+0.032
 Please+0.031
 You+0.026
 sorry+0.026
 Come+0.026
Neg logit contributions
aches-0.057
cking-0.049
cks-0.049
els-0.048
ck-0.048
Token family
Token ID1641
Feature activation+3.19e-01

Pos logit contributions
 Please+0.037
 Can+0.036
 Come+0.034
 You+0.032
 sorry+0.031
Neg logit contributions
aches-0.034
els-0.029
ries-0.027
cks-0.026
ric-0.026
Token.
Token ID13
Feature activation+8.32e+00

Pos logit contributions
 Can+0.006
 Please+0.006
 You+0.005
 sorry+0.005
 Come+0.005
Neg logit contributions
aches-0.009
cking-0.007
cks-0.007
els-0.007
ck-0.007
Token "
Token ID366
Feature activation-7.11e-01

Pos logit contributions
 Please+0.037
 Can+0.037
 You+0.031
 Come+0.030
 sorry+0.028
Neg logit contributions
aches-0.084
cking-0.073
cks-0.072
ric-0.071
els-0.071
TokenHello
Token ID15496
Feature activation+4.08e+00

Pos logit contributions
aches+0.023
cking+0.020
els+0.020
cks+0.019
ric+0.019
Neg logit contributions
 Can-0.011
 Please-0.011
 You-0.009
 Come-0.009
 I-0.008
Token,
Token ID11
Feature activation+5.38e+00

Pos logit contributions
 Can+0.061
 Please+0.061
 sorry+0.050
 You+0.050
 Come+0.049
Neg logit contributions
aches-0.121
els-0.104
cks-0.104
cking-0.104
ries-0.103
Token little
Token ID1310
Feature activation+3.07e+00

Pos logit contributions
 Can+0.090
 Please+0.089
 You+0.075
 Come+0.074
 sorry+0.073
Neg logit contributions
aches-0.152
cking-0.130
cks-0.129
ck-0.127
els-0.127
Token ones
Token ID3392
Feature activation+9.98e-01

Pos logit contributions
 Can+0.053
 Please+0.053
 sorry+0.048
 You+0.045
 Come+0.045
Neg logit contributions
aches-0.081
ries-0.070
els-0.069
cking-0.068
cks-0.068
Token.
Token ID13
Feature activation+8.98e+00

Pos logit contributions
 Can+0.019
 Please+0.019
 sorry+0.016
 You+0.016
 Come+0.016
Neg logit contributions
aches-0.026
cking-0.022
cks-0.022
els-0.022
ck-0.022
Token Do
Token ID2141
Feature activation+2.71e+00

Pos logit contributions
 guilty+0.095
 impossible+0.089
 unfair+0.086
 ok+0.086
 sorry+0.085
Neg logit contributions
aches-0.242
cks-0.232
ric-0.230
els-0.222
ries-0.218
Token you
Token ID345
Feature activation+1.74e+00

Pos logit contributions
 Can+0.035
 Please+0.034
 You+0.028
 sorry+0.027
 Come+0.026
Neg logit contributions
aches-0.089
cking-0.078
cks-0.077
els-0.076
ck-0.076
Token want
Token ID765
Feature activation+1.79e+00

Pos logit contributions
 Can+0.025
 Please+0.024
 You+0.020
 Come+0.019
 sorry+0.019
Neg logit contributions
aches-0.055
ck-0.048
cks-0.045
cking-0.045
els-0.044
Token to
Token ID284
Feature activation-5.75e-01

Pos logit contributions
 Can+0.027
 Please+0.027
 You+0.023
 sorry+0.023
 Come+0.022
Neg logit contributions
aches-0.053
cking-0.044
cks-0.044
ck-0.044
els-0.044
Token see
Token ID766
Feature activation+4.25e-01

Pos logit contributions
aches+0.016
cking+0.016
cks+0.015
els+0.015
ries+0.015
Neg logit contributions
 Can-0.009
 Please-0.009
 I-0.008
 sorry-0.008
 You-0.007
Token,
Token ID11
Feature activation+1.78e+00

Pos logit contributions
 Can+0.020
 Please+0.020
 You+0.017
 Come+0.017
 sorry+0.016
Neg logit contributions
aches-0.031
els-0.026
cks-0.026
cking-0.026
ck-0.026
Token "
Token ID366
Feature activation-1.27e-01

Pos logit contributions
 Can+0.023
 Please+0.023
 You+0.018
 Come+0.018
 sorry+0.017
Neg logit contributions
aches-0.055
cking-0.049
cks-0.049
ck-0.048
els-0.048
TokenHello
Token ID15496
Feature activation+3.83e+00

Pos logit contributions
aches+0.004
cking+0.003
cks+0.003
els+0.003
ries+0.003
Neg logit contributions
 Can-0.002
 Please-0.002
 You-0.002
 Come-0.002
 Give-0.002
Token,
Token ID11
Feature activation+5.07e+00

Pos logit contributions
 Can+0.061
 Please+0.061
 You+0.051
 sorry+0.050
 Come+0.050
Neg logit contributions
aches-0.108
ries-0.092
els-0.092
cks-0.091
cking-0.090
Token children
Token ID1751
Feature activation+1.72e+00

Pos logit contributions
 Can+0.080
 Please+0.080
 Come+0.065
 Just+0.064
 You+0.064
Neg logit contributions
aches-0.145
cking-0.118
cks-0.117
ck-0.117
ries-0.116
Token.
Token ID13
Feature activation+8.98e+00

Pos logit contributions
 Can+0.034
 Please+0.034
 sorry+0.029
 Come+0.029
 You+0.028
Neg logit contributions
aches-0.043
cks-0.036
cking-0.036
els-0.036
ck-0.035
Token What
Token ID1867
Feature activation+2.83e+00

Pos logit contributions
 guilty+0.089
 impossible+0.088
 unfair+0.085
 stupid+0.078
 foolish+0.078
Neg logit contributions
aches-0.241
ric-0.236
cks-0.230
els-0.221
ries-0.214
Token a
Token ID257
Feature activation+3.19e+00

Pos logit contributions
 Please+0.064
 Can+0.064
 sorry+0.056
 guilty+0.054
 You+0.054
Neg logit contributions
aches-0.063
cks-0.049
els-0.046
ck-0.046
ics-0.045
Token nice
Token ID3621
Feature activation+1.84e+00

Pos logit contributions
 Can+0.056
 Just+0.054
 Please+0.053
 Come+0.053
 You+0.052
Neg logit contributions
aches-0.071
cking-0.057
ck-0.056
cks-0.052
ind-0.051
Token ball
Token ID2613
Feature activation+2.48e+00

Pos logit contributions
 Can+0.033
 Please+0.033
 Just+0.028
 You+0.028
 Come+0.027
Neg logit contributions
aches-0.048
ck-0.037
els-0.036
ries-0.036
cking-0.035
Token you
Token ID345
Feature activation+2.70e+00

Pos logit contributions
 Please+0.047
 Can+0.046
 sorry+0.041
 Come+0.040
 Yes+0.038
Neg logit contributions
aches-0.062
cks-0.053
els-0.051
ries-0.050
cking-0.050
Token
Token ID198
Feature activation-5.11e+00

Pos logit contributions
aches+0.198
cking+0.176
els+0.173
ries+0.171
cks+0.168
Neg logit contributions
 Can-0.103
 Please-0.099
 You-0.086
 Come-0.083
 But-0.078
Token"
Token ID1
Feature activation-1.05e+00

Pos logit contributions
aches+0.114
cking+0.095
cks+0.093
els+0.093
ric+0.092
Neg logit contributions
 Please-0.127
 Can-0.126
 You-0.110
 Come-0.106
 I-0.105
TokenHello
Token ID15496
Feature activation+4.85e+00

Pos logit contributions
aches+0.034
cking+0.030
cks+0.030
ck+0.029
els+0.029
Neg logit contributions
 Can-0.017
 Please-0.016
 Come-0.013
 You-0.013
 Yes-0.012
Token,
Token ID11
Feature activation+5.83e+00

Pos logit contributions
 Please+0.079
 Can+0.079
 sorry+0.069
 guilty+0.068
 You+0.068
Neg logit contributions
aches-0.130
ries-0.114
els-0.112
cks-0.108
ck-0.107
Token class
Token ID1398
Feature activation+1.98e+00

Pos logit contributions
 Please+0.092
 Can+0.090
 Come+0.078
 Just+0.077
 You+0.075
Neg logit contributions
aches-0.159
els-0.133
ries-0.130
ck-0.129
cks-0.128
Token.
Token ID13
Feature activation+8.96e+00

Pos logit contributions
 Can+0.036
 Please+0.035
 sorry+0.031
 Come+0.031
 guilty+0.030
Neg logit contributions
aches-0.051
cks-0.043
els-0.042
ries-0.042
cking-0.042
Token Today
Token ID6288
Feature activation+3.03e+00

Pos logit contributions
 guilty+0.100
 unfair+0.090
 impossible+0.089
 trouble+0.086
 ok+0.086
Neg logit contributions
ric-0.226
aches-0.224
cks-0.209
els-0.208
ries-0.204
Token I
Token ID314
Feature activation+1.23e+00

Pos logit contributions
 Can+0.052
 Please+0.052
 sorry+0.046
 Come+0.045
 guilty+0.045
Neg logit contributions
aches-0.078
ries-0.062
ide-0.061
of-0.060
cks-0.060
Token will
Token ID481
Feature activation+2.46e+00

Pos logit contributions
 Can+0.024
 Please+0.024
 You+0.021
 sorry+0.021
 Come+0.020
Neg logit contributions
aches-0.031
ck-0.026
cks-0.025
cking-0.025
els-0.024
Token teach
Token ID4545
Feature activation+8.51e-01

Pos logit contributions
 Can+0.050
 Please+0.049
 You+0.043
 sorry+0.043
 Come+0.042
Neg logit contributions
aches-0.062
cking-0.053
cks-0.052
els-0.051
ck-0.050
Token you
Token ID345
Feature activation+5.87e-01

Pos logit contributions
 Can+0.011
 Please+0.011
 You+0.009
 sorry+0.009
 Come+0.008
Neg logit contributions
aches-0.028
cking-0.024
cks-0.024
els-0.024
ck-0.024
Token
Token ID198
Feature activation-5.20e+00

Pos logit contributions
aches+0.191
cking+0.169
els+0.167
ries+0.164
cks+0.163
Neg logit contributions
 Can-0.104
 Please-0.101
 You-0.090
 Come-0.086
 sorry-0.085
Token"
Token ID1
Feature activation-1.35e+00

Pos logit contributions
aches+0.122
cking+0.102
els+0.101
cks+0.101
ries+0.099
Neg logit contributions
 Can-0.119
 Please-0.118
 You-0.104
 Come-0.101
 sorry-0.100
TokenHello
Token ID15496
Feature activation+4.08e+00

Pos logit contributions
aches+0.039
cking+0.033
cks+0.033
els+0.032
ck+0.032
Neg logit contributions
 Can-0.025
 Please-0.024
 You-0.021
 Come-0.020
 sorry-0.019
Token,
Token ID11
Feature activation+4.99e+00

Pos logit contributions
 Please+0.072
 Can+0.070
 You+0.062
 sorry+0.061
 guilty+0.061
Neg logit contributions
aches-0.103
ries-0.092
els-0.090
cks-0.087
ck-0.086
Token kids
Token ID3988
Feature activation+1.76e+00

Pos logit contributions
 Please+0.084
 Can+0.082
 Just+0.070
 Come+0.070
 You+0.066
Neg logit contributions
aches-0.133
els-0.112
ck-0.111
cks-0.110
ries-0.110
Token.
Token ID13
Feature activation+8.96e+00

Pos logit contributions
 Can+0.035
 Please+0.034
 sorry+0.029
 Come+0.029
 You+0.029
Neg logit contributions
aches-0.045
cking-0.038
cks-0.038
ck-0.037
els-0.037
Token Don
Token ID2094
Feature activation+3.23e+00

Pos logit contributions
 impossible+0.113
 guilty+0.113
 unfair+0.113
 ok+0.102
 stupid+0.102
Neg logit contributions
ric-0.209
aches-0.204
cks-0.199
els-0.190
ries-0.186
Token't
Token ID470
Feature activation+1.41e+00

Pos logit contributions
 Can+0.057
 Please+0.056
 sorry+0.048
 You+0.047
 Come+0.047
Neg logit contributions
aches-0.089
cking-0.075
cks-0.075
ck-0.074
els-0.074
Token be
Token ID307
Feature activation+1.83e-01

Pos logit contributions
 Can+0.025
 Please+0.024
 You+0.021
 Come+0.021
 sorry+0.021
Neg logit contributions
aches-0.039
cking-0.033
ck-0.033
cks-0.032
els-0.031
Token afraid
Token ID7787
Feature activation+3.53e+00

Pos logit contributions
 Can+0.002
 Please+0.002
 Come+0.002
 You+0.002
 Yes+0.002
Neg logit contributions
aches-0.006
cking-0.005
cks-0.005
ck-0.005
els-0.005
Token.
Token ID13
Feature activation+8.75e+00

Pos logit contributions
 Can+0.079
 Please+0.077
 Come+0.071
 Just+0.070
 Yes+0.070
Neg logit contributions
aches-0.070
ries-0.057
els-0.057
cks-0.056
cking-0.055
Token.
Token ID13
Feature activation+4.83e+00

Pos logit contributions
 Can+0.025
 Please+0.024
 You+0.021
 sorry+0.021
 Come+0.020
Neg logit contributions
aches-0.033
cks-0.028
els-0.027
cking-0.027
ck-0.027
Token "
Token ID366
Feature activation-1.74e+00

Pos logit contributions
 Can+0.074
 Please+0.073
 sorry+0.062
 guilty+0.062
 You+0.060
Neg logit contributions
aches-0.132
cks-0.117
els-0.112
cking-0.111
ries-0.110
TokenHi
Token ID17250
Feature activation+5.64e+00

Pos logit contributions
aches+0.045
els+0.038
cking+0.037
cks+0.037
ries+0.036
Neg logit contributions
 Please-0.037
 Can-0.036
 You-0.032
 Come-0.032
 I-0.029
Token,
Token ID11
Feature activation+6.21e+00

Pos logit contributions
 Can+0.101
 Please+0.100
 guilty+0.090
 sorry+0.087
 You+0.084
Neg logit contributions
aches-0.140
ries-0.121
cks-0.119
els-0.117
ck-0.116
Token Tom
Token ID4186
Feature activation+2.50e+00

Pos logit contributions
 Please+0.101
 Can+0.101
 Just+0.086
 Come+0.083
 guilty+0.082
Neg logit contributions
aches-0.165
ck-0.133
cks-0.133
ries-0.132
els-0.130
Token.
Token ID13
Feature activation+8.94e+00

Pos logit contributions
 Can+0.055
 Please+0.053
 Yes+0.048
 sorry+0.048
 Come+0.047
Neg logit contributions
aches-0.054
cks-0.048
cking-0.046
ck-0.046
els-0.046
Token I
Token ID314
Feature activation+1.25e+00

Pos logit contributions
 guilty+0.097
 impossible+0.096
 unfair+0.092
 ok+0.084
 nervous+0.083
Neg logit contributions
ric-0.232
aches-0.230
cks-0.224
ries-0.214
illas-0.213
Token cannot
Token ID2314
Feature activation+2.41e+00

Pos logit contributions
 Can+0.026
 Please+0.026
 You+0.022
 sorry+0.022
 Come+0.021
Neg logit contributions
aches-0.030
ck-0.026
cking-0.025
cks-0.025
els-0.024
Token play
Token ID711
Feature activation+1.20e+00

Pos logit contributions
 Can+0.046
 Please+0.046
 You+0.040
 sorry+0.039
 Come+0.039
Neg logit contributions
aches-0.063
cking-0.054
cks-0.053
els-0.052
ck-0.052
Token with
Token ID351
Feature activation-2.31e-01

Pos logit contributions
 Can+0.027
 Please+0.026
 Come+0.024
 You+0.023
 guilty+0.023
Neg logit contributions
aches-0.025
cks-0.021
cking-0.021
ck-0.020
els-0.020
Token you
Token ID345
Feature activation+7.61e-01

Pos logit contributions
aches+0.007
cking+0.006
cks+0.006
els+0.006
ck+0.006
Neg logit contributions
 Can-0.004
 Please-0.004
 You-0.003
 Come-0.003
 sorry-0.003
TokenHello
Token ID15496
Feature activation+3.71e+00

Pos logit contributions
aches+0.032
cks+0.029
cking+0.028
els+0.028
ries+0.027
Neg logit contributions
 Can-0.012
 Please-0.012
 You-0.010
 Come-0.010
 I-0.008
Token,
Token ID11
Feature activation+5.45e+00

Pos logit contributions
 Can+0.052
 Please+0.052
 You+0.043
 sorry+0.042
 Come+0.041
Neg logit contributions
aches-0.115
cking-0.100
cks-0.099
els-0.099
ck-0.098
Token I
Token ID314
Feature activation+1.66e+00

Pos logit contributions
 Can+0.099
 Please+0.098
 You+0.084
 sorry+0.082
 Come+0.082
Neg logit contributions
aches-0.148
cking-0.124
cks-0.122
ck-0.122
els-0.121
Token'm
Token ID1101
Feature activation+3.90e+00

Pos logit contributions
 Can+0.032
 Please+0.031
 You+0.027
 sorry+0.027
 Come+0.026
Neg logit contributions
aches-0.043
cking-0.037
cks-0.036
ck-0.036
els-0.036
Token Anna
Token ID11735
Feature activation+1.97e+00

Pos logit contributions
 Can+0.073
 Please+0.070
 Come+0.061
 Let+0.057
 Give+0.056
Neg logit contributions
aches-0.096
cks-0.085
cking-0.082
ck-0.081
ric-0.077
Token.
Token ID13
Feature activation+8.94e+00

Pos logit contributions
 Can+0.045
 Please+0.043
 Come+0.040
 sorry+0.038
 Yes+0.038
Neg logit contributions
aches-0.040
cks-0.034
els-0.033
ries-0.032
cking-0.032
Token This
Token ID770
Feature activation+2.74e+00

Pos logit contributions
 guilty+0.085
 impossible+0.076
 unfair+0.075
 sorry+0.075
 stupid+0.073
Neg logit contributions
aches-0.255
cks-0.233
ric-0.231
els-0.229
ries-0.228
Token is
Token ID318
Feature activation+2.42e+00

Pos logit contributions
 Please+0.049
 Can+0.048
 sorry+0.043
 guilty+0.040
 Come+0.040
Neg logit contributions
aches-0.071
cks-0.057
els-0.056
ries-0.056
ics-0.055
Token my
Token ID616
Feature activation+2.16e+00

Pos logit contributions
 Can+0.044
 Please+0.043
 Come+0.037
 You+0.036
 sorry+0.035
Neg logit contributions
aches-0.063
cks-0.054
ck-0.053
cking-0.052
els-0.051
Token dog
Token ID3290
Feature activation+3.30e+00

Pos logit contributions
 Please+0.043
 Can+0.043
 Come+0.037
 You+0.036
 sorry+0.035
Neg logit contributions
aches-0.052
els-0.042
ries-0.041
ric-0.040
cks-0.039
Token,
Token ID11
Feature activation+3.20e+00

Pos logit contributions
 Can+0.069
 Please+0.068
 Come+0.060
 sorry+0.058
 Yes+0.057
Neg logit contributions
aches-0.077
cks-0.064
ck-0.063
cking-0.063
els-0.062
Token
Token ID198
Feature activation-5.84e+00

Pos logit contributions
aches+0.195
cking+0.170
els+0.164
ries+0.163
cks+0.161
Neg logit contributions
 Can-0.118
 Please-0.114
 You-0.102
 sorry-0.098
 Come-0.098
Token"
Token ID1
Feature activation-1.52e+00

Pos logit contributions
aches+0.144
cking+0.121
els+0.118
cks+0.116
ries+0.116
Neg logit contributions
 Can-0.126
 Please-0.124
 You-0.110
 Come-0.106
 sorry-0.105
TokenHello
Token ID15496
Feature activation+4.09e+00

Pos logit contributions
aches+0.043
cking+0.036
cks+0.036
els+0.035
ric+0.035
Neg logit contributions
 Can-0.029
 Please-0.028
 Come-0.024
 You-0.024
 sorry-0.022
Token,
Token ID11
Feature activation+5.38e+00

Pos logit contributions
 Please+0.072
 Can+0.070
 You+0.060
 guilty+0.059
 sorry+0.059
Neg logit contributions
aches-0.105
ries-0.093
els-0.092
cks-0.089
ck-0.088
Token kids
Token ID3988
Feature activation+1.69e+00

Pos logit contributions
 Please+0.103
 Can+0.101
 Just+0.091
 Come+0.086
 Yes+0.082
Neg logit contributions
aches-0.127
els-0.107
ck-0.106
cks-0.106
cking-0.104
Token.
Token ID13
Feature activation+8.93e+00

Pos logit contributions
 Can+0.033
 Please+0.032
 sorry+0.028
 Come+0.028
 You+0.028
Neg logit contributions
aches-0.043
cking-0.037
cks-0.037
els-0.036
ck-0.036
Token Do
Token ID2141
Feature activation+2.97e+00

Pos logit contributions
 impossible+0.117
 guilty+0.115
 unfair+0.114
 ok+0.111
 stupid+0.104
Neg logit contributions
ric-0.201
cks-0.200
aches-0.197
els-0.187
illas-0.185
Token you
Token ID345
Feature activation+1.46e+00

Pos logit contributions
 Can+0.042
 Please+0.041
 You+0.033
 sorry+0.032
 Come+0.032
Neg logit contributions
aches-0.093
cking-0.081
cks-0.081
ck-0.080
els-0.079
Token want
Token ID765
Feature activation+1.55e+00

Pos logit contributions
 Can+0.024
 Please+0.022
 You+0.019
 Come+0.018
 Oh+0.017
Neg logit contributions
aches-0.043
ck-0.038
cks-0.035
cking-0.035
els-0.033
Token your
Token ID534
Feature activation+1.04e+00

Pos logit contributions
 Please+0.026
 Can+0.026
 sorry+0.023
 You+0.023
 Come+0.022
Neg logit contributions
aches-0.042
els-0.036
ck-0.035
ries-0.035
cks-0.035
Token ball
Token ID2613
Feature activation+5.34e-01

Pos logit contributions
 Please+0.023
 Can+0.022
 Come+0.019
 You+0.019
 sorry+0.018
Neg logit contributions
aches-0.024
cks-0.020
els-0.020
cking-0.020
ck-0.019
TokenHi
Token ID17250
Feature activation+5.20e+00

Pos logit contributions
aches+0.012
cking+0.010
cks+0.010
els+0.010
ck+0.010
Neg logit contributions
 Can-0.006
 Please-0.005
 You-0.005
 Come-0.004
 sorry-0.004
Token,
Token ID11
Feature activation+5.74e+00

Pos logit contributions
 Please+0.095
 Can+0.091
 guilty+0.080
 Come+0.078
 You+0.077
Neg logit contributions
aches-0.127
ries-0.113
els-0.112
cks-0.110
ck-0.106
Token I
Token ID314
Feature activation+2.17e+00

Pos logit contributions
 Please+0.063
 Can+0.056
 Just+0.045
 Come+0.044
 guilty+0.042
Neg logit contributions
aches-0.184
els-0.157
cks-0.157
ck-0.156
ries-0.155
Token'm
Token ID1101
Feature activation+4.27e+00

Pos logit contributions
 Can+0.048
 Please+0.048
 You+0.041
 sorry+0.041
 Come+0.040
Neg logit contributions
aches-0.050
ck-0.041
cking-0.040
cks-0.040
els-0.039
Token Lily
Token ID20037
Feature activation+3.28e+00

Pos logit contributions
 Can+0.076
 Please+0.074
 Come+0.065
 Give+0.060
 Let+0.060
Neg logit contributions
aches-0.103
cks-0.092
cking-0.087
ck-0.087
ries-0.083
Token.
Token ID13
Feature activation+8.93e+00

Pos logit contributions
 Can+0.084
 Please+0.082
 Come+0.076
 Yes+0.073
 unfair+0.073
Neg logit contributions
aches-0.051
cks-0.043
els-0.041
cking-0.041
ries-0.040
Token What
Token ID1867
Feature activation+3.17e+00

Pos logit contributions
 guilty+0.100
 impossible+0.093
 unfair+0.091
 foolish+0.087
 stupid+0.084
Neg logit contributions
aches-0.225
ric-0.220
cks-0.216
els-0.207
ries-0.201
Token's
Token ID338
Feature activation+4.42e+00

Pos logit contributions
 Please+0.064
 Can+0.063
 sorry+0.056
 guilty+0.054
 You+0.053
Neg logit contributions
aches-0.078
cks-0.062
els-0.061
ck-0.060
ries-0.059
Token your
Token ID534
Feature activation+2.73e+00

Pos logit contributions
 Can+0.091
 Please+0.089
 Come+0.078
 sorry+0.076
 guilty+0.076
Neg logit contributions
aches-0.110
cks-0.093
ck-0.090
cking-0.090
els-0.087
Token name
Token ID1438
Feature activation+3.43e+00

Pos logit contributions
 Please+0.047
 Can+0.046
 Come+0.039
 You+0.039
 sorry+0.038
Neg logit contributions
aches-0.073
els-0.060
cks-0.060
ric-0.058
ck-0.058
Token?"
Token ID1701
Feature activation-4.85e-01

Pos logit contributions
 Can+0.061
 Please+0.060
 guilty+0.054
 sorry+0.053
 Come+0.053
Neg logit contributions
aches-0.086
els-0.072
ries-0.071
cks-0.071
cking-0.069
Token"
Token ID1
Feature activation-1.61e+00

Pos logit contributions
aches+0.148
cking+0.124
cks+0.123
els+0.122
ries+0.119
Neg logit contributions
 Can-0.133
 Please-0.131
 You-0.115
 Come-0.111
 sorry-0.111
TokenHello
Token ID15496
Feature activation+4.20e+00

Pos logit contributions
aches+0.053
cking+0.047
cks+0.047
ric+0.046
els+0.045
Neg logit contributions
 Can-0.025
 Please-0.024
 Come-0.019
 You-0.019
 Yes-0.017
Token,
Token ID11
Feature activation+5.30e+00

Pos logit contributions
 Please+0.068
 Can+0.066
 sorry+0.058
 You+0.057
 guilty+0.056
Neg logit contributions
aches-0.117
ries-0.102
els-0.101
cks-0.098
ck-0.097
Token little
Token ID1310
Feature activation+3.02e+00

Pos logit contributions
 Please+0.087
 Can+0.085
 Come+0.073
 Just+0.072
 You+0.071
Neg logit contributions
aches-0.144
els-0.119
ck-0.119
ries-0.118
cking-0.118
Token ones
Token ID3392
Feature activation+6.81e-01

Pos logit contributions
 Please+0.057
 Can+0.057
 sorry+0.055
 guilty+0.051
 Just+0.050
Neg logit contributions
aches-0.067
ries-0.060
els-0.059
ck-0.054
cking-0.054
Token.
Token ID13
Feature activation+8.93e+00

Pos logit contributions
 Can+0.011
 Please+0.011
 You+0.009
 sorry+0.009
 Come+0.009
Neg logit contributions
aches-0.020
cking-0.017
cks-0.017
els-0.017
ck-0.017
Token Welcome
Token ID19134
Feature activation+2.49e+00

Pos logit contributions
 guilty+0.104
 unfair+0.100
 impossible+0.098
 stupid+0.092
 foolish+0.092
Neg logit contributions
aches-0.217
ric-0.215
cks-0.204
els-0.195
ries-0.195
Token to
Token ID284
Feature activation-1.99e+00

Pos logit contributions
 sorry+0.042
 Can+0.042
 Please+0.040
 guilty+0.039
 You+0.036
Neg logit contributions
aches-0.063
ries-0.053
els-0.053
cks-0.053
ck-0.051
Token my
Token ID616
Feature activation+5.18e-01

Pos logit contributions
aches+0.060
cking+0.054
els+0.052
cks+0.052
ck+0.051
Neg logit contributions
 Can-0.032
 Please-0.031
 You-0.027
 sorry-0.026
 I-0.026
Token magic
Token ID5536
Feature activation+2.07e+00

Pos logit contributions
 Can+0.011
 Please+0.011
 You+0.010
 Come+0.010
 sorry+0.010
Neg logit contributions
aches-0.012
cking-0.010
els-0.010
cks-0.010
ries-0.009
Token shop
Token ID6128
Feature activation+1.60e+00

Pos logit contributions
 Can+0.041
 Please+0.040
 sorry+0.037
 You+0.036
 Come+0.036
Neg logit contributions
aches-0.051
els-0.041
ries-0.039
cks-0.039
ck-0.039
Token "
Token ID366
Feature activation+5.20e-02

Pos logit contributions
 Please+0.034
 Can+0.032
 guilty+0.030
 unfair+0.029
 impossible+0.029
Neg logit contributions
aches-0.055
cks-0.045
ric-0.045
ries-0.044
cking-0.044
TokenCome
Token ID16773
Feature activation+3.77e+00

Pos logit contributions
 Can+0.001
 Please+0.001
 You+0.001
 sorry+0.001
 Come+0.001
Neg logit contributions
aches-0.001
cking-0.001
cks-0.001
ck-0.001
els-0.001
Token on
Token ID319
Feature activation-1.39e+00

Pos logit contributions
 Can+0.084
 Please+0.084
 You+0.073
 guilty+0.072
 sorry+0.072
Neg logit contributions
aches-0.083
cking-0.068
cks-0.068
els-0.068
ck-0.067
Token,
Token ID11
Feature activation+5.40e+00

Pos logit contributions
aches+0.031
ries+0.026
cks+0.025
els+0.025
ck+0.025
Neg logit contributions
 Can-0.029
 Please-0.027
 Come-0.025
 Yes-0.024
 guilty-0.024
Token Mia
Token ID32189
Feature activation+2.17e+00

Pos logit contributions
 Can+0.118
 Please+0.115
 guilty+0.101
 unfair+0.097
 Just+0.097
Neg logit contributions
aches-0.116
ck-0.094
cks-0.084
cking-0.082
of-0.080
Token.
Token ID13
Feature activation+8.91e+00

Pos logit contributions
 Can+0.049
 Please+0.048
 sorry+0.042
 Come+0.041
 guilty+0.041
Neg logit contributions
aches-0.047
cks-0.040
els-0.038
ck-0.038
cking-0.038
Token The
Token ID383
Feature activation+1.18e+00

Pos logit contributions
 unfair+0.100
 guilty+0.095
 impossible+0.093
 ok+0.084
 mean+0.077
Neg logit contributions
ric-0.231
aches-0.218
cks-0.211
els-0.204
ites-0.202
Token slide
Token ID10649
Feature activation+6.19e-01

Pos logit contributions
 Can+0.030
 Please+0.030
 You+0.027
 Come+0.026
 sorry+0.026
Neg logit contributions
aches-0.024
cking-0.018
cks-0.018
els-0.018
ck-0.017
Token is
Token ID318
Feature activation+1.22e+00

Pos logit contributions
 Can+0.010
 Please+0.010
 sorry+0.008
 Come+0.008
 You+0.008
Neg logit contributions
aches-0.018
cks-0.016
cking-0.016
els-0.015
ck-0.015
Token fun
Token ID1257
Feature activation+3.11e+00

Pos logit contributions
 Can+0.020
 Please+0.020
 You+0.017
 Come+0.017
 Yes+0.016
Neg logit contributions
aches-0.036
cking-0.031
cks-0.030
ck-0.030
els-0.029
Token.
Token ID13
Feature activation+7.92e+00

Pos logit contributions
 Can+0.084
 Please+0.084
 sorry+0.080
 guilty+0.077
 Oh+0.076
Neg logit contributions
aches-0.046
cks-0.038
cking-0.035
ck-0.034
els-0.033
Token,
Token ID11
Feature activation+2.01e+00

Pos logit contributions
 Can+0.079
 Please+0.077
 You+0.069
 Come+0.066
 guilty+0.061
Neg logit contributions
aches-0.122
ries-0.097
els-0.096
ck-0.094
cking-0.093
Token "
Token ID366
Feature activation-4.07e-01

Pos logit contributions
 Can+0.025
 Please+0.025
 You+0.020
 Come+0.019
 sorry+0.019
Neg logit contributions
aches-0.063
cking-0.056
cks-0.055
els-0.055
ck-0.054
TokenHey
Token ID10814
Feature activation+4.75e+00

Pos logit contributions
aches+0.012
cking+0.010
cks+0.010
els+0.010
ric+0.010
Neg logit contributions
 Please-0.008
 Can-0.008
 You-0.007
 Come-0.006
 I-0.006
Token,
Token ID11
Feature activation+5.36e+00

Pos logit contributions
 Please+0.087
 Can+0.086
 You+0.075
 Come+0.074
 Just+0.073
Neg logit contributions
aches-0.118
ries-0.101
ck-0.101
els-0.099
cking-0.097
Token you
Token ID345
Feature activation+1.70e+00

Pos logit contributions
 Can+0.100
 Please+0.098
 Come+0.082
 You+0.080
 Just+0.078
Neg logit contributions
aches-0.141
ck-0.117
cking-0.117
cks-0.116
els-0.114
Token.
Token ID13
Feature activation+8.91e+00

Pos logit contributions
 Can+0.030
 Please+0.030
 Come+0.025
 You+0.025
 Yes+0.024
Neg logit contributions
aches-0.045
cks-0.038
els-0.038
cking-0.038
ck-0.037
Token What
Token ID1867
Feature activation+2.75e+00

Pos logit contributions
 Please+0.071
 guilty+0.071
 impossible+0.071
 Can+0.069
 unfair+0.066
Neg logit contributions
aches-0.270
els-0.255
cks-0.251
ric-0.251
ries-0.248
Token are
Token ID389
Feature activation+3.77e+00

Pos logit contributions
 Please+0.058
 Can+0.058
 sorry+0.051
 guilty+0.050
 You+0.049
Neg logit contributions
aches-0.064
cks-0.051
els-0.051
ck-0.049
ries-0.048
Token you
Token ID345
Feature activation+3.36e+00

Pos logit contributions
 Can+0.048
 Please+0.047
 You+0.038
 sorry+0.037
 Come+0.036
Neg logit contributions
aches-0.123
cking-0.109
cks-0.108
els-0.106
ck-0.106
Token doing
Token ID1804
Feature activation+3.49e+00

Pos logit contributions
 Can+0.069
 Please+0.067
 Come+0.057
 You+0.057
 sorry+0.055
Neg logit contributions
aches-0.085
cking-0.073
cks-0.072
els-0.070
ck-0.069
Token here
Token ID994
Feature activation+1.64e+00

Pos logit contributions
 Can+0.076
 Please+0.073
 guilty+0.069
 Come+0.068
 sorry+0.065
Neg logit contributions
aches-0.072
els-0.060
cks-0.059
ating-0.057
cking-0.056
TokenHi
Token ID17250
Feature activation+5.06e+00

Pos logit contributions
aches+0.012
cking+0.011
cks+0.011
els+0.011
ck+0.010
Neg logit contributions
 Can-0.007
 Please-0.007
 You-0.006
 Come-0.006
 sorry-0.006
Token,
Token ID11
Feature activation+5.61e+00

Pos logit contributions
 Can+0.084
 Please+0.084
 You+0.070
 sorry+0.069
 guilty+0.069
Neg logit contributions
aches-0.139
ries-0.118
els-0.116
cks-0.115
ck-0.114
Token I
Token ID314
Feature activation+1.98e+00

Pos logit contributions
 Can+0.070
 Please+0.070
 You+0.053
 Come+0.052
 sorry+0.052
Neg logit contributions
aches-0.182
cking-0.156
cks-0.155
ck-0.155
els-0.153
Token'm
Token ID1101
Feature activation+3.95e+00

Pos logit contributions
 Can+0.040
 Please+0.040
 You+0.034
 sorry+0.034
 Come+0.033
Neg logit contributions
aches-0.049
cking-0.041
ck-0.041
cks-0.040
els-0.039
Token Tom
Token ID4186
Feature activation+3.12e+00

Pos logit contributions
 Can+0.073
 Please+0.069
 Come+0.061
 Let+0.058
 Give+0.058
Neg logit contributions
aches-0.092
cks-0.083
cking-0.078
ck-0.076
ries-0.075
Token.
Token ID13
Feature activation+8.90e+00

Pos logit contributions
 Can+0.077
 Please+0.074
 Come+0.067
 Yes+0.067
 guilty+0.066
Neg logit contributions
aches-0.056
cks-0.048
cking-0.046
ck-0.046
els-0.045
Token Can
Token ID1680
Feature activation+4.56e+00

Pos logit contributions
 guilty+0.099
 impossible+0.092
 unfair+0.091
 foolish+0.085
 stupid+0.083
Neg logit contributions
aches-0.230
ric-0.215
cks-0.212
els-0.204
of-0.202
Token I
Token ID314
Feature activation+2.86e+00

Pos logit contributions
 Can+0.042
 Please+0.038
 guilty+0.032
 sorry+0.031
 Come+0.028
Neg logit contributions
aches-0.156
ck-0.135
cks-0.134
cking-0.133
els-0.133
Token play
Token ID711
Feature activation+1.13e+00

Pos logit contributions
 Can+0.059
 Please+0.058
 You+0.051
 sorry+0.050
 Come+0.050
Neg logit contributions
aches-0.071
cking-0.059
cks-0.058
ck-0.058
els-0.057
Token with
Token ID351
Feature activation-5.14e-01

Pos logit contributions
 Can+0.026
 Please+0.026
 You+0.023
 Come+0.023
 sorry+0.023
Neg logit contributions
aches-0.023
cking-0.019
ck-0.019
cks-0.019
els-0.018
Token you
Token ID345
Feature activation+1.31e+00

Pos logit contributions
aches+0.017
cking+0.015
cks+0.015
ck+0.014
els+0.014
Neg logit contributions
 Can-0.007
 Please-0.007
 You-0.005
 sorry-0.005
 Come-0.005
Token
Token ID198
Feature activation-5.94e+00

Pos logit contributions
aches+0.226
cking+0.200
ries+0.200
els+0.196
cks+0.185
Neg logit contributions
 Can-0.091
 Please-0.085
 You-0.078
 Come-0.076
 I-0.072
Token"
Token ID1
Feature activation-1.57e+00

Pos logit contributions
aches+0.161
ries+0.140
els+0.135
cking+0.135
ric+0.133
Neg logit contributions
 Can-0.115
 Please-0.112
 You-0.100
 Come-0.098
 I-0.098
TokenHi
Token ID17250
Feature activation+4.86e+00

Pos logit contributions
aches+0.044
cking+0.038
cks+0.037
els+0.035
ck+0.035
Neg logit contributions
 Can-0.031
 Please-0.030
 Come-0.026
 You-0.026
 Yes-0.024
Token,
Token ID11
Feature activation+5.45e+00

Pos logit contributions
 Please+0.080
 Can+0.079
 sorry+0.065
 You+0.065
 guilty+0.063
Neg logit contributions
aches-0.138
els-0.119
cks-0.118
ries-0.117
ck-0.117
Token Sam
Token ID3409
Feature activation+2.44e+00

Pos logit contributions
 Please+0.106
 Can+0.105
 You+0.087
 sorry+0.087
 Come+0.086
Neg logit contributions
aches-0.138
cks-0.114
cking-0.114
els-0.114
ck-0.113
Token.
Token ID13
Feature activation+8.89e+00

Pos logit contributions
 Can+0.049
 Please+0.047
 sorry+0.042
 Come+0.041
 Yes+0.041
Neg logit contributions
aches-0.060
ck-0.051
cks-0.051
cking-0.051
els-0.050
Token What
Token ID1867
Feature activation+3.23e+00

Pos logit contributions
 unfair+0.113
 guilty+0.111
 impossible+0.111
 ok+0.103
 sorry+0.099
Neg logit contributions
ric-0.214
aches-0.209
cks-0.206
els-0.196
ries-0.195
Token are
Token ID389
Feature activation+4.08e+00

Pos logit contributions
 Please+0.075
 Can+0.073
 sorry+0.066
 guilty+0.064
 You+0.063
Neg logit contributions
aches-0.069
cks-0.055
els-0.053
ries-0.051
ck-0.051
Token you
Token ID345
Feature activation+3.62e+00

Pos logit contributions
 Can+0.060
 Please+0.058
 sorry+0.049
 Come+0.047
 You+0.047
Neg logit contributions
aches-0.124
cks-0.106
cking-0.106
ck-0.104
els-0.104
Token doing
Token ID1804
Feature activation+3.58e+00

Pos logit contributions
 Can+0.072
 Please+0.071
 Come+0.060
 You+0.060
 sorry+0.058
Neg logit contributions
aches-0.094
cking-0.081
cks-0.079
els-0.078
ck-0.077
Token?"
Token ID1701
Feature activation-1.67e+00

Pos logit contributions
 Can+0.073
 Please+0.069
 guilty+0.068
 Come+0.065
 Just+0.064
Neg logit contributions
aches-0.077
els-0.067
cks-0.065
cking-0.064
ries-0.064
Token It
Token ID632
Feature activation-2.34e+00

Pos logit contributions
 Please+0.067
 Can+0.067
 Come+0.051
 guilty+0.051
 sorry+0.051
Neg logit contributions
aches-0.135
cks-0.127
els-0.122
ck-0.121
cking-0.119
Token says
Token ID1139
Feature activation+4.27e+00

Pos logit contributions
aches+0.065
cks+0.057
cking+0.056
ck+0.056
els+0.055
Neg logit contributions
 Can-0.041
 Please-0.040
 You-0.035
 Come-0.033
 sorry-0.033
Token:
Token ID25
Feature activation+2.34e+00

Pos logit contributions
 Can+0.090
 Please+0.087
 You+0.079
 Come+0.077
 guilty+0.071
Neg logit contributions
aches-0.103
els-0.082
cks-0.081
cking-0.081
ck-0.081
Token "
Token ID366
Feature activation-5.15e-01

Pos logit contributions
 Can+0.030
 Please+0.030
 You+0.024
 Come+0.023
 sorry+0.023
Neg logit contributions
aches-0.072
cks-0.063
cking-0.062
els-0.061
ck-0.061
TokenNo
Token ID2949
Feature activation+1.60e+00

Pos logit contributions
aches+0.014
cking+0.012
cks+0.011
ries+0.011
els+0.011
Neg logit contributions
 Please-0.010
 Can-0.010
 You-0.009
 Come-0.009
 War-0.008
Token!
Token ID0
Feature activation+8.89e+00

Pos logit contributions
 Please+0.034
 Can+0.033
 Come+0.029
 Goodbye+0.027
 crank+0.027
Neg logit contributions
aches-0.037
els-0.031
ries-0.031
cks-0.029
ric-0.029
Token You
Token ID921
Feature activation+1.69e+00

Pos logit contributions
 Can+0.098
 Please+0.097
 guilty+0.086
 sorry+0.082
 impossible+0.077
Neg logit contributions
aches-0.259
els-0.245
ric-0.241
cks-0.238
ries-0.233
Token are
Token ID389
Feature activation+7.88e-01

Pos logit contributions
 Can+0.029
 Please+0.028
 You+0.024
 Come+0.024
 sorry+0.023
Neg logit contributions
aches-0.048
cking-0.041
cks-0.041
ck-0.040
els-0.040
Token bad
Token ID2089
Feature activation+2.98e+00

Pos logit contributions
 Can+0.015
 Please+0.014
 Come+0.013
 Yes+0.012
 You+0.011
Neg logit contributions
aches-0.022
cks-0.019
cking-0.019
ck-0.018
els-0.018
Token!
Token ID0
Feature activation+8.38e+00

Pos logit contributions
 Can+0.065
 Please+0.062
 sorry+0.056
 unfair+0.054
 impossible+0.054
Neg logit contributions
aches-0.059
els-0.055
ries-0.053
ins-0.050
cks-0.050
Token You
Token ID921
Feature activation+1.38e+00

Pos logit contributions
 Please+0.059
 Can+0.059
 guilty+0.059
 impossible+0.055
 sorry+0.053
Neg logit contributions
aches-0.256
els-0.251
cks-0.247
ric-0.244
ries-0.238
Token,
Token ID11
Feature activation+5.17e+00

Pos logit contributions
 Please+0.086
 Can+0.086
 Just+0.078
 Come+0.078
 You+0.074
Neg logit contributions
aches-0.098
ries-0.085
cking-0.083
ck-0.083
els-0.083
Token that
Token ID326
Feature activation+3.50e+00

Pos logit contributions
 Can+0.101
 Please+0.099
 Come+0.085
 Just+0.081
 You+0.080
Neg logit contributions
aches-0.130
cking-0.109
ck-0.108
cks-0.107
els-0.105
Token's
Token ID338
Feature activation+1.48e+00

Pos logit contributions
 Can+0.066
 Please+0.065
 Come+0.055
 sorry+0.054
 Just+0.053
Neg logit contributions
aches-0.089
cks-0.075
cking-0.073
els-0.071
ries-0.071
Token not
Token ID407
Feature activation+1.99e+00

Pos logit contributions
 Can+0.030
 Please+0.028
 Come+0.025
 You+0.024
 Yes+0.023
Neg logit contributions
aches-0.039
cks-0.034
cking-0.033
ck-0.033
els-0.032
Token nice
Token ID3621
Feature activation+1.47e+00

Pos logit contributions
 Can+0.034
 Please+0.032
 You+0.027
 Come+0.027
 sorry+0.026
Neg logit contributions
aches-0.057
cking-0.049
cks-0.048
ck-0.048
els-0.046
Token.
Token ID13
Feature activation+8.88e+00

Pos logit contributions
 Can+0.031
 Please+0.030
 sorry+0.027
 Come+0.026
 You+0.026
Neg logit contributions
aches-0.035
els-0.029
ries-0.028
cks-0.028
cking-0.026
Token That
Token ID1320
Feature activation+3.33e+00

Pos logit contributions
 impossible+0.095
 guilty+0.091
 ok+0.088
 Can+0.087
 Please+0.087
Neg logit contributions
aches-0.237
els-0.234
cks-0.234
ric-0.223
ries-0.222
Token's
Token ID338
Feature activation+7.01e-01

Pos logit contributions
 Can+0.062
 Please+0.062
 sorry+0.056
 Come+0.056
 Yes+0.052
Neg logit contributions
aches-0.080
els-0.066
cks-0.066
ries-0.064
ric-0.062
Token our
Token ID674
Feature activation+1.05e+00

Pos logit contributions
 Can+0.013
 Please+0.012
 Come+0.010
 You+0.010
 sorry+0.010
Neg logit contributions
aches-0.020
cks-0.017
cking-0.017
ck-0.017
els-0.016
Token ball
Token ID2613
Feature activation+2.20e-01

Pos logit contributions
 Can+0.022
 Please+0.022
 Come+0.019
 You+0.018
 sorry+0.018
Neg logit contributions
aches-0.026
cks-0.021
els-0.021
cking-0.021
ck-0.020
Token.
Token ID13
Feature activation+7.68e+00

Pos logit contributions
 Can+0.005
 Please+0.005
 Come+0.004
 sorry+0.004
 Yes+0.004
Neg logit contributions
aches-0.005
cks-0.004
els-0.004
cking-0.004
ck-0.004
TokenHello
Token ID15496
Feature activation+4.60e+00

Pos logit contributions
aches+0.031
cking+0.027
ck+0.026
cks+0.026
els+0.026
Neg logit contributions
 Can-0.023
 Please-0.023
 sorry-0.021
 You-0.020
 guilty-0.019
Token,
Token ID11
Feature activation+5.36e+00

Pos logit contributions
 Please+0.081
 Can+0.079
 guilty+0.070
 sorry+0.069
 You+0.068
Neg logit contributions
aches-0.115
ries-0.103
els-0.100
cks-0.097
ck-0.096
Token I
Token ID314
Feature activation+2.17e+00

Pos logit contributions
 Please+0.084
 Can+0.081
 Just+0.071
 Come+0.069
 You+0.065
Neg logit contributions
aches-0.144
ck-0.120
els-0.118
cks-0.117
ries-0.117
Token am
Token ID716
Feature activation+4.12e+00

Pos logit contributions
 Can+0.046
 Please+0.046
 You+0.040
 sorry+0.039
 Come+0.039
Neg logit contributions
aches-0.051
ck-0.042
cking-0.042
cks-0.042
els-0.040
Token Max
Token ID5436
Feature activation+4.19e+00

Pos logit contributions
 Can+0.071
 Please+0.069
 Come+0.062
 guilty+0.058
 Give+0.055
Neg logit contributions
aches-0.101
cks-0.089
ck-0.083
cking-0.082
ries-0.082
Token.
Token ID13
Feature activation+8.87e+00

Pos logit contributions
 Can+0.096
 Please+0.089
 Yes+0.086
 Come+0.083
 impossible+0.082
Neg logit contributions
aches-0.083
cks-0.071
ck-0.070
cking-0.068
ries-0.067
Token What
Token ID1867
Feature activation+3.30e+00

Pos logit contributions
 guilty+0.098
 impossible+0.094
 unfair+0.089
 foolish+0.086
 wrong+0.085
Neg logit contributions
aches-0.217
ric-0.213
cks-0.207
isk-0.198
els-0.197
Token is
Token ID318
Feature activation+5.46e+00

Pos logit contributions
 Please+0.070
 Can+0.069
 sorry+0.061
 guilty+0.060
 You+0.059
Neg logit contributions
aches-0.078
cks-0.061
els-0.059
ck-0.058
ries-0.057
Token your
Token ID534
Feature activation+2.75e+00

Pos logit contributions
 Can+0.119
 Please+0.119
 Come+0.112
 guilty+0.108
 ok+0.107
Neg logit contributions
aches-0.115
cks-0.099
ck-0.091
cking-0.091
els-0.088
Token name
Token ID1438
Feature activation+3.17e+00

Pos logit contributions
 Please+0.046
 Can+0.046
 sorry+0.038
 Come+0.038
 guilty+0.037
Neg logit contributions
aches-0.076
els-0.064
cks-0.062
ric-0.062
ck-0.060
Token?"
Token ID1701
Feature activation-1.00e+00

Pos logit contributions
 Can+0.057
 Please+0.055
 guilty+0.050
 sorry+0.050
 Come+0.049
Neg logit contributions
aches-0.080
els-0.068
ries-0.067
cks-0.067
ck-0.065
Token as
Token ID355
Feature activation-9.37e-01

Pos logit contributions
 Can+0.018
 Please+0.018
 Come+0.016
 You+0.016
 sorry+0.015
Neg logit contributions
aches-0.022
cking-0.019
cks-0.018
ck-0.018
els-0.018
Token he
Token ID339
Feature activation+6.52e-01

Pos logit contributions
aches+0.024
cking+0.020
ck+0.019
cks+0.019
els+0.019
Neg logit contributions
 Can-0.019
 Please-0.019
 Come-0.016
 You-0.016
 sorry-0.015
Token opened
Token ID4721
Feature activation-2.27e+00

Pos logit contributions
 Can+0.012
 Please+0.012
 You+0.010
 sorry+0.010
 Come+0.010
Neg logit contributions
aches-0.018
cking-0.015
cks-0.015
els-0.015
ck-0.015
Token the
Token ID262
Feature activation-7.81e-01

Pos logit contributions
aches+0.054
cking+0.045
cks+0.044
ck+0.043
els+0.043
Neg logit contributions
 Can-0.049
 Please-0.048
 You-0.043
 sorry-0.042
 Come-0.041
Token ch
Token ID442
Feature activation-1.68e+00

Pos logit contributions
aches+0.020
els+0.017
cking+0.017
ries+0.016
ric+0.016
Neg logit contributions
 Can-0.014
 Please-0.014
 Come-0.013
 You-0.012
 sorry-0.012
Tokenocol
Token ID4668
Feature activation-1.05e+01

Pos logit contributions
aches+0.031
cking+0.024
cks+0.024
ck+0.023
els+0.022
Neg logit contributions
 Please-0.045
 Can-0.045
 sorry-0.041
 You-0.041
 Come-0.039
Tokenates
Token ID689
Feature activation-3.16e+00

Pos logit contributions
aches+0.175
cking+0.156
glers+0.153
ck+0.143
ge+0.140
Neg logit contributions
 Please-0.218
 Can-0.199
 Come-0.197
 Just-0.195
 You-0.191
Token and
Token ID290
Feature activation-1.12e+00

Pos logit contributions
aches+0.086
cking+0.074
cks+0.073
els+0.072
ck+0.072
Neg logit contributions
 Can-0.057
 Please-0.056
 You-0.049
 sorry-0.048
 Come-0.047
Token started
Token ID2067
Feature activation+1.36e+00

Pos logit contributions
aches+0.028
cking+0.024
cks+0.024
els+0.023
ck+0.023
Neg logit contributions
 Can-0.023
 Please-0.022
 You-0.020
 sorry-0.019
 Come-0.019
Token eating
Token ID6600
Feature activation-2.76e+00

Pos logit contributions
 Can+0.021
 Please+0.020
 sorry+0.017
 You+0.017
 Come+0.016
Neg logit contributions
aches-0.040
cking-0.035
cks-0.035
els-0.035
ck-0.034
Token them
Token ID606
Feature activation-1.40e+00

Pos logit contributions
aches+0.067
cking+0.058
cks+0.057
ck+0.056
els+0.055
Neg logit contributions
 Can-0.057
 Please-0.055
 You-0.050
 sorry-0.049
 Come-0.047
Token he
Token ID339
Feature activation-5.36e-02

Pos logit contributions
 Can+0.021
 Please+0.021
 You+0.018
 Come+0.017
 sorry+0.017
Neg logit contributions
aches-0.036
cking-0.031
cks-0.030
ck-0.030
els-0.030
Token ate
Token ID15063
Feature activation-2.40e+00

Pos logit contributions
aches+0.001
ck+0.001
cking+0.001
cks+0.001
els+0.001
Neg logit contributions
 Can-0.001
 Please-0.001
 You-0.001
 sorry-0.001
 Come-0.001
Token all
Token ID477
Feature activation-4.45e+00

Pos logit contributions
aches+0.062
cking+0.056
cks+0.055
els+0.053
ck+0.053
Neg logit contributions
 Can-0.044
 Please-0.043
 You-0.039
 sorry-0.038
 I-0.036
Token the
Token ID262
Feature activation-2.41e+00

Pos logit contributions
aches+0.117
cking+0.107
cks+0.104
els+0.102
ck+0.101
Neg logit contributions
 Can-0.078
 Please-0.072
 You-0.072
 I-0.067
 Come-0.064
Token ch
Token ID442
Feature activation-1.78e+00

Pos logit contributions
aches+0.061
cking+0.056
cks+0.056
els+0.054
ck+0.053
Neg logit contributions
 Can-0.045
 Please-0.043
 You-0.039
 sorry-0.037
 guilty-0.036
Tokenocol
Token ID4668
Feature activation-1.02e+01

Pos logit contributions
aches+0.033
cking+0.030
ics+0.029
Back+0.029
cks+0.028
Neg logit contributions
 Please-0.037
 Can-0.036
 sorry-0.034
 You-0.034
 guilty-0.032
Tokenates
Token ID689
Feature activation-3.96e+00

Pos logit contributions
aches+0.176
cking+0.172
glers+0.165
ck+0.155
ge+0.153
Neg logit contributions
 Please-0.177
 Come-0.168
 Just-0.167
 Can-0.164
 You-0.164
Token and
Token ID290
Feature activation-1.74e+00

Pos logit contributions
aches+0.111
cking+0.100
cks+0.099
els+0.096
ck+0.096
Neg logit contributions
 Can-0.062
 Please-0.062
 You-0.054
 sorry-0.051
 I-0.051
Token sweets
Token ID42402
Feature activation-3.07e+00

Pos logit contributions
aches+0.043
cking+0.037
cks+0.037
els+0.036
ck+0.035
Neg logit contributions
 Can-0.035
 Please-0.035
 You-0.030
 sorry-0.030
 Come-0.029
Token.
Token ID13
Feature activation+1.75e+00

Pos logit contributions
aches+0.089
cks+0.083
cking+0.082
els+0.081
ck+0.080
Neg logit contributions
 Can-0.043
 Please-0.042
 You-0.035
 I-0.034
 sorry-0.034
Token Then
Token ID3244
Feature activation+1.55e+00

Pos logit contributions
 Please+0.023
 Can+0.022
 sorry+0.021
 mean+0.019
 guilty+0.019
Neg logit contributions
aches-0.049
ric-0.042
els-0.042
ries-0.042
cks-0.041
Token and
Token ID290
Feature activation-1.08e+00

Pos logit contributions
aches+0.078
cking+0.068
cks+0.067
els+0.066
ck+0.066
Neg logit contributions
 Can-0.045
 Please-0.044
 You-0.038
 sorry-0.037
 Come-0.036
Token started
Token ID2067
Feature activation+1.44e+00

Pos logit contributions
aches+0.028
cking+0.025
cks+0.024
els+0.024
ck+0.023
Neg logit contributions
 Can-0.021
 Please-0.021
 You-0.018
 sorry-0.017
 Come-0.017
Token eating
Token ID6600
Feature activation-2.17e+00

Pos logit contributions
 Can+0.017
 Please+0.017
 sorry+0.013
 You+0.013
 Come+0.013
Neg logit contributions
aches-0.048
cking-0.043
cks-0.042
ck-0.042
els-0.041
Token the
Token ID262
Feature activation-1.23e+00

Pos logit contributions
aches+0.058
cking+0.051
cks+0.050
ck+0.049
els+0.049
Neg logit contributions
 Can-0.040
 Please-0.039
 You-0.034
 sorry-0.034
 Come-0.032
Token ch
Token ID442
Feature activation-7.99e-01

Pos logit contributions
aches+0.028
cking+0.022
ries+0.022
els+0.022
ck+0.021
Neg logit contributions
 Please-0.028
 Can-0.028
 Come-0.024
 sorry-0.024
 You-0.024
Tokenocol
Token ID4668
Feature activation-9.58e+00

Pos logit contributions
aches+0.013
cking+0.010
cks+0.010
ck+0.009
ric+0.009
Neg logit contributions
 Can-0.023
 Please-0.023
 You-0.021
 sorry-0.021
 Come-0.020
Tokenates
Token ID689
Feature activation-3.00e+00

Pos logit contributions
cking+0.169
aches+0.168
ge+0.155
ck+0.152
glers+0.152
Neg logit contributions
 Please-0.168
 Come-0.163
 Just-0.158
 Can-0.156
 You-0.156
Token.
Token ID13
Feature activation+2.02e+00

Pos logit contributions
aches+0.086
cking+0.077
cks+0.076
ck+0.074
els+0.073
Neg logit contributions
 Can-0.046
 Please-0.046
 You-0.040
 sorry-0.037
 I-0.036
Token The
Token ID383
Feature activation-6.59e-01

Pos logit contributions
 Can+0.025
 Please+0.025
 sorry+0.022
 You+0.021
 guilty+0.020
Neg logit contributions
aches-0.059
cks-0.051
ries-0.050
els-0.050
cking-0.050
Token shop
Token ID6128
Feature activation+3.77e-01

Pos logit contributions
aches+0.015
cking+0.013
cks+0.013
ck+0.012
els+0.012
Neg logit contributions
 Can-0.015
 Please-0.015
 You-0.013
 sorry-0.013
 Come-0.013
Tokenkeeper
Token ID13884
Feature activation-1.73e+00

Pos logit contributions
 Can+0.007
 Please+0.007
 You+0.006
 sorry+0.006
 Come+0.006
Neg logit contributions
aches-0.010
cking-0.008
cks-0.008
els-0.008
ck-0.008
Tokenbled
Token ID9342
Feature activation-3.90e+00

Pos logit contributions
aches+0.052
cking+0.041
cks+0.040
ck+0.039
els+0.038
Neg logit contributions
 Can-0.086
 Please-0.086
 You-0.080
 sorry-0.079
 Come-0.078
Token up
Token ID510
Feature activation-2.75e+00

Pos logit contributions
aches+0.119
cking+0.108
cks+0.108
ric+0.107
ge+0.101
Neg logit contributions
 Can-0.048
 Please-0.045
 You-0.041
 sorry-0.038
 Come-0.037
Token all
Token ID477
Feature activation-4.85e+00

Pos logit contributions
aches+0.073
cks+0.069
cking+0.069
ric+0.067
ck+0.065
Neg logit contributions
 Can-0.044
 Please-0.042
 sorry-0.041
 You-0.039
 guilty-0.038
Token the
Token ID262
Feature activation-2.82e+00

Pos logit contributions
aches+0.131
cking+0.118
cks+0.116
ric+0.114
ck+0.111
Neg logit contributions
 Can-0.082
 You-0.074
 Please-0.074
 I-0.069
 But-0.067
Token ch
Token ID442
Feature activation-1.54e+00

Pos logit contributions
aches+0.071
cking+0.065
ric+0.064
cks+0.064
els+0.061
Neg logit contributions
 Can-0.052
 Please-0.049
 You-0.044
 sorry-0.042
 guilty-0.042
Tokenocol
Token ID4668
Feature activation-9.53e+00

Pos logit contributions
aches+0.028
cking+0.025
ric+0.024
ics+0.024
glers+0.023
Neg logit contributions
 Please-0.033
 sorry-0.032
 Can-0.032
 You-0.032
 guilty-0.030
Tokenates
Token ID689
Feature activation-4.50e+00

Pos logit contributions
aches+0.163
cking+0.158
ge+0.153
glers+0.149
ck+0.142
Neg logit contributions
 Come-0.164
 Please-0.159
 You-0.151
 Can-0.151
 Just-0.148
Token,
Token ID11
Feature activation+9.12e-01

Pos logit contributions
aches+0.136
cks+0.123
cking+0.122
els+0.118
ric+0.118
Neg logit contributions
 Can-0.059
 Please-0.059
 You-0.051
 sorry-0.049
 I-0.048
Token and
Token ID290
Feature activation-1.05e+00

Pos logit contributions
 Can+0.016
 Please+0.015
 You+0.013
 sorry+0.013
 Come+0.013
Neg logit contributions
aches-0.026
cking-0.022
cks-0.022
els-0.022
ck-0.022
Token she
Token ID673
Feature activation-1.50e+00

Pos logit contributions
aches+0.027
cks+0.024
cking+0.024
els+0.023
ric+0.023
Neg logit contributions
 Can-0.020
 Please-0.020
 sorry-0.017
 You-0.017
 Come-0.016
Token was
Token ID373
Feature activation+1.02e+00

Pos logit contributions
aches+0.042
cking+0.038
cks+0.038
els+0.038
ries+0.037
Neg logit contributions
 Please-0.023
 Can-0.023
 sorry-0.019
 You-0.019
 I-0.019
Token giving
Token ID3501
Feature activation-1.57e+00

Pos logit contributions
aches+0.086
els+0.076
cking+0.075
ries+0.073
cks+0.072
Neg logit contributions
 Please-0.061
 Can-0.061
 You-0.057
 sorry-0.056
 I-0.052
Token him
Token ID683
Feature activation-1.45e+00

Pos logit contributions
aches+0.042
cking+0.035
cks+0.035
els+0.035
ck+0.034
Neg logit contributions
 Can-0.030
 Please-0.029
 You-0.025
 sorry-0.025
 Come-0.025
Token too
Token ID1165
Feature activation-9.22e-01

Pos logit contributions
aches+0.040
cking+0.037
cks+0.036
els+0.035
ck+0.035
Neg logit contributions
 Can-0.023
 Please-0.023
 sorry-0.020
 You-0.020
 Come-0.019
Token many
Token ID867
Feature activation-3.90e+00

Pos logit contributions
aches+0.026
cking+0.023
cks+0.022
ck+0.022
els+0.022
Neg logit contributions
 Can-0.016
 Please-0.015
 You-0.013
 Come-0.013
 sorry-0.013
Token ch
Token ID442
Feature activation-1.48e+00

Pos logit contributions
aches+0.102
cking+0.085
els+0.084
cks+0.084
ck+0.083
Neg logit contributions
 Can-0.075
 Please-0.074
 You-0.065
 Come-0.064
 sorry-0.064
Tokenocol
Token ID4668
Feature activation-9.34e+00

Pos logit contributions
aches+0.030
cking+0.025
cks+0.024
ck+0.023
ries+0.022
Neg logit contributions
 Please-0.035
 Can-0.034
 sorry-0.032
 You-0.031
 guilty-0.030
Tokenates
Token ID689
Feature activation-2.78e+00

Pos logit contributions
aches+0.149
cking+0.133
ck+0.121
glers+0.121
ge+0.112
Neg logit contributions
 Please-0.219
 Can-0.204
 Come-0.198
 You-0.198
 Just-0.188
Token."
Token ID526
Feature activation-1.14e+00

Pos logit contributions
aches+0.079
cking+0.068
cks+0.067
els+0.066
ck+0.066
Neg logit contributions
 Can-0.047
 Please-0.046
 You-0.040
 sorry-0.039
 Come-0.038
Token The
Token ID383
Feature activation-1.11e+00

Pos logit contributions
aches+0.030
cking+0.025
cks+0.025
ck+0.024
els+0.024
Neg logit contributions
 Can-0.022
 Please-0.021
 You-0.019
 sorry-0.018
 Come-0.018
Token king
Token ID5822
Feature activation-3.00e+00

Pos logit contributions
aches+0.031
cking+0.026
cks+0.026
ck+0.025
els+0.025
Neg logit contributions
 Can-0.020
 Please-0.020
 You-0.017
 Come-0.017
 sorry-0.016
Token learned
Token ID4499
Feature activation-3.88e-01

Pos logit contributions
aches+0.083
cking+0.076
cks+0.074
els+0.072
irs+0.071
Neg logit contributions
 Please-0.049
 Can-0.047
 sorry-0.041
 You-0.041
 guilty-0.038
Token and
Token ID290
Feature activation-2.33e+00

Pos logit contributions
 Can+0.003
 Please+0.003
 You+0.003
 Come+0.002
 sorry+0.002
Neg logit contributions
aches-0.003
cking-0.002
cks-0.002
ck-0.002
els-0.002
Token laughed
Token ID13818
Feature activation+3.22e-01

Pos logit contributions
aches+0.064
els+0.054
cks+0.054
cking+0.053
ries+0.052
Neg logit contributions
 Can-0.043
 Please-0.043
 sorry-0.038
 You-0.036
 Come-0.035
Token and
Token ID290
Feature activation-2.32e+00

Pos logit contributions
 Can+0.007
 Please+0.007
 Come+0.006
 You+0.006
 Give+0.006
Neg logit contributions
aches-0.007
cking-0.006
ck-0.006
cks-0.006
els-0.006
Token ate
Token ID15063
Feature activation-2.65e+00

Pos logit contributions
aches+0.059
cking+0.050
cks+0.049
els+0.049
ck+0.048
Neg logit contributions
 Can-0.047
 Please-0.046
 You-0.040
 sorry-0.040
 Come-0.039
Token ch
Token ID442
Feature activation-2.51e+00

Pos logit contributions
aches+0.067
cking+0.057
cks+0.056
els+0.055
ck+0.055
Neg logit contributions
 Can-0.054
 Please-0.053
 You-0.047
 sorry-0.046
 Come-0.045
Tokenocol
Token ID4668
Feature activation-9.28e+00

Pos logit contributions
aches+0.049
cking+0.037
Back+0.037
ge+0.037
ics+0.036
Neg logit contributions
 sorry-0.059
 Can-0.056
 Please-0.056
 You-0.053
 guilty-0.051
Tokenates
Token ID689
Feature activation-4.63e+00

Pos logit contributions
aches+0.168
cking+0.146
ge+0.138
ck+0.131
eling+0.125
Neg logit contributions
 Come-0.188
 Please-0.185
 You-0.183
 Can-0.182
 Just-0.163
Token.
Token ID13
Feature activation+3.19e+00

Pos logit contributions
aches+0.140
cking+0.126
cks+0.125
els+0.121
ric+0.121
Neg logit contributions
 Can-0.061
 Please-0.060
 sorry-0.051
 You-0.051
 Come-0.046
Token They
Token ID1119
Feature activation-1.74e+00

Pos logit contributions
 Please+0.056
 Can+0.054
 You+0.046
 sorry+0.046
 guilty+0.045
Neg logit contributions
aches-0.079
cks-0.070
els-0.068
cking-0.068
ck-0.067
Token had
Token ID550
Feature activation+1.70e-01

Pos logit contributions
aches+0.051
cks+0.047
cking+0.047
els+0.047
ries+0.046
Neg logit contributions
 Please-0.025
 Can-0.024
 sorry-0.022
 guilty-0.021
 mean-0.021
Token a
Token ID257
Feature activation+1.21e+00

Pos logit contributions
 Can+0.002
 Please+0.002
 You+0.002
 sorry+0.002
 Come+0.002
Neg logit contributions
aches-0.005
cking-0.005
cks-0.005
els-0.005
ck-0.005
Token and
Token ID290
Feature activation-1.69e+00

Pos logit contributions
 Can+0.010
 Please+0.009
 You+0.008
 Come+0.008
 sorry+0.008
Neg logit contributions
aches-0.015
cking-0.012
ck-0.012
els-0.012
cks-0.012
Token they
Token ID484
Feature activation-2.04e+00

Pos logit contributions
aches+0.042
cking+0.036
cks+0.036
els+0.035
ck+0.035
Neg logit contributions
 Can-0.034
 Please-0.034
 You-0.030
 sorry-0.029
 Come-0.029
Token shared
Token ID4888
Feature activation-3.44e+00

Pos logit contributions
aches+0.049
cks+0.045
cking+0.044
ric+0.043
els+0.042
Neg logit contributions
 Please-0.039
 Can-0.037
 sorry-0.034
 You-0.032
 guilty-0.032
Token the
Token ID262
Feature activation-2.07e+00

Pos logit contributions
aches+0.101
ric+0.097
cking+0.097
igi+0.095
rim+0.094
Neg logit contributions
 Can-0.040
 sorry-0.038
 right-0.035
 Please-0.035
 But-0.033
Token ch
Token ID442
Feature activation-1.15e+00

Pos logit contributions
aches+0.054
cking+0.046
cks+0.045
els+0.044
ck+0.044
Neg logit contributions
 Can-0.040
 Please-0.040
 You-0.035
 sorry-0.034
 Come-0.034
Tokenocol
Token ID4668
Feature activation-9.22e+00

Pos logit contributions
aches+0.019
cking+0.017
cks+0.016
ck+0.015
ics+0.014
Neg logit contributions
 Can-0.030
 Please-0.030
 You-0.029
 sorry-0.028
 Come-0.027
Tokenates
Token ID689
Feature activation-4.67e+00

Pos logit contributions
cking+0.164
ge+0.151
aches+0.149
glers+0.146
eling+0.146
Neg logit contributions
 Come-0.150
 Please-0.145
 You-0.142
 Can-0.140
 Just-0.137
Token with
Token ID351
Feature activation-3.46e+00

Pos logit contributions
aches+0.128
cks+0.119
cking+0.119
ric+0.116
els+0.114
Neg logit contributions
 Please-0.063
 Can-0.063
 You-0.058
 sorry-0.056
 I-0.054
Token everyone
Token ID2506
Feature activation-1.97e+00

Pos logit contributions
aches+0.091
cking+0.083
cks+0.079
ric+0.078
ge+0.077
Neg logit contributions
 Please-0.058
 Can-0.058
 sorry-0.054
 You-0.053
 guilty-0.049
Token in
Token ID287
Feature activation-4.64e+00

Pos logit contributions
aches+0.053
cking+0.045
cks+0.045
els+0.044
ck+0.044
Neg logit contributions
 Can-0.037
 Please-0.036
 You-0.032
 sorry-0.032
 Come-0.030
Token the
Token ID262
Feature activation-2.22e+00

Pos logit contributions
aches+0.130
cking+0.109
ck+0.103
cks+0.102
ants+0.101
Neg logit contributions
 Can-0.090
 Please-0.087
 sorry-0.078
 You-0.077
 I-0.072
Token lived
Token ID5615
Feature activation-2.62e+00

Pos logit contributions
aches+0.050
cking+0.043
cks+0.042
els+0.042
ck+0.041
Neg logit contributions
 Can-0.035
 Please-0.034
 sorry-0.030
 You-0.030
 Come-0.028
Token happily
Token ID18177
Feature activation-1.01e+00

Pos logit contributions
aches+0.063
cking+0.054
cks+0.054
els+0.054
ck+0.052
Neg logit contributions
 Please-0.054
 Can-0.054
 sorry-0.049
 You-0.048
 Come-0.046
Token ever
Token ID1683
Feature activation-1.92e+00

Pos logit contributions
aches+0.030
cking+0.026
cks+0.026
els+0.025
ck+0.025
Neg logit contributions
 Can-0.016
 Please-0.016
 You-0.014
 sorry-0.013
 Come-0.013
Token after
Token ID706
Feature activation-3.35e+00

Pos logit contributions
aches+0.055
cking+0.049
ck+0.048
cks+0.047
els+0.046
Neg logit contributions
 Can-0.031
 Please-0.029
 You-0.025
 sorry-0.025
 Come-0.025
Token.
Token ID13
Feature activation-1.15e-01

Pos logit contributions
aches+0.097
cking+0.085
cks+0.084
els+0.083
ck+0.082
Neg logit contributions
 Can-0.055
 Please-0.053
 You-0.047
 sorry-0.046
 Come-0.043
Token
Token ID198
Feature activation-8.94e+00

Pos logit contributions
aches+0.004
cking+0.003
els+0.003
ck+0.003
cks+0.003
Neg logit contributions
 Can-0.002
 Please-0.002
 You-0.002
 Come-0.001
 I-0.001
Token
Token ID198
Feature activation-7.59e+00

Pos logit contributions
aches+0.255
els+0.242
ge+0.231
ries+0.230
cking+0.220
Neg logit contributions
 Can-0.130
 sorry-0.129
 Please-0.123
 You-0.122
 It-0.114
TokenThe
Token ID464
Feature activation-2.57e+00

Pos logit contributions
aches+0.182
els+0.182
cks+0.170
ries+0.166
ric+0.161
Neg logit contributions
 Can-0.142
 Please-0.141
 You-0.128
 Come-0.120
 sorry-0.120
Token moral
Token ID6573
Feature activation-1.33e+00

Pos logit contributions
aches+0.065
cks+0.055
cking+0.055
els+0.054
ck+0.054
Neg logit contributions
 Can-0.052
 Please-0.051
 sorry-0.046
 You-0.045
 Come-0.043
Token of
Token ID286
Feature activation-7.22e+00

Pos logit contributions
aches+0.025
ck+0.019
els+0.018
cking+0.018
cks+0.018
Neg logit contributions
 Can-0.034
 Please-0.033
 sorry-0.031
 You-0.030
 Come-0.030
Token the
Token ID262
Feature activation-2.38e+00

Pos logit contributions
aches+0.204
cking+0.174
cks+0.172
els+0.172
ge+0.165
Neg logit contributions
 Can-0.125
 Please-0.118
 You-0.116
 sorry-0.106
 I-0.104
Token son
Token ID3367
Feature activation+1.91e+00

Pos logit contributions
 Can+0.019
 Please+0.019
 You+0.017
 Come+0.017
 sorry+0.016
Neg logit contributions
aches-0.024
els-0.019
cking-0.019
cks-0.019
ck-0.019
Token is
Token ID318
Feature activation+1.34e+00

Pos logit contributions
 Can+0.032
 Please+0.031
 Come+0.026
 sorry+0.026
 You+0.025
Neg logit contributions
aches-0.054
cks-0.044
cking-0.043
ck-0.043
els-0.043
Token eating
Token ID6600
Feature activation-1.58e+00

Pos logit contributions
 Can+0.024
 Please+0.023
 Come+0.021
 You+0.020
 Yes+0.019
Neg logit contributions
aches-0.037
cking-0.032
cks-0.031
ck-0.030
els-0.030
Token our
Token ID674
Feature activation-1.20e+00

Pos logit contributions
aches+0.042
cking+0.036
cks+0.035
ck+0.035
els+0.035
Neg logit contributions
 Can-0.030
 Please-0.029
 You-0.026
 sorry-0.025
 Come-0.025
Token ch
Token ID442
Feature activation-9.10e-01

Pos logit contributions
aches+0.031
cking+0.027
cks+0.026
ck+0.026
els+0.026
Neg logit contributions
 Can-0.023
 Please-0.023
 You-0.020
 sorry-0.019
 Come-0.019
Tokenocol
Token ID4668
Feature activation-8.89e+00

Pos logit contributions
aches+0.016
cking+0.014
ck+0.013
cks+0.013
ics+0.013
Neg logit contributions
 Please-0.023
 Can-0.022
 You-0.021
 sorry-0.020
 Come-0.019
Tokenates
Token ID689
Feature activation-3.11e+00

Pos logit contributions
aches+0.158
cking+0.156
glers+0.143
ck+0.139
ge+0.138
Neg logit contributions
 Please-0.166
 Come-0.154
 Just-0.151
 You-0.150
 Can-0.149
Token!"
Token ID2474
Feature activation-1.45e+00

Pos logit contributions
aches+0.091
cking+0.079
cks+0.078
els+0.077
ck+0.077
Neg logit contributions
 Can-0.050
 Please-0.049
 You-0.042
 sorry-0.041
 Come-0.040
Token
Token ID220
Feature activation-1.96e+00

Pos logit contributions
aches+0.038
cking+0.032
cks+0.032
ries+0.031
ck+0.031
Neg logit contributions
 Can-0.026
 Please-0.025
 sorry-0.022
 You-0.021
 Come-0.021
Token
Token ID198
Feature activation-7.18e+00

Pos logit contributions
aches+0.056
cking+0.049
els+0.047
ck+0.047
cks+0.047
Neg logit contributions
 Please-0.034
 Can-0.033
 You-0.030
 Come-0.029
 sorry-0.028
Token
Token ID198
Feature activation-6.07e+00

Pos logit contributions
aches+0.210
els+0.187
cking+0.185
cks+0.182
ries+0.180
Neg logit contributions
 Please-0.126
 Can-0.124
 You-0.107
 Come-0.103
 sorry-0.101
Token't
Token ID470
Feature activation+6.74e-02

Pos logit contributions
 Please+0.018
 Can+0.018
 You+0.016
 I+0.015
 sorry+0.015
Neg logit contributions
aches-0.035
cking-0.033
cks-0.033
ries-0.031
els-0.031
Token have
Token ID423
Feature activation+6.63e-01

Pos logit contributions
 Please+0.001
 Can+0.001
 You+0.001
 sorry+0.001
 Come+0.001
Neg logit contributions
aches-0.002
cking-0.002
cks-0.002
els-0.001
ries-0.001
Token all
Token ID477
Feature activation-1.99e+00

Pos logit contributions
 Please+0.011
 Can+0.011
 sorry+0.009
 You+0.009
 I+0.009
Neg logit contributions
aches-0.019
cking-0.017
cks-0.017
els-0.016
ck-0.016
Token these
Token ID777
Feature activation-9.47e-01

Pos logit contributions
aches+0.050
cking+0.043
cks+0.043
ck+0.042
els+0.042
Neg logit contributions
 Can-0.040
 Please-0.039
 You-0.035
 sorry-0.034
 Come-0.033
Token ch
Token ID442
Feature activation-8.22e-01

Pos logit contributions
aches+0.025
cking+0.021
cks+0.021
els+0.020
ck+0.020
Neg logit contributions
 Can-0.018
 Please-0.018
 You-0.016
 sorry-0.015
 Come-0.015
Tokenocol
Token ID4668
Feature activation-8.78e+00

Pos logit contributions
aches+0.014
cking+0.012
ck+0.011
cks+0.011
ics+0.010
Neg logit contributions
 Please-0.021
 Can-0.021
 You-0.020
 sorry-0.020
 Come-0.019
Tokenates
Token ID689
Feature activation-2.39e+00

Pos logit contributions
aches+0.149
cking+0.142
glers+0.129
ck+0.128
ge+0.125
Neg logit contributions
 Please-0.167
 Come-0.161
 Can-0.158
 You-0.157
 Just-0.151
Token!
Token ID0
Feature activation+3.65e+00

Pos logit contributions
aches+0.068
cking+0.059
cks+0.058
els+0.057
ck+0.057
Neg logit contributions
 Can-0.040
 Please-0.040
 You-0.034
 sorry-0.033
 Come-0.033
Token Now
Token ID2735
Feature activation+2.46e+00

Pos logit contributions
 guilty+0.028
 sorry+0.027
 Can+0.027
 Please+0.026
 guy+0.025
Neg logit contributions
aches-0.111
ric-0.103
els-0.102
cks-0.100
ries-0.098
Token,
Token ID11
Feature activation+1.30e+00

Pos logit contributions
 Can+0.037
 Please+0.035
 You+0.029
 Come+0.029
 sorry+0.029
Neg logit contributions
aches-0.075
cking-0.064
cks-0.063
ck-0.063
els-0.062
Token go
Token ID467
Feature activation+1.17e+00

Pos logit contributions
 Can+0.020
 Please+0.019
 You+0.016
 sorry+0.016
 Come+0.015
Neg logit contributions
aches-0.039
cking-0.033
ck-0.032
els-0.032
cks-0.032
Token never
Token ID1239
Feature activation-2.02e+00

Pos logit contributions
aches+0.047
cking+0.040
cks+0.039
ck+0.039
els+0.039
Neg logit contributions
 Can-0.033
 Please-0.032
 You-0.028
 sorry-0.027
 Come-0.027
Token felt
Token ID2936
Feature activation+1.89e+00

Pos logit contributions
aches+0.052
cking+0.044
cks+0.043
ck+0.043
els+0.043
Neg logit contributions
 Can-0.040
 Please-0.039
 You-0.035
 sorry-0.034
 Come-0.033
Token guilty
Token ID6717
Feature activation+3.48e+00

Pos logit contributions
 Please+0.031
 Can+0.031
 Come+0.028
 You+0.027
 Give+0.025
Neg logit contributions
aches-0.053
cking-0.045
ck-0.043
cks-0.041
els-0.040
Token again
Token ID757
Feature activation-1.13e+00

Pos logit contributions
 Please+0.077
 Can+0.074
 Come+0.068
 You+0.067
 sorry+0.066
Neg logit contributions
aches-0.068
els-0.055
ge-0.055
cking-0.054
ries-0.054
Token.
Token ID13
Feature activation+3.72e-01

Pos logit contributions
aches+0.032
cking+0.028
cks+0.027
els+0.027
ck+0.027
Neg logit contributions
 Can-0.019
 Please-0.019
 You-0.016
 sorry-0.016
 Come-0.015
Token
Token ID198
Feature activation-8.78e+00

Pos logit contributions
 Can+0.006
 Please+0.006
 You+0.006
 Come+0.005
 I+0.005
Neg logit contributions
aches-0.011
cking-0.010
els-0.010
ck-0.010
cks-0.010
Token
Token ID198
Feature activation-7.46e+00

Pos logit contributions
aches+0.247
els+0.238
ries+0.234
ge+0.229
cking+0.217
Neg logit contributions
 sorry-0.126
 Can-0.125
 You-0.122
 Please-0.117
 Come-0.117
TokenThe
Token ID464
Feature activation-1.94e+00

Pos logit contributions
els+0.179
aches+0.175
ries+0.173
cks+0.158
ge+0.156
Neg logit contributions
 Please-0.144
 Can-0.143
 You-0.141
 Come-0.128
 I-0.126
Token moral
Token ID6573
Feature activation-1.36e+00

Pos logit contributions
aches+0.045
cking+0.038
cks+0.037
ck+0.036
els+0.036
Neg logit contributions
 Can-0.043
 Please-0.043
 You-0.038
 sorry-0.037
 Come-0.037
Token of
Token ID286
Feature activation-7.23e+00

Pos logit contributions
aches+0.026
ck+0.020
els+0.019
cking+0.019
cks+0.018
Neg logit contributions
 Can-0.034
 Please-0.033
 sorry-0.031
 You-0.030
 Come-0.030
Token the
Token ID262
Feature activation-2.02e+00

Pos logit contributions
aches+0.207
cking+0.177
els+0.174
cks+0.172
ge+0.172
Neg logit contributions
 Can-0.123
 Please-0.115
 You-0.114
 sorry-0.105
 Come-0.103
Token Lucy
Token ID22162
Feature activation-3.22e+00

Pos logit contributions
aches+0.005
cking+0.004
els+0.004
cks+0.004
ck+0.004
Neg logit contributions
 Can-0.003
 Please-0.003
 You-0.003
 Come-0.002
 sorry-0.002
Token was
Token ID373
Feature activation+4.78e-01

Pos logit contributions
aches+0.090
cking+0.078
cks+0.078
els+0.076
ries+0.075
Neg logit contributions
 Can-0.055
 Please-0.055
 You-0.047
 sorry-0.047
 Come-0.045
Token so
Token ID523
Feature activation+5.72e-01

Pos logit contributions
 Can+0.006
 Please+0.006
 You+0.005
 Come+0.004
 sorry+0.004
Neg logit contributions
aches-0.016
cking-0.014
cks-0.014
ck-0.014
els-0.013
Token happy
Token ID3772
Feature activation+1.27e+00

Pos logit contributions
 Can+0.009
 Please+0.009
 sorry+0.008
 You+0.007
 guilty+0.007
Neg logit contributions
aches-0.017
els-0.015
cking-0.015
cks-0.015
ries-0.015
Token!
Token ID0
Feature activation-6.50e-01

Pos logit contributions
 Can+0.026
 Please+0.025
 sorry+0.022
 You+0.022
 Come+0.022
Neg logit contributions
aches-0.031
cking-0.027
cks-0.026
ck-0.026
ries-0.025
Token
Token ID198
Feature activation-8.78e+00

Pos logit contributions
aches+0.018
cking+0.015
cks+0.015
els+0.015
ck+0.015
Neg logit contributions
 Can-0.012
 Please-0.012
 You-0.010
 sorry-0.010
 Come-0.010
Token
Token ID198
Feature activation-7.43e+00

Pos logit contributions
aches+0.252
els+0.238
ries+0.234
cking+0.228
ge+0.220
Neg logit contributions
 Can-0.125
 You-0.121
 sorry-0.119
 Come-0.117
 Please-0.117
TokenThe
Token ID464
Feature activation-2.15e+00

Pos logit contributions
els+0.175
aches+0.172
ries+0.169
cks+0.166
ishes+0.165
Neg logit contributions
 Please-0.148
 Can-0.144
 You-0.132
 Come-0.129
�-0.126
Token next
Token ID1306
Feature activation+6.56e-02

Pos logit contributions
aches+0.054
cking+0.045
cks+0.045
els+0.044
ck+0.044
Neg logit contributions
 Can-0.044
 Please-0.044
 You-0.039
 sorry-0.038
 Come-0.037
Token day
Token ID1110
Feature activation-2.69e-02

Pos logit contributions
 Can+0.001
 Please+0.001
 You+0.001
 sorry+0.001
 Come+0.001
Neg logit contributions
aches-0.002
cking-0.002
ck-0.002
els-0.001
cks-0.001
Token,
Token ID11
Feature activation-1.45e-01

Pos logit contributions
aches+0.001
cking+0.001
els+0.001
cks+0.001
ck+0.001
Neg logit contributions
 Can-0.000
 Please-0.000
 You-0.000
 sorry-0.000
 Come-0.000
Token had
Token ID550
Feature activation-6.83e-02

Pos logit contributions
aches+0.014
cking+0.012
cks+0.012
ck+0.012
els+0.012
Neg logit contributions
 Can-0.009
 Please-0.009
 You-0.008
 sorry-0.008
 Come-0.008
Token been
Token ID587
Feature activation+1.45e+00

Pos logit contributions
aches+0.002
cking+0.002
cks+0.002
els+0.002
ck+0.001
Neg logit contributions
 Can-0.001
 Please-0.001
 You-0.001
 sorry-0.001
 Come-0.001
Token resolved
Token ID12939
Feature activation+6.99e-01

Pos logit contributions
 Can+0.026
 Please+0.026
 You+0.022
 sorry+0.022
 Come+0.022
Neg logit contributions
aches-0.040
cking-0.034
cks-0.033
ck-0.033
els-0.033
Token.
Token ID13
Feature activation-1.51e-01

Pos logit contributions
 Can+0.013
 Please+0.013
 sorry+0.011
 Come+0.011
 You+0.011
Neg logit contributions
aches-0.018
cking-0.016
els-0.016
cks-0.015
ck-0.015
Token
Token ID220
Feature activation-2.46e+00

Pos logit contributions
aches+0.005
cking+0.004
els+0.004
ck+0.004
cks+0.004
Neg logit contributions
 Can-0.003
 Please-0.002
 You-0.002
 Come-0.002
 I-0.002
Token
Token ID198
Feature activation-8.77e+00

Pos logit contributions
aches+0.074
cking+0.066
ck+0.065
els+0.064
cks+0.063
Neg logit contributions
 Please-0.039
 Can-0.038
 You-0.037
 Come-0.034
 sorry-0.033
Token
Token ID198
Feature activation-7.47e+00

Pos logit contributions
aches+0.314
els+0.310
ries+0.295
cking+0.285
ishes+0.284
Neg logit contributions
 Can-0.086
 You-0.079
 Please-0.079
 sorry-0.063
 I-0.063
TokenThey
Token ID2990
Feature activation-2.11e+00

Pos logit contributions
aches+0.179
els+0.175
cks+0.169
ries+0.166
ishes+0.161
Neg logit contributions
 Please-0.155
 Can-0.153
 You-0.142
 I-0.131
 Come-0.126
Token both
Token ID1111
Feature activation-2.66e+00

Pos logit contributions
aches+0.056
cks+0.053
cking+0.052
ries+0.051
els+0.050
Neg logit contributions
 Please-0.033
 Can-0.032
 sorry-0.031
 guilty-0.030
 I-0.029
Token left
Token ID1364
Feature activation-2.87e+00

Pos logit contributions
aches+0.072
cking+0.064
cks+0.063
els+0.062
ries+0.061
Neg logit contributions
 Please-0.048
 Can-0.047
 sorry-0.042
 You-0.041
 guilty-0.040
Token the
Token ID262
Feature activation-2.54e+00

Pos logit contributions
aches+0.070
cking+0.060
cks+0.059
ck+0.058
els+0.057
Neg logit contributions
 Can-0.060
 Please-0.059
 You-0.053
 sorry-0.053
 Come-0.051
Token his
Token ID465
Feature activation-1.77e+00

Pos logit contributions
aches+0.093
cking+0.078
cks+0.077
els+0.075
ck+0.075
Neg logit contributions
 Can-0.089
 Please-0.088
 You-0.079
 sorry-0.077
 Come-0.076
Token hard
Token ID1327
Feature activation-3.15e-01

Pos logit contributions
aches+0.043
cking+0.035
ck+0.035
els+0.034
cks+0.034
Neg logit contributions
 Can-0.037
 Please-0.036
 Come-0.031
 You-0.031
 Give-0.029
Token work
Token ID670
Feature activation-2.07e+00

Pos logit contributions
aches+0.006
cking+0.005
els+0.005
ck+0.005
cks+0.005
Neg logit contributions
 Can-0.008
 Please-0.008
 Come-0.007
 You-0.007
 sorry-0.007
Token.
Token ID13
Feature activation-8.22e-01

Pos logit contributions
aches+0.051
cking+0.043
cks+0.042
ck+0.041
els+0.041
Neg logit contributions
 Can-0.043
 Please-0.041
 Come-0.036
 You-0.036
 sorry-0.035
Token
Token ID220
Feature activation-2.70e+00

Pos logit contributions
aches+0.023
cking+0.020
cks+0.019
ck+0.019
els+0.019
Neg logit contributions
 Can-0.015
 Please-0.014
 You-0.013
 Come-0.012
 sorry-0.012
Token
Token ID198
Feature activation-8.76e+00

Pos logit contributions
aches+0.075
cking+0.065
ck+0.063
els+0.063
cks+0.062
Neg logit contributions
 Can-0.048
 Please-0.047
 You-0.044
 Come-0.041
 sorry-0.041
Token
Token ID198
Feature activation-7.37e+00

Pos logit contributions
aches+0.294
els+0.270
ries+0.266
cking+0.261
ge+0.249
Neg logit contributions
 Can-0.101
 You-0.101
 Please-0.094
 Come-0.089
 sorry-0.088
TokenAnd
Token ID1870
Feature activation-1.64e-01

Pos logit contributions
aches+0.171
ries+0.158
els+0.156
cks+0.148
cking+0.143
Neg logit contributions
 Please-0.154
 Can-0.153
 You-0.149
 Come-0.138
 Yes-0.133
Token they
Token ID484
Feature activation-2.08e+00

Pos logit contributions
aches+0.004
cking+0.004
cks+0.004
els+0.004
ck+0.004
Neg logit contributions
 Can-0.003
 Please-0.003
 You-0.003
 sorry-0.003
 Come-0.003
Token lived
Token ID5615
Feature activation-2.68e+00

Pos logit contributions
aches+0.053
cking+0.050
ries+0.049
cks+0.049
els+0.048
Neg logit contributions
 Please-0.035
 Can-0.034
 You-0.033
 sorry-0.033
 I-0.031
Token happily
Token ID18177
Feature activation-1.49e+00

Pos logit contributions
aches+0.061
cking+0.051
cks+0.050
els+0.049
ck+0.049
Neg logit contributions
 Can-0.061
 Please-0.060
 You-0.054
 sorry-0.053
 Come-0.052
Tokenapped
Token ID6320
Feature activation-2.11e+00

Pos logit contributions
aches+0.031
cking+0.026
cks+0.026
ck+0.025
ries+0.024
Neg logit contributions
 Please-0.021
 Can-0.021
 You-0.019
 sorry-0.018
 Come-0.018
Token and
Token ID290
Feature activation-2.44e+00

Pos logit contributions
aches+0.057
cking+0.048
cks+0.048
els+0.047
ck+0.047
Neg logit contributions
 Can-0.039
 Please-0.039
 You-0.034
 sorry-0.033
 Come-0.033
Token cheered
Token ID34767
Feature activation+3.41e-01

Pos logit contributions
aches+0.060
cking+0.051
cks+0.050
ck+0.049
els+0.049
Neg logit contributions
 Can-0.051
 Please-0.050
 You-0.044
 sorry-0.043
 Come-0.043
Token.
Token ID13
Feature activation-5.05e-01

Pos logit contributions
 Can+0.007
 Please+0.007
 Come+0.006
 sorry+0.006
 You+0.006
Neg logit contributions
aches-0.008
cking-0.007
ck-0.007
cks-0.007
els-0.007
Token
Token ID220
Feature activation-2.55e+00

Pos logit contributions
aches+0.017
cking+0.015
ck+0.015
els+0.015
cks+0.015
Neg logit contributions
 Can-0.007
 You-0.007
 Please-0.007
 Come-0.006
 But-0.006
Token
Token ID198
Feature activation-8.74e+00

Pos logit contributions
aches+0.078
cking+0.069
els+0.067
ck+0.067
ries+0.065
Neg logit contributions
 Can-0.038
 Please-0.038
 You-0.038
 Come-0.035
 sorry-0.033
Token
Token ID198
Feature activation-7.39e+00

Pos logit contributions
aches+0.313
els+0.304
ries+0.298
cking+0.283
ishes+0.276
Neg logit contributions
 You-0.079
 Can-0.077
 But-0.066
 Come-0.066
 It-0.066
TokenThe
Token ID464
Feature activation-1.89e+00

Pos logit contributions
ries+0.188
els+0.187
aches+0.187
ishes+0.174
cks+0.172
Neg logit contributions
 Can-0.129
 Please-0.129
 You-0.125
�-0.117
 Come-0.111
Token k
Token ID479
Feature activation-1.16e+00

Pos logit contributions
aches+0.050
cks+0.043
cking+0.043
els+0.041
ck+0.041
Neg logit contributions
 Can-0.037
 Please-0.037
 You-0.032
 sorry-0.032
 Come-0.031
Tokenitty
Token ID9760
Feature activation-1.05e+00

Pos logit contributions
aches+0.032
cks+0.027
cking+0.026
ck+0.025
els+0.024
Neg logit contributions
 Please-0.023
 Can-0.023
 You-0.021
 Come-0.019
 sorry-0.018
Token had
Token ID550
Feature activation-4.67e-02

Pos logit contributions
aches+0.025
ck+0.021
cking+0.021
els+0.020
cks+0.020
Neg logit contributions
 Can-0.022
 Please-0.021
 sorry-0.019
 You-0.018
 Come-0.018
Token took
Token ID1718
Feature activation-2.70e+00

Pos logit contributions
aches+0.002
cking+0.002
cks+0.002
ck+0.002
els+0.002
Neg logit contributions
 Can-0.001
 Please-0.001
 You-0.001
 sorry-0.001
 Come-0.001
Token some
Token ID617
Feature activation-3.27e+00

Pos logit contributions
aches+0.067
cking+0.057
cks+0.056
ck+0.056
els+0.056
Neg logit contributions
 Can-0.055
 Please-0.054
 You-0.048
 sorry-0.047
 Come-0.046
Token lovely
Token ID14081
Feature activation-1.18e+00

Pos logit contributions
aches+0.076
cking+0.064
cks+0.062
els+0.062
ck+0.062
Neg logit contributions
 Can-0.072
 Please-0.071
 You-0.064
 sorry-0.062
 Come-0.062
Token bows
Token ID37469
Feature activation-6.48e-01

Pos logit contributions
aches+0.026
cking+0.022
els+0.021
cks+0.021
ck+0.021
Neg logit contributions
 Can-0.027
 Please-0.027
 You-0.024
 sorry-0.024
 Come-0.023
Token and
Token ID290
Feature activation-1.36e+00

Pos logit contributions
aches+0.017
cking+0.015
els+0.015
cks+0.015
ck+0.014
Neg logit contributions
 Can-0.012
 Please-0.012
 You-0.010
 sorry-0.010
 Come-0.010
Token rib
Token ID12183
Feature activation-8.74e+00

Pos logit contributions
aches+0.033
cking+0.028
cks+0.027
els+0.027
ck+0.026
Neg logit contributions
 Can-0.029
 Please-0.028
 You-0.025
 sorry-0.025
 Come-0.024
Tokenbons
Token ID23461
Feature activation-4.28e+00

Pos logit contributions
aches+0.195
cking+0.162
ge+0.150
cks+0.143
ries+0.142
Neg logit contributions
 Can-0.193
 Please-0.186
 You-0.180
 Come-0.168
 It-0.167
Token and
Token ID290
Feature activation-1.11e+00

Pos logit contributions
aches+0.127
cking+0.113
cks+0.111
ck+0.108
els+0.107
Neg logit contributions
 Can-0.064
 Please-0.062
 You-0.054
 sorry-0.051
 Come-0.050
Token used
Token ID973
Feature activation-2.68e+00

Pos logit contributions
aches+0.025
cking+0.022
cks+0.021
els+0.020
ries+0.020
Neg logit contributions
 Can-0.025
 Please-0.024
 You-0.022
 sorry-0.022
 I-0.021
Token them
Token ID606
Feature activation-1.87e+00

Pos logit contributions
aches+0.058
cking+0.049
cks+0.046
ck+0.044
ries+0.044
Neg logit contributions
 Can-0.063
 Please-0.060
 You-0.056
 sorry-0.055
 Yes-0.053
Token to
Token ID284
Feature activation-3.95e+00

Pos logit contributions
aches+0.059
cking+0.054
cks+0.053
ck+0.051
els+0.051
Neg logit contributions
 Can-0.024
 Please-0.023
 You-0.019
 sorry-0.019
 Come-0.018
Token lived
Token ID5615
Feature activation-2.72e+00

Pos logit contributions
aches+0.066
els+0.061
cking+0.060
ries+0.060
cks+0.059
Neg logit contributions
 Can-0.042
 Please-0.041
 sorry-0.039
 You-0.038
 I-0.036
Token happily
Token ID18177
Feature activation-6.75e-01

Pos logit contributions
aches+0.068
cking+0.060
cks+0.059
els+0.059
ric+0.057
Neg logit contributions
 Can-0.053
 Please-0.053
 sorry-0.048
 You-0.047
 Come-0.046
Token ever
Token ID1683
Feature activation-1.40e+00

Pos logit contributions
aches+0.022
cking+0.019
els+0.019
cks+0.019
ck+0.019
Neg logit contributions
 Please-0.009
 Can-0.009
 You-0.007
 sorry-0.007
 Come-0.007
Token after
Token ID706
Feature activation-3.20e+00

Pos logit contributions
aches+0.044
cking+0.038
cks+0.038
ck+0.037
els+0.037
Neg logit contributions
 Can-0.020
 Please-0.020
 You-0.016
 sorry-0.016
 Come-0.016
Token.
Token ID13
Feature activation-6.20e-02

Pos logit contributions
aches+0.098
cking+0.086
els+0.084
cks+0.084
ck+0.083
Neg logit contributions
 Can-0.046
 Please-0.044
 You-0.040
 sorry-0.040
 I-0.036
Token
Token ID198
Feature activation-8.74e+00

Pos logit contributions
aches+0.002
cking+0.002
els+0.002
ge+0.002
ck+0.002
Neg logit contributions
 Can-0.001
 You-0.001
 Please-0.001
 But-0.001
 Come-0.001
Token
Token ID198
Feature activation-7.40e+00

Pos logit contributions
aches+0.246
els+0.237
ries+0.231
ge+0.231
cking+0.214
Neg logit contributions
 sorry-0.124
 Can-0.122
 You-0.117
 Please-0.116
 It-0.111
TokenThe
Token ID464
Feature activation-1.74e+00

Pos logit contributions
els+0.178
aches+0.175
ries+0.170
ge+0.164
cks+0.161
Neg logit contributions
 Can-0.130
 Please-0.128
 You-0.122
 mean-0.116
 Come-0.114
Token moral
Token ID6573
Feature activation-9.22e-01

Pos logit contributions
aches+0.047
cks+0.041
cking+0.040
els+0.039
ries+0.038
Neg logit contributions
 Can-0.033
 Please-0.033
 sorry-0.029
 You-0.029
 Come-0.027
Token of
Token ID286
Feature activation-7.13e+00

Pos logit contributions
aches+0.019
ck+0.015
cking+0.015
cks+0.014
els+0.014
Neg logit contributions
 Can-0.022
 Please-0.022
 sorry-0.020
 You-0.019
 Come-0.019
Token the
Token ID262
Feature activation-1.81e+00

Pos logit contributions
aches+0.204
cking+0.172
els+0.171
ge+0.169
cks+0.168
Neg logit contributions
 Can-0.122
 Please-0.115
 You-0.113
 sorry-0.105
 Come-0.102
Token the
Token ID262
Feature activation-2.29e+00

Pos logit contributions
aches+0.074
ric+0.067
cks+0.066
cking+0.066
els+0.065
Neg logit contributions
 Please-0.031
 Can-0.029
 You-0.026
 sorry-0.025
 I-0.023
Token bear
Token ID6842
Feature activation-2.74e+00

Pos logit contributions
aches+0.055
cking+0.045
ck+0.044
els+0.043
ts+0.043
Neg logit contributions
 Please-0.044
 Can-0.043
 You-0.038
 Come-0.038
 gonna-0.035
Token again
Token ID757
Feature activation-2.29e+00

Pos logit contributions
aches+0.081
cking+0.071
cks+0.070
els+0.069
ck+0.068
Neg logit contributions
 Can-0.043
 Please-0.042
 You-0.037
 sorry-0.036
 Come-0.035
Token.
Token ID13
Feature activation-4.14e-01

Pos logit contributions
aches+0.066
cking+0.057
cks+0.057
els+0.056
ck+0.055
Neg logit contributions
 Can-0.038
 Please-0.038
 You-0.033
 sorry-0.032
 Come-0.031
Token
Token ID220
Feature activation-2.82e+00

Pos logit contributions
aches+0.012
cking+0.011
els+0.011
cks+0.011
ck+0.010
Neg logit contributions
 Can-0.007
 Please-0.007
 You-0.006
 Come-0.006
 sorry-0.006
Token
Token ID198
Feature activation-8.72e+00

Pos logit contributions
aches+0.081
cking+0.071
els+0.069
ck+0.069
cks+0.067
Neg logit contributions
 Can-0.048
 Please-0.047
 You-0.044
 Come-0.042
 sorry-0.041
Token
Token ID198
Feature activation-7.25e+00

Pos logit contributions
aches+0.308
els+0.297
ries+0.286
cking+0.279
ck+0.265
Neg logit contributions
 Can-0.086
 You-0.085
 Please-0.078
 Come-0.075
 But-0.073
TokenThe
Token ID464
Feature activation-2.76e+00

Pos logit contributions
aches+0.178
els+0.174
ries+0.172
cks+0.157
ishes+0.153
Neg logit contributions
 Can-0.143
 Please-0.143
 You-0.137
 Come-0.126
 I-0.123
Token boy
Token ID2933
Feature activation-2.15e+00

Pos logit contributions
aches+0.066
cking+0.055
ck+0.054
els+0.053
cks+0.051
Neg logit contributions
 Please-0.057
 Can-0.056
 You-0.050
 Come-0.049
 Yes-0.045
Token also
Token ID635
Feature activation-5.33e-01

Pos logit contributions
aches+0.057
cking+0.049
cks+0.048
els+0.048
ck+0.048
Neg logit contributions
 Can-0.040
 Please-0.039
 You-0.034
 sorry-0.034
 Come-0.033
Token remembers
Token ID18140
Feature activation+9.70e-01

Pos logit contributions
aches+0.014
cking+0.012
cks+0.012
ck+0.011
els+0.011
Neg logit contributions
 Can-0.010
 Please-0.010
 You-0.009
 sorry-0.009
 Come-0.009
Token not
Token ID407
Feature activation-4.31e-01

Pos logit contributions
aches+0.003
cking+0.003
cks+0.003
els+0.003
ck+0.003
Neg logit contributions
 Can-0.002
 Please-0.002
 You-0.002
 sorry-0.002
 Come-0.002
Token eat
Token ID4483
Feature activation-1.03e+00

Pos logit contributions
aches+0.011
cking+0.011
cks+0.011
els+0.010
ries+0.010
Neg logit contributions
 Please-0.008
 Can-0.008
 You-0.007
 I-0.006
 sorry-0.006
Token too
Token ID1165
Feature activation-6.72e-01

Pos logit contributions
aches+0.026
cking+0.023
cks+0.022
ck+0.022
els+0.022
Neg logit contributions
 Can-0.020
 Please-0.019
 sorry-0.017
 You-0.017
 Come-0.016
Token many
Token ID867
Feature activation-2.84e+00

Pos logit contributions
aches+0.020
cking+0.018
cks+0.018
els+0.018
ck+0.017
Neg logit contributions
 Can-0.010
 Please-0.010
 sorry-0.008
 You-0.008
 guilty-0.008
Token ch
Token ID442
Feature activation-1.06e+00

Pos logit contributions
aches+0.074
cking+0.066
cks+0.064
ck+0.062
ric+0.061
Neg logit contributions
 Can-0.053
 Please-0.051
 You-0.045
 sorry-0.044
 guilty-0.042
Tokenocol
Token ID4668
Feature activation-8.72e+00

Pos logit contributions
aches+0.018
cking+0.015
ck+0.014
Back+0.014
cks+0.013
Neg logit contributions
 Please-0.026
 Can-0.026
 sorry-0.025
 You-0.024
 guilty-0.023
Tokenates
Token ID689
Feature activation-2.14e+00

Pos logit contributions
aches+0.145
cking+0.134
ck+0.121
glers+0.118
Back+0.113
Neg logit contributions
 Please-0.191
 Can-0.180
 You-0.176
 Come-0.175
 Just-0.169
Token and
Token ID290
Feature activation-1.37e+00

Pos logit contributions
aches+0.060
cking+0.052
cks+0.052
ck+0.051
els+0.051
Neg logit contributions
 Can-0.037
 Please-0.036
 You-0.031
 sorry-0.030
 Come-0.030
Token his
Token ID465
Feature activation+1.04e+00

Pos logit contributions
aches+0.037
cking+0.031
cks+0.031
ck+0.030
els+0.030
Neg logit contributions
 Can-0.025
 Please-0.025
 You-0.022
 Come-0.021
 sorry-0.021
Token friend
Token ID1545
Feature activation-1.63e-01

Pos logit contributions
 Can+0.021
 Please+0.020
 You+0.018
 sorry+0.018
 Come+0.017
Neg logit contributions
aches-0.026
cking-0.022
cks-0.022
els-0.022
ck-0.022
Token learned
Token ID4499
Feature activation-5.97e-01

Pos logit contributions
aches+0.004
cking+0.004
ck+0.004
els+0.004
cks+0.004
Neg logit contributions
 Can-0.003
 Please-0.003
 You-0.003
 Come-0.002
 sorry-0.002
Token,
Token ID11
Feature activation-8.32e-01

Pos logit contributions
aches+0.003
cking+0.003
ck+0.003
els+0.002
cks+0.002
Neg logit contributions
 Can-0.003
 Please-0.002
 You-0.002
 Come-0.002
 Give-0.002
Token feeling
Token ID4203
Feature activation+8.84e-01

Pos logit contributions
aches+0.024
cking+0.021
cks+0.021
els+0.021
ck+0.020
Neg logit contributions
 Can-0.014
 Please-0.014
 You-0.012
 sorry-0.012
 Come-0.011
Token important
Token ID1593
Feature activation+1.62e+00

Pos logit contributions
 Can+0.015
 Please+0.015
 You+0.013
 Come+0.013
 Yes+0.012
Neg logit contributions
aches-0.024
cking-0.021
cks-0.020
ck-0.020
els-0.019
Token.
Token ID13
Feature activation-5.15e-01

Pos logit contributions
 Can+0.037
 Please+0.035
 Come+0.033
 You+0.032
 sorry+0.030
Neg logit contributions
aches-0.033
cking-0.027
ge-0.027
els-0.026
ck-0.025
Token
Token ID220
Feature activation-2.27e+00

Pos logit contributions
aches+0.015
cking+0.014
ck+0.013
cks+0.013
els+0.013
Neg logit contributions
 Can-0.009
 Please-0.008
 You-0.008
 Come-0.007
 sorry-0.007
Token
Token ID198
Feature activation-8.71e+00

Pos logit contributions
aches+0.068
cking+0.061
ck+0.059
els+0.058
cks+0.058
Neg logit contributions
 Please-0.035
 Can-0.035
 You-0.033
 Come-0.031
 sorry-0.031
Token
Token ID198
Feature activation-7.39e+00

Pos logit contributions
aches+0.322
els+0.318
ries+0.309
cking+0.303
ge+0.298
Neg logit contributions
 But-0.058
 You-0.057
 Can-0.055
 Come-0.054
 Please-0.050
TokenM
Token ID44
Feature activation-1.86e+00

Pos logit contributions
aches+0.174
els+0.174
ries+0.172
ishes+0.162
cks+0.161
Neg logit contributions
 Please-0.151
 Can-0.143
 Come-0.135
 You-0.128
 I-0.126
Tokenia
Token ID544
Feature activation-2.90e+00

Pos logit contributions
aches+0.037
cking+0.032
cks+0.032
ck+0.029
els+0.029
Neg logit contributions
 Please-0.046
 Can-0.046
 sorry-0.041
 You-0.041
 Come-0.041
Token loved
Token ID6151
Feature activation+7.75e-01

Pos logit contributions
aches+0.079
cking+0.069
cks+0.069
els+0.066
ck+0.065
Neg logit contributions
 Please-0.052
 Can-0.051
 You-0.044
 sorry-0.043
 Come-0.043
Token being
Token ID852
Feature activation+5.49e-01

Pos logit contributions
 Can+0.016
 Please+0.015
 You+0.014
 sorry+0.013
 Come+0.013
Neg logit contributions
aches-0.020
cking-0.017
cks-0.017
ck-0.016
els-0.016
Token you
Token ID345
Feature activation+5.05e-01

Pos logit contributions
 Can+0.004
 Please+0.003
 You+0.003
 Come+0.003
 Yes+0.003
Neg logit contributions
aches-0.006
ck-0.005
cking-0.005
ries-0.005
cks-0.005
Token out
Token ID503
Feature activation-5.80e-01

Pos logit contributions
 Can+0.009
 Please+0.009
 You+0.007
 sorry+0.007
 Come+0.007
Neg logit contributions
aches-0.014
cking-0.012
cks-0.012
ck-0.012
els-0.012
Token of
Token ID286
Feature activation-3.50e+00

Pos logit contributions
aches+0.014
cking+0.012
cks+0.012
ck+0.012
els+0.012
Neg logit contributions
 Can-0.011
 Please-0.011
 You-0.010
 Come-0.010
 sorry-0.010
Token the
Token ID262
Feature activation-8.88e-01

Pos logit contributions
aches+0.088
cking+0.075
cks+0.074
els+0.073
ck+0.072
Neg logit contributions
 Can-0.070
 Please-0.070
 You-0.061
 sorry-0.060
 Come-0.060
Token hole
Token ID7604
Feature activation-1.16e+00

Pos logit contributions
aches+0.017
cking+0.014
els+0.014
cks+0.014
ries+0.013
Neg logit contributions
 Can-0.021
 Please-0.021
 Come-0.020
 You-0.019
 Yes-0.019
Token.
Token ID13
Feature activation+5.78e+00

Pos logit contributions
aches+0.030
cks+0.025
cking+0.025
els+0.025
ries+0.024
Neg logit contributions
 Can-0.022
 Please-0.022
 Come-0.019
 sorry-0.019
 You-0.019
Token Just
Token ID2329
Feature activation+2.20e+00

Pos logit contributions
 guilty+0.059
 unfair+0.057
 wrong+0.056
 mean+0.053
 impossible+0.053
Neg logit contributions
aches-0.149
ric-0.147
cks-0.145
asted-0.139
els-0.139
Token stay
Token ID2652
Feature activation+2.63e-01

Pos logit contributions
 Can+0.045
 Please+0.044
 You+0.039
 Come+0.038
 sorry+0.038
Neg logit contributions
aches-0.055
cking-0.046
cks-0.046
ck-0.045
els-0.045
Token calm
Token ID9480
Feature activation+7.23e-01

Pos logit contributions
 Can+0.005
 Please+0.004
 You+0.004
 Come+0.004
 Yes+0.004
Neg logit contributions
aches-0.007
cking-0.007
ck-0.006
cks-0.006
els-0.006
Token and
Token ID290
Feature activation-1.32e+00

Pos logit contributions
 Can+0.015
 Please+0.014
 sorry+0.014
 Yes+0.014
 guilty+0.014
Neg logit contributions
aches-0.013
els-0.012
ries-0.012
cking-0.012
cks-0.011
Token settle
Token ID12259
Feature activation-1.31e-01

Pos logit contributions
aches+0.035
cking+0.030
ck+0.030
cks+0.029
ric+0.027
Neg logit contributions
 Can-0.024
 Please-0.023
 You-0.021
 Come-0.020
 Yes-0.019
Token
Token ID198
Feature activation-7.12e+00

Pos logit contributions
 Please+0.056
 Can+0.056
 You+0.048
 sorry+0.047
 Come+0.046
Neg logit contributions
aches-0.081
els-0.069
cks-0.069
cking-0.068
ries-0.067
Token
Token ID198
Feature activation-6.12e+00

Pos logit contributions
aches+0.229
cking+0.202
ries+0.193
els+0.193
cks+0.191
Neg logit contributions
 Can-0.102
 Please-0.098
 You-0.088
 Come-0.088
 sorry-0.084
Token"
Token ID1
Feature activation-1.56e+00

Pos logit contributions
aches+0.151
cking+0.126
cks+0.124
els+0.122
ries+0.120
Neg logit contributions
 Please-0.133
 Can-0.132
 You-0.116
 Come-0.114
 sorry-0.113
TokenWhat
Token ID2061
Feature activation+3.47e+00

Pos logit contributions
aches+0.041
cking+0.035
cks+0.034
els+0.034
ck+0.033
Neg logit contributions
 Can-0.031
 Please-0.031
 You-0.027
 sorry-0.026
 Come-0.026
Token's
Token ID338
Feature activation+4.12e+00

Pos logit contributions
 Please+0.069
 Can+0.067
 Just+0.058
 guilty+0.058
 You+0.058
Neg logit contributions
aches-0.080
els-0.063
ries-0.061
ons-0.061
cks-0.060
Token wrong
Token ID2642
Feature activation+5.71e+00

Pos logit contributions
 Can+0.059
 Please+0.057
 Come+0.046
 You+0.046
 sorry+0.041
Neg logit contributions
aches-0.132
cking-0.115
cks-0.114
ck-0.113
els-0.111
Token,
Token ID11
Feature activation+4.74e+00

Pos logit contributions
 Can+0.107
 Please+0.102
 You+0.095
 Goodbye+0.093
 guilty+0.093
Neg logit contributions
aches-0.139
els-0.117
ries-0.115
ck-0.109
ran-0.106
Token sweet
Token ID6029
Feature activation+1.06e+00

Pos logit contributions
 Can+0.080
 Please+0.079
 Come+0.069
 You+0.065
 Just+0.064
Neg logit contributions
aches-0.132
els-0.112
ries-0.111
ck-0.110
cking-0.108
Tokenie
Token ID494
Feature activation+2.31e+00

Pos logit contributions
 Can+0.036
 Please+0.036
 You+0.033
 sorry+0.033
 Come+0.033
Neg logit contributions
aches-0.012
cking-0.008
cks-0.007
els-0.007
ck-0.007
Token?"
Token ID1701
Feature activation-1.75e+00

Pos logit contributions
 Can+0.046
 Please+0.045
 Yes+0.040
 You+0.040
 Come+0.040
Neg logit contributions
aches-0.057
ck-0.047
cking-0.047
els-0.046
cks-0.045
Token Mom
Token ID11254
Feature activation-2.40e+00

Pos logit contributions
aches+0.043
ck+0.036
els+0.036
cking+0.036
cks+0.036
Neg logit contributions
 Can-0.034
 Please-0.033
 You-0.029
 Come-0.029
 Yes-0.026
Token angry
Token ID7954
Feature activation+3.34e+00

Pos logit contributions
 Can+0.023
 Please+0.022
 You+0.019
 Come+0.018
 sorry+0.016
Neg logit contributions
aches-0.050
cking-0.044
cks-0.043
ck-0.043
els-0.042
Token and
Token ID290
Feature activation-2.35e-01

Pos logit contributions
 Can+0.072
 Please+0.070
 sorry+0.062
 Come+0.062
 Goodbye+0.061
Neg logit contributions
aches-0.066
els-0.059
ries-0.058
cking-0.056
cks-0.055
Token said
Token ID531
Feature activation+3.91e+00

Pos logit contributions
aches+0.007
cking+0.006
ck+0.006
cks+0.005
els+0.005
Neg logit contributions
 Can-0.004
 Please-0.004
 Come-0.003
 You-0.003
 Yes-0.003
Token '
Token ID705
Feature activation-2.39e+00

Pos logit contributions
 Can+0.069
 Please+0.066
 You+0.059
 Come+0.058
 sorry+0.051
Neg logit contributions
aches-0.108
cking-0.087
els-0.087
ries-0.086
ck-0.085
TokenWhy
Token ID5195
Feature activation+2.53e+00

Pos logit contributions
aches+0.063
cks+0.054
cking+0.053
els+0.052
ric+0.051
Neg logit contributions
 Please-0.048
 Can-0.047
 You-0.041
 Come-0.039
 I-0.036
Token did
Token ID750
Feature activation+5.04e+00

Pos logit contributions
 Can+0.040
 Please+0.039
 You+0.033
 sorry+0.032
 Come+0.032
Neg logit contributions
aches-0.076
cking-0.065
cks-0.064
els-0.064
ck-0.063
Token you
Token ID345
Feature activation+2.13e+00

Pos logit contributions
 Can+0.072
 Please+0.068
 You+0.057
 Come+0.056
 sorry+0.055
Neg logit contributions
aches-0.157
cking-0.135
cks-0.134
ck-0.133
els-0.132
Token say
Token ID910
Feature activation+3.97e+00

Pos logit contributions
 Please+0.039
 Can+0.038
 You+0.034
 sorry+0.034
 I+0.032
Neg logit contributions
aches-0.055
cking-0.050
cks-0.050
els-0.048
ries-0.047
Token no
Token ID645
Feature activation+1.09e+00

Pos logit contributions
 Can+0.070
 Please+0.067
 Come+0.059
 You+0.058
 sorry+0.054
Neg logit contributions
aches-0.109
cking-0.090
cks-0.089
els-0.088
ck-0.088
Token?'
Token ID8348
Feature activation-7.83e-01

Pos logit contributions
 Can+0.020
 Please+0.019
 sorry+0.017
 You+0.017
 Come+0.016
Neg logit contributions
aches-0.029
els-0.024
cking-0.024
ck-0.024
ries-0.024
Token The
Token ID383
Feature activation-1.29e+00

Pos logit contributions
aches+0.021
cking+0.018
els+0.018
cks+0.018
ck+0.018
Neg logit contributions
 Can-0.014
 Please-0.013
 You-0.011
 sorry-0.011
 Come-0.011
Token came
Token ID1625
Feature activation-8.62e-02

Pos logit contributions
aches+0.056
cking+0.048
cks+0.048
els+0.047
ck+0.047
Neg logit contributions
 Can-0.034
 Please-0.034
 sorry-0.029
 You-0.029
 Come-0.028
Token to
Token ID284
Feature activation-4.59e+00

Pos logit contributions
aches+0.002
cking+0.002
cks+0.002
els+0.002
ck+0.002
Neg logit contributions
 Can-0.001
 Please-0.001
 You-0.001
 sorry-0.001
 Come-0.001
Token see
Token ID766
Feature activation-1.18e+00

Pos logit contributions
aches+0.124
cking+0.106
cks+0.104
els+0.103
ck+0.100
Neg logit contributions
 Can-0.088
 Please-0.087
 You-0.075
 sorry-0.074
 Come-0.074
Token what
Token ID644
Feature activation+1.42e+00

Pos logit contributions
aches+0.029
cking+0.024
cks+0.023
ck+0.023
els+0.022
Neg logit contributions
 Can-0.026
 Please-0.025
 You-0.024
 sorry-0.023
 Come-0.022
Token was
Token ID373
Feature activation+4.40e+00

Pos logit contributions
 Can+0.025
 Please+0.025
 You+0.021
 sorry+0.021
 Come+0.021
Neg logit contributions
aches-0.039
cking-0.033
cks-0.033
els-0.032
ck-0.032
Token wrong
Token ID2642
Feature activation+4.21e+00

Pos logit contributions
 Can+0.049
 Please+0.046
 Come+0.034
 You+0.033
 Yes+0.029
Neg logit contributions
aches-0.157
cking-0.142
cks-0.140
els-0.135
ck-0.135
Token.
Token ID13
Feature activation+4.23e+00

Pos logit contributions
 Can+0.095
 Please+0.092
 sorry+0.080
 Come+0.079
 You+0.079
Neg logit contributions
aches-0.093
els-0.079
ries-0.075
cks-0.075
cking-0.075
Token She
Token ID1375
Feature activation-2.07e+00

Pos logit contributions
 ok+0.082
 Can+0.081
 Please+0.079
 pleasure+0.079
 brav+0.077
Neg logit contributions
aches-0.066
els-0.064
ries-0.061
cks-0.057
Her-0.056
Token saw
Token ID2497
Feature activation-2.00e+00

Pos logit contributions
aches+0.065
cking+0.056
cks+0.056
els+0.054
ric+0.054
Neg logit contributions
 Can-0.031
 Please-0.031
 sorry-0.025
 Come-0.025
 You-0.024
Token Max
Token ID5436
Feature activation+1.46e+00

Pos logit contributions
aches+0.064
cking+0.055
cks+0.054
ck+0.054
els+0.052
Neg logit contributions
 Can-0.029
 Please-0.027
 You-0.024
 sorry-0.024
 Come-0.022
Token on
Token ID319
Feature activation-3.31e+00

Pos logit contributions
 Can+0.024
 Please+0.024
 You+0.020
 sorry+0.020
 Come+0.019
Neg logit contributions
aches-0.042
cking-0.036
cks-0.036
els-0.035
ck-0.035
Token They
Token ID1119
Feature activation-2.57e+00

Pos logit contributions
 Please+0.084
 Can+0.083
 You+0.069
 sorry+0.068
 Come+0.068
Neg logit contributions
aches-0.098
cks-0.090
els-0.088
cking-0.084
ric-0.082
Token were
Token ID547
Feature activation+1.52e-01

Pos logit contributions
aches+0.073
cks+0.066
cking+0.065
ries+0.064
els+0.063
Neg logit contributions
 Please-0.037
 Can-0.036
 sorry-0.035
 guilty-0.034
 mean-0.033
Token too
Token ID1165
Feature activation+1.57e+00

Pos logit contributions
 Can+0.002
 Please+0.002
 sorry+0.002
 You+0.002
 Come+0.002
Neg logit contributions
aches-0.005
cking-0.004
cks-0.004
els-0.004
ck-0.004
Token far
Token ID1290
Feature activation-1.75e-01

Pos logit contributions
 Please+0.018
 Can+0.017
 sorry+0.015
 guilty+0.013
 You+0.013
Neg logit contributions
aches-0.054
els-0.048
cks-0.047
cking-0.047
ries-0.047
Token away
Token ID1497
Feature activation+4.38e-01

Pos logit contributions
aches+0.004
cking+0.003
cks+0.003
ck+0.003
els+0.003
Neg logit contributions
 Can-0.004
 Please-0.004
 You-0.003
 Come-0.003
 sorry-0.003
Token.
Token ID13
Feature activation+4.62e+00

Pos logit contributions
 Can+0.008
 Please+0.008
 You+0.007
 sorry+0.007
 Come+0.007
Neg logit contributions
aches-0.012
cking-0.010
cks-0.010
els-0.010
ck-0.010
Token They
Token ID1119
Feature activation-1.61e+00

Pos logit contributions
 Please+0.087
 Can+0.087
 sorry+0.073
 You+0.071
 Come+0.070
Neg logit contributions
aches-0.101
cks-0.094
els-0.091
cking-0.088
ric-0.087
Token did
Token ID750
Feature activation+8.47e-01

Pos logit contributions
aches+0.045
cking+0.039
cks+0.039
els+0.038
ries+0.038
Neg logit contributions
 Can-0.027
 Please-0.027
 sorry-0.024
 guilty-0.022
 You-0.022
Token not
Token ID407
Feature activation-4.66e-01

Pos logit contributions
 Can+0.013
 Please+0.013
 sorry+0.011
 You+0.011
 Come+0.010
Neg logit contributions
aches-0.025
cking-0.022
els-0.022
cks-0.022
ries-0.021
Token see
Token ID766
Feature activation+1.19e-01

Pos logit contributions
aches+0.013
cking+0.011
ck+0.011
cks+0.011
els+0.010
Neg logit contributions
 Can-0.009
 Please-0.009
 You-0.008
 Come-0.007
 sorry-0.007
Token him
Token ID683
Feature activation+1.80e+00

Pos logit contributions
 Can+0.002
 Please+0.002
 sorry+0.002
 You+0.002
 guilty+0.002
Neg logit contributions
aches-0.003
cking-0.003
cks-0.002
ck-0.002
els-0.002
Token is
Token ID318
Feature activation+1.69e+00

Pos logit contributions
 Can+0.000
 Please+0.000
 You+0.000
 sorry+0.000
 Come+0.000
Neg logit contributions
aches-0.001
cking-0.001
cks-0.000
els-0.000
ck-0.000
Token to
Token ID284
Feature activation-2.65e+00

Pos logit contributions
 Can+0.030
 Please+0.030
 You+0.026
 Come+0.025
 sorry+0.025
Neg logit contributions
aches-0.047
cking-0.039
cks-0.038
ck-0.038
els-0.038
Token care
Token ID1337
Feature activation-7.25e-02

Pos logit contributions
aches+0.066
cking+0.062
cks+0.061
els+0.060
ries+0.059
Neg logit contributions
 Please-0.049
 Can-0.048
 sorry-0.044
 You-0.043
 I-0.042
Token for
Token ID329
Feature activation-4.13e+00

Pos logit contributions
aches+0.002
cking+0.002
cks+0.002
ck+0.002
els+0.002
Neg logit contributions
 Can-0.001
 Please-0.001
 You-0.001
 sorry-0.001
 Come-0.001
Token something
Token ID1223
Feature activation+1.52e+00

Pos logit contributions
aches+0.104
cks+0.094
cking+0.094
ric+0.088
ck+0.087
Neg logit contributions
 Can-0.075
 Please-0.073
 sorry-0.069
 You-0.067
 guilty-0.063
Token,
Token ID11
Feature activation+1.63e+00

Pos logit contributions
 Can+0.028
 Please+0.028
 You+0.024
 Come+0.024
 sorry+0.023
Neg logit contributions
aches-0.041
cking-0.034
cks-0.034
ck-0.033
els-0.033
Token so
Token ID523
Feature activation+5.90e-01

Pos logit contributions
 Can+0.023
 Please+0.023
 Come+0.020
 You+0.019
 sorry+0.019
Neg logit contributions
aches-0.048
cking-0.041
ck-0.041
els-0.040
cks-0.040
Token he
Token ID339
Feature activation-5.20e-01

Pos logit contributions
 Can+0.010
 Please+0.009
 You+0.008
 Come+0.008
 sorry+0.008
Neg logit contributions
aches-0.017
cking-0.015
cks-0.015
ck-0.015
els-0.014
Token said
Token ID531
Feature activation+3.18e+00

Pos logit contributions
aches+0.013
cking+0.011
ck+0.011
cks+0.010
els+0.010
Neg logit contributions
 Can-0.011
 Please-0.010
 You-0.010
 sorry-0.009
 Come-0.009
Token that
Token ID326
Feature activation+8.11e-01

Pos logit contributions
 Can+0.056
 Please+0.054
 Come+0.049
 You+0.048
 guy+0.043
Neg logit contributions
aches-0.085
cking-0.066
ck-0.066
els-0.063
cks-0.063
Token she
Token ID673
Feature activation+4.83e-01

Pos logit contributions
 Can+0.014
 Please+0.013
 You+0.012
 sorry+0.011
 Come+0.011
Neg logit contributions
aches-0.023
cking-0.020
cks-0.020
els-0.020
ck-0.020
Token He
Token ID679
Feature activation+4.46e-01

Pos logit contributions
 Please+0.057
 Can+0.055
 sorry+0.048
 You+0.043
 guilty+0.043
Neg logit contributions
aches-0.124
cks-0.111
cking-0.107
els-0.107
ck-0.105
Token was
Token ID373
Feature activation+4.49e+00

Pos logit contributions
 Can+0.009
 Please+0.009
 You+0.008
 sorry+0.007
 Come+0.007
Neg logit contributions
aches-0.011
cking-0.010
cks-0.010
ck-0.010
els-0.009
Token 3
Token ID513
Feature activation+1.35e+00

Pos logit contributions
 Can+0.077
 Please+0.076
 Come+0.064
 You+0.062
 Give+0.057
Neg logit contributions
aches-0.123
cking-0.108
cks-0.105
ck-0.104
els-0.102
Token years
Token ID812
Feature activation+4.40e+00

Pos logit contributions
 Can+0.028
 Please+0.028
 sorry+0.025
 You+0.025
 guilty+0.024
Neg logit contributions
aches-0.032
cking-0.027
cks-0.027
ck-0.026
els-0.026
Token old
Token ID1468
Feature activation+2.14e+00

Pos logit contributions
 Can+0.069
 Please+0.068
 You+0.059
 Come+0.058
 sorry+0.053
Neg logit contributions
aches-0.129
els-0.110
cking-0.109
ries-0.108
ck-0.107
Token and
Token ID290
Feature activation+1.26e+00

Pos logit contributions
 Can+0.041
 Please+0.041
 Come+0.036
 sorry+0.036
 You+0.036
Neg logit contributions
aches-0.053
els-0.045
ries-0.044
cking-0.044
cks-0.043
Token loved
Token ID6151
Feature activation+3.37e+00

Pos logit contributions
 Can+0.024
 Please+0.024
 Come+0.021
 You+0.021
 sorry+0.020
Neg logit contributions
aches-0.033
cking-0.027
cks-0.027
ck-0.026
els-0.026
Token going
Token ID1016
Feature activation+4.74e+00

Pos logit contributions
 Can+0.046
 Please+0.045
 You+0.037
 sorry+0.035
 Come+0.035
Neg logit contributions
aches-0.108
cking-0.095
cks-0.094
ck-0.093
els-0.093
Token outside
Token ID2354
Feature activation+6.05e-01

Pos logit contributions
 Can+0.083
 Please+0.082
 You+0.070
 sorry+0.069
 Come+0.067
Neg logit contributions
aches-0.132
cking-0.113
cks-0.111
els-0.110
ck-0.109
Token in
Token ID287
Feature activation-2.46e+00

Pos logit contributions
 Can+0.012
 Please+0.011
 sorry+0.010
 You+0.010
 Come+0.010
Neg logit contributions
aches-0.015
cking-0.013
cks-0.013
els-0.013
ries-0.013
Token the
Token ID262
Feature activation-5.17e-02

Pos logit contributions
aches+0.067
cking+0.056
ck+0.055
cks+0.055
els+0.054
Neg logit contributions
 Can-0.049
 Please-0.046
 You-0.042
 I-0.040
 Come-0.040
Token and
Token ID290
Feature activation-8.05e-01

Pos logit contributions
aches+0.081
cking+0.070
cks+0.069
els+0.068
ck+0.068
Neg logit contributions
 Can-0.052
 Please-0.051
 You-0.044
 sorry-0.043
 Come-0.042
Token started
Token ID2067
Feature activation+1.96e+00

Pos logit contributions
aches+0.020
cking+0.017
cks+0.017
ck+0.017
els+0.017
Neg logit contributions
 Can-0.016
 Please-0.016
 You-0.014
 sorry-0.014
 Come-0.014
Token to
Token ID284
Feature activation-2.67e+00

Pos logit contributions
 Can+0.026
 Please+0.026
 You+0.021
 sorry+0.021
 Come+0.020
Neg logit contributions
aches-0.063
cking-0.055
cks-0.055
els-0.054
ck-0.054
Token seem
Token ID1283
Feature activation+2.71e+00

Pos logit contributions
aches+0.071
cking+0.062
cks+0.062
els+0.061
ries+0.060
Neg logit contributions
 Can-0.048
 Please-0.048
 sorry-0.043
 You-0.042
 Come-0.040
Token less
Token ID1342
Feature activation+2.05e+00

Pos logit contributions
 Can+0.043
 Please+0.043
 You+0.038
 Come+0.037
 sorry+0.033
Neg logit contributions
aches-0.078
cking-0.068
ck-0.066
cks-0.066
els-0.064
Token miserable
Token ID22444
Feature activation+1.42e+00

Pos logit contributions
 Can+0.027
 Please+0.025
 Come+0.022
 You+0.022
 sorry+0.019
Neg logit contributions
aches-0.066
cking-0.059
cks-0.057
ck-0.057
els-0.057
Token.
Token ID13
Feature activation+7.95e-01

Pos logit contributions
 Can+0.030
 Please+0.030
 sorry+0.026
 Come+0.026
 You+0.026
Neg logit contributions
aches-0.033
cking-0.028
els-0.028
cks-0.028
ries-0.027
Token
Token ID198
Feature activation-7.09e+00

Pos logit contributions
 Can+0.016
 Please+0.016
 sorry+0.015
 You+0.014
 Come+0.014
Neg logit contributions
aches-0.019
cks-0.015
cking-0.015
els-0.015
ck-0.015
TokenSuddenly
Token ID38582
Feature activation+1.81e+00

Pos logit contributions
aches+0.222
cking+0.198
els+0.197
ries+0.197
cks+0.192
Neg logit contributions
 Can-0.103
 Please-0.101
 You-0.088
 Come-0.085
 But-0.084
Token,
Token ID11
Feature activation+8.34e-01

Pos logit contributions
 Can+0.036
 Please+0.035
 You+0.032
 Come+0.031
 Yes+0.029
Neg logit contributions
aches-0.044
ries-0.037
els-0.036
ck-0.035
cking-0.035
Token she
Token ID673
Feature activation+2.39e-01

Pos logit contributions
 Can+0.017
 Please+0.016
 You+0.014
 Come+0.014
 sorry+0.014
Neg logit contributions
aches-0.021
cking-0.018
ck-0.017
els-0.017
ries-0.017
TokenOnce
Token ID7454
Feature activation+1.83e-01

Pos logit contributions
aches+0.035
cking+0.030
cks+0.029
els+0.029
ck+0.029
Neg logit contributions
 Can-0.033
 Please-0.033
 sorry-0.029
 You-0.029
 Come-0.028
Token upon
Token ID2402
Feature activation-8.14e-01

Pos logit contributions
 Can+0.004
 Please+0.004
 You+0.004
 sorry+0.004
 I+0.003
Neg logit contributions
aches-0.004
cking-0.004
cks-0.004
ck-0.003
els-0.003
Token a
Token ID257
Feature activation+3.70e+00

Pos logit contributions
aches+0.023
cking+0.020
cks+0.020
ck+0.020
els+0.019
Neg logit contributions
 Can-0.013
 Please-0.013
 You-0.011
 sorry-0.011
 Come-0.011
Token time
Token ID640
Feature activation+5.61e+00

Pos logit contributions
 Can+0.061
 Please+0.059
 You+0.051
 sorry+0.049
 Come+0.048
Neg logit contributions
aches-0.107
cking-0.093
ck-0.092
cks-0.092
els-0.088
Token,
Token ID11
Feature activation+3.89e+00

Pos logit contributions
 Please+0.098
 Can+0.097
 You+0.078
 Come+0.077
 sorry+0.076
Neg logit contributions
aches-0.151
cks-0.128
cking-0.126
els-0.126
ries-0.125
Token there
Token ID612
Feature activation-1.69e-02

Pos logit contributions
 Can+0.074
 Please+0.074
 You+0.063
 sorry+0.062
 Come+0.062
Neg logit contributions
aches-0.101
cking-0.086
cks-0.086
ck-0.084
els-0.084
Token was
Token ID373
Feature activation+5.87e+00

Pos logit contributions
aches+0.000
cking+0.000
cks+0.000
ck+0.000
els+0.000
Neg logit contributions
 Can-0.000
 Please-0.000
 You-0.000
 sorry-0.000
 Come-0.000
Token a
Token ID257
Feature activation+4.74e+00

Pos logit contributions
 Please+0.108
 Can+0.107
 You+0.089
 sorry+0.088
 Come+0.087
Neg logit contributions
aches-0.154
cking-0.132
ck-0.131
cks-0.130
els-0.129
Token child
Token ID1200
Feature activation+3.32e+00

Pos logit contributions
 Please+0.085
 Can+0.084
 Come+0.080
 You+0.078
 Just+0.071
Neg logit contributions
aches-0.117
cking-0.099
ck-0.098
cks-0.090
els-0.090
Token who
Token ID508
Feature activation+2.36e+00

Pos logit contributions
 Can+0.060
 Please+0.058
 You+0.051
 sorry+0.049
 Come+0.048
Neg logit contributions
aches-0.092
cking-0.079
cks-0.078
els-0.076
ck-0.076
Token loved
Token ID6151
Feature activation+3.56e+00

Pos logit contributions
 Can+0.051
 Please+0.050
 sorry+0.044
 You+0.044
 Come+0.043
Neg logit contributions
aches-0.055
ck-0.044
cking-0.044
cks-0.044
els-0.041
Token run
Token ID1057
Feature activation+7.21e-02

Pos logit contributions
 Can+0.004
 Please+0.003
 You+0.003
 sorry+0.003
 Come+0.003
Neg logit contributions
aches-0.005
cking-0.004
cks-0.004
els-0.004
ck-0.004
Token away
Token ID1497
Feature activation-1.75e+00

Pos logit contributions
 Can+0.002
 Please+0.002
 You+0.001
 Come+0.001
 sorry+0.001
Neg logit contributions
aches-0.001
cking-0.001
ck-0.001
ge-0.001
cks-0.001
Token.
Token ID13
Feature activation+1.50e+00

Pos logit contributions
aches+0.047
cking+0.041
cks+0.040
ck+0.039
els+0.039
Neg logit contributions
 Can-0.032
 Please-0.031
 You-0.027
 sorry-0.026
 Come-0.026
Token Now
Token ID2735
Feature activation+1.40e+00

Pos logit contributions
 Can+0.024
 Please+0.023
 sorry+0.020
 Come+0.018
 guilty+0.018
Neg logit contributions
aches-0.040
cks-0.036
els-0.035
ric-0.035
cking-0.034
Token he
Token ID339
Feature activation-8.16e-01

Pos logit contributions
 Can+0.025
 Please+0.025
 You+0.021
 sorry+0.021
 Come+0.021
Neg logit contributions
aches-0.038
cking-0.033
cks-0.032
ck-0.032
els-0.032
Token could
Token ID714
Feature activation+1.21e+00

Pos logit contributions
aches+0.023
ck+0.019
cks+0.019
cking+0.019
els+0.019
Neg logit contributions
 Can-0.014
 Please-0.014
 sorry-0.012
 You-0.012
 Come-0.011
Token eat
Token ID4483
Feature activation-2.04e+00

Pos logit contributions
 Can+0.022
 Please+0.022
 sorry+0.019
 You+0.019
 Come+0.018
Neg logit contributions
aches-0.032
cking-0.029
cks-0.028
els-0.028
ries-0.027
Token one
Token ID530
Feature activation-1.61e+00

Pos logit contributions
aches+0.050
cking+0.043
cks+0.042
els+0.041
ck+0.041
Neg logit contributions
 Can-0.042
 Please-0.041
 You-0.036
 sorry-0.036
 Come-0.035
Token of
Token ID286
Feature activation-5.10e+00

Pos logit contributions
aches+0.042
cking+0.035
cks+0.034
ck+0.034
els+0.034
Neg logit contributions
 Can-0.031
 Please-0.031
 You-0.027
 sorry-0.027
 Come-0.027
Token them
Token ID606
Feature activation-5.46e-01

Pos logit contributions
aches+0.123
cking+0.106
cks+0.102
els+0.101
ck+0.101
Neg logit contributions
 Can-0.104
 Please-0.101
 sorry-0.095
 You-0.094
 I-0.087
Token.
Token ID13
Feature activation+1.47e+00

Pos logit contributions
aches+0.016
cking+0.014
cks+0.013
els+0.013
ck+0.013
Neg logit contributions
 Can-0.009
 Please-0.009
 You-0.008
 sorry-0.007
 Come-0.007
Token not
Token ID407
Feature activation-8.30e-01

Pos logit contributions
 Can+0.004
 Please+0.004
 You+0.003
 sorry+0.003
 Come+0.003
Neg logit contributions
aches-0.006
cking-0.005
cks-0.005
ck-0.005
els-0.005
Token know
Token ID760
Feature activation+1.48e+00

Pos logit contributions
aches+0.024
ck+0.021
cks+0.020
cking+0.020
els+0.019
Neg logit contributions
 Can-0.014
 Please-0.013
 You-0.012
 Come-0.011
 sorry-0.011
Token.
Token ID13
Feature activation+3.53e+00

Pos logit contributions
 Can+0.032
 Please+0.031
 You+0.028
 sorry+0.028
 Come+0.027
Neg logit contributions
aches-0.035
cking-0.030
cks-0.029
ck-0.029
els-0.028
Token They
Token ID1119
Feature activation-2.00e+00

Pos logit contributions
 Can+0.066
 Please+0.066
 sorry+0.061
 guilty+0.060
 You+0.058
Neg logit contributions
aches-0.078
cks-0.069
els-0.066
cking-0.065
ck-0.064
Token take
Token ID1011
Feature activation-2.31e+00

Pos logit contributions
aches+0.055
ck+0.045
cking+0.045
cks+0.045
els+0.044
Neg logit contributions
 Can-0.038
 Please-0.036
 You-0.032
 Come-0.031
 sorry-0.030
Token the
Token ID262
Feature activation-2.24e+00

Pos logit contributions
aches+0.057
cking+0.048
cks+0.048
els+0.047
ck+0.047
Neg logit contributions
 Can-0.049
 Please-0.048
 You-0.042
 Come-0.041
 sorry-0.041
Token book
Token ID1492
Feature activation-2.25e+00

Pos logit contributions
aches+0.058
cking+0.049
cks+0.048
els+0.048
ck+0.047
Neg logit contributions
 Can-0.044
 Please-0.044
 You-0.039
 Come-0.038
 sorry-0.037
Token and
Token ID290
Feature activation-1.41e+00

Pos logit contributions
aches+0.061
cking+0.052
cks+0.051
els+0.051
ck+0.051
Neg logit contributions
 Can-0.041
 Please-0.041
 You-0.035
 sorry-0.034
 Come-0.034
Token say
Token ID910
Feature activation+2.41e+00

Pos logit contributions
aches+0.036
cking+0.032
cks+0.030
els+0.029
ric+0.029
Neg logit contributions
 Please-0.028
 Can-0.028
 sorry-0.024
 You-0.024
 I-0.023
Token thank
Token ID5875
Feature activation-1.05e+00

Pos logit contributions
 Can+0.030
 Please+0.030
 You+0.024
 Come+0.024
 guilty+0.020
Neg logit contributions
aches-0.078
cks-0.065
els-0.064
cking-0.064
ck-0.064
Token you
Token ID345
Feature activation+1.70e-04

Pos logit contributions
aches+0.036
cking+0.032
cks+0.031
els+0.030
ric+0.030
Neg logit contributions
 Please-0.013
 Can-0.012
 You-0.010
 sorry-0.010
 Come-0.009
Tokenled
Token ID992
Feature activation-1.07e+00

Pos logit contributions
aches+0.014
cks+0.011
cking+0.011
els+0.011
ric+0.011
Neg logit contributions
 Please-0.019
 Can-0.019
 sorry-0.017
 You-0.017
 guilty-0.017
Token as
Token ID355
Feature activation-7.70e-01

Pos logit contributions
aches+0.023
cking+0.020
ck+0.019
els+0.019
cks+0.019
Neg logit contributions
 Can-0.023
 Please-0.022
 Come-0.020
 You-0.020
 sorry-0.019
Token they
Token ID484
Feature activation-9.52e-01

Pos logit contributions
aches+0.018
cking+0.015
cks+0.014
ck+0.014
els+0.014
Neg logit contributions
 Can-0.017
 Please-0.017
 You-0.015
 Come-0.015
 sorry-0.014
Token ran
Token ID4966
Feature activation+9.49e-01

Pos logit contributions
aches+0.025
cks+0.023
els+0.022
cking+0.022
ries+0.022
Neg logit contributions
 Please-0.015
 Can-0.015
 sorry-0.013
 guilty-0.013
 I-0.013
Token around
Token ID1088
Feature activation-4.96e+00

Pos logit contributions
 Can+0.021
 Please+0.020
 You+0.018
 sorry+0.017
 Come+0.017
Neg logit contributions
aches-0.022
cking-0.018
ck-0.018
cks-0.018
els-0.017
Token the
Token ID262
Feature activation-1.73e+00

Pos logit contributions
aches+0.115
cking+0.097
cks+0.095
els+0.093
ck+0.093
Neg logit contributions
 Can-0.109
 Please-0.108
 You-0.096
 sorry-0.094
 Come-0.093
Token beach
Token ID10481
Feature activation-8.49e-01

Pos logit contributions
aches+0.040
cking+0.033
els+0.032
cks+0.032
ck+0.031
Neg logit contributions
 Can-0.039
 Please-0.038
 You-0.034
 Come-0.033
 sorry-0.033
Token.
Token ID13
Feature activation+1.59e+00

Pos logit contributions
aches+0.024
cking+0.021
cks+0.020
els+0.020
ck+0.020
Neg logit contributions
 Can-0.014
 Please-0.014
 You-0.012
 sorry-0.012
 Come-0.012
Token They
Token ID1119
Feature activation-1.61e+00

Pos logit contributions
 Please+0.027
 Can+0.026
 sorry+0.024
 guilty+0.022
 mean+0.022
Neg logit contributions
aches-0.039
ric-0.034
cks-0.034
els-0.034
cking-0.033
Token lay
Token ID3830
Feature activation-6.21e-01

Pos logit contributions
aches+0.043
cks+0.039
cking+0.038
els+0.038
ries+0.038
Neg logit contributions
 Can-0.027
 Please-0.027
 sorry-0.024
 guilty-0.023
 You-0.023
Token down
Token ID866
Feature activation-2.34e+00

Pos logit contributions
aches+0.018
cking+0.015
cks+0.015
ck+0.015
els+0.015
Neg logit contributions
 Can-0.010
 Please-0.010
 sorry-0.009
 You-0.009
 Come-0.008
Token cat
Token ID3797
Feature activation+1.06e+00

Pos logit contributions
 Please+0.025
 Can+0.024
 Come+0.021
 You+0.021
 sorry+0.021
Neg logit contributions
aches-0.026
ck-0.020
els-0.020
cking-0.020
ric-0.019
Token was
Token ID373
Feature activation+1.37e+00

Pos logit contributions
 Can+0.024
 Please+0.024
 Come+0.021
 You+0.021
 Yes+0.020
Neg logit contributions
aches-0.024
ck-0.019
els-0.019
cking-0.018
cks-0.018
Token not
Token ID407
Feature activation+1.32e+00

Pos logit contributions
 Can+0.024
 Please+0.023
 Come+0.020
 You+0.019
 Yes+0.018
Neg logit contributions
aches-0.039
cking-0.034
ck-0.033
cks-0.033
els-0.031
Token available
Token ID1695
Feature activation+1.46e+00

Pos logit contributions
 Can+0.022
 Please+0.022
 Come+0.018
 You+0.018
 sorry+0.016
Neg logit contributions
aches-0.039
cking-0.033
ck-0.032
cks-0.032
els-0.031
Token after
Token ID706
Feature activation-2.66e+00

Pos logit contributions
 Please+0.030
 Can+0.029
 Come+0.025
 You+0.025
 sorry+0.024
Neg logit contributions
aches-0.036
ge-0.029
els-0.029
ries-0.028
cking-0.028
Token all
Token ID477
Feature activation-1.61e+00

Pos logit contributions
aches+0.066
cking+0.058
cks+0.057
ck+0.056
els+0.055
Neg logit contributions
 Can-0.052
 Please-0.051
 sorry-0.047
 You-0.047
 I-0.044
Token.
Token ID13
Feature activation+1.30e+00

Pos logit contributions
aches+0.039
cking+0.033
cks+0.033
els+0.032
ries+0.032
Neg logit contributions
 Please-0.034
 Can-0.034
 sorry-0.029
 Come-0.029
 You-0.029
Token He
Token ID679
Feature activation-1.77e+00

Pos logit contributions
 Can+0.021
 Please+0.020
 sorry+0.017
 You+0.017
 Come+0.016
Neg logit contributions
aches-0.037
cks-0.032
cking-0.032
ck-0.031
els-0.031
Token thought
Token ID1807
Feature activation+3.80e+00

Pos logit contributions
aches+0.046
cking+0.040
cks+0.039
ck+0.039
els+0.038
Neg logit contributions
 Can-0.034
 Please-0.033
 You-0.030
 sorry-0.029
 Come-0.029
Token he
Token ID339
Feature activation-1.09e+00

Pos logit contributions
 Can+0.076
 Please+0.075
 You+0.065
 Come+0.064
 sorry+0.060
Neg logit contributions
aches-0.099
cking-0.081
ck-0.081
cks-0.080
els-0.079
Token had
Token ID550
Feature activation+3.04e-01

Pos logit contributions
aches+0.027
ck+0.023
cking+0.023
els+0.022
cks+0.022
Neg logit contributions
 Can-0.023
 Please-0.022
 You-0.019
 Come-0.019
 sorry-0.019
Token it
Token ID340
Feature activation-7.84e-03

Pos logit contributions
 Can+0.062
 Please+0.062
 You+0.053
 Come+0.052
 guilty+0.048
Neg logit contributions
aches-0.102
els-0.080
cking-0.079
ck-0.079
ries-0.078
Token felt
Token ID2936
Feature activation+2.03e+00

Pos logit contributions
aches+0.000
els+0.000
ck+0.000
cking+0.000
cks+0.000
Neg logit contributions
 Can-0.000
 Please-0.000
 You-0.000
 sorry-0.000
 Come-0.000
Token better
Token ID1365
Feature activation+8.94e-01

Pos logit contributions
 Please+0.036
 Can+0.036
 Come+0.032
 You+0.032
 Give+0.029
Neg logit contributions
aches-0.054
ck-0.045
cking-0.044
cks-0.042
els-0.039
Token.
Token ID13
Feature activation+1.82e+00

Pos logit contributions
 Can+0.020
 Please+0.019
 Come+0.017
 You+0.017
 sorry+0.017
Neg logit contributions
aches-0.021
cking-0.017
ck-0.017
cks-0.017
els-0.016
Token
Token ID198
Feature activation-7.19e+00

Pos logit contributions
 Can+0.029
 Please+0.028
 You+0.023
 sorry+0.023
 Come+0.023
Neg logit contributions
aches-0.051
cks-0.044
els-0.043
cking-0.043
ck-0.042
Token
Token ID198
Feature activation-6.18e+00

Pos logit contributions
aches+0.245
cking+0.219
ries+0.219
els+0.216
cks+0.209
Neg logit contributions
 Can-0.084
 Please-0.079
 You-0.074
 Come-0.073
 But-0.072
TokenShe
Token ID3347
Feature activation-2.15e+00

Pos logit contributions
aches+0.146
ries+0.130
cks+0.126
els+0.125
cking+0.124
Neg logit contributions
 Please-0.140
 Can-0.137
 You-0.123
 Come-0.120
 I-0.116
Token was
Token ID373
Feature activation+8.91e-01

Pos logit contributions
aches+0.057
cks+0.050
cking+0.050
els+0.048
ric+0.048
Neg logit contributions
 Please-0.039
 Can-0.039
 sorry-0.033
 You-0.032
 Come-0.032
Token so
Token ID523
Feature activation+8.61e-01

Pos logit contributions
 Can+0.012
 Please+0.012
 sorry+0.010
 You+0.010
 Come+0.009
Neg logit contributions
aches-0.028
cking-0.025
cks-0.025
els-0.024
ck-0.024
Token proud
Token ID6613
Feature activation+1.50e+00

Pos logit contributions
 Please+0.011
 Can+0.011
 sorry+0.011
 guilty+0.009
 You+0.009
Neg logit contributions
aches-0.027
els-0.024
cks-0.024
ries-0.024
cking-0.024
Token of
Token ID286
Feature activation-2.68e+00

Pos logit contributions
 Can+0.032
 Please+0.032
 You+0.028
 sorry+0.028
 Come+0.028
Neg logit contributions
aches-0.036
cking-0.030
cks-0.029
els-0.029
ck-0.029
Token,
Token ID11
Feature activation+1.24e-02

Pos logit contributions
 Can+0.042
 Please+0.041
 You+0.036
 Come+0.035
 sorry+0.035
Neg logit contributions
aches-0.059
cking-0.051
els-0.050
cks-0.050
ries-0.049
Token they
Token ID484
Feature activation-4.00e-01

Pos logit contributions
 Can+0.000
 Please+0.000
 You+0.000
 sorry+0.000
 Come+0.000
Neg logit contributions
aches-0.000
cking-0.000
cks-0.000
els-0.000
ck-0.000
Token hugged
Token ID41130
Feature activation-1.60e+00

Pos logit contributions
aches+0.010
cking+0.009
cks+0.009
els+0.009
ries+0.009
Neg logit contributions
 Please-0.008
 Can-0.007
 sorry-0.007
 You-0.006
 guilty-0.006
Token each
Token ID1123
Feature activation-5.48e+00

Pos logit contributions
aches+0.032
cking+0.026
cks+0.025
els+0.025
ck+0.025
Neg logit contributions
 Can-0.040
 Please-0.040
 You-0.036
 sorry-0.036
 Come-0.035
Token other
Token ID584
Feature activation-3.53e+00

Pos logit contributions
aches+0.132
cking+0.110
cks+0.108
ck+0.107
els+0.107
Neg logit contributions
 Can-0.117
 Please-0.116
 You-0.102
 sorry-0.101
 Come-0.100
Token to
Token ID284
Feature activation-5.09e+00

Pos logit contributions
aches+0.101
cking+0.088
cks+0.087
els+0.085
ck+0.085
Neg logit contributions
 Can-0.059
 Please-0.058
 You-0.050
 sorry-0.049
 Come-0.047
Token make
Token ID787
Feature activation-3.10e+00

Pos logit contributions
aches+0.122
cking+0.103
els+0.102
cks+0.102
ries+0.100
Neg logit contributions
 Please-0.108
 Can-0.108
 sorry-0.099
 You-0.097
 Come-0.092
Token themselves
Token ID2405
Feature activation-1.90e+00

Pos logit contributions
aches+0.080
cking+0.068
cks+0.067
ck+0.066
els+0.066
Neg logit contributions
 Can-0.061
 Please-0.060
 You-0.052
 Come-0.051
 sorry-0.051
Token feel
Token ID1254
Feature activation+4.26e-01

Pos logit contributions
aches+0.052
cks+0.047
els+0.047
cking+0.046
ries+0.045
Neg logit contributions
 Please-0.030
 Can-0.030
 sorry-0.028
 You-0.026
 guilty-0.025
Token better
Token ID1365
Feature activation+1.33e-01

Pos logit contributions
 Can+0.008
 Please+0.008
 Come+0.007
 You+0.007
 Yes+0.006
Neg logit contributions
aches-0.011
cking-0.010
ck-0.010
cks-0.009
els-0.008
Token.
Token ID13
Feature activation+7.73e-01

Pos logit contributions
 Can+0.003
 Please+0.003
 Come+0.002
 You+0.002
 sorry+0.002
Neg logit contributions
aches-0.003
cking-0.003
ck-0.003
cks-0.003
els-0.003
Token arms
Token ID5101
Feature activation-2.92e+00

Pos logit contributions
aches+0.001
cking+0.001
cks+0.001
ck+0.001
els+0.001
Neg logit contributions
 Can-0.001
 Please-0.001
 You-0.001
 Come-0.001
 sorry-0.001
Token above
Token ID2029
Feature activation-5.25e+00

Pos logit contributions
aches+0.081
cking+0.070
cks+0.068
ck+0.068
els+0.067
Neg logit contributions
 Can-0.052
 Please-0.051
 You-0.044
 Come-0.043
 sorry-0.042
Token the
Token ID262
Feature activation-1.35e+00

Pos logit contributions
aches+0.119
cking+0.099
cks+0.097
ck+0.095
els+0.094
Neg logit contributions
 Can-0.120
 Please-0.118
 You-0.106
 sorry-0.105
 Come-0.103
Token post
Token ID1281
Feature activation-2.69e+00

Pos logit contributions
aches+0.031
cking+0.026
cks+0.025
els+0.025
ck+0.025
Neg logit contributions
 Can-0.030
 Please-0.030
 You-0.027
 Come-0.026
 sorry-0.026
Token.
Token ID13
Feature activation+1.16e+00

Pos logit contributions
aches+0.070
cking+0.060
cks+0.059
ck+0.058
els+0.058
Neg logit contributions
 Can-0.050
 Please-0.050
 You-0.044
 sorry-0.044
 Come-0.042
Token
Token ID198
Feature activation-6.99e+00

Pos logit contributions
 Can+0.020
 Please+0.020
 sorry+0.017
 You+0.017
 Come+0.016
Neg logit contributions
aches-0.030
cking-0.025
cks-0.025
els-0.025
ries-0.025
Token
Token ID198
Feature activation-5.83e+00

Pos logit contributions
aches+0.208
cking+0.186
els+0.183
cks+0.183
ries+0.182
Neg logit contributions
 Can-0.111
 Please-0.110
 You-0.095
 Come-0.094
 sorry-0.090
Token"
Token ID1
Feature activation-2.63e+00

Pos logit contributions
aches+0.123
els+0.106
cks+0.106
ries+0.105
cking+0.102
Neg logit contributions
 Please-0.147
 Can-0.144
 You-0.128
 Come-0.127
 sorry-0.122
TokenWhat
Token ID2061
Feature activation+3.15e+00

Pos logit contributions
aches+0.075
cks+0.065
cking+0.064
els+0.064
ries+0.063
Neg logit contributions
 Can-0.047
 Please-0.047
 You-0.039
 Come-0.038
 Yes-0.036
Token are
Token ID389
Feature activation+3.96e+00

Pos logit contributions
 Can+0.063
 Please+0.062
 Come+0.053
 sorry+0.052
 You+0.052
Neg logit contributions
aches-0.078
cks-0.060
els-0.060
ck-0.058
cking-0.057
Token you
Token ID345
Feature activation+3.86e+00

Pos logit contributions
 Can+0.059
 Please+0.057
 Come+0.047
 You+0.047
 sorry+0.047
Neg logit contributions
aches-0.122
cking-0.104
cks-0.103
ries-0.102
ck-0.102
Token and
Token ID290
Feature activation-2.24e+00

Pos logit contributions
aches+0.003
cking+0.003
ck+0.003
els+0.003
cks+0.003
Neg logit contributions
 Can-0.004
 Please-0.004
 You-0.003
 Come-0.003
 gonna-0.003
Token pet
Token ID4273
Feature activation-1.88e+00

Pos logit contributions
aches+0.060
cking+0.051
els+0.051
cks+0.051
ries+0.049
Neg logit contributions
 Can-0.042
 Please-0.042
 sorry-0.037
 You-0.036
 Come-0.035
Tokented
Token ID1513
Feature activation-2.90e+00

Pos logit contributions
aches+0.039
cking+0.032
cks+0.032
els+0.031
ck+0.031
Neg logit contributions
 Can-0.046
 Please-0.045
 You-0.041
 sorry-0.040
 Come-0.040
Token Spot
Token ID15899
Feature activation-1.64e+00

Pos logit contributions
aches+0.064
cking+0.053
cks+0.052
els+0.052
ck+0.051
Neg logit contributions
 Can-0.067
 Please-0.066
 You-0.059
 sorry-0.058
 Come-0.057
Token.
Token ID13
Feature activation+3.61e+00

Pos logit contributions
aches+0.044
cking+0.038
cks+0.037
ck+0.037
els+0.037
Neg logit contributions
 Can-0.031
 Please-0.030
 You-0.026
 sorry-0.025
 Come-0.025
Token
Token ID198
Feature activation-6.68e+00

Pos logit contributions
 Please+0.050
 Can+0.048
 Come+0.039
 You+0.039
 ok+0.037
Neg logit contributions
aches-0.095
cks-0.085
els-0.081
ric-0.080
cking-0.080
Token
Token ID198
Feature activation-5.89e+00

Pos logit contributions
aches+0.212
cking+0.186
ries+0.182
els+0.182
cks+0.178
Neg logit contributions
 Can-0.101
 Please-0.096
 You-0.085
 Come-0.082
 sorry-0.080
Token"
Token ID1
Feature activation-2.16e+00

Pos logit contributions
aches+0.154
cking+0.130
els+0.129
ries+0.128
cks+0.127
Neg logit contributions
 Can-0.122
 Please-0.121
 You-0.106
 Come-0.103
 sorry-0.100
TokenThat
Token ID2504
Feature activation+1.89e+00

Pos logit contributions
aches+0.058
cking+0.049
els+0.049
cks+0.049
ck+0.047
Neg logit contributions
 Can-0.043
 Please-0.042
 You-0.037
 sorry-0.035
 Come-0.035
Token's
Token ID338
Feature activation-1.05e-02

Pos logit contributions
 Please+0.044
 Can+0.044
 sorry+0.042
 Come+0.038
 Yes+0.038
Neg logit contributions
aches-0.037
ric-0.028
ics-0.027
ries-0.027
els-0.026
Token okay
Token ID8788
Feature activation+4.64e+00

Pos logit contributions
aches+0.000
cking+0.000
cks+0.000
ck+0.000
els+0.000
Neg logit contributions
 Can-0.000
 Please-0.000
 Come-0.000
 You-0.000
 sorry-0.000
Token was
Token ID373
Feature activation+1.18e+00

Pos logit contributions
 Please+0.005
 Can+0.005
 You+0.004
 sorry+0.004
 Come+0.004
Neg logit contributions
aches-0.007
cking-0.006
cks-0.006
els-0.006
ck-0.006
Token already
Token ID1541
Feature activation+1.36e+00

Pos logit contributions
 Can+0.019
 Please+0.019
 You+0.016
 sorry+0.016
 Come+0.016
Neg logit contributions
aches-0.034
cking-0.030
cks-0.029
els-0.029
ck-0.029
Token walking
Token ID6155
Feature activation+3.23e-01

Pos logit contributions
 Can+0.025
 Please+0.025
 sorry+0.022
 You+0.021
 Come+0.020
Neg logit contributions
aches-0.036
cking-0.031
cks-0.031
els-0.031
ck-0.030
Token away
Token ID1497
Feature activation-9.37e-01

Pos logit contributions
 Can+0.007
 Please+0.006
 Come+0.006
 sorry+0.006
 You+0.006
Neg logit contributions
aches-0.008
cking-0.006
els-0.006
cks-0.006
ck-0.006
Token.
Token ID13
Feature activation+1.44e+00

Pos logit contributions
aches+0.025
cking+0.022
cks+0.022
els+0.021
ck+0.021
Neg logit contributions
 Can-0.017
 Please-0.017
 You-0.015
 sorry-0.014
 Come-0.014
Token
Token ID198
Feature activation-6.69e+00

Pos logit contributions
 Can+0.028
 Please+0.027
 sorry+0.024
 You+0.023
 Come+0.023
Neg logit contributions
aches-0.035
cks-0.029
cking-0.029
els-0.029
ck-0.028
Token
Token ID198
Feature activation-5.56e+00

Pos logit contributions
aches+0.194
cking+0.173
els+0.170
ries+0.169
cks+0.166
Neg logit contributions
 Please-0.109
 Can-0.109
 You-0.097
 Come-0.094
 sorry-0.091
TokenMom
Token ID29252
Feature activation-2.28e+00

Pos logit contributions
aches+0.113
els+0.097
ries+0.097
cking+0.096
cks+0.095
Neg logit contributions
 Please-0.141
 Can-0.137
 You-0.126
 Come-0.123
 sorry-0.119
Tokenmy
Token ID1820
Feature activation-1.09e+00

Pos logit contributions
aches+0.059
cking+0.050
cks+0.050
ck+0.048
els+0.048
Neg logit contributions
 Can-0.045
 Please-0.045
 You-0.039
 sorry-0.039
 Come-0.038
Token ran
Token ID4966
Feature activation+2.82e-01

Pos logit contributions
aches+0.028
cking+0.024
cks+0.024
ck+0.023
els+0.023
Neg logit contributions
 Can-0.021
 Please-0.021
 You-0.018
 sorry-0.018
 Come-0.017
Token back
Token ID736
Feature activation-2.44e+00

Pos logit contributions
 Can+0.006
 Please+0.006
 You+0.005
 sorry+0.005
 Come+0.005
Neg logit contributions
aches-0.006
ck-0.005
cking-0.005
ge-0.005
els-0.005
Token the
Token ID262
Feature activation-1.50e+00

Pos logit contributions
aches+0.021
cking+0.018
cks+0.018
els+0.018
ck+0.017
Neg logit contributions
 Can-0.015
 Please-0.015
 sorry-0.013
 You-0.013
 Come-0.013
Token tigers
Token ID43114
Feature activation-2.79e+00

Pos logit contributions
aches+0.036
cking+0.030
ck+0.028
els+0.028
cks+0.028
Neg logit contributions
 Can-0.031
 Please-0.030
 Come-0.027
 You-0.027
 sorry-0.026
Token were
Token ID547
Feature activation+4.00e-01

Pos logit contributions
aches+0.069
cking+0.059
cks+0.059
els+0.058
ck+0.057
Neg logit contributions
 Can-0.056
 Please-0.055
 You-0.049
 sorry-0.048
 Come-0.046
Token safe
Token ID3338
Feature activation+1.36e+00

Pos logit contributions
 Please+0.006
 Can+0.006
 sorry+0.006
 You+0.005
 guilty+0.005
Neg logit contributions
aches-0.012
els-0.010
cks-0.010
cking-0.010
ries-0.010
Token.
Token ID13
Feature activation+6.58e-01

Pos logit contributions
 Can+0.028
 Please+0.027
 You+0.024
 Come+0.024
 sorry+0.023
Neg logit contributions
aches-0.033
cking-0.029
cks-0.027
els-0.027
ries-0.027
Token
Token ID198
Feature activation-7.45e+00

Pos logit contributions
 Can+0.012
 Please+0.012
 You+0.010
 sorry+0.010
 Come+0.010
Neg logit contributions
aches-0.018
cking-0.015
cks-0.015
els-0.015
ck-0.015
Token
Token ID198
Feature activation-6.18e+00

Pos logit contributions
aches+0.204
ries+0.187
els+0.186
cking+0.180
cks+0.175
Neg logit contributions
 Please-0.121
 Can-0.120
 You-0.114
 Come-0.108
 sorry-0.107
TokenAs
Token ID1722
Feature activation-2.69e-01

Pos logit contributions
aches+0.126
ries+0.116
els+0.114
cks+0.111
cking+0.106
Neg logit contributions
 Please-0.149
 Can-0.144
 You-0.135
 Come-0.129
 Yes-0.125
Token Bob
Token ID5811
Feature activation-2.12e+00

Pos logit contributions
aches+0.007
cking+0.006
cks+0.006
els+0.006
ck+0.006
Neg logit contributions
 Can-0.005
 Please-0.005
 You-0.005
 sorry-0.005
 Come-0.004
Token walked
Token ID6807
Feature activation-1.16e+00

Pos logit contributions
aches+0.061
cks+0.054
cking+0.054
els+0.053
ries+0.051
Neg logit contributions
 Please-0.033
 Can-0.033
 You-0.028
 Come-0.028
 guilty-0.026
Token through
Token ID832
Feature activation-4.71e+00

Pos logit contributions
aches+0.033
cking+0.028
cks+0.028
ck+0.027
els+0.027
Neg logit contributions
 Can-0.020
 Please-0.020
 You-0.017
 sorry-0.017
 Come-0.017
Token little
Token ID1310
Feature activation-5.23e-02

Pos logit contributions
aches+0.061
cking+0.053
ck+0.051
cks+0.050
els+0.047
Neg logit contributions
 Can-0.049
 Please-0.048
 Come-0.044
 You-0.044
 Just-0.040
Token fish
Token ID5916
Feature activation-4.40e+00

Pos logit contributions
aches+0.001
cking+0.001
ck+0.001
ries+0.001
els+0.001
Neg logit contributions
 Please-0.001
 Can-0.001
 Come-0.001
 You-0.001
 Just-0.001
Token was
Token ID373
Feature activation+7.33e-01

Pos logit contributions
aches+0.119
cking+0.102
els+0.102
cks+0.102
ries+0.100
Neg logit contributions
 Please-0.078
 Can-0.078
 sorry-0.069
 You-0.067
 Come-0.064
Token safe
Token ID3338
Feature activation+1.77e+00

Pos logit contributions
 Can+0.010
 Please+0.010
 sorry+0.008
 You+0.008
 Come+0.007
Neg logit contributions
aches-0.023
cking-0.021
cks-0.020
els-0.020
ck-0.020
Token.
Token ID13
Feature activation+7.11e-01

Pos logit contributions
 Can+0.039
 Please+0.038
 Come+0.033
 You+0.033
 sorry+0.031
Neg logit contributions
aches-0.040
cking-0.036
cks-0.034
ck-0.033
ge-0.033
Token
Token ID198
Feature activation-8.08e+00

Pos logit contributions
 Can+0.013
 Please+0.013
 sorry+0.011
 You+0.010
 Come+0.010
Neg logit contributions
aches-0.019
cks-0.016
cking-0.016
ck-0.016
els-0.015
Token
Token ID198
Feature activation-6.69e+00

Pos logit contributions
aches+0.269
els+0.255
ries+0.254
cking+0.239
cks+0.230
Neg logit contributions
 Can-0.093
 Please-0.087
 But-0.080
 You-0.080
 Come-0.077
TokenThe
Token ID464
Feature activation-2.37e+00

Pos logit contributions
aches+0.151
ries+0.149
els+0.144
cks+0.129
packs+0.125
Neg logit contributions
 Please-0.152
 Can-0.151
 You-0.135
 Come-0.126
 I-0.126
Token little
Token ID1310
Feature activation-5.02e-02

Pos logit contributions
aches+0.062
cking+0.054
ck+0.053
cks+0.052
els+0.051
Neg logit contributions
 Can-0.045
 Please-0.045
 You-0.040
 Come-0.039
 sorry-0.036
Token fish
Token ID5916
Feature activation-4.10e+00

Pos logit contributions
aches+0.001
cking+0.001
ck+0.001
cks+0.001
els+0.001
Neg logit contributions
 Please-0.001
 Can-0.001
 Come-0.001
 You-0.001
 sorry-0.001
Token was
Token ID373
Feature activation+8.42e-01

Pos logit contributions
aches+0.115
cks+0.100
els+0.100
cking+0.099
ries+0.098
Neg logit contributions
 Please-0.068
 Can-0.068
 sorry-0.060
 You-0.058
 guilty-0.056