TOP ACTIVATIONS
MAX = 6.303

Where do you want to go ?" The goose hon
Where do you need to go ?" The animal pointed a
" Do you want to go play at the gym ?"
Where do you want to go , Blank et ?"
. Would you like to go on an adventure with me
uffy , where do you go when you disappear ?"
Shadow , where did you go ?" She looked around ,
" Would you like to go for a picnic today ?"
, " What is that wreck ?". Jack shrugged
, " Where did you go Nate ? I was looking
fear . " What 's jail ?" asked Billy . "
my , where do I go when I disappear ?" His
, " Where did you go ?". The rabbit said he
Where do you want to go ?" Tim
, " Where are you going ?". The pear didn 't
" Do you want to go on an adventure ?" Joe
Smith ! Where are you going ?" asked Lily .
friends , " Can I go fly by myself ?"
asked " Where did you go , daughter ? I miss
" Do you want to go down the slide ?"

MIN ACTIVATIONS
MIN = -6.979

fun !" Anna says . " Look at my bubble ,
Amazing ," Ben said . " This is a treasure ."
pretty !" Ben says . " You can wear it ,
Tom ," Lily said . " You are my friend ."
friends ," Lily says . " But you have to wait
lucky !" Mia says . " But where is our rocket
is ," Ben said . " And the rain is a
Tom ," Lily says . " But maybe it is someone
first ," Ben says . " But I dare you to
Sara and Ben say . " We are ready !"
dog !" Ben says . " I made a flower .
it !" Ben says . " We are the best spies
did !" Lily said . " But we have to remember
fish !" Ben said . " You are very smart and
Ben and Lily say . " We took pictures of many
fish !" Lily said . " You are very kind and
names !" Lily says . " He is very smart and
Wow !" Ben says . " This is a sharp pearl
," L ila says . " Now you are warm and
mm !" Sue said . " It is nice !"

INTERVAL 2.983 - 6.303
CONTAINS 0.993%

is cool . Let me help you ." He
" Can you help me break the eggs and put them
friend , Sally . " Hi Sally , do
to go play at the gym ?" Jack said
on the door . When his mom answered ,

INTERVAL -0.338 - 2.983
CONTAINS 62.271%

Okay , you can have the shell if you give me
But then , she saw some funny - looking
enjoyed the lemon juice together . <|endoftext|> One day , a
we love you , we love you ." They decided to
took out Spot . He was happy to see them and

INTERVAL -3.658 - -0.338
CONTAINS 35.903%

he and Amy had lots of fun playing together in the
on a trip with her mom and dad . They had
" Yes , please . I want to play with the
They raced the car around the garden , but then Sam
Emily 's m ittens . They promised each other that they

INTERVAL -6.979 - -3.658
CONTAINS 0.833%

animals happy . <|endoftext|> Once upon a time , there was
Ben nods and smiles . He and his dad
to hope . <|endoftext|> Once upon a time , there was
back home . <|endoftext|> Once upon a time , there was
toys ," Mom said . " That 's not nice .
TokenWhere
Token ID8496
Feature activation-1.12e-02

Pos logit contributions
 de+0.063
 lear+0.052
ives+0.049
bows+0.049
rav+0.046
Neg logit contributions
lf-0.048
 me-0.042
 site-0.040
 first-0.037
izing-0.037
Token do
Token ID466
Feature activation+3.82e+00

Pos logit contributions
 de+0.000
 lear+0.000
ives+0.000
 love+0.000
rav+0.000
Neg logit contributions
lf-0.000
 site-0.000
ered-0.000
izing-0.000
 avail-0.000
Token you
Token ID345
Feature activation-1.09e+00

Pos logit contributions
lf+0.072
ered+0.065
izing+0.063
 site+0.062
aned+0.061
Neg logit contributions
 de-0.090
 love-0.079
 lear-0.076
rav-0.075
By-0.075
Token want
Token ID765
Feature activation+4.06e+00

Pos logit contributions
 de+0.020
 lear+0.018
ives+0.016
 love+0.015
 hope+0.014
Neg logit contributions
lf-0.030
 site-0.027
izing-0.025
ered-0.025
 avail-0.025
Token to
Token ID284
Feature activation+2.87e+00

Pos logit contributions
lf+0.120
izing+0.112
ering+0.107
 avail+0.106
ths+0.105
Neg logit contributions
 love-0.056
 de-0.050
 like-0.048
ives-0.047
're-0.046
Token go
Token ID467
Feature activation+6.30e+00

Pos logit contributions
lf+0.087
izing+0.085
ering+0.083
ths+0.082
web+0.081
Neg logit contributions
 love-0.037
 like-0.029
're-0.029
 lear-0.027
 hope-0.026
Token?"
Token ID1701
Feature activation+1.79e+00

Pos logit contributions
lf+0.139
izing+0.116
 avail+0.112
ered+0.108
 site+0.107
Neg logit contributions
 like-0.145
 love-0.133
 de-0.128
're-0.123
ives-0.122
Token
Token ID198
Feature activation+2.12e+00

Pos logit contributions
lf+0.053
 avail+0.047
ered+0.047
 site+0.047
izing+0.046
Neg logit contributions
 de-0.030
 lear-0.025
ives-0.021
 love-0.019
bows-0.018
TokenThe
Token ID464
Feature activation+3.49e-01

Pos logit contributions
lf+0.066
 avail+0.061
ered+0.061
 site+0.061
izing+0.060
Neg logit contributions
 de-0.029
 lear-0.023
ives-0.021
 love-0.019
rav-0.016
Token goose
Token ID37246
Feature activation+2.13e+00

Pos logit contributions
lf+0.009
ered+0.008
 site+0.008
 avail+0.008
izing+0.007
Neg logit contributions
 de-0.007
 lear-0.006
 love-0.006
ives-0.005
 hope-0.005
Token hon
Token ID3032
Feature activation-6.34e-01

Pos logit contributions
lf+0.044
 avail+0.043
 site+0.041
izing+0.039
 happen+0.038
Neg logit contributions
 de-0.051
 lear-0.044
 love-0.039
ives-0.038
 hope-0.037
TokenWhere
Token ID8496
Feature activation+1.45e-01

Pos logit contributions
 de+0.064
 lear+0.053
ives+0.051
bows+0.050
rav+0.048
Neg logit contributions
lf-0.051
 me-0.046
 site-0.044
izing-0.043
ered-0.043
Token do
Token ID466
Feature activation+3.50e+00

Pos logit contributions
lf+0.004
 site+0.004
ered+0.004
izing+0.004
 avail+0.004
Neg logit contributions
 de-0.002
 lear-0.002
ives-0.002
 love-0.001
rav-0.001
Token you
Token ID345
Feature activation-1.15e+00

Pos logit contributions
lf+0.061
 site+0.056
ered+0.053
izing+0.051
ering+0.050
Neg logit contributions
 de-0.089
 lear-0.079
 love-0.079
rav-0.072
 rib-0.070
Token need
Token ID761
Feature activation+2.91e+00

Pos logit contributions
 de+0.021
 lear+0.019
ives+0.018
 love+0.017
 hope+0.016
Neg logit contributions
lf-0.031
 site-0.028
izing-0.027
 avail-0.027
ered-0.025
Token to
Token ID284
Feature activation+2.51e+00

Pos logit contributions
lf+0.069
 site+0.061
izing+0.059
 avail+0.059
ths+0.055
Neg logit contributions
 de-0.058
 love-0.057
 lear-0.055
ives-0.053
 like-0.049
Token go
Token ID467
Feature activation+6.19e+00

Pos logit contributions
lf+0.064
izing+0.061
ths+0.059
 site+0.058
ering+0.057
Neg logit contributions
 love-0.043
 lear-0.036
 like-0.035
 hope-0.035
 de-0.034
Token?"
Token ID1701
Feature activation+1.55e+00

Pos logit contributions
lf+0.129
izing+0.105
 avail+0.104
ered+0.098
 site+0.098
Neg logit contributions
 like-0.140
 de-0.135
 love-0.134
 lear-0.132
ives-0.123
Token The
Token ID383
Feature activation+7.30e-01

Pos logit contributions
lf+0.040
 site+0.036
ered+0.035
 avail+0.033
izing+0.033
Neg logit contributions
 de-0.031
 lear-0.027
ives-0.025
 love-0.022
rav-0.022
Token animal
Token ID5044
Feature activation+2.25e+00

Pos logit contributions
lf+0.017
ered+0.016
 avail+0.015
ering+0.014
angle+0.014
Neg logit contributions
 de-0.016
 lear-0.014
 love-0.014
Thank-0.012
 hope-0.012
Token pointed
Token ID6235
Feature activation-2.43e-01

Pos logit contributions
lf+0.045
 avail+0.041
 site+0.039
izing+0.038
 happen+0.037
Neg logit contributions
 de-0.057
 lear-0.050
 love-0.044
ives-0.044
 hope-0.042
Token a
Token ID257
Feature activation+5.65e-01

Pos logit contributions
 de+0.006
 lear+0.006
ives+0.005
bows+0.005
rav+0.005
Neg logit contributions
lf-0.004
ered-0.004
 me-0.004
 site-0.004
izing-0.004
Token "
Token ID366
Feature activation-2.04e+00

Pos logit contributions
 de+0.009
 lear+0.008
ives+0.008
bows+0.007
rav+0.007
Neg logit contributions
lf-0.009
 site-0.008
ered-0.008
 me-0.008
izing-0.008
TokenDo
Token ID5211
Feature activation+7.46e-01

Pos logit contributions
 de+0.052
 lear+0.041
ives+0.039
bows+0.038
isc+0.037
Neg logit contributions
lf-0.048
 me-0.041
 site-0.039
 avail-0.038
 first-0.037
Token you
Token ID345
Feature activation-2.43e+00

Pos logit contributions
lf+0.011
 site+0.009
ered+0.009
 avail+0.008
izing+0.008
Neg logit contributions
 de-0.023
 lear-0.021
ives-0.020
rav-0.020
 love-0.020
Token want
Token ID765
Feature activation+3.66e+00

Pos logit contributions
 de+0.046
 lear+0.035
joy+0.032
ives+0.032
bows+0.031
Neg logit contributions
lf-0.061
ered-0.061
 me-0.060
?"-0.059
 prove-0.056
Token to
Token ID284
Feature activation+2.59e+00

Pos logit contributions
lf+0.117
izing+0.113
ered+0.109
ering+0.106
 site+0.106
Neg logit contributions
 love-0.043
 lear-0.037
 de-0.037
ives-0.034
 like-0.033
Token go
Token ID467
Feature activation+6.06e+00

Pos logit contributions
lf+0.069
izing+0.063
 site+0.063
ered+0.060
ering+0.058
Neg logit contributions
 lear-0.044
 de-0.043
 love-0.042
ives-0.038
 like-0.036
Token play
Token ID711
Feature activation+2.44e+00

Pos logit contributions
lf+0.179
izing+0.164
 site+0.156
ered+0.155
 avail+0.153
Neg logit contributions
 like-0.087
 love-0.083
 lear-0.083
 de-0.083
ives-0.073
Token at
Token ID379
Feature activation+1.02e+00

Pos logit contributions
lf+0.055
ered+0.055
 avail+0.051
 site+0.050
izing+0.050
Neg logit contributions
 lear-0.046
 de-0.044
 like-0.042
 love-0.039
ives-0.039
Token the
Token ID262
Feature activation+3.13e-01

Pos logit contributions
ered+0.026
lf+0.026
izing+0.026
ering+0.025
eking+0.024
Neg logit contributions
 de-0.016
 lear-0.015
 love-0.014
 Whe-0.013
 hope-0.013
Token gym
Token ID11550
Feature activation+4.17e+00

Pos logit contributions
lf+0.009
ered+0.009
 avail+0.008
izing+0.008
ering+0.008
Neg logit contributions
 de-0.005
 lear-0.005
ives-0.004
 love-0.004
Whe-0.004
Token?"
Token ID1701
Feature activation+9.53e-01

Pos logit contributions
lf+0.080
ered+0.076
 avail+0.072
izing+0.067
 site+0.066
Neg logit contributions
 de-0.104
 lear-0.099
ives-0.089
bows-0.084
 like-0.084
TokenWhere
Token ID8496
Feature activation-1.97e-01

Pos logit contributions
 de+0.069
ives+0.055
 lear+0.054
bows+0.053
rav+0.052
Neg logit contributions
lf-0.050
 me-0.045
izing-0.040
 site-0.040
 first-0.040
Token do
Token ID466
Feature activation+3.45e+00

Pos logit contributions
 de+0.004
 lear+0.004
ives+0.003
 love+0.003
rav+0.003
Neg logit contributions
lf-0.005
 site-0.004
ered-0.004
izing-0.004
 avail-0.004
Token you
Token ID345
Feature activation-1.58e+00

Pos logit contributions
lf+0.059
 site+0.058
izing+0.056
ered+0.056
ering+0.052
Neg logit contributions
 de-0.087
 love-0.074
 lear-0.072
rav-0.071
 rib-0.068
Token want
Token ID765
Feature activation+3.88e+00

Pos logit contributions
 de+0.027
 lear+0.023
ives+0.020
joy+0.018
bows+0.018
Neg logit contributions
lf-0.045
ered-0.041
 site-0.040
 me-0.039
 happen-0.039
Token to
Token ID284
Feature activation+2.69e+00

Pos logit contributions
lf+0.082
izing+0.075
 avail+0.073
 site+0.071
ered+0.070
Neg logit contributions
 de-0.085
 love-0.081
 lear-0.074
 like-0.074
ives-0.074
Token go
Token ID467
Feature activation+6.06e+00

Pos logit contributions
lf+0.079
 site+0.075
izing+0.074
ths+0.071
ering+0.070
Neg logit contributions
 love-0.035
 de-0.033
 lear-0.031
 like-0.028
're-0.027
Token,
Token ID11
Feature activation+3.92e-01

Pos logit contributions
lf+0.106
izing+0.089
 avail+0.088
 site+0.086
ered+0.085
Neg logit contributions
 like-0.153
 de-0.151
 love-0.147
 lear-0.140
're-0.136
Token Blank
Token ID31990
Feature activation-1.37e+00

Pos logit contributions
lf+0.008
 site+0.007
ered+0.007
 avail+0.006
izing+0.006
Neg logit contributions
 de-0.010
 lear-0.009
ives-0.008
 love-0.008
bows-0.008
Tokenet
Token ID316
Feature activation+2.75e+00

Pos logit contributions
 de+0.047
 lear+0.044
ives+0.041
 love+0.039
bows+0.039
Neg logit contributions
lf-0.016
 site-0.012
ered-0.011
 avail-0.010
izing-0.010
Token?"
Token ID1701
Feature activation+1.51e+00

Pos logit contributions
lf+0.034
 site+0.029
 avail+0.028
izing+0.026
ered+0.025
Neg logit contributions
 de-0.090
 lear-0.082
ives-0.078
 love-0.075
bows-0.075
Token
Token ID220
Feature activation-1.32e+00

Pos logit contributions
lf+0.036
 site+0.031
 me+0.031
ered+0.031
 avail+0.029
Neg logit contributions
 de-0.033
 lear-0.029
ives-0.028
rav-0.026
bows-0.025
Token.
Token ID13
Feature activation-2.32e-01

Pos logit contributions
lf+0.039
 site+0.034
ered+0.034
 avail+0.032
izing+0.031
Neg logit contributions
 de-0.052
 lear-0.047
ives-0.043
 love-0.041
joy-0.040
Token Would
Token ID10928
Feature activation+2.33e+00

Pos logit contributions
 de+0.006
 lear+0.005
ives+0.005
bows+0.005
joy+0.005
Neg logit contributions
lf-0.004
 site-0.004
 me-0.004
ered-0.004
 my-0.003
Token you
Token ID345
Feature activation-2.51e+00

Pos logit contributions
lf+0.042
 site+0.040
 avail+0.037
ered+0.036
izing+0.036
Neg logit contributions
 de-0.060
 love-0.053
rav-0.052
By-0.049
 lear-0.049
Token like
Token ID588
Feature activation+2.50e+00

Pos logit contributions
 de+0.037
 lear+0.029
ives+0.023
joy+0.023
bows+0.022
Neg logit contributions
lf-0.072
?"-0.072
 me-0.072
ered-0.071
 first-0.069
Token to
Token ID284
Feature activation+3.25e+00

Pos logit contributions
lf+0.065
 site+0.059
ered+0.058
izing+0.057
 avail+0.057
Neg logit contributions
 de-0.047
 lear-0.042
 love-0.042
ives-0.038
 like-0.036
Token go
Token ID467
Feature activation+6.05e+00

Pos logit contributions
lf+0.086
 site+0.078
izing+0.077
ered+0.073
 avail+0.071
Neg logit contributions
 de-0.056
 lear-0.055
 love-0.054
ives-0.048
 hope-0.047
Token on
Token ID319
Feature activation+4.40e+00

Pos logit contributions
lf+0.170
izing+0.150
 site+0.145
 avail+0.143
ered+0.139
Neg logit contributions
 love-0.099
 like-0.098
 lear-0.094
 de-0.093
ives-0.087
Token an
Token ID281
Feature activation+2.39e+00

Pos logit contributions
lf+0.132
izing+0.123
ered+0.122
ering+0.116
 site+0.115
Neg logit contributions
 love-0.057
 de-0.056
joy-0.051
 like-0.051
 lear-0.050
Token adventure
Token ID8855
Feature activation+4.85e+00

Pos logit contributions
izing+0.055
lf+0.054
ize+0.050
 avail+0.050
 site+0.048
Neg logit contributions
 lear-0.039
 de-0.039
 love-0.038
 like-0.035
 hope-0.032
Token with
Token ID351
Feature activation+1.85e+00

Pos logit contributions
lf+0.068
ered+0.062
 site+0.054
 avail+0.053
izing+0.053
Neg logit contributions
 de-0.141
 lear-0.136
 like-0.129
 love-0.126
ives-0.124
Token me
Token ID502
Feature activation+1.82e+00

Pos logit contributions
lf+0.021
 site+0.019
izing+0.018
ered+0.016
 avail+0.015
Neg logit contributions
 de-0.057
 love-0.054
 lear-0.051
ives-0.049
 hope-0.049
Tokenuffy
Token ID15352
Feature activation+2.26e+00

Pos logit contributions
lf+0.014
 site+0.012
ered+0.012
 avail+0.011
 happen+0.011
Neg logit contributions
 de-0.015
 lear-0.013
ives-0.013
rav-0.012
bows-0.012
Token,
Token ID11
Feature activation-5.07e-01

Pos logit contributions
lf+0.052
 site+0.048
 avail+0.045
izing+0.044
ered+0.043
Neg logit contributions
 de-0.054
 lear-0.045
 love-0.043
 hope-0.041
 like-0.040
Token where
Token ID810
Feature activation+1.56e+00

Pos logit contributions
 de+0.012
 lear+0.011
ives+0.010
bows+0.010
joy+0.009
Neg logit contributions
lf-0.011
 me-0.009
 site-0.009
ered-0.009
 happen-0.009
Token do
Token ID466
Feature activation+3.90e+00

Pos logit contributions
lf+0.045
 site+0.041
ered+0.040
izing+0.039
 avail+0.039
Neg logit contributions
 de-0.026
 lear-0.022
ives-0.020
 love-0.018
rav-0.018
Token you
Token ID345
Feature activation-7.55e-02

Pos logit contributions
lf+0.070
 site+0.065
ered+0.062
izing+0.062
 avail+0.061
Neg logit contributions
 de-0.102
rav-0.087
 love-0.085
 lear-0.083
ives-0.081
Token go
Token ID467
Feature activation+6.05e+00

Pos logit contributions
 de+0.002
 lear+0.001
 love+0.001
ives+0.001
're+0.001
Neg logit contributions
lf-0.002
 site-0.002
izing-0.002
 avail-0.002
ths-0.002
Token when
Token ID618
Feature activation+3.32e+00

Pos logit contributions
lf+0.161
 avail+0.149
ths+0.145
izing+0.144
ered+0.133
Neg logit contributions
 like-0.114
 love-0.099
 de-0.096
ives-0.091
're-0.091
Token you
Token ID345
Feature activation-2.19e-01

Pos logit contributions
lf+0.071
izing+0.070
ered+0.067
ths+0.066
aned+0.063
Neg logit contributions
 de-0.073
 love-0.062
 like-0.058
're-0.056
 hope-0.054
Token disappear
Token ID10921
Feature activation+3.50e+00

Pos logit contributions
 de+0.005
 lear+0.004
ives+0.004
joy+0.003
bows+0.003
Neg logit contributions
lf-0.005
 site-0.005
ered-0.005
 avail-0.004
 me-0.004
Token?"
Token ID1701
Feature activation+2.35e+00

Pos logit contributions
lf+0.054
 site+0.045
ered+0.040
 avail+0.040
izing+0.039
Neg logit contributions
 de-0.110
 like-0.091
 love-0.091
 lear-0.091
bows-0.089
Token
Token ID198
Feature activation+2.06e+00

Pos logit contributions
lf+0.061
 site+0.056
ered+0.054
 avail+0.054
izing+0.053
Neg logit contributions
 de-0.047
 lear-0.040
 love-0.034
ives-0.034
Thank-0.033
TokenShadow
Token ID27447
Feature activation+2.39e+00

Pos logit contributions
 de+0.057
ives+0.050
 lear+0.050
joy+0.042
bows+0.041
Neg logit contributions
lf-0.072
 me-0.064
izing-0.062
 avail-0.061
 site-0.060
Token,
Token ID11
Feature activation-5.97e-01

Pos logit contributions
lf+0.046
 site+0.042
ered+0.039
izing+0.038
 avail+0.038
Neg logit contributions
 de-0.064
 lear-0.055
 love-0.050
bows-0.049
ives-0.049
Token where
Token ID810
Feature activation+1.50e+00

Pos logit contributions
 de+0.017
 lear+0.015
ives+0.015
bows+0.014
joy+0.013
Neg logit contributions
lf-0.011
 site-0.009
ered-0.009
 avail-0.008
izing-0.008
Token did
Token ID750
Feature activation+3.67e+00

Pos logit contributions
lf+0.044
 site+0.041
izing+0.039
ered+0.038
 avail+0.038
Neg logit contributions
 de-0.025
 lear-0.020
ives-0.019
 love-0.018
rav-0.017
Token you
Token ID345
Feature activation+6.85e-01

Pos logit contributions
 site+0.071
lf+0.069
izing+0.066
bing+0.065
roads+0.065
Neg logit contributions
 de-0.089
 love-0.075
rav-0.067
 hope-0.067
 lear-0.067
Token go
Token ID467
Feature activation+6.04e+00

Pos logit contributions
lf+0.019
 site+0.018
izing+0.018
ths+0.017
 avail+0.017
Neg logit contributions
 love-0.011
 like-0.010
're-0.010
 de-0.010
 lear-0.010
Token?"
Token ID1701
Feature activation+1.95e+00

Pos logit contributions
lf+0.144
izing+0.137
 avail+0.131
ths+0.128
iture+0.124
Neg logit contributions
 like-0.135
 love-0.115
 and-0.104
 hope-0.100
 de-0.095
Token She
Token ID1375
Feature activation+6.77e-01

Pos logit contributions
lf+0.058
 site+0.053
 avail+0.052
izing+0.052
ered+0.051
Neg logit contributions
 de-0.031
 lear-0.027
 love-0.023
ives-0.022
 hope-0.021
Token looked
Token ID3114
Feature activation-1.56e+00

Pos logit contributions
lf+0.016
 site+0.015
izing+0.014
 avail+0.014
ered+0.014
Neg logit contributions
 de-0.015
 lear-0.013
 love-0.011
ives-0.011
 hope-0.011
Token around
Token ID1088
Feature activation-1.65e+00

Pos logit contributions
 de+0.038
 lear+0.035
ives+0.032
bows+0.030
rav+0.030
Neg logit contributions
lf-0.032
 site-0.028
ered-0.028
 me-0.026
izing-0.026
Token,
Token ID11
Feature activation-1.65e+00

Pos logit contributions
 de+0.037
 lear+0.033
ives+0.032
bows+0.029
rav+0.029
Neg logit contributions
lf-0.037
 site-0.033
 me-0.032
ered-0.032
 avail-0.030
Token "
Token ID366
Feature activation-1.90e+00

Pos logit contributions
lf+0.011
 site+0.010
ered+0.010
 me+0.009
izing+0.009
Neg logit contributions
 de-0.010
 lear-0.009
ives-0.009
bows-0.008
rav-0.008
TokenWould
Token ID17353
Feature activation+1.90e+00

Pos logit contributions
 de+0.054
 lear+0.046
ives+0.045
isc+0.043
rav+0.042
Neg logit contributions
lf-0.036
 me-0.032
 site-0.029
izing-0.028
 first-0.028
Token you
Token ID345
Feature activation-3.27e+00

Pos logit contributions
lf+0.031
 site+0.029
 avail+0.026
ered+0.026
izing+0.024
Neg logit contributions
 de-0.053
ives-0.047
 lear-0.047
rav-0.047
 love-0.046
Token like
Token ID588
Feature activation+1.74e+00

Pos logit contributions
 de+0.055
 lear+0.046
joy+0.041
bows+0.036
pre+0.035
Neg logit contributions
?"-0.088
 me-0.085
 first-0.084
ered-0.081
 my-0.080
Token to
Token ID284
Feature activation+2.78e+00

Pos logit contributions
lf+0.041
 site+0.037
ered+0.036
izing+0.034
 avail+0.034
Neg logit contributions
 de-0.038
 lear-0.034
ives-0.031
 love-0.030
joy-0.028
Token go
Token ID467
Feature activation+5.98e+00

Pos logit contributions
lf+0.068
 site+0.062
izing+0.060
ered+0.059
 avail+0.057
Neg logit contributions
 de-0.055
 lear-0.051
 love-0.048
ives-0.047
bows-0.043
Token for
Token ID329
Feature activation+1.83e+00

Pos logit contributions
lf+0.176
izing+0.162
 site+0.155
 avail+0.153
ered+0.151
Neg logit contributions
 love-0.086
 de-0.086
 lear-0.082
 like-0.082
ives-0.077
Token a
Token ID257
Feature activation+1.89e+00

Pos logit contributions
lf+0.058
izing+0.056
 site+0.054
ering+0.053
ered+0.052
Neg logit contributions
 love-0.021
 de-0.020
 lear-0.017
 hope-0.016
 like-0.014
Token picnic
Token ID35715
Feature activation+3.51e+00

Pos logit contributions
lf+0.049
ered+0.047
 site+0.045
izing+0.045
 avail+0.043
Neg logit contributions
 de-0.036
 lear-0.033
 love-0.028
ives-0.027
rav-0.026
Token today
Token ID1909
Feature activation+3.98e+00

Pos logit contributions
lf+0.050
ered+0.044
 site+0.041
izing+0.039
 avail+0.038
Neg logit contributions
 de-0.104
 lear-0.098
 love-0.091
ives-0.090
 like-0.088
Token?"
Token ID1701
Feature activation+1.31e+00

Pos logit contributions
lf+0.055
ered+0.052
izing+0.046
 site+0.044
 avail+0.043
Neg logit contributions
 de-0.119
 lear-0.107
 like-0.106
 love-0.105
ives-0.101
Token,
Token ID11
Feature activation-4.82e-01

Pos logit contributions
lf+0.000
 site+0.000
 avail+0.000
izing+0.000
ered+0.000
Neg logit contributions
 de-0.001
 lear-0.000
ives-0.000
 love-0.000
joy-0.000
Token "
Token ID366
Feature activation-2.06e+00

Pos logit contributions
 de+0.010
 lear+0.009
ives+0.008
bows+0.007
rav+0.007
Neg logit contributions
lf-0.012
 site-0.011
ered-0.010
izing-0.010
 avail-0.010
TokenWhat
Token ID2061
Feature activation-1.58e+00

Pos logit contributions
 de+0.054
 lear+0.045
ives+0.043
bows+0.041
rav+0.040
Neg logit contributions
lf-0.046
 site-0.038
 me-0.037
ered-0.036
izing-0.035
Token is
Token ID318
Feature activation+1.21e+00

Pos logit contributions
 de+0.040
 lear+0.036
ives+0.033
joy+0.031
bows+0.030
Neg logit contributions
lf-0.033
 site-0.028
ered-0.028
 happen-0.026
 avail-0.026
Token that
Token ID326
Feature activation+2.85e+00

Pos logit contributions
lf+0.023
 avail+0.020
ered+0.020
ation+0.019
 site+0.019
Neg logit contributions
 de-0.033
 lear-0.028
 like-0.026
 love-0.025
 hope-0.024
Token wreck
Token ID15228
Feature activation+5.96e+00

Pos logit contributions
lf+0.053
 avail+0.045
ths+0.045
ered+0.043
izing+0.041
Neg logit contributions
 de-0.071
 like-0.064
 love-0.062
're-0.061
 are-0.056
Token?".
Token ID43634
Feature activation+1.59e+00

Pos logit contributions
lf+0.085
 avail+0.084
ggy+0.067
 site+0.066
ered+0.065
Neg logit contributions
 de-0.164
 like-0.146
're-0.144
bows-0.143
ives-0.142
Token
Token ID198
Feature activation+2.58e+00

Pos logit contributions
lf+0.031
 site+0.027
ered+0.027
izing+0.025
 avail+0.025
Neg logit contributions
 de-0.041
 lear-0.037
ives-0.035
joy-0.033
 love-0.033
Token
Token ID198
Feature activation+3.50e+00

Pos logit contributions
lf+0.081
ered+0.075
 site+0.075
 avail+0.073
 happen+0.073
Neg logit contributions
 de-0.034
 lear-0.028
ives-0.026
joy-0.023
 love-0.022
TokenJack
Token ID14295
Feature activation+1.68e+00

Pos logit contributions
lf+0.082
ered+0.077
 site+0.076
 avail+0.075
 happen+0.073
Neg logit contributions
 de-0.070
 lear-0.064
ives-0.061
joy-0.056
 love-0.055
Token shrugged
Token ID28271
Feature activation+2.42e-01

Pos logit contributions
lf+0.030
 site+0.030
 avail+0.029
izing+0.027
 happen+0.026
Neg logit contributions
 de-0.044
 lear-0.038
 love-0.035
Thank-0.033
 like-0.033
Token,
Token ID11
Feature activation-6.72e-01

Pos logit contributions
lf+0.003
 site+0.003
ered+0.002
 avail+0.002
izing+0.002
Neg logit contributions
 de-0.006
 lear-0.005
ives-0.005
 love-0.005
bows-0.005
Token "
Token ID366
Feature activation-3.04e+00

Pos logit contributions
 de+0.015
 lear+0.014
ives+0.013
bows+0.012
rav+0.012
Neg logit contributions
lf-0.015
 site-0.013
ered-0.013
 me-0.012
izing-0.012
TokenWhere
Token ID8496
Feature activation-4.76e-01

Pos logit contributions
 de+0.075
 lear+0.067
bows+0.062
ives+0.060
ushes+0.056
Neg logit contributions
 me-0.061
lf-0.056
izing-0.054
 avail-0.053
ered-0.051
Token did
Token ID750
Feature activation+3.91e+00

Pos logit contributions
 de+0.009
 lear+0.008
ives+0.007
 love+0.007
rav+0.007
Neg logit contributions
lf-0.013
 site-0.011
ered-0.011
izing-0.011
 avail-0.011
Token you
Token ID345
Feature activation-3.57e-01

Pos logit contributions
lf+0.069
 site+0.065
izing+0.062
ered+0.060
ering+0.058
Neg logit contributions
 de-0.099
 love-0.083
 lear-0.081
 rib-0.074
're-0.074
Token go
Token ID467
Feature activation+5.94e+00

Pos logit contributions
 love+0.006
're+0.006
 de+0.006
 like+0.005
 lear+0.005
Neg logit contributions
lf-0.010
 site-0.009
izing-0.009
 avail-0.008
ths-0.008
Token Nate
Token ID23486
Feature activation+1.70e+00

Pos logit contributions
lf+0.134
izing+0.125
 avail+0.119
 site+0.110
rums+0.109
Neg logit contributions
 like-0.118
 de-0.105
 love-0.103
ives-0.101
're-0.101
Token?
Token ID30
Feature activation+1.71e-01

Pos logit contributions
lf+0.027
 site+0.023
izing+0.022
 avail+0.022
ered+0.021
Neg logit contributions
 de-0.049
 lear-0.045
ives-0.045
 love-0.042
're-0.041
Token I
Token ID314
Feature activation-1.25e+00

Pos logit contributions
lf+0.003
 site+0.003
ered+0.003
 me+0.003
 avail+0.002
Neg logit contributions
 de-0.005
 lear-0.004
ives-0.004
bows-0.004
joy-0.004
Token was
Token ID373
Feature activation+2.26e+00

Pos logit contributions
 de+0.030
 lear+0.027
ives+0.025
joy+0.024
bows+0.024
Neg logit contributions
lf-0.026
ered-0.024
 site-0.022
 me-0.022
 happen-0.021
Token looking
Token ID2045
Feature activation-3.52e-01

Pos logit contributions
lf+0.048
 site+0.041
izing+0.040
 avail+0.040
ered+0.040
Neg logit contributions
 de-0.056
 lear-0.050
ives-0.045
 love-0.044
joy-0.044
Token fear
Token ID3252
Feature activation+1.07e-01

Pos logit contributions
 de+0.042
 lear+0.038
ives+0.038
rav+0.036
bows+0.035
Neg logit contributions
lf-0.027
 site-0.022
 avail-0.022
 me-0.021
ered-0.021
Token.
Token ID13
Feature activation-8.71e-02

Pos logit contributions
lf+0.003
 site+0.002
 avail+0.002
ered+0.002
izing+0.002
Neg logit contributions
 de-0.002
 lear-0.002
ives-0.002
 love-0.002
bows-0.002
Token "
Token ID366
Feature activation-2.82e+00

Pos logit contributions
 de+0.001
 lear+0.001
ives+0.001
 love+0.001
bows+0.001
Neg logit contributions
lf-0.003
 site-0.002
ered-0.002
 avail-0.002
izing-0.002
TokenWhat
Token ID2061
Feature activation-1.12e+00

Pos logit contributions
 de+0.067
ives+0.056
 lear+0.056
joy+0.051
isc+0.050
Neg logit contributions
lf-0.063
izing-0.056
 me-0.052
 site-0.052
ths-0.051
Token's
Token ID338
Feature activation+8.12e-01

Pos logit contributions
 de+0.030
 lear+0.027
ives+0.025
joy+0.024
bows+0.023
Neg logit contributions
lf-0.022
 site-0.018
ered-0.018
 avail-0.017
izing-0.017
Token jail
Token ID7356
Feature activation+5.94e+00

Pos logit contributions
lf+0.011
 site+0.009
 avail+0.008
ered+0.008
izing+0.007
Neg logit contributions
 de-0.027
 lear-0.023
 love-0.022
 like-0.022
 hope-0.021
Token?"
Token ID1701
Feature activation+1.15e+00

Pos logit contributions
 avail+0.088
lf+0.084
ered+0.077
izing+0.073
 site+0.072
Neg logit contributions
 de-0.164
bows-0.138
ives-0.137
 like-0.134
're-0.133
Token asked
Token ID1965
Feature activation+7.42e-01

Pos logit contributions
lf+0.028
 site+0.026
izing+0.026
ered+0.025
 happen+0.024
Neg logit contributions
 de-0.023
 lear-0.019
 love-0.018
Thank-0.018
Let-0.017
Token Billy
Token ID15890
Feature activation+1.15e-01

Pos logit contributions
lf+0.017
 site+0.015
izing+0.014
ered+0.014
 avail+0.013
Neg logit contributions
 de-0.017
 lear-0.015
 love-0.013
ives-0.013
joy-0.013
Token.
Token ID13
Feature activation+9.38e-01

Pos logit contributions
lf+0.003
 site+0.003
 avail+0.002
izing+0.002
ered+0.002
Neg logit contributions
 de-0.003
 lear-0.002
 love-0.002
 like-0.002
ives-0.002
Token "
Token ID366
Feature activation-2.89e+00

Pos logit contributions
lf+0.026
 site+0.023
ered+0.023
 avail+0.022
izing+0.022
Neg logit contributions
 de-0.017
 lear-0.015
ives-0.014
 love-0.012
bows-0.012
Tokenmy
Token ID1820
Feature activation+1.11e+00

Pos logit contributions
lf+0.037
 site+0.034
ered+0.033
izing+0.032
 avail+0.031
Neg logit contributions
 de-0.030
 lear-0.026
ives-0.024
 love-0.022
bows-0.021
Token,
Token ID11
Feature activation-7.10e-01

Pos logit contributions
lf+0.019
 site+0.016
ered+0.015
 avail+0.014
izing+0.014
Neg logit contributions
 de-0.032
 lear-0.029
ives-0.028
 love-0.026
bows-0.026
Token where
Token ID810
Feature activation+1.16e+00

Pos logit contributions
 de+0.017
 lear+0.016
ives+0.015
bows+0.013
joy+0.013
Neg logit contributions
lf-0.015
 site-0.013
ered-0.013
 me-0.013
 happen-0.012
Token do
Token ID466
Feature activation+4.53e+00

Pos logit contributions
lf+0.031
 site+0.028
ered+0.027
izing+0.027
 avail+0.027
Neg logit contributions
 de-0.022
 lear-0.018
ives-0.017
 love-0.015
rav-0.015
Token I
Token ID314
Feature activation+4.11e-01

Pos logit contributions
izing+0.087
aned+0.083
ered+0.082
ering+0.082
bing+0.082
Neg logit contributions
 de-0.097
 love-0.085
rav-0.080
By-0.076
 lear-0.076
Token go
Token ID467
Feature activation+5.91e+00

Pos logit contributions
lf+0.010
 site+0.009
izing+0.009
 avail+0.008
ered+0.008
Neg logit contributions
 de-0.009
 lear-0.008
 love-0.008
ives-0.008
 hope-0.007
Token when
Token ID618
Feature activation+2.96e+00

Pos logit contributions
lf+0.151
izing+0.135
 avail+0.132
ths+0.125
 site+0.125
Neg logit contributions
 like-0.108
 de-0.103
 love-0.102
 lear-0.094
ives-0.092
Token I
Token ID314
Feature activation-3.95e-01

Pos logit contributions
lf+0.057
izing+0.056
ered+0.055
ering+0.050
ation+0.049
Neg logit contributions
 de-0.070
 love-0.059
 lear-0.053
're-0.053
 hope-0.052
Token disappear
Token ID10921
Feature activation+3.18e+00

Pos logit contributions
 de+0.009
 lear+0.008
ives+0.008
joy+0.007
bows+0.007
Neg logit contributions
lf-0.009
ered-0.008
 site-0.008
 me-0.007
 happen-0.007
Token?"
Token ID1701
Feature activation+1.97e+00

Pos logit contributions
lf+0.041
 site+0.032
ered+0.030
 avail+0.029
izing+0.029
Neg logit contributions
 de-0.105
 lear-0.092
 love-0.089
 like-0.087
bows-0.086
Token His
Token ID2399
Feature activation+7.06e-01

Pos logit contributions
lf+0.047
 site+0.044
izing+0.044
 avail+0.043
ered+0.043
Neg logit contributions
 de-0.040
 lear-0.035
 love-0.032
Thank-0.030
ives-0.029
Token,
Token ID11
Feature activation+4.48e-01

Pos logit contributions
lf+0.028
 site+0.024
izing+0.023
 avail+0.023
ered+0.022
Neg logit contributions
 de-0.037
 lear-0.033
ives-0.030
 love-0.030
joy-0.029
Token "
Token ID366
Feature activation-1.96e+00

Pos logit contributions
lf+0.010
 site+0.009
ered+0.009
 me+0.009
izing+0.008
Neg logit contributions
 de-0.010
 lear-0.009
ives-0.009
bows-0.008
rav-0.008
TokenWhere
Token ID8496
Feature activation+4.29e-04

Pos logit contributions
 de+0.054
 lear+0.043
ives+0.043
bows+0.042
isc+0.040
Neg logit contributions
lf-0.041
 me-0.036
 site-0.033
izing-0.031
 avail-0.031
Token did
Token ID750
Feature activation+3.78e+00

Pos logit contributions
lf+0.000
 site+0.000
ered+0.000
izing+0.000
 avail+0.000
Neg logit contributions
 de-0.000
 lear-0.000
ives-0.000
 love-0.000
rav-0.000
Token you
Token ID345
Feature activation-2.85e-01

Pos logit contributions
lf+0.068
 site+0.068
izing+0.066
aned+0.065
ered+0.065
Neg logit contributions
 de-0.089
 love-0.075
 lear-0.071
 rib-0.069
 like-0.067
Token go
Token ID467
Feature activation+5.91e+00

Pos logit contributions
 love+0.005
 lear+0.005
're+0.005
 like+0.004
ives+0.004
Neg logit contributions
lf-0.008
 site-0.007
izing-0.007
 avail-0.007
ths-0.006
Token?".
Token ID43634
Feature activation+2.45e+00

Pos logit contributions
lf+0.120
 avail+0.110
izing+0.106
ered+0.099
 site+0.098
Neg logit contributions
 like-0.135
 love-0.118
ives-0.114
 de-0.112
 lear-0.112
Token The
Token ID383
Feature activation+1.05e+00

Pos logit contributions
lf+0.066
izing+0.062
ered+0.060
 happen+0.060
 site+0.059
Neg logit contributions
 de-0.042
 lear-0.038
ives-0.031
 love-0.031
joy-0.031
Token rabbit
Token ID22746
Feature activation+2.18e+00

Pos logit contributions
lf+0.032
ered+0.032
 site+0.032
izing+0.029
 happen+0.029
Neg logit contributions
 de-0.015
 lear-0.012
 love-0.012
Thank-0.011
ives-0.010
Token said
Token ID531
Feature activation-5.29e-02

Pos logit contributions
lf+0.039
 site+0.037
 avail+0.035
izing+0.034
ered+0.032
Neg logit contributions
 de-0.058
 lear-0.052
 love-0.047
ives-0.046
joy-0.044
Token he
Token ID339
Feature activation-5.41e-01

Pos logit contributions
 de+0.001
 lear+0.001
ives+0.001
 love+0.001
joy+0.001
Neg logit contributions
lf-0.001
 site-0.001
 avail-0.001
ered-0.001
izing-0.001
Token Where
Token ID6350
Feature activation+1.02e-01

Pos logit contributions
 de+0.029
 lear+0.026
ives+0.024
bows+0.023
joy+0.023
Neg logit contributions
lf-0.020
 me-0.018
 site-0.018
ered-0.017
 myself-0.017
Token do
Token ID466
Feature activation+3.65e+00

Pos logit contributions
lf+0.003
 site+0.002
ered+0.002
 avail+0.002
izing+0.002
Neg logit contributions
 de-0.002
 lear-0.002
ives-0.002
 love-0.001
bows-0.001
Token you
Token ID345
Feature activation-4.75e-01

Pos logit contributions
lf+0.069
 site+0.067
ered+0.065
aned+0.064
izing+0.062
Neg logit contributions
 de-0.076
rav-0.069
 love-0.068
By-0.066
 lear-0.065
Token want
Token ID765
Feature activation+4.15e+00

Pos logit contributions
 de+0.009
 lear+0.008
ives+0.007
 love+0.007
 hope+0.007
Neg logit contributions
lf-0.013
 site-0.011
izing-0.011
 avail-0.011
ered-0.011
Token to
Token ID284
Feature activation+2.63e+00

Pos logit contributions
lf+0.130
izing+0.127
ering+0.121
 avail+0.119
ered+0.118
Neg logit contributions
 love-0.046
 like-0.041
 de-0.037
ives-0.037
're-0.035
Token go
Token ID467
Feature activation+5.89e+00

Pos logit contributions
izing+0.082
lf+0.081
ering+0.080
web+0.079
ths+0.077
Neg logit contributions
 love-0.030
 like-0.023
're-0.023
 lear-0.021
 hope-0.020
Token?"
Token ID1701
Feature activation+1.20e+00

Pos logit contributions
lf+0.126
izing+0.110
 site+0.102
 avail+0.101
ered+0.099
Neg logit contributions
 like-0.139
 love-0.126
 de-0.117
ives-0.115
're-0.114
Token
Token ID220
Feature activation-1.63e+00

Pos logit contributions
lf+0.032
 site+0.028
ered+0.028
 avail+0.027
izing+0.027
Neg logit contributions
 de-0.023
 lear-0.020
ives-0.019
bows-0.017
 love-0.017
Token
Token ID198
Feature activation+1.32e+00

Pos logit contributions
 de+0.031
 lear+0.028
ives+0.024
bows+0.023
joy+0.022
Neg logit contributions
lf-0.043
 site-0.039
ered-0.038
 me-0.036
izing-0.036
Token
Token ID198
Feature activation+2.90e+00

Pos logit contributions
lf+0.037
 site+0.034
ered+0.033
 avail+0.032
izing+0.032
Neg logit contributions
 de-0.023
 lear-0.020
ives-0.018
 love-0.016
bows-0.016
TokenTim
Token ID14967
Feature activation+1.79e-01

Pos logit contributions
lf+0.067
 site+0.059
ered+0.056
izing+0.055
 avail+0.054
Neg logit contributions
 de-0.067
 lear-0.060
ives-0.055
bows-0.051
joy-0.050
Token,
Token ID11
Feature activation-2.72e-01

Pos logit contributions
lf+0.037
 avail+0.034
 site+0.032
izing+0.031
ered+0.031
Neg logit contributions
 de-0.035
 lear-0.030
ives-0.030
bows-0.026
 love-0.026
Token "
Token ID366
Feature activation-2.17e+00

Pos logit contributions
 de+0.006
 lear+0.005
ives+0.005
bows+0.005
rav+0.005
Neg logit contributions
lf-0.006
 site-0.006
ered-0.005
 me-0.005
izing-0.005
TokenWhere
Token ID8496
Feature activation+5.06e-02

Pos logit contributions
 de+0.059
 lear+0.049
ives+0.046
rav+0.046
bows+0.045
Neg logit contributions
lf-0.044
 me-0.039
 site-0.037
ered-0.036
izing-0.035
Token are
Token ID389
Feature activation+3.17e+00

Pos logit contributions
lf+0.001
 site+0.001
ered+0.001
izing+0.001
 avail+0.001
Neg logit contributions
 de-0.001
 lear-0.001
ives-0.001
 love-0.001
rav-0.001
Token you
Token ID345
Feature activation-3.00e-01

Pos logit contributions
lf+0.059
 site+0.052
izing+0.046
 avail+0.046
ered+0.046
Neg logit contributions
 de-0.085
 lear-0.069
 love-0.069
 rib-0.068
 ju-0.067
Token going
Token ID1016
Feature activation+5.84e+00

Pos logit contributions
 de+0.006
 lear+0.005
 love+0.005
 like+0.005
're+0.005
Neg logit contributions
lf-0.008
 site-0.007
izing-0.006
 avail-0.006
ths-0.006
Token?".
Token ID43634
Feature activation+2.41e+00

Pos logit contributions
lf+0.116
izing+0.099
 site+0.094
ered+0.093
 avail+0.092
Neg logit contributions
 de-0.136
 like-0.134
 lear-0.133
 love-0.125
ives-0.124
Token The
Token ID383
Feature activation+8.71e-01

Pos logit contributions
lf+0.061
 site+0.054
ered+0.053
izing+0.053
 avail+0.052
Neg logit contributions
 de-0.049
 lear-0.044
ives-0.040
 love-0.036
bows-0.036
Token pear
Token ID25286
Feature activation+1.82e+00

Pos logit contributions
lf+0.023
 site+0.021
ered+0.020
 avail+0.020
izing+0.020
Neg logit contributions
 de-0.017
 lear-0.015
ives-0.013
 love-0.012
joy-0.012
Token didn
Token ID1422
Feature activation+2.64e+00

Pos logit contributions
 avail+0.043
 site+0.041
lf+0.039
 myself+0.038
ation+0.037
Neg logit contributions
 de-0.036
 lear-0.027
 were-0.027
ives-0.025
 love-0.025
Token't
Token ID470
Feature activation-7.90e-01

Pos logit contributions
lf+0.049
 avail+0.044
 site+0.043
izing+0.041
ering+0.040
Neg logit contributions
 de-0.064
 lear-0.060
ives-0.057
 love-0.053
Che-0.050
Token "
Token ID366
Feature activation-2.07e+00

Pos logit contributions
lf+0.043
 site+0.037
ered+0.036
izing+0.035
 avail+0.035
Neg logit contributions
 de-0.044
 lear-0.039
ives-0.036
joy-0.033
 love-0.033
TokenDo
Token ID5211
Feature activation+1.03e+00

Pos logit contributions
 de+0.051
 lear+0.043
ives+0.040
joy+0.037
rav+0.037
Neg logit contributions
lf-0.047
 site-0.041
 me-0.039
ered-0.039
izing-0.037
Token you
Token ID345
Feature activation-1.84e+00

Pos logit contributions
lf+0.015
 site+0.013
ered+0.012
 avail+0.012
izing+0.011
Neg logit contributions
 de-0.031
 lear-0.028
rav-0.027
 love-0.027
ives-0.027
Token want
Token ID765
Feature activation+3.44e+00

Pos logit contributions
 de+0.031
 lear+0.026
ives+0.023
joy+0.021
rav+0.021
Neg logit contributions
lf-0.053
ered-0.049
 site-0.047
 me-0.046
 happen-0.046
Token to
Token ID284
Feature activation+2.55e+00

Pos logit contributions
lf+0.106
izing+0.097
ered+0.094
 avail+0.093
ering+0.092
Neg logit contributions
 de-0.046
 love-0.044
 lear-0.039
ives-0.037
 names-0.037
Token go
Token ID467
Feature activation+5.84e+00

Pos logit contributions
lf+0.066
izing+0.062
 site+0.060
ering+0.060
ths+0.057
Neg logit contributions
 love-0.045
 lear-0.042
 like-0.039
 de-0.039
ives-0.037
Token on
Token ID319
Feature activation+3.48e+00

Pos logit contributions
lf+0.163
izing+0.146
 avail+0.140
 site+0.140
ered+0.138
Neg logit contributions
 de-0.092
 like-0.090
 love-0.087
 lear-0.085
ives-0.078
Token an
Token ID281
Feature activation+2.69e+00

Pos logit contributions
lf+0.105
ered+0.099
izing+0.098
ering+0.095
izes+0.091
Neg logit contributions
 de-0.047
 lear-0.036
joy-0.036
 love-0.035
 hope-0.032
Token adventure
Token ID8855
Feature activation+4.80e+00

Pos logit contributions
izing+0.062
lf+0.056
 avail+0.055
ize+0.053
izes+0.053
Neg logit contributions
 lear-0.048
 de-0.047
 love-0.042
 like-0.041
 hope-0.036
Token?"
Token ID1701
Feature activation+6.78e-01

Pos logit contributions
lf+0.065
ered+0.058
 avail+0.051
izing+0.051
 site+0.050
Neg logit contributions
 de-0.141
 lear-0.133
 like-0.126
 love-0.123
ives-0.119
Token Joe
Token ID5689
Feature activation+1.40e+00

Pos logit contributions
lf+0.014
 site+0.012
ered+0.012
izing+0.012
 avail+0.011
Neg logit contributions
 de-0.017
 lear-0.015
ives-0.014
joy-0.013
 love-0.013
Token Smith
Token ID4176
Feature activation+1.91e+00

Pos logit contributions
 de+0.030
 lear+0.026
ives+0.024
 love+0.022
bows+0.021
Neg logit contributions
lf-0.036
 site-0.032
ered-0.030
 avail-0.030
 me-0.029
Token!
Token ID0
Feature activation-9.67e-01

Pos logit contributions
lf+0.038
 site+0.033
ered+0.033
 avail+0.032
izing+0.030
Neg logit contributions
 de-0.052
 lear-0.043
ives-0.041
 love-0.041
bows-0.039
Token Where
Token ID6350
Feature activation+1.11e-01

Pos logit contributions
 de+0.024
 lear+0.022
ives+0.021
bows+0.020
 hope+0.019
Neg logit contributions
lf-0.018
 me-0.015
 site-0.015
?"-0.014
 my-0.014
Token are
Token ID389
Feature activation+2.97e+00

Pos logit contributions
lf+0.003
 site+0.003
ered+0.003
 avail+0.003
 happen+0.003
Neg logit contributions
 de-0.002
 lear-0.001
ives-0.001
joy-0.001
bows-0.001
Token you
Token ID345
Feature activation+2.57e-02

Pos logit contributions
lf+0.049
 site+0.042
ered+0.039
izing+0.036
 avail+0.035
Neg logit contributions
 de-0.086
 lear-0.075
rav-0.072
 love-0.071
ives-0.070
Token going
Token ID1016
Feature activation+5.82e+00

Pos logit contributions
lf+0.001
 site+0.001
 avail+0.001
izing+0.001
ered+0.001
Neg logit contributions
 de-0.001
 lear-0.000
 love-0.000
joy-0.000
ives-0.000
Token?"
Token ID1701
Feature activation+1.52e+00

Pos logit contributions
lf+0.095
izing+0.081
 site+0.074
ered+0.073
 avail+0.072
Neg logit contributions
 de-0.155
 like-0.152
 lear-0.148
 love-0.146
ives-0.140
Token asked
Token ID1965
Feature activation+8.17e-01

Pos logit contributions
 site+0.034
lf+0.034
izing+0.033
ths+0.033
ered+0.033
Neg logit contributions
 de-0.034
Thank-0.028
 lear-0.027
 love-0.026
Let-0.024
Token Lily
Token ID20037
Feature activation-1.21e+00

Pos logit contributions
lf+0.021
 site+0.019
izing+0.018
ered+0.018
 avail+0.017
Neg logit contributions
 de-0.017
 lear-0.014
ives-0.012
 love-0.012
joy-0.012
Token.
Token ID13
Feature activation+1.13e+00

Pos logit contributions
 de+0.023
 lear+0.020
 love+0.018
joy+0.018
ives+0.017
Neg logit contributions
lf-0.031
 site-0.029
izing-0.028
 avail-0.027
ered-0.026
Token
Token ID198
Feature activation+1.74e+00

Pos logit contributions
lf+0.030
 site+0.027
ered+0.027
 avail+0.026
izing+0.026
Neg logit contributions
 de-0.021
 lear-0.018
ives-0.017
 love-0.015
bows-0.015
Token friends
Token ID2460
Feature activation-9.63e-01

Pos logit contributions
lf+0.035
 site+0.030
izing+0.027
 avail+0.027
ered+0.027
Neg logit contributions
 de-0.056
 lear-0.051
ives-0.048
 love-0.048
 hope-0.046
Token,
Token ID11
Feature activation+2.56e-01

Pos logit contributions
 de+0.022
 lear+0.020
ives+0.018
 love+0.017
bows+0.017
Neg logit contributions
lf-0.022
 site-0.019
ered-0.019
 avail-0.018
izing-0.018
Token "
Token ID366
Feature activation-1.94e+00

Pos logit contributions
lf+0.005
 me+0.005
 site+0.005
 wrong+0.005
ered+0.005
Neg logit contributions
 de-0.006
 lear-0.005
ives-0.005
bows-0.005
rav-0.005
TokenCan
Token ID6090
Feature activation+2.44e+00

Pos logit contributions
 de+0.045
 lear+0.038
ives+0.036
joy+0.033
bows+0.033
Neg logit contributions
lf-0.046
 me-0.042
 site-0.040
ered-0.039
izing-0.039
Token I
Token ID314
Feature activation+1.70e+00

Pos logit contributions
lf+0.050
 site+0.047
ered+0.042
 avail+0.041
izing+0.041
Neg logit contributions
 de-0.059
 lear-0.052
 love-0.051
rav-0.051
Thank-0.047
Token go
Token ID467
Feature activation+5.82e+00

Pos logit contributions
lf+0.041
 site+0.038
izing+0.035
ered+0.034
 avail+0.033
Neg logit contributions
 de-0.036
 lear-0.033
ives-0.031
 love-0.031
rav-0.029
Token fly
Token ID6129
Feature activation+3.78e+00

Pos logit contributions
lf+0.157
 site+0.141
izing+0.134
ered+0.131
 avail+0.131
Neg logit contributions
 de-0.104
 lear-0.097
 love-0.096
 like-0.092
ives-0.088
Token by
Token ID416
Feature activation+2.20e+00

Pos logit contributions
lf+0.089
 site+0.079
ered+0.076
 avail+0.073
izing+0.072
Neg logit contributions
 de-0.084
 lear-0.075
ives-0.068
 love-0.068
 like-0.065
Token myself
Token ID3589
Feature activation+4.51e+00

Pos logit contributions
lf+0.043
 site+0.037
 avail+0.035
ered+0.033
ths+0.032
Neg logit contributions
 de-0.057
 lear-0.051
 love-0.049
 like-0.046
 hope-0.045
Token?"
Token ID1701
Feature activation+2.13e+00

Pos logit contributions
lf+0.080
ered+0.068
 site+0.066
 avail+0.062
izing+0.061
Neg logit contributions
 de-0.119
 lear-0.112
 like-0.110
ives-0.108
 love-0.104
Token
Token ID198
Feature activation+1.62e+00

Pos logit contributions
lf+0.065
 site+0.059
ered+0.057
 avail+0.057
izing+0.056
Neg logit contributions
 de-0.033
 lear-0.028
ives-0.024
 love-0.022
bows-0.021
Token asked
Token ID1965
Feature activation+3.47e-01

Pos logit contributions
lf+0.052
 avail+0.050
 site+0.048
izing+0.047
 happen+0.045
Neg logit contributions
 de-0.050
 lear-0.044
ives-0.040
 love-0.038
 hope-0.035
Token "
Token ID366
Feature activation-2.09e+00

Pos logit contributions
lf+0.006
 site+0.005
ered+0.005
 me+0.005
izing+0.005
Neg logit contributions
 de-0.010
 lear-0.009
ives-0.008
bows-0.008
rav-0.008
TokenWhere
Token ID8496
Feature activation-7.90e-02

Pos logit contributions
 de+0.057
 lear+0.048
ives+0.047
bows+0.044
isc+0.044
Neg logit contributions
lf-0.043
 me-0.036
 site-0.035
izing-0.033
ered-0.032
Token did
Token ID750
Feature activation+3.51e+00

Pos logit contributions
 de+0.001
 lear+0.001
ives+0.001
joy+0.001
 love+0.001
Neg logit contributions
lf-0.002
 site-0.002
ered-0.002
 avail-0.002
izing-0.002
Token you
Token ID345
Feature activation-4.49e-01

Pos logit contributions
lf+0.063
 site+0.062
izing+0.058
ered+0.058
ering+0.057
Neg logit contributions
 de-0.086
 love-0.074
 lear-0.071
 like-0.066
rav-0.065
Token go
Token ID467
Feature activation+5.81e+00

Pos logit contributions
 de+0.007
 lear+0.007
 love+0.007
're+0.007
ives+0.007
Neg logit contributions
lf-0.012
 site-0.011
izing-0.011
 avail-0.010
ths-0.010
Token,
Token ID11
Feature activation-2.51e-02

Pos logit contributions
lf+0.130
 avail+0.114
izing+0.110
 site+0.107
ered+0.105
Neg logit contributions
 like-0.123
 love-0.114
 de-0.111
're-0.105
 lear-0.104
Token daughter
Token ID4957
Feature activation+2.74e+00

Pos logit contributions
 de+0.001
 lear+0.001
ives+0.000
 love+0.000
rav+0.000
Neg logit contributions
lf-0.001
 site-0.000
ered-0.000
 avail-0.000
izing-0.000
Token?
Token ID30
Feature activation+5.29e-01

Pos logit contributions
lf+0.033
 site+0.026
izing+0.024
ered+0.023
 avail+0.022
Neg logit contributions
 de-0.093
 lear-0.086
ives-0.081
 love-0.080
 hope-0.077
Token I
Token ID314
Feature activation-1.17e+00

Pos logit contributions
lf+0.011
 site+0.010
ered+0.009
 me+0.009
izing+0.009
Neg logit contributions
 de-0.013
 lear-0.011
ives-0.011
joy-0.010
bows-0.010
Token miss
Token ID2051
Feature activation+3.08e+00

Pos logit contributions
 de+0.027
 lear+0.024
ives+0.023
joy+0.022
bows+0.021
Neg logit contributions
lf-0.025
ered-0.023
 me-0.021
 site-0.021
 happen-0.021
Token "
Token ID366
Feature activation-2.04e+00

Pos logit contributions
 de+0.013
 lear+0.011
ives+0.011
bows+0.010
rav+0.010
Neg logit contributions
lf-0.014
 site-0.013
ered-0.012
izing-0.012
 me-0.012
TokenDo
Token ID5211
Feature activation+7.00e-01

Pos logit contributions
 de+0.053
 lear+0.044
ives+0.043
bows+0.041
isc+0.039
Neg logit contributions
lf-0.045
 me-0.038
 site-0.037
 avail-0.035
izing-0.035
Token you
Token ID345
Feature activation-2.44e+00

Pos logit contributions
lf+0.010
 site+0.009
ered+0.008
 avail+0.008
izing+0.007
Neg logit contributions
 de-0.022
 lear-0.019
rav-0.018
ives-0.018
 love-0.018
Token want
Token ID765
Feature activation+3.38e+00

Pos logit contributions
 de+0.041
 lear+0.033
ives+0.029
joy+0.027
bows+0.027
Neg logit contributions
lf-0.068
ered-0.067
 me-0.063
 happen-0.062
?"-0.061
Token to
Token ID284
Feature activation+2.70e+00

Pos logit contributions
lf+0.109
izing+0.103
 site+0.099
 avail+0.096
ered+0.095
Neg logit contributions
 love-0.042
 de-0.038
 lear-0.034
 like-0.032
 rib-0.031
Token go
Token ID467
Feature activation+5.80e+00

Pos logit contributions
lf+0.077
 site+0.070
izing+0.069
ering+0.063
 avail+0.063
Neg logit contributions
 love-0.042
 lear-0.041
 de-0.040
 like-0.035
 hope-0.035
Token down
Token ID866
Feature activation+4.07e+00

Pos logit contributions
lf+0.164
izing+0.147
 site+0.143
 avail+0.140
ered+0.134
Neg logit contributions
 de-0.089
 love-0.088
 like-0.087
 lear-0.085
ives-0.075
Token the
Token ID262
Feature activation-1.89e-01

Pos logit contributions
lf+0.099
ered+0.089
izing+0.089
ering+0.087
 site+0.085
Neg logit contributions
 de-0.076
 like-0.072
 lear-0.070
 love-0.069
 hope-0.063
Token slide
Token ID10649
Feature activation+3.82e+00

Pos logit contributions
 de+0.004
 lear+0.004
ives+0.003
 love+0.003
rav+0.003
Neg logit contributions
lf-0.004
ered-0.004
 site-0.004
 avail-0.004
izing-0.004
Token?"
Token ID1701
Feature activation+4.29e-01

Pos logit contributions
lf+0.059
ered+0.053
 avail+0.050
 site+0.049
izing+0.046
Neg logit contributions
 de-0.111
 lear-0.103
ives-0.095
 like-0.093
bows-0.093
Token
Token ID198
Feature activation+1.42e+00

Pos logit contributions
lf+0.010
 site+0.009
ered+0.008
 me+0.008
 avail+0.008
Neg logit contributions
 de-0.010
 lear-0.009
ives-0.008
bows-0.007
rav-0.007
Token fun
Token ID1257
Feature activation+1.20e+00

Pos logit contributions
lf+0.006
 site+0.005
ered+0.005
 avail+0.005
izing+0.005
Neg logit contributions
 de-0.005
 lear-0.004
 love-0.004
ives-0.004
bows-0.004
Token!"
Token ID2474
Feature activation-6.51e-01

Pos logit contributions
lf+0.021
ered+0.018
 site+0.018
 avail+0.017
izing+0.017
Neg logit contributions
 de-0.034
 lear-0.030
ives-0.028
 love-0.027
bows-0.027
Token Anna
Token ID11735
Feature activation-1.52e+00

Pos logit contributions
 de+0.015
 lear+0.013
ives+0.012
 love+0.011
joy+0.011
Neg logit contributions
lf-0.015
 site-0.013
ered-0.013
izing-0.013
 avail-0.013
Token says
Token ID1139
Feature activation-1.92e+00

Pos logit contributions
 de+0.045
 lear+0.041
ives+0.039
rav+0.037
joy+0.037
Neg logit contributions
lf-0.024
 site-0.020
ered-0.019
 me-0.018
 avail-0.017
Token.
Token ID13
Feature activation-3.64e+00

Pos logit contributions
 de+0.048
 lear+0.043
ives+0.040
 love+0.037
rav+0.037
Neg logit contributions
lf-0.040
 site-0.034
ered-0.033
 avail-0.032
izing-0.032
Token "
Token ID366
Feature activation-6.98e+00

Pos logit contributions
 de+0.087
 lear+0.078
ives+0.074
bows+0.072
rav+0.070
Neg logit contributions
lf-0.070
 me-0.064
 site-0.061
 first-0.057
?"-0.056
TokenLook
Token ID8567
Feature activation+1.85e+00

Pos logit contributions
 de+0.167
isc+0.153
joy+0.146
 lear+0.144
ives+0.131
Neg logit contributions
 myself-0.129
lf-0.128
izing-0.115
 site-0.111
 happen-0.108
Token at
Token ID379
Feature activation+1.19e+00

Pos logit contributions
lf+0.043
 site+0.042
 avail+0.037
roads+0.037
izing+0.036
Neg logit contributions
 de-0.042
 lear-0.034
 love-0.032
rav-0.031
 like-0.030
Token my
Token ID616
Feature activation+4.99e-01

Pos logit contributions
lf+0.021
 avail+0.017
 site+0.017
ered+0.017
roads+0.016
Neg logit contributions
 de-0.035
 lear-0.030
 love-0.029
 hope-0.027
 like-0.027
Token bubble
Token ID14310
Feature activation+1.28e+00

Pos logit contributions
lf+0.012
ered+0.011
 site+0.011
izing+0.010
 avail+0.010
Neg logit contributions
 de-0.011
 lear-0.010
ives-0.009
 love-0.009
joy-0.008
Token,
Token ID11
Feature activation-1.51e+00

Pos logit contributions
lf+0.025
 site+0.023
ered+0.022
 avail+0.021
izing+0.020
Neg logit contributions
 de-0.033
 lear-0.029
ives-0.028
bows-0.026
 love-0.026
TokenAmazing
Token ID42770
Feature activation+2.68e+00

Pos logit contributions
 de+0.083
ives+0.071
isc+0.069
 lear+0.068
joy+0.068
Neg logit contributions
lf-0.097
izing-0.090
 myself-0.087
 site-0.086
 prove-0.083
Token,"
Token ID553
Feature activation-1.48e+00

Pos logit contributions
lf+0.049
 site+0.041
 avail+0.039
ered+0.037
izing+0.036
Neg logit contributions
 de-0.076
 lear-0.064
bows-0.058
ives-0.058
 love-0.057
Token Ben
Token ID3932
Feature activation-1.32e+00

Pos logit contributions
 de+0.036
 lear+0.033
ives+0.032
rav+0.030
bows+0.029
Neg logit contributions
lf-0.030
 site-0.026
 me-0.025
 avail-0.024
ered-0.024
Token said
Token ID531
Feature activation-1.27e+00

Pos logit contributions
 de+0.038
 lear+0.034
ives+0.032
 love+0.030
bows+0.030
Neg logit contributions
lf-0.023
 site-0.020
ered-0.019
 avail-0.018
izing-0.018
Token.
Token ID13
Feature activation-3.15e+00

Pos logit contributions
 de+0.031
 lear+0.028
ives+0.026
 love+0.024
joy+0.024
Neg logit contributions
lf-0.028
 site-0.024
ered-0.023
izing-0.022
 avail-0.022
Token "
Token ID366
Feature activation-6.84e+00

Pos logit contributions
 de+0.071
 lear+0.065
ives+0.059
bows+0.056
rav+0.056
Neg logit contributions
lf-0.071
 site-0.064
 me-0.058
ered-0.057
izing-0.056
TokenThis
Token ID1212
Feature activation+7.27e-01

Pos logit contributions
 de+0.176
isc+0.152
 lear+0.150
joy+0.147
ives+0.132
Neg logit contributions
 myself-0.131
lf-0.128
 site-0.115
ize-0.110
ble-0.110
Token is
Token ID318
Feature activation+8.26e-01

Pos logit contributions
lf+0.013
 site+0.011
ered+0.011
 happen+0.010
 me+0.010
Neg logit contributions
 de-0.020
 lear-0.019
ives-0.018
joy-0.016
rav-0.016
Token a
Token ID257
Feature activation+4.64e-02

Pos logit contributions
lf+0.018
 site+0.016
ered+0.015
izing+0.014
 avail+0.014
Neg logit contributions
 de-0.021
 lear-0.018
ives-0.016
 love-0.016
bows-0.015
Token treasure
Token ID14068
Feature activation+6.16e-01

Pos logit contributions
lf+0.001
 site+0.001
ered+0.001
 avail+0.001
izing+0.001
Neg logit contributions
 de-0.001
 lear-0.001
ives-0.001
bows-0.001
rav-0.001
Token."
Token ID526
Feature activation-3.17e+00

Pos logit contributions
lf+0.013
ered+0.012
 avail+0.012
 site+0.012
izing+0.011
Neg logit contributions
 de-0.015
 lear-0.013
ives-0.012
 love-0.011
bows-0.011
Token pretty
Token ID2495
Feature activation-8.57e-01

Pos logit contributions
lf+0.007
 site+0.006
ered+0.006
izing+0.006
 avail+0.006
Neg logit contributions
 de-0.008
 lear-0.007
ives-0.006
joy-0.006
bows-0.006
Token!"
Token ID2474
Feature activation-8.62e-01

Pos logit contributions
 de+0.025
 lear+0.022
ives+0.021
bows+0.020
 love+0.020
Neg logit contributions
lf-0.014
 site-0.012
ered-0.012
 avail-0.011
izing-0.011
Token Ben
Token ID3932
Feature activation-1.29e+00

Pos logit contributions
 de+0.019
 lear+0.016
ives+0.015
 love+0.014
 hope+0.013
Neg logit contributions
lf-0.021
 site-0.019
ered-0.018
izing-0.018
 avail-0.018
Token says
Token ID1139
Feature activation-1.79e+00

Pos logit contributions
 de+0.039
 lear+0.036
ives+0.033
rav+0.032
joy+0.032
Neg logit contributions
lf-0.020
 site-0.016
ered-0.016
 me-0.015
 avail-0.014
Token.
Token ID13
Feature activation-3.29e+00

Pos logit contributions
 de+0.041
 lear+0.036
ives+0.033
 love+0.031
joy+0.031
Neg logit contributions
lf-0.043
 site-0.037
ered-0.035
izing-0.035
 avail-0.035
Token "
Token ID366
Feature activation-6.84e+00

Pos logit contributions
 de+0.074
 lear+0.067
ives+0.063
bows+0.060
rav+0.057
Neg logit contributions
lf-0.075
 site-0.067
 me-0.062
ered-0.060
izing-0.059
TokenYou
Token ID1639
Feature activation-2.68e+00

Pos logit contributions
 de+0.164
isc+0.152
 lear+0.148
joy+0.148
ives+0.136
Neg logit contributions
 myself-0.128
lf-0.126
izing-0.112
 site-0.107
 happen-0.105
Token can
Token ID460
Feature activation+1.22e+00

Pos logit contributions
 de+0.054
 lear+0.048
joy+0.044
ives+0.041
rav+0.040
Neg logit contributions
lf-0.066
ered-0.063
 me-0.060
 happen-0.058
 site-0.058
Token wear
Token ID5806
Feature activation-9.77e-02

Pos logit contributions
lf+0.028
 site+0.025
ered+0.023
izing+0.023
 avail+0.023
Neg logit contributions
 de-0.028
 lear-0.025
ives-0.023
 love-0.022
bows-0.021
Token it
Token ID340
Feature activation+5.79e-01

Pos logit contributions
 de+0.003
 lear+0.002
ives+0.002
 love+0.002
bows+0.002
Neg logit contributions
lf-0.002
 site-0.002
ered-0.001
 avail-0.001
izing-0.001
Token,
Token ID11
Feature activation-1.13e+00

Pos logit contributions
lf+0.012
 site+0.011
ered+0.011
 avail+0.010
izing+0.010
Neg logit contributions
 de-0.014
 lear-0.013
ives-0.012
 love-0.011
bows-0.011
Token Tom
Token ID4186
Feature activation+4.25e-01

Pos logit contributions
 de+0.024
 lear+0.021
ives+0.020
 love+0.019
rav+0.018
Neg logit contributions
lf-0.025
 site-0.022
ered-0.022
izing-0.021
 avail-0.021
Token,"
Token ID553
Feature activation-6.96e-01

Pos logit contributions
lf+0.008
 site+0.007
ered+0.006
izing+0.006
 avail+0.006
Neg logit contributions
 de-0.012
 lear-0.010
ives-0.010
 love-0.009
bows-0.009
Token Lily
Token ID20037
Feature activation-1.40e+00

Pos logit contributions
 de+0.020
ives+0.018
 lear+0.018
bows+0.017
joy+0.016
Neg logit contributions
lf-0.011
 me-0.009
 my-0.009
 first-0.009
 site-0.009
Token said
Token ID531
Feature activation-1.87e+00

Pos logit contributions
 de+0.044
 lear+0.041
ives+0.039
joy+0.037
rav+0.037
Neg logit contributions
lf-0.019
 site-0.015
ered-0.014
 me-0.014
 avail-0.012
Token.
Token ID13
Feature activation-3.78e+00

Pos logit contributions
 de+0.050
 lear+0.045
ives+0.042
bows+0.039
rav+0.039
Neg logit contributions
lf-0.035
 site-0.030
ered-0.030
izing-0.027
 avail-0.027
Token "
Token ID366
Feature activation-6.83e+00

Pos logit contributions
 de+0.089
 lear+0.082
ives+0.074
rav+0.072
bows+0.070
Neg logit contributions
lf-0.078
 site-0.070
 me-0.066
izing-0.064
 wrong-0.061
TokenYou
Token ID1639
Feature activation-2.54e+00

Pos logit contributions
 de+0.199
isc+0.178
 lear+0.162
joy+0.159
 spe+0.151
Neg logit contributions
lf-0.103
 myself-0.102
 first-0.096
izing-0.087
 me-0.086
Token are
Token ID389
Feature activation-6.14e-01

Pos logit contributions
 de+0.054
joy+0.048
 lear+0.047
ives+0.039
rav+0.038
Neg logit contributions
ered-0.057
 me-0.057
?"-0.056
lf-0.055
 happen-0.053
Token my
Token ID616
Feature activation-1.95e+00

Pos logit contributions
 de+0.017
 lear+0.015
ives+0.014
 love+0.013
bows+0.013
Neg logit contributions
lf-0.012
 site-0.010
ered-0.010
izing-0.009
 avail-0.009
Token friend
Token ID1545
Feature activation-6.56e-01

Pos logit contributions
 de+0.047
 lear+0.042
ives+0.040
bows+0.036
rav+0.036
Neg logit contributions
lf-0.042
 site-0.036
ered-0.035
 avail-0.034
izing-0.034
Token."
Token ID526
Feature activation-2.53e+00

Pos logit contributions
 de+0.016
 lear+0.015
ives+0.014
 love+0.013
bows+0.013
Neg logit contributions
lf-0.014
 site-0.012
ered-0.012
 avail-0.011
izing-0.011
Token friends
Token ID2460
Feature activation-8.35e-01

Pos logit contributions
lf+0.002
 site+0.002
ered+0.002
izing+0.002
 avail+0.002
Neg logit contributions
 de-0.002
 lear-0.002
 love-0.002
ives-0.002
 hope-0.002
Token,"
Token ID553
Feature activation-1.18e+00

Pos logit contributions
 de+0.023
 lear+0.021
ives+0.020
 love+0.019
bows+0.019
Neg logit contributions
lf-0.015
 site-0.013
ered-0.013
 avail-0.012
izing-0.011
Token Lily
Token ID20037
Feature activation-1.47e+00

Pos logit contributions
 de+0.030
 lear+0.028
ives+0.027
bows+0.025
rav+0.024
Neg logit contributions
lf-0.022
 me-0.019
 avail-0.018
 site-0.018
ered-0.018
Token says
Token ID1139
Feature activation-2.50e+00

Pos logit contributions
 de+0.048
 lear+0.045
free+0.042
rav+0.042
joy+0.041
Neg logit contributions
lf-0.015
 me-0.014
 site-0.012
ered-0.012
 happen-0.011
Token.
Token ID13
Feature activation-3.78e+00

Pos logit contributions
 de+0.066
 lear+0.060
ives+0.056
bows+0.052
joy+0.052
Neg logit contributions
lf-0.047
 site-0.040
ered-0.039
izing-0.037
 avail-0.037
Token "
Token ID366
Feature activation-6.82e+00

Pos logit contributions
 de+0.090
 lear+0.083
bows+0.075
ives+0.075
isc+0.074
Neg logit contributions
lf-0.069
 me-0.066
 site-0.062
 first-0.059
izing-0.059
TokenBut
Token ID1537
Feature activation+4.51e-01

Pos logit contributions
 de+0.176
isc+0.159
 lear+0.140
joy+0.130
rol+0.124
Neg logit contributions
 myself-0.127
 first-0.118
lf-0.117
izing-0.112
 me-0.107
Token you
Token ID345
Feature activation-2.36e+00

Pos logit contributions
lf+0.008
 site+0.007
ered+0.006
 avail+0.006
izing+0.006
Neg logit contributions
 de-0.013
 lear-0.012
 love-0.011
rav-0.011
ives-0.011
Token have
Token ID423
Feature activation-4.03e-01

Pos logit contributions
 de+0.055
 lear+0.049
joy+0.043
ives+0.042
rav+0.041
Neg logit contributions
lf-0.054
ered-0.047
 site-0.047
 me-0.046
 happen-0.045
Token to
Token ID284
Feature activation+5.12e-01

Pos logit contributions
 de+0.010
 lear+0.009
ives+0.008
bows+0.007
rav+0.007
Neg logit contributions
lf-0.009
 site-0.008
ered-0.007
 avail-0.007
izing-0.007
Token wait
Token ID4043
Feature activation+1.28e+00

Pos logit contributions
lf+0.008
 site+0.007
ered+0.007
 avail+0.006
 me+0.006
Neg logit contributions
 de-0.015
 lear-0.014
ives-0.013
joy-0.012
rav-0.012
Token lucky
Token ID9670
Feature activation+1.70e+00

Pos logit contributions
 de+0.005
 lear+0.005
 love+0.004
ives+0.004
rav+0.004
Neg logit contributions
lf-0.007
ered-0.006
 site-0.006
 avail-0.006
izing-0.006
Token!"
Token ID2474
Feature activation-6.79e-01

Pos logit contributions
lf+0.029
 site+0.025
ered+0.024
 avail+0.023
izing+0.023
Neg logit contributions
 de-0.049
 lear-0.044
ives-0.041
 love-0.039
rav-0.038
Token Mia
Token ID32189
Feature activation-8.60e-01

Pos logit contributions
 de+0.020
 lear+0.018
ives+0.018
bows+0.017
rav+0.016
Neg logit contributions
lf-0.011
 site-0.009
ered-0.009
 avail-0.008
 me-0.008
Token says
Token ID1139
Feature activation-2.10e+00

Pos logit contributions
 de+0.026
 lear+0.024
ives+0.022
joy+0.022
rav+0.022
Neg logit contributions
lf-0.012
 me-0.010
ered-0.010
 site-0.010
 happen-0.009
Token.
Token ID13
Feature activation-3.41e+00

Pos logit contributions
 de+0.056
 lear+0.052
ives+0.048
bows+0.046
rav+0.045
Neg logit contributions
lf-0.038
 site-0.033
ered-0.033
 happen-0.030
izing-0.030
Token "
Token ID366
Feature activation-6.82e+00

Pos logit contributions
 de+0.074
 lear+0.069
ives+0.064
rav+0.060
bows+0.059
Neg logit contributions
lf-0.076
 site-0.071
 me-0.065
ered-0.061
izing-0.060
TokenBut
Token ID1537
Feature activation+8.86e-01

Pos logit contributions
 de+0.164
isc+0.145
 lear+0.139
ives+0.131
joy+0.124
Neg logit contributions
lf-0.149
 myself-0.136
 site-0.129
izing-0.124
ize-0.120
Token where
Token ID810
Feature activation+1.26e+00

Pos logit contributions
lf+0.018
 site+0.015
 avail+0.014
ered+0.014
izing+0.013
Neg logit contributions
 de-0.023
 lear-0.020
 love-0.020
 hope-0.019
rav-0.018
Token is
Token ID318
Feature activation+2.70e+00

Pos logit contributions
lf+0.033
 site+0.029
izing+0.028
ered+0.028
 avail+0.028
Neg logit contributions
 de-0.025
 lear-0.021
 love-0.019
ives-0.019
 hope-0.018
Token our
Token ID674
Feature activation+4.11e-01

Pos logit contributions
lf+0.067
 site+0.055
ered+0.054
 avail+0.053
izing+0.053
Neg logit contributions
 de-0.063
 love-0.049
 lear-0.048
 like-0.046
 hope-0.044
Token rocket
Token ID10701
Feature activation+2.87e+00

Pos logit contributions
lf+0.009
ered+0.008
 site+0.008
izing+0.007
 avail+0.007
Neg logit contributions
 de-0.010
 lear-0.009
 love-0.008
ives-0.008
joy-0.008
Token is
Token ID318
Feature activation+2.97e-01

Pos logit contributions
 de+0.012
 lear+0.010
ives+0.010
bows+0.009
 love+0.009
Neg logit contributions
lf-0.010
 site-0.009
 avail-0.009
izing-0.008
ered-0.008
Token,"
Token ID553
Feature activation-9.41e-01

Pos logit contributions
lf+0.006
 site+0.006
 avail+0.005
izing+0.005
ths+0.005
Neg logit contributions
 de-0.008
 love-0.006
 lear-0.006
bows-0.006
 like-0.006
Token Ben
Token ID3932
Feature activation-1.30e+00

Pos logit contributions
 de+0.022
 lear+0.020
ives+0.019
bows+0.017
rav+0.017
Neg logit contributions
lf-0.020
 site-0.018
 me-0.017
 avail-0.017
ered-0.017
Token said
Token ID531
Feature activation-1.62e+00

Pos logit contributions
 de+0.039
 lear+0.036
ives+0.034
rav+0.032
joy+0.032
Neg logit contributions
lf-0.020
 site-0.016
ered-0.016
izing-0.014
 me-0.014
Token.
Token ID13
Feature activation-3.30e+00

Pos logit contributions
 de+0.040
 lear+0.036
ives+0.033
 love+0.031
bows+0.030
Neg logit contributions
lf-0.035
 site-0.030
ered-0.029
 avail-0.028
izing-0.028
Token "
Token ID366
Feature activation-6.79e+00

Pos logit contributions
 de+0.078
 lear+0.072
ives+0.066
bows+0.064
rav+0.064
Neg logit contributions
lf-0.068
 site-0.062
 me-0.057
izing-0.054
ered-0.053
TokenAnd
Token ID1870
Feature activation+1.02e+00

Pos logit contributions
 de+0.169
isc+0.150
 lear+0.148
joy+0.144
ives+0.133
Neg logit contributions
lf-0.127
 myself-0.125
 site-0.115
ble-0.108
web-0.104
Token the
Token ID262
Feature activation+1.92e-01

Pos logit contributions
lf+0.018
 site+0.015
ered+0.014
 avail+0.014
izing+0.014
Neg logit contributions
 de-0.029
 lear-0.026
ives-0.025
 love-0.023
rav-0.023
Token rain
Token ID6290
Feature activation-2.25e-02

Pos logit contributions
lf+0.004
 site+0.003
ered+0.003
 me+0.003
 happen+0.003
Neg logit contributions
 de-0.005
 lear-0.004
ives-0.004
joy-0.004
rav-0.004
Token is
Token ID318
Feature activation+1.10e+00

Pos logit contributions
 de+0.001
 lear+0.001
joy+0.000
ives+0.000
Che+0.000
Neg logit contributions
lf-0.000
 happen-0.000
 site-0.000
ered-0.000
 me-0.000
Token a
Token ID257
Feature activation-1.61e-01

Pos logit contributions
lf+0.025
 site+0.022
 avail+0.021
ered+0.021
izing+0.021
Neg logit contributions
 de-0.026
 lear-0.022
 love-0.021
ives-0.019
 hope-0.019
Token Tom
Token ID4186
Feature activation-2.76e-02

Pos logit contributions
 de+0.043
 lear+0.039
ives+0.036
bows+0.033
joy+0.033
Neg logit contributions
lf-0.043
 site-0.037
ered-0.036
 me-0.036
izing-0.035
Token,"
Token ID553
Feature activation-8.16e-01

Pos logit contributions
 de+0.001
 lear+0.001
ives+0.001
 love+0.001
bows+0.001
Neg logit contributions
lf-0.000
 site-0.000
ered-0.000
 avail-0.000
izing-0.000
Token Lily
Token ID20037
Feature activation-1.53e+00

Pos logit contributions
 de+0.021
 lear+0.019
ives+0.019
bows+0.017
rav+0.017
Neg logit contributions
lf-0.016
 site-0.014
ered-0.013
 me-0.013
 avail-0.013
Token says
Token ID1139
Feature activation-2.18e+00

Pos logit contributions
 de+0.049
 lear+0.046
joy+0.043
rav+0.043
ives+0.043
Neg logit contributions
lf-0.018
 me-0.015
 site-0.014
ered-0.014
 happen-0.013
Token.
Token ID13
Feature activation-3.99e+00

Pos logit contributions
 de+0.059
 lear+0.054
ives+0.050
bows+0.047
rav+0.047
Neg logit contributions
lf-0.040
 site-0.034
ered-0.034
 happen-0.031
izing-0.031
Token "
Token ID366
Feature activation-6.79e+00

Pos logit contributions
 de+0.094
 lear+0.087
ives+0.079
rav+0.077
bows+0.076
Neg logit contributions
lf-0.082
 site-0.073
 me-0.070
izing-0.067
ered-0.064
TokenBut
Token ID1537
Feature activation+6.32e-01

Pos logit contributions
 de+0.184
isc+0.165
 lear+0.161
joy+0.159
ives+0.147
Neg logit contributions
lf-0.108
 myself-0.105
 site-0.091
 first-0.091
izing-0.090
Token maybe
Token ID3863
Feature activation+1.33e+00

Pos logit contributions
lf+0.011
 site+0.010
ered+0.009
 avail+0.009
izing+0.009
Neg logit contributions
 de-0.018
 lear-0.016
 love-0.014
ives-0.014
rav-0.014
Token it
Token ID340
Feature activation+1.05e+00

Pos logit contributions
lf+0.027
 site+0.024
ered+0.023
izing+0.022
ation+0.021
Neg logit contributions
 de-0.034
 lear-0.029
 love-0.028
 hope-0.026
ives-0.026
Token is
Token ID318
Feature activation+1.46e+00

Pos logit contributions
lf+0.022
 site+0.019
ered+0.019
 avail+0.018
izing+0.018
Neg logit contributions
 de-0.026
 lear-0.023
ives-0.021
 love-0.020
bows-0.020
Token someone
Token ID2130
Feature activation+7.20e-01

Pos logit contributions
lf+0.032
 site+0.027
ered+0.027
izing+0.026
 avail+0.025
Neg logit contributions
 de-0.037
 lear-0.031
 love-0.029
ives-0.028
bows-0.027
Token first
Token ID717
Feature activation+2.71e+00

Pos logit contributions
lf+0.088
izing+0.078
 avail+0.077
ered+0.074
 site+0.074
Neg logit contributions
 de-0.074
 love-0.067
 lear-0.067
 like-0.065
 hope-0.063
Token,"
Token ID553
Feature activation-6.96e-01

Pos logit contributions
lf+0.053
ered+0.046
 site+0.045
izing+0.045
 avail+0.043
Neg logit contributions
 de-0.068
 lear-0.061
 love-0.059
 like-0.056
 hope-0.055
Token Ben
Token ID3932
Feature activation-1.01e+00

Pos logit contributions
 de+0.018
 lear+0.016
ives+0.015
bows+0.014
joy+0.014
Neg logit contributions
lf-0.014
 site-0.012
ered-0.012
 avail-0.011
 me-0.011
Token says
Token ID1139
Feature activation-1.75e+00

Pos logit contributions
 de+0.031
 lear+0.028
ives+0.027
joy+0.025
rav+0.025
Neg logit contributions
lf-0.015
 site-0.012
ered-0.012
 me-0.011
 avail-0.011
Token.
Token ID13
Feature activation-3.54e+00

Pos logit contributions
 de+0.043
 lear+0.039
ives+0.036
 love+0.034
bows+0.033
Neg logit contributions
lf-0.037
 site-0.032
ered-0.031
izing-0.030
 avail-0.030
Token "
Token ID366
Feature activation-6.77e+00

Pos logit contributions
 de+0.072
 lear+0.065
ives+0.060
bows+0.056
rav+0.055
Neg logit contributions
lf-0.088
 site-0.079
 me-0.074
ered-0.073
 avail-0.071
TokenBut
Token ID1537
Feature activation+9.85e-01

Pos logit contributions
 de+0.182
isc+0.153
 lear+0.143
bows+0.134
joy+0.134
Neg logit contributions
lf-0.121
 first-0.118
 myself-0.117
 site-0.108
 me-0.104
Token I
Token ID314
Feature activation-2.41e+00

Pos logit contributions
lf+0.016
 site+0.014
ered+0.013
izing+0.013
 avail+0.012
Neg logit contributions
 de-0.028
 lear-0.026
 love-0.024
 hope-0.023
rav-0.023
Token dare
Token ID16498
Feature activation+3.38e+00

Pos logit contributions
 de+0.057
 lear+0.047
bows+0.044
joy+0.044
ives+0.044
Neg logit contributions
lf-0.046
 first-0.046
?"-0.045
 prove-0.044
 me-0.043
Token you
Token ID345
Feature activation-1.36e-01

Pos logit contributions
lf+0.069
 site+0.062
izing+0.057
 avail+0.055
ered+0.055
Neg logit contributions
 de-0.082
 love-0.075
 lear-0.074
 like-0.068
rav-0.067
Token to
Token ID284
Feature activation+1.72e+00

Pos logit contributions
 de+0.003
 lear+0.002
ives+0.002
 love+0.002
 like+0.002
Neg logit contributions
lf-0.004
 site-0.003
izing-0.003
 avail-0.003
ered-0.003
Token Sara
Token ID24799
Feature activation-2.35e+00

Pos logit contributions
 de+0.032
 lear+0.031
ives+0.031
bows+0.030
free+0.029
Neg logit contributions
lf-0.023
 me-0.022
 first-0.020
 my-0.020
 site-0.020
Token and
Token ID290
Feature activation-1.81e+00

Pos logit contributions
 de+0.048
 lear+0.043
ives+0.038
rav+0.037
joy+0.036
Neg logit contributions
lf-0.056
 site-0.050
 me-0.050
ered-0.049
 happen-0.046
Token Ben
Token ID3932
Feature activation-2.21e+00

Pos logit contributions
 de+0.045
 lear+0.040
ives+0.039
bows+0.037
rav+0.036
Neg logit contributions
lf-0.035
 me-0.032
 avail-0.031
 site-0.029
ered-0.028
Token say
Token ID910
Feature activation-2.30e+00

Pos logit contributions
 de+0.064
 lear+0.057
joy+0.055
ives+0.053
rav+0.053
Neg logit contributions
lf-0.035
 me-0.030
ered-0.030
 happen-0.029
 site-0.028
Token.
Token ID13
Feature activation-3.65e+00

Pos logit contributions
 de+0.065
 lear+0.060
ives+0.056
joy+0.054
bows+0.054
Neg logit contributions
lf-0.037
ered-0.032
 site-0.029
 happen-0.029
 me-0.027
Token "
Token ID366
Feature activation-6.76e+00

Pos logit contributions
 de+0.083
 lear+0.079
ives+0.073
bows+0.071
isc+0.066
Neg logit contributions
lf-0.072
 me-0.067
 site-0.065
?"-0.061
 first-0.060
TokenWe
Token ID1135
Feature activation-3.39e+00

Pos logit contributions
 de+0.150
isc+0.128
 lear+0.121
joy+0.119
ige+0.109
Neg logit contributions
lf-0.146
 myself-0.145
ggy-0.130
 site-0.124
 first-0.123
Token are
Token ID389
Feature activation-5.67e-02

Pos logit contributions
 de+0.076
joy+0.066
ksh+0.059
 lear+0.057
pill+0.056
Neg logit contributions
?"-0.074
ered-0.073
 me-0.070
lf-0.068
 first-0.066
Token ready
Token ID3492
Feature activation+1.77e+00

Pos logit contributions
 de+0.001
 lear+0.001
ives+0.001
 love+0.001
bows+0.001
Neg logit contributions
lf-0.001
 site-0.001
ered-0.001
izing-0.001
 avail-0.001
Token!"
Token ID2474
Feature activation-2.34e+00

Pos logit contributions
lf+0.039
 site+0.034
ered+0.033
izing+0.032
 avail+0.032
Neg logit contributions
 de-0.042
 lear-0.038
ives-0.034
 love-0.033
rav-0.032
Token
Token ID198
Feature activation-4.21e-01

Pos logit contributions
 lear+0.058
bows+0.055
 de+0.054
ives+0.053
rav+0.049
Neg logit contributions
 me-0.040
lf-0.039
?"-0.039
 my-0.036
 site-0.034
Token dog
Token ID3290
Feature activation+7.70e-01

Pos logit contributions
lf+0.000
 site+0.000
 me+0.000
 avail+0.000
 happen+0.000
Neg logit contributions
 de-0.000
ives-0.000
 lear-0.000
joy-0.000
rav-0.000
Token!"
Token ID2474
Feature activation-1.07e+00

Pos logit contributions
lf+0.014
 site+0.012
ered+0.012
 avail+0.012
izing+0.011
Neg logit contributions
 de-0.022
 lear-0.019
ives-0.018
 love-0.017
bows-0.017
Token Ben
Token ID3932
Feature activation-9.53e-01

Pos logit contributions
 de+0.022
 lear+0.019
ives+0.017
 love+0.016
joy+0.016
Neg logit contributions
lf-0.027
 site-0.025
ered-0.024
izing-0.023
 avail-0.023
Token says
Token ID1139
Feature activation-1.44e+00

Pos logit contributions
 de+0.029
 lear+0.027
ives+0.026
joy+0.025
rav+0.025
Neg logit contributions
lf-0.013
 site-0.011
ered-0.010
 me-0.010
 happen-0.009
Token.
Token ID13
Feature activation-3.58e+00

Pos logit contributions
 de+0.038
 lear+0.034
ives+0.031
 love+0.029
joy+0.029
Neg logit contributions
lf-0.030
 site-0.026
 avail-0.024
ered-0.024
izing-0.023
Token "
Token ID366
Feature activation-6.75e+00

Pos logit contributions
 de+0.083
 lear+0.080
bows+0.073
ives+0.073
rav+0.073
Neg logit contributions
lf-0.071
 site-0.065
 me-0.063
?"-0.057
 first-0.056
TokenI
Token ID40
Feature activation-2.72e+00

Pos logit contributions
 de+0.169
isc+0.155
 lear+0.150
joy+0.143
ives+0.140
Neg logit contributions
lf-0.122
 myself-0.116
 me-0.102
izing-0.100
 happen-0.098
Token made
Token ID925
Feature activation+1.30e+00

Pos logit contributions
 de+0.050
 lear+0.041
joy+0.039
ives+0.038
rav+0.034
Neg logit contributions
lf-0.069
ered-0.066
 happen-0.064
 prove-0.063
 site-0.062
Token a
Token ID257
Feature activation+1.31e+00

Pos logit contributions
lf+0.032
 site+0.028
 avail+0.027
ered+0.027
izing+0.027
Neg logit contributions
 de-0.029
 lear-0.024
 love-0.022
ives-0.022
 hope-0.020
Token flower
Token ID15061
Feature activation+1.91e-01

Pos logit contributions
lf+0.027
 site+0.024
ered+0.023
 avail+0.022
izing+0.022
Neg logit contributions
 de-0.033
 lear-0.029
ives-0.028
bows-0.025
 love-0.025
Token.
Token ID13
Feature activation-1.79e+00

Pos logit contributions
lf+0.004
 site+0.004
ered+0.003
 avail+0.003
izing+0.003
Neg logit contributions
 de-0.005
 lear-0.004
ives-0.004
 love-0.004
bows-0.004
Token it
Token ID340
Feature activation+6.13e-01

Pos logit contributions
lf+0.010
 site+0.009
izing+0.009
ered+0.008
 avail+0.008
Neg logit contributions
 de-0.013
 love-0.011
 lear-0.011
 hope-0.010
 like-0.010
Token!"
Token ID2474
Feature activation-1.32e+00

Pos logit contributions
lf+0.013
 site+0.012
 avail+0.011
izing+0.011
roads+0.011
Neg logit contributions
 de-0.015
bows-0.012
 lear-0.012
 love-0.012
 like-0.011
Token Ben
Token ID3932
Feature activation-1.25e+00

Pos logit contributions
 de+0.029
 lear+0.026
ives+0.024
bows+0.022
rav+0.022
Neg logit contributions
lf-0.031
 site-0.027
ered-0.027
 avail-0.026
izing-0.025
Token says
Token ID1139
Feature activation-2.08e+00

Pos logit contributions
 de+0.037
 lear+0.034
ives+0.032
rav+0.030
joy+0.030
Neg logit contributions
lf-0.020
 site-0.016
ered-0.016
 avail-0.014
izing-0.014
Token.
Token ID13
Feature activation-3.35e+00

Pos logit contributions
 de+0.051
 lear+0.045
ives+0.042
 love+0.039
joy+0.039
Neg logit contributions
lf-0.046
 site-0.040
ered-0.038
izing-0.037
 avail-0.037
Token "
Token ID366
Feature activation-6.75e+00

Pos logit contributions
 de+0.072
 lear+0.065
ives+0.061
bows+0.059
rav+0.056
Neg logit contributions
lf-0.078
 site-0.071
 me-0.067
ered-0.064
izing-0.062
TokenWe
Token ID1135
Feature activation-3.28e+00

Pos logit contributions
 de+0.149
 lear+0.127
isc+0.126
joy+0.125
ives+0.111
Neg logit contributions
lf-0.155
 myself-0.142
 site-0.135
izing-0.130
 happen-0.129
Token are
Token ID389
Feature activation+2.45e-01

Pos logit contributions
 de+0.080
joy+0.068
 lear+0.065
rav+0.062
ives+0.059
Neg logit contributions
lf-0.069
ered-0.068
 me-0.061
?"-0.061
 happen-0.059
Token the
Token ID262
Feature activation-6.06e-01

Pos logit contributions
lf+0.006
 site+0.005
ered+0.005
izing+0.005
 avail+0.005
Neg logit contributions
 de-0.006
 lear-0.005
ives-0.005
 love-0.004
joy-0.004
Token best
Token ID1266
Feature activation+5.21e-01

Pos logit contributions
 de+0.016
ives+0.014
 lear+0.014
rav+0.013
joy+0.012
Neg logit contributions
lf-0.012
 site-0.010
 me-0.010
 wrong-0.010
 avail-0.010
Token spies
Token ID28801
Feature activation-1.25e+00

Pos logit contributions
lf+0.011
 site+0.009
ered+0.009
izing+0.009
 avail+0.009
Neg logit contributions
 de-0.013
 lear-0.012
ives-0.011
 love-0.010
bows-0.010
Token did
Token ID750
Feature activation+8.75e-01

Pos logit contributions
 de+0.094
 lear+0.084
joy+0.076
rav+0.073
ives+0.073
Neg logit contributions
lf-0.078
ered-0.072
 me-0.068
 happen-0.066
 site-0.066
Token!"
Token ID2474
Feature activation-9.72e-01

Pos logit contributions
lf+0.017
 site+0.015
 avail+0.014
izing+0.014
ths+0.014
Neg logit contributions
 de-0.023
 love-0.019
 lear-0.019
 hope-0.018
 like-0.017
Token Lily
Token ID20037
Feature activation-1.89e+00

Pos logit contributions
 de+0.026
 lear+0.024
ives+0.023
rav+0.022
bows+0.022
Neg logit contributions
lf-0.017
 site-0.015
 me-0.014
ered-0.014
 avail-0.013
Token said
Token ID531
Feature activation-1.74e+00

Pos logit contributions
 de+0.054
 lear+0.050
ives+0.046
rav+0.045
joy+0.044
Neg logit contributions
lf-0.032
 site-0.026
ered-0.025
 me-0.024
izing-0.023
Token.
Token ID13
Feature activation-3.50e+00

Pos logit contributions
 de+0.046
 lear+0.042
ives+0.039
rav+0.036
bows+0.036
Neg logit contributions
lf-0.033
 site-0.028
ered-0.028
 avail-0.026
izing-0.026
Token "
Token ID366
Feature activation-6.74e+00

Pos logit contributions
 de+0.083
 lear+0.077
ives+0.069
bows+0.068
rav+0.068
Neg logit contributions
lf-0.070
 site-0.064
 me-0.060
izing-0.055
 first-0.055
TokenBut
Token ID1537
Feature activation+8.63e-01

Pos logit contributions
 de+0.157
isc+0.135
joy+0.134
 lear+0.134
rav+0.119
Neg logit contributions
lf-0.148
 myself-0.140
 site-0.133
izing-0.127
web-0.124
Token we
Token ID356
Feature activation-2.71e+00

Pos logit contributions
lf+0.017
 site+0.014
 avail+0.014
ered+0.014
izing+0.013
Neg logit contributions
 de-0.022
 lear-0.019
 love-0.019
 hope-0.018
 rib-0.017
Token have
Token ID423
Feature activation-6.03e-01

Pos logit contributions
 de+0.065
 lear+0.058
joy+0.055
rav+0.052
ives+0.050
Neg logit contributions
lf-0.057
ered-0.054
 me-0.049
 site-0.048
 happen-0.048
Token to
Token ID284
Feature activation+3.28e-01

Pos logit contributions
 de+0.014
 lear+0.013
ives+0.012
 love+0.011
bows+0.011
Neg logit contributions
lf-0.013
 site-0.012
ered-0.011
 avail-0.011
izing-0.011
Token remember
Token ID3505
Feature activation+3.03e+00

Pos logit contributions
lf+0.006
 site+0.006
izing+0.005
ered+0.005
 avail+0.005
Neg logit contributions
 de-0.008
 lear-0.008
 love-0.007
ives-0.007
 hope-0.007
Token fish
Token ID5916
Feature activation-6.68e-01

Pos logit contributions
 de+0.038
 lear+0.034
ives+0.033
bows+0.031
rav+0.030
Neg logit contributions
lf-0.036
 site-0.031
 me-0.030
ered-0.029
izing-0.029
Token!"
Token ID2474
Feature activation-1.78e+00

Pos logit contributions
 de+0.020
 lear+0.019
ives+0.017
joy+0.016
rav+0.016
Neg logit contributions
lf-0.010
 site-0.008
ered-0.008
izing-0.008
 me-0.007
Token Ben
Token ID3932
Feature activation-1.54e+00

Pos logit contributions
 de+0.043
 lear+0.041
ives+0.040
rav+0.039
bows+0.038
Neg logit contributions
lf-0.033
 me-0.030
 site-0.029
 avail-0.026
?"-0.026
Token said
Token ID531
Feature activation-1.78e+00

Pos logit contributions
 de+0.048
 lear+0.045
ives+0.042
rav+0.041
joy+0.041
Neg logit contributions
lf-0.021
 me-0.016
 site-0.016
ered-0.015
?"-0.013
Token.
Token ID13
Feature activation-3.38e+00

Pos logit contributions
 de+0.048
 lear+0.044
ives+0.041
bows+0.039
rav+0.038
Neg logit contributions
lf-0.032
ered-0.028
 site-0.027
 happen-0.025
izing-0.025
Token "
Token ID366
Feature activation-6.73e+00

Pos logit contributions
 de+0.078
 lear+0.073
bows+0.069
ives+0.068
rav+0.066
Neg logit contributions
lf-0.070
 site-0.064
 me-0.060
?"-0.055
 my-0.054
TokenYou
Token ID1639
Feature activation-3.11e+00

Pos logit contributions
 de+0.154
isc+0.144
 lear+0.131
ives+0.122
joy+0.122
Neg logit contributions
lf-0.149
 myself-0.146
izing-0.122
ggy-0.121
 site-0.120
Token are
Token ID389
Feature activation-9.39e-01

Pos logit contributions
 de+0.058
 lear+0.050
joy+0.049
rav+0.039
ksh+0.039
Neg logit contributions
ered-0.078
lf-0.078
 me-0.076
 happen-0.075
?"-0.075
Token very
Token ID845
Feature activation-3.46e-01

Pos logit contributions
 de+0.023
 lear+0.022
ives+0.020
rav+0.018
bows+0.018
Neg logit contributions
lf-0.019
 site-0.017
ered-0.016
 me-0.016
 avail-0.015
Token smart
Token ID4451
Feature activation-7.84e-01

Pos logit contributions
 de+0.008
 lear+0.008
ives+0.007
joy+0.007
bows+0.007
Neg logit contributions
lf-0.007
 site-0.006
ered-0.006
 me-0.006
izing-0.006
Token and
Token ID290
Feature activation-4.88e-02

Pos logit contributions
 de+0.017
 lear+0.016
ives+0.015
rav+0.014
Che+0.014
Neg logit contributions
lf-0.017
 site-0.015
 me-0.015
ered-0.014
?"-0.014
Token Ben
Token ID3932
Feature activation-1.29e+00

Pos logit contributions
 de+0.027
 lear+0.025
ives+0.024
bows+0.023
rav+0.022
Neg logit contributions
lf-0.023
 me-0.019
 site-0.019
 avail-0.018
ered-0.018
Token and
Token ID290
Feature activation-6.80e-01

Pos logit contributions
 de+0.027
 lear+0.024
rav+0.021
ives+0.021
free+0.021
Neg logit contributions
lf-0.030
 me-0.027
 site-0.026
ered-0.025
?"-0.025
Token Lily
Token ID20037
Feature activation-2.80e+00

Pos logit contributions
 de+0.017
 lear+0.015
ives+0.015
bows+0.014
rav+0.013
Neg logit contributions
lf-0.014
 site-0.012
ered-0.012
 me-0.011
 avail-0.011
Token say
Token ID910
Feature activation-2.29e+00

Pos logit contributions
 de+0.083
 lear+0.075
joy+0.074
rav+0.071
pre+0.071
Neg logit contributions
lf-0.038
 me-0.036
 happen-0.034
ered-0.034
 first-0.034
Token.
Token ID13
Feature activation-3.56e+00

Pos logit contributions
 de+0.066
 lear+0.061
ives+0.058
bows+0.055
joy+0.055
Neg logit contributions
lf-0.035
ered-0.031
 happen-0.028
 site-0.028
 wrong-0.027
Token "
Token ID366
Feature activation-6.73e+00

Pos logit contributions
 de+0.080
 lear+0.077
ives+0.070
bows+0.070
rav+0.067
Neg logit contributions
lf-0.071
 me-0.067
 site-0.065
?"-0.061
 first-0.060
TokenWe
Token ID1135
Feature activation-3.82e+00

Pos logit contributions
 de+0.159
isc+0.139
 lear+0.133
joy+0.129
ashed+0.113
Neg logit contributions
lf-0.146
 myself-0.137
 site-0.123
ggy-0.120
 happen-0.116
Token took
Token ID1718
Feature activation+1.34e-01

Pos logit contributions
 de+0.101
joy+0.084
 lear+0.081
ives+0.077
rav+0.075
Neg logit contributions
ered-0.073
lf-0.069
?"-0.067
 me-0.066
 first-0.059
Token pictures
Token ID5986
Feature activation-1.29e+00

Pos logit contributions
lf+0.003
 site+0.002
ered+0.002
 me+0.002
 avail+0.002
Neg logit contributions
 de-0.003
 lear-0.003
ives-0.003
bows-0.003
joy-0.003
Token of
Token ID286
Feature activation-1.58e+00

Pos logit contributions
 de+0.029
 lear+0.026
ives+0.023
joy+0.023
rav+0.022
Neg logit contributions
lf-0.030
 site-0.026
ered-0.025
 me-0.024
izing-0.024
Token many
Token ID867
Feature activation-1.05e-01

Pos logit contributions
 de+0.043
 lear+0.039
ives+0.036
 love+0.034
bows+0.034
Neg logit contributions
lf-0.030
 site-0.025
ered-0.024
 avail-0.023
izing-0.023
Token fish
Token ID5916
Feature activation-3.45e-01

Pos logit contributions
 de+0.028
 lear+0.025
ives+0.023
bows+0.021
rav+0.021
Neg logit contributions
lf-0.028
 site-0.024
ered-0.023
 avail-0.022
izing-0.022
Token!"
Token ID2474
Feature activation-9.77e-01

Pos logit contributions
 de+0.010
 lear+0.009
ives+0.009
 love+0.008
bows+0.008
Neg logit contributions
lf-0.006
 site-0.005
ered-0.005
 avail-0.004
izing-0.004
Token Lily
Token ID20037
Feature activation-1.77e+00

Pos logit contributions
 de+0.026
 lear+0.024
ives+0.023
bows+0.022
rav+0.022
Neg logit contributions
lf-0.018
 site-0.015
 me-0.015
 avail-0.014
ered-0.014
Token said
Token ID531
Feature activation-2.05e+00

Pos logit contributions
 de+0.052
 lear+0.048
ives+0.045
rav+0.044
joy+0.044
Neg logit contributions
lf-0.028
 site-0.022
ered-0.021
 me-0.021
izing-0.019
Token.
Token ID13
Feature activation-3.87e+00

Pos logit contributions
 de+0.055
 lear+0.050
ives+0.046
bows+0.044
rav+0.043
Neg logit contributions
lf-0.038
 site-0.032
ered-0.032
izing-0.030
 avail-0.030
Token "
Token ID366
Feature activation-6.72e+00

Pos logit contributions
 de+0.090
 lear+0.087
bows+0.083
rav+0.082
ives+0.079
Neg logit contributions
lf-0.069
 me-0.067
 site-0.063
 first-0.058
?"-0.057
TokenYou
Token ID1639
Feature activation-2.96e+00

Pos logit contributions
 de+0.153
isc+0.140
 lear+0.133
joy+0.126
ives+0.120
Neg logit contributions
lf-0.148
 myself-0.142
ggy-0.122
 site-0.119
ble-0.118
Token are
Token ID389
Feature activation-8.31e-01

Pos logit contributions
 de+0.055
 lear+0.049
joy+0.047
rav+0.038
ksh+0.037
Neg logit contributions
ered-0.076
lf-0.075
 me-0.073
 happen-0.072
?"-0.071
Token very
Token ID845
Feature activation-1.27e-01

Pos logit contributions
 de+0.020
 lear+0.019
ives+0.018
rav+0.016
bows+0.016
Neg logit contributions
lf-0.017
 site-0.015
ered-0.015
 me-0.014
 avail-0.014
Token kind
Token ID1611
Feature activation+3.02e-02

Pos logit contributions
 de+0.003
 lear+0.003
ives+0.003
joy+0.002
bows+0.002
Neg logit contributions
lf-0.003
 site-0.002
ered-0.002
izing-0.002
 avail-0.002
Token and
Token ID290
Feature activation+2.39e-01

Pos logit contributions
lf+0.001
 site+0.001
 me+0.001
ered+0.001
izing+0.001
Neg logit contributions
 de-0.001
 lear-0.001
ives-0.001
rav-0.001
Whe-0.001
Token names
Token ID3891
Feature activation-2.82e-01

Pos logit contributions
lf+0.025
 site+0.022
ered+0.022
 avail+0.021
izing+0.021
Neg logit contributions
 de-0.018
 lear-0.015
ives-0.014
 love-0.013
bows-0.012
Token!"
Token ID2474
Feature activation-1.17e+00

Pos logit contributions
 de+0.008
 lear+0.007
ives+0.007
 love+0.006
bows+0.006
Neg logit contributions
lf-0.005
 site-0.005
ered-0.004
 avail-0.004
izing-0.004
Token Lily
Token ID20037
Feature activation-1.51e+00

Pos logit contributions
 de+0.031
 lear+0.028
ives+0.027
rav+0.026
bows+0.026
Neg logit contributions
lf-0.021
 site-0.018
 me-0.017
 avail-0.017
ered-0.016
Token says
Token ID1139
Feature activation-2.42e+00

Pos logit contributions
 de+0.046
 lear+0.042
ives+0.040
joy+0.038
rav+0.038
Neg logit contributions
lf-0.022
 site-0.018
ered-0.017
 me-0.017
 happen-0.016
Token.
Token ID13
Feature activation-3.87e+00

Pos logit contributions
 de+0.064
 lear+0.058
ives+0.054
bows+0.050
rav+0.050
Neg logit contributions
lf-0.046
 site-0.039
ered-0.039
izing-0.036
 avail-0.036
Token "
Token ID366
Feature activation-6.72e+00

Pos logit contributions
 de+0.090
 lear+0.084
bows+0.079
ives+0.075
rav+0.075
Neg logit contributions
 me-0.069
lf-0.069
 site-0.063
?"-0.062
 first-0.061
TokenHe
Token ID1544
Feature activation+3.49e-01

Pos logit contributions
 de+0.173
isc+0.149
joy+0.140
 lear+0.137
ige+0.131
Neg logit contributions
lf-0.120
 myself-0.119
 me-0.109
 first-0.106
 prove-0.101
Token is
Token ID318
Feature activation+2.76e-01

Pos logit contributions
lf+0.007
 site+0.006
ered+0.006
izing+0.006
 avail+0.006
Neg logit contributions
 de-0.009
 lear-0.008
ives-0.007
joy-0.007
rav-0.007
Token very
Token ID845
Feature activation+8.83e-01

Pos logit contributions
lf+0.006
 site+0.005
ered+0.005
izing+0.005
 avail+0.005
Neg logit contributions
 de-0.007
 lear-0.006
ives-0.006
 love-0.005
joy-0.005
Token smart
Token ID4451
Feature activation-5.41e-01

Pos logit contributions
lf+0.021
 site+0.019
ered+0.019
 avail+0.018
izing+0.018
Neg logit contributions
 de-0.020
 lear-0.016
 love-0.015
ives-0.015
rav-0.014
Token and
Token ID290
Feature activation-2.39e-02

Pos logit contributions
 de+0.010
 lear+0.009
ives+0.008
rav+0.007
joy+0.007
Neg logit contributions
lf-0.014
 site-0.013
ered-0.012
izing-0.012
 me-0.012
TokenWow
Token ID22017
Feature activation+6.24e-01

Pos logit contributions
 de+0.077
ives+0.067
 lear+0.064
isc+0.063
joy+0.059
Neg logit contributions
lf-0.101
izing-0.098
 myself-0.090
 site-0.090
 prove-0.088
Token!"
Token ID2474
Feature activation-2.17e+00

Pos logit contributions
lf+0.010
 site+0.008
ered+0.008
 avail+0.008
izing+0.008
Neg logit contributions
 de-0.019
 lear-0.017
ives-0.016
 love-0.015
rav-0.015
Token Ben
Token ID3932
Feature activation-1.14e+00

Pos logit contributions
 de+0.050
 lear+0.048
ives+0.045
rav+0.043
bows+0.042
Neg logit contributions
lf-0.044
 site-0.039
 me-0.039
ered-0.037
 avail-0.036
Token says
Token ID1139
Feature activation-2.13e+00

Pos logit contributions
 de+0.035
 lear+0.033
ives+0.031
joy+0.030
rav+0.030
Neg logit contributions
lf-0.015
 site-0.012
ered-0.012
 me-0.012
 happen-0.011
Token.
Token ID13
Feature activation-3.50e+00

Pos logit contributions
 de+0.055
 lear+0.050
ives+0.047
bows+0.043
 love+0.043
Neg logit contributions
lf-0.042
 site-0.037
ered-0.036
izing-0.034
 avail-0.034
Token "
Token ID366
Feature activation-6.72e+00

Pos logit contributions
 de+0.086
 lear+0.080
ives+0.075
bows+0.073
 love+0.070
Neg logit contributions
lf-0.068
 site-0.063
 me-0.056
izing-0.052
?"-0.051
TokenThis
Token ID1212
Feature activation+4.86e-01

Pos logit contributions
 de+0.165
 lear+0.152
isc+0.144
joy+0.141
ives+0.135
Neg logit contributions
 myself-0.125
lf-0.122
izing-0.113
 site-0.110
ize-0.106
Token is
Token ID318
Feature activation+2.38e-01

Pos logit contributions
lf+0.009
 site+0.007
ered+0.007
 avail+0.007
 me+0.007
Neg logit contributions
 de-0.013
 lear-0.012
ives-0.012
bows-0.011
rav-0.011
Token a
Token ID257
Feature activation-5.81e-01

Pos logit contributions
lf+0.005
 site+0.005
ered+0.005
 avail+0.004
izing+0.004
Neg logit contributions
 de-0.006
 lear-0.005
ives-0.004
 love-0.004
bows-0.004
Token sharp
Token ID7786
Feature activation-2.93e-01

Pos logit contributions
 de+0.015
 lear+0.014
ives+0.014
bows+0.013
rav+0.012
Neg logit contributions
lf-0.011
 site-0.009
ered-0.009
 me-0.009
 avail-0.009
Token pearl
Token ID43836
Feature activation-2.66e-01

Pos logit contributions
 de+0.008
 lear+0.007
ives+0.007
 love+0.006
joy+0.006
Neg logit contributions
lf-0.005
 site-0.005
ered-0.004
 avail-0.004
izing-0.004
Token,"
Token ID553
Feature activation-5.86e-01

Pos logit contributions
lf+0.026
 site+0.023
 avail+0.023
ered+0.022
izing+0.022
Neg logit contributions
 de-0.037
 lear-0.033
 love-0.030
ives-0.030
 hope-0.029
Token L
Token ID406
Feature activation-7.86e-01

Pos logit contributions
 de+0.012
 lear+0.011
ives+0.010
bows+0.009
rav+0.009
Neg logit contributions
lf-0.015
 site-0.013
ered-0.013
 avail-0.012
 me-0.012
Tokenila
Token ID10102
Feature activation-1.07e+00

Pos logit contributions
 de+0.017
 lear+0.015
ives+0.014
rav+0.013
 love+0.013
Neg logit contributions
lf-0.019
 site-0.017
ered-0.016
 avail-0.016
izing-0.016
Token says
Token ID1139
Feature activation-2.20e+00

Pos logit contributions
 de+0.031
 lear+0.028
ives+0.027
 love+0.025
bows+0.025
Neg logit contributions
lf-0.018
 site-0.015
ered-0.014
 avail-0.013
izing-0.013
Token.
Token ID13
Feature activation-3.59e+00

Pos logit contributions
 de+0.058
 lear+0.053
ives+0.050
bows+0.047
rav+0.046
Neg logit contributions
lf-0.040
 site-0.035
ered-0.034
izing-0.032
 avail-0.032
Token "
Token ID366
Feature activation-6.72e+00

Pos logit contributions
 de+0.082
 lear+0.074
ives+0.070
bows+0.066
rav+0.065
Neg logit contributions
lf-0.078
 site-0.070
 me-0.067
izing-0.063
ered-0.063
TokenNow
Token ID3844
Feature activation+1.52e+00

Pos logit contributions
 de+0.155
isc+0.141
 lear+0.138
joy+0.127
ige+0.122
Neg logit contributions
lf-0.138
 myself-0.132
 site-0.117
ble-0.117
 happen-0.115
Token you
Token ID345
Feature activation-1.81e+00

Pos logit contributions
lf+0.027
 site+0.023
ered+0.022
izing+0.021
 avail+0.021
Neg logit contributions
 de-0.043
 lear-0.038
ives-0.035
 love-0.035
bows-0.034
Token are
Token ID389
Feature activation+9.39e-01

Pos logit contributions
 de+0.040
 lear+0.035
joy+0.033
rav+0.030
ives+0.030
Neg logit contributions
lf-0.041
ered-0.040
 me-0.038
 site-0.037
 happen-0.037
Token warm
Token ID5814
Feature activation-1.14e+00

Pos logit contributions
lf+0.022
 site+0.019
izing+0.019
 avail+0.018
ered+0.018
Neg logit contributions
 de-0.021
 lear-0.018
 love-0.017
 hope-0.016
ives-0.016
Token and
Token ID290
Feature activation+5.44e-02

Pos logit contributions
 de+0.023
 lear+0.021
ives+0.019
rav+0.017
bows+0.016
Neg logit contributions
lf-0.029
 site-0.026
 me-0.025
ered-0.025
izing-0.024
Tokenmm
Token ID3020
Feature activation+1.31e+00

Pos logit contributions
lf+0.027
 site+0.023
ered+0.022
 avail+0.021
izing+0.021
Neg logit contributions
 de-0.036
 lear-0.032
ives-0.030
rav-0.028
bows-0.028
Token!"
Token ID2474
Feature activation-1.10e+00

Pos logit contributions
lf+0.025
 site+0.023
ered+0.021
 avail+0.020
roads+0.020
Neg logit contributions
 de-0.036
 lear-0.029
 love-0.027
rav-0.026
ives-0.026
Token Sue
Token ID26089
Feature activation-1.13e+00

Pos logit contributions
 de+0.030
 lear+0.029
ives+0.027
rav+0.026
bows+0.026
Neg logit contributions
lf-0.019
 site-0.016
ered-0.016
 me-0.015
 avail-0.014
Token said
Token ID531
Feature activation-1.96e+00

Pos logit contributions
 de+0.035
 lear+0.033
ives+0.031
joy+0.030
rav+0.030
Neg logit contributions
lf-0.016
 me-0.012
ered-0.012
 site-0.012
izing-0.010
Token.
Token ID13
Feature activation-3.21e+00

Pos logit contributions
 de+0.054
 lear+0.050
ives+0.047
bows+0.044
rav+0.043
Neg logit contributions
lf-0.033
ered-0.029
 site-0.028
izing-0.026
 happen-0.026
Token "
Token ID366
Feature activation-6.72e+00

Pos logit contributions
 de+0.074
 lear+0.069
ives+0.062
rav+0.060
bows+0.059
Neg logit contributions
lf-0.066
 site-0.061
 me-0.056
izing-0.053
ered-0.052
TokenIt
Token ID1026
Feature activation-2.22e-01

Pos logit contributions
 de+0.163
isc+0.149
joy+0.148
 lear+0.145
ashed+0.130
Neg logit contributions
 myself-0.128
lf-0.125
izing-0.105
 happen-0.099
 site-0.099
Token is
Token ID318
Feature activation-9.73e-02

Pos logit contributions
 de+0.006
 lear+0.005
ives+0.005
 love+0.005
bows+0.005
Neg logit contributions
lf-0.004
 site-0.004
ered-0.004
 avail-0.004
izing-0.004
Token nice
Token ID3621
Feature activation-4.83e-01

Pos logit contributions
 de+0.002
 lear+0.002
ives+0.002
rav+0.002
joy+0.002
Neg logit contributions
lf-0.002
 site-0.002
 me-0.002
ered-0.002
 happen-0.002
Token!"
Token ID2474
Feature activation-2.73e+00

Pos logit contributions
 de+0.012
 lear+0.011
ives+0.010
joy+0.009
rav+0.009
Neg logit contributions
lf-0.010
 site-0.009
 me-0.008
ered-0.008
 happen-0.008
Token
Token ID198
Feature activation-9.38e-01

Pos logit contributions
 de+0.061
 lear+0.060
ives+0.053
bows+0.052
rav+0.051
Neg logit contributions
lf-0.056
 me-0.051
 site-0.050
?"-0.048
ered-0.047
Token is
Token ID318
Feature activation+1.85e+00

Pos logit contributions
lf+0.012
 avail+0.010
 site+0.010
izing+0.010
ered+0.010
Neg logit contributions
 de-0.012
 lear-0.011
ives-0.010
 love-0.009
bows-0.009
Token cool
Token ID3608
Feature activation+3.54e-01

Pos logit contributions
lf+0.038
 site+0.033
ered+0.033
izing+0.031
 avail+0.031
Neg logit contributions
 de-0.047
 lear-0.041
 love-0.037
ives-0.035
 like-0.035
Token.
Token ID13
Feature activation-7.44e-01

Pos logit contributions
lf+0.009
 site+0.008
ered+0.008
 avail+0.008
izing+0.007
Neg logit contributions
 de-0.007
 lear-0.006
ives-0.006
 love-0.005
bows-0.005
Token Let
Token ID3914
Feature activation+1.75e+00

Pos logit contributions
 de+0.019
 lear+0.017
ives+0.016
joy+0.015
bows+0.015
Neg logit contributions
lf-0.015
 site-0.013
ered-0.012
 me-0.012
 avail-0.012
Token me
Token ID502
Feature activation+1.08e+00

Pos logit contributions
lf+0.039
 site+0.036
izing+0.034
web+0.032
ered+0.032
Neg logit contributions
 de-0.037
 love-0.034
're-0.032
 were-0.031
 lear-0.031
Token help
Token ID1037
Feature activation+3.12e+00

Pos logit contributions
lf+0.028
 site+0.025
izing+0.024
 avail+0.024
ered+0.023
Neg logit contributions
 de-0.021
 lear-0.020
 love-0.019
ives-0.018
 hope-0.017
Token you
Token ID345
Feature activation+1.38e+00

Pos logit contributions
lf+0.058
 site+0.058
ered+0.056
roads+0.052
 avail+0.051
Neg logit contributions
 de-0.071
 love-0.071
 lear-0.065
ives-0.061
 like-0.061
Token."
Token ID526
Feature activation-7.86e-01

Pos logit contributions
lf+0.034
 site+0.030
 avail+0.030
ths+0.028
ered+0.028
Neg logit contributions
 de-0.028
 lear-0.026
 love-0.026
 like-0.024
ives-0.024
Token
Token ID198
Feature activation-1.52e-01

Pos logit contributions
 de+0.015
 lear+0.014
ives+0.012
rav+0.011
bows+0.011
Neg logit contributions
lf-0.021
 site-0.019
ered-0.018
 avail-0.017
izing-0.017
Token
Token ID198
Feature activation+1.40e+00

Pos logit contributions
 de+0.003
 lear+0.002
ives+0.002
 love+0.002
rav+0.002
Neg logit contributions
lf-0.004
 site-0.004
ered-0.004
 avail-0.004
izing-0.004
TokenHe
Token ID1544
Feature activation+7.98e-01

Pos logit contributions
lf+0.040
 site+0.036
ered+0.035
izing+0.034
 me+0.034
Neg logit contributions
 de-0.025
 lear-0.021
ives-0.019
bows-0.017
joy-0.017
Token "
Token ID366
Feature activation-4.93e+00

Pos logit contributions
 de+0.047
 lear+0.044
ives+0.039
bows+0.039
rav+0.038
Neg logit contributions
lf-0.046
 me-0.041
 site-0.041
 first-0.037
 wrong-0.037
TokenCan
Token ID6090
Feature activation+1.89e+00

Pos logit contributions
 de+0.118
isc+0.106
joy+0.104
're+0.095
 lear+0.094
Neg logit contributions
lf-0.089
 myself-0.085
 site-0.083
 first-0.079
izing-0.078
Token you
Token ID345
Feature activation+1.50e-01

Pos logit contributions
lf+0.040
ered+0.036
 site+0.036
 avail+0.034
izing+0.033
Neg logit contributions
 de-0.045
 lear-0.039
 love-0.037
rav-0.035
 rib-0.035
Token help
Token ID1037
Feature activation+3.64e+00

Pos logit contributions
lf+0.004
 site+0.003
izing+0.003
 avail+0.003
ered+0.003
Neg logit contributions
 de-0.003
 lear-0.003
 love-0.003
 like-0.003
 hope-0.003
Token me
Token ID502
Feature activation+3.61e+00

Pos logit contributions
lf+0.056
roads+0.054
 site+0.052
ered+0.050
 avail+0.048
Neg logit contributions
 love-0.091
 de-0.088
 like-0.086
 lear-0.075
 hope-0.074
Token break
Token ID2270
Feature activation+3.46e+00

Pos logit contributions
lf+0.084
 avail+0.076
 site+0.074
izing+0.074
ths+0.068
Neg logit contributions
 love-0.069
 de-0.069
 like-0.067
 lear-0.062
 hope-0.060
Token the
Token ID262
Feature activation+4.18e-02

Pos logit contributions
lf+0.075
 site+0.064
 avail+0.062
ered+0.061
izing+0.058
Neg logit contributions
 de-0.082
 love-0.065
 lear-0.065
ives-0.062
 like-0.061
Token eggs
Token ID9653
Feature activation+1.18e+00

Pos logit contributions
lf+0.001
 site+0.001
ered+0.001
 me+0.001
 happen+0.001
Neg logit contributions
 de-0.001
 lear-0.001
ives-0.001
rav-0.001
joy-0.001
Token and
Token ID290
Feature activation+1.60e+00

Pos logit contributions
lf+0.022
 site+0.018
 avail+0.018
ered+0.018
ggy+0.017
Neg logit contributions
 de-0.030
 like-0.027
're-0.027
 love-0.026
ives-0.025
Token put
Token ID1234
Feature activation-1.95e+00

Pos logit contributions
lf+0.035
 site+0.031
ered+0.030
izing+0.029
 avail+0.029
Neg logit contributions
 de-0.038
 lear-0.033
 love-0.030
ives-0.030
 hope-0.028
Token them
Token ID606
Feature activation-1.51e+00

Pos logit contributions
 de+0.051
 lear+0.049
ives+0.046
rav+0.043
joy+0.043
Neg logit contributions
lf-0.035
ered-0.031
 site-0.031
 me-0.030
izing-0.028
Token friend
Token ID1545
Feature activation+2.07e+00

Pos logit contributions
lf+0.027
ered+0.025
 site+0.024
izing+0.024
 avail+0.023
Neg logit contributions
 de-0.028
 lear-0.024
 love-0.022
ives-0.021
joy-0.021
Token,
Token ID11
Feature activation-3.73e-02

Pos logit contributions
lf+0.050
 site+0.046
izing+0.045
ered+0.044
 avail+0.042
Neg logit contributions
 de-0.044
 lear-0.038
ives-0.035
 love-0.034
joy-0.033
Token Sally
Token ID25737
Feature activation-1.53e-01

Pos logit contributions
 de+0.001
 lear+0.001
ives+0.001
rav+0.001
bows+0.001
Neg logit contributions
lf-0.001
 site-0.001
 me-0.001
ered-0.001
 avail-0.001
Token.
Token ID13
Feature activation+8.52e-01

Pos logit contributions
 de+0.004
 lear+0.004
ives+0.004
 love+0.003
joy+0.003
Neg logit contributions
lf-0.003
 site-0.002
ered-0.002
 avail-0.002
izing-0.002
Token
Token ID198
Feature activation+1.63e+00

Pos logit contributions
lf+0.020
 site+0.018
ered+0.017
 me+0.017
 avail+0.017
Neg logit contributions
 de-0.018
 lear-0.017
ives-0.015
rav-0.014
bows-0.014
Token
Token ID198
Feature activation+3.20e+00

Pos logit contributions
lf+0.047
 site+0.043
ered+0.042
 avail+0.041
izing+0.041
Neg logit contributions
 de-0.027
 lear-0.023
ives-0.021
 love-0.018
rav-0.018
Token"
Token ID1
Feature activation-1.72e+00

Pos logit contributions
lf+0.061
 site+0.052
ered+0.050
izing+0.047
 avail+0.047
Neg logit contributions
 de-0.086
 lear-0.078
ives-0.073
bows-0.068
joy-0.068
TokenHi
Token ID17250
Feature activation+1.16e+00

Pos logit contributions
 de+0.046
 lear+0.039
ives+0.036
bows+0.034
ushes+0.034
Neg logit contributions
lf-0.036
 me-0.032
 avail-0.029
 site-0.027
?"-0.027
Token Sally
Token ID25737
Feature activation-2.46e-01

Pos logit contributions
lf+0.024
 site+0.021
ered+0.020
izing+0.020
 avail+0.019
Neg logit contributions
 de-0.029
 lear-0.025
ives-0.024
joy-0.023
 love-0.023
Token,
Token ID11
Feature activation-1.11e+00

Pos logit contributions
 de+0.007
 lear+0.007
ives+0.007
rav+0.006
bows+0.006
Neg logit contributions
lf-0.004
 site-0.003
ered-0.003
izing-0.003
 me-0.003
Token do
Token ID466
Feature activation+1.28e+00

Pos logit contributions
 de+0.030
 lear+0.028
ives+0.026
bows+0.025
joy+0.025
Neg logit contributions
lf-0.020
 me-0.018
 happen-0.017
 my-0.016
ered-0.016
Token to
Token ID284
Feature activation+2.59e+00

Pos logit contributions
lf+0.117
izing+0.113
ered+0.109
ering+0.106
 site+0.106
Neg logit contributions
 love-0.043
 lear-0.037
 de-0.037
ives-0.034
 like-0.033
Token go
Token ID467
Feature activation+6.06e+00

Pos logit contributions
lf+0.069
izing+0.063
 site+0.063
ered+0.060
ering+0.058
Neg logit contributions
 lear-0.044
 de-0.043
 love-0.042
ives-0.038
 like-0.036
Token play
Token ID711
Feature activation+2.44e+00

Pos logit contributions
lf+0.179
izing+0.164
 site+0.156
ered+0.155
 avail+0.153
Neg logit contributions
 like-0.087
 love-0.083
 lear-0.083
 de-0.083
ives-0.073
Token at
Token ID379
Feature activation+1.02e+00

Pos logit contributions
lf+0.055
ered+0.055
 avail+0.051
 site+0.050
izing+0.050
Neg logit contributions
 lear-0.046
 de-0.044
 like-0.042
 love-0.039
ives-0.039
Token the
Token ID262
Feature activation+3.13e-01

Pos logit contributions
ered+0.026
lf+0.026
izing+0.026
ering+0.025
eking+0.024
Neg logit contributions
 de-0.016
 lear-0.015
 love-0.014
 Whe-0.013
 hope-0.013
Token gym
Token ID11550
Feature activation+4.17e+00

Pos logit contributions
lf+0.009
ered+0.009
 avail+0.008
izing+0.008
ering+0.008
Neg logit contributions
 de-0.005
 lear-0.005
ives-0.004
 love-0.004
Whe-0.004
Token?"
Token ID1701
Feature activation+9.53e-01

Pos logit contributions
lf+0.080
ered+0.076
 avail+0.072
izing+0.067
 site+0.066
Neg logit contributions
 de-0.104
 lear-0.099
ives-0.089
bows-0.084
 like-0.084
Token
Token ID198
Feature activation+2.21e+00

Pos logit contributions
lf+0.018
 site+0.016
ered+0.015
izing+0.015
 avail+0.014
Neg logit contributions
 de-0.025
 lear-0.023
ives-0.022
joy-0.020
 love-0.020
Token
Token ID198
Feature activation+3.42e+00

Pos logit contributions
lf+0.066
 site+0.061
ered+0.061
izing+0.058
 avail+0.058
Neg logit contributions
 de-0.034
 lear-0.028
ives-0.026
 love-0.023
joy-0.022
TokenJack
Token ID14295
Feature activation+1.04e+00

Pos logit contributions
lf+0.076
 site+0.068
ered+0.067
 avail+0.064
izing+0.063
Neg logit contributions
 de-0.078
 lear-0.071
ives-0.066
 love-0.061
joy-0.060
Token said
Token ID531
Feature activation-2.74e-01

Pos logit contributions
lf+0.021
 site+0.020
 avail+0.019
izing+0.018
 happen+0.017
Neg logit contributions
 de-0.026
 lear-0.023
 love-0.021
ives-0.021
 hope-0.020
Token on
Token ID319
Feature activation+1.22e-02

Pos logit contributions
 de+0.012
 lear+0.011
ives+0.010
bows+0.009
rav+0.009
Neg logit contributions
lf-0.012
 site-0.010
ered-0.010
 avail-0.009
izing-0.009
Token the
Token ID262
Feature activation+3.91e-01

Pos logit contributions
lf+0.000
 site+0.000
ered+0.000
izing+0.000
 avail+0.000
Neg logit contributions
 de-0.000
 lear-0.000
ives-0.000
 love-0.000
joy-0.000
Token door
Token ID3420
Feature activation+2.41e+00

Pos logit contributions
lf+0.005
 site+0.003
ered+0.003
 avail+0.003
izing+0.003
Neg logit contributions
 de-0.013
 lear-0.012
ives-0.012
 love-0.011
joy-0.011
Token.
Token ID13
Feature activation+8.03e-01

Pos logit contributions
lf+0.063
 avail+0.054
 site+0.054
izing+0.053
ize+0.053
Neg logit contributions
 de-0.047
 lear-0.038
ives-0.035
 love-0.035
 like-0.034
Token
Token ID198
Feature activation+1.43e+00

Pos logit contributions
lf+0.021
 site+0.019
ered+0.018
 avail+0.018
izing+0.018
Neg logit contributions
 de-0.016
 lear-0.014
ives-0.012
 love-0.011
bows-0.011
Token
Token ID198
Feature activation+2.99e+00

Pos logit contributions
lf+0.043
 site+0.039
 avail+0.039
ered+0.039
izing+0.038
Neg logit contributions
 de-0.022
 lear-0.018
ives-0.017
 love-0.014
rav-0.014
TokenWhen
Token ID2215
Feature activation+1.15e+00

Pos logit contributions
lf+0.071
 site+0.063
ered+0.062
 avail+0.060
izing+0.059
Neg logit contributions
 de-0.065
 lear-0.058
ives-0.054
 love-0.049
rav-0.048
Token his
Token ID465
Feature activation+1.74e+00

Pos logit contributions
lf+0.028
ered+0.024
izing+0.024
 site+0.024
 myself+0.023
Neg logit contributions
 de-0.024
 lear-0.022
joy-0.018
 love-0.018
 hope-0.017
Token mom
Token ID1995
Feature activation+1.33e+00

Pos logit contributions
lf+0.044
izing+0.041
 site+0.039
ered+0.038
 avail+0.038
Neg logit contributions
 de-0.034
 lear-0.029
 love-0.028
joy-0.026
 hope-0.026
Token answered
Token ID9373
Feature activation+1.76e+00

Pos logit contributions
lf+0.032
 site+0.030
 avail+0.030
izing+0.030
 happen+0.028
Neg logit contributions
 de-0.027
 lear-0.023
ives-0.023
 love-0.020
isc-0.019
Token,
Token ID11
Feature activation+6.17e-01

Pos logit contributions
lf+0.033
 site+0.028
 avail+0.027
ered+0.026
izing+0.026
Neg logit contributions
 de-0.047
 lear-0.043
ives-0.038
 love-0.038
joy-0.037
TokenOkay
Token ID16454
Feature activation+1.99e+00

Pos logit contributions
 de+0.076
 lear+0.060
ives+0.057
joy+0.055
bows+0.054
Neg logit contributions
lf-0.075
izing-0.071
 me-0.069
 myself-0.068
 prove-0.066
Token,
Token ID11
Feature activation-2.27e-01

Pos logit contributions
lf+0.040
 site+0.035
ered+0.033
izing+0.031
 avail+0.030
Neg logit contributions
 de-0.052
 lear-0.046
 love-0.041
joy-0.041
rav-0.039
Token you
Token ID345
Feature activation-1.97e+00

Pos logit contributions
 de+0.006
 lear+0.006
ives+0.006
bows+0.005
joy+0.005
Neg logit contributions
lf-0.004
 site-0.003
ered-0.003
 me-0.003
izing-0.003
Token can
Token ID460
Feature activation+2.00e+00

Pos logit contributions
 de+0.038
 lear+0.032
ives+0.028
joy+0.027
bows+0.026
Neg logit contributions
lf-0.052
ered-0.048
 site-0.047
 me-0.045
 avail-0.044
Token have
Token ID423
Feature activation+4.96e-01

Pos logit contributions
lf+0.048
 site+0.042
izing+0.040
ered+0.040
 avail+0.039
Neg logit contributions
 de-0.043
 lear-0.039
ives-0.036
 love-0.035
bows-0.033
Token the
Token ID262
Feature activation+3.03e-02

Pos logit contributions
lf+0.011
 site+0.010
izing+0.009
ered+0.009
web+0.008
Neg logit contributions
 de-0.012
 love-0.011
 lear-0.010
 hope-0.010
joy-0.009
Token shell
Token ID7582
Feature activation+1.17e+00

Pos logit contributions
lf+0.001
 site+0.000
ered+0.000
 avail+0.000
 me+0.000
Neg logit contributions
 de-0.001
 lear-0.001
ives-0.001
bows-0.001
rav-0.001
Token if
Token ID611
Feature activation+2.27e+00

Pos logit contributions
lf+0.024
 site+0.021
ered+0.020
 avail+0.020
izing+0.019
Neg logit contributions
 de-0.030
 lear-0.027
ives-0.025
 love-0.023
bows-0.023
Token you
Token ID345
Feature activation-1.46e+00

Pos logit contributions
lf+0.043
 site+0.037
izing+0.033
ered+0.032
ation+0.030
Neg logit contributions
 de-0.062
 love-0.052
 lear-0.052
rav-0.049
 hope-0.049
Token give
Token ID1577
Feature activation+2.06e+00

Pos logit contributions
 de+0.031
 lear+0.026
joy+0.024
ives+0.023
bows+0.022
Neg logit contributions
lf-0.034
ered-0.032
 site-0.031
 me-0.031
 happen-0.030
Token me
Token ID502
Feature activation+1.05e+00

Pos logit contributions
lf+0.043
 site+0.037
izing+0.032
 avail+0.032
ered+0.031
Neg logit contributions
 de-0.053
 love-0.046
 lear-0.045
 hope-0.043
ives-0.043
Token
Token ID198
Feature activation+1.15e+00

Pos logit contributions
 de+0.009
 lear+0.008
ives+0.007
rav+0.007
bows+0.007
Neg logit contributions
lf-0.011
 site-0.010
 me-0.010
ered-0.010
izing-0.009
Token
Token ID198
Feature activation+2.75e+00

Pos logit contributions
lf+0.035
 site+0.032
 avail+0.032
izing+0.032
ered+0.032
Neg logit contributions
 de-0.017
 lear-0.013
ives-0.012
 love-0.011
joy-0.010
TokenBut
Token ID1537
Feature activation+1.67e+00

Pos logit contributions
lf+0.065
 site+0.059
ered+0.057
 avail+0.057
 happen+0.055
Neg logit contributions
 de-0.058
 lear-0.053
ives-0.048
 love-0.045
Thank-0.043
Token then
Token ID788
Feature activation+1.31e+00

Pos logit contributions
lf+0.031
 site+0.028
izing+0.025
ered+0.024
 avail+0.024
Neg logit contributions
 de-0.045
 lear-0.042
 love-0.037
joy-0.036
 hope-0.036
Token,
Token ID11
Feature activation+8.04e-01

Pos logit contributions
lf+0.025
 site+0.022
izing+0.020
ered+0.020
 happen+0.019
Neg logit contributions
 de-0.035
 lear-0.031
 love-0.028
joy-0.028
ives-0.028
Token she
Token ID673
Feature activation+9.79e-01

Pos logit contributions
lf+0.016
 site+0.013
ered+0.013
 avail+0.013
 me+0.012
Neg logit contributions
 de-0.021
 lear-0.019
ives-0.019
rav-0.017
bows-0.017
Token saw
Token ID2497
Feature activation+2.90e-01

Pos logit contributions
lf+0.024
 site+0.022
ered+0.021
izing+0.021
 avail+0.021
Neg logit contributions
 de-0.020
 lear-0.018
ives-0.016
 love-0.015
joy-0.015
Token some
Token ID617
Feature activation-2.15e-01

Pos logit contributions
lf+0.007
 site+0.006
ered+0.006
izing+0.006
 avail+0.006
Neg logit contributions
 de-0.006
 lear-0.005
ives-0.005
 love-0.005
joy-0.004
Token funny
Token ID8258
Feature activation-2.40e-01

Pos logit contributions
 de+0.005
 lear+0.005
ives+0.005
rav+0.004
bows+0.004
Neg logit contributions
lf-0.004
 site-0.004
ered-0.004
izing-0.003
 avail-0.003
Token-
Token ID12
Feature activation-1.03e+00

Pos logit contributions
 de+0.006
 lear+0.006
ives+0.006
rav+0.005
Thank+0.005
Neg logit contributions
lf-0.004
 site-0.004
 me-0.004
ered-0.004
 happen-0.003
Tokenlooking
Token ID11534
Feature activation+3.52e-02

Pos logit contributions
 de+0.024
 lear+0.022
joy+0.020
rav+0.019
ives+0.019
Neg logit contributions
lf-0.021
ered-0.019
 site-0.019
 me-0.018
ggy-0.018
Token enjoyed
Token ID8359
Feature activation-1.61e+00

Pos logit contributions
 de+0.029
ives+0.027
 lear+0.026
joy+0.023
rav+0.022
Neg logit contributions
lf-0.029
ered-0.025
 me-0.024
 happen-0.023
 site-0.023
Token the
Token ID262
Feature activation-1.34e+00

Pos logit contributions
 de+0.044
ives+0.043
 lear+0.041
isc+0.038
bows+0.037
Neg logit contributions
lf-0.028
 site-0.022
 happen-0.022
ered-0.022
 me-0.021
Token lemon
Token ID18873
Feature activation-1.99e+00

Pos logit contributions
 de+0.034
 lear+0.033
ives+0.032
joy+0.028
rav+0.028
Neg logit contributions
lf-0.025
 happen-0.021
 me-0.021
 site-0.021
 wrong-0.020
Token juice
Token ID13135
Feature activation-2.40e+00

Pos logit contributions
 de+0.051
 lear+0.048
ives+0.043
joy+0.042
rav+0.041
Neg logit contributions
lf-0.040
 site-0.034
ered-0.032
izing-0.031
 happen-0.031
Token together
Token ID1978
Feature activation-8.61e-01

Pos logit contributions
 de+0.061
 lear+0.055
ives+0.051
joy+0.048
rav+0.048
Neg logit contributions
lf-0.049
 site-0.042
ered-0.040
izing-0.039
 happen-0.038
Token.
Token ID13
Feature activation-3.05e-01

Pos logit contributions
 de+0.021
 lear+0.019
ives+0.018
 love+0.017
bows+0.016
Neg logit contributions
lf-0.018
 site-0.016
ered-0.015
 avail-0.014
izing-0.014
Token<|endoftext|>
Token ID50256
Feature activation-1.85e+00

Pos logit contributions
 de+0.006
 lear+0.006
ives+0.005
bows+0.005
rav+0.005
Neg logit contributions
lf-0.007
 site-0.007
 me-0.006
ered-0.006
 avail-0.006
TokenOne
Token ID3198
Feature activation-1.53e-01

Pos logit contributions
 de+0.039
 lear+0.037
ives+0.031
rav+0.029
bows+0.029
Neg logit contributions
lf-0.045
 site-0.041
 me-0.038
ered-0.038
?"-0.036
Token day
Token ID1110
Feature activation+1.56e+00

Pos logit contributions
 de+0.003
 lear+0.003
 love+0.003
ives+0.002
rav+0.002
Neg logit contributions
lf-0.004
 site-0.004
izing-0.003
ered-0.003
 avail-0.003
Token,
Token ID11
Feature activation-3.68e-01

Pos logit contributions
lf+0.026
 site+0.022
ered+0.021
 avail+0.020
izing+0.020
Neg logit contributions
 de-0.045
 lear-0.041
ives-0.039
 love-0.037
rav-0.037
Token a
Token ID257
Feature activation-1.59e+00

Pos logit contributions
 de+0.008
 lear+0.008
ives+0.007
bows+0.007
joy+0.006
Neg logit contributions
lf-0.008
 site-0.007
ered-0.007
 me-0.007
izing-0.007
Token we
Token ID356
Feature activation-3.31e+00

Pos logit contributions
 de+0.028
ives+0.025
 lear+0.025
bows+0.024
joy+0.023
Neg logit contributions
lf-0.018
 me-0.015
 site-0.014
 my-0.013
 avail-0.013
Token love
Token ID1842
Feature activation+1.77e+00

Pos logit contributions
 de+0.058
 lear+0.045
joy+0.043
ives+0.040
pre+0.037
Neg logit contributions
 me-0.088
ered-0.087
lf-0.086
?"-0.085
 wrong-0.085
Token you
Token ID345
Feature activation-2.06e+00

Pos logit contributions
lf+0.025
ered+0.020
 site+0.020
 avail+0.018
izing+0.018
Neg logit contributions
 de-0.057
 lear-0.051
 love-0.049
ives-0.047
rav-0.046
Token,
Token ID11
Feature activation-1.22e+00

Pos logit contributions
 de+0.055
 lear+0.049
ives+0.046
 love+0.044
rav+0.042
Neg logit contributions
lf-0.041
 site-0.035
ered-0.034
 avail-0.033
izing-0.032
Token we
Token ID356
Feature activation-3.40e+00

Pos logit contributions
 de+0.034
ives+0.033
 lear+0.032
bows+0.031
free+0.029
Neg logit contributions
lf-0.019
 me-0.018
 my-0.017
 first-0.016
 site-0.015
Token love
Token ID1842
Feature activation+1.62e+00

Pos logit contributions
 de+0.055
 lear+0.043
joy+0.042
ives+0.037
pre+0.036
Neg logit contributions
 me-0.093
lf-0.092
ered-0.091
?"-0.091
 wrong-0.090
Token you
Token ID345
Feature activation-2.18e+00

Pos logit contributions
lf+0.023
 site+0.018
ered+0.018
 avail+0.016
izing+0.016
Neg logit contributions
 de-0.052
 lear-0.047
 love-0.045
ives-0.044
rav-0.043
Token."
Token ID526
Feature activation-7.98e-01

Pos logit contributions
 de+0.058
 lear+0.052
ives+0.048
 love+0.046
bows+0.045
Neg logit contributions
lf-0.043
 site-0.037
ered-0.036
 avail-0.034
izing-0.034
Token They
Token ID1119
Feature activation-2.42e+00

Pos logit contributions
 de+0.016
 lear+0.016
ives+0.014
bows+0.014
rav+0.013
Neg logit contributions
lf-0.018
 me-0.017
 site-0.016
?"-0.015
 my-0.015
Token decided
Token ID3066
Feature activation-2.21e-01

Pos logit contributions
 de+0.063
ives+0.055
bows+0.055
 lear+0.055
joy+0.055
Neg logit contributions
lf-0.047
ered-0.043
 me-0.040
 first-0.038
 replied-0.036
Token to
Token ID284
Feature activation+2.97e-01

Pos logit contributions
 de+0.006
 lear+0.005
ives+0.005
bows+0.004
rav+0.004
Neg logit contributions
lf-0.005
 site-0.004
ered-0.004
izing-0.004
 avail-0.004
Token took
Token ID1718
Feature activation-1.53e+00

Pos logit contributions
lf+0.000
ered+0.000
 site+0.000
 me+0.000
 avail+0.000
Neg logit contributions
 de-0.000
 lear-0.000
ives-0.000
joy-0.000
bows-0.000
Token out
Token ID503
Feature activation-1.96e+00

Pos logit contributions
 de+0.041
 lear+0.039
ives+0.035
joy+0.034
bows+0.034
Neg logit contributions
lf-0.026
ered-0.025
 me-0.023
 site-0.023
 avail-0.022
Token Spot
Token ID15899
Feature activation-1.16e+00

Pos logit contributions
 de+0.055
 lear+0.053
ives+0.050
rav+0.049
Thank+0.048
Neg logit contributions
lf-0.031
 site-0.028
ered-0.027
 me-0.025
 my-0.023
Token.
Token ID13
Feature activation-1.27e+00

Pos logit contributions
 de+0.028
 lear+0.025
ives+0.023
 love+0.021
bows+0.021
Neg logit contributions
lf-0.025
 site-0.022
ered-0.021
 avail-0.021
izing-0.021
Token He
Token ID679
Feature activation+7.06e-01

Pos logit contributions
 de+0.025
 lear+0.024
ives+0.021
bows+0.020
rav+0.020
Neg logit contributions
lf-0.030
 site-0.028
 me-0.027
 avail-0.026
ered-0.025
Token was
Token ID373
Feature activation-9.37e-02

Pos logit contributions
lf+0.015
 site+0.013
ered+0.012
 me+0.012
ggy+0.011
Neg logit contributions
 de-0.018
 lear-0.016
ives-0.015
joy-0.014
rav-0.014
Token happy
Token ID3772
Feature activation-1.97e-01

Pos logit contributions
 de+0.002
 lear+0.002
ives+0.002
 love+0.002
joy+0.002
Neg logit contributions
lf-0.002
 site-0.002
ered-0.002
izing-0.002
 avail-0.002
Token to
Token ID284
Feature activation+3.32e-02

Pos logit contributions
 de+0.005
 lear+0.004
ives+0.004
rav+0.004
bows+0.004
Neg logit contributions
lf-0.004
 site-0.004
ered-0.004
 me-0.004
izing-0.003
Token see
Token ID766
Feature activation-1.75e-01

Pos logit contributions
lf+0.001
izing+0.001
 site+0.001
ths+0.001
ering+0.001
Neg logit contributions
 lear-0.000
 de-0.000
 love-0.000
 hope-0.000
rav-0.000
Token them
Token ID606
Feature activation-1.48e+00

Pos logit contributions
 de+0.004
 lear+0.004
ives+0.004
rav+0.003
joy+0.003
Neg logit contributions
lf-0.004
 site-0.003
ered-0.003
 avail-0.003
 me-0.003
Token and
Token ID290
Feature activation-1.70e-01

Pos logit contributions
 de+0.032
 lear+0.028
ives+0.026
 love+0.024
bows+0.024
Neg logit contributions
lf-0.036
 site-0.032
ered-0.031
 avail-0.030
izing-0.030
Token he
Token ID339
Feature activation+9.21e-01

Pos logit contributions
lf+0.023
 site+0.021
ered+0.020
 avail+0.020
izing+0.019
Neg logit contributions
 de-0.028
 lear-0.025
ives-0.023
 love-0.022
rav-0.021
Token and
Token ID290
Feature activation+7.39e-01

Pos logit contributions
lf+0.022
 site+0.019
ered+0.019
 me+0.018
izing+0.018
Neg logit contributions
 de-0.020
 lear-0.018
ives-0.017
joy-0.015
bows-0.015
Token Amy
Token ID14235
Feature activation-1.33e+00

Pos logit contributions
lf+0.013
 site+0.012
ered+0.011
izing+0.011
 avail+0.011
Neg logit contributions
 de-0.020
 lear-0.018
ives-0.017
 love-0.016
joy-0.016
Token had
Token ID550
Feature activation-2.37e+00

Pos logit contributions
 de+0.029
 lear+0.024
ives+0.023
joy+0.022
rav+0.022
Neg logit contributions
lf-0.032
ered-0.029
 site-0.027
 me-0.027
 happen-0.026
Token lots
Token ID6041
Feature activation-1.61e+00

Pos logit contributions
 de+0.057
 lear+0.052
ives+0.049
rav+0.047
bows+0.045
Neg logit contributions
lf-0.048
 me-0.044
ered-0.043
 site-0.043
 wrong-0.042
Token of
Token ID286
Feature activation-2.35e+00

Pos logit contributions
 de+0.043
 lear+0.040
isc+0.037
rav+0.037
ksh+0.036
Neg logit contributions
lf-0.027
 site-0.022
ered-0.022
 wrong-0.022
 happen-0.022
Token fun
Token ID1257
Feature activation-1.35e+00

Pos logit contributions
 de+0.061
 lear+0.057
ives+0.053
rav+0.051
isc+0.051
Neg logit contributions
lf-0.043
ered-0.037
 me-0.037
 site-0.036
 wrong-0.035
Token playing
Token ID2712
Feature activation-1.42e+00

Pos logit contributions
 de+0.036
 lear+0.032
ives+0.031
rav+0.029
isc+0.028
Neg logit contributions
lf-0.026
 site-0.022
ered-0.021
izing-0.021
 happen-0.020
Token together
Token ID1978
Feature activation-4.96e-01

Pos logit contributions
 de+0.037
 lear+0.033
ives+0.031
rav+0.029
isc+0.028
Neg logit contributions
lf-0.029
 site-0.025
ered-0.024
izing-0.023
 me-0.023
Token in
Token ID287
Feature activation-1.30e+00

Pos logit contributions
 de+0.012
 lear+0.011
ives+0.010
rav+0.010
bows+0.009
Neg logit contributions
lf-0.010
 site-0.009
ered-0.009
izing-0.008
 avail-0.008
Token the
Token ID262
Feature activation-1.42e+00

Pos logit contributions
 de+0.032
 lear+0.029
 love+0.027
joy+0.026
ives+0.026
Neg logit contributions
lf-0.026
ered-0.023
 site-0.023
izing-0.022
 avail-0.021
Token on
Token ID319
Feature activation+2.56e-01

Pos logit contributions
lf+0.056
 site+0.050
ered+0.049
izing+0.049
 avail+0.048
Neg logit contributions
 de-0.049
 lear-0.044
ives-0.040
 love-0.037
joy-0.036
Token a
Token ID257
Feature activation+1.51e+00

Pos logit contributions
lf+0.006
 site+0.006
ered+0.005
 avail+0.005
izing+0.005
Neg logit contributions
 de-0.005
 lear-0.005
ives-0.004
 love-0.004
bows-0.004
Token trip
Token ID5296
Feature activation+2.48e+00

Pos logit contributions
ered+0.043
izing+0.041
 avail+0.039
rums+0.039
lf+0.038
Neg logit contributions
 de-0.026
 lear-0.019
 love-0.018
 fet-0.016
 Whe-0.015
Token with
Token ID351
Feature activation-5.20e-01

Pos logit contributions
lf+0.057
 site+0.050
ered+0.050
 avail+0.049
izing+0.048
Neg logit contributions
 de-0.055
 lear-0.050
ives-0.046
 love-0.043
bows-0.042
Token her
Token ID607
Feature activation-2.42e-01

Pos logit contributions
 de+0.014
 lear+0.012
ives+0.012
bows+0.011
rav+0.011
Neg logit contributions
lf-0.010
 site-0.009
ered-0.009
 avail-0.008
 me-0.008
Token mom
Token ID1995
Feature activation-3.66e-01

Pos logit contributions
 de+0.006
 lear+0.006
ives+0.006
bows+0.005
rav+0.005
Neg logit contributions
lf-0.005
 site-0.004
ered-0.004
 avail-0.004
izing-0.004
Token and
Token ID290
Feature activation+6.62e-01

Pos logit contributions
 de+0.008
 lear+0.008
ives+0.007
rav+0.006
joy+0.006
Neg logit contributions
lf-0.009
 site-0.007
 me-0.007
ered-0.007
 first-0.007
Token dad
Token ID9955
Feature activation-1.13e+00

Pos logit contributions
lf+0.014
 site+0.012
ered+0.011
 me+0.011
 avail+0.011
Neg logit contributions
 de-0.017
 lear-0.015
ives-0.015
bows-0.013
joy-0.013
Token.
Token ID13
Feature activation+7.82e-02

Pos logit contributions
 de+0.032
 lear+0.030
ives+0.028
joy+0.027
bows+0.026
Neg logit contributions
lf-0.019
 site-0.015
ered-0.015
 first-0.015
 me-0.014
Token They
Token ID1119
Feature activation+4.75e-02

Pos logit contributions
lf+0.002
 site+0.002
 me+0.002
ered+0.002
 avail+0.002
Neg logit contributions
 de-0.002
 lear-0.001
ives-0.001
rav-0.001
joy-0.001
Token had
Token ID550
Feature activation-2.15e+00

Pos logit contributions
lf+0.001
ered+0.001
 site+0.001
 me+0.001
 first+0.001
Neg logit contributions
 de-0.001
ives-0.001
 lear-0.001
bows-0.001
joy-0.001
Token "
Token ID366
Feature activation-3.65e+00

Pos logit contributions
 de+0.061
 lear+0.056
ives+0.055
bows+0.051
rav+0.049
Neg logit contributions
lf-0.051
 site-0.044
 me-0.042
izing-0.040
ered-0.040
TokenYes
Token ID5297
Feature activation+9.50e-01

Pos logit contributions
 de+0.079
joy+0.065
 lear+0.064
ives+0.057
bows+0.056
Neg logit contributions
lf-0.078
 myself-0.076
 me-0.075
izing-0.072
 prove-0.068
Token,
Token ID11
Feature activation-7.14e-01

Pos logit contributions
lf+0.020
 site+0.018
ered+0.017
izing+0.016
roads+0.016
Neg logit contributions
 de-0.024
 lear-0.021
 love-0.019
rav-0.019
joy-0.018
Token please
Token ID3387
Feature activation+1.44e+00

Pos logit contributions
 de+0.019
 lear+0.017
ives+0.016
bows+0.015
 love+0.015
Neg logit contributions
lf-0.014
 site-0.012
ered-0.011
 avail-0.011
izing-0.011
Token.
Token ID13
Feature activation-1.66e+00

Pos logit contributions
lf+0.027
 site+0.024
ered+0.023
 avail+0.022
izing+0.022
Neg logit contributions
 de-0.039
 lear-0.035
ives-0.033
 love-0.032
bows-0.031
Token I
Token ID314
Feature activation-2.71e+00

Pos logit contributions
 de+0.043
 lear+0.040
joy+0.037
ives+0.037
bows+0.036
Neg logit contributions
lf-0.027
 my-0.024
 me-0.024
 myself-0.023
 site-0.023
Token want
Token ID765
Feature activation+2.17e+00

Pos logit contributions
 de+0.062
joy+0.051
 lear+0.051
ksh+0.046
ives+0.046
Neg logit contributions
?"-0.054
 prove-0.053
lf-0.053
ered-0.052
 first-0.051
Token to
Token ID284
Feature activation+1.21e+00

Pos logit contributions
lf+0.058
ered+0.051
 site+0.051
izing+0.049
 avail+0.048
Neg logit contributions
 de-0.044
 love-0.036
 lear-0.036
ives-0.033
 hope-0.031
Token play
Token ID711
Feature activation+5.27e-01

Pos logit contributions
lf+0.029
 site+0.026
izing+0.025
ered+0.025
 avail+0.024
Neg logit contributions
 de-0.025
 lear-0.023
 love-0.022
ives-0.021
 hope-0.020
Token with
Token ID351
Feature activation+9.76e-01

Pos logit contributions
lf+0.013
ered+0.013
 avail+0.013
 site+0.012
ths+0.012
Neg logit contributions
 de-0.009
 lear-0.009
 love-0.008
 like-0.008
ives-0.007
Token the
Token ID262
Feature activation-4.75e-01

Pos logit contributions
lf+0.018
 site+0.017
ered+0.016
 avail+0.016
ation+0.014
Neg logit contributions
 de-0.026
 love-0.023
 lear-0.021
 rib-0.020
ives-0.020
Token They
Token ID1119
Feature activation-1.62e+00

Pos logit contributions
 de+0.000
 lear+0.000
ives+0.000
bows+0.000
rav+0.000
Neg logit contributions
lf-0.000
 site-0.000
ered-0.000
 me-0.000
 avail-0.000
Token raced
Token ID32897
Feature activation-1.27e-01

Pos logit contributions
 de+0.039
ives+0.033
 lear+0.033
joy+0.031
bows+0.030
Neg logit contributions
lf-0.037
ered-0.032
 me-0.030
 site-0.029
 first-0.028
Token the
Token ID262
Feature activation+1.48e-01

Pos logit contributions
 de+0.003
 lear+0.003
ives+0.003
bows+0.002
rav+0.002
Neg logit contributions
lf-0.003
 site-0.002
ered-0.002
 me-0.002
 avail-0.002
Token car
Token ID1097
Feature activation-1.63e-01

Pos logit contributions
lf+0.003
 site+0.002
ered+0.002
 avail+0.002
izing+0.002
Neg logit contributions
 de-0.004
 lear-0.004
ives-0.003
 love-0.003
joy-0.003
Token around
Token ID1088
Feature activation-1.74e+00

Pos logit contributions
 de+0.004
 lear+0.003
ives+0.003
bows+0.003
 love+0.003
Neg logit contributions
lf-0.004
 site-0.003
ered-0.003
 avail-0.003
izing-0.003
Token the
Token ID262
Feature activation-1.22e+00

Pos logit contributions
 de+0.048
ives+0.042
 lear+0.042
rav+0.039
bows+0.038
Neg logit contributions
lf-0.033
 site-0.028
ered-0.026
 me-0.025
izing-0.025
Token garden
Token ID11376
Feature activation-1.44e+00

Pos logit contributions
 de+0.033
 lear+0.030
ives+0.028
rav+0.026
bows+0.026
Neg logit contributions
lf-0.023
 site-0.021
 me-0.019
ered-0.019
izing-0.018
Token,
Token ID11
Feature activation-2.65e+00

Pos logit contributions
 de+0.039
 lear+0.035
ives+0.034
rav+0.032
 love+0.031
Neg logit contributions
lf-0.026
 site-0.023
ered-0.021
izing-0.020
 me-0.020
Token but
Token ID475
Feature activation+8.72e-01

Pos logit contributions
 de+0.072
ives+0.067
 lear+0.063
isc+0.061
rav+0.060
Neg logit contributions
lf-0.048
 site-0.040
 me-0.039
 first-0.038
ered-0.037
Token then
Token ID788
Feature activation+8.19e-01

Pos logit contributions
lf+0.019
 site+0.017
ered+0.016
 avail+0.016
izing+0.016
Neg logit contributions
 de-0.020
 lear-0.018
 love-0.016
ives-0.016
rav-0.015
Token Sam
Token ID3409
Feature activation+9.08e-01

Pos logit contributions
lf+0.017
 site+0.015
ered+0.014
izing+0.014
 avail+0.014
Neg logit contributions
 de-0.020
 lear-0.019
ives-0.017
joy-0.016
 love-0.016
Token Emily
Token ID17608
Feature activation-2.16e+00

Pos logit contributions
 de+0.066
 lear+0.061
ives+0.057
rav+0.054
bows+0.053
Neg logit contributions
lf-0.044
 site-0.037
ered-0.036
 me-0.034
 avail-0.034
Token's
Token ID338
Feature activation-1.47e+00

Pos logit contributions
 de+0.054
 lear+0.050
ives+0.046
rav+0.044
bows+0.043
Neg logit contributions
lf-0.043
 site-0.036
ered-0.035
 me-0.034
izing-0.034
Token m
Token ID285
Feature activation+8.67e-01

Pos logit contributions
 de+0.037
 lear+0.036
ives+0.034
bows+0.032
rav+0.030
Neg logit contributions
lf-0.027
 me-0.024
 site-0.023
ered-0.022
 first-0.021
Tokenittens
Token ID34978
Feature activation-1.99e+00

Pos logit contributions
lf+0.017
 site+0.014
ered+0.014
izing+0.013
 me+0.013
Neg logit contributions
 de-0.023
 lear-0.021
ives-0.019
joy-0.018
bows-0.018
Token.
Token ID13
Feature activation-8.17e-01

Pos logit contributions
 de+0.048
 lear+0.042
ives+0.039
 love+0.037
bows+0.037
Neg logit contributions
lf-0.043
 site-0.038
ered-0.037
 avail-0.036
izing-0.035
Token They
Token ID1119
Feature activation-2.02e+00

Pos logit contributions
 de+0.020
 lear+0.019
ives+0.017
pre+0.016
rav+0.016
Neg logit contributions
lf-0.015
 me-0.013
 site-0.013
 avail-0.012
izing-0.012
Token promised
Token ID8072
Feature activation-1.77e-01

Pos logit contributions
 de+0.047
ives+0.042
joy+0.040
 lear+0.039
rav+0.037
Neg logit contributions
lf-0.048
ered-0.042
 me-0.039
 first-0.036
ie-0.036
Token each
Token ID1123
Feature activation-2.00e+00

Pos logit contributions
 de+0.004
 lear+0.004
ives+0.004
rav+0.003
bows+0.003
Neg logit contributions
lf-0.004
 site-0.003
ered-0.003
 avail-0.003
izing-0.003
Token other
Token ID584
Feature activation-2.63e+00

Pos logit contributions
 de+0.057
 lear+0.050
ives+0.049
bows+0.046
joy+0.045
Neg logit contributions
lf-0.035
 site-0.029
ered-0.028
 me-0.027
 avail-0.027
Token that
Token ID326
Feature activation-4.28e-01

Pos logit contributions
 de+0.065
 lear+0.059
ives+0.056
joy+0.053
isc+0.051
Neg logit contributions
lf-0.054
ered-0.047
 site-0.045
 happen-0.044
 wrong-0.043
Token they
Token ID484
Feature activation-2.44e+00

Pos logit contributions
 de+0.011
 lear+0.010
ives+0.010
rav+0.009
bows+0.009
Neg logit contributions
lf-0.008
 site-0.007
ered-0.007
 avail-0.007
 happen-0.007
Token animals
Token ID4695
Feature activation-2.14e+00

Pos logit contributions
 de+0.012
 lear+0.011
ives+0.011
rav+0.010
isc+0.009
Neg logit contributions
lf-0.009
 site-0.008
 me-0.008
ered-0.007
 wrong-0.007
Token happy
Token ID3772
Feature activation+1.36e-01

Pos logit contributions
 de+0.045
 lear+0.040
ives+0.037
rav+0.033
joy+0.033
Neg logit contributions
lf-0.052
ered-0.047
 site-0.046
 happen-0.045
izing-0.044
Token.
Token ID13
Feature activation-1.68e-01

Pos logit contributions
lf+0.003
 site+0.002
ered+0.002
izing+0.002
 happen+0.002
Neg logit contributions
 de-0.003
 lear-0.003
ives-0.003
rav-0.003
joy-0.003
Token<|endoftext|>
Token ID50256
Feature activation-1.31e+00

Pos logit contributions
 lear+0.004
 de+0.004
ives+0.004
bows+0.004
pre+0.003
Neg logit contributions
lf-0.003
 me-0.003
 site-0.003
 myself-0.003
ered-0.002
TokenOnce
Token ID7454
Feature activation+8.67e-02

Pos logit contributions
 de+0.029
 lear+0.028
ives+0.024
rav+0.022
bows+0.022
Neg logit contributions
lf-0.030
 site-0.027
 me-0.026
ered-0.025
?"-0.024
Token upon
Token ID2402
Feature activation-4.18e+00

Pos logit contributions
lf+0.003
 site+0.003
izing+0.003
ered+0.003
 avail+0.003
Neg logit contributions
 de-0.001
 lear-0.001
 love-0.001
joy-0.001
ives-0.001
Token a
Token ID257
Feature activation-2.91e+00

Pos logit contributions
 de+0.109
 lear+0.109
ives+0.102
rav+0.094
isc+0.092
Neg logit contributions
lf-0.078
 me-0.063
ered-0.063
ggy-0.062
 happen-0.060
Token time
Token ID640
Feature activation-3.68e-01

Pos logit contributions
 de+0.075
 lear+0.075
ives+0.073
bows+0.069
rav+0.067
Neg logit contributions
lf-0.050
 me-0.042
 site-0.040
ggy-0.040
 wrong-0.036
Token,
Token ID11
Feature activation-1.12e-01

Pos logit contributions
 de+0.011
 lear+0.010
ives+0.009
 love+0.009
rav+0.009
Neg logit contributions
lf-0.006
 site-0.005
ered-0.005
 avail-0.005
izing-0.005
Token there
Token ID612
Feature activation-1.21e+00

Pos logit contributions
 de+0.003
 lear+0.003
ives+0.002
 love+0.002
joy+0.002
Neg logit contributions
lf-0.002
 site-0.002
 avail-0.002
ered-0.002
izing-0.002
Token was
Token ID373
Feature activation-1.25e+00

Pos logit contributions
 de+0.029
 lear+0.024
 love+0.023
ives+0.022
rav+0.021
Neg logit contributions
lf-0.026
 site-0.023
izing-0.023
 avail-0.023
ered-0.022
Token
Token ID198
Feature activation-1.13e-01

Pos logit contributions
 de+0.018
 lear+0.016
ives+0.015
bows+0.014
rav+0.013
Neg logit contributions
lf-0.025
 site-0.022
ered-0.021
 avail-0.021
 me-0.021
Token
Token ID198
Feature activation+1.13e+00

Pos logit contributions
 de+0.002
 lear+0.002
ives+0.002
 love+0.001
rav+0.001
Neg logit contributions
lf-0.003
 site-0.003
ered-0.003
 avail-0.003
izing-0.003
TokenBen
Token ID11696
Feature activation-1.18e+00

Pos logit contributions
lf+0.032
 site+0.029
ered+0.028
 me+0.027
izing+0.027
Neg logit contributions
 de-0.020
 lear-0.018
ives-0.016
bows-0.015
joy-0.014
Token nods
Token ID34833
Feature activation-1.41e+00

Pos logit contributions
 de+0.027
 lear+0.026
ives+0.023
free+0.022
rav+0.022
Neg logit contributions
lf-0.025
 me-0.023
 site-0.022
ered-0.022
?"-0.020
Token and
Token ID290
Feature activation-6.48e-01

Pos logit contributions
 de+0.028
 lear+0.024
ives+0.023
bows+0.021
rav+0.020
Neg logit contributions
lf-0.037
 site-0.032
ered-0.032
 me-0.031
 avail-0.031
Token smiles
Token ID21845
Feature activation-4.03e+00

Pos logit contributions
 de+0.018
 lear+0.017
ives+0.017
bows+0.016
rav+0.016
Neg logit contributions
lf-0.010
ered-0.008
 me-0.008
 site-0.008
 happen-0.008
Token.
Token ID13
Feature activation-2.00e+00

Pos logit contributions
 de+0.123
 lear+0.116
ives+0.111
bows+0.105
rav+0.103
Neg logit contributions
lf-0.048
 me-0.045
 wrong-0.042
 first-0.039
ered-0.038
Token He
Token ID679
Feature activation-7.91e-01

Pos logit contributions
 de+0.046
 lear+0.044
ives+0.041
bows+0.039
rav+0.038
Neg logit contributions
lf-0.041
 site-0.037
 me-0.036
 avail-0.033
ered-0.032
Token and
Token ID290
Feature activation-5.74e-01

Pos logit contributions
 lear+0.023
 de+0.023
free+0.021
joy+0.021
rav+0.021
Neg logit contributions
 me-0.010
lf-0.010
ered-0.008
?"-0.008
 site-0.008
Token his
Token ID465
Feature activation-1.22e+00

Pos logit contributions
 de+0.014
 lear+0.013
ives+0.013
bows+0.012
rav+0.012
Neg logit contributions
lf-0.011
 me-0.010
 site-0.009
 avail-0.009
ered-0.009
Token dad
Token ID9955
Feature activation-2.27e+00

Pos logit contributions
 de+0.031
 lear+0.030
ives+0.029
bows+0.027
rav+0.026
Neg logit contributions
lf-0.023
 site-0.019
 me-0.018
ered-0.018
 avail-0.018
Token to
Token ID284
Feature activation+5.17e-01

Pos logit contributions
 de+0.023
 lear+0.021
ives+0.020
rav+0.018
bows+0.018
Neg logit contributions
lf-0.016
 site-0.014
ered-0.014
 me-0.013
 avail-0.013
Token hope
Token ID2911
Feature activation+1.10e+00

Pos logit contributions
lf+0.012
 site+0.010
ered+0.010
izing+0.010
 avail+0.009
Neg logit contributions
 de-0.012
 lear-0.011
ives-0.010
 love-0.010
bows-0.009
Token.
Token ID13
Feature activation+1.50e+00

Pos logit contributions
lf+0.023
 site+0.020
ered+0.019
 avail+0.019
izing+0.018
Neg logit contributions
 de-0.027
 lear-0.024
ives-0.022
 love-0.022
joy-0.021
Token<|endoftext|>
Token ID50256
Feature activation-1.25e+00

Pos logit contributions
lf+0.040
 site+0.036
ered+0.034
 avail+0.033
izing+0.033
Neg logit contributions
 de-0.029
 lear-0.025
ives-0.023
bows-0.021
rav-0.020
TokenOnce
Token ID7454
Feature activation+3.43e-04

Pos logit contributions
 de+0.029
 lear+0.028
ives+0.023
rav+0.023
bows+0.022
Neg logit contributions
lf-0.028
 site-0.024
 me-0.023
ered-0.023
?"-0.022
Token upon
Token ID2402
Feature activation-4.00e+00

Pos logit contributions
lf+0.000
 site+0.000
ered+0.000
izing+0.000
 avail+0.000
Neg logit contributions
 de-0.000
 lear-0.000
ives-0.000
joy-0.000
 love-0.000
Token a
Token ID257
Feature activation-2.82e+00

Pos logit contributions
 lear+0.110
 de+0.109
ives+0.106
rav+0.098
isc+0.096
Neg logit contributions
lf-0.067
 me-0.055
ggy-0.053
 myself-0.051
ered-0.051
Token time
Token ID640
Feature activation-4.02e-01

Pos logit contributions
 de+0.074
 lear+0.074
ives+0.072
bows+0.070
free+0.067
Neg logit contributions
lf-0.048
 me-0.041
ggy-0.038
 site-0.036
 first-0.033
Token,
Token ID11
Feature activation-5.67e-01

Pos logit contributions
 de+0.012
 lear+0.011
ives+0.010
rav+0.009
 love+0.009
Neg logit contributions
lf-0.007
 site-0.006
ered-0.005
 avail-0.005
izing-0.005
Token there
Token ID612
Feature activation-1.33e+00

Pos logit contributions
 de+0.015
 lear+0.013
ives+0.012
 love+0.012
joy+0.012
Neg logit contributions
lf-0.011
 site-0.010
 avail-0.009
ered-0.009
izing-0.009
Token was
Token ID373
Feature activation-1.34e+00

Pos logit contributions
 de+0.033
 lear+0.029
ives+0.027
 love+0.026
rav+0.025
Neg logit contributions
lf-0.027
 site-0.024
ered-0.023
 avail-0.023
izing-0.023
Token back
Token ID736
Feature activation+1.65e+00

Pos logit contributions
lf+0.044
 site+0.038
ered+0.037
izing+0.036
 avail+0.035
Neg logit contributions
 de-0.048
 lear-0.043
ives-0.040
 love-0.037
joy-0.037
Token home
Token ID1363
Feature activation+1.30e+00

Pos logit contributions
lf+0.039
 site+0.034
izing+0.032
ered+0.032
 avail+0.031
Neg logit contributions
 de-0.037
 lear-0.033
ives-0.030
 love-0.029
joy-0.028
Token.
Token ID13
Feature activation+6.54e-01

Pos logit contributions
lf+0.028
 site+0.025
ered+0.024
 avail+0.024
izing+0.023
Neg logit contributions
 de-0.031
 lear-0.027
ives-0.025
 love-0.024
joy-0.023
Token<|endoftext|>
Token ID50256
Feature activation-1.57e+00

Pos logit contributions
lf+0.014
 me+0.013
 site+0.012
ered+0.011
 avail+0.011
Neg logit contributions
 de-0.014
 lear-0.014
ives-0.012
bows-0.012
pre-0.011
TokenOnce
Token ID7454
Feature activation-3.42e-01

Pos logit contributions
 de+0.037
 lear+0.036
ives+0.029
rav+0.028
bows+0.028
Neg logit contributions
lf-0.034
 site-0.031
 me-0.030
ered-0.029
?"-0.028
Token upon
Token ID2402
Feature activation-4.07e+00

Pos logit contributions
 de+0.004
 lear+0.003
ives+0.003
 love+0.002
joy+0.002
Neg logit contributions
lf-0.011
 site-0.011
ered-0.010
izing-0.010
 avail-0.010
Token a
Token ID257
Feature activation-3.14e+00

Pos logit contributions
 de+0.108
 lear+0.106
ives+0.100
rav+0.094
isc+0.092
Neg logit contributions
lf-0.074
 me-0.061
ered-0.059
ggy-0.058
 happen-0.058
Token time
Token ID640
Feature activation-6.02e-01

Pos logit contributions
 de+0.082
 lear+0.081
ives+0.079
bows+0.076
rav+0.073
Neg logit contributions
lf-0.054
 me-0.047
ggy-0.043
 site-0.042
 wrong-0.039
Token,
Token ID11
Feature activation-6.66e-01

Pos logit contributions
 de+0.018
 lear+0.016
ives+0.015
 love+0.014
rav+0.014
Neg logit contributions
lf-0.010
 site-0.008
ered-0.008
 avail-0.007
izing-0.007
Token there
Token ID612
Feature activation-1.35e+00

Pos logit contributions
 de+0.018
 lear+0.016
ives+0.014
 love+0.014
joy+0.014
Neg logit contributions
lf-0.013
 site-0.011
ered-0.011
 avail-0.011
izing-0.010
Token was
Token ID373
Feature activation-1.37e+00

Pos logit contributions
 de+0.033
 lear+0.028
ives+0.026
 love+0.026
rav+0.025
Neg logit contributions
lf-0.029
 site-0.026
ered-0.024
 avail-0.024
izing-0.024
Token toys
Token ID14958
Feature activation-1.29e+00

Pos logit contributions
 de+0.008
 lear+0.007
ives+0.006
rav+0.006
joy+0.006
Neg logit contributions
lf-0.007
 site-0.006
ered-0.005
 avail-0.005
izing-0.005
Token,"
Token ID553
Feature activation-6.29e-01

Pos logit contributions
 de+0.032
 lear+0.028
ives+0.027
 love+0.025
bows+0.025
Neg logit contributions
lf-0.027
 site-0.024
ered-0.023
 avail-0.023
izing-0.022
Token Mom
Token ID11254
Feature activation-1.24e-01

Pos logit contributions
 de+0.015
 lear+0.013
 love+0.012
ives+0.011
joy+0.011
Neg logit contributions
lf-0.014
 site-0.013
ered-0.012
izing-0.012
 avail-0.012
Token said
Token ID531
Feature activation-1.73e+00

Pos logit contributions
 de+0.003
 lear+0.003
ives+0.003
 love+0.002
bows+0.002
Neg logit contributions
lf-0.003
 site-0.002
 avail-0.002
izing-0.002
ered-0.002
Token.
Token ID13
Feature activation-2.32e+00

Pos logit contributions
 de+0.042
 lear+0.037
ives+0.035
joy+0.032
 love+0.032
Neg logit contributions
lf-0.038
 site-0.033
ered-0.031
 avail-0.031
izing-0.030
Token "
Token ID366
Feature activation-4.29e+00

Pos logit contributions
 de+0.049
 lear+0.045
ives+0.041
bows+0.040
rav+0.039
Neg logit contributions
lf-0.053
 site-0.048
 me-0.048
ered-0.045
izing-0.043
TokenThat
Token ID2504
Feature activation+7.08e-01

Pos logit contributions
 de+0.101
isc+0.089
 lear+0.088
ives+0.083
rav+0.081
Neg logit contributions
lf-0.088
izing-0.083
 myself-0.078
 site-0.076
ize-0.074
Token's
Token ID338
Feature activation+5.29e-01

Pos logit contributions
lf+0.013
 site+0.012
ered+0.011
 avail+0.011
izing+0.011
Neg logit contributions
 de-0.019
 lear-0.017
ives-0.016
 love-0.015
bows-0.015
Token not
Token ID407
Feature activation-3.32e-02

Pos logit contributions
lf+0.012
 site+0.011
ered+0.010
 avail+0.010
izing+0.010
Neg logit contributions
 de-0.012
 lear-0.011
ives-0.009
 love-0.009
bows-0.009
Token nice
Token ID3621
Feature activation+7.14e-01

Pos logit contributions
 de+0.001
 lear+0.001
ives+0.001
joy+0.001
rav+0.001
Neg logit contributions
lf-0.001
 site-0.001
ered-0.001
izing-0.001
 avail-0.001
Token.
Token ID13
Feature activation-1.34e+00

Pos logit contributions
lf+0.014
 site+0.012
ered+0.012
izing+0.011
 avail+0.011
Neg logit contributions
 de-0.019
 lear-0.017
ives-0.016
joy-0.015
rav-0.014