TOP ACTIVATIONS
MAX = 5.075

would take out a j igsaw puzzle and try to solve
It was a big j igsaw puzzle and she wanted to
box and saw a j igsaw puzzle inside . He paused
cried . " The ost ric hes are mean ! They
. They were called ost ric hes . Lily and Ben
dish that the restaurant was known for . But Sam said
a place full of rain bows and butterflies . Sara 's
the brave little boy was known as the mail man of
then on , Mary was known as the little girl who
was a young man , known to all as Jimmy .
a boy . They had known each other since they were
, the small crab was known as the brav est crab
then on , Sarah was known across the town as the
ie . Unlike other ost ric hes , his feathers gl
, the bar ber was known as the village 's best
too close . The ost ric hes might pe ck you
. This old number was known far and wide for his
on , the battery was known as the brilliant battery !
on , the man was known as the strongest man in
and looked at the ost ric hes . They saw one

MIN ACTIVATIONS
MIN = -3.958

do that again . I promise ." Mom and
. And you have to promise not to eat my cake
belongings , and ask for permission before using them . <|endoftext|>
. But you have to promise me something . You have
. They never broke their promise again . <|endoftext|> Tom and
. And you have to promise me to be careful and
will be careful . I promise ." Lily said .
. But only if you promise not to chew on them
something . You have to promise me that you will not
. And you have to promise not to tell Mom or
Now , you have to promise me something . You have
. But you have to promise me to never do this
We promise , promise , promise ." The teacher
will like him , I promise ." Ben nodded
Lily said , " We promise , mom . Thank you
. And you have to promise me you will never touch
. But you have to promise me not to touch it
. But you have to promise me you will never play
. But you have to promise me you will never lie
B ella nodded and whim pered . She was ashamed and

INTERVAL 2.817 - 5.075
CONTAINS 0.021%

on , the princess was known for being the brave ,
taught him . " No matter what happens , Joe ,
. He knew that no matter how many times he asked
of the street . Exc ited , John began to follow
seemed to fit . No matter how hard True tried ,

INTERVAL 0.559 - 2.817
CONTAINS 17.357%

she put it back on his head . She asked Tim
on the floor while wearing his bath robe and g iggle
to the shore of the beach . There , they saw
He realised it was ok to be scared of loud things
it off to everyone . Everyone compliment ed Molly on her

INTERVAL -1.700 - 0.559
CONTAINS 81.145%

a big yellow carpet on the floor and she had so
g lee as the water soaked her clothes and hair .
it was so nice , so she picked up the peanut
not wake up when the air inside became cold and thin
She flew and flew and flew until she was back home

INTERVAL -3.958 - -1.700
CONTAINS 1.477%

the station , but he couldn 't get out . He
eating some fuel . Max couldn 't believe it . Everyone
allowed him to draw outside from then on . Tim my
chest ." L ily felt happy that she could
." L ily felt proud of herself for
Token would
Token ID561
Feature activation+3.45e-01

Pos logit contributions
elle+0.007
burn+0.007
ky+0.007
ami+0.007
ah+0.007
Neg logit contributions
and-0.007
 contain-0.007
 Follow-0.007
 Madd-0.006
 Stay-0.006
Token take
Token ID1011
Feature activation+1.43e+00

Pos logit contributions
elle+0.014
burn+0.014
ky+0.013
ami+0.013
ah+0.013
Neg logit contributions
and-0.013
 contain-0.012
 Follow-0.011
 Madd-0.011
 Stay-0.010
Token out
Token ID503
Feature activation+9.43e-01

Pos logit contributions
burn+0.057
aman+0.055
ami+0.054
elle+0.054
ited+0.053
Neg logit contributions
and-0.057
 contain-0.050
 Follow-0.049
 Stay-0.046
engu-0.046
Token a
Token ID257
Feature activation-1.47e-01

Pos logit contributions
burn+0.042
elle+0.040
andra+0.039
aman+0.039
ky+0.038
Neg logit contributions
and-0.034
 contain-0.029
 chambers-0.028
 Follow-0.027
engu-0.026
Token j
Token ID474
Feature activation+3.50e-01

Pos logit contributions
and+0.006
 contain+0.005
 Follow+0.005
 Stay+0.005
 Madd+0.005
Neg logit contributions
elle-0.006
burn-0.006
ky-0.006
ami-0.005
ah-0.005
Tokenigsaw
Token ID45636
Feature activation+4.69e+00

Pos logit contributions
elle+0.016
burn+0.016
ky+0.015
ami+0.015
ah+0.015
Neg logit contributions
and-0.011
 contain-0.010
 Follow-0.010
 Madd-0.009
 Stay-0.009
Token puzzle
Token ID15027
Feature activation+6.65e-01

Pos logit contributions
elle+0.177
ky+0.171
ami+0.168
burn+0.168
ah+0.160
Neg logit contributions
and-0.181
 contain-0.171
 Follow-0.169
 Stay-0.162
 Madd-0.159
Token and
Token ID290
Feature activation-1.91e-01

Pos logit contributions
elle+0.027
burn+0.027
ky+0.026
ami+0.026
andra+0.025
Neg logit contributions
and-0.025
 contain-0.022
 Follow-0.022
 Madd-0.020
 Stay-0.020
Token try
Token ID1949
Feature activation+3.83e-01

Pos logit contributions
and+0.007
 contain+0.007
 Follow+0.006
 Stay+0.006
 Madd+0.006
Neg logit contributions
elle-0.008
burn-0.008
ky-0.008
ami-0.008
ah-0.007
Token to
Token ID284
Feature activation+7.60e-01

Pos logit contributions
ited+0.020
burn+0.020
elle+0.019
aman+0.019
andra+0.019
Neg logit contributions
and-0.012
 contain-0.008
oles-0.007
engu-0.007
 Follow-0.007
Token solve
Token ID8494
Feature activation-1.30e-01

Pos logit contributions
elle+0.034
burn+0.034
ky+0.033
ami+0.033
ah+0.032
Neg logit contributions
and-0.025
 contain-0.022
 Follow-0.022
 Madd-0.020
 Stay-0.020
Token It
Token ID632
Feature activation-8.72e-01

Pos logit contributions
elle+0.009
ky+0.009
ami+0.009
en+0.008
andra+0.008
Neg logit contributions
and-0.012
 contain-0.012
 Follow-0.011
oles-0.011
engu-0.011
Token was
Token ID373
Feature activation+1.25e-01

Pos logit contributions
and+0.023
 contain+0.021
 Follow+0.020
 Madd+0.020
 Stay+0.019
Neg logit contributions
elle-0.043
burn-0.043
ami-0.043
ky-0.042
andra-0.041
Token a
Token ID257
Feature activation-1.62e-01

Pos logit contributions
elle+0.005
ami+0.005
burn+0.005
ited+0.005
andra+0.005
Neg logit contributions
and-0.004
 Madd-0.004
 contain-0.004
 Stay-0.004
 tad-0.004
Token big
Token ID1263
Feature activation+6.12e-01

Pos logit contributions
and+0.006
 contain+0.006
 Follow+0.005
 Madd+0.005
 Stay+0.005
Neg logit contributions
elle-0.007
burn-0.006
ky-0.006
ami-0.006
andra-0.006
Token j
Token ID474
Feature activation+3.30e-01

Pos logit contributions
elle+0.024
burn+0.024
ami+0.023
ky+0.023
andra+0.022
Neg logit contributions
and-0.023
 contain-0.021
 Follow-0.021
 Madd-0.019
 Stay-0.019
Tokenigsaw
Token ID45636
Feature activation+4.61e+00

Pos logit contributions
burn+0.016
elle+0.016
ami+0.016
ky+0.015
ep+0.015
Neg logit contributions
and-0.009
 contain-0.008
 Follow-0.008
 Madd-0.007
oles-0.007
Token puzzle
Token ID15027
Feature activation+6.26e-01

Pos logit contributions
elle+0.178
burn+0.171
ky+0.171
ami+0.168
ah+0.161
Neg logit contributions
and-0.178
 contain-0.165
 Follow-0.163
 Stay-0.154
 Madd-0.152
Token and
Token ID290
Feature activation+5.41e-03

Pos logit contributions
burn+0.025
elle+0.025
ami+0.024
andra+0.023
ky+0.023
Neg logit contributions
and-0.024
 contain-0.022
 Follow-0.021
 Madd-0.019
 Stay-0.019
Token she
Token ID673
Feature activation+6.12e-01

Pos logit contributions
burn+0.000
elle+0.000
ami+0.000
ky+0.000
ah+0.000
Neg logit contributions
and-0.000
 contain-0.000
 Follow-0.000
 Madd-0.000
 Stay-0.000
Token wanted
Token ID2227
Feature activation-1.28e-01

Pos logit contributions
elle+0.027
burn+0.026
ky+0.026
ami+0.026
ah+0.025
Neg logit contributions
and-0.020
 contain-0.019
 Follow-0.018
 Madd-0.017
 Stay-0.017
Token to
Token ID284
Feature activation+7.83e-01

Pos logit contributions
and+0.004
 contain+0.004
 Follow+0.003
 Madd+0.003
 Stay+0.003
Neg logit contributions
burn-0.006
elle-0.006
ami-0.006
ky-0.006
ited-0.006
Token box
Token ID3091
Feature activation-5.99e-01

Pos logit contributions
elle+0.014
ky+0.014
burn+0.013
en+0.013
ami+0.013
Neg logit contributions
and-0.012
 contain-0.011
 Follow-0.010
oles-0.010
unn-0.010
Token and
Token ID290
Feature activation-9.99e-02

Pos logit contributions
and+0.026
 contain+0.024
 Follow+0.024
 Madd+0.022
 Stay+0.022
Neg logit contributions
elle-0.020
burn-0.020
ky-0.019
ami-0.019
andra-0.018
Token saw
Token ID2497
Feature activation-1.10e-01

Pos logit contributions
and+0.004
 contain+0.004
 Follow+0.004
 Stay+0.004
 Madd+0.004
Neg logit contributions
elle-0.003
burn-0.003
ky-0.003
ami-0.003
andra-0.003
Token a
Token ID257
Feature activation-3.11e-01

Pos logit contributions
and+0.005
 contain+0.005
 Follow+0.005
 Stay+0.005
unn+0.005
Neg logit contributions
elle-0.003
ami-0.003
ky-0.003
burn-0.003
ep-0.003
Token j
Token ID474
Feature activation+2.88e-01

Pos logit contributions
and+0.013
 contain+0.012
 Follow+0.011
unn+0.011
 Stay+0.010
Neg logit contributions
elle-0.012
burn-0.011
ky-0.011
ami-0.011
ah-0.011
Tokenigsaw
Token ID45636
Feature activation+4.59e+00

Pos logit contributions
elle+0.013
burn+0.013
ky+0.013
ami+0.013
andra+0.012
Neg logit contributions
and-0.009
 contain-0.008
 Follow-0.008
 Madd-0.007
 Stay-0.007
Token puzzle
Token ID15027
Feature activation+5.51e-01

Pos logit contributions
elle+0.168
ky+0.165
ami+0.158
burn+0.155
en+0.153
Neg logit contributions
and-0.186
 Follow-0.172
 contain-0.171
 Stay-0.168
 Madd-0.159
Token inside
Token ID2641
Feature activation-7.12e-01

Pos logit contributions
elle+0.019
burn+0.018
ky+0.018
ami+0.018
ah+0.017
Neg logit contributions
and-0.024
 contain-0.022
 Follow-0.022
 Madd-0.021
 Stay-0.020
Token.
Token ID13
Feature activation+2.68e-01

Pos logit contributions
and+0.030
 contain+0.028
 Follow+0.027
 Stay+0.026
 Madd+0.025
Neg logit contributions
burn-0.026
elle-0.025
ky-0.024
ami-0.023
ah-0.023
Token He
Token ID679
Feature activation-2.59e-02

Pos logit contributions
andra+0.009
ami+0.009
elle+0.009
ky+0.009
aman+0.009
Neg logit contributions
and-0.010
 contain-0.010
unn-0.009
oles-0.009
 plunge-0.009
Token paused
Token ID24487
Feature activation+1.25e-01

Pos logit contributions
and+0.001
 contain+0.001
 Follow+0.001
unn+0.001
 Stay+0.001
Neg logit contributions
 str-0.001
ep-0.001
ky-0.001
elle-0.001
ine-0.001
Token cried
Token ID16896
Feature activation-1.37e+00

Pos logit contributions
and+0.000
 Follow+0.000
 contain+0.000
 Madd+0.000
engu+0.000
Neg logit contributions
elle-0.000
burn-0.000
ky-0.000
ami-0.000
ine-0.000
Token.
Token ID13
Feature activation+1.54e-01

Pos logit contributions
and+0.064
 contain+0.061
 Follow+0.060
 Madd+0.056
 Stay+0.056
Neg logit contributions
elle-0.041
burn-0.040
ky-0.039
ami-0.038
ah-0.037
Token "
Token ID366
Feature activation-7.35e-01

Pos logit contributions
elle+0.008
burn+0.008
ky+0.007
ami+0.007
ah+0.007
Neg logit contributions
and-0.004
 contain-0.004
 Follow-0.004
 Stay-0.003
 Madd-0.003
TokenThe
Token ID464
Feature activation-3.98e-01

Pos logit contributions
and+0.032
 contain+0.030
 Follow+0.029
 Madd+0.027
 Stay+0.027
Neg logit contributions
elle-0.025
burn-0.024
ky-0.024
ami-0.023
ah-0.022
Token ost
Token ID23619
Feature activation-6.06e-01

Pos logit contributions
and+0.015
 contain+0.014
 Follow+0.013
 Madd+0.012
 Stay+0.012
Neg logit contributions
elle-0.016
burn-0.016
ky-0.015
ami-0.015
ah-0.015
Tokenric
Token ID1173
Feature activation+4.24e+00

Pos logit contributions
oles+0.021
and+0.019
 contain+0.018
engu+0.018
unn+0.017
Neg logit contributions
elle-0.024
andra-0.024
ited-0.023
 str-0.023
aman-0.023
Tokenhes
Token ID956
Feature activation+1.00e+00

Pos logit contributions
burn+0.137
elle+0.134
ky+0.132
aman+0.131
andra+0.126
Neg logit contributions
and-0.202
 Follow-0.176
oles-0.175
 contain-0.175
adders-0.171
Token are
Token ID389
Feature activation-8.49e-03

Pos logit contributions
elle+0.050
burn+0.049
ky+0.049
ami+0.048
andra+0.047
Neg logit contributions
and-0.027
 contain-0.024
 Follow-0.023
 Stay-0.021
 Madd-0.021
Token mean
Token ID1612
Feature activation+4.88e-01

Pos logit contributions
and+0.000
 contain+0.000
 Follow+0.000
 Stay+0.000
 Madd+0.000
Neg logit contributions
elle-0.000
ami-0.000
andra-0.000
burn-0.000
aman-0.000
Token!
Token ID0
Feature activation-2.82e-01

Pos logit contributions
elle+0.019
burn+0.019
ami+0.018
ky+0.018
andra+0.018
Neg logit contributions
and-0.019
 contain-0.017
 Follow-0.017
 Stay-0.016
oles-0.015
Token They
Token ID1119
Feature activation+4.52e-01

Pos logit contributions
and+0.008
 contain+0.007
 Follow+0.007
 Stay+0.007
 Madd+0.006
Neg logit contributions
elle-0.014
burn-0.013
ky-0.013
ami-0.013
andra-0.013
Token.
Token ID13
Feature activation-1.22e-01

Pos logit contributions
and+0.023
 contain+0.021
oles+0.019
 Stay+0.018
 Follow+0.018
Neg logit contributions
elle-0.025
andra-0.025
ami-0.024
ited-0.023
burn-0.023
Token They
Token ID1119
Feature activation+4.73e-01

Pos logit contributions
and+0.004
 contain+0.004
 Follow+0.004
unn+0.004
 plunge+0.004
Neg logit contributions
elle-0.005
ky-0.005
ami-0.005
ine-0.005
andra-0.004
Token were
Token ID547
Feature activation-7.77e-02

Pos logit contributions
elle+0.020
burn+0.020
ami+0.020
ky+0.019
andra+0.019
Neg logit contributions
and-0.016
 contain-0.015
 Follow-0.014
 Madd-0.013
 Stay-0.013
Token called
Token ID1444
Feature activation-3.45e-01

Pos logit contributions
and+0.003
 contain+0.002
 Follow+0.002
 Madd+0.002
 Stay+0.002
Neg logit contributions
elle-0.003
burn-0.003
ky-0.003
ami-0.003
andra-0.003
Token ost
Token ID23619
Feature activation-6.46e-01

Pos logit contributions
and+0.013
 contain+0.011
 Follow+0.011
 Madd+0.010
 Stay+0.010
Neg logit contributions
elle-0.014
burn-0.014
ky-0.013
ami-0.013
ah-0.013
Tokenric
Token ID1173
Feature activation+4.20e+00

Pos logit contributions
and+0.026
oles+0.025
 contain+0.024
engu+0.024
 Follow+0.023
Neg logit contributions
elle-0.023
burn-0.022
ited-0.021
 str-0.020
andra-0.020
Tokenhes
Token ID956
Feature activation+1.27e+00

Pos logit contributions
burn+0.134
elle+0.131
aman+0.125
ky+0.124
ited+0.121
Neg logit contributions
and-0.200
 contain-0.178
oles-0.177
 Follow-0.176
adders-0.172
Token.
Token ID13
Feature activation-6.75e-02

Pos logit contributions
elle+0.051
burn+0.050
ami+0.049
ky+0.049
andra+0.048
Neg logit contributions
and-0.046
 contain-0.043
 Follow-0.041
oles-0.040
 Stay-0.039
Token Lily
Token ID20037
Feature activation-8.97e-02

Pos logit contributions
and+0.002
 contain+0.002
 Follow+0.002
unn+0.002
 plunge+0.002
Neg logit contributions
elle-0.003
ami-0.003
ky-0.003
burn-0.003
ine-0.003
Token and
Token ID290
Feature activation-1.78e-01

Pos logit contributions
and+0.003
 contain+0.003
 Follow+0.003
 Stay+0.003
 Madd+0.003
Neg logit contributions
burn-0.004
elle-0.004
ami-0.003
ky-0.003
ah-0.003
Token Ben
Token ID3932
Feature activation-2.84e-02

Pos logit contributions
 Madd+0.005
 contain+0.005
and+0.005
 Stay+0.005
 Follow+0.005
Neg logit contributions
burn-0.008
ited-0.008
ky-0.008
elle-0.008
ep-0.007
Token dish
Token ID9433
Feature activation+4.11e-01

Pos logit contributions
elle+0.021
ky+0.021
en+0.020
ami+0.020
burn+0.020
Neg logit contributions
and-0.023
 Follow-0.021
 contain-0.021
oles-0.020
 Stay-0.020
Token that
Token ID326
Feature activation+3.29e-01

Pos logit contributions
elle+0.015
burn+0.015
ky+0.014
ami+0.014
ah+0.014
Neg logit contributions
and-0.016
 contain-0.015
 Follow-0.015
 Madd-0.014
 Stay-0.014
Token the
Token ID262
Feature activation+4.89e-01

Pos logit contributions
elle+0.013
burn+0.013
ky+0.012
ami+0.012
andra+0.012
Neg logit contributions
and-0.013
 contain-0.012
 Follow-0.011
 Madd-0.011
 Stay-0.011
Token restaurant
Token ID7072
Feature activation+1.49e-01

Pos logit contributions
elle+0.020
ky+0.019
ami+0.019
burn+0.019
ited+0.018
Neg logit contributions
and-0.018
 Follow-0.017
 contain-0.016
oles-0.015
 Madd-0.015
Token was
Token ID373
Feature activation+3.36e-01

Pos logit contributions
elle+0.006
burn+0.006
ky+0.006
ami+0.006
ah+0.005
Neg logit contributions
and-0.006
 contain-0.005
 Follow-0.005
 Madd-0.005
 Stay-0.005
Token known
Token ID1900
Feature activation+4.19e+00

Pos logit contributions
elle+0.015
burn+0.014
ami+0.014
ky+0.013
ah+0.013
Neg logit contributions
and-0.011
 contain-0.011
 Follow-0.010
 Stay-0.010
 Madd-0.010
Token for
Token ID329
Feature activation-7.15e-01

Pos logit contributions
elle+0.163
burn+0.161
ky+0.155
ami+0.155
andra+0.150
Neg logit contributions
and-0.163
 contain-0.151
 Follow-0.146
 Madd-0.136
 Stay-0.136
Token.
Token ID13
Feature activation+2.63e-01

Pos logit contributions
and+0.030
 contain+0.027
 Follow+0.027
 Madd+0.025
 Stay+0.025
Neg logit contributions
elle-0.026
burn-0.025
ky-0.024
ami-0.024
ah-0.023
Token But
Token ID887
Feature activation-5.83e-02

Pos logit contributions
elle+0.012
ky+0.011
ami+0.011
burn+0.011
ah+0.011
Neg logit contributions
and-0.009
 contain-0.008
 Follow-0.007
 Madd-0.007
oles-0.007
Token Sam
Token ID3409
Feature activation+1.06e+00

Pos logit contributions
and+0.002
 contain+0.002
 Follow+0.002
 Madd+0.002
 Stay+0.002
Neg logit contributions
burn-0.002
elle-0.002
ami-0.002
ky-0.002
andra-0.002
Token said
Token ID531
Feature activation+1.73e-01

Pos logit contributions
burn+0.046
elle+0.044
ami+0.043
ky+0.043
aman+0.040
Neg logit contributions
and-0.035
 contain-0.033
 Follow-0.032
 Stay-0.030
 Madd-0.029
Token a
Token ID257
Feature activation-1.49e-01

Pos logit contributions
and+0.017
 contain+0.016
 Follow+0.016
 Madd+0.015
 Stay+0.015
Neg logit contributions
elle-0.015
burn-0.015
ky-0.014
ami-0.014
ah-0.014
Token place
Token ID1295
Feature activation+9.69e-02

Pos logit contributions
and+0.006
 contain+0.005
 Follow+0.005
unn+0.005
 Stay+0.005
Neg logit contributions
elle-0.006
burn-0.006
ky-0.005
ah-0.005
ine-0.005
Token full
Token ID1336
Feature activation-8.90e-01

Pos logit contributions
elle+0.004
burn+0.004
ami+0.004
ky+0.004
andra+0.003
Neg logit contributions
and-0.004
 contain-0.004
 Follow-0.003
 Madd-0.003
 Stay-0.003
Token of
Token ID286
Feature activation-6.50e-02

Pos logit contributions
and+0.026
 Follow+0.025
 contain+0.022
engu+0.022
unn+0.021
Neg logit contributions
elle-0.039
ky-0.039
en-0.039
ami-0.038
burn-0.038
Token rain
Token ID6290
Feature activation-2.53e-01

Pos logit contributions
and+0.003
 Follow+0.002
 contain+0.002
unny+0.002
engu+0.002
Neg logit contributions
elle-0.002
en-0.002
 afar-0.002
ky-0.002
ep-0.002
Tokenbows
Token ID25435
Feature activation+4.11e+00

Pos logit contributions
and+0.011
 contain+0.010
 Follow+0.009
 Madd+0.009
oles+0.009
Neg logit contributions
elle-0.009
ami-0.009
andra-0.009
burn-0.008
ky-0.008
Token and
Token ID290
Feature activation-1.70e-01

Pos logit contributions
elle+0.170
burn+0.166
ami+0.166
andra+0.164
aman+0.159
Neg logit contributions
and-0.146
 contain-0.137
oles-0.123
 Madd-0.117
 Follow-0.116
Token butterflies
Token ID44571
Feature activation-3.60e-01

Pos logit contributions
and+0.006
 Follow+0.006
 contain+0.006
oles+0.005
engu+0.005
Neg logit contributions
elle-0.007
burn-0.007
ky-0.007
ine-0.006
ami-0.006
Token.
Token ID13
Feature activation-2.16e-01

Pos logit contributions
and+0.015
 contain+0.014
 Follow+0.013
 Madd+0.013
oles+0.013
Neg logit contributions
elle-0.013
ami-0.013
burn-0.013
andra-0.012
ky-0.012
Token Sara
Token ID24799
Feature activation-2.07e-01

Pos logit contributions
and+0.007
 contain+0.007
 Follow+0.007
oles+0.006
 tad+0.006
Neg logit contributions
elle-0.009
burn-0.009
ky-0.009
ami-0.008
ine-0.008
Token's
Token ID338
Feature activation-8.86e-02

Pos logit contributions
and+0.008
 contain+0.007
 Follow+0.007
 Madd+0.006
 Stay+0.006
Neg logit contributions
elle-0.008
burn-0.008
ky-0.008
ami-0.008
ah-0.008
Token the
Token ID262
Feature activation+6.67e-02

Pos logit contributions
and+0.006
 contain+0.006
 Madd+0.006
 Follow+0.006
 Stay+0.005
Neg logit contributions
burn-0.007
elle-0.007
ky-0.007
ami-0.007
ited-0.007
Token brave
Token ID14802
Feature activation-7.60e-01

Pos logit contributions
elle+0.003
ky+0.003
ami+0.003
burn+0.003
ah+0.002
Neg logit contributions
and-0.003
 contain-0.002
 Follow-0.002
unn-0.002
oles-0.002
Token little
Token ID1310
Feature activation+6.32e-01

Pos logit contributions
and+0.023
 contain+0.020
 Follow+0.020
 Madd+0.018
 Stay+0.018
Neg logit contributions
elle-0.037
ky-0.035
burn-0.035
ami-0.035
ah-0.034
Token boy
Token ID2933
Feature activation-5.28e-01

Pos logit contributions
elle+0.034
burn+0.033
ky+0.032
ami+0.032
ah+0.031
Neg logit contributions
and-0.016
 contain-0.014
 Follow-0.013
 Madd-0.012
 Stay-0.012
Token was
Token ID373
Feature activation+8.41e-02

Pos logit contributions
and+0.017
 Follow+0.016
 contain+0.016
 Stay+0.015
 tad+0.014
Neg logit contributions
burn-0.023
elle-0.022
ami-0.022
aman-0.022
ky-0.021
Token known
Token ID1900
Feature activation+4.10e+00

Pos logit contributions
ami+0.004
ky+0.004
elle+0.004
aman+0.004
andra+0.004
Neg logit contributions
and-0.003
 contain-0.003
 Madd-0.002
 Stay-0.002
 Follow-0.002
Token as
Token ID355
Feature activation+8.38e-01

Pos logit contributions
elle+0.141
burn+0.140
ami+0.134
ky+0.132
andra+0.130
Neg logit contributions
and-0.180
 contain-0.169
 Follow-0.163
 Madd-0.154
 Stay-0.153
Token the
Token ID262
Feature activation+1.60e-01

Pos logit contributions
burn+0.039
elle+0.036
ited+0.035
ky+0.035
ami+0.034
Neg logit contributions
and-0.028
 contain-0.026
 Follow-0.026
 Madd-0.026
 tad-0.025
Token mail
Token ID6920
Feature activation+1.57e-01

Pos logit contributions
elle+0.007
burn+0.007
ky+0.007
ami+0.007
andra+0.006
Neg logit contributions
and-0.005
 contain-0.005
 Follow-0.005
 Madd-0.004
 Stay-0.004
Tokenman
Token ID805
Feature activation+3.03e-01

Pos logit contributions
burn+0.007
elle+0.007
ky+0.007
ited+0.006
ami+0.006
Neg logit contributions
and-0.005
 contain-0.005
 Follow-0.005
 Stay-0.005
hog-0.004
Token of
Token ID286
Feature activation-2.18e-01

Pos logit contributions
burn+0.012
elle+0.012
aman+0.012
ky+0.011
ami+0.011
Neg logit contributions
and-0.011
 contain-0.010
 Follow-0.010
 Stay-0.009
hog-0.009
Token then
Token ID788
Feature activation-8.24e-01

Pos logit contributions
 contain+0.001
 Follow+0.001
and+0.001
 Madd+0.001
 Stay+0.001
Neg logit contributions
burn-0.002
elle-0.001
ami-0.001
andra-0.001
ine-0.001
Token on
Token ID319
Feature activation+5.73e-01

Pos logit contributions
and+0.024
 Follow+0.023
 Stay+0.022
 contain+0.021
 Go+0.019
Neg logit contributions
ami-0.039
andra-0.038
ky-0.038
aman-0.037
elle-0.036
Token,
Token ID11
Feature activation+2.17e-01

Pos logit contributions
burn+0.024
elle+0.024
ky+0.023
ami+0.023
andra+0.022
Neg logit contributions
and-0.020
 contain-0.018
 Follow-0.018
 Madd-0.017
 Stay-0.017
Token Mary
Token ID5335
Feature activation+8.13e-01

Pos logit contributions
burn+0.009
ky+0.009
elle+0.009
ine+0.009
ep+0.009
Neg logit contributions
and-0.008
 contain-0.007
 Follow-0.007
 Stay-0.007
 Madd-0.007
Token was
Token ID373
Feature activation+2.36e-01

Pos logit contributions
burn+0.035
aman+0.035
ami+0.034
elle+0.034
ky+0.034
Neg logit contributions
and-0.024
 Follow-0.022
 Stay-0.022
unn-0.020
 Madd-0.020
Token known
Token ID1900
Feature activation+4.05e+00

Pos logit contributions
ky+0.012
ami+0.011
aman+0.011
elle+0.011
andra+0.011
Neg logit contributions
and-0.007
 Stay-0.006
 Madd-0.006
 contain-0.006
 Follow-0.006
Token as
Token ID355
Feature activation+1.09e+00

Pos logit contributions
elle+0.125
burn+0.122
ky+0.118
ami+0.117
andra+0.111
Neg logit contributions
and-0.189
 contain-0.177
 Follow-0.174
 Madd-0.164
 Stay-0.163
Token the
Token ID262
Feature activation+3.86e-01

Pos logit contributions
burn+0.047
elle+0.044
ky+0.044
ep+0.042
ine+0.042
Neg logit contributions
and-0.040
 Madd-0.037
 contain-0.037
 Follow-0.036
 tad-0.036
Token little
Token ID1310
Feature activation+5.78e-01

Pos logit contributions
elle+0.017
burn+0.017
ky+0.016
ami+0.016
ine+0.016
Neg logit contributions
and-0.013
 contain-0.012
 Follow-0.012
 Stay-0.011
 tad-0.010
Token girl
Token ID2576
Feature activation-2.82e-01

Pos logit contributions
burn+0.032
elle+0.032
ine+0.031
ami+0.031
ky+0.031
Neg logit contributions
and-0.012
 contain-0.010
 Follow-0.010
 Stay-0.009
 tad-0.009
Token who
Token ID508
Feature activation+2.76e-01

Pos logit contributions
and+0.008
 Follow+0.007
 contain+0.007
 Stay+0.007
hog+0.006
Neg logit contributions
burn-0.014
aman-0.013
elle-0.013
ky-0.013
ami-0.013
Token was
Token ID373
Feature activation+1.54e-01

Pos logit contributions
and+0.049
 contain+0.048
 Follow+0.047
unn+0.045
 hind+0.043
Neg logit contributions
elle-0.036
ah-0.033
en-0.032
ine-0.032
burn-0.030
Token a
Token ID257
Feature activation+1.20e-02

Pos logit contributions
elle+0.005
burn+0.004
ky+0.004
ami+0.004
ah+0.004
Neg logit contributions
and-0.007
 contain-0.007
 Follow-0.007
 Madd-0.006
 Stay-0.006
Token young
Token ID1862
Feature activation+3.11e-01

Pos logit contributions
elle+0.001
burn+0.001
ky+0.001
ami+0.001
andra+0.001
Neg logit contributions
and-0.000
 contain-0.000
 Follow-0.000
 Madd-0.000
 Stay-0.000
Token man
Token ID582
Feature activation+5.71e-01

Pos logit contributions
elle+0.014
burn+0.014
ky+0.014
ami+0.014
ah+0.013
Neg logit contributions
and-0.010
 contain-0.009
 Follow-0.009
 Stay-0.008
 Madd-0.008
Token,
Token ID11
Feature activation+3.43e-01

Pos logit contributions
aman+0.026
andra+0.026
burn+0.025
elle+0.025
ited+0.025
Neg logit contributions
and-0.020
 contain-0.016
 Stay-0.016
 Follow-0.015
oles-0.014
Token known
Token ID1900
Feature activation+4.03e+00

Pos logit contributions
elle+0.014
burn+0.014
ami+0.014
ited+0.014
en+0.013
Neg logit contributions
and-0.012
 Madd-0.012
 contain-0.012
 Follow-0.011
 Stay-0.011
Token to
Token ID284
Feature activation+8.18e-01

Pos logit contributions
elle+0.148
burn+0.145
ky+0.143
ami+0.141
ah+0.135
Neg logit contributions
and-0.161
 contain-0.149
 Follow-0.148
 Stay-0.137
 Madd-0.136
Token all
Token ID477
Feature activation-2.25e-01

Pos logit contributions
elle+0.032
burn+0.031
ky+0.030
ami+0.030
andra+0.029
Neg logit contributions
and-0.032
 contain-0.030
 Follow-0.029
 Stay-0.028
 Madd-0.027
Token as
Token ID355
Feature activation+1.24e+00

Pos logit contributions
and+0.008
 contain+0.007
 Stay+0.007
 Follow+0.007
igators+0.007
Neg logit contributions
andra-0.010
burn-0.010
ky-0.009
ami-0.009
aman-0.009
Token Jimmy
Token ID12963
Feature activation+8.76e-02

Pos logit contributions
elle+0.049
burn+0.049
ky+0.047
ited+0.046
ami+0.046
Neg logit contributions
and-0.047
 contain-0.042
 Follow-0.041
 Madd-0.040
 tad-0.040
Token.
Token ID13
Feature activation+1.80e-01

Pos logit contributions
burn+0.004
elle+0.004
aman+0.004
ami+0.004
andra+0.004
Neg logit contributions
and-0.003
 contain-0.002
 Stay-0.002
 Follow-0.002
engu-0.002
Token a
Token ID257
Feature activation-2.15e-01

Pos logit contributions
ited+0.009
andra+0.009
ami+0.009
aman+0.009
elle+0.009
Neg logit contributions
and-0.006
 Madd-0.006
 Stay-0.005
 Follow-0.005
 contain-0.005
Token boy
Token ID2933
Feature activation-3.45e-01

Pos logit contributions
and+0.007
 Follow+0.006
 contain+0.006
 Stay+0.005
 tad+0.005
Neg logit contributions
elle-0.010
ami-0.010
burn-0.009
andra-0.009
ky-0.009
Token.
Token ID13
Feature activation-1.19e-03

Pos logit contributions
and+0.015
 Follow+0.013
 contain+0.013
 Stay+0.013
 Madd+0.012
Neg logit contributions
burn-0.012
elle-0.012
andra-0.012
aman-0.012
ited-0.012
Token They
Token ID1119
Feature activation+5.93e-01

Pos logit contributions
and+0.000
 contain+0.000
 Follow+0.000
oles+0.000
unn+0.000
Neg logit contributions
elle-0.000
ky-0.000
ami-0.000
burn-0.000
ine-0.000
Token had
Token ID550
Feature activation+1.50e-01

Pos logit contributions
elle+0.029
burn+0.028
ky+0.028
ami+0.028
andra+0.027
Neg logit contributions
and-0.017
 contain-0.015
 Follow-0.015
 Madd-0.013
 Stay-0.013
Token known
Token ID1900
Feature activation+4.03e+00

Pos logit contributions
elle+0.006
ami+0.005
burn+0.005
ky+0.005
ited+0.005
Neg logit contributions
and-0.006
 contain-0.006
 Follow-0.005
 Stay-0.005
 Madd-0.005
Token each
Token ID1123
Feature activation-3.71e-01

Pos logit contributions
elle+0.127
burn+0.124
ky+0.120
ami+0.118
ah+0.113
Neg logit contributions
and-0.183
 contain-0.170
 Follow-0.168
 Madd-0.158
 Stay-0.157
Token other
Token ID584
Feature activation-2.25e-01

Pos logit contributions
and+0.008
 contain+0.007
 Follow+0.007
 Stay+0.006
 Madd+0.006
Neg logit contributions
elle-0.021
burn-0.020
ky-0.020
ami-0.020
andra-0.019
Token since
Token ID1201
Feature activation-7.67e-01

Pos logit contributions
and+0.009
 contain+0.008
 Follow+0.008
 Stay+0.008
engu+0.008
Neg logit contributions
burn-0.008
elle-0.008
andra-0.008
ami-0.008
ited-0.008
Token they
Token ID484
Feature activation+1.70e-01

Pos logit contributions
and+0.026
 contain+0.024
 Stay+0.023
 Follow+0.023
 Madd+0.023
Neg logit contributions
elle-0.032
burn-0.032
ky-0.032
ami-0.031
ited-0.031
Token were
Token ID547
Feature activation+9.52e-02

Pos logit contributions
elle+0.010
burn+0.009
ami+0.009
aman+0.009
ky+0.009
Neg logit contributions
 contain-0.004
and-0.003
 Follow-0.003
 Stay-0.003
 tad-0.003
Token,
Token ID11
Feature activation+1.90e-01

Pos logit contributions
elle+0.023
burn+0.023
ami+0.022
ky+0.022
ah+0.021
Neg logit contributions
and-0.020
 contain-0.019
 Follow-0.018
 Madd-0.017
 Stay-0.017
Token the
Token ID262
Feature activation+2.80e-01

Pos logit contributions
elle+0.007
burn+0.007
ky+0.007
ami+0.007
andra+0.007
Neg logit contributions
and-0.007
 contain-0.007
 Follow-0.007
 Madd-0.007
 Stay-0.006
Token small
Token ID1402
Feature activation+6.54e-01

Pos logit contributions
elle+0.012
ited+0.012
ami+0.012
ky+0.012
ah+0.012
Neg logit contributions
and-0.009
 Follow-0.008
 contain-0.007
unn-0.007
 Stay-0.007
Token crab
Token ID32202
Feature activation-5.55e-01

Pos logit contributions
ky+0.031
elle+0.031
ami+0.031
ited+0.031
ah+0.031
Neg logit contributions
and-0.020
 Follow-0.017
 contain-0.016
adders-0.016
engu-0.015
Token was
Token ID373
Feature activation+1.97e-01

Pos logit contributions
and+0.019
 Follow+0.017
 contain+0.017
 Stay+0.016
oles+0.016
Neg logit contributions
burn-0.024
elle-0.023
ky-0.023
ami-0.022
aman-0.022
Token known
Token ID1900
Feature activation+4.02e+00

Pos logit contributions
ami+0.009
ky+0.009
elle+0.009
aman+0.009
andra+0.009
Neg logit contributions
and-0.006
 contain-0.005
 Madd-0.005
 Stay-0.005
 Follow-0.005
Token as
Token ID355
Feature activation+1.10e+00

Pos logit contributions
elle+0.118
burn+0.114
ky+0.112
ami+0.110
ah+0.104
Neg logit contributions
and-0.192
 contain-0.179
 Follow-0.177
 Stay-0.167
 Madd-0.167
Token the
Token ID262
Feature activation+3.18e-01

Pos logit contributions
burn+0.045
elle+0.045
ky+0.042
ited+0.041
ami+0.041
Neg logit contributions
and-0.041
 contain-0.038
 Follow-0.037
 Madd-0.037
 tad-0.037
Token brav
Token ID49025
Feature activation-6.76e-01

Pos logit contributions
elle+0.013
burn+0.013
ky+0.013
ami+0.013
ah+0.012
Neg logit contributions
and-0.011
 contain-0.010
 Follow-0.010
 Madd-0.009
 Stay-0.009
Tokenest
Token ID395
Feature activation+1.33e+00

Pos logit contributions
and+0.022
 contain+0.019
 Follow+0.019
 Stay+0.018
 tad+0.018
Neg logit contributions
burn-0.030
elle-0.029
ky-0.029
ami-0.029
andra-0.028
Token crab
Token ID32202
Feature activation-5.84e-01

Pos logit contributions
elle+0.075
burn+0.074
ky+0.073
ami+0.072
andra+0.071
Neg logit contributions
and-0.028
 contain-0.025
 Follow-0.023
 Madd-0.021
 tad-0.020
Token then
Token ID788
Feature activation-9.06e-01

Pos logit contributions
 contain+0.003
 Follow+0.003
and+0.003
 Madd+0.003
 Stay+0.003
Neg logit contributions
burn-0.004
ami-0.004
elle-0.004
andra-0.004
ah-0.003
Token on
Token ID319
Feature activation+5.70e-01

Pos logit contributions
and+0.027
 Follow+0.026
 Stay+0.025
 contain+0.025
 Go+0.022
Neg logit contributions
ami-0.043
andra-0.042
ky-0.041
aman-0.039
ited-0.039
Token,
Token ID11
Feature activation+1.97e-01

Pos logit contributions
burn+0.025
ami+0.025
elle+0.025
ky+0.025
ited+0.024
Neg logit contributions
and-0.018
 contain-0.016
 Madd-0.016
 Follow-0.016
oles-0.015
Token Sarah
Token ID10490
Feature activation+1.91e+00

Pos logit contributions
ky+0.008
ep+0.008
ami+0.008
burn+0.008
elle+0.008
Neg logit contributions
and-0.007
 Madd-0.006
 contain-0.006
 Stay-0.006
 Follow-0.006
Token was
Token ID373
Feature activation+2.46e-01

Pos logit contributions
ami+0.082
burn+0.081
aman+0.078
ky+0.078
elle+0.077
Neg logit contributions
and-0.057
 Follow-0.052
 Stay-0.051
 contain-0.048
 Madd-0.048
Token known
Token ID1900
Feature activation+4.02e+00

Pos logit contributions
ami+0.013
ky+0.013
aman+0.012
andra+0.012
elle+0.012
Neg logit contributions
and-0.007
 Stay-0.006
 Madd-0.006
 contain-0.006
 Follow-0.005
Token across
Token ID1973
Feature activation-3.49e-01

Pos logit contributions
elle+0.126
burn+0.123
ky+0.119
ami+0.118
ah+0.112
Neg logit contributions
and-0.185
 contain-0.174
 Follow-0.171
 Madd-0.161
 Stay-0.160
Token the
Token ID262
Feature activation+4.52e-01

Pos logit contributions
and+0.013
 Follow+0.012
 contain+0.012
 Madd+0.011
 Stay+0.011
Neg logit contributions
burn-0.015
andra-0.014
elle-0.014
ami-0.013
ah-0.013
Token town
Token ID3240
Feature activation-5.07e-02

Pos logit contributions
elle+0.020
ami+0.019
burn+0.019
ky+0.019
ine+0.018
Neg logit contributions
and-0.015
 contain-0.014
 Follow-0.014
 Stay-0.013
 Madd-0.012
Token as
Token ID355
Feature activation+1.08e+00

Pos logit contributions
and+0.002
 contain+0.002
 Follow+0.002
 Stay+0.002
 Madd+0.002
Neg logit contributions
burn-0.002
elle-0.002
ky-0.002
ami-0.002
andra-0.002
Token the
Token ID262
Feature activation+4.47e-01

Pos logit contributions
burn+0.042
ky+0.042
ited+0.042
elle+0.041
andra+0.040
Neg logit contributions
and-0.042
 Madd-0.038
 tad-0.035
 contain-0.035
 Follow-0.035
Tokenie
Token ID494
Feature activation+8.59e-02

Pos logit contributions
andra+0.016
ami+0.016
ited+0.015
 afar+0.015
elle+0.015
Neg logit contributions
and-0.016
oles-0.015
unn-0.014
 Madd-0.014
 contain-0.014
Token.
Token ID13
Feature activation-3.02e-02

Pos logit contributions
burn+0.003
elle+0.003
andra+0.003
ited+0.003
ami+0.003
Neg logit contributions
and-0.003
 contain-0.003
 Follow-0.003
 Stay-0.003
oles-0.003
Token Unlike
Token ID12101
Feature activation+1.50e+00

Pos logit contributions
 contain+0.001
and+0.001
 Follow+0.001
suits+0.001
vere+0.001
Neg logit contributions
ah-0.001
elle-0.001
ine-0.001
ky-0.001
en-0.001
Token other
Token ID584
Feature activation-3.02e-01

Pos logit contributions
burn+0.065
ami+0.063
andra+0.063
elle+0.061
ky+0.061
Neg logit contributions
and-0.051
 contain-0.049
 Madd-0.045
 Follow-0.044
oles-0.042
Token ost
Token ID23619
Feature activation-1.12e+00

Pos logit contributions
and+0.010
 contain+0.009
 Follow+0.009
 Madd+0.008
oles+0.008
Neg logit contributions
elle-0.014
burn-0.013
ky-0.013
ami-0.013
ah-0.013
Tokenric
Token ID1173
Feature activation+4.00e+00

Pos logit contributions
and+0.042
oles+0.040
engu+0.037
unn+0.036
 contain+0.036
Neg logit contributions
elle-0.042
burn-0.042
ited-0.040
 str-0.038
ky-0.038
Tokenhes
Token ID956
Feature activation+1.33e+00

Pos logit contributions
burn+0.131
elle+0.124
aman+0.120
ky+0.119
ited+0.114
Neg logit contributions
and-0.193
 contain-0.169
 Follow-0.168
oles-0.164
adders-0.163
Token,
Token ID11
Feature activation+7.30e-02

Pos logit contributions
burn+0.060
ami+0.060
elle+0.060
ky+0.059
andra+0.059
Neg logit contributions
and-0.041
 contain-0.038
 Follow-0.036
 Stay-0.035
oles-0.034
Token his
Token ID465
Feature activation+6.64e-01

Pos logit contributions
elle+0.003
burn+0.003
ky+0.003
ami+0.003
ah+0.002
Neg logit contributions
and-0.003
 contain-0.003
 Follow-0.003
 Stay-0.002
engu-0.002
Token feathers
Token ID27400
Feature activation-1.13e+00

Pos logit contributions
elle+0.027
burn+0.027
ky+0.026
ine+0.026
ami+0.026
Neg logit contributions
and-0.024
 contain-0.023
 Follow-0.022
 Madd-0.020
 Stay-0.020
Token gl
Token ID1278
Feature activation-8.71e-01

Pos logit contributions
and+0.026
 contain+0.025
oles+0.023
 Stay+0.021
 Follow+0.021
Neg logit contributions
ami-0.059
elle-0.059
burn-0.058
andra-0.057
ky-0.057
Token,
Token ID11
Feature activation+1.79e-01

Pos logit contributions
and+0.003
 contain+0.003
 Follow+0.003
 Stay+0.002
 Madd+0.002
Neg logit contributions
burn-0.003
elle-0.003
ky-0.003
ami-0.003
andra-0.003
Token the
Token ID262
Feature activation+2.71e-01

Pos logit contributions
burn+0.007
elle+0.007
ky+0.006
ami+0.006
andra+0.006
Neg logit contributions
and-0.007
 contain-0.007
 Follow-0.006
 Madd-0.006
 Stay-0.006
Token bar
Token ID2318
Feature activation+6.87e-01

Pos logit contributions
elle+0.011
burn+0.010
ky+0.010
ami+0.010
ah+0.010
Neg logit contributions
and-0.010
 contain-0.009
 Follow-0.009
 Madd-0.009
oles-0.009
Tokenber
Token ID527
Feature activation+1.26e-01

Pos logit contributions
burn+0.035
ky+0.033
elle+0.033
andra+0.032
ami+0.032
Neg logit contributions
and-0.019
 contain-0.016
 Follow-0.016
engu-0.015
 Stay-0.015
Token was
Token ID373
Feature activation+1.45e-01

Pos logit contributions
burn+0.005
ky+0.005
elle+0.005
aman+0.005
ami+0.005
Neg logit contributions
and-0.004
 Follow-0.004
 contain-0.004
 Stay-0.004
engu-0.003
Token known
Token ID1900
Feature activation+3.99e+00

Pos logit contributions
ky+0.007
ami+0.007
elle+0.007
aman+0.007
andra+0.006
Neg logit contributions
and-0.004
 contain-0.004
 Stay-0.003
 Madd-0.003
 of-0.003
Token as
Token ID355
Feature activation+1.13e+00

Pos logit contributions
elle+0.125
burn+0.122
ky+0.118
ami+0.117
andra+0.112
Neg logit contributions
and-0.184
 contain-0.173
 Follow-0.170
 Madd-0.160
 Stay-0.159
Token the
Token ID262
Feature activation+2.95e-01

Pos logit contributions
burn+0.049
elle+0.047
ky+0.046
ited+0.044
ami+0.044
Neg logit contributions
and-0.040
 contain-0.038
 Madd-0.036
 Follow-0.036
 tad-0.036
Token village
Token ID7404
Feature activation-5.45e-01

Pos logit contributions
elle+0.013
burn+0.013
ky+0.013
ami+0.013
andra+0.012
Neg logit contributions
and-0.009
 contain-0.009
 Follow-0.008
 Stay-0.008
 Madd-0.008
Token's
Token ID338
Feature activation+4.59e-02

Pos logit contributions
and+0.019
 contain+0.018
 Follow+0.017
 Stay+0.016
oles+0.015
Neg logit contributions
burn-0.024
elle-0.023
ky-0.022
ami-0.022
andra-0.022
Token best
Token ID1266
Feature activation+8.15e-01

Pos logit contributions
burn+0.002
elle+0.002
ky+0.002
ami+0.002
andra+0.002
Neg logit contributions
and-0.002
 contain-0.001
 Follow-0.001
 Stay-0.001
 Madd-0.001
Token too
Token ID1165
Feature activation+8.65e-02

Pos logit contributions
and+0.008
 contain+0.007
 Follow+0.007
 Stay+0.007
 Madd+0.006
Neg logit contributions
elle-0.011
ami-0.011
andra-0.010
burn-0.010
ky-0.010
Token close
Token ID1969
Feature activation-3.54e-01

Pos logit contributions
elle+0.003
burn+0.003
ky+0.003
ami+0.003
andra+0.003
Neg logit contributions
and-0.004
 contain-0.003
 Follow-0.003
 Stay-0.003
 Madd-0.003
Token.
Token ID13
Feature activation+3.28e-02

Pos logit contributions
and+0.011
 contain+0.010
 Follow+0.010
 Stay+0.009
 Madd+0.009
Neg logit contributions
elle-0.016
ami-0.016
burn-0.015
ky-0.015
ited-0.015
Token The
Token ID383
Feature activation-6.44e-01

Pos logit contributions
elle+0.001
burn+0.001
ky+0.001
ami+0.001
ine+0.001
Neg logit contributions
and-0.001
 contain-0.001
 Follow-0.001
oles-0.001
 Madd-0.001
Token ost
Token ID23619
Feature activation-1.03e+00

Pos logit contributions
and+0.024
 contain+0.021
 Follow+0.020
 Madd+0.019
unn+0.018
Neg logit contributions
elle-0.028
burn-0.027
ky-0.026
ep-0.026
ah-0.026
Tokenric
Token ID1173
Feature activation+3.98e+00

Pos logit contributions
and+0.043
oles+0.041
 contain+0.040
 Follow+0.039
engu+0.039
Neg logit contributions
elle-0.035
burn-0.034
ited-0.032
ky-0.032
 str-0.032
Tokenhes
Token ID956
Feature activation+9.98e-01

Pos logit contributions
burn+0.119
elle+0.117
ky+0.114
aman+0.108
ami+0.107
Neg logit contributions
and-0.197
 Follow-0.178
 contain-0.177
oles-0.170
 Stay-0.169
Token might
Token ID1244
Feature activation-7.78e-01

Pos logit contributions
elle+0.049
burn+0.048
ky+0.047
ami+0.047
andra+0.046
Neg logit contributions
and-0.028
 contain-0.025
 Follow-0.024
 Stay-0.022
 Madd-0.022
Token pe
Token ID613
Feature activation-8.50e-01

Pos logit contributions
and+0.024
 contain+0.023
 Follow+0.021
 Stay+0.020
 Madd+0.019
Neg logit contributions
elle-0.037
ami-0.036
ky-0.035
burn-0.035
andra-0.034
Tokenck
Token ID694
Feature activation+1.35e-01

Pos logit contributions
and+0.037
oles+0.035
 contain+0.034
 Follow+0.033
engu+0.032
Neg logit contributions
elle-0.028
burn-0.028
andra-0.028
ky-0.027
aman-0.027
Token you
Token ID345
Feature activation-3.77e-01

Pos logit contributions
elle+0.005
burn+0.005
ky+0.005
ami+0.005
ah+0.005
Neg logit contributions
and-0.005
 contain-0.005
 Follow-0.005
 Madd-0.005
 Stay-0.005
Token.
Token ID13
Feature activation+1.44e-01

Pos logit contributions
and+0.014
 contain+0.012
 Follow+0.012
 Madd+0.012
oles+0.012
Neg logit contributions
burn-0.011
andra-0.011
ited-0.011
elle-0.011
ami-0.011
Token This
Token ID770
Feature activation+6.46e-01

Pos logit contributions
elle+0.006
burn+0.006
ky+0.006
ami+0.006
ah+0.006
Neg logit contributions
and-0.005
 contain-0.004
 Follow-0.004
 Madd-0.004
 Stay-0.004
Token old
Token ID1468
Feature activation+2.45e-01

Pos logit contributions
burn+0.036
ited+0.031
ky+0.031
en+0.031
elle+0.030
Neg logit contributions
and-0.015
 Follow-0.015
 contain-0.014
 lid-0.013
engu-0.013
Token number
Token ID1271
Feature activation+1.66e+00

Pos logit contributions
burn+0.010
elle+0.010
ky+0.010
ami+0.010
andra+0.009
Neg logit contributions
and-0.008
 contain-0.008
 Follow-0.008
 Stay-0.007
 Madd-0.007
Token was
Token ID373
Feature activation+1.36e-01

Pos logit contributions
elle+0.077
burn+0.075
ky+0.074
ami+0.073
ah+0.071
Neg logit contributions
and-0.052
 contain-0.047
 Follow-0.046
 Madd-0.042
 Stay-0.042
Token known
Token ID1900
Feature activation+3.97e+00

Pos logit contributions
burn+0.006
elle+0.006
ited+0.006
ky+0.005
ami+0.005
Neg logit contributions
and-0.005
 contain-0.004
 Follow-0.004
 Stay-0.004
 Madd-0.004
Token far
Token ID1290
Feature activation+1.36e-02

Pos logit contributions
elle+0.149
burn+0.145
ky+0.142
ami+0.141
ah+0.135
Neg logit contributions
and-0.158
 contain-0.147
 Follow-0.144
 Madd-0.134
 Stay-0.134
Token and
Token ID290
Feature activation-8.27e-02

Pos logit contributions
burn+0.001
andra+0.001
ami+0.001
aman+0.001
Kelly+0.001
Neg logit contributions
 contain-0.000
and-0.000
 Follow-0.000
 Stay-0.000
 plunge-0.000
Token wide
Token ID3094
Feature activation-6.23e-02

Pos logit contributions
and+0.003
 contain+0.003
 Follow+0.003
 Madd+0.003
oles+0.003
Neg logit contributions
elle-0.003
burn-0.003
ky-0.003
ami-0.003
ep-0.003
Token for
Token ID329
Feature activation-7.44e-01

Pos logit contributions
and+0.002
 contain+0.002
 Follow+0.002
 Stay+0.002
engu+0.002
Neg logit contributions
burn-0.003
elle-0.002
ited-0.002
andra-0.002
ami-0.002
Token his
Token ID465
Feature activation+7.95e-01

Pos logit contributions
and+0.025
 contain+0.023
 Follow+0.021
 Stay+0.020
 Madd+0.020
Neg logit contributions
burn-0.034
elle-0.033
ky-0.032
ited-0.031
ami-0.031
Token on
Token ID319
Feature activation+5.10e-01

Pos logit contributions
and+0.029
 Follow+0.027
 contain+0.026
 Stay+0.025
oles+0.025
Neg logit contributions
ami-0.040
andra-0.039
ky-0.039
burn-0.039
elle-0.038
Token,
Token ID11
Feature activation-2.37e-02

Pos logit contributions
burn+0.025
ami+0.024
elle+0.023
ky+0.023
ited+0.023
Neg logit contributions
and-0.014
 contain-0.013
 Madd-0.013
 Follow-0.013
oles-0.013
Token the
Token ID262
Feature activation+3.08e-01

Pos logit contributions
and+0.001
 Madd+0.001
 contain+0.001
 Follow+0.001
 tad+0.001
Neg logit contributions
burn-0.001
ky-0.001
elle-0.001
ami-0.001
ited-0.001
Token battery
Token ID6555
Feature activation+1.75e+00

Pos logit contributions
elle+0.011
burn+0.010
ky+0.010
ami+0.010
ah+0.009
Neg logit contributions
and-0.014
 contain-0.013
 Follow-0.012
 Madd-0.011
oles-0.011
Token was
Token ID373
Feature activation+1.40e-01

Pos logit contributions
elle+0.074
burn+0.074
ky+0.072
ami+0.071
aman+0.069
Neg logit contributions
and-0.060
 contain-0.054
 Follow-0.053
 Madd-0.050
 Stay-0.050
Token known
Token ID1900
Feature activation+3.97e+00

Pos logit contributions
elle+0.006
ky+0.006
ami+0.006
burn+0.006
aman+0.006
Neg logit contributions
and-0.004
 Madd-0.004
 contain-0.004
 tum-0.003
 Stay-0.003
Token as
Token ID355
Feature activation+9.77e-01

Pos logit contributions
elle+0.128
burn+0.125
ky+0.120
ami+0.120
andra+0.114
Neg logit contributions
and-0.181
 contain-0.170
 Follow-0.166
 Madd-0.157
 Stay-0.156
Token the
Token ID262
Feature activation+3.37e-01

Pos logit contributions
burn+0.043
elle+0.042
ky+0.039
ited+0.039
ah+0.039
Neg logit contributions
and-0.035
 contain-0.033
 Madd-0.032
 tad-0.032
 Follow-0.031
Token brilliant
Token ID10457
Feature activation+5.37e-01

Pos logit contributions
elle+0.014
burn+0.014
ami+0.013
ky+0.013
andra+0.013
Neg logit contributions
and-0.012
 contain-0.011
 Follow-0.011
 Stay-0.010
 Madd-0.010
Token battery
Token ID6555
Feature activation+1.71e+00

Pos logit contributions
andra+0.023
aman+0.023
burn+0.023
ami+0.022
ited+0.022
Neg logit contributions
and-0.016
 Madd-0.015
 contain-0.015
 tad-0.015
 Follow-0.015
Token!
Token ID0
Feature activation-3.59e-01

Pos logit contributions
burn+0.074
elle+0.073
ami+0.070
ky+0.070
aman+0.069
Neg logit contributions
and-0.058
 contain-0.054
 Follow-0.052
 Madd-0.048
 Stay-0.048
Token on
Token ID319
Feature activation+5.41e-01

Pos logit contributions
and+0.024
 Follow+0.023
 contain+0.022
 Stay+0.022
 Go+0.019
Neg logit contributions
ami-0.040
andra-0.039
ky-0.039
burn-0.038
aman-0.037
Token,
Token ID11
Feature activation+1.57e-01

Pos logit contributions
burn+0.023
elle+0.023
ami+0.022
ky+0.022
ited+0.021
Neg logit contributions
and-0.018
 contain-0.017
 Follow-0.017
 Madd-0.016
oles-0.016
Token the
Token ID262
Feature activation+2.93e-01

Pos logit contributions
burn+0.006
elle+0.006
ky+0.006
ami+0.006
ited+0.006
Neg logit contributions
and-0.006
 contain-0.006
 Follow-0.005
 Madd-0.005
 Stay-0.005
Token man
Token ID582
Feature activation+3.51e-01

Pos logit contributions
burn+0.016
elle+0.015
en+0.015
ine+0.015
ami+0.015
Neg logit contributions
 contain-0.006
 Follow-0.006
 tad-0.006
and-0.006
 Stay-0.005
Token was
Token ID373
Feature activation+1.03e-01

Pos logit contributions
burn+0.015
elle+0.014
ky+0.014
ami+0.014
aman+0.014
Neg logit contributions
and-0.012
 Follow-0.011
 contain-0.010
 Stay-0.010
 tad-0.009
Token known
Token ID1900
Feature activation+3.96e+00

Pos logit contributions
ami+0.005
ky+0.005
aman+0.005
elle+0.005
andra+0.005
Neg logit contributions
and-0.003
 contain-0.003
 Stay-0.002
 Madd-0.002
 Follow-0.002
Token as
Token ID355
Feature activation+1.07e+00

Pos logit contributions
elle+0.120
burn+0.116
ky+0.112
ami+0.112
andra+0.106
Neg logit contributions
and-0.188
 contain-0.176
 Follow-0.173
 Madd-0.164
 Stay-0.163
Token the
Token ID262
Feature activation+3.08e-01

Pos logit contributions
burn+0.045
elle+0.044
ky+0.042
ami+0.041
ited+0.041
Neg logit contributions
and-0.039
 contain-0.038
 Follow-0.036
 Madd-0.036
 tad-0.035
Token strongest
Token ID12841
Feature activation+1.43e+00

Pos logit contributions
burn+0.014
elle+0.014
ami+0.013
en+0.013
aman+0.013
Neg logit contributions
and-0.009
 contain-0.009
 Follow-0.009
 Stay-0.008
 tad-0.008
Token man
Token ID582
Feature activation+3.62e-01

Pos logit contributions
burn+0.066
aman+0.062
ami+0.061
andra+0.061
ky+0.058
Neg logit contributions
and-0.043
 contain-0.041
 Follow-0.039
 tad-0.037
 Stay-0.036
Token in
Token ID287
Feature activation-1.48e+00

Pos logit contributions
burn+0.016
elle+0.015
aman+0.015
ami+0.015
ky+0.015
Neg logit contributions
and-0.012
 contain-0.011
 Follow-0.011
 Stay-0.010
oles-0.009
Token and
Token ID290
Feature activation+4.14e-02

Pos logit contributions
elle+0.056
ky+0.053
andra+0.053
burn+0.053
ited+0.052
Neg logit contributions
and-0.049
 contain-0.045
 Madd-0.043
 Stay-0.043
 Follow-0.042
Token looked
Token ID3114
Feature activation+9.10e-01

Pos logit contributions
ami+0.002
elle+0.002
burn+0.002
opl+0.002
isdom+0.002
Neg logit contributions
 contain-0.001
 Stay-0.001
 Follow-0.001
 Madd-0.001
 tad-0.001
Token at
Token ID379
Feature activation+7.83e-01

Pos logit contributions
elle+0.029
burn+0.028
ky+0.027
ami+0.027
ah+0.026
Neg logit contributions
and-0.042
 contain-0.039
 Follow-0.038
 Madd-0.036
 Stay-0.036
Token the
Token ID262
Feature activation+1.04e-01

Pos logit contributions
elle+0.031
burn+0.031
ky+0.030
ami+0.030
andra+0.029
Neg logit contributions
and-0.028
 contain-0.027
 Madd-0.025
 Follow-0.025
 Stay-0.025
Token ost
Token ID23619
Feature activation-1.14e+00

Pos logit contributions
elle+0.005
burn+0.005
ky+0.005
ami+0.005
andra+0.005
Neg logit contributions
and-0.003
 contain-0.003
 Follow-0.003
 tad-0.003
 Stay-0.003
Tokenric
Token ID1173
Feature activation+3.96e+00

Pos logit contributions
and+0.046
oles+0.044
engu+0.041
 contain+0.041
unn+0.040
Neg logit contributions
elle-0.041
burn-0.039
 str-0.038
ited-0.038
andra-0.037
Tokenhes
Token ID956
Feature activation+1.08e+00

Pos logit contributions
burn+0.127
elle+0.123
ky+0.121
aman+0.116
ited+0.114
Neg logit contributions
and-0.189
 Follow-0.168
oles-0.167
 contain-0.165
adders-0.163
Token.
Token ID13
Feature activation+2.68e-01

Pos logit contributions
elle+0.039
burn+0.037
ami+0.037
ky+0.037
andra+0.036
Neg logit contributions
and-0.044
 contain-0.042
 Follow-0.039
 Stay-0.038
oles-0.038
Token They
Token ID1119
Feature activation+6.04e-01

Pos logit contributions
elle+0.010
ky+0.010
ami+0.010
ine+0.010
andra+0.010
Neg logit contributions
and-0.010
 contain-0.010
 Follow-0.009
unn-0.008
 plunge-0.008
Token saw
Token ID2497
Feature activation-1.87e-01

Pos logit contributions
elle+0.023
burn+0.023
ky+0.022
ami+0.022
ah+0.021
Neg logit contributions
and-0.024
 contain-0.022
 Follow-0.022
 Madd-0.020
 Stay-0.020
Token one
Token ID530
Feature activation+3.55e-01

Pos logit contributions
and+0.008
 contain+0.008
 Follow+0.008
 Stay+0.007
 Madd+0.007
Neg logit contributions
elle-0.006
burn-0.006
ky-0.006
ami-0.006
ah-0.005
Token do
Token ID466
Feature activation-6.03e-01

Pos logit contributions
elle+0.023
burn+0.022
ky+0.022
ami+0.022
andra+0.021
Neg logit contributions
and-0.019
 contain-0.018
 Follow-0.017
 Stay-0.016
 Madd-0.016
Token that
Token ID326
Feature activation-8.39e-02

Pos logit contributions
and+0.017
 contain+0.015
 Stay+0.015
 Go+0.014
 Madd+0.013
Neg logit contributions
aman-0.030
ited-0.030
ami-0.029
ky-0.029
andra-0.029
Token again
Token ID757
Feature activation-1.51e+00

Pos logit contributions
and+0.003
 contain+0.003
 Follow+0.003
 Stay+0.003
 Madd+0.003
Neg logit contributions
elle-0.003
ami-0.003
andra-0.003
ky-0.003
burn-0.003
Token.
Token ID13
Feature activation+1.11e-01

Pos logit contributions
and+0.054
 Follow+0.047
 contain+0.046
 Stay+0.046
oles+0.043
Neg logit contributions
elle-0.063
burn-0.062
andra-0.061
ky-0.061
ami-0.061
Token I
Token ID314
Feature activation-2.39e-01

Pos logit contributions
elle+0.005
burn+0.005
ky+0.005
ami+0.005
ah+0.005
Neg logit contributions
and-0.003
 contain-0.003
oles-0.003
 tad-0.003
 Madd-0.003
Token promise
Token ID6991
Feature activation-3.83e+00

Pos logit contributions
and+0.008
 contain+0.007
 Follow+0.007
 Stay+0.007
 Madd+0.007
Neg logit contributions
elle-0.011
burn-0.010
ami-0.010
ky-0.010
andra-0.010
Token."
Token ID526
Feature activation+1.09e-01

Pos logit contributions
and+0.153
 contain+0.145
 Follow+0.138
 tad+0.134
oles+0.130
Neg logit contributions
burn-0.145
elle-0.142
ky-0.136
en-0.131
ami-0.131
Token
Token ID198
Feature activation-1.67e-02

Pos logit contributions
elle+0.004
burn+0.004
ami+0.004
ky+0.004
andra+0.004
Neg logit contributions
 contain-0.004
and-0.004
 Follow-0.004
 plunge-0.004
oles-0.003
Token
Token ID198
Feature activation+2.37e-02

Pos logit contributions
 contain+0.001
 Follow+0.001
 plunge+0.001
 tad+0.001
and+0.001
Neg logit contributions
ep-0.001
ami-0.001
burn-0.000
elle-0.000
ine-0.000
TokenMom
Token ID29252
Feature activation-1.39e+00

Pos logit contributions
elle+0.001
burn+0.001
ami+0.001
ky+0.001
ep+0.001
Neg logit contributions
and-0.001
 contain-0.001
 Follow-0.001
 Madd-0.001
 Stay-0.001
Token and
Token ID290
Feature activation-1.01e-01

Pos logit contributions
and+0.054
 contain+0.050
 Follow+0.049
 Stay+0.045
 Madd+0.045
Neg logit contributions
elle-0.054
burn-0.052
ky-0.050
ami-0.050
ah-0.048
Token.
Token ID13
Feature activation+1.30e-01

Pos logit contributions
and+0.085
 contain+0.077
 Follow+0.074
 Stay+0.071
 Madd+0.068
Neg logit contributions
elle-0.076
burn-0.075
ami-0.072
ky-0.071
andra-0.070
Token And
Token ID843
Feature activation+9.71e-02

Pos logit contributions
elle+0.006
burn+0.006
ky+0.006
ami+0.006
ah+0.006
Neg logit contributions
and-0.004
 Follow-0.004
 contain-0.004
 Stay-0.003
 Madd-0.003
Token you
Token ID345
Feature activation-5.49e-01

Pos logit contributions
elle+0.004
burn+0.004
ky+0.004
ami+0.004
ine+0.004
Neg logit contributions
and-0.003
 contain-0.003
 Follow-0.003
 Madd-0.003
oles-0.003
Token have
Token ID423
Feature activation+4.56e-01

Pos logit contributions
and+0.022
 contain+0.019
 Follow+0.019
 Madd+0.018
oles+0.018
Neg logit contributions
elle-0.020
ine-0.020
burn-0.020
ky-0.020
ah-0.020
Token to
Token ID284
Feature activation+5.58e-01

Pos logit contributions
elle+0.020
burn+0.019
ky+0.019
ami+0.019
ah+0.018
Neg logit contributions
and-0.016
 contain-0.014
 Follow-0.014
 Madd-0.013
 Stay-0.013
Token promise
Token ID6991
Feature activation-3.78e+00

Pos logit contributions
elle+0.029
burn+0.027
ami+0.027
ky+0.027
andra+0.026
Neg logit contributions
 contain-0.015
and-0.013
 Stay-0.013
 Follow-0.012
 Madd-0.011
Token not
Token ID407
Feature activation+1.32e+00

Pos logit contributions
and+0.146
 contain+0.135
 Follow+0.132
 Madd+0.123
 Stay+0.123
Neg logit contributions
elle-0.147
burn-0.143
ky-0.140
ami-0.139
ah-0.134
Token to
Token ID284
Feature activation+5.52e-01

Pos logit contributions
elle+0.061
ky+0.058
burn+0.058
ami+0.058
andra+0.057
Neg logit contributions
and-0.043
 contain-0.039
 Follow-0.037
 Stay-0.035
 Madd-0.034
Token eat
Token ID4483
Feature activation-1.65e+00

Pos logit contributions
elle+0.029
ami+0.027
andra+0.027
burn+0.027
ky+0.026
Neg logit contributions
 contain-0.014
and-0.014
 Stay-0.012
 hide-0.011
 Follow-0.011
Token my
Token ID616
Feature activation-1.09e-01

Pos logit contributions
and+0.069
 contain+0.064
 Follow+0.062
 Stay+0.058
 Madd+0.058
Neg logit contributions
elle-0.059
burn-0.058
ky-0.056
ami-0.056
ah-0.054
Token cake
Token ID12187
Feature activation-3.68e-01

Pos logit contributions
and+0.004
 contain+0.004
 Follow+0.004
 Madd+0.004
 Stay+0.003
Neg logit contributions
elle-0.004
burn-0.004
ami-0.004
ky-0.004
ep-0.004
Token belongings
Token ID30985
Feature activation+4.47e-01

Pos logit contributions
and+0.010
 contain+0.010
 Follow+0.009
 Stay+0.009
 Madd+0.009
Neg logit contributions
elle-0.012
burn-0.012
ky-0.012
ami-0.012
ah-0.011
Token,
Token ID11
Feature activation-2.56e-01

Pos logit contributions
elle+0.015
burn+0.015
ami+0.014
ky+0.014
andra+0.014
Neg logit contributions
and-0.020
 contain-0.018
 Follow-0.018
 Stay-0.017
 Madd-0.017
Token and
Token ID290
Feature activation-3.21e-01

Pos logit contributions
and+0.010
 contain+0.009
 Follow+0.009
 Madd+0.009
 Stay+0.009
Neg logit contributions
elle-0.010
burn-0.009
ky-0.009
ami-0.009
ine-0.009
Token ask
Token ID1265
Feature activation+6.71e-02

Pos logit contributions
and+0.012
 contain+0.012
 Follow+0.011
 Stay+0.011
 Madd+0.010
Neg logit contributions
elle-0.012
burn-0.012
ky-0.012
ami-0.012
ah-0.011
Token for
Token ID329
Feature activation-1.25e+00

Pos logit contributions
elle+0.003
burn+0.003
ky+0.002
ami+0.002
ah+0.002
Neg logit contributions
and-0.003
 contain-0.002
 Follow-0.002
 Madd-0.002
 Stay-0.002
Token permission
Token ID7170
Feature activation-3.78e+00

Pos logit contributions
and+0.038
 contain+0.035
 Follow+0.034
 Stay+0.031
 Madd+0.030
Neg logit contributions
elle-0.059
burn-0.058
ky-0.057
ami-0.056
ah-0.056
Token before
Token ID878
Feature activation-1.72e+00

Pos logit contributions
and+0.155
 contain+0.147
 Follow+0.144
 Madd+0.137
 tad+0.135
Neg logit contributions
elle-0.133
burn-0.128
ky-0.127
ami-0.125
ine-0.121
Token using
Token ID1262
Feature activation-6.76e-01

Pos logit contributions
and+0.067
 contain+0.063
 Follow+0.061
 Stay+0.060
oles+0.058
Neg logit contributions
elle-0.066
ky-0.063
burn-0.063
ami-0.062
ah-0.062
Token them
Token ID606
Feature activation-1.41e+00

Pos logit contributions
and+0.025
 contain+0.023
 Follow+0.022
 Stay+0.022
 Madd+0.022
Neg logit contributions
elle-0.027
burn-0.027
ami-0.027
ited-0.026
ah-0.025
Token.
Token ID13
Feature activation-2.46e-01

Pos logit contributions
and+0.056
 contain+0.052
 Follow+0.051
 Stay+0.048
 Madd+0.047
Neg logit contributions
elle-0.053
burn-0.052
ami-0.050
ky-0.050
ah-0.048
Token<|endoftext|>
Token ID50256
Feature activation+5.34e-01

Pos logit contributions
and+0.008
 contain+0.008
 Follow+0.007
 Madd+0.007
 Stay+0.007
Neg logit contributions
elle-0.011
burn-0.011
ami-0.010
ky-0.010
andra-0.010
Token.
Token ID13
Feature activation+1.64e-01

Pos logit contributions
 Follow+0.027
 Madd+0.027
and+0.026
 contain+0.026
 tad+0.026
Neg logit contributions
burn-0.008
ky-0.008
ine-0.008
elle-0.008
 dreaming-0.008
Token But
Token ID887
Feature activation-4.40e-01

Pos logit contributions
elle+0.008
burn+0.008
ky+0.008
ami+0.008
ah+0.008
Neg logit contributions
and-0.005
 Follow-0.004
 contain-0.004
 Stay-0.004
 Madd-0.004
Token you
Token ID345
Feature activation-5.06e-01

Pos logit contributions
and+0.016
 contain+0.014
 Follow+0.014
oles+0.013
engu+0.013
Neg logit contributions
elle-0.018
burn-0.018
ami-0.017
ky-0.017
 str-0.017
Token have
Token ID423
Feature activation+4.53e-01

Pos logit contributions
and+0.023
 Follow+0.019
 contain+0.019
 Madd+0.019
oles+0.019
Neg logit contributions
 str-0.016
ine-0.016
burn-0.016
ah-0.016
ami-0.015
Token to
Token ID284
Feature activation+6.06e-01

Pos logit contributions
elle+0.021
burn+0.020
ky+0.020
ami+0.020
ah+0.019
Neg logit contributions
and-0.015
 contain-0.013
 Follow-0.013
 Stay-0.012
 Madd-0.012
Token promise
Token ID6991
Feature activation-3.75e+00

Pos logit contributions
elle+0.033
ami+0.032
ky+0.031
ited+0.030
andra+0.030
Neg logit contributions
 contain-0.015
 Stay-0.014
and-0.014
 Follow-0.013
 Madd-0.011
Token me
Token ID502
Feature activation-1.31e+00

Pos logit contributions
and+0.138
 contain+0.127
 Follow+0.124
 Madd+0.115
 Stay+0.115
Neg logit contributions
elle-0.153
burn-0.149
ky-0.147
ami-0.146
ah-0.140
Token something
Token ID1223
Feature activation+7.23e-01

Pos logit contributions
and+0.057
 contain+0.053
 Follow+0.052
 Stay+0.049
 Madd+0.048
Neg logit contributions
elle-0.046
burn-0.044
ky-0.043
ami-0.043
andra-0.041
Token.
Token ID13
Feature activation+1.90e-01

Pos logit contributions
elle+0.023
burn+0.022
ky+0.022
ami+0.022
ine+0.021
Neg logit contributions
and-0.031
 contain-0.030
 Follow-0.029
 Madd-0.028
 tad-0.028
Token You
Token ID921
Feature activation+2.60e-01

Pos logit contributions
elle+0.009
burn+0.009
ky+0.008
ami+0.008
ine+0.008
Neg logit contributions
and-0.006
 contain-0.005
 Follow-0.005
 Madd-0.005
oles-0.005
Token have
Token ID423
Feature activation+4.74e-01

Pos logit contributions
 str+0.009
burn+0.009
elle+0.008
ep+0.008
 presses+0.008
Neg logit contributions
and-0.011
 Madd-0.009
oles-0.009
 contain-0.009
 Follow-0.009
Token.
Token ID13
Feature activation+2.69e-01

Pos logit contributions
and+0.059
 contain+0.056
 Follow+0.056
 Madd+0.053
 Stay+0.052
Neg logit contributions
elle-0.037
burn-0.035
ky-0.035
ami-0.034
ine-0.032
Token They
Token ID1119
Feature activation+2.25e-01

Pos logit contributions
elle+0.012
burn+0.012
ami+0.012
ky+0.012
ine+0.011
Neg logit contributions
and-0.008
 contain-0.008
 Follow-0.007
 Madd-0.007
 tad-0.007
Token never
Token ID1239
Feature activation+4.42e-01

Pos logit contributions
elle+0.008
burn+0.008
ky+0.008
ami+0.008
ah+0.007
Neg logit contributions
and-0.009
 contain-0.009
 Follow-0.008
 Stay-0.008
 Madd-0.008
Token broke
Token ID6265
Feature activation+2.14e-01

Pos logit contributions
elle+0.016
burn+0.015
ami+0.015
ky+0.015
aman+0.014
Neg logit contributions
and-0.018
 contain-0.017
 Follow-0.017
 Stay-0.016
 Madd-0.016
Token their
Token ID511
Feature activation-8.74e-02

Pos logit contributions
ited+0.009
andra+0.009
elle+0.008
ami+0.008
ah+0.008
Neg logit contributions
and-0.008
 Madd-0.007
 Stay-0.006
oles-0.006
 contain-0.006
Token promise
Token ID6991
Feature activation-3.73e+00

Pos logit contributions
and+0.003
 contain+0.003
 Follow+0.003
oles+0.003
 Madd+0.003
Neg logit contributions
burn-0.004
elle-0.003
aman-0.003
ky-0.003
ine-0.003
Token again
Token ID757
Feature activation-1.51e+00

Pos logit contributions
and+0.135
 contain+0.121
 Stay+0.113
 Follow+0.110
 Madd+0.104
Neg logit contributions
elle-0.161
ky-0.155
ami-0.154
ited-0.153
burn-0.147
Token.
Token ID13
Feature activation+2.79e-01

Pos logit contributions
and+0.059
 contain+0.050
 Stay+0.047
 Follow+0.047
oles+0.046
Neg logit contributions
elle-0.062
burn-0.060
ky-0.059
ited-0.058
ah-0.057
Token<|endoftext|>
Token ID50256
Feature activation+7.09e-01

Pos logit contributions
elle+0.012
ami+0.011
burn+0.011
ky+0.011
ine+0.011
Neg logit contributions
and-0.010
 contain-0.009
 Follow-0.008
 plunge-0.008
 Madd-0.008
TokenTom
Token ID13787
Feature activation+1.96e-02

Pos logit contributions
elle+0.018
burn+0.017
ami+0.017
ky+0.017
ah+0.016
Neg logit contributions
 contain-0.036
and-0.035
 Follow-0.033
 Madd-0.032
engu-0.031
Token and
Token ID290
Feature activation-4.05e-02

Pos logit contributions
burn+0.001
elle+0.001
ami+0.001
aman+0.001
ky+0.001
Neg logit contributions
and-0.001
 Stay-0.001
 Follow-0.001
suits-0.000
 contain-0.000
Token.
Token ID13
Feature activation+1.45e-01

Pos logit contributions
elle+0.028
ami+0.027
andra+0.027
burn+0.027
ky+0.026
Neg logit contributions
and-0.026
 contain-0.023
 Follow-0.023
 Stay-0.022
 Madd-0.022
Token And
Token ID843
Feature activation+9.63e-02

Pos logit contributions
elle+0.006
burn+0.006
ky+0.006
ami+0.006
ine+0.006
Neg logit contributions
and-0.005
 contain-0.004
oles-0.004
 Madd-0.004
 Follow-0.004
Token you
Token ID345
Feature activation-5.23e-01

Pos logit contributions
elle+0.004
burn+0.004
ky+0.004
ine+0.004
ami+0.004
Neg logit contributions
and-0.003
 contain-0.003
 Follow-0.003
oles-0.003
 Madd-0.003
Token have
Token ID423
Feature activation+4.58e-01

Pos logit contributions
and+0.019
 contain+0.017
 Madd+0.016
 Follow+0.016
oles+0.016
Neg logit contributions
burn-0.021
elle-0.021
ky-0.020
ah-0.020
ine-0.020
Token to
Token ID284
Feature activation+6.23e-01

Pos logit contributions
elle+0.020
burn+0.020
ky+0.019
ami+0.019
ah+0.018
Neg logit contributions
and-0.016
 contain-0.014
 Follow-0.014
 Stay-0.013
 Madd-0.013
Token promise
Token ID6991
Feature activation-3.72e+00

Pos logit contributions
elle+0.033
ami+0.030
burn+0.030
ky+0.030
ited+0.029
Neg logit contributions
 contain-0.016
 Stay-0.014
and-0.014
 Follow-0.014
 bury-0.013
Token me
Token ID502
Feature activation-1.26e+00

Pos logit contributions
and+0.146
 contain+0.135
 Follow+0.132
 Stay+0.124
 Madd+0.124
Neg logit contributions
elle-0.144
burn-0.139
ky-0.136
ami-0.136
ah-0.131
Token to
Token ID284
Feature activation+6.25e-01

Pos logit contributions
and+0.049
 contain+0.045
 Follow+0.044
 Stay+0.042
 Madd+0.040
Neg logit contributions
elle-0.050
ami-0.048
ky-0.047
burn-0.047
ited-0.047
Token be
Token ID307
Feature activation+6.39e-01

Pos logit contributions
elle+0.033
ami+0.031
burn+0.030
ky+0.030
ited+0.030
Neg logit contributions
 contain-0.016
and-0.015
 Stay-0.014
 Follow-0.014
 Madd-0.013
Token careful
Token ID8161
Feature activation-5.47e-01

Pos logit contributions
elle+0.035
ami+0.034
aman+0.034
ah+0.033
burn+0.033
Neg logit contributions
and-0.016
 Follow-0.016
 Stay-0.014
 contain-0.013
 welcome-0.012
Token and
Token ID290
Feature activation-1.20e-01

Pos logit contributions
 contain+0.025
and+0.025
 Follow+0.024
 tad+0.024
engu+0.024
Neg logit contributions
burn-0.016
elle-0.016
ky-0.015
en-0.015
ami-0.014
Token will
Token ID481
Feature activation-7.42e-01

Pos logit contributions
and+0.001
 contain+0.001
 Follow+0.001
 Stay+0.001
 Madd+0.001
Neg logit contributions
elle-0.002
ami-0.002
ky-0.002
burn-0.002
andra-0.002
Token be
Token ID307
Feature activation+6.32e-01

Pos logit contributions
 contain+0.023
and+0.021
 Stay+0.020
 Follow+0.019
 Madd+0.018
Neg logit contributions
ami-0.036
elle-0.036
ky-0.034
ited-0.034
ah-0.033
Token careful
Token ID8161
Feature activation-5.07e-01

Pos logit contributions
ami+0.035
ah+0.035
elle+0.034
aman+0.034
andra+0.033
Neg logit contributions
and-0.013
 welcome-0.011
 contain-0.011
 Stay-0.011
 Follow-0.011
Token.
Token ID13
Feature activation+1.77e-01

Pos logit contributions
and+0.022
 contain+0.021
 Follow+0.021
 tad+0.020
engu+0.020
Neg logit contributions
burn-0.016
elle-0.016
ky-0.016
ami-0.015
ine-0.015
Token I
Token ID314
Feature activation-8.13e-02

Pos logit contributions
elle+0.008
burn+0.008
ky+0.008
ine+0.008
ami+0.008
Neg logit contributions
and-0.006
 contain-0.005
oles-0.005
 tad-0.004
 Follow-0.004
Token promise
Token ID6991
Feature activation-3.72e+00

Pos logit contributions
and+0.003
 contain+0.002
 Follow+0.002
 Stay+0.002
 Madd+0.002
Neg logit contributions
elle-0.004
ami-0.004
burn-0.004
ky-0.004
andra-0.004
Token."
Token ID526
Feature activation+7.74e-01

Pos logit contributions
and+0.134
 contain+0.123
 Follow+0.120
 Madd+0.111
 tad+0.110
Neg logit contributions
elle-0.154
burn-0.152
ky-0.148
ami-0.146
ine-0.141
Token Lily
Token ID20037
Feature activation-4.62e-01

Pos logit contributions
elle+0.030
burn+0.029
ky+0.029
ami+0.028
andra+0.027
Neg logit contributions
and-0.030
 contain-0.028
 Follow-0.027
 Madd-0.025
 Stay-0.025
Token said
Token ID531
Feature activation-3.88e-01

Pos logit contributions
and+0.019
 contain+0.017
 Follow+0.017
 Stay+0.016
 Madd+0.016
Neg logit contributions
elle-0.017
burn-0.017
ky-0.016
ami-0.016
ah-0.016
Token.
Token ID13
Feature activation+2.87e-01

Pos logit contributions
and+0.018
 contain+0.017
 Follow+0.017
 Madd+0.016
 Stay+0.016
Neg logit contributions
elle-0.012
burn-0.012
ky-0.011
ami-0.011
ah-0.011
Token
Token ID198
Feature activation+3.41e-02

Pos logit contributions
elle+0.011
burn+0.011
ky+0.011
ami+0.011
andra+0.010
Neg logit contributions
and-0.010
 contain-0.010
 Follow-0.010
 tad-0.009
engu-0.009
Token.
Token ID13
Feature activation+1.50e-01

Pos logit contributions
and+0.015
 contain+0.014
 Follow+0.014
 Madd+0.013
 Stay+0.013
Neg logit contributions
elle-0.014
burn-0.013
ky-0.013
ami-0.013
andra-0.012
Token But
Token ID887
Feature activation-4.70e-01

Pos logit contributions
elle+0.007
burn+0.007
ami+0.007
ky+0.007
ine+0.006
Neg logit contributions
and-0.005
 contain-0.004
 Follow-0.004
oles-0.004
 Madd-0.004
Token only
Token ID691
Feature activation+5.68e-01

Pos logit contributions
and+0.017
 contain+0.016
 Follow+0.015
 Madd+0.014
oles+0.014
Neg logit contributions
elle-0.019
burn-0.019
ky-0.018
ami-0.018
ah-0.018
Token if
Token ID611
Feature activation-1.59e+00

Pos logit contributions
elle+0.023
burn+0.023
ky+0.022
ami+0.022
andra+0.022
Neg logit contributions
and-0.021
 contain-0.019
 Follow-0.019
 Stay-0.017
 Madd-0.017
Token you
Token ID345
Feature activation-4.78e-01

Pos logit contributions
and+0.054
 contain+0.049
 Follow+0.048
 Madd+0.042
 tad+0.042
Neg logit contributions
elle-0.070
burn-0.068
ky-0.066
ami-0.065
ine-0.064
Token promise
Token ID6991
Feature activation-3.72e+00

Pos logit contributions
and+0.016
 contain+0.015
 Follow+0.014
 Madd+0.013
 Stay+0.013
Neg logit contributions
elle-0.021
burn-0.020
ky-0.020
ami-0.020
ah-0.019
Token not
Token ID407
Feature activation+1.40e+00

Pos logit contributions
and+0.110
 contain+0.099
 Follow+0.097
 Stay+0.095
 Madd+0.089
Neg logit contributions
elle-0.179
ami-0.171
ky-0.169
burn-0.169
ited-0.167
Token to
Token ID284
Feature activation+5.86e-01

Pos logit contributions
elle+0.064
burn+0.061
ky+0.061
ami+0.061
andra+0.059
Neg logit contributions
and-0.046
 contain-0.042
 Follow-0.040
 Stay-0.038
 Madd-0.037
Token chew
Token ID34722
Feature activation+3.86e-01

Pos logit contributions
elle+0.031
ami+0.028
andra+0.028
ky+0.028
ited+0.027
Neg logit contributions
 contain-0.015
and-0.014
 Stay-0.014
 Madd-0.011
 Follow-0.011
Token on
Token ID319
Feature activation+3.56e-01

Pos logit contributions
elle+0.014
burn+0.014
ami+0.014
ky+0.013
andra+0.013
Neg logit contributions
and-0.015
 contain-0.014
 Follow-0.014
 Madd-0.013
 Stay-0.013
Token them
Token ID606
Feature activation-1.43e+00

Pos logit contributions
burn+0.014
ited+0.014
andra+0.014
elle+0.014
ami+0.014
Neg logit contributions
and-0.012
 Madd-0.011
oles-0.010
 Stay-0.010
 nuts-0.010
Token something
Token ID1223
Feature activation+7.23e-01

Pos logit contributions
and+0.057
 contain+0.053
 Follow+0.052
 Stay+0.049
 Madd+0.048
Neg logit contributions
elle-0.046
burn-0.044
ky-0.043
ami-0.043
andra-0.041
Token.
Token ID13
Feature activation+1.90e-01

Pos logit contributions
elle+0.023
burn+0.022
ky+0.022
ami+0.022
ine+0.021
Neg logit contributions
and-0.031
 contain-0.030
 Follow-0.029
 Madd-0.028
 tad-0.028
Token You
Token ID921
Feature activation+2.60e-01

Pos logit contributions
elle+0.009
burn+0.009
ky+0.008
ami+0.008
ine+0.008
Neg logit contributions
and-0.006
 contain-0.005
 Follow-0.005
 Madd-0.005
oles-0.005
Token have
Token ID423
Feature activation+4.74e-01

Pos logit contributions
 str+0.009
burn+0.009
elle+0.008
ep+0.008
 presses+0.008
Neg logit contributions
and-0.011
 Madd-0.009
oles-0.009
 contain-0.009
 Follow-0.009
Token to
Token ID284
Feature activation+6.26e-01

Pos logit contributions
elle+0.022
burn+0.021
ky+0.021
ami+0.021
ah+0.020
Neg logit contributions
and-0.015
 contain-0.014
 Follow-0.013
 Stay-0.013
 Madd-0.012
Token promise
Token ID6991
Feature activation-3.71e+00

Pos logit contributions
elle+0.032
ami+0.031
ky+0.031
burn+0.029
ited+0.029
Neg logit contributions
 contain-0.017
 Stay-0.015
and-0.015
 Follow-0.015
 Madd-0.013
Token me
Token ID502
Feature activation-1.28e+00

Pos logit contributions
and+0.147
 contain+0.137
 Follow+0.134
 Madd+0.125
 Stay+0.123
Neg logit contributions
elle-0.140
burn-0.138
ky-0.135
ami-0.133
ine-0.128
Token that
Token ID326
Feature activation+1.95e-04

Pos logit contributions
and+0.054
 contain+0.050
 Follow+0.049
 Stay+0.046
 Madd+0.045
Neg logit contributions
elle-0.046
burn-0.044
ky-0.043
ami-0.043
andra-0.041
Token you
Token ID345
Feature activation-4.79e-01

Pos logit contributions
elle+0.000
burn+0.000
ami+0.000
ky+0.000
 str+0.000
Neg logit contributions
and-0.000
 contain-0.000
 Follow-0.000
 Madd-0.000
engu-0.000
Token will
Token ID481
Feature activation-7.93e-01

Pos logit contributions
and+0.019
 Madd+0.017
 Follow+0.017
 contain+0.017
oles+0.016
Neg logit contributions
 str-0.017
burn-0.017
elle-0.016
ine-0.016
ah-0.016
Token not
Token ID407
Feature activation+1.37e+00

Pos logit contributions
 contain+0.023
and+0.022
 Stay+0.020
 Follow+0.019
 Madd+0.018
Neg logit contributions
elle-0.041
ami-0.039
ky-0.038
ited-0.037
burn-0.036
Token.
Token ID13
Feature activation+2.14e-01

Pos logit contributions
elle+0.012
burn+0.012
ky+0.012
ami+0.011
ah+0.011
Neg logit contributions
and-0.012
 contain-0.011
 Follow-0.011
 Madd-0.010
 Stay-0.010
Token And
Token ID843
Feature activation+1.49e-01

Pos logit contributions
elle+0.009
burn+0.009
ami+0.009
ky+0.009
ine+0.009
Neg logit contributions
and-0.007
 contain-0.007
 Follow-0.006
oles-0.006
 plunge-0.006
Token you
Token ID345
Feature activation-4.49e-01

Pos logit contributions
elle+0.006
burn+0.006
ky+0.006
ami+0.006
andra+0.006
Neg logit contributions
and-0.005
 contain-0.005
 Follow-0.005
 Madd-0.004
 Stay-0.004
Token have
Token ID423
Feature activation+4.96e-01

Pos logit contributions
and+0.017
 contain+0.014
 Follow+0.014
 Madd+0.014
oles+0.014
Neg logit contributions
elle-0.018
burn-0.018
ah-0.018
ky-0.017
ine-0.017
Token to
Token ID284
Feature activation+6.37e-01

Pos logit contributions
elle+0.022
burn+0.021
ami+0.021
ky+0.021
andra+0.020
Neg logit contributions
and-0.017
 contain-0.015
 Follow-0.015
 Stay-0.014
 Madd-0.013
Token promise
Token ID6991
Feature activation-3.71e+00

Pos logit contributions
elle+0.034
ami+0.033
ky+0.032
burn+0.032
ming+0.031
Neg logit contributions
 contain-0.016
and-0.014
 Stay-0.014
 Follow-0.013
 Madd-0.012
Token not
Token ID407
Feature activation+1.36e+00

Pos logit contributions
and+0.155
 contain+0.144
 Follow+0.141
 Madd+0.132
 Stay+0.132
Neg logit contributions
elle-0.133
burn-0.129
ky-0.126
ami-0.125
ah-0.120
Token to
Token ID284
Feature activation+6.33e-01

Pos logit contributions
elle+0.061
burn+0.059
ky+0.059
ami+0.058
andra+0.056
Neg logit contributions
and-0.045
 contain-0.041
 Follow-0.039
 Stay-0.037
 Madd-0.036
Token tell
Token ID1560
Feature activation-1.25e+00

Pos logit contributions
elle+0.033
ami+0.032
ky+0.031
andra+0.031
burn+0.031
Neg logit contributions
 contain-0.015
and-0.015
 Stay-0.014
 Follow-0.012
 Madd-0.012
Token Mom
Token ID11254
Feature activation-6.77e-02

Pos logit contributions
and+0.050
 contain+0.046
 Follow+0.045
 Stay+0.043
 Madd+0.042
Neg logit contributions
elle-0.046
burn-0.046
ky-0.045
ami-0.044
andra-0.043
Token or
Token ID393
Feature activation+2.75e-01

Pos logit contributions
and+0.003
 contain+0.003
 Follow+0.003
 Stay+0.003
 Madd+0.003
Neg logit contributions
elle-0.002
burn-0.002
ky-0.002
ami-0.002
andra-0.002
Token Now
Token ID2735
Feature activation-7.42e-01

Pos logit contributions
elle+0.008
burn+0.008
ky+0.007
ami+0.007
ine+0.007
Neg logit contributions
and-0.005
 contain-0.005
 Follow-0.004
oles-0.004
 Madd-0.004
Token,
Token ID11
Feature activation+7.31e-03

Pos logit contributions
and+0.023
 contain+0.022
 Follow+0.021
 Stay+0.020
 Madd+0.019
Neg logit contributions
elle-0.034
ami-0.033
ky-0.033
burn-0.033
ah-0.032
Token you
Token ID345
Feature activation-4.88e-01

Pos logit contributions
elle+0.000
burn+0.000
ky+0.000
ami+0.000
ah+0.000
Neg logit contributions
and-0.000
 contain-0.000
 Follow-0.000
 Stay-0.000
 Madd-0.000
Token have
Token ID423
Feature activation+5.26e-01

Pos logit contributions
and+0.015
 contain+0.013
 Follow+0.012
 Madd+0.012
oles+0.012
Neg logit contributions
elle-0.023
burn-0.023
ky-0.022
ami-0.022
ah-0.022
Token to
Token ID284
Feature activation+6.24e-01

Pos logit contributions
elle+0.024
ami+0.024
burn+0.024
ky+0.023
ah+0.023
Neg logit contributions
and-0.017
 contain-0.015
 Follow-0.014
 Stay-0.014
 Madd-0.013
Token promise
Token ID6991
Feature activation-3.70e+00

Pos logit contributions
elle+0.032
ami+0.030
ky+0.030
burn+0.029
ming+0.028
Neg logit contributions
 contain-0.018
and-0.015
 Stay-0.015
 Follow-0.015
 Madd-0.013
Token me
Token ID502
Feature activation-1.29e+00

Pos logit contributions
and+0.128
 contain+0.116
 Follow+0.111
 Stay+0.108
 Madd+0.105
Neg logit contributions
elle-0.162
ami-0.155
burn-0.153
ky-0.152
ited-0.152
Token something
Token ID1223
Feature activation+7.84e-01

Pos logit contributions
and+0.051
 contain+0.048
 Follow+0.045
 Stay+0.043
 Madd+0.041
Neg logit contributions
elle-0.051
ami-0.049
burn-0.048
ky-0.048
ited-0.048
Token.
Token ID13
Feature activation+1.42e-01

Pos logit contributions
elle+0.031
burn+0.030
ky+0.030
ami+0.029
ah+0.028
Neg logit contributions
and-0.030
 contain-0.027
 Follow-0.027
 Madd-0.025
 Stay-0.025
Token You
Token ID921
Feature activation+3.07e-01

Pos logit contributions
elle+0.007
burn+0.006
ky+0.006
ami+0.006
ine+0.006
Neg logit contributions
and-0.004
 contain-0.004
 Follow-0.004
oles-0.004
 Madd-0.003
Token have
Token ID423
Feature activation+5.41e-01

Pos logit contributions
burn+0.013
elle+0.013
ky+0.013
 str+0.013
ah+0.013
Neg logit contributions
and-0.010
 contain-0.009
 Follow-0.008
 Madd-0.008
oles-0.008
Token.
Token ID13
Feature activation+1.19e-01

Pos logit contributions
and+0.081
 contain+0.076
 Follow+0.073
 Stay+0.069
 Madd+0.067
Neg logit contributions
elle-0.070
burn-0.067
ami-0.066
ky-0.065
ah-0.063
Token But
Token ID887
Feature activation-4.25e-01

Pos logit contributions
elle+0.005
burn+0.005
ky+0.005
ami+0.005
ine+0.005
Neg logit contributions
and-0.004
 contain-0.004
oles-0.003
 Follow-0.003
 Madd-0.003
Token you
Token ID345
Feature activation-4.91e-01

Pos logit contributions
and+0.016
 contain+0.014
 Follow+0.014
oles+0.013
 Madd+0.013
Neg logit contributions
elle-0.017
burn-0.017
ami-0.016
ky-0.016
 str-0.016
Token have
Token ID423
Feature activation+4.89e-01

Pos logit contributions
and+0.022
 Madd+0.019
 Follow+0.019
 contain+0.019
oles+0.019
Neg logit contributions
ah-0.015
burn-0.015
ine-0.015
 str-0.015
elle-0.015
Token to
Token ID284
Feature activation+6.54e-01

Pos logit contributions
elle+0.019
burn+0.019
ky+0.018
ami+0.018
 str+0.017
Neg logit contributions
and-0.019
 contain-0.018
 Follow-0.017
 Madd-0.016
oles-0.016
Token promise
Token ID6991
Feature activation-3.70e+00

Pos logit contributions
elle+0.034
ami+0.033
ky+0.032
burn+0.032
andra+0.031
Neg logit contributions
 contain-0.017
and-0.017
 Stay-0.015
 Follow-0.015
 Madd-0.014
Token me
Token ID502
Feature activation-1.29e+00

Pos logit contributions
and+0.129
 contain+0.118
 Follow+0.115
 Stay+0.108
 Madd+0.107
Neg logit contributions
elle-0.159
burn-0.154
ky-0.152
ami-0.151
ah-0.147
Token to
Token ID284
Feature activation+6.51e-01

Pos logit contributions
and+0.055
 contain+0.052
 Follow+0.050
 Stay+0.048
 Madd+0.047
Neg logit contributions
elle-0.046
burn-0.044
ami-0.043
ky-0.043
andra-0.041
Token never
Token ID1239
Feature activation+4.22e-01

Pos logit contributions
elle+0.032
ami+0.031
ky+0.031
burn+0.030
ited+0.030
Neg logit contributions
and-0.018
 contain-0.018
 Stay-0.017
 Follow-0.016
 Madd-0.015
Token do
Token ID466
Feature activation-5.63e-01

Pos logit contributions
burn+0.016
elle+0.015
ky+0.014
ami+0.014
 str+0.014
Neg logit contributions
and-0.017
 Follow-0.015
 contain-0.015
 Madd-0.015
oles-0.014
Token this
Token ID428
Feature activation-1.57e+00

Pos logit contributions
and+0.019
 contain+0.018
 Stay+0.018
 Go+0.017
 tum+0.016
Neg logit contributions
ited-0.024
ami-0.024
aman-0.024
ky-0.024
ah-0.024
Token We
Token ID775
Feature activation+2.57e-01

Pos logit contributions
and+0.001
 contain+0.000
 Follow+0.000
 Madd+0.000
oles+0.000
Neg logit contributions
elle-0.001
burn-0.001
ky-0.001
ami-0.001
andra-0.001
Token promise
Token ID6991
Feature activation-3.64e+00

Pos logit contributions
elle+0.012
burn+0.012
ami+0.012
ky+0.012
andra+0.011
Neg logit contributions
and-0.008
 contain-0.007
 Follow-0.007
 Stay-0.006
 Madd-0.006
Token,
Token ID11
Feature activation-5.46e-02

Pos logit contributions
and+0.144
 contain+0.133
 Follow+0.131
 Madd+0.121
 tad+0.120
Neg logit contributions
elle-0.138
burn-0.137
ky-0.133
ami-0.131
andra-0.126
Token promise
Token ID6991
Feature activation-3.67e+00

Pos logit contributions
and+0.002
 Follow+0.002
 contain+0.002
unn+0.002
oles+0.001
Neg logit contributions
elle-0.002
burn-0.002
ami-0.002
 str-0.002
ky-0.002
Token,
Token ID11
Feature activation-8.68e-02

Pos logit contributions
and+0.176
 contain+0.166
 Follow+0.162
 tad+0.153
 Madd+0.151
Neg logit contributions
burn-0.111
elle-0.108
ky-0.103
ami-0.101
ine-0.099
Token promise
Token ID6991
Feature activation-3.70e+00

Pos logit contributions
and+0.003
 contain+0.003
 Follow+0.003
unn+0.003
oles+0.002
Neg logit contributions
burn-0.004
elle-0.004
ami-0.003
 str-0.003
ky-0.003
Token."
Token ID526
Feature activation-1.05e-01

Pos logit contributions
and+0.185
 contain+0.175
 Follow+0.171
 tad+0.162
 Madd+0.159
Neg logit contributions
burn-0.106
elle-0.101
ky-0.096
ami-0.094
ine-0.094
Token
Token ID198
Feature activation-7.80e-02

Pos logit contributions
and+0.004
 contain+0.003
 Follow+0.003
 Madd+0.003
 Stay+0.003
Neg logit contributions
elle-0.005
burn-0.004
ky-0.004
ami-0.004
andra-0.004
Token
Token ID198
Feature activation-1.47e-02

Pos logit contributions
 contain+0.003
and+0.003
 Follow+0.003
 plunge+0.003
unn+0.003
Neg logit contributions
ami-0.002
ep-0.002
elle-0.002
burn-0.002
andra-0.002
TokenThe
Token ID464
Feature activation-6.22e-01

Pos logit contributions
and+0.001
 contain+0.001
 Follow+0.001
 Stay+0.001
 Madd+0.001
Neg logit contributions
elle-0.001
burn-0.000
ky-0.000
ami-0.000
ah-0.000
Token teacher
Token ID4701
Feature activation+7.77e-01

Pos logit contributions
and+0.024
 contain+0.022
 Follow+0.022
 Madd+0.021
oles+0.020
Neg logit contributions
elle-0.024
ami-0.024
burn-0.023
ky-0.023
andra-0.022
Token will
Token ID481
Feature activation-7.60e-01

Pos logit contributions
elle+0.016
burn+0.016
ky+0.016
ami+0.016
andra+0.015
Neg logit contributions
and-0.009
 contain-0.008
 Follow-0.008
 Stay-0.007
 Madd-0.007
Token like
Token ID588
Feature activation-1.40e-01

Pos logit contributions
 contain+0.023
and+0.020
 Stay+0.019
 Follow+0.019
 Madd+0.019
Neg logit contributions
elle-0.037
ami-0.036
burn-0.035
ited-0.035
andra-0.034
Token him
Token ID683
Feature activation-2.25e+00

Pos logit contributions
and+0.005
 contain+0.005
 Follow+0.005
 Madd+0.005
 Stay+0.004
Neg logit contributions
elle-0.005
burn-0.005
ky-0.005
ami-0.005
ah-0.005
Token,
Token ID11
Feature activation-5.86e-02

Pos logit contributions
and+0.087
 contain+0.080
 Follow+0.078
 Stay+0.073
 Madd+0.072
Neg logit contributions
elle-0.086
burn-0.084
ami-0.083
ky-0.083
andra-0.081
Token I
Token ID314
Feature activation-1.15e-01

Pos logit contributions
and+0.002
 contain+0.002
 Follow+0.002
 Madd+0.002
 Stay+0.002
Neg logit contributions
elle-0.002
burn-0.002
ky-0.002
ami-0.002
ah-0.002
Token promise
Token ID6991
Feature activation-3.70e+00

Pos logit contributions
and+0.003
 contain+0.003
 Follow+0.003
 Stay+0.003
 Madd+0.003
Neg logit contributions
elle-0.005
ami-0.005
ky-0.005
burn-0.005
andra-0.005
Token."
Token ID526
Feature activation+2.69e-01

Pos logit contributions
and+0.129
 contain+0.118
 Follow+0.116
 Stay+0.109
 Madd+0.107
Neg logit contributions
elle-0.157
burn-0.153
ami-0.152
ky-0.151
ited-0.146
Token
Token ID198
Feature activation-4.92e-02

Pos logit contributions
elle+0.010
burn+0.010
ami+0.010
ky+0.010
andra+0.010
Neg logit contributions
and-0.010
 contain-0.010
 Follow-0.009
oles-0.009
 tad-0.009
Token
Token ID198
Feature activation+1.98e-03

Pos logit contributions
 contain+0.002
and+0.002
 Follow+0.002
 tad+0.002
 plunge+0.002
Neg logit contributions
elle-0.001
ami-0.001
andra-0.001
ine-0.001
 str-0.001
TokenBen
Token ID11696
Feature activation-7.60e-01

Pos logit contributions
burn+0.000
elle+0.000
ky+0.000
ami+0.000
en+0.000
Neg logit contributions
and-0.000
 contain-0.000
 Follow-0.000
 Stay-0.000
 Madd-0.000
Token nodded
Token ID14464
Feature activation+5.77e-01

Pos logit contributions
and+0.030
 contain+0.028
 Follow+0.028
 Stay+0.025
 Madd+0.025
Neg logit contributions
elle-0.029
burn-0.028
ami-0.027
ky-0.027
ah-0.026
Token Lily
Token ID20037
Feature activation-4.30e-01

Pos logit contributions
and+0.001
 contain+0.001
 Follow+0.001
 Madd+0.001
 Stay+0.001
Neg logit contributions
elle-0.001
burn-0.001
ky-0.001
ami-0.001
ah-0.001
Token said
Token ID531
Feature activation-3.13e-01

Pos logit contributions
and+0.019
 contain+0.017
 Follow+0.017
 Madd+0.016
oles+0.016
Neg logit contributions
elle-0.014
burn-0.014
ky-0.014
ami-0.013
andra-0.013
Token,
Token ID11
Feature activation+1.24e-01

Pos logit contributions
and+0.014
 contain+0.014
 Follow+0.013
 tad+0.013
engu+0.012
Neg logit contributions
burn-0.009
elle-0.009
 str-0.009
ming-0.008
ah-0.008
Token "
Token ID366
Feature activation-9.77e-01

Pos logit contributions
elle+0.006
burn+0.006
ky+0.006
ami+0.006
ah+0.006
Neg logit contributions
and-0.004
 contain-0.003
 Follow-0.003
 Madd-0.003
 Stay-0.003
TokenWe
Token ID1135
Feature activation-2.62e-01

Pos logit contributions
and+0.033
 Follow+0.030
 contain+0.029
 Stay+0.027
 Madd+0.026
Neg logit contributions
elle-0.044
burn-0.042
ky-0.041
ami-0.041
ine-0.040
Token promise
Token ID6991
Feature activation-3.70e+00

Pos logit contributions
and+0.009
 contain+0.008
 Follow+0.008
 Madd+0.007
oles+0.007
Neg logit contributions
elle-0.011
burn-0.011
ky-0.011
ah-0.011
ami-0.011
Token,
Token ID11
Feature activation+1.15e-01

Pos logit contributions
and+0.151
 contain+0.141
 Follow+0.138
 Madd+0.129
 Stay+0.128
Neg logit contributions
elle-0.135
burn-0.133
ky-0.129
ami-0.128
andra-0.122
Token mom
Token ID1995
Feature activation-3.82e-01

Pos logit contributions
burn+0.005
elle+0.005
 str+0.005
ine+0.005
ami+0.005
Neg logit contributions
and-0.004
 contain-0.004
 Follow-0.003
oles-0.003
unn-0.003
Token.
Token ID13
Feature activation+1.36e-01

Pos logit contributions
and+0.013
 Follow+0.011
 Stay+0.011
 contain+0.011
suits+0.010
Neg logit contributions
elle-0.016
ited-0.016
ky-0.016
burn-0.015
andra-0.015
Token Thank
Token ID6952
Feature activation-3.76e-01

Pos logit contributions
burn+0.006
elle+0.006
ky+0.006
ami+0.006
 presses+0.006
Neg logit contributions
and-0.005
 contain-0.004
oles-0.004
 tad-0.003
 Madd-0.003
Token you
Token ID345
Feature activation-5.67e-01

Pos logit contributions
and+0.013
 contain+0.012
 Follow+0.012
 Madd+0.011
engu+0.011
Neg logit contributions
elle-0.015
burn-0.015
ky-0.015
ami-0.015
andra-0.014
Token.
Token ID13
Feature activation+2.53e-01

Pos logit contributions
and+0.048
 contain+0.044
 Follow+0.043
 Stay+0.040
 Madd+0.040
Neg logit contributions
elle-0.042
burn-0.041
ami-0.040
ky-0.039
andra-0.038
Token And
Token ID843
Feature activation+1.65e-01

Pos logit contributions
elle+0.012
burn+0.012
ky+0.012
ami+0.011
ah+0.011
Neg logit contributions
and-0.008
 contain-0.007
 Follow-0.007
 Madd-0.006
 Stay-0.006
Token you
Token ID345
Feature activation-4.12e-01

Pos logit contributions
elle+0.007
burn+0.007
ky+0.007
ami+0.007
ine+0.007
Neg logit contributions
and-0.005
 contain-0.005
 Follow-0.005
 Madd-0.004
oles-0.004
Token have
Token ID423
Feature activation+5.21e-01

Pos logit contributions
and+0.016
 contain+0.015
 Madd+0.014
oles+0.014
 Follow+0.014
Neg logit contributions
elle-0.015
burn-0.015
ep-0.015
ah-0.015
 str-0.015
Token to
Token ID284
Feature activation+6.61e-01

Pos logit contributions
elle+0.023
burn+0.022
ky+0.022
ami+0.022
ah+0.021
Neg logit contributions
and-0.018
 contain-0.016
 Follow-0.016
 Stay-0.015
 Madd-0.015
Token promise
Token ID6991
Feature activation-3.70e+00

Pos logit contributions
elle+0.035
ami+0.033
ky+0.033
burn+0.033
ited+0.032
Neg logit contributions
 contain-0.017
 Stay-0.015
and-0.014
 Follow-0.014
 bury-0.013
Token me
Token ID502
Feature activation-1.19e+00

Pos logit contributions
and+0.148
 contain+0.137
 Follow+0.135
 Stay+0.126
 Madd+0.126
Neg logit contributions
elle-0.139
burn-0.135
ky-0.132
ami-0.131
ah-0.126
Token you
Token ID345
Feature activation-4.17e-01

Pos logit contributions
and+0.049
 contain+0.045
 Follow+0.043
 Stay+0.041
 Madd+0.039
Neg logit contributions
elle-0.045
burn-0.043
ami-0.043
ky-0.043
ited-0.041
Token will
Token ID481
Feature activation-7.04e-01

Pos logit contributions
and+0.017
 contain+0.016
 Madd+0.016
 Follow+0.016
 tad+0.016
Neg logit contributions
elle-0.014
burn-0.014
ky-0.014
 str-0.013
ine-0.013
Token never
Token ID1239
Feature activation+5.06e-01

Pos logit contributions
 contain+0.018
 Stay+0.015
and+0.015
 bury+0.014
 hide+0.014
Neg logit contributions
elle-0.040
ami-0.037
ky-0.036
ited-0.036
burn-0.035
Token touch
Token ID3638
Feature activation+7.80e-01

Pos logit contributions
elle+0.021
burn+0.020
ky+0.020
ami+0.020
andra+0.019
Neg logit contributions
and-0.018
 contain-0.017
 Follow-0.016
 Stay-0.015
 Madd-0.015
Token.
Token ID13
Feature activation+2.37e-01

Pos logit contributions
and+0.063
 contain+0.056
 Follow+0.055
 Stay+0.052
 Madd+0.050
Neg logit contributions
elle-0.053
burn-0.053
ami-0.050
ky-0.049
andra-0.049
Token But
Token ID887
Feature activation-3.95e-01

Pos logit contributions
elle+0.010
burn+0.010
ky+0.010
ami+0.010
ine+0.009
Neg logit contributions
and-0.008
 contain-0.007
oles-0.007
 tad-0.006
 Follow-0.006
Token you
Token ID345
Feature activation-5.04e-01

Pos logit contributions
and+0.014
 contain+0.013
 Follow+0.012
oles+0.012
engu+0.011
Neg logit contributions
elle-0.016
burn-0.016
ami-0.016
ky-0.015
 str-0.015
Token have
Token ID423
Feature activation+5.09e-01

Pos logit contributions
and+0.023
 contain+0.020
 Follow+0.020
 Madd+0.020
oles+0.020
Neg logit contributions
 str-0.015
ine-0.015
ah-0.015
burn-0.015
elle-0.014
Token to
Token ID284
Feature activation+6.36e-01

Pos logit contributions
elle+0.022
burn+0.021
ky+0.021
ami+0.021
ah+0.020
Neg logit contributions
and-0.018
 contain-0.016
 Follow-0.016
 Madd-0.015
 Stay-0.015
Token promise
Token ID6991
Feature activation-3.70e+00

Pos logit contributions
elle+0.034
ami+0.033
ky+0.033
burn+0.032
ah+0.032
Neg logit contributions
 contain-0.016
and-0.014
 Stay-0.014
 Follow-0.013
 Madd-0.012
Token me
Token ID502
Feature activation-1.30e+00

Pos logit contributions
and+0.137
 contain+0.126
 Follow+0.123
 Stay+0.116
 Madd+0.116
Neg logit contributions
elle-0.151
burn-0.145
ky-0.143
ami-0.142
ah-0.139
Token not
Token ID407
Feature activation+1.37e+00

Pos logit contributions
and+0.055
 contain+0.051
 Follow+0.049
 Stay+0.047
 Madd+0.046
Neg logit contributions
elle-0.046
burn-0.045
ami-0.045
ky-0.044
ited-0.043
Token to
Token ID284
Feature activation+6.50e-01

Pos logit contributions
elle+0.066
ami+0.063
ky+0.063
burn+0.062
andra+0.061
Neg logit contributions
and-0.043
 contain-0.039
 Follow-0.037
 Stay-0.035
 Madd-0.033
Token touch
Token ID3638
Feature activation+7.94e-01

Pos logit contributions
elle+0.032
ami+0.031
andra+0.029
ky+0.029
ah+0.029
Neg logit contributions
 contain-0.020
and-0.019
 Stay-0.017
 Follow-0.016
 Madd-0.015
Token it
Token ID340
Feature activation-6.39e-01

Pos logit contributions
ited+0.032
burn+0.032
andra+0.032
ami+0.032
elle+0.031
Neg logit contributions
and-0.028
 Stay-0.024
 Follow-0.024
 contain-0.024
 Madd-0.023
Token.
Token ID13
Feature activation-1.42e-01

Pos logit contributions
 Follow+0.031
 contain+0.031
 Madd+0.030
 tad+0.030
engu+0.030
Neg logit contributions
burn-0.007
ky-0.007
ine-0.007
 dreaming-0.007
en-0.007
Token But
Token ID887
Feature activation-4.71e-01

Pos logit contributions
and+0.005
 contain+0.004
oles+0.004
 tad+0.004
 Madd+0.004
Neg logit contributions
elle-0.006
ky-0.006
burn-0.006
ami-0.006
ah-0.006
Token you
Token ID345
Feature activation-5.34e-01

Pos logit contributions
and+0.019
 contain+0.017
suits+0.016
 Follow+0.016
oles+0.016
Neg logit contributions
elle-0.017
 str-0.017
burn-0.017
ami-0.016
ah-0.016
Token have
Token ID423
Feature activation+4.98e-01

Pos logit contributions
and+0.024
 contain+0.021
 Follow+0.021
oles+0.019
 hind+0.019
Neg logit contributions
 str-0.017
burn-0.016
ine-0.016
ah-0.016
en-0.016
Token to
Token ID284
Feature activation+4.87e-01

Pos logit contributions
elle+0.021
burn+0.020
ky+0.020
ami+0.020
andra+0.019
Neg logit contributions
and-0.018
 contain-0.017
 Follow-0.016
 Madd-0.015
oles-0.015
Token promise
Token ID6991
Feature activation-3.69e+00

Pos logit contributions
elle+0.026
ami+0.025
ky+0.024
burn+0.024
andra+0.023
Neg logit contributions
and-0.012
 contain-0.012
 Follow-0.011
 Stay-0.011
 Madd-0.010
Token me
Token ID502
Feature activation-1.31e+00

Pos logit contributions
and+0.159
 contain+0.151
 Follow+0.146
 tad+0.136
oles+0.135
Neg logit contributions
burn-0.126
elle-0.126
ky-0.122
ami-0.119
en-0.116
Token you
Token ID345
Feature activation-5.20e-01

Pos logit contributions
and+0.055
 contain+0.051
 Follow+0.050
 Stay+0.047
 Madd+0.047
Neg logit contributions
elle-0.047
burn-0.045
ky-0.044
ami-0.044
andra-0.042
Token will
Token ID481
Feature activation-8.01e-01

Pos logit contributions
and+0.023
 tad+0.022
 Madd+0.022
 Follow+0.022
 contain+0.022
Neg logit contributions
burn-0.015
 str-0.015
elle-0.014
ine-0.014
ky-0.014
Token never
Token ID1239
Feature activation+5.08e-01

Pos logit contributions
 contain+0.022
and+0.022
 Stay+0.020
 Follow+0.019
 Madd+0.019
Neg logit contributions
elle-0.042
ami-0.039
ky-0.038
ited-0.038
andra-0.037
Token play
Token ID711
Feature activation+1.51e-01

Pos logit contributions
burn+0.019
elle+0.017
 str+0.017
en+0.017
ky+0.017
Neg logit contributions
and-0.020
 Follow-0.019
 contain-0.018
oles-0.018
 Madd-0.018
Token.
Token ID13
Feature activation+4.44e-01

Pos logit contributions
and+0.013
 Follow+0.012
 Stay+0.012
 contain+0.011
 Madd+0.011
Neg logit contributions
elle-0.015
burn-0.015
ami-0.014
aman-0.014
ky-0.014
Token But
Token ID887
Feature activation-3.45e-01

Pos logit contributions
elle+0.020
ky+0.019
burn+0.019
ami+0.019
ah+0.019
Neg logit contributions
and-0.014
 contain-0.013
oles-0.011
 tad-0.011
engu-0.011
Token you
Token ID345
Feature activation-4.58e-01

Pos logit contributions
and+0.013
 contain+0.012
suits+0.011
 Follow+0.011
oles+0.010
Neg logit contributions
elle-0.014
 str-0.013
ami-0.013
ah-0.013
burn-0.013
Token have
Token ID423
Feature activation+5.92e-01

Pos logit contributions
and+0.022
 Follow+0.019
oles+0.018
 contain+0.018
 Madd+0.018
Neg logit contributions
 str-0.013
ah-0.013
ine-0.013
en-0.012
ep-0.012
Token to
Token ID284
Feature activation+6.69e-01

Pos logit contributions
elle+0.025
burn+0.024
ky+0.024
ami+0.023
ah+0.023
Neg logit contributions
and-0.021
 contain-0.020
 Follow-0.019
 Madd-0.018
 Stay-0.017
Token promise
Token ID6991
Feature activation-3.69e+00

Pos logit contributions
elle+0.034
ky+0.033
ami+0.033
burn+0.033
andra+0.031
Neg logit contributions
and-0.018
 contain-0.017
 Follow-0.016
 Stay-0.015
 Madd-0.014
Token me
Token ID502
Feature activation-1.27e+00

Pos logit contributions
and+0.137
 contain+0.126
 Follow+0.124
 Stay+0.115
 Madd+0.115
Neg logit contributions
elle-0.149
burn-0.146
ky-0.143
ami-0.142
ah-0.137
Token you
Token ID345
Feature activation-4.63e-01

Pos logit contributions
and+0.055
 contain+0.052
 Follow+0.050
 Stay+0.048
 Madd+0.047
Neg logit contributions
elle-0.043
burn-0.042
ky-0.041
ami-0.041
andra-0.039
Token will
Token ID481
Feature activation-6.78e-01

Pos logit contributions
and+0.020
 Follow+0.019
 tad+0.018
 Madd+0.018
 contain+0.018
Neg logit contributions
elle-0.014
 str-0.014
burn-0.014
ine-0.013
ky-0.013
Token never
Token ID1239
Feature activation+4.87e-01

Pos logit contributions
 contain+0.019
and+0.019
 Stay+0.017
 Follow+0.016
 Madd+0.015
Neg logit contributions
elle-0.035
ami-0.033
ky-0.033
burn-0.032
ited-0.031
Token lie
Token ID6486
Feature activation+1.35e-01

Pos logit contributions
burn+0.019
elle+0.018
ky+0.017
ami+0.017
en+0.017
Neg logit contributions
and-0.019
 Follow-0.017
 contain-0.017
 Madd-0.016
oles-0.016
TokenB
Token ID33
Feature activation-1.68e+00

Pos logit contributions
elle+0.005
ky+0.004
burn+0.004
ami+0.004
ah+0.004
Neg logit contributions
and-0.007
 contain-0.007
 Follow-0.007
engu-0.006
 Stay-0.006
Tokenella
Token ID12627
Feature activation-2.11e-01

Pos logit contributions
and+0.055
 contain+0.054
 Follow+0.049
 Stay+0.046
 Madd+0.045
Neg logit contributions
elle-0.074
burn-0.072
ky-0.071
ami-0.071
ah-0.070
Token nodded
Token ID14464
Feature activation+5.90e-01

Pos logit contributions
and+0.008
 contain+0.007
 Follow+0.007
 Stay+0.007
 Madd+0.006
Neg logit contributions
elle-0.009
burn-0.009
ami-0.008
ky-0.008
ah-0.008
Token and
Token ID290
Feature activation+1.39e-02

Pos logit contributions
elle+0.022
burn+0.021
ami+0.021
ky+0.021
ah+0.020
Neg logit contributions
and-0.024
 contain-0.022
 Follow-0.021
 Stay-0.021
 Madd-0.021
Token whim
Token ID29923
Feature activation+4.07e-01

Pos logit contributions
elle+0.001
burn+0.001
ami+0.001
ky+0.001
ah+0.001
Neg logit contributions
and-0.000
 contain-0.000
 Follow-0.000
 Madd-0.000
 Stay-0.000
Tokenpered
Token ID13653
Feature activation-3.69e+00

Pos logit contributions
elle+0.022
ky+0.022
ine+0.022
en+0.022
burn+0.022
Neg logit contributions
and-0.010
 contain-0.009
 Follow-0.008
engu-0.007
 plunge-0.007
Token.
Token ID13
Feature activation+2.98e-01

Pos logit contributions
and+0.156
 contain+0.144
 Follow+0.142
 Stay+0.136
 Madd+0.129
Neg logit contributions
elle-0.131
burn-0.131
ami-0.127
ky-0.121
andra-0.121
Token She
Token ID1375
Feature activation+1.19e-01

Pos logit contributions
elle+0.010
ky+0.009
burn+0.009
ami+0.009
ine+0.009
Neg logit contributions
and-0.013
 contain-0.013
 Follow-0.012
engu-0.011
oles-0.011
Token was
Token ID373
Feature activation-3.70e-03

Pos logit contributions
burn+0.005
elle+0.005
ami+0.005
ky+0.005
en+0.005
Neg logit contributions
and-0.004
 Follow-0.004
 contain-0.004
 Stay-0.003
 tad-0.003
Token ashamed
Token ID22461
Feature activation+1.61e-01

Pos logit contributions
and+0.000
 contain+0.000
 Follow+0.000
 Stay+0.000
 Madd+0.000
Neg logit contributions
elle-0.000
burn-0.000
ami-0.000
ky-0.000
ah-0.000
Token and
Token ID290
Feature activation+5.01e-02

Pos logit contributions
burn+0.006
elle+0.006
ky+0.006
ami+0.006
andra+0.006
Neg logit contributions
and-0.006
 contain-0.006
 Follow-0.005
 Stay-0.005
 Madd-0.005
Token on
Token ID319
Feature activation+4.66e-01

Pos logit contributions
elle+0.025
ami+0.025
ky+0.025
burn+0.024
andra+0.023
Neg logit contributions
and-0.022
 Follow-0.021
 contain-0.021
 Stay-0.019
oles-0.018
Token,
Token ID11
Feature activation+1.01e-01

Pos logit contributions
burn+0.020
elle+0.020
ky+0.020
ami+0.020
ited+0.019
Neg logit contributions
and-0.015
 contain-0.014
 Madd-0.014
 Follow-0.014
oles-0.013
Token the
Token ID262
Feature activation+2.58e-01

Pos logit contributions
burn+0.004
ky+0.004
elle+0.004
ep+0.004
ami+0.004
Neg logit contributions
and-0.004
 contain-0.004
 Madd-0.004
 Follow-0.004
 tad-0.003
Token princess
Token ID21752
Feature activation+2.52e-01

Pos logit contributions
burn+0.012
elle+0.012
ky+0.011
ami+0.011
aman+0.011
Neg logit contributions
and-0.008
 contain-0.008
 Follow-0.007
 tad-0.007
 Stay-0.007
Token was
Token ID373
Feature activation+2.28e-02

Pos logit contributions
burn+0.011
elle+0.010
ky+0.010
aman+0.010
ami+0.010
Neg logit contributions
and-0.009
 Follow-0.008
 contain-0.007
oles-0.007
 Stay-0.007
Token known
Token ID1900
Feature activation+3.72e+00

Pos logit contributions
ky+0.001
ami+0.001
elle+0.001
aman+0.001
en+0.001
Neg logit contributions
and-0.001
 contain-0.001
 Madd-0.001
 Stay-0.001
 Follow-0.001
Token for
Token ID329
Feature activation-8.71e-01

Pos logit contributions
elle+0.120
burn+0.118
ky+0.113
ami+0.113
andra+0.108
Neg logit contributions
and-0.169
 contain-0.160
 Follow-0.155
 Madd-0.148
 Stay-0.145
Token being
Token ID852
Feature activation-7.25e-01

Pos logit contributions
and+0.037
 contain+0.034
 Madd+0.033
 Follow+0.032
 tad+0.032
Neg logit contributions
burn-0.032
elle-0.030
ah-0.030
ky-0.029
andra-0.029
Token the
Token ID262
Feature activation+2.66e-01

Pos logit contributions
and+0.020
 Follow+0.017
 contain+0.017
 Madd+0.016
 Stay+0.016
Neg logit contributions
burn-0.037
aman-0.037
elle-0.036
ami-0.036
andra-0.036
Token brave
Token ID14802
Feature activation-8.39e-01

Pos logit contributions
elle+0.012
burn+0.012
ky+0.012
ami+0.011
andra+0.011
Neg logit contributions
and-0.009
 contain-0.008
 Follow-0.008
 Stay-0.007
 Madd-0.007
Token,
Token ID11
Feature activation+1.07e-01

Pos logit contributions
and+0.027
 contain+0.025
 Follow+0.024
 tad+0.022
 Stay+0.022
Neg logit contributions
burn-0.038
elle-0.036
ky-0.036
ami-0.035
ep-0.035
Token taught
Token ID7817
Feature activation-5.34e-02

Pos logit contributions
and+0.002
 contain+0.001
 Stay+0.001
 Follow+0.001
 Madd+0.001
Neg logit contributions
burn-0.002
elle-0.002
ami-0.002
ky-0.002
ited-0.002
Token him
Token ID683
Feature activation-2.33e+00

Pos logit contributions
and+0.002
 contain+0.002
 Madd+0.002
 Follow+0.002
 Stay+0.002
Neg logit contributions
burn-0.002
elle-0.002
ky-0.002
ited-0.002
ami-0.002
Token.
Token ID13
Feature activation-5.86e-01

Pos logit contributions
and+0.109
 contain+0.101
 Follow+0.097
 Stay+0.094
 Madd+0.093
Neg logit contributions
burn-0.075
elle-0.074
ky-0.072
ami-0.071
andra-0.068
Token "
Token ID366
Feature activation-8.50e-01

Pos logit contributions
and+0.018
 contain+0.016
 Follow+0.016
 Madd+0.014
 Stay+0.014
Neg logit contributions
elle-0.028
burn-0.027
ky-0.027
ami-0.026
ah-0.026
TokenNo
Token ID2949
Feature activation+2.19e+00

Pos logit contributions
and+0.033
 Follow+0.029
 contain+0.027
 Stay+0.026
 Go+0.024
Neg logit contributions
burn-0.035
elle-0.033
ine-0.032
ky-0.031
inkle-0.031
Token matter
Token ID2300
Feature activation+3.48e+00

Pos logit contributions
burn+0.086
elle+0.086
ky+0.084
ami+0.084
andra+0.080
Neg logit contributions
and-0.081
 contain-0.076
 Follow-0.073
 Madd-0.069
 Stay-0.069
Token what
Token ID644
Feature activation+7.19e-01

Pos logit contributions
elle+0.082
burn+0.075
ky+0.073
ami+0.072
ah+0.069
Neg logit contributions
and-0.187
 contain-0.180
 Follow-0.175
 Madd-0.167
 tad-0.167
Token happens
Token ID4325
Feature activation+5.29e-02

Pos logit contributions
burn+0.030
elle+0.030
ami+0.030
ky+0.029
andra+0.028
Neg logit contributions
and-0.025
 contain-0.023
 Follow-0.023
 Stay-0.021
oles-0.021
Token,
Token ID11
Feature activation-1.62e-02

Pos logit contributions
elle+0.002
burn+0.002
ami+0.002
ky+0.002
andra+0.002
Neg logit contributions
and-0.002
 contain-0.002
 Follow-0.002
 Madd-0.002
oles-0.001
Token Joe
Token ID5689
Feature activation+8.98e-02

Pos logit contributions
and+0.001
 contain+0.001
 Follow+0.000
 Madd+0.000
 Stay+0.000
Neg logit contributions
elle-0.001
burn-0.001
ky-0.001
ami-0.001
ah-0.001
Token,
Token ID11
Feature activation-2.51e-02

Pos logit contributions
burn+0.004
ami+0.004
elle+0.004
ky+0.004
aman+0.004
Neg logit contributions
and-0.003
 contain-0.002
 Follow-0.002
 Stay-0.002
oles-0.002
Token.
Token ID13
Feature activation-9.64e-01

Pos logit contributions
and+0.066
 Follow+0.063
 contain+0.062
engu+0.059
 Madd+0.058
Neg logit contributions
elle-0.043
burn-0.042
ky-0.039
ami-0.039
ah-0.039
Token He
Token ID679
Feature activation-4.19e-01

Pos logit contributions
and+0.033
 contain+0.031
 Follow+0.028
engu+0.027
unn+0.027
Neg logit contributions
elle-0.040
ky-0.039
burn-0.039
ami-0.038
ine-0.038
Token knew
Token ID2993
Feature activation-1.16e+00

Pos logit contributions
and+0.015
 contain+0.014
 Follow+0.014
 Stay+0.013
 Madd+0.013
Neg logit contributions
burn-0.017
elle-0.017
ami-0.016
ky-0.016
andra-0.015
Token that
Token ID326
Feature activation-1.40e-02

Pos logit contributions
and+0.050
 contain+0.047
 Follow+0.046
 Madd+0.045
 Stay+0.044
Neg logit contributions
elle-0.040
burn-0.039
ky-0.038
ami-0.038
ited-0.036
Token no
Token ID645
Feature activation+2.01e-02

Pos logit contributions
and+0.001
 contain+0.001
 Follow+0.001
unn+0.001
engu+0.001
Neg logit contributions
elle-0.000
burn-0.000
ami-0.000
ky-0.000
 str-0.000
Token matter
Token ID2300
Feature activation+3.43e+00

Pos logit contributions
burn+0.001
ky+0.001
ami+0.001
elle+0.001
andra+0.001
Neg logit contributions
and-0.001
 contain-0.001
 Follow-0.001
 Stay-0.001
 Madd-0.001
Token how
Token ID703
Feature activation-2.34e-01

Pos logit contributions
elle+0.091
burn+0.089
ky+0.087
ami+0.085
andra+0.081
Neg logit contributions
and-0.174
 contain-0.164
 Follow-0.161
 Stay-0.154
 Madd-0.153
Token many
Token ID867
Feature activation+1.16e+00

Pos logit contributions
and+0.009
 contain+0.009
 Follow+0.009
 Madd+0.008
 Stay+0.008
Neg logit contributions
elle-0.009
burn-0.008
ky-0.008
ami-0.008
ah-0.008
Token times
Token ID1661
Feature activation-4.02e-01

Pos logit contributions
elle+0.048
burn+0.046
ky+0.045
ami+0.045
en+0.044
Neg logit contributions
and-0.042
 contain-0.038
 Follow-0.038
 Madd-0.034
engu-0.034
Token he
Token ID339
Feature activation+2.05e-01

Pos logit contributions
and+0.016
 contain+0.015
 Follow+0.014
 Madd+0.013
 tad+0.013
Neg logit contributions
elle-0.015
burn-0.014
ky-0.014
ami-0.014
en-0.014
Token asked
Token ID1965
Feature activation+8.96e-01

Pos logit contributions
burn+0.009
elle+0.009
ky+0.008
ami+0.008
andra+0.008
Neg logit contributions
and-0.007
 contain-0.007
 Follow-0.006
 Stay-0.006
 Madd-0.006
Token of
Token ID286
Feature activation-4.87e-02

Pos logit contributions
 contain+0.026
 Follow+0.026
and+0.025
unn+0.023
 Stay+0.022
Neg logit contributions
elle-0.040
ami-0.040
ine-0.039
en-0.039
ky-0.039
Token the
Token ID262
Feature activation+3.21e-01

Pos logit contributions
and+0.002
 contain+0.002
 Follow+0.002
 Madd+0.002
 tad+0.002
Neg logit contributions
burn-0.002
elle-0.002
ky-0.002
ami-0.002
ah-0.002
Token street
Token ID4675
Feature activation+1.23e+00

Pos logit contributions
ky+0.011
elle+0.011
ami+0.010
burn+0.009
 str+0.009
Neg logit contributions
and-0.014
oles-0.013
 Madd-0.013
 contain-0.013
 Follow-0.013
Token.
Token ID13
Feature activation+3.11e-01

Pos logit contributions
elle+0.048
burn+0.048
ky+0.045
ami+0.045
andra+0.044
Neg logit contributions
and-0.048
 contain-0.044
 Follow-0.043
 Stay-0.040
 Madd-0.040
Token Exc
Token ID25268
Feature activation+2.11e-01

Pos logit contributions
elle+0.010
andra+0.010
aman+0.010
ky+0.010
ami+0.010
Neg logit contributions
and-0.013
 contain-0.013
unn-0.012
 Follow-0.012
 tad-0.012
Tokenited
Token ID863
Feature activation+2.92e+00

Pos logit contributions
burn+0.004
 dreaming+0.003
elle+0.003
ine+0.003
ami+0.003
Neg logit contributions
and-0.013
 contain-0.012
 Follow-0.011
engu-0.011
unn-0.011
Token,
Token ID11
Feature activation+2.51e-01

Pos logit contributions
burn+0.126
elle+0.124
ami+0.119
ky+0.118
ine+0.115
Neg logit contributions
and-0.099
 contain-0.091
 Follow-0.088
 Madd-0.082
 Stay-0.082
Token John
Token ID1757
Feature activation+4.00e-01

Pos logit contributions
burn+0.011
elle+0.011
ky+0.011
ami+0.011
ine+0.011
Neg logit contributions
and-0.008
 contain-0.008
 Follow-0.007
 Madd-0.007
 Stay-0.007
Token began
Token ID2540
Feature activation+3.29e-01

Pos logit contributions
burn+0.017
elle+0.016
ami+0.016
ky+0.015
aman+0.015
Neg logit contributions
and-0.014
 Follow-0.013
 contain-0.013
 Stay-0.012
 Madd-0.012
Token to
Token ID284
Feature activation+7.98e-01

Pos logit contributions
elle+0.017
burn+0.017
ky+0.016
ami+0.016
ah+0.016
Neg logit contributions
and-0.009
 contain-0.008
 Follow-0.007
 Stay-0.007
 Madd-0.007
Token follow
Token ID1061
Feature activation+7.83e-01

Pos logit contributions
elle+0.035
burn+0.034
ky+0.033
ami+0.033
ah+0.031
Neg logit contributions
and-0.027
 contain-0.026
 Follow-0.025
 Madd-0.023
 Stay-0.023
Token seemed
Token ID3947
Feature activation+9.97e-01

Pos logit contributions
elle+0.028
burn+0.027
ky+0.027
ah+0.026
ami+0.026
Neg logit contributions
and-0.023
 contain-0.021
 Follow-0.021
engu-0.019
 Stay-0.019
Token to
Token ID284
Feature activation+7.99e-01

Pos logit contributions
elle+0.050
ami+0.050
burn+0.050
andra+0.049
aman+0.048
Neg logit contributions
and-0.029
 contain-0.025
 Follow-0.023
 Stay-0.023
 Madd-0.023
Token fit
Token ID4197
Feature activation+1.49e-01

Pos logit contributions
elle+0.037
ky+0.036
ami+0.036
burn+0.036
andra+0.035
Neg logit contributions
 contain-0.024
and-0.024
 Madd-0.021
 Follow-0.021
 Stay-0.021
Token.
Token ID13
Feature activation+2.23e-01

Pos logit contributions
elle+0.006
burn+0.006
ami+0.006
ky+0.006
andra+0.006
Neg logit contributions
and-0.006
 contain-0.005
 Follow-0.005
 Madd-0.005
 Stay-0.005
Token No
Token ID1400
Feature activation+1.80e-01

Pos logit contributions
elle+0.008
ky+0.007
ah+0.007
 str+0.007
ami+0.006
Neg logit contributions
and-0.009
 contain-0.009
oles-0.009
 tad-0.009
 Madd-0.009
Token matter
Token ID2300
Feature activation+3.08e+00

Pos logit contributions
burn+0.005
elle+0.005
ami+0.005
ky+0.005
andra+0.005
Neg logit contributions
and-0.009
 contain-0.008
 Follow-0.008
 Stay-0.008
 Madd-0.007
Token how
Token ID703
Feature activation-2.87e-01

Pos logit contributions
elle+0.079
burn+0.077
ky+0.075
ami+0.074
andra+0.071
Neg logit contributions
and-0.160
 contain-0.150
 Follow-0.148
 Stay-0.141
 Madd-0.141
Token hard
Token ID1327
Feature activation-3.69e-01

Pos logit contributions
and+0.012
 contain+0.011
 Follow+0.011
engu+0.010
 Madd+0.010
Neg logit contributions
elle-0.010
burn-0.010
ky-0.009
ami-0.009
 str-0.009
Token True
Token ID6407
Feature activation+2.83e-01

Pos logit contributions
and+0.015
 contain+0.014
 Follow+0.014
 Stay+0.013
 Madd+0.013
Neg logit contributions
elle-0.013
burn-0.013
ami-0.013
ky-0.013
ah-0.012
Token tried
Token ID3088
Feature activation-4.15e-01

Pos logit contributions
elle+0.012
burn+0.012
ky+0.012
ami+0.011
ah+0.011
Neg logit contributions
and-0.010
 contain-0.009
 Follow-0.009
 Madd-0.008
 Stay-0.008
Token,
Token ID11
Feature activation+2.20e-01

Pos logit contributions
and+0.014
 contain+0.012
 Follow+0.012
 Madd+0.012
 Stay+0.011
Neg logit contributions
elle-0.018
burn-0.018
ami-0.018
ited-0.018
andra-0.018
Token she
Token ID673
Feature activation+4.13e-01

Pos logit contributions
and+0.007
 contain+0.007
 Follow+0.007
 Madd+0.007
 Stay+0.006
Neg logit contributions
ami-0.007
elle-0.007
burn-0.007
ky-0.007
andra-0.007
Token put
Token ID1234
Feature activation+1.04e+00

Pos logit contributions
elle+0.017
burn+0.017
ky+0.017
ami+0.016
ah+0.016
Neg logit contributions
and-0.015
 contain-0.014
 Follow-0.013
 Madd-0.012
 Stay-0.012
Token it
Token ID340
Feature activation-3.24e-01

Pos logit contributions
ami+0.044
elle+0.042
andra+0.042
burn+0.041
ky+0.041
Neg logit contributions
and-0.036
 contain-0.032
 Follow-0.032
 Madd-0.032
 Stay-0.031
Token back
Token ID736
Feature activation-1.43e+00

Pos logit contributions
and+0.011
 contain+0.010
 Follow+0.010
 Stay+0.009
 Madd+0.009
Neg logit contributions
elle-0.014
ami-0.014
ky-0.013
burn-0.013
andra-0.013
Token on
Token ID319
Feature activation+5.18e-01

Pos logit contributions
and+0.047
 contain+0.040
 Follow+0.040
 Stay+0.036
 Madd+0.035
Neg logit contributions
elle-0.066
ami-0.066
andra-0.064
burn-0.062
ky-0.061
Token his
Token ID465
Feature activation+6.22e-01

Pos logit contributions
burn+0.021
ited+0.020
elle+0.020
ami+0.020
ah+0.019
Neg logit contributions
and-0.019
 contain-0.017
 Madd-0.016
 Stay-0.016
 Follow-0.015
Token head
Token ID1182
Feature activation-6.33e-01

Pos logit contributions
elle+0.025
burn+0.025
ky+0.025
ep+0.024
ine+0.024
Neg logit contributions
and-0.023
 Follow-0.021
 Madd-0.020
 contain-0.020
oles-0.020
Token.
Token ID13
Feature activation+4.31e-02

Pos logit contributions
and+0.025
 contain+0.023
 Follow+0.021
 Stay+0.020
 Madd+0.020
Neg logit contributions
elle-0.025
ami-0.024
andra-0.023
burn-0.023
ited-0.023
Token She
Token ID1375
Feature activation+3.99e-01

Pos logit contributions
elle+0.002
burn+0.002
ky+0.002
aman+0.002
ami+0.002
Neg logit contributions
and-0.002
 contain-0.002
 Follow-0.001
 tad-0.001
engu-0.001
Token asked
Token ID1965
Feature activation+9.56e-01

Pos logit contributions
elle+0.016
burn+0.015
ky+0.015
ami+0.015
andra+0.014
Neg logit contributions
and-0.015
 contain-0.014
 Follow-0.014
 Madd-0.013
 Stay-0.013
Token Tim
Token ID5045
Feature activation-6.23e-01

Pos logit contributions
elle+0.031
burn+0.029
ami+0.028
ky+0.028
en+0.027
Neg logit contributions
and-0.043
 contain-0.042
 Follow-0.042
 tad-0.039
oles-0.038
Token on
Token ID319
Feature activation+5.84e-01

Pos logit contributions
elle+0.045
ky+0.044
burn+0.042
en+0.041
 str+0.040
Neg logit contributions
and-0.063
 contain-0.062
 Follow-0.061
oles-0.057
engu-0.057
Token the
Token ID262
Feature activation+3.64e-01

Pos logit contributions
ami+0.021
elle+0.021
burn+0.021
andra+0.021
ited+0.020
Neg logit contributions
and-0.024
 contain-0.021
 Madd-0.020
 Stay-0.020
 Follow-0.020
Token floor
Token ID4314
Feature activation-3.09e-02

Pos logit contributions
burn+0.013
ky+0.012
en+0.011
elle+0.011
ep+0.011
Neg logit contributions
 Follow-0.016
and-0.015
 contain-0.015
oles-0.015
engu-0.015
Token while
Token ID981
Feature activation+2.26e-01

Pos logit contributions
and+0.001
 contain+0.001
 Follow+0.001
 Madd+0.001
 Stay+0.001
Neg logit contributions
burn-0.001
elle-0.001
andra-0.001
ami-0.001
aman-0.001
Token wearing
Token ID5762
Feature activation-1.36e+00

Pos logit contributions
elle+0.008
burn+0.008
ky+0.008
ami+0.008
andra+0.008
Neg logit contributions
and-0.009
 contain-0.009
 Follow-0.008
 Madd-0.008
 Stay-0.008
Token his
Token ID465
Feature activation+8.10e-01

Pos logit contributions
and+0.040
 tum+0.037
 Madd+0.035
 hide+0.035
 contain+0.035
Neg logit contributions
ami-0.066
aman-0.062
andra-0.062
ited-0.061
elle-0.060
Token bath
Token ID7837
Feature activation-1.37e-01

Pos logit contributions
burn+0.032
elle+0.031
ky+0.030
ine+0.029
ep+0.029
Neg logit contributions
and-0.032
 contain-0.029
 Follow-0.029
engu-0.027
oles-0.026
Tokenrobe
Token ID25481
Feature activation-1.36e+00

Pos logit contributions
and+0.004
 contain+0.003
 Follow+0.003
 Stay+0.003
engu+0.003
Neg logit contributions
burn-0.007
elle-0.007
ky-0.007
ine-0.006
en-0.006
Token and
Token ID290
Feature activation-1.79e-01

Pos logit contributions
and+0.051
 contain+0.046
 Follow+0.043
 Stay+0.040
 Madd+0.039
Neg logit contributions
elle-0.056
ami-0.056
burn-0.055
andra-0.054
aman-0.052
Token g
Token ID308
Feature activation+5.04e-01

Pos logit contributions
and+0.007
 contain+0.007
 Follow+0.006
 Madd+0.006
 Stay+0.006
Neg logit contributions
elle-0.007
ami-0.007
ky-0.006
burn-0.006
andra-0.006
Tokeniggle
Token ID24082
Feature activation-6.68e-01

Pos logit contributions
elle+0.022
burn+0.021
ky+0.021
ami+0.020
ah+0.020
Neg logit contributions
and-0.018
 contain-0.016
 Follow-0.015
 Madd-0.015
oles-0.014
Token to
Token ID284
Feature activation+7.70e-01

Pos logit contributions
elle+0.019
burn+0.018
ami+0.018
ky+0.018
andra+0.017
Neg logit contributions
and-0.021
 contain-0.019
 Follow-0.019
 Stay-0.017
 Madd-0.017
Token the
Token ID262
Feature activation+2.72e-01

Pos logit contributions
elle+0.025
burn+0.024
ky+0.024
ami+0.023
ah+0.022
Neg logit contributions
and-0.035
 contain-0.032
 Follow-0.032
 Madd-0.030
 Stay-0.030
Token shore
Token ID15191
Feature activation-4.39e-01

Pos logit contributions
elle+0.012
burn+0.011
ky+0.011
ami+0.011
ah+0.011
Neg logit contributions
and-0.010
 contain-0.009
 Follow-0.008
 Madd-0.008
 Stay-0.008
Token of
Token ID286
Feature activation-9.14e-02

Pos logit contributions
and+0.019
 contain+0.018
 Follow+0.018
 Stay+0.017
 Madd+0.017
Neg logit contributions
elle-0.015
burn-0.014
ky-0.014
ami-0.014
andra-0.013
Token the
Token ID262
Feature activation+2.67e-01

Pos logit contributions
and+0.004
 contain+0.004
 Follow+0.003
 Madd+0.003
 tad+0.003
Neg logit contributions
elle-0.003
burn-0.003
ami-0.003
ky-0.003
ah-0.003
Token beach
Token ID10481
Feature activation+6.81e-01

Pos logit contributions
elle+0.011
ky+0.011
burn+0.011
ami+0.011
ep+0.010
Neg logit contributions
and-0.010
 contain-0.009
 Follow-0.009
 Madd-0.008
oles-0.008
Token.
Token ID13
Feature activation+2.29e-01

Pos logit contributions
elle+0.024
burn+0.024
ami+0.023
ky+0.022
andra+0.022
Neg logit contributions
and-0.029
 contain-0.027
 Follow-0.026
 Stay-0.025
oles-0.024
Token There
Token ID1318
Feature activation-1.84e-01

Pos logit contributions
ky+0.008
ine+0.008
andra+0.008
elle+0.008
 str+0.008
Neg logit contributions
and-0.009
 contain-0.009
unn-0.008
 Follow-0.008
 Madd-0.008
Token,
Token ID11
Feature activation+2.07e-01

Pos logit contributions
and+0.008
 contain+0.008
 hind+0.008
 Follow+0.008
unn+0.007
Neg logit contributions
en-0.006
elle-0.006
ah-0.005
aman-0.005
ine-0.005
Token they
Token ID484
Feature activation+3.89e-02

Pos logit contributions
elle+0.006
ky+0.006
burn+0.006
 str+0.006
ami+0.005
Neg logit contributions
and-0.010
 Follow-0.009
 contain-0.009
unn-0.009
engu-0.008
Token saw
Token ID2497
Feature activation-1.63e-02

Pos logit contributions
elle+0.001
burn+0.001
ky+0.001
ami+0.001
ah+0.001
Neg logit contributions
and-0.002
 contain-0.002
 Follow-0.002
 Madd-0.002
 Stay-0.002
Token He
Token ID679
Feature activation-4.46e-01

Pos logit contributions
and+0.026
 contain+0.025
 Follow+0.023
 tad+0.021
unn+0.021
Neg logit contributions
elle-0.033
burn-0.032
ky-0.032
ami-0.031
ine-0.031
Token realised
Token ID19169
Feature activation-9.56e-01

Pos logit contributions
and+0.016
 Follow+0.015
 contain+0.015
 Stay+0.014
 Madd+0.014
Neg logit contributions
burn-0.018
elle-0.018
ami-0.017
ky-0.017
andra-0.016
Token it
Token ID340
Feature activation-2.66e-01

Pos logit contributions
and+0.038
 Madd+0.035
 contain+0.034
oles+0.034
 Stay+0.033
Neg logit contributions
burn-0.036
ited-0.035
ky-0.034
elle-0.034
ami-0.033
Token was
Token ID373
Feature activation+4.14e-02

Pos logit contributions
and+0.007
 contain+0.006
 Follow+0.006
 Stay+0.006
 Madd+0.006
Neg logit contributions
burn-0.013
elle-0.013
ky-0.013
ami-0.013
andra-0.013
Token ok
Token ID12876
Feature activation+9.53e-02

Pos logit contributions
elle+0.002
burn+0.002
ami+0.002
ky+0.001
andra+0.001
Neg logit contributions
and-0.002
 contain-0.002
 Madd-0.001
 Follow-0.001
 Stay-0.001
Token to
Token ID284
Feature activation+6.06e-01

Pos logit contributions
elle+0.004
burn+0.003
ky+0.003
ami+0.003
ine+0.003
Neg logit contributions
and-0.004
 contain-0.004
 Follow-0.004
 tad-0.003
engu-0.003
Token be
Token ID307
Feature activation+9.09e-01

Pos logit contributions
elle+0.032
ami+0.031
ky+0.030
burn+0.030
ep+0.029
Neg logit contributions
 contain-0.017
 Madd-0.014
 Stay-0.014
and-0.014
 Follow-0.013
Token scared
Token ID12008
Feature activation+5.77e-01

Pos logit contributions
elle+0.043
ami+0.043
burn+0.043
aman+0.042
andra+0.042
Neg logit contributions
and-0.028
 Follow-0.024
 Madd-0.024
 Stay-0.024
 contain-0.023
Token of
Token ID286
Feature activation-1.33e-01

Pos logit contributions
elle+0.021
burn+0.020
ky+0.020
en+0.019
ami+0.019
Neg logit contributions
and-0.023
 contain-0.022
 Follow-0.022
engu-0.021
 tad-0.020
Token loud
Token ID7812
Feature activation+5.67e-01

Pos logit contributions
and+0.005
 contain+0.005
 Follow+0.005
 Madd+0.005
 Stay+0.005
Neg logit contributions
burn-0.005
elle-0.005
andra-0.005
ami-0.005
ah-0.005
Token things
Token ID1243
Feature activation+1.38e+00

Pos logit contributions
elle+0.021
burn+0.021
ami+0.020
ky+0.020
andra+0.020
Neg logit contributions
and-0.022
 contain-0.020
 Follow-0.020
 Stay-0.019
 Madd-0.019
Token it
Token ID340
Feature activation-4.63e-01

Pos logit contributions
and+0.044
 contain+0.041
 Madd+0.039
 Follow+0.039
 Stay+0.038
Neg logit contributions
ited-0.031
ky-0.031
burn-0.031
ami-0.030
ah-0.030
Token off
Token ID572
Feature activation+7.26e-01

Pos logit contributions
and+0.014
 contain+0.012
 Follow+0.011
 Madd+0.010
 Stay+0.010
Neg logit contributions
burn-0.023
ami-0.023
elle-0.023
ky-0.023
andra-0.023
Token to
Token ID284
Feature activation+4.49e-01

Pos logit contributions
burn+0.036
elle+0.035
ami+0.035
andra+0.035
ah+0.035
Neg logit contributions
and-0.022
 contain-0.019
 Madd-0.017
oles-0.016
 Follow-0.015
Token everyone
Token ID2506
Feature activation+9.98e-01

Pos logit contributions
burn+0.016
elle+0.015
ky+0.015
ami+0.015
ited+0.015
Neg logit contributions
and-0.019
 contain-0.019
 Follow-0.018
 Stay-0.017
 Madd-0.017
Token.
Token ID13
Feature activation-3.51e-01

Pos logit contributions
burn+0.040
elle+0.039
ky+0.038
ami+0.038
andra+0.038
Neg logit contributions
and-0.040
 contain-0.036
 Follow-0.033
 Madd-0.032
oles-0.032
Token Everyone
Token ID11075
Feature activation+1.51e+00

Pos logit contributions
and+0.012
 contain+0.011
 Follow+0.011
 Madd+0.010
 Stay+0.010
Neg logit contributions
elle-0.015
burn-0.015
ky-0.014
ami-0.014
ah-0.014
Token compliment
Token ID19370
Feature activation+8.16e-01

Pos logit contributions
burn+0.077
ami+0.072
aman+0.071
andra+0.071
ky+0.071
Neg logit contributions
and-0.044
 contain-0.040
 Follow-0.036
 Madd-0.034
 Stay-0.033
Tokened
Token ID276
Feature activation-8.19e-01

Pos logit contributions
burn+0.044
ami+0.042
elle+0.042
aman+0.042
ep+0.042
Neg logit contributions
and-0.020
 Follow-0.017
 contain-0.016
 Madd-0.015
 Stay-0.014
Token Molly
Token ID30236
Feature activation-2.54e-01

Pos logit contributions
and+0.040
 contain+0.036
 Follow+0.036
 Madd+0.035
 Stay+0.034
Neg logit contributions
burn-0.025
elle-0.024
ami-0.023
ky-0.022
ep-0.022
Token on
Token ID319
Feature activation+4.60e-01

Pos logit contributions
and+0.009
 contain+0.008
 Follow+0.008
 Madd+0.007
 Stay+0.007
Neg logit contributions
burn-0.011
ami-0.010
elle-0.010
ky-0.010
andra-0.010
Token her
Token ID607
Feature activation+3.15e-01

Pos logit contributions
burn+0.017
ited+0.016
elle+0.016
ah+0.015
ep+0.015
Neg logit contributions
and-0.020
 contain-0.018
 Madd-0.017
 Stay-0.016
 tad-0.016
Token a
Token ID257
Feature activation-2.03e-01

Pos logit contributions
elle+0.003
burn+0.003
ky+0.003
ami+0.003
ah+0.003
Neg logit contributions
and-0.005
 contain-0.005
 Follow-0.005
 Stay-0.005
 Madd-0.005
Token big
Token ID1263
Feature activation+6.28e-01

Pos logit contributions
and+0.008
 contain+0.007
 Follow+0.007
 Stay+0.006
unn+0.006
Neg logit contributions
elle-0.008
burn-0.008
ky-0.008
ami-0.007
ah-0.007
Token yellow
Token ID7872
Feature activation+8.92e-01

Pos logit contributions
elle+0.026
burn+0.026
ky+0.025
ami+0.025
ah+0.024
Neg logit contributions
and-0.022
 contain-0.020
 Follow-0.020
 Madd-0.018
 Stay-0.018
Token carpet
Token ID20710
Feature activation+8.73e-01

Pos logit contributions
elle+0.038
burn+0.037
ky+0.037
ami+0.036
ah+0.035
Neg logit contributions
and-0.031
 contain-0.028
 Follow-0.028
 Madd-0.025
 Stay-0.025
Token on
Token ID319
Feature activation+6.15e-01

Pos logit contributions
elle+0.039
burn+0.039
andra+0.037
ami+0.037
ky+0.036
Neg logit contributions
and-0.028
 contain-0.027
 Follow-0.025
 Madd-0.023
oles-0.023
Token the
Token ID262
Feature activation+2.45e-01

Pos logit contributions
burn+0.027
ited+0.026
elle+0.025
ah+0.025
ami+0.025
Neg logit contributions
 contain-0.020
 Madd-0.019
and-0.019
 Stay-0.017
 tad-0.017
Token floor
Token ID4314
Feature activation+6.67e-02

Pos logit contributions
ky+0.010
elle+0.010
burn+0.009
ine+0.009
en+0.009
Neg logit contributions
and-0.009
oles-0.009
 Follow-0.008
unn-0.008
 tad-0.008
Token and
Token ID290
Feature activation-4.24e-02

Pos logit contributions
burn+0.003
elle+0.003
andra+0.003
ah+0.003
ited+0.003
Neg logit contributions
and-0.002
 contain-0.002
oles-0.002
 Follow-0.002
 Stay-0.002
Token she
Token ID673
Feature activation+6.06e-01

Pos logit contributions
and+0.002
 contain+0.002
 Follow+0.002
 Madd+0.002
 Stay+0.002
Neg logit contributions
elle-0.001
burn-0.001
ky-0.001
ami-0.001
ah-0.001
Token had
Token ID550
Feature activation+5.76e-02

Pos logit contributions
elle+0.024
ky+0.024
ine+0.023
burn+0.023
ami+0.023
Neg logit contributions
and-0.022
 contain-0.021
 Follow-0.020
 Madd-0.019
oles-0.019
Token so
Token ID523
Feature activation-8.47e-01

Pos logit contributions
elle+0.002
burn+0.002
ami+0.002
ky+0.002
ited+0.002
Neg logit contributions
and-0.002
 contain-0.002
 Follow-0.002
 Stay-0.002
 Madd-0.002
Token g
Token ID308
Feature activation+3.85e-01

Pos logit contributions
and+0.011
 contain+0.010
 Follow+0.009
 Madd+0.009
 Stay+0.008
Neg logit contributions
elle-0.023
burn-0.022
ky-0.022
ami-0.022
ah-0.021
Tokenlee
Token ID7197
Feature activation-1.20e+00

Pos logit contributions
elle+0.015
burn+0.015
ky+0.014
ami+0.014
ine+0.014
Neg logit contributions
and-0.015
 contain-0.014
 Follow-0.014
 Stay-0.012
 Madd-0.012
Token as
Token ID355
Feature activation+1.05e+00

Pos logit contributions
and+0.051
 contain+0.048
 Follow+0.046
 Madd+0.044
oles+0.043
Neg logit contributions
elle-0.041
ami-0.040
burn-0.040
andra-0.039
ky-0.039
Token the
Token ID262
Feature activation+2.56e-01

Pos logit contributions
ky+0.036
elle+0.036
ited+0.035
ami+0.035
burn+0.034
Neg logit contributions
and-0.046
 Madd-0.043
 contain-0.042
 Follow-0.040
 Stay-0.039
Token water
Token ID1660
Feature activation-3.38e-01

Pos logit contributions
burn+0.009
elle+0.009
ky+0.008
ami+0.008
ah+0.008
Neg logit contributions
and-0.011
 Follow-0.010
 contain-0.010
engu-0.010
oles-0.009
Token soaked
Token ID33657
Feature activation-1.25e+00

Pos logit contributions
and+0.013
 contain+0.013
 Follow+0.012
 Madd+0.012
oles+0.012
Neg logit contributions
elle-0.013
ami-0.012
ky-0.012
burn-0.012
andra-0.012
Token her
Token ID607
Feature activation+2.48e-01

Pos logit contributions
and+0.061
 contain+0.057
 Follow+0.055
 Madd+0.055
 Stay+0.053
Neg logit contributions
elle-0.036
ami-0.034
burn-0.034
ky-0.033
andra-0.033
Token clothes
Token ID8242
Feature activation-5.91e-01

Pos logit contributions
burn+0.010
elle+0.010
ky+0.010
ine+0.009
en+0.009
Neg logit contributions
and-0.009
 Follow-0.009
 contain-0.008
engu-0.008
unn-0.008
Token and
Token ID290
Feature activation-2.25e-01

Pos logit contributions
 contain+0.022
and+0.022
 Madd+0.018
 plunge+0.018
oles+0.017
Neg logit contributions
elle-0.025
ami-0.024
andra-0.023
ah-0.022
isdom-0.022
Token hair
Token ID4190
Feature activation+1.43e+00

Pos logit contributions
and+0.008
 contain+0.008
 Follow+0.008
 Madd+0.007
 Stay+0.007
Neg logit contributions
elle-0.009
burn-0.009
ami-0.009
ky-0.009
ah-0.008
Token.
Token ID13
Feature activation+1.51e-03

Pos logit contributions
andra+0.065
ami+0.064
elle+0.063
isdom+0.063
aman+0.061
Neg logit contributions
and-0.048
 contain-0.045
oles-0.037
 Madd-0.037
balls-0.036
Token it
Token ID340
Feature activation-3.89e-01

Pos logit contributions
and+0.015
 Follow+0.014
 contain+0.014
 Stay+0.013
engu+0.013
Neg logit contributions
elle-0.009
ami-0.008
ky-0.008
ine-0.008
burn-0.008
Token was
Token ID373
Feature activation-1.89e-02

Pos logit contributions
and+0.011
 contain+0.010
 Follow+0.010
 Madd+0.009
 Stay+0.009
Neg logit contributions
burn-0.019
elle-0.019
ami-0.018
ky-0.018
andra-0.017
Token so
Token ID523
Feature activation-1.20e+00

Pos logit contributions
and+0.001
 contain+0.001
 Follow+0.001
 Madd+0.001
 Stay+0.001
Neg logit contributions
elle-0.001
burn-0.001
ky-0.001
ami-0.001
ah-0.001
Token nice
Token ID3621
Feature activation+1.20e-02

Pos logit contributions
and+0.044
 contain+0.041
 Follow+0.040
 Madd+0.037
 Stay+0.037
Neg logit contributions
elle-0.049
burn-0.049
ky-0.047
ami-0.047
ah-0.045
Token,
Token ID11
Feature activation-1.05e-01

Pos logit contributions
burn+0.001
elle+0.001
ep+0.001
ited+0.000
andra+0.000
Neg logit contributions
and-0.000
 contain-0.000
 Follow-0.000
 Madd-0.000
oles-0.000
Token so
Token ID523
Feature activation-1.20e+00

Pos logit contributions
and+0.004
 contain+0.003
 Follow+0.003
 Madd+0.003
 Stay+0.003
Neg logit contributions
elle-0.005
burn-0.004
ky-0.004
ami-0.004
ah-0.004
Token she
Token ID673
Feature activation+4.96e-01

Pos logit contributions
and+0.054
 contain+0.050
 Madd+0.049
 Follow+0.049
 Stay+0.047
Neg logit contributions
elle-0.039
burn-0.039
ky-0.038
ami-0.037
ah-0.036
Token picked
Token ID6497
Feature activation+8.51e-01

Pos logit contributions
elle+0.018
burn+0.017
ky+0.017
ami+0.017
ine+0.016
Neg logit contributions
and-0.021
 contain-0.019
 Follow-0.018
 Stay-0.017
 Madd-0.017
Token up
Token ID510
Feature activation+1.64e+00

Pos logit contributions
burn+0.041
ited+0.040
ep+0.040
elle+0.040
ky+0.039
Neg logit contributions
 Madd-0.022
and-0.022
 tad-0.021
 nuts-0.019
 plunge-0.018
Token the
Token ID262
Feature activation+1.49e-01

Pos logit contributions
ep+0.076
burn+0.075
elle+0.074
aman+0.073
ami+0.072
Neg logit contributions
 Madd-0.046
 nuts-0.045
 tad-0.044
and-0.044
 contain-0.041
Token peanut
Token ID26636
Feature activation+8.32e-01

Pos logit contributions
burn+0.004
elle+0.004
ky+0.004
ah+0.004
ami+0.004
Neg logit contributions
and-0.007
 Follow-0.007
 contain-0.007
adders-0.006
 Stay-0.006
Token not
Token ID407
Feature activation+1.61e+00

Pos logit contributions
and+0.007
 contain+0.007
 Stay+0.007
 hide+0.006
 Follow+0.006
Neg logit contributions
ami-0.017
ky-0.016
elle-0.016
ited-0.016
burn-0.016
Token wake
Token ID7765
Feature activation-5.41e-01

Pos logit contributions
elle+0.078
ami+0.076
aman+0.076
burn+0.075
ky+0.072
Neg logit contributions
 contain-0.049
and-0.046
 Stay-0.041
 Madd-0.039
 Follow-0.039
Token up
Token ID510
Feature activation+1.70e+00

Pos logit contributions
and+0.017
 contain+0.015
 Madd+0.014
 Stay+0.014
 Follow+0.014
Neg logit contributions
elle-0.024
ited-0.024
ami-0.023
ah-0.023
ky-0.023
Token when
Token ID618
Feature activation+4.35e-02

Pos logit contributions
elle+0.073
burn+0.069
ami+0.068
ah+0.068
andra+0.068
Neg logit contributions
and-0.061
 contain-0.057
 Stay-0.051
 Madd-0.050
oles-0.049
Token the
Token ID262
Feature activation+1.32e-01

Pos logit contributions
elle+0.001
burn+0.001
ami+0.001
ine+0.001
ky+0.001
Neg logit contributions
and-0.002
 Follow-0.002
 contain-0.002
unn-0.002
engu-0.002
Token air
Token ID1633
Feature activation+2.02e-01

Pos logit contributions
elle+0.004
burn+0.004
ky+0.004
ep+0.004
ah+0.004
Neg logit contributions
and-0.006
 contain-0.006
 Follow-0.006
 Madd-0.005
oles-0.005
Token inside
Token ID2641
Feature activation-8.17e-01

Pos logit contributions
elle+0.010
ami+0.009
ky+0.009
burn+0.009
andra+0.009
Neg logit contributions
 contain-0.005
and-0.005
 Stay-0.005
 Madd-0.005
oles-0.005
Token became
Token ID2627
Feature activation+1.35e+00

Pos logit contributions
and+0.027
 contain+0.025
 Follow+0.024
 Stay+0.023
 Madd+0.023
Neg logit contributions
elle-0.036
burn-0.035
ky-0.034
ami-0.034
ah-0.033
Token cold
Token ID4692
Feature activation-7.09e-01

Pos logit contributions
elle+0.061
ami+0.059
ky+0.056
andra+0.055
ah+0.055
Neg logit contributions
and-0.042
 Madd-0.038
 contain-0.038
 tum-0.037
 tad-0.037
Token and
Token ID290
Feature activation+9.09e-03

Pos logit contributions
and+0.029
 contain+0.027
 Follow+0.026
 Stay+0.025
 Madd+0.025
Neg logit contributions
elle-0.025
burn-0.024
ami-0.024
ky-0.024
andra-0.023
Token thin
Token ID7888
Feature activation-1.45e-01

Pos logit contributions
elle+0.000
ami+0.000
burn+0.000
ky+0.000
andra+0.000
Neg logit contributions
and-0.000
 contain-0.000
 Follow-0.000
 Madd-0.000
 Stay-0.000
Token She
Token ID1375
Feature activation+1.63e-01

Pos logit contributions
elle+0.004
ami+0.004
ky+0.004
andra+0.003
ine+0.003
Neg logit contributions
and-0.004
 contain-0.004
 Follow-0.004
unn-0.004
engu-0.004
Token flew
Token ID13112
Feature activation-2.60e-01

Pos logit contributions
burn+0.007
elle+0.007
ami+0.007
ky+0.007
aman+0.006
Neg logit contributions
and-0.006
 contain-0.005
 Follow-0.005
 tad-0.005
 Madd-0.005
Token and
Token ID290
Feature activation-2.57e-01

Pos logit contributions
and+0.009
 contain+0.008
 Follow+0.008
 Madd+0.008
 Stay+0.007
Neg logit contributions
elle-0.011
burn-0.011
ami-0.011
ky-0.010
andra-0.010
Token flew
Token ID13112
Feature activation-2.68e-01

Pos logit contributions
and+0.008
 contain+0.008
 Follow+0.008
 Madd+0.008
 tad+0.007
Neg logit contributions
ami-0.012
burn-0.011
aman-0.011
isdom-0.011
andra-0.011
Token and
Token ID290
Feature activation-2.74e-01

Pos logit contributions
and+0.010
 contain+0.009
 Follow+0.008
 Madd+0.008
 Stay+0.008
Neg logit contributions
elle-0.011
burn-0.011
ami-0.011
ky-0.011
andra-0.011
Token flew
Token ID13112
Feature activation-2.73e-01

Pos logit contributions
and+0.011
 contain+0.010
 Follow+0.010
 Madd+0.009
 tad+0.009
Neg logit contributions
ami-0.011
elle-0.010
burn-0.010
aman-0.010
andra-0.010
Token until
Token ID1566
Feature activation-5.46e-01

Pos logit contributions
and+0.010
 contain+0.009
 Follow+0.009
 Madd+0.008
 Stay+0.008
Neg logit contributions
elle-0.011
burn-0.011
ami-0.011
ky-0.011
andra-0.011
Token she
Token ID673
Feature activation+1.51e-01

Pos logit contributions
and+0.021
 Madd+0.020
 contain+0.020
 Follow+0.018
 tad+0.018
Neg logit contributions
burn-0.021
ky-0.021
ami-0.021
elle-0.020
ep-0.020
Token was
Token ID373
Feature activation-7.25e-02

Pos logit contributions
burn+0.007
elle+0.007
ky+0.006
ami+0.006
aman+0.006
Neg logit contributions
and-0.005
 contain-0.005
 Follow-0.004
 tad-0.004
 Madd-0.004
Token back
Token ID736
Feature activation-1.47e+00

Pos logit contributions
and+0.002
 contain+0.002
 Madd+0.002
 Stay+0.002
 Follow+0.002
Neg logit contributions
ami-0.004
elle-0.004
ky-0.003
aman-0.003
andra-0.003
Token home
Token ID1363
Feature activation-1.80e+00

Pos logit contributions
and+0.041
 contain+0.033
 Follow+0.030
 Madd+0.030
 Stay+0.029
Neg logit contributions
elle-0.080
ami-0.076
andra-0.076
ky-0.074
ah-0.073
Token the
Token ID262
Feature activation+2.17e-01

Pos logit contributions
burn+0.026
ami+0.026
elle+0.026
ited+0.025
ky+0.025
Neg logit contributions
and-0.025
 contain-0.024
 Follow-0.023
 Stay-0.023
 Madd-0.022
Token station
Token ID4429
Feature activation+1.57e-01

Pos logit contributions
elle+0.007
burn+0.007
ky+0.007
ami+0.007
ep+0.006
Neg logit contributions
and-0.010
 contain-0.009
 Follow-0.009
 Madd-0.009
oles-0.008
Token,
Token ID11
Feature activation-1.92e-03

Pos logit contributions
elle+0.006
burn+0.006
ami+0.005
ky+0.005
andra+0.005
Neg logit contributions
and-0.006
 contain-0.006
 Follow-0.006
 Stay-0.005
 Madd-0.005
Token but
Token ID475
Feature activation-4.25e-01

Pos logit contributions
and+0.000
 contain+0.000
 Follow+0.000
oles+0.000
engu+0.000
Neg logit contributions
elle-0.000
burn-0.000
ky-0.000
ami-0.000
ine-0.000
Token he
Token ID339
Feature activation+2.97e-03

Pos logit contributions
and+0.023
unn+0.020
 contain+0.020
 Follow+0.020
engu+0.020
Neg logit contributions
elle-0.011
ah-0.010
 str-0.009
ep-0.009
ine-0.009
Token couldn
Token ID3521
Feature activation-2.04e+00

Pos logit contributions
elle+0.000
ah+0.000
ine+0.000
burn+0.000
ky+0.000
Neg logit contributions
and-0.000
 contain-0.000
 Follow-0.000
engu-0.000
 Madd-0.000
Token't
Token ID470
Feature activation+6.45e-01

Pos logit contributions
and+0.084
 contain+0.077
 Follow+0.075
 tad+0.069
 Madd+0.069
Neg logit contributions
elle-0.076
burn-0.075
ami-0.070
ky-0.070
ah-0.069
Token get
Token ID651
Feature activation+1.81e-01

Pos logit contributions
elle+0.026
burn+0.025
ky+0.025
ami+0.024
ah+0.024
Neg logit contributions
and-0.024
 contain-0.022
 Follow-0.022
 Madd-0.020
 Stay-0.020
Token out
Token ID503
Feature activation+7.92e-01

Pos logit contributions
elle+0.008
ami+0.008
ky+0.007
andra+0.007
aman+0.007
Neg logit contributions
and-0.006
 contain-0.006
 Follow-0.006
 Stay-0.006
 Madd-0.006
Token.
Token ID13
Feature activation+4.13e-02

Pos logit contributions
elle+0.030
burn+0.030
ami+0.029
ky+0.029
andra+0.029
Neg logit contributions
and-0.031
 contain-0.029
 Follow-0.028
 Stay-0.026
 Madd-0.026
Token He
Token ID679
Feature activation-1.45e-01

Pos logit contributions
elle+0.001
aman+0.001
ah+0.001
andra+0.001
ky+0.001
Neg logit contributions
 contain-0.002
and-0.002
unn-0.002
suits-0.001
 plunge-0.001
Token eating
Token ID6600
Feature activation-1.42e+00

Pos logit contributions
elle+0.002
burn+0.002
ami+0.002
ky+0.002
andra+0.002
Neg logit contributions
and-0.002
 contain-0.002
 Follow-0.002
 Madd-0.002
 Stay-0.002
Token some
Token ID617
Feature activation+1.34e-01

Pos logit contributions
and+0.055
 contain+0.054
 Follow+0.052
engu+0.048
unn+0.047
Neg logit contributions
elle-0.052
ky-0.050
burn-0.049
ami-0.048
ah-0.048
Token fuel
Token ID5252
Feature activation+2.53e-01

Pos logit contributions
elle+0.005
ky+0.004
ited+0.004
en+0.004
burn+0.004
Neg logit contributions
and-0.005
 contain-0.005
 Follow-0.005
engu-0.005
stack-0.005
Token.
Token ID13
Feature activation+2.68e-01

Pos logit contributions
elle+0.007
burn+0.007
ky+0.007
ami+0.006
ine+0.006
Neg logit contributions
and-0.012
 contain-0.012
 Follow-0.012
 Stay-0.011
 Madd-0.011
Token Max
Token ID5436
Feature activation+6.14e-01

Pos logit contributions
elle+0.011
ky+0.010
burn+0.010
ami+0.010
andra+0.010
Neg logit contributions
and-0.010
 contain-0.010
 Follow-0.009
 tad-0.008
 Madd-0.008
Token couldn
Token ID3521
Feature activation-1.90e+00

Pos logit contributions
elle+0.025
burn+0.025
ky+0.024
ami+0.024
ah+0.023
Neg logit contributions
and-0.022
 contain-0.020
 Follow-0.020
 Madd-0.018
 Stay-0.018
Token't
Token ID470
Feature activation+7.52e-01

Pos logit contributions
and+0.082
 contain+0.075
 Follow+0.072
engu+0.068
 tad+0.067
Neg logit contributions
elle-0.067
burn-0.065
ah-0.062
ky-0.061
ami-0.061
Token believe
Token ID1975
Feature activation-7.28e-01

Pos logit contributions
elle+0.032
ky+0.032
burn+0.031
ah+0.031
ami+0.031
Neg logit contributions
and-0.026
 Follow-0.023
engu-0.021
oles-0.021
 contain-0.021
Token it
Token ID340
Feature activation-4.24e-01

Pos logit contributions
and+0.030
 contain+0.028
 Madd+0.026
 Follow+0.025
 Stay+0.025
Neg logit contributions
burn-0.027
elle-0.027
ami-0.026
ited-0.026
andra-0.025
Token.
Token ID13
Feature activation+2.25e-01

Pos logit contributions
and+0.014
 contain+0.013
 Follow+0.013
 Stay+0.012
oles+0.012
Neg logit contributions
burn-0.019
elle-0.018
ami-0.018
andra-0.017
ited-0.017
Token Everyone
Token ID11075
Feature activation+1.54e+00

Pos logit contributions
elle+0.009
ky+0.009
andra+0.009
burn+0.009
ami+0.009
Neg logit contributions
and-0.008
 contain-0.008
 Follow-0.007
oles-0.007
unn-0.007
Token allowed
Token ID3142
Feature activation-7.86e-01

Pos logit contributions
burn+0.012
elle+0.011
ami+0.011
ky+0.011
aman+0.011
Neg logit contributions
and-0.008
 contain-0.008
 Follow-0.008
 Madd-0.007
 Stay-0.007
Token him
Token ID683
Feature activation-2.11e+00

Pos logit contributions
and+0.031
 contain+0.029
 Follow+0.028
 Madd+0.026
 Stay+0.026
Neg logit contributions
elle-0.030
burn-0.029
ky-0.029
ami-0.028
ah-0.027
Token to
Token ID284
Feature activation+3.31e-01

Pos logit contributions
and+0.069
 contain+0.063
 Follow+0.061
 Stay+0.056
 Madd+0.056
Neg logit contributions
elle-0.095
burn-0.093
ky-0.091
ami-0.091
ah-0.088
Token draw
Token ID3197
Feature activation+1.12e+00

Pos logit contributions
elle+0.015
ky+0.014
ami+0.014
burn+0.014
ah+0.013
Neg logit contributions
and-0.011
 contain-0.010
 Follow-0.010
 Stay-0.009
 Madd-0.009
Token outside
Token ID2354
Feature activation-3.18e-01

Pos logit contributions
elle+0.037
burn+0.037
ky+0.035
ami+0.035
andra+0.034
Neg logit contributions
and-0.050
 contain-0.046
 Follow-0.045
 Madd-0.043
 Stay-0.042
Token from
Token ID422
Feature activation-3.10e+00

Pos logit contributions
and+0.013
 contain+0.012
 Follow+0.012
 Madd+0.011
 Stay+0.011
Neg logit contributions
elle-0.011
burn-0.011
ami-0.011
ky-0.011
andra-0.010
Token then
Token ID788
Feature activation-1.20e+00

Pos logit contributions
and+0.123
 contain+0.115
 Follow+0.109
 Madd+0.109
 Stay+0.105
Neg logit contributions
burn-0.114
elle-0.111
ami-0.107
ited-0.106
ah-0.106
Token on
Token ID319
Feature activation+3.72e-01

Pos logit contributions
and+0.042
 contain+0.040
 Follow+0.040
 Stay+0.037
 Madd+0.035
Neg logit contributions
elle-0.050
ami-0.048
burn-0.048
ited-0.047
ky-0.047
Token.
Token ID13
Feature activation-3.63e-01

Pos logit contributions
burn+0.015
ited+0.015
ami+0.015
elle+0.014
ah+0.014
Neg logit contributions
and-0.013
 contain-0.012
 Madd-0.012
 Stay-0.011
 Follow-0.011
Token Tim
Token ID5045
Feature activation-5.39e-01

Pos logit contributions
and+0.012
 contain+0.011
 Follow+0.011
 Madd+0.010
 Stay+0.010
Neg logit contributions
elle-0.016
burn-0.016
ky-0.016
ami-0.015
ah-0.015
Tokenmy
Token ID1820
Feature activation+9.52e-01

Pos logit contributions
and+0.025
 Follow+0.023
 Stay+0.022
 contain+0.021
unn+0.021
Neg logit contributions
burn-0.016
ami-0.015
elle-0.015
ky-0.015
aman-0.015
Token chest
Token ID7721
Feature activation-1.37e+00

Pos logit contributions
and+0.019
 contain+0.018
 Follow+0.017
 Madd+0.017
 Stay+0.017
Neg logit contributions
ami-0.025
elle-0.025
ky-0.025
andra-0.023
burn-0.023
Token."
Token ID526
Feature activation+1.17e-01

Pos logit contributions
and+0.044
 contain+0.040
 Follow+0.038
 Stay+0.035
 Madd+0.033
Neg logit contributions
elle-0.063
ami-0.061
andra-0.060
burn-0.060
ky-0.059
Token
Token ID198
Feature activation-1.03e-01

Pos logit contributions
elle+0.005
burn+0.005
ky+0.005
ami+0.005
ah+0.005
Neg logit contributions
and-0.004
 contain-0.003
 Follow-0.003
 Madd-0.003
 Stay-0.003
Token
Token ID198
Feature activation-3.69e-02

Pos logit contributions
 contain+0.004
 Follow+0.004
 plunge+0.004
and+0.004
 tad+0.004
Neg logit contributions
elle-0.004
ep-0.003
ami-0.003
ine-0.003
ky-0.003
TokenL
Token ID43
Feature activation+3.80e-01

Pos logit contributions
and+0.002
 contain+0.002
 Follow+0.002
 Madd+0.001
 Stay+0.001
Neg logit contributions
elle-0.001
burn-0.001
ky-0.001
ami-0.001
ah-0.001
Tokenily
Token ID813
Feature activation-2.08e+00

Pos logit contributions
elle+0.023
burn+0.022
ami+0.022
ky+0.022
ah+0.021
Neg logit contributions
and-0.007
 contain-0.006
 Follow-0.005
 Madd-0.005
 Stay-0.004
Token felt
Token ID2936
Feature activation+7.91e-01

Pos logit contributions
and+0.068
 contain+0.063
 Follow+0.063
 Stay+0.057
 Madd+0.057
Neg logit contributions
elle-0.089
burn-0.089
ami-0.086
ky-0.085
aman-0.082
Token happy
Token ID3772
Feature activation+4.43e-01

Pos logit contributions
elle+0.031
burn+0.030
ky+0.030
ami+0.029
andra+0.028
Neg logit contributions
and-0.030
 contain-0.027
 Follow-0.027
oles-0.025
 Madd-0.025
Token that
Token ID326
Feature activation+1.20e-01

Pos logit contributions
elle+0.017
burn+0.017
ky+0.017
ami+0.016
ah+0.016
Neg logit contributions
and-0.018
 contain-0.016
 Madd-0.015
 Follow-0.015
 Stay-0.015
Token she
Token ID673
Feature activation+2.38e-01

Pos logit contributions
elle+0.004
ky+0.004
burn+0.003
ami+0.003
ep+0.003
Neg logit contributions
and-0.006
 contain-0.005
 Follow-0.005
 Stay-0.005
 Madd-0.005
Token could
Token ID714
Feature activation-1.08e+00

Pos logit contributions
ami+0.012
aman+0.012
elle+0.012
burn+0.012
ky+0.012
Neg logit contributions
 Follow-0.006
and-0.006
 contain-0.005
 Stay-0.005
tted-0.004
Token."
Token ID526
Feature activation+2.22e-01

Pos logit contributions
and+0.041
 contain+0.038
 Follow+0.037
 Madd+0.034
 Stay+0.034
Neg logit contributions
elle-0.045
burn-0.044
ky-0.043
ami-0.042
ah-0.041
Token
Token ID220
Feature activation+6.70e-01

Pos logit contributions
elle+0.010
burn+0.010
ky+0.009
ami+0.009
andra+0.009
Neg logit contributions
and-0.007
 contain-0.007
 Follow-0.007
 Madd-0.006
 Stay-0.006
Token
Token ID198
Feature activation-1.40e-02

Pos logit contributions
burn+0.023
elle+0.022
ami+0.022
andra+0.021
aman+0.021
Neg logit contributions
 contain-0.028
and-0.025
 Follow-0.025
 tad-0.025
 Madd-0.023
Token
Token ID198
Feature activation+2.61e-02

Pos logit contributions
 contain+0.001
 Follow+0.001
 plunge+0.001
 tad+0.001
and+0.001
Neg logit contributions
burn-0.000
elle-0.000
ep-0.000
ami-0.000
ky-0.000
TokenL
Token ID43
Feature activation+2.31e-01

Pos logit contributions
elle+0.001
burn+0.001
ky+0.001
ami+0.001
ah+0.001
Neg logit contributions
and-0.001
 contain-0.001
 Follow-0.001
 Stay-0.001
 Madd-0.001
Tokenily
Token ID813
Feature activation-2.21e+00

Pos logit contributions
elle+0.013
ami+0.013
burn+0.013
ky+0.012
ah+0.012
Neg logit contributions
and-0.004
 contain-0.004
 Follow-0.004
engu-0.004
 Madd-0.004
Token felt
Token ID2936
Feature activation+7.18e-01

Pos logit contributions
and+0.089
 contain+0.083
 Follow+0.081
 Madd+0.075
 Stay+0.075
Neg logit contributions
elle-0.082
burn-0.079
ky-0.078
ami-0.077
andra-0.075
Token proud
Token ID6613
Feature activation+1.29e+00

Pos logit contributions
elle+0.029
burn+0.028
ky+0.028
ami+0.027
ah+0.026
Neg logit contributions
and-0.027
 contain-0.025
 Follow-0.024
 Madd-0.022
 Stay-0.022
Token of
Token ID286
Feature activation-2.43e-01

Pos logit contributions
elle+0.068
burn+0.067
ky+0.065
ami+0.064
andra+0.064
Neg logit contributions
and-0.034
 contain-0.029
 Follow-0.028
 Madd-0.025
 Stay-0.025
Token herself
Token ID5223
Feature activation-1.21e+00

Pos logit contributions
and+0.007
 contain+0.006
 Madd+0.006
 Stay+0.006
 Follow+0.005
Neg logit contributions
burn-0.012
ited-0.012
elle-0.012
ky-0.012
ah-0.012
Token for
Token ID329
Feature activation-8.74e-01

Pos logit contributions
and+0.043
 contain+0.039
 Follow+0.038
 Stay+0.035
 Madd+0.035
Neg logit contributions
elle-0.051
burn-0.050
ky-0.049
ami-0.049
andra-0.047