TOP ACTIVATIONS
MAX = 0.084

. The cater pillar happily mun ched on the mushroom ,
looked so happy ! It mun ched and mun ched until
in his hands . He mun ched on it happily .
eating ." Daisy smiled and mun ched on her apple .
! Little John mun ched away happily , welcoming
a honey sandwich . He mun ched away happily and soon
to eat . The bear mun ched the juicy green sa
! It mun ched and mun ched until the cereal was
mouse watched as she happily mun ched away on the y
The bunny happily mun ched on his carrot and
started to eat . They mun ched and cr un ched
grabbed a handful and happily mun ched away . The popcorn
. The two of them mun ched on the banana ,
little girl was thrilled and mun ched away happily .
" ch oo cho o cho o ". They
proud of her for being such an obedient and happy little
," Be good , Holly ,. I 'll be back soon
happy that they had seen such a wonderful sight . The
all applauded him for being such a good pup . <|endoftext|>
he would never have seen such a wonderful sight ! <|endoftext|>

MIN ACTIVATIONS
MIN = -0.093

would take out a j igsaw puzzle and try to solve
box and saw a j igsaw puzzle inside . He paused
As the farmer scattered the av oc ados , he noticed
and her mom sorted the av oc ados into piles according
, and all the other av oc ados were very happy
continued to enjoy eating her av oc ados with a smile
It turned out that the av oc ados were bad and
and try to grab the av oc ados . But the
so happy and brought the av oc ados home to share
. They forget about the av oc ados . They just
Everyday he would take the av oc ados and scatter them
tree was sad that the av oc ados were being taken
With a mis chie vous gl int in
looked at him with a mis chie vous grin . She
a small boy with a mis chie vous tw inkle in
It was a big j igsaw puzzle and she wanted to
. She loved to eat av oc ados . One day
" I will buy more av oc ados tomorrow . You
tree could see lots of av oc ados that the supplier
Suddenly , a mis chie vous squirrel came and

INTERVAL 0.040 - 0.084
CONTAINS 0.094%

said , " You 're such a good girl , Lily
� � That â € ™ s so cool ! â
realized he couldn â € ™ t take it home .
it . Tim my felt even more sad , but his
empty , he still felt like he had found something valuable

INTERVAL -0.005 - 0.040
CONTAINS 68.933%

to wash her hair with shampoo . Lily didn 't like
walked through the forest , the sun began to set ,
strange sound . T eddy stopped and looked
said it made him happy . Mom explained to
black and white squares . But they could not make a

INTERVAL -0.049 - -0.005
CONTAINS 30.970%

She did not know if they would hurt her or her
'm hungry . And I 'm curious . I want to
dad " Daddy , why are we in this tunnel ?"
carefully put the cheese , crack er and glass of milk
a sudden he noticed a rabb ith ole . Hop was

INTERVAL -0.093 - -0.049
CONTAINS 0.003%

'm sorry , my d arl ings . The tornado took
" Hello , my d arl ings . I 'm so
day on , Lily loved av oc ados and would ask
, and all the other av oc ados were very happy
kids . They liked to solve puzzles and r iddles .
Token.
Token ID13
Feature activation+2.13e-03

Pos logit contributions
rys+0.030
iny+0.028
 avail+0.028
alk+0.028
umper+0.028
Neg logit contributions
 how-0.015
 lots-0.015
 were-0.015
 no-0.014
 thank-0.014
Token The
Token ID383
Feature activation+5.81e-03

Pos logit contributions
 were+0.002
five+0.002
 YES+0.002
anges+0.002
 how+0.002
Neg logit contributions
rys-0.004
alk-0.004
iny-0.004
 diary-0.004
enna-0.004
Token cater
Token ID20825
Feature activation-6.13e-03

Pos logit contributions
 were+0.009
 lots+0.009
 thank+0.009
 no+0.009
Sure+0.009
Neg logit contributions
rys-0.009
alk-0.008
iny-0.008
 battlefield-0.008
enna-0.008
Tokenpillar
Token ID41643
Feature activation-1.29e-02

Pos logit contributions
 avail+0.008
rys+0.008
ishment+0.008
 diary+0.008
 battlefield+0.007
Neg logit contributions
 were-0.011
 of-0.010
Sure-0.010
Of-0.010
five-0.010
Token happily
Token ID18177
Feature activation+5.08e-03

Pos logit contributions
rys+0.028
iny+0.027
 diary+0.026
ita+0.026
alk+0.026
Neg logit contributions
five-0.012
anges-0.012
 YES-0.011
 Mike-0.011
Of-0.010
Token mun
Token ID29856
Feature activation+8.37e-02

Pos logit contributions
anges+0.006
 YES+0.005
five+0.005
 Mike+0.005
 oh+0.005
Neg logit contributions
rys-0.010
alk-0.009
zo-0.009
iny-0.009
ci-0.009
Tokenched
Token ID1740
Feature activation+8.42e-03

Pos logit contributions
 Mike+0.109
 moms+0.106
 oh+0.105
anges+0.104
 how+0.103
Neg logit contributions
rys-0.153
alk-0.143
umper-0.135
iny-0.135
ky-0.129
Token on
Token ID319
Feature activation+9.96e-03

Pos logit contributions
anges+0.010
 YES+0.010
five+0.010
Of+0.010
Sure+0.009
Neg logit contributions
rys-0.016
iny-0.015
alk-0.015
 diary-0.014
zo-0.014
Token the
Token ID262
Feature activation+3.78e-03

Pos logit contributions
five+0.009
 were+0.009
anges+0.009
 YES+0.009
Of+0.008
Neg logit contributions
rys-0.022
alk-0.021
iny-0.020
 diary-0.020
zo-0.020
Token mushroom
Token ID28520
Feature activation-3.10e-02

Pos logit contributions
 lots+0.008
 were+0.008
 thank+0.008
five+0.008
 how+0.007
Neg logit contributions
rys-0.004
iny-0.004
zo-0.004
alk-0.004
enna-0.004
Token,
Token ID11
Feature activation+1.23e-02

Pos logit contributions
rys+0.080
iny+0.075
umper+0.074
alk+0.074
enna+0.073
Neg logit contributions
 were-0.019
 lots-0.018
 how-0.018
 no-0.017
 thank-0.017
Token looked
Token ID3114
Feature activation-1.38e-02

Pos logit contributions
rys+0.019
iny+0.018
 diary+0.018
alk+0.018
zo+0.018
Neg logit contributions
five-0.008
anges-0.008
 YES-0.008
Of-0.008
Sure-0.007
Token so
Token ID523
Feature activation-4.34e-03

Pos logit contributions
rys+0.026
 diary+0.024
iny+0.024
alk+0.023
 weeping+0.023
Neg logit contributions
five-0.017
anges-0.017
 YES-0.016
 were-0.016
Sure-0.016
Token happy
Token ID3772
Feature activation+4.65e-03

Pos logit contributions
rys+0.009
iny+0.008
alk+0.008
 diary+0.008
zo+0.008
Neg logit contributions
five-0.005
 YES-0.005
 were-0.004
anges-0.004
 how-0.004
Token!
Token ID0
Feature activation+9.20e-03

Pos logit contributions
five+0.004
 YES+0.004
anges+0.004
 were+0.004
Sure+0.004
Neg logit contributions
rys-0.010
 diary-0.009
alk-0.009
iny-0.009
enna-0.009
Token It
Token ID632
Feature activation+1.36e-02

Pos logit contributions
five+0.012
 were+0.012
 YES+0.011
anges+0.011
 how+0.011
Neg logit contributions
rys-0.017
iny-0.016
alk-0.016
 diary-0.015
enna-0.015
Token mun
Token ID29856
Feature activation+8.01e-02

Pos logit contributions
five+0.014
anges+0.013
 YES+0.012
Sure+0.011
 thank+0.011
Neg logit contributions
rys-0.029
iny-0.028
zo-0.027
 diary-0.027
ita-0.027
Tokenched
Token ID1740
Feature activation+5.23e-03

Pos logit contributions
 moms+0.116
 Mike+0.115
 oh+0.109
 how+0.107
 Megan+0.106
Neg logit contributions
rys-0.136
alk-0.129
umper-0.120
iny-0.118
ky-0.118
Token and
Token ID290
Feature activation+7.48e-03

Pos logit contributions
five+0.006
 YES+0.006
anges+0.006
 thank+0.006
 were+0.006
Neg logit contributions
rys-0.010
iny-0.009
alk-0.009
 diary-0.009
ci-0.009
Token mun
Token ID29856
Feature activation+7.79e-02

Pos logit contributions
anges+0.008
five+0.008
Sure+0.008
Of+0.008
 YES+0.008
Neg logit contributions
rys-0.014
 bur-0.013
alk-0.013
ky-0.013
 diary-0.013
Tokenched
Token ID1740
Feature activation+1.66e-03

Pos logit contributions
 Mike+0.104
 moms+0.104
 oh+0.100
Of+0.099
 how+0.099
Neg logit contributions
rys-0.138
alk-0.131
iny-0.122
umper-0.121
ky-0.120
Token until
Token ID1566
Feature activation+1.71e-02

Pos logit contributions
five+0.002
 YES+0.002
anges+0.002
Of+0.002
Sure+0.002
Neg logit contributions
rys-0.003
iny-0.003
alk-0.003
 diary-0.003
ita-0.003
Token in
Token ID287
Feature activation-6.62e-03

Pos logit contributions
rys+0.005
iny+0.005
alk+0.005
 diary+0.005
ita+0.004
Neg logit contributions
five-0.003
 were-0.003
 YES-0.003
 thank-0.003
 how-0.003
Token his
Token ID465
Feature activation-3.69e-03

Pos logit contributions
rys+0.014
iny+0.013
alk+0.013
 diary+0.013
enna+0.013
Neg logit contributions
 were-0.007
five-0.006
 how-0.006
 YES-0.006
 lots-0.006
Token hands
Token ID2832
Feature activation-8.18e-03

Pos logit contributions
rys+0.007
 diary+0.006
alk+0.006
iny+0.006
ita+0.006
Neg logit contributions
 YES-0.005
anges-0.005
five-0.005
 were-0.005
 yes-0.005
Token.
Token ID13
Feature activation-1.84e-03

Pos logit contributions
rys+0.014
iny+0.013
alk+0.013
 diary+0.013
enna+0.013
Neg logit contributions
 were-0.011
five-0.011
 YES-0.011
 how-0.011
 thank-0.010
Token He
Token ID679
Feature activation+7.09e-03

Pos logit contributions
rys+0.003
alk+0.003
iny+0.003
 battlefield+0.003
zo+0.003
Neg logit contributions
five-0.002
anges-0.002
 yes-0.002
 were-0.002
 YES-0.002
Token mun
Token ID29856
Feature activation+7.96e-02

Pos logit contributions
five+0.009
anges+0.009
rav+0.009
Not+0.009
 YES+0.008
Neg logit contributions
 bur-0.012
zo-0.012
iny-0.011
ita-0.011
 pen-0.011
Tokenched
Token ID1740
Feature activation+6.36e-03

Pos logit contributions
 moms+0.130
 Mike+0.125
 oh+0.123
Of+0.118
 Megan+0.118
Neg logit contributions
rys-0.123
alk-0.116
umper-0.108
ky-0.106
iny-0.104
Token on
Token ID319
Feature activation+6.60e-03

Pos logit contributions
anges+0.009
 YES+0.009
Of+0.009
five+0.008
Sure+0.008
Neg logit contributions
iny-0.010
rys-0.010
ky-0.010
alk-0.009
zo-0.009
Token it
Token ID340
Feature activation-1.01e-03

Pos logit contributions
anges+0.007
Of+0.007
rav+0.007
 were+0.007
Sure+0.007
Neg logit contributions
rys-0.013
alk-0.012
iny-0.012
ky-0.011
 avail-0.011
Token happily
Token ID18177
Feature activation+1.09e-02

Pos logit contributions
rys+0.002
iny+0.002
alk+0.002
 diary+0.002
zo+0.002
Neg logit contributions
five-0.001
anges-0.001
 YES-0.001
 were-0.001
Of-0.001
Token.
Token ID13
Feature activation+3.10e-03

Pos logit contributions
 were+0.010
five+0.010
 how+0.009
 no+0.009
 lots+0.009
Neg logit contributions
rys-0.024
iny-0.023
alk-0.023
 diary-0.022
umper-0.022
Token eating
Token ID6600
Feature activation+1.48e-02

Pos logit contributions
five+0.013
 were+0.013
 YES+0.013
 thank+0.012
anges+0.012
Neg logit contributions
rys-0.021
iny-0.019
alk-0.019
 diary-0.019
ita-0.018
Token."
Token ID526
Feature activation+5.25e-03

Pos logit contributions
five+0.016
 YES+0.015
Of+0.015
anges+0.015
Sure+0.015
Neg logit contributions
rys-0.029
alk-0.028
iny-0.027
 diary-0.027
ky-0.026
Token Daisy
Token ID40355
Feature activation-5.65e-03

Pos logit contributions
 were+0.005
five+0.005
 how+0.005
 YES+0.005
 thank+0.005
Neg logit contributions
rys-0.011
iny-0.011
alk-0.011
 diary-0.011
zo-0.010
Token smiled
Token ID13541
Feature activation+1.02e-02

Pos logit contributions
ita+0.011
rys+0.011
iny+0.011
ci+0.010
zo+0.010
Neg logit contributions
anges-0.006
five-0.006
 YES-0.006
 Faster-0.005
rav-0.005
Token and
Token ID290
Feature activation+1.21e-02

Pos logit contributions
 were+0.007
five+0.007
 YES+0.007
 how+0.007
 thank+0.007
Neg logit contributions
rys-0.024
iny-0.023
alk-0.023
 diary-0.023
enna-0.022
Token mun
Token ID29856
Feature activation+7.88e-02

Pos logit contributions
anges+0.015
five+0.014
 YES+0.014
Eventually+0.013
 Faster+0.013
Neg logit contributions
rys-0.022
ci-0.020
ita-0.020
 diary-0.020
 weeping-0.019
Tokenched
Token ID1740
Feature activation+6.09e-03

Pos logit contributions
 Mike+0.099
 were+0.095
 how+0.095
 YES+0.094
Of+0.094
Neg logit contributions
rys-0.146
alk-0.136
iny-0.131
umper-0.129
ky-0.124
Token on
Token ID319
Feature activation+4.62e-03

Pos logit contributions
five+0.007
 YES+0.007
 were+0.007
anges+0.007
 thank+0.007
Neg logit contributions
rys-0.012
iny-0.011
alk-0.011
 diary-0.011
ita-0.011
Token her
Token ID607
Feature activation-7.54e-03

Pos logit contributions
five+0.005
 were+0.005
anges+0.005
 YES+0.005
Of+0.005
Neg logit contributions
rys-0.009
alk-0.009
iny-0.008
 diary-0.008
ita-0.008
Token apple
Token ID17180
Feature activation+1.42e-02

Pos logit contributions
rys+0.015
 diary+0.013
alk+0.013
iny+0.013
ita+0.013
Neg logit contributions
 were-0.009
five-0.009
anges-0.009
 Mike-0.009
 YES-0.009
Token.
Token ID13
Feature activation+4.68e-03

Pos logit contributions
 were+0.015
five+0.015
 how+0.015
 lots+0.014
 thank+0.014
Neg logit contributions
rys-0.029
iny-0.027
alk-0.027
enna-0.027
umper-0.027
Token!
Token ID0
Feature activation+1.60e-02

Pos logit contributions
rys+0.010
iny+0.009
alk+0.009
 diary+0.009
enna+0.009
Neg logit contributions
 were-0.005
 how-0.005
 lots-0.005
 no-0.005
 Mike-0.004
Token
Token ID198
Feature activation+1.23e-02

Pos logit contributions
five+0.016
anges+0.016
 YES+0.016
 were+0.016
 yes+0.015
Neg logit contributions
rys-0.033
iny-0.031
alk-0.031
enna-0.030
 diary-0.030
Token
Token ID198
Feature activation+3.97e-03

Pos logit contributions
five+0.016
anges+0.016
 thank+0.015
 yes+0.015
 YES+0.015
Neg logit contributions
iny-0.022
rys-0.021
alk-0.020
Harry-0.020
 diary-0.020
TokenLittle
Token ID22253
Feature activation+1.47e-02

Pos logit contributions
five+0.007
 thank+0.007
oops+0.006
anges+0.006
 were+0.006
Neg logit contributions
iny-0.006
Harry-0.006
rys-0.005
ita-0.005
zo-0.005
Token John
Token ID1757
Feature activation+5.87e-03

Pos logit contributions
 were+0.018
five+0.018
 YES+0.018
 how+0.018
 thank+0.017
Neg logit contributions
rys-0.028
alk-0.026
iny-0.026
 diary-0.025
umper-0.025
Token mun
Token ID29856
Feature activation+7.86e-02

Pos logit contributions
anges+0.006
five+0.005
 YES+0.005
Of+0.005
Sure+0.005
Neg logit contributions
iny-0.012
rys-0.012
ita-0.012
zo-0.012
ky-0.012
Tokenched
Token ID1740
Feature activation+9.20e-03

Pos logit contributions
 moms+0.112
 Mike+0.106
 oh+0.105
 how+0.104
Of+0.104
Neg logit contributions
rys-0.135
alk-0.126
umper-0.120
iny-0.117
ky-0.117
Token away
Token ID1497
Feature activation+8.37e-03

Pos logit contributions
anges+0.011
 YES+0.011
Of+0.011
five+0.011
Sure+0.011
Neg logit contributions
rys-0.016
iny-0.016
alk-0.016
 diary-0.015
ky-0.015
Token happily
Token ID18177
Feature activation+9.55e-03

Pos logit contributions
five+0.006
 YES+0.006
anges+0.006
 were+0.006
Sure+0.006
Neg logit contributions
rys-0.020
alk-0.019
iny-0.018
 diary-0.018
enna-0.018
Token,
Token ID11
Feature activation+9.91e-03

Pos logit contributions
 were+0.007
five+0.007
 how+0.007
 lots+0.007
 thank+0.007
Neg logit contributions
rys-0.022
iny-0.021
alk-0.021
 diary-0.021
umper-0.021
Token welcoming
Token ID22516
Feature activation+1.27e-02

Pos logit contributions
anges+0.011
five+0.011
 were+0.011
Sure+0.011
 YES+0.011
Neg logit contributions
rys-0.019
iny-0.017
 diary-0.017
 weeping-0.017
alk-0.017
Token a
Token ID257
Feature activation-1.01e-03

Pos logit contributions
 diary+0.014
rys+0.013
iny+0.013
ci+0.013
 universe+0.013
Neg logit contributions
five-0.007
anges-0.007
Sure-0.006
Eventually-0.006
 YES-0.006
Token honey
Token ID12498
Feature activation+5.44e-03

Pos logit contributions
rys+0.002
 diary+0.002
iny+0.002
alk+0.002
 violin+0.002
Neg logit contributions
five-0.001
anges-0.001
 how-0.001
 were-0.001
 Mike-0.001
Token sandwich
Token ID20433
Feature activation+1.47e-02

Pos logit contributions
five+0.006
anges+0.006
Of+0.006
 YES+0.006
Sure+0.006
Neg logit contributions
rys-0.011
iny-0.010
 diary-0.010
alk-0.010
ci-0.010
Token.
Token ID13
Feature activation-1.29e-03

Pos logit contributions
five+0.015
anges+0.015
Of+0.015
Sure+0.014
 YES+0.014
Neg logit contributions
rys-0.029
 diary-0.028
iny-0.028
alk-0.028
ci-0.027
Token He
Token ID679
Feature activation+8.59e-03

Pos logit contributions
rys+0.003
iny+0.003
alk+0.003
ita+0.003
enna+0.003
Neg logit contributions
five-0.001
 were-0.001
anges-0.001
 yes-0.001
 YES-0.001
Token mun
Token ID29856
Feature activation+7.81e-02

Pos logit contributions
five+0.009
anges+0.009
 Faster+0.009
 YES+0.008
Not+0.008
Neg logit contributions
ci-0.016
iny-0.016
ita-0.016
zo-0.016
rys-0.016
Tokenched
Token ID1740
Feature activation+1.12e-02

Pos logit contributions
 moms+0.118
 Mike+0.116
 oh+0.115
Of+0.112
Me+0.108
Neg logit contributions
rys-0.126
alk-0.123
umper-0.114
iny-0.112
ky-0.108
Token away
Token ID1497
Feature activation+8.16e-03

Pos logit contributions
Of+0.013
five+0.013
anges+0.012
 YES+0.012
Sure+0.012
Neg logit contributions
rys-0.021
iny-0.020
ky-0.020
alk-0.020
 diary-0.019
Token happily
Token ID18177
Feature activation+9.65e-03

Pos logit contributions
five+0.007
 YES+0.006
Of+0.006
anges+0.006
Sure+0.006
Neg logit contributions
rys-0.018
alk-0.018
iny-0.017
 diary-0.017
ita-0.017
Token and
Token ID290
Feature activation-9.39e-04

Pos logit contributions
 were+0.007
five+0.007
 how+0.007
 lots+0.007
 no+0.007
Neg logit contributions
rys-0.023
iny-0.022
alk-0.022
 diary-0.022
umper-0.021
Token soon
Token ID2582
Feature activation+9.54e-03

Pos logit contributions
rys+0.002
iny+0.002
 diary+0.002
alk+0.002
ci+0.002
Neg logit contributions
five-0.001
anges-0.001
 YES-0.001
Sure-0.001
 were-0.001
Token to
Token ID284
Feature activation-8.19e-03

Pos logit contributions
rys+0.007
iny+0.007
alk+0.007
 diary+0.006
enna+0.006
Neg logit contributions
 were-0.002
five-0.002
 YES-0.002
 how-0.002
 thank-0.002
Token eat
Token ID4483
Feature activation+5.77e-03

Pos logit contributions
rys+0.015
enna+0.014
 diary+0.014
iny+0.014
alk+0.014
Neg logit contributions
 were-0.011
 thank-0.010
 how-0.010
 lots-0.010
 no-0.010
Token.
Token ID13
Feature activation+3.64e-03

Pos logit contributions
five+0.007
Of+0.007
anges+0.006
rav+0.006
 YES+0.006
Neg logit contributions
rys-0.011
alk-0.011
iny-0.010
 diary-0.010
ky-0.010
Token The
Token ID383
Feature activation+8.77e-03

Pos logit contributions
 were+0.003
five+0.003
 YES+0.003
anges+0.003
 yes+0.003
Neg logit contributions
rys-0.008
alk-0.007
iny-0.007
 diary-0.007
 battlefield-0.007
Token bear
Token ID6842
Feature activation-1.37e-03

Pos logit contributions
 were+0.016
five+0.016
 YES+0.016
 how+0.016
 thank+0.015
Neg logit contributions
rys-0.011
iny-0.010
alk-0.010
 diary-0.010
enna-0.010
Token mun
Token ID29856
Feature activation+7.80e-02

Pos logit contributions
iny+0.003
zo+0.003
ita+0.003
rys+0.003
cker+0.003
Neg logit contributions
anges-0.001
five-0.001
 YES-0.001
Of-0.001
 Mike-0.001
Tokenched
Token ID1740
Feature activation+5.01e-03

Pos logit contributions
 moms+0.116
 Mike+0.115
 oh+0.109
Of+0.107
anges+0.107
Neg logit contributions
rys-0.132
alk-0.125
umper-0.115
iny-0.114
ky-0.112
Token the
Token ID262
Feature activation-1.13e-03

Pos logit contributions
anges+0.006
 YES+0.006
five+0.006
Of+0.006
 yes+0.006
Neg logit contributions
rys-0.009
iny-0.009
alk-0.009
 diary-0.008
ky-0.008
Token juicy
Token ID35075
Feature activation+2.47e-02

Pos logit contributions
rys+0.002
iny+0.002
alk+0.002
 diary+0.002
cker+0.002
Neg logit contributions
 how-0.002
 YES-0.002
 Mike-0.002
anges-0.002
Of-0.002
Token green
Token ID4077
Feature activation-1.81e-03

Pos logit contributions
Of+0.041
 YES+0.040
Sure+0.039
 yes+0.039
 Mike+0.038
Neg logit contributions
rys-0.038
alk-0.036
 bean-0.034
iny-0.034
ky-0.033
Token sa
Token ID473
Feature activation+1.41e-02

Pos logit contributions
rys+0.003
alk+0.003
iny+0.003
 diary+0.002
ky+0.002
Neg logit contributions
Of-0.003
 YES-0.003
anges-0.003
Sure-0.003
 Mike-0.003
Token!
Token ID0
Feature activation+9.20e-03

Pos logit contributions
five+0.004
 YES+0.004
anges+0.004
 were+0.004
Sure+0.004
Neg logit contributions
rys-0.010
 diary-0.009
alk-0.009
iny-0.009
enna-0.009
Token It
Token ID632
Feature activation+1.36e-02

Pos logit contributions
five+0.012
 were+0.012
 YES+0.011
anges+0.011
 how+0.011
Neg logit contributions
rys-0.017
iny-0.016
alk-0.016
 diary-0.015
enna-0.015
Token mun
Token ID29856
Feature activation+8.01e-02

Pos logit contributions
five+0.014
anges+0.013
 YES+0.012
Sure+0.011
 thank+0.011
Neg logit contributions
rys-0.029
iny-0.028
zo-0.027
 diary-0.027
ita-0.027
Tokenched
Token ID1740
Feature activation+5.23e-03

Pos logit contributions
 moms+0.116
 Mike+0.115
 oh+0.109
 how+0.107
 Megan+0.106
Neg logit contributions
rys-0.136
alk-0.129
umper-0.120
iny-0.118
ky-0.118
Token and
Token ID290
Feature activation+7.48e-03

Pos logit contributions
five+0.006
 YES+0.006
anges+0.006
 thank+0.006
 were+0.006
Neg logit contributions
rys-0.010
iny-0.009
alk-0.009
 diary-0.009
ci-0.009
Token mun
Token ID29856
Feature activation+7.79e-02

Pos logit contributions
anges+0.008
five+0.008
Sure+0.008
Of+0.008
 YES+0.008
Neg logit contributions
rys-0.014
 bur-0.013
alk-0.013
ky-0.013
 diary-0.013
Tokenched
Token ID1740
Feature activation+1.66e-03

Pos logit contributions
 Mike+0.104
 moms+0.104
 oh+0.100
Of+0.099
 how+0.099
Neg logit contributions
rys-0.138
alk-0.131
iny-0.122
umper-0.121
ky-0.120
Token until
Token ID1566
Feature activation+1.71e-02

Pos logit contributions
five+0.002
 YES+0.002
anges+0.002
Of+0.002
Sure+0.002
Neg logit contributions
rys-0.003
iny-0.003
alk-0.003
 diary-0.003
ita-0.003
Token the
Token ID262
Feature activation+3.73e-03

Pos logit contributions
 were+0.011
five+0.011
 YES+0.011
 how+0.010
 thank+0.010
Neg logit contributions
rys-0.042
alk-0.040
iny-0.040
 diary-0.039
enna-0.038
Token cereal
Token ID33158
Feature activation+2.03e-03

Pos logit contributions
 were+0.006
 thank+0.005
 no+0.005
 lots+0.005
five+0.005
Neg logit contributions
rys-0.006
alk-0.006
iny-0.006
enna-0.006
umper-0.006
Token was
Token ID373
Feature activation+7.29e-03

Pos logit contributions
Of+0.002
anges+0.002
five+0.002
Sure+0.002
 YES+0.002
Neg logit contributions
alk-0.004
rys-0.004
 diary-0.004
iny-0.004
ci-0.004
Token mouse
Token ID10211
Feature activation-9.72e-03

Pos logit contributions
 were+0.006
 thank+0.006
 lots+0.006
 no+0.006
 yes+0.006
Neg logit contributions
rys-0.005
alk-0.005
iny-0.005
enna-0.005
 diary-0.005
Token watched
Token ID7342
Feature activation+4.99e-03

Pos logit contributions
iny+0.020
rys+0.019
zo+0.019
ita+0.019
 diary+0.019
Neg logit contributions
anges-0.010
 YES-0.009
five-0.009
Of-0.009
rav-0.009
Token as
Token ID355
Feature activation+1.46e-02

Pos logit contributions
five+0.004
 were+0.004
 YES+0.004
anges+0.004
Of+0.004
Neg logit contributions
rys-0.011
iny-0.011
alk-0.011
 diary-0.011
zo-0.011
Token she
Token ID673
Feature activation-1.84e-02

Pos logit contributions
five+0.010
 were+0.010
 YES+0.009
 how+0.009
 thank+0.009
Neg logit contributions
rys-0.035
iny-0.034
alk-0.033
 diary-0.033
enna-0.032
Token happily
Token ID18177
Feature activation+3.04e-03

Pos logit contributions
rys+0.032
zo+0.030
iny+0.030
 diary+0.030
ita+0.029
Neg logit contributions
anges-0.023
five-0.023
 YES-0.023
Not-0.022
Of-0.022
Token mun
Token ID29856
Feature activation+7.77e-02

Pos logit contributions
 YES+0.003
anges+0.003
five+0.002
 were+0.002
 how+0.002
Neg logit contributions
rys-0.007
alk-0.006
iny-0.006
zo-0.006
 diary-0.006
Tokenched
Token ID1740
Feature activation+6.78e-03

Pos logit contributions
 were+0.085
 YES+0.084
anges+0.084
 Mike+0.083
 how+0.083
Neg logit contributions
rys-0.154
alk-0.145
iny-0.141
umper-0.137
zo-0.135
Token away
Token ID1497
Feature activation+6.74e-03

Pos logit contributions
five+0.007
 were+0.007
 YES+0.007
anges+0.007
 how+0.007
Neg logit contributions
rys-0.014
iny-0.013
alk-0.013
 diary-0.013
zo-0.012
Token on
Token ID319
Feature activation+1.01e-02

Pos logit contributions
 were+0.006
five+0.006
 how+0.006
 YES+0.006
 lots+0.006
Neg logit contributions
rys-0.015
iny-0.014
alk-0.014
 diary-0.014
enna-0.013
Token the
Token ID262
Feature activation+1.03e-02

Pos logit contributions
five+0.008
 were+0.007
 YES+0.007
anges+0.007
 thank+0.007
Neg logit contributions
rys-0.024
alk-0.022
iny-0.022
 diary-0.022
ita-0.022
Token y
Token ID331
Feature activation+1.13e-02

Pos logit contributions
five+0.015
 lots+0.015
 were+0.015
 thank+0.015
 YES+0.015
Neg logit contributions
rys-0.016
alk-0.016
enna-0.015
iny-0.015
umper-0.015
Token
Token ID198
Feature activation+5.54e-03

Pos logit contributions
 were+0.002
five+0.002
 YES+0.002
anges+0.002
 how+0.002
Neg logit contributions
rys-0.005
alk-0.005
iny-0.005
 diary-0.004
enna-0.004
Token
Token ID198
Feature activation-2.78e-03

Pos logit contributions
anges+0.006
 yes+0.006
 YES+0.006
 were+0.006
five+0.006
Neg logit contributions
iny-0.010
rys-0.010
alk-0.010
Harry-0.009
umper-0.009
TokenThe
Token ID464
Feature activation+1.29e-02

Pos logit contributions
rys+0.005
iny+0.005
Harry+0.005
zo+0.005
umper+0.005
Neg logit contributions
 were-0.003
 thank-0.003
 how-0.003
oops-0.003
five-0.003
Token bunny
Token ID44915
Feature activation-5.71e-03

Pos logit contributions
 thank+0.020
 were+0.020
 lots+0.020
 how+0.020
 no+0.020
Neg logit contributions
rys-0.020
alk-0.020
enna-0.019
iny-0.018
ci-0.018
Token happily
Token ID18177
Feature activation+3.52e-04

Pos logit contributions
rys+0.012
iny+0.012
ita+0.012
zo+0.012
alk+0.012
Neg logit contributions
anges-0.005
five-0.005
 YES-0.005
Of-0.005
 Mike-0.004
Token mun
Token ID29856
Feature activation+7.73e-02

Pos logit contributions
anges+0.000
 YES+0.000
 oh+0.000
 Faster+0.000
five+0.000
Neg logit contributions
rys-0.001
zo-0.001
ci-0.001
alk-0.001
ita-0.001
Tokenched
Token ID1740
Feature activation+8.51e-03

Pos logit contributions
 Mike+0.099
 oh+0.097
 how+0.096
 moms+0.095
anges+0.095
Neg logit contributions
rys-0.139
alk-0.132
umper-0.128
iny-0.125
zo-0.121
Token on
Token ID319
Feature activation+6.36e-03

Pos logit contributions
anges+0.010
 YES+0.009
five+0.009
Of+0.009
Sure+0.009
Neg logit contributions
rys-0.016
iny-0.015
alk-0.015
 diary-0.015
zo-0.015
Token his
Token ID465
Feature activation+6.99e-03

Pos logit contributions
anges+0.006
Of+0.006
 were+0.006
five+0.006
Sure+0.006
Neg logit contributions
rys-0.013
alk-0.012
iny-0.012
 diary-0.012
zo-0.012
Token carrot
Token ID40562
Feature activation+1.26e-02

Pos logit contributions
 were+0.011
five+0.011
anges+0.011
 Mike+0.011
 YES+0.010
Neg logit contributions
rys-0.011
iny-0.010
 diary-0.010
alk-0.010
zo-0.009
Token and
Token ID290
Feature activation+3.55e-03

Pos logit contributions
 were+0.011
five+0.010
 how+0.010
 lots+0.010
 YES+0.010
Neg logit contributions
rys-0.029
iny-0.027
alk-0.027
 diary-0.026
enna-0.026
Token started
Token ID2067
Feature activation-8.87e-03

Pos logit contributions
rys+0.004
alk+0.004
 diary+0.004
iny+0.004
ci+0.004
Neg logit contributions
 YES-0.002
five-0.002
anges-0.002
Sure-0.002
Of-0.002
Token to
Token ID284
Feature activation-7.88e-03

Pos logit contributions
rys+0.021
iny+0.020
alk+0.020
 diary+0.020
enna+0.020
Neg logit contributions
 how-0.007
 were-0.007
 lots-0.007
 no-0.006
five-0.006
Token eat
Token ID4483
Feature activation+6.98e-03

Pos logit contributions
rys+0.017
iny+0.017
enna+0.016
alk+0.016
umper+0.016
Neg logit contributions
 were-0.009
 thank-0.008
 lots-0.008
 how-0.008
 yes-0.008
Token.
Token ID13
Feature activation+3.13e-04

Pos logit contributions
five+0.010
Of+0.009
rav+0.009
 YES+0.009
anges+0.009
Neg logit contributions
rys-0.011
alk-0.011
 diary-0.010
 bean-0.010
 avail-0.010
Token They
Token ID1119
Feature activation+5.09e-03

Pos logit contributions
five+0.000
 YES+0.000
 were+0.000
 yes+0.000
anges+0.000
Neg logit contributions
rys-0.001
alk-0.001
iny-0.001
ita-0.001
enna-0.001
Token mun
Token ID29856
Feature activation+7.73e-02

Pos logit contributions
five+0.006
anges+0.005
 YES+0.005
Of+0.005
Sure+0.005
Neg logit contributions
rys-0.010
alk-0.009
zo-0.009
 diary-0.009
ci-0.009
Tokenched
Token ID1740
Feature activation+2.55e-03

Pos logit contributions
 Mike+0.118
 oh+0.117
 moms+0.115
Of+0.111
 how+0.109
Neg logit contributions
rys-0.124
alk-0.121
ky-0.111
iny-0.108
umper-0.106
Token and
Token ID290
Feature activation+7.54e-03

Pos logit contributions
 YES+0.003
anges+0.003
Of+0.003
five+0.003
Sure+0.003
Neg logit contributions
rys-0.005
iny-0.005
alk-0.004
ky-0.004
ci-0.004
Token cr
Token ID1067
Feature activation-1.25e-02

Pos logit contributions
anges+0.008
Of+0.008
Sure+0.008
 YES+0.008
 Mike+0.008
Neg logit contributions
rys-0.014
alk-0.013
ky-0.012
 bean-0.012
 bur-0.012
Tokenun
Token ID403
Feature activation-7.27e-03

Pos logit contributions
iny+0.019
alk+0.018
enna+0.018
rys+0.017
umper+0.017
Neg logit contributions
 oh-0.021
 maybe-0.020
 thank-0.020
 YES-0.020
Of-0.020
Tokenched
Token ID1740
Feature activation+7.44e-04

Pos logit contributions
zo+0.012
rys+0.012
ky+0.012
ita+0.012
iny+0.012
Neg logit contributions
 YES-0.009
Sure-0.009
Of-0.009
 maybe-0.009
 yes-0.009
Token grabbed
Token ID13176
Feature activation-1.08e-02

Pos logit contributions
anges+0.005
 how+0.005
 YES+0.005
 thank+0.005
Of+0.005
Neg logit contributions
iny-0.008
rys-0.008
ita-0.008
zo-0.008
ky-0.008
Token a
Token ID257
Feature activation-1.25e-03

Pos logit contributions
rys+0.020
 diary+0.019
ky+0.019
ishment+0.019
ci+0.019
Neg logit contributions
rav-0.012
 YES-0.011
Of-0.011
Sure-0.011
Not-0.011
Token handful
Token ID10089
Feature activation+2.03e-02

Pos logit contributions
rys+0.002
 diary+0.002
 pen+0.002
iny+0.002
 violin+0.002
Neg logit contributions
 how-0.002
 Mike-0.002
anges-0.002
 were-0.002
 YES-0.002
Token and
Token ID290
Feature activation+2.24e-04

Pos logit contributions
 YES+0.019
Of+0.017
 oh+0.016
Sure+0.016
Me+0.016
Neg logit contributions
alk-0.044
rys-0.044
ky-0.041
iny-0.041
 bean-0.040
Token happily
Token ID18177
Feature activation-4.01e-03

Pos logit contributions
rav+0.000
 YES+0.000
Of+0.000
Sure+0.000
five+0.000
Neg logit contributions
ky-0.000
 pen-0.000
ci-0.000
 bean-0.000
ita-0.000
Token mun
Token ID29856
Feature activation+7.71e-02

Pos logit contributions
rys+0.008
ci+0.007
zo+0.007
iny+0.007
alk+0.007
Neg logit contributions
 YES-0.004
Of-0.004
 Faster-0.004
anges-0.004
Sure-0.004
Tokenched
Token ID1740
Feature activation+8.55e-04

Pos logit contributions
Of+0.092
 Mike+0.092
 oh+0.092
 YES+0.091
 were+0.091
Neg logit contributions
rys-0.143
alk-0.136
iny-0.127
umper-0.126
ky-0.125
Token away
Token ID1497
Feature activation+3.23e-03

Pos logit contributions
 YES+0.001
Of+0.001
Sure+0.001
five+0.001
anges+0.001
Neg logit contributions
rys-0.002
alk-0.002
iny-0.002
 diary-0.002
zo-0.002
Token.
Token ID13
Feature activation+2.03e-03

Pos logit contributions
five+0.003
 YES+0.003
 were+0.003
anges+0.003
Sure+0.003
Neg logit contributions
rys-0.007
alk-0.006
iny-0.006
 diary-0.006
zo-0.006
Token The
Token ID383
Feature activation+9.91e-03

Pos logit contributions
anges+0.002
 thank+0.002
five+0.002
 yes+0.002
 YES+0.002
Neg logit contributions
alk-0.004
rys-0.004
enna-0.004
ishment-0.004
 battlefield-0.004
Token popcorn
Token ID38914
Feature activation+3.04e-02

Pos logit contributions
 YES+0.013
 were+0.013
 how+0.013
five+0.013
anges+0.013
Neg logit contributions
rys-0.016
iny-0.016
 diary-0.016
 battlefield-0.015
ita-0.015
Token.
Token ID13
Feature activation+1.24e-03

Pos logit contributions
rys+0.006
alk+0.006
iny+0.006
 diary+0.005
umper+0.005
Neg logit contributions
 were-0.004
 how-0.004
five-0.004
 lots-0.004
 thank-0.004
Token The
Token ID383
Feature activation+5.79e-03

Pos logit contributions
 were+0.001
 YES+0.001
 how+0.001
anges+0.001
five+0.001
Neg logit contributions
rys-0.003
alk-0.002
iny-0.002
enna-0.002
 diary-0.002
Token two
Token ID734
Feature activation-1.11e-02

Pos logit contributions
 YES+0.007
 were+0.007
anges+0.006
 how+0.006
five+0.006
Neg logit contributions
rys-0.011
iny-0.011
 diary-0.010
alk-0.010
ita-0.010
Token of
Token ID286
Feature activation-1.78e-02

Pos logit contributions
iny+0.023
rys+0.022
zo+0.022
ita+0.022
 diary+0.022
Neg logit contributions
 YES-0.010
five-0.009
Of-0.009
 Enjoy-0.009
anges-0.008
Token them
Token ID606
Feature activation-4.59e-03

Pos logit contributions
rys+0.037
iny+0.035
alk+0.035
ita+0.033
 diary+0.033
Neg logit contributions
five-0.018
 YES-0.018
 were-0.017
 yes-0.017
 thank-0.017
Token mun
Token ID29856
Feature activation+7.63e-02

Pos logit contributions
rys+0.009
zo+0.009
iny+0.009
alk+0.009
ci+0.009
Neg logit contributions
anges-0.004
five-0.004
 YES-0.004
Of-0.004
Sure-0.004
Tokenched
Token ID1740
Feature activation+1.05e-02

Pos logit contributions
Of+0.095
 Mike+0.095
 oh+0.094
 YES+0.093
 how+0.093
Neg logit contributions
rys-0.139
alk-0.134
iny-0.126
umper-0.123
ky-0.122
Token on
Token ID319
Feature activation+7.12e-03

Pos logit contributions
 YES+0.012
anges+0.012
five+0.012
 thank+0.011
Of+0.011
Neg logit contributions
rys-0.020
iny-0.019
alk-0.019
 diary-0.019
zo-0.018
Token the
Token ID262
Feature activation+4.28e-03

Pos logit contributions
Of+0.007
anges+0.007
 YES+0.007
 were+0.006
Sure+0.006
Neg logit contributions
rys-0.015
alk-0.014
iny-0.014
ky-0.013
 diary-0.013
Token banana
Token ID25996
Feature activation+9.68e-03

Pos logit contributions
 were+0.004
 how+0.004
anges+0.004
 YES+0.004
 Mike+0.004
Neg logit contributions
rys-0.009
iny-0.008
 diary-0.008
alk-0.008
 battlefield-0.008
Token,
Token ID11
Feature activation+7.48e-03

Pos logit contributions
 YES+0.009
anges+0.008
Of+0.008
five+0.008
 were+0.008
Neg logit contributions
rys-0.021
alk-0.020
iny-0.020
 diary-0.020
zo-0.019
Token little
Token ID1310
Feature activation-7.26e-03

Pos logit contributions
 YES+0.008
 how+0.008
 Mike+0.008
 were+0.008
five+0.008
Neg logit contributions
rys-0.010
 diary-0.010
iny-0.010
ita-0.009
 universe-0.009
Token girl
Token ID2576
Feature activation+1.31e-03

Pos logit contributions
rys+0.014
alk+0.013
iny+0.013
zo+0.012
umper+0.012
Neg logit contributions
 were-0.009
five-0.009
 no-0.009
 thank-0.009
 yes-0.009
Token was
Token ID373
Feature activation+2.39e-03

Pos logit contributions
anges+0.002
Of+0.001
Not+0.001
five+0.001
 YES+0.001
Neg logit contributions
ci-0.003
ita-0.002
iny-0.002
zo-0.002
cker-0.002
Token thrilled
Token ID20536
Feature activation+2.04e-02

Pos logit contributions
five+0.003
Sure+0.003
rav+0.003
anges+0.002
haw+0.002
Neg logit contributions
 weeping-0.005
 awake-0.004
iny-0.004
ita-0.004
rys-0.004
Token and
Token ID290
Feature activation-6.81e-03

Pos logit contributions
five+0.020
 YES+0.019
anges+0.018
 were+0.018
 thank+0.018
Neg logit contributions
rys-0.043
iny-0.042
 diary-0.041
alk-0.040
ita-0.040
Token mun
Token ID29856
Feature activation+7.62e-02

Pos logit contributions
rys+0.011
ci+0.011
 diary+0.011
 weeping+0.010
ita+0.010
Neg logit contributions
five-0.009
anges-0.009
 YES-0.009
Eventually-0.008
Sure-0.008
Tokenched
Token ID1740
Feature activation+3.55e-03

Pos logit contributions
 Mike+0.106
 moms+0.102
Of+0.100
 oh+0.099
 how+0.099
Neg logit contributions
rys-0.132
alk-0.122
umper-0.117
iny-0.117
ky-0.114
Token away
Token ID1497
Feature activation+5.01e-03

Pos logit contributions
five+0.004
 YES+0.004
anges+0.004
 were+0.004
 thank+0.004
Neg logit contributions
rys-0.007
iny-0.006
 diary-0.006
alk-0.006
ita-0.006
Token happily
Token ID18177
Feature activation+7.00e-03

Pos logit contributions
 were+0.004
five+0.004
 YES+0.004
 how+0.004
 thank+0.004
Neg logit contributions
rys-0.011
iny-0.011
alk-0.010
 diary-0.010
enna-0.010
Token.
Token ID13
Feature activation+7.01e-03

Pos logit contributions
 were+0.008
five+0.008
 how+0.008
 no+0.008
 lots+0.008
Neg logit contributions
rys-0.014
alk-0.013
iny-0.013
 diary-0.013
umper-0.013
Token
Token ID220
Feature activation-1.13e-02

Pos logit contributions
five+0.009
 were+0.009
anges+0.009
 YES+0.009
 how+0.009
Neg logit contributions
rys-0.012
alk-0.012
iny-0.012
 diary-0.011
enna-0.011
Token "
Token ID366
Feature activation+3.24e-03

Pos logit contributions
 lots+0.025
 how+0.025
 were+0.024
five+0.024
 no+0.024
Neg logit contributions
rys-0.059
iny-0.056
ita-0.055
alk-0.055
enna-0.055
Tokench
Token ID354
Feature activation+3.74e-02

Pos logit contributions
 were+0.004
 thank+0.004
 how+0.004
 lots+0.004
anges+0.004
Neg logit contributions
rys-0.006
iny-0.006
alk-0.005
umper-0.005
enna-0.005
Tokenoo
Token ID2238
Feature activation+3.71e-02

Pos logit contributions
Of+0.044
Sure+0.041
 YES+0.041
anges+0.040
 yes+0.039
Neg logit contributions
alk-0.073
rys-0.073
iny-0.071
zo-0.067
ita-0.065
Token cho
Token ID1727
Feature activation+5.77e-02

Pos logit contributions
 were+0.041
 YES+0.040
Of+0.040
five+0.040
anges+0.040
Neg logit contributions
rys-0.072
iny-0.069
alk-0.067
zo-0.066
ita-0.065
Tokeno
Token ID78
Feature activation+2.41e-02

Pos logit contributions
 were+0.068
 thank+0.068
 how+0.067
 YES+0.067
 lots+0.067
Neg logit contributions
rys-0.108
iny-0.107
alk-0.104
zo-0.102
umper-0.100
Token cho
Token ID1727
Feature activation+6.36e-02

Pos logit contributions
 YES+0.029
 were+0.029
five+0.029
anges+0.029
Of+0.028
Neg logit contributions
rys-0.045
iny-0.043
alk-0.042
zo-0.041
ita-0.040
Tokeno
Token ID78
Feature activation+2.76e-02

Pos logit contributions
 were+0.075
 thank+0.074
 how+0.074
 YES+0.074
 lots+0.073
Neg logit contributions
rys-0.120
iny-0.117
alk-0.114
zo-0.111
umper-0.110
Token".
Token ID1911
Feature activation-1.67e-03

Pos logit contributions
 YES+0.034
 were+0.034
anges+0.034
five+0.033
 how+0.033
Neg logit contributions
rys-0.050
iny-0.048
alk-0.047
zo-0.046
ita-0.045
Token
Token ID198
Feature activation-4.62e-03

Pos logit contributions
rys+0.004
iny+0.003
alk+0.003
 diary+0.003
enna+0.003
Neg logit contributions
 were-0.001
five-0.001
 YES-0.001
 how-0.001
 thank-0.001
Token
Token ID198
Feature activation-1.22e-02

Pos logit contributions
rys+0.010
iny+0.010
alk+0.010
 diary+0.009
enna+0.009
Neg logit contributions
 were-0.004
five-0.004
 YES-0.004
 how-0.004
 thank-0.004
TokenThey
Token ID2990
Feature activation+1.18e-02

Pos logit contributions
rys+0.026
alk+0.025
iny+0.025
 diary+0.024
enna+0.024
Neg logit contributions
 were-0.011
five-0.011
 YES-0.011
 how-0.011
 lots-0.011
Token proud
Token ID6613
Feature activation+6.71e-03

Pos logit contributions
five+0.014
 YES+0.014
anges+0.014
 Mike+0.013
 were+0.013
Neg logit contributions
rys-0.033
iny-0.031
 weeping-0.030
 diary-0.030
enna-0.030
Token of
Token ID286
Feature activation-8.64e-04

Pos logit contributions
five+0.003
 YES+0.003
anges+0.003
 were+0.003
 lots+0.002
Neg logit contributions
rys-0.018
iny-0.017
 diary-0.017
alk-0.017
enna-0.017
Token her
Token ID607
Feature activation-4.30e-03

Pos logit contributions
rys+0.001
alk+0.001
iny+0.001
 diary+0.001
enna+0.001
Neg logit contributions
five-0.001
 YES-0.001
anges-0.001
 thank-0.001
 were-0.001
Token for
Token ID329
Feature activation+1.46e-02

Pos logit contributions
rys+0.009
 diary+0.009
iny+0.009
alk+0.008
ci+0.008
Neg logit contributions
anges-0.004
five-0.004
 YES-0.004
 were-0.004
Sure-0.004
Token being
Token ID852
Feature activation+1.08e-02

Pos logit contributions
five+0.018
 were+0.018
 YES+0.018
anges+0.017
 thank+0.017
Neg logit contributions
rys-0.027
iny-0.025
 diary-0.025
alk-0.025
enna-0.024
Token such
Token ID884
Feature activation+6.23e-02

Pos logit contributions
five+0.005
 were+0.005
 YES+0.005
 thank+0.004
anges+0.004
Neg logit contributions
rys-0.029
iny-0.027
 diary-0.027
alk-0.027
ita-0.026
Token an
Token ID281
Feature activation+2.29e-02

Pos logit contributions
five+0.053
 YES+0.047
 were+0.046
 Mike+0.045
anges+0.044
Neg logit contributions
rys-0.146
 diary-0.138
iny-0.134
alk-0.131
ita-0.130
Token obedient
Token ID41652
Feature activation+1.02e-02

Pos logit contributions
 were+0.031
anges+0.031
 YES+0.029
five+0.028
 Mike+0.028
Neg logit contributions
rys-0.042
 diary-0.039
iny-0.039
ind-0.037
 alien-0.037
Token and
Token ID290
Feature activation+4.34e-03

Pos logit contributions
 were+0.012
five+0.012
 thank+0.011
 how+0.011
 YES+0.011
Neg logit contributions
rys-0.020
alk-0.019
iny-0.019
 diary-0.018
enna-0.018
Token happy
Token ID3772
Feature activation+9.35e-03

Pos logit contributions
five+0.004
 were+0.004
 YES+0.004
anges+0.004
 how+0.004
Neg logit contributions
rys-0.009
iny-0.009
alk-0.009
 diary-0.009
enna-0.008
Token little
Token ID1310
Feature activation+1.74e-03

Pos logit contributions
 were+0.010
 how+0.010
 thank+0.010
 no+0.010
 lots+0.010
Neg logit contributions
rys-0.020
iny-0.019
alk-0.019
umper-0.018
enna-0.018
Token,"
Token ID553
Feature activation-7.76e-03

Pos logit contributions
 were+0.037
five+0.037
 YES+0.037
anges+0.036
 thank+0.035
Neg logit contributions
rys-0.091
iny-0.087
alk-0.086
 diary-0.085
ita-0.084
TokenBe
Token ID3856
Feature activation+2.99e-02

Pos logit contributions
rys+0.013
 diary+0.013
 violin+0.012
ci+0.011
ita+0.011
Neg logit contributions
five-0.011
oops-0.011
 Mike-0.011
anges-0.010
 of-0.010
Token good
Token ID922
Feature activation+9.27e-03

Pos logit contributions
 were+0.037
five+0.037
 thank+0.036
 YES+0.035
Not+0.034
Neg logit contributions
iny-0.053
rys-0.052
ita-0.052
 diary-0.050
ci-0.049
Token,
Token ID11
Feature activation-3.14e-03

Pos logit contributions
five+0.007
 YES+0.007
 were+0.007
anges+0.007
 lots+0.007
Neg logit contributions
rys-0.021
iny-0.020
 diary-0.020
alk-0.020
ita-0.020
Token Holly
Token ID7764
Feature activation+3.97e-03

Pos logit contributions
rys+0.007
alk+0.006
iny+0.006
 diary+0.006
ita+0.006
Neg logit contributions
five-0.003
 were-0.003
anges-0.003
 YES-0.003
 lots-0.003
Token,.
Token ID38508
Feature activation+6.17e-02

Pos logit contributions
 were+0.004
five+0.004
 how+0.003
 YES+0.003
 thank+0.003
Neg logit contributions
rys-0.009
alk-0.008
iny-0.008
 diary-0.008
umper-0.008
Token I
Token ID314
Feature activation+3.14e-03

Pos logit contributions
five+0.074
 thank+0.073
 YES+0.071
Sure+0.071
Of+0.071
Neg logit contributions
rys-0.116
iny-0.114
alk-0.110
cker-0.110
ations-0.109
Token'll
Token ID1183
Feature activation+2.55e-02

Pos logit contributions
five+0.003
 YES+0.003
anges+0.003
 how+0.003
 were+0.003
Neg logit contributions
rys-0.007
iny-0.007
alk-0.007
 diary-0.006
zo-0.006
Token be
Token ID307
Feature activation+4.03e-03

Pos logit contributions
 were+0.022
five+0.022
 YES+0.022
 how+0.021
anges+0.021
Neg logit contributions
rys-0.056
iny-0.053
alk-0.053
 diary-0.052
enna-0.051
Token back
Token ID736
Feature activation+2.18e-02

Pos logit contributions
 were+0.004
five+0.004
 how+0.003
 lots+0.003
 YES+0.003
Neg logit contributions
rys-0.009
alk-0.009
iny-0.008
 diary-0.008
zo-0.008
Token soon
Token ID2582
Feature activation+1.45e-02

Pos logit contributions
five+0.016
 were+0.016
 YES+0.016
anges+0.015
 thank+0.015
Neg logit contributions
rys-0.050
iny-0.048
 diary-0.048
alk-0.048
ita-0.047
Token happy
Token ID3772
Feature activation+1.05e-02

Pos logit contributions
anges+0.009
 YES+0.008
five+0.008
Sure+0.008
 oh+0.008
Neg logit contributions
rys-0.014
iny-0.012
 weeping-0.012
zo-0.012
alk-0.012
Token that
Token ID326
Feature activation+6.68e-03

Pos logit contributions
five+0.009
 YES+0.008
anges+0.008
Sure+0.008
 were+0.008
Neg logit contributions
rys-0.024
alk-0.023
iny-0.022
 diary-0.022
zo-0.022
Token they
Token ID484
Feature activation-1.26e-02

Pos logit contributions
five+0.004
 were+0.004
 YES+0.004
 how+0.004
 thank+0.004
Neg logit contributions
rys-0.016
iny-0.016
alk-0.016
 diary-0.015
enna-0.015
Token had
Token ID550
Feature activation+4.44e-03

Pos logit contributions
rys+0.027
iny+0.025
enna+0.025
zo+0.025
alk+0.025
Neg logit contributions
five-0.012
anges-0.012
 YES-0.011
Sure-0.011
 how-0.010
Token seen
Token ID1775
Feature activation+2.20e-02

Pos logit contributions
Sure+0.005
five+0.005
anges+0.005
 YES+0.005
Of+0.005
Neg logit contributions
rys-0.008
iny-0.007
alk-0.007
enna-0.007
zo-0.007
Token such
Token ID884
Feature activation+6.14e-02

Pos logit contributions
five+0.017
 YES+0.016
anges+0.016
Sure+0.016
Of+0.015
Neg logit contributions
rys-0.051
iny-0.048
alk-0.048
 diary-0.048
zo-0.047
Token a
Token ID257
Feature activation+1.64e-02

Pos logit contributions
five+0.041
 were+0.039
 YES+0.038
anges+0.038
Sure+0.036
Neg logit contributions
rys-0.150
iny-0.142
alk-0.141
 diary-0.141
ita-0.137
Token wonderful
Token ID7932
Feature activation+1.31e-02

Pos logit contributions
 were+0.018
 thank+0.017
five+0.017
 lots+0.017
 no+0.017
Neg logit contributions
rys-0.034
alk-0.032
iny-0.032
enna-0.031
umper-0.031
Token sight
Token ID6504
Feature activation+1.62e-02

Pos logit contributions
 thank+0.020
 were+0.020
 how+0.020
 lots+0.020
five+0.020
Neg logit contributions
rys-0.021
umper-0.020
iny-0.020
enna-0.019
alk-0.019
Token.
Token ID13
Feature activation+6.62e-03

Pos logit contributions
 were+0.016
 how+0.015
five+0.015
 thank+0.015
 no+0.015
Neg logit contributions
rys-0.035
iny-0.033
alk-0.033
 diary-0.032
umper-0.032
Token The
Token ID383
Feature activation+1.66e-02

Pos logit contributions
five+0.006
 YES+0.006
anges+0.006
 yes+0.006
 thank+0.006
Neg logit contributions
rys-0.014
alk-0.013
iny-0.013
enna-0.013
 diary-0.013
Token all
Token ID477
Feature activation+2.07e-02

Pos logit contributions
rys+0.031
alk+0.029
 diary+0.029
iny+0.029
zo+0.029
Neg logit contributions
five-0.011
anges-0.011
 YES-0.011
Sure-0.010
Of-0.010
Token applauded
Token ID38911
Feature activation+7.83e-03

Pos logit contributions
 were+0.017
 how+0.016
 thank+0.016
five+0.016
 lots+0.016
Neg logit contributions
rys-0.048
iny-0.046
alk-0.045
 diary-0.045
enna-0.044
Token him
Token ID683
Feature activation+2.83e-03

Pos logit contributions
 were+0.006
five+0.006
 how+0.006
 YES+0.006
 thank+0.006
Neg logit contributions
rys-0.018
alk-0.017
iny-0.017
 diary-0.017
enna-0.017
Token for
Token ID329
Feature activation+1.62e-02

Pos logit contributions
 were+0.003
five+0.003
 YES+0.002
anges+0.002
 how+0.002
Neg logit contributions
rys-0.006
iny-0.006
alk-0.006
 diary-0.006
zo-0.006
Token being
Token ID852
Feature activation+1.07e-02

Pos logit contributions
five+0.016
 were+0.016
 YES+0.016
 how+0.016
 thank+0.016
Neg logit contributions
rys-0.034
iny-0.032
alk-0.032
 diary-0.031
enna-0.030
Token such
Token ID884
Feature activation+6.13e-02

Pos logit contributions
five+0.006
 YES+0.005
 were+0.005
Sure+0.005
anges+0.005
Neg logit contributions
rys-0.028
iny-0.026
 diary-0.026
zo-0.026
umper-0.026
Token a
Token ID257
Feature activation+2.02e-02

Pos logit contributions
five+0.059
Sure+0.054
 YES+0.054
anges+0.052
Of+0.051
Neg logit contributions
rys-0.133
 diary-0.126
iny-0.123
alk-0.120
 alien-0.119
Token good
Token ID922
Feature activation+8.84e-03

Pos logit contributions
anges+0.020
oops+0.020
five+0.020
 were+0.020
 YES+0.019
Neg logit contributions
rys-0.042
 diary-0.041
iny-0.039
 universe-0.039
 violin-0.038
Token pup
Token ID15552
Feature activation+2.43e-02

Pos logit contributions
 were+0.011
five+0.011
 thank+0.011
 no+0.011
 how+0.011
Neg logit contributions
rys-0.017
iny-0.016
alk-0.016
enna-0.015
 diary-0.015
Token.
Token ID13
Feature activation+5.63e-03

Pos logit contributions
 were+0.029
five+0.029
 YES+0.029
 how+0.029
 thank+0.028
Neg logit contributions
rys-0.045
iny-0.042
alk-0.042
 diary-0.042
enna-0.041
Token<|endoftext|>
Token ID50256
Feature activation-1.81e-02

Pos logit contributions
five+0.005
 were+0.005
 YES+0.005
 how+0.005
 thank+0.005
Neg logit contributions
rys-0.013
iny-0.012
alk-0.012
 diary-0.012
enna-0.011
Token he
Token ID339
Feature activation-9.73e-03

Pos logit contributions
rys+0.014
iny+0.013
alk+0.013
 diary+0.013
enna+0.013
Neg logit contributions
 were-0.004
five-0.004
 YES-0.004
 how-0.004
 thank-0.004
Token would
Token ID561
Feature activation+6.42e-04

Pos logit contributions
rys+0.023
iny+0.022
alk+0.021
 diary+0.021
zo+0.021
Neg logit contributions
five-0.007
anges-0.007
 YES-0.007
 thank-0.007
Sure-0.007
Token never
Token ID1239
Feature activation+2.32e-02

Pos logit contributions
 were+0.001
 thank+0.001
 no+0.001
five+0.001
 how+0.001
Neg logit contributions
rys-0.001
iny-0.001
alk-0.001
 diary-0.001
enna-0.001
Token have
Token ID423
Feature activation+5.06e-03

Pos logit contributions
 were+0.031
 thank+0.031
 no+0.030
 how+0.028
 so+0.028
Neg logit contributions
iny-0.043
rys-0.043
ind-0.041
 diary-0.041
alk-0.041
Token seen
Token ID1775
Feature activation+2.80e-02

Pos logit contributions
five+0.005
 YES+0.005
Sure+0.005
anges+0.005
 were+0.005
Neg logit contributions
rys-0.010
alk-0.009
 diary-0.009
iny-0.009
enna-0.009
Token such
Token ID884
Feature activation+6.13e-02

Pos logit contributions
five+0.025
 were+0.024
 YES+0.024
anges+0.023
 thank+0.023
Neg logit contributions
rys-0.062
iny-0.058
alk-0.058
 diary-0.058
enna-0.056
Token a
Token ID257
Feature activation+2.20e-02

Pos logit contributions
five+0.039
 were+0.039
 YES+0.037
anges+0.036
 thank+0.036
Neg logit contributions
rys-0.151
iny-0.143
alk-0.143
 diary-0.141
enna-0.138
Token wonderful
Token ID7932
Feature activation+1.61e-02

Pos logit contributions
 were+0.026
five+0.025
 YES+0.025
 how+0.025
 thank+0.025
Neg logit contributions
rys-0.043
iny-0.040
alk-0.040
 diary-0.039
enna-0.038
Token sight
Token ID6504
Feature activation+1.99e-02

Pos logit contributions
 were+0.024
five+0.024
 thank+0.024
 how+0.023
 YES+0.023
Neg logit contributions
rys-0.026
iny-0.025
alk-0.024
umper-0.024
enna-0.024
Token!
Token ID0
Feature activation+2.57e-02

Pos logit contributions
 how+0.016
 were+0.016
 no+0.016
 thank+0.015
 of+0.015
Neg logit contributions
rys-0.046
alk-0.044
iny-0.044
 diary-0.043
umper-0.043
Token<|endoftext|>
Token ID50256
Feature activation-1.41e-02

Pos logit contributions
anges+0.031
five+0.029
 yes+0.029
 how+0.029
 YES+0.029
Neg logit contributions
rys-0.046
enna-0.045
iny-0.044
alk-0.044
 diary-0.043
Token would
Token ID561
Feature activation-5.58e-03

Pos logit contributions
rys+0.033
iny+0.033
 diary+0.032
alk+0.032
zo+0.031
Neg logit contributions
five-0.014
 YES-0.014
anges-0.013
 thank-0.013
 were-0.013
Token take
Token ID1011
Feature activation-7.02e-03

Pos logit contributions
rys+0.010
alk+0.009
 diary+0.009
iny+0.009
 battlefield+0.009
Neg logit contributions
five-0.007
 YES-0.007
anges-0.007
 were-0.007
Of-0.007
Token out
Token ID503
Feature activation+4.66e-03

Pos logit contributions
rys+0.014
 diary+0.014
iny+0.014
alk+0.014
 battlefield+0.013
Neg logit contributions
 YES-0.007
Of-0.007
Sure-0.007
 were-0.007
 yes-0.007
Token a
Token ID257
Feature activation-3.34e-04

Pos logit contributions
 were+0.004
five+0.003
 how+0.003
 lots+0.003
 YES+0.003
Neg logit contributions
rys-0.011
iny-0.010
alk-0.010
 diary-0.010
enna-0.010
Token j
Token ID474
Feature activation-5.74e-03

Pos logit contributions
rys+0.001
umper+0.001
ishment+0.001
iny+0.001
ind+0.001
Neg logit contributions
 lots-0.000
 thank-0.000
 yes-0.000
 good-0.000
 were-0.000
Tokenigsaw
Token ID45636
Feature activation-7.95e-02

Pos logit contributions
rys+0.011
iny+0.010
alk+0.010
 diary+0.010
enna+0.010
Neg logit contributions
five-0.007
 thank-0.007
 were-0.007
 of-0.007
 lots-0.007
Token puzzle
Token ID15027
Feature activation-2.41e-02

Pos logit contributions
rys+0.117
umper+0.103
alk+0.102
iny+0.102
enna+0.099
Neg logit contributions
 were-0.134
five-0.133
 lots-0.132
 how-0.132
 of-0.130
Token and
Token ID290
Feature activation-2.52e-03

Pos logit contributions
rys+0.060
alk+0.056
iny+0.056
umper+0.055
enna+0.055
Neg logit contributions
 were-0.016
 how-0.016
 lots-0.015
five-0.015
 no-0.014
Token try
Token ID1949
Feature activation-5.50e-03

Pos logit contributions
rys+0.005
iny+0.005
alk+0.005
umper+0.004
ita+0.004
Neg logit contributions
 were-0.003
 how-0.003
 thank-0.003
 lots-0.003
five-0.003
Token to
Token ID284
Feature activation-9.33e-03

Pos logit contributions
rys+0.014
iny+0.013
alk+0.013
 diary+0.013
enna+0.013
Neg logit contributions
 how-0.004
 lots-0.004
 were-0.003
five-0.003
 no-0.003
Token solve
Token ID8494
Feature activation-4.35e-02

Pos logit contributions
rys+0.019
iny+0.017
alk+0.017
enna+0.017
umper+0.017
Neg logit contributions
 were-0.011
 how-0.011
 thank-0.011
 lots-0.011
five-0.010
Token box
Token ID3091
Feature activation-2.75e-02

Pos logit contributions
rys+0.012
iny+0.011
alk+0.011
zo+0.011
enna+0.011
Neg logit contributions
 were-0.012
five-0.012
 how-0.012
 lots-0.012
 YES-0.012
Token and
Token ID290
Feature activation-7.07e-03

Pos logit contributions
rys+0.060
iny+0.058
alk+0.058
 diary+0.058
enna+0.057
Neg logit contributions
five-0.023
 YES-0.022
anges-0.022
 thank-0.022
 were-0.021
Token saw
Token ID2497
Feature activation-9.88e-03

Pos logit contributions
rys+0.012
 diary+0.011
ci+0.011
enna+0.011
ky+0.011
Neg logit contributions
five-0.009
 YES-0.009
 thank-0.009
Eventually-0.008
anges-0.008
Token a
Token ID257
Feature activation+3.13e-03

Pos logit contributions
 diary+0.018
 alien+0.018
 pen+0.017
 weeping+0.017
 universe+0.017
Neg logit contributions
rav-0.012
five-0.011
 YES-0.011
Not-0.011
Of-0.011
Token j
Token ID474
Feature activation-4.03e-03

Pos logit contributions
 were+0.004
five+0.004
 YES+0.004
 how+0.004
 thank+0.004
Neg logit contributions
rys-0.005
iny-0.005
alk-0.005
 diary-0.005
enna-0.005
Tokenigsaw
Token ID45636
Feature activation-7.35e-02

Pos logit contributions
rys+0.008
iny+0.008
 diary+0.007
ations+0.007
ci+0.007
Neg logit contributions
five-0.005
 thank-0.004
 of-0.004
 Mike-0.004
 were-0.004
Token puzzle
Token ID15027
Feature activation-1.80e-02

Pos logit contributions
rys+0.127
iny+0.115
alk+0.114
umper+0.112
enna+0.111
Neg logit contributions
 were-0.106
five-0.103
 how-0.102
 thank-0.102
 lots-0.101
Token inside
Token ID2641
Feature activation-5.95e-03

Pos logit contributions
rys+0.036
iny+0.034
alk+0.034
 diary+0.033
umper+0.033
Neg logit contributions
 were-0.020
five-0.019
 how-0.019
 lots-0.019
 thank-0.019
Token.
Token ID13
Feature activation+1.93e-04

Pos logit contributions
rys+0.010
alk+0.009
iny+0.009
 diary+0.009
enna+0.009
Neg logit contributions
five-0.008
 YES-0.008
anges-0.008
Of-0.008
Sure-0.008
Token He
Token ID679
Feature activation+7.94e-03

Pos logit contributions
five+0.000
 were+0.000
anges+0.000
 no+0.000
 YES+0.000
Neg logit contributions
rys-0.000
alk-0.000
enna-0.000
 diary-0.000
 battlefield-0.000
Token paused
Token ID24487
Feature activation+2.05e-03

Pos logit contributions
five+0.011
Not+0.011
 Enjoy+0.011
ocolate+0.010
 goodbye+0.010
Neg logit contributions
 pen-0.012
 bur-0.012
zo-0.012
ky-0.012
ita-0.012
Token As
Token ID1081
Feature activation-5.02e-04

Pos logit contributions
rys+0.005
iny+0.004
alk+0.004
 diary+0.004
ci+0.004
Neg logit contributions
 were-0.002
five-0.002
 lots-0.002
 YES-0.002
 how-0.002
Token the
Token ID262
Feature activation-1.21e-02

Pos logit contributions
rys+0.001
alk+0.001
iny+0.001
 diary+0.001
umper+0.001
Neg logit contributions
 were-0.000
 lots-0.000
 no-0.000
 how-0.000
 thank-0.000
Token farmer
Token ID18739
Feature activation-5.38e-03

Pos logit contributions
rys+0.023
alk+0.021
iny+0.021
umper+0.020
enna+0.020
Neg logit contributions
 thank-0.017
 were-0.016
 lots-0.016
 yes-0.016
 no-0.016
Token scattered
Token ID16830
Feature activation-7.63e-05

Pos logit contributions
rys+0.012
iny+0.012
alk+0.012
 diary+0.011
enna+0.011
Neg logit contributions
 were-0.004
five-0.004
 YES-0.004
 how-0.004
 thank-0.004
Token the
Token ID262
Feature activation-1.28e-02

Pos logit contributions
rys+0.000
alk+0.000
iny+0.000
 diary+0.000
 battlefield+0.000
Neg logit contributions
five-0.000
 YES-0.000
Of-0.000
anges-0.000
Sure-0.000
Token av
Token ID1196
Feature activation-7.19e-02

Pos logit contributions
rys+0.021
iny+0.019
alk+0.019
ita+0.019
enna+0.019
Neg logit contributions
 lots-0.020
 were-0.019
 so-0.019
 thank-0.018
 no-0.018
Tokenoc
Token ID420
Feature activation+3.14e-02

Pos logit contributions
rys+0.150
 diary+0.139
 weeping+0.136
alk+0.136
 avail+0.132
Neg logit contributions
five-0.075
 were-0.073
 lots-0.072
 how-0.068
anges-0.066
Tokenados
Token ID22484
Feature activation-3.02e-02

Pos logit contributions
five+0.054
 were+0.054
 how+0.053
 YES+0.053
 thank+0.052
Neg logit contributions
rys-0.043
iny-0.039
alk-0.039
 diary-0.038
enna-0.037
Token,
Token ID11
Feature activation-2.28e-03

Pos logit contributions
rys+0.071
iny+0.067
alk+0.066
 diary+0.065
ita+0.064
Neg logit contributions
 were-0.024
 how-0.023
five-0.023
 lots-0.022
 YES-0.022
Token he
Token ID339
Feature activation-9.24e-03

Pos logit contributions
rys+0.005
iny+0.005
alk+0.005
 diary+0.005
enna+0.005
Neg logit contributions
five-0.002
 were-0.002
 YES-0.002
 how-0.002
 thank-0.002
Token noticed
Token ID6810
Feature activation+1.37e-02

Pos logit contributions
rys+0.019
iny+0.018
alk+0.018
 diary+0.017
zo+0.017
Neg logit contributions
five-0.010
 YES-0.009
anges-0.009
 thank-0.009
Of-0.009
Token and
Token ID290
Feature activation-3.24e-03

Pos logit contributions
iny+0.001
rys+0.001
alk+0.001
ita+0.001
ci+0.001
Neg logit contributions
 YES-0.001
Of-0.001
 thank-0.001
anges-0.001
five-0.001
Token her
Token ID607
Feature activation-1.37e-02

Pos logit contributions
rys+0.005
 diary+0.005
iny+0.005
alk+0.005
enna+0.004
Neg logit contributions
anges-0.005
 YES-0.005
five-0.005
Of-0.005
Eventually-0.005
Token mom
Token ID1995
Feature activation+6.24e-03

Pos logit contributions
rys+0.030
 diary+0.030
iny+0.029
alk+0.028
enna+0.028
Neg logit contributions
five-0.012
 YES-0.011
anges-0.011
 Mike-0.011
 were-0.011
Token sorted
Token ID23243
Feature activation-8.64e-03

Pos logit contributions
 YES+0.008
 thank+0.008
five+0.008
anges+0.008
Of+0.008
Neg logit contributions
enna-0.010
iny-0.010
ci-0.010
rys-0.010
ky-0.010
Token the
Token ID262
Feature activation-1.41e-02

Pos logit contributions
rys+0.018
alk+0.018
 diary+0.017
iny+0.017
ishment+0.017
Neg logit contributions
five-0.008
 YES-0.008
 thank-0.008
Of-0.008
 yes-0.008
Token av
Token ID1196
Feature activation-7.15e-02

Pos logit contributions
rys+0.022
iny+0.020
alk+0.020
zo+0.019
 diary+0.019
Neg logit contributions
 were-0.022
five-0.022
 how-0.022
 lots-0.022
 YES-0.022
Tokenoc
Token ID420
Feature activation+2.58e-02

Pos logit contributions
rys+0.129
alk+0.118
iny+0.117
 diary+0.117
zo+0.113
Neg logit contributions
five-0.093
 were-0.092
 lots-0.091
 how-0.090
anges-0.089
Tokenados
Token ID22484
Feature activation-3.27e-02

Pos logit contributions
 were+0.047
 YES+0.047
five+0.046
 lots+0.046
 yes+0.046
Neg logit contributions
rys-0.033
alk-0.029
iny-0.029
 diary-0.029
umper-0.028
Token into
Token ID656
Feature activation-1.22e-02

Pos logit contributions
rys+0.070
alk+0.066
iny+0.066
 diary+0.065
enna+0.063
Neg logit contributions
five-0.031
 were-0.031
 YES-0.031
 thank-0.030
anges-0.029
Token piles
Token ID30935
Feature activation+1.31e-02

Pos logit contributions
rys+0.030
iny+0.028
alk+0.028
zo+0.028
umper+0.027
Neg logit contributions
 were-0.010
 lots-0.009
 how-0.009
 no-0.009
anges-0.008
Token according
Token ID1864
Feature activation-7.12e-03

Pos logit contributions
five+0.021
 YES+0.021
Of+0.021
Sure+0.020
 thank+0.020
Neg logit contributions
alk-0.019
rys-0.018
iny-0.017
 diary-0.017
enna-0.017
Token,
Token ID11
Feature activation+3.37e-03

Pos logit contributions
rys+0.017
umper+0.016
iny+0.015
alk+0.015
 avail+0.015
Neg logit contributions
 were-0.008
 no-0.008
 lots-0.007
 how-0.007
 so-0.007
Token and
Token ID290
Feature activation-1.72e-02

Pos logit contributions
five+0.003
anges+0.003
 YES+0.003
 were+0.003
Sure+0.003
Neg logit contributions
rys-0.007
 diary-0.007
alk-0.007
iny-0.007
 weeping-0.007
Token all
Token ID477
Feature activation+4.28e-03

Pos logit contributions
rys+0.037
 diary+0.035
alk+0.035
iny+0.034
ci+0.034
Neg logit contributions
five-0.015
 YES-0.015
anges-0.015
 were-0.014
Sure-0.014
Token the
Token ID262
Feature activation-1.74e-02

Pos logit contributions
 were+0.003
 how+0.003
 thank+0.003
five+0.003
 lots+0.003
Neg logit contributions
rys-0.011
iny-0.010
alk-0.010
 diary-0.010
enna-0.010
Token other
Token ID584
Feature activation-2.90e-02

Pos logit contributions
rys+0.037
iny+0.034
alk+0.034
 diary+0.034
enna+0.033
Neg logit contributions
 were-0.017
five-0.017
 YES-0.017
 how-0.017
 thank-0.017
Token av
Token ID1196
Feature activation-7.03e-02

Pos logit contributions
rys+0.053
 diary+0.051
 universe+0.050
iny+0.049
alk+0.049
Neg logit contributions
 YES-0.036
five-0.036
anges-0.035
Sure-0.034
Of-0.033
Tokenoc
Token ID420
Feature activation+4.10e-02

Pos logit contributions
rys+0.127
alk+0.118
iny+0.117
 diary+0.115
enna+0.112
Neg logit contributions
 were-0.091
five-0.091
 how-0.089
 YES-0.088
 lots-0.088
Tokenados
Token ID22484
Feature activation-2.83e-02

Pos logit contributions
 were+0.082
 YES+0.081
 yes+0.081
 lots+0.080
Sure+0.079
Neg logit contributions
rys-0.044
alk-0.041
iny-0.039
umper-0.039
 diary-0.038
Token were
Token ID547
Feature activation+2.64e-02

Pos logit contributions
rys+0.067
alk+0.064
iny+0.064
 diary+0.063
enna+0.062
Neg logit contributions
five-0.020
 YES-0.019
 were-0.019
anges-0.019
 thank-0.018
Token very
Token ID845
Feature activation+1.84e-04

Pos logit contributions
five+0.026
Sure+0.024
 YES+0.024
Of+0.024
anges+0.024
Neg logit contributions
rys-0.056
iny-0.054
 diary-0.053
alk-0.053
 weeping-0.051
Token happy
Token ID3772
Feature activation+7.33e-03

Pos logit contributions
five+0.000
 YES+0.000
anges+0.000
 were+0.000
 how+0.000
Neg logit contributions
rys-0.000
iny-0.000
ind-0.000
zo-0.000
alk-0.000
Token continued
Token ID3767
Feature activation+6.43e-03

Pos logit contributions
rys+0.020
iny+0.020
zo+0.019
alk+0.019
 diary+0.019
Neg logit contributions
anges-0.008
 YES-0.008
five-0.008
 oh-0.008
 yes-0.008
Token to
Token ID284
Feature activation-5.78e-03

Pos logit contributions
 how+0.004
 lots+0.003
 no+0.003
 so+0.003
 thank+0.003
Neg logit contributions
rys-0.017
enna-0.017
iny-0.017
umper-0.017
ishment-0.016
Token enjoy
Token ID2883
Feature activation+9.38e-03

Pos logit contributions
rys+0.013
iny+0.012
enna+0.012
umper+0.012
ind+0.012
Neg logit contributions
 thank-0.006
 were-0.006
 no-0.006
 how-0.006
 are-0.005
Token eating
Token ID6600
Feature activation+1.32e-02

Pos logit contributions
 how+0.007
 lots+0.007
 no+0.006
 were+0.006
 so+0.006
Neg logit contributions
rys-0.024
iny-0.023
enna-0.023
alk-0.023
umper-0.022
Token her
Token ID607
Feature activation-1.38e-02

Pos logit contributions
five+0.012
 YES+0.012
 were+0.012
anges+0.012
Sure+0.011
Neg logit contributions
rys-0.028
alk-0.027
 diary-0.026
iny-0.026
ita-0.025
Token av
Token ID1196
Feature activation-7.00e-02

Pos logit contributions
rys+0.029
iny+0.028
alk+0.027
enna+0.027
umper+0.027
Neg logit contributions
 were-0.014
 lots-0.014
 how-0.014
five-0.014
 thank-0.013
Tokenoc
Token ID420
Feature activation+3.03e-02

Pos logit contributions
rys+0.142
alk+0.129
 diary+0.128
enna+0.125
 weeping+0.124
Neg logit contributions
five-0.078
 lots-0.076
 were-0.076
 how-0.074
 thank-0.072
Tokenados
Token ID22484
Feature activation-2.79e-02

Pos logit contributions
five+0.052
 were+0.051
 how+0.051
 YES+0.051
 thank+0.050
Neg logit contributions
rys-0.042
iny-0.038
alk-0.038
 diary-0.037
enna-0.036
Token with
Token ID351
Feature activation+4.44e-04

Pos logit contributions
rys+0.070
iny+0.065
alk+0.065
umper+0.063
 diary+0.063
Neg logit contributions
 were-0.020
 how-0.019
 lots-0.019
 no-0.018
 so-0.018
Token a
Token ID257
Feature activation+8.86e-03

Pos logit contributions
 were+0.000
 lots+0.000
 how+0.000
 no+0.000
 thank+0.000
Neg logit contributions
rys-0.001
iny-0.001
alk-0.001
 diary-0.001
zo-0.001
Token smile
Token ID8212
Feature activation+8.77e-03

Pos logit contributions
 were+0.010
 thank+0.010
 yes+0.009
 no+0.009
 lots+0.009
Neg logit contributions
rys-0.018
iny-0.017
alk-0.017
umper-0.017
ci-0.017
Token It
Token ID632
Feature activation+1.85e-02

Pos logit contributions
 Mike+0.002
 were+0.002
 lots+0.002
 YES+0.002
 how+0.002
Neg logit contributions
rys-0.008
iny-0.007
alk-0.007
 diary-0.007
ations-0.007
Token turned
Token ID2900
Feature activation-2.03e-03

Pos logit contributions
 were+0.016
 how+0.015
five+0.015
 no+0.015
 lots+0.015
Neg logit contributions
rys-0.042
alk-0.040
iny-0.039
 diary-0.039
enna-0.038
Token out
Token ID503
Feature activation+6.62e-04

Pos logit contributions
rys+0.004
alk+0.004
iny+0.004
 diary+0.004
enna+0.004
Neg logit contributions
 were-0.002
 lots-0.002
 how-0.002
 no-0.002
 thank-0.002
Token that
Token ID326
Feature activation-3.56e-03

Pos logit contributions
five+0.001
anges+0.000
 YES+0.000
 were+0.000
Sure+0.000
Neg logit contributions
rys-0.002
iny-0.001
alk-0.001
 diary-0.001
ita-0.001
Token the
Token ID262
Feature activation-1.02e-02

Pos logit contributions
rys+0.009
iny+0.008
alk+0.008
umper+0.008
enna+0.008
Neg logit contributions
 were-0.002
 how-0.002
 lots-0.002
 no-0.002
 thank-0.002
Token av
Token ID1196
Feature activation-6.98e-02

Pos logit contributions
rys+0.019
alk+0.018
iny+0.018
enna+0.018
umper+0.017
Neg logit contributions
 no-0.013
 were-0.013
 thank-0.013
 lots-0.013
 yes-0.013
Tokenoc
Token ID420
Feature activation+3.24e-02

Pos logit contributions
rys+0.145
alk+0.132
 diary+0.132
enna+0.129
iny+0.129
Neg logit contributions
five-0.075
 were-0.073
 lots-0.071
 how-0.068
 thank-0.067
Tokenados
Token ID22484
Feature activation-2.40e-02

Pos logit contributions
 were+0.058
 YES+0.057
five+0.057
 lots+0.057
 how+0.056
Neg logit contributions
rys-0.042
alk-0.038
iny-0.038
 diary-0.037
umper-0.036
Token were
Token ID547
Feature activation+1.83e-02

Pos logit contributions
rys+0.063
iny+0.060
alk+0.060
 diary+0.059
enna+0.058
Neg logit contributions
 were-0.011
five-0.011
 how-0.010
 YES-0.010
 thank-0.010
Token bad
Token ID2089
Feature activation-6.14e-03

Pos logit contributions
 were+0.023
five+0.023
 YES+0.023
 how+0.023
 thank+0.023
Neg logit contributions
rys-0.033
iny-0.031
alk-0.031
 diary-0.030
enna-0.029
Token and
Token ID290
Feature activation-2.06e-03

Pos logit contributions
rys+0.014
iny+0.013
alk+0.013
 diary+0.013
zo+0.012
Neg logit contributions
five-0.005
 YES-0.005
 were-0.005
anges-0.005
Sure-0.005
Token and
Token ID290
Feature activation+2.87e-03

Pos logit contributions
rys+0.026
iny+0.025
alk+0.025
 diary+0.024
zo+0.024
Neg logit contributions
five-0.009
 YES-0.009
anges-0.009
 were-0.009
 thank-0.009
Token try
Token ID1949
Feature activation-6.75e-04

Pos logit contributions
 thank+0.003
 were+0.003
five+0.003
 how+0.003
 no+0.003
Neg logit contributions
rys-0.006
iny-0.006
enna-0.006
umper-0.005
alk-0.005
Token to
Token ID284
Feature activation-9.60e-03

Pos logit contributions
rys+0.002
alk+0.001
iny+0.001
 diary+0.001
zo+0.001
Neg logit contributions
five-0.001
 were-0.001
 YES-0.001
 thank-0.001
anges-0.000
Token grab
Token ID5552
Feature activation-1.18e-02

Pos logit contributions
rys+0.021
enna+0.021
 diary+0.020
iny+0.020
ita+0.020
Neg logit contributions
 thank-0.009
 were-0.009
 no-0.009
five-0.009
 how-0.009
Token the
Token ID262
Feature activation-5.65e-03

Pos logit contributions
rys+0.024
alk+0.022
ishment+0.022
ky+0.022
iny+0.022
Neg logit contributions
 YES-0.012
Of-0.012
five-0.011
Eventually-0.011
Sure-0.011
Token av
Token ID1196
Feature activation-6.93e-02

Pos logit contributions
rys+0.009
iny+0.009
ita+0.009
enna+0.009
zo+0.009
Neg logit contributions
 were-0.008
 thank-0.008
 lots-0.008
 no-0.008
 how-0.008
Tokenoc
Token ID420
Feature activation+2.93e-02

Pos logit contributions
rys+0.134
 diary+0.123
alk+0.123
iny+0.120
enna+0.120
Neg logit contributions
five-0.083
 were-0.079
 lots-0.078
 how-0.078
anges-0.077
Tokenados
Token ID22484
Feature activation-2.49e-02

Pos logit contributions
 were+0.052
 YES+0.051
five+0.051
 how+0.051
 lots+0.051
Neg logit contributions
rys-0.039
alk-0.035
iny-0.035
 diary-0.034
umper-0.033
Token.
Token ID13
Feature activation-8.52e-04

Pos logit contributions
rys+0.057
iny+0.053
 diary+0.052
alk+0.052
enna+0.052
Neg logit contributions
 how-0.024
 were-0.023
 so-0.022
 no-0.022
 lots-0.022
Token But
Token ID887
Feature activation+7.81e-03

Pos logit contributions
rys+0.001
alk+0.001
iny+0.001
 battlefield+0.001
older+0.001
Neg logit contributions
five-0.001
 YES-0.001
 were-0.001
 how-0.001
 yes-0.001
Token the
Token ID262
Feature activation-1.15e-04

Pos logit contributions
five+0.009
Of+0.008
Sure+0.008
 YES+0.008
Eventually+0.008
Neg logit contributions
rys-0.015
 diary-0.014
 avail-0.014
alk-0.013
 bean-0.013
Token so
Token ID523
Feature activation+6.66e-03

Pos logit contributions
five+0.003
Sure+0.003
rav+0.003
Of+0.003
anges+0.003
Neg logit contributions
 weeping-0.006
rys-0.005
 awake-0.005
iny-0.005
ita-0.005
Token happy
Token ID3772
Feature activation+1.10e-02

Pos logit contributions
anges+0.009
 YES+0.008
five+0.008
Sure+0.008
Of+0.008
Neg logit contributions
rys-0.011
 weeping-0.011
iny-0.011
ita-0.010
enna-0.010
Token and
Token ID290
Feature activation-7.58e-03

Pos logit contributions
 were+0.007
five+0.007
 how+0.007
 YES+0.007
 thank+0.007
Neg logit contributions
rys-0.027
iny-0.025
alk-0.025
 diary-0.025
enna-0.024
Token brought
Token ID3181
Feature activation+4.84e-03

Pos logit contributions
rys+0.013
ci+0.012
 diary+0.012
ita+0.012
enna+0.012
Neg logit contributions
five-0.009
anges-0.009
 YES-0.009
Sure-0.008
 Faster-0.008
Token the
Token ID262
Feature activation-1.13e-02

Pos logit contributions
 YES+0.004
five+0.004
anges+0.004
 were+0.004
 yes+0.004
Neg logit contributions
rys-0.011
alk-0.010
iny-0.010
 diary-0.010
enna-0.010
Token av
Token ID1196
Feature activation-6.92e-02

Pos logit contributions
rys+0.017
iny+0.016
alk+0.016
 diary+0.015
ita+0.015
Neg logit contributions
 were-0.018
five-0.017
 thank-0.017
 how-0.017
 lots-0.017
Tokenoc
Token ID420
Feature activation+3.43e-02

Pos logit contributions
rys+0.133
alk+0.122
 diary+0.121
iny+0.119
enna+0.116
Neg logit contributions
five-0.083
 were-0.082
 lots-0.081
 how-0.080
 thank-0.079
Tokenados
Token ID22484
Feature activation-2.76e-02

Pos logit contributions
 were+0.060
five+0.060
 YES+0.059
 how+0.059
 thank+0.058
Neg logit contributions
rys-0.046
iny-0.042
alk-0.042
 diary-0.040
enna-0.039
Token home
Token ID1363
Feature activation+1.68e-02

Pos logit contributions
rys+0.062
iny+0.059
alk+0.058
 diary+0.057
enna+0.056
Neg logit contributions
 were-0.023
five-0.023
 YES-0.023
 how-0.022
 thank-0.022
Token to
Token ID284
Feature activation+1.28e-03

Pos logit contributions
five+0.024
 YES+0.023
anges+0.023
Sure+0.023
Of+0.023
Neg logit contributions
rys-0.027
alk-0.026
 diary-0.025
iny-0.025
ci-0.025
Token share
Token ID2648
Feature activation+9.13e-03

Pos logit contributions
 thank+0.001
 were+0.001
 how+0.001
 lots+0.001
 no+0.001
Neg logit contributions
rys-0.003
iny-0.003
zo-0.003
ita-0.003
alk-0.003
Token.
Token ID13
Feature activation-7.57e-03

Pos logit contributions
rys+0.001
alk+0.001
iny+0.001
 diary+0.001
ci+0.001
Neg logit contributions
anges-0.001
 YES-0.001
five-0.001
Of-0.001
 thank-0.001
Token They
Token ID1119
Feature activation+6.93e-03

Pos logit contributions
rys+0.011
alk+0.011
 battlefield+0.009
ishment+0.009
iny+0.009
Neg logit contributions
 YES-0.012
 yes-0.012
anges-0.011
five-0.011
 how-0.011
Token forget
Token ID6044
Feature activation+4.29e-03

Pos logit contributions
five+0.007
 YES+0.007
 were+0.007
 how+0.007
anges+0.007
Neg logit contributions
rys-0.014
alk-0.013
iny-0.013
 diary-0.013
zo-0.013
Token about
Token ID546
Feature activation+1.11e-02

Pos logit contributions
anges+0.004
five+0.004
Eventually+0.004
Of+0.004
Sure+0.004
Neg logit contributions
 diary-0.008
 weeping-0.008
alk-0.008
 battlefield-0.008
rys-0.008
Token the
Token ID262
Feature activation+3.61e-03

Pos logit contributions
five+0.009
 were+0.009
 YES+0.008
anges+0.008
 thank+0.008
Neg logit contributions
rys-0.026
alk-0.024
iny-0.024
 diary-0.024
enna-0.023
Token av
Token ID1196
Feature activation-6.90e-02

Pos logit contributions
 thank+0.005
 were+0.005
 good+0.005
 no+0.005
 ice+0.005
Neg logit contributions
enna-0.006
rys-0.006
iny-0.006
ita-0.006
ci-0.006
Tokenoc
Token ID420
Feature activation+3.67e-02

Pos logit contributions
rys+0.150
 diary+0.138
alk+0.138
enna+0.135
 weeping+0.133
Neg logit contributions
five-0.068
 were-0.067
 lots-0.062
 thank-0.062
anges-0.061
Tokenados
Token ID22484
Feature activation-2.38e-02

Pos logit contributions
 were+0.064
five+0.063
 YES+0.063
 how+0.062
 thank+0.062
Neg logit contributions
rys-0.050
iny-0.045
alk-0.045
 diary-0.044
enna-0.042
Token.
Token ID13
Feature activation-2.68e-03

Pos logit contributions
rys+0.051
iny+0.047
alk+0.046
enna+0.046
umper+0.046
Neg logit contributions
 were-0.027
 how-0.025
 no-0.025
 thank-0.024
 yes-0.024
Token They
Token ID1119
Feature activation+9.23e-03

Pos logit contributions
rys+0.004
alk+0.004
iny+0.003
 battlefield+0.003
 galaxy+0.003
Neg logit contributions
 YES-0.004
 yes-0.004
five-0.004
 how-0.004
anges-0.004
Token just
Token ID655
Feature activation+1.33e-02

Pos logit contributions
five+0.008
 YES+0.008
 were+0.008
anges+0.008
 how+0.008
Neg logit contributions
rys-0.020
alk-0.019
iny-0.019
 diary-0.018
zo-0.018
Token Everyday
Token ID48154
Feature activation-1.50e-02

Pos logit contributions
rys+0.020
iny+0.019
alk+0.019
enna+0.019
zo+0.019
Neg logit contributions
five-0.008
anges-0.008
 YES-0.007
oops-0.007
Sure-0.007
Token he
Token ID339
Feature activation-1.13e-02

Pos logit contributions
rys+0.036
alk+0.034
iny+0.034
umper+0.033
 diary+0.033
Neg logit contributions
 were-0.012
 how-0.011
 no-0.011
 lots-0.011
 thank-0.011
Token would
Token ID561
Feature activation+5.92e-03

Pos logit contributions
rys+0.025
iny+0.024
alk+0.024
 diary+0.023
zo+0.023
Neg logit contributions
five-0.010
 were-0.009
 YES-0.009
anges-0.009
 how-0.009
Token take
Token ID1011
Feature activation-3.44e-03

Pos logit contributions
 were+0.006
 thank+0.006
 how+0.006
 no+0.006
five+0.006
Neg logit contributions
rys-0.013
iny-0.012
alk-0.012
 diary-0.011
enna-0.011
Token the
Token ID262
Feature activation+4.26e-03

Pos logit contributions
rys+0.008
iny+0.008
alk+0.008
 diary+0.007
enna+0.007
Neg logit contributions
 were-0.003
five-0.003
 how-0.002
 YES-0.002
 lots-0.002
Token av
Token ID1196
Feature activation-6.82e-02

Pos logit contributions
 yes+0.006
 thank+0.006
 lots+0.006
 were+0.006
 YES+0.006
Neg logit contributions
rys-0.007
alk-0.007
iny-0.007
enna-0.006
umper-0.006
Tokenoc
Token ID420
Feature activation+3.40e-02

Pos logit contributions
rys+0.132
 diary+0.121
alk+0.120
 weeping+0.120
iny+0.117
Neg logit contributions
five-0.082
 lots-0.078
 how-0.077
 were-0.077
anges-0.075
Tokenados
Token ID22484
Feature activation-1.98e-02

Pos logit contributions
 were+0.059
five+0.058
 YES+0.058
 how+0.057
 thank+0.057
Neg logit contributions
rys-0.046
iny-0.042
alk-0.042
 diary-0.041
enna-0.040
Token and
Token ID290
Feature activation+2.87e-03

Pos logit contributions
rys+0.051
iny+0.047
alk+0.047
 diary+0.046
 avail+0.046
Neg logit contributions
 were-0.013
 how-0.013
 lots-0.012
 so-0.011
 no-0.011
Token scatter
Token ID41058
Feature activation-1.28e-03

Pos logit contributions
 were+0.003
five+0.003
 YES+0.003
 how+0.003
 thank+0.003
Neg logit contributions
rys-0.006
iny-0.005
alk-0.005
 diary-0.005
enna-0.005
Token them
Token ID606
Feature activation-3.49e-03

Pos logit contributions
rys+0.003
alk+0.003
iny+0.003
 diary+0.003
umper+0.003
Neg logit contributions
five-0.001
 YES-0.001
 were-0.001
anges-0.001
 thank-0.001
Token tree
Token ID5509
Feature activation-1.54e-02

Pos logit contributions
rys+0.012
iny+0.011
alk+0.011
 diary+0.011
enna+0.011
Neg logit contributions
 were-0.009
five-0.009
 thank-0.009
 lots-0.009
 YES-0.009
Token was
Token ID373
Feature activation+3.50e-04

Pos logit contributions
rys+0.037
iny+0.036
alk+0.035
 diary+0.035
zo+0.034
Neg logit contributions
five-0.010
 YES-0.010
anges-0.010
 thank-0.009
 were-0.009
Token sad
Token ID6507
Feature activation-5.02e-03

Pos logit contributions
five+0.000
Sure+0.000
anges+0.000
Of+0.000
 YES+0.000
Neg logit contributions
rys-0.001
iny-0.001
alk-0.001
 diary-0.001
ita-0.001
Token that
Token ID326
Feature activation-3.84e-04

Pos logit contributions
rys+0.012
iny+0.011
alk+0.011
 diary+0.011
enna+0.010
Neg logit contributions
 were-0.004
five-0.004
 YES-0.004
 how-0.004
 thank-0.004
Token the
Token ID262
Feature activation-5.46e-03

Pos logit contributions
rys+0.001
iny+0.001
alk+0.001
 diary+0.001
enna+0.001
Neg logit contributions
 were-0.000
five-0.000
 YES-0.000
 how-0.000
 thank-0.000
Token av
Token ID1196
Feature activation-6.81e-02

Pos logit contributions
rys+0.009
iny+0.009
alk+0.009
enna+0.008
ita+0.008
Neg logit contributions
 were-0.008
 thank-0.008
 lots-0.008
 no-0.008
five-0.008
Tokenoc
Token ID420
Feature activation+3.55e-02

Pos logit contributions
rys+0.140
 diary+0.128
alk+0.128
enna+0.124
iny+0.124
Neg logit contributions
 were-0.075
five-0.074
 lots-0.071
 how-0.069
 thank-0.067
Tokenados
Token ID22484
Feature activation-2.49e-02

Pos logit contributions
 were+0.066
 YES+0.065
five+0.064
 lots+0.064
 yes+0.064
Neg logit contributions
rys-0.044
alk-0.040
iny-0.040
 diary-0.038
umper-0.037
Token were
Token ID547
Feature activation+1.78e-02

Pos logit contributions
rys+0.069
iny+0.066
alk+0.065
 diary+0.064
enna+0.064
Neg logit contributions
 were-0.009
five-0.008
 how-0.008
 thank-0.008
 YES-0.007
Token being
Token ID852
Feature activation+1.79e-03

Pos logit contributions
 were+0.016
 how+0.016
 no+0.016
 lots+0.016
 thank+0.015
Neg logit contributions
rys-0.040
iny-0.038
alk-0.038
 diary-0.037
enna-0.036
Token taken
Token ID2077
Feature activation+8.47e-04

Pos logit contributions
 were+0.002
 how+0.002
five+0.002
 lots+0.002
 no+0.002
Neg logit contributions
rys-0.003
iny-0.003
alk-0.003
 diary-0.003
enna-0.003
Token
Token ID220
Feature activation-1.90e-02

Pos logit contributions
rys+0.003
alk+0.003
iny+0.003
 diary+0.002
zo+0.002
Neg logit contributions
 were-0.001
five-0.001
 how-0.001
 YES-0.001
 lots-0.001
Token
Token ID198
Feature activation+4.82e-03

Pos logit contributions
rys+0.042
iny+0.040
alk+0.039
 diary+0.039
enna+0.038
Neg logit contributions
five-0.017
 were-0.016
 YES-0.016
 thank-0.016
 how-0.016
Token
Token ID198
Feature activation-3.49e-03

Pos logit contributions
 were+0.004
five+0.004
 YES+0.004
 how+0.004
 thank+0.004
Neg logit contributions
rys-0.011
iny-0.010
alk-0.010
 diary-0.010
enna-0.010
TokenWith
Token ID3152
Feature activation-1.52e-02

Pos logit contributions
rys+0.007
alk+0.007
iny+0.006
 diary+0.006
zo+0.006
Neg logit contributions
 were-0.004
five-0.004
 how-0.004
 lots-0.004
 YES-0.004
Token a
Token ID257
Feature activation-1.38e-02

Pos logit contributions
rys+0.030
 diary+0.029
iny+0.027
enna+0.027
alk+0.027
Neg logit contributions
five-0.017
anges-0.016
 YES-0.016
Of-0.015
 were-0.015
Token mis
Token ID2984
Feature activation-6.80e-02

Pos logit contributions
rys+0.026
 diary+0.024
iny+0.024
alk+0.023
ita+0.023
Neg logit contributions
five-0.017
 how-0.017
 were-0.017
anges-0.017
 YES-0.016
Tokenchie
Token ID3043
Feature activation-9.25e-03

Pos logit contributions
rys+0.089
iny+0.081
alk+0.079
enna+0.076
ita+0.075
Neg logit contributions
 were-0.125
 YES-0.123
 thank-0.123
 how-0.123
 no-0.121
Tokenvous
Token ID31222
Feature activation+9.49e-03

Pos logit contributions
rys+0.013
iny+0.013
zo+0.012
ita+0.011
alk+0.011
Neg logit contributions
anges-0.016
 YES-0.016
 thank-0.015
 Mike-0.015
 were-0.015
Token gl
Token ID1278
Feature activation-1.18e-02

Pos logit contributions
 were+0.012
 how+0.012
five+0.012
anges+0.012
 YES+0.012
Neg logit contributions
rys-0.017
iny-0.016
alk-0.016
 diary-0.016
ita-0.015
Tokenint
Token ID600
Feature activation+5.45e-03

Pos logit contributions
rys+0.021
iny+0.020
alk+0.020
 diary+0.019
enna+0.019
Neg logit contributions
 were-0.016
five-0.016
 YES-0.015
 how-0.015
 lots-0.015
Token in
Token ID287
Feature activation-9.82e-03

Pos logit contributions
 YES+0.005
five+0.005
 were+0.005
anges+0.005
Of+0.004
Neg logit contributions
rys-0.012
iny-0.011
alk-0.011
 diary-0.011
ci-0.011
Token looked
Token ID3114
Feature activation-1.20e-02

Pos logit contributions
five+0.017
 were+0.017
 YES+0.017
 how+0.016
 thank+0.016
Neg logit contributions
rys-0.042
iny-0.040
alk-0.040
 diary-0.039
enna-0.038
Token at
Token ID379
Feature activation-2.10e-03

Pos logit contributions
rys+0.024
 diary+0.022
iny+0.022
alk+0.022
enna+0.021
Neg logit contributions
five-0.013
 were-0.013
 YES-0.013
anges-0.013
 thank-0.012
Token him
Token ID683
Feature activation-3.15e-03

Pos logit contributions
rys+0.005
iny+0.005
alk+0.005
 diary+0.004
enna+0.004
Neg logit contributions
five-0.002
 YES-0.002
anges-0.002
 were-0.002
 thank-0.002
Token with
Token ID351
Feature activation-6.20e-03

Pos logit contributions
rys+0.008
alk+0.007
iny+0.007
 diary+0.007
enna+0.007
Neg logit contributions
 were-0.002
 how-0.002
 lots-0.002
five-0.002
 no-0.002
Token a
Token ID257
Feature activation-1.43e-03

Pos logit contributions
rys+0.013
 diary+0.012
alk+0.012
iny+0.012
enna+0.012
Neg logit contributions
five-0.006
anges-0.006
 YES-0.006
 were-0.006
Sure-0.006
Token mis
Token ID2984
Feature activation-6.73e-02

Pos logit contributions
rys+0.003
iny+0.002
alk+0.002
 diary+0.002
enna+0.002
Neg logit contributions
 were-0.002
five-0.002
 YES-0.002
 thank-0.002
 how-0.002
Tokenchie
Token ID3043
Feature activation-6.64e-03

Pos logit contributions
rys+0.097
iny+0.088
alk+0.088
 diary+0.086
enna+0.083
Neg logit contributions
five-0.111
 were-0.111
 YES-0.109
 how-0.109
anges-0.108
Tokenvous
Token ID31222
Feature activation+1.48e-02

Pos logit contributions
rys+0.010
iny+0.009
alk+0.009
ita+0.008
enna+0.008
Neg logit contributions
anges-0.011
 YES-0.011
 were-0.011
 Mike-0.011
 yes-0.011
Token grin
Token ID16770
Feature activation+1.91e-02

Pos logit contributions
 were+0.019
five+0.019
 YES+0.018
 how+0.018
 thank+0.018
Neg logit contributions
rys-0.027
iny-0.025
alk-0.025
 diary-0.024
enna-0.024
Token.
Token ID13
Feature activation+1.80e-03

Pos logit contributions
 were+0.019
 how+0.019
 thank+0.018
 no+0.018
 yes+0.018
Neg logit contributions
rys-0.042
alk-0.040
iny-0.040
 battlefield-0.039
 galaxy-0.039
Token She
Token ID1375
Feature activation+5.09e-03

Pos logit contributions
five+0.002
 were+0.002
 YES+0.002
anges+0.002
 thank+0.002
Neg logit contributions
rys-0.003
iny-0.003
alk-0.003
 diary-0.003
enna-0.003
Token a
Token ID257
Feature activation-9.81e-04

Pos logit contributions
rys+0.016
 diary+0.016
iny+0.015
umper+0.015
 weeping+0.015
Neg logit contributions
five-0.007
 YES-0.006
Of-0.006
rav-0.006
anges-0.006
Token small
Token ID1402
Feature activation-8.92e-03

Pos logit contributions
rys+0.002
alk+0.002
iny+0.002
enna+0.002
umper+0.002
Neg logit contributions
 were-0.001
 thank-0.001
 YES-0.001
five-0.001
 lots-0.001
Token boy
Token ID2933
Feature activation+8.20e-03

Pos logit contributions
rys+0.016
alk+0.015
enna+0.015
iny+0.015
umper+0.015
Neg logit contributions
 were-0.012
 thank-0.012
 no-0.012
five-0.011
 how-0.011
Token with
Token ID351
Feature activation-9.32e-03

Pos logit contributions
 were+0.009
 how+0.009
 was+0.008
 of+0.008
 no+0.008
Neg logit contributions
rys-0.018
alk-0.018
enna-0.017
iny-0.017
 battlefield-0.016
Token a
Token ID257
Feature activation-4.75e-03

Pos logit contributions
rys+0.023
iny+0.021
alk+0.021
 diary+0.021
enna+0.021
Neg logit contributions
 were-0.006
five-0.006
 YES-0.006
 how-0.006
 thank-0.006
Token mis
Token ID2984
Feature activation-6.69e-02

Pos logit contributions
rys+0.009
alk+0.009
ations+0.008
iny+0.008
umper+0.008
Neg logit contributions
 no-0.006
 were-0.006
 thank-0.006
 lots-0.006
 yes-0.006
Tokenchie
Token ID3043
Feature activation-8.70e-03

Pos logit contributions
rys+0.100
alk+0.092
 diary+0.091
iny+0.091
 battlefield+0.087
Neg logit contributions
five-0.107
 were-0.105
anges-0.104
 YES-0.103
 how-0.102
Tokenvous
Token ID31222
Feature activation+1.29e-02

Pos logit contributions
rys+0.013
iny+0.012
alk+0.012
zo+0.011
enna+0.011
Neg logit contributions
anges-0.014
 YES-0.014
 were-0.014
 thank-0.014
 how-0.014
Token tw
Token ID665
Feature activation-8.56e-03

Pos logit contributions
 were+0.017
five+0.017
 thank+0.017
 YES+0.017
 how+0.016
Neg logit contributions
rys-0.023
iny-0.021
alk-0.021
 diary-0.021
enna-0.020
Tokeninkle
Token ID19894
Feature activation-2.04e-02

Pos logit contributions
 battlefield+0.014
enna+0.014
 galaxy+0.014
 universe+0.013
 diary+0.013
Neg logit contributions
five-0.011
oops-0.011
 no-0.011
 thank-0.011
 of-0.011
Token in
Token ID287
Feature activation-9.04e-03

Pos logit contributions
rys+0.048
alk+0.048
enna+0.046
 battlefield+0.046
iny+0.046
Neg logit contributions
 were-0.016
 how-0.016
 no-0.015
 of-0.015
 yes-0.015
Token It
Token ID632
Feature activation+1.06e-02

Pos logit contributions
rys+0.011
iny+0.010
alk+0.010
zo+0.010
 diary+0.010
Neg logit contributions
 were-0.007
 lots-0.007
 how-0.007
 YES-0.007
five-0.007
Token was
Token ID373
Feature activation-6.80e-03

Pos logit contributions
five+0.004
 YES+0.003
 were+0.003
 thank+0.003
anges+0.003
Neg logit contributions
rys-0.029
iny-0.028
alk-0.028
 diary-0.027
zo-0.027
Token a
Token ID257
Feature activation-2.88e-03

Pos logit contributions
rys+0.015
iny+0.014
alk+0.014
 diary+0.014
enna+0.013
Neg logit contributions
five-0.006
 were-0.006
 YES-0.006
 thank-0.006
anges-0.006
Token big
Token ID1263
Feature activation-8.30e-03

Pos logit contributions
rys+0.006
alk+0.005
iny+0.005
umper+0.005
enna+0.005
Neg logit contributions
 thank-0.003
 were-0.003
 no-0.003
 lots-0.003
 YES-0.003
Token j
Token ID474
Feature activation+1.23e-04

Pos logit contributions
rys+0.017
iny+0.016
alk+0.016
enna+0.015
umper+0.015
Neg logit contributions
 were-0.009
 thank-0.009
five-0.009
 how-0.008
 YES-0.008
Tokenigsaw
Token ID45636
Feature activation-6.67e-02

Pos logit contributions
 of+0.000
five+0.000
 thank+0.000
 Mike+0.000
Sure+0.000
Neg logit contributions
rys-0.000
iny-0.000
Harry-0.000
ations-0.000
cker-0.000
Token puzzle
Token ID15027
Feature activation-1.78e-02

Pos logit contributions
rys+0.111
alk+0.101
umper+0.100
iny+0.099
enna+0.097
Neg logit contributions
 were-0.100
five-0.098
 how-0.098
 lots-0.097
 thank-0.096
Token and
Token ID290
Feature activation-1.00e-02

Pos logit contributions
rys+0.041
alk+0.039
iny+0.038
umper+0.037
 diary+0.037
Neg logit contributions
 were-0.015
 how-0.014
 lots-0.014
five-0.014
 no-0.014
Token she
Token ID673
Feature activation-1.45e-02

Pos logit contributions
rys+0.019
 diary+0.019
iny+0.018
ita+0.018
ci+0.018
Neg logit contributions
five-0.012
 YES-0.010
 thank-0.010
Sure-0.010
anges-0.010
Token wanted
Token ID2227
Feature activation-1.09e-02

Pos logit contributions
iny+0.031
ci+0.031
rys+0.031
zo+0.031
 diary+0.030
Neg logit contributions
five-0.013
anges-0.012
 YES-0.012
 thank-0.012
Of-0.011
Token to
Token ID284
Feature activation+2.10e-03

Pos logit contributions
 diary+0.022
rys+0.022
iny+0.021
alk+0.021
enna+0.021
Neg logit contributions
five-0.010
 YES-0.010
 thank-0.010
Of-0.010
Sure-0.010
Token.
Token ID13
Feature activation+6.11e-04

Pos logit contributions
 were+0.012
 how+0.012
 no+0.011
 lots+0.011
five+0.011
Neg logit contributions
rys-0.021
alk-0.019
iny-0.018
 diary-0.018
 battlefield-0.018
Token She
Token ID1375
Feature activation+5.59e-03

Pos logit contributions
 Mike+0.001
 "+0.001
 were+0.001
 how+0.001
 Faster+0.001
Neg logit contributions
rys-0.001
cker-0.001
ations-0.001
zo-0.001
ci-0.001
Token loved
Token ID6151
Feature activation-1.86e-03

Pos logit contributions
five+0.004
anges+0.004
 YES+0.003
 yes+0.003
 thank+0.003
Neg logit contributions
rys-0.014
iny-0.013
 diary-0.013
alk-0.013
ita-0.013
Token to
Token ID284
Feature activation-8.72e-03

Pos logit contributions
rys+0.005
alk+0.004
iny+0.004
 diary+0.004
enna+0.004
Neg logit contributions
 how-0.001
 were-0.001
 lots-0.001
 thank-0.001
five-0.001
Token eat
Token ID4483
Feature activation+5.91e-04

Pos logit contributions
rys+0.019
ishment+0.019
enna+0.018
older+0.018
ations+0.018
Neg logit contributions
 thank-0.010
 were-0.009
 how-0.009
 lots-0.008
 yes-0.008
Token av
Token ID1196
Feature activation-6.67e-02

Pos logit contributions
five+0.001
 YES+0.001
Of+0.001
Sure+0.001
anges+0.001
Neg logit contributions
rys-0.001
iny-0.001
alk-0.001
 diary-0.001
umper-0.001
Tokenoc
Token ID420
Feature activation+3.50e-02

Pos logit contributions
rys+0.116
alk+0.107
iny+0.105
 diary+0.105
enna+0.102
Neg logit contributions
five-0.091
 were-0.091
 lots-0.089
 how-0.089
anges-0.087
Tokenados
Token ID22484
Feature activation-2.05e-02

Pos logit contributions
 were+0.072
 YES+0.070
 yes+0.070
Of+0.069
Sure+0.069
Neg logit contributions
rys-0.036
alk-0.033
iny-0.033
umper-0.033
 diary-0.031
Token.
Token ID13
Feature activation-2.34e-03

Pos logit contributions
rys+0.049
alk+0.045
iny+0.045
 diary+0.045
 battlefield+0.044
Neg logit contributions
 were-0.017
 how-0.016
 lots-0.016
 so-0.015
 no-0.014
Token One
Token ID1881
Feature activation-8.98e-05

Pos logit contributions
rys+0.005
zo+0.004
ations+0.004
iny+0.004
cker+0.004
Neg logit contributions
 were-0.003
 how-0.003
Sure-0.003
 Mike-0.003
 lots-0.003
Token day
Token ID1110
Feature activation-1.74e-02

Pos logit contributions
rys+0.000
alk+0.000
iny+0.000
 diary+0.000
 battlefield+0.000
Neg logit contributions
 were-0.000
 how-0.000
 lots-0.000
five-0.000
 no-0.000
Token "
Token ID366
Feature activation-8.93e-03

Pos logit contributions
five+0.004
 were+0.004
anges+0.004
 lots+0.004
oops+0.004
Neg logit contributions
 weeping-0.010
rys-0.010
iny-0.009
 diary-0.009
enna-0.009
TokenI
Token ID40
Feature activation+7.10e-03

Pos logit contributions
rys+0.020
iny+0.019
alk+0.019
 diary+0.018
enna+0.018
Neg logit contributions
 were-0.008
five-0.008
 YES-0.008
 how-0.008
 thank-0.008
Token will
Token ID481
Feature activation+4.30e-03

Pos logit contributions
 YES+0.008
anges+0.008
 how+0.008
 lots+0.008
five+0.008
Neg logit contributions
rys-0.013
iny-0.013
alk-0.012
enna-0.012
ci-0.012
Token buy
Token ID2822
Feature activation-9.18e-03

Pos logit contributions
 YES+0.005
five+0.005
anges+0.005
 lots+0.004
 were+0.004
Neg logit contributions
rys-0.008
alk-0.008
iny-0.008
 diary-0.008
 violin-0.008
Token more
Token ID517
Feature activation+2.26e-04

Pos logit contributions
 diary+0.015
rys+0.015
 avail+0.014
ishment+0.014
umper+0.014
Neg logit contributions
 YES-0.012
 were-0.011
Eventually-0.011
five-0.011
Of-0.011
Token av
Token ID1196
Feature activation-6.67e-02

Pos logit contributions
 YES+0.000
 were+0.000
Sure+0.000
five+0.000
 no+0.000
Neg logit contributions
rys-0.000
 diary-0.000
alk-0.000
iny-0.000
 violin-0.000
Tokenoc
Token ID420
Feature activation+3.65e-02

Pos logit contributions
rys+0.121
alk+0.112
iny+0.112
 diary+0.110
enna+0.107
Neg logit contributions
five-0.086
 were-0.085
 how-0.083
 YES-0.083
 thank-0.082
Tokenados
Token ID22484
Feature activation-2.00e-02

Pos logit contributions
 were+0.076
 YES+0.073
 yes+0.073
 lots+0.072
Sure+0.072
Neg logit contributions
rys-0.038
alk-0.035
iny-0.034
umper-0.034
zo-0.032
Token tomorrow
Token ID9439
Feature activation+2.37e-03

Pos logit contributions
rys+0.040
iny+0.039
alk+0.039
 diary+0.038
enna+0.037
Neg logit contributions
 YES-0.020
five-0.019
Sure-0.019
 were-0.019
Of-0.019
Token.
Token ID13
Feature activation-3.78e-03

Pos logit contributions
five+0.003
 YES+0.003
 were+0.003
anges+0.003
 how+0.003
Neg logit contributions
rys-0.004
iny-0.004
alk-0.004
 diary-0.004
ita-0.004
Token You
Token ID921
Feature activation+6.64e-03

Pos logit contributions
rys+0.007
iny+0.007
 diary+0.007
alk+0.007
enna+0.007
Neg logit contributions
 were-0.004
 YES-0.004
five-0.004
 lots-0.004
 how-0.004
Token tree
Token ID5509
Feature activation-1.33e-02

Pos logit contributions
rys+0.050
alk+0.047
iny+0.047
 diary+0.046
zo+0.045
Neg logit contributions
five-0.025
 YES-0.024
 were-0.024
anges-0.024
 how-0.024
Token could
Token ID714
Feature activation+4.78e-03

Pos logit contributions
rys+0.029
alk+0.028
iny+0.028
 diary+0.027
enna+0.027
Neg logit contributions
five-0.012
 were-0.011
 YES-0.011
anges-0.011
 how-0.011
Token see
Token ID766
Feature activation+7.77e-03

Pos logit contributions
 were+0.006
five+0.005
 how+0.005
 YES+0.005
 thank+0.005
Neg logit contributions
rys-0.009
iny-0.009
alk-0.009
 diary-0.009
enna-0.008
Token lots
Token ID6041
Feature activation+3.03e-02

Pos logit contributions
five+0.007
 YES+0.007
anges+0.006
Sure+0.006
Of+0.006
Neg logit contributions
rys-0.017
iny-0.017
alk-0.016
 diary-0.016
umper-0.016
Token of
Token ID286
Feature activation-5.45e-03

Pos logit contributions
 YES+0.014
five+0.012
anges+0.012
Sure+0.011
Of+0.011
Neg logit contributions
rys-0.081
iny-0.077
alk-0.077
 diary-0.075
zo-0.074
Token av
Token ID1196
Feature activation-6.61e-02

Pos logit contributions
rys+0.010
iny+0.009
alk+0.009
 diary+0.009
enna+0.009
Neg logit contributions
 were-0.007
 how-0.007
 lots-0.007
five-0.007
 thank-0.007
Tokenoc
Token ID420
Feature activation+4.57e-02

Pos logit contributions
rys+0.140
 diary+0.129
ci+0.128
alk+0.126
 violin+0.125
Neg logit contributions
 lots-0.067
 were-0.067
five-0.065
 how-0.062
 so-0.062
Tokenados
Token ID22484
Feature activation-2.02e-02

Pos logit contributions
 were+0.090
 YES+0.089
 yes+0.088
Sure+0.087
Of+0.087
Neg logit contributions
rys-0.052
alk-0.048
iny-0.046
umper-0.044
 diary-0.043
Token that
Token ID326
Feature activation+4.34e-03

Pos logit contributions
rys+0.047
iny+0.043
 diary+0.042
alk+0.041
umper+0.041
Neg logit contributions
 were-0.022
 how-0.021
 so-0.020
 lots-0.019
 of-0.019
Token the
Token ID262
Feature activation-3.95e-03

Pos logit contributions
five+0.003
 were+0.003
 YES+0.003
 how+0.003
anges+0.003
Neg logit contributions
rys-0.010
iny-0.010
alk-0.010
 diary-0.009
enna-0.009
Token supplier
Token ID22693
Feature activation+9.93e-03

Pos logit contributions
rys+0.007
iny+0.007
alk+0.006
enna+0.006
ci+0.006
Neg logit contributions
 were-0.006
 no-0.005
 thank-0.005
 lots-0.005
five-0.005
Token
Token ID198
Feature activation-4.71e-04

Pos logit contributions
rys+0.042
iny+0.040
alk+0.040
 diary+0.039
enna+0.038
Neg logit contributions
five-0.017
 YES-0.017
 were-0.017
 thank-0.016
 how-0.016
Token
Token ID198
Feature activation-8.86e-03

Pos logit contributions
rys+0.001
iny+0.001
alk+0.001
 diary+0.001
enna+0.001
Neg logit contributions
five-0.000
 YES-0.000
anges-0.000
 thank-0.000
 were-0.000
TokenSuddenly
Token ID38582
Feature activation-2.03e-03

Pos logit contributions
rys+0.019
iny+0.018
alk+0.018
 diary+0.018
ita+0.018
Neg logit contributions
five-0.008
 were-0.008
 thank-0.008
 YES-0.008
 how-0.008
Token,
Token ID11
Feature activation-1.47e-02

Pos logit contributions
rys+0.004
 diary+0.004
iny+0.004
alk+0.004
 weeping+0.004
Neg logit contributions
five-0.002
Of-0.002
oops-0.002
Sure-0.002
 YES-0.002
Token a
Token ID257
Feature activation-6.97e-03

Pos logit contributions
rys+0.028
 diary+0.027
 avail+0.025
 violin+0.025
alk+0.025
Neg logit contributions
five-0.018
oops-0.016
 YES-0.016
anges-0.015
Of-0.015
Token mis
Token ID2984
Feature activation-6.55e-02

Pos logit contributions
rys+0.014
alk+0.013
iny+0.013
enna+0.013
umper+0.013
Neg logit contributions
 thank-0.008
 were-0.008
 YES-0.007
 yes-0.007
 no-0.007
Tokenchie
Token ID3043
Feature activation-2.47e-03

Pos logit contributions
rys+0.094
iny+0.086
alk+0.085
 diary+0.083
enna+0.080
Neg logit contributions
 were-0.109
five-0.109
 YES-0.107
 how-0.107
 thank-0.106
Tokenvous
Token ID31222
Feature activation+1.38e-02

Pos logit contributions
rys+0.004
iny+0.004
alk+0.003
zo+0.003
 diary+0.003
Neg logit contributions
anges-0.004
 YES-0.004
 were-0.004
 how-0.004
 thank-0.004
Token squirrel
Token ID33039
Feature activation+3.82e-03

Pos logit contributions
 thank+0.018
 were+0.018
five+0.018
 YES+0.017
 how+0.017
Neg logit contributions
rys-0.025
alk-0.024
iny-0.023
ations-0.023
 diary-0.023
Token came
Token ID1625
Feature activation-1.39e-02

Pos logit contributions
 were+0.003
 was+0.003
 how+0.003
 of+0.003
Sure+0.003
Neg logit contributions
rys-0.009
alk-0.009
ishment-0.009
 avail-0.009
 former-0.009
Token and
Token ID290
Feature activation-8.22e-03

Pos logit contributions
rys+0.032
 diary+0.030
alk+0.030
 battlefield+0.030
 avail+0.029
Neg logit contributions
 how-0.012
 were-0.012
 lots-0.011
 so-0.011
 of-0.011
Token said
Token ID531
Feature activation-2.46e-03

Pos logit contributions
five+0.011
 YES+0.010
anges+0.010
Eventually+0.010
oops+0.010
Neg logit contributions
rys-0.012
ci-0.012
ita-0.011
enna-0.011
 diary-0.011
Token,
Token ID11
Feature activation+1.37e-02

Pos logit contributions
rys+0.004
 diary+0.004
 weeping+0.004
 bean+0.004
 violin+0.004
Neg logit contributions
five-0.003
Eventually-0.003
oops-0.003
haw-0.003
Me-0.003
Token "
Token ID366
Feature activation-5.79e-04

Pos logit contributions
five+0.011
 were+0.010
oops+0.010
anges+0.010
 Mike+0.010
Neg logit contributions
rys-0.029
 weeping-0.028
iny-0.028
 violin-0.027
 diary-0.027
TokenYou
Token ID1639
Feature activation+1.83e-02

Pos logit contributions
rys+0.001
iny+0.001
alk+0.001
 diary+0.001
enna+0.001
Neg logit contributions
 were-0.001
five-0.001
 YES-0.001
 how-0.001
 thank-0.001
Token're
Token ID821
Feature activation+1.25e-02

Pos logit contributions
five+0.015
 YES+0.014
anges+0.013
Sure+0.013
Me+0.013
Neg logit contributions
rys-0.040
 diary-0.040
enna-0.039
alk-0.039
zo-0.039
Token such
Token ID884
Feature activation+4.92e-02

Pos logit contributions
five+0.015
 YES+0.014
Sure+0.014
haw+0.013
 were+0.013
Neg logit contributions
rys-0.024
iny-0.023
 diary-0.023
 universe-0.022
umper-0.022
Token a
Token ID257
Feature activation+1.19e-02

Pos logit contributions
five+0.036
 were+0.034
 YES+0.034
anges+0.033
Sure+0.032
Neg logit contributions
rys-0.118
 diary-0.111
iny-0.109
alk-0.108
ita-0.106
Token good
Token ID922
Feature activation+2.53e-03

Pos logit contributions
 were+0.013
anges+0.013
 Mike+0.012
oops+0.012
five+0.012
Neg logit contributions
rys-0.024
 diary-0.023
 violin-0.022
 universe-0.022
 bean-0.021
Token girl
Token ID2576
Feature activation+1.21e-03

Pos logit contributions
 were+0.003
five+0.003
 thank+0.003
 how+0.003
 YES+0.003
Neg logit contributions
rys-0.005
alk-0.004
iny-0.004
 diary-0.004
zo-0.004
Token,
Token ID11
Feature activation+6.74e-03

Pos logit contributions
 YES+0.001
five+0.001
 were+0.001
anges+0.001
 lots+0.001
Neg logit contributions
rys-0.002
iny-0.002
 diary-0.002
ita-0.002
ci-0.002
Token Lily
Token ID20037
Feature activation-1.29e-03

Pos logit contributions
five+0.009
 YES+0.008
oops+0.008
 were+0.008
anges+0.008
Neg logit contributions
rys-0.013
 diary-0.012
iny-0.011
 violin-0.011
 universe-0.011
Token�
Token ID129
Feature activation-7.19e-03

Pos logit contributions
rys+0.001
iny+0.001
alk+0.001
 diary+0.001
enna+0.001
Neg logit contributions
five-0.001
 were-0.001
 how-0.001
 YES-0.001
 thank-0.001
Token�
Token ID241
Feature activation+6.75e-03

Pos logit contributions
rys+0.013
iny+0.012
alk+0.012
 diary+0.012
enna+0.011
Neg logit contributions
five-0.009
 were-0.009
 YES-0.009
 how-0.009
 thank-0.009
TokenThat
Token ID2504
Feature activation+2.33e-02

Pos logit contributions
 thank+0.008
 how+0.008
anges+0.008
 were+0.007
five+0.007
Neg logit contributions
rys-0.012
iny-0.012
alk-0.011
ita-0.011
zo-0.011
Tokenâ
Token ID22940
Feature activation-1.54e-02

Pos logit contributions
 were+0.021
five+0.021
 how+0.020
 YES+0.020
 lots+0.020
Neg logit contributions
rys-0.051
iny-0.049
alk-0.048
 diary-0.048
enna-0.047
Token€
Token ID26391
Feature activation-5.35e-03

Pos logit contributions
rys+0.026
 diary+0.025
alk+0.024
iny+0.024
enna+0.023
Neg logit contributions
five-0.022
 thank-0.021
 were-0.021
 YES-0.021
Sure-0.021
Tokenâ„¢
Token ID8151
Feature activation+4.16e-02

Pos logit contributions
rys+0.010
iny+0.009
alk+0.009
 diary+0.009
enna+0.009
Neg logit contributions
five-0.007
 were-0.006
 how-0.006
 YES-0.006
 thank-0.006
Tokens
Token ID82
Feature activation-1.86e-02

Pos logit contributions
five+0.050
Of+0.048
 how+0.048
 thank+0.047
anges+0.047
Neg logit contributions
rys-0.074
iny-0.073
ci-0.072
alk-0.071
enna-0.070
Token so
Token ID523
Feature activation+6.76e-03

Pos logit contributions
rys+0.040
iny+0.038
 universe+0.038
alk+0.038
 diary+0.038
Neg logit contributions
five-0.018
Sure-0.016
Of-0.016
 YES-0.015
anges-0.015
Token cool
Token ID3608
Feature activation-9.53e-04

Pos logit contributions
five+0.006
 YES+0.006
anges+0.006
 how+0.006
Of+0.006
Neg logit contributions
rys-0.015
alk-0.014
iny-0.014
 diary-0.013
zo-0.013
Token!
Token ID0
Feature activation+1.12e-02

Pos logit contributions
rys+0.002
iny+0.002
alk+0.002
 diary+0.002
ita+0.002
Neg logit contributions
five-0.001
 were-0.001
 YES-0.001
anges-0.001
 thank-0.001
Tokenâ
Token ID22940
Feature activation-8.53e-03

Pos logit contributions
 were+0.012
five+0.012
 YES+0.011
 how+0.011
 thank+0.011
Neg logit contributions
rys-0.023
iny-0.021
alk-0.021
 diary-0.021
enna-0.021
Token realized
Token ID6939
Feature activation+1.75e-02

Pos logit contributions
rys+0.028
iny+0.026
zo+0.026
alk+0.026
 diary+0.026
Neg logit contributions
five-0.016
 YES-0.016
Of-0.015
 how-0.015
anges-0.015
Token he
Token ID339
Feature activation-9.00e-03

Pos logit contributions
five+0.011
 YES+0.010
 were+0.010
anges+0.010
 thank+0.010
Neg logit contributions
rys-0.043
iny-0.041
alk-0.041
 diary-0.041
umper-0.040
Token couldn
Token ID3521
Feature activation+2.14e-02

Pos logit contributions
rys+0.019
iny+0.019
zo+0.019
 diary+0.018
alk+0.018
Neg logit contributions
five-0.008
Of-0.008
 YES-0.008
anges-0.008
 thank-0.008
Tokenâ
Token ID22940
Feature activation-8.58e-04

Pos logit contributions
 were+0.019
 no+0.017
five+0.017
 lots+0.017
 how+0.016
Neg logit contributions
rys-0.049
 diary-0.046
enna-0.046
ita-0.046
alk-0.046
Token€
Token ID26391
Feature activation+3.89e-03

Pos logit contributions
rys+0.002
iny+0.001
alk+0.001
 diary+0.001
enna+0.001
Neg logit contributions
five-0.001
 were-0.001
 thank-0.001
 YES-0.001
 how-0.001
Tokenâ„¢
Token ID8151
Feature activation+4.77e-02

Pos logit contributions
 were+0.005
five+0.005
 YES+0.005
 how+0.005
 lots+0.005
Neg logit contributions
rys-0.007
iny-0.007
alk-0.007
 diary-0.006
enna-0.006
Tokent
Token ID83
Feature activation-5.25e-03

Pos logit contributions
five+0.035
 thank+0.034
 YES+0.034
 how+0.034
anges+0.033
Neg logit contributions
rys-0.110
iny-0.107
alk-0.105
enna-0.103
umper-0.103
Token take
Token ID1011
Feature activation-3.46e-03

Pos logit contributions
rys+0.011
 diary+0.011
alk+0.011
iny+0.011
enna+0.011
Neg logit contributions
 were-0.005
 thank-0.005
five-0.005
 how-0.005
 no-0.005
Token it
Token ID340
Feature activation+2.81e-03

Pos logit contributions
rys+0.007
iny+0.007
 diary+0.007
 avail+0.007
umper+0.007
Neg logit contributions
 YES-0.003
Of-0.003
 yes-0.003
 were-0.003
five-0.003
Token home
Token ID1363
Feature activation+2.31e-02

Pos logit contributions
 lots+0.003
 no+0.003
 so+0.003
 how+0.003
 were+0.003
Neg logit contributions
rys-0.006
alk-0.006
ita-0.006
iny-0.005
enna-0.005
Token.
Token ID13
Feature activation+7.01e-03

Pos logit contributions
 were+0.029
 how+0.028
 lots+0.028
five+0.028
 no+0.028
Neg logit contributions
rys-0.043
alk-0.040
iny-0.040
 diary-0.039
enna-0.038
Token it
Token ID340
Feature activation+1.97e-03

Pos logit contributions
 YES+0.009
five+0.008
Of+0.008
 thank+0.008
 yes+0.008
Neg logit contributions
rys-0.020
iny-0.019
 diary-0.019
zo-0.018
 avail-0.018
Token.
Token ID13
Feature activation+3.91e-04

Pos logit contributions
 were+0.002
 how+0.002
 lots+0.002
 no+0.002
 yes+0.002
Neg logit contributions
rys-0.004
alk-0.004
iny-0.004
 diary-0.003
ita-0.003
Token Tim
Token ID5045
Feature activation+4.61e-03

Pos logit contributions
 were+0.000
 Mike+0.000
 lots+0.000
 YES+0.000
 how+0.000
Neg logit contributions
rys-0.001
alk-0.001
iny-0.001
 diary-0.001
ations-0.001
Tokenmy
Token ID1820
Feature activation-1.14e-02

Pos logit contributions
five+0.006
 were+0.006
 YES+0.006
 thank+0.006
 how+0.006
Neg logit contributions
rys-0.008
iny-0.007
alk-0.007
 diary-0.007
ita-0.007
Token felt
Token ID2936
Feature activation+1.65e-02

Pos logit contributions
iny+0.024
rys+0.023
ita+0.023
zo+0.023
 diary+0.022
Neg logit contributions
five-0.012
 YES-0.011
anges-0.011
 thank-0.011
Of-0.010
Token even
Token ID772
Feature activation+4.58e-02

Pos logit contributions
five+0.021
 YES+0.020
anges+0.020
Sure+0.020
Of+0.020
Neg logit contributions
rys-0.031
 diary-0.028
iny-0.028
alk-0.028
 weeping-0.028
Token more
Token ID517
Feature activation+1.52e-02

Pos logit contributions
five+0.045
 were+0.045
 YES+0.045
anges+0.044
 how+0.043
Neg logit contributions
rys-0.096
iny-0.090
alk-0.089
 diary-0.088
zo-0.086
Token sad
Token ID6507
Feature activation+3.09e-03

Pos logit contributions
 were+0.021
 no+0.020
 thank+0.020
 how+0.020
 so+0.020
Neg logit contributions
rys-0.027
alk-0.026
iny-0.025
enna-0.025
 diary-0.025
Token,
Token ID11
Feature activation+7.37e-03

Pos logit contributions
five+0.003
 YES+0.003
 were+0.003
anges+0.003
 thank+0.003
Neg logit contributions
rys-0.007
iny-0.006
alk-0.006
 diary-0.006
zo-0.006
Token but
Token ID475
Feature activation+9.84e-03

Pos logit contributions
five+0.006
 were+0.006
 YES+0.006
anges+0.005
 thank+0.005
Neg logit contributions
rys-0.017
iny-0.016
alk-0.016
 diary-0.016
zo-0.015
Token his
Token ID465
Feature activation-1.10e-05

Pos logit contributions
 were+0.007
 how+0.006
 no+0.006
 YES+0.006
 thank+0.006
Neg logit contributions
rys-0.024
alk-0.023
iny-0.023
 diary-0.023
enna-0.022
Token empty
Token ID6565
Feature activation+7.60e-03

Pos logit contributions
five+0.007
 were+0.007
 YES+0.007
anges+0.006
 thank+0.006
Neg logit contributions
rys-0.013
iny-0.012
alk-0.012
 diary-0.012
enna-0.012
Token,
Token ID11
Feature activation+7.59e-03

Pos logit contributions
five+0.006
 YES+0.006
Sure+0.006
anges+0.006
Of+0.006
Neg logit contributions
rys-0.017
alk-0.016
iny-0.016
 diary-0.016
enna-0.016
Token he
Token ID339
Feature activation-1.75e-03

Pos logit contributions
five+0.007
 YES+0.006
anges+0.006
Of+0.006
Sure+0.006
Neg logit contributions
rys-0.016
 diary-0.016
iny-0.015
alk-0.015
 avail-0.015
Token still
Token ID991
Feature activation+1.78e-02

Pos logit contributions
rys+0.004
iny+0.004
zo+0.004
 diary+0.004
ci+0.003
Neg logit contributions
five-0.002
anges-0.002
 YES-0.002
 thank-0.001
 Mike-0.001
Token felt
Token ID2936
Feature activation+2.11e-02

Pos logit contributions
five+0.017
anges+0.016
 YES+0.016
Sure+0.016
 were+0.015
Neg logit contributions
rys-0.038
 diary-0.036
alk-0.036
iny-0.036
ci-0.035
Token like
Token ID588
Feature activation+4.59e-02

Pos logit contributions
five+0.023
 YES+0.023
anges+0.023
rav+0.022
 Mike+0.022
Neg logit contributions
rys-0.041
 diary-0.039
 weeping-0.038
alk-0.037
iny-0.037
Token he
Token ID339
Feature activation-9.58e-03

Pos logit contributions
five+0.047
Sure+0.042
anges+0.042
oops+0.041
 YES+0.041
Neg logit contributions
 diary-0.092
rys-0.092
 weeping-0.088
alk-0.086
 universe-0.086
Token had
Token ID550
Feature activation+1.14e-02

Pos logit contributions
rys+0.021
iny+0.020
 diary+0.020
zo+0.020
alk+0.020
Neg logit contributions
five-0.008
anges-0.008
 YES-0.008
 thank-0.008
Sure-0.008
Token found
Token ID1043
Feature activation+1.56e-02

Pos logit contributions
five+0.012
 YES+0.012
Sure+0.012
Of+0.012
anges+0.012
Neg logit contributions
rys-0.022
alk-0.021
 diary-0.021
iny-0.020
enna-0.020
Token something
Token ID1223
Feature activation+8.80e-03

Pos logit contributions
Sure+0.018
 YES+0.018
five+0.018
Of+0.017
anges+0.017
Neg logit contributions
 diary-0.028
rys-0.028
iny-0.027
alk-0.027
umper-0.026
Token valuable
Token ID8119
Feature activation-1.98e-02

Pos logit contributions
five+0.010
 YES+0.010
anges+0.010
Eventually+0.009
Sure+0.009
Neg logit contributions
rys-0.016
 diary-0.016
 weeping-0.015
iny-0.015
alk-0.015
Token to
Token ID284
Feature activation+9.33e-03

Pos logit contributions
 YES+0.009
five+0.009
 were+0.009
Eventually+0.008
 Mike+0.008
Neg logit contributions
 diary-0.019
rys-0.018
enna-0.017
iny-0.017
alk-0.017
Token wash
Token ID13502
Feature activation+3.49e-03

Pos logit contributions
five+0.010
 YES+0.010
 were+0.010
anges+0.010
 how+0.010
Neg logit contributions
rys-0.018
iny-0.017
 diary-0.017
alk-0.017
enna-0.016
Token her
Token ID607
Feature activation-1.66e-02

Pos logit contributions
five+0.005
 YES+0.005
 thank+0.005
 were+0.005
 yes+0.004
Neg logit contributions
rys-0.006
iny-0.006
 diary-0.006
alk-0.006
ci-0.005
Token hair
Token ID4190
Feature activation+1.48e-02

Pos logit contributions
rys+0.031
iny+0.029
alk+0.029
 diary+0.028
ita+0.027
Neg logit contributions
five-0.020
 were-0.020
 YES-0.020
 thank-0.020
 how-0.020
Token with
Token ID351
Feature activation-1.20e-02

Pos logit contributions
five+0.017
 YES+0.016
 thank+0.016
 were+0.016
 Mike+0.016
Neg logit contributions
rys-0.029
iny-0.027
ci-0.027
alk-0.027
 diary-0.027
Token shampoo
Token ID47259
Feature activation+1.84e-03

Pos logit contributions
rys+0.019
 diary+0.018
alk+0.018
iny+0.018
umper+0.017
Neg logit contributions
five-0.018
 YES-0.017
anges-0.017
Of-0.017
Sure-0.017
Token.
Token ID13
Feature activation-2.88e-03

Pos logit contributions
five+0.003
 YES+0.003
 thank+0.002
Of+0.002
Sure+0.002
Neg logit contributions
rys-0.003
alk-0.003
iny-0.003
ci-0.003
 diary-0.003
Token Lily
Token ID20037
Feature activation-4.33e-03

Pos logit contributions
rys+0.006
alk+0.005
 diary+0.005
iny+0.005
enna+0.005
Neg logit contributions
five-0.003
 YES-0.003
anges-0.003
 thank-0.003
 no-0.003
Token didn
Token ID1422
Feature activation+3.20e-03

Pos logit contributions
ci+0.009
iny+0.008
ita+0.008
rys+0.008
 diary+0.008
Neg logit contributions
five-0.005
 YES-0.004
anges-0.004
 thank-0.004
 20-0.004
Token't
Token ID470
Feature activation-1.95e-02

Pos logit contributions
five+0.004
 YES+0.004
Of+0.004
 were+0.004
 thank+0.004
Neg logit contributions
rys-0.006
iny-0.005
alk-0.005
 diary-0.005
umper-0.005
Token like
Token ID588
Feature activation+3.52e-02

Pos logit contributions
rys+0.043
iny+0.041
alk+0.040
 diary+0.040
enna+0.039
Neg logit contributions
 were-0.018
five-0.017
 YES-0.017
 how-0.017
 thank-0.017
Token walked
Token ID6807
Feature activation-5.55e-03

Pos logit contributions
rys+0.049
iny+0.046
zo+0.045
alk+0.045
ita+0.045
Neg logit contributions
five-0.031
anges-0.031
 YES-0.030
 thank-0.029
 no-0.029
Token through
Token ID832
Feature activation+2.57e-03

Pos logit contributions
rys+0.011
alk+0.011
iny+0.011
 diary+0.011
zo+0.010
Neg logit contributions
 were-0.006
 YES-0.006
five-0.006
anges-0.006
 how-0.005
Token the
Token ID262
Feature activation-3.70e-03

Pos logit contributions
five+0.002
 were+0.002
 YES+0.002
anges+0.002
 how+0.002
Neg logit contributions
rys-0.006
alk-0.006
iny-0.006
 diary-0.006
zo-0.006
Token forest
Token ID8222
Feature activation-1.57e-02

Pos logit contributions
enna+0.006
umper+0.006
rys+0.006
iny+0.006
alk+0.006
Neg logit contributions
 ice-0.006
 lots-0.006
 so-0.005
 were-0.005
 of-0.005
Token,
Token ID11
Feature activation+3.01e-04

Pos logit contributions
rys+0.038
alk+0.037
iny+0.037
 diary+0.036
zo+0.036
Neg logit contributions
five-0.009
anges-0.009
 YES-0.009
Of-0.009
Sure-0.009
Token the
Token ID262
Feature activation-1.64e-03

Pos logit contributions
five+0.000
anges+0.000
Sure+0.000
Of+0.000
 YES+0.000
Neg logit contributions
rys-0.001
 diary-0.001
alk-0.001
iny-0.001
 avail-0.001
Token sun
Token ID4252
Feature activation+6.76e-04

Pos logit contributions
rys+0.003
 diary+0.002
iny+0.002
 universe+0.002
cker+0.002
Neg logit contributions
anges-0.002
 YES-0.002
 how-0.002
five-0.002
 were-0.002
Token began
Token ID2540
Feature activation-8.47e-03

Pos logit contributions
five+0.001
 were+0.001
 YES+0.001
anges+0.001
 how+0.001
Neg logit contributions
rys-0.001
iny-0.001
alk-0.001
 diary-0.001
zo-0.001
Token to
Token ID284
Feature activation-9.56e-03

Pos logit contributions
rys+0.021
iny+0.020
alk+0.020
 diary+0.019
enna+0.019
Neg logit contributions
 were-0.006
five-0.005
 how-0.005
 YES-0.005
 thank-0.005
Token set
Token ID900
Feature activation+1.50e-02

Pos logit contributions
rys+0.019
iny+0.018
 diary+0.017
alk+0.017
enna+0.017
Neg logit contributions
 were-0.012
 thank-0.011
 how-0.011
 no-0.011
 yes-0.011
Token,
Token ID11
Feature activation+3.24e-04

Pos logit contributions
 were+0.012
five+0.012
 how+0.012
 YES+0.012
 thank+0.012
Neg logit contributions
rys-0.034
iny-0.032
alk-0.032
 diary-0.032
enna-0.031
Token strange
Token ID6283
Feature activation+1.38e-02

Pos logit contributions
 were+0.016
five+0.016
 YES+0.015
 how+0.015
 thank+0.015
Neg logit contributions
rys-0.023
iny-0.021
alk-0.021
 diary-0.021
enna-0.020
Token sound
Token ID2128
Feature activation+6.07e-03

Pos logit contributions
 how+0.017
 thank+0.017
 were+0.017
 so+0.017
five+0.016
Neg logit contributions
rys-0.026
umper-0.025
enna-0.025
alk-0.024
iny-0.024
Token.
Token ID13
Feature activation-7.93e-04

Pos logit contributions
 were+0.007
 how+0.007
 thank+0.007
five+0.007
 no+0.007
Neg logit contributions
rys-0.012
alk-0.011
iny-0.011
 diary-0.011
enna-0.011
Token
Token ID220
Feature activation-1.81e-02

Pos logit contributions
rys+0.002
alk+0.002
iny+0.002
 diary+0.002
ita+0.002
Neg logit contributions
 were-0.001
 lots-0.001
 YES-0.001
five-0.001
 how-0.001
Token
Token ID198
Feature activation+5.06e-03

Pos logit contributions
rys+0.037
iny+0.036
alk+0.036
 diary+0.034
umper+0.034
Neg logit contributions
five-0.018
 YES-0.018
 thank-0.018
 yes-0.018
anges-0.017
Token
Token ID198
Feature activation-3.53e-03

Pos logit contributions
five+0.005
 were+0.005
 YES+0.005
 how+0.004
 thank+0.004
Neg logit contributions
rys-0.011
iny-0.010
alk-0.010
 diary-0.010
zo-0.010
TokenT
Token ID51
Feature activation-5.12e-03

Pos logit contributions
rys+0.007
alk+0.006
 diary+0.006
iny+0.006
ishment+0.006
Neg logit contributions
 lots-0.004
 were-0.004
anges-0.004
Sure-0.004
 how-0.004
Tokeneddy
Token ID21874
Feature activation+2.70e-02

Pos logit contributions
rys+0.007
alk+0.006
 diary+0.006
 battlefield+0.006
 avail+0.006
Neg logit contributions
 were-0.009
 how-0.009
five-0.009
 YES-0.009
 Mike-0.009
Token stopped
Token ID5025
Feature activation-1.76e-02

Pos logit contributions
 were+0.023
 how+0.023
five+0.021
 lots+0.021
 YES+0.021
Neg logit contributions
rys-0.062
alk-0.059
iny-0.057
 diary-0.057
enna-0.056
Token and
Token ID290
Feature activation-5.79e-03

Pos logit contributions
enna+0.046
 battlefield+0.046
rys+0.046
 diary+0.046
alk+0.046
Neg logit contributions
 how-0.011
 so-0.009
 now-0.008
 were-0.008
 was-0.008
Token looked
Token ID3114
Feature activation-2.13e-02

Pos logit contributions
rys+0.012
iny+0.011
alk+0.011
 diary+0.011
zo+0.011
Neg logit contributions
five-0.006
 were-0.006
 YES-0.006
anges-0.006
 thank-0.006
Token said
Token ID531
Feature activation+3.88e-03

Pos logit contributions
rys+0.011
iny+0.010
ita+0.010
 diary+0.010
alk+0.010
Neg logit contributions
five-0.006
anges-0.005
 YES-0.005
 thank-0.005
 were-0.005
Token it
Token ID340
Feature activation+1.56e-02

Pos logit contributions
five+0.004
oops+0.004
Eventually+0.004
Sure+0.004
Of+0.004
Neg logit contributions
 diary-0.007
 weeping-0.007
rys-0.007
 violin-0.007
zo-0.006
Token made
Token ID925
Feature activation+1.30e-02

Pos logit contributions
five+0.007
 were+0.007
 YES+0.007
anges+0.006
 thank+0.006
Neg logit contributions
rys-0.041
iny-0.039
alk-0.039
 diary-0.039
zo-0.038
Token him
Token ID683
Feature activation+2.57e-03

Pos logit contributions
 were+0.011
five+0.011
 YES+0.011
 how+0.011
 thank+0.011
Neg logit contributions
rys-0.029
iny-0.028
alk-0.027
 diary-0.027
enna-0.027
Token happy
Token ID3772
Feature activation+1.17e-02

Pos logit contributions
five+0.003
anges+0.003
 YES+0.003
Sure+0.003
 were+0.003
Neg logit contributions
rys-0.005
 diary-0.005
iny-0.005
alk-0.005
zo-0.005
Token.
Token ID13
Feature activation-8.17e-04

Pos logit contributions
five+0.013
 YES+0.013
anges+0.013
 were+0.012
 thank+0.012
Neg logit contributions
rys-0.023
iny-0.022
alk-0.022
 diary-0.022
zo-0.021
Token
Token ID198
Feature activation+4.42e-03

Pos logit contributions
rys+0.002
iny+0.002
alk+0.002
 diary+0.002
ita+0.002
Neg logit contributions
 were-0.001
five-0.001
 how-0.001
 YES-0.001
 lots-0.001
Token
Token ID198
Feature activation-3.90e-03

Pos logit contributions
five+0.004
 were+0.004
 YES+0.004
 how+0.004
 thank+0.004
Neg logit contributions
rys-0.010
iny-0.009
alk-0.009
 diary-0.009
enna-0.009
TokenMom
Token ID29252
Feature activation+1.37e-02

Pos logit contributions
rys+0.007
iny+0.007
alk+0.007
 diary+0.007
enna+0.007
Neg logit contributions
 were-0.005
five-0.005
 YES-0.005
 how-0.005
 thank-0.005
Token explained
Token ID4893
Feature activation+6.92e-03

Pos logit contributions
five+0.016
anges+0.015
 YES+0.015
oops+0.014
 oh+0.014
Neg logit contributions
iny-0.026
ita-0.026
enna-0.025
zo-0.025
rys-0.025
Token to
Token ID284
Feature activation+1.70e-03

Pos logit contributions
five+0.005
 YES+0.005
 were+0.005
Of+0.004
Me+0.004
Neg logit contributions
rys-0.016
 diary-0.016
iny-0.015
alk-0.015
zo-0.015
Token black
Token ID2042
Feature activation-1.53e-02

Pos logit contributions
rys+0.019
alk+0.019
 diary+0.018
iny+0.018
ky+0.018
Neg logit contributions
 YES-0.009
Of-0.009
five-0.009
Sure-0.009
Not-0.008
Token and
Token ID290
Feature activation-2.83e-03

Pos logit contributions
rys+0.026
alk+0.025
iny+0.024
 diary+0.024
enna+0.023
Neg logit contributions
 YES-0.021
five-0.021
Of-0.021
 were-0.021
anges-0.021
Token white
Token ID2330
Feature activation-1.62e-02

Pos logit contributions
rys+0.005
iny+0.005
alk+0.005
umper+0.005
enna+0.005
Neg logit contributions
 were-0.004
 lots-0.004
 thank-0.004
 no-0.004
 how-0.004
Token squares
Token ID24438
Feature activation-1.74e-02

Pos logit contributions
rys+0.030
iny+0.028
umper+0.027
enna+0.027
alk+0.026
Neg logit contributions
 were-0.022
five-0.021
 how-0.021
 lots-0.021
 thank-0.020
Token.
Token ID13
Feature activation-4.19e-03

Pos logit contributions
rys+0.033
iny+0.031
alk+0.030
 diary+0.030
enna+0.029
Neg logit contributions
 were-0.022
 how-0.021
five-0.021
 lots-0.021
 no-0.021
Token But
Token ID887
Feature activation+7.90e-03

Pos logit contributions
rys+0.008
alk+0.007
 battlefield+0.007
iny+0.007
ishment+0.007
Neg logit contributions
five-0.005
 thank-0.005
 YES-0.005
 were-0.005
 yes-0.005
Token they
Token ID484
Feature activation-8.13e-03

Pos logit contributions
five+0.009
Of+0.007
 YES+0.007
Sure+0.007
 thank+0.007
Neg logit contributions
rys-0.016
 diary-0.015
ishment-0.015
 avail-0.015
ci-0.015
Token could
Token ID714
Feature activation+4.06e-03

Pos logit contributions
rys+0.017
alk+0.016
enna+0.016
zo+0.016
 diary+0.016
Neg logit contributions
five-0.008
 YES-0.007
Of-0.007
 thank-0.007
rav-0.007
Token not
Token ID407
Feature activation-1.25e-02

Pos logit contributions
five+0.003
 YES+0.003
 were+0.003
anges+0.003
 lots+0.003
Neg logit contributions
rys-0.010
iny-0.009
alk-0.009
 diary-0.009
zo-0.009
Token make
Token ID787
Feature activation-5.97e-03

Pos logit contributions
rys+0.026
iny+0.024
 diary+0.024
alk+0.024
ita+0.024
Neg logit contributions
 were-0.014
 how-0.013
 thank-0.013
five-0.013
 no-0.013
Token a
Token ID257
Feature activation+5.68e-03

Pos logit contributions
rys+0.014
iny+0.013
alk+0.013
 diary+0.013
enna+0.013
Neg logit contributions
 were-0.004
five-0.004
 YES-0.004
 how-0.004
 thank-0.004
Token She
Token ID1375
Feature activation+2.35e-03

Pos logit contributions
rys+0.004
iny+0.004
zo+0.004
alk+0.003
ci+0.003
Neg logit contributions
 were-0.002
 Mike-0.002
 how-0.002
 no-0.002
 lots-0.002
Token did
Token ID750
Feature activation-3.21e-04

Pos logit contributions
five+0.002
 YES+0.002
anges+0.002
 how+0.002
 thank+0.002
Neg logit contributions
rys-0.005
iny-0.005
alk-0.005
 diary-0.005
ita-0.005
Token not
Token ID407
Feature activation-6.99e-03

Pos logit contributions
rys+0.001
iny+0.001
alk+0.001
 diary+0.001
enna+0.001
Neg logit contributions
 were-0.000
five-0.000
 how-0.000
 YES-0.000
 thank-0.000
Token know
Token ID760
Feature activation+9.94e-03

Pos logit contributions
rys+0.018
ci+0.017
alk+0.017
ations+0.016
 diary+0.016
Neg logit contributions
 thank-0.005
five-0.005
 were-0.005
 no-0.005
 how-0.005
Token if
Token ID611
Feature activation-1.63e-02

Pos logit contributions
five+0.008
 YES+0.007
Sure+0.007
Of+0.007
anges+0.006
Neg logit contributions
rys-0.023
 diary-0.022
iny-0.022
alk-0.022
enna-0.021
Token they
Token ID484
Feature activation-1.30e-02

Pos logit contributions
rys+0.034
iny+0.032
alk+0.032
 diary+0.031
enna+0.030
Neg logit contributions
 were-0.017
five-0.017
 how-0.016
 YES-0.016
 thank-0.016
Token would
Token ID561
Feature activation-3.09e-03

Pos logit contributions
rys+0.032
alk+0.031
iny+0.031
 diary+0.031
zo+0.031
Neg logit contributions
 YES-0.007
anges-0.007
five-0.007
Of-0.006
Sure-0.006
Token hurt
Token ID5938
Feature activation-1.19e-02

Pos logit contributions
rys+0.007
iny+0.007
umper+0.007
alk+0.007
enna+0.007
Neg logit contributions
 were-0.003
 thank-0.003
 no-0.003
 how-0.003
 yes-0.003
Token her
Token ID607
Feature activation-2.87e-03

Pos logit contributions
rys+0.027
alk+0.026
iny+0.025
 diary+0.025
enna+0.025
Neg logit contributions
 were-0.011
 no-0.011
 how-0.011
 lots-0.010
 Mike-0.010
Token or
Token ID393
Feature activation-4.54e-03

Pos logit contributions
rys+0.006
alk+0.006
iny+0.006
enna+0.005
umper+0.005
Neg logit contributions
 were-0.003
 no-0.003
 how-0.003
 yes-0.003
 thank-0.003
Token her
Token ID607
Feature activation-8.51e-03

Pos logit contributions
rys+0.010
iny+0.010
alk+0.010
umper+0.009
enna+0.009
Neg logit contributions
 how-0.005
 thank-0.005
 were-0.005
 no-0.005
 yes-0.004
Token'm
Token ID1101
Feature activation-1.15e-02

Pos logit contributions
 YES+0.004
 how+0.004
anges+0.004
five+0.004
 lots+0.004
Neg logit contributions
rys-0.007
iny-0.007
alk-0.007
 diary-0.007
zo-0.007
Token hungry
Token ID14720
Feature activation+3.26e-03

Pos logit contributions
rys+0.026
alk+0.025
iny+0.025
 diary+0.025
enna+0.024
Neg logit contributions
 how-0.009
 no-0.009
 thank-0.009
 were-0.009
five-0.009
Token.
Token ID13
Feature activation-4.64e-03

Pos logit contributions
 YES+0.005
five+0.005
 were+0.005
anges+0.005
Of+0.005
Neg logit contributions
rys-0.005
iny-0.005
alk-0.005
 diary-0.005
ita-0.005
Token And
Token ID843
Feature activation+6.88e-03

Pos logit contributions
rys+0.009
iny+0.008
alk+0.008
 diary+0.008
zo+0.008
Neg logit contributions
 were-0.005
 YES-0.005
 lots-0.005
five-0.005
 how-0.005
Token I
Token ID314
Feature activation-1.42e-03

Pos logit contributions
five+0.009
Eventually+0.009
Sure+0.009
 YES+0.008
 were+0.008
Neg logit contributions
 diary-0.011
rys-0.011
 universe-0.010
ci-0.010
 bean-0.010
Token'm
Token ID1101
Feature activation-1.75e-02

Pos logit contributions
rys+0.003
iny+0.003
alk+0.003
zo+0.003
 diary+0.003
Neg logit contributions
 YES-0.001
anges-0.001
 how-0.001
five-0.001
 lots-0.001
Token curious
Token ID11040
Feature activation-8.31e-03

Pos logit contributions
rys+0.038
alk+0.037
iny+0.037
 diary+0.036
enna+0.036
Neg logit contributions
 no-0.016
 how-0.016
 thank-0.016
 were-0.016
 lots-0.016
Token.
Token ID13
Feature activation-4.08e-03

Pos logit contributions
rys+0.013
iny+0.012
 diary+0.012
alk+0.012
zo+0.012
Neg logit contributions
five-0.012
 YES-0.012
anges-0.012
 were-0.012
Sure-0.012
Token I
Token ID314
Feature activation-3.70e-04

Pos logit contributions
rys+0.008
iny+0.007
 diary+0.007
alk+0.007
zo+0.007
Neg logit contributions
 were-0.005
 YES-0.004
five-0.004
 lots-0.004
 how-0.004
Token want
Token ID765
Feature activation+4.81e-04

Pos logit contributions
rys+0.001
iny+0.001
alk+0.001
 diary+0.001
zo+0.001
Neg logit contributions
 YES-0.000
five-0.000
 how-0.000
anges-0.000
 were-0.000
Token to
Token ID284
Feature activation-1.85e-03

Pos logit contributions
 YES+0.000
five+0.000
 were+0.000
Sure+0.000
Of+0.000
Neg logit contributions
rys-0.001
 diary-0.001
alk-0.001
iny-0.001
 battlefield-0.001
Token dad
Token ID9955
Feature activation+1.21e-02

Pos logit contributions
rys+0.003
alk+0.002
iny+0.002
 diary+0.002
umper+0.002
Neg logit contributions
 were-0.001
 how-0.001
 lots-0.001
 thank-0.001
five-0.001
Token "
Token ID366
Feature activation-1.62e-03

Pos logit contributions
five+0.007
 YES+0.007
anges+0.006
 were+0.006
Of+0.006
Neg logit contributions
rys-0.030
iny-0.029
alk-0.029
 diary-0.028
enna-0.028
TokenDaddy
Token ID48280
Feature activation-2.30e-03

Pos logit contributions
rys+0.003
alk+0.003
 diary+0.003
 battlefield+0.003
iny+0.003
Neg logit contributions
 were-0.002
Sure-0.002
 lots-0.002
 YES-0.002
five-0.002
Token,
Token ID11
Feature activation+5.90e-03

Pos logit contributions
rys+0.005
iny+0.005
alk+0.005
enna+0.005
ita+0.005
Neg logit contributions
five-0.002
 YES-0.002
Of-0.001
Sure-0.001
anges-0.001
Token why
Token ID1521
Feature activation+4.34e-03

Pos logit contributions
five+0.007
Eventually+0.006
anges+0.006
 YES+0.006
Sure+0.006
Neg logit contributions
rys-0.012
 diary-0.012
 universe-0.011
 alien-0.011
 violin-0.011
Token are
Token ID389
Feature activation-1.07e-02

Pos logit contributions
five+0.002
 YES+0.002
 were+0.002
anges+0.002
 how+0.002
Neg logit contributions
rys-0.011
iny-0.011
alk-0.011
 diary-0.010
zo-0.010
Token we
Token ID356
Feature activation-1.60e-02

Pos logit contributions
rys+0.022
iny+0.021
alk+0.020
 diary+0.020
zo+0.020
Neg logit contributions
five-0.011
 were-0.011
 YES-0.011
 thank-0.011
 how-0.010
Token in
Token ID287
Feature activation-1.38e-02

Pos logit contributions
rys+0.031
iny+0.029
alk+0.029
 diary+0.028
ita+0.028
Neg logit contributions
 were-0.019
 how-0.019
 no-0.018
 lots-0.018
 thank-0.018
Token this
Token ID428
Feature activation-7.61e-03

Pos logit contributions
rys+0.036
iny+0.036
enna+0.036
ita+0.035
alk+0.035
Neg logit contributions
 no-0.009
 how-0.008
 lots-0.008
 were-0.007
 so-0.007
Token tunnel
Token ID13275
Feature activation-3.53e-03

Pos logit contributions
rys+0.012
enna+0.012
umper+0.011
ita+0.011
iny+0.011
Neg logit contributions
 lots-0.013
 was-0.013
 were-0.013
 no-0.012
 good-0.012
Token?"
Token ID1701
Feature activation-6.09e-03

Pos logit contributions
rys+0.006
iny+0.005
alk+0.005
 diary+0.005
enna+0.005
Neg logit contributions
 were-0.005
five-0.005
 how-0.005
 YES-0.005
 lots-0.005
Token carefully
Token ID7773
Feature activation-3.47e-03

Pos logit contributions
iny+0.008
ita+0.007
zo+0.007
cker+0.007
ky+0.007
Neg logit contributions
 YES-0.005
Of-0.005
anges-0.005
five-0.005
 thank-0.005
Token put
Token ID1234
Feature activation+4.90e-03

Pos logit contributions
rys+0.006
alk+0.006
iny+0.006
ky+0.006
 diary+0.006
Neg logit contributions
 YES-0.004
five-0.004
Of-0.004
 thank-0.004
anges-0.004
Token the
Token ID262
Feature activation-1.51e-02

Pos logit contributions
rav+0.006
 YES+0.006
five+0.006
Of+0.006
Sure+0.005
Neg logit contributions
rys-0.009
iny-0.008
umper-0.008
 diary-0.008
ky-0.008
Token cheese
Token ID9891
Feature activation-1.38e-02

Pos logit contributions
rys+0.024
 diary+0.024
iny+0.023
alk+0.023
 pen+0.022
Neg logit contributions
 Mike-0.022
 YES-0.022
 how-0.021
five-0.021
Of-0.021
Token,
Token ID11
Feature activation-1.56e-03

Pos logit contributions
rys+0.027
alk+0.026
iny+0.026
 diary+0.025
umper+0.024
Neg logit contributions
five-0.015
 YES-0.015
 thank-0.015
anges-0.015
 were-0.015
Token crack
Token ID8469
Feature activation-1.03e-02

Pos logit contributions
rys+0.003
iny+0.003
 diary+0.003
alk+0.003
ky+0.003
Neg logit contributions
five-0.002
 YES-0.002
rav-0.002
Sure-0.002
 Mike-0.002
Tokener
Token ID263
Feature activation+7.34e-03

Pos logit contributions
iny+0.018
rys+0.018
alk+0.017
ky+0.017
ations+0.017
Neg logit contributions
Of-0.014
Sure-0.014
five-0.014
 thank-0.013
 YES-0.013
Token and
Token ID290
Feature activation-9.51e-03

Pos logit contributions
five+0.007
 YES+0.007
 thank+0.007
 were+0.007
anges+0.007
Neg logit contributions
rys-0.015
iny-0.015
alk-0.015
 diary-0.014
ci-0.014
Token glass
Token ID5405
Feature activation-3.77e-03

Pos logit contributions
rys+0.016
 diary+0.016
 pen+0.016
 bur+0.016
ci+0.015
Neg logit contributions
rav-0.011
five-0.011
 YES-0.011
Sure-0.011
Of-0.011
Token of
Token ID286
Feature activation-2.10e-02

Pos logit contributions
rys+0.006
alk+0.006
iny+0.006
 bean+0.006
 diary+0.006
Neg logit contributions
Of-0.005
 YES-0.005
five-0.005
Sure-0.005
rav-0.005
Token milk
Token ID7545
Feature activation-4.64e-04

Pos logit contributions
rys+0.034
alk+0.033
iny+0.032
 diary+0.031
 bean+0.031
Neg logit contributions
 YES-0.030
rav-0.028
five-0.028
Sure-0.028
Of-0.028
Token a
Token ID257
Feature activation-8.99e-03

Pos logit contributions
rys+0.049
iny+0.047
alk+0.047
 diary+0.045
umper+0.045
Neg logit contributions
five-0.018
 YES-0.017
 thank-0.016
anges-0.016
Of-0.016
Token sudden
Token ID4802
Feature activation-1.14e-02

Pos logit contributions
rys+0.015
iny+0.014
alk+0.014
 diary+0.013
enna+0.013
Neg logit contributions
 were-0.013
five-0.013
 YES-0.013
 how-0.013
 thank-0.013
Token he
Token ID339
Feature activation-1.60e-02

Pos logit contributions
rys+0.028
alk+0.027
iny+0.027
 diary+0.027
zo+0.026
Neg logit contributions
five-0.007
 were-0.007
 YES-0.007
 how-0.006
 thank-0.006
Token noticed
Token ID6810
Feature activation+9.76e-03

Pos logit contributions
rys+0.031
iny+0.030
alk+0.029
 diary+0.029
zo+0.029
Neg logit contributions
five-0.018
 YES-0.017
 thank-0.017
anges-0.017
 how-0.017
Token a
Token ID257
Feature activation+7.71e-03

Pos logit contributions
five+0.007
 YES+0.007
 thank+0.007
Of+0.006
 were+0.006
Neg logit contributions
rys-0.023
iny-0.022
alk-0.022
 diary-0.022
zo-0.021
Token rabb
Token ID19368
Feature activation-2.68e-02

Pos logit contributions
 were+0.011
 thank+0.010
 lots+0.010
 YES+0.010
 no+0.010
Neg logit contributions
rys-0.013
iny-0.012
alk-0.012
enna-0.012
umper-0.012
Tokenith
Token ID342
Feature activation-1.20e-03

Pos logit contributions
 diary+0.052
ci+0.052
 avail+0.052
rys+0.050
ishment+0.049
Neg logit contributions
 of-0.034
Mike-0.028
 were-0.027
anges-0.027
Sure-0.027
Tokenole
Token ID2305
Feature activation-1.76e-02

Pos logit contributions
rys+0.002
alk+0.002
iny+0.002
 battlefield+0.002
ishment+0.002
Neg logit contributions
 were-0.001
 how-0.001
 of-0.001
 no-0.001
 lots-0.001
Token.
Token ID13
Feature activation-2.97e-03

Pos logit contributions
rys+0.035
iny+0.033
alk+0.033
 diary+0.032
enna+0.031
Neg logit contributions
 were-0.020
five-0.019
 how-0.019
 YES-0.019
 thank-0.019
Token Hop
Token ID9996
Feature activation-1.47e-02

Pos logit contributions
rys+0.006
alk+0.006
iny+0.006
 diary+0.006
enna+0.006
Neg logit contributions
 were-0.003
five-0.003
 how-0.003
 YES-0.003
 thank-0.003
Token was
Token ID373
Feature activation-5.69e-04

Pos logit contributions
rys+0.032
iny+0.032
zo+0.032
ita+0.031
alk+0.031
Neg logit contributions
five-0.012
 thank-0.012
 YES-0.012
anges-0.011
Of-0.011
Token'm
Token ID1101
Feature activation-1.33e-02

Pos logit contributions
 YES+0.010
anges+0.009
five+0.009
 how+0.009
 lots+0.009
Neg logit contributions
rys-0.017
iny-0.017
alk-0.016
 diary-0.016
zo-0.016
Token sorry
Token ID7926
Feature activation-2.58e-02

Pos logit contributions
rys+0.029
iny+0.028
alk+0.028
 diary+0.027
enna+0.026
Neg logit contributions
 were-0.012
five-0.012
 YES-0.012
 how-0.012
 thank-0.012
Token,
Token ID11
Feature activation+1.73e-03

Pos logit contributions
rys+0.055
iny+0.054
 diary+0.052
ita+0.052
zo+0.051
Neg logit contributions
five-0.025
 YES-0.023
Eventually-0.021
Sure-0.021
 were-0.021
Token my
Token ID616
Feature activation+1.08e-02

Pos logit contributions
five+0.002
 Faster+0.002
 YES+0.002
Eventually+0.002
ival+0.002
Neg logit contributions
rys-0.003
ishment-0.003
yt-0.003
ita-0.003
 avail-0.003
Token d
Token ID288
Feature activation+9.64e-03

Pos logit contributions
anges+0.013
Eventually+0.012
 Faster+0.011
 YES+0.011
five+0.011
Neg logit contributions
 diary-0.020
 weeping-0.019
ci-0.019
 violin-0.019
 universe-0.018
Tokenarl
Token ID7063
Feature activation-5.49e-02

Pos logit contributions
five+0.018
 were+0.017
anges+0.017
 how+0.017
 thank+0.017
Neg logit contributions
rys-0.012
alk-0.011
iny-0.011
 diary-0.011
 avail-0.011
Tokenings
Token ID654
Feature activation-8.75e-03

Pos logit contributions
rys+0.099
iny+0.093
alk+0.092
 diary+0.090
enna+0.088
Neg logit contributions
 were-0.071
five-0.070
 YES-0.069
 how-0.069
 thank-0.068
Token.
Token ID13
Feature activation-6.71e-03

Pos logit contributions
rys+0.015
iny+0.014
alk+0.014
 diary+0.013
zo+0.013
Neg logit contributions
 YES-0.012
five-0.012
anges-0.012
 how-0.012
 were-0.011
Token The
Token ID383
Feature activation-1.12e-03

Pos logit contributions
rys+0.014
alk+0.014
iny+0.014
 diary+0.013
enna+0.013
Neg logit contributions
five-0.007
 were-0.007
 thank-0.007
 how-0.006
 no-0.006
Token tornado
Token ID33718
Feature activation-7.20e-03

Pos logit contributions
rys+0.002
iny+0.002
alk+0.002
enna+0.002
ita+0.002
Neg logit contributions
 were-0.001
 thank-0.001
five-0.001
 no-0.001
 how-0.001
Token took
Token ID1718
Feature activation-8.30e-03

Pos logit contributions
rys+0.017
iny+0.016
alk+0.016
 diary+0.016
ci+0.016
Neg logit contributions
 YES-0.005
anges-0.005
five-0.005
 lots-0.005
Of-0.005
Token"
Token ID1
Feature activation-3.01e-03

Pos logit contributions
rys+0.006
iny+0.006
alk+0.005
 diary+0.005
ita+0.005
Neg logit contributions
 YES-0.003
 were-0.003
 thank-0.003
five-0.003
 how-0.003
TokenHello
Token ID15496
Feature activation-1.65e-02

Pos logit contributions
rys+0.005
iny+0.005
enna+0.005
ita+0.004
umper+0.004
Neg logit contributions
 how-0.004
 thank-0.004
 were-0.004
oops-0.004
 no-0.004
Token,
Token ID11
Feature activation+6.36e-03

Pos logit contributions
rys+0.032
 diary+0.032
ita+0.032
iny+0.032
yt+0.032
Neg logit contributions
five-0.016
 YES-0.015
Eventually-0.015
Sure-0.014
Of-0.014
Token my
Token ID616
Feature activation+1.65e-02

Pos logit contributions
five+0.009
 YES+0.008
Eventually+0.008
 Faster+0.008
Again+0.007
Neg logit contributions
 diary-0.010
 universe-0.010
yt-0.009
 violin-0.009
 Margaret-0.009
Token d
Token ID288
Feature activation+1.02e-02

Pos logit contributions
anges+0.023
Eventually+0.021
 Faster+0.020
Sure+0.019
five+0.019
Neg logit contributions
ind-0.026
 diary-0.026
 universe-0.026
ci-0.025
ky-0.025
Tokenarl
Token ID7063
Feature activation-5.41e-02

Pos logit contributions
 were+0.020
five+0.020
 YES+0.019
 how+0.019
 thank+0.019
Neg logit contributions
rys-0.012
iny-0.010
alk-0.010
 diary-0.010
enna-0.010
Tokenings
Token ID654
Feature activation-1.02e-02

Pos logit contributions
rys+0.097
iny+0.091
alk+0.091
 diary+0.089
enna+0.086
Neg logit contributions
 were-0.070
five-0.069
 YES-0.069
 how-0.068
 thank-0.068
Token.
Token ID13
Feature activation-7.07e-03

Pos logit contributions
rys+0.021
alk+0.020
iny+0.019
 diary+0.019
enna+0.019
Neg logit contributions
 were-0.011
five-0.011
 how-0.011
 thank-0.011
 lots-0.010
Token I
Token ID314
Feature activation+1.34e-03

Pos logit contributions
rys+0.013
iny+0.012
 diary+0.012
alk+0.012
enna+0.012
Neg logit contributions
 were-0.008
five-0.008
 YES-0.008
 lots-0.008
 how-0.008
Token'm
Token ID1101
Feature activation-1.34e-02

Pos logit contributions
anges+0.001
 YES+0.001
 how+0.001
five+0.001
 lots+0.001
Neg logit contributions
rys-0.003
iny-0.003
alk-0.003
 diary-0.003
zo-0.003
Token so
Token ID523
Feature activation+4.92e-03

Pos logit contributions
rys+0.029
iny+0.027
alk+0.027
 diary+0.027
enna+0.026
Neg logit contributions
 were-0.013
five-0.013
 YES-0.012
 how-0.012
 thank-0.012
Token day
Token ID1110
Feature activation-1.08e-02

Pos logit contributions
five+0.015
anges+0.015
 were+0.015
 YES+0.015
 how+0.015
Neg logit contributions
rys-0.017
iny-0.016
alk-0.016
 diary-0.015
ita-0.015
Token on
Token ID319
Feature activation+2.24e-03

Pos logit contributions
rys+0.019
iny+0.018
 diary+0.018
alk+0.017
umper+0.017
Neg logit contributions
five-0.014
 Mike-0.014
 YES-0.014
 how-0.014
Of-0.014
Token,
Token ID11
Feature activation+3.53e-03

Pos logit contributions
five+0.001
 were+0.001
 YES+0.001
 how+0.001
anges+0.001
Neg logit contributions
rys-0.006
iny-0.006
alk-0.006
 diary-0.006
ita-0.005
Token Lily
Token ID20037
Feature activation-9.56e-04

Pos logit contributions
five+0.004
 YES+0.004
anges+0.004
 Mike+0.004
oops+0.004
Neg logit contributions
rys-0.007
 diary-0.006
iny-0.006
alk-0.006
 universe-0.006
Token loved
Token ID6151
Feature activation-2.30e-03

Pos logit contributions
rys+0.002
iny+0.002
ita+0.002
zo+0.002
 diary+0.002
Neg logit contributions
anges-0.001
five-0.001
 YES-0.001
 yes-0.001
Of-0.001
Token av
Token ID1196
Feature activation-5.13e-02

Pos logit contributions
rys+0.005
iny+0.005
 diary+0.004
alk+0.004
umper+0.004
Neg logit contributions
five-0.002
 were-0.002
 YES-0.002
anges-0.002
 yes-0.002
Tokenoc
Token ID420
Feature activation+4.36e-02

Pos logit contributions
rys+0.091
iny+0.085
alk+0.084
 diary+0.082
enna+0.080
Neg logit contributions
 were-0.068
five-0.068
 YES-0.067
 how-0.066
 thank-0.066
Tokenados
Token ID22484
Feature activation-1.29e-02

Pos logit contributions
 were+0.094
 YES+0.090
 yes+0.089
 lots+0.089
Of+0.088
Neg logit contributions
rys-0.043
umper-0.038
alk-0.037
 diary-0.037
iny-0.036
Token and
Token ID290
Feature activation-7.10e-03

Pos logit contributions
rys+0.027
iny+0.027
 diary+0.026
alk+0.026
ita+0.025
Neg logit contributions
 YES-0.012
five-0.012
anges-0.012
 Mike-0.011
Sure-0.011
Token would
Token ID561
Feature activation-8.51e-03

Pos logit contributions
rys+0.014
 diary+0.013
ci+0.013
 weeping+0.013
alk+0.013
Neg logit contributions
five-0.008
anges-0.008
 YES-0.007
Sure-0.007
Of-0.007
Token ask
Token ID1265
Feature activation+1.15e-03

Pos logit contributions
rys+0.018
alk+0.017
iny+0.017
 diary+0.016
zo+0.016
Neg logit contributions
five-0.008
 YES-0.008
anges-0.008
 were-0.008
 how-0.008
Token,
Token ID11
Feature activation+3.37e-03

Pos logit contributions
rys+0.017
umper+0.016
iny+0.015
alk+0.015
 avail+0.015
Neg logit contributions
 were-0.008
 no-0.008
 lots-0.007
 how-0.007
 so-0.007
Token and
Token ID290
Feature activation-1.72e-02

Pos logit contributions
five+0.003
anges+0.003
 YES+0.003
 were+0.003
Sure+0.003
Neg logit contributions
rys-0.007
 diary-0.007
alk-0.007
iny-0.007
 weeping-0.007
Token all
Token ID477
Feature activation+4.28e-03

Pos logit contributions
rys+0.037
 diary+0.035
alk+0.035
iny+0.034
ci+0.034
Neg logit contributions
five-0.015
 YES-0.015
anges-0.015
 were-0.014
Sure-0.014
Token the
Token ID262
Feature activation-1.74e-02

Pos logit contributions
 were+0.003
 how+0.003
 thank+0.003
five+0.003
 lots+0.003
Neg logit contributions
rys-0.011
iny-0.010
alk-0.010
 diary-0.010
enna-0.010
Token other
Token ID584
Feature activation-2.90e-02

Pos logit contributions
rys+0.037
iny+0.034
alk+0.034
 diary+0.034
enna+0.033
Neg logit contributions
 were-0.017
five-0.017
 YES-0.017
 how-0.017
 thank-0.017
Token av
Token ID1196
Feature activation-7.03e-02

Pos logit contributions
rys+0.053
 diary+0.051
 universe+0.050
iny+0.049
alk+0.049
Neg logit contributions
 YES-0.036
five-0.036
anges-0.035
Sure-0.034
Of-0.033
Tokenoc
Token ID420
Feature activation+4.10e-02

Pos logit contributions
rys+0.127
alk+0.118
iny+0.117
 diary+0.115
enna+0.112
Neg logit contributions
 were-0.091
five-0.091
 how-0.089
 YES-0.088
 lots-0.088
Tokenados
Token ID22484
Feature activation-2.83e-02

Pos logit contributions
 were+0.082
 YES+0.081
 yes+0.081
 lots+0.080
Sure+0.079
Neg logit contributions
rys-0.044
alk-0.041
iny-0.039
umper-0.039
 diary-0.038
Token were
Token ID547
Feature activation+2.64e-02

Pos logit contributions
rys+0.067
alk+0.064
iny+0.064
 diary+0.063
enna+0.062
Neg logit contributions
five-0.020
 YES-0.019
 were-0.019
anges-0.019
 thank-0.018
Token very
Token ID845
Feature activation+1.84e-04

Pos logit contributions
five+0.026
Sure+0.024
 YES+0.024
Of+0.024
anges+0.024
Neg logit contributions
rys-0.056
iny-0.054
 diary-0.053
alk-0.053
 weeping-0.051
Token happy
Token ID3772
Feature activation+7.33e-03

Pos logit contributions
five+0.000
 YES+0.000
anges+0.000
 were+0.000
 how+0.000
Neg logit contributions
rys-0.000
iny-0.000
ind-0.000
zo-0.000
alk-0.000
Token kids
Token ID3988
Feature activation+9.60e-03

Pos logit contributions
 were+0.004
 how+0.004
 yes+0.004
anges+0.004
 lots+0.004
Neg logit contributions
rys-0.009
alk-0.008
iny-0.008
 diary-0.008
enna-0.008
Token.
Token ID13
Feature activation+3.92e-03

Pos logit contributions
 were+0.011
 how+0.011
five+0.011
 no+0.011
 yes+0.011
Neg logit contributions
rys-0.019
alk-0.018
iny-0.018
 diary-0.017
enna-0.017
Token They
Token ID1119
Feature activation+8.59e-03

Pos logit contributions
 were+0.005
five+0.005
 YES+0.005
 how+0.005
 thank+0.005
Neg logit contributions
rys-0.007
iny-0.007
alk-0.007
 diary-0.006
enna-0.006
Token liked
Token ID8288
Feature activation-3.74e-03

Pos logit contributions
five+0.008
 YES+0.007
anges+0.007
 oh+0.007
oops+0.007
Neg logit contributions
rys-0.018
enna-0.018
zo-0.018
alk-0.018
iny-0.018
Token to
Token ID284
Feature activation-1.06e-02

Pos logit contributions
rys+0.008
alk+0.008
 diary+0.008
iny+0.008
 battlefield+0.008
Neg logit contributions
five-0.003
 YES-0.003
 were-0.003
anges-0.003
 thank-0.003
Token solve
Token ID8494
Feature activation-4.95e-02

Pos logit contributions
rys+0.021
iny+0.020
umper+0.020
ind+0.019
ishment+0.019
Neg logit contributions
 thank-0.013
 were-0.013
 how-0.013
 yes-0.013
 lots-0.013
Token puzzles
Token ID24367
Feature activation-1.27e-02

Pos logit contributions
rys+0.108
iny+0.101
alk+0.100
enna+0.099
umper+0.099
Neg logit contributions
 how-0.048
 were-0.048
 lots-0.047
 no-0.044
five-0.044
Token and
Token ID290
Feature activation-1.46e-02

Pos logit contributions
rys+0.025
iny+0.023
alk+0.022
 diary+0.022
umper+0.022
Neg logit contributions
 were-0.015
 how-0.015
five-0.015
 lots-0.015
 thank-0.015
Token r
Token ID374
Feature activation-2.93e-02

Pos logit contributions
rys+0.031
iny+0.029
umper+0.029
alk+0.028
ita+0.028
Neg logit contributions
 were-0.017
 how-0.016
 lots-0.016
 no-0.016
 thank-0.016
Tokeniddles
Token ID29319
Feature activation-2.06e-02

Pos logit contributions
rys+0.046
 diary+0.041
alk+0.040
iny+0.040
 battlefield+0.039
Neg logit contributions
 were-0.047
 how-0.046
 lots-0.046
five-0.046
anges-0.046
Token.
Token ID13
Feature activation-5.06e-03

Pos logit contributions
rys+0.031
alk+0.030
iny+0.030
 diary+0.029
zo+0.029
Neg logit contributions
 YES-0.031
Of-0.031
Sure-0.030
five-0.030
 thank-0.030