TOP ACTIVATIONS
MAX = 3.504

the rice back in its rightful place . John was proud
restore it back to its rightful place . Although Sammy was
to return it to its rightful owner if you can find
the stolen squares to their rightful place . The
he got it so the rightful owner could find it .
returned the towel to its rightful owner . The owner was
put them back in their rightful place . After
The boot returned to its rightful owners and the conflict was
I am already married to Mrs . Swan ."
, he accidentally knocked over Mr . Smith 's drink that
it ? What if her poems were gone ?
, Sara knew why the bl ouse was so enormous -
He felt bad for making Mr . Bear feel not good
wonderful colours of the animals ' marks . <|endoftext|> Jack went
her strong teeth to stitch Mr . Snow man back together
dust and dirt . The st alks were hidden and the
others made him feel less gr umpy . The moral of
He always remembered how polite Mr . Smith was and how
king ate too many ch ocol ates and his tum my
James wanted to enjoy the kay ak before it disappeared ,

MIN ACTIVATIONS
MIN = -3.898

Jake had never had por ridge before and he was not
saw . She used cr ay ons of different colors .
dreamed . She used cr ay ons and stickers to make
. She writes with cr ay ons , pens , pencil
stamps . He has cr ay ons and scissors . He
, papers , and cr ay ons everywhere . She saw
filthy ! She hadn 't seen it look like this before
art . We used cr ay ons and markers to draw
couldn 't un - m elt it .
scissors , glue and cr ay ons . They make their
. She had many cr ay ons , papers , scissors
the whale had ever sm elt . He couldn 't help
. She had many cr ay ons and papers in her
. She had many cr ay ons and papers to make
stage . They use cr ay ons and stickers to decor
He had glue and cr ay on on his fur and
She hadn 't seen it before but it was
. She used many cr ay ons and paper . She
You should not use cr ay ons like that ."
that the flag had sm elt really sour ! Bob thought

INTERVAL 1.653 - 3.504
CONTAINS 0.520%

, the moral of the story was that playing jokes can
to show how happy it was that the storm was over
spend this money on anything you want ." Exc
. Lily was happy to share her candy with Emma .
put the food in the right place . "

INTERVAL -0.197 - 1.653
CONTAINS 62.556%

take care of your responsibilities , even if you don 't
J ill was so excited that she decided to make a
J enny 's dad called for her and she scrambled back
mom said she would help . Lily felt happy and relieved
in the park . He saw something strange , it was

INTERVAL -2.047 - -0.197
CONTAINS 36.893%

day he would make new clay figures and dreams .
said . She took their hands and walked with them to
to play chess . He moved the pieces around and then
She is happy . She thinks she has a very sharp
. He watched helpless ly as the ash that he wanted

INTERVAL -3.898 - -2.047
CONTAINS 0.032%

. Tim my had never seen one before . His dad
the post . She scrub bed the dirt away and soon
With a to othy grin , Kenny led the
excited . She had never seen an aer opl ane before
worried . She had never seen fashion before and she was
Token the
Token ID262
Feature activation+3.90e-02

Pos logit contributions
 Care+0.004
 snack+0.004
 spaghetti+0.004
!+0.004
 Sure+0.004
Neg logit contributions
spe-0.009
work-0.009
ho-0.009
 peac-0.009
 prope-0.008
Token rice
Token ID11464
Feature activation+2.08e-03

Pos logit contributions
spe+0.002
work+0.002
ho+0.002
 peac+0.002
 prope+0.002
Neg logit contributions
 snack-0.001
 spaghetti-0.001
 rest-0.001
 chair-0.001
 Care-0.001
Token back
Token ID736
Feature activation-1.15e-01

Pos logit contributions
spe+0.000
 prope+0.000
 peac+0.000
ateful+0.000
 narrative+0.000
Neg logit contributions
!-0.000
 snack-0.000
 while-0.000
 lunch-0.000
 spaghetti-0.000
Token in
Token ID287
Feature activation+1.13e+00

Pos logit contributions
!+0.002
 while+0.002
 snack+0.002
.+0.002
 minutes+0.002
Neg logit contributions
 peac-0.005
spe-0.005
 prope-0.005
work-0.005
 narrative-0.005
Token its
Token ID663
Feature activation+8.27e-01

Pos logit contributions
spe+0.050
work+0.048
ho+0.048
 peac+0.046
 prope+0.045
Neg logit contributions
 snack-0.023
 Care-0.023
!-0.023
 spaghetti-0.022
 Sure-0.021
Token rightful
Token ID43133
Feature activation+3.50e+00

Pos logit contributions
spe+0.039
work+0.038
ho+0.036
 peac+0.035
 prope+0.035
Neg logit contributions
 snack-0.015
 Care-0.014
 Sure-0.014
!-0.014
 spaghetti-0.013
Token place
Token ID1295
Feature activation+6.71e-01

Pos logit contributions
spe+0.153
work+0.148
ho+0.143
 peac+0.140
 prope+0.139
Neg logit contributions
 Care-0.069
 Sure-0.068
 snack-0.066
 minutes-0.065
 spaghetti-0.064
Token.
Token ID13
Feature activation-4.44e-01

Pos logit contributions
spe+0.038
 prope+0.038
 peac+0.038
work+0.038
ho+0.037
Neg logit contributions
!-0.005
 while-0.003
 snack-0.003
 Care-0.002
 Sure-0.002
Token John
Token ID1757
Feature activation-4.97e-01

Pos logit contributions
 snack+0.008
 pharmacy+0.008
 minutes+0.008
 spaghetti+0.007
ittens+0.007
Neg logit contributions
spe-0.021
work-0.019
ho-0.018
 prope-0.017
 peac-0.017
Token was
Token ID373
Feature activation-8.58e-02

Pos logit contributions
!+0.009
 As+0.007
As+0.007
 chair+0.007
 while+0.007
Neg logit contributions
work-0.023
spe-0.023
 peac-0.023
 narrative-0.022
 prope-0.022
Token proud
Token ID6613
Feature activation-5.23e-02

Pos logit contributions
!+0.001
 quick+0.001
 Dr+0.001
 Care+0.001
 minutes+0.001
Neg logit contributions
ho-0.004
work-0.004
spe-0.004
 peac-0.004
 narrative-0.004
Token restore
Token ID11169
Feature activation+6.11e-01

Pos logit contributions
 snack+0.000
 Care+0.000
!+0.000
 while+0.000
 rest+0.000
Neg logit contributions
spe-0.000
work-0.000
 prope-0.000
ho-0.000
 peac-0.000
Token it
Token ID340
Feature activation+2.23e-01

Pos logit contributions
spe+0.029
ho+0.027
work+0.027
 peac+0.027
 prope+0.027
Neg logit contributions
!-0.012
 Care-0.010
 snack-0.010
 spaghetti-0.010
 while-0.010
Token back
Token ID736
Feature activation+3.20e-02

Pos logit contributions
spe+0.012
 peac+0.012
 prope+0.012
work+0.012
ho+0.012
Neg logit contributions
!-0.002
 Care-0.002
 snack-0.002
 while-0.002
 spaghetti-0.002
Token to
Token ID284
Feature activation+2.88e-01

Pos logit contributions
spe+0.002
 peac+0.002
work+0.002
 prope+0.002
ho+0.002
Neg logit contributions
!-0.000
 Care-0.000
 snack-0.000
 Sure-0.000
 while-0.000
Token its
Token ID663
Feature activation+5.04e-01

Pos logit contributions
spe+0.014
ho+0.013
work+0.013
 prope+0.013
 peac+0.013
Neg logit contributions
!-0.005
 snack-0.005
 Care-0.005
 Sure-0.004
 spaghetti-0.004
Token rightful
Token ID43133
Feature activation+3.42e+00

Pos logit contributions
spe+0.021
work+0.021
ho+0.019
 peac+0.019
 prope+0.019
Neg logit contributions
 snack-0.011
!-0.011
 Care-0.011
 Sure-0.010
 minutes-0.010
Token place
Token ID1295
Feature activation+7.99e-01

Pos logit contributions
spe+0.145
work+0.139
ho+0.134
 peac+0.132
 prope+0.131
Neg logit contributions
 Care-0.072
 Sure-0.071
 snack-0.069
 spaghetti-0.068
 minutes-0.067
Token.
Token ID13
Feature activation-3.48e-01

Pos logit contributions
 prope+0.047
spe+0.046
 peac+0.046
work+0.045
ho+0.045
Neg logit contributions
!-0.006
 Care-0.002
 snack-0.002
 while-0.002
 Sure-0.002
Token Although
Token ID4900
Feature activation+7.24e-01

Pos logit contributions
 minutes+0.006
 snack+0.005
 pharmacy+0.005
 Care+0.005
 spaghetti+0.005
Neg logit contributions
spe-0.016
work-0.016
ho-0.015
 peac-0.015
melon-0.015
Token Sammy
Token ID44067
Feature activation-8.63e-01

Pos logit contributions
spe+0.038
work+0.037
 prope+0.036
ho+0.035
melon+0.033
Neg logit contributions
!-0.011
 while-0.009
 snack-0.008
 spaghetti-0.008
 Care-0.008
Token was
Token ID373
Feature activation-5.54e-01

Pos logit contributions
!+0.016
 walks+0.014
 rest+0.013
 Care+0.013
 quick+0.012
Neg logit contributions
spe-0.040
 prope-0.039
 peac-0.039
work-0.039
ateful-0.038
Token to
Token ID284
Feature activation-2.57e-01

Pos logit contributions
spe+0.013
work+0.012
ho+0.012
 peac+0.012
 prope+0.012
Neg logit contributions
!-0.004
 Care-0.004
 spaghetti-0.004
 snack-0.004
 As-0.004
Token return
Token ID1441
Feature activation+8.06e-01

Pos logit contributions
 snack+0.004
 rest+0.004
!+0.004
 Care+0.004
 nap+0.004
Neg logit contributions
spe-0.013
work-0.013
ho-0.012
 prope-0.012
 peac-0.011
Token it
Token ID340
Feature activation+3.19e-01

Pos logit contributions
spe+0.034
work+0.033
ho+0.032
 peac+0.032
 prope+0.031
Neg logit contributions
 Sure-0.016
 Care-0.016
 snack-0.015
 pharmacy-0.015
 spaghetti-0.015
Token to
Token ID284
Feature activation+2.72e-01

Pos logit contributions
spe+0.017
work+0.016
ho+0.016
 peac+0.015
 prope+0.015
Neg logit contributions
!-0.004
 Care-0.004
 Sure-0.004
 snack-0.004
 minutes-0.003
Token its
Token ID663
Feature activation+5.55e-01

Pos logit contributions
spe+0.013
work+0.012
ho+0.012
 prope+0.012
 peac+0.012
Neg logit contributions
!-0.005
 Care-0.005
 snack-0.005
 spaghetti-0.004
 Sure-0.004
Token rightful
Token ID43133
Feature activation+3.35e+00

Pos logit contributions
spe+0.021
work+0.021
ho+0.019
 peac+0.019
 prope+0.019
Neg logit contributions
 Care-0.014
 snack-0.014
 Sure-0.014
!-0.013
 minutes-0.013
Token owner
Token ID4870
Feature activation-1.01e+00

Pos logit contributions
spe+0.132
work+0.124
ho+0.119
 peac+0.118
 prope+0.118
Neg logit contributions
 Sure-0.080
 Care-0.080
 spaghetti-0.079
 snack-0.078
 Dr-0.077
Token if
Token ID611
Feature activation+2.30e-01

Pos logit contributions
 Care+0.010
 Sure+0.010
!+0.010
 snack+0.010
 spaghetti+0.009
Neg logit contributions
spe-0.053
work-0.051
ho-0.050
 prope-0.050
 peac-0.050
Token you
Token ID345
Feature activation+1.10e+00

Pos logit contributions
spe+0.011
work+0.011
ho+0.010
 peac+0.010
 prope+0.010
Neg logit contributions
 Care-0.003
 Sure-0.003
 snack-0.003
!-0.003
 minutes-0.003
Token can
Token ID460
Feature activation+1.00e+00

Pos logit contributions
spe+0.054
 prope+0.051
work+0.050
 peac+0.049
ho+0.047
Neg logit contributions
!-0.021
 rest-0.020
 snack-0.018
 nap-0.017
 Care-0.017
Token find
Token ID1064
Feature activation+3.81e-01

Pos logit contributions
spe+0.047
work+0.045
ho+0.043
 peac+0.043
 prope+0.043
Neg logit contributions
 Care-0.017
 snack-0.016
 Sure-0.016
!-0.016
 pharmacy-0.016
Token the
Token ID262
Feature activation+3.91e-02

Pos logit contributions
spe+0.022
work+0.021
ho+0.020
 peac+0.020
 prope+0.019
Neg logit contributions
 Care-0.012
 Sure-0.012
 spaghetti-0.012
 minutes-0.012
 pharmacy-0.012
Token stolen
Token ID9909
Feature activation+7.00e-01

Pos logit contributions
spe+0.002
work+0.002
ho+0.002
 prope+0.002
 peac+0.002
Neg logit contributions
 Care-0.001
 snack-0.001
 Sure-0.001
 spaghetti-0.001
 minutes-0.001
Token squares
Token ID24438
Feature activation-2.22e-01

Pos logit contributions
spe+0.028
work+0.027
ho+0.026
 prope+0.026
 peac+0.026
Neg logit contributions
 Care-0.016
!-0.016
 snack-0.016
 Sure-0.016
 spaghetti-0.015
Token to
Token ID284
Feature activation-4.91e-01

Pos logit contributions
!+0.003
 snack+0.002
 while+0.002
 Care+0.002
 minutes+0.002
Neg logit contributions
spe-0.012
 prope-0.011
work-0.011
 peac-0.011
ho-0.011
Token their
Token ID511
Feature activation+2.17e-01

Pos logit contributions
 snack+0.010
!+0.010
 Care+0.010
 Sure+0.010
 Dr+0.009
Neg logit contributions
spe-0.022
work-0.021
 prope-0.021
ho-0.021
 peac-0.020
Token rightful
Token ID43133
Feature activation+3.32e+00

Pos logit contributions
spe+0.009
work+0.009
 prope+0.009
 peac+0.009
ho+0.009
Neg logit contributions
 snack-0.005
 spaghetti-0.004
!-0.004
 Sure-0.004
 Care-0.004
Token place
Token ID1295
Feature activation+4.81e-01

Pos logit contributions
spe+0.129
work+0.121
ho+0.117
 peac+0.115
 prope+0.113
Neg logit contributions
 Care-0.082
 Sure-0.082
 snack-0.079
 spaghetti-0.079
 Dr-0.078
Token.
Token ID13
Feature activation-1.29e-01

Pos logit contributions
spe+0.027
 prope+0.027
work+0.027
 peac+0.027
ho+0.026
Neg logit contributions
!-0.003
 Care-0.002
 snack-0.002
 Sure-0.002
 while-0.002
Token
Token ID198
Feature activation+4.00e-02

Pos logit contributions
 snack+0.002
 minutes+0.002
 spaghetti+0.002
 pharmacy+0.001
 Care+0.001
Neg logit contributions
spe-0.007
work-0.006
ho-0.006
 peac-0.006
 prope-0.006
Token
Token ID198
Feature activation-4.15e-01

Pos logit contributions
spe+0.002
ho+0.002
work+0.002
 peac+0.002
 prope+0.002
Neg logit contributions
 minutes-0.000
 spaghetti-0.000
 snack-0.000
 Care-0.000
 Sure-0.000
TokenThe
Token ID464
Feature activation-3.21e-01

Pos logit contributions
 Care+0.007
 Sure+0.007
 snack+0.007
 As+0.006
 minutes+0.006
Neg logit contributions
spe-0.019
work-0.019
ho-0.018
 peac-0.018
 prope-0.018
Token he
Token ID339
Feature activation+1.27e-01

Pos logit contributions
 prope+0.072
spe+0.071
work+0.070
ho+0.068
 peac+0.068
Neg logit contributions
 spaghetti-0.009
 rest-0.009
 Dr-0.008
 snack-0.008
!-0.008
Token got
Token ID1392
Feature activation+7.21e-01

Pos logit contributions
spe+0.005
work+0.005
 prope+0.005
 peac+0.005
ho+0.005
Neg logit contributions
!-0.003
 snack-0.003
 Care-0.003
 Sure-0.003
 walks-0.003
Token it
Token ID340
Feature activation+9.32e-01

Pos logit contributions
spe+0.036
work+0.035
 prope+0.035
 peac+0.033
ho+0.033
Neg logit contributions
!-0.013
 snack-0.011
 rest-0.011
 while-0.010
 Care-0.010
Token so
Token ID523
Feature activation+3.45e-02

Pos logit contributions
spe+0.047
 prope+0.046
 peac+0.045
work+0.045
ho+0.044
Neg logit contributions
!-0.015
 snack-0.013
 while-0.013
 spaghetti-0.012
 rest-0.011
Token the
Token ID262
Feature activation+3.02e-01

Pos logit contributions
work+0.002
spe+0.002
 prope+0.002
 peac+0.002
 narrative+0.001
Neg logit contributions
 quick-0.001
!-0.001
 while-0.001
 rest-0.001
 snack-0.001
Token rightful
Token ID43133
Feature activation+3.30e+00

Pos logit contributions
spe+0.013
work+0.013
 prope+0.013
ho+0.012
 peac+0.012
Neg logit contributions
 snack-0.006
 minutes-0.006
 rest-0.006
 pharmacy-0.005
 Care-0.005
Token owner
Token ID4870
Feature activation-3.19e-01

Pos logit contributions
spe+0.138
work+0.133
ho+0.128
 peac+0.126
 prope+0.126
Neg logit contributions
 Care-0.071
 Sure-0.071
 snack-0.069
 minutes-0.068
!-0.068
Token could
Token ID714
Feature activation+1.96e-01

Pos logit contributions
!+0.007
 snack+0.006
 while+0.006
 Care+0.006
 Sure+0.006
Neg logit contributions
 prope-0.014
spe-0.013
 peac-0.013
work-0.013
 narrative-0.013
Token find
Token ID1064
Feature activation-2.33e-01

Pos logit contributions
spe+0.010
work+0.009
 prope+0.009
 peac+0.009
ho+0.008
Neg logit contributions
!-0.003
 snack-0.003
 rest-0.003
 Care-0.003
 nap-0.003
Token it
Token ID340
Feature activation+4.06e-01

Pos logit contributions
 Care+0.005
 Sure+0.004
 snack+0.004
!+0.004
 minutes+0.004
Neg logit contributions
spe-0.010
work-0.010
ho-0.010
 peac-0.009
 prope-0.009
Token.
Token ID13
Feature activation+1.37e-01

Pos logit contributions
spe+0.023
 prope+0.023
work+0.022
 peac+0.022
ho+0.022
Neg logit contributions
!-0.004
 snack-0.003
 while-0.002
 Care-0.002
 Sure-0.002
Token returned
Token ID4504
Feature activation+3.84e-01

Pos logit contributions
spe+0.010
work+0.009
 prope+0.009
 peac+0.009
ho+0.009
Neg logit contributions
 Care-0.004
 snack-0.004
 Sure-0.004
!-0.004
 while-0.004
Token the
Token ID262
Feature activation-3.03e-02

Pos logit contributions
spe+0.017
work+0.016
ho+0.016
 peac+0.015
 prope+0.015
Neg logit contributions
 Care-0.008
 Sure-0.008
 snack-0.007
 minutes-0.007
!-0.007
Token towel
Token ID24808
Feature activation-3.49e-01

Pos logit contributions
 snack+0.001
 Care+0.001
 spaghetti+0.001
 minutes+0.000
 chair+0.000
Neg logit contributions
spe-0.001
work-0.001
ho-0.001
 peac-0.001
 prope-0.001
Token to
Token ID284
Feature activation-2.55e-01

Pos logit contributions
!+0.006
 while+0.005
.+0.005
 snack+0.005
 Care+0.004
Neg logit contributions
 prope-0.017
spe-0.017
 peac-0.017
 narrative-0.016
ateful-0.016
Token its
Token ID663
Feature activation+2.53e-01

Pos logit contributions
 Care+0.005
 Sure+0.005
 snack+0.004
 minutes+0.004
 spaghetti+0.004
Neg logit contributions
spe-0.012
work-0.011
ho-0.011
 peac-0.011
 prope-0.011
Token rightful
Token ID43133
Feature activation+3.26e+00

Pos logit contributions
spe+0.011
work+0.011
ho+0.010
 peac+0.010
 prope+0.010
Neg logit contributions
 snack-0.005
 Care-0.005
!-0.005
 Sure-0.005
 pharmacy-0.005
Token owner
Token ID4870
Feature activation-1.06e+00

Pos logit contributions
spe+0.128
work+0.121
ho+0.117
 peac+0.115
 prope+0.114
Neg logit contributions
 Care-0.079
 Sure-0.078
 snack-0.076
 spaghetti-0.076
 Dr-0.074
Token.
Token ID13
Feature activation+9.08e-02

Pos logit contributions
 Care+0.011
 Sure+0.011
 snack+0.010
 minutes+0.010
 spaghetti+0.010
Neg logit contributions
spe-0.056
work-0.055
ho-0.053
 peac-0.052
 prope-0.052
Token The
Token ID383
Feature activation-1.05e-01

Pos logit contributions
spe+0.004
work+0.004
ho+0.003
 peac+0.003
melon+0.003
Neg logit contributions
 minutes-0.002
 snack-0.002
 snowball-0.002
 spaghetti-0.002
!-0.002
Token owner
Token ID4870
Feature activation-3.36e-01

Pos logit contributions
 snack+0.002
 pharmacy+0.002
 Care+0.002
 spaghetti+0.002
 rest+0.002
Neg logit contributions
spe-0.004
work-0.004
ho-0.004
 prope-0.004
 peac-0.004
Token was
Token ID373
Feature activation-3.64e-01

Pos logit contributions
!+0.005
 Sure+0.005
 Care+0.005
 Dr+0.005
 snack+0.005
Neg logit contributions
 prope-0.016
work-0.015
 peac-0.015
spe-0.015
ho-0.014
Token put
Token ID1234
Feature activation-6.44e-01

Pos logit contributions
spe+0.001
work+0.001
ho+0.001
 peac+0.001
 prope+0.001
Neg logit contributions
 Care-0.000
 snack-0.000
 Sure-0.000
 spaghetti-0.000
!-0.000
Token them
Token ID606
Feature activation-2.19e-01

Pos logit contributions
 Care+0.015
 snack+0.014
!+0.014
 Sure+0.014
 spaghetti+0.014
Neg logit contributions
spe-0.027
work-0.026
ho-0.025
 peac-0.025
 prope-0.024
Token back
Token ID736
Feature activation-2.88e-01

Pos logit contributions
!+0.005
 snack+0.005
 while+0.004
 Care+0.004
 spaghetti+0.004
Neg logit contributions
 peac-0.010
spe-0.010
ho-0.010
work-0.009
 prope-0.009
Token in
Token ID287
Feature activation+7.63e-01

Pos logit contributions
!+0.006
 while+0.005
 Care+0.005
 snack+0.005
 minutes+0.005
Neg logit contributions
 peac-0.013
spe-0.013
 prope-0.012
ho-0.012
work-0.012
Token their
Token ID511
Feature activation+5.57e-01

Pos logit contributions
spe+0.035
work+0.034
ho+0.034
 peac+0.033
 prope+0.032
Neg logit contributions
!-0.014
 Care-0.014
 snack-0.013
 minutes-0.013
 spaghetti-0.013
Token rightful
Token ID43133
Feature activation+3.19e+00

Pos logit contributions
spe+0.027
work+0.026
ho+0.025
 peac+0.025
 prope+0.024
Neg logit contributions
 Care-0.009
 Sure-0.009
 snack-0.009
 minutes-0.008
 spaghetti-0.008
Token place
Token ID1295
Feature activation+4.05e-01

Pos logit contributions
spe+0.142
work+0.132
 prope+0.127
ho+0.127
 peac+0.126
Neg logit contributions
 Care-0.058
 Sure-0.058
 snack-0.058
 spaghetti-0.058
 Dr-0.057
Token.
Token ID13
Feature activation-3.69e-01

Pos logit contributions
spe+0.023
work+0.022
 peac+0.022
ho+0.022
 prope+0.022
Neg logit contributions
!-0.003
 Care-0.003
 snack-0.002
 Sure-0.002
 spaghetti-0.002
Token
Token ID198
Feature activation-3.06e-01

Pos logit contributions
 minutes+0.008
 snack+0.008
 pharmacy+0.008
 snowball+0.008
 walks+0.007
Neg logit contributions
spe-0.015
work-0.014
melon-0.013
ho-0.012
ull-0.012
Token
Token ID198
Feature activation-6.04e-01

Pos logit contributions
 minutes+0.006
 snack+0.006
 pharmacy+0.006
 procedure+0.006
 Sure+0.006
Neg logit contributions
spe-0.013
ull-0.011
ho-0.011
work-0.011
 peac-0.011
TokenAfter
Token ID3260
Feature activation-6.07e-01

Pos logit contributions
 Sure+0.012
 snack+0.011
 Care+0.011
 minutes+0.011
 pharmacy+0.011
Neg logit contributions
spe-0.027
work-0.025
ho-0.024
 peac-0.024
 prope-0.023
Token The
Token ID383
Feature activation-7.06e-01

Pos logit contributions
 Sure+0.009
 Care+0.009
 As+0.009
 Dr+0.008
 While+0.008
Neg logit contributions
spe-0.020
work-0.020
ho-0.020
 peac-0.019
 prope-0.019
Token boot
Token ID6297
Feature activation-2.25e-01

Pos logit contributions
 Care+0.016
 Sure+0.015
 snack+0.015
 minutes+0.015
 spaghetti+0.015
Neg logit contributions
spe-0.030
work-0.029
ho-0.027
 prope-0.027
 peac-0.027
Token returned
Token ID4504
Feature activation+4.12e-01

Pos logit contributions
!+0.004
 Care+0.004
 snack+0.004
 chair+0.003
 Sure+0.003
Neg logit contributions
work-0.010
ateful-0.010
spe-0.010
 narrative-0.010
 prope-0.010
Token to
Token ID284
Feature activation-3.66e-01

Pos logit contributions
spe+0.021
work+0.020
 narrative+0.020
 prope+0.020
 peac+0.020
Neg logit contributions
!-0.006
 Sure-0.006
 Care-0.006
 while-0.005
 As-0.005
Token its
Token ID663
Feature activation+2.82e-01

Pos logit contributions
 Care+0.008
 Sure+0.008
 snack+0.008
!+0.008
 spaghetti+0.008
Neg logit contributions
spe-0.016
work-0.015
ho-0.015
 prope-0.014
 peac-0.014
Token rightful
Token ID43133
Feature activation+3.13e+00

Pos logit contributions
spe+0.012
work+0.012
ho+0.011
 peac+0.011
 prope+0.011
Neg logit contributions
 Care-0.006
 Sure-0.006
 snack-0.006
 minutes-0.005
 spaghetti-0.005
Token owners
Token ID4393
Feature activation-8.45e-01

Pos logit contributions
spe+0.120
work+0.111
ho+0.108
 peac+0.108
 prope+0.106
Neg logit contributions
 Sure-0.078
 Care-0.077
 spaghetti-0.077
 snack-0.076
 Dr-0.075
Token and
Token ID290
Feature activation+2.92e-01

Pos logit contributions
!+0.012
 snack+0.010
 Care+0.010
 while+0.010
 Sure+0.010
Neg logit contributions
spe-0.043
work-0.041
 prope-0.041
 peac-0.041
ho-0.040
Token the
Token ID262
Feature activation-3.02e-01

Pos logit contributions
spe+0.013
 prope+0.013
work+0.012
ho+0.012
 narrative+0.012
Neg logit contributions
!-0.006
 snack-0.006
 bed-0.005
 Care-0.005
 rest-0.005
Token conflict
Token ID5358
Feature activation+7.34e-01

Pos logit contributions
 Care+0.006
 snack+0.006
 Sure+0.006
 minutes+0.006
!+0.006
Neg logit contributions
spe-0.013
work-0.012
ho-0.012
 prope-0.012
 peac-0.011
Token was
Token ID373
Feature activation+2.17e-02

Pos logit contributions
 prope+0.034
 peac+0.034
ho+0.033
spe+0.033
work+0.032
Neg logit contributions
!-0.014
 snack-0.012
 Care-0.011
 while-0.011
 Sure-0.011
Token I
Token ID314
Feature activation-6.69e-01

Pos logit contributions
 minutes+0.003
 snack+0.003
 Care+0.003
 Sure+0.003
 spaghetti+0.003
Neg logit contributions
spe-0.008
work-0.007
ho-0.007
 peac-0.007
 prope-0.007
Token am
Token ID716
Feature activation+7.70e-01

Pos logit contributions
!+0.013
 snack+0.013
 Care+0.012
 rest+0.012
 Sure+0.012
Neg logit contributions
spe-0.030
work-0.028
 prope-0.028
ho-0.027
 peac-0.027
Token already
Token ID1541
Feature activation-8.70e-01

Pos logit contributions
spe+0.036
ho+0.033
work+0.033
 prope+0.032
 narrative+0.031
Neg logit contributions
 Dr-0.015
!-0.015
 Care-0.015
 snack-0.014
 spaghetti-0.014
Token married
Token ID6405
Feature activation+1.21e-01

Pos logit contributions
!+0.020
 snack+0.018
 Care+0.018
 while+0.018
 quick+0.018
Neg logit contributions
spe-0.037
ho-0.035
 prope-0.034
work-0.034
 narrative-0.034
Token to
Token ID284
Feature activation-9.03e-02

Pos logit contributions
spe+0.006
work+0.006
ho+0.006
 peac+0.006
 prope+0.005
Neg logit contributions
 Care-0.002
 Sure-0.002
 snack-0.002
 minutes-0.002
 spaghetti-0.002
Token Mrs
Token ID9074
Feature activation+3.09e+00

Pos logit contributions
 Care+0.001
 snack+0.001
!+0.001
 Dr+0.001
 Sure+0.001
Neg logit contributions
spe-0.004
work-0.004
ho-0.004
 prope-0.004
 peac-0.004
Token.
Token ID13
Feature activation+2.86e-01

Pos logit contributions
spe+0.174
work+0.164
 prope+0.162
ho+0.161
 peac+0.156
Neg logit contributions
!-0.026
 Care-0.021
 Dr-0.021
 snack-0.021
 Sure-0.020
Token Swan
Token ID21491
Feature activation-7.81e-01

Pos logit contributions
spe+0.009
work+0.009
ho+0.008
 prope+0.008
 peac+0.008
Neg logit contributions
 Care-0.009
 Sure-0.009
!-0.008
 snack-0.008
 As-0.008
Token."
Token ID526
Feature activation+1.07e-01

Pos logit contributions
!+0.009
 snack+0.008
 Care+0.008
 Sure+0.007
 Dr+0.007
Neg logit contributions
spe-0.041
work-0.040
 prope-0.039
ho-0.038
 peac-0.038
Token
Token ID220
Feature activation-1.67e-01

Pos logit contributions
spe+0.004
work+0.004
 peac+0.004
ho+0.004
melon+0.004
Neg logit contributions
 minutes-0.002
 wanna-0.002
Michael-0.002
 Care-0.002
 snowball-0.002
Token
Token ID198
Feature activation+4.41e-01

Pos logit contributions
 minutes+0.003
 snack+0.003
 spaghetti+0.003
 pharmacy+0.003
 procedure+0.003
Neg logit contributions
spe-0.007
work-0.007
ho-0.007
 peac-0.006
ull-0.006
Token,
Token ID11
Feature activation+2.36e-01

Pos logit contributions
 Care+0.013
 Sure+0.013
 snack+0.012
!+0.012
 spaghetti+0.012
Neg logit contributions
spe-0.038
work-0.036
ho-0.035
 peac-0.035
 prope-0.035
Token he
Token ID339
Feature activation-8.11e-01

Pos logit contributions
spe+0.012
work+0.012
ho+0.012
 prope+0.011
 peac+0.011
Neg logit contributions
 Care-0.003
 snack-0.003
 Sure-0.003
 spaghetti-0.003
 Dr-0.003
Token accidentally
Token ID14716
Feature activation+1.95e-01

Pos logit contributions
!+0.012
 spaghetti+0.012
 Care+0.011
 minutes+0.011
 while+0.011
Neg logit contributions
spe-0.040
work-0.039
 prope-0.037
ho-0.037
 peac-0.036
Token knocked
Token ID13642
Feature activation-6.72e-01

Pos logit contributions
spe+0.007
work+0.007
ho+0.007
 peac+0.007
 prope+0.006
Neg logit contributions
 Care-0.005
 Sure-0.005
 snack-0.005
 spaghetti-0.005
!-0.005
Token over
Token ID625
Feature activation+3.66e-01

Pos logit contributions
!+0.017
 Care+0.017
 spaghetti+0.017
 snack+0.016
 As+0.016
Neg logit contributions
spe-0.027
work-0.025
 peac-0.025
 prope-0.025
ho-0.024
Token Mr
Token ID1770
Feature activation+3.06e+00

Pos logit contributions
spe+0.018
work+0.017
ho+0.017
 peac+0.016
 prope+0.016
Neg logit contributions
 Care-0.006
 Sure-0.005
 snack-0.005
 spaghetti-0.005
!-0.005
Token.
Token ID13
Feature activation+1.43e+00

Pos logit contributions
work+0.156
spe+0.155
ho+0.148
 peac+0.146
 prope+0.143
Neg logit contributions
 Care-0.039
 Sure-0.037
 minutes-0.036
 snack-0.034
 As-0.033
Token Smith
Token ID4176
Feature activation-2.38e-02

Pos logit contributions
spe+0.052
work+0.049
ho+0.048
 peac+0.047
 prope+0.046
Neg logit contributions
 Care-0.039
 Sure-0.039
 snack-0.038
 spaghetti-0.037
!-0.037
Token's
Token ID338
Feature activation+6.92e-01

Pos logit contributions
!+0.000
 Care+0.000
 snack+0.000
 spaghetti+0.000
 Sure+0.000
Neg logit contributions
spe-0.001
work-0.001
ho-0.001
 peac-0.001
 prope-0.001
Token drink
Token ID4144
Feature activation+9.41e-01

Pos logit contributions
spe+0.031
work+0.030
 peac+0.029
ho+0.029
 prope+0.028
Neg logit contributions
 snack-0.013
 spaghetti-0.013
 chair-0.013
!-0.013
 Care-0.013
Token that
Token ID326
Feature activation+4.89e-01

Pos logit contributions
spe+0.048
 peac+0.045
work+0.045
 prope+0.044
ho+0.042
Neg logit contributions
!-0.018
 spaghetti-0.017
 snack-0.015
 while-0.015
 Care-0.014
Token it
Token ID340
Feature activation+2.52e-01

Pos logit contributions
 Care+0.007
 Sure+0.007
 snack+0.006
 minutes+0.006
 pharmacy+0.006
Neg logit contributions
spe-0.014
work-0.014
ho-0.013
 peac-0.013
 prope-0.013
Token?
Token ID30
Feature activation-1.60e-02

Pos logit contributions
spe+0.013
work+0.013
ho+0.012
 prope+0.012
 peac+0.012
Neg logit contributions
 Care-0.003
!-0.003
 snack-0.003
 Sure-0.003
 spaghetti-0.003
Token What
Token ID1867
Feature activation+9.94e-01

Pos logit contributions
 minutes+0.000
 snack+0.000
 Care+0.000
 spaghetti+0.000
!+0.000
Neg logit contributions
spe-0.001
work-0.001
ho-0.001
 peac-0.001
 prope-0.001
Token if
Token ID611
Feature activation-2.20e-01

Pos logit contributions
spe+0.041
work+0.039
ho+0.038
 peac+0.037
 prope+0.037
Neg logit contributions
 Care-0.023
 snack-0.022
 Sure-0.022
 spaghetti-0.022
!-0.022
Token her
Token ID607
Feature activation+3.24e-02

Pos logit contributions
 snack+0.003
 Care+0.003
 spaghetti+0.003
!+0.003
 Sure+0.003
Neg logit contributions
spe-0.011
work-0.011
 prope-0.011
ho-0.010
 peac-0.010
Token poems
Token ID31888
Feature activation+3.04e+00

Pos logit contributions
spe+0.001
work+0.001
 peac+0.001
ho+0.001
 prope+0.001
Neg logit contributions
 Sure-0.001
 Care-0.001
 clearing-0.001
 As-0.001
 While-0.001
Token were
Token ID547
Feature activation+2.43e-01

Pos logit contributions
spe+0.144
 prope+0.140
 peac+0.135
work+0.130
ho+0.123
Neg logit contributions
!-0.066
 rest-0.059
 snack-0.057
 nap-0.057
 while-0.054
Token gone
Token ID3750
Feature activation+6.67e-01

Pos logit contributions
spe+0.012
ho+0.011
work+0.011
 prope+0.011
 peac+0.011
Neg logit contributions
!-0.005
 spaghetti-0.005
 snack-0.004
 minutes-0.004
 Dr-0.004
Token?
Token ID30
Feature activation-3.12e-02

Pos logit contributions
spe+0.035
work+0.034
ho+0.033
 peac+0.033
 prope+0.033
Neg logit contributions
!-0.007
 Care-0.007
 Sure-0.007
 snack-0.007
 spaghetti-0.007
Token
Token ID198
Feature activation+1.19e-01

Pos logit contributions
 minutes+0.000
 snack+0.000
 Care+0.000
 spaghetti+0.000
 Sure+0.000
Neg logit contributions
spe-0.002
work-0.002
ho-0.002
 peac-0.002
 prope-0.002
Token
Token ID198
Feature activation+6.83e-02

Pos logit contributions
spe+0.006
ho+0.006
work+0.005
 peac+0.005
 prope+0.005
Neg logit contributions
 snack-0.002
 minutes-0.002
 spaghetti-0.001
 pharmacy-0.001
 Care-0.001
Token,
Token ID11
Feature activation+1.78e-01

Pos logit contributions
 Care+0.024
 Sure+0.024
 snack+0.023
!+0.022
 minutes+0.022
Neg logit contributions
spe-0.078
work-0.076
ho-0.073
 peac-0.073
 prope-0.072
Token Sara
Token ID24799
Feature activation+2.16e-02

Pos logit contributions
spe+0.009
work+0.009
 peac+0.008
 prope+0.008
ho+0.008
Neg logit contributions
 Care-0.003
 Sure-0.002
 snack-0.002
 Dr-0.002
!-0.002
Token knew
Token ID2993
Feature activation-1.30e-01

Pos logit contributions
spe+0.001
work+0.001
 prope+0.001
ho+0.001
 peac+0.001
Neg logit contributions
!-0.000
 Care-0.000
 minutes-0.000
 spaghetti-0.000
As-0.000
Token why
Token ID1521
Feature activation+1.23e+00

Pos logit contributions
 Care+0.002
!+0.002
 Sure+0.002
 snack+0.002
 minutes+0.002
Neg logit contributions
spe-0.007
work-0.006
ho-0.006
 prope-0.006
 peac-0.006
Token the
Token ID262
Feature activation+7.12e-01

Pos logit contributions
spe+0.067
work+0.066
ho+0.063
 prope+0.063
 peac+0.062
Neg logit contributions
!-0.014
 snack-0.012
 Care-0.011
 Dr-0.011
 spaghetti-0.011
Token bl
Token ID698
Feature activation+3.04e+00

Pos logit contributions
spe+0.025
work+0.024
ho+0.023
 peac+0.022
 prope+0.022
Neg logit contributions
 Care-0.021
 snack-0.020
!-0.020
 Sure-0.020
 spaghetti-0.020
Tokenouse
Token ID1076
Feature activation+9.00e-01

Pos logit contributions
spe+0.103
work+0.098
ho+0.094
 peac+0.092
 prope+0.090
Neg logit contributions
 Care-0.090
 Sure-0.089
 snack-0.088
 minutes-0.087
 spaghetti-0.087
Token was
Token ID373
Feature activation+7.99e-01

Pos logit contributions
spe+0.045
 prope+0.045
 peac+0.044
work+0.043
ho+0.042
Neg logit contributions
!-0.014
 snack-0.012
 chair-0.011
 rest-0.010
 Care-0.010
Token so
Token ID523
Feature activation-1.91e-01

Pos logit contributions
spe+0.040
ho+0.038
work+0.037
 prope+0.035
 peac+0.035
Neg logit contributions
!-0.017
 spaghetti-0.014
 snack-0.014
 bed-0.013
 while-0.013
Token enormous
Token ID9812
Feature activation-3.83e-01

Pos logit contributions
 Care+0.003
!+0.003
 quick+0.003
 spaghetti+0.003
 snack+0.003
Neg logit contributions
work-0.010
spe-0.009
 prope-0.009
 peac-0.009
ho-0.009
Token -
Token ID532
Feature activation+7.30e-01

Pos logit contributions
 Care+0.006
 Sure+0.006
 snack+0.006
 pharmacy+0.005
 As+0.005
Neg logit contributions
spe-0.018
work-0.018
ho-0.017
 peac-0.017
 prope-0.017
Token He
Token ID679
Feature activation+2.98e-01

Pos logit contributions
spe+0.003
work+0.003
ho+0.003
melon+0.003
 peac+0.003
Neg logit contributions
 minutes-0.002
 snack-0.002
 Care-0.002
 snowball-0.002
!-0.002
Token felt
Token ID2936
Feature activation+8.22e-01

Pos logit contributions
work+0.014
spe+0.014
 prope+0.013
 narrative+0.013
ho+0.012
Neg logit contributions
!-0.005
As-0.005
 Care-0.005
 Dr-0.005
 walks-0.005
Token bad
Token ID2089
Feature activation+7.95e-01

Pos logit contributions
spe+0.034
work+0.032
ho+0.031
 peac+0.030
 prope+0.030
Neg logit contributions
 Care-0.020
 snack-0.019
 spaghetti-0.019
 Sure-0.019
!-0.018
Token for
Token ID329
Feature activation+1.09e+00

Pos logit contributions
spe+0.039
work+0.038
ho+0.036
 peac+0.036
 prope+0.036
Neg logit contributions
 Care-0.012
 Sure-0.012
 snack-0.011
!-0.011
 minutes-0.011
Token making
Token ID1642
Feature activation+4.49e-01

Pos logit contributions
spe+0.047
work+0.045
ho+0.043
 peac+0.043
 prope+0.042
Neg logit contributions
 Care-0.022
 Sure-0.022
 snack-0.021
 minutes-0.021
 spaghetti-0.021
Token Mr
Token ID1770
Feature activation+3.04e+00

Pos logit contributions
spe+0.020
work+0.019
ho+0.019
 peac+0.018
 prope+0.018
Neg logit contributions
 Care-0.009
 Sure-0.008
 snack-0.008
 minutes-0.008
 spaghetti-0.008
Token.
Token ID13
Feature activation+1.01e+00

Pos logit contributions
work+0.134
spe+0.130
ho+0.124
fly+0.120
 peac+0.120
Neg logit contributions
 Care-0.062
 Sure-0.060
 minutes-0.057
 Ready-0.053
When-0.053
Token Bear
Token ID14732
Feature activation+1.19e-01

Pos logit contributions
work+0.037
spe+0.035
 peac+0.033
 prope+0.033
ho+0.033
Neg logit contributions
 minutes-0.027
 Care-0.026
As-0.025
 Sure-0.025
 snack-0.025
Token feel
Token ID1254
Feature activation+5.40e-01

Pos logit contributions
 prope+0.006
spe+0.006
ho+0.006
 narrative+0.006
 peac+0.006
Neg logit contributions
!-0.002
 snack-0.002
 spaghetti-0.001
 while-0.001
 rest-0.001
Token not
Token ID407
Feature activation-2.62e-01

Pos logit contributions
spe+0.023
work+0.022
ho+0.021
 peac+0.020
 prope+0.020
Neg logit contributions
!-0.013
 Care-0.013
 spaghetti-0.012
 Dr-0.012
 snack-0.012
Token good
Token ID922
Feature activation+2.47e-01

Pos logit contributions
!+0.006
 while+0.005
 snack+0.005
 minutes+0.005
 spaghetti+0.005
Neg logit contributions
ho-0.012
spe-0.012
work-0.011
 prope-0.011
 narrative-0.010
Token wonderful
Token ID7932
Feature activation+1.03e+00

Pos logit contributions
spe+0.033
work+0.031
ho+0.030
 peac+0.029
 prope+0.029
Neg logit contributions
 Care-0.017
 snack-0.017
 spaghetti-0.017
 minutes-0.017
!-0.017
Token colours
Token ID18915
Feature activation+2.89e-01

Pos logit contributions
spe+0.039
work+0.038
 peac+0.036
ho+0.036
 prope+0.036
Neg logit contributions
 Sure-0.026
 Care-0.026
 pharmacy-0.025
 As-0.025
 Dr-0.025
Token of
Token ID286
Feature activation+9.65e-01

Pos logit contributions
spe+0.016
 narrative+0.016
 prope+0.016
ateful+0.016
ho+0.015
Neg logit contributions
!-0.004
 while-0.003
.-0.003
 snack-0.002
 spaghetti-0.002
Token the
Token ID262
Feature activation+9.14e-01

Pos logit contributions
spe+0.049
work+0.044
ho+0.044
edient+0.042
 narrative+0.041
Neg logit contributions
 spaghetti-0.019
 snack-0.018
!-0.018
 bed-0.018
 Care-0.017
Token animals
Token ID4695
Feature activation-2.34e-02

Pos logit contributions
spe+0.039
work+0.036
ho+0.035
 prope+0.034
 peac+0.033
Neg logit contributions
 snack-0.021
 Care-0.021
 spaghetti-0.021
 minutes-0.020
!-0.020
Token'
Token ID6
Feature activation+3.03e+00

Pos logit contributions
!+0.000
 while+0.000
.+0.000
 rest+0.000
 spaghetti+0.000
Neg logit contributions
spe-0.001
 prope-0.001
 narrative-0.001
ateful-0.001
ho-0.001
Token marks
Token ID8849
Feature activation+4.41e-01

Pos logit contributions
spe+0.129
work+0.121
ho+0.118
 prope+0.117
 peac+0.114
Neg logit contributions
 Care-0.066
 snack-0.065
 spaghetti-0.064
 minutes-0.064
 Sure-0.064
Token.
Token ID13
Feature activation-2.34e-01

Pos logit contributions
spe+0.024
 prope+0.022
ho+0.022
 peac+0.022
ateful+0.022
Neg logit contributions
!-0.007
 while-0.005
.-0.005
 spaghetti-0.005
 snack-0.004
Token<|endoftext|>
Token ID50256
Feature activation-5.42e-01

Pos logit contributions
 Care+0.003
 snack+0.003
 minutes+0.003
 Sure+0.003
 pharmacy+0.003
Neg logit contributions
spe-0.012
work-0.011
ho-0.011
 peac-0.011
 prope-0.010
TokenJack
Token ID14295
Feature activation+8.90e-01

Pos logit contributions
 pharmacy+0.012
 minutes+0.012
ittens+0.012
 Deal+0.012
 procedure+0.012
Neg logit contributions
ull-0.020
work-0.019
spe-0.019
ho-0.018
 peac-0.017
Token went
Token ID1816
Feature activation+1.89e-01

Pos logit contributions
spe+0.040
work+0.038
 prope+0.036
 peac+0.036
ho+0.036
Neg logit contributions
!-0.018
 Care-0.017
 spaghetti-0.017
 Sure-0.017
 snack-0.017
Token her
Token ID607
Feature activation-2.15e-01

Pos logit contributions
spe+0.009
work+0.009
 peac+0.008
ho+0.008
 narrative+0.008
Neg logit contributions
 Care-0.003
 snack-0.003
 spaghetti-0.002
 while-0.002
!-0.002
Token strong
Token ID1913
Feature activation+2.45e-01

Pos logit contributions
 snack+0.004
 Care+0.004
 Sure+0.004
!+0.004
 minutes+0.004
Neg logit contributions
spe-0.010
work-0.009
ho-0.009
 peac-0.009
 prope-0.009
Token teeth
Token ID9941
Feature activation-4.53e-01

Pos logit contributions
spe+0.011
work+0.011
ho+0.010
 peac+0.010
 prope+0.010
Neg logit contributions
 snack-0.005
 Care-0.005
!-0.005
 Sure-0.005
 spaghetti-0.005
Token to
Token ID284
Feature activation-2.31e-01

Pos logit contributions
!+0.006
 while+0.005
 Care+0.005
 As+0.005
 snack+0.004
Neg logit contributions
spe-0.025
work-0.024
 peac-0.023
 narrative-0.023
 prope-0.023
Token stitch
Token ID24695
Feature activation-4.45e-01

Pos logit contributions
 snack+0.004
!+0.004
 Care+0.004
 rest+0.004
 spaghetti+0.004
Neg logit contributions
spe-0.011
work-0.011
ho-0.010
 peac-0.010
 narrative-0.010
Token Mr
Token ID1770
Feature activation+3.03e+00

Pos logit contributions
!+0.008
 while+0.007
 snack+0.007
 spaghetti+0.007
 Care+0.007
Neg logit contributions
spe-0.022
 peac-0.021
work-0.021
 narrative-0.021
 prope-0.020
Token.
Token ID13
Feature activation+1.27e+00

Pos logit contributions
spe+0.147
work+0.147
ho+0.146
 peac+0.141
 prope+0.137
Neg logit contributions
 Care-0.044
 Sure-0.041
 minutes-0.038
 snack-0.037
 spaghetti-0.037
Token Snow
Token ID7967
Feature activation+5.74e-01

Pos logit contributions
spe+0.045
ho+0.044
work+0.044
 peac+0.044
 prope+0.042
Neg logit contributions
 Care-0.033
 minutes-0.033
 Sure-0.032
 spaghetti-0.031
 snack-0.031
Tokenman
Token ID805
Feature activation-9.49e-01

Pos logit contributions
spe+0.021
work+0.019
 prope+0.019
 peac+0.018
ho+0.018
Neg logit contributions
!-0.016
 snack-0.016
 Sure-0.015
 Care-0.015
 spaghetti-0.015
Token back
Token ID736
Feature activation+3.33e-02

Pos logit contributions
!+0.016
 snack+0.015
 Care+0.014
 while+0.014
 Sure+0.014
Neg logit contributions
spe-0.046
work-0.044
 peac-0.043
 prope-0.043
 narrative-0.043
Token together
Token ID1978
Feature activation+4.89e-01

Pos logit contributions
spe+0.002
 peac+0.002
work+0.002
 narrative+0.002
 prope+0.002
Neg logit contributions
!-0.000
 while-0.000
 Care-0.000
 snack-0.000
 Sure-0.000
Token dust
Token ID8977
Feature activation+1.67e-01

Pos logit contributions
spe+0.054
work+0.053
 narrative+0.047
ho+0.046
 peac+0.045
Neg logit contributions
 spaghetti-0.019
 snack-0.018
!-0.017
 lunch-0.017
 quick-0.016
Token and
Token ID290
Feature activation-1.94e-01

Pos logit contributions
spe+0.010
 peac+0.010
 narrative+0.009
ateful+0.009
work+0.009
Neg logit contributions
!-0.002
.-0.001
 while-0.001
 spaghetti-0.001
 chair-0.001
Token dirt
Token ID13647
Feature activation-2.70e-01

Pos logit contributions
 spaghetti+0.004
 minutes+0.004
 snack+0.004
 while+0.004
!+0.004
Neg logit contributions
spe-0.009
work-0.009
ho-0.008
 narrative-0.008
 peac-0.008
Token.
Token ID13
Feature activation-1.98e-02

Pos logit contributions
!+0.002
 while+0.001
 snack+0.001
 minutes+0.001
 spaghetti+0.001
Neg logit contributions
spe-0.016
work-0.016
 peac-0.015
 prope-0.015
 narrative-0.015
Token The
Token ID383
Feature activation-7.72e-02

Pos logit contributions
 Care+0.000
 Sure+0.000
 As+0.000
 spaghetti+0.000
 snack+0.000
Neg logit contributions
spe-0.001
work-0.001
ho-0.001
 peac-0.001
 prope-0.001
Token st
Token ID336
Feature activation+3.02e+00

Pos logit contributions
 Care+0.002
 Sure+0.002
 snack+0.002
 minutes+0.002
 spaghetti+0.002
Neg logit contributions
spe-0.003
work-0.003
ho-0.003
 peac-0.003
 prope-0.003
Tokenalks
Token ID23833
Feature activation+1.24e+00

Pos logit contributions
spe+0.094
work+0.089
 peac+0.083
ho+0.082
 prope+0.082
Neg logit contributions
 Sure-0.098
 Care-0.098
 As-0.096
!-0.095
 snack-0.094
Token were
Token ID547
Feature activation-4.01e-01

Pos logit contributions
work+0.056
spe+0.054
 prope+0.053
 peac+0.053
ateful+0.051
Neg logit contributions
!-0.025
 while-0.022
As-0.021
 rest-0.021
-0.021
Token hidden
Token ID7104
Feature activation-4.68e-02

Pos logit contributions
 quick+0.007
 while+0.006
 minutes+0.006
!+0.006
 snack+0.006
Neg logit contributions
spe-0.021
work-0.020
 narrative-0.019
ho-0.019
 peac-0.018
Token and
Token ID290
Feature activation-4.39e-01

Pos logit contributions
!+0.001
 minutes+0.001
 while+0.001
 Care+0.001
 spaghetti+0.001
Neg logit contributions
spe-0.002
work-0.002
 peac-0.002
 prope-0.002
ho-0.002
Token the
Token ID262
Feature activation+2.91e-01

Pos logit contributions
!+0.008
 while+0.007
 minutes+0.007
 quick+0.007
 spaghetti+0.007
Neg logit contributions
spe-0.021
work-0.021
 narrative-0.020
 peac-0.019
 prope-0.019
Token others
Token ID1854
Feature activation+1.20e+00

Pos logit contributions
spe+0.056
 prope+0.053
work+0.052
ho+0.051
 peac+0.049
Neg logit contributions
!-0.028
 rest-0.028
 snack-0.027
 Care-0.027
 while-0.027
Token made
Token ID925
Feature activation+9.89e-01

Pos logit contributions
spe+0.059
ho+0.056
 prope+0.055
 peac+0.055
work+0.053
Neg logit contributions
!-0.021
 Care-0.019
 while-0.018
 rest-0.018
 As-0.017
Token him
Token ID683
Feature activation+9.09e-01

Pos logit contributions
spe+0.043
work+0.041
ho+0.040
 peac+0.039
 prope+0.039
Neg logit contributions
 Care-0.020
 Sure-0.020
 snack-0.019
 minutes-0.019
 spaghetti-0.018
Token feel
Token ID1254
Feature activation+1.60e+00

Pos logit contributions
spe+0.043
ho+0.042
work+0.041
 peac+0.039
 narrative+0.039
Neg logit contributions
!-0.017
 snack-0.017
 while-0.017
 rest-0.016
 quick-0.016
Token less
Token ID1342
Feature activation+1.38e+00

Pos logit contributions
spe+0.080
work+0.077
ho+0.076
 peac+0.071
 prope+0.070
Neg logit contributions
 Care-0.027
!-0.027
 while-0.025
 quick-0.025
 spaghetti-0.024
Token gr
Token ID1036
Feature activation+3.01e+00

Pos logit contributions
ho+0.059
spe+0.059
work+0.057
 peac+0.053
melon+0.050
Neg logit contributions
!-0.034
 minutes-0.032
 while-0.032
 quick-0.031
 spaghetti-0.029
Tokenumpy
Token ID32152
Feature activation+1.51e+00

Pos logit contributions
 prope+0.115
 peac+0.113
ho+0.111
spe+0.108
 narrative+0.100
Neg logit contributions
!-0.074
care-0.066
 As-0.064
.-0.063
 while-0.060
Token.
Token ID13
Feature activation+4.21e-01

Pos logit contributions
spe+0.085
ho+0.082
 peac+0.081
work+0.080
 prope+0.079
Neg logit contributions
!-0.016
 while-0.012
 As-0.009
 minutes-0.009
 Care-0.009
Token The
Token ID383
Feature activation+9.69e-01

Pos logit contributions
spe+0.019
work+0.018
ho+0.017
 peac+0.017
 prope+0.017
Neg logit contributions
 Care-0.008
 Sure-0.008
 snack-0.008
 spaghetti-0.008
 As-0.008
Token moral
Token ID6573
Feature activation+2.64e-01

Pos logit contributions
spe+0.043
work+0.041
ho+0.040
fly+0.037
 prope+0.037
Neg logit contributions
 rest-0.023
 snack-0.021
 nap-0.020
 quick-0.020
 lunch-0.020
Token of
Token ID286
Feature activation+7.76e-01

Pos logit contributions
spe+0.013
work+0.012
ho+0.012
 prope+0.012
 peac+0.012
Neg logit contributions
 Care-0.004
 Sure-0.004
 snack-0.004
!-0.004
 As-0.004
Token He
Token ID679
Feature activation-3.71e-01

Pos logit contributions
spe+0.000
work+0.000
ho+0.000
 peac+0.000
 prope+0.000
Neg logit contributions
 minutes-0.000
 Care-0.000
 snack-0.000
 Sure-0.000
 pharmacy-0.000
Token always
Token ID1464
Feature activation+1.19e+00

Pos logit contributions
!+0.007
 walks+0.006
As+0.006
+0.006
 snowball+0.005
Neg logit contributions
work-0.017
spe-0.016
ateful-0.016
 prope-0.016
 peac-0.015
Token remembered
Token ID12086
Feature activation+7.45e-01

Pos logit contributions
spe+0.052
work+0.050
ho+0.049
 prope+0.048
 peac+0.046
Neg logit contributions
!-0.026
 minutes-0.024
 Care-0.023
 snack-0.022
 Sure-0.022
Token how
Token ID703
Feature activation+1.48e+00

Pos logit contributions
spe+0.038
work+0.037
ho+0.036
 prope+0.036
 peac+0.035
Neg logit contributions
!-0.011
 spaghetti-0.010
 snack-0.010
 while-0.010
 Care-0.010
Token polite
Token ID23507
Feature activation+1.96e+00

Pos logit contributions
work+0.070
spe+0.070
ho+0.067
 prope+0.067
 peac+0.066
Neg logit contributions
 Care-0.023
!-0.023
 Sure-0.021
 snack-0.021
 spaghetti-0.021
Token Mr
Token ID1770
Feature activation+3.01e+00

Pos logit contributions
spe+0.107
work+0.103
 prope+0.101
ho+0.100
 peac+0.100
Neg logit contributions
!-0.025
 while-0.023
 spaghetti-0.020
 snack-0.019
 Dr-0.016
Token.
Token ID13
Feature activation+6.41e-01

Pos logit contributions
spe+0.168
work+0.158
 prope+0.158
ho+0.158
 peac+0.156
Neg logit contributions
!-0.026
 snack-0.024
 Care-0.023
 Sure-0.023
 spaghetti-0.022
Token Smith
Token ID4176
Feature activation-1.37e-01

Pos logit contributions
spe+0.025
work+0.024
ho+0.022
 peac+0.022
 prope+0.022
Neg logit contributions
 Care-0.016
 Sure-0.016
 minutes-0.015
 snack-0.015
 spaghetti-0.015
Token was
Token ID373
Feature activation+1.04e+00

Pos logit contributions
!+0.002
 while+0.002
 snack+0.002
 spaghetti+0.002
 chair+0.002
Neg logit contributions
 prope-0.007
spe-0.007
 peac-0.007
 narrative-0.006
ho-0.006
Token and
Token ID290
Feature activation+1.38e-01

Pos logit contributions
spe+0.050
ho+0.050
 prope+0.048
work+0.047
 peac+0.045
Neg logit contributions
!-0.021
 while-0.017
 Care-0.017
 spaghetti-0.017
 minutes-0.017
Token how
Token ID703
Feature activation+1.70e+00

Pos logit contributions
spe+0.006
work+0.006
 narrative+0.006
 prope+0.006
ho+0.006
Neg logit contributions
!-0.002
 while-0.002
 Care-0.002
 snack-0.002
 spaghetti-0.002
Token king
Token ID5822
Feature activation+6.51e-02

Pos logit contributions
spe+0.008
work+0.007
ho+0.007
 peac+0.007
 prope+0.007
Neg logit contributions
 Care-0.010
 Sure-0.010
 snack-0.010
 minutes-0.010
 spaghetti-0.010
Token ate
Token ID15063
Feature activation-3.08e-01

Pos logit contributions
 prope+0.003
 narrative+0.003
work+0.003
 peac+0.003
spe+0.003
Neg logit contributions
!-0.001
 while-0.001
As-0.001
 chair-0.001
 snack-0.001
Token too
Token ID1165
Feature activation-4.14e-01

Pos logit contributions
 spaghetti+0.005
 while+0.005
 snack+0.005
 Dr+0.005
!+0.005
Neg logit contributions
spe-0.016
work-0.015
 peac-0.015
 prope-0.014
edient-0.014
Token many
Token ID867
Feature activation-5.20e-01

Pos logit contributions
 Sure+0.010
 Care+0.009
 pharmacy+0.009
 snack+0.009
 minutes+0.009
Neg logit contributions
spe-0.016
work-0.016
ho-0.015
 peac-0.015
 prope-0.015
Token ch
Token ID442
Feature activation+4.76e-01

Pos logit contributions
 Sure+0.013
 Care+0.012
 Deal+0.012
 As+0.011
 pharmacy+0.011
Neg logit contributions
spe-0.020
work-0.019
 prope-0.018
 peac-0.018
ho-0.018
Tokenocol
Token ID4668
Feature activation+3.00e+00

Pos logit contributions
spe+0.020
work+0.019
 peac+0.018
 prope+0.018
ho+0.017
Neg logit contributions
ittens-0.011
 minutes-0.011
!-0.010
 spaghetti-0.010
 walks-0.010
Tokenates
Token ID689
Feature activation-4.50e-01

Pos logit contributions
spe+0.134
 prope+0.119
 peac+0.118
work+0.114
ho+0.113
Neg logit contributions
!-0.059
ittens-0.057
 Care-0.057
 spaghetti-0.057
 snack-0.056
Token and
Token ID290
Feature activation-3.64e-01

Pos logit contributions
!+0.005
 while+0.004
 snack+0.003
 spaghetti+0.003
.+0.003
Neg logit contributions
spe-0.027
 prope-0.026
 peac-0.026
 narrative-0.026
ateful-0.024
Token his
Token ID465
Feature activation+3.97e-02

Pos logit contributions
 snack+0.008
 spaghetti+0.008
 while+0.008
!+0.008
 minutes+0.008
Neg logit contributions
spe-0.016
work-0.015
ho-0.015
 prope-0.014
 peac-0.014
Token tum
Token ID11814
Feature activation+8.79e-01

Pos logit contributions
spe+0.002
work+0.002
ho+0.002
 peac+0.002
 prope+0.002
Neg logit contributions
 snack-0.001
 Care-0.001
 Sure-0.001
!-0.001
 spaghetti-0.001
Tokenmy
Token ID1820
Feature activation+2.59e-02

Pos logit contributions
spe+0.043
work+0.042
ho+0.041
 peac+0.040
 prope+0.040
Neg logit contributions
 Care-0.012
 Sure-0.012
 snack-0.012
!-0.011
 spaghetti-0.011
Token James
Token ID3700
Feature activation+4.69e-01

Pos logit contributions
 snack+0.006
 minutes+0.005
 spaghetti+0.005
 pharmacy+0.005
ittens+0.005
Neg logit contributions
spe-0.015
work-0.014
ho-0.013
melon-0.013
 prope-0.012
Token wanted
Token ID2227
Feature activation+1.50e-01

Pos logit contributions
work+0.022
 prope+0.022
 peac+0.021
spe+0.021
ateful+0.021
Neg logit contributions
!-0.008
 while-0.007
As-0.007
 As-0.007
 quick-0.007
Token to
Token ID284
Feature activation-3.58e-01

Pos logit contributions
spe+0.007
work+0.007
ho+0.007
 prope+0.007
 peac+0.007
Neg logit contributions
 Care-0.002
 Sure-0.002
 snack-0.002
 spaghetti-0.002
!-0.002
Token enjoy
Token ID2883
Feature activation+7.30e-01

Pos logit contributions
 Care+0.006
 snack+0.006
!+0.006
 Sure+0.006
 As+0.005
Neg logit contributions
spe-0.017
work-0.016
ho-0.016
 peac-0.015
 prope-0.015
Token the
Token ID262
Feature activation+7.85e-01

Pos logit contributions
spe+0.033
work+0.032
ho+0.031
 peac+0.030
 prope+0.029
Neg logit contributions
 Care-0.014
!-0.014
 snack-0.014
 Sure-0.014
 spaghetti-0.013
Token kay
Token ID34681
Feature activation+2.95e+00

Pos logit contributions
spe+0.029
work+0.029
ho+0.028
 prope+0.027
 peac+0.026
Neg logit contributions
 snack-0.021
!-0.021
 Care-0.020
 rest-0.020
 minutes-0.019
Tokenak
Token ID461
Feature activation-7.94e-01

Pos logit contributions
spe+0.105
work+0.099
ho+0.095
 peac+0.093
 prope+0.092
Neg logit contributions
 Care-0.083
 Sure-0.083
 snack-0.081
 spaghetti-0.080
 minutes-0.080
Token before
Token ID878
Feature activation+1.61e+00

Pos logit contributions
!+0.011
 while+0.010
 snack+0.008
 spaghetti+0.007
 As+0.007
Neg logit contributions
 peac-0.042
spe-0.042
 prope-0.041
ho-0.041
 narrative-0.040
Token it
Token ID340
Feature activation+1.21e+00

Pos logit contributions
spe+0.073
work+0.070
ho+0.068
 peac+0.067
 prope+0.066
Neg logit contributions
 Care-0.029
 Sure-0.028
 minutes-0.027
 snack-0.027
 pharmacy-0.026
Token disappeared
Token ID12120
Feature activation+7.66e-01

Pos logit contributions
spe+0.051
 prope+0.050
work+0.050
ateful+0.049
 narrative+0.049
Neg logit contributions
!-0.028
 while-0.025
 snack-0.024
 As-0.023
 Care-0.022
Token,
Token ID11
Feature activation-2.76e-02

Pos logit contributions
spe+0.045
 peac+0.043
work+0.043
ho+0.043
 prope+0.042
Neg logit contributions
!-0.006
 while-0.003
 minutes-0.003
 Care-0.003
 snack-0.002
Token Jake
Token ID14757
Feature activation-2.17e-01

Pos logit contributions
 snack+0.027
 pharmacy+0.025
 snowball+0.025
 minutes+0.024
 rest+0.024
Neg logit contributions
spe-0.039
work-0.037
 prope-0.033
ho-0.032
 peac-0.031
Token had
Token ID550
Feature activation+9.86e-02

Pos logit contributions
!+0.004
 spaghetti+0.004
 Care+0.004
 minutes+0.004
 As+0.004
Neg logit contributions
spe-0.010
work-0.010
 prope-0.010
 peac-0.010
 narrative-0.009
Token never
Token ID1239
Feature activation+3.38e-01

Pos logit contributions
spe+0.004
work+0.004
ho+0.004
 peac+0.004
 prope+0.004
Neg logit contributions
 spaghetti-0.002
!-0.002
 snack-0.002
 Care-0.002
 minutes-0.002
Token had
Token ID550
Feature activation-4.32e-01

Pos logit contributions
spe+0.015
work+0.014
ho+0.014
 prope+0.014
 peac+0.013
Neg logit contributions
 snack-0.007
 spaghetti-0.007
 minutes-0.007
!-0.007
 Care-0.007
Token por
Token ID16964
Feature activation-6.28e-01

Pos logit contributions
 Care+0.008
 Sure+0.008
 snack+0.007
 pharmacy+0.007
 minutes+0.007
Neg logit contributions
spe-0.019
work-0.019
ho-0.018
 peac-0.018
 prope-0.018
Tokenridge
Token ID12818
Feature activation-3.07e+00

Pos logit contributions
 Sure+0.015
 Care+0.014
 clearing+0.014
 minutes+0.014
 chair+0.013
Neg logit contributions
work-0.023
ho-0.022
spe-0.022
melon-0.021
 prope-0.021
Token before
Token ID878
Feature activation+2.20e-01

Pos logit contributions
 Care+0.049
 Sure+0.048
 snack+0.047
!+0.046
 spaghetti+0.046
Neg logit contributions
spe-0.146
work-0.140
ho-0.136
 peac-0.135
 prope-0.134
Token and
Token ID290
Feature activation-4.70e-01

Pos logit contributions
spe+0.010
work+0.010
ho+0.010
 peac+0.009
 prope+0.009
Neg logit contributions
 Care-0.004
 Sure-0.004
 snack-0.003
 minutes-0.003
 pharmacy-0.003
Token he
Token ID339
Feature activation-7.41e-01

Pos logit contributions
 spaghetti+0.008
!+0.007
 Care+0.007
 snack+0.007
 minutes+0.007
Neg logit contributions
spe-0.023
work-0.022
 peac-0.021
ho-0.021
 prope-0.021
Token was
Token ID373
Feature activation-3.63e-01

Pos logit contributions
!+0.018
 spaghetti+0.017
 Care+0.017
 snack+0.017
 minutes+0.017
Neg logit contributions
spe-0.030
work-0.029
 peac-0.028
 prope-0.028
ho-0.027
Token not
Token ID407
Feature activation-6.17e-01

Pos logit contributions
 Care+0.008
!+0.008
 minutes+0.007
 spaghetti+0.007
 Dr+0.007
Neg logit contributions
spe-0.016
ho-0.015
work-0.015
 peac-0.015
 prope-0.015
Token saw
Token ID2497
Feature activation-1.39e+00

Pos logit contributions
 Care+0.028
 Sure+0.027
 snack+0.027
!+0.026
 spaghetti+0.026
Neg logit contributions
spe-0.036
work-0.034
ho-0.033
 prope-0.032
 peac-0.032
Token.
Token ID13
Feature activation-9.17e-03

Pos logit contributions
!+0.017
 Care+0.015
 spaghetti+0.015
 minutes+0.014
 snack+0.014
Neg logit contributions
spe-0.073
work-0.070
 prope-0.070
 peac-0.069
ho-0.069
Token She
Token ID1375
Feature activation-3.08e-01

Pos logit contributions
 snack+0.000
 minutes+0.000
!+0.000
 Care+0.000
 Sure+0.000
Neg logit contributions
spe-0.000
work-0.000
ho-0.000
 peac-0.000
 prope-0.000
Token used
Token ID973
Feature activation+5.45e-02

Pos logit contributions
!+0.005
 Care+0.005
As+0.005
 Sure+0.005
+0.005
Neg logit contributions
spe-0.014
work-0.014
ho-0.014
 prope-0.014
melon-0.013
Token cr
Token ID1067
Feature activation-4.34e-01

Pos logit contributions
spe+0.003
work+0.003
ho+0.003
 peac+0.002
edient+0.002
Neg logit contributions
 Care-0.001
 spaghetti-0.001
 snack-0.001
 minutes-0.001
!-0.001
Tokenay
Token ID323
Feature activation-3.07e+00

Pos logit contributions
 Care+0.009
 Sure+0.009
 snack+0.008
!+0.008
 spaghetti+0.008
Neg logit contributions
spe-0.019
work-0.018
ho-0.017
 peac-0.017
 prope-0.017
Tokenons
Token ID684
Feature activation-1.10e+00

Pos logit contributions
 Sure+0.071
 Care+0.069
 pharmacy+0.068
 snack+0.067
 Dr+0.065
Neg logit contributions
spe-0.120
work-0.119
 peac-0.113
ho-0.112
 prope-0.109
Token of
Token ID286
Feature activation-7.02e-01

Pos logit contributions
!+0.011
 while+0.010
 spaghetti+0.007
 snack+0.006
 Care+0.006
Neg logit contributions
spe-0.065
 prope-0.063
 peac-0.060
 narrative-0.060
work-0.059
Token different
Token ID1180
Feature activation-1.02e+00

Pos logit contributions
 Care+0.016
 spaghetti+0.016
 snack+0.016
!+0.016
 minutes+0.015
Neg logit contributions
spe-0.030
work-0.028
ho-0.027
 prope-0.026
 peac-0.026
Token colors
Token ID7577
Feature activation-3.65e-01

Pos logit contributions
 Care+0.030
 Sure+0.030
 snack+0.029
 minutes+0.029
 spaghetti+0.029
Neg logit contributions
spe-0.034
work-0.033
ho-0.031
 peac-0.031
 prope-0.030
Token.
Token ID13
Feature activation-6.87e-03

Pos logit contributions
!+0.004
 while+0.003
.+0.002
 spaghetti+0.002
 snack+0.002
Neg logit contributions
spe-0.022
 prope-0.021
 peac-0.020
 narrative-0.020
ho-0.020
Token dreamed
Token ID27947
Feature activation-1.06e+00

Pos logit contributions
 Care+0.023
!+0.023
 Sure+0.022
 snack+0.022
 minutes+0.022
Neg logit contributions
spe-0.031
work-0.029
 prope-0.028
ho-0.028
 peac-0.028
Token.
Token ID13
Feature activation-7.26e-02

Pos logit contributions
 Sure+0.021
 Care+0.021
 snack+0.020
 pharmacy+0.020
 spaghetti+0.020
Neg logit contributions
spe-0.045
work-0.044
ho-0.042
 peac-0.041
 prope-0.041
Token She
Token ID1375
Feature activation-2.89e-01

Pos logit contributions
 minutes+0.002
!+0.002
 snack+0.002
 quick+0.002
 snowball+0.002
Neg logit contributions
spe-0.003
work-0.003
melon-0.003
ho-0.002
 peac-0.002
Token used
Token ID973
Feature activation+2.24e-01

Pos logit contributions
!+0.005
 minutes+0.005
As+0.005
 walks+0.005
 chair+0.005
Neg logit contributions
melon-0.013
spe-0.013
ho-0.013
work-0.012
 peac-0.012
Token cr
Token ID1067
Feature activation-5.48e-01

Pos logit contributions
spe+0.011
work+0.011
ho+0.010
 peac+0.010
 prope+0.010
Neg logit contributions
 Care-0.004
 spaghetti-0.004
 snack-0.004
 minutes-0.003
!-0.003
Tokenay
Token ID323
Feature activation-3.06e+00

Pos logit contributions
 Care+0.012
 Sure+0.011
 snack+0.011
 minutes+0.011
 spaghetti+0.011
Neg logit contributions
spe-0.023
work-0.022
ho-0.021
 peac-0.021
 prope-0.021
Tokenons
Token ID684
Feature activation-1.50e+00

Pos logit contributions
 Sure+0.073
 Care+0.071
 pharmacy+0.070
 snack+0.069
 clearing+0.067
Neg logit contributions
spe-0.117
work-0.116
 peac-0.111
ho-0.108
 prope-0.108
Token and
Token ID290
Feature activation-6.99e-01

Pos logit contributions
!+0.016
 while+0.015
 Care+0.011
 snack+0.011
 spaghetti+0.011
Neg logit contributions
spe-0.086
 prope-0.082
 peac-0.079
work-0.079
 narrative-0.079
Token stickers
Token ID28568
Feature activation-8.68e-01

Pos logit contributions
 spaghetti+0.016
 minutes+0.016
 snack+0.015
 while+0.015
 Care+0.014
Neg logit contributions
spe-0.031
work-0.029
ho-0.028
melon-0.028
 peac-0.027
Token to
Token ID284
Feature activation-7.24e-01

Pos logit contributions
!+0.010
 while+0.009
 Care+0.006
 snack+0.006
 As+0.006
Neg logit contributions
spe-0.049
 prope-0.046
 peac-0.046
work-0.045
 narrative-0.045
Token make
Token ID787
Feature activation-3.51e-01

Pos logit contributions
 snack+0.013
 Care+0.012
!+0.012
 spaghetti+0.012
 rest+0.012
Neg logit contributions
spe-0.036
ho-0.033
work-0.033
melon-0.031
 peac-0.031
Token.
Token ID13
Feature activation-8.51e-02

Pos logit contributions
!+0.005
 Care+0.003
 spaghetti+0.003
 minutes+0.003
 Dr+0.003
Neg logit contributions
spe-0.030
ho-0.029
work-0.029
 peac-0.029
 prope-0.028
Token She
Token ID1375
Feature activation-7.81e-01

Pos logit contributions
 snack+0.001
 minutes+0.001
 Care+0.001
 spaghetti+0.001
 Sure+0.001
Neg logit contributions
spe-0.004
work-0.004
ho-0.004
 peac-0.004
 prope-0.004
Token writes
Token ID6797
Feature activation-9.06e-01

Pos logit contributions
 walks+0.015
 Care+0.014
!+0.014
 minutes+0.013
 Dr+0.013
Neg logit contributions
spe-0.036
ho-0.035
work-0.034
 peac-0.033
melon-0.033
Token with
Token ID351
Feature activation-5.53e-01

Pos logit contributions
!+0.013
 spaghetti+0.013
 while+0.013
 Dr+0.012
 Care+0.012
Neg logit contributions
spe-0.048
work-0.044
 peac-0.043
 prope-0.043
ho-0.043
Token cr
Token ID1067
Feature activation-4.85e-01

Pos logit contributions
 Care+0.008
 spaghetti+0.007
 snack+0.007
!+0.007
 minutes+0.007
Neg logit contributions
spe-0.030
work-0.028
ho-0.027
 peac-0.026
melon-0.026
Tokenay
Token ID323
Feature activation-3.03e+00

Pos logit contributions
 Care+0.010
 Sure+0.009
!+0.009
 snack+0.009
 spaghetti+0.009
Neg logit contributions
spe-0.021
work-0.020
ho-0.020
 peac-0.019
 prope-0.019
Tokenons
Token ID684
Feature activation-7.27e-01

Pos logit contributions
 Care+0.058
 Sure+0.057
 snack+0.055
 minutes+0.054
 spaghetti+0.054
Neg logit contributions
spe-0.134
work-0.129
ho-0.124
 peac-0.123
 prope-0.122
Token,
Token ID11
Feature activation+1.18e-01

Pos logit contributions
!+0.007
 while+0.005
.+0.004
 spaghetti+0.004
 snack+0.004
Neg logit contributions
spe-0.044
 prope-0.041
 peac-0.040
work-0.040
 narrative-0.040
Token pens
Token ID29707
Feature activation-2.44e-01

Pos logit contributions
spe+0.006
work+0.005
ho+0.005
edient+0.005
melon+0.005
Neg logit contributions
 spaghetti-0.002
 snack-0.002
 minutes-0.002
 while-0.002
 walks-0.002
Token,
Token ID11
Feature activation+8.73e-02

Pos logit contributions
!+0.002
 while+0.002
 snack+0.001
 spaghetti+0.001
.+0.001
Neg logit contributions
spe-0.015
work-0.014
 peac-0.013
ho-0.013
 prope-0.013
Token pencil
Token ID21613
Feature activation-1.49e-01

Pos logit contributions
spe+0.005
work+0.004
ho+0.004
edient+0.004
melon+0.004
Neg logit contributions
 spaghetti-0.002
 while-0.001
 snack-0.001
 minutes-0.001
 walks-0.001
Token stamps
Token ID25560
Feature activation-5.06e-03

Pos logit contributions
 minutes+0.009
!+0.008
 snack+0.008
 Care+0.008
 spaghetti+0.008
Neg logit contributions
spe-0.021
work-0.021
ho-0.020
 peac-0.019
 prope-0.019
Token.
Token ID13
Feature activation-1.64e-01

Pos logit contributions
!+0.000
 while+0.000
 snack+0.000
 Care+0.000
 spaghetti+0.000
Neg logit contributions
spe-0.000
work-0.000
 prope-0.000
ho-0.000
 peac-0.000
Token He
Token ID679
Feature activation-8.05e-01

Pos logit contributions
 Sure+0.003
 Care+0.003
 snack+0.003
!+0.003
 spaghetti+0.003
Neg logit contributions
spe-0.007
ho-0.007
work-0.007
 prope-0.006
melon-0.006
Token has
Token ID468
Feature activation-1.42e-01

Pos logit contributions
 walks+0.014
!+0.014
 Care+0.014
 snack+0.014
 pharmacy+0.014
Neg logit contributions
spe-0.037
work-0.035
 peac-0.035
 prope-0.034
ho-0.034
Token cr
Token ID1067
Feature activation-8.60e-03

Pos logit contributions
 minutes+0.002
 spaghetti+0.002
!+0.002
 snack+0.002
 Care+0.002
Neg logit contributions
spe-0.007
work-0.007
ho-0.007
 peac-0.007
 prope-0.007
Tokenay
Token ID323
Feature activation-3.02e+00

Pos logit contributions
 Care+0.000
 Sure+0.000
!+0.000
 As+0.000
 spaghetti+0.000
Neg logit contributions
spe-0.000
work-0.000
ho-0.000
 peac-0.000
 prope-0.000
Tokenons
Token ID684
Feature activation-3.63e-01

Pos logit contributions
 Care+0.054
 Sure+0.052
 snack+0.052
!+0.051
 minutes+0.051
Neg logit contributions
spe-0.138
work-0.132
ho-0.129
 peac-0.125
 prope-0.125
Token and
Token ID290
Feature activation-1.11e+00

Pos logit contributions
!+0.004
 while+0.003
.+0.003
 spaghetti+0.002
 snack+0.002
Neg logit contributions
spe-0.021
 prope-0.020
 peac-0.020
work-0.019
 narrative-0.019
Token scissors
Token ID41786
Feature activation+3.60e-01

Pos logit contributions
 spaghetti+0.024
 snack+0.023
 minutes+0.022
 Care+0.022
!+0.022
Neg logit contributions
spe-0.049
work-0.046
ho-0.045
 peac-0.043
 prope-0.042
Token.
Token ID13
Feature activation+3.82e-02

Pos logit contributions
spe+0.022
 peac+0.021
 prope+0.021
work+0.020
 narrative+0.020
Neg logit contributions
!-0.003
 while-0.001
.-0.001
-0.001
 As-0.000
Token He
Token ID679
Feature activation-5.54e-01

Pos logit contributions
spe+0.002
ho+0.001
work+0.001
melon+0.001
 prope+0.001
Neg logit contributions
 Sure-0.001
!-0.001
 snack-0.001
 Care-0.001
 spaghetti-0.001
Token,
Token ID11
Feature activation-8.73e-01

Pos logit contributions
!+0.007
 while+0.005
.+0.005
 snack+0.004
 chair+0.003
Neg logit contributions
spe-0.027
 peac-0.026
 prope-0.025
 narrative-0.025
ho-0.025
Token papers
Token ID9473
Feature activation+9.88e-03

Pos logit contributions
 spaghetti+0.015
 while+0.015
 snack+0.015
 minutes+0.014
 Care+0.014
Neg logit contributions
ho-0.044
spe-0.043
work-0.040
edient-0.039
melon-0.039
Token,
Token ID11
Feature activation-6.27e-01

Pos logit contributions
spe+0.001
work+0.001
 peac+0.001
ho+0.001
 prope+0.001
Neg logit contributions
!-0.000
 while-0.000
 snack-0.000
 minutes-0.000
 chair-0.000
Token and
Token ID290
Feature activation-9.21e-01

Pos logit contributions
 while+0.011
 snack+0.010
 spaghetti+0.010
 minutes+0.010
 bed+0.009
Neg logit contributions
spe-0.031
ho-0.030
work-0.029
edient-0.028
melon-0.028
Token cr
Token ID1067
Feature activation-5.95e-01

Pos logit contributions
 minutes+0.019
 spaghetti+0.019
 snack+0.018
 walks+0.017
 while+0.017
Neg logit contributions
ho-0.043
spe-0.042
work-0.040
melon-0.039
edient-0.038
Tokenay
Token ID323
Feature activation-3.02e+00

Pos logit contributions
 Care+0.012
 Sure+0.011
!+0.011
 snack+0.011
 spaghetti+0.011
Neg logit contributions
spe-0.026
work-0.025
ho-0.024
 peac-0.024
 prope-0.023
Tokenons
Token ID684
Feature activation-1.13e+00

Pos logit contributions
 Care+0.059
 Sure+0.058
 snack+0.056
 minutes+0.055
 spaghetti+0.055
Neg logit contributions
spe-0.133
work-0.128
ho-0.124
 peac-0.122
 prope-0.121
Token everywhere
Token ID8347
Feature activation-4.89e-01

Pos logit contributions
!+0.008
.+0.004
 while+0.003
 snack+0.002
 spaghetti+0.001
Neg logit contributions
spe-0.071
 peac-0.067
 prope-0.066
ho-0.066
 narrative-0.066
Token.
Token ID13
Feature activation-1.84e-01

Pos logit contributions
!+0.003
 Care+0.002
 spaghetti+0.002
 minutes+0.002
 snack+0.002
Neg logit contributions
spe-0.029
work-0.028
ho-0.028
 peac-0.027
 prope-0.027
Token She
Token ID1375
Feature activation-2.94e-01

Pos logit contributions
 snack+0.004
 minutes+0.003
!+0.003
 pharmacy+0.003
 snowball+0.003
Neg logit contributions
spe-0.008
work-0.007
ho-0.007
 peac-0.007
 prope-0.007
Token saw
Token ID2497
Feature activation-1.26e+00

Pos logit contributions
 Care+0.006
 minutes+0.006
 Sure+0.006
!+0.006
As+0.006
Neg logit contributions
spe-0.012
work-0.012
ho-0.012
 prope-0.012
 peac-0.011
Token filthy
Token ID35677
Feature activation+1.52e-01

Pos logit contributions
 quick+0.008
 spaghetti+0.007
!+0.007
 minutes+0.006
 Care+0.006
Neg logit contributions
work-0.024
spe-0.023
 prope-0.023
 peac-0.023
ho-0.022
Token!
Token ID0
Feature activation-1.16e+00

Pos logit contributions
spe+0.008
work+0.008
ho+0.008
 peac+0.008
 prope+0.008
Neg logit contributions
!-0.002
 Care-0.001
 spaghetti-0.001
 snack-0.001
 Sure-0.001
Token She
Token ID1375
Feature activation-1.72e-01

Pos logit contributions
 snack+0.020
 minutes+0.019
 pharmacy+0.019
!+0.019
 Care+0.018
Neg logit contributions
spe-0.052
work-0.050
ho-0.048
 prope-0.047
 peac-0.047
Token hadn
Token ID8020
Feature activation-9.94e-01

Pos logit contributions
!+0.004
 Care+0.004
 minutes+0.003
 spaghetti+0.003
 Sure+0.003
Neg logit contributions
spe-0.008
work-0.007
ho-0.007
 prope-0.007
 peac-0.007
Token't
Token ID470
Feature activation-4.52e-01

Pos logit contributions
 Care+0.021
 minutes+0.021
 Sure+0.021
 snack+0.020
 pharmacy+0.020
Neg logit contributions
spe-0.041
work-0.040
ho-0.039
 peac-0.038
 prope-0.037
Token seen
Token ID1775
Feature activation-3.02e+00

Pos logit contributions
 minutes+0.009
!+0.009
 snack+0.009
 spaghetti+0.009
 Care+0.009
Neg logit contributions
spe-0.020
ho-0.020
work-0.019
 prope-0.018
 peac-0.018
Token it
Token ID340
Feature activation-1.85e+00

Pos logit contributions
 Care+0.065
 Sure+0.062
 spaghetti+0.062
 snack+0.062
!+0.062
Neg logit contributions
spe-0.130
work-0.122
ho-0.120
 peac-0.117
 prope-0.115
Token look
Token ID804
Feature activation-3.74e-01

Pos logit contributions
!+0.034
 snack+0.032
 Care+0.032
 spaghetti+0.031
 Sure+0.031
Neg logit contributions
spe-0.087
work-0.082
 prope-0.081
 peac-0.080
ho-0.080
Token like
Token ID588
Feature activation-8.64e-01

Pos logit contributions
!+0.008
 spaghetti+0.008
 minutes+0.008
 while+0.007
 snack+0.007
Neg logit contributions
spe-0.017
ho-0.016
work-0.016
 peac-0.016
 prope-0.015
Token this
Token ID428
Feature activation-7.62e-02

Pos logit contributions
 spaghetti+0.015
 snack+0.015
!+0.014
 Care+0.014
 minutes+0.014
Neg logit contributions
spe-0.042
work-0.040
ho-0.039
 peac-0.038
 prope-0.038
Token before
Token ID878
Feature activation+8.74e-02

Pos logit contributions
 Care+0.001
 snack+0.001
!+0.001
 Sure+0.001
 spaghetti+0.001
Neg logit contributions
spe-0.004
work-0.004
ho-0.004
 peac-0.003
 prope-0.003
Token art
Token ID1242
Feature activation-1.09e+00

Pos logit contributions
 snack+0.004
 Care+0.004
 spaghetti+0.004
!+0.004
 minutes+0.004
Neg logit contributions
spe-0.010
work-0.009
ho-0.009
 peac-0.009
 prope-0.009
Token.
Token ID13
Feature activation-6.81e-02

Pos logit contributions
 Care+0.020
 snack+0.019
!+0.019
 Sure+0.019
 minutes+0.019
Neg logit contributions
spe-0.049
work-0.047
ho-0.047
 peac-0.045
 prope-0.045
Token We
Token ID775
Feature activation-9.28e-02

Pos logit contributions
 Care+0.001
 Sure+0.001
 snack+0.001
 minutes+0.001
 spaghetti+0.001
Neg logit contributions
spe-0.003
work-0.003
ho-0.003
 peac-0.003
 prope-0.003
Token used
Token ID973
Feature activation-1.05e-01

Pos logit contributions
!+0.002
 rest+0.002
!"+0.002
 Care+0.002
 hide+0.002
Neg logit contributions
spe-0.004
ho-0.004
work-0.004
 prope-0.004
melon-0.004
Token cr
Token ID1067
Feature activation-5.19e-01

Pos logit contributions
!+0.002
 Care+0.002
 spaghetti+0.002
 snack+0.002
 minutes+0.002
Neg logit contributions
spe-0.005
ho-0.005
work-0.005
 peac-0.005
 narrative-0.005
Tokenay
Token ID323
Feature activation-2.97e+00

Pos logit contributions
 Care+0.010
 Sure+0.010
!+0.010
 snack+0.010
 spaghetti+0.010
Neg logit contributions
spe-0.023
work-0.022
ho-0.021
 peac-0.021
 prope-0.021
Tokenons
Token ID684
Feature activation-9.54e-01

Pos logit contributions
 Care+0.055
 Sure+0.054
 snack+0.053
 minutes+0.052
 spaghetti+0.052
Neg logit contributions
spe-0.133
work-0.128
ho-0.124
 peac-0.122
 prope-0.121
Token and
Token ID290
Feature activation-2.42e-01

Pos logit contributions
!+0.012
 while+0.010
 spaghetti+0.009
 snack+0.009
!"+0.009
Neg logit contributions
spe-0.053
 prope-0.049
ho-0.049
 peac-0.048
work-0.048
Token markers
Token ID19736
Feature activation-1.02e+00

Pos logit contributions
 snack+0.005
 spaghetti+0.005
 minutes+0.005
!+0.005
 Care+0.005
Neg logit contributions
spe-0.011
ho-0.010
work-0.009
melon-0.009
edient-0.009
Token to
Token ID284
Feature activation-6.35e-01

Pos logit contributions
!+0.013
 while+0.010
!"+0.010
 snack+0.010
 spaghetti+0.009
Neg logit contributions
spe-0.056
ho-0.052
 peac-0.052
 prope-0.051
 narrative-0.050
Token draw
Token ID3197
Feature activation-6.72e-01

Pos logit contributions
 snack+0.011
!+0.010
 rest+0.010
 bed+0.010
 Care+0.010
Neg logit contributions
spe-0.032
ho-0.030
work-0.028
melon-0.028
 narrative-0.027
Token couldn
Token ID3521
Feature activation-7.67e-01

Pos logit contributions
!+0.010
 Care+0.010
 snack+0.009
 Sure+0.009
 spaghetti+0.009
Neg logit contributions
spe-0.015
 prope-0.015
work-0.015
 peac-0.015
ho-0.014
Token't
Token ID470
Feature activation-5.96e-02

Pos logit contributions
 Care+0.011
!+0.011
 Sure+0.011
 snack+0.011
 As+0.010
Neg logit contributions
spe-0.038
work-0.036
 peac-0.035
ho-0.035
 prope-0.035
Token un
Token ID555
Feature activation+2.30e-01

Pos logit contributions
!+0.001
 snack+0.001
 rest+0.001
 nap+0.001
 chew+0.001
Neg logit contributions
spe-0.003
work-0.003
ho-0.003
 prope-0.003
 peac-0.003
Token-
Token ID12
Feature activation-3.97e-01

Pos logit contributions
spe+0.011
work+0.010
ho+0.010
 peac+0.010
 prope+0.010
Neg logit contributions
 Care-0.004
 Sure-0.004
 snack-0.004
!-0.004
 pharmacy-0.004
Tokenm
Token ID76
Feature activation-8.41e-01

Pos logit contributions
 Care+0.008
 As+0.008
!+0.008
 Sure+0.008
As+0.008
Neg logit contributions
spe-0.016
work-0.016
 peac-0.016
 prope-0.016
ho-0.015
Tokenelt
Token ID2120
Feature activation-2.90e+00

Pos logit contributions
!+0.018
 spaghetti+0.016
 snack+0.015
 minutes+0.015
 Care+0.014
Neg logit contributions
spe-0.037
ho-0.037
 peac-0.037
work-0.037
 prope-0.035
Token it
Token ID340
Feature activation+2.38e-01

Pos logit contributions
!+0.049
 Care+0.049
 snack+0.049
 spaghetti+0.048
 minutes+0.047
Neg logit contributions
spe-0.135
work-0.131
ho-0.128
 peac-0.126
 prope-0.125
Token.
Token ID13
Feature activation-3.02e-01

Pos logit contributions
spe+0.013
 peac+0.012
work+0.012
 prope+0.012
ho+0.012
Neg logit contributions
!-0.003
 snack-0.003
 while-0.003
 spaghetti-0.003
 rest-0.002
Token
Token ID220
Feature activation-5.35e-01

Pos logit contributions
 snack+0.005
 minutes+0.005
 spaghetti+0.005
 pharmacy+0.005
!+0.004
Neg logit contributions
spe-0.014
work-0.013
ho-0.013
 peac-0.012
 prope-0.012
Token
Token ID198
Feature activation-1.59e-01

Pos logit contributions
 minutes+0.007
 snack+0.007
 spaghetti+0.006
 pharmacy+0.006
 cafĂ©+0.006
Neg logit contributions
spe-0.027
work-0.026
ho-0.025
 prope-0.024
melon-0.024
Token
Token ID198
Feature activation-3.55e-01

Pos logit contributions
 spaghetti+0.002
 snack+0.002
 minutes+0.002
 pharmacy+0.002
 Care+0.002
Neg logit contributions
spe-0.008
ho-0.007
work-0.007
 peac-0.007
 prope-0.007
Token scissors
Token ID41786
Feature activation-3.09e-01

Pos logit contributions
spe+0.005
ho+0.004
work+0.004
edient+0.004
 peac+0.004
Neg logit contributions
 spaghetti-0.002
 minutes-0.002
 snack-0.002
 chair-0.002
 bed-0.002
Token,
Token ID11
Feature activation-2.17e-01

Pos logit contributions
!+0.003
 while+0.002
.+0.002
 minutes+0.001
!"+0.001
Neg logit contributions
spe-0.019
 peac-0.018
 narrative-0.017
work-0.017
ho-0.017
Token glue
Token ID22749
Feature activation-6.12e-01

Pos logit contributions
 minutes+0.004
 spaghetti+0.004
 while+0.003
 snack+0.003
 chair+0.003
Neg logit contributions
spe-0.012
ho-0.010
work-0.010
edient-0.010
 peac-0.010
Token and
Token ID290
Feature activation-9.40e-01

Pos logit contributions
!+0.006
 while+0.004
 snack+0.004
.+0.004
 minutes+0.003
Neg logit contributions
spe-0.037
 peac-0.034
work-0.034
ho-0.033
 narrative-0.033
Token cr
Token ID1067
Feature activation-6.42e-01

Pos logit contributions
 minutes+0.018
 snack+0.017
 spaghetti+0.017
 walks+0.015
 while+0.015
Neg logit contributions
spe-0.046
ho-0.044
work-0.043
melon-0.042
 peac-0.040
Tokenay
Token ID323
Feature activation-2.89e+00

Pos logit contributions
 Care+0.013
 Sure+0.012
!+0.012
 snack+0.012
 spaghetti+0.012
Neg logit contributions
spe-0.028
work-0.027
ho-0.026
 peac-0.026
 prope-0.025
Tokenons
Token ID684
Feature activation-1.03e+00

Pos logit contributions
 Care+0.050
!+0.049
 snack+0.048
 minutes+0.048
 Sure+0.048
Neg logit contributions
spe-0.135
work-0.128
ho-0.126
 peac-0.123
 prope-0.122
Token.
Token ID13
Feature activation+2.71e-01

Pos logit contributions
!+0.004
.+0.002
 while+0.001
 snack-0.001
!"-0.001
Neg logit contributions
spe-0.068
 peac-0.064
 prope-0.063
ho-0.063
 narrative-0.063
Token They
Token ID1119
Feature activation-1.86e-01

Pos logit contributions
spe+0.011
work+0.011
ho+0.011
 peac+0.010
 prope+0.010
Neg logit contributions
 Care-0.006
 Sure-0.006
 snack-0.006
 As-0.006
 spaghetti-0.006
Token make
Token ID787
Feature activation-3.98e-01

Pos logit contributions
 rest+0.002
 nap+0.002
!+0.002
 snack+0.002
 chew+0.002
Neg logit contributions
spe-0.010
ho-0.009
 peac-0.009
ull-0.009
work-0.009
Token their
Token ID511
Feature activation-4.15e-01

Pos logit contributions
 spaghetti+0.007
 snack+0.006
!+0.006
 minutes+0.006
 Care+0.006
Neg logit contributions
spe-0.019
ho-0.019
work-0.018
 peac-0.018
 narrative-0.017
Token.
Token ID13
Feature activation+7.23e-01

Pos logit contributions
spe+0.020
ho+0.020
 peac+0.019
 prope+0.019
work+0.019
Neg logit contributions
!-0.002
 while-0.001
.-0.001
 snack-0.001
 Care-0.000
Token She
Token ID1375
Feature activation+3.65e-01

Pos logit contributions
spe+0.032
work+0.031
ho+0.030
 peac+0.029
 prope+0.029
Neg logit contributions
 minutes-0.013
 snack-0.013
!-0.012
 Sure-0.012
 Care-0.012
Token had
Token ID550
Feature activation+3.45e-01

Pos logit contributions
spe+0.016
work+0.016
 prope+0.016
ho+0.016
melon+0.016
Neg logit contributions
!-0.006
 minutes-0.006
-0.005
 spaghetti-0.005
 while-0.005
Token many
Token ID867
Feature activation-9.11e-01

Pos logit contributions
spe+0.018
work+0.018
ho+0.017
melon+0.017
 prope+0.017
Neg logit contributions
 minutes-0.005
 Care-0.004
 spaghetti-0.004
 snack-0.004
!-0.004
Token cr
Token ID1067
Feature activation+5.78e-02

Pos logit contributions
 minutes+0.016
 Care+0.016
 snack+0.016
!+0.015
 spaghetti+0.015
Neg logit contributions
spe-0.042
work-0.042
ho-0.040
 peac-0.039
 prope-0.038
Tokenay
Token ID323
Feature activation-2.89e+00

Pos logit contributions
spe+0.003
work+0.002
ho+0.002
 peac+0.002
 prope+0.002
Neg logit contributions
 Care-0.001
 Sure-0.001
!-0.001
 snack-0.001
 spaghetti-0.001
Tokenons
Token ID684
Feature activation-5.62e-01

Pos logit contributions
 Care+0.057
 Sure+0.056
 snack+0.054
 minutes+0.053
 pharmacy+0.052
Neg logit contributions
spe-0.126
work-0.122
ho-0.118
 peac-0.116
 prope-0.115
Token,
Token ID11
Feature activation+7.15e-01

Pos logit contributions
!+0.007
 while+0.005
.+0.004
 snack+0.004
 spaghetti+0.004
Neg logit contributions
spe-0.032
 prope-0.031
work-0.030
 peac-0.029
ho-0.029
Token papers
Token ID9473
Feature activation+5.42e-01

Pos logit contributions
spe+0.036
work+0.034
ho+0.034
melon+0.032
 prope+0.032
Neg logit contributions
 spaghetti-0.012
 while-0.011
 snack-0.011
 minutes-0.011
 Care-0.011
Token,
Token ID11
Feature activation+6.81e-01

Pos logit contributions
spe+0.031
 prope+0.029
work+0.029
 peac+0.028
ho+0.028
Neg logit contributions
!-0.006
 while-0.004
 Care-0.004
 snack-0.003
 minutes-0.003
Token scissors
Token ID41786
Feature activation+6.78e-02

Pos logit contributions
spe+0.034
ho+0.031
work+0.031
edient+0.030
melon+0.030
Neg logit contributions
 spaghetti-0.012
 while-0.012
 snack-0.011
 minutes-0.011
!-0.011
Token the
Token ID262
Feature activation+5.66e-01

Pos logit contributions
!+0.000
 while+0.000
 spaghetti+0.000
 snack+0.000
 Care+0.000
Neg logit contributions
spe-0.001
 prope-0.001
 narrative-0.001
 peac-0.001
ateful-0.001
Token whale
Token ID22206
Feature activation+9.30e-01

Pos logit contributions
spe+0.022
ho+0.020
work+0.019
 prope+0.019
ateful+0.018
Neg logit contributions
 minutes-0.015
 rest-0.015
 snack-0.014
 spaghetti-0.014
!-0.014
Token had
Token ID550
Feature activation+4.97e-01

Pos logit contributions
spe+0.043
 prope+0.042
 narrative+0.042
ateful+0.041
 peac+0.039
Neg logit contributions
!-0.021
 while-0.017
 snack-0.015
 spaghetti-0.015
.-0.015
Token ever
Token ID1683
Feature activation+4.47e-01

Pos logit contributions
spe+0.023
 prope+0.022
work+0.022
ho+0.021
 peac+0.020
Neg logit contributions
!-0.011
 minutes-0.009
 Care-0.008
 snack-0.008
 while-0.008
Token sm
Token ID895
Feature activation+8.29e-01

Pos logit contributions
spe+0.022
work+0.021
 prope+0.021
ho+0.021
 peac+0.020
Neg logit contributions
!-0.007
 minutes-0.007
 snack-0.007
 spaghetti-0.006
 Care-0.006
Tokenelt
Token ID2120
Feature activation-2.87e+00

Pos logit contributions
spe+0.030
work+0.029
ho+0.028
 peac+0.027
 prope+0.026
Neg logit contributions
 Care-0.023
 Sure-0.023
 snack-0.022
 minutes-0.022
 pharmacy-0.022
Token.
Token ID13
Feature activation-2.98e-01

Pos logit contributions
!+0.041
 snack+0.030
 minutes+0.030
 spaghetti+0.030
 while+0.030
Neg logit contributions
spe-0.154
work-0.148
 peac-0.146
 prope-0.145
ho-0.144
Token He
Token ID679
Feature activation-3.23e-02

Pos logit contributions
 minutes+0.006
 snack+0.006
 pharmacy+0.006
 spaghetti+0.006
 procedure+0.005
Neg logit contributions
spe-0.012
work-0.012
ho-0.011
melon-0.011
 peac-0.011
Token couldn
Token ID3521
Feature activation-7.43e-01

Pos logit contributions
!+0.001
As+0.001
 while+0.001
 cast+0.001
 minutes+0.001
Neg logit contributions
 narrative-0.002
work-0.001
spe-0.001
ateful-0.001
hips-0.001
Token't
Token ID470
Feature activation-5.37e-01

Pos logit contributions
 Care+0.010
!+0.010
 Sure+0.010
 As+0.009
 snack+0.009
Neg logit contributions
spe-0.038
work-0.036
 peac-0.035
 prope-0.035
ho-0.035
Token help
Token ID1037
Feature activation-6.26e-01

Pos logit contributions
 rest+0.011
 snack+0.011
!+0.010
 while+0.010
 nap+0.010
Neg logit contributions
spe-0.025
work-0.022
ho-0.022
 prope-0.022
melon-0.022
Token.
Token ID13
Feature activation-1.45e-01

Pos logit contributions
!+0.001
 Care+0.001
 spaghetti+0.001
 snack+0.001
 minutes+0.001
Neg logit contributions
spe-0.003
work-0.003
ho-0.003
 peac-0.003
 prope-0.003
Token She
Token ID1375
Feature activation-6.13e-02

Pos logit contributions
 snack+0.003
 minutes+0.003
 pharmacy+0.003
!+0.003
 spaghetti+0.003
Neg logit contributions
spe-0.006
work-0.006
ho-0.006
 peac-0.005
 prope-0.005
Token had
Token ID550
Feature activation+2.26e-01

Pos logit contributions
!+0.001
 Care+0.001
 minutes+0.001
 Sure+0.001
As+0.001
Neg logit contributions
spe-0.003
work-0.003
 prope-0.002
ho-0.002
 peac-0.002
Token many
Token ID867
Feature activation-9.06e-01

Pos logit contributions
spe+0.011
work+0.011
ho+0.011
melon+0.010
 peac+0.010
Neg logit contributions
 Care-0.004
 minutes-0.003
 spaghetti-0.003
!-0.003
 snack-0.003
Token cr
Token ID1067
Feature activation-2.77e-01

Pos logit contributions
 Sure+0.027
 Care+0.026
 pharmacy+0.025
 snack+0.025
 As+0.025
Neg logit contributions
spe-0.031
work-0.029
 peac-0.028
ho-0.028
 prope-0.028
Tokenay
Token ID323
Feature activation-2.87e+00

Pos logit contributions
 Care+0.006
 Sure+0.006
 snack+0.005
 spaghetti+0.005
 minutes+0.005
Neg logit contributions
spe-0.012
work-0.011
ho-0.011
 peac-0.011
 prope-0.011
Tokenons
Token ID684
Feature activation-6.30e-01

Pos logit contributions
 Sure+0.064
 Care+0.063
 pharmacy+0.061
 snack+0.060
 minutes+0.059
Neg logit contributions
spe-0.116
work-0.114
 peac-0.109
ho-0.108
 prope-0.105
Token and
Token ID290
Feature activation-7.74e-01

Pos logit contributions
!+0.008
 while+0.005
.+0.005
 spaghetti+0.004
 snack+0.004
Neg logit contributions
spe-0.036
 prope-0.035
ho-0.033
 narrative-0.033
work-0.033
Token papers
Token ID9473
Feature activation+6.94e-01

Pos logit contributions
 spaghetti+0.016
 Care+0.016
 snack+0.015
 minutes+0.015
!+0.015
Neg logit contributions
spe-0.034
work-0.033
ho-0.032
 prope-0.031
 peac-0.031
Token in
Token ID287
Feature activation+2.73e-01

Pos logit contributions
spe+0.040
 prope+0.038
ho+0.038
 peac+0.037
work+0.037
Neg logit contributions
!-0.007
 while-0.005
.-0.004
 Care-0.004
 snack-0.003
Token her
Token ID607
Feature activation-7.51e-01

Pos logit contributions
spe+0.013
work+0.012
ho+0.012
 peac+0.012
 prope+0.012
Neg logit contributions
 Care-0.005
!-0.005
 snack-0.005
 minutes-0.004
 Sure-0.004
Token.
Token ID13
Feature activation+4.99e-01

Pos logit contributions
spe+0.005
work+0.005
ho+0.005
 peac+0.005
 prope+0.005
Neg logit contributions
 Care-0.001
 Sure-0.001
 snack-0.001
!-0.001
 spaghetti-0.001
Token She
Token ID1375
Feature activation+1.56e-01

Pos logit contributions
spe+0.020
work+0.019
ho+0.018
melon+0.018
 peac+0.017
Neg logit contributions
 minutes-0.011
 snowball-0.010
Michael-0.010
 pharmacy-0.010
 snack-0.010
Token had
Token ID550
Feature activation+3.48e-01

Pos logit contributions
work+0.006
spe+0.006
 prope+0.006
 peac+0.006
ho+0.006
Neg logit contributions
!-0.003
 minutes-0.003
 Sure-0.003
 Care-0.003
 chair-0.003
Token many
Token ID867
Feature activation-8.33e-01

Pos logit contributions
spe+0.019
work+0.018
ho+0.018
melon+0.017
 peac+0.017
Neg logit contributions
 Care-0.004
 minutes-0.004
 spaghetti-0.004
 snack-0.004
!-0.004
Token cr
Token ID1067
Feature activation+1.02e-01

Pos logit contributions
 Sure+0.023
 Care+0.022
 snack+0.022
 pharmacy+0.021
 As+0.021
Neg logit contributions
spe-0.031
work-0.029
ho-0.028
 peac-0.028
 prope-0.027
Tokenay
Token ID323
Feature activation-2.87e+00

Pos logit contributions
spe+0.004
work+0.004
ho+0.004
 peac+0.004
 prope+0.004
Neg logit contributions
 Care-0.002
 Sure-0.002
 snack-0.002
 minutes-0.002
 spaghetti-0.002
Tokenons
Token ID684
Feature activation-5.39e-01

Pos logit contributions
 Sure+0.068
 Care+0.066
 pharmacy+0.065
 snack+0.063
 Dr+0.063
Neg logit contributions
spe-0.112
work-0.110
 peac-0.105
ho-0.104
 prope-0.101
Token and
Token ID290
Feature activation-9.31e-01

Pos logit contributions
!+0.007
 while+0.005
.+0.004
 snack+0.004
 spaghetti+0.004
Neg logit contributions
spe-0.031
 prope-0.029
work-0.028
 narrative-0.028
ho-0.028
Token papers
Token ID9473
Feature activation+6.32e-01

Pos logit contributions
 Care+0.021
 snack+0.020
 Sure+0.020
 spaghetti+0.020
 minutes+0.020
Neg logit contributions
spe-0.039
work-0.037
ho-0.036
 peac-0.035
 prope-0.035
Token to
Token ID284
Feature activation-4.61e-01

Pos logit contributions
spe+0.036
 prope+0.033
work+0.033
ho+0.033
 peac+0.033
Neg logit contributions
!-0.007
 while-0.005
 Care-0.005
 snack-0.004
 minutes-0.004
Token make
Token ID787
Feature activation+5.10e-02

Pos logit contributions
 snack+0.008
 Care+0.008
!+0.007
 spaghetti+0.007
 rest+0.007
Neg logit contributions
spe-0.023
work-0.021
ho-0.021
 prope-0.020
 peac-0.020
Token stage
Token ID3800
Feature activation+4.27e-01

Pos logit contributions
 Sure+0.003
 As+0.003
 Care+0.003
 Dr+0.003
!+0.003
Neg logit contributions
spe-0.005
work-0.005
ho-0.005
 prope-0.005
 peac-0.005
Token.
Token ID13
Feature activation+3.63e-01

Pos logit contributions
spe+0.023
work+0.023
ho+0.022
 peac+0.022
 prope+0.022
Neg logit contributions
 Care-0.003
!-0.003
 Sure-0.003
 snack-0.003
 minutes-0.003
Token They
Token ID1119
Feature activation+4.73e-01

Pos logit contributions
spe+0.015
work+0.015
ho+0.014
 prope+0.014
 peac+0.014
Neg logit contributions
 snack-0.007
 pharmacy-0.007
!-0.007
 minutes-0.007
 spaghetti-0.007
Token use
Token ID779
Feature activation+1.51e-01

Pos logit contributions
spe+0.024
work+0.023
ho+0.022
 peac+0.022
melon+0.022
Neg logit contributions
 Care-0.007
!-0.007
 rest-0.006
 walks-0.006
 search-0.006
Token cr
Token ID1067
Feature activation+1.22e-03

Pos logit contributions
spe+0.007
ho+0.007
work+0.007
 peac+0.007
melon+0.007
Neg logit contributions
 Care-0.003
 minutes-0.003
!-0.003
 spaghetti-0.002
 snack-0.002
Tokenay
Token ID323
Feature activation-2.86e+00

Pos logit contributions
spe+0.000
work+0.000
ho+0.000
 peac+0.000
 prope+0.000
Neg logit contributions
 Care-0.000
 Sure-0.000
 snack-0.000
!-0.000
 spaghetti-0.000
Tokenons
Token ID684
Feature activation-5.92e-01

Pos logit contributions
 Sure+0.064
 Care+0.063
 pharmacy+0.062
 snack+0.060
 Dr+0.059
Neg logit contributions
spe-0.115
work-0.113
 peac-0.108
ho-0.107
 prope-0.105
Token and
Token ID290
Feature activation-5.97e-01

Pos logit contributions
!+0.006
 while+0.004
.+0.003
 spaghetti+0.003
 snack+0.003
Neg logit contributions
spe-0.035
 prope-0.034
 narrative-0.033
 peac-0.032
ateful-0.032
Token stickers
Token ID28568
Feature activation-1.33e-01

Pos logit contributions
 minutes+0.013
 spaghetti+0.012
 snack+0.012
 walks+0.011
 Care+0.011
Neg logit contributions
spe-0.028
ho-0.026
work-0.025
melon-0.025
edient-0.024
Token to
Token ID284
Feature activation-4.52e-01

Pos logit contributions
!+0.001
.+0.001
 while+0.001
+0.000
!"+0.000
Neg logit contributions
spe-0.008
 peac-0.008
 prope-0.008
 narrative-0.008
 rob-0.007
Token decor
Token ID11705
Feature activation-3.33e-01

Pos logit contributions
 snack+0.008
 bed+0.007
!+0.007
 rest+0.007
 while+0.007
Neg logit contributions
spe-0.023
ho-0.022
work-0.021
 narrative-0.021
melon-0.020
Token He
Token ID679
Feature activation-3.41e-01

Pos logit contributions
 snack+0.008
 minutes+0.008
!+0.008
 Care+0.008
 Sure+0.008
Neg logit contributions
spe-0.021
 prope-0.020
work-0.019
melon-0.019
ho-0.019
Token had
Token ID550
Feature activation+3.42e-01

Pos logit contributions
!+0.006
 minutes+0.006
 Care+0.006
 spaghetti+0.006
 Dr+0.006
Neg logit contributions
spe-0.016
work-0.015
ho-0.014
 prope-0.014
 peac-0.014
Token glue
Token ID22749
Feature activation-5.28e-01

Pos logit contributions
spe+0.016
work+0.016
ho+0.015
 prope+0.014
fly+0.014
Neg logit contributions
 snack-0.007
 spaghetti-0.007
!-0.007
 minutes-0.006
 lunch-0.006
Token and
Token ID290
Feature activation-1.44e+00

Pos logit contributions
!+0.007
 while+0.005
.+0.004
 spaghetti+0.004
 snack+0.004
Neg logit contributions
spe-0.030
 peac-0.028
 narrative-0.028
 prope-0.028
work-0.028
Token cr
Token ID1067
Feature activation-8.92e-02

Pos logit contributions
 spaghetti+0.026
 snack+0.025
 minutes+0.023
 lunch+0.021
 meat+0.021
Neg logit contributions
spe-0.071
work-0.069
ho-0.068
 peac-0.065
 prope-0.063
Tokenay
Token ID323
Feature activation-2.85e+00

Pos logit contributions
 Care+0.002
 Sure+0.002
!+0.002
 As+0.002
 snack+0.002
Neg logit contributions
spe-0.004
work-0.004
ho-0.004
 peac-0.004
 prope-0.004
Tokenon
Token ID261
Feature activation-8.27e-01

Pos logit contributions
 Care+0.058
 Sure+0.058
 snack+0.055
 pharmacy+0.054
 minutes+0.054
Neg logit contributions
spe-0.122
work-0.119
ho-0.114
 peac-0.113
 prope-0.112
Token on
Token ID319
Feature activation-5.45e-01

Pos logit contributions
!+0.011
.+0.008
 snack+0.006
 while+0.006
 spaghetti+0.006
Neg logit contributions
spe-0.047
 prope-0.047
 narrative-0.045
 peac-0.045
work-0.043
Token his
Token ID465
Feature activation-9.61e-01

Pos logit contributions
!+0.009
 spaghetti+0.009
 snack+0.009
 Care+0.009
 chair+0.008
Neg logit contributions
spe-0.026
work-0.026
ho-0.025
 peac-0.025
 prope-0.024
Token fur
Token ID9230
Feature activation-1.28e+00

Pos logit contributions
 snack+0.018
 chair+0.017
 spaghetti+0.017
 bed+0.016
 lap+0.016
Neg logit contributions
spe-0.046
work-0.043
ho-0.042
 peac-0.041
ateful-0.040
Token and
Token ID290
Feature activation-7.89e-01

Pos logit contributions
!+0.006
 while+0.002
.+0.002
 snack+0.001
 spaghetti+0.000
Neg logit contributions
spe-0.083
 peac-0.080
 prope-0.078
work-0.078
 narrative-0.077
Token
Token ID198
Feature activation+1.56e-02

Pos logit contributions
 snowball+0.012
 minutes+0.012
 pharmacy+0.011
 snack+0.011
 camps+0.010
Neg logit contributions
spe-0.025
work-0.024
ho-0.023
 peac-0.023
 prope-0.022
Token
Token ID198
Feature activation-8.14e-01

Pos logit contributions
spe+0.001
ull+0.001
ho+0.001
 peac+0.000
work+0.000
Neg logit contributions
 pharmacy-0.000
 snack-0.000
 spaghetti-0.000
 minutes-0.000
 procedure-0.000
TokenShe
Token ID3347
Feature activation-4.26e-01

Pos logit contributions
 snack+0.017
 Sure+0.016
 pharmacy+0.016
 Dr+0.016
 spaghetti+0.016
Neg logit contributions
spe-0.035
 peac-0.032
work-0.032
ho-0.031
 prope-0.031
Token hadn
Token ID8020
Feature activation-1.05e+00

Pos logit contributions
 Care+0.009
 minutes+0.009
!+0.009
 Sure+0.008
 spaghetti+0.008
Neg logit contributions
spe-0.019
work-0.018
 prope-0.017
ho-0.017
 peac-0.017
Token't
Token ID470
Feature activation-5.19e-01

Pos logit contributions
 minutes+0.023
 Care+0.023
 Sure+0.022
 spaghetti+0.022
 snack+0.022
Neg logit contributions
spe-0.043
work-0.041
ho-0.041
 peac-0.040
 prope-0.039
Token seen
Token ID1775
Feature activation-2.85e+00

Pos logit contributions
 Care+0.010
 snack+0.010
!+0.010
 minutes+0.010
 Sure+0.009
Neg logit contributions
spe-0.024
work-0.023
ho-0.022
 prope-0.022
 peac-0.021
Token it
Token ID340
Feature activation-1.79e+00

Pos logit contributions
 Care+0.057
 Sure+0.056
 snack+0.055
 spaghetti+0.054
!+0.054
Neg logit contributions
spe-0.125
work-0.119
ho-0.116
 peac-0.113
 prope-0.112
Token before
Token ID878
Feature activation+2.42e-01

Pos logit contributions
 Care+0.034
 Sure+0.033
 snack+0.033
!+0.032
 minutes+0.032
Neg logit contributions
spe-0.080
work-0.077
ho-0.074
 peac-0.073
 prope-0.073
Token but
Token ID475
Feature activation-1.82e+00

Pos logit contributions
spe+0.011
work+0.011
ho+0.011
 peac+0.011
 prope+0.010
Neg logit contributions
 Care-0.004
 Sure-0.004
 snack-0.004
 pharmacy-0.003
 minutes-0.003
Token it
Token ID340
Feature activation-3.99e-01

Pos logit contributions
 Care+0.029
!+0.028
 Sure+0.027
 spaghetti+0.025
 minutes+0.025
Neg logit contributions
spe-0.094
work-0.089
 peac-0.087
 prope-0.086
ho-0.084
Token was
Token ID373
Feature activation-3.95e-01

Pos logit contributions
!+0.007
 Care+0.006
 snack+0.006
 minutes+0.006
 while+0.006
Neg logit contributions
spe-0.020
work-0.019
 prope-0.019
 peac-0.018
melon-0.017
Token.
Token ID13
Feature activation+1.90e-01

Pos logit contributions
spe+0.011
 peac+0.011
work+0.011
 prope+0.011
ho+0.011
Neg logit contributions
!-0.003
 Care-0.003
 Sure-0.003
 while-0.003
 snack-0.003
Token She
Token ID1375
Feature activation+1.28e-01

Pos logit contributions
spe+0.007
work+0.006
 prope+0.006
ho+0.006
ateful+0.006
Neg logit contributions
 snack-0.005
 snowball-0.005
 pharmacy-0.005
ittens-0.005
 clearing-0.004
Token used
Token ID973
Feature activation-1.73e-01

Pos logit contributions
spe+0.006
work+0.005
ho+0.005
 peac+0.005
melon+0.005
Neg logit contributions
 minutes-0.003
 Care-0.002
!-0.002
 Sure-0.002
 snack-0.002
Token many
Token ID867
Feature activation-1.58e+00

Pos logit contributions
 Care+0.003
 snack+0.003
 spaghetti+0.003
 minutes+0.003
!+0.003
Neg logit contributions
spe-0.008
work-0.008
ho-0.008
 peac-0.008
 prope-0.007
Token cr
Token ID1067
Feature activation-5.11e-01

Pos logit contributions
 Sure+0.044
 Care+0.042
 pharmacy+0.042
 As+0.041
 Dr+0.041
Neg logit contributions
spe-0.057
work-0.053
 prope-0.053
 peac-0.051
ho-0.050
Tokenay
Token ID323
Feature activation-2.84e+00

Pos logit contributions
 Care+0.011
 Sure+0.010
 snack+0.010
 minutes+0.010
 spaghetti+0.010
Neg logit contributions
spe-0.022
work-0.021
ho-0.020
 peac-0.020
 prope-0.020
Tokenons
Token ID684
Feature activation-1.08e+00

Pos logit contributions
 Sure+0.067
 Care+0.065
 pharmacy+0.065
 snack+0.063
 Dr+0.062
Neg logit contributions
spe-0.110
work-0.109
 peac-0.103
ho-0.102
 prope-0.102
Token and
Token ID290
Feature activation-7.22e-01

Pos logit contributions
!+0.011
 while+0.009
 Care+0.007
 snack+0.007
 spaghetti+0.007
Neg logit contributions
spe-0.062
work-0.059
 prope-0.059
 peac-0.058
ateful-0.056
Token paper
Token ID3348
Feature activation-6.71e-01

Pos logit contributions
 minutes+0.016
 spaghetti+0.015
 snack+0.015
 Care+0.015
 while+0.014
Neg logit contributions
spe-0.032
work-0.031
ho-0.030
melon-0.029
 peac-0.028
Token.
Token ID13
Feature activation+4.05e-01

Pos logit contributions
!+0.007
 while+0.005
 snack+0.005
.+0.005
 Care+0.005
Neg logit contributions
spe-0.039
work-0.035
 peac-0.035
 prope-0.035
ho-0.035
Token She
Token ID1375
Feature activation+7.68e-02

Pos logit contributions
spe+0.014
work+0.014
 prope+0.013
ho+0.013
 peac+0.013
Neg logit contributions
 snowball-0.010
 snack-0.010
 pharmacy-0.010
 clearing-0.010
!-0.009
Token You
Token ID921
Feature activation-1.20e+00

Pos logit contributions
 snack+0.030
 minutes+0.030
 Care+0.028
!+0.028
 spaghetti+0.027
Neg logit contributions
spe-0.057
work-0.054
ho-0.053
 peac-0.052
 prope-0.051
Token should
Token ID815
Feature activation-1.05e-01

Pos logit contributions
!+0.027
 rest+0.025
 snack+0.025
 Care+0.024
 nap+0.024
Neg logit contributions
spe-0.053
work-0.049
ho-0.048
 prope-0.047
melon-0.046
Token not
Token ID407
Feature activation+3.25e-01

Pos logit contributions
 Care+0.002
 snack+0.002
 Sure+0.002
!+0.002
 spaghetti+0.002
Neg logit contributions
spe-0.004
work-0.004
ho-0.004
 peac-0.004
 prope-0.004
Token use
Token ID779
Feature activation-1.39e-01

Pos logit contributions
spe+0.016
work+0.015
 prope+0.015
ho+0.015
 peac+0.014
Neg logit contributions
 snack-0.005
!-0.005
 Care-0.005
 spaghetti-0.005
 Sure-0.005
Token cr
Token ID1067
Feature activation-2.17e-01

Pos logit contributions
 Care+0.003
 Sure+0.003
 snack+0.003
 minutes+0.002
 spaghetti+0.002
Neg logit contributions
spe-0.006
work-0.006
ho-0.006
 peac-0.006
 prope-0.006
Tokenay
Token ID323
Feature activation-2.84e+00

Pos logit contributions
 Care+0.005
 Sure+0.005
 snack+0.005
 minutes+0.004
 pharmacy+0.004
Neg logit contributions
spe-0.009
work-0.009
ho-0.008
 peac-0.008
 prope-0.008
Tokenons
Token ID684
Feature activation-9.29e-01

Pos logit contributions
 Sure+0.061
 Care+0.060
 pharmacy+0.058
 snack+0.058
 As+0.056
Neg logit contributions
spe-0.118
work-0.115
ho-0.110
 peac-0.110
 prope-0.107
Token like
Token ID588
Feature activation-3.91e-01

Pos logit contributions
!+0.012
 snack+0.010
 spaghetti+0.010
 while+0.009
 minutes+0.009
Neg logit contributions
spe-0.050
work-0.047
 prope-0.046
 peac-0.046
ho-0.046
Token that
Token ID326
Feature activation+2.76e-02

Pos logit contributions
 spaghetti+0.007
 snack+0.006
!+0.006
 Care+0.006
 Dr+0.006
Neg logit contributions
spe-0.019
work-0.018
ho-0.017
 prope-0.017
 peac-0.017
Token."
Token ID526
Feature activation-1.51e-01

Pos logit contributions
spe+0.001
work+0.001
 prope+0.001
ho+0.001
 peac+0.001
Neg logit contributions
!-0.000
 snack-0.000
 spaghetti-0.000
 while-0.000
!"-0.000
Token
Token ID198
Feature activation+1.07e-02

Pos logit contributions
 minutes+0.003
 spaghetti+0.002
 pharmacy+0.002
 Care+0.002
 clearing+0.002
Neg logit contributions
spe-0.007
ho-0.006
work-0.006
melon-0.006
 peac-0.006
Token that
Token ID326
Feature activation+1.77e-01

Pos logit contributions
spe+0.002
work+0.002
ho+0.002
 prope+0.002
 peac+0.002
Neg logit contributions
!-0.001
 Dr-0.000
 Care-0.000
 spaghetti-0.000
 while-0.000
Token the
Token ID262
Feature activation-3.37e-01

Pos logit contributions
work+0.010
spe+0.010
ho+0.009
 prope+0.009
melon+0.009
Neg logit contributions
 spaghetti-0.002
 while-0.002
!-0.002
 snack-0.002
 Dr-0.002
Token flag
Token ID6056
Feature activation+2.42e-01

Pos logit contributions
 snack+0.009
 rest+0.009
!+0.008
 minutes+0.008
 chair+0.008
Neg logit contributions
spe-0.013
work-0.012
ho-0.012
ateful-0.011
edient-0.011
Token had
Token ID550
Feature activation+6.63e-01

Pos logit contributions
 prope+0.012
ateful+0.012
 peac+0.012
spe+0.012
 narrative+0.011
Neg logit contributions
!-0.004
.-0.002
 while-0.002
 rest-0.002
 snack-0.002
Token sm
Token ID895
Feature activation+7.87e-01

Pos logit contributions
spe+0.031
ho+0.030
work+0.029
 narrative+0.029
melon+0.028
Neg logit contributions
!-0.014
 snack-0.013
 rest-0.012
 walks-0.012
 minutes-0.012
Tokenelt
Token ID2120
Feature activation-2.84e+00

Pos logit contributions
spe+0.031
work+0.030
 prope+0.029
 peac+0.028
ho+0.028
Neg logit contributions
 Care-0.019
 Sure-0.019
 minutes-0.019
!-0.018
 spaghetti-0.018
Token really
Token ID1107
Feature activation+1.63e-01

Pos logit contributions
!+0.050
 spaghetti+0.041
 minutes+0.041
 while+0.040
 snack+0.040
Neg logit contributions
spe-0.143
work-0.138
 peac-0.137
 prope-0.133
ho-0.133
Token sour
Token ID11348
Feature activation-2.61e-01

Pos logit contributions
spe+0.008
work+0.008
 peac+0.008
 prope+0.007
ho+0.007
Neg logit contributions
!-0.003
 quick-0.003
 while-0.003
 minutes-0.003
 Care-0.003
Token!
Token ID0
Feature activation-8.62e-01

Pos logit contributions
!+0.003
 spaghetti+0.003
 snack+0.003
 while+0.002
 minutes+0.002
Neg logit contributions
spe-0.014
work-0.014
 prope-0.013
 peac-0.013
ho-0.013
Token Bob
Token ID5811
Feature activation-5.19e-01

Pos logit contributions
 snack+0.013
 minutes+0.013
 spaghetti+0.012
 pharmacy+0.012
 Care+0.012
Neg logit contributions
spe-0.041
work-0.039
ho-0.038
 peac-0.037
 prope-0.037
Token thought
Token ID1807
Feature activation-7.74e-01

Pos logit contributions
!+0.010
 quick+0.008
As+0.008
 As+0.008
+0.008
Neg logit contributions
work-0.024
 prope-0.024
spe-0.024
 narrative-0.023
 peac-0.023
Token,
Token ID11
Feature activation+2.51e-01

Pos logit contributions
spe+0.028
work+0.027
ho+0.026
 peac+0.026
 prope+0.026
Neg logit contributions
 Care-0.009
 Sure-0.009
 snack-0.009
 spaghetti-0.009
 minutes-0.009
Token the
Token ID262
Feature activation-3.03e-01

Pos logit contributions
work+0.012
ho+0.011
spe+0.011
 prope+0.011
 peac+0.011
Neg logit contributions
 while-0.004
!-0.004
 Care-0.004
 snack-0.004
 Sure-0.004
Token moral
Token ID6573
Feature activation+1.08e-01

Pos logit contributions
 Care+0.007
 snack+0.007
 Sure+0.007
 spaghetti+0.006
 minutes+0.006
Neg logit contributions
spe-0.013
work-0.012
ho-0.012
 prope-0.012
 peac-0.011
Token of
Token ID286
Feature activation+4.79e-01

Pos logit contributions
spe+0.005
work+0.005
ho+0.005
 prope+0.005
melon+0.005
Neg logit contributions
 Care-0.002
!-0.002
 Sure-0.002
 minutes-0.001
 snack-0.001
Token the
Token ID262
Feature activation+6.00e-03

Pos logit contributions
spe+0.023
work+0.021
ho+0.021
 prope+0.021
 peac+0.021
Neg logit contributions
 Dr-0.008
!-0.008
 snack-0.008
 Care-0.008
 spaghetti-0.008
Token story
Token ID1621
Feature activation+1.66e+00

Pos logit contributions
spe+0.000
work+0.000
ho+0.000
 prope+0.000
ateful+0.000
Neg logit contributions
 snack-0.000
 Care-0.000
 spaghetti-0.000
!-0.000
 rest-0.000
Token was
Token ID373
Feature activation+5.02e-01

Pos logit contributions
 prope+0.089
 peac+0.089
spe+0.084
work+0.083
ho+0.082
Neg logit contributions
!-0.023
 Sure-0.016
 Care-0.016
 As-0.016
-0.015
Token that
Token ID326
Feature activation+6.32e-01

Pos logit contributions
spe+0.027
 prope+0.027
work+0.025
ho+0.024
ateful+0.023
Neg logit contributions
!-0.007
 As-0.007
 Care-0.007
-0.007
 Dr-0.007
Token playing
Token ID2712
Feature activation+1.35e-01

Pos logit contributions
spe+0.031
work+0.029
ho+0.029
 prope+0.029
 peac+0.028
Neg logit contributions
!-0.011
 while-0.011
 Care-0.010
 spaghetti-0.010
 snack-0.010
Token jokes
Token ID14532
Feature activation+8.59e-01

Pos logit contributions
spe+0.006
work+0.006
ho+0.006
 peac+0.006
 prope+0.005
Neg logit contributions
 Care-0.003
 Sure-0.003
 snack-0.002
 spaghetti-0.002
 minutes-0.002
Token can
Token ID460
Feature activation+9.81e-01

Pos logit contributions
spe+0.041
 peac+0.040
 prope+0.039
ho+0.038
 rob+0.038
Neg logit contributions
!-0.018
 while-0.015
 spaghetti-0.013
 Care-0.013
 As-0.013
Token to
Token ID284
Feature activation+1.35e-01

Pos logit contributions
!+0.004
 while+0.002
 minutes+0.002
 Care+0.002
 snack+0.002
Neg logit contributions
spe-0.029
work-0.027
 narrative-0.027
ho-0.027
 peac-0.027
Token show
Token ID905
Feature activation-3.80e-01

Pos logit contributions
spe+0.007
work+0.006
ho+0.006
 peac+0.006
 prope+0.006
Neg logit contributions
 snack-0.002
 Care-0.002
!-0.002
 Sure-0.002
 rest-0.002
Token how
Token ID703
Feature activation+1.49e+00

Pos logit contributions
!+0.009
 Care+0.008
 As+0.007
.+0.007
 while+0.007
Neg logit contributions
spe-0.016
ho-0.016
work-0.016
melon-0.015
 peac-0.015
Token happy
Token ID3772
Feature activation+8.14e-01

Pos logit contributions
work+0.068
spe+0.067
ho+0.066
 prope+0.065
melon+0.064
Neg logit contributions
!-0.026
 Care-0.026
 quick-0.023
 minutes-0.023
 Sure-0.022
Token it
Token ID340
Feature activation+2.19e+00

Pos logit contributions
spe+0.042
work+0.040
ho+0.040
edient+0.039
 narrative+0.038
Neg logit contributions
!-0.014
 while-0.012
 snack-0.010
 minutes-0.010
 spaghetti-0.010
Token was
Token ID373
Feature activation+1.97e+00

Pos logit contributions
spe+0.110
ho+0.107
 prope+0.105
work+0.104
 peac+0.103
Neg logit contributions
!-0.037
 Care-0.027
 walks-0.027
 minutes-0.026
 rest-0.024
Token that
Token ID326
Feature activation+4.79e-01

Pos logit contributions
spe+0.114
work+0.109
ho+0.108
 prope+0.106
 peac+0.105
Neg logit contributions
!-0.015
 Care-0.012
 snack-0.011
 Sure-0.010
 minutes-0.010
Token the
Token ID262
Feature activation+4.52e-01

Pos logit contributions
spe+0.022
ho+0.022
work+0.021
ateful+0.021
 prope+0.021
Neg logit contributions
!-0.010
 rest-0.009
 while-0.008
 snack-0.008
.-0.008
Token storm
Token ID6388
Feature activation+1.59e+00

Pos logit contributions
spe+0.018
ho+0.017
work+0.017
ateful+0.016
melon+0.016
Neg logit contributions
 rest-0.012
 snack-0.011
 minutes-0.011
!-0.011
 clearing-0.010
Token was
Token ID373
Feature activation+1.54e+00

Pos logit contributions
ateful+0.075
 prope+0.073
 peac+0.073
spe+0.072
 narrative+0.070
Neg logit contributions
!-0.030
 while-0.022
 rest-0.021
.-0.020
 minutes-0.020
Token over
Token ID625
Feature activation+3.51e-01

Pos logit contributions
spe+0.078
ho+0.075
work+0.074
 narrative+0.072
ateful+0.071
Neg logit contributions
!-0.027
 while-0.021
 minutes-0.021
 Care-0.019
 snack-0.019
Token spend
Token ID4341
Feature activation+3.70e-01

Pos logit contributions
spe+0.023
work+0.022
ho+0.021
 peac+0.021
 prope+0.021
Neg logit contributions
 Care-0.010
 Sure-0.010
 snack-0.009
 minutes-0.009
 spaghetti-0.009
Token this
Token ID428
Feature activation+4.21e-01

Pos logit contributions
spe+0.015
work+0.015
 prope+0.014
 peac+0.014
ho+0.014
Neg logit contributions
 Sure-0.008
 Care-0.007
 pharmacy-0.007
 snack-0.007
 spaghetti-0.007
Token money
Token ID1637
Feature activation-3.31e-01

Pos logit contributions
spe+0.019
work+0.018
ho+0.018
 peac+0.018
 prope+0.018
Neg logit contributions
 snack-0.008
!-0.008
 Care-0.008
 Sure-0.007
 minutes-0.007
Token on
Token ID319
Feature activation+4.98e-01

Pos logit contributions
!+0.005
 Care+0.005
 snack+0.005
 Sure+0.005
 spaghetti+0.005
Neg logit contributions
spe-0.016
work-0.015
 peac-0.015
ho-0.015
 prope-0.015
Token anything
Token ID1997
Feature activation+1.04e+00

Pos logit contributions
spe+0.023
work+0.022
ho+0.021
 peac+0.021
 prope+0.021
Neg logit contributions
 Care-0.009
 snack-0.009
!-0.009
 spaghetti-0.009
 Sure-0.009
Token you
Token ID345
Feature activation+1.96e+00

Pos logit contributions
spe+0.052
work+0.050
ho+0.049
 peac+0.049
 prope+0.048
Neg logit contributions
 Care-0.014
!-0.014
 snack-0.014
 Sure-0.014
 minutes-0.013
Token want
Token ID765
Feature activation+3.70e-01

Pos logit contributions
spe+0.091
work+0.086
ho+0.084
 prope+0.084
 peac+0.083
Neg logit contributions
!-0.035
 Care-0.035
 snack-0.034
 minutes-0.033
 Sure-0.032
Token."
Token ID526
Feature activation+1.50e-01

Pos logit contributions
spe+0.019
work+0.018
ho+0.018
 peac+0.017
 prope+0.017
Neg logit contributions
 Care-0.005
 Sure-0.005
 snack-0.004
 minutes-0.004
 pharmacy-0.004
Token
Token ID198
Feature activation-1.16e-01

Pos logit contributions
spe+0.005
work+0.005
ho+0.005
melon+0.005
 peac+0.005
Neg logit contributions
 minutes-0.004
 pharmacy-0.004
 walks-0.004
 quick-0.003
 clearing-0.003
Token
Token ID198
Feature activation-4.33e-01

Pos logit contributions
 minutes+0.003
 pharmacy+0.003
 walks+0.003
 spaghetti+0.003
 Deal+0.003
Neg logit contributions
spe-0.005
ull-0.004
ho-0.004
work-0.004
 peac-0.004
TokenExc
Token ID40127
Feature activation-4.24e-01

Pos logit contributions
 Sure+0.009
 minutes+0.009
 pharmacy+0.009
 spaghetti+0.009
 Dr+0.009
Neg logit contributions
spe-0.019
ho-0.016
 peac-0.016
work-0.015
 prope-0.015
Token.
Token ID13
Feature activation-1.88e-01

Pos logit contributions
spe+0.012
 prope+0.011
work+0.011
 peac+0.011
ho+0.011
Neg logit contributions
!-0.002
 snack-0.002
 spaghetti-0.002
 Care-0.001
 while-0.001
Token Lily
Token ID20037
Feature activation-9.62e-03

Pos logit contributions
 minutes+0.003
 snack+0.003
!+0.003
 pharmacy+0.003
 Care+0.003
Neg logit contributions
spe-0.008
work-0.008
 peac-0.007
ho-0.007
melon-0.007
Token was
Token ID373
Feature activation-2.76e-01

Pos logit contributions
 Care+0.000
!+0.000
 Sure+0.000
 As+0.000
 pharmacy+0.000
Neg logit contributions
spe-0.000
work-0.000
 prope-0.000
 peac-0.000
ho-0.000
Token happy
Token ID3772
Feature activation+2.83e-01

Pos logit contributions
 Care+0.005
 Dr+0.005
!+0.005
 quick+0.005
 As+0.005
Neg logit contributions
spe-0.013
ho-0.013
work-0.012
 peac-0.012
 prope-0.012
Token to
Token ID284
Feature activation+1.33e+00

Pos logit contributions
work+0.011
spe+0.011
ho+0.010
 peac+0.010
melon+0.010
Neg logit contributions
 Care-0.007
 Sure-0.007
 pharmacy-0.007
 Ready-0.006
 snowball-0.006
Token share
Token ID2648
Feature activation+1.70e+00

Pos logit contributions
spe+0.064
work+0.061
ho+0.059
 prope+0.058
 peac+0.057
Neg logit contributions
 snack-0.022
 Care-0.022
!-0.021
 Sure-0.021
 As-0.020
Token her
Token ID607
Feature activation+5.05e-01

Pos logit contributions
spe+0.075
work+0.072
ho+0.069
 peac+0.068
 prope+0.068
Neg logit contributions
 Care-0.034
 Sure-0.033
 snack-0.032
 spaghetti-0.032
!-0.032
Token candy
Token ID18550
Feature activation+5.10e-02

Pos logit contributions
spe+0.025
work+0.024
ho+0.024
 prope+0.023
 peac+0.023
Neg logit contributions
 snack-0.007
 Care-0.007
 spaghetti-0.007
 Sure-0.007
!-0.006
Token with
Token ID351
Feature activation+1.07e+00

Pos logit contributions
spe+0.003
work+0.002
 prope+0.002
 peac+0.002
ho+0.002
Neg logit contributions
!-0.001
 snack-0.001
 spaghetti-0.001
 Care-0.001
 while-0.001
Token Emma
Token ID18966
Feature activation+5.78e-01

Pos logit contributions
spe+0.047
work+0.045
ho+0.044
 prope+0.043
 peac+0.043
Neg logit contributions
 Care-0.022
 Sure-0.021
 snack-0.021
!-0.021
 spaghetti-0.021
Token.
Token ID13
Feature activation+6.00e-02

Pos logit contributions
spe+0.029
work+0.028
ho+0.028
 peac+0.027
 prope+0.027
Neg logit contributions
 Care-0.007
 Sure-0.007
 snack-0.007
!-0.007
 spaghetti-0.007
Token put
Token ID1234
Feature activation-2.67e-01

Pos logit contributions
spe+0.001
work+0.001
 peac+0.001
ho+0.001
 prope+0.001
Neg logit contributions
 Care-0.000
 snack-0.000
!-0.000
 rest-0.000
 spaghetti-0.000
Token the
Token ID262
Feature activation+2.40e-01

Pos logit contributions
 Care+0.005
 snack+0.005
 spaghetti+0.005
!+0.005
 minutes+0.005
Neg logit contributions
spe-0.013
ho-0.012
work-0.011
 peac-0.011
 prope-0.011
Token food
Token ID2057
Feature activation-3.26e-01

Pos logit contributions
spe+0.012
work+0.011
ho+0.011
 prope+0.011
 peac+0.011
Neg logit contributions
 snack-0.004
 Care-0.004
 spaghetti-0.004
 minutes-0.004
!-0.004
Token in
Token ID287
Feature activation+7.71e-01

Pos logit contributions
 while+0.006
!+0.006
 snack+0.006
.+0.005
 spaghetti+0.005
Neg logit contributions
spe-0.016
 peac-0.015
 prope-0.015
work-0.014
ateful-0.014
Token the
Token ID262
Feature activation+4.59e-01

Pos logit contributions
spe+0.035
work+0.034
ho+0.033
 peac+0.032
 prope+0.032
Neg logit contributions
 Care-0.015
!-0.015
 snack-0.015
 minutes-0.014
 spaghetti-0.014
Token right
Token ID826
Feature activation+1.66e+00

Pos logit contributions
spe+0.022
work+0.021
ho+0.021
 peac+0.020
 prope+0.020
Neg logit contributions
 Care-0.007
 Sure-0.007
 snack-0.007
 minutes-0.007
 As-0.006
Token place
Token ID1295
Feature activation+5.58e-02

Pos logit contributions
spe+0.076
work+0.073
ho+0.070
 peac+0.069
 prope+0.069
Neg logit contributions
 Care-0.030
 Sure-0.029
 snack-0.028
 spaghetti-0.028
 Dr-0.027
Token.
Token ID13
Feature activation+1.67e-01

Pos logit contributions
spe+0.003
 peac+0.003
 prope+0.003
work+0.003
ho+0.003
Neg logit contributions
!-0.000
 while-0.000
 snack-0.000
 Care-0.000
 minutes-0.000
Token
Token ID198
Feature activation+4.56e-03

Pos logit contributions
spe+0.006
melon+0.006
ho+0.006
work+0.005
ateful+0.005
Neg logit contributions
 procedure-0.004
 pharmacy-0.004
 snack-0.004
 snowball-0.003
 bone-0.003
Token
Token ID198
Feature activation-4.60e-01

Pos logit contributions
spe+0.000
ho+0.000
ull+0.000
work+0.000
melon+0.000
Neg logit contributions
 procedure-0.000
 pharmacy-0.000
 snack-0.000
 spaghetti-0.000
amas-0.000
Token"
Token ID1
Feature activation+5.22e-01

Pos logit contributions
 Sure+0.010
 snack+0.009
 procedure+0.009
 pharmacy+0.009
 Care+0.009
Neg logit contributions
spe-0.019
ho-0.017
work-0.017
 peac-0.016
 narrative-0.016
Token take
Token ID1011
Feature activation-1.11e-01

Pos logit contributions
 snack+0.001
 Care+0.001
 rest+0.001
!+0.001
 while+0.001
Neg logit contributions
spe-0.002
work-0.002
 prope-0.002
ho-0.002
 peac-0.002
Token care
Token ID1337
Feature activation+2.91e-01

Pos logit contributions
 Care+0.002
 Sure+0.002
 snack+0.002
 minutes+0.002
!+0.002
Neg logit contributions
spe-0.005
work-0.005
ho-0.005
 peac-0.005
 prope-0.005
Token of
Token ID286
Feature activation+9.50e-02

Pos logit contributions
spe+0.013
work+0.013
ho+0.012
 peac+0.012
 prope+0.012
Neg logit contributions
 Care-0.005
 Sure-0.005
!-0.005
 snack-0.005
 As-0.005
Token your
Token ID534
Feature activation+5.58e-01

Pos logit contributions
spe+0.004
work+0.004
ho+0.004
 peac+0.004
 prope+0.004
Neg logit contributions
 Care-0.002
!-0.002
 snack-0.002
 Sure-0.002
 spaghetti-0.002
Token responsibilities
Token ID15171
Feature activation+3.49e-01

Pos logit contributions
spe+0.025
work+0.024
ho+0.023
 peac+0.023
 prope+0.023
Neg logit contributions
 Care-0.010
 Sure-0.010
 snack-0.010
 minutes-0.010
 spaghetti-0.010
Token,
Token ID11
Feature activation+1.30e-01

Pos logit contributions
spe+0.020
 peac+0.019
ho+0.019
 prope+0.019
work+0.019
Neg logit contributions
!-0.003
 while-0.003
 Care-0.002
 snack-0.002
 spaghetti-0.002
Token even
Token ID772
Feature activation+2.26e-01

Pos logit contributions
work+0.007
spe+0.006
ho+0.006
melon+0.006
 prope+0.006
Neg logit contributions
 while-0.002
!-0.002
 snack-0.002
 spaghetti-0.002
 Care-0.002
Token if
Token ID611
Feature activation-9.51e-02

Pos logit contributions
spe+0.011
work+0.010
ho+0.010
 peac+0.010
 prope+0.010
Neg logit contributions
 Care-0.004
 Sure-0.004
 snack-0.004
 minutes-0.004
 spaghetti-0.004
Token you
Token ID345
Feature activation+9.93e-01

Pos logit contributions
 Care+0.001
!+0.001
 snack+0.001
 Sure+0.001
 spaghetti+0.001
Neg logit contributions
spe-0.005
work-0.005
ho-0.005
 prope-0.005
 peac-0.005
Token don
Token ID836
Feature activation+9.77e-01

Pos logit contributions
spe+0.048
 prope+0.044
work+0.044
 peac+0.043
ho+0.043
Neg logit contributions
!-0.020
 rest-0.019
 snack-0.018
 nap-0.018
 Care-0.017
Token't
Token ID470
Feature activation+5.35e-01

Pos logit contributions
spe+0.050
work+0.048
ho+0.047
 peac+0.047
 prope+0.046
Neg logit contributions
!-0.013
 Care-0.013
 As-0.012
As-0.011
 snack-0.011
TokenJ
Token ID41
Feature activation-7.31e-01

Pos logit contributions
 snack+0.011
 pharmacy+0.011
 Sure+0.011
 Dr+0.010
 spaghetti+0.010
Neg logit contributions
spe-0.031
ho-0.028
 peac-0.028
work-0.027
 prope-0.026
Tokenill
Token ID359
Feature activation-2.88e-01

Pos logit contributions
 Care+0.024
 Sure+0.024
 snack+0.023
!+0.023
 minutes+0.023
Neg logit contributions
spe-0.022
work-0.021
ho-0.020
 prope-0.020
 peac-0.020
Token was
Token ID373
Feature activation-3.94e-01

Pos logit contributions
!+0.005
As+0.004
 walks+0.004
 minutes+0.004
 while+0.004
Neg logit contributions
work-0.014
spe-0.014
melon-0.013
ateful-0.013
ho-0.013
Token so
Token ID523
Feature activation-8.06e-01

Pos logit contributions
 Care+0.008
!+0.007
 quick+0.007
 As+0.007
 Dr+0.007
Neg logit contributions
spe-0.018
ho-0.018
work-0.018
 peac-0.017
 narrative-0.016
Token excited
Token ID6568
Feature activation-6.17e-01

Pos logit contributions
 quick+0.015
 Care+0.014
!+0.013
As+0.013
+0.013
Neg logit contributions
work-0.039
spe-0.038
ho-0.037
 peac-0.037
 prope-0.036
Token that
Token ID326
Feature activation+3.88e-01

Pos logit contributions
 Deal+0.017
 pharmacy+0.015
 Care+0.015
 snowball+0.015
 Ready+0.015
Neg logit contributions
work-0.023
spe-0.022
ho-0.022
 peac-0.021
melon-0.020
Token she
Token ID673
Feature activation-4.90e-01

Pos logit contributions
spe+0.022
work+0.021
 prope+0.021
ho+0.020
 peac+0.020
Neg logit contributions
!-0.004
 Care-0.003
 snack-0.003
 Sure-0.003
 spaghetti-0.003
Token decided
Token ID3066
Feature activation+7.83e-01

Pos logit contributions
 Care+0.012
!+0.012
 Sure+0.011
 snack+0.011
 minutes+0.011
Neg logit contributions
spe-0.019
work-0.019
ho-0.018
 peac-0.018
 prope-0.017
Token to
Token ID284
Feature activation+1.39e-01

Pos logit contributions
spe+0.040
work+0.039
ho+0.038
 prope+0.037
 peac+0.037
Neg logit contributions
 Care-0.010
!-0.010
 Sure-0.010
 snack-0.009
 As-0.009
Token make
Token ID787
Feature activation+3.04e-01

Pos logit contributions
spe+0.006
work+0.006
ho+0.006
 peac+0.006
 prope+0.006
Neg logit contributions
 Care-0.003
 Sure-0.002
 snack-0.002
!-0.002
 spaghetti-0.002
Token a
Token ID257
Feature activation-1.77e-01

Pos logit contributions
work+0.011
spe+0.011
 peac+0.011
 prope+0.010
 rob+0.010
Neg logit contributions
 Sure-0.008
 Care-0.007
 While-0.007
 minutes-0.007
 Deal-0.007
TokenJ
Token ID41
Feature activation-7.60e-01

Pos logit contributions
 Sure+0.014
 Care+0.014
 snack+0.014
 spaghetti+0.013
 minutes+0.013
Neg logit contributions
spe-0.037
work-0.035
ho-0.034
 peac-0.034
 prope-0.033
Tokenenny
Token ID11870
Feature activation-1.35e-01

Pos logit contributions
 Care+0.010
 Sure+0.010
!+0.009
 snack+0.009
 minutes+0.009
Neg logit contributions
spe-0.038
work-0.037
ho-0.036
 peac-0.036
 prope-0.036
Token's
Token ID338
Feature activation-7.01e-01

Pos logit contributions
!+0.003
As+0.003
 As+0.003
 walks+0.002
 minutes+0.002
Neg logit contributions
work-0.006
spe-0.006
ateful-0.006
 peac-0.006
 prope-0.006
Token dad
Token ID9955
Feature activation-4.52e-02

Pos logit contributions
 Care+0.017
 Sure+0.016
 snack+0.016
 minutes+0.016
 spaghetti+0.016
Neg logit contributions
spe-0.028
work-0.027
ho-0.026
 peac-0.026
 prope-0.025
Token called
Token ID1444
Feature activation-1.11e+00

Pos logit contributions
!+0.001
 Care+0.001
 Sure+0.001
 Dr+0.001
 minutes+0.001
Neg logit contributions
spe-0.002
work-0.002
 prope-0.002
 peac-0.002
ho-0.002
Token for
Token ID329
Feature activation-9.55e-02

Pos logit contributions
!+0.018
 Care+0.018
 Sure+0.017
 Dr+0.017
 snack+0.017
Neg logit contributions
spe-0.054
work-0.052
 prope-0.050
 peac-0.050
ho-0.049
Token her
Token ID607
Feature activation-2.66e-01

Pos logit contributions
 Sure+0.002
 Care+0.002
 As+0.002
 snowball+0.002
 While+0.002
Neg logit contributions
spe-0.004
 peac-0.004
work-0.004
ho-0.004
 prope-0.003
Token and
Token ID290
Feature activation+6.93e-02

Pos logit contributions
!+0.004
 snack+0.004
 Care+0.003
 Sure+0.003
 spaghetti+0.003
Neg logit contributions
spe-0.013
work-0.013
ho-0.013
 prope-0.013
 peac-0.013
Token she
Token ID673
Feature activation-7.27e-01

Pos logit contributions
spe+0.003
work+0.003
 prope+0.003
ho+0.003
 peac+0.003
Neg logit contributions
 Care-0.001
 Sure-0.001
!-0.001
 snack-0.001
 Dr-0.001
Token scrambled
Token ID34351
Feature activation-5.98e-01

Pos logit contributions
!+0.016
 Care+0.015
 Sure+0.015
 snack+0.015
 minutes+0.014
Neg logit contributions
work-0.030
spe-0.030
 prope-0.029
 peac-0.028
ho-0.028
Token back
Token ID736
Feature activation-5.30e-01

Pos logit contributions
 Care+0.013
 Sure+0.013
 snack+0.013
 minutes+0.013
 pharmacy+0.013
Neg logit contributions
spe-0.024
work-0.023
ho-0.023
 peac-0.022
 prope-0.022
Token mom
Token ID1995
Feature activation+1.48e-01

Pos logit contributions
spe+0.005
work+0.005
ho+0.005
 peac+0.004
 prope+0.004
Neg logit contributions
 snack-0.002
 Care-0.002
 spaghetti-0.002
!-0.002
 Sure-0.002
Token said
Token ID531
Feature activation+2.46e-02

Pos logit contributions
spe+0.006
work+0.006
 prope+0.006
ho+0.006
 peac+0.006
Neg logit contributions
!-0.003
 Care-0.003
 snack-0.003
 Dr-0.003
 Sure-0.003
Token she
Token ID673
Feature activation-2.22e-01

Pos logit contributions
spe+0.002
work+0.001
edient+0.001
 prope+0.001
 narrative+0.001
Neg logit contributions
!-0.000
 while-0.000
.-0.000
 she-0.000
,-0.000
Token would
Token ID561
Feature activation+6.64e-01

Pos logit contributions
 Care+0.006
!+0.006
 Sure+0.006
 spaghetti+0.006
 snack+0.006
Neg logit contributions
spe-0.008
work-0.008
 prope-0.008
ho-0.008
 peac-0.008
Token help
Token ID1037
Feature activation-2.39e-02

Pos logit contributions
spe+0.033
work+0.031
ho+0.030
 peac+0.029
 prope+0.029
Neg logit contributions
!-0.011
 snack-0.011
 Care-0.011
 rest-0.010
 while-0.010
Token.
Token ID13
Feature activation+7.20e-01

Pos logit contributions
 Care+0.000
 Sure+0.000
 minutes+0.000
 snack+0.000
 spaghetti+0.000
Neg logit contributions
spe-0.001
work-0.001
ho-0.001
 peac-0.001
 prope-0.001
Token Lily
Token ID20037
Feature activation+7.92e-01

Pos logit contributions
spe+0.031
work+0.030
ho+0.029
 peac+0.028
 prope+0.028
Neg logit contributions
 Care-0.014
 snack-0.014
 minutes-0.014
 Sure-0.014
!-0.014
Token felt
Token ID2936
Feature activation+5.91e-01

Pos logit contributions
spe+0.035
work+0.034
 prope+0.032
ho+0.032
 peac+0.032
Neg logit contributions
 Care-0.016
!-0.016
As-0.015
 As-0.015
 spaghetti-0.015
Token happy
Token ID3772
Feature activation+1.40e-01

Pos logit contributions
spe+0.023
work+0.022
ho+0.021
 peac+0.021
 prope+0.021
Neg logit contributions
 Care-0.015
 Sure-0.014
 snack-0.014
 minutes-0.014
 spaghetti-0.014
Token and
Token ID290
Feature activation+5.66e-02

Pos logit contributions
spe+0.006
work+0.006
ho+0.006
 peac+0.006
 prope+0.005
Neg logit contributions
 Care-0.003
 Sure-0.003
 pharmacy-0.003
 snack-0.003
 minutes-0.003
Token relieved
Token ID22598
Feature activation+4.13e-01

Pos logit contributions
spe+0.002
work+0.002
ho+0.002
 peac+0.002
 prope+0.002
Neg logit contributions
 Care-0.002
 Sure-0.002
 snack-0.001
 minutes-0.001
 spaghetti-0.001
Token in
Token ID287
Feature activation+1.00e-01

Pos logit contributions
 Care+0.021
 Sure+0.021
 snack+0.021
 minutes+0.020
 spaghetti+0.020
Neg logit contributions
spe-0.034
work-0.033
ho-0.031
 peac-0.031
 prope-0.030
Token the
Token ID262
Feature activation+1.99e-01

Pos logit contributions
spe+0.003
work+0.002
ho+0.002
 peac+0.002
 prope+0.002
Neg logit contributions
ittens-0.003
amas-0.003
 pharmacy-0.003
 Sure-0.003
 minutes-0.003
Token park
Token ID3952
Feature activation-2.44e-01

Pos logit contributions
spe+0.010
work+0.010
ho+0.010
 prope+0.010
 peac+0.009
Neg logit contributions
 Care-0.003
 Sure-0.002
 snack-0.002
 spaghetti-0.002
!-0.002
Token.
Token ID13
Feature activation-5.50e-01

Pos logit contributions
 Deal+0.004
 pharmacy+0.004
 snack+0.004
 Dr+0.004
 Sure+0.004
Neg logit contributions
spe-0.011
work-0.010
ho-0.010
 peac-0.010
melon-0.010
Token He
Token ID679
Feature activation-9.62e-01

Pos logit contributions
ittens+0.014
 snack+0.014
 minutes+0.012
 pharmacy+0.012
 snowball+0.012
Neg logit contributions
spe-0.020
ho-0.017
melon-0.017
work-0.016
 prope-0.015
Token saw
Token ID2497
Feature activation+9.85e-02

Pos logit contributions
 Care+0.015
 Sure+0.015
 snack+0.014
 minutes+0.014
 spaghetti+0.014
Neg logit contributions
spe-0.046
work-0.044
ho-0.043
 peac-0.042
 prope-0.042
Token something
Token ID1223
Feature activation+1.37e+00

Pos logit contributions
spe+0.006
work+0.005
ho+0.005
 prope+0.005
 peac+0.005
Neg logit contributions
 spaghetti-0.001
 Care-0.001
 snack-0.001
!-0.001
 Sure-0.001
Token strange
Token ID6283
Feature activation+1.26e+00

Pos logit contributions
spe+0.055
work+0.053
ho+0.050
 prope+0.050
 peac+0.049
Neg logit contributions
 Care-0.033
!-0.032
 snack-0.032
 Sure-0.031
 spaghetti-0.031
Token,
Token ID11
Feature activation+9.20e-01

Pos logit contributions
spe+0.066
work+0.063
 prope+0.062
 peac+0.062
ho+0.061
Neg logit contributions
!-0.016
 Care-0.014
 spaghetti-0.013
 snack-0.013
 minutes-0.013
Token it
Token ID340
Feature activation+9.00e-01

Pos logit contributions
spe+0.046
work+0.045
ho+0.043
 peac+0.042
 prope+0.042
Neg logit contributions
 spaghetti-0.015
!-0.015
 while-0.014
 snack-0.014
 Care-0.014
Token was
Token ID373
Feature activation+7.88e-01

Pos logit contributions
spe+0.044
work+0.043
 prope+0.042
 peac+0.040
ho+0.040
Neg logit contributions
!-0.016
 Care-0.014
 spaghetti-0.013
 snack-0.012
 clearing-0.012
Token day
Token ID1110
Feature activation-3.16e-01

Pos logit contributions
 Care+0.022
 snack+0.022
 Sure+0.022
 minutes+0.021
 spaghetti+0.021
Neg logit contributions
spe-0.037
work-0.036
ho-0.034
 peac-0.034
 prope-0.034
Token he
Token ID339
Feature activation+6.18e-02

Pos logit contributions
 while+0.003
!+0.003
 snack+0.003
 Dr+0.003
 Care+0.003
Neg logit contributions
spe-0.017
work-0.017
ho-0.016
 peac-0.016
 prope-0.016
Token would
Token ID561
Feature activation+5.76e-01

Pos logit contributions
spe+0.003
work+0.003
 peac+0.003
ho+0.003
 prope+0.003
Neg logit contributions
 spaghetti-0.001
 snack-0.001
 snowball-0.001
 while-0.001
 walks-0.001
Token make
Token ID787
Feature activation+3.29e-01

Pos logit contributions
spe+0.030
work+0.028
 narrative+0.027
ho+0.027
ateful+0.027
Neg logit contributions
 snack-0.009
 rest-0.008
!-0.008
 chew-0.008
 Care-0.008
Token new
Token ID649
Feature activation+6.05e-02

Pos logit contributions
spe+0.015
work+0.014
ho+0.014
 peac+0.014
 prope+0.014
Neg logit contributions
 Care-0.006
 snack-0.006
 Sure-0.006
 spaghetti-0.006
!-0.006
Token clay
Token ID21558
Feature activation-5.75e-01

Pos logit contributions
spe+0.003
ho+0.002
work+0.002
 peac+0.002
 prope+0.002
Neg logit contributions
 snack-0.001
 Care-0.001
!-0.001
 spaghetti-0.001
 Sure-0.001
Token figures
Token ID5538
Feature activation+1.17e-01

Pos logit contributions
!+0.010
 snack+0.009
 while+0.009
 spaghetti+0.009
 lunch+0.007
Neg logit contributions
spe-0.029
 prope-0.027
ho-0.027
 peac-0.026
work-0.026
Token and
Token ID290
Feature activation-2.08e-02

Pos logit contributions
spe+0.006
ho+0.006
 prope+0.006
work+0.006
 narrative+0.006
Neg logit contributions
!-0.002
 while-0.002
 snack-0.002
 Care-0.001
 spaghetti-0.001
Token dreams
Token ID10625
Feature activation-1.32e-01

Pos logit contributions
 snack+0.000
 while+0.000
 spaghetti+0.000
 rest+0.000
 Care+0.000
Neg logit contributions
spe-0.001
ho-0.001
work-0.001
 narrative-0.001
edient-0.001
Token.
Token ID13
Feature activation+1.23e-02

Pos logit contributions
!+0.002
 Care+0.001
 snack+0.001
 Sure+0.001
 while+0.001
Neg logit contributions
spe-0.007
ho-0.007
work-0.007
 peac-0.007
 prope-0.006
Token
Token ID198
Feature activation-2.60e-01

Pos logit contributions
spe+0.001
work+0.001
ho+0.001
melon+0.001
 prope+0.001
Neg logit contributions
 minutes-0.000
 pharmacy-0.000
 snack-0.000
 quick-0.000
!-0.000
Token said
Token ID531
Feature activation+4.01e-01

Pos logit contributions
spe+0.022
work+0.021
ho+0.020
melon+0.018
 peac+0.018
Neg logit contributions
 Sure-0.019
 snack-0.019
 pharmacy-0.018
 Care-0.018
 clearing-0.018
Token.
Token ID13
Feature activation-1.34e-01

Pos logit contributions
spe+0.019
work+0.019
ho+0.018
melon+0.017
 prope+0.017
Neg logit contributions
 clearing-0.006
 Care-0.006
 Sure-0.006
 pharmacy-0.006
As-0.006
Token She
Token ID1375
Feature activation+1.49e+00

Pos logit contributions
 snack+0.004
eh+0.004
 snowball+0.004
 clearing+0.004
 procedure+0.004
Neg logit contributions
spe-0.004
ho-0.004
work-0.004
melon-0.003
 prope-0.003
Token took
Token ID1718
Feature activation-6.64e-01

Pos logit contributions
spe+0.057
work+0.054
ho+0.052
 peac+0.051
 prope+0.050
Neg logit contributions
 Care-0.037
 Sure-0.037
 snack-0.036
 spaghetti-0.035
 minutes-0.035
Token their
Token ID511
Feature activation+2.62e-01

Pos logit contributions
 Sure+0.013
 Care+0.013
 snack+0.012
 minutes+0.012
!+0.012
Neg logit contributions
spe-0.028
work-0.028
ho-0.027
 prope-0.026
 peac-0.026
Token hands
Token ID2832
Feature activation-2.91e-01

Pos logit contributions
spe+0.010
ho+0.010
work+0.010
 prope+0.009
melon+0.009
Neg logit contributions
 Sure-0.006
 Care-0.006
 As-0.006
As-0.006
 clearing-0.006
Token and
Token ID290
Feature activation+3.59e-01

Pos logit contributions
 while+0.003
!+0.003
 As+0.003
 Care+0.002
 Sure+0.002
Neg logit contributions
spe-0.016
 peac-0.016
 narrative-0.015
 prope-0.015
work-0.015
Token walked
Token ID6807
Feature activation-1.58e-01

Pos logit contributions
spe+0.014
work+0.013
ho+0.013
 peac+0.013
 prope+0.012
Neg logit contributions
 Care-0.009
 Sure-0.009
 snack-0.008
 minutes-0.008
 pharmacy-0.008
Token with
Token ID351
Feature activation-3.96e-02

Pos logit contributions
 spaghetti+0.004
amas+0.004
 pharmacy+0.004
 Deal+0.004
ittens+0.004
Neg logit contributions
spe-0.006
work-0.006
ho-0.005
melon-0.005
 prope-0.005
Token them
Token ID606
Feature activation-3.76e-01

Pos logit contributions
 Sure+0.001
 pharmacy+0.001
 Care+0.001
 procedure+0.001
As+0.001
Neg logit contributions
work-0.001
spe-0.001
 prope-0.001
ho-0.001
melon-0.001
Token to
Token ID284
Feature activation+1.41e-02

Pos logit contributions
 Care+0.004
!+0.004
 Sure+0.003
 snack+0.003
 while+0.003
Neg logit contributions
spe-0.020
work-0.020
 peac-0.020
 prope-0.019
ho-0.019
Token to
Token ID284
Feature activation+9.52e-01

Pos logit contributions
spe+0.060
work+0.060
ho+0.058
 prope+0.058
 peac+0.058
Neg logit contributions
 Care-0.016
!-0.015
 Sure-0.015
 spaghetti-0.014
 snack-0.014
Token play
Token ID711
Feature activation+1.54e-01

Pos logit contributions
spe+0.045
work+0.043
ho+0.042
 peac+0.041
 prope+0.041
Neg logit contributions
 Care-0.016
!-0.016
 snack-0.016
 Sure-0.015
 spaghetti-0.015
Token chess
Token ID19780
Feature activation+1.16e+00

Pos logit contributions
spe+0.007
work+0.007
ho+0.007
 peac+0.007
 prope+0.006
Neg logit contributions
 Care-0.003
 Sure-0.003
 snack-0.003
 spaghetti-0.003
 minutes-0.003
Token.
Token ID13
Feature activation+2.65e-01

Pos logit contributions
spe+0.062
ho+0.060
 peac+0.060
work+0.059
 prope+0.059
Neg logit contributions
!-0.014
 Care-0.012
 spaghetti-0.011
 while-0.011
 minutes-0.011
Token He
Token ID679
Feature activation+5.12e-01

Pos logit contributions
spe+0.011
work+0.010
melon+0.010
ho+0.010
 prope+0.009
Neg logit contributions
 minutes-0.006
 snack-0.005
 pharmacy-0.005
 clearing-0.005
!-0.005
Token moved
Token ID3888
Feature activation-2.21e-01

Pos logit contributions
spe+0.021
work+0.021
ho+0.020
 peac+0.020
 prope+0.020
Neg logit contributions
 Care-0.012
!-0.011
 Sure-0.011
 As-0.011
 spaghetti-0.011
Token the
Token ID262
Feature activation-2.34e-01

Pos logit contributions
 Care+0.004
!+0.004
 minutes+0.004
 while+0.004
 Sure+0.004
Neg logit contributions
spe-0.010
work-0.010
ho-0.010
 peac-0.010
 prope-0.009
Token pieces
Token ID5207
Feature activation-1.14e-02

Pos logit contributions
 Sure+0.006
 Care+0.006
 snack+0.005
 minutes+0.005
 As+0.005
Neg logit contributions
spe-0.009
work-0.009
ho-0.008
 peac-0.008
 prope-0.008
Token around
Token ID1088
Feature activation-7.30e-01

Pos logit contributions
!+0.000
 while+0.000
 quick+0.000
 Care+0.000
 minutes+0.000
Neg logit contributions
spe-0.001
 peac-0.001
work-0.001
 prope-0.001
ho-0.000
Token and
Token ID290
Feature activation-3.15e-01

Pos logit contributions
!+0.013
 while+0.012
 spaghetti+0.011
 Sure+0.011
 Care+0.011
Neg logit contributions
spe-0.035
 peac-0.034
 prope-0.033
work-0.033
ho-0.033
Token then
Token ID788
Feature activation-2.76e-01

Pos logit contributions
 Sure+0.006
 while+0.006
!+0.006
 Care+0.006
 minutes+0.006
Neg logit contributions
spe-0.014
 peac-0.014
work-0.013
ho-0.013
 narrative-0.013
Token She
Token ID1375
Feature activation-3.54e-01

Pos logit contributions
spe+0.010
work+0.010
ho+0.009
 peac+0.009
 prope+0.009
Neg logit contributions
 Care-0.004
 Sure-0.004
 snack-0.004
 spaghetti-0.004
 minutes-0.003
Token is
Token ID318
Feature activation-1.73e-01

Pos logit contributions
 Care+0.007
!+0.007
 Sure+0.007
 spaghetti+0.006
 minutes+0.006
Neg logit contributions
spe-0.016
work-0.015
ho-0.015
 peac-0.014
melon-0.014
Token happy
Token ID3772
Feature activation+5.83e-02

Pos logit contributions
!+0.004
 Care+0.004
 minutes+0.003
 quick+0.003
 Dr+0.003
Neg logit contributions
spe-0.008
ho-0.008
work-0.007
 narrative-0.007
 peac-0.007
Token.
Token ID13
Feature activation+1.53e-01

Pos logit contributions
work+0.003
spe+0.003
ho+0.003
 peac+0.003
 prope+0.002
Neg logit contributions
 Sure-0.001
 Care-0.001
 pharmacy-0.001
 snack-0.001
 As-0.001
Token She
Token ID1375
Feature activation+3.86e-02

Pos logit contributions
spe+0.007
work+0.006
ho+0.006
 peac+0.006
 prope+0.006
Neg logit contributions
 snack-0.003
 minutes-0.003
 pharmacy-0.003
 Care-0.003
 spaghetti-0.003
Token thinks
Token ID6834
Feature activation-4.18e-01

Pos logit contributions
spe+0.002
work+0.002
ho+0.002
 peac+0.001
 prope+0.001
Neg logit contributions
 Care-0.001
 Sure-0.001
 snack-0.001
 minutes-0.001
 spaghetti-0.001
Token she
Token ID673
Feature activation-5.90e-01

Pos logit contributions
!+0.005
 Dr+0.005
 while+0.004
 spaghetti+0.004
 minutes+0.004
Neg logit contributions
spe-0.024
work-0.022
ho-0.022
edient-0.021
 peac-0.021
Token has
Token ID468
Feature activation+3.00e-01

Pos logit contributions
 Sure+0.016
 Care+0.015
 minutes+0.015
 As+0.015
 snack+0.015
Neg logit contributions
work-0.021
spe-0.021
ho-0.019
 peac-0.019
 prope-0.019
Token a
Token ID257
Feature activation-7.85e-01

Pos logit contributions
spe+0.016
ho+0.015
work+0.015
melon+0.014
 prope+0.014
Neg logit contributions
!-0.004
 minutes-0.004
 Care-0.004
 snack-0.004
 spaghetti-0.004
Token very
Token ID845
Feature activation-3.52e-01

Pos logit contributions
 Care+0.017
 snack+0.017
 Sure+0.017
!+0.016
 spaghetti+0.016
Neg logit contributions
spe-0.033
work-0.031
ho-0.030
 peac-0.029
 prope-0.029
Token sharp
Token ID7786
Feature activation-6.81e-01

Pos logit contributions
 Sure+0.008
 pharmacy+0.008
 Care+0.008
 Dr+0.008
 spaghetti+0.008
Neg logit contributions
spe-0.013
work-0.013
ho-0.012
 peac-0.012
 prope-0.012
Token.
Token ID13
Feature activation-4.21e-01

Pos logit contributions
spe+0.032
work+0.031
ho+0.030
 peac+0.030
 prope+0.029
Neg logit contributions
 Care-0.007
 Sure-0.007
 snack-0.007
 minutes-0.006
 spaghetti-0.006
Token He
Token ID679
Feature activation-3.71e-01

Pos logit contributions
 Care+0.006
 Sure+0.006
 snack+0.006
 minutes+0.006
 spaghetti+0.006
Neg logit contributions
spe-0.021
work-0.020
ho-0.019
 peac-0.019
 prope-0.019
Token watched
Token ID7342
Feature activation-9.28e-01

Pos logit contributions
!+0.008
 minutes+0.007
 Care+0.007
 Sure+0.007
 snowball+0.007
Neg logit contributions
spe-0.017
work-0.016
 peac-0.015
ho-0.015
 prope-0.015
Token helpless
Token ID21144
Feature activation-4.49e-01

Pos logit contributions
!+0.014
 while+0.013
 spaghetti+0.012
 Care+0.012
 snack+0.012
Neg logit contributions
spe-0.050
work-0.046
ho-0.045
 peac-0.045
 prope-0.044
Tokenly
Token ID306
Feature activation-1.21e+00

Pos logit contributions
!+0.009
 spaghetti+0.008
 Care+0.008
 minutes+0.008
 while+0.008
Neg logit contributions
spe-0.021
work-0.020
 prope-0.019
 peac-0.019
ho-0.019
Token as
Token ID355
Feature activation-3.18e-01

Pos logit contributions
!+0.022
 while+0.019
 snack+0.018
 Care+0.018
 spaghetti+0.017
Neg logit contributions
spe-0.058
work-0.056
 peac-0.056
 prope-0.055
 narrative-0.055
Token the
Token ID262
Feature activation-4.91e-01

Pos logit contributions
!+0.004
 Care+0.003
 Sure+0.003
 Dr+0.003
 spaghetti+0.003
Neg logit contributions
spe-0.017
work-0.017
ho-0.016
 peac-0.016
 prope-0.016
Token ash
Token ID12530
Feature activation-2.83e-01

Pos logit contributions
 Care+0.013
 snack+0.013
 Sure+0.013
 minutes+0.013
 spaghetti+0.012
Neg logit contributions
spe-0.019
work-0.018
ho-0.017
 prope-0.017
 peac-0.016
Token that
Token ID326
Feature activation+3.12e-01

Pos logit contributions
!+0.009
 snack+0.008
 Sure+0.008
 chair+0.008
 minutes+0.008
Neg logit contributions
spe-0.010
ateful-0.009
 peac-0.009
 prope-0.009
work-0.009
Token he
Token ID339
Feature activation-7.52e-01

Pos logit contributions
spe+0.017
work+0.016
 prope+0.016
ateful+0.015
ho+0.015
Neg logit contributions
 spaghetti-0.004
!-0.004
 he-0.004
 rest-0.003
 Dr-0.003
Token wanted
Token ID2227
Feature activation-3.50e-02

Pos logit contributions
!+0.017
 minutes+0.016
 spaghetti+0.015
 snack+0.015
 Care+0.015
Neg logit contributions
spe-0.031
work-0.031
 prope-0.029
 peac-0.029
ho-0.028
Token.
Token ID13
Feature activation+1.16e-01

Pos logit contributions
 Care+0.016
 Sure+0.015
 snack+0.015
!+0.014
 spaghetti+0.014
Neg logit contributions
spe-0.066
work-0.064
ho-0.062
 peac-0.061
 prope-0.061
Token Tim
Token ID5045
Feature activation-1.04e+00

Pos logit contributions
spe+0.005
work+0.004
melon+0.004
ho+0.004
 peac+0.004
Neg logit contributions
 minutes-0.003
 pharmacy-0.003
 snack-0.002
 Deal-0.002
Jeremy-0.002
Tokenmy
Token ID1820
Feature activation-2.88e-01

Pos logit contributions
 Sure+0.019
 Care+0.019
 minutes+0.018
 snack+0.018
 pharmacy+0.017
Neg logit contributions
spe-0.047
work-0.046
ho-0.044
 peac-0.043
 prope-0.043
Token had
Token ID550
Feature activation+4.14e-01

Pos logit contributions
!+0.005
As+0.005
 Care+0.005
 As+0.005
 spaghetti+0.005
Neg logit contributions
spe-0.013
work-0.012
ho-0.012
 narrative-0.012
 peac-0.012
Token never
Token ID1239
Feature activation+6.41e-01

Pos logit contributions
spe+0.019
ho+0.019
work+0.018
 narrative+0.018
melon+0.017
Neg logit contributions
 Care-0.008
!-0.008
 minutes-0.007
 spaghetti-0.007
 snack-0.007
Token seen
Token ID1775
Feature activation-2.24e+00

Pos logit contributions
spe+0.033
ho+0.032
work+0.031
 prope+0.031
 peac+0.030
Neg logit contributions
 minutes-0.008
!-0.008
 snack-0.008
 Care-0.008
 spaghetti-0.008
Token one
Token ID530
Feature activation-1.33e+00

Pos logit contributions
 Sure+0.046
 Care+0.044
 snack+0.043
 minutes+0.042
 pharmacy+0.042
Neg logit contributions
work-0.094
spe-0.093
 prope-0.088
 peac-0.088
ho-0.086
Token before
Token ID878
Feature activation+6.93e-01

Pos logit contributions
 Care+0.025
 Sure+0.025
 pharmacy+0.024
 snack+0.024
 minutes+0.023
Neg logit contributions
spe-0.058
work-0.058
ho-0.055
 peac-0.054
 prope-0.053
Token.
Token ID13
Feature activation-1.03e-01

Pos logit contributions
spe+0.033
work+0.033
ho+0.032
 peac+0.031
 prope+0.031
Neg logit contributions
 Care-0.010
 Sure-0.010
 minutes-0.009
 snack-0.009
 pharmacy-0.009
Token His
Token ID2399
Feature activation-7.58e-01

Pos logit contributions
 minutes+0.002
 pharmacy+0.002
 snack+0.002
 walks+0.002
!+0.002
Neg logit contributions
spe-0.004
work-0.004
melon-0.004
ho-0.004
 peac-0.004
Token dad
Token ID9955
Feature activation+1.42e-01

Pos logit contributions
 Sure+0.023
 Care+0.023
 clearing+0.023
 As+0.022
 Ready+0.022
Neg logit contributions
spe-0.024
work-0.024
ho-0.021
 prope-0.021
 peac-0.021
Token the
Token ID262
Feature activation+2.26e-01

Pos logit contributions
 Care+0.014
!+0.014
 snack+0.014
 Sure+0.013
 spaghetti+0.013
Neg logit contributions
spe-0.036
work-0.035
ho-0.034
 peac-0.033
 prope-0.032
Token post
Token ID1281
Feature activation-8.33e-01

Pos logit contributions
spe+0.009
work+0.009
ho+0.008
 peac+0.008
melon+0.008
Neg logit contributions
 Care-0.006
 snack-0.006
!-0.005
 minutes-0.005
 chair-0.005
Token.
Token ID13
Feature activation-1.83e-01

Pos logit contributions
!+0.008
 Care+0.008
 chair+0.007
 snack+0.007
 Sure+0.007
Neg logit contributions
spe-0.045
 peac-0.043
work-0.043
ho-0.043
 prope-0.043
Token She
Token ID1375
Feature activation-9.26e-02

Pos logit contributions
 snack+0.003
 spaghetti+0.003
 pharmacy+0.003
 minutes+0.003
 snowball+0.002
Neg logit contributions
spe-0.009
work-0.009
ho-0.008
 peac-0.008
 prope-0.008
Token scrub
Token ID27268
Feature activation+2.32e-01

Pos logit contributions
 Care+0.002
 minutes+0.002
!+0.002
 spaghetti+0.002
 snack+0.002
Neg logit contributions
spe-0.004
work-0.004
ho-0.004
 peac-0.004
 prope-0.003
Tokenbed
Token ID3077
Feature activation-2.05e+00

Pos logit contributions
spe+0.009
work+0.008
 peac+0.008
ho+0.008
 prope+0.008
Neg logit contributions
 Care-0.006
!-0.006
 Sure-0.006
 minutes-0.006
 spaghetti-0.006
Token the
Token ID262
Feature activation+1.75e-01

Pos logit contributions
 Care+0.036
!+0.035
 snack+0.034
 Sure+0.034
 spaghetti+0.034
Neg logit contributions
spe-0.097
work-0.093
ho-0.090
 peac-0.090
 prope-0.087
Token dirt
Token ID13647
Feature activation-7.01e-01

Pos logit contributions
spe+0.007
work+0.006
ho+0.006
 peac+0.006
 prope+0.006
Neg logit contributions
 Care-0.005
 snack-0.005
 Sure-0.004
 minutes-0.004
!-0.004
Token away
Token ID1497
Feature activation+2.31e-01

Pos logit contributions
 while+0.011
!+0.011
 chair+0.010
 snack+0.010
 Care+0.010
Neg logit contributions
spe-0.036
 peac-0.035
work-0.034
ateful-0.032
 prope-0.032
Token and
Token ID290
Feature activation-5.37e-01

Pos logit contributions
 peac+0.013
spe+0.012
ho+0.012
work+0.012
 narrative+0.012
Neg logit contributions
!-0.003
 while-0.003
 chair-0.002
 minutes-0.002
.-0.002
Token soon
Token ID2582
Feature activation-4.94e-01

Pos logit contributions
 minutes+0.012
 snack+0.011
 spaghetti+0.011
 Care+0.011
 while+0.011
Neg logit contributions
spe-0.024
work-0.023
ho-0.022
 peac-0.022
melon-0.021
Token
Token ID198
Feature activation-5.42e-01

Pos logit contributions
 minutes+0.008
 snack+0.008
 spaghetti+0.008
 pharmacy+0.007
 cafĂ©+0.007
Neg logit contributions
spe-0.028
work-0.026
ho-0.025
 prope-0.025
 peac-0.024
Token
Token ID198
Feature activation-8.57e-01

Pos logit contributions
 spaghetti+0.007
 minutes+0.007
 snack+0.007
 pharmacy+0.006
 Care+0.006
Neg logit contributions
spe-0.028
ho-0.025
work-0.025
 prope-0.025
 peac-0.025
TokenWith
Token ID3152
Feature activation-3.75e-01

Pos logit contributions
 Care+0.018
 Sure+0.018
 snack+0.018
 spaghetti+0.018
 minutes+0.017
Neg logit contributions
spe-0.036
work-0.035
ho-0.033
 peac-0.033
 prope-0.033
Token a
Token ID257
Feature activation-1.10e+00

Pos logit contributions
 Care+0.006
 Dr+0.006
!+0.006
 snack+0.006
 As+0.006
Neg logit contributions
spe-0.018
work-0.018
ho-0.017
 peac-0.016
 prope-0.016
Token to
Token ID284
Feature activation-5.39e-01

Pos logit contributions
 Care+0.024
 Sure+0.024
 spaghetti+0.023
 pharmacy+0.023
 As+0.022
Neg logit contributions
spe-0.046
work-0.043
ho-0.042
 peac-0.042
 prope-0.042
Tokenothy
Token ID14863
Feature activation-2.17e+00

Pos logit contributions
 snack+0.010
 Care+0.010
!+0.009
 Sure+0.009
 minutes+0.009
Neg logit contributions
spe-0.026
work-0.025
ho-0.024
 peac-0.023
 prope-0.023
Token grin
Token ID16770
Feature activation+7.54e-02

Pos logit contributions
 Care+0.045
 Sure+0.044
 spaghetti+0.044
 pharmacy+0.042
 snack+0.042
Neg logit contributions
spe-0.092
work-0.087
ho-0.085
 prope-0.084
 peac-0.084
Token,
Token ID11
Feature activation-1.98e-01

Pos logit contributions
spe+0.004
work+0.004
 narrative+0.004
 peac+0.004
ateful+0.004
Neg logit contributions
!-0.001
 while-0.001
 minutes-0.001
 As-0.001
 chair-0.001
Token Kenny
Token ID22102
Feature activation-9.16e-01

Pos logit contributions
 Care+0.003
 Dr+0.003
 Sure+0.003
!+0.003
 while+0.003
Neg logit contributions
work-0.009
spe-0.009
ho-0.009
 peac-0.009
melon-0.008
Token led
Token ID2957
Feature activation-3.95e-01

Pos logit contributions
!+0.021
 Care+0.020
 Sure+0.020
 As+0.020
 minutes+0.019
Neg logit contributions
spe-0.037
work-0.037
 peac-0.035
 prope-0.035
ho-0.035
Token the
Token ID262
Feature activation+6.63e-02

Pos logit contributions
 Care+0.008
 Sure+0.008
 spaghetti+0.008
 snack+0.008
 minutes+0.008
Neg logit contributions
spe-0.017
work-0.016
ho-0.016
 peac-0.015
 prope-0.015
Token excited
Token ID6568
Feature activation-7.27e-01

Pos logit contributions
 Care+0.007
 quick+0.007
 minutes+0.007
 spaghetti+0.006
 Sure+0.006
Neg logit contributions
spe-0.019
work-0.019
 peac-0.018
 prope-0.018
ho-0.018
Token.
Token ID13
Feature activation-2.74e-01

Pos logit contributions
 Care+0.011
 Sure+0.011
 pharmacy+0.011
 minutes+0.010
 snowball+0.010
Neg logit contributions
work-0.035
spe-0.034
ho-0.032
 peac-0.031
melon-0.031
Token She
Token ID1375
Feature activation-7.96e-01

Pos logit contributions
 minutes+0.005
 snack+0.005
 Care+0.005
 pharmacy+0.005
 spaghetti+0.005
Neg logit contributions
spe-0.012
work-0.011
ho-0.011
 peac-0.010
melon-0.010
Token had
Token ID550
Feature activation+3.49e-01

Pos logit contributions
!+0.019
 Care+0.019
 Sure+0.018
 minutes+0.018
 snack+0.018
Neg logit contributions
spe-0.032
work-0.031
ho-0.029
 peac-0.029
 prope-0.029
Token never
Token ID1239
Feature activation+7.43e-01

Pos logit contributions
spe+0.016
work+0.016
ho+0.016
 prope+0.015
 peac+0.015
Neg logit contributions
 Care-0.006
!-0.006
 minutes-0.006
 snack-0.006
 spaghetti-0.006
Token seen
Token ID1775
Feature activation-2.11e+00

Pos logit contributions
 prope+0.036
ho+0.036
work+0.034
spe+0.034
melon+0.033
Neg logit contributions
!-0.012
 minutes-0.012
 while-0.011
 snack-0.011
care-0.011
Token an
Token ID281
Feature activation-6.12e-01

Pos logit contributions
 spaghetti+0.031
!+0.030
 Care+0.030
 snack+0.029
 Dr+0.028
Neg logit contributions
spe-0.107
ho-0.101
work-0.100
 prope-0.096
 peac-0.096
Token aer
Token ID9551
Feature activation+1.87e-01

Pos logit contributions
!+0.015
 spaghetti+0.015
 snack+0.014
 Dr+0.013
 cafĂ©+0.013
Neg logit contributions
spe-0.026
 prope-0.024
ho-0.024
 narrative-0.022
work-0.022
Tokenopl
Token ID20106
Feature activation-5.55e-01

Pos logit contributions
spe+0.005
work+0.005
 peac+0.005
ho+0.005
 prope+0.005
Neg logit contributions
 Sure-0.007
 Care-0.007
 snack-0.006
 spaghetti-0.006
 pharmacy-0.006
Tokenane
Token ID1531
Feature activation-1.65e+00

Pos logit contributions
!+0.026
 Sure+0.026
 Care+0.026
 As+0.025
 Dr+0.025
Neg logit contributions
spe-0.009
work-0.008
 prope-0.007
ho-0.007
 peac-0.007
Token before
Token ID878
Feature activation+7.67e-01

Pos logit contributions
!+0.024
 snack+0.022
 while+0.021
 Care+0.021
 minutes+0.020
Neg logit contributions
spe-0.083
work-0.079
 prope-0.077
ho-0.077
 peac-0.077
Token worried
Token ID7960
Feature activation-2.63e-01

Pos logit contributions
 Care+0.014
 Sure+0.013
 snack+0.013
 minutes+0.013
 spaghetti+0.013
Neg logit contributions
spe-0.025
work-0.024
ho-0.023
 peac-0.023
 prope-0.023
Token.
Token ID13
Feature activation-3.00e-01

Pos logit contributions
 Care+0.003
!+0.003
 Sure+0.003
 snack+0.003
 minutes+0.003
Neg logit contributions
spe-0.014
work-0.013
ho-0.013
 peac-0.013
 prope-0.013
Token She
Token ID1375
Feature activation+1.26e-01

Pos logit contributions
 Care+0.005
 Sure+0.005
 snack+0.004
 minutes+0.004
 spaghetti+0.004
Neg logit contributions
spe-0.014
work-0.014
ho-0.013
 peac-0.013
 prope-0.013
Token had
Token ID550
Feature activation+1.78e-01

Pos logit contributions
spe+0.005
work+0.005
 prope+0.005
 peac+0.005
melon+0.005
Neg logit contributions
 minutes-0.003
 Care-0.003
!-0.003
 Sure-0.003
 walks-0.003
Token never
Token ID1239
Feature activation+7.22e-01

Pos logit contributions
work+0.009
spe+0.009
ho+0.008
melon+0.008
 prope+0.008
Neg logit contributions
 minutes-0.003
!-0.003
 Care-0.003
 walks-0.003
 snack-0.003
Token seen
Token ID1775
Feature activation-2.07e+00

Pos logit contributions
spe+0.036
work+0.035
ho+0.034
 prope+0.034
 peac+0.033
Neg logit contributions
 minutes-0.011
!-0.010
 snack-0.010
 Care-0.010
 spaghetti-0.010
Token fashion
Token ID6977
Feature activation-1.26e+00

Pos logit contributions
 Care+0.037
!+0.036
 spaghetti+0.036
 snack+0.035
 Dr+0.035
Neg logit contributions
spe-0.098
ho-0.092
work-0.091
 peac-0.087
 prope-0.087
Token before
Token ID878
Feature activation+5.82e-01

Pos logit contributions
!+0.019
 Care+0.018
 snack+0.017
 minutes+0.017
 Sure+0.017
Neg logit contributions
spe-0.062
work-0.059
ho-0.058
 peac-0.058
 prope-0.058
Token and
Token ID290
Feature activation-4.73e-01

Pos logit contributions
spe+0.029
work+0.028
ho+0.027
 peac+0.027
 prope+0.027
Neg logit contributions
 Care-0.008
 Sure-0.007
 snack-0.007
 minutes-0.007
 pharmacy-0.007
Token she
Token ID673
Feature activation-8.93e-01

Pos logit contributions
!+0.009
 while+0.008
 minutes+0.008
 Care+0.008
 spaghetti+0.007
Neg logit contributions
spe-0.022
work-0.022
 prope-0.021
 peac-0.021
ho-0.020
Token was
Token ID373
Feature activation-4.81e-01

Pos logit contributions
!+0.020
 minutes+0.018
 Care+0.018
 Sure+0.017
 snack+0.017
Neg logit contributions
work-0.038
spe-0.037
 prope-0.036
 peac-0.035
melon-0.034