TOP ACTIVATIONS
MAX = 0.074

was a grand door . Bes ide the door , there
. " What did you wish for ?" her mom my
girl . What do you wish to gain ?" Jane
" What did you wish for ?" Dad asked .
" What do you wish for ?" The mel on
, with all its different col ours , shapes and textures
" What do you wish for ?" L ila asks
, " What have you seen , John ?" He pointed
did something special . On Wed nes days she liked to
one night . It was dark , and their shadows were
, " Hi ! What 's that ?" The
Wed nes days because on Wed nes days she did something
what the rocket discovered in outer space . It was a
, " What is your dream ?" The
came by . " What 's wrong , little bird ie
What 's so special about Wed nes days ?"
her mum : " What 's that ?". Her mum said
. " What 's going on here ?" She
. " What 's wrong , kids ?" mom
her mom , " What 's that ?" Her

MIN ACTIVATIONS
MIN = -0.081

and try to grab the av oc ados . But the
they had never seen the av oc ados . They wished
so happy and brought the av oc ados home to share
going to get a l oll ipop for being so organized
As the farmer scattered the av oc ados , he noticed
won 't get a l oll ipop ." Lu
a dinosaur and a l oll ipop with a cherry .
wanted to win a l oll ipop too . But he
ongo didn 't like wet av oc ados , so he
down and spotted a l oll ipop on the ground .
hug and a big l oll ipop . Tim smiled and
and saw a three - year - old child . The
and she had a l oll ipop to show for it
. She wanted a l oll ipop now . She saw
backpack and grabbed the l oll ipop . He ran away
still couldn 't find any av oc ados . Finally ,
a sticker and a l oll ipop . L
girl who dropped a l oll ipop from her pocket .
the cheese and add the topp ings . Lily and Ben
she handed him a l oll ipop . She smiled and

INTERVAL 0.035 - 0.074
CONTAINS 0.107%

mom my , " What is this ?" "
to himself , " I wish I could stay here forever
thought carefully about what to wish for . Suddenly , he
the horizon . In the distance , she saw something shining
out proudly and say � � € � � Look mummy

INTERVAL -0.004 - 0.035
CONTAINS 65.505%

was glad that she improved her knitting . She was glad
Suddenly , his wife started running faster , and
and Lily liked to decor ate the pizza . They made
M ia says , " No , Tom . This is
her adventure ! <|endoftext|> Once upon a time , there was

INTERVAL -0.042 - -0.004
CONTAINS 34.375%

time , a little girl wanted to remember something really important
man . He had a big box with the deep pizza
proud of her daughter . One day , Molly
weigh the apples . " Wow , these apples are heavy
have a game and play . The dad heard

INTERVAL -0.081 - -0.042
CONTAINS 0.013%

the rip est , ju ic iest pear in the garden
the robot and found a loose screw . He used a
Anna . You are a mean mom . You don 't
wanted to win a l oll ipop too . But he
a funny video about a talking cat . They laughed and
Token was
Token ID373
Feature activation+6.40e-03

Pos logit contributions
bi+0.009
 edge+0.008
Larry+0.008
ling+0.008
Charles+0.008
Neg logit contributions
 cr-0.007
?-0.007
 sew-0.007
 forgiveness-0.006
 report-0.006
Token a
Token ID257
Feature activation+6.44e-03

Pos logit contributions
 cr+0.005
?+0.005
 sew+0.004
 forgiveness+0.004
 L+0.004
Neg logit contributions
bi-0.014
ling-0.013
 edge-0.013
acking-0.013
Larry-0.013
Token grand
Token ID4490
Feature activation+9.61e-03

Pos logit contributions
 cr+0.009
?+0.008
 sew+0.008
 forgive+0.008
 grandma+0.008
Neg logit contributions
bi-0.011
arted-0.011
eled-0.011
mo-0.011
umbling-0.011
Token door
Token ID3420
Feature activation+4.91e-03

Pos logit contributions
 cr+0.013
 sew+0.012
?+0.012
 forgiveness+0.012
 forgive+0.011
Neg logit contributions
bi-0.016
 edge-0.015
ling-0.014
Larry-0.014
Charles-0.014
Token.
Token ID13
Feature activation+6.32e-03

Pos logit contributions
 cr+0.005
 forgiveness+0.005
 sew+0.005
?+0.005
 heaven+0.005
Neg logit contributions
bi-0.009
 edge-0.009
ling-0.008
 distance-0.008
 across-0.008
Token Bes
Token ID30837
Feature activation+5.82e-02

Pos logit contributions
 cr+0.011
?+0.010
 sew+0.010
 forgiveness+0.010
 report+0.010
Neg logit contributions
bi-0.008
 edge-0.008
Larry-0.007
 valleys-0.007
Charles-0.007
Tokenide
Token ID485
Feature activation-4.48e-03

Pos logit contributions
 cr+0.087
?+0.084
 sew+0.083
 forgiveness+0.081
 report+0.078
Neg logit contributions
bi-0.085
 edge-0.079
Larry-0.075
ling-0.074
Charles-0.072
Token the
Token ID262
Feature activation+8.90e-03

Pos logit contributions
Phil+0.009
bi+0.009
 sudden+0.009
Chuck+0.008
Charles+0.008
Neg logit contributions
?-0.005
 cr-0.004
?"-0.004
 if-0.004
 whose-0.004
Token door
Token ID3420
Feature activation+1.01e-02

Pos logit contributions
 cr+0.015
?+0.014
 sew+0.014
 report+0.014
 grandma+0.013
Neg logit contributions
bi-0.012
Larry-0.011
ling-0.011
eled-0.010
acking-0.010
Token,
Token ID11
Feature activation+7.12e-03

Pos logit contributions
 cr+0.013
 forgiveness+0.012
 sew+0.012
?+0.012
 heaven+0.011
Neg logit contributions
bi-0.017
 edge-0.016
ling-0.015
 valleys-0.015
 distance-0.015
Token there
Token ID612
Feature activation-5.38e-03

Pos logit contributions
 cr+0.008
?+0.007
 sew+0.007
 forgiveness+0.007
 report+0.006
Neg logit contributions
bi-0.014
 edge-0.013
ling-0.012
Larry-0.012
acking-0.012
Token.
Token ID13
Feature activation+7.32e-03

Pos logit contributions
 cr+0.001
?+0.001
 report+0.001
 sew+0.001
 forgiveness+0.001
Neg logit contributions
bi-0.002
 edge-0.002
ling-0.002
 across-0.002
Larry-0.002
Token "
Token ID366
Feature activation-3.84e-04

Pos logit contributions
 cr+0.012
 forgiveness+0.012
?+0.012
 report+0.012
 heaven+0.012
Neg logit contributions
bi-0.009
 edge-0.008
acking-0.007
arted-0.007
 sudden-0.007
TokenWhat
Token ID2061
Feature activation+6.13e-03

Pos logit contributions
bi+0.000
 edge+0.000
Larry+0.000
Frank+0.000
Charles+0.000
Neg logit contributions
 cr-0.001
?-0.001
 sew-0.001
 forgiveness-0.001
 report-0.001
Token did
Token ID750
Feature activation+2.80e-02

Pos logit contributions
 extingu+0.009
 report+0.009
ands+0.009
 L+0.009
 sew+0.009
Neg logit contributions
Larry-0.007
bi-0.007
 giant-0.007
 across-0.007
 whole-0.006
Token you
Token ID345
Feature activation+1.83e-02

Pos logit contributions
 cr+0.028
/+0.028
 sew+0.028
 forgive+0.028
 report+0.026
Neg logit contributions
 edge-0.045
Larry-0.045
Charles-0.045
Frank-0.044
bi-0.044
Token wish
Token ID4601
Feature activation+5.69e-02

Pos logit contributions
 cr+0.023
 forgiveness+0.022
 heaven+0.021
 sew+0.020
 L+0.020
Neg logit contributions
 edge-0.029
bi-0.029
ling-0.027
 bottom-0.027
 distance-0.026
Token for
Token ID329
Feature activation+1.44e-02

Pos logit contributions
 cr+0.040
 sew+0.036
 forgiveness+0.035
?+0.033
 heaven+0.032
Neg logit contributions
bi-0.125
 edge-0.121
ling-0.116
Larry-0.115
Charles-0.113
Token?"
Token ID1701
Feature activation+6.19e-03

Pos logit contributions
/+0.014
 cr+0.012
 forgiveness+0.012
ird+0.012
 report+0.012
Neg logit contributions
 edge-0.027
 bottom-0.024
 across-0.024
Larry-0.024
Charles-0.024
Token her
Token ID607
Feature activation-2.53e-03

Pos logit contributions
 cr+0.009
 report+0.009
/+0.008
?+0.008
 forgiveness+0.008
Neg logit contributions
bi-0.008
 sudden-0.008
 edge-0.008
He-0.008
 across-0.007
Token mom
Token ID1995
Feature activation+1.35e-02

Pos logit contributions
bi+0.004
 edge+0.004
 bottom+0.003
 sudden+0.003
eled+0.003
Neg logit contributions
 sew-0.004
 heaven-0.003
?-0.003
 report-0.003
 cr-0.003
Tokenmy
Token ID1820
Feature activation+5.14e-04

Pos logit contributions
 cr+0.025
?+0.024
 sew+0.024
 forgiveness+0.023
 forgive+0.022
Neg logit contributions
bi-0.015
 edge-0.015
Larry-0.014
 valleys-0.014
Charles-0.013
Token girl
Token ID2576
Feature activation+8.04e-03

Pos logit contributions
 cr+0.030
?+0.028
 L+0.026
?"+0.025
 heaven+0.025
Neg logit contributions
bi-0.036
 edge-0.035
Back-0.034
Phil-0.034
eled-0.034
Token.
Token ID13
Feature activation+2.40e-02

Pos logit contributions
?+0.007
 cr+0.006
 sew+0.006
 forgiveness+0.005
?"+0.005
Neg logit contributions
bi-0.017
 edge-0.017
 sudden-0.016
Larry-0.016
ling-0.016
Token What
Token ID1867
Feature activation+9.08e-03

Pos logit contributions
 cr+0.028
?+0.026
 sew+0.026
 forgiveness+0.025
 report+0.024
Neg logit contributions
bi-0.043
 edge-0.041
Larry-0.039
ling-0.038
Charles-0.038
Token do
Token ID466
Feature activation+3.03e-02

Pos logit contributions
 sew+0.012
 report+0.011
 extingu+0.011
 L+0.010
 forgiveness+0.010
Neg logit contributions
bi-0.014
Larry-0.013
 across-0.011
Frank-0.011
Charles-0.011
Token you
Token ID345
Feature activation+2.16e-02

Pos logit contributions
 cr+0.023
 sew+0.020
?+0.019
 report+0.019
 forgiveness+0.018
Neg logit contributions
bi-0.066
Larry-0.062
 edge-0.062
Frank-0.060
Charles-0.060
Token wish
Token ID4601
Feature activation+5.68e-02

Pos logit contributions
 cr+0.025
 forgiveness+0.023
 sew+0.023
?+0.023
 report+0.021
Neg logit contributions
bi-0.039
 edge-0.036
ling-0.035
Larry-0.035
 distance-0.034
Token to
Token ID284
Feature activation+2.68e-02

Pos logit contributions
 cr+0.037
?+0.035
 sew+0.033
 forgiveness+0.032
 report+0.029
Neg logit contributions
bi-0.131
 edge-0.125
Larry-0.121
ling-0.120
eled-0.119
Token gain
Token ID4461
Feature activation+2.56e-02

Pos logit contributions
/+0.032
 cr+0.031
 whose+0.030
 forgiveness+0.029
ems+0.028
Neg logit contributions
 edge-0.045
Larry-0.041
 bottom-0.041
Frank-0.039
 peak-0.039
Token?"
Token ID1701
Feature activation+6.85e-03

Pos logit contributions
 cr+0.024
 sew+0.024
 report+0.022
 forgiveness+0.022
/+0.022
Neg logit contributions
 edge-0.047
bi-0.046
 across-0.045
Larry-0.044
ling-0.044
Token
Token ID198
Feature activation+2.92e-03

Pos logit contributions
 cr+0.009
?+0.009
 sew+0.008
 forgiveness+0.008
 L+0.008
Neg logit contributions
bi-0.011
 edge-0.011
ling-0.011
Larry-0.010
Charles-0.010
TokenJane
Token ID41083
Feature activation+5.35e-03

Pos logit contributions
?+0.003
 cr+0.003
 sew+0.003
 forgive+0.003
?"+0.003
Neg logit contributions
bi-0.006
ling-0.005
 edge-0.005
mo-0.005
ging-0.005
Token
Token ID198
Feature activation+5.01e-03

Pos logit contributions
 cr+0.001
?+0.001
 sew+0.001
 forgiveness+0.001
 report+0.001
Neg logit contributions
bi-0.001
 edge-0.001
Larry-0.001
ling-0.001
Charles-0.001
Token"
Token ID1
Feature activation+1.39e-02

Pos logit contributions
 cr+0.007
?+0.007
 sew+0.007
 forgiveness+0.006
 grandma+0.006
Neg logit contributions
bi-0.008
mo-0.007
ling-0.007
 edge-0.007
arted-0.007
TokenWhat
Token ID2061
Feature activation+7.09e-03

Pos logit contributions
 cr+0.019
?+0.019
 sew+0.018
 forgiveness+0.018
 report+0.018
Neg logit contributions
bi-0.021
 edge-0.020
Larry-0.020
Frank-0.019
 sudden-0.019
Token did
Token ID750
Feature activation+2.75e-02

Pos logit contributions
ands+0.010
 sew+0.010
 heaven+0.010
isher+0.009
ist+0.009
Neg logit contributions
Larry-0.009
 across-0.008
 giant-0.008
 whole-0.008
He-0.008
Token you
Token ID345
Feature activation+1.96e-02

Pos logit contributions
 sew+0.028
 forgive+0.028
 cr+0.027
ird+0.026
/+0.026
Neg logit contributions
Charles-0.045
 edge-0.045
Larry-0.044
Frank-0.043
 streets-0.042
Token wish
Token ID4601
Feature activation+5.65e-02

Pos logit contributions
 forgiveness+0.021
 cr+0.020
 sew+0.020
 heaven+0.020
 whose+0.019
Neg logit contributions
bi-0.034
 edge-0.033
ling-0.032
Larry-0.031
 bottom-0.031
Token for
Token ID329
Feature activation+1.37e-02

Pos logit contributions
 cr+0.039
 sew+0.035
 forgiveness+0.034
?+0.033
 heaven+0.030
Neg logit contributions
bi-0.126
 edge-0.122
ling-0.117
Larry-0.117
Charles-0.114
Token?"
Token ID1701
Feature activation+6.67e-03

Pos logit contributions
/+0.013
ems+0.012
pre+0.012
ird+0.012
 forgiveness+0.011
Neg logit contributions
 edge-0.025
 bottom-0.023
Larry-0.022
Charles-0.022
Frank-0.022
Token Dad
Token ID17415
Feature activation-1.01e-04

Pos logit contributions
 cr+0.008
 sew+0.007
?+0.007
 forgiveness+0.007
 report+0.007
Neg logit contributions
bi-0.012
 edge-0.011
Larry-0.011
Charles-0.011
ling-0.011
Token asked
Token ID1965
Feature activation+8.23e-03

Pos logit contributions
bi+0.000
 edge+0.000
ling+0.000
Larry+0.000
Charles+0.000
Neg logit contributions
 cr-0.000
 sew-0.000
?-0.000
 forgiveness-0.000
 report-0.000
Token.
Token ID13
Feature activation-3.30e-03

Pos logit contributions
 cr+0.005
?+0.004
 sew+0.004
 forgiveness+0.004
 forgive+0.003
Neg logit contributions
bi-0.020
 edge-0.019
Larry-0.018
Charles-0.018
ling-0.018
Token
Token ID198
Feature activation+2.92e-03

Pos logit contributions
bi+0.002
 edge+0.002
Larry+0.002
ling+0.002
Charles+0.002
Neg logit contributions
 cr-0.002
?-0.002
 sew-0.002
 forgiveness-0.002
 report-0.001
Token"
Token ID1
Feature activation+9.01e-03

Pos logit contributions
 cr+0.004
?+0.004
 sew+0.004
 L+0.003
 forgive+0.003
Neg logit contributions
bi-0.005
arted-0.004
zi-0.004
 peak-0.004
 summit-0.004
TokenWhat
Token ID2061
Feature activation+5.34e-03

Pos logit contributions
 cr+0.015
?+0.015
 sew+0.014
 forgiveness+0.014
 report+0.014
Neg logit contributions
bi-0.011
 edge-0.011
Larry-0.010
Frank-0.010
Charles-0.010
Token do
Token ID466
Feature activation+2.61e-02

Pos logit contributions
 sew+0.007
ands+0.007
 extingu+0.007
ist+0.007
isher+0.007
Neg logit contributions
Larry-0.007
 giant-0.006
bi-0.006
 across-0.006
He-0.006
Token you
Token ID345
Feature activation+2.04e-02

Pos logit contributions
/+0.027
 scrib+0.026
 heaven+0.025
 sew+0.025
 cr+0.025
Neg logit contributions
Larry-0.046
Charles-0.043
Frank-0.042
bi-0.041
 edge-0.041
Token wish
Token ID4601
Feature activation+5.43e-02

Pos logit contributions
 forgiveness+0.022
 cr+0.022
 sew+0.021
 report+0.020
 heaven+0.020
Neg logit contributions
bi-0.036
 edge-0.034
Larry-0.033
ling-0.033
 bottom-0.032
Token for
Token ID329
Feature activation+1.24e-02

Pos logit contributions
 cr+0.037
 sew+0.035
 forgiveness+0.033
 heaven+0.032
 scrib+0.030
Neg logit contributions
bi-0.114
 edge-0.114
ling-0.108
 across-0.107
Larry-0.106
Token?"
Token ID1701
Feature activation+4.64e-03

Pos logit contributions
/+0.012
pre+0.011
ird+0.011
ems+0.011
udd+0.010
Neg logit contributions
 edge-0.023
 middle-0.020
Larry-0.020
 bottom-0.020
 across-0.020
Token The
Token ID383
Feature activation-1.02e-02

Pos logit contributions
 cr+0.003
?+0.003
 sew+0.003
 forgiveness+0.003
 L+0.003
Neg logit contributions
bi-0.011
 edge-0.010
ling-0.010
Larry-0.010
acking-0.010
Token mel
Token ID7758
Feature activation-1.87e-02

Pos logit contributions
bi+0.021
Larry+0.019
 onward+0.019
 edge+0.019
Charles+0.019
Neg logit contributions
 cr-0.011
?-0.010
 forgiveness-0.009
 sew-0.009
?"-0.008
Tokenon
Token ID261
Feature activation-9.18e-03

Pos logit contributions
bi+0.032
 edge+0.030
Larry+0.028
ling+0.028
Charles+0.028
Neg logit contributions
 cr-0.024
 sew-0.022
?-0.022
 forgiveness-0.022
 report-0.021
Token,
Token ID11
Feature activation-2.21e-03

Pos logit contributions
 sew+0.032
 cr+0.032
 forgiveness+0.032
 report+0.031
 heaven+0.029
Neg logit contributions
bi-0.046
 edge-0.045
ling-0.044
 across-0.043
Larry-0.042
Token with
Token ID351
Feature activation-3.37e-03

Pos logit contributions
bi+0.004
 edge+0.004
ling+0.003
Larry+0.003
 distance+0.003
Neg logit contributions
 cr-0.003
?-0.003
 sew-0.003
 forgiveness-0.003
 report-0.002
Token all
Token ID477
Feature activation-1.03e-02

Pos logit contributions
bi+0.006
 edge+0.006
ling+0.006
Larry+0.006
acking+0.006
Neg logit contributions
 cr-0.004
?-0.004
 sew-0.004
 forgiveness-0.004
 forgive-0.003
Token its
Token ID663
Feature activation+4.48e-03

Pos logit contributions
bi+0.020
 edge+0.019
Larry+0.018
ling+0.018
Charles+0.018
Neg logit contributions
 cr-0.011
?-0.010
 sew-0.010
 forgiveness-0.009
 report-0.009
Token different
Token ID1180
Feature activation+1.25e-02

Pos logit contributions
 cr+0.006
?+0.005
 sew+0.005
 forgiveness+0.005
 report+0.005
Neg logit contributions
bi-0.008
 edge-0.007
Larry-0.007
ling-0.007
Charles-0.007
Tokencol
Token ID4033
Feature activation+5.42e-02

Pos logit contributions
 cr+0.016
?+0.015
 sew+0.015
 forgiveness+0.014
 report+0.014
Neg logit contributions
bi-0.021
 edge-0.020
Larry-0.019
ling-0.019
Charles-0.019
Tokenours
Token ID4662
Feature activation-3.52e-03

Pos logit contributions
?+0.085
 forgive+0.085
 forgiveness+0.085
 cr+0.083
 sew+0.080
Neg logit contributions
bi-0.075
arted-0.070
acking-0.070
eled-0.066
ling-0.066
Token,
Token ID11
Feature activation+4.48e-04

Pos logit contributions
bi+0.007
 edge+0.006
Larry+0.006
ling+0.006
Charles+0.006
Neg logit contributions
 cr-0.004
?-0.004
 sew-0.004
 forgiveness-0.003
 report-0.003
Token shapes
Token ID15268
Feature activation-6.10e-03

Pos logit contributions
 cr+0.001
?+0.001
 sew+0.001
 forgiveness+0.001
 report+0.000
Neg logit contributions
bi-0.001
 edge-0.001
Larry-0.001
ling-0.001
 valleys-0.001
Token and
Token ID290
Feature activation+5.60e-03

Pos logit contributions
bi+0.013
Charles+0.012
 edge+0.012
Larry+0.012
Frank+0.012
Neg logit contributions
 cr-0.005
?-0.005
 sew-0.005
 forgiveness-0.004
 L-0.004
Token textures
Token ID20028
Feature activation+1.18e-03

Pos logit contributions
 cr+0.007
 sew+0.006
 forgiveness+0.006
?+0.006
 scrib+0.006
Neg logit contributions
bi-0.010
 edge-0.010
Larry-0.009
ling-0.009
Charles-0.009
Token
Token ID198
Feature activation+3.10e-03

Pos logit contributions
bi+0.003
 edge+0.003
Larry+0.003
ling+0.002
Frank+0.002
Neg logit contributions
 cr-0.003
 sew-0.003
 forgiveness-0.003
?-0.003
 report-0.003
Token"
Token ID1
Feature activation+9.32e-03

Pos logit contributions
 cr+0.005
?+0.005
 sew+0.005
 forgiveness+0.005
 report+0.004
Neg logit contributions
bi-0.004
 edge-0.004
Larry-0.004
ling-0.004
Charles-0.004
TokenWhat
Token ID2061
Feature activation+6.21e-03

Pos logit contributions
 cr+0.018
?+0.018
 sew+0.017
 forgiveness+0.017
 report+0.017
Neg logit contributions
bi-0.009
 edge-0.009
Larry-0.008
Frank-0.008
 distance-0.008
Token do
Token ID466
Feature activation+2.68e-02

Pos logit contributions
 sew+0.009
ands+0.009
 Bian+0.009
ist+0.009
 extingu+0.009
Neg logit contributions
Larry-0.007
 giant-0.007
 across-0.006
bi-0.006
 bottom-0.006
Token you
Token ID345
Feature activation+2.02e-02

Pos logit contributions
 scrib+0.025
/+0.024
 cr+0.024
 sew+0.024
 heaven+0.024
Neg logit contributions
Larry-0.048
Charles-0.046
 edge-0.045
Frank-0.044
Bob-0.042
Token wish
Token ID4601
Feature activation+5.38e-02

Pos logit contributions
 forgiveness+0.022
 cr+0.022
 sew+0.022
 report+0.021
 heaven+0.021
Neg logit contributions
bi-0.036
 edge-0.034
Larry-0.032
ling-0.032
 bottom-0.032
Token for
Token ID329
Feature activation+1.25e-02

Pos logit contributions
 cr+0.045
 sew+0.043
 forgiveness+0.041
 heaven+0.040
 scrib+0.037
Neg logit contributions
 edge-0.106
bi-0.105
ling-0.100
 across-0.099
 bottom-0.098
Token?"
Token ID1701
Feature activation+1.95e-03

Pos logit contributions
/+0.009
ird+0.009
 forgiveness+0.009
pre+0.009
ems+0.009
Neg logit contributions
 edge-0.025
Frank-0.022
Charles-0.022
Larry-0.022
 bottom-0.022
Token L
Token ID406
Feature activation+8.11e-03

Pos logit contributions
 cr+0.001
 sew+0.001
 forgiveness+0.001
?+0.001
 report+0.001
Neg logit contributions
bi-0.005
 edge-0.005
Larry-0.005
 valleys-0.005
 sudden-0.005
Tokenila
Token ID10102
Feature activation+7.09e-03

Pos logit contributions
 cr+0.010
 sew+0.010
 forgiveness+0.010
 report+0.009
 heaven+0.009
Neg logit contributions
bi-0.014
 edge-0.013
ling-0.013
Larry-0.013
arted-0.012
Token asks
Token ID7893
Feature activation+2.35e-03

Pos logit contributions
?+0.008
 cr+0.008
 sew+0.007
?"+0.007
 forgiveness+0.007
Neg logit contributions
bi-0.014
 edge-0.013
Larry-0.012
ling-0.012
 distance-0.012
Token,
Token ID11
Feature activation-6.27e-03

Pos logit contributions
?+0.000
 cr+0.000
 if+0.000
 forgiveness+0.000
 forgive+0.000
Neg logit contributions
bi-0.000
 edge-0.000
umbling-0.000
 distance-0.000
acking-0.000
Token "
Token ID366
Feature activation-1.77e-02

Pos logit contributions
bi+0.012
ling+0.011
acking+0.011
eled+0.011
umbling+0.011
Neg logit contributions
?-0.007
 L-0.007
 cr-0.006
 forgive-0.006
 if-0.006
TokenWhat
Token ID2061
Feature activation+4.29e-03

Pos logit contributions
bi+0.027
ling+0.025
pered+0.024
acking+0.024
 edge+0.024
Neg logit contributions
?-0.027
 cr-0.026
 whose-0.025
 forgive-0.024
 someday-0.024
Token have
Token ID423
Feature activation+3.69e-02

Pos logit contributions
 sew+0.005
 report+0.005
 forgiveness+0.005
 cr+0.005
 heaven+0.004
Neg logit contributions
bi-0.007
Larry-0.007
 edge-0.007
 across-0.006
Charles-0.006
Token you
Token ID345
Feature activation+2.35e-02

Pos logit contributions
 sew+0.033
 cr+0.029
 report+0.029
 forgiveness+0.029
 forgive+0.027
Neg logit contributions
bi-0.073
 edge-0.070
 streets-0.069
Larry-0.069
Frank-0.069
Token seen
Token ID1775
Feature activation+5.26e-02

Pos logit contributions
 cr+0.031
?+0.030
 sew+0.028
?"+0.027
 report+0.026
Neg logit contributions
bi-0.040
 edge-0.038
Larry-0.037
Charles-0.036
ling-0.036
Token,
Token ID11
Feature activation+3.87e-03

Pos logit contributions
 cr+0.029
 sew+0.026
 forgiveness+0.024
?+0.024
 report+0.021
Neg logit contributions
bi-0.125
 edge-0.121
ling-0.116
Larry-0.116
Charles-0.114
Token John
Token ID1757
Feature activation-7.82e-03

Pos logit contributions
 cr+0.005
?+0.005
 grandma+0.004
 forgiveness+0.004
?"+0.004
Neg logit contributions
bi-0.007
 edge-0.007
 valleys-0.007
ling-0.006
acking-0.006
Token?"
Token ID1701
Feature activation-8.86e-04

Pos logit contributions
 edge+0.023
 sudden+0.023
 valleys+0.023
 summit+0.023
 distance+0.023
Neg logit contributions
?-0.002
?"-0.001
 cr+0.000
 tonight+0.001
/+0.001
Token He
Token ID679
Feature activation-1.42e-02

Pos logit contributions
bi+0.002
ling+0.002
 edge+0.002
 valleys+0.001
mo+0.001
Neg logit contributions
?-0.001
 cr-0.001
 grandma-0.001
 sew-0.001
 forgiveness-0.001
Token pointed
Token ID6235
Feature activation-2.57e-02

Pos logit contributions
bi+0.033
 edge+0.031
 valleys+0.031
 distance+0.031
umbling+0.030
Neg logit contributions
 cr-0.010
?-0.010
 sew-0.009
 whose-0.008
?"-0.008
Token did
Token ID750
Feature activation+4.32e-03

Pos logit contributions
 cr+0.013
 sew+0.012
 forgiveness+0.012
?+0.012
 report+0.011
Neg logit contributions
bi-0.015
 edge-0.014
ling-0.014
Larry-0.013
 valleys-0.013
Token something
Token ID1223
Feature activation+1.60e-02

Pos logit contributions
 cr+0.005
?+0.005
 sew+0.004
 forgiveness+0.004
 report+0.004
Neg logit contributions
bi-0.008
 edge-0.008
Larry-0.007
ling-0.007
Charles-0.007
Token special
Token ID2041
Feature activation+2.60e-02

Pos logit contributions
 cr+0.024
 sew+0.024
 forgiveness+0.024
 heaven+0.023
/+0.023
Neg logit contributions
bi-0.020
 edge-0.019
ling-0.019
 across-0.019
 bottom-0.017
Token.
Token ID13
Feature activation-3.91e-03

Pos logit contributions
 sew+0.035
 forgiveness+0.032
endant+0.031
 scrib+0.031
 heaven+0.031
Neg logit contributions
 across-0.041
 edge-0.034
bi-0.034
 onward-0.034
 onwards-0.034
Token On
Token ID1550
Feature activation+7.97e-03

Pos logit contributions
bi+0.006
ling+0.006
 edge+0.006
Larry+0.006
ging+0.006
Neg logit contributions
?-0.005
 cr-0.005
 sew-0.005
 forgiveness-0.005
 L-0.004
Token Wed
Token ID3300
Feature activation+5.22e-02

Pos logit contributions
 cr+0.009
 sew+0.009
 forgiveness+0.009
 forgive+0.008
?+0.008
Neg logit contributions
 edge-0.014
bi-0.014
 bottom-0.013
Larry-0.012
 valleys-0.012
Tokennes
Token ID2516
Feature activation+8.52e-03

Pos logit contributions
 cr+0.064
 forgiveness+0.062
?+0.061
 sew+0.059
 report+0.056
Neg logit contributions
bi-0.090
Larry-0.085
Charles-0.083
 edge-0.082
Frank-0.082
Tokendays
Token ID12545
Feature activation+1.45e-02

Pos logit contributions
 cr+0.007
?+0.007
 sew+0.007
 forgiveness+0.006
 report+0.006
Neg logit contributions
bi-0.018
 edge-0.017
Larry-0.017
ling-0.016
Charles-0.016
Token she
Token ID673
Feature activation+7.00e-03

Pos logit contributions
 cr+0.015
 sew+0.015
 forgiveness+0.015
?+0.014
 report+0.013
Neg logit contributions
bi-0.027
 edge-0.025
 streets-0.024
acking-0.024
ling-0.024
Token liked
Token ID8288
Feature activation+1.77e-02

Pos logit contributions
 forgiveness+0.010
 cr+0.009
 sew+0.009
?+0.009
 report+0.009
Neg logit contributions
bi-0.011
 edge-0.010
ling-0.009
mo-0.009
eled-0.009
Token to
Token ID284
Feature activation+1.13e-03

Pos logit contributions
 cr+0.017
?+0.015
 sew+0.015
 forgiveness+0.014
 L+0.014
Neg logit contributions
bi-0.037
 edge-0.035
Larry-0.034
ling-0.034
Charles-0.033
Token one
Token ID530
Feature activation+1.65e-02

Pos logit contributions
?+0.007
 cr+0.006
 sew+0.004
 whose+0.004
 someday+0.004
Neg logit contributions
bi-0.033
Larry-0.032
Charles-0.031
Frank-0.031
die-0.031
Token night
Token ID1755
Feature activation+2.46e-02

Pos logit contributions
?+0.022
 cr+0.022
 someday+0.019
 sew+0.018
 forgiveness+0.018
Neg logit contributions
bi-0.027
Larry-0.026
 edge-0.026
Charles-0.026
 valleys-0.025
Token.
Token ID13
Feature activation+2.28e-03

Pos logit contributions
 cr+0.016
?+0.015
 sew+0.013
 L+0.012
 grandma+0.011
Neg logit contributions
bi-0.059
Larry-0.055
eled-0.054
mo-0.054
Charles-0.054
Token It
Token ID632
Feature activation+1.39e-02

Pos logit contributions
 Lily+0.003
 Grand+0.002
 L+0.002
 grandma+0.002
?+0.002
Neg logit contributions
bi-0.004
ling-0.004
 eagle-0.004
mo-0.004
Larry-0.004
Token was
Token ID373
Feature activation+2.08e-02

Pos logit contributions
 cr+0.009
?+0.009
 sew+0.008
 L+0.007
?"+0.007
Neg logit contributions
bi-0.033
ging-0.032
pering-0.032
pered-0.031
 reins-0.031
Token dark
Token ID3223
Feature activation+5.22e-02

Pos logit contributions
 cr+0.019
?+0.017
 L+0.015
 sew+0.014
 grandma+0.014
Neg logit contributions
bi-0.046
mo-0.043
ling-0.043
Larry-0.043
acking-0.042
Token,
Token ID11
Feature activation+5.45e-03

Pos logit contributions
 cr+0.044
?+0.044
 sew+0.039
 report+0.034
?"+0.034
Neg logit contributions
bi-0.114
Larry-0.105
 edge-0.105
Charles-0.102
eled-0.101
Token and
Token ID290
Feature activation+2.45e-03

Pos logit contributions
 cr+0.004
?+0.004
 sew+0.004
 L+0.004
 tonight+0.003
Neg logit contributions
bi-0.013
Larry-0.011
mo-0.011
eled-0.011
Charles-0.011
Token their
Token ID511
Feature activation-3.34e-03

Pos logit contributions
 cr+0.002
?+0.002
 tonight+0.002
 we+0.002
 grandma+0.002
Neg logit contributions
bi-0.005
acking-0.005
pering-0.005
pered-0.005
Larry-0.005
Token shadows
Token ID16187
Feature activation+1.29e-02

Pos logit contributions
bi+0.006
acking+0.006
Frank+0.006
Larry+0.006
Charles+0.006
Neg logit contributions
 cr-0.004
 grandma-0.004
 grand-0.003
?-0.003
 L-0.003
Token were
Token ID547
Feature activation-4.94e-03

Pos logit contributions
 cr+0.015
?+0.014
 sew+0.013
 forgiveness+0.012
 report+0.012
Neg logit contributions
bi-0.024
Larry-0.023
Charles-0.023
Frank-0.022
Phil-0.022
Token,
Token ID11
Feature activation+4.38e-03

Pos logit contributions
?+0.002
 cr+0.002
 forgive+0.002
 L+0.002
 sew+0.002
Neg logit contributions
bi-0.009
 distance-0.008
 valleys-0.008
eled-0.008
Larry-0.008
Token "
Token ID366
Feature activation-3.42e-03

Pos logit contributions
 L+0.003
?+0.003
 don+0.003
 cr+0.003
 forgive+0.003
Neg logit contributions
bi-0.010
 depths-0.009
eled-0.009
umbling-0.009
 summit-0.009
TokenHi
Token ID17250
Feature activation-1.65e-03

Pos logit contributions
bi+0.006
 onward+0.006
pered+0.006
 onwards+0.006
pering+0.006
Neg logit contributions
L-0.004
?-0.004
 cr-0.004
 don-0.003
 L-0.003
Token!
Token ID0
Feature activation+1.05e-02

Pos logit contributions
bi+0.003
 edge+0.003
ling+0.003
Larry+0.003
 distance+0.003
Neg logit contributions
 cr-0.002
?-0.002
 sew-0.002
 forgiveness-0.002
?"-0.001
Token What
Token ID1867
Feature activation+7.63e-03

Pos logit contributions
 cr+0.014
?+0.014
 sew+0.013
 forgiveness+0.013
 forgive+0.012
Neg logit contributions
bi-0.017
 edge-0.016
ling-0.015
Larry-0.015
 distance-0.015
Token's
Token ID338
Feature activation+5.21e-02

Pos logit contributions
 sew+0.009
 report+0.009
 forgiveness+0.008
 heaven+0.008
 cr+0.008
Neg logit contributions
bi-0.012
Larry-0.012
 edge-0.011
 across-0.011
Frank-0.011
Token that
Token ID326
Feature activation+1.68e-02

Pos logit contributions
ird+0.066
 cr+0.064
 forgiveness+0.063
ems+0.063
 sew+0.062
Neg logit contributions
 across-0.081
 edge-0.078
 crashing-0.072
bi-0.072
 bottom-0.071
Token?"
Token ID1701
Feature activation+6.56e-03

Pos logit contributions
 cr+0.015
 sew+0.015
 heaven+0.015
 forgiveness+0.014
 whose+0.013
Neg logit contributions
 edge-0.033
bi-0.031
 bottom-0.031
 across-0.030
ling-0.030
Token
Token ID198
Feature activation+1.36e-03

Pos logit contributions
?+0.007
 cr+0.006
 sew+0.006
 forgive+0.006
 forgiveness+0.006
Neg logit contributions
bi-0.013
ling-0.012
ging-0.012
eled-0.012
mo-0.012
Token
Token ID198
Feature activation+5.71e-03

Pos logit contributions
?+0.001
 forgive+0.001
?"+0.001
 cr+0.001
 sew+0.001
Neg logit contributions
bi-0.003
ling-0.003
mo-0.003
eled-0.003
ging-0.003
TokenThe
Token ID464
Feature activation-8.79e-03

Pos logit contributions
?+0.006
 forgive+0.006
 cr+0.005
 whose+0.005
?"+0.005
Neg logit contributions
bi-0.012
mo-0.011
ling-0.010
eled-0.010
arted-0.010
Token Wed
Token ID3300
Feature activation+4.16e-02

Pos logit contributions
 cr+0.014
?+0.014
 sew+0.013
 forgiveness+0.013
 report+0.012
Neg logit contributions
bi-0.022
 edge-0.021
Larry-0.020
ling-0.020
 streets-0.019
Tokennes
Token ID2516
Feature activation+1.55e-03

Pos logit contributions
 cr+0.061
?+0.060
 forgiveness+0.060
 cleans+0.054
 heaven+0.053
Neg logit contributions
bi-0.061
 edge-0.057
Frank-0.057
rolling-0.056
Larry-0.056
Tokendays
Token ID12545
Feature activation+6.55e-03

Pos logit contributions
 forgiveness+0.002
 cr+0.002
 sew+0.002
?+0.002
 report+0.002
Neg logit contributions
bi-0.003
ling-0.002
 edge-0.002
mo-0.002
acking-0.002
Token because
Token ID780
Feature activation+2.48e-03

Pos logit contributions
 forgiveness+0.009
endant+0.008
 sew+0.008
prise+0.008
 cr+0.008
Neg logit contributions
 across-0.010
 streets-0.010
acking-0.009
 edge-0.009
 forward-0.009
Token on
Token ID319
Feature activation-3.18e-03

Pos logit contributions
 sew+0.003
 cr+0.003
 report+0.003
/+0.003
 forgiveness+0.003
Neg logit contributions
bi-0.004
 edge-0.004
 streets-0.004
 across-0.004
 bottom-0.004
Token Wed
Token ID3300
Feature activation+5.19e-02

Pos logit contributions
bi+0.006
 edge+0.006
Larry+0.005
ling+0.005
 valleys+0.005
Neg logit contributions
 cr-0.004
 sew-0.003
 forgiveness-0.003
?-0.003
 report-0.003
Tokennes
Token ID2516
Feature activation+6.40e-03

Pos logit contributions
 cr+0.071
 forgiveness+0.067
?+0.066
 sew+0.061
 L+0.059
Neg logit contributions
bi-0.082
Larry-0.078
 edge-0.078
Frank-0.077
Charles-0.076
Tokendays
Token ID12545
Feature activation+1.39e-02

Pos logit contributions
 cr+0.006
 sew+0.006
?+0.005
 forgiveness+0.005
 report+0.005
Neg logit contributions
bi-0.013
 edge-0.012
Larry-0.012
ling-0.012
acking-0.011
Token she
Token ID673
Feature activation+9.42e-03

Pos logit contributions
 forgiveness+0.018
 cr+0.017
 sew+0.017
 heaven+0.016
 report+0.016
Neg logit contributions
bi-0.021
 across-0.021
 streets-0.021
 edge-0.020
 bottom-0.020
Token did
Token ID750
Feature activation+4.32e-03

Pos logit contributions
 cr+0.013
 sew+0.012
 forgiveness+0.012
?+0.012
 report+0.011
Neg logit contributions
bi-0.015
 edge-0.014
ling-0.014
Larry-0.013
 valleys-0.013
Token something
Token ID1223
Feature activation+1.60e-02

Pos logit contributions
 cr+0.005
?+0.005
 sew+0.004
 forgiveness+0.004
 report+0.004
Neg logit contributions
bi-0.008
 edge-0.008
Larry-0.007
ling-0.007
Charles-0.007
Token what
Token ID644
Feature activation-9.21e-03

Pos logit contributions
bi+0.016
 edge+0.015
acking+0.015
ling+0.015
Larry+0.014
Neg logit contributions
 cr-0.009
?-0.008
 forgiveness-0.008
 sew-0.008
 heaven-0.008
Token the
Token ID262
Feature activation+6.12e-03

Pos logit contributions
bi+0.020
 edge+0.019
acking+0.018
ling+0.018
eled+0.018
Neg logit contributions
 cr-0.008
?-0.008
 forgiveness-0.007
 sew-0.007
 L-0.006
Token rocket
Token ID10701
Feature activation+1.82e-02

Pos logit contributions
 cr+0.009
?+0.008
 forgiveness+0.008
 grandma+0.008
 report+0.007
Neg logit contributions
bi-0.011
acking-0.010
eled-0.010
arted-0.010
zi-0.009
Token discovered
Token ID5071
Feature activation+1.45e-02

Pos logit contributions
 cr+0.022
?+0.021
 sew+0.020
 forgiveness+0.020
 report+0.019
Neg logit contributions
bi-0.032
 edge-0.030
Larry-0.029
acking-0.028
Charles-0.028
Token in
Token ID287
Feature activation+5.96e-03

Pos logit contributions
 cr+0.013
?+0.013
 sew+0.012
 forgiveness+0.012
 report+0.011
Neg logit contributions
bi-0.030
 edge-0.028
Larry-0.027
ling-0.027
Charles-0.027
Token outer
Token ID12076
Feature activation+5.19e-02

Pos logit contributions
 cr+0.006
?+0.006
 sew+0.006
 forgiveness+0.006
 report+0.005
Neg logit contributions
bi-0.011
 edge-0.011
ling-0.010
Larry-0.010
acking-0.010
Token space
Token ID2272
Feature activation+3.00e-02

Pos logit contributions
 sew+0.047
 forgiveness+0.044
 cr+0.043
 report+0.043
 don+0.041
Neg logit contributions
 edge-0.103
 valleys-0.102
 distance-0.101
 streets-0.098
 depths-0.098
Token.
Token ID13
Feature activation+7.73e-03

Pos logit contributions
 cr+0.024
?+0.023
 sew+0.019
 forgiveness+0.019
 someday+0.019
Neg logit contributions
bi-0.067
 edge-0.061
Larry-0.061
eled-0.061
arted-0.060
Token It
Token ID632
Feature activation+8.98e-03

Pos logit contributions
 L+0.008
?+0.008
 cr+0.008
 Sp+0.007
 grandma+0.007
Neg logit contributions
bi-0.014
ling-0.014
 edge-0.014
ging-0.013
eled-0.013
Token was
Token ID373
Feature activation+1.51e-02

Pos logit contributions
 cr+0.009
?+0.008
 someday+0.007
 sew+0.007
 don+0.007
Neg logit contributions
bi-0.018
pering-0.016
ging-0.016
acking-0.016
 edge-0.016
Token a
Token ID257
Feature activation+1.01e-02

Pos logit contributions
?+0.014
 cr+0.013
 L+0.011
 grandma+0.011
 where+0.011
Neg logit contributions
bi-0.032
acking-0.030
 edge-0.030
ling-0.029
umbling-0.029
Token,
Token ID11
Feature activation+1.03e-03

Pos logit contributions
 cr+0.007
?+0.007
 forgiveness+0.006
 sew+0.006
 forgive+0.006
Neg logit contributions
bi-0.018
 edge-0.017
ling-0.016
Larry-0.016
acking-0.016
Token "
Token ID366
Feature activation-1.13e-02

Pos logit contributions
 cr+0.001
?+0.001
 sew+0.001
 forgiveness+0.001
 forgive+0.001
Neg logit contributions
bi-0.002
 edge-0.002
ling-0.002
Larry-0.002
 distance-0.001
TokenWhat
Token ID2061
Feature activation+7.09e-03

Pos logit contributions
bi+0.016
ling+0.014
acking+0.014
 edge+0.013
 onward+0.013
Neg logit contributions
 cr-0.018
?-0.018
 sew-0.018
 forgiveness-0.017
 forgive-0.017
Token is
Token ID318
Feature activation+3.59e-02

Pos logit contributions
 sew+0.009
 report+0.009
 heaven+0.008
 forgiveness+0.008
 L+0.008
Neg logit contributions
bi-0.011
Larry-0.010
Charles-0.009
 across-0.009
Frank-0.009
Token your
Token ID534
Feature activation+1.09e-02

Pos logit contributions
 sew+0.044
 cr+0.043
 forgiveness+0.042
 forgive+0.042
ird+0.041
Neg logit contributions
 across-0.058
 edge-0.057
Charles-0.055
Frank-0.054
 streets-0.053
Token dream
Token ID4320
Feature activation+5.16e-02

Pos logit contributions
 cr+0.015
 sew+0.014
?+0.014
 forgiveness+0.014
 report+0.013
Neg logit contributions
bi-0.017
 edge-0.016
Larry-0.015
ling-0.015
Charles-0.015
Token?"
Token ID1701
Feature activation+3.65e-03

Pos logit contributions
 cr+0.032
 sew+0.028
 forgiveness+0.028
 L+0.024
 heaven+0.023
Neg logit contributions
bi-0.117
 edge-0.115
ling-0.111
 distance-0.109
Larry-0.108
Token
Token ID220
Feature activation-2.23e-02

Pos logit contributions
 cr+0.005
 forgiveness+0.005
?+0.005
 sew+0.005
 report+0.005
Neg logit contributions
bi-0.005
 edge-0.005
Larry-0.005
 valleys-0.005
Frank-0.005
Token
Token ID198
Feature activation-1.60e-03

Pos logit contributions
bi+0.038
 edge+0.035
ling+0.034
Larry+0.034
 distance+0.033
Neg logit contributions
 cr-0.028
?-0.027
 sew-0.027
 forgiveness-0.026
 report-0.025
Token
Token ID198
Feature activation+3.48e-03

Pos logit contributions
bi+0.003
 edge+0.003
ling+0.003
eled+0.003
mo+0.003
Neg logit contributions
 cr-0.002
?-0.002
 sew-0.002
 forgiveness-0.002
 forgive-0.002
TokenThe
Token ID464
Feature activation-9.81e-03

Pos logit contributions
 sew+0.005
?+0.005
 cr+0.005
 forgive+0.005
 forgiveness+0.004
Neg logit contributions
bi-0.006
mo-0.005
eled-0.005
 edge-0.005
arted-0.005
Token came
Token ID1625
Feature activation-7.53e-03

Pos logit contributions
 summit+0.017
 streets+0.016
Back+0.016
umbling+0.016
bi+0.016
Neg logit contributions
?-0.007
 cr-0.007
?"-0.006
 L-0.005
 whose-0.005
Token by
Token ID416
Feature activation-5.20e-03

Pos logit contributions
 summit+0.014
bi+0.013
 bottom+0.013
Charles+0.013
 sudden+0.013
Neg logit contributions
?-0.009
 cr-0.008
 if-0.008
?"-0.008
 sew-0.007
Token.
Token ID13
Feature activation+6.16e-03

Pos logit contributions
bi+0.011
 edge+0.010
Larry+0.010
ling+0.010
 distance+0.010
Neg logit contributions
 cr-0.005
 sew-0.004
 forgiveness-0.004
?-0.004
 report-0.004
Token "
Token ID366
Feature activation-2.84e-04

Pos logit contributions
 cr+0.009
?+0.009
 sew+0.009
 forgiveness+0.009
 report+0.008
Neg logit contributions
bi-0.009
 edge-0.008
Larry-0.008
ling-0.008
Charles-0.008
TokenWhat
Token ID2061
Feature activation+6.72e-03

Pos logit contributions
bi+0.000
 edge+0.000
ling+0.000
Larry+0.000
acking+0.000
Neg logit contributions
 cr-0.000
 sew-0.000
?-0.000
 forgiveness-0.000
 forgive-0.000
Token's
Token ID338
Feature activation+5.15e-02

Pos logit contributions
 sew+0.009
 report+0.009
 forgiveness+0.009
 heaven+0.008
 someday+0.008
Neg logit contributions
bi-0.010
Larry-0.009
 across-0.009
 edge-0.008
Frank-0.008
Token wrong
Token ID2642
Feature activation+1.74e-02

Pos logit contributions
ird+0.093
 sew+0.090
 forgiveness+0.090
 someday+0.089
 cr+0.088
Neg logit contributions
 across-0.055
 edge-0.050
bi-0.050
ling-0.047
 bottom-0.046
Token,
Token ID11
Feature activation+9.22e-03

Pos logit contributions
 cr+0.013
 sew+0.012
 forgiveness+0.012
?+0.011
 report+0.011
Neg logit contributions
bi-0.038
 edge-0.037
Larry-0.035
ling-0.035
Frank-0.034
Token little
Token ID1310
Feature activation+1.48e-02

Pos logit contributions
 cr+0.011
 sew+0.011
 forgiveness+0.011
?+0.010
 report+0.010
Neg logit contributions
bi-0.015
 edge-0.015
Frank-0.014
Larry-0.014
Charles-0.014
Token bird
Token ID6512
Feature activation+8.28e-03

Pos logit contributions
 report+0.026
 sew+0.026
 cr+0.025
 forgiveness+0.025
?+0.024
Neg logit contributions
bi-0.016
 edge-0.016
Larry-0.014
ling-0.014
 bottom-0.014
Tokenie
Token ID494
Feature activation+5.48e-03

Pos logit contributions
 cr+0.010
?+0.009
 sew+0.009
 forgiveness+0.009
 report+0.008
Neg logit contributions
bi-0.015
 edge-0.014
Larry-0.013
ling-0.013
Charles-0.013
TokenWhat
Token ID2061
Feature activation+2.48e-03

Pos logit contributions
bi+0.015
ling+0.013
 edge+0.013
acking+0.012
arted+0.012
Neg logit contributions
 cr-0.015
?-0.014
 sew-0.014
 whose-0.013
 forgiveness-0.013
Token's
Token ID338
Feature activation+4.31e-02

Pos logit contributions
 heaven+0.003
 sew+0.003
 report+0.003
 L+0.003
 extingu+0.003
Neg logit contributions
Larry-0.003
bi-0.003
He-0.003
 bulls-0.003
Charles-0.003
Token so
Token ID523
Feature activation+1.38e-02

Pos logit contributions
 forgiveness+0.048
ird+0.048
 cr+0.046
 forgive+0.046
/+0.046
Neg logit contributions
 across-0.068
 edge-0.066
bi-0.064
Charles-0.064
Phil-0.061
Token special
Token ID2041
Feature activation+2.66e-02

Pos logit contributions
 cr+0.018
 sew+0.017
?+0.017
 forgiveness+0.017
 report+0.016
Neg logit contributions
bi-0.023
 edge-0.021
Larry-0.021
ling-0.020
Charles-0.020
Token about
Token ID546
Feature activation+4.56e-03

Pos logit contributions
 sew+0.015
 cr+0.015
 forgiveness+0.014
 heaven+0.014
/+0.012
Neg logit contributions
bi-0.059
 edge-0.057
 across-0.057
ling-0.056
Larry-0.054
Token Wed
Token ID3300
Feature activation+5.13e-02

Pos logit contributions
 cr+0.004
?+0.004
 sew+0.004
 forgiveness+0.004
?"+0.004
Neg logit contributions
bi-0.010
 edge-0.009
ling-0.009
Larry-0.009
 distance-0.008
Tokennes
Token ID2516
Feature activation+1.66e-02

Pos logit contributions
 cr+0.075
 forgiveness+0.071
?+0.069
 L+0.065
 sew+0.065
Neg logit contributions
bi-0.075
Larry-0.073
Frank-0.072
acking-0.071
Charles-0.071
Tokendays
Token ID12545
Feature activation+2.49e-02

Pos logit contributions
 cr+0.011
?+0.010
 sew+0.010
 forgiveness+0.010
 report+0.009
Neg logit contributions
bi-0.038
 edge-0.036
Larry-0.035
ling-0.035
Charles-0.034
Token?"
Token ID1701
Feature activation+3.29e-03

Pos logit contributions
 cr+0.015
 forgiveness+0.015
 sew+0.014
 heaven+0.014
 whose+0.012
Neg logit contributions
 edge-0.053
bi-0.053
ling-0.052
 streets-0.051
 across-0.051
Token
Token ID198
Feature activation-8.82e-04

Pos logit contributions
 cr+0.005
 sew+0.004
?+0.004
 forgiveness+0.004
 report+0.004
Neg logit contributions
bi-0.005
 edge-0.005
Larry-0.005
 valleys-0.004
ling-0.004
Token
Token ID198
Feature activation+3.84e-03

Pos logit contributions
bi+0.001
 edge+0.001
Larry+0.001
ling+0.001
Charles+0.001
Neg logit contributions
 cr-0.001
?-0.001
 sew-0.001
 forgiveness-0.001
 report-0.001
Token her
Token ID607
Feature activation+3.69e-03

Pos logit contributions
 cr+0.003
 sew+0.003
?+0.003
 forgiveness+0.003
 report+0.003
Neg logit contributions
bi-0.006
 edge-0.005
Larry-0.005
ling-0.005
Charles-0.005
Token mum
Token ID25682
Feature activation+2.89e-03

Pos logit contributions
 sew+0.005
 cr+0.005
?+0.005
 heaven+0.005
 forgiveness+0.005
Neg logit contributions
bi-0.006
 edge-0.005
Larry-0.005
 sudden-0.005
ling-0.005
Token:
Token ID25
Feature activation-2.42e-02

Pos logit contributions
 cr+0.001
?+0.001
 sew+0.001
 forgiveness+0.001
 forgive+0.001
Neg logit contributions
bi-0.007
 edge-0.007
Larry-0.007
Charles-0.007
 sudden-0.007
Token "
Token ID366
Feature activation-1.28e-02

Pos logit contributions
bi+0.049
ling+0.048
umbling+0.046
eled+0.046
arted+0.045
Neg logit contributions
?-0.025
 cr-0.023
 L-0.023
 forgive-0.022
 if-0.022
TokenWhat
Token ID2061
Feature activation+8.26e-03

Pos logit contributions
bi+0.018
ling+0.016
arted+0.016
pered+0.016
 edge+0.016
Neg logit contributions
 cr-0.020
?-0.020
 sew-0.019
 tonight-0.018
 whose-0.018
Token's
Token ID338
Feature activation+5.12e-02

Pos logit contributions
 sew+0.010
 heaven+0.010
 forgiveness+0.010
 report+0.010
 cr+0.009
Neg logit contributions
bi-0.012
Larry-0.011
 across-0.011
 edge-0.011
Frank-0.011
Token that
Token ID326
Feature activation+1.82e-02

Pos logit contributions
ird+0.070
 forgiveness+0.068
 cr+0.067
/+0.066
 forgive+0.066
Neg logit contributions
 across-0.073
 edge-0.067
 streets-0.064
bi-0.063
 middle-0.061
Token?".
Token ID43634
Feature activation+3.26e-03

Pos logit contributions
 cr+0.012
 sew+0.011
 forgiveness+0.011
?+0.009
 heaven+0.009
Neg logit contributions
bi-0.041
 edge-0.040
ling-0.038
Larry-0.038
 distance-0.038
Token Her
Token ID2332
Feature activation+1.52e-03

Pos logit contributions
 cr+0.004
?+0.004
 sew+0.004
 forgiveness+0.004
 report+0.004
Neg logit contributions
bi-0.006
 edge-0.005
Larry-0.005
ling-0.005
Charles-0.005
Token mum
Token ID25682
Feature activation+1.54e-02

Pos logit contributions
 cr+0.002
?+0.002
 grandma+0.002
 forgiveness+0.002
?"+0.002
Neg logit contributions
 valleys-0.002
bi-0.002
ling-0.002
acking-0.002
Charles-0.002
Token said
Token ID531
Feature activation+1.26e-03

Pos logit contributions
 cr+0.011
?+0.010
 sew+0.010
 forgiveness+0.009
 forgive+0.008
Neg logit contributions
bi-0.035
 edge-0.034
Larry-0.032
 valleys-0.032
ling-0.032
Token.
Token ID13
Feature activation-3.93e-03

Pos logit contributions
 edge+0.006
bi+0.006
ling+0.006
 across+0.005
 distance+0.005
Neg logit contributions
 cr-0.003
 sew-0.002
 forgiveness-0.002
?-0.002
 report-0.002
Token
Token ID198
Feature activation-3.19e-03

Pos logit contributions
bi+0.004
 edge+0.004
 streets+0.003
 valleys+0.003
Larry+0.003
Neg logit contributions
 cr-0.008
?-0.007
 forgiveness-0.007
 sew-0.007
 report-0.007
Token
Token ID198
Feature activation+2.62e-03

Pos logit contributions
bi+0.006
 edge+0.005
ling+0.005
Larry+0.005
eled+0.005
Neg logit contributions
 cr-0.004
?-0.004
 sew-0.004
 forgiveness-0.004
 report-0.003
Token"
Token ID1
Feature activation+1.25e-02

Pos logit contributions
 cr+0.004
 sew+0.004
?+0.004
 forgiveness+0.004
 report+0.004
Neg logit contributions
bi-0.004
 edge-0.003
ling-0.003
Larry-0.003
eled-0.003
TokenWhat
Token ID2061
Feature activation+6.30e-03

Pos logit contributions
 cr+0.021
 sew+0.021
?+0.020
 forgiveness+0.020
 report+0.019
Neg logit contributions
bi-0.016
 edge-0.015
ling-0.014
Larry-0.014
Charles-0.013
Token's
Token ID338
Feature activation+5.09e-02

Pos logit contributions
 sew+0.008
 heaven+0.008
ist+0.008
 report+0.007
 lav+0.007
Neg logit contributions
bi-0.009
Larry-0.008
He-0.008
Charlie-0.008
Steve-0.008
Token going
Token ID1016
Feature activation-1.34e-03

Pos logit contributions
 cr+0.085
 forgiveness+0.081
 sew+0.080
 heaven+0.076
 someday+0.076
Neg logit contributions
 across-0.058
bi-0.055
 edge-0.055
 streets-0.053
Phil-0.053
Token on
Token ID319
Feature activation+2.43e-02

Pos logit contributions
bi+0.002
 edge+0.002
Larry+0.002
ling+0.002
Charles+0.002
Neg logit contributions
 cr-0.002
?-0.002
 sew-0.002
 forgiveness-0.002
 report-0.002
Token here
Token ID994
Feature activation+1.11e-02

Pos logit contributions
 cr+0.020
 sew+0.019
 forgiveness+0.018
?+0.018
 report+0.016
Neg logit contributions
bi-0.051
 edge-0.049
Larry-0.047
ling-0.047
Charles-0.046
Token?"
Token ID1701
Feature activation+5.63e-03

Pos logit contributions
 cr+0.005
 sew+0.004
 forgiveness+0.004
 heaven+0.004
 whose+0.003
Neg logit contributions
bi-0.026
 edge-0.026
 across-0.025
Larry-0.025
ling-0.025
Token She
Token ID1375
Feature activation-1.18e-02

Pos logit contributions
 cr+0.006
 sew+0.006
 forgiveness+0.006
?+0.006
 report+0.005
Neg logit contributions
bi-0.010
 edge-0.010
 sudden-0.009
Larry-0.009
 streets-0.009
Token.
Token ID13
Feature activation+1.58e-03

Pos logit contributions
bi+0.011
 edge+0.011
ling+0.011
acking+0.010
 distance+0.010
Neg logit contributions
 cr-0.007
 forgiveness-0.007
 sew-0.006
?-0.006
 report-0.006
Token
Token ID198
Feature activation-1.85e-03

Pos logit contributions
 cr+0.002
?+0.002
 sew+0.002
 forgiveness+0.002
 report+0.002
Neg logit contributions
bi-0.003
 edge-0.002
Larry-0.002
ling-0.002
Charles-0.002
Token
Token ID198
Feature activation+3.06e-03

Pos logit contributions
bi+0.003
 edge+0.003
ling+0.003
Larry+0.003
eled+0.003
Neg logit contributions
 cr-0.002
?-0.002
 sew-0.002
 forgiveness-0.002
 report-0.002
Token"
Token ID1
Feature activation+1.19e-02

Pos logit contributions
 cr+0.005
?+0.004
 sew+0.004
 forgiveness+0.004
 forgive+0.004
Neg logit contributions
bi-0.005
 edge-0.004
Larry-0.004
ling-0.004
mo-0.004
TokenWhat
Token ID2061
Feature activation+8.09e-03

Pos logit contributions
 cr+0.022
?+0.021
 sew+0.021
 forgiveness+0.021
 report+0.020
Neg logit contributions
bi-0.013
 edge-0.012
Larry-0.011
ling-0.011
Charles-0.011
Token's
Token ID338
Feature activation+5.09e-02

Pos logit contributions
 sew+0.011
ist+0.011
 report+0.010
ands+0.010
endant+0.010
Neg logit contributions
bi-0.010
Larry-0.010
 across-0.010
Charlie-0.009
 bottom-0.009
Token wrong
Token ID2642
Feature activation+1.64e-02

Pos logit contributions
 cr+0.094
 sew+0.090
 forgiveness+0.086
ird+0.086
 opener+0.085
Neg logit contributions
bi-0.051
 across-0.049
ling-0.049
 edge-0.046
ging-0.046
Token,
Token ID11
Feature activation+8.57e-03

Pos logit contributions
 cr+0.015
 sew+0.014
 forgiveness+0.013
 heaven+0.013
 report+0.012
Neg logit contributions
bi-0.033
 edge-0.032
ling-0.030
Larry-0.030
Charles-0.030
Token kids
Token ID3988
Feature activation-2.31e-03

Pos logit contributions
 sew+0.013
 cr+0.013
 forgiveness+0.012
?+0.012
 report+0.012
Neg logit contributions
bi-0.011
 edge-0.011
ling-0.010
Larry-0.010
 sudden-0.010
Token?"
Token ID1701
Feature activation+4.39e-03

Pos logit contributions
bi+0.005
ling+0.005
 edge+0.005
 distance+0.005
 across+0.005
Neg logit contributions
 cr-0.002
 forgiveness-0.001
 Sp-0.001
 sew-0.001
mm-0.001
Token mom
Token ID1995
Feature activation+8.94e-03

Pos logit contributions
 cr+0.004
 sew+0.004
 forgiveness+0.003
?+0.003
 report+0.003
Neg logit contributions
bi-0.009
 edge-0.009
Larry-0.008
 sudden-0.008
Frank-0.008
Token her
Token ID607
Feature activation+3.95e-03

Pos logit contributions
 cr+0.010
 sew+0.010
?+0.010
 forgiveness+0.009
 report+0.009
Neg logit contributions
bi-0.015
 edge-0.014
Larry-0.013
Frank-0.013
ling-0.013
Token mom
Token ID1995
Feature activation+8.33e-03

Pos logit contributions
 sew+0.006
?+0.005
 heaven+0.005
 cr+0.005
 report+0.005
Neg logit contributions
bi-0.006
 edge-0.005
Larry-0.005
 sudden-0.005
ling-0.005
Token,
Token ID11
Feature activation-4.92e-03

Pos logit contributions
 cr+0.009
?+0.009
 sew+0.008
 forgiveness+0.008
 forgive+0.008
Neg logit contributions
bi-0.015
 edge-0.015
Larry-0.015
 valleys-0.015
 sudden-0.015
Token "
Token ID366
Feature activation-1.34e-02

Pos logit contributions
bi+0.007
 edge+0.006
Larry+0.006
Frank+0.006
ling+0.006
Neg logit contributions
 cr-0.008
 sew-0.007
?-0.007
 forgiveness-0.007
 report-0.007
TokenWhat
Token ID2061
Feature activation+6.13e-03

Pos logit contributions
bi+0.016
ling+0.014
 edge+0.014
Larry+0.014
acking+0.013
Neg logit contributions
 cr-0.025
?-0.024
 sew-0.023
 whose-0.022
 grandma-0.022
Token's
Token ID338
Feature activation+5.06e-02

Pos logit contributions
 forgiveness+0.009
 sew+0.009
ands+0.009
 report+0.009
 heaven+0.009
Neg logit contributions
bi-0.008
 across-0.007
Larry-0.006
 bottom-0.006
 giant-0.006
Token that
Token ID326
Feature activation+1.65e-02

Pos logit contributions
 forgiveness+0.077
 forgive+0.072
 opener+0.071
 someday+0.070
ird+0.067
Neg logit contributions
 across-0.070
bi-0.062
 edge-0.061
 middle-0.057
 back-0.057
Token?"
Token ID1701
Feature activation-3.00e-03

Pos logit contributions
 forgiveness+0.010
 cr+0.009
 sew+0.009
 forgive+0.008
 whose+0.008
Neg logit contributions
bi-0.037
 edge-0.037
ling-0.035
 bottom-0.035
 distance-0.034
Token
Token ID198
Feature activation-5.59e-03

Pos logit contributions
bi+0.004
 edge+0.004
Frank+0.004
Larry+0.004
eled+0.004
Neg logit contributions
 cr-0.004
 forgiveness-0.004
?-0.004
 sew-0.004
 report-0.004
Token
Token ID198
Feature activation-3.60e-04

Pos logit contributions
bi+0.008
 edge+0.008
Larry+0.008
Frank+0.007
Charles+0.007
Neg logit contributions
 cr-0.008
 sew-0.008
 forgiveness-0.008
?-0.007
 report-0.007
TokenHer
Token ID9360
Feature activation-2.06e-03

Pos logit contributions
bi+0.001
 edge+0.000
Larry+0.000
ling+0.000
 valleys+0.000
Neg logit contributions
 cr-0.001
?-0.001
 sew-0.001
 grandma-0.000
 forgiveness-0.000
Token and
Token ID290
Feature activation-4.60e-03

Pos logit contributions
 cr+0.003
?+0.002
 sew+0.002
 forgive+0.002
 someday+0.002
Neg logit contributions
bi-0.008
Larry-0.007
 edge-0.007
Charles-0.007
eled-0.007
Token try
Token ID1949
Feature activation+5.06e-03

Pos logit contributions
bi+0.008
Frank+0.007
eled+0.007
umbling+0.007
 edge+0.007
Neg logit contributions
 cr-0.007
 sew-0.006
?-0.006
 don-0.006
 L-0.006
Token to
Token ID284
Feature activation+1.27e-03

Pos logit contributions
 cr+0.003
?+0.003
 if+0.003
 L+0.003
 where+0.003
Neg logit contributions
bi-0.012
eled-0.012
umbling-0.011
Frank-0.011
Charles-0.011
Token grab
Token ID5552
Feature activation-2.25e-03

Pos logit contributions
 cr+0.002
 sew+0.002
?+0.002
 forgive+0.001
 forgiveness+0.001
Neg logit contributions
bi-0.002
eled-0.002
 edge-0.002
Larry-0.002
ling-0.002
Token the
Token ID262
Feature activation-5.88e-03

Pos logit contributions
bi+0.005
 edge+0.004
acking+0.004
Larry+0.004
Frank+0.004
Neg logit contributions
 cr-0.002
?-0.002
 sew-0.002
 L-0.002
 forgiveness-0.002
Token av
Token ID1196
Feature activation-8.10e-02

Pos logit contributions
bi+0.011
Charles+0.011
acking+0.011
Mary+0.011
Pat+0.010
Neg logit contributions
 cr-0.008
?-0.007
 glue-0.007
 forgiveness-0.006
 thread-0.006
Tokenoc
Token ID420
Feature activation-1.02e-02

Pos logit contributions
bi+0.150
 edge+0.144
Larry+0.138
 crashing+0.136
 sudden+0.136
Neg logit contributions
 cr-0.089
?-0.080
 sew-0.080
/-0.072
 someday-0.071
Tokenados
Token ID22484
Feature activation-3.63e-02

Pos logit contributions
bi+0.016
 distance+0.015
Larry+0.015
 sudden+0.015
 valleys+0.014
Neg logit contributions
 cr-0.015
 forgiveness-0.013
/-0.013
?-0.013
 sew-0.012
Token.
Token ID13
Feature activation-4.89e-03

Pos logit contributions
bi+0.074
 edge+0.073
ling+0.069
Larry+0.068
 across+0.068
Neg logit contributions
 cr-0.031
 forgiveness-0.030
 sew-0.030
?-0.028
 report-0.026
Token But
Token ID887
Feature activation-5.87e-03

Pos logit contributions
bi+0.008
 edge+0.007
Larry+0.007
 valleys+0.007
Charles+0.007
Neg logit contributions
 cr-0.007
?-0.006
 forgiveness-0.006
 sew-0.006
 report-0.006
Token the
Token ID262
Feature activation-7.52e-03

Pos logit contributions
bi+0.013
 edge+0.012
ling+0.012
Larry+0.011
acking+0.011
Neg logit contributions
 cr-0.005
?-0.005
 sew-0.004
 forgiveness-0.004
 L-0.004
Token they
Token ID484
Feature activation-1.19e-02

Pos logit contributions
 cr+0.007
?+0.007
 forgiveness+0.006
 sew+0.006
 report+0.006
Neg logit contributions
bi-0.025
 edge-0.024
Larry-0.023
ling-0.023
acking-0.023
Token had
Token ID550
Feature activation+7.32e-03

Pos logit contributions
bi+0.018
 edge+0.017
Larry+0.016
ling+0.016
acking+0.016
Neg logit contributions
 cr-0.017
?-0.016
 sew-0.016
 forgiveness-0.016
 forgive-0.015
Token never
Token ID1239
Feature activation+1.40e-03

Pos logit contributions
 cr+0.010
 sew+0.010
?+0.010
 forgiveness+0.009
 report+0.009
Neg logit contributions
bi-0.011
 edge-0.010
Larry-0.010
Frank-0.010
Charles-0.010
Token seen
Token ID1775
Feature activation+2.60e-02

Pos logit contributions
 sew+0.002
 cr+0.002
 report+0.002
 forgiveness+0.002
?+0.002
Neg logit contributions
bi-0.002
 distance-0.002
 across-0.002
 edge-0.002
 scrambled-0.001
Token the
Token ID262
Feature activation-8.68e-03

Pos logit contributions
 cr+0.029
?+0.028
 sew+0.027
 forgiveness+0.027
 heaven+0.024
Neg logit contributions
bi-0.048
 edge-0.046
Larry-0.044
ling-0.044
acking-0.043
Token av
Token ID1196
Feature activation-6.97e-02

Pos logit contributions
bi+0.014
acking+0.013
arted+0.013
eled+0.013
Larry+0.013
Neg logit contributions
 cr-0.013
?-0.012
 forgiveness-0.012
 report-0.012
 grandma-0.011
Tokenoc
Token ID420
Feature activation-5.85e-03

Pos logit contributions
bi+0.144
 edge+0.141
Larry+0.136
arted+0.135
 crashing+0.135
Neg logit contributions
 cr-0.061
?-0.054
 sew-0.054
/-0.051
 someday-0.048
Tokenados
Token ID22484
Feature activation-2.87e-02

Pos logit contributions
bi+0.009
Larry+0.008
 sudden+0.008
 distance+0.008
 edge+0.008
Neg logit contributions
 cr-0.008
 forgiveness-0.008
?-0.008
/-0.008
 report-0.008
Token.
Token ID13
Feature activation-4.69e-03

Pos logit contributions
bi+0.059
 edge+0.056
Larry+0.054
ling+0.054
 distance+0.052
Neg logit contributions
 cr-0.026
 sew-0.024
?-0.024
 forgiveness-0.023
 report-0.022
Token They
Token ID1119
Feature activation+5.52e-03

Pos logit contributions
bi+0.007
 edge+0.007
Larry+0.006
 valleys+0.006
 streets+0.006
Neg logit contributions
 cr-0.007
 sew-0.007
?-0.007
 forgiveness-0.007
 report-0.006
Token wished
Token ID16555
Feature activation+1.52e-02

Pos logit contributions
 cr+0.007
?+0.006
 sew+0.006
 forgiveness+0.006
 report+0.006
Neg logit contributions
bi-0.010
 edge-0.009
Larry-0.009
ling-0.009
 distance-0.008
Token so
Token ID523
Feature activation-2.71e-03

Pos logit contributions
 terrain+0.013
ling+0.013
bi+0.013
 edge+0.013
 slope+0.012
Neg logit contributions
 where-0.005
 L-0.005
?-0.005
 if-0.005
 grandma-0.005
Token happy
Token ID3772
Feature activation-5.58e-03

Pos logit contributions
 edge+0.005
bi+0.005
 terrain+0.005
 boulder+0.005
mo+0.005
Neg logit contributions
 cr-0.003
?-0.003
 forgive-0.003
 don-0.003
 sew-0.003
Token and
Token ID290
Feature activation-3.48e-03

Pos logit contributions
bi+0.012
 edge+0.011
Larry+0.011
ling+0.011
Charles+0.011
Neg logit contributions
 cr-0.005
?-0.004
 sew-0.004
 forgiveness-0.004
 report-0.004
Token brought
Token ID3181
Feature activation-1.08e-02

Pos logit contributions
bi+0.006
 edge+0.006
mo+0.006
Larry+0.006
ling+0.006
Neg logit contributions
 cr-0.004
 sew-0.004
?-0.003
 L-0.003
 forgiveness-0.003
Token the
Token ID262
Feature activation-4.65e-03

Pos logit contributions
umbling+0.024
 edge+0.022
Charles+0.022
bi+0.022
 slope+0.022
Neg logit contributions
 cr-0.010
?-0.008
 sew-0.008
 grandma-0.007
 forgiveness-0.007
Token av
Token ID1196
Feature activation-6.84e-02

Pos logit contributions
umbling+0.009
eled+0.009
acking+0.009
Charles+0.009
 crashing+0.009
Neg logit contributions
 cr-0.005
?-0.004
 forgive-0.004
 glue-0.004
?"-0.004
Tokenoc
Token ID420
Feature activation-2.32e-03

Pos logit contributions
 edge+0.145
 crashing+0.143
 bottom+0.139
bi+0.138
arted+0.135
Neg logit contributions
 cr-0.058
 sew-0.048
?-0.043
 someday-0.041
/-0.039
Tokenados
Token ID22484
Feature activation-3.18e-02

Pos logit contributions
 distance+0.004
Larry+0.004
 sudden+0.004
 crashing+0.004
Back+0.004
Neg logit contributions
 cr-0.003
 forgiveness-0.003
/-0.003
mm-0.002
?-0.002
Token home
Token ID1363
Feature activation+6.31e-03

Pos logit contributions
bi+0.053
 edge+0.050
Larry+0.049
Charles+0.048
 sudden+0.048
Neg logit contributions
 cr-0.042
?-0.039
 sew-0.037
 forgiveness-0.037
 forgive-0.035
Token to
Token ID284
Feature activation-1.00e-03

Pos logit contributions
 cr+0.004
?+0.003
 sew+0.003
 forgiveness+0.003
 forgive+0.003
Neg logit contributions
bi-0.015
 edge-0.014
Larry-0.014
Charles-0.014
 sudden-0.014
Token share
Token ID2648
Feature activation-3.44e-04

Pos logit contributions
bi+0.002
 edge+0.002
Larry+0.002
 distance+0.002
 bottom+0.002
Neg logit contributions
 cr-0.001
 sew-0.001
?-0.001
 forgiveness-0.001
/-0.001
Token going
Token ID1016
Feature activation-3.15e-02

Pos logit contributions
bi+0.009
Larry+0.009
 edge+0.009
ling+0.009
Charles+0.008
Neg logit contributions
 cr-0.006
?-0.005
 L-0.005
 sew-0.005
 grandma-0.005
Token to
Token ID284
Feature activation+4.05e-03

Pos logit contributions
bi+0.072
Larry+0.070
eled+0.067
 edge+0.067
mo+0.066
Neg logit contributions
 cr-0.026
?-0.025
?"-0.021
 report-0.021
 sew-0.020
Token get
Token ID651
Feature activation-5.80e-03

Pos logit contributions
 cr+0.004
?+0.004
 sew+0.004
 forgiveness+0.004
 report+0.004
Neg logit contributions
bi-0.008
 edge-0.007
ling-0.007
Larry-0.007
Charles-0.007
Token a
Token ID257
Feature activation-1.79e-02

Pos logit contributions
bi+0.013
 edge+0.013
Larry+0.012
acking+0.012
ling+0.012
Neg logit contributions
 cr-0.005
?-0.004
 sew-0.004
 grandma-0.004
 forgiveness-0.004
Token l
Token ID300
Feature activation-1.63e-02

Pos logit contributions
bi+0.031
 edge+0.029
Larry+0.027
ling+0.027
 distance+0.027
Neg logit contributions
 cr-0.021
 sew-0.021
?-0.021
 forgiveness-0.020
 heaven-0.019
Tokenoll
Token ID692
Feature activation-6.74e-02

Pos logit contributions
bi+0.028
 edge+0.027
ling+0.025
acking+0.025
Larry+0.025
Neg logit contributions
 cr-0.020
 forgiveness-0.019
 sew-0.019
?-0.018
 report-0.018
Tokenipop
Token ID42800
Feature activation-1.94e-02

Pos logit contributions
bi+0.107
ling+0.098
 edge+0.091
mo+0.091
ging+0.089
Neg logit contributions
 sew-0.091
 forgiveness-0.089
 heaven-0.088
 cr-0.088
 forgive-0.084
Token for
Token ID329
Feature activation-9.36e-03

Pos logit contributions
bi+0.038
 edge+0.038
ling+0.036
 bottom+0.035
 across+0.035
Neg logit contributions
 cr-0.017
 forgiveness-0.017
 sew-0.016
?-0.015
 forgive-0.015
Token being
Token ID852
Feature activation-1.83e-02

Pos logit contributions
bi+0.017
 edge+0.017
Larry+0.016
ling+0.016
Charles+0.015
Neg logit contributions
 cr-0.010
?-0.010
 sew-0.009
 forgiveness-0.009
 report-0.009
Token so
Token ID523
Feature activation+2.15e-03

Pos logit contributions
 edge+0.035
bi+0.035
 summit+0.032
 valleys+0.032
 bottom+0.032
Neg logit contributions
 cr-0.022
?-0.019
 sew-0.018
 L-0.018
 forgiveness-0.017
Token organized
Token ID8389
Feature activation-1.77e-03

Pos logit contributions
 cr+0.003
 sew+0.002
 L+0.002
?+0.002
 forgiveness+0.002
Neg logit contributions
bi-0.004
 edge-0.004
 distance-0.004
Larry-0.004
ling-0.004
Token As
Token ID1081
Feature activation+2.80e-03

Pos logit contributions
bi+0.001
ling+0.001
 edge+0.001
eled+0.001
mo+0.001
Neg logit contributions
 cr-0.001
?-0.001
 L-0.001
 sew-0.001
 heaven-0.001
Token the
Token ID262
Feature activation-1.15e-02

Pos logit contributions
 cr+0.003
 L+0.002
?+0.002
 we+0.002
 later+0.002
Neg logit contributions
bi-0.006
umbling-0.006
acking-0.006
 edge-0.006
eled-0.006
Token farmer
Token ID18739
Feature activation+4.21e-03

Pos logit contributions
Pat+0.020
bi+0.019
Phil+0.019
acking+0.019
 valleys+0.019
Neg logit contributions
 cr-0.017
 forgiveness-0.014
 ink-0.014
 glue-0.013
 wine-0.013
Token scattered
Token ID16830
Feature activation-9.29e-03

Pos logit contributions
 cr+0.005
?+0.005
 sew+0.004
 later+0.004
 scrib+0.004
Neg logit contributions
bi-0.008
Phil-0.007
Charles-0.007
pering-0.007
 edge-0.007
Token the
Token ID262
Feature activation-2.10e-03

Pos logit contributions
bi+0.019
umbling+0.018
Charles+0.017
rolling+0.017
 edge+0.017
Neg logit contributions
 cr-0.011
?-0.009
 ink-0.008
 instead-0.008
 forgiveness-0.008
Token av
Token ID1196
Feature activation-6.54e-02

Pos logit contributions
Pat+0.004
Phil+0.004
Charles+0.004
eled+0.004
Josh+0.004
Neg logit contributions
 cr-0.003
?-0.002
 glue-0.002
 pens-0.002
 art-0.002
Tokenoc
Token ID420
Feature activation+4.84e-03

Pos logit contributions
 edge+0.140
bi+0.138
 crashing+0.137
 bottom+0.133
Charles+0.131
Neg logit contributions
 cr-0.053
 sew-0.042
?-0.037
/-0.035
 scrib-0.033
Tokenados
Token ID22484
Feature activation-2.60e-02

Pos logit contributions
 cr+0.006
 forgiveness+0.006
/+0.006
?+0.006
 sew+0.005
Neg logit contributions
 distance-0.008
bi-0.008
Larry-0.008
Back-0.007
 sudden-0.007
Token,
Token ID11
Feature activation+4.84e-03

Pos logit contributions
bi+0.051
Charles+0.046
 edge+0.045
Larry+0.045
eled+0.044
Neg logit contributions
 cr-0.030
?-0.028
 sew-0.026
 don-0.024
 later-0.024
Token he
Token ID339
Feature activation+1.56e-03

Pos logit contributions
 cr+0.005
 L+0.004
 if+0.004
?+0.004
 don+0.004
Neg logit contributions
bi-0.011
acking-0.011
umbling-0.010
eled-0.010
arted-0.010
Token noticed
Token ID6810
Feature activation-5.05e-03

Pos logit contributions
 cr+0.002
?+0.002
 scrib+0.002
 drew+0.002
 sew+0.002
Neg logit contributions
bi-0.003
Frank-0.003
Charles-0.003
 distance-0.003
umbling-0.003
Token won
Token ID1839
Feature activation-6.25e-03

Pos logit contributions
 cr+0.000
?+0.000
 sew+0.000
 forgiveness+0.000
 report+0.000
Neg logit contributions
bi-0.001
 edge-0.001
Larry-0.001
ling-0.001
 valleys-0.001
Token't
Token ID470
Feature activation-1.23e-02

Pos logit contributions
bi+0.012
eled+0.010
 distance+0.010
 valleys+0.010
Larry+0.010
Neg logit contributions
?-0.008
 sew-0.008
?"-0.008
 cr-0.008
 forgiveness-0.007
Token get
Token ID651
Feature activation-9.16e-03

Pos logit contributions
bi+0.026
 valleys+0.025
eled+0.025
Charles+0.024
Larry+0.024
Neg logit contributions
?-0.012
 forgive-0.011
 sew-0.011
?"-0.011
 don-0.010
Token a
Token ID257
Feature activation-1.59e-02

Pos logit contributions
bi+0.018
 valleys+0.016
Larry+0.016
 edge+0.016
Charles+0.016
Neg logit contributions
?-0.011
 cr-0.011
?"-0.010
 forgiveness-0.010
 sew-0.009
Token l
Token ID300
Feature activation-2.15e-02

Pos logit contributions
bi+0.028
 edge+0.026
Larry+0.025
ling+0.025
Charles+0.025
Neg logit contributions
 cr-0.019
?-0.018
 sew-0.018
 forgiveness-0.017
 report-0.016
Tokenoll
Token ID692
Feature activation-6.54e-02

Pos logit contributions
bi+0.037
 edge+0.035
ling+0.033
Larry+0.033
mo+0.032
Neg logit contributions
 cr-0.026
 sew-0.025
 forgiveness-0.024
 report-0.024
?-0.023
Tokenipop
Token ID42800
Feature activation-1.93e-02

Pos logit contributions
bi+0.118
 edge+0.108
ling+0.106
Larry+0.104
Charles+0.101
Neg logit contributions
 cr-0.075
 sew-0.073
 forgiveness-0.070
?-0.070
 report-0.067
Token."
Token ID526
Feature activation+3.19e-03

Pos logit contributions
bi+0.040
 edge+0.038
ling+0.036
Larry+0.036
 distance+0.035
Neg logit contributions
 cr-0.017
 sew-0.015
 forgiveness-0.015
?-0.015
 report-0.014
Token
Token ID198
Feature activation-9.76e-04

Pos logit contributions
?+0.004
 cr+0.004
 sew+0.004
 L+0.004
 forgiveness+0.004
Neg logit contributions
bi-0.005
 edge-0.005
ling-0.005
 valleys-0.005
 depths-0.005
Token
Token ID198
Feature activation+4.09e-03

Pos logit contributions
bi+0.002
 edge+0.002
 valleys+0.002
 summit+0.002
ling+0.002
Neg logit contributions
?-0.001
 forgive-0.001
 cr-0.001
 forgiveness-0.001
 sew-0.001
TokenLu
Token ID25596
Feature activation-4.69e-03

Pos logit contributions
?+0.005
 forgive+0.005
 sew+0.005
 forgiveness+0.005
 later+0.005
Neg logit contributions
bi-0.007
 edge-0.007
 valleys-0.006
 bottom-0.006
 peak-0.006
Token a
Token ID257
Feature activation+1.05e-02

Pos logit contributions
bi+0.022
 edge+0.020
ling+0.019
Larry+0.019
acking+0.019
Neg logit contributions
 cr-0.009
 sew-0.007
?-0.007
 L-0.007
 forgiveness-0.007
Token dinosaur
Token ID28257
Feature activation-1.24e-02

Pos logit contributions
 cr+0.009
 sew+0.007
?+0.007
 forgiveness+0.006
 scrib+0.006
Neg logit contributions
bi-0.024
arted-0.022
Larry-0.022
 edge-0.022
ling-0.022
Token and
Token ID290
Feature activation-2.06e-03

Pos logit contributions
bi+0.026
 edge+0.024
Larry+0.024
ling+0.023
Charles+0.023
Neg logit contributions
 cr-0.011
?-0.010
 sew-0.010
 forgiveness-0.010
 report-0.009
Token a
Token ID257
Feature activation+4.50e-03

Pos logit contributions
bi+0.004
 edge+0.004
Larry+0.003
 valleys+0.003
 sudden+0.003
Neg logit contributions
?-0.002
 forgiveness-0.002
 cr-0.002
 sew-0.002
 report-0.002
Token l
Token ID300
Feature activation-2.02e-02

Pos logit contributions
 cr+0.004
 sew+0.004
?+0.003
 forgiveness+0.003
 scrib+0.003
Neg logit contributions
bi-0.010
arted-0.009
Larry-0.009
ling-0.009
umbling-0.009
Tokenoll
Token ID692
Feature activation-6.54e-02

Pos logit contributions
bi+0.030
 edge+0.028
He+0.027
 onward+0.027
mo+0.027
Neg logit contributions
 report-0.027
 cr-0.026
 forgiveness-0.026
 sew-0.026
 tonight-0.026
Tokenipop
Token ID42800
Feature activation-1.95e-02

Pos logit contributions
bi+0.099
ling+0.088
mo+0.086
ging+0.082
eled+0.081
Neg logit contributions
 sew-0.094
 forgiveness-0.093
 heaven-0.091
 report-0.090
 cr-0.087
Token with
Token ID351
Feature activation-1.23e-02

Pos logit contributions
bi+0.037
 edge+0.037
ling+0.035
 across+0.035
 bottom+0.034
Neg logit contributions
 forgiveness-0.017
 cr-0.017
 sew-0.017
?-0.017
 report-0.016
Token a
Token ID257
Feature activation+9.60e-03

Pos logit contributions
bi+0.027
umbling+0.025
ling+0.025
 edge+0.025
acking+0.024
Neg logit contributions
 cr-0.013
 L-0.010
 scrib-0.010
 sew-0.009
?-0.009
Token cherry
Token ID23612
Feature activation-2.10e-02

Pos logit contributions
 cr+0.012
 sew+0.009
?+0.009
 scrib+0.009
 card+0.008
Neg logit contributions
bi-0.021
arted-0.020
umbling-0.019
eled-0.019
Larry-0.018
Token.
Token ID13
Feature activation-1.85e-03

Pos logit contributions
bi+0.045
Larry+0.043
Charles+0.042
Frank+0.042
 edge+0.042
Neg logit contributions
 cr-0.019
?-0.017
 sew-0.016
 forgiveness-0.014
 L-0.013
Token wanted
Token ID2227
Feature activation-1.32e-03

Pos logit contributions
bi+0.005
 edge+0.005
Larry+0.005
ling+0.004
 distance+0.004
Neg logit contributions
 cr-0.004
?-0.004
 sew-0.004
 forgiveness-0.004
 report-0.004
Token to
Token ID284
Feature activation+1.21e-03

Pos logit contributions
bi+0.003
 edge+0.003
ling+0.002
Larry+0.002
acking+0.002
Neg logit contributions
 cr-0.001
?-0.001
 forgiveness-0.001
 sew-0.001
 report-0.001
Token win
Token ID1592
Feature activation+2.60e-03

Pos logit contributions
 cr+0.001
/+0.001
 forgiveness+0.001
?+0.001
 whose+0.001
Neg logit contributions
 edge-0.002
bi-0.002
Larry-0.002
 bottom-0.002
 distance-0.002
Token a
Token ID257
Feature activation-3.67e-03

Pos logit contributions
 cr+0.003
 sew+0.002
?+0.002
 forgiveness+0.002
 report+0.002
Neg logit contributions
bi-0.005
 edge-0.005
Larry-0.005
ling-0.005
 valleys-0.005
Token l
Token ID300
Feature activation-1.75e-02

Pos logit contributions
bi+0.006
 edge+0.006
Larry+0.006
ling+0.006
 distance+0.006
Neg logit contributions
 cr-0.004
 sew-0.004
?-0.004
 forgiveness-0.004
 heaven-0.004
Tokenoll
Token ID692
Feature activation-6.53e-02

Pos logit contributions
bi+0.024
 edge+0.023
 onward+0.023
pering+0.022
acking+0.022
Neg logit contributions
 cr-0.025
 forgiveness-0.025
 sew-0.024
 Acc-0.022
 report-0.022
Tokenipop
Token ID42800
Feature activation-1.64e-02

Pos logit contributions
bi+0.098
ling+0.090
mo+0.084
ging+0.082
 edge+0.080
Neg logit contributions
 sew-0.093
 forgiveness-0.090
 heaven-0.090
 cr-0.087
 forgive-0.087
Token too
Token ID1165
Feature activation-6.07e-03

Pos logit contributions
 edge+0.029
bi+0.028
 across+0.028
ling+0.027
 bottom+0.027
Neg logit contributions
 forgiveness-0.016
 sew-0.016
 cr-0.016
 Acc-0.015
 forgive-0.014
Token.
Token ID13
Feature activation-3.55e-03

Pos logit contributions
bi+0.011
 edge+0.010
 across+0.010
ling+0.010
Frank+0.010
Neg logit contributions
 forgiveness-0.007
 sew-0.006
 cr-0.006
 forgive-0.005
 report-0.005
Token But
Token ID887
Feature activation-6.00e-03

Pos logit contributions
bi+0.004
 edge+0.004
Larry+0.004
ling+0.004
Charles+0.004
Neg logit contributions
 cr-0.006
?-0.006
 sew-0.006
 forgiveness-0.006
 report-0.006
Token he
Token ID339
Feature activation-1.01e-02

Pos logit contributions
bi+0.013
 edge+0.012
Larry+0.012
ling+0.012
 valleys+0.011
Neg logit contributions
 cr-0.005
?-0.005
 sew-0.005
 forgiveness-0.005
 report-0.004
Tokenongo
Token ID25162
Feature activation-6.47e-03

Pos logit contributions
 cr+0.022
?+0.021
 sew+0.021
 forgiveness+0.021
 report+0.020
Neg logit contributions
bi-0.014
 edge-0.012
ling-0.011
Larry-0.011
acking-0.011
Token didn
Token ID1422
Feature activation-1.39e-04

Pos logit contributions
bi+0.012
 edge+0.012
Larry+0.011
 valleys+0.011
Charles+0.011
Neg logit contributions
 cr-0.007
?-0.007
 sew-0.006
 forgiveness-0.006
 scrib-0.006
Token't
Token ID470
Feature activation-9.16e-03

Pos logit contributions
bi+0.000
 edge+0.000
Larry+0.000
ling+0.000
 bottom+0.000
Neg logit contributions
 cr-0.000
 sew-0.000
 forgiveness-0.000
?-0.000
 heaven-0.000
Token like
Token ID588
Feature activation-3.06e-03

Pos logit contributions
bi+0.015
ling+0.014
 distance+0.013
 edge+0.013
 across+0.013
Neg logit contributions
 cr-0.012
 forgiveness-0.011
?-0.011
 sew-0.011
/-0.010
Token wet
Token ID9583
Feature activation+6.34e-03

Pos logit contributions
bi+0.006
 edge+0.006
Larry+0.006
ling+0.006
Charles+0.006
Neg logit contributions
 cr-0.003
?-0.003
 forgiveness-0.003
 sew-0.003
 report-0.003
Token av
Token ID1196
Feature activation-6.45e-02

Pos logit contributions
 cr+0.007
?+0.007
?"+0.006
 Lily+0.006
 ink+0.006
Neg logit contributions
Charles-0.013
Larry-0.013
bi-0.013
rolling-0.012
Flash-0.012
Tokenoc
Token ID420
Feature activation+8.98e-03

Pos logit contributions
bi+0.116
 edge+0.111
Larry+0.106
Charles+0.103
ling+0.102
Neg logit contributions
 cr-0.076
 sew-0.071
?-0.070
 forgiveness-0.065
/-0.064
Tokenados
Token ID22484
Feature activation-2.15e-02

Pos logit contributions
 cr+0.014
 forgiveness+0.013
?+0.013
 sew+0.013
/+0.013
Neg logit contributions
bi-0.013
 edge-0.012
 distance-0.011
Larry-0.011
 valleys-0.011
Token,
Token ID11
Feature activation-1.01e-02

Pos logit contributions
bi+0.040
 edge+0.039
ling+0.039
 across+0.037
 distance+0.037
Neg logit contributions
 cr-0.020
 forgiveness-0.020
 sew-0.019
?-0.019
 report-0.018
Token so
Token ID523
Feature activation-1.26e-02

Pos logit contributions
bi+0.021
Larry+0.019
Charles+0.019
acking+0.019
 edge+0.019
Neg logit contributions
 cr-0.010
 L-0.009
?-0.009
 don-0.009
 sew-0.008
Token he
Token ID339
Feature activation-1.33e-02

Pos logit contributions
bi+0.020
 edge+0.019
Larry+0.018
ling+0.018
Charles+0.018
Neg logit contributions
 cr-0.017
?-0.017
 sew-0.016
 forgiveness-0.016
 L-0.015
Token down
Token ID866
Feature activation-2.53e-03

Pos logit contributions
bi+0.008
 edge+0.007
Larry+0.007
Frank+0.007
Charles+0.007
Neg logit contributions
?-0.012
 cr-0.012
 sew-0.011
?"-0.011
 forgiveness-0.011
Token and
Token ID290
Feature activation-3.68e-03

Pos logit contributions
 edge+0.004
bi+0.004
 across+0.003
 distance+0.003
ling+0.003
Neg logit contributions
 forgiveness-0.004
 cr-0.004
 sew-0.004
/-0.003
 report-0.003
Token spotted
Token ID13489
Feature activation+5.14e-03

Pos logit contributions
bi+0.006
 edge+0.006
Larry+0.005
ling+0.005
Charles+0.005
Neg logit contributions
 cr-0.005
?-0.005
 sew-0.005
 forgiveness-0.005
 report-0.004
Token a
Token ID257
Feature activation-8.09e-05

Pos logit contributions
 cr+0.005
?+0.004
 sew+0.004
 forgiveness+0.004
 report+0.004
Neg logit contributions
bi-0.011
 edge-0.010
Larry-0.010
ling-0.010
 valleys-0.010
Token l
Token ID300
Feature activation-1.55e-02

Pos logit contributions
bi+0.000
arted+0.000
pered+0.000
eled+0.000
mo+0.000
Neg logit contributions
 cr-0.000
?-0.000
 sew-0.000
 grandma-0.000
?"-0.000
Tokenoll
Token ID692
Feature activation-6.42e-02

Pos logit contributions
bi+0.031
 edge+0.029
Larry+0.028
ling+0.028
Charles+0.027
Neg logit contributions
 cr-0.015
?-0.014
 sew-0.014
 forgiveness-0.014
 report-0.013
Tokenipop
Token ID42800
Feature activation-1.77e-02

Pos logit contributions
bi+0.106
ling+0.096
 edge+0.093
mo+0.091
ging+0.089
Neg logit contributions
 sew-0.084
 forgiveness-0.081
 cr-0.080
 heaven-0.079
 report-0.078
Token on
Token ID319
Feature activation-1.64e-02

Pos logit contributions
bi+0.034
Larry+0.032
 edge+0.032
Charles+0.031
Frank+0.031
Neg logit contributions
 cr-0.019
?-0.018
 sew-0.017
 forgiveness-0.016
 report-0.016
Token the
Token ID262
Feature activation+2.17e-03

Pos logit contributions
bi+0.031
 edge+0.029
Larry+0.028
ling+0.028
Charles+0.027
Neg logit contributions
 cr-0.018
?-0.017
 sew-0.016
 forgiveness-0.016
 report-0.015
Token ground
Token ID2323
Feature activation-3.54e-03

Pos logit contributions
 cr+0.004
 sew+0.004
?+0.004
 report+0.003
 grandma+0.003
Neg logit contributions
bi-0.003
eled-0.003
Larry-0.003
arted-0.003
Frank-0.003
Token.
Token ID13
Feature activation+2.75e-03

Pos logit contributions
bi+0.009
Larry+0.009
Charles+0.009
Frank+0.009
Phil+0.009
Neg logit contributions
 cr-0.002
?-0.001
 sew-0.001
 L-0.001
 heaven-0.001
Token hug
Token ID16225
Feature activation-1.20e-02

Pos logit contributions
bi+0.005
 edge+0.005
 distance+0.004
ling+0.004
 sudden+0.004
Neg logit contributions
 someday-0.003
 tonight-0.003
 cr-0.003
?-0.003
 sew-0.003
Token and
Token ID290
Feature activation+1.59e-03

Pos logit contributions
bi+0.018
 edge+0.017
ling+0.017
 across+0.017
 distance+0.017
Neg logit contributions
 cr-0.015
 sew-0.015
 report-0.014
 forgiveness-0.014
?-0.014
Token a
Token ID257
Feature activation+1.13e-03

Pos logit contributions
?+0.002
 report+0.002
arium+0.002
 cr+0.002
 tonight+0.002
Neg logit contributions
bi-0.003
 sudden-0.002
 scrambled-0.002
ling-0.002
Larry-0.002
Token big
Token ID1263
Feature activation-1.27e-02

Pos logit contributions
 cr+0.001
?+0.001
 sew+0.001
 forgiveness+0.001
 report+0.001
Neg logit contributions
bi-0.002
 edge-0.002
Larry-0.002
 distance-0.002
ling-0.002
Token l
Token ID300
Feature activation-1.61e-02

Pos logit contributions
bi+0.023
 edge+0.022
ling+0.021
Larry+0.021
 distance+0.021
Neg logit contributions
 cr-0.014
?-0.013
 sew-0.013
 forgiveness-0.012
 report-0.012
Tokenoll
Token ID692
Feature activation-6.34e-02

Pos logit contributions
bi+0.023
ling+0.021
mo+0.021
 edge+0.021
umbling+0.021
Neg logit contributions
 sew-0.023
 report-0.023
 cr-0.023
 tonight-0.022
 we-0.022
Tokenipop
Token ID42800
Feature activation-1.85e-02

Pos logit contributions
bi+0.093
ling+0.083
mo+0.080
eled+0.074
ging+0.073
Neg logit contributions
 sew-0.100
 heaven-0.097
 report-0.095
 forgiveness-0.092
 art-0.091
Token.
Token ID13
Feature activation-1.87e-03

Pos logit contributions
 edge+0.031
 across+0.030
bi+0.030
ling+0.030
 bottom+0.028
Neg logit contributions
?-0.019
 forgiveness-0.019
 report-0.019
 sew-0.018
 heaven-0.018
Token Tim
Token ID5045
Feature activation+6.31e-03

Pos logit contributions
bi+0.003
 edge+0.003
Larry+0.003
Frank+0.003
 sudden+0.003
Neg logit contributions
 cr-0.002
 sew-0.002
?-0.002
 forgiveness-0.002
 report-0.002
Token smiled
Token ID13541
Feature activation-1.08e-02

Pos logit contributions
 cr+0.008
 sew+0.007
?+0.007
 forgiveness+0.007
 forgive+0.007
Neg logit contributions
bi-0.011
 edge-0.010
Larry-0.010
 valleys-0.010
Frank-0.010
Token and
Token ID290
Feature activation+6.12e-03

Pos logit contributions
bi+0.018
 edge+0.017
ling+0.017
 across+0.016
Larry+0.016
Neg logit contributions
 cr-0.014
 sew-0.013
 forgiveness-0.013
 report-0.013
?-0.013
Token and
Token ID290
Feature activation-1.24e-03

Pos logit contributions
bi+0.036
 edge+0.033
Larry+0.033
Charles+0.033
mo+0.032
Neg logit contributions
 cr-0.013
?-0.013
 sew-0.010
 L-0.010
 forgiveness-0.010
Token saw
Token ID2497
Feature activation-8.44e-03

Pos logit contributions
bi+0.002
 edge+0.002
umbling+0.002
 bottom+0.002
acking+0.002
Neg logit contributions
 cr-0.001
?-0.001
 sew-0.001
 L-0.001
 whose-0.001
Token a
Token ID257
Feature activation-4.45e-03

Pos logit contributions
bi+0.017
 edge+0.016
Larry+0.016
ling+0.015
acking+0.015
Neg logit contributions
 cr-0.008
?-0.008
 sew-0.007
 forgiveness-0.007
 report-0.007
Token three
Token ID1115
Feature activation-3.21e-02

Pos logit contributions
arted+0.008
bi+0.008
eled+0.008
mo+0.008
pered+0.008
Neg logit contributions
 cr-0.006
 forgive-0.005
?-0.005
 sew-0.005
 grandma-0.005
Token-
Token ID12
Feature activation-2.25e-02

Pos logit contributions
Back+0.061
arted+0.060
 summit+0.059
appointed+0.059
bi+0.059
Neg logit contributions
 cr-0.040
?-0.034
?"-0.030
,"-0.028
/-0.028
Tokenyear
Token ID1941
Feature activation-6.31e-02

Pos logit contributions
bi+0.040
 edge+0.039
 bottom+0.038
 streets+0.037
 valleys+0.036
Neg logit contributions
 cr-0.027
 forgive-0.024
?-0.024
 sew-0.024
 forgiveness-0.023
Token-
Token ID12
Feature activation-1.70e-02

Pos logit contributions
bi+0.130
 edge+0.128
Larry+0.127
Back+0.125
 sudden+0.125
Neg logit contributions
 cr-0.052
?-0.043
 report-0.039
 forgiveness-0.038
/-0.037
Tokenold
Token ID727
Feature activation-7.16e-04

Pos logit contributions
 edge+0.035
bi+0.035
 bottom+0.034
 valleys+0.033
 summit+0.033
Neg logit contributions
 cr-0.015
 forgiveness-0.013
 sew-0.013
?-0.013
 forgive-0.013
Token child
Token ID1200
Feature activation-1.60e-02

Pos logit contributions
 summit+0.001
Back+0.001
bi+0.001
 edge+0.001
appointed+0.001
Neg logit contributions
 cr-0.001
?-0.001
 L-0.001
 forgiveness-0.001
 whose-0.001
Token.
Token ID13
Feature activation+9.63e-04

Pos logit contributions
bi+0.035
 summit+0.034
Larry+0.034
Charles+0.033
Phil+0.033
Neg logit contributions
 cr-0.014
?-0.014
 whose-0.012
 sew-0.012
 forgiveness-0.011
Token The
Token ID383
Feature activation-1.20e-02

Pos logit contributions
 cr+0.001
?+0.001
 sew+0.001
 L+0.001
 forgiveness+0.001
Neg logit contributions
bi-0.002
ling-0.002
 edge-0.002
Larry-0.001
eled-0.001
Token and
Token ID290
Feature activation-7.03e-03

Pos logit contributions
 cr+0.001
?+0.001
 sew+0.001
 forgiveness+0.001
 someday+0.001
Neg logit contributions
bi-0.003
 edge-0.003
Larry-0.003
eled-0.003
Charles-0.003
Token she
Token ID673
Feature activation+8.65e-03

Pos logit contributions
bi+0.014
 edge+0.013
Larry+0.013
ling+0.013
Charles+0.013
Neg logit contributions
 cr-0.007
 sew-0.007
?-0.007
 forgiveness-0.007
 L-0.006
Token had
Token ID550
Feature activation+3.75e-03

Pos logit contributions
 cr+0.010
?+0.010
 sew+0.009
 forgiveness+0.009
/+0.009
Neg logit contributions
bi-0.016
 edge-0.015
Larry-0.014
Frank-0.014
Charles-0.014
Token a
Token ID257
Feature activation+1.84e-03

Pos logit contributions
 cr+0.004
?+0.003
 sew+0.003
 L+0.003
 forgiveness+0.003
Neg logit contributions
bi-0.008
 edge-0.008
ling-0.007
mo-0.007
acking-0.007
Token l
Token ID300
Feature activation-1.69e-02

Pos logit contributions
 cr+0.003
?+0.002
 sew+0.002
 forgiveness+0.002
 forgive+0.002
Neg logit contributions
bi-0.003
 edge-0.003
ling-0.003
umbling-0.003
arted-0.003
Tokenoll
Token ID692
Feature activation-6.29e-02

Pos logit contributions
bi+0.034
 edge+0.032
Larry+0.031
ling+0.031
Charles+0.030
Neg logit contributions
 cr-0.016
?-0.015
 sew-0.015
 forgiveness-0.014
 report-0.013
Tokenipop
Token ID42800
Feature activation-1.68e-02

Pos logit contributions
bi+0.112
ling+0.100
 edge+0.100
Larry+0.097
mo+0.096
Neg logit contributions
 cr-0.074
 sew-0.073
?-0.071
 forgiveness-0.070
 report-0.069
Token to
Token ID284
Feature activation-1.18e-02

Pos logit contributions
bi+0.035
 edge+0.033
Larry+0.032
ling+0.032
Charles+0.031
Neg logit contributions
 cr-0.015
?-0.014
 sew-0.014
 forgiveness-0.013
 report-0.012
Token show
Token ID905
Feature activation+2.40e-03

Pos logit contributions
bi+0.023
 edge+0.022
Larry+0.021
ling+0.021
 distance+0.021
Neg logit contributions
 cr-0.011
?-0.010
 sew-0.010
 forgiveness-0.010
 report-0.009
Token for
Token ID329
Feature activation-7.55e-03

Pos logit contributions
 cr+0.003
?+0.003
 sew+0.002
 forgiveness+0.002
 L+0.002
Neg logit contributions
bi-0.005
 edge-0.004
Larry-0.004
ling-0.004
 valleys-0.004
Token it
Token ID340
Feature activation+6.91e-03

Pos logit contributions
bi+0.014
 edge+0.013
Larry+0.012
ling+0.012
 valleys+0.012
Neg logit contributions
 cr-0.009
?-0.008
 sew-0.008
 forgiveness-0.008
 report-0.008
Token.
Token ID13
Feature activation-3.85e-03

Pos logit contributions
bi+0.013
 edge+0.013
ling+0.012
 distance+0.012
 across+0.012
Neg logit contributions
 cr-0.006
 sew-0.006
 forgiveness-0.006
?-0.005
 report-0.005
Token She
Token ID1375
Feature activation-4.79e-03

Pos logit contributions
bi+0.005
 edge+0.005
ling+0.005
Larry+0.005
Charles+0.004
Neg logit contributions
 cr-0.006
?-0.006
 sew-0.006
 forgiveness-0.006
 L-0.006
Token wanted
Token ID2227
Feature activation-2.05e-03

Pos logit contributions
bi+0.008
 edge+0.008
Larry+0.008
ling+0.008
 valleys+0.008
Neg logit contributions
 cr-0.006
?-0.005
 sew-0.005
 forgiveness-0.005
 report-0.005
Token a
Token ID257
Feature activation-9.56e-03

Pos logit contributions
bi+0.004
 edge+0.004
ling+0.004
Larry+0.004
acking+0.004
Neg logit contributions
 cr-0.002
?-0.002
 sew-0.002
 forgiveness-0.002
 report-0.002
Token l
Token ID300
Feature activation-1.57e-02

Pos logit contributions
bi+0.017
 edge+0.016
Larry+0.015
ling+0.015
 distance+0.015
Neg logit contributions
 cr-0.011
 sew-0.010
?-0.010
 forgiveness-0.010
 report-0.009
Tokenoll
Token ID692
Feature activation-6.29e-02

Pos logit contributions
bi+0.022
 edge+0.020
 onward+0.020
mo+0.020
Larry+0.020
Neg logit contributions
 sew-0.022
 report-0.022
 cr-0.022
 forgiveness-0.021
 grandma-0.021
Tokenipop
Token ID42800
Feature activation-1.35e-02

Pos logit contributions
bi+0.088
ling+0.074
mo+0.072
ging+0.068
 onward+0.066
Neg logit contributions
 sew-0.102
 heaven-0.100
 forgiveness-0.096
 report-0.095
 scrib-0.094
Token now
Token ID783
Feature activation-2.00e-03

Pos logit contributions
bi+0.024
 edge+0.023
 across+0.022
 bottom+0.022
 distance+0.021
Neg logit contributions
 sew-0.014
 cr-0.014
 forgiveness-0.014
 scrib-0.013
 heaven-0.013
Token.
Token ID13
Feature activation-7.11e-03

Pos logit contributions
bi+0.003
 edge+0.003
ling+0.003
 bottom+0.003
 across+0.003
Neg logit contributions
 cr-0.002
 sew-0.002
 forgiveness-0.002
 Sp-0.002
 heaven-0.002
Token She
Token ID1375
Feature activation-7.13e-03

Pos logit contributions
bi+0.008
 edge+0.008
Larry+0.007
Charles+0.007
 valleys+0.007
Neg logit contributions
 cr-0.013
 forgiveness-0.012
 sew-0.012
?-0.012
 report-0.012
Token saw
Token ID2497
Feature activation-1.62e-03

Pos logit contributions
bi+0.012
 edge+0.012
Larry+0.011
ling+0.011
Charles+0.011
Neg logit contributions
 cr-0.009
?-0.008
 sew-0.008
 forgiveness-0.008
 report-0.008
Token backpack
Token ID22526
Feature activation+7.76e-03

Pos logit contributions
bi+0.024
Larry+0.022
 edge+0.022
ling+0.022
Frank+0.022
Neg logit contributions
 cr-0.015
?-0.014
 sew-0.014
 forgiveness-0.013
 report-0.013
Token and
Token ID290
Feature activation-9.26e-03

Pos logit contributions
 cr+0.009
 forgiveness+0.008
 sew+0.008
?+0.008
 report+0.008
Neg logit contributions
bi-0.014
 edge-0.013
 distance-0.013
ling-0.012
 across-0.012
Token grabbed
Token ID13176
Feature activation-1.50e-02

Pos logit contributions
bi+0.016
 edge+0.015
Larry+0.014
ling+0.014
Charles+0.014
Neg logit contributions
 cr-0.011
?-0.011
 sew-0.011
 forgiveness-0.010
 report-0.010
Token the
Token ID262
Feature activation-6.34e-03

Pos logit contributions
bi+0.031
 edge+0.029
Larry+0.028
ling+0.028
Charles+0.028
Neg logit contributions
 cr-0.014
?-0.013
 sew-0.013
 forgiveness-0.012
 L-0.012
Token l
Token ID300
Feature activation-2.11e-02

Pos logit contributions
bi+0.014
Larry+0.013
 valleys+0.013
arted+0.013
acking+0.013
Neg logit contributions
 cr-0.007
 sew-0.005
?-0.005
 L-0.005
 thread-0.005
Tokenoll
Token ID692
Feature activation-6.26e-02

Pos logit contributions
bi+0.043
 edge+0.040
Larry+0.039
ling+0.039
 valleys+0.038
Neg logit contributions
 cr-0.020
?-0.019
 sew-0.018
 forgiveness-0.018
 report-0.016
Tokenipop
Token ID42800
Feature activation-2.28e-02

Pos logit contributions
bi+0.113
 edge+0.104
ling+0.101
Larry+0.099
Charles+0.097
Neg logit contributions
 cr-0.072
 sew-0.070
 forgiveness-0.068
?-0.068
 report-0.065
Token.
Token ID13
Feature activation-7.03e-03

Pos logit contributions
bi+0.049
 edge+0.047
Larry+0.045
ling+0.045
Charles+0.044
Neg logit contributions
 cr-0.018
?-0.016
 sew-0.016
 forgiveness-0.016
 report-0.014
Token He
Token ID679
Feature activation-2.17e-03

Pos logit contributions
bi+0.011
 edge+0.010
Larry+0.010
ling+0.010
Charles+0.009
Neg logit contributions
 cr-0.010
?-0.009
 sew-0.009
 forgiveness-0.009
 report-0.009
Token ran
Token ID4966
Feature activation-9.65e-03

Pos logit contributions
bi+0.004
 edge+0.003
Larry+0.003
ling+0.003
Charles+0.003
Neg logit contributions
 cr-0.003
?-0.003
 forgiveness-0.003
 sew-0.003
 report-0.003
Token away
Token ID1497
Feature activation-8.41e-06

Pos logit contributions
bi+0.020
Larry+0.018
ling+0.018
 sudden+0.018
eled+0.018
Neg logit contributions
 cr-0.010
 L-0.010
 where-0.009
 later-0.009
?-0.009
Token still
Token ID991
Feature activation-1.08e-02

Pos logit contributions
acking+0.022
bi+0.021
umbling+0.021
ling+0.020
 edge+0.020
Neg logit contributions
 cr-0.007
 L-0.007
 don-0.006
?-0.006
 if-0.006
Token couldn
Token ID3521
Feature activation-1.08e-02

Pos logit contributions
bi+0.021
acking+0.021
 edge+0.021
Frank+0.020
Charles+0.020
Neg logit contributions
?-0.011
 cr-0.010
 don-0.010
 if-0.010
 L-0.009
Token't
Token ID470
Feature activation-2.26e-02

Pos logit contributions
bi+0.021
ling+0.019
eled+0.019
acking+0.019
 depths+0.018
Neg logit contributions
?-0.013
 we-0.012
 sew-0.011
?"-0.011
 cr-0.011
Token find
Token ID1064
Feature activation-3.24e-03

Pos logit contributions
bi+0.039
ling+0.035
acking+0.035
 edge+0.034
eled+0.034
Neg logit contributions
?-0.029
 cr-0.028
 sew-0.028
 forgiveness-0.026
 forgive-0.026
Token any
Token ID597
Feature activation-1.74e-02

Pos logit contributions
bi+0.006
acking+0.006
ling+0.006
 edge+0.006
Larry+0.006
Neg logit contributions
 cr-0.003
?-0.003
 sew-0.003
 L-0.003
 forgiveness-0.003
Token av
Token ID1196
Feature activation-6.24e-02

Pos logit contributions
bi+0.032
acking+0.031
Charles+0.031
Larry+0.030
Phil+0.030
Neg logit contributions
 cr-0.021
?-0.020
 forgiveness-0.019
 if-0.017
 forgive-0.017
Tokenoc
Token ID420
Feature activation+6.45e-03

Pos logit contributions
 edge+0.121
bi+0.119
 crashing+0.113
Larry+0.112
 bottom+0.112
Neg logit contributions
 cr-0.063
?-0.056
 sew-0.054
/-0.051
 someday-0.051
Tokenados
Token ID22484
Feature activation-2.58e-02

Pos logit contributions
 cr+0.009
 forgiveness+0.009
/+0.008
?+0.008
 sew+0.008
Neg logit contributions
bi-0.010
 distance-0.010
 edge-0.010
Larry-0.009
 sudden-0.009
Token.
Token ID13
Feature activation+2.01e-04

Pos logit contributions
bi+0.054
 edge+0.052
Larry+0.050
ling+0.050
 valleys+0.048
Neg logit contributions
 cr-0.022
 sew-0.020
?-0.020
 forgiveness-0.019
 report-0.018
Token Finally
Token ID9461
Feature activation-1.78e-02

Pos logit contributions
 L+0.000
 Lily+0.000
 Sp+0.000
 grandma+0.000
?+0.000
Neg logit contributions
ling-0.000
lling-0.000
bi-0.000
ging-0.000
 crashing-0.000
Token,
Token ID11
Feature activation-6.74e-03

Pos logit contributions
bi+0.041
 edge+0.039
ling+0.038
acking+0.038
Larry+0.038
Neg logit contributions
 cr-0.013
?-0.012
 sew-0.011
 forgiveness-0.011
 L-0.010
Token a
Token ID257
Feature activation-1.30e-02

Pos logit contributions
bi+0.000
 edge+0.000
Larry+0.000
ling+0.000
Charles+0.000
Neg logit contributions
 cr-0.000
?-0.000
 sew-0.000
 forgiveness-0.000
 report-0.000
Token sticker
Token ID27735
Feature activation+1.03e-03

Pos logit contributions
 edge+0.022
bi+0.022
 distance+0.022
 sudden+0.021
 bottom+0.021
Neg logit contributions
 cr-0.014
?-0.013
mm-0.013
 sew-0.013
/-0.013
Token and
Token ID290
Feature activation-6.91e-03

Pos logit contributions
 forgiveness+0.001
 sew+0.001
 cr+0.001
 Acc+0.001
 whose+0.001
Neg logit contributions
bi-0.002
ling-0.002
 across-0.002
 edge-0.002
 distance-0.002
Token a
Token ID257
Feature activation+2.19e-03

Pos logit contributions
bi+0.012
 edge+0.012
 distance+0.011
 across+0.011
 streets+0.011
Neg logit contributions
 cr-0.007
?-0.007
/-0.006
 forgiveness-0.006
arium-0.006
Token l
Token ID300
Feature activation-1.61e-02

Pos logit contributions
 cr+0.002
?+0.002
 sew+0.002
 forgiveness+0.002
 report+0.002
Neg logit contributions
bi-0.004
 edge-0.004
Larry-0.004
ling-0.004
 distance-0.004
Tokenoll
Token ID692
Feature activation-6.24e-02

Pos logit contributions
pered+0.016
 onward+0.016
mo+0.015
 forward+0.015
umbling+0.015
Neg logit contributions
 we-0.028
 cr-0.027
 tonight-0.027
 disl-0.027
 report-0.027
Tokenipop
Token ID42800
Feature activation-1.62e-02

Pos logit contributions
bi+0.077
mo+0.069
ling+0.069
ging+0.060
eled+0.056
Neg logit contributions
 sew-0.105
 forgiveness-0.102
 express-0.102
 heaven-0.100
 art-0.100
Token.
Token ID13
Feature activation-8.54e-03

Pos logit contributions
bi+0.027
 edge+0.027
 across+0.026
ling+0.026
 onward+0.026
Neg logit contributions
 forgiveness-0.016
 sew-0.016
 cr-0.015
 Acc-0.014
 scrib-0.014
Token
Token ID198
Feature activation-4.54e-03

Pos logit contributions
 edge+0.011
bi+0.011
He+0.011
 onward+0.011
 sudden+0.010
Neg logit contributions
 cr-0.013
 report-0.012
 forgiveness-0.012
?-0.012
 sew-0.011
Token
Token ID198
Feature activation+1.15e-03

Pos logit contributions
bi+0.006
 edge+0.006
Larry+0.005
 sudden+0.005
ling+0.005
Neg logit contributions
 cr-0.007
 forgiveness-0.007
 sew-0.007
?-0.007
 report-0.006
TokenL
Token ID43
Feature activation+1.74e-02

Pos logit contributions
 cr+0.001
?+0.001
 sew+0.001
 forgiveness+0.001
 report+0.001
Neg logit contributions
bi-0.003
 edge-0.002
Larry-0.002
ling-0.002
 distance-0.002
Token girl
Token ID2576
Feature activation-1.11e-02

Pos logit contributions
 cr+0.008
 sew+0.007
?+0.007
 forgiveness+0.006
 heaven+0.006
Neg logit contributions
bi-0.010
Larry-0.010
Charles-0.010
Frank-0.009
Phil-0.009
Token who
Token ID508
Feature activation-1.85e-03

Pos logit contributions
bi+0.023
Larry+0.021
 edge+0.021
Charles+0.021
Frank+0.021
Neg logit contributions
 cr-0.011
?-0.011
 sew-0.010
 heaven-0.009
 forgiveness-0.009
Token dropped
Token ID5710
Feature activation-9.45e-03

Pos logit contributions
bi+0.003
 edge+0.003
Larry+0.003
ling+0.003
Charles+0.003
Neg logit contributions
 cr-0.002
?-0.002
 sew-0.002
 forgiveness-0.002
 report-0.002
Token a
Token ID257
Feature activation+5.48e-04

Pos logit contributions
bi+0.021
acking+0.020
Larry+0.020
Charles+0.020
 edge+0.020
Neg logit contributions
 cr-0.009
?-0.007
 L-0.007
 forgiveness-0.006
 sew-0.006
Token l
Token ID300
Feature activation-1.44e-02

Pos logit contributions
 cr+0.001
?+0.001
 sew+0.001
 forgiveness+0.000
 thread+0.000
Neg logit contributions
bi-0.001
arted-0.001
Larry-0.001
 valleys-0.001
eled-0.001
Tokenoll
Token ID692
Feature activation-6.22e-02

Pos logit contributions
bi+0.027
 edge+0.026
Larry+0.025
ling+0.025
acking+0.024
Neg logit contributions
 cr-0.015
 sew-0.014
?-0.014
 forgiveness-0.014
 report-0.014
Tokenipop
Token ID42800
Feature activation-1.71e-02

Pos logit contributions
bi+0.100
ling+0.090
mo+0.086
 edge+0.085
ging+0.083
Neg logit contributions
 sew-0.084
 forgiveness-0.081
 heaven-0.079
 cr-0.079
 report-0.079
Token from
Token ID422
Feature activation-8.08e-03

Pos logit contributions
bi+0.036
 edge+0.034
Larry+0.033
ling+0.033
Charles+0.032
Neg logit contributions
 cr-0.015
?-0.014
 sew-0.013
 forgiveness-0.013
 report-0.012
Token her
Token ID607
Feature activation+7.92e-04

Pos logit contributions
bi+0.017
 edge+0.015
Larry+0.015
ling+0.015
acking+0.015
Neg logit contributions
 cr-0.008
?-0.007
 sew-0.007
 forgiveness-0.007
 heaven-0.007
Token pocket
Token ID10000
Feature activation+1.69e-02

Pos logit contributions
 cr+0.001
?+0.001
 sew+0.001
 forgiveness+0.001
 grandma+0.001
Neg logit contributions
bi-0.001
Larry-0.001
ling-0.001
 edge-0.001
Frank-0.001
Token.
Token ID13
Feature activation+2.87e-03

Pos logit contributions
 cr+0.011
?+0.010
 sew+0.010
 forgiveness+0.010
 report+0.009
Neg logit contributions
bi-0.038
 edge-0.037
Larry-0.035
ling-0.035
Charles-0.035
Token the
Token ID262
Feature activation-2.91e-03

Pos logit contributions
bi+0.036
Charles+0.033
Frank+0.032
Larry+0.032
 edge+0.032
Neg logit contributions
 cr-0.017
?-0.015
 sew-0.014
 L-0.013
?"-0.013
Token cheese
Token ID9891
Feature activation-2.50e-02

Pos logit contributions
bi+0.007
Phil+0.006
Frank+0.006
Charles+0.006
Pat+0.006
Neg logit contributions
 cr-0.003
?-0.002
 sew-0.002
 ink-0.002
 glue-0.002
Token and
Token ID290
Feature activation-2.38e-03

Pos logit contributions
bi+0.052
Charles+0.047
eled+0.047
Frank+0.047
Larry+0.046
Neg logit contributions
 cr-0.027
?-0.024
 sew-0.023
 report-0.020
 later-0.020
Token add
Token ID751
Feature activation-1.26e-02

Pos logit contributions
bi+0.005
 edge+0.005
Larry+0.004
ling+0.004
Charles+0.004
Neg logit contributions
 cr-0.002
 sew-0.002
?-0.002
 forgiveness-0.002
 report-0.002
Token the
Token ID262
Feature activation+1.80e-03

Pos logit contributions
bi+0.026
Charles+0.024
 edge+0.024
Larry+0.024
Frank+0.024
Neg logit contributions
 cr-0.013
?-0.012
 sew-0.011
 L-0.010
 forgiveness-0.010
Token topp
Token ID23126
Feature activation-6.22e-02

Pos logit contributions
 cr+0.002
?+0.002
 sew+0.001
 report+0.001
 glue+0.001
Neg logit contributions
bi-0.004
Phil-0.004
arted-0.004
eled-0.004
Pat-0.004
Tokenings
Token ID654
Feature activation+2.89e-03

Pos logit contributions
bi+0.117
 distance+0.113
arted+0.112
Frank+0.112
 valleys+0.111
Neg logit contributions
 cr-0.077
?-0.065
 sew-0.057
/-0.057
?"-0.056
Token.
Token ID13
Feature activation-2.49e-03

Pos logit contributions
 cr+0.002
?+0.002
 sew+0.002
 L+0.001
 forgiveness+0.001
Neg logit contributions
bi-0.007
 edge-0.006
 sudden-0.006
Charles-0.006
Larry-0.006
Token Lily
Token ID20037
Feature activation+4.86e-03

Pos logit contributions
bi+0.006
 edge+0.005
Larry+0.005
ling+0.005
Charles+0.005
Neg logit contributions
 cr-0.002
?-0.002
 sew-0.002
 forgiveness-0.002
 report-0.001
Token and
Token ID290
Feature activation+1.24e-02

Pos logit contributions
 cr+0.005
?+0.005
 sew+0.005
 forgiveness+0.004
 report+0.004
Neg logit contributions
bi-0.010
 edge-0.009
Larry-0.009
ling-0.009
 valleys-0.009
Token Ben
Token ID3932
Feature activation+8.36e-03

Pos logit contributions
 cr+0.009
?+0.008
 sew+0.008
 forgiveness+0.007
 report+0.007
Neg logit contributions
bi-0.028
 edge-0.027
Larry-0.026
ling-0.026
 valleys-0.026
Token she
Token ID673
Feature activation-8.63e-03

Pos logit contributions
bi+0.009
 edge+0.009
 valleys+0.008
mo+0.008
Charles+0.008
Neg logit contributions
?-0.003
 cr-0.003
 forgiveness-0.002
 L-0.002
 forgive-0.002
Token handed
Token ID10158
Feature activation-6.73e-03

Pos logit contributions
bi+0.016
 edge+0.015
Larry+0.014
 valleys+0.014
 distance+0.014
Neg logit contributions
 cr-0.010
 sew-0.010
?-0.010
 forgiveness-0.009
 forgive-0.009
Token him
Token ID683
Feature activation-9.44e-03

Pos logit contributions
bi+0.012
 edge+0.012
Charles+0.011
 valleys+0.011
Larry+0.011
Neg logit contributions
 cr-0.009
?-0.008
 forgiveness-0.008
 sew-0.007
 forgive-0.007
Token a
Token ID257
Feature activation-2.93e-03

Pos logit contributions
bi+0.018
 edge+0.017
ling+0.016
Larry+0.016
 distance+0.016
Neg logit contributions
 cr-0.010
 sew-0.009
?-0.009
 forgiveness-0.009
 report-0.009
Token l
Token ID300
Feature activation-1.24e-02

Pos logit contributions
bi+0.005
 edge+0.005
Larry+0.005
ling+0.005
 valleys+0.005
Neg logit contributions
 cr-0.004
?-0.003
 sew-0.003
 forgiveness-0.003
 report-0.003
Tokenoll
Token ID692
Feature activation-6.19e-02

Pos logit contributions
bi+0.021
 edge+0.020
ling+0.019
Larry+0.019
mo+0.019
Neg logit contributions
 sew-0.015
 cr-0.014
 report-0.014
 forgiveness-0.014
?-0.014
Tokenipop
Token ID42800
Feature activation-1.91e-02

Pos logit contributions
bi+0.098
ling+0.087
mo+0.083
ging+0.080
 edge+0.079
Neg logit contributions
 sew-0.088
 heaven-0.084
 forgiveness-0.082
 report-0.081
 cr-0.080
Token.
Token ID13
Feature activation-6.85e-03

Pos logit contributions
bi+0.037
 edge+0.037
ling+0.035
 across+0.034
 bottom+0.034
Neg logit contributions
 cr-0.017
 sew-0.017
 forgiveness-0.017
?-0.016
 heaven-0.015
Token She
Token ID1375
Feature activation-2.46e-03

Pos logit contributions
bi+0.013
ling+0.012
 edge+0.012
mo+0.012
Charles+0.012
Neg logit contributions
?-0.007
 cr-0.007
 sew-0.007
 L-0.006
 forgiveness-0.006
Token smiled
Token ID13541
Feature activation-8.64e-03

Pos logit contributions
bi+0.005
 edge+0.004
ling+0.004
eled+0.004
 sudden+0.004
Neg logit contributions
?-0.003
 cr-0.002
 forgiveness-0.002
 report-0.002
 heaven-0.002
Token and
Token ID290
Feature activation+6.33e-03

Pos logit contributions
bi+0.017
 edge+0.016
Larry+0.016
ling+0.016
Charles+0.015
Neg logit contributions
 cr-0.008
?-0.008
 sew-0.008
 forgiveness-0.007
 report-0.007
Token mom
Token ID1995
Feature activation+4.05e-03

Pos logit contributions
 sew+0.001
 heaven+0.001
/+0.001
ist+0.001
 cr+0.001
Neg logit contributions
bi-0.001
 edge-0.001
Larry-0.001
 bottom-0.001
 across-0.001
Tokenmy
Token ID1820
Feature activation-8.32e-03

Pos logit contributions
 cr+0.007
?+0.007
 sew+0.007
 forgive+0.006
 forgiveness+0.006
Neg logit contributions
bi-0.005
 edge-0.005
Larry-0.005
Charles-0.005
 valleys-0.005
Token,
Token ID11
Feature activation-7.29e-03

Pos logit contributions
bi+0.021
 edge+0.020
Larry+0.020
 sudden+0.020
Charles+0.020
Neg logit contributions
 cr-0.004
?-0.004
 sew-0.003
 forgiveness-0.003
 forgive-0.003
Token "
Token ID366
Feature activation-1.34e-02

Pos logit contributions
bi+0.011
 edge+0.010
Larry+0.010
ling+0.009
Charles+0.009
Neg logit contributions
 cr-0.011
?-0.010
 sew-0.010
 forgiveness-0.010
 report-0.010
TokenWhat
Token ID2061
Feature activation+6.61e-03

Pos logit contributions
bi+0.015
 edge+0.014
ling+0.013
Larry+0.013
Charles+0.012
Neg logit contributions
 cr-0.025
?-0.024
 sew-0.024
 forgiveness-0.023
 report-0.022
Token is
Token ID318
Feature activation+3.50e-02

Pos logit contributions
 sew+0.010
 report+0.010
 forgiveness+0.009
ands+0.009
 heaven+0.009
Neg logit contributions
bi-0.008
Larry-0.008
 across-0.007
 giant-0.007
 bulls-0.007
Token this
Token ID428
Feature activation+3.52e-02

Pos logit contributions
 forgiveness+0.047
 forgive+0.046
 sew+0.044
ird+0.042
 cr+0.041
Neg logit contributions
 edge-0.053
bi-0.051
 across-0.051
 bottom-0.049
Larry-0.049
Token?"
Token ID1701
Feature activation-2.16e-03

Pos logit contributions
?+0.019
 cr+0.017
?"+0.016
 sew+0.013
 tonight+0.012
Neg logit contributions
bi-0.087
arted-0.083
eled-0.082
 edge-0.082
Larry-0.082
Token
Token ID198
Feature activation-5.44e-03

Pos logit contributions
bi+0.002
 edge+0.002
 sudden+0.002
eled+0.002
 middle+0.002
Neg logit contributions
 cr-0.004
 forgiveness-0.004
 report-0.004
 heaven-0.004
/-0.004
Token
Token ID198
Feature activation-1.89e-04

Pos logit contributions
bi+0.007
 edge+0.007
Larry+0.006
Frank+0.006
ling+0.006
Neg logit contributions
 cr-0.009
 sew-0.008
 forgiveness-0.008
 report-0.008
 heaven-0.008
Token"
Token ID1
Feature activation+3.84e-03

Pos logit contributions
bi+0.000
 edge+0.000
Larry+0.000
ling+0.000
Frank+0.000
Neg logit contributions
 cr-0.000
?-0.000
 forgiveness-0.000
 sew-0.000
 report-0.000
Token to
Token ID284
Feature activation+7.81e-03

Pos logit contributions
 cr+0.006
?+0.006
 sew+0.006
 forgiveness+0.006
 report+0.006
Neg logit contributions
bi-0.011
 edge-0.011
Larry-0.010
ling-0.010
Charles-0.010
Token himself
Token ID2241
Feature activation-1.43e-02

Pos logit contributions
 cr+0.015
?+0.015
 sew+0.015
 forgiveness+0.014
 report+0.014
Neg logit contributions
bi-0.009
 edge-0.007
ling-0.007
Larry-0.007
acking-0.007
Token,
Token ID11
Feature activation+1.63e-04

Pos logit contributions
bi+0.036
 edge+0.034
 distance+0.034
 valleys+0.034
 sudden+0.034
Neg logit contributions
?-0.007
 cr-0.006
 if-0.006
 whose-0.005
 don-0.005
Token "
Token ID366
Feature activation-5.99e-03

Pos logit contributions
 cr+0.000
 L+0.000
 if+0.000
 whose+0.000
 don+0.000
Neg logit contributions
bi-0.000
Larry-0.000
 edge-0.000
 depths-0.000
 distance-0.000
TokenI
Token ID40
Feature activation-6.48e-03

Pos logit contributions
bi+0.009
 edge+0.009
pering+0.008
 boulder+0.008
 bottom+0.008
Neg logit contributions
 cr-0.008
 sew-0.008
?-0.008
 someday-0.008
 whose-0.007
Token wish
Token ID4601
Feature activation+3.64e-02

Pos logit contributions
bi+0.011
 edge+0.010
Larry+0.010
ling+0.010
Charles+0.010
Neg logit contributions
 cr-0.008
 sew-0.007
 forgiveness-0.007
?-0.007
 report-0.007
Token I
Token ID314
Feature activation-4.90e-03

Pos logit contributions
 cr+0.023
?+0.023
 forgiveness+0.019
 sew+0.019
?"+0.017
Neg logit contributions
bi-0.087
 edge-0.081
eled-0.080
Larry-0.079
ling-0.079
Token could
Token ID714
Feature activation-7.37e-04

Pos logit contributions
bi+0.008
 edge+0.008
Larry+0.008
 distance+0.007
 across+0.007
Neg logit contributions
 cr-0.006
 forgiveness-0.006
 sew-0.006
?-0.005
 report-0.005
Token stay
Token ID2652
Feature activation+4.13e-03

Pos logit contributions
bi+0.002
ling+0.001
eled+0.001
 valleys+0.001
arted+0.001
Neg logit contributions
?-0.001
 sew-0.001
 cr-0.001
 forgive-0.001
 report-0.001
Token here
Token ID994
Feature activation-7.23e-03

Pos logit contributions
?+0.004
 cr+0.004
 sew+0.004
?"+0.004
 forgiveness+0.004
Neg logit contributions
bi-0.008
 edge-0.007
Larry-0.007
 valleys-0.007
eled-0.007
Token forever
Token ID8097
Feature activation+1.47e-02

Pos logit contributions
bi+0.016
eled+0.015
 edge+0.015
Larry+0.015
 sudden+0.014
Neg logit contributions
?-0.006
 cr-0.006
?"-0.005
 tonight-0.005
 someday-0.005
Token thought
Token ID1807
Feature activation+5.03e-03

Pos logit contributions
 cr+0.005
 forgiveness+0.005
?+0.005
 sew+0.005
 report+0.005
Neg logit contributions
bi-0.006
 edge-0.006
ling-0.005
Larry-0.005
 valleys-0.005
Token carefully
Token ID7773
Feature activation-1.42e-02

Pos logit contributions
 cr+0.005
 sew+0.005
 forgiveness+0.004
?+0.004
 heaven+0.004
Neg logit contributions
 edge-0.009
bi-0.009
ling-0.009
 across-0.009
 crashing-0.009
Token about
Token ID546
Feature activation+3.54e-03

Pos logit contributions
 edge+0.024
bi+0.024
 across+0.023
ling+0.023
 onward+0.022
Neg logit contributions
 cr-0.014
 forgiveness-0.014
 sew-0.014
 heaven-0.014
 report-0.013
Token what
Token ID644
Feature activation-9.92e-03

Pos logit contributions
 cr+0.004
?+0.004
 sew+0.004
 forgiveness+0.003
 report+0.003
Neg logit contributions
bi-0.007
 edge-0.006
Larry-0.006
ling-0.006
Charles-0.006
Token to
Token ID284
Feature activation+7.12e-03

Pos logit contributions
bi+0.015
 edge+0.014
Larry+0.013
ling+0.013
Charles+0.013
Neg logit contributions
 cr-0.014
 sew-0.014
?-0.014
 forgiveness-0.013
 report-0.013
Token wish
Token ID4601
Feature activation+3.98e-02

Pos logit contributions
 cr+0.008
/+0.008
 heaven+0.008
 whose+0.008
arium+0.008
Neg logit contributions
 edge-0.012
 bottom-0.011
bi-0.011
ling-0.010
 distance-0.010
Token for
Token ID329
Feature activation+5.03e-03

Pos logit contributions
 cr+0.036
?+0.034
 sew+0.033
 forgiveness+0.032
 report+0.030
Neg logit contributions
bi-0.081
 edge-0.077
Larry-0.075
ling-0.074
Charles-0.073
Token.
Token ID13
Feature activation+7.27e-03

Pos logit contributions
 sew+0.006
 cr+0.006
/+0.005
 report+0.005
ems+0.005
Neg logit contributions
 edge-0.008
Larry-0.007
 bottom-0.007
bi-0.007
 across-0.007
Token Suddenly
Token ID24975
Feature activation+6.72e-03

Pos logit contributions
?+0.011
 cr+0.011
 grandma+0.011
 sew+0.011
 forgiveness+0.011
Neg logit contributions
bi-0.010
ling-0.009
 edge-0.009
Larry-0.009
ging-0.009
Token,
Token ID11
Feature activation+4.44e-04

Pos logit contributions
 cr+0.005
?+0.005
 sew+0.004
 forgiveness+0.004
 grandma+0.004
Neg logit contributions
bi-0.015
 edge-0.014
acking-0.014
Larry-0.014
ling-0.014
Token he
Token ID339
Feature activation+5.03e-04

Pos logit contributions
 cr+0.001
?+0.001
 forgiveness+0.001
 sew+0.000
 grandma+0.000
Neg logit contributions
bi-0.001
 edge-0.001
acking-0.001
ling-0.001
Larry-0.001
Token the
Token ID262
Feature activation-5.88e-03

Pos logit contributions
bi+0.026
Larry+0.023
acking+0.023
Charles+0.023
 edge+0.023
Neg logit contributions
 cr-0.010
 sew-0.009
?-0.009
 L-0.008
 forgiveness-0.008
Token horizon
Token ID17810
Feature activation+3.59e-03

Pos logit contributions
bi+0.010
acking+0.009
Larry+0.009
Phil+0.009
 onward+0.009
Neg logit contributions
 cr-0.009
 grandma-0.008
?-0.008
 sew-0.008
 art-0.007
Token.
Token ID13
Feature activation+3.40e-03

Pos logit contributions
 cr+0.002
 sew+0.002
?+0.002
 someday+0.002
 later+0.002
Neg logit contributions
bi-0.009
Phil-0.008
Larry-0.008
Frank-0.008
Charles-0.008
Token In
Token ID554
Feature activation-3.70e-05

Pos logit contributions
 L+0.004
?+0.004
 Sp+0.004
 sew+0.003
 Lily+0.003
Neg logit contributions
ling-0.007
bi-0.006
lling-0.006
ging-0.006
bling-0.006
Token the
Token ID262
Feature activation+7.13e-03

Pos logit contributions
bi+0.000
ling+0.000
acking+0.000
Larry+0.000
umbling+0.000
Neg logit contributions
 cr-0.000
?-0.000
 grandma-0.000
 forgiveness-0.000
 L-0.000
Token distance
Token ID5253
Feature activation+3.76e-02

Pos logit contributions
 cr+0.009
?+0.008
 grandma+0.007
 ink+0.007
 paper+0.007
Neg logit contributions
bi-0.013
arted-0.013
pered-0.013
Frank-0.013
Phil-0.013
Token,
Token ID11
Feature activation+2.20e-03

Pos logit contributions
 cr+0.020
?+0.020
 L+0.018
 if+0.016
 we+0.014
Neg logit contributions
bi-0.092
acking-0.090
Charles-0.089
Larry-0.089
Phil-0.089
Token she
Token ID673
Feature activation+4.45e-03

Pos logit contributions
 cr+0.002
?+0.002
 sew+0.002
 L+0.002
 don+0.002
Neg logit contributions
bi-0.005
acking-0.005
 edge-0.005
eled-0.005
ling-0.005
Token saw
Token ID2497
Feature activation-7.29e-04

Pos logit contributions
 cr+0.006
?+0.005
 sew+0.005
 scrib+0.005
 L+0.005
Neg logit contributions
bi-0.008
 edge-0.007
Larry-0.007
Frank-0.007
acking-0.007
Token something
Token ID1223
Feature activation+1.40e-02

Pos logit contributions
bi+0.002
acking+0.002
ling+0.002
Larry+0.002
 edge+0.002
Neg logit contributions
 cr-0.001
?-0.000
 forgiveness-0.000
 grandma-0.000
 whose-0.000
Token shining
Token ID22751
Feature activation-1.62e-02

Pos logit contributions
 cr+0.015
?+0.015
 sew+0.013
 whose+0.013
 forgiveness+0.012
Neg logit contributions
bi-0.028
acking-0.026
Charles-0.025
Larry-0.025
eled-0.025
Token out
Token ID503
Feature activation+9.14e-03

Pos logit contributions
 cr+0.016
?+0.015
 sew+0.014
 someday+0.013
 instead+0.013
Neg logit contributions
bi-0.024
Frank-0.021
eled-0.021
Larry-0.021
Charles-0.021
Token proudly
Token ID23017
Feature activation+5.95e-03

Pos logit contributions
 cr+0.009
?+0.008
 forgiveness+0.007
 sew+0.007
 L+0.007
Neg logit contributions
bi-0.019
Charles-0.017
Larry-0.017
Frank-0.017
 edge-0.017
Token and
Token ID290
Feature activation-2.19e-03

Pos logit contributions
 cr+0.005
?+0.005
 sew+0.005
 forgiveness+0.004
 someday+0.004
Neg logit contributions
bi-0.013
 edge-0.012
Larry-0.012
Frank-0.012
Charles-0.012
Token say
Token ID910
Feature activation-7.15e-03

Pos logit contributions
bi+0.004
acking+0.004
 edge+0.004
Frank+0.004
umbling+0.004
Neg logit contributions
 cr-0.002
 sew-0.002
?-0.002
 forgiveness-0.002
 forgive-0.002
Token �
Token ID6184
Feature activation+1.82e-02

Pos logit contributions
bi+0.014
 edge+0.013
ling+0.013
 distance+0.013
eled+0.013
Neg logit contributions
 cr-0.008
?-0.007
 sew-0.007
 forgiveness-0.007
 L-0.006
Token�
Token ID95
Feature activation+3.60e-02

Pos logit contributions
 cr+0.029
 forgiveness+0.027
 grandma+0.025
?+0.025
 if+0.024
Neg logit contributions
 edge-0.024
acking-0.024
pering-0.023
bi-0.022
 peak-0.022
Token€
Token ID26391
Feature activation+2.56e-03

Pos logit contributions
 cr+0.061
 forgiveness+0.057
?+0.054
 grandma+0.054
 if+0.052
Neg logit contributions
 edge-0.043
acking-0.042
bi-0.042
Larry-0.040
zi-0.039
Token�
Token ID129
Feature activation+1.01e-02

Pos logit contributions
 cr+0.004
?+0.004
 sew+0.003
 forgiveness+0.003
 report+0.003
Neg logit contributions
bi-0.004
 edge-0.004
Larry-0.003
ling-0.003
Charles-0.003
Token�
Token ID241
Feature activation+2.12e-02

Pos logit contributions
 sew+0.015
?+0.015
 cr+0.015
 report+0.015
/+0.014
Neg logit contributions
bi-0.014
Larry-0.012
 distance-0.012
 edge-0.012
 valleys-0.012
TokenLook
Token ID8567
Feature activation-7.33e-03

Pos logit contributions
 cr+0.027
?+0.027
 L+0.026
 sew+0.024
 whose+0.024
Neg logit contributions
bi-0.036
 edge-0.034
ling-0.034
acking-0.032
Larry-0.031
Token mummy
Token ID38567
Feature activation+8.09e-03

Pos logit contributions
bi+0.015
 edge+0.014
 distance+0.014
Larry+0.014
 valleys+0.014
Neg logit contributions
 cr-0.007
?-0.007
 sew-0.006
 forgiveness-0.006
 L-0.006
Token was
Token ID373
Feature activation+7.88e-04

Pos logit contributions
 report+0.005
 forgiveness+0.004
 heaven+0.004
?+0.004
 sew+0.004
Neg logit contributions
bi-0.003
He-0.003
 scrambled-0.003
 edge-0.003
eled-0.003
Token glad
Token ID9675
Feature activation-4.91e-03

Pos logit contributions
 cr+0.001
?+0.001
 sew+0.001
 forgiveness+0.001
 report+0.001
Neg logit contributions
bi-0.001
 edge-0.001
Larry-0.001
ling-0.001
Charles-0.001
Token that
Token ID326
Feature activation-1.26e-02

Pos logit contributions
bi+0.009
 edge+0.008
Larry+0.008
ling+0.008
Frank+0.008
Neg logit contributions
 cr-0.005
 sew-0.005
 report-0.005
?-0.005
ird-0.005
Token she
Token ID673
Feature activation+3.10e-03

Pos logit contributions
bi+0.023
 edge+0.021
Larry+0.021
 bottom+0.021
 onward+0.020
Neg logit contributions
 cr-0.013
 sew-0.012
 report-0.012
?-0.012
/-0.011
Token improved
Token ID6596
Feature activation-5.24e-04

Pos logit contributions
 forgiveness+0.005
 report+0.005
 cr+0.005
 heaven+0.005
 charity+0.005
Neg logit contributions
ling-0.003
bi-0.003
 across-0.003
amed-0.003
He-0.003
Token her
Token ID607
Feature activation+1.38e-03

Pos logit contributions
bi+0.001
 edge+0.001
ling+0.001
Larry+0.001
 distance+0.001
Neg logit contributions
 cr-0.001
 sew-0.001
?-0.001
 forgiveness-0.001
 report-0.001
Token knitting
Token ID41577
Feature activation-3.62e-03

Pos logit contributions
ird+0.002
 tonight+0.002
aund+0.002
?+0.002
 L+0.001
Neg logit contributions
 edge-0.002
bi-0.002
 bottom-0.002
ling-0.002
 distance-0.002
Token.
Token ID13
Feature activation-6.66e-03

Pos logit contributions
bi+0.007
 edge+0.007
ling+0.006
 distance+0.006
Larry+0.006
Neg logit contributions
 cr-0.004
?-0.004
 sew-0.003
 forgiveness-0.003
 report-0.003
Token She
Token ID1375
Feature activation+3.33e-03

Pos logit contributions
bi+0.007
He+0.006
Larry+0.006
 edge+0.006
Pat+0.006
Neg logit contributions
 cr-0.012
 report-0.012
 forgiveness-0.012
 sew-0.012
 scrib-0.011
Token was
Token ID373
Feature activation+7.71e-04

Pos logit contributions
 report+0.006
 forgiveness+0.006
 heaven+0.005
 tonight+0.005
 charity+0.005
Neg logit contributions
bi-0.004
He-0.004
 scrambled-0.003
eled-0.003
 edge-0.003
Token glad
Token ID9675
Feature activation-5.94e-03

Pos logit contributions
 cr+0.001
 sew+0.001
?+0.001
 forgiveness+0.001
 report+0.001
Neg logit contributions
bi-0.001
 edge-0.001
Larry-0.001
ling-0.001
Charles-0.001
Token
Token ID198
Feature activation-3.30e-03

Pos logit contributions
ling+0.015
ging+0.014
lling+0.014
bling+0.014
bi+0.014
Neg logit contributions
 L-0.012
 Lily-0.012
 Grand-0.010
 Sp-0.010
?-0.010
Token
Token ID198
Feature activation+1.54e-03

Pos logit contributions
bi+0.007
eled+0.006
 edge+0.006
ling+0.006
mo+0.006
Neg logit contributions
?-0.003
 cr-0.003
 sew-0.003
?"-0.003
 L-0.003
TokenSuddenly
Token ID38582
Feature activation+3.45e-03

Pos logit contributions
 sew+0.002
?+0.002
L+0.002
 cr+0.002
 forgive+0.002
Neg logit contributions
bi-0.003
eled-0.002
mo-0.002
zi-0.002
arted-0.002
Token,
Token ID11
Feature activation-2.74e-03

Pos logit contributions
 cr+0.002
?+0.002
 sew+0.002
 forgiveness+0.002
 L+0.002
Neg logit contributions
bi-0.008
 edge-0.008
ling-0.007
Larry-0.007
acking-0.007
Token his
Token ID465
Feature activation-7.29e-03

Pos logit contributions
bi+0.006
 edge+0.005
ling+0.005
Larry+0.005
acking+0.005
Neg logit contributions
 cr-0.003
?-0.003
 sew-0.002
 forgiveness-0.002
 report-0.002
Token wife
Token ID3656
Feature activation+1.04e-02

Pos logit contributions
bi+0.013
 valleys+0.013
Frank+0.013
mo+0.013
Larry+0.013
Neg logit contributions
 cr-0.008
 grandma-0.008
?-0.008
 report-0.007
 forgiveness-0.007
Token started
Token ID2067
Feature activation-1.10e-02

Pos logit contributions
 cr+0.010
 L+0.009
?+0.009
 scrib+0.009
 drew+0.008
Neg logit contributions
bi-0.022
 depths-0.021
umbling-0.021
 edge-0.021
Larry-0.021
Token running
Token ID2491
Feature activation-1.61e-02

Pos logit contributions
bi+0.025
 summit+0.024
Larry+0.024
 valleys+0.024
Frank+0.024
Neg logit contributions
 cr-0.009
 scrib-0.008
 L-0.008
?-0.008
 sew-0.007
Token faster
Token ID5443
Feature activation-8.98e-03

Pos logit contributions
bi+0.030
Larry+0.028
eled+0.027
Frank+0.027
Charles+0.027
Neg logit contributions
?-0.019
 cr-0.018
 L-0.018
 instead-0.017
 if-0.017
Token,
Token ID11
Feature activation-7.57e-03

Pos logit contributions
bi+0.022
Larry+0.020
Frank+0.020
Charles+0.020
 edge+0.019
Neg logit contributions
?-0.006
 cr-0.006
 L-0.005
 someday-0.005
 if-0.004
Token and
Token ID290
Feature activation-3.85e-03

Pos logit contributions
bi+0.017
Larry+0.016
eled+0.016
Frank+0.015
Charles+0.015
Neg logit contributions
 L-0.007
 cr-0.007
 where-0.006
 don-0.006
 instead-0.006
Token and
Token ID290
Feature activation+5.65e-03

Pos logit contributions
 forgiveness+0.015
 cr+0.015
 sew+0.015
?+0.015
 report+0.014
Neg logit contributions
bi-0.023
 edge-0.021
Larry-0.020
ling-0.020
acking-0.020
Token Lily
Token ID20037
Feature activation+2.47e-04

Pos logit contributions
 cr+0.003
?+0.003
 sew+0.003
 forgiveness+0.002
 report+0.002
Neg logit contributions
bi-0.014
 edge-0.013
Larry-0.013
ling-0.013
 distance-0.012
Token liked
Token ID8288
Feature activation+1.74e-03

Pos logit contributions
 forgiveness+0.000
?+0.000
 cr+0.000
 heaven+0.000
 report+0.000
Neg logit contributions
bi-0.000
 edge-0.000
ling-0.000
 scrambled-0.000
Larry-0.000
Token to
Token ID284
Feature activation-8.89e-03

Pos logit contributions
 cr+0.002
?+0.002
 sew+0.002
 forgiveness+0.001
 report+0.001
Neg logit contributions
bi-0.004
 edge-0.003
ling-0.003
Larry-0.003
Charles-0.003
Token decor
Token ID11705
Feature activation-3.32e-06

Pos logit contributions
bi+0.016
 edge+0.016
 bottom+0.015
Larry+0.014
 peak+0.014
Neg logit contributions
 cr-0.009
 forgiveness-0.009
?-0.008
/-0.008
arium-0.008
Tokenate
Token ID378
Feature activation+3.41e-03

Feature not active on prev token;
no predictions were affected.
Token the
Token ID262
Feature activation+2.33e-03

Pos logit contributions
 cr+0.003
?+0.003
 sew+0.003
 forgiveness+0.003
 report+0.003
Neg logit contributions
bi-0.007
 edge-0.007
Larry-0.006
Charles-0.006
Frank-0.006
Token pizza
Token ID14256
Feature activation-2.47e-02

Pos logit contributions
 cr+0.002
?+0.002
 report+0.002
 grandma+0.002
 sew+0.002
Neg logit contributions
bi-0.005
arted-0.005
Pat-0.005
Larry-0.005
eled-0.005
Token.
Token ID13
Feature activation-9.13e-03

Pos logit contributions
bi+0.049
 edge+0.047
ling+0.045
Larry+0.045
 distance+0.044
Neg logit contributions
 cr-0.023
 sew-0.022
 forgiveness-0.022
?-0.022
 report-0.020
Token They
Token ID1119
Feature activation+3.78e-03

Pos logit contributions
bi+0.016
 edge+0.015
ling+0.014
Larry+0.014
Charles+0.014
Neg logit contributions
 cr-0.011
?-0.011
 sew-0.011
 forgiveness-0.010
 report-0.010
Token made
Token ID925
Feature activation-3.19e-03

Pos logit contributions
 forgiveness+0.005
?+0.005
 report+0.004
 heaven+0.004
 sew+0.004
Neg logit contributions
bi-0.006
 edge-0.006
Larry-0.005
ling-0.005
eled-0.005
TokenM
Token ID44
Feature activation-3.42e-03

Pos logit contributions
bi+0.008
ling+0.007
arted+0.007
mo+0.007
ging+0.007
Neg logit contributions
 cr-0.006
?-0.006
 L-0.006
 sew-0.006
 later-0.005
Tokenia
Token ID544
Feature activation+7.86e-03

Pos logit contributions
bi+0.006
 edge+0.006
Larry+0.005
ling+0.005
 valleys+0.005
Neg logit contributions
 cr-0.004
?-0.004
 sew-0.004
 forgiveness-0.004
 report-0.004
Token says
Token ID1139
Feature activation-9.99e-03

Pos logit contributions
 cr+0.006
?+0.006
 sew+0.005
/+0.005
 forgiveness+0.005
Neg logit contributions
bi-0.017
Frank-0.016
 distance-0.016
Larry-0.016
 edge-0.016
Token,
Token ID11
Feature activation+9.46e-04

Pos logit contributions
bi+0.018
 edge+0.018
Larry+0.017
ling+0.017
 onward+0.017
Neg logit contributions
 sew-0.010
 cr-0.010
 heaven-0.010
 report-0.010
?-0.009
Token "
Token ID366
Feature activation-1.02e-02

Pos logit contributions
 cr+0.001
?+0.001
 sew+0.001
 forgiveness+0.001
 L+0.001
Neg logit contributions
bi-0.002
 edge-0.001
ling-0.001
Larry-0.001
 distance-0.001
TokenNo
Token ID2949
Feature activation-2.90e-03

Pos logit contributions
bi+0.017
mo+0.015
ling+0.015
 edge+0.015
acking+0.014
Neg logit contributions
 cr-0.014
 sew-0.014
?-0.013
 don-0.013
 forgiveness-0.013
Token,
Token ID11
Feature activation+3.09e-03

Pos logit contributions
bi+0.006
 edge+0.006
Larry+0.005
ling+0.005
 bottom+0.005
Neg logit contributions
 cr-0.003
 sew-0.003
?-0.002
 forgiveness-0.002
 heaven-0.002
Token Tom
Token ID4186
Feature activation-2.31e-03

Pos logit contributions
 cr+0.003
 sew+0.003
?+0.003
 forgiveness+0.003
 report+0.003
Neg logit contributions
bi-0.006
 edge-0.006
Larry-0.006
ling-0.005
Charles-0.005
Token.
Token ID13
Feature activation-3.21e-04

Pos logit contributions
bi+0.005
 edge+0.004
ling+0.004
Larry+0.004
acking+0.004
Neg logit contributions
 cr-0.002
 sew-0.002
 forgiveness-0.002
 report-0.002
 heaven-0.002
Token This
Token ID770
Feature activation-1.62e-02

Pos logit contributions
bi+0.001
 edge+0.001
Larry+0.000
Charles+0.000
ling+0.000
Neg logit contributions
 cr-0.000
?-0.000
 sew-0.000
 forgiveness-0.000
 report-0.000
Token is
Token ID318
Feature activation+1.40e-02

Pos logit contributions
bi+0.031
 edge+0.030
Larry+0.029
ling+0.029
 valleys+0.028
Neg logit contributions
 cr-0.016
 sew-0.015
 forgiveness-0.015
?-0.015
 heaven-0.014
Token her
Token ID607
Feature activation-2.95e-03

Pos logit contributions
bi+0.008
 edge+0.008
Larry+0.008
ling+0.007
eled+0.007
Neg logit contributions
 cr-0.006
 sew-0.005
?-0.005
 report-0.005
 forgiveness-0.005
Token adventure
Token ID8855
Feature activation+5.87e-03

Pos logit contributions
bi+0.004
 edge+0.004
 bottom+0.004
ling+0.004
 sudden+0.004
Neg logit contributions
 sew-0.004
 cr-0.004
?-0.004
 heaven-0.004
ems-0.004
Token!
Token ID0
Feature activation+4.17e-03

Pos logit contributions
 cr+0.004
?+0.003
 sew+0.003
 forgiveness+0.003
 report+0.003
Neg logit contributions
bi-0.014
 edge-0.013
Larry-0.013
Charles-0.013
ling-0.013
Token<|endoftext|>
Token ID50256
Feature activation-3.65e-03

Pos logit contributions
 cr+0.006
 sew+0.006
?+0.006
 forgiveness+0.006
 report+0.006
Neg logit contributions
bi-0.006
 edge-0.006
Larry-0.006
Frank-0.005
Charles-0.005
TokenOnce
Token ID7454
Feature activation+8.68e-03

Pos logit contributions
bi+0.007
ling+0.006
 edge+0.006
eled+0.006
 sudden+0.006
Neg logit contributions
?-0.004
 cr-0.004
 L-0.004
?"-0.004
 sew-0.003
Token upon
Token ID2402
Feature activation+1.43e-02

Pos logit contributions
 cr+0.014
 sew+0.013
 forgiveness+0.013
?+0.013
 report+0.013
Neg logit contributions
bi-0.012
 edge-0.011
ling-0.010
Larry-0.010
 distance-0.010
Token a
Token ID257
Feature activation+1.73e-03

Pos logit contributions
 cr+0.015
 forgiveness+0.015
 sew+0.015
 report+0.014
?+0.014
Neg logit contributions
bi-0.025
 edge-0.024
Larry-0.024
He-0.023
Frank-0.023
Token time
Token ID640
Feature activation-4.70e-03

Pos logit contributions
 cr+0.003
 sew+0.003
 forgiveness+0.003
?+0.003
 report+0.002
Neg logit contributions
bi-0.002
 edge-0.002
Larry-0.002
ling-0.002
Charles-0.002
Token,
Token ID11
Feature activation-7.14e-03

Pos logit contributions
bi+0.009
 edge+0.009
Larry+0.009
ling+0.009
Charles+0.008
Neg logit contributions
 cr-0.004
 sew-0.004
 forgiveness-0.004
?-0.004
 report-0.004
Token there
Token ID612
Feature activation-1.21e-02

Pos logit contributions
bi+0.014
 edge+0.013
acking+0.013
 streets+0.013
Charles+0.013
Neg logit contributions
?-0.007
 cr-0.007
 sew-0.006
?"-0.006
 L-0.006
Token was
Token ID373
Feature activation+8.34e-03

Pos logit contributions
bi+0.021
 edge+0.019
Larry+0.018
ling+0.018
Charles+0.018
Neg logit contributions
 cr-0.015
?-0.015
 sew-0.014
 forgiveness-0.014
 report-0.013
Token time
Token ID640
Feature activation-5.89e-03

Pos logit contributions
bi+0.002
 edge+0.002
Larry+0.002
ling+0.002
 valleys+0.002
Neg logit contributions
 cr-0.002
?-0.002
 sew-0.002
 forgiveness-0.002
 heaven-0.002
Token,
Token ID11
Feature activation-6.72e-03

Pos logit contributions
bi+0.012
 edge+0.011
Larry+0.011
ling+0.011
Charles+0.011
Neg logit contributions
 cr-0.005
 sew-0.005
 forgiveness-0.005
?-0.005
 report-0.005
Token a
Token ID257
Feature activation+4.09e-05

Pos logit contributions
bi+0.014
 streets+0.013
 edge+0.013
 distance+0.013
acking+0.013
Neg logit contributions
?-0.006
 cr-0.006
 whose-0.005
?"-0.005
 L-0.005
Token little
Token ID1310
Feature activation+9.59e-03

Pos logit contributions
 cr+0.000
?+0.000
 sew+0.000
 grandma+0.000
 L+0.000
Neg logit contributions
bi-0.000
 edge-0.000
arted-0.000
 valleys-0.000
eled-0.000
Token girl
Token ID2576
Feature activation-4.69e-03

Pos logit contributions
 cr+0.014
?+0.013
 sew+0.012
 L+0.012
 grandma+0.012
Neg logit contributions
bi-0.015
 edge-0.015
 summit-0.014
Larry-0.014
arted-0.014
Token wanted
Token ID2227
Feature activation-4.86e-03

Pos logit contributions
bi+0.010
 edge+0.009
eled+0.009
 sudden+0.009
ling+0.009
Neg logit contributions
 forgiveness-0.004
 cr-0.004
 sew-0.004
 report-0.003
?-0.003
Token to
Token ID284
Feature activation-3.98e-03

Pos logit contributions
bi+0.009
 edge+0.008
Larry+0.008
ling+0.008
 distance+0.008
Neg logit contributions
 cr-0.005
?-0.005
 sew-0.005
 forgiveness-0.005
 report-0.005
Token remember
Token ID3505
Feature activation+1.44e-03

Pos logit contributions
 edge+0.007
bi+0.007
 bottom+0.006
Larry+0.006
 distance+0.006
Neg logit contributions
 cr-0.004
 forgiveness-0.004
/-0.004
?-0.004
 whose-0.004
Token something
Token ID1223
Feature activation+1.38e-02

Pos logit contributions
 cr+0.001
 sew+0.001
/+0.001
 forgiveness+0.001
?+0.001
Neg logit contributions
bi-0.003
 edge-0.003
Larry-0.002
 across-0.002
Charles-0.002
Token really
Token ID1107
Feature activation-1.09e-02

Pos logit contributions
 cr+0.016
 sew+0.015
?+0.015
 forgiveness+0.015
 report+0.014
Neg logit contributions
bi-0.025
 edge-0.023
ling-0.022
Larry-0.022
 distance-0.022
Token important
Token ID1593
Feature activation+3.82e-04

Pos logit contributions
bi+0.018
 edge+0.016
ling+0.016
Larry+0.015
 across+0.015
Neg logit contributions
 cr-0.014
 forgiveness-0.014
 sew-0.014
?-0.013
 heaven-0.013
Token man
Token ID582
Feature activation+1.35e-03

Pos logit contributions
bi+0.046
 edge+0.043
Larry+0.042
ling+0.042
Charles+0.041
Neg logit contributions
 cr-0.019
?-0.018
 sew-0.017
 forgiveness-0.016
 report-0.016
Token.
Token ID13
Feature activation-1.19e-02

Pos logit contributions
 cr+0.001
?+0.001
 sew+0.001
 forgiveness+0.001
 report+0.001
Neg logit contributions
bi-0.003
 edge-0.002
Larry-0.002
ling-0.002
Charles-0.002
Token He
Token ID679
Feature activation-3.37e-03

Pos logit contributions
bi+0.009
 edge+0.008
He+0.008
Larry+0.007
Charles+0.007
Neg logit contributions
 cr-0.025
 sew-0.025
 forgiveness-0.025
?-0.024
 report-0.024
Token had
Token ID550
Feature activation+7.23e-03

Pos logit contributions
bi+0.005
 edge+0.005
Larry+0.005
ling+0.005
Charles+0.005
Neg logit contributions
 cr-0.005
 sew-0.004
?-0.004
 forgiveness-0.004
 report-0.004
Token a
Token ID257
Feature activation+3.29e-03

Pos logit contributions
 cr+0.008
?+0.007
 sew+0.007
 forgiveness+0.007
 report+0.006
Neg logit contributions
bi-0.014
 edge-0.013
Larry-0.013
ling-0.012
Charles-0.012
Token big
Token ID1263
Feature activation-1.10e-02

Pos logit contributions
 cr+0.005
?+0.004
 report+0.004
 sew+0.004
 forgiveness+0.004
Neg logit contributions
bi-0.006
arted-0.005
ling-0.005
eled-0.005
 edge-0.005
Token box
Token ID3091
Feature activation-2.96e-03

Pos logit contributions
bi+0.019
 edge+0.018
Larry+0.017
ling+0.017
 valleys+0.017
Neg logit contributions
 cr-0.014
?-0.013
 sew-0.013
 forgiveness-0.012
 report-0.012
Token with
Token ID351
Feature activation-1.19e-02

Pos logit contributions
bi+0.006
 edge+0.006
ling+0.005
Larry+0.005
 distance+0.005
Neg logit contributions
 cr-0.003
 sew-0.003
 forgiveness-0.003
?-0.003
 report-0.002
Token the
Token ID262
Feature activation-1.64e-03

Pos logit contributions
bi+0.025
 edge+0.023
ling+0.023
Larry+0.023
acking+0.023
Neg logit contributions
 cr-0.012
?-0.010
 forgiveness-0.009
 sew-0.009
 L-0.009
Token deep
Token ID2769
Feature activation+1.16e-02

Pos logit contributions
bi+0.003
arted+0.003
Larry+0.003
Pat+0.003
eled+0.003
Neg logit contributions
 cr-0.002
?-0.002
 forgiveness-0.001
 report-0.001
 grandma-0.001
Token pizza
Token ID14256
Feature activation-2.63e-02

Pos logit contributions
 cr+0.015
?+0.013
 heaven+0.012
 sew+0.012
 report+0.012
Neg logit contributions
bi-0.021
Larry-0.020
Phil-0.020
Pat-0.020
Frank-0.020
Token proud
Token ID6613
Feature activation-1.55e-03

Pos logit contributions
bi+0.007
 edge+0.006
ling+0.006
Larry+0.006
 sudden+0.006
Neg logit contributions
 cr-0.008
 sew-0.008
 forgiveness-0.008
?-0.008
 heaven-0.008
Token of
Token ID286
Feature activation-2.74e-02

Pos logit contributions
bi+0.002
 across+0.002
 sudden+0.002
ling+0.002
 streets+0.002
Neg logit contributions
 sew-0.002
 report-0.002
 forgiveness-0.002
 cr-0.002
 heaven-0.002
Token her
Token ID607
Feature activation-6.08e-03

Pos logit contributions
bi+0.051
 edge+0.048
Larry+0.047
ling+0.046
Charles+0.045
Neg logit contributions
 cr-0.030
 sew-0.028
?-0.028
 forgiveness-0.027
 report-0.026
Token daughter
Token ID4957
Feature activation-1.41e-02

Pos logit contributions
bi+0.011
 edge+0.010
ling+0.010
 streets+0.010
 bottom+0.010
Neg logit contributions
 sew-0.006
 cr-0.006
?-0.006
 forgiveness-0.006
 L-0.005
Token.
Token ID13
Feature activation-8.59e-03

Pos logit contributions
bi+0.029
 edge+0.028
ling+0.027
Larry+0.026
 across+0.026
Neg logit contributions
 cr-0.012
 sew-0.011
?-0.011
 forgiveness-0.011
 report-0.010
Token
Token ID198
Feature activation-7.59e-03

Pos logit contributions
bi+0.014
 edge+0.013
ling+0.013
Larry+0.013
Charles+0.013
Neg logit contributions
 cr-0.011
?-0.011
 sew-0.010
 forgiveness-0.010
 L-0.010
Token
Token ID198
Feature activation-2.69e-03

Pos logit contributions
bi+0.013
 edge+0.012
Larry+0.011
ling+0.011
Charles+0.011
Neg logit contributions
 cr-0.010
?-0.009
 sew-0.009
 forgiveness-0.009
 report-0.009
TokenOne
Token ID3198
Feature activation-3.03e-03

Pos logit contributions
bi+0.004
 edge+0.003
ling+0.003
Larry+0.003
eled+0.003
Neg logit contributions
 cr-0.004
?-0.004
 sew-0.004
 forgiveness-0.004
 heaven-0.004
Token day
Token ID1110
Feature activation-4.41e-03

Pos logit contributions
bi+0.003
ling+0.003
 edge+0.003
 distance+0.003
Larry+0.003
Neg logit contributions
 sew-0.006
 cr-0.005
 forgiveness-0.005
/-0.005
 report-0.005
Token,
Token ID11
Feature activation+2.16e-03

Pos logit contributions
bi+0.007
 edge+0.007
ling+0.007
 across+0.006
 sudden+0.006
Neg logit contributions
 sew-0.006
 report-0.006
 forgiveness-0.006
 heaven-0.005
 cr-0.005
Token Molly
Token ID30236
Feature activation+2.73e-03

Pos logit contributions
 report+0.003
 sew+0.003
 forgiveness+0.003
/+0.003
 fixes+0.003
Neg logit contributions
bi-0.003
 edge-0.003
Larry-0.003
 sudden-0.003
ling-0.003
Token weigh
Token ID10164
Feature activation-1.32e-02

Pos logit contributions
 sew+0.005
 cr+0.005
?+0.005
 if+0.004
 report+0.004
Neg logit contributions
Phil-0.012
Charles-0.012
Larry-0.012
Pat-0.012
Frank-0.012
Token the
Token ID262
Feature activation-6.87e-03

Pos logit contributions
bi+0.027
rolling+0.027
 distance+0.026
Larry+0.026
Charles+0.026
Neg logit contributions
 cr-0.012
?-0.011
 later-0.010
 if-0.010
 L-0.010
Token apples
Token ID22514
Feature activation-3.26e-02

Pos logit contributions
Phil+0.015
Pat+0.014
arted+0.014
Larry+0.013
Steve+0.013
Neg logit contributions
 cr-0.009
 glue-0.007
 tape-0.007
?-0.007
 sew-0.006
Token.
Token ID13
Feature activation+5.25e-03

Pos logit contributions
bi+0.080
Larry+0.077
Charles+0.076
 sudden+0.076
eled+0.075
Neg logit contributions
 cr-0.018
?-0.016
 sew-0.015
 later-0.013
 instead-0.013
Token "
Token ID366
Feature activation-2.61e-03

Pos logit contributions
 cr+0.007
 sew+0.007
?+0.007
 L+0.007
 forgiveness+0.006
Neg logit contributions
bi-0.008
ling-0.008
Larry-0.008
 edge-0.008
Charles-0.008
TokenWow
Token ID22017
Feature activation-8.88e-03

Pos logit contributions
bi+0.004
 edge+0.004
ling+0.004
arted+0.003
 boulder+0.003
Neg logit contributions
 cr-0.004
 sew-0.004
?-0.004
 forgive-0.003
 whose-0.003
Token,
Token ID11
Feature activation+8.66e-03

Pos logit contributions
bi+0.019
 edge+0.018
Larry+0.018
 distance+0.018
ling+0.018
Neg logit contributions
 cr-0.007
?-0.007
 sew-0.006
 forgiveness-0.006
 report-0.006
Token these
Token ID777
Feature activation-2.72e-03

Pos logit contributions
 cr+0.007
?+0.006
 sew+0.006
 forgiveness+0.006
 report+0.005
Neg logit contributions
bi-0.019
 edge-0.018
Larry-0.017
Charles-0.017
ling-0.017
Token apples
Token ID22514
Feature activation-7.73e-03

Pos logit contributions
eled+0.006
bi+0.006
Phil+0.006
 distance+0.006
Larry+0.006
Neg logit contributions
 cr-0.002
?-0.002
?"-0.002
 sew-0.002
 don-0.002
Token are
Token ID389
Feature activation+4.76e-03

Pos logit contributions
bi+0.018
Larry+0.017
 edge+0.017
eled+0.017
Phil+0.017
Neg logit contributions
 cr-0.005
?-0.005
 sew-0.004
?"-0.004
 don-0.004
Token heavy
Token ID4334
Feature activation+1.83e-02

Pos logit contributions
 cr+0.006
?+0.005
?"+0.005
 sew+0.005
/+0.005
Neg logit contributions
bi-0.009
eled-0.009
 edge-0.009
arted-0.009
umbling-0.008
Token have
Token ID423
Feature activation+5.50e-03

Pos logit contributions
bi+0.021
 edge+0.020
Larry+0.019
ling+0.019
 distance+0.019
Neg logit contributions
 cr-0.011
 sew-0.010
?-0.010
 forgiveness-0.010
 report-0.010
Token a
Token ID257
Feature activation-3.06e-03

Pos logit contributions
 cr+0.006
 sew+0.005
?+0.005
 forgiveness+0.005
 report+0.005
Neg logit contributions
bi-0.010
 edge-0.010
Larry-0.009
ling-0.009
 valleys-0.009
Token game
Token ID983
Feature activation-6.57e-03

Pos logit contributions
bi+0.005
Larry+0.005
arted+0.005
ling+0.005
 edge+0.005
Neg logit contributions
 cr-0.004
?-0.004
 forgiveness-0.004
 sew-0.004
 report-0.003
Token and
Token ID290
Feature activation-5.27e-03

Pos logit contributions
 across+0.011
 edge+0.011
bi+0.011
 bottom+0.010
 onward+0.010
Neg logit contributions
 sew-0.007
 forgiveness-0.007
 cr-0.007
mm-0.007
 Acc-0.006
Token play
Token ID711
Feature activation-2.12e-03

Pos logit contributions
bi+0.010
 edge+0.009
Larry+0.009
ling+0.009
Charles+0.009
Neg logit contributions
 cr-0.006
?-0.005
 sew-0.005
 forgiveness-0.005
 report-0.005
Token.
Token ID13
Feature activation-5.45e-03

Pos logit contributions
 edge+0.004
 across+0.003
ling+0.003
 bottom+0.003
 middle+0.003
Neg logit contributions
endant-0.003
grand-0.003
 cr-0.003
 sew-0.003
 forgiveness-0.002
Token
Token ID198
Feature activation-5.27e-03

Pos logit contributions
bi+0.008
 edge+0.008
ling+0.008
Larry+0.008
 valleys+0.007
Neg logit contributions
 cr-0.008
?-0.008
 sew-0.007
 forgiveness-0.007
 report-0.007
Token
Token ID198
Feature activation-3.13e-04

Pos logit contributions
bi+0.009
 edge+0.008
Larry+0.008
ling+0.008
Charles+0.008
Neg logit contributions
 cr-0.007
?-0.007
 sew-0.007
 forgiveness-0.006
 report-0.006
TokenThe
Token ID464
Feature activation-1.43e-02

Pos logit contributions
bi+0.000
arted+0.000
eled+0.000
 edge+0.000
ging+0.000
Neg logit contributions
?-0.000
 cr-0.000
 sew-0.000
 whose-0.000
 forgive-0.000
Token dad
Token ID9955
Feature activation+1.27e-03

Pos logit contributions
bi+0.023
 edge+0.021
Larry+0.020
ling+0.020
Charles+0.020
Neg logit contributions
 cr-0.020
?-0.019
 sew-0.019
 forgiveness-0.018
 report-0.018
Token heard
Token ID2982
Feature activation+5.61e-03

Pos logit contributions
 cr+0.001
?+0.001
 sew+0.001
 forgiveness+0.001
 report+0.001
Neg logit contributions
bi-0.002
 edge-0.002
Larry-0.002
ling-0.002
Charles-0.002
Token the
Token ID262
Feature activation-1.05e-02

Pos logit contributions
 cr+0.012
?+0.010
 L+0.010
 sew+0.010
 heaven+0.009
Neg logit contributions
bi-0.028
 edge-0.027
ling-0.026
acking-0.026
eled-0.026
Token rip
Token ID18344
Feature activation-8.19e-03

Pos logit contributions
acking+0.017
bi+0.017
ging+0.016
arted+0.016
eled+0.016
Neg logit contributions
 cr-0.016
?-0.014
 forgive-0.013
 forgiveness-0.013
 sew-0.013
Tokenest
Token ID395
Feature activation-1.33e-02

Pos logit contributions
bi+0.016
Phil+0.016
Charles+0.016
Frank+0.016
Larry+0.016
Neg logit contributions
 cr-0.009
?-0.008
 sew-0.007
 L-0.007
 if-0.007
Token,
Token ID11
Feature activation-2.81e-03

Pos logit contributions
bi+0.024
Charles+0.023
Larry+0.023
Phil+0.023
Frank+0.022
Neg logit contributions
 cr-0.018
?-0.015
 sew-0.015
 forgiveness-0.014
 L-0.014
Token ju
Token ID7544
Feature activation-2.80e-02

Pos logit contributions
bi+0.005
 edge+0.005
Larry+0.005
ling+0.004
Charles+0.004
Neg logit contributions
 cr-0.003
?-0.003
 sew-0.003
 forgiveness-0.003
 report-0.003
Tokenic
Token ID291
Feature activation-4.37e-02

Pos logit contributions
bi+0.042
 edge+0.040
Larry+0.038
Charles+0.037
 distance+0.037
Neg logit contributions
 cr-0.040
?-0.039
 sew-0.038
 forgiveness-0.037
/-0.035
Tokeniest
Token ID6386
Feature activation-1.13e-02

Pos logit contributions
bi+0.082
 edge+0.080
 distance+0.076
Charles+0.076
 valleys+0.075
Neg logit contributions
 cr-0.051
?-0.048
 sew-0.043
 forgiveness-0.043
 later-0.041
Token pear
Token ID25286
Feature activation-2.86e-03

Pos logit contributions
Charles+0.020
bi+0.020
Larry+0.020
Phil+0.020
Frank+0.019
Neg logit contributions
 cr-0.015
?-0.013
 L-0.012
 sew-0.012
 forgiveness-0.012
Token in
Token ID287
Feature activation-1.04e-02

Pos logit contributions
bi+0.006
 edge+0.005
Charles+0.005
Larry+0.005
eled+0.005
Neg logit contributions
 cr-0.003
?-0.003
 sew-0.002
 forgiveness-0.002
 L-0.002
Token the
Token ID262
Feature activation+7.55e-03

Pos logit contributions
bi+0.021
 edge+0.019
ling+0.019
Larry+0.019
acking+0.018
Neg logit contributions
 cr-0.011
?-0.010
 sew-0.010
 forgiveness-0.009
 heaven-0.009
Token garden
Token ID11376
Feature activation+8.46e-03

Pos logit contributions
 cr+0.012
?+0.011
 sew+0.010
 heaven+0.010
 glass+0.010
Neg logit contributions
ging-0.012
Charles-0.012
Frank-0.012
arted-0.012
Phil-0.012
Token the
Token ID262
Feature activation-1.21e-02

Pos logit contributions
bi+0.005
 edge+0.005
 distance+0.004
 bottom+0.004
 valleys+0.004
Neg logit contributions
 sew-0.004
 cr-0.004
?-0.003
 forgiveness-0.003
 report-0.003
Token robot
Token ID9379
Feature activation-8.02e-03

Pos logit contributions
bi+0.021
 edge+0.021
ling+0.020
Larry+0.020
 distance+0.019
Neg logit contributions
 cr-0.013
 sew-0.013
?-0.013
 forgiveness-0.012
 report-0.012
Token and
Token ID290
Feature activation-3.52e-03

Pos logit contributions
bi+0.015
 edge+0.015
ling+0.014
Larry+0.014
 valleys+0.013
Neg logit contributions
 cr-0.008
?-0.008
 sew-0.008
 forgiveness-0.008
 report-0.007
Token found
Token ID1043
Feature activation-1.10e-02

Pos logit contributions
bi+0.006
 edge+0.006
Larry+0.006
ling+0.006
eled+0.006
Neg logit contributions
?-0.004
 heaven-0.003
 report-0.003
 sew-0.003
 forgiveness-0.003
Token a
Token ID257
Feature activation+2.10e-03

Pos logit contributions
bi+0.021
 edge+0.020
Larry+0.019
ling+0.019
Charles+0.018
Neg logit contributions
 cr-0.012
?-0.011
 sew-0.011
 forgiveness-0.011
 report-0.010
Token loose
Token ID9155
Feature activation-4.29e-02

Pos logit contributions
 cr+0.003
 sew+0.003
?+0.003
 forgiveness+0.002
 forgive+0.002
Neg logit contributions
bi-0.004
ling-0.003
Larry-0.003
 edge-0.003
arted-0.003
Token screw
Token ID9580
Feature activation-2.29e-02

Pos logit contributions
 edge+0.068
bi+0.064
 bottom+0.063
 across+0.062
ling+0.062
Neg logit contributions
 forgiveness-0.053
 cr-0.052
 sew-0.051
?-0.051
 report-0.050
Token.
Token ID13
Feature activation-1.25e-03

Pos logit contributions
 edge+0.039
bi+0.037
 across+0.036
ling+0.036
 bottom+0.036
Neg logit contributions
 forgiveness-0.026
 cr-0.025
 sew-0.025
?-0.024
 heaven-0.024
Token He
Token ID679
Feature activation-2.90e-03

Pos logit contributions
bi+0.002
 edge+0.002
Larry+0.002
acking+0.002
Frank+0.002
Neg logit contributions
 cr-0.002
 forgiveness-0.001
 sew-0.001
?-0.001
 heaven-0.001
Token used
Token ID973
Feature activation-7.26e-03

Pos logit contributions
bi+0.005
 edge+0.004
eled+0.004
 scrambled+0.004
 sudden+0.004
Neg logit contributions
 heaven-0.004
?-0.004
 forgiveness-0.004
 report-0.003
 parties-0.003
Token a
Token ID257
Feature activation+1.05e-02

Pos logit contributions
bi+0.013
 edge+0.012
Larry+0.011
ling+0.011
 valleys+0.011
Neg logit contributions
 cr-0.008
?-0.008
 sew-0.008
 forgiveness-0.008
 report-0.008
Token Anna
Token ID11735
Feature activation-3.75e-03

Pos logit contributions
bi+0.006
 edge+0.005
ling+0.005
eled+0.005
Larry+0.005
Neg logit contributions
 cr-0.002
?-0.002
 sew-0.002
 forgiveness-0.002
 report-0.002
Token.
Token ID13
Feature activation-1.13e-02

Pos logit contributions
bi+0.008
 edge+0.008
Larry+0.008
ling+0.008
Charles+0.007
Neg logit contributions
 cr-0.003
?-0.003
 sew-0.003
 forgiveness-0.002
 report-0.002
Token You
Token ID921
Feature activation-5.70e-03

Pos logit contributions
bi+0.022
 edge+0.021
ling+0.020
Larry+0.020
eled+0.020
Neg logit contributions
 cr-0.012
?-0.011
 sew-0.011
 forgiveness-0.011
 L-0.010
Token are
Token ID389
Feature activation-8.24e-03

Pos logit contributions
bi+0.009
 edge+0.009
Larry+0.008
 across+0.008
 summit+0.008
Neg logit contributions
 forgiveness-0.007
 cr-0.006
?-0.006
 report-0.006
 sew-0.006
Token a
Token ID257
Feature activation-2.47e-03

Pos logit contributions
bi+0.014
 edge+0.013
Larry+0.013
ling+0.013
Charles+0.013
Neg logit contributions
 cr-0.010
 sew-0.009
?-0.009
 forgiveness-0.009
 report-0.009
Token mean
Token ID1612
Feature activation-4.48e-02

Pos logit contributions
bi+0.004
 edge+0.004
Larry+0.003
ling+0.003
 valleys+0.003
Neg logit contributions
 cr-0.003
?-0.003
 sew-0.003
 forgiveness-0.003
 report-0.003
Token mom
Token ID1995
Feature activation+5.52e-03

Pos logit contributions
bi+0.072
 edge+0.067
ling+0.064
Larry+0.062
mo+0.061
Neg logit contributions
 cr-0.058
 sew-0.057
 forgiveness-0.054
?-0.052
 report-0.052
Token.
Token ID13
Feature activation-1.47e-02

Pos logit contributions
 cr+0.006
 sew+0.006
 forgiveness+0.006
 report+0.005
 Sp+0.005
Neg logit contributions
bi-0.010
ling-0.009
 edge-0.009
acking-0.009
mo-0.009
Token You
Token ID921
Feature activation-8.19e-03

Pos logit contributions
bi+0.025
 edge+0.023
Larry+0.022
Charles+0.022
Frank+0.022
Neg logit contributions
 cr-0.018
 sew-0.017
 forgiveness-0.017
?-0.017
 report-0.016
Token don
Token ID836
Feature activation+1.77e-04

Pos logit contributions
bi+0.011
 edge+0.010
ny+0.010
 across+0.010
ling+0.010
Neg logit contributions
 forgiveness-0.011
 cr-0.010
mm-0.010
arium-0.010
?-0.010
Token't
Token ID470
Feature activation-1.44e-02

Pos logit contributions
 cr+0.000
 forgiveness+0.000
 sew+0.000
 heaven+0.000
?+0.000
Neg logit contributions
 edge-0.000
bi-0.000
acking-0.000
 bottom-0.000
ling-0.000
Token wanted
Token ID2227
Feature activation-1.32e-03

Pos logit contributions
bi+0.005
 edge+0.005
Larry+0.005
ling+0.004
 distance+0.004
Neg logit contributions
 cr-0.004
?-0.004
 sew-0.004
 forgiveness-0.004
 report-0.004
Token to
Token ID284
Feature activation+1.21e-03

Pos logit contributions
bi+0.003
 edge+0.003
ling+0.002
Larry+0.002
acking+0.002
Neg logit contributions
 cr-0.001
?-0.001
 forgiveness-0.001
 sew-0.001
 report-0.001
Token win
Token ID1592
Feature activation+2.60e-03

Pos logit contributions
 cr+0.001
/+0.001
 forgiveness+0.001
?+0.001
 whose+0.001
Neg logit contributions
 edge-0.002
bi-0.002
Larry-0.002
 bottom-0.002
 distance-0.002
Token a
Token ID257
Feature activation-3.67e-03

Pos logit contributions
 cr+0.003
 sew+0.002
?+0.002
 forgiveness+0.002
 report+0.002
Neg logit contributions
bi-0.005
 edge-0.005
Larry-0.005
ling-0.005
 valleys-0.005
Token l
Token ID300
Feature activation-1.75e-02

Pos logit contributions
bi+0.006
 edge+0.006
Larry+0.006
ling+0.006
 distance+0.006
Neg logit contributions
 cr-0.004
 sew-0.004
?-0.004
 forgiveness-0.004
 heaven-0.004
Tokenoll
Token ID692
Feature activation-6.53e-02

Pos logit contributions
bi+0.024
 edge+0.023
 onward+0.023
pering+0.022
acking+0.022
Neg logit contributions
 cr-0.025
 forgiveness-0.025
 sew-0.024
 Acc-0.022
 report-0.022
Tokenipop
Token ID42800
Feature activation-1.64e-02

Pos logit contributions
bi+0.098
ling+0.090
mo+0.084
ging+0.082
 edge+0.080
Neg logit contributions
 sew-0.093
 forgiveness-0.090
 heaven-0.090
 cr-0.087
 forgive-0.087
Token too
Token ID1165
Feature activation-6.07e-03

Pos logit contributions
 edge+0.029
bi+0.028
 across+0.028
ling+0.027
 bottom+0.027
Neg logit contributions
 forgiveness-0.016
 sew-0.016
 cr-0.016
 Acc-0.015
 forgive-0.014
Token.
Token ID13
Feature activation-3.55e-03

Pos logit contributions
bi+0.011
 edge+0.010
 across+0.010
ling+0.010
Frank+0.010
Neg logit contributions
 forgiveness-0.007
 sew-0.006
 cr-0.006
 forgive-0.005
 report-0.005
Token But
Token ID887
Feature activation-6.00e-03

Pos logit contributions
bi+0.004
 edge+0.004
Larry+0.004
ling+0.004
Charles+0.004
Neg logit contributions
 cr-0.006
?-0.006
 sew-0.006
 forgiveness-0.006
 report-0.006
Token he
Token ID339
Feature activation-1.01e-02

Pos logit contributions
bi+0.013
 edge+0.012
Larry+0.012
ling+0.012
 valleys+0.011
Neg logit contributions
 cr-0.005
?-0.005
 sew-0.005
 forgiveness-0.005
 report-0.004
Token a
Token ID257
Feature activation+3.53e-03

Pos logit contributions
bi+0.015
acking+0.015
ling+0.015
umbling+0.014
 edge+0.014
Neg logit contributions
 cr-0.004
 L-0.003
 whose-0.003
 grandma-0.003
 sew-0.003
Token funny
Token ID8258
Feature activation-2.35e-02

Pos logit contributions
 cr+0.004
?+0.004
 sew+0.004
 forgiveness+0.004
 report+0.004
Neg logit contributions
bi-0.006
 edge-0.006
Larry-0.006
ling-0.006
 valleys-0.006
Token video
Token ID2008
Feature activation-8.88e-03

Pos logit contributions
bi+0.045
Larry+0.042
 edge+0.042
 valleys+0.042
arted+0.041
Neg logit contributions
 cr-0.027
?-0.024
 sew-0.023
 forgiveness-0.022
 report-0.022
Token about
Token ID546
Feature activation-6.15e-03

Pos logit contributions
bi+0.019
 edge+0.018
Larry+0.017
ling+0.017
Charles+0.017
Neg logit contributions
 cr-0.007
?-0.007
 sew-0.007
 forgiveness-0.006
 report-0.006
Token a
Token ID257
Feature activation+1.37e-02

Pos logit contributions
bi+0.013
 edge+0.011
acking+0.011
ling+0.011
umbling+0.011
Neg logit contributions
 cr-0.007
?-0.006
 forgiveness-0.006
 L-0.006
 whose-0.006
Token talking
Token ID3375
Feature activation-4.41e-02

Pos logit contributions
 cr+0.017
?+0.015
 sew+0.015
 report+0.014
 grandma+0.014
Neg logit contributions
bi-0.026
eled-0.024
arted-0.023
umbling-0.023
Larry-0.023
Token cat
Token ID3797
Feature activation+2.34e-03

Pos logit contributions
bi+0.085
 edge+0.078
Larry+0.078
 valleys+0.076
eled+0.076
Neg logit contributions
 cr-0.049
?-0.045
 sew-0.043
 forgiveness-0.042
 report-0.040
Token.
Token ID13
Feature activation+5.55e-03

Pos logit contributions
 cr+0.002
?+0.001
 sew+0.001
 heaven+0.001
 forgiveness+0.001
Neg logit contributions
bi-0.006
 edge-0.005
Larry-0.005
eled-0.005
 valleys-0.005
Token They
Token ID1119
Feature activation+3.89e-03

Pos logit contributions
 cr+0.007
?+0.007
 L+0.007
 sew+0.006
 Sp+0.006
Neg logit contributions
bi-0.009
ling-0.009
 edge-0.009
ging-0.009
Charles-0.009
Token laughed
Token ID13818
Feature activation-1.06e-02

Pos logit contributions
 cr+0.005
 sew+0.004
?+0.004
 forgiveness+0.004
 forgive+0.004
Neg logit contributions
bi-0.007
 edge-0.007
Larry-0.006
ling-0.006
Charles-0.006
Token and
Token ID290
Feature activation+1.18e-02

Pos logit contributions
bi+0.020
 edge+0.019
ling+0.018
Larry+0.018
 valleys+0.018
Neg logit contributions
 cr-0.011
 sew-0.011
?-0.010
 forgiveness-0.010
 report-0.010