TOP ACTIVATIONS
MAX = 0.045

and Daddy were not happy when they drew on the wall
smiled and hugged Suz ie when she saw the painting .
present . Mom was sad when she saw what Jill did
how happy the children were when they got a toy .
wall . Lily is glad when they say they like her
. People were so impressed when they saw her creation and
She told the children that when it was their turn they
and bread . She smiled when she saw the paper animals
about to give - up when a little rabbit stepped forward
and dad were so glad when they saw the print and
and Dad are not happy when they see the mess .
the kingdom was very happy when the jail disappeared !
not listen to the teacher when she gave a lecture .
what her mom had said when she was scared -
toys . She was happy when she drew .
he could . Everyone cheered when he managed to cross the
kissed her . She remembers when the little girl cl apped
But the children were scared when they saw the large cop
the tree . Everyone cheered when they finished , and it
She says , " B oys , boys . You have

MIN ACTIVATIONS
MIN = -0.043

ed down and snatched Tim my up in his be ak
voice . But then , mirac ulously , it returned and
knew how to prevent Tim my from forgetting about his toy
. Let 's put the top back on the box and
hill and put her or nam ents all around . She
scary man who stole Tim my 's shirt . They asked
the rest of the or nam ents onto the tree .
help her mom put or nam ents on the tree .
she could carry the or nam ents back with her .
The big boy stole Tim my 's toy truck !
They packed a picnic basket full of food and went to
tail and l icked Tim my 's face . Tim my
ictionary ' written on the front . She was so curious
mom where a trip to an orange place could be found
here and explore the or nam ents ." Da
. From that day on , the giant and
mom packed a picnic basket full of y ummy food .
her to go and mail an important letter . So Sally
they brought a picnic basket full of y ummy snacks .
so that no more mis chie vous mice could come in

INTERVAL 0.023 - 0.045
CONTAINS 0.201%

for now , you can enjoy their gentle light and wonder
playing in the bathroom . They saw a shiny thing on
. His mom saw this and wanted to help . She
to work , building and patch ing , fixing the well
. They did not know if it was a good idea

INTERVAL 0.001 - 0.023
CONTAINS 45.548%

to his neighbor , Mrs . Smith . Jack realized that
catch the wind . Together , they were able to catch
" What are they M ummy ?" asked Joe
she went to the park with her mom and saw a
Bo was so happy and danced around with his tw ig

INTERVAL -0.021 - 0.001
CONTAINS 53.691%

sat on the slide . But he was scared . He
broke . And the pins hurt me ." She sniff ed
, sir . I won 't do it again . Please
" I 'm sorry Tom , but I need to sell
. The monkey was about to win the race

INTERVAL -0.043 - -0.021
CONTAINS 0.561%

time there was a naughty boy . His name was Ben
decided that was the best plan - to try again and
in the garden and took care of the flowers . <|endoftext|>
nodded . They hugged their mom and thanked her . They
taken the shiny magnet . It is not mine . It
Token and
Token ID290
Feature activation+9.98e-03

Pos logit contributions
 friends+0.005
 there+0.005
 here+0.005
 thinking+0.005
 things+0.004
Neg logit contributions
ifying-0.007
attered-0.006
tr-0.005
rant-0.005
amy-0.005
Token Daddy
Token ID26926
Feature activation+6.29e-04

Pos logit contributions
 friends+0.014
 here+0.013
 thinking+0.013
 there+0.012
 things+0.012
Neg logit contributions
ifying-0.021
attered-0.018
amy-0.018
ksh-0.017
rant-0.017
Token were
Token ID547
Feature activation+8.76e-03

Pos logit contributions
 friends+0.001
 there+0.001
 here+0.001
 thinking+0.001
 things+0.001
Neg logit contributions
ifying-0.001
attered-0.001
tr-0.001
rant-0.001
amy-0.001
Token not
Token ID407
Feature activation+1.16e-02

Pos logit contributions
 friends+0.012
 there+0.012
 thinking+0.012
 here+0.012
 things+0.011
Neg logit contributions
ifying-0.018
attered-0.015
amy-0.014
tr-0.014
rant-0.014
Token happy
Token ID3772
Feature activation-1.23e-02

Pos logit contributions
 friends+0.010
 here+0.010
 there+0.009
 it+0.009
 things+0.009
Neg logit contributions
ifying-0.029
attered-0.025
amy-0.023
rant-0.023
work-0.023
Token when
Token ID618
Feature activation+4.53e-02

Pos logit contributions
ifying+0.026
attered+0.022
lier+0.021
work+0.020
tr+0.020
Neg logit contributions
 friends-0.016
 here-0.015
 there-0.015
 thinking-0.015
 upon-0.014
Token they
Token ID484
Feature activation-1.16e-02

Pos logit contributions
 thinking+0.052
 here+0.051
 friends+0.048
 upon+0.046
Once+0.044
Neg logit contributions
ifying-0.088
attered-0.073
amy-0.072
work-0.069
lier-0.068
Token drew
Token ID9859
Feature activation+4.49e-03

Pos logit contributions
ifying+0.026
attered+0.022
ksh+0.021
amy+0.021
tr+0.020
Neg logit contributions
 friends-0.017
 there-0.017
 thinking-0.016
 it-0.016
 here-0.015
Token on
Token ID319
Feature activation+7.48e-03

Pos logit contributions
 friends+0.005
 thinking+0.005
 there+0.005
 here+0.005
 it+0.004
Neg logit contributions
ifying-0.011
attered-0.010
amy-0.009
rant-0.009
tr-0.009
Token the
Token ID262
Feature activation-7.29e-03

Pos logit contributions
 friends+0.007
 there+0.007
 thinking+0.006
 here+0.006
 things+0.006
Neg logit contributions
ifying-0.021
attered-0.018
amy-0.017
rant-0.017
tr-0.017
Token wall
Token ID3355
Feature activation+5.33e-04

Pos logit contributions
ifying+0.016
attered+0.013
amy+0.013
lier+0.012
rant+0.012
Neg logit contributions
 friends-0.012
 things-0.011
 thinking-0.011
 next-0.011
 kids-0.011
Token smiled
Token ID13541
Feature activation+3.07e-03

Pos logit contributions
 friends+0.005
 thinking+0.005
 there+0.005
 here+0.005
 things+0.005
Neg logit contributions
ifying-0.007
attered-0.006
amy-0.005
tr-0.005
rant-0.005
Token and
Token ID290
Feature activation+3.70e-03

Pos logit contributions
 friends+0.002
 there+0.002
 thinking+0.002
 here+0.002
 kids+0.002
Neg logit contributions
ifying-0.009
attered-0.008
amy-0.008
rant-0.008
tr-0.008
Token hugged
Token ID41130
Feature activation+1.25e-02

Pos logit contributions
 friends+0.005
 there+0.005
 thinking+0.005
 it+0.005
 here+0.004
Neg logit contributions
ifying-0.009
attered-0.008
rant-0.007
tr-0.007
amy-0.007
Token Suz
Token ID22691
Feature activation-1.23e-02

Pos logit contributions
 friends+0.014
 there+0.012
 thinking+0.012
 here+0.012
 kids+0.011
Neg logit contributions
ifying-0.032
attered-0.028
amy-0.027
rant-0.027
tr-0.026
Tokenie
Token ID494
Feature activation-1.05e-02

Pos logit contributions
ifying+0.027
attered+0.022
tr+0.021
amy+0.021
rant+0.020
Neg logit contributions
 friends-0.020
 there-0.018
 thinking-0.018
 here-0.018
 things-0.017
Token when
Token ID618
Feature activation+4.42e-02

Pos logit contributions
ifying+0.025
attered+0.021
amy+0.020
rant+0.020
tr+0.020
Neg logit contributions
 friends-0.015
 there-0.014
 thinking-0.013
 here-0.013
 it-0.013
Token she
Token ID673
Feature activation-6.90e-03

Pos logit contributions
 here+0.037
 friends+0.036
 thinking+0.034
 cowboy+0.031
 kids+0.030
Neg logit contributions
ifying-0.104
attered-0.091
shaped-0.086
amy-0.085
piece-0.083
Token saw
Token ID2497
Feature activation+1.59e-03

Pos logit contributions
ifying+0.017
attered+0.014
ksh+0.013
rant+0.013
amy+0.013
Neg logit contributions
 friends-0.010
 there-0.009
 thinking-0.009
 it-0.009
 here-0.008
Token the
Token ID262
Feature activation+1.19e-02

Pos logit contributions
 friends+0.001
 there+0.001
 thinking+0.001
 here+0.001
 kids+0.001
Neg logit contributions
ifying-0.004
attered-0.004
amy-0.004
rant-0.004
shaped-0.004
Token painting
Token ID12036
Feature activation-9.78e-03

Pos logit contributions
 friends+0.018
 there+0.017
 here+0.017
 thinking+0.017
 it+0.016
Neg logit contributions
ifying-0.025
attered-0.020
tr-0.020
amy-0.019
shaped-0.019
Token.
Token ID13
Feature activation+1.07e-02

Pos logit contributions
ifying+0.027
attered+0.023
amy+0.023
rant+0.022
ksh+0.022
Neg logit contributions
 friends-0.010
 there-0.009
 thinking-0.008
 here-0.008
 it-0.008
Token present
Token ID1944
Feature activation-9.26e-03

Pos logit contributions
ifying+0.023
tr+0.018
attered+0.018
amy+0.017
 unison+0.016
Neg logit contributions
 there-0.016
 friends-0.016
 here-0.016
 thinking-0.015
 upon-0.015
Token.
Token ID13
Feature activation+9.45e-03

Pos logit contributions
ifying+0.024
attered+0.020
tr+0.019
rant+0.019
amy+0.019
Neg logit contributions
 friends-0.011
 there-0.010
 thinking-0.010
 here-0.010
 things-0.009
Token Mom
Token ID11254
Feature activation-7.31e-03

Pos logit contributions
 friends+0.015
 thinking+0.014
 there+0.013
 here+0.013
 things+0.013
Neg logit contributions
ifying-0.021
attered-0.017
amy-0.016
rant-0.015
piece-0.015
Token was
Token ID373
Feature activation-3.38e-03

Pos logit contributions
ifying+0.016
attered+0.014
tr+0.013
amy+0.013
rant+0.013
Neg logit contributions
 friends-0.011
 there-0.010
 thinking-0.010
 here-0.010
 things-0.009
Token sad
Token ID6507
Feature activation-6.86e-04

Pos logit contributions
ifying+0.008
attered+0.007
tr+0.006
amy+0.006
rant+0.006
Neg logit contributions
 friends-0.004
 there-0.004
 thinking-0.004
 things-0.004
 here-0.004
Token when
Token ID618
Feature activation+4.17e-02

Pos logit contributions
ifying+0.001
attered+0.001
tr+0.001
rant+0.001
lier+0.001
Neg logit contributions
 friends-0.001
 thinking-0.001
 here-0.001
 there-0.001
 upon-0.001
Token she
Token ID673
Feature activation-1.52e-02

Pos logit contributions
 friends+0.048
 here+0.048
ittens+0.046
 thinking+0.046
 cowboy+0.046
Neg logit contributions
ifying-0.078
amy-0.063
attered-0.062
piece-0.059
work-0.059
Token saw
Token ID2497
Feature activation+1.14e-03

Pos logit contributions
ifying+0.037
attered+0.032
ksh+0.030
amy+0.030
rant+0.029
Neg logit contributions
 friends-0.020
 there-0.020
 here-0.018
 it-0.018
 thinking-0.018
Token what
Token ID644
Feature activation+1.75e-02

Pos logit contributions
 friends+0.001
 thinking+0.001
 here+0.001
 there+0.001
 upon+0.001
Neg logit contributions
ifying-0.003
attered-0.003
amy-0.003
rant-0.003
shaped-0.003
Token Jill
Token ID22772
Feature activation-9.31e-03

Pos logit contributions
 friends+0.020
 thinking+0.018
 here+0.018
 there+0.017
 things+0.016
Neg logit contributions
ifying-0.045
attered-0.039
amy-0.037
tr-0.036
rant-0.036
Token did
Token ID750
Feature activation+1.38e-03

Pos logit contributions
ifying+0.019
attered+0.016
rant+0.015
amy+0.015
tr+0.015
Neg logit contributions
 friends-0.015
 thinking-0.014
 here-0.013
 there-0.013
 things-0.013
Token how
Token ID703
Feature activation+8.94e-03

Pos logit contributions
ifying+0.019
attered+0.016
amy+0.016
rant+0.015
tr+0.015
Neg logit contributions
 friends-0.007
 there-0.006
 thinking-0.006
 here-0.006
 it-0.005
Token happy
Token ID3772
Feature activation+1.24e-04

Pos logit contributions
 friends+0.009
 there+0.008
 thinking+0.007
 here+0.007
 it+0.007
Neg logit contributions
ifying-0.025
attered-0.022
amy-0.021
rant-0.021
tr-0.021
Token the
Token ID262
Feature activation+4.49e-04

Pos logit contributions
 thinking+0.000
 friends+0.000
 here+0.000
 upon+0.000
 there+0.000
Neg logit contributions
ifying-0.000
attered-0.000
amy-0.000
lier-0.000
tr-0.000
Token children
Token ID1751
Feature activation-7.92e-03

Pos logit contributions
 there+0.000
 here+0.000
 it+0.000
 friends+0.000
 upon+0.000
Neg logit contributions
ifying-0.001
attered-0.001
tr-0.001
amy-0.001
rant-0.001
Token were
Token ID547
Feature activation+1.14e-02

Pos logit contributions
ifying+0.017
attered+0.014
amy+0.013
rant+0.013
tr+0.013
Neg logit contributions
 friends-0.013
 there-0.012
 thinking-0.012
 here-0.012
 it-0.011
Token when
Token ID618
Feature activation+4.14e-02

Pos logit contributions
 friends+0.011
 upon+0.010
 thinking+0.010
 here+0.010
 there+0.009
Neg logit contributions
ifying-0.028
attered-0.024
lier-0.023
rant-0.023
amy-0.023
Token they
Token ID484
Feature activation-6.97e-03

Pos logit contributions
 here+0.043
 friends+0.042
 thinking+0.042
 upon+0.037
 there+0.036
Neg logit contributions
ifying-0.094
attered-0.076
shaped-0.073
amy-0.073
work-0.072
Token got
Token ID1392
Feature activation+1.18e-02

Pos logit contributions
ifying+0.017
attered+0.014
tr+0.014
amy+0.013
ksh+0.013
Neg logit contributions
 friends-0.010
 there-0.009
 thinking-0.009
 here-0.008
 it-0.008
Token a
Token ID257
Feature activation+9.27e-03

Pos logit contributions
 friends+0.013
 thinking+0.012
 there+0.012
 here+0.012
 things+0.011
Neg logit contributions
ifying-0.030
attered-0.026
amy-0.025
rant-0.024
shaped-0.024
Token toy
Token ID13373
Feature activation-6.79e-03

Pos logit contributions
 friends+0.010
 there+0.009
 thinking+0.009
 here+0.009
 it+0.008
Neg logit contributions
ifying-0.025
attered-0.021
amy-0.020
tr-0.020
rant-0.020
Token.
Token ID13
Feature activation+7.34e-03

Pos logit contributions
ifying+0.019
attered+0.016
amy+0.016
rant+0.015
tr+0.015
Neg logit contributions
 friends-0.007
 there-0.006
 thinking-0.006
 here-0.006
 things-0.005
Token wall
Token ID3355
Feature activation-5.68e-03

Pos logit contributions
ifying+0.039
attered+0.035
lier+0.032
amy+0.032
ksh+0.031
Neg logit contributions
 friends-0.022
 next-0.021
 mom-0.021
 things-0.021
 there-0.020
Token.
Token ID13
Feature activation+4.87e-03

Pos logit contributions
ifying+0.016
attered+0.013
amy+0.013
rant+0.013
ksh+0.013
Neg logit contributions
 friends-0.006
 there-0.005
 here-0.005
 thinking-0.005
 it-0.005
Token Lily
Token ID20037
Feature activation+1.54e-02

Pos logit contributions
 friends+0.004
 thinking+0.004
 here+0.004
 there+0.004
 things+0.003
Neg logit contributions
ifying-0.014
attered-0.012
tr-0.011
rant-0.011
amy-0.011
Token is
Token ID318
Feature activation+1.25e-02

Pos logit contributions
 friends+0.022
 there+0.020
 thinking+0.019
 it+0.019
 here+0.019
Neg logit contributions
ifying-0.038
attered-0.031
rant-0.029
amy-0.029
tr-0.028
Token glad
Token ID9675
Feature activation+2.10e-03

Pos logit contributions
 friends+0.017
 there+0.016
 thinking+0.016
 here+0.016
 things+0.016
Neg logit contributions
ifying-0.027
attered-0.022
amy-0.021
tr-0.021
shaped-0.021
Token when
Token ID618
Feature activation+3.99e-02

Pos logit contributions
 friends+0.002
 here+0.002
 thinking+0.002
 upon+0.002
 idea+0.002
Neg logit contributions
ifying-0.005
attered-0.004
tr-0.004
work-0.003
amy-0.003
Token they
Token ID484
Feature activation-7.82e-03

Pos logit contributions
 here+0.034
 friends+0.031
 upon+0.030
 idea+0.029
 sure+0.029
Neg logit contributions
ifying-0.087
attered-0.072
nam-0.072
shaped-0.071
amy-0.070
Token say
Token ID910
Feature activation+3.64e-03

Pos logit contributions
ifying+0.020
attered+0.018
ksh+0.016
tr+0.016
amy+0.016
Neg logit contributions
 friends-0.009
 there-0.009
 thinking-0.008
 here-0.008
 it-0.008
Token they
Token ID484
Feature activation-2.37e-03

Pos logit contributions
 friends+0.003
 there+0.003
 thinking+0.003
 here+0.003
 upon+0.003
Neg logit contributions
ifying-0.010
attered-0.008
tr-0.008
rant-0.008
amy-0.008
Token like
Token ID588
Feature activation-2.67e-03

Pos logit contributions
ifying+0.006
attered+0.005
amy+0.005
tr+0.005
rant+0.005
Neg logit contributions
 friends-0.003
 there-0.003
 here-0.003
 thinking-0.003
 it-0.003
Token her
Token ID607
Feature activation+1.09e-02

Pos logit contributions
ifying+0.006
attered+0.005
shaped+0.005
tr+0.005
rant+0.005
Neg logit contributions
 friends-0.003
 there-0.003
 here-0.003
 thinking-0.003
 upon-0.003
Token.
Token ID13
Feature activation+7.15e-03

Pos logit contributions
 friends+0.016
 there+0.014
 thinking+0.014
 here+0.014
 kids+0.012
Neg logit contributions
ifying-0.040
attered-0.034
rant-0.032
amy-0.032
tr-0.032
Token People
Token ID4380
Feature activation-3.96e-03

Pos logit contributions
 friends+0.009
 there+0.009
 thinking+0.008
 here+0.008
 things+0.008
Neg logit contributions
ifying-0.018
attered-0.015
tr-0.014
amy-0.014
rant-0.014
Token were
Token ID547
Feature activation+7.64e-03

Pos logit contributions
ifying+0.008
attered+0.007
rant+0.006
amy+0.006
tr+0.006
Neg logit contributions
 friends-0.006
 there-0.006
 thinking-0.006
 here-0.006
 things-0.006
Token so
Token ID523
Feature activation+8.63e-03

Pos logit contributions
 friends+0.010
 there+0.009
 here+0.009
 thinking+0.009
 things+0.009
Neg logit contributions
ifying-0.017
attered-0.015
amy-0.014
rant-0.013
shaped-0.013
Token impressed
Token ID12617
Feature activation-4.34e-03

Pos logit contributions
 friends+0.008
 there+0.008
 thinking+0.008
 here+0.007
 things+0.007
Neg logit contributions
ifying-0.023
attered-0.020
amy-0.019
shaped-0.019
rant-0.019
Token when
Token ID618
Feature activation+3.98e-02

Pos logit contributions
ifying+0.011
lier+0.009
attered+0.009
amy+0.009
work+0.009
Neg logit contributions
 friends-0.004
 thinking-0.004
 there-0.004
 mom-0.004
 here-0.004
Token they
Token ID484
Feature activation-1.36e-03

Pos logit contributions
 here+0.034
 friends+0.031
 thinking+0.031
 there+0.030
 upon+0.029
Neg logit contributions
ifying-0.098
amy-0.079
attered-0.078
 unison-0.078
shaped-0.077
Token saw
Token ID2497
Feature activation-5.65e-03

Pos logit contributions
ifying+0.003
attered+0.002
amy+0.002
rant+0.002
tr+0.002
Neg logit contributions
 friends-0.002
 there-0.002
 thinking-0.002
 here-0.002
 things-0.002
Token her
Token ID607
Feature activation+1.03e-02

Pos logit contributions
ifying+0.016
attered+0.014
amy+0.013
rant+0.013
tr+0.013
Neg logit contributions
 friends-0.005
 there-0.004
 thinking-0.004
 here-0.004
 kids-0.004
Token creation
Token ID6282
Feature activation-2.96e-04

Pos logit contributions
 friends+0.010
 here+0.010
 there+0.010
 thinking+0.009
 it+0.009
Neg logit contributions
ifying-0.026
attered-0.022
tr-0.021
amy-0.021
work-0.020
Token and
Token ID290
Feature activation+1.26e-02

Pos logit contributions
ifying+0.001
attered+0.001
amy+0.001
tr+0.001
rant+0.001
Neg logit contributions
 friends-0.000
 there-0.000
 thinking-0.000
 here-0.000
 things-0.000
Token She
Token ID1375
Feature activation+8.94e-03

Pos logit contributions
 friends+0.000
 here+0.000
 there+0.000
 thinking+0.000
 things+0.000
Neg logit contributions
ifying-0.000
attered-0.000
tr-0.000
rant-0.000
amy-0.000
Token told
Token ID1297
Feature activation+1.30e-03

Pos logit contributions
 friends+0.011
 there+0.011
 here+0.011
 thinking+0.010
 things+0.010
Neg logit contributions
ifying-0.021
attered-0.018
tr-0.017
amy-0.017
rant-0.017
Token the
Token ID262
Feature activation+9.59e-03

Pos logit contributions
 friends+0.002
 there+0.002
 here+0.002
 upon+0.001
 thinking+0.001
Neg logit contributions
ifying-0.003
rant-0.002
shaped-0.002
attered-0.002
amy-0.002
Token children
Token ID1751
Feature activation-2.97e-03

Pos logit contributions
 here+0.013
 there+0.012
 friends+0.012
 it+0.011
 upon+0.011
Neg logit contributions
ifying-0.019
tr-0.017
attered-0.015
amy-0.015
piece-0.015
Token that
Token ID326
Feature activation+8.99e-03

Pos logit contributions
ifying+0.008
attered+0.007
amy+0.006
tr+0.006
rant+0.006
Neg logit contributions
 friends-0.003
 thinking-0.003
 there-0.003
 here-0.003
 kids-0.002
Token when
Token ID618
Feature activation+3.95e-02

Pos logit contributions
 friends+0.008
 thinking+0.007
 here+0.007
 upon+0.007
 there+0.007
Neg logit contributions
ifying-0.023
attered-0.020
tr-0.020
amy-0.019
lier-0.019
Token it
Token ID340
Feature activation+1.35e-02

Pos logit contributions
 thinking+0.038
 here+0.038
 friends+0.037
 upon+0.034
"+0.032
Neg logit contributions
ifying-0.093
attered-0.074
work-0.071
amy-0.071
lier-0.070
Token was
Token ID373
Feature activation+3.10e-03

Pos logit contributions
 friends+0.021
 thinking+0.020
 here+0.019
 there+0.019
 kids+0.018
Neg logit contributions
ifying-0.028
attered-0.023
tr-0.023
amy-0.022
rant-0.021
Token their
Token ID511
Feature activation+1.24e-02

Pos logit contributions
 friends+0.003
 there+0.003
 thinking+0.003
 here+0.003
 things+0.003
Neg logit contributions
ifying-0.008
attered-0.007
rant-0.007
tr-0.007
amy-0.007
Token turn
Token ID1210
Feature activation+2.03e-03

Pos logit contributions
 there+0.014
 friends+0.013
 here+0.013
 thinking+0.013
 it+0.013
Neg logit contributions
ifying-0.030
attered-0.025
tr-0.025
work-0.023
shaped-0.023
Token they
Token ID484
Feature activation-6.38e-04

Pos logit contributions
 friends+0.002
 there+0.002
 thinking+0.002
 here+0.002
 it+0.002
Neg logit contributions
ifying-0.006
attered-0.005
amy-0.005
rant-0.005
tr-0.005
Token and
Token ID290
Feature activation+1.55e-03

Pos logit contributions
 friends+0.011
 there+0.010
 thinking+0.010
 here+0.010
 things+0.009
Neg logit contributions
ifying-0.025
attered-0.021
amy-0.020
rant-0.020
tr-0.020
Token bread
Token ID8509
Feature activation+1.39e-02

Pos logit contributions
 friends+0.002
 there+0.002
 thinking+0.001
 it+0.001
 here+0.001
Neg logit contributions
ifying-0.004
attered-0.004
amy-0.004
rant-0.003
tr-0.003
Token.
Token ID13
Feature activation+1.56e-02

Pos logit contributions
 friends+0.016
 thinking+0.015
 there+0.014
 here+0.014
 kids+0.013
Neg logit contributions
ifying-0.035
attered-0.029
rant-0.028
tr-0.028
amy-0.028
Token She
Token ID1375
Feature activation+9.72e-03

Pos logit contributions
 friends+0.023
 thinking+0.020
 here+0.020
 there+0.019
 things+0.019
Neg logit contributions
ifying-0.034
attered-0.029
amy-0.027
tr-0.026
piece-0.026
Token smiled
Token ID13541
Feature activation+8.74e-03

Pos logit contributions
 friends+0.014
 there+0.013
 thinking+0.013
 here+0.013
 things+0.012
Neg logit contributions
ifying-0.023
attered-0.019
amy-0.018
rant-0.018
tr-0.018
Token when
Token ID618
Feature activation+3.94e-02

Pos logit contributions
 friends+0.008
 thinking+0.008
 there+0.007
 here+0.007
 things+0.007
Neg logit contributions
ifying-0.023
attered-0.020
amy-0.018
lier-0.018
rant-0.018
Token she
Token ID673
Feature activation-2.22e-03

Pos logit contributions
 friends+0.040
 here+0.039
 thinking+0.038
ittens+0.034
 friend+0.034
Neg logit contributions
ifying-0.086
attered-0.071
amy-0.069
piece-0.067
shaped-0.066
Token saw
Token ID2497
Feature activation+1.29e-03

Pos logit contributions
ifying+0.006
attered+0.005
amy+0.004
rant+0.004
tr+0.004
Neg logit contributions
 friends-0.003
 there-0.003
 thinking-0.003
 here-0.003
 it-0.002
Token the
Token ID262
Feature activation+1.15e-02

Pos logit contributions
 friends+0.001
 thinking+0.001
 there+0.001
 here+0.001
 things+0.001
Neg logit contributions
ifying-0.004
attered-0.003
amy-0.003
rant-0.003
shaped-0.003
Token paper
Token ID3348
Feature activation+4.75e-03

Pos logit contributions
 friends+0.018
 there+0.018
 here+0.018
 thinking+0.017
"+0.017
Neg logit contributions
ifying-0.022
attered-0.017
tr-0.016
rant-0.016
amy-0.016
Token animals
Token ID4695
Feature activation-4.11e-03

Pos logit contributions
 friends+0.005
 there+0.004
 thinking+0.004
 here+0.004
 kids+0.004
Neg logit contributions
ifying-0.013
attered-0.011
rant-0.010
tr-0.010
amy-0.010
Token about
Token ID546
Feature activation+1.58e-02

Pos logit contributions
ifying+0.011
attered+0.009
amy+0.009
rant+0.009
tr+0.009
Neg logit contributions
 friends-0.006
 there-0.006
 thinking-0.006
 here-0.006
 things-0.005
Token to
Token ID284
Feature activation-3.46e-04

Pos logit contributions
 friends+0.019
 kids+0.017
 here+0.017
 there+0.016
 thinking+0.016
Neg logit contributions
ifying-0.035
shaped-0.027
attered-0.027
amy-0.026
work-0.026
Token give
Token ID1577
Feature activation+1.80e-02

Pos logit contributions
ifying+0.001
attered+0.001
amy+0.001
rant+0.001
tr+0.001
Neg logit contributions
 friends-0.000
 there-0.000
 thinking-0.000
 here-0.000
 things-0.000
Token-
Token ID12
Feature activation+5.34e-03

Pos logit contributions
 friends+0.020
 thinking+0.019
 there+0.019
 here+0.018
 kids+0.017
Neg logit contributions
ifying-0.046
attered-0.039
amy-0.037
rant-0.037
tr-0.036
Tokenup
Token ID929
Feature activation+5.68e-03

Pos logit contributions
 friends+0.010
 there+0.009
 it+0.009
 here+0.009
 things+0.009
Neg logit contributions
ifying-0.011
attered-0.008
tr-0.008
rant-0.008
ksh-0.008
Token when
Token ID618
Feature activation+3.94e-02

Pos logit contributions
 friends+0.005
 thinking+0.005
 there+0.005
 here+0.005
 kids+0.004
Neg logit contributions
ifying-0.016
attered-0.013
amy-0.013
rant-0.013
tr-0.012
Token a
Token ID257
Feature activation+1.45e-02

Pos logit contributions
 friends+0.029
 here+0.029
 thinking+0.028
 there+0.024
 kids+0.023
Neg logit contributions
ifying-0.105
attered-0.085
amy-0.085
shaped-0.084
lier-0.083
Token little
Token ID1310
Feature activation+9.93e-04

Pos logit contributions
 friends+0.016
 there+0.015
 here+0.015
 thinking+0.014
 upon+0.013
Neg logit contributions
ifying-0.037
attered-0.030
tr-0.029
amy-0.029
rant-0.029
Token rabbit
Token ID22746
Feature activation+1.92e-03

Pos logit contributions
 friends+0.001
 there+0.001
 thinking+0.001
 here+0.001
 it+0.001
Neg logit contributions
ifying-0.003
attered-0.002
amy-0.002
rant-0.002
tr-0.002
Token stepped
Token ID10764
Feature activation-2.06e-05

Pos logit contributions
 friends+0.003
 there+0.003
 thinking+0.003
 here+0.003
 things+0.003
Neg logit contributions
ifying-0.004
attered-0.003
amy-0.003
rant-0.003
tr-0.003
Token forward
Token ID2651
Feature activation+1.31e-03

Pos logit contributions
ifying+0.000
attered+0.000
tr+0.000
amy+0.000
rant+0.000
Neg logit contributions
 friends-0.000
 there-0.000
 here-0.000
 thinking-0.000
 it-0.000
Token and
Token ID290
Feature activation+8.02e-03

Pos logit contributions
 things+0.003
 here+0.003
 friends+0.003
 thinking+0.003
 there+0.003
Neg logit contributions
ifying-0.006
tr-0.005
bug-0.005
attered-0.005
amy-0.005
Token dad
Token ID9955
Feature activation+1.09e-03

Pos logit contributions
 here+0.007
't+0.007
 friends+0.007
 there+0.007
 thinking+0.007
Neg logit contributions
ifying-0.019
attered-0.017
nam-0.016
amy-0.015
work-0.015
Token were
Token ID547
Feature activation+3.44e-03

Pos logit contributions
 friends+0.002
 there+0.002
 thinking+0.002
 here+0.002
 things+0.002
Neg logit contributions
ifying-0.002
attered-0.002
amy-0.002
rant-0.002
tr-0.002
Token so
Token ID523
Feature activation+8.40e-03

Pos logit contributions
 things+0.005
 friends+0.005
 thinking+0.005
 upon+0.004
 there+0.004
Neg logit contributions
ifying-0.007
attered-0.005
amy-0.005
rant-0.005
shaped-0.005
Token glad
Token ID9675
Feature activation+7.14e-04

Pos logit contributions
 friends+0.008
 thinking+0.008
 there+0.007
 things+0.007
 here+0.007
Neg logit contributions
ifying-0.021
attered-0.019
amy-0.018
shaped-0.017
rant-0.017
Token when
Token ID618
Feature activation+3.93e-02

Pos logit contributions
 friends+0.001
 thinking+0.001
 here+0.001
 upon+0.001
 there+0.001
Neg logit contributions
ifying-0.002
attered-0.001
lier-0.001
amy-0.001
work-0.001
Token they
Token ID484
Feature activation-4.23e-03

Pos logit contributions
 here+0.046
 upon+0.044
 friends+0.041
"+0.041
 idea+0.040
Neg logit contributions
ifying-0.070
amy-0.058
work-0.055
attered-0.054
 unison-0.054
Token saw
Token ID2497
Feature activation-5.86e-03

Pos logit contributions
ifying+0.010
attered+0.009
amy+0.008
tr+0.008
rant+0.008
Neg logit contributions
 friends-0.005
 there-0.005
 thinking-0.005
 here-0.005
 it-0.005
Token the
Token ID262
Feature activation+1.13e-02

Pos logit contributions
ifying+0.016
attered+0.014
amy+0.013
rant+0.013
shaped+0.013
Neg logit contributions
 friends-0.005
 thinking-0.005
 there-0.005
 here-0.005
 upon-0.004
Token print
Token ID3601
Feature activation+1.38e-04

Pos logit contributions
 friends+0.019
 there+0.018
 here+0.018
 thinking+0.018
 it+0.017
Neg logit contributions
ifying-0.022
attered-0.018
tr-0.017
amy-0.017
rant-0.016
Token and
Token ID290
Feature activation+5.40e-03

Pos logit contributions
 friends+0.000
 there+0.000
 thinking+0.000
 here+0.000
 things+0.000
Neg logit contributions
ifying-0.000
attered-0.000
amy-0.000
rant-0.000
tr-0.000
Token and
Token ID290
Feature activation+4.26e-03

Pos logit contributions
 friends+0.002
 thinking+0.002
 there+0.002
 here+0.002
 things+0.002
Neg logit contributions
ifying-0.003
attered-0.003
rant-0.003
amy-0.003
tr-0.003
Token Dad
Token ID17415
Feature activation+2.41e-03

Pos logit contributions
 friends+0.005
 thinking+0.005
 there+0.005
 here+0.005
 things+0.004
Neg logit contributions
ifying-0.010
attered-0.009
amy-0.008
tr-0.008
rant-0.008
Token are
Token ID389
Feature activation+1.00e-02

Pos logit contributions
 friends+0.003
 there+0.003
 thinking+0.003
 here+0.003
 things+0.003
Neg logit contributions
ifying-0.005
attered-0.005
rant-0.004
amy-0.004
tr-0.004
Token not
Token ID407
Feature activation+1.26e-02

Pos logit contributions
 friends+0.012
 thinking+0.011
 it+0.011
 things+0.011
 there+0.011
Neg logit contributions
ifying-0.022
attered-0.019
amy-0.018
tr-0.017
rant-0.017
Token happy
Token ID3772
Feature activation-1.45e-02

Pos logit contributions
Next+0.010
 friends+0.009
 it+0.009
't+0.009
"+0.009
Neg logit contributions
ifying-0.029
attered-0.026
nam-0.025
lier-0.024
rant-0.024
Token when
Token ID618
Feature activation+3.91e-02

Pos logit contributions
ifying+0.030
attered+0.025
lier+0.024
tr+0.023
amy+0.023
Neg logit contributions
 friends-0.019
 thinking-0.018
 there-0.018
 here-0.018
 upon-0.018
Token they
Token ID484
Feature activation-5.98e-03

Pos logit contributions
 thinking+0.044
 friends+0.042
 here+0.040
 upon+0.040
 friend+0.036
Neg logit contributions
ifying-0.078
attered-0.065
amy-0.063
lier-0.060
work-0.060
Token see
Token ID766
Feature activation-2.89e-03

Pos logit contributions
ifying+0.016
attered+0.014
amy+0.013
ksh+0.013
tr+0.013
Neg logit contributions
 friends-0.007
 there-0.006
 thinking-0.006
 here-0.006
 it-0.006
Token the
Token ID262
Feature activation+1.19e-02

Pos logit contributions
ifying+0.008
attered+0.007
amy+0.006
rant+0.006
tr+0.006
Neg logit contributions
 friends-0.003
 there-0.003
 thinking-0.003
 here-0.003
 upon-0.002
Token mess
Token ID2085
Feature activation+6.58e-03

Pos logit contributions
 friends+0.015
 there+0.015
 thinking+0.015
 here+0.014
 it+0.014
Neg logit contributions
ifying-0.028
attered-0.023
tr-0.023
amy-0.021
rant-0.021
Token.
Token ID13
Feature activation+5.67e-03

Pos logit contributions
 friends+0.008
 there+0.007
 thinking+0.007
 here+0.007
 things+0.006
Neg logit contributions
ifying-0.017
attered-0.014
amy-0.014
rant-0.014
tr-0.013
Token the
Token ID262
Feature activation-8.96e-04

Pos logit contributions
ifying+0.014
attered+0.011
 unison+0.011
amy+0.011
rant+0.011
Neg logit contributions
 friends-0.005
 here-0.005
 thinking-0.005
 there-0.004
 things-0.004
Token kingdom
Token ID13239
Feature activation+5.18e-03

Pos logit contributions
ifying+0.002
attered+0.002
rant+0.002
amy+0.002
ksh+0.002
Neg logit contributions
 friends-0.001
 there-0.001
 thinking-0.001
 things-0.001
 here-0.001
Token was
Token ID373
Feature activation+7.84e-03

Pos logit contributions
 friends+0.008
 here+0.008
 there+0.008
 thinking+0.008
 things+0.007
Neg logit contributions
ifying-0.010
attered-0.009
amy-0.008
rant-0.008
tr-0.008
Token very
Token ID845
Feature activation+1.91e-03

Pos logit contributions
 friends+0.011
 there+0.010
 thinking+0.010
 here+0.010
 things+0.009
Neg logit contributions
ifying-0.018
attered-0.015
amy-0.014
rant-0.014
tr-0.014
Token happy
Token ID3772
Feature activation+1.95e-04

Pos logit contributions
 friends+0.001
 thinking+0.001
 there+0.001
 here+0.001
 things+0.001
Neg logit contributions
ifying-0.006
attered-0.005
amy-0.005
rant-0.005
tr-0.005
Token when
Token ID618
Feature activation+3.90e-02

Pos logit contributions
 friends+0.000
 thinking+0.000
 upon+0.000
 here+0.000
 there+0.000
Neg logit contributions
ifying-0.000
attered-0.000
work-0.000
lier-0.000
amy-0.000
Token the
Token ID262
Feature activation+6.42e-03

Pos logit contributions
 here+0.037
 friends+0.036
ittens+0.036
 thinking+0.034
"+0.033
Neg logit contributions
ifying-0.086
amy-0.071
attered-0.070
work-0.070
 unison-0.068
Token jail
Token ID7356
Feature activation+7.86e-03

Pos logit contributions
 friends+0.009
 there+0.009
 thinking+0.009
 here+0.009
 upon+0.009
Neg logit contributions
ifying-0.014
attered-0.011
tr-0.011
amy-0.011
rant-0.010
Token disappeared
Token ID12120
Feature activation+7.41e-03

Pos logit contributions
 friends+0.011
 there+0.010
 thinking+0.010
 here+0.010
 things+0.009
Neg logit contributions
ifying-0.019
attered-0.015
amy-0.015
rant-0.015
tr-0.014
Token!
Token ID0
Feature activation-1.65e-03

Pos logit contributions
 friends+0.008
 there+0.008
 here+0.007
 thinking+0.007
 things+0.007
Neg logit contributions
ifying-0.020
attered-0.017
amy-0.017
tr-0.016
rant-0.016
Token
Token ID198
Feature activation-6.05e-03

Pos logit contributions
ifying+0.005
attered+0.005
amy+0.005
rant+0.005
tr+0.005
Neg logit contributions
 friends-0.001
 there-0.001
 thinking-0.001
 here-0.001
 it-0.000
Token not
Token ID407
Feature activation-6.25e-03

Pos logit contributions
 friends+0.015
 thinking+0.013
 here+0.013
 there+0.012
 kids+0.012
Neg logit contributions
ifying-0.033
tr-0.029
attered-0.028
rant-0.027
shaped-0.027
Token listen
Token ID6004
Feature activation+4.70e-04

Pos logit contributions
ifying+0.015
attered+0.013
rant+0.012
tr+0.012
amy+0.012
Neg logit contributions
 friends-0.008
 there-0.007
 here-0.007
 thinking-0.007
 things-0.006
Token to
Token ID284
Feature activation-6.24e-03

Pos logit contributions
 friends+0.001
 thinking+0.001
 here+0.001
 there+0.001
 kids+0.000
Neg logit contributions
ifying-0.001
attered-0.001
rant-0.001
lier-0.001
tr-0.001
Token the
Token ID262
Feature activation+5.18e-03

Pos logit contributions
ifying+0.015
attered+0.012
rant+0.012
tr+0.012
amy+0.012
Neg logit contributions
 friends-0.008
 there-0.008
 here-0.007
 thinking-0.007
 upon-0.007
Token teacher
Token ID4701
Feature activation-1.68e-02

Pos logit contributions
 there+0.005
 here+0.005
 friends+0.005
 thinking+0.005
 it+0.005
Neg logit contributions
ifying-0.013
tr-0.011
attered-0.011
amy-0.011
rant-0.010
Token when
Token ID618
Feature activation+3.88e-02

Pos logit contributions
ifying+0.041
attered+0.034
tr+0.033
rant+0.033
amy+0.033
Neg logit contributions
 friends-0.021
 there-0.019
 thinking-0.019
 here-0.018
 things-0.017
Token she
Token ID673
Feature activation-1.21e-02

Pos logit contributions
 friends+0.036
 thinking+0.035
 here+0.035
 upon+0.030
 there+0.029
Neg logit contributions
ifying-0.094
attered-0.077
shaped-0.074
tr-0.073
work-0.073
Token gave
Token ID2921
Feature activation+9.63e-03

Pos logit contributions
ifying+0.030
attered+0.025
ksh+0.024
amy+0.024
rant+0.023
Neg logit contributions
 friends-0.017
 there-0.016
 thinking-0.015
 here-0.014
 kids-0.014
Token a
Token ID257
Feature activation+5.79e-03

Pos logit contributions
 friends+0.009
 thinking+0.009
 there+0.008
 here+0.008
 upon+0.007
Neg logit contributions
ifying-0.026
attered-0.022
amy-0.021
rant-0.021
tr-0.021
Token lecture
Token ID19143
Feature activation-2.01e-03

Pos logit contributions
 friends+0.007
 there+0.006
 thinking+0.006
 here+0.006
 it+0.005
Neg logit contributions
ifying-0.015
attered-0.013
amy-0.012
rant-0.012
tr-0.012
Token.
Token ID13
Feature activation-4.65e-03

Pos logit contributions
ifying+0.005
attered+0.004
tr+0.004
rant+0.004
amy+0.004
Neg logit contributions
 friends-0.002
 there-0.002
 thinking-0.002
 here-0.002
 things-0.002
Token what
Token ID644
Feature activation+1.36e-02

Pos logit contributions
 friends+0.002
 here+0.002
 kids+0.002
 things+0.002
 thinking+0.002
Neg logit contributions
ifying-0.005
amy-0.004
shaped-0.004
rant-0.004
attered-0.004
Token her
Token ID607
Feature activation+3.14e-03

Pos logit contributions
 friends+0.011
 here+0.011
 thinking+0.010
 there+0.010
 kids+0.009
Neg logit contributions
ifying-0.037
attered-0.032
tr-0.031
amy-0.030
shaped-0.030
Token mom
Token ID1995
Feature activation+1.93e-03

Pos logit contributions
 here+0.003
"+0.003
 there+0.003
 thinking+0.003
't+0.002
Neg logit contributions
ifying-0.008
attered-0.006
tr-0.006
work-0.006
 unison-0.006
Token had
Token ID550
Feature activation-6.01e-03

Pos logit contributions
 friends+0.003
 here+0.002
 things+0.002
 thinking+0.002
 there+0.002
Neg logit contributions
ifying-0.004
attered-0.003
rant-0.003
tr-0.003
bug-0.003
Token said
Token ID531
Feature activation+8.02e-03

Pos logit contributions
ifying+0.014
attered+0.013
rant+0.012
tr+0.012
amy+0.012
Neg logit contributions
 friends-0.006
 here-0.006
 there-0.005
 things-0.005
 kids-0.005
Token when
Token ID618
Feature activation+3.88e-02

Pos logit contributions
 friends+0.008
 here+0.006
 there+0.006
 thinking+0.006
 kids+0.006
Neg logit contributions
ifying-0.022
attered-0.018
rant-0.017
lier-0.017
tr-0.017
Token she
Token ID673
Feature activation+2.42e-03

Pos logit contributions
 friends+0.030
 here+0.027
 thinking+0.026
 kids+0.022
 upon+0.021
Neg logit contributions
ifying-0.101
attered-0.084
amy-0.083
shaped-0.082
lier-0.080
Token was
Token ID373
Feature activation+3.41e-03

Pos logit contributions
 friends+0.003
 there+0.003
 thinking+0.003
 here+0.003
 it+0.003
Neg logit contributions
ifying-0.006
attered-0.005
amy-0.005
rant-0.005
tr-0.005
Token scared
Token ID12008
Feature activation+3.34e-04

Pos logit contributions
 things+0.004
 friends+0.004
 it+0.003
 upon+0.003
"+0.003
Neg logit contributions
ifying-0.008
rant-0.006
amy-0.006
lier-0.006
attered-0.006
Token -
Token ID532
Feature activation+1.20e-02

Pos logit contributions
 friends+0.000
 thinking+0.000
 there+0.000
 here+0.000
 things+0.000
Neg logit contributions
ifying-0.001
attered-0.001
amy-0.001
rant-0.001
lier-0.001
Token
Token ID6184
Feature activation+4.50e-03

Pos logit contributions
 friends+0.010
 thinking+0.009
 kids+0.008
 here+0.008
 upon+0.008
Neg logit contributions
ifying-0.031
tr-0.025
attered-0.025
shaped-0.024
bug-0.024
Token toys
Token ID14958
Feature activation+9.79e-03

Pos logit contributions
 there+0.010
 here+0.009
 thinking+0.008
 it+0.008
 friends+0.008
Neg logit contributions
ifying-0.030
attered-0.026
tr-0.026
amy-0.025
work-0.025
Token.
Token ID13
Feature activation+6.38e-03

Pos logit contributions
 friends+0.011
 there+0.010
 thinking+0.010
 here+0.009
 it+0.009
Neg logit contributions
ifying-0.026
attered-0.022
amy-0.021
rant-0.021
tr-0.021
Token She
Token ID1375
Feature activation+1.15e-02

Pos logit contributions
 friends+0.007
 thinking+0.007
 there+0.007
 here+0.006
 things+0.006
Neg logit contributions
ifying-0.017
attered-0.014
tr-0.013
amy-0.013
rant-0.013
Token was
Token ID373
Feature activation+1.18e-02

Pos logit contributions
 friends+0.018
 there+0.017
 thinking+0.017
 here+0.016
 it+0.016
Neg logit contributions
ifying-0.025
attered-0.021
amy-0.019
rant-0.019
tr-0.019
Token happy
Token ID3772
Feature activation+1.36e-03

Pos logit contributions
 friends+0.016
 there+0.015
 thinking+0.015
 here+0.015
 things+0.015
Neg logit contributions
ifying-0.025
attered-0.021
rant-0.020
amy-0.020
tr-0.020
Token when
Token ID618
Feature activation+3.88e-02

Pos logit contributions
 friends+0.002
 thinking+0.001
 here+0.001
Next+0.001
 upon+0.001
Neg logit contributions
ifying-0.003
attered-0.002
work-0.002
tr-0.002
lier-0.002
Token she
Token ID673
Feature activation-4.46e-03

Pos logit contributions
 here+0.032
 thinking+0.031
 friends+0.030
 upon+0.027
 sure+0.027
Neg logit contributions
ifying-0.092
attered-0.077
amy-0.074
shaped-0.074
lier-0.072
Token drew
Token ID9859
Feature activation+1.05e-02

Pos logit contributions
ifying+0.010
attered+0.008
ksh+0.008
rant+0.008
amy+0.008
Neg logit contributions
 friends-0.007
 there-0.007
 thinking-0.006
 here-0.006
 it-0.006
Token.
Token ID13
Feature activation+4.49e-03

Pos logit contributions
 friends+0.011
 thinking+0.010
 here+0.010
 there+0.010
 upon+0.009
Neg logit contributions
ifying-0.027
attered-0.022
tr-0.022
rant-0.022
amy-0.021
Token
Token ID198
Feature activation+4.15e-03

Pos logit contributions
 friends+0.004
 thinking+0.003
 here+0.003
 there+0.003
 things+0.003
Neg logit contributions
ifying-0.013
attered-0.011
tr-0.011
amy-0.011
rant-0.011
Token
Token ID198
Feature activation+4.65e-03

Pos logit contributions
 friends+0.005
 thinking+0.004
 things+0.004
 kids+0.004
 here+0.003
Neg logit contributions
ifying-0.010
tr-0.009
shaped-0.009
rant-0.009
attered-0.009
Token he
Token ID339
Feature activation-4.28e-04

Pos logit contributions
 friends+0.014
 thinking+0.013
 here+0.012
 there+0.012
 kids+0.011
Neg logit contributions
ifying-0.051
attered-0.043
rant-0.042
amy-0.042
shaped-0.041
Token could
Token ID714
Feature activation+1.87e-02

Pos logit contributions
ifying+0.001
attered+0.001
amy+0.001
tr+0.001
rant+0.001
Neg logit contributions
 friends-0.001
 thinking-0.001
 here-0.001
 there-0.001
 things-0.001
Token.
Token ID13
Feature activation+7.85e-03

Pos logit contributions
 friends+0.023
 thinking+0.022
 here+0.021
 things+0.021
 there+0.021
Neg logit contributions
ifying-0.043
attered-0.036
lier-0.034
rant-0.033
tr-0.033
Token Everyone
Token ID11075
Feature activation-1.04e-02

Pos logit contributions
 friends+0.010
 thinking+0.009
 here+0.009
 there+0.009
 things+0.008
Neg logit contributions
ifying-0.019
attered-0.016
amy-0.015
tr-0.015
ksh-0.015
Token cheered
Token ID34767
Feature activation+1.26e-02

Pos logit contributions
ifying+0.023
attered+0.019
amy+0.018
rant+0.017
tr+0.017
Neg logit contributions
 friends-0.016
 there-0.016
 thinking-0.015
 here-0.015
 it-0.014
Token when
Token ID618
Feature activation+3.87e-02

Pos logit contributions
 friends+0.013
Next+0.011
 there+0.011
 thinking+0.011
ittens+0.010
Neg logit contributions
ifying-0.029
lier-0.025
attered-0.025
rant-0.025
work-0.025
Token he
Token ID339
Feature activation+7.60e-03

Pos logit contributions
 here+0.040
 friends+0.039
 thinking+0.037
 upon+0.034
 there+0.034
Neg logit contributions
ifying-0.092
amy-0.074
 unison-0.072
attered-0.072
lier-0.071
Token managed
Token ID5257
Feature activation+1.04e-02

Pos logit contributions
 friends+0.012
 thinking+0.011
 here+0.011
 there+0.011
 things+0.011
Neg logit contributions
ifying-0.015
attered-0.013
amy-0.012
rant-0.012
lier-0.012
Token to
Token ID284
Feature activation+6.83e-03

Pos logit contributions
 friends+0.012
 thinking+0.011
 here+0.011
 kids+0.011
 there+0.010
Neg logit contributions
ifying-0.024
attered-0.020
rant-0.020
lier-0.019
shaped-0.019
Token cross
Token ID3272
Feature activation+1.81e-02

Pos logit contributions
 friends+0.009
 thinking+0.009
 there+0.009
 here+0.008
 things+0.008
Neg logit contributions
ifying-0.016
attered-0.013
amy-0.012
rant-0.012
tr-0.012
Token the
Token ID262
Feature activation-1.95e-03

Pos logit contributions
 friends+0.016
 thinking+0.014
 there+0.014
 here+0.013
 kids+0.011
Neg logit contributions
ifying-0.051
attered-0.044
amy-0.042
rant-0.042
tr-0.042
Token kissed
Token ID27946
Feature activation+1.96e-03

Pos logit contributions
 friends+0.008
 there+0.007
 thinking+0.007
 here+0.007
 it+0.007
Neg logit contributions
ifying-0.019
attered-0.016
amy-0.016
rant-0.015
tr-0.015
Token her
Token ID607
Feature activation-4.12e-03

Pos logit contributions
 friends+0.001
 there+0.001
 here+0.001
 it+0.001
 thinking+0.001
Neg logit contributions
ifying-0.006
attered-0.005
amy-0.005
rant-0.005
tr-0.005
Token.
Token ID13
Feature activation+1.08e-02

Pos logit contributions
ifying+0.010
tr+0.008
attered+0.008
rant+0.008
amy+0.008
Neg logit contributions
 friends-0.005
 there-0.005
 thinking-0.005
 here-0.005
 it-0.004
Token She
Token ID1375
Feature activation+8.47e-03

Pos logit contributions
 friends+0.013
 there+0.012
 thinking+0.012
 here+0.012
 things+0.011
Neg logit contributions
ifying-0.027
attered-0.022
tr-0.022
amy-0.021
rant-0.021
Token remembers
Token ID18140
Feature activation-4.46e-03

Pos logit contributions
 friends+0.011
 there+0.011
 thinking+0.010
 here+0.010
 it+0.010
Neg logit contributions
ifying-0.021
attered-0.018
amy-0.017
rant-0.016
tr-0.016
Token when
Token ID618
Feature activation+3.86e-02

Pos logit contributions
ifying+0.011
attered+0.008
amy+0.008
rant+0.008
shaped+0.008
Neg logit contributions
 friends-0.005
 thinking-0.004
 here-0.004
 there-0.004
 kids-0.004
Token the
Token ID262
Feature activation+1.18e-02

Pos logit contributions
 friends+0.031
 thinking+0.031
 here+0.031
 upon+0.027
 idea+0.026
Neg logit contributions
ifying-0.092
shaped-0.073
amy-0.073
attered-0.072
nam-0.071
Token little
Token ID1310
Feature activation+1.75e-02

Pos logit contributions
 friends+0.014
 here+0.014
 there+0.013
 thinking+0.013
 it+0.013
Neg logit contributions
ifying-0.028
tr-0.024
attered-0.023
amy-0.022
rant-0.021
Token girl
Token ID2576
Feature activation-3.00e-03

Pos logit contributions
 friends+0.017
 thinking+0.015
 there+0.015
 here+0.015
 it+0.013
Neg logit contributions
ifying-0.048
attered-0.041
amy-0.039
tr-0.039
rant-0.039
Token cl
Token ID537
Feature activation-1.12e-02

Pos logit contributions
ifying+0.007
attered+0.006
amy+0.005
rant+0.005
tr+0.005
Neg logit contributions
 friends-0.004
 there-0.004
 thinking-0.004
 here-0.004
 things-0.004
Tokenapped
Token ID6320
Feature activation+6.75e-03

Pos logit contributions
ifying+0.025
attered+0.019
rant+0.019
tr+0.019
amy+0.019
Neg logit contributions
 friends-0.018
 there-0.017
 here-0.016
 thinking-0.016
 kids-0.015
Token But
Token ID887
Feature activation+8.21e-03

Pos logit contributions
 friends+0.010
 thinking+0.010
 here+0.010
 there+0.009
 things+0.009
Neg logit contributions
ifying-0.015
attered-0.012
tr-0.012
ksh-0.011
rant-0.011
Token the
Token ID262
Feature activation-2.16e-03

Pos logit contributions
 friends+0.008
 thinking+0.008
 here+0.007
 kids+0.006
 upon+0.006
Neg logit contributions
ifying-0.020
attered-0.017
tr-0.016
rant-0.016
work-0.016
Token children
Token ID1751
Feature activation-6.21e-03

Pos logit contributions
ifying+0.005
attered+0.004
tr+0.004
amy+0.003
rant+0.003
Neg logit contributions
 thinking-0.003
 there-0.003
 friends-0.003
 here-0.003
 it-0.003
Token were
Token ID547
Feature activation-2.16e-03

Pos logit contributions
ifying+0.013
attered+0.011
amy+0.010
tr+0.010
rant+0.010
Neg logit contributions
 friends-0.009
 thinking-0.009
 here-0.009
 there-0.009
 things-0.008
Token scared
Token ID12008
Feature activation-6.16e-03

Pos logit contributions
ifying+0.005
attered+0.004
amy+0.004
rant+0.004
tr+0.004
Neg logit contributions
 friends-0.002
 thinking-0.002
 things-0.002
 it-0.002
 there-0.002
Token when
Token ID618
Feature activation+3.86e-02

Pos logit contributions
ifying+0.017
attered+0.014
lier+0.014
amy+0.013
rant+0.013
Neg logit contributions
 friends-0.005
 thinking-0.005
 here-0.005
 there-0.005
 upon-0.004
Token they
Token ID484
Feature activation-2.90e-03

Pos logit contributions
 friends+0.040
 here+0.040
 thinking+0.039
 upon+0.035
ittens+0.033
Neg logit contributions
ifying-0.082
 unison-0.065
amy-0.065
attered-0.064
shaped-0.063
Token saw
Token ID2497
Feature activation-9.27e-03

Pos logit contributions
ifying+0.007
attered+0.006
amy+0.006
tr+0.006
rant+0.006
Neg logit contributions
 friends-0.003
 there-0.003
 thinking-0.003
 here-0.003
 it-0.003
Token the
Token ID262
Feature activation+5.77e-03

Pos logit contributions
ifying+0.027
attered+0.023
amy+0.022
rant+0.022
tr+0.022
Neg logit contributions
 friends-0.008
 there-0.007
 thinking-0.007
 here-0.007
 things-0.006
Token large
Token ID1588
Feature activation+2.40e-03

Pos logit contributions
 there+0.006
 thinking+0.006
 friends+0.006
 here+0.006
 it+0.006
Neg logit contributions
ifying-0.014
attered-0.012
tr-0.012
amy-0.011
work-0.011
Token cop
Token ID2243
Feature activation-1.08e-02

Pos logit contributions
 friends+0.002
 there+0.002
 thinking+0.002
 here+0.002
 it+0.002
Neg logit contributions
ifying-0.007
attered-0.006
amy-0.005
tr-0.005
rant-0.005
Token the
Token ID262
Feature activation+2.88e-04

Pos logit contributions
 friends+0.002
 there+0.002
 it+0.001
 things+0.001
 here+0.001
Neg logit contributions
ifying-0.008
attered-0.007
amy-0.006
tr-0.006
rant-0.006
Token tree
Token ID5509
Feature activation+7.30e-03

Pos logit contributions
 friends+0.000
 thinking+0.000
 here+0.000
 there+0.000
 it+0.000
Neg logit contributions
ifying-0.001
attered-0.001
tr-0.001
amy-0.000
shaped-0.000
Token.
Token ID13
Feature activation+5.44e-04

Pos logit contributions
 friends+0.007
 there+0.007
 it+0.006
 here+0.006
 things+0.006
Neg logit contributions
ifying-0.021
amy-0.018
attered-0.018
rant-0.017
ksh-0.017
Token Everyone
Token ID11075
Feature activation-1.31e-02

Pos logit contributions
 friends+0.001
 there+0.001
 thinking+0.001
 here+0.001
 it+0.001
Neg logit contributions
ifying-0.001
attered-0.001
amy-0.001
rant-0.001
tr-0.001
Token cheered
Token ID34767
Feature activation+8.52e-03

Pos logit contributions
ifying+0.033
attered+0.027
tr+0.025
amy+0.025
rant+0.025
Neg logit contributions
 friends-0.018
 there-0.017
 thinking-0.016
 here-0.016
 it-0.016
Token when
Token ID618
Feature activation+3.86e-02

Pos logit contributions
 friends+0.009
 there+0.007
 thinking+0.007
Next+0.007
 kids+0.007
Neg logit contributions
ifying-0.020
attered-0.017
lier-0.017
work-0.017
rant-0.017
Token they
Token ID484
Feature activation+3.58e-03

Pos logit contributions
 here+0.042
 friends+0.040
 thinking+0.039
 upon+0.037
 there+0.035
Neg logit contributions
ifying-0.087
amy-0.068
attered-0.068
shaped-0.067
work-0.067
Token finished
Token ID5201
Feature activation+8.19e-03

Pos logit contributions
 friends+0.006
 there+0.006
 thinking+0.005
 here+0.005
 it+0.005
Neg logit contributions
ifying-0.008
attered-0.007
amy-0.006
tr-0.006
rant-0.006
Token,
Token ID11
Feature activation+5.03e-03

Pos logit contributions
 friends+0.010
 here+0.009
 there+0.008
 kids+0.008
 thinking+0.008
Neg logit contributions
ifying-0.020
attered-0.017
amy-0.016
rant-0.016
shaped-0.015
Token and
Token ID290
Feature activation-3.06e-03

Pos logit contributions
 friends+0.004
 here+0.004
 there+0.004
 things+0.004
 upon+0.003
Neg logit contributions
ifying-0.013
attered-0.011
tr-0.011
rant-0.011
amy-0.010
Token it
Token ID340
Feature activation+6.34e-05

Pos logit contributions
ifying+0.009
attered+0.007
amy+0.007
rant+0.007
tr+0.007
Neg logit contributions
 friends-0.003
 thinking-0.002
 there-0.002
 here-0.002
 things-0.002
Token She
Token ID1375
Feature activation+5.22e-03

Pos logit contributions
 friends+0.009
 there+0.008
 thinking+0.008
 here+0.008
 things+0.008
Neg logit contributions
ifying-0.018
attered-0.015
amy-0.014
tr-0.014
rant-0.014
Token says
Token ID1139
Feature activation+2.23e-02

Pos logit contributions
 friends+0.005
 there+0.005
 thinking+0.005
 here+0.005
 it+0.004
Neg logit contributions
ifying-0.015
attered-0.012
amy-0.012
rant-0.012
tr-0.012
Token,
Token ID11
Feature activation+1.86e-04

Pos logit contributions
 friends+0.019
 thinking+0.017
 here+0.016
 upon+0.016
 there+0.015
Neg logit contributions
ifying-0.057
amy-0.049
attered-0.048
shaped-0.047
lier-0.047
Token "
Token ID366
Feature activation+5.99e-03

Pos logit contributions
 friends+0.000
 things+0.000
 here+0.000
 discovery+0.000
 upon+0.000
Neg logit contributions
ifying-0.000
tr-0.000
piece-0.000
amy-0.000
attered-0.000
TokenB
Token ID33
Feature activation+1.74e-02

Pos logit contributions
 friends+0.009
 thinking+0.008
 there+0.008
 here+0.008
 things+0.007
Neg logit contributions
ifying-0.014
attered-0.011
amy-0.011
tr-0.011
rant-0.011
Tokenoys
Token ID19417
Feature activation+3.85e-02

Pos logit contributions
 friends+0.030
 there+0.029
 thinking+0.028
 here+0.028
 it+0.026
Neg logit contributions
ifying-0.036
attered-0.029
amy-0.027
tr-0.027
rant-0.027
Token,
Token ID11
Feature activation-1.05e-02

Pos logit contributions
 friends+0.033
 thinking+0.031
 there+0.030
 here+0.030
Next+0.029
Neg logit contributions
ifying-0.090
attered-0.080
amy-0.078
lier-0.077
rant-0.076
Token boys
Token ID6510
Feature activation+6.38e-03

Pos logit contributions
ifying+0.024
tr+0.019
amy+0.019
attered+0.018
work+0.018
Neg logit contributions
 thinking-0.011
 friends-0.011
 upon-0.011
Next-0.010
 here-0.010
Token.
Token ID13
Feature activation+6.80e-03

Pos logit contributions
 friends+0.005
 there+0.004
 thinking+0.004
 here+0.004
 things+0.004
Neg logit contributions
ifying-0.018
attered-0.016
amy-0.015
rant-0.015
tr-0.015
Token You
Token ID921
Feature activation+5.73e-03

Pos logit contributions
 friends+0.009
 there+0.008
 thinking+0.008
 here+0.008
 things+0.007
Neg logit contributions
ifying-0.017
attered-0.014
amy-0.014
tr-0.014
rant-0.013
Token have
Token ID423
Feature activation+2.36e-03

Pos logit contributions
 friends+0.008
 there+0.008
 here+0.007
 thinking+0.007
 it+0.007
Neg logit contributions
ifying-0.013
attered-0.011
amy-0.011
rant-0.011
tr-0.011
Tokened
Token ID276
Feature activation-5.56e-03

Pos logit contributions
ifying+0.002
attered+0.002
amy+0.002
rant+0.002
tr+0.002
Neg logit contributions
 friends-0.003
 there-0.002
 thinking-0.002
 here-0.002
 it-0.002
Token down
Token ID866
Feature activation-1.49e-02

Pos logit contributions
ifying+0.013
attered+0.010
amy+0.010
rant+0.010
tr+0.010
Neg logit contributions
 friends-0.008
 there-0.008
 thinking-0.008
 here-0.008
 things-0.007
Token and
Token ID290
Feature activation+2.75e-03

Pos logit contributions
ifying+0.043
attered+0.037
amy+0.035
rant+0.035
tr+0.035
Neg logit contributions
 friends-0.014
 there-0.012
 thinking-0.012
 here-0.011
 things-0.010
Token snatched
Token ID43276
Feature activation-1.19e-02

Pos logit contributions
 friends+0.004
 there+0.004
 thinking+0.004
 it+0.004
 here+0.004
Neg logit contributions
ifying-0.006
attered-0.005
amy-0.005
rant-0.005
tr-0.005
Token Tim
Token ID5045
Feature activation-2.92e-02

Pos logit contributions
ifying+0.036
attered+0.032
tr+0.031
amy+0.030
rant+0.030
Neg logit contributions
 friends-0.010
 there-0.009
 it-0.008
 here-0.008
 things-0.007
Tokenmy
Token ID1820
Feature activation-4.34e-02

Pos logit contributions
ifying+0.063
attered+0.051
rant+0.048
amy+0.048
tr+0.047
Neg logit contributions
 friends-0.047
 there-0.045
 here-0.043
 thinking-0.042
 things-0.041
Token up
Token ID510
Feature activation-6.29e-03

Pos logit contributions
ifying+0.090
attered+0.076
amy+0.071
rant+0.071
tr+0.069
Neg logit contributions
 there-0.071
 friends-0.071
 here-0.068
 it-0.066
 things-0.065
Token in
Token ID287
Feature activation-1.19e-02

Pos logit contributions
ifying+0.016
attered+0.013
rant+0.012
amy+0.012
tr+0.012
Neg logit contributions
 friends-0.008
 thinking-0.007
 there-0.007
 here-0.006
 kids-0.006
Token his
Token ID465
Feature activation-1.61e-03

Pos logit contributions
ifying+0.031
attered+0.024
amy+0.024
 unison+0.023
rant+0.023
Neg logit contributions
 friends-0.013
 thinking-0.012
 there-0.011
 upon-0.011
 here-0.011
Token be
Token ID307
Feature activation+7.28e-03

Pos logit contributions
ifying+0.004
attered+0.004
rant+0.003
amy+0.003
lier+0.003
Neg logit contributions
 friends-0.002
 there-0.002
 things-0.002
 here-0.002
 thinking-0.002
Tokenak
Token ID461
Feature activation+4.17e-03

Pos logit contributions
 friends+0.015
 there+0.015
 thinking+0.015
 here+0.014
 things+0.014
Neg logit contributions
ifying-0.012
attered-0.009
amy-0.008
rant-0.008
tr-0.008
Token voice
Token ID3809
Feature activation-1.69e-02

Pos logit contributions
 friends+0.008
 here+0.008
 there+0.008
"+0.007
 thinking+0.007
Neg logit contributions
ifying-0.015
attered-0.012
tr-0.012
rant-0.012
amy-0.011
Token.
Token ID13
Feature activation+2.75e-03

Pos logit contributions
ifying+0.047
attered+0.040
amy+0.039
rant+0.038
tr+0.038
Neg logit contributions
 friends-0.017
 there-0.017
 here-0.015
 thinking-0.015
 it-0.015
Token But
Token ID887
Feature activation+3.52e-04

Pos logit contributions
 friends+0.004
 there+0.003
 thinking+0.003
 here+0.003
 it+0.003
Neg logit contributions
ifying-0.007
attered-0.006
amy-0.005
rant-0.005
tr-0.005
Token then
Token ID788
Feature activation+4.91e-03

Pos logit contributions
 friends+0.000
 thinking+0.000
 here+0.000
 kids+0.000
 there+0.000
Neg logit contributions
ifying-0.001
attered-0.001
rant-0.001
amy-0.001
tr-0.001
Token,
Token ID11
Feature activation-6.17e-03

Pos logit contributions
 friends+0.004
 here+0.003
 thinking+0.003
 upon+0.003
 kids+0.003
Neg logit contributions
ifying-0.013
attered-0.011
amy-0.011
lier-0.011
shaped-0.011
Token mirac
Token ID30352
Feature activation-4.22e-02

Pos logit contributions
ifying+0.016
attered+0.013
tr+0.013
shaped+0.013
amy+0.012
Neg logit contributions
 friends-0.006
 thinking-0.005
 here-0.005
 kids-0.005
 there-0.005
Tokenulously
Token ID18117
Feature activation+1.78e-02

Pos logit contributions
ifying+0.087
attered+0.062
tr+0.061
ksh+0.060
amy+0.059
Neg logit contributions
 friends-0.075
 there-0.073
 it-0.071
 here-0.071
 things-0.070
Token,
Token ID11
Feature activation+4.79e-03

Pos logit contributions
 friends+0.017
 upon+0.015
 here+0.015
 kids+0.014
 next+0.014
Neg logit contributions
ifying-0.042
lier-0.035
shaped-0.034
amy-0.034
rant-0.034
Token it
Token ID340
Feature activation+9.18e-03

Pos logit contributions
 friends+0.004
 thinking+0.004
 here+0.004
 kids+0.004
 there+0.004
Neg logit contributions
ifying-0.013
attered-0.010
shaped-0.010
tr-0.010
amy-0.010
Token returned
Token ID4504
Feature activation+1.48e-02

Pos logit contributions
 friends+0.015
 there+0.014
 thinking+0.014
 here+0.014
 things+0.013
Neg logit contributions
ifying-0.020
attered-0.016
amy-0.015
tr-0.015
rant-0.015
Token and
Token ID290
Feature activation+2.05e-03

Pos logit contributions
 friends+0.015
 thinking+0.013
 there+0.012
 kids+0.012
 here+0.012
Neg logit contributions
ifying-0.039
attered-0.034
tr-0.032
rant-0.031
amy-0.031
Token knew
Token ID2993
Feature activation-1.73e-03

Pos logit contributions
ifying+0.002
attered+0.002
amy+0.002
tr+0.002
rant+0.002
Neg logit contributions
 friends-0.002
 there-0.002
 here-0.002
 thinking-0.002
 things-0.002
Token how
Token ID703
Feature activation+3.05e-03

Pos logit contributions
ifying+0.005
attered+0.004
tr+0.004
amy+0.004
lier+0.004
Neg logit contributions
 friends-0.001
 thinking-0.001
 here-0.001
 upon-0.001
 there-0.001
Token to
Token ID284
Feature activation-5.12e-03

Pos logit contributions
 friends+0.003
 there+0.003
 thinking+0.003
 here+0.002
 kids+0.002
Neg logit contributions
ifying-0.008
attered-0.007
amy-0.007
rant-0.007
tr-0.007
Token prevent
Token ID2948
Feature activation-7.18e-03

Pos logit contributions
ifying+0.010
attered+0.008
tr+0.008
rant+0.008
amy+0.008
Neg logit contributions
 friends-0.008
 thinking-0.008
 there-0.008
 here-0.007
 things-0.007
Token Tim
Token ID5045
Feature activation-2.95e-02

Pos logit contributions
ifying+0.017
attered+0.015
shaped+0.014
amy+0.014
lier+0.014
Neg logit contributions
 upon-0.007
 there-0.007
 friends-0.007
 here-0.007
 thinking-0.007
Tokenmy
Token ID1820
Feature activation-4.20e-02

Pos logit contributions
ifying+0.064
attered+0.053
amy+0.050
tr+0.049
rant+0.049
Neg logit contributions
 friends-0.046
 there-0.044
 thinking-0.043
 here-0.042
 it-0.040
Token from
Token ID422
Feature activation+1.44e-02

Pos logit contributions
ifying+0.103
attered+0.087
amy+0.084
rant+0.082
tr+0.081
Neg logit contributions
 friends-0.055
 there-0.053
 here-0.050
 thinking-0.050
 things-0.048
Token forgetting
Token ID32012
Feature activation-6.83e-04

Pos logit contributions
 upon+0.020
 there+0.017
"+0.016
 friends+0.016
 name+0.016
Neg logit contributions
ifying-0.024
attered-0.020
amy-0.020
shaped-0.020
rant-0.019
Token about
Token ID546
Feature activation+6.66e-03

Pos logit contributions
ifying+0.002
attered+0.002
amy+0.001
rant+0.001
tr+0.001
Neg logit contributions
 friends-0.001
 there-0.001
 thinking-0.001
 here-0.001
 kids-0.001
Token his
Token ID465
Feature activation+1.98e-03

Pos logit contributions
 friends+0.007
 here+0.006
 upon+0.006
 there+0.006
"+0.006
Neg logit contributions
ifying-0.016
attered-0.012
rant-0.012
shaped-0.012
amy-0.012
Token toy
Token ID13373
Feature activation-4.73e-03

Pos logit contributions
 there+0.003
"+0.003
 upon+0.003
Next+0.003
 here+0.003
Neg logit contributions
ifying-0.004
attered-0.003
tr-0.003
work-0.003
shaped-0.003
Token.
Token ID13
Feature activation-9.52e-03

Pos logit contributions
 friends+0.014
 there+0.014
 thinking+0.013
 here+0.013
 upon+0.012
Neg logit contributions
ifying-0.026
attered-0.022
amy-0.020
lier-0.020
tr-0.020
Token Let
Token ID3914
Feature activation-4.99e-03

Pos logit contributions
ifying+0.024
attered+0.021
amy+0.020
rant+0.020
tr+0.019
Neg logit contributions
 friends-0.011
 there-0.011
 here-0.010
 thinking-0.010
 it-0.010
Token's
Token ID338
Feature activation-1.04e-02

Pos logit contributions
ifying+0.009
attered+0.007
amy+0.007
rant+0.007
tr+0.006
Neg logit contributions
 friends-0.010
 there-0.009
 thinking-0.009
 here-0.009
 it-0.009
Token put
Token ID1234
Feature activation-7.40e-03

Pos logit contributions
ifying+0.023
attered+0.019
amy+0.018
tr+0.018
rant+0.018
Neg logit contributions
 friends-0.015
 there-0.014
 thinking-0.014
 here-0.014
 things-0.013
Token the
Token ID262
Feature activation-6.67e-03

Pos logit contributions
ifying+0.020
attered+0.016
rant+0.016
amy+0.016
tr+0.016
Neg logit contributions
 friends-0.007
 thinking-0.007
 there-0.006
 upon-0.006
 here-0.006
Token top
Token ID1353
Feature activation-4.19e-02

Pos logit contributions
ifying+0.013
tr+0.010
attered+0.010
amy+0.010
rant+0.010
Neg logit contributions
 friends-0.011
 thinking-0.011
 there-0.010
 here-0.010
 it-0.010
Token back
Token ID736
Feature activation-1.50e-02

Pos logit contributions
ifying+0.113
attered+0.097
amy+0.093
rant+0.092
tr+0.091
Neg logit contributions
 friends-0.044
 there-0.041
 here-0.039
 thinking-0.038
 it-0.035
Token on
Token ID319
Feature activation-1.04e-02

Pos logit contributions
ifying+0.038
attered+0.032
amy+0.030
rant+0.030
tr+0.030
Neg logit contributions
 friends-0.018
 thinking-0.017
 there-0.017
 here-0.016
 it-0.015
Token the
Token ID262
Feature activation-1.58e-02

Pos logit contributions
ifying+0.029
attered+0.023
amy+0.023
rant+0.023
shaped+0.022
Neg logit contributions
 friends-0.010
 thinking-0.010
 there-0.009
 upon-0.008
 here-0.008
Token box
Token ID3091
Feature activation-1.96e-03

Pos logit contributions
ifying+0.040
attered+0.034
amy+0.032
lier+0.032
rant+0.031
Neg logit contributions
 friends-0.021
 there-0.020
 next-0.019
 here-0.019
 thinking-0.019
Token and
Token ID290
Feature activation-1.81e-02

Pos logit contributions
ifying+0.006
attered+0.005
tr+0.005
rant+0.005
amy+0.004
Neg logit contributions
 friends-0.002
 thinking-0.002
 there-0.001
 here-0.001
 things-0.001
Token hill
Token ID12788
Feature activation+1.02e-03

Pos logit contributions
 friends+0.003
 there+0.003
 thinking+0.003
 here+0.002
 things+0.002
Neg logit contributions
ifying-0.006
attered-0.005
amy-0.005
tr-0.005
rant-0.005
Token and
Token ID290
Feature activation-4.64e-05

Pos logit contributions
 friends+0.001
 there+0.001
 thinking+0.001
 here+0.001
 things+0.000
Neg logit contributions
ifying-0.003
attered-0.003
amy-0.003
rant-0.003
tr-0.003
Token put
Token ID1234
Feature activation+4.77e-03

Pos logit contributions
ifying+0.000
attered+0.000
amy+0.000
rant+0.000
tr+0.000
Neg logit contributions
 friends-0.000
 there-0.000
 thinking-0.000
 here-0.000
 things-0.000
Token her
Token ID607
Feature activation-7.00e-03

Pos logit contributions
 friends+0.004
 thinking+0.004
 here+0.003
 upon+0.003
 there+0.003
Neg logit contributions
ifying-0.013
attered-0.011
rant-0.011
amy-0.011
shaped-0.011
Token or
Token ID393
Feature activation+3.73e-03

Pos logit contributions
ifying+0.017
attered+0.014
tr+0.013
amy+0.013
rant+0.013
Neg logit contributions
 friends-0.009
 thinking-0.009
 there-0.009
 here-0.008
 it-0.008
Tokennam
Token ID7402
Feature activation-4.13e-02

Pos logit contributions
 friends+0.009
 there+0.009
 thinking+0.009
 here+0.009
 it+0.009
Neg logit contributions
ifying-0.005
attered-0.003
amy-0.003
rant-0.003
tr-0.003
Tokenents
Token ID658
Feature activation+7.62e-04

Pos logit contributions
ifying+0.078
rant+0.062
attered+0.061
tr+0.061
amy+0.060
Neg logit contributions
 friends-0.076
 there-0.072
 here-0.069
 thinking-0.067
 it-0.066
Token all
Token ID477
Feature activation+5.40e-03

Pos logit contributions
 friends+0.001
 there+0.001
 thinking+0.001
 here+0.001
 things+0.001
Neg logit contributions
ifying-0.002
attered-0.002
amy-0.002
rant-0.002
tr-0.002
Token around
Token ID1088
Feature activation+1.03e-02

Pos logit contributions
 friends+0.005
 there+0.005
 here+0.004
 thinking+0.004
 it+0.004
Neg logit contributions
ifying-0.015
attered-0.013
amy-0.013
tr-0.012
rant-0.012
Token.
Token ID13
Feature activation+2.37e-05

Pos logit contributions
 friends+0.012
 thinking+0.010
 here+0.010
 there+0.010
 kids+0.009
Neg logit contributions
ifying-0.026
attered-0.022
amy-0.021
rant-0.021
shaped-0.021
Token She
Token ID1375
Feature activation+1.06e-03

Pos logit contributions
 friends+0.000
 there+0.000
 thinking+0.000
 here+0.000
 things+0.000
Neg logit contributions
ifying-0.000
attered-0.000
tr-0.000
amy-0.000
rant-0.000
Token scary
Token ID14343
Feature activation-1.13e-03

Pos logit contributions
ifying+0.007
attered+0.006
tr+0.006
amy+0.005
shaped+0.005
Neg logit contributions
 friends-0.005
 here-0.005
 there-0.005
 thinking-0.005
"-0.004
Token man
Token ID582
Feature activation-1.60e-02

Pos logit contributions
ifying+0.003
attered+0.002
tr+0.002
amy+0.002
rant+0.002
Neg logit contributions
 friends-0.001
 here-0.001
 there-0.001
 upon-0.001
 thinking-0.001
Token who
Token ID508
Feature activation-1.29e-03

Pos logit contributions
ifying+0.039
attered+0.032
amy+0.031
rant+0.030
tr+0.030
Neg logit contributions
 friends-0.022
 there-0.021
 thinking-0.020
 here-0.020
 it-0.018
Token stole
Token ID18240
Feature activation-2.40e-03

Pos logit contributions
ifying+0.003
attered+0.002
amy+0.002
rant+0.002
tr+0.002
Neg logit contributions
 friends-0.002
 there-0.002
 thinking-0.002
 here-0.002
 it-0.002
Token Tim
Token ID5045
Feature activation-2.46e-02

Pos logit contributions
ifying+0.007
attered+0.006
amy+0.006
rant+0.006
tr+0.006
Neg logit contributions
 friends-0.002
 there-0.002
 thinking-0.001
 here-0.001
 it-0.001
Tokenmy
Token ID1820
Feature activation-4.06e-02

Pos logit contributions
ifying+0.059
attered+0.048
amy+0.046
rant+0.045
tr+0.045
Neg logit contributions
 friends-0.035
 there-0.033
 here-0.031
 thinking-0.031
 things-0.030
Token's
Token ID338
Feature activation-7.74e-03

Pos logit contributions
ifying+0.097
attered+0.081
amy+0.079
rant+0.077
tr+0.075
Neg logit contributions
 there-0.055
 friends-0.055
 here-0.052
 it-0.051
 things-0.050
Token shirt
Token ID10147
Feature activation-7.37e-03

Pos logit contributions
ifying+0.014
attered+0.011
tr+0.010
amy+0.010
shaped+0.010
Neg logit contributions
 friends-0.014
 there-0.013
 here-0.013
 thinking-0.013
 kids-0.012
Token.
Token ID13
Feature activation+2.56e-05

Pos logit contributions
ifying+0.019
attered+0.016
amy+0.016
rant+0.015
tr+0.015
Neg logit contributions
 friends-0.008
 there-0.008
 thinking-0.007
 here-0.007
 things-0.007
Token They
Token ID1119
Feature activation+2.34e-04

Pos logit contributions
 friends+0.000
 there+0.000
 thinking+0.000
 here+0.000
 things+0.000
Neg logit contributions
ifying-0.000
attered-0.000
tr-0.000
amy-0.000
rant-0.000
Token asked
Token ID1965
Feature activation+2.21e-03

Pos logit contributions
 friends+0.000
 there+0.000
 it+0.000
 thinking+0.000
 mom+0.000
Neg logit contributions
ifying-0.001
attered-0.000
amy-0.000
ksh-0.000
rant-0.000
Token the
Token ID262
Feature activation-1.25e-03

Pos logit contributions
ifying+0.029
attered+0.025
amy+0.024
rant+0.024
tr+0.024
Neg logit contributions
 friends-0.008
 there-0.007
 here-0.006
 thinking-0.006
 it-0.006
Token rest
Token ID1334
Feature activation-2.77e-02

Pos logit contributions
ifying+0.002
attered+0.002
tr+0.002
amy+0.002
shaped+0.002
Neg logit contributions
 friends-0.002
 thinking-0.002
 there-0.002
 here-0.002
 it-0.002
Token of
Token ID286
Feature activation-5.23e-03

Pos logit contributions
ifying+0.084
attered+0.072
amy+0.072
tr+0.072
ksh+0.071
Neg logit contributions
 friends-0.022
 there-0.021
 here-0.018
 it-0.017
 things-0.016
Token the
Token ID262
Feature activation-7.10e-03

Pos logit contributions
ifying+0.015
attered+0.013
amy+0.012
rant+0.012
tr+0.012
Neg logit contributions
 friends-0.004
 there-0.004
 thinking-0.004
 here-0.004
 things-0.003
Token or
Token ID393
Feature activation-8.21e-03

Pos logit contributions
ifying+0.014
attered+0.011
tr+0.011
amy+0.011
rant+0.010
Neg logit contributions
 friends-0.012
 there-0.012
 thinking-0.012
 here-0.011
 it-0.011
Tokennam
Token ID7402
Feature activation-4.05e-02

Pos logit contributions
ifying+0.014
attered+0.011
amy+0.010
rant+0.010
ksh+0.010
Neg logit contributions
 friends-0.018
 there-0.018
 it-0.017
 thinking-0.017
 here-0.016
Tokenents
Token ID658
Feature activation-4.34e-03

Pos logit contributions
ifying+0.079
rant+0.063
tr+0.062
amy+0.061
attered+0.061
Neg logit contributions
 friends-0.072
 there-0.068
 here-0.065
 upon-0.063
 thinking-0.062
Token onto
Token ID4291
Feature activation-8.32e-03

Pos logit contributions
ifying+0.012
attered+0.010
amy+0.009
rant+0.009
tr+0.009
Neg logit contributions
 friends-0.005
 there-0.005
 here-0.004
 thinking-0.004
 it-0.004
Token the
Token ID262
Feature activation-1.59e-02

Pos logit contributions
ifying+0.026
attered+0.023
amy+0.022
tr+0.022
rant+0.022
Neg logit contributions
 friends-0.005
 there-0.005
 it-0.005
 thinking-0.004
 here-0.004
Token tree
Token ID5509
Feature activation-4.49e-03

Pos logit contributions
ifying+0.040
attered+0.034
amy+0.033
rant+0.032
ksh+0.031
Neg logit contributions
 friends-0.021
 there-0.020
 things-0.019
 next-0.019
 here-0.019
Token.
Token ID13
Feature activation-7.12e-03

Pos logit contributions
ifying+0.012
attered+0.010
amy+0.010
rant+0.010
ksh+0.009
Neg logit contributions
 friends-0.005
 there-0.005
 here-0.005
 thinking-0.004
 it-0.004
Token help
Token ID1037
Feature activation-6.74e-03

Pos logit contributions
 friends+0.009
 there+0.008
 thinking+0.008
 here+0.008
 it+0.008
Neg logit contributions
ifying-0.018
attered-0.015
amy-0.015
rant-0.014
tr-0.014
Token her
Token ID607
Feature activation-3.96e-03

Pos logit contributions
ifying+0.018
attered+0.015
tr+0.015
amy+0.015
rant+0.015
Neg logit contributions
 friends-0.008
 there-0.007
 it-0.007
 thinking-0.007
 here-0.007
Token mom
Token ID1995
Feature activation-6.27e-03

Pos logit contributions
ifying+0.012
attered+0.011
tr+0.011
amy+0.010
shaped+0.010
Neg logit contributions
 friends-0.002
 thinking-0.002
 there-0.002
 here-0.002
 upon-0.001
Token put
Token ID1234
Feature activation+9.74e-04

Pos logit contributions
ifying+0.018
attered+0.015
amy+0.015
rant+0.014
tr+0.014
Neg logit contributions
 friends-0.006
 there-0.005
 thinking-0.005
 here-0.005
 it-0.005
Token or
Token ID393
Feature activation+1.42e-02

Pos logit contributions
 friends+0.001
 there+0.001
 thinking+0.001
 here+0.001
 it+0.001
Neg logit contributions
ifying-0.003
attered-0.002
amy-0.002
rant-0.002
tr-0.002
Tokennam
Token ID7402
Feature activation-4.01e-02

Pos logit contributions
 friends+0.017
 there+0.016
 thinking+0.016
 here+0.015
 it+0.014
Neg logit contributions
ifying-0.036
attered-0.030
amy-0.029
rant-0.028
tr-0.028
Tokenents
Token ID658
Feature activation+8.37e-03

Pos logit contributions
ifying+0.077
rant+0.060
attered+0.059
tr+0.059
amy+0.059
Neg logit contributions
 friends-0.074
 there-0.070
 here-0.067
 thinking-0.064
 it-0.064
Token on
Token ID319
Feature activation+8.46e-03

Pos logit contributions
 friends+0.009
 there+0.008
 thinking+0.008
 here+0.008
 things+0.007
Neg logit contributions
ifying-0.022
attered-0.019
amy-0.018
rant-0.018
tr-0.018
Token the
Token ID262
Feature activation-4.81e-03

Pos logit contributions
 friends+0.008
 there+0.008
 thinking+0.007
 here+0.007
 it+0.007
Neg logit contributions
ifying-0.024
attered-0.020
amy-0.019
rant-0.019
tr-0.019
Token tree
Token ID5509
Feature activation-3.06e-03

Pos logit contributions
ifying+0.012
attered+0.010
amy+0.010
rant+0.010
ksh+0.009
Neg logit contributions
 friends-0.007
 there-0.006
 things-0.006
 next-0.006
 here-0.006
Token.
Token ID13
Feature activation+6.17e-03

Pos logit contributions
ifying+0.009
attered+0.008
amy+0.008
rant+0.008
ksh+0.007
Neg logit contributions
 there-0.003
 friends-0.003
 here-0.002
 it-0.002
 things-0.002
Token she
Token ID673
Feature activation-1.63e-02

Pos logit contributions
 friends+0.024
 here+0.022
 idea+0.021
 man+0.021
 friend+0.021
Neg logit contributions
ifying-0.076
attered-0.061
amy-0.060
shaped-0.060
piece-0.059
Token could
Token ID714
Feature activation+3.68e-03

Pos logit contributions
ifying+0.038
attered+0.032
amy+0.030
tr+0.029
rant+0.029
Neg logit contributions
 friends-0.024
 there-0.023
 thinking-0.022
 here-0.022
 it-0.021
Token carry
Token ID3283
Feature activation+8.83e-03

Pos logit contributions
 friends+0.005
 there+0.004
 thinking+0.004
 here+0.004
 things+0.004
Neg logit contributions
ifying-0.009
attered-0.007
tr-0.007
rant-0.007
shaped-0.007
Token the
Token ID262
Feature activation+1.69e-03

Pos logit contributions
 friends+0.007
 thinking+0.006
 there+0.006
 here+0.006
 upon+0.005
Neg logit contributions
ifying-0.025
attered-0.021
rant-0.021
amy-0.021
shaped-0.021
Token or
Token ID393
Feature activation-7.95e-03

Pos logit contributions
 thinking+0.002
 friends+0.002
 there+0.002
 here+0.002
 it+0.002
Neg logit contributions
ifying-0.004
attered-0.003
tr-0.003
rant-0.003
amy-0.003
Tokennam
Token ID7402
Feature activation-4.00e-02

Pos logit contributions
ifying+0.013
attered+0.010
amy+0.010
rant+0.009
tr+0.009
Neg logit contributions
 friends-0.017
 there-0.016
 thinking-0.016
 here-0.016
 it-0.015
Tokenents
Token ID658
Feature activation+2.61e-03

Pos logit contributions
ifying+0.075
attered+0.059
rant+0.057
amy+0.057
tr+0.056
Neg logit contributions
 friends-0.075
 there-0.072
 here-0.069
 thinking-0.067
 it-0.066
Token back
Token ID736
Feature activation+2.28e-03

Pos logit contributions
 friends+0.003
 thinking+0.003
 there+0.003
 here+0.003
 kids+0.003
Neg logit contributions
ifying-0.007
attered-0.006
rant-0.005
amy-0.005
tr-0.005
Token with
Token ID351
Feature activation+5.26e-03

Pos logit contributions
 friends+0.002
 thinking+0.002
 there+0.002
 here+0.002
 kids+0.002
Neg logit contributions
ifying-0.006
attered-0.005
tr-0.005
rant-0.005
amy-0.005
Token her
Token ID607
Feature activation-9.72e-04

Pos logit contributions
 friends+0.005
 thinking+0.004
 here+0.004
 there+0.004
 upon+0.004
Neg logit contributions
ifying-0.014
attered-0.012
rant-0.011
shaped-0.011
amy-0.011
Token.
Token ID13
Feature activation-6.68e-04

Pos logit contributions
ifying+0.003
attered+0.002
tr+0.002
amy+0.002
rant+0.002
Neg logit contributions
 thinking-0.001
 there-0.001
 friends-0.001
 here-0.001
 it-0.001
Token The
Token ID383
Feature activation+1.23e-03

Pos logit contributions
 friends+0.003
 there+0.003
 thinking+0.003
 here+0.003
 things+0.003
Neg logit contributions
ifying-0.006
attered-0.005
amy-0.005
tr-0.005
rant-0.005
Token big
Token ID1263
Feature activation-1.20e-03

Pos logit contributions
 friends+0.001
 there+0.001
 here+0.001
 thinking+0.001
 it+0.001
Neg logit contributions
ifying-0.003
attered-0.003
tr-0.003
amy-0.003
rant-0.003
Token boy
Token ID2933
Feature activation-9.48e-03

Pos logit contributions
ifying+0.003
attered+0.003
amy+0.003
tr+0.002
rant+0.002
Neg logit contributions
 friends-0.001
 here-0.001
 thinking-0.001
 there-0.001
 it-0.001
Token stole
Token ID18240
Feature activation-3.76e-03

Pos logit contributions
ifying+0.025
attered+0.021
amy+0.020
rant+0.020
tr+0.020
Neg logit contributions
 friends-0.012
 there-0.011
 thinking-0.010
 here-0.010
 it-0.010
Token Tim
Token ID5045
Feature activation-2.05e-02

Pos logit contributions
ifying+0.011
attered+0.010
amy+0.009
rant+0.009
tr+0.009
Neg logit contributions
 friends-0.003
 there-0.003
 thinking-0.003
 here-0.003
 it-0.002
Tokenmy
Token ID1820
Feature activation-3.98e-02

Pos logit contributions
ifying+0.049
attered+0.040
amy+0.038
rant+0.038
tr+0.037
Neg logit contributions
 friends-0.029
 there-0.027
 thinking-0.026
 here-0.026
 things-0.025
Token's
Token ID338
Feature activation-1.82e-03

Pos logit contributions
ifying+0.103
attered+0.087
amy+0.084
rant+0.083
ksh+0.081
Neg logit contributions
 friends-0.048
 there-0.047
 here-0.044
 it-0.043
 things-0.042
Token toy
Token ID13373
Feature activation-1.45e-02

Pos logit contributions
ifying+0.004
attered+0.004
tr+0.003
rant+0.003
shaped+0.003
Neg logit contributions
 friends-0.002
 there-0.002
 thinking-0.002
 here-0.002
 it-0.002
Token truck
Token ID7779
Feature activation-1.57e-02

Pos logit contributions
ifying+0.041
attered+0.035
amy+0.035
rant+0.034
tr+0.033
Neg logit contributions
 friends-0.014
 there-0.013
 here-0.012
 thinking-0.012
 it-0.011
Token!
Token ID0
Feature activation+3.91e-03

Pos logit contributions
ifying+0.048
attered+0.041
amy+0.040
rant+0.040
ksh+0.039
Neg logit contributions
 friends-0.012
 there-0.011
 here-0.010
 thinking-0.010
 it-0.009
Token
Token ID220
Feature activation-1.13e-02

Pos logit contributions
 friends+0.005
 thinking+0.004
 here+0.004
 there+0.004
 things+0.004
Neg logit contributions
ifying-0.010
attered-0.008
tr-0.008
amy-0.008
ksh-0.008
TokenThey
Token ID2990
Feature activation-6.80e-03

Pos logit contributions
 friends+0.015
 thinking+0.014
 here+0.013
 things+0.013
 there+0.013
Neg logit contributions
ifying-0.019
attered-0.016
amy-0.015
tr-0.015
lier-0.014
Token packed
Token ID11856
Feature activation+2.30e-03

Pos logit contributions
ifying+0.015
attered+0.012
tr+0.012
amy+0.012
rant+0.011
Neg logit contributions
 friends-0.011
 there-0.010
 thinking-0.010
 here-0.010
 it-0.009
Token a
Token ID257
Feature activation-2.41e-02

Pos logit contributions
 friends+0.003
 there+0.003
 things+0.002
 it+0.002
 here+0.002
Neg logit contributions
ifying-0.006
attered-0.005
tr-0.005
amy-0.005
rant-0.005
Token picnic
Token ID35715
Feature activation+1.78e-03

Pos logit contributions
ifying+0.062
attered+0.053
amy+0.050
rant+0.050
tr+0.049
Neg logit contributions
 friends-0.029
 there-0.027
 thinking-0.026
 here-0.026
 things-0.024
Token basket
Token ID7988
Feature activation-5.39e-03

Pos logit contributions
 friends+0.002
 thinking+0.002
 there+0.002
 here+0.002
 it+0.002
Neg logit contributions
ifying-0.005
attered-0.004
amy-0.004
tr-0.004
rant-0.004
Token full
Token ID1336
Feature activation-3.97e-02

Pos logit contributions
ifying+0.016
attered+0.014
amy+0.014
rant+0.013
tr+0.013
Neg logit contributions
 friends-0.004
 there-0.004
 thinking-0.003
 here-0.003
 things-0.003
Token of
Token ID286
Feature activation-2.48e-02

Pos logit contributions
ifying+0.119
attered+0.104
tr+0.101
amy+0.101
rant+0.100
Neg logit contributions
 friends-0.034
 there-0.030
 here-0.026
 thinking-0.025
 it-0.024
Token food
Token ID2057
Feature activation+5.51e-03

Pos logit contributions
ifying+0.050
attered+0.040
amy+0.040
rant+0.037
shaped+0.036
Neg logit contributions
 thinking-0.037
't-0.034
 here-0.034
 there-0.033
 friends-0.033
Token and
Token ID290
Feature activation-1.14e-02

Pos logit contributions
 friends+0.004
 thinking+0.004
 here+0.004
 there+0.004
 kids+0.003
Neg logit contributions
ifying-0.015
attered-0.013
tr-0.013
rant-0.013
amy-0.013
Token went
Token ID1816
Feature activation-1.49e-02

Pos logit contributions
ifying+0.028
attered+0.024
amy+0.023
rant+0.022
tr+0.022
Neg logit contributions
 friends-0.015
 there-0.014
 thinking-0.013
 here-0.013
 it-0.013
Token to
Token ID284
Feature activation-1.44e-02

Pos logit contributions
ifying+0.039
attered+0.033
rant+0.031
amy+0.031
tr+0.031
Neg logit contributions
 friends-0.017
 there-0.015
 thinking-0.015
 here-0.015
 things-0.014
Token tail
Token ID7894
Feature activation+8.37e-03

Pos logit contributions
ifying+0.012
attered+0.010
amy+0.010
rant+0.010
tr+0.010
Neg logit contributions
 friends-0.007
 there-0.007
 thinking-0.007
 here-0.007
 it-0.006
Token and
Token ID290
Feature activation+1.57e-03

Pos logit contributions
 friends+0.008
 there+0.007
 thinking+0.007
 here+0.007
 things+0.006
Neg logit contributions
ifying-0.023
attered-0.020
amy-0.019
rant-0.019
tr-0.019
Token l
Token ID300
Feature activation-1.01e-02

Pos logit contributions
 friends+0.002
 there+0.002
 here+0.002
 thinking+0.002
 things+0.002
Neg logit contributions
ifying-0.004
attered-0.003
tr-0.003
amy-0.003
rant-0.003
Tokenicked
Token ID9484
Feature activation-2.65e-02

Pos logit contributions
ifying+0.019
ksh+0.015
rant+0.015
tr+0.015
attered+0.014
Neg logit contributions
 friends-0.020
 there-0.018
 here-0.017
 thinking-0.017
 kids-0.016
Token Tim
Token ID5045
Feature activation-2.57e-02

Pos logit contributions
ifying+0.082
attered+0.072
tr+0.070
amy+0.069
rant+0.069
Neg logit contributions
 friends-0.021
 there-0.018
 it-0.018
 here-0.015
 thinking-0.015
Tokenmy
Token ID1820
Feature activation-3.96e-02

Pos logit contributions
ifying+0.067
attered+0.055
rant+0.053
amy+0.052
tr+0.051
Neg logit contributions
 friends-0.033
 there-0.030
 thinking-0.028
 here-0.028
 things-0.027
Token's
Token ID338
Feature activation-1.22e-02

Pos logit contributions
ifying+0.088
attered+0.073
amy+0.070
rant+0.070
tr+0.066
Neg logit contributions
 friends-0.066
 there-0.060
 it-0.058
 things-0.057
 thinking-0.056
Token face
Token ID1986
Feature activation-9.47e-03

Pos logit contributions
ifying+0.028
attered+0.023
amy+0.022
rant+0.022
tr+0.021
Neg logit contributions
 friends-0.019
 thinking-0.017
 there-0.017
 here-0.017
 things-0.016
Token.
Token ID13
Feature activation-7.67e-03

Pos logit contributions
ifying+0.027
attered+0.024
amy+0.023
rant+0.022
tr+0.022
Neg logit contributions
 friends-0.009
 there-0.008
 thinking-0.007
 it-0.007
 here-0.007
Token Tim
Token ID5045
Feature activation+1.73e-04

Pos logit contributions
ifying+0.018
attered+0.015
tr+0.014
rant+0.014
ksh+0.014
Neg logit contributions
 friends-0.011
 there-0.010
 here-0.010
 thinking-0.010
 things-0.009
Tokenmy
Token ID1820
Feature activation-4.15e-03

Pos logit contributions
 friends+0.000
 there+0.000
 thinking+0.000
 here+0.000
 things+0.000
Neg logit contributions
ifying-0.000
attered-0.000
amy-0.000
rant-0.000
tr-0.000
Tokenictionary
Token ID14188
Feature activation-3.21e-03

Pos logit contributions
ifying+0.026
attered+0.021
amy+0.019
rant+0.019
tr+0.019
Neg logit contributions
 friends-0.022
 there-0.021
 thinking-0.020
 here-0.020
 things-0.019
Token'
Token ID6
Feature activation-1.31e-02

Pos logit contributions
ifying+0.007
attered+0.005
amy+0.005
tr+0.005
rant+0.005
Neg logit contributions
 friends-0.006
 there-0.005
 thinking-0.005
 here-0.005
 things-0.005
Token written
Token ID3194
Feature activation-1.75e-02

Pos logit contributions
ifying+0.034
attered+0.029
tr+0.027
rant+0.027
amy+0.027
Neg logit contributions
 friends-0.015
 there-0.014
 thinking-0.014
 here-0.013
 it-0.012
Token on
Token ID319
Feature activation-1.55e-02

Pos logit contributions
ifying+0.041
attered+0.034
amy+0.032
rant+0.032
tr+0.032
Neg logit contributions
 friends-0.025
 there-0.024
 thinking-0.023
 here-0.023
 things-0.021
Token the
Token ID262
Feature activation-2.27e-02

Pos logit contributions
ifying+0.045
attered+0.037
amy+0.036
rant+0.036
tr+0.036
Neg logit contributions
 friends-0.014
 there-0.012
 thinking-0.012
 here-0.011
 upon-0.010
Token front
Token ID2166
Feature activation-3.96e-02

Pos logit contributions
ifying+0.048
attered+0.040
amy+0.038
rant+0.037
shaped+0.036
Neg logit contributions
 friends-0.037
 there-0.035
 thinking-0.035
 here-0.034
 things-0.033
Token.
Token ID13
Feature activation-4.39e-03

Pos logit contributions
ifying+0.091
attered+0.079
amy+0.075
rant+0.073
ksh+0.071
Neg logit contributions
 friends-0.057
 there-0.054
 here-0.052
 thinking-0.051
 it-0.050
Token She
Token ID1375
Feature activation-2.11e-03

Pos logit contributions
ifying+0.010
attered+0.008
amy+0.008
tr+0.007
rant+0.007
Neg logit contributions
 friends-0.007
 thinking-0.006
 there-0.006
 things-0.006
 here-0.006
Token was
Token ID373
Feature activation-1.33e-02

Pos logit contributions
ifying+0.005
attered+0.004
amy+0.004
rant+0.004
tr+0.004
Neg logit contributions
 friends-0.003
 there-0.003
 thinking-0.003
 here-0.003
 it-0.003
Token so
Token ID523
Feature activation+4.51e-03

Pos logit contributions
ifying+0.031
attered+0.025
tr+0.024
rant+0.024
amy+0.024
Neg logit contributions
 friends-0.018
 there-0.017
 things-0.016
 thinking-0.016
 here-0.016
Token curious
Token ID11040
Feature activation-7.03e-05

Pos logit contributions
 friends+0.004
 thinking+0.004
 there+0.004
 things+0.004
 here+0.004
Neg logit contributions
ifying-0.012
attered-0.010
tr-0.010
shaped-0.010
amy-0.009
Token mom
Token ID1995
Feature activation-1.56e-03

Pos logit contributions
ifying+0.016
tr+0.015
attered+0.014
amy+0.014
shaped+0.014
Neg logit contributions
 thinking-0.003
 here-0.003
 there-0.003
't-0.003
 it-0.003
Token where
Token ID810
Feature activation+1.60e-03

Pos logit contributions
ifying+0.004
rant+0.003
tr+0.003
attered+0.003
amy+0.003
Neg logit contributions
 friends-0.002
 thinking-0.002
 here-0.001
 there-0.001
 kids-0.001
Token a
Token ID257
Feature activation-8.60e-04

Pos logit contributions
 friends+0.001
 thinking+0.001
 here+0.001
 there+0.001
 kids+0.001
Neg logit contributions
ifying-0.005
attered-0.004
tr-0.004
amy-0.004
shaped-0.004
Token trip
Token ID5296
Feature activation+5.48e-03

Pos logit contributions
ifying+0.002
attered+0.002
amy+0.002
rant+0.002
tr+0.002
Neg logit contributions
 friends-0.001
 there-0.001
 thinking-0.001
 here-0.001
 it-0.001
Token to
Token ID284
Feature activation-1.75e-02

Pos logit contributions
 friends+0.007
 there+0.006
 here+0.006
 thinking+0.006
 it+0.006
Neg logit contributions
ifying-0.014
attered-0.012
amy-0.012
rant-0.011
ksh-0.011
Token an
Token ID281
Feature activation-3.96e-02

Pos logit contributions
ifying+0.039
attered+0.031
rant+0.029
amy+0.029
tr+0.029
Neg logit contributions
 friends-0.026
 there-0.025
 thinking-0.024
 here-0.024
 things-0.022
Token orange
Token ID10912
Feature activation-2.64e-02

Pos logit contributions
ifying+0.096
attered+0.081
amy+0.077
rant+0.076
tr+0.076
Neg logit contributions
 friends-0.053
 there-0.049
 thinking-0.048
 here-0.047
 things-0.044
Token place
Token ID1295
Feature activation-1.26e-03

Pos logit contributions
ifying+0.065
attered+0.054
tr+0.052
amy+0.051
rant+0.051
Neg logit contributions
 friends-0.033
 there-0.031
 thinking-0.031
 here-0.030
 it-0.027
Token could
Token ID714
Feature activation-5.34e-03

Pos logit contributions
ifying+0.003
attered+0.003
amy+0.003
rant+0.003
tr+0.003
Neg logit contributions
 friends-0.001
 there-0.001
 thinking-0.001
 here-0.001
 it-0.001
Token be
Token ID307
Feature activation-2.09e-03

Pos logit contributions
ifying+0.012
attered+0.010
tr+0.009
rant+0.009
amy+0.009
Neg logit contributions
 friends-0.008
 there-0.007
 thinking-0.007
 here-0.007
 things-0.007
Token found
Token ID1043
Feature activation-4.48e-03

Pos logit contributions
ifying+0.005
attered+0.004
amy+0.004
rant+0.004
tr+0.004
Neg logit contributions
 friends-0.003
 there-0.002
 thinking-0.002
 here-0.002
 it-0.002
Token here
Token ID994
Feature activation+1.55e-02

Pos logit contributions
 kids+0.011
 thinking+0.010
 friends+0.010
Next+0.010
't+0.010
Neg logit contributions
ifying-0.028
rant-0.022
attered-0.022
amy-0.022
ist-0.021
Token and
Token ID290
Feature activation-1.65e-02

Pos logit contributions
 thinking+0.017
 friends+0.016
 kids+0.015
 things+0.015
Next+0.014
Neg logit contributions
ifying-0.037
lier-0.029
rant-0.029
attered-0.029
work-0.028
Token explore
Token ID7301
Feature activation+2.24e-03

Pos logit contributions
ifying+0.040
attered+0.032
tr+0.031
rant+0.030
amy+0.030
Neg logit contributions
 friends-0.021
 thinking-0.020
 there-0.020
 here-0.019
 things-0.018
Token the
Token ID262
Feature activation-2.26e-03

Pos logit contributions
 friends+0.002
 thinking+0.002
 there+0.002
 here+0.002
 kids+0.002
Neg logit contributions
ifying-0.006
rant-0.005
attered-0.005
amy-0.005
lier-0.005
Token or
Token ID393
Feature activation-1.25e-02

Pos logit contributions
ifying+0.005
amy+0.004
tr+0.004
attered+0.004
rant+0.004
Neg logit contributions
 thinking-0.003
 friends-0.003
 there-0.002
't-0.002
 here-0.002
Tokennam
Token ID7402
Feature activation-3.93e-02

Pos logit contributions
ifying+0.033
attered+0.027
amy+0.026
rant+0.026
tr+0.026
Neg logit contributions
 friends-0.014
 thinking-0.012
 there-0.012
 here-0.012
 things-0.011
Tokenents
Token ID658
Feature activation+3.85e-03

Pos logit contributions
ifying+0.074
attered+0.058
rant+0.056
tr+0.056
amy+0.056
Neg logit contributions
 friends-0.073
 there-0.069
 here-0.067
 thinking-0.065
 upon-0.064
Token."
Token ID526
Feature activation-6.51e-03

Pos logit contributions
 friends+0.004
 thinking+0.004
 there+0.004
 here+0.003
 kids+0.003
Neg logit contributions
ifying-0.010
rant-0.008
attered-0.008
tr-0.008
lier-0.008
Token
Token ID198
Feature activation-7.06e-03

Pos logit contributions
ifying+0.019
attered+0.016
amy+0.016
rant+0.016
tr+0.016
Neg logit contributions
 friends-0.005
 thinking-0.004
 there-0.004
 here-0.004
 things-0.004
Token
Token ID198
Feature activation-7.68e-03

Pos logit contributions
ifying+0.022
attered+0.019
amy+0.019
tr+0.018
rant+0.018
Neg logit contributions
 thinking-0.004
 friends-0.004
 there-0.003
 here-0.003
 things-0.003
TokenDa
Token ID26531
Feature activation-2.33e-02

Pos logit contributions
ifying+0.017
attered+0.014
rant+0.013
amy+0.013
tr+0.013
Neg logit contributions
 friends-0.012
 there-0.011
 here-0.011
 thinking-0.011
 it-0.010
Token.
Token ID13
Feature activation-1.16e-03

Pos logit contributions
ifying+0.041
attered+0.035
amy+0.033
rant+0.033
tr+0.033
Neg logit contributions
 friends-0.019
 there-0.017
 thinking-0.017
 here-0.016
 it-0.015
Token
Token ID198
Feature activation-2.51e-03

Pos logit contributions
ifying+0.003
tr+0.003
attered+0.003
ksh+0.003
piece+0.003
Neg logit contributions
 thinking-0.001
 friends-0.001
 things-0.001
 here-0.001
 there-0.001
Token
Token ID198
Feature activation-1.53e-03

Pos logit contributions
ifying+0.007
tr+0.006
attered+0.006
amy+0.006
rant+0.006
Neg logit contributions
 thinking-0.002
 friends-0.002
 here-0.002
 there-0.002
 things-0.002
TokenFrom
Token ID4863
Feature activation-1.78e-02

Pos logit contributions
ifying+0.004
attered+0.003
amy+0.003
rant+0.003
tr+0.003
Neg logit contributions
 friends-0.002
 there-0.002
 thinking-0.002
 here-0.002
 things-0.002
Token that
Token ID326
Feature activation-1.26e-02

Pos logit contributions
ifying+0.040
rant+0.034
attered+0.034
amy+0.034
shaped+0.033
Neg logit contributions
 friends-0.022
 thinking-0.021
 kids-0.019
 upon-0.019
 there-0.018
Token day
Token ID1110
Feature activation-3.91e-02

Pos logit contributions
ifying+0.029
tr+0.026
attered+0.024
rant+0.024
amy+0.024
Neg logit contributions
 friends-0.013
 here-0.012
 there-0.012
 kids-0.011
 thinking-0.011
Token on
Token ID319
Feature activation-1.54e-03

Pos logit contributions
ifying+0.107
attered+0.091
amy+0.087
tr+0.087
rant+0.086
Neg logit contributions
 friends-0.039
 there-0.035
 thinking-0.034
 here-0.034
 things-0.031
Token,
Token ID11
Feature activation-1.04e-02

Pos logit contributions
ifying+0.004
attered+0.003
amy+0.003
tr+0.003
rant+0.003
Neg logit contributions
 thinking-0.001
 friends-0.001
 here-0.001
 upon-0.001
 kids-0.001
Token the
Token ID262
Feature activation+1.66e-03

Pos logit contributions
ifying+0.026
attered+0.021
amy+0.021
tr+0.021
lier+0.020
Neg logit contributions
 thinking-0.011
 friends-0.011
 here-0.011
 there-0.010
 kids-0.009
Token giant
Token ID6175
Feature activation-8.95e-03

Pos logit contributions
 friends+0.002
 there+0.002
 here+0.002
 thinking+0.002
 it+0.002
Neg logit contributions
ifying-0.004
attered-0.003
amy-0.003
tr-0.003
rant-0.003
Token and
Token ID290
Feature activation-3.39e-04

Pos logit contributions
ifying+0.019
attered+0.016
tr+0.015
amy+0.015
rant+0.015
Neg logit contributions
 friends-0.013
 there-0.013
 here-0.013
 thinking-0.013
 things-0.012
Token mom
Token ID1995
Feature activation+1.88e-04

Pos logit contributions
 there+0.003
 it+0.002
 here+0.002
 thinking+0.002
 upon+0.002
Neg logit contributions
ifying-0.015
tr-0.013
attered-0.013
amy-0.013
 unison-0.012
Token packed
Token ID11856
Feature activation+7.58e-03

Pos logit contributions
 friends+0.000
 things+0.000
 here+0.000
 thinking+0.000
 there+0.000
Neg logit contributions
ifying-0.000
attered-0.000
tr-0.000
rant-0.000
amy-0.000
Token a
Token ID257
Feature activation-1.72e-02

Pos logit contributions
 friends+0.008
 there+0.007
 thinking+0.007
 here+0.007
 things+0.006
Neg logit contributions
ifying-0.021
attered-0.018
amy-0.017
rant-0.017
tr-0.017
Token picnic
Token ID35715
Feature activation-1.26e-04

Pos logit contributions
ifying+0.043
attered+0.036
amy+0.035
rant+0.035
tr+0.034
Neg logit contributions
 friends-0.022
 there-0.021
 here-0.020
 thinking-0.019
 things-0.019
Token basket
Token ID7988
Feature activation-9.57e-03

Pos logit contributions
ifying+0.000
attered+0.000
amy+0.000
tr+0.000
rant+0.000
Neg logit contributions
 friends-0.000
 there-0.000
 thinking-0.000
 here-0.000
 it-0.000
Token full
Token ID1336
Feature activation-3.89e-02

Pos logit contributions
ifying+0.029
attered+0.026
amy+0.025
rant+0.025
tr+0.024
Neg logit contributions
 friends-0.006
 there-0.006
 thinking-0.005
 here-0.005
 it-0.004
Token of
Token ID286
Feature activation-2.39e-02

Pos logit contributions
ifying+0.116
attered+0.101
amy+0.097
tr+0.097
rant+0.097
Neg logit contributions
 friends-0.033
 there-0.030
 here-0.027
 thinking-0.026
 it-0.025
Token y
Token ID331
Feature activation-1.13e-02

Pos logit contributions
ifying+0.048
amy+0.038
attered+0.038
shaped+0.035
lier+0.035
Neg logit contributions
 thinking-0.035
't-0.033
 here-0.032
 upon-0.032
 there-0.032
Tokenummy
Token ID13513
Feature activation-9.30e-03

Pos logit contributions
ifying+0.029
tr+0.025
attered+0.024
rant+0.024
amy+0.023
Neg logit contributions
 friends-0.014
 there-0.012
 here-0.012
 things-0.012
 thinking-0.012
Token food
Token ID2057
Feature activation+7.15e-04

Pos logit contributions
ifying+0.021
attered+0.017
amy+0.017
rant+0.016
lier+0.016
Neg logit contributions
 thinking-0.012
 friends-0.012
 there-0.012
 here-0.012
 upon-0.011
Token.
Token ID13
Feature activation+3.36e-03

Pos logit contributions
 friends+0.001
 thinking+0.001
 there+0.001
 here+0.001
 kids+0.001
Neg logit contributions
ifying-0.002
attered-0.002
tr-0.002
rant-0.002
amy-0.002
Token her
Token ID607
Feature activation-4.35e-05

Pos logit contributions
 friends+0.004
 thinking+0.003
 here+0.003
 there+0.003
 things+0.003
Neg logit contributions
ifying-0.009
attered-0.008
amy-0.007
rant-0.007
lier-0.007
Token to
Token ID284
Feature activation-1.02e-02

Pos logit contributions
ifying+0.000
tr+0.000
attered+0.000
shaped+0.000
bug+0.000
Neg logit contributions
 friends-0.000
 thinking-0.000
 here-0.000
 there-0.000
 kids-0.000
Token go
Token ID467
Feature activation+2.90e-03

Pos logit contributions
ifying+0.025
attered+0.022
amy+0.020
rant+0.020
tr+0.020
Neg logit contributions
 friends-0.013
 there-0.012
 here-0.012
 thinking-0.012
 it-0.011
Token and
Token ID290
Feature activation-2.49e-03

Pos logit contributions
 friends+0.004
 thinking+0.003
 kids+0.003
 there+0.003
 things+0.003
Neg logit contributions
ifying-0.007
attered-0.006
rant-0.005
amy-0.005
tr-0.005
Token mail
Token ID6920
Feature activation+9.24e-03

Pos logit contributions
ifying+0.006
attered+0.005
tr+0.005
amy+0.005
rant+0.005
Neg logit contributions
 friends-0.003
 thinking-0.003
 there-0.003
 here-0.003
 things-0.003
Token an
Token ID281
Feature activation-3.89e-02

Pos logit contributions
 friends+0.009
 there+0.008
 thinking+0.007
 here+0.007
 it+0.007
Neg logit contributions
ifying-0.026
attered-0.023
amy-0.022
rant-0.022
tr-0.021
Token important
Token ID1593
Feature activation+6.31e-03

Pos logit contributions
ifying+0.086
attered+0.068
amy+0.067
rant+0.064
ksh+0.063
Neg logit contributions
 friends-0.066
 there-0.061
 here-0.061
 thinking-0.059
 things-0.057
Token letter
Token ID3850
Feature activation-8.35e-03

Pos logit contributions
 friends+0.010
 there+0.009
 thinking+0.009
 here+0.009
 it+0.008
Neg logit contributions
ifying-0.014
attered-0.011
tr-0.011
amy-0.011
rant-0.011
Token.
Token ID13
Feature activation+2.28e-03

Pos logit contributions
ifying+0.022
attered+0.019
tr+0.018
rant+0.018
amy+0.018
Neg logit contributions
 friends-0.009
 thinking-0.008
 there-0.008
 here-0.008
 things-0.007
Token So
Token ID1406
Feature activation-1.40e-02

Pos logit contributions
 friends+0.003
 there+0.003
 thinking+0.002
 here+0.002
 things+0.002
Neg logit contributions
ifying-0.006
attered-0.005
amy-0.005
tr-0.005
rant-0.005
Token Sally
Token ID25737
Feature activation-4.70e-03

Pos logit contributions
ifying+0.039
attered+0.034
amy+0.032
rant+0.032
tr+0.031
Neg logit contributions
 friends-0.011
 thinking-0.010
 here-0.009
 there-0.009
 kids-0.008
Token they
Token ID484
Feature activation-4.31e-03

Pos logit contributions
 friends+0.002
 thinking+0.002
 things+0.002
 here+0.002
 there+0.002
Neg logit contributions
ifying-0.006
attered-0.005
amy-0.005
tr-0.005
nam-0.005
Token brought
Token ID3181
Feature activation+5.25e-04

Pos logit contributions
ifying+0.009
attered+0.007
amy+0.007
rant+0.007
tr+0.007
Neg logit contributions
 friends-0.007
 here-0.006
 thinking-0.006
 there-0.006
 things-0.006
Token a
Token ID257
Feature activation-5.37e-03

Pos logit contributions
 friends+0.001
 there+0.000
 here+0.000
 thinking+0.000
 it+0.000
Neg logit contributions
ifying-0.002
attered-0.001
rant-0.001
amy-0.001
tr-0.001
Token picnic
Token ID35715
Feature activation-2.59e-03

Pos logit contributions
ifying+0.015
attered+0.013
amy+0.012
rant+0.012
tr+0.012
Neg logit contributions
 friends-0.005
 there-0.005
 thinking-0.005
 here-0.005
 things-0.004
Token basket
Token ID7988
Feature activation-8.93e-03

Pos logit contributions
ifying+0.007
attered+0.006
amy+0.006
rant+0.006
tr+0.006
Neg logit contributions
 friends-0.003
 there-0.003
 thinking-0.003
 here-0.003
 things-0.002
Token full
Token ID1336
Feature activation-3.85e-02

Pos logit contributions
ifying+0.025
attered+0.022
amy+0.021
rant+0.021
tr+0.021
Neg logit contributions
 friends-0.008
 there-0.008
 here-0.007
 thinking-0.007
 it-0.007
Token of
Token ID286
Feature activation-2.81e-02

Pos logit contributions
ifying+0.121
attered+0.107
rant+0.105
tr+0.105
amy+0.105
Neg logit contributions
 friends-0.029
 there-0.024
 here-0.020
 kids-0.019
 it-0.018
Token y
Token ID331
Feature activation-1.22e-02

Pos logit contributions
ifying+0.056
attered+0.045
amy+0.045
shaped+0.041
rant+0.041
Neg logit contributions
 thinking-0.043
 here-0.039
 friends-0.039
 there-0.038
 upon-0.038
Tokenummy
Token ID13513
Feature activation-1.48e-02

Pos logit contributions
ifying+0.032
tr+0.027
attered+0.027
rant+0.026
amy+0.025
Neg logit contributions
 friends-0.015
 there-0.013
 things-0.012
 here-0.012
 thinking-0.012
Token snacks
Token ID28487
Feature activation-4.17e-03

Pos logit contributions
ifying+0.035
attered+0.029
amy+0.028
rant+0.027
tr+0.027
Neg logit contributions
 friends-0.019
 thinking-0.019
 there-0.018
 here-0.018
 it-0.016
Token.
Token ID13
Feature activation-7.40e-03

Pos logit contributions
ifying+0.011
attered+0.009
amy+0.009
rant+0.009
tr+0.009
Neg logit contributions
 friends-0.005
 there-0.004
 thinking-0.004
 here-0.004
 things-0.004
Token so
Token ID523
Feature activation-9.50e-03

Pos logit contributions
 friends+0.013
 there+0.012
 thinking+0.012
 here+0.012
 kids+0.011
Neg logit contributions
ifying-0.027
attered-0.022
rant-0.022
tr-0.021
amy-0.021
Token that
Token ID326
Feature activation+6.91e-03

Pos logit contributions
ifying+0.029
attered+0.025
rant+0.023
tr+0.023
amy+0.023
Neg logit contributions
 friends-0.007
 thinking-0.006
 there-0.005
 here-0.005
 kids-0.005
Token no
Token ID645
Feature activation-5.42e-03

Pos logit contributions
 friends+0.005
 thinking+0.005
 here+0.005
 kids+0.004
 there+0.004
Neg logit contributions
ifying-0.019
attered-0.016
tr-0.016
rant-0.015
lier-0.015
Token more
Token ID517
Feature activation-1.03e-02

Pos logit contributions
ifying+0.015
attered+0.013
tr+0.012
rant+0.012
amy+0.012
Neg logit contributions
 friends-0.005
 there-0.004
 here-0.004
 thinking-0.004
 upon-0.004
Token mis
Token ID2984
Feature activation-1.56e-02

Pos logit contributions
ifying+0.026
attered+0.021
tr+0.020
lier+0.020
rant+0.020
Neg logit contributions
 friends-0.011
 there-0.010
 here-0.010
 thinking-0.010
 upon-0.010
Tokenchie
Token ID3043
Feature activation-3.83e-02

Pos logit contributions
ifying+0.025
tr+0.019
rant+0.019
attered+0.018
amy+0.017
Neg logit contributions
 friends-0.033
 there-0.031
 here-0.031
 kids-0.030
 thinking-0.029
Tokenvous
Token ID31222
Feature activation+6.98e-03

Pos logit contributions
ifying+0.067
attered+0.050
amy+0.049
rant+0.047
ksh+0.044
Neg logit contributions
 friends-0.079
 there-0.077
 here-0.073
 thinking-0.072
 it-0.071
Token mice
Token ID10693
Feature activation-8.30e-03

Pos logit contributions
 friends+0.007
 there+0.007
 here+0.007
 upon+0.007
 thinking+0.007
Neg logit contributions
ifying-0.017
attered-0.014
tr-0.014
lier-0.013
amy-0.013
Token could
Token ID714
Feature activation+7.86e-03

Pos logit contributions
ifying+0.018
attered+0.014
rant+0.013
tr+0.013
amy+0.013
Neg logit contributions
 friends-0.013
 thinking-0.012
 there-0.012
 here-0.011
 things-0.011
Token come
Token ID1282
Feature activation+8.22e-03

Pos logit contributions
 friends+0.011
 thinking+0.010
 there+0.010
 here+0.010
 things+0.010
Neg logit contributions
ifying-0.017
tr-0.014
attered-0.014
rant-0.014
shaped-0.013
Token in
Token ID287
Feature activation+9.74e-03

Pos logit contributions
 friends+0.011
 thinking+0.010
 there+0.010
 kids+0.009
 here+0.009
Neg logit contributions
ifying-0.020
attered-0.016
tr-0.015
rant-0.015
amy-0.015
Token for
Token ID329
Feature activation-8.43e-03

Pos logit contributions
 friends+0.005
 thinking+0.005
 upon+0.004
Next+0.004
"+0.004
Neg logit contributions
ifying-0.011
tr-0.009
work-0.009
rant-0.009
amy-0.009
Token now
Token ID783
Feature activation+8.00e-04

Pos logit contributions
ifying+0.020
rant+0.016
 unison+0.016
work+0.015
tr+0.015
Neg logit contributions
 friends-0.009
 thinking-0.008
 upon-0.008
 there-0.008
 man-0.008
Token,
Token ID11
Feature activation-1.41e-02

Pos logit contributions
 upon+0.001
 friends+0.001
 thinking+0.001
Next+0.001
 there+0.001
Neg logit contributions
ifying-0.002
lier-0.002
amy-0.002
rant-0.002
attered-0.002
Token you
Token ID345
Feature activation+1.71e-03

Pos logit contributions
ifying+0.034
tr+0.026
 unison+0.026
rant+0.025
amy+0.025
Neg logit contributions
 friends-0.014
 thinking-0.014
 upon-0.013
Next-0.012
't-0.012
Token can
Token ID460
Feature activation-7.29e-04

Pos logit contributions
 thinking+0.002
 friends+0.002
 things+0.002
Next+0.002
 there+0.002
Neg logit contributions
ifying-0.004
tr-0.003
rant-0.003
attered-0.003
lier-0.003
Token enjoy
Token ID2883
Feature activation+2.39e-02

Pos logit contributions
ifying+0.002
rant+0.001
tr+0.001
attered+0.001
shaped+0.001
Neg logit contributions
 friends-0.001
 thinking-0.001
 there-0.001
 things-0.001
 here-0.001
Token their
Token ID511
Feature activation-6.91e-03

Pos logit contributions
 friends+0.025
 thinking+0.024
 man+0.023
ittens+0.022
 kids+0.022
Neg logit contributions
ifying-0.052
rant-0.042
amy-0.040
work-0.040
nam-0.039
Token gentle
Token ID10296
Feature activation-7.76e-03

Pos logit contributions
ifying+0.016
attered+0.013
tr+0.012
amy+0.012
rant+0.012
Neg logit contributions
 thinking-0.009
 friends-0.009
 there-0.008
 here-0.008
 it-0.008
Token light
Token ID1657
Feature activation-9.17e-04

Pos logit contributions
ifying+0.019
attered+0.015
tr+0.015
amy+0.015
rant+0.015
Neg logit contributions
 friends-0.009
 there-0.009
 thinking-0.009
 here-0.009
 it-0.008
Token and
Token ID290
Feature activation-4.72e-03

Pos logit contributions
ifying+0.002
attered+0.002
rant+0.002
tr+0.002
amy+0.002
Neg logit contributions
 friends-0.001
 thinking-0.001
 there-0.001
 here-0.001
 kids-0.001
Token wonder
Token ID4240
Feature activation+1.78e-03

Pos logit contributions
ifying+0.012
attered+0.009
rant+0.009
tr+0.009
amy+0.009
Neg logit contributions
 friends-0.006
 there-0.005
 thinking-0.005
 here-0.005
 kids-0.005
Token playing
Token ID2712
Feature activation+1.78e-03

Pos logit contributions
ifying+0.014
attered+0.012
amy+0.012
tr+0.011
rant+0.011
Neg logit contributions
 it-0.005
 things-0.005
 thinking-0.004
 there-0.004
 man-0.004
Token in
Token ID287
Feature activation+4.55e-03

Pos logit contributions
 friends+0.002
 there+0.002
 thinking+0.002
 here+0.002
 it+0.002
Neg logit contributions
ifying-0.004
attered-0.004
amy-0.003
rant-0.003
tr-0.003
Token the
Token ID262
Feature activation-4.76e-03

Pos logit contributions
 friends+0.005
 thinking+0.005
 here+0.004
 there+0.004
 sure+0.004
Neg logit contributions
ifying-0.011
amy-0.009
attered-0.009
shaped-0.009
rant-0.009
Token bathroom
Token ID12436
Feature activation+1.52e-02

Pos logit contributions
ifying+0.013
attered+0.011
rant+0.011
amy+0.011
tr+0.011
Neg logit contributions
 friends-0.005
 there-0.005
 here-0.004
 thinking-0.004
 things-0.004
Token.
Token ID13
Feature activation+4.90e-03

Pos logit contributions
 friends+0.018
 there+0.016
 thinking+0.016
 here+0.016
 it+0.015
Neg logit contributions
ifying-0.038
attered-0.033
amy-0.031
rant-0.031
tr-0.031
Token They
Token ID1119
Feature activation+2.45e-02

Pos logit contributions
 friends+0.007
 there+0.006
 thinking+0.006
 here+0.006
 things+0.006
Neg logit contributions
ifying-0.012
attered-0.010
amy-0.009
rant-0.009
tr-0.009
Token saw
Token ID2497
Feature activation-8.39e-03

Pos logit contributions
 friends+0.038
 there+0.035
 thinking+0.035
 here+0.035
 things+0.033
Neg logit contributions
ifying-0.053
attered-0.044
amy-0.041
rant-0.041
tr-0.040
Token a
Token ID257
Feature activation+8.42e-03

Pos logit contributions
ifying+0.026
attered+0.022
rant+0.022
ksh+0.022
tr+0.022
Neg logit contributions
 friends-0.007
 there-0.007
 it-0.006
 things-0.006
 kids-0.006
Token shiny
Token ID22441
Feature activation-4.21e-03

Pos logit contributions
 friends+0.008
 there+0.008
 here+0.007
 thinking+0.007
 things+0.006
Neg logit contributions
ifying-0.024
attered-0.021
rant-0.020
amy-0.020
shaped-0.019
Token thing
Token ID1517
Feature activation-2.05e-02

Pos logit contributions
ifying+0.011
attered+0.009
amy+0.009
rant+0.008
ksh+0.008
Neg logit contributions
 friends-0.005
 there-0.005
 here-0.005
 thinking-0.005
 things-0.005
Token on
Token ID319
Feature activation-1.36e-02

Pos logit contributions
ifying+0.056
attered+0.048
amy+0.046
ksh+0.045
rant+0.045
Neg logit contributions
 there-0.022
 friends-0.022
 here-0.020
 it-0.019
 thinking-0.019
Token.
Token ID13
Feature activation-6.06e-05

Pos logit contributions
ifying+0.010
attered+0.008
tr+0.008
rant+0.008
lier+0.008
Neg logit contributions
 friends-0.005
 thinking-0.005
 there-0.005
 here-0.005
 upon-0.005
Token His
Token ID2399
Feature activation+6.20e-03

Pos logit contributions
ifying+0.000
attered+0.000
tr+0.000
amy+0.000
rant+0.000
Neg logit contributions
 friends-0.000
 there-0.000
 here-0.000
 thinking-0.000
 things-0.000
Token mom
Token ID1995
Feature activation+2.52e-03

Pos logit contributions
 there+0.004
 here+0.004
 thinking+0.004
 friends+0.003
 it+0.003
Neg logit contributions
ifying-0.018
attered-0.016
tr-0.016
amy-0.015
shaped-0.015
Token saw
Token ID2497
Feature activation-9.55e-03

Pos logit contributions
 friends+0.004
 things+0.003
 here+0.003
 thinking+0.003
 there+0.003
Neg logit contributions
ifying-0.005
attered-0.004
rant-0.004
amy-0.004
tr-0.004
Token this
Token ID428
Feature activation+6.13e-03

Pos logit contributions
ifying+0.021
rant+0.017
amy+0.017
shaped+0.017
attered+0.017
Neg logit contributions
 friends-0.013
 thinking-0.011
 friend-0.010
 here-0.010
 there-0.010
Token and
Token ID290
Feature activation+2.45e-02

Pos logit contributions
 friends+0.004
 there+0.003
 thinking+0.003
 here+0.003
 it+0.002
Neg logit contributions
ifying-0.019
attered-0.017
tr-0.016
amy-0.016
rant-0.016
Token wanted
Token ID2227
Feature activation+2.94e-03

Pos logit contributions
 friends+0.026
 here+0.025
 thinking+0.024
 there+0.024
 things+0.023
Neg logit contributions
ifying-0.060
attered-0.053
amy-0.050
tr-0.049
rant-0.049
Token to
Token ID284
Feature activation+8.85e-03

Pos logit contributions
 friends+0.003
 there+0.003
 thinking+0.003
 here+0.003
 upon+0.002
Neg logit contributions
ifying-0.008
attered-0.006
amy-0.006
rant-0.006
shaped-0.006
Token help
Token ID1037
Feature activation+7.07e-03

Pos logit contributions
 friends+0.010
 there+0.009
 thinking+0.008
 here+0.008
 it+0.008
Neg logit contributions
ifying-0.024
attered-0.021
amy-0.020
rant-0.020
tr-0.019
Token.
Token ID13
Feature activation+9.72e-03

Pos logit contributions
 friends+0.008
 there+0.007
 thinking+0.007
 here+0.007
 upon+0.006
Neg logit contributions
ifying-0.018
attered-0.015
amy-0.015
rant-0.014
tr-0.014
Token She
Token ID1375
Feature activation+6.42e-03

Pos logit contributions
 friends+0.014
 thinking+0.013
 here+0.013
 there+0.013
 things+0.012
Neg logit contributions
ifying-0.021
attered-0.018
tr-0.017
amy-0.017
ksh-0.016
Token to
Token ID284
Feature activation+8.14e-03

Pos logit contributions
 friends+0.012
 thinking+0.011
 there+0.010
 kids+0.010
 here+0.010
Neg logit contributions
ifying-0.025
shaped-0.021
rant-0.021
attered-0.021
amy-0.020
Token work
Token ID670
Feature activation+1.79e-02

Pos logit contributions
 friends+0.012
 there+0.011
 thinking+0.011
 here+0.010
 things+0.010
Neg logit contributions
ifying-0.019
attered-0.016
amy-0.015
rant-0.015
tr-0.015
Token,
Token ID11
Feature activation+1.19e-02

Pos logit contributions
 friends+0.020
 thinking+0.017
 there+0.017
 kids+0.017
 here+0.017
Neg logit contributions
ifying-0.046
attered-0.038
rant-0.038
amy-0.037
tr-0.037
Token building
Token ID2615
Feature activation+2.39e-02

Pos logit contributions
 friends+0.013
 there+0.011
 here+0.011
 kids+0.010
 things+0.010
Neg logit contributions
ifying-0.031
attered-0.026
rant-0.025
amy-0.025
tr-0.025
Token and
Token ID290
Feature activation+1.14e-02

Pos logit contributions
 friends+0.024
 thinking+0.022
 here+0.021
 there+0.021
 kids+0.019
Neg logit contributions
ifying-0.063
attered-0.054
rant-0.052
amy-0.051
tr-0.051
Token patch
Token ID8529
Feature activation+2.37e-02

Pos logit contributions
 friends+0.016
 there+0.015
 here+0.015
 thinking+0.015
 things+0.014
Neg logit contributions
ifying-0.026
attered-0.021
amy-0.020
rant-0.020
tr-0.020
Tokening
Token ID278
Feature activation+1.16e-02

Pos logit contributions
 friends+0.036
 thinking+0.034
 there+0.033
 here+0.032
 kids+0.031
Neg logit contributions
ifying-0.052
attered-0.043
amy-0.040
rant-0.040
tr-0.040
Token,
Token ID11
Feature activation+1.64e-02

Pos logit contributions
 friends+0.014
 there+0.013
 thinking+0.013
 here+0.013
 it+0.012
Neg logit contributions
ifying-0.029
attered-0.025
amy-0.024
rant-0.023
tr-0.023
Token fixing
Token ID18682
Feature activation+1.09e-02

Pos logit contributions
 friends+0.018
 here+0.016
 there+0.016
 kids+0.015
 thinking+0.015
Neg logit contributions
ifying-0.040
attered-0.034
tr-0.032
rant-0.032
amy-0.032
Token the
Token ID262
Feature activation+3.31e-03

Pos logit contributions
 friends+0.011
 there+0.010
 thinking+0.010
 here+0.009
 it+0.008
Neg logit contributions
ifying-0.030
attered-0.026
amy-0.025
rant-0.025
tr-0.024
Token well
Token ID880
Feature activation+3.86e-03

Pos logit contributions
 friends+0.005
 there+0.004
 thinking+0.004
 here+0.004
 things+0.004
Neg logit contributions
ifying-0.008
attered-0.007
amy-0.006
rant-0.006
tr-0.006
Token.
Token ID13
Feature activation+8.83e-03

Pos logit contributions
 friends+0.002
 here+0.002
 thinking+0.001
 there+0.001
 upon+0.001
Neg logit contributions
ifying-0.006
attered-0.005
lier-0.005
rant-0.005
 unison-0.005
Token They
Token ID1119
Feature activation+5.81e-03

Pos logit contributions
 friends+0.014
 there+0.013
 thinking+0.013
 here+0.013
 things+0.012
Neg logit contributions
ifying-0.019
attered-0.016
tr-0.014
amy-0.014
rant-0.014
Token did
Token ID750
Feature activation+9.83e-03

Pos logit contributions
 friends+0.007
 there+0.007
 here+0.007
 thinking+0.007
 things+0.006
Neg logit contributions
ifying-0.014
attered-0.012
rant-0.011
amy-0.011
tr-0.011
Token not
Token ID407
Feature activation-1.51e-03

Pos logit contributions
 friends+0.009
 there+0.008
 kids+0.008
 here+0.008
 thinking+0.008
Neg logit contributions
ifying-0.025
 unison-0.021
rant-0.020
lier-0.020
shaped-0.020
Token know
Token ID760
Feature activation-7.16e-03

Pos logit contributions
ifying+0.003
rant+0.003
attered+0.003
lier+0.003
amy+0.003
Neg logit contributions
 friends-0.002
 there-0.002
 here-0.002
 things-0.002
 thinking-0.002
Token if
Token ID611
Feature activation+2.37e-02

Pos logit contributions
ifying+0.018
attered+0.015
rant+0.014
amy+0.014
lier+0.014
Neg logit contributions
 friends-0.008
 thinking-0.007
 here-0.007
 there-0.007
 upon-0.006
Token it
Token ID340
Feature activation-2.69e-04

Pos logit contributions
 friends+0.021
 thinking+0.020
 here+0.019
 there+0.018
 upon+0.017
Neg logit contributions
ifying-0.061
attered-0.050
amy-0.049
rant-0.048
shaped-0.048
Token was
Token ID373
Feature activation-4.05e-03

Pos logit contributions
ifying+0.001
attered+0.001
amy+0.001
rant+0.001
tr+0.001
Neg logit contributions
 friends-0.000
 there-0.000
 thinking-0.000
 here-0.000
 things-0.000
Token a
Token ID257
Feature activation+1.01e-02

Pos logit contributions
ifying+0.011
attered+0.009
rant+0.009
amy+0.008
tr+0.008
Neg logit contributions
 friends-0.004
 there-0.004
 thinking-0.004
 here-0.004
 kids-0.003
Token good
Token ID922
Feature activation+4.18e-03

Pos logit contributions
 friends+0.012
 there+0.012
 thinking+0.011
 here+0.011
 it+0.010
Neg logit contributions
ifying-0.025
attered-0.021
tr-0.020
amy-0.020
rant-0.020
Token idea
Token ID2126
Feature activation-1.27e-02

Pos logit contributions
 friends+0.003
 there+0.003
 here+0.003
 thinking+0.003
 upon+0.003
Neg logit contributions
ifying-0.011
attered-0.010
tr-0.009
rant-0.009
lier-0.009
Token to
Token ID284
Feature activation+3.80e-03

Pos logit contributions
 friends+0.005
 thinking+0.005
 there+0.005
 here+0.004
 kids+0.004
Neg logit contributions
ifying-0.015
attered-0.013
amy-0.012
tr-0.012
rant-0.012
Token his
Token ID465
Feature activation-2.48e-04

Pos logit contributions
 friends+0.004
 thinking+0.004
 there+0.004
 upon+0.004
 things+0.004
Neg logit contributions
ifying-0.008
 unison-0.007
lier-0.006
amy-0.006
rant-0.006
Token neighbor
Token ID4780
Feature activation-2.42e-03

Pos logit contributions
ifying+0.001
attered+0.001
tr+0.001
amy+0.001
 unison+0.001
Neg logit contributions
 thinking-0.000
 there-0.000
 here-0.000
 it-0.000
Next-0.000
Token,
Token ID11
Feature activation+5.34e-03

Pos logit contributions
ifying+0.006
attered+0.005
amy+0.005
tr+0.005
rant+0.005
Neg logit contributions
 friends-0.002
 thinking-0.002
 there-0.002
 here-0.002
 things-0.002
Token Mrs
Token ID9074
Feature activation+6.94e-04

Pos logit contributions
 friends+0.006
 thinking+0.006
 there+0.006
 here+0.006
 things+0.005
Neg logit contributions
ifying-0.013
attered-0.011
tr-0.010
amy-0.010
rant-0.010
Token.
Token ID13
Feature activation+1.24e-02

Pos logit contributions
 friends+0.001
 there+0.001
 thinking+0.001
 here+0.001
 things+0.001
Neg logit contributions
ifying-0.002
attered-0.002
amy-0.002
tr-0.002
rant-0.002
Token Smith
Token ID4176
Feature activation+3.18e-03

Pos logit contributions
 friends+0.024
 things+0.022
 here+0.021
 there+0.021
 thinking+0.021
Neg logit contributions
ifying-0.023
tr-0.017
amy-0.016
ksh-0.016
rant-0.015
Token.
Token ID13
Feature activation+6.77e-03

Pos logit contributions
 friends+0.004
 there+0.004
 thinking+0.004
 here+0.003
 things+0.003
Neg logit contributions
ifying-0.008
attered-0.007
amy-0.006
rant-0.006
tr-0.006
Token Jack
Token ID3619
Feature activation+1.84e-03

Pos logit contributions
 friends+0.008
 things+0.007
 thinking+0.007
 there+0.007
 here+0.007
Neg logit contributions
ifying-0.017
attered-0.014
tr-0.014
rant-0.013
ksh-0.013
Token realized
Token ID6939
Feature activation-1.90e-03

Pos logit contributions
 friends+0.002
 there+0.002
 thinking+0.002
 here+0.002
 it+0.002
Neg logit contributions
ifying-0.004
attered-0.004
amy-0.004
rant-0.004
tr-0.003
Token that
Token ID326
Feature activation-5.07e-03

Pos logit contributions
ifying+0.005
attered+0.004
lier+0.004
rant+0.004
tr+0.004
Neg logit contributions
 friends-0.002
 thinking-0.002
 here-0.001
 upon-0.001
 there-0.001
Token catch
Token ID4929
Feature activation+9.41e-03

Pos logit contributions
 friends+0.001
 there+0.001
 thinking+0.001
 here+0.001
 things+0.001
Neg logit contributions
ifying-0.002
attered-0.001
amy-0.001
rant-0.001
tr-0.001
Token the
Token ID262
Feature activation-4.92e-03

Pos logit contributions
 friends+0.008
 thinking+0.008
 there+0.008
 here+0.007
 upon+0.006
Neg logit contributions
ifying-0.026
attered-0.022
amy-0.022
rant-0.021
shaped-0.021
Token wind
Token ID2344
Feature activation-3.05e-03

Pos logit contributions
ifying+0.011
attered+0.009
tr+0.009
amy+0.008
shaped+0.008
Neg logit contributions
 friends-0.007
 thinking-0.007
 there-0.007
 here-0.006
 upon-0.006
Token.
Token ID13
Feature activation+6.43e-03

Pos logit contributions
ifying+0.008
attered+0.007
rant+0.007
tr+0.007
amy+0.007
Neg logit contributions
 friends-0.003
 there-0.003
 thinking-0.003
 here-0.003
 things-0.003
Token Together
Token ID17083
Feature activation-2.62e-03

Pos logit contributions
 friends+0.008
 thinking+0.008
 things+0.008
 here+0.008
 there+0.007
Neg logit contributions
ifying-0.015
attered-0.012
tr-0.012
ksh-0.012
lier-0.012
Token,
Token ID11
Feature activation+7.76e-03

Pos logit contributions
ifying+0.007
attered+0.006
amy+0.006
tr+0.006
rant+0.006
Neg logit contributions
 friends-0.002
 here-0.002
 thinking-0.002
 there-0.002
 upon-0.002
Token they
Token ID484
Feature activation-7.71e-04

Pos logit contributions
 friends+0.008
 thinking+0.008
 here+0.007
 upon+0.007
 things+0.006
Neg logit contributions
ifying-0.019
amy-0.015
shaped-0.015
attered-0.015
lier-0.015
Token were
Token ID547
Feature activation+3.32e-03

Pos logit contributions
ifying+0.002
attered+0.001
amy+0.001
rant+0.001
tr+0.001
Neg logit contributions
 friends-0.001
 there-0.001
 thinking-0.001
 here-0.001
 it-0.001
Token able
Token ID1498
Feature activation+2.11e-03

Pos logit contributions
 friends+0.004
 there+0.004
 thinking+0.004
 here+0.004
 things+0.004
Neg logit contributions
ifying-0.007
attered-0.006
amy-0.006
rant-0.006
lier-0.006
Token to
Token ID284
Feature activation-6.51e-04

Pos logit contributions
 friends+0.002
 thinking+0.002
 here+0.002
 there+0.002
 kids+0.002
Neg logit contributions
ifying-0.005
attered-0.004
shaped-0.004
amy-0.004
lier-0.004
Token catch
Token ID4929
Feature activation+9.51e-03

Pos logit contributions
ifying+0.002
attered+0.001
amy+0.001
rant+0.001
tr+0.001
Neg logit contributions
 friends-0.001
 there-0.001
 thinking-0.001
 here-0.001
 it-0.001
Token
Token ID198
Feature activation+4.39e-03

Pos logit contributions
ifying+0.003
attered+0.002
amy+0.002
rant+0.002
tr+0.002
Neg logit contributions
 friends-0.001
 there-0.001
 thinking-0.001
 here-0.001
 things-0.001
Token
Token ID198
Feature activation+1.50e-03

Pos logit contributions
 thinking+0.005
 friends+0.004
 things+0.004
 here+0.004
 kids+0.004
Neg logit contributions
ifying-0.010
attered-0.009
amy-0.009
piece-0.009
rant-0.008
Token"
Token ID1
Feature activation+4.39e-03

Pos logit contributions
 friends+0.002
 thinking+0.002
 here+0.002
 there+0.002
 things+0.002
Neg logit contributions
ifying-0.004
attered-0.003
amy-0.003
rant-0.003
shaped-0.003
TokenWhat
Token ID2061
Feature activation-5.46e-03

Pos logit contributions
 friends+0.007
 there+0.006
 thinking+0.006
 here+0.006
 things+0.006
Neg logit contributions
ifying-0.010
attered-0.008
amy-0.008
rant-0.008
tr-0.008
Token are
Token ID389
Feature activation-2.88e-03

Pos logit contributions
ifying+0.013
attered+0.012
rant+0.011
amy+0.011
ksh+0.011
Neg logit contributions
 friends-0.008
 there-0.008
 here-0.007
 it-0.007
 things-0.007
Token they
Token ID484
Feature activation+7.03e-03

Pos logit contributions
ifying+0.008
attered+0.007
amy+0.007
rant+0.007
tr+0.007
Neg logit contributions
 friends-0.002
 thinking-0.002
 there-0.002
 here-0.002
 kids-0.002
Token M
Token ID337
Feature activation+9.17e-03

Pos logit contributions
 friends+0.008
Next+0.007
 things+0.007
 mom+0.007
 it+0.007
Neg logit contributions
ifying-0.015
rant-0.012
work-0.012
attered-0.011
amy-0.011
Tokenummy
Token ID13513
Feature activation+3.75e-03

Pos logit contributions
 there+0.016
 friends+0.016
 it+0.015
 thinking+0.015
 here+0.015
Neg logit contributions
ifying-0.017
amy-0.014
attered-0.014
rant-0.013
ksh-0.013
Token?"
Token ID1701
Feature activation-5.44e-03

Pos logit contributions
 friends+0.005
 thinking+0.004
 things+0.004
 it+0.004
Next+0.004
Neg logit contributions
ifying-0.008
attered-0.007
rant-0.007
tr-0.007
work-0.007
Token asked
Token ID1965
Feature activation-7.18e-03

Pos logit contributions
ifying+0.012
attered+0.010
amy+0.010
rant+0.009
piece+0.009
Neg logit contributions
 friends-0.006
 here-0.006
 thinking-0.006
 things-0.006
 upon-0.005
Token Joe
Token ID5689
Feature activation-6.06e-03

Pos logit contributions
ifying+0.018
attered+0.015
amy+0.014
rant+0.014
tr+0.014
Neg logit contributions
 friends-0.009
 thinking-0.008
 here-0.008
 there-0.008
 things-0.007
Token she
Token ID673
Feature activation+9.63e-03

Pos logit contributions
ifying+0.007
attered+0.006
tr+0.006
amy+0.006
shaped+0.006
Neg logit contributions
 friends-0.003
 thinking-0.002
 here-0.002
 there-0.002
 kids-0.002
Token went
Token ID1816
Feature activation-2.32e-03

Pos logit contributions
 friends+0.013
 there+0.013
 here+0.012
 thinking+0.012
 it+0.011
Neg logit contributions
ifying-0.024
attered-0.019
rant-0.019
amy-0.019
ksh-0.018
Token to
Token ID284
Feature activation+1.82e-03

Pos logit contributions
ifying+0.006
attered+0.005
tr+0.005
amy+0.005
rant+0.005
Neg logit contributions
 friends-0.002
 there-0.002
 thinking-0.002
 here-0.002
 kids-0.002
Token the
Token ID262
Feature activation-7.14e-03

Pos logit contributions
 friends+0.002
 thinking+0.002
 there+0.002
 here+0.001
 things+0.001
Neg logit contributions
ifying-0.005
attered-0.004
amy-0.004
tr-0.004
shaped-0.004
Token park
Token ID3952
Feature activation-4.67e-03

Pos logit contributions
ifying+0.023
attered+0.020
rant+0.019
ksh+0.019
amy+0.019
Neg logit contributions
 friends-0.005
 there-0.004
 here-0.004
 things-0.004
 thinking-0.004
Token with
Token ID351
Feature activation+2.19e-03

Pos logit contributions
ifying+0.012
attered+0.010
tr+0.010
amy+0.010
shaped+0.009
Neg logit contributions
 friends-0.005
 thinking-0.004
 here-0.004
 there-0.004
 things-0.004
Token her
Token ID607
Feature activation+7.48e-03

Pos logit contributions
 friends+0.002
 there+0.002
 thinking+0.001
 here+0.001
 it+0.001
Neg logit contributions
ifying-0.007
attered-0.006
amy-0.005
rant-0.005
tr-0.005
Token mom
Token ID1995
Feature activation-1.55e-03

Pos logit contributions
 thinking+0.005
 here+0.005
 there+0.004
 it+0.004
 upon+0.003
Neg logit contributions
ifying-0.021
tr-0.019
attered-0.019
amy-0.017
shaped-0.017
Token and
Token ID290
Feature activation+1.17e-03

Pos logit contributions
ifying+0.004
attered+0.003
amy+0.003
tr+0.003
rant+0.003
Neg logit contributions
 friends-0.002
 thinking-0.002
 there-0.002
 here-0.002
 things-0.002
Token saw
Token ID2497
Feature activation-1.74e-02

Pos logit contributions
 friends+0.001
 thinking+0.001
 here+0.001
 things+0.001
 there+0.001
Neg logit contributions
ifying-0.003
attered-0.002
tr-0.002
amy-0.002
shaped-0.002
Token a
Token ID257
Feature activation+9.58e-03

Pos logit contributions
ifying+0.056
attered+0.049
rant+0.047
ksh+0.047
amy+0.046
Neg logit contributions
 there-0.013
 friends-0.013
 it-0.011
 here-0.010
 things-0.010
Token Bo
Token ID3248
Feature activation+1.10e-02

Pos logit contributions
ifying+0.019
attered+0.015
amy+0.014
rant+0.014
tr+0.014
Neg logit contributions
 friends-0.012
 thinking-0.012
 here-0.012
 there-0.012
 things-0.011
Token was
Token ID373
Feature activation-4.36e-04

Pos logit contributions
 friends+0.017
 thinking+0.017
 there+0.017
 here+0.016
 things+0.016
Neg logit contributions
ifying-0.023
attered-0.019
amy-0.018
rant-0.018
shaped-0.017
Token so
Token ID523
Feature activation+4.26e-03

Pos logit contributions
ifying+0.001
attered+0.001
rant+0.001
amy+0.001
shaped+0.001
Neg logit contributions
 friends-0.001
 things-0.001
 there-0.001
 thinking-0.001
 here-0.001
Token happy
Token ID3772
Feature activation-8.42e-04

Pos logit contributions
 friends+0.004
 things+0.004
 thinking+0.004
 there+0.003
 kids+0.003
Neg logit contributions
ifying-0.011
attered-0.010
shaped-0.009
amy-0.009
rant-0.009
Token and
Token ID290
Feature activation+3.61e-03

Pos logit contributions
ifying+0.002
work+0.001
attered+0.001
lier+0.001
amy+0.001
Neg logit contributions
 friends-0.001
 here-0.001
 thinking-0.001
 kids-0.001
Next-0.001
Token danced
Token ID39480
Feature activation+1.24e-02

Pos logit contributions
 friends+0.004
 there+0.004
 here+0.004
 thinking+0.004
 things+0.003
Neg logit contributions
ifying-0.009
attered-0.008
amy-0.007
rant-0.007
shaped-0.007
Token around
Token ID1088
Feature activation+2.20e-03

Pos logit contributions
 friends+0.016
 thinking+0.015
 here+0.015
 kids+0.015
 there+0.014
Neg logit contributions
ifying-0.028
attered-0.023
rant-0.022
lier-0.022
amy-0.022
Token with
Token ID351
Feature activation+6.05e-03

Pos logit contributions
 friends+0.003
 there+0.002
 kids+0.002
 here+0.002
 thinking+0.002
Neg logit contributions
ifying-0.005
rant-0.004
amy-0.004
attered-0.004
work-0.004
Token his
Token ID465
Feature activation+1.09e-02

Pos logit contributions
 here+0.007
 there+0.006
 upon+0.006
 thinking+0.006
"+0.006
Neg logit contributions
ifying-0.015
shaped-0.012
 unison-0.012
rant-0.012
attered-0.011
Token tw
Token ID665
Feature activation+5.37e-03

Pos logit contributions
 here+0.015
 there+0.015
 thinking+0.015
't+0.014
 kids+0.013
Neg logit contributions
ifying-0.021
amy-0.016
attered-0.016
tr-0.015
shaped-0.015
Tokenig
Token ID328
Feature activation+8.76e-03

Pos logit contributions
 there+0.010
 thinking+0.010
 friends+0.010
 kids+0.009
 here+0.009
Neg logit contributions
ifying-0.009
attered-0.008
piece-0.007
amy-0.007
shaped-0.007
Token sat
Token ID3332
Feature activation+1.19e-02

Pos logit contributions
 friends+0.002
 there+0.002
 here+0.002
 it+0.002
 thinking+0.002
Neg logit contributions
ifying-0.003
attered-0.002
amy-0.002
rant-0.002
tr-0.002
Token on
Token ID319
Feature activation-6.17e-03

Pos logit contributions
 friends+0.020
 thinking+0.019
 kids+0.017
 here+0.017
 discovery+0.017
Neg logit contributions
ifying-0.022
attered-0.018
amy-0.016
rant-0.016
piece-0.016
Token the
Token ID262
Feature activation-6.35e-03

Pos logit contributions
ifying+0.016
amy+0.012
shaped+0.012
attered+0.012
rant+0.012
Neg logit contributions
 friends-0.007
 thinking-0.006
 here-0.006
 there-0.005
 upon-0.005
Token slide
Token ID10649
Feature activation-7.57e-03

Pos logit contributions
ifying+0.017
attered+0.014
amy+0.014
rant+0.014
tr+0.013
Neg logit contributions
 friends-0.007
 there-0.007
 thinking-0.007
 it-0.006
 here-0.006
Token.
Token ID13
Feature activation-3.63e-03

Pos logit contributions
ifying+0.020
attered+0.017
amy+0.016
rant+0.016
tr+0.016
Neg logit contributions
 friends-0.009
 there-0.008
 thinking-0.008
 here-0.008
 things-0.007
Token But
Token ID887
Feature activation-9.89e-05

Pos logit contributions
ifying+0.009
attered+0.008
rant+0.007
amy+0.007
tr+0.007
Neg logit contributions
 friends-0.005
 there-0.004
 thinking-0.004
 here-0.004
 things-0.004
Token he
Token ID339
Feature activation+4.42e-03

Pos logit contributions
ifying+0.000
attered+0.000
rant+0.000
piece+0.000
amy+0.000
Neg logit contributions
 friends-0.000
 thinking-0.000
 here-0.000
 discovery-0.000
 kids-0.000
Token was
Token ID373
Feature activation-4.59e-03

Pos logit contributions
 friends+0.006
 thinking+0.006
 here+0.006
 there+0.006
 things+0.006
Neg logit contributions
ifying-0.009
attered-0.008
tr-0.007
amy-0.007
rant-0.007
Token scared
Token ID12008
Feature activation-6.79e-03

Pos logit contributions
ifying+0.012
attered+0.010
rant+0.009
amy+0.009
tr+0.009
Neg logit contributions
 friends-0.005
 thinking-0.004
 things-0.004
 there-0.004
 here-0.004
Token.
Token ID13
Feature activation-4.52e-03

Pos logit contributions
ifying+0.017
attered+0.014
amy+0.013
rant+0.013
lier+0.013
Neg logit contributions
 friends-0.008
 thinking-0.008
 there-0.007
 here-0.007
 things-0.007
Token He
Token ID679
Feature activation+3.86e-03

Pos logit contributions
ifying+0.010
attered+0.009
amy+0.008
ksh+0.008
tr+0.008
Neg logit contributions
 friends-0.006
 thinking-0.006
 here-0.006
 there-0.006
 things-0.005
Token broke
Token ID6265
Feature activation-7.59e-04

Pos logit contributions
ifying+0.048
attered+0.040
amy+0.038
rant+0.038
shaped+0.037
Neg logit contributions
 friends-0.024
 there-0.022
 here-0.021
 it-0.020
 things-0.020
Token.
Token ID13
Feature activation-1.41e-02

Pos logit contributions
ifying+0.002
attered+0.002
amy+0.002
rant+0.002
tr+0.002
Neg logit contributions
 friends-0.001
 there-0.001
 here-0.001
 thinking-0.001
 it-0.001
Token And
Token ID843
Feature activation-1.35e-02

Pos logit contributions
ifying+0.035
attered+0.030
amy+0.028
rant+0.028
tr+0.028
Neg logit contributions
 friends-0.018
 there-0.017
 thinking-0.016
 here-0.016
 it-0.015
Token the
Token ID262
Feature activation+5.58e-03

Pos logit contributions
ifying+0.033
tr+0.026
attered+0.025
bug+0.025
nam+0.025
Neg logit contributions
 thinking-0.014
 friends-0.014
 upon-0.013
 things-0.011
Next-0.011
Token pins
Token ID20567
Feature activation+3.79e-03

Pos logit contributions
 here+0.008
 there+0.008
 friends+0.007
 thinking+0.007
 it+0.007
Neg logit contributions
ifying-0.012
tr-0.010
attered-0.009
amy-0.009
bo-0.008
Token hurt
Token ID5938
Feature activation-4.26e-03

Pos logit contributions
 friends+0.006
 thinking+0.005
 there+0.005
 here+0.005
 kids+0.005
Neg logit contributions
ifying-0.008
attered-0.007
rant-0.006
tr-0.006
amy-0.006
Token me
Token ID502
Feature activation-1.26e-02

Pos logit contributions
ifying+0.012
attered+0.010
amy+0.010
rant+0.010
tr+0.010
Neg logit contributions
 friends-0.004
 there-0.004
 here-0.004
 it-0.004
 things-0.004
Token."
Token ID526
Feature activation+1.52e-03

Pos logit contributions
ifying+0.035
attered+0.030
amy+0.029
rant+0.028
shaped+0.028
Neg logit contributions
 friends-0.013
 there-0.012
 here-0.012
 thinking-0.011
 it-0.011
Token She
Token ID1375
Feature activation-3.54e-03

Pos logit contributions
 friends+0.001
 here+0.001
 thinking+0.001
 there+0.001
 things+0.001
Neg logit contributions
ifying-0.004
attered-0.004
tr-0.004
amy-0.004
shaped-0.004
Token sniff
Token ID26300
Feature activation-2.80e-03

Pos logit contributions
ifying+0.010
attered+0.008
rant+0.007
amy+0.007
shaped+0.007
Neg logit contributions
 friends-0.005
 there-0.004
 it-0.004
 thinking-0.004
 mom-0.004
Tokened
Token ID276
Feature activation-2.94e-04

Pos logit contributions
ifying+0.006
tr+0.004
attered+0.004
rant+0.004
 unison+0.004
Neg logit contributions
 friends-0.005
 there-0.005
 it-0.005
 things-0.005
 kids-0.005
Token,
Token ID11
Feature activation+2.46e-03

Pos logit contributions
 thinking+0.007
 friends+0.007
Next+0.007
 there+0.006
 here+0.006
Neg logit contributions
ifying-0.020
lier-0.017
attered-0.016
tr-0.016
ksh-0.015
Token sir
Token ID15967
Feature activation+1.40e-02

Pos logit contributions
 friends+0.003
 thinking+0.003
Next+0.003
 upon+0.003
 there+0.003
Neg logit contributions
ifying-0.006
tr-0.004
work-0.004
 unison-0.004
lier-0.004
Token.
Token ID13
Feature activation-7.91e-03

Pos logit contributions
 friends+0.018
 thinking+0.017
 there+0.016
 here+0.015
 things+0.015
Neg logit contributions
ifying-0.033
attered-0.027
lier-0.026
tr-0.026
amy-0.026
Token I
Token ID314
Feature activation-5.79e-03

Pos logit contributions
ifying+0.021
attered+0.018
amy+0.017
rant+0.017
tr+0.016
Neg logit contributions
 friends-0.009
 there-0.009
 thinking-0.008
 here-0.008
 it-0.008
Token won
Token ID1839
Feature activation+3.82e-03

Pos logit contributions
ifying+0.014
attered+0.012
tr+0.011
amy+0.011
rant+0.011
Neg logit contributions
 friends-0.008
 thinking-0.007
 there-0.007
 here-0.007
 things-0.007
Token't
Token ID470
Feature activation-1.10e-02

Pos logit contributions
 friends+0.003
 thinking+0.002
 upon+0.002
 there+0.002
 here+0.002
Neg logit contributions
ifying-0.011
amy-0.009
tr-0.009
shaped-0.009
attered-0.009
Token do
Token ID466
Feature activation-3.33e-03

Pos logit contributions
ifying+0.028
attered+0.024
amy+0.023
rant+0.022
tr+0.022
Neg logit contributions
 friends-0.013
 there-0.012
 thinking-0.012
 here-0.012
 things-0.011
Token it
Token ID340
Feature activation-1.04e-02

Pos logit contributions
ifying+0.010
attered+0.008
shaped+0.008
lier+0.008
amy+0.008
Neg logit contributions
 friends-0.002
 upon-0.002
 thinking-0.002
 there-0.002
 here-0.002
Token again
Token ID757
Feature activation+7.60e-03

Pos logit contributions
ifying+0.023
attered+0.019
tr+0.018
rant+0.018
amy+0.018
Neg logit contributions
 friends-0.015
 thinking-0.014
 there-0.014
 here-0.013
 kids-0.013
Token.
Token ID13
Feature activation-9.26e-03

Pos logit contributions
 friends+0.010
 thinking+0.009
 there+0.009
 upon+0.009
 kids+0.008
Neg logit contributions
ifying-0.017
lier-0.014
attered-0.014
rant-0.013
amy-0.013
Token Please
Token ID4222
Feature activation+8.77e-04

Pos logit contributions
ifying+0.024
attered+0.021
amy+0.020
rant+0.019
tr+0.019
Neg logit contributions
 friends-0.011
 there-0.010
 here-0.010
 thinking-0.010
 it-0.009
Token "
Token ID366
Feature activation+2.43e-03

Pos logit contributions
ifying+0.019
tr+0.016
attered+0.016
ksh+0.015
rant+0.015
Neg logit contributions
 friends-0.008
 here-0.007
 things-0.006
 there-0.006
 upon-0.006
TokenI
Token ID40
Feature activation+7.58e-03

Pos logit contributions
 friends+0.004
 thinking+0.004
 here+0.004
 there+0.004
 things+0.003
Neg logit contributions
ifying-0.005
attered-0.004
amy-0.004
lier-0.004
shaped-0.004
Token'm
Token ID1101
Feature activation+1.57e-03

Pos logit contributions
Next+0.012
 thinking+0.012
 friends+0.011
 things+0.011
 kids+0.011
Neg logit contributions
ifying-0.013
tr-0.011
attered-0.011
lier-0.011
bug-0.010
Token sorry
Token ID7926
Feature activation+4.98e-03

Pos logit contributions
 upon+0.002
 friends+0.002
 things+0.002
Next+0.002
 discovery+0.002
Neg logit contributions
ifying-0.003
rant-0.002
attered-0.002
lier-0.002
shaped-0.002
Token Tom
Token ID4186
Feature activation+4.76e-03

Pos logit contributions
 thinking+0.005
 friends+0.004
Next+0.004
 there+0.004
 upon+0.004
Neg logit contributions
ifying-0.012
lier-0.010
attered-0.010
tr-0.009
ksh-0.009
Token,
Token ID11
Feature activation-1.03e-02

Pos logit contributions
 friends+0.006
 thinking+0.006
 there+0.005
 here+0.005
 things+0.005
Neg logit contributions
ifying-0.011
attered-0.009
lier-0.009
tr-0.009
amy-0.009
Token but
Token ID475
Feature activation+8.50e-03

Pos logit contributions
ifying+0.024
lier+0.020
tr+0.019
bug+0.019
ksh+0.018
Neg logit contributions
 friends-0.010
 thinking-0.010
 upon-0.009
Next-0.008
 home-0.008
Token I
Token ID314
Feature activation-1.03e-03

Pos logit contributions
 thinking+0.010
 friends+0.009
 upon+0.008
Next+0.008
 here+0.008
Neg logit contributions
ifying-0.018
tr-0.015
work-0.014
nam-0.014
attered-0.014
Token need
Token ID761
Feature activation-2.04e-03

Pos logit contributions
ifying+0.002
attered+0.002
tr+0.002
rant+0.002
lier+0.002
Neg logit contributions
 thinking-0.001
 friends-0.001
 things-0.001
 there-0.001
Next-0.001
Token to
Token ID284
Feature activation-3.63e-03

Pos logit contributions
ifying+0.006
attered+0.005
amy+0.005
rant+0.005
tr+0.005
Neg logit contributions
 friends-0.002
 thinking-0.002
 there-0.002
 here-0.002
 upon-0.002
Token sell
Token ID3677
Feature activation+3.71e-03

Pos logit contributions
ifying+0.008
attered+0.006
tr+0.006
rant+0.006
amy+0.006
Neg logit contributions
 friends-0.005
 there-0.005
 thinking-0.005
 here-0.005
 upon-0.005
Token.
Token ID13
Feature activation+1.15e-04

Pos logit contributions
 friends+0.014
 there+0.013
 thinking+0.013
 here+0.013
 kids+0.012
Neg logit contributions
ifying-0.029
attered-0.025
tr-0.023
amy-0.023
rant-0.023
Token
Token ID198
Feature activation+2.68e-03

Pos logit contributions
 friends+0.000
 thinking+0.000
 here+0.000
 there+0.000
 things+0.000
Neg logit contributions
ifying-0.000
attered-0.000
tr-0.000
ksh-0.000
amy-0.000
Token
Token ID198
Feature activation+2.66e-03

Pos logit contributions
 thinking+0.003
 friends+0.003
 things+0.003
 here+0.003
 kids+0.003
Neg logit contributions
ifying-0.006
tr-0.005
shaped-0.005
bo-0.005
amy-0.005
TokenThe
Token ID464
Feature activation+1.23e-03

Pos logit contributions
 friends+0.004
 thinking+0.004
 here+0.004
 there+0.004
 things+0.003
Neg logit contributions
ifying-0.006
attered-0.005
amy-0.005
tr-0.005
shaped-0.005
Token monkey
Token ID21657
Feature activation+2.19e-03

Pos logit contributions
 there+0.002
 it+0.002
 here+0.002
 upon+0.002
 thinking+0.002
Neg logit contributions
ifying-0.002
tr-0.002
amy-0.002
attered-0.002
bug-0.002
Token was
Token ID373
Feature activation+8.59e-04

Pos logit contributions
 thinking+0.003
 here+0.003
 friends+0.003
 there+0.003
 things+0.003
Neg logit contributions
ifying-0.004
attered-0.004
tr-0.004
amy-0.004
lier-0.004
Token about
Token ID546
Feature activation+2.10e-02

Pos logit contributions
 friends+0.001
 things+0.001
 upon+0.001
 thinking+0.001
 there+0.001
Neg logit contributions
ifying-0.002
attered-0.002
tr-0.002
rant-0.001
amy-0.001
Token to
Token ID284
Feature activation+8.60e-03

Pos logit contributions
 friends+0.027
 thinking+0.025
 here+0.025
 there+0.025
 kids+0.024
Neg logit contributions
ifying-0.038
tr-0.030
shaped-0.029
attered-0.029
nam-0.028
Token win
Token ID1592
Feature activation+9.94e-03

Pos logit contributions
 friends+0.012
 there+0.012
 thinking+0.011
 upon+0.011
 here+0.011
Neg logit contributions
ifying-0.018
rant-0.014
tr-0.014
amy-0.014
attered-0.014
Token the
Token ID262
Feature activation+1.87e-03

Pos logit contributions
 friends+0.010
 thinking+0.009
 there+0.009
 here+0.009
 kids+0.008
Neg logit contributions
ifying-0.026
attered-0.022
amy-0.021
rant-0.021
lier-0.021
Token race
Token ID3234
Feature activation+3.09e-03

Pos logit contributions
 there+0.003
 friends+0.003
 thinking+0.003
 here+0.003
 upon+0.003
Neg logit contributions
ifying-0.004
attered-0.003
tr-0.003
amy-0.003
rant-0.003
Token time
Token ID640
Feature activation+5.34e-03

Pos logit contributions
ifying+0.046
attered+0.040
amy+0.039
tr+0.039
rant+0.039
Neg logit contributions
 friends-0.010
 there-0.009
 thinking-0.008
 here-0.008
 things-0.007
Token there
Token ID612
Feature activation+2.67e-04

Pos logit contributions
 friends+0.006
 thinking+0.006
 man+0.005
Next+0.005
 kids+0.005
Neg logit contributions
ifying-0.012
tr-0.009
attered-0.009
amy-0.009
bug-0.009
Token was
Token ID373
Feature activation-2.89e-03

Pos logit contributions
 friends+0.000
 thinking+0.000
 here+0.000
 there+0.000
 things+0.000
Neg logit contributions
ifying-0.001
attered-0.000
amy-0.000
tr-0.000
rant-0.000
Token a
Token ID257
Feature activation+3.51e-03

Pos logit contributions
ifying+0.008
attered+0.007
rant+0.007
amy+0.007
tr+0.007
Neg logit contributions
 friends-0.003
 there-0.003
 thinking-0.002
 here-0.002
 it-0.002
Token naughty
Token ID45128
Feature activation+1.41e-02

Pos logit contributions
 friends+0.003
 there+0.002
 thinking+0.002
 here+0.002
 things+0.002
Neg logit contributions
ifying-0.010
attered-0.009
amy-0.009
tr-0.009
rant-0.009
Token boy
Token ID2933
Feature activation-2.36e-02

Pos logit contributions
 friends+0.012
 there+0.012
 here+0.011
 thinking+0.011
 it+0.010
Neg logit contributions
ifying-0.039
attered-0.034
tr-0.032
amy-0.032
rant-0.032
Token.
Token ID13
Feature activation-2.67e-03

Pos logit contributions
ifying+0.064
attered+0.051
ksh+0.051
rant+0.050
amy+0.049
Neg logit contributions
 friends-0.032
 there-0.031
 thinking-0.028
 it-0.027
 here-0.026
Token His
Token ID2399
Feature activation+1.31e-02

Pos logit contributions
ifying+0.008
attered+0.007
amy+0.007
rant+0.007
shaped+0.007
Neg logit contributions
 friends-0.002
 there-0.002
 kids-0.002
-0.002
 thinking-0.002
Token name
Token ID1438
Feature activation-9.10e-03

Pos logit contributions
 there+0.012
 here+0.011
 it+0.011
 thinking+0.010
 friends+0.010
Neg logit contributions
ifying-0.035
tr-0.031
attered-0.031
amy-0.028
ksh-0.028
Token was
Token ID373
Feature activation-2.02e-02

Pos logit contributions
ifying+0.028
attered+0.023
rant+0.023
ksh+0.022
amy+0.022
Neg logit contributions
 friends-0.009
 there-0.008
 it-0.007
 things-0.007
 kids-0.007
Token Ben
Token ID3932
Feature activation+3.94e-03

Pos logit contributions
ifying+0.062
attered+0.054
rant+0.051
amy+0.051
ksh+0.050
Neg logit contributions
 friends-0.017
 there-0.016
 thinking-0.015
 here-0.013
 it-0.013
Token decided
Token ID3066
Feature activation+1.03e-03

Pos logit contributions
ifying+0.003
attered+0.002
amy+0.002
rant+0.002
tr+0.002
Neg logit contributions
 friends-0.002
 there-0.002
 thinking-0.002
 here-0.002
 things-0.002
Token that
Token ID326
Feature activation+3.26e-03

Pos logit contributions
 friends+0.001
 kids+0.001
 here+0.001
 upon+0.001
 thinking+0.001
Neg logit contributions
ifying-0.002
attered-0.002
shaped-0.002
tr-0.002
piece-0.002
Token was
Token ID373
Feature activation-8.01e-03

Pos logit contributions
 friends+0.002
 here+0.002
 thinking+0.002
 upon+0.002
 there+0.002
Neg logit contributions
ifying-0.009
attered-0.007
tr-0.007
lier-0.007
work-0.007
Token the
Token ID262
Feature activation+6.64e-03

Pos logit contributions
ifying+0.020
attered+0.017
rant+0.016
amy+0.016
tr+0.016
Neg logit contributions
 friends-0.009
 there-0.008
 thinking-0.008
 here-0.008
 kids-0.008
Token best
Token ID1266
Feature activation-2.88e-03

Pos logit contributions
 friends+0.008
 there+0.008
 here+0.008
 thinking+0.008
 it+0.007
Neg logit contributions
ifying-0.016
tr-0.013
attered-0.013
amy-0.012
rant-0.012
Token plan
Token ID1410
Feature activation-2.14e-02

Pos logit contributions
ifying+0.006
attered+0.005
tr+0.005
work+0.005
lier+0.005
Neg logit contributions
 friends-0.004
 thinking-0.004
 there-0.004
 here-0.004
 upon-0.004
Token -
Token ID532
Feature activation+3.53e-03

Pos logit contributions
ifying+0.050
attered+0.042
tr+0.039
rant+0.039
amy+0.039
Neg logit contributions
 friends-0.029
 there-0.027
 thinking-0.027
 here-0.026
 things-0.025
Token to
Token ID284
Feature activation-6.09e-03

Pos logit contributions
 friends+0.003
 thinking+0.003
 there+0.003
 upon+0.003
 here+0.003
Neg logit contributions
ifying-0.009
tr-0.007
attered-0.007
lier-0.007
work-0.007
Token try
Token ID1949
Feature activation+9.39e-03

Pos logit contributions
ifying+0.014
attered+0.011
tr+0.011
rant+0.011
amy+0.011
Neg logit contributions
 friends-0.008
 there-0.008
 thinking-0.008
 here-0.008
 things-0.007
Token again
Token ID757
Feature activation+9.22e-03

Pos logit contributions
 friends+0.009
 thinking+0.008
 there+0.008
 here+0.007
 kids+0.007
Neg logit contributions
ifying-0.025
rant-0.020
attered-0.020
lier-0.020
amy-0.020
Token and
Token ID290
Feature activation-3.10e-03

Pos logit contributions
 friends+0.008
 thinking+0.008
 there+0.007
 here+0.007
 kids+0.007
Neg logit contributions
ifying-0.025
attered-0.021
amy-0.020
rant-0.020
lier-0.020
Token in
Token ID287
Feature activation+7.04e-03

Pos logit contributions
ifying+0.012
attered+0.011
amy+0.010
rant+0.010
tr+0.010
Neg logit contributions
 friends-0.004
 there-0.004
 thinking-0.003
 here-0.003
 it-0.003
Token the
Token ID262
Feature activation+1.55e-03

Pos logit contributions
 friends+0.006
 there+0.005
 thinking+0.005
 here+0.005
 things+0.004
Neg logit contributions
ifying-0.020
attered-0.018
amy-0.017
rant-0.017
tr-0.017
Token garden
Token ID11376
Feature activation+1.79e-02

Pos logit contributions
 friends+0.002
 there+0.002
 things+0.001
 here+0.001
 thinking+0.001
Neg logit contributions
ifying-0.004
attered-0.004
ksh-0.004
rant-0.004
tr-0.004
Token and
Token ID290
Feature activation+1.12e-03

Pos logit contributions
 friends+0.015
 there+0.013
 thinking+0.013
 here+0.013
 it+0.011
Neg logit contributions
ifying-0.051
attered-0.044
amy-0.043
tr-0.042
rant-0.042
Token took
Token ID1718
Feature activation+3.08e-03

Pos logit contributions
 friends+0.002
 there+0.002
 thinking+0.002
 here+0.002
 it+0.001
Neg logit contributions
ifying-0.003
attered-0.002
amy-0.002
rant-0.002
tr-0.002
Token care
Token ID1337
Feature activation-2.38e-02

Pos logit contributions
 friends+0.003
 there+0.003
 things+0.003
 it+0.002
 thinking+0.002
Neg logit contributions
ifying-0.009
attered-0.008
tr-0.008
amy-0.007
rant-0.007
Token of
Token ID286
Feature activation+5.67e-03

Pos logit contributions
ifying+0.069
attered+0.060
amy+0.058
rant+0.057
tr+0.057
Neg logit contributions
 friends-0.022
 there-0.020
 here-0.018
 thinking-0.017
 it-0.017
Token the
Token ID262
Feature activation+1.10e-02

Pos logit contributions
 friends+0.006
 there+0.006
 thinking+0.005
 here+0.005
 it+0.005
Neg logit contributions
ifying-0.016
attered-0.013
tr-0.013
amy-0.013
rant-0.013
Token flowers
Token ID12734
Feature activation-4.03e-03

Pos logit contributions
 thinking+0.014
 friends+0.013
 here+0.013
 there+0.013
 it+0.012
Neg logit contributions
ifying-0.024
attered-0.021
amy-0.020
rant-0.020
tr-0.019
Token.
Token ID13
Feature activation+7.78e-04

Pos logit contributions
ifying+0.011
attered+0.010
amy+0.009
rant+0.009
tr+0.009
Neg logit contributions
 friends-0.004
 there-0.004
 here-0.004
 thinking-0.004
 it-0.003
Token<|endoftext|>
Token ID50256
Feature activation-4.54e-04

Pos logit contributions
 friends+0.001
 here+0.001
 there+0.001
 thinking+0.001
 things+0.000
Neg logit contributions
ifying-0.002
attered-0.002
amy-0.002
tr-0.002
rant-0.002
Token nodded
Token ID14464
Feature activation-9.35e-03

Pos logit contributions
ifying+0.004
attered+0.004
amy+0.003
rant+0.003
tr+0.003
Neg logit contributions
 friends-0.002
 there-0.002
 thinking-0.002
 kids-0.002
 mom-0.002
Token.
Token ID13
Feature activation-2.26e-03

Pos logit contributions
ifying+0.028
attered+0.024
amy+0.023
rant+0.023
tr+0.023
Neg logit contributions
 friends-0.008
 there-0.007
 thinking-0.007
 here-0.006
 it-0.006
Token They
Token ID1119
Feature activation-4.74e-03

Pos logit contributions
ifying+0.005
attered+0.004
amy+0.004
rant+0.004
tr+0.004
Neg logit contributions
 friends-0.003
 there-0.003
 thinking-0.003
 here-0.003
 it-0.003
Token hugged
Token ID41130
Feature activation-1.02e-02

Pos logit contributions
ifying+0.013
attered+0.011
rant+0.010
amy+0.010
ksh+0.010
Neg logit contributions
 friends-0.006
 there-0.005
 thinking-0.005
 mom-0.005
 kids-0.005
Token their
Token ID511
Feature activation+1.67e-03

Pos logit contributions
ifying+0.028
attered+0.024
amy+0.023
rant+0.023
tr+0.022
Neg logit contributions
 friends-0.011
 there-0.010
 thinking-0.010
 here-0.009
 it-0.009
Token mom
Token ID1995
Feature activation-2.26e-02

Pos logit contributions
 there+0.001
 friends+0.001
 thinking+0.001
 here+0.001
 it+0.000
Neg logit contributions
ifying-0.005
attered-0.005
tr-0.005
amy-0.005
rant-0.004
Token and
Token ID290
Feature activation-3.16e-03

Pos logit contributions
ifying+0.077
attered+0.069
amy+0.066
rant+0.065
tr+0.065
Neg logit contributions
 friends-0.011
 there-0.008
 thinking-0.007
 things-0.006
 it-0.006
Token thanked
Token ID26280
Feature activation-4.88e-04

Pos logit contributions
ifying+0.010
attered+0.008
rant+0.008
amy+0.008
ksh+0.008
Neg logit contributions
 friends-0.004
 mom-0.003
 there-0.003
 thinking-0.003
 it-0.003
Token her
Token ID607
Feature activation+8.21e-04

Pos logit contributions
ifying+0.001
attered+0.001
rant+0.001
amy+0.001
tr+0.001
Neg logit contributions
 friends-0.000
 there-0.000
 thinking-0.000
 here-0.000
 upon-0.000
Token.
Token ID13
Feature activation+2.47e-03

Pos logit contributions
 friends+0.001
 there+0.001
 here+0.001
 thinking+0.001
 kids+0.001
Neg logit contributions
ifying-0.002
tr-0.002
attered-0.002
rant-0.002
ksh-0.002
Token They
Token ID1119
Feature activation-6.76e-03

Pos logit contributions
 friends+0.004
 here+0.004
 thinking+0.004
 there+0.004
 things+0.004
Neg logit contributions
ifying-0.005
attered-0.004
tr-0.004
amy-0.003
rant-0.003
Token taken
Token ID2077
Feature activation+4.03e-03

Pos logit contributions
ifying+0.037
attered+0.031
tr+0.030
rant+0.030
amy+0.029
Neg logit contributions
 friends-0.018
 there-0.018
 here-0.016
 thinking-0.016
 it-0.016
Token the
Token ID262
Feature activation-3.48e-03

Pos logit contributions
 friends+0.004
 thinking+0.003
 there+0.003
 here+0.003
 upon+0.003
Neg logit contributions
ifying-0.011
attered-0.009
amy-0.009
rant-0.009
shaped-0.009
Token shiny
Token ID22441
Feature activation-1.37e-02

Pos logit contributions
ifying+0.007
attered+0.005
tr+0.005
amy+0.005
rant+0.005
Neg logit contributions
 friends-0.006
 there-0.006
 thinking-0.006
 here-0.006
 it-0.006
Token magnet
Token ID19972
Feature activation-2.05e-02

Pos logit contributions
ifying+0.030
attered+0.025
amy+0.024
tr+0.023
rant+0.023
Neg logit contributions
 friends-0.022
 there-0.020
 here-0.020
 thinking-0.019
 things-0.019
Token.
Token ID13
Feature activation-2.39e-03

Pos logit contributions
ifying+0.056
attered+0.047
amy+0.046
rant+0.045
tr+0.045
Neg logit contributions
 friends-0.022
 there-0.020
 here-0.019
 thinking-0.019
 it-0.017
Token It
Token ID632
Feature activation-2.22e-02

Pos logit contributions
ifying+0.006
attered+0.005
rant+0.005
amy+0.005
shaped+0.005
Neg logit contributions
 friends-0.003
 there-0.003
 here-0.002
 thinking-0.002
 it-0.002
Token is
Token ID318
Feature activation-1.77e-02

Pos logit contributions
ifying+0.064
attered+0.054
amy+0.052
 unison+0.052
rant+0.051
Neg logit contributions
 friends-0.022
 there-0.021
 here-0.020
 it-0.019
 things-0.018
Token not
Token ID407
Feature activation-1.20e-02

Pos logit contributions
ifying+0.050
attered+0.043
amy+0.041
rant+0.041
tr+0.040
Neg logit contributions
 friends-0.017
 there-0.016
 here-0.015
 thinking-0.015
 it-0.013
Token mine
Token ID6164
Feature activation+2.10e-04

Pos logit contributions
ifying+0.033
attered+0.029
amy+0.028
rant+0.027
tr+0.027
Neg logit contributions
 friends-0.013
 there-0.012
 here-0.011
 thinking-0.011
 it-0.010
Token.
Token ID13
Feature activation-5.27e-03

Pos logit contributions
 friends+0.000
 thinking+0.000
 there+0.000
 here+0.000
 things+0.000
Neg logit contributions
ifying-0.001
attered-0.000
tr-0.000
rant-0.000
amy-0.000
Token It
Token ID632
Feature activation-2.35e-02

Pos logit contributions
ifying+0.014
attered+0.012
rant+0.012
amy+0.012
shaped+0.011
Neg logit contributions
 friends-0.006
 there-0.005
 here-0.005
 it-0.005
 thinking-0.005