TOP ACTIVATIONS
MAX = 3.075

and his friend tasted the delic iously sweet treats the restaurant
away . The popcorn was delic iously crunch y and tasted
see the bird and have pic n ics . <|endoftext|> Once
got lost , the l ibr arians would help him find
away from dangerous things like volcan oes . <|endoftext|> Once upon
treat . The pastry was delic iously sweet and she knew
P oppy sees Roc co and bar ks .
gone . Days went by and Max 's
big book with pictures of volcan oes . Vol can oes
the ingredients to make her delic iously sweet past ries .
big book with pictures of volcan oes . Vol can oes
day , a chocolate - col oured whip named C heddar
a strange , raspberry - col oured object hiding under a
reliable that it had out sm arted the big scary shark
also added some spicy - sm elling flowers to her dress
just wanted to write about volcan oes . We like volcan
sunshine . Molly liked the delic ately pet alled yellow ones
that !" Charl otte thanked her mother for
replied Jim . Days passed and Bob noticed something
she got rid of the intr uders . She did not

MIN ACTIVATIONS
MIN = -3.013

It was a big , red barn and it had lots
y ummy . Just a little bite ." She
It will tick le a little . You can watch yourself
they want a new hairst yle or when their hair is
. He asked the zo oke eper if he could borrow
Yes , but only a little . The oats are for
asis ! â The little boy was very excited and
the kitchen . Wait a little bit ," mom says .
there was a loud * be ep * from the oven
the cake . Just a little bit . Please , Anna
would pour it from the blue cup into the green cup
One day a trou l be came to the bridge .
at him with a mis chie vous grin . She said
is more shiny . The red one is old and boring
. Ben asked the zo oke eper if he could touch
the blue cup into the green cup . One
With a mis chie vous gl int in her
was bright and warm , and Jos ie was happy !
yawn . The little boy yawn ed and stretched
to have a cool hairst yle , but he didn 't

INTERVAL 1.553 - 3.075
CONTAINS 0.244%

her mom would always be there to help her . <|endoftext|>
others . If you do something wrong , you should say
, a little girl came over to the t eddy bear
Hey , dog , return our ball !" Sara shouts .
." Anna forg ives Tom . She says he

INTERVAL 0.031 - 1.553
CONTAINS 46.194%

trouble . <|endoftext|> Once upon a time there was a cool
friends . One day , Tim my and his friends were
, and finally stopped to rest . With a final scatter
can 't . This is our garden . You are not
. They loved playing together and having y ummy snacks afterwards

INTERVAL -1.491 - 0.031
CONTAINS 53.368%

not okay . My knee is red and sore . It
and get help from a grown up . Belle ran as
liked pink and flowers and dolls . Tom liked blue and
Thank you for being so nice and helpful ."
Your words are hurt ful and rude . You have no

INTERVAL -3.013 - -1.491
CONTAINS 0.193%

ran to the bathroom but the door was tight and she
!" The little girl was so excited and
heard L ila and Ben . She came to them and
her mom said . Instead , Lily gave the spider a
boat sailed on the big blue sea . The wind blew
Token and
Token ID290
Feature activation-6.36e-02

Pos logit contributions
ach+0.003
aman+0.003
oun+0.003
 Kel+0.003
ait+0.003
Neg logit contributions
 very-0.001
 too-0.001
 a-0.001
 so-0.001
 called-0.001
Token his
Token ID465
Feature activation+4.02e-01

Pos logit contributions
 too+0.002
 very+0.002
 a+0.001
 so+0.001
 called+0.001
Neg logit contributions
ach-0.003
aman-0.003
oun-0.003
liv-0.003
z-0.003
Token friend
Token ID1545
Feature activation+1.19e-01

Pos logit contributions
aman+0.022
ach+0.021
oun+0.021
 Kel+0.021
liv+0.021
Neg logit contributions
 very-0.011
 too-0.011
 a-0.009
 so-0.008
 all-0.007
Token tasted
Token ID29187
Feature activation-2.49e-01

Pos logit contributions
ach+0.007
aman+0.007
oun+0.007
liv+0.007
 Kel+0.007
Neg logit contributions
 very-0.003
 too-0.003
 a-0.002
 so-0.002
 called-0.002
Token the
Token ID262
Feature activation+2.04e-01

Pos logit contributions
 too+0.006
 very+0.006
 a+0.004
 called+0.004
 bored+0.004
Neg logit contributions
ach-0.013
aman-0.013
oun-0.012
 Kel-0.012
liv-0.012
Token delic
Token ID8675
Feature activation+3.07e+00

Pos logit contributions
ach+0.009
aman+0.009
oun+0.009
 Kel+0.009
liv+0.009
Neg logit contributions
 very-0.007
 too-0.007
 a-0.006
 so-0.005
 hungry-0.005
Tokeniously
Token ID6819
Feature activation+4.63e-01

Pos logit contributions
liv+0.097
Aud+0.095
aman+0.090
 Kel+0.086
 stret+0.085
Neg logit contributions
 too-0.152
 very-0.135
 a-0.122
 all-0.118
 so-0.116
Token sweet
Token ID6029
Feature activation-1.32e-01

Pos logit contributions
liv+0.030
aman+0.030
oun+0.030
 Kel+0.030
ach+0.030
Neg logit contributions
 too-0.008
 very-0.008
 a-0.005
 so-0.004
 hungry-0.003
Token treats
Token ID18432
Feature activation+7.49e-01

Pos logit contributions
 too+0.003
 very+0.003
 a+0.002
 so+0.002
 all+0.002
Neg logit contributions
liv-0.007
oun-0.007
 Kel-0.007
aman-0.007
z-0.007
Token the
Token ID262
Feature activation+1.22e-01

Pos logit contributions
aman+0.046
ach+0.046
oun+0.046
liv+0.045
 Kel+0.045
Neg logit contributions
 too-0.016
 very-0.015
 a-0.010
 so-0.009
 all-0.009
Token restaurant
Token ID7072
Feature activation+3.40e-01

Pos logit contributions
aman+0.005
ach+0.005
oun+0.005
liv+0.005
 Kel+0.005
Neg logit contributions
 very-0.004
 too-0.004
 a-0.004
 so-0.003
 hungry-0.003
Token away
Token ID1497
Feature activation+4.69e-01

Pos logit contributions
 too+0.012
 very+0.012
 a+0.010
 so+0.007
 all+0.007
Neg logit contributions
aman-0.038
ach-0.037
liv-0.036
oun-0.036
becca-0.036
Token.
Token ID13
Feature activation+6.43e-01

Pos logit contributions
aman+0.029
ach+0.029
oun+0.029
liv+0.028
 Kel+0.028
Neg logit contributions
 too-0.008
 very-0.008
 a-0.006
 so-0.004
 all-0.004
Token The
Token ID383
Feature activation-3.06e-01

Pos logit contributions
ach+0.030
aman+0.030
oun+0.029
 Kel+0.029
z+0.028
Neg logit contributions
 too-0.021
 very-0.020
 a-0.017
 so-0.016
 called-0.015
Token popcorn
Token ID38914
Feature activation-2.97e-01

Pos logit contributions
 too+0.011
 very+0.011
 a+0.010
 so+0.009
 all+0.008
Neg logit contributions
ach-0.014
aman-0.014
 Kel-0.013
oun-0.013
z-0.013
Token was
Token ID373
Feature activation-7.51e-02

Pos logit contributions
 very+0.007
 too+0.007
 a+0.005
 all+0.005
 so+0.005
Neg logit contributions
aman-0.018
ach-0.018
 Kel-0.018
ait-0.017
oun-0.017
Token delic
Token ID8675
Feature activation+2.99e+00

Pos logit contributions
 very+0.001
 too+0.001
 a+0.001
 so+0.001
 all+0.001
Neg logit contributions
ach-0.005
aman-0.005
oun-0.005
 Kel-0.005
liv-0.005
Tokeniously
Token ID6819
Feature activation+1.96e-02

Pos logit contributions
Aud+0.085
liv+0.080
aman+0.080
oun+0.076
 stret+0.075
Neg logit contributions
 too-0.151
 very-0.136
 a-0.128
 so-0.123
 all-0.123
Token crunch
Token ID23527
Feature activation+1.87e-01

Pos logit contributions
liv+0.001
aman+0.001
ach+0.001
oun+0.001
 Kel+0.001
Neg logit contributions
 very-0.001
 too-0.001
 a-0.001
 so-0.000
 hungry-0.000
Tokeny
Token ID88
Feature activation-7.91e-01

Pos logit contributions
aman+0.008
ach+0.008
 Kel+0.008
oun+0.008
liv+0.008
Neg logit contributions
 too-0.007
 very-0.007
 a-0.006
 so-0.005
 all-0.005
Token and
Token ID290
Feature activation-1.76e-01

Pos logit contributions
 too+0.014
 very+0.012
 so+0.007
 a+0.007
 all+0.005
Neg logit contributions
liv-0.055
aman-0.054
 Kel-0.054
oun-0.053
ach-0.053
Token tasted
Token ID29187
Feature activation+2.06e-01

Pos logit contributions
 too+0.006
 very+0.006
 a+0.005
 so+0.004
 called+0.004
Neg logit contributions
ach-0.009
aman-0.009
oun-0.009
ait-0.008
z-0.008
Token see
Token ID766
Feature activation-9.16e-03

Pos logit contributions
aman+0.018
ach+0.018
 Kel+0.017
oun+0.017
z+0.017
Neg logit contributions
 too-0.017
 very-0.016
 bored-0.013
 a-0.013
 so-0.013
Token the
Token ID262
Feature activation+8.77e-01

Pos logit contributions
 too+0.000
 very+0.000
 a+0.000
 so+0.000
 all+0.000
Neg logit contributions
ach-0.001
aman-0.001
oun-0.001
liv-0.001
z-0.001
Token bird
Token ID6512
Feature activation+5.09e-02

Pos logit contributions
ach+0.044
aman+0.043
liv+0.043
oun+0.043
 Kel+0.043
Neg logit contributions
 too-0.027
 very-0.027
 a-0.022
 so-0.019
 all-0.019
Token and
Token ID290
Feature activation-1.55e-01

Pos logit contributions
ach+0.003
aman+0.003
oun+0.003
 Kel+0.003
z+0.003
Neg logit contributions
 too-0.001
 very-0.001
 a-0.001
 so-0.001
 called-0.001
Token have
Token ID423
Feature activation-4.54e-01

Pos logit contributions
 too+0.004
 very+0.004
 a+0.003
 so+0.003
 bored+0.003
Neg logit contributions
aman-0.008
ach-0.008
 Kel-0.008
oun-0.008
liv-0.007
Token pic
Token ID8301
Feature activation+2.96e+00

Pos logit contributions
 too+0.010
 very+0.009
 a+0.007
 so+0.006
 called+0.005
Neg logit contributions
aman-0.026
ach-0.026
oun-0.025
 Kel-0.025
liv-0.025
Tokenn
Token ID77
Feature activation+8.85e-01

Pos logit contributions
aman+0.088
ach+0.088
oun+0.085
 Kel+0.084
ait+0.079
Neg logit contributions
 too-0.144
 very-0.141
 a-0.127
 so-0.119
 all-0.115
Tokenics
Token ID873
Feature activation+5.62e-01

Pos logit contributions
aman+0.029
ach+0.028
 Kel+0.026
oun+0.026
umm+0.026
Neg logit contributions
 too-0.041
 very-0.041
 a-0.036
 all-0.034
 so-0.034
Token.
Token ID13
Feature activation+3.97e-01

Pos logit contributions
aman+0.034
oun+0.034
 Kel+0.033
ach+0.033
liv+0.033
Neg logit contributions
 too-0.013
 very-0.012
 a-0.009
 so-0.008
 all-0.007
Token<|endoftext|>
Token ID50256
Feature activation-5.41e-01

Pos logit contributions
oun+0.022
aman+0.022
 Kel+0.021
ach+0.021
liv+0.021
Neg logit contributions
 too-0.009
 very-0.009
 a-0.006
 bored-0.006
 called-0.005
TokenOnce
Token ID7454
Feature activation+4.49e-01

Pos logit contributions
 too+0.012
 very+0.012
 bored+0.008
 a+0.008
 so+0.008
Neg logit contributions
ach-0.029
aman-0.029
oun-0.028
 Kel-0.028
z-0.028
Token got
Token ID1392
Feature activation-1.75e-01

Pos logit contributions
 too+0.015
 very+0.015
 a+0.011
 so+0.010
 called+0.009
Neg logit contributions
ach-0.036
aman-0.036
oun-0.035
 Kel-0.034
liv-0.034
Token lost
Token ID2626
Feature activation-4.07e-03

Pos logit contributions
 too+0.004
 very+0.004
 a+0.003
 so+0.002
 called+0.002
Neg logit contributions
aman-0.010
ach-0.010
oun-0.010
 Kel-0.010
liv-0.009
Token,
Token ID11
Feature activation-1.66e-01

Pos logit contributions
 too+0.000
 very+0.000
 a+0.000
 so+0.000
 all+0.000
Neg logit contributions
oun-0.000
aman-0.000
liv-0.000
 Kel-0.000
ach-0.000
Token the
Token ID262
Feature activation-5.69e-01

Pos logit contributions
 too+0.003
 very+0.003
 a+0.003
 so+0.002
 all+0.002
Neg logit contributions
oun-0.011
aman-0.011
ach-0.011
z-0.010
liv-0.010
Token l
Token ID300
Feature activation+6.13e-02

Pos logit contributions
 too+0.020
 very+0.020
 a+0.017
 so+0.016
 called+0.015
Neg logit contributions
ach-0.025
aman-0.025
oun-0.025
 Kel-0.024
z-0.023
Tokenibr
Token ID2889
Feature activation+2.96e+00

Pos logit contributions
aman+0.002
ach+0.002
oun+0.002
 Kel+0.002
z+0.002
Neg logit contributions
 too-0.002
 very-0.002
 a-0.002
 called-0.002
 so-0.002
Tokenarians
Token ID13517
Feature activation-3.48e-01

Pos logit contributions
 Kel+0.071
aman+0.070
oun+0.063
liv+0.062
less+0.059
Neg logit contributions
 too-0.150
 very-0.140
 called-0.124
 named-0.122
 a-0.121
Token would
Token ID561
Feature activation-4.23e-01

Pos logit contributions
 too+0.009
 very+0.009
 a+0.007
 so+0.006
 all+0.006
Neg logit contributions
aman-0.020
ach-0.020
liv-0.019
oun-0.019
 Kel-0.019
Token help
Token ID1037
Feature activation+6.23e-01

Pos logit contributions
 too+0.011
 very+0.011
 a+0.008
 so+0.008
 all+0.007
Neg logit contributions
ach-0.023
aman-0.023
oun-0.022
z-0.022
liv-0.022
Token him
Token ID683
Feature activation-3.84e-01

Pos logit contributions
oun+0.039
liv+0.038
aman+0.038
umm+0.037
z+0.037
Neg logit contributions
 too-0.015
 very-0.015
 a-0.012
 so-0.010
 all-0.009
Token find
Token ID1064
Feature activation+2.56e-01

Pos logit contributions
 too+0.010
 very+0.010
 a+0.008
 so+0.007
 called+0.007
Neg logit contributions
aman-0.020
ach-0.020
oun-0.019
 Kel-0.019
liv-0.019
Token away
Token ID1497
Feature activation+1.17e+00

Pos logit contributions
ach+0.012
aman+0.012
 Kel+0.012
oun+0.012
z+0.011
Neg logit contributions
 too-0.007
 very-0.007
 a-0.006
 so-0.005
 called-0.005
Token from
Token ID422
Feature activation+1.52e-01

Pos logit contributions
aman+0.062
ach+0.062
oun+0.060
liv+0.060
 Kel+0.059
Neg logit contributions
 too-0.032
 very-0.032
 a-0.025
 so-0.023
 all-0.021
Token dangerous
Token ID4923
Feature activation+8.70e-01

Pos logit contributions
aman+0.008
ach+0.008
oun+0.008
 Kel+0.008
liv+0.008
Neg logit contributions
 too-0.004
 very-0.004
 a-0.003
 so-0.003
 called-0.003
Token things
Token ID1243
Feature activation+9.14e-01

Pos logit contributions
aman+0.041
liv+0.041
oun+0.040
 Kel+0.038
becca+0.038
Neg logit contributions
 too-0.032
 very-0.031
 a-0.027
 so-0.025
 all-0.024
Token like
Token ID588
Feature activation+3.20e-01

Pos logit contributions
aman+0.061
ach+0.060
 Kel+0.059
liv+0.059
oun+0.059
Neg logit contributions
 too-0.016
 very-0.015
 a-0.009
 so-0.008
 all-0.007
Token volcan
Token ID17516
Feature activation+2.94e+00

Pos logit contributions
aman+0.018
oun+0.018
liv+0.018
ach+0.018
z+0.017
Neg logit contributions
 too-0.009
 very-0.009
 a-0.007
 so-0.006
 all-0.006
Tokenoes
Token ID3028
Feature activation+3.10e-02

Pos logit contributions
aman+0.122
 Kel+0.118
ach+0.118
liv+0.117
oun+0.116
Neg logit contributions
 too-0.113
 very-0.110
 a-0.091
 so-0.087
 called-0.083
Token.
Token ID13
Feature activation-1.97e-01

Pos logit contributions
aman+0.002
ach+0.002
oun+0.002
liv+0.002
becca+0.002
Neg logit contributions
 too-0.001
 very-0.001
 a-0.000
 so-0.000
 all-0.000
Token<|endoftext|>
Token ID50256
Feature activation-2.67e-01

Pos logit contributions
 very+0.005
 too+0.005
 a+0.004
 bored+0.003
 mean+0.003
Neg logit contributions
aman-0.010
oun-0.010
 Kel-0.010
ach-0.010
liv-0.010
TokenOnce
Token ID7454
Feature activation+7.16e-01

Pos logit contributions
 too+0.006
 very+0.006
 bored+0.004
 a+0.004
 so+0.004
Neg logit contributions
ach-0.014
aman-0.014
oun-0.014
 Kel-0.014
z-0.014
Token upon
Token ID2402
Feature activation-4.78e-01

Pos logit contributions
aman+0.029
ach+0.029
oun+0.028
 Kel+0.028
liv+0.027
Neg logit contributions
 too-0.026
 very-0.026
 a-0.022
 so-0.021
 called-0.020
Token treat
Token ID2190
Feature activation-7.30e-03

Pos logit contributions
aman+0.043
ach+0.043
 Kel+0.042
ait+0.041
oun+0.040
Neg logit contributions
 very-0.048
 too-0.047
 a-0.041
 so-0.037
 all-0.036
Token.
Token ID13
Feature activation+4.18e-01

Pos logit contributions
 too+0.000
 very+0.000
 a+0.000
 so+0.000
 all+0.000
Neg logit contributions
aman-0.000
ach-0.000
oun-0.000
 Kel-0.000
ait-0.000
Token The
Token ID383
Feature activation+1.65e-02

Pos logit contributions
aman+0.022
ach+0.021
 Kel+0.021
oun+0.021
liv+0.020
Neg logit contributions
 too-0.011
 very-0.010
 a-0.008
 so-0.007
 called-0.007
Token pastry
Token ID41738
Feature activation-5.50e-02

Pos logit contributions
ach+0.001
aman+0.001
 Kel+0.001
oun+0.001
ait+0.001
Neg logit contributions
 very-0.001
 too-0.001
 a-0.000
 so-0.000
 all-0.000
Token was
Token ID373
Feature activation+1.81e-01

Pos logit contributions
 too+0.001
 very+0.001
 a+0.001
 so+0.001
 all+0.001
Neg logit contributions
ach-0.003
oun-0.003
aman-0.003
ait-0.003
 Kel-0.003
Token delic
Token ID8675
Feature activation+2.92e+00

Pos logit contributions
ach+0.013
oun+0.012
aman+0.012
ait+0.012
 Kel+0.012
Neg logit contributions
 very-0.002
 too-0.002
 a-0.001
 so-0.001
 all-0.001
Tokeniously
Token ID6819
Feature activation+3.13e-01

Pos logit contributions
 stret+0.083
Aud+0.083
oun+0.082
liv+0.079
aman+0.077
Neg logit contributions
 too-0.142
 very-0.126
 a-0.118
 so-0.116
 all-0.115
Token sweet
Token ID6029
Feature activation-6.33e-01

Pos logit contributions
oun+0.019
ach+0.018
liv+0.018
aman+0.018
 Kel+0.018
Neg logit contributions
 too-0.008
 very-0.008
 a-0.007
 so-0.006
 all-0.005
Token and
Token ID290
Feature activation-6.42e-01

Pos logit contributions
 too+0.012
 very+0.010
 a+0.007
 so+0.007
 all+0.005
Neg logit contributions
 Kel-0.041
oun-0.041
ach-0.041
liv-0.040
aman-0.040
Token she
Token ID673
Feature activation+3.81e-01

Pos logit contributions
 too+0.018
 very+0.018
 a+0.014
 so+0.013
 called+0.012
Neg logit contributions
ach-0.035
oun-0.034
aman-0.034
ait-0.032
z-0.032
Token knew
Token ID2993
Feature activation-3.22e-01

Pos logit contributions
aman+0.023
ach+0.022
oun+0.022
 Kel+0.021
ait+0.021
Neg logit contributions
 very-0.009
 too-0.009
 a-0.007
 so-0.006
 called-0.006
Token
Token ID198
Feature activation+4.75e-02

Pos logit contributions
 very+0.009
 too+0.009
 a+0.006
 so+0.006
 bored+0.006
Neg logit contributions
ach-0.017
oun-0.017
 Kel-0.017
aman-0.016
liv-0.016
Token
Token ID198
Feature activation+2.86e-02

Pos logit contributions
ach+0.003
aman+0.003
oun+0.003
 Kel+0.003
liv+0.003
Neg logit contributions
 too-0.001
 very-0.001
 a-0.001
 so-0.000
 called-0.000
TokenP
Token ID47
Feature activation+6.22e-01

Pos logit contributions
ach+0.001
aman+0.001
oun+0.001
 Kel+0.001
liv+0.001
Neg logit contributions
 too-0.001
 very-0.001
 a-0.001
 so-0.001
 called-0.001
Tokenoppy
Token ID26696
Feature activation+6.06e-01

Pos logit contributions
ach+0.020
aman+0.020
 Kel+0.020
oun+0.020
liv+0.018
Neg logit contributions
 too-0.029
 very-0.028
 a-0.025
 so-0.023
 called-0.023
Token sees
Token ID7224
Feature activation+3.37e-02

Pos logit contributions
aman+0.027
ach+0.027
oun+0.026
 Kel+0.026
liv+0.025
Neg logit contributions
 too-0.021
 very-0.021
 a-0.018
 so-0.016
 called-0.016
Token Roc
Token ID13545
Feature activation+2.91e+00

Pos logit contributions
 Kel+0.001
ach+0.001
oun+0.001
con+0.001
aman+0.001
Neg logit contributions
 very-0.001
 too-0.001
 decided-0.001
 Amen-0.001
 bored-0.001
Tokenco
Token ID1073
Feature activation-3.13e-01

Pos logit contributions
 Kel+0.102
aman+0.100
oun+0.098
ach+0.098
liv+0.090
Neg logit contributions
 too-0.130
 very-0.124
 a-0.104
 called-0.102
 so-0.098
Token and
Token ID290
Feature activation-3.61e-01

Pos logit contributions
 too+0.011
 very+0.011
 a+0.009
 so+0.009
 bored+0.008
Neg logit contributions
ach-0.013
aman-0.013
oun-0.013
 Kel-0.013
liv-0.012
Token bar
Token ID2318
Feature activation+4.86e-01

Pos logit contributions
 too+0.012
 very+0.012
 a+0.009
 so+0.009
 called+0.009
Neg logit contributions
 Kel-0.015
oun-0.015
ach-0.014
aman-0.014
z-0.014
Tokenks
Token ID591
Feature activation+1.31e-01

Pos logit contributions
ach+0.018
z+0.016
oun+0.016
ait+0.016
ge+0.015
Neg logit contributions
 very-0.018
 too-0.018
 a-0.016
 so-0.016
 called-0.015
Token.
Token ID13
Feature activation-1.27e+00

Pos logit contributions
ach+0.007
z+0.006
oun+0.006
aman+0.006
 Kel+0.006
Neg logit contributions
 very-0.003
 too-0.003
 called-0.002
 bored-0.002
 a-0.002
Token gone
Token ID3750
Feature activation+2.95e-01

Pos logit contributions
 very+0.006
 Amen+0.005
 +0.005
 Yesterday+0.005
 too+0.004
Neg logit contributions
 Kel-0.048
ge-0.047
less-0.046
oun-0.045
z-0.045
Token.
Token ID13
Feature activation+2.94e-01

Pos logit contributions
ach+0.017
aman+0.017
oun+0.017
 Kel+0.017
liv+0.017
Neg logit contributions
 too-0.006
 very-0.005
 a-0.004
 so-0.003
 called-0.003
Token
Token ID220
Feature activation+6.24e-01

Pos logit contributions
aman+0.014
ach+0.014
oun+0.014
 Kel+0.014
liv+0.014
Neg logit contributions
 too-0.008
 very-0.008
 a-0.006
 so-0.006
 bored-0.005
Token
Token ID198
Feature activation+4.63e-01

Pos logit contributions
ach+0.040
aman+0.040
oun+0.038
 Kel+0.038
liv+0.037
Neg logit contributions
 too-0.010
 very-0.009
 a-0.006
 so-0.005
 called-0.004
Token
Token ID198
Feature activation+9.36e-01

Pos logit contributions
ach+0.028
aman+0.028
oun+0.028
 Kel+0.027
liv+0.027
Neg logit contributions
 too-0.008
 very-0.008
 a-0.005
 so-0.004
 called-0.004
TokenDays
Token ID38770
Feature activation+2.91e+00

Pos logit contributions
aman+0.033
ach+0.032
oun+0.030
 Kel+0.030
ait+0.030
Neg logit contributions
 very-0.042
 too-0.042
 a-0.037
 so-0.034
 called-0.034
Token went
Token ID1816
Feature activation+4.40e-01

Pos logit contributions
ach+0.159
aman+0.159
oun+0.153
 Kel+0.151
liv+0.150
Neg logit contributions
 too-0.072
 very-0.070
 a-0.054
 so-0.047
 called-0.045
Token by
Token ID416
Feature activation+1.32e+00

Pos logit contributions
aman+0.022
ach+0.022
oun+0.022
 Kel+0.021
liv+0.021
Neg logit contributions
 too-0.013
 very-0.013
 a-0.010
 so-0.009
 called-0.009
Token and
Token ID290
Feature activation+2.95e-01

Pos logit contributions
aman+0.082
ach+0.081
oun+0.080
 Kel+0.078
liv+0.078
Neg logit contributions
 too-0.024
 very-0.024
 a-0.017
 so-0.014
 called-0.012
Token Max
Token ID5436
Feature activation+2.64e-01

Pos logit contributions
ach+0.014
aman+0.014
oun+0.013
 Kel+0.013
liv+0.013
Neg logit contributions
 too-0.010
 very-0.010
 a-0.008
 so-0.007
 called-0.007
Token's
Token ID338
Feature activation+6.12e-01

Pos logit contributions
ach+0.015
aman+0.014
oun+0.014
 Kel+0.014
liv+0.014
Neg logit contributions
 too-0.007
 very-0.007
 a-0.005
 so-0.004
 called-0.004
Token big
Token ID1263
Feature activation+2.14e-01

Pos logit contributions
ach+0.016
aman+0.016
oun+0.016
 Kel+0.015
liv+0.015
Neg logit contributions
 very-0.010
 too-0.010
 a-0.008
 so-0.007
 called-0.007
Token book
Token ID1492
Feature activation+2.70e-01

Pos logit contributions
ach+0.009
aman+0.009
oun+0.009
 Kel+0.009
liv+0.009
Neg logit contributions
 too-0.007
 very-0.007
 a-0.006
 so-0.006
 called-0.005
Token with
Token ID351
Feature activation+1.59e-01

Pos logit contributions
ach+0.013
aman+0.013
oun+0.013
z+0.013
 Kel+0.013
Neg logit contributions
 too-0.007
 very-0.007
 a-0.006
 bored-0.005
 so-0.005
Token pictures
Token ID5986
Feature activation+9.54e-02

Pos logit contributions
ach+0.009
aman+0.009
oun+0.009
 Kel+0.009
liv+0.009
Neg logit contributions
 too-0.003
 very-0.003
 a-0.002
 so-0.002
 called-0.001
Token of
Token ID286
Feature activation+5.51e-01

Pos logit contributions
ach+0.005
aman+0.005
oun+0.004
z+0.004
 Kel+0.004
Neg logit contributions
 very-0.002
 too-0.002
 a-0.002
 bored-0.002
 hungry-0.002
Token volcan
Token ID17516
Feature activation+2.91e+00

Pos logit contributions
aman+0.030
ach+0.030
oun+0.029
 Kel+0.029
liv+0.029
Neg logit contributions
 too-0.013
 very-0.013
 a-0.010
 so-0.009
 called-0.008
Tokenoes
Token ID3028
Feature activation+2.01e-01

Pos logit contributions
aman+0.111
ach+0.110
oun+0.106
 Kel+0.105
liv+0.103
Neg logit contributions
 too-0.118
 very-0.116
 a-0.099
 so-0.093
 called-0.090
Token.
Token ID13
Feature activation-6.53e-01

Pos logit contributions
ach+0.011
aman+0.011
oun+0.011
 Kel+0.011
z+0.011
Neg logit contributions
 very-0.004
 too-0.004
 a-0.002
 bored-0.002
 tired-0.002
Token Vol
Token ID4709
Feature activation+9.98e-01

Pos logit contributions
 too+0.015
 very+0.015
 a+0.011
 so+0.010
 called+0.009
Neg logit contributions
ach-0.036
aman-0.036
oun-0.035
 Kel-0.035
liv-0.034
Tokencan
Token ID5171
Feature activation-9.43e-01

Pos logit contributions
aman+0.035
ach+0.035
oun+0.033
 Kel+0.033
liv+0.032
Neg logit contributions
 too-0.043
 very-0.043
 a-0.037
 so-0.035
 called-0.034
Tokenoes
Token ID3028
Feature activation-5.57e-01

Pos logit contributions
 too+0.029
 very+0.029
 a+0.023
 so+0.021
 called+0.020
Neg logit contributions
ach-0.044
aman-0.044
oun-0.043
 Kel-0.042
liv-0.041
Token the
Token ID262
Feature activation-2.52e-01

Pos logit contributions
ach+0.010
aman+0.010
oun+0.009
 Kel+0.009
liv+0.009
Neg logit contributions
 too-0.004
 very-0.004
 a-0.003
 so-0.003
 all-0.003
Token ingredients
Token ID9391
Feature activation-1.63e-01

Pos logit contributions
 very+0.011
 too+0.010
 a+0.009
 so+0.009
 all+0.008
Neg logit contributions
liv-0.010
oun-0.010
aman-0.010
 Kel-0.010
ach-0.010
Token to
Token ID284
Feature activation-4.18e-01

Pos logit contributions
 too+0.003
 very+0.003
 a+0.002
 so+0.002
 all+0.002
Neg logit contributions
aman-0.010
ach-0.010
oun-0.010
 Kel-0.010
liv-0.010
Token make
Token ID787
Feature activation-7.55e-01

Pos logit contributions
 too+0.016
 very+0.015
 a+0.013
 so+0.012
 called+0.012
Neg logit contributions
ach-0.017
aman-0.017
oun-0.017
 Kel-0.016
liv-0.016
Token her
Token ID607
Feature activation-8.14e-01

Pos logit contributions
 very+0.022
 too+0.022
 a+0.019
 so+0.016
 all+0.015
Neg logit contributions
oun-0.040
aman-0.039
ach-0.039
liv-0.038
 Kel-0.038
Token delic
Token ID8675
Feature activation+2.90e+00

Pos logit contributions
 very+0.030
 too+0.030
 a+0.025
 so+0.023
 hungry+0.022
Neg logit contributions
aman-0.036
ach-0.036
oun-0.036
 Kel-0.035
liv-0.035
Tokeniously
Token ID6819
Feature activation-2.27e-03

Pos logit contributions
aman+0.083
ach+0.081
oun+0.079
 Kel+0.078
liv+0.077
Neg logit contributions
 too-0.144
 very-0.140
 a-0.123
 so-0.119
 called-0.117
Token sweet
Token ID6029
Feature activation-7.28e-01

Pos logit contributions
 very+0.000
 too+0.000
 a+0.000
 so+0.000
 hungry+0.000
Neg logit contributions
aman-0.000
liv-0.000
oun-0.000
ach-0.000
 Kel-0.000
Token past
Token ID1613
Feature activation+5.20e-01

Pos logit contributions
 too+0.014
 very+0.013
 a+0.009
 so+0.008
 called+0.007
Neg logit contributions
aman-0.044
ach-0.044
oun-0.043
 Kel-0.042
liv-0.042
Tokenries
Token ID1678
Feature activation-3.28e-01

Pos logit contributions
aman+0.015
ach+0.015
oun+0.015
 Kel+0.014
liv+0.014
Neg logit contributions
 too-0.026
 very-0.026
 a-0.023
 so-0.022
 all-0.021
Token.
Token ID13
Feature activation-6.84e-01

Pos logit contributions
 too+0.005
 very+0.005
 a+0.003
 so+0.003
 called+0.002
Neg logit contributions
ach-0.020
aman-0.020
oun-0.020
 Kel-0.019
liv-0.019
Token big
Token ID1263
Feature activation+1.96e-01

Pos logit contributions
ach+0.015
aman+0.015
oun+0.014
 Kel+0.014
liv+0.014
Neg logit contributions
 very-0.009
 too-0.009
 a-0.007
 so-0.006
 called-0.006
Token book
Token ID1492
Feature activation+2.37e-01

Pos logit contributions
ach+0.009
aman+0.009
oun+0.008
 Kel+0.008
liv+0.008
Neg logit contributions
 too-0.007
 very-0.007
 a-0.005
 so-0.005
 called-0.005
Token with
Token ID351
Feature activation+1.56e-01

Pos logit contributions
ach+0.011
aman+0.011
oun+0.011
z+0.011
 Kel+0.011
Neg logit contributions
 too-0.007
 very-0.007
 a-0.005
 bored-0.005
 so-0.005
Token pictures
Token ID5986
Feature activation+1.04e-01

Pos logit contributions
ach+0.009
aman+0.009
oun+0.009
 Kel+0.009
liv+0.009
Neg logit contributions
 too-0.003
 very-0.003
 a-0.002
 so-0.001
 called-0.001
Token of
Token ID286
Feature activation+5.63e-01

Pos logit contributions
ach+0.005
aman+0.005
oun+0.005
z+0.005
 Kel+0.005
Neg logit contributions
 very-0.003
 too-0.003
 a-0.002
 bored-0.002
 hungry-0.002
Token volcan
Token ID17516
Feature activation+2.86e+00

Pos logit contributions
ach+0.031
aman+0.031
oun+0.030
 Kel+0.029
liv+0.029
Neg logit contributions
 too-0.014
 very-0.013
 a-0.010
 so-0.009
 called-0.008
Tokenoes
Token ID3028
Feature activation+2.04e-01

Pos logit contributions
aman+0.108
ach+0.108
oun+0.104
 Kel+0.103
liv+0.100
Neg logit contributions
 too-0.117
 very-0.115
 a-0.098
 so-0.092
 called-0.090
Token.
Token ID13
Feature activation-6.28e-01

Pos logit contributions
ach+0.012
aman+0.012
oun+0.011
 Kel+0.011
z+0.011
Neg logit contributions
 very-0.004
 too-0.004
 a-0.002
 bored-0.002
 so-0.002
Token Vol
Token ID4709
Feature activation+1.03e+00

Pos logit contributions
 too+0.015
 very+0.015
 a+0.011
 so+0.010
 called+0.009
Neg logit contributions
ach-0.034
aman-0.034
oun-0.033
 Kel-0.033
liv-0.033
Tokencan
Token ID5171
Feature activation-9.23e-01

Pos logit contributions
aman+0.036
ach+0.036
oun+0.034
 Kel+0.034
liv+0.033
Neg logit contributions
 too-0.044
 very-0.044
 a-0.038
 so-0.036
 called-0.035
Tokenoes
Token ID3028
Feature activation-5.23e-01

Pos logit contributions
 too+0.027
 very+0.027
 a+0.022
 so+0.020
 called+0.019
Neg logit contributions
ach-0.045
aman-0.045
oun-0.043
 Kel-0.043
liv-0.042
Token day
Token ID1110
Feature activation+2.68e-01

Pos logit contributions
ach+0.015
aman+0.015
oun+0.014
 Kel+0.014
liv+0.014
Neg logit contributions
 too-0.009
 very-0.009
 a-0.007
 so-0.007
 called-0.007
Token,
Token ID11
Feature activation+4.65e-01

Pos logit contributions
ach+0.016
aman+0.016
oun+0.015
 Kel+0.015
liv+0.015
Neg logit contributions
 too-0.005
 very-0.005
 a-0.003
 so-0.003
 called-0.003
Token a
Token ID257
Feature activation-7.36e-02

Pos logit contributions
 Kel+0.015
ach+0.015
aman+0.015
oun+0.014
liv+0.014
Neg logit contributions
 too-0.020
 very-0.020
 a-0.017
 bored-0.016
 so-0.016
Token chocolate
Token ID11311
Feature activation-6.92e-02

Pos logit contributions
 too+0.003
 very+0.002
 a+0.002
 so+0.002
 bored+0.002
Neg logit contributions
aman-0.003
ach-0.003
oun-0.003
 Kel-0.003
liv-0.003
Token-
Token ID12
Feature activation+3.63e-01

Pos logit contributions
 very+0.003
 too+0.003
 a+0.002
 called+0.002
 so+0.002
Neg logit contributions
ach-0.003
aman-0.003
 Kel-0.003
oun-0.003
liv-0.003
Tokencol
Token ID4033
Feature activation+2.85e+00

Pos logit contributions
aman+0.012
ach+0.012
oun+0.011
 Kel+0.010
ait+0.010
Neg logit contributions
 very-0.017
 too-0.017
 a-0.015
 so-0.014
 called-0.014
Tokenoured
Token ID8167
Feature activation-6.42e-01

Pos logit contributions
 policy+0.078
ach+0.077
 occasions+0.076
 Kel+0.074
liv+0.073
Neg logit contributions
 too-0.137
 very-0.120
 a-0.109
 all-0.107
 tired-0.104
Token whip
Token ID22972
Feature activation-8.97e-01

Pos logit contributions
 too+0.024
 very+0.023
 a+0.020
 so+0.018
 called+0.017
Neg logit contributions
liv-0.029
ach-0.028
aman-0.028
 Kel-0.028
oun-0.028
Token named
Token ID3706
Feature activation+1.14e+00

Pos logit contributions
 too+0.021
 very+0.021
 a+0.015
 bored+0.014
 tired+0.014
Neg logit contributions
ach-0.044
aman-0.043
z-0.043
oun-0.042
 Kel-0.042
Token C
Token ID327
Feature activation+2.09e-01

Pos logit contributions
ach+0.047
aman+0.046
z+0.043
oun+0.043
liv+0.043
Neg logit contributions
 too-0.045
 very-0.044
 a-0.040
 so-0.036
 called-0.034
Tokenheddar
Token ID44937
Feature activation+9.48e-01

Pos logit contributions
ach+0.005
aman+0.005
oun+0.005
 Kel+0.005
liv+0.005
Neg logit contributions
 too-0.011
 very-0.011
 a-0.010
 so-0.009
 called-0.009
Token a
Token ID257
Feature activation-3.61e-03

Pos logit contributions
 too+0.004
 very+0.004
 a+0.002
 bored+0.002
 so+0.002
Neg logit contributions
aman-0.014
ach-0.014
 Kel-0.014
oun-0.013
liv-0.013
Token strange
Token ID6283
Feature activation-8.98e-02

Pos logit contributions
 very+0.000
 too+0.000
 a+0.000
 so+0.000
 called+0.000
Neg logit contributions
ach-0.000
aman-0.000
oun-0.000
 Kel-0.000
liv-0.000
Token,
Token ID11
Feature activation+6.48e-01

Pos logit contributions
 too+0.003
 very+0.003
 a+0.003
 so+0.002
 called+0.002
Neg logit contributions
ach-0.004
aman-0.004
oun-0.004
 Kel-0.004
liv-0.004
Token raspberry
Token ID38973
Feature activation+5.94e-01

Pos logit contributions
ach+0.030
aman+0.030
 Kel+0.029
oun+0.028
liv+0.028
Neg logit contributions
 too-0.020
 very-0.019
 a-0.016
 so-0.015
 called-0.014
Token-
Token ID12
Feature activation+7.80e-01

Pos logit contributions
aman+0.030
oun+0.030
ach+0.030
 Kel+0.029
ait+0.029
Neg logit contributions
 too-0.018
 very-0.017
 a-0.014
 so-0.013
 called-0.012
Tokencol
Token ID4033
Feature activation+2.85e+00

Pos logit contributions
aman+0.023
ach+0.023
oun+0.022
 Kel+0.021
liv+0.021
Neg logit contributions
 too-0.038
 very-0.038
 a-0.034
 so-0.032
 called-0.031
Tokenoured
Token ID8167
Feature activation-8.24e-01

Pos logit contributions
 Kel+0.073
ach+0.070
oun+0.067
liv+0.066
 policy+0.064
Neg logit contributions
 too-0.145
 very-0.131
 a-0.119
 bored-0.115
 all-0.115
Token object
Token ID2134
Feature activation+5.71e-02

Pos logit contributions
 too+0.030
 very+0.030
 a+0.025
 so+0.023
 called+0.022
Neg logit contributions
aman-0.036
ach-0.036
oun-0.035
 Kel-0.035
liv-0.035
Token hiding
Token ID11816
Feature activation+3.28e-01

Pos logit contributions
aman+0.003
ach+0.003
oun+0.003
 Kel+0.003
liv+0.003
Neg logit contributions
 too-0.001
 very-0.001
 a-0.001
 so-0.001
 called-0.001
Token under
Token ID739
Feature activation+4.30e-01

Pos logit contributions
ach+0.014
aman+0.014
 Kel+0.013
oun+0.013
z+0.013
Neg logit contributions
 too-0.011
 very-0.011
 a-0.009
 so-0.008
 bored-0.008
Token a
Token ID257
Feature activation-7.78e-01

Pos logit contributions
aman+0.027
ach+0.027
 Kel+0.026
oun+0.026
liv+0.025
Neg logit contributions
 too-0.007
 very-0.006
 a-0.003
 so-0.003
 bored-0.003
Token reliable
Token ID9314
Feature activation+2.01e-01

Pos logit contributions
ach+0.036
aman+0.034
oun+0.034
 Kel+0.034
liv+0.033
Neg logit contributions
 too-0.016
 very-0.015
 a-0.011
 called-0.010
 so-0.010
Token that
Token ID326
Feature activation-2.00e-01

Pos logit contributions
aman+0.013
oun+0.013
liv+0.012
 Kel+0.012
ait+0.012
Neg logit contributions
 too-0.005
 very-0.004
 a-0.003
 so-0.003
 all-0.002
Token it
Token ID340
Feature activation+3.24e-01

Pos logit contributions
 too+0.005
 very+0.005
 a+0.003
 so+0.003
 all+0.003
Neg logit contributions
ach-0.011
aman-0.011
oun-0.011
 Kel-0.011
liv-0.011
Token had
Token ID550
Feature activation+4.40e-02

Pos logit contributions
aman+0.019
ach+0.019
oun+0.018
 Kel+0.018
liv+0.018
Neg logit contributions
 too-0.007
 very-0.007
 a-0.005
 so-0.005
 called-0.004
Token out
Token ID503
Feature activation+1.69e+00

Pos logit contributions
ach+0.002
aman+0.002
oun+0.002
 Kel+0.002
z+0.002
Neg logit contributions
 too-0.001
 very-0.001
 a-0.001
 so-0.001
 all-0.001
Tokensm
Token ID5796
Feature activation+2.85e+00

Pos logit contributions
aman+0.089
ach+0.088
oun+0.087
liv+0.084
 Kel+0.084
Neg logit contributions
 too-0.048
 very-0.048
 a-0.038
 so-0.034
 all-0.032
Tokenarted
Token ID19112
Feature activation+4.78e-01

Pos logit contributions
ach+0.087
aman+0.087
oun+0.083
 Kel+0.081
liv+0.079
Neg logit contributions
 too-0.137
 very-0.136
 a-0.119
 so-0.113
 called-0.111
Token the
Token ID262
Feature activation-1.73e-02

Pos logit contributions
aman+0.030
oun+0.030
liv+0.030
ach+0.030
becca+0.029
Neg logit contributions
 too-0.010
 very-0.010
 a-0.007
 so-0.006
 all-0.006
Token big
Token ID1263
Feature activation+6.40e-01

Pos logit contributions
 too+0.001
 very+0.001
 a+0.000
 so+0.000
 all+0.000
Neg logit contributions
aman-0.001
ach-0.001
 Kel-0.001
liv-0.001
oun-0.001
Token scary
Token ID14343
Feature activation+8.05e-01

Pos logit contributions
aman+0.033
ach+0.032
oun+0.032
liv+0.032
 Kel+0.031
Neg logit contributions
 too-0.019
 very-0.018
 a-0.015
 so-0.013
 all-0.013
Token shark
Token ID21027
Feature activation-6.94e-01

Pos logit contributions
aman+0.041
liv+0.040
 Kel+0.040
oun+0.039
ait+0.039
Neg logit contributions
 too-0.026
 very-0.025
 a-0.021
 so-0.019
 all-0.018
Token also
Token ID635
Feature activation-5.01e-01

Pos logit contributions
ach+0.002
aman+0.002
oun+0.002
 Kel+0.002
liv+0.002
Neg logit contributions
 too-0.001
 very-0.001
 a-0.001
 so-0.001
 called-0.001
Token added
Token ID2087
Feature activation-2.14e-01

Pos logit contributions
 too+0.015
 very+0.014
 a+0.011
 so+0.011
 bored+0.010
Neg logit contributions
aman-0.023
ach-0.023
z-0.022
 Kel-0.022
oun-0.022
Token some
Token ID617
Feature activation+1.43e+00

Pos logit contributions
 too+0.003
 very+0.003
 a+0.002
 so+0.002
 all+0.001
Neg logit contributions
aman-0.014
ach-0.014
oun-0.014
liv-0.014
 Kel-0.014
Token spicy
Token ID26880
Feature activation+1.33e+00

Pos logit contributions
aman+0.069
ach+0.067
liv+0.067
oun+0.066
becca+0.064
Neg logit contributions
 too-0.052
 very-0.052
 a-0.045
 so-0.041
 called-0.038
Token-
Token ID12
Feature activation+6.20e-01

Pos logit contributions
aman+0.061
liv+0.060
becca+0.057
oun+0.056
ait+0.056
Neg logit contributions
 too-0.053
 very-0.051
 a-0.046
 so-0.040
 all-0.039
Tokensm
Token ID5796
Feature activation+2.85e+00

Pos logit contributions
aman+0.023
ach+0.022
oun+0.021
 Kel+0.021
liv+0.020
Neg logit contributions
 very-0.027
 too-0.027
 a-0.025
 so-0.022
 called-0.021
Tokenelling
Token ID9417
Feature activation+8.70e-01

Pos logit contributions
aman+0.088
ach+0.088
oun+0.083
liv+0.082
 Kel+0.081
Neg logit contributions
 too-0.137
 very-0.134
 a-0.117
 so-0.112
 called-0.109
Token flowers
Token ID12734
Feature activation+5.54e-01

Pos logit contributions
aman+0.039
ach+0.039
liv+0.039
oun+0.038
becca+0.038
Neg logit contributions
 too-0.033
 very-0.032
 a-0.029
 so-0.025
 called-0.024
Token to
Token ID284
Feature activation+2.74e-01

Pos logit contributions
aman+0.032
ach+0.032
oun+0.032
liv+0.031
 Kel+0.031
Neg logit contributions
 too-0.013
 very-0.013
 a-0.009
 so-0.008
 called-0.007
Token her
Token ID607
Feature activation-1.84e-01

Pos logit contributions
ach+0.014
aman+0.014
oun+0.014
 Kel+0.013
z+0.013
Neg logit contributions
 too-0.008
 very-0.007
 a-0.006
 so-0.005
 called-0.005
Token dress
Token ID6576
Feature activation-1.57e-01

Pos logit contributions
 very+0.006
 too+0.006
 a+0.005
 so+0.004
 all+0.004
Neg logit contributions
ach-0.010
oun-0.010
liv-0.010
aman-0.009
 Kel-0.009
Token just
Token ID655
Feature activation-6.50e-01

Pos logit contributions
 too+0.026
 very+0.026
 a+0.021
 so+0.020
 hungry+0.019
Neg logit contributions
ach-0.041
aman-0.041
oun-0.040
 Kel-0.040
liv-0.039
Token wanted
Token ID2227
Feature activation-4.33e-02

Pos logit contributions
 too+0.017
 very+0.016
 a+0.012
 so+0.012
 bored+0.011
Neg logit contributions
ach-0.030
 Kel-0.029
aman-0.029
oun-0.029
less-0.028
Token to
Token ID284
Feature activation+7.66e-01

Pos logit contributions
 too+0.001
 very+0.001
 a+0.001
 so+0.001
 all+0.001
Neg logit contributions
aman-0.003
oun-0.003
liv-0.002
ach-0.002
z-0.002
Token write
Token ID3551
Feature activation-1.92e-01

Pos logit contributions
aman+0.042
ach+0.041
oun+0.041
liv+0.040
z+0.040
Neg logit contributions
 too-0.020
 very-0.020
 a-0.017
 so-0.014
 called-0.013
Token about
Token ID546
Feature activation+5.94e-01

Pos logit contributions
 too+0.004
 very+0.004
 a+0.003
 so+0.003
 all+0.002
Neg logit contributions
aman-0.011
ach-0.011
oun-0.011
 Kel-0.011
liv-0.011
Token volcan
Token ID17516
Feature activation+2.85e+00

Pos logit contributions
aman+0.031
ach+0.031
oun+0.030
 Kel+0.030
liv+0.029
Neg logit contributions
 too-0.016
 very-0.016
 a-0.013
 so-0.011
 called-0.010
Tokenoes
Token ID3028
Feature activation+2.39e-01

Pos logit contributions
aman+0.107
ach+0.107
oun+0.103
 Kel+0.102
liv+0.099
Neg logit contributions
 too-0.116
 very-0.115
 a-0.098
 so-0.092
 called-0.090
Token.
Token ID13
Feature activation+1.47e-01

Pos logit contributions
ach+0.014
aman+0.014
oun+0.013
 Kel+0.013
z+0.013
Neg logit contributions
 too-0.005
 very-0.005
 a-0.003
 so-0.003
 called-0.003
Token We
Token ID775
Feature activation-1.22e+00

Pos logit contributions
ach+0.005
oun+0.005
aman+0.005
 Kel+0.005
ait+0.004
Neg logit contributions
 very-0.006
 too-0.006
 a-0.005
 all-0.005
 so-0.005
Token like
Token ID588
Feature activation-8.82e-01

Pos logit contributions
 too+0.035
 very+0.035
 a+0.027
 so+0.026
 hungry+0.024
Neg logit contributions
ach-0.057
aman-0.056
oun-0.056
 Kel-0.054
liv-0.053
Token volcan
Token ID17516
Feature activation+2.40e+00

Pos logit contributions
 too+0.028
 very+0.028
 a+0.023
 so+0.021
 called+0.020
Neg logit contributions
ach-0.041
aman-0.040
oun-0.039
 Kel-0.039
liv-0.038
Token sunshine
Token ID34488
Feature activation-7.50e-01

Pos logit contributions
ach+0.004
aman+0.004
oun+0.004
 Kel+0.004
z+0.004
Neg logit contributions
 very-0.003
 too-0.003
 a-0.002
 so-0.002
 called-0.002
Token.
Token ID13
Feature activation+1.66e-02

Pos logit contributions
 too+0.012
 very+0.012
 a+0.007
 so+0.006
 called+0.005
Neg logit contributions
ach-0.047
aman-0.047
oun-0.046
 Kel-0.045
liv-0.045
Token Molly
Token ID30236
Feature activation+6.86e-01

Pos logit contributions
aman+0.001
oun+0.001
ach+0.001
 Kel+0.001
liv+0.001
Neg logit contributions
 very-0.000
 too-0.000
 a-0.000
 so-0.000
 bored-0.000
Token liked
Token ID8288
Feature activation-4.18e-01

Pos logit contributions
ach+0.043
aman+0.042
oun+0.041
umm+0.041
 Kel+0.040
Neg logit contributions
 very-0.015
 too-0.014
 a-0.010
 called-0.010
 so-0.009
Token the
Token ID262
Feature activation+4.07e-02

Pos logit contributions
 too+0.012
 very+0.012
 a+0.010
 so+0.009
 all+0.008
Neg logit contributions
ach-0.022
oun-0.022
aman-0.022
liv-0.021
z-0.021
Token delic
Token ID8675
Feature activation+2.85e+00

Pos logit contributions
ach+0.002
aman+0.002
oun+0.002
 Kel+0.002
liv+0.002
Neg logit contributions
 very-0.001
 too-0.001
 a-0.001
 so-0.001
 called-0.001
Tokenately
Token ID1286
Feature activation+1.96e-01

Pos logit contributions
liv+0.076
oun+0.075
aman+0.075
 Kel+0.073
 stret+0.071
Neg logit contributions
 too-0.141
 very-0.135
 a-0.120
 so-0.117
 called-0.116
Token pet
Token ID4273
Feature activation-6.91e-01

Pos logit contributions
oun+0.010
ach+0.010
aman+0.010
ait+0.009
liv+0.009
Neg logit contributions
 very-0.006
 too-0.006
 a-0.005
 so-0.005
 called-0.004
Tokenalled
Token ID4262
Feature activation-5.34e-01

Pos logit contributions
 too+0.022
 very+0.022
 a+0.017
 so+0.016
 called+0.016
Neg logit contributions
ach-0.034
aman-0.033
oun-0.033
liv-0.032
 Kel-0.032
Token yellow
Token ID7872
Feature activation-9.05e-01

Pos logit contributions
 very+0.012
 too+0.012
 a+0.009
 so+0.008
 all+0.007
Neg logit contributions
aman-0.032
oun-0.032
ach-0.032
liv-0.031
ait-0.031
Token ones
Token ID3392
Feature activation+6.06e-01

Pos logit contributions
 too+0.030
 very+0.029
 a+0.024
 so+0.022
 all+0.021
Neg logit contributions
liv-0.043
ach-0.043
aman-0.043
oun-0.043
ait-0.042
Token that
Token ID326
Feature activation+6.89e-02

Pos logit contributions
oun+0.015
aman+0.015
 Kel+0.015
liv+0.015
ach+0.015
Neg logit contributions
 too-0.008
 very-0.008
 a-0.007
 so-0.006
 all-0.006
Token!"
Token ID2474
Feature activation+1.96e-01

Pos logit contributions
ach+0.003
aman+0.003
oun+0.003
 Kel+0.003
liv+0.003
Neg logit contributions
 too-0.002
 very-0.002
 a-0.002
 so-0.002
 called-0.002
Token
Token ID220
Feature activation+5.73e-01

Pos logit contributions
aman+0.011
ach+0.011
oun+0.011
 Kel+0.011
liv+0.011
Neg logit contributions
 too-0.004
 very-0.004
 a-0.002
 bored-0.002
 so-0.002
Token
Token ID198
Feature activation+8.52e-01

Pos logit contributions
ach+0.040
aman+0.039
umm+0.039
 occasions+0.038
 Kel+0.038
Neg logit contributions
 too-0.006
 very-0.006
 a-0.003
 -0.001
 so-0.001
Token
Token ID198
Feature activation+1.04e+00

Pos logit contributions
ach+0.055
aman+0.055
oun+0.053
 Kel+0.053
ait+0.052
Neg logit contributions
 very-0.013
 too-0.013
 a-0.008
 so-0.006
 called-0.005
TokenCharl
Token ID24453
Feature activation+2.85e+00

Pos logit contributions
ach+0.035
oun+0.033
aman+0.032
 Kel+0.032
umm+0.031
Neg logit contributions
 very-0.050
 too-0.049
 a-0.044
 so-0.041
 called-0.039
Tokenotte
Token ID11404
Feature activation+6.01e-02

Pos logit contributions
umm+0.091
ach+0.089
con+0.089
aman+0.086
oun+0.085
Neg logit contributions
 too-0.119
 very-0.115
 a-0.097
 so-0.092
 called-0.092
Token thanked
Token ID26280
Feature activation+4.84e-01

Pos logit contributions
ach+0.003
aman+0.003
oun+0.003
 Kel+0.003
ait+0.003
Neg logit contributions
 too-0.001
 very-0.001
 a-0.001
 so-0.001
 called-0.001
Token her
Token ID607
Feature activation+1.47e-01

Pos logit contributions
aman+0.025
ach+0.024
 Kel+0.024
oun+0.023
liv+0.023
Neg logit contributions
 too-0.013
 very-0.013
 bored-0.009
 a-0.009
 hungry-0.009
Token mother
Token ID2802
Feature activation-4.04e-01

Pos logit contributions
aman+0.005
ach+0.005
oun+0.005
 Kel+0.005
becca+0.004
Neg logit contributions
 too-0.006
 very-0.006
 a-0.005
 so-0.005
 bored-0.005
Token for
Token ID329
Feature activation-5.86e-01

Pos logit contributions
 too+0.009
 very+0.008
 a+0.006
 so+0.005
 called+0.005
Neg logit contributions
aman-0.023
ach-0.023
oun-0.022
 Kel-0.022
liv-0.022
Token replied
Token ID8712
Feature activation+1.05e+00

Pos logit contributions
ach+0.008
aman+0.008
oun+0.008
 Kel+0.008
liv+0.008
Neg logit contributions
 too-0.006
 very-0.006
 a-0.005
 so-0.004
 called-0.004
Token Jim
Token ID5395
Feature activation+4.96e-01

Pos logit contributions
ach+0.039
aman+0.039
oun+0.037
 Kel+0.037
liv+0.036
Neg logit contributions
 too-0.044
 very-0.043
 a-0.037
 so-0.035
 called-0.034
Token.
Token ID13
Feature activation-6.82e-01

Pos logit contributions
ach+0.030
aman+0.030
oun+0.030
 Kel+0.029
liv+0.029
Neg logit contributions
 too-0.008
 very-0.008
 a-0.005
 so-0.004
 called-0.004
Token
Token ID198
Feature activation+3.27e-01

Pos logit contributions
 too+0.016
 very+0.016
 a+0.012
 so+0.011
 bored+0.011
Neg logit contributions
aman-0.036
ach-0.036
oun-0.035
 Kel-0.035
liv-0.034
Token
Token ID198
Feature activation+4.32e-01

Pos logit contributions
aman+0.021
ach+0.021
 Kel+0.020
oun+0.020
ait+0.020
Neg logit contributions
 too-0.005
 very-0.005
 a-0.003
 so-0.002
 called-0.002
TokenDays
Token ID38770
Feature activation+2.83e+00

Pos logit contributions
aman+0.015
 Kel+0.015
oun+0.014
ach+0.014
ait+0.014
Neg logit contributions
 too-0.019
 very-0.019
 a-0.017
 so-0.016
 called-0.015
Token passed
Token ID3804
Feature activation-2.34e-02

Pos logit contributions
aman+0.157
ach+0.156
oun+0.152
 Kel+0.151
liv+0.148
Neg logit contributions
 too-0.069
 very-0.066
 a-0.051
 so-0.044
 called-0.042
Token and
Token ID290
Feature activation+1.59e-02

Pos logit contributions
 too+0.001
 very+0.001
 a+0.000
 so+0.000
 called+0.000
Neg logit contributions
aman-0.001
ach-0.001
oun-0.001
 Kel-0.001
liv-0.001
Token Bob
Token ID5811
Feature activation+6.46e-01

Pos logit contributions
ach+0.001
oun+0.001
aman+0.001
 Kel+0.001
z+0.001
Neg logit contributions
 too-0.000
 very-0.000
 a-0.000
 so-0.000
 all-0.000
Token noticed
Token ID6810
Feature activation+2.33e-01

Pos logit contributions
aman+0.037
ach+0.037
oun+0.037
 Kel+0.036
liv+0.035
Neg logit contributions
 too-0.015
 very-0.015
 a-0.011
 so-0.010
 called-0.010
Token something
Token ID1223
Feature activation+9.34e-01

Pos logit contributions
ach+0.013
aman+0.013
oun+0.013
 Kel+0.013
liv+0.012
Neg logit contributions
 too-0.005
 very-0.005
 a-0.003
 so-0.003
 called-0.003
Token she
Token ID673
Feature activation+5.98e-01

Pos logit contributions
 too+0.014
 very+0.014
 a+0.011
 so+0.009
 all+0.009
Neg logit contributions
ach-0.030
oun-0.030
aman-0.030
z-0.029
liv-0.028
Token got
Token ID1392
Feature activation+3.17e-01

Pos logit contributions
aman+0.036
ach+0.036
oun+0.035
 Kel+0.034
liv+0.034
Neg logit contributions
 too-0.013
 very-0.013
 a-0.009
 so-0.008
 called-0.008
Token rid
Token ID5755
Feature activation+3.08e-01

Pos logit contributions
aman+0.018
ach+0.018
oun+0.018
liv+0.017
 Kel+0.017
Neg logit contributions
 too-0.008
 very-0.008
 a-0.006
 so-0.005
 bored-0.005
Token of
Token ID286
Feature activation+3.96e-02

Pos logit contributions
aman+0.017
ach+0.016
oun+0.016
becca+0.016
liv+0.016
Neg logit contributions
 too-0.009
 very-0.008
 a-0.007
 so-0.006
 all-0.006
Token the
Token ID262
Feature activation-1.16e-01

Pos logit contributions
oun+0.002
ach+0.002
aman+0.002
z+0.002
liv+0.002
Neg logit contributions
 too-0.001
 very-0.001
 a-0.001
 so-0.000
 all-0.000
Token intr
Token ID9913
Feature activation+2.81e+00

Pos logit contributions
 very+0.004
 too+0.004
 a+0.003
 hungry+0.003
 mean+0.003
Neg logit contributions
oun-0.006
liv-0.006
aman-0.006
ach-0.006
 Kel-0.006
Tokenuders
Token ID48739
Feature activation-9.95e-02

Pos logit contributions
 Kel+0.074
oun+0.073
aman+0.071
liv+0.071
less+0.067
Neg logit contributions
 too-0.145
 very-0.141
 a-0.125
 called-0.122
 so-0.120
Token.
Token ID13
Feature activation+4.33e-02

Pos logit contributions
 too+0.002
 very+0.002
 a+0.001
 so+0.001
 all+0.001
Neg logit contributions
aman-0.006
ach-0.006
oun-0.006
liv-0.006
becca-0.006
Token She
Token ID1375
Feature activation-3.41e-02

Pos logit contributions
aman+0.002
ach+0.002
 Kel+0.002
oun+0.002
liv+0.002
Neg logit contributions
 too-0.001
 very-0.001
 a-0.001
 so-0.001
 bored-0.001
Token did
Token ID750
Feature activation-1.06e-01

Pos logit contributions
 too+0.001
 very+0.001
 a+0.001
 so+0.001
 called+0.001
Neg logit contributions
ach-0.002
aman-0.002
oun-0.002
 Kel-0.002
liv-0.002
Token not
Token ID407
Feature activation-3.58e-01

Pos logit contributions
 too+0.002
 very+0.002
 a+0.001
 so+0.001
 called+0.001
Neg logit contributions
ach-0.006
aman-0.006
oun-0.006
 Kel-0.006
liv-0.006
Token It
Token ID632
Feature activation-9.26e-01

Pos logit contributions
 very+0.005
 too+0.005
 bored+0.004
 so+0.004
 tired+0.004
Neg logit contributions
ach-0.007
 Kel-0.007
aman-0.007
oun-0.007
ait-0.007
Token was
Token ID373
Feature activation-7.55e-01

Pos logit contributions
 too+0.032
 very+0.029
 bored+0.026
 a+0.025
 tired+0.024
Neg logit contributions
 Kel-0.037
oun-0.037
z-0.036
aman-0.035
liv-0.035
Token a
Token ID257
Feature activation-1.32e+00

Pos logit contributions
 too+0.014
 very+0.011
 bored+0.008
 a+0.006
 decided+0.006
Neg logit contributions
ach-0.041
aman-0.041
 Kel-0.041
oun-0.039
z-0.039
Token big
Token ID1263
Feature activation-1.00e+00

Pos logit contributions
 too+0.037
 very+0.033
 a+0.028
 so+0.025
 bored+0.024
Neg logit contributions
aman-0.064
ach-0.063
oun-0.062
 Kel-0.061
ait-0.059
Token,
Token ID11
Feature activation-3.09e-01

Pos logit contributions
 too+0.027
 very+0.026
 a+0.022
 bored+0.020
 decided+0.019
Neg logit contributions
aman-0.047
ach-0.045
oun-0.044
 Kel-0.044
ait-0.043
Token red
Token ID2266
Feature activation-3.01e+00

Pos logit contributions
 too+0.009
 very+0.009
 a+0.007
 so+0.007
 bored+0.007
Neg logit contributions
ach-0.014
aman-0.014
 Kel-0.014
oun-0.013
z-0.013
Token barn
Token ID25203
Feature activation+6.58e-01

Pos logit contributions
 too+0.138
 very+0.135
 a+0.119
 bored+0.118
 so+0.115
Neg logit contributions
aman-0.096
ach-0.089
oun-0.089
 Kel-0.082
ait-0.081
Token and
Token ID290
Feature activation-1.34e+00

Pos logit contributions
ach+0.037
aman+0.036
oun+0.035
 Kel+0.035
z+0.035
Neg logit contributions
 too-0.013
 very-0.012
 a-0.009
 bored-0.008
 hungry-0.008
Token it
Token ID340
Feature activation-8.93e-01

Pos logit contributions
 too+0.029
 very+0.024
 bored+0.021
 tired+0.018
 mean+0.017
Neg logit contributions
 Kel-0.067
z-0.060
liv-0.060
aman-0.060
oun-0.059
Token had
Token ID550
Feature activation-8.80e-01

Pos logit contributions
 too+0.026
 very+0.024
 a+0.020
 so+0.018
 bored+0.018
Neg logit contributions
 Kel-0.041
aman-0.040
ach-0.040
oun-0.040
z-0.040
Token lots
Token ID6041
Feature activation-1.24e-01

Pos logit contributions
 too+0.016
 very+0.015
 a+0.009
 bored+0.008
 so+0.008
Neg logit contributions
aman-0.051
ach-0.051
 Kel-0.050
oun-0.049
z-0.049
Token y
Token ID331
Feature activation+6.05e-02

Pos logit contributions
 too+0.013
 very+0.013
 a+0.011
 so+0.010
 called+0.009
Neg logit contributions
ach-0.019
aman-0.019
oun-0.018
 Kel-0.018
liv-0.018
Tokenummy
Token ID13513
Feature activation-7.00e-01

Pos logit contributions
aman+0.001
ach+0.001
oun+0.001
 Kel+0.001
liv+0.001
Neg logit contributions
 too-0.004
 very-0.004
 a-0.003
 so-0.003
 called-0.003
Token.
Token ID13
Feature activation+2.36e-01

Pos logit contributions
 too+0.019
 very+0.018
 a+0.014
 so+0.013
 called+0.012
Neg logit contributions
aman-0.038
ach-0.037
 Kel-0.037
oun-0.036
liv-0.036
Token Just
Token ID2329
Feature activation-1.08e+00

Pos logit contributions
aman+0.010
ach+0.010
oun+0.010
 Kel+0.010
ait+0.009
Neg logit contributions
 very-0.008
 too-0.008
 a-0.006
 so-0.006
 bored-0.006
Token a
Token ID257
Feature activation-7.87e-01

Pos logit contributions
 too+0.026
 very+0.026
 a+0.019
 so+0.017
 called+0.016
Neg logit contributions
ach-0.058
aman-0.058
oun-0.057
 Kel-0.056
liv-0.055
Token little
Token ID1310
Feature activation-2.78e+00

Pos logit contributions
 very+0.027
 too+0.027
 a+0.022
 so+0.020
 hungry+0.019
Neg logit contributions
ach-0.037
aman-0.036
 Kel-0.036
liv-0.036
oun-0.035
Token bite
Token ID13197
Feature activation-5.23e-01

Pos logit contributions
 too+0.089
 very+0.088
 a+0.071
 so+0.066
 called+0.063
Neg logit contributions
ach-0.129
aman-0.129
oun-0.125
 Kel-0.123
liv-0.122
Token."
Token ID526
Feature activation-3.81e-01

Pos logit contributions
 too+0.013
 very+0.012
 a+0.009
 so+0.008
 called+0.008
Neg logit contributions
aman-0.030
ach-0.030
oun-0.029
 Kel-0.029
liv-0.028
Token
Token ID198
Feature activation+1.26e-01

Pos logit contributions
 too+0.008
 very+0.008
 a+0.005
 so+0.005
 called+0.005
Neg logit contributions
aman-0.022
ach-0.021
oun-0.021
 Kel-0.021
liv-0.020
Token
Token ID198
Feature activation+1.02e-01

Pos logit contributions
ach+0.008
aman+0.008
oun+0.007
 Kel+0.007
liv+0.007
Neg logit contributions
 too-0.002
 very-0.002
 a-0.001
 so-0.001
 called-0.001
TokenShe
Token ID3347
Feature activation-1.51e-01

Pos logit contributions
aman+0.005
ach+0.005
 Kel+0.005
oun+0.005
liv+0.005
Neg logit contributions
 too-0.003
 very-0.003
 a-0.003
 so-0.002
 called-0.002
Token It
Token ID632
Feature activation-1.16e+00

Pos logit contributions
ach+0.027
aman+0.026
oun+0.026
 Kel+0.026
ait+0.025
Neg logit contributions
 very-0.021
 too-0.020
 a-0.016
 all-0.016
 bored-0.016
Token will
Token ID481
Feature activation-5.14e-01

Pos logit contributions
 too+0.040
 very+0.039
 a+0.032
 so+0.030
 called+0.029
Neg logit contributions
ach-0.053
aman-0.053
oun-0.050
 Kel-0.050
liv-0.049
Token tick
Token ID4378
Feature activation-8.50e-02

Pos logit contributions
 too+0.015
 very+0.015
 a+0.012
 so+0.011
 called+0.010
Neg logit contributions
aman-0.027
ach-0.027
oun-0.026
liv-0.025
 Kel-0.025
Tokenle
Token ID293
Feature activation-7.38e-01

Pos logit contributions
 too+0.004
 very+0.004
 a+0.004
 so+0.003
 all+0.003
Neg logit contributions
aman-0.003
oun-0.003
 Kel-0.003
ach-0.003
liv-0.003
Token a
Token ID257
Feature activation-7.42e-01

Pos logit contributions
 too+0.021
 very+0.020
 a+0.016
 so+0.014
 all+0.013
Neg logit contributions
aman-0.040
oun-0.039
ach-0.039
liv-0.038
 Kel-0.038
Token little
Token ID1310
Feature activation-2.69e+00

Pos logit contributions
 very+0.027
 too+0.027
 a+0.022
 so+0.020
 called+0.019
Neg logit contributions
ach-0.033
aman-0.032
 Kel-0.032
liv-0.031
oun-0.031
Token.
Token ID13
Feature activation+6.11e-01

Pos logit contributions
 too+0.077
 very+0.076
 a+0.061
 so+0.055
 called+0.054
Neg logit contributions
ach-0.130
aman-0.130
oun-0.126
 Kel-0.124
z-0.122
Token You
Token ID921
Feature activation-1.49e+00

Pos logit contributions
ach+0.026
aman+0.025
oun+0.025
 Kel+0.025
ait+0.024
Neg logit contributions
 very-0.021
 too-0.020
 all-0.016
 bored-0.016
 a-0.016
Token can
Token ID460
Feature activation-1.34e-01

Pos logit contributions
 too+0.044
 very+0.044
 a+0.035
 so+0.033
 hungry+0.032
Neg logit contributions
ach-0.067
aman-0.066
oun-0.066
 Kel-0.064
ait-0.063
Token watch
Token ID2342
Feature activation-6.29e-02

Pos logit contributions
 too+0.004
 very+0.004
 a+0.003
 so+0.003
 called+0.003
Neg logit contributions
aman-0.007
ach-0.007
oun-0.006
 Kel-0.006
liv-0.006
Token yourself
Token ID3511
Feature activation-4.85e-01

Pos logit contributions
 too+0.002
 very+0.002
 a+0.001
 so+0.001
 all+0.001
Neg logit contributions
aman-0.003
ach-0.003
oun-0.003
liv-0.003
 Kel-0.003
Token they
Token ID484
Feature activation-3.71e-01

Pos logit contributions
aman+0.041
ach+0.041
oun+0.040
liv+0.040
z+0.040
Neg logit contributions
 too-0.022
 very-0.021
 a-0.017
 so-0.015
 all-0.014
Token want
Token ID765
Feature activation-1.78e-01

Pos logit contributions
 too+0.010
 very+0.010
 a+0.008
 so+0.007
 called+0.007
Neg logit contributions
ach-0.019
aman-0.019
oun-0.018
 Kel-0.018
z-0.017
Token a
Token ID257
Feature activation+3.41e-01

Pos logit contributions
 too+0.005
 very+0.005
 a+0.004
 so+0.004
 all+0.003
Neg logit contributions
liv-0.010
aman-0.010
ach-0.010
 Kel-0.010
z-0.010
Token new
Token ID649
Feature activation-2.22e-01

Pos logit contributions
ach+0.014
liv+0.014
aman+0.013
 Kel+0.013
z+0.012
Neg logit contributions
 very-0.015
 too-0.015
 a-0.012
 so-0.011
 hungry-0.011
Token hairst
Token ID42658
Feature activation-3.39e-01

Pos logit contributions
 too+0.009
 very+0.009
 a+0.008
 so+0.007
 called+0.007
Neg logit contributions
ach-0.009
liv-0.009
aman-0.009
 Kel-0.009
oun-0.009
Tokenyle
Token ID2349
Feature activation-2.69e+00

Pos logit contributions
 too+0.019
 very+0.019
 a+0.017
 so+0.016
 bored+0.016
Neg logit contributions
aman-0.008
ach-0.008
oun-0.007
 Kel-0.007
liv-0.007
Token or
Token ID393
Feature activation+2.24e-01

Pos logit contributions
 too+0.072
 very+0.071
 a+0.055
 so+0.050
 bored+0.049
Neg logit contributions
ach-0.135
aman-0.132
oun-0.131
z-0.127
 Kel-0.126
Token when
Token ID618
Feature activation+9.56e-01

Pos logit contributions
aman+0.013
ach+0.013
oun+0.013
liv+0.012
z+0.012
Neg logit contributions
 too-0.005
 very-0.005
 a-0.004
 so-0.004
 called-0.003
Token their
Token ID511
Feature activation-9.05e-03

Pos logit contributions
aman+0.053
ach+0.051
liv+0.051
oun+0.050
z+0.049
Neg logit contributions
 too-0.027
 very-0.027
 a-0.023
 so-0.019
 all-0.018
Token hair
Token ID4190
Feature activation-5.49e-01

Pos logit contributions
 very+0.000
 too+0.000
 a+0.000
 bored+0.000
 so+0.000
Neg logit contributions
liv-0.000
aman-0.000
 Kel-0.000
oun-0.000
ach-0.000
Token is
Token ID318
Feature activation-3.37e-01

Pos logit contributions
 too+0.017
 very+0.016
 a+0.013
 so+0.012
 called+0.012
Neg logit contributions
aman-0.028
ach-0.027
 Kel-0.027
oun-0.027
liv-0.026
Token.
Token ID13
Feature activation-1.74e-01

Pos logit contributions
 very+0.007
 too+0.007
 a+0.005
 bored+0.005
 hungry+0.004
Neg logit contributions
ach-0.020
aman-0.019
z-0.019
oun-0.019
 Kel-0.019
Token He
Token ID679
Feature activation-2.26e-01

Pos logit contributions
 too+0.004
 very+0.004
 a+0.003
 bored+0.003
 so+0.003
Neg logit contributions
ach-0.009
aman-0.009
oun-0.009
 Kel-0.009
z-0.009
Token asked
Token ID1965
Feature activation-4.58e-01

Pos logit contributions
 too+0.006
 very+0.006
 a+0.004
 so+0.004
 bored+0.004
Neg logit contributions
aman-0.012
ach-0.012
oun-0.012
 Kel-0.011
liv-0.011
Token the
Token ID262
Feature activation-1.23e+00

Pos logit contributions
 too+0.011
 very+0.010
 bored+0.008
 tired+0.007
 hungry+0.007
Neg logit contributions
ach-0.023
 Kel-0.023
aman-0.022
oun-0.022
ait-0.021
Token zo
Token ID40565
Feature activation-5.46e-01

Pos logit contributions
 too+0.058
 very+0.056
 a+0.049
 so+0.047
 bored+0.047
Neg logit contributions
ach-0.038
aman-0.038
oun-0.036
 Kel-0.035
z-0.033
Tokenoke
Token ID2088
Feature activation-2.63e+00

Pos logit contributions
 too+0.032
 very+0.032
 a+0.029
 so+0.028
 called+0.027
Neg logit contributions
ach-0.011
aman-0.010
oun-0.010
 Kel-0.009
z-0.009
Tokeneper
Token ID5723
Feature activation+2.31e-01

Pos logit contributions
 too+0.103
 very+0.099
 bored+0.094
 tired+0.092
 hungry+0.091
Neg logit contributions
ach-0.091
z-0.090
 Kel-0.082
aman-0.079
ge-0.077
Token if
Token ID611
Feature activation-2.91e-01

Pos logit contributions
ach+0.011
aman+0.011
oun+0.011
 Kel+0.011
z+0.010
Neg logit contributions
 too-0.006
 very-0.006
 a-0.005
 so-0.005
 bored-0.005
Token he
Token ID339
Feature activation+5.13e-01

Pos logit contributions
 too+0.007
 very+0.006
 a+0.005
 so+0.004
 called+0.004
Neg logit contributions
ach-0.016
aman-0.016
oun-0.016
 Kel-0.016
liv-0.015
Token could
Token ID714
Feature activation-3.47e-02

Pos logit contributions
aman+0.032
ach+0.032
oun+0.031
 Kel+0.031
liv+0.031
Neg logit contributions
 too-0.009
 very-0.009
 a-0.006
 so-0.005
 called-0.004
Token borrow
Token ID8804
Feature activation-5.38e-01

Pos logit contributions
 too+0.001
 very+0.001
 a+0.001
 so+0.001
 called+0.001
Neg logit contributions
ach-0.001
aman-0.001
oun-0.001
 Kel-0.001
liv-0.001
TokenYes
Token ID5297
Feature activation-3.66e-01

Pos logit contributions
ach+0.002
aman+0.002
oun+0.002
 Kel+0.002
liv+0.002
Neg logit contributions
 too-0.002
 very-0.002
 a-0.002
 so-0.002
 called-0.002
Token,
Token ID11
Feature activation-8.96e-02

Pos logit contributions
 too+0.006
 very+0.005
 a+0.003
 so+0.003
 all+0.002
Neg logit contributions
ach-0.024
aman-0.023
oun-0.023
z-0.023
liv-0.023
Token but
Token ID475
Feature activation-4.83e-01

Pos logit contributions
 too+0.003
 very+0.002
 a+0.002
 so+0.002
 called+0.002
Neg logit contributions
aman-0.005
ach-0.005
z-0.005
oun-0.005
liv-0.005
Token only
Token ID691
Feature activation-4.27e-01

Pos logit contributions
 too+0.017
 very+0.017
 a+0.014
 so+0.013
 bored+0.013
Neg logit contributions
ach-0.021
aman-0.021
oun-0.020
 Kel-0.020
liv-0.019
Token a
Token ID257
Feature activation-1.10e+00

Pos logit contributions
 too+0.011
 very+0.010
 a+0.008
 so+0.007
 called+0.007
Neg logit contributions
aman-0.023
ach-0.023
oun-0.023
liv-0.022
 Kel-0.022
Token little
Token ID1310
Feature activation-2.62e+00

Pos logit contributions
 very+0.040
 too+0.039
 a+0.031
 so+0.030
 hungry+0.028
Neg logit contributions
ach-0.051
aman-0.049
liv-0.048
 Kel-0.048
oun-0.047
Token.
Token ID13
Feature activation-4.23e-01

Pos logit contributions
 too+0.079
 very+0.079
 a+0.064
 so+0.059
 called+0.056
Neg logit contributions
aman-0.121
ach-0.121
oun-0.118
 Kel-0.116
z-0.113
Token The
Token ID383
Feature activation-1.31e+00

Pos logit contributions
 very+0.014
 too+0.014
 a+0.011
 so+0.011
 all+0.011
Neg logit contributions
ach-0.018
oun-0.018
aman-0.018
 Kel-0.018
ait-0.017
Token oats
Token ID47009
Feature activation-3.38e-01

Pos logit contributions
 too+0.058
 very+0.057
 a+0.049
 so+0.047
 called+0.045
Neg logit contributions
ach-0.047
aman-0.046
oun-0.044
 Kel-0.043
liv-0.043
Token are
Token ID389
Feature activation-1.89e+00

Pos logit contributions
 too+0.009
 very+0.009
 a+0.007
 so+0.006
 all+0.006
Neg logit contributions
aman-0.018
ach-0.018
oun-0.018
 Kel-0.017
liv-0.017
Token for
Token ID329
Feature activation+2.70e-02

Pos logit contributions
 too+0.036
 very+0.035
 a+0.024
 so+0.020
 called+0.019
Neg logit contributions
ach-0.112
aman-0.112
oun-0.109
 Kel-0.108
liv-0.106
Tokenasis
Token ID17765
Feature activation+6.85e-01

Pos logit contributions
 too+0.015
 very+0.015
 a+0.014
 so+0.013
 bored+0.013
Neg logit contributions
ach-0.008
aman-0.008
oun-0.007
z-0.007
liv-0.007
Token!
Token ID0
Feature activation-4.69e-01

Pos logit contributions
aman+0.034
ach+0.033
 Kel+0.033
oun+0.033
liv+0.032
Neg logit contributions
 too-0.021
 very-0.020
 a-0.017
 so-0.016
 called-0.014
Tokenâ
Token ID22940
Feature activation-3.03e-01

Pos logit contributions
 too+0.014
 very+0.014
 a+0.011
 so+0.010
 called+0.010
Neg logit contributions
ach-0.022
aman-0.022
oun-0.022
 Kel-0.021
liv-0.021
Token
Token ID26391
Feature activation-1.00e+00

Pos logit contributions
 too+0.015
 very+0.015
 a+0.013
 so+0.012
 bored+0.012
Neg logit contributions
ach-0.009
aman-0.009
oun-0.009
liv-0.009
 Kel-0.008
Token The
Token ID383
Feature activation-1.70e+00

Pos logit contributions
 too+0.041
 very+0.040
 a+0.034
 so+0.032
 called+0.031
Neg logit contributions
ach-0.038
aman-0.037
oun-0.036
 Kel-0.035
z-0.035
Token little
Token ID1310
Feature activation-2.61e+00

Pos logit contributions
 too+0.060
 very+0.059
 a+0.049
 so+0.045
 called+0.045
Neg logit contributions
aman-0.073
ach-0.072
oun-0.070
 Kel-0.068
liv-0.067
Token boy
Token ID2933
Feature activation-1.32e+00

Pos logit contributions
 very+0.088
 too+0.088
 a+0.071
 so+0.067
 called+0.066
Neg logit contributions
aman-0.112
ach-0.112
oun-0.108
 Kel-0.105
z-0.105
Token was
Token ID373
Feature activation-7.67e-01

Pos logit contributions
 very+0.030
 too+0.030
 a+0.023
 called+0.022
 so+0.020
Neg logit contributions
aman-0.081
ach-0.077
umm-0.075
liv-0.075
oun-0.075
Token very
Token ID845
Feature activation-6.32e-01

Pos logit contributions
 too+0.009
 very+0.007
 a+0.003
 decided+0.002
 bored+0.002
Neg logit contributions
ach-0.047
 Kel-0.047
aman-0.046
oun-0.046
z-0.045
Token excited
Token ID6568
Feature activation-8.89e-01

Pos logit contributions
 too+0.012
 very+0.012
 a+0.008
 so+0.007
 called+0.006
Neg logit contributions
aman-0.038
ach-0.038
oun-0.037
 Kel-0.036
liv-0.036
Token and
Token ID290
Feature activation-1.02e+00

Pos logit contributions
 too+0.021
 very+0.019
 a+0.015
 so+0.014
 all+0.012
Neg logit contributions
aman-0.052
 Kel-0.051
oun-0.051
ach-0.051
liv-0.050
Token the
Token ID262
Feature activation-3.38e-01

Pos logit contributions
 too+0.004
 very+0.004
 a+0.003
 so+0.002
 all+0.002
Neg logit contributions
aman-0.011
ach-0.011
oun-0.011
 Kel-0.011
liv-0.011
Token kitchen
Token ID9592
Feature activation+1.42e-01

Pos logit contributions
 too+0.012
 very+0.012
 a+0.010
 so+0.009
 called+0.009
Neg logit contributions
aman-0.014
ach-0.014
oun-0.014
 Kel-0.014
liv-0.013
Token.
Token ID13
Feature activation+2.05e-01

Pos logit contributions
aman+0.009
oun+0.009
ach+0.009
 Kel+0.009
liv+0.009
Neg logit contributions
 too-0.003
 very-0.003
 a-0.002
 so-0.002
 called-0.001
Token Wait
Token ID16314
Feature activation+2.32e-01

Pos logit contributions
ach+0.009
aman+0.009
oun+0.009
 Kel+0.009
ait+0.008
Neg logit contributions
 very-0.007
 too-0.007
 a-0.006
 so-0.005
 all-0.005
Token a
Token ID257
Feature activation-7.67e-01

Pos logit contributions
aman+0.012
oun+0.012
ach+0.012
liv+0.012
 Kel+0.012
Neg logit contributions
 too-0.007
 very-0.006
 a-0.005
 so-0.005
 called-0.004
Token little
Token ID1310
Feature activation-2.61e+00

Pos logit contributions
 too+0.028
 very+0.028
 a+0.023
 so+0.021
 called+0.021
Neg logit contributions
ach-0.033
aman-0.033
 Kel-0.032
oun-0.032
liv-0.031
Token bit
Token ID1643
Feature activation-9.04e-03

Pos logit contributions
 very+0.078
 too+0.077
 a+0.063
 so+0.057
 called+0.056
Neg logit contributions
aman-0.120
ach-0.120
oun-0.116
 Kel-0.113
z-0.112
Token,"
Token ID553
Feature activation-1.23e-01

Pos logit contributions
 too+0.000
 very+0.000
 a+0.000
 so+0.000
 called+0.000
Neg logit contributions
aman-0.001
ach-0.001
oun-0.000
 Kel-0.000
liv-0.000
Token mom
Token ID1995
Feature activation+5.65e-01

Pos logit contributions
 too+0.004
 very+0.004
 a+0.003
 so+0.003
 bored+0.003
Neg logit contributions
ach-0.005
aman-0.005
 Kel-0.005
oun-0.005
liv-0.005
Token says
Token ID1139
Feature activation+8.29e-01

Pos logit contributions
ach+0.025
aman+0.025
oun+0.025
 Kel+0.024
liv+0.024
Neg logit contributions
 too-0.019
 very-0.018
 a-0.015
 so-0.014
 bored-0.013
Token.
Token ID13
Feature activation-1.04e+00

Pos logit contributions
ach+0.049
aman+0.048
oun+0.047
 Kel+0.047
z+0.046
Neg logit contributions
 very-0.014
 too-0.014
 a-0.009
 so-0.008
 called-0.008
Token there
Token ID612
Feature activation+5.71e-01

Pos logit contributions
 too+0.016
 very+0.016
 a+0.010
 so+0.009
 called+0.009
Neg logit contributions
aman-0.052
ach-0.052
 Kel-0.050
oun-0.050
liv-0.049
Token was
Token ID373
Feature activation-5.78e-01

Pos logit contributions
ach+0.033
oun+0.033
aman+0.033
becca+0.031
liv+0.031
Neg logit contributions
 too-0.014
 very-0.014
 a-0.011
 so-0.009
 all-0.009
Token a
Token ID257
Feature activation-3.49e-01

Pos logit contributions
 too+0.005
 very+0.005
 a+0.002
 so+0.001
 all-0.001
Neg logit contributions
ach-0.041
aman-0.041
oun-0.040
 Kel-0.040
z-0.039
Token loud
Token ID7812
Feature activation+6.35e-01

Pos logit contributions
 too+0.011
 very+0.010
 a+0.008
 so+0.008
 called+0.007
Neg logit contributions
ach-0.017
aman-0.017
oun-0.017
 Kel-0.016
liv-0.016
Token *
Token ID1635
Feature activation-9.43e-01

Pos logit contributions
ach+0.031
aman+0.031
oun+0.030
liv+0.029
 Kel+0.029
Neg logit contributions
 too-0.020
 very-0.019
 a-0.016
 so-0.014
 called-0.013
Tokenbe
Token ID1350
Feature activation-2.59e+00

Pos logit contributions
 too+0.043
 very+0.043
 a+0.037
 so+0.035
 bored+0.034
Neg logit contributions
ach-0.031
aman-0.030
oun-0.029
 Kel-0.028
z-0.028
Tokenep
Token ID538
Feature activation-6.26e-01

Pos logit contributions
 too+0.111
 very+0.110
 a+0.095
 so+0.090
 called+0.087
Neg logit contributions
aman-0.092
ach-0.092
oun-0.088
 Kel-0.087
liv-0.085
Token*
Token ID9
Feature activation-4.19e-01

Pos logit contributions
 too+0.018
 very+0.018
 a+0.014
 so+0.013
 called+0.012
Neg logit contributions
aman-0.032
oun-0.032
ach-0.032
liv-0.031
becca-0.031
Token from
Token ID422
Feature activation+3.37e-01

Pos logit contributions
 too+0.012
 very+0.011
 a+0.009
 so+0.008
 all+0.007
Neg logit contributions
ach-0.022
aman-0.022
oun-0.021
 Kel-0.021
liv-0.021
Token the
Token ID262
Feature activation-7.38e-01

Pos logit contributions
aman+0.020
ach+0.020
oun+0.019
liv+0.019
 Kel+0.019
Neg logit contributions
 too-0.007
 very-0.007
 a-0.005
 so-0.004
 all-0.004
Token oven
Token ID14361
Feature activation+3.96e-01

Pos logit contributions
 very+0.027
 too+0.027
 a+0.022
 so+0.021
 hungry+0.020
Neg logit contributions
ach-0.034
 Kel-0.033
oun-0.033
aman-0.033
liv-0.032
Token the
Token ID262
Feature activation-2.18e-01

Pos logit contributions
 too+0.014
 very+0.014
 a+0.010
 so+0.009
 called+0.009
Neg logit contributions
ach-0.031
aman-0.031
oun-0.030
 Kel-0.030
liv-0.029
Token cake
Token ID12187
Feature activation-5.48e-01

Pos logit contributions
 very+0.008
 too+0.008
 a+0.007
 so+0.006
 all+0.006
Neg logit contributions
 Kel-0.010
ach-0.010
oun-0.010
aman-0.010
liv-0.009
Token.
Token ID13
Feature activation+2.36e-01

Pos logit contributions
 too+0.010
 very+0.010
 a+0.007
 so+0.006
 called+0.006
Neg logit contributions
ach-0.032
aman-0.032
oun-0.031
 Kel-0.031
z-0.030
Token Just
Token ID2329
Feature activation-6.69e-01

Pos logit contributions
ach+0.010
aman+0.009
oun+0.009
 Kel+0.009
ait+0.009
Neg logit contributions
 very-0.008
 too-0.008
 all-0.006
 a-0.006
 bored-0.006
Token a
Token ID257
Feature activation-4.39e-01

Pos logit contributions
 too+0.016
 very+0.016
 a+0.012
 so+0.011
 bored+0.011
Neg logit contributions
ach-0.035
aman-0.035
oun-0.033
 Kel-0.033
z-0.032
Token little
Token ID1310
Feature activation-2.58e+00

Pos logit contributions
 very+0.016
 too+0.016
 a+0.013
 so+0.012
 hungry+0.011
Neg logit contributions
ach-0.020
aman-0.019
 Kel-0.019
liv-0.019
oun-0.019
Token bit
Token ID1643
Feature activation-4.02e-01

Pos logit contributions
 very+0.081
 too+0.081
 a+0.067
 so+0.061
 called+0.061
Neg logit contributions
ach-0.112
aman-0.112
oun-0.108
 Kel-0.105
z-0.104
Token.
Token ID13
Feature activation+1.07e-01

Pos logit contributions
 very+0.009
 too+0.009
 a+0.007
 so+0.006
 called+0.005
Neg logit contributions
ach-0.022
aman-0.022
oun-0.021
 Kel-0.021
ait-0.020
Token Please
Token ID4222
Feature activation+2.65e-01

Pos logit contributions
ach+0.004
aman+0.004
 Kel+0.004
oun+0.004
ait+0.004
Neg logit contributions
 very-0.004
 too-0.004
 all-0.003
 bored-0.003
 a-0.003
Token,
Token ID11
Feature activation+5.58e-01

Pos logit contributions
ach+0.014
aman+0.014
oun+0.013
 Kel+0.013
z+0.013
Neg logit contributions
 very-0.006
 too-0.006
 a-0.005
 so-0.004
 bored-0.004
Token Anna
Token ID11735
Feature activation-1.00e+00

Pos logit contributions
ach+0.031
aman+0.030
oun+0.029
z+0.029
liv+0.029
Neg logit contributions
 too-0.016
 very-0.015
 a-0.013
 so-0.010
 called-0.010
Token would
Token ID561
Feature activation-1.06e+00

Pos logit contributions
 too+0.007
 very+0.007
 a+0.005
 so+0.005
 called+0.005
Neg logit contributions
ach-0.013
aman-0.013
oun-0.012
 Kel-0.012
liv-0.012
Token pour
Token ID12797
Feature activation-1.05e+00

Pos logit contributions
 too+0.038
 very+0.037
 a+0.031
 so+0.029
 bored+0.028
Neg logit contributions
ach-0.043
aman-0.043
oun-0.042
 Kel-0.041
z-0.040
Token it
Token ID340
Feature activation+3.86e-02

Pos logit contributions
 too+0.023
 very+0.022
 a+0.016
 so+0.014
 called+0.013
Neg logit contributions
ach-0.059
aman-0.059
oun-0.058
 Kel-0.057
liv-0.056
Token from
Token ID422
Feature activation+2.81e-01

Pos logit contributions
ach+0.002
aman+0.002
oun+0.002
z+0.002
 Kel+0.002
Neg logit contributions
 too-0.001
 very-0.001
 a-0.001
 so-0.001
 called-0.001
Token the
Token ID262
Feature activation-1.93e+00

Pos logit contributions
ach+0.016
aman+0.016
oun+0.016
 Kel+0.015
z+0.015
Neg logit contributions
 too-0.006
 very-0.006
 a-0.004
 so-0.003
 bored-0.003
Token blue
Token ID4171
Feature activation-2.55e+00

Pos logit contributions
 very+0.069
 too+0.068
 a+0.056
 so+0.053
 called+0.050
Neg logit contributions
ach-0.087
aman-0.087
oun-0.084
 Kel-0.082
liv-0.081
Token cup
Token ID6508
Feature activation-5.91e-01

Pos logit contributions
 too+0.095
 very+0.093
 a+0.079
 so+0.073
 called+0.071
Neg logit contributions
ach-0.106
aman-0.106
oun-0.103
 Kel-0.101
liv-0.100
Token into
Token ID656
Feature activation-7.49e-01

Pos logit contributions
 too+0.014
 very+0.013
 a+0.010
 so+0.009
 bored+0.009
Neg logit contributions
ach-0.032
aman-0.032
oun-0.031
 Kel-0.030
liv-0.030
Token the
Token ID262
Feature activation-1.77e+00

Pos logit contributions
 too+0.015
 very+0.015
 a+0.010
 so+0.009
 called+0.008
Neg logit contributions
ach-0.044
aman-0.043
oun-0.042
 Kel-0.042
liv-0.041
Token green
Token ID4077
Feature activation-2.51e+00

Pos logit contributions
 very+0.064
 too+0.063
 a+0.052
 so+0.049
 called+0.047
Neg logit contributions
ach-0.079
aman-0.078
oun-0.076
 Kel-0.074
liv-0.073
Token cup
Token ID6508
Feature activation-7.76e-01

Pos logit contributions
 too+0.090
 very+0.088
 a+0.074
 so+0.069
 called+0.066
Neg logit contributions
ach-0.109
aman-0.109
oun-0.106
 Kel-0.105
liv-0.103
Token One
Token ID1881
Feature activation-4.72e-01

Pos logit contributions
 too+0.008
 very+0.008
 a+0.006
 so+0.005
 bored+0.005
Neg logit contributions
aman-0.019
ach-0.019
oun-0.018
 Kel-0.018
liv-0.018
Token day
Token ID1110
Feature activation-4.87e-01

Pos logit contributions
 too+0.016
 very+0.015
 bored+0.012
 a+0.012
 so+0.011
Neg logit contributions
 Kel-0.019
aman-0.019
ach-0.019
oun-0.018
z-0.018
Token a
Token ID257
Feature activation-4.79e-01

Pos logit contributions
 too+0.008
 very+0.008
 a+0.006
 so+0.004
 called+0.004
Neg logit contributions
aman-0.031
ach-0.031
oun-0.031
liv-0.029
ait-0.029
Token trou
Token ID4057
Feature activation-7.61e-01

Pos logit contributions
 too+0.017
 very+0.016
 a+0.014
 so+0.013
 bored+0.012
Neg logit contributions
ach-0.020
aman-0.020
oun-0.019
 Kel-0.019
z-0.018
Tokenl
Token ID75
Feature activation+5.90e-01

Pos logit contributions
 too+0.034
 very+0.033
 a+0.029
 so+0.028
 bored+0.027
Neg logit contributions
ach-0.026
aman-0.025
 Kel-0.024
oun-0.024
z-0.024
Tokenbe
Token ID1350
Feature activation-2.54e+00

Pos logit contributions
ach+0.025
aman+0.024
oun+0.024
 Kel+0.023
liv+0.023
Neg logit contributions
 too-0.022
 very-0.022
 a-0.018
 so-0.017
 called-0.016
Token came
Token ID1625
Feature activation+5.16e-01

Pos logit contributions
 too+0.115
 very+0.112
 a+0.098
 so+0.093
 bored+0.092
Neg logit contributions
ach-0.083
aman-0.078
z-0.077
oun-0.076
 Kel-0.076
Token to
Token ID284
Feature activation+7.02e-02

Pos logit contributions
aman+0.027
ach+0.027
oun+0.027
liv+0.026
 Kel+0.026
Neg logit contributions
 too-0.014
 very-0.014
 a-0.011
 so-0.010
 called-0.009
Token the
Token ID262
Feature activation-9.90e-01

Pos logit contributions
ach+0.004
aman+0.004
oun+0.004
liv+0.004
 Kel+0.004
Neg logit contributions
 too-0.001
 very-0.001
 a-0.001
 so-0.001
 called-0.001
Token bridge
Token ID7696
Feature activation-5.96e-01

Pos logit contributions
 too+0.034
 very+0.033
 a+0.028
 so+0.026
 called+0.025
Neg logit contributions
ach-0.043
aman-0.043
oun-0.041
 Kel-0.041
liv-0.040
Token.
Token ID13
Feature activation-1.47e-01

Pos logit contributions
 too+0.012
 very+0.012
 a+0.008
 so+0.007
 bored+0.007
Neg logit contributions
ach-0.034
aman-0.034
oun-0.033
 Kel-0.033
z-0.032
Token at
Token ID379
Feature activation-3.56e-01

Pos logit contributions
ach+0.020
aman+0.020
oun+0.020
 Kel+0.019
liv+0.019
Neg logit contributions
 too-0.011
 very-0.010
 a-0.008
 so-0.007
 called-0.007
Token him
Token ID683
Feature activation-3.13e-01

Pos logit contributions
 too+0.008
 very+0.008
 a+0.006
 so+0.005
 all+0.005
Neg logit contributions
ach-0.021
aman-0.020
oun-0.020
z-0.019
liv-0.019
Token with
Token ID351
Feature activation+2.21e-01

Pos logit contributions
 too+0.006
 very+0.006
 a+0.004
 so+0.004
 called+0.003
Neg logit contributions
aman-0.019
ach-0.019
oun-0.018
 Kel-0.018
liv-0.018
Token a
Token ID257
Feature activation+7.54e-01

Pos logit contributions
ach+0.012
aman+0.012
oun+0.012
 Kel+0.012
liv+0.012
Neg logit contributions
 too-0.005
 very-0.005
 a-0.004
 so-0.003
 called-0.003
Token mis
Token ID2984
Feature activation+2.44e-02

Pos logit contributions
aman+0.036
ach+0.036
oun+0.035
 Kel+0.035
liv+0.034
Neg logit contributions
 too-0.022
 very-0.022
 a-0.017
 so-0.016
 called-0.016
Tokenchie
Token ID3043
Feature activation-2.54e+00

Pos logit contributions
ach+0.001
aman+0.001
oun+0.001
z+0.001
 Kel+0.001
Neg logit contributions
 too-0.001
 very-0.001
 a-0.001
 so-0.001
 bored-0.001
Tokenvous
Token ID31222
Feature activation+3.88e-01

Pos logit contributions
 too+0.115
 very+0.108
 bored+0.096
 a+0.095
 tired+0.094
Neg logit contributions
ach-0.076
aman-0.073
z-0.071
liv-0.070
becca-0.067
Token grin
Token ID16770
Feature activation-8.86e-01

Pos logit contributions
ach+0.017
aman+0.017
oun+0.016
 Kel+0.016
liv+0.016
Neg logit contributions
 too-0.013
 very-0.013
 a-0.011
 so-0.010
 called-0.010
Token.
Token ID13
Feature activation-1.23e-02

Pos logit contributions
 very+0.016
 too+0.016
 a+0.011
 so+0.010
 bored+0.009
Neg logit contributions
ach-0.052
aman-0.051
oun-0.050
 Kel-0.049
z-0.049
Token She
Token ID1375
Feature activation-1.71e-01

Pos logit contributions
 too+0.000
 very+0.000
 a+0.000
 so+0.000
 called+0.000
Neg logit contributions
ach-0.001
aman-0.001
oun-0.001
 Kel-0.001
liv-0.001
Token said
Token ID531
Feature activation-1.01e-01

Pos logit contributions
 too+0.004
 very+0.004
 a+0.003
 so+0.003
 called+0.003
Neg logit contributions
ach-0.010
aman-0.009
oun-0.009
 Kel-0.009
liv-0.009
Token is
Token ID318
Feature activation-4.12e-01

Pos logit contributions
 too+0.011
 very+0.011
 a+0.009
 so+0.008
 bored+0.008
Neg logit contributions
ach-0.014
aman-0.014
oun-0.014
 Kel-0.014
z-0.013
Token more
Token ID517
Feature activation-3.66e-01

Pos logit contributions
 too+0.009
 very+0.008
 a+0.006
 bored+0.005
 so+0.005
Neg logit contributions
ach-0.022
aman-0.022
 Kel-0.021
oun-0.021
liv-0.021
Token shiny
Token ID22441
Feature activation-8.66e-01

Pos logit contributions
 too+0.012
 very+0.012
 a+0.010
 so+0.009
 called+0.009
Neg logit contributions
aman-0.016
ach-0.016
oun-0.016
 Kel-0.016
liv-0.015
Token.
Token ID13
Feature activation+2.95e-01

Pos logit contributions
 very+0.022
 too+0.021
 a+0.017
 bored+0.015
 tired+0.015
Neg logit contributions
ach-0.044
aman-0.041
oun-0.041
z-0.040
 Kel-0.039
Token The
Token ID383
Feature activation-1.17e+00

Pos logit contributions
ach+0.012
oun+0.011
 Kel+0.011
aman+0.011
umm+0.011
Neg logit contributions
 very-0.011
 too-0.010
 bored-0.008
 tired-0.008
 all-0.008
Token red
Token ID2266
Feature activation-2.52e+00

Pos logit contributions
 too+0.036
 very+0.035
 a+0.028
 so+0.026
 called+0.025
Neg logit contributions
ach-0.055
aman-0.055
oun-0.053
 Kel-0.053
liv-0.052
Token one
Token ID530
Feature activation-7.00e-01

Pos logit contributions
 too+0.085
 very+0.085
 a+0.069
 bored+0.066
 so+0.066
Neg logit contributions
ach-0.109
aman-0.106
oun-0.103
z-0.101
 Kel-0.101
Token is
Token ID318
Feature activation-4.72e-01

Pos logit contributions
 too+0.021
 very+0.021
 a+0.016
 so+0.015
 called+0.014
Neg logit contributions
aman-0.036
ach-0.036
oun-0.035
 Kel-0.034
liv-0.034
Token old
Token ID1468
Feature activation-1.19e+00

Pos logit contributions
 too+0.013
 very+0.012
 a+0.010
 so+0.009
 called+0.008
Neg logit contributions
aman-0.025
ach-0.025
oun-0.024
 Kel-0.024
liv-0.023
Token and
Token ID290
Feature activation-3.70e-01

Pos logit contributions
 very+0.025
 too+0.025
 a+0.018
 bored+0.015
 so+0.015
Neg logit contributions
ach-0.065
aman-0.063
oun-0.062
 Kel-0.061
z-0.060
Token boring
Token ID14262
Feature activation-7.42e-01

Pos logit contributions
 too+0.011
 very+0.011
 a+0.009
 called+0.008
 so+0.008
Neg logit contributions
ach-0.016
 Kel-0.016
aman-0.016
oun-0.016
z-0.015
Token.
Token ID13
Feature activation+2.83e-01

Pos logit contributions
 too+0.012
 very+0.012
 a+0.008
 bored+0.007
 so+0.007
Neg logit contributions
ach-0.035
aman-0.034
oun-0.033
z-0.033
 Kel-0.033
Token Ben
Token ID3932
Feature activation-3.39e-02

Pos logit contributions
ach+0.013
oun+0.013
aman+0.013
ait+0.012
becca+0.012
Neg logit contributions
 very-0.008
 too-0.008
 a-0.006
 so-0.006
 called-0.006
Token asked
Token ID1965
Feature activation-4.88e-01

Pos logit contributions
 too+0.001
 very+0.001
 a+0.001
 so+0.001
 called+0.001
Neg logit contributions
ach-0.002
aman-0.002
oun-0.002
 Kel-0.002
liv-0.002
Token the
Token ID262
Feature activation-8.33e-01

Pos logit contributions
 too+0.011
 very+0.011
 bored+0.008
 so+0.008
 called+0.007
Neg logit contributions
ach-0.025
aman-0.024
oun-0.023
 Kel-0.023
ait-0.023
Token zo
Token ID40565
Feature activation-2.88e-01

Pos logit contributions
 too+0.028
 very+0.028
 a+0.023
 so+0.022
 called+0.020
Neg logit contributions
ach-0.037
aman-0.036
oun-0.036
 Kel-0.035
z-0.034
Tokenoke
Token ID2088
Feature activation-2.51e+00

Pos logit contributions
 too+0.017
 very+0.016
 a+0.015
 so+0.014
 called+0.014
Neg logit contributions
ach-0.006
aman-0.006
oun-0.006
 Kel-0.005
liv-0.005
Tokeneper
Token ID5723
Feature activation+4.34e-01

Pos logit contributions
 too+0.097
 very+0.094
 bored+0.084
 tired+0.082
 so+0.081
Neg logit contributions
ach-0.093
z-0.089
aman-0.085
 Kel-0.083
oun-0.081
Token if
Token ID611
Feature activation-5.40e-01

Pos logit contributions
ach+0.021
aman+0.021
oun+0.021
 Kel+0.020
z+0.020
Neg logit contributions
 too-0.011
 very-0.011
 a-0.009
 so-0.008
 bored-0.008
Token he
Token ID339
Feature activation+6.00e-01

Pos logit contributions
 too+0.013
 very+0.013
 a+0.010
 so+0.009
 called+0.008
Neg logit contributions
ach-0.029
aman-0.029
oun-0.028
 Kel-0.028
liv-0.028
Token could
Token ID714
Feature activation+3.81e-01

Pos logit contributions
ach+0.035
aman+0.035
oun+0.034
 Kel+0.034
liv+0.034
Neg logit contributions
 too-0.012
 very-0.011
 a-0.008
 so-0.007
 called-0.006
Token touch
Token ID3638
Feature activation-3.03e-01

Pos logit contributions
ach+0.017
aman+0.017
oun+0.016
 Kel+0.016
z+0.016
Neg logit contributions
 too-0.012
 very-0.012
 a-0.010
 so-0.009
 bored-0.009
Token the
Token ID262
Feature activation-1.93e+00

Pos logit contributions
ach+0.016
aman+0.016
oun+0.016
 Kel+0.015
z+0.015
Neg logit contributions
 too-0.006
 very-0.006
 a-0.004
 so-0.003
 bored-0.003
Token blue
Token ID4171
Feature activation-2.55e+00

Pos logit contributions
 very+0.069
 too+0.068
 a+0.056
 so+0.053
 called+0.050
Neg logit contributions
ach-0.087
aman-0.087
oun-0.084
 Kel-0.082
liv-0.081
Token cup
Token ID6508
Feature activation-5.91e-01

Pos logit contributions
 too+0.095
 very+0.093
 a+0.079
 so+0.073
 called+0.071
Neg logit contributions
ach-0.106
aman-0.106
oun-0.103
 Kel-0.101
liv-0.100
Token into
Token ID656
Feature activation-7.49e-01

Pos logit contributions
 too+0.014
 very+0.013
 a+0.010
 so+0.009
 bored+0.009
Neg logit contributions
ach-0.032
aman-0.032
oun-0.031
 Kel-0.030
liv-0.030
Token the
Token ID262
Feature activation-1.77e+00

Pos logit contributions
 too+0.015
 very+0.015
 a+0.010
 so+0.009
 called+0.008
Neg logit contributions
ach-0.044
aman-0.043
oun-0.042
 Kel-0.042
liv-0.041
Token green
Token ID4077
Feature activation-2.51e+00

Pos logit contributions
 very+0.064
 too+0.063
 a+0.052
 so+0.049
 called+0.047
Neg logit contributions
ach-0.079
aman-0.078
oun-0.076
 Kel-0.074
liv-0.073
Token cup
Token ID6508
Feature activation-7.76e-01

Pos logit contributions
 too+0.090
 very+0.088
 a+0.074
 so+0.069
 called+0.066
Neg logit contributions
ach-0.109
aman-0.109
oun-0.106
 Kel-0.105
liv-0.103
Token.
Token ID13
Feature activation-6.66e-02

Pos logit contributions
 too+0.017
 very+0.017
 a+0.013
 bored+0.011
 hungry+0.011
Neg logit contributions
aman-0.042
ach-0.042
oun-0.041
liv-0.040
 Kel-0.039
Token
Token ID198
Feature activation-1.10e-01

Pos logit contributions
 very+0.001
 too+0.001
 a+0.001
 bored+0.001
 so+0.001
Neg logit contributions
oun-0.004
aman-0.004
ach-0.004
 Kel-0.004
z-0.003
Token
Token ID198
Feature activation+4.80e-01

Pos logit contributions
 too+0.002
 very+0.002
 a+0.002
 so+0.001
 bored+0.001
Neg logit contributions
ach-0.006
aman-0.006
oun-0.006
 Kel-0.006
z-0.006
TokenOne
Token ID3198
Feature activation-1.26e-01

Pos logit contributions
ach+0.024
aman+0.024
oun+0.024
 Kel+0.023
liv+0.023
Neg logit contributions
 too-0.013
 very-0.013
 a-0.010
 so-0.009
 called-0.009
Token
Token ID198
Feature activation+7.35e-03

Pos logit contributions
ach+0.007
aman+0.007
oun+0.006
 Kel+0.006
liv+0.006
Neg logit contributions
 too-0.002
 very-0.002
 a-0.001
 so-0.001
 called-0.001
Token
Token ID198
Feature activation+2.22e-01

Pos logit contributions
ach+0.000
aman+0.000
oun+0.000
 Kel+0.000
liv+0.000
Neg logit contributions
 too-0.000
 very-0.000
 a-0.000
 so-0.000
 bored-0.000
TokenWith
Token ID3152
Feature activation-1.13e+00

Pos logit contributions
ach+0.008
aman+0.008
oun+0.007
 Kel+0.007
liv+0.007
Neg logit contributions
 too-0.010
 very-0.009
 a-0.008
 so-0.008
 bored-0.008
Token a
Token ID257
Feature activation-2.09e-01

Pos logit contributions
 too+0.023
 very+0.022
 a+0.014
 bored+0.013
 so+0.013
Neg logit contributions
aman-0.065
ach-0.064
 Kel-0.063
oun-0.062
liv-0.061
Token mis
Token ID2984
Feature activation-7.09e-02

Pos logit contributions
 too+0.007
 very+0.007
 a+0.006
 so+0.005
 called+0.005
Neg logit contributions
aman-0.009
ach-0.009
oun-0.009
 Kel-0.009
z-0.008
Tokenchie
Token ID3043
Feature activation-2.50e+00

Pos logit contributions
 too+0.004
 very+0.004
 a+0.004
 all+0.003
 so+0.003
Neg logit contributions
ach-0.002
aman-0.002
oun-0.001
z-0.001
ait-0.001
Tokenvous
Token ID31222
Feature activation-1.68e-01

Pos logit contributions
 too+0.120
 very+0.108
 a+0.100
 bored+0.099
 all+0.099
Neg logit contributions
z-0.066
ach-0.066
liv-0.061
aman-0.060
becca-0.059
Token gl
Token ID1278
Feature activation+4.03e-01

Pos logit contributions
 too+0.005
 very+0.005
 a+0.004
 so+0.004
 called+0.004
Neg logit contributions
ach-0.008
aman-0.008
oun-0.008
 Kel-0.008
liv-0.008
Tokenint
Token ID600
Feature activation-2.26e-01

Pos logit contributions
ach+0.011
aman+0.011
oun+0.011
 Kel+0.011
liv+0.010
Neg logit contributions
 too-0.020
 very-0.020
 a-0.018
 so-0.017
 called-0.016
Token in
Token ID287
Feature activation+1.07e-01

Pos logit contributions
 too+0.009
 very+0.009
 a+0.008
 so+0.007
 called+0.007
Neg logit contributions
aman-0.009
ach-0.009
oun-0.008
liv-0.008
 Kel-0.008
Token her
Token ID607
Feature activation-1.25e+00

Pos logit contributions
oun+0.007
ach+0.007
aman+0.006
liv+0.006
z+0.006
Neg logit contributions
 too-0.002
 very-0.002
 a-0.002
 so-0.001
 all-0.001
Token was
Token ID373
Feature activation+8.41e-01

Pos logit contributions
ach+0.048
aman+0.046
oun+0.045
becca+0.045
ait+0.044
Neg logit contributions
 too-0.021
 very-0.020
 a-0.015
 called-0.013
 so-0.013
Token bright
Token ID6016
Feature activation+2.31e-01

Pos logit contributions
ach+0.042
aman+0.041
oun+0.040
 Kel+0.039
liv+0.039
Neg logit contributions
 too-0.026
 very-0.025
 a-0.020
 so-0.019
 called-0.017
Token and
Token ID290
Feature activation-1.42e+00

Pos logit contributions
ach+0.014
aman+0.014
oun+0.014
 Kel+0.014
liv+0.013
Neg logit contributions
 too-0.004
 very-0.004
 a-0.003
 so-0.002
 called-0.002
Token warm
Token ID5814
Feature activation+1.03e-01

Pos logit contributions
 too+0.038
 very+0.036
 a+0.027
 bored+0.027
 hungry+0.026
Neg logit contributions
 Kel-0.068
aman-0.067
ach-0.065
liv-0.064
z-0.064
Token,
Token ID11
Feature activation-5.71e-01

Pos logit contributions
ach+0.006
aman+0.006
oun+0.006
 Kel+0.006
liv+0.006
Neg logit contributions
 too-0.002
 very-0.002
 a-0.001
 so-0.001
 all-0.001
Token and
Token ID290
Feature activation-2.50e+00

Pos logit contributions
 too+0.012
 very+0.011
 a+0.008
 so+0.007
 called+0.006
Neg logit contributions
ach-0.033
aman-0.033
oun-0.032
 Kel-0.032
liv-0.031
Token Jos
Token ID22568
Feature activation-4.86e-01

Pos logit contributions
 too+0.077
 very+0.074
 bored+0.058
 hungry+0.058
 a+0.057
Neg logit contributions
 Kel-0.107
aman-0.106
ach-0.105
z-0.101
liv-0.101
Tokenie
Token ID494
Feature activation-1.15e-01

Pos logit contributions
 too+0.020
 very+0.020
 a+0.017
 so+0.016
 called+0.015
Neg logit contributions
ach-0.018
aman-0.018
oun-0.017
 Kel-0.017
liv-0.017
Token was
Token ID373
Feature activation+5.51e-02

Pos logit contributions
 too+0.003
 very+0.003
 a+0.002
 so+0.002
 called+0.002
Neg logit contributions
ach-0.006
aman-0.006
oun-0.006
 Kel-0.006
liv-0.006
Token happy
Token ID3772
Feature activation-6.69e-01

Pos logit contributions
aman+0.003
ach+0.003
 Kel+0.003
oun+0.003
liv+0.003
Neg logit contributions
 too-0.001
 very-0.001
 a-0.001
 called-0.000
 bored-0.000
Token!
Token ID0
Feature activation-8.11e-01

Pos logit contributions
 too+0.016
 very+0.016
 a+0.012
 so+0.010
 called+0.010
Neg logit contributions
ach-0.037
aman-0.037
oun-0.036
 Kel-0.036
liv-0.035
Token yawn
Token ID40908
Feature activation-3.39e-03

Pos logit contributions
ach+0.011
aman+0.011
oun+0.011
 Kel+0.011
liv+0.011
Neg logit contributions
 too-0.008
 very-0.008
 a-0.006
 so-0.006
 called-0.006
Token.
Token ID13
Feature activation-8.88e-01

Pos logit contributions
 too+0.000
 very+0.000
 a+0.000
 called+0.000
 bored+0.000
Neg logit contributions
ach-0.000
aman-0.000
oun-0.000
z-0.000
liv-0.000
Token
Token ID198
Feature activation-2.46e-01

Pos logit contributions
 too+0.021
 very+0.021
 a+0.014
 mean+0.014
 bored+0.014
Neg logit contributions
oun-0.047
ach-0.047
 Kel-0.046
aman-0.046
z-0.044
Token
Token ID198
Feature activation-2.13e-01

Pos logit contributions
 too+0.005
 very+0.005
 a+0.003
 so+0.003
 bored+0.003
Neg logit contributions
ach-0.014
aman-0.014
oun-0.014
 Kel-0.013
liv-0.013
TokenThe
Token ID464
Feature activation-7.23e-01

Pos logit contributions
 too+0.007
 very+0.007
 a+0.006
 so+0.005
 bored+0.005
Neg logit contributions
ach-0.010
aman-0.009
oun-0.009
 Kel-0.009
z-0.009
Token little
Token ID1310
Feature activation-2.50e+00

Pos logit contributions
 too+0.025
 very+0.025
 a+0.021
 so+0.019
 called+0.019
Neg logit contributions
ach-0.031
aman-0.031
oun-0.030
 Kel-0.030
liv-0.029
Token boy
Token ID2933
Feature activation-7.38e-01

Pos logit contributions
 very+0.097
 too+0.096
 Amen+0.082
 a+0.082
 called+0.078
Neg logit contributions
ach-0.080
 Kel-0.076
oun-0.076
aman-0.075
z-0.073
Token yawn
Token ID40908
Feature activation+1.98e-01

Pos logit contributions
 too+0.019
 very+0.019
 a+0.014
 so+0.013
 bored+0.012
Neg logit contributions
aman-0.037
ach-0.037
 Kel-0.035
oun-0.035
z-0.035
Tokened
Token ID276
Feature activation-2.32e-01

Pos logit contributions
ach+0.005
aman+0.005
oun+0.005
z+0.005
liv+0.005
Neg logit contributions
 too-0.010
 very-0.010
 a-0.009
 bored-0.008
 so-0.008
Token and
Token ID290
Feature activation-6.45e-01

Pos logit contributions
 very+0.005
 too+0.005
 a+0.003
 bored+0.003
 called+0.003
Neg logit contributions
ach-0.012
aman-0.012
 Kel-0.011
oun-0.011
z-0.011
Token stretched
Token ID19110
Feature activation-5.03e-01

Pos logit contributions
 too+0.016
 very+0.016
 a+0.012
 so+0.011
 bored+0.011
Neg logit contributions
aman-0.032
ach-0.032
 Kel-0.031
oun-0.031
liv-0.031
Token to
Token ID284
Feature activation+5.78e-02

Pos logit contributions
 too+0.025
 very+0.024
 a+0.020
 so+0.018
 all+0.017
Neg logit contributions
ach-0.041
aman-0.041
oun-0.040
liv-0.039
 Kel-0.039
Token have
Token ID423
Feature activation-5.10e-01

Pos logit contributions
ach+0.002
aman+0.002
oun+0.002
 Kel+0.002
liv+0.002
Neg logit contributions
 too-0.002
 very-0.002
 a-0.002
 so-0.002
 bored-0.002
Token a
Token ID257
Feature activation+5.03e-01

Pos logit contributions
 too+0.008
 very+0.008
 a+0.005
 so+0.004
 called+0.003
Neg logit contributions
ach-0.032
aman-0.032
oun-0.031
 Kel-0.031
liv-0.031
Token cool
Token ID3608
Feature activation-5.06e-01

Pos logit contributions
ach+0.022
aman+0.021
oun+0.021
 Kel+0.021
liv+0.020
Neg logit contributions
 very-0.018
 too-0.018
 a-0.015
 so-0.014
 bored-0.014
Token hairst
Token ID42658
Feature activation-1.82e-01

Pos logit contributions
 too+0.018
 very+0.018
 a+0.015
 so+0.014
 all+0.013
Neg logit contributions
ach-0.023
aman-0.022
liv-0.022
 Kel-0.022
oun-0.022
Tokenyle
Token ID2349
Feature activation-2.47e+00

Pos logit contributions
 too+0.010
 very+0.010
 a+0.009
 so+0.008
 called+0.008
Neg logit contributions
ach-0.004
aman-0.004
oun-0.004
 Kel-0.004
liv-0.004
Token,
Token ID11
Feature activation+4.51e-01

Pos logit contributions
 too+0.056
 very+0.055
 a+0.042
 bored+0.041
 tired+0.039
Neg logit contributions
ach-0.125
oun-0.124
z-0.119
aman-0.119
 Kel-0.116
Token but
Token ID475
Feature activation+2.37e-01

Pos logit contributions
ach+0.025
aman+0.025
oun+0.024
 Kel+0.024
liv+0.023
Neg logit contributions
 too-0.010
 very-0.010
 a-0.007
 so-0.006
 called-0.006
Token he
Token ID339
Feature activation-2.54e-01

Pos logit contributions
aman+0.012
ach+0.012
 Kel+0.012
oun+0.012
liv+0.012
Neg logit contributions
 too-0.006
 very-0.006
 a-0.004
 bored-0.004
 hungry-0.004
Token didn
Token ID1422
Feature activation-2.87e-01

Pos logit contributions
 too+0.007
 very+0.007
 a+0.005
 so+0.005
 bored+0.005
Neg logit contributions
ach-0.012
aman-0.012
liv-0.012
oun-0.012
 Kel-0.012
Token't
Token ID470
Feature activation-6.12e-01

Pos logit contributions
 too+0.011
 very+0.011
 bored+0.009
 hungry+0.009
 called+0.009
Neg logit contributions
oun-0.011
ach-0.011
aman-0.011
 Kel-0.010
liv-0.010
Token her
Token ID607
Feature activation+1.66e-01

Pos logit contributions
 too+0.008
 very+0.007
 a+0.006
 so+0.005
 called+0.005
Neg logit contributions
aman-0.015
ach-0.015
oun-0.015
 Kel-0.015
liv-0.014
Token mom
Token ID1995
Feature activation-5.20e-01

Pos logit contributions
aman+0.009
ach+0.009
oun+0.009
 Kel+0.009
liv+0.008
Neg logit contributions
 too-0.005
 very-0.005
 a-0.004
 all-0.003
 bored-0.003
Token would
Token ID561
Feature activation+1.65e-01

Pos logit contributions
 too+0.013
 very+0.013
 a+0.010
 so+0.009
 called+0.009
Neg logit contributions
ach-0.028
aman-0.028
oun-0.027
 Kel-0.027
liv-0.026
Token always
Token ID1464
Feature activation-1.24e-01

Pos logit contributions
aman+0.008
ach+0.008
oun+0.008
 Kel+0.008
liv+0.008
Neg logit contributions
 too-0.005
 very-0.004
 a-0.003
 bored-0.003
 so-0.003
Token be
Token ID307
Feature activation-7.17e-01

Pos logit contributions
 too+0.004
 very+0.004
 a+0.003
 so+0.003
 called+0.003
Neg logit contributions
ach-0.006
aman-0.006
oun-0.006
 Kel-0.006
liv-0.006
Token there
Token ID612
Feature activation+1.60e+00

Pos logit contributions
 too+0.016
 very+0.016
 a+0.011
 called+0.011
 bored+0.010
Neg logit contributions
ach-0.036
aman-0.035
 Kel-0.034
z-0.033
ait-0.033
Token to
Token ID284
Feature activation+4.50e-01

Pos logit contributions
aman+0.092
ach+0.092
oun+0.091
 Kel+0.090
liv+0.089
Neg logit contributions
 too-0.042
 very-0.042
 a-0.032
 so-0.029
 all-0.025
Token help
Token ID1037
Feature activation+5.97e-01

Pos logit contributions
aman+0.021
ach+0.021
oun+0.021
 Kel+0.020
liv+0.020
Neg logit contributions
 too-0.014
 very-0.014
 a-0.012
 so-0.011
 called-0.010
Token her
Token ID607
Feature activation-3.50e-01

Pos logit contributions
oun+0.041
aman+0.040
liv+0.040
z+0.040
ach+0.039
Neg logit contributions
 very-0.011
 too-0.011
 a-0.008
 so-0.007
 all-0.006
Token.
Token ID13
Feature activation+8.57e-02

Pos logit contributions
 very+0.009
 too+0.009
 a+0.007
 so+0.006
 all+0.005
Neg logit contributions
aman-0.020
ach-0.020
oun-0.019
liv-0.019
 Kel-0.019
Token<|endoftext|>
Token ID50256
Feature activation-4.49e-01

Pos logit contributions
aman+0.004
oun+0.004
 Kel+0.004
ach+0.004
liv+0.004
Neg logit contributions
 too-0.002
 very-0.002
 a-0.002
 bored-0.001
 mean-0.001
Token others
Token ID1854
Feature activation-1.67e-01

Pos logit contributions
aman+0.027
liv+0.026
oun+0.026
z+0.026
ach+0.026
Neg logit contributions
 too-0.017
 very-0.017
 a-0.014
 so-0.013
 all-0.012
Token.
Token ID13
Feature activation+7.95e-01

Pos logit contributions
 too+0.003
 very+0.003
 a+0.002
 so+0.002
 called+0.001
Neg logit contributions
aman-0.010
ach-0.010
oun-0.010
 Kel-0.010
liv-0.010
Token If
Token ID1002
Feature activation+4.03e-01

Pos logit contributions
ach+0.033
aman+0.032
oun+0.032
 Kel+0.031
ait+0.029
Neg logit contributions
 very-0.029
 too-0.028
 a-0.023
 bored-0.023
 hungry-0.022
Token you
Token ID345
Feature activation-7.38e-01

Pos logit contributions
aman+0.020
oun+0.019
ach+0.019
liv+0.019
z+0.019
Neg logit contributions
 too-0.014
 very-0.013
 a-0.011
 so-0.010
 all-0.010
Token do
Token ID466
Feature activation+2.22e-02

Pos logit contributions
 too+0.023
 very+0.023
 a+0.018
 so+0.017
 called+0.016
Neg logit contributions
ach-0.034
aman-0.034
oun-0.033
 Kel-0.032
liv-0.032
Token something
Token ID1223
Feature activation+1.62e+00

Pos logit contributions
ach+0.001
aman+0.001
oun+0.001
 Kel+0.001
liv+0.001
Neg logit contributions
 too-0.001
 very-0.001
 a-0.000
 so-0.000
 called-0.000
Token wrong
Token ID2642
Feature activation-1.48e-01

Pos logit contributions
aman+0.106
z+0.106
ait+0.105
ach+0.103
liv+0.103
Neg logit contributions
 too-0.037
 very-0.036
 a-0.026
 so-0.023
 mean-0.019
Token,
Token ID11
Feature activation+2.19e-01

Pos logit contributions
 too+0.003
 very+0.002
 a+0.002
 so+0.001
 called+0.001
Neg logit contributions
aman-0.010
oun-0.009
liv-0.009
ach-0.009
 Kel-0.009
Token you
Token ID345
Feature activation-9.23e-01

Pos logit contributions
oun+0.012
liv+0.012
aman+0.012
z+0.012
ach+0.011
Neg logit contributions
 too-0.007
 very-0.006
 a-0.006
 so-0.005
 all-0.005
Token should
Token ID815
Feature activation+3.65e-01

Pos logit contributions
 too+0.026
 very+0.026
 a+0.021
 so+0.019
 bored+0.018
Neg logit contributions
ach-0.045
aman-0.044
oun-0.043
 Kel-0.042
liv-0.041
Token say
Token ID910
Feature activation-1.37e-01

Pos logit contributions
ach+0.019
aman+0.019
oun+0.019
 Kel+0.019
z+0.018
Neg logit contributions
 too-0.009
 very-0.009
 a-0.007
 so-0.006
 called-0.006
Token,
Token ID11
Feature activation-2.29e-01

Pos logit contributions
ach+0.006
aman+0.006
oun+0.006
 Kel+0.006
z+0.006
Neg logit contributions
 too-0.003
 very-0.003
 a-0.002
 so-0.002
 bored-0.002
Token a
Token ID257
Feature activation-8.75e-02

Pos logit contributions
 too+0.005
 very+0.005
 bored+0.003
 so+0.003
 called+0.003
Neg logit contributions
ach-0.012
 Kel-0.012
aman-0.012
oun-0.012
z-0.011
Token little
Token ID1310
Feature activation-1.60e+00

Pos logit contributions
 too+0.003
 very+0.003
 a+0.002
 so+0.002
 all+0.002
Neg logit contributions
aman-0.004
ach-0.004
oun-0.003
 Kel-0.003
becca-0.003
Token girl
Token ID2576
Feature activation-9.86e-01

Pos logit contributions
 very+0.057
 too+0.057
 a+0.049
 so+0.046
 all+0.045
Neg logit contributions
ach-0.060
aman-0.060
oun-0.059
 Kel-0.058
z-0.055
Token came
Token ID1625
Feature activation+1.08e+00

Pos logit contributions
 too+0.018
 very+0.017
 a+0.012
 so+0.010
 called+0.009
Neg logit contributions
ach-0.059
aman-0.059
oun-0.058
 Kel-0.057
liv-0.057
Token over
Token ID625
Feature activation+1.72e+00

Pos logit contributions
ach+0.045
aman+0.043
z+0.043
oun+0.042
 Kel+0.042
Neg logit contributions
 too-0.036
 very-0.035
 a-0.029
 bored-0.028
 so-0.027
Token to
Token ID284
Feature activation+4.74e-02

Pos logit contributions
aman+0.091
ach+0.090
oun+0.090
 Kel+0.088
liv+0.087
Neg logit contributions
 too-0.047
 very-0.046
 a-0.037
 so-0.032
 called-0.030
Token the
Token ID262
Feature activation-6.96e-01

Pos logit contributions
ach+0.002
aman+0.002
oun+0.002
 Kel+0.002
liv+0.002
Neg logit contributions
 too-0.001
 very-0.001
 a-0.001
 so-0.001
 called-0.001
Token t
Token ID256
Feature activation+2.69e-01

Pos logit contributions
 too+0.024
 very+0.023
 a+0.019
 so+0.018
 called+0.017
Neg logit contributions
ach-0.032
aman-0.032
oun-0.031
 Kel-0.030
liv-0.030
Tokeneddy
Token ID21874
Feature activation-1.28e+00

Pos logit contributions
aman+0.004
ach+0.004
 Kel+0.004
oun+0.004
liv+0.004
Neg logit contributions
 too-0.017
 very-0.016
 a-0.015
 so-0.014
 called-0.014
Token bear
Token ID6842
Feature activation-6.57e-01

Pos logit contributions
 very+0.047
 too+0.044
 bored+0.041
 mean+0.039
 much+0.039
Neg logit contributions
aman-0.047
con-0.044
z-0.044
ach-0.043
oun-0.043
TokenHey
Token ID10814
Feature activation+1.01e+00

Pos logit contributions
aman+0.041
ach+0.040
oun+0.039
liv+0.038
ait+0.038
Neg logit contributions
 too-0.038
 very-0.037
 a-0.033
 decided-0.029
 so-0.028
Token,
Token ID11
Feature activation+7.92e-01

Pos logit contributions
aman+0.060
ach+0.060
oun+0.059
 Kel+0.058
liv+0.058
Neg logit contributions
 too-0.020
 very-0.019
 a-0.013
 so-0.011
 called-0.010
Token dog
Token ID3290
Feature activation+3.06e-01

Pos logit contributions
ach+0.041
aman+0.041
z+0.039
oun+0.039
liv+0.039
Neg logit contributions
 too-0.024
 very-0.023
 a-0.019
 so-0.017
 all-0.016
Token,
Token ID11
Feature activation+1.22e+00

Pos logit contributions
aman+0.017
ach+0.017
oun+0.017
 Kel+0.017
liv+0.017
Neg logit contributions
 too-0.007
 very-0.007
 a-0.005
 so-0.005
 called-0.005
Token return
Token ID1441
Feature activation+6.61e-01

Pos logit contributions
aman+0.066
ach+0.066
oun+0.064
liv+0.063
z+0.063
Neg logit contributions
 too-0.034
 very-0.033
 a-0.027
 so-0.024
 all-0.022
Token our
Token ID674
Feature activation+1.58e+00

Pos logit contributions
ach+0.035
aman+0.035
oun+0.034
 Kel+0.034
liv+0.034
Neg logit contributions
 too-0.020
 very-0.018
 a-0.016
 so-0.014
 all-0.013
Token ball
Token ID2613
Feature activation+2.54e-01

Pos logit contributions
aman+0.074
ach+0.072
 Kel+0.070
liv+0.069
oun+0.069
Neg logit contributions
 too-0.057
 very-0.055
 a-0.046
 so-0.042
 all-0.041
Token!"
Token ID2474
Feature activation+3.24e-01

Pos logit contributions
aman+0.013
 Kel+0.012
ach+0.012
liv+0.012
oun+0.012
Neg logit contributions
 too-0.009
 very-0.008
 a-0.007
 so-0.006
 all-0.006
Token Sara
Token ID24799
Feature activation+8.82e-01

Pos logit contributions
ach+0.016
aman+0.015
 Kel+0.015
oun+0.015
liv+0.014
Neg logit contributions
 too-0.009
 very-0.009
 a-0.007
 so-0.007
 bored-0.006
Token shouts
Token ID34757
Feature activation+5.83e-01

Pos logit contributions
ach+0.033
aman+0.031
oun+0.031
 Kel+0.030
z+0.029
Neg logit contributions
 very-0.033
 too-0.032
 a-0.028
 so-0.026
 bored-0.025
Token.
Token ID13
Feature activation-2.01e-01

Pos logit contributions
ach+0.031
oun+0.030
 Kel+0.029
z+0.029
less+0.029
Neg logit contributions
 very-0.012
 too-0.011
 called-0.008
 so-0.008
 a-0.008
Token."
Token ID526
Feature activation+3.56e-02

Pos logit contributions
 very+0.010
 too+0.010
 bored+0.008
 a+0.008
 hungry+0.008
Neg logit contributions
ach-0.019
oun-0.019
ait-0.018
aman-0.018
less-0.017
Token
Token ID198
Feature activation+3.13e-01

Pos logit contributions
ach+0.002
oun+0.002
aman+0.002
 Kel+0.002
liv+0.002
Neg logit contributions
 very-0.001
 too-0.001
 a-0.001
 so-0.001
 bored-0.000
Token
Token ID198
Feature activation+2.65e-01

Pos logit contributions
ach+0.018
oun+0.017
aman+0.017
 Kel+0.017
z+0.017
Neg logit contributions
 too-0.007
 very-0.006
 a-0.004
 so-0.004
 bored-0.004
TokenAnna
Token ID31160
Feature activation+5.26e-01

Pos logit contributions
aman+0.010
ach+0.009
oun+0.009
 Kel+0.009
liv+0.009
Neg logit contributions
 too-0.012
 very-0.012
 a-0.010
 so-0.009
 called-0.009
Token forg
Token ID5784
Feature activation+3.97e-01

Pos logit contributions
aman+0.020
ach+0.020
oun+0.020
 Kel+0.019
z+0.019
Neg logit contributions
 very-0.019
 too-0.019
 a-0.015
 so-0.015
 called-0.015
Tokenives
Token ID1083
Feature activation+1.83e+00

Pos logit contributions
oun+0.008
ach+0.008
z+0.007
liv+0.007
aman+0.007
Neg logit contributions
 very-0.022
 too-0.022
 a-0.019
 so-0.019
 hungry-0.019
Token Tom
Token ID4186
Feature activation+2.77e-01

Pos logit contributions
ach+0.084
aman+0.083
oun+0.082
 Kel+0.080
liv+0.078
Neg logit contributions
 very-0.056
 too-0.055
 a-0.044
 so-0.043
 bored-0.041
Token.
Token ID13
Feature activation-1.50e-01

Pos logit contributions
ach+0.015
aman+0.015
oun+0.015
z+0.014
 Kel+0.014
Neg logit contributions
 very-0.005
 too-0.005
 a-0.003
 so-0.003
 bored-0.003
Token She
Token ID1375
Feature activation+5.62e-01

Pos logit contributions
 very+0.004
 too+0.004
 a+0.003
 so+0.003
 called+0.003
Neg logit contributions
aman-0.007
oun-0.007
ach-0.007
 Kel-0.007
liv-0.007
Token says
Token ID1139
Feature activation-2.40e-01

Pos logit contributions
aman+0.021
ach+0.021
oun+0.021
 Kel+0.020
liv+0.020
Neg logit contributions
 too-0.021
 very-0.021
 a-0.017
 so-0.017
 bored-0.016
Token he
Token ID339
Feature activation+5.34e-01

Pos logit contributions
 very+0.007
 too+0.006
 called+0.005
 a+0.005
 so+0.005
Neg logit contributions
ach-0.011
oun-0.011
aman-0.011
 Kel-0.011
z-0.011
Token trouble
Token ID5876
Feature activation+6.55e-01

Pos logit contributions
oun+0.033
aman+0.033
ach+0.032
liv+0.032
 Kel+0.031
Neg logit contributions
 too-0.022
 very-0.022
 a-0.019
 so-0.016
 all-0.016
Token.
Token ID13
Feature activation+5.31e-01

Pos logit contributions
aman+0.043
oun+0.042
liv+0.042
 Kel+0.041
ach+0.041
Neg logit contributions
 too-0.012
 very-0.011
 a-0.007
 so-0.007
 all-0.005
Token<|endoftext|>
Token ID50256
Feature activation-2.76e-01

Pos logit contributions
ach+0.026
aman+0.026
oun+0.026
 Kel+0.025
ait+0.024
Neg logit contributions
 very-0.015
 too-0.014
 a-0.011
 bored-0.011
 all-0.010
TokenOnce
Token ID7454
Feature activation+6.87e-01

Pos logit contributions
 too+0.007
 very+0.007
 a+0.005
 so+0.005
 bored+0.005
Neg logit contributions
ach-0.014
oun-0.013
aman-0.013
z-0.013
 Kel-0.013
Token upon
Token ID2402
Feature activation-5.17e-01

Pos logit contributions
ach+0.028
aman+0.028
oun+0.028
 Kel+0.027
liv+0.027
Neg logit contributions
 too-0.025
 very-0.025
 a-0.021
 so-0.019
 called-0.019
Token a
Token ID257
Feature activation+6.62e-01

Pos logit contributions
 too+0.008
 very+0.007
 bored+0.006
 called+0.006
 tired+0.005
Neg logit contributions
ach-0.027
oun-0.026
z-0.026
 Kel-0.026
liv-0.025
Token time
Token ID640
Feature activation+2.15e-01

Pos logit contributions
ach+0.030
aman+0.030
 Kel+0.030
oun+0.029
liv+0.028
Neg logit contributions
 too-0.022
 very-0.022
 a-0.018
 so-0.016
 called-0.016
Token there
Token ID612
Feature activation+1.10e+00

Pos logit contributions
aman+0.013
ach+0.013
oun+0.013
 Kel+0.013
liv+0.012
Neg logit contributions
 too-0.004
 very-0.004
 a-0.003
 so-0.002
 called-0.002
Token was
Token ID373
Feature activation-4.51e-02

Pos logit contributions
aman+0.064
ach+0.064
oun+0.062
ait+0.061
becca+0.061
Neg logit contributions
 very-0.025
 too-0.025
 a-0.020
 so-0.016
 called-0.015
Token a
Token ID257
Feature activation+2.16e-01

Pos logit contributions
 too+0.001
 very+0.000
 bored+0.000
 called+0.000
 tired+0.000
Neg logit contributions
ach-0.003
aman-0.003
oun-0.003
 Kel-0.003
liv-0.003
Token cool
Token ID3608
Feature activation-3.80e-01

Pos logit contributions
oun+0.008
ach+0.008
aman+0.008
 Kel+0.008
becca+0.008
Neg logit contributions
 too-0.008
 very-0.008
 a-0.007
 so-0.006
 called-0.006
Token friends
Token ID2460
Feature activation-6.63e-01

Pos logit contributions
 very+0.004
 too+0.004
 a+0.003
 so+0.003
 all+0.003
Neg logit contributions
aman-0.008
ach-0.008
oun-0.008
 Kel-0.008
liv-0.008
Token.
Token ID13
Feature activation-2.51e-01

Pos logit contributions
 too+0.011
 very+0.011
 a+0.007
 so+0.006
 bored+0.006
Neg logit contributions
ach-0.040
aman-0.040
oun-0.039
 Kel-0.038
z-0.038
Token One
Token ID1881
Feature activation-1.24e-01

Pos logit contributions
 very+0.005
 too+0.005
 a+0.003
 bored+0.003
 so+0.003
Neg logit contributions
aman-0.014
oun-0.014
ach-0.014
 Kel-0.014
liv-0.014
Token day
Token ID1110
Feature activation+8.36e-02

Pos logit contributions
 too+0.004
 very+0.004
 bored+0.003
 a+0.003
 so+0.003
Neg logit contributions
aman-0.005
ach-0.005
oun-0.005
 Kel-0.005
liv-0.005
Token,
Token ID11
Feature activation+9.68e-02

Pos logit contributions
ach+0.005
aman+0.005
oun+0.005
 Kel+0.005
z+0.004
Neg logit contributions
 too-0.002
 very-0.002
 a-0.001
 so-0.001
 bored-0.001
Token Tim
Token ID5045
Feature activation+5.94e-01

Pos logit contributions
aman+0.004
ach+0.004
 Kel+0.004
oun+0.004
liv+0.004
Neg logit contributions
 too-0.003
 very-0.003
 a-0.002
 bored-0.002
 so-0.002
Tokenmy
Token ID1820
Feature activation-1.16e-02

Pos logit contributions
aman+0.029
ach+0.029
oun+0.028
z+0.028
liv+0.027
Neg logit contributions
 too-0.016
 very-0.016
 a-0.012
 so-0.011
 bored-0.011
Token and
Token ID290
Feature activation-5.88e-01

Pos logit contributions
 too+0.000
 very+0.000
 a+0.000
 so+0.000
 called+0.000
Neg logit contributions
ach-0.001
aman-0.001
oun-0.001
 Kel-0.001
liv-0.001
Token his
Token ID465
Feature activation+4.88e-01

Pos logit contributions
 too+0.013
 very+0.013
 a+0.008
 so+0.008
 bored+0.008
Neg logit contributions
aman-0.032
 Kel-0.032
ach-0.032
oun-0.032
liv-0.030
Token friends
Token ID2460
Feature activation-5.13e-01

Pos logit contributions
aman+0.026
ach+0.026
oun+0.025
 Kel+0.025
liv+0.025
Neg logit contributions
 too-0.013
 very-0.013
 a-0.010
 so-0.009
 called-0.008
Token were
Token ID547
Feature activation-5.40e-01

Pos logit contributions
 too+0.011
 very+0.010
 a+0.007
 so+0.006
 bored+0.006
Neg logit contributions
aman-0.029
ach-0.029
oun-0.028
 Kel-0.028
liv-0.027
Token,
Token ID11
Feature activation+2.87e-01

Pos logit contributions
aman+0.013
ach+0.013
oun+0.013
 Kel+0.012
liv+0.012
Neg logit contributions
 too-0.006
 very-0.006
 a-0.004
 so-0.004
 all-0.004
Token and
Token ID290
Feature activation-1.28e+00

Pos logit contributions
ach+0.017
oun+0.017
aman+0.017
liv+0.016
z+0.016
Neg logit contributions
 too-0.006
 very-0.006
 a-0.005
 so-0.004
 called-0.004
Token finally
Token ID3443
Feature activation-2.24e-01

Pos logit contributions
 too+0.027
 very+0.026
 a+0.020
 so+0.017
 called+0.016
Neg logit contributions
ach-0.076
aman-0.076
oun-0.075
liv-0.073
 Kel-0.072
Token stopped
Token ID5025
Feature activation+1.93e-01

Pos logit contributions
 too+0.005
 very+0.005
 a+0.004
 so+0.004
 bored+0.003
Neg logit contributions
ach-0.012
aman-0.012
oun-0.012
 Kel-0.012
liv-0.011
Token to
Token ID284
Feature activation+4.41e-02

Pos logit contributions
aman+0.011
ach+0.010
oun+0.010
liv+0.010
 Kel+0.010
Neg logit contributions
 too-0.005
 very-0.005
 a-0.004
 so-0.003
 called-0.003
Token rest
Token ID1334
Feature activation+2.66e-01

Pos logit contributions
aman+0.002
ach+0.002
oun+0.002
 Kel+0.002
liv+0.002
Neg logit contributions
 too-0.002
 very-0.002
 a-0.001
 so-0.001
 bored-0.001
Token.
Token ID13
Feature activation+4.40e-02

Pos logit contributions
aman+0.017
oun+0.017
ach+0.017
 Kel+0.016
liv+0.016
Neg logit contributions
 too-0.004
 very-0.004
 a-0.003
 so-0.002
 all-0.002
Token With
Token ID2080
Feature activation-9.55e-01

Pos logit contributions
ach+0.003
aman+0.002
oun+0.002
 Kel+0.002
liv+0.002
Neg logit contributions
 too-0.001
 very-0.001
 a-0.001
 so-0.001
 called-0.001
Token a
Token ID257
Feature activation-6.24e-02

Pos logit contributions
 too+0.016
 very+0.016
 a+0.010
 so+0.008
 called+0.008
Neg logit contributions
aman-0.058
ach-0.058
oun-0.056
 Kel-0.056
liv-0.055
Token final
Token ID2457
Feature activation+1.90e-01

Pos logit contributions
 very+0.002
 too+0.002
 a+0.002
 so+0.001
 called+0.001
Neg logit contributions
ach-0.003
aman-0.003
oun-0.003
 Kel-0.003
liv-0.003
Token scatter
Token ID41058
Feature activation+9.80e-02

Pos logit contributions
ach+0.009
aman+0.009
oun+0.009
liv+0.009
 Kel+0.009
Neg logit contributions
 too-0.006
 very-0.006
 a-0.005
 so-0.005
 hungry-0.004
Token can
Token ID460
Feature activation+3.36e-01

Pos logit contributions
 very+0.021
 too+0.021
 a+0.016
 bored+0.015
 called+0.015
Neg logit contributions
oun-0.034
ach-0.034
aman-0.034
 Kel-0.033
liv-0.032
Token't
Token ID470
Feature activation-3.05e-01

Pos logit contributions
aman+0.017
ach+0.017
oun+0.016
 Kel+0.016
ait+0.016
Neg logit contributions
 too-0.010
 very-0.010
 a-0.008
 so-0.007
 all-0.007
Token.
Token ID13
Feature activation+8.84e-01

Pos logit contributions
 too+0.006
 very+0.006
 a+0.004
 so+0.004
 called+0.003
Neg logit contributions
aman-0.019
ach-0.018
oun-0.018
 Kel-0.018
liv-0.018
Token This
Token ID770
Feature activation+1.55e-01

Pos logit contributions
aman+0.040
ach+0.040
oun+0.038
 Kel+0.038
liv+0.038
Neg logit contributions
 too-0.030
 very-0.029
 a-0.024
 so-0.022
 called-0.021
Token is
Token ID318
Feature activation-5.73e-01

Pos logit contributions
ach+0.008
aman+0.008
 Kel+0.007
oun+0.007
ait+0.007
Neg logit contributions
 too-0.005
 very-0.005
 a-0.004
 so-0.004
 called-0.003
Token our
Token ID674
Feature activation+7.83e-01

Pos logit contributions
 too+0.019
 very+0.019
 a+0.015
 so+0.014
 bored+0.014
Neg logit contributions
ach-0.026
aman-0.025
oun-0.025
 Kel-0.024
liv-0.024
Token garden
Token ID11376
Feature activation-1.51e-02

Pos logit contributions
liv+0.038
aman+0.038
 Kel+0.038
z+0.037
ach+0.037
Neg logit contributions
 very-0.028
 too-0.027
 a-0.023
 all-0.021
 so-0.019
Token.
Token ID13
Feature activation+3.07e-01

Pos logit contributions
 too+0.000
 very+0.000
 a+0.000
 so+0.000
 called+0.000
Neg logit contributions
aman-0.001
ach-0.001
oun-0.001
 Kel-0.001
liv-0.001
Token You
Token ID921
Feature activation-1.06e+00

Pos logit contributions
ach+0.013
aman+0.013
oun+0.013
 Kel+0.013
liv+0.012
Neg logit contributions
 very-0.011
 too-0.011
 a-0.008
 so-0.008
 bored-0.008
Token are
Token ID389
Feature activation-6.17e-01

Pos logit contributions
 too+0.025
 very+0.025
 a+0.019
 so+0.016
 called+0.015
Neg logit contributions
ach-0.058
aman-0.058
oun-0.056
 Kel-0.056
liv-0.055
Token not
Token ID407
Feature activation-2.29e-01

Pos logit contributions
 too+0.013
 very+0.013
 a+0.009
 so+0.008
 bored+0.007
Neg logit contributions
ach-0.034
aman-0.033
oun-0.033
 Kel-0.033
liv-0.032
Token.
Token ID13
Feature activation+1.06e+00

Pos logit contributions
oun+0.002
ach+0.002
aman+0.002
liv+0.002
 Kel+0.002
Neg logit contributions
 too-0.001
 very-0.001
 a-0.000
 so-0.000
 all-0.000
Token They
Token ID1119
Feature activation-6.67e-01

Pos logit contributions
oun+0.057
aman+0.056
ach+0.056
 Kel+0.055
liv+0.054
Neg logit contributions
 too-0.025
 very-0.024
 a-0.018
 bored-0.016
 called-0.016
Token loved
Token ID6151
Feature activation-2.13e-01

Pos logit contributions
 too+0.020
 very+0.018
 a+0.015
 bored+0.014
 hungry+0.014
Neg logit contributions
aman-0.030
oun-0.029
 Kel-0.029
ach-0.028
ait-0.028
Token playing
Token ID2712
Feature activation-3.63e-01

Pos logit contributions
 very+0.006
 too+0.006
 a+0.004
 so+0.004
 all+0.004
Neg logit contributions
aman-0.012
ach-0.012
oun-0.012
liv-0.012
 Kel-0.011
Token together
Token ID1978
Feature activation+6.04e-02

Pos logit contributions
 too+0.011
 very+0.011
 a+0.009
 so+0.008
 all+0.008
Neg logit contributions
aman-0.018
ach-0.017
oun-0.017
 Kel-0.017
liv-0.016
Token and
Token ID290
Feature activation+3.96e-01

Pos logit contributions
aman+0.004
oun+0.004
 Kel+0.004
ach+0.004
liv+0.004
Neg logit contributions
 too-0.001
 very-0.001
 a-0.001
 so-0.001
 all-0.001
Token having
Token ID1719
Feature activation-1.20e+00

Pos logit contributions
ach+0.019
aman+0.019
oun+0.019
 Kel+0.019
liv+0.018
Neg logit contributions
 too-0.011
 very-0.011
 a-0.009
 so-0.008
 called-0.008
Token y
Token ID331
Feature activation+8.89e-01

Pos logit contributions
 too+0.031
 very+0.029
 a+0.021
 called+0.020
 bored+0.020
Neg logit contributions
ach-0.061
aman-0.061
oun-0.058
 Kel-0.058
liv-0.058
Tokenummy
Token ID13513
Feature activation+3.70e-01

Pos logit contributions
 Kel+0.021
aman+0.020
oun+0.019
liv+0.019
z+0.018
Neg logit contributions
 too-0.049
 very-0.048
 a-0.043
 so-0.041
 all-0.041
Token snacks
Token ID28487
Feature activation+6.44e-01

Pos logit contributions
aman+0.014
ach+0.014
oun+0.014
 Kel+0.014
liv+0.014
Neg logit contributions
 too-0.016
 very-0.016
 a-0.014
 so-0.013
 all-0.012
Token afterwards
Token ID12979
Feature activation+2.85e-01

Pos logit contributions
aman+0.041
ach+0.041
 Kel+0.041
oun+0.040
liv+0.039
Neg logit contributions
 too-0.012
 very-0.011
 a-0.007
 so-0.007
 all-0.006
Token not
Token ID407
Feature activation-2.52e-01

Pos logit contributions
ach+0.000
aman+0.000
 Kel+0.000
oun+0.000
liv+0.000
Neg logit contributions
 very-0.000
 too-0.000
 a-0.000
 decided-0.000
 called-0.000
Token okay
Token ID8788
Feature activation+1.04e+00

Pos logit contributions
 very+0.006
 too+0.006
 a+0.005
 called+0.005
 so+0.004
Neg logit contributions
ach-0.013
aman-0.012
oun-0.012
z-0.012
 Kel-0.012
Token.
Token ID13
Feature activation+6.61e-01

Pos logit contributions
ach+0.060
aman+0.057
oun+0.056
z+0.056
 Kel+0.055
Neg logit contributions
 very-0.020
 too-0.019
 a-0.013
 called-0.013
 so-0.012
Token My
Token ID2011
Feature activation+2.82e-01

Pos logit contributions
aman+0.035
ach+0.034
oun+0.032
liv+0.032
z+0.032
Neg logit contributions
 too-0.019
 very-0.018
 a-0.015
 so-0.013
 bored-0.012
Token knee
Token ID10329
Feature activation-5.66e-01

Pos logit contributions
aman+0.013
ach+0.012
oun+0.012
 Kel+0.012
liv+0.012
Neg logit contributions
 too-0.010
 very-0.010
 a-0.008
 so-0.008
 bored-0.007
Token is
Token ID318
Feature activation+7.19e-03

Pos logit contributions
 too+0.021
 very+0.021
 a+0.018
 so+0.017
 called+0.016
Neg logit contributions
ach-0.022
aman-0.022
oun-0.022
 Kel-0.021
z-0.021
Token red
Token ID2266
Feature activation-1.05e+00

Pos logit contributions
ach+0.000
aman+0.000
oun+0.000
 Kel+0.000
liv+0.000
Neg logit contributions
 too-0.000
 very-0.000
 a-0.000
 so-0.000
 called-0.000
Token and
Token ID290
Feature activation+5.73e-01

Pos logit contributions
 very+0.020
 too+0.019
 a+0.014
 called+0.014
 bored+0.013
Neg logit contributions
ach-0.057
oun-0.056
aman-0.055
z-0.054
less-0.052
Token sore
Token ID19597
Feature activation-1.00e-01

Pos logit contributions
aman+0.024
ach+0.024
oun+0.024
 Kel+0.024
liv+0.023
Neg logit contributions
 too-0.019
 very-0.019
 a-0.015
 so-0.015
 called-0.014
Token.
Token ID13
Feature activation+7.91e-01

Pos logit contributions
 very+0.002
 too+0.002
 named+0.002
 called+0.002
 bored+0.002
Neg logit contributions
ach-0.005
oun-0.005
less-0.005
z-0.005
 degrees-0.004
Token It
Token ID632
Feature activation-2.08e-02

Pos logit contributions
ach+0.038
oun+0.037
aman+0.037
 Kel+0.037
liv+0.035
Neg logit contributions
 very-0.023
 too-0.023
 a-0.018
 so-0.017
 called-0.016
Token and
Token ID290
Feature activation-5.06e-01

Pos logit contributions
 too+0.006
 very+0.006
 a+0.004
 so+0.004
 called+0.004
Neg logit contributions
aman-0.012
ach-0.012
oun-0.012
liv-0.012
 Kel-0.011
Token get
Token ID651
Feature activation-4.50e-01

Pos logit contributions
 too+0.014
 very+0.014
 a+0.011
 so+0.010
 called+0.010
Neg logit contributions
ach-0.028
aman-0.027
oun-0.027
umm-0.026
liv-0.026
Token help
Token ID1037
Feature activation+6.10e-01

Pos logit contributions
 too+0.010
 very+0.010
 a+0.008
 so+0.007
 tired+0.006
Neg logit contributions
aman-0.027
ach-0.027
oun-0.026
liv-0.026
becca-0.025
Token from
Token ID422
Feature activation-1.34e-01

Pos logit contributions
oun+0.044
aman+0.044
liv+0.044
z+0.043
becca+0.043
Neg logit contributions
 too-0.008
 very-0.007
 a-0.004
 so-0.003
.-0.002
Token a
Token ID257
Feature activation+4.96e-02

Pos logit contributions
 too+0.003
 very+0.003
 a+0.003
 so+0.002
 all+0.002
Neg logit contributions
oun-0.008
aman-0.008
ach-0.008
z-0.008
liv-0.007
Token grown
Token ID7334
Feature activation-8.76e-01

Pos logit contributions
ach+0.002
oun+0.002
liv+0.002
aman+0.002
 Kel+0.002
Neg logit contributions
 very-0.002
 too-0.002
 a-0.001
 so-0.001
 tired-0.001
Token up
Token ID510
Feature activation+5.56e-01

Pos logit contributions
 too+0.040
 very+0.040
 a+0.035
 so+0.033
 called+0.032
Neg logit contributions
aman-0.029
ach-0.029
oun-0.027
 Kel-0.027
liv-0.026
Token.
Token ID13
Feature activation-3.30e-01

Pos logit contributions
oun+0.035
aman+0.034
ach+0.034
liv+0.034
becca+0.033
Neg logit contributions
 too-0.012
 very-0.012
 a-0.009
 so-0.008
 all-0.007
Token Belle
Token ID31974
Feature activation+6.82e-01

Pos logit contributions
 too+0.007
 very+0.007
 a+0.005
 so+0.004
 called+0.004
Neg logit contributions
aman-0.019
ach-0.019
oun-0.018
 Kel-0.018
liv-0.018
Token ran
Token ID4966
Feature activation-1.65e-01

Pos logit contributions
ach+0.040
umm+0.040
aman+0.040
oun+0.039
liv+0.037
Neg logit contributions
 very-0.016
 too-0.016
 a-0.012
 called-0.011
 so-0.011
Token as
Token ID355
Feature activation+1.49e-01

Pos logit contributions
 too+0.005
 very+0.005
 a+0.003
 so+0.003
 all+0.003
Neg logit contributions
aman-0.010
ach-0.009
umm-0.009
liv-0.009
 degrees-0.009
Token liked
Token ID8288
Feature activation-5.58e-01

Pos logit contributions
ach+0.025
aman+0.024
oun+0.024
umm+0.024
 Kel+0.023
Neg logit contributions
 very-0.008
 too-0.007
 a-0.005
 called-0.005
 so-0.004
Token pink
Token ID11398
Feature activation-1.09e+00

Pos logit contributions
 too+0.017
 very+0.017
 a+0.014
 so+0.012
 all+0.012
Neg logit contributions
aman-0.028
ach-0.028
oun-0.028
liv-0.027
z-0.027
Token and
Token ID290
Feature activation+5.17e-01

Pos logit contributions
 too+0.038
 very+0.037
 a+0.031
 so+0.029
 called+0.028
Neg logit contributions
ach-0.048
aman-0.047
oun-0.046
 Kel-0.045
liv-0.044
Token flowers
Token ID12734
Feature activation+2.62e-01

Pos logit contributions
 Kel+0.022
oun+0.022
liv+0.021
ach+0.021
becca+0.021
Neg logit contributions
 too-0.017
 very-0.015
 so-0.012
 a-0.012
 bored-0.012
Token and
Token ID290
Feature activation+1.38e-01

Pos logit contributions
ach+0.012
z+0.011
 Kel+0.011
aman+0.011
oun+0.011
Neg logit contributions
 too-0.005
 Amen-0.005
 bored-0.005
 very-0.005
 afraid-0.004
Token dolls
Token ID36062
Feature activation-3.16e-01

Pos logit contributions
 Kel+0.006
aman+0.006
ach+0.006
oun+0.006
becca+0.006
Neg logit contributions
 too-0.004
 very-0.004
 so-0.003
 bored-0.003
 tired-0.003
Token.
Token ID13
Feature activation-1.83e-02

Pos logit contributions
 too+0.006
 very+0.006
 a+0.004
 bored+0.004
 hungry+0.004
Neg logit contributions
ach-0.017
aman-0.016
z-0.016
oun-0.016
ait-0.016
Token Tom
Token ID4186
Feature activation+7.56e-01

Pos logit contributions
 too+0.000
 very+0.000
 a+0.000
 so+0.000
 called+0.000
Neg logit contributions
aman-0.001
ach-0.001
oun-0.001
 Kel-0.001
liv-0.001
Token liked
Token ID8288
Feature activation-4.66e-01

Pos logit contributions
ach+0.047
aman+0.047
umm+0.044
oun+0.044
 Kel+0.044
Neg logit contributions
 very-0.016
 too-0.015
 a-0.011
 called-0.009
 so-0.009
Token blue
Token ID4171
Feature activation-8.33e-01

Pos logit contributions
 too+0.014
 very+0.014
 a+0.011
 so+0.010
 all+0.009
Neg logit contributions
aman-0.024
ach-0.024
oun-0.023
liv-0.023
 Kel-0.022
Token and
Token ID290
Feature activation+3.46e-01

Pos logit contributions
 very+0.031
 too+0.031
 a+0.026
 bored+0.025
 so+0.025
Neg logit contributions
ach-0.033
aman-0.031
oun-0.030
 Kel-0.030
z-0.029
Token Thank
Token ID6952
Feature activation-8.24e-01

Pos logit contributions
ach+0.034
aman+0.033
oun+0.033
 Kel+0.032
ait+0.031
Neg logit contributions
 very-0.023
 too-0.023
 a-0.018
 so-0.017
 bored-0.017
Token you
Token ID345
Feature activation-1.40e-01

Pos logit contributions
 too+0.031
 very+0.030
 a+0.025
 so+0.024
 bored+0.023
Neg logit contributions
ach-0.034
aman-0.033
oun-0.032
 Kel-0.031
z-0.031
Token for
Token ID329
Feature activation+4.71e-01

Pos logit contributions
 very+0.004
 too+0.004
 bored+0.003
 a+0.003
 hungry+0.003
Neg logit contributions
ach-0.006
aman-0.006
oun-0.006
ait-0.006
 Kel-0.006
Token being
Token ID852
Feature activation-6.84e-01

Pos logit contributions
ach+0.021
aman+0.021
 Kel+0.020
oun+0.020
liv+0.019
Neg logit contributions
 too-0.015
 very-0.015
 a-0.012
 bored-0.011
 so-0.011
Token so
Token ID523
Feature activation+6.84e-01

Pos logit contributions
 too+0.018
 very+0.018
 a+0.014
 so+0.012
 called+0.012
Neg logit contributions
ach-0.035
aman-0.035
oun-0.034
 Kel-0.033
liv-0.033
Token nice
Token ID3621
Feature activation-8.23e-01

Pos logit contributions
aman+0.036
ach+0.036
oun+0.035
 Kel+0.035
liv+0.034
Neg logit contributions
 too-0.018
 very-0.017
 a-0.013
 so-0.012
 called-0.011
Token and
Token ID290
Feature activation+7.18e-01

Pos logit contributions
 too+0.017
 very+0.016
 a+0.012
 so+0.010
 called+0.009
Neg logit contributions
ach-0.048
aman-0.048
oun-0.047
 Kel-0.046
liv-0.045
Token helpful
Token ID7613
Feature activation-3.64e-02

Pos logit contributions
aman+0.034
ach+0.033
oun+0.033
 Kel+0.032
liv+0.032
Neg logit contributions
 too-0.023
 very-0.023
 a-0.019
 so-0.017
 called-0.017
Token."
Token ID526
Feature activation-6.52e-01

Pos logit contributions
 too+0.001
 very+0.001
 a+0.000
 so+0.000
 called+0.000
Neg logit contributions
aman-0.002
liv-0.002
 Kel-0.002
ach-0.002
oun-0.002
Token
Token ID198
Feature activation-2.28e-01

Pos logit contributions
 very+0.016
 too+0.016
 mean+0.011
 a+0.011
 so+0.011
Neg logit contributions
aman-0.033
ach-0.032
oun-0.032
 Kel-0.031
ait-0.030
Token
Token ID198
Feature activation-9.81e-02

Pos logit contributions
 too+0.005
 very+0.004
 a+0.003
 so+0.003
 bored+0.002
Neg logit contributions
ach-0.013
aman-0.013
oun-0.013
 Kel-0.013
liv-0.013
Token Your
Token ID3406
Feature activation-9.49e-01

Pos logit contributions
aman+0.042
ach+0.042
oun+0.041
 Kel+0.040
liv+0.040
Neg logit contributions
 too-0.028
 very-0.028
 a-0.023
 so-0.021
 called-0.020
Token words
Token ID2456
Feature activation-4.07e-01

Pos logit contributions
 too+0.036
 very+0.036
 a+0.031
 bored+0.028
 so+0.028
Neg logit contributions
aman-0.042
ach-0.041
oun-0.041
liv-0.039
 Kel-0.039
Token are
Token ID389
Feature activation-5.72e-01

Pos logit contributions
 too+0.010
 very+0.010
 a+0.008
 so+0.007
 called+0.006
Neg logit contributions
aman-0.022
ach-0.022
oun-0.021
 Kel-0.021
liv-0.021
Token hurt
Token ID5938
Feature activation-6.04e-01

Pos logit contributions
 too+0.012
 very+0.012
 a+0.008
 so+0.007
 called+0.007
Neg logit contributions
ach-0.033
aman-0.032
oun-0.032
 Kel-0.031
liv-0.031
Tokenful
Token ID913
Feature activation-5.65e-01

Pos logit contributions
 too+0.012
 very+0.012
 a+0.008
 bored+0.007
 hungry+0.007
Neg logit contributions
ach-0.034
aman-0.033
oun-0.033
 Kel-0.032
z-0.032
Token and
Token ID290
Feature activation-1.17e-01

Pos logit contributions
 very+0.011
 too+0.011
 a+0.008
 called+0.007
 bored+0.007
Neg logit contributions
ach-0.031
oun-0.030
aman-0.030
 Kel-0.029
z-0.029
Token rude
Token ID22066
Feature activation-7.86e-01

Pos logit contributions
 too+0.003
 very+0.003
 a+0.003
 so+0.003
 called+0.002
Neg logit contributions
ach-0.005
aman-0.005
oun-0.005
 Kel-0.005
liv-0.005
Token.
Token ID13
Feature activation+8.80e-01

Pos logit contributions
 too+0.014
 very+0.014
 a+0.009
 so+0.007
 called+0.007
Neg logit contributions
ach-0.047
aman-0.047
oun-0.046
 Kel-0.045
liv-0.045
Token You
Token ID921
Feature activation-1.00e+00

Pos logit contributions
ach+0.038
aman+0.037
oun+0.037
 Kel+0.037
liv+0.035
Neg logit contributions
 too-0.030
 very-0.030
 a-0.025
 so-0.023
 bored-0.022
Token have
Token ID423
Feature activation-1.01e-01

Pos logit contributions
 too+0.026
 very+0.025
 a+0.019
 so+0.018
 hungry+0.017
Neg logit contributions
ach-0.050
oun-0.049
aman-0.048
 Kel-0.048
ait-0.046
Token no
Token ID645
Feature activation+2.82e-02

Pos logit contributions
 too+0.003
 very+0.003
 a+0.002
 so+0.002
 all+0.002
Neg logit contributions
ach-0.005
aman-0.005
oun-0.005
 Kel-0.005
liv-0.005
Token ran
Token ID4966
Feature activation-6.53e-01

Pos logit contributions
 too+0.011
 very+0.010
 a+0.008
 so+0.007
 called+0.006
Neg logit contributions
ach-0.025
aman-0.025
oun-0.024
 Kel-0.024
liv-0.024
Token to
Token ID284
Feature activation-5.87e-01

Pos logit contributions
 too+0.019
 very+0.019
 a+0.015
 so+0.014
 all+0.013
Neg logit contributions
aman-0.034
ach-0.034
oun-0.033
 Kel-0.032
liv-0.032
Token the
Token ID262
Feature activation-1.31e+00

Pos logit contributions
 too+0.015
 very+0.015
 bored+0.011
 so+0.010
 hungry+0.010
Neg logit contributions
aman-0.029
 Kel-0.029
ach-0.029
oun-0.028
ait-0.027
Token bathroom
Token ID12436
Feature activation-2.63e-01

Pos logit contributions
 very+0.048
 too+0.046
 a+0.039
 so+0.036
 called+0.034
Neg logit contributions
ach-0.060
aman-0.060
oun-0.057
z-0.056
 Kel-0.056
Token but
Token ID475
Feature activation+7.50e-02

Pos logit contributions
 too+0.005
 very+0.005
 a+0.004
 so+0.003
 called+0.003
Neg logit contributions
ach-0.016
aman-0.015
oun-0.015
 Kel-0.015
liv-0.015
Token the
Token ID262
Feature activation-1.49e+00

Pos logit contributions
ach+0.004
aman+0.004
oun+0.004
 Kel+0.004
z+0.004
Neg logit contributions
 too-0.002
 very-0.002
 a-0.001
 so-0.001
 called-0.001
Token door
Token ID3420
Feature activation+1.65e-01

Pos logit contributions
 too+0.055
 very+0.055
 a+0.045
 so+0.042
 called+0.041
Neg logit contributions
aman-0.065
ach-0.064
oun-0.061
 Kel-0.060
liv-0.059
Token was
Token ID373
Feature activation-1.32e+00

Pos logit contributions
ach+0.009
aman+0.009
oun+0.009
becca+0.008
liv+0.008
Neg logit contributions
 too-0.004
 very-0.004
 a-0.003
 called-0.003
 so-0.003
Token tight
Token ID5381
Feature activation+1.72e-01

Pos logit contributions
 too+0.033
 very+0.032
 a+0.026
 bored+0.025
 named+0.025
Neg logit contributions
aman-0.059
 Kel-0.059
liv-0.056
ach-0.056
less-0.056
Token and
Token ID290
Feature activation-1.23e+00

Pos logit contributions
ach+0.011
aman+0.011
oun+0.011
 Kel+0.010
liv+0.010
Neg logit contributions
 too-0.003
 very-0.003
 a-0.002
 so-0.001
 called-0.001
Token she
Token ID673
Feature activation-6.22e-01

Pos logit contributions
 too+0.032
 very+0.032
 a+0.024
 so+0.022
 bored+0.022
Neg logit contributions
aman-0.060
 Kel-0.060
ach-0.058
liv-0.058
oun-0.057
Token!"
Token ID2474
Feature activation-7.97e-01

Pos logit contributions
 too+0.036
 very+0.034
 a+0.028
 so+0.025
 called+0.024
Neg logit contributions
aman-0.065
ach-0.064
oun-0.063
 Kel-0.062
liv-0.061
Token
Token ID220
Feature activation+9.50e-02

Pos logit contributions
 too+0.011
 very+0.011
 a+0.006
 so+0.005
 called+0.004
Neg logit contributions
ach-0.052
aman-0.052
oun-0.051
 Kel-0.050
liv-0.050
Token
Token ID198
Feature activation-3.82e-01

Pos logit contributions
ach+0.007
aman+0.007
becca+0.006
umm+0.006
 occasions+0.006
Neg logit contributions
 too-0.001
 very-0.001
 a-0.001
 -0.000
 so-0.000
Token
Token ID198
Feature activation-2.24e-01

Pos logit contributions
 very+0.004
 too+0.004
 a+0.002
 +0.001
 so+0.001
Neg logit contributions
ach-0.027
aman-0.027
umm-0.026
oun-0.026
ait-0.025
TokenThe
Token ID464
Feature activation-8.27e-01

Pos logit contributions
 very+0.007
 too+0.007
 a+0.006
 so+0.005
 decided+0.005
Neg logit contributions
ach-0.011
aman-0.011
umm-0.011
oun-0.011
ait-0.010
Token little
Token ID1310
Feature activation-1.59e+00

Pos logit contributions
 very+0.029
 too+0.029
 a+0.025
 so+0.022
 hungry+0.021
Neg logit contributions
ach-0.038
aman-0.037
 Kel-0.036
oun-0.036
liv-0.036
Token girl
Token ID2576
Feature activation-4.36e-01

Pos logit contributions
 too+0.051
 very+0.049
 a+0.041
 so+0.036
 called+0.035
Neg logit contributions
ach-0.078
aman-0.076
oun-0.075
liv-0.075
 Kel-0.074
Token was
Token ID373
Feature activation-5.17e-01

Pos logit contributions
 too+0.011
 very+0.011
 a+0.008
 so+0.007
 called+0.007
Neg logit contributions
ach-0.025
aman-0.024
oun-0.024
 Kel-0.024
ait-0.023
Token so
Token ID523
Feature activation-7.34e-02

Pos logit contributions
 too+0.006
 very+0.005
 a+0.003
 bored+0.002
 called+0.002
Neg logit contributions
aman-0.032
ach-0.032
 Kel-0.031
liv-0.031
oun-0.031
Token excited
Token ID6568
Feature activation+9.76e-02

Pos logit contributions
 too+0.001
 very+0.001
 a+0.001
 so+0.001
 called+0.001
Neg logit contributions
ach-0.004
aman-0.004
oun-0.004
 Kel-0.004
liv-0.004
Token and
Token ID290
Feature activation-5.95e-01

Pos logit contributions
oun+0.006
z+0.006
ait+0.006
 Kel+0.006
liv+0.006
Neg logit contributions
 too-0.002
 very-0.002
 a-0.001
 so-0.001
 all-0.001
Token heard
Token ID2982
Feature activation+6.26e-01

Pos logit contributions
aman+0.039
ach+0.037
z+0.037
oun+0.037
ait+0.037
Neg logit contributions
 very-0.021
 too-0.020
 a-0.015
 bored-0.015
 so-0.014
Token L
Token ID406
Feature activation+5.75e-02

Pos logit contributions
ach+0.032
aman+0.032
oun+0.031
 Kel+0.031
liv+0.031
Neg logit contributions
 too-0.017
 very-0.016
 a-0.013
 so-0.012
 called-0.011
Tokenila
Token ID10102
Feature activation-6.56e-01

Pos logit contributions
ach+0.002
aman+0.002
oun+0.002
 Kel+0.002
liv+0.002
Neg logit contributions
 too-0.002
 very-0.002
 a-0.002
 so-0.002
 called-0.002
Token and
Token ID290
Feature activation-1.89e-01

Pos logit contributions
 too+0.015
 very+0.015
 a+0.011
 so+0.010
 bored+0.009
Neg logit contributions
ach-0.035
aman-0.034
 Kel-0.033
oun-0.033
z-0.033
Token Ben
Token ID3932
Feature activation-9.26e-01

Pos logit contributions
 very+0.005
 too+0.005
 so+0.004
 a+0.004
 bored+0.004
Neg logit contributions
 Kel-0.009
aman-0.009
ait-0.008
ach-0.008
oun-0.008
Token.
Token ID13
Feature activation-1.62e+00

Pos logit contributions
 very+0.025
 too+0.025
 a+0.019
 so+0.018
 called+0.017
Neg logit contributions
ach-0.043
aman-0.041
 Kel-0.041
z-0.040
oun-0.039
Token She
Token ID1375
Feature activation-1.20e-01

Pos logit contributions
 too+0.049
 very+0.048
 Amen+0.047
?"+0.041
 so+0.040
Neg logit contributions
 Kel-0.062
ach-0.051
less-0.051
aman-0.050
isc-0.050
Token came
Token ID1625
Feature activation+4.09e-01

Pos logit contributions
 too+0.003
 very+0.003
 bored+0.002
 a+0.002
 so+0.002
Neg logit contributions
aman-0.006
 Kel-0.006
ach-0.006
z-0.006
liv-0.006
Token to
Token ID284
Feature activation+2.56e-01

Pos logit contributions
ach+0.018
z+0.017
 Kel+0.016
aman+0.016
oun+0.016
Neg logit contributions
 too-0.012
 very-0.012
 a-0.010
 bored-0.010
 hungry-0.010
Token them
Token ID606
Feature activation-6.61e-02

Pos logit contributions
aman+0.015
oun+0.015
ach+0.015
liv+0.015
z+0.015
Neg logit contributions
 too-0.005
 very-0.005
 a-0.004
 so-0.003
 called-0.003
Token and
Token ID290
Feature activation+4.68e-02

Pos logit contributions
 very+0.001
 too+0.001
 a+0.001
 bored+0.001
 mean+0.001
Neg logit contributions
ach-0.003
aman-0.003
 Kel-0.003
z-0.003
ge-0.003
Token her
Token ID607
Feature activation+6.68e-01

Pos logit contributions
aman+0.018
ach+0.018
 Kel+0.018
oun+0.018
liv+0.018
Neg logit contributions
 too-0.009
 very-0.009
 a-0.006
 so-0.006
 bored-0.006
Token mom
Token ID1995
Feature activation+3.34e-01

Pos logit contributions
ach+0.035
aman+0.035
oun+0.034
 Kel+0.034
liv+0.033
Neg logit contributions
 too-0.019
 very-0.019
 a-0.015
 so-0.013
 all-0.012
Token said
Token ID531
Feature activation-1.55e-01

Pos logit contributions
ach+0.019
aman+0.019
oun+0.018
 Kel+0.018
liv+0.018
Neg logit contributions
 too-0.008
 very-0.007
 a-0.005
 so-0.005
 called-0.004
Token.
Token ID13
Feature activation-8.96e-01

Pos logit contributions
 too+0.003
 very+0.003
 a+0.002
 so+0.001
 bored+0.001
Neg logit contributions
aman-0.009
ach-0.009
oun-0.009
 Kel-0.009
liv-0.009
Token Instead
Token ID5455
Feature activation+1.31e-01

Pos logit contributions
 too+0.024
 very+0.023
 a+0.017
 mean+0.016
 bored+0.016
Neg logit contributions
aman-0.045
oun-0.044
ach-0.044
 Kel-0.044
liv-0.042
Token,
Token ID11
Feature activation-1.55e+00

Pos logit contributions
ach+0.009
oun+0.008
aman+0.008
z+0.008
liv+0.008
Neg logit contributions
 too-0.002
 very-0.002
 a-0.001
 so-0.001
 called-0.001
Token Lily
Token ID20037
Feature activation-2.81e-01

Pos logit contributions
 too+0.045
 very+0.045
 a+0.033
 called+0.033
 so+0.032
Neg logit contributions
 Kel-0.069
aman-0.068
ach-0.067
liv-0.065
oun-0.065
Token gave
Token ID2921
Feature activation-1.34e+00

Pos logit contributions
 too+0.007
 very+0.007
 a+0.005
 bored+0.005
 so+0.005
Neg logit contributions
aman-0.014
ach-0.014
z-0.013
 Kel-0.013
liv-0.013
Token the
Token ID262
Feature activation+6.96e-01

Pos logit contributions
 too+0.031
 very+0.031
 hungry+0.021
 bored+0.021
 so+0.021
Neg logit contributions
aman-0.067
 Kel-0.067
ach-0.066
z-0.065
liv-0.064
Token spider
Token ID19230
Feature activation-1.67e-01

Pos logit contributions
aman+0.022
ach+0.021
 Kel+0.021
oun+0.021
z+0.020
Neg logit contributions
 too-0.031
 very-0.031
 a-0.026
 so-0.026
 called-0.025
Token a
Token ID257
Feature activation+7.68e-01

Pos logit contributions
 too+0.002
 very+0.002
 bored+0.001
 called+0.001
 so+0.001
Neg logit contributions
aman-0.010
ach-0.010
 Kel-0.010
z-0.010
oun-0.010
Token boat
Token ID8848
Feature activation+1.37e-01

Pos logit contributions
 too+0.016
 very+0.016
 bored+0.013
 a+0.013
 called+0.013
Neg logit contributions
oun-0.017
ach-0.017
aman-0.016
z-0.015
 Kel-0.015
Token sailed
Token ID33835
Feature activation-6.49e-01

Pos logit contributions
ach+0.006
z+0.006
oun+0.006
 Kel+0.006
aman+0.006
Neg logit contributions
 too-0.004
 very-0.004
 a-0.003
 bored-0.003
 so-0.003
Token on
Token ID319
Feature activation-1.69e-01

Pos logit contributions
 too+0.017
 very+0.016
 bored+0.013
 hungry+0.012
 called+0.012
Neg logit contributions
ach-0.031
z-0.029
oun-0.028
aman-0.028
less-0.027
Token the
Token ID262
Feature activation-7.02e-01

Pos logit contributions
 too+0.004
 very+0.004
 bored+0.003
 hungry+0.003
 tired+0.003
Neg logit contributions
ach-0.009
aman-0.009
 Kel-0.008
oun-0.008
z-0.008
Token big
Token ID1263
Feature activation-1.45e-01

Pos logit contributions
 very+0.025
 too+0.025
 a+0.020
 so+0.019
 called+0.018
Neg logit contributions
aman-0.031
ach-0.031
oun-0.029
 Kel-0.029
liv-0.029
Token blue
Token ID4171
Feature activation-1.55e+00

Pos logit contributions
 too+0.005
 very+0.005
 a+0.004
 bored+0.004
 so+0.004
Neg logit contributions
ach-0.006
aman-0.006
oun-0.006
 Kel-0.005
z-0.005
Token sea
Token ID5417
Feature activation-1.32e+00

Pos logit contributions
 very+0.052
 too+0.051
 bored+0.042
 hungry+0.041
 afraid+0.041
Neg logit contributions
ach-0.069
aman-0.065
oun-0.065
z-0.060
ait-0.059
Token.
Token ID13
Feature activation-3.95e-03

Pos logit contributions
 too+0.031
 very+0.029
 bored+0.022
 a+0.022
 mean+0.021
Neg logit contributions
ach-0.067
z-0.064
oun-0.061
aman-0.060
less-0.059
Token The
Token ID383
Feature activation-7.46e-01

Pos logit contributions
 very+0.000
 too+0.000
 mean+0.000
 bored+0.000
 afraid+0.000
Neg logit contributions
oun-0.000
 Kel-0.000
aman-0.000
ach-0.000
ait-0.000
Token wind
Token ID2344
Feature activation+9.09e-02

Pos logit contributions
 too+0.027
 very+0.026
 a+0.021
 so+0.021
 called+0.021
Neg logit contributions
ach-0.030
oun-0.029
aman-0.028
z-0.027
 Kel-0.026
Token blew
Token ID17948
Feature activation-1.00e+00

Pos logit contributions
ach+0.004
oun+0.004
z+0.004
aman+0.004
 Kel+0.004
Neg logit contributions
 too-0.003
 very-0.003
 a-0.002
 so-0.002
 bored-0.002